KR20200064129A - 트랜스제닉 선택 방법 및 조성물 - Google Patents
트랜스제닉 선택 방법 및 조성물 Download PDFInfo
- Publication number
- KR20200064129A KR20200064129A KR1020207013411A KR20207013411A KR20200064129A KR 20200064129 A KR20200064129 A KR 20200064129A KR 1020207013411 A KR1020207013411 A KR 1020207013411A KR 20207013411 A KR20207013411 A KR 20207013411A KR 20200064129 A KR20200064129 A KR 20200064129A
- Authority
- KR
- South Korea
- Prior art keywords
- intein
- terminal fragment
- protein
- fragment
- nucleotide sequence
- Prior art date
Links
- 230000009261 transgenic effect Effects 0.000 title claims abstract description 59
- 239000000203 mixture Substances 0.000 title description 15
- 238000010187 selection method Methods 0.000 title description 7
- 230000017730 intein-mediated protein splicing Effects 0.000 claims abstract description 586
- 239000003550 marker Substances 0.000 claims abstract description 168
- 108090000623 proteins and genes Proteins 0.000 claims description 663
- 102000004169 proteins and genes Human genes 0.000 claims description 498
- 239000013598 vector Substances 0.000 claims description 372
- 239000012634 fragment Substances 0.000 claims description 288
- 210000004900 c-terminal fragment Anatomy 0.000 claims description 286
- 210000004898 n-terminal fragment Anatomy 0.000 claims description 280
- 239000002773 nucleotide Substances 0.000 claims description 254
- 125000003729 nucleotide group Chemical group 0.000 claims description 254
- 238000000034 method Methods 0.000 claims description 170
- 230000003115 biocidal effect Effects 0.000 claims description 138
- 238000011144 upstream manufacturing Methods 0.000 claims description 132
- 210000003527 eukaryotic cell Anatomy 0.000 claims description 110
- 108091006047 fluorescent proteins Proteins 0.000 claims description 81
- 102000034287 fluorescent proteins Human genes 0.000 claims description 79
- YQYJSBFKSSDGFO-UHFFFAOYSA-N Epihygromycin Natural products OC1C(O)C(C(=O)C)OC1OC(C(=C1)O)=CC=C1C=C(C)C(=O)NC1C(O)C(O)C2OCOC2C1O YQYJSBFKSSDGFO-UHFFFAOYSA-N 0.000 claims description 56
- 239000004055 small Interfering RNA Substances 0.000 claims description 52
- RXWNCPJZOCPEPQ-NVWDDTSBSA-N puromycin Chemical compound C1=CC(OC)=CC=C1C[C@H](N)C(=O)N[C@H]1[C@@H](O)[C@H](N2C3=NC=NC(=C3N=C2)N(C)C)O[C@@H]1CO RXWNCPJZOCPEPQ-NVWDDTSBSA-N 0.000 claims description 42
- 239000003242 anti bacterial agent Substances 0.000 claims description 29
- 108700011259 MicroRNAs Proteins 0.000 claims description 26
- 108091027967 Small hairpin RNA Proteins 0.000 claims description 26
- 108020004459 Small interfering RNA Proteins 0.000 claims description 26
- 239000002679 microRNA Substances 0.000 claims description 26
- 239000000178 monomer Substances 0.000 claims description 24
- 101150111388 pac gene Proteins 0.000 claims description 24
- 229930189065 blasticidin Natural products 0.000 claims description 23
- 108010048367 enhanced green fluorescent protein Proteins 0.000 claims description 21
- BRZYSWJRSDMWLG-CAXSIQPQSA-N geneticin Natural products O1C[C@@](O)(C)[C@H](NC)[C@@H](O)[C@H]1O[C@@H]1[C@@H](O)[C@H](O[C@@H]2[C@@H]([C@@H](O)[C@H](O)[C@@H](C(C)O)O2)N)[C@@H](N)C[C@H]1N BRZYSWJRSDMWLG-CAXSIQPQSA-N 0.000 claims description 21
- 229950010131 puromycin Drugs 0.000 claims description 21
- 241001045988 Neogene Species 0.000 claims description 19
- 101150091879 neo gene Proteins 0.000 claims description 19
- 239000013603 viral vector Substances 0.000 claims description 18
- 101150084954 bsr gene Proteins 0.000 claims description 16
- 108091027963 non-coding RNA Proteins 0.000 claims description 16
- 102000042567 non-coding RNA Human genes 0.000 claims description 16
- 229910052594 sapphire Inorganic materials 0.000 claims description 16
- 239000010980 sapphire Substances 0.000 claims description 16
- 210000004899 c-terminal region Anatomy 0.000 claims description 15
- 108020005544 Antisense RNA Proteins 0.000 claims description 13
- 239000003184 complementary RNA Substances 0.000 claims description 13
- 210000004962 mammalian cell Anatomy 0.000 claims description 13
- 229920002477 rna polymer Polymers 0.000 claims description 13
- 239000013600 plasmid vector Substances 0.000 claims description 12
- 108010021843 fluorescent protein 583 Proteins 0.000 claims description 9
- 108010054624 red fluorescent protein Proteins 0.000 claims description 9
- YMHOBZXQZVXHBM-UHFFFAOYSA-N 2,5-dimethoxy-4-bromophenethylamine Chemical compound COC1=CC(CCN)=C(OC)C=C1Br YMHOBZXQZVXHBM-UHFFFAOYSA-N 0.000 claims description 8
- 108091005944 Cerulean Proteins 0.000 claims description 8
- 108091005960 Citrine Proteins 0.000 claims description 8
- 108010054814 DNA Gyrase Proteins 0.000 claims description 8
- 108091005942 ECFP Proteins 0.000 claims description 8
- 241000219793 Trifolium Species 0.000 claims description 8
- 241000545067 Venus Species 0.000 claims description 8
- 239000011035 citrine Substances 0.000 claims description 8
- 108091005949 mKalama1 Proteins 0.000 claims description 8
- 108091005958 mTurquoise2 Proteins 0.000 claims description 8
- 108010082025 cyan fluorescent protein Proteins 0.000 claims description 7
- 108010013829 alpha subunit DNA polymerase III Proteins 0.000 claims 3
- 241000424623 Nostoc punctiforme Species 0.000 claims 1
- 241000192581 Synechocystis sp. Species 0.000 claims 1
- 235000018102 proteins Nutrition 0.000 description 453
- 150000001413 amino acids Chemical group 0.000 description 264
- 210000004027 cell Anatomy 0.000 description 252
- 239000013612 plasmid Substances 0.000 description 167
- 229940024606 amino acid Drugs 0.000 description 104
- 235000001014 amino acid Nutrition 0.000 description 102
- 108090000765 processed proteins & peptides Proteins 0.000 description 50
- 108700019146 Transgenes Proteins 0.000 description 47
- 108010071146 DNA Polymerase III Proteins 0.000 description 44
- 102000004196 processed proteins & peptides Human genes 0.000 description 44
- 102000007528 DNA Polymerase III Human genes 0.000 description 43
- 229920001184 polypeptide Polymers 0.000 description 42
- 239000005090 green fluorescent protein Substances 0.000 description 28
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 26
- 108020004414 DNA Proteins 0.000 description 20
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 20
- 102000040430 polynucleotide Human genes 0.000 description 17
- 108091033319 polynucleotide Proteins 0.000 description 17
- 239000002157 polynucleotide Substances 0.000 description 17
- 108010043121 Green Fluorescent Proteins Proteins 0.000 description 16
- 102000004144 Green Fluorescent Proteins Human genes 0.000 description 16
- 108010044940 alanylglutamine Proteins 0.000 description 15
- ILJREDZFPHTUIE-GUBZILKMSA-N Leu-Asp-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ILJREDZFPHTUIE-GUBZILKMSA-N 0.000 description 14
- 108010057821 leucylproline Proteins 0.000 description 14
- 230000008685 targeting Effects 0.000 description 14
- 108010034529 leucyl-lysine Proteins 0.000 description 13
- CGGVNFJRZJUVAE-BYULHYEWSA-N Val-Asp-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CGGVNFJRZJUVAE-BYULHYEWSA-N 0.000 description 12
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 12
- 230000002195 synergetic effect Effects 0.000 description 12
- 238000010361 transduction Methods 0.000 description 12
- 230000026683 transduction Effects 0.000 description 12
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 11
- QHDXUYOYTPWCSK-RCOVLWMOSA-N Val-Asp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N QHDXUYOYTPWCSK-RCOVLWMOSA-N 0.000 description 11
- 229940088710 antibiotic agent Drugs 0.000 description 11
- 102000039446 nucleic acids Human genes 0.000 description 11
- 108020004707 nucleic acids Proteins 0.000 description 11
- 150000007523 nucleic acids Chemical class 0.000 description 11
- 108700028369 Alleles Proteins 0.000 description 10
- 108010068380 arginylarginine Proteins 0.000 description 10
- 238000000684 flow cytometry Methods 0.000 description 10
- YPFFHGRJCUBXPX-NHCYSSNCSA-N Gln-Pro-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCC(N)=O)C(O)=O YPFFHGRJCUBXPX-NHCYSSNCSA-N 0.000 description 9
- DSPQRJXOIXHOHK-WDSKDSINSA-N Glu-Asp-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O DSPQRJXOIXHOHK-WDSKDSINSA-N 0.000 description 9
- 101001000998 Homo sapiens Protein phosphatase 1 regulatory subunit 12C Proteins 0.000 description 9
- TZCGZYWNIDZZMR-UHFFFAOYSA-N Ile-Arg-Ala Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(C)C(O)=O)CCCN=C(N)N TZCGZYWNIDZZMR-UHFFFAOYSA-N 0.000 description 9
- BEBVVQPDSHHWQL-NRPADANISA-N Ser-Val-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O BEBVVQPDSHHWQL-NRPADANISA-N 0.000 description 9
- 108010009111 arginyl-glycyl-glutamic acid Proteins 0.000 description 9
- 108010093581 aspartyl-proline Proteins 0.000 description 9
- 230000001404 mediated effect Effects 0.000 description 9
- 239000002609 medium Substances 0.000 description 9
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 8
- RRVCZCNFXIFGRA-DCAQKATOSA-N Leu-Pro-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O RRVCZCNFXIFGRA-DCAQKATOSA-N 0.000 description 8
- 241000700605 Viruses Species 0.000 description 8
- 108010049041 glutamylalanine Proteins 0.000 description 8
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 8
- 108010015792 glycyllysine Proteins 0.000 description 8
- 108010037850 glycylvaline Proteins 0.000 description 8
- 108010003700 lysyl aspartic acid Proteins 0.000 description 8
- 108010012581 phenylalanylglutamate Proteins 0.000 description 8
- 238000012360 testing method Methods 0.000 description 8
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 7
- PWYFCPCBOYMOGB-LKTVYLICSA-N Ala-Gln-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N PWYFCPCBOYMOGB-LKTVYLICSA-N 0.000 description 7
- OYJCVIGKMXUVKB-GARJFASQSA-N Ala-Leu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N OYJCVIGKMXUVKB-GARJFASQSA-N 0.000 description 7
- SSQHYGLFYWZWDV-UVBJJODRSA-N Ala-Val-Trp Chemical compound CC(C)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O SSQHYGLFYWZWDV-UVBJJODRSA-N 0.000 description 7
- VFGADOJXRLWTBU-JBDRJPRFSA-N Cys-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N VFGADOJXRLWTBU-JBDRJPRFSA-N 0.000 description 7
- HGLKOTPFWOMPOB-MEYUZBJRSA-N Leu-Thr-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HGLKOTPFWOMPOB-MEYUZBJRSA-N 0.000 description 7
- 229930193140 Neomycin Natural products 0.000 description 7
- 102100035620 Protein phosphatase 1 regulatory subunit 12C Human genes 0.000 description 7
- YRBGKVIWMNEVCZ-WDSKDSINSA-N Ser-Glu-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O YRBGKVIWMNEVCZ-WDSKDSINSA-N 0.000 description 7
- 108010005233 alanylglutamic acid Proteins 0.000 description 7
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 7
- 235000018417 cysteine Nutrition 0.000 description 7
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 7
- 108010073628 glutamyl-valyl-phenylalanine Proteins 0.000 description 7
- 108010081551 glycylphenylalanine Proteins 0.000 description 7
- 229960004927 neomycin Drugs 0.000 description 7
- 108010051242 phenylalanylserine Proteins 0.000 description 7
- 230000006798 recombination Effects 0.000 description 7
- 238000005215 recombination Methods 0.000 description 7
- 238000001890 transfection Methods 0.000 description 7
- 230000014616 translation Effects 0.000 description 7
- 108010029384 tryptophyl-histidine Proteins 0.000 description 7
- AXFMEGAFCUULFV-BLFANLJRSA-N (2s)-2-[[(2s)-1-[(2s,3r)-2-amino-3-methylpentanoyl]pyrrolidine-2-carbonyl]amino]pentanedioic acid Chemical compound CC[C@@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AXFMEGAFCUULFV-BLFANLJRSA-N 0.000 description 6
- DKJPOZOEBONHFS-ZLUOBGJFSA-N Ala-Ala-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O DKJPOZOEBONHFS-ZLUOBGJFSA-N 0.000 description 6
- BUDNAJYVCUHLSV-ZLUOBGJFSA-N Ala-Asp-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O BUDNAJYVCUHLSV-ZLUOBGJFSA-N 0.000 description 6
- YCRAFFCYWOUEOF-DLOVCJGASA-N Ala-Phe-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 YCRAFFCYWOUEOF-DLOVCJGASA-N 0.000 description 6
- DCVYRWFAMZFSDA-ZLUOBGJFSA-N Ala-Ser-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DCVYRWFAMZFSDA-ZLUOBGJFSA-N 0.000 description 6
- IRRMIGDCPOPZJW-ULQDDVLXSA-N Arg-His-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O IRRMIGDCPOPZJW-ULQDDVLXSA-N 0.000 description 6
- QBQVKUNBCAFXSV-ULQDDVLXSA-N Arg-Lys-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QBQVKUNBCAFXSV-ULQDDVLXSA-N 0.000 description 6
- XMZZGVGKGXRIGJ-JYJNAYRXSA-N Arg-Tyr-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O XMZZGVGKGXRIGJ-JYJNAYRXSA-N 0.000 description 6
- PSUXEQYPYZLNER-QXEWZRGKSA-N Arg-Val-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PSUXEQYPYZLNER-QXEWZRGKSA-N 0.000 description 6
- AXXCUABIFZPKPM-BQBZGAKWSA-N Asp-Arg-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O AXXCUABIFZPKPM-BQBZGAKWSA-N 0.000 description 6
- KHGPWGKPYHPOIK-QWRGUYRKSA-N Asp-Gly-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KHGPWGKPYHPOIK-QWRGUYRKSA-N 0.000 description 6
- JOCQXVJCTCEFAZ-CIUDSAMLSA-N Asp-His-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O JOCQXVJCTCEFAZ-CIUDSAMLSA-N 0.000 description 6
- SEMWSADZTMJELF-BYULHYEWSA-N Asp-Ile-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O SEMWSADZTMJELF-BYULHYEWSA-N 0.000 description 6
- JSHWXQIZOCVWIA-ZKWXMUAHSA-N Asp-Ser-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O JSHWXQIZOCVWIA-ZKWXMUAHSA-N 0.000 description 6
- 101100315624 Caenorhabditis elegans tyr-1 gene Proteins 0.000 description 6
- SRIRHERUAMYIOQ-CIUDSAMLSA-N Cys-Leu-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SRIRHERUAMYIOQ-CIUDSAMLSA-N 0.000 description 6
- 108010090461 DFG peptide Proteins 0.000 description 6
- LMPBBFWHCRURJD-LAEOZQHASA-N Gln-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N LMPBBFWHCRURJD-LAEOZQHASA-N 0.000 description 6
- XHUCVVHRLNPZSZ-CIUDSAMLSA-N Glu-Gln-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XHUCVVHRLNPZSZ-CIUDSAMLSA-N 0.000 description 6
- MFNUFCFRAZPJFW-JYJNAYRXSA-N Glu-Lys-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MFNUFCFRAZPJFW-JYJNAYRXSA-N 0.000 description 6
- WIKMTDVSCUJIPJ-CIUDSAMLSA-N Glu-Ser-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N WIKMTDVSCUJIPJ-CIUDSAMLSA-N 0.000 description 6
- DWUKOTKSTDWGAE-BQBZGAKWSA-N Gly-Asn-Arg Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DWUKOTKSTDWGAE-BQBZGAKWSA-N 0.000 description 6
- LHRXAHLCRMQBGJ-RYUDHWBXSA-N Gly-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)CN LHRXAHLCRMQBGJ-RYUDHWBXSA-N 0.000 description 6
- HQRHFUYMGCHHJS-LURJTMIESA-N Gly-Gly-Arg Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N HQRHFUYMGCHHJS-LURJTMIESA-N 0.000 description 6
- DUAWRXXTOQOECJ-JSGCOSHPSA-N Gly-Tyr-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O DUAWRXXTOQOECJ-JSGCOSHPSA-N 0.000 description 6
- HDOYNXLPTRQLAD-JBDRJPRFSA-N Ile-Ala-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(=O)O)N HDOYNXLPTRQLAD-JBDRJPRFSA-N 0.000 description 6
- CYHYBSGMHMHKOA-CIQUZCHMSA-N Ile-Ala-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N CYHYBSGMHMHKOA-CIQUZCHMSA-N 0.000 description 6
- APFJUBGRZGMQFF-QWRGUYRKSA-N Leu-Gly-Lys Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN APFJUBGRZGMQFF-QWRGUYRKSA-N 0.000 description 6
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 6
- HQVDJTYKCMIWJP-YUMQZZPRSA-N Lys-Asn-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O HQVDJTYKCMIWJP-YUMQZZPRSA-N 0.000 description 6
- WAIHHELKYSFIQN-XUXIUFHCSA-N Lys-Ile-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O WAIHHELKYSFIQN-XUXIUFHCSA-N 0.000 description 6
- LNMKRJJLEFASGA-BZSNNMDCSA-N Lys-Phe-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O LNMKRJJLEFASGA-BZSNNMDCSA-N 0.000 description 6
- VOOINLQYUZOREH-SRVKXCTJSA-N Met-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCSC)N VOOINLQYUZOREH-SRVKXCTJSA-N 0.000 description 6
- RRIHXWPHQSXHAQ-XUXIUFHCSA-N Met-Ile-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(O)=O RRIHXWPHQSXHAQ-XUXIUFHCSA-N 0.000 description 6
- HAQLBBVZAGMESV-IHRRRGAJSA-N Met-Lys-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O HAQLBBVZAGMESV-IHRRRGAJSA-N 0.000 description 6
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 6
- BBDSZDHUCPSYAC-QEJZJMRPSA-N Phe-Ala-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BBDSZDHUCPSYAC-QEJZJMRPSA-N 0.000 description 6
- FRPVPGRXUKFEQE-YDHLFZDLSA-N Phe-Asp-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O FRPVPGRXUKFEQE-YDHLFZDLSA-N 0.000 description 6
- NHDVNAKDACFHPX-GUBZILKMSA-N Pro-Arg-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O NHDVNAKDACFHPX-GUBZILKMSA-N 0.000 description 6
- NXEYSLRNNPWCRN-SRVKXCTJSA-N Pro-Glu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXEYSLRNNPWCRN-SRVKXCTJSA-N 0.000 description 6
- XYHMFGGWNOFUOU-QXEWZRGKSA-N Pro-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 XYHMFGGWNOFUOU-QXEWZRGKSA-N 0.000 description 6
- BGOWRLSWJCVYAQ-CIUDSAMLSA-N Ser-Asp-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BGOWRLSWJCVYAQ-CIUDSAMLSA-N 0.000 description 6
- VQBCMLMPEWPUTB-ACZMJKKPSA-N Ser-Glu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O VQBCMLMPEWPUTB-ACZMJKKPSA-N 0.000 description 6
- CAJFZCICSVBOJK-SHGPDSBTSA-N Thr-Ala-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAJFZCICSVBOJK-SHGPDSBTSA-N 0.000 description 6
- BPGDJSUFQKWUBK-KJEVXHAQSA-N Thr-Val-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 BPGDJSUFQKWUBK-KJEVXHAQSA-N 0.000 description 6
- KCPFDGNYAMKZQP-KBPBESRZSA-N Tyr-Gly-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O KCPFDGNYAMKZQP-KBPBESRZSA-N 0.000 description 6
- HSBZWINKRYZCSQ-KKUMJFAQSA-N Tyr-Lys-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O HSBZWINKRYZCSQ-KKUMJFAQSA-N 0.000 description 6
- BRPKEERLGYNCNC-NHCYSSNCSA-N Val-Glu-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N BRPKEERLGYNCNC-NHCYSSNCSA-N 0.000 description 6
- AGXGCFSECFQMKB-NHCYSSNCSA-N Val-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N AGXGCFSECFQMKB-NHCYSSNCSA-N 0.000 description 6
- 108010008685 alanyl-glutamyl-aspartic acid Proteins 0.000 description 6
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 6
- 108010062796 arginyllysine Proteins 0.000 description 6
- 238000010367 cloning Methods 0.000 description 6
- 108010062266 glycyl-glycyl-argininal Proteins 0.000 description 6
- 108010050848 glycylleucine Proteins 0.000 description 6
- 235000011475 lollipops Nutrition 0.000 description 6
- 108010009298 lysylglutamic acid Proteins 0.000 description 6
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 6
- 230000016434 protein splicing Effects 0.000 description 6
- 241000894007 species Species 0.000 description 6
- UCSJYZPVAKXKNQ-HZYVHMACSA-N streptomycin Chemical compound CN[C@H]1[C@H](O)[C@@H](O)[C@H](CO)O[C@H]1O[C@@H]1[C@](C=O)(O)[C@H](C)O[C@H]1O[C@@H]1[C@@H](NC(N)=N)[C@H](O)[C@@H](NC(N)=N)[C@H](O)[C@H]1O UCSJYZPVAKXKNQ-HZYVHMACSA-N 0.000 description 6
- 238000013519 translation Methods 0.000 description 6
- FUSPCLTUKXQREV-ACZMJKKPSA-N Ala-Glu-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O FUSPCLTUKXQREV-ACZMJKKPSA-N 0.000 description 5
- OBVSBEYOMDWLRJ-BFHQHQDPSA-N Ala-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N OBVSBEYOMDWLRJ-BFHQHQDPSA-N 0.000 description 5
- SAHQGRZIQVEJPF-JXUBOQSCSA-N Ala-Thr-Lys Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCCN SAHQGRZIQVEJPF-JXUBOQSCSA-N 0.000 description 5
- GXCSUJQOECMKPV-CIUDSAMLSA-N Arg-Ala-Gln Chemical compound C[C@H](NC(=O)[C@@H](N)CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O GXCSUJQOECMKPV-CIUDSAMLSA-N 0.000 description 5
- UISQLSIBJKEJSS-GUBZILKMSA-N Arg-Arg-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(O)=O UISQLSIBJKEJSS-GUBZILKMSA-N 0.000 description 5
- TTXYKSADPSNOIF-IHRRRGAJSA-N Arg-Asp-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O TTXYKSADPSNOIF-IHRRRGAJSA-N 0.000 description 5
- ATABBWFGOHKROJ-GUBZILKMSA-N Arg-Pro-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O ATABBWFGOHKROJ-GUBZILKMSA-N 0.000 description 5
- QLSRIZIDQXDQHK-RCWTZXSCSA-N Arg-Val-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QLSRIZIDQXDQHK-RCWTZXSCSA-N 0.000 description 5
- DXVMJJNAOVECBA-WHFBIAKZSA-N Asn-Gly-Asn Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O DXVMJJNAOVECBA-WHFBIAKZSA-N 0.000 description 5
- LSJQOMAZIKQMTJ-SRVKXCTJSA-N Asn-Phe-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O LSJQOMAZIKQMTJ-SRVKXCTJSA-N 0.000 description 5
- KRXIWXCXOARFNT-ZLUOBGJFSA-N Asp-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O KRXIWXCXOARFNT-ZLUOBGJFSA-N 0.000 description 5
- BFOYULZBKYOKAN-OLHMAJIHSA-N Asp-Asp-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BFOYULZBKYOKAN-OLHMAJIHSA-N 0.000 description 5
- KFAFUJMGHVVYRC-DCAQKATOSA-N Asp-Leu-Met Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O KFAFUJMGHVVYRC-DCAQKATOSA-N 0.000 description 5
- 102100021277 Beta-secretase 2 Human genes 0.000 description 5
- 108091033409 CRISPR Proteins 0.000 description 5
- 241000579895 Chlorostilbon Species 0.000 description 5
- 101100118093 Drosophila melanogaster eEF1alpha2 gene Proteins 0.000 description 5
- ARYKRXHBIPLULY-XKBZYTNZSA-N Gln-Thr-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ARYKRXHBIPLULY-XKBZYTNZSA-N 0.000 description 5
- QXDXIXFSFHUYAX-MNXVOIDGSA-N Glu-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O QXDXIXFSFHUYAX-MNXVOIDGSA-N 0.000 description 5
- IGOYNRWLWHWAQO-JTQLQIEISA-N Gly-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IGOYNRWLWHWAQO-JTQLQIEISA-N 0.000 description 5
- AFMOTCMSEBITOE-YEPSODPASA-N Gly-Val-Thr Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AFMOTCMSEBITOE-YEPSODPASA-N 0.000 description 5
- KZTLOHBDLMIFSH-XVYDVKMFSA-N His-Ala-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O KZTLOHBDLMIFSH-XVYDVKMFSA-N 0.000 description 5
- TWROVBNEHJSXDG-IHRRRGAJSA-N His-Leu-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O TWROVBNEHJSXDG-IHRRRGAJSA-N 0.000 description 5
- UXZMINKIEWBEQU-SZMVWBNQSA-N His-Trp-Gln Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC3=CN=CN3)N UXZMINKIEWBEQU-SZMVWBNQSA-N 0.000 description 5
- VSZALHITQINTGC-GHCJXIJMSA-N Ile-Ala-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VSZALHITQINTGC-GHCJXIJMSA-N 0.000 description 5
- IDAHFEPYTJJZFD-PEFMBERDSA-N Ile-Asp-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N IDAHFEPYTJJZFD-PEFMBERDSA-N 0.000 description 5
- LOXMWQOKYBGCHF-JBDRJPRFSA-N Ile-Cys-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O LOXMWQOKYBGCHF-JBDRJPRFSA-N 0.000 description 5
- IGJWJGIHUFQANP-LAEOZQHASA-N Ile-Gly-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N IGJWJGIHUFQANP-LAEOZQHASA-N 0.000 description 5
- NYEYYMLUABXDMC-NHCYSSNCSA-N Ile-Gly-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)O)N NYEYYMLUABXDMC-NHCYSSNCSA-N 0.000 description 5
- ZYLJULGXQDNXDK-GUBZILKMSA-N Leu-Gln-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ZYLJULGXQDNXDK-GUBZILKMSA-N 0.000 description 5
- AVEGDIAXTDVBJS-XUXIUFHCSA-N Leu-Ile-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AVEGDIAXTDVBJS-XUXIUFHCSA-N 0.000 description 5
- VULJUQZPSOASBZ-SRVKXCTJSA-N Leu-Pro-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O VULJUQZPSOASBZ-SRVKXCTJSA-N 0.000 description 5
- UZVWDRPUTHXQAM-FXQIFTODSA-N Met-Asp-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O UZVWDRPUTHXQAM-FXQIFTODSA-N 0.000 description 5
- AWGBEIYZPAXXSX-RWMBFGLXSA-N Met-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N AWGBEIYZPAXXSX-RWMBFGLXSA-N 0.000 description 5
- 101800000135 N-terminal protein Proteins 0.000 description 5
- 101800001452 P1 proteinase Proteins 0.000 description 5
- KCIKTPHTEYBXMG-BVSLBCMMSA-N Phe-Trp-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O KCIKTPHTEYBXMG-BVSLBCMMSA-N 0.000 description 5
- OOLOTUZJUBOMAX-GUBZILKMSA-N Pro-Ala-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O OOLOTUZJUBOMAX-GUBZILKMSA-N 0.000 description 5
- ZPPVJIJMIKTERM-YUMQZZPRSA-N Pro-Gln-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)N)NC(=O)[C@@H]1CCCN1 ZPPVJIJMIKTERM-YUMQZZPRSA-N 0.000 description 5
- LPGSNRSLPHRNBW-AVGNSLFASA-N Pro-His-Val Chemical compound C([C@@H](C(=O)N[C@@H](C(C)C)C([O-])=O)NC(=O)[C@H]1[NH2+]CCC1)C1=CN=CN1 LPGSNRSLPHRNBW-AVGNSLFASA-N 0.000 description 5
- AJBQTGZIZQXBLT-STQMWFEESA-N Pro-Phe-Gly Chemical compound C([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 AJBQTGZIZQXBLT-STQMWFEESA-N 0.000 description 5
- YIPFBJGBRCJJJD-FHWLQOOXSA-N Pro-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@@H]3CCCN3 YIPFBJGBRCJJJD-FHWLQOOXSA-N 0.000 description 5
- KCFKKAQKRZBWJB-ZLUOBGJFSA-N Ser-Cys-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O KCFKKAQKRZBWJB-ZLUOBGJFSA-N 0.000 description 5
- PMTWIUBUQRGCSB-FXQIFTODSA-N Ser-Val-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O PMTWIUBUQRGCSB-FXQIFTODSA-N 0.000 description 5
- LHEZGZQRLDBSRR-WDCWCFNPSA-N Thr-Glu-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LHEZGZQRLDBSRR-WDCWCFNPSA-N 0.000 description 5
- CSZFFQBUTMGHAH-UAXMHLISSA-N Thr-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O CSZFFQBUTMGHAH-UAXMHLISSA-N 0.000 description 5
- KPMIQCXJDVKWKO-IFFSRLJSSA-N Thr-Val-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KPMIQCXJDVKWKO-IFFSRLJSSA-N 0.000 description 5
- SBYQHZCMVSPQCS-RCWTZXSCSA-N Thr-Val-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O SBYQHZCMVSPQCS-RCWTZXSCSA-N 0.000 description 5
- KOVXHANYYYMBRF-IRIUXVKKSA-N Tyr-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O KOVXHANYYYMBRF-IRIUXVKKSA-N 0.000 description 5
- DEGUERSKQBRZMZ-FXQIFTODSA-N Val-Ser-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DEGUERSKQBRZMZ-FXQIFTODSA-N 0.000 description 5
- 108010060035 arginylproline Proteins 0.000 description 5
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 5
- 230000001580 bacterial effect Effects 0.000 description 5
- 239000010976 emerald Substances 0.000 description 5
- 229910052876 emerald Inorganic materials 0.000 description 5
- JYPCXBJRLBHWME-UHFFFAOYSA-N glycyl-L-prolyl-L-arginine Natural products NCC(=O)N1CCCC1C(=O)NC(CCCN=C(N)N)C(O)=O JYPCXBJRLBHWME-UHFFFAOYSA-N 0.000 description 5
- 108010077435 glycyl-phenylalanyl-glycine Proteins 0.000 description 5
- 108010025801 glycyl-prolyl-arginine Proteins 0.000 description 5
- 108010077515 glycylproline Proteins 0.000 description 5
- 108020004999 messenger RNA Proteins 0.000 description 5
- 108010056582 methionylglutamic acid Proteins 0.000 description 5
- 210000001236 prokaryotic cell Anatomy 0.000 description 5
- 108010070643 prolylglutamic acid Proteins 0.000 description 5
- 108010061238 threonyl-glycine Proteins 0.000 description 5
- 230000003612 virological effect Effects 0.000 description 5
- IFTVANMRTIHKML-WDSKDSINSA-N Ala-Gln-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O IFTVANMRTIHKML-WDSKDSINSA-N 0.000 description 4
- AWZKCUCQJNTBAD-SRVKXCTJSA-N Ala-Leu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN AWZKCUCQJNTBAD-SRVKXCTJSA-N 0.000 description 4
- KGSJCPBERYUXCN-BPNCWPANSA-N Arg-Ala-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KGSJCPBERYUXCN-BPNCWPANSA-N 0.000 description 4
- NKBQZKVMKJJDLX-SRVKXCTJSA-N Arg-Glu-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NKBQZKVMKJJDLX-SRVKXCTJSA-N 0.000 description 4
- QHBMKQWOIYJYMI-BYULHYEWSA-N Asn-Asn-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O QHBMKQWOIYJYMI-BYULHYEWSA-N 0.000 description 4
- XLZCLJRGGMBKLR-PCBIJLKTSA-N Asn-Ile-Phe Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XLZCLJRGGMBKLR-PCBIJLKTSA-N 0.000 description 4
- FANQWNCPNFEPGZ-WHFBIAKZSA-N Asp-Asp-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O FANQWNCPNFEPGZ-WHFBIAKZSA-N 0.000 description 4
- KVPHTGVUMJGMCX-BIIVOSGPSA-N Asp-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N)C(=O)O KVPHTGVUMJGMCX-BIIVOSGPSA-N 0.000 description 4
- ODNWIBOCFGMRTP-SRVKXCTJSA-N Asp-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CN=CN1 ODNWIBOCFGMRTP-SRVKXCTJSA-N 0.000 description 4
- VIOQRFNAZDMVLO-NRPADANISA-N Cys-Val-Glu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O VIOQRFNAZDMVLO-NRPADANISA-N 0.000 description 4
- 102000004190 Enzymes Human genes 0.000 description 4
- 108090000790 Enzymes Proteins 0.000 description 4
- NPTGGVQJYRSMCM-GLLZPBPUSA-N Gln-Gln-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NPTGGVQJYRSMCM-GLLZPBPUSA-N 0.000 description 4
- WTMZXOPHTIVFCP-QEWYBTABSA-N Glu-Ile-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 WTMZXOPHTIVFCP-QEWYBTABSA-N 0.000 description 4
- HVYWQYLBVXMXSV-GUBZILKMSA-N Glu-Leu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HVYWQYLBVXMXSV-GUBZILKMSA-N 0.000 description 4
- SJJHXJDSNQJMMW-SRVKXCTJSA-N Glu-Lys-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O SJJHXJDSNQJMMW-SRVKXCTJSA-N 0.000 description 4
- KIEICAOUSNYOLM-NRPADANISA-N Glu-Val-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O KIEICAOUSNYOLM-NRPADANISA-N 0.000 description 4
- MLILEEIVMRUYBX-NHCYSSNCSA-N Glu-Val-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O MLILEEIVMRUYBX-NHCYSSNCSA-N 0.000 description 4
- FVGOGEGGQLNZGH-DZKIICNBSA-N Glu-Val-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FVGOGEGGQLNZGH-DZKIICNBSA-N 0.000 description 4
- OCQUNKSFDYDXBG-QXEWZRGKSA-N Gly-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OCQUNKSFDYDXBG-QXEWZRGKSA-N 0.000 description 4
- DTPOVRRYXPJJAZ-FJXKBIBVSA-N Gly-Arg-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N DTPOVRRYXPJJAZ-FJXKBIBVSA-N 0.000 description 4
- LCNXZQROPKFGQK-WHFBIAKZSA-N Gly-Asp-Ser Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O LCNXZQROPKFGQK-WHFBIAKZSA-N 0.000 description 4
- XJQDHFMUUBRCGA-KKUMJFAQSA-N His-Asn-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XJQDHFMUUBRCGA-KKUMJFAQSA-N 0.000 description 4
- PYNPBMCLAKTHJL-SRVKXCTJSA-N His-Pro-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O PYNPBMCLAKTHJL-SRVKXCTJSA-N 0.000 description 4
- REJKOQYVFDEZHA-SLBDDTMCSA-N Ile-Asp-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N REJKOQYVFDEZHA-SLBDDTMCSA-N 0.000 description 4
- QRTVJGKXFSYJGW-KBIXCLLPSA-N Ile-Glu-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N QRTVJGKXFSYJGW-KBIXCLLPSA-N 0.000 description 4
- SAVXZJYTTQQQDD-QEWYBTABSA-N Ile-Phe-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SAVXZJYTTQQQDD-QEWYBTABSA-N 0.000 description 4
- NGKPIPCGMLWHBX-WZLNRYEVSA-N Ile-Tyr-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N NGKPIPCGMLWHBX-WZLNRYEVSA-N 0.000 description 4
- 108020004684 Internal Ribosome Entry Sites Proteins 0.000 description 4
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 4
- XBBKIIGCUMBKCO-JXUBOQSCSA-N Leu-Ala-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XBBKIIGCUMBKCO-JXUBOQSCSA-N 0.000 description 4
- VQPPIMUZCZCOIL-GUBZILKMSA-N Leu-Gln-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O VQPPIMUZCZCOIL-GUBZILKMSA-N 0.000 description 4
- DDVHDMSBLRAKNV-IHRRRGAJSA-N Leu-Met-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O DDVHDMSBLRAKNV-IHRRRGAJSA-N 0.000 description 4
- ICYRCNICGBJLGM-HJGDQZAQSA-N Leu-Thr-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O ICYRCNICGBJLGM-HJGDQZAQSA-N 0.000 description 4
- SKRGVGLIRUGANF-AVGNSLFASA-N Lys-Leu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SKRGVGLIRUGANF-AVGNSLFASA-N 0.000 description 4
- PZUUMQPMHBJJKE-AVGNSLFASA-N Met-Leu-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCNC(N)=N PZUUMQPMHBJJKE-AVGNSLFASA-N 0.000 description 4
- KRLKICLNEICJGV-STQMWFEESA-N Met-Phe-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 KRLKICLNEICJGV-STQMWFEESA-N 0.000 description 4
- 108010079364 N-glycylalanine Proteins 0.000 description 4
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 4
- 101100342977 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) leu-1 gene Proteins 0.000 description 4
- HOYQLNNGMHXZDW-KKUMJFAQSA-N Phe-Glu-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O HOYQLNNGMHXZDW-KKUMJFAQSA-N 0.000 description 4
- BIYWZVCPZIFGPY-QWRGUYRKSA-N Phe-Gly-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CO)C(O)=O BIYWZVCPZIFGPY-QWRGUYRKSA-N 0.000 description 4
- WURZLPSMYZLEGH-UNQGMJICSA-N Phe-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC1=CC=CC=C1)N)O WURZLPSMYZLEGH-UNQGMJICSA-N 0.000 description 4
- OAICVXFJPJFONN-UHFFFAOYSA-N Phosphorus Chemical compound [P] OAICVXFJPJFONN-UHFFFAOYSA-N 0.000 description 4
- VDVYTKZBMFADQH-AVGNSLFASA-N Ser-Gln-Tyr Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 VDVYTKZBMFADQH-AVGNSLFASA-N 0.000 description 4
- SMIDBHKWSYUBRZ-ACZMJKKPSA-N Ser-Glu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O SMIDBHKWSYUBRZ-ACZMJKKPSA-N 0.000 description 4
- HEUVHBXOVZONPU-BJDJZHNGSA-N Ser-Leu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HEUVHBXOVZONPU-BJDJZHNGSA-N 0.000 description 4
- 101800001978 Ssp dnaB intein Proteins 0.000 description 4
- XSLXHSYIVPGEER-KZVJFYERSA-N Thr-Ala-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O XSLXHSYIVPGEER-KZVJFYERSA-N 0.000 description 4
- JEDIEMIJYSRUBB-FOHZUACHSA-N Thr-Asp-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O JEDIEMIJYSRUBB-FOHZUACHSA-N 0.000 description 4
- MPUMPERGHHJGRP-WEDXCCLWSA-N Thr-Gly-Lys Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N)O MPUMPERGHHJGRP-WEDXCCLWSA-N 0.000 description 4
- MQVGIFJSFFVGFW-XEGUGMAKSA-N Trp-Ala-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MQVGIFJSFFVGFW-XEGUGMAKSA-N 0.000 description 4
- YLRLHDFMMWDYTK-KKUMJFAQSA-N Tyr-Cys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 YLRLHDFMMWDYTK-KKUMJFAQSA-N 0.000 description 4
- YLRAFVVWZRSZQC-DZKIICNBSA-N Val-Phe-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YLRAFVVWZRSZQC-DZKIICNBSA-N 0.000 description 4
- PGQUDQYHWICSAB-NAKRPEOUSA-N Val-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N PGQUDQYHWICSAB-NAKRPEOUSA-N 0.000 description 4
- 230000009471 action Effects 0.000 description 4
- 108010087924 alanylproline Proteins 0.000 description 4
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 4
- 108010038633 aspartylglutamate Proteins 0.000 description 4
- 108010047857 aspartylglycine Proteins 0.000 description 4
- 230000015572 biosynthetic process Effects 0.000 description 4
- 238000002474 experimental method Methods 0.000 description 4
- 108010078144 glutaminyl-glycine Proteins 0.000 description 4
- 108010027668 glycyl-alanyl-valine Proteins 0.000 description 4
- 108010089804 glycyl-threonine Proteins 0.000 description 4
- 238000003780 insertion Methods 0.000 description 4
- 230000037431 insertion Effects 0.000 description 4
- 108010000761 leucylarginine Proteins 0.000 description 4
- 108010054155 lysyllysine Proteins 0.000 description 4
- 108010038320 lysylphenylalanine Proteins 0.000 description 4
- 229910052698 phosphorus Inorganic materials 0.000 description 4
- 239000011574 phosphorus Substances 0.000 description 4
- 230000008569 process Effects 0.000 description 4
- 229960005322 streptomycin Drugs 0.000 description 4
- GJLXVWOMRRWCIB-MERZOTPQSA-N (2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-acetamido-5-(diaminomethylideneamino)pentanoyl]amino]-3-(4-hydroxyphenyl)propanoyl]amino]-3-(4-hydroxyphenyl)propanoyl]amino]-5-(diaminomethylideneamino)pentanoyl]amino]-3-(1H-indol-3-yl)propanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanamide Chemical compound C([C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(N)=O)C1=CC=C(O)C=C1 GJLXVWOMRRWCIB-MERZOTPQSA-N 0.000 description 3
- PIPTUBPKYFRLCP-NHCYSSNCSA-N Ala-Ala-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 PIPTUBPKYFRLCP-NHCYSSNCSA-N 0.000 description 3
- KUDREHRZRIVKHS-UWJYBYFXSA-N Ala-Asp-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KUDREHRZRIVKHS-UWJYBYFXSA-N 0.000 description 3
- BLGHHPHXVJWCNK-GUBZILKMSA-N Ala-Gln-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BLGHHPHXVJWCNK-GUBZILKMSA-N 0.000 description 3
- ZVFVBBGVOILKPO-WHFBIAKZSA-N Ala-Gly-Ala Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O ZVFVBBGVOILKPO-WHFBIAKZSA-N 0.000 description 3
- NBTGEURICRTMGL-WHFBIAKZSA-N Ala-Gly-Ser Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O NBTGEURICRTMGL-WHFBIAKZSA-N 0.000 description 3
- LNNSWWRRYJLGNI-NAKRPEOUSA-N Ala-Ile-Val Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O LNNSWWRRYJLGNI-NAKRPEOUSA-N 0.000 description 3
- 108010011667 Ala-Phe-Ala Proteins 0.000 description 3
- VRTOMXFZHGWHIJ-KZVJFYERSA-N Ala-Thr-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VRTOMXFZHGWHIJ-KZVJFYERSA-N 0.000 description 3
- 101710154825 Aminoglycoside 3'-phosphotransferase Proteins 0.000 description 3
- OTOXOKCIIQLMFH-KZVJFYERSA-N Arg-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N OTOXOKCIIQLMFH-KZVJFYERSA-N 0.000 description 3
- NTAZNGWBXRVEDJ-FXQIFTODSA-N Arg-Asp-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NTAZNGWBXRVEDJ-FXQIFTODSA-N 0.000 description 3
- DNLQVHBBMPZUGJ-BQBZGAKWSA-N Arg-Ser-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O DNLQVHBBMPZUGJ-BQBZGAKWSA-N 0.000 description 3
- YNSUUAOAFCVINY-OSUNSFLBSA-N Arg-Thr-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YNSUUAOAFCVINY-OSUNSFLBSA-N 0.000 description 3
- LFWOQHSQNCKXRU-UFYCRDLUSA-N Arg-Tyr-Phe Chemical compound C([C@H](NC(=O)[C@H](CCCN=C(N)N)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 LFWOQHSQNCKXRU-UFYCRDLUSA-N 0.000 description 3
- PCKRJVZAQZWNKM-WHFBIAKZSA-N Asn-Asn-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O PCKRJVZAQZWNKM-WHFBIAKZSA-N 0.000 description 3
- JQBCANGGAVVERB-CFMVVWHZSA-N Asn-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N JQBCANGGAVVERB-CFMVVWHZSA-N 0.000 description 3
- 241000304886 Bacilli Species 0.000 description 3
- 101710150190 Beta-secretase 2 Proteins 0.000 description 3
- 108010045123 Blasticidin-S deaminase Proteins 0.000 description 3
- 238000010453 CRISPR/Cas method Methods 0.000 description 3
- POSRGGKLRWCUBE-CIUDSAMLSA-N Cys-Met-Glu Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N POSRGGKLRWCUBE-CIUDSAMLSA-N 0.000 description 3
- 101001091269 Escherichia coli Hygromycin-B 4-O-kinase Proteins 0.000 description 3
- HDUDGCZEOZEFOA-KBIXCLLPSA-N Gln-Ile-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HDUDGCZEOZEFOA-KBIXCLLPSA-N 0.000 description 3
- HSHCEAUPUPJPTE-JYJNAYRXSA-N Gln-Leu-Tyr Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HSHCEAUPUPJPTE-JYJNAYRXSA-N 0.000 description 3
- FALJZCPMTGJOHX-SRVKXCTJSA-N Gln-Met-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O FALJZCPMTGJOHX-SRVKXCTJSA-N 0.000 description 3
- LPIKVBWNNVFHCQ-GUBZILKMSA-N Gln-Ser-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LPIKVBWNNVFHCQ-GUBZILKMSA-N 0.000 description 3
- WOMUDRVDJMHTCV-DCAQKATOSA-N Glu-Arg-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WOMUDRVDJMHTCV-DCAQKATOSA-N 0.000 description 3
- ITBHUUMCJJQUSC-LAEOZQHASA-N Glu-Ile-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O ITBHUUMCJJQUSC-LAEOZQHASA-N 0.000 description 3
- FBEJIDRSQCGFJI-GUBZILKMSA-N Glu-Leu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FBEJIDRSQCGFJI-GUBZILKMSA-N 0.000 description 3
- HFXJIZNEXNIZIJ-BQBZGAKWSA-N Gly-Glu-Gln Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HFXJIZNEXNIZIJ-BQBZGAKWSA-N 0.000 description 3
- YTSVAIMKVLZUDU-YUMQZZPRSA-N Gly-Leu-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YTSVAIMKVLZUDU-YUMQZZPRSA-N 0.000 description 3
- PTIIBFKSLCYQBO-NHCYSSNCSA-N Gly-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)CN PTIIBFKSLCYQBO-NHCYSSNCSA-N 0.000 description 3
- ABPRMMYHROQBLY-NKWVEPMBSA-N Gly-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)CN)C(=O)O ABPRMMYHROQBLY-NKWVEPMBSA-N 0.000 description 3
- LYSMQLXUCAKELQ-DCAQKATOSA-N His-Asp-Arg Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N LYSMQLXUCAKELQ-DCAQKATOSA-N 0.000 description 3
- LBQAHBIVXQSBIR-HVTMNAMFSA-N His-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N LBQAHBIVXQSBIR-HVTMNAMFSA-N 0.000 description 3
- VAXBXNPRXPHGHG-BJDJZHNGSA-N Ile-Ala-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)O)N VAXBXNPRXPHGHG-BJDJZHNGSA-N 0.000 description 3
- PDTMWFVVNZYWTR-NHCYSSNCSA-N Ile-Gly-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](CCCCN)C(O)=O PDTMWFVVNZYWTR-NHCYSSNCSA-N 0.000 description 3
- VOBYAKCXGQQFLR-LSJOCFKGSA-N Ile-Gly-Val Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O VOBYAKCXGQQFLR-LSJOCFKGSA-N 0.000 description 3
- FFAUOCITXBMRBT-YTFOTSKYSA-N Ile-Lys-Ile Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FFAUOCITXBMRBT-YTFOTSKYSA-N 0.000 description 3
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 3
- ZTLGVASZOIKNIX-DCAQKATOSA-N Leu-Gln-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZTLGVASZOIKNIX-DCAQKATOSA-N 0.000 description 3
- HRTRLSRYZZKPCO-BJDJZHNGSA-N Leu-Ile-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HRTRLSRYZZKPCO-BJDJZHNGSA-N 0.000 description 3
- DRWMRVFCKKXHCH-BZSNNMDCSA-N Leu-Phe-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=CC=C1 DRWMRVFCKKXHCH-BZSNNMDCSA-N 0.000 description 3
- YUTNOGOMBNYPFH-XUXIUFHCSA-N Leu-Pro-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YUTNOGOMBNYPFH-XUXIUFHCSA-N 0.000 description 3
- AKVBOOKXVAMKSS-GUBZILKMSA-N Leu-Ser-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O AKVBOOKXVAMKSS-GUBZILKMSA-N 0.000 description 3
- XOWMDXHFSBCAKQ-SRVKXCTJSA-N Leu-Ser-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C XOWMDXHFSBCAKQ-SRVKXCTJSA-N 0.000 description 3
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 3
- IBQMEXQYZMVIFU-SRVKXCTJSA-N Lys-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCCN)N IBQMEXQYZMVIFU-SRVKXCTJSA-N 0.000 description 3
- QIJVAFLRMVBHMU-KKUMJFAQSA-N Lys-Asp-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QIJVAFLRMVBHMU-KKUMJFAQSA-N 0.000 description 3
- LJADEBULDNKJNK-IHRRRGAJSA-N Lys-Leu-Val Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LJADEBULDNKJNK-IHRRRGAJSA-N 0.000 description 3
- MIMXMVDLMDMOJD-BZSNNMDCSA-N Lys-Tyr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O MIMXMVDLMDMOJD-BZSNNMDCSA-N 0.000 description 3
- FXBKQTOGURNXSL-HJGDQZAQSA-N Met-Thr-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O FXBKQTOGURNXSL-HJGDQZAQSA-N 0.000 description 3
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 3
- 108010047562 NGR peptide Proteins 0.000 description 3
- VCYJKOLZYPYGJV-AVGNSLFASA-N Pro-Arg-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VCYJKOLZYPYGJV-AVGNSLFASA-N 0.000 description 3
- XZGWNSIRZIUHHP-SRVKXCTJSA-N Pro-Arg-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H]1CCCN1 XZGWNSIRZIUHHP-SRVKXCTJSA-N 0.000 description 3
- JARJPEMLQAWNBR-GUBZILKMSA-N Pro-Asp-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JARJPEMLQAWNBR-GUBZILKMSA-N 0.000 description 3
- AQGUSRZKDZYGGV-GMOBBJLQSA-N Pro-Ile-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O AQGUSRZKDZYGGV-GMOBBJLQSA-N 0.000 description 3
- XDKKMRPRRCOELJ-GUBZILKMSA-N Pro-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 XDKKMRPRRCOELJ-GUBZILKMSA-N 0.000 description 3
- 241000607142 Salmonella Species 0.000 description 3
- HBZBPFLJNDXRAY-FXQIFTODSA-N Ser-Ala-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O HBZBPFLJNDXRAY-FXQIFTODSA-N 0.000 description 3
- QYBRQMLZDDJBSW-AVGNSLFASA-N Ser-Tyr-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O QYBRQMLZDDJBSW-AVGNSLFASA-N 0.000 description 3
- UKKROEYWYIHWBD-ZKWXMUAHSA-N Ser-Val-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O UKKROEYWYIHWBD-ZKWXMUAHSA-N 0.000 description 3
- 241000607768 Shigella Species 0.000 description 3
- 241000194017 Streptococcus Species 0.000 description 3
- 101001091268 Streptomyces hygroscopicus Hygromycin-B 7''-O-kinase Proteins 0.000 description 3
- 108091027544 Subgenomic mRNA Proteins 0.000 description 3
- PKXHGEXFMIZSER-QTKMDUPCSA-N Thr-Arg-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O PKXHGEXFMIZSER-QTKMDUPCSA-N 0.000 description 3
- JNQZPAWOPBZGIX-RCWTZXSCSA-N Thr-Arg-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)O)CCCN=C(N)N JNQZPAWOPBZGIX-RCWTZXSCSA-N 0.000 description 3
- JMGJDTNUMAZNLX-RWRJDSDZSA-N Thr-Glu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JMGJDTNUMAZNLX-RWRJDSDZSA-N 0.000 description 3
- JKGGPMOUIAAJAA-YEPSODPASA-N Thr-Gly-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O JKGGPMOUIAAJAA-YEPSODPASA-N 0.000 description 3
- ZESGVALRVJIVLZ-VFCFLDTKSA-N Thr-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O ZESGVALRVJIVLZ-VFCFLDTKSA-N 0.000 description 3
- XGFYGMKZKFRGAI-RCWTZXSCSA-N Thr-Val-Arg Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N XGFYGMKZKFRGAI-RCWTZXSCSA-N 0.000 description 3
- FYBFTPLPAXZBOY-KKHAAJSZSA-N Thr-Val-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O FYBFTPLPAXZBOY-KKHAAJSZSA-N 0.000 description 3
- AKHDFZHUPGVFEJ-YEPSODPASA-N Thr-Val-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AKHDFZHUPGVFEJ-YEPSODPASA-N 0.000 description 3
- UGFOSENEZHEQKX-PJODQICGSA-N Trp-Val-Ala Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(=O)N[C@@H](C)C(O)=O UGFOSENEZHEQKX-PJODQICGSA-N 0.000 description 3
- KGSDLCMCDFETHU-YESZJQIVSA-N Tyr-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O KGSDLCMCDFETHU-YESZJQIVSA-N 0.000 description 3
- VMRFIKXKOFNMHW-GUBZILKMSA-N Val-Arg-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N VMRFIKXKOFNMHW-GUBZILKMSA-N 0.000 description 3
- DNOOLPROHJWCSQ-RCWTZXSCSA-N Val-Arg-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DNOOLPROHJWCSQ-RCWTZXSCSA-N 0.000 description 3
- XQVRMLRMTAGSFJ-QXEWZRGKSA-N Val-Asp-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XQVRMLRMTAGSFJ-QXEWZRGKSA-N 0.000 description 3
- ZXAGTABZUOMUDO-GVXVVHGQSA-N Val-Glu-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZXAGTABZUOMUDO-GVXVVHGQSA-N 0.000 description 3
- LJSZPMSUYKKKCP-UBHSHLNASA-N Val-Phe-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 LJSZPMSUYKKKCP-UBHSHLNASA-N 0.000 description 3
- SJRUJQFQVLMZFW-WPRPVWTQSA-N Val-Pro-Gly Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O SJRUJQFQVLMZFW-WPRPVWTQSA-N 0.000 description 3
- GBIUHAYJGWVNLN-UHFFFAOYSA-N Val-Ser-Pro Natural products CC(C)C(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O GBIUHAYJGWVNLN-UHFFFAOYSA-N 0.000 description 3
- 241000607598 Vibrio Species 0.000 description 3
- 108010084455 Zeocin Proteins 0.000 description 3
- 125000002252 acyl group Chemical group 0.000 description 3
- 108010078114 alanyl-tryptophyl-alanine Proteins 0.000 description 3
- 108010041407 alanylaspartic acid Proteins 0.000 description 3
- 238000004458 analytical method Methods 0.000 description 3
- 101150046240 bsd gene Proteins 0.000 description 3
- 238000003776 cleavage reaction Methods 0.000 description 3
- 125000000151 cysteine group Chemical group N[C@@H](CS)C(=O)* 0.000 description 3
- 230000009977 dual effect Effects 0.000 description 3
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 3
- 108010082286 glycyl-seryl-alanine Proteins 0.000 description 3
- 208000015181 infectious disease Diseases 0.000 description 3
- 230000002401 inhibitory effect Effects 0.000 description 3
- 230000007246 mechanism Effects 0.000 description 3
- CWCMIVBLVUHDHK-ZSNHEYEWSA-N phleomycin D1 Chemical compound N([C@H](C(=O)N[C@H](C)[C@@H](O)[C@H](C)C(=O)N[C@@H]([C@H](O)C)C(=O)NCCC=1SC[C@@H](N=1)C=1SC=C(N=1)C(=O)NCCCCNC(N)=N)[C@@H](O[C@H]1[C@H]([C@@H](O)[C@H](O)[C@H](CO)O1)O[C@@H]1[C@H]([C@@H](OC(N)=O)[C@H](O)[C@@H](CO)O1)O)C=1N=CNC=1)C(=O)C1=NC([C@H](CC(N)=O)NC[C@H](N)C(N)=O)=NC(N)=C1C CWCMIVBLVUHDHK-ZSNHEYEWSA-N 0.000 description 3
- -1 pleomycin D1 Natural products 0.000 description 3
- 108010014614 prolyl-glycyl-proline Proteins 0.000 description 3
- 108010045647 puromycin N-acetyltransferase Proteins 0.000 description 3
- 150000003839 salts Chemical class 0.000 description 3
- 230000007017 scission Effects 0.000 description 3
- 239000006152 selective media Substances 0.000 description 3
- 239000012096 transfection reagent Substances 0.000 description 3
- 102000007469 Actins Human genes 0.000 description 2
- 108010085238 Actins Proteins 0.000 description 2
- BUANFPRKJKJSRR-ACZMJKKPSA-N Ala-Ala-Gln Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CCC(N)=O BUANFPRKJKJSRR-ACZMJKKPSA-N 0.000 description 2
- HGRBNYQIMKTUNT-XVYDVKMFSA-N Ala-Asn-His Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N HGRBNYQIMKTUNT-XVYDVKMFSA-N 0.000 description 2
- PBAMJJXWDQXOJA-FXQIFTODSA-N Ala-Asp-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PBAMJJXWDQXOJA-FXQIFTODSA-N 0.000 description 2
- KXEVYGKATAMXJJ-ACZMJKKPSA-N Ala-Glu-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KXEVYGKATAMXJJ-ACZMJKKPSA-N 0.000 description 2
- PNALXAODQKTNLV-JBDRJPRFSA-N Ala-Ile-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O PNALXAODQKTNLV-JBDRJPRFSA-N 0.000 description 2
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 2
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 2
- DGLQWAFPIXDKRL-UBHSHLNASA-N Ala-Met-Phe Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N DGLQWAFPIXDKRL-UBHSHLNASA-N 0.000 description 2
- KLALXKYLOMZDQT-ZLUOBGJFSA-N Ala-Ser-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(N)=O KLALXKYLOMZDQT-ZLUOBGJFSA-N 0.000 description 2
- LSMDIAAALJJLRO-XQXXSGGOSA-N Ala-Thr-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O LSMDIAAALJJLRO-XQXXSGGOSA-N 0.000 description 2
- IYKVSFNGSWTTNZ-GUBZILKMSA-N Ala-Val-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IYKVSFNGSWTTNZ-GUBZILKMSA-N 0.000 description 2
- REWSWYIDQIELBE-FXQIFTODSA-N Ala-Val-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O REWSWYIDQIELBE-FXQIFTODSA-N 0.000 description 2
- VBFJESQBIWCWRL-DCAQKATOSA-N Arg-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCNC(N)=N VBFJESQBIWCWRL-DCAQKATOSA-N 0.000 description 2
- JTWOBPNAVBESFW-FXQIFTODSA-N Arg-Cys-Asp Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)CN=C(N)N JTWOBPNAVBESFW-FXQIFTODSA-N 0.000 description 2
- UPKMBGAAEZGHOC-RWMBFGLXSA-N Arg-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O UPKMBGAAEZGHOC-RWMBFGLXSA-N 0.000 description 2
- AGVNTAUPLWIQEN-ZPFDUUQYSA-N Arg-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N AGVNTAUPLWIQEN-ZPFDUUQYSA-N 0.000 description 2
- HJDNZFIYILEIKR-OSUNSFLBSA-N Arg-Ile-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HJDNZFIYILEIKR-OSUNSFLBSA-N 0.000 description 2
- YBZMTKUDWXZLIX-UWVGGRQHSA-N Arg-Leu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YBZMTKUDWXZLIX-UWVGGRQHSA-N 0.000 description 2
- CLICCYPMVFGUOF-IHRRRGAJSA-N Arg-Lys-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O CLICCYPMVFGUOF-IHRRRGAJSA-N 0.000 description 2
- LYJXHXGPWDTLKW-HJGDQZAQSA-N Arg-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O LYJXHXGPWDTLKW-HJGDQZAQSA-N 0.000 description 2
- JQSWHKKUZMTOIH-QWRGUYRKSA-N Asn-Gly-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N JQSWHKKUZMTOIH-QWRGUYRKSA-N 0.000 description 2
- ZYPWIUFLYMQZBS-SRVKXCTJSA-N Asn-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N ZYPWIUFLYMQZBS-SRVKXCTJSA-N 0.000 description 2
- HBUJSDCLZCXXCW-YDHLFZDLSA-N Asn-Val-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HBUJSDCLZCXXCW-YDHLFZDLSA-N 0.000 description 2
- XBQSLMACWDXWLJ-GHCJXIJMSA-N Asp-Ala-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XBQSLMACWDXWLJ-GHCJXIJMSA-N 0.000 description 2
- GWTLRDMPMJCNMH-WHFBIAKZSA-N Asp-Asn-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GWTLRDMPMJCNMH-WHFBIAKZSA-N 0.000 description 2
- HRGGPWBIMIQANI-GUBZILKMSA-N Asp-Gln-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HRGGPWBIMIQANI-GUBZILKMSA-N 0.000 description 2
- PDECQIHABNQRHN-GUBZILKMSA-N Asp-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(O)=O PDECQIHABNQRHN-GUBZILKMSA-N 0.000 description 2
- HAFCJCDJGIOYPW-WDSKDSINSA-N Asp-Gly-Gln Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O HAFCJCDJGIOYPW-WDSKDSINSA-N 0.000 description 2
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 description 2
- DWOGMPWRQQWPPF-GUBZILKMSA-N Asp-Leu-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O DWOGMPWRQQWPPF-GUBZILKMSA-N 0.000 description 2
- 241000894006 Bacteria Species 0.000 description 2
- 108010006654 Bleomycin Proteins 0.000 description 2
- 108091003079 Bovine Serum Albumin Proteins 0.000 description 2
- 101100512078 Caenorhabditis elegans lys-1 gene Proteins 0.000 description 2
- 241000193403 Clostridium Species 0.000 description 2
- 108091026890 Coding region Proteins 0.000 description 2
- 241000186216 Corynebacterium Species 0.000 description 2
- ZQHQTSONVIANQR-BQBZGAKWSA-N Cys-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CS)N ZQHQTSONVIANQR-BQBZGAKWSA-N 0.000 description 2
- XLLSMEFANRROJE-GUBZILKMSA-N Cys-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N XLLSMEFANRROJE-GUBZILKMSA-N 0.000 description 2
- MBRWOKXNHTUJMB-CIUDSAMLSA-N Cys-Pro-Glu Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O MBRWOKXNHTUJMB-CIUDSAMLSA-N 0.000 description 2
- 239000006144 Dulbecco’s modified Eagle's medium Substances 0.000 description 2
- 241000196324 Embryophyta Species 0.000 description 2
- ULGZDMOVFRHVEP-RWJQBGPGSA-N Erythromycin Chemical compound O([C@@H]1[C@@H](C)C(=O)O[C@@H]([C@@]([C@H](O)[C@@H](C)C(=O)[C@H](C)C[C@@](C)(O)[C@H](O[C@H]2[C@@H]([C@H](C[C@@H](C)O2)N(C)C)O)[C@H]1C)(C)O)CC)[C@H]1C[C@@](C)(OC)[C@@H](O)[C@H](C)O1 ULGZDMOVFRHVEP-RWJQBGPGSA-N 0.000 description 2
- 241000588724 Escherichia coli Species 0.000 description 2
- 238000012413 Fluorescence activated cell sorting analysis Methods 0.000 description 2
- GHYJGDCPHMSFEJ-GUBZILKMSA-N Gln-Gln-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N GHYJGDCPHMSFEJ-GUBZILKMSA-N 0.000 description 2
- IKFZXRLDMYWNBU-YUMQZZPRSA-N Gln-Gly-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N IKFZXRLDMYWNBU-YUMQZZPRSA-N 0.000 description 2
- XFHMVFKCQSHLKW-HJGDQZAQSA-N Gln-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O XFHMVFKCQSHLKW-HJGDQZAQSA-N 0.000 description 2
- FVEMBYKESRUFBG-SZMVWBNQSA-N Gln-Trp-His Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)NC(=O)[C@H](CCC(=O)N)N FVEMBYKESRUFBG-SZMVWBNQSA-N 0.000 description 2
- WIMVKDYAKRAUCG-IHRRRGAJSA-N Gln-Tyr-Glu Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O WIMVKDYAKRAUCG-IHRRRGAJSA-N 0.000 description 2
- RLZBLVSJDFHDBL-KBIXCLLPSA-N Glu-Ala-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RLZBLVSJDFHDBL-KBIXCLLPSA-N 0.000 description 2
- CGYDXNKRIMJMLV-GUBZILKMSA-N Glu-Arg-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O CGYDXNKRIMJMLV-GUBZILKMSA-N 0.000 description 2
- KKCUFHUTMKQQCF-SRVKXCTJSA-N Glu-Arg-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O KKCUFHUTMKQQCF-SRVKXCTJSA-N 0.000 description 2
- MLCPTRRNICEKIS-FXQIFTODSA-N Glu-Asn-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MLCPTRRNICEKIS-FXQIFTODSA-N 0.000 description 2
- NTBDVNJIWCKURJ-ACZMJKKPSA-N Glu-Asp-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O NTBDVNJIWCKURJ-ACZMJKKPSA-N 0.000 description 2
- UENPHLAAKDPZQY-XKBZYTNZSA-N Glu-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)O)N)O UENPHLAAKDPZQY-XKBZYTNZSA-N 0.000 description 2
- XHWLNISLUFEWNS-CIUDSAMLSA-N Glu-Gln-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O XHWLNISLUFEWNS-CIUDSAMLSA-N 0.000 description 2
- QJCKNLPMTPXXEM-AUTRQRHGSA-N Glu-Glu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O QJCKNLPMTPXXEM-AUTRQRHGSA-N 0.000 description 2
- OGNJZUXUTPQVBR-BQBZGAKWSA-N Glu-Gly-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OGNJZUXUTPQVBR-BQBZGAKWSA-N 0.000 description 2
- LRPXYSGPOBVBEH-IUCAKERBSA-N Glu-Gly-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O LRPXYSGPOBVBEH-IUCAKERBSA-N 0.000 description 2
- VGUYMZGLJUJRBV-YVNDNENWSA-N Glu-Ile-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O VGUYMZGLJUJRBV-YVNDNENWSA-N 0.000 description 2
- ZCOJVESMNGBGLF-GRLWGSQLSA-N Glu-Ile-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZCOJVESMNGBGLF-GRLWGSQLSA-N 0.000 description 2
- ZCFNZTVIDMLUQC-SXNHZJKMSA-N Glu-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZCFNZTVIDMLUQC-SXNHZJKMSA-N 0.000 description 2
- PJBVXVBTTFZPHJ-GUBZILKMSA-N Glu-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)O)N PJBVXVBTTFZPHJ-GUBZILKMSA-N 0.000 description 2
- VGBSZQSKQRMLHD-MNXVOIDGSA-N Glu-Leu-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VGBSZQSKQRMLHD-MNXVOIDGSA-N 0.000 description 2
- BCYGDJXHAGZNPQ-DCAQKATOSA-N Glu-Lys-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O BCYGDJXHAGZNPQ-DCAQKATOSA-N 0.000 description 2
- VNCNWQPIQYAMAK-ACZMJKKPSA-N Glu-Ser-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O VNCNWQPIQYAMAK-ACZMJKKPSA-N 0.000 description 2
- QOOFKCCZZWTCEP-AVGNSLFASA-N Glu-Tyr-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O QOOFKCCZZWTCEP-AVGNSLFASA-N 0.000 description 2
- PYTZFYUXZZHOAD-WHFBIAKZSA-N Gly-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)CN PYTZFYUXZZHOAD-WHFBIAKZSA-N 0.000 description 2
- UGVQELHRNUDMAA-BYPYZUCNSA-N Gly-Ala-Gly Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)NCC([O-])=O UGVQELHRNUDMAA-BYPYZUCNSA-N 0.000 description 2
- PYUCNHJQQVSPGN-BQBZGAKWSA-N Gly-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN)CN=C(N)N PYUCNHJQQVSPGN-BQBZGAKWSA-N 0.000 description 2
- KKBWDNZXYLGJEY-UHFFFAOYSA-N Gly-Arg-Pro Natural products NCC(=O)NC(CCNC(=N)N)C(=O)N1CCCC1C(=O)O KKBWDNZXYLGJEY-UHFFFAOYSA-N 0.000 description 2
- KQDMENMTYNBWMR-WHFBIAKZSA-N Gly-Asp-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KQDMENMTYNBWMR-WHFBIAKZSA-N 0.000 description 2
- BEQGFMIBZFNROK-JGVFFNPUSA-N Gly-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)CN)C(=O)O BEQGFMIBZFNROK-JGVFFNPUSA-N 0.000 description 2
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 2
- FHQRLHFYVZAQHU-IUCAKERBSA-N Gly-Lys-Gln Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O FHQRLHFYVZAQHU-IUCAKERBSA-N 0.000 description 2
- VEPBEGNDJYANCF-QWRGUYRKSA-N Gly-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN VEPBEGNDJYANCF-QWRGUYRKSA-N 0.000 description 2
- IRJWAYCXIYUHQE-WHFBIAKZSA-N Gly-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)CN IRJWAYCXIYUHQE-WHFBIAKZSA-N 0.000 description 2
- YOBGUCWZPXJHTN-BQBZGAKWSA-N Gly-Ser-Arg Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YOBGUCWZPXJHTN-BQBZGAKWSA-N 0.000 description 2
- NGRPGJGKJMUGDM-XVKPBYJWSA-N Gly-Val-Gln Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O NGRPGJGKJMUGDM-XVKPBYJWSA-N 0.000 description 2
- SYOJVRNQCXYEOV-XVKPBYJWSA-N Gly-Val-Glu Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SYOJVRNQCXYEOV-XVKPBYJWSA-N 0.000 description 2
- PROLDOGUBQJNPG-RWMBFGLXSA-N His-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O PROLDOGUBQJNPG-RWMBFGLXSA-N 0.000 description 2
- ORZGPQXISSXQGW-IHRRRGAJSA-N His-His-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(O)=O ORZGPQXISSXQGW-IHRRRGAJSA-N 0.000 description 2
- IGBBXBFSLKRHJB-BZSNNMDCSA-N His-Lys-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CN=CN1 IGBBXBFSLKRHJB-BZSNNMDCSA-N 0.000 description 2
- CHIAUHSHDARFBD-ULQDDVLXSA-N His-Pro-Tyr Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CN=CN1 CHIAUHSHDARFBD-ULQDDVLXSA-N 0.000 description 2
- 241000282414 Homo sapiens Species 0.000 description 2
- GRRNUXAQVGOGFE-UHFFFAOYSA-N Hygromycin-B Natural products OC1C(NC)CC(N)C(O)C1OC1C2OC3(C(C(O)C(O)C(C(N)CO)O3)O)OC2C(O)C(CO)O1 GRRNUXAQVGOGFE-UHFFFAOYSA-N 0.000 description 2
- QTUSJASXLGLJSR-OSUNSFLBSA-N Ile-Arg-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N QTUSJASXLGLJSR-OSUNSFLBSA-N 0.000 description 2
- SCHZQZPYHBWYEQ-PEFMBERDSA-N Ile-Asn-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SCHZQZPYHBWYEQ-PEFMBERDSA-N 0.000 description 2
- WZDCVAWMBUNDDY-KBIXCLLPSA-N Ile-Glu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C)C(=O)O)N WZDCVAWMBUNDDY-KBIXCLLPSA-N 0.000 description 2
- DFJJAVZIHDFOGQ-MNXVOIDGSA-N Ile-Glu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N DFJJAVZIHDFOGQ-MNXVOIDGSA-N 0.000 description 2
- LBRCLQMZAHRTLV-ZKWXMUAHSA-N Ile-Gly-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LBRCLQMZAHRTLV-ZKWXMUAHSA-N 0.000 description 2
- COWHUQXTSYTKQC-RWRJDSDZSA-N Ile-Thr-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N COWHUQXTSYTKQC-RWRJDSDZSA-N 0.000 description 2
- JZBVBOKASHNXAD-NAKRPEOUSA-N Ile-Val-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N JZBVBOKASHNXAD-NAKRPEOUSA-N 0.000 description 2
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 2
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 2
- 241000589248 Legionella Species 0.000 description 2
- 208000007764 Legionnaires' Disease Diseases 0.000 description 2
- 241000880493 Leptailurus serval Species 0.000 description 2
- WSGXUIQTEZDVHJ-GARJFASQSA-N Leu-Ala-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(O)=O WSGXUIQTEZDVHJ-GARJFASQSA-N 0.000 description 2
- HQUXQAMSWFIRET-AVGNSLFASA-N Leu-Glu-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HQUXQAMSWFIRET-AVGNSLFASA-N 0.000 description 2
- WQWSMEOYXJTFRU-GUBZILKMSA-N Leu-Glu-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O WQWSMEOYXJTFRU-GUBZILKMSA-N 0.000 description 2
- FIICHHJDINDXKG-IHPCNDPISA-N Leu-Lys-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O FIICHHJDINDXKG-IHPCNDPISA-N 0.000 description 2
- PKKMDPNFGULLNQ-AVGNSLFASA-N Leu-Met-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O PKKMDPNFGULLNQ-AVGNSLFASA-N 0.000 description 2
- AIQWYVFNBNNOLU-RHYQMDGZSA-N Leu-Thr-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O AIQWYVFNBNNOLU-RHYQMDGZSA-N 0.000 description 2
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 2
- MSFITIBEMPWCBD-ULQDDVLXSA-N Leu-Val-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 MSFITIBEMPWCBD-ULQDDVLXSA-N 0.000 description 2
- KNKHAVVBVXKOGX-JXUBOQSCSA-N Lys-Ala-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KNKHAVVBVXKOGX-JXUBOQSCSA-N 0.000 description 2
- IWWMPCPLFXFBAF-SRVKXCTJSA-N Lys-Asp-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O IWWMPCPLFXFBAF-SRVKXCTJSA-N 0.000 description 2
- KZOHPCYVORJBLG-AVGNSLFASA-N Lys-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)N KZOHPCYVORJBLG-AVGNSLFASA-N 0.000 description 2
- DCRWPTBMWMGADO-AVGNSLFASA-N Lys-Glu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DCRWPTBMWMGADO-AVGNSLFASA-N 0.000 description 2
- LCMWVZLBCUVDAZ-IUCAKERBSA-N Lys-Gly-Glu Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CCC([O-])=O LCMWVZLBCUVDAZ-IUCAKERBSA-N 0.000 description 2
- NCZIQZYZPUPMKY-PPCPHDFISA-N Lys-Ile-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NCZIQZYZPUPMKY-PPCPHDFISA-N 0.000 description 2
- MYZMQWHPDAYKIE-SRVKXCTJSA-N Lys-Leu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O MYZMQWHPDAYKIE-SRVKXCTJSA-N 0.000 description 2
- ONPDTSFZAIWMDI-AVGNSLFASA-N Lys-Leu-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O ONPDTSFZAIWMDI-AVGNSLFASA-N 0.000 description 2
- HVAUKHLDSDDROB-KKUMJFAQSA-N Lys-Lys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HVAUKHLDSDDROB-KKUMJFAQSA-N 0.000 description 2
- WBSCNDJQPKSPII-KKUMJFAQSA-N Lys-Lys-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O WBSCNDJQPKSPII-KKUMJFAQSA-N 0.000 description 2
- CENKQZWVYMLRAX-ULQDDVLXSA-N Lys-Phe-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O CENKQZWVYMLRAX-ULQDDVLXSA-N 0.000 description 2
- JHNOXVASMSXSNB-WEDXCCLWSA-N Lys-Thr-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JHNOXVASMSXSNB-WEDXCCLWSA-N 0.000 description 2
- VHGIWFGJIHTASW-FXQIFTODSA-N Met-Ala-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O VHGIWFGJIHTASW-FXQIFTODSA-N 0.000 description 2
- ACYHZNZHIZWLQF-BQBZGAKWSA-N Met-Asn-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O ACYHZNZHIZWLQF-BQBZGAKWSA-N 0.000 description 2
- OCRSGGIJBDUXHU-WDSOQIARSA-N Met-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCSC)C(O)=O)=CNC2=C1 OCRSGGIJBDUXHU-WDSOQIARSA-N 0.000 description 2
- YLBUMXYVQCHBPR-ULQDDVLXSA-N Met-Leu-Tyr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 YLBUMXYVQCHBPR-ULQDDVLXSA-N 0.000 description 2
- LCPUWQLULVXROY-RHYQMDGZSA-N Met-Lys-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LCPUWQLULVXROY-RHYQMDGZSA-N 0.000 description 2
- ZBLSZPYQQRIHQU-RCWTZXSCSA-N Met-Thr-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O ZBLSZPYQQRIHQU-RCWTZXSCSA-N 0.000 description 2
- LBSWWNKMVPAXOI-GUBZILKMSA-N Met-Val-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O LBSWWNKMVPAXOI-GUBZILKMSA-N 0.000 description 2
- 241000187479 Mycobacterium tuberculosis Species 0.000 description 2
- 241000202934 Mycoplasma pneumoniae Species 0.000 description 2
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 2
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 2
- 241000588653 Neisseria Species 0.000 description 2
- 108091028043 Nucleic acid sequence Proteins 0.000 description 2
- KIEPQOIQHFKQLK-PCBIJLKTSA-N Phe-Asn-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KIEPQOIQHFKQLK-PCBIJLKTSA-N 0.000 description 2
- RIYZXJVARWJLKS-KKUMJFAQSA-N Phe-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 RIYZXJVARWJLKS-KKUMJFAQSA-N 0.000 description 2
- PSBJZLMFFTULDX-IXOXFDKPSA-N Phe-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CC=CC=C1)N)O PSBJZLMFFTULDX-IXOXFDKPSA-N 0.000 description 2
- CMHTUJQZQXFNTQ-OEAJRASXSA-N Phe-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CC=CC=C1)N)O CMHTUJQZQXFNTQ-OEAJRASXSA-N 0.000 description 2
- RBRNEFJTEHPDSL-ACRUOGEOSA-N Phe-Phe-Lys Chemical compound C([C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 RBRNEFJTEHPDSL-ACRUOGEOSA-N 0.000 description 2
- CDHURCQGUDNBMA-UBHSHLNASA-N Phe-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 CDHURCQGUDNBMA-UBHSHLNASA-N 0.000 description 2
- IWNOFCGBMSFTBC-CIUDSAMLSA-N Pro-Ala-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IWNOFCGBMSFTBC-CIUDSAMLSA-N 0.000 description 2
- NMELOOXSGDRBRU-YUMQZZPRSA-N Pro-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H]1CCCN1 NMELOOXSGDRBRU-YUMQZZPRSA-N 0.000 description 2
- FDINZVJXLPILKV-DCAQKATOSA-N Pro-His-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O FDINZVJXLPILKV-DCAQKATOSA-N 0.000 description 2
- KHRLUIPIMIQFGT-AVGNSLFASA-N Pro-Val-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHRLUIPIMIQFGT-AVGNSLFASA-N 0.000 description 2
- 241000589516 Pseudomonas Species 0.000 description 2
- OOKCGAYXSNJBGQ-ZLUOBGJFSA-N Ser-Asn-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OOKCGAYXSNJBGQ-ZLUOBGJFSA-N 0.000 description 2
- FTVRVZNYIYWJGB-ACZMJKKPSA-N Ser-Asp-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FTVRVZNYIYWJGB-ACZMJKKPSA-N 0.000 description 2
- QPFJSHSJFIYDJZ-GHCJXIJMSA-N Ser-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO QPFJSHSJFIYDJZ-GHCJXIJMSA-N 0.000 description 2
- IXUGADGDCQDLSA-FXQIFTODSA-N Ser-Gln-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N IXUGADGDCQDLSA-FXQIFTODSA-N 0.000 description 2
- YIUWWXVTYLANCJ-NAKRPEOUSA-N Ser-Ile-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O YIUWWXVTYLANCJ-NAKRPEOUSA-N 0.000 description 2
- FUMGHWDRRFCKEP-CIUDSAMLSA-N Ser-Leu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O FUMGHWDRRFCKEP-CIUDSAMLSA-N 0.000 description 2
- GJFYFGOEWLDQGW-GUBZILKMSA-N Ser-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GJFYFGOEWLDQGW-GUBZILKMSA-N 0.000 description 2
- BSXKBOUZDAZXHE-CIUDSAMLSA-N Ser-Pro-Glu Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O BSXKBOUZDAZXHE-CIUDSAMLSA-N 0.000 description 2
- 241000607720 Serratia Species 0.000 description 2
- 241000607762 Shigella flexneri Species 0.000 description 2
- 241000187747 Streptomyces Species 0.000 description 2
- VFEHSAJCWWHDBH-RHYQMDGZSA-N Thr-Arg-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VFEHSAJCWWHDBH-RHYQMDGZSA-N 0.000 description 2
- GZYNMZQXFRWDFH-YTWAJWBKSA-N Thr-Arg-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O GZYNMZQXFRWDFH-YTWAJWBKSA-N 0.000 description 2
- UNURFMVMXLENAZ-KJEVXHAQSA-N Thr-Arg-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O UNURFMVMXLENAZ-KJEVXHAQSA-N 0.000 description 2
- DIPIPFHFLPTCLK-LOKLDPHHSA-N Thr-Gln-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O DIPIPFHFLPTCLK-LOKLDPHHSA-N 0.000 description 2
- CRZNCABIJLRFKZ-IUKAMOBKSA-N Thr-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N CRZNCABIJLRFKZ-IUKAMOBKSA-N 0.000 description 2
- 108091023040 Transcription factor Proteins 0.000 description 2
- WFZYXGSAPWKTHR-XEGUGMAKSA-N Trp-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N WFZYXGSAPWKTHR-XEGUGMAKSA-N 0.000 description 2
- ZHZLQVLQBDBQCQ-WDSOQIARSA-N Trp-Lys-Arg Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N ZHZLQVLQBDBQCQ-WDSOQIARSA-N 0.000 description 2
- UIRPULWLRODAEQ-QEJZJMRPSA-N Trp-Ser-Glu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 UIRPULWLRODAEQ-QEJZJMRPSA-N 0.000 description 2
- HZZKQZDUIKVFDZ-AVGNSLFASA-N Tyr-Gln-Ser Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)O HZZKQZDUIKVFDZ-AVGNSLFASA-N 0.000 description 2
- GGXUDPQWAWRINY-XEGUGMAKSA-N Tyr-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 GGXUDPQWAWRINY-XEGUGMAKSA-N 0.000 description 2
- HVPPEXXUDXAPOM-MGHWNKPDSA-N Tyr-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 HVPPEXXUDXAPOM-MGHWNKPDSA-N 0.000 description 2
- OFHKXNKJXURPSY-ULQDDVLXSA-N Tyr-Met-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O OFHKXNKJXURPSY-ULQDDVLXSA-N 0.000 description 2
- NAHUCETZGZZSEX-IHPCNDPISA-N Tyr-Trp-Asp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N NAHUCETZGZZSEX-IHPCNDPISA-N 0.000 description 2
- UEOOXDLMQZBPFR-ZKWXMUAHSA-N Val-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N UEOOXDLMQZBPFR-ZKWXMUAHSA-N 0.000 description 2
- CWSIBTLMMQLPPZ-FXQIFTODSA-N Val-Cys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C(C)C)N CWSIBTLMMQLPPZ-FXQIFTODSA-N 0.000 description 2
- PMXBARDFIAPBGK-DZKIICNBSA-N Val-Glu-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PMXBARDFIAPBGK-DZKIICNBSA-N 0.000 description 2
- UEHRGZCNLSWGHK-DLOVCJGASA-N Val-Glu-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UEHRGZCNLSWGHK-DLOVCJGASA-N 0.000 description 2
- SYSWVVCYSXBVJG-RHYQMDGZSA-N Val-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N)O SYSWVVCYSXBVJG-RHYQMDGZSA-N 0.000 description 2
- YQMILNREHKTFBS-IHRRRGAJSA-N Val-Phe-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)O)N YQMILNREHKTFBS-IHRRRGAJSA-N 0.000 description 2
- AJNUKMZFHXUBMK-GUBZILKMSA-N Val-Ser-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N AJNUKMZFHXUBMK-GUBZILKMSA-N 0.000 description 2
- GBIUHAYJGWVNLN-AEJSXWLSSA-N Val-Ser-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N GBIUHAYJGWVNLN-AEJSXWLSSA-N 0.000 description 2
- IECQJCJNPJVUSB-IHRRRGAJSA-N Val-Tyr-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CO)C(O)=O IECQJCJNPJVUSB-IHRRRGAJSA-N 0.000 description 2
- 241000607626 Vibrio cholerae Species 0.000 description 2
- 108010013835 arginine glutamate Proteins 0.000 description 2
- 101150038738 ble gene Proteins 0.000 description 2
- 229960001561 bleomycin Drugs 0.000 description 2
- OYVAGSVQBOHSSS-UAPAGMARSA-O bleomycin A2 Chemical compound N([C@H](C(=O)N[C@H](C)[C@@H](O)[C@H](C)C(=O)N[C@@H]([C@H](O)C)C(=O)NCCC=1SC=C(N=1)C=1SC=C(N=1)C(=O)NCCC[S+](C)C)[C@@H](O[C@H]1[C@H]([C@@H](O)[C@H](O)[C@H](CO)O1)O[C@@H]1[C@H]([C@@H](OC(N)=O)[C@H](O)[C@@H](CO)O1)O)C=1N=CNC=1)C(=O)C1=NC([C@H](CC(N)=O)NC[C@H](N)C(N)=O)=NC(N)=C1C OYVAGSVQBOHSSS-UAPAGMARSA-O 0.000 description 2
- 101150060238 bls gene Proteins 0.000 description 2
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 2
- 101150102092 ccdB gene Proteins 0.000 description 2
- 238000004113 cell culture Methods 0.000 description 2
- 229960005091 chloramphenicol Drugs 0.000 description 2
- WIIZWVCIJKGZOK-RKDXNWHRSA-N chloramphenicol Chemical compound ClC(Cl)C(=O)N[C@H](CO)[C@H](O)C1=CC=C([N+]([O-])=O)C=C1 WIIZWVCIJKGZOK-RKDXNWHRSA-N 0.000 description 2
- 238000013461 design Methods 0.000 description 2
- 230000034431 double-strand break repair via homologous recombination Effects 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 239000012091 fetal bovine serum Substances 0.000 description 2
- 238000001943 fluorescence-activated cell sorting Methods 0.000 description 2
- 230000002538 fungal effect Effects 0.000 description 2
- 230000004927 fusion Effects 0.000 description 2
- 238000010362 genome editing Methods 0.000 description 2
- 101150047832 hpt gene Proteins 0.000 description 2
- GRRNUXAQVGOGFE-NZSRVPFOSA-N hygromycin B Chemical compound O[C@@H]1[C@@H](NC)C[C@@H](N)[C@H](O)[C@H]1O[C@H]1[C@H]2O[C@@]3([C@@H]([C@@H](O)[C@@H](O)[C@@H](C(N)CO)O3)O)O[C@H]2[C@@H](O)[C@@H](CO)O1 GRRNUXAQVGOGFE-NZSRVPFOSA-N 0.000 description 2
- 229940097277 hygromycin b Drugs 0.000 description 2
- 108010002685 hygromycin-B kinase Proteins 0.000 description 2
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 2
- 238000002372 labelling Methods 0.000 description 2
- 210000004940 nucleus Anatomy 0.000 description 2
- 108010084572 phenylalanyl-valine Proteins 0.000 description 2
- 108010024607 phenylalanylalanine Proteins 0.000 description 2
- 239000002243 precursor Substances 0.000 description 2
- 238000012545 processing Methods 0.000 description 2
- 108010004914 prolylarginine Proteins 0.000 description 2
- 108010029020 prolylglycine Proteins 0.000 description 2
- 108010090894 prolylleucine Proteins 0.000 description 2
- 108010053725 prolylvaline Proteins 0.000 description 2
- 230000008707 rearrangement Effects 0.000 description 2
- 108010048818 seryl-histidine Proteins 0.000 description 2
- DAEPDZWVDSPTHF-UHFFFAOYSA-M sodium pyruvate Chemical compound [Na+].CC(=O)C([O-])=O DAEPDZWVDSPTHF-UHFFFAOYSA-M 0.000 description 2
- 238000003786 synthesis reaction Methods 0.000 description 2
- 108010051110 tyrosyl-lysine Proteins 0.000 description 2
- 108010009962 valyltyrosine Proteins 0.000 description 2
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 1
- CXNPLSGKWMLZPZ-GIFSMMMISA-N (2r,3r,6s)-3-[[(3s)-3-amino-5-[carbamimidoyl(methyl)amino]pentanoyl]amino]-6-(4-amino-2-oxopyrimidin-1-yl)-3,6-dihydro-2h-pyran-2-carboxylic acid Chemical compound O1[C@@H](C(O)=O)[C@H](NC(=O)C[C@@H](N)CCN(C)C(N)=N)C=C[C@H]1N1C(=O)N=C(N)C=C1 CXNPLSGKWMLZPZ-GIFSMMMISA-N 0.000 description 1
- ABCLPPNEPBAKRL-ROQIPNNMSA-N (2r,3s,4s,5r,6s)-2-[(1r)-1-aminoethyl]-6-[(1r,2r,3s,4r,6s)-4,6-diamino-3-[(2r,3r,4r,5r)-3,5-dihydroxy-5-methyl-4-(methylamino)oxan-2-yl]oxy-2-hydroxycyclohexyl]oxyoxane-3,4,5-triol Chemical compound O1C[C@@](O)(C)[C@H](NC)[C@@H](O)[C@H]1O[C@@H]1[C@@H](O)[C@H](O[C@H]2[C@@H]([C@@H](O)[C@H](O)[C@@H]([C@@H](C)N)O2)O)[C@@H](N)C[C@H]1N ABCLPPNEPBAKRL-ROQIPNNMSA-N 0.000 description 1
- COEXAQSTZUWMRI-STQMWFEESA-N (2s)-1-[2-[[(2s)-2-amino-3-(4-hydroxyphenyl)propanoyl]amino]acetyl]pyrrolidine-2-carboxylic acid Chemical compound C([C@H](N)C(=O)NCC(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=C(O)C=C1 COEXAQSTZUWMRI-STQMWFEESA-N 0.000 description 1
- BRPMXFSTKXXNHF-IUCAKERBSA-N (2s)-1-[2-[[(2s)-pyrrolidine-2-carbonyl]amino]acetyl]pyrrolidine-2-carboxylic acid Chemical compound OC(=O)[C@@H]1CCCN1C(=O)CNC(=O)[C@H]1NCCC1 BRPMXFSTKXXNHF-IUCAKERBSA-N 0.000 description 1
- HKZAAJSTFUZYTO-LURJTMIESA-N (2s)-2-[[2-[[2-[[2-[(2-aminoacetyl)amino]acetyl]amino]acetyl]amino]acetyl]amino]-3-hydroxypropanoic acid Chemical compound NCC(=O)NCC(=O)NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O HKZAAJSTFUZYTO-LURJTMIESA-N 0.000 description 1
- SGKRLCUYIXIAHR-AKNGSSGZSA-N (4s,4ar,5s,5ar,6r,12ar)-4-(dimethylamino)-1,5,10,11,12a-pentahydroxy-6-methyl-3,12-dioxo-4a,5,5a,6-tetrahydro-4h-tetracene-2-carboxamide Chemical compound C1=CC=C2[C@H](C)[C@@H]([C@H](O)[C@@H]3[C@](C(O)=C(C(N)=O)C(=O)[C@H]3N(C)C)(O)C3=O)C3=C(O)C2=C1O SGKRLCUYIXIAHR-AKNGSSGZSA-N 0.000 description 1
- DQPMXYDFWRYWQV-UHFFFAOYSA-N 2-[[6-amino-2-[[2-[(2-amino-3-methylbutanoyl)amino]-3-hydroxybutanoyl]amino]hexanoyl]amino]acetic acid Chemical compound CC(C)C(N)C(=O)NC(C(C)O)C(=O)NC(CCCCN)C(=O)NCC(O)=O DQPMXYDFWRYWQV-UHFFFAOYSA-N 0.000 description 1
- ASJSAQIRZKANQN-CRCLSJGQSA-N 2-deoxy-D-ribose Chemical compound OC[C@@H](O)[C@@H](O)CC=O ASJSAQIRZKANQN-CRCLSJGQSA-N 0.000 description 1
- RYSMHWILUNYBFW-GRIPGOBMSA-N 3'-amino-3'-deoxy-N(6),N(6)-dimethyladenosine Chemical compound C1=NC=2C(N(C)C)=NC=NC=2N1[C@@H]1O[C@H](CO)[C@@H](N)[C@H]1O RYSMHWILUNYBFW-GRIPGOBMSA-N 0.000 description 1
- AAQGRPOPTAUUBM-ZLUOBGJFSA-N Ala-Ala-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O AAQGRPOPTAUUBM-ZLUOBGJFSA-N 0.000 description 1
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 1
- KQFRUSHJPKXBMB-BHDSKKPTSA-N Ala-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)C)C(O)=O)=CNC2=C1 KQFRUSHJPKXBMB-BHDSKKPTSA-N 0.000 description 1
- QDRGPQWIVZNJQD-CIUDSAMLSA-N Ala-Arg-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O QDRGPQWIVZNJQD-CIUDSAMLSA-N 0.000 description 1
- STACJSVFHSEZJV-GHCJXIJMSA-N Ala-Asn-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O STACJSVFHSEZJV-GHCJXIJMSA-N 0.000 description 1
- LZRNYBIJOSKKRJ-XVYDVKMFSA-N Ala-Asp-His Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N LZRNYBIJOSKKRJ-XVYDVKMFSA-N 0.000 description 1
- GWFSQQNGMPGBEF-GHCJXIJMSA-N Ala-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N GWFSQQNGMPGBEF-GHCJXIJMSA-N 0.000 description 1
- IKKVASZHTMKJIR-ZKWXMUAHSA-N Ala-Asp-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IKKVASZHTMKJIR-ZKWXMUAHSA-N 0.000 description 1
- XYKDZXKKYOOTGC-FXQIFTODSA-N Ala-Cys-Met Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCSC)C(=O)O)N XYKDZXKKYOOTGC-FXQIFTODSA-N 0.000 description 1
- HJCMDXDYPOUFDY-WHFBIAKZSA-N Ala-Gln Chemical compound C[C@H](N)C(=O)N[C@H](C(O)=O)CCC(N)=O HJCMDXDYPOUFDY-WHFBIAKZSA-N 0.000 description 1
- LGFCAXJBAZESCF-ACZMJKKPSA-N Ala-Gln-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O LGFCAXJBAZESCF-ACZMJKKPSA-N 0.000 description 1
- NWVVKQZOVSTDBQ-CIUDSAMLSA-N Ala-Glu-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NWVVKQZOVSTDBQ-CIUDSAMLSA-N 0.000 description 1
- WGDNWOMKBUXFHR-BQBZGAKWSA-N Ala-Gly-Arg Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N WGDNWOMKBUXFHR-BQBZGAKWSA-N 0.000 description 1
- LMFXXZPPZDCPTA-ZKWXMUAHSA-N Ala-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N LMFXXZPPZDCPTA-ZKWXMUAHSA-N 0.000 description 1
- SMCGQGDVTPFXKB-XPUUQOCRSA-N Ala-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N SMCGQGDVTPFXKB-XPUUQOCRSA-N 0.000 description 1
- IFKQPMZRDQZSHI-GHCJXIJMSA-N Ala-Ile-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O IFKQPMZRDQZSHI-GHCJXIJMSA-N 0.000 description 1
- CKLDHDOIYBVUNP-KBIXCLLPSA-N Ala-Ile-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O CKLDHDOIYBVUNP-KBIXCLLPSA-N 0.000 description 1
- MFMDKJIPHSWSBM-GUBZILKMSA-N Ala-Lys-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFMDKJIPHSWSBM-GUBZILKMSA-N 0.000 description 1
- PMQXMXAASGFUDX-SRVKXCTJSA-N Ala-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCCN PMQXMXAASGFUDX-SRVKXCTJSA-N 0.000 description 1
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 1
- MDNAVFBZPROEHO-DCAQKATOSA-N Ala-Lys-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MDNAVFBZPROEHO-DCAQKATOSA-N 0.000 description 1
- NLOMBWNGESDVJU-GUBZILKMSA-N Ala-Met-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NLOMBWNGESDVJU-GUBZILKMSA-N 0.000 description 1
- VHEVVUZDDUCAKU-FXQIFTODSA-N Ala-Met-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O VHEVVUZDDUCAKU-FXQIFTODSA-N 0.000 description 1
- CYBJZLQSUJEMAS-LFSVMHDDSA-N Ala-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C)N)O CYBJZLQSUJEMAS-LFSVMHDDSA-N 0.000 description 1
- MAZZQZWCCYJQGZ-GUBZILKMSA-N Ala-Pro-Arg Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MAZZQZWCCYJQGZ-GUBZILKMSA-N 0.000 description 1
- FQNILRVJOJBFFC-FXQIFTODSA-N Ala-Pro-Asp Chemical compound C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N FQNILRVJOJBFFC-FXQIFTODSA-N 0.000 description 1
- ARHJJAAWNWOACN-FXQIFTODSA-N Ala-Ser-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O ARHJJAAWNWOACN-FXQIFTODSA-N 0.000 description 1
- OEVCHROQUIVQFZ-YTLHQDLWSA-N Ala-Thr-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](C)C(O)=O OEVCHROQUIVQFZ-YTLHQDLWSA-N 0.000 description 1
- QKHWNPQNOHEFST-VZFHVOOUSA-N Ala-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C)N)O QKHWNPQNOHEFST-VZFHVOOUSA-N 0.000 description 1
- KTXKIYXZQFWJKB-VZFHVOOUSA-N Ala-Thr-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O KTXKIYXZQFWJKB-VZFHVOOUSA-N 0.000 description 1
- WUGMRIBZSVSJNP-UFBFGSQYSA-N Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UFBFGSQYSA-N 0.000 description 1
- ZVWXMTTZJKBJCI-BHDSKKPTSA-N Ala-Trp-Ala Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 ZVWXMTTZJKBJCI-BHDSKKPTSA-N 0.000 description 1
- VQBULXOHAZSTQY-GKCIPKSASA-N Ala-Trp-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O VQBULXOHAZSTQY-GKCIPKSASA-N 0.000 description 1
- AOAKQKVICDWCLB-UWJYBYFXSA-N Ala-Tyr-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N AOAKQKVICDWCLB-UWJYBYFXSA-N 0.000 description 1
- LYILPUNCKACNGF-NAKRPEOUSA-N Ala-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C)N LYILPUNCKACNGF-NAKRPEOUSA-N 0.000 description 1
- KWKQGHSSNHPGOW-BQBZGAKWSA-N Arg-Ala-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)NCC(O)=O KWKQGHSSNHPGOW-BQBZGAKWSA-N 0.000 description 1
- BVBKBQRPOJFCQM-DCAQKATOSA-N Arg-Asn-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BVBKBQRPOJFCQM-DCAQKATOSA-N 0.000 description 1
- IGULQRCJLQQPSM-DCAQKATOSA-N Arg-Cys-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O IGULQRCJLQQPSM-DCAQKATOSA-N 0.000 description 1
- CRCCTGPNZUCAHE-DCAQKATOSA-N Arg-His-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CN=CN1 CRCCTGPNZUCAHE-DCAQKATOSA-N 0.000 description 1
- UAOSDDXCTBIPCA-QXEWZRGKSA-N Arg-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UAOSDDXCTBIPCA-QXEWZRGKSA-N 0.000 description 1
- OFIYLHVAAJYRBC-HJWJTTGWSA-N Arg-Ile-Phe Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)CCCNC(N)=N)C(=O)N[C@@H](Cc1ccccc1)C(O)=O OFIYLHVAAJYRBC-HJWJTTGWSA-N 0.000 description 1
- LVMUGODRNHFGRA-AVGNSLFASA-N Arg-Leu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O LVMUGODRNHFGRA-AVGNSLFASA-N 0.000 description 1
- WMEVEPXNCMKNGH-IHRRRGAJSA-N Arg-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N WMEVEPXNCMKNGH-IHRRRGAJSA-N 0.000 description 1
- UZGFHWIJWPUPOH-IHRRRGAJSA-N Arg-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UZGFHWIJWPUPOH-IHRRRGAJSA-N 0.000 description 1
- COXMUHNBYCVVRG-DCAQKATOSA-N Arg-Leu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O COXMUHNBYCVVRG-DCAQKATOSA-N 0.000 description 1
- GRRXPUAICOGISM-RWMBFGLXSA-N Arg-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O GRRXPUAICOGISM-RWMBFGLXSA-N 0.000 description 1
- LCBSSOCDWUTQQV-SDDRHHMPSA-N Arg-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N LCBSSOCDWUTQQV-SDDRHHMPSA-N 0.000 description 1
- UGZUVYDKAYNCII-ULQDDVLXSA-N Arg-Phe-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O UGZUVYDKAYNCII-ULQDDVLXSA-N 0.000 description 1
- KZXPVYVSHUJCEO-ULQDDVLXSA-N Arg-Phe-Lys Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=CC=C1 KZXPVYVSHUJCEO-ULQDDVLXSA-N 0.000 description 1
- AOHKLEBWKMKITA-IHRRRGAJSA-N Arg-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N AOHKLEBWKMKITA-IHRRRGAJSA-N 0.000 description 1
- WKPXXXUSUHAXDE-SRVKXCTJSA-N Arg-Pro-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O WKPXXXUSUHAXDE-SRVKXCTJSA-N 0.000 description 1
- QHVRVUNEAIFTEK-SZMVWBNQSA-N Arg-Pro-Trp Chemical compound N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O QHVRVUNEAIFTEK-SZMVWBNQSA-N 0.000 description 1
- KMFPQTITXUKJOV-DCAQKATOSA-N Arg-Ser-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O KMFPQTITXUKJOV-DCAQKATOSA-N 0.000 description 1
- AIFHRTPABBBHKU-RCWTZXSCSA-N Arg-Thr-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O AIFHRTPABBBHKU-RCWTZXSCSA-N 0.000 description 1
- WTFIFQWLQXZLIZ-UMPQAUOISA-N Arg-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O WTFIFQWLQXZLIZ-UMPQAUOISA-N 0.000 description 1
- CGWVCWFQGXOUSJ-ULQDDVLXSA-N Arg-Tyr-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O CGWVCWFQGXOUSJ-ULQDDVLXSA-N 0.000 description 1
- QCTOLCVIGRLMQS-HRCADAONSA-N Arg-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O QCTOLCVIGRLMQS-HRCADAONSA-N 0.000 description 1
- CPTXATAOUQJQRO-GUBZILKMSA-N Arg-Val-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O CPTXATAOUQJQRO-GUBZILKMSA-N 0.000 description 1
- MEFGKQUUYZOLHM-GMOBBJLQSA-N Asn-Arg-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MEFGKQUUYZOLHM-GMOBBJLQSA-N 0.000 description 1
- ZWASIOHRQWRWAS-UGYAYLCHSA-N Asn-Asp-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZWASIOHRQWRWAS-UGYAYLCHSA-N 0.000 description 1
- XXAOXVBAWLMTDR-ZLUOBGJFSA-N Asn-Cys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)N)N XXAOXVBAWLMTDR-ZLUOBGJFSA-N 0.000 description 1
- WQSCVMQDZYTFQU-FXQIFTODSA-N Asn-Cys-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WQSCVMQDZYTFQU-FXQIFTODSA-N 0.000 description 1
- VWJFQGXPYOPXJH-ZLUOBGJFSA-N Asn-Cys-Asp Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)C(=O)N VWJFQGXPYOPXJH-ZLUOBGJFSA-N 0.000 description 1
- LUVODTFFSXVOAG-ACZMJKKPSA-N Asn-Cys-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)N)N LUVODTFFSXVOAG-ACZMJKKPSA-N 0.000 description 1
- ZMWDUIIACVLIHK-GHCJXIJMSA-N Asn-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)N)N ZMWDUIIACVLIHK-GHCJXIJMSA-N 0.000 description 1
- DHVMIHWNDBFTHB-FXQIFTODSA-N Asn-Cys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)N)N DHVMIHWNDBFTHB-FXQIFTODSA-N 0.000 description 1
- QYXNFROWLZPWPC-FXQIFTODSA-N Asn-Glu-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O QYXNFROWLZPWPC-FXQIFTODSA-N 0.000 description 1
- JREOBWLIZLXRIS-GUBZILKMSA-N Asn-Glu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JREOBWLIZLXRIS-GUBZILKMSA-N 0.000 description 1
- WONGRTVAMHFGBE-WDSKDSINSA-N Asn-Gly-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N WONGRTVAMHFGBE-WDSKDSINSA-N 0.000 description 1
- OLVIPTLKNSAYRJ-YUMQZZPRSA-N Asn-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N OLVIPTLKNSAYRJ-YUMQZZPRSA-N 0.000 description 1
- ZKDGORKGHPCZOV-DCAQKATOSA-N Asn-His-Arg Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N ZKDGORKGHPCZOV-DCAQKATOSA-N 0.000 description 1
- WQLJRNRLHWJIRW-KKUMJFAQSA-N Asn-His-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC(=O)N)N)O WQLJRNRLHWJIRW-KKUMJFAQSA-N 0.000 description 1
- JLNFZLNDHONLND-GARJFASQSA-N Asn-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N JLNFZLNDHONLND-GARJFASQSA-N 0.000 description 1
- HZZIFFOVHLWGCS-KKUMJFAQSA-N Asn-Phe-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O HZZIFFOVHLWGCS-KKUMJFAQSA-N 0.000 description 1
- ZJIFRAPZHAGLGR-MELADBBJSA-N Asn-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC(=O)N)N)C(=O)O ZJIFRAPZHAGLGR-MELADBBJSA-N 0.000 description 1
- WUQXMTITJLFXAU-JIOCBJNQSA-N Asn-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N)O WUQXMTITJLFXAU-JIOCBJNQSA-N 0.000 description 1
- RTFXPCYMDYBZNQ-SRVKXCTJSA-N Asn-Tyr-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O RTFXPCYMDYBZNQ-SRVKXCTJSA-N 0.000 description 1
- QNNBHTFDFFFHGC-KKUMJFAQSA-N Asn-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QNNBHTFDFFFHGC-KKUMJFAQSA-N 0.000 description 1
- UGKZHCBLMLSANF-CIUDSAMLSA-N Asp-Asn-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O UGKZHCBLMLSANF-CIUDSAMLSA-N 0.000 description 1
- NAPNAGZWHQHZLG-ZLUOBGJFSA-N Asp-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N NAPNAGZWHQHZLG-ZLUOBGJFSA-N 0.000 description 1
- WEDGJJRCJNHYSF-SRVKXCTJSA-N Asp-Cys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N WEDGJJRCJNHYSF-SRVKXCTJSA-N 0.000 description 1
- BKXPJCBEHWFSTF-ACZMJKKPSA-N Asp-Gln-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O BKXPJCBEHWFSTF-ACZMJKKPSA-N 0.000 description 1
- WBDWQKRLTVCDSY-WHFBIAKZSA-N Asp-Gly-Asp Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O WBDWQKRLTVCDSY-WHFBIAKZSA-N 0.000 description 1
- BIVYLQMZPHDUIH-WHFBIAKZSA-N Asp-Gly-Cys Chemical compound C([C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N)C(=O)O BIVYLQMZPHDUIH-WHFBIAKZSA-N 0.000 description 1
- VIRHEUMYXXLCBF-WDSKDSINSA-N Asp-Gly-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O VIRHEUMYXXLCBF-WDSKDSINSA-N 0.000 description 1
- OMMIEVATLAGRCK-BYPYZUCNSA-N Asp-Gly-Gly Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)NCC(O)=O OMMIEVATLAGRCK-BYPYZUCNSA-N 0.000 description 1
- SVABRQFIHCSNCI-FOHZUACHSA-N Asp-Gly-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O SVABRQFIHCSNCI-FOHZUACHSA-N 0.000 description 1
- NRIFEOUAFLTMFJ-AAEUAGOBSA-N Asp-Gly-Trp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O NRIFEOUAFLTMFJ-AAEUAGOBSA-N 0.000 description 1
- KPNUCOPMVSGRCR-DCAQKATOSA-N Asp-His-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O KPNUCOPMVSGRCR-DCAQKATOSA-N 0.000 description 1
- LNENWJXDHCFVOF-DCAQKATOSA-N Asp-His-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC(=O)O)N LNENWJXDHCFVOF-DCAQKATOSA-N 0.000 description 1
- NHSDEZURHWEZPN-SXTJYALSSA-N Asp-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CC(=O)O)N NHSDEZURHWEZPN-SXTJYALSSA-N 0.000 description 1
- YFSLJHLQOALGSY-ZPFDUUQYSA-N Asp-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N YFSLJHLQOALGSY-ZPFDUUQYSA-N 0.000 description 1
- RTXQQDVBACBSCW-CFMVVWHZSA-N Asp-Ile-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RTXQQDVBACBSCW-CFMVVWHZSA-N 0.000 description 1
- PAYPSKIBMDHZPI-CIUDSAMLSA-N Asp-Leu-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PAYPSKIBMDHZPI-CIUDSAMLSA-N 0.000 description 1
- IVPNEDNYYYFAGI-GARJFASQSA-N Asp-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N IVPNEDNYYYFAGI-GARJFASQSA-N 0.000 description 1
- UMHUHHJMEXNSIV-CIUDSAMLSA-N Asp-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UMHUHHJMEXNSIV-CIUDSAMLSA-N 0.000 description 1
- ORRJQLIATJDMQM-HJGDQZAQSA-N Asp-Leu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O ORRJQLIATJDMQM-HJGDQZAQSA-N 0.000 description 1
- QNMKWNONJGKJJC-NHCYSSNCSA-N Asp-Leu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O QNMKWNONJGKJJC-NHCYSSNCSA-N 0.000 description 1
- GWIJZUVQVDJHDI-AVGNSLFASA-N Asp-Phe-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O GWIJZUVQVDJHDI-AVGNSLFASA-N 0.000 description 1
- QJHOOKBAHRJPPX-QWRGUYRKSA-N Asp-Phe-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 QJHOOKBAHRJPPX-QWRGUYRKSA-N 0.000 description 1
- JUWISGAGWSDGDH-KKUMJFAQSA-N Asp-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CC=CC=C1 JUWISGAGWSDGDH-KKUMJFAQSA-N 0.000 description 1
- QTIZKMMLNUMHHU-DCAQKATOSA-N Asp-Pro-His Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O QTIZKMMLNUMHHU-DCAQKATOSA-N 0.000 description 1
- QSFHZPQUAAQHAQ-CIUDSAMLSA-N Asp-Ser-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O QSFHZPQUAAQHAQ-CIUDSAMLSA-N 0.000 description 1
- KBJVTFWQWXCYCQ-IUKAMOBKSA-N Asp-Thr-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KBJVTFWQWXCYCQ-IUKAMOBKSA-N 0.000 description 1
- XOASPVGNFAMYBD-WFBYXXMGSA-N Asp-Trp-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(O)=O XOASPVGNFAMYBD-WFBYXXMGSA-N 0.000 description 1
- YUELDQUPTAYEGM-XIRDDKMYSA-N Asp-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC(=O)O)N YUELDQUPTAYEGM-XIRDDKMYSA-N 0.000 description 1
- PLNJUJGNLDSFOP-UWJYBYFXSA-N Asp-Tyr-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O PLNJUJGNLDSFOP-UWJYBYFXSA-N 0.000 description 1
- 241001465318 Aspergillus terreus Species 0.000 description 1
- 241000193755 Bacillus cereus Species 0.000 description 1
- 241000509998 Basiliscus Species 0.000 description 1
- 241000283690 Bos taurus Species 0.000 description 1
- 241000589562 Brucella Species 0.000 description 1
- 241000589567 Brucella abortus Species 0.000 description 1
- 241000722910 Burkholderia mallei Species 0.000 description 1
- 238000010354 CRISPR gene editing Methods 0.000 description 1
- 101100352418 Caenorhabditis elegans plp-1 gene Proteins 0.000 description 1
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical group [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 1
- 241000193468 Clostridium perfringens Species 0.000 description 1
- 241000193449 Clostridium tetani Species 0.000 description 1
- 108091033380 Coding strand Proteins 0.000 description 1
- 108020004705 Codon Proteins 0.000 description 1
- 241000186227 Corynebacterium diphtheriae Species 0.000 description 1
- 241000699802 Cricetulus griseus Species 0.000 description 1
- 241000192700 Cyanobacteria Species 0.000 description 1
- CLDCTNHPILWQCW-CIUDSAMLSA-N Cys-Arg-Glu Chemical compound C(C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N)CN=C(N)N CLDCTNHPILWQCW-CIUDSAMLSA-N 0.000 description 1
- VNLYIYOYUNGURO-ZLUOBGJFSA-N Cys-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N VNLYIYOYUNGURO-ZLUOBGJFSA-N 0.000 description 1
- HQZGVYJBRSISDT-BQBZGAKWSA-N Cys-Gly-Arg Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O HQZGVYJBRSISDT-BQBZGAKWSA-N 0.000 description 1
- CHRCKSPMGYDLIA-SRVKXCTJSA-N Cys-Phe-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O CHRCKSPMGYDLIA-SRVKXCTJSA-N 0.000 description 1
- NAPULYCVEVVFRB-HEIBUPTGSA-N Cys-Thr-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)CS NAPULYCVEVVFRB-HEIBUPTGSA-N 0.000 description 1
- 102000004127 Cytokines Human genes 0.000 description 1
- 108090000695 Cytokines Proteins 0.000 description 1
- 102000053602 DNA Human genes 0.000 description 1
- 241000255581 Drosophila <fruit fly, genus> Species 0.000 description 1
- 241000588877 Eikenella Species 0.000 description 1
- 241000588878 Eikenella corrodens Species 0.000 description 1
- 241000283073 Equus caballus Species 0.000 description 1
- 241000588722 Escherichia Species 0.000 description 1
- 241000206602 Eukaryota Species 0.000 description 1
- 241000282326 Felis catus Species 0.000 description 1
- 241000233866 Fungi Species 0.000 description 1
- LKUWAWGNJYJODH-KBIXCLLPSA-N Gln-Ala-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LKUWAWGNJYJODH-KBIXCLLPSA-N 0.000 description 1
- MWLYSLMKFXWZPW-ZPFDUUQYSA-N Gln-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CCC(N)=O MWLYSLMKFXWZPW-ZPFDUUQYSA-N 0.000 description 1
- IKDOHQHEFPPGJG-FXQIFTODSA-N Gln-Asp-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IKDOHQHEFPPGJG-FXQIFTODSA-N 0.000 description 1
- WQWMZOIPXWSZNE-WDSKDSINSA-N Gln-Asp-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O WQWMZOIPXWSZNE-WDSKDSINSA-N 0.000 description 1
- SXIJQMBEVYWAQT-GUBZILKMSA-N Gln-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N SXIJQMBEVYWAQT-GUBZILKMSA-N 0.000 description 1
- XEYMBRRKIFYQMF-GUBZILKMSA-N Gln-Asp-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O XEYMBRRKIFYQMF-GUBZILKMSA-N 0.000 description 1
- XSBGUANSZDGULP-IUCAKERBSA-N Gln-Gly-Lys Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CCCCN)C(O)=O XSBGUANSZDGULP-IUCAKERBSA-N 0.000 description 1
- LTXLIIZACMCQTO-GUBZILKMSA-N Gln-His-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N LTXLIIZACMCQTO-GUBZILKMSA-N 0.000 description 1
- QKCZZAZNMMVICF-DCAQKATOSA-N Gln-Leu-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O QKCZZAZNMMVICF-DCAQKATOSA-N 0.000 description 1
- TWIAMTNJOMRDAK-GUBZILKMSA-N Gln-Lys-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O TWIAMTNJOMRDAK-GUBZILKMSA-N 0.000 description 1
- JRHPEMVLTRADLJ-AVGNSLFASA-N Gln-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JRHPEMVLTRADLJ-AVGNSLFASA-N 0.000 description 1
- LVRKAFPPFJRIOF-GARJFASQSA-N Gln-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N LVRKAFPPFJRIOF-GARJFASQSA-N 0.000 description 1
- DBNLXHGDGBUCDV-KKUMJFAQSA-N Gln-Phe-Met Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O DBNLXHGDGBUCDV-KKUMJFAQSA-N 0.000 description 1
- KUBFPYIMAGXGBT-ACZMJKKPSA-N Gln-Ser-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KUBFPYIMAGXGBT-ACZMJKKPSA-N 0.000 description 1
- UTOQQOMEJDPDMX-ACZMJKKPSA-N Gln-Ser-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O UTOQQOMEJDPDMX-ACZMJKKPSA-N 0.000 description 1
- DYVMTEWCGAVKSE-HJGDQZAQSA-N Gln-Thr-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O DYVMTEWCGAVKSE-HJGDQZAQSA-N 0.000 description 1
- UQKVUFGUSVYJMQ-IRIUXVKKSA-N Gln-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCC(=O)N)N)O UQKVUFGUSVYJMQ-IRIUXVKKSA-N 0.000 description 1
- MXOODARRORARSU-ACZMJKKPSA-N Glu-Ala-Ser Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N MXOODARRORARSU-ACZMJKKPSA-N 0.000 description 1
- DIXKFOPPGWKZLY-CIUDSAMLSA-N Glu-Arg-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O DIXKFOPPGWKZLY-CIUDSAMLSA-N 0.000 description 1
- CKRUHITYRFNUKW-WDSKDSINSA-N Glu-Asn-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CKRUHITYRFNUKW-WDSKDSINSA-N 0.000 description 1
- VFZIDQZAEBORGY-GLLZPBPUSA-N Glu-Gln-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VFZIDQZAEBORGY-GLLZPBPUSA-N 0.000 description 1
- PXXGVUVQWQGGIG-YUMQZZPRSA-N Glu-Gly-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N PXXGVUVQWQGGIG-YUMQZZPRSA-N 0.000 description 1
- MTAOBYXRYJZRGQ-WDSKDSINSA-N Glu-Gly-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MTAOBYXRYJZRGQ-WDSKDSINSA-N 0.000 description 1
- ZWQVYZXPYSYPJD-RYUDHWBXSA-N Glu-Gly-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZWQVYZXPYSYPJD-RYUDHWBXSA-N 0.000 description 1
- OPAINBJQDQTGJY-JGVFFNPUSA-N Glu-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCC(=O)O)N)C(=O)O OPAINBJQDQTGJY-JGVFFNPUSA-N 0.000 description 1
- RAUDKMVXNOWDLS-WDSKDSINSA-N Glu-Gly-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O RAUDKMVXNOWDLS-WDSKDSINSA-N 0.000 description 1
- XOIATPHFYVWFEU-DCAQKATOSA-N Glu-His-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XOIATPHFYVWFEU-DCAQKATOSA-N 0.000 description 1
- NJPQBTJSYCKCNS-HVTMNAMFSA-N Glu-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N NJPQBTJSYCKCNS-HVTMNAMFSA-N 0.000 description 1
- ATVYZJGOZLVXDK-IUCAKERBSA-N Glu-Leu-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O ATVYZJGOZLVXDK-IUCAKERBSA-N 0.000 description 1
- IVGJYOOGJLFKQE-AVGNSLFASA-N Glu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N IVGJYOOGJLFKQE-AVGNSLFASA-N 0.000 description 1
- WNRZUESNGGDCJX-JYJNAYRXSA-N Glu-Leu-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WNRZUESNGGDCJX-JYJNAYRXSA-N 0.000 description 1
- ILWHFUZZCFYSKT-AVGNSLFASA-N Glu-Lys-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ILWHFUZZCFYSKT-AVGNSLFASA-N 0.000 description 1
- CHDWDBPJOZVZSE-KKUMJFAQSA-N Glu-Phe-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O CHDWDBPJOZVZSE-KKUMJFAQSA-N 0.000 description 1
- BPLNJYHNAJVLRT-ACZMJKKPSA-N Glu-Ser-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O BPLNJYHNAJVLRT-ACZMJKKPSA-N 0.000 description 1
- QCMVGXDELYMZET-GLLZPBPUSA-N Glu-Thr-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QCMVGXDELYMZET-GLLZPBPUSA-N 0.000 description 1
- MXJYXYDREQWUMS-XKBZYTNZSA-N Glu-Thr-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O MXJYXYDREQWUMS-XKBZYTNZSA-N 0.000 description 1
- VIPDPMHGICREIS-GVXVVHGQSA-N Glu-Val-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VIPDPMHGICREIS-GVXVVHGQSA-N 0.000 description 1
- RMWAOBGCZZSJHE-UMNHJUIQSA-N Glu-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N RMWAOBGCZZSJHE-UMNHJUIQSA-N 0.000 description 1
- YMUFWNJHVPQNQD-ZKWXMUAHSA-N Gly-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN YMUFWNJHVPQNQD-ZKWXMUAHSA-N 0.000 description 1
- VSVZIEVNUYDAFR-YUMQZZPRSA-N Gly-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN VSVZIEVNUYDAFR-YUMQZZPRSA-N 0.000 description 1
- VXKCPBPQEKKERH-IUCAKERBSA-N Gly-Arg-Pro Chemical compound NC(N)=NCCC[C@H](NC(=O)CN)C(=O)N1CCC[C@H]1C(O)=O VXKCPBPQEKKERH-IUCAKERBSA-N 0.000 description 1
- BGVYNAQWHSTTSP-BYULHYEWSA-N Gly-Asn-Ile Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BGVYNAQWHSTTSP-BYULHYEWSA-N 0.000 description 1
- JVACNFOPSUPDTK-QWRGUYRKSA-N Gly-Asn-Phe Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JVACNFOPSUPDTK-QWRGUYRKSA-N 0.000 description 1
- LEGMTEAZGRRIMY-ZKWXMUAHSA-N Gly-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)CN LEGMTEAZGRRIMY-ZKWXMUAHSA-N 0.000 description 1
- PEZZSFLFXXFUQD-XPUUQOCRSA-N Gly-Cys-Val Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O PEZZSFLFXXFUQD-XPUUQOCRSA-N 0.000 description 1
- JNGJGFMFXREJNF-KBPBESRZSA-N Gly-Glu-Trp Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JNGJGFMFXREJNF-KBPBESRZSA-N 0.000 description 1
- CCQOOWAONKGYKQ-BYPYZUCNSA-N Gly-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)CN CCQOOWAONKGYKQ-BYPYZUCNSA-N 0.000 description 1
- QPCVIQJVRGXUSA-LURJTMIESA-N Gly-Gly-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)CNC(=O)CN QPCVIQJVRGXUSA-LURJTMIESA-N 0.000 description 1
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 description 1
- SWQALSGKVLYKDT-UHFFFAOYSA-N Gly-Ile-Ala Natural products NCC(=O)NC(C(C)CC)C(=O)NC(C)C(O)=O SWQALSGKVLYKDT-UHFFFAOYSA-N 0.000 description 1
- HMHRTKOWRUPPNU-RCOVLWMOSA-N Gly-Ile-Gly Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O HMHRTKOWRUPPNU-RCOVLWMOSA-N 0.000 description 1
- ITZOBNKQDZEOCE-NHCYSSNCSA-N Gly-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)CN ITZOBNKQDZEOCE-NHCYSSNCSA-N 0.000 description 1
- PAWIVEIWWYGBAM-YUMQZZPRSA-N Gly-Leu-Ala Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O PAWIVEIWWYGBAM-YUMQZZPRSA-N 0.000 description 1
- CCBIBMKQNXHNIN-ZETCQYMHSA-N Gly-Leu-Gly Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CCBIBMKQNXHNIN-ZETCQYMHSA-N 0.000 description 1
- MIIVFRCYJABHTQ-ONGXEEELSA-N Gly-Leu-Val Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O MIIVFRCYJABHTQ-ONGXEEELSA-N 0.000 description 1
- CLNSYANKYVMZNM-UWVGGRQHSA-N Gly-Lys-Arg Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N CLNSYANKYVMZNM-UWVGGRQHSA-N 0.000 description 1
- MHXKHKWHPNETGG-QWRGUYRKSA-N Gly-Lys-Leu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O MHXKHKWHPNETGG-QWRGUYRKSA-N 0.000 description 1
- IBYOLNARKHMLBG-WHOFXGATSA-N Gly-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IBYOLNARKHMLBG-WHOFXGATSA-N 0.000 description 1
- WNZOCXUOGVYYBJ-CDMKHQONSA-N Gly-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)CN)O WNZOCXUOGVYYBJ-CDMKHQONSA-N 0.000 description 1
- HJARVELKOSZUEW-YUMQZZPRSA-N Gly-Pro-Gln Chemical compound [H]NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O HJARVELKOSZUEW-YUMQZZPRSA-N 0.000 description 1
- HFPVRZWORNJRRC-UWVGGRQHSA-N Gly-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN HFPVRZWORNJRRC-UWVGGRQHSA-N 0.000 description 1
- IXHQLZIWBCQBLQ-STQMWFEESA-N Gly-Pro-Phe Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IXHQLZIWBCQBLQ-STQMWFEESA-N 0.000 description 1
- YABRDIBSPZONIY-BQBZGAKWSA-N Gly-Ser-Met Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O YABRDIBSPZONIY-BQBZGAKWSA-N 0.000 description 1
- LKJCZEPXHOIAIW-HOTGVXAUSA-N Gly-Trp-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)CN LKJCZEPXHOIAIW-HOTGVXAUSA-N 0.000 description 1
- SBVMXEZQJVUARN-XPUUQOCRSA-N Gly-Val-Ser Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O SBVMXEZQJVUARN-XPUUQOCRSA-N 0.000 description 1
- VPZXBVLAVMBEQI-VKHMYHEASA-N Glycyl-alanine Chemical compound OC(=O)[C@H](C)NC(=O)CN VPZXBVLAVMBEQI-VKHMYHEASA-N 0.000 description 1
- 241000606790 Haemophilus Species 0.000 description 1
- 241001501603 Haemophilus aegyptius Species 0.000 description 1
- 241000606768 Haemophilus influenzae Species 0.000 description 1
- 229920000209 Hexadimethrine bromide Polymers 0.000 description 1
- 241000238631 Hexapoda Species 0.000 description 1
- SYMSVYVUSPSAAO-IHRRRGAJSA-N His-Arg-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O SYMSVYVUSPSAAO-IHRRRGAJSA-N 0.000 description 1
- CJGDTAHEMXLRMB-ULQDDVLXSA-N His-Arg-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O CJGDTAHEMXLRMB-ULQDDVLXSA-N 0.000 description 1
- KYMUEAZVLPRVAE-GUBZILKMSA-N His-Asn-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KYMUEAZVLPRVAE-GUBZILKMSA-N 0.000 description 1
- UZZXGLOJRZKYEL-DJFWLOJKSA-N His-Asn-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UZZXGLOJRZKYEL-DJFWLOJKSA-N 0.000 description 1
- UPGJWSUYENXOPV-HGNGGELXSA-N His-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N UPGJWSUYENXOPV-HGNGGELXSA-N 0.000 description 1
- HVCRQRQPIIRNLY-IUCAKERBSA-N His-Gln-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)NCC(=O)O)N HVCRQRQPIIRNLY-IUCAKERBSA-N 0.000 description 1
- WEIYKCOEVBUJQC-JYJNAYRXSA-N His-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC2=CN=CN2)N WEIYKCOEVBUJQC-JYJNAYRXSA-N 0.000 description 1
- FSOXZQBMPBQKGJ-QSFUFRPTSA-N His-Ile-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]([NH3+])CC1=CN=CN1 FSOXZQBMPBQKGJ-QSFUFRPTSA-N 0.000 description 1
- VYUXYMRNGALHEA-DLOVCJGASA-N His-Leu-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O VYUXYMRNGALHEA-DLOVCJGASA-N 0.000 description 1
- WYSJPCTWSBJFCO-AVGNSLFASA-N His-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC1=CN=CN1)N WYSJPCTWSBJFCO-AVGNSLFASA-N 0.000 description 1
- 241001479528 Hygrophila corymbosa Species 0.000 description 1
- NKVZTQVGUNLLQW-JBDRJPRFSA-N Ile-Ala-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(=O)O)N NKVZTQVGUNLLQW-JBDRJPRFSA-N 0.000 description 1
- JRHFQUPIZOYKQP-KBIXCLLPSA-N Ile-Ala-Glu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O JRHFQUPIZOYKQP-KBIXCLLPSA-N 0.000 description 1
- RSDHVTMRXSABSV-GHCJXIJMSA-N Ile-Asn-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N RSDHVTMRXSABSV-GHCJXIJMSA-N 0.000 description 1
- NKRJALPCDNXULF-BYULHYEWSA-N Ile-Asp-Gly Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O NKRJALPCDNXULF-BYULHYEWSA-N 0.000 description 1
- HGNUKGZQASSBKQ-PCBIJLKTSA-N Ile-Asp-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N HGNUKGZQASSBKQ-PCBIJLKTSA-N 0.000 description 1
- JDAWAWXGAUZPNJ-ZPFDUUQYSA-N Ile-Glu-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N JDAWAWXGAUZPNJ-ZPFDUUQYSA-N 0.000 description 1
- PHIXPNQDGGILMP-YVNDNENWSA-N Ile-Glu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N PHIXPNQDGGILMP-YVNDNENWSA-N 0.000 description 1
- TVSPLSZTKTUYLV-ZPFDUUQYSA-N Ile-Glu-Met Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCSC)C(=O)O TVSPLSZTKTUYLV-ZPFDUUQYSA-N 0.000 description 1
- NHJKZMDIMMTVCK-QXEWZRGKSA-N Ile-Gly-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N NHJKZMDIMMTVCK-QXEWZRGKSA-N 0.000 description 1
- KFVUBLZRFSVDGO-BYULHYEWSA-N Ile-Gly-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O KFVUBLZRFSVDGO-BYULHYEWSA-N 0.000 description 1
- OEQKGSPBDVKYOC-ZKWXMUAHSA-N Ile-Gly-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N OEQKGSPBDVKYOC-ZKWXMUAHSA-N 0.000 description 1
- DJQUZZAFLFQVFL-UHFFFAOYSA-N Ile-Gly-Leu-Pro Chemical compound CCC(C)C(N)C(=O)NCC(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O DJQUZZAFLFQVFL-UHFFFAOYSA-N 0.000 description 1
- QZZIBQZLWBOOJH-PEDHHIEDSA-N Ile-Ile-Val Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(=O)O QZZIBQZLWBOOJH-PEDHHIEDSA-N 0.000 description 1
- TVYWVSJGSHQWMT-AJNGGQMLSA-N Ile-Leu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N TVYWVSJGSHQWMT-AJNGGQMLSA-N 0.000 description 1
- PHRWFSFCNJPWRO-PPCPHDFISA-N Ile-Leu-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N PHRWFSFCNJPWRO-PPCPHDFISA-N 0.000 description 1
- UIEZQYNXCYHMQS-BJDJZHNGSA-N Ile-Lys-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)O)N UIEZQYNXCYHMQS-BJDJZHNGSA-N 0.000 description 1
- RMNMUUCYTMLWNA-ZPFDUUQYSA-N Ile-Lys-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N RMNMUUCYTMLWNA-ZPFDUUQYSA-N 0.000 description 1
- PNTWNAXGBOZMBO-MNXVOIDGSA-N Ile-Lys-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PNTWNAXGBOZMBO-MNXVOIDGSA-N 0.000 description 1
- SNHYFFQZRFIRHO-CYDGBPFRSA-N Ile-Met-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(=O)O)N SNHYFFQZRFIRHO-CYDGBPFRSA-N 0.000 description 1
- ZLFNNVATRMCAKN-ZKWXMUAHSA-N Ile-Ser-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZLFNNVATRMCAKN-ZKWXMUAHSA-N 0.000 description 1
- WCNWGAUZWWSYDG-SVSWQMSJSA-N Ile-Thr-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)O)N WCNWGAUZWWSYDG-SVSWQMSJSA-N 0.000 description 1
- JTBFQNHKNRZJDS-SYWGBEHUSA-N Ile-Trp-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](C)C(=O)O)N JTBFQNHKNRZJDS-SYWGBEHUSA-N 0.000 description 1
- JERJIYYCOGBAIJ-OBAATPRFSA-N Ile-Tyr-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)N JERJIYYCOGBAIJ-OBAATPRFSA-N 0.000 description 1
- JCGMFFQQHJQASB-PYJNHQTQSA-N Ile-Val-His Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O JCGMFFQQHJQASB-PYJNHQTQSA-N 0.000 description 1
- 241000588748 Klebsiella Species 0.000 description 1
- 241000588747 Klebsiella pneumoniae Species 0.000 description 1
- XUJNEKJLAYXESH-REOHCLBHSA-N L-Cysteine Chemical compound SC[C@H](N)C(O)=O XUJNEKJLAYXESH-REOHCLBHSA-N 0.000 description 1
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 description 1
- 241000186660 Lactobacillus Species 0.000 description 1
- 241000713666 Lentivirus Species 0.000 description 1
- CZCSUZMIRKFFFA-CIUDSAMLSA-N Leu-Ala-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O CZCSUZMIRKFFFA-CIUDSAMLSA-N 0.000 description 1
- KVRKAGGMEWNURO-CIUDSAMLSA-N Leu-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(C)C)N KVRKAGGMEWNURO-CIUDSAMLSA-N 0.000 description 1
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 1
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 1
- JUWJEAPUNARGCF-DCAQKATOSA-N Leu-Arg-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O JUWJEAPUNARGCF-DCAQKATOSA-N 0.000 description 1
- KSZCCRIGNVSHFH-UWVGGRQHSA-N Leu-Arg-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O KSZCCRIGNVSHFH-UWVGGRQHSA-N 0.000 description 1
- ZURHXHNAEJJRNU-CIUDSAMLSA-N Leu-Asp-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZURHXHNAEJJRNU-CIUDSAMLSA-N 0.000 description 1
- PJYSOYLLTJKZHC-GUBZILKMSA-N Leu-Asp-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(N)=O PJYSOYLLTJKZHC-GUBZILKMSA-N 0.000 description 1
- DLCOFDAHNMMQPP-SRVKXCTJSA-N Leu-Asp-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DLCOFDAHNMMQPP-SRVKXCTJSA-N 0.000 description 1
- QCSFMCFHVGTLFF-NHCYSSNCSA-N Leu-Asp-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O QCSFMCFHVGTLFF-NHCYSSNCSA-N 0.000 description 1
- LOLUPZNNADDTAA-AVGNSLFASA-N Leu-Gln-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LOLUPZNNADDTAA-AVGNSLFASA-N 0.000 description 1
- CQGSYZCULZMEDE-UHFFFAOYSA-N Leu-Gln-Pro Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)N1CCCC1C(O)=O CQGSYZCULZMEDE-UHFFFAOYSA-N 0.000 description 1
- YVKSMSDXKMSIRX-GUBZILKMSA-N Leu-Glu-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YVKSMSDXKMSIRX-GUBZILKMSA-N 0.000 description 1
- FEHQLKKBVJHSEC-SZMVWBNQSA-N Leu-Glu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 FEHQLKKBVJHSEC-SZMVWBNQSA-N 0.000 description 1
- BABSVXFGKFLIGW-UWVGGRQHSA-N Leu-Gly-Arg Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N BABSVXFGKFLIGW-UWVGGRQHSA-N 0.000 description 1
- VBZOAGIPCULURB-QWRGUYRKSA-N Leu-Gly-His Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N VBZOAGIPCULURB-QWRGUYRKSA-N 0.000 description 1
- POZULHZYLPGXMR-ONGXEEELSA-N Leu-Gly-Val Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O POZULHZYLPGXMR-ONGXEEELSA-N 0.000 description 1
- PBGDOSARRIJMEV-DLOVCJGASA-N Leu-His-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O PBGDOSARRIJMEV-DLOVCJGASA-N 0.000 description 1
- HGFGEMSVBMCFKK-MNXVOIDGSA-N Leu-Ile-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O HGFGEMSVBMCFKK-MNXVOIDGSA-N 0.000 description 1
- OMHLATXVNQSALM-FQUUOJAGSA-N Leu-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(C)C)N OMHLATXVNQSALM-FQUUOJAGSA-N 0.000 description 1
- NRFGTHFONZYFNY-MGHWNKPDSA-N Leu-Ile-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NRFGTHFONZYFNY-MGHWNKPDSA-N 0.000 description 1
- PDQDCFBVYXEFSD-SRVKXCTJSA-N Leu-Leu-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PDQDCFBVYXEFSD-SRVKXCTJSA-N 0.000 description 1
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 1
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 1
- ZRHDPZAAWLXXIR-SRVKXCTJSA-N Leu-Lys-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O ZRHDPZAAWLXXIR-SRVKXCTJSA-N 0.000 description 1
- WXUOJXIGOPMDJM-SRVKXCTJSA-N Leu-Lys-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O WXUOJXIGOPMDJM-SRVKXCTJSA-N 0.000 description 1
- QNTJIDXQHWUBKC-BZSNNMDCSA-N Leu-Lys-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QNTJIDXQHWUBKC-BZSNNMDCSA-N 0.000 description 1
- OVZLLFONXILPDZ-VOAKCMCISA-N Leu-Lys-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OVZLLFONXILPDZ-VOAKCMCISA-N 0.000 description 1
- ONPJGOIVICHWBW-BZSNNMDCSA-N Leu-Lys-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 ONPJGOIVICHWBW-BZSNNMDCSA-N 0.000 description 1
- WMIOEVKKYIMVKI-DCAQKATOSA-N Leu-Pro-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WMIOEVKKYIMVKI-DCAQKATOSA-N 0.000 description 1
- QMKFDEUJGYNFMC-AVGNSLFASA-N Leu-Pro-Arg Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QMKFDEUJGYNFMC-AVGNSLFASA-N 0.000 description 1
- BMVFXOQHDQZAQU-DCAQKATOSA-N Leu-Pro-Asp Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N BMVFXOQHDQZAQU-DCAQKATOSA-N 0.000 description 1
- UCBPDSYUVAAHCD-UWVGGRQHSA-N Leu-Pro-Gly Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UCBPDSYUVAAHCD-UWVGGRQHSA-N 0.000 description 1
- IRMLZWSRWSGTOP-CIUDSAMLSA-N Leu-Ser-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O IRMLZWSRWSGTOP-CIUDSAMLSA-N 0.000 description 1
- SBANPBVRHYIMRR-GARJFASQSA-N Leu-Ser-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N SBANPBVRHYIMRR-GARJFASQSA-N 0.000 description 1
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 1
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 1
- PPGBXYKMUMHFBF-KATARQTJSA-N Leu-Ser-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PPGBXYKMUMHFBF-KATARQTJSA-N 0.000 description 1
- LJBVRCDPWOJOEK-PPCPHDFISA-N Leu-Thr-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LJBVRCDPWOJOEK-PPCPHDFISA-N 0.000 description 1
- VHTIZYYHIUHMCA-JYJNAYRXSA-N Leu-Tyr-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O VHTIZYYHIUHMCA-JYJNAYRXSA-N 0.000 description 1
- WFCKERTZVCQXKH-KBPBESRZSA-N Leu-Tyr-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O WFCKERTZVCQXKH-KBPBESRZSA-N 0.000 description 1
- BTEMNFBEAAOGBR-BZSNNMDCSA-N Leu-Tyr-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BTEMNFBEAAOGBR-BZSNNMDCSA-N 0.000 description 1
- AXVIGSRGTMNSJU-YESZJQIVSA-N Leu-Tyr-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N AXVIGSRGTMNSJU-YESZJQIVSA-N 0.000 description 1
- LMDVGHQPPPLYAR-IHRRRGAJSA-N Leu-Val-His Chemical compound N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O LMDVGHQPPPLYAR-IHRRRGAJSA-N 0.000 description 1
- YQFZRHYZLARWDY-IHRRRGAJSA-N Leu-Val-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN YQFZRHYZLARWDY-IHRRRGAJSA-N 0.000 description 1
- QESXLSQLQHHTIX-RHYQMDGZSA-N Leu-Val-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QESXLSQLQHHTIX-RHYQMDGZSA-N 0.000 description 1
- WSXTWLJHTLRFLW-SRVKXCTJSA-N Lys-Ala-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O WSXTWLJHTLRFLW-SRVKXCTJSA-N 0.000 description 1
- GQUDMNDPQTXZRV-DCAQKATOSA-N Lys-Arg-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O GQUDMNDPQTXZRV-DCAQKATOSA-N 0.000 description 1
- VHNOAIFVYUQOOY-XUXIUFHCSA-N Lys-Arg-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VHNOAIFVYUQOOY-XUXIUFHCSA-N 0.000 description 1
- YNNPKXBBRZVIRX-IHRRRGAJSA-N Lys-Arg-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O YNNPKXBBRZVIRX-IHRRRGAJSA-N 0.000 description 1
- NTSPQIONFJUMJV-AVGNSLFASA-N Lys-Arg-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O NTSPQIONFJUMJV-AVGNSLFASA-N 0.000 description 1
- SSYOBDBNBQBSQE-SRVKXCTJSA-N Lys-Cys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O SSYOBDBNBQBSQE-SRVKXCTJSA-N 0.000 description 1
- YFGWNAROEYWGNL-GUBZILKMSA-N Lys-Gln-Asn Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YFGWNAROEYWGNL-GUBZILKMSA-N 0.000 description 1
- QQUJSUFWEDZQQY-AVGNSLFASA-N Lys-Gln-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCCN QQUJSUFWEDZQQY-AVGNSLFASA-N 0.000 description 1
- PAMDBWYMLWOELY-SDDRHHMPSA-N Lys-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)N)C(=O)O PAMDBWYMLWOELY-SDDRHHMPSA-N 0.000 description 1
- GQFDWEDHOQRNLC-QWRGUYRKSA-N Lys-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN GQFDWEDHOQRNLC-QWRGUYRKSA-N 0.000 description 1
- KNKJPYAZQUFLQK-IHRRRGAJSA-N Lys-His-Arg Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCCCN)N KNKJPYAZQUFLQK-IHRRRGAJSA-N 0.000 description 1
- GNLJXWBNLAIPEP-MELADBBJSA-N Lys-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCCCN)N)C(=O)O GNLJXWBNLAIPEP-MELADBBJSA-N 0.000 description 1
- MXMDJEJWERYPMO-XUXIUFHCSA-N Lys-Ile-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MXMDJEJWERYPMO-XUXIUFHCSA-N 0.000 description 1
- MUXNCRWTWBMNHX-SRVKXCTJSA-N Lys-Leu-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O MUXNCRWTWBMNHX-SRVKXCTJSA-N 0.000 description 1
- WRODMZBHNNPRLN-SRVKXCTJSA-N Lys-Leu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O WRODMZBHNNPRLN-SRVKXCTJSA-N 0.000 description 1
- OIQSIMFSVLLWBX-VOAKCMCISA-N Lys-Leu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OIQSIMFSVLLWBX-VOAKCMCISA-N 0.000 description 1
- YSPZCHGIWAQVKQ-AVGNSLFASA-N Lys-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN YSPZCHGIWAQVKQ-AVGNSLFASA-N 0.000 description 1
- TVHCDSBMFQYPNA-RHYQMDGZSA-N Lys-Thr-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TVHCDSBMFQYPNA-RHYQMDGZSA-N 0.000 description 1
- CAVRAQIDHUPECU-UVOCVTCTSA-N Lys-Thr-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAVRAQIDHUPECU-UVOCVTCTSA-N 0.000 description 1
- XGZDDOKIHSYHTO-SZMVWBNQSA-N Lys-Trp-Glu Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 XGZDDOKIHSYHTO-SZMVWBNQSA-N 0.000 description 1
- NYTDJEZBAAFLLG-IHRRRGAJSA-N Lys-Val-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(O)=O NYTDJEZBAAFLLG-IHRRRGAJSA-N 0.000 description 1
- GILLQRYAWOMHED-DCAQKATOSA-N Lys-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN GILLQRYAWOMHED-DCAQKATOSA-N 0.000 description 1
- QEVRUYFHWJJUHZ-DCAQKATOSA-N Met-Ala-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(C)C QEVRUYFHWJJUHZ-DCAQKATOSA-N 0.000 description 1
- MDXAULHWGWETHF-SRVKXCTJSA-N Met-Arg-Val Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CCCNC(N)=N MDXAULHWGWETHF-SRVKXCTJSA-N 0.000 description 1
- TUSOIZOVPJCMFC-FXQIFTODSA-N Met-Asp-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O TUSOIZOVPJCMFC-FXQIFTODSA-N 0.000 description 1
- GODBLDDYHFTUAH-CIUDSAMLSA-N Met-Asp-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O GODBLDDYHFTUAH-CIUDSAMLSA-N 0.000 description 1
- UYAKZHGIPRCGPF-CIUDSAMLSA-N Met-Glu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCSC)N UYAKZHGIPRCGPF-CIUDSAMLSA-N 0.000 description 1
- DJDFBVNNDAUPRW-GUBZILKMSA-N Met-Glu-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O DJDFBVNNDAUPRW-GUBZILKMSA-N 0.000 description 1
- QGRJTULYDZUBAY-ZPFDUUQYSA-N Met-Ile-Glu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O QGRJTULYDZUBAY-ZPFDUUQYSA-N 0.000 description 1
- HSJIGJRZYUADSS-IHRRRGAJSA-N Met-Lys-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HSJIGJRZYUADSS-IHRRRGAJSA-N 0.000 description 1
- PHURAEXVWLDIGT-LPEHRKFASA-N Met-Ser-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N PHURAEXVWLDIGT-LPEHRKFASA-N 0.000 description 1
- DBMLDOWSVHMQQN-XGEHTFHBSA-N Met-Ser-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DBMLDOWSVHMQQN-XGEHTFHBSA-N 0.000 description 1
- KSIPKXNIQOWMIC-RCWTZXSCSA-N Met-Thr-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCNC(N)=N KSIPKXNIQOWMIC-RCWTZXSCSA-N 0.000 description 1
- 206010027476 Metastases Diseases 0.000 description 1
- 241000187722 Micromonospora echinospora Species 0.000 description 1
- 241000699666 Mus <mouse, genus> Species 0.000 description 1
- 241000699670 Mus sp. Species 0.000 description 1
- 241000186359 Mycobacterium Species 0.000 description 1
- 241000204031 Mycoplasma Species 0.000 description 1
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 1
- 241000588650 Neisseria meningitidis Species 0.000 description 1
- 101710118186 Neomycin resistance protein Proteins 0.000 description 1
- 241000894763 Nostoc punctiforme PCC 73102 Species 0.000 description 1
- 238000012408 PCR amplification Methods 0.000 description 1
- JVTMTFMMMHAPCR-UBHSHLNASA-N Phe-Ala-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JVTMTFMMMHAPCR-UBHSHLNASA-N 0.000 description 1
- SWZKMTDPQXLQRD-XVSYOHENSA-N Phe-Asp-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWZKMTDPQXLQRD-XVSYOHENSA-N 0.000 description 1
- MPFGIYLYWUCSJG-AVGNSLFASA-N Phe-Glu-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 MPFGIYLYWUCSJG-AVGNSLFASA-N 0.000 description 1
- MGECUMGTSHYHEJ-QEWYBTABSA-N Phe-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 MGECUMGTSHYHEJ-QEWYBTABSA-N 0.000 description 1
- UAMFZRNCIFFMLE-FHWLQOOXSA-N Phe-Glu-Tyr Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N UAMFZRNCIFFMLE-FHWLQOOXSA-N 0.000 description 1
- MMYUOSCXBJFUNV-QWRGUYRKSA-N Phe-Gly-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N MMYUOSCXBJFUNV-QWRGUYRKSA-N 0.000 description 1
- QPVFUAUFEBPIPT-CDMKHQONSA-N Phe-Gly-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O QPVFUAUFEBPIPT-CDMKHQONSA-N 0.000 description 1
- WFHRXJOZEXUKLV-IRXDYDNUSA-N Phe-Gly-Tyr Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 WFHRXJOZEXUKLV-IRXDYDNUSA-N 0.000 description 1
- RGZYXNFHYRFNNS-MXAVVETBSA-N Phe-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N RGZYXNFHYRFNNS-MXAVVETBSA-N 0.000 description 1
- KXUZHWXENMYOHC-QEJZJMRPSA-N Phe-Leu-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O KXUZHWXENMYOHC-QEJZJMRPSA-N 0.000 description 1
- KDYPMIZMXDECSU-JYJNAYRXSA-N Phe-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 KDYPMIZMXDECSU-JYJNAYRXSA-N 0.000 description 1
- YMTMNYNEZDAGMW-RNXOBYDBSA-N Phe-Phe-Trp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N[C@@H](CC3=CNC4=CC=CC=C43)C(=O)O)N YMTMNYNEZDAGMW-RNXOBYDBSA-N 0.000 description 1
- AGTHXWTYCLLYMC-FHWLQOOXSA-N Phe-Tyr-Glu Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCC(O)=O)C(O)=O)C1=CC=CC=C1 AGTHXWTYCLLYMC-FHWLQOOXSA-N 0.000 description 1
- JTKGCYOOJLUETJ-ULQDDVLXSA-N Phe-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JTKGCYOOJLUETJ-ULQDDVLXSA-N 0.000 description 1
- 241000288906 Primates Species 0.000 description 1
- CQZNGNCAIXMAIQ-UBHSHLNASA-N Pro-Ala-Phe Chemical compound C[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O CQZNGNCAIXMAIQ-UBHSHLNASA-N 0.000 description 1
- ONPFOYPPPOHMNH-UVBJJODRSA-N Pro-Ala-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@@H]3CCCN3 ONPFOYPPPOHMNH-UVBJJODRSA-N 0.000 description 1
- QSKCKTUQPICLSO-AVGNSLFASA-N Pro-Arg-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O QSKCKTUQPICLSO-AVGNSLFASA-N 0.000 description 1
- ZSKJPKFTPQCPIH-RCWTZXSCSA-N Pro-Arg-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZSKJPKFTPQCPIH-RCWTZXSCSA-N 0.000 description 1
- OBVCYFIHIIYIQF-CIUDSAMLSA-N Pro-Asn-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O OBVCYFIHIIYIQF-CIUDSAMLSA-N 0.000 description 1
- SGCZFWSQERRKBD-BQBZGAKWSA-N Pro-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 SGCZFWSQERRKBD-BQBZGAKWSA-N 0.000 description 1
- KIGGUSRFHJCIEJ-DCAQKATOSA-N Pro-Asp-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O KIGGUSRFHJCIEJ-DCAQKATOSA-N 0.000 description 1
- ZCXQTRXYZOSGJR-FXQIFTODSA-N Pro-Asp-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZCXQTRXYZOSGJR-FXQIFTODSA-N 0.000 description 1
- DEDANIDYQAPTFI-IHRRRGAJSA-N Pro-Asp-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O DEDANIDYQAPTFI-IHRRRGAJSA-N 0.000 description 1
- WVOXLKUUVCCCSU-ZPFDUUQYSA-N Pro-Glu-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVOXLKUUVCCCSU-ZPFDUUQYSA-N 0.000 description 1
- MRYUJHGPZQNOAD-IHRRRGAJSA-N Pro-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 MRYUJHGPZQNOAD-IHRRRGAJSA-N 0.000 description 1
- AWQGDZBKQTYNMN-IHRRRGAJSA-N Pro-Phe-Asp Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N[C@@H](CC(=O)O)C(=O)O AWQGDZBKQTYNMN-IHRRRGAJSA-N 0.000 description 1
- ZVEQWRWMRFIVSD-HRCADAONSA-N Pro-Phe-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N3CCC[C@@H]3C(=O)O ZVEQWRWMRFIVSD-HRCADAONSA-N 0.000 description 1
- GFHXZNVJIKMAGO-IHRRRGAJSA-N Pro-Phe-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O GFHXZNVJIKMAGO-IHRRRGAJSA-N 0.000 description 1
- HOTVCUAVDQHUDB-UFYCRDLUSA-N Pro-Phe-Tyr Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)[C@H]1NCCC1)C1=CC=C(O)C=C1 HOTVCUAVDQHUDB-UFYCRDLUSA-N 0.000 description 1
- KWMZPPWYBVZIER-XGEHTFHBSA-N Pro-Ser-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWMZPPWYBVZIER-XGEHTFHBSA-N 0.000 description 1
- XRGIDCGRSSWCKE-SRVKXCTJSA-N Pro-Val-Met Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O XRGIDCGRSSWCKE-SRVKXCTJSA-N 0.000 description 1
- ZMLRZBWCXPQADC-TUAOUCFPSA-N Pro-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 ZMLRZBWCXPQADC-TUAOUCFPSA-N 0.000 description 1
- 241000589517 Pseudomonas aeruginosa Species 0.000 description 1
- 241000283984 Rodentia Species 0.000 description 1
- 241000235070 Saccharomyces Species 0.000 description 1
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 1
- 241001354013 Salmonella enterica subsp. enterica serovar Enteritidis Species 0.000 description 1
- 241000293871 Salmonella enterica subsp. enterica serovar Typhi Species 0.000 description 1
- 241000293869 Salmonella enterica subsp. enterica serovar Typhimurium Species 0.000 description 1
- ZUGXSSFMTXKHJS-ZLUOBGJFSA-N Ser-Ala-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O ZUGXSSFMTXKHJS-ZLUOBGJFSA-N 0.000 description 1
- WTUJZHKANPDPIN-CIUDSAMLSA-N Ser-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N WTUJZHKANPDPIN-CIUDSAMLSA-N 0.000 description 1
- IYCBDVBJWDXQRR-FXQIFTODSA-N Ser-Ala-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O IYCBDVBJWDXQRR-FXQIFTODSA-N 0.000 description 1
- BRKHVZNDAOMAHX-BIIVOSGPSA-N Ser-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N BRKHVZNDAOMAHX-BIIVOSGPSA-N 0.000 description 1
- NLQUOHDCLSFABG-GUBZILKMSA-N Ser-Arg-Arg Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NLQUOHDCLSFABG-GUBZILKMSA-N 0.000 description 1
- HBOABDXGTMMDSE-GUBZILKMSA-N Ser-Arg-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O HBOABDXGTMMDSE-GUBZILKMSA-N 0.000 description 1
- HEQPKICPPDOSIN-SRVKXCTJSA-N Ser-Asp-Tyr Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HEQPKICPPDOSIN-SRVKXCTJSA-N 0.000 description 1
- DSSOYPJWSWFOLK-CIUDSAMLSA-N Ser-Cys-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O DSSOYPJWSWFOLK-CIUDSAMLSA-N 0.000 description 1
- FMDHKPRACUXATF-ACZMJKKPSA-N Ser-Gln-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O FMDHKPRACUXATF-ACZMJKKPSA-N 0.000 description 1
- UOLGINIHBRIECN-FXQIFTODSA-N Ser-Glu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UOLGINIHBRIECN-FXQIFTODSA-N 0.000 description 1
- MUARUIBTKQJKFY-WHFBIAKZSA-N Ser-Gly-Asp Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MUARUIBTKQJKFY-WHFBIAKZSA-N 0.000 description 1
- RJHJPZQOMKCSTP-CIUDSAMLSA-N Ser-His-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O RJHJPZQOMKCSTP-CIUDSAMLSA-N 0.000 description 1
- JIPVNVNKXJLFJF-BJDJZHNGSA-N Ser-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N JIPVNVNKXJLFJF-BJDJZHNGSA-N 0.000 description 1
- ZIFYDQAFEMIZII-GUBZILKMSA-N Ser-Leu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZIFYDQAFEMIZII-GUBZILKMSA-N 0.000 description 1
- UBRMZSHOOIVJPW-SRVKXCTJSA-N Ser-Leu-Lys Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O UBRMZSHOOIVJPW-SRVKXCTJSA-N 0.000 description 1
- IXZHZUGGKLRHJD-DCAQKATOSA-N Ser-Leu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IXZHZUGGKLRHJD-DCAQKATOSA-N 0.000 description 1
- PPNPDKGQRFSCAC-CIUDSAMLSA-N Ser-Lys-Asp Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPNPDKGQRFSCAC-CIUDSAMLSA-N 0.000 description 1
- RWDVVSKYZBNDCO-MELADBBJSA-N Ser-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CO)N)C(=O)O RWDVVSKYZBNDCO-MELADBBJSA-N 0.000 description 1
- NUEHQDHDLDXCRU-GUBZILKMSA-N Ser-Pro-Arg Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NUEHQDHDLDXCRU-GUBZILKMSA-N 0.000 description 1
- PJIQEIFXZPCWOJ-FXQIFTODSA-N Ser-Pro-Asp Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O PJIQEIFXZPCWOJ-FXQIFTODSA-N 0.000 description 1
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 1
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 1
- PURRNJBBXDDWLX-ZDLURKLDSA-N Ser-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CO)N)O PURRNJBBXDDWLX-ZDLURKLDSA-N 0.000 description 1
- JGUWRQWULDWNCM-FXQIFTODSA-N Ser-Val-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O JGUWRQWULDWNCM-FXQIFTODSA-N 0.000 description 1
- ODRUTDLAONAVDV-IHRRRGAJSA-N Ser-Val-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ODRUTDLAONAVDV-IHRRRGAJSA-N 0.000 description 1
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 1
- 241000607715 Serratia marcescens Species 0.000 description 1
- 241000607764 Shigella dysenteriae Species 0.000 description 1
- 241000203644 Streptoalloteichus hindustanus Species 0.000 description 1
- 241000193998 Streptococcus pneumoniae Species 0.000 description 1
- 241000193996 Streptococcus pyogenes Species 0.000 description 1
- 241001312524 Streptococcus viridans Species 0.000 description 1
- 241000913727 Streptomyces alboniger Species 0.000 description 1
- 241000187759 Streptomyces albus Species 0.000 description 1
- 241000187432 Streptomyces coelicolor Species 0.000 description 1
- 241000970979 Streptomyces griseochromogenes Species 0.000 description 1
- 241000187391 Streptomyces hygroscopicus Species 0.000 description 1
- 241000187398 Streptomyces lividans Species 0.000 description 1
- 241001147844 Streptomyces verticillus Species 0.000 description 1
- 241000192584 Synechocystis Species 0.000 description 1
- 241000589262 Tatlockia micdadei Species 0.000 description 1
- 239000004098 Tetracycline Substances 0.000 description 1
- DFTCYYILCSQGIZ-GCJQMDKQSA-N Thr-Ala-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DFTCYYILCSQGIZ-GCJQMDKQSA-N 0.000 description 1
- NJEMRSFGDNECGF-GCJQMDKQSA-N Thr-Ala-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O NJEMRSFGDNECGF-GCJQMDKQSA-N 0.000 description 1
- CAGTXGDOIFXLPC-KZVJFYERSA-N Thr-Arg-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CCCN=C(N)N CAGTXGDOIFXLPC-KZVJFYERSA-N 0.000 description 1
- JMZKMSTYXHFYAK-VEVYYDQMSA-N Thr-Arg-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O JMZKMSTYXHFYAK-VEVYYDQMSA-N 0.000 description 1
- GLQFKOVWXPPFTP-VEVYYDQMSA-N Thr-Arg-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O GLQFKOVWXPPFTP-VEVYYDQMSA-N 0.000 description 1
- DCLBXIWHLVEPMQ-JRQIVUDYSA-N Thr-Asp-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 DCLBXIWHLVEPMQ-JRQIVUDYSA-N 0.000 description 1
- WLDUCKSCDRIVLJ-NUMRIWBASA-N Thr-Gln-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O WLDUCKSCDRIVLJ-NUMRIWBASA-N 0.000 description 1
- GARULAKWZGFIKC-RWRJDSDZSA-N Thr-Gln-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GARULAKWZGFIKC-RWRJDSDZSA-N 0.000 description 1
- KGKWKSSSQGGYAU-SUSMZKCASA-N Thr-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KGKWKSSSQGGYAU-SUSMZKCASA-N 0.000 description 1
- FHDLKMFZKRUQCE-HJGDQZAQSA-N Thr-Glu-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FHDLKMFZKRUQCE-HJGDQZAQSA-N 0.000 description 1
- UDQBCBUXAQIZAK-GLLZPBPUSA-N Thr-Glu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UDQBCBUXAQIZAK-GLLZPBPUSA-N 0.000 description 1
- XOTBWOCSLMBGMF-SUSMZKCASA-N Thr-Glu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOTBWOCSLMBGMF-SUSMZKCASA-N 0.000 description 1
- AHOLTQCAVBSUDP-PPCPHDFISA-N Thr-Ile-Lys Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)[C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O AHOLTQCAVBSUDP-PPCPHDFISA-N 0.000 description 1
- RRRRCRYTLZVCEN-HJGDQZAQSA-N Thr-Leu-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O RRRRCRYTLZVCEN-HJGDQZAQSA-N 0.000 description 1
- RFKVQLIXNVEOMB-WEDXCCLWSA-N Thr-Leu-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N)O RFKVQLIXNVEOMB-WEDXCCLWSA-N 0.000 description 1
- KZSYAEWQMJEGRZ-RHYQMDGZSA-N Thr-Leu-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O KZSYAEWQMJEGRZ-RHYQMDGZSA-N 0.000 description 1
- QHUWWSQZTFLXPQ-FJXKBIBVSA-N Thr-Met-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O QHUWWSQZTFLXPQ-FJXKBIBVSA-N 0.000 description 1
- KDGBLMDAPJTQIW-RHYQMDGZSA-N Thr-Met-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)O)N)O KDGBLMDAPJTQIW-RHYQMDGZSA-N 0.000 description 1
- SGAOHNPSEPVAFP-ZDLURKLDSA-N Thr-Ser-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SGAOHNPSEPVAFP-ZDLURKLDSA-N 0.000 description 1
- UQCNIMDPYICBTR-KYNKHSRBSA-N Thr-Thr-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UQCNIMDPYICBTR-KYNKHSRBSA-N 0.000 description 1
- NHQVWACSJZJCGJ-FLBSBUHZSA-N Thr-Thr-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NHQVWACSJZJCGJ-FLBSBUHZSA-N 0.000 description 1
- GRIUMVXCJDKVPI-IZPVPAKOSA-N Thr-Thr-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O GRIUMVXCJDKVPI-IZPVPAKOSA-N 0.000 description 1
- ZOCJFNXUVSGBQI-HSHDSVGOSA-N Thr-Trp-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N)O ZOCJFNXUVSGBQI-HSHDSVGOSA-N 0.000 description 1
- PELIQFPESHBTMA-WLTAIBSBSA-N Thr-Tyr-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 PELIQFPESHBTMA-WLTAIBSBSA-N 0.000 description 1
- MNYNCKZAEIAONY-XGEHTFHBSA-N Thr-Val-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O MNYNCKZAEIAONY-XGEHTFHBSA-N 0.000 description 1
- KZTLZZQTJMCGIP-ZJDVBMNYSA-N Thr-Val-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KZTLZZQTJMCGIP-ZJDVBMNYSA-N 0.000 description 1
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 1
- 239000004473 Threonine Substances 0.000 description 1
- 102000040945 Transcription factor Human genes 0.000 description 1
- MJBBMTOGSOSAKJ-HJXMPXNTSA-N Trp-Ala-Ile Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MJBBMTOGSOSAKJ-HJXMPXNTSA-N 0.000 description 1
- DVAAUUVLDFKTAQ-VHWLVUOQSA-N Trp-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N DVAAUUVLDFKTAQ-VHWLVUOQSA-N 0.000 description 1
- LTLBNCDNXQCOLB-UBHSHLNASA-N Trp-Asp-Ser Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 LTLBNCDNXQCOLB-UBHSHLNASA-N 0.000 description 1
- KOVOKXBHGVXQMG-BPUTZDHNSA-N Trp-Cys-Met Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCSC)C(O)=O)=CNC2=C1 KOVOKXBHGVXQMG-BPUTZDHNSA-N 0.000 description 1
- GWQUSADRQCTMHN-NWLDYVSISA-N Trp-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O GWQUSADRQCTMHN-NWLDYVSISA-N 0.000 description 1
- CXPJPTFWKXNDKV-NUTKFTJISA-N Trp-Leu-Ala Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 CXPJPTFWKXNDKV-NUTKFTJISA-N 0.000 description 1
- IKUMWSDCGQVGHC-UMPQAUOISA-N Trp-Pro-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC2=CNC3=CC=CC=C32)N)O IKUMWSDCGQVGHC-UMPQAUOISA-N 0.000 description 1
- DDHFMBDACJYSKW-AQZXSJQPSA-N Trp-Thr-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O DDHFMBDACJYSKW-AQZXSJQPSA-N 0.000 description 1
- 102000004142 Trypsin Human genes 0.000 description 1
- 108090000631 Trypsin Proteins 0.000 description 1
- ADBDQGBDNUTRDB-ULQDDVLXSA-N Tyr-Arg-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O ADBDQGBDNUTRDB-ULQDDVLXSA-N 0.000 description 1
- NGALWFGCOMHUSN-AVGNSLFASA-N Tyr-Gln-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NGALWFGCOMHUSN-AVGNSLFASA-N 0.000 description 1
- QUILOGWWLXMSAT-IHRRRGAJSA-N Tyr-Gln-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O QUILOGWWLXMSAT-IHRRRGAJSA-N 0.000 description 1
- XQYHLZNPOTXRMQ-KKUMJFAQSA-N Tyr-Glu-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O XQYHLZNPOTXRMQ-KKUMJFAQSA-N 0.000 description 1
- HVHJYXDXRIWELT-RYUDHWBXSA-N Tyr-Glu-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O HVHJYXDXRIWELT-RYUDHWBXSA-N 0.000 description 1
- NZFCWALTLNFHHC-JYJNAYRXSA-N Tyr-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NZFCWALTLNFHHC-JYJNAYRXSA-N 0.000 description 1
- AZGZDDNKFFUDEH-QWRGUYRKSA-N Tyr-Gly-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AZGZDDNKFFUDEH-QWRGUYRKSA-N 0.000 description 1
- BYAKMYBZADCNMN-JYJNAYRXSA-N Tyr-Lys-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O BYAKMYBZADCNMN-JYJNAYRXSA-N 0.000 description 1
- ITDWWLTTWRRLCC-KJEVXHAQSA-N Tyr-Thr-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 ITDWWLTTWRRLCC-KJEVXHAQSA-N 0.000 description 1
- RIVVDNTUSRVTQT-IRIUXVKKSA-N Tyr-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O RIVVDNTUSRVTQT-IRIUXVKKSA-N 0.000 description 1
- JHDZONWZTCKTJR-KJEVXHAQSA-N Tyr-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JHDZONWZTCKTJR-KJEVXHAQSA-N 0.000 description 1
- KLOZTPOXVVRVAQ-DZKIICNBSA-N Tyr-Val-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 KLOZTPOXVVRVAQ-DZKIICNBSA-N 0.000 description 1
- HNWQUBBOBKSFQV-AVGNSLFASA-N Val-Arg-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N HNWQUBBOBKSFQV-AVGNSLFASA-N 0.000 description 1
- UDNYEPLJTRDMEJ-RCOVLWMOSA-N Val-Asn-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N UDNYEPLJTRDMEJ-RCOVLWMOSA-N 0.000 description 1
- QGFPYRPIUXBYGR-YDHLFZDLSA-N Val-Asn-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N QGFPYRPIUXBYGR-YDHLFZDLSA-N 0.000 description 1
- DDNIHOWRDOXXPF-NGZCFLSTSA-N Val-Asp-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N DDNIHOWRDOXXPF-NGZCFLSTSA-N 0.000 description 1
- VFOHXOLPLACADK-GVXVVHGQSA-N Val-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N VFOHXOLPLACADK-GVXVVHGQSA-N 0.000 description 1
- XGJLNBNZNMVJRS-NRPADANISA-N Val-Glu-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O XGJLNBNZNMVJRS-NRPADANISA-N 0.000 description 1
- VLDMQVZZWDOKQF-AUTRQRHGSA-N Val-Glu-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VLDMQVZZWDOKQF-AUTRQRHGSA-N 0.000 description 1
- ROLGIBMFNMZANA-GVXVVHGQSA-N Val-Glu-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N ROLGIBMFNMZANA-GVXVVHGQSA-N 0.000 description 1
- JTWIMNMUYLQNPI-WPRPVWTQSA-N Val-Gly-Arg Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N JTWIMNMUYLQNPI-WPRPVWTQSA-N 0.000 description 1
- OACSGBOREVRSME-NHCYSSNCSA-N Val-His-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](CC(N)=O)C(O)=O OACSGBOREVRSME-NHCYSSNCSA-N 0.000 description 1
- HLBHFAWNMAQGNO-AVGNSLFASA-N Val-His-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCSC)C(=O)O)N HLBHFAWNMAQGNO-AVGNSLFASA-N 0.000 description 1
- KDKLLPMFFGYQJD-CYDGBPFRSA-N Val-Ile-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N KDKLLPMFFGYQJD-CYDGBPFRSA-N 0.000 description 1
- LKUDRJSNRWVGMS-QSFUFRPTSA-N Val-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LKUDRJSNRWVGMS-QSFUFRPTSA-N 0.000 description 1
- SDUBQHUJJWQTEU-XUXIUFHCSA-N Val-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C(C)C)N SDUBQHUJJWQTEU-XUXIUFHCSA-N 0.000 description 1
- HGJRMXOWUWVUOA-GVXVVHGQSA-N Val-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N HGJRMXOWUWVUOA-GVXVVHGQSA-N 0.000 description 1
- LYERIXUFCYVFFX-GVXVVHGQSA-N Val-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LYERIXUFCYVFFX-GVXVVHGQSA-N 0.000 description 1
- AEMPCGRFEZTWIF-IHRRRGAJSA-N Val-Leu-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O AEMPCGRFEZTWIF-IHRRRGAJSA-N 0.000 description 1
- ZHQWPWQNVRCXAX-XQQFMLRXSA-N Val-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZHQWPWQNVRCXAX-XQQFMLRXSA-N 0.000 description 1
- HPANGHISDXDUQY-ULQDDVLXSA-N Val-Lys-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N HPANGHISDXDUQY-ULQDDVLXSA-N 0.000 description 1
- XPKCFQZDQGVJCX-RHYQMDGZSA-N Val-Lys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N)O XPKCFQZDQGVJCX-RHYQMDGZSA-N 0.000 description 1
- VPGCVZRRBYOGCD-AVGNSLFASA-N Val-Lys-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O VPGCVZRRBYOGCD-AVGNSLFASA-N 0.000 description 1
- SVFRYKBZHUGKLP-QXEWZRGKSA-N Val-Met-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SVFRYKBZHUGKLP-QXEWZRGKSA-N 0.000 description 1
- VNGKMNPAENRGDC-JYJNAYRXSA-N Val-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=CC=C1 VNGKMNPAENRGDC-JYJNAYRXSA-N 0.000 description 1
- XBJKAZATRJBDCU-GUBZILKMSA-N Val-Pro-Ala Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O XBJKAZATRJBDCU-GUBZILKMSA-N 0.000 description 1
- HPOSMQWRPMRMFO-GUBZILKMSA-N Val-Pro-Cys Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)O)N HPOSMQWRPMRMFO-GUBZILKMSA-N 0.000 description 1
- USLVEJAHTBLSIL-CYDGBPFRSA-N Val-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C USLVEJAHTBLSIL-CYDGBPFRSA-N 0.000 description 1
- CEKSLIVSNNGOKH-KZVJFYERSA-N Val-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](C(C)C)N)O CEKSLIVSNNGOKH-KZVJFYERSA-N 0.000 description 1
- UVHFONIHVHLDDQ-IFFSRLJSSA-N Val-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O UVHFONIHVHLDDQ-IFFSRLJSSA-N 0.000 description 1
- TVGWMCTYUFBXAP-QTKMDUPCSA-N Val-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N)O TVGWMCTYUFBXAP-QTKMDUPCSA-N 0.000 description 1
- LCHZBEUVGAVMKS-RHYQMDGZSA-N Val-Thr-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)[C@@H](C)O)C(O)=O LCHZBEUVGAVMKS-RHYQMDGZSA-N 0.000 description 1
- HTONZBWRYUKUKC-RCWTZXSCSA-N Val-Thr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HTONZBWRYUKUKC-RCWTZXSCSA-N 0.000 description 1
- OEVFFOBAXHBXKM-HSHDSVGOSA-N Val-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](C(C)C)N)O OEVFFOBAXHBXKM-HSHDSVGOSA-N 0.000 description 1
- VTIAEOKFUJJBTC-YDHLFZDLSA-N Val-Tyr-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VTIAEOKFUJJBTC-YDHLFZDLSA-N 0.000 description 1
- JPBGMZDTPVGGMQ-ULQDDVLXSA-N Val-Tyr-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N JPBGMZDTPVGGMQ-ULQDDVLXSA-N 0.000 description 1
- GTACFKZDQFTVAI-STECZYCISA-N Val-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 GTACFKZDQFTVAI-STECZYCISA-N 0.000 description 1
- BGTDGENDNWGMDQ-KJEVXHAQSA-N Val-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N)O BGTDGENDNWGMDQ-KJEVXHAQSA-N 0.000 description 1
- 241000607365 Vibrio natriegens Species 0.000 description 1
- 241000607734 Yersinia <bacteria> Species 0.000 description 1
- 241000607479 Yersinia pestis Species 0.000 description 1
- 241000606834 [Haemophilus] ducreyi Species 0.000 description 1
- 108010081404 acein-2 Proteins 0.000 description 1
- 108020002494 acetyltransferase Proteins 0.000 description 1
- 102000005421 acetyltransferase Human genes 0.000 description 1
- 101150063416 add gene Proteins 0.000 description 1
- 108010047506 alanyl-glutaminyl-glycyl-valine Proteins 0.000 description 1
- 108010047495 alanylglycine Proteins 0.000 description 1
- 108010011559 alanylphenylalanine Proteins 0.000 description 1
- 108010070783 alanyltyrosine Proteins 0.000 description 1
- 108010050025 alpha-glutamyltryptophan Proteins 0.000 description 1
- 150000001408 amides Chemical group 0.000 description 1
- 125000003277 amino group Chemical group 0.000 description 1
- 229940126575 aminoglycoside Drugs 0.000 description 1
- 229940126574 aminoglycoside antibiotic Drugs 0.000 description 1
- 239000002647 aminoglycoside antibiotic agent Substances 0.000 description 1
- 229960000723 ampicillin Drugs 0.000 description 1
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 1
- 210000004102 animal cell Anatomy 0.000 description 1
- 238000010171 animal model Methods 0.000 description 1
- 239000000427 antigen Substances 0.000 description 1
- 230000000890 antigenic effect Effects 0.000 description 1
- 108091007433 antigens Proteins 0.000 description 1
- 102000036639 antigens Human genes 0.000 description 1
- 108010018691 arginyl-threonyl-arginine Proteins 0.000 description 1
- 210000004507 artificial chromosome Anatomy 0.000 description 1
- 125000000613 asparagine group Chemical group N[C@@H](CC(N)=O)C(=O)* 0.000 description 1
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 1
- 108010092854 aspartyllysine Proteins 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- CXNPLSGKWMLZPZ-UHFFFAOYSA-N blasticidin-S Natural products O1C(C(O)=O)C(NC(=O)CC(N)CCN(C)C(N)=N)C=CC1N1C(=O)N=C(N)C=C1 CXNPLSGKWMLZPZ-UHFFFAOYSA-N 0.000 description 1
- 229960000182 blood factors Drugs 0.000 description 1
- 229940056450 brucella abortus Drugs 0.000 description 1
- 229960003669 carbenicillin Drugs 0.000 description 1
- FPPNZSSZRUTDAP-UWFZAAFLSA-N carbenicillin Chemical compound N([C@H]1[C@H]2SC([C@@H](N2C1=O)C(O)=O)(C)C)C(=O)C(C(O)=O)C1=CC=CC=C1 FPPNZSSZRUTDAP-UWFZAAFLSA-N 0.000 description 1
- 229910052799 carbon Inorganic materials 0.000 description 1
- 230000003197 catalytic effect Effects 0.000 description 1
- 230000030833 cell death Effects 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 239000003795 chemical substances by application Substances 0.000 description 1
- 230000021615 conjugation Effects 0.000 description 1
- 108010060199 cysteinylproline Proteins 0.000 description 1
- 238000004163 cytometry Methods 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 238000002716 delivery method Methods 0.000 description 1
- 230000001627 detrimental effect Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 206010013023 diphtheria Diseases 0.000 description 1
- 230000005782 double-strand break Effects 0.000 description 1
- 230000011559 double-strand break repair via nonhomologous end joining Effects 0.000 description 1
- 229960003722 doxycycline Drugs 0.000 description 1
- 238000004520 electroporation Methods 0.000 description 1
- 229940096118 ella Drugs 0.000 description 1
- 229960003276 erythromycin Drugs 0.000 description 1
- 238000005886 esterification reaction Methods 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 238000002073 fluorescence micrograph Methods 0.000 description 1
- 238000000799 fluorescence microscopy Methods 0.000 description 1
- PGBHMTALBVVCIT-VCIWKGPPSA-N framycetin Chemical compound N[C@@H]1[C@@H](O)[C@H](O)[C@H](CN)O[C@@H]1O[C@H]1[C@@H](O)[C@H](O[C@H]2[C@@H]([C@@H](N)C[C@@H](N)[C@@H]2O)O[C@@H]2[C@@H]([C@@H](O)[C@H](O)[C@@H](CN)O2)N)O[C@@H]1CO PGBHMTALBVVCIT-VCIWKGPPSA-N 0.000 description 1
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 1
- 238000003209 gene knockout Methods 0.000 description 1
- 238000010363 gene targeting Methods 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 1
- 108010079547 glutamylmethionine Proteins 0.000 description 1
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 1
- 108010090037 glycyl-alanyl-isoleucine Proteins 0.000 description 1
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 1
- 108010072405 glycyl-aspartyl-glycine Proteins 0.000 description 1
- 108010038983 glycyl-histidyl-lysine Proteins 0.000 description 1
- 108010010147 glycylglutamine Proteins 0.000 description 1
- 108010020688 glycylhistidine Proteins 0.000 description 1
- 108010087823 glycyltyrosine Proteins 0.000 description 1
- 230000012010 growth Effects 0.000 description 1
- 239000003102 growth factor Substances 0.000 description 1
- 230000036541 health Effects 0.000 description 1
- 108010040030 histidinoalanine Proteins 0.000 description 1
- 108010036413 histidylglycine Proteins 0.000 description 1
- 108010025306 histidylleucine Proteins 0.000 description 1
- 229940088597 hormone Drugs 0.000 description 1
- 239000005556 hormone Substances 0.000 description 1
- 210000005260 human cell Anatomy 0.000 description 1
- 238000010348 incorporation Methods 0.000 description 1
- 238000011534 incubation Methods 0.000 description 1
- 206010022000 influenza Diseases 0.000 description 1
- 230000010354 integration Effects 0.000 description 1
- 230000002452 interceptive effect Effects 0.000 description 1
- 229960000318 kanamycin Drugs 0.000 description 1
- 229930027917 kanamycin Natural products 0.000 description 1
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 1
- 229930182823 kanamycin A Natural products 0.000 description 1
- 210000003734 kidney Anatomy 0.000 description 1
- 229940039696 lactobacillus Drugs 0.000 description 1
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 1
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 1
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 1
- 108010012988 lysyl-glutamyl-aspartyl-glycine Proteins 0.000 description 1
- 108010064235 lysylglycine Proteins 0.000 description 1
- 108010017391 lysylvaline Proteins 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 230000009401 metastasis Effects 0.000 description 1
- 238000013508 migration Methods 0.000 description 1
- 230000005012 migration Effects 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 238000010172 mouse model Methods 0.000 description 1
- 229940053050 neomycin sulfate Drugs 0.000 description 1
- 210000001672 ovary Anatomy 0.000 description 1
- 238000004806 packaging method and process Methods 0.000 description 1
- 239000002245 particle Substances 0.000 description 1
- 108010073101 phenylalanylleucine Proteins 0.000 description 1
- 108010073734 polymyxin D Proteins 0.000 description 1
- 239000003910 polypeptide antibiotic agent Substances 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 239000000047 product Substances 0.000 description 1
- 108010077112 prolyl-proline Proteins 0.000 description 1
- 108010015796 prolylisoleucine Proteins 0.000 description 1
- 230000012846 protein folding Effects 0.000 description 1
- 238000001742 protein purification Methods 0.000 description 1
- 238000001243 protein synthesis Methods 0.000 description 1
- 238000011002 quantification Methods 0.000 description 1
- 238000004064 recycling Methods 0.000 description 1
- 230000008439 repair process Effects 0.000 description 1
- 230000000717 retained effect Effects 0.000 description 1
- 238000012552 review Methods 0.000 description 1
- 210000003705 ribosome Anatomy 0.000 description 1
- 125000000548 ribosyl group Chemical group C1([C@H](O)[C@H](O)[C@H](O1)CO)* 0.000 description 1
- 238000007363 ring formation reaction Methods 0.000 description 1
- 238000009394 selective breeding Methods 0.000 description 1
- 108010069117 seryl-lysyl-aspartic acid Proteins 0.000 description 1
- 108010071207 serylmethionine Proteins 0.000 description 1
- 229940007046 shigella dysenteriae Drugs 0.000 description 1
- 229940054269 sodium pyruvate Drugs 0.000 description 1
- 125000006850 spacer group Chemical group 0.000 description 1
- 229960000268 spectinomycin Drugs 0.000 description 1
- UNFWWIHTNXNPBV-WXKVUWSESA-N spectinomycin Chemical compound O([C@@H]1[C@@H](NC)[C@@H](O)[C@H]([C@@H]([C@H]1O1)O)NC)[C@]2(O)[C@H]1O[C@H](C)CC2=O UNFWWIHTNXNPBV-WXKVUWSESA-N 0.000 description 1
- 238000001228 spectrum Methods 0.000 description 1
- 230000002269 spontaneous effect Effects 0.000 description 1
- 210000000130 stem cell Anatomy 0.000 description 1
- 229940031000 streptococcus pneumoniae Drugs 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 239000006228 supernatant Substances 0.000 description 1
- 230000004083 survival effect Effects 0.000 description 1
- 229960002180 tetracycline Drugs 0.000 description 1
- 229930101283 tetracycline Natural products 0.000 description 1
- 235000019364 tetracycline Nutrition 0.000 description 1
- 150000003522 tetracyclines Chemical class 0.000 description 1
- 108010033670 threonyl-aspartyl-tyrosine Proteins 0.000 description 1
- 230000000699 topical effect Effects 0.000 description 1
- 238000003151 transfection method Methods 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
- 108700004896 tripeptide FEG Proteins 0.000 description 1
- 239000012588 trypsin Substances 0.000 description 1
- 108010080629 tryptophan-leucine Proteins 0.000 description 1
- 108010035534 tyrosyl-leucyl-alanine Proteins 0.000 description 1
- OOLLAFOLCSJHRE-ZHAKMVSLSA-N ulipristal acetate Chemical compound C1=CC(N(C)C)=CC=C1[C@@H]1C2=C3CCC(=O)C=C3CC[C@H]2[C@H](CC[C@]2(OC(C)=O)C(C)=O)[C@]2(C)C1 OOLLAFOLCSJHRE-ZHAKMVSLSA-N 0.000 description 1
- 229960005486 vaccine Drugs 0.000 description 1
- 108010073969 valyllysine Proteins 0.000 description 1
- MYPYJXKWCTUITO-LYRMYLQWSA-N vancomycin Chemical compound O([C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@H]1OC1=C2C=C3C=C1OC1=CC=C(C=C1Cl)[C@@H](O)[C@H](C(N[C@@H](CC(N)=O)C(=O)N[C@H]3C(=O)N[C@H]1C(=O)N[C@H](C(N[C@@H](C3=CC(O)=CC(O)=C3C=3C(O)=CC=C1C=3)C(O)=O)=O)[C@H](O)C1=CC=C(C(=C1)Cl)O2)=O)NC(=O)[C@@H](CC(C)C)NC)[C@H]1C[C@](C)(N)[C@H](O)[C@H](C)O1 MYPYJXKWCTUITO-LYRMYLQWSA-N 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/85—Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/87—Introduction of foreign genetic material using processes not otherwise provided for, e.g. co-transformation
- C12N15/90—Stable introduction of foreign DNA into chromosome
- C12N15/902—Stable introduction of foreign DNA into chromosome using homologous recombination
- C12N15/907—Stable introduction of foreign DNA into chromosome using homologous recombination in mammalian cells
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/195—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/85—Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
- C12N15/86—Viral vectors
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/10—Transferases (2.)
- C12N9/1025—Acyltransferases (2.3)
- C12N9/1029—Acyltransferases (2.3) transferring groups other than amino-acyl groups (2.3.1)
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/60—Fusion polypeptide containing spectroscopic/fluorescent detection, e.g. green fluorescent protein [GFP]
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/61—Fusion polypeptide containing an enzyme fusion for detection (lacZ, luciferase)
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/90—Fusion polypeptide containing a motif for post-translational modification
- C07K2319/92—Fusion polypeptide containing a motif for post-translational modification containing an intein ("protein splicing")domain
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2310/00—Structure or type of the nucleic acid
- C12N2310/30—Chemical structure
- C12N2310/35—Nature of the modification
- C12N2310/351—Conjugate
- C12N2310/3517—Marker; Tag
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2310/00—Structure or type of the nucleic acid
- C12N2310/30—Chemical structure
- C12N2310/35—Nature of the modification
- C12N2310/351—Conjugate
- C12N2310/3519—Fusion with another nucleic acid
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2740/00—Reverse transcribing RNA viruses
- C12N2740/00011—Details
- C12N2740/10011—Retroviridae
- C12N2740/16011—Human Immunodeficiency Virus, HIV
- C12N2740/16041—Use of virus, viral particle or viral elements as a vector
- C12N2740/16043—Use of virus, viral particle or viral elements as a vector viral genome or elements thereof as genetic vector
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Genetics & Genomics (AREA)
- Chemical & Material Sciences (AREA)
- Engineering & Computer Science (AREA)
- Organic Chemistry (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Biomedical Technology (AREA)
- Biotechnology (AREA)
- General Engineering & Computer Science (AREA)
- Molecular Biology (AREA)
- General Health & Medical Sciences (AREA)
- Biochemistry (AREA)
- Microbiology (AREA)
- Biophysics (AREA)
- Physics & Mathematics (AREA)
- Plant Pathology (AREA)
- Medicinal Chemistry (AREA)
- Virology (AREA)
- Cell Biology (AREA)
- Mycology (AREA)
- Gastroenterology & Hepatology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Peptides Or Proteins (AREA)
Abstract
본 개시내용은 트랜스제닉 세포의 생성 및 선택을 위한 분할 인테인 선택가능한 마커 시스템을 제공한다.
Description
<관련 출원>
본 출원은 35 U.S.C. § 119(e) 하에 2018년 1월 11일에 출원된 미국 가출원 번호 62/616,281, 2017년 12월 20일에 출원된 미국 가출원 번호 62/608,478, 2018년 1월 31일에 출원된 미국 가출원 번호 62/624,629, 2017년 10월 12일에 출원된 미국 가출원 번호 62/571,672의 이익을 주장하며, 이들은 그 전문이 본원에 참조로 포함된다.
<서열 목록>
본 출원은 컴퓨터 판독가능한 형태의 서열 목록 (파일명: J022770007WO00-SEQ-HJD; 1.50 MB - ASCII 텍스트 파일; 2018년 10월 3일 생성됨)을 함유하며, 이는 그 전문이 본원에 참조로 포함되고 본 개시내용의 일부를 형성한다.
선택가능한 마커는 원하는 유전자형을 갖는 조작된 세포를 선택하기 위한 트랜스제네시스(transgenesis) 및 게놈 편집에서 광범위하게 채택된다. 항생제 내성 유전자 (항생제 내성 단백질을 코딩함)는 특정 항생제에 대한 내성을 제공하여, 이들 내성 유전자를 발현하는 세포만이 생존하고 증식하도록 한다. 진핵생물 세포에 사용하기 위한 이용가능한 항생제 내성 유전자/항생제는 hygB/히그로마이신, neo/게네티신(Geneticin)®/G418, pac/퓨로마이신, Sh bla/플레오마이신 D1 (제오신(Zeocin)™), 및 bsd/블라스티시딘을 포함한다. 형광 단백질, 예컨대 녹색 형광 단백질(green fluorescent protein)(GFP)은 세포 선택의 또 다른 수단, 예를 들어 형광-활성화 세포 분류법(fluorescent-activated cell sorting)(FACS) 기술 또는 형광 현미경검사를 통한 수단을 제공한다.
<요약>
진핵생물 (예를 들어, 포유동물) 세포에 사용하기 위한 이용가능한 항생제 내성 유전자/항생제의 수가 제한되어 있으므로, 다중 트랜스진(transgene)을 함유하는 세포를 확인하기 위한 선택 방식이 제한된다. 진핵생물 세포에서 항생제 내성을 부여하는 별개의 유전자의 수가 제한되어 있을 뿐만 아니라, 최소 3개의 상이한 항생제 내성 유전자의 동시 사용은 트랜스제닉(transgenic) 세포의 건강에 유해한 영향을 미칠 수 있다. 항생제 선택은 연속적으로 수행될 수 있지만, 이 과정은 시간이 많이 걸린다. 트랜스제닉 세포를 확인하기 위한 선택 방식에 대한 이들 제한은 다중 트랜스진이 도입된 세포를 확인할 필요가 있는 경우 (예를 들어, 트랜스제닉 유기체, 예를 들어 동물 모델, 예컨대 마우스 모델을 생성하기 위해) 문제가 된다.
예를 들어, 2개 이상의 트랜스진 (예를 들어, 이중-트랜스제닉, 삼중-트랜스제닉 등)을 보유하는 세포 및/또는 유기체의 생성 및/또는 확인에 유용한 방법, 조성물 및 키트가 본원에 제공된다. 예를 들어, 조성물 및 키트는 2, 3 또는 4개의 트랜스진을 보유하는 세포 및/또는 유기체의 생성 및/또는 확인을 위해 사용될 수 있다. 이 기술은 적어도 부분적으로, 인테인(intein) 자동-프로세싱 도메인에 의해 개시된 단백질 스플라이싱 메커니즘에 기초하며, 이는 다중 (예를 들어, 2, 3 또는 4개) 별도의 선택가능한 마커 단백질 단편의 다중-트랜스제닉 세포 (이중-트랜스제닉 세포, 삼중-트랜스제닉 세포 또는 사중-트랜스제닉 세포)에서 특이적 연결 (접합)을 용이하게 한다. 다중-트랜스제닉 세포에서 2, 3, 4개 또는 그 이상의 별도의 선택가능한 마커 단백질 단편의 연결은 예를 들어, 항생제 내성 (항생제 내성 단백질)을 부여하거나 적절한 파장의 빛 하에 형광 (형광 단백질)을 낼 수 있는 전장 선택가능한 마커 단백질을 생성한다. 전장 항생제 내성 유전자를 발현하는 세포는 상응하는 항생제의 존재 하에 생존하므로, 다중-트랜스제닉 (예를 들어, 이중-트랜스제닉, 삼중-트랜스제닉 또는 사중-트랜스제닉) 세포로서 선택된다. 마찬가지로, 전장 기능성 형광 단백질을 발현하는 세포는 적절한 파장의 빛 하에 형광을 내므로, 다중-트랜스제닉 (예를 들어, 이중-트랜스제닉, 삼중-트랜스제닉 또는 사중-트랜스제닉) 세포로서 선택된다.
그러므로, 일부 실시양태에서, 본 개시내용은 2종 이상의 벡터를 진핵생물 세포를 포함하는 조성물로 전달하는 것을 포함하는 방법을 제공하며, 여기서 각각의 벡터는 (i) N-말단 인테인 단백질 단편 및/또는 C-말단 인테인 단백질 단편에 연결된 선택가능한 마커 단백질 단편을 코딩하는 뉴클레오티드 서열 및 (ii) 관심있는 분자를 코딩하는 뉴클레오티드 서열을 포함하며, 여기서 인테인 단백질 단편은 전장 기능 단백질을 형성하도록 프레임에 연결된 경우, 선택가능한 마커 단백질 단편의 연결을 촉매하여 전장 선택가능한 마커 단백질을 생성한다. 예를 들어, 2종의 벡터가 세포의 집단으로 전달된 경우 (예를 들어, 형질감염 조건 하에), 일부 세포는 제1 벡터를 수용할 것이고 (벡터가 세포에 도입됨), 일부 세포는 제2 벡터를 수용할 것이고, 일부 세포는 2종의 벡터 모두를 수용할 것이다. 2종의 벡터 모두를 수용하는 세포만이 전장 기능성 선택가능한 마커 단백질을 발현할 수 있으므로, 이들 세포만이 이중-트랜스제닉 세포로서 선택된다.
일부 실시양태에서, (a) (i) N-말단 인테인 단백질 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, 제1 선택가능한 마커 단백질 단편 (예를 들어, 항생제 내성 단백질 단편 또는 형광 단백질 단편)을 코딩하는 뉴클레오티드 서열 및 (ii) 제1 분자를 코딩하는 뉴클레오티드 서열을 포함하는 제1 벡터, 및 (b) (i) 제2 선택가능한 마커 단백질 단편 (예를 들어, 항생제 내성 단백질 단편 또는 형광 단백질 단편)으로부터 상류에 있는, C-말단 인테인 단백질 단편을 코딩하는 뉴클레오티드 서열 및 (ii) 제2 분자를 코딩하는 뉴클레오티드 서열을 포함하는 제2 벡터를 진핵생물 세포를 포함하는 조성물로 전달하는 것을 포함하며, 여기서 N-말단 인테인 단백질 단편 및 C-말단 인테인 단백질 단편은 제2 선택가능한 마커 단백질 단편으로의 제1 선택가능한 마커 단백질 단편의 연결을 촉매하여 전장 선택가능한 마커 단백질을 생성하는 것인 방법이 본원에 제공된다. 2종의 벡터가 세포의 집단으로 전달된 경우 (예를 들어, 형질감염 조건 하에), 일부 세포는 제1 벡터를 수용할 것이고 (벡터가 세포에 도입됨), 일부 세포는 제2 벡터를 수용할 것이고, 일부 세포는 2종의 벡터 모두를 수용할 것이다. 2종의 벡터 모두를 수용하는 세포만이 전장 기능성 선택가능한 마커 단백질을 발현할 수 있으므로, 이들 세포만이 이중-트랜스제닉 세포로서 선택된다.
다른 실시양태에서, 방법은 (a) (i) 제1 인테인의 N-말단 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, 선택가능한 마커 단백질 (예를 들어, 항생제 내성 단백질 또는 형광 단백질)의 N-말단 단편을 코딩하는 뉴클레오티드 서열 및 (ii) 관심있는 제1 분자를 코딩하는 뉴클레오티드 서열을 포함하는 제1 벡터, (b) (i) 제2 인테인의 N-말단 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, 선택가능한 마커 단백질의 중심 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, 제1 인테인의 C-말단 단편을 코딩하는 뉴클레오티드 서열 및 (ii) 관심있는 제2 분자를 코딩하는 뉴클레오티드 서열을 포함하는 제2 벡터, 및 (c) (i) 선택가능한 마커 단백질의 C-말단 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, 제2 인테인의 C-말단 단편을 코딩하는 뉴클레오티드 서열 및 (ii) 관심있는 제3 분자를 코딩하는 뉴클레오티드 서열을 포함하는 제3 벡터를 진핵생물 세포로 전달하는 것을 포함하며, 여기서 제1 인테인의 N-말단 단편 및 C-말단 단편은 선택가능한 마커 단백질의 중심 단편으로의 선택가능한 마커 단백질의 N-말단 단편의 연결을 촉매하고, 제2 인테인의 N-말단 단편 및 C-말단 단편은 선택가능한 마커 단백질의 C-말단 단편으로의 선택가능한 마커 단백질의 중심 단편의 연결을 촉매하여 전장 선택가능한 마커 단백질을 생성한다. 3종의 벡터가 세포의 집단으로 전달된 경우 (예를 들어, 형질감염 조건 하에), 일부 세포는 제1 벡터를 수용할 것이고 (벡터가 세포에 도입됨), 일부 세포는 제2 벡터를 수용할 것이고, 일부 세포는 제3 벡터를 수용할 것이고, 일부 세포는 2종의 상이한 벡터를 수용할 것이고, 일부 세포는 3종의 벡터 모두를 수용할 것이다. 3종의 벡터 모두를 수용하는 세포만이 전장 기능성 선택가능한 마커 단백질을 발현할 수 있으므로, 이들 세포만이 삼중-트랜스제닉 세포로서 선택된다.
또 다른 실시양태에서, 방법은 (a) (i) 제1 인테인의 N-말단 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, 선택가능한 마커 단백질 (예를 들어, 항생제 내성 단백질 또는 형광 단백질)의 N-말단 단편을 코딩하는 뉴클레오티드 서열 및 (ii) 관심있는 제1 분자를 코딩하는 뉴클레오티드 서열을 포함하는 제1 벡터, (b) (i) 제2 인테인의 N-말단 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, 선택가능한 마커 단백질의 제1 중심 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, 제1 인테인의 C-말단 단편을 코딩하는 뉴클레오티드 서열 및 (ii) 관심있는 제2 분자를 코딩하는 뉴클레오티드 서열을 포함하는 제2 벡터, (c) (i) 제3 인테인의 N-말단 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, 선택가능한 마커 단백질의 제2 중심 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, 제2 인테인의 C-말단 단편을 코딩하는 뉴클레오티드 서열 및 (ii) 관심있는 제3 분자를 코딩하는 뉴클레오티드 서열을 포함하는 제3 벡터, 및 (d) (i) 선택가능한 마커 단백질의 C-말단 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, 제3 인테인의 C-말단 단편을 코딩하는 뉴클레오티드 서열 및 (ii) 관심있는 제3 분자를 코딩하는 뉴클레오티드 서열을 포함하는 제4 벡터를 진핵생물 세포로 전달하는 것을 포함하며, 여기서 제1 인테인의 N-말단 단편 및 C-말단 단편은 선택가능한 마커 단백질의 제1 중심 단편으로의 선택가능한 마커 단백질의 N-말단 단편의 연결을 촉매하고, 제2 인테인의 N-말단 단편 및 C-말단 단편은 선택가능한 마커 단백질의 제2 중심 단편으로의 선택가능한 마커 단백질의 제1 중심 단편의 연결을 촉매하고, 제3 인테인의 N-말단 단편 및 C-말단 단편은 선택가능한 마커 단백질의 C-말단 단편으로의 선택가능한 마커 단백질의 제2 중심 단편의 연결을 촉매하여 전장 선택가능한 마커 단백질을 생성한다. 4종의 벡터가 세포의 집단으로 전달된 경우 (예를 들어, 형질감염 조건 하에), 일부 세포는 제1 벡터를 수용할 것이고 (벡터가 세포에 도입됨), 일부 세포는 제2 벡터를 수용할 것이고, 일부 세포는 제3 벡터를 수용할 것이고, 일부 세포는 제4 벡터를 수용할 것이고, 일부 세포는 2종의 상이한 벡터를 수용할 것이고, 일부 세포는 3종의 상이한 벡터를 수용할 것이고, 일부 세포는 4종의 벡터 모두를 수용할 것이다. 4종의 벡터 모두를 수용하는 세포만이 전장 기능성 선택가능한 마커 단백질을 발현할 수 있으므로, 이들 세포만이 사중-트랜스제닉 세포로서 선택된다.
명세서의 예들 또는 하나의 섹션에만 개시된 것들을 포함하여 본원에 기재된 임의의 한 실시양태는 명백하게 부인되지 않는 한 임의의 하나 이상의 다른 실시양태와 조합될 수 있는 것으로 의도되는 것으로 이해되어야 한다.
도 1a-1b. 2종의 별도의 트랜스제닉 벡터의 항생제 공동선택을 위한 분할 선택가능한 마커. (도 1a) 선택가능한 마커의 코딩 서열은 N-말단 단편 (MarN) 및 C-말단 단편 (kerC)으로 분할되고, 분할 인테인의 N-말단 단편 (IntN)의 상류 및 분할 인테인의 C-말단 단편 (IntC)의 하류 각각에 별도로 클로닝되며, 2종의 상이한 벡터 각각은 상이한 트랜스진을 보유한다. 이들 벡터는 세포로 전달되어 벡터 중 하나 또는 벡터 중 둘 모두를 함유하는 세포의 하위집단을 수득한다. 2종의 인테인-분할 선택가능한 마커 단편 ("마커트론(markertron)")을 발현하는 두 벡터 모두를 갖는 세포에서만 단백질 트랜스-스플라이싱을 수행하여 전장 선택가능한 마커를 재구성하여 이중 트랜스제닉 세포의 특이적 선택 및 풍부를 허용한다. (도 1b) 항생제 내성 유전자에 대한 인테인과 호환가능한 분할점을 스크리닝하기 위해, 본 발명자들은 시험된 인테인의 유형에 대한 이음부 요건에 따라 잠재적인 분할점을 확인한 후, TagBFP2 또는 mCherry 형광 단백질이 장착된 렌티바이러스 벡터 상의 분할 인테인 스캐폴드(scaffold)에 상응하는 N-말단 및 C-말단 단편을 클로닝하였으며, 이는 선택 효율을 평가하기 위한 시험 트랜스진으로서 작용한다. 이들은 렌티바이러스 형질도입을 통해 세포로 전달된다. 그 후, 세포는 복제 플레이트로 분할되었으며, 하나는 항생제 선택을 거친 반면, 다른 하나는 비-선택 배지에서 유지되었다. 항생제 선택 후, 복제 배양물을 유동 세포계측법에 의해 분석하였다.
도 2a-2f. 인테인-분할 내성 (Intres) 유전자 (선택가능한 마커 유전자라고도 함) 및 플라스미드의 분할점에 대한 세부사항. (도 2a) 히그로마이신 내성 단백질 (서열식별번호(SEQ ID NO): 1)에 대한 분할점. 히그로마이신 내성 단백질의 아미노산 서열에는 본 연구에서 특징화된 분할점을 표지하는 클라우드(cloud)가 제공된다. 표지 내에서, 상단 행은 표 1에 상응하는 플라스미드 번호를 나타낸다. 하단 행은 N-말단 단편에서 마지막 아미노산의 잔기 수, 사용된 인테인의 종, 및 C-말단 단편에서 첫 번째 아미노산의 잔기 수를 나타낸다. "^C"는 시스테인의 삽입을 나타낸다. (도 2b) 퓨로마이신 내성 단백질 (서열식별번호: 2)에 대한 분할점. 퓨로마이신 내성 단백질의 아미노산 서열에는 본 연구에서 특징화된 분할점을 표지하는 클라우드가 제공된다. 표지 내에서, 상단 행은 표 1에 상응하는 플라스미드 번호를 나타낸다. 하단 행은 N-말단 단편에서 마지막 아미노산의 잔기 수, 사용된 인테인의 종, 및 C-말단 단편에서 첫 번째 아미노산의 잔기 수를 나타낸다. "^C"는 시스테인의 삽입을 나타낸다. (도 2c) 네오마이신 내성 단백질 (서열식별번호: 3)에 대한 분할점. 네오마이신 내성 유전자의 아미노산 서열에는 본 연구에서 특징화된 분할점을 표지하는 클라우드가 제공된다. 표지 내에서, 상단 행은 표 1에 상응하는 플라스미드 번호를 나타낸다. 하단 행은 N-말단 단편에서 마지막 아미노산의 잔기 수, 사용된 인테인의 종, 및 C-말단 단편에서 첫 번째 아미노산의 잔기 수를 나타낸다. (도 2d) 블라스티시딘 내성 단백질 (서열식별번호: 4)에 대한 분할점. 블라스티시딘 내성 유전자의 아미노산 서열에는 본 연구에서 특징화된 분할점을 표지하는 클라우드가 제공된다. 표지 내에서, 상단 행은 표 1에 상응하는 플라스미드 번호를 나타낸다. 하단 행은 N-말단 단편에서 마지막 아미노산의 잔기 수, 사용된 인테인의 종, 및 C-말단 단편에서 첫 번째 아미노산의 잔기 수를 나타낸다. (도 2e) 녹색 형광 단백질 (서열식별번호: 5)에 대한 분할점. (도 2f) mScarlet 형광 단백질 (서열식별번호: 6)에 대한 분할점. mScarlet 유전자의 아미노산 서열에는 본 연구에서 특징화된 분할점을 표지하는 클라우드가 제공된다. 표지 내에서, 상단 행은 표 1에 상응하는 플라스미드 번호를 나타낸다. 하단 행은 N-말단 단편에서 마지막 아미노산의 잔기 수, 사용된 인테인의 종, 및 C-말단 단편에서 첫 번째 아미노산의 잔기 수를 나타낸다. "^C"는 시스테인의 삽입을 나타낸다.
도 3. 2-마커트론 히그로마이신 (Hygro) 인테인-분할 내성 (Intres) 유전자. 상단 개략도는 히그로마이신 내성 유전자에 대해 시험된 분할점을 나타낸다. N-말단 단편의 마지막 잔기는 롤리팝(lollipop)의 상단 위에 표시되어 있다. 원형 롤리팝은 NpuDnaE 인테인을 사용하는 분할점을 나타내고, 사각형 롤리팝은 SspDnaB 인테인을 사용하는 분할점을 나타낸다. 줄 그어진 및 음영처리된 롤리팝은 세포에 히그로마이신 내성을 부여하지 못한 분할 쌍을 나타낸다. 아래의 컬럼 플롯은 유동 세포계측법에 의해 분석된 비-선택된 (백색 컬럼) 및 선택된 배양물 (파란색 컬럼) 중 이중-트랜스제닉 세포 (BFP+ mCherry+)의 백분율을 나타낸다.
도 4. 2-마커트론 퓨로마이신 (Puro) Intres 유전자. 상단 개략도는 퓨로마이신 내성 유전자에 대해 시험된 분할점을 나타내는 반면, 하단 컬럼 플롯은 비-선택된 (백색 컬럼) 및 선택된 배양물 (갈색 컬럼) 중 이중 트랜스제닉 세포의 백분율을 나타낸다.
도 5. 2-마커트론 네오마이신 (Neo) 내성 유전자. 상단 개략도는 네오마이신 내성 유전자에 대해 시험된 분할점을 나타내는 반면, 하단 컬럼 플롯은 비-선택된 (백색 컬럼) 및 선택된 배양물 (오렌지색 컬럼) 중 이중 트랜스제닉 세포의 백분율을 나타낸다.
도 6. 2-마커트론 블라스티시딘 (Blast) Intres 유전자. 상단 개략도는 블라스티시딘 내성 유전자에 대해 시험된 분할점을 나타내는 반면, 하단 컬럼 플롯은 비-선택된 (백색 컬럼) 및 선택된 (시안색 컬럼) 배양물 중 이중 트랜스제닉 세포의 백분율을 나타낸다.
도 7a-7c. 2-마커트론 Intres 마커를 갖는 게이트웨이-호환가능한(Gateway-compatible) 렌티바이러스 목적지 벡터. (도 7a) 각각의 분할 Intres 마커에 대한 게이트웨이-호환가능한 렌티바이러스 목적지 벡터 키트는 N-벡터 및 C-벡터로 이루어진다. N-벡터는 바이러스 LTR, CAGGS 프로모터, 게이트웨이 목적지 카세트 AttL, ccdB 유전자, 트랜스진을 보유하는 게이트웨이 공여자 벡터의 LR 클로나제-매개 재조합을 허용하는 클로람페니콜 내성 유전자, 이어서 N-마커트론의 다중시스트론(polycistronic) 발현을 허용하는 내부 리보솜 진입 부위 (IRES)를 함유한다. 유사하게, C-벡터는 C-마커트론을 함유하고, 또 다른 트랜스진의 재조합을 허용한다. (도 7b) TagBFP2 (트랜스진 1로서) 및 mCherry (트랜스진 2로서)는 게이트웨이 재조합에 의해 2-마커트론 Intres 플라스미드로 클로닝되고, 렌티바이러스 형질도입에 의해 세포로 전달되고, 이어서 항생제 선택 및 유동 세포계측법 분석이 수행되었다. 컬럼 플롯은 2-마커트론 히그로마이신 (Hygro, 파란색 컬럼), 퓨로마이신 (Puro, 갈색 컬럼), 및 네오마이신 (Neo, 오렌지색 컬럼) 실험으로부터의 선택된 배양물 대 그의 상응하는 비-선택 배양물 (백색 컬럼) 중 BFP+mCherry+ 이중-양성 세포의 백분율을 나타낸다. (도 7c) 핵을 GFP 형광으로 표지하는 NLS-GFP (트랜스진 1로서) 및 F-액틴을 mScarlet 형광으로 표지하는 lifeAct-mScarlet (트랜스진 2로서)은 전체 비-분할 히그로마이신 내성 유전자를 발현하는 렌티바이러스 벡터 또는 2-마커트론 히그로마이신 Intres 유전자를 갖는 렌티바이러스 벡터로 재조합되고, 이중-표지 세포를 제작하기 위해 U2OS 세포를 형질도입하는데 사용되었다. 대표적인 형광 현미경 이미지는 2주 동안 히그로마이신 선택 후 GFP, mScarlet 및 세포의 병합된 채널을 나타낸다.
도 8a-8c. 2종의 별도의 트랜스제닉 벡터의 형광-매개 공동선택을 위한 분할 mScarlet. (도 8a) 2-마커트론 mScarlet 단백질. 상단 개략도는 mScarlet에 대해 시험된 분할점을 나타낸다. N-말단 단편의 마지막 잔기는 롤리팝의 상단 위에 표시되어 있다. (도 8b) mScarlet에 대한 NpuDnaE 인테인-호환가능한 분할점을 스크리닝하기 위해, 본 발명자들은 NpuDnaE 인테인에 대한 이음부 요건에 따라 잠재적인 분할점을 확인한 후, TagBFP2 또는 EGFP 형광 단백질이 장착된 렌티바이러스 벡터 상의 분할 인테인 스캐폴드에 상응하는 N-말단 및 C-말단 단편을 클로닝하였으며, 이는 선택 효율을 평가하기 위한 시험 트랜스진으로서 작용한다. 이들은 렌티바이러스 형질도입을 통해 세포로 전달된다. 두 렌티바이러스 모두를 갖는 세포는 전장 mScarlet 형광 단백질을 재구성할 뿐만 아니라 TagBFP2 및 EGFP 트랜스진 둘 모두를 발현하기 위해 필수적인 단백질 스플라이싱 기구 및 mScarlet 단편을 함유한다. 세포에 FACS 분석을 수행하였다. 박스형 개략도는 플라스미드 쌍 33+34의 FACS 분석의 예를 나타낸다. P1 집단은 살아있는 단일항 세포에 대한 전방 산란 및 측방 산란을 위해 게이팅되었다. 이들로부터, 세포의 17.8%는 TagBFP 및 EGFP 트랜스진에 대해 이중 양성이다. P1 세포가 mScarlet-양성에 대해 추가로 게이팅된 경우 (mCherry 채널), 세포의 99.4%는 TagBFP 및 EGFP에 대해 이중 양성이다. (도 8c) 아래의 컬럼 플롯은 표시된 분할점 각각의 mScarlet-양성 세포의 백분율을 나타낸다. 위의 컬럼 플롯은 P1 세포 중 TagBFP+ EGFP+ 세포 (백색 컬럼) 및 P1 세포의 mScarlet-양성 서브세트 (적색 컬럼)의 백분율을 나타낸다.
도 9a-9d. 3종 이상의 트랜스제닉 벡터의 공동선택을 위한 다중-분할 선택가능한 마커. (도 9a) 선택가능한 마커는 3개의 단편 (M1, M2 및 M3)으로 분배된다. 제1 마커 단편 (M1)은 제1 분할 인테인의 N-말단 단편 (IN1)의 상류에 융합된다. 제2 마커 단편 (M2)은 제1 분할 인테인의 C-말단 단편 (IC1)의 하류 및 제2 분할 인테인의 N-말단 단편 (IN2)의 상류에 융합된다. 제3 마커 단편 (M3)은 제2 분할 인테인의 C-말단 단편 (IC2)의 하류에 융합된다. 제1 분할 인테인은 M2로의 M1의 연결을 촉매하는 반면, 제2 분할 인테인은 M3으로의 M2의 연결을 촉매하여, 전체 선택가능한 마커를 효과적으로 재구성한다. (도 9b) "인테인 쇄" 메커니즘을 통한 k-분할 선택가능한 마커의 설계. 3-분할 시나리오와 유사하게, 선택가능한 마커는 k 단편으로 분배되고, 개입 분할 인테인에 의해 매개된 단백질 트랜스-스플라이싱을 통해 재구성된다. (도 9c) 2-분할 선택가능한 마커로부터 확인된 분할점은 3-분할 선택가능한 마커를 생성하기 위해 조합하여 사용되었다. 상응하는 단편은 렌티바이러스 벡터로 클로닝되어 벡터 당 3-분할 선택가능한 마커 구조 및 리포터 형광 트랜스진을 생성하였다. 그 후, 세포는 이들 벡터로부터 제조된 바이러스로 형질도입되었으며, 선택 또는 비-선택 배지로 분할되었다. 적절한 선택 기간 후, 배양물은 유동 세포계측법에 의해 분석되었다. (도 9d) 3-마커트론 히그로마이신 (Hygro) Intres. 상단 개략도는 히그로마이신 내성 유전자에 대해 시험된 분할점을 나타내며, N-말단 단편의 마지막 아미노산의 잔기 수는 NpuDnaE 및 SspDnaB 인테인을 각각 나타내는 원형 또는 사각형 롤리팝 위에 표시된다. 6종의 3-마커트론 히그로마이신 Intres를 시험하였으며, 각각은 각각의 경우에 사용된 2개의 분할점을 나타내는 원형 또는 사각형으로 넘버링된 라인으로 표시된다. 아래의 컬럼 플롯은 하기 숫자로 표시된 3-마커트론 히그로마이신 Intres에 대한 비-선택 (백색 컬럼) 및 선택 (파란색 컬럼) 배양물로부터의 삼중 트랜스제닉 (BFP+ GFP+ mCherry+) 세포의 백분율을 나타낸다.
도 10a-10c. 3-마커트론 히그로마이신 Intres 유전자를 갖는 게이트웨이-호환가능한 렌티바이러스 목적지 벡터. (도 10a) 바이러스 LTR, CAGGS 프로모터, 게이트웨이 목적지 카세트 AttL, ccdB 유전자, 트랜스진을 보유하는 게이트웨이 공여자 벡터의 LR 클로나제-매개 재조합을 허용하는 클로람페니콜 내성 유전자, 이어서 3종의 3-분할 히그로마이신 마커트론 각각의 다중시스트론 발현을 허용하는 내부 리보솜 진입 부위 (IRES)를 갖는 게이트웨이-호환가능한 렌티바이러스 목적지 벡터. (도 10b) TagBFP2 (트랜스진 1로서) 및 EGFP (트랜스진 2로서) 및 mCherry (트랜스진 3으로서)는 게이트웨이 재조합에 의해 3-분할 Intres 플라스미드로 클로닝되고, 렌티바이러스 형질도입에 의해 세포로 전달되고, 이어서 항생제 선택 및 유동 세포계측법 분석이 수행되었다. (도 10c) 컬럼 플롯은 선택된 히그로마이신 (파란색 컬럼) 대 그의 상응하는 비-선택 배양물 (백색 컬럼) 중 BFP+GFP+mCherry+ 삼중-양성 세포의 백분율을 나타낸다.
도 11. 4-분할 Hygro intres. (a) 4종의 상이한 플라스미드에 의해 발현된 4-분할 hygro intres의 마커트론. 플라스미드 115는 히그로마이신 내성 유전자의 아미노산 1~89 [Hygro(1-89)]를 NpuDnaE(N) 및 류신 지퍼 A 모티프 (LZA)로 융합함으로써 생성된 마커트론을 발현한다. 플라스미드 116은 N-말단으로부터 C-말단으로 류신 지퍼 B 모티프 (LZB)-NpuDnaGEP(C), Hygro(90-200) 및 SspDnaB(N)를 융합함으로써 생성된 마커트론을 발현한다. 플라스미드 117은 N-말단으로부터 C-말단으로 SspDnaB(C), Hygro(201-240), NpuDnaE(N)-LZA를 융합함으로써 생성된 마커트론을 발현한다. 플라스미드 118은 LZB-NpuDnaGEP(C)를 Hygro(241-341)로 융합함으로써 생성된 마커트론을 발현한다.
도 12a-12e. Intres 마커는 CRISPR/Cas-매개 녹-인(knock-in) 실험으로부터 이대립유전자 표적화된 세포의 풍부를 허용한다. AAVS1 안전한 하버(harbor) 유전자좌에 대한 상동성 아암을 함유하는 표적화 구축물 쌍은 전장 (FL) 비-분할 또는 분할 Intres 마커를 함유하도록 설계되었고, 항생제 선택을 통해 이대립유전자 표적화된 세포를 풍부하게 하는 능력에 대해 시험되었다. (도 12a) 플라스미드 107 및 108은 AAVS1 유전자좌에서 내인성 PPP1R12C 프로모터에 의해 구동되는 FL 네오마이신 (Neo) 내성 유전자, EF1a 프로모터에 의해 구동되는 FL 히그로마이신 (Hygro) 유전자 및 rtTA Dox-반응성 트랜스활성인자, 뿐만 아니라 dox-유도성 TetO 프로모터로부터 발현된 FL 블라스티시딘 (Blast) 뿐만 아니라 EGFP (플라스미드 107) 및 mScarlet (플라스미드 108)을 함유한다. 플라스미드 106은 Cas9 및 AAVS 유전자좌를 표적화하는 sgRNA를 함유한다. 2A: 자기-절단 2A 펩티드. 플라스미드 106, 107 및 108은 HEK293T 세포로 공동-형질감염되고, 분할되고, dox-함유 히그로마이신, 블라스티시딘 또는 비-선택 배지에서 2주 동안 계대배양되고, 이대립유전자 표적화의 효율을 분석하기 위해 유동 세포계측법에 의해 분석하였다. (도 12b) 플라스미드 109 및 110은 플라스미드 107 및 108과 유사한 구조를 함유하나, FL Blast 대신에 분할 Blast Intres를 갖는다. (도 12c) 플라스미드 111 및 112는 EF1a-구동된 FL Blast 및 TetO-구동된 FL Hygro, 니트로리덕타제 (NTR), 2A 펩티드에 의해 분리된 형광 단백질 (EGFP 또는 mCherry)을 함유한다. (도 12d) 플라스미드 113 및 114는 플라스미드 111 및 112와 유사하나, FL Hygro 대신에 Hygro Intres를 갖는다. (도 12e) dox-함유 비-선택 배지 (선택: 없음), 블라스티시딘 선택 배지 (Blast) 및 히그로마이신 선택 배지 (Hygro)에서 배양 후 2주에, 플라스미드 106 (Cas9+AAVS-sgRNA)으로 형질감염된 세포 및 표시된 표적화 구축물 쌍의 유동 세포계측법 분석.
도 2a-2f. 인테인-분할 내성 (Intres) 유전자 (선택가능한 마커 유전자라고도 함) 및 플라스미드의 분할점에 대한 세부사항. (도 2a) 히그로마이신 내성 단백질 (서열식별번호(SEQ ID NO): 1)에 대한 분할점. 히그로마이신 내성 단백질의 아미노산 서열에는 본 연구에서 특징화된 분할점을 표지하는 클라우드(cloud)가 제공된다. 표지 내에서, 상단 행은 표 1에 상응하는 플라스미드 번호를 나타낸다. 하단 행은 N-말단 단편에서 마지막 아미노산의 잔기 수, 사용된 인테인의 종, 및 C-말단 단편에서 첫 번째 아미노산의 잔기 수를 나타낸다. "^C"는 시스테인의 삽입을 나타낸다. (도 2b) 퓨로마이신 내성 단백질 (서열식별번호: 2)에 대한 분할점. 퓨로마이신 내성 단백질의 아미노산 서열에는 본 연구에서 특징화된 분할점을 표지하는 클라우드가 제공된다. 표지 내에서, 상단 행은 표 1에 상응하는 플라스미드 번호를 나타낸다. 하단 행은 N-말단 단편에서 마지막 아미노산의 잔기 수, 사용된 인테인의 종, 및 C-말단 단편에서 첫 번째 아미노산의 잔기 수를 나타낸다. "^C"는 시스테인의 삽입을 나타낸다. (도 2c) 네오마이신 내성 단백질 (서열식별번호: 3)에 대한 분할점. 네오마이신 내성 유전자의 아미노산 서열에는 본 연구에서 특징화된 분할점을 표지하는 클라우드가 제공된다. 표지 내에서, 상단 행은 표 1에 상응하는 플라스미드 번호를 나타낸다. 하단 행은 N-말단 단편에서 마지막 아미노산의 잔기 수, 사용된 인테인의 종, 및 C-말단 단편에서 첫 번째 아미노산의 잔기 수를 나타낸다. (도 2d) 블라스티시딘 내성 단백질 (서열식별번호: 4)에 대한 분할점. 블라스티시딘 내성 유전자의 아미노산 서열에는 본 연구에서 특징화된 분할점을 표지하는 클라우드가 제공된다. 표지 내에서, 상단 행은 표 1에 상응하는 플라스미드 번호를 나타낸다. 하단 행은 N-말단 단편에서 마지막 아미노산의 잔기 수, 사용된 인테인의 종, 및 C-말단 단편에서 첫 번째 아미노산의 잔기 수를 나타낸다. (도 2e) 녹색 형광 단백질 (서열식별번호: 5)에 대한 분할점. (도 2f) mScarlet 형광 단백질 (서열식별번호: 6)에 대한 분할점. mScarlet 유전자의 아미노산 서열에는 본 연구에서 특징화된 분할점을 표지하는 클라우드가 제공된다. 표지 내에서, 상단 행은 표 1에 상응하는 플라스미드 번호를 나타낸다. 하단 행은 N-말단 단편에서 마지막 아미노산의 잔기 수, 사용된 인테인의 종, 및 C-말단 단편에서 첫 번째 아미노산의 잔기 수를 나타낸다. "^C"는 시스테인의 삽입을 나타낸다.
도 3. 2-마커트론 히그로마이신 (Hygro) 인테인-분할 내성 (Intres) 유전자. 상단 개략도는 히그로마이신 내성 유전자에 대해 시험된 분할점을 나타낸다. N-말단 단편의 마지막 잔기는 롤리팝(lollipop)의 상단 위에 표시되어 있다. 원형 롤리팝은 NpuDnaE 인테인을 사용하는 분할점을 나타내고, 사각형 롤리팝은 SspDnaB 인테인을 사용하는 분할점을 나타낸다. 줄 그어진 및 음영처리된 롤리팝은 세포에 히그로마이신 내성을 부여하지 못한 분할 쌍을 나타낸다. 아래의 컬럼 플롯은 유동 세포계측법에 의해 분석된 비-선택된 (백색 컬럼) 및 선택된 배양물 (파란색 컬럼) 중 이중-트랜스제닉 세포 (BFP+ mCherry+)의 백분율을 나타낸다.
도 4. 2-마커트론 퓨로마이신 (Puro) Intres 유전자. 상단 개략도는 퓨로마이신 내성 유전자에 대해 시험된 분할점을 나타내는 반면, 하단 컬럼 플롯은 비-선택된 (백색 컬럼) 및 선택된 배양물 (갈색 컬럼) 중 이중 트랜스제닉 세포의 백분율을 나타낸다.
도 5. 2-마커트론 네오마이신 (Neo) 내성 유전자. 상단 개략도는 네오마이신 내성 유전자에 대해 시험된 분할점을 나타내는 반면, 하단 컬럼 플롯은 비-선택된 (백색 컬럼) 및 선택된 배양물 (오렌지색 컬럼) 중 이중 트랜스제닉 세포의 백분율을 나타낸다.
도 6. 2-마커트론 블라스티시딘 (Blast) Intres 유전자. 상단 개략도는 블라스티시딘 내성 유전자에 대해 시험된 분할점을 나타내는 반면, 하단 컬럼 플롯은 비-선택된 (백색 컬럼) 및 선택된 (시안색 컬럼) 배양물 중 이중 트랜스제닉 세포의 백분율을 나타낸다.
도 7a-7c. 2-마커트론 Intres 마커를 갖는 게이트웨이-호환가능한(Gateway-compatible) 렌티바이러스 목적지 벡터. (도 7a) 각각의 분할 Intres 마커에 대한 게이트웨이-호환가능한 렌티바이러스 목적지 벡터 키트는 N-벡터 및 C-벡터로 이루어진다. N-벡터는 바이러스 LTR, CAGGS 프로모터, 게이트웨이 목적지 카세트 AttL, ccdB 유전자, 트랜스진을 보유하는 게이트웨이 공여자 벡터의 LR 클로나제-매개 재조합을 허용하는 클로람페니콜 내성 유전자, 이어서 N-마커트론의 다중시스트론(polycistronic) 발현을 허용하는 내부 리보솜 진입 부위 (IRES)를 함유한다. 유사하게, C-벡터는 C-마커트론을 함유하고, 또 다른 트랜스진의 재조합을 허용한다. (도 7b) TagBFP2 (트랜스진 1로서) 및 mCherry (트랜스진 2로서)는 게이트웨이 재조합에 의해 2-마커트론 Intres 플라스미드로 클로닝되고, 렌티바이러스 형질도입에 의해 세포로 전달되고, 이어서 항생제 선택 및 유동 세포계측법 분석이 수행되었다. 컬럼 플롯은 2-마커트론 히그로마이신 (Hygro, 파란색 컬럼), 퓨로마이신 (Puro, 갈색 컬럼), 및 네오마이신 (Neo, 오렌지색 컬럼) 실험으로부터의 선택된 배양물 대 그의 상응하는 비-선택 배양물 (백색 컬럼) 중 BFP+mCherry+ 이중-양성 세포의 백분율을 나타낸다. (도 7c) 핵을 GFP 형광으로 표지하는 NLS-GFP (트랜스진 1로서) 및 F-액틴을 mScarlet 형광으로 표지하는 lifeAct-mScarlet (트랜스진 2로서)은 전체 비-분할 히그로마이신 내성 유전자를 발현하는 렌티바이러스 벡터 또는 2-마커트론 히그로마이신 Intres 유전자를 갖는 렌티바이러스 벡터로 재조합되고, 이중-표지 세포를 제작하기 위해 U2OS 세포를 형질도입하는데 사용되었다. 대표적인 형광 현미경 이미지는 2주 동안 히그로마이신 선택 후 GFP, mScarlet 및 세포의 병합된 채널을 나타낸다.
도 8a-8c. 2종의 별도의 트랜스제닉 벡터의 형광-매개 공동선택을 위한 분할 mScarlet. (도 8a) 2-마커트론 mScarlet 단백질. 상단 개략도는 mScarlet에 대해 시험된 분할점을 나타낸다. N-말단 단편의 마지막 잔기는 롤리팝의 상단 위에 표시되어 있다. (도 8b) mScarlet에 대한 NpuDnaE 인테인-호환가능한 분할점을 스크리닝하기 위해, 본 발명자들은 NpuDnaE 인테인에 대한 이음부 요건에 따라 잠재적인 분할점을 확인한 후, TagBFP2 또는 EGFP 형광 단백질이 장착된 렌티바이러스 벡터 상의 분할 인테인 스캐폴드에 상응하는 N-말단 및 C-말단 단편을 클로닝하였으며, 이는 선택 효율을 평가하기 위한 시험 트랜스진으로서 작용한다. 이들은 렌티바이러스 형질도입을 통해 세포로 전달된다. 두 렌티바이러스 모두를 갖는 세포는 전장 mScarlet 형광 단백질을 재구성할 뿐만 아니라 TagBFP2 및 EGFP 트랜스진 둘 모두를 발현하기 위해 필수적인 단백질 스플라이싱 기구 및 mScarlet 단편을 함유한다. 세포에 FACS 분석을 수행하였다. 박스형 개략도는 플라스미드 쌍 33+34의 FACS 분석의 예를 나타낸다. P1 집단은 살아있는 단일항 세포에 대한 전방 산란 및 측방 산란을 위해 게이팅되었다. 이들로부터, 세포의 17.8%는 TagBFP 및 EGFP 트랜스진에 대해 이중 양성이다. P1 세포가 mScarlet-양성에 대해 추가로 게이팅된 경우 (mCherry 채널), 세포의 99.4%는 TagBFP 및 EGFP에 대해 이중 양성이다. (도 8c) 아래의 컬럼 플롯은 표시된 분할점 각각의 mScarlet-양성 세포의 백분율을 나타낸다. 위의 컬럼 플롯은 P1 세포 중 TagBFP+ EGFP+ 세포 (백색 컬럼) 및 P1 세포의 mScarlet-양성 서브세트 (적색 컬럼)의 백분율을 나타낸다.
도 9a-9d. 3종 이상의 트랜스제닉 벡터의 공동선택을 위한 다중-분할 선택가능한 마커. (도 9a) 선택가능한 마커는 3개의 단편 (M1, M2 및 M3)으로 분배된다. 제1 마커 단편 (M1)은 제1 분할 인테인의 N-말단 단편 (IN1)의 상류에 융합된다. 제2 마커 단편 (M2)은 제1 분할 인테인의 C-말단 단편 (IC1)의 하류 및 제2 분할 인테인의 N-말단 단편 (IN2)의 상류에 융합된다. 제3 마커 단편 (M3)은 제2 분할 인테인의 C-말단 단편 (IC2)의 하류에 융합된다. 제1 분할 인테인은 M2로의 M1의 연결을 촉매하는 반면, 제2 분할 인테인은 M3으로의 M2의 연결을 촉매하여, 전체 선택가능한 마커를 효과적으로 재구성한다. (도 9b) "인테인 쇄" 메커니즘을 통한 k-분할 선택가능한 마커의 설계. 3-분할 시나리오와 유사하게, 선택가능한 마커는 k 단편으로 분배되고, 개입 분할 인테인에 의해 매개된 단백질 트랜스-스플라이싱을 통해 재구성된다. (도 9c) 2-분할 선택가능한 마커로부터 확인된 분할점은 3-분할 선택가능한 마커를 생성하기 위해 조합하여 사용되었다. 상응하는 단편은 렌티바이러스 벡터로 클로닝되어 벡터 당 3-분할 선택가능한 마커 구조 및 리포터 형광 트랜스진을 생성하였다. 그 후, 세포는 이들 벡터로부터 제조된 바이러스로 형질도입되었으며, 선택 또는 비-선택 배지로 분할되었다. 적절한 선택 기간 후, 배양물은 유동 세포계측법에 의해 분석되었다. (도 9d) 3-마커트론 히그로마이신 (Hygro) Intres. 상단 개략도는 히그로마이신 내성 유전자에 대해 시험된 분할점을 나타내며, N-말단 단편의 마지막 아미노산의 잔기 수는 NpuDnaE 및 SspDnaB 인테인을 각각 나타내는 원형 또는 사각형 롤리팝 위에 표시된다. 6종의 3-마커트론 히그로마이신 Intres를 시험하였으며, 각각은 각각의 경우에 사용된 2개의 분할점을 나타내는 원형 또는 사각형으로 넘버링된 라인으로 표시된다. 아래의 컬럼 플롯은 하기 숫자로 표시된 3-마커트론 히그로마이신 Intres에 대한 비-선택 (백색 컬럼) 및 선택 (파란색 컬럼) 배양물로부터의 삼중 트랜스제닉 (BFP+ GFP+ mCherry+) 세포의 백분율을 나타낸다.
도 10a-10c. 3-마커트론 히그로마이신 Intres 유전자를 갖는 게이트웨이-호환가능한 렌티바이러스 목적지 벡터. (도 10a) 바이러스 LTR, CAGGS 프로모터, 게이트웨이 목적지 카세트 AttL, ccdB 유전자, 트랜스진을 보유하는 게이트웨이 공여자 벡터의 LR 클로나제-매개 재조합을 허용하는 클로람페니콜 내성 유전자, 이어서 3종의 3-분할 히그로마이신 마커트론 각각의 다중시스트론 발현을 허용하는 내부 리보솜 진입 부위 (IRES)를 갖는 게이트웨이-호환가능한 렌티바이러스 목적지 벡터. (도 10b) TagBFP2 (트랜스진 1로서) 및 EGFP (트랜스진 2로서) 및 mCherry (트랜스진 3으로서)는 게이트웨이 재조합에 의해 3-분할 Intres 플라스미드로 클로닝되고, 렌티바이러스 형질도입에 의해 세포로 전달되고, 이어서 항생제 선택 및 유동 세포계측법 분석이 수행되었다. (도 10c) 컬럼 플롯은 선택된 히그로마이신 (파란색 컬럼) 대 그의 상응하는 비-선택 배양물 (백색 컬럼) 중 BFP+GFP+mCherry+ 삼중-양성 세포의 백분율을 나타낸다.
도 11. 4-분할 Hygro intres. (a) 4종의 상이한 플라스미드에 의해 발현된 4-분할 hygro intres의 마커트론. 플라스미드 115는 히그로마이신 내성 유전자의 아미노산 1~89 [Hygro(1-89)]를 NpuDnaE(N) 및 류신 지퍼 A 모티프 (LZA)로 융합함으로써 생성된 마커트론을 발현한다. 플라스미드 116은 N-말단으로부터 C-말단으로 류신 지퍼 B 모티프 (LZB)-NpuDnaGEP(C), Hygro(90-200) 및 SspDnaB(N)를 융합함으로써 생성된 마커트론을 발현한다. 플라스미드 117은 N-말단으로부터 C-말단으로 SspDnaB(C), Hygro(201-240), NpuDnaE(N)-LZA를 융합함으로써 생성된 마커트론을 발현한다. 플라스미드 118은 LZB-NpuDnaGEP(C)를 Hygro(241-341)로 융합함으로써 생성된 마커트론을 발현한다.
도 12a-12e. Intres 마커는 CRISPR/Cas-매개 녹-인(knock-in) 실험으로부터 이대립유전자 표적화된 세포의 풍부를 허용한다. AAVS1 안전한 하버(harbor) 유전자좌에 대한 상동성 아암을 함유하는 표적화 구축물 쌍은 전장 (FL) 비-분할 또는 분할 Intres 마커를 함유하도록 설계되었고, 항생제 선택을 통해 이대립유전자 표적화된 세포를 풍부하게 하는 능력에 대해 시험되었다. (도 12a) 플라스미드 107 및 108은 AAVS1 유전자좌에서 내인성 PPP1R12C 프로모터에 의해 구동되는 FL 네오마이신 (Neo) 내성 유전자, EF1a 프로모터에 의해 구동되는 FL 히그로마이신 (Hygro) 유전자 및 rtTA Dox-반응성 트랜스활성인자, 뿐만 아니라 dox-유도성 TetO 프로모터로부터 발현된 FL 블라스티시딘 (Blast) 뿐만 아니라 EGFP (플라스미드 107) 및 mScarlet (플라스미드 108)을 함유한다. 플라스미드 106은 Cas9 및 AAVS 유전자좌를 표적화하는 sgRNA를 함유한다. 2A: 자기-절단 2A 펩티드. 플라스미드 106, 107 및 108은 HEK293T 세포로 공동-형질감염되고, 분할되고, dox-함유 히그로마이신, 블라스티시딘 또는 비-선택 배지에서 2주 동안 계대배양되고, 이대립유전자 표적화의 효율을 분석하기 위해 유동 세포계측법에 의해 분석하였다. (도 12b) 플라스미드 109 및 110은 플라스미드 107 및 108과 유사한 구조를 함유하나, FL Blast 대신에 분할 Blast Intres를 갖는다. (도 12c) 플라스미드 111 및 112는 EF1a-구동된 FL Blast 및 TetO-구동된 FL Hygro, 니트로리덕타제 (NTR), 2A 펩티드에 의해 분리된 형광 단백질 (EGFP 또는 mCherry)을 함유한다. (도 12d) 플라스미드 113 및 114는 플라스미드 111 및 112와 유사하나, FL Hygro 대신에 Hygro Intres를 갖는다. (도 12e) dox-함유 비-선택 배지 (선택: 없음), 블라스티시딘 선택 배지 (Blast) 및 히그로마이신 선택 배지 (Hygro)에서 배양 후 2주에, 플라스미드 106 (Cas9+AAVS-sgRNA)으로 형질감염된 세포 및 표시된 표적화 구축물 쌍의 유동 세포계측법 분석.
<상세한 설명>
일부 측면에서, 하나 초과의 트랜스진 (또는 다른 유전적 요소)이 도입된 트랜스제닉 (예를 들어, 다중-트랜스제닉, 예컨대 이중 트랜스제닉 또는 삼중 트랜스제닉) 유기체를 생성하는 방법이 본원에 제공된다. 도 1a에 나타낸 바와 같이, 본 개시내용의 예시적인 방법은 (a) N-말단 인테인 단백질 단편으로부터 상류에 있는 제1 선택가능한 마커 단백질 단편을 코딩하는 벡터 및 관심있는 제1 트랜스진, 및 (b) 제2 선택가능한 마커 단백질 단편으로부터 상류에 있는 C-말단 인테인 단백질 단편을 코딩하는 또 다른 벡터 및 관심있는 제2 (예를 들어, 상이한) 트랜스진을 세포의 집단으로 전달하는 것을 포함한다. 집단의 일부 세포는 단일 벡터 (인테인의 단편, 선택가능한 마커 단백질의 단편, 및 단일 트랜스진만을 보유함)를 수용할 것이나, 집단의 다른 세포는 2종의 벡터 모두 (및 그러므로 인테인 단편 둘 모두, 선택가능한 마커 단백질 단편 둘 모두, 및 관심있는 트랜스진 둘 모두)를 수용할 것이다. 2종의 벡터 모두를 수용하는 세포에서, 번역 후, 인테인 단백질 단편은 인테인 구조로 자발적으로 및 비-공유결합으로 어셈블리하여 (협동적으로 폴딩하여), 제2 선택가능한 마커 단백질 단편으로의 제1 선택가능한 마커 단백질 단편의 연결을 촉매하여, 전장 선택가능한 마커 단백질을 생성하며, 이는 상기 이중 트랜스제닉 세포의 특이적 선택을 가능하게 한다. 예를 들어, 선택가능한 마커 단백질이 항생제 내성 단백질인 경우, 전장 (기능성) 항생제 내성 단백질을 발현하는 이중-트랜스제닉 세포만이 특정 항생제의 존재 하에 선택에서 생존할 것이다. 또 다른 예로서, 선택가능한 마커 단백질이 형광 단백질인 경우, 전장 (기능성) 형광 단백질을 발현하는 이중-트랜스제닉 세포만이 신호-방출 세포만이 선택되도록 검출가능한 신호를 방출할 것이다.
본 개시내용의 또 다른 예시적인 방법은 (a) (i) 제1 인테인의 N-말단 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, 항생제 내성 단백질의 N-말단 단편을 코딩하는 뉴클레오티드 서열 및 (ii) 관심있는 제1 분자를 코딩하는 뉴클레오티드 서열을 포함하는 제1 벡터, (b) (i) 제2 인테인의 N-말단 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, 항생제 내성 단백질의 중심 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, 제1 인테인의 C-말단 단편을 코딩하는 뉴클레오티드 서열 및 (ii) 관심있는 제2 분자를 코딩하는 뉴클레오티드 서열을 포함하는 제2 벡터, 및 (c) (i) 항생제 내성 단백질의 C-말단 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, 제2 인테인의 C-말단 단편을 코딩하는 뉴클레오티드 서열 및 (ii) 관심있는 제3 분자를 코딩하는 뉴클레오티드 서열을 포함하는 제3 벡터를 세포의 집단으로 전달하는 것을 포함한다. 집단의 일부 세포는 단일 벡터 (인테인의 단편, 선택가능한 마커 단백질의 단편, 및 단일 트랜스진만을 보유함)를 수용할 것이나, 집단의 다른 세포는 2종의 벡터 또는 3종의 벡터 모두 (및 그러므로 인테인 단편 모두, 선택가능한 마커 단백질 단편 모두, 및 관심있는 트랜스진 모두)를 수용할 것이다. 3종의 벡터 모두를 수용하는 세포에서, 번역 후, 인테인 단백질 단편은 인테인 구조로 자발적으로 및 비-공유결합으로 어셈블리하여 (협동적으로 폴딩하여), 선택가능한 마커 단백질의 N-말단 단편의 중심 단편으로의 연결, 및 중심 단편의 선택가능한 마커 단백질의 C-말단 단편으로의 연결을 촉매하여, 전장 선택가능한 마커 단백질을 생성하며, 이는 상기 삼중-트랜스제닉 세포의 특이적 선택을 가능하게 한다. 예를 들어, 선택가능한 마커 단백질이 항생제 내성 단백질인 경우, 전장 (기능성) 항생제 내성 단백질을 발현하는 삼중-트랜스제닉 세포만이 특정 항생제의 존재 하에 선택에서 생존할 것이다. 또 다른 예로서, 선택가능한 마커 단백질이 형광 단백질인 경우, 전장 (기능성) 형광 단백질을 발현하는 삼중-트랜스제닉 세포만이 신호-방출 세포만이 선택되도록 검출가능한 신호를 방출할 것이다.
인테인
인테인 (개입 단백질)은 단백질 스플라이싱이라고 공지된 고유한 자동-프로세싱 사건을 수행하며, 여기서 이는 2개의 펩티드 결합의 절단을 통해 더 큰 전구체 폴리펩티드로부터 그 자신을 절제하고, 이 과정에서, 새로운 펩티드 결합의 형성을 통해 플랭킹(flanking) 익스테인(extein) (외부 단백질) 서열을 라이게이션한다. 이 재배열은 인테인 유전자가 다른 단백질-코딩 유전자 내에 프레임에 내장된 것으로 발견되기 때문에 번역후 (또는 가능하게는 번역과 공동으로) 발생한다. 더욱이, 인테인-매개 단백질 스플라이싱은 자발적이며; 외부 인자 또는 에너지 공급원이 필요하지 않으며, 오직 인테인 도메인의 폴딩만이 필요하다. 사실상, 전구체 단백질은 3개의 세그먼트, 즉 N-익스테인 (단백질의 N-말단 부분), 이어서 인테인, 이어서 C-익스테인 (단백질의 C-말단 부분)을 함유한다. 스플라이싱 후, 생성된 단백질은 C-익스테인에 연결된 N-익스테인을 함유한다.
두 유형의 인테인이 있다: 시스-스플라이싱 인테인은 숙주 단백질에 내장된 단일 폴리펩티드인 반면, 트랜스-스플라이싱 인테인 (분할 인테인이라고 함)은 인테인 조각 및 그의 단백질 카고(cargo)가 회합한 후 단백질 스플라이싱을 매개하는 별도의 폴리펩티드이다 (예를 들어, 문헌 [Paulus, H Annu Rev Biochem 69:447-496 (2000)]; 및 [Saleh L, Perler FB Chem Rec 6:183-193 (2006)] 참조). 분할 인테인은 인테인이 적절히 어셈블리되고 폴딩되는 것을 필요로 하는 일련의 화학적 재배열을 촉매한다. 스플라이싱에서 제1 단계는 N-익스테인 폴리펩티드가 인테인의 제1 잔기의 측쇄로 전이되는 N-S 아실 이동을 수반한다. 그 후, 이는 이 아실 단위가 C-익스테인의 제1 잔기 (이는 세린, 트레오닌 또는 시스테인임)로 전이되어 분지된 중간체를 형성하는 트랜스-(티오)에스테르화 반응으로 이어진다. 프로세스의 끝에서 두 번째 단계에서, 이 분지된 중간체는 인테인의 C-말단 아스파라긴 잔기를 수반하는 아미드 교환 반응에 의해 인테인으로부터 절단된다. 그 후, 이는 두 익스테인 사이에 정상적인 펩티드 결합을 생성하도록 S-N 아실 전이를 수반하는 프로세스의 최종 단계를 설정한다 (Lockless, SW, Muir, TW PNAS 106(27): 10999-11004 (2009)).
현재까지, 인테인이 내장된 숙주 유전자의 유형 뿐만 아니라 그 숙주 유전자 내에 통합점에 의해 구별되는 적어도 70종의 상이한 인테인 대립유전자가 있다 (Perler, FB Nucleic Acids Res. 30: 383-384 (2002); Pietrokovski, S Trends Genet. 17: 465-472 (2001)). 확인된 인테인 유전자의 작은 분율 (5% 미만)이 분할 인테인을 코딩한다. 더 흔한 인접 인테인과 달리, 분할 인테인은 각각 하나의 익스테인에 융합된 2개의 별도의 폴리펩티드, N-인테인 및 C-인테인으로서 전사되고 번역된다. 번역시, 인테인 단편은 정준적 인테인 구조로 자발적으로 및 비-공유결합으로 어셈블리하여 (협동적으로 폴딩하여), 단백질 스플라이싱을 트랜스로 수행한다. 시아노박테리아 시네코시스티스(Synechocystis) 종 PCC6803 (Ssp) 및 노스톡 푼크티포르메(Nostoc punctiforme) PCC73102 (Npu)로부터 특징화된 첫 번째 2개의 분할 인테인은 DNA 폴리머라제 III (DnaE)의 α 서브유닛에 삽입된 것으로 자연적으로 발견되는 오르토로그(ortholog)이다. Npu는 단백질 트랜스-스플라이싱의 현저하게 빠른 속도로 인해 특히 주목할 만하다 (t1/2 = 30℃에서 50 s). 이 반감기는 Ssp의 반감기보다 유의하게 짧다 (t1/2 = 30℃에서 80 min) (Shah, NH et al. J. Am. Chem. Soc. 135: 5839 (2013)).
여기서, 분할 인테인은 선택가능한 마커 단백질, 예컨대 항생제 내성 단백질 또는 형광 단백질의 2개의 단편 (예를 들어, N-말단 단편 및 C-말단 단편)의 연결을 촉매하여 기능성 전장 단백질을 생성하는데 사용된다 (예를 들어, 도 1a 및 1b).
분할 인테인은 자연 분할 인테인 또는 조작된 분할 인테인일 수 있다. 자연 분할 인테인은 다양한 상이한 유기체에서 자연적으로 발생한다. 분할 인테인의 가장 큰 공지된 패밀리는 적어도 20종의 시아노박테리아 종의 DnaE 유전자 내에서 발견된다 (Caspi J, et al. Mol. Microbiol. 50: 1569-1577 (2003)). 그러므로, 본 개시내용의 일부 실시양태에서, 자연 분할 인테인은 DnaE 인테인으로부터 선택된다. DnaE 인테인의 비제한적인 예는 시네코시스티스 sp. DnaE (SspDnaE) 인테인 및 노스톡 푼크티포르메 (NpuDnaE) 인테인을 포함한다.
일부 실시양태에서, 분할 인테인은 조작된 분할 인테인이다. 조작된 분할 인테인은 인접 인테인으로부터 생성될 수 있거나 (여기서, 인접 인테인은 인공적으로 분할됨), 예를 들어, 효율적인 단백질 정제, 라이게이션, 변형 및 고리화를 촉진하는 변형된 자연 분할 인테인일 수 있다 (예를 들어, NpuGEP 및 CfaGEP, 문헌 [AJ PNAS 114(32): 8538-8543 (2017)]에 기재된 바와 같음). 분할 인테인을 조작하는 방법은 예를 들어, 본원에 참조로 포함된 문헌 [Aranko, AS et al. Protein Eng Des Sel. 27(8): 263-271 (2014)]에 기재되어 있다. 일부 실시양태에서, 조작된 분할 인테인은 DnaB 인테인으로부터 조작된다 (Wu, H, et al. Biochim Biophys Acta 1387(1-2): 422-432 (1998)). 예를 들어, 조작된 분할 인테인은 SspDnaB S1 인테인일 수 있다. 일부 실시양태에서, 조작된 분할 인테인은 GyrB 인테인으로부터 조작된다. 예를 들어, 조작된 분할 인테인은 SspGyrB S11 인테인일 수 있다.
삼중-트랜스제닉이 생성되는 일부 실시양태에서, 예를 들어, 제1 인테인은 제2 인테인과 동일할 수 있다 (예를 들어, 둘 모두 DnaE 인테인). 다른 실시양태에서, 2종의 상이한 인테인이 사용될 수 있다 (예를 들어, DnaE 인테인 및 DnaB 인테인). 일부 실시양태에서, 제1 인테인은 NpuDnaE 인테인이고, 제2 인테인은 NpuDnaE 인테인이다.
선택가능한 마커 단백질
본 개시내용의 트랜스제닉 (예를 들어, 이중 및/또는 삼중 트랜스제닉) 세포는 전장 선택가능한 마커 단백질의 그의 발현에 기초하여 선택된다. 선택가능한 마커 단백질은 인공 선택에 적합한 형질을 일반적으로 부여한다. 선택가능한 마커 단백질의 예는 항생제 내성 단백질 및 형광 단백질을 포함한다.
항생제 내성 유전자는 특정 항생제 또는 항생제의 부류에 대한 내성을 부여하는 단백질을 코딩하는 유전자이다. 진핵생물 세포에서 사용하기 위한 항생제 내성 유전자의 비제한적인 예는 히그로마이신, G418, 퓨로마이신, 플레오마이신 D1 또는 블라스티시딘에 대한 내성을 부여하는 단백질을 코딩하는 유전자를 포함한다. 원핵생물 세포에서 사용하기 위한 항생제 내성 유전자의 비제한적인 예는 히그로마이신, G418, 퓨로마이신, 플레오마이신 D1, 블라스티시딘, 카나마이신, 스펙티노마이신, 스트렙토마이신, 암피실린, 카르베니실린, 블레오마이신, 에리트로마이신, 폴리믹신 D, 테트라사이클린 및 클로람페니콜에 대한 내성을 부여하는 단백질을 코딩하는 유전자를 포함한다.
히그로마이신 B는 스트렙토미세스 히그로스코피쿠스(Streptomyces hygroscopicus) 박테리아에 의해 생성되는 항생제이다. 이는 단백질 합성을 억제함으로써 박테리아, 진균 및 고등 진핵생물 세포를 사멸시키는 아미노글리코시드이다. 본래 에스케리치아 콜라이(Escherichia coli)로부터 유래된 hpt 유전자 (hph 또는 aphIV 유전자라고도 함)에 의해 코딩되는 히그로마이신 포스포트랜스퍼라제 (HPT)는 아미노시클리톨 항생제 히그로마이신 B를 해독한다. 그러므로, 일부 실시양태에서, 본 개시내용의 선택가능한 마커 유전자는 hpt 유전자이다.
G418 (게네티신®)은 겐타마이신 B1과 구조가 유사한 아미노글리코시드 항생제이다. 이는 미크로모노스포라 로도란게아(Micromonospora rhodorangea)에 의해 생성된다. G418은 원핵생물 및 진핵생물 세포 둘 모두에서 신장 단계를 억제함으로써 폴리펩티드 합성을 차단한다. G418에 대한 내성은 아미노글리코시드 3'-포스포트랜스퍼라제, APT 3' II를 코딩하는 Tn5로부터의 neo 유전자에 의해 부여된다. G418은 네오마이신 술페이트의 유사체이며, 네오마이신과 유사한 메커니즘을 갖는다. 그러므로, 일부 실시양태에서, 본 개시내용의 선택가능한 마커 유전자는 neo 유전자이다.
퓨로마이신은 스트렙토미세스 알보니게르(Streptomyces alboniger)로부터 유래된 아미노뉴클레오시드 항생제이며, 리보솜에서 일어나는 번역 동안 조기 쇄 종결을 야기한다. 퓨로마이신은 원핵생물 또는 진핵생물에 대해 선택적이다. 퓨로마이신에 대한 내성은 퓨로마이신 N-아세틸-트랜스퍼라제 (pac) 유전자의 발현을 통해 부여된다. 그러므로, 일부 실시양태에서, 본 개시내용의 선택가능한 마커 유전자는 pac 유전자이다.
플레오마이신 D1 (예를 들어, 제오신®)은 글리코펩티드 항생제이며, 항생제의 블레오마이신 패밀리에 속하는 스트렙토미세스 베르티실루스(Streptomyces verticillus)로부터의 플레오마이신 중 하나이다. 이는 대부분의 박테리아, 사상 진균, 효모, 식물 및 동물 세포에 대해 효과적인 광범위-스펙트럼 항생제이다. 이는 DNA에 삽입함으로써 세포 사멸을 야기하고 DNA의 이중 가닥 파단을 유도한다. 플레오마이신 D1에 대한 내성은 스트렙토알로테이쿠스 힌두스타누스(Streptoalloteichus hindustanus)로부터 처음 단리된 Sh ble 유전자의 생성물에 의해 부여된다. 그러므로, 일부 실시양태에서, 본 개시내용의 선택가능한 마커 유전자는 Sh ble 유전자이다.
블라스티시딘 S는 스트렙토미세스 그리세오크로모게네스(Streptomyces griseochromogenes)에 의해 생성되는 항생제이다. 블라스티시딘은 번역의 종결 단계 및 리보솜에 의한 펩티드 결합 형성 (더 적은 정도로)을 억제함으로써 진핵생물 및 원핵생물 세포 둘 모두의 성장을 방지한다. 블라스티시딘에 대한 내성은 적어도 3종의 상이한 유전자에 의해 부여된다: 스트렙토베르티실룸(Streptoverticillum) spp.로부터의 bls (아세틸트랜스퍼라제); 바실러스 세레우스(Bacillus cereus)로부터의 bsr (블라스티시딘-S 데아미나제) (다른 bsr 유전자도 공지되어 있음); 및 아스페르길루스 테레우스(Aspergillus terreus)로부터의 bsd (또 다른 데아미나제). 그러므로, 일부 실시양태에서, 본 개시내용의 선택가능한 마커 유전자는 bls 유전자, bsr 유전자 또는 bsd 유전자이다.
본원에 제공된 바와 같이 사용될 수 있는 형광 단백질의 비제한적인 예는 TagCFP, mTagCFP2, Czurite, ECFP2, mKalama1, Sirius, Sapphire, T-Sapphire, ECFP, Cerulean, SCFP3C, mTurquoise, mTurquoise2, 모노머 Midoriishi-Cyan, TagCFP, mTFP1, EGFP, Emerald, Superfolder GFP, 모노머 Czami Green, TagGFP2, mUKG, mWasabi, Clover, mNeonGreen, EYFP, Citrine, Venus, SYFP2, TagYFP, 모노머 Kusabira-Orange, mKOκ, mKO2, mOrange, mOrange2, mRaspberry, mCherry, mStrawberry, mScarlet, mTangerine, tdTomato, TagRFP, TagRFP-T, mCpple, mRuby, mRuby2, mPlum, HcRed-Tandem, mKate2, mNeptune, NirFP, TagRFP657, IFP1.4 및 iRFP를 포함한다.
일부 실시양태에서, 전장 선택가능한 마커 유전자는 동일한 세포에서 2개의 선택가능한 마커 유전자 단편을 연결함으로써 생성된다. 일부 실시양태에서, 임의의 전장 단백질에 관하여, 단편 중 하나는 N-말단 단편 (N-익스테인)인 반면, 다른 단편은 C-말단 단편 (C-익스테인)이다. 그러므로, 일부 실시양태에서, 제1 항생제 내성 단백질 단편은 N-말단 항생제 내성 단백질 단편이고, 제2 항생제 내성 단백질 단편은 C-말단 항생제 내성 단백질 단편이다. 다른 실시양태에서, 제1 형광 단백질 단편은 N-말단 형광 단백질 단편이고, 제2 형광 단백질 단편은 C-말단 형광 단백질 단편이다.
다른 실시양태에서, 전장 선택가능한 마커 유전자는 동일한 세포에서 3개 이상의 선택가능한 마커 유전자 단편을 연결함으로써 생성된다. 일부 실시양태에서, 임의의 전장 단백질에 관하여, 단편 중 하나는 N-말단 단편이고, 단편 중 하나 이상 (예를 들어, 1, 2 또는 3개)은 중심 단편이고, 단편 중 하나는 C-말단 단편이다.
N-말단 단편은 전장 단백질의 유리 아민 기 (-NH2)를 포함하는 임의의 단백질 단편일 수 있다. C-말단 단편은 유리 카르복실 기 (-COOH)를 포함하는 임의의 단백질 단편일 수 있다. 중심 단편은 전장 단백질의 N-말단 단편 및 C-말단 단편 사이에 위치된 임의의 단백질 단편일 수 있다.
예를 들어, 유전자 코딩 히그로마이신 (341-아미노산 단백질)의 아미노산 1-89는 N-말단 단백질 단편으로서 지칭될 수 있는 반면, 아미노산 90-341은 C-말단 단편으로서 지칭될 수 있다. 유사하게, 도 5에 관하여, 히그로마이신을 코딩하는 유전자의 아미노산 1-200은 N-말단 단백질 단편으로서 지칭될 수 있는 반면, 아미노산 201-341은 C-말단 단편으로서 지칭될 수 있다. 도 6은 아미노산 1-53, 1-240, 또는 1-292가 각각의 C-말단 단편으로서 아미노산 54-341, 241-341, 또는 293-341을 함유하는 전장 히그로마이신의 N-말단 단백질 단편으로 간주되는 추가 예를 나타낸다.
또 다른 예로서, 유전자 코딩 히그로마이신 (341-아미노산 단백질)의 아미노산 1-52는 N-말단 단백질 단편으로서 지칭될 수 있고, 아미노산 53-89는 중심 단백질 단편으로서 지칭될 수 있고, 아미노산 90-341은 C-말단 단편으로서 지칭될 수 있다. 유사하게, 히그로마이신을 코딩하는 유전자의 아미노산 1-89는 N-말단 단백질 단편으로서 지칭될 수 있고, 아미노산 90-240은 중심 단편으로서 지칭될 수 있고, 아미노산 241-341은 C-말단 단편으로서 지칭될 수 있다.
관심있는 트랜스진 및 다른 분자
일부 실시양태에서, 본 개시내용의 방법 및 조성물은 다중-트랜스제닉 (예를 들어, 이중 및/또는 삼중 트랜스제닉) 세포 및/또는 유기체를 생성하는데 사용된다. 그러므로, 일부 실시양태에서, 방법은 제1 분자 (관심있는 제1 분자)를 코딩하는 하나의 벡터 및 제2 분자 (관심있는 제2 분자)를 코딩하는 또 다른 벡터를 사용한다. 일부 실시양태에서, 방법은 관심있는 제3 분자를 코딩하는 또 다른 벡터를 사용한다. 추가 벡터 (예를 들어, 선택가능한 마커 단백질의 추가 중심 단편을 코딩함)는 관심있는 추가 분자를 코딩할 수 있다. 관심있는 분자는 예를 들어, 폴리펩티드 (예를 들어, 단백질 및 펩티드) 또는 폴리뉴클레오티드 (예를 들어, 핵산, 예컨대 DNA 또는 RNA)일 수 있다.
일부 실시양태에서, 제1 분자 (예를 들어, 제1 벡터 상에 위치됨)는 단백질이다. 일부 실시양태에서, 제2 분자 (예를 들어, 제2 벡터 상에 위치됨)는 단백질이다. 일부 실시양태에서, 제3 분자 (예를 들어, 제3 벡터 상에 위치됨)는 단백질이다. 관심있는 단백질의 예는 효소, 시토카인, 전사 인자, 호르몬, 성장 인자, 혈액 인자, 항원 및 항체를 포함하나, 이에 제한되지는 않는다.
일부 실시양태에서, 제1 분자는 펩티드이다. 일부 실시양태에서, 제2 분자는 펩티드이다. 일부 실시양태에서, 제3 분자는 펩티드이다.
일부 실시양태에서, 제1 분자는 메신저 RNA (mRNA)이다. 일부 실시양태에서, 제2 분자는 mRNA이다. 일부 실시양태에서, 제3 분자는 mRNA이다. 일부 실시양태에서, mRNA는 백신 또는 다른 항원성 분자를 코딩한다.
일부 실시양태에서, 제1 분자는 비-코딩 RNA (단백질을 코딩하지 않는 RNA)이다. 일부 실시양태에서, 제2 분자는 비-코딩 RNA이다. 일부 실시양태에서, 제3 분자는 비-코딩 RNA이다. 비-코딩 RNA의 예는 RNA 간섭 분자, 예컨대 마이크로RNA (miRNA), 안티센스 RNA, 짧은-간섭 RNA (siRNA) 또는 짧은-헤어핀 RNA (shRNA)를 포함하나, 이에 제한되지는 않는다.
벡터
본 개시내용의 방법은 적어도 2종 또는 적어도 3종의 상이한 벡터의 사용을 포함한다. 벡터는 외인성 (외래) 유전 물질을 세포로 운반하는 비히클로서 사용될 수 있는 임의의 핵산이다. 일부 실시양태에서, 벡터는 삽입물 (예를 들어, 트랜스진) 및 벡터의 백본으로서 작용하는 더 큰 서열을 포함하는 DNA 서열이다. 벡터의 비제한적인 예는 플라스미드, 바이러스/바이러스 벡터, 코스미드, 및 인공 염색체를 포함하며, 이들 중 임의의 것이 본원에 제공된 바와 같이 사용될 수 있다. 일부 실시양태에서, 벡터는 바이러스 벡터, 예컨대 바이러스 입자이다. 일부 실시양태에서, 벡터는 RNA-기반 벡터, 예컨대 자기-복제 RNA 벡터이다. 일부 실시양태에서, 제1 벡터는 플라스미드이고/거나, 제2 벡터는 플라스미드이고/거나, 제3 벡터는 플라스미드이다. 본원에 제공된 바와 같은 벡터는 인테인의 단편 및 선택가능한 마커 단백질의 단편을 코딩하는 핵산에 작동가능하게 연결된 프로모터를 포함한다. 일부 실시양태에서, 벡터는 또한 관심있는 분자를 코딩하는 핵산, 예컨대 트랜스진에 작동가능하게 연결된 프로모터를 포함한다.
일부 실시양태에서, 하나의 벡터 (예를 들어, 제1 벡터)는 N-말단 인테인 단백질 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는 제1 선택가능한 마커 단백질 단편을 코딩하는 뉴클레오티드 서열을 포함하는 반면, 다른 하나의 벡터 (예를 들어, 제2 벡터)는 제2 항생제 내성 단백질 단편으로부터 상류에 있는 C-말단 인테인 단백질 단편을 코딩하는 뉴클레오티드 서열을 포함한다 (예를 들어, 도 1a 참조). 이 구성은 제1 선택가능한 마커 단백질 단편을 코딩하는 뉴클레오티드 서열로부터 하류에 있는 N-말단 인테인 단백질 단편을 코딩하는 뉴클레오티드 서열을 포함하는 하나의 벡터 (예를 들어, 제1 벡터), 및 C-말단 인테인 단백질 단편을 코딩하는 뉴클레오티드 서열로부터 하류에 있는 제2 항생제 내성 단백질 단편을 포함하는 다른 하나의 벡터 (예를 들어, 제2 벡터)와 동등하다. 용어 "상류" 및 "하류"는 핵산에서의 상대 위치를 지칭한다. 각각의 핵산은 데옥시리보스 (또는 리보스) 고리 상의 탄소 위치로 명명되는 5' 말단 및 3' 말단을 갖는다. 예를 들어, 이중-가닥 DNA를 고려할 때, 상류는 코딩 가닥의 5' 말단을 향하고, 하류는 3' 말단을 향한다.
일부 실시양태에서, (a) 제1 벡터는 제1 인테인의 N-말단 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, 항생제 내성 단백질의 N-말단 단편을 코딩하는 뉴클레오티드 서열을 포함하고, (b) 제2 벡터는 제2 인테인의 N-말단 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, 항생제 내성 단백질의 중심 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, 제1 인테인의 C-말단 단편을 코딩하는 뉴클레오티드 서열을 포함하고, (c) 제3 벡터는 항생제 내성 단백질의 C-말단 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, 제2 인테인의 C-말단 단편을 코딩하는 뉴클레오티드 서열을 포함한다. 이 구성은 a) 항생제 내성 단백질의 N-말단 단편을 코딩하는 뉴클레오티드 서열로부터 하류에 있는, 제1 인테인의 N-말단 단편을 코딩하는 뉴클레오티드 서열을 포함하는 제1 벡터, (b) 제1 인테인의 C-말단 단편을 코딩하는 뉴클레오티드 서열로부터 하류에 있는, 항생제 내성 단백질의 중심 단편을 코딩하는 뉴클레오티드 서열로부터 하류에 있는, 제2 인테인의 N-말단 단편을 코딩하는 뉴클레오티드 서열을 포함하는 제2 벡터, 및 (c) 제2 인테인의 C-말단 단편을 코딩하는 뉴클레오티드 서열로부터 하류에 있는, 항생제 내성 단백질의 C-말단 단편을 포함하는 제3 벡터와 동등하다.
세포
본 개시내용의 방법은 본원에 기재된 벡터 (예를 들어, 제1 및 제2 벡터)를 숙주 세포로 도입함으로써 트랜스제닉 세포 및 유기체를 생성하는데 사용될 수 있다. 벡터가 도입된 세포는 진핵생물 또는 원핵생물일 수 있다. 일부 실시양태에서, 세포는 진핵생물이다. 본원에 제공된 바와 같이 사용하기 위한 진핵생물 세포의 예는 포유동물 세포, 식물 세포 (예를 들어, 작물 세포), 곤충 세포 (예를 들어, 드로소필라(Drosophila)) 및 진균 세포 (예를 들어, 사카로미세스(Saccharomyces))를 포함한다. 포유동물 세포는 예를 들어, 인간 세포 (줄기 세포 또는 확립된 세포주로부터의 세포), 영장류 세포, 말 세포, 소 세포, 돼지 세포, 개 세포, 고양이 세포, 또는 설치류 세포 (예를 들어, 마우스 또는 래트)일 수 있다. 본원에 제공된 바와 같이 사용하기 위한 포유동물 세포의 예는 차이니즈 햄스터 난소 (CHO) 세포, 인간 배아 신장 (HEK) 293 세포, HeLa 세포 및 NS0 세포를 포함하나, 이에 제한되지는 않는다. 일부 실시양태에서, 세포는 원핵생물이다. 본원에 제공된 바와 같이 사용하기 위한 원핵생물 세포의 예는 박테리아 세포를 포함한다. 박테리아 세포는 예를 들어, 에스케리치아(Escherichia) spp. (예를 들어, 에스케리치아 콜라이), 스트렙토코쿠스(Streptococcus) spp. (예를 들어, 스트렙토코쿠스 피오게네스(Streptococcus pyogenes), 스트렙토코쿠스 비리단스(Streptococcus viridans), 스트렙토코쿠스 뉴모니아에(Streptococcus pneumoniae)), 네이쎄리아(Neisseria) spp. (예를 들어, 네이쎄리아 기비로에아(Neisseria gibirrhoea), 네이쎄리아 메닌기티디스(Neisseria meningitidis)), 코리네박테리움(Corynebacterium) spp. (예를 들어, 코리네박테리움 디프테리아에(Corynebacterium diphtheriae)), 바실리스(Bacillis) spp. (예를 들어, 바실리스 안트라시스(Bacillis anthracis), 바실리스 서브틸리스(Bacillis subtilis)), 락토바실러스(Lactobacillus) spp., 클로스트리디움(Clostridium) spp. (예를 들어, 클로스트리디움 테타니(Clostridium tetani), 클로스트리디움 페르프린겐스(Clostridium perfringens), 클로스트리디움 노비이(Clostridium novyii)), 미코박테리움(Mycobacterium) spp. (예를 들어, 미코박테리움 투베르쿨로시스(Mycobacterium tuberculosis)), 시겔라(Shigella) spp. (예를 들어, 시겔라 플렉스네리(Shigella flexneri), 시겔라 디센테리아에(Shigella dysenteriae)), 살모넬라(Salmonella) spp. (예를 들어, 살모넬라 티피(Salmonella typhi), 살모넬라 엔테리티디스(Salmonella enteritidis)), 클렙시엘라(Klebsiella) spp. (예를 들어, 클렙시엘라 뉴모니아에(Klebsiella pneumoniae)), 예르시니아(Yersinia) spp. (예를 들어, 예르시니아 페스티스(Yersinia pestis)), 세라티아(Serratia) spp. (예를 들어, 세라티아 마르세센스(Serratia marcescens)), 슈도모나스(Pseudomonas) spp. (예를 들어, 슈도모나스 아에루기노사(Pseudomonas aeruginosa), 슈도모나스 말레이(Pseudomonas mallei)), 에이케넬라(Eikenella) spp. (예를 들어, 에이케넬라 코로덴스(Eikenella corrodens)), 하에모필루스(Haemophilus) spp. (예를 들어, 하에모필루스 인플루엔자(Haemophilus influenza), 하에모필루스 두크레이이(Haemophilus ducreyi), 하에모필루스 아에깁티우스(Haemophilus aegyptius)), 비브리오(Vibrio) spp. (예를 들어, 비브리오 콜레라(Vibrio cholera), 비브리오 나트리에겐스(Vibrio natriegens)), 레기오넬라(Legionella) spp. (예를 들어, 레기오넬라 미크다데이(Legionella micdadei), 레기오넬라 보제마니(Legionella bozemani)), 브루셀라(Brucella) spp. (예를 들어, 브루셀라 아보르투스(Brucella abortus)), 미코플라스마(Mycoplasma) spp. (예를 들어, 미코플라스마 뉴모니아에(Mycoplasma pneumoniae)) 또는 스트렙토미세스(Streptomyces) spp. (예를 들어 스트렙토미세스 코엘리콜로르(Streptomyces coelicolor), 스트렙토미세스 리비단스(Streptomyces lividans), 스트렙토미세스 알부스(Streptomyces albus))일 수 있다.
전달 및 선택 방법
일부 실시양태에서, 본 개시내용의 방법은 세포를 포함하는 조성물로 벡터를 전달하고, 진핵생물 세포를 생성하기 위해 세포로의 핵산 (예를 들어, 제1, 제2 및 제3 벡터)의 도입을 허용하고 세포에서 핵산 발현을 허용하는 조건 하에 조성물을 유지하는 것을 포함한다. 세포로의 핵산 (예를 들어, 벡터)의 도입에 필요한 조건은 널리 공지되어 있다. 이들 조건은 예를 들어, (원핵생물 세포의) 형질전환 조건, (진핵생물 세포의) 형질감염 조건, (바이러스/바이러스 벡터를 통한) 형질도입 조건, 및 전기천공 조건을 포함하며, 이들 중 임의의 것이 본원에 제공된 바와 같이 사용될 수 있다. 그러므로, 일부 실시양태에서, 본 개시내용의 방법은 진핵생물 (예를 들어 포유동물) 세포를 형질감염시키는 것을 포함하는 반면, 다른 실시양태에서, 방법은 원핵생물 (예를 들어, 박테리아) 세포를 형질전환시키는 것을 포함한다.
트랜스제닉, 예를 들어, 다중-트랜스제닉 세포, 예컨대 이중, 삼중, 및/또는 사중 트랜스제닉 세포의 선택은 사용되는 선택가능한 마커의 유형에 좌우된다. 예를 들어, 선택가능한 마커 단백질이 항생제 내성 단백질인 경우, 선택 단계는 세포를 특정 항생제에 노출시키고, 생존하는 세포만을 선택하는 것을 포함할 수 있다. 선택가능한 마커 단백질이 형광 단백질인 경우, 선택 단계는 단순히 세포를 현미경 하에서 관찰하고, 형광을 내는 세포를 선택하는 것을 포함할 수 있거나, 선택 단계는 다른 형광 선택 방법, 예컨대 형광-활성화 세포 분류법 (FACS) 분류법을 포함할 수 있다.
일부 실시양태에서, 세포는 본원에 기재된 바와 같은 핵산을 보유하는 바이러스 벡터 (예를 들어, 바이러스)로 형질도입된다. 일부 실시양태에서, 형질도입 (또는 다른 형질감염 방법) 전에, 세포를 예를 들어, 웰 당 1x104 내지 1x106개의 밀도로 웰 플레이트 (예를 들어, 12-웰 플레이트) 상에 씨딩한다. 일부 실시양태에서, 100 μL 내지 500 μL, 예를 들어, 100, 150, 200, 250, 300, 350, 400, 450, 또는 500 μL의 각각의 바이러스 벡터를 각각의 웰에 첨가한다.
키트
본 개시내용은 또한 예를 들어, 트랜스제닉 세포 및/또는 유기체를 생성하고 스크리닝하는데 사용될 수 있는 키트를 제공한다. 키트는 본원에 기재된 바와 같은 임의의 둘 이상의 구성성분을 포함할 수 있다. 예를 들어, 키트는 (a) N-말단 인테인 단백질 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, 제1 선택가능한 마커 단백질 단편을 코딩하는 뉴클레오티드 서열을 포함하는 제1 벡터; 및 (b) 제2 선택가능한 마커 단백질 단편으로부터 상류에 있는, C-말단 인테인 단백질 단편을 코딩하는 뉴클레오티드 서열을 포함하는 제2 벡터를 포함할 수 있으며, 여기서 N-말단 인테인 단백질 단편 및 C-말단 인테인 단백질 단편은 제2 선택가능한 마커 단백질 단편으로의 제1 선택가능한 마커 단백질 단편의 연결을 촉매하여 전장 항생제 내성 단백질을 생성한다.
일부 실시양태에서, 키트는 본원에 기재된 바와 같은 임의의 둘 이상의 구성성분을 포함한다. 예를 들어, 키트는 (a) 제1 인테인의 N-말단 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, 항생제 내성 단백질의 N-말단 단편을 코딩하는 뉴클레오티드 서열을 포함하는 제1 벡터, (b) 제2 인테인의 N-말단 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, 항생제 내성 단백질의 중심 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, 제1 인테인의 C-말단 단편을 코딩하는 뉴클레오티드 서열을 포함하는 제2 벡터, 및 (c) 항생제 내성 단백질의 C-말단 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, 제2 인테인의 C-말단 단편을 코딩하는 뉴클레오티드 서열을 포함하는 제3 벡터를 포함할 수 있으며, 여기서 제1 인테인의 N-말단 단편 및 C-말단 단편은 항생제 내성 단백질의 중심 단편으로의 항생제 내성 단백질의 N-말단 단편의 연결을 촉매하고, 제2 인테인의 N-말단 단편 및 C-말단 단편은 항생제 내성 단백질의 C-말단 단편으로의 항생제 내성 단백질의 중심 단편의 연결을 촉매하여 전장 항생제 내성 단백질을 생성한다.
일부 실시양태에서, 키트는 다음 구성성분: 완충제, 염, 클로닝 효소 (예를 들어, LR 클로나제), 감응성 세포 (예를 들어, 감응성 박테리아 세포), 형질감염 시약, 항생제, 및/또는 본원에 기재된 방법의 수행에 대한 지침서 중 임의의 하나 이상을 추가로 포함한다.
<추가 실시양태>
본 개시내용의 추가 실시양태는 하기 번호의 단락에 포함된다:
1. (a) (i) 인테인의 N-말단 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, 항생제 내성 단백질의 N-말단 단편을 코딩하는 뉴클레오티드 서열 및 (ii) 관심있는 제1 분자를 코딩하는 뉴클레오티드 서열을 포함하는 제1 벡터; 및
(b) (i) 항생제 내성 단백질의 C-말단 단편으로부터 상류에 있는, 인테인의 C-말단 단편을 코딩하는 뉴클레오티드 서열 및 (ii) 관심있는 제2 분자를 코딩하는 뉴클레오티드 서열을 포함하는 제2 벡터
를 진핵생물 세포로 전달하는 것을 포함하며, 여기서 인테인의 N-말단 단편 및 C-말단 단편은 항생제 내성 단백질의 N-말단 단편 및 C-말단 단편의 연결을 촉매하여 전장 항생제 내성 단백질을 생성하는 것인 방법.
2. 단락 1에 있어서, 트랜스제닉 진핵생물 세포를 생성하기 위해 진핵생물 세포로의 제1 및 제2 벡터의 도입을 허용하는 조건 하에 진핵생물 세포를 유지하는 것을 추가로 포함하는 방법.
3. 단락 2에 있어서, 전장 항생제 내성 단백질을 포함하는 트랜스제닉 진핵생물 세포를 선택하는 것을 추가로 포함하는 방법.
4. 단락 1 내지 3 중 어느 한 단락에 있어서, 진핵생물 세포가 포유동물 세포인 방법.
5. 단락 1 내지 4 중 어느 한 단락에 있어서, 항생제 내성 단백질이 히그로마이신, G418, 퓨로마이신, 플레오마이신 D1 또는 블라스티시딘에 대한 내성을 부여하는 것인 방법.
6. 단락 1 내지 5 중 어느 한 단락에 있어서, 인테인이 분할 인테인인 방법.
7. 단락 6에 있어서, 분할 인테인이 자연 분할 인테인인 방법.
8. 단락 7에 있어서, 자연 분할 인테인이 DnaE 인테인으로부터 선택된 것인 방법.
9. 단락 8에 있어서, DnaE 인테인이 시네코시스티스 sp. DnaE (SspDnaE) 인테인 및 노스톡 푼크티포르메 (NpuDnaE) 인테인으로부터 선택된 것인 방법.
10. 단락 6에 있어서, 분할 인테인이 조작된 분할 인테인인 방법.
11. 단락 10에 있어서, 조작된 분할 인테인이 DnaB 인테인으로부터 조작된 것인 방법.
12. 단락 11에 있어서, 조작된 분할 인테인이 SspDnaB S1 인테인인 방법.
13. 단락 12에 있어서, 조작된 분할 인테인이 GyrB 인테인으로부터 조작된 것인 방법.
14. 단락 13에 있어서, 조작된 분할 인테인이 SspGyrB S11 인테인인 방법.
15. 단락 1 내지 14 중 어느 한 단락에 있어서, 제1 및/또는 제2 분자가 단백질인 방법.
16. 단락 1 내지 15 중 어느 한 단락에 있어서, 제1 및/또는 제2 분자가 비-코딩 리보핵산 (RNA)인 방법.
17. 단락 16에 있어서, 비-코딩 RNA가 마이크로RNA (miRNA), 안티센스 RNA, 짧은-간섭 RNA (siRNA) 또는 짧은-헤어핀 RNA (shRNA)인 방법.
18. 단락 1 내지 17 중 어느 한 단락에 있어서, 제1 및/또는 제2 벡터가 플라스미드 벡터 또는 바이러스 벡터인 방법.
19. (a) (i) 인테인의 N-말단 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, hygB 유전자의 N-말단 단편 및 (ii) 관심있는 제1 분자를 포함하는 제1 벡터; 및
(b) (ii) hygB 유전자의 C-말단 단편으로부터 상류에 있는, 인테인의 C-말단 단편을 코딩하는 뉴클레오티드 서열 및 (ii) 관심있는 제2 분자를 포함하는 제2 벡터
를 진핵생물 세포로 전달하는 것을 포함하며, 여기서 인테인의 N-말단 단편 및 C-말단 단편은 hygB 유전자의 C-말단 단편에 의해 코딩된 단백질 단편으로의 hygB 유전자의 N-말단 단편에 의해 코딩된 단백질 단편의 연결을 촉매하여 전장 히그로마이신 B 포스포트랜스퍼라제를 생성하는 것인 방법.
20. 단락 19에 있어서, 제2 hygB 유전자 단편에 의해 코딩된 단백질 단편의 첫 번째 아미노산이 시스테인인 방법.
21. 단락 23에 있어서,
hygB 유전자의 N-말단 단편에 의해 코딩된 단백질 단편이 서열식별번호: 1의 아미노산 1-89에 의해 확인된 아미노산 서열을 포함하고, hygB 유전자의 C-말단 단편에 의해 코딩된 단백질 단편이 서열식별번호: 1의 아미노산 90-341에 의해 확인된 아미노산 서열을 포함하거나;
hygB 유전자의 N-말단 단편에 의해 코딩된 단백질 단편이 서열식별번호: 1의 아미노산 1-200에 의해 확인된 아미노산 서열을 포함하고, hygB 유전자의 C-말단 단편에 의해 코딩된 단백질 단편이 서열식별번호: 1의 아미노산 201-341에 의해 확인된 아미노산 서열을 포함하거나;
hygB 유전자의 N-말단 단편에 의해 코딩된 단백질 단편이 서열식별번호: 1의 아미노산 1-53에 의해 확인된 아미노산 서열을 포함하고, hygB 유전자의 C-말단 단편에 의해 코딩된 단백질 단편이 서열식별번호: 1의 아미노산 54-341에 의해 확인된 아미노산 서열을 포함하거나;
hygB 유전자의 N-말단 단편에 의해 코딩된 단백질 단편이 서열식별번호: 1의 아미노산 1-240에 의해 확인된 아미노산 서열을 포함하고, hygB 유전자의 C-말단 단편에 의해 코딩된 단백질 단편이 서열식별번호: 1의 아미노산 241-341에 의해 확인된 아미노산 서열을 포함하거나;
hygB 유전자의 N-말단 단편에 의해 코딩된 단백질 단편이 서열식별번호: 1의 아미노산 1-292에 의해 확인된 아미노산 서열을 포함하고, hygB 유전자의 C-말단 단편에 의해 코딩된 단백질 단편이 서열식별번호: 1의 아미노산 293-341에 의해 확인된 아미노산 서열을 포함하는 것인 방법.
22. 단락 23 내지 21 중 어느 한 단락에 있어서,
인테인의 N-말단 단편이 서열식별번호: 16에 의해 확인되고, 인테인의 C-말단 단편이 서열식별번호: 17에 의해 확인되거나;
인테인의 N-말단 단편이 서열식별번호: 7에 의해 확인되고, 인테인의 C-말단 단편이 서열식별번호: 8에 의해 확인되거나; 또는
인테인의 N-말단 단편이 서열식별번호: 18 또는 서열식별번호: 9에 의해 확인되고, 인테인의 C-말단 단편이 서열식별번호: 19 또는 서열식별번호: 10에 의해 확인된 것인 방법.
23. (a) (i) 인테인의 N-말단 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, bsr 유전자의 N-말단 단편 및 (ii) 관심있는 제1 분자를 포함하는 제1 벡터; 및
(b) (ii) bsr 유전자의 C-말단 단편으로부터 상류에 있는, 인테인의 C-말단 단편을 코딩하는 뉴클레오티드 서열 및 (ii) 관심있는 제2 분자를 포함하는 제2 벡터
를 진핵생물 세포로 전달하는 것을 포함하며, 여기서 인테인의 N-말단 단편 및 C-말단 단편은 bsr 유전자의 C-말단 단편에 의해 코딩된 단백질 단편으로의 bsr 유전자의 N-말단 단편에 의해 코딩된 단백질 단편의 연결을 촉매하여 전장 블라스티시딘-S 데아미나제를 생성하는 것인 방법.
24. 단락 23에 있어서, bsr 유전자의 N-말단 단편에 의해 코딩된 단백질 단편이 서열식별번호: 4의 아미노산 1-102에 의해 확인된 아미노산 서열을 포함하고, bsr 유전자의 C-말단 단편에 의해 코딩된 단백질 단편이 서열식별번호: 4의 아미노산 103-140에 의해 확인된 아미노산 서열을 포함하는 것인 방법.
25. 단락 22 또는 23에 있어서,
인테인의 N-말단 단편이 서열식별번호: 16에 의해 확인되고, 인테인의 C-말단 단편이 서열식별번호: 17에 의해 확인되거나;
인테인의 N-말단 단편이 서열식별번호: 7에 의해 확인되고, 인테인의 C-말단 단편이 서열식별번호: 8에 의해 확인되거나; 또는
인테인의 N-말단 단편이 서열식별번호: 18 또는 서열식별번호: 9에 의해 확인되고, 인테인의 C-말단 단편이 서열식별번호: 19 또는 서열식별번호: 10에 의해 확인된 것인 방법.
26. (a) (i) 인테인의 N-말단 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, pac 유전자의 N-말단 단편 및 (ii) 관심있는 제1 분자를 포함하는 제1 벡터; 및
(b) (ii) pac 유전자의 C-말단 단편으로부터 상류에 있는, 인테인의 C-말단 단편을 코딩하는 뉴클레오티드 서열 및 (ii) 관심있는 제2 분자를 포함하는 제2 벡터
를 진핵생물 세포로 전달하는 것을 포함하며, 여기서 인테인의 N-말단 단편 및 C-말단 단편은 pac 유전자의 C-말단 단편에 의해 코딩된 단백질 단편으로의 pac 유전자의 N-말단 단편에 의해 코딩된 단백질 단편의 연결을 촉매하여 전장 퓨로마이신 N-아세틸-트랜스퍼라제를 생성하는 것인 방법.
27. 단락 26에 있어서,
pac 유전자의 N-말단 단편에 의해 코딩된 단백질 단편이 서열식별번호: 2의 아미노산 1-63에 의해 확인된 아미노산 서열을 포함하고, pac 유전자의 C-말단 단편에 의해 코딩된 단백질 단편이 서열식별번호: 2의 아미노산 64-199에 의해 확인된 아미노산 서열을 포함하거나;
pac 유전자의 N-말단 단편에 의해 코딩된 단백질 단편이 서열식별번호: 2의 아미노산 1-119에 의해 확인된 아미노산 서열을 포함하고, pac 유전자의 C-말단 단편에 의해 코딩된 단백질 단편이 서열식별번호: 2의 아미노산 120-199에 의해 확인된 아미노산 서열을 포함하거나;
pac 유전자의 N-말단 단편에 의해 코딩된 단백질 단편이 서열식별번호: 2의 아미노산 1-100에 의해 확인된 아미노산 서열을 포함하고, pac 유전자의 C-말단 단편에 의해 코딩된 단백질 단편이 서열식별번호: 2의 아미노산 101-199에 의해 확인된 아미노산 서열을 포함하는 것인 방법.
28. 단락 26 또는 27에 있어서,
인테인의 N-말단 단편이 서열식별번호: 16에 의해 확인되고, 인테인의 C-말단 단편이 서열식별번호: 17에 의해 확인되거나;
인테인의 N-말단 단편이 서열식별번호: 7에 의해 확인되고, 인테인의 C-말단 단편이 서열식별번호: 8에 의해 확인되거나; 또는
인테인의 N-말단 단편이 서열식별번호: 18 또는 서열식별번호: 9에 의해 확인되고, 인테인의 C-말단 단편이 서열식별번호: 19 또는 서열식별번호: 10에 의해 확인된 것인 방법.
29. (a) (i) 인테인의 N-말단 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, neo 유전자의 N-말단 단편 및 (ii) 관심있는 제1 분자를 포함하는 제1 벡터; 및
(b) (ii) neo 유전자의 C-말단 단편으로부터 상류에 있는, 인테인의 C-말단 단편을 코딩하는 뉴클레오티드 서열 및 (ii) 관심있는 제2 분자를 포함하는 제2 벡터
를 진핵생물 세포로 전달하는 것을 포함하며, 여기서 인테인의 N-말단 단편 및 C-말단 단편은 neo 유전자의 C-말단 단편에 의해 코딩된 단백질 단편으로의 neo 유전자의 N-말단 단편에 의해 코딩된 단백질 단편의 연결을 촉매하여 전장 아미노글리코시드 3'-포스포트랜스퍼라제를 생성하는 것인 방법.
30. 단락 29에 있어서,
neo 유전자의 N-말단 단편에 의해 코딩된 단백질 단편이 서열식별번호: 3의 아미노산 1-133에 의해 확인된 아미노산 서열을 포함하고, neo 유전자의 C-말단 단편에 의해 코딩된 단백질 단편이 서열식별번호: 3의 아미노산 134-267에 의해 확인된 아미노산 서열을 포함하거나; 또는
neo 유전자의 N-말단 단편에 의해 코딩된 단백질 단편이 서열식별번호: 3의 아미노산 1-194에 의해 확인된 아미노산 서열을 포함하고, neo 유전자의 C-말단 단편에 의해 코딩된 단백질 단편이 서열식별번호: 3의 아미노산 195-267에 의해 확인된 아미노산 서열을 포함하는 것인 방법.
31. 단락 29 또는 30에 있어서,
인테인의 N-말단 단편이 서열식별번호: 16에 의해 확인되고, 인테인의 C-말단 단편이 서열식별번호: 17에 의해 확인되거나;
인테인의 N-말단 단편이 서열식별번호: 7에 의해 확인되고, 인테인의 C-말단 단편이 서열식별번호: 8에 의해 확인되거나; 또는
인테인의 N-말단 단편이 서열식별번호: 18 또는 서열식별번호: 9에 의해 확인되고, 인테인의 C-말단 단편이 서열식별번호: 19 또는 서열식별번호: 10에 의해 확인된 것인 방법.
32. (a) (i) 인테인의 N-말단 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, 형광 단백질의 N-말단 단편을 코딩하는 뉴클레오티드 서열 및 (ii) 관심있는 제1 분자를 코딩하는 뉴클레오티드 서열을 포함하는 제1 벡터; 및
(b) (ii) 형광 단백질의 C-말단 단편으로부터 상류에 있는, 인테인의 C-말단 단편을 코딩하는 뉴클레오티드 서열 및 (ii) 관심있는 제2 분자를 코딩하는 뉴클레오티드 서열을 포함하는 제2 벡터
를 진핵생물 세포를 포함하는 조성물로 전달하는 것을 포함하며, 여기서 인테인의 N-말단 단편 및 C-말단 단편은 형광 단백질의 C-말단 단편으로의 형광 단백질의 N-말단 단편의 연결을 촉매하여 전장 형광 단백질을 생성하는 것인 방법.
33. 단락 51에 있어서, 트랜스제닉 진핵생물 세포를 생성하기 위해 진핵생물 세포로의 제1 및 제2 벡터의 도입을 허용하는 조건 하에 진핵생물 세포를 유지하는 것을 추가로 포함하는 방법.
34. 단락 33에 있어서, 전장 형광 단백질을 포함하는 트랜스제닉 진핵생물 세포를 선택하는 것을 추가로 포함하는 방법.
35. 단락 32 내지 34 중 어느 한 단락에 있어서, 진핵생물 세포가 포유동물 세포인 방법.
36. 단락 32 내지 35 중 어느 한 단락에 있어서, 형광 단백질이 TagCFP, mTagCFP2, Czurite, ECFP2, mKalama1, Sirius, Sapphire, T-Sapphire, ECFP, Cerulean, SCFP3C, mTurquoise, mTurquoise2, 모노머 Midoriishi-Cyan, TagCFP, mTFP1, EGFP, Emerald, Superfolder GFP, 모노머 Czami Green, TagGFP2, mUKG, mWasabi, Clover, mNeonGreen, EYFP, Citrine, Venus, SYFP2, TagYFP, 모노머 Kusabira-Orange, mKOκ, mKO2, mOrange, mOrange2, mRaspberry, mCherry, mStrawberry, mTangerine, tdTomato, TagRFP, TagRFP-T, mCpple, mRuby, mRuby2, mPlum, HcRed-Tandem, mKate2, mNeptune, NirFP, TagRFP657, IFP1.4 및 iRFP로부터 선택된 것인 방법.
37. 단락 32 내지 36 중 어느 한 단락에 있어서, 인테인이 분할 인테인인 방법.
38. 단락 37에 있어서, 분할 인테인이 자연 분할 인테인인 방법.
39. 단락 38에 있어서, 자연 분할 인테인이 DnaE 인테인으로부터 선택된 것인 방법.
40. 단락 39에 있어서, DnaE 인테인이 시네코시스티스 sp. DnaE (SspDnaE) 인테인 및 노스톡 푼크티포르메 (NpuDnaE) 인테인으로부터 선택된 것인 방법.
41. 단락 40에 있어서, 분할 인테인이 조작된 분할 인테인인 방법.
42. 단락 41에 있어서, 조작된 분할 인테인이 DnaB 인테인으로부터 조작된 것인 방법.
43. 단락 42에 있어서, 조작된 분할 인테인이 SspDnaB S1 인테인인 방법.
44. 단락 42에 있어서, 조작된 분할 인테인이 GyrB 인테인으로부터 조작된 것인 방법.
45. 단락 44에 있어서, 조작된 분할 인테인이 SspGyrB S11 인테인인 방법.
46. 단락 32 내지 45 중 어느 한 단락에 있어서, 제1 및/또는 제2 분자가 단백질인 방법.
47. 단락 32 내지 46 중 어느 한 단락에 있어서, 제1 및/또는 제2 분자가 비-코딩 리보핵산 (RNA)인 방법.
48. 단락 47에 있어서, 비-코딩 RNA가 마이크로RNA (miRNA), 안티센스 RNA, 짧은-간섭 RNA (siRNA) 또는 짧은-헤어핀 RNA (shRNA)인 방법.
49. 단락 32 내지 48 중 어느 한 단락에 있어서, 제1 및/또는 제2 벡터가 플라스미드 벡터 또는 바이러스 벡터인 방법.
50. (a) (i) 인테인의 N-말단 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, egfp 유전자의 N-말단 단편 및 (ii) 관심있는 제1 분자를 포함하는 제1 벡터; 및
(b) (ii) egfp 유전자의 C-말단 단편으로부터 상류에 있는, 인테인의 C-말단 단편을 코딩하는 뉴클레오티드 서열 및 (ii) 관심있는 제2 분자를 포함하는 제2 벡터
를 진핵생물 세포로 전달하는 것을 포함하며, 여기서 인테인의 N-말단 단편 및 C-말단 단편은 egfp 유전자의 C-말단 단편에 의해 코딩된 단백질 단편으로의 egfp 유전자의 N-말단 단편에 의해 코딩된 단백질 단편의 연결을 촉매하여 전장 EGFP 단백질을 생성하는 것인 방법.
51. 단락 50에 있어서, egfp 유전자의 N-말단 단편에 의해 코딩된 단백질 단편이 서열식별번호: 5의 아미노산 1-175에 의해 확인된 아미노산 서열을 포함하고, egfp 유전자의 C-말단 단편에 의해 코딩된 단백질 단편이 서열식별번호: 5의 아미노산 175-239에 의해 확인된 아미노산 서열을 포함하는 것인 방법.
52. 단락 50 또는 51에 있어서,
인테인의 N-말단 단편이 서열식별번호: 16에 의해 확인되고, 인테인의 C-말단 단편이 서열식별번호: 17에 의해 확인되거나;
인테인의 N-말단 단편이 서열식별번호: 7에 의해 확인되고, 인테인의 C-말단 단편이 서열식별번호: 8에 의해 확인되거나; 또는
인테인의 N-말단 단편이 서열식별번호: 18 또는 서열식별번호: 9에 의해 확인되고, 인테인의 C-말단 단편이 서열식별번호: 19 또는 서열식별번호: 10에 의해 확인된 것인 방법.
53. (a) (i) 인테인의 N-말단 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, mScarlet 유전자의 N-말단 단편 및 (ii) 관심있는 제1 분자를 포함하는 제1 벡터; 및
(b) (ii) mScarlet 유전자의 C-말단 단편으로부터 상류에 있는, 인테인의 C-말단 단편을 코딩하는 뉴클레오티드 서열 및 (ii) 관심있는 제2 분자를 포함하는 제2 벡터
를 진핵생물 세포로 전달하는 것을 포함하며, 여기서 인테인의 N-말단 단편 및 C-말단 단편은 mScarlet 유전자의 C-말단 단편에 의해 코딩된 단백질 단편으로의 mScarlet 유전자의 N-말단 단편에 의해 코딩된 단백질 단편의 연결을 촉매하여 전장 mScarlet 단백질을 생성하는 것인 방법.
54. 단락 53에 있어서,
mScarlet 유전자의 N-말단 단편에 의해 코딩된 단백질 단편이 서열식별번호: 6의 아미노산 1-46에 의해 확인된 아미노산 서열을 포함하고, mScarlet 유전자의 C-말단 단편에 의해 코딩된 단백질 단편이 서열식별번호: 6의 아미노산 47-232에 의해 확인된 아미노산 서열을 포함하거나;
mScarlet 유전자의 N-말단 단편에 의해 코딩된 단백질 단편이 서열식별번호: 6의 아미노산 1-48에 의해 확인된 아미노산 서열을 포함하고, mScarlet 유전자의 C-말단 단편에 의해 코딩된 단백질 단편이 서열식별번호: 6의 아미노산 49-232에 의해 확인된 아미노산 서열을 포함하거나;
mScarlet 유전자의 N-말단 단편에 의해 코딩된 단백질 단편이 서열식별번호: 6의 아미노산 1-51에 의해 확인된 아미노산 서열을 포함하고, mScarlet 유전자의 C-말단 단편에 의해 코딩된 단백질 단편이 서열식별번호: 6의 아미노산 52-232에 의해 확인된 아미노산 서열을 포함하거나;
mScarlet 유전자의 N-말단 단편에 의해 코딩된 단백질 단편이 서열식별번호: 6의 아미노산 1-75에 의해 확인된 아미노산 서열을 포함하고, mScarlet 유전자의 C-말단 단편에 의해 코딩된 단백질 단편이 서열식별번호: 6의 아미노산 76-232에 의해 확인된 아미노산 서열을 포함하거나;
mScarlet 유전자의 N-말단 단편에 의해 코딩된 단백질 단편이 서열식별번호: 6의 아미노산 1-122에 의해 확인된 아미노산 서열을 포함하고, mScarlet 유전자의 C-말단 단편에 의해 코딩된 단백질 단편이 서열식별번호: 6의 아미노산 123-232에 의해 확인된 아미노산 서열을 포함하거나;
mScarlet 유전자의 N-말단 단편에 의해 코딩된 단백질 단편이 서열식별번호: 6의 아미노산 1-140에 의해 확인된 아미노산 서열을 포함하고, mScarlet 유전자의 C-말단 단편에 의해 코딩된 단백질 단편이 서열식별번호: 6의 아미노산 141-232에 의해 확인된 아미노산 서열을 포함하거나;
mScarlet 유전자의 N-말단 단편에 의해 코딩된 단백질 단편이 서열식별번호: 6의 아미노산 1-163에 의해 확인된 아미노산 서열을 포함하고, mScarlet 유전자의 C-말단 단편에 의해 코딩된 단백질 단편이 서열식별번호: 6의 아미노산 164-232에 의해 확인된 아미노산 서열을 포함하는 것인 방법.
55. 단락 53 또는 54에 있어서,
인테인의 N-말단 단편이 서열식별번호: 16에 의해 확인되고, 인테인의 C-말단 단편이 서열식별번호: 17에 의해 확인되거나;
인테인의 N-말단 단편이 서열식별번호: 7에 의해 확인되고, 인테인의 C-말단 단편이 서열식별번호: 8에 의해 확인되거나; 또는
인테인의 N-말단 단편이 서열식별번호: 18 또는 서열식별번호: 9에 의해 확인되고, 인테인의 C-말단 단편이 서열식별번호: 19 또는 서열식별번호: 10에 의해 확인된 것인 방법.
56. (a) (i) 인테인의 N-말단 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, 항생제 내성 단백질의 N-말단 단편을 코딩하는 뉴클레오티드 서열 및 (ii) 관심있는 제1 분자를 코딩하는 뉴클레오티드 서열을 포함하는 제1 벡터; 및
(b) (i) 항생제 내성 단백질의 C-말단 단편으로부터 상류에 있는, 인테인의 C-말단 단편을 코딩하는 뉴클레오티드 서열 및 (ii) 관심있는 제2 분자를 코딩하는 뉴클레오티드 서열을 포함하는 제2 벡터
를 포함하며, 여기서 인테인의 N-말단 단편 및 C-말단 단편은 항생제 내성 단백질의 N-말단 단편 및 C-말단 단편의 연결을 촉매하여 전장 항생제 내성 단백질을 생성하는 것인 진핵생물 세포.
57. 단락 56에 있어서, 진핵생물 세포가 포유동물 세포인 세포.
58. 단락 56 또는 57에 있어서, 항생제 내성 단백질이 히그로마이신, G418, 퓨로마이신, 플레오마이신 D1 또는 블라스티시딘에 대한 내성을 부여하는 것인 세포.
59. 단락 56 내지 58 중 어느 한 단락에 있어서, 인테인이 분할 인테인인 세포.
60. 단락 59에 있어서, 분할 인테인이 자연 분할 인테인인 세포.
61. 단락 60에 있어서, 자연 분할 인테인이 DnaE 인테인으로부터 선택된 것인 세포.
62. 단락 61에 있어서, DnaE 인테인이 시네코시스티스 sp. DnaE (SspDnaE) 인테인 및 노스톡 푼크티포르메 (NpuDnaE) 인테인으로부터 선택된 것인 세포.
63. 단락 59에 있어서, 분할 인테인이 조작된 분할 인테인인 세포.
64. 단락 63에 있어서, 조작된 분할 인테인이 DnaB 인테인으로부터 조작된 것인 세포.
65. 단락 64에 있어서, 조작된 분할 인테인이 SspDnaB S1 인테인인 세포.
66. 단락 65에 있어서, 조작된 분할 인테인이 GyrB 인테인으로부터 조작된 것인 세포.
67. 단락 66에 있어서, 조작된 분할 인테인이 SspGyrB S11 인테인인 세포.
68. 단락 56 내지 67 중 어느 한 단락에 있어서, 제1 및/또는 제2 분자가 단백질인 세포.
69. 단락 56 내지 68 중 어느 한 단락에 있어서, 제1 및/또는 제2 분자가 비-코딩 리보핵산 (RNA)인 세포.
70. 단락 69에 있어서, 비-코딩 RNA가 마이크로RNA (miRNA), 안티센스 RNA, 짧은-간섭 RNA (siRNA) 또는 짧은-헤어핀 RNA (shRNA)인 세포.
71. 단락 56 내지 70 중 어느 한 단락에 있어서, 제1 및/또는 제2 벡터가 플라스미드 벡터 또는 바이러스 벡터인 세포.
72. (a) (i) 인테인의 N-말단 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, hygB 유전자의 N-말단 단편 및 (ii) 관심있는 제1 분자를 포함하는 제1 벡터; 및
(b) (ii) hygB 유전자의 C-말단 단편으로부터 상류에 있는, 인테인의 C-말단 단편을 코딩하는 뉴클레오티드 서열 및 (ii) 관심있는 제2 분자를 포함하는 제2 벡터
를 포함하며, 여기서 인테인의 N-말단 단편 및 C-말단 단편은 hygB 유전자의 C-말단 단편에 의해 코딩된 단백질 단편으로의 hygB 유전자의 N-말단 단편에 의해 코딩된 단백질 단편의 연결을 촉매하여 전장 히그로마이신 B 포스포트랜스퍼라제를 생성하는 것인 세포.
73. 단락 72에 있어서, 제2 hygB 유전자 단편에 의해 코딩된 단백질 단편의 첫 번째 아미노산이 시스테인인 세포.
74. 단락 73에 있어서,
hygB 유전자의 N-말단 단편에 의해 코딩된 단백질 단편이 서열식별번호: 1의 아미노산 1-89에 의해 확인된 아미노산 서열을 포함하고, hygB 유전자의 C-말단 단편에 의해 코딩된 단백질 단편이 서열식별번호: 1의 아미노산 90-341에 의해 확인된 아미노산 서열을 포함하거나;
hygB 유전자의 N-말단 단편에 의해 코딩된 단백질 단편이 서열식별번호: 1의 아미노산 1-200에 의해 확인된 아미노산 서열을 포함하고, hygB 유전자의 C-말단 단편에 의해 코딩된 단백질 단편이 서열식별번호: 1의 아미노산 201-341에 의해 확인된 아미노산 서열을 포함하거나;
hygB 유전자의 N-말단 단편에 의해 코딩된 단백질 단편이 서열식별번호: 1의 아미노산 1-53에 의해 확인된 아미노산 서열을 포함하고, hygB 유전자의 C-말단 단편에 의해 코딩된 단백질 단편이 서열식별번호: 1의 아미노산 54-341에 의해 확인된 아미노산 서열을 포함하거나;
hygB 유전자의 N-말단 단편에 의해 코딩된 단백질 단편이 서열식별번호: 1의 아미노산 1-240에 의해 확인된 아미노산 서열을 포함하고, hygB 유전자의 C-말단 단편에 의해 코딩된 단백질 단편이 서열식별번호: 1의 아미노산 241-341에 의해 확인된 아미노산 서열을 포함하거나;
hygB 유전자의 N-말단 단편에 의해 코딩된 단백질 단편이 서열식별번호: 1의 아미노산 1-292에 의해 확인된 아미노산 서열을 포함하고, hygB 유전자의 C-말단 단편에 의해 코딩된 단백질 단편이 서열식별번호: 1의 아미노산 293-341에 의해 확인된 아미노산 서열을 포함하는 것인 세포.
75. 단락 72 내지 74 중 어느 한 단락에 있어서,
인테인의 N-말단 단편이 서열식별번호: 16에 의해 확인되고, 인테인의 C-말단 단편이 서열식별번호: 17에 의해 확인되거나;
인테인의 N-말단 단편이 서열식별번호: 7에 의해 확인되고, 인테인의 C-말단 단편이 서열식별번호: 8에 의해 확인되거나; 또는
인테인의 N-말단 단편이 서열식별번호: 18 또는 서열식별번호: 9에 의해 확인되고, 인테인의 C-말단 단편이 서열식별번호: 19 또는 서열식별번호: 10에 의해 확인된 것인 세포.
76. (a) (i) 인테인의 N-말단 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, bsr 유전자의 N-말단 단편 및 (ii) 관심있는 제1 분자를 포함하는 제1 벡터; 및
(b) (ii) bsr 유전자의 C-말단 단편으로부터 상류에 있는, 인테인의 C-말단 단편을 코딩하는 뉴클레오티드 서열 및 (ii) 관심있는 제2 분자를 포함하는 제2 벡터
를 포함하며, 여기서 인테인의 N-말단 단편 및 C-말단 단편은 bsr 유전자의 C-말단 단편에 의해 코딩된 단백질 단편으로의 bsr 유전자의 N-말단 단편에 의해 코딩된 단백질 단편의 연결을 촉매하여 전장 블라스티시딘-S 데아미나제를 생성하는 것인 진핵생물 세포.
77. 단락 76에 있어서, bsr 유전자의 N-말단 단편에 의해 코딩된 단백질 단편이 서열식별번호: 4의 아미노산 1-102에 의해 확인된 아미노산 서열을 포함하고, bsr 유전자의 C-말단 단편에 의해 코딩된 단백질 단편이 서열식별번호: 4의 아미노산 103-140에 의해 확인된 아미노산 서열을 포함하는 것인 세포.
78. 단락 76 또는 77에 있어서,
인테인의 N-말단 단편이 서열식별번호: 16에 의해 확인되고, 인테인의 C-말단 단편이 서열식별번호: 17에 의해 확인되거나;
인테인의 N-말단 단편이 서열식별번호: 7에 의해 확인되고, 인테인의 C-말단 단편이 서열식별번호: 8에 의해 확인되거나; 또는
인테인의 N-말단 단편이 서열식별번호: 18 또는 서열식별번호: 9에 의해 확인되고, 인테인의 C-말단 단편이 서열식별번호: 19 또는 서열식별번호: 10에 의해 확인된 것인 세포.
79. (a) (i) 인테인의 N-말단 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, pac 유전자의 N-말단 단편 및 (ii) 관심있는 제1 분자를 포함하는 제1 벡터; 및
(b) (ii) pac 유전자의 C-말단 단편으로부터 상류에 있는, 인테인의 C-말단 단편을 코딩하는 뉴클레오티드 서열 및 (ii) 관심있는 제2 분자를 포함하는 제2 벡터
를 포함하며, 여기서 인테인의 N-말단 단편 및 C-말단 단편은 pac 유전자의 C-말단 단편에 의해 코딩된 단백질 단편으로의 pac 유전자의 N-말단 단편에 의해 코딩된 단백질 단편의 연결을 촉매하여 전장 퓨로마이신 N-아세틸-트랜스퍼라제를 생성하는 것인 진핵생물 세포.
80. 단락 79에 있어서,
pac 유전자의 N-말단 단편에 의해 코딩된 단백질 단편이 서열식별번호: 2의 아미노산 1-63에 의해 확인된 아미노산 서열을 포함하고, pac 유전자의 C-말단 단편에 의해 코딩된 단백질 단편이 서열식별번호: 2의 아미노산 64-199에 의해 확인된 아미노산 서열을 포함하거나;
pac 유전자의 N-말단 단편에 의해 코딩된 단백질 단편이 서열식별번호: 2의 아미노산 1-119에 의해 확인된 아미노산 서열을 포함하고, pac 유전자의 C-말단 단편에 의해 코딩된 단백질 단편이 서열식별번호: 2의 아미노산 120-199에 의해 확인된 아미노산 서열을 포함하거나;
pac 유전자의 N-말단 단편에 의해 코딩된 단백질 단편이 서열식별번호: 2의 아미노산 1-100에 의해 확인된 아미노산 서열을 포함하고, pac 유전자의 C-말단 단편에 의해 코딩된 단백질 단편이 서열식별번호: 2의 아미노산 101-199에 의해 확인된 아미노산 서열을 포함하는 것인 세포.
81. 단락 79 또는 80에 있어서,
인테인의 N-말단 단편이 서열식별번호: 16에 의해 확인되고, 인테인의 C-말단 단편이 서열식별번호: 17에 의해 확인되거나;
인테인의 N-말단 단편이 서열식별번호: 7에 의해 확인되고, 인테인의 C-말단 단편이 서열식별번호: 8에 의해 확인되거나; 또는
인테인의 N-말단 단편이 서열식별번호: 18 또는 서열식별번호: 9에 의해 확인되고, 인테인의 C-말단 단편이 서열식별번호: 19 또는 서열식별번호: 10에 의해 확인된 것인 세포.
82. (a) (i) 인테인의 N-말단 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, neo 유전자의 N-말단 단편 및 (ii) 관심있는 제1 분자를 포함하는 제1 벡터; 및
(b) (ii) neo 유전자의 C-말단 단편으로부터 상류에 있는, 인테인의 C-말단 단편을 코딩하는 뉴클레오티드 서열 및 (ii) 관심있는 제2 분자를 포함하는 제2 벡터
를 포함하며, 여기서 인테인의 N-말단 단편 및 C-말단 단편은 neo 유전자의 C-말단 단편에 의해 코딩된 단백질 단편으로의 neo 유전자의 N-말단 단편에 의해 코딩된 단백질 단편의 연결을 촉매하여 전장 아미노글리코시드 3'-포스포트랜스퍼라제를 생성하는 것인 진핵생물 세포.
83. 단락 82에 있어서,
neo 유전자의 N-말단 단편에 의해 코딩된 단백질 단편이 서열식별번호: 3의 아미노산 1-133에 의해 확인된 아미노산 서열을 포함하고, neo 유전자의 C-말단 단편에 의해 코딩된 단백질 단편이 서열식별번호: 3의 아미노산 134-267에 의해 확인된 아미노산 서열을 포함하거나; 또는
neo 유전자의 N-말단 단편에 의해 코딩된 단백질 단편이 서열식별번호: 3의 아미노산 1-194에 의해 확인된 아미노산 서열을 포함하고, neo 유전자의 C-말단 단편에 의해 코딩된 단백질 단편이 서열식별번호: 3의 아미노산 195-267에 의해 확인된 아미노산 서열을 포함하는 것인 세포.
84. 단락 82 또는 83에 있어서,
인테인의 N-말단 단편이 서열식별번호: 16에 의해 확인되고, 인테인의 C-말단 단편이 서열식별번호: 17에 의해 확인되거나;
인테인의 N-말단 단편이 서열식별번호: 7에 의해 확인되고, 인테인의 C-말단 단편이 서열식별번호: 8에 의해 확인되거나; 또는
인테인의 N-말단 단편이 서열식별번호: 18 또는 서열식별번호: 9에 의해 확인되고, 인테인의 C-말단 단편이 서열식별번호: 19 또는 서열식별번호: 10에 의해 확인된 것인 세포.
85. (a) (i) 인테인의 N-말단 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, 형광 단백질의 N-말단 단편을 코딩하는 뉴클레오티드 서열 및 (ii) 관심있는 제1 분자를 코딩하는 뉴클레오티드 서열을 포함하는 제1 벡터; 및
(b) (ii) 형광 단백질의 C-말단 단편으로부터 상류에 있는, 인테인의 C-말단 단편을 코딩하는 뉴클레오티드 서열 및 (ii) 관심있는 제2 분자를 코딩하는 뉴클레오티드 서열을 포함하는 제2 벡터
를 포함하며, 여기서 인테인의 N-말단 단편 및 C-말단 단편은 형광 단백질의 C-말단 단편으로의 형광 단백질의 N-말단 단편의 연결을 촉매하여 전장 형광 단백질을 생성하는 것인 진핵생물 세포.
86. 단락 85에 있어서, 트랜스제닉 진핵생물 세포를 생성하기 위해 진핵생물 세포로의 제1 및 제2 벡터의 도입을 허용하는 조건 하에 진핵생물 세포를 유지하는 것을 추가로 포함하는 세포.
87. 단락 86에 있어서, 전장 형광 단백질을 포함하는 트랜스제닉 진핵생물 세포를 선택하는 것을 추가로 포함하는 세포.
88. 단락 85 내지 87 중 어느 한 단락에 있어서, 진핵생물 세포가 포유동물 세포인 세포.
89. 단락 85 내지 88 중 어느 한 단락에 있어서, 형광 단백질이 TagCFP, mTagCFP2, Czurite, ECFP2, mKalama1, Sirius, Sapphire, T-Sapphire, ECFP, Cerulean, SCFP3C, mTurquoise, mTurquoise2, 모노머 Midoriishi-Cyan, TagCFP, mTFP1, EGFP, Emerald, Superfolder GFP, 모노머 Czami Green, TagGFP2, mUKG, mWasabi, Clover, mNeonGreen, EYFP, Citrine, Venus, SYFP2, TagYFP, 모노머 Kusabira-Orange, mKOκ, mKO2, mOrange, mOrange2, mRaspberry, mCherry, mStrawberry, mTangerine, tdTomato, TagRFP, TagRFP-T, mCpple, mRuby, mRuby2, mPlum, HcRed-Tandem, mKate2, mNeptune, NirFP, TagRFP657, IFP1.4 및 iRFP로부터 선택된 것인 세포.
90. 단락 85 내지 89 중 어느 한 단락에 있어서, 인테인이 분할 인테인인 세포.
91. 단락 90에 있어서, 분할 인테인이 자연 분할 인테인인 세포.
92. 단락 91에 있어서, 자연 분할 인테인이 DnaE 인테인으로부터 선택된 것인 세포.
93. 단락 92에 있어서, DnaE 인테인이 시네코시스티스 sp. DnaE (SspDnaE) 인테인 및 노스톡 푼크티포르메 (NpuDnaE) 인테인으로부터 선택된 것인 세포.
94. 단락 93에 있어서, 분할 인테인이 조작된 분할 인테인인 세포.
95. 단락 94에 있어서, 조작된 분할 인테인이 DnaB 인테인으로부터 조작된 것인 세포.
96. 단락 95에 있어서, 조작된 분할 인테인이 SspDnaB S1 인테인인 세포.
97. 단락 95에 있어서, 조작된 분할 인테인이 GyrB 인테인으로부터 조작된 것인 세포.
98. 단락 97에 있어서, 조작된 분할 인테인이 SspGyrB S11 인테인인 세포.
99. 단락 85 내지 98 중 어느 한 단락에 있어서, 제1 및/또는 제2 분자가 단백질인 세포.
100. 단락 85 내지 99 중 어느 한 단락에 있어서, 제1 및/또는 제2 분자가 비-코딩 리보핵산 (RNA)인 세포.
101. 단락 100에 있어서, 비-코딩 RNA가 마이크로RNA (miRNA), 안티센스 RNA, 짧은-간섭 RNA (siRNA) 또는 짧은-헤어핀 RNA (shRNA)인 세포.
102. 단락 85 내지 101 중 어느 한 단락에 있어서, 제1 및/또는 제2 벡터가 플라스미드 벡터 또는 바이러스 벡터인 세포.
103. (a) (i) 인테인의 N-말단 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, egfp 유전자의 N-말단 단편 및 (ii) 관심있는 제1 분자를 포함하는 제1 벡터; 및
(b) (ii) egfp 유전자의 C-말단 단편으로부터 상류에 있는, 인테인의 C-말단 단편을 코딩하는 뉴클레오티드 서열 및 (ii) 관심있는 제2 분자를 포함하는 제2 벡터
를 포함하며, 여기서 인테인의 N-말단 단편 및 C-말단 단편은 egfp 유전자의 C-말단 단편에 의해 코딩된 단백질 단편으로의 egfp 유전자의 N-말단 단편에 의해 코딩된 단백질 단편의 연결을 촉매하여 전장 EGFP 단백질을 생성하는 것인 진핵생물 세포.
104. 단락 103에 있어서, egfp 유전자의 N-말단 단편에 의해 코딩된 단백질 단편이 서열식별번호: 5의 아미노산 1-175에 의해 확인된 아미노산 서열을 포함하고, egfp 유전자의 C-말단 단편에 의해 코딩된 단백질 단편이 서열식별번호: 5의 아미노산 175-239에 의해 확인된 아미노산 서열을 포함하는 것인 세포.
105. 단락 103 또는 104에 있어서,
인테인의 N-말단 단편이 서열식별번호: 16에 의해 확인되고, 인테인의 C-말단 단편이 서열식별번호: 17에 의해 확인되거나;
인테인의 N-말단 단편이 서열식별번호: 7에 의해 확인되고, 인테인의 C-말단 단편이 서열식별번호: 8에 의해 확인되거나; 또는
인테인의 N-말단 단편이 서열식별번호: 18 또는 서열식별번호: 9에 의해 확인되고, 인테인의 C-말단 단편이 서열식별번호: 19 또는 서열식별번호: 10에 의해 확인된 것인 세포.
106. (a) (i) 인테인의 N-말단 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, mScarlet 유전자의 N-말단 단편 및 (ii) 관심있는 제1 분자를 포함하는 제1 벡터; 및
(b) (ii) mScarlet 유전자의 C-말단 단편으로부터 상류에 있는, 인테인의 C-말단 단편을 코딩하는 뉴클레오티드 서열 및 (ii) 관심있는 제2 분자를 포함하는 제2 벡터
를 포함하며, 여기서 인테인의 N-말단 단편 및 C-말단 단편은 mScarlet 유전자의 C-말단 단편에 의해 코딩된 단백질 단편으로의 mScarlet 유전자의 N-말단 단편에 의해 코딩된 단백질 단편의 연결을 촉매하여 전장 mScarlet 단백질을 생성하는 것인 진핵생물 세포.
107. 단락 106에 있어서,
mScarlet 유전자의 N-말단 단편에 의해 코딩된 단백질 단편이 서열식별번호: 6의 아미노산 1-46에 의해 확인된 아미노산 서열을 포함하고, mScarlet 유전자의 C-말단 단편에 의해 코딩된 단백질 단편이 서열식별번호: 6의 아미노산 47-232에 의해 확인된 아미노산 서열을 포함하거나;
mScarlet 유전자의 N-말단 단편에 의해 코딩된 단백질 단편이 서열식별번호: 6의 아미노산 1-48에 의해 확인된 아미노산 서열을 포함하고, mScarlet 유전자의 C-말단 단편에 의해 코딩된 단백질 단편이 서열식별번호: 6의 아미노산 49-232에 의해 확인된 아미노산 서열을 포함하거나;
mScarlet 유전자의 N-말단 단편에 의해 코딩된 단백질 단편이 서열식별번호: 6의 아미노산 1-51에 의해 확인된 아미노산 서열을 포함하고, mScarlet 유전자의 C-말단 단편에 의해 코딩된 단백질 단편이 서열식별번호: 6의 아미노산 52-232에 의해 확인된 아미노산 서열을 포함하거나;
mScarlet 유전자의 N-말단 단편에 의해 코딩된 단백질 단편이 서열식별번호: 6의 아미노산 1-75에 의해 확인된 아미노산 서열을 포함하고, mScarlet 유전자의 C-말단 단편에 의해 코딩된 단백질 단편이 서열식별번호: 6의 아미노산 76-232에 의해 확인된 아미노산 서열을 포함하거나;
mScarlet 유전자의 N-말단 단편에 의해 코딩된 단백질 단편이 서열식별번호: 6의 아미노산 1-122에 의해 확인된 아미노산 서열을 포함하고, mScarlet 유전자의 C-말단 단편에 의해 코딩된 단백질 단편이 서열식별번호: 6의 아미노산 123-232에 의해 확인된 아미노산 서열을 포함하거나;
mScarlet 유전자의 N-말단 단편에 의해 코딩된 단백질 단편이 서열식별번호: 6의 아미노산 1-140에 의해 확인된 아미노산 서열을 포함하고, mScarlet 유전자의 C-말단 단편에 의해 코딩된 단백질 단편이 서열식별번호: 6의 아미노산 141-232에 의해 확인된 아미노산 서열을 포함하거나;
mScarlet 유전자의 N-말단 단편에 의해 코딩된 단백질 단편이 서열식별번호: 6의 아미노산 1-163에 의해 확인된 아미노산 서열을 포함하고, mScarlet 유전자의 C-말단 단편에 의해 코딩된 단백질 단편이 서열식별번호: 6의 아미노산 164-232에 의해 확인된 아미노산 서열을 포함하는 것인 세포.
108. 단락 106 또는 107에 있어서,
인테인의 N-말단 단편이 서열식별번호: 16에 의해 확인되고, 인테인의 C-말단 단편이 서열식별번호: 17에 의해 확인되거나;
인테인의 N-말단 단편이 서열식별번호: 7에 의해 확인되고, 인테인의 C-말단 단편이 서열식별번호: 8에 의해 확인되거나; 또는
인테인의 N-말단 단편이 서열식별번호: 18 또는 서열식별번호: 9에 의해 확인되고, 인테인의 C-말단 단편이 서열식별번호: 19 또는 서열식별번호: 10에 의해 확인된 것인 세포.
109. 단락 85 내지 108 중 어느 한 단락의 세포를 포함하는 조성물.
110. (a) 인테인의 N-말단 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, 항생제 내성 단백질의 N-말단 단편을 코딩하는 뉴클레오티드 서열을 포함하는 제1 벡터; 및
(b) 항생제 내성 단백질의 C-말단 단편으로부터 상류에 있는, 인테인의 C-말단 단편을 코딩하는 뉴클레오티드 서열을 포함하는 제2 벡터
를 포함하며, 여기서 인테인의 N-말단 단편 및 C-말단 단편은 항생제 내성 단백질의 N-말단 단편 및 C-말단 단편의 연결을 촉매하여 전장 항생제 내성 단백질을 생성하는 것인 키트.
111. 단락 110에 있어서, 항생제 내성 단백질이 히그로마이신, G418, 퓨로마이신, 플레오마이신 D1 또는 블라스티시딘에 대한 내성을 부여하는 것인 키트.
112. (a) 인테인의 N-말단 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, 형광 단백질의 N-말단 단편을 코딩하는 뉴클레오티드 서열을 포함하는 제1 벡터; 및
(b) 형광 단백질의 C-말단 단편으로부터 상류에 있는, 인테인의 C-말단 단편을 코딩하는 뉴클레오티드 서열을 포함하는 제2 벡터
를 포함하며, 여기서 인테인의 N-말단 단편 및 C-말단 단편은 형광 단백질의 C-말단 단편으로의 형광 단백질의 N-말단 단편의 연결을 촉매하여 전장 형광 단백질을 생성하는 것인 키트.
113. 단락 112에 있어서, 형광 단백질이 TagCFP, mTagCFP2, Czurite, ECFP2, mKalama1, Sirius, Sapphire, T-Sapphire, ECFP, Cerulean, SCFP3C, mTurquoise, mTurquoise2, 모노머 Midoriishi-Cyan, TagCFP, mTFP1, EGFP, Emerald, Superfolder GFP, 모노머 Czami Green, TagGFP2, mUKG, mWasabi, Clover, mNeonGreen, EYFP, Citrine, Venus, SYFP2, TagYFP, 모노머 Kusabira-Orange, mKOκ, mKO2, mOrange, mOrange2, mRaspberry, mCherry, mStrawberry, mTangerine, tdTomato, TagRFP, TagRFP-T, mCpple, mRuby, mRuby2, mPlum, HcRed-Tandem, mKate2, mNeptune, NirFP, TagRFP657, IFP1.4 및 iRFP로부터 선택된 것인 키트.
114. 단락 110 내지 113 중 어느 한 단락에 있어서, 인테인이 분할 인테인인 키트.
115. 단락 114에 있어서, 분할 인테인이 자연 분할 인테인 또는 조작된 분할 인테인인 키트.
116. 단락 115에 있어서, 자연 분할 인테인이 DnaE 인테인으로부터 선택된 것인 키트.
117. 단락 116에 있어서, DnaE 인테인이 시네코시스티스 sp. DnaE (SspDnaE) 인테인 및 노스톡 푼크티포르메 (NpuDnaE) 인테인으로부터 선택된 것인 키트.
118. 단락 115에 있어서, 조작된 분할 인테인이 DnaB 인테인 또는 GyrB 인테인으로부터 조작된 것인 키트.
119. 단락 118에 있어서, 조작된 분할 인테인이 SspDnaB S1 인테인인 키트.
120. 단락 118에 있어서, 조작된 분할 인테인이 SspGyrB S11 인테인인 키트.
121. 단락 112 내지 120 중 어느 한 단락에 있어서, 다음 구성성분: 완충제, 염, 클로닝 효소, 감응성 세포, 형질감염 시약, 항생제, 및/또는 본원에 기재된 방법의 수행에 대한 지침서 중 임의의 하나 이상을 추가로 포함하는 키트.
122. (a) (i) 제1 인테인의 N-말단 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, 항생제 내성 단백질의 N-말단 단편을 코딩하는 뉴클레오티드 서열 및 (ii) 관심있는 제1 분자를 코딩하는 뉴클레오티드 서열을 포함하는 제1 벡터,
(b) (i) 제2 인테인의 N-말단 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, 항생제 내성 단백질의 중심 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, 제1 인테인의 C-말단 단편을 코딩하는 뉴클레오티드 서열 및 (ii) 관심있는 제2 분자를 코딩하는 뉴클레오티드 서열을 포함하는 제2 벡터, 및
(c) (i) 항생제 내성 단백질의 C-말단 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, 제2 인테인의 C-말단 단편을 코딩하는 뉴클레오티드 서열 및 (ii) 관심있는 제3 분자를 코딩하는 뉴클레오티드 서열을 포함하는 제3 벡터
를 진핵생물 세포로 전달하는 것을 포함하며, 여기서 제1 인테인의 N-말단 단편 및 C-말단 단편은 항생제 내성 단백질의 중심 단편으로의 항생제 내성 단백질의 N-말단 단편의 연결을 촉매하고, 제2 인테인의 N-말단 단편 및 C-말단 단편은 항생제 내성 단백질의 C-말단 단편으로의 항생제 내성 단백질의 중심 단편의 연결을 촉매하여 전장 항생제 내성 단백질을 생성하는 것인 방법.
123. 단락 112에 있어서, 트랜스제닉 진핵생물 세포를 생성하기 위해 진핵생물 세포로의 제1, 제2 및 제3 벡터의 도입을 허용하는 조건 하에 진핵생물 세포를 유지하는 것을 추가로 포함하는 방법.
124. 단락 123에 있어서, 전장 항생제 내성 단백질을 포함하는 트랜스제닉 진핵생물 세포를 선택하는 것을 추가로 포함하는 방법.
125. 단락 112 내지 124 중 어느 한 단락에 있어서, 진핵생물 세포가 포유동물 세포인 방법.
126. 단락 112 내지 125 중 어느 한 단락에 있어서, 항생제 내성 단백질이 히그로마이신, G418, 퓨로마이신, 플레오마이신 D1 또는 블라스티시딘에 대한 내성을 부여하는 것인 방법.
127. 단락 126에 있어서, 항생제 내성 단백질이 히그로마이신에 대한 내성을 부여하는 것인 방법.
128. 단락 112 내지 127 중 어느 한 단락에 있어서, 제1 인테인이 분할 인테인인 방법.
129. 단락 112 내지 128 중 어느 한 단락에 있어서, 제2 인테인이 분할 인테인인 방법.
130. 단락 128 또는 129에 있어서, 분할 인테인이 자연 분할 인테인인 방법.
131. 단락 130에 있어서, 자연 분할 인테인이 DnaE 인테인으로부터 선택된 것인 방법.
132. 단락 131에 있어서, DnaE 인테인이 시네코시스티스 sp. DnaE (SspDnaE) 인테인 및 노스톡 푼크티포르메 (NpuDnaE) 인테인으로부터 선택된 것인 방법.
133. 단락 132에 있어서, 제1 인테인이 NpuDnaE 인테인이고 제2 인테인이 NpuDnaE 인테인인 방법.
134. 단락 112 내지 133 중 어느 한 단락에 있어서, 관심있는 제1 분자, 관심있는 제2 분자, 관심있는 제3 분자 또는 이들의 임의의 조합이 단백질인 방법.
135. 단락 112 내지 133 중 어느 한 단락에 있어서, 관심있는 제1 분자, 관심있는 제2 분자, 관심있는 제3 분자 또는 이들의 임의의 조합이 비-코딩 리보핵산 (RNA)인 방법.
136. 단락 135에 있어서, 비-코딩 RNA가 마이크로RNA (miRNA), 안티센스 RNA, 짧은-간섭 RNA (siRNA) 또는 짧은-헤어핀 RNA (shRNA)인 방법.
137. 단락 112 내지 136 중 어느 한 단락에 있어서, 제1 벡터, 제2 벡터, 제3 벡터 또는 이들의 임의의 조합이 플라스미드 벡터 또는 바이러스 벡터인 방법.
138. (a) (i) 제1 인테인의 N-말단 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, hygB 유전자의 N-말단 단편 및 (ii) 관심있는 제1 분자를 코딩하는 뉴클레오티드 서열을 포함하는 제1 벡터,
(b) (i) 제2 인테인의 N-말단 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, hygB 유전자의 중심 단편으로부터 상류에 있는, 제1 인테인의 C-말단 단편을 코딩하는 뉴클레오티드 서열 및 (ii) 관심있는 제2 분자를 코딩하는 뉴클레오티드 서열을 포함하는 제2 벡터, 및
(c) (i) hygB 유전자의 C-말단 단편으로부터 상류에 있는, 제2 인테인의 C-말단 단편을 코딩하는 뉴클레오티드 서열 및 (ii) 관심있는 제3 분자를 코딩하는 뉴클레오티드 서열을 포함하는 제3 벡터
를 진핵생물 세포로 전달하는 것을 포함하며, 여기서 제1 인테인의 N-말단 단편 및 C-말단 단편은 hygB 유전자의 중심 단편에 의해 코딩된 단백질 단편으로의 hygB 유전자의 N-말단 단편에 의해 코딩된 단백질 단편의 연결을 촉매하고, 제2 인테인의 N-말단 단편 및 C-말단 단편은 hygB 유전자의 C-말단 단편에 의해 코딩된 단백질 단편으로의 hygB 유전자의 중심 단편에 의해 코딩된 단백질 단편의 연결을 촉매하여 전장 히그로마이신 B 포스포트랜스퍼라제를 생성하는 것인 방법.
139. 단락 138에 있어서, 제1 벡터가 서열식별번호: 29에 의해 확인된 서열을 코딩하고, 제2 벡터가 서열식별번호: 61에 의해 확인된 서열을 코딩하고, 제3 벡터가 서열식별번호: 23에 의해 확인된 서열을 코딩하는 것인 방법.
140. 단락 138에 있어서, 제1 벡터가 서열식별번호: 21에 의해 확인된 서열을 코딩하고, 제2 벡터가 서열식별번호: 61에 의해 확인된 서열을 코딩하고, 제3 벡터가 서열식별번호: 35에 의해 확인된 서열을 코딩하는 것인 방법.
141. (a) (i) 제1 인테인의 N-말단 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, 항생제 내성 단백질의 N-말단 단편을 코딩하는 뉴클레오티드 서열 및 (ii) 관심있는 제1 분자를 코딩하는 뉴클레오티드 서열을 포함하는 제1 벡터,
(b) (i) 제2 인테인의 N-말단 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, 항생제 내성 단백질의 중심 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, 제1 인테인의 C-말단 단편을 코딩하는 뉴클레오티드 서열 및 (ii) 관심있는 제2 분자를 코딩하는 뉴클레오티드 서열을 포함하는 제2 벡터, 및
(c) (i) 항생제 내성 단백질의 C-말단 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, 제2 인테인의 C-말단 단편을 코딩하는 뉴클레오티드 서열 및 (ii) 관심있는 제3 분자를 코딩하는 뉴클레오티드 서열을 포함하는 제3 벡터
를 포함하며, 여기서 제1 인테인의 N-말단 단편 및 C-말단 단편은 항생제 내성 단백질의 중심 단편으로의 항생제 내성 단백질의 N-말단 단편의 연결을 촉매하고, 제2 인테인의 N-말단 단편 및 C-말단 단편은 항생제 내성 단백질의 C-말단 단편으로의 항생제 내성 단백질의 중심 단편의 연결을 촉매하여 전장 항생제 내성 단백질을 생성하는 것인 진핵생물 세포.
142. 단락 112에 있어서, 진핵생물 세포가 포유동물 세포인 진핵생물 세포.
143. 단락 141 또는 142에 있어서, 항생제 내성 단백질이 히그로마이신, G418, 퓨로마이신, 플레오마이신 D1 또는 블라스티시딘에 대한 내성을 부여하는 것인 진핵생물 세포.
144. 단락 143에 있어서, 항생제 내성 단백질이 히그로마이신에 대한 내성을 부여하는 것인 진핵생물 세포.
145. 단락 141 내지 144 중 어느 한 단락에 있어서, 제1 인테인이 분할 인테인인 진핵생물 세포.
146. 단락 142 내지 145 중 어느 한 단락에 있어서, 제2 인테인이 분할 인테인인 진핵생물 세포.
147. 단락 145 또는 146에 있어서, 분할 인테인이 자연 분할 인테인인 진핵생물 세포.
148. 단락 147에 있어서, 자연 분할 인테인이 DnaE 인테인으로부터 선택된 것인 진핵생물 세포.
149. 단락 148에 있어서, DnaE 인테인이 시네코시스티스 sp. DnaE (SspDnaE) 인테인 및 노스톡 푼크티포르메 (NpuDnaE) 인테인으로부터 선택된 것인 진핵생물 세포.
150. 단락 149에 있어서, 제1 인테인이 NpuDnaE 인테인이고 제2 인테인이 NpuDnaE 인테인인 진핵생물 세포.
151. 단락 142 내지 150 중 어느 한 단락에 있어서, 관심있는 제1 분자, 관심있는 제2 분자, 관심있는 제3 분자 또는 이들의 임의의 조합이 단백질인 진핵생물 세포.
152. 단락 142 내지 150 중 어느 한 단락에 있어서, 관심있는 제1 분자, 관심있는 제2 분자, 관심있는 제3 분자 또는 이들의 임의의 조합이 비-코딩 리보핵산 (RNA)인 진핵생물 세포.
153. 단락 152에 있어서, 비-코딩 RNA가 마이크로RNA (miRNA), 안티센스 RNA, 짧은-간섭 RNA (siRNA) 또는 짧은-헤어핀 RNA (shRNA)인 진핵생물 세포.
154. 단락 142 내지 153 중 어느 한 단락에 있어서, 제1 벡터, 제2 벡터, 제3 벡터 또는 이들의 임의의 조합이 플라스미드 벡터 또는 바이러스 벡터인 진핵생물 세포.
155. 단락 142 내지 154 중 어느 한 단락의 진핵생물 세포를 포함하는 조성물.
156. (a) 제1 인테인의 N-말단 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, 항생제 내성 단백질의 N-말단 단편을 코딩하는 뉴클레오티드 서열을 포함하는 제1 벡터,
(b) 제2 인테인의 N-말단 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, 항생제 내성 단백질의 중심 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, 제1 인테인의 C-말단 단편을 코딩하는 뉴클레오티드 서열을 포함하는 제2 벡터, 및
(c) 항생제 내성 단백질의 C-말단 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, 제2 인테인의 C-말단 단편을 코딩하는 뉴클레오티드 서열을 포함하는 제3 벡터
를 포함하며, 여기서 제1 인테인의 N-말단 단편 및 C-말단 단편은 항생제 내성 단백질의 중심 단편으로의 항생제 내성 단백질의 N-말단 단편의 연결을 촉매하고, 제2 인테인의 N-말단 단편 및 C-말단 단편은 항생제 내성 단백질의 C-말단 단편으로의 항생제 내성 단백질의 중심 단편의 연결을 촉매하여 전장 항생제 내성 단백질을 생성하는 것인 키트.
157. 단락 156에 있어서, 항생제 내성 단백질이 히그로마이신, G418, 퓨로마이신, 플레오마이신 D1 또는 블라스티시딘에 대한 내성을 부여하는 것인 키트.
158. 단락 157에 있어서, 항생제 내성 단백질이 히그로마이신에 대한 내성을 부여하는 것인 키트.
159. 단락 156 내지 158 중 어느 한 단락에 있어서, 제1 인테인이 분할 인테인인 키트.
160. 단락 156 내지 159 중 어느 한 단락에 있어서, 제2 인테인이 분할 인테인인 키트.
161. 단락 159 또는 160에 있어서, 분할 인테인이 자연 분할 인테인인 키트.
162. 단락 161에 있어서, 자연 분할 인테인이 DnaE 인테인으로부터 선택된 것인 키트.
163. 단락 162에 있어서, DnaE 인테인이 시네코시스티스 sp. DnaE (SspDnaE) 인테인 및 노스톡 푼크티포르메 (NpuDnaE) 인테인으로부터 선택된 것인 키트.
164. 단락 163에 있어서, 제1 인테인이 NpuDnaE 인테인이고 제2 인테인이 NpuDnaE 인테인인 키트.
165. 단락 156 내지 164 중 어느 한 단락에 있어서, 관심있는 제1 분자, 관심있는 제2 분자, 관심있는 제3 분자 또는 이들의 임의의 조합이 단백질인 키트.
166. 단락 156 내지 164 중 어느 한 단락에 있어서, 관심있는 제1 분자, 관심있는 제2 분자, 관심있는 제3 분자 또는 이들의 임의의 조합이 비-코딩 리보핵산 (RNA)인 키트.
167. 단락 166에 있어서, 비-코딩 RNA가 마이크로RNA (miRNA), 안티센스 RNA, 짧은-간섭 RNA (siRNA) 또는 짧은-헤어핀 RNA (shRNA)인 키트.
168. 단락 156 내지 167 중 어느 한 단락에 있어서, 제1 벡터, 제2 벡터, 제3 벡터 또는 이들의 임의의 조합이 플라스미드 벡터 또는 바이러스 벡터인 키트.
169. (a) (i) 제1 인테인의 N-말단 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, 형광 단백질의 N-말단 단편을 코딩하는 뉴클레오티드 서열 및 (ii) 관심있는 제1 분자를 코딩하는 뉴클레오티드 서열을 포함하는 제1 벡터,
(b) (i) 제2 인테인의 N-말단 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, 형광 단백질의 중심 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, 제1 인테인의 C-말단 단편을 코딩하는 뉴클레오티드 서열 및 (ii) 관심있는 제2 분자를 코딩하는 뉴클레오티드 서열을 포함하는 제2 벡터, 및
(c) (i) 형광 단백질의 C-말단 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, 제2 인테인의 C-말단 단편을 코딩하는 뉴클레오티드 서열 및 (ii) 관심있는 제3 분자를 코딩하는 뉴클레오티드 서열을 포함하는 제3 벡터
를 진핵생물 세포로 전달하는 것을 포함하며, 여기서 제1 인테인의 N-말단 단편 및 C-말단 단편은 형광 단백질의 중심 단편으로의 형광 단백질의 N-말단 단편의 연결을 촉매하고, 제2 인테인의 N-말단 단편 및 C-말단 단편은 형광 단백질의 C-말단 단편으로의 형광 단백질의 중심 단편의 연결을 촉매하여 전장 형광 단백질을 생성하는 것인 방법.
170. 단락 169에 있어서, 트랜스제닉 진핵생물 세포를 생성하기 위해 진핵생물 세포로의 제1, 제2 및 제3 벡터의 도입을 허용하는 조건 하에 진핵생물 세포를 유지하는 것을 추가로 포함하는 방법.
171. 단락 170에 있어서, 전장 형광 단백질을 포함하는 트랜스제닉 진핵생물 세포를 선택하는 것을 추가로 포함하는 방법.
172. 단락 169 내지 171 중 어느 한 단락에 있어서, 진핵생물 세포가 포유동물 세포인 방법.
173. 단락 169 내지 172 중 어느 한 단락에 있어서, 형광 단백질이 TagCFP, mTagCFP2, Czurite, ECFP2, mKalama1, Sirius, Sapphire, T-Sapphire, ECFP, Cerulean, SCFP3C, mTurquoise, mTurquoise2, 모노머 Midoriishi-Cyan, TagCFP, mTFP1, EGFP, Emerald, Superfolder GFP, 모노머 Czami Green, TagGFP2, mUKG, mWasabi, Clover, mNeonGreen, EYFP, Citrine, Venus, SYFP2, TagYFP, 모노머 Kusabira-Orange, mKOκ, mScarlet, mKO2, mOrange, mOrange2, mRaspberry, mCherry, mStrawberry, mTangerine, tdTomato, TagRFP, TagRFP-T, mCpple, mRuby, mRuby2, mPlum, HcRed-Tandem, mKate2, mNeptune, NirFP, TagRFP657, IFP1.4 및 iRFP로부터 선택된 것인 방법.
174. 단락 173에 있어서, 형광 단백질이 mScarlet인 방법.
175. 단락 169 내지 174 중 어느 한 단락에 있어서, 제1 인테인이 분할 인테인인 방법.
176. 단락 169 내지 175 중 어느 한 단락에 있어서, 제2 인테인이 분할 인테인인 방법.
177. 단락 175 또는 176에 있어서, 분할 인테인이 자연 분할 인테인인 방법.
178. 단락 177에 있어서, 자연 분할 인테인이 DnaE 인테인으로부터 선택된 것인 방법.
179. 단락 178에 있어서, DnaE 인테인이 시네코시스티스 sp. DnaE (SspDnaE) 인테인 및 노스톡 푼크티포르메 (NpuDnaE) 인테인으로부터 선택된 것인 방법.
180. 단락 179에 있어서, 제1 인테인이 NpuDnaE 인테인이고 제2 인테인이 NpuDnaE 인테인인 방법.
181. 단락 169 내지 170 중 어느 한 단락에 있어서, 관심있는 제1 분자, 관심있는 제2 분자, 관심있는 제3 분자 또는 이들의 임의의 조합이 단백질인 방법.
182. 단락 169 내지 180 중 어느 한 단락에 있어서, 관심있는 제1 분자, 관심있는 제2 분자, 관심있는 제3 분자 또는 이들의 임의의 조합이 비-코딩 리보핵산 (RNA)인 방법.
183. 단락 182에 있어서, 비-코딩 RNA가 마이크로RNA (miRNA), 안티센스 RNA, 짧은-간섭 RNA (siRNA) 또는 짧은-헤어핀 RNA (shRNA)인 방법.
184. 단락 169 내지 183 중 어느 한 단락에 있어서, 제1 벡터, 제2 벡터, 제3 벡터 또는 이들의 임의의 조합이 플라스미드 벡터 또는 바이러스 벡터인 방법.
185. (a) (i) 제1 인테인의 N-말단 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, mScarlet 유전자의 N-말단 단편 및 (ii) 관심있는 제1 분자를 코딩하는 뉴클레오티드 서열을 포함하는 제1 벡터,
(b) (i) 제2 인테인의 N-말단 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, mScarlet 유전자의 중심 단편으로부터 상류에 있는, 제1 인테인의 C-말단 단편을 코딩하는 뉴클레오티드 서열 및 (ii) 관심있는 제2 분자를 코딩하는 뉴클레오티드 서열을 포함하는 제2 벡터, 및
(c) (i) mScarlet 유전자의 C-말단 단편으로부터 상류에 있는, 제2 인테인의 C-말단 단편을 코딩하는 뉴클레오티드 서열 및 (ii) 관심있는 제3 분자를 코딩하는 뉴클레오티드 서열을 포함하는 제3 벡터
를 진핵생물 세포로 전달하는 것을 포함하며, 여기서 제1 인테인의 N-말단 단편 및 C-말단 단편은 mScarlet 유전자의 중심 단편에 의해 코딩된 단백질 단편으로의 mScarlet 유전자의 N-말단 단편에 의해 코딩된 단백질 단편의 연결을 촉매하고, 제2 인테인의 N-말단 단편 및 C-말단 단편은 mScarlet 유전자의 C-말단 단편에 의해 코딩된 단백질 단편으로의 mScarlet 유전자의 중심 단편에 의해 코딩된 단백질 단편의 연결을 촉매하여 전장 mScarlet 단백질을 생성하는 것인 방법.
186. 단락 185에 있어서, 제1 벡터가 서열식별번호: 121에 의해 확인된 서열을 코딩하고, 제2 벡터가 서열식별번호: 123에 의해 확인된 서열을 코딩하고, 제3 벡터가 서열식별번호: 125에 의해 확인된 서열을 코딩하는 것인 방법.
187. (a) (i) 제1 인테인의 N-말단 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, 형광 단백질의 N-말단 단편을 코딩하는 뉴클레오티드 서열 및 (ii) 관심있는 제1 분자를 코딩하는 뉴클레오티드 서열을 포함하는 제1 벡터,
(b) (i) 제2 인테인의 N-말단 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, 형광 단백질의 중심 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, 제1 인테인의 C-말단 단편을 코딩하는 뉴클레오티드 서열 및 (ii) 관심있는 제2 분자를 코딩하는 뉴클레오티드 서열을 포함하는 제2 벡터, 및
(c) (i) 형광 단백질의 C-말단 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, 제2 인테인의 C-말단 단편을 코딩하는 뉴클레오티드 서열 및 (ii) 관심있는 제3 분자를 코딩하는 뉴클레오티드 서열을 포함하는 제3 벡터
를 포함하며, 여기서 제1 인테인의 N-말단 단편 및 C-말단 단편은 형광 단백질의 중심 단편으로의 형광 단백질의 N-말단 단편의 연결을 촉매하고, 제2 인테인의 N-말단 단편 및 C-말단 단편은 형광 단백질의 C-말단 단편으로의 형광 단백질의 중심 단편의 연결을 촉매하여 전장 형광 단백질을 생성하는 것인 진핵생물 세포.
188. 단락 187에 있어서, 진핵생물 세포가 포유동물 세포인 진핵생물 세포.
189. 단락 187 또는 188에 있어서, 형광 단백질이 TagCFP, mTagCFP2, Czurite, ECFP2, mKalama1, Sirius, Sapphire, T-Sapphire, ECFP, Cerulean, SCFP3C, mTurquoise, mTurquoise2, 모노머 Midoriishi-Cyan, TagCFP, mTFP1, EGFP, Emerald, Superfolder GFP, 모노머 Czami Green, TagGFP2, mUKG, mWasabi, Clover, mNeonGreen, EYFP, Citrine, Venus, SYFP2, TagYFP, 모노머 Kusabira-Orange, mKOκ, mScarlet, mKO2, mOrange, mOrange2, mRaspberry, mCherry, mStrawberry, mTangerine, tdTomato, TagRFP, TagRFP-T, mCpple, mRuby, mRuby2, mPlum, HcRed-Tandem, mKate2, mNeptune, NirFP, TagRFP657, IFP1.4 및 iRFP로부터 선택된 것인 진핵생물 세포.
190. 단락 189에 있어서, 형광 단백질이 mScarlet인 진핵생물 세포.
191. 단락 187 내지 190 중 어느 한 단락에 있어서, 제1 인테인이 분할 인테인인 진핵생물 세포.
192. 단락 185 내지 191 중 어느 한 단락에 있어서, 제2 인테인이 분할 인테인인 진핵생물 세포.
193. 단락 191 또는 192에 있어서, 분할 인테인이 자연 분할 인테인인 진핵생물 세포.
194. 단락 193에 있어서, 자연 분할 인테인이 DnaE 인테인으로부터 선택된 것인 진핵생물 세포.
195. 단락 194에 있어서, DnaE 인테인이 시네코시스티스 sp. DnaE (SspDnaE) 인테인 및 노스톡 푼크티포르메 (NpuDnaE) 인테인으로부터 선택된 것인 진핵생물 세포.
196. 단락 195에 있어서, 제1 인테인이 NpuDnaE 인테인이고 제2 인테인이 NpuDnaE 인테인인 진핵생물 세포.
197. 단락 185 내지 196 중 어느 한 단락에 있어서, 관심있는 제1 분자, 관심있는 제2 분자, 관심있는 제3 분자 또는 이들의 임의의 조합이 단백질인 진핵생물 세포.
198. 단락 185 내지 196 중 어느 한 단락에 있어서, 관심있는 제1 분자, 관심있는 제2 분자, 관심있는 제3 분자 또는 이들의 임의의 조합이 비-코딩 리보핵산 (RNA)인 진핵생물 세포.
199. 단락 198에 있어서, 비-코딩 RNA가 마이크로RNA (miRNA), 안티센스 RNA, 짧은-간섭 RNA (siRNA) 또는 짧은-헤어핀 RNA (shRNA)인 진핵생물 세포.
200. 단락 185 내지 199 중 어느 한 단락에 있어서, 제1 벡터, 제2 벡터, 제3 벡터 또는 이들의 임의의 조합이 플라스미드 벡터 또는 바이러스 벡터인 진핵생물 세포.
201. 단락 185 내지 200 중 어느 한 단락의 진핵생물 세포를 포함하는 조성물.
202. (a) 제1 인테인의 N-말단 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, 형광 단백질의 N-말단 단편을 코딩하는 뉴클레오티드 서열을 포함하는 제1 벡터,
(b) 제2 인테인의 N-말단 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, 형광 단백질의 중심 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, 제1 인테인의 C-말단 단편을 코딩하는 뉴클레오티드 서열을 포함하는 제2 벡터, 및
(c) 형광 단백질의 C-말단 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, 제2 인테인의 C-말단 단편을 코딩하는 뉴클레오티드 서열을 포함하는 제3 벡터
를 포함하며, 여기서 제1 인테인의 N-말단 단편 및 C-말단 단편은 형광 단백질의 중심 단편으로의 형광 단백질의 N-말단 단편의 연결을 촉매하고, 제2 인테인의 N-말단 단편 및 C-말단 단편은 형광 단백질의 C-말단 단편으로의 형광 단백질의 중심 단편의 연결을 촉매하여 전장 형광 단백질을 생성하는 것인 키트.
203. 단락 202에 있어서, 형광 단백질이 TagCFP, mTagCFP2, Czurite, ECFP2, mKalama1, Sirius, Sapphire, T-Sapphire, ECFP, Cerulean, SCFP3C, mTurquoise, mTurquoise2, 모노머 Midoriishi-Cyan, TagCFP, mTFP1, EGFP, Emerald, Superfolder GFP, 모노머 Czami Green, TagGFP2, mUKG, mWasabi, Clover, mNeonGreen, EYFP, Citrine, Venus, SYFP2, TagYFP, 모노머 Kusabira-Orange, mKOκ, mScarlet, mKO2, mOrange, mOrange2, mRaspberry, mCherry, mStrawberry, mTangerine, tdTomato, TagRFP, TagRFP-T, mCpple, mRuby, mRuby2, mPlum, HcRed-Tandem, mKate2, mNeptune, NirFP, TagRFP657, IFP1.4 및 iRFP로부터 선택된 것인 키트.
204. 단락 203에 있어서, 형광 단백질이 mScarlet인 키트.
205. 단락 202 내지 204 중 어느 한 단락에 있어서, 제1 인테인이 분할 인테인인 키트.
206. 단락 202 내지 205 중 어느 한 단락에 있어서, 제2 인테인이 분할 인테인인 키트.
207. 단락 206에 있어서, 분할 인테인이 자연 분할 인테인인 키트.
208. 단락 207에 있어서, 자연 분할 인테인이 DnaE 인테인으로부터 선택된 것인 키트.
209. 단락 208에 있어서, DnaE 인테인이 시네코시스티스 sp. DnaE (SspDnaE) 인테인 및 노스톡 푼크티포르메 (NpuDnaE) 인테인으로부터 선택된 것인 키트.
210. 단락 209에 있어서, 제1 인테인이 NpuDnaE 인테인이고 제2 인테인이 NpuDnaE 인테인인 키트.
211. 단락 202 내지 210 중 어느 한 단락에 있어서, 관심있는 제1 분자, 관심있는 제2 분자, 관심있는 제3 분자 또는 이들의 임의의 조합이 단백질인 키트.
212. 단락 202 내지 210 중 어느 한 단락에 있어서, 관심있는 제1 분자, 관심있는 제2 분자, 관심있는 제3 분자 또는 이들의 임의의 조합이 비-코딩 리보핵산 (RNA)인 키트.
213. 단락 212에 있어서, 비-코딩 RNA가 마이크로RNA (miRNA), 안티센스 RNA, 짧은-간섭 RNA (siRNA) 또는 짧은-헤어핀 RNA (shRNA)인 키트.
214. 단락 202 내지 213 중 어느 한 단락에 있어서, 제1 벡터, 제2 벡터, 제3 벡터 또는 이들의 임의의 조합이 플라스미드 벡터 또는 바이러스 벡터인 키트.
215. 단락 202 내지 214 중 어느 한 단락에 있어서, 다음 구성성분: 완충제, 염, 클로닝 효소, 감응성 세포, 형질감염 시약, 항생제, 및/또는 본원에 기재된 방법의 수행에 대한 지침서 중 임의의 하나 이상을 추가로 포함하는 키트.
216. (a) (i) N-말단 인테인 단백질 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, 제1 선택가능한 마커 단백질 단편 (예를 들어, 항생제 내성 단백질 단편 또는 형광 단백질 단편)을 코딩하는 뉴클레오티드 서열 및 (ii) 제1 분자를 코딩하는 뉴클레오티드 서열을 포함하는 제1 벡터, 및 (b) (i) 제2 선택가능한 마커 단백질 단편 (예를 들어, 항생제 내성 단백질 단편 또는 형광 단백질 단편)으로부터 상류에 있는, C-말단 인테인 단백질 단편을 코딩하는 뉴클레오티드 서열 및 (ii) 제2 분자를 코딩하는 뉴클레오티드 서열을 포함하는 제2 벡터를 진핵생물 세포를 포함하는 조성물로 전달하는 것을 포함하며, 여기서 N-말단 인테인 단백질 단편 및 C-말단 인테인 단백질 단편은 제2 선택가능한 마커 단백질 단편으로의 제1 선택가능한 마커 단백질 단편의 연결을 촉매하여 전장 선택가능한 마커 단백질을 생성하는 것인 트랜스제닉 선택 방법.
217. (a) (i) 제1 인테인의 N-말단 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, 선택가능한 마커 단백질 (예를 들어, 항생제 내성 단백질 또는 형광 단백질)의 N-말단 단편을 코딩하는 뉴클레오티드 서열 및 (ii) 관심있는 제1 분자를 코딩하는 뉴클레오티드 서열을 포함하는 제1 벡터, (b) (i) 제2 인테인의 N-말단 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, 선택가능한 마커 단백질의 중심 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, 제1 인테인의 C-말단 단편을 코딩하는 뉴클레오티드 서열 및 (ii) 관심있는 제2 분자를 코딩하는 뉴클레오티드 서열을 포함하는 제2 벡터, 및 (c) (i) 선택가능한 마커 단백질의 C-말단 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, 제2 인테인의 C-말단 단편을 코딩하는 뉴클레오티드 서열 및 (ii) 관심있는 제3 분자를 코딩하는 뉴클레오티드 서열을 포함하는 제3 벡터를 진핵생물 세포로 전달하는 것을 포함하며, 여기서 제1 인테인의 N-말단 단편 및 C-말단 단편은 선택가능한 마커 단백질의 중심 단편으로의 선택가능한 마커 단백질의 N-말단 단편의 연결을 촉매하고, 제2 인테인의 N-말단 단편 및 C-말단 단편은 선택가능한 마커 단백질의 C-말단 단편으로의 선택가능한 마커 단백질의 중심 단편의 연결을 촉매하여 전장 선택가능한 마커 단백질을 생성하는 것인 트랜스제닉 선택 방법.
218. (a) (i) 제1 인테인의 N-말단 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, 선택가능한 마커 단백질 (예를 들어, 항생제 내성 단백질 또는 형광 단백질)의 N-말단 단편을 코딩하는 뉴클레오티드 서열 및 (ii) 관심있는 제1 분자를 코딩하는 뉴클레오티드 서열을 포함하는 제1 벡터, (b) (i) 제2 인테인의 N-말단 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, 선택가능한 마커 단백질의 제1 중심 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, 제1 인테인의 C-말단 단편을 코딩하는 뉴클레오티드 서열 및 (ii) 관심있는 제2 분자를 코딩하는 뉴클레오티드 서열을 포함하는 제2 벡터, (c) (i) 제3 인테인의 N-말단 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, 선택가능한 마커 단백질의 제2 중심 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, 제2 인테인의 C-말단 단편을 코딩하는 뉴클레오티드 서열 및 (ii) 관심있는 제3 분자를 코딩하는 뉴클레오티드 서열을 포함하는 제3 벡터, 및 (d) (i) 선택가능한 마커 단백질의 C-말단 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, 제3 인테인의 C-말단 단편을 코딩하는 뉴클레오티드 서열 및 (ii) 관심있는 제3 분자를 코딩하는 뉴클레오티드 서열을 포함하는 제4 벡터를 진핵생물 세포로 전달하는 것을 포함하며, 여기서 제1 인테인의 N-말단 단편 및 C-말단 단편은 선택가능한 마커 단백질의 제1 중심 단편으로의 선택가능한 마커 단백질의 N-말단 단편의 연결을 촉매하고, 제2 인테인의 N-말단 단편 및 C-말단 단편은 선택가능한 마커 단백질의 제2 중심 단편으로의 선택가능한 마커 단백질의 제1 중심 단편의 연결을 촉매하고, 제3 인테인의 N-말단 단편 및 C-말단 단편은 선택가능한 마커 단백질의 C-말단 단편으로의 선택가능한 마커 단백질의 제2 중심 단편의 연결을 촉매하여 전장 선택가능한 마커 단백질을 생성하는 것인 트랜스제닉 선택 방법.
219. 단락 216 내지 218 중 어느 한 단락에 있어서, 트랜스제닉 진핵생물 세포를 생성하기 위해 진핵생물 세포로의 벡터의 도입을 허용하는 조건 하에 진핵생물 세포를 유지하는 것을 추가로 포함하는 방법.
220. 단락 219에 있어서, 전장 선택가능한 마커 단백질을 포함하는 트랜스제닉 진핵생물 세포를 선택하는 것을 추가로 포함하는 방법.
221. 단락 216 내지 220 중 어느 한 단락에 있어서, 진핵생물 세포가 포유동물 세포인 방법.
222. 단락 216 내지 221 중 어느 한 단락에 있어서, 항생제 내성 단백질이 히그로마이신, G418, 퓨로마이신, 플레오마이신 D1 또는 블라스티시딘에 대한 내성을 부여하는 것인 방법.
223. 단락 216 내지 222 중 어느 한 단락에 있어서, 인테인이 분할 인테인인 방법.
224. 단락 223에 있어서, 분할 인테인이 자연 분할 인테인인 방법.
225. 단락 224에 있어서, 자연 분할 인테인이 DnaE 인테인으로부터 선택된 것인 방법.
226. 단락 225에 있어서, DnaE 인테인이 시네코시스티스 sp. DnaE (SspDnaE) 인테인 및 노스톡 푼크티포르메 (NpuDnaE) 인테인으로부터 선택된 것인 방법.
227. 단락 223에 있어서, 분할 인테인이 조작된 분할 인테인인 방법.
228. 단락 2278에 있어서, 조작된 분할 인테인이 DnaB 인테인으로부터 조작된 것인 방법.
229. 단락 228에 있어서, 조작된 분할 인테인이 SspDnaB S1 인테인인 방법.
230. 단락 229에 있어서, 조작된 분할 인테인이 GyrB 인테인으로부터 조작된 것인 방법.
231. 단락 230에 있어서, 조작된 분할 인테인이 SspGyrB S11 인테인인 방법.
232. 단락 216 내지 231 중 어느 한 단락에 있어서, 분자가 단백질로부터 선택된 것인 방법.
233. 단락 216 내지 231 중 어느 한 단락에 있어서, 분자가 비-코딩 리보핵산 (RNA)으로부터 선택된 것인 방법.
234. 단락 233에 있어서, 비-코딩 RNA가 마이크로RNA (miRNA), 안티센스 RNA, 짧은-간섭 RNA (siRNA) 및 짧은-헤어핀 RNA (shRNA)로부터 선택된 것인 방법.
235. 단락 216 내지 234 중 어느 한 단락에 있어서, 벡터가 플라스미드 벡터 및 바이러스 벡터로부터 선택된 것인 방법.
<실시예>
본 개시내용은 하기 실시예에 의해 추가로 예시된다. 이들 실시예는 본 개시내용의 이해를 돕기 위해 제공되며, 그의 제한으로 해석되어서는 안된다.
실시예 1. 항생제 내성 마커
선택가능한 마커는 종종 원하는 유전자형을 갖는 세포를 단리하기 위해 유전자 조작에 사용된다 [1]. 그러나, 진핵생물 세포에서 사용하기 위한 잘-특징화된 항생제 내성 유전자의 수는 제한되어 있고, 스펙트럼이 일반 실험실에서 장비에 의해 명백하게 식별될 수 있는 형광 단백질의 수는 제한되어 있다. 연구원은 종종 다중 트랜스진을 세포로 혼입시키는 경우 선택가능한 마커를 충분히 선택하지 못하는 문제를 겪는다. 다른 한편, 다중 항생제를 동시에 선택하는 것은 종종 세포에 가혹하다. "선택가능한 마커 재순환"은 제2의 해결책(work-around)을 제공할 수 있으나, 선택 마커의 트랜스제네시스, 선택 및 제거의 다수의 라운드를 필요로 한다 [2]. 다중 트랜스진이 하나의 선택 방식에 의해 동시에 선택되도록 하기 위해, 본 발명자들은 분할 항생제 내성 및 형광 단백질 유전자를 생성하였으며, 여기서 항생제 내성 또는 형광 단백질을 코딩하는 유전자는 단백질 트랜스-스플라이싱에 의해 재연결될 수 있는 인테인 ("마커트론")으로 융합된 2개 이상의 세그먼트로 분할된다 [3] (도 1a). 각각의 마커트론은 특정 트랜스진을 보유하는 트랜스제닉 벡터로 삽입된다. 마커트론의 세트를 함유하는 트랜스제닉 벡터의 전달은 마커트론의 서브세트 또는 완전한 세트를 보유하는 세포를 생성한다. 마커트론의 완전한 세트를 함유하는 세포만이 단백질 스플라이싱을 통해 완전히 재구성된 마커 단백질을 생성하므로 선택을 통해 통과하는 반면, 마커트론의 부분 세트를 갖는 세포는 제거되어, 모든 의도된 트랜스진을 함유하는 세포의 공동선택을 달성한다.
본 발명자들은 이중 트랜스제네시스를 위한 2-마커트론 인테인-분할 내성 (Intres) 유전자의 조작으로 시작하였다. 플랭킹 잔기 및 국소 단백질 폴딩은 인테인-매개 트랜스-스플라이싱의 효율에 영향을 미칠 수 있기 때문에, 본 발명자들은 NpuDnaE [4, 5] 및 SspDnaB [6]로부터 유래된 2개의 잘-특징화된 분할 인테인과 호환가능한 4개의 통상적으로 사용되는 항생제 내성 유전자 각각에서 분할점을 확인하는 것을 시작하였다. 이중 트랜스제닉 선택의 유효성의 평가를 용이하게 하기 위해, 본 발명자들은 시험 트랜스진으로서 TagBFP2 또는 mCherry 형광 단백질을 발현하는 렌티바이러스 벡터 상에 마커트론을 클로닝하였다 (도 1b). 바이러스 제제를 U2OS 세포로 형질도입한 후, 비-선택 또는 선택 배지를 갖는 복제 플레이트로 분할하였다. 항생제 선택을 위한 적절한 계대배양 후, 2개의 세포 배양물을 유동 세포계측법에 의해 분석하였다. 히그로마이신 (Hygro) 내성 유전자의 경우, 플랭킹 잔기 "GS"를 갖는 하나의 "천연" SspDnaB 분할점 (G200:S201) 및 "YC" 잔기를 갖는 하나의 "천연" NpuDnaE 분할점 (Y89:C90)을 시험하였다. 둘 모두는 N- 및 C-마커트론 둘 모두가 형질도입될 때 성공적인 선택을 가능하게 하였으며, 비-선택된 배양물에서 <10% 이중-양성 세포와 비교하여 선택된 배양물에서 >99% BFP+ mCherry+ 이중 트랜스제닉 세포를 얻었다 (도 3; 플라스미드 쌍 3,4 및 5,6). 2개의 마커트론 중 하나로 형질도입된 세포는 히그로마이신 선택에서 생존하지 않았다. 대조적으로, 통상적인 전장 비-분할 히그로마이신 벡터를 사용한 이중 트랜스제네시스는 BFP+ mCherry+ 세포의 약 20% 풍부만을 허용하였다 (플라스미드 쌍 97,98). 본 발명자들은 C-익스테인 이음부 상의 필수 시스테인 잔기 및 이전 보고서 7에서 실질적인 트랜스-스플라이싱 활성을 지지하는 N-익스테인 이음부 상의 잔기를 갖는 NpuDnaE에 대한 3개의 추가 잠재적인 분할점 (52S:53C), (240A:241C) 및 (292R:293C)를 스크리닝하였다. 또한, 본 발명자들은 추가 분할점을 생성하는 전위성 부위에서 스플라이싱을 지지하기 위해 C-익스테인 이음부 상에 "인공" 시스테인을 삽입함으로써 6개의 추가 NpuDnaE 분할점을 혼입하였다. 전체로서, 시험된 11개의 분할점 중 8개는 히그로마이신 선택을 지지하였다 (도 3). 유사하게, 퓨로마이신 (Puro) (도 4), 네오마이신 (Neo) (도 5) 및 블라스티시딘 (Blast) (도 6) 내성 유전자의 경우, 본 발명자들은 4, 2 및 1개의 기능성 Intres 쌍(들)을 각각 확인하였다. 이들 모든 경우에, 마커트론으로 형질도입된 세포는 선택에서 생존하지 않았으나, 둘 모두로 형질도입된 세포는 블라스티시딘(102) Intres를 제외하고 비-선택 배양물에서 <50%와 비교하여 선택 배양물에서 >95% 이중 트랜스제닉 세포를 수득하였으며, 91% 이중 트랜스제닉 세포의 더 낮지만 여전히 유의하게 풍부함을 달성하였다 (도 3-6). Intres 유전자 및 플라스미드의 분할점의 세부사항은 도 2a-2d 및 표 1에 제공된다.
실시예 2. 게이트웨이-호환가능한 렌티바이러스 벡터
Intres 마커의 채택을 용이하게 하기 위해, 본 발명자들은 트랜스진 8의 편리한 제한-라이게이션-독립적 LR 클로나제 재조합을 위한 게이트웨이-호환가능한 렌티바이러스 벡터를 생성하였다 (도 7a). 본 발명자들은 TagBPF2 및 mCherry를 각각 N- 및 C-Intres 벡터에 재조합함으로써 이들 벡터의 기능성을 시험하였으며, 이중 트랜스제닉 세포의 강건한 선택을 발견하였다 (도 7b). Intres 벡터의 한 잠재적인 유용성은 세포에 상이한 형광 마커를 설치하여 상이한 세포 구획을 표지하는 것이다. 이러한 유용성을 탐색하기 위해, 본 발명자들은 통상적인 전장 (FL) 비-분할 히그로마이신 선택가능한 벡터 또는 2-마커트론 히그로마이신 Intres 벡터 및 플라스미드의 세트로 형질도입된 세포로의 게이트웨이 재조합, 이어서 항생제 선택에 의해, 핵 및 F-액틴을 각각 표지하는 NLS-GFP 및 LifeAct-mScarlet 9에서 클로닝하였다 (도 7c). 비-분할 선택가능한 플라스미드로 형질도입된 샘플은 단일 및 이중 표지된 세포 둘 모두를 함유한 반면, Intres 플라스미드로 형질도입된 세포는 모두 이중 표지되었다 (도 7c).
실시예 3. 형광 마커
분할 형광 마커가 트랜스진 선택을 위해 사용될 수 있는지를 시험하기 위해, 본 발명자들은 mScarlet 형광 단백질에 대한 NpuDnaE 분할점을 스크리닝하고 (도 8a), 4개의 분할점은 이중 트랜스제닉 세포의 >96% 풍부를 허용하고, 3개의 다른 분할점은 비-게이팅된 집단에서 <20% 이중 트랜스제닉 세포와 비교하여 mScarlet-게이팅된 집단에서 이중 트랜스제닉 세포의 >60% 풍부를 가능하게 한다는 것을 확인하였다 (도 8b).
실시예 4. 고도의 분할 마커
본 발명자들은 2-마커트론 Intres 유전자에 대해 확인된 분할점으로 고도의 분할 마커의 조작을 시작하였다. 본 발명자들은 하나의 항생제로 2개 초과의 "비연결된" 트랜스진의 공동선택을 허용하기 위해 마커 유전자를 3개 이상의 마커트론으로 분배하기 위해 분할점의 조합을 시험하였다 (도 9a-9b). 이러한 "Intres 쇄"를 허용하는 분할점의 쌍을 확인하기 위해, 본 발명자들은 3개의 형광 트랜스진 TagBFP2, EGFP 또는 mCherry 중 하나를 각각 보유하는 3개의 렌티바이러스 벡터로 3-분할 마커트론을 클로닝하였으며, 이는 본 발명자들이 유동 세포계측법에 의해 선택의 유효성을 평가하는 것을 가능하게 하였다 (도 9c). 히그로마이신 내성 유전자는 가장 길고, 시험을 위한 가장 큰 분할점을 제공하기 때문에, 본 발명자들은 3-마커트론 히그로마이신 Intres의 조작에 중점을 두었다. 본 발명자들은 2개의 개입 NpuDnaE 인테인을 사용하여 2개의 3-마커트론 히그로마이신 Intres를 시험하였으며, 제1 인테인을 위한 NpuDnaE 및 제2 인테인을 위한 SspDnaB를 사용할 뿐만 아니라 제1 인테인을 위한 SspDnaB 및 제2 인테인을 위한 NpuDnaE를 사용하였다 (도 9d). 비-선택된 배양물에서 <15% 삼중 트랜스제닉 세포와 비교하여, 이들 6개의 3-마커트론 히그로마이신 Intres 중 5개는 >97%를 가능하게 하였고, 나머지 1개는 히그로마이신-선택된 배양물에서 80% 삼중 트랜스제닉 선택을 가능하게 하였다. 리브-원-아웃(leave-one-out) 형질도입을 갖는 샘플은 히그로마이신 선택 후 어떠한 생존가능 세포도 생성하지 않은 반면, 비-분할 히그로마이신 벡터로 형질도입된 세포는 선택 후 오직 7% 삼중 트랜스제닉 세포를 생성하였다.
3-마커트론 Intres의 사용을 용이하게 하기 위해, 본 발명자들은 이들 마커를 갖는 게이트웨이 호환가능한 렌티바이러스 벡터를 생성하였다 (도 10a). TagBFP (트랜스진 1로서), EGFP (트랜스진 2로서) 및 mCherry (트랜스진 3으로서)를 N-, M- 및 C-Intres 게이트웨이 목적지 벡터로 재조합함으로써 이들 벡터의 3개 세트를 각각 시험하였으며, 이를 사용하여 U2OS 세포를 형질도입한 후, 히그로마이신 선택 또는 비-선택 배지에서 분할하고 배양하였다 (도 10b). 선택 후 2주에, 세포를 유동 세포계측법에 의해 분석하였다. 3-마커트론 히그로마이신 Intres 플라스미드의 3개 세트 모두는 비-선택된 배양물에서 <25%와 비교하여 >99%의 삼중 트랜스제닉 세포 선택을 지지하였다 (도 10c).
본 발명자들은 4-마커트론 히그로마이신 Intres 유전자의 타당성을 추가로 시험하였다 (도 11). 여기서, 본 발명자들은 SspDnaB 인테인과 조합하여 류신 지퍼 모티프 11과 융합된 NpuDnaGEP 10으로 공지된 NpuDnaE 인테인의 향상된 변이체를 사용하였다. 구성요소 마커트론을 함유하는 모든 4개의 플라스미드의 형질도입은 히그로마이신 선택에서 생존한 세포를 생성한 반면, 리브-원-아웃 형질도입은 어떠한 생존도 생성하지 않았다 (표 2).
실시예 5. AAVS1 유전자좌에서의 이대립유전자 녹-인
CRISPR/Cas는 최근에 게놈 조작 및 편집을 위한 강력한 기술로 부상하였다. NHEJ-매개 삽입/결실 (indel)에 기초한 유전자 녹아웃(knockout)은 높은 빈도로 발생하지만, 외인성 복구 주형 (표적화 구축물이라고도 공지됨)을 사용하는 상동성 지정 복구(homology directed repair)(HDR)에 기초한 정확한 편집 및 녹-인은 비효율적이다. 본 발명자들은 분할 선택가능한 마커가 AAVS1 유전자좌에서 이대립유전자 녹-인을 갖는 세포를 풍부하게 하는데 사용될 수 있는지를 시험하였다. 본 발명자들은 표적 부위를 플랭킹하는 상동성 아암을 갖는 표적화 구축물을 구축하고, 숙주 유전자 PPP1R12C 중 하나인 인트론 내에 마커트론을 포획하기 위해 수용자-2A 펩티드를 스플라이싱하였다. 그러나, 본 발명자들은 이들 표적화 구축물을 사용하는 CRISPR/Cas 녹-인 실험 및 항생제 선택의 2주 후에 어떠한 살아있는 세포도 얻지 못하였다 (데이터는 나타내지 않음). 본 발명자들은 숙주 유전자 PPP1R12C의 내인성 프로모터가 항생제의 작용에 대항하기에 충분한 항생제 내성 단백질을 재구성하기 위해 마커트론의 충분한 발현을 구동할 수 없을 것으로 의심하였다. 그러므로, 본 발명자들은 활성이 독시사이클린 (dox) 농도에 의해 적정될 수 있는 TetO 프로모터에 의해 Intres 마커트론을 발현하는 대안적인 전략을 시험하였다. Intres-매개 이대립유전자 선택 대 전장 (FL) 비-분할 선택가능한 마커의 비교를 허용하기 위해, 본 발명자들은 여러 상이한 표적화 구축물 설계를 구현하였다. 먼저, 본 발명자들은 구성적(constitutive) EF1a 프로모터 하에 rtTA 및 dox-유도성 TetO 프로모터 하에 별도의 시험 Intres (예를 들어, Blast Intres)와 함께 전장 (FL) 내성 유전자 (예를 들어, Hygro)의 발현을 구동하였다 (도 12b, 플라스미드 109 및 110). 이는 동일한 구축물 내에 전장 및 분할 선택가능한 마커의 비교를 허용하였다. 동일한 TetO 프로모터에 의해 구동된 전장 대 분할 마커의 공정한 비교를 허용하기 위해, 본 발명자들은 2개의 유사한 플라스미드 107 및 108 (cf. 플라스미드 109 및 110)을 구축하였으며, 여기서 전장 항생제 내성 유전자 (Blast)는 TetO 프로모터의 하류에 위치하였다 (도 12a). 이대립유전자 표적화의 단일-세포 정량화를 가능하게 하고, 2개의 트랜스진의 2개의 AAVS1 대립유전자로의 혼입의 타당성을 입증하기 위해, 본 발명자들은 자기-절단 2A 펩티드를 통해 시험 분할 또는 비-분할 마커의 하류에 EGFP 및 mScarlet 형광 유전자를 첨부하였다. 유사하게, Hygro Intres를 시험하기 위해, 본 발명자들은 FL Hygro 또는 Hygro Intres는 TetO의 하류에 위치하고 FL Blast는 EF1a의 하류에 위치하도록 EF1a 및 TetO-구동된 마커를 교체하였다 (도 12c-12d; 플라스미드 111-114). 본 발명자들은 Cas9 및 sgRNA 표적화 AAVS1을 함유하는 pX330-AAVS1 (플라스미드 106), 및 표적화 구축물의 상이한 쌍을 HEK293T 세포로 공동-형질감염하고, 이를 후속 계대배양에서 항생제를 갖지 않는, 블라스티시딘을 갖는 또는 히그로마이신을 갖는 삼중 독시사이클린-함유 배지로 분할하였다. 선택 후 2주에, 본 발명자들은 GFP 및 RFP 형광의 유동 세포계측법 측정에 의해 이대립유전자 표적화에 대해 배양물을 분석하였다 (도 12e). 예상된 바와 같이, 비-선택된 배양물은 작은 분율 (<1%)의 이대립유전자 녹-인 GFP+/RFP+ 세포를 보유하였다 (도 12e; 선택 = 없음). 상응하는 FL 항생제 내성 유전자가 표적화 구축물 상에 존재하는 항생제의 선택은 < 30% 이대립유전자 녹-인 세포를 생성하였다 (도 12e; Blast: TC a,c,d; Hygro: TC a,b,c). 대조적으로, 상응하는 Intres가 표적화 구축물 상에 존재하는 항생제의 선택은 75% (도 6e; Blast Intres: TC b) 및 88% (도 6e; Hygro Intres: TC d) 이대립유전자 녹-인 세포를 생성하였다
상기 실시예에서, 본 발명자들은 2개 이상의 "비연결된" 트랜스진에 대한 선택을 허용할 수 있는 분할 항생제 내성 및 형광 단백질 유전자를 조작하였다. 선택가능한 마커에 비자연 잔기를 삽입함으로써, 본 발명자들은 새로운 고효율 분할점을 사용하여 조작에 이용가능한 위치를 확장할 수 있음을 보여주었다. 본 발명자들은 분할 선택가능한 마커가 CRISPR/Cas9 게놈 편집 실험에서 렌티바이러스 벡터 또는 유전자 표적화 구축물로 혼입되어, 이중 트랜스제네시스 또는 이대립유전자 녹-인을 갖는 세포의 풍부를 가능하게 할 수 있다는 것을 입증하였다. 2개 이상의 분할점을 조합함으로써, 본 발명자들은 3- 및 4-분할 마커를 생성하여 고도의 트랜스제닉 선택을 가능하게 할 수 있음을 보여주었다. 훨씬 고도의 분할 선택가능한 마커의 미래 개발은 수십 개의 트랜스진 또는 표적화된 녹-인을 함유하는 세포의 "과조작(hyper-engineering)"을 가능하게 할 수 있다.
재료 및 방법
클로닝
각각의 마커트론을 위한 시험 플라스미드를 생성하기 위해, 본 발명자들은 먼저 그의 ORF를 함유하는 게이트웨이 공여자 플라스미드를 생성한 후, TagBFP2 (플라스미드 94: pLX-DEST-IRES-TagBFP2), EGFP (플라스미드 95: pLX-DEST-IRES-EGFP), 또는 mCherry (플라스미드 96: pLX-DEST-IRES-mCherry) 리포터를 갖는 렌티바이러스 목적지 벡터로 재조합하였으며, 이들은 퓨로마이신 내성 유전자를 제거하고 게이트웨이 카세트의 하류에 IRES-형광 유전자를 삽입함으로써 pLX302 (addgene.org/25896/)로부터 유래된 것이었다. 마커트론-ORF 게이트웨이 공여자 플라스미드는 인테인을, 선택가능한 마커의 단편의 코딩 서열과 조합한 후 서열- 및 라이게이션-독립적 클로닝(sequence- and ligation-independent cloning)(SLIC)에 의해 pCR8-GW-TOPO 플라스미드에 삽입하는 내포 융합 PCR 절차 (Li, M.Z. & Elledge, S.J. SLIC: a method for sequence-and ligation-independent cloning. Gene Synthesis: Methods and Protocols, 51-59 (2012))에 의해, 또는 선택가능한 마커의 관련 단편을 PCR-증폭시킨 후 SLIC에 의해 인테인 서열을 함유하는 "스캐폴드" 플라스미드 (플라스미드 27~32)에 삽입함으로써 생성하였다. 인테인을 코딩하는 DNA 서열을 호모 사피엔스에 대해 코돈 최적화하고, NpuDnaE 인테인을 코딩하는 AC1947GB, SspDnaB 인테인을 코딩하는 AC1949GB와 함께 GBlock (IDT)으로서 합성하였다. 이들 마커를 함유하는 플라스미드로부터 선택가능한 마커 단편을 증폭시켰다. 플라스미드에 대해서는 표 1을 참조한다.
세포 배양
모든 세포를 10% 소 태아 혈청 (FBS)(론자(Lonza)), 4% 글루타맥스(Glutamax) (깁코(Gibco)), 1% 피루브산나트륨 (깁코) 및 페니실린-스트렙토마이신 (깁코)을 갖는 둘베코 변형 이글 배지(Dulbecco's modified Eagle's medium)(DMEM) (시그마(Sigma))에서 배양하였다. 인큐베이터 조건은 37℃ 및 5% CO2였다.
바이러스 생성
pLP1, pLP2 및 VSV-G의 바이러스 패키징 믹스를, 리포펙타민(Lipofectamine) 3000을 사용하여 웰 당 1.2x106개 세포의 농도로 6-웰 플레이트에서 그 전날 접종된 Lenti-X 293T 세포 (클론테크(ClonTech))에 각각의 렌티바이러스 벡터와 함께 공동-형질감염시켰다. 형질감염 후 6시간에 배지를 교체한 다음, 밤새 인큐베이션하였다. 형질감염 후 28시간에 바이러스를 함유하는 배지 상등액을 45 uM PES 필터를 사용하여 여과한 후, 사용할 때까지 -80℃에서 저장하였다.
형질도입
형질도입 전날, 표적 세포 (HEK293T, MCF7, U2-OS)를 웰 당 1.5x105개 세포의 밀도로 12-웰 플레이트에 씨딩하였다. 형질도입 전에, 배지를 웰 당 1 mL의 10 μg/mL 폴리브렌을 함유하는 배지로 교체하였다. 250 μL의 각각의 바이러스 (2종의 바이러스가 첨가된 실험 샘플의 경우 총 500 μL)를 각각의 웰에 첨가하고 밤새 인큐베이션하였다. 감염 후 24시간에 배지를 교체하였다. 감염 후 4일에 세포를 이중 플레이트로 분할하였다. 감염 후 5일에 항생제 (히그로마이신)를 갖는 배지를 하나의 복제 플레이트의 웰 각각에 첨가하였다 (다른 하나는 선택 없이 유지됨). 항생제 선택은 FACS에 의한 분석전 2주 동안 계속하였다.
형광-활성화 세포 분류법
세포를 트립신으로 처리하고, 배지에 현탁시킨 후, HP Z230 워크스테이션에서 FACSDiVa 소프트웨어, 버전 8을 사용하여 LSRFortessa X-20 (비디 바이오사이언스(BD Bioscience)) 유동 세포계측기에서 분석하였다. 각각의 실행에서 5만건의 사건을 수집하였다.
<구축물 및 서열>
플라스미드 3: pLX-Hygro(1-89)-NpuDnaE(N)-IRES-TagBFP2
단백질 = Hygro(1-89)-NpuDnaE(N)
벡터 서열 (서열식별번호: 20)
아미노산 서열 (서열식별번호: 21)
플라스미드 4: pLX-NpuDnaE(C)-Hygro(90-341)-IRES-mCherry
단백질 = NpuDnaE(C)-Hygro(90-341)
벡터 서열 (서열식별번호: 22)
아미노산 서열 (서열식별번호: 23)
플라스미드 5: pLX-Hygro(1-200)-SspDnaB(N)-IRES-TagBFP2
단백질 = Hygro(1-200)-SspDnaB(N)
벡터 서열 (서열식별번호: 24)
아미노산 서열 (서열식별번호: 25)
플라스미드 6: pLX-SspDnaB(C)-Hygro(201-341)-IRES-mCherry
단백질 = SspDnaB(C)-Hygro(201-341)
벡터 서열 (서열식별번호: 26)
아미노산 서열 (서열식별번호: 27)
플라스미드 7: pLX-Hygro(1-52)-NpuDnaE(N)-IRES-TagBFP2
단백질 = Hygro(1-52)-NpuDnaE(N)
벡터 서열 (서열식별번호: 28)
아미노산 서열 (서열식별번호: 29)
플라스미드 8: pLX-NpuDnaE(C)-Hygro(53-341)-IRES-mCherry
단백질 = NpuDnaE(C)-Hygro(53-341)
벡터 서열 (서열식별번호: 30)
아미노산 서열 (서열식별번호: 31)
플라스미드 9: pLX-Hygro(1-240)-NpuDnaE(N)-IRES-TagBFP2
단백질 = Hygro(1-240)-NpuDnaE(N)
벡터 서열 (서열식별번호: 32)
아미노산 서열 (서열식별번호: 33)
플라스미드 10: pLX-NpuDnaE(C)-Hygro(241-341)-IRES-mCherry
단백질 = NpuDnaE(C)-Hygro(241-341)
벡터 서열 (서열식별번호: 34)
아미노산 서열 (서열식별번호: 35)
플라스미드 11: pLX-Hygro(1-292)-NpuDnaE(N)-IRES-TagBFP2
단백질 = Hygro(1-292)-NpuDnaE(N)
벡터 서열 (서열식별번호: 36)
아미노산 서열 (서열식별번호: 37)
플라스미드 12: pLX-NpuDnaE(C)-Hygro(293-341)-IRES-mCherry
단백질 = NpuDnaE(C)-Hygro(293-341)
벡터 서열 (서열식별번호: 38)
아미노산 서열 (서열식별번호: 39)
플라스미드 13: pLX-Blast(1-102)-NpuDnaE(N)-IRES-TagBFP2
단백질 = Blast(1-102)-NpuDnaE(N)
벡터 서열 (서열식별번호: 40)
아미노산 서열 (서열식별번호: 41)
플라스미드 14: pLX-NpuDnaE(C)-Blast(103-140)-IRES-mCherry
단백질 = NpuDnaE(C)-Blast(103-140)
벡터 서열 (서열식별번호: 42)
아미노산 서열 (서열식별번호: 43)
플라스미드 17: pLX-Puro(1-119)-NpuDnaE(N)-IRES-TagBFP2
단백질 = Puro(1-119)-NpuDnaE(N)
벡터 서열 (서열식별번호: 44)
아미노산 서열 (서열식별번호: 45)
플라스미드 18: pLX-NpuDnaE(C)-Puro(insCys;120-199)-IRES-mCherry
단백질 = NpuDnaE(C)-Puro(insCys;120-199)
벡터 서열 (서열식별번호: 46)
아미노산 서열 (서열식별번호: 47)
플라스미드 19: pLX-Puro(1-100)-SspDnaB(N-S0)-IRES-TagBFP2
단백질 = Puro(1-100)-SspDnaB(N-S0)
벡터 서열 (서열식별번호: 48)
아미노산 서열 (서열식별번호: 49)
플라스미드 20: pLX-SspDnaB(C-S0)-Puro(101-199)-IRES-mCherry
단백질 = SspDnaB(C-S0)-Puro(101-199)
벡터 서열 (서열식별번호: 50)
아미노산 서열 (서열식별번호: 51)
플라스미드 21: pLX-Neo(1-133)-NpuDnaE(N)-IRES-TagBFP2
단백질 = Neo(1-133)-NpuDnaE(N)
벡터 서열 (서열식별번호: 52)
아미노산 서열 (서열식별번호: 53)
플라스미드 22: pLX-NpuDnaE(C)-Neo(134-267)-IRES-mCherry
단백질 = NpuDnaE(C)-Neo(134-267)
벡터 서열 (서열식별번호: 54)
아미노산 서열 (서열식별번호: 55)
플라스미드 23: pLX-Neo(1-194)-NpuDnaE(N)-IRES-TagBFP2
단백질 = Neo(1-194)-NpuDnaE(N)
벡터 서열 (서열식별번호: 56)
아미노산 서열 (서열식별번호: 57)
플라스미드 24: pLX-NpuDnaE(C)-Neo(195-267)-IRES-mCherry
단백질 = NpuDnaE(C)-Neo(195-267)
벡터 서열 (서열식별번호: 58)
아미노산 서열 (서열식별번호: 59)
플라스미드 25: pLX-NpuDnaE(C)_Hygro(53-89)-NpuDnaE(N)-IRES-GFP
단백질 = NpuDnaE(C)_Hygro(53-89)-NpuDnaE(N)
벡터 서열 (서열식별번호: 60)
아미노산 서열 (서열식별번호: 61)
플라스미드 26: pLX-NpuDnaE(C)_Hygro(53-239)-NpuDnaE(N)-IRES-GFP
단백질 = NpuDnaE(C)_Hygro(53-239)-NpuDnaE(N)
벡터 서열 (서열식별번호: 62)
아미노산 서열 (서열식별번호: 63)
플라스미드 27: pCR8-BsaI->ccdbCam<-BsaI-NpuDnaE(N)-MD1-68-15 (서열식별번호: 64)
플라스미드 28: pCR8-NpuDnaE(C)_BsaI->ccdbCam<-BsaI-MD1-68-18 (서열식별번호: 65)
플라스미드 29: pCR8-BsaI->ccdbCam<-BsaI-SspDnaE(N)-MD1-68-12 (서열식별번호: 66)
플라스미드 30: pCR8-SspDnaE(C)_BsaI->ccdbCam<-BsaI-MD1-68-13 (서열식별번호: 67)
플라스미드 31: pCR8-BsaI->ccdbCam<-BsaI-SspDnaB(N-S0)-25-135-18 (서열식별번호: 68)
플라스미드 32: pCR8-SspDnaB(C-S0)_BsaI->ccdbCam<-BsaI-25-155-41 (서열식별번호: 69)
플라스미드 33: pLX-mScarlet(1-46)-NpuDnaE(N)_LZA-IRES-TagBFP2
단백질 = mScarlet(1-46)-NpuDnaE(N)_LZA
벡터 서열 (서열식별번호: 70)
아미노산 서열 (서열식별번호: 71)
플라스미드 34: pLX-LZB_NpuDnaE(C)-mScarlet(insCys;47-232)-IRES-TagBFP2
단백질 = LZB_NpuDnaE(C)-mScarlet(insCys;47-232)
벡터 서열 (서열식별번호: 72)
아미노산 서열 (서열식별번호: 73)
플라스미드 35: pLX-mScarlet(1-48)-NpuDnaE(N)_LZA-IRES-TagBFP2
단백질 = mScarlet(1-48)-NpuDnaE(N)_LZA
벡터 서열 (서열식별번호: 74)
아미노산 서열 (서열식별번호: 75)
플라스미드 36: pLX-LZB_NpuDnaE(C)-mScarlet(insCys;49-232)-IRES-GFP
단백질 = LZB_NpuDnaE(C)-mScarlet(insCys;49-232)
벡터 서열 (서열식별번호: 76)
아미노산 서열 (서열식별번호: 77)
플라스미드 37: pLX-mScarlet(1-51)-NpuDnaE(N)_LZA -IRES-TagBFP2
단백질 = mScarlet(1-51)-NpuDnaE(N)_LZA
벡터 서열 (서열식별번호: 78)
아미노산 서열 (서열식별번호: 79)
플라스미드 38: pLX-LZB_NpuDnaE(C)-mScarlet(insCys;52-232)-IRES-GFP
단백질 = LZB_NpuDnaE(C)-mScarlet(insCys;52-232)
벡터 서열 (서열식별번호: 80)
아미노산 서열 (서열식별번호: 81)
플라스미드 39: pLX-mScarlet(1-75)-NpuDnaE(N)_LZA-IRES-TagBFP2
단백질 = mScarlet(1-75)-NpuDnaE(N)_LZA
벡터 서열 (서열식별번호: 82)
아미노산 서열 (서열식별번호: 83)
플라스미드 40: pLX-LZB_NpuDnaE(C)-mScarlet(insCys;76-232)-IRES-GFP
단백질 = LZB_NpuDnaE(C)-mScarlet(insCys;76-232)
벡터 서열 (서열식별번호: 84)
아미노산 서열 (서열식별번호: 85)
플라스미드 41: pLX-mScarlet(1-122)-NpuDnaE(N)_LZA-IRES-TagBFP2
단백질 = mScarlet(1-122)-NpuDnaE(N)_LZA
벡터 서열 (서열식별번호: 86)
아미노산 서열 (서열식별번호: 87)
플라스미드 42: pLX-LZB_NpuDnaE(C)-mScarlet(insCys;123-232)-IRES-GFP
단백질 = LZB_NpuDnaE(C)-mScarlet(insCys;123-232)
벡터 서열 (서열식별번호: 88)
아미노산 서열 (서열식별번호: 89)
플라스미드 43: pLX-mScarlet(1-140)-NpuDnaE(N)_LZA-IRES-TagBFP2
단백질 = mScarlet(1-140)-NpuDnaE(N)_LZA
벡터 서열 (서열식별번호: 90)
아미노산 서열 (서열식별번호: 91)
플라스미드 44: pLX-LZB_NpuDnaE(C)-mScarlet(insCys;141-232)-IRES-GFP
단백질 = LZB_NpuDnaE(C)-mScarlet(insCys;141-232)
벡터 서열 (서열식별번호: 92)
아미노산 서열 (서열식별번호: 93)
플라스미드 45: pLX-mScarlet(1-163)-NpuDnaE(N)_LZA-IRES-TagBFP2
단백질 = mScarlet(1-163)-NpuDnaE(N)_LZA
벡터 서열 (서열식별번호: 94)
아미노산 서열 (서열식별번호: 95)
플라스미드 46: pLX-LZB_NpuDnaE(C)-mScarlet(insCys;164-232)-IRES-GFP
단백질 = LZB_NpuDnaE(C)-mScarlet(insCys;164-232)
벡터 서열 (서열식별번호: 96)
아미노산 서열 (서열식별번호: 97)
플라스미드 47: pCR8-TagBFP2
단백질 = TagBFP2
벡터 서열 (서열식별번호: 98)
아미노산 서열 (서열식별번호: 99)
플라스미드 48: pCR8-mCherry
단백질 = mCherry
벡터 서열 (서열식별번호: 100)
아미노산 서열 (서열식별번호: 101)
플라스미드 49: pLX-DEST-IRES-Hygro(1-89)-NpuDnaE(N)
단백질 = Hygro(1-89)-NpuDnaE(N)
벡터 서열 (서열식별번호: 102)
아미노산 서열 (서열식별번호: 103)
플라스미드 50: pLX-DEST-IRES-NpuDnaE(C)-Hygro(90-341)
단백질 = NpuDnaE(C)-Hygro(90-341)
벡터 서열 (서열식별번호: 104)
아미노산 서열 (서열식별번호: 105)
플라스미드 51: pLX-[TagBFP2]-IRES-Hygro(1-89)-NpuDnaE(N)
벡터 서열 (서열식별번호: 106)
플라스미드 52: pLX-[mCherry]-IRES-NpuDnaE(C)-Hygro(90-341)
벡터 서열 (서열식별번호: 107)
플라스미드 53: pLX-DEST-IRES-Puro(1-119)-NpuDnaE(N)
단백질 = Puro(1-119)-NpuDnaE(N)
벡터 서열 (서열식별번호: 108)
아미노산 서열 (서열식별번호: 109)
플라스미드 54: pLX-DEST-IRES-NpuDnaE(C)-Puro(120-199)
단백질 = NpuDnaE(C)-Puro(120-199)
벡터 서열 (서열식별번호: 110)
아미노산 서열 (서열식별번호: 111)
플라스미드 55: pLX-[TagBFP2]-IRES-Puro(1-119)-NpuDnaE(N)
벡터 서열 (서열식별번호: 112)
플라스미드 56: pLX-[mCherry]-IRES-NpuDnaE(C)-Puro(120-199)
벡터 서열 (서열식별번호: 113)
플라스미드 57: pLX-DEST-IRES-Neo(1-194)-NpuDnaE(N)
단백질 = Neo(1-194)-NpuDnaE(N)
벡터 서열 (서열식별번호: 114)
아미노산 서열 (서열식별번호: 115)
플라스미드 58: pLX-DEST-IRES-NpuDnaE(C)-Neo(195-267)
단백질 = NpuDnaE(C)-Neo(195-267)
벡터 서열 (서열식별번호: 116)
아미노산 서열 (서열식별번호: 117)
플라스미드 59: pLX-[TagBFP2]-IRES-Neo(1-194)-NpuDnaE(N)
벡터 서열 (서열식별번호: 118)
플라스미드 60: pLX-[mCherry]-IRES-NpuDnaE(C)-Neo(195-267)
벡터 서열 (서열식별번호: 119)
플라스미드 61: pLX-mScarlet(1-51)-NpuDnaE(N)-LZA-IRES-TagBFP2
단백질 = mScarlet(1-51)-NpuDnaE(N)-LZA
벡터 서열 (서열식별번호: 120)
아미노산 서열 (서열식별번호: 121)
플라스미드 62: pLX-LZB-NpuDnaE(C)-mScarlet(^C,52-163)-NpuDnaE(N)_LZA-IRES-EGFP
단백질 = LZB-NpuDnaE(C)-mScarlet(^C;52-163)-NpuDnaE(N)_LZA
벡터 서열 (서열식별번호: 122)
아미노산 서열 (서열식별번호: 123)
플라스미드 63: pLX-LZB-NpuDnaE(C)-mScarlet(^C;164-232)-IRES-EGFP
단백질 = LZB-NpuDnaE(C)-mScarlet(^C;164-232)
벡터 서열 (서열식별번호: 124)
아미노산 서열 (서열식별번호: 125)
플라스미드 64: pLX-Hygro(1-69)-NpuDnaE(N)-IRES-TagBFP2
단백질 = Hygro(1-69)-NpuDnaE(N)
벡터 서열 (서열식별번호: 126)
아미노산 서열 (서열식별번호: 127)
플라스미드 65: pLX-NpuDnaE(C)-Hygro(^C;70-341)-IRES-mCherry
단백질 = NpuDnaE(C)-Hygro(^C;70-341)
벡터 서열 (서열식별번호: 128)
아미노산 서열 (서열식별번호: 129)
플라스미드 66: pLX-Hygro(1-131)-NpuDnaE(N)-IRES-TagBFP2
단백질 = Hygro(1-131)-NpuDnaE(N)
벡터 서열 (서열식별번호: 130)
아미노산 서열 (서열식별번호: 131)
플라스미드 67: pLX-NpuDnaE(C)-Hygro(^C;132-341)-IRES-mCherry
단백질 = NpuDnaE(C)-Hygro(^C;132-341)
벡터 서열 (서열식별번호: 132)
아미노산 서열 (서열식별번호: 133)
플라스미드 68: pLX-Hygro(1-171)-NpuDnaE(N)-IRES-TagBFP2
단백질 = Hygro(1-171)-NpuDnaE(N)
벡터 서열 (서열식별번호: 134)
아미노산 서열 (서열식별번호: 135)
플라스미드 69: pLX-NpuDnaE(C)-Hygro(^C;172-341)-IRES-mCherry
단백질 = NpuDnaE(C)-Hygro(^C;172-341)
벡터 서열 (서열식별번호: 136)
아미노산 서열 (서열식별번호: 137)
플라스미드 70: pLX-Hygro(1-218)-NpuDnaE(N)-IRES-TagBFP2
단백질 = Hygro(1-218)-NpuDnaE(N)
벡터 서열 (서열식별번호: 138)
아미노산 서열 (서열식별번호: 139)
플라스미드 71: pLX-NpuDnaE(C)-Hygro(^C;219-341)-IRES-mCherry
단백질 = NpuDnaE(C)-Hygro(^C;219-341)
벡터 서열 (서열식별번호: 140)
아미노산 서열 (서열식별번호: 141)
플라스미드 72: pLX-Hygro(1-259)-NpuDnaE(N)-IRES-TagBFP2
단백질 = Hygro(1-259)-NpuDnaE(N)
벡터 서열 (서열식별번호: 142)
아미노산 서열 (서열식별번호: 143)
플라스미드 73: pLX-NpuDnaE(C)-Hygro(^C;260-341)-IRES-mCherry
단백질 = NpuDnaE(C)-Hygro(^C;260-341)
벡터 서열 (서열식별번호: 144)
아미노산 서열 (서열식별번호: 145)
플라스미드 74: pLX-Hygro(1-277)-NpuDnaE(N)-IRES-TagBFP2
단백질 = Hygro(1-277)-NpuDnaE(N)
벡터 서열 (서열식별번호: 146)
아미노산 서열 (서열식별번호: 147)
플라스미드 75: pLX-NpuDnaE(C)-Hygro(^C; 278-341)-IRES-mCherry
단백질 = NpuDnaE(C)-Hygro(^C;278-341)
벡터 서열 (서열식별번호: 148)
아미노산 서열 (서열식별번호: 149)
플라스미드 76: pLX-Puro(1-32)-NpuDnaE(N)-IRES-TagBFP2
단백질 = Puro(1-32)-NpuDnaE(N)
벡터 서열 (서열식별번호: 150)
아미노산 서열 (서열식별번호: 151)
플라스미드 77: pLX-NpuDnaE(C)-Puro(^C;33-199)-IRES-mCherry
단백질 = NpuDnaE(C)-Puro(^C;33-199)
벡터 서열 (서열식별번호: 152)
아미노산 서열 (서열식별번호: 153)
플라스미드 78: pLX-Puro(1-84)-NpuDnaE(N)-IRES-TagBFP2
단백질 = Puro(1-84)-NpuDnaE(N)
벡터 서열 (서열식별번호: 154)
아미노산 서열 (서열식별번호: 155)
플라스미드 79: pLX-NpuDnaE(C)-Puro(^C;85-199)-IRES-mCherry
단백질 = NpuDnaE(C)-Puro(^C;85-199)
벡터 서열 (서열식별번호: 156)
아미노산 서열 (서열식별번호: 157)
플라스미드 80: pLX-Puro(1-137)-NpuDnaE(N)-IRES-TagBFP2
단백질 = Puro(1-137)-NpuDnaE(N)
파일 = pLX-[PuroKC3(N)-NpuDnaE(N)-25-131-29"]-IRES-TagBFP2-25-133-6
벡터 서열 (서열식별번호: 158)
아미노산 서열 (서열식별번호: 159)
플라스미드 81: pLX-NpuDnaE(C)-Puro(^C;138-199)-IRES-mCherry
단백질 = NpuDnaE(C)-Puro(^C;138-199)
벡터 서열 (서열식별번호: 160)
아미노산 서열 (서열식별번호: 161)
플라스미드 82: pLX-Puro(1-158)-NpuDnaE(N)-IRES-TagBFP2
단백질 = Puro(1-158)-NpuDnaE(N)
벡터 서열 (서열식별번호: 162)
아미노산 서열 (서열식별번호: 163)
플라스미드 83: pLX-NpuDnaE(C)-Puro(^C;159-199)-IRES-mCherry
단백질 = NpuDnaE(C)-Puro(^C;159-199)
벡터 서열 (서열식별번호: 164)
아미노산 서열 (서열식별번호: 165)
플라스미드 84: pLX-Puro(1-180)-NpuDnaE(N)-IRES-TagBFP2
단백질 = Puro(1-180)-NpuDnaE(N)
벡터 서열 (서열식별번호: 166)
아미노산 서열 (서열식별번호: 167)
플라스미드 85: pLX-NpuDnaE(C)-Puro(^C;181-199)-IRES-mCherry
단백질 = NpuDnaE(C)-Puro(^C;181-199)
벡터 서열 (서열식별번호: 168)
아미노산 서열 (서열식별번호: 169)
플라스미드 86: pLX-Blast(1-58)-NpuDnaE(N)-IRES-TagBFP2
단백질 = Blast(1-58)-NpuDnaE(N)
벡터 서열 (서열식별번호: 170)
아미노산 서열 (서열식별번호: 171)
플라스미드 87: pLX-NpuDnaE(C)-Blast(59-140)-IRES-mCherry
단백질 = NpuDnaE(C)-Blast(59-140)
벡터 서열 (서열식별번호: 172)
아미노산 서열 (서열식별번호: 173)
플라스미드 88: pLX-NpuDnaE(C)-HygroBA-SspDnaB(N-S0)-IRES-EGFP
단백질 = NpuDnaE(C)-Hygro(53-200)-SspDnaB(N-S0)
벡터 서열 (서열식별번호: 174)
아미노산 서열 (서열식별번호: 175)
플라스미드 89: pLX-SspDnaB(C-S0)-Hygro(201-341)-IRES-mCherry
단백질 = SspDnaB(C-S0)-Hygro(201-341)
벡터 서열 (서열식별번호: 176)
아미노산 서열 (서열식별번호: 177)
플라스미드 90: pLX-NpuDnaE(C)-Hygro(90-200)-SspDnaB(N-S0)-IRES-EGFP
단백질 = NpuDnaE(C)-Hygro(90-200)-SspDnaB(N-S0)
벡터 서열 (서열식별번호: 178)
아미노산 서열 (서열식별번호: 179)
플라스미드 91: pLX-Hygro(1-200)-SspDnaB(N-S0)-IRES-TagBFP2
단백질 = Hygro(1-200)-SspDnaB(N-S0)
벡터 서열 (서열식별번호: 180)
아미노산 서열 (서열식별번호: 181)
플라스미드 92: pLX-SspDnaB(C-S0)-Hygro(201-240)-NpuDnaE(N)-IRES-EGFP
단백질 = SspDnaB(C-S0)-Hygro(201-240)-NpuDnaE(N)
벡터 서열 (서열식별번호: 182)
아미노산 서열 (서열식별번호: 183)
플라스미드 93: pLX-SspDnaB(C-S0)-Hygro(201-292)-NpuDnaE(N)-IRES-EGFP
단백질 = SspDnaB(C-S0)-Hygro(201-292)-NpuDnaE(N)
벡터 서열 (서열식별번호: 184)
아미노산 서열 (서열식별번호: 185)
플라스미드 94: pLX-DEST-IRES-TagBFP2 (서열식별번호: 186)
플라스미드 95: pLX-DEST-IRES-EGFP (서열식별번호: 187)
플라스미드 96: pLX-DEST-IRES-mCherry (서열식별번호: 188)
플라스미드 97: pLX-Hygro-IRES-TagBFP2
벡터 서열 (서열식별번호: 189)
플라스미드 98: pLX-Hygro-IRES-mCherry
벡터 서열 (서열식별번호: 190)
플라스미드 99: pLX-Puro-IRES-TagBFP2
벡터 서열 (서열식별번호: 191)
플라스미드 100: pLX-Puro-IRES-mCherry
벡터 서열 (서열식별번호: 192)
플라스미드 101: pLX-Hygro-IRES-EGFP
벡터 서열 (서열식별번호: 193)
플라스미드 102: pLX-NLS_GFP-IRES-Hygro
벡터 서열 (서열식별번호: 194)
플라스미드 103: pLX-LifeAct_mCherry-IRES-Hygro
벡터 서열 (서열식별번호: 195)
플라스미드 104: pLX-NLS_GFP-IRES-Hygro(1-89)-NpuDnaE(N)
벡터 서열 (서열식별번호: 196)
플라스미드 105: pLX-LifeAct_mScarlet-IRES- NpuDnaE(C)-Hygro(90-341)
벡터 서열 (서열식별번호: 197)
플라스미드 106: pX330-AAVS1
sgRNA 스페이서 서열: gACCCCACAGTGGGGCCACTA (첫 번째 g는 게놈과 매칭하지 않음) (서열식별번호: 198)
벡터 서열 (서열식별번호: 199)
플라스미드 107: pAAVS1-Nst-EF1aHygro2ArtTA3(-)_TetO-Blast-P2A-EGFP
벡터 서열 (서열식별번호: 200)
플라스미드 108: pAAVS1-Nst-EF1aHygro2ArtTA3(-)_TetO-Blast-P2A-mScarlet
벡터 서열 (서열식별번호: 201)
플라스미드 109: pAAVS1-Nst-EF1aHygro2ArtTA3(-)_TetO-Blast(1-102)_NpuDnaE(N)-P2A-EGFP
벡터 서열 (서열식별번호: 202)
플라스미드 110: pAAVS1-Nst-EF1aHygro2ArtTA3(-)_TetO-NpuDnaE(C)_Blast(103-140)-P2A-mScarlet
벡터 서열 (서열식별번호: 203)
플라스미드 111: pAAVS1-Nst-EF1aBlast2ArtTA3(-)_TetO-Hygro-P2A-NTR-E2A-EGFP
벡터 서열 (서열식별번호: 204)
플라스미드 112: pAAVS1-Nst-EF1aBlast2ArtTA3(-)_TetO-Hygro-P2A-NTR-E2A-mCherry
벡터 서열 (서열식별번호: 205)
플라스미드 113: pAAVS1-Nst-EF1aBlast2ArtTA3(-)_TetO- Hygro(1-89)-NpuDnaE(N)-P2A-NTR-E2A-EGFP
벡터 서열 (서열식별번호: 206)
플라스미드 114: pAAVS1-Nst-EF1aBlast2ArtTA3(-)_TetO- NpuDnaE(C)-Hygro(90-341)-P2A-NTR-E2A-mCherry
벡터 서열 (서열식별번호: 207)
플라스미드 115: pLX-Hygro(1-89)_NpuDnaE(N)_LZA-IRES-TagBFP2
단백질 = Hygro(1-89)-NpuDnaE(N)-LZA
벡터 서열 (서열식별번호: 208)
아미노산 서열 (서열식별번호: 209)
플라스미드 116: pLX-LZB_NpuDnaGEP(C)_Hygro(90-200)_SspDnaB(N-S0)-IRES-GFP
단백질 = LZB-NpuDnaGEP(C)-Hygro(90-200)-SspDnaB(N-S0)
벡터 서열 (서열식별번호: 210)
아미노산 서열 (서열식별번호: 211)
플라스미드 117: pLX-SspDnaB(C-S0)_Hygro(201-240)_NpuDnaE(N)_LZA-IRES-GFP
단백질 = SspDnaB(C-S0)-Hygro(201-240)-NpuDnaE(N)-LZA
벡터 서열 (서열식별번호: 212)
아미노산 서열 (서열식별번호: 213)
플라스미드 118: pLX-LZB_NpuDnaGEP(C)_Hygro(241-341)-IRES-mCherry
단백질 = LZB-NpuDnaGEP(C)-Hygro(241-341)
벡터 서열 (서열식별번호: 214)
아미노산 서열 (서열식별번호: 215)
AC1947GB (서열식별번호: 216)
AC1949GB (서열식별번호: 217)
pCR8-ccdbCam (서열식별번호: 218)
참고문헌
본원에 개시된 모든 참고문헌, 특허 및 특허 출원은 각각이 인용되는 주제와 관련하여 참조로 포함되며, 일부 경우에 문헌의 전체를 포함할 수 있다.
본원의 명세서 및 청구범위에서 사용된 단수 형태는 달리 명백하게 지시되지 않는 한, "적어도 하나"를 의미하는 것으로 이해되어야 한다.
또한, 달리 명백하게 지시되지 않는 한, 하나 초과의 단계 또는 작용을 포함하는 본원에 청구된 임의의 방법에서, 방법의 단계 또는 작용의 순서는 방법의 단계 또는 작용이 인용된 순서로 반드시 제한되는 것은 아니라는 것이 이해되어야 한다.
상기 명세서 뿐만 아니라 청구범위에서, 모든 연결구, 예컨대 "포함하는(comprising)", "포함하는(including)", "보유하는" "갖는", "함유하는", "수반하는", "유지하는", "로 구성되는(composed of)" 등은 개방형, 즉 포함하나 이에 제한되지는 않는 것을 의미하는 것으로 이해되어야 한다. 연결구 "로 이루어진" 및 "로 본질적으로 이루어진"만이 미국 특허청 특허 심사 절차 매뉴얼, 섹션 2111.03에 기술된 바와 같이 각각 폐쇄형 또는 반-폐쇄형 연결구이다.
수치 앞의 용어 "약" 및 "실질적으로"는 인용된 수치의 ±10%를 의미한다.
값의 범위가 제공되는 경우, 범위의 상단 및 하단 사이의 각각의 값은 본원에서 구체적으로 고려되고 설명된다.
SEQUENCE LISTING
<110> The Jackson Laboratory
<120> TRANSGENIC SELECTION METHODS AND COMPOSITIONS
<130> J0227.70007WO00
<140> Not Yet Assigned
<141> Concurrently Herewith
<150> US 62/624,629
<151> 2018-01-31
<150> US 62/616,281
<151> 2018-01-11
<150> US 62/608,478
<151> 2017-12-20
<150> US 62/571,672
<151> 2017-10-12
<160> 218
<170> PatentIn version 3.5
<210> 1
<211> 341
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 1
Met Lys Lys Pro Glu Leu Thr Ala Thr Ser Val Glu Lys Phe Leu Ile
1 5 10 15
Glu Lys Phe Asp Ser Val Ser Asp Leu Met Gln Leu Ser Glu Gly Glu
20 25 30
Glu Ser Arg Ala Phe Ser Phe Asp Val Gly Gly Arg Gly Tyr Val Leu
35 40 45
Arg Val Asn Ser Cys Ala Asp Gly Phe Tyr Lys Asp Arg Tyr Val Tyr
50 55 60
Arg His Phe Ala Ser Ala Ala Leu Pro Ile Pro Glu Val Leu Asp Ile
65 70 75 80
Gly Glu Phe Ser Glu Ser Leu Thr Tyr Cys Ile Ser Arg Arg Ala Gln
85 90 95
Gly Val Thr Leu Gln Asp Leu Pro Glu Thr Glu Leu Pro Ala Val Leu
100 105 110
Gln Pro Val Ala Glu Ala Met Asp Ala Ile Ala Ala Ala Asp Leu Ser
115 120 125
Gln Thr Ser Gly Phe Gly Pro Phe Gly Pro Gln Gly Ile Gly Gln Tyr
130 135 140
Thr Thr Trp Arg Asp Phe Ile Cys Ala Ile Ala Asp Pro His Val Tyr
145 150 155 160
His Trp Gln Thr Val Met Asp Asp Thr Val Ser Ala Ser Val Ala Gln
165 170 175
Ala Leu Asp Glu Leu Met Leu Trp Ala Glu Asp Cys Pro Glu Val Arg
180 185 190
His Leu Val His Ala Asp Phe Gly Ser Asn Asn Val Leu Thr Asp Asn
195 200 205
Gly Arg Ile Thr Ala Val Ile Asp Trp Ser Glu Ala Met Phe Gly Asp
210 215 220
Ser Gln Tyr Glu Val Ala Asn Ile Phe Phe Trp Arg Pro Trp Leu Ala
225 230 235 240
Cys Met Glu Gln Gln Thr Arg Tyr Phe Glu Arg Arg His Pro Glu Leu
245 250 255
Ala Gly Ser Pro Arg Leu Arg Ala Tyr Met Leu Arg Ile Gly Leu Asp
260 265 270
Gln Leu Tyr Gln Ser Leu Val Asp Gly Asn Phe Asp Asp Ala Ala Trp
275 280 285
Ala Gln Gly Arg Cys Asp Ala Ile Val Arg Ser Gly Ala Gly Thr Val
290 295 300
Gly Arg Thr Gln Ile Ala Arg Arg Ser Ala Ala Val Trp Thr Asp Gly
305 310 315 320
Cys Val Glu Val Leu Ala Asp Ser Gly Asn Arg Arg Pro Ser Thr Arg
325 330 335
Pro Arg Ala Lys Glu
340
<210> 2
<211> 199
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 2
Met Thr Glu Tyr Lys Pro Thr Val Arg Leu Ala Thr Arg Asp Asp Val
1 5 10 15
Pro Arg Ala Val Arg Thr Leu Ala Ala Ala Phe Ala Asp Tyr Pro Ala
20 25 30
Thr Arg His Thr Val Asp Pro Asp Arg His Ile Glu Arg Val Thr Glu
35 40 45
Leu Gln Glu Leu Phe Leu Thr Arg Val Gly Leu Asp Ile Gly Lys Val
50 55 60
Trp Val Ala Asp Asp Gly Ala Ala Val Ala Val Trp Thr Thr Pro Glu
65 70 75 80
Ser Val Glu Ala Gly Ala Val Phe Ala Glu Ile Gly Pro Arg Met Ala
85 90 95
Glu Leu Ser Gly Ser Arg Leu Ala Ala Gln Gln Gln Met Glu Gly Leu
100 105 110
Leu Ala Pro His Arg Pro Lys Glu Pro Ala Trp Phe Leu Ala Thr Val
115 120 125
Gly Val Ser Pro Asp His Gln Gly Lys Gly Leu Gly Ser Ala Val Val
130 135 140
Leu Pro Gly Val Glu Ala Ala Glu Arg Ala Gly Val Pro Ala Phe Leu
145 150 155 160
Glu Thr Ser Ala Pro Arg Asn Leu Pro Phe Tyr Glu Arg Leu Gly Phe
165 170 175
Thr Val Thr Ala Asp Val Glu Val Pro Glu Gly Pro Arg Thr Trp Cys
180 185 190
Met Thr Arg Lys Pro Gly Ala
195
<210> 3
<211> 264
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 3
Met Ile Glu Gln Asp Gly Leu His Ala Gly Ser Pro Ala Ala Trp Val
1 5 10 15
Glu Arg Leu Phe Gly Tyr Asp Trp Ala Gln Gln Thr Ile Gly Cys Ser
20 25 30
Asp Ala Ala Val Phe Arg Leu Ser Ala Gln Gly Arg Pro Val Leu Phe
35 40 45
Val Lys Thr Asp Leu Ser Gly Ala Leu Asn Glu Leu Gln Asp Glu Ala
50 55 60
Ala Arg Leu Ser Trp Leu Ala Thr Thr Gly Val Pro Cys Ala Ala Val
65 70 75 80
Leu Asp Val Val Thr Glu Ala Gly Arg Asp Trp Leu Leu Leu Gly Glu
85 90 95
Val Pro Gly Gln Asp Leu Leu Ser Ser His Leu Ala Pro Ala Glu Lys
100 105 110
Val Ser Ile Met Ala Asp Ala Met Arg Arg Leu His Thr Leu Asp Pro
115 120 125
Ala Thr Cys Pro Phe Asp His Gln Ala Lys His Arg Ile Glu Arg Ala
130 135 140
Arg Thr Arg Met Glu Ala Gly Leu Val Asp Gln Asp Asp Leu Asp Glu
145 150 155 160
Glu His Gln Gly Leu Ala Pro Ala Glu Leu Phe Ala Arg Leu Lys Ala
165 170 175
Arg Met Pro Asp Gly Glu Asp Leu Val Val Thr His Gly Asp Ala Cys
180 185 190
Leu Pro Asn Ile Met Val Glu Asn Gly Arg Phe Ser Gly Phe Ile Asp
195 200 205
Cys Gly Arg Leu Gly Val Ala Asp Arg Tyr Gln Asp Ile Ala Leu Ala
210 215 220
Thr Arg Asp Ile Ala Glu Glu Leu Gly Gly Glu Trp Ala Asp Arg Phe
225 230 235 240
Leu Val Leu Tyr Gly Ile Ala Ala Pro Asp Ser Gln Arg Ile Ala Phe
245 250 255
Tyr Arg Leu Leu Asp Glu Phe Phe
260
<210> 4
<211> 140
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 4
Met Lys Thr Phe Asn Ile Ser Gln Gln Asp Leu Glu Leu Val Glu Val
1 5 10 15
Ala Thr Glu Lys Ile Thr Met Leu Tyr Glu Asp Asn Lys His His Val
20 25 30
Gly Ala Ala Ile Arg Thr Lys Thr Gly Glu Ile Ile Ser Ala Val His
35 40 45
Ile Glu Ala Tyr Ile Gly Arg Val Thr Val Cys Ala Glu Ala Ile Ala
50 55 60
Ile Gly Ser Ala Val Ser Asn Gly Gln Lys Asp Phe Asp Thr Ile Val
65 70 75 80
Ala Val Arg His Pro Tyr Ser Asp Glu Val Asp Arg Ser Ile Arg Val
85 90 95
Val Ser Pro Cys Gly Met Cys Arg Glu Leu Ile Ser Asp Tyr Ala Pro
100 105 110
Asp Cys Phe Val Leu Ile Glu Met Asn Gly Lys Leu Val Lys Thr Thr
115 120 125
Ile Glu Glu Leu Ile Pro Leu Lys Tyr Thr Arg Asn
130 135 140
<210> 5
<211> 239
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 5
Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro Ile Leu
1 5 10 15
Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly
20 25 30
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe Ile
35 40 45
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr
50 55 60
Leu Thr Tyr Gly Val Gln Cys Phe Ser Arg Tyr Pro Asp His Met Lys
65 70 75 80
Gln His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gln Glu
85 90 95
Arg Thr Ile Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu
100 105 110
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg Ile Glu Leu Lys Gly
115 120 125
Ile Asp Phe Lys Glu Asp Gly Asn Ile Leu Gly His Lys Leu Glu Tyr
130 135 140
Asn Tyr Asn Ser His Asn Val Tyr Ile Met Ala Asp Lys Gln Lys Asn
145 150 155 160
Gly Ile Lys Val Asn Phe Lys Ile Arg His Asn Ile Glu Asp Gly Ser
165 170 175
Val Gln Leu Ala Asp His Tyr Gln Gln Asn Thr Pro Ile Gly Asp Gly
180 185 190
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gln Ser Ala Leu
195 200 205
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe
210 215 220
Val Thr Ala Ala Gly Ile Thr Leu Gly Met Asp Glu Leu Tyr Lys
225 230 235
<210> 6
<211> 232
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 6
Met Val Ser Lys Gly Glu Ala Val Ile Lys Glu Phe Met Arg Phe Lys
1 5 10 15
Val His Met Glu Gly Ser Met Asn Gly His Glu Phe Glu Ile Glu Gly
20 25 30
Glu Gly Glu Gly Arg Pro Tyr Glu Gly Thr Gln Thr Ala Lys Leu Lys
35 40 45
Val Thr Lys Gly Gly Pro Leu Pro Phe Ser Trp Asp Ile Leu Ser Pro
50 55 60
Gln Phe Met Tyr Gly Ser Arg Ala Phe Thr Lys His Pro Ala Asp Ile
65 70 75 80
Pro Asp Tyr Tyr Lys Gln Ser Phe Pro Glu Gly Phe Lys Trp Glu Arg
85 90 95
Val Met Asn Phe Glu Asp Gly Gly Ala Val Thr Val Thr Gln Asp Thr
100 105 110
Ser Leu Glu Asp Gly Thr Leu Ile Tyr Lys Val Lys Leu Arg Gly Thr
115 120 125
Asn Phe Pro Pro Asp Gly Pro Val Met Gln Lys Lys Thr Met Gly Trp
130 135 140
Glu Ala Ser Thr Glu Arg Leu Tyr Pro Glu Asp Gly Val Leu Lys Gly
145 150 155 160
Asp Ile Lys Met Ala Leu Arg Leu Lys Asp Gly Gly Arg Tyr Leu Ala
165 170 175
Asp Phe Lys Thr Thr Tyr Lys Ala Lys Lys Pro Val Gln Met Pro Gly
180 185 190
Ala Tyr Asn Val Asp Arg Lys Leu Asp Ile Thr Ser His Asn Glu Asp
195 200 205
Tyr Thr Val Val Glu Gln Tyr Glu Arg Ser Glu Gly Arg His Ser Thr
210 215 220
Gly Gly Met Asp Glu Leu Tyr Lys
225 230
<210> 7
<211> 102
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 7
Cys Leu Ser Tyr Glu Thr Glu Ile Leu Thr Val Glu Tyr Gly Leu Leu
1 5 10 15
Pro Ile Gly Lys Ile Val Glu Lys Arg Ile Glu Cys Thr Val Tyr Ser
20 25 30
Val Asp Asn Asn Gly Asn Ile Tyr Thr Gln Pro Val Ala Gln Trp His
35 40 45
Asp Arg Gly Glu Gln Glu Val Phe Glu Tyr Cys Leu Glu Asp Gly Ser
50 55 60
Leu Ile Arg Ala Thr Lys Asp His Lys Phe Met Thr Val Asp Gly Gln
65 70 75 80
Met Leu Pro Ile Asp Glu Ile Phe Glu Arg Glu Leu Asp Leu Met Arg
85 90 95
Val Asp Asn Leu Pro Asn
100
<210> 8
<211> 35
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 8
Ile Lys Ile Ala Thr Arg Lys Tyr Leu Gly Lys Gln Asn Val Tyr Asp
1 5 10 15
Ile Gly Val Glu Arg Asp His Asn Phe Ala Leu Lys Asn Gly Phe Ile
20 25 30
Ala Ser Asn
35
<210> 9
<211> 106
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 9
Cys Ile Ser Gly Asp Ser Leu Ile Ser Leu Ala Ser Thr Gly Lys Arg
1 5 10 15
Val Ser Ile Lys Asp Leu Leu Asp Glu Lys Asp Phe Glu Ile Trp Ala
20 25 30
Ile Asn Glu Gln Thr Met Lys Leu Glu Ser Ala Lys Val Ser Arg Val
35 40 45
Phe Cys Thr Gly Lys Lys Leu Val Tyr Ile Leu Lys Thr Arg Leu Gly
50 55 60
Arg Thr Ile Lys Ala Thr Ala Asn His Arg Phe Leu Thr Ile Asp Gly
65 70 75 80
Trp Lys Arg Leu Asp Glu Leu Ser Leu Lys Glu His Ile Ala Leu Pro
85 90 95
Arg Lys Leu Glu Ser Ser Ser Leu Gln Leu
100 105
<210> 10
<211> 48
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 10
Ser Pro Glu Ile Glu Lys Leu Ser Gln Ser Asp Ile Tyr Trp Asp Ser
1 5 10 15
Ile Val Ser Ile Thr Glu Thr Gly Val Glu Glu Val Phe Asp Leu Thr
20 25 30
Val Pro Gly Pro His Asn Phe Val Ala Asn Asp Ile Ile Val His Asn
35 40 45
<210> 11
<211> 139
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 11
Cys Leu Ser Tyr Glu Thr Glu Ile Leu Thr Val Glu Tyr Gly Leu Leu
1 5 10 15
Pro Ile Gly Lys Ile Val Glu Lys Arg Ile Glu Cys Thr Val Tyr Ser
20 25 30
Val Asp Asn Asn Gly Asn Ile Tyr Thr Gln Pro Val Ala Gln Trp His
35 40 45
Asp Arg Gly Glu Gln Glu Val Phe Glu Tyr Cys Leu Glu Asp Gly Ser
50 55 60
Leu Ile Arg Ala Thr Lys Asp His Lys Phe Met Thr Val Asp Gly Gln
65 70 75 80
Met Leu Pro Ile Asp Glu Ile Phe Glu Arg Glu Leu Asp Leu Met Arg
85 90 95
Val Asp Asn Leu Pro Asn Gly Gly Gly Gly Ser Gly Ser Ala Gln Leu
100 105 110
Glu Lys Glu Leu Gln Ala Leu Glu Lys Lys Leu Ala Gln Leu Glu Trp
115 120 125
Glu Asn Gln Ala Leu Glu Lys Glu Leu Ala Gln
130 135
<210> 12
<211> 68
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 12
Ala Gln Leu Lys Lys Lys Leu Gln Ala Asn Lys Lys Glu Leu Ala Gln
1 5 10 15
Leu Lys Trp Lys Leu Gln Ala Leu Lys Lys Lys Leu Ala Gln Gly Gly
20 25 30
Gly Gly Ser Gly Ser Met Ile Lys Ile Ala Thr Arg Lys Tyr Leu Gly
35 40 45
Lys Gln Asn Val Tyr Asp Ile Gly Val Gly Glu Pro His Asn Phe Ala
50 55 60
Leu Lys Asn Gly
65
<210> 13
<211> 35
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 13
Ile Lys Ile Ala Thr Arg Lys Tyr Leu Gly Lys Gln Asn Val Tyr Asp
1 5 10 15
Ile Gly Val Gly Glu Pro His Asn Phe Ala Leu Lys Asn Gly Phe Ile
20 25 30
Ala Ser Asn
35
<210> 14
<211> 30
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 14
Ala Gln Leu Glu Lys Glu Leu Gln Ala Leu Glu Lys Lys Leu Ala Gln
1 5 10 15
Leu Glu Trp Glu Asn Gln Ala Leu Glu Lys Glu Leu Ala Gln
20 25 30
<210> 15
<211> 29
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 15
Ala Gln Leu Lys Lys Lys Leu Gln Ala Asn Lys Lys Glu Leu Ala Gln
1 5 10 15
Leu Lys Trp Lys Leu Gln Ala Leu Lys Lys Lys Leu Ala
20 25
<210> 16
<211> 123
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 16
Cys Leu Ser Phe Gly Thr Glu Ile Leu Thr Val Glu Tyr Gly Pro Leu
1 5 10 15
Pro Ile Gly Lys Ile Val Ser Glu Glu Ile Asn Cys Ser Val Tyr Ser
20 25 30
Val Asp Pro Glu Gly Arg Val Tyr Thr Gln Ala Ile Ala Gln Trp His
35 40 45
Asp Arg Gly Glu Gln Glu Val Leu Glu Tyr Glu Leu Glu Asp Gly Ser
50 55 60
Val Ile Arg Ala Thr Ser Asp His Arg Phe Leu Thr Thr Asp Tyr Gln
65 70 75 80
Leu Leu Ala Ile Glu Glu Ile Phe Ala Arg Gln Leu Asp Leu Leu Thr
85 90 95
Leu Glu Asn Ile Lys Gln Thr Glu Glu Ala Leu Asp Asn His Arg Leu
100 105 110
Pro Phe Pro Leu Leu Asp Ala Gly Thr Ile Lys
115 120
<210> 17
<211> 35
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 17
Val Lys Val Ile Gly Arg Arg Ser Leu Gly Val Gln Arg Ile Phe Asp
1 5 10 15
Ile Gly Leu Pro Gln Asp His Asn Phe Leu Leu Ala Asn Gly Ala Ile
20 25 30
Ala Ala Asn
35
<210> 18
<211> 11
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 18
Cys Ile Ser Gly Asp Ser Leu Ile Ser Leu Ala
1 5 10
<210> 19
<211> 143
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 19
Ser Thr Gly Lys Arg Val Ser Ile Lys Asp Leu Leu Asp Glu Lys Asp
1 5 10 15
Phe Glu Ile Trp Ala Ile Asn Glu Gln Thr Met Lys Leu Glu Ser Ala
20 25 30
Lys Val Ser Arg Val Phe Cys Thr Gly Lys Lys Leu Val Tyr Ile Leu
35 40 45
Lys Thr Arg Leu Gly Arg Thr Ile Lys Ala Thr Ala Asn His Arg Phe
50 55 60
Leu Thr Ile Asp Gly Trp Lys Arg Leu Asp Glu Leu Ser Leu Lys Glu
65 70 75 80
His Ile Ala Leu Pro Arg Lys Leu Glu Ser Ser Ser Leu Gln Leu Ser
85 90 95
Pro Glu Ile Glu Lys Leu Ser Gln Ser Asp Ile Tyr Trp Asp Ser Ile
100 105 110
Val Ser Ile Thr Glu Thr Gly Val Glu Glu Val Phe Asp Leu Thr Val
115 120 125
Pro Gly Pro His Asn Phe Val Ala Asn Asp Ile Ile Val His Asn
130 135 140
<210> 20
<211> 9219
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 20
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtgccacca tgaaaaagcc tgaactcacc 660
gcgacgtctg tcgagaagtt tctgatcgaa aagttcgaca gcgtctccga cctgatgcag 720
ctctcggagg gcgaagaatc tcgtgctttc agcttcgatg taggagggcg tggatatgtc 780
ctgcgggtaa atagctgcgc cgatggtttc tacaaagatc gttatgttta tcggcacttt 840
gcatcggccg cgctcccgat tccggaagtg cttgacattg gggaatttag cgagagcctg 900
acctattgcc tttcatacga gaccgagatc ctgactgtcg agtacggatt gcttcctatc 960
ggcaaaatcg tggagaagag gattgaatgt accgtctatt cagtcgataa taatgggaac 1020
atctacacac agcccgtggc tcaatggcac gacagaggag agcaggaagt ttttgaatac 1080
tgtctcgagg acggatccct catccgcgct actaaagatc ataagtttat gaccgtggac 1140
ggccagatgc tgccaattga cgaaattttt gaacgagagc tggatctgat gagagtcgac 1200
aaccttccaa actgattaat taagaattcg acccagcttt cttgtacaaa gtggttggta 1260
agcctatccc taaccctctc ctcggtctcg attctacgta gtaatgagct agcagtctcg 1320
aggttaacga attccgcccc ccccctaacg ttactggccg aagccgcttg gaataaggcc 1380
ggtgtgcgct tgtctatatg ttattttcca ccatattgcc gtcttttggc aatgtgaggg 1440
cccggaaacc tggccctgtc ttcttgacga gcattcctag gggtctttcc cctctcgcca 1500
aaggaatgca aggtctgttg aatgtcgtga aggaagcagt tcctctggaa gcttcttgaa 1560
gacaaacaac gtctgtagcg accctttgca ggcagcggaa ccccccacct ggcgacaggt 1620
gcccctgcgg ccaaaagcca cgtgtataag atacacctgc aaaggcggca caaccccagt 1680
gccacgttgt gagttggata gttgtggaaa gagtcaaatg gctctcctca agcgtattca 1740
acaaggggct gaaggatgcc cagaaggtac cccattgtat gggatctgat ctggggcctc 1800
ggtgcacatg ctttacatgt gtttagtcga ggttaaaaaa acgtctaggc cccccgaacc 1860
acggggacgt ggttttcctt tgaaaaacac gataatacca tggccatgag cgagctgatt 1920
aaggagaaca tgcacatgaa gctgtacatg gagggcaccg tggacaacca tcacttcaag 1980
tgcacatccg agggcgaagg caagccctac gagggcaccc agaccatgag aatcaaggtg 2040
gtcgagggcg gccctctccc cttcgccttc gacatcctgg ctactagctt cctctacggc 2100
agcaagacct tcatcaacca cacccagggc atccccgact tcttcaagca gtccttccct 2160
gagggcttca catgggagag agtcaccaca tacgaagacg ggggcgtgct gaccgctacc 2220
caggacacca gcctccagga cggctgcctc atctacaacg tcaagatcag aggggtgaac 2280
ttcacatcca acggccctgt gatgcagaag aaaacactcg gctgggaggc cttcaccgag 2340
acgctgtacc ccgctgacgg cggcctggaa ggcagaaacg acatggccct gaagctcgtg 2400
ggcgggagcc atctgatcgc aaacatcaag accacatata gatccaagaa acccgctaag 2460
aacctcaaga tgcctggcgt ctactatgtg gactacagac tggaaagaat caaggaggcc 2520
aacaacgaga cctacgtcga gcagcacgag gtggcagtgg ccagatactg cgacctccct 2580
agcaaactgg ggcacaagct taattaacac cggtggcgcg ttaagtcgac aatcaacctc 2640
tggattacaa aatttgtgaa agattgactg gtattcttaa ctatgttgct ccttttacgc 2700
tatgtggata cgctgcttta atgcctttgt atcatgctat tgcttcccgt atggctttca 2760
ttttctcctc cttgtataaa tcctggttgc tgtctcttta tgaggagttg tggcccgttg 2820
tcaggcaacg tggcgtggtg tgcactgtgt ttgctgacgc aacccccact ggttggggca 2880
ttgccaccac ctgtcagctc ctttccggga ctttcgcttt ccccctccct attgccacgg 2940
cggaactcat cgccgcctgc cttgcccgct gctggacagg ggctcggctg ttgggcactg 3000
acaattccgt ggtgttgtcg gggaaatcat cgtcctttcc ttggctgctc gcctgtgttg 3060
ccacctggat tctgcgcggg acgtccttct gctacgtccc ttcggccctc aatccagcgg 3120
accttccttc ccgcggcctg ctgccggctc tgcggcctct tccgcgtctt cgccttcgcc 3180
ctcagacgag tcggatctcc ctttgggccg cctccccgcg tcgactttaa gaccaatgac 3240
ttacaaggca gctgtagatc ttagccactt tttaaaagaa aaggggggac tggaagggct 3300
aattcactcc caacgaagac aagatctgct ttttgcttgt actgggtctc tctggttaga 3360
ccagatctga gcctgggagc tctctggcta actagggaac ccactgctta agcctcaata 3420
aagcttgcct tgagtgcttc aagtagtgtg tgcccgtctg ttgtgtgact ctggtaacta 3480
gagatccctc agaccctttt agtcagtgtg gaaaatctct agcagtacgt atagtagttc 3540
atgtcatctt attattcagt atttataact tgcaaagaaa tgaatatcag agagtgagag 3600
gaacttgttt attgcagctt ataatggtta caaataaagc aatagcatca caaatttcac 3660
aaataaagca tttttttcac tgcattctag ttgtggtttg tccaaactca tcaatgtatc 3720
ttatcatgtc tggctctagc tatcccgccc ctaactccgc ccatcccgcc cctaactccg 3780
cccagttccg cccattctcc gccccatggc tgactaattt tttttattta tgcagaggcc 3840
gaggccgcct cggcctctga gctattccag aagtagtgag gaggcttttt tggaggccta 3900
gggacgtacc caattcgccc tatagtgagt cgtattacgc gcgctcactg gccgtcgttt 3960
tacaacgtcg tgactgggaa aaccctggcg ttacccaact taatcgcctt gcagcacatc 4020
cccctttcgc cagctggcgt aatagcgaag aggcccgcac cgatcgccct tcccaacagt 4080
tgcgcagcct gaatggcgaa tgggacgcgc cctgtagcgg cgcattaagc gcggcgggtg 4140
tggtggttac gcgcagcgtg accgctacac ttgccagcgc cctagcgccc gctcctttcg 4200
ctttcttccc ttcctttctc gccacgttcg ccggctttcc ccgtcaagct ctaaatcggg 4260
ggctcccttt agggttccga tttagtgctt tacggcacct cgaccccaaa aaacttgatt 4320
agggtgatgg ttcacgtagt gggccatcgc cctgatagac ggtttttcgc cctttgacgt 4380
tggagtccac gttctttaat agtggactct tgttccaaac tggaacaaca ctcaacccta 4440
tctcggtcta ttcttttgat ttataaggga ttttgccgat ttcggcctat tggttaaaaa 4500
atgagctgat ttaacaaaaa tttaacgcga attttaacaa aatattaacg cttacaattt 4560
aggtggcact tttcggggaa atgtgcgcgg aacccctatt tgtttatttt tctaaataca 4620
ttcaaatatg tatccgctca tgagacaata accctgataa atgcttcaat aatattgaaa 4680
aaggaagagt atgagtattc aacatttccg tgtcgccctt attccctttt ttgcggcatt 4740
ttgccttcct gtttttgctc acccagaaac gctggtgaaa gtaaaagatg ctgaagatca 4800
gttgggtgca cgagtgggtt acatcgaact ggatctcaac agcggtaaga tccttgagag 4860
ttttcgcccc gaagaacgtt ttccaatgat gagcactttt aaagttctgc tatgtggcgc 4920
ggtattatcc cgtattgacg ccgggcaaga gcaactcggt cgccgcatac actattctca 4980
gaatgacttg gttgagtact caccagtcac agaaaagcat cttacggatg gcatgacagt 5040
aagagaatta tgcagtgctg ccataaccat gagtgataac actgcggcca acttacttct 5100
gacaacgatc ggaggaccga aggagctaac cgcttttttg cacaacatgg gggatcatgt 5160
aactcgcctt gatcgttggg aaccggagct gaatgaagcc ataccaaacg acgagcgtga 5220
caccacgatg cctgtagcaa tggcaacaac gttgcgcaaa ctattaactg gcgaactact 5280
tactctagct tcccggcaac aattaataga ctggatggag gcggataaag ttgcaggacc 5340
acttctgcgc tcggcccttc cggctggctg gtttattgct gataaatctg gagccggtga 5400
gcgtgggtct cgcggtatca ttgcagcact ggggccagat ggtaagccct cccgtatcgt 5460
agttatctac acgacgggga gtcaggcaac tatggatgaa cgaaatagac agatcgctga 5520
gataggtgcc tcactgatta agcattggta actgtcagac caagtttact catatatact 5580
ttagattgat ttaaaacttc atttttaatt taaaaggatc taggtgaaga tcctttttga 5640
taatctcatg accaaaatcc cttaacgtga gttttcgttc cactgagcgt cagaccccgt 5700
agaaaagatc aaaggatctt cttgagatcc tttttttctg cgcgtaatct gctgcttgca 5760
aacaaaaaaa ccaccgctac cagcggtggt ttgtttgccg gatcaagagc taccaactct 5820
ttttccgaag gtaactggct tcagcagagc gcagatacca aatactgttc ttctagtgta 5880
gccgtagtta ggccaccact tcaagaactc tgtagcaccg cctacatacc tcgctctgct 5940
aatcctgtta ccagtggctg ctgccagtgg cgataagtcg tgtcttaccg ggttggactc 6000
aagacgatag ttaccggata aggcgcagcg gtcgggctga acggggggtt cgtgcacaca 6060
gcccagcttg gagcgaacga cctacaccga actgagatac ctacagcgtg agctatgaga 6120
aagcgccacg cttcccgaag ggagaaaggc ggacaggtat ccggtaagcg gcagggtcgg 6180
aacaggagag cgcacgaggg agcttccagg gggaaacgcc tggtatcttt atagtcctgt 6240
cgggtttcgc cacctctgac ttgagcgtcg atttttgtga tgctcgtcag gggggcggag 6300
cctatggaaa aacgccagca acgcggcctt tttacggttc ctggcctttt gctggccttt 6360
tgctcacatg ttctttcctg cgttatcccc tgattctgtg gataaccgta ttaccgcctt 6420
tgagtgagct gataccgctc gccgcagccg aacgaccgag cgcagcgagt cagtgagcga 6480
ggaagcggaa gagcgcccaa tacgcaaacc gcctctcccc gcgcgttggc cgattcatta 6540
atgcagctgg cacgacaggt ttcccgactg gaaagcgggc agtgagcgca acgcaattaa 6600
tgtgagttag ctcactcatt aggcacccca ggctttacac tttatgcttc cggctcgtat 6660
gttgtgtgga attgtgagcg gataacaatt tcacacagga aacagctatg accatgatta 6720
cgccaagcgc gcaattaacc ctcactaaag ggaacaaaag ctggagctgc aagcttaatg 6780
tagtcttatg caatactctt gtagtcttgc aacatggtaa cgatgagtta gcaacatgcc 6840
ttacaaggag agaaaaagca ccgtgcatgc cgattggtgg aagtaaggtg gtacgatcgt 6900
gccttattag gaaggcaaca gacgggtctg acatggattg gacgaaccac tgaattgccg 6960
cattgcagag atattgtatt taagtgccta gctcgataca taaacgggtc tctctggtta 7020
gaccagatct gagcctggga gctctctggc taactaggga acccactgct taagcctcaa 7080
taaagcttgc cttgagtgct tcaagtagtg tgtgcccgtc tgttgtgtga ctctggtaac 7140
tagagatccc tcagaccctt ttagtcagtg tggaaaatct ctagcagtgg cgcccgaaca 7200
gggacttgaa agcgaaaggg aaaccagagg agctctctcg acgcaggact cggcttgctg 7260
aagcgcgcac ggcaagaggc gaggggcggc gactggtgag tacgccaaaa attttgacta 7320
gcggaggcta gaaggagaga gatgggtgcg agagcgtcag tattaagcgg gggagaatta 7380
gatcgcgatg ggaaaaaatt cggttaaggc cagggggaaa gaaaaaatat aaattaaaac 7440
atatagtatg ggcaagcagg gagctagaac gattcgcagt taatcctggc ctgttagaaa 7500
catcagaagg ctgtagacaa atactgggac agctacaacc atcccttcag acaggatcag 7560
aagaacttag atcattatat aatacagtag caaccctcta ttgtgtgcat caaaggatag 7620
agataaaaga caccaaggaa gctttagaca agatagagga agagcaaaac aaaagtaaga 7680
ccaccgcaca gcaagcggcc gctgatcttc agacctggag gaggagatat gagggacaat 7740
tggagaagtg aattatataa atataaagta gtaaaaattg aaccattagg agtagcaccc 7800
accaaggcaa agagaagagt ggtgcagaga gaaaaaagag cagtgggaat aggagctttg 7860
ttccttgggt tcttgggagc agcaggaagc actatgggcg cagcgtcaat gacgctgacg 7920
gtacaggcca gacaattatt gtctggtata gtgcagcagc agaacaattt gctgagggct 7980
attgaggcgc aacagcatct gttgcaactc acagtctggg gcatcaagca gctccaggca 8040
agaatcctgg ctgtggaaag atacctaaag gatcaacagc tcctggggat ttggggttgc 8100
tctggaaaac tcatttgcac cactgctgtg ccttggaatg ctagttggag taataaatct 8160
ctggaacaga tttggaatca cacgacctgg atggagtggg acagagaaat taacaattac 8220
acaagcttaa tacactcctt aattgaagaa tcgcaaaacc agcaagaaaa gaatgaacaa 8280
gaattattgg aattagataa atgggcaagt ttgtggaatt ggtttaacat aacaaattgg 8340
ctgtggtata taaaattatt cataatgata gtaggaggct tggtaggttt aagaatagtt 8400
tttgctgtac tttctatagt gaatagagtt aggcagggat attcaccatt atcgtttcag 8460
acccacctcc caaccccgag gggacccttg cgccttttcc aaggcagccc tgggtttgcg 8520
cagggacgcg gctgctctgg gcgtggttcc gggaaacgca gcggcgccga ccctgggtct 8580
cgcacattct tcacgtccgt tcgcagcgtc acccggatct tcgccgctac ccttgtgggc 8640
cccccggcga cgcttcctgc tccgccccta agtcgggaag gttccttgcg gttcgcggcg 8700
tgccggacgt gacaaacgga agccgcacgt ctcactagta ccctcgcaga cggacagcgc 8760
cagggagcaa tggcagcgcg ccgaccgcga tgggctgtgg ccaatagcgg ctgctcagca 8820
gggcgcgccg agagcagcgg ccgggaaggg gcggtgcggg aggcggggtg tggggcggta 8880
gtgtgggccc tgttcctgcc cgcgcggtgt tccgcattct gcaagcctcc ggagcgcacg 8940
tcggcagtcg gctccctcgt tgaccgaatc accgacctct ctccccaggg ggtacccagc 9000
tgtctagaga attctagatc ttgagacaaa tggcagtatt catccacaat tttaaaagaa 9060
aaggggggat tggggggtac agtgcagggg aaagaatagt agacataata gcaacagaca 9120
tacaaactaa agaattacaa aaacaaatta caaaaattca aaattttcgg gtttattaca 9180
gggacagcag agatccactt tggcgccggc tcgaggggg 9219
<210> 21
<211> 191
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 21
Met Lys Lys Pro Glu Leu Thr Ala Thr Ser Val Glu Lys Phe Leu Ile
1 5 10 15
Glu Lys Phe Asp Ser Val Ser Asp Leu Met Gln Leu Ser Glu Gly Glu
20 25 30
Glu Ser Arg Ala Phe Ser Phe Asp Val Gly Gly Arg Gly Tyr Val Leu
35 40 45
Arg Val Asn Ser Cys Ala Asp Gly Phe Tyr Lys Asp Arg Tyr Val Tyr
50 55 60
Arg His Phe Ala Ser Ala Ala Leu Pro Ile Pro Glu Val Leu Asp Ile
65 70 75 80
Gly Glu Phe Ser Glu Ser Leu Thr Tyr Cys Leu Ser Tyr Glu Thr Glu
85 90 95
Ile Leu Thr Val Glu Tyr Gly Leu Leu Pro Ile Gly Lys Ile Val Glu
100 105 110
Lys Arg Ile Glu Cys Thr Val Tyr Ser Val Asp Asn Asn Gly Asn Ile
115 120 125
Tyr Thr Gln Pro Val Ala Gln Trp His Asp Arg Gly Glu Gln Glu Val
130 135 140
Phe Glu Tyr Cys Leu Glu Asp Gly Ser Leu Ile Arg Ala Thr Lys Asp
145 150 155 160
His Lys Phe Met Thr Val Asp Gly Gln Met Leu Pro Ile Asp Glu Ile
165 170 175
Phe Glu Arg Glu Leu Asp Leu Met Arg Val Asp Asn Leu Pro Asn
180 185 190
<210> 22
<211> 9516
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 22
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtgccgcca ccatgattaa gatcgctacg 660
cggaagtacc tggggaaaca gaacgtctac gacataggtg tggagcgcga tcacaacttt 720
gctctgaaaa atggatttat cgccagcaac tgcatctccc gccgtgcaca gggtgtcacg 780
ttgcaagacc tgcctgaaac cgaactgccc gctgttctgc agccggtcgc ggaggccatg 840
gatgcgatcg ctgcggccga tcttagccag acgagcgggt tcggcccatt cggaccgcaa 900
ggaatcggtc aatacactac atggcgtgat ttcatatgcg cgattgctga tccccatgtg 960
tatcactggc aaactgtgat ggacgacacc gtcagtgcgt ccgtcgcgca ggctctcgat 1020
gagctgatgc tttgggccga ggactgcccc gaagtccggc acctcgtgca cgcggatttc 1080
ggctccaaca atgtcctgac ggacaatggc cgcataacag cggtcattga ctggagcgag 1140
gcgatgttcg gggattccca atacgaggtc gccaacatct tcttctggag gccgtggttg 1200
gcttgtatgg agcagcagac gcgctacttc gagcggaggc atccggagct tgcaggatcg 1260
ccgcggctcc gggcgtatat gctccgcatt ggtcttgacc aactctatca gagcttggtt 1320
gacggcaatt tcgatgatgc agcttgggcg cagggtcgat gcgacgcaat cgtccgatcc 1380
ggagccggga ctgtcgggcg tacacaaatc gcccgcagaa gcgcggccgt ctggaccgat 1440
ggctgtgtag aagtactcgc cgatagtgga aaccgacgcc ccagcactcg tccgagggca 1500
aaggaatagt taattaagaa ttcgacccag ctttcttgta caaagtggtt ggtaagccta 1560
tccctaaccc tctcctcggt ctcgattcta cgtagtaatg agctagcagt ctcgaggtta 1620
acgaattccg ccccccccct aacgttactg gccgaagccg cttggaataa ggccggtgtg 1680
cgcttgtcta tatgttattt tccaccatat tgccgtcttt tggcaatgtg agggcccgga 1740
aacctggccc tgtcttcttg acgagcattc ctaggggtct ttcccctctc gccaaaggaa 1800
tgcaaggtct gttgaatgtc gtgaaggaag cagttcctct ggaagcttct tgaagacaaa 1860
caacgtctgt agcgaccctt tgcaggcagc ggaacccccc acctggcgac aggtgcccct 1920
gcggccaaaa gccacgtgta taagatacac ctgcaaaggc ggcacaaccc cagtgccacg 1980
ttgtgagttg gatagttgtg gaaagagtca aatggctctc ctcaagcgta ttcaacaagg 2040
ggctgaagga tgcccagaag gtaccccatt gtatgggatc tgatctgggg cctcggtgca 2100
catgctttac atgtgtttag tcgaggttaa aaaaacgtct aggccccccg aaccacgggg 2160
acgtggtttt cctttgaaaa acacgataat accatggtga gcaagggcga ggaggataac 2220
atggccatca tcaaggagtt catgcgcttc aaggtgcaca tggagggctc cgtgaacggc 2280
cacgagttcg agatcgaggg cgagggcgag ggccgcccct acgagggcac ccagaccgcc 2340
aagctgaagg tgaccaaggg tggccccctg cccttcgcct gggacatcct gtcccctcag 2400
ttcatgtacg gctccaaggc ctacgtgaag caccccgccg acatccccga ctacttgaag 2460
ctgtccttcc ccgagggctt caagtgggag cgcgtgatga acttcgagga cggcggcgtg 2520
gtgaccgtga cccaggactc ctccctgcag gacggcgagt tcatctacaa ggtgaagctg 2580
cgcggcacca acttcccctc cgacggcccc gtaatgcaga agaagaccat gggctgggag 2640
gcctcctccg agcggatgta ccccgaggac ggcgccctga agggcgagat caagcagagg 2700
ctgaagctga aggacggcgg ccactacgac gctgaggtca agaccaccta caaggccaag 2760
aagcccgtgc agctgcccgg cgcctacaac gtcaacatca agttggacat cacctcccac 2820
aacgaggact acaccatcgt ggaacagtac gaacgcgccg agggccgcca ctccaccggc 2880
ggcatggacg agctgtacaa gtaacaccgg tggcgcgtta agtcgacaat caacctctgg 2940
attacaaaat ttgtgaaaga ttgactggta ttcttaacta tgttgctcct tttacgctat 3000
gtggatacgc tgctttaatg cctttgtatc atgctattgc ttcccgtatg gctttcattt 3060
tctcctcctt gtataaatcc tggttgctgt ctctttatga ggagttgtgg cccgttgtca 3120
ggcaacgtgg cgtggtgtgc actgtgtttg ctgacgcaac ccccactggt tggggcattg 3180
ccaccacctg tcagctcctt tccgggactt tcgctttccc cctccctatt gccacggcgg 3240
aactcatcgc cgcctgcctt gcccgctgct ggacaggggc tcggctgttg ggcactgaca 3300
attccgtggt gttgtcgggg aaatcatcgt cctttccttg gctgctcgcc tgtgttgcca 3360
cctggattct gcgcgggacg tccttctgct acgtcccttc ggccctcaat ccagcggacc 3420
ttccttcccg cggcctgctg ccggctctgc ggcctcttcc gcgtcttcgc cttcgccctc 3480
agacgagtcg gatctccctt tgggccgcct ccccgcgtcg actttaagac caatgactta 3540
caaggcagct gtagatctta gccacttttt aaaagaaaag gggggactgg aagggctaat 3600
tcactcccaa cgaagacaag atctgctttt tgcttgtact gggtctctct ggttagacca 3660
gatctgagcc tgggagctct ctggctaact agggaaccca ctgcttaagc ctcaataaag 3720
cttgccttga gtgcttcaag tagtgtgtgc ccgtctgttg tgtgactctg gtaactagag 3780
atccctcaga cccttttagt cagtgtggaa aatctctagc agtacgtata gtagttcatg 3840
tcatcttatt attcagtatt tataacttgc aaagaaatga atatcagaga gtgagaggaa 3900
cttgtttatt gcagcttata atggttacaa ataaagcaat agcatcacaa atttcacaaa 3960
taaagcattt ttttcactgc attctagttg tggtttgtcc aaactcatca atgtatctta 4020
tcatgtctgg ctctagctat cccgccccta actccgccca tcccgcccct aactccgccc 4080
agttccgccc attctccgcc ccatggctga ctaatttttt ttatttatgc agaggccgag 4140
gccgcctcgg cctctgagct attccagaag tagtgaggag gcttttttgg aggcctaggg 4200
acgtacccaa ttcgccctat agtgagtcgt attacgcgcg ctcactggcc gtcgttttac 4260
aacgtcgtga ctgggaaaac cctggcgtta cccaacttaa tcgccttgca gcacatcccc 4320
ctttcgccag ctggcgtaat agcgaagagg cccgcaccga tcgcccttcc caacagttgc 4380
gcagcctgaa tggcgaatgg gacgcgccct gtagcggcgc attaagcgcg gcgggtgtgg 4440
tggttacgcg cagcgtgacc gctacacttg ccagcgccct agcgcccgct cctttcgctt 4500
tcttcccttc ctttctcgcc acgttcgccg gctttccccg tcaagctcta aatcgggggc 4560
tccctttagg gttccgattt agtgctttac ggcacctcga ccccaaaaaa cttgattagg 4620
gtgatggttc acgtagtggg ccatcgccct gatagacggt ttttcgccct ttgacgttgg 4680
agtccacgtt ctttaatagt ggactcttgt tccaaactgg aacaacactc aaccctatct 4740
cggtctattc ttttgattta taagggattt tgccgatttc ggcctattgg ttaaaaaatg 4800
agctgattta acaaaaattt aacgcgaatt ttaacaaaat attaacgctt acaatttagg 4860
tggcactttt cggggaaatg tgcgcggaac ccctatttgt ttatttttct aaatacattc 4920
aaatatgtat ccgctcatga gacaataacc ctgataaatg cttcaataat attgaaaaag 4980
gaagagtatg agtattcaac atttccgtgt cgcccttatt cccttttttg cggcattttg 5040
ccttcctgtt tttgctcacc cagaaacgct ggtgaaagta aaagatgctg aagatcagtt 5100
gggtgcacga gtgggttaca tcgaactgga tctcaacagc ggtaagatcc ttgagagttt 5160
tcgccccgaa gaacgttttc caatgatgag cacttttaaa gttctgctat gtggcgcggt 5220
attatcccgt attgacgccg ggcaagagca actcggtcgc cgcatacact attctcagaa 5280
tgacttggtt gagtactcac cagtcacaga aaagcatctt acggatggca tgacagtaag 5340
agaattatgc agtgctgcca taaccatgag tgataacact gcggccaact tacttctgac 5400
aacgatcgga ggaccgaagg agctaaccgc ttttttgcac aacatggggg atcatgtaac 5460
tcgccttgat cgttgggaac cggagctgaa tgaagccata ccaaacgacg agcgtgacac 5520
cacgatgcct gtagcaatgg caacaacgtt gcgcaaacta ttaactggcg aactacttac 5580
tctagcttcc cggcaacaat taatagactg gatggaggcg gataaagttg caggaccact 5640
tctgcgctcg gcccttccgg ctggctggtt tattgctgat aaatctggag ccggtgagcg 5700
tgggtctcgc ggtatcattg cagcactggg gccagatggt aagccctccc gtatcgtagt 5760
tatctacacg acggggagtc aggcaactat ggatgaacga aatagacaga tcgctgagat 5820
aggtgcctca ctgattaagc attggtaact gtcagaccaa gtttactcat atatacttta 5880
gattgattta aaacttcatt tttaatttaa aaggatctag gtgaagatcc tttttgataa 5940
tctcatgacc aaaatccctt aacgtgagtt ttcgttccac tgagcgtcag accccgtaga 6000
aaagatcaaa ggatcttctt gagatccttt ttttctgcgc gtaatctgct gcttgcaaac 6060
aaaaaaacca ccgctaccag cggtggtttg tttgccggat caagagctac caactctttt 6120
tccgaaggta actggcttca gcagagcgca gataccaaat actgttcttc tagtgtagcc 6180
gtagttaggc caccacttca agaactctgt agcaccgcct acatacctcg ctctgctaat 6240
cctgttacca gtggctgctg ccagtggcga taagtcgtgt cttaccgggt tggactcaag 6300
acgatagtta ccggataagg cgcagcggtc gggctgaacg gggggttcgt gcacacagcc 6360
cagcttggag cgaacgacct acaccgaact gagataccta cagcgtgagc tatgagaaag 6420
cgccacgctt cccgaaggga gaaaggcgga caggtatccg gtaagcggca gggtcggaac 6480
aggagagcgc acgagggagc ttccaggggg aaacgcctgg tatctttata gtcctgtcgg 6540
gtttcgccac ctctgacttg agcgtcgatt tttgtgatgc tcgtcagggg ggcggagcct 6600
atggaaaaac gccagcaacg cggccttttt acggttcctg gccttttgct ggccttttgc 6660
tcacatgttc tttcctgcgt tatcccctga ttctgtggat aaccgtatta ccgcctttga 6720
gtgagctgat accgctcgcc gcagccgaac gaccgagcgc agcgagtcag tgagcgagga 6780
agcggaagag cgcccaatac gcaaaccgcc tctccccgcg cgttggccga ttcattaatg 6840
cagctggcac gacaggtttc ccgactggaa agcgggcagt gagcgcaacg caattaatgt 6900
gagttagctc actcattagg caccccaggc tttacacttt atgcttccgg ctcgtatgtt 6960
gtgtggaatt gtgagcggat aacaatttca cacaggaaac agctatgacc atgattacgc 7020
caagcgcgca attaaccctc actaaaggga acaaaagctg gagctgcaag cttaatgtag 7080
tcttatgcaa tactcttgta gtcttgcaac atggtaacga tgagttagca acatgcctta 7140
caaggagaga aaaagcaccg tgcatgccga ttggtggaag taaggtggta cgatcgtgcc 7200
ttattaggaa ggcaacagac gggtctgaca tggattggac gaaccactga attgccgcat 7260
tgcagagata ttgtatttaa gtgcctagct cgatacataa acgggtctct ctggttagac 7320
cagatctgag cctgggagct ctctggctaa ctagggaacc cactgcttaa gcctcaataa 7380
agcttgcctt gagtgcttca agtagtgtgt gcccgtctgt tgtgtgactc tggtaactag 7440
agatccctca gaccctttta gtcagtgtgg aaaatctcta gcagtggcgc ccgaacaggg 7500
acttgaaagc gaaagggaaa ccagaggagc tctctcgacg caggactcgg cttgctgaag 7560
cgcgcacggc aagaggcgag gggcggcgac tggtgagtac gccaaaaatt ttgactagcg 7620
gaggctagaa ggagagagat gggtgcgaga gcgtcagtat taagcggggg agaattagat 7680
cgcgatggga aaaaattcgg ttaaggccag ggggaaagaa aaaatataaa ttaaaacata 7740
tagtatgggc aagcagggag ctagaacgat tcgcagttaa tcctggcctg ttagaaacat 7800
cagaaggctg tagacaaata ctgggacagc tacaaccatc ccttcagaca ggatcagaag 7860
aacttagatc attatataat acagtagcaa ccctctattg tgtgcatcaa aggatagaga 7920
taaaagacac caaggaagct ttagacaaga tagaggaaga gcaaaacaaa agtaagacca 7980
ccgcacagca agcggccgct gatcttcaga cctggaggag gagatatgag ggacaattgg 8040
agaagtgaat tatataaata taaagtagta aaaattgaac cattaggagt agcacccacc 8100
aaggcaaaga gaagagtggt gcagagagaa aaaagagcag tgggaatagg agctttgttc 8160
cttgggttct tgggagcagc aggaagcact atgggcgcag cgtcaatgac gctgacggta 8220
caggccagac aattattgtc tggtatagtg cagcagcaga acaatttgct gagggctatt 8280
gaggcgcaac agcatctgtt gcaactcaca gtctggggca tcaagcagct ccaggcaaga 8340
atcctggctg tggaaagata cctaaaggat caacagctcc tggggatttg gggttgctct 8400
ggaaaactca tttgcaccac tgctgtgcct tggaatgcta gttggagtaa taaatctctg 8460
gaacagattt ggaatcacac gacctggatg gagtgggaca gagaaattaa caattacaca 8520
agcttaatac actccttaat tgaagaatcg caaaaccagc aagaaaagaa tgaacaagaa 8580
ttattggaat tagataaatg ggcaagtttg tggaattggt ttaacataac aaattggctg 8640
tggtatataa aattattcat aatgatagta ggaggcttgg taggtttaag aatagttttt 8700
gctgtacttt ctatagtgaa tagagttagg cagggatatt caccattatc gtttcagacc 8760
cacctcccaa ccccgagggg acccttgcgc cttttccaag gcagccctgg gtttgcgcag 8820
ggacgcggct gctctgggcg tggttccggg aaacgcagcg gcgccgaccc tgggtctcgc 8880
acattcttca cgtccgttcg cagcgtcacc cggatcttcg ccgctaccct tgtgggcccc 8940
ccggcgacgc ttcctgctcc gcccctaagt cgggaaggtt ccttgcggtt cgcggcgtgc 9000
cggacgtgac aaacggaagc cgcacgtctc actagtaccc tcgcagacgg acagcgccag 9060
ggagcaatgg cagcgcgccg accgcgatgg gctgtggcca atagcggctg ctcagcaggg 9120
cgcgccgaga gcagcggccg ggaaggggcg gtgcgggagg cggggtgtgg ggcggtagtg 9180
tgggccctgt tcctgcccgc gcggtgttcc gcattctgca agcctccgga gcgcacgtcg 9240
gcagtcggct ccctcgttga ccgaatcacc gacctctctc cccagggggt acccagctgt 9300
ctagagaatt ctagatcttg agacaaatgg cagtattcat ccacaatttt aaaagaaaag 9360
gggggattgg ggggtacagt gcaggggaaa gaatagtaga cataatagca acagacatac 9420
aaactaaaga attacaaaaa caaattacaa aaattcaaaa ttttcgggtt tattacaggg 9480
acagcagaga tccactttgg cgccggctcg aggggg 9516
<210> 23
<211> 288
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 23
Met Ile Lys Ile Ala Thr Arg Lys Tyr Leu Gly Lys Gln Asn Val Tyr
1 5 10 15
Asp Ile Gly Val Glu Arg Asp His Asn Phe Ala Leu Lys Asn Gly Phe
20 25 30
Ile Ala Ser Asn Cys Ile Ser Arg Arg Ala Gln Gly Val Thr Leu Gln
35 40 45
Asp Leu Pro Glu Thr Glu Leu Pro Ala Val Leu Gln Pro Val Ala Glu
50 55 60
Ala Met Asp Ala Ile Ala Ala Ala Asp Leu Ser Gln Thr Ser Gly Phe
65 70 75 80
Gly Pro Phe Gly Pro Gln Gly Ile Gly Gln Tyr Thr Thr Trp Arg Asp
85 90 95
Phe Ile Cys Ala Ile Ala Asp Pro His Val Tyr His Trp Gln Thr Val
100 105 110
Met Asp Asp Thr Val Ser Ala Ser Val Ala Gln Ala Leu Asp Glu Leu
115 120 125
Met Leu Trp Ala Glu Asp Cys Pro Glu Val Arg His Leu Val His Ala
130 135 140
Asp Phe Gly Ser Asn Asn Val Leu Thr Asp Asn Gly Arg Ile Thr Ala
145 150 155 160
Val Ile Asp Trp Ser Glu Ala Met Phe Gly Asp Ser Gln Tyr Glu Val
165 170 175
Ala Asn Ile Phe Phe Trp Arg Pro Trp Leu Ala Cys Met Glu Gln Gln
180 185 190
Thr Arg Tyr Phe Glu Arg Arg His Pro Glu Leu Ala Gly Ser Pro Arg
195 200 205
Leu Arg Ala Tyr Met Leu Arg Ile Gly Leu Asp Gln Leu Tyr Gln Ser
210 215 220
Leu Val Asp Gly Asn Phe Asp Asp Ala Ala Trp Ala Gln Gly Arg Cys
225 230 235 240
Asp Ala Ile Val Arg Ser Gly Ala Gly Thr Val Gly Arg Thr Gln Ile
245 250 255
Ala Arg Arg Ser Ala Ala Val Trp Thr Asp Gly Cys Val Glu Val Leu
260 265 270
Ala Asp Ser Gly Asn Arg Arg Pro Ser Thr Arg Pro Arg Ala Lys Glu
275 280 285
<210> 24
<211> 9279
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 24
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtgccacca tgaaaaagcc tgaactcacc 660
gcgacgtctg tcgagaagtt tctgatcgaa aagttcgaca gcgtctccga cctgatgcag 720
ctctcggagg gcgaagaatc tcgtgctttc agcttcgatg taggagggcg tggatatgtc 780
ctgcgggtaa atagctgcgc cgatggtttc tacaaagatc gttatgttta tcggcacttt 840
gcatcggccg cgctcccgat tccggaagtg cttgacattg gggaatttag cgagagcctg 900
acctattgca tctcccgccg tgcacagggt gtcacgttgc aagacctgcc tgaaaccgaa 960
ctgcccgctg ttctgcagcc ggtcgcggag gccatggatg cgatcgctgc ggccgatctt 1020
agccagacga gcgggttcgg cccattcgga ccgcaaggaa tcggtcaata cactacatgg 1080
cgtgatttca tatgcgcgat tgctgatccc catgtgtatc actggcaaac tgtgatggac 1140
gacaccgtca gtgcgtccgt cgcgcaggct ctcgatgagc tgatgctttg ggccgaggac 1200
tgccccgaag tccggcacct cgtgcacgcg gatttcggct gtatcagtgg cgactccctg 1260
atctcactcg catgattaat taagaattcg acccagcttt cttgtacaaa gtggttggta 1320
agcctatccc taaccctctc ctcggtctcg attctacgta gtaatgagct agcagtctcg 1380
aggttaacga attccgcccc ccccctaacg ttactggccg aagccgcttg gaataaggcc 1440
ggtgtgcgct tgtctatatg ttattttcca ccatattgcc gtcttttggc aatgtgaggg 1500
cccggaaacc tggccctgtc ttcttgacga gcattcctag gggtctttcc cctctcgcca 1560
aaggaatgca aggtctgttg aatgtcgtga aggaagcagt tcctctggaa gcttcttgaa 1620
gacaaacaac gtctgtagcg accctttgca ggcagcggaa ccccccacct ggcgacaggt 1680
gcccctgcgg ccaaaagcca cgtgtataag atacacctgc aaaggcggca caaccccagt 1740
gccacgttgt gagttggata gttgtggaaa gagtcaaatg gctctcctca agcgtattca 1800
acaaggggct gaaggatgcc cagaaggtac cccattgtat gggatctgat ctggggcctc 1860
ggtgcacatg ctttacatgt gtttagtcga ggttaaaaaa acgtctaggc cccccgaacc 1920
acggggacgt ggttttcctt tgaaaaacac gataatacca tggccatgag cgagctgatt 1980
aaggagaaca tgcacatgaa gctgtacatg gagggcaccg tggacaacca tcacttcaag 2040
tgcacatccg agggcgaagg caagccctac gagggcaccc agaccatgag aatcaaggtg 2100
gtcgagggcg gccctctccc cttcgccttc gacatcctgg ctactagctt cctctacggc 2160
agcaagacct tcatcaacca cacccagggc atccccgact tcttcaagca gtccttccct 2220
gagggcttca catgggagag agtcaccaca tacgaagacg ggggcgtgct gaccgctacc 2280
caggacacca gcctccagga cggctgcctc atctacaacg tcaagatcag aggggtgaac 2340
ttcacatcca acggccctgt gatgcagaag aaaacactcg gctgggaggc cttcaccgag 2400
acgctgtacc ccgctgacgg cggcctggaa ggcagaaacg acatggccct gaagctcgtg 2460
ggcgggagcc atctgatcgc aaacatcaag accacatata gatccaagaa acccgctaag 2520
aacctcaaga tgcctggcgt ctactatgtg gactacagac tggaaagaat caaggaggcc 2580
aacaacgaga cctacgtcga gcagcacgag gtggcagtgg ccagatactg cgacctccct 2640
agcaaactgg ggcacaagct taattaacac cggtggcgcg ttaagtcgac aatcaacctc 2700
tggattacaa aatttgtgaa agattgactg gtattcttaa ctatgttgct ccttttacgc 2760
tatgtggata cgctgcttta atgcctttgt atcatgctat tgcttcccgt atggctttca 2820
ttttctcctc cttgtataaa tcctggttgc tgtctcttta tgaggagttg tggcccgttg 2880
tcaggcaacg tggcgtggtg tgcactgtgt ttgctgacgc aacccccact ggttggggca 2940
ttgccaccac ctgtcagctc ctttccggga ctttcgcttt ccccctccct attgccacgg 3000
cggaactcat cgccgcctgc cttgcccgct gctggacagg ggctcggctg ttgggcactg 3060
acaattccgt ggtgttgtcg gggaaatcat cgtcctttcc ttggctgctc gcctgtgttg 3120
ccacctggat tctgcgcggg acgtccttct gctacgtccc ttcggccctc aatccagcgg 3180
accttccttc ccgcggcctg ctgccggctc tgcggcctct tccgcgtctt cgccttcgcc 3240
ctcagacgag tcggatctcc ctttgggccg cctccccgcg tcgactttaa gaccaatgac 3300
ttacaaggca gctgtagatc ttagccactt tttaaaagaa aaggggggac tggaagggct 3360
aattcactcc caacgaagac aagatctgct ttttgcttgt actgggtctc tctggttaga 3420
ccagatctga gcctgggagc tctctggcta actagggaac ccactgctta agcctcaata 3480
aagcttgcct tgagtgcttc aagtagtgtg tgcccgtctg ttgtgtgact ctggtaacta 3540
gagatccctc agaccctttt agtcagtgtg gaaaatctct agcagtacgt atagtagttc 3600
atgtcatctt attattcagt atttataact tgcaaagaaa tgaatatcag agagtgagag 3660
gaacttgttt attgcagctt ataatggtta caaataaagc aatagcatca caaatttcac 3720
aaataaagca tttttttcac tgcattctag ttgtggtttg tccaaactca tcaatgtatc 3780
ttatcatgtc tggctctagc tatcccgccc ctaactccgc ccatcccgcc cctaactccg 3840
cccagttccg cccattctcc gccccatggc tgactaattt tttttattta tgcagaggcc 3900
gaggccgcct cggcctctga gctattccag aagtagtgag gaggcttttt tggaggccta 3960
gggacgtacc caattcgccc tatagtgagt cgtattacgc gcgctcactg gccgtcgttt 4020
tacaacgtcg tgactgggaa aaccctggcg ttacccaact taatcgcctt gcagcacatc 4080
cccctttcgc cagctggcgt aatagcgaag aggcccgcac cgatcgccct tcccaacagt 4140
tgcgcagcct gaatggcgaa tgggacgcgc cctgtagcgg cgcattaagc gcggcgggtg 4200
tggtggttac gcgcagcgtg accgctacac ttgccagcgc cctagcgccc gctcctttcg 4260
ctttcttccc ttcctttctc gccacgttcg ccggctttcc ccgtcaagct ctaaatcggg 4320
ggctcccttt agggttccga tttagtgctt tacggcacct cgaccccaaa aaacttgatt 4380
agggtgatgg ttcacgtagt gggccatcgc cctgatagac ggtttttcgc cctttgacgt 4440
tggagtccac gttctttaat agtggactct tgttccaaac tggaacaaca ctcaacccta 4500
tctcggtcta ttcttttgat ttataaggga ttttgccgat ttcggcctat tggttaaaaa 4560
atgagctgat ttaacaaaaa tttaacgcga attttaacaa aatattaacg cttacaattt 4620
aggtggcact tttcggggaa atgtgcgcgg aacccctatt tgtttatttt tctaaataca 4680
ttcaaatatg tatccgctca tgagacaata accctgataa atgcttcaat aatattgaaa 4740
aaggaagagt atgagtattc aacatttccg tgtcgccctt attccctttt ttgcggcatt 4800
ttgccttcct gtttttgctc acccagaaac gctggtgaaa gtaaaagatg ctgaagatca 4860
gttgggtgca cgagtgggtt acatcgaact ggatctcaac agcggtaaga tccttgagag 4920
ttttcgcccc gaagaacgtt ttccaatgat gagcactttt aaagttctgc tatgtggcgc 4980
ggtattatcc cgtattgacg ccgggcaaga gcaactcggt cgccgcatac actattctca 5040
gaatgacttg gttgagtact caccagtcac agaaaagcat cttacggatg gcatgacagt 5100
aagagaatta tgcagtgctg ccataaccat gagtgataac actgcggcca acttacttct 5160
gacaacgatc ggaggaccga aggagctaac cgcttttttg cacaacatgg gggatcatgt 5220
aactcgcctt gatcgttggg aaccggagct gaatgaagcc ataccaaacg acgagcgtga 5280
caccacgatg cctgtagcaa tggcaacaac gttgcgcaaa ctattaactg gcgaactact 5340
tactctagct tcccggcaac aattaataga ctggatggag gcggataaag ttgcaggacc 5400
acttctgcgc tcggcccttc cggctggctg gtttattgct gataaatctg gagccggtga 5460
gcgtgggtct cgcggtatca ttgcagcact ggggccagat ggtaagccct cccgtatcgt 5520
agttatctac acgacgggga gtcaggcaac tatggatgaa cgaaatagac agatcgctga 5580
gataggtgcc tcactgatta agcattggta actgtcagac caagtttact catatatact 5640
ttagattgat ttaaaacttc atttttaatt taaaaggatc taggtgaaga tcctttttga 5700
taatctcatg accaaaatcc cttaacgtga gttttcgttc cactgagcgt cagaccccgt 5760
agaaaagatc aaaggatctt cttgagatcc tttttttctg cgcgtaatct gctgcttgca 5820
aacaaaaaaa ccaccgctac cagcggtggt ttgtttgccg gatcaagagc taccaactct 5880
ttttccgaag gtaactggct tcagcagagc gcagatacca aatactgttc ttctagtgta 5940
gccgtagtta ggccaccact tcaagaactc tgtagcaccg cctacatacc tcgctctgct 6000
aatcctgtta ccagtggctg ctgccagtgg cgataagtcg tgtcttaccg ggttggactc 6060
aagacgatag ttaccggata aggcgcagcg gtcgggctga acggggggtt cgtgcacaca 6120
gcccagcttg gagcgaacga cctacaccga actgagatac ctacagcgtg agctatgaga 6180
aagcgccacg cttcccgaag ggagaaaggc ggacaggtat ccggtaagcg gcagggtcgg 6240
aacaggagag cgcacgaggg agcttccagg gggaaacgcc tggtatcttt atagtcctgt 6300
cgggtttcgc cacctctgac ttgagcgtcg atttttgtga tgctcgtcag gggggcggag 6360
cctatggaaa aacgccagca acgcggcctt tttacggttc ctggcctttt gctggccttt 6420
tgctcacatg ttctttcctg cgttatcccc tgattctgtg gataaccgta ttaccgcctt 6480
tgagtgagct gataccgctc gccgcagccg aacgaccgag cgcagcgagt cagtgagcga 6540
ggaagcggaa gagcgcccaa tacgcaaacc gcctctcccc gcgcgttggc cgattcatta 6600
atgcagctgg cacgacaggt ttcccgactg gaaagcgggc agtgagcgca acgcaattaa 6660
tgtgagttag ctcactcatt aggcacccca ggctttacac tttatgcttc cggctcgtat 6720
gttgtgtgga attgtgagcg gataacaatt tcacacagga aacagctatg accatgatta 6780
cgccaagcgc gcaattaacc ctcactaaag ggaacaaaag ctggagctgc aagcttaatg 6840
tagtcttatg caatactctt gtagtcttgc aacatggtaa cgatgagtta gcaacatgcc 6900
ttacaaggag agaaaaagca ccgtgcatgc cgattggtgg aagtaaggtg gtacgatcgt 6960
gccttattag gaaggcaaca gacgggtctg acatggattg gacgaaccac tgaattgccg 7020
cattgcagag atattgtatt taagtgccta gctcgataca taaacgggtc tctctggtta 7080
gaccagatct gagcctggga gctctctggc taactaggga acccactgct taagcctcaa 7140
taaagcttgc cttgagtgct tcaagtagtg tgtgcccgtc tgttgtgtga ctctggtaac 7200
tagagatccc tcagaccctt ttagtcagtg tggaaaatct ctagcagtgg cgcccgaaca 7260
gggacttgaa agcgaaaggg aaaccagagg agctctctcg acgcaggact cggcttgctg 7320
aagcgcgcac ggcaagaggc gaggggcggc gactggtgag tacgccaaaa attttgacta 7380
gcggaggcta gaaggagaga gatgggtgcg agagcgtcag tattaagcgg gggagaatta 7440
gatcgcgatg ggaaaaaatt cggttaaggc cagggggaaa gaaaaaatat aaattaaaac 7500
atatagtatg ggcaagcagg gagctagaac gattcgcagt taatcctggc ctgttagaaa 7560
catcagaagg ctgtagacaa atactgggac agctacaacc atcccttcag acaggatcag 7620
aagaacttag atcattatat aatacagtag caaccctcta ttgtgtgcat caaaggatag 7680
agataaaaga caccaaggaa gctttagaca agatagagga agagcaaaac aaaagtaaga 7740
ccaccgcaca gcaagcggcc gctgatcttc agacctggag gaggagatat gagggacaat 7800
tggagaagtg aattatataa atataaagta gtaaaaattg aaccattagg agtagcaccc 7860
accaaggcaa agagaagagt ggtgcagaga gaaaaaagag cagtgggaat aggagctttg 7920
ttccttgggt tcttgggagc agcaggaagc actatgggcg cagcgtcaat gacgctgacg 7980
gtacaggcca gacaattatt gtctggtata gtgcagcagc agaacaattt gctgagggct 8040
attgaggcgc aacagcatct gttgcaactc acagtctggg gcatcaagca gctccaggca 8100
agaatcctgg ctgtggaaag atacctaaag gatcaacagc tcctggggat ttggggttgc 8160
tctggaaaac tcatttgcac cactgctgtg ccttggaatg ctagttggag taataaatct 8220
ctggaacaga tttggaatca cacgacctgg atggagtggg acagagaaat taacaattac 8280
acaagcttaa tacactcctt aattgaagaa tcgcaaaacc agcaagaaaa gaatgaacaa 8340
gaattattgg aattagataa atgggcaagt ttgtggaatt ggtttaacat aacaaattgg 8400
ctgtggtata taaaattatt cataatgata gtaggaggct tggtaggttt aagaatagtt 8460
tttgctgtac tttctatagt gaatagagtt aggcagggat attcaccatt atcgtttcag 8520
acccacctcc caaccccgag gggacccttg cgccttttcc aaggcagccc tgggtttgcg 8580
cagggacgcg gctgctctgg gcgtggttcc gggaaacgca gcggcgccga ccctgggtct 8640
cgcacattct tcacgtccgt tcgcagcgtc acccggatct tcgccgctac ccttgtgggc 8700
cccccggcga cgcttcctgc tccgccccta agtcgggaag gttccttgcg gttcgcggcg 8760
tgccggacgt gacaaacgga agccgcacgt ctcactagta ccctcgcaga cggacagcgc 8820
cagggagcaa tggcagcgcg ccgaccgcga tgggctgtgg ccaatagcgg ctgctcagca 8880
gggcgcgccg agagcagcgg ccgggaaggg gcggtgcggg aggcggggtg tggggcggta 8940
gtgtgggccc tgttcctgcc cgcgcggtgt tccgcattct gcaagcctcc ggagcgcacg 9000
tcggcagtcg gctccctcgt tgaccgaatc accgacctct ctccccaggg ggtacccagc 9060
tgtctagaga attctagatc ttgagacaaa tggcagtatt catccacaat tttaaaagaa 9120
aaggggggat tggggggtac agtgcagggg aaagaatagt agacataata gcaacagaca 9180
tacaaactaa agaattacaa aaacaaatta caaaaattca aaattttcgg gtttattaca 9240
gggacagcag agatccactt tggcgccggc tcgaggggg 9279
<210> 25
<211> 211
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 25
Met Lys Lys Pro Glu Leu Thr Ala Thr Ser Val Glu Lys Phe Leu Ile
1 5 10 15
Glu Lys Phe Asp Ser Val Ser Asp Leu Met Gln Leu Ser Glu Gly Glu
20 25 30
Glu Ser Arg Ala Phe Ser Phe Asp Val Gly Gly Arg Gly Tyr Val Leu
35 40 45
Arg Val Asn Ser Cys Ala Asp Gly Phe Tyr Lys Asp Arg Tyr Val Tyr
50 55 60
Arg His Phe Ala Ser Ala Ala Leu Pro Ile Pro Glu Val Leu Asp Ile
65 70 75 80
Gly Glu Phe Ser Glu Ser Leu Thr Tyr Cys Ile Ser Arg Arg Ala Gln
85 90 95
Gly Val Thr Leu Gln Asp Leu Pro Glu Thr Glu Leu Pro Ala Val Leu
100 105 110
Gln Pro Val Ala Glu Ala Met Asp Ala Ile Ala Ala Ala Asp Leu Ser
115 120 125
Gln Thr Ser Gly Phe Gly Pro Phe Gly Pro Gln Gly Ile Gly Gln Tyr
130 135 140
Thr Thr Trp Arg Asp Phe Ile Cys Ala Ile Ala Asp Pro His Val Tyr
145 150 155 160
His Trp Gln Thr Val Met Asp Asp Thr Val Ser Ala Ser Val Ala Gln
165 170 175
Ala Leu Asp Glu Leu Met Leu Trp Ala Glu Asp Cys Pro Glu Val Arg
180 185 190
His Leu Val His Ala Asp Phe Gly Cys Ile Ser Gly Asp Ser Leu Ile
195 200 205
Ser Leu Ala
210
<210> 26
<211> 9507
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 26
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtgccgcca ccatgagcac tggaaagcga 660
gttagcatca aggacttgct ggacgaaaag gatttcgaaa tttgggcaat caatgagcag 720
accatgaaac tggagtctgc aaaggtgtcc cgggtgtttt gcacgggtaa gaagcttgtt 780
tatatcctta aaactagact gggccggacg atcaaagcca ccgcgaacca cagattcttg 840
acaatcgacg ggtggaaacg gctggacgaa ctgagcttga aggagcacat cgcccttcct 900
cggaagctcg agtcatcttc cctgcagctg agtcccgaaa tcgaaaagct ctctcagagc 960
gatatatatt gggactccat cgtaagcata acagagacgg gggtcgagga ggtgttcgat 1020
ctgacagttc ctgggcctca taatttcgta gcgaacgaca tcattgtaca taactccaac 1080
aatgtcctga cggacaatgg ccgcataaca gcggtcattg actggagcga ggcgatgttc 1140
ggggattccc aatacgaggt cgccaacatc ttcttctgga ggccgtggtt ggcttgtatg 1200
gagcagcaga cgcgctactt cgagcggagg catccggagc ttgcaggatc gccgcggctc 1260
cgggcgtata tgctccgcat tggtcttgac caactctatc agagcttggt tgacggcaat 1320
ttcgatgatg cagcttgggc gcagggtcga tgcgacgcaa tcgtccgatc cggagccggg 1380
actgtcgggc gtacacaaat cgcccgcaga agcgcggccg tctggaccga tggctgtgta 1440
gaagtactcg ccgatagtgg aaaccgacgc cccagcactc gtccgagggc aaaggaatag 1500
ttaattaaga attcgaccca gctttcttgt acaaagtggt tggtaagcct atccctaacc 1560
ctctcctcgg tctcgattct acgtagtaat gagctagcag tctcgaggtt aacgaattcc 1620
gccccccccc taacgttact ggccgaagcc gcttggaata aggccggtgt gcgcttgtct 1680
atatgttatt ttccaccata ttgccgtctt ttggcaatgt gagggcccgg aaacctggcc 1740
ctgtcttctt gacgagcatt cctaggggtc tttcccctct cgccaaagga atgcaaggtc 1800
tgttgaatgt cgtgaaggaa gcagttcctc tggaagcttc ttgaagacaa acaacgtctg 1860
tagcgaccct ttgcaggcag cggaaccccc cacctggcga caggtgcccc tgcggccaaa 1920
agccacgtgt ataagataca cctgcaaagg cggcacaacc ccagtgccac gttgtgagtt 1980
ggatagttgt ggaaagagtc aaatggctct cctcaagcgt attcaacaag gggctgaagg 2040
atgcccagaa ggtaccccat tgtatgggat ctgatctggg gcctcggtgc acatgcttta 2100
catgtgttta gtcgaggtta aaaaaacgtc taggcccccc gaaccacggg gacgtggttt 2160
tcctttgaaa aacacgataa taccatggtg agcaagggcg aggaggataa catggccatc 2220
atcaaggagt tcatgcgctt caaggtgcac atggagggct ccgtgaacgg ccacgagttc 2280
gagatcgagg gcgagggcga gggccgcccc tacgagggca cccagaccgc caagctgaag 2340
gtgaccaagg gtggccccct gcccttcgcc tgggacatcc tgtcccctca gttcatgtac 2400
ggctccaagg cctacgtgaa gcaccccgcc gacatccccg actacttgaa gctgtccttc 2460
cccgagggct tcaagtggga gcgcgtgatg aacttcgagg acggcggcgt ggtgaccgtg 2520
acccaggact cctccctgca ggacggcgag ttcatctaca aggtgaagct gcgcggcacc 2580
aacttcccct ccgacggccc cgtaatgcag aagaagacca tgggctggga ggcctcctcc 2640
gagcggatgt accccgagga cggcgccctg aagggcgaga tcaagcagag gctgaagctg 2700
aaggacggcg gccactacga cgctgaggtc aagaccacct acaaggccaa gaagcccgtg 2760
cagctgcccg gcgcctacaa cgtcaacatc aagttggaca tcacctccca caacgaggac 2820
tacaccatcg tggaacagta cgaacgcgcc gagggccgcc actccaccgg cggcatggac 2880
gagctgtaca agtaacaccg gtggcgcgtt aagtcgacaa tcaacctctg gattacaaaa 2940
tttgtgaaag attgactggt attcttaact atgttgctcc ttttacgcta tgtggatacg 3000
ctgctttaat gcctttgtat catgctattg cttcccgtat ggctttcatt ttctcctcct 3060
tgtataaatc ctggttgctg tctctttatg aggagttgtg gcccgttgtc aggcaacgtg 3120
gcgtggtgtg cactgtgttt gctgacgcaa cccccactgg ttggggcatt gccaccacct 3180
gtcagctcct ttccgggact ttcgctttcc ccctccctat tgccacggcg gaactcatcg 3240
ccgcctgcct tgcccgctgc tggacagggg ctcggctgtt gggcactgac aattccgtgg 3300
tgttgtcggg gaaatcatcg tcctttcctt ggctgctcgc ctgtgttgcc acctggattc 3360
tgcgcgggac gtccttctgc tacgtccctt cggccctcaa tccagcggac cttccttccc 3420
gcggcctgct gccggctctg cggcctcttc cgcgtcttcg ccttcgccct cagacgagtc 3480
ggatctccct ttgggccgcc tccccgcgtc gactttaaga ccaatgactt acaaggcagc 3540
tgtagatctt agccactttt taaaagaaaa ggggggactg gaagggctaa ttcactccca 3600
acgaagacaa gatctgcttt ttgcttgtac tgggtctctc tggttagacc agatctgagc 3660
ctgggagctc tctggctaac tagggaaccc actgcttaag cctcaataaa gcttgccttg 3720
agtgcttcaa gtagtgtgtg cccgtctgtt gtgtgactct ggtaactaga gatccctcag 3780
acccttttag tcagtgtgga aaatctctag cagtacgtat agtagttcat gtcatcttat 3840
tattcagtat ttataacttg caaagaaatg aatatcagag agtgagagga acttgtttat 3900
tgcagcttat aatggttaca aataaagcaa tagcatcaca aatttcacaa ataaagcatt 3960
tttttcactg cattctagtt gtggtttgtc caaactcatc aatgtatctt atcatgtctg 4020
gctctagcta tcccgcccct aactccgccc atcccgcccc taactccgcc cagttccgcc 4080
cattctccgc cccatggctg actaattttt tttatttatg cagaggccga ggccgcctcg 4140
gcctctgagc tattccagaa gtagtgagga ggcttttttg gaggcctagg gacgtaccca 4200
attcgcccta tagtgagtcg tattacgcgc gctcactggc cgtcgtttta caacgtcgtg 4260
actgggaaaa ccctggcgtt acccaactta atcgccttgc agcacatccc cctttcgcca 4320
gctggcgtaa tagcgaagag gcccgcaccg atcgcccttc ccaacagttg cgcagcctga 4380
atggcgaatg ggacgcgccc tgtagcggcg cattaagcgc ggcgggtgtg gtggttacgc 4440
gcagcgtgac cgctacactt gccagcgccc tagcgcccgc tcctttcgct ttcttccctt 4500
cctttctcgc cacgttcgcc ggctttcccc gtcaagctct aaatcggggg ctccctttag 4560
ggttccgatt tagtgcttta cggcacctcg accccaaaaa acttgattag ggtgatggtt 4620
cacgtagtgg gccatcgccc tgatagacgg tttttcgccc tttgacgttg gagtccacgt 4680
tctttaatag tggactcttg ttccaaactg gaacaacact caaccctatc tcggtctatt 4740
cttttgattt ataagggatt ttgccgattt cggcctattg gttaaaaaat gagctgattt 4800
aacaaaaatt taacgcgaat tttaacaaaa tattaacgct tacaatttag gtggcacttt 4860
tcggggaaat gtgcgcggaa cccctatttg tttatttttc taaatacatt caaatatgta 4920
tccgctcatg agacaataac cctgataaat gcttcaataa tattgaaaaa ggaagagtat 4980
gagtattcaa catttccgtg tcgcccttat tccctttttt gcggcatttt gccttcctgt 5040
ttttgctcac ccagaaacgc tggtgaaagt aaaagatgct gaagatcagt tgggtgcacg 5100
agtgggttac atcgaactgg atctcaacag cggtaagatc cttgagagtt ttcgccccga 5160
agaacgtttt ccaatgatga gcacttttaa agttctgcta tgtggcgcgg tattatcccg 5220
tattgacgcc gggcaagagc aactcggtcg ccgcatacac tattctcaga atgacttggt 5280
tgagtactca ccagtcacag aaaagcatct tacggatggc atgacagtaa gagaattatg 5340
cagtgctgcc ataaccatga gtgataacac tgcggccaac ttacttctga caacgatcgg 5400
aggaccgaag gagctaaccg cttttttgca caacatgggg gatcatgtaa ctcgccttga 5460
tcgttgggaa ccggagctga atgaagccat accaaacgac gagcgtgaca ccacgatgcc 5520
tgtagcaatg gcaacaacgt tgcgcaaact attaactggc gaactactta ctctagcttc 5580
ccggcaacaa ttaatagact ggatggaggc ggataaagtt gcaggaccac ttctgcgctc 5640
ggcccttccg gctggctggt ttattgctga taaatctgga gccggtgagc gtgggtctcg 5700
cggtatcatt gcagcactgg ggccagatgg taagccctcc cgtatcgtag ttatctacac 5760
gacggggagt caggcaacta tggatgaacg aaatagacag atcgctgaga taggtgcctc 5820
actgattaag cattggtaac tgtcagacca agtttactca tatatacttt agattgattt 5880
aaaacttcat ttttaattta aaaggatcta ggtgaagatc ctttttgata atctcatgac 5940
caaaatccct taacgtgagt tttcgttcca ctgagcgtca gaccccgtag aaaagatcaa 6000
aggatcttct tgagatcctt tttttctgcg cgtaatctgc tgcttgcaaa caaaaaaacc 6060
accgctacca gcggtggttt gtttgccgga tcaagagcta ccaactcttt ttccgaaggt 6120
aactggcttc agcagagcgc agataccaaa tactgttctt ctagtgtagc cgtagttagg 6180
ccaccacttc aagaactctg tagcaccgcc tacatacctc gctctgctaa tcctgttacc 6240
agtggctgct gccagtggcg ataagtcgtg tcttaccggg ttggactcaa gacgatagtt 6300
accggataag gcgcagcggt cgggctgaac ggggggttcg tgcacacagc ccagcttgga 6360
gcgaacgacc tacaccgaac tgagatacct acagcgtgag ctatgagaaa gcgccacgct 6420
tcccgaaggg agaaaggcgg acaggtatcc ggtaagcggc agggtcggaa caggagagcg 6480
cacgagggag cttccagggg gaaacgcctg gtatctttat agtcctgtcg ggtttcgcca 6540
cctctgactt gagcgtcgat ttttgtgatg ctcgtcaggg gggcggagcc tatggaaaaa 6600
cgccagcaac gcggcctttt tacggttcct ggccttttgc tggccttttg ctcacatgtt 6660
ctttcctgcg ttatcccctg attctgtgga taaccgtatt accgcctttg agtgagctga 6720
taccgctcgc cgcagccgaa cgaccgagcg cagcgagtca gtgagcgagg aagcggaaga 6780
gcgcccaata cgcaaaccgc ctctccccgc gcgttggccg attcattaat gcagctggca 6840
cgacaggttt cccgactgga aagcgggcag tgagcgcaac gcaattaatg tgagttagct 6900
cactcattag gcaccccagg ctttacactt tatgcttccg gctcgtatgt tgtgtggaat 6960
tgtgagcgga taacaatttc acacaggaaa cagctatgac catgattacg ccaagcgcgc 7020
aattaaccct cactaaaggg aacaaaagct ggagctgcaa gcttaatgta gtcttatgca 7080
atactcttgt agtcttgcaa catggtaacg atgagttagc aacatgcctt acaaggagag 7140
aaaaagcacc gtgcatgccg attggtggaa gtaaggtggt acgatcgtgc cttattagga 7200
aggcaacaga cgggtctgac atggattgga cgaaccactg aattgccgca ttgcagagat 7260
attgtattta agtgcctagc tcgatacata aacgggtctc tctggttaga ccagatctga 7320
gcctgggagc tctctggcta actagggaac ccactgctta agcctcaata aagcttgcct 7380
tgagtgcttc aagtagtgtg tgcccgtctg ttgtgtgact ctggtaacta gagatccctc 7440
agaccctttt agtcagtgtg gaaaatctct agcagtggcg cccgaacagg gacttgaaag 7500
cgaaagggaa accagaggag ctctctcgac gcaggactcg gcttgctgaa gcgcgcacgg 7560
caagaggcga ggggcggcga ctggtgagta cgccaaaaat tttgactagc ggaggctaga 7620
aggagagaga tgggtgcgag agcgtcagta ttaagcgggg gagaattaga tcgcgatggg 7680
aaaaaattcg gttaaggcca gggggaaaga aaaaatataa attaaaacat atagtatggg 7740
caagcaggga gctagaacga ttcgcagtta atcctggcct gttagaaaca tcagaaggct 7800
gtagacaaat actgggacag ctacaaccat cccttcagac aggatcagaa gaacttagat 7860
cattatataa tacagtagca accctctatt gtgtgcatca aaggatagag ataaaagaca 7920
ccaaggaagc tttagacaag atagaggaag agcaaaacaa aagtaagacc accgcacagc 7980
aagcggccgc tgatcttcag acctggagga ggagatatga gggacaattg gagaagtgaa 8040
ttatataaat ataaagtagt aaaaattgaa ccattaggag tagcacccac caaggcaaag 8100
agaagagtgg tgcagagaga aaaaagagca gtgggaatag gagctttgtt ccttgggttc 8160
ttgggagcag caggaagcac tatgggcgca gcgtcaatga cgctgacggt acaggccaga 8220
caattattgt ctggtatagt gcagcagcag aacaatttgc tgagggctat tgaggcgcaa 8280
cagcatctgt tgcaactcac agtctggggc atcaagcagc tccaggcaag aatcctggct 8340
gtggaaagat acctaaagga tcaacagctc ctggggattt ggggttgctc tggaaaactc 8400
atttgcacca ctgctgtgcc ttggaatgct agttggagta ataaatctct ggaacagatt 8460
tggaatcaca cgacctggat ggagtgggac agagaaatta acaattacac aagcttaata 8520
cactccttaa ttgaagaatc gcaaaaccag caagaaaaga atgaacaaga attattggaa 8580
ttagataaat gggcaagttt gtggaattgg tttaacataa caaattggct gtggtatata 8640
aaattattca taatgatagt aggaggcttg gtaggtttaa gaatagtttt tgctgtactt 8700
tctatagtga atagagttag gcagggatat tcaccattat cgtttcagac ccacctccca 8760
accccgaggg gacccttgcg ccttttccaa ggcagccctg ggtttgcgca gggacgcggc 8820
tgctctgggc gtggttccgg gaaacgcagc ggcgccgacc ctgggtctcg cacattcttc 8880
acgtccgttc gcagcgtcac ccggatcttc gccgctaccc ttgtgggccc cccggcgacg 8940
cttcctgctc cgcccctaag tcgggaaggt tccttgcggt tcgcggcgtg ccggacgtga 9000
caaacggaag ccgcacgtct cactagtacc ctcgcagacg gacagcgcca gggagcaatg 9060
gcagcgcgcc gaccgcgatg ggctgtggcc aatagcggct gctcagcagg gcgcgccgag 9120
agcagcggcc gggaaggggc ggtgcgggag gcggggtgtg gggcggtagt gtgggccctg 9180
ttcctgcccg cgcggtgttc cgcattctgc aagcctccgg agcgcacgtc ggcagtcggc 9240
tccctcgttg accgaatcac cgacctctct ccccaggggg tacccagctg tctagagaat 9300
tctagatctt gagacaaatg gcagtattca tccacaattt taaaagaaaa ggggggattg 9360
gggggtacag tgcaggggaa agaatagtag acataatagc aacagacata caaactaaag 9420
aattacaaaa acaaattaca aaaattcaaa attttcgggt ttattacagg gacagcagag 9480
atccactttg gcgccggctc gaggggg 9507
<210> 27
<211> 285
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 27
Met Ser Thr Gly Lys Arg Val Ser Ile Lys Asp Leu Leu Asp Glu Lys
1 5 10 15
Asp Phe Glu Ile Trp Ala Ile Asn Glu Gln Thr Met Lys Leu Glu Ser
20 25 30
Ala Lys Val Ser Arg Val Phe Cys Thr Gly Lys Lys Leu Val Tyr Ile
35 40 45
Leu Lys Thr Arg Leu Gly Arg Thr Ile Lys Ala Thr Ala Asn His Arg
50 55 60
Phe Leu Thr Ile Asp Gly Trp Lys Arg Leu Asp Glu Leu Ser Leu Lys
65 70 75 80
Glu His Ile Ala Leu Pro Arg Lys Leu Glu Ser Ser Ser Leu Gln Leu
85 90 95
Ser Pro Glu Ile Glu Lys Leu Ser Gln Ser Asp Ile Tyr Trp Asp Ser
100 105 110
Ile Val Ser Ile Thr Glu Thr Gly Val Glu Glu Val Phe Asp Leu Thr
115 120 125
Val Pro Gly Pro His Asn Phe Val Ala Asn Asp Ile Ile Val His Asn
130 135 140
Ser Asn Asn Val Leu Thr Asp Asn Gly Arg Ile Thr Ala Val Ile Asp
145 150 155 160
Trp Ser Glu Ala Met Phe Gly Asp Ser Gln Tyr Glu Val Ala Asn Ile
165 170 175
Phe Phe Trp Arg Pro Trp Leu Ala Cys Met Glu Gln Gln Thr Arg Tyr
180 185 190
Phe Glu Arg Arg His Pro Glu Leu Ala Gly Ser Pro Arg Leu Arg Ala
195 200 205
Tyr Met Leu Arg Ile Gly Leu Asp Gln Leu Tyr Gln Ser Leu Val Asp
210 215 220
Gly Asn Phe Asp Asp Ala Ala Trp Ala Gln Gly Arg Cys Asp Ala Ile
225 230 235 240
Val Arg Ser Gly Ala Gly Thr Val Gly Arg Thr Gln Ile Ala Arg Arg
245 250 255
Ser Ala Ala Val Trp Thr Asp Gly Cys Val Glu Val Leu Ala Asp Ser
260 265 270
Gly Asn Arg Arg Pro Ser Thr Arg Pro Arg Ala Lys Glu
275 280 285
<210> 28
<211> 9108
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 28
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtgccacca tgaaaaagcc tgaactcacc 660
gcgacgtctg tcgagaagtt tctgatcgaa aagttcgaca gcgtctccga cctgatgcag 720
ctctcggagg gcgaagaatc tcgtgctttc agcttcgatg taggagggcg tggatatgtc 780
ctgcgggtaa atagctgcct ttcatacgag accgagatcc tgactgtcga gtacggattg 840
cttcctatcg gcaaaatcgt ggagaagagg attgaatgta ccgtctattc agtcgataat 900
aatgggaaca tctacacaca gcccgtggct caatggcacg acagaggaga gcaggaagtt 960
tttgaatact gtctcgagga cggatccctc atccgcgcta ctaaagatca taagtttatg 1020
accgtggacg gccagatgct gccaattgac gaaatttttg aacgagagct ggatctgatg 1080
agagtcgaca accttccaaa ctgattaatt aagaattcga cccagctttc ttgtacaaag 1140
tggttggtaa gcctatccct aaccctctcc tcggtctcga ttctacgtag taatgagcta 1200
gcagtctcga ggttaacgaa ttccgccccc cccctaacgt tactggccga agccgcttgg 1260
aataaggccg gtgtgcgctt gtctatatgt tattttccac catattgccg tcttttggca 1320
atgtgagggc ccggaaacct ggccctgtct tcttgacgag cattcctagg ggtctttccc 1380
ctctcgccaa aggaatgcaa ggtctgttga atgtcgtgaa ggaagcagtt cctctggaag 1440
cttcttgaag acaaacaacg tctgtagcga ccctttgcag gcagcggaac cccccacctg 1500
gcgacaggtg cccctgcggc caaaagccac gtgtataaga tacacctgca aaggcggcac 1560
aaccccagtg ccacgttgtg agttggatag ttgtggaaag agtcaaatgg ctctcctcaa 1620
gcgtattcaa caaggggctg aaggatgccc agaaggtacc ccattgtatg ggatctgatc 1680
tggggcctcg gtgcacatgc tttacatgtg tttagtcgag gttaaaaaaa cgtctaggcc 1740
ccccgaacca cggggacgtg gttttccttt gaaaaacacg ataataccat ggccatgagc 1800
gagctgatta aggagaacat gcacatgaag ctgtacatgg agggcaccgt ggacaaccat 1860
cacttcaagt gcacatccga gggcgaaggc aagccctacg agggcaccca gaccatgaga 1920
atcaaggtgg tcgagggcgg ccctctcccc ttcgccttcg acatcctggc tactagcttc 1980
ctctacggca gcaagacctt catcaaccac acccagggca tccccgactt cttcaagcag 2040
tccttccctg agggcttcac atgggagaga gtcaccacat acgaagacgg gggcgtgctg 2100
accgctaccc aggacaccag cctccaggac ggctgcctca tctacaacgt caagatcaga 2160
ggggtgaact tcacatccaa cggccctgtg atgcagaaga aaacactcgg ctgggaggcc 2220
ttcaccgaga cgctgtaccc cgctgacggc ggcctggaag gcagaaacga catggccctg 2280
aagctcgtgg gcgggagcca tctgatcgca aacatcaaga ccacatatag atccaagaaa 2340
cccgctaaga acctcaagat gcctggcgtc tactatgtgg actacagact ggaaagaatc 2400
aaggaggcca acaacgagac ctacgtcgag cagcacgagg tggcagtggc cagatactgc 2460
gacctcccta gcaaactggg gcacaagctt aattaacacc ggtggcgcgt taagtcgaca 2520
atcaacctct ggattacaaa atttgtgaaa gattgactgg tattcttaac tatgttgctc 2580
cttttacgct atgtggatac gctgctttaa tgcctttgta tcatgctatt gcttcccgta 2640
tggctttcat tttctcctcc ttgtataaat cctggttgct gtctctttat gaggagttgt 2700
ggcccgttgt caggcaacgt ggcgtggtgt gcactgtgtt tgctgacgca acccccactg 2760
gttggggcat tgccaccacc tgtcagctcc tttccgggac tttcgctttc cccctcccta 2820
ttgccacggc ggaactcatc gccgcctgcc ttgcccgctg ctggacaggg gctcggctgt 2880
tgggcactga caattccgtg gtgttgtcgg ggaaatcatc gtcctttcct tggctgctcg 2940
cctgtgttgc cacctggatt ctgcgcggga cgtccttctg ctacgtccct tcggccctca 3000
atccagcgga ccttccttcc cgcggcctgc tgccggctct gcggcctctt ccgcgtcttc 3060
gccttcgccc tcagacgagt cggatctccc tttgggccgc ctccccgcgt cgactttaag 3120
accaatgact tacaaggcag ctgtagatct tagccacttt ttaaaagaaa aggggggact 3180
ggaagggcta attcactccc aacgaagaca agatctgctt tttgcttgta ctgggtctct 3240
ctggttagac cagatctgag cctgggagct ctctggctaa ctagggaacc cactgcttaa 3300
gcctcaataa agcttgcctt gagtgcttca agtagtgtgt gcccgtctgt tgtgtgactc 3360
tggtaactag agatccctca gaccctttta gtcagtgtgg aaaatctcta gcagtacgta 3420
tagtagttca tgtcatctta ttattcagta tttataactt gcaaagaaat gaatatcaga 3480
gagtgagagg aacttgttta ttgcagctta taatggttac aaataaagca atagcatcac 3540
aaatttcaca aataaagcat ttttttcact gcattctagt tgtggtttgt ccaaactcat 3600
caatgtatct tatcatgtct ggctctagct atcccgcccc taactccgcc catcccgccc 3660
ctaactccgc ccagttccgc ccattctccg ccccatggct gactaatttt ttttatttat 3720
gcagaggccg aggccgcctc ggcctctgag ctattccaga agtagtgagg aggctttttt 3780
ggaggcctag ggacgtaccc aattcgccct atagtgagtc gtattacgcg cgctcactgg 3840
ccgtcgtttt acaacgtcgt gactgggaaa accctggcgt tacccaactt aatcgccttg 3900
cagcacatcc ccctttcgcc agctggcgta atagcgaaga ggcccgcacc gatcgccctt 3960
cccaacagtt gcgcagcctg aatggcgaat gggacgcgcc ctgtagcggc gcattaagcg 4020
cggcgggtgt ggtggttacg cgcagcgtga ccgctacact tgccagcgcc ctagcgcccg 4080
ctcctttcgc tttcttccct tcctttctcg ccacgttcgc cggctttccc cgtcaagctc 4140
taaatcgggg gctcccttta gggttccgat ttagtgcttt acggcacctc gaccccaaaa 4200
aacttgatta gggtgatggt tcacgtagtg ggccatcgcc ctgatagacg gtttttcgcc 4260
ctttgacgtt ggagtccacg ttctttaata gtggactctt gttccaaact ggaacaacac 4320
tcaaccctat ctcggtctat tcttttgatt tataagggat tttgccgatt tcggcctatt 4380
ggttaaaaaa tgagctgatt taacaaaaat ttaacgcgaa ttttaacaaa atattaacgc 4440
ttacaattta ggtggcactt ttcggggaaa tgtgcgcgga acccctattt gtttattttt 4500
ctaaatacat tcaaatatgt atccgctcat gagacaataa ccctgataaa tgcttcaata 4560
atattgaaaa aggaagagta tgagtattca acatttccgt gtcgccctta ttcccttttt 4620
tgcggcattt tgccttcctg tttttgctca cccagaaacg ctggtgaaag taaaagatgc 4680
tgaagatcag ttgggtgcac gagtgggtta catcgaactg gatctcaaca gcggtaagat 4740
ccttgagagt tttcgccccg aagaacgttt tccaatgatg agcactttta aagttctgct 4800
atgtggcgcg gtattatccc gtattgacgc cgggcaagag caactcggtc gccgcataca 4860
ctattctcag aatgacttgg ttgagtactc accagtcaca gaaaagcatc ttacggatgg 4920
catgacagta agagaattat gcagtgctgc cataaccatg agtgataaca ctgcggccaa 4980
cttacttctg acaacgatcg gaggaccgaa ggagctaacc gcttttttgc acaacatggg 5040
ggatcatgta actcgccttg atcgttggga accggagctg aatgaagcca taccaaacga 5100
cgagcgtgac accacgatgc ctgtagcaat ggcaacaacg ttgcgcaaac tattaactgg 5160
cgaactactt actctagctt cccggcaaca attaatagac tggatggagg cggataaagt 5220
tgcaggacca cttctgcgct cggcccttcc ggctggctgg tttattgctg ataaatctgg 5280
agccggtgag cgtgggtctc gcggtatcat tgcagcactg gggccagatg gtaagccctc 5340
ccgtatcgta gttatctaca cgacggggag tcaggcaact atggatgaac gaaatagaca 5400
gatcgctgag ataggtgcct cactgattaa gcattggtaa ctgtcagacc aagtttactc 5460
atatatactt tagattgatt taaaacttca tttttaattt aaaaggatct aggtgaagat 5520
cctttttgat aatctcatga ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc 5580
agaccccgta gaaaagatca aaggatcttc ttgagatcct ttttttctgc gcgtaatctg 5640
ctgcttgcaa acaaaaaaac caccgctacc agcggtggtt tgtttgccgg atcaagagct 5700
accaactctt tttccgaagg taactggctt cagcagagcg cagataccaa atactgttct 5760
tctagtgtag ccgtagttag gccaccactt caagaactct gtagcaccgc ctacatacct 5820
cgctctgcta atcctgttac cagtggctgc tgccagtggc gataagtcgt gtcttaccgg 5880
gttggactca agacgatagt taccggataa ggcgcagcgg tcgggctgaa cggggggttc 5940
gtgcacacag cccagcttgg agcgaacgac ctacaccgaa ctgagatacc tacagcgtga 6000
gctatgagaa agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg 6060
cagggtcgga acaggagagc gcacgaggga gcttccaggg ggaaacgcct ggtatcttta 6120
tagtcctgtc gggtttcgcc acctctgact tgagcgtcga tttttgtgat gctcgtcagg 6180
ggggcggagc ctatggaaaa acgccagcaa cgcggccttt ttacggttcc tggccttttg 6240
ctggcctttt gctcacatgt tctttcctgc gttatcccct gattctgtgg ataaccgtat 6300
taccgccttt gagtgagctg ataccgctcg ccgcagccga acgaccgagc gcagcgagtc 6360
agtgagcgag gaagcggaag agcgcccaat acgcaaaccg cctctccccg cgcgttggcc 6420
gattcattaa tgcagctggc acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa 6480
cgcaattaat gtgagttagc tcactcatta ggcaccccag gctttacact ttatgcttcc 6540
ggctcgtatg ttgtgtggaa ttgtgagcgg ataacaattt cacacaggaa acagctatga 6600
ccatgattac gccaagcgcg caattaaccc tcactaaagg gaacaaaagc tggagctgca 6660
agcttaatgt agtcttatgc aatactcttg tagtcttgca acatggtaac gatgagttag 6720
caacatgcct tacaaggaga gaaaaagcac cgtgcatgcc gattggtgga agtaaggtgg 6780
tacgatcgtg ccttattagg aaggcaacag acgggtctga catggattgg acgaaccact 6840
gaattgccgc attgcagaga tattgtattt aagtgcctag ctcgatacat aaacgggtct 6900
ctctggttag accagatctg agcctgggag ctctctggct aactagggaa cccactgctt 6960
aagcctcaat aaagcttgcc ttgagtgctt caagtagtgt gtgcccgtct gttgtgtgac 7020
tctggtaact agagatccct cagacccttt tagtcagtgt ggaaaatctc tagcagtggc 7080
gcccgaacag ggacttgaaa gcgaaaggga aaccagagga gctctctcga cgcaggactc 7140
ggcttgctga agcgcgcacg gcaagaggcg aggggcggcg actggtgagt acgccaaaaa 7200
ttttgactag cggaggctag aaggagagag atgggtgcga gagcgtcagt attaagcggg 7260
ggagaattag atcgcgatgg gaaaaaattc ggttaaggcc agggggaaag aaaaaatata 7320
aattaaaaca tatagtatgg gcaagcaggg agctagaacg attcgcagtt aatcctggcc 7380
tgttagaaac atcagaaggc tgtagacaaa tactgggaca gctacaacca tcccttcaga 7440
caggatcaga agaacttaga tcattatata atacagtagc aaccctctat tgtgtgcatc 7500
aaaggataga gataaaagac accaaggaag ctttagacaa gatagaggaa gagcaaaaca 7560
aaagtaagac caccgcacag caagcggccg ctgatcttca gacctggagg aggagatatg 7620
agggacaatt ggagaagtga attatataaa tataaagtag taaaaattga accattagga 7680
gtagcaccca ccaaggcaaa gagaagagtg gtgcagagag aaaaaagagc agtgggaata 7740
ggagctttgt tccttgggtt cttgggagca gcaggaagca ctatgggcgc agcgtcaatg 7800
acgctgacgg tacaggccag acaattattg tctggtatag tgcagcagca gaacaatttg 7860
ctgagggcta ttgaggcgca acagcatctg ttgcaactca cagtctgggg catcaagcag 7920
ctccaggcaa gaatcctggc tgtggaaaga tacctaaagg atcaacagct cctggggatt 7980
tggggttgct ctggaaaact catttgcacc actgctgtgc cttggaatgc tagttggagt 8040
aataaatctc tggaacagat ttggaatcac acgacctgga tggagtggga cagagaaatt 8100
aacaattaca caagcttaat acactcctta attgaagaat cgcaaaacca gcaagaaaag 8160
aatgaacaag aattattgga attagataaa tgggcaagtt tgtggaattg gtttaacata 8220
acaaattggc tgtggtatat aaaattattc ataatgatag taggaggctt ggtaggttta 8280
agaatagttt ttgctgtact ttctatagtg aatagagtta ggcagggata ttcaccatta 8340
tcgtttcaga cccacctccc aaccccgagg ggacccttgc gccttttcca aggcagccct 8400
gggtttgcgc agggacgcgg ctgctctggg cgtggttccg ggaaacgcag cggcgccgac 8460
cctgggtctc gcacattctt cacgtccgtt cgcagcgtca cccggatctt cgccgctacc 8520
cttgtgggcc ccccggcgac gcttcctgct ccgcccctaa gtcgggaagg ttccttgcgg 8580
ttcgcggcgt gccggacgtg acaaacggaa gccgcacgtc tcactagtac cctcgcagac 8640
ggacagcgcc agggagcaat ggcagcgcgc cgaccgcgat gggctgtggc caatagcggc 8700
tgctcagcag ggcgcgccga gagcagcggc cgggaagggg cggtgcggga ggcggggtgt 8760
ggggcggtag tgtgggccct gttcctgccc gcgcggtgtt ccgcattctg caagcctccg 8820
gagcgcacgt cggcagtcgg ctccctcgtt gaccgaatca ccgacctctc tccccagggg 8880
gtacccagct gtctagagaa ttctagatct tgagacaaat ggcagtattc atccacaatt 8940
ttaaaagaaa aggggggatt ggggggtaca gtgcagggga aagaatagta gacataatag 9000
caacagacat acaaactaaa gaattacaaa aacaaattac aaaaattcaa aattttcggg 9060
tttattacag ggacagcaga gatccacttt ggcgccggct cgaggggg 9108
<210> 29
<211> 154
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 29
Met Lys Lys Pro Glu Leu Thr Ala Thr Ser Val Glu Lys Phe Leu Ile
1 5 10 15
Glu Lys Phe Asp Ser Val Ser Asp Leu Met Gln Leu Ser Glu Gly Glu
20 25 30
Glu Ser Arg Ala Phe Ser Phe Asp Val Gly Gly Arg Gly Tyr Val Leu
35 40 45
Arg Val Asn Ser Cys Leu Ser Tyr Glu Thr Glu Ile Leu Thr Val Glu
50 55 60
Tyr Gly Leu Leu Pro Ile Gly Lys Ile Val Glu Lys Arg Ile Glu Cys
65 70 75 80
Thr Val Tyr Ser Val Asp Asn Asn Gly Asn Ile Tyr Thr Gln Pro Val
85 90 95
Ala Gln Trp His Asp Arg Gly Glu Gln Glu Val Phe Glu Tyr Cys Leu
100 105 110
Glu Asp Gly Ser Leu Ile Arg Ala Thr Lys Asp His Lys Phe Met Thr
115 120 125
Val Asp Gly Gln Met Leu Pro Ile Asp Glu Ile Phe Glu Arg Glu Leu
130 135 140
Asp Leu Met Arg Val Asp Asn Leu Pro Asn
145 150
<210> 30
<211> 9627
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 30
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtgccgcca ccatgattaa gatcgctacg 660
cggaagtacc tggggaaaca gaacgtctac gacataggtg tggagcgcga tcacaacttt 720
gctctgaaaa atggatttat cgccagcaac tgcgccgatg gtttctacaa agatcgttat 780
gtttatcggc actttgcatc ggccgcgctc ccgattccgg aagtgcttga cattggggaa 840
tttagcgaga gcctgaccta ttgcatctcc cgccgtgcac agggtgtcac gttgcaagac 900
ctgcctgaaa ccgaactgcc cgctgttctg cagccggtcg cggaggccat ggatgcgatc 960
gctgcggccg atcttagcca gacgagcggg ttcggcccat tcggaccaca aggaatcggt 1020
caatacacta catggcgtga tttcatatgc gcgattgctg atccccatgt gtatcactgg 1080
caaactgtga tggacgacac cgtcagtgcg tccgtcgcgc aggctctcga tgagctgatg 1140
ctttgggccg aggactgccc cgaagtccgg cacctcgtgc acgcggattt cggctccaac 1200
aatgtcctga cggacaatgg ccgcataaca gcggtcattg actggagcga ggcgatgttc 1260
ggggattccc aatacgaggt cgccaacatc ttcttctgga ggccgtggtt ggcttgtatg 1320
gagcagcaga cgcgctactt cgagcggagg catccggagc ttgcaggatc gccgcggctc 1380
cgggcgtata tgctccgcat tggtcttgac caactctatc agagcttggt tgacggcaat 1440
ttcgatgatg cagcttgggc gcagggtcga tgcgacgcaa tcgtccgatc cggagccggg 1500
actgtcgggc gtacacaaat cgcccgcaga agcgcggccg tctggaccga tggctgtgta 1560
gaagtactcg ccgatagtgg aaaccgacgc cccagcactc gtccgagggc aaaggaatag 1620
ttaattaaga attcgaccca gctttcttgt acaaagtggt tggtaagcct atccctaacc 1680
ctctcctcgg tctcgattct acgtagtaat gagctagcag tctcgaggtt aacgaattcc 1740
gccccccccc taacgttact ggccgaagcc gcttggaata aggccggtgt gcgcttgtct 1800
atatgttatt ttccaccata ttgccgtctt ttggcaatgt gagggcccgg aaacctggcc 1860
ctgtcttctt gacgagcatt cctaggggtc tttcccctct cgccaaagga atgcaaggtc 1920
tgttgaatgt cgtgaaggaa gcagttcctc tggaagcttc ttgaagacaa acaacgtctg 1980
tagcgaccct ttgcaggcag cggaaccccc cacctggcga caggtgcccc tgcggccaaa 2040
agccacgtgt ataagataca cctgcaaagg cggcacaacc ccagtgccac gttgtgagtt 2100
ggatagttgt ggaaagagtc aaatggctct cctcaagcgt attcaacaag gggctgaagg 2160
atgcccagaa ggtaccccat tgtatgggat ctgatctggg gcctcggtgc acatgcttta 2220
catgtgttta gtcgaggtta aaaaaacgtc taggcccccc gaaccacggg gacgtggttt 2280
tcctttgaaa aacacgataa taccatggtg agcaagggcg aggaggataa catggccatc 2340
atcaaggagt tcatgcgctt caaggtgcac atggagggct ccgtgaacgg ccacgagttc 2400
gagatcgagg gcgagggcga gggccgcccc tacgagggca cccagaccgc caagctgaag 2460
gtgaccaagg gtggccccct gcccttcgcc tgggacatcc tgtcccctca gttcatgtac 2520
ggctccaagg cctacgtgaa gcaccccgcc gacatccccg actacttgaa gctgtccttc 2580
cccgagggct tcaagtggga gcgcgtgatg aacttcgagg acggcggcgt ggtgaccgtg 2640
acccaggact cctccctgca ggacggcgag ttcatctaca aggtgaagct gcgcggcacc 2700
aacttcccct ccgacggccc cgtaatgcag aagaagacca tgggctggga ggcctcctcc 2760
gagcggatgt accccgagga cggcgccctg aagggcgaga tcaagcagag gctgaagctg 2820
aaggacggcg gccactacga cgctgaggtc aagaccacct acaaggccaa gaagcccgtg 2880
cagctgcccg gcgcctacaa cgtcaacatc aagttggaca tcacctccca caacgaggac 2940
tacaccatcg tggaacagta cgaacgcgcc gagggccgcc actccaccgg cggcatggac 3000
gagctgtaca agtaacaccg gtggcgcgtt aagtcgacaa tcaacctctg gattacaaaa 3060
tttgtgaaag attgactggt attcttaact atgttgctcc ttttacgcta tgtggatacg 3120
ctgctttaat gcctttgtat catgctattg cttcccgtat ggctttcatt ttctcctcct 3180
tgtataaatc ctggttgctg tctctttatg aggagttgtg gcccgttgtc aggcaacgtg 3240
gcgtggtgtg cactgtgttt gctgacgcaa cccccactgg ttggggcatt gccaccacct 3300
gtcagctcct ttccgggact ttcgctttcc ccctccctat tgccacggcg gaactcatcg 3360
ccgcctgcct tgcccgctgc tggacagggg ctcggctgtt gggcactgac aattccgtgg 3420
tgttgtcggg gaaatcatcg tcctttcctt ggctgctcgc ctgtgttgcc acctggattc 3480
tgcgcgggac gtccttctgc tacgtccctt cggccctcaa tccagcggac cttccttccc 3540
gcggcctgct gccggctctg cggcctcttc cgcgtcttcg ccttcgccct cagacgagtc 3600
ggatctccct ttgggccgcc tccccgcgtc gactttaaga ccaatgactt acaaggcagc 3660
tgtagatctt agccactttt taaaagaaaa ggggggactg gaagggctaa ttcactccca 3720
acgaagacaa gatctgcttt ttgcttgtac tgggtctctc tggttagacc agatctgagc 3780
ctgggagctc tctggctaac tagggaaccc actgcttaag cctcaataaa gcttgccttg 3840
agtgcttcaa gtagtgtgtg cccgtctgtt gtgtgactct ggtaactaga gatccctcag 3900
acccttttag tcagtgtgga aaatctctag cagtacgtat agtagttcat gtcatcttat 3960
tattcagtat ttataacttg caaagaaatg aatatcagag agtgagagga acttgtttat 4020
tgcagcttat aatggttaca aataaagcaa tagcatcaca aatttcacaa ataaagcatt 4080
tttttcactg cattctagtt gtggtttgtc caaactcatc aatgtatctt atcatgtctg 4140
gctctagcta tcccgcccct aactccgccc atcccgcccc taactccgcc cagttccgcc 4200
cattctccgc cccatggctg actaattttt tttatttatg cagaggccga ggccgcctcg 4260
gcctctgagc tattccagaa gtagtgagga ggcttttttg gaggcctagg gacgtaccca 4320
attcgcccta tagtgagtcg tattacgcgc gctcactggc cgtcgtttta caacgtcgtg 4380
actgggaaaa ccctggcgtt acccaactta atcgccttgc agcacatccc cctttcgcca 4440
gctggcgtaa tagcgaagag gcccgcaccg atcgcccttc ccaacagttg cgcagcctga 4500
atggcgaatg ggacgcgccc tgtagcggcg cattaagcgc ggcgggtgtg gtggttacgc 4560
gcagcgtgac cgctacactt gccagcgccc tagcgcccgc tcctttcgct ttcttccctt 4620
cctttctcgc cacgttcgcc ggctttcccc gtcaagctct aaatcggggg ctccctttag 4680
ggttccgatt tagtgcttta cggcacctcg accccaaaaa acttgattag ggtgatggtt 4740
cacgtagtgg gccatcgccc tgatagacgg tttttcgccc tttgacgttg gagtccacgt 4800
tctttaatag tggactcttg ttccaaactg gaacaacact caaccctatc tcggtctatt 4860
cttttgattt ataagggatt ttgccgattt cggcctattg gttaaaaaat gagctgattt 4920
aacaaaaatt taacgcgaat tttaacaaaa tattaacgct tacaatttag gtggcacttt 4980
tcggggaaat gtgcgcggaa cccctatttg tttatttttc taaatacatt caaatatgta 5040
tccgctcatg agacaataac cctgataaat gcttcaataa tattgaaaaa ggaagagtat 5100
gagtattcaa catttccgtg tcgcccttat tccctttttt gcggcatttt gccttcctgt 5160
ttttgctcac ccagaaacgc tggtgaaagt aaaagatgct gaagatcagt tgggtgcacg 5220
agtgggttac atcgaactgg atctcaacag cggtaagatc cttgagagtt ttcgccccga 5280
agaacgtttt ccaatgatga gcacttttaa agttctgcta tgtggcgcgg tattatcccg 5340
tattgacgcc gggcaagagc aactcggtcg ccgcatacac tattctcaga atgacttggt 5400
tgagtactca ccagtcacag aaaagcatct tacggatggc atgacagtaa gagaattatg 5460
cagtgctgcc ataaccatga gtgataacac tgcggccaac ttacttctga caacgatcgg 5520
aggaccgaag gagctaaccg cttttttgca caacatgggg gatcatgtaa ctcgccttga 5580
tcgttgggaa ccggagctga atgaagccat accaaacgac gagcgtgaca ccacgatgcc 5640
tgtagcaatg gcaacaacgt tgcgcaaact attaactggc gaactactta ctctagcttc 5700
ccggcaacaa ttaatagact ggatggaggc ggataaagtt gcaggaccac ttctgcgctc 5760
ggcccttccg gctggctggt ttattgctga taaatctgga gccggtgagc gtgggtctcg 5820
cggtatcatt gcagcactgg ggccagatgg taagccctcc cgtatcgtag ttatctacac 5880
gacggggagt caggcaacta tggatgaacg aaatagacag atcgctgaga taggtgcctc 5940
actgattaag cattggtaac tgtcagacca agtttactca tatatacttt agattgattt 6000
aaaacttcat ttttaattta aaaggatcta ggtgaagatc ctttttgata atctcatgac 6060
caaaatccct taacgtgagt tttcgttcca ctgagcgtca gaccccgtag aaaagatcaa 6120
aggatcttct tgagatcctt tttttctgcg cgtaatctgc tgcttgcaaa caaaaaaacc 6180
accgctacca gcggtggttt gtttgccgga tcaagagcta ccaactcttt ttccgaaggt 6240
aactggcttc agcagagcgc agataccaaa tactgttctt ctagtgtagc cgtagttagg 6300
ccaccacttc aagaactctg tagcaccgcc tacatacctc gctctgctaa tcctgttacc 6360
agtggctgct gccagtggcg ataagtcgtg tcttaccggg ttggactcaa gacgatagtt 6420
accggataag gcgcagcggt cgggctgaac ggggggttcg tgcacacagc ccagcttgga 6480
gcgaacgacc tacaccgaac tgagatacct acagcgtgag ctatgagaaa gcgccacgct 6540
tcccgaaggg agaaaggcgg acaggtatcc ggtaagcggc agggtcggaa caggagagcg 6600
cacgagggag cttccagggg gaaacgcctg gtatctttat agtcctgtcg ggtttcgcca 6660
cctctgactt gagcgtcgat ttttgtgatg ctcgtcaggg gggcggagcc tatggaaaaa 6720
cgccagcaac gcggcctttt tacggttcct ggccttttgc tggccttttg ctcacatgtt 6780
ctttcctgcg ttatcccctg attctgtgga taaccgtatt accgcctttg agtgagctga 6840
taccgctcgc cgcagccgaa cgaccgagcg cagcgagtca gtgagcgagg aagcggaaga 6900
gcgcccaata cgcaaaccgc ctctccccgc gcgttggccg attcattaat gcagctggca 6960
cgacaggttt cccgactgga aagcgggcag tgagcgcaac gcaattaatg tgagttagct 7020
cactcattag gcaccccagg ctttacactt tatgcttccg gctcgtatgt tgtgtggaat 7080
tgtgagcgga taacaatttc acacaggaaa cagctatgac catgattacg ccaagcgcgc 7140
aattaaccct cactaaaggg aacaaaagct ggagctgcaa gcttaatgta gtcttatgca 7200
atactcttgt agtcttgcaa catggtaacg atgagttagc aacatgcctt acaaggagag 7260
aaaaagcacc gtgcatgccg attggtggaa gtaaggtggt acgatcgtgc cttattagga 7320
aggcaacaga cgggtctgac atggattgga cgaaccactg aattgccgca ttgcagagat 7380
attgtattta agtgcctagc tcgatacata aacgggtctc tctggttaga ccagatctga 7440
gcctgggagc tctctggcta actagggaac ccactgctta agcctcaata aagcttgcct 7500
tgagtgcttc aagtagtgtg tgcccgtctg ttgtgtgact ctggtaacta gagatccctc 7560
agaccctttt agtcagtgtg gaaaatctct agcagtggcg cccgaacagg gacttgaaag 7620
cgaaagggaa accagaggag ctctctcgac gcaggactcg gcttgctgaa gcgcgcacgg 7680
caagaggcga ggggcggcga ctggtgagta cgccaaaaat tttgactagc ggaggctaga 7740
aggagagaga tgggtgcgag agcgtcagta ttaagcgggg gagaattaga tcgcgatggg 7800
aaaaaattcg gttaaggcca gggggaaaga aaaaatataa attaaaacat atagtatggg 7860
caagcaggga gctagaacga ttcgcagtta atcctggcct gttagaaaca tcagaaggct 7920
gtagacaaat actgggacag ctacaaccat cccttcagac aggatcagaa gaacttagat 7980
cattatataa tacagtagca accctctatt gtgtgcatca aaggatagag ataaaagaca 8040
ccaaggaagc tttagacaag atagaggaag agcaaaacaa aagtaagacc accgcacagc 8100
aagcggccgc tgatcttcag acctggagga ggagatatga gggacaattg gagaagtgaa 8160
ttatataaat ataaagtagt aaaaattgaa ccattaggag tagcacccac caaggcaaag 8220
agaagagtgg tgcagagaga aaaaagagca gtgggaatag gagctttgtt ccttgggttc 8280
ttgggagcag caggaagcac tatgggcgca gcgtcaatga cgctgacggt acaggccaga 8340
caattattgt ctggtatagt gcagcagcag aacaatttgc tgagggctat tgaggcgcaa 8400
cagcatctgt tgcaactcac agtctggggc atcaagcagc tccaggcaag aatcctggct 8460
gtggaaagat acctaaagga tcaacagctc ctggggattt ggggttgctc tggaaaactc 8520
atttgcacca ctgctgtgcc ttggaatgct agttggagta ataaatctct ggaacagatt 8580
tggaatcaca cgacctggat ggagtgggac agagaaatta acaattacac aagcttaata 8640
cactccttaa ttgaagaatc gcaaaaccag caagaaaaga atgaacaaga attattggaa 8700
ttagataaat gggcaagttt gtggaattgg tttaacataa caaattggct gtggtatata 8760
aaattattca taatgatagt aggaggcttg gtaggtttaa gaatagtttt tgctgtactt 8820
tctatagtga atagagttag gcagggatat tcaccattat cgtttcagac ccacctccca 8880
accccgaggg gacccttgcg ccttttccaa ggcagccctg ggtttgcgca gggacgcggc 8940
tgctctgggc gtggttccgg gaaacgcagc ggcgccgacc ctgggtctcg cacattcttc 9000
acgtccgttc gcagcgtcac ccggatcttc gccgctaccc ttgtgggccc cccggcgacg 9060
cttcctgctc cgcccctaag tcgggaaggt tccttgcggt tcgcggcgtg ccggacgtga 9120
caaacggaag ccgcacgtct cactagtacc ctcgcagacg gacagcgcca gggagcaatg 9180
gcagcgcgcc gaccgcgatg ggctgtggcc aatagcggct gctcagcagg gcgcgccgag 9240
agcagcggcc gggaaggggc ggtgcgggag gcggggtgtg gggcggtagt gtgggccctg 9300
ttcctgcccg cgcggtgttc cgcattctgc aagcctccgg agcgcacgtc ggcagtcggc 9360
tccctcgttg accgaatcac cgacctctct ccccaggggg tacccagctg tctagagaat 9420
tctagatctt gagacaaatg gcagtattca tccacaattt taaaagaaaa ggggggattg 9480
gggggtacag tgcaggggaa agaatagtag acataatagc aacagacata caaactaaag 9540
aattacaaaa acaaattaca aaaattcaaa attttcgggt ttattacagg gacagcagag 9600
atccactttg gcgccggctc gaggggg 9627
<210> 31
<211> 325
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 31
Met Ile Lys Ile Ala Thr Arg Lys Tyr Leu Gly Lys Gln Asn Val Tyr
1 5 10 15
Asp Ile Gly Val Glu Arg Asp His Asn Phe Ala Leu Lys Asn Gly Phe
20 25 30
Ile Ala Ser Asn Cys Ala Asp Gly Phe Tyr Lys Asp Arg Tyr Val Tyr
35 40 45
Arg His Phe Ala Ser Ala Ala Leu Pro Ile Pro Glu Val Leu Asp Ile
50 55 60
Gly Glu Phe Ser Glu Ser Leu Thr Tyr Cys Ile Ser Arg Arg Ala Gln
65 70 75 80
Gly Val Thr Leu Gln Asp Leu Pro Glu Thr Glu Leu Pro Ala Val Leu
85 90 95
Gln Pro Val Ala Glu Ala Met Asp Ala Ile Ala Ala Ala Asp Leu Ser
100 105 110
Gln Thr Ser Gly Phe Gly Pro Phe Gly Pro Gln Gly Ile Gly Gln Tyr
115 120 125
Thr Thr Trp Arg Asp Phe Ile Cys Ala Ile Ala Asp Pro His Val Tyr
130 135 140
His Trp Gln Thr Val Met Asp Asp Thr Val Ser Ala Ser Val Ala Gln
145 150 155 160
Ala Leu Asp Glu Leu Met Leu Trp Ala Glu Asp Cys Pro Glu Val Arg
165 170 175
His Leu Val His Ala Asp Phe Gly Ser Asn Asn Val Leu Thr Asp Asn
180 185 190
Gly Arg Ile Thr Ala Val Ile Asp Trp Ser Glu Ala Met Phe Gly Asp
195 200 205
Ser Gln Tyr Glu Val Ala Asn Ile Phe Phe Trp Arg Pro Trp Leu Ala
210 215 220
Cys Met Glu Gln Gln Thr Arg Tyr Phe Glu Arg Arg His Pro Glu Leu
225 230 235 240
Ala Gly Ser Pro Arg Leu Arg Ala Tyr Met Leu Arg Ile Gly Leu Asp
245 250 255
Gln Leu Tyr Gln Ser Leu Val Asp Gly Asn Phe Asp Asp Ala Ala Trp
260 265 270
Ala Gln Gly Arg Cys Asp Ala Ile Val Arg Ser Gly Ala Gly Thr Val
275 280 285
Gly Arg Thr Gln Ile Ala Arg Arg Ser Ala Ala Val Trp Thr Asp Gly
290 295 300
Cys Val Glu Val Leu Ala Asp Ser Gly Asn Arg Arg Pro Ser Thr Arg
305 310 315 320
Pro Arg Ala Lys Glu
325
<210> 32
<211> 9672
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 32
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtgccacca tgaaaaagcc tgaactcacc 660
gcgacgtctg tcgagaagtt tctgatcgaa aagttcgaca gcgtctccga cctgatgcag 720
ctctcggagg gcgaagaatc tcgtgctttc agcttcgatg taggagggcg tggatatgtc 780
ctgcgggtaa atagctgcgc cgatggtttc tacaaagatc gttatgttta tcggcacttt 840
gcatcggccg cgctcccgat tccggaagtg cttgacattg gggaatttag cgagagcctg 900
acctattgca tctcccgccg tgcacagggt gtcacgttgc aagacctgcc tgaaaccgaa 960
ctgcccgctg ttctgcagcc ggtcgcggag gccatggatg cgatcgctgc ggccgatctt 1020
agccagacga gcgggttcgg cccattcgga ccgcaaggaa tcggtcaata cactacatgg 1080
cgtgatttca tatgcgcgat tgctgatccc catgtgtatc actggcaaac tgtgatggac 1140
gacaccgtca gtgcgtccgt cgcgcaggct ctcgatgagc tgatgctttg ggccgaggac 1200
tgccccgaag tccggcacct cgtgcacgcg gatttcggct ccaacaatgt cctgacggac 1260
aatggccgca taacagcggt cattgactgg agcgaggcga tgttcgggga ttcccaatac 1320
gaggtcgcca acatcttctt ctggaggccg tggttggctt gcctttcata cgagaccgag 1380
atcctgactg tcgagtacgg attgcttcct atcggcaaaa tcgtggagaa gaggattgaa 1440
tgtaccgtct attcagtcga taataatggg aacatctaca cacagcccgt ggctcaatgg 1500
cacgacagag gagagcagga agtttttgaa tactgtctcg aggacggatc cctcatccgc 1560
gctactaaag atcataagtt tatgaccgtg gacggccaga tgctgccaat tgacgaaatt 1620
tttgaacgag agctggatct gatgagagtc gacaaccttc caaactgatt aattaagaat 1680
tcgacccagc tttcttgtac aaagtggttg gtaagcctat ccctaaccct ctcctcggtc 1740
tcgattctac gtagtaatga gctagcagtc tcgaggttaa cgaattccgc ccccccccta 1800
acgttactgg ccgaagccgc ttggaataag gccggtgtgc gcttgtctat atgttatttt 1860
ccaccatatt gccgtctttt ggcaatgtga gggcccggaa acctggccct gtcttcttga 1920
cgagcattcc taggggtctt tcccctctcg ccaaaggaat gcaaggtctg ttgaatgtcg 1980
tgaaggaagc agttcctctg gaagcttctt gaagacaaac aacgtctgta gcgacccttt 2040
gcaggcagcg gaacccccca cctggcgaca ggtgcccctg cggccaaaag ccacgtgtat 2100
aagatacacc tgcaaaggcg gcacaacccc agtgccacgt tgtgagttgg atagttgtgg 2160
aaagagtcaa atggctctcc tcaagcgtat tcaacaaggg gctgaaggat gcccagaagg 2220
taccccattg tatgggatct gatctggggc ctcggtgcac atgctttaca tgtgtttagt 2280
cgaggttaaa aaaacgtcta ggccccccga accacgggga cgtggttttc ctttgaaaaa 2340
cacgataata ccatggccat gagcgagctg attaaggaga acatgcacat gaagctgtac 2400
atggagggca ccgtggacaa ccatcacttc aagtgcacat ccgagggcga aggcaagccc 2460
tacgagggca cccagaccat gagaatcaag gtggtcgagg gcggccctct ccccttcgcc 2520
ttcgacatcc tggctactag cttcctctac ggcagcaaga ccttcatcaa ccacacccag 2580
ggcatccccg acttcttcaa gcagtccttc cctgagggct tcacatggga gagagtcacc 2640
acatacgaag acgggggcgt gctgaccgct acccaggaca ccagcctcca ggacggctgc 2700
ctcatctaca acgtcaagat cagaggggtg aacttcacat ccaacggccc tgtgatgcag 2760
aagaaaacac tcggctggga ggccttcacc gagacgctgt accccgctga cggcggcctg 2820
gaaggcagaa acgacatggc cctgaagctc gtgggcggga gccatctgat cgcaaacatc 2880
aagaccacat atagatccaa gaaacccgct aagaacctca agatgcctgg cgtctactat 2940
gtggactaca gactggaaag aatcaaggag gccaacaacg agacctacgt cgagcagcac 3000
gaggtggcag tggccagata ctgcgacctc cctagcaaac tggggcacaa gcttaattaa 3060
caccggtggc gcgttaagtc gacaatcaac ctctggatta caaaatttgt gaaagattga 3120
ctggtattct taactatgtt gctcctttta cgctatgtgg atacgctgct ttaatgcctt 3180
tgtatcatgc tattgcttcc cgtatggctt tcattttctc ctccttgtat aaatcctggt 3240
tgctgtctct ttatgaggag ttgtggcccg ttgtcaggca acgtggcgtg gtgtgcactg 3300
tgtttgctga cgcaaccccc actggttggg gcattgccac cacctgtcag ctcctttccg 3360
ggactttcgc tttccccctc cctattgcca cggcggaact catcgccgcc tgccttgccc 3420
gctgctggac aggggctcgg ctgttgggca ctgacaattc cgtggtgttg tcggggaaat 3480
catcgtcctt tccttggctg ctcgcctgtg ttgccacctg gattctgcgc gggacgtcct 3540
tctgctacgt cccttcggcc ctcaatccag cggaccttcc ttcccgcggc ctgctgccgg 3600
ctctgcggcc tcttccgcgt cttcgccttc gccctcagac gagtcggatc tccctttggg 3660
ccgcctcccc gcgtcgactt taagaccaat gacttacaag gcagctgtag atcttagcca 3720
ctttttaaaa gaaaaggggg gactggaagg gctaattcac tcccaacgaa gacaagatct 3780
gctttttgct tgtactgggt ctctctggtt agaccagatc tgagcctggg agctctctgg 3840
ctaactaggg aacccactgc ttaagcctca ataaagcttg ccttgagtgc ttcaagtagt 3900
gtgtgcccgt ctgttgtgtg actctggtaa ctagagatcc ctcagaccct tttagtcagt 3960
gtggaaaatc tctagcagta cgtatagtag ttcatgtcat cttattattc agtatttata 4020
acttgcaaag aaatgaatat cagagagtga gaggaacttg tttattgcag cttataatgg 4080
ttacaaataa agcaatagca tcacaaattt cacaaataaa gcattttttt cactgcattc 4140
tagttgtggt ttgtccaaac tcatcaatgt atcttatcat gtctggctct agctatcccg 4200
cccctaactc cgcccatccc gcccctaact ccgcccagtt ccgcccattc tccgccccat 4260
ggctgactaa ttttttttat ttatgcagag gccgaggccg cctcggcctc tgagctattc 4320
cagaagtagt gaggaggctt ttttggaggc ctagggacgt acccaattcg ccctatagtg 4380
agtcgtatta cgcgcgctca ctggccgtcg ttttacaacg tcgtgactgg gaaaaccctg 4440
gcgttaccca acttaatcgc cttgcagcac atcccccttt cgccagctgg cgtaatagcg 4500
aagaggcccg caccgatcgc ccttcccaac agttgcgcag cctgaatggc gaatgggacg 4560
cgccctgtag cggcgcatta agcgcggcgg gtgtggtggt tacgcgcagc gtgaccgcta 4620
cacttgccag cgccctagcg cccgctcctt tcgctttctt cccttccttt ctcgccacgt 4680
tcgccggctt tccccgtcaa gctctaaatc gggggctccc tttagggttc cgatttagtg 4740
ctttacggca cctcgacccc aaaaaacttg attagggtga tggttcacgt agtgggccat 4800
cgccctgata gacggttttt cgccctttga cgttggagtc cacgttcttt aatagtggac 4860
tcttgttcca aactggaaca acactcaacc ctatctcggt ctattctttt gatttataag 4920
ggattttgcc gatttcggcc tattggttaa aaaatgagct gatttaacaa aaatttaacg 4980
cgaattttaa caaaatatta acgcttacaa tttaggtggc acttttcggg gaaatgtgcg 5040
cggaacccct atttgtttat ttttctaaat acattcaaat atgtatccgc tcatgagaca 5100
ataaccctga taaatgcttc aataatattg aaaaaggaag agtatgagta ttcaacattt 5160
ccgtgtcgcc cttattccct tttttgcggc attttgcctt cctgtttttg ctcacccaga 5220
aacgctggtg aaagtaaaag atgctgaaga tcagttgggt gcacgagtgg gttacatcga 5280
actggatctc aacagcggta agatccttga gagttttcgc cccgaagaac gttttccaat 5340
gatgagcact tttaaagttc tgctatgtgg cgcggtatta tcccgtattg acgccgggca 5400
agagcaactc ggtcgccgca tacactattc tcagaatgac ttggttgagt actcaccagt 5460
cacagaaaag catcttacgg atggcatgac agtaagagaa ttatgcagtg ctgccataac 5520
catgagtgat aacactgcgg ccaacttact tctgacaacg atcggaggac cgaaggagct 5580
aaccgctttt ttgcacaaca tgggggatca tgtaactcgc cttgatcgtt gggaaccgga 5640
gctgaatgaa gccataccaa acgacgagcg tgacaccacg atgcctgtag caatggcaac 5700
aacgttgcgc aaactattaa ctggcgaact acttactcta gcttcccggc aacaattaat 5760
agactggatg gaggcggata aagttgcagg accacttctg cgctcggccc ttccggctgg 5820
ctggtttatt gctgataaat ctggagccgg tgagcgtggg tctcgcggta tcattgcagc 5880
actggggcca gatggtaagc cctcccgtat cgtagttatc tacacgacgg ggagtcaggc 5940
aactatggat gaacgaaata gacagatcgc tgagataggt gcctcactga ttaagcattg 6000
gtaactgtca gaccaagttt actcatatat actttagatt gatttaaaac ttcattttta 6060
atttaaaagg atctaggtga agatcctttt tgataatctc atgaccaaaa tcccttaacg 6120
tgagttttcg ttccactgag cgtcagaccc cgtagaaaag atcaaaggat cttcttgaga 6180
tccttttttt ctgcgcgtaa tctgctgctt gcaaacaaaa aaaccaccgc taccagcggt 6240
ggtttgtttg ccggatcaag agctaccaac tctttttccg aaggtaactg gcttcagcag 6300
agcgcagata ccaaatactg ttcttctagt gtagccgtag ttaggccacc acttcaagaa 6360
ctctgtagca ccgcctacat acctcgctct gctaatcctg ttaccagtgg ctgctgccag 6420
tggcgataag tcgtgtctta ccgggttgga ctcaagacga tagttaccgg ataaggcgca 6480
gcggtcgggc tgaacggggg gttcgtgcac acagcccagc ttggagcgaa cgacctacac 6540
cgaactgaga tacctacagc gtgagctatg agaaagcgcc acgcttcccg aagggagaaa 6600
ggcggacagg tatccggtaa gcggcagggt cggaacagga gagcgcacga gggagcttcc 6660
agggggaaac gcctggtatc tttatagtcc tgtcgggttt cgccacctct gacttgagcg 6720
tcgatttttg tgatgctcgt caggggggcg gagcctatgg aaaaacgcca gcaacgcggc 6780
ctttttacgg ttcctggcct tttgctggcc ttttgctcac atgttctttc ctgcgttatc 6840
ccctgattct gtggataacc gtattaccgc ctttgagtga gctgataccg ctcgccgcag 6900
ccgaacgacc gagcgcagcg agtcagtgag cgaggaagcg gaagagcgcc caatacgcaa 6960
accgcctctc cccgcgcgtt ggccgattca ttaatgcagc tggcacgaca ggtttcccga 7020
ctggaaagcg ggcagtgagc gcaacgcaat taatgtgagt tagctcactc attaggcacc 7080
ccaggcttta cactttatgc ttccggctcg tatgttgtgt ggaattgtga gcggataaca 7140
atttcacaca ggaaacagct atgaccatga ttacgccaag cgcgcaatta accctcacta 7200
aagggaacaa aagctggagc tgcaagctta atgtagtctt atgcaatact cttgtagtct 7260
tgcaacatgg taacgatgag ttagcaacat gccttacaag gagagaaaaa gcaccgtgca 7320
tgccgattgg tggaagtaag gtggtacgat cgtgccttat taggaaggca acagacgggt 7380
ctgacatgga ttggacgaac cactgaattg ccgcattgca gagatattgt atttaagtgc 7440
ctagctcgat acataaacgg gtctctctgg ttagaccaga tctgagcctg ggagctctct 7500
ggctaactag ggaacccact gcttaagcct caataaagct tgccttgagt gcttcaagta 7560
gtgtgtgccc gtctgttgtg tgactctggt aactagagat ccctcagacc cttttagtca 7620
gtgtggaaaa tctctagcag tggcgcccga acagggactt gaaagcgaaa gggaaaccag 7680
aggagctctc tcgacgcagg actcggcttg ctgaagcgcg cacggcaaga ggcgaggggc 7740
ggcgactggt gagtacgcca aaaattttga ctagcggagg ctagaaggag agagatgggt 7800
gcgagagcgt cagtattaag cgggggagaa ttagatcgcg atgggaaaaa attcggttaa 7860
ggccaggggg aaagaaaaaa tataaattaa aacatatagt atgggcaagc agggagctag 7920
aacgattcgc agttaatcct ggcctgttag aaacatcaga aggctgtaga caaatactgg 7980
gacagctaca accatccctt cagacaggat cagaagaact tagatcatta tataatacag 8040
tagcaaccct ctattgtgtg catcaaagga tagagataaa agacaccaag gaagctttag 8100
acaagataga ggaagagcaa aacaaaagta agaccaccgc acagcaagcg gccgctgatc 8160
ttcagacctg gaggaggaga tatgagggac aattggagaa gtgaattata taaatataaa 8220
gtagtaaaaa ttgaaccatt aggagtagca cccaccaagg caaagagaag agtggtgcag 8280
agagaaaaaa gagcagtggg aataggagct ttgttccttg ggttcttggg agcagcagga 8340
agcactatgg gcgcagcgtc aatgacgctg acggtacagg ccagacaatt attgtctggt 8400
atagtgcagc agcagaacaa tttgctgagg gctattgagg cgcaacagca tctgttgcaa 8460
ctcacagtct ggggcatcaa gcagctccag gcaagaatcc tggctgtgga aagataccta 8520
aaggatcaac agctcctggg gatttggggt tgctctggaa aactcatttg caccactgct 8580
gtgccttgga atgctagttg gagtaataaa tctctggaac agatttggaa tcacacgacc 8640
tggatggagt gggacagaga aattaacaat tacacaagct taatacactc cttaattgaa 8700
gaatcgcaaa accagcaaga aaagaatgaa caagaattat tggaattaga taaatgggca 8760
agtttgtgga attggtttaa cataacaaat tggctgtggt atataaaatt attcataatg 8820
atagtaggag gcttggtagg tttaagaata gtttttgctg tactttctat agtgaataga 8880
gttaggcagg gatattcacc attatcgttt cagacccacc tcccaacccc gaggggaccc 8940
ttgcgccttt tccaaggcag ccctgggttt gcgcagggac gcggctgctc tgggcgtggt 9000
tccgggaaac gcagcggcgc cgaccctggg tctcgcacat tcttcacgtc cgttcgcagc 9060
gtcacccgga tcttcgccgc tacccttgtg ggccccccgg cgacgcttcc tgctccgccc 9120
ctaagtcggg aaggttcctt gcggttcgcg gcgtgccgga cgtgacaaac ggaagccgca 9180
cgtctcacta gtaccctcgc agacggacag cgccagggag caatggcagc gcgccgaccg 9240
cgatgggctg tggccaatag cggctgctca gcagggcgcg ccgagagcag cggccgggaa 9300
ggggcggtgc gggaggcggg gtgtggggcg gtagtgtggg ccctgttcct gcccgcgcgg 9360
tgttccgcat tctgcaagcc tccggagcgc acgtcggcag tcggctccct cgttgaccga 9420
atcaccgacc tctctcccca gggggtaccc agctgtctag agaattctag atcttgagac 9480
aaatggcagt attcatccac aattttaaaa gaaaaggggg gattgggggg tacagtgcag 9540
gggaaagaat agtagacata atagcaacag acatacaaac taaagaatta caaaaacaaa 9600
ttacaaaaat tcaaaatttt cgggtttatt acagggacag cagagatcca ctttggcgcc 9660
ggctcgaggg gg 9672
<210> 33
<211> 342
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 33
Met Lys Lys Pro Glu Leu Thr Ala Thr Ser Val Glu Lys Phe Leu Ile
1 5 10 15
Glu Lys Phe Asp Ser Val Ser Asp Leu Met Gln Leu Ser Glu Gly Glu
20 25 30
Glu Ser Arg Ala Phe Ser Phe Asp Val Gly Gly Arg Gly Tyr Val Leu
35 40 45
Arg Val Asn Ser Cys Ala Asp Gly Phe Tyr Lys Asp Arg Tyr Val Tyr
50 55 60
Arg His Phe Ala Ser Ala Ala Leu Pro Ile Pro Glu Val Leu Asp Ile
65 70 75 80
Gly Glu Phe Ser Glu Ser Leu Thr Tyr Cys Ile Ser Arg Arg Ala Gln
85 90 95
Gly Val Thr Leu Gln Asp Leu Pro Glu Thr Glu Leu Pro Ala Val Leu
100 105 110
Gln Pro Val Ala Glu Ala Met Asp Ala Ile Ala Ala Ala Asp Leu Ser
115 120 125
Gln Thr Ser Gly Phe Gly Pro Phe Gly Pro Gln Gly Ile Gly Gln Tyr
130 135 140
Thr Thr Trp Arg Asp Phe Ile Cys Ala Ile Ala Asp Pro His Val Tyr
145 150 155 160
His Trp Gln Thr Val Met Asp Asp Thr Val Ser Ala Ser Val Ala Gln
165 170 175
Ala Leu Asp Glu Leu Met Leu Trp Ala Glu Asp Cys Pro Glu Val Arg
180 185 190
His Leu Val His Ala Asp Phe Gly Ser Asn Asn Val Leu Thr Asp Asn
195 200 205
Gly Arg Ile Thr Ala Val Ile Asp Trp Ser Glu Ala Met Phe Gly Asp
210 215 220
Ser Gln Tyr Glu Val Ala Asn Ile Phe Phe Trp Arg Pro Trp Leu Ala
225 230 235 240
Cys Leu Ser Tyr Glu Thr Glu Ile Leu Thr Val Glu Tyr Gly Leu Leu
245 250 255
Pro Ile Gly Lys Ile Val Glu Lys Arg Ile Glu Cys Thr Val Tyr Ser
260 265 270
Val Asp Asn Asn Gly Asn Ile Tyr Thr Gln Pro Val Ala Gln Trp His
275 280 285
Asp Arg Gly Glu Gln Glu Val Phe Glu Tyr Cys Leu Glu Asp Gly Ser
290 295 300
Leu Ile Arg Ala Thr Lys Asp His Lys Phe Met Thr Val Asp Gly Gln
305 310 315 320
Met Leu Pro Ile Asp Glu Ile Phe Glu Arg Glu Leu Asp Leu Met Arg
325 330 335
Val Asp Asn Leu Pro Asn
340
<210> 34
<211> 9063
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 34
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtgccgcca ccatgattaa gatcgctacg 660
cgtaagtacc tggggaaaca gaacgtctac gacataggtg tggagcgcga tcacaacttt 720
gctctgaaaa atggatttat cgccagcaac tgtatggagc agcagacgcg ctacttcgag 780
cggaggcatc cggagcttgc aggatcgccg cggctccggg cgtatatgct ccgcattggt 840
cttgaccaac tctatcagag cttggttgac ggcaatttcg atgatgcagc ttgggcgcag 900
ggtcgatgcg acgcaatcgt ccgatccgga gccgggactg tcgggcgtac acaaatcgcc 960
cgcagaagcg cggccgtctg gaccgatggc tgtgtagaag tactcgccga tagtggaaac 1020
cgacgcccca gcactcgtcc gagggcaaag gaatagttaa ttaagaattc gacccagctt 1080
tcttgtacaa agtggttggt aagcctatcc ctaaccctct cctcggtctc gattctacgt 1140
agtaatgagc tagcagtctc gaggttaacg aattccgccc cccccctaac gttactggcc 1200
gaagccgctt ggaataaggc cggtgtgcgc ttgtctatat gttattttcc accatattgc 1260
cgtcttttgg caatgtgagg gcccggaaac ctggccctgt cttcttgacg agcattccta 1320
ggggtctttc ccctctcgcc aaaggaatgc aaggtctgtt gaatgtcgtg aaggaagcag 1380
ttcctctgga agcttcttga agacaaacaa cgtctgtagc gaccctttgc aggcagcgga 1440
accccccacc tggcgacagg tgcccctgcg gccaaaagcc acgtgtataa gatacacctg 1500
caaaggcggc acaaccccag tgccacgttg tgagttggat agttgtggaa agagtcaaat 1560
ggctctcctc aagcgtattc aacaaggggc tgaaggatgc ccagaaggta ccccattgta 1620
tgggatctga tctggggcct cggtgcacat gctttacatg tgtttagtcg aggttaaaaa 1680
aacgtctagg ccccccgaac cacggggacg tggttttcct ttgaaaaaca cgataatacc 1740
atggtgagca agggcgagga ggataacatg gccatcatca aggagttcat gcgcttcaag 1800
gtgcacatgg agggctccgt gaacggccac gagttcgaga tcgagggcga gggcgagggc 1860
cgcccctacg agggcaccca gaccgccaag ctgaaggtga ccaagggtgg ccccctgccc 1920
ttcgcctggg acatcctgtc ccctcagttc atgtacggct ccaaggccta cgtgaagcac 1980
cccgccgaca tccccgacta cttgaagctg tccttccccg agggcttcaa gtgggagcgc 2040
gtgatgaact tcgaggacgg cggcgtggtg accgtgaccc aggactcctc cctgcaggac 2100
ggcgagttca tctacaaggt gaagctgcgc ggcaccaact tcccctccga cggccccgta 2160
atgcagaaga agaccatggg ctgggaggcc tcctccgagc ggatgtaccc cgaggacggc 2220
gccctgaagg gcgagatcaa gcagaggctg aagctgaagg acggcggcca ctacgacgct 2280
gaggtcaaga ccacctacaa ggccaagaag cccgtgcagc tgcccggcgc ctacaacgtc 2340
aacatcaagt tggacatcac ctcccacaac gaggactaca ccatcgtgga acagtacgaa 2400
cgcgccgagg gccgccactc caccggcggc atggacgagc tgtacaagta acaccggtgg 2460
cgcgttaagt cgacaatcaa cctctggatt acaaaatttg tgaaagattg actggtattc 2520
ttaactatgt tgctcctttt acgctatgtg gatacgctgc tttaatgcct ttgtatcatg 2580
ctattgcttc ccgtatggct ttcattttct cctccttgta taaatcctgg ttgctgtctc 2640
tttatgagga gttgtggccc gttgtcaggc aacgtggcgt ggtgtgcact gtgtttgctg 2700
acgcaacccc cactggttgg ggcattgcca ccacctgtca gctcctttcc gggactttcg 2760
ctttccccct ccctattgcc acggcggaac tcatcgccgc ctgccttgcc cgctgctgga 2820
caggggctcg gctgttgggc actgacaatt ccgtggtgtt gtcggggaaa tcatcgtcct 2880
ttccttggct gctcgcctgt gttgccacct ggattctgcg cgggacgtcc ttctgctacg 2940
tcccttcggc cctcaatcca gcggaccttc cttcccgcgg cctgctgccg gctctgcggc 3000
ctcttccgcg tcttcgcctt cgccctcaga cgagtcggat ctccctttgg gccgcctccc 3060
cgcgtcgact ttaagaccaa tgacttacaa ggcagctgta gatcttagcc actttttaaa 3120
agaaaagggg ggactggaag ggctaattca ctcccaacga agacaagatc tgctttttgc 3180
ttgtactggg tctctctggt tagaccagat ctgagcctgg gagctctctg gctaactagg 3240
gaacccactg cttaagcctc aataaagctt gccttgagtg cttcaagtag tgtgtgcccg 3300
tctgttgtgt gactctggta actagagatc cctcagaccc ttttagtcag tgtggaaaat 3360
ctctagcagt acgtatagta gttcatgtca tcttattatt cagtatttat aacttgcaaa 3420
gaaatgaata tcagagagtg agaggaactt gtttattgca gcttataatg gttacaaata 3480
aagcaatagc atcacaaatt tcacaaataa agcatttttt tcactgcatt ctagttgtgg 3540
tttgtccaaa ctcatcaatg tatcttatca tgtctggctc tagctatccc gcccctaact 3600
ccgcccatcc cgcccctaac tccgcccagt tccgcccatt ctccgcccca tggctgacta 3660
atttttttta tttatgcaga ggccgaggcc gcctcggcct ctgagctatt ccagaagtag 3720
tgaggaggct tttttggagg cctagggacg tacccaattc gccctatagt gagtcgtatt 3780
acgcgcgctc actggccgtc gttttacaac gtcgtgactg ggaaaaccct ggcgttaccc 3840
aacttaatcg ccttgcagca catccccctt tcgccagctg gcgtaatagc gaagaggccc 3900
gcaccgatcg cccttcccaa cagttgcgca gcctgaatgg cgaatgggac gcgccctgta 3960
gcggcgcatt aagcgcggcg ggtgtggtgg ttacgcgcag cgtgaccgct acacttgcca 4020
gcgccctagc gcccgctcct ttcgctttct tcccttcctt tctcgccacg ttcgccggct 4080
ttccccgtca agctctaaat cgggggctcc ctttagggtt ccgatttagt gctttacggc 4140
acctcgaccc caaaaaactt gattagggtg atggttcacg tagtgggcca tcgccctgat 4200
agacggtttt tcgccctttg acgttggagt ccacgttctt taatagtgga ctcttgttcc 4260
aaactggaac aacactcaac cctatctcgg tctattcttt tgatttataa gggattttgc 4320
cgatttcggc ctattggtta aaaaatgagc tgatttaaca aaaatttaac gcgaatttta 4380
acaaaatatt aacgcttaca atttaggtgg cacttttcgg ggaaatgtgc gcggaacccc 4440
tatttgttta tttttctaaa tacattcaaa tatgtatccg ctcatgagac aataaccctg 4500
ataaatgctt caataatatt gaaaaaggaa gagtatgagt attcaacatt tccgtgtcgc 4560
ccttattccc ttttttgcgg cattttgcct tcctgttttt gctcacccag aaacgctggt 4620
gaaagtaaaa gatgctgaag atcagttggg tgcacgagtg ggttacatcg aactggatct 4680
caacagcggt aagatccttg agagttttcg ccccgaagaa cgttttccaa tgatgagcac 4740
ttttaaagtt ctgctatgtg gcgcggtatt atcccgtatt gacgccgggc aagagcaact 4800
cggtcgccgc atacactatt ctcagaatga cttggttgag tactcaccag tcacagaaaa 4860
gcatcttacg gatggcatga cagtaagaga attatgcagt gctgccataa ccatgagtga 4920
taacactgcg gccaacttac ttctgacaac gatcggagga ccgaaggagc taaccgcttt 4980
tttgcacaac atgggggatc atgtaactcg ccttgatcgt tgggaaccgg agctgaatga 5040
agccatacca aacgacgagc gtgacaccac gatgcctgta gcaatggcaa caacgttgcg 5100
caaactatta actggcgaac tacttactct agcttcccgg caacaattaa tagactggat 5160
ggaggcggat aaagttgcag gaccacttct gcgctcggcc cttccggctg gctggtttat 5220
tgctgataaa tctggagccg gtgagcgtgg gtctcgcggt atcattgcag cactggggcc 5280
agatggtaag ccctcccgta tcgtagttat ctacacgacg gggagtcagg caactatgga 5340
tgaacgaaat agacagatcg ctgagatagg tgcctcactg attaagcatt ggtaactgtc 5400
agaccaagtt tactcatata tactttagat tgatttaaaa cttcattttt aatttaaaag 5460
gatctaggtg aagatccttt ttgataatct catgaccaaa atcccttaac gtgagttttc 5520
gttccactga gcgtcagacc ccgtagaaaa gatcaaagga tcttcttgag atcctttttt 5580
tctgcgcgta atctgctgct tgcaaacaaa aaaaccaccg ctaccagcgg tggtttgttt 5640
gccggatcaa gagctaccaa ctctttttcc gaaggtaact ggcttcagca gagcgcagat 5700
accaaatact gttcttctag tgtagccgta gttaggccac cacttcaaga actctgtagc 5760
accgcctaca tacctcgctc tgctaatcct gttaccagtg gctgctgcca gtggcgataa 5820
gtcgtgtctt accgggttgg actcaagacg atagttaccg gataaggcgc agcggtcggg 5880
ctgaacgggg ggttcgtgca cacagcccag cttggagcga acgacctaca ccgaactgag 5940
atacctacag cgtgagctat gagaaagcgc cacgcttccc gaagggagaa aggcggacag 6000
gtatccggta agcggcaggg tcggaacagg agagcgcacg agggagcttc cagggggaaa 6060
cgcctggtat ctttatagtc ctgtcgggtt tcgccacctc tgacttgagc gtcgattttt 6120
gtgatgctcg tcaggggggc ggagcctatg gaaaaacgcc agcaacgcgg cctttttacg 6180
gttcctggcc ttttgctggc cttttgctca catgttcttt cctgcgttat cccctgattc 6240
tgtggataac cgtattaccg cctttgagtg agctgatacc gctcgccgca gccgaacgac 6300
cgagcgcagc gagtcagtga gcgaggaagc ggaagagcgc ccaatacgca aaccgcctct 6360
ccccgcgcgt tggccgattc attaatgcag ctggcacgac aggtttcccg actggaaagc 6420
gggcagtgag cgcaacgcaa ttaatgtgag ttagctcact cattaggcac cccaggcttt 6480
acactttatg cttccggctc gtatgttgtg tggaattgtg agcggataac aatttcacac 6540
aggaaacagc tatgaccatg attacgccaa gcgcgcaatt aaccctcact aaagggaaca 6600
aaagctggag ctgcaagctt aatgtagtct tatgcaatac tcttgtagtc ttgcaacatg 6660
gtaacgatga gttagcaaca tgccttacaa ggagagaaaa agcaccgtgc atgccgattg 6720
gtggaagtaa ggtggtacga tcgtgcctta ttaggaaggc aacagacggg tctgacatgg 6780
attggacgaa ccactgaatt gccgcattgc agagatattg tatttaagtg cctagctcga 6840
tacataaacg ggtctctctg gttagaccag atctgagcct gggagctctc tggctaacta 6900
gggaacccac tgcttaagcc tcaataaagc ttgccttgag tgcttcaagt agtgtgtgcc 6960
cgtctgttgt gtgactctgg taactagaga tccctcagac ccttttagtc agtgtggaaa 7020
atctctagca gtggcgcccg aacagggact tgaaagcgaa agggaaacca gaggagctct 7080
ctcgacgcag gactcggctt gctgaagcgc gcacggcaag aggcgagggg cggcgactgg 7140
tgagtacgcc aaaaattttg actagcggag gctagaagga gagagatggg tgcgagagcg 7200
tcagtattaa gcgggggaga attagatcgc gatgggaaaa aattcggtta aggccagggg 7260
gaaagaaaaa atataaatta aaacatatag tatgggcaag cagggagcta gaacgattcg 7320
cagttaatcc tggcctgtta gaaacatcag aaggctgtag acaaatactg ggacagctac 7380
aaccatccct tcagacagga tcagaagaac ttagatcatt atataataca gtagcaaccc 7440
tctattgtgt gcatcaaagg atagagataa aagacaccaa ggaagcttta gacaagatag 7500
aggaagagca aaacaaaagt aagaccaccg cacagcaagc ggccgctgat cttcagacct 7560
ggaggaggag atatgaggga caattggaga agtgaattat ataaatataa agtagtaaaa 7620
attgaaccat taggagtagc acccaccaag gcaaagagaa gagtggtgca gagagaaaaa 7680
agagcagtgg gaataggagc tttgttcctt gggttcttgg gagcagcagg aagcactatg 7740
ggcgcagcgt caatgacgct gacggtacag gccagacaat tattgtctgg tatagtgcag 7800
cagcagaaca atttgctgag ggctattgag gcgcaacagc atctgttgca actcacagtc 7860
tggggcatca agcagctcca ggcaagaatc ctggctgtgg aaagatacct aaaggatcaa 7920
cagctcctgg ggatttgggg ttgctctgga aaactcattt gcaccactgc tgtgccttgg 7980
aatgctagtt ggagtaataa atctctggaa cagatttgga atcacacgac ctggatggag 8040
tgggacagag aaattaacaa ttacacaagc ttaatacact ccttaattga agaatcgcaa 8100
aaccagcaag aaaagaatga acaagaatta ttggaattag ataaatgggc aagtttgtgg 8160
aattggttta acataacaaa ttggctgtgg tatataaaat tattcataat gatagtagga 8220
ggcttggtag gtttaagaat agtttttgct gtactttcta tagtgaatag agttaggcag 8280
ggatattcac cattatcgtt tcagacccac ctcccaaccc cgaggggacc cttgcgcctt 8340
ttccaaggca gccctgggtt tgcgcaggga cgcggctgct ctgggcgtgg ttccgggaaa 8400
cgcagcggcg ccgaccctgg gtctcgcaca ttcttcacgt ccgttcgcag cgtcacccgg 8460
atcttcgccg ctacccttgt gggccccccg gcgacgcttc ctgctccgcc cctaagtcgg 8520
gaaggttcct tgcggttcgc ggcgtgccgg acgtgacaaa cggaagccgc acgtctcact 8580
agtaccctcg cagacggaca gcgccaggga gcaatggcag cgcgccgacc gcgatgggct 8640
gtggccaata gcggctgctc agcagggcgc gccgagagca gcggccggga aggggcggtg 8700
cgggaggcgg ggtgtggggc ggtagtgtgg gccctgttcc tgcccgcgcg gtgttccgca 8760
ttctgcaagc ctccggagcg cacgtcggca gtcggctccc tcgttgaccg aatcaccgac 8820
ctctctcccc agggggtacc cagctgtcta gagaattcta gatcttgaga caaatggcag 8880
tattcatcca caattttaaa agaaaagggg ggattggggg gtacagtgca ggggaaagaa 8940
tagtagacat aatagcaaca gacatacaaa ctaaagaatt acaaaaacaa attacaaaaa 9000
ttcaaaattt tcgggtttat tacagggaca gcagagatcc actttggcgc cggctcgagg 9060
ggg 9063
<210> 35
<211> 137
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 35
Met Ile Lys Ile Ala Thr Arg Lys Tyr Leu Gly Lys Gln Asn Val Tyr
1 5 10 15
Asp Ile Gly Val Glu Arg Asp His Asn Phe Ala Leu Lys Asn Gly Phe
20 25 30
Ile Ala Ser Asn Cys Met Glu Gln Gln Thr Arg Tyr Phe Glu Arg Arg
35 40 45
His Pro Glu Leu Ala Gly Ser Pro Arg Leu Arg Ala Tyr Met Leu Arg
50 55 60
Ile Gly Leu Asp Gln Leu Tyr Gln Ser Leu Val Asp Gly Asn Phe Asp
65 70 75 80
Asp Ala Ala Trp Ala Gln Gly Arg Cys Asp Ala Ile Val Arg Ser Gly
85 90 95
Ala Gly Thr Val Gly Arg Thr Gln Ile Ala Arg Arg Ser Ala Ala Val
100 105 110
Trp Thr Asp Gly Cys Val Glu Val Leu Ala Asp Ser Gly Asn Arg Arg
115 120 125
Pro Ser Thr Arg Pro Arg Ala Lys Glu
130 135
<210> 36
<211> 9828
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 36
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtgacacca tgaaaaagcc tgaactcacc 660
gcgacgtctg tcgagaagtt tctgatcgaa aagttcgaca gcgtctccga cctgatgcag 720
ctctcggagg gcgaagaatc tcgtgctttc agcttcgatg taggagggcg tggatatgtc 780
ctgcgggtaa atagctgcgc cgatggtttc tacaaagatc gttatgttta tcggcacttt 840
gcatcggccg cgctcccgat tccggaagtg cttgacattg gggaatttag cgagagcctg 900
acctattgca tctcccgccg tgcacagggt gtcacgttgc aagacctgcc tgaaaccgaa 960
ctgcccgctg ttctgcagcc ggtcgcggag gccatggatg cgatcgctgc ggccgatctt 1020
agccagacga gcgggttcgg cccattcgga ccgcaaggaa tcggtcaata cactacatgg 1080
cgtgatttca tatgcgcgat tgctgatccc catgtgtatc actggcaaac tgtgatggac 1140
gacaccgtca gtgcgtccgt cgcgcaggct ctcgatgagc tgatgctttg ggccgaggac 1200
tgccccgaag tccggcacct cgtgcacgcg gatttcggct ccaacaatgt cctgacggac 1260
aatggccgca taacagcggt cattgactgg agcgaggcga tgttcgggga ttcccaatac 1320
gaggtcgcca acatcttctt ctggaggccg tggttggctt gtatggagca gcagacgcgc 1380
tacttcgagc ggaggcatcc ggagcttgca ggatcgccgc ggctccgggc gtatatgctc 1440
cgcattggtc ttgaccaact ctatcagagc ttggttgacg gcaatttcga tgatgcagct 1500
tgggcgcagg gtcgatgcct ttcatacgag accgagatcc tgactgtcga gtacggattg 1560
cttcctatcg gcaaaatcgt ggagaagagg attgaatgta ccgtctattc agtcgataat 1620
aatgggaaca tctacacaca gcccgtggct caatggcacg acagaggaga gcaggaagtt 1680
tttgaatact gtctcgagga cggatccctc atccgcgcta ctaaagatca taagtttatg 1740
accgtggacg gccagatgct gccaattgac gaaatttttg aacgagagct ggatctgatg 1800
agagtcgaca accttccaaa ctgattaatt aagaattcga cccagctttc ttgtacaaag 1860
tggttggtaa gcctatccct aaccctctcc tcggtctcga ttctacgtag taatgagcta 1920
gcagtctcga ggttaacgaa ttccgccccc cccctaacgt tactggccga agccgcttgg 1980
aataaggccg gtgtgcgctt gtctatatgt tattttccac catattgccg tcttttggca 2040
atgtgagggc ccggaaacct ggccctgtct tcttgacgag cattcctagg ggtctttccc 2100
ctctcgccaa aggaatgcaa ggtctgttga atgtcgtgaa ggaagcagtt cctctggaag 2160
cttcttgaag acaaacaacg tctgtagcga ccctttgcag gcagcggaac cccccacctg 2220
gcgacaggtg cccctgcggc caaaagccac gtgtataaga tacacctgca aaggcggcac 2280
aaccccagtg ccacgttgtg agttggatag ttgtggaaag agtcaaatgg ctctcctcaa 2340
gcgtattcaa caaggggctg aaggatgccc agaaggtacc ccattgtatg ggatctgatc 2400
tggggcctcg gtgcacatgc tttacatgtg tttagtcgag gttaaaaaaa cgtctaggcc 2460
ccccgaacca cggggacgtg gttttccttt gaaaaacacg ataataccat ggccatgagc 2520
gagctgatta aggagaacat gcacatgaag ctgtacatgg agggcaccgt ggacaaccat 2580
cacttcaagt gcacatccga gggcgaaggc aagccctacg agggcaccca gaccatgaga 2640
atcaaggtgg tcgagggcgg ccctctcccc ttcgccttcg acatcctggc tactagcttc 2700
ctctacggca gcaagacctt catcaaccac acccagggca tccccgactt cttcaagcag 2760
tccttccctg agggcttcac atgggagaga gtcaccacat acgaagacgg gggcgtgctg 2820
accgctaccc aggacaccag cctccaggac ggctgcctca tctacaacgt caagatcaga 2880
ggggtgaact tcacatccaa cggccctgtg atgcagaaga aaacactcgg ctgggaggcc 2940
ttcaccgaga cgctgtaccc cgctgacggc ggcctggaag gcagaaacga catggccctg 3000
aagctcgtgg gcgggagcca tctgatcgca aacatcaaga ccacatatag atccaagaaa 3060
cccgctaaga acctcaagat gcctggcgtc tactatgtgg actacagact ggaaagaatc 3120
aaggaggcca acaacgagac ctacgtcgag cagcacgagg tggcagtggc cagatactgc 3180
gacctcccta gcaaactggg gcacaagctt aattaacacc ggtggcgcgt taagtcgaca 3240
atcaacctct ggattacaaa atttgtgaaa gattgactgg tattcttaac tatgttgctc 3300
cttttacgct atgtggatac gctgctttaa tgcctttgta tcatgctatt gcttcccgta 3360
tggctttcat tttctcctcc ttgtataaat cctggttgct gtctctttat gaggagttgt 3420
ggcccgttgt caggcaacgt ggcgtggtgt gcactgtgtt tgctgacgca acccccactg 3480
gttggggcat tgccaccacc tgtcagctcc tttccgggac tttcgctttc cccctcccta 3540
ttgccacggc ggaactcatc gccgcctgcc ttgcccgctg ctggacaggg gctcggctgt 3600
tgggcactga caattccgtg gtgttgtcgg ggaaatcatc gtcctttcct tggctgctcg 3660
cctgtgttgc cacctggatt ctgcgcggga cgtccttctg ctacgtccct tcggccctca 3720
atccagcgga ccttccttcc cgcggcctgc tgccggctct gcggcctctt ccgcgtcttc 3780
gccttcgccc tcagacgagt cggatctccc tttgggccgc ctccccgcgt cgactttaag 3840
accaatgact tacaaggcag ctgtagatct tagccacttt ttaaaagaaa aggggggact 3900
ggaagggcta attcactccc aacgaagaca agatctgctt tttgcttgta ctgggtctct 3960
ctggttagac cagatctgag cctgggagct ctctggctaa ctagggaacc cactgcttaa 4020
gcctcaataa agcttgcctt gagtgcttca agtagtgtgt gcccgtctgt tgtgtgactc 4080
tggtaactag agatccctca gaccctttta gtcagtgtgg aaaatctcta gcagtacgta 4140
tagtagttca tgtcatctta ttattcagta tttataactt gcaaagaaat gaatatcaga 4200
gagtgagagg aacttgttta ttgcagctta taatggttac aaataaagca atagcatcac 4260
aaatttcaca aataaagcat ttttttcact gcattctagt tgtggtttgt ccaaactcat 4320
caatgtatct tatcatgtct ggctctagct atcccgcccc taactccgcc catcccgccc 4380
ctaactccgc ccagttccgc ccattctccg ccccatggct gactaatttt ttttatttat 4440
gcagaggccg aggccgcctc ggcctctgag ctattccaga agtagtgagg aggctttttt 4500
ggaggcctag ggacgtaccc aattcgccct atagtgagtc gtattacgcg cgctcactgg 4560
ccgtcgtttt acaacgtcgt gactgggaaa accctggcgt tacccaactt aatcgccttg 4620
cagcacatcc ccctttcgcc agctggcgta atagcgaaga ggcccgcacc gatcgccctt 4680
cccaacagtt gcgcagcctg aatggcgaat gggacgcgcc ctgtagcggc gcattaagcg 4740
cggcgggtgt ggtggttacg cgcagcgtga ccgctacact tgccagcgcc ctagcgcccg 4800
ctcctttcgc tttcttccct tcctttctcg ccacgttcgc cggctttccc cgtcaagctc 4860
taaatcgggg gctcccttta gggttccgat ttagtgcttt acggcacctc gaccccaaaa 4920
aacttgatta gggtgatggt tcacgtagtg ggccatcgcc ctgatagacg gtttttcgcc 4980
ctttgacgtt ggagtccacg ttctttaata gtggactctt gttccaaact ggaacaacac 5040
tcaaccctat ctcggtctat tcttttgatt tataagggat tttgccgatt tcggcctatt 5100
ggttaaaaaa tgagctgatt taacaaaaat ttaacgcgaa ttttaacaaa atattaacgc 5160
ttacaattta ggtggcactt ttcggggaaa tgtgcgcgga acccctattt gtttattttt 5220
ctaaatacat tcaaatatgt atccgctcat gagacaataa ccctgataaa tgcttcaata 5280
atattgaaaa aggaagagta tgagtattca acatttccgt gtcgccctta ttcccttttt 5340
tgcggcattt tgccttcctg tttttgctca cccagaaacg ctggtgaaag taaaagatgc 5400
tgaagatcag ttgggtgcac gagtgggtta catcgaactg gatctcaaca gcggtaagat 5460
ccttgagagt tttcgccccg aagaacgttt tccaatgatg agcactttta aagttctgct 5520
atgtggcgcg gtattatccc gtattgacgc cgggcaagag caactcggtc gccgcataca 5580
ctattctcag aatgacttgg ttgagtactc accagtcaca gaaaagcatc ttacggatgg 5640
catgacagta agagaattat gcagtgctgc cataaccatg agtgataaca ctgcggccaa 5700
cttacttctg acaacgatcg gaggaccgaa ggagctaacc gcttttttgc acaacatggg 5760
ggatcatgta actcgccttg atcgttggga accggagctg aatgaagcca taccaaacga 5820
cgagcgtgac accacgatgc ctgtagcaat ggcaacaacg ttgcgcaaac tattaactgg 5880
cgaactactt actctagctt cccggcaaca attaatagac tggatggagg cggataaagt 5940
tgcaggacca cttctgcgct cggcccttcc ggctggctgg tttattgctg ataaatctgg 6000
agccggtgag cgtgggtctc gcggtatcat tgcagcactg gggccagatg gtaagccctc 6060
ccgtatcgta gttatctaca cgacggggag tcaggcaact atggatgaac gaaatagaca 6120
gatcgctgag ataggtgcct cactgattaa gcattggtaa ctgtcagacc aagtttactc 6180
atatatactt tagattgatt taaaacttca tttttaattt aaaaggatct aggtgaagat 6240
cctttttgat aatctcatga ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc 6300
agaccccgta gaaaagatca aaggatcttc ttgagatcct ttttttctgc gcgtaatctg 6360
ctgcttgcaa acaaaaaaac caccgctacc agcggtggtt tgtttgccgg atcaagagct 6420
accaactctt tttccgaagg taactggctt cagcagagcg cagataccaa atactgttct 6480
tctagtgtag ccgtagttag gccaccactt caagaactct gtagcaccgc ctacatacct 6540
cgctctgcta atcctgttac cagtggctgc tgccagtggc gataagtcgt gtcttaccgg 6600
gttggactca agacgatagt taccggataa ggcgcagcgg tcgggctgaa cggggggttc 6660
gtgcacacag cccagcttgg agcgaacgac ctacaccgaa ctgagatacc tacagcgtga 6720
gctatgagaa agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg 6780
cagggtcgga acaggagagc gcacgaggga gcttccaggg ggaaacgcct ggtatcttta 6840
tagtcctgtc gggtttcgcc acctctgact tgagcgtcga tttttgtgat gctcgtcagg 6900
ggggcggagc ctatggaaaa acgccagcaa cgcggccttt ttacggttcc tggccttttg 6960
ctggcctttt gctcacatgt tctttcctgc gttatcccct gattctgtgg ataaccgtat 7020
taccgccttt gagtgagctg ataccgctcg ccgcagccga acgaccgagc gcagcgagtc 7080
agtgagcgag gaagcggaag agcgcccaat acgcaaaccg cctctccccg cgcgttggcc 7140
gattcattaa tgcagctggc acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa 7200
cgcaattaat gtgagttagc tcactcatta ggcaccccag gctttacact ttatgcttcc 7260
ggctcgtatg ttgtgtggaa ttgtgagcgg ataacaattt cacacaggaa acagctatga 7320
ccatgattac gccaagcgcg caattaaccc tcactaaagg gaacaaaagc tggagctgca 7380
agcttaatgt agtcttatgc aatactcttg tagtcttgca acatggtaac gatgagttag 7440
caacatgcct tacaaggaga gaaaaagcac cgtgcatgcc gattggtgga agtaaggtgg 7500
tacgatcgtg ccttattagg aaggcaacag acgggtctga catggattgg acgaaccact 7560
gaattgccgc attgcagaga tattgtattt aagtgcctag ctcgatacat aaacgggtct 7620
ctctggttag accagatctg agcctgggag ctctctggct aactagggaa cccactgctt 7680
aagcctcaat aaagcttgcc ttgagtgctt caagtagtgt gtgcccgtct gttgtgtgac 7740
tctggtaact agagatccct cagacccttt tagtcagtgt ggaaaatctc tagcagtggc 7800
gcccgaacag ggacttgaaa gcgaaaggga aaccagagga gctctctcga cgcaggactc 7860
ggcttgctga agcgcgcacg gcaagaggcg aggggcggcg actggtgagt acgccaaaaa 7920
ttttgactag cggaggctag aaggagagag atgggtgcga gagcgtcagt attaagcggg 7980
ggagaattag atcgcgatgg gaaaaaattc ggttaaggcc agggggaaag aaaaaatata 8040
aattaaaaca tatagtatgg gcaagcaggg agctagaacg attcgcagtt aatcctggcc 8100
tgttagaaac atcagaaggc tgtagacaaa tactgggaca gctacaacca tcccttcaga 8160
caggatcaga agaacttaga tcattatata atacagtagc aaccctctat tgtgtgcatc 8220
aaaggataga gataaaagac accaaggaag ctttagacaa gatagaggaa gagcaaaaca 8280
aaagtaagac caccgcacag caagcggccg ctgatcttca gacctggagg aggagatatg 8340
agggacaatt ggagaagtga attatataaa tataaagtag taaaaattga accattagga 8400
gtagcaccca ccaaggcaaa gagaagagtg gtgcagagag aaaaaagagc agtgggaata 8460
ggagctttgt tccttgggtt cttgggagca gcaggaagca ctatgggcgc agcgtcaatg 8520
acgctgacgg tacaggccag acaattattg tctggtatag tgcagcagca gaacaatttg 8580
ctgagggcta ttgaggcgca acagcatctg ttgcaactca cagtctgggg catcaagcag 8640
ctccaggcaa gaatcctggc tgtggaaaga tacctaaagg atcaacagct cctggggatt 8700
tggggttgct ctggaaaact catttgcacc actgctgtgc cttggaatgc tagttggagt 8760
aataaatctc tggaacagat ttggaatcac acgacctgga tggagtggga cagagaaatt 8820
aacaattaca caagcttaat acactcctta attgaagaat cgcaaaacca gcaagaaaag 8880
aatgaacaag aattattgga attagataaa tgggcaagtt tgtggaattg gtttaacata 8940
acaaattggc tgtggtatat aaaattattc ataatgatag taggaggctt ggtaggttta 9000
agaatagttt ttgctgtact ttctatagtg aatagagtta ggcagggata ttcaccatta 9060
tcgtttcaga cccacctccc aaccccgagg ggacccttgc gccttttcca aggcagccct 9120
gggtttgcgc agggacgcgg ctgctctggg cgtggttccg ggaaacgcag cggcgccgac 9180
cctgggtctc gcacattctt cacgtccgtt cgcagcgtca cccggatctt cgccgctacc 9240
cttgtgggcc ccccggcgac gcttcctgct ccgcccctaa gtcgggaagg ttccttgcgg 9300
ttcgcggcgt gccggacgtg acaaacggaa gccgcacgtc tcactagtac cctcgcagac 9360
ggacagcgcc agggagcaat ggcagcgcgc cgaccgcgat gggctgtggc caatagcggc 9420
tgctcagcag ggcgcgccga gagcagcggc cgggaagggg cggtgcggga ggcggggtgt 9480
ggggcggtag tgtgggccct gttcctgccc gcgcggtgtt ccgcattctg caagcctccg 9540
gagcgcacgt cggcagtcgg ctccctcgtt gaccgaatca ccgacctctc tccccagggg 9600
gtacccagct gtctagagaa ttctagatct tgagacaaat ggcagtattc atccacaatt 9660
ttaaaagaaa aggggggatt ggggggtaca gtgcagggga aagaatagta gacataatag 9720
caacagacat acaaactaaa gaattacaaa aacaaattac aaaaattcaa aattttcggg 9780
tttattacag ggacagcaga gatccacttt ggcgccggct cgaggggg 9828
<210> 37
<211> 394
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 37
Met Lys Lys Pro Glu Leu Thr Ala Thr Ser Val Glu Lys Phe Leu Ile
1 5 10 15
Glu Lys Phe Asp Ser Val Ser Asp Leu Met Gln Leu Ser Glu Gly Glu
20 25 30
Glu Ser Arg Ala Phe Ser Phe Asp Val Gly Gly Arg Gly Tyr Val Leu
35 40 45
Arg Val Asn Ser Cys Ala Asp Gly Phe Tyr Lys Asp Arg Tyr Val Tyr
50 55 60
Arg His Phe Ala Ser Ala Ala Leu Pro Ile Pro Glu Val Leu Asp Ile
65 70 75 80
Gly Glu Phe Ser Glu Ser Leu Thr Tyr Cys Ile Ser Arg Arg Ala Gln
85 90 95
Gly Val Thr Leu Gln Asp Leu Pro Glu Thr Glu Leu Pro Ala Val Leu
100 105 110
Gln Pro Val Ala Glu Ala Met Asp Ala Ile Ala Ala Ala Asp Leu Ser
115 120 125
Gln Thr Ser Gly Phe Gly Pro Phe Gly Pro Gln Gly Ile Gly Gln Tyr
130 135 140
Thr Thr Trp Arg Asp Phe Ile Cys Ala Ile Ala Asp Pro His Val Tyr
145 150 155 160
His Trp Gln Thr Val Met Asp Asp Thr Val Ser Ala Ser Val Ala Gln
165 170 175
Ala Leu Asp Glu Leu Met Leu Trp Ala Glu Asp Cys Pro Glu Val Arg
180 185 190
His Leu Val His Ala Asp Phe Gly Ser Asn Asn Val Leu Thr Asp Asn
195 200 205
Gly Arg Ile Thr Ala Val Ile Asp Trp Ser Glu Ala Met Phe Gly Asp
210 215 220
Ser Gln Tyr Glu Val Ala Asn Ile Phe Phe Trp Arg Pro Trp Leu Ala
225 230 235 240
Cys Met Glu Gln Gln Thr Arg Tyr Phe Glu Arg Arg His Pro Glu Leu
245 250 255
Ala Gly Ser Pro Arg Leu Arg Ala Tyr Met Leu Arg Ile Gly Leu Asp
260 265 270
Gln Leu Tyr Gln Ser Leu Val Asp Gly Asn Phe Asp Asp Ala Ala Trp
275 280 285
Ala Gln Gly Arg Cys Leu Ser Tyr Glu Thr Glu Ile Leu Thr Val Glu
290 295 300
Tyr Gly Leu Leu Pro Ile Gly Lys Ile Val Glu Lys Arg Ile Glu Cys
305 310 315 320
Thr Val Tyr Ser Val Asp Asn Asn Gly Asn Ile Tyr Thr Gln Pro Val
325 330 335
Ala Gln Trp His Asp Arg Gly Glu Gln Glu Val Phe Glu Tyr Cys Leu
340 345 350
Glu Asp Gly Ser Leu Ile Arg Ala Thr Lys Asp His Lys Phe Met Thr
355 360 365
Val Asp Gly Gln Met Leu Pro Ile Asp Glu Ile Phe Glu Arg Glu Leu
370 375 380
Asp Leu Met Arg Val Asp Asn Leu Pro Asn
385 390
<210> 38
<211> 8907
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 38
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtgccgcca ccatgattaa gatcgctacg 660
cggaagtacc tggggaaaca gaacgtctac gacataggtg tggagcgcga tcacaacttt 720
gctctgaaaa atggatttat cgccagcaac tgcgacgcaa tcgtccgatc cggagccggg 780
actgtcgggc gtacacaaat cgcccgcaga agcgcggccg tctggaccga tggctgtgta 840
gaagtactcg ccgatagtgg aaaccgacgc cccagcactc gtccgagggc aaaggaatag 900
ttaattaaga attcgaccca gctttcttgt acaaagtggt tggtaagcct atccctaacc 960
ctctcctcgg tctcgattct acgtagtaat gagctagcag tctcgaggtt aacgaattcc 1020
gccccccccc taacgttact ggccgaagcc gcttggaata aggccggtgt gcgcttgtct 1080
atatgttatt ttccaccata ttgccgtctt ttggcaatgt gagggcccgg aaacctggcc 1140
ctgtcttctt gacgagcatt cctaggggtc tttcccctct cgccaaagga atgcaaggtc 1200
tgttgaatgt cgtgaaggaa gcagttcctc tggaagcttc ttgaagacaa acaacgtctg 1260
tagcgaccct ttgcaggcag cggaaccccc cacctggcga caggtgcccc tgcggccaaa 1320
agccacgtgt ataagataca cctgcaaagg cggcacaacc ccagtgccac gttgtgagtt 1380
ggatagttgt ggaaagagtc aaatggctct cctcaagcgt attcaacaag gggctgaagg 1440
atgcccagaa ggtaccccat tgtatgggat ctgatctggg gcctcggtgc acatgcttta 1500
catgtgttta gtcgaggtta aaaaaacgtc taggcccccc gaaccacggg gacgtggttt 1560
tcctttgaaa aacacgataa taccatggtg agcaagggcg aggaggataa catggccatc 1620
atcaaggagt tcatgcgctt caaggtgcac atggagggct ccgtgaacgg ccacgagttc 1680
gagatcgagg gcgagggcga gggccgcccc tacgagggca cccagaccgc caagctgaag 1740
gtgaccaagg gtggccccct gcccttcgcc tgggacatcc tgtcccctca gttcatgtac 1800
ggctccaagg cctacgtgaa gcaccccgcc gacatccccg actacttgaa gctgtccttc 1860
cccgagggct tcaagtggga gcgcgtgatg aacttcgagg acggcggcgt ggtgaccgtg 1920
acccaggact cctccctgca ggacggcgag ttcatctaca aggtgaagct gcgcggcacc 1980
aacttcccct ccgacggccc cgtaatgcag aagaagacca tgggctggga ggcctcctcc 2040
gagcggatgt accccgagga cggcgccctg aagggcgaga tcaagcagag gctgaagctg 2100
aaggacggcg gccactacga cgctgaggtc aagaccacct acaaggccaa gaagcccgtg 2160
cagctgcccg gcgcctacaa cgtcaacatc aagttggaca tcacctccca caacgaggac 2220
tacaccatcg tggaacagta cgaacgcgcc gagggccgcc actccaccgg cggcatggac 2280
gagctgtaca agtaacaccg gtggcgcgtt aagtcgacaa tcaacctctg gattacaaaa 2340
tttgtgaaag attgactggt attcttaact atgttgctcc ttttacgcta tgtggatacg 2400
ctgctttaat gcctttgtat catgctattg cttcccgtat ggctttcatt ttctcctcct 2460
tgtataaatc ctggttgctg tctctttatg aggagttgtg gcccgttgtc aggcaacgtg 2520
gcgtggtgtg cactgtgttt gctgacgcaa cccccactgg ttggggcatt gccaccacct 2580
gtcagctcct ttccgggact ttcgctttcc ccctccctat tgccacggcg gaactcatcg 2640
ccgcctgcct tgcccgctgc tggacagggg ctcggctgtt gggcactgac aattccgtgg 2700
tgttgtcggg gaaatcatcg tcctttcctt ggctgctcgc ctgtgttgcc acctggattc 2760
tgcgcgggac gtccttctgc tacgtccctt cggccctcaa tccagcggac cttccttccc 2820
gcggcctgct gccggctctg cggcctcttc cgcgtcttcg ccttcgccct cagacgagtc 2880
ggatctccct ttgggccgcc tccccgcgtc gactttaaga ccaatgactt acaaggcagc 2940
tgtagatctt agccactttt taaaagaaaa ggggggactg gaagggctaa ttcactccca 3000
acgaagacaa gatctgcttt ttgcttgtac tgggtctctc tggttagacc agatctgagc 3060
ctgggagctc tctggctaac tagggaaccc actgcttaag cctcaataaa gcttgccttg 3120
agtgcttcaa gtagtgtgtg cccgtctgtt gtgtgactct ggtaactaga gatccctcag 3180
acccttttag tcagtgtgga aaatctctag cagtacgtat agtagttcat gtcatcttat 3240
tattcagtat ttataacttg caaagaaatg aatatcagag agtgagagga acttgtttat 3300
tgcagcttat aatggttaca aataaagcaa tagcatcaca aatttcacaa ataaagcatt 3360
tttttcactg cattctagtt gtggtttgtc caaactcatc aatgtatctt atcatgtctg 3420
gctctagcta tcccgcccct aactccgccc atcccgcccc taactccgcc cagttccgcc 3480
cattctccgc cccatggctg actaattttt tttatttatg cagaggccga ggccgcctcg 3540
gcctctgagc tattccagaa gtagtgagga ggcttttttg gaggcctagg gacgtaccca 3600
attcgcccta tagtgagtcg tattacgcgc gctcactggc cgtcgtttta caacgtcgtg 3660
actgggaaaa ccctggcgtt acccaactta atcgccttgc agcacatccc cctttcgcca 3720
gctggcgtaa tagcgaagag gcccgcaccg atcgcccttc ccaacagttg cgcagcctga 3780
atggcgaatg ggacgcgccc tgtagcggcg cattaagcgc ggcgggtgtg gtggttacgc 3840
gcagcgtgac cgctacactt gccagcgccc tagcgcccgc tcctttcgct ttcttccctt 3900
cctttctcgc cacgttcgcc ggctttcccc gtcaagctct aaatcggggg ctccctttag 3960
ggttccgatt tagtgcttta cggcacctcg accccaaaaa acttgattag ggtgatggtt 4020
cacgtagtgg gccatcgccc tgatagacgg tttttcgccc tttgacgttg gagtccacgt 4080
tctttaatag tggactcttg ttccaaactg gaacaacact caaccctatc tcggtctatt 4140
cttttgattt ataagggatt ttgccgattt cggcctattg gttaaaaaat gagctgattt 4200
aacaaaaatt taacgcgaat tttaacaaaa tattaacgct tacaatttag gtggcacttt 4260
tcggggaaat gtgcgcggaa cccctatttg tttatttttc taaatacatt caaatatgta 4320
tccgctcatg agacaataac cctgataaat gcttcaataa tattgaaaaa ggaagagtat 4380
gagtattcaa catttccgtg tcgcccttat tccctttttt gcggcatttt gccttcctgt 4440
ttttgctcac ccagaaacgc tggtgaaagt aaaagatgct gaagatcagt tgggtgcacg 4500
agtgggttac atcgaactgg atctcaacag cggtaagatc cttgagagtt ttcgccccga 4560
agaacgtttt ccaatgatga gcacttttaa agttctgcta tgtggcgcgg tattatcccg 4620
tattgacgcc gggcaagagc aactcggtcg ccgcatacac tattctcaga atgacttggt 4680
tgagtactca ccagtcacag aaaagcatct tacggatggc atgacagtaa gagaattatg 4740
cagtgctgcc ataaccatga gtgataacac tgcggccaac ttacttctga caacgatcgg 4800
aggaccgaag gagctaaccg cttttttgca caacatgggg gatcatgtaa ctcgccttga 4860
tcgttgggaa ccggagctga atgaagccat accaaacgac gagcgtgaca ccacgatgcc 4920
tgtagcaatg gcaacaacgt tgcgcaaact attaactggc gaactactta ctctagcttc 4980
ccggcaacaa ttaatagact ggatggaggc ggataaagtt gcaggaccac ttctgcgctc 5040
ggcccttccg gctggctggt ttattgctga taaatctgga gccggtgagc gtgggtctcg 5100
cggtatcatt gcagcactgg ggccagatgg taagccctcc cgtatcgtag ttatctacac 5160
gacggggagt caggcaacta tggatgaacg aaatagacag atcgctgaga taggtgcctc 5220
actgattaag cattggtaac tgtcagacca agtttactca tatatacttt agattgattt 5280
aaaacttcat ttttaattta aaaggatcta ggtgaagatc ctttttgata atctcatgac 5340
caaaatccct taacgtgagt tttcgttcca ctgagcgtca gaccccgtag aaaagatcaa 5400
aggatcttct tgagatcctt tttttctgcg cgtaatctgc tgcttgcaaa caaaaaaacc 5460
accgctacca gcggtggttt gtttgccgga tcaagagcta ccaactcttt ttccgaaggt 5520
aactggcttc agcagagcgc agataccaaa tactgttctt ctagtgtagc cgtagttagg 5580
ccaccacttc aagaactctg tagcaccgcc tacatacctc gctctgctaa tcctgttacc 5640
agtggctgct gccagtggcg ataagtcgtg tcttaccggg ttggactcaa gacgatagtt 5700
accggataag gcgcagcggt cgggctgaac ggggggttcg tgcacacagc ccagcttgga 5760
gcgaacgacc tacaccgaac tgagatacct acagcgtgag ctatgagaaa gcgccacgct 5820
tcccgaaggg agaaaggcgg acaggtatcc ggtaagcggc agggtcggaa caggagagcg 5880
cacgagggag cttccagggg gaaacgcctg gtatctttat agtcctgtcg ggtttcgcca 5940
cctctgactt gagcgtcgat ttttgtgatg ctcgtcaggg gggcggagcc tatggaaaaa 6000
cgccagcaac gcggcctttt tacggttcct ggccttttgc tggccttttg ctcacatgtt 6060
ctttcctgcg ttatcccctg attctgtgga taaccgtatt accgcctttg agtgagctga 6120
taccgctcgc cgcagccgaa cgaccgagcg cagcgagtca gtgagcgagg aagcggaaga 6180
gcgcccaata cgcaaaccgc ctctccccgc gcgttggccg attcattaat gcagctggca 6240
cgacaggttt cccgactgga aagcgggcag tgagcgcaac gcaattaatg tgagttagct 6300
cactcattag gcaccccagg ctttacactt tatgcttccg gctcgtatgt tgtgtggaat 6360
tgtgagcgga taacaatttc acacaggaaa cagctatgac catgattacg ccaagcgcgc 6420
aattaaccct cactaaaggg aacaaaagct ggagctgcaa gcttaatgta gtcttatgca 6480
atactcttgt agtcttgcaa catggtaacg atgagttagc aacatgcctt acaaggagag 6540
aaaaagcacc gtgcatgccg attggtggaa gtaaggtggt acgatcgtgc cttattagga 6600
aggcaacaga cgggtctgac atggattgga cgaaccactg aattgccgca ttgcagagat 6660
attgtattta agtgcctagc tcgatacata aacgggtctc tctggttaga ccagatctga 6720
gcctgggagc tctctggcta actagggaac ccactgctta agcctcaata aagcttgcct 6780
tgagtgcttc aagtagtgtg tgcccgtctg ttgtgtgact ctggtaacta gagatccctc 6840
agaccctttt agtcagtgtg gaaaatctct agcagtggcg cccgaacagg gacttgaaag 6900
cgaaagggaa accagaggag ctctctcgac gcaggactcg gcttgctgaa gcgcgcacgg 6960
caagaggcga ggggcggcga ctggtgagta cgccaaaaat tttgactagc ggaggctaga 7020
aggagagaga tgggtgcgag agcgtcagta ttaagcgggg gagaattaga tcgcgatggg 7080
aaaaaattcg gttaaggcca gggggaaaga aaaaatataa attaaaacat atagtatggg 7140
caagcaggga gctagaacga ttcgcagtta atcctggcct gttagaaaca tcagaaggct 7200
gtagacaaat actgggacag ctacaaccat cccttcagac aggatcagaa gaacttagat 7260
cattatataa tacagtagca accctctatt gtgtgcatca aaggatagag ataaaagaca 7320
ccaaggaagc tttagacaag atagaggaag agcaaaacaa aagtaagacc accgcacagc 7380
aagcggccgc tgatcttcag acctggagga ggagatatga gggacaattg gagaagtgaa 7440
ttatataaat ataaagtagt aaaaattgaa ccattaggag tagcacccac caaggcaaag 7500
agaagagtgg tgcagagaga aaaaagagca gtgggaatag gagctttgtt ccttgggttc 7560
ttgggagcag caggaagcac tatgggcgca gcgtcaatga cgctgacggt acaggccaga 7620
caattattgt ctggtatagt gcagcagcag aacaatttgc tgagggctat tgaggcgcaa 7680
cagcatctgt tgcaactcac agtctggggc atcaagcagc tccaggcaag aatcctggct 7740
gtggaaagat acctaaagga tcaacagctc ctggggattt ggggttgctc tggaaaactc 7800
atttgcacca ctgctgtgcc ttggaatgct agttggagta ataaatctct ggaacagatt 7860
tggaatcaca cgacctggat ggagtgggac agagaaatta acaattacac aagcttaata 7920
cactccttaa ttgaagaatc gcaaaaccag caagaaaaga atgaacaaga attattggaa 7980
ttagataaat gggcaagttt gtggaattgg tttaacataa caaattggct gtggtatata 8040
aaattattca taatgatagt aggaggcttg gtaggtttaa gaatagtttt tgctgtactt 8100
tctatagtga atagagttag gcagggatat tcaccattat cgtttcagac ccacctccca 8160
accccgaggg gacccttgcg ccttttccaa ggcagccctg ggtttgcgca gggacgcggc 8220
tgctctgggc gtggttccgg gaaacgcagc ggcgccgacc ctgggtctcg cacattcttc 8280
acgtccgttc gcagcgtcac ccggatcttc gccgctaccc ttgtgggccc cccggcgacg 8340
cttcctgctc cgcccctaag tcgggaaggt tccttgcggt tcgcggcgtg ccggacgtga 8400
caaacggaag ccgcacgtct cactagtacc ctcgcagacg gacagcgcca gggagcaatg 8460
gcagcgcgcc gaccgcgatg ggctgtggcc aatagcggct gctcagcagg gcgcgccgag 8520
agcagcggcc gggaaggggc ggtgcgggag gcggggtgtg gggcggtagt gtgggccctg 8580
ttcctgcccg cgcggtgttc cgcattctgc aagcctccgg agcgcacgtc ggcagtcggc 8640
tccctcgttg accgaatcac cgacctctct ccccaggggg tacccagctg tctagagaat 8700
tctagatctt gagacaaatg gcagtattca tccacaattt taaaagaaaa ggggggattg 8760
gggggtacag tgcaggggaa agaatagtag acataatagc aacagacata caaactaaag 8820
aattacaaaa acaaattaca aaaattcaaa attttcgggt ttattacagg gacagcagag 8880
atccactttg gcgccggctc gaggggg 8907
<210> 39
<211> 85
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 39
Met Ile Lys Ile Ala Thr Arg Lys Tyr Leu Gly Lys Gln Asn Val Tyr
1 5 10 15
Asp Ile Gly Val Glu Arg Asp His Asn Phe Ala Leu Lys Asn Gly Phe
20 25 30
Ile Ala Ser Asn Cys Asp Ala Ile Val Arg Ser Gly Ala Gly Thr Val
35 40 45
Gly Arg Thr Gln Ile Ala Arg Arg Ser Ala Ala Val Trp Thr Asp Gly
50 55 60
Cys Val Glu Val Leu Ala Asp Ser Gly Asn Arg Arg Pro Ser Thr Arg
65 70 75 80
Pro Arg Ala Lys Glu
85
<210> 40
<211> 9261
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 40
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtgccgcca ccatgaaaac atttaacatt 660
tctcaacagg atctagaatt agtagaagta gcgacagaga agattacaat gctttatgag 720
gataataaac atcatgtggg agcggcaatt cgtacgaaaa caggagaaat catttcggca 780
gtacatattg aagcgtatat aggacgagta actgtttgtg cagaagccat tgcgattggt 840
agtgcagttt cgaatggaca aaaggatttt gacacgattg tagctgttag acacccttat 900
tctgacgaag tagatagaag tattcgagtg gtaagtcctt gtggtatgtg cctttcatac 960
gagaccgaga tcctgactgt cgagtacgga ttgcttccta tcggcaaaat cgtggagaag 1020
aggattgaat gtaccgtcta ttcagtcgat aataatggga acatctacac acagcccgtg 1080
gctcaatggc acgacagagg agagcaggaa gtttttgaat actgtctcga ggacggatcc 1140
ctcatccgcg ctactaaaga tcataagttt atgaccgtgg acggccagat gctgccaatt 1200
gacgaaattt ttgaacgaga gctggatctg atgagagtcg acaaccttcc aaactgatta 1260
attaagaatt cgacccagct ttcttgtaca aagtggttgg taagcctatc cctaaccctc 1320
tcctcggtct cgattctacg tagtaatgag ctagcagtct cgaggttaac gaattccgcc 1380
ccccccctaa cgttactggc cgaagccgct tggaataagg ccggtgtgcg cttgtctata 1440
tgttattttc caccatattg ccgtcttttg gcaatgtgag ggcccggaaa cctggccctg 1500
tcttcttgac gagcattcct aggggtcttt cccctctcgc caaaggaatg caaggtctgt 1560
tgaatgtcgt gaaggaagca gttcctctgg aagcttcttg aagacaaaca acgtctgtag 1620
cgaccctttg caggcagcgg aaccccccac ctggcgacag gtgcccctgc ggccaaaagc 1680
cacgtgtata agatacacct gcaaaggcgg cacaacccca gtgccacgtt gtgagttgga 1740
tagttgtgga aagagtcaaa tggctctcct caagcgtatt caacaagggg ctgaaggatg 1800
cccagaaggt accccattgt atgggatctg atctggggcc tcggtgcaca tgctttacat 1860
gtgtttagtc gaggttaaaa aaacgtctag gccccccgaa ccacggggac gtggttttcc 1920
tttgaaaaac acgataatac catggccatg agcgagctga ttaaggagaa catgcacatg 1980
aagctgtaca tggagggcac cgtggacaac catcacttca agtgcacatc cgagggcgaa 2040
ggcaagccct acgagggcac ccagaccatg agaatcaagg tggtcgaggg cggccctctc 2100
cccttcgcct tcgacatcct ggctactagc ttcctctacg gcagcaagac cttcatcaac 2160
cacacccagg gcatccccga cttcttcaag cagtccttcc ctgagggctt cacatgggag 2220
agagtcacca catacgaaga cgggggcgtg ctgaccgcta cccaggacac cagcctccag 2280
gacggctgcc tcatctacaa cgtcaagatc agaggggtga acttcacatc caacggccct 2340
gtgatgcaga agaaaacact cggctgggag gccttcaccg agacgctgta ccccgctgac 2400
ggcggcctgg aaggcagaaa cgacatggcc ctgaagctcg tgggcgggag ccatctgatc 2460
gcaaacatca agaccacata tagatccaag aaacccgcta agaacctcaa gatgcctggc 2520
gtctactatg tggactacag actggaaaga atcaaggagg ccaacaacga gacctacgtc 2580
gagcagcacg aggtggcagt ggccagatac tgcgacctcc ctagcaaact ggggcacaag 2640
cttaattaac accggtggcg cgttaagtcg acaatcaacc tctggattac aaaatttgtg 2700
aaagattgac tggtattctt aactatgttg ctccttttac gctatgtgga tacgctgctt 2760
taatgccttt gtatcatgct attgcttccc gtatggcttt cattttctcc tccttgtata 2820
aatcctggtt gctgtctctt tatgaggagt tgtggcccgt tgtcaggcaa cgtggcgtgg 2880
tgtgcactgt gtttgctgac gcaaccccca ctggttgggg cattgccacc acctgtcagc 2940
tcctttccgg gactttcgct ttccccctcc ctattgccac ggcggaactc atcgccgcct 3000
gccttgcccg ctgctggaca ggggctcggc tgttgggcac tgacaattcc gtggtgttgt 3060
cggggaaatc atcgtccttt ccttggctgc tcgcctgtgt tgccacctgg attctgcgcg 3120
ggacgtcctt ctgctacgtc ccttcggccc tcaatccagc ggaccttcct tcccgcggcc 3180
tgctgccggc tctgcggcct cttccgcgtc ttcgccttcg ccctcagacg agtcggatct 3240
ccctttgggc cgcctccccg cgtcgacttt aagaccaatg acttacaagg cagctgtaga 3300
tcttagccac tttttaaaag aaaagggggg actggaaggg ctaattcact cccaacgaag 3360
acaagatctg ctttttgctt gtactgggtc tctctggtta gaccagatct gagcctggga 3420
gctctctggc taactaggga acccactgct taagcctcaa taaagcttgc cttgagtgct 3480
tcaagtagtg tgtgcccgtc tgttgtgtga ctctggtaac tagagatccc tcagaccctt 3540
ttagtcagtg tggaaaatct ctagcagtac gtatagtagt tcatgtcatc ttattattca 3600
gtatttataa cttgcaaaga aatgaatatc agagagtgag aggaacttgt ttattgcagc 3660
ttataatggt tacaaataaa gcaatagcat cacaaatttc acaaataaag catttttttc 3720
actgcattct agttgtggtt tgtccaaact catcaatgta tcttatcatg tctggctcta 3780
gctatcccgc ccctaactcc gcccatcccg cccctaactc cgcccagttc cgcccattct 3840
ccgccccatg gctgactaat tttttttatt tatgcagagg ccgaggccgc ctcggcctct 3900
gagctattcc agaagtagtg aggaggcttt tttggaggcc tagggacgta cccaattcgc 3960
cctatagtga gtcgtattac gcgcgctcac tggccgtcgt tttacaacgt cgtgactggg 4020
aaaaccctgg cgttacccaa cttaatcgcc ttgcagcaca tccccctttc gccagctggc 4080
gtaatagcga agaggcccgc accgatcgcc cttcccaaca gttgcgcagc ctgaatggcg 4140
aatgggacgc gccctgtagc ggcgcattaa gcgcggcggg tgtggtggtt acgcgcagcg 4200
tgaccgctac acttgccagc gccctagcgc ccgctccttt cgctttcttc ccttcctttc 4260
tcgccacgtt cgccggcttt ccccgtcaag ctctaaatcg ggggctccct ttagggttcc 4320
gatttagtgc tttacggcac ctcgacccca aaaaacttga ttagggtgat ggttcacgta 4380
gtgggccatc gccctgatag acggtttttc gccctttgac gttggagtcc acgttcttta 4440
atagtggact cttgttccaa actggaacaa cactcaaccc tatctcggtc tattcttttg 4500
atttataagg gattttgccg atttcggcct attggttaaa aaatgagctg atttaacaaa 4560
aatttaacgc gaattttaac aaaatattaa cgcttacaat ttaggtggca cttttcgggg 4620
aaatgtgcgc ggaaccccta tttgtttatt tttctaaata cattcaaata tgtatccgct 4680
catgagacaa taaccctgat aaatgcttca ataatattga aaaaggaaga gtatgagtat 4740
tcaacatttc cgtgtcgccc ttattccctt ttttgcggca ttttgccttc ctgtttttgc 4800
tcacccagaa acgctggtga aagtaaaaga tgctgaagat cagttgggtg cacgagtggg 4860
ttacatcgaa ctggatctca acagcggtaa gatccttgag agttttcgcc ccgaagaacg 4920
ttttccaatg atgagcactt ttaaagttct gctatgtggc gcggtattat cccgtattga 4980
cgccgggcaa gagcaactcg gtcgccgcat acactattct cagaatgact tggttgagta 5040
ctcaccagtc acagaaaagc atcttacgga tggcatgaca gtaagagaat tatgcagtgc 5100
tgccataacc atgagtgata acactgcggc caacttactt ctgacaacga tcggaggacc 5160
gaaggagcta accgcttttt tgcacaacat gggggatcat gtaactcgcc ttgatcgttg 5220
ggaaccggag ctgaatgaag ccataccaaa cgacgagcgt gacaccacga tgcctgtagc 5280
aatggcaaca acgttgcgca aactattaac tggcgaacta cttactctag cttcccggca 5340
acaattaata gactggatgg aggcggataa agttgcagga ccacttctgc gctcggccct 5400
tccggctggc tggtttattg ctgataaatc tggagccggt gagcgtgggt ctcgcggtat 5460
cattgcagca ctggggccag atggtaagcc ctcccgtatc gtagttatct acacgacggg 5520
gagtcaggca actatggatg aacgaaatag acagatcgct gagataggtg cctcactgat 5580
taagcattgg taactgtcag accaagttta ctcatatata ctttagattg atttaaaact 5640
tcatttttaa tttaaaagga tctaggtgaa gatccttttt gataatctca tgaccaaaat 5700
cccttaacgt gagttttcgt tccactgagc gtcagacccc gtagaaaaga tcaaaggatc 5760
ttcttgagat cctttttttc tgcgcgtaat ctgctgcttg caaacaaaaa aaccaccgct 5820
accagcggtg gtttgtttgc cggatcaaga gctaccaact ctttttccga aggtaactgg 5880
cttcagcaga gcgcagatac caaatactgt tcttctagtg tagccgtagt taggccacca 5940
cttcaagaac tctgtagcac cgcctacata cctcgctctg ctaatcctgt taccagtggc 6000
tgctgccagt ggcgataagt cgtgtcttac cgggttggac tcaagacgat agttaccgga 6060
taaggcgcag cggtcgggct gaacgggggg ttcgtgcaca cagcccagct tggagcgaac 6120
gacctacacc gaactgagat acctacagcg tgagctatga gaaagcgcca cgcttcccga 6180
agggagaaag gcggacaggt atccggtaag cggcagggtc ggaacaggag agcgcacgag 6240
ggagcttcca gggggaaacg cctggtatct ttatagtcct gtcgggtttc gccacctctg 6300
acttgagcgt cgatttttgt gatgctcgtc aggggggcgg agcctatgga aaaacgccag 6360
caacgcggcc tttttacggt tcctggcctt ttgctggcct tttgctcaca tgttctttcc 6420
tgcgttatcc cctgattctg tggataaccg tattaccgcc tttgagtgag ctgataccgc 6480
tcgccgcagc cgaacgaccg agcgcagcga gtcagtgagc gaggaagcgg aagagcgccc 6540
aatacgcaaa ccgcctctcc ccgcgcgttg gccgattcat taatgcagct ggcacgacag 6600
gtttcccgac tggaaagcgg gcagtgagcg caacgcaatt aatgtgagtt agctcactca 6660
ttaggcaccc caggctttac actttatgct tccggctcgt atgttgtgtg gaattgtgag 6720
cggataacaa tttcacacag gaaacagcta tgaccatgat tacgccaagc gcgcaattaa 6780
ccctcactaa agggaacaaa agctggagct gcaagcttaa tgtagtctta tgcaatactc 6840
ttgtagtctt gcaacatggt aacgatgagt tagcaacatg ccttacaagg agagaaaaag 6900
caccgtgcat gccgattggt ggaagtaagg tggtacgatc gtgccttatt aggaaggcaa 6960
cagacgggtc tgacatggat tggacgaacc actgaattgc cgcattgcag agatattgta 7020
tttaagtgcc tagctcgata cataaacggg tctctctggt tagaccagat ctgagcctgg 7080
gagctctctg gctaactagg gaacccactg cttaagcctc aataaagctt gccttgagtg 7140
cttcaagtag tgtgtgcccg tctgttgtgt gactctggta actagagatc cctcagaccc 7200
ttttagtcag tgtggaaaat ctctagcagt ggcgcccgaa cagggacttg aaagcgaaag 7260
ggaaaccaga ggagctctct cgacgcagga ctcggcttgc tgaagcgcgc acggcaagag 7320
gcgaggggcg gcgactggtg agtacgccaa aaattttgac tagcggaggc tagaaggaga 7380
gagatgggtg cgagagcgtc agtattaagc gggggagaat tagatcgcga tgggaaaaaa 7440
ttcggttaag gccaggggga aagaaaaaat ataaattaaa acatatagta tgggcaagca 7500
gggagctaga acgattcgca gttaatcctg gcctgttaga aacatcagaa ggctgtagac 7560
aaatactggg acagctacaa ccatcccttc agacaggatc agaagaactt agatcattat 7620
ataatacagt agcaaccctc tattgtgtgc atcaaaggat agagataaaa gacaccaagg 7680
aagctttaga caagatagag gaagagcaaa acaaaagtaa gaccaccgca cagcaagcgg 7740
ccgctgatct tcagacctgg aggaggagat atgagggaca attggagaag tgaattatat 7800
aaatataaag tagtaaaaat tgaaccatta ggagtagcac ccaccaaggc aaagagaaga 7860
gtggtgcaga gagaaaaaag agcagtggga ataggagctt tgttccttgg gttcttggga 7920
gcagcaggaa gcactatggg cgcagcgtca atgacgctga cggtacaggc cagacaatta 7980
ttgtctggta tagtgcagca gcagaacaat ttgctgaggg ctattgaggc gcaacagcat 8040
ctgttgcaac tcacagtctg gggcatcaag cagctccagg caagaatcct ggctgtggaa 8100
agatacctaa aggatcaaca gctcctgggg atttggggtt gctctggaaa actcatttgc 8160
accactgctg tgccttggaa tgctagttgg agtaataaat ctctggaaca gatttggaat 8220
cacacgacct ggatggagtg ggacagagaa attaacaatt acacaagctt aatacactcc 8280
ttaattgaag aatcgcaaaa ccagcaagaa aagaatgaac aagaattatt ggaattagat 8340
aaatgggcaa gtttgtggaa ttggtttaac ataacaaatt ggctgtggta tataaaatta 8400
ttcataatga tagtaggagg cttggtaggt ttaagaatag tttttgctgt actttctata 8460
gtgaatagag ttaggcaggg atattcacca ttatcgtttc agacccacct cccaaccccg 8520
aggggaccct tgcgcctttt ccaaggcagc cctgggtttg cgcagggacg cggctgctct 8580
gggcgtggtt ccgggaaacg cagcggcgcc gaccctgggt ctcgcacatt cttcacgtcc 8640
gttcgcagcg tcacccggat cttcgccgct acccttgtgg gccccccggc gacgcttcct 8700
gctccgcccc taagtcggga aggttccttg cggttcgcgg cgtgccggac gtgacaaacg 8760
gaagccgcac gtctcactag taccctcgca gacggacagc gccagggagc aatggcagcg 8820
cgccgaccgc gatgggctgt ggccaatagc ggctgctcag cagggcgcgc cgagagcagc 8880
ggccgggaag gggcggtgcg ggaggcgggg tgtggggcgg tagtgtgggc cctgttcctg 8940
cccgcgcggt gttccgcatt ctgcaagcct ccggagcgca cgtcggcagt cggctccctc 9000
gttgaccgaa tcaccgacct ctctccccag ggggtaccca gctgtctaga gaattctaga 9060
tcttgagaca aatggcagta ttcatccaca attttaaaag aaaagggggg attggggggt 9120
acagtgcagg ggaaagaata gtagacataa tagcaacaga catacaaact aaagaattac 9180
aaaaacaaat tacaaaaatt caaaattttc gggtttatta cagggacagc agagatccac 9240
tttggcgccg gctcgagggg g 9261
<210> 41
<211> 204
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 41
Met Lys Thr Phe Asn Ile Ser Gln Gln Asp Leu Glu Leu Val Glu Val
1 5 10 15
Ala Thr Glu Lys Ile Thr Met Leu Tyr Glu Asp Asn Lys His His Val
20 25 30
Gly Ala Ala Ile Arg Thr Lys Thr Gly Glu Ile Ile Ser Ala Val His
35 40 45
Ile Glu Ala Tyr Ile Gly Arg Val Thr Val Cys Ala Glu Ala Ile Ala
50 55 60
Ile Gly Ser Ala Val Ser Asn Gly Gln Lys Asp Phe Asp Thr Ile Val
65 70 75 80
Ala Val Arg His Pro Tyr Ser Asp Glu Val Asp Arg Ser Ile Arg Val
85 90 95
Val Ser Pro Cys Gly Met Cys Leu Ser Tyr Glu Thr Glu Ile Leu Thr
100 105 110
Val Glu Tyr Gly Leu Leu Pro Ile Gly Lys Ile Val Glu Lys Arg Ile
115 120 125
Glu Cys Thr Val Tyr Ser Val Asp Asn Asn Gly Asn Ile Tyr Thr Gln
130 135 140
Pro Val Ala Gln Trp His Asp Arg Gly Glu Gln Glu Val Phe Glu Tyr
145 150 155 160
Cys Leu Glu Asp Gly Ser Leu Ile Arg Ala Thr Lys Asp His Lys Phe
165 170 175
Met Thr Val Asp Gly Gln Met Leu Pro Ile Asp Glu Ile Phe Glu Arg
180 185 190
Glu Leu Asp Leu Met Arg Val Asp Asn Leu Pro Asn
195 200
<210> 42
<211> 8873
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 42
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtgccgcca ccatgattaa gatcgctacg 660
cggaagtacc tggggaaaca gaacgtctac gacataggtg tggagcgcga tcacaacttt 720
gctctgaaaa atggatttat cgccagcaac tgtagggagt tgatttcaga ctatgcacca 780
gattgttttg tgttaataga aatgaatggc aagttagtca aaactacgat tgaagaactc 840
attccactca aatatacccg aaattaatta ttaagaattc gacccagctt tcttgtacaa 900
agtggttggt aagcctatcc ctaaccctct cctcggtctc gattctacgt agtaatgagc 960
tagcagtctc gaggttaacg aattccgccc cccccctaac gttactggcc gaagccgctt 1020
ggaataaggc cggtgtgcgc ttgtctatat gttattttcc accatattgc cgtcttttgg 1080
caatgtgagg gcccggaaac ctggccctgt cttcttgacg agcattccta ggggtctttc 1140
ccctctcgcc aaaggaatgc aaggtctgtt gaatgtcgtg aaggaagcag ttcctctgga 1200
agcttcttga agacaaacaa cgtctgtagc gaccctttgc aggcagcgga accccccacc 1260
tggcgacagg tgcccctgcg gccaaaagcc acgtgtataa gatacacctg caaaggcggc 1320
acaaccccag tgccacgttg tgagttggat agttgtggaa agagtcaaat ggctctcctc 1380
aagcgtattc aacaaggggc tgaaggatgc ccagaaggta ccccattgta tgggatctga 1440
tctggggcct cggtgcacat gctttacatg tgtttagtcg aggttaaaaa aacgtctagg 1500
ccccccgaac cacggggacg tggttttcct ttgaaaaaca cgataatacc atggtgagca 1560
agggcgagga ggataacatg gccatcatca aggagttcat gcgcttcaag gtgcacatgg 1620
agggctccgt gaacggccac gagttcgaga tcgagggcga gggcgagggc cgcccctacg 1680
agggcaccca gaccgccaag ctgaaggtga ccaagggtgg ccccctgccc ttcgcctggg 1740
acatcctgtc ccctcagttc atgtacggct ccaaggccta cgtgaagcac cccgccgaca 1800
tccccgacta cttgaagctg tccttccccg agggcttcaa gtgggagcgc gtgatgaact 1860
tcgaggacgg cggcgtggtg accgtgaccc aggactcctc cctgcaggac ggcgagttca 1920
tctacaaggt gaagctgcgc ggcaccaact tcccctccga cggccccgta atgcagaaga 1980
agaccatggg ctgggaggcc tcctccgagc ggatgtaccc cgaggacggc gccctgaagg 2040
gcgagatcaa gcagaggctg aagctgaagg acggcggcca ctacgacgct gaggtcaaga 2100
ccacctacaa ggccaagaag cccgtgcagc tgcccggcgc ctacaacgtc aacatcaagt 2160
tggacatcac ctcccacaac gaggactaca ccatcgtgga acagtacgaa cgcgccgagg 2220
gccgccactc caccggcggc atggacgagc tgtacaagta acaccggtgg cgcgttaagt 2280
cgacaatcaa cctctggatt acaaaatttg tgaaagattg actggtattc ttaactatgt 2340
tgctcctttt acgctatgtg gatacgctgc tttaatgcct ttgtatcatg ctattgcttc 2400
ccgtatggct ttcattttct cctccttgta taaatcctgg ttgctgtctc tttatgagga 2460
gttgtggccc gttgtcaggc aacgtggcgt ggtgtgcact gtgtttgctg acgcaacccc 2520
cactggttgg ggcattgcca ccacctgtca gctcctttcc gggactttcg ctttccccct 2580
ccctattgcc acggcggaac tcatcgccgc ctgccttgcc cgctgctgga caggggctcg 2640
gctgttgggc actgacaatt ccgtggtgtt gtcggggaaa tcatcgtcct ttccttggct 2700
gctcgcctgt gttgccacct ggattctgcg cgggacgtcc ttctgctacg tcccttcggc 2760
cctcaatcca gcggaccttc cttcccgcgg cctgctgccg gctctgcggc ctcttccgcg 2820
tcttcgcctt cgccctcaga cgagtcggat ctccctttgg gccgcctccc cgcgtcgact 2880
ttaagaccaa tgacttacaa ggcagctgta gatcttagcc actttttaaa agaaaagggg 2940
ggactggaag ggctaattca ctcccaacga agacaagatc tgctttttgc ttgtactggg 3000
tctctctggt tagaccagat ctgagcctgg gagctctctg gctaactagg gaacccactg 3060
cttaagcctc aataaagctt gccttgagtg cttcaagtag tgtgtgcccg tctgttgtgt 3120
gactctggta actagagatc cctcagaccc ttttagtcag tgtggaaaat ctctagcagt 3180
acgtatagta gttcatgtca tcttattatt cagtatttat aacttgcaaa gaaatgaata 3240
tcagagagtg agaggaactt gtttattgca gcttataatg gttacaaata aagcaatagc 3300
atcacaaatt tcacaaataa agcatttttt tcactgcatt ctagttgtgg tttgtccaaa 3360
ctcatcaatg tatcttatca tgtctggctc tagctatccc gcccctaact ccgcccatcc 3420
cgcccctaac tccgcccagt tccgcccatt ctccgcccca tggctgacta atttttttta 3480
tttatgcaga ggccgaggcc gcctcggcct ctgagctatt ccagaagtag tgaggaggct 3540
tttttggagg cctagggacg tacccaattc gccctatagt gagtcgtatt acgcgcgctc 3600
actggccgtc gttttacaac gtcgtgactg ggaaaaccct ggcgttaccc aacttaatcg 3660
ccttgcagca catccccctt tcgccagctg gcgtaatagc gaagaggccc gcaccgatcg 3720
cccttcccaa cagttgcgca gcctgaatgg cgaatgggac gcgccctgta gcggcgcatt 3780
aagcgcggcg ggtgtggtgg ttacgcgcag cgtgaccgct acacttgcca gcgccctagc 3840
gcccgctcct ttcgctttct tcccttcctt tctcgccacg ttcgccggct ttccccgtca 3900
agctctaaat cgggggctcc ctttagggtt ccgatttagt gctttacggc acctcgaccc 3960
caaaaaactt gattagggtg atggttcacg tagtgggcca tcgccctgat agacggtttt 4020
tcgccctttg acgttggagt ccacgttctt taatagtgga ctcttgttcc aaactggaac 4080
aacactcaac cctatctcgg tctattcttt tgatttataa gggattttgc cgatttcggc 4140
ctattggtta aaaaatgagc tgatttaaca aaaatttaac gcgaatttta acaaaatatt 4200
aacgcttaca atttaggtgg cacttttcgg ggaaatgtgc gcggaacccc tatttgttta 4260
tttttctaaa tacattcaaa tatgtatccg ctcatgagac aataaccctg ataaatgctt 4320
caataatatt gaaaaaggaa gagtatgagt attcaacatt tccgtgtcgc ccttattccc 4380
ttttttgcgg cattttgcct tcctgttttt gctcacccag aaacgctggt gaaagtaaaa 4440
gatgctgaag atcagttggg tgcacgagtg ggttacatcg aactggatct caacagcggt 4500
aagatccttg agagttttcg ccccgaagaa cgttttccaa tgatgagcac ttttaaagtt 4560
ctgctatgtg gcgcggtatt atcccgtatt gacgccgggc aagagcaact cggtcgccgc 4620
atacactatt ctcagaatga cttggttgag tactcaccag tcacagaaaa gcatcttacg 4680
gatggcatga cagtaagaga attatgcagt gctgccataa ccatgagtga taacactgcg 4740
gccaacttac ttctgacaac gatcggagga ccgaaggagc taaccgcttt tttgcacaac 4800
atgggggatc atgtaactcg ccttgatcgt tgggaaccgg agctgaatga agccatacca 4860
aacgacgagc gtgacaccac gatgcctgta gcaatggcaa caacgttgcg caaactatta 4920
actggcgaac tacttactct agcttcccgg caacaattaa tagactggat ggaggcggat 4980
aaagttgcag gaccacttct gcgctcggcc cttccggctg gctggtttat tgctgataaa 5040
tctggagccg gtgagcgtgg gtctcgcggt atcattgcag cactggggcc agatggtaag 5100
ccctcccgta tcgtagttat ctacacgacg gggagtcagg caactatgga tgaacgaaat 5160
agacagatcg ctgagatagg tgcctcactg attaagcatt ggtaactgtc agaccaagtt 5220
tactcatata tactttagat tgatttaaaa cttcattttt aatttaaaag gatctaggtg 5280
aagatccttt ttgataatct catgaccaaa atcccttaac gtgagttttc gttccactga 5340
gcgtcagacc ccgtagaaaa gatcaaagga tcttcttgag atcctttttt tctgcgcgta 5400
atctgctgct tgcaaacaaa aaaaccaccg ctaccagcgg tggtttgttt gccggatcaa 5460
gagctaccaa ctctttttcc gaaggtaact ggcttcagca gagcgcagat accaaatact 5520
gttcttctag tgtagccgta gttaggccac cacttcaaga actctgtagc accgcctaca 5580
tacctcgctc tgctaatcct gttaccagtg gctgctgcca gtggcgataa gtcgtgtctt 5640
accgggttgg actcaagacg atagttaccg gataaggcgc agcggtcggg ctgaacgggg 5700
ggttcgtgca cacagcccag cttggagcga acgacctaca ccgaactgag atacctacag 5760
cgtgagctat gagaaagcgc cacgcttccc gaagggagaa aggcggacag gtatccggta 5820
agcggcaggg tcggaacagg agagcgcacg agggagcttc cagggggaaa cgcctggtat 5880
ctttatagtc ctgtcgggtt tcgccacctc tgacttgagc gtcgattttt gtgatgctcg 5940
tcaggggggc ggagcctatg gaaaaacgcc agcaacgcgg cctttttacg gttcctggcc 6000
ttttgctggc cttttgctca catgttcttt cctgcgttat cccctgattc tgtggataac 6060
cgtattaccg cctttgagtg agctgatacc gctcgccgca gccgaacgac cgagcgcagc 6120
gagtcagtga gcgaggaagc ggaagagcgc ccaatacgca aaccgcctct ccccgcgcgt 6180
tggccgattc attaatgcag ctggcacgac aggtttcccg actggaaagc gggcagtgag 6240
cgcaacgcaa ttaatgtgag ttagctcact cattaggcac cccaggcttt acactttatg 6300
cttccggctc gtatgttgtg tggaattgtg agcggataac aatttcacac aggaaacagc 6360
tatgaccatg attacgccaa gcgcgcaatt aaccctcact aaagggaaca aaagctggag 6420
ctgcaagctt aatgtagtct tatgcaatac tcttgtagtc ttgcaacatg gtaacgatga 6480
gttagcaaca tgccttacaa ggagagaaaa agcaccgtgc atgccgattg gtggaagtaa 6540
ggtggtacga tcgtgcctta ttaggaaggc aacagacggg tctgacatgg attggacgaa 6600
ccactgaatt gccgcattgc agagatattg tatttaagtg cctagctcga tacataaacg 6660
ggtctctctg gttagaccag atctgagcct gggagctctc tggctaacta gggaacccac 6720
tgcttaagcc tcaataaagc ttgccttgag tgcttcaagt agtgtgtgcc cgtctgttgt 6780
gtgactctgg taactagaga tccctcagac ccttttagtc agtgtggaaa atctctagca 6840
gtggcgcccg aacagggact tgaaagcgaa agggaaacca gaggagctct ctcgacgcag 6900
gactcggctt gctgaagcgc gcacggcaag aggcgagggg cggcgactgg tgagtacgcc 6960
aaaaattttg actagcggag gctagaagga gagagatggg tgcgagagcg tcagtattaa 7020
gcgggggaga attagatcgc gatgggaaaa aattcggtta aggccagggg gaaagaaaaa 7080
atataaatta aaacatatag tatgggcaag cagggagcta gaacgattcg cagttaatcc 7140
tggcctgtta gaaacatcag aaggctgtag acaaatactg ggacagctac aaccatccct 7200
tcagacagga tcagaagaac ttagatcatt atataataca gtagcaaccc tctattgtgt 7260
gcatcaaagg atagagataa aagacaccaa ggaagcttta gacaagatag aggaagagca 7320
aaacaaaagt aagaccaccg cacagcaagc ggccgctgat cttcagacct ggaggaggag 7380
atatgaggga caattggaga agtgaattat ataaatataa agtagtaaaa attgaaccat 7440
taggagtagc acccaccaag gcaaagagaa gagtggtgca gagagaaaaa agagcagtgg 7500
gaataggagc tttgttcctt gggttcttgg gagcagcagg aagcactatg ggcgcagcgt 7560
caatgacgct gacggtacag gccagacaat tattgtctgg tatagtgcag cagcagaaca 7620
atttgctgag ggctattgag gcgcaacagc atctgttgca actcacagtc tggggcatca 7680
agcagctcca ggcaagaatc ctggctgtgg aaagatacct aaaggatcaa cagctcctgg 7740
ggatttgggg ttgctctgga aaactcattt gcaccactgc tgtgccttgg aatgctagtt 7800
ggagtaataa atctctggaa cagatttgga atcacacgac ctggatggag tgggacagag 7860
aaattaacaa ttacacaagc ttaatacact ccttaattga agaatcgcaa aaccagcaag 7920
aaaagaatga acaagaatta ttggaattag ataaatgggc aagtttgtgg aattggttta 7980
acataacaaa ttggctgtgg tatataaaat tattcataat gatagtagga ggcttggtag 8040
gtttaagaat agtttttgct gtactttcta tagtgaatag agttaggcag ggatattcac 8100
cattatcgtt tcagacccac ctcccaaccc cgaggggacc cttgcgcctt ttccaaggca 8160
gccctgggtt tgcgcaggga cgcggctgct ctgggcgtgg ttccgggaaa cgcagcggcg 8220
ccgaccctgg gtctcgcaca ttcttcacgt ccgttcgcag cgtcacccgg atcttcgccg 8280
ctacccttgt gggccccccg gcgacgcttc ctgctccgcc cctaagtcgg gaaggttcct 8340
tgcggttcgc ggcgtgccgg acgtgacaaa cggaagccgc acgtctcact agtaccctcg 8400
cagacggaca gcgccaggga gcaatggcag cgcgccgacc gcgatgggct gtggccaata 8460
gcggctgctc agcagggcgc gccgagagca gcggccggga aggggcggtg cgggaggcgg 8520
ggtgtggggc ggtagtgtgg gccctgttcc tgcccgcgcg gtgttccgca ttctgcaagc 8580
ctccggagcg cacgtcggca gtcggctccc tcgttgaccg aatcaccgac ctctctcccc 8640
agggggtacc cagctgtcta gagaattcta gatcttgaga caaatggcag tattcatcca 8700
caattttaaa agaaaagggg ggattggggg gtacagtgca ggggaaagaa tagtagacat 8760
aatagcaaca gacatacaaa ctaaagaatt acaaaaacaa attacaaaaa ttcaaaattt 8820
tcgggtttat tacagggaca gcagagatcc actttggcgc cggctcgagg ggg 8873
<210> 43
<211> 74
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 43
Met Ile Lys Ile Ala Thr Arg Lys Tyr Leu Gly Lys Gln Asn Val Tyr
1 5 10 15
Asp Ile Gly Val Glu Arg Asp His Asn Phe Ala Leu Lys Asn Gly Phe
20 25 30
Ile Ala Ser Asn Cys Arg Glu Leu Ile Ser Asp Tyr Ala Pro Asp Cys
35 40 45
Phe Val Leu Ile Glu Met Asn Gly Lys Leu Val Lys Thr Thr Ile Glu
50 55 60
Glu Leu Ile Pro Leu Lys Tyr Thr Arg Asn
65 70
<210> 44
<211> 9309
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 44
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtgccacca tgaccgagta caagcccacg 660
gtgcgcctcg ccacccgcga cgacgtcccc agggccgtac gcaccctcgc cgccgcgttc 720
gccgactacc ccgccacgcg ccacaccgtc gatccggacc gccacatcga gcgggtcacc 780
gagctgcaag aactcttcct cacgcgcgtc gggctcgaca tcggcaaggt gtgggtcgcg 840
gacgacggcg ccgcggtggc ggtctggacc acgccggaga gcgtcgaagc gggggcggtg 900
ttcgccgaga tcggcccgcg catggccgag ttgagcggtt cccggctggc cgcgcagcaa 960
cagatggaag gcctcctggc gccgcaccgg cccaagtgcc tttcatacga gaccgagatc 1020
ctgactgtcg agtacggatt gcttcctatc ggcaaaatcg tggagaagag gattgaatgt 1080
accgtctatt cagtcgataa taatgggaac atctacacac agcccgtggc tcaatggcac 1140
gacagaggag agcaggaagt ttttgaatac tgtctcgagg acggatccct catccgcgct 1200
actaaagatc ataagtttat gaccgtggac ggccagatgc tgccaattga cgaaattttt 1260
gaacgagagc tggatctgat gagagtcgac aaccttccaa actgattaat taagaattcg 1320
acccagcttt cttgtacaaa gtggttggta agcctatccc taaccctctc ctcggtctcg 1380
attctacgta gtaatgagct agcagtctcg aggttaacga attccgcccc ccccctaacg 1440
ttactggccg aagccgcttg gaataaggcc ggtgtgcgct tgtctatatg ttattttcca 1500
ccatattgcc gtcttttggc aatgtgaggg cccggaaacc tggccctgtc ttcttgacga 1560
gcattcctag gggtctttcc cctctcgcca aaggaatgca aggtctgttg aatgtcgtga 1620
aggaagcagt tcctctggaa gcttcttgaa gacaaacaac gtctgtagcg accctttgca 1680
ggcagcggaa ccccccacct ggcgacaggt gcccctgcgg ccaaaagcca cgtgtataag 1740
atacacctgc aaaggcggca caaccccagt gccacgttgt gagttggata gttgtggaaa 1800
gagtcaaatg gctctcctca agcgtattca acaaggggct gaaggatgcc cagaaggtac 1860
cccattgtat gggatctgat ctggggcctc ggtgcacatg ctttacatgt gtttagtcga 1920
ggttaaaaaa acgtctaggc cccccgaacc acggggacgt ggttttcctt tgaaaaacac 1980
gataatacca tggccatgag cgagctgatt aaggagaaca tgcacatgaa gctgtacatg 2040
gagggcaccg tggacaacca tcacttcaag tgcacatccg agggcgaagg caagccctac 2100
gagggcaccc agaccatgag aatcaaggtg gtcgagggcg gccctctccc cttcgccttc 2160
gacatcctgg ctactagctt cctctacggc agcaagacct tcatcaacca cacccagggc 2220
atccccgact tcttcaagca gtccttccct gagggcttca catgggagag agtcaccaca 2280
tacgaagacg ggggcgtgct gaccgctacc caggacacca gcctccagga cggctgcctc 2340
atctacaacg tcaagatcag aggggtgaac ttcacatcca acggccctgt gatgcagaag 2400
aaaacactcg gctgggaggc cttcaccgag acgctgtacc ccgctgacgg cggcctggaa 2460
ggcagaaacg acatggccct gaagctcgtg ggcgggagcc atctgatcgc aaacatcaag 2520
accacatata gatccaagaa acccgctaag aacctcaaga tgcctggcgt ctactatgtg 2580
gactacagac tggaaagaat caaggaggcc aacaacgaga cctacgtcga gcagcacgag 2640
gtggcagtgg ccagatactg cgacctccct agcaaactgg ggcacaagct taattaacac 2700
cggtggcgcg ttaagtcgac aatcaacctc tggattacaa aatttgtgaa agattgactg 2760
gtattcttaa ctatgttgct ccttttacgc tatgtggata cgctgcttta atgcctttgt 2820
atcatgctat tgcttcccgt atggctttca ttttctcctc cttgtataaa tcctggttgc 2880
tgtctcttta tgaggagttg tggcccgttg tcaggcaacg tggcgtggtg tgcactgtgt 2940
ttgctgacgc aacccccact ggttggggca ttgccaccac ctgtcagctc ctttccggga 3000
ctttcgcttt ccccctccct attgccacgg cggaactcat cgccgcctgc cttgcccgct 3060
gctggacagg ggctcggctg ttgggcactg acaattccgt ggtgttgtcg gggaaatcat 3120
cgtcctttcc ttggctgctc gcctgtgttg ccacctggat tctgcgcggg acgtccttct 3180
gctacgtccc ttcggccctc aatccagcgg accttccttc ccgcggcctg ctgccggctc 3240
tgcggcctct tccgcgtctt cgccttcgcc ctcagacgag tcggatctcc ctttgggccg 3300
cctccccgcg tcgactttaa gaccaatgac ttacaaggca gctgtagatc ttagccactt 3360
tttaaaagaa aaggggggac tggaagggct aattcactcc caacgaagac aagatctgct 3420
ttttgcttgt actgggtctc tctggttaga ccagatctga gcctgggagc tctctggcta 3480
actagggaac ccactgctta agcctcaata aagcttgcct tgagtgcttc aagtagtgtg 3540
tgcccgtctg ttgtgtgact ctggtaacta gagatccctc agaccctttt agtcagtgtg 3600
gaaaatctct agcagtacgt atagtagttc atgtcatctt attattcagt atttataact 3660
tgcaaagaaa tgaatatcag agagtgagag gaacttgttt attgcagctt ataatggtta 3720
caaataaagc aatagcatca caaatttcac aaataaagca tttttttcac tgcattctag 3780
ttgtggtttg tccaaactca tcaatgtatc ttatcatgtc tggctctagc tatcccgccc 3840
ctaactccgc ccatcccgcc cctaactccg cccagttccg cccattctcc gccccatggc 3900
tgactaattt tttttattta tgcagaggcc gaggccgcct cggcctctga gctattccag 3960
aagtagtgag gaggcttttt tggaggccta gggacgtacc caattcgccc tatagtgagt 4020
cgtattacgc gcgctcactg gccgtcgttt tacaacgtcg tgactgggaa aaccctggcg 4080
ttacccaact taatcgcctt gcagcacatc cccctttcgc cagctggcgt aatagcgaag 4140
aggcccgcac cgatcgccct tcccaacagt tgcgcagcct gaatggcgaa tgggacgcgc 4200
cctgtagcgg cgcattaagc gcggcgggtg tggtggttac gcgcagcgtg accgctacac 4260
ttgccagcgc cctagcgccc gctcctttcg ctttcttccc ttcctttctc gccacgttcg 4320
ccggctttcc ccgtcaagct ctaaatcggg ggctcccttt agggttccga tttagtgctt 4380
tacggcacct cgaccccaaa aaacttgatt agggtgatgg ttcacgtagt gggccatcgc 4440
cctgatagac ggtttttcgc cctttgacgt tggagtccac gttctttaat agtggactct 4500
tgttccaaac tggaacaaca ctcaacccta tctcggtcta ttcttttgat ttataaggga 4560
ttttgccgat ttcggcctat tggttaaaaa atgagctgat ttaacaaaaa tttaacgcga 4620
attttaacaa aatattaacg cttacaattt aggtggcact tttcggggaa atgtgcgcgg 4680
aacccctatt tgtttatttt tctaaataca ttcaaatatg tatccgctca tgagacaata 4740
accctgataa atgcttcaat aatattgaaa aaggaagagt atgagtattc aacatttccg 4800
tgtcgccctt attccctttt ttgcggcatt ttgccttcct gtttttgctc acccagaaac 4860
gctggtgaaa gtaaaagatg ctgaagatca gttgggtgca cgagtgggtt acatcgaact 4920
ggatctcaac agcggtaaga tccttgagag ttttcgcccc gaagaacgtt ttccaatgat 4980
gagcactttt aaagttctgc tatgtggcgc ggtattatcc cgtattgacg ccgggcaaga 5040
gcaactcggt cgccgcatac actattctca gaatgacttg gttgagtact caccagtcac 5100
agaaaagcat cttacggatg gcatgacagt aagagaatta tgcagtgctg ccataaccat 5160
gagtgataac actgcggcca acttacttct gacaacgatc ggaggaccga aggagctaac 5220
cgcttttttg cacaacatgg gggatcatgt aactcgcctt gatcgttggg aaccggagct 5280
gaatgaagcc ataccaaacg acgagcgtga caccacgatg cctgtagcaa tggcaacaac 5340
gttgcgcaaa ctattaactg gcgaactact tactctagct tcccggcaac aattaataga 5400
ctggatggag gcggataaag ttgcaggacc acttctgcgc tcggcccttc cggctggctg 5460
gtttattgct gataaatctg gagccggtga gcgtgggtct cgcggtatca ttgcagcact 5520
ggggccagat ggtaagccct cccgtatcgt agttatctac acgacgggga gtcaggcaac 5580
tatggatgaa cgaaatagac agatcgctga gataggtgcc tcactgatta agcattggta 5640
actgtcagac caagtttact catatatact ttagattgat ttaaaacttc atttttaatt 5700
taaaaggatc taggtgaaga tcctttttga taatctcatg accaaaatcc cttaacgtga 5760
gttttcgttc cactgagcgt cagaccccgt agaaaagatc aaaggatctt cttgagatcc 5820
tttttttctg cgcgtaatct gctgcttgca aacaaaaaaa ccaccgctac cagcggtggt 5880
ttgtttgccg gatcaagagc taccaactct ttttccgaag gtaactggct tcagcagagc 5940
gcagatacca aatactgttc ttctagtgta gccgtagtta ggccaccact tcaagaactc 6000
tgtagcaccg cctacatacc tcgctctgct aatcctgtta ccagtggctg ctgccagtgg 6060
cgataagtcg tgtcttaccg ggttggactc aagacgatag ttaccggata aggcgcagcg 6120
gtcgggctga acggggggtt cgtgcacaca gcccagcttg gagcgaacga cctacaccga 6180
actgagatac ctacagcgtg agctatgaga aagcgccacg cttcccgaag ggagaaaggc 6240
ggacaggtat ccggtaagcg gcagggtcgg aacaggagag cgcacgaggg agcttccagg 6300
gggaaacgcc tggtatcttt atagtcctgt cgggtttcgc cacctctgac ttgagcgtcg 6360
atttttgtga tgctcgtcag gggggcggag cctatggaaa aacgccagca acgcggcctt 6420
tttacggttc ctggcctttt gctggccttt tgctcacatg ttctttcctg cgttatcccc 6480
tgattctgtg gataaccgta ttaccgcctt tgagtgagct gataccgctc gccgcagccg 6540
aacgaccgag cgcagcgagt cagtgagcga ggaagcggaa gagcgcccaa tacgcaaacc 6600
gcctctcccc gcgcgttggc cgattcatta atgcagctgg cacgacaggt ttcccgactg 6660
gaaagcgggc agtgagcgca acgcaattaa tgtgagttag ctcactcatt aggcacccca 6720
ggctttacac tttatgcttc cggctcgtat gttgtgtgga attgtgagcg gataacaatt 6780
tcacacagga aacagctatg accatgatta cgccaagcgc gcaattaacc ctcactaaag 6840
ggaacaaaag ctggagctgc aagcttaatg tagtcttatg caatactctt gtagtcttgc 6900
aacatggtaa cgatgagtta gcaacatgcc ttacaaggag agaaaaagca ccgtgcatgc 6960
cgattggtgg aagtaaggtg gtacgatcgt gccttattag gaaggcaaca gacgggtctg 7020
acatggattg gacgaaccac tgaattgccg cattgcagag atattgtatt taagtgccta 7080
gctcgataca taaacgggtc tctctggtta gaccagatct gagcctggga gctctctggc 7140
taactaggga acccactgct taagcctcaa taaagcttgc cttgagtgct tcaagtagtg 7200
tgtgcccgtc tgttgtgtga ctctggtaac tagagatccc tcagaccctt ttagtcagtg 7260
tggaaaatct ctagcagtgg cgcccgaaca gggacttgaa agcgaaaggg aaaccagagg 7320
agctctctcg acgcaggact cggcttgctg aagcgcgcac ggcaagaggc gaggggcggc 7380
gactggtgag tacgccaaaa attttgacta gcggaggcta gaaggagaga gatgggtgcg 7440
agagcgtcag tattaagcgg gggagaatta gatcgcgatg ggaaaaaatt cggttaaggc 7500
cagggggaaa gaaaaaatat aaattaaaac atatagtatg ggcaagcagg gagctagaac 7560
gattcgcagt taatcctggc ctgttagaaa catcagaagg ctgtagacaa atactgggac 7620
agctacaacc atcccttcag acaggatcag aagaacttag atcattatat aatacagtag 7680
caaccctcta ttgtgtgcat caaaggatag agataaaaga caccaaggaa gctttagaca 7740
agatagagga agagcaaaac aaaagtaaga ccaccgcaca gcaagcggcc gctgatcttc 7800
agacctggag gaggagatat gagggacaat tggagaagtg aattatataa atataaagta 7860
gtaaaaattg aaccattagg agtagcaccc accaaggcaa agagaagagt ggtgcagaga 7920
gaaaaaagag cagtgggaat aggagctttg ttccttgggt tcttgggagc agcaggaagc 7980
actatgggcg cagcgtcaat gacgctgacg gtacaggcca gacaattatt gtctggtata 8040
gtgcagcagc agaacaattt gctgagggct attgaggcgc aacagcatct gttgcaactc 8100
acagtctggg gcatcaagca gctccaggca agaatcctgg ctgtggaaag atacctaaag 8160
gatcaacagc tcctggggat ttggggttgc tctggaaaac tcatttgcac cactgctgtg 8220
ccttggaatg ctagttggag taataaatct ctggaacaga tttggaatca cacgacctgg 8280
atggagtggg acagagaaat taacaattac acaagcttaa tacactcctt aattgaagaa 8340
tcgcaaaacc agcaagaaaa gaatgaacaa gaattattgg aattagataa atgggcaagt 8400
ttgtggaatt ggtttaacat aacaaattgg ctgtggtata taaaattatt cataatgata 8460
gtaggaggct tggtaggttt aagaatagtt tttgctgtac tttctatagt gaatagagtt 8520
aggcagggat attcaccatt atcgtttcag acccacctcc caaccccgag gggacccttg 8580
cgccttttcc aaggcagccc tgggtttgcg cagggacgcg gctgctctgg gcgtggttcc 8640
gggaaacgca gcggcgccga ccctgggtct cgcacattct tcacgtccgt tcgcagcgtc 8700
acccggatct tcgccgctac ccttgtgggc cccccggcga cgcttcctgc tccgccccta 8760
agtcgggaag gttccttgcg gttcgcggcg tgccggacgt gacaaacgga agccgcacgt 8820
ctcactagta ccctcgcaga cggacagcgc cagggagcaa tggcagcgcg ccgaccgcga 8880
tgggctgtgg ccaatagcgg ctgctcagca gggcgcgccg agagcagcgg ccgggaaggg 8940
gcggtgcggg aggcggggtg tggggcggta gtgtgggccc tgttcctgcc cgcgcggtgt 9000
tccgcattct gcaagcctcc ggagcgcacg tcggcagtcg gctccctcgt tgaccgaatc 9060
accgacctct ctccccaggg ggtacccagc tgtctagaga attctagatc ttgagacaaa 9120
tggcagtatt catccacaat tttaaaagaa aaggggggat tggggggtac agtgcagggg 9180
aaagaatagt agacataata gcaacagaca tacaaactaa agaattacaa aaacaaatta 9240
caaaaattca aaattttcgg gtttattaca gggacagcag agatccactt tggcgccggc 9300
tcgaggggg 9309
<210> 45
<211> 221
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 45
Met Thr Glu Tyr Lys Pro Thr Val Arg Leu Ala Thr Arg Asp Asp Val
1 5 10 15
Pro Arg Ala Val Arg Thr Leu Ala Ala Ala Phe Ala Asp Tyr Pro Ala
20 25 30
Thr Arg His Thr Val Asp Pro Asp Arg His Ile Glu Arg Val Thr Glu
35 40 45
Leu Gln Glu Leu Phe Leu Thr Arg Val Gly Leu Asp Ile Gly Lys Val
50 55 60
Trp Val Ala Asp Asp Gly Ala Ala Val Ala Val Trp Thr Thr Pro Glu
65 70 75 80
Ser Val Glu Ala Gly Ala Val Phe Ala Glu Ile Gly Pro Arg Met Ala
85 90 95
Glu Leu Ser Gly Ser Arg Leu Ala Ala Gln Gln Gln Met Glu Gly Leu
100 105 110
Leu Ala Pro His Arg Pro Lys Cys Leu Ser Tyr Glu Thr Glu Ile Leu
115 120 125
Thr Val Glu Tyr Gly Leu Leu Pro Ile Gly Lys Ile Val Glu Lys Arg
130 135 140
Ile Glu Cys Thr Val Tyr Ser Val Asp Asn Asn Gly Asn Ile Tyr Thr
145 150 155 160
Gln Pro Val Ala Gln Trp His Asp Arg Gly Glu Gln Glu Val Phe Glu
165 170 175
Tyr Cys Leu Glu Asp Gly Ser Leu Ile Arg Ala Thr Lys Asp His Lys
180 185 190
Phe Met Thr Val Asp Gly Gln Met Leu Pro Ile Asp Glu Ile Phe Glu
195 200 205
Arg Glu Leu Asp Leu Met Arg Val Asp Asn Leu Pro Asn
210 215 220
<210> 46
<211> 9003
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 46
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtgccgcca ccatgattaa gatcgctacg 660
cggaagtacc tggggaaaca gaacgtctac gacataggtg tggagcgcga tcacaacttt 720
gctctgaaaa atggatttat cgccagcaac tgcgagcccg cgtggttcct ggccaccgtc 780
ggcgtctcgc ccgaccacca gggcaagggt ctgggcagcg ccgtcgtgct ccccggagtg 840
gaggcggccg agcgcgccgg ggtgcccgcc ttcctggaga cctccgcgcc ccgcaacctc 900
cccttctacg agcggctcgg cttcaccgtc accgccgacg tcgaggtgcc cgaaggaccg 960
cgcacctggt gcatgacccg caagcccggt gcctgattaa ttaagaattc gacccagctt 1020
tcttgtacaa agtggttggt aagcctatcc ctaaccctct cctcggtctc gattctacgt 1080
agtaatgagc tagcagtctc gaggttaacg aattccgccc cccccctaac gttactggcc 1140
gaagccgctt ggaataaggc cggtgtgcgc ttgtctatat gttattttcc accatattgc 1200
cgtcttttgg caatgtgagg gcccggaaac ctggccctgt cttcttgacg agcattccta 1260
ggggtctttc ccctctcgcc aaaggaatgc aaggtctgtt gaatgtcgtg aaggaagcag 1320
ttcctctgga agcttcttga agacaaacaa cgtctgtagc gaccctttgc aggcagcgga 1380
accccccacc tggcgacagg tgcccctgcg gccaaaagcc acgtgtataa gatacacctg 1440
caaaggcggc acaaccccag tgccacgttg tgagttggat agttgtggaa agagtcaaat 1500
ggctctcctc aagcgtattc aacaaggggc tgaaggatgc ccagaaggta ccccattgta 1560
tgggatctga tctggggcct cggtgcacat gctttacatg tgtttagtcg aggttaaaaa 1620
aacgtctagg ccccccgaac cacggggacg tggttttcct ttgaaaaaca cgataatacc 1680
atggtgagca agggcgagga ggataacatg gccatcatca aggagttcat gcgcttcaag 1740
gtgcacatgg agggctccgt gaacggccac gagttcgaga tcgagggcga gggcgagggc 1800
cgcccctacg agggcaccca gaccgccaag ctgaaggtga ccaagggtgg ccccctgccc 1860
ttcgcctggg acatcctgtc ccctcagttc atgtacggct ccaaggccta cgtgaagcac 1920
cccgccgaca tccccgacta cttgaagctg tccttccccg agggcttcaa gtgggagcgc 1980
gtgatgaact tcgaggacgg cggcgtggtg accgtgaccc aggactcctc cctgcaggac 2040
ggcgagttca tctacaaggt gaagctgcgc ggcaccaact tcccctccga cggccccgta 2100
atgcagaaga agaccatggg ctgggaggcc tcctccgagc ggatgtaccc cgaggacggc 2160
gccctgaagg gcgagatcaa gcagaggctg aagctgaagg acggcggcca ctacgacgct 2220
gaggtcaaga ccacctacaa ggccaagaag cccgtgcagc tgcccggcgc ctacaacgtc 2280
aacatcaagt tggacatcac ctcccacaac gaggactaca ccatcgtgga acagtacgaa 2340
cgcgccgagg gccgccactc caccggcggc atggacgagc tgtacaagta acaccggtgg 2400
cgcgttaagt cgacaatcaa cctctggatt acaaaatttg tgaaagattg actggtattc 2460
ttaactatgt tgctcctttt acgctatgtg gatacgctgc tttaatgcct ttgtatcatg 2520
ctattgcttc ccgtatggct ttcattttct cctccttgta taaatcctgg ttgctgtctc 2580
tttatgagga gttgtggccc gttgtcaggc aacgtggcgt ggtgtgcact gtgtttgctg 2640
acgcaacccc cactggttgg ggcattgcca ccacctgtca gctcctttcc gggactttcg 2700
ctttccccct ccctattgcc acggcggaac tcatcgccgc ctgccttgcc cgctgctgga 2760
caggggctcg gctgttgggc actgacaatt ccgtggtgtt gtcggggaaa tcatcgtcct 2820
ttccttggct gctcgcctgt gttgccacct ggattctgcg cgggacgtcc ttctgctacg 2880
tcccttcggc cctcaatcca gcggaccttc cttcccgcgg cctgctgccg gctctgcggc 2940
ctcttccgcg tcttcgcctt cgccctcaga cgagtcggat ctccctttgg gccgcctccc 3000
cgcgtcgact ttaagaccaa tgacttacaa ggcagctgta gatcttagcc actttttaaa 3060
agaaaagggg ggactggaag ggctaattca ctcccaacga agacaagatc tgctttttgc 3120
ttgtactggg tctctctggt tagaccagat ctgagcctgg gagctctctg gctaactagg 3180
gaacccactg cttaagcctc aataaagctt gccttgagtg cttcaagtag tgtgtgcccg 3240
tctgttgtgt gactctggta actagagatc cctcagaccc ttttagtcag tgtggaaaat 3300
ctctagcagt acgtatagta gttcatgtca tcttattatt cagtatttat aacttgcaaa 3360
gaaatgaata tcagagagtg agaggaactt gtttattgca gcttataatg gttacaaata 3420
aagcaatagc atcacaaatt tcacaaataa agcatttttt tcactgcatt ctagttgtgg 3480
tttgtccaaa ctcatcaatg tatcttatca tgtctggctc tagctatccc gcccctaact 3540
ccgcccatcc cgcccctaac tccgcccagt tccgcccatt ctccgcccca tggctgacta 3600
atttttttta tttatgcaga ggccgaggcc gcctcggcct ctgagctatt ccagaagtag 3660
tgaggaggct tttttggagg cctagggacg tacccaattc gccctatagt gagtcgtatt 3720
acgcgcgctc actggccgtc gttttacaac gtcgtgactg ggaaaaccct ggcgttaccc 3780
aacttaatcg ccttgcagca catccccctt tcgccagctg gcgtaatagc gaagaggccc 3840
gcaccgatcg cccttcccaa cagttgcgca gcctgaatgg cgaatgggac gcgccctgta 3900
gcggcgcatt aagcgcggcg ggtgtggtgg ttacgcgcag cgtgaccgct acacttgcca 3960
gcgccctagc gcccgctcct ttcgctttct tcccttcctt tctcgccacg ttcgccggct 4020
ttccccgtca agctctaaat cgggggctcc ctttagggtt ccgatttagt gctttacggc 4080
acctcgaccc caaaaaactt gattagggtg atggttcacg tagtgggcca tcgccctgat 4140
agacggtttt tcgccctttg acgttggagt ccacgttctt taatagtgga ctcttgttcc 4200
aaactggaac aacactcaac cctatctcgg tctattcttt tgatttataa gggattttgc 4260
cgatttcggc ctattggtta aaaaatgagc tgatttaaca aaaatttaac gcgaatttta 4320
acaaaatatt aacgcttaca atttaggtgg cacttttcgg ggaaatgtgc gcggaacccc 4380
tatttgttta tttttctaaa tacattcaaa tatgtatccg ctcatgagac aataaccctg 4440
ataaatgctt caataatatt gaaaaaggaa gagtatgagt attcaacatt tccgtgtcgc 4500
ccttattccc ttttttgcgg cattttgcct tcctgttttt gctcacccag aaacgctggt 4560
gaaagtaaaa gatgctgaag atcagttggg tgcacgagtg ggttacatcg aactggatct 4620
caacagcggt aagatccttg agagttttcg ccccgaagaa cgttttccaa tgatgagcac 4680
ttttaaagtt ctgctatgtg gcgcggtatt atcccgtatt gacgccgggc aagagcaact 4740
cggtcgccgc atacactatt ctcagaatga cttggttgag tactcaccag tcacagaaaa 4800
gcatcttacg gatggcatga cagtaagaga attatgcagt gctgccataa ccatgagtga 4860
taacactgcg gccaacttac ttctgacaac gatcggagga ccgaaggagc taaccgcttt 4920
tttgcacaac atgggggatc atgtaactcg ccttgatcgt tgggaaccgg agctgaatga 4980
agccatacca aacgacgagc gtgacaccac gatgcctgta gcaatggcaa caacgttgcg 5040
caaactatta actggcgaac tacttactct agcttcccgg caacaattaa tagactggat 5100
ggaggcggat aaagttgcag gaccacttct gcgctcggcc cttccggctg gctggtttat 5160
tgctgataaa tctggagccg gtgagcgtgg gtctcgcggt atcattgcag cactggggcc 5220
agatggtaag ccctcccgta tcgtagttat ctacacgacg gggagtcagg caactatgga 5280
tgaacgaaat agacagatcg ctgagatagg tgcctcactg attaagcatt ggtaactgtc 5340
agaccaagtt tactcatata tactttagat tgatttaaaa cttcattttt aatttaaaag 5400
gatctaggtg aagatccttt ttgataatct catgaccaaa atcccttaac gtgagttttc 5460
gttccactga gcgtcagacc ccgtagaaaa gatcaaagga tcttcttgag atcctttttt 5520
tctgcgcgta atctgctgct tgcaaacaaa aaaaccaccg ctaccagcgg tggtttgttt 5580
gccggatcaa gagctaccaa ctctttttcc gaaggtaact ggcttcagca gagcgcagat 5640
accaaatact gttcttctag tgtagccgta gttaggccac cacttcaaga actctgtagc 5700
accgcctaca tacctcgctc tgctaatcct gttaccagtg gctgctgcca gtggcgataa 5760
gtcgtgtctt accgggttgg actcaagacg atagttaccg gataaggcgc agcggtcggg 5820
ctgaacgggg ggttcgtgca cacagcccag cttggagcga acgacctaca ccgaactgag 5880
atacctacag cgtgagctat gagaaagcgc cacgcttccc gaagggagaa aggcggacag 5940
gtatccggta agcggcaggg tcggaacagg agagcgcacg agggagcttc cagggggaaa 6000
cgcctggtat ctttatagtc ctgtcgggtt tcgccacctc tgacttgagc gtcgattttt 6060
gtgatgctcg tcaggggggc ggagcctatg gaaaaacgcc agcaacgcgg cctttttacg 6120
gttcctggcc ttttgctggc cttttgctca catgttcttt cctgcgttat cccctgattc 6180
tgtggataac cgtattaccg cctttgagtg agctgatacc gctcgccgca gccgaacgac 6240
cgagcgcagc gagtcagtga gcgaggaagc ggaagagcgc ccaatacgca aaccgcctct 6300
ccccgcgcgt tggccgattc attaatgcag ctggcacgac aggtttcccg actggaaagc 6360
gggcagtgag cgcaacgcaa ttaatgtgag ttagctcact cattaggcac cccaggcttt 6420
acactttatg cttccggctc gtatgttgtg tggaattgtg agcggataac aatttcacac 6480
aggaaacagc tatgaccatg attacgccaa gcgcgcaatt aaccctcact aaagggaaca 6540
aaagctggag ctgcaagctt aatgtagtct tatgcaatac tcttgtagtc ttgcaacatg 6600
gtaacgatga gttagcaaca tgccttacaa ggagagaaaa agcaccgtgc atgccgattg 6660
gtggaagtaa ggtggtacga tcgtgcctta ttaggaaggc aacagacggg tctgacatgg 6720
attggacgaa ccactgaatt gccgcattgc agagatattg tatttaagtg cctagctcga 6780
tacataaacg ggtctctctg gttagaccag atctgagcct gggagctctc tggctaacta 6840
gggaacccac tgcttaagcc tcaataaagc ttgccttgag tgcttcaagt agtgtgtgcc 6900
cgtctgttgt gtgactctgg taactagaga tccctcagac ccttttagtc agtgtggaaa 6960
atctctagca gtggcgcccg aacagggact tgaaagcgaa agggaaacca gaggagctct 7020
ctcgacgcag gactcggctt gctgaagcgc gcacggcaag aggcgagggg cggcgactgg 7080
tgagtacgcc aaaaattttg actagcggag gctagaagga gagagatggg tgcgagagcg 7140
tcagtattaa gcgggggaga attagatcgc gatgggaaaa aattcggtta aggccagggg 7200
gaaagaaaaa atataaatta aaacatatag tatgggcaag cagggagcta gaacgattcg 7260
cagttaatcc tggcctgtta gaaacatcag aaggctgtag acaaatactg ggacagctac 7320
aaccatccct tcagacagga tcagaagaac ttagatcatt atataataca gtagcaaccc 7380
tctattgtgt gcatcaaagg atagagataa aagacaccaa ggaagcttta gacaagatag 7440
aggaagagca aaacaaaagt aagaccaccg cacagcaagc ggccgctgat cttcagacct 7500
ggaggaggag atatgaggga caattggaga agtgaattat ataaatataa agtagtaaaa 7560
attgaaccat taggagtagc acccaccaag gcaaagagaa gagtggtgca gagagaaaaa 7620
agagcagtgg gaataggagc tttgttcctt gggttcttgg gagcagcagg aagcactatg 7680
ggcgcagcgt caatgacgct gacggtacag gccagacaat tattgtctgg tatagtgcag 7740
cagcagaaca atttgctgag ggctattgag gcgcaacagc atctgttgca actcacagtc 7800
tggggcatca agcagctcca ggcaagaatc ctggctgtgg aaagatacct aaaggatcaa 7860
cagctcctgg ggatttgggg ttgctctgga aaactcattt gcaccactgc tgtgccttgg 7920
aatgctagtt ggagtaataa atctctggaa cagatttgga atcacacgac ctggatggag 7980
tgggacagag aaattaacaa ttacacaagc ttaatacact ccttaattga agaatcgcaa 8040
aaccagcaag aaaagaatga acaagaatta ttggaattag ataaatgggc aagtttgtgg 8100
aattggttta acataacaaa ttggctgtgg tatataaaat tattcataat gatagtagga 8160
ggcttggtag gtttaagaat agtttttgct gtactttcta tagtgaatag agttaggcag 8220
ggatattcac cattatcgtt tcagacccac ctcccaaccc cgaggggacc cttgcgcctt 8280
ttccaaggca gccctgggtt tgcgcaggga cgcggctgct ctgggcgtgg ttccgggaaa 8340
cgcagcggcg ccgaccctgg gtctcgcaca ttcttcacgt ccgttcgcag cgtcacccgg 8400
atcttcgccg ctacccttgt gggccccccg gcgacgcttc ctgctccgcc cctaagtcgg 8460
gaaggttcct tgcggttcgc ggcgtgccgg acgtgacaaa cggaagccgc acgtctcact 8520
agtaccctcg cagacggaca gcgccaggga gcaatggcag cgcgccgacc gcgatgggct 8580
gtggccaata gcggctgctc agcagggcgc gccgagagca gcggccggga aggggcggtg 8640
cgggaggcgg ggtgtggggc ggtagtgtgg gccctgttcc tgcccgcgcg gtgttccgca 8700
ttctgcaagc ctccggagcg cacgtcggca gtcggctccc tcgttgaccg aatcaccgac 8760
ctctctcccc agggggtacc cagctgtcta gagaattcta gatcttgaga caaatggcag 8820
tattcatcca caattttaaa agaaaagggg ggattggggg gtacagtgca ggggaaagaa 8880
tagtagacat aatagcaaca gacatacaaa ctaaagaatt acaaaaacaa attacaaaaa 8940
ttcaaaattt tcgggtttat tacagggaca gcagagatcc actttggcgc cggctcgagg 9000
ggg 9003
<210> 47
<211> 117
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 47
Met Ile Lys Ile Ala Thr Arg Lys Tyr Leu Gly Lys Gln Asn Val Tyr
1 5 10 15
Asp Ile Gly Val Glu Arg Asp His Asn Phe Ala Leu Lys Asn Gly Phe
20 25 30
Ile Ala Ser Asn Cys Glu Pro Ala Trp Phe Leu Ala Thr Val Gly Val
35 40 45
Ser Pro Asp His Gln Gly Lys Gly Leu Gly Ser Ala Val Val Leu Pro
50 55 60
Gly Val Glu Ala Ala Glu Arg Ala Gly Val Pro Ala Phe Leu Glu Thr
65 70 75 80
Ser Ala Pro Arg Asn Leu Pro Phe Tyr Glu Arg Leu Gly Phe Thr Val
85 90 95
Thr Ala Asp Val Glu Val Pro Glu Gly Pro Arg Thr Trp Cys Met Thr
100 105 110
Arg Lys Pro Gly Ala
115
<210> 48
<211> 9264
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 48
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtgccacca tgaccgagta caagcccacg 660
gtgcgcctcg ccacccgcga cgacgtcccc agggccgtac gcaccctcgc cgccgcgttc 720
gccgactacc ccgccacgcg ccacaccgtc gatccggacc gccacatcga gcgggtcacc 780
gagctgcaag aactcttcct cacgcgcgtc gggctcgaca tcggcaaggt gtgggtcgcg 840
gacgacggcg ccgcggtggc ggtctggacc acgccggaga gcgtcgaagc gggggcggtg 900
ttcgccgaga tcggcccgcg catggccgag ttgagcggtt gtatcagtgg cgactccctg 960
atctcactcg caagcactgg aaagcgagtt agcatcaagg acttgctgga cgaaaaggat 1020
ttcgaaattt gggcaatcaa tgagcagacc atgaaactgg agtctgcaaa ggtgtcccgg 1080
gtgttttgca cgggtaagaa gcttgtttat atccttaaaa ctagactggg ccggacgatc 1140
aaagccaccg cgaaccacag attcttgaca atcgacgggt ggaaacggct ggacgaactg 1200
agcttgaagg agcacatcgc ccttcctcgg aagctcgagt catcttccct gcagctgtga 1260
ttaattaaga attcgaccca gctttcttgt acaaagtggt tggtaagcct atccctaacc 1320
ctctcctcgg tctcgattct acgtagtaat gagctagcag tctcgaggtt aacgaattcc 1380
gccccccccc taacgttact ggccgaagcc gcttggaata aggccggtgt gcgcttgtct 1440
atatgttatt ttccaccata ttgccgtctt ttggcaatgt gagggcccgg aaacctggcc 1500
ctgtcttctt gacgagcatt cctaggggtc tttcccctct cgccaaagga atgcaaggtc 1560
tgttgaatgt cgtgaaggaa gcagttcctc tggaagcttc ttgaagacaa acaacgtctg 1620
tagcgaccct ttgcaggcag cggaaccccc cacctggcga caggtgcccc tgcggccaaa 1680
agccacgtgt ataagataca cctgcaaagg cggcacaacc ccagtgccac gttgtgagtt 1740
ggatagttgt ggaaagagtc aaatggctct cctcaagcgt attcaacaag gggctgaagg 1800
atgcccagaa ggtaccccat tgtatgggat ctgatctggg gcctcggtgc acatgcttta 1860
catgtgttta gtcgaggtta aaaaaacgtc taggcccccc gaaccacggg gacgtggttt 1920
tcctttgaaa aacacgataa taccatggcc atgagcgagc tgattaagga gaacatgcac 1980
atgaagctgt acatggaggg caccgtggac aaccatcact tcaagtgcac atccgagggc 2040
gaaggcaagc cctacgaggg cacccagacc atgagaatca aggtggtcga gggcggccct 2100
ctccccttcg ccttcgacat cctggctact agcttcctct acggcagcaa gaccttcatc 2160
aaccacaccc agggcatccc cgacttcttc aagcagtcct tccctgaggg cttcacatgg 2220
gagagagtca ccacatacga agacgggggc gtgctgaccg ctacccagga caccagcctc 2280
caggacggct gcctcatcta caacgtcaag atcagagggg tgaacttcac atccaacggc 2340
cctgtgatgc agaagaaaac actcggctgg gaggccttca ccgagacgct gtaccccgct 2400
gacggcggcc tggaaggcag aaacgacatg gccctgaagc tcgtgggcgg gagccatctg 2460
atcgcaaaca tcaagaccac atatagatcc aagaaacccg ctaagaacct caagatgcct 2520
ggcgtctact atgtggacta cagactggaa agaatcaagg aggccaacaa cgagacctac 2580
gtcgagcagc acgaggtggc agtggccaga tactgcgacc tccctagcaa actggggcac 2640
aagcttaatt aacaccggtg gcgcgttaag tcgacaatca acctctggat tacaaaattt 2700
gtgaaagatt gactggtatt cttaactatg ttgctccttt tacgctatgt ggatacgctg 2760
ctttaatgcc tttgtatcat gctattgctt cccgtatggc tttcattttc tcctccttgt 2820
ataaatcctg gttgctgtct ctttatgagg agttgtggcc cgttgtcagg caacgtggcg 2880
tggtgtgcac tgtgtttgct gacgcaaccc ccactggttg gggcattgcc accacctgtc 2940
agctcctttc cgggactttc gctttccccc tccctattgc cacggcggaa ctcatcgccg 3000
cctgccttgc ccgctgctgg acaggggctc ggctgttggg cactgacaat tccgtggtgt 3060
tgtcggggaa atcatcgtcc tttccttggc tgctcgcctg tgttgccacc tggattctgc 3120
gcgggacgtc cttctgctac gtcccttcgg ccctcaatcc agcggacctt ccttcccgcg 3180
gcctgctgcc ggctctgcgg cctcttccgc gtcttcgcct tcgccctcag acgagtcgga 3240
tctccctttg ggccgcctcc ccgcgtcgac tttaagacca atgacttaca aggcagctgt 3300
agatcttagc cactttttaa aagaaaaggg gggactggaa gggctaattc actcccaacg 3360
aagacaagat ctgctttttg cttgtactgg gtctctctgg ttagaccaga tctgagcctg 3420
ggagctctct ggctaactag ggaacccact gcttaagcct caataaagct tgccttgagt 3480
gcttcaagta gtgtgtgccc gtctgttgtg tgactctggt aactagagat ccctcagacc 3540
cttttagtca gtgtggaaaa tctctagcag tacgtatagt agttcatgtc atcttattat 3600
tcagtattta taacttgcaa agaaatgaat atcagagagt gagaggaact tgtttattgc 3660
agcttataat ggttacaaat aaagcaatag catcacaaat ttcacaaata aagcattttt 3720
ttcactgcat tctagttgtg gtttgtccaa actcatcaat gtatcttatc atgtctggct 3780
ctagctatcc cgcccctaac tccgcccatc ccgcccctaa ctccgcccag ttccgcccat 3840
tctccgcccc atggctgact aatttttttt atttatgcag aggccgaggc cgcctcggcc 3900
tctgagctat tccagaagta gtgaggaggc ttttttggag gcctagggac gtacccaatt 3960
cgccctatag tgagtcgtat tacgcgcgct cactggccgt cgttttacaa cgtcgtgact 4020
gggaaaaccc tggcgttacc caacttaatc gccttgcagc acatccccct ttcgccagct 4080
ggcgtaatag cgaagaggcc cgcaccgatc gcccttccca acagttgcgc agcctgaatg 4140
gcgaatggga cgcgccctgt agcggcgcat taagcgcggc gggtgtggtg gttacgcgca 4200
gcgtgaccgc tacacttgcc agcgccctag cgcccgctcc tttcgctttc ttcccttcct 4260
ttctcgccac gttcgccggc tttccccgtc aagctctaaa tcgggggctc cctttagggt 4320
tccgatttag tgctttacgg cacctcgacc ccaaaaaact tgattagggt gatggttcac 4380
gtagtgggcc atcgccctga tagacggttt ttcgcccttt gacgttggag tccacgttct 4440
ttaatagtgg actcttgttc caaactggaa caacactcaa ccctatctcg gtctattctt 4500
ttgatttata agggattttg ccgatttcgg cctattggtt aaaaaatgag ctgatttaac 4560
aaaaatttaa cgcgaatttt aacaaaatat taacgcttac aatttaggtg gcacttttcg 4620
gggaaatgtg cgcggaaccc ctatttgttt atttttctaa atacattcaa atatgtatcc 4680
gctcatgaga caataaccct gataaatgct tcaataatat tgaaaaagga agagtatgag 4740
tattcaacat ttccgtgtcg cccttattcc cttttttgcg gcattttgcc ttcctgtttt 4800
tgctcaccca gaaacgctgg tgaaagtaaa agatgctgaa gatcagttgg gtgcacgagt 4860
gggttacatc gaactggatc tcaacagcgg taagatcctt gagagttttc gccccgaaga 4920
acgttttcca atgatgagca cttttaaagt tctgctatgt ggcgcggtat tatcccgtat 4980
tgacgccggg caagagcaac tcggtcgccg catacactat tctcagaatg acttggttga 5040
gtactcacca gtcacagaaa agcatcttac ggatggcatg acagtaagag aattatgcag 5100
tgctgccata accatgagtg ataacactgc ggccaactta cttctgacaa cgatcggagg 5160
accgaaggag ctaaccgctt ttttgcacaa catgggggat catgtaactc gccttgatcg 5220
ttgggaaccg gagctgaatg aagccatacc aaacgacgag cgtgacacca cgatgcctgt 5280
agcaatggca acaacgttgc gcaaactatt aactggcgaa ctacttactc tagcttcccg 5340
gcaacaatta atagactgga tggaggcgga taaagttgca ggaccacttc tgcgctcggc 5400
ccttccggct ggctggttta ttgctgataa atctggagcc ggtgagcgtg ggtctcgcgg 5460
tatcattgca gcactggggc cagatggtaa gccctcccgt atcgtagtta tctacacgac 5520
ggggagtcag gcaactatgg atgaacgaaa tagacagatc gctgagatag gtgcctcact 5580
gattaagcat tggtaactgt cagaccaagt ttactcatat atactttaga ttgatttaaa 5640
acttcatttt taatttaaaa ggatctaggt gaagatcctt tttgataatc tcatgaccaa 5700
aatcccttaa cgtgagtttt cgttccactg agcgtcagac cccgtagaaa agatcaaagg 5760
atcttcttga gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa aaaaaccacc 5820
gctaccagcg gtggtttgtt tgccggatca agagctacca actctttttc cgaaggtaac 5880
tggcttcagc agagcgcaga taccaaatac tgttcttcta gtgtagccgt agttaggcca 5940
ccacttcaag aactctgtag caccgcctac atacctcgct ctgctaatcc tgttaccagt 6000
ggctgctgcc agtggcgata agtcgtgtct taccgggttg gactcaagac gatagttacc 6060
ggataaggcg cagcggtcgg gctgaacggg gggttcgtgc acacagccca gcttggagcg 6120
aacgacctac accgaactga gatacctaca gcgtgagcta tgagaaagcg ccacgcttcc 6180
cgaagggaga aaggcggaca ggtatccggt aagcggcagg gtcggaacag gagagcgcac 6240
gagggagctt ccagggggaa acgcctggta tctttatagt cctgtcgggt ttcgccacct 6300
ctgacttgag cgtcgatttt tgtgatgctc gtcagggggg cggagcctat ggaaaaacgc 6360
cagcaacgcg gcctttttac ggttcctggc cttttgctgg ccttttgctc acatgttctt 6420
tcctgcgtta tcccctgatt ctgtggataa ccgtattacc gcctttgagt gagctgatac 6480
cgctcgccgc agccgaacga ccgagcgcag cgagtcagtg agcgaggaag cggaagagcg 6540
cccaatacgc aaaccgcctc tccccgcgcg ttggccgatt cattaatgca gctggcacga 6600
caggtttccc gactggaaag cgggcagtga gcgcaacgca attaatgtga gttagctcac 6660
tcattaggca ccccaggctt tacactttat gcttccggct cgtatgttgt gtggaattgt 6720
gagcggataa caatttcaca caggaaacag ctatgaccat gattacgcca agcgcgcaat 6780
taaccctcac taaagggaac aaaagctgga gctgcaagct taatgtagtc ttatgcaata 6840
ctcttgtagt cttgcaacat ggtaacgatg agttagcaac atgccttaca aggagagaaa 6900
aagcaccgtg catgccgatt ggtggaagta aggtggtacg atcgtgcctt attaggaagg 6960
caacagacgg gtctgacatg gattggacga accactgaat tgccgcattg cagagatatt 7020
gtatttaagt gcctagctcg atacataaac gggtctctct ggttagacca gatctgagcc 7080
tgggagctct ctggctaact agggaaccca ctgcttaagc ctcaataaag cttgccttga 7140
gtgcttcaag tagtgtgtgc ccgtctgttg tgtgactctg gtaactagag atccctcaga 7200
cccttttagt cagtgtggaa aatctctagc agtggcgccc gaacagggac ttgaaagcga 7260
aagggaaacc agaggagctc tctcgacgca ggactcggct tgctgaagcg cgcacggcaa 7320
gaggcgaggg gcggcgactg gtgagtacgc caaaaatttt gactagcgga ggctagaagg 7380
agagagatgg gtgcgagagc gtcagtatta agcgggggag aattagatcg cgatgggaaa 7440
aaattcggtt aaggccaggg ggaaagaaaa aatataaatt aaaacatata gtatgggcaa 7500
gcagggagct agaacgattc gcagttaatc ctggcctgtt agaaacatca gaaggctgta 7560
gacaaatact gggacagcta caaccatccc ttcagacagg atcagaagaa cttagatcat 7620
tatataatac agtagcaacc ctctattgtg tgcatcaaag gatagagata aaagacacca 7680
aggaagcttt agacaagata gaggaagagc aaaacaaaag taagaccacc gcacagcaag 7740
cggccgctga tcttcagacc tggaggagga gatatgaggg acaattggag aagtgaatta 7800
tataaatata aagtagtaaa aattgaacca ttaggagtag cacccaccaa ggcaaagaga 7860
agagtggtgc agagagaaaa aagagcagtg ggaataggag ctttgttcct tgggttcttg 7920
ggagcagcag gaagcactat gggcgcagcg tcaatgacgc tgacggtaca ggccagacaa 7980
ttattgtctg gtatagtgca gcagcagaac aatttgctga gggctattga ggcgcaacag 8040
catctgttgc aactcacagt ctggggcatc aagcagctcc aggcaagaat cctggctgtg 8100
gaaagatacc taaaggatca acagctcctg gggatttggg gttgctctgg aaaactcatt 8160
tgcaccactg ctgtgccttg gaatgctagt tggagtaata aatctctgga acagatttgg 8220
aatcacacga cctggatgga gtgggacaga gaaattaaca attacacaag cttaatacac 8280
tccttaattg aagaatcgca aaaccagcaa gaaaagaatg aacaagaatt attggaatta 8340
gataaatggg caagtttgtg gaattggttt aacataacaa attggctgtg gtatataaaa 8400
ttattcataa tgatagtagg aggcttggta ggtttaagaa tagtttttgc tgtactttct 8460
atagtgaata gagttaggca gggatattca ccattatcgt ttcagaccca cctcccaacc 8520
ccgaggggac ccttgcgcct tttccaaggc agccctgggt ttgcgcaggg acgcggctgc 8580
tctgggcgtg gttccgggaa acgcagcggc gccgaccctg ggtctcgcac attcttcacg 8640
tccgttcgca gcgtcacccg gatcttcgcc gctacccttg tgggcccccc ggcgacgctt 8700
cctgctccgc ccctaagtcg ggaaggttcc ttgcggttcg cggcgtgccg gacgtgacaa 8760
acggaagccg cacgtctcac tagtaccctc gcagacggac agcgccaggg agcaatggca 8820
gcgcgccgac cgcgatgggc tgtggccaat agcggctgct cagcagggcg cgccgagagc 8880
agcggccggg aaggggcggt gcgggaggcg gggtgtgggg cggtagtgtg ggccctgttc 8940
ctgcccgcgc ggtgttccgc attctgcaag cctccggagc gcacgtcggc agtcggctcc 9000
ctcgttgacc gaatcaccga cctctctccc cagggggtac ccagctgtct agagaattct 9060
agatcttgag acaaatggca gtattcatcc acaattttaa aagaaaaggg gggattgggg 9120
ggtacagtgc aggggaaaga atagtagaca taatagcaac agacatacaa actaaagaat 9180
tacaaaaaca aattacaaaa attcaaaatt ttcgggttta ttacagggac agcagagatc 9240
cactttggcg ccggctcgag gggg 9264
<210> 49
<211> 206
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 49
Met Thr Glu Tyr Lys Pro Thr Val Arg Leu Ala Thr Arg Asp Asp Val
1 5 10 15
Pro Arg Ala Val Arg Thr Leu Ala Ala Ala Phe Ala Asp Tyr Pro Ala
20 25 30
Thr Arg His Thr Val Asp Pro Asp Arg His Ile Glu Arg Val Thr Glu
35 40 45
Leu Gln Glu Leu Phe Leu Thr Arg Val Gly Leu Asp Ile Gly Lys Val
50 55 60
Trp Val Ala Asp Asp Gly Ala Ala Val Ala Val Trp Thr Thr Pro Glu
65 70 75 80
Ser Val Glu Ala Gly Ala Val Phe Ala Glu Ile Gly Pro Arg Met Ala
85 90 95
Glu Leu Ser Gly Cys Ile Ser Gly Asp Ser Leu Ile Ser Leu Ala Ser
100 105 110
Thr Gly Lys Arg Val Ser Ile Lys Asp Leu Leu Asp Glu Lys Asp Phe
115 120 125
Glu Ile Trp Ala Ile Asn Glu Gln Thr Met Lys Leu Glu Ser Ala Lys
130 135 140
Val Ser Arg Val Phe Cys Thr Gly Lys Lys Leu Val Tyr Ile Leu Lys
145 150 155 160
Thr Arg Leu Gly Arg Thr Ile Lys Ala Thr Ala Asn His Arg Phe Leu
165 170 175
Thr Ile Asp Gly Trp Lys Arg Leu Asp Glu Leu Ser Leu Lys Glu His
180 185 190
Ile Ala Leu Pro Arg Lys Leu Glu Ser Ser Ser Leu Gln Leu
195 200 205
<210> 50
<211> 9096
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 50
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtgccgcca ccatgagtcc cgaaatcgaa 660
aagctctctc agagcgatat atattgggac tccatcgtaa gcataacaga gacgggggtc 720
gaggaggtgt tcgatctgac agttcctggg cctcataatt tcgtagcgaa cgacatcatt 780
gtacataact cccggctggc cgcgcagcaa cagatggaag gcctcctggc gccgcaccgg 840
cccaaggagc ccgcgtggtt cctggccacc gtcggcgtct cgcccgacca ccagggcaag 900
ggtctgggca gcgccgtcgt gctccccgga gtggaggcgg ccgagcgcgc cggggtgccc 960
gccttcctgg agacctccgc gccccgcaac ctccccttct acgagcggct cggcttcacc 1020
gtcaccgccg acgtcgaggt gcccgaagga ccgcgcacct ggtgcatgac ccgcaagccc 1080
ggtgcctgat taattaagaa ttcgacccag ctttcttgta caaagtggtt ggtaagccta 1140
tccctaaccc tctcctcggt ctcgattcta cgtagtaatg agctagcagt ctcgaggtta 1200
acgaattccg ccccccccct aacgttactg gccgaagccg cttggaataa ggccggtgtg 1260
cgcttgtcta tatgttattt tccaccatat tgccgtcttt tggcaatgtg agggcccgga 1320
aacctggccc tgtcttcttg acgagcattc ctaggggtct ttcccctctc gccaaaggaa 1380
tgcaaggtct gttgaatgtc gtgaaggaag cagttcctct ggaagcttct tgaagacaaa 1440
caacgtctgt agcgaccctt tgcaggcagc ggaacccccc acctggcgac aggtgcccct 1500
gcggccaaaa gccacgtgta taagatacac ctgcaaaggc ggcacaaccc cagtgccacg 1560
ttgtgagttg gatagttgtg gaaagagtca aatggctctc ctcaagcgta ttcaacaagg 1620
ggctgaagga tgcccagaag gtaccccatt gtatgggatc tgatctgggg cctcggtgca 1680
catgctttac atgtgtttag tcgaggttaa aaaaacgtct aggccccccg aaccacgggg 1740
acgtggtttt cctttgaaaa acacgataat accatggtga gcaagggcga ggaggataac 1800
atggccatca tcaaggagtt catgcgcttc aaggtgcaca tggagggctc cgtgaacggc 1860
cacgagttcg agatcgaggg cgagggcgag ggccgcccct acgagggcac ccagaccgcc 1920
aagctgaagg tgaccaaggg tggccccctg cccttcgcct gggacatcct gtcccctcag 1980
ttcatgtacg gctccaaggc ctacgtgaag caccccgccg acatccccga ctacttgaag 2040
ctgtccttcc ccgagggctt caagtgggag cgcgtgatga acttcgagga cggcggcgtg 2100
gtgaccgtga cccaggactc ctccctgcag gacggcgagt tcatctacaa ggtgaagctg 2160
cgcggcacca acttcccctc cgacggcccc gtaatgcaga agaagaccat gggctgggag 2220
gcctcctccg agcggatgta ccccgaggac ggcgccctga agggcgagat caagcagagg 2280
ctgaagctga aggacggcgg ccactacgac gctgaggtca agaccaccta caaggccaag 2340
aagcccgtgc agctgcccgg cgcctacaac gtcaacatca agttggacat cacctcccac 2400
aacgaggact acaccatcgt ggaacagtac gaacgcgccg agggccgcca ctccaccggc 2460
ggcatggacg agctgtacaa gtaacaccgg tggcgcgtta agtcgacaat caacctctgg 2520
attacaaaat ttgtgaaaga ttgactggta ttcttaacta tgttgctcct tttacgctat 2580
gtggatacgc tgctttaatg cctttgtatc atgctattgc ttcccgtatg gctttcattt 2640
tctcctcctt gtataaatcc tggttgctgt ctctttatga ggagttgtgg cccgttgtca 2700
ggcaacgtgg cgtggtgtgc actgtgtttg ctgacgcaac ccccactggt tggggcattg 2760
ccaccacctg tcagctcctt tccgggactt tcgctttccc cctccctatt gccacggcgg 2820
aactcatcgc cgcctgcctt gcccgctgct ggacaggggc tcggctgttg ggcactgaca 2880
attccgtggt gttgtcgggg aaatcatcgt cctttccttg gctgctcgcc tgtgttgcca 2940
cctggattct gcgcgggacg tccttctgct acgtcccttc ggccctcaat ccagcggacc 3000
ttccttcccg cggcctgctg ccggctctgc ggcctcttcc gcgtcttcgc cttcgccctc 3060
agacgagtcg gatctccctt tgggccgcct ccccgcgtcg actttaagac caatgactta 3120
caaggcagct gtagatctta gccacttttt aaaagaaaag gggggactgg aagggctaat 3180
tcactcccaa cgaagacaag atctgctttt tgcttgtact gggtctctct ggttagacca 3240
gatctgagcc tgggagctct ctggctaact agggaaccca ctgcttaagc ctcaataaag 3300
cttgccttga gtgcttcaag tagtgtgtgc ccgtctgttg tgtgactctg gtaactagag 3360
atccctcaga cccttttagt cagtgtggaa aatctctagc agtacgtata gtagttcatg 3420
tcatcttatt attcagtatt tataacttgc aaagaaatga atatcagaga gtgagaggaa 3480
cttgtttatt gcagcttata atggttacaa ataaagcaat agcatcacaa atttcacaaa 3540
taaagcattt ttttcactgc attctagttg tggtttgtcc aaactcatca atgtatctta 3600
tcatgtctgg ctctagctat cccgccccta actccgccca tcccgcccct aactccgccc 3660
agttccgccc attctccgcc ccatggctga ctaatttttt ttatttatgc agaggccgag 3720
gccgcctcgg cctctgagct attccagaag tagtgaggag gcttttttgg aggcctaggg 3780
acgtacccaa ttcgccctat agtgagtcgt attacgcgcg ctcactggcc gtcgttttac 3840
aacgtcgtga ctgggaaaac cctggcgtta cccaacttaa tcgccttgca gcacatcccc 3900
ctttcgccag ctggcgtaat agcgaagagg cccgcaccga tcgcccttcc caacagttgc 3960
gcagcctgaa tggcgaatgg gacgcgccct gtagcggcgc attaagcgcg gcgggtgtgg 4020
tggttacgcg cagcgtgacc gctacacttg ccagcgccct agcgcccgct cctttcgctt 4080
tcttcccttc ctttctcgcc acgttcgccg gctttccccg tcaagctcta aatcgggggc 4140
tccctttagg gttccgattt agtgctttac ggcacctcga ccccaaaaaa cttgattagg 4200
gtgatggttc acgtagtggg ccatcgccct gatagacggt ttttcgccct ttgacgttgg 4260
agtccacgtt ctttaatagt ggactcttgt tccaaactgg aacaacactc aaccctatct 4320
cggtctattc ttttgattta taagggattt tgccgatttc ggcctattgg ttaaaaaatg 4380
agctgattta acaaaaattt aacgcgaatt ttaacaaaat attaacgctt acaatttagg 4440
tggcactttt cggggaaatg tgcgcggaac ccctatttgt ttatttttct aaatacattc 4500
aaatatgtat ccgctcatga gacaataacc ctgataaatg cttcaataat attgaaaaag 4560
gaagagtatg agtattcaac atttccgtgt cgcccttatt cccttttttg cggcattttg 4620
ccttcctgtt tttgctcacc cagaaacgct ggtgaaagta aaagatgctg aagatcagtt 4680
gggtgcacga gtgggttaca tcgaactgga tctcaacagc ggtaagatcc ttgagagttt 4740
tcgccccgaa gaacgttttc caatgatgag cacttttaaa gttctgctat gtggcgcggt 4800
attatcccgt attgacgccg ggcaagagca actcggtcgc cgcatacact attctcagaa 4860
tgacttggtt gagtactcac cagtcacaga aaagcatctt acggatggca tgacagtaag 4920
agaattatgc agtgctgcca taaccatgag tgataacact gcggccaact tacttctgac 4980
aacgatcgga ggaccgaagg agctaaccgc ttttttgcac aacatggggg atcatgtaac 5040
tcgccttgat cgttgggaac cggagctgaa tgaagccata ccaaacgacg agcgtgacac 5100
cacgatgcct gtagcaatgg caacaacgtt gcgcaaacta ttaactggcg aactacttac 5160
tctagcttcc cggcaacaat taatagactg gatggaggcg gataaagttg caggaccact 5220
tctgcgctcg gcccttccgg ctggctggtt tattgctgat aaatctggag ccggtgagcg 5280
tgggtctcgc ggtatcattg cagcactggg gccagatggt aagccctccc gtatcgtagt 5340
tatctacacg acggggagtc aggcaactat ggatgaacga aatagacaga tcgctgagat 5400
aggtgcctca ctgattaagc attggtaact gtcagaccaa gtttactcat atatacttta 5460
gattgattta aaacttcatt tttaatttaa aaggatctag gtgaagatcc tttttgataa 5520
tctcatgacc aaaatccctt aacgtgagtt ttcgttccac tgagcgtcag accccgtaga 5580
aaagatcaaa ggatcttctt gagatccttt ttttctgcgc gtaatctgct gcttgcaaac 5640
aaaaaaacca ccgctaccag cggtggtttg tttgccggat caagagctac caactctttt 5700
tccgaaggta actggcttca gcagagcgca gataccaaat actgttcttc tagtgtagcc 5760
gtagttaggc caccacttca agaactctgt agcaccgcct acatacctcg ctctgctaat 5820
cctgttacca gtggctgctg ccagtggcga taagtcgtgt cttaccgggt tggactcaag 5880
acgatagtta ccggataagg cgcagcggtc gggctgaacg gggggttcgt gcacacagcc 5940
cagcttggag cgaacgacct acaccgaact gagataccta cagcgtgagc tatgagaaag 6000
cgccacgctt cccgaaggga gaaaggcgga caggtatccg gtaagcggca gggtcggaac 6060
aggagagcgc acgagggagc ttccaggggg aaacgcctgg tatctttata gtcctgtcgg 6120
gtttcgccac ctctgacttg agcgtcgatt tttgtgatgc tcgtcagggg ggcggagcct 6180
atggaaaaac gccagcaacg cggccttttt acggttcctg gccttttgct ggccttttgc 6240
tcacatgttc tttcctgcgt tatcccctga ttctgtggat aaccgtatta ccgcctttga 6300
gtgagctgat accgctcgcc gcagccgaac gaccgagcgc agcgagtcag tgagcgagga 6360
agcggaagag cgcccaatac gcaaaccgcc tctccccgcg cgttggccga ttcattaatg 6420
cagctggcac gacaggtttc ccgactggaa agcgggcagt gagcgcaacg caattaatgt 6480
gagttagctc actcattagg caccccaggc tttacacttt atgcttccgg ctcgtatgtt 6540
gtgtggaatt gtgagcggat aacaatttca cacaggaaac agctatgacc atgattacgc 6600
caagcgcgca attaaccctc actaaaggga acaaaagctg gagctgcaag cttaatgtag 6660
tcttatgcaa tactcttgta gtcttgcaac atggtaacga tgagttagca acatgcctta 6720
caaggagaga aaaagcaccg tgcatgccga ttggtggaag taaggtggta cgatcgtgcc 6780
ttattaggaa ggcaacagac gggtctgaca tggattggac gaaccactga attgccgcat 6840
tgcagagata ttgtatttaa gtgcctagct cgatacataa acgggtctct ctggttagac 6900
cagatctgag cctgggagct ctctggctaa ctagggaacc cactgcttaa gcctcaataa 6960
agcttgcctt gagtgcttca agtagtgtgt gcccgtctgt tgtgtgactc tggtaactag 7020
agatccctca gaccctttta gtcagtgtgg aaaatctcta gcagtggcgc ccgaacaggg 7080
acttgaaagc gaaagggaaa ccagaggagc tctctcgacg caggactcgg cttgctgaag 7140
cgcgcacggc aagaggcgag gggcggcgac tggtgagtac gccaaaaatt ttgactagcg 7200
gaggctagaa ggagagagat gggtgcgaga gcgtcagtat taagcggggg agaattagat 7260
cgcgatggga aaaaattcgg ttaaggccag ggggaaagaa aaaatataaa ttaaaacata 7320
tagtatgggc aagcagggag ctagaacgat tcgcagttaa tcctggcctg ttagaaacat 7380
cagaaggctg tagacaaata ctgggacagc tacaaccatc ccttcagaca ggatcagaag 7440
aacttagatc attatataat acagtagcaa ccctctattg tgtgcatcaa aggatagaga 7500
taaaagacac caaggaagct ttagacaaga tagaggaaga gcaaaacaaa agtaagacca 7560
ccgcacagca agcggccgct gatcttcaga cctggaggag gagatatgag ggacaattgg 7620
agaagtgaat tatataaata taaagtagta aaaattgaac cattaggagt agcacccacc 7680
aaggcaaaga gaagagtggt gcagagagaa aaaagagcag tgggaatagg agctttgttc 7740
cttgggttct tgggagcagc aggaagcact atgggcgcag cgtcaatgac gctgacggta 7800
caggccagac aattattgtc tggtatagtg cagcagcaga acaatttgct gagggctatt 7860
gaggcgcaac agcatctgtt gcaactcaca gtctggggca tcaagcagct ccaggcaaga 7920
atcctggctg tggaaagata cctaaaggat caacagctcc tggggatttg gggttgctct 7980
ggaaaactca tttgcaccac tgctgtgcct tggaatgcta gttggagtaa taaatctctg 8040
gaacagattt ggaatcacac gacctggatg gagtgggaca gagaaattaa caattacaca 8100
agcttaatac actccttaat tgaagaatcg caaaaccagc aagaaaagaa tgaacaagaa 8160
ttattggaat tagataaatg ggcaagtttg tggaattggt ttaacataac aaattggctg 8220
tggtatataa aattattcat aatgatagta ggaggcttgg taggtttaag aatagttttt 8280
gctgtacttt ctatagtgaa tagagttagg cagggatatt caccattatc gtttcagacc 8340
cacctcccaa ccccgagggg acccttgcgc cttttccaag gcagccctgg gtttgcgcag 8400
ggacgcggct gctctgggcg tggttccggg aaacgcagcg gcgccgaccc tgggtctcgc 8460
acattcttca cgtccgttcg cagcgtcacc cggatcttcg ccgctaccct tgtgggcccc 8520
ccggcgacgc ttcctgctcc gcccctaagt cgggaaggtt ccttgcggtt cgcggcgtgc 8580
cggacgtgac aaacggaagc cgcacgtctc actagtaccc tcgcagacgg acagcgccag 8640
ggagcaatgg cagcgcgccg accgcgatgg gctgtggcca atagcggctg ctcagcaggg 8700
cgcgccgaga gcagcggccg ggaaggggcg gtgcgggagg cggggtgtgg ggcggtagtg 8760
tgggccctgt tcctgcccgc gcggtgttcc gcattctgca agcctccgga gcgcacgtcg 8820
gcagtcggct ccctcgttga ccgaatcacc gacctctctc cccagggggt acccagctgt 8880
ctagagaatt ctagatcttg agacaaatgg cagtattcat ccacaatttt aaaagaaaag 8940
gggggattgg ggggtacagt gcaggggaaa gaatagtaga cataatagca acagacatac 9000
aaactaaaga attacaaaaa caaattacaa aaattcaaaa ttttcgggtt tattacaggg 9060
acagcagaga tccactttgg cgccggctcg aggggg 9096
<210> 51
<211> 148
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 51
Met Ser Pro Glu Ile Glu Lys Leu Ser Gln Ser Asp Ile Tyr Trp Asp
1 5 10 15
Ser Ile Val Ser Ile Thr Glu Thr Gly Val Glu Glu Val Phe Asp Leu
20 25 30
Thr Val Pro Gly Pro His Asn Phe Val Ala Asn Asp Ile Ile Val His
35 40 45
Asn Ser Arg Leu Ala Ala Gln Gln Gln Met Glu Gly Leu Leu Ala Pro
50 55 60
His Arg Pro Lys Glu Pro Ala Trp Phe Leu Ala Thr Val Gly Val Ser
65 70 75 80
Pro Asp His Gln Gly Lys Gly Leu Gly Ser Ala Val Val Leu Pro Gly
85 90 95
Val Glu Ala Ala Glu Arg Ala Gly Val Pro Ala Phe Leu Glu Thr Ser
100 105 110
Ala Pro Arg Asn Leu Pro Phe Tyr Glu Arg Leu Gly Phe Thr Val Thr
115 120 125
Ala Asp Val Glu Val Pro Glu Gly Pro Arg Thr Trp Cys Met Thr Arg
130 135 140
Lys Pro Gly Ala
145
<210> 52
<211> 9351
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 52
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtgccacca tgggatcggc cattgaacaa 660
gatggattgc acgcaggttc tccggccgct tgggtggaga ggctattcgg ctatgactgg 720
gcacaacaga caatcggctg ctctgatgcc gccgtgttcc ggctgtcagc gcaggggcgc 780
ccggttcttt ttgtcaagac cgacctgtcc ggtgccctga atgaactgca ggacgaggca 840
gcgcggctat cgtggctggc cacgacgggc gttccttgcg cagctgtgct cgacgttgtc 900
actgaagcgg gaagggactg gctgctattg ggcgaagtgc cggggcagga tctcctgtca 960
tctcaccttg ctcctgccga gaaagtatcc atcatggctg atgcaatgcg gcggctgcat 1020
acgcttgatc cggctacctg cctttcatac gagaccgaga tcctgactgt cgagtacgga 1080
ttgcttccta tcggcaaaat cgtggagaag aggattgaat gtaccgtcta ttcagtcgat 1140
aataatggga acatctacac acagcccgtg gctcaatggc acgacagagg agagcaggaa 1200
gtttttgaat actgtctcga ggacggatcc ctcatccgcg ctactaaaga tcataagttt 1260
atgaccgtgg acggccagat gctgccaatt gacgaaattt ttgaacgaga gctggatctg 1320
atgagagtcg acaaccttcc aaactgatta attaagaatt cgacccagct ttcttgtaca 1380
aagtggttgg taagcctatc cctaaccctc tcctcggtct cgattctacg tagtaatgag 1440
ctagcagtct cgaggttaac gaattccgcc ccccccctaa cgttactggc cgaagccgct 1500
tggaataagg ccggtgtgcg cttgtctata tgttattttc caccatattg ccgtcttttg 1560
gcaatgtgag ggcccggaaa cctggccctg tcttcttgac gagcattcct aggggtcttt 1620
cccctctcgc caaaggaatg caaggtctgt tgaatgtcgt gaaggaagca gttcctctgg 1680
aagcttcttg aagacaaaca acgtctgtag cgaccctttg caggcagcgg aaccccccac 1740
ctggcgacag gtgcccctgc ggccaaaagc cacgtgtata agatacacct gcaaaggcgg 1800
cacaacccca gtgccacgtt gtgagttgga tagttgtgga aagagtcaaa tggctctcct 1860
caagcgtatt caacaagggg ctgaaggatg cccagaaggt accccattgt atgggatctg 1920
atctggggcc tcggtgcaca tgctttacat gtgtttagtc gaggttaaaa aaacgtctag 1980
gccccccgaa ccacggggac gtggttttcc tttgaaaaac acgataatac catggccatg 2040
agcgagctga ttaaggagaa catgcacatg aagctgtaca tggagggcac cgtggacaac 2100
catcacttca agtgcacatc cgagggcgaa ggcaagccct acgagggcac ccagaccatg 2160
agaatcaagg tggtcgaggg cggccctctc cccttcgcct tcgacatcct ggctactagc 2220
ttcctctacg gcagcaagac cttcatcaac cacacccagg gcatccccga cttcttcaag 2280
cagtccttcc ctgagggctt cacatgggag agagtcacca catacgaaga cgggggcgtg 2340
ctgaccgcta cccaggacac cagcctccag gacggctgcc tcatctacaa cgtcaagatc 2400
agaggggtga acttcacatc caacggccct gtgatgcaga agaaaacact cggctgggag 2460
gccttcaccg agacgctgta ccccgctgac ggcggcctgg aaggcagaaa cgacatggcc 2520
ctgaagctcg tgggcgggag ccatctgatc gcaaacatca agaccacata tagatccaag 2580
aaacccgcta agaacctcaa gatgcctggc gtctactatg tggactacag actggaaaga 2640
atcaaggagg ccaacaacga gacctacgtc gagcagcacg aggtggcagt ggccagatac 2700
tgcgacctcc ctagcaaact ggggcacaag cttaattaac accggtggcg cgttaagtcg 2760
acaatcaacc tctggattac aaaatttgtg aaagattgac tggtattctt aactatgttg 2820
ctccttttac gctatgtgga tacgctgctt taatgccttt gtatcatgct attgcttccc 2880
gtatggcttt cattttctcc tccttgtata aatcctggtt gctgtctctt tatgaggagt 2940
tgtggcccgt tgtcaggcaa cgtggcgtgg tgtgcactgt gtttgctgac gcaaccccca 3000
ctggttgggg cattgccacc acctgtcagc tcctttccgg gactttcgct ttccccctcc 3060
ctattgccac ggcggaactc atcgccgcct gccttgcccg ctgctggaca ggggctcggc 3120
tgttgggcac tgacaattcc gtggtgttgt cggggaaatc atcgtccttt ccttggctgc 3180
tcgcctgtgt tgccacctgg attctgcgcg ggacgtcctt ctgctacgtc ccttcggccc 3240
tcaatccagc ggaccttcct tcccgcggcc tgctgccggc tctgcggcct cttccgcgtc 3300
ttcgccttcg ccctcagacg agtcggatct ccctttgggc cgcctccccg cgtcgacttt 3360
aagaccaatg acttacaagg cagctgtaga tcttagccac tttttaaaag aaaagggggg 3420
actggaaggg ctaattcact cccaacgaag acaagatctg ctttttgctt gtactgggtc 3480
tctctggtta gaccagatct gagcctggga gctctctggc taactaggga acccactgct 3540
taagcctcaa taaagcttgc cttgagtgct tcaagtagtg tgtgcccgtc tgttgtgtga 3600
ctctggtaac tagagatccc tcagaccctt ttagtcagtg tggaaaatct ctagcagtac 3660
gtatagtagt tcatgtcatc ttattattca gtatttataa cttgcaaaga aatgaatatc 3720
agagagtgag aggaacttgt ttattgcagc ttataatggt tacaaataaa gcaatagcat 3780
cacaaatttc acaaataaag catttttttc actgcattct agttgtggtt tgtccaaact 3840
catcaatgta tcttatcatg tctggctcta gctatcccgc ccctaactcc gcccatcccg 3900
cccctaactc cgcccagttc cgcccattct ccgccccatg gctgactaat tttttttatt 3960
tatgcagagg ccgaggccgc ctcggcctct gagctattcc agaagtagtg aggaggcttt 4020
tttggaggcc tagggacgta cccaattcgc cctatagtga gtcgtattac gcgcgctcac 4080
tggccgtcgt tttacaacgt cgtgactggg aaaaccctgg cgttacccaa cttaatcgcc 4140
ttgcagcaca tccccctttc gccagctggc gtaatagcga agaggcccgc accgatcgcc 4200
cttcccaaca gttgcgcagc ctgaatggcg aatgggacgc gccctgtagc ggcgcattaa 4260
gcgcggcggg tgtggtggtt acgcgcagcg tgaccgctac acttgccagc gccctagcgc 4320
ccgctccttt cgctttcttc ccttcctttc tcgccacgtt cgccggcttt ccccgtcaag 4380
ctctaaatcg ggggctccct ttagggttcc gatttagtgc tttacggcac ctcgacccca 4440
aaaaacttga ttagggtgat ggttcacgta gtgggccatc gccctgatag acggtttttc 4500
gccctttgac gttggagtcc acgttcttta atagtggact cttgttccaa actggaacaa 4560
cactcaaccc tatctcggtc tattcttttg atttataagg gattttgccg atttcggcct 4620
attggttaaa aaatgagctg atttaacaaa aatttaacgc gaattttaac aaaatattaa 4680
cgcttacaat ttaggtggca cttttcgggg aaatgtgcgc ggaaccccta tttgtttatt 4740
tttctaaata cattcaaata tgtatccgct catgagacaa taaccctgat aaatgcttca 4800
ataatattga aaaaggaaga gtatgagtat tcaacatttc cgtgtcgccc ttattccctt 4860
ttttgcggca ttttgccttc ctgtttttgc tcacccagaa acgctggtga aagtaaaaga 4920
tgctgaagat cagttgggtg cacgagtggg ttacatcgaa ctggatctca acagcggtaa 4980
gatccttgag agttttcgcc ccgaagaacg ttttccaatg atgagcactt ttaaagttct 5040
gctatgtggc gcggtattat cccgtattga cgccgggcaa gagcaactcg gtcgccgcat 5100
acactattct cagaatgact tggttgagta ctcaccagtc acagaaaagc atcttacgga 5160
tggcatgaca gtaagagaat tatgcagtgc tgccataacc atgagtgata acactgcggc 5220
caacttactt ctgacaacga tcggaggacc gaaggagcta accgcttttt tgcacaacat 5280
gggggatcat gtaactcgcc ttgatcgttg ggaaccggag ctgaatgaag ccataccaaa 5340
cgacgagcgt gacaccacga tgcctgtagc aatggcaaca acgttgcgca aactattaac 5400
tggcgaacta cttactctag cttcccggca acaattaata gactggatgg aggcggataa 5460
agttgcagga ccacttctgc gctcggccct tccggctggc tggtttattg ctgataaatc 5520
tggagccggt gagcgtgggt ctcgcggtat cattgcagca ctggggccag atggtaagcc 5580
ctcccgtatc gtagttatct acacgacggg gagtcaggca actatggatg aacgaaatag 5640
acagatcgct gagataggtg cctcactgat taagcattgg taactgtcag accaagttta 5700
ctcatatata ctttagattg atttaaaact tcatttttaa tttaaaagga tctaggtgaa 5760
gatccttttt gataatctca tgaccaaaat cccttaacgt gagttttcgt tccactgagc 5820
gtcagacccc gtagaaaaga tcaaaggatc ttcttgagat cctttttttc tgcgcgtaat 5880
ctgctgcttg caaacaaaaa aaccaccgct accagcggtg gtttgtttgc cggatcaaga 5940
gctaccaact ctttttccga aggtaactgg cttcagcaga gcgcagatac caaatactgt 6000
tcttctagtg tagccgtagt taggccacca cttcaagaac tctgtagcac cgcctacata 6060
cctcgctctg ctaatcctgt taccagtggc tgctgccagt ggcgataagt cgtgtcttac 6120
cgggttggac tcaagacgat agttaccgga taaggcgcag cggtcgggct gaacgggggg 6180
ttcgtgcaca cagcccagct tggagcgaac gacctacacc gaactgagat acctacagcg 6240
tgagctatga gaaagcgcca cgcttcccga agggagaaag gcggacaggt atccggtaag 6300
cggcagggtc ggaacaggag agcgcacgag ggagcttcca gggggaaacg cctggtatct 6360
ttatagtcct gtcgggtttc gccacctctg acttgagcgt cgatttttgt gatgctcgtc 6420
aggggggcgg agcctatgga aaaacgccag caacgcggcc tttttacggt tcctggcctt 6480
ttgctggcct tttgctcaca tgttctttcc tgcgttatcc cctgattctg tggataaccg 6540
tattaccgcc tttgagtgag ctgataccgc tcgccgcagc cgaacgaccg agcgcagcga 6600
gtcagtgagc gaggaagcgg aagagcgccc aatacgcaaa ccgcctctcc ccgcgcgttg 6660
gccgattcat taatgcagct ggcacgacag gtttcccgac tggaaagcgg gcagtgagcg 6720
caacgcaatt aatgtgagtt agctcactca ttaggcaccc caggctttac actttatgct 6780
tccggctcgt atgttgtgtg gaattgtgag cggataacaa tttcacacag gaaacagcta 6840
tgaccatgat tacgccaagc gcgcaattaa ccctcactaa agggaacaaa agctggagct 6900
gcaagcttaa tgtagtctta tgcaatactc ttgtagtctt gcaacatggt aacgatgagt 6960
tagcaacatg ccttacaagg agagaaaaag caccgtgcat gccgattggt ggaagtaagg 7020
tggtacgatc gtgccttatt aggaaggcaa cagacgggtc tgacatggat tggacgaacc 7080
actgaattgc cgcattgcag agatattgta tttaagtgcc tagctcgata cataaacggg 7140
tctctctggt tagaccagat ctgagcctgg gagctctctg gctaactagg gaacccactg 7200
cttaagcctc aataaagctt gccttgagtg cttcaagtag tgtgtgcccg tctgttgtgt 7260
gactctggta actagagatc cctcagaccc ttttagtcag tgtggaaaat ctctagcagt 7320
ggcgcccgaa cagggacttg aaagcgaaag ggaaaccaga ggagctctct cgacgcagga 7380
ctcggcttgc tgaagcgcgc acggcaagag gcgaggggcg gcgactggtg agtacgccaa 7440
aaattttgac tagcggaggc tagaaggaga gagatgggtg cgagagcgtc agtattaagc 7500
gggggagaat tagatcgcga tgggaaaaaa ttcggttaag gccaggggga aagaaaaaat 7560
ataaattaaa acatatagta tgggcaagca gggagctaga acgattcgca gttaatcctg 7620
gcctgttaga aacatcagaa ggctgtagac aaatactggg acagctacaa ccatcccttc 7680
agacaggatc agaagaactt agatcattat ataatacagt agcaaccctc tattgtgtgc 7740
atcaaaggat agagataaaa gacaccaagg aagctttaga caagatagag gaagagcaaa 7800
acaaaagtaa gaccaccgca cagcaagcgg ccgctgatct tcagacctgg aggaggagat 7860
atgagggaca attggagaag tgaattatat aaatataaag tagtaaaaat tgaaccatta 7920
ggagtagcac ccaccaaggc aaagagaaga gtggtgcaga gagaaaaaag agcagtggga 7980
ataggagctt tgttccttgg gttcttggga gcagcaggaa gcactatggg cgcagcgtca 8040
atgacgctga cggtacaggc cagacaatta ttgtctggta tagtgcagca gcagaacaat 8100
ttgctgaggg ctattgaggc gcaacagcat ctgttgcaac tcacagtctg gggcatcaag 8160
cagctccagg caagaatcct ggctgtggaa agatacctaa aggatcaaca gctcctgggg 8220
atttggggtt gctctggaaa actcatttgc accactgctg tgccttggaa tgctagttgg 8280
agtaataaat ctctggaaca gatttggaat cacacgacct ggatggagtg ggacagagaa 8340
attaacaatt acacaagctt aatacactcc ttaattgaag aatcgcaaaa ccagcaagaa 8400
aagaatgaac aagaattatt ggaattagat aaatgggcaa gtttgtggaa ttggtttaac 8460
ataacaaatt ggctgtggta tataaaatta ttcataatga tagtaggagg cttggtaggt 8520
ttaagaatag tttttgctgt actttctata gtgaatagag ttaggcaggg atattcacca 8580
ttatcgtttc agacccacct cccaaccccg aggggaccct tgcgcctttt ccaaggcagc 8640
cctgggtttg cgcagggacg cggctgctct gggcgtggtt ccgggaaacg cagcggcgcc 8700
gaccctgggt ctcgcacatt cttcacgtcc gttcgcagcg tcacccggat cttcgccgct 8760
acccttgtgg gccccccggc gacgcttcct gctccgcccc taagtcggga aggttccttg 8820
cggttcgcgg cgtgccggac gtgacaaacg gaagccgcac gtctcactag taccctcgca 8880
gacggacagc gccagggagc aatggcagcg cgccgaccgc gatgggctgt ggccaatagc 8940
ggctgctcag cagggcgcgc cgagagcagc ggccgggaag gggcggtgcg ggaggcgggg 9000
tgtggggcgg tagtgtgggc cctgttcctg cccgcgcggt gttccgcatt ctgcaagcct 9060
ccggagcgca cgtcggcagt cggctccctc gttgaccgaa tcaccgacct ctctccccag 9120
ggggtaccca gctgtctaga gaattctaga tcttgagaca aatggcagta ttcatccaca 9180
attttaaaag aaaagggggg attggggggt acagtgcagg ggaaagaata gtagacataa 9240
tagcaacaga catacaaact aaagaattac aaaaacaaat tacaaaaatt caaaattttc 9300
gggtttatta cagggacagc agagatccac tttggcgccg gctcgagggg g 9351
<210> 53
<211> 235
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 53
Met Gly Ser Ala Ile Glu Gln Asp Gly Leu His Ala Gly Ser Pro Ala
1 5 10 15
Ala Trp Val Glu Arg Leu Phe Gly Tyr Asp Trp Ala Gln Gln Thr Ile
20 25 30
Gly Cys Ser Asp Ala Ala Val Phe Arg Leu Ser Ala Gln Gly Arg Pro
35 40 45
Val Leu Phe Val Lys Thr Asp Leu Ser Gly Ala Leu Asn Glu Leu Gln
50 55 60
Asp Glu Ala Ala Arg Leu Ser Trp Leu Ala Thr Thr Gly Val Pro Cys
65 70 75 80
Ala Ala Val Leu Asp Val Val Thr Glu Ala Gly Arg Asp Trp Leu Leu
85 90 95
Leu Gly Glu Val Pro Gly Gln Asp Leu Leu Ser Ser His Leu Ala Pro
100 105 110
Ala Glu Lys Val Ser Ile Met Ala Asp Ala Met Arg Arg Leu His Thr
115 120 125
Leu Asp Pro Ala Thr Cys Leu Ser Tyr Glu Thr Glu Ile Leu Thr Val
130 135 140
Glu Tyr Gly Leu Leu Pro Ile Gly Lys Ile Val Glu Lys Arg Ile Glu
145 150 155 160
Cys Thr Val Tyr Ser Val Asp Asn Asn Gly Asn Ile Tyr Thr Gln Pro
165 170 175
Val Ala Gln Trp His Asp Arg Gly Glu Gln Glu Val Phe Glu Tyr Cys
180 185 190
Leu Glu Asp Gly Ser Leu Ile Arg Ala Thr Lys Asp His Lys Phe Met
195 200 205
Thr Val Asp Gly Gln Met Leu Pro Ile Asp Glu Ile Phe Glu Arg Glu
210 215 220
Leu Asp Leu Met Arg Val Asp Asn Leu Pro Asn
225 230 235
<210> 54
<211> 9162
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 54
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtgccgcca ccatgattaa gatcgctacg 660
cggaagtacc tggggaaaca gaacgtctac gacataggtg tggagcgcga tcacaacttt 720
gctctgaaaa atggatttat cgccagcaac tgcccattcg accaccaagc gaaacatcgc 780
atcgagcgag cacgtactcg gatggaagcc ggtcttgtcg atcaggatga tctggacgaa 840
gagcatcagg ggctcgcgcc agccgaactg ttcgccaggc tcaaggcgcg catgcccgac 900
ggcgatgatc tcgtcgtgac ccatggcgat gcctgcttgc cgaatatcat ggtggaaaat 960
ggccgctttt ctggattcat cgactgtggc cggctgggtg tggcggaccg ctatcaggac 1020
atagcgttgg ctacccgtga tattgctgaa gagcttggcg gcgaatgggc tgaccgcttc 1080
ctcgtgcttt acggtatcgc cgctcccgat tcgcagcgca tcgccttcta tcgccttctt 1140
gacgagttct tctgattaat taagaattcg acccagcttt cttgtacaaa gtggttggta 1200
agcctatccc taaccctctc ctcggtctcg attctacgta gtaatgagct agcagtctcg 1260
aggttaacga attccgcccc ccccctaacg ttactggccg aagccgcttg gaataaggcc 1320
ggtgtgcgct tgtctatatg ttattttcca ccatattgcc gtcttttggc aatgtgaggg 1380
cccggaaacc tggccctgtc ttcttgacga gcattcctag gggtctttcc cctctcgcca 1440
aaggaatgca aggtctgttg aatgtcgtga aggaagcagt tcctctggaa gcttcttgaa 1500
gacaaacaac gtctgtagcg accctttgca ggcagcggaa ccccccacct ggcgacaggt 1560
gcccctgcgg ccaaaagcca cgtgtataag atacacctgc aaaggcggca caaccccagt 1620
gccacgttgt gagttggata gttgtggaaa gagtcaaatg gctctcctca agcgtattca 1680
acaaggggct gaaggatgcc cagaaggtac cccattgtat gggatctgat ctggggcctc 1740
ggtgcacatg ctttacatgt gtttagtcga ggttaaaaaa acgtctaggc cccccgaacc 1800
acggggacgt ggttttcctt tgaaaaacac gataatacca tggtgagcaa gggcgaggag 1860
gataacatgg ccatcatcaa ggagttcatg cgcttcaagg tgcacatgga gggctccgtg 1920
aacggccacg agttcgagat cgagggcgag ggcgagggcc gcccctacga gggcacccag 1980
accgccaagc tgaaggtgac caagggtggc cccctgccct tcgcctggga catcctgtcc 2040
cctcagttca tgtacggctc caaggcctac gtgaagcacc ccgccgacat ccccgactac 2100
ttgaagctgt ccttccccga gggcttcaag tgggagcgcg tgatgaactt cgaggacggc 2160
ggcgtggtga ccgtgaccca ggactcctcc ctgcaggacg gcgagttcat ctacaaggtg 2220
aagctgcgcg gcaccaactt cccctccgac ggccccgtaa tgcagaagaa gaccatgggc 2280
tgggaggcct cctccgagcg gatgtacccc gaggacggcg ccctgaaggg cgagatcaag 2340
cagaggctga agctgaagga cggcggccac tacgacgctg aggtcaagac cacctacaag 2400
gccaagaagc ccgtgcagct gcccggcgcc tacaacgtca acatcaagtt ggacatcacc 2460
tcccacaacg aggactacac catcgtggaa cagtacgaac gcgccgaggg ccgccactcc 2520
accggcggca tggacgagct gtacaagtaa caccggtggc gcgttaagtc gacaatcaac 2580
ctctggatta caaaatttgt gaaagattga ctggtattct taactatgtt gctcctttta 2640
cgctatgtgg atacgctgct ttaatgcctt tgtatcatgc tattgcttcc cgtatggctt 2700
tcattttctc ctccttgtat aaatcctggt tgctgtctct ttatgaggag ttgtggcccg 2760
ttgtcaggca acgtggcgtg gtgtgcactg tgtttgctga cgcaaccccc actggttggg 2820
gcattgccac cacctgtcag ctcctttccg ggactttcgc tttccccctc cctattgcca 2880
cggcggaact catcgccgcc tgccttgccc gctgctggac aggggctcgg ctgttgggca 2940
ctgacaattc cgtggtgttg tcggggaaat catcgtcctt tccttggctg ctcgcctgtg 3000
ttgccacctg gattctgcgc gggacgtcct tctgctacgt cccttcggcc ctcaatccag 3060
cggaccttcc ttcccgcggc ctgctgccgg ctctgcggcc tcttccgcgt cttcgccttc 3120
gccctcagac gagtcggatc tccctttggg ccgcctcccc gcgtcgactt taagaccaat 3180
gacttacaag gcagctgtag atcttagcca ctttttaaaa gaaaaggggg gactggaagg 3240
gctaattcac tcccaacgaa gacaagatct gctttttgct tgtactgggt ctctctggtt 3300
agaccagatc tgagcctggg agctctctgg ctaactaggg aacccactgc ttaagcctca 3360
ataaagcttg ccttgagtgc ttcaagtagt gtgtgcccgt ctgttgtgtg actctggtaa 3420
ctagagatcc ctcagaccct tttagtcagt gtggaaaatc tctagcagta cgtatagtag 3480
ttcatgtcat cttattattc agtatttata acttgcaaag aaatgaatat cagagagtga 3540
gaggaacttg tttattgcag cttataatgg ttacaaataa agcaatagca tcacaaattt 3600
cacaaataaa gcattttttt cactgcattc tagttgtggt ttgtccaaac tcatcaatgt 3660
atcttatcat gtctggctct agctatcccg cccctaactc cgcccatccc gcccctaact 3720
ccgcccagtt ccgcccattc tccgccccat ggctgactaa ttttttttat ttatgcagag 3780
gccgaggccg cctcggcctc tgagctattc cagaagtagt gaggaggctt ttttggaggc 3840
ctagggacgt acccaattcg ccctatagtg agtcgtatta cgcgcgctca ctggccgtcg 3900
ttttacaacg tcgtgactgg gaaaaccctg gcgttaccca acttaatcgc cttgcagcac 3960
atcccccttt cgccagctgg cgtaatagcg aagaggcccg caccgatcgc ccttcccaac 4020
agttgcgcag cctgaatggc gaatgggacg cgccctgtag cggcgcatta agcgcggcgg 4080
gtgtggtggt tacgcgcagc gtgaccgcta cacttgccag cgccctagcg cccgctcctt 4140
tcgctttctt cccttccttt ctcgccacgt tcgccggctt tccccgtcaa gctctaaatc 4200
gggggctccc tttagggttc cgatttagtg ctttacggca cctcgacccc aaaaaacttg 4260
attagggtga tggttcacgt agtgggccat cgccctgata gacggttttt cgccctttga 4320
cgttggagtc cacgttcttt aatagtggac tcttgttcca aactggaaca acactcaacc 4380
ctatctcggt ctattctttt gatttataag ggattttgcc gatttcggcc tattggttaa 4440
aaaatgagct gatttaacaa aaatttaacg cgaattttaa caaaatatta acgcttacaa 4500
tttaggtggc acttttcggg gaaatgtgcg cggaacccct atttgtttat ttttctaaat 4560
acattcaaat atgtatccgc tcatgagaca ataaccctga taaatgcttc aataatattg 4620
aaaaaggaag agtatgagta ttcaacattt ccgtgtcgcc cttattccct tttttgcggc 4680
attttgcctt cctgtttttg ctcacccaga aacgctggtg aaagtaaaag atgctgaaga 4740
tcagttgggt gcacgagtgg gttacatcga actggatctc aacagcggta agatccttga 4800
gagttttcgc cccgaagaac gttttccaat gatgagcact tttaaagttc tgctatgtgg 4860
cgcggtatta tcccgtattg acgccgggca agagcaactc ggtcgccgca tacactattc 4920
tcagaatgac ttggttgagt actcaccagt cacagaaaag catcttacgg atggcatgac 4980
agtaagagaa ttatgcagtg ctgccataac catgagtgat aacactgcgg ccaacttact 5040
tctgacaacg atcggaggac cgaaggagct aaccgctttt ttgcacaaca tgggggatca 5100
tgtaactcgc cttgatcgtt gggaaccgga gctgaatgaa gccataccaa acgacgagcg 5160
tgacaccacg atgcctgtag caatggcaac aacgttgcgc aaactattaa ctggcgaact 5220
acttactcta gcttcccggc aacaattaat agactggatg gaggcggata aagttgcagg 5280
accacttctg cgctcggccc ttccggctgg ctggtttatt gctgataaat ctggagccgg 5340
tgagcgtggg tctcgcggta tcattgcagc actggggcca gatggtaagc cctcccgtat 5400
cgtagttatc tacacgacgg ggagtcaggc aactatggat gaacgaaata gacagatcgc 5460
tgagataggt gcctcactga ttaagcattg gtaactgtca gaccaagttt actcatatat 5520
actttagatt gatttaaaac ttcattttta atttaaaagg atctaggtga agatcctttt 5580
tgataatctc atgaccaaaa tcccttaacg tgagttttcg ttccactgag cgtcagaccc 5640
cgtagaaaag atcaaaggat cttcttgaga tccttttttt ctgcgcgtaa tctgctgctt 5700
gcaaacaaaa aaaccaccgc taccagcggt ggtttgtttg ccggatcaag agctaccaac 5760
tctttttccg aaggtaactg gcttcagcag agcgcagata ccaaatactg ttcttctagt 5820
gtagccgtag ttaggccacc acttcaagaa ctctgtagca ccgcctacat acctcgctct 5880
gctaatcctg ttaccagtgg ctgctgccag tggcgataag tcgtgtctta ccgggttgga 5940
ctcaagacga tagttaccgg ataaggcgca gcggtcgggc tgaacggggg gttcgtgcac 6000
acagcccagc ttggagcgaa cgacctacac cgaactgaga tacctacagc gtgagctatg 6060
agaaagcgcc acgcttcccg aagggagaaa ggcggacagg tatccggtaa gcggcagggt 6120
cggaacagga gagcgcacga gggagcttcc agggggaaac gcctggtatc tttatagtcc 6180
tgtcgggttt cgccacctct gacttgagcg tcgatttttg tgatgctcgt caggggggcg 6240
gagcctatgg aaaaacgcca gcaacgcggc ctttttacgg ttcctggcct tttgctggcc 6300
ttttgctcac atgttctttc ctgcgttatc ccctgattct gtggataacc gtattaccgc 6360
ctttgagtga gctgataccg ctcgccgcag ccgaacgacc gagcgcagcg agtcagtgag 6420
cgaggaagcg gaagagcgcc caatacgcaa accgcctctc cccgcgcgtt ggccgattca 6480
ttaatgcagc tggcacgaca ggtttcccga ctggaaagcg ggcagtgagc gcaacgcaat 6540
taatgtgagt tagctcactc attaggcacc ccaggcttta cactttatgc ttccggctcg 6600
tatgttgtgt ggaattgtga gcggataaca atttcacaca ggaaacagct atgaccatga 6660
ttacgccaag cgcgcaatta accctcacta aagggaacaa aagctggagc tgcaagctta 6720
atgtagtctt atgcaatact cttgtagtct tgcaacatgg taacgatgag ttagcaacat 6780
gccttacaag gagagaaaaa gcaccgtgca tgccgattgg tggaagtaag gtggtacgat 6840
cgtgccttat taggaaggca acagacgggt ctgacatgga ttggacgaac cactgaattg 6900
ccgcattgca gagatattgt atttaagtgc ctagctcgat acataaacgg gtctctctgg 6960
ttagaccaga tctgagcctg ggagctctct ggctaactag ggaacccact gcttaagcct 7020
caataaagct tgccttgagt gcttcaagta gtgtgtgccc gtctgttgtg tgactctggt 7080
aactagagat ccctcagacc cttttagtca gtgtggaaaa tctctagcag tggcgcccga 7140
acagggactt gaaagcgaaa gggaaaccag aggagctctc tcgacgcagg actcggcttg 7200
ctgaagcgcg cacggcaaga ggcgaggggc ggcgactggt gagtacgcca aaaattttga 7260
ctagcggagg ctagaaggag agagatgggt gcgagagcgt cagtattaag cgggggagaa 7320
ttagatcgcg atgggaaaaa attcggttaa ggccaggggg aaagaaaaaa tataaattaa 7380
aacatatagt atgggcaagc agggagctag aacgattcgc agttaatcct ggcctgttag 7440
aaacatcaga aggctgtaga caaatactgg gacagctaca accatccctt cagacaggat 7500
cagaagaact tagatcatta tataatacag tagcaaccct ctattgtgtg catcaaagga 7560
tagagataaa agacaccaag gaagctttag acaagataga ggaagagcaa aacaaaagta 7620
agaccaccgc acagcaagcg gccgctgatc ttcagacctg gaggaggaga tatgagggac 7680
aattggagaa gtgaattata taaatataaa gtagtaaaaa ttgaaccatt aggagtagca 7740
cccaccaagg caaagagaag agtggtgcag agagaaaaaa gagcagtggg aataggagct 7800
ttgttccttg ggttcttggg agcagcagga agcactatgg gcgcagcgtc aatgacgctg 7860
acggtacagg ccagacaatt attgtctggt atagtgcagc agcagaacaa tttgctgagg 7920
gctattgagg cgcaacagca tctgttgcaa ctcacagtct ggggcatcaa gcagctccag 7980
gcaagaatcc tggctgtgga aagataccta aaggatcaac agctcctggg gatttggggt 8040
tgctctggaa aactcatttg caccactgct gtgccttgga atgctagttg gagtaataaa 8100
tctctggaac agatttggaa tcacacgacc tggatggagt gggacagaga aattaacaat 8160
tacacaagct taatacactc cttaattgaa gaatcgcaaa accagcaaga aaagaatgaa 8220
caagaattat tggaattaga taaatgggca agtttgtgga attggtttaa cataacaaat 8280
tggctgtggt atataaaatt attcataatg atagtaggag gcttggtagg tttaagaata 8340
gtttttgctg tactttctat agtgaataga gttaggcagg gatattcacc attatcgttt 8400
cagacccacc tcccaacccc gaggggaccc ttgcgccttt tccaaggcag ccctgggttt 8460
gcgcagggac gcggctgctc tgggcgtggt tccgggaaac gcagcggcgc cgaccctggg 8520
tctcgcacat tcttcacgtc cgttcgcagc gtcacccgga tcttcgccgc tacccttgtg 8580
ggccccccgg cgacgcttcc tgctccgccc ctaagtcggg aaggttcctt gcggttcgcg 8640
gcgtgccgga cgtgacaaac ggaagccgca cgtctcacta gtaccctcgc agacggacag 8700
cgccagggag caatggcagc gcgccgaccg cgatgggctg tggccaatag cggctgctca 8760
gcagggcgcg ccgagagcag cggccgggaa ggggcggtgc gggaggcggg gtgtggggcg 8820
gtagtgtggg ccctgttcct gcccgcgcgg tgttccgcat tctgcaagcc tccggagcgc 8880
acgtcggcag tcggctccct cgttgaccga atcaccgacc tctctcccca gggggtaccc 8940
agctgtctag agaattctag atcttgagac aaatggcagt attcatccac aattttaaaa 9000
gaaaaggggg gattgggggg tacagtgcag gggaaagaat agtagacata atagcaacag 9060
acatacaaac taaagaatta caaaaacaaa ttacaaaaat tcaaaatttt cgggtttatt 9120
acagggacag cagagatcca ctttggcgcc ggctcgaggg gg 9162
<210> 55
<211> 170
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 55
Met Ile Lys Ile Ala Thr Arg Lys Tyr Leu Gly Lys Gln Asn Val Tyr
1 5 10 15
Asp Ile Gly Val Glu Arg Asp His Asn Phe Ala Leu Lys Asn Gly Phe
20 25 30
Ile Ala Ser Asn Cys Pro Phe Asp His Gln Ala Lys His Arg Ile Glu
35 40 45
Arg Ala Arg Thr Arg Met Glu Ala Gly Leu Val Asp Gln Asp Asp Leu
50 55 60
Asp Glu Glu His Gln Gly Leu Ala Pro Ala Glu Leu Phe Ala Arg Leu
65 70 75 80
Lys Ala Arg Met Pro Asp Gly Asp Asp Leu Val Val Thr His Gly Asp
85 90 95
Ala Cys Leu Pro Asn Ile Met Val Glu Asn Gly Arg Phe Ser Gly Phe
100 105 110
Ile Asp Cys Gly Arg Leu Gly Val Ala Asp Arg Tyr Gln Asp Ile Ala
115 120 125
Leu Ala Thr Arg Asp Ile Ala Glu Glu Leu Gly Gly Glu Trp Ala Asp
130 135 140
Arg Phe Leu Val Leu Tyr Gly Ile Ala Ala Pro Asp Ser Gln Arg Ile
145 150 155 160
Ala Phe Tyr Arg Leu Leu Asp Glu Phe Phe
165 170
<210> 56
<211> 9534
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 56
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtgccacca tgggatcggc cattgaacaa 660
gatggattgc acgcaggttc tccggccgct tgggtggaga ggctattcgg ctatgactgg 720
gcacaacaga caatcggctg ctctgatgcc gccgtgttcc ggctgtcagc gcaggggcgc 780
ccggttcttt ttgtcaagac cgacctgtcc ggtgccctga atgaactgca ggacgaggca 840
gcgcggctat cgtggctggc cacgacgggc gttccttgcg cagctgtgct cgacgttgtc 900
actgaagcgg gaagggactg gctgctattg ggcgaagtgc cggggcagga tctcctgtca 960
tctcaccttg ctcctgccga gaaagtatcc atcatggctg atgcaatgcg gcggctgcat 1020
acgcttgatc cggctacctg cccattcgac caccaagcga aacatcgcat cgagcgagca 1080
cgtactcgga tggaagccgg tcttgtcgat caggatgatc tggacgaaga gcatcagggg 1140
ctcgcgccag ccgaactgtt cgccaggctc aaggcgcgca tgcccgacgg cgatgatctc 1200
gtcgtgaccc atggcgatgc ctgcctttca tacgagaccg agatcctgac tgtcgagtac 1260
ggattgcttc ctatcggcaa aatcgtggag aagaggattg aatgtaccgt ctattcagtc 1320
gataataatg ggaacatcta cacacagccc gtggctcaat ggcacgacag aggagagcag 1380
gaagtttttg aatactgtct cgaggacgga tccctcatcc gcgctactaa agatcataag 1440
tttatgaccg tggacggcca gatgctgcca attgacgaaa tttttgaacg agagctggat 1500
ctgatgagag tcgacaacct tccaaactga ttaattaaga attcgaccca gctttcttgt 1560
acaaagtggt tggtaagcct atccctaacc ctctcctcgg tctcgattct acgtagtaat 1620
gagctagcag tctcgaggtt aacgaattcc gccccccccc taacgttact ggccgaagcc 1680
gcttggaata aggccggtgt gcgcttgtct atatgttatt ttccaccata ttgccgtctt 1740
ttggcaatgt gagggcccgg aaacctggcc ctgtcttctt gacgagcatt cctaggggtc 1800
tttcccctct cgccaaagga atgcaaggtc tgttgaatgt cgtgaaggaa gcagttcctc 1860
tggaagcttc ttgaagacaa acaacgtctg tagcgaccct ttgcaggcag cggaaccccc 1920
cacctggcga caggtgcccc tgcggccaaa agccacgtgt ataagataca cctgcaaagg 1980
cggcacaacc ccagtgccac gttgtgagtt ggatagttgt ggaaagagtc aaatggctct 2040
cctcaagcgt attcaacaag gggctgaagg atgcccagaa ggtaccccat tgtatgggat 2100
ctgatctggg gcctcggtgc acatgcttta catgtgttta gtcgaggtta aaaaaacgtc 2160
taggcccccc gaaccacggg gacgtggttt tcctttgaaa aacacgataa taccatggcc 2220
atgagcgagc tgattaagga gaacatgcac atgaagctgt acatggaggg caccgtggac 2280
aaccatcact tcaagtgcac atccgagggc gaaggcaagc cctacgaggg cacccagacc 2340
atgagaatca aggtggtcga gggcggccct ctccccttcg ccttcgacat cctggctact 2400
agcttcctct acggcagcaa gaccttcatc aaccacaccc agggcatccc cgacttcttc 2460
aagcagtcct tccctgaggg cttcacatgg gagagagtca ccacatacga agacgggggc 2520
gtgctgaccg ctacccagga caccagcctc caggacggct gcctcatcta caacgtcaag 2580
atcagagggg tgaacttcac atccaacggc cctgtgatgc agaagaaaac actcggctgg 2640
gaggccttca ccgagacgct gtaccccgct gacggcggcc tggaaggcag aaacgacatg 2700
gccctgaagc tcgtgggcgg gagccatctg atcgcaaaca tcaagaccac atatagatcc 2760
aagaaacccg ctaagaacct caagatgcct ggcgtctact atgtggacta cagactggaa 2820
agaatcaagg aggccaacaa cgagacctac gtcgagcagc acgaggtggc agtggccaga 2880
tactgcgacc tccctagcaa actggggcac aagcttaatt aacaccggtg gcgcgttaag 2940
tcgacaatca acctctggat tacaaaattt gtgaaagatt gactggtatt cttaactatg 3000
ttgctccttt tacgctatgt ggatacgctg ctttaatgcc tttgtatcat gctattgctt 3060
cccgtatggc tttcattttc tcctccttgt ataaatcctg gttgctgtct ctttatgagg 3120
agttgtggcc cgttgtcagg caacgtggcg tggtgtgcac tgtgtttgct gacgcaaccc 3180
ccactggttg gggcattgcc accacctgtc agctcctttc cgggactttc gctttccccc 3240
tccctattgc cacggcggaa ctcatcgccg cctgccttgc ccgctgctgg acaggggctc 3300
ggctgttggg cactgacaat tccgtggtgt tgtcggggaa atcatcgtcc tttccttggc 3360
tgctcgcctg tgttgccacc tggattctgc gcgggacgtc cttctgctac gtcccttcgg 3420
ccctcaatcc agcggacctt ccttcccgcg gcctgctgcc ggctctgcgg cctcttccgc 3480
gtcttcgcct tcgccctcag acgagtcgga tctccctttg ggccgcctcc ccgcgtcgac 3540
tttaagacca atgacttaca aggcagctgt agatcttagc cactttttaa aagaaaaggg 3600
gggactggaa gggctaattc actcccaacg aagacaagat ctgctttttg cttgtactgg 3660
gtctctctgg ttagaccaga tctgagcctg ggagctctct ggctaactag ggaacccact 3720
gcttaagcct caataaagct tgccttgagt gcttcaagta gtgtgtgccc gtctgttgtg 3780
tgactctggt aactagagat ccctcagacc cttttagtca gtgtggaaaa tctctagcag 3840
tacgtatagt agttcatgtc atcttattat tcagtattta taacttgcaa agaaatgaat 3900
atcagagagt gagaggaact tgtttattgc agcttataat ggttacaaat aaagcaatag 3960
catcacaaat ttcacaaata aagcattttt ttcactgcat tctagttgtg gtttgtccaa 4020
actcatcaat gtatcttatc atgtctggct ctagctatcc cgcccctaac tccgcccatc 4080
ccgcccctaa ctccgcccag ttccgcccat tctccgcccc atggctgact aatttttttt 4140
atttatgcag aggccgaggc cgcctcggcc tctgagctat tccagaagta gtgaggaggc 4200
ttttttggag gcctagggac gtacccaatt cgccctatag tgagtcgtat tacgcgcgct 4260
cactggccgt cgttttacaa cgtcgtgact gggaaaaccc tggcgttacc caacttaatc 4320
gccttgcagc acatccccct ttcgccagct ggcgtaatag cgaagaggcc cgcaccgatc 4380
gcccttccca acagttgcgc agcctgaatg gcgaatggga cgcgccctgt agcggcgcat 4440
taagcgcggc gggtgtggtg gttacgcgca gcgtgaccgc tacacttgcc agcgccctag 4500
cgcccgctcc tttcgctttc ttcccttcct ttctcgccac gttcgccggc tttccccgtc 4560
aagctctaaa tcgggggctc cctttagggt tccgatttag tgctttacgg cacctcgacc 4620
ccaaaaaact tgattagggt gatggttcac gtagtgggcc atcgccctga tagacggttt 4680
ttcgcccttt gacgttggag tccacgttct ttaatagtgg actcttgttc caaactggaa 4740
caacactcaa ccctatctcg gtctattctt ttgatttata agggattttg ccgatttcgg 4800
cctattggtt aaaaaatgag ctgatttaac aaaaatttaa cgcgaatttt aacaaaatat 4860
taacgcttac aatttaggtg gcacttttcg gggaaatgtg cgcggaaccc ctatttgttt 4920
atttttctaa atacattcaa atatgtatcc gctcatgaga caataaccct gataaatgct 4980
tcaataatat tgaaaaagga agagtatgag tattcaacat ttccgtgtcg cccttattcc 5040
cttttttgcg gcattttgcc ttcctgtttt tgctcaccca gaaacgctgg tgaaagtaaa 5100
agatgctgaa gatcagttgg gtgcacgagt gggttacatc gaactggatc tcaacagcgg 5160
taagatcctt gagagttttc gccccgaaga acgttttcca atgatgagca cttttaaagt 5220
tctgctatgt ggcgcggtat tatcccgtat tgacgccggg caagagcaac tcggtcgccg 5280
catacactat tctcagaatg acttggttga gtactcacca gtcacagaaa agcatcttac 5340
ggatggcatg acagtaagag aattatgcag tgctgccata accatgagtg ataacactgc 5400
ggccaactta cttctgacaa cgatcggagg accgaaggag ctaaccgctt ttttgcacaa 5460
catgggggat catgtaactc gccttgatcg ttgggaaccg gagctgaatg aagccatacc 5520
aaacgacgag cgtgacacca cgatgcctgt agcaatggca acaacgttgc gcaaactatt 5580
aactggcgaa ctacttactc tagcttcccg gcaacaatta atagactgga tggaggcgga 5640
taaagttgca ggaccacttc tgcgctcggc ccttccggct ggctggttta ttgctgataa 5700
atctggagcc ggtgagcgtg ggtctcgcgg tatcattgca gcactggggc cagatggtaa 5760
gccctcccgt atcgtagtta tctacacgac ggggagtcag gcaactatgg atgaacgaaa 5820
tagacagatc gctgagatag gtgcctcact gattaagcat tggtaactgt cagaccaagt 5880
ttactcatat atactttaga ttgatttaaa acttcatttt taatttaaaa ggatctaggt 5940
gaagatcctt tttgataatc tcatgaccaa aatcccttaa cgtgagtttt cgttccactg 6000
agcgtcagac cccgtagaaa agatcaaagg atcttcttga gatccttttt ttctgcgcgt 6060
aatctgctgc ttgcaaacaa aaaaaccacc gctaccagcg gtggtttgtt tgccggatca 6120
agagctacca actctttttc cgaaggtaac tggcttcagc agagcgcaga taccaaatac 6180
tgttcttcta gtgtagccgt agttaggcca ccacttcaag aactctgtag caccgcctac 6240
atacctcgct ctgctaatcc tgttaccagt ggctgctgcc agtggcgata agtcgtgtct 6300
taccgggttg gactcaagac gatagttacc ggataaggcg cagcggtcgg gctgaacggg 6360
gggttcgtgc acacagccca gcttggagcg aacgacctac accgaactga gatacctaca 6420
gcgtgagcta tgagaaagcg ccacgcttcc cgaagggaga aaggcggaca ggtatccggt 6480
aagcggcagg gtcggaacag gagagcgcac gagggagctt ccagggggaa acgcctggta 6540
tctttatagt cctgtcgggt ttcgccacct ctgacttgag cgtcgatttt tgtgatgctc 6600
gtcagggggg cggagcctat ggaaaaacgc cagcaacgcg gcctttttac ggttcctggc 6660
cttttgctgg ccttttgctc acatgttctt tcctgcgtta tcccctgatt ctgtggataa 6720
ccgtattacc gcctttgagt gagctgatac cgctcgccgc agccgaacga ccgagcgcag 6780
cgagtcagtg agcgaggaag cggaagagcg cccaatacgc aaaccgcctc tccccgcgcg 6840
ttggccgatt cattaatgca gctggcacga caggtttccc gactggaaag cgggcagtga 6900
gcgcaacgca attaatgtga gttagctcac tcattaggca ccccaggctt tacactttat 6960
gcttccggct cgtatgttgt gtggaattgt gagcggataa caatttcaca caggaaacag 7020
ctatgaccat gattacgcca agcgcgcaat taaccctcac taaagggaac aaaagctgga 7080
gctgcaagct taatgtagtc ttatgcaata ctcttgtagt cttgcaacat ggtaacgatg 7140
agttagcaac atgccttaca aggagagaaa aagcaccgtg catgccgatt ggtggaagta 7200
aggtggtacg atcgtgcctt attaggaagg caacagacgg gtctgacatg gattggacga 7260
accactgaat tgccgcattg cagagatatt gtatttaagt gcctagctcg atacataaac 7320
gggtctctct ggttagacca gatctgagcc tgggagctct ctggctaact agggaaccca 7380
ctgcttaagc ctcaataaag cttgccttga gtgcttcaag tagtgtgtgc ccgtctgttg 7440
tgtgactctg gtaactagag atccctcaga cccttttagt cagtgtggaa aatctctagc 7500
agtggcgccc gaacagggac ttgaaagcga aagggaaacc agaggagctc tctcgacgca 7560
ggactcggct tgctgaagcg cgcacggcaa gaggcgaggg gcggcgactg gtgagtacgc 7620
caaaaatttt gactagcgga ggctagaagg agagagatgg gtgcgagagc gtcagtatta 7680
agcgggggag aattagatcg cgatgggaaa aaattcggtt aaggccaggg ggaaagaaaa 7740
aatataaatt aaaacatata gtatgggcaa gcagggagct agaacgattc gcagttaatc 7800
ctggcctgtt agaaacatca gaaggctgta gacaaatact gggacagcta caaccatccc 7860
ttcagacagg atcagaagaa cttagatcat tatataatac agtagcaacc ctctattgtg 7920
tgcatcaaag gatagagata aaagacacca aggaagcttt agacaagata gaggaagagc 7980
aaaacaaaag taagaccacc gcacagcaag cggccgctga tcttcagacc tggaggagga 8040
gatatgaggg acaattggag aagtgaatta tataaatata aagtagtaaa aattgaacca 8100
ttaggagtag cacccaccaa ggcaaagaga agagtggtgc agagagaaaa aagagcagtg 8160
ggaataggag ctttgttcct tgggttcttg ggagcagcag gaagcactat gggcgcagcg 8220
tcaatgacgc tgacggtaca ggccagacaa ttattgtctg gtatagtgca gcagcagaac 8280
aatttgctga gggctattga ggcgcaacag catctgttgc aactcacagt ctggggcatc 8340
aagcagctcc aggcaagaat cctggctgtg gaaagatacc taaaggatca acagctcctg 8400
gggatttggg gttgctctgg aaaactcatt tgcaccactg ctgtgccttg gaatgctagt 8460
tggagtaata aatctctgga acagatttgg aatcacacga cctggatgga gtgggacaga 8520
gaaattaaca attacacaag cttaatacac tccttaattg aagaatcgca aaaccagcaa 8580
gaaaagaatg aacaagaatt attggaatta gataaatggg caagtttgtg gaattggttt 8640
aacataacaa attggctgtg gtatataaaa ttattcataa tgatagtagg aggcttggta 8700
ggtttaagaa tagtttttgc tgtactttct atagtgaata gagttaggca gggatattca 8760
ccattatcgt ttcagaccca cctcccaacc ccgaggggac ccttgcgcct tttccaaggc 8820
agccctgggt ttgcgcaggg acgcggctgc tctgggcgtg gttccgggaa acgcagcggc 8880
gccgaccctg ggtctcgcac attcttcacg tccgttcgca gcgtcacccg gatcttcgcc 8940
gctacccttg tgggcccccc ggcgacgctt cctgctccgc ccctaagtcg ggaaggttcc 9000
ttgcggttcg cggcgtgccg gacgtgacaa acggaagccg cacgtctcac tagtaccctc 9060
gcagacggac agcgccaggg agcaatggca gcgcgccgac cgcgatgggc tgtggccaat 9120
agcggctgct cagcagggcg cgccgagagc agcggccggg aaggggcggt gcgggaggcg 9180
gggtgtgggg cggtagtgtg ggccctgttc ctgcccgcgc ggtgttccgc attctgcaag 9240
cctccggagc gcacgtcggc agtcggctcc ctcgttgacc gaatcaccga cctctctccc 9300
cagggggtac ccagctgtct agagaattct agatcttgag acaaatggca gtattcatcc 9360
acaattttaa aagaaaaggg gggattgggg ggtacagtgc aggggaaaga atagtagaca 9420
taatagcaac agacatacaa actaaagaat tacaaaaaca aattacaaaa attcaaaatt 9480
ttcgggttta ttacagggac agcagagatc cactttggcg ccggctcgag gggg 9534
<210> 57
<211> 296
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 57
Met Gly Ser Ala Ile Glu Gln Asp Gly Leu His Ala Gly Ser Pro Ala
1 5 10 15
Ala Trp Val Glu Arg Leu Phe Gly Tyr Asp Trp Ala Gln Gln Thr Ile
20 25 30
Gly Cys Ser Asp Ala Ala Val Phe Arg Leu Ser Ala Gln Gly Arg Pro
35 40 45
Val Leu Phe Val Lys Thr Asp Leu Ser Gly Ala Leu Asn Glu Leu Gln
50 55 60
Asp Glu Ala Ala Arg Leu Ser Trp Leu Ala Thr Thr Gly Val Pro Cys
65 70 75 80
Ala Ala Val Leu Asp Val Val Thr Glu Ala Gly Arg Asp Trp Leu Leu
85 90 95
Leu Gly Glu Val Pro Gly Gln Asp Leu Leu Ser Ser His Leu Ala Pro
100 105 110
Ala Glu Lys Val Ser Ile Met Ala Asp Ala Met Arg Arg Leu His Thr
115 120 125
Leu Asp Pro Ala Thr Cys Pro Phe Asp His Gln Ala Lys His Arg Ile
130 135 140
Glu Arg Ala Arg Thr Arg Met Glu Ala Gly Leu Val Asp Gln Asp Asp
145 150 155 160
Leu Asp Glu Glu His Gln Gly Leu Ala Pro Ala Glu Leu Phe Ala Arg
165 170 175
Leu Lys Ala Arg Met Pro Asp Gly Asp Asp Leu Val Val Thr His Gly
180 185 190
Asp Ala Cys Leu Ser Tyr Glu Thr Glu Ile Leu Thr Val Glu Tyr Gly
195 200 205
Leu Leu Pro Ile Gly Lys Ile Val Glu Lys Arg Ile Glu Cys Thr Val
210 215 220
Tyr Ser Val Asp Asn Asn Gly Asn Ile Tyr Thr Gln Pro Val Ala Gln
225 230 235 240
Trp His Asp Arg Gly Glu Gln Glu Val Phe Glu Tyr Cys Leu Glu Asp
245 250 255
Gly Ser Leu Ile Arg Ala Thr Lys Asp His Lys Phe Met Thr Val Asp
260 265 270
Gly Gln Met Leu Pro Ile Asp Glu Ile Phe Glu Arg Glu Leu Asp Leu
275 280 285
Met Arg Val Asp Asn Leu Pro Asn
290 295
<210> 58
<211> 8979
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 58
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtgccgcca ccatgattaa gatcgctacg 660
cggaagtacc tggggaaaca gaacgtctac gacataggtg tggagcgcga tcacaacttt 720
gctctgaaaa atggatttat cgccagcaac tgcttgccga atatcatggt ggaaaatggc 780
cgcttttctg gattcatcga ctgtggccgg ctgggtgtgg cggaccgcta tcaggacata 840
gcgttggcta cccgtgatat tgctgaagag cttggcggcg aatgggctga ccgcttcctc 900
gtgctttacg gtatcgccgc tcccgattcg cagcgcatcg ccttctatcg ccttcttgac 960
gagttcttct gattaattaa gaattcgacc cagctttctt gtacaaagtg gttggtaagc 1020
ctatccctaa ccctctcctc ggtctcgatt ctacgtagta atgagctagc agtctcgagg 1080
ttaacgaatt ccgccccccc cctaacgtta ctggccgaag ccgcttggaa taaggccggt 1140
gtgcgcttgt ctatatgtta ttttccacca tattgccgtc ttttggcaat gtgagggccc 1200
ggaaacctgg ccctgtcttc ttgacgagca ttcctagggg tctttcccct ctcgccaaag 1260
gaatgcaagg tctgttgaat gtcgtgaagg aagcagttcc tctggaagct tcttgaagac 1320
aaacaacgtc tgtagcgacc ctttgcaggc agcggaaccc cccacctggc gacaggtgcc 1380
cctgcggcca aaagccacgt gtataagata cacctgcaaa ggcggcacaa ccccagtgcc 1440
acgttgtgag ttggatagtt gtggaaagag tcaaatggct ctcctcaagc gtattcaaca 1500
aggggctgaa ggatgcccag aaggtacccc attgtatggg atctgatctg gggcctcggt 1560
gcacatgctt tacatgtgtt tagtcgaggt taaaaaaacg tctaggcccc ccgaaccacg 1620
gggacgtggt tttcctttga aaaacacgat aataccatgg tgagcaaggg cgaggaggat 1680
aacatggcca tcatcaagga gttcatgcgc ttcaaggtgc acatggaggg ctccgtgaac 1740
ggccacgagt tcgagatcga gggcgagggc gagggccgcc cctacgaggg cacccagacc 1800
gccaagctga aggtgaccaa gggtggcccc ctgcccttcg cctgggacat cctgtcccct 1860
cagttcatgt acggctccaa ggcctacgtg aagcaccccg ccgacatccc cgactacttg 1920
aagctgtcct tccccgaggg cttcaagtgg gagcgcgtga tgaacttcga ggacggcggc 1980
gtggtgaccg tgacccagga ctcctccctg caggacggcg agttcatcta caaggtgaag 2040
ctgcgcggca ccaacttccc ctccgacggc cccgtaatgc agaagaagac catgggctgg 2100
gaggcctcct ccgagcggat gtaccccgag gacggcgccc tgaagggcga gatcaagcag 2160
aggctgaagc tgaaggacgg cggccactac gacgctgagg tcaagaccac ctacaaggcc 2220
aagaagcccg tgcagctgcc cggcgcctac aacgtcaaca tcaagttgga catcacctcc 2280
cacaacgagg actacaccat cgtggaacag tacgaacgcg ccgagggccg ccactccacc 2340
ggcggcatgg acgagctgta caagtaacac cggtggcgcg ttaagtcgac aatcaacctc 2400
tggattacaa aatttgtgaa agattgactg gtattcttaa ctatgttgct ccttttacgc 2460
tatgtggata cgctgcttta atgcctttgt atcatgctat tgcttcccgt atggctttca 2520
ttttctcctc cttgtataaa tcctggttgc tgtctcttta tgaggagttg tggcccgttg 2580
tcaggcaacg tggcgtggtg tgcactgtgt ttgctgacgc aacccccact ggttggggca 2640
ttgccaccac ctgtcagctc ctttccggga ctttcgcttt ccccctccct attgccacgg 2700
cggaactcat cgccgcctgc cttgcccgct gctggacagg ggctcggctg ttgggcactg 2760
acaattccgt ggtgttgtcg gggaaatcat cgtcctttcc ttggctgctc gcctgtgttg 2820
ccacctggat tctgcgcggg acgtccttct gctacgtccc ttcggccctc aatccagcgg 2880
accttccttc ccgcggcctg ctgccggctc tgcggcctct tccgcgtctt cgccttcgcc 2940
ctcagacgag tcggatctcc ctttgggccg cctccccgcg tcgactttaa gaccaatgac 3000
ttacaaggca gctgtagatc ttagccactt tttaaaagaa aaggggggac tggaagggct 3060
aattcactcc caacgaagac aagatctgct ttttgcttgt actgggtctc tctggttaga 3120
ccagatctga gcctgggagc tctctggcta actagggaac ccactgctta agcctcaata 3180
aagcttgcct tgagtgcttc aagtagtgtg tgcccgtctg ttgtgtgact ctggtaacta 3240
gagatccctc agaccctttt agtcagtgtg gaaaatctct agcagtacgt atagtagttc 3300
atgtcatctt attattcagt atttataact tgcaaagaaa tgaatatcag agagtgagag 3360
gaacttgttt attgcagctt ataatggtta caaataaagc aatagcatca caaatttcac 3420
aaataaagca tttttttcac tgcattctag ttgtggtttg tccaaactca tcaatgtatc 3480
ttatcatgtc tggctctagc tatcccgccc ctaactccgc ccatcccgcc cctaactccg 3540
cccagttccg cccattctcc gccccatggc tgactaattt tttttattta tgcagaggcc 3600
gaggccgcct cggcctctga gctattccag aagtagtgag gaggcttttt tggaggccta 3660
gggacgtacc caattcgccc tatagtgagt cgtattacgc gcgctcactg gccgtcgttt 3720
tacaacgtcg tgactgggaa aaccctggcg ttacccaact taatcgcctt gcagcacatc 3780
cccctttcgc cagctggcgt aatagcgaag aggcccgcac cgatcgccct tcccaacagt 3840
tgcgcagcct gaatggcgaa tgggacgcgc cctgtagcgg cgcattaagc gcggcgggtg 3900
tggtggttac gcgcagcgtg accgctacac ttgccagcgc cctagcgccc gctcctttcg 3960
ctttcttccc ttcctttctc gccacgttcg ccggctttcc ccgtcaagct ctaaatcggg 4020
ggctcccttt agggttccga tttagtgctt tacggcacct cgaccccaaa aaacttgatt 4080
agggtgatgg ttcacgtagt gggccatcgc cctgatagac ggtttttcgc cctttgacgt 4140
tggagtccac gttctttaat agtggactct tgttccaaac tggaacaaca ctcaacccta 4200
tctcggtcta ttcttttgat ttataaggga ttttgccgat ttcggcctat tggttaaaaa 4260
atgagctgat ttaacaaaaa tttaacgcga attttaacaa aatattaacg cttacaattt 4320
aggtggcact tttcggggaa atgtgcgcgg aacccctatt tgtttatttt tctaaataca 4380
ttcaaatatg tatccgctca tgagacaata accctgataa atgcttcaat aatattgaaa 4440
aaggaagagt atgagtattc aacatttccg tgtcgccctt attccctttt ttgcggcatt 4500
ttgccttcct gtttttgctc acccagaaac gctggtgaaa gtaaaagatg ctgaagatca 4560
gttgggtgca cgagtgggtt acatcgaact ggatctcaac agcggtaaga tccttgagag 4620
ttttcgcccc gaagaacgtt ttccaatgat gagcactttt aaagttctgc tatgtggcgc 4680
ggtattatcc cgtattgacg ccgggcaaga gcaactcggt cgccgcatac actattctca 4740
gaatgacttg gttgagtact caccagtcac agaaaagcat cttacggatg gcatgacagt 4800
aagagaatta tgcagtgctg ccataaccat gagtgataac actgcggcca acttacttct 4860
gacaacgatc ggaggaccga aggagctaac cgcttttttg cacaacatgg gggatcatgt 4920
aactcgcctt gatcgttggg aaccggagct gaatgaagcc ataccaaacg acgagcgtga 4980
caccacgatg cctgtagcaa tggcaacaac gttgcgcaaa ctattaactg gcgaactact 5040
tactctagct tcccggcaac aattaataga ctggatggag gcggataaag ttgcaggacc 5100
acttctgcgc tcggcccttc cggctggctg gtttattgct gataaatctg gagccggtga 5160
gcgtgggtct cgcggtatca ttgcagcact ggggccagat ggtaagccct cccgtatcgt 5220
agttatctac acgacgggga gtcaggcaac tatggatgaa cgaaatagac agatcgctga 5280
gataggtgcc tcactgatta agcattggta actgtcagac caagtttact catatatact 5340
ttagattgat ttaaaacttc atttttaatt taaaaggatc taggtgaaga tcctttttga 5400
taatctcatg accaaaatcc cttaacgtga gttttcgttc cactgagcgt cagaccccgt 5460
agaaaagatc aaaggatctt cttgagatcc tttttttctg cgcgtaatct gctgcttgca 5520
aacaaaaaaa ccaccgctac cagcggtggt ttgtttgccg gatcaagagc taccaactct 5580
ttttccgaag gtaactggct tcagcagagc gcagatacca aatactgttc ttctagtgta 5640
gccgtagtta ggccaccact tcaagaactc tgtagcaccg cctacatacc tcgctctgct 5700
aatcctgtta ccagtggctg ctgccagtgg cgataagtcg tgtcttaccg ggttggactc 5760
aagacgatag ttaccggata aggcgcagcg gtcgggctga acggggggtt cgtgcacaca 5820
gcccagcttg gagcgaacga cctacaccga actgagatac ctacagcgtg agctatgaga 5880
aagcgccacg cttcccgaag ggagaaaggc ggacaggtat ccggtaagcg gcagggtcgg 5940
aacaggagag cgcacgaggg agcttccagg gggaaacgcc tggtatcttt atagtcctgt 6000
cgggtttcgc cacctctgac ttgagcgtcg atttttgtga tgctcgtcag gggggcggag 6060
cctatggaaa aacgccagca acgcggcctt tttacggttc ctggcctttt gctggccttt 6120
tgctcacatg ttctttcctg cgttatcccc tgattctgtg gataaccgta ttaccgcctt 6180
tgagtgagct gataccgctc gccgcagccg aacgaccgag cgcagcgagt cagtgagcga 6240
ggaagcggaa gagcgcccaa tacgcaaacc gcctctcccc gcgcgttggc cgattcatta 6300
atgcagctgg cacgacaggt ttcccgactg gaaagcgggc agtgagcgca acgcaattaa 6360
tgtgagttag ctcactcatt aggcacccca ggctttacac tttatgcttc cggctcgtat 6420
gttgtgtgga attgtgagcg gataacaatt tcacacagga aacagctatg accatgatta 6480
cgccaagcgc gcaattaacc ctcactaaag ggaacaaaag ctggagctgc aagcttaatg 6540
tagtcttatg caatactctt gtagtcttgc aacatggtaa cgatgagtta gcaacatgcc 6600
ttacaaggag agaaaaagca ccgtgcatgc cgattggtgg aagtaaggtg gtacgatcgt 6660
gccttattag gaaggcaaca gacgggtctg acatggattg gacgaaccac tgaattgccg 6720
cattgcagag atattgtatt taagtgccta gctcgataca taaacgggtc tctctggtta 6780
gaccagatct gagcctggga gctctctggc taactaggga acccactgct taagcctcaa 6840
taaagcttgc cttgagtgct tcaagtagtg tgtgcccgtc tgttgtgtga ctctggtaac 6900
tagagatccc tcagaccctt ttagtcagtg tggaaaatct ctagcagtgg cgcccgaaca 6960
gggacttgaa agcgaaaggg aaaccagagg agctctctcg acgcaggact cggcttgctg 7020
aagcgcgcac ggcaagaggc gaggggcggc gactggtgag tacgccaaaa attttgacta 7080
gcggaggcta gaaggagaga gatgggtgcg agagcgtcag tattaagcgg gggagaatta 7140
gatcgcgatg ggaaaaaatt cggttaaggc cagggggaaa gaaaaaatat aaattaaaac 7200
atatagtatg ggcaagcagg gagctagaac gattcgcagt taatcctggc ctgttagaaa 7260
catcagaagg ctgtagacaa atactgggac agctacaacc atcccttcag acaggatcag 7320
aagaacttag atcattatat aatacagtag caaccctcta ttgtgtgcat caaaggatag 7380
agataaaaga caccaaggaa gctttagaca agatagagga agagcaaaac aaaagtaaga 7440
ccaccgcaca gcaagcggcc gctgatcttc agacctggag gaggagatat gagggacaat 7500
tggagaagtg aattatataa atataaagta gtaaaaattg aaccattagg agtagcaccc 7560
accaaggcaa agagaagagt ggtgcagaga gaaaaaagag cagtgggaat aggagctttg 7620
ttccttgggt tcttgggagc agcaggaagc actatgggcg cagcgtcaat gacgctgacg 7680
gtacaggcca gacaattatt gtctggtata gtgcagcagc agaacaattt gctgagggct 7740
attgaggcgc aacagcatct gttgcaactc acagtctggg gcatcaagca gctccaggca 7800
agaatcctgg ctgtggaaag atacctaaag gatcaacagc tcctggggat ttggggttgc 7860
tctggaaaac tcatttgcac cactgctgtg ccttggaatg ctagttggag taataaatct 7920
ctggaacaga tttggaatca cacgacctgg atggagtggg acagagaaat taacaattac 7980
acaagcttaa tacactcctt aattgaagaa tcgcaaaacc agcaagaaaa gaatgaacaa 8040
gaattattgg aattagataa atgggcaagt ttgtggaatt ggtttaacat aacaaattgg 8100
ctgtggtata taaaattatt cataatgata gtaggaggct tggtaggttt aagaatagtt 8160
tttgctgtac tttctatagt gaatagagtt aggcagggat attcaccatt atcgtttcag 8220
acccacctcc caaccccgag gggacccttg cgccttttcc aaggcagccc tgggtttgcg 8280
cagggacgcg gctgctctgg gcgtggttcc gggaaacgca gcggcgccga ccctgggtct 8340
cgcacattct tcacgtccgt tcgcagcgtc acccggatct tcgccgctac ccttgtgggc 8400
cccccggcga cgcttcctgc tccgccccta agtcgggaag gttccttgcg gttcgcggcg 8460
tgccggacgt gacaaacgga agccgcacgt ctcactagta ccctcgcaga cggacagcgc 8520
cagggagcaa tggcagcgcg ccgaccgcga tgggctgtgg ccaatagcgg ctgctcagca 8580
gggcgcgccg agagcagcgg ccgggaaggg gcggtgcggg aggcggggtg tggggcggta 8640
gtgtgggccc tgttcctgcc cgcgcggtgt tccgcattct gcaagcctcc ggagcgcacg 8700
tcggcagtcg gctccctcgt tgaccgaatc accgacctct ctccccaggg ggtacccagc 8760
tgtctagaga attctagatc ttgagacaaa tggcagtatt catccacaat tttaaaagaa 8820
aaggggggat tggggggtac agtgcagggg aaagaatagt agacataata gcaacagaca 8880
tacaaactaa agaattacaa aaacaaatta caaaaattca aaattttcgg gtttattaca 8940
gggacagcag agatccactt tggcgccggc tcgaggggg 8979
<210> 59
<211> 109
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 59
Met Ile Lys Ile Ala Thr Arg Lys Tyr Leu Gly Lys Gln Asn Val Tyr
1 5 10 15
Asp Ile Gly Val Glu Arg Asp His Asn Phe Ala Leu Lys Asn Gly Phe
20 25 30
Ile Ala Ser Asn Cys Leu Pro Asn Ile Met Val Glu Asn Gly Arg Phe
35 40 45
Ser Gly Phe Ile Asp Cys Gly Arg Leu Gly Val Ala Asp Arg Tyr Gln
50 55 60
Asp Ile Ala Leu Ala Thr Arg Asp Ile Ala Glu Glu Leu Gly Gly Glu
65 70 75 80
Trp Ala Asp Arg Phe Leu Val Leu Tyr Gly Ile Ala Ala Pro Asp Ser
85 90 95
Gln Arg Ile Ala Phe Tyr Arg Leu Leu Asp Glu Phe Phe
100 105
<210> 60
<211> 9186
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 60
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtgccgcca ccatgattaa gatcgctacg 660
cggaagtacc tggggaaaca gaacgtctac gacataggtg tggagcgcga tcacaacttt 720
gctctgaaaa atggatttat cgccagcaac tgcgccgatg gtttctacaa agatcgttat 780
gtttatcggc actttgcatc ggccgcgctc ccgattccgg aagtgcttga cattggggaa 840
tttagcgaga gcctgaccta ttgcctttca tacgagaccg agatcctgac tgtcgagtac 900
ggattgcttc ctatcggcaa aatcgtggag aagaggattg aatgtaccgt ctattcagtc 960
gataataatg ggaacatcta cacacagccc gtggctcaat ggcacgacag aggagagcag 1020
gaagtttttg aatactgtct cgaggacgga tccctcatcc gcgctactaa agatcataag 1080
tttatgaccg tggacggcca gatgctgcca attgacgaaa tttttgaacg agagctggat 1140
ctgatgagag tcgacaacct tccaaactga ttaattaaga attcgaccca gctttcttgt 1200
acaaagtggt tggtaagcct atccctaacc ctctcctcgg tctcgattct acgtagtaat 1260
gagctagcag tctcgaggtt aacgaattcc gccccccccc taacgttact ggccgaagcc 1320
gcttggaata aggccggtgt gcgcttgtct atatgttatt ttccaccata ttgccgtctt 1380
ttggcaatgt gagggcccgg aaacctggcc ctgtcttctt gacgagcatt cctaggggtc 1440
tttcccctct cgccaaagga atgcaaggtc tgttgaatgt cgtgaaggaa gcagttcctc 1500
tggaagcttc ttgaagacaa acaacgtctg tagcgaccct ttgcaggcag cggaaccccc 1560
cacctggcga caggtgcccc tgcggccaaa agccacgtgt ataagataca cctgcaaagg 1620
cggcacaacc ccagtgccac gttgtgagtt ggatagttgt ggaaagagtc aaatggctct 1680
cctcaagcgt attcaacaag gggctgaagg atgcccagaa ggtaccccat tgtatgggat 1740
ctgatctggg gcctcggtgc acatgcttta catgtgttta gtcgaggtta aaaaaacgtc 1800
taggcccccc gaaccacggg gacgtggttt tcctttgaaa aacacgataa taccatggtg 1860
agcaagggcg aggagctgtt caccggggtg gtgcccatcc tggtcgagct ggacggcgac 1920
gtaaacggcc acaagttcag cgtgtccggc gagggcgagg gcgatgccac ctacggcaag 1980
ctgaccctga agttcatctg caccaccggc aagctgcccg tgccctggcc caccctcgtg 2040
accaccctga cctacggcgt gcagtgcttc agccgctacc ccgaccacat gaagcagcac 2100
gacttcttca agtccgccat gcccgaaggc tacgtccagg agcgcaccat cttcttcaag 2160
gacgacggca actacaagac ccgcgccgag gtgaagttcg agggcgacac cctggtgaac 2220
cgcatcgagc tgaagggcat cgacttcaag gaggacggca acatcctggg gcacaagctg 2280
gagtacaact acaacagcca caacgtctat atcatggccg acaagcagaa gaacggcatc 2340
aaggtgaact tcaagatccg ccacaacatc gaggacggca gcgtgcagct cgccgaccac 2400
taccagcaga acacccccat cggcgacggc cccgtgctgc tgcccgacaa ccactacctg 2460
agcacccagt ccgccctgag caaagacccc aacgagaagc gcgatcacat ggtcctgctg 2520
gagttcgtga ccgccgccgg gatcactctc ggcatggacg agctgtacaa gtaacaccgg 2580
tggcgcgtta agtcgacaat caacctctgg attacaaaat ttgtgaaaga ttgactggta 2640
ttcttaacta tgttgctcct tttacgctat gtggatacgc tgctttaatg cctttgtatc 2700
atgctattgc ttcccgtatg gctttcattt tctcctcctt gtataaatcc tggttgctgt 2760
ctctttatga ggagttgtgg cccgttgtca ggcaacgtgg cgtggtgtgc actgtgtttg 2820
ctgacgcaac ccccactggt tggggcattg ccaccacctg tcagctcctt tccgggactt 2880
tcgctttccc cctccctatt gccacggcgg aactcatcgc cgcctgcctt gcccgctgct 2940
ggacaggggc tcggctgttg ggcactgaca attccgtggt gttgtcgggg aaatcatcgt 3000
cctttccttg gctgctcgcc tgtgttgcca cctggattct gcgcgggacg tccttctgct 3060
acgtcccttc ggccctcaat ccagcggacc ttccttcccg cggcctgctg ccggctctgc 3120
ggcctcttcc gcgtcttcgc cttcgccctc agacgagtcg gatctccctt tgggccgcct 3180
ccccgcgtcg actttaagac caatgactta caaggcagct gtagatctta gccacttttt 3240
aaaagaaaag gggggactgg aagggctaat tcactcccaa cgaagacaag atctgctttt 3300
tgcttgtact gggtctctct ggttagacca gatctgagcc tgggagctct ctggctaact 3360
agggaaccca ctgcttaagc ctcaataaag cttgccttga gtgcttcaag tagtgtgtgc 3420
ccgtctgttg tgtgactctg gtaactagag atccctcaga cccttttagt cagtgtggaa 3480
aatctctagc agtacgtata gtagttcatg tcatcttatt attcagtatt tataacttgc 3540
aaagaaatga atatcagaga gtgagaggaa cttgtttatt gcagcttata atggttacaa 3600
ataaagcaat agcatcacaa atttcacaaa taaagcattt ttttcactgc attctagttg 3660
tggtttgtcc aaactcatca atgtatctta tcatgtctgg ctctagctat cccgccccta 3720
actccgccca tcccgcccct aactccgccc agttccgccc attctccgcc ccatggctga 3780
ctaatttttt ttatttatgc agaggccgag gccgcctcgg cctctgagct attccagaag 3840
tagtgaggag gcttttttgg aggcctaggg acgtacccaa ttcgccctat agtgagtcgt 3900
attacgcgcg ctcactggcc gtcgttttac aacgtcgtga ctgggaaaac cctggcgtta 3960
cccaacttaa tcgccttgca gcacatcccc ctttcgccag ctggcgtaat agcgaagagg 4020
cccgcaccga tcgcccttcc caacagttgc gcagcctgaa tggcgaatgg gacgcgccct 4080
gtagcggcgc attaagcgcg gcgggtgtgg tggttacgcg cagcgtgacc gctacacttg 4140
ccagcgccct agcgcccgct cctttcgctt tcttcccttc ctttctcgcc acgttcgccg 4200
gctttccccg tcaagctcta aatcgggggc tccctttagg gttccgattt agtgctttac 4260
ggcacctcga ccccaaaaaa cttgattagg gtgatggttc acgtagtggg ccatcgccct 4320
gatagacggt ttttcgccct ttgacgttgg agtccacgtt ctttaatagt ggactcttgt 4380
tccaaactgg aacaacactc aaccctatct cggtctattc ttttgattta taagggattt 4440
tgccgatttc ggcctattgg ttaaaaaatg agctgattta acaaaaattt aacgcgaatt 4500
ttaacaaaat attaacgctt acaatttagg tggcactttt cggggaaatg tgcgcggaac 4560
ccctatttgt ttatttttct aaatacattc aaatatgtat ccgctcatga gacaataacc 4620
ctgataaatg cttcaataat attgaaaaag gaagagtatg agtattcaac atttccgtgt 4680
cgcccttatt cccttttttg cggcattttg ccttcctgtt tttgctcacc cagaaacgct 4740
ggtgaaagta aaagatgctg aagatcagtt gggtgcacga gtgggttaca tcgaactgga 4800
tctcaacagc ggtaagatcc ttgagagttt tcgccccgaa gaacgttttc caatgatgag 4860
cacttttaaa gttctgctat gtggcgcggt attatcccgt attgacgccg ggcaagagca 4920
actcggtcgc cgcatacact attctcagaa tgacttggtt gagtactcac cagtcacaga 4980
aaagcatctt acggatggca tgacagtaag agaattatgc agtgctgcca taaccatgag 5040
tgataacact gcggccaact tacttctgac aacgatcgga ggaccgaagg agctaaccgc 5100
ttttttgcac aacatggggg atcatgtaac tcgccttgat cgttgggaac cggagctgaa 5160
tgaagccata ccaaacgacg agcgtgacac cacgatgcct gtagcaatgg caacaacgtt 5220
gcgcaaacta ttaactggcg aactacttac tctagcttcc cggcaacaat taatagactg 5280
gatggaggcg gataaagttg caggaccact tctgcgctcg gcccttccgg ctggctggtt 5340
tattgctgat aaatctggag ccggtgagcg tgggtctcgc ggtatcattg cagcactggg 5400
gccagatggt aagccctccc gtatcgtagt tatctacacg acggggagtc aggcaactat 5460
ggatgaacga aatagacaga tcgctgagat aggtgcctca ctgattaagc attggtaact 5520
gtcagaccaa gtttactcat atatacttta gattgattta aaacttcatt tttaatttaa 5580
aaggatctag gtgaagatcc tttttgataa tctcatgacc aaaatccctt aacgtgagtt 5640
ttcgttccac tgagcgtcag accccgtaga aaagatcaaa ggatcttctt gagatccttt 5700
ttttctgcgc gtaatctgct gcttgcaaac aaaaaaacca ccgctaccag cggtggtttg 5760
tttgccggat caagagctac caactctttt tccgaaggta actggcttca gcagagcgca 5820
gataccaaat actgttcttc tagtgtagcc gtagttaggc caccacttca agaactctgt 5880
agcaccgcct acatacctcg ctctgctaat cctgttacca gtggctgctg ccagtggcga 5940
taagtcgtgt cttaccgggt tggactcaag acgatagtta ccggataagg cgcagcggtc 6000
gggctgaacg gggggttcgt gcacacagcc cagcttggag cgaacgacct acaccgaact 6060
gagataccta cagcgtgagc tatgagaaag cgccacgctt cccgaaggga gaaaggcgga 6120
caggtatccg gtaagcggca gggtcggaac aggagagcgc acgagggagc ttccaggggg 6180
aaacgcctgg tatctttata gtcctgtcgg gtttcgccac ctctgacttg agcgtcgatt 6240
tttgtgatgc tcgtcagggg ggcggagcct atggaaaaac gccagcaacg cggccttttt 6300
acggttcctg gccttttgct ggccttttgc tcacatgttc tttcctgcgt tatcccctga 6360
ttctgtggat aaccgtatta ccgcctttga gtgagctgat accgctcgcc gcagccgaac 6420
gaccgagcgc agcgagtcag tgagcgagga agcggaagag cgcccaatac gcaaaccgcc 6480
tctccccgcg cgttggccga ttcattaatg cagctggcac gacaggtttc ccgactggaa 6540
agcgggcagt gagcgcaacg caattaatgt gagttagctc actcattagg caccccaggc 6600
tttacacttt atgcttccgg ctcgtatgtt gtgtggaatt gtgagcggat aacaatttca 6660
cacaggaaac agctatgacc atgattacgc caagcgcgca attaaccctc actaaaggga 6720
acaaaagctg gagctgcaag cttaatgtag tcttatgcaa tactcttgta gtcttgcaac 6780
atggtaacga tgagttagca acatgcctta caaggagaga aaaagcaccg tgcatgccga 6840
ttggtggaag taaggtggta cgatcgtgcc ttattaggaa ggcaacagac gggtctgaca 6900
tggattggac gaaccactga attgccgcat tgcagagata ttgtatttaa gtgcctagct 6960
cgatacataa acgggtctct ctggttagac cagatctgag cctgggagct ctctggctaa 7020
ctagggaacc cactgcttaa gcctcaataa agcttgcctt gagtgcttca agtagtgtgt 7080
gcccgtctgt tgtgtgactc tggtaactag agatccctca gaccctttta gtcagtgtgg 7140
aaaatctcta gcagtggcgc ccgaacaggg acttgaaagc gaaagggaaa ccagaggagc 7200
tctctcgacg caggactcgg cttgctgaag cgcgcacggc aagaggcgag gggcggcgac 7260
tggtgagtac gccaaaaatt ttgactagcg gaggctagaa ggagagagat gggtgcgaga 7320
gcgtcagtat taagcggggg agaattagat cgcgatggga aaaaattcgg ttaaggccag 7380
ggggaaagaa aaaatataaa ttaaaacata tagtatgggc aagcagggag ctagaacgat 7440
tcgcagttaa tcctggcctg ttagaaacat cagaaggctg tagacaaata ctgggacagc 7500
tacaaccatc ccttcagaca ggatcagaag aacttagatc attatataat acagtagcaa 7560
ccctctattg tgtgcatcaa aggatagaga taaaagacac caaggaagct ttagacaaga 7620
tagaggaaga gcaaaacaaa agtaagacca ccgcacagca agcggccgct gatcttcaga 7680
cctggaggag gagatatgag ggacaattgg agaagtgaat tatataaata taaagtagta 7740
aaaattgaac cattaggagt agcacccacc aaggcaaaga gaagagtggt gcagagagaa 7800
aaaagagcag tgggaatagg agctttgttc cttgggttct tgggagcagc aggaagcact 7860
atgggcgcag cgtcaatgac gctgacggta caggccagac aattattgtc tggtatagtg 7920
cagcagcaga acaatttgct gagggctatt gaggcgcaac agcatctgtt gcaactcaca 7980
gtctggggca tcaagcagct ccaggcaaga atcctggctg tggaaagata cctaaaggat 8040
caacagctcc tggggatttg gggttgctct ggaaaactca tttgcaccac tgctgtgcct 8100
tggaatgcta gttggagtaa taaatctctg gaacagattt ggaatcacac gacctggatg 8160
gagtgggaca gagaaattaa caattacaca agcttaatac actccttaat tgaagaatcg 8220
caaaaccagc aagaaaagaa tgaacaagaa ttattggaat tagataaatg ggcaagtttg 8280
tggaattggt ttaacataac aaattggctg tggtatataa aattattcat aatgatagta 8340
ggaggcttgg taggtttaag aatagttttt gctgtacttt ctatagtgaa tagagttagg 8400
cagggatatt caccattatc gtttcagacc cacctcccaa ccccgagggg acccttgcgc 8460
cttttccaag gcagccctgg gtttgcgcag ggacgcggct gctctgggcg tggttccggg 8520
aaacgcagcg gcgccgaccc tgggtctcgc acattcttca cgtccgttcg cagcgtcacc 8580
cggatcttcg ccgctaccct tgtgggcccc ccggcgacgc ttcctgctcc gcccctaagt 8640
cgggaaggtt ccttgcggtt cgcggcgtgc cggacgtgac aaacggaagc cgcacgtctc 8700
actagtaccc tcgcagacgg acagcgccag ggagcaatgg cagcgcgccg accgcgatgg 8760
gctgtggcca atagcggctg ctcagcaggg cgcgccgaga gcagcggccg ggaaggggcg 8820
gtgcgggagg cggggtgtgg ggcggtagtg tgggccctgt tcctgcccgc gcggtgttcc 8880
gcattctgca agcctccgga gcgcacgtcg gcagtcggct ccctcgttga ccgaatcacc 8940
gacctctctc cccagggggt acccagctgt ctagagaatt ctagatcttg agacaaatgg 9000
cagtattcat ccacaatttt aaaagaaaag gggggattgg ggggtacagt gcaggggaaa 9060
gaatagtaga cataatagca acagacatac aaactaaaga attacaaaaa caaattacaa 9120
aaattcaaaa ttttcgggtt tattacaggg acagcagaga tccactttgg cgccggctcg 9180
aggggg 9186
<210> 61
<211> 175
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 61
Met Ile Lys Ile Ala Thr Arg Lys Tyr Leu Gly Lys Gln Asn Val Tyr
1 5 10 15
Asp Ile Gly Val Glu Arg Asp His Asn Phe Ala Leu Lys Asn Gly Phe
20 25 30
Ile Ala Ser Asn Cys Ala Asp Gly Phe Tyr Lys Asp Arg Tyr Val Tyr
35 40 45
Arg His Phe Ala Ser Ala Ala Leu Pro Ile Pro Glu Val Leu Asp Ile
50 55 60
Gly Glu Phe Ser Glu Ser Leu Thr Tyr Cys Leu Ser Tyr Glu Thr Glu
65 70 75 80
Ile Leu Thr Val Glu Tyr Gly Leu Leu Pro Ile Gly Lys Ile Val Glu
85 90 95
Lys Arg Ile Glu Cys Thr Val Tyr Ser Val Asp Asn Asn Gly Asn Ile
100 105 110
Tyr Thr Gln Pro Val Ala Gln Trp His Asp Arg Gly Glu Gln Glu Val
115 120 125
Phe Glu Tyr Cys Leu Glu Asp Gly Ser Leu Ile Arg Ala Thr Lys Asp
130 135 140
His Lys Phe Met Thr Val Asp Gly Gln Met Leu Pro Ile Asp Glu Ile
145 150 155 160
Phe Glu Arg Glu Leu Asp Leu Met Arg Val Asp Asn Leu Pro Asn
165 170 175
<210> 62
<211> 9528
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 62
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtgccgcca ccatgattaa gatcgctacg 660
cggaagtacc tggggaaaca gaacgtctac gacataggtg tggagcgcga tcacaacttt 720
gctctgaaaa atggatttat cgccagcaac tgcatctccc gccgtgcaca gggtgtcacg 780
ttgcaagacc tgcctgaaac cgaactgccc gctgttctgc agccggtcgc ggaggccatg 840
gatgcgatcg ctgcggccga tcttagccag acgagcgggt tcggcccatt cggaccgcaa 900
ggaatcggtc aatacactac atggcgtgat ttcatatgcg cgattgctga tccccatgtg 960
tatcactggc aaactgtgat ggacgacacc gtcagtgcgt ccgtcgcgca ggctctcgat 1020
gagctgatgc tttgggccga ggactgcccc gaagtccggc acctcgtgca cgcggatttc 1080
ggctccaaca atgtcctgac ggacaatggc cgcataacag cggtcattga ctggagcgag 1140
gcgatgttcg gggattccca atacgaggtc gccaacatct tcttctggag gccgtggttg 1200
gcttgccttt catacgagac cgagatcctg actgtcgagt acggattgct tcctatcggc 1260
aaaatcgtgg agaagaggat tgaatgtacc gtctattcag tcgataataa tgggaacatc 1320
tacacacagc ccgtggctca atggcacgac agaggagagc aggaagtttt tgaatactgt 1380
ctcgaggacg gatccctcat ccgcgctact aaagatcata agtttatgac cgtggacggc 1440
cagatgctgc caattgacga aatttttgaa cgagagctgg atctgatgag agtcgacaac 1500
cttccaaact gattaattaa gaattcgacc cagctttctt gtacaaagtg gttggtaagc 1560
ctatccctaa ccctctcctc ggtctcgatt ctacgtagta atgagctagc agtctcgagg 1620
ttaacgaatt ccgccccccc cctaacgtta ctggccgaag ccgcttggaa taaggccggt 1680
gtgcgcttgt ctatatgtta ttttccacca tattgccgtc ttttggcaat gtgagggccc 1740
ggaaacctgg ccctgtcttc ttgacgagca ttcctagggg tctttcccct ctcgccaaag 1800
gaatgcaagg tctgttgaat gtcgtgaagg aagcagttcc tctggaagct tcttgaagac 1860
aaacaacgtc tgtagcgacc ctttgcaggc agcggaaccc cccacctggc gacaggtgcc 1920
cctgcggcca aaagccacgt gtataagata cacctgcaaa ggcggcacaa ccccagtgcc 1980
acgttgtgag ttggatagtt gtggaaagag tcaaatggct ctcctcaagc gtattcaaca 2040
aggggctgaa ggatgcccag aaggtacccc attgtatggg atctgatctg gggcctcggt 2100
gcacatgctt tacatgtgtt tagtcgaggt taaaaaaacg tctaggcccc ccgaaccacg 2160
gggacgtggt tttcctttga aaaacacgat aataccatgg tgagcaaggg cgaggagctg 2220
ttcaccgggg tggtgcccat cctggtcgag ctggacggcg acgtaaacgg ccacaagttc 2280
agcgtgtccg gcgagggcga gggcgatgcc acctacggca agctgaccct gaagttcatc 2340
tgcaccaccg gcaagctgcc cgtgccctgg cccaccctcg tgaccaccct gacctacggc 2400
gtgcagtgct tcagccgcta ccccgaccac atgaagcagc acgacttctt caagtccgcc 2460
atgcccgaag gctacgtcca ggagcgcacc atcttcttca aggacgacgg caactacaag 2520
acccgcgccg aggtgaagtt cgagggcgac accctggtga accgcatcga gctgaagggc 2580
atcgacttca aggaggacgg caacatcctg gggcacaagc tggagtacaa ctacaacagc 2640
cacaacgtct atatcatggc cgacaagcag aagaacggca tcaaggtgaa cttcaagatc 2700
cgccacaaca tcgaggacgg cagcgtgcag ctcgccgacc actaccagca gaacaccccc 2760
atcggcgacg gccccgtgct gctgcccgac aaccactacc tgagcaccca gtccgccctg 2820
agcaaagacc ccaacgagaa gcgcgatcac atggtcctgc tggagttcgt gaccgccgcc 2880
gggatcactc tcggcatgga cgagctgtac aagtaacacc ggtggcgcgt taagtcgaca 2940
atcaacctct ggattacaaa atttgtgaaa gattgactgg tattcttaac tatgttgctc 3000
cttttacgct atgtggatac gctgctttaa tgcctttgta tcatgctatt gcttcccgta 3060
tggctttcat tttctcctcc ttgtataaat cctggttgct gtctctttat gaggagttgt 3120
ggcccgttgt caggcaacgt ggcgtggtgt gcactgtgtt tgctgacgca acccccactg 3180
gttggggcat tgccaccacc tgtcagctcc tttccgggac tttcgctttc cccctcccta 3240
ttgccacggc ggaactcatc gccgcctgcc ttgcccgctg ctggacaggg gctcggctgt 3300
tgggcactga caattccgtg gtgttgtcgg ggaaatcatc gtcctttcct tggctgctcg 3360
cctgtgttgc cacctggatt ctgcgcggga cgtccttctg ctacgtccct tcggccctca 3420
atccagcgga ccttccttcc cgcggcctgc tgccggctct gcggcctctt ccgcgtcttc 3480
gccttcgccc tcagacgagt cggatctccc tttgggccgc ctccccgcgt cgactttaag 3540
accaatgact tacaaggcag ctgtagatct tagccacttt ttaaaagaaa aggggggact 3600
ggaagggcta attcactccc aacgaagaca agatctgctt tttgcttgta ctgggtctct 3660
ctggttagac cagatctgag cctgggagct ctctggctaa ctagggaacc cactgcttaa 3720
gcctcaataa agcttgcctt gagtgcttca agtagtgtgt gcccgtctgt tgtgtgactc 3780
tggtaactag agatccctca gaccctttta gtcagtgtgg aaaatctcta gcagtacgta 3840
tagtagttca tgtcatctta ttattcagta tttataactt gcaaagaaat gaatatcaga 3900
gagtgagagg aacttgttta ttgcagctta taatggttac aaataaagca atagcatcac 3960
aaatttcaca aataaagcat ttttttcact gcattctagt tgtggtttgt ccaaactcat 4020
caatgtatct tatcatgtct ggctctagct atcccgcccc taactccgcc catcccgccc 4080
ctaactccgc ccagttccgc ccattctccg ccccatggct gactaatttt ttttatttat 4140
gcagaggccg aggccgcctc ggcctctgag ctattccaga agtagtgagg aggctttttt 4200
ggaggcctag ggacgtaccc aattcgccct atagtgagtc gtattacgcg cgctcactgg 4260
ccgtcgtttt acaacgtcgt gactgggaaa accctggcgt tacccaactt aatcgccttg 4320
cagcacatcc ccctttcgcc agctggcgta atagcgaaga ggcccgcacc gatcgccctt 4380
cccaacagtt gcgcagcctg aatggcgaat gggacgcgcc ctgtagcggc gcattaagcg 4440
cggcgggtgt ggtggttacg cgcagcgtga ccgctacact tgccagcgcc ctagcgcccg 4500
ctcctttcgc tttcttccct tcctttctcg ccacgttcgc cggctttccc cgtcaagctc 4560
taaatcgggg gctcccttta gggttccgat ttagtgcttt acggcacctc gaccccaaaa 4620
aacttgatta gggtgatggt tcacgtagtg ggccatcgcc ctgatagacg gtttttcgcc 4680
ctttgacgtt ggagtccacg ttctttaata gtggactctt gttccaaact ggaacaacac 4740
tcaaccctat ctcggtctat tcttttgatt tataagggat tttgccgatt tcggcctatt 4800
ggttaaaaaa tgagctgatt taacaaaaat ttaacgcgaa ttttaacaaa atattaacgc 4860
ttacaattta ggtggcactt ttcggggaaa tgtgcgcgga acccctattt gtttattttt 4920
ctaaatacat tcaaatatgt atccgctcat gagacaataa ccctgataaa tgcttcaata 4980
atattgaaaa aggaagagta tgagtattca acatttccgt gtcgccctta ttcccttttt 5040
tgcggcattt tgccttcctg tttttgctca cccagaaacg ctggtgaaag taaaagatgc 5100
tgaagatcag ttgggtgcac gagtgggtta catcgaactg gatctcaaca gcggtaagat 5160
ccttgagagt tttcgccccg aagaacgttt tccaatgatg agcactttta aagttctgct 5220
atgtggcgcg gtattatccc gtattgacgc cgggcaagag caactcggtc gccgcataca 5280
ctattctcag aatgacttgg ttgagtactc accagtcaca gaaaagcatc ttacggatgg 5340
catgacagta agagaattat gcagtgctgc cataaccatg agtgataaca ctgcggccaa 5400
cttacttctg acaacgatcg gaggaccgaa ggagctaacc gcttttttgc acaacatggg 5460
ggatcatgta actcgccttg atcgttggga accggagctg aatgaagcca taccaaacga 5520
cgagcgtgac accacgatgc ctgtagcaat ggcaacaacg ttgcgcaaac tattaactgg 5580
cgaactactt actctagctt cccggcaaca attaatagac tggatggagg cggataaagt 5640
tgcaggacca cttctgcgct cggcccttcc ggctggctgg tttattgctg ataaatctgg 5700
agccggtgag cgtgggtctc gcggtatcat tgcagcactg gggccagatg gtaagccctc 5760
ccgtatcgta gttatctaca cgacggggag tcaggcaact atggatgaac gaaatagaca 5820
gatcgctgag ataggtgcct cactgattaa gcattggtaa ctgtcagacc aagtttactc 5880
atatatactt tagattgatt taaaacttca tttttaattt aaaaggatct aggtgaagat 5940
cctttttgat aatctcatga ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc 6000
agaccccgta gaaaagatca aaggatcttc ttgagatcct ttttttctgc gcgtaatctg 6060
ctgcttgcaa acaaaaaaac caccgctacc agcggtggtt tgtttgccgg atcaagagct 6120
accaactctt tttccgaagg taactggctt cagcagagcg cagataccaa atactgttct 6180
tctagtgtag ccgtagttag gccaccactt caagaactct gtagcaccgc ctacatacct 6240
cgctctgcta atcctgttac cagtggctgc tgccagtggc gataagtcgt gtcttaccgg 6300
gttggactca agacgatagt taccggataa ggcgcagcgg tcgggctgaa cggggggttc 6360
gtgcacacag cccagcttgg agcgaacgac ctacaccgaa ctgagatacc tacagcgtga 6420
gctatgagaa agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg 6480
cagggtcgga acaggagagc gcacgaggga gcttccaggg ggaaacgcct ggtatcttta 6540
tagtcctgtc gggtttcgcc acctctgact tgagcgtcga tttttgtgat gctcgtcagg 6600
ggggcggagc ctatggaaaa acgccagcaa cgcggccttt ttacggttcc tggccttttg 6660
ctggcctttt gctcacatgt tctttcctgc gttatcccct gattctgtgg ataaccgtat 6720
taccgccttt gagtgagctg ataccgctcg ccgcagccga acgaccgagc gcagcgagtc 6780
agtgagcgag gaagcggaag agcgcccaat acgcaaaccg cctctccccg cgcgttggcc 6840
gattcattaa tgcagctggc acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa 6900
cgcaattaat gtgagttagc tcactcatta ggcaccccag gctttacact ttatgcttcc 6960
ggctcgtatg ttgtgtggaa ttgtgagcgg ataacaattt cacacaggaa acagctatga 7020
ccatgattac gccaagcgcg caattaaccc tcactaaagg gaacaaaagc tggagctgca 7080
agcttaatgt agtcttatgc aatactcttg tagtcttgca acatggtaac gatgagttag 7140
caacatgcct tacaaggaga gaaaaagcac cgtgcatgcc gattggtgga agtaaggtgg 7200
tacgatcgtg ccttattagg aaggcaacag acgggtctga catggattgg acgaaccact 7260
gaattgccgc attgcagaga tattgtattt aagtgcctag ctcgatacat aaacgggtct 7320
ctctggttag accagatctg agcctgggag ctctctggct aactagggaa cccactgctt 7380
aagcctcaat aaagcttgcc ttgagtgctt caagtagtgt gtgcccgtct gttgtgtgac 7440
tctggtaact agagatccct cagacccttt tagtcagtgt ggaaaatctc tagcagtggc 7500
gcccgaacag ggacttgaaa gcgaaaggga aaccagagga gctctctcga cgcaggactc 7560
ggcttgctga agcgcgcacg gcaagaggcg aggggcggcg actggtgagt acgccaaaaa 7620
ttttgactag cggaggctag aaggagagag atgggtgcga gagcgtcagt attaagcggg 7680
ggagaattag atcgcgatgg gaaaaaattc ggttaaggcc agggggaaag aaaaaatata 7740
aattaaaaca tatagtatgg gcaagcaggg agctagaacg attcgcagtt aatcctggcc 7800
tgttagaaac atcagaaggc tgtagacaaa tactgggaca gctacaacca tcccttcaga 7860
caggatcaga agaacttaga tcattatata atacagtagc aaccctctat tgtgtgcatc 7920
aaaggataga gataaaagac accaaggaag ctttagacaa gatagaggaa gagcaaaaca 7980
aaagtaagac caccgcacag caagcggccg ctgatcttca gacctggagg aggagatatg 8040
agggacaatt ggagaagtga attatataaa tataaagtag taaaaattga accattagga 8100
gtagcaccca ccaaggcaaa gagaagagtg gtgcagagag aaaaaagagc agtgggaata 8160
ggagctttgt tccttgggtt cttgggagca gcaggaagca ctatgggcgc agcgtcaatg 8220
acgctgacgg tacaggccag acaattattg tctggtatag tgcagcagca gaacaatttg 8280
ctgagggcta ttgaggcgca acagcatctg ttgcaactca cagtctgggg catcaagcag 8340
ctccaggcaa gaatcctggc tgtggaaaga tacctaaagg atcaacagct cctggggatt 8400
tggggttgct ctggaaaact catttgcacc actgctgtgc cttggaatgc tagttggagt 8460
aataaatctc tggaacagat ttggaatcac acgacctgga tggagtggga cagagaaatt 8520
aacaattaca caagcttaat acactcctta attgaagaat cgcaaaacca gcaagaaaag 8580
aatgaacaag aattattgga attagataaa tgggcaagtt tgtggaattg gtttaacata 8640
acaaattggc tgtggtatat aaaattattc ataatgatag taggaggctt ggtaggttta 8700
agaatagttt ttgctgtact ttctatagtg aatagagtta ggcagggata ttcaccatta 8760
tcgtttcaga cccacctccc aaccccgagg ggacccttgc gccttttcca aggcagccct 8820
gggtttgcgc agggacgcgg ctgctctggg cgtggttccg ggaaacgcag cggcgccgac 8880
cctgggtctc gcacattctt cacgtccgtt cgcagcgtca cccggatctt cgccgctacc 8940
cttgtgggcc ccccggcgac gcttcctgct ccgcccctaa gtcgggaagg ttccttgcgg 9000
ttcgcggcgt gccggacgtg acaaacggaa gccgcacgtc tcactagtac cctcgcagac 9060
ggacagcgcc agggagcaat ggcagcgcgc cgaccgcgat gggctgtggc caatagcggc 9120
tgctcagcag ggcgcgccga gagcagcggc cgggaagggg cggtgcggga ggcggggtgt 9180
ggggcggtag tgtgggccct gttcctgccc gcgcggtgtt ccgcattctg caagcctccg 9240
gagcgcacgt cggcagtcgg ctccctcgtt gaccgaatca ccgacctctc tccccagggg 9300
gtacccagct gtctagagaa ttctagatct tgagacaaat ggcagtattc atccacaatt 9360
ttaaaagaaa aggggggatt ggggggtaca gtgcagggga aagaatagta gacataatag 9420
caacagacat acaaactaaa gaattacaaa aacaaattac aaaaattcaa aattttcggg 9480
tttattacag ggacagcaga gatccacttt ggcgccggct cgaggggg 9528
<210> 63
<211> 289
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 63
Met Ile Lys Ile Ala Thr Arg Lys Tyr Leu Gly Lys Gln Asn Val Tyr
1 5 10 15
Asp Ile Gly Val Glu Arg Asp His Asn Phe Ala Leu Lys Asn Gly Phe
20 25 30
Ile Ala Ser Asn Cys Ile Ser Arg Arg Ala Gln Gly Val Thr Leu Gln
35 40 45
Asp Leu Pro Glu Thr Glu Leu Pro Ala Val Leu Gln Pro Val Ala Glu
50 55 60
Ala Met Asp Ala Ile Ala Ala Ala Asp Leu Ser Gln Thr Ser Gly Phe
65 70 75 80
Gly Pro Phe Gly Pro Gln Gly Ile Gly Gln Tyr Thr Thr Trp Arg Asp
85 90 95
Phe Ile Cys Ala Ile Ala Asp Pro His Val Tyr His Trp Gln Thr Val
100 105 110
Met Asp Asp Thr Val Ser Ala Ser Val Ala Gln Ala Leu Asp Glu Leu
115 120 125
Met Leu Trp Ala Glu Asp Cys Pro Glu Val Arg His Leu Val His Ala
130 135 140
Asp Phe Gly Ser Asn Asn Val Leu Thr Asp Asn Gly Arg Ile Thr Ala
145 150 155 160
Val Ile Asp Trp Ser Glu Ala Met Phe Gly Asp Ser Gln Tyr Glu Val
165 170 175
Ala Asn Ile Phe Phe Trp Arg Pro Trp Leu Ala Cys Leu Ser Tyr Glu
180 185 190
Thr Glu Ile Leu Thr Val Glu Tyr Gly Leu Leu Pro Ile Gly Lys Ile
195 200 205
Val Glu Lys Arg Ile Glu Cys Thr Val Tyr Ser Val Asp Asn Asn Gly
210 215 220
Asn Ile Tyr Thr Gln Pro Val Ala Gln Trp His Asp Arg Gly Glu Gln
225 230 235 240
Glu Val Phe Glu Tyr Cys Leu Glu Asp Gly Ser Leu Ile Arg Ala Thr
245 250 255
Lys Asp His Lys Phe Met Thr Val Asp Gly Gln Met Leu Pro Ile Asp
260 265 270
Glu Ile Phe Glu Arg Glu Leu Asp Leu Met Arg Val Asp Asn Leu Pro
275 280 285
Asn
<210> 64
<211> 4596
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 64
ctttcctgcg ttatcccctg attctgtgga taaccgtatt accgcctttg agtgagctga 60
taccgctcgc cgcagccgaa cgaccgagcg cagcgagtca gtgagcgagg aagcggaaga 120
gcgcccaata cgcaaaccgc ctctccccgc gcgttggccg attcattaat gcagctggca 180
cgacaggttt cccgactgga aagcgggcag tgagcgcaac gcaattaata cgcgtaccgc 240
tagccaggaa gagtttgtag aaacgcaaaa aggccatccg tcaggatggc cttctgctta 300
gtttgatgcc tggcagttta tggcgggcgt cctgcccgcc accctccggg ccgttgcttc 360
acaacgatca aatccgctcc cggcggattt gtcctactca ggagagcgtt caccgacaaa 420
caacagataa aacgaaaggc ccagtcttcc gactgagcct ttcgttttat ttgatgcctg 480
gcagttccct actctcgcgt taacgctagc atggatgttt tcccagtcac gacgttgtaa 540
aacgacggcc agtcttaagc tcgggcccca aataatgatt ttattttgac tgatagtgac 600
ctgttcgttg caacaaattg atgagcaatg cttttttata atgccaactt tgtacaaaaa 660
agcaggctcc gaattcaccg gtgagaccgt cgacctgcag actggctgtg tataagggag 720
cctgacattt atattcccca gaacatcagg ttaatggcgt ttttgatgtc attttcgcgg 780
tggctgagat cagccacttc ttccccgata acggagaccg gcacactggc catatcggtg 840
gtcatcatgc gccagctttc atccccgata tgcaccaccg ggtaaagttc acgggagact 900
ttatctgaca gcagacgtgc actggccagg gggatcacca tccgtcgccc gggcgtgtca 960
ataatatcac tctgtacatc cacaaacaga cgataacggc tctctctttt ataggtgtaa 1020
accttaaact gcatttcacc agcccctgtt ctcgtcagca aaagagccgt tcatttcaat 1080
aaaccgggcg acctcagcca tcccttcctg attttccgct ttccagcgtt cggcacgcag 1140
acgacgggct tcattctgca tggttgtgct taccagaccg gagatattga catcatatat 1200
gccttgagca actgatagct gtcgctgtca actgtcactg taatacgctg cttcatagca 1260
tacctctttt tgacatactt cgggtataca tatcagtata tattcttata ccgcaaaaat 1320
cagcgcgcaa atacgcatac tgttatctgg cttttagtaa gccggatcca gatctttacg 1380
ccccgccctg ccactcatcg cagtactgtt gtaattcatt aagcattctg ccgacatgga 1440
agccatcaca aacggcatga tgaacctgaa tcgccagcgg catcagcacc ttgtcgcctt 1500
gcgtataata tttgcccatg gtgaaaacgg gggcgaagaa gttgtccata ttggccacgt 1560
ttaaatcaaa actggtgaaa ctcacccagg gattggctga gacgaaaaac atattctcaa 1620
taaacccttt agggaaatag gccaggtttt caccgtaaca cgccacatct tgcgaatata 1680
tgtgtagaaa ctgccggaaa tcgtcgtggt attcactcca gagcgatgaa aaggtttcag 1740
tttgctcatg gaaaacggtg taacaagggt gaacactatc ccatatcacc agctcaccgt 1800
ctttcattgc catacggaat tccggatgag cattcatcag gcgggcaaga atgtgaataa 1860
aggccggata aaacttgtgc ttatttttct ttacggtctt taaaaaggcc gtaatatcca 1920
gctgaacggt ctggttatag gtacattgag caactgactg aaatgcctca aaatgttctt 1980
tacgatgcca ttgggatata tcaacggtgg tatatccagt gatttttttc tccattttag 2040
cttccttagc tcctgaaaat ctcgacggat cctaactcaa aatccacaca ttatacgagc 2100
cggaagcata aagtgtaaag cctggggtgc ctaatgcggc cgcggtctca tgcctttcat 2160
acgagaccga gatcctgact gtcgagtacg gattgcttcc tatcggcaaa atcgtggaga 2220
agaggattga atgtaccgtc tattcagtcg ataataatgg gaacatctac acacagcccg 2280
tggctcaatg gcacgacaga ggagagcagg aagtttttga atactgtctc gaggacggat 2340
ccctcatccg cgctactaaa gatcataagt ttatgaccgt ggacggccag atgctgccaa 2400
ttgacgaaat ttttgaacga gagctggatc tgatgagagt cgacaacctt ccaaactgat 2460
taattaagaa ttcgacccag ctttcttgta caaagttggc attataaaaa ataattgctc 2520
atcaatttgt tgcaacgaac aggtcactat cagtcaaaat aaaatcatta tttgccatcc 2580
agctgatatc ccctatagtg agtcgtatta catggtcata gctgtttcct ggcagctctg 2640
gcccgtgtct caaaatctct gatgttacat tgcacaagat aaaaatatat catcatgcct 2700
cctctagacc agccaggaca gaaatgcctc gacttcgctg ctgcccaagg ttgccgggtg 2760
acgcacaccg tggaaacgga tgaaggcacg aacccagtgg acataagcct gttcggttcg 2820
taagctgtaa tgcaagtagc gtatgcgctc acgcaactgg tccagaacct tgaccgaacg 2880
cagcggtggt aacggcgcag tggcggtttt catggcttgt tatgactgtt tttttggggt 2940
acagtctatg cctcgggcat ccaagcagca agcgcgttac gccgtgggtc gatgtttgat 3000
gttatggagc agcaacgatg ttacgcagca gggcagtcgc cctaaaacaa agttaaacat 3060
catgagggaa gcggtgatcg ccgaagtatc gactcaacta tcagaggtag ttggcgtcat 3120
cgagcgccat ctcgaaccga cgttgctggc cgtacatttg tacggctccg cagtggatgg 3180
cggcctgaag ccacacagtg atattgattt gctggttacg gtgaccgtaa ggcttgatga 3240
aacaacgcgg cgagctttga tcaacgacct tttggaaact tcggcttccc ctggagagag 3300
cgagattctc cgcgctgtag aagtcaccat tgttgtgcac gacgacatca ttccgtggcg 3360
ttatccagct aagcgcgaac tgcaatttgg agaatggcag cgcaatgaca ttcttgcagg 3420
tatcttcgag ccagccacga tcgacattga tctggctatc ttgctgacaa aagcaagaga 3480
acatagcgtt gccttggtag gtccagcggc ggaggaactc tttgatccgg ttcctgaaca 3540
ggatctattt gaggcgctaa atgaaacctt aacgctatgg aactcgccgc ccgactgggc 3600
tggcgatgag cgaaatgtag tgcttacgtt gtcccgcatt tggtacagcg cagtaaccgg 3660
caaaatcgcg ccgaaggatg tcgctgccga ctgggcaatg gagcgcctgc cggcccagta 3720
tcagcccgtc atacttgaag ctagacaggc ttatcttgga caagaagaag atcgcttggc 3780
ctcgcgcgca gatcagttgg aagaatttgt ccactacgtg aaaggcgaga tcaccaaggt 3840
agtcggcaaa taaccctcga gccacccatg accaaaatcc cttaacgtga gttacgcgtc 3900
gttccactga gcgtcagacc ccgtagaaaa gatcaaagga tcttcttgag atcctttttt 3960
tctgcgcgta atctgctgct tgcaaacaaa aaaaccaccg ctaccagcgg tggtttgttt 4020
gccggatcaa gagctaccaa ctctttttcc gaaggtaact ggcttcagca gagcgcagat 4080
accaaatact gtccttctag tgtagccgta gttaggccac cacttcaaga actctgtagc 4140
accgcctaca tacctcgctc tgctaatcct gttaccagtg gctgctgcca gtggcgataa 4200
gtcgtgtctt accgggttgg actcaagacg atagttaccg gataaggcgc agcggtcggg 4260
ctgaacgggg ggttcgtgca cacagcccag cttggagcga acgacctaca ccgaactgag 4320
atacctacag cgtgagcatt gagaaagcgc cacgcttccc gaagggagaa aggcggacag 4380
gtatccggta agcggcaggg tcggaacagg agagcgcacg agggagcttc cagggggaaa 4440
cgcctggtat ctttatagtc ctgtcgggtt tcgccacctc tgacttgagc gtcgattttt 4500
gtgatgctcg tcaggggggc ggagcctatg gaaaaacgcc agcaacgcgg cctttttacg 4560
gttcctggcc ttttgctggc cttttgctca catgtt 4596
<210> 65
<211> 4405
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 65
ctttcctgcg ttatcccctg attctgtgga taaccgtatt accgcctttg agtgagctga 60
taccgctcgc cgcagccgaa cgaccgagcg cagcgagtca gtgagcgagg aagcggaaga 120
gcgcccaata cgcaaaccgc ctctccccgc gcgttggccg attcattaat gcagctggca 180
cgacaggttt cccgactgga aagcgggcag tgagcgcaac gcaattaata cgcgtaccgc 240
tagccaggaa gagtttgtag aaacgcaaaa aggccatccg tcaggatggc cttctgctta 300
gtttgatgcc tggcagttta tggcgggcgt cctgcccgcc accctccggg ccgttgcttc 360
acaacgatca aatccgctcc cggcggattt gtcctactca ggagagcgtt caccgacaaa 420
caacagataa aacgaaaggc ccagtcttcc gactgagcct ttcgttttat ttgatgcctg 480
gcagttccct actctcgcgt taacgctagc atggatgttt tcccagtcac gacgttgtaa 540
aacgacggcc agtcttaagc tcgggcccca aataatgatt ttattttgac tgatagtgac 600
ctgttcgttg caacaaattg atgagcaatg cttttttata atgccaactt tgtacaaaaa 660
agcaggctcc gaattcaccg gtgccgccac catgattaag atcgctacgc ggaagtacct 720
ggggaaacag aacgtctacg acataggtgt ggagcgcgat cacaactttg ctctgaaaaa 780
tggatttatc gccagcaact gagaccgtcg acctgcagac tggctgtgta taagggagcc 840
tgacatttat attccccaga acatcaggtt aatggcgttt ttgatgtcat tttcgcggtg 900
gctgagatca gccacttctt ccccgataac ggagaccggc acactggcca tatcggtggt 960
catcatgcgc cagctttcat ccccgatatg caccaccggg taaagttcac gggagacttt 1020
atctgacagc agacgtgcac tggccagggg gatcaccatc cgtcgcccgg gcgtgtcaat 1080
aatatcactc tgtacatcca caaacagacg ataacggctc tctcttttat aggtgtaaac 1140
cttaaactgc atttcaccag cccctgttct cgtcagcaaa agagccgttc atttcaataa 1200
accgggcgac ctcagccatc ccttcctgat tttccgcttt ccagcgttcg gcacgcagac 1260
gacgggcttc attctgcatg gttgtgctta ccagaccgga gatattgaca tcatatatgc 1320
cttgagcaac tgatagctgt cgctgtcaac tgtcactgta atacgctgct tcatagcata 1380
cctctttttg acatacttcg ggtatacata tcagtatata ttcttatacc gcaaaaatca 1440
gcgcgcaaat acgcatactg ttatctggct tttagtaagc cggatccaga tctttacgcc 1500
ccgccctgcc actcatcgca gtactgttgt aattcattaa gcattctgcc gacatggaag 1560
ccatcacaaa cggcatgatg aacctgaatc gccagcggca tcagcacctt gtcgccttgc 1620
gtataatatt tgcccatggt gaaaacgggg gcgaagaagt tgtccatatt ggccacgttt 1680
aaatcaaaac tggtgaaact cacccaggga ttggctgaga cgaaaaacat attctcaata 1740
aaccctttag ggaaataggc caggttttca ccgtaacacg ccacatcttg cgaatatatg 1800
tgtagaaact gccggaaatc gtcgtggtat tcactccaga gcgatgaaaa ggtttcagtt 1860
tgctcatgga aaacggtgta acaagggtga acactatccc atatcaccag ctcaccgtct 1920
ttcattgcca tacggaattc cggatgagca ttcatcaggc gggcaagaat gtgaataaag 1980
gccggataaa acttgtgctt atttttcttt acggtcttta aaaaggccgt aatatccagc 2040
tgaacggtct ggttataggt acattgagca actgactgaa atgcctcaaa atgttcttta 2100
cgatgccatt gggatatatc aacggtggta tatccagtga tttttttctc cattttagct 2160
tccttagctc ctgaaaatct cgacggatcc taactcaaaa tccacacatt atacgagccg 2220
gaagcataaa gtgtaaagcc tggggtgcct aatgcggccg cggtctcatt aattaagaat 2280
tcgacccagc tttcttgtac aaagttggca ttataaaaaa taattgctca tcaatttgtt 2340
gcaacgaaca ggtcactatc agtcaaaata aaatcattat ttgccatcca gctgatatcc 2400
cctatagtga gtcgtattac atggtcatag ctgtttcctg gcagctctgg cccgtgtctc 2460
aaaatctctg atgttacatt gcacaagata aaaatatatc atcatgcctc ctctagacca 2520
gccaggacag aaatgcctcg acttcgctgc tgcccaaggt tgccgggtga cgcacaccgt 2580
ggaaacggat gaaggcacga acccagtgga cataagcctg ttcggttcgt aagctgtaat 2640
gcaagtagcg tatgcgctca cgcaactggt ccagaacctt gaccgaacgc agcggtggta 2700
acggcgcagt ggcggttttc atggcttgtt atgactgttt ttttggggta cagtctatgc 2760
ctcgggcatc caagcagcaa gcgcgttacg ccgtgggtcg atgtttgatg ttatggagca 2820
gcaacgatgt tacgcagcag ggcagtcgcc ctaaaacaaa gttaaacatc atgagggaag 2880
cggtgatcgc cgaagtatcg actcaactat cagaggtagt tggcgtcatc gagcgccatc 2940
tcgaaccgac gttgctggcc gtacatttgt acggctccgc agtggatggc ggcctgaagc 3000
cacacagtga tattgatttg ctggttacgg tgaccgtaag gcttgatgaa acaacgcggc 3060
gagctttgat caacgacctt ttggaaactt cggcttcccc tggagagagc gagattctcc 3120
gcgctgtaga agtcaccatt gttgtgcacg acgacatcat tccgtggcgt tatccagcta 3180
agcgcgaact gcaatttgga gaatggcagc gcaatgacat tcttgcaggt atcttcgagc 3240
cagccacgat cgacattgat ctggctatct tgctgacaaa agcaagagaa catagcgttg 3300
ccttggtagg tccagcggcg gaggaactct ttgatccggt tcctgaacag gatctatttg 3360
aggcgctaaa tgaaacctta acgctatgga actcgccgcc cgactgggct ggcgatgagc 3420
gaaatgtagt gcttacgttg tcccgcattt ggtacagcgc agtaaccggc aaaatcgcgc 3480
cgaaggatgt cgctgccgac tgggcaatgg agcgcctgcc ggcccagtat cagcccgtca 3540
tacttgaagc tagacaggct tatcttggac aagaagaaga tcgcttggcc tcgcgcgcag 3600
atcagttgga agaatttgtc cactacgtga aaggcgagat caccaaggta gtcggcaaat 3660
aaccctcgag ccacccatga ccaaaatccc ttaacgtgag ttacgcgtcg ttccactgag 3720
cgtcagaccc cgtagaaaag atcaaaggat cttcttgaga tccttttttt ctgcgcgtaa 3780
tctgctgctt gcaaacaaaa aaaccaccgc taccagcggt ggtttgtttg ccggatcaag 3840
agctaccaac tctttttccg aaggtaactg gcttcagcag agcgcagata ccaaatactg 3900
tccttctagt gtagccgtag ttaggccacc acttcaagaa ctctgtagca ccgcctacat 3960
acctcgctct gctaatcctg ttaccagtgg ctgctgccag tggcgataag tcgtgtctta 4020
ccgggttgga ctcaagacga tagttaccgg ataaggcgca gcggtcgggc tgaacggggg 4080
gttcgtgcac acagcccagc ttggagcgaa cgacctacac cgaactgaga tacctacagc 4140
gtgagcattg agaaagcgcc acgcttcccg aagggagaaa ggcggacagg tatccggtaa 4200
gcggcagggt cggaacagga gagcgcacga gggagcttcc agggggaaac gcctggtatc 4260
tttatagtcc tgtcgggttt cgccacctct gacttgagcg tcgatttttg tgatgctcgt 4320
caggggggcg gagcctatgg aaaaacgcca gcaacgcggc ctttttacgg ttcctggcct 4380
tttgctggcc ttttgctcac atgtt 4405
<210> 66
<211> 4659
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 66
ctttcctgcg ttatcccctg attctgtgga taaccgtatt accgcctttg agtgagctga 60
taccgctcgc cgcagccgaa cgaccgagcg cagcgagtca gtgagcgagg aagcggaaga 120
gcgcccaata cgcaaaccgc ctctccccgc gcgttggccg attcattaat gcagctggca 180
cgacaggttt cccgactgga aagcgggcag tgagcgcaac gcaattaata cgcgtaccgc 240
tagccaggaa gagtttgtag aaacgcaaaa aggccatccg tcaggatggc cttctgctta 300
gtttgatgcc tggcagttta tggcgggcgt cctgcccgcc accctccggg ccgttgcttc 360
acaacgatca aatccgctcc cggcggattt gtcctactca ggagagcgtt caccgacaaa 420
caacagataa aacgaaaggc ccagtcttcc gactgagcct ttcgttttat ttgatgcctg 480
gcagttccct actctcgcgt taacgctagc atggatgttt tcccagtcac gacgttgtaa 540
aacgacggcc agtcttaagc tcgggcccca aataatgatt ttattttgac tgatagtgac 600
ctgttcgttg caacaaattg atgagcaatg cttttttata atgccaactt tgtacaaaaa 660
agcaggctcc gaattcaccg gtgagaccgt cgacctgcag actggctgtg tataagggag 720
cctgacattt atattcccca gaacatcagg ttaatggcgt ttttgatgtc attttcgcgg 780
tggctgagat cagccacttc ttccccgata acggagaccg gcacactggc catatcggtg 840
gtcatcatgc gccagctttc atccccgata tgcaccaccg ggtaaagttc acgggagact 900
ttatctgaca gcagacgtgc actggccagg gggatcacca tccgtcgccc gggcgtgtca 960
ataatatcac tctgtacatc cacaaacaga cgataacggc tctctctttt ataggtgtaa 1020
accttaaact gcatttcacc agcccctgtt ctcgtcagca aaagagccgt tcatttcaat 1080
aaaccgggcg acctcagcca tcccttcctg attttccgct ttccagcgtt cggcacgcag 1140
acgacgggct tcattctgca tggttgtgct taccagaccg gagatattga catcatatat 1200
gccttgagca actgatagct gtcgctgtca actgtcactg taatacgctg cttcatagca 1260
tacctctttt tgacatactt cgggtataca tatcagtata tattcttata ccgcaaaaat 1320
cagcgcgcaa atacgcatac tgttatctgg cttttagtaa gccggatcca gatctttacg 1380
ccccgccctg ccactcatcg cagtactgtt gtaattcatt aagcattctg ccgacatgga 1440
agccatcaca aacggcatga tgaacctgaa tcgccagcgg catcagcacc ttgtcgcctt 1500
gcgtataata tttgcccatg gtgaaaacgg gggcgaagaa gttgtccata ttggccacgt 1560
ttaaatcaaa actggtgaaa ctcacccagg gattggctga gacgaaaaac atattctcaa 1620
taaacccttt agggaaatag gccaggtttt caccgtaaca cgccacatct tgcgaatata 1680
tgtgtagaaa ctgccggaaa tcgtcgtggt attcactcca gagcgatgaa aaggtttcag 1740
tttgctcatg gaaaacggtg taacaagggt gaacactatc ccatatcacc agctcaccgt 1800
ctttcattgc catacggaat tccggatgag cattcatcag gcgggcaaga atgtgaataa 1860
aggccggata aaacttgtgc ttatttttct ttacggtctt taaaaaggcc gtaatatcca 1920
gctgaacggt ctggttatag gtacattgag caactgactg aaatgcctca aaatgttctt 1980
tacgatgcca ttgggatata tcaacggtgg tatatccagt gatttttttc tccattttag 2040
cttccttagc tcctgaaaat ctcgacggat cctaactcaa aatccacaca ttatacgagc 2100
cggaagcata aagtgtaaag cctggggtgc ctaatgcggc cgcggtctca tgccttagct 2160
tcggtaccga gatactcacc gttgagtacg gaccactgcc aattggcaag atcgttagcg 2220
aagagattaa ttgttccgtc tactctgttg atcctgaggg cagggtatac acacaggcta 2280
ttgctcagtg gcatgacaga ggcgagcagg aggttcttga gtacgagctg gaggatgggt 2340
cagttatccg ggcgacatca gatcaccgat tcctgactac agattatcag ctgctcgcca 2400
ttgaagaaat cttcgcaaga cagttggatc tcctgactct ggaaaatatc aagcagaccg 2460
aggaagctct ggacaaccac cgcctgcctt ttccgctgct tgatgccggc accattaagt 2520
gattaattaa gaattcgacc cagctttctt gtacaaagtt ggcattataa aaaataattg 2580
ctcatcaatt tgttgcaacg aacaggtcac tatcagtcaa aataaaatca ttatttgcca 2640
tccagctgat atcccctata gtgagtcgta ttacatggtc atagctgttt cctggcagct 2700
ctggcccgtg tctcaaaatc tctgatgtta cattgcacaa gataaaaata tatcatcatg 2760
cctcctctag accagccagg acagaaatgc ctcgacttcg ctgctgccca aggttgccgg 2820
gtgacgcaca ccgtggaaac ggatgaaggc acgaacccag tggacataag cctgttcggt 2880
tcgtaagctg taatgcaagt agcgtatgcg ctcacgcaac tggtccagaa ccttgaccga 2940
acgcagcggt ggtaacggcg cagtggcggt tttcatggct tgttatgact gtttttttgg 3000
ggtacagtct atgcctcggg catccaagca gcaagcgcgt tacgccgtgg gtcgatgttt 3060
gatgttatgg agcagcaacg atgttacgca gcagggcagt cgccctaaaa caaagttaaa 3120
catcatgagg gaagcggtga tcgccgaagt atcgactcaa ctatcagagg tagttggcgt 3180
catcgagcgc catctcgaac cgacgttgct ggccgtacat ttgtacggct ccgcagtgga 3240
tggcggcctg aagccacaca gtgatattga tttgctggtt acggtgaccg taaggcttga 3300
tgaaacaacg cggcgagctt tgatcaacga ccttttggaa acttcggctt cccctggaga 3360
gagcgagatt ctccgcgctg tagaagtcac cattgttgtg cacgacgaca tcattccgtg 3420
gcgttatcca gctaagcgcg aactgcaatt tggagaatgg cagcgcaatg acattcttgc 3480
aggtatcttc gagccagcca cgatcgacat tgatctggct atcttgctga caaaagcaag 3540
agaacatagc gttgccttgg taggtccagc ggcggaggaa ctctttgatc cggttcctga 3600
acaggatcta tttgaggcgc taaatgaaac cttaacgcta tggaactcgc cgcccgactg 3660
ggctggcgat gagcgaaatg tagtgcttac gttgtcccgc atttggtaca gcgcagtaac 3720
cggcaaaatc gcgccgaagg atgtcgctgc cgactgggca atggagcgcc tgccggccca 3780
gtatcagccc gtcatacttg aagctagaca ggcttatctt ggacaagaag aagatcgctt 3840
ggcctcgcgc gcagatcagt tggaagaatt tgtccactac gtgaaaggcg agatcaccaa 3900
ggtagtcggc aaataaccct cgagccaccc atgaccaaaa tcccttaacg tgagttacgc 3960
gtcgttccac tgagcgtcag accccgtaga aaagatcaaa ggatcttctt gagatccttt 4020
ttttctgcgc gtaatctgct gcttgcaaac aaaaaaacca ccgctaccag cggtggtttg 4080
tttgccggat caagagctac caactctttt tccgaaggta actggcttca gcagagcgca 4140
gataccaaat actgtccttc tagtgtagcc gtagttaggc caccacttca agaactctgt 4200
agcaccgcct acatacctcg ctctgctaat cctgttacca gtggctgctg ccagtggcga 4260
taagtcgtgt cttaccgggt tggactcaag acgatagtta ccggataagg cgcagcggtc 4320
gggctgaacg gggggttcgt gcacacagcc cagcttggag cgaacgacct acaccgaact 4380
gagataccta cagcgtgagc attgagaaag cgccacgctt cccgaaggga gaaaggcgga 4440
caggtatccg gtaagcggca gggtcggaac aggagagcgc acgagggagc ttccaggggg 4500
aaacgcctgg tatctttata gtcctgtcgg gtttcgccac ctctgacttg agcgtcgatt 4560
tttgtgatgc tcgtcagggg ggcggagcct atggaaaaac gccagcaacg cggccttttt 4620
acggttcctg gccttttgct ggccttttgc tcacatgtt 4659
<210> 67
<211> 4405
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 67
ctttcctgcg ttatcccctg attctgtgga taaccgtatt accgcctttg agtgagctga 60
taccgctcgc cgcagccgaa cgaccgagcg cagcgagtca gtgagcgagg aagcggaaga 120
gcgcccaata cgcaaaccgc ctctccccgc gcgttggccg attcattaat gcagctggca 180
cgacaggttt cccgactgga aagcgggcag tgagcgcaac gcaattaata cgcgtaccgc 240
tagccaggaa gagtttgtag aaacgcaaaa aggccatccg tcaggatggc cttctgctta 300
gtttgatgcc tggcagttta tggcgggcgt cctgcccgcc accctccggg ccgttgcttc 360
acaacgatca aatccgctcc cggcggattt gtcctactca ggagagcgtt caccgacaaa 420
caacagataa aacgaaaggc ccagtcttcc gactgagcct ttcgttttat ttgatgcctg 480
gcagttccct actctcgcgt taacgctagc atggatgttt tcccagtcac gacgttgtaa 540
aacgacggcc agtcttaagc tcgggcccca aataatgatt ttattttgac tgatagtgac 600
ctgttcgttg caacaaattg atgagcaatg cttttttata atgccaactt tgtacaaaaa 660
agcaggctcc gaattcaccg gtgccgccac catggtgaaa gtgattggcc ggcgatcact 720
cggcgtccag aggattttcg atatcggcct cccgcaggac cacaattttc tgctcgctaa 780
tggtgctatt gcagctaact gagaccgtcg acctgcagac tggctgtgta taagggagcc 840
tgacatttat attccccaga acatcaggtt aatggcgttt ttgatgtcat tttcgcggtg 900
gctgagatca gccacttctt ccccgataac ggagaccggc acactggcca tatcggtggt 960
catcatgcgc cagctttcat ccccgatatg caccaccggg taaagttcac gggagacttt 1020
atctgacagc agacgtgcac tggccagggg gatcaccatc cgtcgcccgg gcgtgtcaat 1080
aatatcactc tgtacatcca caaacagacg ataacggctc tctcttttat aggtgtaaac 1140
cttaaactgc atttcaccag cccctgttct cgtcagcaaa agagccgttc atttcaataa 1200
accgggcgac ctcagccatc ccttcctgat tttccgcttt ccagcgttcg gcacgcagac 1260
gacgggcttc attctgcatg gttgtgctta ccagaccgga gatattgaca tcatatatgc 1320
cttgagcaac tgatagctgt cgctgtcaac tgtcactgta atacgctgct tcatagcata 1380
cctctttttg acatacttcg ggtatacata tcagtatata ttcttatacc gcaaaaatca 1440
gcgcgcaaat acgcatactg ttatctggct tttagtaagc cggatccaga tctttacgcc 1500
ccgccctgcc actcatcgca gtactgttgt aattcattaa gcattctgcc gacatggaag 1560
ccatcacaaa cggcatgatg aacctgaatc gccagcggca tcagcacctt gtcgccttgc 1620
gtataatatt tgcccatggt gaaaacgggg gcgaagaagt tgtccatatt ggccacgttt 1680
aaatcaaaac tggtgaaact cacccaggga ttggctgaga cgaaaaacat attctcaata 1740
aaccctttag ggaaataggc caggttttca ccgtaacacg ccacatcttg cgaatatatg 1800
tgtagaaact gccggaaatc gtcgtggtat tcactccaga gcgatgaaaa cgtttcagtt 1860
tgctcatgga aaacggtgta acaagggtga acactatccc atatcaccag ctcaccgtct 1920
ttcattgcca tacggaattc cggatgagca ttcatcaggc gggcaagaat gtgaataaag 1980
gccggataaa acttgtgctt atttttcttt acggtcttta aaaaggccgt aatatccagc 2040
tgaacggtct ggttataggt acattgagca actgactgaa atgcctcaaa atgttcttta 2100
cgatgccatt gggatatatc aacggtggta tatccagtga tttttttctc cattttagct 2160
tccttagctc ctgaaaatct cgacggatcc taactcaaaa tccacacatt atacgagccg 2220
gaagcataaa gtgtaaagcc tggggtgcct aatgcggccg cggtctcatt aattaagaat 2280
tcgacccagc tttcttgtac aaagttggca ttataaaaaa taattgctca tcaatttgtt 2340
gcaacgaaca ggtcactatc agtcaaaata aaatcattat ttgccatcca gctgatatcc 2400
cctatagtga gtcgtattac atggtcatag ctgtttcctg gcagctctgg cccgtgtctc 2460
aaaatctctg atgttacatt gcacaagata aaaatatatc atcatgcctc ctctagacca 2520
gccaggacag aaatgcctcg acttcgctgc tgcccaaggt tgccgggtga cgcacaccgt 2580
ggaaacggat gaaggcacga acccagtgga cataagcctg ttcggttcgt aagctgtaat 2640
gcaagtagcg tatgcgctca cgcaactggt ccagaacctt gaccgaacgc agcggtggta 2700
acggcgcagt ggcggttttc atggcttgtt atgactgttt ttttggggta cagtctatgc 2760
ctcgggcatc caagcagcaa gcgcgttacg ccgtgggtcg atgtttgatg ttatggagca 2820
gcaacgatgt tacgcagcag ggcagtcgcc ctaaaacaaa gttaaacatc atgagggaag 2880
cggtgatcgc cgaagtatcg actcaactat cagaggtagt tggcgtcatc gagcgccatc 2940
tcgaaccgac gttgctggcc gtacatttgt acggctccgc agtggatggc ggcctgaagc 3000
cacacagtga tattgatttg ctggttacgg tgaccgtaag gcttgatgaa acaacgcggc 3060
gagctttgat caacgacctt ttggaaactt cggcttcccc tggagagagc gagattctcc 3120
gcgctgtaga agtcaccatt gttgtgcacg acgacatcat tccgtggcgt tatccagcta 3180
agcgcgaact gcaatttgga gaatggcagc gcaatgacat tcttgcaggt atcttcgagc 3240
cagccacgat cgacattgat ctggctatct tgctgacaaa agcaagagaa catagcgttg 3300
ccttggtagg tccagcggcg gaggaactct ttgatccggt tcctgaacag gatctatttg 3360
aggcgctaaa tgaaacctta acgctatgga actcgccgcc cgactgggct ggcgatgagc 3420
gaaatgtagt gcttacgttg tcccgcattt ggtacagcgc agtaaccggc aaaatcgcgc 3480
cgaaggatgt cgctgccgac tgggcaatgg agcgcctgcc ggcccagtat cagcccgtca 3540
tacttgaagc tagacaggct tatcttggac aagaagaaga tcgcttggcc tcgcgcgcag 3600
atcagttgga agaatttgtc cactacgtga aaggcgagat caccaaggta gtcggcaaat 3660
aaccctcgag ccacccatga ccaaaatccc ttaacgtgag ttacgcgtcg ttccactgag 3720
cgtcagaccc cgtagaaaag atcaaaggat cttcttgaga tccttttttt ctgcgcgtaa 3780
tctgctgctt gcaaacaaaa aaaccaccgc taccagcggt ggtttgtttg ccggatcaag 3840
agctaccaac tctttttccg aaggtaactg gcttcagcag agcgcagata ccaaatactg 3900
tccttctagt gtagccgtag ttaggccacc acttcaagaa ctctgtagca ccgcctacat 3960
acctcgctct gctaatcctg ttaccagtgg ctgctgccag tggcgataag tcgtgtctta 4020
ccgggttgga ctcaagacga tagttaccgg ataaggcgca gcggtcgggc tgaacggggg 4080
gttcgtgcac acagcccagc ttggagcgaa cgacctacac cgaactgaga tacctacagc 4140
gtgagcattg agaaagcgcc acgcttcccg aagggagaaa ggcggacagg tatccggtaa 4200
gcggcagggt cggaacagga gagcgcacga gggagcttcc agggggaaac gcctggtatc 4260
tttatagtcc tgtcgggttt cgccacctct gacttgagcg tcgatttttg tgatgctcgt 4320
caggggggcg gagcctatgg aaaaacgcca gcaacgcggc ctttttacgg ttcctggcct 4380
tttgctggcc ttttgctcac atgtt 4405
<210> 68
<211> 4607
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 68
ctttcctgcg ttatcccctg attctgtgga taaccgtatt accgcctttg agtgagctga 60
taccgctcgc cgcagccgaa cgaccgagcg cagcgagtca gtgagcgagg aagcggaaga 120
gcgcccaata cgcaaaccgc ctctccccgc gcgttggccg attcattaat gcagctggca 180
cgacaggttt cccgactgga aagcgggcag tgagcgcaac gcaattaata cgcgtaccgc 240
tagccaggaa gagtttgtag aaacgcaaaa aggccatccg tcaggatggc cttctgctta 300
gtttgatgcc tggcagttta tggcgggcgt cctgcccgcc accctccggg ccgttgcttc 360
acaacgatca aatccgctcc cggcggattt gtcctactca ggagagcgtt caccgacaaa 420
caacagataa aacgaaaggc ccagtcttcc gactgagcct ttcgttttat ttgatgcctg 480
gcagttccct actctcgcgt taacgctagc atggatgttt tcccagtcac gacgttgtaa 540
aacgacggcc agtcttaagc tcgggcccca aataatgatt ttattttgac tgatagtgac 600
ctgttcgttg caacaaattg atgagcaatg cttttttata atgccaactt tgtacaaaaa 660
agcaggctcc gaattcaccg gtgagacctc gacctgcaga ctggctgtgt ataagggagc 720
ctgacattta tattccccag aacatcaggt taatggcgtt tttgatgtca ttttcgcggt 780
ggctgagatc agccacttct tccccgataa cggagaccgg cacactggcc atatcggtgg 840
tcatcatgcg ccagctttca tccccgatat gcaccaccgg gtaaagttca cgggagactt 900
tatctgacag cagacgtgca ctggccaggg ggatcaccat ccgtcgcccg ggcgtgtcaa 960
taatatcact ctgtacatcc acaaacagac gataacggct ctctctttta taggtgtaaa 1020
ccttaaactg catttcacca gcccctgttc tcgtcagcaa aagagccgtt catttcaata 1080
aaccgggcga cctcagccat cccttcctga ttttccgctt tccagcgttc ggcacgcaga 1140
cgacgggctt cattctgcat ggttgtgctt accagaccgg agatattgac atcatatatg 1200
ccttgagcaa ctgatagctg tcgctgtcaa ctgtcactgt aatacgctgc ttcatagcat 1260
acctcttttt gacatacttc gggtatacat atcagtatat attcttatac cgcaaaaatc 1320
agcgcgcaaa tacgcatact gttatctggc ttttagtaag ccggatccag atctttacgc 1380
cccgccctgc cactcatcgc agtactgttg taattcatta agcattctgc cgacatggaa 1440
gccatcacaa acggcatgat gaacctgaat cgccagcggc atcagcacct tgtcgccttg 1500
cgtataatat ttgcccatgg tgaaaacggg ggcgaagaag ttgtccatat tggccacgtt 1560
taaatcaaaa ctggtgaaac tcacccaggg attggctgag acgaaaaaca tattctcaat 1620
aaacccttta gggaaatagg ccaggttttc accgtaacac gccacatctt gcgaatatat 1680
gtgtagaaac tgccggaaat cgtcgtggta ttcactccag agcgatgaaa aggtttcagt 1740
ttgctcatgg aaaacggtgt aacaagggtg aacactatcc catatcacca gctcaccgtc 1800
tttcattgcc atacggaatt ccggatgagc attcatcagg cgggcaagaa tgtgaataaa 1860
ggccggataa aacttgtgct tatttttctt tacggtcttt aaaaaggccg taatatccag 1920
ctgaacggtc tggttatagg tacattgagc aactgactga aatgcctcaa aatgttcttt 1980
acgatgccat tgggatatat caacggtggt atatccagtg atttttttct ccattttagc 2040
ttccttagct cctgaaaatc tcgacggatc ctaactcaaa atccacacat tatacgagcc 2100
ggaagcataa agtgtaaagc ctggggtgcc taatgcggcc gcggtctcat gtatcagtgg 2160
cgactccctg atctcactcg caagcactgg aaagcgagtt agcatcaagg acttgctgga 2220
cgaaaaggat ttcgaaattt gggcaatcaa tgagcagacc atgaaactgg agtctgcaaa 2280
ggtgtcccgg gtgttttgca cgggtaagaa gcttgtttat atccttaaaa ctagactggg 2340
ccggacgatc aaagccaccg cgaaccacag attcttgaca atcgacgggt ggaaacggct 2400
ggacgaactg agcttgaagg agcacatcgc ccttcctcgg aagctcgagt catcttccct 2460
gcagctgtga ttaattaaga attcgaccca gctttcttgt acaaagttgg cattataaaa 2520
aataattgct catcaatttg ttgcaacgaa caggtcacta tcagtcaaaa taaaatcatt 2580
atttgccatc cagctgatat cccctatagt gagtcgtatt acatggtcat agctgtttcc 2640
tggcagctct ggcccgtgtc tcaaaatctc tgatgttaca ttgcacaaga taaaaatata 2700
tcatcatgcc tcctctagac cagccaggac agaaatgcct cgacttcgct gctgcccaag 2760
gttgccgggt gacgcacacc gtggaaacgg atgaaggcac gaacccagtg gacataagcc 2820
tgttcggttc gtaagctgta atgcaagtag cgtatgcgct cacgcaactg gtccagaacc 2880
ttgaccgaac gcagcggtgg taacggcgca gtggcggttt tcatggcttg ttatgactgt 2940
ttttttgggg tacagtctat gcctcgggca tccaagcagc aagcgcgtta cgccgtgggt 3000
cgatgtttga tgttatggag cagcaacgat gttacgcagc agggcagtcg ccctaaaaca 3060
aagttaaaca tcatgaggga agcggtgatc gccgaagtat cgactcaact atcagaggta 3120
gttggcgtca tcgagcgcca tctcgaaccg acgttgctgg ccgtacattt gtacggctcc 3180
gcagtggatg gcggcctgaa gccacacagt gatattgatt tgctggttac ggtgaccgta 3240
aggcttgatg aaacaacgcg gcgagctttg atcaacgacc ttttggaaac ttcggcttcc 3300
cctggagaga gcgagattct ccgcgctgta gaagtcacca ttgttgtgca cgacgacatc 3360
attccgtggc gttatccagc taagcgcgaa ctgcaatttg gagaatggca gcgcaatgac 3420
attcttgcag gtatcttcga gccagccacg atcgacattg atctggctat cttgctgaca 3480
aaagcaagag aacatagcgt tgccttggta ggtccagcgg cggaggaact ctttgatccg 3540
gttcctgaac aggatctatt tgaggcgcta aatgaaacct taacgctatg gaactcgccg 3600
cccgactggg ctggcgatga gcgaaatgta gtgcttacgt tgtcccgcat ttggtacagc 3660
gcagtaaccg gcaaaatcgc gccgaaggat gtcgctgccg actgggcaat ggagcgcctg 3720
ccggcccagt atcagcccgt catacttgaa gctagacagg cttatcttgg acaagaagaa 3780
gatcgcttgg cctcgcgcgc agatcagttg gaagaatttg tccactacgt gaaaggcgag 3840
atcaccaagg tagtcggcaa ataaccctcg agccacccat gaccaaaatc ccttaacgtg 3900
agttacgcgt cgttccactg agcgtcagac cccgtagaaa agatcaaagg atcttcttga 3960
gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa aaaaaccacc gctaccagcg 4020
gtggtttgtt tgccggatca agagctacca actctttttc cgaaggtaac tggcttcagc 4080
agagcgcaga taccaaatac tgtccttcta gtgtagccgt agttaggcca ccacttcaag 4140
aactctgtag caccgcctac atacctcgct ctgctaatcc tgttaccagt ggctgctgcc 4200
agtggcgata agtcgtgtct taccgggttg gactcaagac gatagttacc ggataaggcg 4260
cagcggtcgg gctgaacggg gggttcgtgc acacagccca gcttggagcg aacgacctac 4320
accgaactga gatacctaca gcgtgagcat tgagaaagcg ccacgcttcc cgaagggaga 4380
aaggcggaca ggtatccggt aagcggcagg gtcggaacag gagagcgcac gagggagctt 4440
ccagggggaa acgcctggta tctttatagt cctgtcgggt ttcgccacct ctgacttgag 4500
cgtcgatttt tgtgatgctc gtcagggggg cggagcctat ggaaaaacgc cagcaacgcg 4560
gcctttttac ggttcctggc cttttgctgg ccttttgctc acatgtt 4607
<210> 69
<211> 4444
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 69
ctttcctgcg ttatcccctg attctgtgga taaccgtatt accgcctttg agtgagctga 60
taccgctcgc cgcagccgaa cgaccgagcg cagcgagtca gtgagcgagg aagcggaaga 120
gcgcccaata cgcaaaccgc ctctccccgc gcgttggccg attcattaat gcagctggca 180
cgacaggttt cccgactgga aagcgggcag tgagcgcaac gcaattaata cgcgtaccgc 240
tagccaggaa gagtttgtag aaacgcaaaa aggccatccg tcaggatggc cttctgctta 300
gtttgatgcc tggcagttta tggcgggcgt cctgcccgcc accctccggg ccgttgcttc 360
acaacgatca aatccgctcc cggcggattt gtcctactca ggagagcgtt caccgacaaa 420
caacagataa aacgaaaggc ccagtcttcc gactgagcct ttcgttttat ttgatgcctg 480
gcagttccct actctcgcgt taacgctagc atggatgttt tcccagtcac gacgttgtaa 540
aacgacggcc agtcttaagc tcgggcccca aataatgatt ttattttgac tgatagtgac 600
ctgttcgttg caacaaattg atgagcaatg cttttttata atgccaactt tgtacaaaaa 660
agcaggctcc gaattcaccg gtgccgccac catgagtccc gaaatcgaaa agctctctca 720
gagcgatata tattgggact ccatcgtaag cataacagag acgggggtcg aggaggtgtt 780
cgatctgaca gttcctgggc ctcataattt cgtagcgaac gacatcattg tacataactg 840
agaccgtcga cctgcagact ggctgtgtat aagggagcct gacatttata ttccccagaa 900
catcaggtta atggcgtttt tgatgtcatt ttcgcggtgg ctgagatcag ccacttcttc 960
cccgataacg gagaccggca cactggccat atcggtggtc atcatgcgcc agctttcatc 1020
cccgatatgc accaccgggt aaagttcacg ggagacttta tctgacagca gacgtgcact 1080
ggccaggggg atcaccatcc gtcgcccggg cgtgtcaata atatcactct gtacatccac 1140
aaacagacga taacggctct ctcttttata ggtgtaaacc ttaaactgca tttcaccagc 1200
ccctgttctc gtcagcaaaa gagccgttca tttcaataaa ccgggcgacc tcagccatcc 1260
cttcctgatt ttccgctttc cagcgttcgg cacgcagacg acgggcttca ttctgcatgg 1320
ttgtgcttac cagaccggag atattgacat catatatgcc ttgagcaact gatagctgtc 1380
gctgtcaact gtcactgtaa tacgctgctt catagcatac ctctttttga catacttcgg 1440
gtatacatat cagtatatat tcttataccg caaaaatcag cgcgcaaata cgcatactgt 1500
tatctggctt ttagtaagcc ggatccagat ctttacgccc cgccctgcca ctcatcgcag 1560
tactgttgta attcattaag cattctgccg acatggaagc catcacaaac ggcatgatga 1620
acctgaatcg ccagcggcat cagcaccttg tcgccttgcg tataatattt gcccatggtg 1680
aaaacggggg cgaagaagtt gtccatattg gccacgttta aatcaaaact ggtgaaactc 1740
acccagggat tggctgagac gaaaaacata ttctcaataa accctttagg gaaataggcc 1800
aggttttcac cgtaacacgc cacatcttgc gaatatatgt gtagaaactg ccggaaatcg 1860
tcgtggtatt cactccagag cgatgaaaag gtttcagttt gctcatggaa aacggtgtaa 1920
caagggtgaa cactatccca tatcaccagc tcaccgtctt tcattgccat acggaattcc 1980
ggatgagcat tcatcaggcg ggcaagaatg tgaataaagg ccggataaaa cttgtgctta 2040
tttttcttta cggtctttaa aaaggccgta atatccagct gaacggtctg gttataggta 2100
cattgagcaa ctgactgaaa tgcctcaaaa tgttctttac gatgccattg ggatatatca 2160
acggtggtat atccagtgat ttttttctcc attttagctt ccttagctcc tgaaaatctc 2220
gacggatcct aactcaaaat ccacacatta tacgagccgg aagcataaag tgtaaagcct 2280
ggggtgccta atgcggccgc ggtctcatta attaagaatt cgacccagct ttcttgtaca 2340
aagttggcat tataaaaaat aattgctcat caatttgttg caacgaacag gtcactatca 2400
gtcaaaataa aatcattatt tgccatccag ctgatatccc ctatagtgag tcgtattaca 2460
tggtcatagc tgtttcctgg cagctctggc ccgtgtctca aaatctctga tgttacattg 2520
cacaagataa aaatatatca tcatgcctcc tctagaccag ccaggacaga aatgcctcga 2580
cttcgctgct gcccaaggtt gccgggtgac gcacaccgtg gaaacggatg aaggcacgaa 2640
cccagtggac ataagcctgt tcggttcgta agctgtaatg caagtagcgt atgcgctcac 2700
gcaactggtc cagaaccttg accgaacgca gcggtggtaa cggcgcagtg gcggttttca 2760
tggcttgtta tgactgtttt tttggggtac agtctatgcc tcgggcatcc aagcagcaag 2820
cgcgttacgc cgtgggtcga tgtttgatgt tatggagcag caacgatgtt acgcagcagg 2880
gcagtcgccc taaaacaaag ttaaacatca tgagggaagc ggtgatcgcc gaagtatcga 2940
ctcaactatc agaggtagtt ggcgtcatcg agcgccatct cgaaccgacg ttgctggccg 3000
tacatttgta cggctccgca gtggatggcg gcctgaagcc acacagtgat attgatttgc 3060
tggttacggt gaccgtaagg cttgatgaaa caacgcggcg agctttgatc aacgaccttt 3120
tggaaacttc ggcttcccct ggagagagcg agattctccg cgctgtagaa gtcaccattg 3180
ttgtgcacga cgacatcatt ccgtggcgtt atccagctaa gcgcgaactg caatttggag 3240
aatggcagcg caatgacatt cttgcaggta tcttcgagcc agccacgatc gacattgatc 3300
tggctatctt gctgacaaaa gcaagagaac atagcgttgc cttggtaggt ccagcggcgg 3360
aggaactctt tgatccggtt cctgaacagg atctatttga ggcgctaaat gaaaccttaa 3420
cgctatggaa ctcgccgccc gactgggctg gcgatgagcg aaatgtagtg cttacgttgt 3480
cccgcatttg gtacagcgca gtaaccggca aaatcgcgcc gaaggatgtc gctgccgact 3540
gggcaatgga gcgcctgccg gcccagtatc agcccgtcat acttgaagct agacaggctt 3600
atcttggaca agaagaagat cgcttggcct cgcgcgcaga tcagttggaa gaatttgtcc 3660
actacgtgaa aggcgagatc accaaggtag tcggcaaata accctcgagc cacccatgac 3720
caaaatccct taacgtgagt tacgcgtcgt tccactgagc gtcagacccc gtagaaaaga 3780
tcaaaggatc ttcttgagat cctttttttc tgcgcgtaat ctgctgcttg caaacaaaaa 3840
aaccaccgct accagcggtg gtttgtttgc cggatcaaga gctaccaact ctttttccga 3900
aggtaactgg cttcagcaga gcgcagatac caaatactgt ccttctagtg tagccgtagt 3960
taggccacca cttcaagaac tctgtagcac cgcctacata cctcgctctg ctaatcctgt 4020
taccagtggc tgctgccagt ggcgataagt cgtgtcttac cgggttggac tcaagacgat 4080
agttaccgga taaggcgcag cggtcgggct gaacgggggg ttcgtgcaca cagcccagct 4140
tggagcgaac gacctacacc gaactgagat acctacagcg tgagcattga gaaagcgcca 4200
cgcttcccga agggagaaag gcggacaggt atccggtaag cggcagggtc ggaacaggag 4260
agcgcacgag ggagcttcca gggggaaacg cctggtatct ttatagtcct gtcgggtttc 4320
gccacctctg acttgagcgt cgatttttgt gatgctcgtc aggggggcgg agcctatgga 4380
aaaacgccag caacgcggcc tttttacggt tcctggcctt ttgctggcct tttgctcaca 4440
tgtt 4444
<210> 70
<211> 9204
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 70
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtgccgcca ccatggtgag caagggcgag 660
gcagtgatca aggagttcat gcggttcaag gtgcacatgg agggctccat gaacggccac 720
gagttcgaga tcgagggcga gggcgagggc cgcccctacg agggcaccca gaccgccaag 780
tgcctttcat acgagaccga gatcctgact gtcgagtacg gattgcttcc tatcggcaaa 840
atcgtggaga agaggattga atgtaccgtc tattcagtcg ataataatgg gaacatctac 900
acacagcccg tggctcaatg gcacgacaga ggagagcagg aagtttttga atactgtctc 960
gaggacggat ccctcatccg cgctactaaa gatcataagt ttatgaccgt ggacggccag 1020
atgctgccaa ttgacgaaat ttttgaacga gagctggatc tgatgagagt cgacaacctt 1080
ccaaacggtg gaggggggtc aggctctgcg cagctggaaa aggagcttca agccctcgaa 1140
aaaaagttgg cccagctcga gtgggagaac caggctctgg agaaagaact ggcccagtga 1200
ttaattaaga attcgaccca gctttcttgt acaaagtggt tggtaagcct atccctaacc 1260
ctctcctcgg tctcgattct acgtagtaat gagctagcag tctcgaggtt aacgaattcc 1320
gccccccccc taacgttact ggccgaagcc gcttggaata aggccggtgt gcgcttgtct 1380
atatgttatt ttccaccata ttgccgtctt ttggcaatgt gagggcccgg aaacctggcc 1440
ctgtcttctt gacgagcatt cctaggggtc tttcccctct cgccaaagga atgcaaggtc 1500
tgttgaatgt cgtgaaggaa gcagttcctc tggaagcttc ttgaagacaa acaacgtctg 1560
tagcgaccct ttgcaggcag cggaaccccc cacctggcga caggtgcccc tgcggccaaa 1620
agccacgtgt ataagataca cctgcaaagg cggcacaacc ccagtgccac gttgtgagtt 1680
ggatagttgt ggaaagagtc aaatggctct cctcaagcgt attcaacaag gggctgaagg 1740
atgcccagaa ggtaccccat tgtatgggat ctgatctggg gcctcggtgc acatgcttta 1800
catgtgttta gtcgaggtta aaaaaacgtc taggcccccc gaaccacggg gacgtggttt 1860
tcctttgaaa aacacgataa taccatggcc atgagcgagc tgattaagga gaacatgcac 1920
atgaagctgt acatggaggg caccgtggac aaccatcact tcaagtgcac atccgagggc 1980
gaaggcaagc cctacgaggg cacccagacc atgagaatca aggtggtcga gggcggccct 2040
ctccccttcg ccttcgacat cctggctact agcttcctct acggcagcaa gaccttcatc 2100
aaccacaccc agggcatccc cgacttcttc aagcagtcct tccctgaggg cttcacatgg 2160
gagagagtca ccacatacga agacgggggc gtgctgaccg ctacccagga caccagcctc 2220
caggacggct gcctcatcta caacgtcaag atcagagggg tgaacttcac atccaacggc 2280
cctgtgatgc agaagaaaac actcggctgg gaggccttca ccgagacgct gtaccccgct 2340
gacggcggcc tggaaggcag aaacgacatg gccctgaagc tcgtgggcgg gagccatctg 2400
atcgcaaaca tcaagaccac atatagatcc aagaaacccg ctaagaacct caagatgcct 2460
ggcgtctact atgtggacta cagactggaa agaatcaagg aggccaacaa cgagacctac 2520
gtcgagcagc acgaggtggc agtggccaga tactgcgacc tccctagcaa actggggcac 2580
aagcttaatt aacaccggtg gcgcgttaag tcgacaatca acctctggat tacaaaattt 2640
gtgaaagatt gactggtatt cttaactatg ttgctccttt tacgctatgt ggatacgctg 2700
ctttaatgcc tttgtatcat gctattgctt cccgtatggc tttcattttc tcctccttgt 2760
ataaatcctg gttgctgtct ctttatgagg agttgtggcc cgttgtcagg caacgtggcg 2820
tggtgtgcac tgtgtttgct gacgcaaccc ccactggttg gggcattgcc accacctgtc 2880
agctcctttc cgggactttc gctttccccc tccctattgc cacggcggaa ctcatcgccg 2940
cctgccttgc ccgctgctgg acaggggctc ggctgttggg cactgacaat tccgtggtgt 3000
tgtcggggaa atcatcgtcc tttccttggc tgctcgcctg tgttgccacc tggattctgc 3060
gcgggacgtc cttctgctac gtcccttcgg ccctcaatcc agcggacctt ccttcccgcg 3120
gcctgctgcc ggctctgcgg cctcttccgc gtcttcgcct tcgccctcag acgagtcgga 3180
tctccctttg ggccgcctcc ccgcgtcgac tttaagacca atgacttaca aggcagctgt 3240
agatcttagc cactttttaa aagaaaaggg gggactggaa gggctaattc actcccaacg 3300
aagacaagat ctgctttttg cttgtactgg gtctctctgg ttagaccaga tctgagcctg 3360
ggagctctct ggctaactag ggaacccact gcttaagcct caataaagct tgccttgagt 3420
gcttcaagta gtgtgtgccc gtctgttgtg tgactctggt aactagagat ccctcagacc 3480
cttttagtca gtgtggaaaa tctctagcag tacgtatagt agttcatgtc atcttattat 3540
tcagtattta taacttgcaa agaaatgaat atcagagagt gagaggaact tgtttattgc 3600
agcttataat ggttacaaat aaagcaatag catcacaaat ttcacaaata aagcattttt 3660
ttcactgcat tctagttgtg gtttgtccaa actcatcaat gtatcttatc atgtctggct 3720
ctagctatcc cgcccctaac tccgcccatc ccgcccctaa ctccgcccag ttccgcccat 3780
tctccgcccc atggctgact aatttttttt atttatgcag aggccgaggc cgcctcggcc 3840
tctgagctat tccagaagta gtgaggaggc ttttttggag gcctagggac gtacccaatt 3900
cgccctatag tgagtcgtat tacgcgcgct cactggccgt cgttttacaa cgtcgtgact 3960
gggaaaaccc tggcgttacc caacttaatc gccttgcagc acatccccct ttcgccagct 4020
ggcgtaatag cgaagaggcc cgcaccgatc gcccttccca acagttgcgc agcctgaatg 4080
gcgaatggga cgcgccctgt agcggcgcat taagcgcggc gggtgtggtg gttacgcgca 4140
gcgtgaccgc tacacttgcc agcgccctag cgcccgctcc tttcgctttc ttcccttcct 4200
ttctcgccac gttcgccggc tttccccgtc aagctctaaa tcgggggctc cctttagggt 4260
tccgatttag tgctttacgg cacctcgacc ccaaaaaact tgattagggt gatggttcac 4320
gtagtgggcc atcgccctga tagacggttt ttcgcccttt gacgttggag tccacgttct 4380
ttaatagtgg actcttgttc caaactggaa caacactcaa ccctatctcg gtctattctt 4440
ttgatttata agggattttg ccgatttcgg cctattggtt aaaaaatgag ctgatttaac 4500
aaaaatttaa cgcgaatttt aacaaaatat taacgcttac aatttaggtg gcacttttcg 4560
gggaaatgtg cgcggaaccc ctatttgttt atttttctaa atacattcaa atatgtatcc 4620
gctcatgaga caataaccct gataaatgct tcaataatat tgaaaaagga agagtatgag 4680
tattcaacat ttccgtgtcg cccttattcc cttttttgcg gcattttgcc ttcctgtttt 4740
tgctcaccca gaaacgctgg tgaaagtaaa agatgctgaa gatcagttgg gtgcacgagt 4800
gggttacatc gaactggatc tcaacagcgg taagatcctt gagagttttc gccccgaaga 4860
acgttttcca atgatgagca cttttaaagt tctgctatgt ggcgcggtat tatcccgtat 4920
tgacgccggg caagagcaac tcggtcgccg catacactat tctcagaatg acttggttga 4980
gtactcacca gtcacagaaa agcatcttac ggatggcatg acagtaagag aattatgcag 5040
tgctgccata accatgagtg ataacactgc ggccaactta cttctgacaa cgatcggagg 5100
accgaaggag ctaaccgctt ttttgcacaa catgggggat catgtaactc gccttgatcg 5160
ttgggaaccg gagctgaatg aagccatacc aaacgacgag cgtgacacca cgatgcctgt 5220
agcaatggca acaacgttgc gcaaactatt aactggcgaa ctacttactc tagcttcccg 5280
gcaacaatta atagactgga tggaggcgga taaagttgca ggaccacttc tgcgctcggc 5340
ccttccggct ggctggttta ttgctgataa atctggagcc ggtgagcgtg ggtctcgcgg 5400
tatcattgca gcactggggc cagatggtaa gccctcccgt atcgtagtta tctacacgac 5460
ggggagtcag gcaactatgg atgaacgaaa tagacagatc gctgagatag gtgcctcact 5520
gattaagcat tggtaactgt cagaccaagt ttactcatat atactttaga ttgatttaaa 5580
acttcatttt taatttaaaa ggatctaggt gaagatcctt tttgataatc tcatgaccaa 5640
aatcccttaa cgtgagtttt cgttccactg agcgtcagac cccgtagaaa agatcaaagg 5700
atcttcttga gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa aaaaaccacc 5760
gctaccagcg gtggtttgtt tgccggatca agagctacca actctttttc cgaaggtaac 5820
tggcttcagc agagcgcaga taccaaatac tgttcttcta gtgtagccgt agttaggcca 5880
ccacttcaag aactctgtag caccgcctac atacctcgct ctgctaatcc tgttaccagt 5940
ggctgctgcc agtggcgata agtcgtgtct taccgggttg gactcaagac gatagttacc 6000
ggataaggcg cagcggtcgg gctgaacggg gggttcgtgc acacagccca gcttggagcg 6060
aacgacctac accgaactga gatacctaca gcgtgagcta tgagaaagcg ccacgcttcc 6120
cgaagggaga aaggcggaca ggtatccggt aagcggcagg gtcggaacag gagagcgcac 6180
gagggagctt ccagggggaa acgcctggta tctttatagt cctgtcgggt ttcgccacct 6240
ctgacttgag cgtcgatttt tgtgatgctc gtcagggggg cggagcctat ggaaaaacgc 6300
cagcaacgcg gcctttttac ggttcctggc cttttgctgg ccttttgctc acatgttctt 6360
tcctgcgtta tcccctgatt ctgtggataa ccgtattacc gcctttgagt gagctgatac 6420
cgctcgccgc agccgaacga ccgagcgcag cgagtcagtg agcgaggaag cggaagagcg 6480
cccaatacgc aaaccgcctc tccccgcgcg ttggccgatt cattaatgca gctggcacga 6540
caggtttccc gactggaaag cgggcagtga gcgcaacgca attaatgtga gttagctcac 6600
tcattaggca ccccaggctt tacactttat gcttccggct cgtatgttgt gtggaattgt 6660
gagcggataa caatttcaca caggaaacag ctatgaccat gattacgcca agcgcgcaat 6720
taaccctcac taaagggaac aaaagctgga gctgcaagct taatgtagtc ttatgcaata 6780
ctcttgtagt cttgcaacat ggtaacgatg agttagcaac atgccttaca aggagagaaa 6840
aagcaccgtg catgccgatt ggtggaagta aggtggtacg atcgtgcctt attaggaagg 6900
caacagacgg gtctgacatg gattggacga accactgaat tgccgcattg cagagatatt 6960
gtatttaagt gcctagctcg atacataaac gggtctctct ggttagacca gatctgagcc 7020
tgggagctct ctggctaact agggaaccca ctgcttaagc ctcaataaag cttgccttga 7080
gtgcttcaag tagtgtgtgc ccgtctgttg tgtgactctg gtaactagag atccctcaga 7140
cccttttagt cagtgtggaa aatctctagc agtggcgccc gaacagggac ttgaaagcga 7200
aagggaaacc agaggagctc tctcgacgca ggactcggct tgctgaagcg cgcacggcaa 7260
gaggcgaggg gcggcgactg gtgagtacgc caaaaatttt gactagcgga ggctagaagg 7320
agagagatgg gtgcgagagc gtcagtatta agcgggggag aattagatcg cgatgggaaa 7380
aaattcggtt aaggccaggg ggaaagaaaa aatataaatt aaaacatata gtatgggcaa 7440
gcagggagct agaacgattc gcagttaatc ctggcctgtt agaaacatca gaaggctgta 7500
gacaaatact gggacagcta caaccatccc ttcagacagg atcagaagaa cttagatcat 7560
tatataatac agtagcaacc ctctattgtg tgcatcaaag gatagagata aaagacacca 7620
aggaagcttt agacaagata gaggaagagc aaaacaaaag taagaccacc gcacagcaag 7680
cggccgctga tcttcagacc tggaggagga gatatgaggg acaattggag aagtgaatta 7740
tataaatata aagtagtaaa aattgaacca ttaggagtag cacccaccaa ggcaaagaga 7800
agagtggtgc agagagaaaa aagagcagtg ggaataggag ctttgttcct tgggttcttg 7860
ggagcagcag gaagcactat gggcgcagcg tcaatgacgc tgacggtaca ggccagacaa 7920
ttattgtctg gtatagtgca gcagcagaac aatttgctga gggctattga ggcgcaacag 7980
catctgttgc aactcacagt ctggggcatc aagcagctcc aggcaagaat cctggctgtg 8040
gaaagatacc taaaggatca acagctcctg gggatttggg gttgctctgg aaaactcatt 8100
tgcaccactg ctgtgccttg gaatgctagt tggagtaata aatctctgga acagatttgg 8160
aatcacacga cctggatgga gtgggacaga gaaattaaca attacacaag cttaatacac 8220
tccttaattg aagaatcgca aaaccagcaa gaaaagaatg aacaagaatt attggaatta 8280
gataaatggg caagtttgtg gaattggttt aacataacaa attggctgtg gtatataaaa 8340
ttattcataa tgatagtagg aggcttggta ggtttaagaa tagtttttgc tgtactttct 8400
atagtgaata gagttaggca gggatattca ccattatcgt ttcagaccca cctcccaacc 8460
ccgaggggac ccttgcgcct tttccaaggc agccctgggt ttgcgcaggg acgcggctgc 8520
tctgggcgtg gttccgggaa acgcagcggc gccgaccctg ggtctcgcac attcttcacg 8580
tccgttcgca gcgtcacccg gatcttcgcc gctacccttg tgggcccccc ggcgacgctt 8640
cctgctccgc ccctaagtcg ggaaggttcc ttgcggttcg cggcgtgccg gacgtgacaa 8700
acggaagccg cacgtctcac tagtaccctc gcagacggac agcgccaggg agcaatggca 8760
gcgcgccgac cgcgatgggc tgtggccaat agcggctgct cagcagggcg cgccgagagc 8820
agcggccggg aaggggcggt gcgggaggcg gggtgtgggg cggtagtgtg ggccctgttc 8880
ctgcccgcgc ggtgttccgc attctgcaag cctccggagc gcacgtcggc agtcggctcc 8940
ctcgttgacc gaatcaccga cctctctccc cagggggtac ccagctgtct agagaattct 9000
agatcttgag acaaatggca gtattcatcc acaattttaa aagaaaaggg gggattgggg 9060
ggtacagtgc aggggaaaga atagtagaca taatagcaac agacatacaa actaaagaat 9120
tacaaaaaca aattacaaaa attcaaaatt ttcgggttta ttacagggac agcagagatc 9180
cactttggcg ccggctcgag gggg 9204
<210> 71
<211> 185
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 71
Met Val Ser Lys Gly Glu Ala Val Ile Lys Glu Phe Met Arg Phe Lys
1 5 10 15
Val His Met Glu Gly Ser Met Asn Gly His Glu Phe Glu Ile Glu Gly
20 25 30
Glu Gly Glu Gly Arg Pro Tyr Glu Gly Thr Gln Thr Ala Lys Cys Leu
35 40 45
Ser Tyr Glu Thr Glu Ile Leu Thr Val Glu Tyr Gly Leu Leu Pro Ile
50 55 60
Gly Lys Ile Val Glu Lys Arg Ile Glu Cys Thr Val Tyr Ser Val Asp
65 70 75 80
Asn Asn Gly Asn Ile Tyr Thr Gln Pro Val Ala Gln Trp His Asp Arg
85 90 95
Gly Glu Gln Glu Val Phe Glu Tyr Cys Leu Glu Asp Gly Ser Leu Ile
100 105 110
Arg Ala Thr Lys Asp His Lys Phe Met Thr Val Asp Gly Gln Met Leu
115 120 125
Pro Ile Asp Glu Ile Phe Glu Arg Glu Leu Asp Leu Met Arg Val Asp
130 135 140
Asn Leu Pro Asn Gly Gly Gly Gly Ser Gly Ser Ala Gln Leu Glu Lys
145 150 155 160
Glu Leu Gln Ala Leu Glu Lys Lys Leu Ala Gln Leu Glu Trp Glu Asn
165 170 175
Gln Ala Leu Glu Lys Glu Leu Ala Gln
180 185
<210> 72
<211> 9444
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 72
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtgccgcca ccatggctca actgaagaag 660
aaactccagg ccaataagaa agaactggca cagctgaagt ggaagctgca ggcactgaaa 720
aagaagctgg cacagggcgg aggaggctca ggcagtatga ttaagatcgc tacgcggaag 780
tacctgggga aacagaacgt ctacgacata ggtgtggagc gcgatcacaa ctttgctctg 840
aaaaatggat ttatcgccag caactgcctg aaggtgacca agggtggccc cctgcccttc 900
tcctgggaca tcctgtcccc tcagttcatg tacggctcca gggccttcac caagcacccc 960
gccgacatcc ccgactacta taagcagtcc ttccccgagg gcttcaagtg ggagcgcgtg 1020
atgaacttcg aggacggcgg cgccgtgacc gtgacccagg acacctccct ggaggacggc 1080
accctgatct acaaggtgaa gctccgcggc accaacttcc ctcctgacgg ccccgtaatg 1140
cagaagaaga caatgggctg ggaagcgtcc accgagcggt tgtaccccga ggacggcgtg 1200
ctgaagggcg acattaagat ggccctgcgc ctgaaggacg gcggccgcta cctggcggac 1260
ttcaagacca cctacaaggc caagaagccc gtgcagatgc ccggcgccta caacgtcgac 1320
cgcaagttgg acatcacctc ccacaacgag gactacaccg tggtggaaca gtacgaacgc 1380
tccgagggcc gccactccac cggcggcatg gacgagctgt acaagtgatt aattaagaat 1440
tcgacccagc tttcttgtac aaagtggttg gtaagcctat ccctaaccct ctcctcggtc 1500
tcgattctac gtagtaatga gctagcagtc tcgaggttaa cgaattccgc ccccccccta 1560
acgttactgg ccgaagccgc ttggaataag gccggtgtgc gcttgtctat atgttatttt 1620
ccaccatatt gccgtctttt ggcaatgtga gggcccggaa acctggccct gtcttcttga 1680
cgagcattcc taggggtctt tcccctctcg ccaaaggaat gcaaggtctg ttgaatgtcg 1740
tgaaggaagc agttcctctg gaagcttctt gaagacaaac aacgtctgta gcgacccttt 1800
gcaggcagcg gaacccccca cctggcgaca ggtgcccctg cggccaaaag ccacgtgtat 1860
aagatacacc tgcaaaggcg gcacaacccc agtgccacgt tgtgagttgg atagttgtgg 1920
aaagagtcaa atggctctcc tcaagcgtat tcaacaaggg gctgaaggat gcccagaagg 1980
taccccattg tatgggatct gatctggggc ctcggtgcac atgctttaca tgtgtttagt 2040
cgaggttaaa aaaacgtcta ggccccccga accacgggga cgtggttttc ctttgaaaaa 2100
cacgataata ccatggtgag caagggcgag gagctgttca ccggggtggt gcccatcctg 2160
gtcgagctgg acggcgacgt aaacggccac aagttcagcg tgtccggcga gggcgagggc 2220
gatgccacct acggcaagct gaccctgaag ttcatctgca ccaccggcaa gctgcccgtg 2280
ccctggccca ccctcgtgac caccctgacc tacggcgtgc agtgcttcag ccgctacccc 2340
gaccacatga agcagcacga cttcttcaag tccgccatgc ccgaaggcta cgtccaggag 2400
cgcaccatct tcttcaagga cgacggcaac tacaagaccc gcgccgaggt gaagttcgag 2460
ggcgacaccc tggtgaaccg catcgagctg aagggcatcg acttcaagga ggacggcaac 2520
atcctggggc acaagctgga gtacaactac aacagccaca acgtctatat catggccgac 2580
aagcagaaga acggcatcaa ggtgaacttc aagatccgcc acaacatcga ggacggcagc 2640
gtgcagctcg ccgaccacta ccagcagaac acccccatcg gcgacggccc cgtgctgctg 2700
cccgacaacc actacctgag cacccagtcc gccctgagca aagaccccaa cgagaagcgc 2760
gatcacatgg tcctgctgga gttcgtgacc gccgccggga tcactctcgg catggacgag 2820
ctgtacaagt aacaccggtg gcgcgttaag tcgacaatca acctctggat tacaaaattt 2880
gtgaaagatt gactggtatt cttaactatg ttgctccttt tacgctatgt ggatacgctg 2940
ctttaatgcc tttgtatcat gctattgctt cccgtatggc tttcattttc tcctccttgt 3000
ataaatcctg gttgctgtct ctttatgagg agttgtggcc cgttgtcagg caacgtggcg 3060
tggtgtgcac tgtgtttgct gacgcaaccc ccactggttg gggcattgcc accacctgtc 3120
agctcctttc cgggactttc gctttccccc tccctattgc cacggcggaa ctcatcgccg 3180
cctgccttgc ccgctgctgg acaggggctc ggctgttggg cactgacaat tccgtggtgt 3240
tgtcggggaa atcatcgtcc tttccttggc tgctcgcctg tgttgccacc tggattctgc 3300
gcgggacgtc cttctgctac gtcccttcgg ccctcaatcc agcggacctt ccttcccgcg 3360
gcctgctgcc ggctctgcgg cctcttccgc gtcttcgcct tcgccctcag acgagtcgga 3420
tctccctttg ggccgcctcc ccgcgtcgac tttaagacca atgacttaca aggcagctgt 3480
agatcttagc cactttttaa aagaaaaggg gggactggaa gggctaattc actcccaacg 3540
aagacaagat ctgctttttg cttgtactgg gtctctctgg ttagaccaga tctgagcctg 3600
ggagctctct ggctaactag ggaacccact gcttaagcct caataaagct tgccttgagt 3660
gcttcaagta gtgtgtgccc gtctgttgtg tgactctggt aactagagat ccctcagacc 3720
cttttagtca gtgtggaaaa tctctagcag tacgtatagt agttcatgtc atcttattat 3780
tcagtattta taacttgcaa agaaatgaat atcagagagt gagaggaact tgtttattgc 3840
agcttataat ggttacaaat aaagcaatag catcacaaat ttcacaaata aagcattttt 3900
ttcactgcat tctagttgtg gtttgtccaa actcatcaat gtatcttatc atgtctggct 3960
ctagctatcc cgcccctaac tccgcccatc ccgcccctaa ctccgcccag ttccgcccat 4020
tctccgcccc atggctgact aatttttttt atttatgcag aggccgaggc cgcctcggcc 4080
tctgagctat tccagaagta gtgaggaggc ttttttggag gcctagggac gtacccaatt 4140
cgccctatag tgagtcgtat tacgcgcgct cactggccgt cgttttacaa cgtcgtgact 4200
gggaaaaccc tggcgttacc caacttaatc gccttgcagc acatccccct ttcgccagct 4260
ggcgtaatag cgaagaggcc cgcaccgatc gcccttccca acagttgcgc agcctgaatg 4320
gcgaatggga cgcgccctgt agcggcgcat taagcgcggc gggtgtggtg gttacgcgca 4380
gcgtgaccgc tacacttgcc agcgccctag cgcccgctcc tttcgctttc ttcccttcct 4440
ttctcgccac gttcgccggc tttccccgtc aagctctaaa tcgggggctc cctttagggt 4500
tccgatttag tgctttacgg cacctcgacc ccaaaaaact tgattagggt gatggttcac 4560
gtagtgggcc atcgccctga tagacggttt ttcgcccttt gacgttggag tccacgttct 4620
ttaatagtgg actcttgttc caaactggaa caacactcaa ccctatctcg gtctattctt 4680
ttgatttata agggattttg ccgatttcgg cctattggtt aaaaaatgag ctgatttaac 4740
aaaaatttaa cgcgaatttt aacaaaatat taacgcttac aatttaggtg gcacttttcg 4800
gggaaatgtg cgcggaaccc ctatttgttt atttttctaa atacattcaa atatgtatcc 4860
gctcatgaga caataaccct gataaatgct tcaataatat tgaaaaagga agagtatgag 4920
tattcaacat ttccgtgtcg cccttattcc cttttttgcg gcattttgcc ttcctgtttt 4980
tgctcaccca gaaacgctgg tgaaagtaaa agatgctgaa gatcagttgg gtgcacgagt 5040
gggttacatc gaactggatc tcaacagcgg taagatcctt gagagttttc gccccgaaga 5100
acgttttcca atgatgagca cttttaaagt tctgctatgt ggcgcggtat tatcccgtat 5160
tgacgccggg caagagcaac tcggtcgccg catacactat tctcagaatg acttggttga 5220
gtactcacca gtcacagaaa agcatcttac ggatggcatg acagtaagag aattatgcag 5280
tgctgccata accatgagtg ataacactgc ggccaactta cttctgacaa cgatcggagg 5340
accgaaggag ctaaccgctt ttttgcacaa catgggggat catgtaactc gccttgatcg 5400
ttgggaaccg gagctgaatg aagccatacc aaacgacgag cgtgacacca cgatgcctgt 5460
agcaatggca acaacgttgc gcaaactatt aactggcgaa ctacttactc tagcttcccg 5520
gcaacaatta atagactgga tggaggcgga taaagttgca ggaccacttc tgcgctcggc 5580
ccttccggct ggctggttta ttgctgataa atctggagcc ggtgagcgtg ggtctcgcgg 5640
tatcattgca gcactggggc cagatggtaa gccctcccgt atcgtagtta tctacacgac 5700
ggggagtcag gcaactatgg atgaacgaaa tagacagatc gctgagatag gtgcctcact 5760
gattaagcat tggtaactgt cagaccaagt ttactcatat atactttaga ttgatttaaa 5820
acttcatttt taatttaaaa ggatctaggt gaagatcctt tttgataatc tcatgaccaa 5880
aatcccttaa cgtgagtttt cgttccactg agcgtcagac cccgtagaaa agatcaaagg 5940
atcttcttga gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa aaaaaccacc 6000
gctaccagcg gtggtttgtt tgccggatca agagctacca actctttttc cgaaggtaac 6060
tggcttcagc agagcgcaga taccaaatac tgttcttcta gtgtagccgt agttaggcca 6120
ccacttcaag aactctgtag caccgcctac atacctcgct ctgctaatcc tgttaccagt 6180
ggctgctgcc agtggcgata agtcgtgtct taccgggttg gactcaagac gatagttacc 6240
ggataaggcg cagcggtcgg gctgaacggg gggttcgtgc acacagccca gcttggagcg 6300
aacgacctac accgaactga gatacctaca gcgtgagcta tgagaaagcg ccacgcttcc 6360
cgaagggaga aaggcggaca ggtatccggt aagcggcagg gtcggaacag gagagcgcac 6420
gagggagctt ccagggggaa acgcctggta tctttatagt cctgtcgggt ttcgccacct 6480
ctgacttgag cgtcgatttt tgtgatgctc gtcagggggg cggagcctat ggaaaaacgc 6540
cagcaacgcg gcctttttac ggttcctggc cttttgctgg ccttttgctc acatgttctt 6600
tcctgcgtta tcccctgatt ctgtggataa ccgtattacc gcctttgagt gagctgatac 6660
cgctcgccgc agccgaacga ccgagcgcag cgagtcagtg agcgaggaag cggaagagcg 6720
cccaatacgc aaaccgcctc tccccgcgcg ttggccgatt cattaatgca gctggcacga 6780
caggtttccc gactggaaag cgggcagtga gcgcaacgca attaatgtga gttagctcac 6840
tcattaggca ccccaggctt tacactttat gcttccggct cgtatgttgt gtggaattgt 6900
gagcggataa caatttcaca caggaaacag ctatgaccat gattacgcca agcgcgcaat 6960
taaccctcac taaagggaac aaaagctgga gctgcaagct taatgtagtc ttatgcaata 7020
ctcttgtagt cttgcaacat ggtaacgatg agttagcaac atgccttaca aggagagaaa 7080
aagcaccgtg catgccgatt ggtggaagta aggtggtacg atcgtgcctt attaggaagg 7140
caacagacgg gtctgacatg gattggacga accactgaat tgccgcattg cagagatatt 7200
gtatttaagt gcctagctcg atacataaac gggtctctct ggttagacca gatctgagcc 7260
tgggagctct ctggctaact agggaaccca ctgcttaagc ctcaataaag cttgccttga 7320
gtgcttcaag tagtgtgtgc ccgtctgttg tgtgactctg gtaactagag atccctcaga 7380
cccttttagt cagtgtggaa aatctctagc agtggcgccc gaacagggac ttgaaagcga 7440
aagggaaacc agaggagctc tctcgacgca ggactcggct tgctgaagcg cgcacggcaa 7500
gaggcgaggg gcggcgactg gtgagtacgc caaaaatttt gactagcgga ggctagaagg 7560
agagagatgg gtgcgagagc gtcagtatta agcgggggag aattagatcg cgatgggaaa 7620
aaattcggtt aaggccaggg ggaaagaaaa aatataaatt aaaacatata gtatgggcaa 7680
gcagggagct agaacgattc gcagttaatc ctggcctgtt agaaacatca gaaggctgta 7740
gacaaatact gggacagcta caaccatccc ttcagacagg atcagaagaa cttagatcat 7800
tatataatac agtagcaacc ctctattgtg tgcatcaaag gatagagata aaagacacca 7860
aggaagcttt agacaagata gaggaagagc aaaacaaaag taagaccacc gcacagcaag 7920
cggccgctga tcttcagacc tggaggagga gatatgaggg acaattggag aagtgaatta 7980
tataaatata aagtagtaaa aattgaacca ttaggagtag cacccaccaa ggcaaagaga 8040
agagtggtgc agagagaaaa aagagcagtg ggaataggag ctttgttcct tgggttcttg 8100
ggagcagcag gaagcactat gggcgcagcg tcaatgacgc tgacggtaca ggccagacaa 8160
ttattgtctg gtatagtgca gcagcagaac aatttgctga gggctattga ggcgcaacag 8220
catctgttgc aactcacagt ctggggcatc aagcagctcc aggcaagaat cctggctgtg 8280
gaaagatacc taaaggatca acagctcctg gggatttggg gttgctctgg aaaactcatt 8340
tgcaccactg ctgtgccttg gaatgctagt tggagtaata aatctctgga acagatttgg 8400
aatcacacga cctggatgga gtgggacaga gaaattaaca attacacaag cttaatacac 8460
tccttaattg aagaatcgca aaaccagcaa gaaaagaatg aacaagaatt attggaatta 8520
gataaatggg caagtttgtg gaattggttt aacataacaa attggctgtg gtatataaaa 8580
ttattcataa tgatagtagg aggcttggta ggtttaagaa tagtttttgc tgtactttct 8640
atagtgaata gagttaggca gggatattca ccattatcgt ttcagaccca cctcccaacc 8700
ccgaggggac ccttgcgcct tttccaaggc agccctgggt ttgcgcaggg acgcggctgc 8760
tctgggcgtg gttccgggaa acgcagcggc gccgaccctg ggtctcgcac attcttcacg 8820
tccgttcgca gcgtcacccg gatcttcgcc gctacccttg tgggcccccc ggcgacgctt 8880
cctgctccgc ccctaagtcg ggaaggttcc ttgcggttcg cggcgtgccg gacgtgacaa 8940
acggaagccg cacgtctcac tagtaccctc gcagacggac agcgccaggg agcaatggca 9000
gcgcgccgac cgcgatgggc tgtggccaat agcggctgct cagcagggcg cgccgagagc 9060
agcggccggg aaggggcggt gcgggaggcg gggtgtgggg cggtagtgtg ggccctgttc 9120
ctgcccgcgc ggtgttccgc attctgcaag cctccggagc gcacgtcggc agtcggctcc 9180
ctcgttgacc gaatcaccga cctctctccc cagggggtac ccagctgtct agagaattct 9240
agatcttgag acaaatggca gtattcatcc acaattttaa aagaaaaggg gggattgggg 9300
ggtacagtgc aggggaaaga atagtagaca taatagcaac agacatacaa actaaagaat 9360
tacaaaaaca aattacaaaa attcaaaatt ttcgggttta ttacagggac agcagagatc 9420
cactttggcg ccggctcgag gggg 9444
<210> 73
<211> 261
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 73
Met Ala Gln Leu Lys Lys Lys Leu Gln Ala Asn Lys Lys Glu Leu Ala
1 5 10 15
Gln Leu Lys Trp Lys Leu Gln Ala Leu Lys Lys Lys Leu Ala Gln Gly
20 25 30
Gly Gly Gly Ser Gly Ser Met Ile Lys Ile Ala Thr Arg Lys Tyr Leu
35 40 45
Gly Lys Gln Asn Val Tyr Asp Ile Gly Val Glu Arg Asp His Asn Phe
50 55 60
Ala Leu Lys Asn Gly Phe Ile Ala Ser Asn Cys Leu Lys Val Thr Lys
65 70 75 80
Gly Gly Pro Leu Pro Phe Ser Trp Asp Ile Leu Ser Pro Gln Phe Met
85 90 95
Tyr Gly Ser Arg Ala Phe Thr Lys His Pro Ala Asp Ile Pro Asp Tyr
100 105 110
Tyr Lys Gln Ser Phe Pro Glu Gly Phe Lys Trp Glu Arg Val Met Asn
115 120 125
Phe Glu Asp Gly Gly Ala Val Thr Val Thr Gln Asp Thr Ser Leu Glu
130 135 140
Asp Gly Thr Leu Ile Tyr Lys Val Lys Leu Arg Gly Thr Asn Phe Pro
145 150 155 160
Pro Asp Gly Pro Val Met Gln Lys Lys Thr Met Gly Trp Glu Ala Ser
165 170 175
Thr Glu Arg Leu Tyr Pro Glu Asp Gly Val Leu Lys Gly Asp Ile Lys
180 185 190
Met Ala Leu Arg Leu Lys Asp Gly Gly Arg Tyr Leu Ala Asp Phe Lys
195 200 205
Thr Thr Tyr Lys Ala Lys Lys Pro Val Gln Met Pro Gly Ala Tyr Asn
210 215 220
Val Asp Arg Lys Leu Asp Ile Thr Ser His Asn Glu Asp Tyr Thr Val
225 230 235 240
Val Glu Gln Tyr Glu Arg Ser Glu Gly Arg His Ser Thr Gly Gly Met
245 250 255
Asp Glu Leu Tyr Lys
260
<210> 74
<211> 9210
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 74
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtgccgcca ccatggtgag caagggcgag 660
gcagtgatca aggagttcat gcggttcaag gtgcacatgg agggctccat gaacggccac 720
gagttcgaga tcgagggcga gggcgagggc cgcccctacg agggcaccca gaccgccaag 780
ctgaagtgcc tttcatacga gaccgagatc ctgactgtcg agtacggatt gcttcctatc 840
ggcaaaatcg tggagaagag gattgaatgt accgtctatt cagtcgataa taatgggaac 900
atctacacac agcccgtggc tcaatggcac gacagaggag agcaggaagt ttttgaatac 960
tgtctcgagg acggatccct catccgcgct actaaagatc ataagtttat gaccgtggac 1020
ggccagatgc tgccaattga cgaaattttt gaacgagagc tggatctgat gagagtcgac 1080
aaccttccaa acggtggagg ggggtcaggc tctgcgcagc tggaaaagga gcttcaagcc 1140
ctcgaaaaaa agttggccca gctcgagtgg gagaaccagg ctctggagaa agaactggcc 1200
cagtgattaa ttaagaattc gacccagctt tcttgtacaa agtggttggt aagcctatcc 1260
ctaaccctct cctcggtctc gattctacgt agtaatgagc tagcagtctc gaggttaacg 1320
aattccgccc cccccctaac gttactggcc gaagccgctt ggaataaggc cggtgtgcgc 1380
ttgtctatat gttattttcc accatattgc cgtcttttgg caatgtgagg gcccggaaac 1440
ctggccctgt cttcttgacg agcattccta ggggtctttc ccctctcgcc aaaggaatgc 1500
aaggtctgtt gaatgtcgtg aaggaagcag ttcctctgga agcttcttga agacaaacaa 1560
cgtctgtagc gaccctttgc aggcagcgga accccccacc tggcgacagg tgcccctgcg 1620
gccaaaagcc acgtgtataa gatacacctg caaaggcggc acaaccccag tgccacgttg 1680
tgagttggat agttgtggaa agagtcaaat ggctctcctc aagcgtattc aacaaggggc 1740
tgaaggatgc ccagaaggta ccccattgta tgggatctga tctggggcct cggtgcacat 1800
gctttacatg tgtttagtcg aggttaaaaa aacgtctagg ccccccgaac cacggggacg 1860
tggttttcct ttgaaaaaca cgataatacc atggccatga gcgagctgat taaggagaac 1920
atgcacatga agctgtacat ggagggcacc gtggacaacc atcacttcaa gtgcacatcc 1980
gagggcgaag gcaagcccta cgagggcacc cagaccatga gaatcaaggt ggtcgagggc 2040
ggccctctcc ccttcgcctt cgacatcctg gctactagct tcctctacgg cagcaagacc 2100
ttcatcaacc acacccaggg catccccgac ttcttcaagc agtccttccc tgagggcttc 2160
acatgggaga gagtcaccac atacgaagac gggggcgtgc tgaccgctac ccaggacacc 2220
agcctccagg acggctgcct catctacaac gtcaagatca gaggggtgaa cttcacatcc 2280
aacggccctg tgatgcagaa gaaaacactc ggctgggagg ccttcaccga gacgctgtac 2340
cccgctgacg gcggcctgga aggcagaaac gacatggccc tgaagctcgt gggcgggagc 2400
catctgatcg caaacatcaa gaccacatat agatccaaga aacccgctaa gaacctcaag 2460
atgcctggcg tctactatgt ggactacaga ctggaaagaa tcaaggaggc caacaacgag 2520
acctacgtcg agcagcacga ggtggcagtg gccagatact gcgacctccc tagcaaactg 2580
gggcacaagc ttaattaaca ccggtggcgc gttaagtcga caatcaacct ctggattaca 2640
aaatttgtga aagattgact ggtattctta actatgttgc tccttttacg ctatgtggat 2700
acgctgcttt aatgcctttg tatcatgcta ttgcttcccg tatggctttc attttctcct 2760
ccttgtataa atcctggttg ctgtctcttt atgaggagtt gtggcccgtt gtcaggcaac 2820
gtggcgtggt gtgcactgtg tttgctgacg caacccccac tggttggggc attgccacca 2880
cctgtcagct cctttccggg actttcgctt tccccctccc tattgccacg gcggaactca 2940
tcgccgcctg ccttgcccgc tgctggacag gggctcggct gttgggcact gacaattccg 3000
tggtgttgtc ggggaaatca tcgtcctttc cttggctgct cgcctgtgtt gccacctgga 3060
ttctgcgcgg gacgtccttc tgctacgtcc cttcggccct caatccagcg gaccttcctt 3120
cccgcggcct gctgccggct ctgcggcctc ttccgcgtct tcgccttcgc cctcagacga 3180
gtcggatctc cctttgggcc gcctccccgc gtcgacttta agaccaatga cttacaaggc 3240
agctgtagat cttagccact ttttaaaaga aaagggggga ctggaagggc taattcactc 3300
ccaacgaaga caagatctgc tttttgcttg tactgggtct ctctggttag accagatctg 3360
agcctgggag ctctctggct aactagggaa cccactgctt aagcctcaat aaagcttgcc 3420
ttgagtgctt caagtagtgt gtgcccgtct gttgtgtgac tctggtaact agagatccct 3480
cagacccttt tagtcagtgt ggaaaatctc tagcagtacg tatagtagtt catgtcatct 3540
tattattcag tatttataac ttgcaaagaa atgaatatca gagagtgaga ggaacttgtt 3600
tattgcagct tataatggtt acaaataaag caatagcatc acaaatttca caaataaagc 3660
atttttttca ctgcattcta gttgtggttt gtccaaactc atcaatgtat cttatcatgt 3720
ctggctctag ctatcccgcc cctaactccg cccatcccgc ccctaactcc gcccagttcc 3780
gcccattctc cgccccatgg ctgactaatt ttttttattt atgcagaggc cgaggccgcc 3840
tcggcctctg agctattcca gaagtagtga ggaggctttt ttggaggcct agggacgtac 3900
ccaattcgcc ctatagtgag tcgtattacg cgcgctcact ggccgtcgtt ttacaacgtc 3960
gtgactggga aaaccctggc gttacccaac ttaatcgcct tgcagcacat ccccctttcg 4020
ccagctggcg taatagcgaa gaggcccgca ccgatcgccc ttcccaacag ttgcgcagcc 4080
tgaatggcga atgggacgcg ccctgtagcg gcgcattaag cgcggcgggt gtggtggtta 4140
cgcgcagcgt gaccgctaca cttgccagcg ccctagcgcc cgctcctttc gctttcttcc 4200
cttcctttct cgccacgttc gccggctttc cccgtcaagc tctaaatcgg gggctccctt 4260
tagggttccg atttagtgct ttacggcacc tcgaccccaa aaaacttgat tagggtgatg 4320
gttcacgtag tgggccatcg ccctgataga cggtttttcg ccctttgacg ttggagtcca 4380
cgttctttaa tagtggactc ttgttccaaa ctggaacaac actcaaccct atctcggtct 4440
attcttttga tttataaggg attttgccga tttcggccta ttggttaaaa aatgagctga 4500
tttaacaaaa atttaacgcg aattttaaca aaatattaac gcttacaatt taggtggcac 4560
ttttcgggga aatgtgcgcg gaacccctat ttgtttattt ttctaaatac attcaaatat 4620
gtatccgctc atgagacaat aaccctgata aatgcttcaa taatattgaa aaaggaagag 4680
tatgagtatt caacatttcc gtgtcgccct tattcccttt tttgcggcat tttgccttcc 4740
tgtttttgct cacccagaaa cgctggtgaa agtaaaagat gctgaagatc agttgggtgc 4800
acgagtgggt tacatcgaac tggatctcaa cagcggtaag atccttgaga gttttcgccc 4860
cgaagaacgt tttccaatga tgagcacttt taaagttctg ctatgtggcg cggtattatc 4920
ccgtattgac gccgggcaag agcaactcgg tcgccgcata cactattctc agaatgactt 4980
ggttgagtac tcaccagtca cagaaaagca tcttacggat ggcatgacag taagagaatt 5040
atgcagtgct gccataacca tgagtgataa cactgcggcc aacttacttc tgacaacgat 5100
cggaggaccg aaggagctaa ccgctttttt gcacaacatg ggggatcatg taactcgcct 5160
tgatcgttgg gaaccggagc tgaatgaagc cataccaaac gacgagcgtg acaccacgat 5220
gcctgtagca atggcaacaa cgttgcgcaa actattaact ggcgaactac ttactctagc 5280
ttcccggcaa caattaatag actggatgga ggcggataaa gttgcaggac cacttctgcg 5340
ctcggccctt ccggctggct ggtttattgc tgataaatct ggagccggtg agcgtgggtc 5400
tcgcggtatc attgcagcac tggggccaga tggtaagccc tcccgtatcg tagttatcta 5460
cacgacgggg agtcaggcaa ctatggatga acgaaataga cagatcgctg agataggtgc 5520
ctcactgatt aagcattggt aactgtcaga ccaagtttac tcatatatac tttagattga 5580
tttaaaactt catttttaat ttaaaaggat ctaggtgaag atcctttttg ataatctcat 5640
gaccaaaatc ccttaacgtg agttttcgtt ccactgagcg tcagaccccg tagaaaagat 5700
caaaggatct tcttgagatc ctttttttct gcgcgtaatc tgctgcttgc aaacaaaaaa 5760
accaccgcta ccagcggtgg tttgtttgcc ggatcaagag ctaccaactc tttttccgaa 5820
ggtaactggc ttcagcagag cgcagatacc aaatactgtt cttctagtgt agccgtagtt 5880
aggccaccac ttcaagaact ctgtagcacc gcctacatac ctcgctctgc taatcctgtt 5940
accagtggct gctgccagtg gcgataagtc gtgtcttacc gggttggact caagacgata 6000
gttaccggat aaggcgcagc ggtcgggctg aacggggggt tcgtgcacac agcccagctt 6060
ggagcgaacg acctacaccg aactgagata cctacagcgt gagctatgag aaagcgccac 6120
gcttcccgaa gggagaaagg cggacaggta tccggtaagc ggcagggtcg gaacaggaga 6180
gcgcacgagg gagcttccag ggggaaacgc ctggtatctt tatagtcctg tcgggtttcg 6240
ccacctctga cttgagcgtc gatttttgtg atgctcgtca ggggggcgga gcctatggaa 6300
aaacgccagc aacgcggcct ttttacggtt cctggccttt tgctggcctt ttgctcacat 6360
gttctttcct gcgttatccc ctgattctgt ggataaccgt attaccgcct ttgagtgagc 6420
tgataccgct cgccgcagcc gaacgaccga gcgcagcgag tcagtgagcg aggaagcgga 6480
agagcgccca atacgcaaac cgcctctccc cgcgcgttgg ccgattcatt aatgcagctg 6540
gcacgacagg tttcccgact ggaaagcggg cagtgagcgc aacgcaatta atgtgagtta 6600
gctcactcat taggcacccc aggctttaca ctttatgctt ccggctcgta tgttgtgtgg 6660
aattgtgagc ggataacaat ttcacacagg aaacagctat gaccatgatt acgccaagcg 6720
cgcaattaac cctcactaaa gggaacaaaa gctggagctg caagcttaat gtagtcttat 6780
gcaatactct tgtagtcttg caacatggta acgatgagtt agcaacatgc cttacaagga 6840
gagaaaaagc accgtgcatg ccgattggtg gaagtaaggt ggtacgatcg tgccttatta 6900
ggaaggcaac agacgggtct gacatggatt ggacgaacca ctgaattgcc gcattgcaga 6960
gatattgtat ttaagtgcct agctcgatac ataaacgggt ctctctggtt agaccagatc 7020
tgagcctggg agctctctgg ctaactaggg aacccactgc ttaagcctca ataaagcttg 7080
ccttgagtgc ttcaagtagt gtgtgcccgt ctgttgtgtg actctggtaa ctagagatcc 7140
ctcagaccct tttagtcagt gtggaaaatc tctagcagtg gcgcccgaac agggacttga 7200
aagcgaaagg gaaaccagag gagctctctc gacgcaggac tcggcttgct gaagcgcgca 7260
cggcaagagg cgaggggcgg cgactggtga gtacgccaaa aattttgact agcggaggct 7320
agaaggagag agatgggtgc gagagcgtca gtattaagcg ggggagaatt agatcgcgat 7380
gggaaaaaat tcggttaagg ccagggggaa agaaaaaata taaattaaaa catatagtat 7440
gggcaagcag ggagctagaa cgattcgcag ttaatcctgg cctgttagaa acatcagaag 7500
gctgtagaca aatactggga cagctacaac catcccttca gacaggatca gaagaactta 7560
gatcattata taatacagta gcaaccctct attgtgtgca tcaaaggata gagataaaag 7620
acaccaagga agctttagac aagatagagg aagagcaaaa caaaagtaag accaccgcac 7680
agcaagcggc cgctgatctt cagacctgga ggaggagata tgagggacaa ttggagaagt 7740
gaattatata aatataaagt agtaaaaatt gaaccattag gagtagcacc caccaaggca 7800
aagagaagag tggtgcagag agaaaaaaga gcagtgggaa taggagcttt gttccttggg 7860
ttcttgggag cagcaggaag cactatgggc gcagcgtcaa tgacgctgac ggtacaggcc 7920
agacaattat tgtctggtat agtgcagcag cagaacaatt tgctgagggc tattgaggcg 7980
caacagcatc tgttgcaact cacagtctgg ggcatcaagc agctccaggc aagaatcctg 8040
gctgtggaaa gatacctaaa ggatcaacag ctcctgggga tttggggttg ctctggaaaa 8100
ctcatttgca ccactgctgt gccttggaat gctagttgga gtaataaatc tctggaacag 8160
atttggaatc acacgacctg gatggagtgg gacagagaaa ttaacaatta cacaagctta 8220
atacactcct taattgaaga atcgcaaaac cagcaagaaa agaatgaaca agaattattg 8280
gaattagata aatgggcaag tttgtggaat tggtttaaca taacaaattg gctgtggtat 8340
ataaaattat tcataatgat agtaggaggc ttggtaggtt taagaatagt ttttgctgta 8400
ctttctatag tgaatagagt taggcaggga tattcaccat tatcgtttca gacccacctc 8460
ccaaccccga ggggaccctt gcgccttttc caaggcagcc ctgggtttgc gcagggacgc 8520
ggctgctctg ggcgtggttc cgggaaacgc agcggcgccg accctgggtc tcgcacattc 8580
ttcacgtccg ttcgcagcgt cacccggatc ttcgccgcta cccttgtggg ccccccggcg 8640
acgcttcctg ctccgcccct aagtcgggaa ggttccttgc ggttcgcggc gtgccggacg 8700
tgacaaacgg aagccgcacg tctcactagt accctcgcag acggacagcg ccagggagca 8760
atggcagcgc gccgaccgcg atgggctgtg gccaatagcg gctgctcagc agggcgcgcc 8820
gagagcagcg gccgggaagg ggcggtgcgg gaggcggggt gtggggcggt agtgtgggcc 8880
ctgttcctgc ccgcgcggtg ttccgcattc tgcaagcctc cggagcgcac gtcggcagtc 8940
ggctccctcg ttgaccgaat caccgacctc tctccccagg gggtacccag ctgtctagag 9000
aattctagat cttgagacaa atggcagtat tcatccacaa ttttaaaaga aaagggggga 9060
ttggggggta cagtgcaggg gaaagaatag tagacataat agcaacagac atacaaacta 9120
aagaattaca aaaacaaatt acaaaaattc aaaattttcg ggtttattac agggacagca 9180
gagatccact ttggcgccgg ctcgaggggg 9210
<210> 75
<211> 187
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 75
Met Val Ser Lys Gly Glu Ala Val Ile Lys Glu Phe Met Arg Phe Lys
1 5 10 15
Val His Met Glu Gly Ser Met Asn Gly His Glu Phe Glu Ile Glu Gly
20 25 30
Glu Gly Glu Gly Arg Pro Tyr Glu Gly Thr Gln Thr Ala Lys Leu Lys
35 40 45
Cys Leu Ser Tyr Glu Thr Glu Ile Leu Thr Val Glu Tyr Gly Leu Leu
50 55 60
Pro Ile Gly Lys Ile Val Glu Lys Arg Ile Glu Cys Thr Val Tyr Ser
65 70 75 80
Val Asp Asn Asn Gly Asn Ile Tyr Thr Gln Pro Val Ala Gln Trp His
85 90 95
Asp Arg Gly Glu Gln Glu Val Phe Glu Tyr Cys Leu Glu Asp Gly Ser
100 105 110
Leu Ile Arg Ala Thr Lys Asp His Lys Phe Met Thr Val Asp Gly Gln
115 120 125
Met Leu Pro Ile Asp Glu Ile Phe Glu Arg Glu Leu Asp Leu Met Arg
130 135 140
Val Asp Asn Leu Pro Asn Gly Gly Gly Gly Ser Gly Ser Ala Gln Leu
145 150 155 160
Glu Lys Glu Leu Gln Ala Leu Glu Lys Lys Leu Ala Gln Leu Glu Trp
165 170 175
Glu Asn Gln Ala Leu Glu Lys Glu Leu Ala Gln
180 185
<210> 76
<211> 9438
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 76
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtgccgcca ccatggctca actgaagaag 660
aaactccagg ccaataagaa agaactggca cagctgaagt ggaagctgca ggcactgaaa 720
aagaagctgg cacagggcgg aggaggctca ggcagtatga ttaagatcgc tacgcggaag 780
tacctgggga aacagaacgt ctacgacata ggtgtggagc gcgatcacaa ctttgctctg 840
aaaaatggat ttatcgccag caactgcgtg accaagggtg gccccctgcc cttctcctgg 900
gacatcctgt cccctcagtt catgtacggc tccagggcct tcaccaagca ccccgccgac 960
atccccgact actataagca gtccttcccc gagggcttca agtgggagcg cgtgatgaac 1020
ttcgaggacg gcggcgccgt gaccgtgacc caggacacct ccctggagga cggcaccctg 1080
atctacaagg tgaagctccg cggcaccaac ttccctcctg acggccccgt aatgcagaag 1140
aagacaatgg gctgggaagc gtccaccgag cggttgtacc ccgaggacgg cgtgctgaag 1200
ggcgacatta agatggccct gcgcctgaag gacggcggcc gctacctggc ggacttcaag 1260
accacctaca aggccaagaa gcccgtgcag atgcccggcg cctacaacgt cgaccgcaag 1320
ttggacatca cctcccacaa cgaggactac accgtggtgg aacagtacga acgctccgag 1380
ggccgccact ccaccggcgg catggacgag ctgtacaagt gattaattaa gaattcgacc 1440
cagctttctt gtacaaagtg gttggtaagc ctatccctaa ccctctcctc ggtctcgatt 1500
ctacgtagta atgagctagc agtctcgagg ttaacgaatt ccgccccccc cctaacgtta 1560
ctggccgaag ccgcttggaa taaggccggt gtgcgcttgt ctatatgtta ttttccacca 1620
tattgccgtc ttttggcaat gtgagggccc ggaaacctgg ccctgtcttc ttgacgagca 1680
ttcctagggg tctttcccct ctcgccaaag gaatgcaagg tctgttgaat gtcgtgaagg 1740
aagcagttcc tctggaagct tcttgaagac aaacaacgtc tgtagcgacc ctttgcaggc 1800
agcggaaccc cccacctggc gacaggtgcc cctgcggcca aaagccacgt gtataagata 1860
cacctgcaaa ggcggcacaa ccccagtgcc acgttgtgag ttggatagtt gtggaaagag 1920
tcaaatggct ctcctcaagc gtattcaaca aggggctgaa ggatgcccag aaggtacccc 1980
attgtatggg atctgatctg gggcctcggt gcacatgctt tacatgtgtt tagtcgaggt 2040
taaaaaaacg tctaggcccc ccgaaccacg gggacgtggt tttcctttga aaaacacgat 2100
aataccatgg tgagcaaggg cgaggagctg ttcaccgggg tggtgcccat cctggtcgag 2160
ctggacggcg acgtaaacgg ccacaagttc agcgtgtccg gcgagggcga gggcgatgcc 2220
acctacggca agctgaccct gaagttcatc tgcaccaccg gcaagctgcc cgtgccctgg 2280
cccaccctcg tgaccaccct gacctacggc gtgcagtgct tcagccgcta ccccgaccac 2340
atgaagcagc acgacttctt caagtccgcc atgcccgaag gctacgtcca ggagcgcacc 2400
atcttcttca aggacgacgg caactacaag acccgcgccg aggtgaagtt cgagggcgac 2460
accctggtga accgcatcga gctgaagggc atcgacttca aggaggacgg caacatcctg 2520
gggcacaagc tggagtacaa ctacaacagc cacaacgtct atatcatggc cgacaagcag 2580
aagaacggca tcaaggtgaa cttcaagatc cgccacaaca tcgaggacgg cagcgtgcag 2640
ctcgccgacc actaccagca gaacaccccc atcggcgacg gccccgtgct gctgcccgac 2700
aaccactacc tgagcaccca gtccgccctg agcaaagacc ccaacgagaa gcgcgatcac 2760
atggtcctgc tggagttcgt gaccgccgcc gggatcactc tcggcatgga cgagctgtac 2820
aagtaacacc ggtggcgcgt taagtcgaca atcaacctct ggattacaaa atttgtgaaa 2880
gattgactgg tattcttaac tatgttgctc cttttacgct atgtggatac gctgctttaa 2940
tgcctttgta tcatgctatt gcttcccgta tggctttcat tttctcctcc ttgtataaat 3000
cctggttgct gtctctttat gaggagttgt ggcccgttgt caggcaacgt ggcgtggtgt 3060
gcactgtgtt tgctgacgca acccccactg gttggggcat tgccaccacc tgtcagctcc 3120
tttccgggac tttcgctttc cccctcccta ttgccacggc ggaactcatc gccgcctgcc 3180
ttgcccgctg ctggacaggg gctcggctgt tgggcactga caattccgtg gtgttgtcgg 3240
ggaaatcatc gtcctttcct tggctgctcg cctgtgttgc cacctggatt ctgcgcggga 3300
cgtccttctg ctacgtccct tcggccctca atccagcgga ccttccttcc cgcggcctgc 3360
tgccggctct gcggcctctt ccgcgtcttc gccttcgccc tcagacgagt cggatctccc 3420
tttgggccgc ctccccgcgt cgactttaag accaatgact tacaaggcag ctgtagatct 3480
tagccacttt ttaaaagaaa aggggggact ggaagggcta attcactccc aacgaagaca 3540
agatctgctt tttgcttgta ctgggtctct ctggttagac cagatctgag cctgggagct 3600
ctctggctaa ctagggaacc cactgcttaa gcctcaataa agcttgcctt gagtgcttca 3660
agtagtgtgt gcccgtctgt tgtgtgactc tggtaactag agatccctca gaccctttta 3720
gtcagtgtgg aaaatctcta gcagtacgta tagtagttca tgtcatctta ttattcagta 3780
tttataactt gcaaagaaat gaatatcaga gagtgagagg aacttgttta ttgcagctta 3840
taatggttac aaataaagca atagcatcac aaatttcaca aataaagcat ttttttcact 3900
gcattctagt tgtggtttgt ccaaactcat caatgtatct tatcatgtct ggctctagct 3960
atcccgcccc taactccgcc catcccgccc ctaactccgc ccagttccgc ccattctccg 4020
ccccatggct gactaatttt ttttatttat gcagaggccg aggccgcctc ggcctctgag 4080
ctattccaga agtagtgagg aggctttttt ggaggcctag ggacgtaccc aattcgccct 4140
atagtgagtc gtattacgcg cgctcactgg ccgtcgtttt acaacgtcgt gactgggaaa 4200
accctggcgt tacccaactt aatcgccttg cagcacatcc ccctttcgcc agctggcgta 4260
atagcgaaga ggcccgcacc gatcgccctt cccaacagtt gcgcagcctg aatggcgaat 4320
gggacgcgcc ctgtagcggc gcattaagcg cggcgggtgt ggtggttacg cgcagcgtga 4380
ccgctacact tgccagcgcc ctagcgcccg ctcctttcgc tttcttccct tcctttctcg 4440
ccacgttcgc cggctttccc cgtcaagctc taaatcgggg gctcccttta gggttccgat 4500
ttagtgcttt acggcacctc gaccccaaaa aacttgatta gggtgatggt tcacgtagtg 4560
ggccatcgcc ctgatagacg gtttttcgcc ctttgacgtt ggagtccacg ttctttaata 4620
gtggactctt gttccaaact ggaacaacac tcaaccctat ctcggtctat tcttttgatt 4680
tataagggat tttgccgatt tcggcctatt ggttaaaaaa tgagctgatt taacaaaaat 4740
ttaacgcgaa ttttaacaaa atattaacgc ttacaattta ggtggcactt ttcggggaaa 4800
tgtgcgcgga acccctattt gtttattttt ctaaatacat tcaaatatgt atccgctcat 4860
gagacaataa ccctgataaa tgcttcaata atattgaaaa aggaagagta tgagtattca 4920
acatttccgt gtcgccctta ttcccttttt tgcggcattt tgccttcctg tttttgctca 4980
cccagaaacg ctggtgaaag taaaagatgc tgaagatcag ttgggtgcac gagtgggtta 5040
catcgaactg gatctcaaca gcggtaagat ccttgagagt tttcgccccg aagaacgttt 5100
tccaatgatg agcactttta aagttctgct atgtggcgcg gtattatccc gtattgacgc 5160
cgggcaagag caactcggtc gccgcataca ctattctcag aatgacttgg ttgagtactc 5220
accagtcaca gaaaagcatc ttacggatgg catgacagta agagaattat gcagtgctgc 5280
cataaccatg agtgataaca ctgcggccaa cttacttctg acaacgatcg gaggaccgaa 5340
ggagctaacc gcttttttgc acaacatggg ggatcatgta actcgccttg atcgttggga 5400
accggagctg aatgaagcca taccaaacga cgagcgtgac accacgatgc ctgtagcaat 5460
ggcaacaacg ttgcgcaaac tattaactgg cgaactactt actctagctt cccggcaaca 5520
attaatagac tggatggagg cggataaagt tgcaggacca cttctgcgct cggcccttcc 5580
ggctggctgg tttattgctg ataaatctgg agccggtgag cgtgggtctc gcggtatcat 5640
tgcagcactg gggccagatg gtaagccctc ccgtatcgta gttatctaca cgacggggag 5700
tcaggcaact atggatgaac gaaatagaca gatcgctgag ataggtgcct cactgattaa 5760
gcattggtaa ctgtcagacc aagtttactc atatatactt tagattgatt taaaacttca 5820
tttttaattt aaaaggatct aggtgaagat cctttttgat aatctcatga ccaaaatccc 5880
ttaacgtgag ttttcgttcc actgagcgtc agaccccgta gaaaagatca aaggatcttc 5940
ttgagatcct ttttttctgc gcgtaatctg ctgcttgcaa acaaaaaaac caccgctacc 6000
agcggtggtt tgtttgccgg atcaagagct accaactctt tttccgaagg taactggctt 6060
cagcagagcg cagataccaa atactgttct tctagtgtag ccgtagttag gccaccactt 6120
caagaactct gtagcaccgc ctacatacct cgctctgcta atcctgttac cagtggctgc 6180
tgccagtggc gataagtcgt gtcttaccgg gttggactca agacgatagt taccggataa 6240
ggcgcagcgg tcgggctgaa cggggggttc gtgcacacag cccagcttgg agcgaacgac 6300
ctacaccgaa ctgagatacc tacagcgtga gctatgagaa agcgccacgc ttcccgaagg 6360
gagaaaggcg gacaggtatc cggtaagcgg cagggtcgga acaggagagc gcacgaggga 6420
gcttccaggg ggaaacgcct ggtatcttta tagtcctgtc gggtttcgcc acctctgact 6480
tgagcgtcga tttttgtgat gctcgtcagg ggggcggagc ctatggaaaa acgccagcaa 6540
cgcggccttt ttacggttcc tggccttttg ctggcctttt gctcacatgt tctttcctgc 6600
gttatcccct gattctgtgg ataaccgtat taccgccttt gagtgagctg ataccgctcg 6660
ccgcagccga acgaccgagc gcagcgagtc agtgagcgag gaagcggaag agcgcccaat 6720
acgcaaaccg cctctccccg cgcgttggcc gattcattaa tgcagctggc acgacaggtt 6780
tcccgactgg aaagcgggca gtgagcgcaa cgcaattaat gtgagttagc tcactcatta 6840
ggcaccccag gctttacact ttatgcttcc ggctcgtatg ttgtgtggaa ttgtgagcgg 6900
ataacaattt cacacaggaa acagctatga ccatgattac gccaagcgcg caattaaccc 6960
tcactaaagg gaacaaaagc tggagctgca agcttaatgt agtcttatgc aatactcttg 7020
tagtcttgca acatggtaac gatgagttag caacatgcct tacaaggaga gaaaaagcac 7080
cgtgcatgcc gattggtgga agtaaggtgg tacgatcgtg ccttattagg aaggcaacag 7140
acgggtctga catggattgg acgaaccact gaattgccgc attgcagaga tattgtattt 7200
aagtgcctag ctcgatacat aaacgggtct ctctggttag accagatctg agcctgggag 7260
ctctctggct aactagggaa cccactgctt aagcctcaat aaagcttgcc ttgagtgctt 7320
caagtagtgt gtgcccgtct gttgtgtgac tctggtaact agagatccct cagacccttt 7380
tagtcagtgt ggaaaatctc tagcagtggc gcccgaacag ggacttgaaa gcgaaaggga 7440
aaccagagga gctctctcga cgcaggactc ggcttgctga agcgcgcacg gcaagaggcg 7500
aggggcggcg actggtgagt acgccaaaaa ttttgactag cggaggctag aaggagagag 7560
atgggtgcga gagcgtcagt attaagcggg ggagaattag atcgcgatgg gaaaaaattc 7620
ggttaaggcc agggggaaag aaaaaatata aattaaaaca tatagtatgg gcaagcaggg 7680
agctagaacg attcgcagtt aatcctggcc tgttagaaac atcagaaggc tgtagacaaa 7740
tactgggaca gctacaacca tcccttcaga caggatcaga agaacttaga tcattatata 7800
atacagtagc aaccctctat tgtgtgcatc aaaggataga gataaaagac accaaggaag 7860
ctttagacaa gatagaggaa gagcaaaaca aaagtaagac caccgcacag caagcggccg 7920
ctgatcttca gacctggagg aggagatatg agggacaatt ggagaagtga attatataaa 7980
tataaagtag taaaaattga accattagga gtagcaccca ccaaggcaaa gagaagagtg 8040
gtgcagagag aaaaaagagc agtgggaata ggagctttgt tccttgggtt cttgggagca 8100
gcaggaagca ctatgggcgc agcgtcaatg acgctgacgg tacaggccag acaattattg 8160
tctggtatag tgcagcagca gaacaatttg ctgagggcta ttgaggcgca acagcatctg 8220
ttgcaactca cagtctgggg catcaagcag ctccaggcaa gaatcctggc tgtggaaaga 8280
tacctaaagg atcaacagct cctggggatt tggggttgct ctggaaaact catttgcacc 8340
actgctgtgc cttggaatgc tagttggagt aataaatctc tggaacagat ttggaatcac 8400
acgacctgga tggagtggga cagagaaatt aacaattaca caagcttaat acactcctta 8460
attgaagaat cgcaaaacca gcaagaaaag aatgaacaag aattattgga attagataaa 8520
tgggcaagtt tgtggaattg gtttaacata acaaattggc tgtggtatat aaaattattc 8580
ataatgatag taggaggctt ggtaggttta agaatagttt ttgctgtact ttctatagtg 8640
aatagagtta ggcagggata ttcaccatta tcgtttcaga cccacctccc aaccccgagg 8700
ggacccttgc gccttttcca aggcagccct gggtttgcgc agggacgcgg ctgctctggg 8760
cgtggttccg ggaaacgcag cggcgccgac cctgggtctc gcacattctt cacgtccgtt 8820
cgcagcgtca cccggatctt cgccgctacc cttgtgggcc ccccggcgac gcttcctgct 8880
ccgcccctaa gtcgggaagg ttccttgcgg ttcgcggcgt gccggacgtg acaaacggaa 8940
gccgcacgtc tcactagtac cctcgcagac ggacagcgcc agggagcaat ggcagcgcgc 9000
cgaccgcgat gggctgtggc caatagcggc tgctcagcag ggcgcgccga gagcagcggc 9060
cgggaagggg cggtgcggga ggcggggtgt ggggcggtag tgtgggccct gttcctgccc 9120
gcgcggtgtt ccgcattctg caagcctccg gagcgcacgt cggcagtcgg ctccctcgtt 9180
gaccgaatca ccgacctctc tccccagggg gtacccagct gtctagagaa ttctagatct 9240
tgagacaaat ggcagtattc atccacaatt ttaaaagaaa aggggggatt ggggggtaca 9300
gtgcagggga aagaatagta gacataatag caacagacat acaaactaaa gaattacaaa 9360
aacaaattac aaaaattcaa aattttcggg tttattacag ggacagcaga gatccacttt 9420
ggcgccggct cgaggggg 9438
<210> 77
<211> 259
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 77
Met Ala Gln Leu Lys Lys Lys Leu Gln Ala Asn Lys Lys Glu Leu Ala
1 5 10 15
Gln Leu Lys Trp Lys Leu Gln Ala Leu Lys Lys Lys Leu Ala Gln Gly
20 25 30
Gly Gly Gly Ser Gly Ser Met Ile Lys Ile Ala Thr Arg Lys Tyr Leu
35 40 45
Gly Lys Gln Asn Val Tyr Asp Ile Gly Val Glu Arg Asp His Asn Phe
50 55 60
Ala Leu Lys Asn Gly Phe Ile Ala Ser Asn Cys Val Thr Lys Gly Gly
65 70 75 80
Pro Leu Pro Phe Ser Trp Asp Ile Leu Ser Pro Gln Phe Met Tyr Gly
85 90 95
Ser Arg Ala Phe Thr Lys His Pro Ala Asp Ile Pro Asp Tyr Tyr Lys
100 105 110
Gln Ser Phe Pro Glu Gly Phe Lys Trp Glu Arg Val Met Asn Phe Glu
115 120 125
Asp Gly Gly Ala Val Thr Val Thr Gln Asp Thr Ser Leu Glu Asp Gly
130 135 140
Thr Leu Ile Tyr Lys Val Lys Leu Arg Gly Thr Asn Phe Pro Pro Asp
145 150 155 160
Gly Pro Val Met Gln Lys Lys Thr Met Gly Trp Glu Ala Ser Thr Glu
165 170 175
Arg Leu Tyr Pro Glu Asp Gly Val Leu Lys Gly Asp Ile Lys Met Ala
180 185 190
Leu Arg Leu Lys Asp Gly Gly Arg Tyr Leu Ala Asp Phe Lys Thr Thr
195 200 205
Tyr Lys Ala Lys Lys Pro Val Gln Met Pro Gly Ala Tyr Asn Val Asp
210 215 220
Arg Lys Leu Asp Ile Thr Ser His Asn Glu Asp Tyr Thr Val Val Glu
225 230 235 240
Gln Tyr Glu Arg Ser Glu Gly Arg His Ser Thr Gly Gly Met Asp Glu
245 250 255
Leu Tyr Lys
<210> 78
<211> 9219
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 78
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtgccgcca ccatggtgag caagggcgag 660
gcagtgatca aggagttcat gcggttcaag gtgcacatgg agggctccat gaacggccac 720
gagttcgaga tcgagggcga gggcgagggc cgcccctacg agggcaccca gaccgccaag 780
ctgaaggtga ccaagtgcct ttcatacgag accgagatcc tgactgtcga gtacggattg 840
cttcctatcg gcaaaatcgt ggagaagagg attgaatgta ccgtctattc agtcgataat 900
aatgggaaca tctacacaca gcccgtggct caatggcacg acagaggaga gcaggaagtt 960
tttgaatact gtctcgagga cggatccctc atccgcgcta ctaaagatca taagtttatg 1020
accgtggacg gccagatgct gccaattgac gaaatttttg aacgagagct ggatctgatg 1080
agagtcgaca accttccaaa cggtggaggg gggtcaggct ctgcgcagct ggaaaaggag 1140
cttcaagccc tcgaaaaaaa gttggcccag ctcgagtggg agaaccaggc tctggagaaa 1200
gaactggccc agtgattaat taagaattcg acccagcttt cttgtacaaa gtggttggta 1260
agcctatccc taaccctctc ctcggtctcg attctacgta gtaatgagct agcagtctcg 1320
aggttaacga attccgcccc ccccctaacg ttactggccg aagccgcttg gaataaggcc 1380
ggtgtgcgct tgtctatatg ttattttcca ccatattgcc gtcttttggc aatgtgaggg 1440
cccggaaacc tggccctgtc ttcttgacga gcattcctag gggtctttcc cctctcgcca 1500
aaggaatgca aggtctgttg aatgtcgtga aggaagcagt tcctctggaa gcttcttgaa 1560
gacaaacaac gtctgtagcg accctttgca ggcagcggaa ccccccacct ggcgacaggt 1620
gcccctgcgg ccaaaagcca cgtgtataag atacacctgc aaaggcggca caaccccagt 1680
gccacgttgt gagttggata gttgtggaaa gagtcaaatg gctctcctca agcgtattca 1740
acaaggggct gaaggatgcc cagaaggtac cccattgtat gggatctgat ctggggcctc 1800
ggtgcacatg ctttacatgt gtttagtcga ggttaaaaaa acgtctaggc cccccgaacc 1860
acggggacgt ggttttcctt tgaaaaacac gataatacca tggccatgag cgagctgatt 1920
aaggagaaca tgcacatgaa gctgtacatg gagggcaccg tggacaacca tcacttcaag 1980
tgcacatccg agggcgaagg caagccctac gagggcaccc agaccatgag aatcaaggtg 2040
gtcgagggcg gccctctccc cttcgccttc gacatcctgg ctactagctt cctctacggc 2100
agcaagacct tcatcaacca cacccagggc atccccgact tcttcaagca gtccttccct 2160
gagggcttca catgggagag agtcaccaca tacgaagacg ggggcgtgct gaccgctacc 2220
caggacacca gcctccagga cggctgcctc atctacaacg tcaagatcag aggggtgaac 2280
ttcacatcca acggccctgt gatgcagaag aaaacactcg gctgggaggc cttcaccgag 2340
acgctgtacc ccgctgacgg cggcctggaa ggcagaaacg acatggccct gaagctcgtg 2400
ggcgggagcc atctgatcgc aaacatcaag accacatata gatccaagaa acccgctaag 2460
aacctcaaga tgcctggcgt ctactatgtg gactacagac tggaaagaat caaggaggcc 2520
aacaacgaga cctacgtcga gcagcacgag gtggcagtgg ccagatactg cgacctccct 2580
agcaaactgg ggcacaagct taattaacac cggtggcgcg ttaagtcgac aatcaacctc 2640
tggattacaa aatttgtgaa agattgactg gtattcttaa ctatgttgct ccttttacgc 2700
tatgtggata cgctgcttta atgcctttgt atcatgctat tgcttcccgt atggctttca 2760
ttttctcctc cttgtataaa tcctggttgc tgtctcttta tgaggagttg tggcccgttg 2820
tcaggcaacg tggcgtggtg tgcactgtgt ttgctgacgc aacccccact ggttggggca 2880
ttgccaccac ctgtcagctc ctttccggga ctttcgcttt ccccctccct attgccacgg 2940
cggaactcat cgccgcctgc cttgcccgct gctggacagg ggctcggctg ttgggcactg 3000
acaattccgt ggtgttgtcg gggaaatcat cgtcctttcc ttggctgctc gcctgtgttg 3060
ccacctggat tctgcgcggg acgtccttct gctacgtccc ttcggccctc aatccagcgg 3120
accttccttc ccgcggcctg ctgccggctc tgcggcctct tccgcgtctt cgccttcgcc 3180
ctcagacgag tcggatctcc ctttgggccg cctccccgcg tcgactttaa gaccaatgac 3240
ttacaaggca gctgtagatc ttagccactt tttaaaagaa aaggggggac tggaagggct 3300
aattcactcc caacgaagac aagatctgct ttttgcttgt actgggtctc tctggttaga 3360
ccagatctga gcctgggagc tctctggcta actagggaac ccactgctta agcctcaata 3420
aagcttgcct tgagtgcttc aagtagtgtg tgcccgtctg ttgtgtgact ctggtaacta 3480
gagatccctc agaccctttt agtcagtgtg gaaaatctct agcagtacgt atagtagttc 3540
atgtcatctt attattcagt atttataact tgcaaagaaa tgaatatcag agagtgagag 3600
gaacttgttt attgcagctt ataatggtta caaataaagc aatagcatca caaatttcac 3660
aaataaagca tttttttcac tgcattctag ttgtggtttg tccaaactca tcaatgtatc 3720
ttatcatgtc tggctctagc tatcccgccc ctaactccgc ccatcccgcc cctaactccg 3780
cccagttccg cccattctcc gccccatggc tgactaattt tttttattta tgcagaggcc 3840
gaggccgcct cggcctctga gctattccag aagtagtgag gaggcttttt tggaggccta 3900
gggacgtacc caattcgccc tatagtgagt cgtattacgc gcgctcactg gccgtcgttt 3960
tacaacgtcg tgactgggaa aaccctggcg ttacccaact taatcgcctt gcagcacatc 4020
cccctttcgc cagctggcgt aatagcgaag aggcccgcac cgatcgccct tcccaacagt 4080
tgcgcagcct gaatggcgaa tgggacgcgc cctgtagcgg cgcattaagc gcggcgggtg 4140
tggtggttac gcgcagcgtg accgctacac ttgccagcgc cctagcgccc gctcctttcg 4200
ctttcttccc ttcctttctc gccacgttcg ccggctttcc ccgtcaagct ctaaatcggg 4260
ggctcccttt agggttccga tttagtgctt tacggcacct cgaccccaaa aaacttgatt 4320
agggtgatgg ttcacgtagt gggccatcgc cctgatagac ggtttttcgc cctttgacgt 4380
tggagtccac gttctttaat agtggactct tgttccaaac tggaacaaca ctcaacccta 4440
tctcggtcta ttcttttgat ttataaggga ttttgccgat ttcggcctat tggttaaaaa 4500
atgagctgat ttaacaaaaa tttaacgcga attttaacaa aatattaacg cttacaattt 4560
aggtggcact tttcggggaa atgtgcgcgg aacccctatt tgtttatttt tctaaataca 4620
ttcaaatatg tatccgctca tgagacaata accctgataa atgcttcaat aatattgaaa 4680
aaggaagagt atgagtattc aacatttccg tgtcgccctt attccctttt ttgcggcatt 4740
ttgccttcct gtttttgctc acccagaaac gctggtgaaa gtaaaagatg ctgaagatca 4800
gttgggtgca cgagtgggtt acatcgaact ggatctcaac agcggtaaga tccttgagag 4860
ttttcgcccc gaagaacgtt ttccaatgat gagcactttt aaagttctgc tatgtggcgc 4920
ggtattatcc cgtattgacg ccgggcaaga gcaactcggt cgccgcatac actattctca 4980
gaatgacttg gttgagtact caccagtcac agaaaagcat cttacggatg gcatgacagt 5040
aagagaatta tgcagtgctg ccataaccat gagtgataac actgcggcca acttacttct 5100
gacaacgatc ggaggaccga aggagctaac cgcttttttg cacaacatgg gggatcatgt 5160
aactcgcctt gatcgttggg aaccggagct gaatgaagcc ataccaaacg acgagcgtga 5220
caccacgatg cctgtagcaa tggcaacaac gttgcgcaaa ctattaactg gcgaactact 5280
tactctagct tcccggcaac aattaataga ctggatggag gcggataaag ttgcaggacc 5340
acttctgcgc tcggcccttc cggctggctg gtttattgct gataaatctg gagccggtga 5400
gcgtgggtct cgcggtatca ttgcagcact ggggccagat ggtaagccct cccgtatcgt 5460
agttatctac acgacgggga gtcaggcaac tatggatgaa cgaaatagac agatcgctga 5520
gataggtgcc tcactgatta agcattggta actgtcagac caagtttact catatatact 5580
ttagattgat ttaaaacttc atttttaatt taaaaggatc taggtgaaga tcctttttga 5640
taatctcatg accaaaatcc cttaacgtga gttttcgttc cactgagcgt cagaccccgt 5700
agaaaagatc aaaggatctt cttgagatcc tttttttctg cgcgtaatct gctgcttgca 5760
aacaaaaaaa ccaccgctac cagcggtggt ttgtttgccg gatcaagagc taccaactct 5820
ttttccgaag gtaactggct tcagcagagc gcagatacca aatactgttc ttctagtgta 5880
gccgtagtta ggccaccact tcaagaactc tgtagcaccg cctacatacc tcgctctgct 5940
aatcctgtta ccagtggctg ctgccagtgg cgataagtcg tgtcttaccg ggttggactc 6000
aagacgatag ttaccggata aggcgcagcg gtcgggctga acggggggtt cgtgcacaca 6060
gcccagcttg gagcgaacga cctacaccga actgagatac ctacagcgtg agctatgaga 6120
aagcgccacg cttcccgaag ggagaaaggc ggacaggtat ccggtaagcg gcagggtcgg 6180
aacaggagag cgcacgaggg agcttccagg gggaaacgcc tggtatcttt atagtcctgt 6240
cgggtttcgc cacctctgac ttgagcgtcg atttttgtga tgctcgtcag gggggcggag 6300
cctatggaaa aacgccagca acgcggcctt tttacggttc ctggcctttt gctggccttt 6360
tgctcacatg ttctttcctg cgttatcccc tgattctgtg gataaccgta ttaccgcctt 6420
tgagtgagct gataccgctc gccgcagccg aacgaccgag cgcagcgagt cagtgagcga 6480
ggaagcggaa gagcgcccaa tacgcaaacc gcctctcccc gcgcgttggc cgattcatta 6540
atgcagctgg cacgacaggt ttcccgactg gaaagcgggc agtgagcgca acgcaattaa 6600
tgtgagttag ctcactcatt aggcacccca ggctttacac tttatgcttc cggctcgtat 6660
gttgtgtgga attgtgagcg gataacaatt tcacacagga aacagctatg accatgatta 6720
cgccaagcgc gcaattaacc ctcactaaag ggaacaaaag ctggagctgc aagcttaatg 6780
tagtcttatg caatactctt gtagtcttgc aacatggtaa cgatgagtta gcaacatgcc 6840
ttacaaggag agaaaaagca ccgtgcatgc cgattggtgg aagtaaggtg gtacgatcgt 6900
gccttattag gaaggcaaca gacgggtctg acatggattg gacgaaccac tgaattgccg 6960
cattgcagag atattgtatt taagtgccta gctcgataca taaacgggtc tctctggtta 7020
gaccagatct gagcctggga gctctctggc taactaggga acccactgct taagcctcaa 7080
taaagcttgc cttgagtgct tcaagtagtg tgtgcccgtc tgttgtgtga ctctggtaac 7140
tagagatccc tcagaccctt ttagtcagtg tggaaaatct ctagcagtgg cgcccgaaca 7200
gggacttgaa agcgaaaggg aaaccagagg agctctctcg acgcaggact cggcttgctg 7260
aagcgcgcac ggcaagaggc gaggggcggc gactggtgag tacgccaaaa attttgacta 7320
gcggaggcta gaaggagaga gatgggtgcg agagcgtcag tattaagcgg gggagaatta 7380
gatcgcgatg ggaaaaaatt cggttaaggc cagggggaaa gaaaaaatat aaattaaaac 7440
atatagtatg ggcaagcagg gagctagaac gattcgcagt taatcctggc ctgttagaaa 7500
catcagaagg ctgtagacaa atactgggac agctacaacc atcccttcag acaggatcag 7560
aagaacttag atcattatat aatacagtag caaccctcta ttgtgtgcat caaaggatag 7620
agataaaaga caccaaggaa gctttagaca agatagagga agagcaaaac aaaagtaaga 7680
ccaccgcaca gcaagcggcc gctgatcttc agacctggag gaggagatat gagggacaat 7740
tggagaagtg aattatataa atataaagta gtaaaaattg aaccattagg agtagcaccc 7800
accaaggcaa agagaagagt ggtgcagaga gaaaaaagag cagtgggaat aggagctttg 7860
ttccttgggt tcttgggagc agcaggaagc actatgggcg cagcgtcaat gacgctgacg 7920
gtacaggcca gacaattatt gtctggtata gtgcagcagc agaacaattt gctgagggct 7980
attgaggcgc aacagcatct gttgcaactc acagtctggg gcatcaagca gctccaggca 8040
agaatcctgg ctgtggaaag atacctaaag gatcaacagc tcctggggat ttggggttgc 8100
tctggaaaac tcatttgcac cactgctgtg ccttggaatg ctagttggag taataaatct 8160
ctggaacaga tttggaatca cacgacctgg atggagtggg acagagaaat taacaattac 8220
acaagcttaa tacactcctt aattgaagaa tcgcaaaacc agcaagaaaa gaatgaacaa 8280
gaattattgg aattagataa atgggcaagt ttgtggaatt ggtttaacat aacaaattgg 8340
ctgtggtata taaaattatt cataatgata gtaggaggct tggtaggttt aagaatagtt 8400
tttgctgtac tttctatagt gaatagagtt aggcagggat attcaccatt atcgtttcag 8460
acccacctcc caaccccgag gggacccttg cgccttttcc aaggcagccc tgggtttgcg 8520
cagggacgcg gctgctctgg gcgtggttcc gggaaacgca gcggcgccga ccctgggtct 8580
cgcacattct tcacgtccgt tcgcagcgtc acccggatct tcgccgctac ccttgtgggc 8640
cccccggcga cgcttcctgc tccgccccta agtcgggaag gttccttgcg gttcgcggcg 8700
tgccggacgt gacaaacgga agccgcacgt ctcactagta ccctcgcaga cggacagcgc 8760
cagggagcaa tggcagcgcg ccgaccgcga tgggctgtgg ccaatagcgg ctgctcagca 8820
gggcgcgccg agagcagcgg ccgggaaggg gcggtgcggg aggcggggtg tggggcggta 8880
gtgtgggccc tgttcctgcc cgcgcggtgt tccgcattct gcaagcctcc ggagcgcacg 8940
tcggcagtcg gctccctcgt tgaccgaatc accgacctct ctccccaggg ggtacccagc 9000
tgtctagaga attctagatc ttgagacaaa tggcagtatt catccacaat tttaaaagaa 9060
aaggggggat tggggggtac agtgcagggg aaagaatagt agacataata gcaacagaca 9120
tacaaactaa agaattacaa aaacaaatta caaaaattca aaattttcgg gtttattaca 9180
gggacagcag agatccactt tggcgccggc tcgaggggg 9219
<210> 79
<211> 190
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 79
Met Val Ser Lys Gly Glu Ala Val Ile Lys Glu Phe Met Arg Phe Lys
1 5 10 15
Val His Met Glu Gly Ser Met Asn Gly His Glu Phe Glu Ile Glu Gly
20 25 30
Glu Gly Glu Gly Arg Pro Tyr Glu Gly Thr Gln Thr Ala Lys Leu Lys
35 40 45
Val Thr Lys Cys Leu Ser Tyr Glu Thr Glu Ile Leu Thr Val Glu Tyr
50 55 60
Gly Leu Leu Pro Ile Gly Lys Ile Val Glu Lys Arg Ile Glu Cys Thr
65 70 75 80
Val Tyr Ser Val Asp Asn Asn Gly Asn Ile Tyr Thr Gln Pro Val Ala
85 90 95
Gln Trp His Asp Arg Gly Glu Gln Glu Val Phe Glu Tyr Cys Leu Glu
100 105 110
Asp Gly Ser Leu Ile Arg Ala Thr Lys Asp His Lys Phe Met Thr Val
115 120 125
Asp Gly Gln Met Leu Pro Ile Asp Glu Ile Phe Glu Arg Glu Leu Asp
130 135 140
Leu Met Arg Val Asp Asn Leu Pro Asn Gly Gly Gly Gly Ser Gly Ser
145 150 155 160
Ala Gln Leu Glu Lys Glu Leu Gln Ala Leu Glu Lys Lys Leu Ala Gln
165 170 175
Leu Glu Trp Glu Asn Gln Ala Leu Glu Lys Glu Leu Ala Gln
180 185 190
<210> 80
<211> 9429
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 80
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtgccgcca ccatggctca actgaagaag 660
aaactccagg ccaataagaa agaactggca cagctgaagt ggaagctgca ggcactgaaa 720
aagaagctgg cacagggcgg aggaggctca ggcagtatga ttaagatcgc tacgcggaag 780
tacctgggga aacagaacgt ctacgacata ggtgtggagc gcgatcacaa ctttgctctg 840
aaaaatggat ttatcgccag caactgcggt ggccccctgc ccttctcctg ggacatcctg 900
tcccctcagt tcatgtacgg ctccagggcc ttcaccaagc accccgccga catccccgac 960
tactataagc agtccttccc cgagggcttc aagtgggagc gcgtgatgaa cttcgaggac 1020
ggcggcgccg tgaccgtgac ccaggacacc tccctggagg acggcaccct gatctacaag 1080
gtgaagctcc gcggcaccaa cttccctcct gacggccccg taatgcagaa gaagacaatg 1140
ggctgggaag cgtccaccga gcggttgtac cccgaggacg gcgtgctgaa gggcgacatt 1200
aagatggccc tgcgcctgaa ggacggcggc cgctacctgg cggacttcaa gaccacctac 1260
aaggccaaga agcccgtgca gatgcccggc gcctacaacg tcgaccgcaa gttggacatc 1320
acctcccaca acgaggacta caccgtggtg gaacagtacg aacgctccga gggccgccac 1380
tccaccggcg gcatggacga gctgtacaag tgattaatta agaattcgac ccagctttct 1440
tgtacaaagt ggttggtaag cctatcccta accctctcct cggtctcgat tctacgtagt 1500
aatgagctag cagtctcgag gttaacgaat tccgcccccc ccctaacgtt actggccgaa 1560
gccgcttgga ataaggccgg tgtgcgcttg tctatatgtt attttccacc atattgccgt 1620
cttttggcaa tgtgagggcc cggaaacctg gccctgtctt cttgacgagc attcctaggg 1680
gtctttcccc tctcgccaaa ggaatgcaag gtctgttgaa tgtcgtgaag gaagcagttc 1740
ctctggaagc ttcttgaaga caaacaacgt ctgtagcgac cctttgcagg cagcggaacc 1800
ccccacctgg cgacaggtgc ccctgcggcc aaaagccacg tgtataagat acacctgcaa 1860
aggcggcaca accccagtgc cacgttgtga gttggatagt tgtggaaaga gtcaaatggc 1920
tctcctcaag cgtattcaac aaggggctga aggatgccca gaaggtaccc cattgtatgg 1980
gatctgatct ggggcctcgg tgcacatgct ttacatgtgt ttagtcgagg ttaaaaaaac 2040
gtctaggccc cccgaaccac ggggacgtgg ttttcctttg aaaaacacga taataccatg 2100
gtgagcaagg gcgaggagct gttcaccggg gtggtgccca tcctggtcga gctggacggc 2160
gacgtaaacg gccacaagtt cagcgtgtcc ggcgagggcg agggcgatgc cacctacggc 2220
aagctgaccc tgaagttcat ctgcaccacc ggcaagctgc ccgtgccctg gcccaccctc 2280
gtgaccaccc tgacctacgg cgtgcagtgc ttcagccgct accccgacca catgaagcag 2340
cacgacttct tcaagtccgc catgcccgaa ggctacgtcc aggagcgcac catcttcttc 2400
aaggacgacg gcaactacaa gacccgcgcc gaggtgaagt tcgagggcga caccctggtg 2460
aaccgcatcg agctgaaggg catcgacttc aaggaggacg gcaacatcct ggggcacaag 2520
ctggagtaca actacaacag ccacaacgtc tatatcatgg ccgacaagca gaagaacggc 2580
atcaaggtga acttcaagat ccgccacaac atcgaggacg gcagcgtgca gctcgccgac 2640
cactaccagc agaacacccc catcggcgac ggccccgtgc tgctgcccga caaccactac 2700
ctgagcaccc agtccgccct gagcaaagac cccaacgaga agcgcgatca catggtcctg 2760
ctggagttcg tgaccgccgc cgggatcact ctcggcatgg acgagctgta caagtaacac 2820
cggtggcgcg ttaagtcgac aatcaacctc tggattacaa aatttgtgaa agattgactg 2880
gtattcttaa ctatgttgct ccttttacgc tatgtggata cgctgcttta atgcctttgt 2940
atcatgctat tgcttcccgt atggctttca ttttctcctc cttgtataaa tcctggttgc 3000
tgtctcttta tgaggagttg tggcccgttg tcaggcaacg tggcgtggtg tgcactgtgt 3060
ttgctgacgc aacccccact ggttggggca ttgccaccac ctgtcagctc ctttccggga 3120
ctttcgcttt ccccctccct attgccacgg cggaactcat cgccgcctgc cttgcccgct 3180
gctggacagg ggctcggctg ttgggcactg acaattccgt ggtgttgtcg gggaaatcat 3240
cgtcctttcc ttggctgctc gcctgtgttg ccacctggat tctgcgcggg acgtccttct 3300
gctacgtccc ttcggccctc aatccagcgg accttccttc ccgcggcctg ctgccggctc 3360
tgcggcctct tccgcgtctt cgccttcgcc ctcagacgag tcggatctcc ctttgggccg 3420
cctccccgcg tcgactttaa gaccaatgac ttacaaggca gctgtagatc ttagccactt 3480
tttaaaagaa aaggggggac tggaagggct aattcactcc caacgaagac aagatctgct 3540
ttttgcttgt actgggtctc tctggttaga ccagatctga gcctgggagc tctctggcta 3600
actagggaac ccactgctta agcctcaata aagcttgcct tgagtgcttc aagtagtgtg 3660
tgcccgtctg ttgtgtgact ctggtaacta gagatccctc agaccctttt agtcagtgtg 3720
gaaaatctct agcagtacgt atagtagttc atgtcatctt attattcagt atttataact 3780
tgcaaagaaa tgaatatcag agagtgagag gaacttgttt attgcagctt ataatggtta 3840
caaataaagc aatagcatca caaatttcac aaataaagca tttttttcac tgcattctag 3900
ttgtggtttg tccaaactca tcaatgtatc ttatcatgtc tggctctagc tatcccgccc 3960
ctaactccgc ccatcccgcc cctaactccg cccagttccg cccattctcc gccccatggc 4020
tgactaattt tttttattta tgcagaggcc gaggccgcct cggcctctga gctattccag 4080
aagtagtgag gaggcttttt tggaggccta gggacgtacc caattcgccc tatagtgagt 4140
cgtattacgc gcgctcactg gccgtcgttt tacaacgtcg tgactgggaa aaccctggcg 4200
ttacccaact taatcgcctt gcagcacatc cccctttcgc cagctggcgt aatagcgaag 4260
aggcccgcac cgatcgccct tcccaacagt tgcgcagcct gaatggcgaa tgggacgcgc 4320
cctgtagcgg cgcattaagc gcggcgggtg tggtggttac gcgcagcgtg accgctacac 4380
ttgccagcgc cctagcgccc gctcctttcg ctttcttccc ttcctttctc gccacgttcg 4440
ccggctttcc ccgtcaagct ctaaatcggg ggctcccttt agggttccga tttagtgctt 4500
tacggcacct cgaccccaaa aaacttgatt agggtgatgg ttcacgtagt gggccatcgc 4560
cctgatagac ggtttttcgc cctttgacgt tggagtccac gttctttaat agtggactct 4620
tgttccaaac tggaacaaca ctcaacccta tctcggtcta ttcttttgat ttataaggga 4680
ttttgccgat ttcggcctat tggttaaaaa atgagctgat ttaacaaaaa tttaacgcga 4740
attttaacaa aatattaacg cttacaattt aggtggcact tttcggggaa atgtgcgcgg 4800
aacccctatt tgtttatttt tctaaataca ttcaaatatg tatccgctca tgagacaata 4860
accctgataa atgcttcaat aatattgaaa aaggaagagt atgagtattc aacatttccg 4920
tgtcgccctt attccctttt ttgcggcatt ttgccttcct gtttttgctc acccagaaac 4980
gctggtgaaa gtaaaagatg ctgaagatca gttgggtgca cgagtgggtt acatcgaact 5040
ggatctcaac agcggtaaga tccttgagag ttttcgcccc gaagaacgtt ttccaatgat 5100
gagcactttt aaagttctgc tatgtggcgc ggtattatcc cgtattgacg ccgggcaaga 5160
gcaactcggt cgccgcatac actattctca gaatgacttg gttgagtact caccagtcac 5220
agaaaagcat cttacggatg gcatgacagt aagagaatta tgcagtgctg ccataaccat 5280
gagtgataac actgcggcca acttacttct gacaacgatc ggaggaccga aggagctaac 5340
cgcttttttg cacaacatgg gggatcatgt aactcgcctt gatcgttggg aaccggagct 5400
gaatgaagcc ataccaaacg acgagcgtga caccacgatg cctgtagcaa tggcaacaac 5460
gttgcgcaaa ctattaactg gcgaactact tactctagct tcccggcaac aattaataga 5520
ctggatggag gcggataaag ttgcaggacc acttctgcgc tcggcccttc cggctggctg 5580
gtttattgct gataaatctg gagccggtga gcgtgggtct cgcggtatca ttgcagcact 5640
ggggccagat ggtaagccct cccgtatcgt agttatctac acgacgggga gtcaggcaac 5700
tatggatgaa cgaaatagac agatcgctga gataggtgcc tcactgatta agcattggta 5760
actgtcagac caagtttact catatatact ttagattgat ttaaaacttc atttttaatt 5820
taaaaggatc taggtgaaga tcctttttga taatctcatg accaaaatcc cttaacgtga 5880
gttttcgttc cactgagcgt cagaccccgt agaaaagatc aaaggatctt cttgagatcc 5940
tttttttctg cgcgtaatct gctgcttgca aacaaaaaaa ccaccgctac cagcggtggt 6000
ttgtttgccg gatcaagagc taccaactct ttttccgaag gtaactggct tcagcagagc 6060
gcagatacca aatactgttc ttctagtgta gccgtagtta ggccaccact tcaagaactc 6120
tgtagcaccg cctacatacc tcgctctgct aatcctgtta ccagtggctg ctgccagtgg 6180
cgataagtcg tgtcttaccg ggttggactc aagacgatag ttaccggata aggcgcagcg 6240
gtcgggctga acggggggtt cgtgcacaca gcccagcttg gagcgaacga cctacaccga 6300
actgagatac ctacagcgtg agctatgaga aagcgccacg cttcccgaag ggagaaaggc 6360
ggacaggtat ccggtaagcg gcagggtcgg aacaggagag cgcacgaggg agcttccagg 6420
gggaaacgcc tggtatcttt atagtcctgt cgggtttcgc cacctctgac ttgagcgtcg 6480
atttttgtga tgctcgtcag gggggcggag cctatggaaa aacgccagca acgcggcctt 6540
tttacggttc ctggcctttt gctggccttt tgctcacatg ttctttcctg cgttatcccc 6600
tgattctgtg gataaccgta ttaccgcctt tgagtgagct gataccgctc gccgcagccg 6660
aacgaccgag cgcagcgagt cagtgagcga ggaagcggaa gagcgcccaa tacgcaaacc 6720
gcctctcccc gcgcgttggc cgattcatta atgcagctgg cacgacaggt ttcccgactg 6780
gaaagcgggc agtgagcgca acgcaattaa tgtgagttag ctcactcatt aggcacccca 6840
ggctttacac tttatgcttc cggctcgtat gttgtgtgga attgtgagcg gataacaatt 6900
tcacacagga aacagctatg accatgatta cgccaagcgc gcaattaacc ctcactaaag 6960
ggaacaaaag ctggagctgc aagcttaatg tagtcttatg caatactctt gtagtcttgc 7020
aacatggtaa cgatgagtta gcaacatgcc ttacaaggag agaaaaagca ccgtgcatgc 7080
cgattggtgg aagtaaggtg gtacgatcgt gccttattag gaaggcaaca gacgggtctg 7140
acatggattg gacgaaccac tgaattgccg cattgcagag atattgtatt taagtgccta 7200
gctcgataca taaacgggtc tctctggtta gaccagatct gagcctggga gctctctggc 7260
taactaggga acccactgct taagcctcaa taaagcttgc cttgagtgct tcaagtagtg 7320
tgtgcccgtc tgttgtgtga ctctggtaac tagagatccc tcagaccctt ttagtcagtg 7380
tggaaaatct ctagcagtgg cgcccgaaca gggacttgaa agcgaaaggg aaaccagagg 7440
agctctctcg acgcaggact cggcttgctg aagcgcgcac ggcaagaggc gaggggcggc 7500
gactggtgag tacgccaaaa attttgacta gcggaggcta gaaggagaga gatgggtgcg 7560
agagcgtcag tattaagcgg gggagaatta gatcgcgatg ggaaaaaatt cggttaaggc 7620
cagggggaaa gaaaaaatat aaattaaaac atatagtatg ggcaagcagg gagctagaac 7680
gattcgcagt taatcctggc ctgttagaaa catcagaagg ctgtagacaa atactgggac 7740
agctacaacc atcccttcag acaggatcag aagaacttag atcattatat aatacagtag 7800
caaccctcta ttgtgtgcat caaaggatag agataaaaga caccaaggaa gctttagaca 7860
agatagagga agagcaaaac aaaagtaaga ccaccgcaca gcaagcggcc gctgatcttc 7920
agacctggag gaggagatat gagggacaat tggagaagtg aattatataa atataaagta 7980
gtaaaaattg aaccattagg agtagcaccc accaaggcaa agagaagagt ggtgcagaga 8040
gaaaaaagag cagtgggaat aggagctttg ttccttgggt tcttgggagc agcaggaagc 8100
actatgggcg cagcgtcaat gacgctgacg gtacaggcca gacaattatt gtctggtata 8160
gtgcagcagc agaacaattt gctgagggct attgaggcgc aacagcatct gttgcaactc 8220
acagtctggg gcatcaagca gctccaggca agaatcctgg ctgtggaaag atacctaaag 8280
gatcaacagc tcctggggat ttggggttgc tctggaaaac tcatttgcac cactgctgtg 8340
ccttggaatg ctagttggag taataaatct ctggaacaga tttggaatca cacgacctgg 8400
atggagtggg acagagaaat taacaattac acaagcttaa tacactcctt aattgaagaa 8460
tcgcaaaacc agcaagaaaa gaatgaacaa gaattattgg aattagataa atgggcaagt 8520
ttgtggaatt ggtttaacat aacaaattgg ctgtggtata taaaattatt cataatgata 8580
gtaggaggct tggtaggttt aagaatagtt tttgctgtac tttctatagt gaatagagtt 8640
aggcagggat attcaccatt atcgtttcag acccacctcc caaccccgag gggacccttg 8700
cgccttttcc aaggcagccc tgggtttgcg cagggacgcg gctgctctgg gcgtggttcc 8760
gggaaacgca gcggcgccga ccctgggtct cgcacattct tcacgtccgt tcgcagcgtc 8820
acccggatct tcgccgctac ccttgtgggc cccccggcga cgcttcctgc tccgccccta 8880
agtcgggaag gttccttgcg gttcgcggcg tgccggacgt gacaaacgga agccgcacgt 8940
ctcactagta ccctcgcaga cggacagcgc cagggagcaa tggcagcgcg ccgaccgcga 9000
tgggctgtgg ccaatagcgg ctgctcagca gggcgcgccg agagcagcgg ccgggaaggg 9060
gcggtgcggg aggcggggtg tggggcggta gtgtgggccc tgttcctgcc cgcgcggtgt 9120
tccgcattct gcaagcctcc ggagcgcacg tcggcagtcg gctccctcgt tgaccgaatc 9180
accgacctct ctccccaggg ggtacccagc tgtctagaga attctagatc ttgagacaaa 9240
tggcagtatt catccacaat tttaaaagaa aaggggggat tggggggtac agtgcagggg 9300
aaagaatagt agacataata gcaacagaca tacaaactaa agaattacaa aaacaaatta 9360
caaaaattca aaattttcgg gtttattaca gggacagcag agatccactt tggcgccggc 9420
tcgaggggg 9429
<210> 81
<211> 256
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 81
Met Ala Gln Leu Lys Lys Lys Leu Gln Ala Asn Lys Lys Glu Leu Ala
1 5 10 15
Gln Leu Lys Trp Lys Leu Gln Ala Leu Lys Lys Lys Leu Ala Gln Gly
20 25 30
Gly Gly Gly Ser Gly Ser Met Ile Lys Ile Ala Thr Arg Lys Tyr Leu
35 40 45
Gly Lys Gln Asn Val Tyr Asp Ile Gly Val Glu Arg Asp His Asn Phe
50 55 60
Ala Leu Lys Asn Gly Phe Ile Ala Ser Asn Cys Gly Gly Pro Leu Pro
65 70 75 80
Phe Ser Trp Asp Ile Leu Ser Pro Gln Phe Met Tyr Gly Ser Arg Ala
85 90 95
Phe Thr Lys His Pro Ala Asp Ile Pro Asp Tyr Tyr Lys Gln Ser Phe
100 105 110
Pro Glu Gly Phe Lys Trp Glu Arg Val Met Asn Phe Glu Asp Gly Gly
115 120 125
Ala Val Thr Val Thr Gln Asp Thr Ser Leu Glu Asp Gly Thr Leu Ile
130 135 140
Tyr Lys Val Lys Leu Arg Gly Thr Asn Phe Pro Pro Asp Gly Pro Val
145 150 155 160
Met Gln Lys Lys Thr Met Gly Trp Glu Ala Ser Thr Glu Arg Leu Tyr
165 170 175
Pro Glu Asp Gly Val Leu Lys Gly Asp Ile Lys Met Ala Leu Arg Leu
180 185 190
Lys Asp Gly Gly Arg Tyr Leu Ala Asp Phe Lys Thr Thr Tyr Lys Ala
195 200 205
Lys Lys Pro Val Gln Met Pro Gly Ala Tyr Asn Val Asp Arg Lys Leu
210 215 220
Asp Ile Thr Ser His Asn Glu Asp Tyr Thr Val Val Glu Gln Tyr Glu
225 230 235 240
Arg Ser Glu Gly Arg His Ser Thr Gly Gly Met Asp Glu Leu Tyr Lys
245 250 255
<210> 82
<211> 9291
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 82
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtgccgcca ccatggtgag caagggcgag 660
gcagtgatca aggagttcat gcggttcaag gtgcacatgg agggctccat gaacggccac 720
gagttcgaga tcgagggcga gggcgagggc cgcccctacg agggcaccca gaccgccaag 780
ctgaaggtga ccaagggtgg ccccctgccc ttctcctggg acatcctgtc ccctcagttc 840
atgtacggct ccagggcctt caccaagtgc ctttcatacg agaccgagat cctgactgtc 900
gagtacggat tgcttcctat cggcaaaatc gtggagaaga ggattgaatg taccgtctat 960
tcagtcgata ataatgggaa catctacaca cagcccgtgg ctcaatggca cgacagagga 1020
gagcaggaag tttttgaata ctgtctcgag gacggatccc tcatccgcgc tactaaagat 1080
cataagttta tgaccgtgga cggccagatg ctgccaattg acgaaatttt tgaacgagag 1140
ctggatctga tgagagtcga caaccttcca aacggtggag gggggtcagg ctctgcgcag 1200
ctggaaaagg agcttcaagc cctcgaaaaa aagttggccc agctcgagtg ggagaaccag 1260
gctctggaga aagaactggc ccagtgatta attaagaatt cgacccagct ttcttgtaca 1320
aagtggttgg taagcctatc cctaaccctc tcctcggtct cgattctacg tagtaatgag 1380
ctagcagtct cgaggttaac gaattccgcc ccccccctaa cgttactggc cgaagccgct 1440
tggaataagg ccggtgtgcg cttgtctata tgttattttc caccatattg ccgtcttttg 1500
gcaatgtgag ggcccggaaa cctggccctg tcttcttgac gagcattcct aggggtcttt 1560
cccctctcgc caaaggaatg caaggtctgt tgaatgtcgt gaaggaagca gttcctctgg 1620
aagcttcttg aagacaaaca acgtctgtag cgaccctttg caggcagcgg aaccccccac 1680
ctggcgacag gtgcccctgc ggccaaaagc cacgtgtata agatacacct gcaaaggcgg 1740
cacaacccca gtgccacgtt gtgagttgga tagttgtgga aagagtcaaa tggctctcct 1800
caagcgtatt caacaagggg ctgaaggatg cccagaaggt accccattgt atgggatctg 1860
atctggggcc tcggtgcaca tgctttacat gtgtttagtc gaggttaaaa aaacgtctag 1920
gccccccgaa ccacggggac gtggttttcc tttgaaaaac acgataatac catggccatg 1980
agcgagctga ttaaggagaa catgcacatg aagctgtaca tggagggcac cgtggacaac 2040
catcacttca agtgcacatc cgagggcgaa ggcaagccct acgagggcac ccagaccatg 2100
agaatcaagg tggtcgaggg cggccctctc cccttcgcct tcgacatcct ggctactagc 2160
ttcctctacg gcagcaagac cttcatcaac cacacccagg gcatccccga cttcttcaag 2220
cagtccttcc ctgagggctt cacatgggag agagtcacca catacgaaga cgggggcgtg 2280
ctgaccgcta cccaggacac cagcctccag gacggctgcc tcatctacaa cgtcaagatc 2340
agaggggtga acttcacatc caacggccct gtgatgcaga agaaaacact cggctgggag 2400
gccttcaccg agacgctgta ccccgctgac ggcggcctgg aaggcagaaa cgacatggcc 2460
ctgaagctcg tgggcgggag ccatctgatc gcaaacatca agaccacata tagatccaag 2520
aaacccgcta agaacctcaa gatgcctggc gtctactatg tggactacag actggaaaga 2580
atcaaggagg ccaacaacga gacctacgtc gagcagcacg aggtggcagt ggccagatac 2640
tgcgacctcc ctagcaaact ggggcacaag cttaattaac accggtggcg cgttaagtcg 2700
acaatcaacc tctggattac aaaatttgtg aaagattgac tggtattctt aactatgttg 2760
ctccttttac gctatgtgga tacgctgctt taatgccttt gtatcatgct attgcttccc 2820
gtatggcttt cattttctcc tccttgtata aatcctggtt gctgtctctt tatgaggagt 2880
tgtggcccgt tgtcaggcaa cgtggcgtgg tgtgcactgt gtttgctgac gcaaccccca 2940
ctggttgggg cattgccacc acctgtcagc tcctttccgg gactttcgct ttccccctcc 3000
ctattgccac ggcggaactc atcgccgcct gccttgcccg ctgctggaca ggggctcggc 3060
tgttgggcac tgacaattcc gtggtgttgt cggggaaatc atcgtccttt ccttggctgc 3120
tcgcctgtgt tgccacctgg attctgcgcg ggacgtcctt ctgctacgtc ccttcggccc 3180
tcaatccagc ggaccttcct tcccgcggcc tgctgccggc tctgcggcct cttccgcgtc 3240
ttcgccttcg ccctcagacg agtcggatct ccctttgggc cgcctccccg cgtcgacttt 3300
aagaccaatg acttacaagg cagctgtaga tcttagccac tttttaaaag aaaagggggg 3360
actggaaggg ctaattcact cccaacgaag acaagatctg ctttttgctt gtactgggtc 3420
tctctggtta gaccagatct gagcctggga gctctctggc taactaggga acccactgct 3480
taagcctcaa taaagcttgc cttgagtgct tcaagtagtg tgtgcccgtc tgttgtgtga 3540
ctctggtaac tagagatccc tcagaccctt ttagtcagtg tggaaaatct ctagcagtac 3600
gtatagtagt tcatgtcatc ttattattca gtatttataa cttgcaaaga aatgaatatc 3660
agagagtgag aggaacttgt ttattgcagc ttataatggt tacaaataaa gcaatagcat 3720
cacaaatttc acaaataaag catttttttc actgcattct agttgtggtt tgtccaaact 3780
catcaatgta tcttatcatg tctggctcta gctatcccgc ccctaactcc gcccatcccg 3840
cccctaactc cgcccagttc cgcccattct ccgccccatg gctgactaat tttttttatt 3900
tatgcagagg ccgaggccgc ctcggcctct gagctattcc agaagtagtg aggaggcttt 3960
tttggaggcc tagggacgta cccaattcgc cctatagtga gtcgtattac gcgcgctcac 4020
tggccgtcgt tttacaacgt cgtgactggg aaaaccctgg cgttacccaa cttaatcgcc 4080
ttgcagcaca tccccctttc gccagctggc gtaatagcga agaggcccgc accgatcgcc 4140
cttcccaaca gttgcgcagc ctgaatggcg aatgggacgc gccctgtagc ggcgcattaa 4200
gcgcggcggg tgtggtggtt acgcgcagcg tgaccgctac acttgccagc gccctagcgc 4260
ccgctccttt cgctttcttc ccttcctttc tcgccacgtt cgccggcttt ccccgtcaag 4320
ctctaaatcg ggggctccct ttagggttcc gatttagtgc tttacggcac ctcgacccca 4380
aaaaacttga ttagggtgat ggttcacgta gtgggccatc gccctgatag acggtttttc 4440
gccctttgac gttggagtcc acgttcttta atagtggact cttgttccaa actggaacaa 4500
cactcaaccc tatctcggtc tattcttttg atttataagg gattttgccg atttcggcct 4560
attggttaaa aaatgagctg atttaacaaa aatttaacgc gaattttaac aaaatattaa 4620
cgcttacaat ttaggtggca cttttcgggg aaatgtgcgc ggaaccccta tttgtttatt 4680
tttctaaata cattcaaata tgtatccgct catgagacaa taaccctgat aaatgcttca 4740
ataatattga aaaaggaaga gtatgagtat tcaacatttc cgtgtcgccc ttattccctt 4800
ttttgcggca ttttgccttc ctgtttttgc tcacccagaa acgctggtga aagtaaaaga 4860
tgctgaagat cagttgggtg cacgagtggg ttacatcgaa ctggatctca acagcggtaa 4920
gatccttgag agttttcgcc ccgaagaacg ttttccaatg atgagcactt ttaaagttct 4980
gctatgtggc gcggtattat cccgtattga cgccgggcaa gagcaactcg gtcgccgcat 5040
acactattct cagaatgact tggttgagta ctcaccagtc acagaaaagc atcttacgga 5100
tggcatgaca gtaagagaat tatgcagtgc tgccataacc atgagtgata acactgcggc 5160
caacttactt ctgacaacga tcggaggacc gaaggagcta accgcttttt tgcacaacat 5220
gggggatcat gtaactcgcc ttgatcgttg ggaaccggag ctgaatgaag ccataccaaa 5280
cgacgagcgt gacaccacga tgcctgtagc aatggcaaca acgttgcgca aactattaac 5340
tggcgaacta cttactctag cttcccggca acaattaata gactggatgg aggcggataa 5400
agttgcagga ccacttctgc gctcggccct tccggctggc tggtttattg ctgataaatc 5460
tggagccggt gagcgtgggt ctcgcggtat cattgcagca ctggggccag atggtaagcc 5520
ctcccgtatc gtagttatct acacgacggg gagtcaggca actatggatg aacgaaatag 5580
acagatcgct gagataggtg cctcactgat taagcattgg taactgtcag accaagttta 5640
ctcatatata ctttagattg atttaaaact tcatttttaa tttaaaagga tctaggtgaa 5700
gatccttttt gataatctca tgaccaaaat cccttaacgt gagttttcgt tccactgagc 5760
gtcagacccc gtagaaaaga tcaaaggatc ttcttgagat cctttttttc tgcgcgtaat 5820
ctgctgcttg caaacaaaaa aaccaccgct accagcggtg gtttgtttgc cggatcaaga 5880
gctaccaact ctttttccga aggtaactgg cttcagcaga gcgcagatac caaatactgt 5940
tcttctagtg tagccgtagt taggccacca cttcaagaac tctgtagcac cgcctacata 6000
cctcgctctg ctaatcctgt taccagtggc tgctgccagt ggcgataagt cgtgtcttac 6060
cgggttggac tcaagacgat agttaccgga taaggcgcag cggtcgggct gaacgggggg 6120
ttcgtgcaca cagcccagct tggagcgaac gacctacacc gaactgagat acctacagcg 6180
tgagctatga gaaagcgcca cgcttcccga agggagaaag gcggacaggt atccggtaag 6240
cggcagggtc ggaacaggag agcgcacgag ggagcttcca gggggaaacg cctggtatct 6300
ttatagtcct gtcgggtttc gccacctctg acttgagcgt cgatttttgt gatgctcgtc 6360
aggggggcgg agcctatgga aaaacgccag caacgcggcc tttttacggt tcctggcctt 6420
ttgctggcct tttgctcaca tgttctttcc tgcgttatcc cctgattctg tggataaccg 6480
tattaccgcc tttgagtgag ctgataccgc tcgccgcagc cgaacgaccg agcgcagcga 6540
gtcagtgagc gaggaagcgg aagagcgccc aatacgcaaa ccgcctctcc ccgcgcgttg 6600
gccgattcat taatgcagct ggcacgacag gtttcccgac tggaaagcgg gcagtgagcg 6660
caacgcaatt aatgtgagtt agctcactca ttaggcaccc caggctttac actttatgct 6720
tccggctcgt atgttgtgtg gaattgtgag cggataacaa tttcacacag gaaacagcta 6780
tgaccatgat tacgccaagc gcgcaattaa ccctcactaa agggaacaaa agctggagct 6840
gcaagcttaa tgtagtctta tgcaatactc ttgtagtctt gcaacatggt aacgatgagt 6900
tagcaacatg ccttacaagg agagaaaaag caccgtgcat gccgattggt ggaagtaagg 6960
tggtacgatc gtgccttatt aggaaggcaa cagacgggtc tgacatggat tggacgaacc 7020
actgaattgc cgcattgcag agatattgta tttaagtgcc tagctcgata cataaacggg 7080
tctctctggt tagaccagat ctgagcctgg gagctctctg gctaactagg gaacccactg 7140
cttaagcctc aataaagctt gccttgagtg cttcaagtag tgtgtgcccg tctgttgtgt 7200
gactctggta actagagatc cctcagaccc ttttagtcag tgtggaaaat ctctagcagt 7260
ggcgcccgaa cagggacttg aaagcgaaag ggaaaccaga ggagctctct cgacgcagga 7320
ctcggcttgc tgaagcgcgc acggcaagag gcgaggggcg gcgactggtg agtacgccaa 7380
aaattttgac tagcggaggc tagaaggaga gagatgggtg cgagagcgtc agtattaagc 7440
gggggagaat tagatcgcga tgggaaaaaa ttcggttaag gccaggggga aagaaaaaat 7500
ataaattaaa acatatagta tgggcaagca gggagctaga acgattcgca gttaatcctg 7560
gcctgttaga aacatcagaa ggctgtagac aaatactggg acagctacaa ccatcccttc 7620
agacaggatc agaagaactt agatcattat ataatacagt agcaaccctc tattgtgtgc 7680
atcaaaggat agagataaaa gacaccaagg aagctttaga caagatagag gaagagcaaa 7740
acaaaagtaa gaccaccgca cagcaagcgg ccgctgatct tcagacctgg aggaggagat 7800
atgagggaca attggagaag tgaattatat aaatataaag tagtaaaaat tgaaccatta 7860
ggagtagcac ccaccaaggc aaagagaaga gtggtgcaga gagaaaaaag agcagtggga 7920
ataggagctt tgttccttgg gttcttggga gcagcaggaa gcactatggg cgcagcgtca 7980
atgacgctga cggtacaggc cagacaatta ttgtctggta tagtgcagca gcagaacaat 8040
ttgctgaggg ctattgaggc gcaacagcat ctgttgcaac tcacagtctg gggcatcaag 8100
cagctccagg caagaatcct ggctgtggaa agatacctaa aggatcaaca gctcctgggg 8160
atttggggtt gctctggaaa actcatttgc accactgctg tgccttggaa tgctagttgg 8220
agtaataaat ctctggaaca gatttggaat cacacgacct ggatggagtg ggacagagaa 8280
attaacaatt acacaagctt aatacactcc ttaattgaag aatcgcaaaa ccagcaagaa 8340
aagaatgaac aagaattatt ggaattagat aaatgggcaa gtttgtggaa ttggtttaac 8400
ataacaaatt ggctgtggta tataaaatta ttcataatga tagtaggagg cttggtaggt 8460
ttaagaatag tttttgctgt actttctata gtgaatagag ttaggcaggg atattcacca 8520
ttatcgtttc agacccacct cccaaccccg aggggaccct tgcgcctttt ccaaggcagc 8580
cctgggtttg cgcagggacg cggctgctct gggcgtggtt ccgggaaacg cagcggcgcc 8640
gaccctgggt ctcgcacatt cttcacgtcc gttcgcagcg tcacccggat cttcgccgct 8700
acccttgtgg gccccccggc gacgcttcct gctccgcccc taagtcggga aggttccttg 8760
cggttcgcgg cgtgccggac gtgacaaacg gaagccgcac gtctcactag taccctcgca 8820
gacggacagc gccagggagc aatggcagcg cgccgaccgc gatgggctgt ggccaatagc 8880
ggctgctcag cagggcgcgc cgagagcagc ggccgggaag gggcggtgcg ggaggcgggg 8940
tgtggggcgg tagtgtgggc cctgttcctg cccgcgcggt gttccgcatt ctgcaagcct 9000
ccggagcgca cgtcggcagt cggctccctc gttgaccgaa tcaccgacct ctctccccag 9060
ggggtaccca gctgtctaga gaattctaga tcttgagaca aatggcagta ttcatccaca 9120
attttaaaag aaaagggggg attggggggt acagtgcagg ggaaagaata gtagacataa 9180
tagcaacaga catacaaact aaagaattac aaaaacaaat tacaaaaatt caaaattttc 9240
gggtttatta cagggacagc agagatccac tttggcgccg gctcgagggg g 9291
<210> 83
<211> 214
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 83
Met Val Ser Lys Gly Glu Ala Val Ile Lys Glu Phe Met Arg Phe Lys
1 5 10 15
Val His Met Glu Gly Ser Met Asn Gly His Glu Phe Glu Ile Glu Gly
20 25 30
Glu Gly Glu Gly Arg Pro Tyr Glu Gly Thr Gln Thr Ala Lys Leu Lys
35 40 45
Val Thr Lys Gly Gly Pro Leu Pro Phe Ser Trp Asp Ile Leu Ser Pro
50 55 60
Gln Phe Met Tyr Gly Ser Arg Ala Phe Thr Lys Cys Leu Ser Tyr Glu
65 70 75 80
Thr Glu Ile Leu Thr Val Glu Tyr Gly Leu Leu Pro Ile Gly Lys Ile
85 90 95
Val Glu Lys Arg Ile Glu Cys Thr Val Tyr Ser Val Asp Asn Asn Gly
100 105 110
Asn Ile Tyr Thr Gln Pro Val Ala Gln Trp His Asp Arg Gly Glu Gln
115 120 125
Glu Val Phe Glu Tyr Cys Leu Glu Asp Gly Ser Leu Ile Arg Ala Thr
130 135 140
Lys Asp His Lys Phe Met Thr Val Asp Gly Gln Met Leu Pro Ile Asp
145 150 155 160
Glu Ile Phe Glu Arg Glu Leu Asp Leu Met Arg Val Asp Asn Leu Pro
165 170 175
Asn Gly Gly Gly Gly Ser Gly Ser Ala Gln Leu Glu Lys Glu Leu Gln
180 185 190
Ala Leu Glu Lys Lys Leu Ala Gln Leu Glu Trp Glu Asn Gln Ala Leu
195 200 205
Glu Lys Glu Leu Ala Gln
210
<210> 84
<211> 9357
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 84
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtgccgcca ccatggctca actgaagaag 660
aaactccagg ccaataagaa agaactggca cagctgaagt ggaagctgca ggcactgaaa 720
aagaagctgg cacagggcgg aggaggctca ggcagtatga ttaagatcgc tacgcggaag 780
tacctgggga aacagaacgt ctacgacata ggtgtggagc gcgatcacaa ctttgctctg 840
aaaaatggat ttatcgccag caactgccac cccgccgaca tccccgacta ctataagcag 900
tccttccccg agggcttcaa gtgggagcgc gtgatgaact tcgaggacgg cggcgccgtg 960
accgtgaccc aggacacctc cctggaggac ggcaccctga tctacaaggt gaagctccgc 1020
ggcaccaact tccctcctga cggccccgta atgcagaaga agacaatggg ctgggaagcg 1080
tccaccgagc ggttgtaccc cgaggacggc gtgctgaagg gcgacattaa gatggccctg 1140
cgcctgaagg acggcggccg ctacctggcg gacttcaaga ccacctacaa ggccaagaag 1200
cccgtgcaga tgcccggcgc ctacaacgtc gaccgcaagt tggacatcac ctcccacaac 1260
gaggactaca ccgtggtgga acagtacgaa cgctccgagg gccgccactc caccggcggc 1320
atggacgagc tgtacaagtg attaattaag aattcgaccc agctttcttg tacaaagtgg 1380
ttggtaagcc tatccctaac cctctcctcg gtctcgattc tacgtagtaa tgagctagca 1440
gtctcgaggt taacgaattc cgcccccccc ctaacgttac tggccgaagc cgcttggaat 1500
aaggccggtg tgcgcttgtc tatatgttat tttccaccat attgccgtct tttggcaatg 1560
tgagggcccg gaaacctggc cctgtcttct tgacgagcat tcctaggggt ctttcccctc 1620
tcgccaaagg aatgcaaggt ctgttgaatg tcgtgaagga agcagttcct ctggaagctt 1680
cttgaagaca aacaacgtct gtagcgaccc tttgcaggca gcggaacccc ccacctggcg 1740
acaggtgccc ctgcggccaa aagccacgtg tataagatac acctgcaaag gcggcacaac 1800
cccagtgcca cgttgtgagt tggatagttg tggaaagagt caaatggctc tcctcaagcg 1860
tattcaacaa ggggctgaag gatgcccaga aggtacccca ttgtatggga tctgatctgg 1920
ggcctcggtg cacatgcttt acatgtgttt agtcgaggtt aaaaaaacgt ctaggccccc 1980
cgaaccacgg ggacgtggtt ttcctttgaa aaacacgata ataccatggt gagcaagggc 2040
gaggagctgt tcaccggggt ggtgcccatc ctggtcgagc tggacggcga cgtaaacggc 2100
cacaagttca gcgtgtccgg cgagggcgag ggcgatgcca cctacggcaa gctgaccctg 2160
aagttcatct gcaccaccgg caagctgccc gtgccctggc ccaccctcgt gaccaccctg 2220
acctacggcg tgcagtgctt cagccgctac cccgaccaca tgaagcagca cgacttcttc 2280
aagtccgcca tgcccgaagg ctacgtccag gagcgcacca tcttcttcaa ggacgacggc 2340
aactacaaga cccgcgccga ggtgaagttc gagggcgaca ccctggtgaa ccgcatcgag 2400
ctgaagggca tcgacttcaa ggaggacggc aacatcctgg ggcacaagct ggagtacaac 2460
tacaacagcc acaacgtcta tatcatggcc gacaagcaga agaacggcat caaggtgaac 2520
ttcaagatcc gccacaacat cgaggacggc agcgtgcagc tcgccgacca ctaccagcag 2580
aacaccccca tcggcgacgg ccccgtgctg ctgcccgaca accactacct gagcacccag 2640
tccgccctga gcaaagaccc caacgagaag cgcgatcaca tggtcctgct ggagttcgtg 2700
accgccgccg ggatcactct cggcatggac gagctgtaca agtaacaccg gtggcgcgtt 2760
aagtcgacaa tcaacctctg gattacaaaa tttgtgaaag attgactggt attcttaact 2820
atgttgctcc ttttacgcta tgtggatacg ctgctttaat gcctttgtat catgctattg 2880
cttcccgtat ggctttcatt ttctcctcct tgtataaatc ctggttgctg tctctttatg 2940
aggagttgtg gcccgttgtc aggcaacgtg gcgtggtgtg cactgtgttt gctgacgcaa 3000
cccccactgg ttggggcatt gccaccacct gtcagctcct ttccgggact ttcgctttcc 3060
ccctccctat tgccacggcg gaactcatcg ccgcctgcct tgcccgctgc tggacagggg 3120
ctcggctgtt gggcactgac aattccgtgg tgttgtcggg gaaatcatcg tcctttcctt 3180
ggctgctcgc ctgtgttgcc acctggattc tgcgcgggac gtccttctgc tacgtccctt 3240
cggccctcaa tccagcggac cttccttccc gcggcctgct gccggctctg cggcctcttc 3300
cgcgtcttcg ccttcgccct cagacgagtc ggatctccct ttgggccgcc tccccgcgtc 3360
gactttaaga ccaatgactt acaaggcagc tgtagatctt agccactttt taaaagaaaa 3420
ggggggactg gaagggctaa ttcactccca acgaagacaa gatctgcttt ttgcttgtac 3480
tgggtctctc tggttagacc agatctgagc ctgggagctc tctggctaac tagggaaccc 3540
actgcttaag cctcaataaa gcttgccttg agtgcttcaa gtagtgtgtg cccgtctgtt 3600
gtgtgactct ggtaactaga gatccctcag acccttttag tcagtgtgga aaatctctag 3660
cagtacgtat agtagttcat gtcatcttat tattcagtat ttataacttg caaagaaatg 3720
aatatcagag agtgagagga acttgtttat tgcagcttat aatggttaca aataaagcaa 3780
tagcatcaca aatttcacaa ataaagcatt tttttcactg cattctagtt gtggtttgtc 3840
caaactcatc aatgtatctt atcatgtctg gctctagcta tcccgcccct aactccgccc 3900
atcccgcccc taactccgcc cagttccgcc cattctccgc cccatggctg actaattttt 3960
tttatttatg cagaggccga ggccgcctcg gcctctgagc tattccagaa gtagtgagga 4020
ggcttttttg gaggcctagg gacgtaccca attcgcccta tagtgagtcg tattacgcgc 4080
gctcactggc cgtcgtttta caacgtcgtg actgggaaaa ccctggcgtt acccaactta 4140
atcgccttgc agcacatccc cctttcgcca gctggcgtaa tagcgaagag gcccgcaccg 4200
atcgcccttc ccaacagttg cgcagcctga atggcgaatg ggacgcgccc tgtagcggcg 4260
cattaagcgc ggcgggtgtg gtggttacgc gcagcgtgac cgctacactt gccagcgccc 4320
tagcgcccgc tcctttcgct ttcttccctt cctttctcgc cacgttcgcc ggctttcccc 4380
gtcaagctct aaatcggggg ctccctttag ggttccgatt tagtgcttta cggcacctcg 4440
accccaaaaa acttgattag ggtgatggtt cacgtagtgg gccatcgccc tgatagacgg 4500
tttttcgccc tttgacgttg gagtccacgt tctttaatag tggactcttg ttccaaactg 4560
gaacaacact caaccctatc tcggtctatt cttttgattt ataagggatt ttgccgattt 4620
cggcctattg gttaaaaaat gagctgattt aacaaaaatt taacgcgaat tttaacaaaa 4680
tattaacgct tacaatttag gtggcacttt tcggggaaat gtgcgcggaa cccctatttg 4740
tttatttttc taaatacatt caaatatgta tccgctcatg agacaataac cctgataaat 4800
gcttcaataa tattgaaaaa ggaagagtat gagtattcaa catttccgtg tcgcccttat 4860
tccctttttt gcggcatttt gccttcctgt ttttgctcac ccagaaacgc tggtgaaagt 4920
aaaagatgct gaagatcagt tgggtgcacg agtgggttac atcgaactgg atctcaacag 4980
cggtaagatc cttgagagtt ttcgccccga agaacgtttt ccaatgatga gcacttttaa 5040
agttctgcta tgtggcgcgg tattatcccg tattgacgcc gggcaagagc aactcggtcg 5100
ccgcatacac tattctcaga atgacttggt tgagtactca ccagtcacag aaaagcatct 5160
tacggatggc atgacagtaa gagaattatg cagtgctgcc ataaccatga gtgataacac 5220
tgcggccaac ttacttctga caacgatcgg aggaccgaag gagctaaccg cttttttgca 5280
caacatgggg gatcatgtaa ctcgccttga tcgttgggaa ccggagctga atgaagccat 5340
accaaacgac gagcgtgaca ccacgatgcc tgtagcaatg gcaacaacgt tgcgcaaact 5400
attaactggc gaactactta ctctagcttc ccggcaacaa ttaatagact ggatggaggc 5460
ggataaagtt gcaggaccac ttctgcgctc ggcccttccg gctggctggt ttattgctga 5520
taaatctgga gccggtgagc gtgggtctcg cggtatcatt gcagcactgg ggccagatgg 5580
taagccctcc cgtatcgtag ttatctacac gacggggagt caggcaacta tggatgaacg 5640
aaatagacag atcgctgaga taggtgcctc actgattaag cattggtaac tgtcagacca 5700
agtttactca tatatacttt agattgattt aaaacttcat ttttaattta aaaggatcta 5760
ggtgaagatc ctttttgata atctcatgac caaaatccct taacgtgagt tttcgttcca 5820
ctgagcgtca gaccccgtag aaaagatcaa aggatcttct tgagatcctt tttttctgcg 5880
cgtaatctgc tgcttgcaaa caaaaaaacc accgctacca gcggtggttt gtttgccgga 5940
tcaagagcta ccaactcttt ttccgaaggt aactggcttc agcagagcgc agataccaaa 6000
tactgttctt ctagtgtagc cgtagttagg ccaccacttc aagaactctg tagcaccgcc 6060
tacatacctc gctctgctaa tcctgttacc agtggctgct gccagtggcg ataagtcgtg 6120
tcttaccggg ttggactcaa gacgatagtt accggataag gcgcagcggt cgggctgaac 6180
ggggggttcg tgcacacagc ccagcttgga gcgaacgacc tacaccgaac tgagatacct 6240
acagcgtgag ctatgagaaa gcgccacgct tcccgaaggg agaaaggcgg acaggtatcc 6300
ggtaagcggc agggtcggaa caggagagcg cacgagggag cttccagggg gaaacgcctg 6360
gtatctttat agtcctgtcg ggtttcgcca cctctgactt gagcgtcgat ttttgtgatg 6420
ctcgtcaggg gggcggagcc tatggaaaaa cgccagcaac gcggcctttt tacggttcct 6480
ggccttttgc tggccttttg ctcacatgtt ctttcctgcg ttatcccctg attctgtgga 6540
taaccgtatt accgcctttg agtgagctga taccgctcgc cgcagccgaa cgaccgagcg 6600
cagcgagtca gtgagcgagg aagcggaaga gcgcccaata cgcaaaccgc ctctccccgc 6660
gcgttggccg attcattaat gcagctggca cgacaggttt cccgactgga aagcgggcag 6720
tgagcgcaac gcaattaatg tgagttagct cactcattag gcaccccagg ctttacactt 6780
tatgcttccg gctcgtatgt tgtgtggaat tgtgagcgga taacaatttc acacaggaaa 6840
cagctatgac catgattacg ccaagcgcgc aattaaccct cactaaaggg aacaaaagct 6900
ggagctgcaa gcttaatgta gtcttatgca atactcttgt agtcttgcaa catggtaacg 6960
atgagttagc aacatgcctt acaaggagag aaaaagcacc gtgcatgccg attggtggaa 7020
gtaaggtggt acgatcgtgc cttattagga aggcaacaga cgggtctgac atggattgga 7080
cgaaccactg aattgccgca ttgcagagat attgtattta agtgcctagc tcgatacata 7140
aacgggtctc tctggttaga ccagatctga gcctgggagc tctctggcta actagggaac 7200
ccactgctta agcctcaata aagcttgcct tgagtgcttc aagtagtgtg tgcccgtctg 7260
ttgtgtgact ctggtaacta gagatccctc agaccctttt agtcagtgtg gaaaatctct 7320
agcagtggcg cccgaacagg gacttgaaag cgaaagggaa accagaggag ctctctcgac 7380
gcaggactcg gcttgctgaa gcgcgcacgg caagaggcga ggggcggcga ctggtgagta 7440
cgccaaaaat tttgactagc ggaggctaga aggagagaga tgggtgcgag agcgtcagta 7500
ttaagcgggg gagaattaga tcgcgatggg aaaaaattcg gttaaggcca gggggaaaga 7560
aaaaatataa attaaaacat atagtatggg caagcaggga gctagaacga ttcgcagtta 7620
atcctggcct gttagaaaca tcagaaggct gtagacaaat actgggacag ctacaaccat 7680
cccttcagac aggatcagaa gaacttagat cattatataa tacagtagca accctctatt 7740
gtgtgcatca aaggatagag ataaaagaca ccaaggaagc tttagacaag atagaggaag 7800
agcaaaacaa aagtaagacc accgcacagc aagcggccgc tgatcttcag acctggagga 7860
ggagatatga gggacaattg gagaagtgaa ttatataaat ataaagtagt aaaaattgaa 7920
ccattaggag tagcacccac caaggcaaag agaagagtgg tgcagagaga aaaaagagca 7980
gtgggaatag gagctttgtt ccttgggttc ttgggagcag caggaagcac tatgggcgca 8040
gcgtcaatga cgctgacggt acaggccaga caattattgt ctggtatagt gcagcagcag 8100
aacaatttgc tgagggctat tgaggcgcaa cagcatctgt tgcaactcac agtctggggc 8160
atcaagcagc tccaggcaag aatcctggct gtggaaagat acctaaagga tcaacagctc 8220
ctggggattt ggggttgctc tggaaaactc atttgcacca ctgctgtgcc ttggaatgct 8280
agttggagta ataaatctct ggaacagatt tggaatcaca cgacctggat ggagtgggac 8340
agagaaatta acaattacac aagcttaata cactccttaa ttgaagaatc gcaaaaccag 8400
caagaaaaga atgaacaaga attattggaa ttagataaat gggcaagttt gtggaattgg 8460
tttaacataa caaattggct gtggtatata aaattattca taatgatagt aggaggcttg 8520
gtaggtttaa gaatagtttt tgctgtactt tctatagtga atagagttag gcagggatat 8580
tcaccattat cgtttcagac ccacctccca accccgaggg gacccttgcg ccttttccaa 8640
ggcagccctg ggtttgcgca gggacgcggc tgctctgggc gtggttccgg gaaacgcagc 8700
ggcgccgacc ctgggtctcg cacattcttc acgtccgttc gcagcgtcac ccggatcttc 8760
gccgctaccc ttgtgggccc cccggcgacg cttcctgctc cgcccctaag tcgggaaggt 8820
tccttgcggt tcgcggcgtg ccggacgtga caaacggaag ccgcacgtct cactagtacc 8880
ctcgcagacg gacagcgcca gggagcaatg gcagcgcgcc gaccgcgatg ggctgtggcc 8940
aatagcggct gctcagcagg gcgcgccgag agcagcggcc gggaaggggc ggtgcgggag 9000
gcggggtgtg gggcggtagt gtgggccctg ttcctgcccg cgcggtgttc cgcattctgc 9060
aagcctccgg agcgcacgtc ggcagtcggc tccctcgttg accgaatcac cgacctctct 9120
ccccaggggg tacccagctg tctagagaat tctagatctt gagacaaatg gcagtattca 9180
tccacaattt taaaagaaaa ggggggattg gggggtacag tgcaggggaa agaatagtag 9240
acataatagc aacagacata caaactaaag aattacaaaa acaaattaca aaaattcaaa 9300
attttcgggt ttattacagg gacagcagag atccactttg gcgccggctc gaggggg 9357
<210> 85
<211> 232
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 85
Met Ala Gln Leu Lys Lys Lys Leu Gln Ala Asn Lys Lys Glu Leu Ala
1 5 10 15
Gln Leu Lys Trp Lys Leu Gln Ala Leu Lys Lys Lys Leu Ala Gln Gly
20 25 30
Gly Gly Gly Ser Gly Ser Met Ile Lys Ile Ala Thr Arg Lys Tyr Leu
35 40 45
Gly Lys Gln Asn Val Tyr Asp Ile Gly Val Glu Arg Asp His Asn Phe
50 55 60
Ala Leu Lys Asn Gly Phe Ile Ala Ser Asn Cys His Pro Ala Asp Ile
65 70 75 80
Pro Asp Tyr Tyr Lys Gln Ser Phe Pro Glu Gly Phe Lys Trp Glu Arg
85 90 95
Val Met Asn Phe Glu Asp Gly Gly Ala Val Thr Val Thr Gln Asp Thr
100 105 110
Ser Leu Glu Asp Gly Thr Leu Ile Tyr Lys Val Lys Leu Arg Gly Thr
115 120 125
Asn Phe Pro Pro Asp Gly Pro Val Met Gln Lys Lys Thr Met Gly Trp
130 135 140
Glu Ala Ser Thr Glu Arg Leu Tyr Pro Glu Asp Gly Val Leu Lys Gly
145 150 155 160
Asp Ile Lys Met Ala Leu Arg Leu Lys Asp Gly Gly Arg Tyr Leu Ala
165 170 175
Asp Phe Lys Thr Thr Tyr Lys Ala Lys Lys Pro Val Gln Met Pro Gly
180 185 190
Ala Tyr Asn Val Asp Arg Lys Leu Asp Ile Thr Ser His Asn Glu Asp
195 200 205
Tyr Thr Val Val Glu Gln Tyr Glu Arg Ser Glu Gly Arg His Ser Thr
210 215 220
Gly Gly Met Asp Glu Leu Tyr Lys
225 230
<210> 86
<211> 9432
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 86
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtgccgcca ccatggtgag caagggcgag 660
gcagtgatca aggagttcat gcggttcaag gtgcacatgg agggctccat gaacggccac 720
gagttcgaga tcgagggcga gggcgagggc cgcccctacg agggcaccca gaccgccaag 780
ctgaaggtga ccaagggtgg ccccctgccc ttctcctggg acatcctgtc ccctcagttc 840
atgtacggct ccagggcctt caccaagcac cccgccgaca tccccgacta ctataagcag 900
tccttccccg agggcttcaa gtgggagcgc gtgatgaact tcgaggacgg cggcgccgtg 960
accgtgaccc aggacacctc cctggaggac ggcaccctga tctacaagtg cctttcatac 1020
gagaccgaga tcctgactgt cgagtacgga ttgcttccta tcggcaaaat cgtggagaag 1080
aggattgaat gtaccgtcta ttcagtcgat aataatggga acatctacac acagcccgtg 1140
gctcaatggc acgacagagg agagcaggaa gtttttgaat actgtctcga ggacggatcc 1200
ctcatccgcg ctactaaaga tcataagttt atgaccgtgg acggccagat gctgccaatt 1260
gacgaaattt ttgaacgaga gctggatctg atgagagtcg acaaccttcc aaacggtgga 1320
ggggggtcag gctctgcgca gctggaaaag gagcttcaag ccctcgaaaa aaagttggcc 1380
cagctcgagt gggagaacca ggctctggag aaagaactgg cccagtgatt aattaagaat 1440
tcgacccagc tttcttgtac aaagtggttg gtaagcctat ccctaaccct ctcctcggtc 1500
tcgattctac gtagtaatga gctagcagtc tcgaggttaa cgaattccgc ccccccccta 1560
acgttactgg ccgaagccgc ttggaataag gccggtgtgc gcttgtctat atgttatttt 1620
ccaccatatt gccgtctttt ggcaatgtga gggcccggaa acctggccct gtcttcttga 1680
cgagcattcc taggggtctt tcccctctcg ccaaaggaat gcaaggtctg ttgaatgtcg 1740
tgaaggaagc agttcctctg gaagcttctt gaagacaaac aacgtctgta gcgacccttt 1800
gcaggcagcg gaacccccca cctggcgaca ggtgcccctg cggccaaaag ccacgtgtat 1860
aagatacacc tgcaaaggcg gcacaacccc agtgccacgt tgtgagttgg atagttgtgg 1920
aaagagtcaa atggctctcc tcaagcgtat tcaacaaggg gctgaaggat gcccagaagg 1980
taccccattg tatgggatct gatctggggc ctcggtgcac atgctttaca tgtgtttagt 2040
cgaggttaaa aaaacgtcta ggccccccga accacgggga cgtggttttc ctttgaaaaa 2100
cacgataata ccatggccat gagcgagctg attaaggaga acatgcacat gaagctgtac 2160
atggagggca ccgtggacaa ccatcacttc aagtgcacat ccgagggcga aggcaagccc 2220
tacgagggca cccagaccat gagaatcaag gtggtcgagg gcggccctct ccccttcgcc 2280
ttcgacatcc tggctactag cttcctctac ggcagcaaga ccttcatcaa ccacacccag 2340
ggcatccccg acttcttcaa gcagtccttc cctgagggct tcacatggga gagagtcacc 2400
acatacgaag acgggggcgt gctgaccgct acccaggaca ccagcctcca ggacggctgc 2460
ctcatctaca acgtcaagat cagaggggtg aacttcacat ccaacggccc tgtgatgcag 2520
aagaaaacac tcggctggga ggccttcacc gagacgctgt accccgctga cggcggcctg 2580
gaaggcagaa acgacatggc cctgaagctc gtgggcggga gccatctgat cgcaaacatc 2640
aagaccacat atagatccaa gaaacccgct aagaacctca agatgcctgg cgtctactat 2700
gtggactaca gactggaaag aatcaaggag gccaacaacg agacctacgt cgagcagcac 2760
gaggtggcag tggccagata ctgcgacctc cctagcaaac tggggcacaa gcttaattaa 2820
caccggtggc gcgttaagtc gacaatcaac ctctggatta caaaatttgt gaaagattga 2880
ctggtattct taactatgtt gctcctttta cgctatgtgg atacgctgct ttaatgcctt 2940
tgtatcatgc tattgcttcc cgtatggctt tcattttctc ctccttgtat aaatcctggt 3000
tgctgtctct ttatgaggag ttgtggcccg ttgtcaggca acgtggcgtg gtgtgcactg 3060
tgtttgctga cgcaaccccc actggttggg gcattgccac cacctgtcag ctcctttccg 3120
ggactttcgc tttccccctc cctattgcca cggcggaact catcgccgcc tgccttgccc 3180
gctgctggac aggggctcgg ctgttgggca ctgacaattc cgtggtgttg tcggggaaat 3240
catcgtcctt tccttggctg ctcgcctgtg ttgccacctg gattctgcgc gggacgtcct 3300
tctgctacgt cccttcggcc ctcaatccag cggaccttcc ttcccgcggc ctgctgccgg 3360
ctctgcggcc tcttccgcgt cttcgccttc gccctcagac gagtcggatc tccctttggg 3420
ccgcctcccc gcgtcgactt taagaccaat gacttacaag gcagctgtag atcttagcca 3480
ctttttaaaa gaaaaggggg gactggaagg gctaattcac tcccaacgaa gacaagatct 3540
gctttttgct tgtactgggt ctctctggtt agaccagatc tgagcctggg agctctctgg 3600
ctaactaggg aacccactgc ttaagcctca ataaagcttg ccttgagtgc ttcaagtagt 3660
gtgtgcccgt ctgttgtgtg actctggtaa ctagagatcc ctcagaccct tttagtcagt 3720
gtggaaaatc tctagcagta cgtatagtag ttcatgtcat cttattattc agtatttata 3780
acttgcaaag aaatgaatat cagagagtga gaggaacttg tttattgcag cttataatgg 3840
ttacaaataa agcaatagca tcacaaattt cacaaataaa gcattttttt cactgcattc 3900
tagttgtggt ttgtccaaac tcatcaatgt atcttatcat gtctggctct agctatcccg 3960
cccctaactc cgcccatccc gcccctaact ccgcccagtt ccgcccattc tccgccccat 4020
ggctgactaa ttttttttat ttatgcagag gccgaggccg cctcggcctc tgagctattc 4080
cagaagtagt gaggaggctt ttttggaggc ctagggacgt acccaattcg ccctatagtg 4140
agtcgtatta cgcgcgctca ctggccgtcg ttttacaacg tcgtgactgg gaaaaccctg 4200
gcgttaccca acttaatcgc cttgcagcac atcccccttt cgccagctgg cgtaatagcg 4260
aagaggcccg caccgatcgc ccttcccaac agttgcgcag cctgaatggc gaatgggacg 4320
cgccctgtag cggcgcatta agcgcggcgg gtgtggtggt tacgcgcagc gtgaccgcta 4380
cacttgccag cgccctagcg cccgctcctt tcgctttctt cccttccttt ctcgccacgt 4440
tcgccggctt tccccgtcaa gctctaaatc gggggctccc tttagggttc cgatttagtg 4500
ctttacggca cctcgacccc aaaaaacttg attagggtga tggttcacgt agtgggccat 4560
cgccctgata gacggttttt cgccctttga cgttggagtc cacgttcttt aatagtggac 4620
tcttgttcca aactggaaca acactcaacc ctatctcggt ctattctttt gatttataag 4680
ggattttgcc gatttcggcc tattggttaa aaaatgagct gatttaacaa aaatttaacg 4740
cgaattttaa caaaatatta acgcttacaa tttaggtggc acttttcggg gaaatgtgcg 4800
cggaacccct atttgtttat ttttctaaat acattcaaat atgtatccgc tcatgagaca 4860
ataaccctga taaatgcttc aataatattg aaaaaggaag agtatgagta ttcaacattt 4920
ccgtgtcgcc cttattccct tttttgcggc attttgcctt cctgtttttg ctcacccaga 4980
aacgctggtg aaagtaaaag atgctgaaga tcagttgggt gcacgagtgg gttacatcga 5040
actggatctc aacagcggta agatccttga gagttttcgc cccgaagaac gttttccaat 5100
gatgagcact tttaaagttc tgctatgtgg cgcggtatta tcccgtattg acgccgggca 5160
agagcaactc ggtcgccgca tacactattc tcagaatgac ttggttgagt actcaccagt 5220
cacagaaaag catcttacgg atggcatgac agtaagagaa ttatgcagtg ctgccataac 5280
catgagtgat aacactgcgg ccaacttact tctgacaacg atcggaggac cgaaggagct 5340
aaccgctttt ttgcacaaca tgggggatca tgtaactcgc cttgatcgtt gggaaccgga 5400
gctgaatgaa gccataccaa acgacgagcg tgacaccacg atgcctgtag caatggcaac 5460
aacgttgcgc aaactattaa ctggcgaact acttactcta gcttcccggc aacaattaat 5520
agactggatg gaggcggata aagttgcagg accacttctg cgctcggccc ttccggctgg 5580
ctggtttatt gctgataaat ctggagccgg tgagcgtggg tctcgcggta tcattgcagc 5640
actggggcca gatggtaagc cctcccgtat cgtagttatc tacacgacgg ggagtcaggc 5700
aactatggat gaacgaaata gacagatcgc tgagataggt gcctcactga ttaagcattg 5760
gtaactgtca gaccaagttt actcatatat actttagatt gatttaaaac ttcattttta 5820
atttaaaagg atctaggtga agatcctttt tgataatctc atgaccaaaa tcccttaacg 5880
tgagttttcg ttccactgag cgtcagaccc cgtagaaaag atcaaaggat cttcttgaga 5940
tccttttttt ctgcgcgtaa tctgctgctt gcaaacaaaa aaaccaccgc taccagcggt 6000
ggtttgtttg ccggatcaag agctaccaac tctttttccg aaggtaactg gcttcagcag 6060
agcgcagata ccaaatactg ttcttctagt gtagccgtag ttaggccacc acttcaagaa 6120
ctctgtagca ccgcctacat acctcgctct gctaatcctg ttaccagtgg ctgctgccag 6180
tggcgataag tcgtgtctta ccgggttgga ctcaagacga tagttaccgg ataaggcgca 6240
gcggtcgggc tgaacggggg gttcgtgcac acagcccagc ttggagcgaa cgacctacac 6300
cgaactgaga tacctacagc gtgagctatg agaaagcgcc acgcttcccg aagggagaaa 6360
ggcggacagg tatccggtaa gcggcagggt cggaacagga gagcgcacga gggagcttcc 6420
agggggaaac gcctggtatc tttatagtcc tgtcgggttt cgccacctct gacttgagcg 6480
tcgatttttg tgatgctcgt caggggggcg gagcctatgg aaaaacgcca gcaacgcggc 6540
ctttttacgg ttcctggcct tttgctggcc ttttgctcac atgttctttc ctgcgttatc 6600
ccctgattct gtggataacc gtattaccgc ctttgagtga gctgataccg ctcgccgcag 6660
ccgaacgacc gagcgcagcg agtcagtgag cgaggaagcg gaagagcgcc caatacgcaa 6720
accgcctctc cccgcgcgtt ggccgattca ttaatgcagc tggcacgaca ggtttcccga 6780
ctggaaagcg ggcagtgagc gcaacgcaat taatgtgagt tagctcactc attaggcacc 6840
ccaggcttta cactttatgc ttccggctcg tatgttgtgt ggaattgtga gcggataaca 6900
atttcacaca ggaaacagct atgaccatga ttacgccaag cgcgcaatta accctcacta 6960
aagggaacaa aagctggagc tgcaagctta atgtagtctt atgcaatact cttgtagtct 7020
tgcaacatgg taacgatgag ttagcaacat gccttacaag gagagaaaaa gcaccgtgca 7080
tgccgattgg tggaagtaag gtggtacgat cgtgccttat taggaaggca acagacgggt 7140
ctgacatgga ttggacgaac cactgaattg ccgcattgca gagatattgt atttaagtgc 7200
ctagctcgat acataaacgg gtctctctgg ttagaccaga tctgagcctg ggagctctct 7260
ggctaactag ggaacccact gcttaagcct caataaagct tgccttgagt gcttcaagta 7320
gtgtgtgccc gtctgttgtg tgactctggt aactagagat ccctcagacc cttttagtca 7380
gtgtggaaaa tctctagcag tggcgcccga acagggactt gaaagcgaaa gggaaaccag 7440
aggagctctc tcgacgcagg actcggcttg ctgaagcgcg cacggcaaga ggcgaggggc 7500
ggcgactggt gagtacgcca aaaattttga ctagcggagg ctagaaggag agagatgggt 7560
gcgagagcgt cagtattaag cgggggagaa ttagatcgcg atgggaaaaa attcggttaa 7620
ggccaggggg aaagaaaaaa tataaattaa aacatatagt atgggcaagc agggagctag 7680
aacgattcgc agttaatcct ggcctgttag aaacatcaga aggctgtaga caaatactgg 7740
gacagctaca accatccctt cagacaggat cagaagaact tagatcatta tataatacag 7800
tagcaaccct ctattgtgtg catcaaagga tagagataaa agacaccaag gaagctttag 7860
acaagataga ggaagagcaa aacaaaagta agaccaccgc acagcaagcg gccgctgatc 7920
ttcagacctg gaggaggaga tatgagggac aattggagaa gtgaattata taaatataaa 7980
gtagtaaaaa ttgaaccatt aggagtagca cccaccaagg caaagagaag agtggtgcag 8040
agagaaaaaa gagcagtggg aataggagct ttgttccttg ggttcttggg agcagcagga 8100
agcactatgg gcgcagcgtc aatgacgctg acggtacagg ccagacaatt attgtctggt 8160
atagtgcagc agcagaacaa tttgctgagg gctattgagg cgcaacagca tctgttgcaa 8220
ctcacagtct ggggcatcaa gcagctccag gcaagaatcc tggctgtgga aagataccta 8280
aaggatcaac agctcctggg gatttggggt tgctctggaa aactcatttg caccactgct 8340
gtgccttgga atgctagttg gagtaataaa tctctggaac agatttggaa tcacacgacc 8400
tggatggagt gggacagaga aattaacaat tacacaagct taatacactc cttaattgaa 8460
gaatcgcaaa accagcaaga aaagaatgaa caagaattat tggaattaga taaatgggca 8520
agtttgtgga attggtttaa cataacaaat tggctgtggt atataaaatt attcataatg 8580
atagtaggag gcttggtagg tttaagaata gtttttgctg tactttctat agtgaataga 8640
gttaggcagg gatattcacc attatcgttt cagacccacc tcccaacccc gaggggaccc 8700
ttgcgccttt tccaaggcag ccctgggttt gcgcagggac gcggctgctc tgggcgtggt 8760
tccgggaaac gcagcggcgc cgaccctggg tctcgcacat tcttcacgtc cgttcgcagc 8820
gtcacccgga tcttcgccgc tacccttgtg ggccccccgg cgacgcttcc tgctccgccc 8880
ctaagtcggg aaggttcctt gcggttcgcg gcgtgccgga cgtgacaaac ggaagccgca 8940
cgtctcacta gtaccctcgc agacggacag cgccagggag caatggcagc gcgccgaccg 9000
cgatgggctg tggccaatag cggctgctca gcagggcgcg ccgagagcag cggccgggaa 9060
ggggcggtgc gggaggcggg gtgtggggcg gtagtgtggg ccctgttcct gcccgcgcgg 9120
tgttccgcat tctgcaagcc tccggagcgc acgtcggcag tcggctccct cgttgaccga 9180
atcaccgacc tctctcccca gggggtaccc agctgtctag agaattctag atcttgagac 9240
aaatggcagt attcatccac aattttaaaa gaaaaggggg gattgggggg tacagtgcag 9300
gggaaagaat agtagacata atagcaacag acatacaaac taaagaatta caaaaacaaa 9360
ttacaaaaat tcaaaatttt cgggtttatt acagggacag cagagatcca ctttggcgcc 9420
ggctcgaggg gg 9432
<210> 87
<211> 261
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 87
Met Val Ser Lys Gly Glu Ala Val Ile Lys Glu Phe Met Arg Phe Lys
1 5 10 15
Val His Met Glu Gly Ser Met Asn Gly His Glu Phe Glu Ile Glu Gly
20 25 30
Glu Gly Glu Gly Arg Pro Tyr Glu Gly Thr Gln Thr Ala Lys Leu Lys
35 40 45
Val Thr Lys Gly Gly Pro Leu Pro Phe Ser Trp Asp Ile Leu Ser Pro
50 55 60
Gln Phe Met Tyr Gly Ser Arg Ala Phe Thr Lys His Pro Ala Asp Ile
65 70 75 80
Pro Asp Tyr Tyr Lys Gln Ser Phe Pro Glu Gly Phe Lys Trp Glu Arg
85 90 95
Val Met Asn Phe Glu Asp Gly Gly Ala Val Thr Val Thr Gln Asp Thr
100 105 110
Ser Leu Glu Asp Gly Thr Leu Ile Tyr Lys Cys Leu Ser Tyr Glu Thr
115 120 125
Glu Ile Leu Thr Val Glu Tyr Gly Leu Leu Pro Ile Gly Lys Ile Val
130 135 140
Glu Lys Arg Ile Glu Cys Thr Val Tyr Ser Val Asp Asn Asn Gly Asn
145 150 155 160
Ile Tyr Thr Gln Pro Val Ala Gln Trp His Asp Arg Gly Glu Gln Glu
165 170 175
Val Phe Glu Tyr Cys Leu Glu Asp Gly Ser Leu Ile Arg Ala Thr Lys
180 185 190
Asp His Lys Phe Met Thr Val Asp Gly Gln Met Leu Pro Ile Asp Glu
195 200 205
Ile Phe Glu Arg Glu Leu Asp Leu Met Arg Val Asp Asn Leu Pro Asn
210 215 220
Gly Gly Gly Gly Ser Gly Ser Ala Gln Leu Glu Lys Glu Leu Gln Ala
225 230 235 240
Leu Glu Lys Lys Leu Ala Gln Leu Glu Trp Glu Asn Gln Ala Leu Glu
245 250 255
Lys Glu Leu Ala Gln
260
<210> 88
<211> 9216
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 88
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtgccgcca ccatggctca actgaagaag 660
aaactccagg ccaataagaa agaactggca cagctgaagt ggaagctgca ggcactgaaa 720
aagaagctgg cacagggcgg aggaggctca ggcagtatga ttaagatcgc tacgcggaag 780
tacctgggga aacagaacgt ctacgacata ggtgtggagc gcgatcacaa ctttgctctg 840
aaaaatggat ttatcgccag caactgcgtg aagctccgcg gcaccaactt ccctcctgac 900
ggccccgtaa tgcagaagaa gacaatgggc tgggaagcgt ccaccgagcg gttgtacccc 960
gaggacggcg tgctgaaggg cgacattaag atggccctgc gcctgaagga cggcggccgc 1020
tacctggcgg acttcaagac cacctacaag gccaagaagc ccgtgcagat gcccggcgcc 1080
tacaacgtcg accgcaagtt ggacatcacc tcccacaacg aggactacac cgtggtggaa 1140
cagtacgaac gctccgaggg ccgccactcc accggcggca tggacgagct gtacaagtga 1200
ttaattaaga attcgaccca gctttcttgt acaaagtggt tggtaagcct atccctaacc 1260
ctctcctcgg tctcgattct acgtagtaat gagctagcag tctcgaggtt aacgaattcc 1320
gccccccccc taacgttact ggccgaagcc gcttggaata aggccggtgt gcgcttgtct 1380
atatgttatt ttccaccata ttgccgtctt ttggcaatgt gagggcccgg aaacctggcc 1440
ctgtcttctt gacgagcatt cctaggggtc tttcccctct cgccaaagga atgcaaggtc 1500
tgttgaatgt cgtgaaggaa gcagttcctc tggaagcttc ttgaagacaa acaacgtctg 1560
tagcgaccct ttgcaggcag cggaaccccc cacctggcga caggtgcccc tgcggccaaa 1620
agccacgtgt ataagataca cctgcaaagg cggcacaacc ccagtgccac gttgtgagtt 1680
ggatagttgt ggaaagagtc aaatggctct cctcaagcgt attcaacaag gggctgaagg 1740
atgcccagaa ggtaccccat tgtatgggat ctgatctggg gcctcggtgc acatgcttta 1800
catgtgttta gtcgaggtta aaaaaacgtc taggcccccc gaaccacggg gacgtggttt 1860
tcctttgaaa aacacgataa taccatggtg agcaagggcg aggagctgtt caccggggtg 1920
gtgcccatcc tggtcgagct ggacggcgac gtaaacggcc acaagttcag cgtgtccggc 1980
gagggcgagg gcgatgccac ctacggcaag ctgaccctga agttcatctg caccaccggc 2040
aagctgcccg tgccctggcc caccctcgtg accaccctga cctacggcgt gcagtgcttc 2100
agccgctacc ccgaccacat gaagcagcac gacttcttca agtccgccat gcccgaaggc 2160
tacgtccagg agcgcaccat cttcttcaag gacgacggca actacaagac ccgcgccgag 2220
gtgaagttcg agggcgacac cctggtgaac cgcatcgagc tgaagggcat cgacttcaag 2280
gaggacggca acatcctggg gcacaagctg gagtacaact acaacagcca caacgtctat 2340
atcatggccg acaagcagaa gaacggcatc aaggtgaact tcaagatccg ccacaacatc 2400
gaggacggca gcgtgcagct cgccgaccac taccagcaga acacccccat cggcgacggc 2460
cccgtgctgc tgcccgacaa ccactacctg agcacccagt ccgccctgag caaagacccc 2520
aacgagaagc gcgatcacat ggtcctgctg gagttcgtga ccgccgccgg gatcactctc 2580
ggcatggacg agctgtacaa gtaacaccgg tggcgcgtta agtcgacaat caacctctgg 2640
attacaaaat ttgtgaaaga ttgactggta ttcttaacta tgttgctcct tttacgctat 2700
gtggatacgc tgctttaatg cctttgtatc atgctattgc ttcccgtatg gctttcattt 2760
tctcctcctt gtataaatcc tggttgctgt ctctttatga ggagttgtgg cccgttgtca 2820
ggcaacgtgg cgtggtgtgc actgtgtttg ctgacgcaac ccccactggt tggggcattg 2880
ccaccacctg tcagctcctt tccgggactt tcgctttccc cctccctatt gccacggcgg 2940
aactcatcgc cgcctgcctt gcccgctgct ggacaggggc tcggctgttg ggcactgaca 3000
attccgtggt gttgtcgggg aaatcatcgt cctttccttg gctgctcgcc tgtgttgcca 3060
cctggattct gcgcgggacg tccttctgct acgtcccttc ggccctcaat ccagcggacc 3120
ttccttcccg cggcctgctg ccggctctgc ggcctcttcc gcgtcttcgc cttcgccctc 3180
agacgagtcg gatctccctt tgggccgcct ccccgcgtcg actttaagac caatgactta 3240
caaggcagct gtagatctta gccacttttt aaaagaaaag gggggactgg aagggctaat 3300
tcactcccaa cgaagacaag atctgctttt tgcttgtact gggtctctct ggttagacca 3360
gatctgagcc tgggagctct ctggctaact agggaaccca ctgcttaagc ctcaataaag 3420
cttgccttga gtgcttcaag tagtgtgtgc ccgtctgttg tgtgactctg gtaactagag 3480
atccctcaga cccttttagt cagtgtggaa aatctctagc agtacgtata gtagttcatg 3540
tcatcttatt attcagtatt tataacttgc aaagaaatga atatcagaga gtgagaggaa 3600
cttgtttatt gcagcttata atggttacaa ataaagcaat agcatcacaa atttcacaaa 3660
taaagcattt ttttcactgc attctagttg tggtttgtcc aaactcatca atgtatctta 3720
tcatgtctgg ctctagctat cccgccccta actccgccca tcccgcccct aactccgccc 3780
agttccgccc attctccgcc ccatggctga ctaatttttt ttatttatgc agaggccgag 3840
gccgcctcgg cctctgagct attccagaag tagtgaggag gcttttttgg aggcctaggg 3900
acgtacccaa ttcgccctat agtgagtcgt attacgcgcg ctcactggcc gtcgttttac 3960
aacgtcgtga ctgggaaaac cctggcgtta cccaacttaa tcgccttgca gcacatcccc 4020
ctttcgccag ctggcgtaat agcgaagagg cccgcaccga tcgcccttcc caacagttgc 4080
gcagcctgaa tggcgaatgg gacgcgccct gtagcggcgc attaagcgcg gcgggtgtgg 4140
tggttacgcg cagcgtgacc gctacacttg ccagcgccct agcgcccgct cctttcgctt 4200
tcttcccttc ctttctcgcc acgttcgccg gctttccccg tcaagctcta aatcgggggc 4260
tccctttagg gttccgattt agtgctttac ggcacctcga ccccaaaaaa cttgattagg 4320
gtgatggttc acgtagtggg ccatcgccct gatagacggt ttttcgccct ttgacgttgg 4380
agtccacgtt ctttaatagt ggactcttgt tccaaactgg aacaacactc aaccctatct 4440
cggtctattc ttttgattta taagggattt tgccgatttc ggcctattgg ttaaaaaatg 4500
agctgattta acaaaaattt aacgcgaatt ttaacaaaat attaacgctt acaatttagg 4560
tggcactttt cggggaaatg tgcgcggaac ccctatttgt ttatttttct aaatacattc 4620
aaatatgtat ccgctcatga gacaataacc ctgataaatg cttcaataat attgaaaaag 4680
gaagagtatg agtattcaac atttccgtgt cgcccttatt cccttttttg cggcattttg 4740
ccttcctgtt tttgctcacc cagaaacgct ggtgaaagta aaagatgctg aagatcagtt 4800
gggtgcacga gtgggttaca tcgaactgga tctcaacagc ggtaagatcc ttgagagttt 4860
tcgccccgaa gaacgttttc caatgatgag cacttttaaa gttctgctat gtggcgcggt 4920
attatcccgt attgacgccg ggcaagagca actcggtcgc cgcatacact attctcagaa 4980
tgacttggtt gagtactcac cagtcacaga aaagcatctt acggatggca tgacagtaag 5040
agaattatgc agtgctgcca taaccatgag tgataacact gcggccaact tacttctgac 5100
aacgatcgga ggaccgaagg agctaaccgc ttttttgcac aacatggggg atcatgtaac 5160
tcgccttgat cgttgggaac cggagctgaa tgaagccata ccaaacgacg agcgtgacac 5220
cacgatgcct gtagcaatgg caacaacgtt gcgcaaacta ttaactggcg aactacttac 5280
tctagcttcc cggcaacaat taatagactg gatggaggcg gataaagttg caggaccact 5340
tctgcgctcg gcccttccgg ctggctggtt tattgctgat aaatctggag ccggtgagcg 5400
tgggtctcgc ggtatcattg cagcactggg gccagatggt aagccctccc gtatcgtagt 5460
tatctacacg acggggagtc aggcaactat ggatgaacga aatagacaga tcgctgagat 5520
aggtgcctca ctgattaagc attggtaact gtcagaccaa gtttactcat atatacttta 5580
gattgattta aaacttcatt tttaatttaa aaggatctag gtgaagatcc tttttgataa 5640
tctcatgacc aaaatccctt aacgtgagtt ttcgttccac tgagcgtcag accccgtaga 5700
aaagatcaaa ggatcttctt gagatccttt ttttctgcgc gtaatctgct gcttgcaaac 5760
aaaaaaacca ccgctaccag cggtggtttg tttgccggat caagagctac caactctttt 5820
tccgaaggta actggcttca gcagagcgca gataccaaat actgttcttc tagtgtagcc 5880
gtagttaggc caccacttca agaactctgt agcaccgcct acatacctcg ctctgctaat 5940
cctgttacca gtggctgctg ccagtggcga taagtcgtgt cttaccgggt tggactcaag 6000
acgatagtta ccggataagg cgcagcggtc gggctgaacg gggggttcgt gcacacagcc 6060
cagcttggag cgaacgacct acaccgaact gagataccta cagcgtgagc tatgagaaag 6120
cgccacgctt cccgaaggga gaaaggcgga caggtatccg gtaagcggca gggtcggaac 6180
aggagagcgc acgagggagc ttccaggggg aaacgcctgg tatctttata gtcctgtcgg 6240
gtttcgccac ctctgacttg agcgtcgatt tttgtgatgc tcgtcagggg ggcggagcct 6300
atggaaaaac gccagcaacg cggccttttt acggttcctg gccttttgct ggccttttgc 6360
tcacatgttc tttcctgcgt tatcccctga ttctgtggat aaccgtatta ccgcctttga 6420
gtgagctgat accgctcgcc gcagccgaac gaccgagcgc agcgagtcag tgagcgagga 6480
agcggaagag cgcccaatac gcaaaccgcc tctccccgcg cgttggccga ttcattaatg 6540
cagctggcac gacaggtttc ccgactggaa agcgggcagt gagcgcaacg caattaatgt 6600
gagttagctc actcattagg caccccaggc tttacacttt atgcttccgg ctcgtatgtt 6660
gtgtggaatt gtgagcggat aacaatttca cacaggaaac agctatgacc atgattacgc 6720
caagcgcgca attaaccctc actaaaggga acaaaagctg gagctgcaag cttaatgtag 6780
tcttatgcaa tactcttgta gtcttgcaac atggtaacga tgagttagca acatgcctta 6840
caaggagaga aaaagcaccg tgcatgccga ttggtggaag taaggtggta cgatcgtgcc 6900
ttattaggaa ggcaacagac gggtctgaca tggattggac gaaccactga attgccgcat 6960
tgcagagata ttgtatttaa gtgcctagct cgatacataa acgggtctct ctggttagac 7020
cagatctgag cctgggagct ctctggctaa ctagggaacc cactgcttaa gcctcaataa 7080
agcttgcctt gagtgcttca agtagtgtgt gcccgtctgt tgtgtgactc tggtaactag 7140
agatccctca gaccctttta gtcagtgtgg aaaatctcta gcagtggcgc ccgaacaggg 7200
acttgaaagc gaaagggaaa ccagaggagc tctctcgacg caggactcgg cttgctgaag 7260
cgcgcacggc aagaggcgag gggcggcgac tggtgagtac gccaaaaatt ttgactagcg 7320
gaggctagaa ggagagagat gggtgcgaga gcgtcagtat taagcggggg agaattagat 7380
cgcgatggga aaaaattcgg ttaaggccag ggggaaagaa aaaatataaa ttaaaacata 7440
tagtatgggc aagcagggag ctagaacgat tcgcagttaa tcctggcctg ttagaaacat 7500
cagaaggctg tagacaaata ctgggacagc tacaaccatc ccttcagaca ggatcagaag 7560
aacttagatc attatataat acagtagcaa ccctctattg tgtgcatcaa aggatagaga 7620
taaaagacac caaggaagct ttagacaaga tagaggaaga gcaaaacaaa agtaagacca 7680
ccgcacagca agcggccgct gatcttcaga cctggaggag gagatatgag ggacaattgg 7740
agaagtgaat tatataaata taaagtagta aaaattgaac cattaggagt agcacccacc 7800
aaggcaaaga gaagagtggt gcagagagaa aaaagagcag tgggaatagg agctttgttc 7860
cttgggttct tgggagcagc aggaagcact atgggcgcag cgtcaatgac gctgacggta 7920
caggccagac aattattgtc tggtatagtg cagcagcaga acaatttgct gagggctatt 7980
gaggcgcaac agcatctgtt gcaactcaca gtctggggca tcaagcagct ccaggcaaga 8040
atcctggctg tggaaagata cctaaaggat caacagctcc tggggatttg gggttgctct 8100
ggaaaactca tttgcaccac tgctgtgcct tggaatgcta gttggagtaa taaatctctg 8160
gaacagattt ggaatcacac gacctggatg gagtgggaca gagaaattaa caattacaca 8220
agcttaatac actccttaat tgaagaatcg caaaaccagc aagaaaagaa tgaacaagaa 8280
ttattggaat tagataaatg ggcaagtttg tggaattggt ttaacataac aaattggctg 8340
tggtatataa aattattcat aatgatagta ggaggcttgg taggtttaag aatagttttt 8400
gctgtacttt ctatagtgaa tagagttagg cagggatatt caccattatc gtttcagacc 8460
cacctcccaa ccccgagggg acccttgcgc cttttccaag gcagccctgg gtttgcgcag 8520
ggacgcggct gctctgggcg tggttccggg aaacgcagcg gcgccgaccc tgggtctcgc 8580
acattcttca cgtccgttcg cagcgtcacc cggatcttcg ccgctaccct tgtgggcccc 8640
ccggcgacgc ttcctgctcc gcccctaagt cgggaaggtt ccttgcggtt cgcggcgtgc 8700
cggacgtgac aaacggaagc cgcacgtctc actagtaccc tcgcagacgg acagcgccag 8760
ggagcaatgg cagcgcgccg accgcgatgg gctgtggcca atagcggctg ctcagcaggg 8820
cgcgccgaga gcagcggccg ggaaggggcg gtgcgggagg cggggtgtgg ggcggtagtg 8880
tgggccctgt tcctgcccgc gcggtgttcc gcattctgca agcctccgga gcgcacgtcg 8940
gcagtcggct ccctcgttga ccgaatcacc gacctctctc cccagggggt acccagctgt 9000
ctagagaatt ctagatcttg agacaaatgg cagtattcat ccacaatttt aaaagaaaag 9060
gggggattgg ggggtacagt gcaggggaaa gaatagtaga cataatagca acagacatac 9120
aaactaaaga attacaaaaa caaattacaa aaattcaaaa ttttcgggtt tattacaggg 9180
acagcagaga tccactttgg cgccggctcg aggggg 9216
<210> 89
<211> 185
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 89
Met Ala Gln Leu Lys Lys Lys Leu Gln Ala Asn Lys Lys Glu Leu Ala
1 5 10 15
Gln Leu Lys Trp Lys Leu Gln Ala Leu Lys Lys Lys Leu Ala Gln Gly
20 25 30
Gly Gly Gly Ser Gly Ser Met Ile Lys Ile Ala Thr Arg Lys Tyr Leu
35 40 45
Gly Lys Gln Asn Val Tyr Asp Ile Gly Val Glu Arg Asp His Asn Phe
50 55 60
Ala Leu Lys Asn Gly Phe Ile Ala Ser Asn Cys Val Lys Leu Arg Gly
65 70 75 80
Thr Asn Phe Pro Pro Asp Gly Pro Val Met Gln Lys Lys Thr Met Gly
85 90 95
Trp Glu Ala Ser Thr Glu Arg Leu Tyr Pro Glu Asp Gly Val Leu Lys
100 105 110
Gly Asp Ile Lys Met Ala Leu Arg Leu Lys Asp Gly Gly Arg Tyr Leu
115 120 125
Ala Asp Phe Lys Thr Thr Tyr Lys Ala Lys Lys Pro Val Gln Met Pro
130 135 140
Gly Ala Tyr Asn Val Asp Arg Lys Leu Asp Ile Thr Ser His Asn Glu
145 150 155 160
Asp Tyr Thr Val Val Glu Gln Tyr Glu Arg Ser Glu Gly Arg His Ser
165 170 175
Thr Gly Gly Met Asp Glu Leu Tyr Lys
180 185
<210> 90
<211> 9486
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 90
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtgccgcca ccatggtgag caagggcgag 660
gcagtgatca aggagttcat gcggttcaag gtgcacatgg agggctccat gaacggccac 720
gagttcgaga tcgagggcga gggcgagggc cgcccctacg agggcaccca gaccgccaag 780
ctgaaggtga ccaagggtgg ccccctgccc ttctcctggg acatcctgtc ccctcagttc 840
atgtacggct ccagggcctt caccaagcac cccgccgaca tccccgacta ctataagcag 900
tccttccccg agggcttcaa gtgggagcgc gtgatgaact tcgaggacgg cggcgccgtg 960
accgtgaccc aggacacctc cctggaggac ggcaccctga tctacaaggt gaagctccgc 1020
ggcaccaact tccctcctga cggccccgta atgcagaaga agtgcctttc atacgagacc 1080
gagatcctga ctgtcgagta cggattgctt cctatcggca aaatcgtgga gaagaggatt 1140
gaatgtaccg tctattcagt cgataataat gggaacatct acacacagcc cgtggctcaa 1200
tggcacgaca gaggagagca ggaagttttt gaatactgtc tcgaggacgg atccctcatc 1260
cgcgctacta aagatcataa gtttatgacc gtggacggcc agatgctgcc aattgacgaa 1320
atttttgaac gagagctgga tctgatgaga gtcgacaacc ttccaaacgg tggagggggg 1380
tcaggctctg cgcagctgga aaaggagctt caagccctcg aaaaaaagtt ggcccagctc 1440
gagtgggaga accaggctct ggagaaagaa ctggcccagt gattaattaa gaattcgacc 1500
cagctttctt gtacaaagtg gttggtaagc ctatccctaa ccctctcctc ggtctcgatt 1560
ctacgtagta atgagctagc agtctcgagg ttaacgaatt ccgccccccc cctaacgtta 1620
ctggccgaag ccgcttggaa taaggccggt gtgcgcttgt ctatatgtta ttttccacca 1680
tattgccgtc ttttggcaat gtgagggccc ggaaacctgg ccctgtcttc ttgacgagca 1740
ttcctagggg tctttcccct ctcgccaaag gaatgcaagg tctgttgaat gtcgtgaagg 1800
aagcagttcc tctggaagct tcttgaagac aaacaacgtc tgtagcgacc ctttgcaggc 1860
agcggaaccc cccacctggc gacaggtgcc cctgcggcca aaagccacgt gtataagata 1920
cacctgcaaa ggcggcacaa ccccagtgcc acgttgtgag ttggatagtt gtggaaagag 1980
tcaaatggct ctcctcaagc gtattcaaca aggggctgaa ggatgcccag aaggtacccc 2040
attgtatggg atctgatctg gggcctcggt gcacatgctt tacatgtgtt tagtcgaggt 2100
taaaaaaacg tctaggcccc ccgaaccacg gggacgtggt tttcctttga aaaacacgat 2160
aataccatgg ccatgagcga gctgattaag gagaacatgc acatgaagct gtacatggag 2220
ggcaccgtgg acaaccatca cttcaagtgc acatccgagg gcgaaggcaa gccctacgag 2280
ggcacccaga ccatgagaat caaggtggtc gagggcggcc ctctcccctt cgccttcgac 2340
atcctggcta ctagcttcct ctacggcagc aagaccttca tcaaccacac ccagggcatc 2400
cccgacttct tcaagcagtc cttccctgag ggcttcacat gggagagagt caccacatac 2460
gaagacgggg gcgtgctgac cgctacccag gacaccagcc tccaggacgg ctgcctcatc 2520
tacaacgtca agatcagagg ggtgaacttc acatccaacg gccctgtgat gcagaagaaa 2580
acactcggct gggaggcctt caccgagacg ctgtaccccg ctgacggcgg cctggaaggc 2640
agaaacgaca tggccctgaa gctcgtgggc gggagccatc tgatcgcaaa catcaagacc 2700
acatatagat ccaagaaacc cgctaagaac ctcaagatgc ctggcgtcta ctatgtggac 2760
tacagactgg aaagaatcaa ggaggccaac aacgagacct acgtcgagca gcacgaggtg 2820
gcagtggcca gatactgcga cctccctagc aaactggggc acaagcttaa ttaacaccgg 2880
tggcgcgtta agtcgacaat caacctctgg attacaaaat ttgtgaaaga ttgactggta 2940
ttcttaacta tgttgctcct tttacgctat gtggatacgc tgctttaatg cctttgtatc 3000
atgctattgc ttcccgtatg gctttcattt tctcctcctt gtataaatcc tggttgctgt 3060
ctctttatga ggagttgtgg cccgttgtca ggcaacgtgg cgtggtgtgc actgtgtttg 3120
ctgacgcaac ccccactggt tggggcattg ccaccacctg tcagctcctt tccgggactt 3180
tcgctttccc cctccctatt gccacggcgg aactcatcgc cgcctgcctt gcccgctgct 3240
ggacaggggc tcggctgttg ggcactgaca attccgtggt gttgtcgggg aaatcatcgt 3300
cctttccttg gctgctcgcc tgtgttgcca cctggattct gcgcgggacg tccttctgct 3360
acgtcccttc ggccctcaat ccagcggacc ttccttcccg cggcctgctg ccggctctgc 3420
ggcctcttcc gcgtcttcgc cttcgccctc agacgagtcg gatctccctt tgggccgcct 3480
ccccgcgtcg actttaagac caatgactta caaggcagct gtagatctta gccacttttt 3540
aaaagaaaag gggggactgg aagggctaat tcactcccaa cgaagacaag atctgctttt 3600
tgcttgtact gggtctctct ggttagacca gatctgagcc tgggagctct ctggctaact 3660
agggaaccca ctgcttaagc ctcaataaag cttgccttga gtgcttcaag tagtgtgtgc 3720
ccgtctgttg tgtgactctg gtaactagag atccctcaga cccttttagt cagtgtggaa 3780
aatctctagc agtacgtata gtagttcatg tcatcttatt attcagtatt tataacttgc 3840
aaagaaatga atatcagaga gtgagaggaa cttgtttatt gcagcttata atggttacaa 3900
ataaagcaat agcatcacaa atttcacaaa taaagcattt ttttcactgc attctagttg 3960
tggtttgtcc aaactcatca atgtatctta tcatgtctgg ctctagctat cccgccccta 4020
actccgccca tcccgcccct aactccgccc agttccgccc attctccgcc ccatggctga 4080
ctaatttttt ttatttatgc agaggccgag gccgcctcgg cctctgagct attccagaag 4140
tagtgaggag gcttttttgg aggcctaggg acgtacccaa ttcgccctat agtgagtcgt 4200
attacgcgcg ctcactggcc gtcgttttac aacgtcgtga ctgggaaaac cctggcgtta 4260
cccaacttaa tcgccttgca gcacatcccc ctttcgccag ctggcgtaat agcgaagagg 4320
cccgcaccga tcgcccttcc caacagttgc gcagcctgaa tggcgaatgg gacgcgccct 4380
gtagcggcgc attaagcgcg gcgggtgtgg tggttacgcg cagcgtgacc gctacacttg 4440
ccagcgccct agcgcccgct cctttcgctt tcttcccttc ctttctcgcc acgttcgccg 4500
gctttccccg tcaagctcta aatcgggggc tccctttagg gttccgattt agtgctttac 4560
ggcacctcga ccccaaaaaa cttgattagg gtgatggttc acgtagtggg ccatcgccct 4620
gatagacggt ttttcgccct ttgacgttgg agtccacgtt ctttaatagt ggactcttgt 4680
tccaaactgg aacaacactc aaccctatct cggtctattc ttttgattta taagggattt 4740
tgccgatttc ggcctattgg ttaaaaaatg agctgattta acaaaaattt aacgcgaatt 4800
ttaacaaaat attaacgctt acaatttagg tggcactttt cggggaaatg tgcgcggaac 4860
ccctatttgt ttatttttct aaatacattc aaatatgtat ccgctcatga gacaataacc 4920
ctgataaatg cttcaataat attgaaaaag gaagagtatg agtattcaac atttccgtgt 4980
cgcccttatt cccttttttg cggcattttg ccttcctgtt tttgctcacc cagaaacgct 5040
ggtgaaagta aaagatgctg aagatcagtt gggtgcacga gtgggttaca tcgaactgga 5100
tctcaacagc ggtaagatcc ttgagagttt tcgccccgaa gaacgttttc caatgatgag 5160
cacttttaaa gttctgctat gtggcgcggt attatcccgt attgacgccg ggcaagagca 5220
actcggtcgc cgcatacact attctcagaa tgacttggtt gagtactcac cagtcacaga 5280
aaagcatctt acggatggca tgacagtaag agaattatgc agtgctgcca taaccatgag 5340
tgataacact gcggccaact tacttctgac aacgatcgga ggaccgaagg agctaaccgc 5400
ttttttgcac aacatggggg atcatgtaac tcgccttgat cgttgggaac cggagctgaa 5460
tgaagccata ccaaacgacg agcgtgacac cacgatgcct gtagcaatgg caacaacgtt 5520
gcgcaaacta ttaactggcg aactacttac tctagcttcc cggcaacaat taatagactg 5580
gatggaggcg gataaagttg caggaccact tctgcgctcg gcccttccgg ctggctggtt 5640
tattgctgat aaatctggag ccggtgagcg tgggtctcgc ggtatcattg cagcactggg 5700
gccagatggt aagccctccc gtatcgtagt tatctacacg acggggagtc aggcaactat 5760
ggatgaacga aatagacaga tcgctgagat aggtgcctca ctgattaagc attggtaact 5820
gtcagaccaa gtttactcat atatacttta gattgattta aaacttcatt tttaatttaa 5880
aaggatctag gtgaagatcc tttttgataa tctcatgacc aaaatccctt aacgtgagtt 5940
ttcgttccac tgagcgtcag accccgtaga aaagatcaaa ggatcttctt gagatccttt 6000
ttttctgcgc gtaatctgct gcttgcaaac aaaaaaacca ccgctaccag cggtggtttg 6060
tttgccggat caagagctac caactctttt tccgaaggta actggcttca gcagagcgca 6120
gataccaaat actgttcttc tagtgtagcc gtagttaggc caccacttca agaactctgt 6180
agcaccgcct acatacctcg ctctgctaat cctgttacca gtggctgctg ccagtggcga 6240
taagtcgtgt cttaccgggt tggactcaag acgatagtta ccggataagg cgcagcggtc 6300
gggctgaacg gggggttcgt gcacacagcc cagcttggag cgaacgacct acaccgaact 6360
gagataccta cagcgtgagc tatgagaaag cgccacgctt cccgaaggga gaaaggcgga 6420
caggtatccg gtaagcggca gggtcggaac aggagagcgc acgagggagc ttccaggggg 6480
aaacgcctgg tatctttata gtcctgtcgg gtttcgccac ctctgacttg agcgtcgatt 6540
tttgtgatgc tcgtcagggg ggcggagcct atggaaaaac gccagcaacg cggccttttt 6600
acggttcctg gccttttgct ggccttttgc tcacatgttc tttcctgcgt tatcccctga 6660
ttctgtggat aaccgtatta ccgcctttga gtgagctgat accgctcgcc gcagccgaac 6720
gaccgagcgc agcgagtcag tgagcgagga agcggaagag cgcccaatac gcaaaccgcc 6780
tctccccgcg cgttggccga ttcattaatg cagctggcac gacaggtttc ccgactggaa 6840
agcgggcagt gagcgcaacg caattaatgt gagttagctc actcattagg caccccaggc 6900
tttacacttt atgcttccgg ctcgtatgtt gtgtggaatt gtgagcggat aacaatttca 6960
cacaggaaac agctatgacc atgattacgc caagcgcgca attaaccctc actaaaggga 7020
acaaaagctg gagctgcaag cttaatgtag tcttatgcaa tactcttgta gtcttgcaac 7080
atggtaacga tgagttagca acatgcctta caaggagaga aaaagcaccg tgcatgccga 7140
ttggtggaag taaggtggta cgatcgtgcc ttattaggaa ggcaacagac gggtctgaca 7200
tggattggac gaaccactga attgccgcat tgcagagata ttgtatttaa gtgcctagct 7260
cgatacataa acgggtctct ctggttagac cagatctgag cctgggagct ctctggctaa 7320
ctagggaacc cactgcttaa gcctcaataa agcttgcctt gagtgcttca agtagtgtgt 7380
gcccgtctgt tgtgtgactc tggtaactag agatccctca gaccctttta gtcagtgtgg 7440
aaaatctcta gcagtggcgc ccgaacaggg acttgaaagc gaaagggaaa ccagaggagc 7500
tctctcgacg caggactcgg cttgctgaag cgcgcacggc aagaggcgag gggcggcgac 7560
tggtgagtac gccaaaaatt ttgactagcg gaggctagaa ggagagagat gggtgcgaga 7620
gcgtcagtat taagcggggg agaattagat cgcgatggga aaaaattcgg ttaaggccag 7680
ggggaaagaa aaaatataaa ttaaaacata tagtatgggc aagcagggag ctagaacgat 7740
tcgcagttaa tcctggcctg ttagaaacat cagaaggctg tagacaaata ctgggacagc 7800
tacaaccatc ccttcagaca ggatcagaag aacttagatc attatataat acagtagcaa 7860
ccctctattg tgtgcatcaa aggatagaga taaaagacac caaggaagct ttagacaaga 7920
tagaggaaga gcaaaacaaa agtaagacca ccgcacagca agcggccgct gatcttcaga 7980
cctggaggag gagatatgag ggacaattgg agaagtgaat tatataaata taaagtagta 8040
aaaattgaac cattaggagt agcacccacc aaggcaaaga gaagagtggt gcagagagaa 8100
aaaagagcag tgggaatagg agctttgttc cttgggttct tgggagcagc aggaagcact 8160
atgggcgcag cgtcaatgac gctgacggta caggccagac aattattgtc tggtatagtg 8220
cagcagcaga acaatttgct gagggctatt gaggcgcaac agcatctgtt gcaactcaca 8280
gtctggggca tcaagcagct ccaggcaaga atcctggctg tggaaagata cctaaaggat 8340
caacagctcc tggggatttg gggttgctct ggaaaactca tttgcaccac tgctgtgcct 8400
tggaatgcta gttggagtaa taaatctctg gaacagattt ggaatcacac gacctggatg 8460
gagtgggaca gagaaattaa caattacaca agcttaatac actccttaat tgaagaatcg 8520
caaaaccagc aagaaaagaa tgaacaagaa ttattggaat tagataaatg ggcaagtttg 8580
tggaattggt ttaacataac aaattggctg tggtatataa aattattcat aatgatagta 8640
ggaggcttgg taggtttaag aatagttttt gctgtacttt ctatagtgaa tagagttagg 8700
cagggatatt caccattatc gtttcagacc cacctcccaa ccccgagggg acccttgcgc 8760
cttttccaag gcagccctgg gtttgcgcag ggacgcggct gctctgggcg tggttccggg 8820
aaacgcagcg gcgccgaccc tgggtctcgc acattcttca cgtccgttcg cagcgtcacc 8880
cggatcttcg ccgctaccct tgtgggcccc ccggcgacgc ttcctgctcc gcccctaagt 8940
cgggaaggtt ccttgcggtt cgcggcgtgc cggacgtgac aaacggaagc cgcacgtctc 9000
actagtaccc tcgcagacgg acagcgccag ggagcaatgg cagcgcgccg accgcgatgg 9060
gctgtggcca atagcggctg ctcagcaggg cgcgccgaga gcagcggccg ggaaggggcg 9120
gtgcgggagg cggggtgtgg ggcggtagtg tgggccctgt tcctgcccgc gcggtgttcc 9180
gcattctgca agcctccgga gcgcacgtcg gcagtcggct ccctcgttga ccgaatcacc 9240
gacctctctc cccagggggt acccagctgt ctagagaatt ctagatcttg agacaaatgg 9300
cagtattcat ccacaatttt aaaagaaaag gggggattgg ggggtacagt gcaggggaaa 9360
gaatagtaga cataatagca acagacatac aaactaaaga attacaaaaa caaattacaa 9420
aaattcaaaa ttttcgggtt tattacaggg acagcagaga tccactttgg cgccggctcg 9480
aggggg 9486
<210> 91
<211> 279
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 91
Met Val Ser Lys Gly Glu Ala Val Ile Lys Glu Phe Met Arg Phe Lys
1 5 10 15
Val His Met Glu Gly Ser Met Asn Gly His Glu Phe Glu Ile Glu Gly
20 25 30
Glu Gly Glu Gly Arg Pro Tyr Glu Gly Thr Gln Thr Ala Lys Leu Lys
35 40 45
Val Thr Lys Gly Gly Pro Leu Pro Phe Ser Trp Asp Ile Leu Ser Pro
50 55 60
Gln Phe Met Tyr Gly Ser Arg Ala Phe Thr Lys His Pro Ala Asp Ile
65 70 75 80
Pro Asp Tyr Tyr Lys Gln Ser Phe Pro Glu Gly Phe Lys Trp Glu Arg
85 90 95
Val Met Asn Phe Glu Asp Gly Gly Ala Val Thr Val Thr Gln Asp Thr
100 105 110
Ser Leu Glu Asp Gly Thr Leu Ile Tyr Lys Val Lys Leu Arg Gly Thr
115 120 125
Asn Phe Pro Pro Asp Gly Pro Val Met Gln Lys Lys Cys Leu Ser Tyr
130 135 140
Glu Thr Glu Ile Leu Thr Val Glu Tyr Gly Leu Leu Pro Ile Gly Lys
145 150 155 160
Ile Val Glu Lys Arg Ile Glu Cys Thr Val Tyr Ser Val Asp Asn Asn
165 170 175
Gly Asn Ile Tyr Thr Gln Pro Val Ala Gln Trp His Asp Arg Gly Glu
180 185 190
Gln Glu Val Phe Glu Tyr Cys Leu Glu Asp Gly Ser Leu Ile Arg Ala
195 200 205
Thr Lys Asp His Lys Phe Met Thr Val Asp Gly Gln Met Leu Pro Ile
210 215 220
Asp Glu Ile Phe Glu Arg Glu Leu Asp Leu Met Arg Val Asp Asn Leu
225 230 235 240
Pro Asn Gly Gly Gly Gly Ser Gly Ser Ala Gln Leu Glu Lys Glu Leu
245 250 255
Gln Ala Leu Glu Lys Lys Leu Ala Gln Leu Glu Trp Glu Asn Gln Ala
260 265 270
Leu Glu Lys Glu Leu Ala Gln
275
<210> 92
<211> 9162
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 92
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtgccgcca ccatggctca actgaagaag 660
aaactccagg ccaataagaa agaactggca cagctgaagt ggaagctgca ggcactgaaa 720
aagaagctgg cacagggcgg aggaggctca ggcagtatga ttaagatcgc tacgcggaag 780
tacctgggga aacagaacgt ctacgacata ggtgtggagc gcgatcacaa ctttgctctg 840
aaaaatggat ttatcgccag caactgcaca atgggctggg aagcgtccac cgagcggttg 900
taccccgagg acggcgtgct gaagggcgac attaagatgg ccctgcgcct gaaggacggc 960
ggccgctacc tggcggactt caagaccacc tacaaggcca agaagcccgt gcagatgccc 1020
ggcgcctaca acgtcgaccg caagttggac atcacctccc acaacgagga ctacaccgtg 1080
gtggaacagt acgaacgctc cgagggccgc cactccaccg gcggcatgga cgagctgtac 1140
aagtgattaa ttaagaattc gacccagctt tcttgtacaa agtggttggt aagcctatcc 1200
ctaaccctct cctcggtctc gattctacgt agtaatgagc tagcagtctc gaggttaacg 1260
aattccgccc cccccctaac gttactggcc gaagccgctt ggaataaggc cggtgtgcgc 1320
ttgtctatat gttattttcc accatattgc cgtcttttgg caatgtgagg gcccggaaac 1380
ctggccctgt cttcttgacg agcattccta ggggtctttc ccctctcgcc aaaggaatgc 1440
aaggtctgtt gaatgtcgtg aaggaagcag ttcctctgga agcttcttga agacaaacaa 1500
cgtctgtagc gaccctttgc aggcagcgga accccccacc tggcgacagg tgcccctgcg 1560
gccaaaagcc acgtgtataa gatacacctg caaaggcggc acaaccccag tgccacgttg 1620
tgagttggat agttgtggaa agagtcaaat ggctctcctc aagcgtattc aacaaggggc 1680
tgaaggatgc ccagaaggta ccccattgta tgggatctga tctggggcct cggtgcacat 1740
gctttacatg tgtttagtcg aggttaaaaa aacgtctagg ccccccgaac cacggggacg 1800
tggttttcct ttgaaaaaca cgataatacc atggtgagca agggcgagga gctgttcacc 1860
ggggtggtgc ccatcctggt cgagctggac ggcgacgtaa acggccacaa gttcagcgtg 1920
tccggcgagg gcgagggcga tgccacctac ggcaagctga ccctgaagtt catctgcacc 1980
accggcaagc tgcccgtgcc ctggcccacc ctcgtgacca ccctgaccta cggcgtgcag 2040
tgcttcagcc gctaccccga ccacatgaag cagcacgact tcttcaagtc cgccatgccc 2100
gaaggctacg tccaggagcg caccatcttc ttcaaggacg acggcaacta caagacccgc 2160
gccgaggtga agttcgaggg cgacaccctg gtgaaccgca tcgagctgaa gggcatcgac 2220
ttcaaggagg acggcaacat cctggggcac aagctggagt acaactacaa cagccacaac 2280
gtctatatca tggccgacaa gcagaagaac ggcatcaagg tgaacttcaa gatccgccac 2340
aacatcgagg acggcagcgt gcagctcgcc gaccactacc agcagaacac ccccatcggc 2400
gacggccccg tgctgctgcc cgacaaccac tacctgagca cccagtccgc cctgagcaaa 2460
gaccccaacg agaagcgcga tcacatggtc ctgctggagt tcgtgaccgc cgccgggatc 2520
actctcggca tggacgagct gtacaagtaa caccggtggc gcgttaagtc gacaatcaac 2580
ctctggatta caaaatttgt gaaagattga ctggtattct taactatgtt gctcctttta 2640
cgctatgtgg atacgctgct ttaatgcctt tgtatcatgc tattgcttcc cgtatggctt 2700
tcattttctc ctccttgtat aaatcctggt tgctgtctct ttatgaggag ttgtggcccg 2760
ttgtcaggca acgtggcgtg gtgtgcactg tgtttgctga cgcaaccccc actggttggg 2820
gcattgccac cacctgtcag ctcctttccg ggactttcgc tttccccctc cctattgcca 2880
cggcggaact catcgccgcc tgccttgccc gctgctggac aggggctcgg ctgttgggca 2940
ctgacaattc cgtggtgttg tcggggaaat catcgtcctt tccttggctg ctcgcctgtg 3000
ttgccacctg gattctgcgc gggacgtcct tctgctacgt cccttcggcc ctcaatccag 3060
cggaccttcc ttcccgcggc ctgctgccgg ctctgcggcc tcttccgcgt cttcgccttc 3120
gccctcagac gagtcggatc tccctttggg ccgcctcccc gcgtcgactt taagaccaat 3180
gacttacaag gcagctgtag atcttagcca ctttttaaaa gaaaaggggg gactggaagg 3240
gctaattcac tcccaacgaa gacaagatct gctttttgct tgtactgggt ctctctggtt 3300
agaccagatc tgagcctggg agctctctgg ctaactaggg aacccactgc ttaagcctca 3360
ataaagcttg ccttgagtgc ttcaagtagt gtgtgcccgt ctgttgtgtg actctggtaa 3420
ctagagatcc ctcagaccct tttagtcagt gtggaaaatc tctagcagta cgtatagtag 3480
ttcatgtcat cttattattc agtatttata acttgcaaag aaatgaatat cagagagtga 3540
gaggaacttg tttattgcag cttataatgg ttacaaataa agcaatagca tcacaaattt 3600
cacaaataaa gcattttttt cactgcattc tagttgtggt ttgtccaaac tcatcaatgt 3660
atcttatcat gtctggctct agctatcccg cccctaactc cgcccatccc gcccctaact 3720
ccgcccagtt ccgcccattc tccgccccat ggctgactaa ttttttttat ttatgcagag 3780
gccgaggccg cctcggcctc tgagctattc cagaagtagt gaggaggctt ttttggaggc 3840
ctagggacgt acccaattcg ccctatagtg agtcgtatta cgcgcgctca ctggccgtcg 3900
ttttacaacg tcgtgactgg gaaaaccctg gcgttaccca acttaatcgc cttgcagcac 3960
atcccccttt cgccagctgg cgtaatagcg aagaggcccg caccgatcgc ccttcccaac 4020
agttgcgcag cctgaatggc gaatgggacg cgccctgtag cggcgcatta agcgcggcgg 4080
gtgtggtggt tacgcgcagc gtgaccgcta cacttgccag cgccctagcg cccgctcctt 4140
tcgctttctt cccttccttt ctcgccacgt tcgccggctt tccccgtcaa gctctaaatc 4200
gggggctccc tttagggttc cgatttagtg ctttacggca cctcgacccc aaaaaacttg 4260
attagggtga tggttcacgt agtgggccat cgccctgata gacggttttt cgccctttga 4320
cgttggagtc cacgttcttt aatagtggac tcttgttcca aactggaaca acactcaacc 4380
ctatctcggt ctattctttt gatttataag ggattttgcc gatttcggcc tattggttaa 4440
aaaatgagct gatttaacaa aaatttaacg cgaattttaa caaaatatta acgcttacaa 4500
tttaggtggc acttttcggg gaaatgtgcg cggaacccct atttgtttat ttttctaaat 4560
acattcaaat atgtatccgc tcatgagaca ataaccctga taaatgcttc aataatattg 4620
aaaaaggaag agtatgagta ttcaacattt ccgtgtcgcc cttattccct tttttgcggc 4680
attttgcctt cctgtttttg ctcacccaga aacgctggtg aaagtaaaag atgctgaaga 4740
tcagttgggt gcacgagtgg gttacatcga actggatctc aacagcggta agatccttga 4800
gagttttcgc cccgaagaac gttttccaat gatgagcact tttaaagttc tgctatgtgg 4860
cgcggtatta tcccgtattg acgccgggca agagcaactc ggtcgccgca tacactattc 4920
tcagaatgac ttggttgagt actcaccagt cacagaaaag catcttacgg atggcatgac 4980
agtaagagaa ttatgcagtg ctgccataac catgagtgat aacactgcgg ccaacttact 5040
tctgacaacg atcggaggac cgaaggagct aaccgctttt ttgcacaaca tgggggatca 5100
tgtaactcgc cttgatcgtt gggaaccgga gctgaatgaa gccataccaa acgacgagcg 5160
tgacaccacg atgcctgtag caatggcaac aacgttgcgc aaactattaa ctggcgaact 5220
acttactcta gcttcccggc aacaattaat agactggatg gaggcggata aagttgcagg 5280
accacttctg cgctcggccc ttccggctgg ctggtttatt gctgataaat ctggagccgg 5340
tgagcgtggg tctcgcggta tcattgcagc actggggcca gatggtaagc cctcccgtat 5400
cgtagttatc tacacgacgg ggagtcaggc aactatggat gaacgaaata gacagatcgc 5460
tgagataggt gcctcactga ttaagcattg gtaactgtca gaccaagttt actcatatat 5520
actttagatt gatttaaaac ttcattttta atttaaaagg atctaggtga agatcctttt 5580
tgataatctc atgaccaaaa tcccttaacg tgagttttcg ttccactgag cgtcagaccc 5640
cgtagaaaag atcaaaggat cttcttgaga tccttttttt ctgcgcgtaa tctgctgctt 5700
gcaaacaaaa aaaccaccgc taccagcggt ggtttgtttg ccggatcaag agctaccaac 5760
tctttttccg aaggtaactg gcttcagcag agcgcagata ccaaatactg ttcttctagt 5820
gtagccgtag ttaggccacc acttcaagaa ctctgtagca ccgcctacat acctcgctct 5880
gctaatcctg ttaccagtgg ctgctgccag tggcgataag tcgtgtctta ccgggttgga 5940
ctcaagacga tagttaccgg ataaggcgca gcggtcgggc tgaacggggg gttcgtgcac 6000
acagcccagc ttggagcgaa cgacctacac cgaactgaga tacctacagc gtgagctatg 6060
agaaagcgcc acgcttcccg aagggagaaa ggcggacagg tatccggtaa gcggcagggt 6120
cggaacagga gagcgcacga gggagcttcc agggggaaac gcctggtatc tttatagtcc 6180
tgtcgggttt cgccacctct gacttgagcg tcgatttttg tgatgctcgt caggggggcg 6240
gagcctatgg aaaaacgcca gcaacgcggc ctttttacgg ttcctggcct tttgctggcc 6300
ttttgctcac atgttctttc ctgcgttatc ccctgattct gtggataacc gtattaccgc 6360
ctttgagtga gctgataccg ctcgccgcag ccgaacgacc gagcgcagcg agtcagtgag 6420
cgaggaagcg gaagagcgcc caatacgcaa accgcctctc cccgcgcgtt ggccgattca 6480
ttaatgcagc tggcacgaca ggtttcccga ctggaaagcg ggcagtgagc gcaacgcaat 6540
taatgtgagt tagctcactc attaggcacc ccaggcttta cactttatgc ttccggctcg 6600
tatgttgtgt ggaattgtga gcggataaca atttcacaca ggaaacagct atgaccatga 6660
ttacgccaag cgcgcaatta accctcacta aagggaacaa aagctggagc tgcaagctta 6720
atgtagtctt atgcaatact cttgtagtct tgcaacatgg taacgatgag ttagcaacat 6780
gccttacaag gagagaaaaa gcaccgtgca tgccgattgg tggaagtaag gtggtacgat 6840
cgtgccttat taggaaggca acagacgggt ctgacatgga ttggacgaac cactgaattg 6900
ccgcattgca gagatattgt atttaagtgc ctagctcgat acataaacgg gtctctctgg 6960
ttagaccaga tctgagcctg ggagctctct ggctaactag ggaacccact gcttaagcct 7020
caataaagct tgccttgagt gcttcaagta gtgtgtgccc gtctgttgtg tgactctggt 7080
aactagagat ccctcagacc cttttagtca gtgtggaaaa tctctagcag tggcgcccga 7140
acagggactt gaaagcgaaa gggaaaccag aggagctctc tcgacgcagg actcggcttg 7200
ctgaagcgcg cacggcaaga ggcgaggggc ggcgactggt gagtacgcca aaaattttga 7260
ctagcggagg ctagaaggag agagatgggt gcgagagcgt cagtattaag cgggggagaa 7320
ttagatcgcg atgggaaaaa attcggttaa ggccaggggg aaagaaaaaa tataaattaa 7380
aacatatagt atgggcaagc agggagctag aacgattcgc agttaatcct ggcctgttag 7440
aaacatcaga aggctgtaga caaatactgg gacagctaca accatccctt cagacaggat 7500
cagaagaact tagatcatta tataatacag tagcaaccct ctattgtgtg catcaaagga 7560
tagagataaa agacaccaag gaagctttag acaagataga ggaagagcaa aacaaaagta 7620
agaccaccgc acagcaagcg gccgctgatc ttcagacctg gaggaggaga tatgagggac 7680
aattggagaa gtgaattata taaatataaa gtagtaaaaa ttgaaccatt aggagtagca 7740
cccaccaagg caaagagaag agtggtgcag agagaaaaaa gagcagtggg aataggagct 7800
ttgttccttg ggttcttggg agcagcagga agcactatgg gcgcagcgtc aatgacgctg 7860
acggtacagg ccagacaatt attgtctggt atagtgcagc agcagaacaa tttgctgagg 7920
gctattgagg cgcaacagca tctgttgcaa ctcacagtct ggggcatcaa gcagctccag 7980
gcaagaatcc tggctgtgga aagataccta aaggatcaac agctcctggg gatttggggt 8040
tgctctggaa aactcatttg caccactgct gtgccttgga atgctagttg gagtaataaa 8100
tctctggaac agatttggaa tcacacgacc tggatggagt gggacagaga aattaacaat 8160
tacacaagct taatacactc cttaattgaa gaatcgcaaa accagcaaga aaagaatgaa 8220
caagaattat tggaattaga taaatgggca agtttgtgga attggtttaa cataacaaat 8280
tggctgtggt atataaaatt attcataatg atagtaggag gcttggtagg tttaagaata 8340
gtttttgctg tactttctat agtgaataga gttaggcagg gatattcacc attatcgttt 8400
cagacccacc tcccaacccc gaggggaccc ttgcgccttt tccaaggcag ccctgggttt 8460
gcgcagggac gcggctgctc tgggcgtggt tccgggaaac gcagcggcgc cgaccctggg 8520
tctcgcacat tcttcacgtc cgttcgcagc gtcacccgga tcttcgccgc tacccttgtg 8580
ggccccccgg cgacgcttcc tgctccgccc ctaagtcggg aaggttcctt gcggttcgcg 8640
gcgtgccgga cgtgacaaac ggaagccgca cgtctcacta gtaccctcgc agacggacag 8700
cgccagggag caatggcagc gcgccgaccg cgatgggctg tggccaatag cggctgctca 8760
gcagggcgcg ccgagagcag cggccgggaa ggggcggtgc gggaggcggg gtgtggggcg 8820
gtagtgtggg ccctgttcct gcccgcgcgg tgttccgcat tctgcaagcc tccggagcgc 8880
acgtcggcag tcggctccct cgttgaccga atcaccgacc tctctcccca gggggtaccc 8940
agctgtctag agaattctag atcttgagac aaatggcagt attcatccac aattttaaaa 9000
gaaaaggggg gattgggggg tacagtgcag gggaaagaat agtagacata atagcaacag 9060
acatacaaac taaagaatta caaaaacaaa ttacaaaaat tcaaaatttt cgggtttatt 9120
acagggacag cagagatcca ctttggcgcc ggctcgaggg gg 9162
<210> 93
<211> 167
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 93
Met Ala Gln Leu Lys Lys Lys Leu Gln Ala Asn Lys Lys Glu Leu Ala
1 5 10 15
Gln Leu Lys Trp Lys Leu Gln Ala Leu Lys Lys Lys Leu Ala Gln Gly
20 25 30
Gly Gly Gly Ser Gly Ser Met Ile Lys Ile Ala Thr Arg Lys Tyr Leu
35 40 45
Gly Lys Gln Asn Val Tyr Asp Ile Gly Val Glu Arg Asp His Asn Phe
50 55 60
Ala Leu Lys Asn Gly Phe Ile Ala Ser Asn Cys Thr Met Gly Trp Glu
65 70 75 80
Ala Ser Thr Glu Arg Leu Tyr Pro Glu Asp Gly Val Leu Lys Gly Asp
85 90 95
Ile Lys Met Ala Leu Arg Leu Lys Asp Gly Gly Arg Tyr Leu Ala Asp
100 105 110
Phe Lys Thr Thr Tyr Lys Ala Lys Lys Pro Val Gln Met Pro Gly Ala
115 120 125
Tyr Asn Val Asp Arg Lys Leu Asp Ile Thr Ser His Asn Glu Asp Tyr
130 135 140
Thr Val Val Glu Gln Tyr Glu Arg Ser Glu Gly Arg His Ser Thr Gly
145 150 155 160
Gly Met Asp Glu Leu Tyr Lys
165
<210> 94
<211> 9555
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 94
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtgccgcca ccatggtgag caagggcgag 660
gcagtgatca aggagttcat gcggttcaag gtgcacatgg agggctccat gaacggccac 720
gagttcgaga tcgagggcga gggcgagggc cgcccctacg agggcaccca gaccgccaag 780
ctgaaggtga ccaagggtgg ccccctgccc ttctcctggg acatcctgtc ccctcagttc 840
atgtacggct ccagggcctt caccaagcac cccgccgaca tccccgacta ctataagcag 900
tccttccccg agggcttcaa gtgggagcgc gtgatgaact tcgaggacgg cggcgccgtg 960
accgtgaccc aggacacctc cctggaggac ggcaccctga tctacaaggt gaagctccgc 1020
ggcaccaact tccctcctga cggccccgta atgcagaaga agacaatggg ctgggaagcg 1080
tccaccgagc ggttgtaccc cgaggacggc gtgctgaagg gcgacattaa gtgcctttca 1140
tacgagaccg agatcctgac tgtcgagtac ggattgcttc ctatcggcaa aatcgtggag 1200
aagaggattg aatgtaccgt ctattcagtc gataataatg ggaacatcta cacacagccc 1260
gtggctcaat ggcacgacag aggagagcag gaagtttttg aatactgtct cgaggacgga 1320
tccctcatcc gcgctactaa agatcataag tttatgaccg tggacggcca gatgctgcca 1380
attgacgaaa tttttgaacg agagctggat ctgatgagag tcgacaacct tccaaacggt 1440
ggaggggggt caggctctgc gcagctggaa aaggagcttc aagccctcga aaaaaagttg 1500
gcccagctcg agtgggagaa ccaggctctg gagaaagaac tggcccagtg attaattaag 1560
aattcgaccc agctttcttg tacaaagtgg ttggtaagcc tatccctaac cctctcctcg 1620
gtctcgattc tacgtagtaa tgagctagca gtctcgaggt taacgaattc cgcccccccc 1680
ctaacgttac tggccgaagc cgcttggaat aaggccggtg tgcgcttgtc tatatgttat 1740
tttccaccat attgccgtct tttggcaatg tgagggcccg gaaacctggc cctgtcttct 1800
tgacgagcat tcctaggggt ctttcccctc tcgccaaagg aatgcaaggt ctgttgaatg 1860
tcgtgaagga agcagttcct ctggaagctt cttgaagaca aacaacgtct gtagcgaccc 1920
tttgcaggca gcggaacccc ccacctggcg acaggtgccc ctgcggccaa aagccacgtg 1980
tataagatac acctgcaaag gcggcacaac cccagtgcca cgttgtgagt tggatagttg 2040
tggaaagagt caaatggctc tcctcaagcg tattcaacaa ggggctgaag gatgcccaga 2100
aggtacccca ttgtatggga tctgatctgg ggcctcggtg cacatgcttt acatgtgttt 2160
agtcgaggtt aaaaaaacgt ctaggccccc cgaaccacgg ggacgtggtt ttcctttgaa 2220
aaacacgata ataccatggc catgagcgag ctgattaagg agaacatgca catgaagctg 2280
tacatggagg gcaccgtgga caaccatcac ttcaagtgca catccgaggg cgaaggcaag 2340
ccctacgagg gcacccagac catgagaatc aaggtggtcg agggcggccc tctccccttc 2400
gccttcgaca tcctggctac tagcttcctc tacggcagca agaccttcat caaccacacc 2460
cagggcatcc ccgacttctt caagcagtcc ttccctgagg gcttcacatg ggagagagtc 2520
accacatacg aagacggggg cgtgctgacc gctacccagg acaccagcct ccaggacggc 2580
tgcctcatct acaacgtcaa gatcagaggg gtgaacttca catccaacgg ccctgtgatg 2640
cagaagaaaa cactcggctg ggaggccttc accgagacgc tgtaccccgc tgacggcggc 2700
ctggaaggca gaaacgacat ggccctgaag ctcgtgggcg ggagccatct gatcgcaaac 2760
atcaagacca catatagatc caagaaaccc gctaagaacc tcaagatgcc tggcgtctac 2820
tatgtggact acagactgga aagaatcaag gaggccaaca acgagaccta cgtcgagcag 2880
cacgaggtgg cagtggccag atactgcgac ctccctagca aactggggca caagcttaat 2940
taacaccggt ggcgcgttaa gtcgacaatc aacctctgga ttacaaaatt tgtgaaagat 3000
tgactggtat tcttaactat gttgctcctt ttacgctatg tggatacgct gctttaatgc 3060
ctttgtatca tgctattgct tcccgtatgg ctttcatttt ctcctccttg tataaatcct 3120
ggttgctgtc tctttatgag gagttgtggc ccgttgtcag gcaacgtggc gtggtgtgca 3180
ctgtgtttgc tgacgcaacc cccactggtt ggggcattgc caccacctgt cagctccttt 3240
ccgggacttt cgctttcccc ctccctattg ccacggcgga actcatcgcc gcctgccttg 3300
cccgctgctg gacaggggct cggctgttgg gcactgacaa ttccgtggtg ttgtcgggga 3360
aatcatcgtc ctttccttgg ctgctcgcct gtgttgccac ctggattctg cgcgggacgt 3420
ccttctgcta cgtcccttcg gccctcaatc cagcggacct tccttcccgc ggcctgctgc 3480
cggctctgcg gcctcttccg cgtcttcgcc ttcgccctca gacgagtcgg atctcccttt 3540
gggccgcctc cccgcgtcga ctttaagacc aatgacttac aaggcagctg tagatcttag 3600
ccacttttta aaagaaaagg ggggactgga agggctaatt cactcccaac gaagacaaga 3660
tctgcttttt gcttgtactg ggtctctctg gttagaccag atctgagcct gggagctctc 3720
tggctaacta gggaacccac tgcttaagcc tcaataaagc ttgccttgag tgcttcaagt 3780
agtgtgtgcc cgtctgttgt gtgactctgg taactagaga tccctcagac ccttttagtc 3840
agtgtggaaa atctctagca gtacgtatag tagttcatgt catcttatta ttcagtattt 3900
ataacttgca aagaaatgaa tatcagagag tgagaggaac ttgtttattg cagcttataa 3960
tggttacaaa taaagcaata gcatcacaaa tttcacaaat aaagcatttt tttcactgca 4020
ttctagttgt ggtttgtcca aactcatcaa tgtatcttat catgtctggc tctagctatc 4080
ccgcccctaa ctccgcccat cccgccccta actccgccca gttccgccca ttctccgccc 4140
catggctgac taattttttt tatttatgca gaggccgagg ccgcctcggc ctctgagcta 4200
ttccagaagt agtgaggagg cttttttgga ggcctaggga cgtacccaat tcgccctata 4260
gtgagtcgta ttacgcgcgc tcactggccg tcgttttaca acgtcgtgac tgggaaaacc 4320
ctggcgttac ccaacttaat cgccttgcag cacatccccc tttcgccagc tggcgtaata 4380
gcgaagaggc ccgcaccgat cgcccttccc aacagttgcg cagcctgaat ggcgaatggg 4440
acgcgccctg tagcggcgca ttaagcgcgg cgggtgtggt ggttacgcgc agcgtgaccg 4500
ctacacttgc cagcgcccta gcgcccgctc ctttcgcttt cttcccttcc tttctcgcca 4560
cgttcgccgg ctttccccgt caagctctaa atcgggggct ccctttaggg ttccgattta 4620
gtgctttacg gcacctcgac cccaaaaaac ttgattaggg tgatggttca cgtagtgggc 4680
catcgccctg atagacggtt tttcgccctt tgacgttgga gtccacgttc tttaatagtg 4740
gactcttgtt ccaaactgga acaacactca accctatctc ggtctattct tttgatttat 4800
aagggatttt gccgatttcg gcctattggt taaaaaatga gctgatttaa caaaaattta 4860
acgcgaattt taacaaaata ttaacgctta caatttaggt ggcacttttc ggggaaatgt 4920
gcgcggaacc cctatttgtt tatttttcta aatacattca aatatgtatc cgctcatgag 4980
acaataaccc tgataaatgc ttcaataata ttgaaaaagg aagagtatga gtattcaaca 5040
tttccgtgtc gcccttattc ccttttttgc ggcattttgc cttcctgttt ttgctcaccc 5100
agaaacgctg gtgaaagtaa aagatgctga agatcagttg ggtgcacgag tgggttacat 5160
cgaactggat ctcaacagcg gtaagatcct tgagagtttt cgccccgaag aacgttttcc 5220
aatgatgagc acttttaaag ttctgctatg tggcgcggta ttatcccgta ttgacgccgg 5280
gcaagagcaa ctcggtcgcc gcatacacta ttctcagaat gacttggttg agtactcacc 5340
agtcacagaa aagcatctta cggatggcat gacagtaaga gaattatgca gtgctgccat 5400
aaccatgagt gataacactg cggccaactt acttctgaca acgatcggag gaccgaagga 5460
gctaaccgct tttttgcaca acatggggga tcatgtaact cgccttgatc gttgggaacc 5520
ggagctgaat gaagccatac caaacgacga gcgtgacacc acgatgcctg tagcaatggc 5580
aacaacgttg cgcaaactat taactggcga actacttact ctagcttccc ggcaacaatt 5640
aatagactgg atggaggcgg ataaagttgc aggaccactt ctgcgctcgg cccttccggc 5700
tggctggttt attgctgata aatctggagc cggtgagcgt gggtctcgcg gtatcattgc 5760
agcactgggg ccagatggta agccctcccg tatcgtagtt atctacacga cggggagtca 5820
ggcaactatg gatgaacgaa atagacagat cgctgagata ggtgcctcac tgattaagca 5880
ttggtaactg tcagaccaag tttactcata tatactttag attgatttaa aacttcattt 5940
ttaatttaaa aggatctagg tgaagatcct ttttgataat ctcatgacca aaatccctta 6000
acgtgagttt tcgttccact gagcgtcaga ccccgtagaa aagatcaaag gatcttcttg 6060
agatcctttt tttctgcgcg taatctgctg cttgcaaaca aaaaaaccac cgctaccagc 6120
ggtggtttgt ttgccggatc aagagctacc aactcttttt ccgaaggtaa ctggcttcag 6180
cagagcgcag ataccaaata ctgttcttct agtgtagccg tagttaggcc accacttcaa 6240
gaactctgta gcaccgccta catacctcgc tctgctaatc ctgttaccag tggctgctgc 6300
cagtggcgat aagtcgtgtc ttaccgggtt ggactcaaga cgatagttac cggataaggc 6360
gcagcggtcg ggctgaacgg ggggttcgtg cacacagccc agcttggagc gaacgaccta 6420
caccgaactg agatacctac agcgtgagct atgagaaagc gccacgcttc ccgaagggag 6480
aaaggcggac aggtatccgg taagcggcag ggtcggaaca ggagagcgca cgagggagct 6540
tccaggggga aacgcctggt atctttatag tcctgtcggg tttcgccacc tctgacttga 6600
gcgtcgattt ttgtgatgct cgtcaggggg gcggagccta tggaaaaacg ccagcaacgc 6660
ggccttttta cggttcctgg ccttttgctg gccttttgct cacatgttct ttcctgcgtt 6720
atcccctgat tctgtggata accgtattac cgcctttgag tgagctgata ccgctcgccg 6780
cagccgaacg accgagcgca gcgagtcagt gagcgaggaa gcggaagagc gcccaatacg 6840
caaaccgcct ctccccgcgc gttggccgat tcattaatgc agctggcacg acaggtttcc 6900
cgactggaaa gcgggcagtg agcgcaacgc aattaatgtg agttagctca ctcattaggc 6960
accccaggct ttacacttta tgcttccggc tcgtatgttg tgtggaattg tgagcggata 7020
acaatttcac acaggaaaca gctatgacca tgattacgcc aagcgcgcaa ttaaccctca 7080
ctaaagggaa caaaagctgg agctgcaagc ttaatgtagt cttatgcaat actcttgtag 7140
tcttgcaaca tggtaacgat gagttagcaa catgccttac aaggagagaa aaagcaccgt 7200
gcatgccgat tggtggaagt aaggtggtac gatcgtgcct tattaggaag gcaacagacg 7260
ggtctgacat ggattggacg aaccactgaa ttgccgcatt gcagagatat tgtatttaag 7320
tgcctagctc gatacataaa cgggtctctc tggttagacc agatctgagc ctgggagctc 7380
tctggctaac tagggaaccc actgcttaag cctcaataaa gcttgccttg agtgcttcaa 7440
gtagtgtgtg cccgtctgtt gtgtgactct ggtaactaga gatccctcag acccttttag 7500
tcagtgtgga aaatctctag cagtggcgcc cgaacaggga cttgaaagcg aaagggaaac 7560
cagaggagct ctctcgacgc aggactcggc ttgctgaagc gcgcacggca agaggcgagg 7620
ggcggcgact ggtgagtacg ccaaaaattt tgactagcgg aggctagaag gagagagatg 7680
ggtgcgagag cgtcagtatt aagcggggga gaattagatc gcgatgggaa aaaattcggt 7740
taaggccagg gggaaagaaa aaatataaat taaaacatat agtatgggca agcagggagc 7800
tagaacgatt cgcagttaat cctggcctgt tagaaacatc agaaggctgt agacaaatac 7860
tgggacagct acaaccatcc cttcagacag gatcagaaga acttagatca ttatataata 7920
cagtagcaac cctctattgt gtgcatcaaa ggatagagat aaaagacacc aaggaagctt 7980
tagacaagat agaggaagag caaaacaaaa gtaagaccac cgcacagcaa gcggccgctg 8040
atcttcagac ctggaggagg agatatgagg gacaattgga gaagtgaatt atataaatat 8100
aaagtagtaa aaattgaacc attaggagta gcacccacca aggcaaagag aagagtggtg 8160
cagagagaaa aaagagcagt gggaatagga gctttgttcc ttgggttctt gggagcagca 8220
ggaagcacta tgggcgcagc gtcaatgacg ctgacggtac aggccagaca attattgtct 8280
ggtatagtgc agcagcagaa caatttgctg agggctattg aggcgcaaca gcatctgttg 8340
caactcacag tctggggcat caagcagctc caggcaagaa tcctggctgt ggaaagatac 8400
ctaaaggatc aacagctcct ggggatttgg ggttgctctg gaaaactcat ttgcaccact 8460
gctgtgcctt ggaatgctag ttggagtaat aaatctctgg aacagatttg gaatcacacg 8520
acctggatgg agtgggacag agaaattaac aattacacaa gcttaataca ctccttaatt 8580
gaagaatcgc aaaaccagca agaaaagaat gaacaagaat tattggaatt agataaatgg 8640
gcaagtttgt ggaattggtt taacataaca aattggctgt ggtatataaa attattcata 8700
atgatagtag gaggcttggt aggtttaaga atagtttttg ctgtactttc tatagtgaat 8760
agagttaggc agggatattc accattatcg tttcagaccc acctcccaac cccgagggga 8820
cccttgcgcc ttttccaagg cagccctggg tttgcgcagg gacgcggctg ctctgggcgt 8880
ggttccggga aacgcagcgg cgccgaccct gggtctcgca cattcttcac gtccgttcgc 8940
agcgtcaccc ggatcttcgc cgctaccctt gtgggccccc cggcgacgct tcctgctccg 9000
cccctaagtc gggaaggttc cttgcggttc gcggcgtgcc ggacgtgaca aacggaagcc 9060
gcacgtctca ctagtaccct cgcagacgga cagcgccagg gagcaatggc agcgcgccga 9120
ccgcgatggg ctgtggccaa tagcggctgc tcagcagggc gcgccgagag cagcggccgg 9180
gaaggggcgg tgcgggaggc ggggtgtggg gcggtagtgt gggccctgtt cctgcccgcg 9240
cggtgttccg cattctgcaa gcctccggag cgcacgtcgg cagtcggctc cctcgttgac 9300
cgaatcaccg acctctctcc ccagggggta cccagctgtc tagagaattc tagatcttga 9360
gacaaatggc agtattcatc cacaatttta aaagaaaagg ggggattggg gggtacagtg 9420
caggggaaag aatagtagac ataatagcaa cagacataca aactaaagaa ttacaaaaac 9480
aaattacaaa aattcaaaat tttcgggttt attacaggga cagcagagat ccactttggc 9540
gccggctcga ggggg 9555
<210> 95
<211> 302
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 95
Met Val Ser Lys Gly Glu Ala Val Ile Lys Glu Phe Met Arg Phe Lys
1 5 10 15
Val His Met Glu Gly Ser Met Asn Gly His Glu Phe Glu Ile Glu Gly
20 25 30
Glu Gly Glu Gly Arg Pro Tyr Glu Gly Thr Gln Thr Ala Lys Leu Lys
35 40 45
Val Thr Lys Gly Gly Pro Leu Pro Phe Ser Trp Asp Ile Leu Ser Pro
50 55 60
Gln Phe Met Tyr Gly Ser Arg Ala Phe Thr Lys His Pro Ala Asp Ile
65 70 75 80
Pro Asp Tyr Tyr Lys Gln Ser Phe Pro Glu Gly Phe Lys Trp Glu Arg
85 90 95
Val Met Asn Phe Glu Asp Gly Gly Ala Val Thr Val Thr Gln Asp Thr
100 105 110
Ser Leu Glu Asp Gly Thr Leu Ile Tyr Lys Val Lys Leu Arg Gly Thr
115 120 125
Asn Phe Pro Pro Asp Gly Pro Val Met Gln Lys Lys Thr Met Gly Trp
130 135 140
Glu Ala Ser Thr Glu Arg Leu Tyr Pro Glu Asp Gly Val Leu Lys Gly
145 150 155 160
Asp Ile Lys Cys Leu Ser Tyr Glu Thr Glu Ile Leu Thr Val Glu Tyr
165 170 175
Gly Leu Leu Pro Ile Gly Lys Ile Val Glu Lys Arg Ile Glu Cys Thr
180 185 190
Val Tyr Ser Val Asp Asn Asn Gly Asn Ile Tyr Thr Gln Pro Val Ala
195 200 205
Gln Trp His Asp Arg Gly Glu Gln Glu Val Phe Glu Tyr Cys Leu Glu
210 215 220
Asp Gly Ser Leu Ile Arg Ala Thr Lys Asp His Lys Phe Met Thr Val
225 230 235 240
Asp Gly Gln Met Leu Pro Ile Asp Glu Ile Phe Glu Arg Glu Leu Asp
245 250 255
Leu Met Arg Val Asp Asn Leu Pro Asn Gly Gly Gly Gly Ser Gly Ser
260 265 270
Ala Gln Leu Glu Lys Glu Leu Gln Ala Leu Glu Lys Lys Leu Ala Gln
275 280 285
Leu Glu Trp Glu Asn Gln Ala Leu Glu Lys Glu Leu Ala Gln
290 295 300
<210> 96
<211> 9093
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 96
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtgccgcca ccatggctca actgaagaag 660
aaactccagg ccaataagaa agaactggca cagctgaagt ggaagctgca ggcactgaaa 720
aagaagctgg cacagggcgg aggaggctca ggcagtatga ttaagatcgc tacgcggaag 780
tacctgggga aacagaacgt ctacgacata ggtgtggagc gcgatcacaa ctttgctctg 840
aaaaatggat ttatcgccag caactgcatg gccctgcgcc tgaaggacgg cggccgctac 900
ctggcggact tcaagaccac ctacaaggcc aagaagcccg tgcagatgcc cggcgcctac 960
aacgtcgacc gcaagttgga catcacctcc cacaacgagg actacaccgt ggtggaacag 1020
tacgaacgct ccgagggccg ccactccacc ggcggcatgg acgagctgta caagtgatta 1080
attaagaatt cgacccagct ttcttgtaca aagtggttgg taagcctatc cctaaccctc 1140
tcctcggtct cgattctacg tagtaatgag ctagcagtct cgaggttaac gaattccgcc 1200
ccccccctaa cgttactggc cgaagccgct tggaataagg ccggtgtgcg cttgtctata 1260
tgttattttc caccatattg ccgtcttttg gcaatgtgag ggcccggaaa cctggccctg 1320
tcttcttgac gagcattcct aggggtcttt cccctctcgc caaaggaatg caaggtctgt 1380
tgaatgtcgt gaaggaagca gttcctctgg aagcttcttg aagacaaaca acgtctgtag 1440
cgaccctttg caggcagcgg aaccccccac ctggcgacag gtgcccctgc ggccaaaagc 1500
cacgtgtata agatacacct gcaaaggcgg cacaacccca gtgccacgtt gtgagttgga 1560
tagttgtgga aagagtcaaa tggctctcct caagcgtatt caacaagggg ctgaaggatg 1620
cccagaaggt accccattgt atgggatctg atctggggcc tcggtgcaca tgctttacat 1680
gtgtttagtc gaggttaaaa aaacgtctag gccccccgaa ccacggggac gtggttttcc 1740
tttgaaaaac acgataatac catggtgagc aagggcgagg agctgttcac cggggtggtg 1800
cccatcctgg tcgagctgga cggcgacgta aacggccaca agttcagcgt gtccggcgag 1860
ggcgagggcg atgccaccta cggcaagctg accctgaagt tcatctgcac caccggcaag 1920
ctgcccgtgc cctggcccac cctcgtgacc accctgacct acggcgtgca gtgcttcagc 1980
cgctaccccg accacatgaa gcagcacgac ttcttcaagt ccgccatgcc cgaaggctac 2040
gtccaggagc gcaccatctt cttcaaggac gacggcaact acaagacccg cgccgaggtg 2100
aagttcgagg gcgacaccct ggtgaaccgc atcgagctga agggcatcga cttcaaggag 2160
gacggcaaca tcctggggca caagctggag tacaactaca acagccacaa cgtctatatc 2220
atggccgaca agcagaagaa cggcatcaag gtgaacttca agatccgcca caacatcgag 2280
gacggcagcg tgcagctcgc cgaccactac cagcagaaca cccccatcgg cgacggcccc 2340
gtgctgctgc ccgacaacca ctacctgagc acccagtccg ccctgagcaa agaccccaac 2400
gagaagcgcg atcacatggt cctgctggag ttcgtgaccg ccgccgggat cactctcggc 2460
atggacgagc tgtacaagta acaccggtgg cgcgttaagt cgacaatcaa cctctggatt 2520
acaaaatttg tgaaagattg actggtattc ttaactatgt tgctcctttt acgctatgtg 2580
gatacgctgc tttaatgcct ttgtatcatg ctattgcttc ccgtatggct ttcattttct 2640
cctccttgta taaatcctgg ttgctgtctc tttatgagga gttgtggccc gttgtcaggc 2700
aacgtggcgt ggtgtgcact gtgtttgctg acgcaacccc cactggttgg ggcattgcca 2760
ccacctgtca gctcctttcc gggactttcg ctttccccct ccctattgcc acggcggaac 2820
tcatcgccgc ctgccttgcc cgctgctgga caggggctcg gctgttgggc actgacaatt 2880
ccgtggtgtt gtcggggaaa tcatcgtcct ttccttggct gctcgcctgt gttgccacct 2940
ggattctgcg cgggacgtcc ttctgctacg tcccttcggc cctcaatcca gcggaccttc 3000
cttcccgcgg cctgctgccg gctctgcggc ctcttccgcg tcttcgcctt cgccctcaga 3060
cgagtcggat ctccctttgg gccgcctccc cgcgtcgact ttaagaccaa tgacttacaa 3120
ggcagctgta gatcttagcc actttttaaa agaaaagggg ggactggaag ggctaattca 3180
ctcccaacga agacaagatc tgctttttgc ttgtactggg tctctctggt tagaccagat 3240
ctgagcctgg gagctctctg gctaactagg gaacccactg cttaagcctc aataaagctt 3300
gccttgagtg cttcaagtag tgtgtgcccg tctgttgtgt gactctggta actagagatc 3360
cctcagaccc ttttagtcag tgtggaaaat ctctagcagt acgtatagta gttcatgtca 3420
tcttattatt cagtatttat aacttgcaaa gaaatgaata tcagagagtg agaggaactt 3480
gtttattgca gcttataatg gttacaaata aagcaatagc atcacaaatt tcacaaataa 3540
agcatttttt tcactgcatt ctagttgtgg tttgtccaaa ctcatcaatg tatcttatca 3600
tgtctggctc tagctatccc gcccctaact ccgcccatcc cgcccctaac tccgcccagt 3660
tccgcccatt ctccgcccca tggctgacta atttttttta tttatgcaga ggccgaggcc 3720
gcctcggcct ctgagctatt ccagaagtag tgaggaggct tttttggagg cctagggacg 3780
tacccaattc gccctatagt gagtcgtatt acgcgcgctc actggccgtc gttttacaac 3840
gtcgtgactg ggaaaaccct ggcgttaccc aacttaatcg ccttgcagca catccccctt 3900
tcgccagctg gcgtaatagc gaagaggccc gcaccgatcg cccttcccaa cagttgcgca 3960
gcctgaatgg cgaatgggac gcgccctgta gcggcgcatt aagcgcggcg ggtgtggtgg 4020
ttacgcgcag cgtgaccgct acacttgcca gcgccctagc gcccgctcct ttcgctttct 4080
tcccttcctt tctcgccacg ttcgccggct ttccccgtca agctctaaat cgggggctcc 4140
ctttagggtt ccgatttagt gctttacggc acctcgaccc caaaaaactt gattagggtg 4200
atggttcacg tagtgggcca tcgccctgat agacggtttt tcgccctttg acgttggagt 4260
ccacgttctt taatagtgga ctcttgttcc aaactggaac aacactcaac cctatctcgg 4320
tctattcttt tgatttataa gggattttgc cgatttcggc ctattggtta aaaaatgagc 4380
tgatttaaca aaaatttaac gcgaatttta acaaaatatt aacgcttaca atttaggtgg 4440
cacttttcgg ggaaatgtgc gcggaacccc tatttgttta tttttctaaa tacattcaaa 4500
tatgtatccg ctcatgagac aataaccctg ataaatgctt caataatatt gaaaaaggaa 4560
gagtatgagt attcaacatt tccgtgtcgc ccttattccc ttttttgcgg cattttgcct 4620
tcctgttttt gctcacccag aaacgctggt gaaagtaaaa gatgctgaag atcagttggg 4680
tgcacgagtg ggttacatcg aactggatct caacagcggt aagatccttg agagttttcg 4740
ccccgaagaa cgttttccaa tgatgagcac ttttaaagtt ctgctatgtg gcgcggtatt 4800
atcccgtatt gacgccgggc aagagcaact cggtcgccgc atacactatt ctcagaatga 4860
cttggttgag tactcaccag tcacagaaaa gcatcttacg gatggcatga cagtaagaga 4920
attatgcagt gctgccataa ccatgagtga taacactgcg gccaacttac ttctgacaac 4980
gatcggagga ccgaaggagc taaccgcttt tttgcacaac atgggggatc atgtaactcg 5040
ccttgatcgt tgggaaccgg agctgaatga agccatacca aacgacgagc gtgacaccac 5100
gatgcctgta gcaatggcaa caacgttgcg caaactatta actggcgaac tacttactct 5160
agcttcccgg caacaattaa tagactggat ggaggcggat aaagttgcag gaccacttct 5220
gcgctcggcc cttccggctg gctggtttat tgctgataaa tctggagccg gtgagcgtgg 5280
gtctcgcggt atcattgcag cactggggcc agatggtaag ccctcccgta tcgtagttat 5340
ctacacgacg gggagtcagg caactatgga tgaacgaaat agacagatcg ctgagatagg 5400
tgcctcactg attaagcatt ggtaactgtc agaccaagtt tactcatata tactttagat 5460
tgatttaaaa cttcattttt aatttaaaag gatctaggtg aagatccttt ttgataatct 5520
catgaccaaa atcccttaac gtgagttttc gttccactga gcgtcagacc ccgtagaaaa 5580
gatcaaagga tcttcttgag atcctttttt tctgcgcgta atctgctgct tgcaaacaaa 5640
aaaaccaccg ctaccagcgg tggtttgttt gccggatcaa gagctaccaa ctctttttcc 5700
gaaggtaact ggcttcagca gagcgcagat accaaatact gttcttctag tgtagccgta 5760
gttaggccac cacttcaaga actctgtagc accgcctaca tacctcgctc tgctaatcct 5820
gttaccagtg gctgctgcca gtggcgataa gtcgtgtctt accgggttgg actcaagacg 5880
atagttaccg gataaggcgc agcggtcggg ctgaacgggg ggttcgtgca cacagcccag 5940
cttggagcga acgacctaca ccgaactgag atacctacag cgtgagctat gagaaagcgc 6000
cacgcttccc gaagggagaa aggcggacag gtatccggta agcggcaggg tcggaacagg 6060
agagcgcacg agggagcttc cagggggaaa cgcctggtat ctttatagtc ctgtcgggtt 6120
tcgccacctc tgacttgagc gtcgattttt gtgatgctcg tcaggggggc ggagcctatg 6180
gaaaaacgcc agcaacgcgg cctttttacg gttcctggcc ttttgctggc cttttgctca 6240
catgttcttt cctgcgttat cccctgattc tgtggataac cgtattaccg cctttgagtg 6300
agctgatacc gctcgccgca gccgaacgac cgagcgcagc gagtcagtga gcgaggaagc 6360
ggaagagcgc ccaatacgca aaccgcctct ccccgcgcgt tggccgattc attaatgcag 6420
ctggcacgac aggtttcccg actggaaagc gggcagtgag cgcaacgcaa ttaatgtgag 6480
ttagctcact cattaggcac cccaggcttt acactttatg cttccggctc gtatgttgtg 6540
tggaattgtg agcggataac aatttcacac aggaaacagc tatgaccatg attacgccaa 6600
gcgcgcaatt aaccctcact aaagggaaca aaagctggag ctgcaagctt aatgtagtct 6660
tatgcaatac tcttgtagtc ttgcaacatg gtaacgatga gttagcaaca tgccttacaa 6720
ggagagaaaa agcaccgtgc atgccgattg gtggaagtaa ggtggtacga tcgtgcctta 6780
ttaggaaggc aacagacggg tctgacatgg attggacgaa ccactgaatt gccgcattgc 6840
agagatattg tatttaagtg cctagctcga tacataaacg ggtctctctg gttagaccag 6900
atctgagcct gggagctctc tggctaacta gggaacccac tgcttaagcc tcaataaagc 6960
ttgccttgag tgcttcaagt agtgtgtgcc cgtctgttgt gtgactctgg taactagaga 7020
tccctcagac ccttttagtc agtgtggaaa atctctagca gtggcgcccg aacagggact 7080
tgaaagcgaa agggaaacca gaggagctct ctcgacgcag gactcggctt gctgaagcgc 7140
gcacggcaag aggcgagggg cggcgactgg tgagtacgcc aaaaattttg actagcggag 7200
gctagaagga gagagatggg tgcgagagcg tcagtattaa gcgggggaga attagatcgc 7260
gatgggaaaa aattcggtta aggccagggg gaaagaaaaa atataaatta aaacatatag 7320
tatgggcaag cagggagcta gaacgattcg cagttaatcc tggcctgtta gaaacatcag 7380
aaggctgtag acaaatactg ggacagctac aaccatccct tcagacagga tcagaagaac 7440
ttagatcatt atataataca gtagcaaccc tctattgtgt gcatcaaagg atagagataa 7500
aagacaccaa ggaagcttta gacaagatag aggaagagca aaacaaaagt aagaccaccg 7560
cacagcaagc ggccgctgat cttcagacct ggaggaggag atatgaggga caattggaga 7620
agtgaattat ataaatataa agtagtaaaa attgaaccat taggagtagc acccaccaag 7680
gcaaagagaa gagtggtgca gagagaaaaa agagcagtgg gaataggagc tttgttcctt 7740
gggttcttgg gagcagcagg aagcactatg ggcgcagcgt caatgacgct gacggtacag 7800
gccagacaat tattgtctgg tatagtgcag cagcagaaca atttgctgag ggctattgag 7860
gcgcaacagc atctgttgca actcacagtc tggggcatca agcagctcca ggcaagaatc 7920
ctggctgtgg aaagatacct aaaggatcaa cagctcctgg ggatttgggg ttgctctgga 7980
aaactcattt gcaccactgc tgtgccttgg aatgctagtt ggagtaataa atctctggaa 8040
cagatttgga atcacacgac ctggatggag tgggacagag aaattaacaa ttacacaagc 8100
ttaatacact ccttaattga agaatcgcaa aaccagcaag aaaagaatga acaagaatta 8160
ttggaattag ataaatgggc aagtttgtgg aattggttta acataacaaa ttggctgtgg 8220
tatataaaat tattcataat gatagtagga ggcttggtag gtttaagaat agtttttgct 8280
gtactttcta tagtgaatag agttaggcag ggatattcac cattatcgtt tcagacccac 8340
ctcccaaccc cgaggggacc cttgcgcctt ttccaaggca gccctgggtt tgcgcaggga 8400
cgcggctgct ctgggcgtgg ttccgggaaa cgcagcggcg ccgaccctgg gtctcgcaca 8460
ttcttcacgt ccgttcgcag cgtcacccgg atcttcgccg ctacccttgt gggccccccg 8520
gcgacgcttc ctgctccgcc cctaagtcgg gaaggttcct tgcggttcgc ggcgtgccgg 8580
acgtgacaaa cggaagccgc acgtctcact agtaccctcg cagacggaca gcgccaggga 8640
gcaatggcag cgcgccgacc gcgatgggct gtggccaata gcggctgctc agcagggcgc 8700
gccgagagca gcggccggga aggggcggtg cgggaggcgg ggtgtggggc ggtagtgtgg 8760
gccctgttcc tgcccgcgcg gtgttccgca ttctgcaagc ctccggagcg cacgtcggca 8820
gtcggctccc tcgttgaccg aatcaccgac ctctctcccc agggggtacc cagctgtcta 8880
gagaattcta gatcttgaga caaatggcag tattcatcca caattttaaa agaaaagggg 8940
ggattggggg gtacagtgca ggggaaagaa tagtagacat aatagcaaca gacatacaaa 9000
ctaaagaatt acaaaaacaa attacaaaaa ttcaaaattt tcgggtttat tacagggaca 9060
gcagagatcc actttggcgc cggctcgagg ggg 9093
<210> 97
<211> 144
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 97
Met Ala Gln Leu Lys Lys Lys Leu Gln Ala Asn Lys Lys Glu Leu Ala
1 5 10 15
Gln Leu Lys Trp Lys Leu Gln Ala Leu Lys Lys Lys Leu Ala Gln Gly
20 25 30
Gly Gly Gly Ser Gly Ser Met Ile Lys Ile Ala Thr Arg Lys Tyr Leu
35 40 45
Gly Lys Gln Asn Val Tyr Asp Ile Gly Val Glu Arg Asp His Asn Phe
50 55 60
Ala Leu Lys Asn Gly Phe Ile Ala Ser Asn Cys Met Ala Leu Arg Leu
65 70 75 80
Lys Asp Gly Gly Arg Tyr Leu Ala Asp Phe Lys Thr Thr Tyr Lys Ala
85 90 95
Lys Lys Pro Val Gln Met Pro Gly Ala Tyr Asn Val Asp Arg Lys Leu
100 105 110
Asp Ile Thr Ser His Asn Glu Asp Tyr Thr Val Val Glu Gln Tyr Glu
115 120 125
Arg Ser Glu Gly Arg His Ser Thr Gly Gly Met Asp Glu Leu Tyr Lys
130 135 140
<210> 98
<211> 3530
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 98
ctttcctgcg ttatcccctg attctgtgga taaccgtatt accgcctttg agtgagctga 60
taccgctcgc cgcagccgaa cgaccgagcg cagcgagtca gtgagcgagg aagcggaaga 120
gcgcccaata cgcaaaccgc ctctccccgc gcgttggccg attcattaat gcagctggca 180
cgacaggttt cccgactgga aagcgggcag tgagcgcaac gcaattaata cgcgtaccgc 240
tagccaggaa gagtttgtag aaacgcaaaa aggccatccg tcaggatggc cttctgctta 300
gtttgatgcc tggcagttta tggcgggcgt cctgcccgcc accctccggg ccgttgcttc 360
acaacgatca aatccgctcc cggcggattt gtcctactca ggagagcgtt caccgacaaa 420
caacagataa aacgaaaggc ccagtcttcc gactgagcct ttcgttttat ttgatgcctg 480
gcagttccct actctcgcgt taacgctagc atggatgttt tcccagtcac gacgttgtaa 540
aacgacggcc agtcttaagc tcgggcccca aataatgatt ttattttgac tgatagtgac 600
ctgttcgttg caacaaattg atgagcaatg cttttttata atgccaactt tgtacaaaaa 660
agcaggctcc gaattcaccg gtgccgccac catgagcgag ctgattaagg agaacatgca 720
catgaagctg tacatggagg gcaccgtgga caaccatcac ttcaagtgca catccgaggg 780
cgaaggcaag ccctacgagg gcacccagac catgagaatc aaggtggtcg agggcggccc 840
tctccccttc gccttcgaca tcctggctac tagcttcctc tacggcagca agaccttcat 900
caaccacacc cagggcatcc ccgacttctt caagcagtcc ttccctgagg gcttcacatg 960
ggagagagtc accacatacg aagacggggg cgtgctgacc gctacccagg acaccagcct 1020
ccaggacggc tgcctcatct acaacgtcaa gatcagaggg gtgaacttca catccaacgg 1080
ccctgtgatg cagaagaaaa cactcggctg ggaggccttc accgagacgc tgtaccccgc 1140
tgacggcggc ctggaaggca gaaacgacat ggccctgaag ctcgtgggcg ggagccatct 1200
gatcgcaaac atcaagacca catatagatc caagaaaccc gctaagaacc tcaagatgcc 1260
tggcgtctac tatgtggact acagactgga aagaatcaag gaggccaaca acgagaccta 1320
cgtcgagcag cacgaggtgg cagtggccag atactgcgac ctccctagca aactggggca 1380
caagcttaat taattaatta agaattcgac ccagctttct tgtacaaagt tggcattata 1440
aaaaataatt gctcatcaat ttgttgcaac gaacaggtca ctatcagtca aaataaaatc 1500
attatttgcc atccagctga tatcccctat agtgagtcgt attacatggt catagctgtt 1560
tcctggcagc tctggcccgt gtctcaaaat ctctgatgtt acattgcaca agataaaaat 1620
atatcatcat gcctcctcta gaccagccag gacagaaatg cctcgacttc gctgctgccc 1680
aaggttgccg ggtgacgcac accgtggaaa cggatgaagg cacgaaccca gtggacataa 1740
gcctgttcgg ttcgtaagct gtaatgcaag tagcgtatgc gctcacgcaa ctggtccaga 1800
accttgaccg aacgcagcgg tggtaacggc gcagtggcgg ttttcatggc ttgttatgac 1860
tgtttttttg gggtacagtc tatgcctcgg gcatccaagc agcaagcgcg ttacgccgtg 1920
ggtcgatgtt tgatgttatg gagcagcaac gatgttacgc agcagggcag tcgccctaaa 1980
acaaagttaa acatcatgag ggaagcggtg atcgccgaag tatcgactca actatcagag 2040
gtagttggcg tcatcgagcg ccatctcgaa ccgacgttgc tggccgtaca tttgtacggc 2100
tccgcagtgg atggcggcct gaagccacac agtgatattg atttgctggt tacggtgacc 2160
gtaaggcttg atgaaacaac gcggcgagct ttgatcaacg accttttgga aacttcggct 2220
tcccctggag agagcgagat tctccgcgct gtagaagtca ccattgttgt gcacgacgac 2280
atcattccgt ggcgttatcc agctaagcgc gaactgcaat ttggagaatg gcagcgcaat 2340
gacattcttg caggtatctt cgagccagcc acgatcgaca ttgatctggc tatcttgctg 2400
acaaaagcaa gagaacatag cgttgccttg gtaggtccag cggcggagga actctttgat 2460
ccggttcctg aacaggatct atttgaggcg ctaaatgaaa ccttaacgct atggaactcg 2520
ccgcccgact gggctggcga tgagcgaaat gtagtgctta cgttgtcccg catttggtac 2580
agcgcagtaa ccggcaaaat cgcgccgaag gatgtcgctg ccgactgggc aatggagcgc 2640
ctgccggccc agtatcagcc cgtcatactt gaagctagac aggcttatct tggacaagaa 2700
gaagatcgct tggcctcgcg cgcagatcag ttggaagaat ttgtccacta cgtgaaaggc 2760
gagatcacca aggtagtcgg caaataaccc tcgagccacc catgaccaaa atcccttaac 2820
gtgagttacg cgtcgttcca ctgagcgtca gaccccgtag aaaagatcaa aggatcttct 2880
tgagatcctt tttttctgcg cgtaatctgc tgcttgcaaa caaaaaaacc accgctacca 2940
gcggtggttt gtttgccgga tcaagagcta ccaactcttt ttccgaaggt aactggcttc 3000
agcagagcgc agataccaaa tactgtcctt ctagtgtagc cgtagttagg ccaccacttc 3060
aagaactctg tagcaccgcc tacatacctc gctctgctaa tcctgttacc agtggctgct 3120
gccagtggcg ataagtcgtg tcttaccggg ttggactcaa gacgatagtt accggataag 3180
gcgcagcggt cgggctgaac ggggggttcg tgcacacagc ccagcttgga gcgaacgacc 3240
tacaccgaac tgagatacct acagcgtgag cattgagaaa gcgccacgct tcccgaaggg 3300
agaaaggcgg acaggtatcc ggtaagcggc agggtcggaa caggagagcg cacgagggag 3360
cttccagggg gaaacgcctg gtatctttat agtcctgtcg ggtttcgcca cctctgactt 3420
gagcgtcgat ttttgtgatg ctcgtcaggg gggcggagcc tatggaaaaa cgccagcaac 3480
gcggcctttt tacggttcct ggccttttgc tggccttttg ctcacatgtt 3530
<210> 99
<211> 233
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 99
Met Ser Glu Leu Ile Lys Glu Asn Met His Met Lys Leu Tyr Met Glu
1 5 10 15
Gly Thr Val Asp Asn His His Phe Lys Cys Thr Ser Glu Gly Glu Gly
20 25 30
Lys Pro Tyr Glu Gly Thr Gln Thr Met Arg Ile Lys Val Val Glu Gly
35 40 45
Gly Pro Leu Pro Phe Ala Phe Asp Ile Leu Ala Thr Ser Phe Leu Tyr
50 55 60
Gly Ser Lys Thr Phe Ile Asn His Thr Gln Gly Ile Pro Asp Phe Phe
65 70 75 80
Lys Gln Ser Phe Pro Glu Gly Phe Thr Trp Glu Arg Val Thr Thr Tyr
85 90 95
Glu Asp Gly Gly Val Leu Thr Ala Thr Gln Asp Thr Ser Leu Gln Asp
100 105 110
Gly Cys Leu Ile Tyr Asn Val Lys Ile Arg Gly Val Asn Phe Thr Ser
115 120 125
Asn Gly Pro Val Met Gln Lys Lys Thr Leu Gly Trp Glu Ala Phe Thr
130 135 140
Glu Thr Leu Tyr Pro Ala Asp Gly Gly Leu Glu Gly Arg Asn Asp Met
145 150 155 160
Ala Leu Lys Leu Val Gly Gly Ser His Leu Ile Ala Asn Ile Lys Thr
165 170 175
Thr Tyr Arg Ser Lys Lys Pro Ala Lys Asn Leu Lys Met Pro Gly Val
180 185 190
Tyr Tyr Val Asp Tyr Arg Leu Glu Arg Ile Lys Glu Ala Asn Asn Glu
195 200 205
Thr Tyr Val Glu Gln His Glu Val Ala Val Ala Arg Tyr Cys Asp Leu
210 215 220
Pro Ser Lys Leu Gly His Lys Leu Asn
225 230
<210> 100
<211> 3539
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 100
ctttcctgcg ttatcccctg attctgtgga taaccgtatt accgcctttg agtgagctga 60
taccgctcgc cgcagccgaa cgaccgagcg cagcgagtca gtgagcgagg aagcggaaga 120
gcgcccaata cgcaaaccgc ctctccccgc gcgttggccg attcattaat gcagctggca 180
cgacaggttt cccgactgga aagcgggcag tgagcgcaac gcaattaata cgcgtaccgc 240
tagccaggaa gagtttgtag aaacgcaaaa aggccatccg tcaggatggc cttctgctta 300
gtttgatgcc tggcagttta tggcgggcgt cctgcccgcc accctccggg ccgttgcttc 360
acaacgatca aatccgctcc cggcggattt gtcctactca ggagagcgtt caccgacaaa 420
caacagataa aacgaaaggc ccagtcttcc gactgagcct ttcgttttat ttgatgcctg 480
gcagttccct actctcgcgt taacgctagc atggatgttt tcccagtcac gacgttgtaa 540
aacgacggcc agtcttaagc tcgggcccca aataatgatt ttattttgac tgatagtgac 600
ctgttcgttg caacaaattg atgagcaatg cttttttata atgccaactt tgtacaaaaa 660
agcaggctcc gaattcaccg gtgccgccac catggtgagc aagggcgagg aggataacat 720
ggccatcatc aaggagttca tgcgcttcaa ggtgcacatg gagggctccg tgaacggcca 780
cgagttcgag atcgagggcg agggcgaggg ccgcccctac gagggcaccc agaccgccaa 840
gctgaaggtg accaagggtg gccccctgcc cttcgcctgg gacatcctgt cccctcagtt 900
catgtacggc tccaaggcct acgtgaagca ccccgccgac atccccgact acttgaagct 960
gtccttcccc gagggcttca agtgggagcg cgtgatgaac ttcgaggacg gcggcgtggt 1020
gaccgtgacc caggactcct ccctgcagga cggcgagttc atctacaagg tgaagctgcg 1080
cggcaccaac ttcccctccg acggccccgt aatgcagaag aagaccatgg gctgggaggc 1140
ctcctccgag cggatgtacc ccgaggacgg cgccctgaag ggcgagatca agcagaggct 1200
gaagctgaag gacggcggcc actacgacgc tgaggtcaag accacctaca aggccaagaa 1260
gcccgtgcag ctgcccggcg cctacaacgt caacattaag ttggacatca cctcccacaa 1320
cgaggactac accatcgtgg aacagtacga acgcgccgag ggccgccact ccaccggcgg 1380
catggacgag ctgtacaagt gattaattaa gaattcgacc cagctttctt gtacaaagtt 1440
ggcattataa aaaataattg ctcatcaatt tgttgcaacg aacaggtcac tatcagtcaa 1500
aataaaatca ttatttgcca tccagctgat atcccctata gtgagtcgta ttacatggtc 1560
atagctgttt cctggcagct ctggcccgtg tctcaaaatc tctgatgtta cattgcacaa 1620
gataaaaata tatcatcatg cctcctctag accagccagg acagaaatgc ctcgacttcg 1680
ctgctgccca aggttgccgg gtgacgcaca ccgtggaaac ggatgaaggc acgaacccag 1740
tggacataag cctgttcggt tcgtaagctg taatgcaagt agcgtatgcg ctcacgcaac 1800
tggtccagaa ccttgaccga acgcagcggt ggtaacggcg cagtggcggt tttcatggct 1860
tgttatgact gtttttttgg ggtacagtct atgcctcggg catccaagca gcaagcgcgt 1920
tacgccgtgg gtcgatgttt gatgttatgg agcagcaacg atgttacgca gcagggcagt 1980
cgccctaaaa caaagttaaa catcatgagg gaagcggtga tcgccgaagt atcgactcaa 2040
ctatcagagg tagttggcgt catcgagcgc catctcgaac cgacgttgct ggccgtacat 2100
ttgtacggct ccgcagtgga tggcggcctg aagccacaca gtgatattga tttgctggtt 2160
acggtgaccg taaggcttga tgaaacaacg cggcgagctt tgatcaacga ccttttggaa 2220
acttcggctt cccctggaga gagcgagatt ctccgcgctg tagaagtcac cattgttgtg 2280
cacgacgaca tcattccgtg gcgttatcca gctaagcgcg aactgcaatt tggagaatgg 2340
cagcgcaatg acattcttgc aggtatcttc gagccagcca cgatcgacat tgatctggct 2400
atcttgctga caaaagcaag agaacatagc gttgccttgg taggtccagc ggcggaggaa 2460
ctctttgatc cggttcctga acaggatcta tttgaggcgc taaatgaaac cttaacgcta 2520
tggaactcgc cgcccgactg ggctggcgat gagcgaaatg tagtgcttac gttgtcccgc 2580
atttggtaca gcgcagtaac cggcaaaatc gcgccgaagg atgtcgctgc cgactgggca 2640
atggagcgcc tgccggccca gtatcagccc gtcatacttg aagctagaca ggcttatctt 2700
ggacaagaag aagatcgctt ggcctcgcgc gcagatcagt tggaagaatt tgtccactac 2760
gtgaaaggcg agatcaccaa ggtagtcggc aaataaccct cgagccaccc atgaccaaaa 2820
tcccttaacg tgagttacgc gtcgttccac tgagcgtcag accccgtaga aaagatcaaa 2880
ggatcttctt gagatccttt ttttctgcgc gtaatctgct gcttgcaaac aaaaaaacca 2940
ccgctaccag cggtggtttg tttgccggat caagagctac caactctttt tccgaaggta 3000
actggcttca gcagagcgca gataccaaat actgtccttc tagtgtagcc gtagttaggc 3060
caccacttca agaactctgt agcaccgcct acatacctcg ctctgctaat cctgttacca 3120
gtggctgctg ccagtggcga taagtcgtgt cttaccgggt tggactcaag acgatagtta 3180
ccggataagg cgcagcggtc gggctgaacg gggggttcgt gcacacagcc cagcttggag 3240
cgaacgacct acaccgaact gagataccta cagcgtgagc attgagaaag cgccacgctt 3300
cccgaaggga gaaaggcgga caggtatccg gtaagcggca gggtcggaac aggagagcgc 3360
acgagggagc ttccaggggg aaacgcctgg tatctttata gtcctgtcgg gtttcgccac 3420
ctctgacttg agcgtcgatt tttgtgatgc tcgtcagggg ggcggagcct atggaaaaac 3480
gccagcaacg cggccttttt acggttcctg gccttttgct ggccttttgc tcacatgtt 3539
<210> 101
<211> 236
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 101
Met Val Ser Lys Gly Glu Glu Asp Asn Met Ala Ile Ile Lys Glu Phe
1 5 10 15
Met Arg Phe Lys Val His Met Glu Gly Ser Val Asn Gly His Glu Phe
20 25 30
Glu Ile Glu Gly Glu Gly Glu Gly Arg Pro Tyr Glu Gly Thr Gln Thr
35 40 45
Ala Lys Leu Lys Val Thr Lys Gly Gly Pro Leu Pro Phe Ala Trp Asp
50 55 60
Ile Leu Ser Pro Gln Phe Met Tyr Gly Ser Lys Ala Tyr Val Lys His
65 70 75 80
Pro Ala Asp Ile Pro Asp Tyr Leu Lys Leu Ser Phe Pro Glu Gly Phe
85 90 95
Lys Trp Glu Arg Val Met Asn Phe Glu Asp Gly Gly Val Val Thr Val
100 105 110
Thr Gln Asp Ser Ser Leu Gln Asp Gly Glu Phe Ile Tyr Lys Val Lys
115 120 125
Leu Arg Gly Thr Asn Phe Pro Ser Asp Gly Pro Val Met Gln Lys Lys
130 135 140
Thr Met Gly Trp Glu Ala Ser Ser Glu Arg Met Tyr Pro Glu Asp Gly
145 150 155 160
Ala Leu Lys Gly Glu Ile Lys Gln Arg Leu Lys Leu Lys Asp Gly Gly
165 170 175
His Tyr Asp Ala Glu Val Lys Thr Thr Tyr Lys Ala Lys Lys Pro Val
180 185 190
Gln Leu Pro Gly Ala Tyr Asn Val Asn Ile Lys Leu Asp Ile Thr Ser
195 200 205
His Asn Glu Asp Tyr Thr Ile Val Glu Gln Tyr Glu Arg Ala Glu Gly
210 215 220
Arg His Ser Thr Gly Gly Met Asp Glu Leu Tyr Lys
225 230 235
<210> 102
<211> 10137
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 102
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagctgaacg agaaacgtaa aatgatataa atatcaatat attaaattag 660
attttgcata aaaaacagac tacataatac tgtaaaacac aacatatcca gtcactatgg 720
cggccgcatt aggcacccca ggctttacac tttatgcttc cggctcgtat aatgtgtgga 780
ttttgagtta ggatccgtcg agattttcag gagctaagga agctaaaatg gagaaaaaaa 840
tcactggata taccaccgtt gatatatccc aatggcatcg taaagaacat tttgaggcat 900
ttcagtcagt tgctcaatgt acctataacc agaccgttca gctggatatt acggcctttt 960
taaagaccgt aaagaaaaat aagcacaagt tttatccggc ctttattcac attcttgccc 1020
gcctgatgaa tgctcatccg gaattccgta tggcaatgaa agacggtgag ctggtgatat 1080
gggatagtgt tcacccttgt tacaccgttt tccatgagca aactgaaacg ttttcatcgc 1140
tctggagtga ataccacgac gatttccggc agtttctaca catatattcg caagatgtgg 1200
cgtgttacgg tgaaaacctg gcctatttcc ctaaagggtt tattgagaat atgtttttcg 1260
tctcagccaa tccctgggtg agtttcacca gttttgattt aaacgtggcc aatatggaca 1320
acttcttcgc ccccgttttc accatgggca aatattatac gcaaggcgac aaggtgctga 1380
tgccgctggc gattcaggtt catcatgccg tttgtgatgg cttccatgtc ggcagaatgc 1440
ttaatgaatt acaacagtac tgcgatgagt ggcagggcgg ggcgtaaaga tctggatccg 1500
gcttactaaa agccagataa cagtatgcgt atttgcgcgc tgatttttgc ggtataagaa 1560
tatatactga tatgtatacc cgaagtatgt caaaaagagg tatgctatga agcagcgtat 1620
tacagtgaca gttgacagcg acagctatca gttgctcaag gcatatatga tgtcaatatc 1680
tccggtctgg taagcacaac catgcagaat gaagcccgtc gtctgcgtgc cgaacgctgg 1740
aaagcggaaa atcaggaagg gatggctgag gtcgcccggt ttattgaaat gaacggctct 1800
tttgctgacg agaacagggg ctggtgaaat gcagtttaag gtttacacct ataaaagaga 1860
gagccgttat cgtctgtttg tggatgtaca gagtgatatt attgacacgc ccgggcgacg 1920
gatggtgatc cccctggcca gtgcacgtct gctgtcagat aaagtctccc gtgaacttta 1980
cccggtggtg catatcgggg atgaaagctg gcgcatgatg accaccgata tggccagtgt 2040
gccggtctcc gttatcgggg aagaagtggc tgatctcagc caccgcgaaa atgacatcaa 2100
aaacgccatt aacctgatgt tctggggaat ataaatgtca ggctccctta tacacagcca 2160
gtctgcaggt cgaccatagt gactggatat gttgtgtttt acagtattat gtagtctgtt 2220
ttttatgcaa aatctaattt aatatattga tatttatatc attttacgtt tctcgttcag 2280
ctttcttgta caaagtggtt ggtaagccta tccctaaccc tctcctcggt ctcgattcta 2340
cgtagtaatg agctagcagt ctcgaggtta acgaattccg ccccccccct aacgttactg 2400
gccgaagccg cttggaataa ggccggtgtg cgcttgtcta tatgttattt tccaccatat 2460
tgccgtcttt tggcaatgtg agggcccgga aacctggccc tgtcttcttg acgagcattc 2520
ctaggggtct ttcccctctc gccaaaggaa tgcaaggtct gttgaatgtc gtgaaggaag 2580
cagttcctct ggaagcttct tgaagacaaa caacgtctgt agcgaccctt tgcaggcagc 2640
ggaacccccc acctggcgac aggtgcccct gcggccaaaa gccacgtgta taagatacac 2700
ctgcaaaggc ggcacaaccc cagtgccacg ttgtgagttg gatagttgtg gaaagagtca 2760
aatggctctc ctcaagcgta ttcaacaagg ggctgaagga tgcccagaag gtaccccatt 2820
gtatgggatc tgatctgggg cctcggtgca catgctttac atgtgtttag tcgaggttaa 2880
aaaaacgtct aggccccccg aaccacgggg acgtggtttt cctttgaaaa acacgataat 2940
accatggcca tgaaaaagcc tgaactcacc gcgacgtctg tcgagaagtt tctgatcgaa 3000
aagttcgaca gcgtctccga cctgatgcag ctctcggagg gcgaagaatc tcgtgctttc 3060
agcttcgatg taggagggcg tggatatgtc ctgcgggtaa atagctgcgc cgatggtttc 3120
tacaaagatc gttatgttta tcggcacttt gcatcggccg cgctcccgat tccggaagtg 3180
cttgacattg gggaatttag cgagagcctg acctattgcc tttcatacga gaccgagatc 3240
ctgactgtcg agtacggatt gcttcctatc ggcaaaatcg tggagaagag gattgaatgt 3300
accgtctatt cagtcgataa taatgggaac atctacacac agcccgtggc tcaatggcac 3360
gacagaggag agcaggaagt ttttgaatac tgtctcgagg acggatccct catccgcgct 3420
actaaagatc ataagtttat gaccgtggac ggccagatgc tgccaattga cgaaattttt 3480
gaacgagagc tggatctgat gagagtcgac aaccttccaa actgacaccg gtggcgcgtt 3540
aagtcgacaa tcaacctctg gattacaaaa tttgtgaaag attgactggt attcttaact 3600
atgttgctcc ttttacgcta tgtggatacg ctgctttaat gcctttgtat catgctattg 3660
cttcccgtat ggctttcatt ttctcctcct tgtataaatc ctggttgctg tctctttatg 3720
aggagttgtg gcccgttgtc aggcaacgtg gcgtggtgtg cactgtgttt gctgacgcaa 3780
cccccactgg ttggggcatt gccaccacct gtcagctcct ttccgggact ttcgctttcc 3840
ccctccctat tgccacggcg gaactcatcg ccgcctgcct tgcccgctgc tggacagggg 3900
ctcggctgtt gggcactgac aattccgtgg tgttgtcggg gaaatcatcg tcctttcctt 3960
ggctgctcgc ctgtgttgcc acctggattc tgcgcgggac gtccttctgc tacgtccctt 4020
cggccctcaa tccagcggac cttccttccc gcggcctgct gccggctctg cggcctcttc 4080
cgcgtcttcg ccttcgccct cagacgagtc ggatctccct ttgggccgcc tccccgcgtc 4140
gactttaaga ccaatgactt acaaggcagc tgtagatctt agccactttt taaaagaaaa 4200
ggggggactg gaagggctaa ttcactccca acgaagacaa gatctgcttt ttgcttgtac 4260
tgggtctctc tggttagacc agatctgagc ctgggagctc tctggctaac tagggaaccc 4320
actgcttaag cctcaataaa gcttgccttg agtgcttcaa gtagtgtgtg cccgtctgtt 4380
gtgtgactct ggtaactaga gatccctcag acccttttag tcagtgtgga aaatctctag 4440
cagtacgtat agtagttcat gtcatcttat tattcagtat ttataacttg caaagaaatg 4500
aatatcagag agtgagagga acttgtttat tgcagcttat aatggttaca aataaagcaa 4560
tagcatcaca aatttcacaa ataaagcatt tttttcactg cattctagtt gtggtttgtc 4620
caaactcatc aatgtatctt atcatgtctg gctctagcta tcccgcccct aactccgccc 4680
atcccgcccc taactccgcc cagttccgcc cattctccgc cccatggctg actaattttt 4740
tttatttatg cagaggccga ggccgcctcg gcctctgagc tattccagaa gtagtgagga 4800
ggcttttttg gaggcctagg gacgtaccca attcgcccta tagtgagtcg tattacgcgc 4860
gctcactggc cgtcgtttta caacgtcgtg actgggaaaa ccctggcgtt acccaactta 4920
atcgccttgc agcacatccc cctttcgcca gctggcgtaa tagcgaagag gcccgcaccg 4980
atcgcccttc ccaacagttg cgcagcctga atggcgaatg ggacgcgccc tgtagcggcg 5040
cattaagcgc ggcgggtgtg gtggttacgc gcagcgtgac cgctacactt gccagcgccc 5100
tagcgcccgc tcctttcgct ttcttccctt cctttctcgc cacgttcgcc ggctttcccc 5160
gtcaagctct aaatcggggg ctccctttag ggttccgatt tagtgcttta cggcacctcg 5220
accccaaaaa acttgattag ggtgatggtt cacgtagtgg gccatcgccc tgatagacgg 5280
tttttcgccc tttgacgttg gagtccacgt tctttaatag tggactcttg ttccaaactg 5340
gaacaacact caaccctatc tcggtctatt cttttgattt ataagggatt ttgccgattt 5400
cggcctattg gttaaaaaat gagctgattt aacaaaaatt taacgcgaat tttaacaaaa 5460
tattaacgct tacaatttag gtggcacttt tcggggaaat gtgcgcggaa cccctatttg 5520
tttatttttc taaatacatt caaatatgta tccgctcatg agacaataac cctgataaat 5580
gcttcaataa tattgaaaaa ggaagagtat gagtattcaa catttccgtg tcgcccttat 5640
tccctttttt gcggcatttt gccttcctgt ttttgctcac ccagaaacgc tggtgaaagt 5700
aaaagatgct gaagatcagt tgggtgcacg agtgggttac atcgaactgg atctcaacag 5760
cggtaagatc cttgagagtt ttcgccccga agaacgtttt ccaatgatga gcacttttaa 5820
agttctgcta tgtggcgcgg tattatcccg tattgacgcc gggcaagagc aactcggtcg 5880
ccgcatacac tattctcaga atgacttggt tgagtactca ccagtcacag aaaagcatct 5940
tacggatggc atgacagtaa gagaattatg cagtgctgcc ataaccatga gtgataacac 6000
tgcggccaac ttacttctga caacgatcgg aggaccgaag gagctaaccg cttttttgca 6060
caacatgggg gatcatgtaa ctcgccttga tcgttgggaa ccggagctga atgaagccat 6120
accaaacgac gagcgtgaca ccacgatgcc tgtagcaatg gcaacaacgt tgcgcaaact 6180
attaactggc gaactactta ctctagcttc ccggcaacaa ttaatagact ggatggaggc 6240
ggataaagtt gcaggaccac ttctgcgctc ggcccttccg gctggctggt ttattgctga 6300
taaatctgga gccggtgagc gtgggtctcg cggtatcatt gcagcactgg ggccagatgg 6360
taagccctcc cgtatcgtag ttatctacac gacggggagt caggcaacta tggatgaacg 6420
aaatagacag atcgctgaga taggtgcctc actgattaag cattggtaac tgtcagacca 6480
agtttactca tatatacttt agattgattt aaaacttcat ttttaattta aaaggatcta 6540
ggtgaagatc ctttttgata atctcatgac caaaatccct taacgtgagt tttcgttcca 6600
ctgagcgtca gaccccgtag aaaagatcaa aggatcttct tgagatcctt tttttctgcg 6660
cgtaatctgc tgcttgcaaa caaaaaaacc accgctacca gcggtggttt gtttgccgga 6720
tcaagagcta ccaactcttt ttccgaaggt aactggcttc agcagagcgc agataccaaa 6780
tactgttctt ctagtgtagc cgtagttagg ccaccacttc aagaactctg tagcaccgcc 6840
tacatacctc gctctgctaa tcctgttacc agtggctgct gccagtggcg ataagtcgtg 6900
tcttaccggg ttggactcaa gacgatagtt accggataag gcgcagcggt cgggctgaac 6960
ggggggttcg tgcacacagc ccagcttgga gcgaacgacc tacaccgaac tgagatacct 7020
acagcgtgag ctatgagaaa gcgccacgct tcccgaaggg agaaaggcgg acaggtatcc 7080
ggtaagcggc agggtcggaa caggagagcg cacgagggag cttccagggg gaaacgcctg 7140
gtatctttat agtcctgtcg ggtttcgcca cctctgactt gagcgtcgat ttttgtgatg 7200
ctcgtcaggg gggcggagcc tatggaaaaa cgccagcaac gcggcctttt tacggttcct 7260
ggccttttgc tggccttttg ctcacatgtt ctttcctgcg ttatcccctg attctgtgga 7320
taaccgtatt accgcctttg agtgagctga taccgctcgc cgcagccgaa cgaccgagcg 7380
cagcgagtca gtgagcgagg aagcggaaga gcgcccaata cgcaaaccgc ctctccccgc 7440
gcgttggccg attcattaat gcagctggca cgacaggttt cccgactgga aagcgggcag 7500
tgagcgcaac gcaattaatg tgagttagct cactcattag gcaccccagg ctttacactt 7560
tatgcttccg gctcgtatgt tgtgtggaat tgtgagcgga taacaatttc acacaggaaa 7620
cagctatgac catgattacg ccaagcgcgc aattaaccct cactaaaggg aacaaaagct 7680
ggagctgcaa gcttaatgta gtcttatgca atactcttgt agtcttgcaa catggtaacg 7740
atgagttagc aacatgcctt acaaggagag aaaaagcacc gtgcatgccg attggtggaa 7800
gtaaggtggt acgatcgtgc cttattagga aggcaacaga cgggtctgac atggattgga 7860
cgaaccactg aattgccgca ttgcagagat attgtattta agtgcctagc tcgatacata 7920
aacgggtctc tctggttaga ccagatctga gcctgggagc tctctggcta actagggaac 7980
ccactgctta agcctcaata aagcttgcct tgagtgcttc aagtagtgtg tgcccgtctg 8040
ttgtgtgact ctggtaacta gagatccctc agaccctttt agtcagtgtg gaaaatctct 8100
agcagtggcg cccgaacagg gacttgaaag cgaaagggaa accagaggag ctctctcgac 8160
gcaggactcg gcttgctgaa gcgcgcacgg caagaggcga ggggcggcga ctggtgagta 8220
cgccaaaaat tttgactagc ggaggctaga aggagagaga tgggtgcgag agcgtcagta 8280
ttaagcgggg gagaattaga tcgcgatggg aaaaaattcg gttaaggcca gggggaaaga 8340
aaaaatataa attaaaacat atagtatggg caagcaggga gctagaacga ttcgcagtta 8400
atcctggcct gttagaaaca tcagaaggct gtagacaaat actgggacag ctacaaccat 8460
cccttcagac aggatcagaa gaacttagat cattatataa tacagtagca accctctatt 8520
gtgtgcatca aaggatagag ataaaagaca ccaaggaagc tttagacaag atagaggaag 8580
agcaaaacaa aagtaagacc accgcacagc aagcggccgc tgatcttcag acctggagga 8640
ggagatatga gggacaattg gagaagtgaa ttatataaat ataaagtagt aaaaattgaa 8700
ccattaggag tagcacccac caaggcaaag agaagagtgg tgcagagaga aaaaagagca 8760
gtgggaatag gagctttgtt ccttgggttc ttgggagcag caggaagcac tatgggcgca 8820
gcgtcaatga cgctgacggt acaggccaga caattattgt ctggtatagt gcagcagcag 8880
aacaatttgc tgagggctat tgaggcgcaa cagcatctgt tgcaactcac agtctggggc 8940
atcaagcagc tccaggcaag aatcctggct gtggaaagat acctaaagga tcaacagctc 9000
ctggggattt ggggttgctc tggaaaactc atttgcacca ctgctgtgcc ttggaatgct 9060
agttggagta ataaatctct ggaacagatt tggaatcaca cgacctggat ggagtgggac 9120
agagaaatta acaattacac aagcttaata cactccttaa ttgaagaatc gcaaaaccag 9180
caagaaaaga atgaacaaga attattggaa ttagataaat gggcaagttt gtggaattgg 9240
tttaacataa caaattggct gtggtatata aaattattca taatgatagt aggaggcttg 9300
gtaggtttaa gaatagtttt tgctgtactt tctatagtga atagagttag gcagggatat 9360
tcaccattat cgtttcagac ccacctccca accccgaggg gacccttgcg ccttttccaa 9420
ggcagccctg ggtttgcgca gggacgcggc tgctctgggc gtggttccgg gaaacgcagc 9480
ggcgccgacc ctgggtctcg cacattcttc acgtccgttc gcagcgtcac ccggatcttc 9540
gccgctaccc ttgtgggccc cccggcgacg cttcctgctc cgcccctaag tcgggaaggt 9600
tccttgcggt tcgcggcgtg ccggacgtga caaacggaag ccgcacgtct cactagtacc 9660
ctcgcagacg gacagcgcca gggagcaatg gcagcgcgcc gaccgcgatg ggctgtggcc 9720
aatagcggct gctcagcagg gcgcgccgag agcagcggcc gggaaggggc ggtgcgggag 9780
gcggggtgtg gggcggtagt gtgggccctg ttcctgcccg cgcggtgttc cgcattctgc 9840
aagcctccgg agcgcacgtc ggcagtcggc tccctcgttg accgaatcac cgacctctct 9900
ccccaggggg tacccagctg tctagagaat tctagatctt gagacaaatg gcagtattca 9960
tccacaattt taaaagaaaa ggggggattg gggggtacag tgcaggggaa agaatagtag 10020
acataatagc aacagacata caaactaaag aattacaaaa acaaattaca aaaattcaaa 10080
attttcgggt ttattacagg gacagcagag atccactttg gcgccggctc gaggggg 10137
<210> 103
<211> 193
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 103
Met Ala Met Lys Lys Pro Glu Leu Thr Ala Thr Ser Val Glu Lys Phe
1 5 10 15
Leu Ile Glu Lys Phe Asp Ser Val Ser Asp Leu Met Gln Leu Ser Glu
20 25 30
Gly Glu Glu Ser Arg Ala Phe Ser Phe Asp Val Gly Gly Arg Gly Tyr
35 40 45
Val Leu Arg Val Asn Ser Cys Ala Asp Gly Phe Tyr Lys Asp Arg Tyr
50 55 60
Val Tyr Arg His Phe Ala Ser Ala Ala Leu Pro Ile Pro Glu Val Leu
65 70 75 80
Asp Ile Gly Glu Phe Ser Glu Ser Leu Thr Tyr Cys Leu Ser Tyr Glu
85 90 95
Thr Glu Ile Leu Thr Val Glu Tyr Gly Leu Leu Pro Ile Gly Lys Ile
100 105 110
Val Glu Lys Arg Ile Glu Cys Thr Val Tyr Ser Val Asp Asn Asn Gly
115 120 125
Asn Ile Tyr Thr Gln Pro Val Ala Gln Trp His Asp Arg Gly Glu Gln
130 135 140
Glu Val Phe Glu Tyr Cys Leu Glu Asp Gly Ser Leu Ile Arg Ala Thr
145 150 155 160
Lys Asp His Lys Phe Met Thr Val Asp Gly Gln Met Leu Pro Ile Asp
165 170 175
Glu Ile Phe Glu Arg Glu Leu Asp Leu Met Arg Val Asp Asn Leu Pro
180 185 190
Asn
<210> 104
<211> 10428
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 104
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagctgaacg agaaacgtaa aatgatataa atatcaatat attaaattag 660
attttgcata aaaaacagac tacataatac tgtaaaacac aacatatcca gtcactatgg 720
cggccgcatt aggcacccca ggctttacac tttatgcttc cggctcgtat aatgtgtgga 780
ttttgagtta ggatccgtcg agattttcag gagctaagga agctaaaatg gagaaaaaaa 840
tcactggata taccaccgtt gatatatccc aatggcatcg taaagaacat tttgaggcat 900
ttcagtcagt tgctcaatgt acctataacc agaccgttca gctggatatt acggcctttt 960
taaagaccgt aaagaaaaat aagcacaagt tttatccggc ctttattcac attcttgccc 1020
gcctgatgaa tgctcatccg gaattccgta tggcaatgaa agacggtgag ctggtgatat 1080
gggatagtgt tcacccttgt tacaccgttt tccatgagca aactgaaacg ttttcatcgc 1140
tctggagtga ataccacgac gatttccggc agtttctaca catatattcg caagatgtgg 1200
cgtgttacgg tgaaaacctg gcctatttcc ctaaagggtt tattgagaat atgtttttcg 1260
tctcagccaa tccctgggtg agtttcacca gttttgattt aaacgtggcc aatatggaca 1320
acttcttcgc ccccgttttc accatgggca aatattatac gcaaggcgac aaggtgctga 1380
tgccgctggc gattcaggtt catcatgccg tttgtgatgg cttccatgtc ggcagaatgc 1440
ttaatgaatt acaacagtac tgcgatgagt ggcagggcgg ggcgtaaaga tctggatccg 1500
gcttactaaa agccagataa cagtatgcgt atttgcgcgc tgatttttgc ggtataagaa 1560
tatatactga tatgtatacc cgaagtatgt caaaaagagg tatgctatga agcagcgtat 1620
tacagtgaca gttgacagcg acagctatca gttgctcaag gcatatatga tgtcaatatc 1680
tccggtctgg taagcacaac catgcagaat gaagcccgtc gtctgcgtgc cgaacgctgg 1740
aaagcggaaa atcaggaagg gatggctgag gtcgcccggt ttattgaaat gaacggctct 1800
tttgctgacg agaacagggg ctggtgaaat gcagtttaag gtttacacct ataaaagaga 1860
gagccgttat cgtctgtttg tggatgtaca gagtgatatt attgacacgc ccgggcgacg 1920
gatggtgatc cccctggcca gtgcacgtct gctgtcagat aaagtctccc gtgaacttta 1980
cccggtggtg catatcgggg atgaaagctg gcgcatgatg accaccgata tggccagtgt 2040
gccggtctcc gttatcgggg aagaagtggc tgatctcagc caccgcgaaa atgacatcaa 2100
aaacgccatt aacctgatgt tctggggaat ataaatgtca ggctccctta tacacagcca 2160
gtctgcaggt cgaccatagt gactggatat gttgtgtttt acagtattat gtagtctgtt 2220
ttttatgcaa aatctaattt aatatattga tatttatatc attttacgtt tctcgttcag 2280
ctttcttgta caaagtggtt ggtaagccta tccctaaccc tctcctcggt ctcgattcta 2340
cgtagtaatg agctagcagt ctcgaggtta acgaattccg ccccccccct aacgttactg 2400
gccgaagccg cttggaataa ggccggtgtg cgcttgtcta tatgttattt tccaccatat 2460
tgccgtcttt tggcaatgtg agggcccgga aacctggccc tgtcttcttg acgagcattc 2520
ctaggggtct ttcccctctc gccaaaggaa tgcaaggtct gttgaatgtc gtgaaggaag 2580
cagttcctct ggaagcttct tgaagacaaa caacgtctgt agcgaccctt tgcaggcagc 2640
ggaacccccc acctggcgac aggtgcccct gcggccaaaa gccacgtgta taagatacac 2700
ctgcaaaggc ggcacaaccc cagtgccacg ttgtgagttg gatagttgtg gaaagagtca 2760
aatggctctc ctcaagcgta ttcaacaagg ggctgaagga tgcccagaag gtaccccatt 2820
gtatgggatc tgatctgggg cctcggtgca catgctttac atgtgtttag tcgaggttaa 2880
aaaaacgtct aggccccccg aaccacgggg acgtggtttt cctttgaaaa acacgataat 2940
accatggcca tgattaagat cgctacgcgg aagtacctgg ggaaacagaa cgtctacgac 3000
ataggtgtgg agcgcgatca caactttgct ctgaaaaatg gatttatcgc cagcaactgc 3060
atctcccgcc gtgcacaggg tgtcacgttg caagacctgc ctgaaaccga actgcccgct 3120
gttctgcagc cggtcgcgga ggccatggat gcgatcgctg cggccgatct tagccagacg 3180
agcgggttcg gcccattcgg accgcaagga atcggtcaat acactacatg gcgtgatttc 3240
atatgcgcga ttgctgatcc ccatgtgtat cactggcaaa ctgtgatgga cgacaccgtc 3300
agtgcgtccg tcgcgcaggc tctcgatgag ctgatgcttt gggccgagga ctgccccgaa 3360
gtccggcacc tcgtgcacgc ggatttcggc tccaacaatg tcctgacgga caatggccgc 3420
ataacagcgg tcattgactg gagcgaggcg atgttcgggg attcccaata cgaggtcgcc 3480
aacatcttct tctggaggcc gtggttggct tgtatggagc agcagacgcg ctacttcgag 3540
cggaggcatc cggagcttgc aggatcgccg cggctccggg cgtatatgct ccgcattggt 3600
cttgaccaac tctatcagag cttggttgac ggcaatttcg atgatgcagc ttgggcgcag 3660
ggtcgatgcg acgcaatcgt ccgatccgga gccgggactg tcgggcgtac acaaatcgcc 3720
cgcagaagcg cggccgtctg gaccgatggc tgtgtagaag tactcgccga tagtggaaac 3780
cgacgcccca gcactcgtcc gagggcaaag gaatagcacc ggtggcgcgt taagtcgaca 3840
atcaacctct ggattacaaa atttgtgaaa gattgactgg tattcttaac tatgttgctc 3900
cttttacgct atgtggatac gctgctttaa tgcctttgta tcatgctatt gcttcccgta 3960
tggctttcat tttctcctcc ttgtataaat cctggttgct gtctctttat gaggagttgt 4020
ggcccgttgt caggcaacgt ggcgtggtgt gcactgtgtt tgctgacgca acccccactg 4080
gttggggcat tgccaccacc tgtcagctcc tttccgggac tttcgctttc cccctcccta 4140
ttgccacggc ggaactcatc gccgcctgcc ttgcccgctg ctggacaggg gctcggctgt 4200
tgggcactga caattccgtg gtgttgtcgg ggaaatcatc gtcctttcct tggctgctcg 4260
cctgtgttgc cacctggatt ctgcgcggga cgtccttctg ctacgtccct tcggccctca 4320
atccagcgga ccttccttcc cgcggcctgc tgccggctct gcggcctctt ccgcgtcttc 4380
gccttcgccc tcagacgagt cggatctccc tttgggccgc ctccccgcgt cgactttaag 4440
accaatgact tacaaggcag ctgtagatct tagccacttt ttaaaagaaa aggggggact 4500
ggaagggcta attcactccc aacgaagaca agatctgctt tttgcttgta ctgggtctct 4560
ctggttagac cagatctgag cctgggagct ctctggctaa ctagggaacc cactgcttaa 4620
gcctcaataa agcttgcctt gagtgcttca agtagtgtgt gcccgtctgt tgtgtgactc 4680
tggtaactag agatccctca gaccctttta gtcagtgtgg aaaatctcta gcagtacgta 4740
tagtagttca tgtcatctta ttattcagta tttataactt gcaaagaaat gaatatcaga 4800
gagtgagagg aacttgttta ttgcagctta taatggttac aaataaagca atagcatcac 4860
aaatttcaca aataaagcat ttttttcact gcattctagt tgtggtttgt ccaaactcat 4920
caatgtatct tatcatgtct ggctctagct atcccgcccc taactccgcc catcccgccc 4980
ctaactccgc ccagttccgc ccattctccg ccccatggct gactaatttt ttttatttat 5040
gcagaggccg aggccgcctc ggcctctgag ctattccaga agtagtgagg aggctttttt 5100
ggaggcctag ggacgtaccc aattcgccct atagtgagtc gtattacgcg cgctcactgg 5160
ccgtcgtttt acaacgtcgt gactgggaaa accctggcgt tacccaactt aatcgccttg 5220
cagcacatcc ccctttcgcc agctggcgta atagcgaaga ggcccgcacc gatcgccctt 5280
cccaacagtt gcgcagcctg aatggcgaat gggacgcgcc ctgtagcggc gcattaagcg 5340
cggcgggtgt ggtggttacg cgcagcgtga ccgctacact tgccagcgcc ctagcgcccg 5400
ctcctttcgc tttcttccct tcctttctcg ccacgttcgc cggctttccc cgtcaagctc 5460
taaatcgggg gctcccttta gggttccgat ttagtgcttt acggcacctc gaccccaaaa 5520
aacttgatta gggtgatggt tcacgtagtg ggccatcgcc ctgatagacg gtttttcgcc 5580
ctttgacgtt ggagtccacg ttctttaata gtggactctt gttccaaact ggaacaacac 5640
tcaaccctat ctcggtctat tcttttgatt tataagggat tttgccgatt tcggcctatt 5700
ggttaaaaaa tgagctgatt taacaaaaat ttaacgcgaa ttttaacaaa atattaacgc 5760
ttacaattta ggtggcactt ttcggggaaa tgtgcgcgga acccctattt gtttattttt 5820
ctaaatacat tcaaatatgt atccgctcat gagacaataa ccctgataaa tgcttcaata 5880
atattgaaaa aggaagagta tgagtattca acatttccgt gtcgccctta ttcccttttt 5940
tgcggcattt tgccttcctg tttttgctca cccagaaacg ctggtgaaag taaaagatgc 6000
tgaagatcag ttgggtgcac gagtgggtta catcgaactg gatctcaaca gcggtaagat 6060
ccttgagagt tttcgccccg aagaacgttt tccaatgatg agcactttta aagttctgct 6120
atgtggcgcg gtattatccc gtattgacgc cgggcaagag caactcggtc gccgcataca 6180
ctattctcag aatgacttgg ttgagtactc accagtcaca gaaaagcatc ttacggatgg 6240
catgacagta agagaattat gcagtgctgc cataaccatg agtgataaca ctgcggccaa 6300
cttacttctg acaacgatcg gaggaccgaa ggagctaacc gcttttttgc acaacatggg 6360
ggatcatgta actcgccttg atcgttggga accggagctg aatgaagcca taccaaacga 6420
cgagcgtgac accacgatgc ctgtagcaat ggcaacaacg ttgcgcaaac tattaactgg 6480
cgaactactt actctagctt cccggcaaca attaatagac tggatggagg cggataaagt 6540
tgcaggacca cttctgcgct cggcccttcc ggctggctgg tttattgctg ataaatctgg 6600
agccggtgag cgtgggtctc gcggtatcat tgcagcactg gggccagatg gtaagccctc 6660
ccgtatcgta gttatctaca cgacggggag tcaggcaact atggatgaac gaaatagaca 6720
gatcgctgag ataggtgcct cactgattaa gcattggtaa ctgtcagacc aagtttactc 6780
atatatactt tagattgatt taaaacttca tttttaattt aaaaggatct aggtgaagat 6840
cctttttgat aatctcatga ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc 6900
agaccccgta gaaaagatca aaggatcttc ttgagatcct ttttttctgc gcgtaatctg 6960
ctgcttgcaa acaaaaaaac caccgctacc agcggtggtt tgtttgccgg atcaagagct 7020
accaactctt tttccgaagg taactggctt cagcagagcg cagataccaa atactgttct 7080
tctagtgtag ccgtagttag gccaccactt caagaactct gtagcaccgc ctacatacct 7140
cgctctgcta atcctgttac cagtggctgc tgccagtggc gataagtcgt gtcttaccgg 7200
gttggactca agacgatagt taccggataa ggcgcagcgg tcgggctgaa cggggggttc 7260
gtgcacacag cccagcttgg agcgaacgac ctacaccgaa ctgagatacc tacagcgtga 7320
gctatgagaa agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg 7380
cagggtcgga acaggagagc gcacgaggga gcttccaggg ggaaacgcct ggtatcttta 7440
tagtcctgtc gggtttcgcc acctctgact tgagcgtcga tttttgtgat gctcgtcagg 7500
ggggcggagc ctatggaaaa acgccagcaa cgcggccttt ttacggttcc tggccttttg 7560
ctggcctttt gctcacatgt tctttcctgc gttatcccct gattctgtgg ataaccgtat 7620
taccgccttt gagtgagctg ataccgctcg ccgcagccga acgaccgagc gcagcgagtc 7680
agtgagcgag gaagcggaag agcgcccaat acgcaaaccg cctctccccg cgcgttggcc 7740
gattcattaa tgcagctggc acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa 7800
cgcaattaat gtgagttagc tcactcatta ggcaccccag gctttacact ttatgcttcc 7860
ggctcgtatg ttgtgtggaa ttgtgagcgg ataacaattt cacacaggaa acagctatga 7920
ccatgattac gccaagcgcg caattaaccc tcactaaagg gaacaaaagc tggagctgca 7980
agcttaatgt agtcttatgc aatactcttg tagtcttgca acatggtaac gatgagttag 8040
caacatgcct tacaaggaga gaaaaagcac cgtgcatgcc gattggtgga agtaaggtgg 8100
tacgatcgtg ccttattagg aaggcaacag acgggtctga catggattgg acgaaccact 8160
gaattgccgc attgcagaga tattgtattt aagtgcctag ctcgatacat aaacgggtct 8220
ctctggttag accagatctg agcctgggag ctctctggct aactagggaa cccactgctt 8280
aagcctcaat aaagcttgcc ttgagtgctt caagtagtgt gtgcccgtct gttgtgtgac 8340
tctggtaact agagatccct cagacccttt tagtcagtgt ggaaaatctc tagcagtggc 8400
gcccgaacag ggacttgaaa gcgaaaggga aaccagagga gctctctcga cgcaggactc 8460
ggcttgctga agcgcgcacg gcaagaggcg aggggcggcg actggtgagt acgccaaaaa 8520
ttttgactag cggaggctag aaggagagag atgggtgcga gagcgtcagt attaagcggg 8580
ggagaattag atcgcgatgg gaaaaaattc ggttaaggcc agggggaaag aaaaaatata 8640
aattaaaaca tatagtatgg gcaagcaggg agctagaacg attcgcagtt aatcctggcc 8700
tgttagaaac atcagaaggc tgtagacaaa tactgggaca gctacaacca tcccttcaga 8760
caggatcaga agaacttaga tcattatata atacagtagc aaccctctat tgtgtgcatc 8820
aaaggataga gataaaagac accaaggaag ctttagacaa gatagaggaa gagcaaaaca 8880
aaagtaagac caccgcacag caagcggccg ctgatcttca gacctggagg aggagatatg 8940
agggacaatt ggagaagtga attatataaa tataaagtag taaaaattga accattagga 9000
gtagcaccca ccaaggcaaa gagaagagtg gtgcagagag aaaaaagagc agtgggaata 9060
ggagctttgt tccttgggtt cttgggagca gcaggaagca ctatgggcgc agcgtcaatg 9120
acgctgacgg tacaggccag acaattattg tctggtatag tgcagcagca gaacaatttg 9180
ctgagggcta ttgaggcgca acagcatctg ttgcaactca cagtctgggg catcaagcag 9240
ctccaggcaa gaatcctggc tgtggaaaga tacctaaagg atcaacagct cctggggatt 9300
tggggttgct ctggaaaact catttgcacc actgctgtgc cttggaatgc tagttggagt 9360
aataaatctc tggaacagat ttggaatcac acgacctgga tggagtggga cagagaaatt 9420
aacaattaca caagcttaat acactcctta attgaagaat cgcaaaacca gcaagaaaag 9480
aatgaacaag aattattgga attagataaa tgggcaagtt tgtggaattg gtttaacata 9540
acaaattggc tgtggtatat aaaattattc ataatgatag taggaggctt ggtaggttta 9600
agaatagttt ttgctgtact ttctatagtg aatagagtta ggcagggata ttcaccatta 9660
tcgtttcaga cccacctccc aaccccgagg ggacccttgc gccttttcca aggcagccct 9720
gggtttgcgc agggacgcgg ctgctctggg cgtggttccg ggaaacgcag cggcgccgac 9780
cctgggtctc gcacattctt cacgtccgtt cgcagcgtca cccggatctt cgccgctacc 9840
cttgtgggcc ccccggcgac gcttcctgct ccgcccctaa gtcgggaagg ttccttgcgg 9900
ttcgcggcgt gccggacgtg acaaacggaa gccgcacgtc tcactagtac cctcgcagac 9960
ggacagcgcc agggagcaat ggcagcgcgc cgaccgcgat gggctgtggc caatagcggc 10020
tgctcagcag ggcgcgccga gagcagcggc cgggaagggg cggtgcggga ggcggggtgt 10080
ggggcggtag tgtgggccct gttcctgccc gcgcggtgtt ccgcattctg caagcctccg 10140
gagcgcacgt cggcagtcgg ctccctcgtt gaccgaatca ccgacctctc tccccagggg 10200
gtacccagct gtctagagaa ttctagatct tgagacaaat ggcagtattc atccacaatt 10260
ttaaaagaaa aggggggatt ggggggtaca gtgcagggga aagaatagta gacataatag 10320
caacagacat acaaactaaa gaattacaaa aacaaattac aaaaattcaa aattttcggg 10380
tttattacag ggacagcaga gatccacttt ggcgccggct cgaggggg 10428
<210> 105
<211> 290
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 105
Met Ala Met Ile Lys Ile Ala Thr Arg Lys Tyr Leu Gly Lys Gln Asn
1 5 10 15
Val Tyr Asp Ile Gly Val Glu Arg Asp His Asn Phe Ala Leu Lys Asn
20 25 30
Gly Phe Ile Ala Ser Asn Cys Ile Ser Arg Arg Ala Gln Gly Val Thr
35 40 45
Leu Gln Asp Leu Pro Glu Thr Glu Leu Pro Ala Val Leu Gln Pro Val
50 55 60
Ala Glu Ala Met Asp Ala Ile Ala Ala Ala Asp Leu Ser Gln Thr Ser
65 70 75 80
Gly Phe Gly Pro Phe Gly Pro Gln Gly Ile Gly Gln Tyr Thr Thr Trp
85 90 95
Arg Asp Phe Ile Cys Ala Ile Ala Asp Pro His Val Tyr His Trp Gln
100 105 110
Thr Val Met Asp Asp Thr Val Ser Ala Ser Val Ala Gln Ala Leu Asp
115 120 125
Glu Leu Met Leu Trp Ala Glu Asp Cys Pro Glu Val Arg His Leu Val
130 135 140
His Ala Asp Phe Gly Ser Asn Asn Val Leu Thr Asp Asn Gly Arg Ile
145 150 155 160
Thr Ala Val Ile Asp Trp Ser Glu Ala Met Phe Gly Asp Ser Gln Tyr
165 170 175
Glu Val Ala Asn Ile Phe Phe Trp Arg Pro Trp Leu Ala Cys Met Glu
180 185 190
Gln Gln Thr Arg Tyr Phe Glu Arg Arg His Pro Glu Leu Ala Gly Ser
195 200 205
Pro Arg Leu Arg Ala Tyr Met Leu Arg Ile Gly Leu Asp Gln Leu Tyr
210 215 220
Gln Ser Leu Val Asp Gly Asn Phe Asp Asp Ala Ala Trp Ala Gln Gly
225 230 235 240
Arg Cys Asp Ala Ile Val Arg Ser Gly Ala Gly Thr Val Gly Arg Thr
245 250 255
Gln Ile Ala Arg Arg Ser Ala Ala Val Trp Thr Asp Gly Cys Val Glu
260 265 270
Val Leu Ala Asp Ser Gly Asn Arg Arg Pro Ser Thr Arg Pro Arg Ala
275 280 285
Lys Glu
290
<210> 106
<211> 9222
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 106
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtgccgcca ccatgagcga gctgattaag 660
gagaacatgc acatgaagct gtacatggag ggcaccgtgg acaaccatca cttcaagtgc 720
acatccgagg gcgaaggcaa gccctacgag ggcacccaga ccatgagaat caaggtggtc 780
gagggcggcc ctctcccctt cgccttcgac atcctggcta ctagcttcct ctacggcagc 840
aagaccttca tcaaccacac ccagggcatc cccgacttct tcaagcagtc cttccctgag 900
ggcttcacat gggagagagt caccacatac gaagacgggg gcgtgctgac cgctacccag 960
gacaccagcc tccaggacgg ctgcctcatc tacaacgtca agatcagagg ggtgaacttc 1020
acatccaacg gccctgtgat gcagaagaaa acactcggct gggaggcctt caccgagacg 1080
ctgtaccccg ctgacggcgg cctggaaggc agaaacgaca tggccctgaa gctcgtgggc 1140
gggagccatc tgatcgcaaa catcaagacc acatatagat ccaagaaacc cgctaagaac 1200
ctcaagatgc ctggcgtcta ctatgtggac tacagactgg aaagaatcaa ggaggccaac 1260
aacgagacct acgtcgagca gcacgaggtg gcagtggcca gatactgcga cctccctagc 1320
aaactggggc acaagcttaa ttaattaatt aagaattcga cccagctttc ttgtacaaag 1380
tggttggtaa gcctatccct aaccctctcc tcggtctcga ttctacgtag taatgagcta 1440
gcagtctcga ggttaacgaa ttccgccccc cccctaacgt tactggccga agccgcttgg 1500
aataaggccg gtgtgcgctt gtctatatgt tattttccac catattgccg tcttttggca 1560
atgtgagggc ccggaaacct ggccctgtct tcttgacgag cattcctagg ggtctttccc 1620
ctctcgccaa aggaatgcaa ggtctgttga atgtcgtgaa ggaagcagtt cctctggaag 1680
cttcttgaag acaaacaacg tctgtagcga ccctttgcag gcagcggaac cccccacctg 1740
gcgacaggtg cccctgcggc caaaagccac gtgtataaga tacacctgca aaggcggcac 1800
aaccccagtg ccacgttgtg agttggatag ttgtggaaag agtcaaatgg ctctcctcaa 1860
gcgtattcaa caaggggctg aaggatgccc agaaggtacc ccattgtatg ggatctgatc 1920
tggggcctcg gtgcacatgc tttacatgtg tttagtcgag gttaaaaaaa cgtctaggcc 1980
ccccgaacca cggggacgtg gttttccttt gaaaaacacg ataataccat ggccatgaaa 2040
aagcctgaac tcaccgcgac gtctgtcgag aagtttctga tcgaaaagtt cgacagcgtc 2100
tccgacctga tgcagctctc ggagggcgaa gaatctcgtg ctttcagctt cgatgtagga 2160
gggcgtggat atgtcctgcg ggtaaatagc tgcgccgatg gtttctacaa agatcgttat 2220
gtttatcggc actttgcatc ggccgcgctc ccgattccgg aagtgcttga cattggggaa 2280
tttagcgaga gcctgaccta ttgcctttca tacgagaccg agatcctgac tgtcgagtac 2340
ggattgcttc ctatcggcaa aatcgtggag aagaggattg aatgtaccgt ctattcagtc 2400
gataataatg ggaacatcta cacacagccc gtggctcaat ggcacgacag aggagagcag 2460
gaagtttttg aatactgtct cgaggacgga tccctcatcc gcgctactaa agatcataag 2520
tttatgaccg tggacggcca gatgctgcca attgacgaaa tttttgaacg agagctggat 2580
ctgatgagag tcgacaacct tccaaactga caccggtggc gcgttaagtc gacaatcaac 2640
ctctggatta caaaatttgt gaaagattga ctggtattct taactatgtt gctcctttta 2700
cgctatgtgg atacgctgct ttaatgcctt tgtatcatgc tattgcttcc cgtatggctt 2760
tcattttctc ctccttgtat aaatcctggt tgctgtctct ttatgaggag ttgtggcccg 2820
ttgtcaggca acgtggcgtg gtgtgcactg tgtttgctga cgcaaccccc actggttggg 2880
gcattgccac cacctgtcag ctcctttccg ggactttcgc tttccccctc cctattgcca 2940
cggcggaact catcgccgcc tgccttgccc gctgctggac aggggctcgg ctgttgggca 3000
ctgacaattc cgtggtgttg tcggggaaat catcgtcctt tccttggctg ctcgcctgtg 3060
ttgccacctg gattctgcgc gggacgtcct tctgctacgt cccttcggcc ctcaatccag 3120
cggaccttcc ttcccgcggc ctgctgccgg ctctgcggcc tcttccgcgt cttcgccttc 3180
gccctcagac gagtcggatc tccctttggg ccgcctcccc gcgtcgactt taagaccaat 3240
gacttacaag gcagctgtag atcttagcca ctttttaaaa gaaaaggggg gactggaagg 3300
gctaattcac tcccaacgaa gacaagatct gctttttgct tgtactgggt ctctctggtt 3360
agaccagatc tgagcctggg agctctctgg ctaactaggg aacccactgc ttaagcctca 3420
ataaagcttg ccttgagtgc ttcaagtagt gtgtgcccgt ctgttgtgtg actctggtaa 3480
ctagagatcc ctcagaccct tttagtcagt gtggaaaatc tctagcagta cgtatagtag 3540
ttcatgtcat cttattattc agtatttata acttgcaaag aaatgaatat cagagagtga 3600
gaggaacttg tttattgcag cttataatgg ttacaaataa agcaatagca tcacaaattt 3660
cacaaataaa gcattttttt cactgcattc tagttgtggt ttgtccaaac tcatcaatgt 3720
atcttatcat gtctggctct agctatcccg cccctaactc cgcccatccc gcccctaact 3780
ccgcccagtt ccgcccattc tccgccccat ggctgactaa ttttttttat ttatgcagag 3840
gccgaggccg cctcggcctc tgagctattc cagaagtagt gaggaggctt ttttggaggc 3900
ctagggacgt acccaattcg ccctatagtg agtcgtatta cgcgcgctca ctggccgtcg 3960
ttttacaacg tcgtgactgg gaaaaccctg gcgttaccca acttaatcgc cttgcagcac 4020
atcccccttt cgccagctgg cgtaatagcg aagaggcccg caccgatcgc ccttcccaac 4080
agttgcgcag cctgaatggc gaatgggacg cgccctgtag cggcgcatta agcgcggcgg 4140
gtgtggtggt tacgcgcagc gtgaccgcta cacttgccag cgccctagcg cccgctcctt 4200
tcgctttctt cccttccttt ctcgccacgt tcgccggctt tccccgtcaa gctctaaatc 4260
gggggctccc tttagggttc cgatttagtg ctttacggca cctcgacccc aaaaaacttg 4320
attagggtga tggttcacgt agtgggccat cgccctgata gacggttttt cgccctttga 4380
cgttggagtc cacgttcttt aatagtggac tcttgttcca aactggaaca acactcaacc 4440
ctatctcggt ctattctttt gatttataag ggattttgcc gatttcggcc tattggttaa 4500
aaaatgagct gatttaacaa aaatttaacg cgaattttaa caaaatatta acgcttacaa 4560
tttaggtggc acttttcggg gaaatgtgcg cggaacccct atttgtttat ttttctaaat 4620
acattcaaat atgtatccgc tcatgagaca ataaccctga taaatgcttc aataatattg 4680
aaaaaggaag agtatgagta ttcaacattt ccgtgtcgcc cttattccct tttttgcggc 4740
attttgcctt cctgtttttg ctcacccaga aacgctggtg aaagtaaaag atgctgaaga 4800
tcagttgggt gcacgagtgg gttacatcga actggatctc aacagcggta agatccttga 4860
gagttttcgc cccgaagaac gttttccaat gatgagcact tttaaagttc tgctatgtgg 4920
cgcggtatta tcccgtattg acgccgggca agagcaactc ggtcgccgca tacactattc 4980
tcagaatgac ttggttgagt actcaccagt cacagaaaag catcttacgg atggcatgac 5040
agtaagagaa ttatgcagtg ctgccataac catgagtgat aacactgcgg ccaacttact 5100
tctgacaacg atcggaggac cgaaggagct aaccgctttt ttgcacaaca tgggggatca 5160
tgtaactcgc cttgatcgtt gggaaccgga gctgaatgaa gccataccaa acgacgagcg 5220
tgacaccacg atgcctgtag caatggcaac aacgttgcgc aaactattaa ctggcgaact 5280
acttactcta gcttcccggc aacaattaat agactggatg gaggcggata aagttgcagg 5340
accacttctg cgctcggccc ttccggctgg ctggtttatt gctgataaat ctggagccgg 5400
tgagcgtggg tctcgcggta tcattgcagc actggggcca gatggtaagc cctcccgtat 5460
cgtagttatc tacacgacgg ggagtcaggc aactatggat gaacgaaata gacagatcgc 5520
tgagataggt gcctcactga ttaagcattg gtaactgtca gaccaagttt actcatatat 5580
actttagatt gatttaaaac ttcattttta atttaaaagg atctaggtga agatcctttt 5640
tgataatctc atgaccaaaa tcccttaacg tgagttttcg ttccactgag cgtcagaccc 5700
cgtagaaaag atcaaaggat cttcttgaga tccttttttt ctgcgcgtaa tctgctgctt 5760
gcaaacaaaa aaaccaccgc taccagcggt ggtttgtttg ccggatcaag agctaccaac 5820
tctttttccg aaggtaactg gcttcagcag agcgcagata ccaaatactg ttcttctagt 5880
gtagccgtag ttaggccacc acttcaagaa ctctgtagca ccgcctacat acctcgctct 5940
gctaatcctg ttaccagtgg ctgctgccag tggcgataag tcgtgtctta ccgggttgga 6000
ctcaagacga tagttaccgg ataaggcgca gcggtcgggc tgaacggggg gttcgtgcac 6060
acagcccagc ttggagcgaa cgacctacac cgaactgaga tacctacagc gtgagctatg 6120
agaaagcgcc acgcttcccg aagggagaaa ggcggacagg tatccggtaa gcggcagggt 6180
cggaacagga gagcgcacga gggagcttcc agggggaaac gcctggtatc tttatagtcc 6240
tgtcgggttt cgccacctct gacttgagcg tcgatttttg tgatgctcgt caggggggcg 6300
gagcctatgg aaaaacgcca gcaacgcggc ctttttacgg ttcctggcct tttgctggcc 6360
ttttgctcac atgttctttc ctgcgttatc ccctgattct gtggataacc gtattaccgc 6420
ctttgagtga gctgataccg ctcgccgcag ccgaacgacc gagcgcagcg agtcagtgag 6480
cgaggaagcg gaagagcgcc caatacgcaa accgcctctc cccgcgcgtt ggccgattca 6540
ttaatgcagc tggcacgaca ggtttcccga ctggaaagcg ggcagtgagc gcaacgcaat 6600
taatgtgagt tagctcactc attaggcacc ccaggcttta cactttatgc ttccggctcg 6660
tatgttgtgt ggaattgtga gcggataaca atttcacaca ggaaacagct atgaccatga 6720
ttacgccaag cgcgcaatta accctcacta aagggaacaa aagctggagc tgcaagctta 6780
atgtagtctt atgcaatact cttgtagtct tgcaacatgg taacgatgag ttagcaacat 6840
gccttacaag gagagaaaaa gcaccgtgca tgccgattgg tggaagtaag gtggtacgat 6900
cgtgccttat taggaaggca acagacgggt ctgacatgga ttggacgaac cactgaattg 6960
ccgcattgca gagatattgt atttaagtgc ctagctcgat acataaacgg gtctctctgg 7020
ttagaccaga tctgagcctg ggagctctct ggctaactag ggaacccact gcttaagcct 7080
caataaagct tgccttgagt gcttcaagta gtgtgtgccc gtctgttgtg tgactctggt 7140
aactagagat ccctcagacc cttttagtca gtgtggaaaa tctctagcag tggcgcccga 7200
acagggactt gaaagcgaaa gggaaaccag aggagctctc tcgacgcagg actcggcttg 7260
ctgaagcgcg cacggcaaga ggcgaggggc ggcgactggt gagtacgcca aaaattttga 7320
ctagcggagg ctagaaggag agagatgggt gcgagagcgt cagtattaag cgggggagaa 7380
ttagatcgcg atgggaaaaa attcggttaa ggccaggggg aaagaaaaaa tataaattaa 7440
aacatatagt atgggcaagc agggagctag aacgattcgc agttaatcct ggcctgttag 7500
aaacatcaga aggctgtaga caaatactgg gacagctaca accatccctt cagacaggat 7560
cagaagaact tagatcatta tataatacag tagcaaccct ctattgtgtg catcaaagga 7620
tagagataaa agacaccaag gaagctttag acaagataga ggaagagcaa aacaaaagta 7680
agaccaccgc acagcaagcg gccgctgatc ttcagacctg gaggaggaga tatgagggac 7740
aattggagaa gtgaattata taaatataaa gtagtaaaaa ttgaaccatt aggagtagca 7800
cccaccaagg caaagagaag agtggtgcag agagaaaaaa gagcagtggg aataggagct 7860
ttgttccttg ggttcttggg agcagcagga agcactatgg gcgcagcgtc aatgacgctg 7920
acggtacagg ccagacaatt attgtctggt atagtgcagc agcagaacaa tttgctgagg 7980
gctattgagg cgcaacagca tctgttgcaa ctcacagtct ggggcatcaa gcagctccag 8040
gcaagaatcc tggctgtgga aagataccta aaggatcaac agctcctggg gatttggggt 8100
tgctctggaa aactcatttg caccactgct gtgccttgga atgctagttg gagtaataaa 8160
tctctggaac agatttggaa tcacacgacc tggatggagt gggacagaga aattaacaat 8220
tacacaagct taatacactc cttaattgaa gaatcgcaaa accagcaaga aaagaatgaa 8280
caagaattat tggaattaga taaatgggca agtttgtgga attggtttaa cataacaaat 8340
tggctgtggt atataaaatt attcataatg atagtaggag gcttggtagg tttaagaata 8400
gtttttgctg tactttctat agtgaataga gttaggcagg gatattcacc attatcgttt 8460
cagacccacc tcccaacccc gaggggaccc ttgcgccttt tccaaggcag ccctgggttt 8520
gcgcagggac gcggctgctc tgggcgtggt tccgggaaac gcagcggcgc cgaccctggg 8580
tctcgcacat tcttcacgtc cgttcgcagc gtcacccgga tcttcgccgc tacccttgtg 8640
ggccccccgg cgacgcttcc tgctccgccc ctaagtcggg aaggttcctt gcggttcgcg 8700
gcgtgccgga cgtgacaaac ggaagccgca cgtctcacta gtaccctcgc agacggacag 8760
cgccagggag caatggcagc gcgccgaccg cgatgggctg tggccaatag cggctgctca 8820
gcagggcgcg ccgagagcag cggccgggaa ggggcggtgc gggaggcggg gtgtggggcg 8880
gtagtgtggg ccctgttcct gcccgcgcgg tgttccgcat tctgcaagcc tccggagcgc 8940
acgtcggcag tcggctccct cgttgaccga atcaccgacc tctctcccca gggggtaccc 9000
agctgtctag agaattctag atcttgagac aaatggcagt attcatccac aattttaaaa 9060
gaaaaggggg gattgggggg tacagtgcag gggaaagaat agtagacata atagcaacag 9120
acatacaaac taaagaatta caaaaacaaa ttacaaaaat tcaaaatttt cgggtttatt 9180
acagggacag cagagatcca ctttggcgcc ggctcgaggg gg 9222
<210> 107
<211> 9522
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 107
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtgccgcca ccatggtgag caagggcgag 660
gaggataaca tggccatcat caaggagttc atgcgcttca aggtgcacat ggagggctcc 720
gtgaacggcc acgagttcga gatcgagggc gagggcgagg gccgccccta cgagggcacc 780
cagaccgcca agctgaaggt gaccaagggt ggccccctgc ccttcgcctg ggacatcctg 840
tcccctcagt tcatgtacgg ctccaaggcc tacgtgaagc accccgccga catccccgac 900
tacttgaagc tgtccttccc cgagggcttc aagtgggagc gcgtgatgaa cttcgaggac 960
ggcggcgtgg tgaccgtgac ccaggactcc tccctgcagg acggcgagtt catctacaag 1020
gtgaagctgc gcggcaccaa cttcccctcc gacggccccg taatgcagaa gaagaccatg 1080
ggctgggagg cctcctccga gcggatgtac cccgaggacg gcgccctgaa gggcgagatc 1140
aagcagaggc tgaagctgaa ggacggcggc cactacgacg ctgaggtcaa gaccacctac 1200
aaggccaaga agcccgtgca gctgcccggc gcctacaacg tcaacattaa gttggacatc 1260
acctcccaca acgaggacta caccatcgtg gaacagtacg aacgcgccga gggccgccac 1320
tccaccggcg gcatggacga gctgtacaag tgattaatta agaattcgac ccagctttct 1380
tgtacaaagt ggttggtaag cctatcccta accctctcct cggtctcgat tctacgtagt 1440
aatgagctag cagtctcgag gttaacgaat tccgcccccc ccctaacgtt actggccgaa 1500
gccgcttgga ataaggccgg tgtgcgcttg tctatatgtt attttccacc atattgccgt 1560
cttttggcaa tgtgagggcc cggaaacctg gccctgtctt cttgacgagc attcctaggg 1620
gtctttcccc tctcgccaaa ggaatgcaag gtctgttgaa tgtcgtgaag gaagcagttc 1680
ctctggaagc ttcttgaaga caaacaacgt ctgtagcgac cctttgcagg cagcggaacc 1740
ccccacctgg cgacaggtgc ccctgcggcc aaaagccacg tgtataagat acacctgcaa 1800
aggcggcaca accccagtgc cacgttgtga gttggatagt tgtggaaaga gtcaaatggc 1860
tctcctcaag cgtattcaac aaggggctga aggatgccca gaaggtaccc cattgtatgg 1920
gatctgatct ggggcctcgg tgcacatgct ttacatgtgt ttagtcgagg ttaaaaaaac 1980
gtctaggccc cccgaaccac ggggacgtgg ttttcctttg aaaaacacga taataccatg 2040
gccatgatta agatcgctac gcggaagtac ctggggaaac agaacgtcta cgacataggt 2100
gtggagcgcg atcacaactt tgctctgaaa aatggattta tcgccagcaa ctgcatctcc 2160
cgccgtgcac agggtgtcac gttgcaagac ctgcctgaaa ccgaactgcc cgctgttctg 2220
cagccggtcg cggaggccat ggatgcgatc gctgcggccg atcttagcca gacgagcggg 2280
ttcggcccat tcggaccgca aggaatcggt caatacacta catggcgtga tttcatatgc 2340
gcgattgctg atccccatgt gtatcactgg caaactgtga tggacgacac cgtcagtgcg 2400
tccgtcgcgc aggctctcga tgagctgatg ctttgggccg aggactgccc cgaagtccgg 2460
cacctcgtgc acgcggattt cggctccaac aatgtcctga cggacaatgg ccgcataaca 2520
gcggtcattg actggagcga ggcgatgttc ggggattccc aatacgaggt cgccaacatc 2580
ttcttctgga ggccgtggtt ggcttgtatg gagcagcaga cgcgctactt cgagcggagg 2640
catccggagc ttgcaggatc gccgcggctc cgggcgtata tgctccgcat tggtcttgac 2700
caactctatc agagcttggt tgacggcaat ttcgatgatg cagcttgggc gcagggtcga 2760
tgcgacgcaa tcgtccgatc cggagccggg actgtcgggc gtacacaaat cgcccgcaga 2820
agcgcggccg tctggaccga tggctgtgta gaagtactcg ccgatagtgg aaaccgacgc 2880
cccagcactc gtccgagggc aaaggaatag caccggtggc gcgttaagtc gacaatcaac 2940
ctctggatta caaaatttgt gaaagattga ctggtattct taactatgtt gctcctttta 3000
cgctatgtgg atacgctgct ttaatgcctt tgtatcatgc tattgcttcc cgtatggctt 3060
tcattttctc ctccttgtat aaatcctggt tgctgtctct ttatgaggag ttgtggcccg 3120
ttgtcaggca acgtggcgtg gtgtgcactg tgtttgctga cgcaaccccc actggttggg 3180
gcattgccac cacctgtcag ctcctttccg ggactttcgc tttccccctc cctattgcca 3240
cggcggaact catcgccgcc tgccttgccc gctgctggac aggggctcgg ctgttgggca 3300
ctgacaattc cgtggtgttg tcggggaaat catcgtcctt tccttggctg ctcgcctgtg 3360
ttgccacctg gattctgcgc gggacgtcct tctgctacgt cccttcggcc ctcaatccag 3420
cggaccttcc ttcccgcggc ctgctgccgg ctctgcggcc tcttccgcgt cttcgccttc 3480
gccctcagac gagtcggatc tccctttggg ccgcctcccc gcgtcgactt taagaccaat 3540
gacttacaag gcagctgtag atcttagcca ctttttaaaa gaaaaggggg gactggaagg 3600
gctaattcac tcccaacgaa gacaagatct gctttttgct tgtactgggt ctctctggtt 3660
agaccagatc tgagcctggg agctctctgg ctaactaggg aacccactgc ttaagcctca 3720
ataaagcttg ccttgagtgc ttcaagtagt gtgtgcccgt ctgttgtgtg actctggtaa 3780
ctagagatcc ctcagaccct tttagtcagt gtggaaaatc tctagcagta cgtatagtag 3840
ttcatgtcat cttattattc agtatttata acttgcaaag aaatgaatat cagagagtga 3900
gaggaacttg tttattgcag cttataatgg ttacaaataa agcaatagca tcacaaattt 3960
cacaaataaa gcattttttt cactgcattc tagttgtggt ttgtccaaac tcatcaatgt 4020
atcttatcat gtctggctct agctatcccg cccctaactc cgcccatccc gcccctaact 4080
ccgcccagtt ccgcccattc tccgccccat ggctgactaa ttttttttat ttatgcagag 4140
gccgaggccg cctcggcctc tgagctattc cagaagtagt gaggaggctt ttttggaggc 4200
ctagggacgt acccaattcg ccctatagtg agtcgtatta cgcgcgctca ctggccgtcg 4260
ttttacaacg tcgtgactgg gaaaaccctg gcgttaccca acttaatcgc cttgcagcac 4320
atcccccttt cgccagctgg cgtaatagcg aagaggcccg caccgatcgc ccttcccaac 4380
agttgcgcag cctgaatggc gaatgggacg cgccctgtag cggcgcatta agcgcggcgg 4440
gtgtggtggt tacgcgcagc gtgaccgcta cacttgccag cgccctagcg cccgctcctt 4500
tcgctttctt cccttccttt ctcgccacgt tcgccggctt tccccgtcaa gctctaaatc 4560
gggggctccc tttagggttc cgatttagtg ctttacggca cctcgacccc aaaaaacttg 4620
attagggtga tggttcacgt agtgggccat cgccctgata gacggttttt cgccctttga 4680
cgttggagtc cacgttcttt aatagtggac tcttgttcca aactggaaca acactcaacc 4740
ctatctcggt ctattctttt gatttataag ggattttgcc gatttcggcc tattggttaa 4800
aaaatgagct gatttaacaa aaatttaacg cgaattttaa caaaatatta acgcttacaa 4860
tttaggtggc acttttcggg gaaatgtgcg cggaacccct atttgtttat ttttctaaat 4920
acattcaaat atgtatccgc tcatgagaca ataaccctga taaatgcttc aataatattg 4980
aaaaaggaag agtatgagta ttcaacattt ccgtgtcgcc cttattccct tttttgcggc 5040
attttgcctt cctgtttttg ctcacccaga aacgctggtg aaagtaaaag atgctgaaga 5100
tcagttgggt gcacgagtgg gttacatcga actggatctc aacagcggta agatccttga 5160
gagttttcgc cccgaagaac gttttccaat gatgagcact tttaaagttc tgctatgtgg 5220
cgcggtatta tcccgtattg acgccgggca agagcaactc ggtcgccgca tacactattc 5280
tcagaatgac ttggttgagt actcaccagt cacagaaaag catcttacgg atggcatgac 5340
agtaagagaa ttatgcagtg ctgccataac catgagtgat aacactgcgg ccaacttact 5400
tctgacaacg atcggaggac cgaaggagct aaccgctttt ttgcacaaca tgggggatca 5460
tgtaactcgc cttgatcgtt gggaaccgga gctgaatgaa gccataccaa acgacgagcg 5520
tgacaccacg atgcctgtag caatggcaac aacgttgcgc aaactattaa ctggcgaact 5580
acttactcta gcttcccggc aacaattaat agactggatg gaggcggata aagttgcagg 5640
accacttctg cgctcggccc ttccggctgg ctggtttatt gctgataaat ctggagccgg 5700
tgagcgtggg tctcgcggta tcattgcagc actggggcca gatggtaagc cctcccgtat 5760
cgtagttatc tacacgacgg ggagtcaggc aactatggat gaacgaaata gacagatcgc 5820
tgagataggt gcctcactga ttaagcattg gtaactgtca gaccaagttt actcatatat 5880
actttagatt gatttaaaac ttcattttta atttaaaagg atctaggtga agatcctttt 5940
tgataatctc atgaccaaaa tcccttaacg tgagttttcg ttccactgag cgtcagaccc 6000
cgtagaaaag atcaaaggat cttcttgaga tccttttttt ctgcgcgtaa tctgctgctt 6060
gcaaacaaaa aaaccaccgc taccagcggt ggtttgtttg ccggatcaag agctaccaac 6120
tctttttccg aaggtaactg gcttcagcag agcgcagata ccaaatactg ttcttctagt 6180
gtagccgtag ttaggccacc acttcaagaa ctctgtagca ccgcctacat acctcgctct 6240
gctaatcctg ttaccagtgg ctgctgccag tggcgataag tcgtgtctta ccgggttgga 6300
ctcaagacga tagttaccgg ataaggcgca gcggtcgggc tgaacggggg gttcgtgcac 6360
acagcccagc ttggagcgaa cgacctacac cgaactgaga tacctacagc gtgagctatg 6420
agaaagcgcc acgcttcccg aagggagaaa ggcggacagg tatccggtaa gcggcagggt 6480
cggaacagga gagcgcacga gggagcttcc agggggaaac gcctggtatc tttatagtcc 6540
tgtcgggttt cgccacctct gacttgagcg tcgatttttg tgatgctcgt caggggggcg 6600
gagcctatgg aaaaacgcca gcaacgcggc ctttttacgg ttcctggcct tttgctggcc 6660
ttttgctcac atgttctttc ctgcgttatc ccctgattct gtggataacc gtattaccgc 6720
ctttgagtga gctgataccg ctcgccgcag ccgaacgacc gagcgcagcg agtcagtgag 6780
cgaggaagcg gaagagcgcc caatacgcaa accgcctctc cccgcgcgtt ggccgattca 6840
ttaatgcagc tggcacgaca ggtttcccga ctggaaagcg ggcagtgagc gcaacgcaat 6900
taatgtgagt tagctcactc attaggcacc ccaggcttta cactttatgc ttccggctcg 6960
tatgttgtgt ggaattgtga gcggataaca atttcacaca ggaaacagct atgaccatga 7020
ttacgccaag cgcgcaatta accctcacta aagggaacaa aagctggagc tgcaagctta 7080
atgtagtctt atgcaatact cttgtagtct tgcaacatgg taacgatgag ttagcaacat 7140
gccttacaag gagagaaaaa gcaccgtgca tgccgattgg tggaagtaag gtggtacgat 7200
cgtgccttat taggaaggca acagacgggt ctgacatgga ttggacgaac cactgaattg 7260
ccgcattgca gagatattgt atttaagtgc ctagctcgat acataaacgg gtctctctgg 7320
ttagaccaga tctgagcctg ggagctctct ggctaactag ggaacccact gcttaagcct 7380
caataaagct tgccttgagt gcttcaagta gtgtgtgccc gtctgttgtg tgactctggt 7440
aactagagat ccctcagacc cttttagtca gtgtggaaaa tctctagcag tggcgcccga 7500
acagggactt gaaagcgaaa gggaaaccag aggagctctc tcgacgcagg actcggcttg 7560
ctgaagcgcg cacggcaaga ggcgaggggc ggcgactggt gagtacgcca aaaattttga 7620
ctagcggagg ctagaaggag agagatgggt gcgagagcgt cagtattaag cgggggagaa 7680
ttagatcgcg atgggaaaaa attcggttaa ggccaggggg aaagaaaaaa tataaattaa 7740
aacatatagt atgggcaagc agggagctag aacgattcgc agttaatcct ggcctgttag 7800
aaacatcaga aggctgtaga caaatactgg gacagctaca accatccctt cagacaggat 7860
cagaagaact tagatcatta tataatacag tagcaaccct ctattgtgtg catcaaagga 7920
tagagataaa agacaccaag gaagctttag acaagataga ggaagagcaa aacaaaagta 7980
agaccaccgc acagcaagcg gccgctgatc ttcagacctg gaggaggaga tatgagggac 8040
aattggagaa gtgaattata taaatataaa gtagtaaaaa ttgaaccatt aggagtagca 8100
cccaccaagg caaagagaag agtggtgcag agagaaaaaa gagcagtggg aataggagct 8160
ttgttccttg ggttcttggg agcagcagga agcactatgg gcgcagcgtc aatgacgctg 8220
acggtacagg ccagacaatt attgtctggt atagtgcagc agcagaacaa tttgctgagg 8280
gctattgagg cgcaacagca tctgttgcaa ctcacagtct ggggcatcaa gcagctccag 8340
gcaagaatcc tggctgtgga aagataccta aaggatcaac agctcctggg gatttggggt 8400
tgctctggaa aactcatttg caccactgct gtgccttgga atgctagttg gagtaataaa 8460
tctctggaac agatttggaa tcacacgacc tggatggagt gggacagaga aattaacaat 8520
tacacaagct taatacactc cttaattgaa gaatcgcaaa accagcaaga aaagaatgaa 8580
caagaattat tggaattaga taaatgggca agtttgtgga attggtttaa cataacaaat 8640
tggctgtggt atataaaatt attcataatg atagtaggag gcttggtagg tttaagaata 8700
gtttttgctg tactttctat agtgaataga gttaggcagg gatattcacc attatcgttt 8760
cagacccacc tcccaacccc gaggggaccc ttgcgccttt tccaaggcag ccctgggttt 8820
gcgcagggac gcggctgctc tgggcgtggt tccgggaaac gcagcggcgc cgaccctggg 8880
tctcgcacat tcttcacgtc cgttcgcagc gtcacccgga tcttcgccgc tacccttgtg 8940
ggccccccgg cgacgcttcc tgctccgccc ctaagtcggg aaggttcctt gcggttcgcg 9000
gcgtgccgga cgtgacaaac ggaagccgca cgtctcacta gtaccctcgc agacggacag 9060
cgccagggag caatggcagc gcgccgaccg cgatgggctg tggccaatag cggctgctca 9120
gcagggcgcg ccgagagcag cggccgggaa ggggcggtgc gggaggcggg gtgtggggcg 9180
gtagtgtggg ccctgttcct gcccgcgcgg tgttccgcat tctgcaagcc tccggagcgc 9240
acgtcggcag tcggctccct cgttgaccga atcaccgacc tctctcccca gggggtaccc 9300
agctgtctag agaattctag atcttgagac aaatggcagt attcatccac aattttaaaa 9360
gaaaaggggg gattgggggg tacagtgcag gggaaagaat agtagacata atagcaacag 9420
acatacaaac taaagaatta caaaaacaaa ttacaaaaat tcaaaatttt cgggtttatt 9480
acagggacag cagagatcca ctttggcgcc ggctcgaggg gg 9522
<210> 108
<211> 10232
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 108
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagctgaacg agaaacgtaa aatgatataa atatcaatat attaaattag 660
attttgcata aaaaacagac tacataatac tgtaaaacac aacatatcca gtcactatgg 720
cggccgcatt aggcacccca ggctttacac tttatgcttc cggctcgtat aatgtgtgga 780
ttttgagtta ggatccgtcg agattttcag gagctaagga agctaaaatg gagaaaaaaa 840
tcactggata taccaccgtt gatatatccc aatggcatcg taaagaacat tttgaggcat 900
ttcagtcagt tgctcaatgt acctataacc agaccgttca gctggatatt acggcctttt 960
taaagaccgt aaagaaaaat aagcacaagt tttatccggc ctttattcac attcttgccc 1020
gcctgatgaa tgctcatccg gaattccgta tggcaatgaa agacggtgag ctggtgatat 1080
gggatagtgt tcacccttgt tacaccgttt tccatgagca aactgaaacg ttttcatcgc 1140
tctggagtga ataccacgac gatttccggc agtttctaca catatattcg caagatgtgg 1200
cgtgttacgg tgaaaacctg gcctatttcc ctaaagggtt tattgagaat atgtttttcg 1260
tctcagccaa tccctgggtg agtttcacca gttttgattt aaacgtggcc aatatggaca 1320
acttcttcgc ccccgttttc accatgggca aatattatac gcaaggcgac aaggtgctga 1380
tgccgctggc gattcaggtt catcatgccg tttgtgatgg cttccatgtc ggcagaatgc 1440
ttaatgaatt acaacagtac tgcgatgagt ggcagggcgg ggcgtaaaga tctggatccg 1500
gcttactaaa agccagataa cagtatgcgt atttgcgcgc tgatttttgc ggtataagaa 1560
tatatactga tatgtatacc cgaagtatgt caaaaagagg tatgctatga agcagcgtat 1620
tacagtgaca gttgacagcg acagctatca gttgctcaag gcatatatga tgtcaatatc 1680
tccggtctgg taagcacaac catgcagaat gaagcccgtc gtctgcgtgc cgaacgctgg 1740
aaagcggaaa atcaggaagg gatggctgag gtcgcccggt ttattgaaat gaacggctct 1800
tttgctgacg agaacagggg ctggtgaaat gcagtttaag gtttacacct ataaaagaga 1860
gagccgttat cgtctgtttg tggatgtaca gagtgatatt attgacacgc ccgggcgacg 1920
gatggtgatc cccctggcca gtgcacgtct gctgtcagat aaagtctccc gtgaacttta 1980
cccggtggtg catatcgggg atgaaagctg gcgcatgatg accaccgata tggccagtgt 2040
gccggtctcc gttatcgggg aagaagtggc tgatctcagc caccgcgaaa atgacatcaa 2100
aaacgccatt aacctgatgt tctggggaat ataaatgtca ggctccctta tacacagcca 2160
gtctgcaggt cgaccatagt gactggatat gttgtgtttt acagtattat gtagtctgtt 2220
ttttatgcaa aatctaattt aatatattga tatttatatc attttacgtt tctcgttcag 2280
ctttcttgta caaagtggtt ggtaagccta tccctaaccc tctcctcggt ctcgattcta 2340
cgtagtaatg agctagcagt ctcgaggtta acgaattccg ccccccccct aacgttactg 2400
gccgaagccg cttggaataa ggccggtgtg cgcttgtcta tatgttattt tccaccatat 2460
tgccgtcttt tggcaatgtg agggcccgga aacctggccc tgtcttcttg acgagcattc 2520
ctaggggtct ttcccctctc gccaaaggaa tgcaaggtct gttgaatgtc gtgaaggaag 2580
cagttcctct ggaagcttct tgaagacaaa caacgtctgt agcgaccctt tgcaggcagc 2640
ggaacccccc acctggcgac aggtgcccct gcggccaaaa gccacgtgta taagatacac 2700
ctgcaaaggc ggcacaaccc cagtgccacg ttgtgagttg gatagttgtg gaaagagtca 2760
aatggctctc ctcaagcgta ttcaacaagg ggctgaagga tgcccagaag gtaccccatt 2820
gtatgggatc tgatctgggg cctcggtgca catgctttac atgtgtttag tcgaggttaa 2880
aaaaacgtct aggccccccg aaccacgggg acgtggtttt cctttgaaaa acacgataat 2940
accatggcca tgaccgagta caagcccacg gtgcgcctcg ccacccgcga cgacgtcccc 3000
agggccgtac gcaccctcgc cgccgcgttc gccgactacc ccgccacgcg ccacaccgtc 3060
gatccggacc gccacatcga gcgggtcacc gagctgcaag aactcttcct cacgcgcgtc 3120
gggctcgaca tcggcaaggt gtgggtcgcg gacgacggcg ccgcggtggc ggtctggacc 3180
acgccggaga gcgtcgaagc gggggcggtg ttcgccgaga tcggcccgcg catggccgag 3240
ttgagcggtt cccggctggc cgcgcagcaa cagatggaag gcctcctggc gccgcaccgg 3300
cccaagtgcc tttcatacga gaccgagatc ctgactgtcg agtacggatt gcttcctatc 3360
ggcaaaatcg tggagaagag gattgaatgt accgtctatt cagtcgataa taatgggaac 3420
atctacacac agcccgtggc tcaatggcac gacagaggag agcaggaagt ttttgaatac 3480
tgtctcgagg acggatccct catccgcgct actaaagatc ataagtttat gaccgtggac 3540
ggccagatgc tgccaattga cgaaattttt gaacgagagc tggatctgat gagagtcgac 3600
aaccttccaa actgagaatt caccggtggc gcgttaagtc gacaatcaac ctctggatta 3660
caaaatttgt gaaagattga ctggtattct taactatgtt gctcctttta cgctatgtgg 3720
atacgctgct ttaatgcctt tgtatcatgc tattgcttcc cgtatggctt tcattttctc 3780
ctccttgtat aaatcctggt tgctgtctct ttatgaggag ttgtggcccg ttgtcaggca 3840
acgtggcgtg gtgtgcactg tgtttgctga cgcaaccccc actggttggg gcattgccac 3900
cacctgtcag ctcctttccg ggactttcgc tttccccctc cctattgcca cggcggaact 3960
catcgccgcc tgccttgccc gctgctggac aggggctcgg ctgttgggca ctgacaattc 4020
cgtggtgttg tcggggaaat catcgtcctt tccttggctg ctcgcctgtg ttgccacctg 4080
gattctgcgc gggacgtcct tctgctacgt cccttcggcc ctcaatccag cggaccttcc 4140
ttcccgcggc ctgctgccgg ctctgcggcc tcttccgcgt cttcgccttc gccctcagac 4200
gagtcggatc tccctttggg ccgcctcccc gcgtcgactt taagaccaat gacttacaag 4260
gcagctgtag atcttagcca ctttttaaaa gaaaaggggg gactggaagg gctaattcac 4320
tcccaacgaa gacaagatct gctttttgct tgtactgggt ctctctggtt agaccagatc 4380
tgagcctggg agctctctgg ctaactaggg aacccactgc ttaagcctca ataaagcttg 4440
ccttgagtgc ttcaagtagt gtgtgcccgt ctgttgtgtg actctggtaa ctagagatcc 4500
ctcagaccct tttagtcagt gtggaaaatc tctagcagta cgtatagtag ttcatgtcat 4560
cttattattc agtatttata acttgcaaag aaatgaatat cagagagtga gaggaacttg 4620
tttattgcag cttataatgg ttacaaataa agcaatagca tcacaaattt cacaaataaa 4680
gcattttttt cactgcattc tagttgtggt ttgtccaaac tcatcaatgt atcttatcat 4740
gtctggctct agctatcccg cccctaactc cgcccatccc gcccctaact ccgcccagtt 4800
ccgcccattc tccgccccat ggctgactaa ttttttttat ttatgcagag gccgaggccg 4860
cctcggcctc tgagctattc cagaagtagt gaggaggctt ttttggaggc ctagggacgt 4920
acccaattcg ccctatagtg agtcgtatta cgcgcgctca ctggccgtcg ttttacaacg 4980
tcgtgactgg gaaaaccctg gcgttaccca acttaatcgc cttgcagcac atcccccttt 5040
cgccagctgg cgtaatagcg aagaggcccg caccgatcgc ccttcccaac agttgcgcag 5100
cctgaatggc gaatgggacg cgccctgtag cggcgcatta agcgcggcgg gtgtggtggt 5160
tacgcgcagc gtgaccgcta cacttgccag cgccctagcg cccgctcctt tcgctttctt 5220
cccttccttt ctcgccacgt tcgccggctt tccccgtcaa gctctaaatc gggggctccc 5280
tttagggttc cgatttagtg ctttacggca cctcgacccc aaaaaacttg attagggtga 5340
tggttcacgt agtgggccat cgccctgata gacggttttt cgccctttga cgttggagtc 5400
cacgttcttt aatagtggac tcttgttcca aactggaaca acactcaacc ctatctcggt 5460
ctattctttt gatttataag ggattttgcc gatttcggcc tattggttaa aaaatgagct 5520
gatttaacaa aaatttaacg cgaattttaa caaaatatta acgcttacaa tttaggtggc 5580
acttttcggg gaaatgtgcg cggaacccct atttgtttat ttttctaaat acattcaaat 5640
atgtatccgc tcatgagaca ataaccctga taaatgcttc aataatattg aaaaaggaag 5700
agtatgagta ttcaacattt ccgtgtcgcc cttattccct tttttgcggc attttgcctt 5760
cctgtttttg ctcacccaga aacgctggtg aaagtaaaag atgctgaaga tcagttgggt 5820
gcacgagtgg gttacatcga actggatctc aacagcggta agatccttga gagttttcgc 5880
cccgaagaac gttttccaat gatgagcact tttaaagttc tgctatgtgg cgcggtatta 5940
tcccgtattg acgccgggca agagcaactc ggtcgccgca tacactattc tcagaatgac 6000
ttggttgagt actcaccagt cacagaaaag catcttacgg atggcatgac agtaagagaa 6060
ttatgcagtg ctgccataac catgagtgat aacactgcgg ccaacttact tctgacaacg 6120
atcggaggac cgaaggagct aaccgctttt ttgcacaaca tgggggatca tgtaactcgc 6180
cttgatcgtt gggaaccgga gctgaatgaa gccataccaa acgacgagcg tgacaccacg 6240
atgcctgtag caatggcaac aacgttgcgc aaactattaa ctggcgaact acttactcta 6300
gcttcccggc aacaattaat agactggatg gaggcggata aagttgcagg accacttctg 6360
cgctcggccc ttccggctgg ctggtttatt gctgataaat ctggagccgg tgagcgtggg 6420
tctcgcggta tcattgcagc actggggcca gatggtaagc cctcccgtat cgtagttatc 6480
tacacgacgg ggagtcaggc aactatggat gaacgaaata gacagatcgc tgagataggt 6540
gcctcactga ttaagcattg gtaactgtca gaccaagttt actcatatat actttagatt 6600
gatttaaaac ttcattttta atttaaaagg atctaggtga agatcctttt tgataatctc 6660
atgaccaaaa tcccttaacg tgagttttcg ttccactgag cgtcagaccc cgtagaaaag 6720
atcaaaggat cttcttgaga tccttttttt ctgcgcgtaa tctgctgctt gcaaacaaaa 6780
aaaccaccgc taccagcggt ggtttgtttg ccggatcaag agctaccaac tctttttccg 6840
aaggtaactg gcttcagcag agcgcagata ccaaatactg ttcttctagt gtagccgtag 6900
ttaggccacc acttcaagaa ctctgtagca ccgcctacat acctcgctct gctaatcctg 6960
ttaccagtgg ctgctgccag tggcgataag tcgtgtctta ccgggttgga ctcaagacga 7020
tagttaccgg ataaggcgca gcggtcgggc tgaacggggg gttcgtgcac acagcccagc 7080
ttggagcgaa cgacctacac cgaactgaga tacctacagc gtgagctatg agaaagcgcc 7140
acgcttcccg aagggagaaa ggcggacagg tatccggtaa gcggcagggt cggaacagga 7200
gagcgcacga gggagcttcc agggggaaac gcctggtatc tttatagtcc tgtcgggttt 7260
cgccacctct gacttgagcg tcgatttttg tgatgctcgt caggggggcg gagcctatgg 7320
aaaaacgcca gcaacgcggc ctttttacgg ttcctggcct tttgctggcc ttttgctcac 7380
atgttctttc ctgcgttatc ccctgattct gtggataacc gtattaccgc ctttgagtga 7440
gctgataccg ctcgccgcag ccgaacgacc gagcgcagcg agtcagtgag cgaggaagcg 7500
gaagagcgcc caatacgcaa accgcctctc cccgcgcgtt ggccgattca ttaatgcagc 7560
tggcacgaca ggtttcccga ctggaaagcg ggcagtgagc gcaacgcaat taatgtgagt 7620
tagctcactc attaggcacc ccaggcttta cactttatgc ttccggctcg tatgttgtgt 7680
ggaattgtga gcggataaca atttcacaca ggaaacagct atgaccatga ttacgccaag 7740
cgcgcaatta accctcacta aagggaacaa aagctggagc tgcaagctta atgtagtctt 7800
atgcaatact cttgtagtct tgcaacatgg taacgatgag ttagcaacat gccttacaag 7860
gagagaaaaa gcaccgtgca tgccgattgg tggaagtaag gtggtacgat cgtgccttat 7920
taggaaggca acagacgggt ctgacatgga ttggacgaac cactgaattg ccgcattgca 7980
gagatattgt atttaagtgc ctagctcgat acataaacgg gtctctctgg ttagaccaga 8040
tctgagcctg ggagctctct ggctaactag ggaacccact gcttaagcct caataaagct 8100
tgccttgagt gcttcaagta gtgtgtgccc gtctgttgtg tgactctggt aactagagat 8160
ccctcagacc cttttagtca gtgtggaaaa tctctagcag tggcgcccga acagggactt 8220
gaaagcgaaa gggaaaccag aggagctctc tcgacgcagg actcggcttg ctgaagcgcg 8280
cacggcaaga ggcgaggggc ggcgactggt gagtacgcca aaaattttga ctagcggagg 8340
ctagaaggag agagatgggt gcgagagcgt cagtattaag cgggggagaa ttagatcgcg 8400
atgggaaaaa attcggttaa ggccaggggg aaagaaaaaa tataaattaa aacatatagt 8460
atgggcaagc agggagctag aacgattcgc agttaatcct ggcctgttag aaacatcaga 8520
aggctgtaga caaatactgg gacagctaca accatccctt cagacaggat cagaagaact 8580
tagatcatta tataatacag tagcaaccct ctattgtgtg catcaaagga tagagataaa 8640
agacaccaag gaagctttag acaagataga ggaagagcaa aacaaaagta agaccaccgc 8700
acagcaagcg gccgctgatc ttcagacctg gaggaggaga tatgagggac aattggagaa 8760
gtgaattata taaatataaa gtagtaaaaa ttgaaccatt aggagtagca cccaccaagg 8820
caaagagaag agtggtgcag agagaaaaaa gagcagtggg aataggagct ttgttccttg 8880
ggttcttggg agcagcagga agcactatgg gcgcagcgtc aatgacgctg acggtacagg 8940
ccagacaatt attgtctggt atagtgcagc agcagaacaa tttgctgagg gctattgagg 9000
cgcaacagca tctgttgcaa ctcacagtct ggggcatcaa gcagctccag gcaagaatcc 9060
tggctgtgga aagataccta aaggatcaac agctcctggg gatttggggt tgctctggaa 9120
aactcatttg caccactgct gtgccttgga atgctagttg gagtaataaa tctctggaac 9180
agatttggaa tcacacgacc tggatggagt gggacagaga aattaacaat tacacaagct 9240
taatacactc cttaattgaa gaatcgcaaa accagcaaga aaagaatgaa caagaattat 9300
tggaattaga taaatgggca agtttgtgga attggtttaa cataacaaat tggctgtggt 9360
atataaaatt attcataatg atagtaggag gcttggtagg tttaagaata gtttttgctg 9420
tactttctat agtgaataga gttaggcagg gatattcacc attatcgttt cagacccacc 9480
tcccaacccc gaggggaccc ttgcgccttt tccaaggcag ccctgggttt gcgcagggac 9540
gcggctgctc tgggcgtggt tccgggaaac gcagcggcgc cgaccctggg tctcgcacat 9600
tcttcacgtc cgttcgcagc gtcacccgga tcttcgccgc tacccttgtg ggccccccgg 9660
cgacgcttcc tgctccgccc ctaagtcggg aaggttcctt gcggttcgcg gcgtgccgga 9720
cgtgacaaac ggaagccgca cgtctcacta gtaccctcgc agacggacag cgccagggag 9780
caatggcagc gcgccgaccg cgatgggctg tggccaatag cggctgctca gcagggcgcg 9840
ccgagagcag cggccgggaa ggggcggtgc gggaggcggg gtgtggggcg gtagtgtggg 9900
ccctgttcct gcccgcgcgg tgttccgcat tctgcaagcc tccggagcgc acgtcggcag 9960
tcggctccct cgttgaccga atcaccgacc tctctcccca gggggtaccc agctgtctag 10020
agaattctag atcttgagac aaatggcagt attcatccac aattttaaaa gaaaaggggg 10080
gattgggggg tacagtgcag gggaaagaat agtagacata atagcaacag acatacaaac 10140
taaagaatta caaaaacaaa ttacaaaaat tcaaaatttt cgggtttatt acagggacag 10200
cagagatcca ctttggcgcc ggctcgaggg gg 10232
<210> 109
<211> 223
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 109
Met Ala Met Thr Glu Tyr Lys Pro Thr Val Arg Leu Ala Thr Arg Asp
1 5 10 15
Asp Val Pro Arg Ala Val Arg Thr Leu Ala Ala Ala Phe Ala Asp Tyr
20 25 30
Pro Ala Thr Arg His Thr Val Asp Pro Asp Arg His Ile Glu Arg Val
35 40 45
Thr Glu Leu Gln Glu Leu Phe Leu Thr Arg Val Gly Leu Asp Ile Gly
50 55 60
Lys Val Trp Val Ala Asp Asp Gly Ala Ala Val Ala Val Trp Thr Thr
65 70 75 80
Pro Glu Ser Val Glu Ala Gly Ala Val Phe Ala Glu Ile Gly Pro Arg
85 90 95
Met Ala Glu Leu Ser Gly Ser Arg Leu Ala Ala Gln Gln Gln Met Glu
100 105 110
Gly Leu Leu Ala Pro His Arg Pro Lys Cys Leu Ser Tyr Glu Thr Glu
115 120 125
Ile Leu Thr Val Glu Tyr Gly Leu Leu Pro Ile Gly Lys Ile Val Glu
130 135 140
Lys Arg Ile Glu Cys Thr Val Tyr Ser Val Asp Asn Asn Gly Asn Ile
145 150 155 160
Tyr Thr Gln Pro Val Ala Gln Trp His Asp Arg Gly Glu Gln Glu Val
165 170 175
Phe Glu Tyr Cys Leu Glu Asp Gly Ser Leu Ile Arg Ala Thr Lys Asp
180 185 190
His Lys Phe Met Thr Val Asp Gly Gln Met Leu Pro Ile Asp Glu Ile
195 200 205
Phe Glu Arg Glu Leu Asp Leu Met Arg Val Asp Asn Leu Pro Asn
210 215 220
<210> 110
<211> 9920
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 110
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagctgaacg agaaacgtaa aatgatataa atatcaatat attaaattag 660
attttgcata aaaaacagac tacataatac tgtaaaacac aacatatcca gtcactatgg 720
cggccgcatt aggcacccca ggctttacac tttatgcttc cggctcgtat aatgtgtgga 780
ttttgagtta ggatccgtcg agattttcag gagctaagga agctaaaatg gagaaaaaaa 840
tcactggata taccaccgtt gatatatccc aatggcatcg taaagaacat tttgaggcat 900
ttcagtcagt tgctcaatgt acctataacc agaccgttca gctggatatt acggcctttt 960
taaagaccgt aaagaaaaat aagcacaagt tttatccggc ctttattcac attcttgccc 1020
gcctgatgaa tgctcatccg gaattccgta tggcaatgaa agacggtgag ctggtgatat 1080
gggatagtgt tcacccttgt tacaccgttt tccatgagca aactgaaacg ttttcatcgc 1140
tctggagtga ataccacgac gatttccggc agtttctaca catatattcg caagatgtgg 1200
cgtgttacgg tgaaaacctg gcctatttcc ctaaagggtt tattgagaat atgtttttcg 1260
tctcagccaa tccctgggtg agtttcacca gttttgattt aaacgtggcc aatatggaca 1320
acttcttcgc ccccgttttc accatgggca aatattatac gcaaggcgac aaggtgctga 1380
tgccgctggc gattcaggtt catcatgccg tttgtgatgg cttccatgtc ggcagaatgc 1440
ttaatgaatt acaacagtac tgcgatgagt ggcagggcgg ggcgtaaaga tctggatccg 1500
gcttactaaa agccagataa cagtatgcgt atttgcgcgc tgatttttgc ggtataagaa 1560
tatatactga tatgtatacc cgaagtatgt caaaaagagg tatgctatga agcagcgtat 1620
tacagtgaca gttgacagcg acagctatca gttgctcaag gcatatatga tgtcaatatc 1680
tccggtctgg taagcacaac catgcagaat gaagcccgtc gtctgcgtgc cgaacgctgg 1740
aaagcggaaa atcaggaagg gatggctgag gtcgcccggt ttattgaaat gaacggctct 1800
tttgctgacg agaacagggg ctggtgaaat gcagtttaag gtttacacct ataaaagaga 1860
gagccgttat cgtctgtttg tggatgtaca gagtgatatt attgacacgc ccgggcgacg 1920
gatggtgatc cccctggcca gtgcacgtct gctgtcagat aaagtctccc gtgaacttta 1980
cccggtggtg catatcgggg atgaaagctg gcgcatgatg accaccgata tggccagtgt 2040
gccggtctcc gttatcgggg aagaagtggc tgatctcagc caccgcgaaa atgacatcaa 2100
aaacgccatt aacctgatgt tctggggaat ataaatgtca ggctccctta tacacagcca 2160
gtctgcaggt cgaccatagt gactggatat gttgtgtttt acagtattat gtagtctgtt 2220
ttttatgcaa aatctaattt aatatattga tatttatatc attttacgtt tctcgttcag 2280
ctttcttgta caaagtggtt ggtaagccta tccctaaccc tctcctcggt ctcgattcta 2340
cgtagtaatg agctagcagt ctcgaggtta acgaattccg ccccccccct aacgttactg 2400
gccgaagccg cttggaataa ggccggtgtg cgcttgtcta tatgttattt tccaccatat 2460
tgccgtcttt tggcaatgtg agggcccgga aacctggccc tgtcttcttg acgagcattc 2520
ctaggggtct ttcccctctc gccaaaggaa tgcaaggtct gttgaatgtc gtgaaggaag 2580
cagttcctct ggaagcttct tgaagacaaa caacgtctgt agcgaccctt tgcaggcagc 2640
ggaacccccc acctggcgac aggtgcccct gcggccaaaa gccacgtgta taagatacac 2700
ctgcaaaggc ggcacaaccc cagtgccacg ttgtgagttg gatagttgtg gaaagagtca 2760
aatggctctc ctcaagcgta ttcaacaagg ggctgaagga tgcccagaag gtaccccatt 2820
gtatgggatc tgatctgggg cctcggtgca catgctttac atgtgtttag tcgaggttaa 2880
aaaaacgtct aggccccccg aaccacgggg acgtggtttt cctttgaaaa acacgataat 2940
accatggcca tgattaagat cgctacgcgg aagtacctgg ggaaacagaa cgtctacgac 3000
ataggtgtgg agcgcgatca caactttgct ctgaaaaatg gatttatcgc cagcaactgc 3060
gagcccgcgt ggttcctggc caccgtcggc gtctcgcccg accaccaggg caagggtctg 3120
ggcagcgccg tcgtgctccc cggagtggag gcggccgagc gcgccggggt gcccgccttc 3180
ctggagacct ccgcgccccg caacctcccc ttctacgagc ggctcggctt caccgtcacc 3240
gccgacgtcg aggtgcccga aggaccgcgc acctggtgca tgacccgcaa gcccggtgcc 3300
tgagaattca ccggtggcgc gttaagtcga caatcaacct ctggattaca aaatttgtga 3360
aagattgact ggtattctta actatgttgc tccttttacg ctatgtggat acgctgcttt 3420
aatgcctttg tatcatgcta ttgcttcccg tatggctttc attttctcct ccttgtataa 3480
atcctggttg ctgtctcttt atgaggagtt gtggcccgtt gtcaggcaac gtggcgtggt 3540
gtgcactgtg tttgctgacg caacccccac tggttggggc attgccacca cctgtcagct 3600
cctttccggg actttcgctt tccccctccc tattgccacg gcggaactca tcgccgcctg 3660
ccttgcccgc tgctggacag gggctcggct gttgggcact gacaattccg tggtgttgtc 3720
ggggaaatca tcgtcctttc cttggctgct cgcctgtgtt gccacctgga ttctgcgcgg 3780
gacgtccttc tgctacgtcc cttcggccct caatccagcg gaccttcctt cccgcggcct 3840
gctgccggct ctgcggcctc ttccgcgtct tcgccttcgc cctcagacga gtcggatctc 3900
cctttgggcc gcctccccgc gtcgacttta agaccaatga cttacaaggc agctgtagat 3960
cttagccact ttttaaaaga aaagggggga ctggaagggc taattcactc ccaacgaaga 4020
caagatctgc tttttgcttg tactgggtct ctctggttag accagatctg agcctgggag 4080
ctctctggct aactagggaa cccactgctt aagcctcaat aaagcttgcc ttgagtgctt 4140
caagtagtgt gtgcccgtct gttgtgtgac tctggtaact agagatccct cagacccttt 4200
tagtcagtgt ggaaaatctc tagcagtacg tatagtagtt catgtcatct tattattcag 4260
tatttataac ttgcaaagaa atgaatatca gagagtgaga ggaacttgtt tattgcagct 4320
tataatggtt acaaataaag caatagcatc acaaatttca caaataaagc atttttttca 4380
ctgcattcta gttgtggttt gtccaaactc atcaatgtat cttatcatgt ctggctctag 4440
ctatcccgcc cctaactccg cccatcccgc ccctaactcc gcccagttcc gcccattctc 4500
cgccccatgg ctgactaatt ttttttattt atgcagaggc cgaggccgcc tcggcctctg 4560
agctattcca gaagtagtga ggaggctttt ttggaggcct agggacgtac ccaattcgcc 4620
ctatagtgag tcgtattacg cgcgctcact ggccgtcgtt ttacaacgtc gtgactggga 4680
aaaccctggc gttacccaac ttaatcgcct tgcagcacat ccccctttcg ccagctggcg 4740
taatagcgaa gaggcccgca ccgatcgccc ttcccaacag ttgcgcagcc tgaatggcga 4800
atgggacgcg ccctgtagcg gcgcattaag cgcggcgggt gtggtggtta cgcgcagcgt 4860
gaccgctaca cttgccagcg ccctagcgcc cgctcctttc gctttcttcc cttcctttct 4920
cgccacgttc gccggctttc cccgtcaagc tctaaatcgg gggctccctt tagggttccg 4980
atttagtgct ttacggcacc tcgaccccaa aaaacttgat tagggtgatg gttcacgtag 5040
tgggccatcg ccctgataga cggtttttcg ccctttgacg ttggagtcca cgttctttaa 5100
tagtggactc ttgttccaaa ctggaacaac actcaaccct atctcggtct attcttttga 5160
tttataaggg attttgccga tttcggccta ttggttaaaa aatgagctga tttaacaaaa 5220
atttaacgcg aattttaaca aaatattaac gcttacaatt taggtggcac ttttcgggga 5280
aatgtgcgcg gaacccctat ttgtttattt ttctaaatac attcaaatat gtatccgctc 5340
atgagacaat aaccctgata aatgcttcaa taatattgaa aaaggaagag tatgagtatt 5400
caacatttcc gtgtcgccct tattcccttt tttgcggcat tttgccttcc tgtttttgct 5460
cacccagaaa cgctggtgaa agtaaaagat gctgaagatc agttgggtgc acgagtgggt 5520
tacatcgaac tggatctcaa cagcggtaag atccttgaga gttttcgccc cgaagaacgt 5580
tttccaatga tgagcacttt taaagttctg ctatgtggcg cggtattatc ccgtattgac 5640
gccgggcaag agcaactcgg tcgccgcata cactattctc agaatgactt ggttgagtac 5700
tcaccagtca cagaaaagca tcttacggat ggcatgacag taagagaatt atgcagtgct 5760
gccataacca tgagtgataa cactgcggcc aacttacttc tgacaacgat cggaggaccg 5820
aaggagctaa ccgctttttt gcacaacatg ggggatcatg taactcgcct tgatcgttgg 5880
gaaccggagc tgaatgaagc cataccaaac gacgagcgtg acaccacgat gcctgtagca 5940
atggcaacaa cgttgcgcaa actattaact ggcgaactac ttactctagc ttcccggcaa 6000
caattaatag actggatgga ggcggataaa gttgcaggac cacttctgcg ctcggccctt 6060
ccggctggct ggtttattgc tgataaatct ggagccggtg agcgtgggtc tcgcggtatc 6120
attgcagcac tggggccaga tggtaagccc tcccgtatcg tagttatcta cacgacgggg 6180
agtcaggcaa ctatggatga acgaaataga cagatcgctg agataggtgc ctcactgatt 6240
aagcattggt aactgtcaga ccaagtttac tcatatatac tttagattga tttaaaactt 6300
catttttaat ttaaaaggat ctaggtgaag atcctttttg ataatctcat gaccaaaatc 6360
ccttaacgtg agttttcgtt ccactgagcg tcagaccccg tagaaaagat caaaggatct 6420
tcttgagatc ctttttttct gcgcgtaatc tgctgcttgc aaacaaaaaa accaccgcta 6480
ccagcggtgg tttgtttgcc ggatcaagag ctaccaactc tttttccgaa ggtaactggc 6540
ttcagcagag cgcagatacc aaatactgtt cttctagtgt agccgtagtt aggccaccac 6600
ttcaagaact ctgtagcacc gcctacatac ctcgctctgc taatcctgtt accagtggct 6660
gctgccagtg gcgataagtc gtgtcttacc gggttggact caagacgata gttaccggat 6720
aaggcgcagc ggtcgggctg aacggggggt tcgtgcacac agcccagctt ggagcgaacg 6780
acctacaccg aactgagata cctacagcgt gagctatgag aaagcgccac gcttcccgaa 6840
gggagaaagg cggacaggta tccggtaagc ggcagggtcg gaacaggaga gcgcacgagg 6900
gagcttccag ggggaaacgc ctggtatctt tatagtcctg tcgggtttcg ccacctctga 6960
cttgagcgtc gatttttgtg atgctcgtca ggggggcgga gcctatggaa aaacgccagc 7020
aacgcggcct ttttacggtt cctggccttt tgctggcctt ttgctcacat gttctttcct 7080
gcgttatccc ctgattctgt ggataaccgt attaccgcct ttgagtgagc tgataccgct 7140
cgccgcagcc gaacgaccga gcgcagcgag tcagtgagcg aggaagcgga agagcgccca 7200
atacgcaaac cgcctctccc cgcgcgttgg ccgattcatt aatgcagctg gcacgacagg 7260
tttcccgact ggaaagcggg cagtgagcgc aacgcaatta atgtgagtta gctcactcat 7320
taggcacccc aggctttaca ctttatgctt ccggctcgta tgttgtgtgg aattgtgagc 7380
ggataacaat ttcacacagg aaacagctat gaccatgatt acgccaagcg cgcaattaac 7440
cctcactaaa gggaacaaaa gctggagctg caagcttaat gtagtcttat gcaatactct 7500
tgtagtcttg caacatggta acgatgagtt agcaacatgc cttacaagga gagaaaaagc 7560
accgtgcatg ccgattggtg gaagtaaggt ggtacgatcg tgccttatta ggaaggcaac 7620
agacgggtct gacatggatt ggacgaacca ctgaattgcc gcattgcaga gatattgtat 7680
ttaagtgcct agctcgatac ataaacgggt ctctctggtt agaccagatc tgagcctggg 7740
agctctctgg ctaactaggg aacccactgc ttaagcctca ataaagcttg ccttgagtgc 7800
ttcaagtagt gtgtgcccgt ctgttgtgtg actctggtaa ctagagatcc ctcagaccct 7860
tttagtcagt gtggaaaatc tctagcagtg gcgcccgaac agggacttga aagcgaaagg 7920
gaaaccagag gagctctctc gacgcaggac tcggcttgct gaagcgcgca cggcaagagg 7980
cgaggggcgg cgactggtga gtacgccaaa aattttgact agcggaggct agaaggagag 8040
agatgggtgc gagagcgtca gtattaagcg ggggagaatt agatcgcgat gggaaaaaat 8100
tcggttaagg ccagggggaa agaaaaaata taaattaaaa catatagtat gggcaagcag 8160
ggagctagaa cgattcgcag ttaatcctgg cctgttagaa acatcagaag gctgtagaca 8220
aatactggga cagctacaac catcccttca gacaggatca gaagaactta gatcattata 8280
taatacagta gcaaccctct attgtgtgca tcaaaggata gagataaaag acaccaagga 8340
agctttagac aagatagagg aagagcaaaa caaaagtaag accaccgcac agcaagcggc 8400
cgctgatctt cagacctgga ggaggagata tgagggacaa ttggagaagt gaattatata 8460
aatataaagt agtaaaaatt gaaccattag gagtagcacc caccaaggca aagagaagag 8520
tggtgcagag agaaaaaaga gcagtgggaa taggagcttt gttccttggg ttcttgggag 8580
cagcaggaag cactatgggc gcagcgtcaa tgacgctgac ggtacaggcc agacaattat 8640
tgtctggtat agtgcagcag cagaacaatt tgctgagggc tattgaggcg caacagcatc 8700
tgttgcaact cacagtctgg ggcatcaagc agctccaggc aagaatcctg gctgtggaaa 8760
gatacctaaa ggatcaacag ctcctgggga tttggggttg ctctggaaaa ctcatttgca 8820
ccactgctgt gccttggaat gctagttgga gtaataaatc tctggaacag atttggaatc 8880
acacgacctg gatggagtgg gacagagaaa ttaacaatta cacaagctta atacactcct 8940
taattgaaga atcgcaaaac cagcaagaaa agaatgaaca agaattattg gaattagata 9000
aatgggcaag tttgtggaat tggtttaaca taacaaattg gctgtggtat ataaaattat 9060
tcataatgat agtaggaggc ttggtaggtt taagaatagt ttttgctgta ctttctatag 9120
tgaatagagt taggcaggga tattcaccat tatcgtttca gacccacctc ccaaccccga 9180
ggggaccctt gcgccttttc caaggcagcc ctgggtttgc gcagggacgc ggctgctctg 9240
ggcgtggttc cgggaaacgc agcggcgccg accctgggtc tcgcacattc ttcacgtccg 9300
ttcgcagcgt cacccggatc ttcgccgcta cccttgtggg ccccccggcg acgcttcctg 9360
ctccgcccct aagtcgggaa ggttccttgc ggttcgcggc gtgccggacg tgacaaacgg 9420
aagccgcacg tctcactagt accctcgcag acggacagcg ccagggagca atggcagcgc 9480
gccgaccgcg atgggctgtg gccaatagcg gctgctcagc agggcgcgcc gagagcagcg 9540
gccgggaagg ggcggtgcgg gaggcggggt gtggggcggt agtgtgggcc ctgttcctgc 9600
ccgcgcggtg ttccgcattc tgcaagcctc cggagcgcac gtcggcagtc ggctccctcg 9660
ttgaccgaat caccgacctc tctccccagg gggtacccag ctgtctagag aattctagat 9720
cttgagacaa atggcagtat tcatccacaa ttttaaaaga aaagggggga ttggggggta 9780
cagtgcaggg gaaagaatag tagacataat agcaacagac atacaaacta aagaattaca 9840
aaaacaaatt acaaaaattc aaaattttcg ggtttattac agggacagca gagatccact 9900
ttggcgccgg ctcgaggggg 9920
<210> 111
<211> 119
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 111
Met Ala Met Ile Lys Ile Ala Thr Arg Lys Tyr Leu Gly Lys Gln Asn
1 5 10 15
Val Tyr Asp Ile Gly Val Glu Arg Asp His Asn Phe Ala Leu Lys Asn
20 25 30
Gly Phe Ile Ala Ser Asn Cys Glu Pro Ala Trp Phe Leu Ala Thr Val
35 40 45
Gly Val Ser Pro Asp His Gln Gly Lys Gly Leu Gly Ser Ala Val Val
50 55 60
Leu Pro Gly Val Glu Ala Ala Glu Arg Ala Gly Val Pro Ala Phe Leu
65 70 75 80
Glu Thr Ser Ala Pro Arg Asn Leu Pro Phe Tyr Glu Arg Leu Gly Phe
85 90 95
Thr Val Thr Ala Asp Val Glu Val Pro Glu Gly Pro Arg Thr Trp Cys
100 105 110
Met Thr Arg Lys Pro Gly Ala
115
<210> 112
<211> 9317
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 112
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtgccgcca ccatgagcga gctgattaag 660
gagaacatgc acatgaagct gtacatggag ggcaccgtgg acaaccatca cttcaagtgc 720
acatccgagg gcgaaggcaa gccctacgag ggcacccaga ccatgagaat caaggtggtc 780
gagggcggcc ctctcccctt cgccttcgac atcctggcta ctagcttcct ctacggcagc 840
aagaccttca tcaaccacac ccagggcatc cccgacttct tcaagcagtc cttccctgag 900
ggcttcacat gggagagagt caccacatac gaagacgggg gcgtgctgac cgctacccag 960
gacaccagcc tccaggacgg ctgcctcatc tacaacgtca agatcagagg ggtgaacttc 1020
acatccaacg gccctgtgat gcagaagaaa acactcggct gggaggcctt caccgagacg 1080
ctgtaccccg ctgacggcgg cctggaaggc agaaacgaca tggccctgaa gctcgtgggc 1140
gggagccatc tgatcgcaaa catcaagacc acatatagat ccaagaaacc cgctaagaac 1200
ctcaagatgc ctggcgtcta ctatgtggac tacagactgg aaagaatcaa ggaggccaac 1260
aacgagacct acgtcgagca gcacgaggtg gcagtggcca gatactgcga cctccctagc 1320
aaactggggc acaagcttaa ttaattaatt aagaattcga cccagctttc ttgtacaaag 1380
tggttggtaa gcctatccct aaccctctcc tcggtctcga ttctacgtag taatgagcta 1440
gcagtctcga ggttaacgaa ttccgccccc cccctaacgt tactggccga agccgcttgg 1500
aataaggccg gtgtgcgctt gtctatatgt tattttccac catattgccg tcttttggca 1560
atgtgagggc ccggaaacct ggccctgtct tcttgacgag cattcctagg ggtctttccc 1620
ctctcgccaa aggaatgcaa ggtctgttga atgtcgtgaa ggaagcagtt cctctggaag 1680
cttcttgaag acaaacaacg tctgtagcga ccctttgcag gcagcggaac cccccacctg 1740
gcgacaggtg cccctgcggc caaaagccac gtgtataaga tacacctgca aaggcggcac 1800
aaccccagtg ccacgttgtg agttggatag ttgtggaaag agtcaaatgg ctctcctcaa 1860
gcgtattcaa caaggggctg aaggatgccc agaaggtacc ccattgtatg ggatctgatc 1920
tggggcctcg gtgcacatgc tttacatgtg tttagtcgag gttaaaaaaa cgtctaggcc 1980
ccccgaacca cggggacgtg gttttccttt gaaaaacacg ataataccat ggccatgacc 2040
gagtacaagc ccacggtgcg cctcgccacc cgcgacgacg tccccagggc cgtacgcacc 2100
ctcgccgccg cgttcgccga ctaccccgcc acgcgccaca ccgtcgatcc ggaccgccac 2160
atcgagcggg tcaccgagct gcaagaactc ttcctcacgc gcgtcgggct cgacatcggc 2220
aaggtgtggg tcgcggacga cggcgccgcg gtggcggtct ggaccacgcc ggagagcgtc 2280
gaagcggggg cggtgttcgc cgagatcggc ccgcgcatgg ccgagttgag cggttcccgg 2340
ctggccgcgc agcaacagat ggaaggcctc ctggcgccgc accggcccaa gtgcctttca 2400
tacgagaccg agatcctgac tgtcgagtac ggattgcttc ctatcggcaa aatcgtggag 2460
aagaggattg aatgtaccgt ctattcagtc gataataatg ggaacatcta cacacagccc 2520
gtggctcaat ggcacgacag aggagagcag gaagtttttg aatactgtct cgaggacgga 2580
tccctcatcc gcgctactaa agatcataag tttatgaccg tggacggcca gatgctgcca 2640
attgacgaaa tttttgaacg agagctggat ctgatgagag tcgacaacct tccaaactga 2700
gaattcaccg gtggcgcgtt aagtcgacaa tcaacctctg gattacaaaa tttgtgaaag 2760
attgactggt attcttaact atgttgctcc ttttacgcta tgtggatacg ctgctttaat 2820
gcctttgtat catgctattg cttcccgtat ggctttcatt ttctcctcct tgtataaatc 2880
ctggttgctg tctctttatg aggagttgtg gcccgttgtc aggcaacgtg gcgtggtgtg 2940
cactgtgttt gctgacgcaa cccccactgg ttggggcatt gccaccacct gtcagctcct 3000
ttccgggact ttcgctttcc ccctccctat tgccacggcg gaactcatcg ccgcctgcct 3060
tgcccgctgc tggacagggg ctcggctgtt gggcactgac aattccgtgg tgttgtcggg 3120
gaaatcatcg tcctttcctt ggctgctcgc ctgtgttgcc acctggattc tgcgcgggac 3180
gtccttctgc tacgtccctt cggccctcaa tccagcggac cttccttccc gcggcctgct 3240
gccggctctg cggcctcttc cgcgtcttcg ccttcgccct cagacgagtc ggatctccct 3300
ttgggccgcc tccccgcgtc gactttaaga ccaatgactt acaaggcagc tgtagatctt 3360
agccactttt taaaagaaaa ggggggactg gaagggctaa ttcactccca acgaagacaa 3420
gatctgcttt ttgcttgtac tgggtctctc tggttagacc agatctgagc ctgggagctc 3480
tctggctaac tagggaaccc actgcttaag cctcaataaa gcttgccttg agtgcttcaa 3540
gtagtgtgtg cccgtctgtt gtgtgactct ggtaactaga gatccctcag acccttttag 3600
tcagtgtgga aaatctctag cagtacgtat agtagttcat gtcatcttat tattcagtat 3660
ttataacttg caaagaaatg aatatcagag agtgagagga acttgtttat tgcagcttat 3720
aatggttaca aataaagcaa tagcatcaca aatttcacaa ataaagcatt tttttcactg 3780
cattctagtt gtggtttgtc caaactcatc aatgtatctt atcatgtctg gctctagcta 3840
tcccgcccct aactccgccc atcccgcccc taactccgcc cagttccgcc cattctccgc 3900
cccatggctg actaattttt tttatttatg cagaggccga ggccgcctcg gcctctgagc 3960
tattccagaa gtagtgagga ggcttttttg gaggcctagg gacgtaccca attcgcccta 4020
tagtgagtcg tattacgcgc gctcactggc cgtcgtttta caacgtcgtg actgggaaaa 4080
ccctggcgtt acccaactta atcgccttgc agcacatccc cctttcgcca gctggcgtaa 4140
tagcgaagag gcccgcaccg atcgcccttc ccaacagttg cgcagcctga atggcgaatg 4200
ggacgcgccc tgtagcggcg cattaagcgc ggcgggtgtg gtggttacgc gcagcgtgac 4260
cgctacactt gccagcgccc tagcgcccgc tcctttcgct ttcttccctt cctttctcgc 4320
cacgttcgcc ggctttcccc gtcaagctct aaatcggggg ctccctttag ggttccgatt 4380
tagtgcttta cggcacctcg accccaaaaa acttgattag ggtgatggtt cacgtagtgg 4440
gccatcgccc tgatagacgg tttttcgccc tttgacgttg gagtccacgt tctttaatag 4500
tggactcttg ttccaaactg gaacaacact caaccctatc tcggtctatt cttttgattt 4560
ataagggatt ttgccgattt cggcctattg gttaaaaaat gagctgattt aacaaaaatt 4620
taacgcgaat tttaacaaaa tattaacgct tacaatttag gtggcacttt tcggggaaat 4680
gtgcgcggaa cccctatttg tttatttttc taaatacatt caaatatgta tccgctcatg 4740
agacaataac cctgataaat gcttcaataa tattgaaaaa ggaagagtat gagtattcaa 4800
catttccgtg tcgcccttat tccctttttt gcggcatttt gccttcctgt ttttgctcac 4860
ccagaaacgc tggtgaaagt aaaagatgct gaagatcagt tgggtgcacg agtgggttac 4920
atcgaactgg atctcaacag cggtaagatc cttgagagtt ttcgccccga agaacgtttt 4980
ccaatgatga gcacttttaa agttctgcta tgtggcgcgg tattatcccg tattgacgcc 5040
gggcaagagc aactcggtcg ccgcatacac tattctcaga atgacttggt tgagtactca 5100
ccagtcacag aaaagcatct tacggatggc atgacagtaa gagaattatg cagtgctgcc 5160
ataaccatga gtgataacac tgcggccaac ttacttctga caacgatcgg aggaccgaag 5220
gagctaaccg cttttttgca caacatgggg gatcatgtaa ctcgccttga tcgttgggaa 5280
ccggagctga atgaagccat accaaacgac gagcgtgaca ccacgatgcc tgtagcaatg 5340
gcaacaacgt tgcgcaaact attaactggc gaactactta ctctagcttc ccggcaacaa 5400
ttaatagact ggatggaggc ggataaagtt gcaggaccac ttctgcgctc ggcccttccg 5460
gctggctggt ttattgctga taaatctgga gccggtgagc gtgggtctcg cggtatcatt 5520
gcagcactgg ggccagatgg taagccctcc cgtatcgtag ttatctacac gacggggagt 5580
caggcaacta tggatgaacg aaatagacag atcgctgaga taggtgcctc actgattaag 5640
cattggtaac tgtcagacca agtttactca tatatacttt agattgattt aaaacttcat 5700
ttttaattta aaaggatcta ggtgaagatc ctttttgata atctcatgac caaaatccct 5760
taacgtgagt tttcgttcca ctgagcgtca gaccccgtag aaaagatcaa aggatcttct 5820
tgagatcctt tttttctgcg cgtaatctgc tgcttgcaaa caaaaaaacc accgctacca 5880
gcggtggttt gtttgccgga tcaagagcta ccaactcttt ttccgaaggt aactggcttc 5940
agcagagcgc agataccaaa tactgttctt ctagtgtagc cgtagttagg ccaccacttc 6000
aagaactctg tagcaccgcc tacatacctc gctctgctaa tcctgttacc agtggctgct 6060
gccagtggcg ataagtcgtg tcttaccggg ttggactcaa gacgatagtt accggataag 6120
gcgcagcggt cgggctgaac ggggggttcg tgcacacagc ccagcttgga gcgaacgacc 6180
tacaccgaac tgagatacct acagcgtgag ctatgagaaa gcgccacgct tcccgaaggg 6240
agaaaggcgg acaggtatcc ggtaagcggc agggtcggaa caggagagcg cacgagggag 6300
cttccagggg gaaacgcctg gtatctttat agtcctgtcg ggtttcgcca cctctgactt 6360
gagcgtcgat ttttgtgatg ctcgtcaggg gggcggagcc tatggaaaaa cgccagcaac 6420
gcggcctttt tacggttcct ggccttttgc tggccttttg ctcacatgtt ctttcctgcg 6480
ttatcccctg attctgtgga taaccgtatt accgcctttg agtgagctga taccgctcgc 6540
cgcagccgaa cgaccgagcg cagcgagtca gtgagcgagg aagcggaaga gcgcccaata 6600
cgcaaaccgc ctctccccgc gcgttggccg attcattaat gcagctggca cgacaggttt 6660
cccgactgga aagcgggcag tgagcgcaac gcaattaatg tgagttagct cactcattag 6720
gcaccccagg ctttacactt tatgcttccg gctcgtatgt tgtgtggaat tgtgagcgga 6780
taacaatttc acacaggaaa cagctatgac catgattacg ccaagcgcgc aattaaccct 6840
cactaaaggg aacaaaagct ggagctgcaa gcttaatgta gtcttatgca atactcttgt 6900
agtcttgcaa catggtaacg atgagttagc aacatgcctt acaaggagag aaaaagcacc 6960
gtgcatgccg attggtggaa gtaaggtggt acgatcgtgc cttattagga aggcaacaga 7020
cgggtctgac atggattgga cgaaccactg aattgccgca ttgcagagat attgtattta 7080
agtgcctagc tcgatacata aacgggtctc tctggttaga ccagatctga gcctgggagc 7140
tctctggcta actagggaac ccactgctta agcctcaata aagcttgcct tgagtgcttc 7200
aagtagtgtg tgcccgtctg ttgtgtgact ctggtaacta gagatccctc agaccctttt 7260
agtcagtgtg gaaaatctct agcagtggcg cccgaacagg gacttgaaag cgaaagggaa 7320
accagaggag ctctctcgac gcaggactcg gcttgctgaa gcgcgcacgg caagaggcga 7380
ggggcggcga ctggtgagta cgccaaaaat tttgactagc ggaggctaga aggagagaga 7440
tgggtgcgag agcgtcagta ttaagcgggg gagaattaga tcgcgatggg aaaaaattcg 7500
gttaaggcca gggggaaaga aaaaatataa attaaaacat atagtatggg caagcaggga 7560
gctagaacga ttcgcagtta atcctggcct gttagaaaca tcagaaggct gtagacaaat 7620
actgggacag ctacaaccat cccttcagac aggatcagaa gaacttagat cattatataa 7680
tacagtagca accctctatt gtgtgcatca aaggatagag ataaaagaca ccaaggaagc 7740
tttagacaag atagaggaag agcaaaacaa aagtaagacc accgcacagc aagcggccgc 7800
tgatcttcag acctggagga ggagatatga gggacaattg gagaagtgaa ttatataaat 7860
ataaagtagt aaaaattgaa ccattaggag tagcacccac caaggcaaag agaagagtgg 7920
tgcagagaga aaaaagagca gtgggaatag gagctttgtt ccttgggttc ttgggagcag 7980
caggaagcac tatgggcgca gcgtcaatga cgctgacggt acaggccaga caattattgt 8040
ctggtatagt gcagcagcag aacaatttgc tgagggctat tgaggcgcaa cagcatctgt 8100
tgcaactcac agtctggggc atcaagcagc tccaggcaag aatcctggct gtggaaagat 8160
acctaaagga tcaacagctc ctggggattt ggggttgctc tggaaaactc atttgcacca 8220
ctgctgtgcc ttggaatgct agttggagta ataaatctct ggaacagatt tggaatcaca 8280
cgacctggat ggagtgggac agagaaatta acaattacac aagcttaata cactccttaa 8340
ttgaagaatc gcaaaaccag caagaaaaga atgaacaaga attattggaa ttagataaat 8400
gggcaagttt gtggaattgg tttaacataa caaattggct gtggtatata aaattattca 8460
taatgatagt aggaggcttg gtaggtttaa gaatagtttt tgctgtactt tctatagtga 8520
atagagttag gcagggatat tcaccattat cgtttcagac ccacctccca accccgaggg 8580
gacccttgcg ccttttccaa ggcagccctg ggtttgcgca gggacgcggc tgctctgggc 8640
gtggttccgg gaaacgcagc ggcgccgacc ctgggtctcg cacattcttc acgtccgttc 8700
gcagcgtcac ccggatcttc gccgctaccc ttgtgggccc cccggcgacg cttcctgctc 8760
cgcccctaag tcgggaaggt tccttgcggt tcgcggcgtg ccggacgtga caaacggaag 8820
ccgcacgtct cactagtacc ctcgcagacg gacagcgcca gggagcaatg gcagcgcgcc 8880
gaccgcgatg ggctgtggcc aatagcggct gctcagcagg gcgcgccgag agcagcggcc 8940
gggaaggggc ggtgcgggag gcggggtgtg gggcggtagt gtgggccctg ttcctgcccg 9000
cgcggtgttc cgcattctgc aagcctccgg agcgcacgtc ggcagtcggc tccctcgttg 9060
accgaatcac cgacctctct ccccaggggg tacccagctg tctagagaat tctagatctt 9120
gagacaaatg gcagtattca tccacaattt taaaagaaaa ggggggattg gggggtacag 9180
tgcaggggaa agaatagtag acataatagc aacagacata caaactaaag aattacaaaa 9240
acaaattaca aaaattcaaa attttcgggt ttattacagg gacagcagag atccactttg 9300
gcgccggctc gaggggg 9317
<210> 113
<211> 9014
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 113
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtgccgcca ccatggtgag caagggcgag 660
gaggataaca tggccatcat caaggagttc atgcgcttca aggtgcacat ggagggctcc 720
gtgaacggcc acgagttcga gatcgagggc gagggcgagg gccgccccta cgagggcacc 780
cagaccgcca agctgaaggt gaccaagggt ggccccctgc ccttcgcctg ggacatcctg 840
tcccctcagt tcatgtacgg ctccaaggcc tacgtgaagc accccgccga catccccgac 900
tacttgaagc tgtccttccc cgagggcttc aagtgggagc gcgtgatgaa cttcgaggac 960
ggcggcgtgg tgaccgtgac ccaggactcc tccctgcagg acggcgagtt catctacaag 1020
gtgaagctgc gcggcaccaa cttcccctcc gacggccccg taatgcagaa gaagaccatg 1080
ggctgggagg cctcctccga gcggatgtac cccgaggacg gcgccctgaa gggcgagatc 1140
aagcagaggc tgaagctgaa ggacggcggc cactacgacg ctgaggtcaa gaccacctac 1200
aaggccaaga agcccgtgca gctgcccggc gcctacaacg tcaacattaa gttggacatc 1260
acctcccaca acgaggacta caccatcgtg gaacagtacg aacgcgccga gggccgccac 1320
tccaccggcg gcatggacga gctgtacaag tgattaatta agaattcgac ccagctttct 1380
tgtacaaagt ggttggtaag cctatcccta accctctcct cggtctcgat tctacgtagt 1440
aatgagctag cagtctcgag gttaacgaat tccgcccccc ccctaacgtt actggccgaa 1500
gccgcttgga ataaggccgg tgtgcgcttg tctatatgtt attttccacc atattgccgt 1560
cttttggcaa tgtgagggcc cggaaacctg gccctgtctt cttgacgagc attcctaggg 1620
gtctttcccc tctcgccaaa ggaatgcaag gtctgttgaa tgtcgtgaag gaagcagttc 1680
ctctggaagc ttcttgaaga caaacaacgt ctgtagcgac cctttgcagg cagcggaacc 1740
ccccacctgg cgacaggtgc ccctgcggcc aaaagccacg tgtataagat acacctgcaa 1800
aggcggcaca accccagtgc cacgttgtga gttggatagt tgtggaaaga gtcaaatggc 1860
tctcctcaag cgtattcaac aaggggctga aggatgccca gaaggtaccc cattgtatgg 1920
gatctgatct ggggcctcgg tgcacatgct ttacatgtgt ttagtcgagg ttaaaaaaac 1980
gtctaggccc cccgaaccac ggggacgtgg ttttcctttg aaaaacacga taataccatg 2040
gccatgatta agatcgctac gcggaagtac ctggggaaac agaacgtcta cgacataggt 2100
gtggagcgcg atcacaactt tgctctgaaa aatggattta tcgccagcaa ctgcgagccc 2160
gcgtggttcc tggccaccgt cggcgtctcg cccgaccacc agggcaaggg tctgggcagc 2220
gccgtcgtgc tccccggagt ggaggcggcc gagcgcgccg gggtgcccgc cttcctggag 2280
acctccgcgc cccgcaacct ccccttctac gagcggctcg gcttcaccgt caccgccgac 2340
gtcgaggtgc ccgaaggacc gcgcacctgg tgcatgaccc gcaagcccgg tgcctgagaa 2400
ttcaccggtg gcgcgttaag tcgacaatca acctctggat tacaaaattt gtgaaagatt 2460
gactggtatt cttaactatg ttgctccttt tacgctatgt ggatacgctg ctttaatgcc 2520
tttgtatcat gctattgctt cccgtatggc tttcattttc tcctccttgt ataaatcctg 2580
gttgctgtct ctttatgagg agttgtggcc cgttgtcagg caacgtggcg tggtgtgcac 2640
tgtgtttgct gacgcaaccc ccactggttg gggcattgcc accacctgtc agctcctttc 2700
cgggactttc gctttccccc tccctattgc cacggcggaa ctcatcgccg cctgccttgc 2760
ccgctgctgg acaggggctc ggctgttggg cactgacaat tccgtggtgt tgtcggggaa 2820
atcatcgtcc tttccttggc tgctcgcctg tgttgccacc tggattctgc gcgggacgtc 2880
cttctgctac gtcccttcgg ccctcaatcc agcggacctt ccttcccgcg gcctgctgcc 2940
ggctctgcgg cctcttccgc gtcttcgcct tcgccctcag acgagtcgga tctccctttg 3000
ggccgcctcc ccgcgtcgac tttaagacca atgacttaca aggcagctgt agatcttagc 3060
cactttttaa aagaaaaggg gggactggaa gggctaattc actcccaacg aagacaagat 3120
ctgctttttg cttgtactgg gtctctctgg ttagaccaga tctgagcctg ggagctctct 3180
ggctaactag ggaacccact gcttaagcct caataaagct tgccttgagt gcttcaagta 3240
gtgtgtgccc gtctgttgtg tgactctggt aactagagat ccctcagacc cttttagtca 3300
gtgtggaaaa tctctagcag tacgtatagt agttcatgtc atcttattat tcagtattta 3360
taacttgcaa agaaatgaat atcagagagt gagaggaact tgtttattgc agcttataat 3420
ggttacaaat aaagcaatag catcacaaat ttcacaaata aagcattttt ttcactgcat 3480
tctagttgtg gtttgtccaa actcatcaat gtatcttatc atgtctggct ctagctatcc 3540
cgcccctaac tccgcccatc ccgcccctaa ctccgcccag ttccgcccat tctccgcccc 3600
atggctgact aatttttttt atttatgcag aggccgaggc cgcctcggcc tctgagctat 3660
tccagaagta gtgaggaggc ttttttggag gcctagggac gtacccaatt cgccctatag 3720
tgagtcgtat tacgcgcgct cactggccgt cgttttacaa cgtcgtgact gggaaaaccc 3780
tggcgttacc caacttaatc gccttgcagc acatccccct ttcgccagct ggcgtaatag 3840
cgaagaggcc cgcaccgatc gcccttccca acagttgcgc agcctgaatg gcgaatggga 3900
cgcgccctgt agcggcgcat taagcgcggc gggtgtggtg gttacgcgca gcgtgaccgc 3960
tacacttgcc agcgccctag cgcccgctcc tttcgctttc ttcccttcct ttctcgccac 4020
gttcgccggc tttccccgtc aagctctaaa tcgggggctc cctttagggt tccgatttag 4080
tgctttacgg cacctcgacc ccaaaaaact tgattagggt gatggttcac gtagtgggcc 4140
atcgccctga tagacggttt ttcgcccttt gacgttggag tccacgttct ttaatagtgg 4200
actcttgttc caaactggaa caacactcaa ccctatctcg gtctattctt ttgatttata 4260
agggattttg ccgatttcgg cctattggtt aaaaaatgag ctgatttaac aaaaatttaa 4320
cgcgaatttt aacaaaatat taacgcttac aatttaggtg gcacttttcg gggaaatgtg 4380
cgcggaaccc ctatttgttt atttttctaa atacattcaa atatgtatcc gctcatgaga 4440
caataaccct gataaatgct tcaataatat tgaaaaagga agagtatgag tattcaacat 4500
ttccgtgtcg cccttattcc cttttttgcg gcattttgcc ttcctgtttt tgctcaccca 4560
gaaacgctgg tgaaagtaaa agatgctgaa gatcagttgg gtgcacgagt gggttacatc 4620
gaactggatc tcaacagcgg taagatcctt gagagttttc gccccgaaga acgttttcca 4680
atgatgagca cttttaaagt tctgctatgt ggcgcggtat tatcccgtat tgacgccggg 4740
caagagcaac tcggtcgccg catacactat tctcagaatg acttggttga gtactcacca 4800
gtcacagaaa agcatcttac ggatggcatg acagtaagag aattatgcag tgctgccata 4860
accatgagtg ataacactgc ggccaactta cttctgacaa cgatcggagg accgaaggag 4920
ctaaccgctt ttttgcacaa catgggggat catgtaactc gccttgatcg ttgggaaccg 4980
gagctgaatg aagccatacc aaacgacgag cgtgacacca cgatgcctgt agcaatggca 5040
acaacgttgc gcaaactatt aactggcgaa ctacttactc tagcttcccg gcaacaatta 5100
atagactgga tggaggcgga taaagttgca ggaccacttc tgcgctcggc ccttccggct 5160
ggctggttta ttgctgataa atctggagcc ggtgagcgtg ggtctcgcgg tatcattgca 5220
gcactggggc cagatggtaa gccctcccgt atcgtagtta tctacacgac ggggagtcag 5280
gcaactatgg atgaacgaaa tagacagatc gctgagatag gtgcctcact gattaagcat 5340
tggtaactgt cagaccaagt ttactcatat atactttaga ttgatttaaa acttcatttt 5400
taatttaaaa ggatctaggt gaagatcctt tttgataatc tcatgaccaa aatcccttaa 5460
cgtgagtttt cgttccactg agcgtcagac cccgtagaaa agatcaaagg atcttcttga 5520
gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa aaaaaccacc gctaccagcg 5580
gtggtttgtt tgccggatca agagctacca actctttttc cgaaggtaac tggcttcagc 5640
agagcgcaga taccaaatac tgttcttcta gtgtagccgt agttaggcca ccacttcaag 5700
aactctgtag caccgcctac atacctcgct ctgctaatcc tgttaccagt ggctgctgcc 5760
agtggcgata agtcgtgtct taccgggttg gactcaagac gatagttacc ggataaggcg 5820
cagcggtcgg gctgaacggg gggttcgtgc acacagccca gcttggagcg aacgacctac 5880
accgaactga gatacctaca gcgtgagcta tgagaaagcg ccacgcttcc cgaagggaga 5940
aaggcggaca ggtatccggt aagcggcagg gtcggaacag gagagcgcac gagggagctt 6000
ccagggggaa acgcctggta tctttatagt cctgtcgggt ttcgccacct ctgacttgag 6060
cgtcgatttt tgtgatgctc gtcagggggg cggagcctat ggaaaaacgc cagcaacgcg 6120
gcctttttac ggttcctggc cttttgctgg ccttttgctc acatgttctt tcctgcgtta 6180
tcccctgatt ctgtggataa ccgtattacc gcctttgagt gagctgatac cgctcgccgc 6240
agccgaacga ccgagcgcag cgagtcagtg agcgaggaag cggaagagcg cccaatacgc 6300
aaaccgcctc tccccgcgcg ttggccgatt cattaatgca gctggcacga caggtttccc 6360
gactggaaag cgggcagtga gcgcaacgca attaatgtga gttagctcac tcattaggca 6420
ccccaggctt tacactttat gcttccggct cgtatgttgt gtggaattgt gagcggataa 6480
caatttcaca caggaaacag ctatgaccat gattacgcca agcgcgcaat taaccctcac 6540
taaagggaac aaaagctgga gctgcaagct taatgtagtc ttatgcaata ctcttgtagt 6600
cttgcaacat ggtaacgatg agttagcaac atgccttaca aggagagaaa aagcaccgtg 6660
catgccgatt ggtggaagta aggtggtacg atcgtgcctt attaggaagg caacagacgg 6720
gtctgacatg gattggacga accactgaat tgccgcattg cagagatatt gtatttaagt 6780
gcctagctcg atacataaac gggtctctct ggttagacca gatctgagcc tgggagctct 6840
ctggctaact agggaaccca ctgcttaagc ctcaataaag cttgccttga gtgcttcaag 6900
tagtgtgtgc ccgtctgttg tgtgactctg gtaactagag atccctcaga cccttttagt 6960
cagtgtggaa aatctctagc agtggcgccc gaacagggac ttgaaagcga aagggaaacc 7020
agaggagctc tctcgacgca ggactcggct tgctgaagcg cgcacggcaa gaggcgaggg 7080
gcggcgactg gtgagtacgc caaaaatttt gactagcgga ggctagaagg agagagatgg 7140
gtgcgagagc gtcagtatta agcgggggag aattagatcg cgatgggaaa aaattcggtt 7200
aaggccaggg ggaaagaaaa aatataaatt aaaacatata gtatgggcaa gcagggagct 7260
agaacgattc gcagttaatc ctggcctgtt agaaacatca gaaggctgta gacaaatact 7320
gggacagcta caaccatccc ttcagacagg atcagaagaa cttagatcat tatataatac 7380
agtagcaacc ctctattgtg tgcatcaaag gatagagata aaagacacca aggaagcttt 7440
agacaagata gaggaagagc aaaacaaaag taagaccacc gcacagcaag cggccgctga 7500
tcttcagacc tggaggagga gatatgaggg acaattggag aagtgaatta tataaatata 7560
aagtagtaaa aattgaacca ttaggagtag cacccaccaa ggcaaagaga agagtggtgc 7620
agagagaaaa aagagcagtg ggaataggag ctttgttcct tgggttcttg ggagcagcag 7680
gaagcactat gggcgcagcg tcaatgacgc tgacggtaca ggccagacaa ttattgtctg 7740
gtatagtgca gcagcagaac aatttgctga gggctattga ggcgcaacag catctgttgc 7800
aactcacagt ctggggcatc aagcagctcc aggcaagaat cctggctgtg gaaagatacc 7860
taaaggatca acagctcctg gggatttggg gttgctctgg aaaactcatt tgcaccactg 7920
ctgtgccttg gaatgctagt tggagtaata aatctctgga acagatttgg aatcacacga 7980
cctggatgga gtgggacaga gaaattaaca attacacaag cttaatacac tccttaattg 8040
aagaatcgca aaaccagcaa gaaaagaatg aacaagaatt attggaatta gataaatggg 8100
caagtttgtg gaattggttt aacataacaa attggctgtg gtatataaaa ttattcataa 8160
tgatagtagg aggcttggta ggtttaagaa tagtttttgc tgtactttct atagtgaata 8220
gagttaggca gggatattca ccattatcgt ttcagaccca cctcccaacc ccgaggggac 8280
ccttgcgcct tttccaaggc agccctgggt ttgcgcaggg acgcggctgc tctgggcgtg 8340
gttccgggaa acgcagcggc gccgaccctg ggtctcgcac attcttcacg tccgttcgca 8400
gcgtcacccg gatcttcgcc gctacccttg tgggcccccc ggcgacgctt cctgctccgc 8460
ccctaagtcg ggaaggttcc ttgcggttcg cggcgtgccg gacgtgacaa acggaagccg 8520
cacgtctcac tagtaccctc gcagacggac agcgccaggg agcaatggca gcgcgccgac 8580
cgcgatgggc tgtggccaat agcggctgct cagcagggcg cgccgagagc agcggccggg 8640
aaggggcggt gcgggaggcg gggtgtgggg cggtagtgtg ggccctgttc ctgcccgcgc 8700
ggtgttccgc attctgcaag cctccggagc gcacgtcggc agtcggctcc ctcgttgacc 8760
gaatcaccga cctctctccc cagggggtac ccagctgtct agagaattct agatcttgag 8820
acaaatggca gtattcatcc acaattttaa aagaaaaggg gggattgggg ggtacagtgc 8880
aggggaaaga atagtagaca taatagcaac agacatacaa actaaagaat tacaaaaaca 8940
aattacaaaa attcaaaatt ttcgggttta ttacagggac agcagagatc cactttggcg 9000
ccggctcgag gggg 9014
<210> 114
<211> 10451
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 114
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagctgaacg agaaacgtaa aatgatataa atatcaatat attaaattag 660
attttgcata aaaaacagac tacataatac tgtaaaacac aacatatcca gtcactatgg 720
cggccgcatt aggcacccca ggctttacac tttatgcttc cggctcgtat aatgtgtgga 780
ttttgagtta ggatccgtcg agattttcag gagctaagga agctaaaatg gagaaaaaaa 840
tcactggata taccaccgtt gatatatccc aatggcatcg taaagaacat tttgaggcat 900
ttcagtcagt tgctcaatgt acctataacc agaccgttca gctggatatt acggcctttt 960
taaagaccgt aaagaaaaat aagcacaagt tttatccggc ctttattcac attcttgccc 1020
gcctgatgaa tgctcatccg gaattccgta tggcaatgaa agacggtgag ctggtgatat 1080
gggatagtgt tcacccttgt tacaccgttt tccatgagca aactgaaacg ttttcatcgc 1140
tctggagtga ataccacgac gatttccggc agtttctaca catatattcg caagatgtgg 1200
cgtgttacgg tgaaaacctg gcctatttcc ctaaagggtt tattgagaat atgtttttcg 1260
tctcagccaa tccctgggtg agtttcacca gttttgattt aaacgtggcc aatatggaca 1320
acttcttcgc ccccgttttc accatgggca aatattatac gcaaggcgac aaggtgctga 1380
tgccgctggc gattcaggtt catcatgccg tttgtgatgg cttccatgtc ggcagaatgc 1440
ttaatgaatt acaacagtac tgcgatgagt ggcagggcgg ggcgtaaaga tctggatccg 1500
gcttactaaa agccagataa cagtatgcgt atttgcgcgc tgatttttgc ggtataagaa 1560
tatatactga tatgtatacc cgaagtatgt caaaaagagg tatgctatga agcagcgtat 1620
tacagtgaca gttgacagcg acagctatca gttgctcaag gcatatatga tgtcaatatc 1680
tccggtctgg taagcacaac catgcagaat gaagcccgtc gtctgcgtgc cgaacgctgg 1740
aaagcggaaa atcaggaagg gatggctgag gtcgcccggt ttattgaaat gaacggctct 1800
tttgctgacg agaacagggg ctggtgaaat gcagtttaag gtttacacct ataaaagaga 1860
gagccgttat cgtctgtttg tggatgtaca gagtgatatt attgacacgc ccgggcgacg 1920
gatggtgatc cccctggcca gtgcacgtct gctgtcagat aaagtctccc gtgaacttta 1980
cccggtggtg catatcgggg atgaaagctg gcgcatgatg accaccgata tggccagtgt 2040
gccggtctcc gttatcgggg aagaagtggc tgatctcagc caccgcgaaa atgacatcaa 2100
aaacgccatt aacctgatgt tctggggaat ataaatgtca ggctccctta tacacagcca 2160
gtctgcaggt cgaccatagt gactggatat gttgtgtttt acagtattat gtagtctgtt 2220
ttttatgcaa aatctaattt aatatattga tatttatatc attttacgtt tctcgttcag 2280
ctttcttgta caaagtggtt ggtaagccta tccctaaccc tctcctcggt ctcgattcta 2340
cgtagtaatg agctagcagt ctcgaggtta acgaattccg ccccccccct aacgttactg 2400
gccgaagccg cttggaataa ggccggtgtg cgcttgtcta tatgttattt tccaccatat 2460
tgccgtcttt tggcaatgtg agggcccgga aacctggccc tgtcttcttg acgagcattc 2520
ctaggggtct ttcccctctc gccaaaggaa tgcaaggtct gttgaatgtc gtgaaggaag 2580
cagttcctct ggaagcttct tgaagacaaa caacgtctgt agcgaccctt tgcaggcagc 2640
ggaacccccc acctggcgac aggtgcccct gcggccaaaa gccacgtgta taagatacac 2700
ctgcaaaggc ggcacaaccc cagtgccacg ttgtgagttg gatagttgtg gaaagagtca 2760
aatggctctc ctcaagcgta ttcaacaagg ggctgaagga tgcccagaag gtaccccatt 2820
gtatgggatc tgatctgggg cctcggtgca catgctttac atgtgtttag tcgaggttaa 2880
aaaaacgtct aggccccccg aaccacgggg acgtggtttt cctttgaaaa acacgataat 2940
accatgggat cggccattga acaagatgga ttgcacgcag gttctccggc cgcttgggtg 3000
gagaggctat tcggctatga ctgggcacaa cagacaatcg gctgctctga tgccgccgtg 3060
ttccggctgt cagcgcaggg gcgcccggtt ctttttgtca agaccgacct gtccggtgcc 3120
ctgaatgaac tgcaggacga ggcagcgcgg ctatcgtggc tggccacgac gggcgttcct 3180
tgcgcagctg tgctcgacgt tgtcactgaa gcgggaaggg actggctgct attgggcgaa 3240
gtgccggggc aggatctcct gtcatctcac cttgctcctg ccgagaaagt atccatcatg 3300
gctgatgcaa tgcggcggct gcatacgctt gatccggcta cctgcccatt cgaccaccaa 3360
gcgaaacatc gcatcgagcg agcacgtact cggatggaag ccggtcttgt cgatcaggat 3420
gatctggacg aagagcatca ggggctcgcg ccagccgaac tgttcgccag gctcaaggcg 3480
cgcatgcccg acggcgatga tctcgtcgtg acccatggcg atgcctgcct ttcatacgag 3540
accgagatcc tgactgtcga gtacggattg cttcctatcg gcaaaatcgt ggagaagagg 3600
attgaatgta ccgtctattc agtcgataat aatgggaaca tctacacaca gcccgtggct 3660
caatggcacg acagaggaga gcaggaagtt tttgaatact gtctcgagga cggatccctc 3720
atccgcgcta ctaaagatca taagtttatg accgtggacg gccagatgct gccaattgac 3780
gaaatttttg aacgagagct ggatctgatg agagtcgaca accttccaaa ctgagaattc 3840
accggtggcg cgttaagtcg acaatcaacc tctggattac aaaatttgtg aaagattgac 3900
tggtattctt aactatgttg ctccttttac gctatgtgga tacgctgctt taatgccttt 3960
gtatcatgct attgcttccc gtatggcttt cattttctcc tccttgtata aatcctggtt 4020
gctgtctctt tatgaggagt tgtggcccgt tgtcaggcaa cgtggcgtgg tgtgcactgt 4080
gtttgctgac gcaaccccca ctggttgggg cattgccacc acctgtcagc tcctttccgg 4140
gactttcgct ttccccctcc ctattgccac ggcggaactc atcgccgcct gccttgcccg 4200
ctgctggaca ggggctcggc tgttgggcac tgacaattcc gtggtgttgt cggggaaatc 4260
atcgtccttt ccttggctgc tcgcctgtgt tgccacctgg attctgcgcg ggacgtcctt 4320
ctgctacgtc ccttcggccc tcaatccagc ggaccttcct tcccgcggcc tgctgccggc 4380
tctgcggcct cttccgcgtc ttcgccttcg ccctcagacg agtcggatct ccctttgggc 4440
cgcctccccg cgtcgacttt aagaccaatg acttacaagg cagctgtaga tcttagccac 4500
tttttaaaag aaaagggggg actggaaggg ctaattcact cccaacgaag acaagatctg 4560
ctttttgctt gtactgggtc tctctggtta gaccagatct gagcctggga gctctctggc 4620
taactaggga acccactgct taagcctcaa taaagcttgc cttgagtgct tcaagtagtg 4680
tgtgcccgtc tgttgtgtga ctctggtaac tagagatccc tcagaccctt ttagtcagtg 4740
tggaaaatct ctagcagtac gtatagtagt tcatgtcatc ttattattca gtatttataa 4800
cttgcaaaga aatgaatatc agagagtgag aggaacttgt ttattgcagc ttataatggt 4860
tacaaataaa gcaatagcat cacaaatttc acaaataaag catttttttc actgcattct 4920
agttgtggtt tgtccaaact catcaatgta tcttatcatg tctggctcta gctatcccgc 4980
ccctaactcc gcccatcccg cccctaactc cgcccagttc cgcccattct ccgccccatg 5040
gctgactaat tttttttatt tatgcagagg ccgaggccgc ctcggcctct gagctattcc 5100
agaagtagtg aggaggcttt tttggaggcc tagggacgta cccaattcgc cctatagtga 5160
gtcgtattac gcgcgctcac tggccgtcgt tttacaacgt cgtgactggg aaaaccctgg 5220
cgttacccaa cttaatcgcc ttgcagcaca tccccctttc gccagctggc gtaatagcga 5280
agaggcccgc accgatcgcc cttcccaaca gttgcgcagc ctgaatggcg aatgggacgc 5340
gccctgtagc ggcgcattaa gcgcggcggg tgtggtggtt acgcgcagcg tgaccgctac 5400
acttgccagc gccctagcgc ccgctccttt cgctttcttc ccttcctttc tcgccacgtt 5460
cgccggcttt ccccgtcaag ctctaaatcg ggggctccct ttagggttcc gatttagtgc 5520
tttacggcac ctcgacccca aaaaacttga ttagggtgat ggttcacgta gtgggccatc 5580
gccctgatag acggtttttc gccctttgac gttggagtcc acgttcttta atagtggact 5640
cttgttccaa actggaacaa cactcaaccc tatctcggtc tattcttttg atttataagg 5700
gattttgccg atttcggcct attggttaaa aaatgagctg atttaacaaa aatttaacgc 5760
gaattttaac aaaatattaa cgcttacaat ttaggtggca cttttcgggg aaatgtgcgc 5820
ggaaccccta tttgtttatt tttctaaata cattcaaata tgtatccgct catgagacaa 5880
taaccctgat aaatgcttca ataatattga aaaaggaaga gtatgagtat tcaacatttc 5940
cgtgtcgccc ttattccctt ttttgcggca ttttgccttc ctgtttttgc tcacccagaa 6000
acgctggtga aagtaaaaga tgctgaagat cagttgggtg cacgagtggg ttacatcgaa 6060
ctggatctca acagcggtaa gatccttgag agttttcgcc ccgaagaacg ttttccaatg 6120
atgagcactt ttaaagttct gctatgtggc gcggtattat cccgtattga cgccgggcaa 6180
gagcaactcg gtcgccgcat acactattct cagaatgact tggttgagta ctcaccagtc 6240
acagaaaagc atcttacgga tggcatgaca gtaagagaat tatgcagtgc tgccataacc 6300
atgagtgata acactgcggc caacttactt ctgacaacga tcggaggacc gaaggagcta 6360
accgcttttt tgcacaacat gggggatcat gtaactcgcc ttgatcgttg ggaaccggag 6420
ctgaatgaag ccataccaaa cgacgagcgt gacaccacga tgcctgtagc aatggcaaca 6480
acgttgcgca aactattaac tggcgaacta cttactctag cttcccggca acaattaata 6540
gactggatgg aggcggataa agttgcagga ccacttctgc gctcggccct tccggctggc 6600
tggtttattg ctgataaatc tggagccggt gagcgtgggt ctcgcggtat cattgcagca 6660
ctggggccag atggtaagcc ctcccgtatc gtagttatct acacgacggg gagtcaggca 6720
actatggatg aacgaaatag acagatcgct gagataggtg cctcactgat taagcattgg 6780
taactgtcag accaagttta ctcatatata ctttagattg atttaaaact tcatttttaa 6840
tttaaaagga tctaggtgaa gatccttttt gataatctca tgaccaaaat cccttaacgt 6900
gagttttcgt tccactgagc gtcagacccc gtagaaaaga tcaaaggatc ttcttgagat 6960
cctttttttc tgcgcgtaat ctgctgcttg caaacaaaaa aaccaccgct accagcggtg 7020
gtttgtttgc cggatcaaga gctaccaact ctttttccga aggtaactgg cttcagcaga 7080
gcgcagatac caaatactgt tcttctagtg tagccgtagt taggccacca cttcaagaac 7140
tctgtagcac cgcctacata cctcgctctg ctaatcctgt taccagtggc tgctgccagt 7200
ggcgataagt cgtgtcttac cgggttggac tcaagacgat agttaccgga taaggcgcag 7260
cggtcgggct gaacgggggg ttcgtgcaca cagcccagct tggagcgaac gacctacacc 7320
gaactgagat acctacagcg tgagctatga gaaagcgcca cgcttcccga agggagaaag 7380
gcggacaggt atccggtaag cggcagggtc ggaacaggag agcgcacgag ggagcttcca 7440
gggggaaacg cctggtatct ttatagtcct gtcgggtttc gccacctctg acttgagcgt 7500
cgatttttgt gatgctcgtc aggggggcgg agcctatgga aaaacgccag caacgcggcc 7560
tttttacggt tcctggcctt ttgctggcct tttgctcaca tgttctttcc tgcgttatcc 7620
cctgattctg tggataaccg tattaccgcc tttgagtgag ctgataccgc tcgccgcagc 7680
cgaacgaccg agcgcagcga gtcagtgagc gaggaagcgg aagagcgccc aatacgcaaa 7740
ccgcctctcc ccgcgcgttg gccgattcat taatgcagct ggcacgacag gtttcccgac 7800
tggaaagcgg gcagtgagcg caacgcaatt aatgtgagtt agctcactca ttaggcaccc 7860
caggctttac actttatgct tccggctcgt atgttgtgtg gaattgtgag cggataacaa 7920
tttcacacag gaaacagcta tgaccatgat tacgccaagc gcgcaattaa ccctcactaa 7980
agggaacaaa agctggagct gcaagcttaa tgtagtctta tgcaatactc ttgtagtctt 8040
gcaacatggt aacgatgagt tagcaacatg ccttacaagg agagaaaaag caccgtgcat 8100
gccgattggt ggaagtaagg tggtacgatc gtgccttatt aggaaggcaa cagacgggtc 8160
tgacatggat tggacgaacc actgaattgc cgcattgcag agatattgta tttaagtgcc 8220
tagctcgata cataaacggg tctctctggt tagaccagat ctgagcctgg gagctctctg 8280
gctaactagg gaacccactg cttaagcctc aataaagctt gccttgagtg cttcaagtag 8340
tgtgtgcccg tctgttgtgt gactctggta actagagatc cctcagaccc ttttagtcag 8400
tgtggaaaat ctctagcagt ggcgcccgaa cagggacttg aaagcgaaag ggaaaccaga 8460
ggagctctct cgacgcagga ctcggcttgc tgaagcgcgc acggcaagag gcgaggggcg 8520
gcgactggtg agtacgccaa aaattttgac tagcggaggc tagaaggaga gagatgggtg 8580
cgagagcgtc agtattaagc gggggagaat tagatcgcga tgggaaaaaa ttcggttaag 8640
gccaggggga aagaaaaaat ataaattaaa acatatagta tgggcaagca gggagctaga 8700
acgattcgca gttaatcctg gcctgttaga aacatcagaa ggctgtagac aaatactggg 8760
acagctacaa ccatcccttc agacaggatc agaagaactt agatcattat ataatacagt 8820
agcaaccctc tattgtgtgc atcaaaggat agagataaaa gacaccaagg aagctttaga 8880
caagatagag gaagagcaaa acaaaagtaa gaccaccgca cagcaagcgg ccgctgatct 8940
tcagacctgg aggaggagat atgagggaca attggagaag tgaattatat aaatataaag 9000
tagtaaaaat tgaaccatta ggagtagcac ccaccaaggc aaagagaaga gtggtgcaga 9060
gagaaaaaag agcagtggga ataggagctt tgttccttgg gttcttggga gcagcaggaa 9120
gcactatggg cgcagcgtca atgacgctga cggtacaggc cagacaatta ttgtctggta 9180
tagtgcagca gcagaacaat ttgctgaggg ctattgaggc gcaacagcat ctgttgcaac 9240
tcacagtctg gggcatcaag cagctccagg caagaatcct ggctgtggaa agatacctaa 9300
aggatcaaca gctcctgggg atttggggtt gctctggaaa actcatttgc accactgctg 9360
tgccttggaa tgctagttgg agtaataaat ctctggaaca gatttggaat cacacgacct 9420
ggatggagtg ggacagagaa attaacaatt acacaagctt aatacactcc ttaattgaag 9480
aatcgcaaaa ccagcaagaa aagaatgaac aagaattatt ggaattagat aaatgggcaa 9540
gtttgtggaa ttggtttaac ataacaaatt ggctgtggta tataaaatta ttcataatga 9600
tagtaggagg cttggtaggt ttaagaatag tttttgctgt actttctata gtgaatagag 9660
ttaggcaggg atattcacca ttatcgtttc agacccacct cccaaccccg aggggaccct 9720
tgcgcctttt ccaaggcagc cctgggtttg cgcagggacg cggctgctct gggcgtggtt 9780
ccgggaaacg cagcggcgcc gaccctgggt ctcgcacatt cttcacgtcc gttcgcagcg 9840
tcacccggat cttcgccgct acccttgtgg gccccccggc gacgcttcct gctccgcccc 9900
taagtcggga aggttccttg cggttcgcgg cgtgccggac gtgacaaacg gaagccgcac 9960
gtctcactag taccctcgca gacggacagc gccagggagc aatggcagcg cgccgaccgc 10020
gatgggctgt ggccaatagc ggctgctcag cagggcgcgc cgagagcagc ggccgggaag 10080
gggcggtgcg ggaggcgggg tgtggggcgg tagtgtgggc cctgttcctg cccgcgcggt 10140
gttccgcatt ctgcaagcct ccggagcgca cgtcggcagt cggctccctc gttgaccgaa 10200
tcaccgacct ctctccccag ggggtaccca gctgtctaga gaattctaga tcttgagaca 10260
aatggcagta ttcatccaca attttaaaag aaaagggggg attggggggt acagtgcagg 10320
ggaaagaata gtagacataa tagcaacaga catacaaact aaagaattac aaaaacaaat 10380
tacaaaaatt caaaattttc gggtttatta cagggacagc agagatccac tttggcgccg 10440
gctcgagggg g 10451
<210> 115
<211> 296
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 115
Met Gly Ser Ala Ile Glu Gln Asp Gly Leu His Ala Gly Ser Pro Ala
1 5 10 15
Ala Trp Val Glu Arg Leu Phe Gly Tyr Asp Trp Ala Gln Gln Thr Ile
20 25 30
Gly Cys Ser Asp Ala Ala Val Phe Arg Leu Ser Ala Gln Gly Arg Pro
35 40 45
Val Leu Phe Val Lys Thr Asp Leu Ser Gly Ala Leu Asn Glu Leu Gln
50 55 60
Asp Glu Ala Ala Arg Leu Ser Trp Leu Ala Thr Thr Gly Val Pro Cys
65 70 75 80
Ala Ala Val Leu Asp Val Val Thr Glu Ala Gly Arg Asp Trp Leu Leu
85 90 95
Leu Gly Glu Val Pro Gly Gln Asp Leu Leu Ser Ser His Leu Ala Pro
100 105 110
Ala Glu Lys Val Ser Ile Met Ala Asp Ala Met Arg Arg Leu His Thr
115 120 125
Leu Asp Pro Ala Thr Cys Pro Phe Asp His Gln Ala Lys His Arg Ile
130 135 140
Glu Arg Ala Arg Thr Arg Met Glu Ala Gly Leu Val Asp Gln Asp Asp
145 150 155 160
Leu Asp Glu Glu His Gln Gly Leu Ala Pro Ala Glu Leu Phe Ala Arg
165 170 175
Leu Lys Ala Arg Met Pro Asp Gly Asp Asp Leu Val Val Thr His Gly
180 185 190
Asp Ala Cys Leu Ser Tyr Glu Thr Glu Ile Leu Thr Val Glu Tyr Gly
195 200 205
Leu Leu Pro Ile Gly Lys Ile Val Glu Lys Arg Ile Glu Cys Thr Val
210 215 220
Tyr Ser Val Asp Asn Asn Gly Asn Ile Tyr Thr Gln Pro Val Ala Gln
225 230 235 240
Trp His Asp Arg Gly Glu Gln Glu Val Phe Glu Tyr Cys Leu Glu Asp
245 250 255
Gly Ser Leu Ile Arg Ala Thr Lys Asp His Lys Phe Met Thr Val Asp
260 265 270
Gly Gln Met Leu Pro Ile Asp Glu Ile Phe Glu Arg Glu Leu Asp Leu
275 280 285
Met Arg Val Asp Asn Leu Pro Asn
290 295
<210> 116
<211> 9896
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 116
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagctgaacg agaaacgtaa aatgatataa atatcaatat attaaattag 660
attttgcata aaaaacagac tacataatac tgtaaaacac aacatatcca gtcactatgg 720
cggccgcatt aggcacccca ggctttacac tttatgcttc cggctcgtat aatgtgtgga 780
ttttgagtta ggatccgtcg agattttcag gagctaagga agctaaaatg gagaaaaaaa 840
tcactggata taccaccgtt gatatatccc aatggcatcg taaagaacat tttgaggcat 900
ttcagtcagt tgctcaatgt acctataacc agaccgttca gctggatatt acggcctttt 960
taaagaccgt aaagaaaaat aagcacaagt tttatccggc ctttattcac attcttgccc 1020
gcctgatgaa tgctcatccg gaattccgta tggcaatgaa agacggtgag ctggtgatat 1080
gggatagtgt tcacccttgt tacaccgttt tccatgagca aactgaaacg ttttcatcgc 1140
tctggagtga ataccacgac gatttccggc agtttctaca catatattcg caagatgtgg 1200
cgtgttacgg tgaaaacctg gcctatttcc ctaaagggtt tattgagaat atgtttttcg 1260
tctcagccaa tccctgggtg agtttcacca gttttgattt aaacgtggcc aatatggaca 1320
acttcttcgc ccccgttttc accatgggca aatattatac gcaaggcgac aaggtgctga 1380
tgccgctggc gattcaggtt catcatgccg tttgtgatgg cttccatgtc ggcagaatgc 1440
ttaatgaatt acaacagtac tgcgatgagt ggcagggcgg ggcgtaaaga tctggatccg 1500
gcttactaaa agccagataa cagtatgcgt atttgcgcgc tgatttttgc ggtataagaa 1560
tatatactga tatgtatacc cgaagtatgt caaaaagagg tatgctatga agcagcgtat 1620
tacagtgaca gttgacagcg acagctatca gttgctcaag gcatatatga tgtcaatatc 1680
tccggtctgg taagcacaac catgcagaat gaagcccgtc gtctgcgtgc cgaacgctgg 1740
aaagcggaaa atcaggaagg gatggctgag gtcgcccggt ttattgaaat gaacggctct 1800
tttgctgacg agaacagggg ctggtgaaat gcagtttaag gtttacacct ataaaagaga 1860
gagccgttat cgtctgtttg tggatgtaca gagtgatatt attgacacgc ccgggcgacg 1920
gatggtgatc cccctggcca gtgcacgtct gctgtcagat aaagtctccc gtgaacttta 1980
cccggtggtg catatcgggg atgaaagctg gcgcatgatg accaccgata tggccagtgt 2040
gccggtctcc gttatcgggg aagaagtggc tgatctcagc caccgcgaaa atgacatcaa 2100
aaacgccatt aacctgatgt tctggggaat ataaatgtca ggctccctta tacacagcca 2160
gtctgcaggt cgaccatagt gactggatat gttgtgtttt acagtattat gtagtctgtt 2220
ttttatgcaa aatctaattt aatatattga tatttatatc attttacgtt tctcgttcag 2280
ctttcttgta caaagtggtt ggtaagccta tccctaaccc tctcctcggt ctcgattcta 2340
cgtagtaatg agctagcagt ctcgaggtta acgaattccg ccccccccct aacgttactg 2400
gccgaagccg cttggaataa ggccggtgtg cgcttgtcta tatgttattt tccaccatat 2460
tgccgtcttt tggcaatgtg agggcccgga aacctggccc tgtcttcttg acgagcattc 2520
ctaggggtct ttcccctctc gccaaaggaa tgcaaggtct gttgaatgtc gtgaaggaag 2580
cagttcctct ggaagcttct tgaagacaaa caacgtctgt agcgaccctt tgcaggcagc 2640
ggaacccccc acctggcgac aggtgcccct gcggccaaaa gccacgtgta taagatacac 2700
ctgcaaaggc ggcacaaccc cagtgccacg ttgtgagttg gatagttgtg gaaagagtca 2760
aatggctctc ctcaagcgta ttcaacaagg ggctgaagga tgcccagaag gtaccccatt 2820
gtatgggatc tgatctgggg cctcggtgca catgctttac atgtgtttag tcgaggttaa 2880
aaaaacgtct aggccccccg aaccacgggg acgtggtttt cctttgaaaa acacgataat 2940
accatggcca tgattaagat cgctacgcgg aagtacctgg ggaaacagaa cgtctacgac 3000
ataggtgtgg agcgcgatca caactttgct ctgaaaaatg gatttatcgc cagcaactgc 3060
ttgccgaata tcatggtgga aaatggccgc ttttctggat tcatcgactg tggccggctg 3120
ggtgtggcgg accgctatca ggacatagcg ttggctaccc gtgatattgc tgaagagctt 3180
ggcggcgaat gggctgaccg cttcctcgtg ctttacggta tcgccgctcc cgattcgcag 3240
cgcatcgcct tctatcgcct tcttgacgag ttcttctgag aattcaccgg tggcgcgtta 3300
agtcgacaat caacctctgg attacaaaat ttgtgaaaga ttgactggta ttcttaacta 3360
tgttgctcct tttacgctat gtggatacgc tgctttaatg cctttgtatc atgctattgc 3420
ttcccgtatg gctttcattt tctcctcctt gtataaatcc tggttgctgt ctctttatga 3480
ggagttgtgg cccgttgtca ggcaacgtgg cgtggtgtgc actgtgtttg ctgacgcaac 3540
ccccactggt tggggcattg ccaccacctg tcagctcctt tccgggactt tcgctttccc 3600
cctccctatt gccacggcgg aactcatcgc cgcctgcctt gcccgctgct ggacaggggc 3660
tcggctgttg ggcactgaca attccgtggt gttgtcgggg aaatcatcgt cctttccttg 3720
gctgctcgcc tgtgttgcca cctggattct gcgcgggacg tccttctgct acgtcccttc 3780
ggccctcaat ccagcggacc ttccttcccg cggcctgctg ccggctctgc ggcctcttcc 3840
gcgtcttcgc cttcgccctc agacgagtcg gatctccctt tgggccgcct ccccgcgtcg 3900
actttaagac caatgactta caaggcagct gtagatctta gccacttttt aaaagaaaag 3960
gggggactgg aagggctaat tcactcccaa cgaagacaag atctgctttt tgcttgtact 4020
gggtctctct ggttagacca gatctgagcc tgggagctct ctggctaact agggaaccca 4080
ctgcttaagc ctcaataaag cttgccttga gtgcttcaag tagtgtgtgc ccgtctgttg 4140
tgtgactctg gtaactagag atccctcaga cccttttagt cagtgtggaa aatctctagc 4200
agtacgtata gtagttcatg tcatcttatt attcagtatt tataacttgc aaagaaatga 4260
atatcagaga gtgagaggaa cttgtttatt gcagcttata atggttacaa ataaagcaat 4320
agcatcacaa atttcacaaa taaagcattt ttttcactgc attctagttg tggtttgtcc 4380
aaactcatca atgtatctta tcatgtctgg ctctagctat cccgccccta actccgccca 4440
tcccgcccct aactccgccc agttccgccc attctccgcc ccatggctga ctaatttttt 4500
ttatttatgc agaggccgag gccgcctcgg cctctgagct attccagaag tagtgaggag 4560
gcttttttgg aggcctaggg acgtacccaa ttcgccctat agtgagtcgt attacgcgcg 4620
ctcactggcc gtcgttttac aacgtcgtga ctgggaaaac cctggcgtta cccaacttaa 4680
tcgccttgca gcacatcccc ctttcgccag ctggcgtaat agcgaagagg cccgcaccga 4740
tcgcccttcc caacagttgc gcagcctgaa tggcgaatgg gacgcgccct gtagcggcgc 4800
attaagcgcg gcgggtgtgg tggttacgcg cagcgtgacc gctacacttg ccagcgccct 4860
agcgcccgct cctttcgctt tcttcccttc ctttctcgcc acgttcgccg gctttccccg 4920
tcaagctcta aatcgggggc tccctttagg gttccgattt agtgctttac ggcacctcga 4980
ccccaaaaaa cttgattagg gtgatggttc acgtagtggg ccatcgccct gatagacggt 5040
ttttcgccct ttgacgttgg agtccacgtt ctttaatagt ggactcttgt tccaaactgg 5100
aacaacactc aaccctatct cggtctattc ttttgattta taagggattt tgccgatttc 5160
ggcctattgg ttaaaaaatg agctgattta acaaaaattt aacgcgaatt ttaacaaaat 5220
attaacgctt acaatttagg tggcactttt cggggaaatg tgcgcggaac ccctatttgt 5280
ttatttttct aaatacattc aaatatgtat ccgctcatga gacaataacc ctgataaatg 5340
cttcaataat attgaaaaag gaagagtatg agtattcaac atttccgtgt cgcccttatt 5400
cccttttttg cggcattttg ccttcctgtt tttgctcacc cagaaacgct ggtgaaagta 5460
aaagatgctg aagatcagtt gggtgcacga gtgggttaca tcgaactgga tctcaacagc 5520
ggtaagatcc ttgagagttt tcgccccgaa gaacgttttc caatgatgag cacttttaaa 5580
gttctgctat gtggcgcggt attatcccgt attgacgccg ggcaagagca actcggtcgc 5640
cgcatacact attctcagaa tgacttggtt gagtactcac cagtcacaga aaagcatctt 5700
acggatggca tgacagtaag agaattatgc agtgctgcca taaccatgag tgataacact 5760
gcggccaact tacttctgac aacgatcgga ggaccgaagg agctaaccgc ttttttgcac 5820
aacatggggg atcatgtaac tcgccttgat cgttgggaac cggagctgaa tgaagccata 5880
ccaaacgacg agcgtgacac cacgatgcct gtagcaatgg caacaacgtt gcgcaaacta 5940
ttaactggcg aactacttac tctagcttcc cggcaacaat taatagactg gatggaggcg 6000
gataaagttg caggaccact tctgcgctcg gcccttccgg ctggctggtt tattgctgat 6060
aaatctggag ccggtgagcg tgggtctcgc ggtatcattg cagcactggg gccagatggt 6120
aagccctccc gtatcgtagt tatctacacg acggggagtc aggcaactat ggatgaacga 6180
aatagacaga tcgctgagat aggtgcctca ctgattaagc attggtaact gtcagaccaa 6240
gtttactcat atatacttta gattgattta aaacttcatt tttaatttaa aaggatctag 6300
gtgaagatcc tttttgataa tctcatgacc aaaatccctt aacgtgagtt ttcgttccac 6360
tgagcgtcag accccgtaga aaagatcaaa ggatcttctt gagatccttt ttttctgcgc 6420
gtaatctgct gcttgcaaac aaaaaaacca ccgctaccag cggtggtttg tttgccggat 6480
caagagctac caactctttt tccgaaggta actggcttca gcagagcgca gataccaaat 6540
actgttcttc tagtgtagcc gtagttaggc caccacttca agaactctgt agcaccgcct 6600
acatacctcg ctctgctaat cctgttacca gtggctgctg ccagtggcga taagtcgtgt 6660
cttaccgggt tggactcaag acgatagtta ccggataagg cgcagcggtc gggctgaacg 6720
gggggttcgt gcacacagcc cagcttggag cgaacgacct acaccgaact gagataccta 6780
cagcgtgagc tatgagaaag cgccacgctt cccgaaggga gaaaggcgga caggtatccg 6840
gtaagcggca gggtcggaac aggagagcgc acgagggagc ttccaggggg aaacgcctgg 6900
tatctttata gtcctgtcgg gtttcgccac ctctgacttg agcgtcgatt tttgtgatgc 6960
tcgtcagggg ggcggagcct atggaaaaac gccagcaacg cggccttttt acggttcctg 7020
gccttttgct ggccttttgc tcacatgttc tttcctgcgt tatcccctga ttctgtggat 7080
aaccgtatta ccgcctttga gtgagctgat accgctcgcc gcagccgaac gaccgagcgc 7140
agcgagtcag tgagcgagga agcggaagag cgcccaatac gcaaaccgcc tctccccgcg 7200
cgttggccga ttcattaatg cagctggcac gacaggtttc ccgactggaa agcgggcagt 7260
gagcgcaacg caattaatgt gagttagctc actcattagg caccccaggc tttacacttt 7320
atgcttccgg ctcgtatgtt gtgtggaatt gtgagcggat aacaatttca cacaggaaac 7380
agctatgacc atgattacgc caagcgcgca attaaccctc actaaaggga acaaaagctg 7440
gagctgcaag cttaatgtag tcttatgcaa tactcttgta gtcttgcaac atggtaacga 7500
tgagttagca acatgcctta caaggagaga aaaagcaccg tgcatgccga ttggtggaag 7560
taaggtggta cgatcgtgcc ttattaggaa ggcaacagac gggtctgaca tggattggac 7620
gaaccactga attgccgcat tgcagagata ttgtatttaa gtgcctagct cgatacataa 7680
acgggtctct ctggttagac cagatctgag cctgggagct ctctggctaa ctagggaacc 7740
cactgcttaa gcctcaataa agcttgcctt gagtgcttca agtagtgtgt gcccgtctgt 7800
tgtgtgactc tggtaactag agatccctca gaccctttta gtcagtgtgg aaaatctcta 7860
gcagtggcgc ccgaacaggg acttgaaagc gaaagggaaa ccagaggagc tctctcgacg 7920
caggactcgg cttgctgaag cgcgcacggc aagaggcgag gggcggcgac tggtgagtac 7980
gccaaaaatt ttgactagcg gaggctagaa ggagagagat gggtgcgaga gcgtcagtat 8040
taagcggggg agaattagat cgcgatggga aaaaattcgg ttaaggccag ggggaaagaa 8100
aaaatataaa ttaaaacata tagtatgggc aagcagggag ctagaacgat tcgcagttaa 8160
tcctggcctg ttagaaacat cagaaggctg tagacaaata ctgggacagc tacaaccatc 8220
ccttcagaca ggatcagaag aacttagatc attatataat acagtagcaa ccctctattg 8280
tgtgcatcaa aggatagaga taaaagacac caaggaagct ttagacaaga tagaggaaga 8340
gcaaaacaaa agtaagacca ccgcacagca agcggccgct gatcttcaga cctggaggag 8400
gagatatgag ggacaattgg agaagtgaat tatataaata taaagtagta aaaattgaac 8460
cattaggagt agcacccacc aaggcaaaga gaagagtggt gcagagagaa aaaagagcag 8520
tgggaatagg agctttgttc cttgggttct tgggagcagc aggaagcact atgggcgcag 8580
cgtcaatgac gctgacggta caggccagac aattattgtc tggtatagtg cagcagcaga 8640
acaatttgct gagggctatt gaggcgcaac agcatctgtt gcaactcaca gtctggggca 8700
tcaagcagct ccaggcaaga atcctggctg tggaaagata cctaaaggat caacagctcc 8760
tggggatttg gggttgctct ggaaaactca tttgcaccac tgctgtgcct tggaatgcta 8820
gttggagtaa taaatctctg gaacagattt ggaatcacac gacctggatg gagtgggaca 8880
gagaaattaa caattacaca agcttaatac actccttaat tgaagaatcg caaaaccagc 8940
aagaaaagaa tgaacaagaa ttattggaat tagataaatg ggcaagtttg tggaattggt 9000
ttaacataac aaattggctg tggtatataa aattattcat aatgatagta ggaggcttgg 9060
taggtttaag aatagttttt gctgtacttt ctatagtgaa tagagttagg cagggatatt 9120
caccattatc gtttcagacc cacctcccaa ccccgagggg acccttgcgc cttttccaag 9180
gcagccctgg gtttgcgcag ggacgcggct gctctgggcg tggttccggg aaacgcagcg 9240
gcgccgaccc tgggtctcgc acattcttca cgtccgttcg cagcgtcacc cggatcttcg 9300
ccgctaccct tgtgggcccc ccggcgacgc ttcctgctcc gcccctaagt cgggaaggtt 9360
ccttgcggtt cgcggcgtgc cggacgtgac aaacggaagc cgcacgtctc actagtaccc 9420
tcgcagacgg acagcgccag ggagcaatgg cagcgcgccg accgcgatgg gctgtggcca 9480
atagcggctg ctcagcaggg cgcgccgaga gcagcggccg ggaaggggcg gtgcgggagg 9540
cggggtgtgg ggcggtagtg tgggccctgt tcctgcccgc gcggtgttcc gcattctgca 9600
agcctccgga gcgcacgtcg gcagtcggct ccctcgttga ccgaatcacc gacctctctc 9660
cccagggggt acccagctgt ctagagaatt ctagatcttg agacaaatgg cagtattcat 9720
ccacaatttt aaaagaaaag gggggattgg ggggtacagt gcaggggaaa gaatagtaga 9780
cataatagca acagacatac aaactaaaga attacaaaaa caaattacaa aaattcaaaa 9840
ttttcgggtt tattacaggg acagcagaga tccactttgg cgccggctcg aggggg 9896
<210> 117
<211> 111
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 117
Met Ala Met Ile Lys Ile Ala Thr Arg Lys Tyr Leu Gly Lys Gln Asn
1 5 10 15
Val Tyr Asp Ile Gly Val Glu Arg Asp His Asn Phe Ala Leu Lys Asn
20 25 30
Gly Phe Ile Ala Ser Asn Cys Leu Pro Asn Ile Met Val Glu Asn Gly
35 40 45
Arg Phe Ser Gly Phe Ile Asp Cys Gly Arg Leu Gly Val Ala Asp Arg
50 55 60
Tyr Gln Asp Ile Ala Leu Ala Thr Arg Asp Ile Ala Glu Glu Leu Gly
65 70 75 80
Gly Glu Trp Ala Asp Arg Phe Leu Val Leu Tyr Gly Ile Ala Ala Pro
85 90 95
Asp Ser Gln Arg Ile Ala Phe Tyr Arg Leu Leu Asp Glu Phe Phe
100 105 110
<210> 118
<211> 9536
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 118
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtgccgcca ccatgagcga gctgattaag 660
gagaacatgc acatgaagct gtacatggag ggcaccgtgg acaaccatca cttcaagtgc 720
acatccgagg gcgaaggcaa gccctacgag ggcacccaga ccatgagaat caaggtggtc 780
gagggcggcc ctctcccctt cgccttcgac atcctggcta ctagcttcct ctacggcagc 840
aagaccttca tcaaccacac ccagggcatc cccgacttct tcaagcagtc cttccctgag 900
ggcttcacat gggagagagt caccacatac gaagacgggg gcgtgctgac cgctacccag 960
gacaccagcc tccaggacgg ctgcctcatc tacaacgtca agatcagagg ggtgaacttc 1020
acatccaacg gccctgtgat gcagaagaaa acactcggct gggaggcctt caccgagacg 1080
ctgtaccccg ctgacggcgg cctggaaggc agaaacgaca tggccctgaa gctcgtgggc 1140
gggagccatc tgatcgcaaa catcaagacc acatatagat ccaagaaacc cgctaagaac 1200
ctcaagatgc ctggcgtcta ctatgtggac tacagactgg aaagaatcaa ggaggccaac 1260
aacgagacct acgtcgagca gcacgaggtg gcagtggcca gatactgcga cctccctagc 1320
aaactggggc acaagcttaa ttaattaatt aagaattcga cccagctttc ttgtacaaag 1380
tggttggtaa gcctatccct aaccctctcc tcggtctcga ttctacgtag taatgagcta 1440
gcagtctcga ggttaacgaa ttccgccccc cccctaacgt tactggccga agccgcttgg 1500
aataaggccg gtgtgcgctt gtctatatgt tattttccac catattgccg tcttttggca 1560
atgtgagggc ccggaaacct ggccctgtct tcttgacgag cattcctagg ggtctttccc 1620
ctctcgccaa aggaatgcaa ggtctgttga atgtcgtgaa ggaagcagtt cctctggaag 1680
cttcttgaag acaaacaacg tctgtagcga ccctttgcag gcagcggaac cccccacctg 1740
gcgacaggtg cccctgcggc caaaagccac gtgtataaga tacacctgca aaggcggcac 1800
aaccccagtg ccacgttgtg agttggatag ttgtggaaag agtcaaatgg ctctcctcaa 1860
gcgtattcaa caaggggctg aaggatgccc agaaggtacc ccattgtatg ggatctgatc 1920
tggggcctcg gtgcacatgc tttacatgtg tttagtcgag gttaaaaaaa cgtctaggcc 1980
ccccgaacca cggggacgtg gttttccttt gaaaaacacg ataataccat gggatcggcc 2040
attgaacaag atggattgca cgcaggttct ccggccgctt gggtggagag gctattcggc 2100
tatgactggg cacaacagac aatcggctgc tctgatgccg ccgtgttccg gctgtcagcg 2160
caggggcgcc cggttctttt tgtcaagacc gacctgtccg gtgccctgaa tgaactgcag 2220
gacgaggcag cgcggctatc gtggctggcc acgacgggcg ttccttgcgc agctgtgctc 2280
gacgttgtca ctgaagcggg aagggactgg ctgctattgg gcgaagtgcc ggggcaggat 2340
ctcctgtcat ctcaccttgc tcctgccgag aaagtatcca tcatggctga tgcaatgcgg 2400
cggctgcata cgcttgatcc ggctacctgc ccattcgacc accaagcgaa acatcgcatc 2460
gagcgagcac gtactcggat ggaagccggt cttgtcgatc aggatgatct ggacgaagag 2520
catcaggggc tcgcgccagc cgaactgttc gccaggctca aggcgcgcat gcccgacggc 2580
gatgatctcg tcgtgaccca tggcgatgcc tgcctttcat acgagaccga gatcctgact 2640
gtcgagtacg gattgcttcc tatcggcaaa atcgtggaga agaggattga atgtaccgtc 2700
tattcagtcg ataataatgg gaacatctac acacagcccg tggctcaatg gcacgacaga 2760
ggagagcagg aagtttttga atactgtctc gaggacggat ccctcatccg cgctactaaa 2820
gatcataagt ttatgaccgt ggacggccag atgctgccaa ttgacgaaat ttttgaacga 2880
gagctggatc tgatgagagt cgacaacctt ccaaactgag aattcaccgg tggcgcgtta 2940
agtcgacaat caacctctgg attacaaaat ttgtgaaaga ttgactggta ttcttaacta 3000
tgttgctcct tttacgctat gtggatacgc tgctttaatg cctttgtatc atgctattgc 3060
ttcccgtatg gctttcattt tctcctcctt gtataaatcc tggttgctgt ctctttatga 3120
ggagttgtgg cccgttgtca ggcaacgtgg cgtggtgtgc actgtgtttg ctgacgcaac 3180
ccccactggt tggggcattg ccaccacctg tcagctcctt tccgggactt tcgctttccc 3240
cctccctatt gccacggcgg aactcatcgc cgcctgcctt gcccgctgct ggacaggggc 3300
tcggctgttg ggcactgaca attccgtggt gttgtcgggg aaatcatcgt cctttccttg 3360
gctgctcgcc tgtgttgcca cctggattct gcgcgggacg tccttctgct acgtcccttc 3420
ggccctcaat ccagcggacc ttccttcccg cggcctgctg ccggctctgc ggcctcttcc 3480
gcgtcttcgc cttcgccctc agacgagtcg gatctccctt tgggccgcct ccccgcgtcg 3540
actttaagac caatgactta caaggcagct gtagatctta gccacttttt aaaagaaaag 3600
gggggactgg aagggctaat tcactcccaa cgaagacaag atctgctttt tgcttgtact 3660
gggtctctct ggttagacca gatctgagcc tgggagctct ctggctaact agggaaccca 3720
ctgcttaagc ctcaataaag cttgccttga gtgcttcaag tagtgtgtgc ccgtctgttg 3780
tgtgactctg gtaactagag atccctcaga cccttttagt cagtgtggaa aatctctagc 3840
agtacgtata gtagttcatg tcatcttatt attcagtatt tataacttgc aaagaaatga 3900
atatcagaga gtgagaggaa cttgtttatt gcagcttata atggttacaa ataaagcaat 3960
agcatcacaa atttcacaaa taaagcattt ttttcactgc attctagttg tggtttgtcc 4020
aaactcatca atgtatctta tcatgtctgg ctctagctat cccgccccta actccgccca 4080
tcccgcccct aactccgccc agttccgccc attctccgcc ccatggctga ctaatttttt 4140
ttatttatgc agaggccgag gccgcctcgg cctctgagct attccagaag tagtgaggag 4200
gcttttttgg aggcctaggg acgtacccaa ttcgccctat agtgagtcgt attacgcgcg 4260
ctcactggcc gtcgttttac aacgtcgtga ctgggaaaac cctggcgtta cccaacttaa 4320
tcgccttgca gcacatcccc ctttcgccag ctggcgtaat agcgaagagg cccgcaccga 4380
tcgcccttcc caacagttgc gcagcctgaa tggcgaatgg gacgcgccct gtagcggcgc 4440
attaagcgcg gcgggtgtgg tggttacgcg cagcgtgacc gctacacttg ccagcgccct 4500
agcgcccgct cctttcgctt tcttcccttc ctttctcgcc acgttcgccg gctttccccg 4560
tcaagctcta aatcgggggc tccctttagg gttccgattt agtgctttac ggcacctcga 4620
ccccaaaaaa cttgattagg gtgatggttc acgtagtggg ccatcgccct gatagacggt 4680
ttttcgccct ttgacgttgg agtccacgtt ctttaatagt ggactcttgt tccaaactgg 4740
aacaacactc aaccctatct cggtctattc ttttgattta taagggattt tgccgatttc 4800
ggcctattgg ttaaaaaatg agctgattta acaaaaattt aacgcgaatt ttaacaaaat 4860
attaacgctt acaatttagg tggcactttt cggggaaatg tgcgcggaac ccctatttgt 4920
ttatttttct aaatacattc aaatatgtat ccgctcatga gacaataacc ctgataaatg 4980
cttcaataat attgaaaaag gaagagtatg agtattcaac atttccgtgt cgcccttatt 5040
cccttttttg cggcattttg ccttcctgtt tttgctcacc cagaaacgct ggtgaaagta 5100
aaagatgctg aagatcagtt gggtgcacga gtgggttaca tcgaactgga tctcaacagc 5160
ggtaagatcc ttgagagttt tcgccccgaa gaacgttttc caatgatgag cacttttaaa 5220
gttctgctat gtggcgcggt attatcccgt attgacgccg ggcaagagca actcggtcgc 5280
cgcatacact attctcagaa tgacttggtt gagtactcac cagtcacaga aaagcatctt 5340
acggatggca tgacagtaag agaattatgc agtgctgcca taaccatgag tgataacact 5400
gcggccaact tacttctgac aacgatcgga ggaccgaagg agctaaccgc ttttttgcac 5460
aacatggggg atcatgtaac tcgccttgat cgttgggaac cggagctgaa tgaagccata 5520
ccaaacgacg agcgtgacac cacgatgcct gtagcaatgg caacaacgtt gcgcaaacta 5580
ttaactggcg aactacttac tctagcttcc cggcaacaat taatagactg gatggaggcg 5640
gataaagttg caggaccact tctgcgctcg gcccttccgg ctggctggtt tattgctgat 5700
aaatctggag ccggtgagcg tgggtctcgc ggtatcattg cagcactggg gccagatggt 5760
aagccctccc gtatcgtagt tatctacacg acggggagtc aggcaactat ggatgaacga 5820
aatagacaga tcgctgagat aggtgcctca ctgattaagc attggtaact gtcagaccaa 5880
gtttactcat atatacttta gattgattta aaacttcatt tttaatttaa aaggatctag 5940
gtgaagatcc tttttgataa tctcatgacc aaaatccctt aacgtgagtt ttcgttccac 6000
tgagcgtcag accccgtaga aaagatcaaa ggatcttctt gagatccttt ttttctgcgc 6060
gtaatctgct gcttgcaaac aaaaaaacca ccgctaccag cggtggtttg tttgccggat 6120
caagagctac caactctttt tccgaaggta actggcttca gcagagcgca gataccaaat 6180
actgttcttc tagtgtagcc gtagttaggc caccacttca agaactctgt agcaccgcct 6240
acatacctcg ctctgctaat cctgttacca gtggctgctg ccagtggcga taagtcgtgt 6300
cttaccgggt tggactcaag acgatagtta ccggataagg cgcagcggtc gggctgaacg 6360
gggggttcgt gcacacagcc cagcttggag cgaacgacct acaccgaact gagataccta 6420
cagcgtgagc tatgagaaag cgccacgctt cccgaaggga gaaaggcgga caggtatccg 6480
gtaagcggca gggtcggaac aggagagcgc acgagggagc ttccaggggg aaacgcctgg 6540
tatctttata gtcctgtcgg gtttcgccac ctctgacttg agcgtcgatt tttgtgatgc 6600
tcgtcagggg ggcggagcct atggaaaaac gccagcaacg cggccttttt acggttcctg 6660
gccttttgct ggccttttgc tcacatgttc tttcctgcgt tatcccctga ttctgtggat 6720
aaccgtatta ccgcctttga gtgagctgat accgctcgcc gcagccgaac gaccgagcgc 6780
agcgagtcag tgagcgagga agcggaagag cgcccaatac gcaaaccgcc tctccccgcg 6840
cgttggccga ttcattaatg cagctggcac gacaggtttc ccgactggaa agcgggcagt 6900
gagcgcaacg caattaatgt gagttagctc actcattagg caccccaggc tttacacttt 6960
atgcttccgg ctcgtatgtt gtgtggaatt gtgagcggat aacaatttca cacaggaaac 7020
agctatgacc atgattacgc caagcgcgca attaaccctc actaaaggga acaaaagctg 7080
gagctgcaag cttaatgtag tcttatgcaa tactcttgta gtcttgcaac atggtaacga 7140
tgagttagca acatgcctta caaggagaga aaaagcaccg tgcatgccga ttggtggaag 7200
taaggtggta cgatcgtgcc ttattaggaa ggcaacagac gggtctgaca tggattggac 7260
gaaccactga attgccgcat tgcagagata ttgtatttaa gtgcctagct cgatacataa 7320
acgggtctct ctggttagac cagatctgag cctgggagct ctctggctaa ctagggaacc 7380
cactgcttaa gcctcaataa agcttgcctt gagtgcttca agtagtgtgt gcccgtctgt 7440
tgtgtgactc tggtaactag agatccctca gaccctttta gtcagtgtgg aaaatctcta 7500
gcagtggcgc ccgaacaggg acttgaaagc gaaagggaaa ccagaggagc tctctcgacg 7560
caggactcgg cttgctgaag cgcgcacggc aagaggcgag gggcggcgac tggtgagtac 7620
gccaaaaatt ttgactagcg gaggctagaa ggagagagat gggtgcgaga gcgtcagtat 7680
taagcggggg agaattagat cgcgatggga aaaaattcgg ttaaggccag ggggaaagaa 7740
aaaatataaa ttaaaacata tagtatgggc aagcagggag ctagaacgat tcgcagttaa 7800
tcctggcctg ttagaaacat cagaaggctg tagacaaata ctgggacagc tacaaccatc 7860
ccttcagaca ggatcagaag aacttagatc attatataat acagtagcaa ccctctattg 7920
tgtgcatcaa aggatagaga taaaagacac caaggaagct ttagacaaga tagaggaaga 7980
gcaaaacaaa agtaagacca ccgcacagca agcggccgct gatcttcaga cctggaggag 8040
gagatatgag ggacaattgg agaagtgaat tatataaata taaagtagta aaaattgaac 8100
cattaggagt agcacccacc aaggcaaaga gaagagtggt gcagagagaa aaaagagcag 8160
tgggaatagg agctttgttc cttgggttct tgggagcagc aggaagcact atgggcgcag 8220
cgtcaatgac gctgacggta caggccagac aattattgtc tggtatagtg cagcagcaga 8280
acaatttgct gagggctatt gaggcgcaac agcatctgtt gcaactcaca gtctggggca 8340
tcaagcagct ccaggcaaga atcctggctg tggaaagata cctaaaggat caacagctcc 8400
tggggatttg gggttgctct ggaaaactca tttgcaccac tgctgtgcct tggaatgcta 8460
gttggagtaa taaatctctg gaacagattt ggaatcacac gacctggatg gagtgggaca 8520
gagaaattaa caattacaca agcttaatac actccttaat tgaagaatcg caaaaccagc 8580
aagaaaagaa tgaacaagaa ttattggaat tagataaatg ggcaagtttg tggaattggt 8640
ttaacataac aaattggctg tggtatataa aattattcat aatgatagta ggaggcttgg 8700
taggtttaag aatagttttt gctgtacttt ctatagtgaa tagagttagg cagggatatt 8760
caccattatc gtttcagacc cacctcccaa ccccgagggg acccttgcgc cttttccaag 8820
gcagccctgg gtttgcgcag ggacgcggct gctctgggcg tggttccggg aaacgcagcg 8880
gcgccgaccc tgggtctcgc acattcttca cgtccgttcg cagcgtcacc cggatcttcg 8940
ccgctaccct tgtgggcccc ccggcgacgc ttcctgctcc gcccctaagt cgggaaggtt 9000
ccttgcggtt cgcggcgtgc cggacgtgac aaacggaagc cgcacgtctc actagtaccc 9060
tcgcagacgg acagcgccag ggagcaatgg cagcgcgccg accgcgatgg gctgtggcca 9120
atagcggctg ctcagcaggg cgcgccgaga gcagcggccg ggaaggggcg gtgcgggagg 9180
cggggtgtgg ggcggtagtg tgggccctgt tcctgcccgc gcggtgttcc gcattctgca 9240
agcctccgga gcgcacgtcg gcagtcggct ccctcgttga ccgaatcacc gacctctctc 9300
cccagggggt acccagctgt ctagagaatt ctagatcttg agacaaatgg cagtattcat 9360
ccacaatttt aaaagaaaag gggggattgg ggggtacagt gcaggggaaa gaatagtaga 9420
cataatagca acagacatac aaactaaaga attacaaaaa caaattacaa aaattcaaaa 9480
ttttcgggtt tattacaggg acagcagaga tccactttgg cgccggctcg aggggg 9536
<210> 119
<211> 8990
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 119
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtgccgcca ccatggtgag caagggcgag 660
gaggataaca tggccatcat caaggagttc atgcgcttca aggtgcacat ggagggctcc 720
gtgaacggcc acgagttcga gatcgagggc gagggcgagg gccgccccta cgagggcacc 780
cagaccgcca agctgaaggt gaccaagggt ggccccctgc ccttcgcctg ggacatcctg 840
tcccctcagt tcatgtacgg ctccaaggcc tacgtgaagc accccgccga catccccgac 900
tacttgaagc tgtccttccc cgagggcttc aagtgggagc gcgtgatgaa cttcgaggac 960
ggcggcgtgg tgaccgtgac ccaggactcc tccctgcagg acggcgagtt catctacaag 1020
gtgaagctgc gcggcaccaa cttcccctcc gacggccccg taatgcagaa gaagaccatg 1080
ggctgggagg cctcctccga gcggatgtac cccgaggacg gcgccctgaa gggcgagatc 1140
aagcagaggc tgaagctgaa ggacggcggc cactacgacg ctgaggtcaa gaccacctac 1200
aaggccaaga agcccgtgca gctgcccggc gcctacaacg tcaacattaa gttggacatc 1260
acctcccaca acgaggacta caccatcgtg gaacagtacg aacgcgccga gggccgccac 1320
tccaccggcg gcatggacga gctgtacaag tgattaatta agaattcgac ccagctttct 1380
tgtacaaagt ggttggtaag cctatcccta accctctcct cggtctcgat tctacgtagt 1440
aatgagctag cagtctcgag gttaacgaat tccgcccccc ccctaacgtt actggccgaa 1500
gccgcttgga ataaggccgg tgtgcgcttg tctatatgtt attttccacc atattgccgt 1560
cttttggcaa tgtgagggcc cggaaacctg gccctgtctt cttgacgagc attcctaggg 1620
gtctttcccc tctcgccaaa ggaatgcaag gtctgttgaa tgtcgtgaag gaagcagttc 1680
ctctggaagc ttcttgaaga caaacaacgt ctgtagcgac cctttgcagg cagcggaacc 1740
ccccacctgg cgacaggtgc ccctgcggcc aaaagccacg tgtataagat acacctgcaa 1800
aggcggcaca accccagtgc cacgttgtga gttggatagt tgtggaaaga gtcaaatggc 1860
tctcctcaag cgtattcaac aaggggctga aggatgccca gaaggtaccc cattgtatgg 1920
gatctgatct ggggcctcgg tgcacatgct ttacatgtgt ttagtcgagg ttaaaaaaac 1980
gtctaggccc cccgaaccac ggggacgtgg ttttcctttg aaaaacacga taataccatg 2040
gccatgatta agatcgctac gcggaagtac ctggggaaac agaacgtcta cgacataggt 2100
gtggagcgcg atcacaactt tgctctgaaa aatggattta tcgccagcaa ctgcttgccg 2160
aatatcatgg tggaaaatgg ccgcttttct ggattcatcg actgtggccg gctgggtgtg 2220
gcggaccgct atcaggacat agcgttggct acccgtgata ttgctgaaga gcttggcggc 2280
gaatgggctg accgcttcct cgtgctttac ggtatcgccg ctcccgattc gcagcgcatc 2340
gccttctatc gccttcttga cgagttcttc tgagaattca ccggtggcgc gttaagtcga 2400
caatcaacct ctggattaca aaatttgtga aagattgact ggtattctta actatgttgc 2460
tccttttacg ctatgtggat acgctgcttt aatgcctttg tatcatgcta ttgcttcccg 2520
tatggctttc attttctcct ccttgtataa atcctggttg ctgtctcttt atgaggagtt 2580
gtggcccgtt gtcaggcaac gtggcgtggt gtgcactgtg tttgctgacg caacccccac 2640
tggttggggc attgccacca cctgtcagct cctttccggg actttcgctt tccccctccc 2700
tattgccacg gcggaactca tcgccgcctg ccttgcccgc tgctggacag gggctcggct 2760
gttgggcact gacaattccg tggtgttgtc ggggaaatca tcgtcctttc cttggctgct 2820
cgcctgtgtt gccacctgga ttctgcgcgg gacgtccttc tgctacgtcc cttcggccct 2880
caatccagcg gaccttcctt cccgcggcct gctgccggct ctgcggcctc ttccgcgtct 2940
tcgccttcgc cctcagacga gtcggatctc cctttgggcc gcctccccgc gtcgacttta 3000
agaccaatga cttacaaggc agctgtagat cttagccact ttttaaaaga aaagggggga 3060
ctggaagggc taattcactc ccaacgaaga caagatctgc tttttgcttg tactgggtct 3120
ctctggttag accagatctg agcctgggag ctctctggct aactagggaa cccactgctt 3180
aagcctcaat aaagcttgcc ttgagtgctt caagtagtgt gtgcccgtct gttgtgtgac 3240
tctggtaact agagatccct cagacccttt tagtcagtgt ggaaaatctc tagcagtacg 3300
tatagtagtt catgtcatct tattattcag tatttataac ttgcaaagaa atgaatatca 3360
gagagtgaga ggaacttgtt tattgcagct tataatggtt acaaataaag caatagcatc 3420
acaaatttca caaataaagc atttttttca ctgcattcta gttgtggttt gtccaaactc 3480
atcaatgtat cttatcatgt ctggctctag ctatcccgcc cctaactccg cccatcccgc 3540
ccctaactcc gcccagttcc gcccattctc cgccccatgg ctgactaatt ttttttattt 3600
atgcagaggc cgaggccgcc tcggcctctg agctattcca gaagtagtga ggaggctttt 3660
ttggaggcct agggacgtac ccaattcgcc ctatagtgag tcgtattacg cgcgctcact 3720
ggccgtcgtt ttacaacgtc gtgactggga aaaccctggc gttacccaac ttaatcgcct 3780
tgcagcacat ccccctttcg ccagctggcg taatagcgaa gaggcccgca ccgatcgccc 3840
ttcccaacag ttgcgcagcc tgaatggcga atgggacgcg ccctgtagcg gcgcattaag 3900
cgcggcgggt gtggtggtta cgcgcagcgt gaccgctaca cttgccagcg ccctagcgcc 3960
cgctcctttc gctttcttcc cttcctttct cgccacgttc gccggctttc cccgtcaagc 4020
tctaaatcgg gggctccctt tagggttccg atttagtgct ttacggcacc tcgaccccaa 4080
aaaacttgat tagggtgatg gttcacgtag tgggccatcg ccctgataga cggtttttcg 4140
ccctttgacg ttggagtcca cgttctttaa tagtggactc ttgttccaaa ctggaacaac 4200
actcaaccct atctcggtct attcttttga tttataaggg attttgccga tttcggccta 4260
ttggttaaaa aatgagctga tttaacaaaa atttaacgcg aattttaaca aaatattaac 4320
gcttacaatt taggtggcac ttttcgggga aatgtgcgcg gaacccctat ttgtttattt 4380
ttctaaatac attcaaatat gtatccgctc atgagacaat aaccctgata aatgcttcaa 4440
taatattgaa aaaggaagag tatgagtatt caacatttcc gtgtcgccct tattcccttt 4500
tttgcggcat tttgccttcc tgtttttgct cacccagaaa cgctggtgaa agtaaaagat 4560
gctgaagatc agttgggtgc acgagtgggt tacatcgaac tggatctcaa cagcggtaag 4620
atccttgaga gttttcgccc cgaagaacgt tttccaatga tgagcacttt taaagttctg 4680
ctatgtggcg cggtattatc ccgtattgac gccgggcaag agcaactcgg tcgccgcata 4740
cactattctc agaatgactt ggttgagtac tcaccagtca cagaaaagca tcttacggat 4800
ggcatgacag taagagaatt atgcagtgct gccataacca tgagtgataa cactgcggcc 4860
aacttacttc tgacaacgat cggaggaccg aaggagctaa ccgctttttt gcacaacatg 4920
ggggatcatg taactcgcct tgatcgttgg gaaccggagc tgaatgaagc cataccaaac 4980
gacgagcgtg acaccacgat gcctgtagca atggcaacaa cgttgcgcaa actattaact 5040
ggcgaactac ttactctagc ttcccggcaa caattaatag actggatgga ggcggataaa 5100
gttgcaggac cacttctgcg ctcggccctt ccggctggct ggtttattgc tgataaatct 5160
ggagccggtg agcgtgggtc tcgcggtatc attgcagcac tggggccaga tggtaagccc 5220
tcccgtatcg tagttatcta cacgacgggg agtcaggcaa ctatggatga acgaaataga 5280
cagatcgctg agataggtgc ctcactgatt aagcattggt aactgtcaga ccaagtttac 5340
tcatatatac tttagattga tttaaaactt catttttaat ttaaaaggat ctaggtgaag 5400
atcctttttg ataatctcat gaccaaaatc ccttaacgtg agttttcgtt ccactgagcg 5460
tcagaccccg tagaaaagat caaaggatct tcttgagatc ctttttttct gcgcgtaatc 5520
tgctgcttgc aaacaaaaaa accaccgcta ccagcggtgg tttgtttgcc ggatcaagag 5580
ctaccaactc tttttccgaa ggtaactggc ttcagcagag cgcagatacc aaatactgtt 5640
cttctagtgt agccgtagtt aggccaccac ttcaagaact ctgtagcacc gcctacatac 5700
ctcgctctgc taatcctgtt accagtggct gctgccagtg gcgataagtc gtgtcttacc 5760
gggttggact caagacgata gttaccggat aaggcgcagc ggtcgggctg aacggggggt 5820
tcgtgcacac agcccagctt ggagcgaacg acctacaccg aactgagata cctacagcgt 5880
gagctatgag aaagcgccac gcttcccgaa gggagaaagg cggacaggta tccggtaagc 5940
ggcagggtcg gaacaggaga gcgcacgagg gagcttccag ggggaaacgc ctggtatctt 6000
tatagtcctg tcgggtttcg ccacctctga cttgagcgtc gatttttgtg atgctcgtca 6060
ggggggcgga gcctatggaa aaacgccagc aacgcggcct ttttacggtt cctggccttt 6120
tgctggcctt ttgctcacat gttctttcct gcgttatccc ctgattctgt ggataaccgt 6180
attaccgcct ttgagtgagc tgataccgct cgccgcagcc gaacgaccga gcgcagcgag 6240
tcagtgagcg aggaagcgga agagcgccca atacgcaaac cgcctctccc cgcgcgttgg 6300
ccgattcatt aatgcagctg gcacgacagg tttcccgact ggaaagcggg cagtgagcgc 6360
aacgcaatta atgtgagtta gctcactcat taggcacccc aggctttaca ctttatgctt 6420
ccggctcgta tgttgtgtgg aattgtgagc ggataacaat ttcacacagg aaacagctat 6480
gaccatgatt acgccaagcg cgcaattaac cctcactaaa gggaacaaaa gctggagctg 6540
caagcttaat gtagtcttat gcaatactct tgtagtcttg caacatggta acgatgagtt 6600
agcaacatgc cttacaagga gagaaaaagc accgtgcatg ccgattggtg gaagtaaggt 6660
ggtacgatcg tgccttatta ggaaggcaac agacgggtct gacatggatt ggacgaacca 6720
ctgaattgcc gcattgcaga gatattgtat ttaagtgcct agctcgatac ataaacgggt 6780
ctctctggtt agaccagatc tgagcctggg agctctctgg ctaactaggg aacccactgc 6840
ttaagcctca ataaagcttg ccttgagtgc ttcaagtagt gtgtgcccgt ctgttgtgtg 6900
actctggtaa ctagagatcc ctcagaccct tttagtcagt gtggaaaatc tctagcagtg 6960
gcgcccgaac agggacttga aagcgaaagg gaaaccagag gagctctctc gacgcaggac 7020
tcggcttgct gaagcgcgca cggcaagagg cgaggggcgg cgactggtga gtacgccaaa 7080
aattttgact agcggaggct agaaggagag agatgggtgc gagagcgtca gtattaagcg 7140
ggggagaatt agatcgcgat gggaaaaaat tcggttaagg ccagggggaa agaaaaaata 7200
taaattaaaa catatagtat gggcaagcag ggagctagaa cgattcgcag ttaatcctgg 7260
cctgttagaa acatcagaag gctgtagaca aatactggga cagctacaac catcccttca 7320
gacaggatca gaagaactta gatcattata taatacagta gcaaccctct attgtgtgca 7380
tcaaaggata gagataaaag acaccaagga agctttagac aagatagagg aagagcaaaa 7440
caaaagtaag accaccgcac agcaagcggc cgctgatctt cagacctgga ggaggagata 7500
tgagggacaa ttggagaagt gaattatata aatataaagt agtaaaaatt gaaccattag 7560
gagtagcacc caccaaggca aagagaagag tggtgcagag agaaaaaaga gcagtgggaa 7620
taggagcttt gttccttggg ttcttgggag cagcaggaag cactatgggc gcagcgtcaa 7680
tgacgctgac ggtacaggcc agacaattat tgtctggtat agtgcagcag cagaacaatt 7740
tgctgagggc tattgaggcg caacagcatc tgttgcaact cacagtctgg ggcatcaagc 7800
agctccaggc aagaatcctg gctgtggaaa gatacctaaa ggatcaacag ctcctgggga 7860
tttggggttg ctctggaaaa ctcatttgca ccactgctgt gccttggaat gctagttgga 7920
gtaataaatc tctggaacag atttggaatc acacgacctg gatggagtgg gacagagaaa 7980
ttaacaatta cacaagctta atacactcct taattgaaga atcgcaaaac cagcaagaaa 8040
agaatgaaca agaattattg gaattagata aatgggcaag tttgtggaat tggtttaaca 8100
taacaaattg gctgtggtat ataaaattat tcataatgat agtaggaggc ttggtaggtt 8160
taagaatagt ttttgctgta ctttctatag tgaatagagt taggcaggga tattcaccat 8220
tatcgtttca gacccacctc ccaaccccga ggggaccctt gcgccttttc caaggcagcc 8280
ctgggtttgc gcagggacgc ggctgctctg ggcgtggttc cgggaaacgc agcggcgccg 8340
accctgggtc tcgcacattc ttcacgtccg ttcgcagcgt cacccggatc ttcgccgcta 8400
cccttgtggg ccccccggcg acgcttcctg ctccgcccct aagtcgggaa ggttccttgc 8460
ggttcgcggc gtgccggacg tgacaaacgg aagccgcacg tctcactagt accctcgcag 8520
acggacagcg ccagggagca atggcagcgc gccgaccgcg atgggctgtg gccaatagcg 8580
gctgctcagc agggcgcgcc gagagcagcg gccgggaagg ggcggtgcgg gaggcggggt 8640
gtggggcggt agtgtgggcc ctgttcctgc ccgcgcggtg ttccgcattc tgcaagcctc 8700
cggagcgcac gtcggcagtc ggctccctcg ttgaccgaat caccgacctc tctccccagg 8760
gggtacccag ctgtctagag aattctagat cttgagacaa atggcagtat tcatccacaa 8820
ttttaaaaga aaagggggga ttggggggta cagtgcaggg gaaagaatag tagacataat 8880
agcaacagac atacaaacta aagaattaca aaaacaaatt acaaaaattc aaaattttcg 8940
ggtttattac agggacagca gagatccact ttggcgccgg ctcgaggggg 8990
<210> 120
<211> 9219
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 120
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtgccgcca ccatggtgag caagggcgag 660
gcagtgatca aggagttcat gcggttcaag gtgcacatgg agggctccat gaacggccac 720
gagttcgaga tcgagggcga gggcgagggc cgcccctacg agggcaccca gaccgccaag 780
ctgaaggtga ccaagtgcct ttcatacgag accgagatcc tgactgtcga gtacggattg 840
cttcctatcg gcaaaatcgt ggagaagagg attgaatgta ccgtctattc agtcgataat 900
aatgggaaca tctacacaca gcccgtggct caatggcacg acagaggaga gcaggaagtt 960
tttgaatact gtctcgagga cggatccctc atccgcgcta ctaaagatca taagtttatg 1020
accgtggacg gccagatgct gccaattgac gaaatttttg aacgagagct ggatctgatg 1080
agagtcgaca accttccaaa cggtggaggg gggtcaggct ctgcgcagct ggaaaaggag 1140
cttcaagccc tcgaaaaaaa gttggcccag ctcgagtggg agaaccaggc tctggagaaa 1200
gaactggccc agtgattaat taagaattcg acccagcttt cttgtacaaa gtggttggta 1260
agcctatccc taaccctctc ctcggtctcg attctacgta gtaatgagct agcagtctcg 1320
aggttaacga attccgcccc ccccctaacg ttactggccg aagccgcttg gaataaggcc 1380
ggtgtgcgct tgtctatatg ttattttcca ccatattgcc gtcttttggc aatgtgaggg 1440
cccggaaacc tggccctgtc ttcttgacga gcattcctag gggtctttcc cctctcgcca 1500
aaggaatgca aggtctgttg aatgtcgtga aggaagcagt tcctctggaa gcttcttgaa 1560
gacaaacaac gtctgtagcg accctttgca ggcagcggaa ccccccacct ggcgacaggt 1620
gcccctgcgg ccaaaagcca cgtgtataag atacacctgc aaaggcggca caaccccagt 1680
gccacgttgt gagttggata gttgtggaaa gagtcaaatg gctctcctca agcgtattca 1740
acaaggggct gaaggatgcc cagaaggtac cccattgtat gggatctgat ctggggcctc 1800
ggtgcacatg ctttacatgt gtttagtcga ggttaaaaaa acgtctaggc cccccgaacc 1860
acggggacgt ggttttcctt tgaaaaacac gataatacca tggccatgag cgagctgatt 1920
aaggagaaca tgcacatgaa gctgtacatg gagggcaccg tggacaacca tcacttcaag 1980
tgcacatccg agggcgaagg caagccctac gagggcaccc agaccatgag aatcaaggtg 2040
gtcgagggcg gccctctccc cttcgccttc gacatcctgg ctactagctt cctctacggc 2100
agcaagacct tcatcaacca cacccagggc atccccgact tcttcaagca gtccttccct 2160
gagggcttca catgggagag agtcaccaca tacgaagacg ggggcgtgct gaccgctacc 2220
caggacacca gcctccagga cggctgcctc atctacaacg tcaagatcag aggggtgaac 2280
ttcacatcca acggccctgt gatgcagaag aaaacactcg gctgggaggc cttcaccgag 2340
acgctgtacc ccgctgacgg cggcctggaa ggcagaaacg acatggccct gaagctcgtg 2400
ggcgggagcc atctgatcgc aaacatcaag accacatata gatccaagaa acccgctaag 2460
aacctcaaga tgcctggcgt ctactatgtg gactacagac tggaaagaat caaggaggcc 2520
aacaacgaga cctacgtcga gcagcacgag gtggcagtgg ccagatactg cgacctccct 2580
agcaaactgg ggcacaagct taattaacac cggtggcgcg ttaagtcgac aatcaacctc 2640
tggattacaa aatttgtgaa agattgactg gtattcttaa ctatgttgct ccttttacgc 2700
tatgtggata cgctgcttta atgcctttgt atcatgctat tgcttcccgt atggctttca 2760
ttttctcctc cttgtataaa tcctggttgc tgtctcttta tgaggagttg tggcccgttg 2820
tcaggcaacg tggcgtggtg tgcactgtgt ttgctgacgc aacccccact ggttggggca 2880
ttgccaccac ctgtcagctc ctttccggga ctttcgcttt ccccctccct attgccacgg 2940
cggaactcat cgccgcctgc cttgcccgct gctggacagg ggctcggctg ttgggcactg 3000
acaattccgt ggtgttgtcg gggaaatcat cgtcctttcc ttggctgctc gcctgtgttg 3060
ccacctggat tctgcgcggg acgtccttct gctacgtccc ttcggccctc aatccagcgg 3120
accttccttc ccgcggcctg ctgccggctc tgcggcctct tccgcgtctt cgccttcgcc 3180
ctcagacgag tcggatctcc ctttgggccg cctccccgcg tcgactttaa gaccaatgac 3240
ttacaaggca gctgtagatc ttagccactt tttaaaagaa aaggggggac tggaagggct 3300
aattcactcc caacgaagac aagatctgct ttttgcttgt actgggtctc tctggttaga 3360
ccagatctga gcctgggagc tctctggcta actagggaac ccactgctta agcctcaata 3420
aagcttgcct tgagtgcttc aagtagtgtg tgcccgtctg ttgtgtgact ctggtaacta 3480
gagatccctc agaccctttt agtcagtgtg gaaaatctct agcagtacgt atagtagttc 3540
atgtcatctt attattcagt atttataact tgcaaagaaa tgaatatcag agagtgagag 3600
gaacttgttt attgcagctt ataatggtta caaataaagc aatagcatca caaatttcac 3660
aaataaagca tttttttcac tgcattctag ttgtggtttg tccaaactca tcaatgtatc 3720
ttatcatgtc tggctctagc tatcccgccc ctaactccgc ccatcccgcc cctaactccg 3780
cccagttccg cccattctcc gccccatggc tgactaattt tttttattta tgcagaggcc 3840
gaggccgcct cggcctctga gctattccag aagtagtgag gaggcttttt tggaggccta 3900
gggacgtacc caattcgccc tatagtgagt cgtattacgc gcgctcactg gccgtcgttt 3960
tacaacgtcg tgactgggaa aaccctggcg ttacccaact taatcgcctt gcagcacatc 4020
cccctttcgc cagctggcgt aatagcgaag aggcccgcac cgatcgccct tcccaacagt 4080
tgcgcagcct gaatggcgaa tgggacgcgc cctgtagcgg cgcattaagc gcggcgggtg 4140
tggtggttac gcgcagcgtg accgctacac ttgccagcgc cctagcgccc gctcctttcg 4200
ctttcttccc ttcctttctc gccacgttcg ccggctttcc ccgtcaagct ctaaatcggg 4260
ggctcccttt agggttccga tttagtgctt tacggcacct cgaccccaaa aaacttgatt 4320
agggtgatgg ttcacgtagt gggccatcgc cctgatagac ggtttttcgc cctttgacgt 4380
tggagtccac gttctttaat agtggactct tgttccaaac tggaacaaca ctcaacccta 4440
tctcggtcta ttcttttgat ttataaggga ttttgccgat ttcggcctat tggttaaaaa 4500
atgagctgat ttaacaaaaa tttaacgcga attttaacaa aatattaacg cttacaattt 4560
aggtggcact tttcggggaa atgtgcgcgg aacccctatt tgtttatttt tctaaataca 4620
ttcaaatatg tatccgctca tgagacaata accctgataa atgcttcaat aatattgaaa 4680
aaggaagagt atgagtattc aacatttccg tgtcgccctt attccctttt ttgcggcatt 4740
ttgccttcct gtttttgctc acccagaaac gctggtgaaa gtaaaagatg ctgaagatca 4800
gttgggtgca cgagtgggtt acatcgaact ggatctcaac agcggtaaga tccttgagag 4860
ttttcgcccc gaagaacgtt ttccaatgat gagcactttt aaagttctgc tatgtggcgc 4920
ggtattatcc cgtattgacg ccgggcaaga gcaactcggt cgccgcatac actattctca 4980
gaatgacttg gttgagtact caccagtcac agaaaagcat cttacggatg gcatgacagt 5040
aagagaatta tgcagtgctg ccataaccat gagtgataac actgcggcca acttacttct 5100
gacaacgatc ggaggaccga aggagctaac cgcttttttg cacaacatgg gggatcatgt 5160
aactcgcctt gatcgttggg aaccggagct gaatgaagcc ataccaaacg acgagcgtga 5220
caccacgatg cctgtagcaa tggcaacaac gttgcgcaaa ctattaactg gcgaactact 5280
tactctagct tcccggcaac aattaataga ctggatggag gcggataaag ttgcaggacc 5340
acttctgcgc tcggcccttc cggctggctg gtttattgct gataaatctg gagccggtga 5400
gcgtgggtct cgcggtatca ttgcagcact ggggccagat ggtaagccct cccgtatcgt 5460
agttatctac acgacgggga gtcaggcaac tatggatgaa cgaaatagac agatcgctga 5520
gataggtgcc tcactgatta agcattggta actgtcagac caagtttact catatatact 5580
ttagattgat ttaaaacttc atttttaatt taaaaggatc taggtgaaga tcctttttga 5640
taatctcatg accaaaatcc cttaacgtga gttttcgttc cactgagcgt cagaccccgt 5700
agaaaagatc aaaggatctt cttgagatcc tttttttctg cgcgtaatct gctgcttgca 5760
aacaaaaaaa ccaccgctac cagcggtggt ttgtttgccg gatcaagagc taccaactct 5820
ttttccgaag gtaactggct tcagcagagc gcagatacca aatactgttc ttctagtgta 5880
gccgtagtta ggccaccact tcaagaactc tgtagcaccg cctacatacc tcgctctgct 5940
aatcctgtta ccagtggctg ctgccagtgg cgataagtcg tgtcttaccg ggttggactc 6000
aagacgatag ttaccggata aggcgcagcg gtcgggctga acggggggtt cgtgcacaca 6060
gcccagcttg gagcgaacga cctacaccga actgagatac ctacagcgtg agctatgaga 6120
aagcgccacg cttcccgaag ggagaaaggc ggacaggtat ccggtaagcg gcagggtcgg 6180
aacaggagag cgcacgaggg agcttccagg gggaaacgcc tggtatcttt atagtcctgt 6240
cgggtttcgc cacctctgac ttgagcgtcg atttttgtga tgctcgtcag gggggcggag 6300
cctatggaaa aacgccagca acgcggcctt tttacggttc ctggcctttt gctggccttt 6360
tgctcacatg ttctttcctg cgttatcccc tgattctgtg gataaccgta ttaccgcctt 6420
tgagtgagct gataccgctc gccgcagccg aacgaccgag cgcagcgagt cagtgagcga 6480
ggaagcggaa gagcgcccaa tacgcaaacc gcctctcccc gcgcgttggc cgattcatta 6540
atgcagctgg cacgacaggt ttcccgactg gaaagcgggc agtgagcgca acgcaattaa 6600
tgtgagttag ctcactcatt aggcacccca ggctttacac tttatgcttc cggctcgtat 6660
gttgtgtgga attgtgagcg gataacaatt tcacacagga aacagctatg accatgatta 6720
cgccaagcgc gcaattaacc ctcactaaag ggaacaaaag ctggagctgc aagcttaatg 6780
tagtcttatg caatactctt gtagtcttgc aacatggtaa cgatgagtta gcaacatgcc 6840
ttacaaggag agaaaaagca ccgtgcatgc cgattggtgg aagtaaggtg gtacgatcgt 6900
gccttattag gaaggcaaca gacgggtctg acatggattg gacgaaccac tgaattgccg 6960
cattgcagag atattgtatt taagtgccta gctcgataca taaacgggtc tctctggtta 7020
gaccagatct gagcctggga gctctctggc taactaggga acccactgct taagcctcaa 7080
taaagcttgc cttgagtgct tcaagtagtg tgtgcccgtc tgttgtgtga ctctggtaac 7140
tagagatccc tcagaccctt ttagtcagtg tggaaaatct ctagcagtgg cgcccgaaca 7200
gggacttgaa agcgaaaggg aaaccagagg agctctctcg acgcaggact cggcttgctg 7260
aagcgcgcac ggcaagaggc gaggggcggc gactggtgag tacgccaaaa attttgacta 7320
gcggaggcta gaaggagaga gatgggtgcg agagcgtcag tattaagcgg gggagaatta 7380
gatcgcgatg ggaaaaaatt cggttaaggc cagggggaaa gaaaaaatat aaattaaaac 7440
atatagtatg ggcaagcagg gagctagaac gattcgcagt taatcctggc ctgttagaaa 7500
catcagaagg ctgtagacaa atactgggac agctacaacc atcccttcag acaggatcag 7560
aagaacttag atcattatat aatacagtag caaccctcta ttgtgtgcat caaaggatag 7620
agataaaaga caccaaggaa gctttagaca agatagagga agagcaaaac aaaagtaaga 7680
ccaccgcaca gcaagcggcc gctgatcttc agacctggag gaggagatat gagggacaat 7740
tggagaagtg aattatataa atataaagta gtaaaaattg aaccattagg agtagcaccc 7800
accaaggcaa agagaagagt ggtgcagaga gaaaaaagag cagtgggaat aggagctttg 7860
ttccttgggt tcttgggagc agcaggaagc actatgggcg cagcgtcaat gacgctgacg 7920
gtacaggcca gacaattatt gtctggtata gtgcagcagc agaacaattt gctgagggct 7980
attgaggcgc aacagcatct gttgcaactc acagtctggg gcatcaagca gctccaggca 8040
agaatcctgg ctgtggaaag atacctaaag gatcaacagc tcctggggat ttggggttgc 8100
tctggaaaac tcatttgcac cactgctgtg ccttggaatg ctagttggag taataaatct 8160
ctggaacaga tttggaatca cacgacctgg atggagtggg acagagaaat taacaattac 8220
acaagcttaa tacactcctt aattgaagaa tcgcaaaacc agcaagaaaa gaatgaacaa 8280
gaattattgg aattagataa atgggcaagt ttgtggaatt ggtttaacat aacaaattgg 8340
ctgtggtata taaaattatt cataatgata gtaggaggct tggtaggttt aagaatagtt 8400
tttgctgtac tttctatagt gaatagagtt aggcagggat attcaccatt atcgtttcag 8460
acccacctcc caaccccgag gggacccttg cgccttttcc aaggcagccc tgggtttgcg 8520
cagggacgcg gctgctctgg gcgtggttcc gggaaacgca gcggcgccga ccctgggtct 8580
cgcacattct tcacgtccgt tcgcagcgtc acccggatct tcgccgctac ccttgtgggc 8640
cccccggcga cgcttcctgc tccgccccta agtcgggaag gttccttgcg gttcgcggcg 8700
tgccggacgt gacaaacgga agccgcacgt ctcactagta ccctcgcaga cggacagcgc 8760
cagggagcaa tggcagcgcg ccgaccgcga tgggctgtgg ccaatagcgg ctgctcagca 8820
gggcgcgccg agagcagcgg ccgggaaggg gcggtgcggg aggcggggtg tggggcggta 8880
gtgtgggccc tgttcctgcc cgcgcggtgt tccgcattct gcaagcctcc ggagcgcacg 8940
tcggcagtcg gctccctcgt tgaccgaatc accgacctct ctccccaggg ggtacccagc 9000
tgtctagaga attctagatc ttgagacaaa tggcagtatt catccacaat tttaaaagaa 9060
aaggggggat tggggggtac agtgcagggg aaagaatagt agacataata gcaacagaca 9120
tacaaactaa agaattacaa aaacaaatta caaaaattca aaattttcgg gtttattaca 9180
gggacagcag agatccactt tggcgccggc tcgaggggg 9219
<210> 121
<211> 190
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 121
Met Val Ser Lys Gly Glu Ala Val Ile Lys Glu Phe Met Arg Phe Lys
1 5 10 15
Val His Met Glu Gly Ser Met Asn Gly His Glu Phe Glu Ile Glu Gly
20 25 30
Glu Gly Glu Gly Arg Pro Tyr Glu Gly Thr Gln Thr Ala Lys Leu Lys
35 40 45
Val Thr Lys Cys Leu Ser Tyr Glu Thr Glu Ile Leu Thr Val Glu Tyr
50 55 60
Gly Leu Leu Pro Ile Gly Lys Ile Val Glu Lys Arg Ile Glu Cys Thr
65 70 75 80
Val Tyr Ser Val Asp Asn Asn Gly Asn Ile Tyr Thr Gln Pro Val Ala
85 90 95
Gln Trp His Asp Arg Gly Glu Gln Glu Val Phe Glu Tyr Cys Leu Glu
100 105 110
Asp Gly Ser Leu Ile Arg Ala Thr Lys Asp His Lys Phe Met Thr Val
115 120 125
Asp Gly Gln Met Leu Pro Ile Asp Glu Ile Phe Glu Arg Glu Leu Asp
130 135 140
Leu Met Arg Val Asp Asn Leu Pro Asn Gly Gly Gly Gly Ser Gly Ser
145 150 155 160
Ala Gln Leu Glu Lys Glu Leu Gln Ala Leu Glu Lys Lys Leu Ala Gln
165 170 175
Leu Glu Trp Glu Asn Gln Ala Leu Glu Lys Glu Leu Ala Gln
180 185 190
<210> 122
<211> 9639
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 122
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtgccgcca ccatggctca actgaagaag 660
aaactccagg ccaataagaa agaactggca cagctgaagt ggaagctgca ggcactgaaa 720
aagaagctgg cacagggcgg aggaggctca ggcagtatga ttaagatcgc tacgcggaag 780
tacctgggga aacagaacgt ctacgacata ggtgtggagc gcgatcacaa ctttgctctg 840
aaaaatggat ttatcgccag caactgcggt ggccccctgc ccttctcctg ggacatcctg 900
tcccctcagt tcatgtacgg ctccagggcc ttcaccaagc accccgccga catccccgac 960
tactataagc agtccttccc cgagggcttc aagtgggagc gcgtgatgaa cttcgaggac 1020
ggcggcgccg tgaccgtgac ccaggacacc tccctggagg acggcaccct gatctacaag 1080
gtgaagctcc gcggcaccaa cttccctcct gacggccccg taatgcagaa gaagacaatg 1140
ggctgggaag cgtccaccga gcggttgtac cccgaggacg gcgtgctgaa gggcgacatt 1200
aagtgccttt catacgagac cgagatcctg actgtcgagt acggattgct tcctatcggc 1260
aaaatcgtgg agaagaggat tgaatgtacc gtctattcag tcgataataa tgggaacatc 1320
tacacacagc ccgtggctca atggcacgac agaggagagc aggaagtttt tgaatactgt 1380
ctcgaggacg gatccctcat ccgcgctact aaagatcata agtttatgac cgtggacggc 1440
cagatgctgc caattgacga aatttttgaa cgagagctgg atctgatgag agtcgacaac 1500
cttccaaacg gtggaggggg gtcaggctct gcgcagctgg aaaaggagct tcaagccctc 1560
gaaaaaaagt tggcccagct cgagtgggag aaccaggctc tggagaaaga actggcccag 1620
tgattaatta agaattcgac ccagctttct tgtacaaagt ggttggtaag cctatcccta 1680
accctctcct cggtctcgat tctacgtagt aatgagctag cagtctcgag gttaacgaat 1740
tccgcccccc ccctaacgtt actggccgaa gccgcttgga ataaggccgg tgtgcgcttg 1800
tctatatgtt attttccacc atattgccgt cttttggcaa tgtgagggcc cggaaacctg 1860
gccctgtctt cttgacgagc attcctaggg gtctttcccc tctcgccaaa ggaatgcaag 1920
gtctgttgaa tgtcgtgaag gaagcagttc ctctggaagc ttcttgaaga caaacaacgt 1980
ctgtagcgac cctttgcagg cagcggaacc ccccacctgg cgacaggtgc ccctgcggcc 2040
aaaagccacg tgtataagat acacctgcaa aggcggcaca accccagtgc cacgttgtga 2100
gttggatagt tgtggaaaga gtcaaatggc tctcctcaag cgtattcaac aaggggctga 2160
aggatgccca gaaggtaccc cattgtatgg gatctgatct ggggcctcgg tgcacatgct 2220
ttacatgtgt ttagtcgagg ttaaaaaaac gtctaggccc cccgaaccac ggggacgtgg 2280
ttttcctttg aaaaacacga taataccatg gtgagcaagg gcgaggagct gttcaccggg 2340
gtggtgccca tcctggtcga gctggacggc gacgtaaacg gccacaagtt cagcgtgtcc 2400
ggcgagggcg agggcgatgc cacctacggc aagctgaccc tgaagttcat ctgcaccacc 2460
ggcaagctgc ccgtgccctg gcccaccctc gtgaccaccc tgacctacgg cgtgcagtgc 2520
ttcagccgct accccgacca catgaagcag cacgacttct tcaagtccgc catgcccgaa 2580
ggctacgtcc aggagcgcac catcttcttc aaggacgacg gcaactacaa gacccgcgcc 2640
gaggtgaagt tcgagggcga caccctggtg aaccgcatcg agctgaaggg catcgacttc 2700
aaggaggacg gcaacatcct ggggcacaag ctggagtaca actacaacag ccacaacgtc 2760
tatatcatgg ccgacaagca gaagaacggc atcaaggtga acttcaagat ccgccacaac 2820
atcgaggacg gcagcgtgca gctcgccgac cactaccagc agaacacccc catcggcgac 2880
ggccccgtgc tgctgcccga caaccactac ctgagcaccc agtccgccct gagcaaagac 2940
cccaacgaga agcgcgatca catggtcctg ctggagttcg tgaccgccgc cgggatcact 3000
ctcggcatgg acgagctgta caagtaacac cggtggcgcg ttaagtcgac aatcaacctc 3060
tggattacaa aatttgtgaa agattgactg gtattcttaa ctatgttgct ccttttacgc 3120
tatgtggata cgctgcttta atgcctttgt atcatgctat tgcttcccgt atggctttca 3180
ttttctcctc cttgtataaa tcctggttgc tgtctcttta tgaggagttg tggcccgttg 3240
tcaggcaacg tggcgtggtg tgcactgtgt ttgctgacgc aacccccact ggttggggca 3300
ttgccaccac ctgtcagctc ctttccggga ctttcgcttt ccccctccct attgccacgg 3360
cggaactcat cgccgcctgc cttgcccgct gctggacagg ggctcggctg ttgggcactg 3420
acaattccgt ggtgttgtcg gggaaatcat cgtcctttcc ttggctgctc gcctgtgttg 3480
ccacctggat tctgcgcggg acgtccttct gctacgtccc ttcggccctc aatccagcgg 3540
accttccttc ccgcggcctg ctgccggctc tgcggcctct tccgcgtctt cgccttcgcc 3600
ctcagacgag tcggatctcc ctttgggccg cctccccgcg tcgactttaa gaccaatgac 3660
ttacaaggca gctgtagatc ttagccactt tttaaaagaa aaggggggac tggaagggct 3720
aattcactcc caacgaagac aagatctgct ttttgcttgt actgggtctc tctggttaga 3780
ccagatctga gcctgggagc tctctggcta actagggaac ccactgctta agcctcaata 3840
aagcttgcct tgagtgcttc aagtagtgtg tgcccgtctg ttgtgtgact ctggtaacta 3900
gagatccctc agaccctttt agtcagtgtg gaaaatctct agcagtacgt atagtagttc 3960
atgtcatctt attattcagt atttataact tgcaaagaaa tgaatatcag agagtgagag 4020
gaacttgttt attgcagctt ataatggtta caaataaagc aatagcatca caaatttcac 4080
aaataaagca tttttttcac tgcattctag ttgtggtttg tccaaactca tcaatgtatc 4140
ttatcatgtc tggctctagc tatcccgccc ctaactccgc ccatcccgcc cctaactccg 4200
cccagttccg cccattctcc gccccatggc tgactaattt tttttattta tgcagaggcc 4260
gaggccgcct cggcctctga gctattccag aagtagtgag gaggcttttt tggaggccta 4320
gggacgtacc caattcgccc tatagtgagt cgtattacgc gcgctcactg gccgtcgttt 4380
tacaacgtcg tgactgggaa aaccctggcg ttacccaact taatcgcctt gcagcacatc 4440
cccctttcgc cagctggcgt aatagcgaag aggcccgcac cgatcgccct tcccaacagt 4500
tgcgcagcct gaatggcgaa tgggacgcgc cctgtagcgg cgcattaagc gcggcgggtg 4560
tggtggttac gcgcagcgtg accgctacac ttgccagcgc cctagcgccc gctcctttcg 4620
ctttcttccc ttcctttctc gccacgttcg ccggctttcc ccgtcaagct ctaaatcggg 4680
ggctcccttt agggttccga tttagtgctt tacggcacct cgaccccaaa aaacttgatt 4740
agggtgatgg ttcacgtagt gggccatcgc cctgatagac ggtttttcgc cctttgacgt 4800
tggagtccac gttctttaat agtggactct tgttccaaac tggaacaaca ctcaacccta 4860
tctcggtcta ttcttttgat ttataaggga ttttgccgat ttcggcctat tggttaaaaa 4920
atgagctgat ttaacaaaaa tttaacgcga attttaacaa aatattaacg cttacaattt 4980
aggtggcact tttcggggaa atgtgcgcgg aacccctatt tgtttatttt tctaaataca 5040
ttcaaatatg tatccgctca tgagacaata accctgataa atgcttcaat aatattgaaa 5100
aaggaagagt atgagtattc aacatttccg tgtcgccctt attccctttt ttgcggcatt 5160
ttgccttcct gtttttgctc acccagaaac gctggtgaaa gtaaaagatg ctgaagatca 5220
gttgggtgca cgagtgggtt acatcgaact ggatctcaac agcggtaaga tccttgagag 5280
ttttcgcccc gaagaacgtt ttccaatgat gagcactttt aaagttctgc tatgtggcgc 5340
ggtattatcc cgtattgacg ccgggcaaga gcaactcggt cgccgcatac actattctca 5400
gaatgacttg gttgagtact caccagtcac agaaaagcat cttacggatg gcatgacagt 5460
aagagaatta tgcagtgctg ccataaccat gagtgataac actgcggcca acttacttct 5520
gacaacgatc ggaggaccga aggagctaac cgcttttttg cacaacatgg gggatcatgt 5580
aactcgcctt gatcgttggg aaccggagct gaatgaagcc ataccaaacg acgagcgtga 5640
caccacgatg cctgtagcaa tggcaacaac gttgcgcaaa ctattaactg gcgaactact 5700
tactctagct tcccggcaac aattaataga ctggatggag gcggataaag ttgcaggacc 5760
acttctgcgc tcggcccttc cggctggctg gtttattgct gataaatctg gagccggtga 5820
gcgtgggtct cgcggtatca ttgcagcact ggggccagat ggtaagccct cccgtatcgt 5880
agttatctac acgacgggga gtcaggcaac tatggatgaa cgaaatagac agatcgctga 5940
gataggtgcc tcactgatta agcattggta actgtcagac caagtttact catatatact 6000
ttagattgat ttaaaacttc atttttaatt taaaaggatc taggtgaaga tcctttttga 6060
taatctcatg accaaaatcc cttaacgtga gttttcgttc cactgagcgt cagaccccgt 6120
agaaaagatc aaaggatctt cttgagatcc tttttttctg cgcgtaatct gctgcttgca 6180
aacaaaaaaa ccaccgctac cagcggtggt ttgtttgccg gatcaagagc taccaactct 6240
ttttccgaag gtaactggct tcagcagagc gcagatacca aatactgttc ttctagtgta 6300
gccgtagtta ggccaccact tcaagaactc tgtagcaccg cctacatacc tcgctctgct 6360
aatcctgtta ccagtggctg ctgccagtgg cgataagtcg tgtcttaccg ggttggactc 6420
aagacgatag ttaccggata aggcgcagcg gtcgggctga acggggggtt cgtgcacaca 6480
gcccagcttg gagcgaacga cctacaccga actgagatac ctacagcgtg agctatgaga 6540
aagcgccacg cttcccgaag ggagaaaggc ggacaggtat ccggtaagcg gcagggtcgg 6600
aacaggagag cgcacgaggg agcttccagg gggaaacgcc tggtatcttt atagtcctgt 6660
cgggtttcgc cacctctgac ttgagcgtcg atttttgtga tgctcgtcag gggggcggag 6720
cctatggaaa aacgccagca acgcggcctt tttacggttc ctggcctttt gctggccttt 6780
tgctcacatg ttctttcctg cgttatcccc tgattctgtg gataaccgta ttaccgcctt 6840
tgagtgagct gataccgctc gccgcagccg aacgaccgag cgcagcgagt cagtgagcga 6900
ggaagcggaa gagcgcccaa tacgcaaacc gcctctcccc gcgcgttggc cgattcatta 6960
atgcagctgg cacgacaggt ttcccgactg gaaagcgggc agtgagcgca acgcaattaa 7020
tgtgagttag ctcactcatt aggcacccca ggctttacac tttatgcttc cggctcgtat 7080
gttgtgtgga attgtgagcg gataacaatt tcacacagga aacagctatg accatgatta 7140
cgccaagcgc gcaattaacc ctcactaaag ggaacaaaag ctggagctgc aagcttaatg 7200
tagtcttatg caatactctt gtagtcttgc aacatggtaa cgatgagtta gcaacatgcc 7260
ttacaaggag agaaaaagca ccgtgcatgc cgattggtgg aagtaaggtg gtacgatcgt 7320
gccttattag gaaggcaaca gacgggtctg acatggattg gacgaaccac tgaattgccg 7380
cattgcagag atattgtatt taagtgccta gctcgataca taaacgggtc tctctggtta 7440
gaccagatct gagcctggga gctctctggc taactaggga acccactgct taagcctcaa 7500
taaagcttgc cttgagtgct tcaagtagtg tgtgcccgtc tgttgtgtga ctctggtaac 7560
tagagatccc tcagaccctt ttagtcagtg tggaaaatct ctagcagtgg cgcccgaaca 7620
gggacttgaa agcgaaaggg aaaccagagg agctctctcg acgcaggact cggcttgctg 7680
aagcgcgcac ggcaagaggc gaggggcggc gactggtgag tacgccaaaa attttgacta 7740
gcggaggcta gaaggagaga gatgggtgcg agagcgtcag tattaagcgg gggagaatta 7800
gatcgcgatg ggaaaaaatt cggttaaggc cagggggaaa gaaaaaatat aaattaaaac 7860
atatagtatg ggcaagcagg gagctagaac gattcgcagt taatcctggc ctgttagaaa 7920
catcagaagg ctgtagacaa atactgggac agctacaacc atcccttcag acaggatcag 7980
aagaacttag atcattatat aatacagtag caaccctcta ttgtgtgcat caaaggatag 8040
agataaaaga caccaaggaa gctttagaca agatagagga agagcaaaac aaaagtaaga 8100
ccaccgcaca gcaagcggcc gctgatcttc agacctggag gaggagatat gagggacaat 8160
tggagaagtg aattatataa atataaagta gtaaaaattg aaccattagg agtagcaccc 8220
accaaggcaa agagaagagt ggtgcagaga gaaaaaagag cagtgggaat aggagctttg 8280
ttccttgggt tcttgggagc agcaggaagc actatgggcg cagcgtcaat gacgctgacg 8340
gtacaggcca gacaattatt gtctggtata gtgcagcagc agaacaattt gctgagggct 8400
attgaggcgc aacagcatct gttgcaactc acagtctggg gcatcaagca gctccaggca 8460
agaatcctgg ctgtggaaag atacctaaag gatcaacagc tcctggggat ttggggttgc 8520
tctggaaaac tcatttgcac cactgctgtg ccttggaatg ctagttggag taataaatct 8580
ctggaacaga tttggaatca cacgacctgg atggagtggg acagagaaat taacaattac 8640
acaagcttaa tacactcctt aattgaagaa tcgcaaaacc agcaagaaaa gaatgaacaa 8700
gaattattgg aattagataa atgggcaagt ttgtggaatt ggtttaacat aacaaattgg 8760
ctgtggtata taaaattatt cataatgata gtaggaggct tggtaggttt aagaatagtt 8820
tttgctgtac tttctatagt gaatagagtt aggcagggat attcaccatt atcgtttcag 8880
acccacctcc caaccccgag gggacccttg cgccttttcc aaggcagccc tgggtttgcg 8940
cagggacgcg gctgctctgg gcgtggttcc gggaaacgca gcggcgccga ccctgggtct 9000
cgcacattct tcacgtccgt tcgcagcgtc acccggatct tcgccgctac ccttgtgggc 9060
cccccggcga cgcttcctgc tccgccccta agtcgggaag gttccttgcg gttcgcggcg 9120
tgccggacgt gacaaacgga agccgcacgt ctcactagta ccctcgcaga cggacagcgc 9180
cagggagcaa tggcagcgcg ccgaccgcga tgggctgtgg ccaatagcgg ctgctcagca 9240
gggcgcgccg agagcagcgg ccgggaaggg gcggtgcggg aggcggggtg tggggcggta 9300
gtgtgggccc tgttcctgcc cgcgcggtgt tccgcattct gcaagcctcc ggagcgcacg 9360
tcggcagtcg gctccctcgt tgaccgaatc accgacctct ctccccaggg ggtacccagc 9420
tgtctagaga attctagatc ttgagacaaa tggcagtatt catccacaat tttaaaagaa 9480
aaggggggat tggggggtac agtgcagggg aaagaatagt agacataata gcaacagaca 9540
tacaaactaa agaattacaa aaacaaatta caaaaattca aaattttcgg gtttattaca 9600
gggacagcag agatccactt tggcgccggc tcgaggggg 9639
<210> 123
<211> 326
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 123
Met Ala Gln Leu Lys Lys Lys Leu Gln Ala Asn Lys Lys Glu Leu Ala
1 5 10 15
Gln Leu Lys Trp Lys Leu Gln Ala Leu Lys Lys Lys Leu Ala Gln Gly
20 25 30
Gly Gly Gly Ser Gly Ser Met Ile Lys Ile Ala Thr Arg Lys Tyr Leu
35 40 45
Gly Lys Gln Asn Val Tyr Asp Ile Gly Val Glu Arg Asp His Asn Phe
50 55 60
Ala Leu Lys Asn Gly Phe Ile Ala Ser Asn Cys Gly Gly Pro Leu Pro
65 70 75 80
Phe Ser Trp Asp Ile Leu Ser Pro Gln Phe Met Tyr Gly Ser Arg Ala
85 90 95
Phe Thr Lys His Pro Ala Asp Ile Pro Asp Tyr Tyr Lys Gln Ser Phe
100 105 110
Pro Glu Gly Phe Lys Trp Glu Arg Val Met Asn Phe Glu Asp Gly Gly
115 120 125
Ala Val Thr Val Thr Gln Asp Thr Ser Leu Glu Asp Gly Thr Leu Ile
130 135 140
Tyr Lys Val Lys Leu Arg Gly Thr Asn Phe Pro Pro Asp Gly Pro Val
145 150 155 160
Met Gln Lys Lys Thr Met Gly Trp Glu Ala Ser Thr Glu Arg Leu Tyr
165 170 175
Pro Glu Asp Gly Val Leu Lys Gly Asp Ile Lys Cys Leu Ser Tyr Glu
180 185 190
Thr Glu Ile Leu Thr Val Glu Tyr Gly Leu Leu Pro Ile Gly Lys Ile
195 200 205
Val Glu Lys Arg Ile Glu Cys Thr Val Tyr Ser Val Asp Asn Asn Gly
210 215 220
Asn Ile Tyr Thr Gln Pro Val Ala Gln Trp His Asp Arg Gly Glu Gln
225 230 235 240
Glu Val Phe Glu Tyr Cys Leu Glu Asp Gly Ser Leu Ile Arg Ala Thr
245 250 255
Lys Asp His Lys Phe Met Thr Val Asp Gly Gln Met Leu Pro Ile Asp
260 265 270
Glu Ile Phe Glu Arg Glu Leu Asp Leu Met Arg Val Asp Asn Leu Pro
275 280 285
Asn Gly Gly Gly Gly Ser Gly Ser Ala Gln Leu Glu Lys Glu Leu Gln
290 295 300
Ala Leu Glu Lys Lys Leu Ala Gln Leu Glu Trp Glu Asn Gln Ala Leu
305 310 315 320
Glu Lys Glu Leu Ala Gln
325
<210> 124
<211> 9093
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 124
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtgccgcca ccatggctca actgaagaag 660
aaactccagg ccaataagaa agaactggca cagctgaagt ggaagctgca ggcactgaaa 720
aagaagctgg cacagggcgg aggaggctca ggcagtatga ttaagatcgc tacgcggaag 780
tacctgggga aacagaacgt ctacgacata ggtgtggagc gcgatcacaa ctttgctctg 840
aaaaatggat ttatcgccag caactgcatg gccctgcgcc tgaaggacgg cggccgctac 900
ctggcggact tcaagaccac ctacaaggcc aagaagcccg tgcagatgcc cggcgcctac 960
aacgtcgacc gcaagttgga catcacctcc cacaacgagg actacaccgt ggtggaacag 1020
tacgaacgct ccgagggccg ccactccacc ggcggcatgg acgagctgta caagtgatta 1080
attaagaatt cgacccagct ttcttgtaca aagtggttgg taagcctatc cctaaccctc 1140
tcctcggtct cgattctacg tagtaatgag ctagcagtct cgaggttaac gaattccgcc 1200
ccccccctaa cgttactggc cgaagccgct tggaataagg ccggtgtgcg cttgtctata 1260
tgttattttc caccatattg ccgtcttttg gcaatgtgag ggcccggaaa cctggccctg 1320
tcttcttgac gagcattcct aggggtcttt cccctctcgc caaaggaatg caaggtctgt 1380
tgaatgtcgt gaaggaagca gttcctctgg aagcttcttg aagacaaaca acgtctgtag 1440
cgaccctttg caggcagcgg aaccccccac ctggcgacag gtgcccctgc ggccaaaagc 1500
cacgtgtata agatacacct gcaaaggcgg cacaacccca gtgccacgtt gtgagttgga 1560
tagttgtgga aagagtcaaa tggctctcct caagcgtatt caacaagggg ctgaaggatg 1620
cccagaaggt accccattgt atgggatctg atctggggcc tcggtgcaca tgctttacat 1680
gtgtttagtc gaggttaaaa aaacgtctag gccccccgaa ccacggggac gtggttttcc 1740
tttgaaaaac acgataatac catggtgagc aagggcgagg agctgttcac cggggtggtg 1800
cccatcctgg tcgagctgga cggcgacgta aacggccaca agttcagcgt gtccggcgag 1860
ggcgagggcg atgccaccta cggcaagctg accctgaagt tcatctgcac caccggcaag 1920
ctgcccgtgc cctggcccac cctcgtgacc accctgacct acggcgtgca gtgcttcagc 1980
cgctaccccg accacatgaa gcagcacgac ttcttcaagt ccgccatgcc cgaaggctac 2040
gtccaggagc gcaccatctt cttcaaggac gacggcaact acaagacccg cgccgaggtg 2100
aagttcgagg gcgacaccct ggtgaaccgc atcgagctga agggcatcga cttcaaggag 2160
gacggcaaca tcctggggca caagctggag tacaactaca acagccacaa cgtctatatc 2220
atggccgaca agcagaagaa cggcatcaag gtgaacttca agatccgcca caacatcgag 2280
gacggcagcg tgcagctcgc cgaccactac cagcagaaca cccccatcgg cgacggcccc 2340
gtgctgctgc ccgacaacca ctacctgagc acccagtccg ccctgagcaa agaccccaac 2400
gagaagcgcg atcacatggt cctgctggag ttcgtgaccg ccgccgggat cactctcggc 2460
atggacgagc tgtacaagta acaccggtgg cgcgttaagt cgacaatcaa cctctggatt 2520
acaaaatttg tgaaagattg actggtattc ttaactatgt tgctcctttt acgctatgtg 2580
gatacgctgc tttaatgcct ttgtatcatg ctattgcttc ccgtatggct ttcattttct 2640
cctccttgta taaatcctgg ttgctgtctc tttatgagga gttgtggccc gttgtcaggc 2700
aacgtggcgt ggtgtgcact gtgtttgctg acgcaacccc cactggttgg ggcattgcca 2760
ccacctgtca gctcctttcc gggactttcg ctttccccct ccctattgcc acggcggaac 2820
tcatcgccgc ctgccttgcc cgctgctgga caggggctcg gctgttgggc actgacaatt 2880
ccgtggtgtt gtcggggaaa tcatcgtcct ttccttggct gctcgcctgt gttgccacct 2940
ggattctgcg cgggacgtcc ttctgctacg tcccttcggc cctcaatcca gcggaccttc 3000
cttcccgcgg cctgctgccg gctctgcggc ctcttccgcg tcttcgcctt cgccctcaga 3060
cgagtcggat ctccctttgg gccgcctccc cgcgtcgact ttaagaccaa tgacttacaa 3120
ggcagctgta gatcttagcc actttttaaa agaaaagggg ggactggaag ggctaattca 3180
ctcccaacga agacaagatc tgctttttgc ttgtactggg tctctctggt tagaccagat 3240
ctgagcctgg gagctctctg gctaactagg gaacccactg cttaagcctc aataaagctt 3300
gccttgagtg cttcaagtag tgtgtgcccg tctgttgtgt gactctggta actagagatc 3360
cctcagaccc ttttagtcag tgtggaaaat ctctagcagt acgtatagta gttcatgtca 3420
tcttattatt cagtatttat aacttgcaaa gaaatgaata tcagagagtg agaggaactt 3480
gtttattgca gcttataatg gttacaaata aagcaatagc atcacaaatt tcacaaataa 3540
agcatttttt tcactgcatt ctagttgtgg tttgtccaaa ctcatcaatg tatcttatca 3600
tgtctggctc tagctatccc gcccctaact ccgcccatcc cgcccctaac tccgcccagt 3660
tccgcccatt ctccgcccca tggctgacta atttttttta tttatgcaga ggccgaggcc 3720
gcctcggcct ctgagctatt ccagaagtag tgaggaggct tttttggagg cctagggacg 3780
tacccaattc gccctatagt gagtcgtatt acgcgcgctc actggccgtc gttttacaac 3840
gtcgtgactg ggaaaaccct ggcgttaccc aacttaatcg ccttgcagca catccccctt 3900
tcgccagctg gcgtaatagc gaagaggccc gcaccgatcg cccttcccaa cagttgcgca 3960
gcctgaatgg cgaatgggac gcgccctgta gcggcgcatt aagcgcggcg ggtgtggtgg 4020
ttacgcgcag cgtgaccgct acacttgcca gcgccctagc gcccgctcct ttcgctttct 4080
tcccttcctt tctcgccacg ttcgccggct ttccccgtca agctctaaat cgggggctcc 4140
ctttagggtt ccgatttagt gctttacggc acctcgaccc caaaaaactt gattagggtg 4200
atggttcacg tagtgggcca tcgccctgat agacggtttt tcgccctttg acgttggagt 4260
ccacgttctt taatagtgga ctcttgttcc aaactggaac aacactcaac cctatctcgg 4320
tctattcttt tgatttataa gggattttgc cgatttcggc ctattggtta aaaaatgagc 4380
tgatttaaca aaaatttaac gcgaatttta acaaaatatt aacgcttaca atttaggtgg 4440
cacttttcgg ggaaatgtgc gcggaacccc tatttgttta tttttctaaa tacattcaaa 4500
tatgtatccg ctcatgagac aataaccctg ataaatgctt caataatatt gaaaaaggaa 4560
gagtatgagt attcaacatt tccgtgtcgc ccttattccc ttttttgcgg cattttgcct 4620
tcctgttttt gctcacccag aaacgctggt gaaagtaaaa gatgctgaag atcagttggg 4680
tgcacgagtg ggttacatcg aactggatct caacagcggt aagatccttg agagttttcg 4740
ccccgaagaa cgttttccaa tgatgagcac ttttaaagtt ctgctatgtg gcgcggtatt 4800
atcccgtatt gacgccgggc aagagcaact cggtcgccgc atacactatt ctcagaatga 4860
cttggttgag tactcaccag tcacagaaaa gcatcttacg gatggcatga cagtaagaga 4920
attatgcagt gctgccataa ccatgagtga taacactgcg gccaacttac ttctgacaac 4980
gatcggagga ccgaaggagc taaccgcttt tttgcacaac atgggggatc atgtaactcg 5040
ccttgatcgt tgggaaccgg agctgaatga agccatacca aacgacgagc gtgacaccac 5100
gatgcctgta gcaatggcaa caacgttgcg caaactatta actggcgaac tacttactct 5160
agcttcccgg caacaattaa tagactggat ggaggcggat aaagttgcag gaccacttct 5220
gcgctcggcc cttccggctg gctggtttat tgctgataaa tctggagccg gtgagcgtgg 5280
gtctcgcggt atcattgcag cactggggcc agatggtaag ccctcccgta tcgtagttat 5340
ctacacgacg gggagtcagg caactatgga tgaacgaaat agacagatcg ctgagatagg 5400
tgcctcactg attaagcatt ggtaactgtc agaccaagtt tactcatata tactttagat 5460
tgatttaaaa cttcattttt aatttaaaag gatctaggtg aagatccttt ttgataatct 5520
catgaccaaa atcccttaac gtgagttttc gttccactga gcgtcagacc ccgtagaaaa 5580
gatcaaagga tcttcttgag atcctttttt tctgcgcgta atctgctgct tgcaaacaaa 5640
aaaaccaccg ctaccagcgg tggtttgttt gccggatcaa gagctaccaa ctctttttcc 5700
gaaggtaact ggcttcagca gagcgcagat accaaatact gttcttctag tgtagccgta 5760
gttaggccac cacttcaaga actctgtagc accgcctaca tacctcgctc tgctaatcct 5820
gttaccagtg gctgctgcca gtggcgataa gtcgtgtctt accgggttgg actcaagacg 5880
atagttaccg gataaggcgc agcggtcggg ctgaacgggg ggttcgtgca cacagcccag 5940
cttggagcga acgacctaca ccgaactgag atacctacag cgtgagctat gagaaagcgc 6000
cacgcttccc gaagggagaa aggcggacag gtatccggta agcggcaggg tcggaacagg 6060
agagcgcacg agggagcttc cagggggaaa cgcctggtat ctttatagtc ctgtcgggtt 6120
tcgccacctc tgacttgagc gtcgattttt gtgatgctcg tcaggggggc ggagcctatg 6180
gaaaaacgcc agcaacgcgg cctttttacg gttcctggcc ttttgctggc cttttgctca 6240
catgttcttt cctgcgttat cccctgattc tgtggataac cgtattaccg cctttgagtg 6300
agctgatacc gctcgccgca gccgaacgac cgagcgcagc gagtcagtga gcgaggaagc 6360
ggaagagcgc ccaatacgca aaccgcctct ccccgcgcgt tggccgattc attaatgcag 6420
ctggcacgac aggtttcccg actggaaagc gggcagtgag cgcaacgcaa ttaatgtgag 6480
ttagctcact cattaggcac cccaggcttt acactttatg cttccggctc gtatgttgtg 6540
tggaattgtg agcggataac aatttcacac aggaaacagc tatgaccatg attacgccaa 6600
gcgcgcaatt aaccctcact aaagggaaca aaagctggag ctgcaagctt aatgtagtct 6660
tatgcaatac tcttgtagtc ttgcaacatg gtaacgatga gttagcaaca tgccttacaa 6720
ggagagaaaa agcaccgtgc atgccgattg gtggaagtaa ggtggtacga tcgtgcctta 6780
ttaggaaggc aacagacggg tctgacatgg attggacgaa ccactgaatt gccgcattgc 6840
agagatattg tatttaagtg cctagctcga tacataaacg ggtctctctg gttagaccag 6900
atctgagcct gggagctctc tggctaacta gggaacccac tgcttaagcc tcaataaagc 6960
ttgccttgag tgcttcaagt agtgtgtgcc cgtctgttgt gtgactctgg taactagaga 7020
tccctcagac ccttttagtc agtgtggaaa atctctagca gtggcgcccg aacagggact 7080
tgaaagcgaa agggaaacca gaggagctct ctcgacgcag gactcggctt gctgaagcgc 7140
gcacggcaag aggcgagggg cggcgactgg tgagtacgcc aaaaattttg actagcggag 7200
gctagaagga gagagatggg tgcgagagcg tcagtattaa gcgggggaga attagatcgc 7260
gatgggaaaa aattcggtta aggccagggg gaaagaaaaa atataaatta aaacatatag 7320
tatgggcaag cagggagcta gaacgattcg cagttaatcc tggcctgtta gaaacatcag 7380
aaggctgtag acaaatactg ggacagctac aaccatccct tcagacagga tcagaagaac 7440
ttagatcatt atataataca gtagcaaccc tctattgtgt gcatcaaagg atagagataa 7500
aagacaccaa ggaagcttta gacaagatag aggaagagca aaacaaaagt aagaccaccg 7560
cacagcaagc ggccgctgat cttcagacct ggaggaggag atatgaggga caattggaga 7620
agtgaattat ataaatataa agtagtaaaa attgaaccat taggagtagc acccaccaag 7680
gcaaagagaa gagtggtgca gagagaaaaa agagcagtgg gaataggagc tttgttcctt 7740
gggttcttgg gagcagcagg aagcactatg ggcgcagcgt caatgacgct gacggtacag 7800
gccagacaat tattgtctgg tatagtgcag cagcagaaca atttgctgag ggctattgag 7860
gcgcaacagc atctgttgca actcacagtc tggggcatca agcagctcca ggcaagaatc 7920
ctggctgtgg aaagatacct aaaggatcaa cagctcctgg ggatttgggg ttgctctgga 7980
aaactcattt gcaccactgc tgtgccttgg aatgctagtt ggagtaataa atctctggaa 8040
cagatttgga atcacacgac ctggatggag tgggacagag aaattaacaa ttacacaagc 8100
ttaatacact ccttaattga agaatcgcaa aaccagcaag aaaagaatga acaagaatta 8160
ttggaattag ataaatgggc aagtttgtgg aattggttta acataacaaa ttggctgtgg 8220
tatataaaat tattcataat gatagtagga ggcttggtag gtttaagaat agtttttgct 8280
gtactttcta tagtgaatag agttaggcag ggatattcac cattatcgtt tcagacccac 8340
ctcccaaccc cgaggggacc cttgcgcctt ttccaaggca gccctgggtt tgcgcaggga 8400
cgcggctgct ctgggcgtgg ttccgggaaa cgcagcggcg ccgaccctgg gtctcgcaca 8460
ttcttcacgt ccgttcgcag cgtcacccgg atcttcgccg ctacccttgt gggccccccg 8520
gcgacgcttc ctgctccgcc cctaagtcgg gaaggttcct tgcggttcgc ggcgtgccgg 8580
acgtgacaaa cggaagccgc acgtctcact agtaccctcg cagacggaca gcgccaggga 8640
gcaatggcag cgcgccgacc gcgatgggct gtggccaata gcggctgctc agcagggcgc 8700
gccgagagca gcggccggga aggggcggtg cgggaggcgg ggtgtggggc ggtagtgtgg 8760
gccctgttcc tgcccgcgcg gtgttccgca ttctgcaagc ctccggagcg cacgtcggca 8820
gtcggctccc tcgttgaccg aatcaccgac ctctctcccc agggggtacc cagctgtcta 8880
gagaattcta gatcttgaga caaatggcag tattcatcca caattttaaa agaaaagggg 8940
ggattggggg gtacagtgca ggggaaagaa tagtagacat aatagcaaca gacatacaaa 9000
ctaaagaatt acaaaaacaa attacaaaaa ttcaaaattt tcgggtttat tacagggaca 9060
gcagagatcc actttggcgc cggctcgagg ggg 9093
<210> 125
<211> 144
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 125
Met Ala Gln Leu Lys Lys Lys Leu Gln Ala Asn Lys Lys Glu Leu Ala
1 5 10 15
Gln Leu Lys Trp Lys Leu Gln Ala Leu Lys Lys Lys Leu Ala Gln Gly
20 25 30
Gly Gly Gly Ser Gly Ser Met Ile Lys Ile Ala Thr Arg Lys Tyr Leu
35 40 45
Gly Lys Gln Asn Val Tyr Asp Ile Gly Val Glu Arg Asp His Asn Phe
50 55 60
Ala Leu Lys Asn Gly Phe Ile Ala Ser Asn Cys Met Ala Leu Arg Leu
65 70 75 80
Lys Asp Gly Gly Arg Tyr Leu Ala Asp Phe Lys Thr Thr Tyr Lys Ala
85 90 95
Lys Lys Pro Val Gln Met Pro Gly Ala Tyr Asn Val Asp Arg Lys Leu
100 105 110
Asp Ile Thr Ser His Asn Glu Asp Tyr Thr Val Val Glu Gln Tyr Glu
115 120 125
Arg Ser Glu Gly Arg His Ser Thr Gly Gly Met Asp Glu Leu Tyr Lys
130 135 140
<210> 126
<211> 9159
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 126
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtgccacca tgaaaaagcc tgaactcacc 660
gcgacgtctg tcgagaagtt tctgatcgaa aagttcgaca gcgtctccga cctgatgcag 720
ctctcggagg gcgaagaatc tcgtgctttc agcttcgatg taggagggcg tggatatgtc 780
ctgcgggtaa atagctgcgc cgatggtttc tacaaagatc gttatgttta tcggcacttt 840
gcatcgtgcc tttcatacga gaccgagatc ctgactgtcg agtacggatt gcttcctatc 900
ggcaaaatcg tggagaagag gattgaatgt accgtctatt cagtcgataa taatgggaac 960
atctacacac agcccgtggc tcaatggcac gacagaggag agcaggaagt ttttgaatac 1020
tgtctcgagg acggatccct catccgcgct actaaagatc ataagtttat gaccgtggac 1080
ggccagatgc tgccaattga cgaaattttt gaacgagagc tggatctgat gagagtcgac 1140
aaccttccaa actgattaat taagaattcg acccagcttt cttgtacaaa gtggttggta 1200
agcctatccc taaccctctc ctcggtctcg attctacgta gtaatgagct agcagtctcg 1260
aggttaacga attccgcccc ccccctaacg ttactggccg aagccgcttg gaataaggcc 1320
ggtgtgcgct tgtctatatg ttattttcca ccatattgcc gtcttttggc aatgtgaggg 1380
cccggaaacc tggccctgtc ttcttgacga gcattcctag gggtctttcc cctctcgcca 1440
aaggaatgca aggtctgttg aatgtcgtga aggaagcagt tcctctggaa gcttcttgaa 1500
gacaaacaac gtctgtagcg accctttgca ggcagcggaa ccccccacct ggcgacaggt 1560
gcccctgcgg ccaaaagcca cgtgtataag atacacctgc aaaggcggca caaccccagt 1620
gccacgttgt gagttggata gttgtggaaa gagtcaaatg gctctcctca agcgtattca 1680
acaaggggct gaaggatgcc cagaaggtac cccattgtat gggatctgat ctggggcctc 1740
ggtgcacatg ctttacatgt gtttagtcga ggttaaaaaa acgtctaggc cccccgaacc 1800
acggggacgt ggttttcctt tgaaaaacac gataatacca tggccatgag cgagctgatt 1860
aaggagaaca tgcacatgaa gctgtacatg gagggcaccg tggacaacca tcacttcaag 1920
tgcacatccg agggcgaagg caagccctac gagggcaccc agaccatgag aatcaaggtg 1980
gtcgagggcg gccctctccc cttcgccttc gacatcctgg ctactagctt cctctacggc 2040
agcaagacct tcatcaacca cacccagggc atccccgact tcttcaagca gtccttccct 2100
gagggcttca catgggagag agtcaccaca tacgaagacg ggggcgtgct gaccgctacc 2160
caggacacca gcctccagga cggctgcctc atctacaacg tcaagatcag aggggtgaac 2220
ttcacatcca acggccctgt gatgcagaag aaaacactcg gctgggaggc cttcaccgag 2280
acgctgtacc ccgctgacgg cggcctggaa ggcagaaacg acatggccct gaagctcgtg 2340
ggcgggagcc atctgatcgc aaacatcaag accacatata gatccaagaa acccgctaag 2400
aacctcaaga tgcctggcgt ctactatgtg gactacagac tggaaagaat caaggaggcc 2460
aacaacgaga cctacgtcga gcagcacgag gtggcagtgg ccagatactg cgacctccct 2520
agcaaactgg ggcacaagct taattaacac cggtggcgcg ttaagtcgac aatcaacctc 2580
tggattacaa aatttgtgaa agattgactg gtattcttaa ctatgttgct ccttttacgc 2640
tatgtggata cgctgcttta atgcctttgt atcatgctat tgcttcccgt atggctttca 2700
ttttctcctc cttgtataaa tcctggttgc tgtctcttta tgaggagttg tggcccgttg 2760
tcaggcaacg tggcgtggtg tgcactgtgt ttgctgacgc aacccccact ggttggggca 2820
ttgccaccac ctgtcagctc ctttccggga ctttcgcttt ccccctccct attgccacgg 2880
cggaactcat cgccgcctgc cttgcccgct gctggacagg ggctcggctg ttgggcactg 2940
acaattccgt ggtgttgtcg gggaaatcat cgtcctttcc ttggctgctc gcctgtgttg 3000
ccacctggat tctgcgcggg acgtccttct gctacgtccc ttcggccctc aatccagcgg 3060
accttccttc ccgcggcctg ctgccggctc tgcggcctct tccgcgtctt cgccttcgcc 3120
ctcagacgag tcggatctcc ctttgggccg cctccccgcg tcgactttaa gaccaatgac 3180
ttacaaggca gctgtagatc ttagccactt tttaaaagaa aaggggggac tggaagggct 3240
aattcactcc caacgaagac aagatctgct ttttgcttgt actgggtctc tctggttaga 3300
ccagatctga gcctgggagc tctctggcta actagggaac ccactgctta agcctcaata 3360
aagcttgcct tgagtgcttc aagtagtgtg tgcccgtctg ttgtgtgact ctggtaacta 3420
gagatccctc agaccctttt agtcagtgtg gaaaatctct agcagtacgt atagtagttc 3480
atgtcatctt attattcagt atttataact tgcaaagaaa tgaatatcag agagtgagag 3540
gaacttgttt attgcagctt ataatggtta caaataaagc aatagcatca caaatttcac 3600
aaataaagca tttttttcac tgcattctag ttgtggtttg tccaaactca tcaatgtatc 3660
ttatcatgtc tggctctagc tatcccgccc ctaactccgc ccatcccgcc cctaactccg 3720
cccagttccg cccattctcc gccccatggc tgactaattt tttttattta tgcagaggcc 3780
gaggccgcct cggcctctga gctattccag aagtagtgag gaggcttttt tggaggccta 3840
gggacgtacc caattcgccc tatagtgagt cgtattacgc gcgctcactg gccgtcgttt 3900
tacaacgtcg tgactgggaa aaccctggcg ttacccaact taatcgcctt gcagcacatc 3960
cccctttcgc cagctggcgt aatagcgaag aggcccgcac cgatcgccct tcccaacagt 4020
tgcgcagcct gaatggcgaa tgggacgcgc cctgtagcgg cgcattaagc gcggcgggtg 4080
tggtggttac gcgcagcgtg accgctacac ttgccagcgc cctagcgccc gctcctttcg 4140
ctttcttccc ttcctttctc gccacgttcg ccggctttcc ccgtcaagct ctaaatcggg 4200
ggctcccttt agggttccga tttagtgctt tacggcacct cgaccccaaa aaacttgatt 4260
agggtgatgg ttcacgtagt gggccatcgc cctgatagac ggtttttcgc cctttgacgt 4320
tggagtccac gttctttaat agtggactct tgttccaaac tggaacaaca ctcaacccta 4380
tctcggtcta ttcttttgat ttataaggga ttttgccgat ttcggcctat tggttaaaaa 4440
atgagctgat ttaacaaaaa tttaacgcga attttaacaa aatattaacg cttacaattt 4500
aggtggcact tttcggggaa atgtgcgcgg aacccctatt tgtttatttt tctaaataca 4560
ttcaaatatg tatccgctca tgagacaata accctgataa atgcttcaat aatattgaaa 4620
aaggaagagt atgagtattc aacatttccg tgtcgccctt attccctttt ttgcggcatt 4680
ttgccttcct gtttttgctc acccagaaac gctggtgaaa gtaaaagatg ctgaagatca 4740
gttgggtgca cgagtgggtt acatcgaact ggatctcaac agcggtaaga tccttgagag 4800
ttttcgcccc gaagaacgtt ttccaatgat gagcactttt aaagttctgc tatgtggcgc 4860
ggtattatcc cgtattgacg ccgggcaaga gcaactcggt cgccgcatac actattctca 4920
gaatgacttg gttgagtact caccagtcac agaaaagcat cttacggatg gcatgacagt 4980
aagagaatta tgcagtgctg ccataaccat gagtgataac actgcggcca acttacttct 5040
gacaacgatc ggaggaccga aggagctaac cgcttttttg cacaacatgg gggatcatgt 5100
aactcgcctt gatcgttggg aaccggagct gaatgaagcc ataccaaacg acgagcgtga 5160
caccacgatg cctgtagcaa tggcaacaac gttgcgcaaa ctattaactg gcgaactact 5220
tactctagct tcccggcaac aattaataga ctggatggag gcggataaag ttgcaggacc 5280
acttctgcgc tcggcccttc cggctggctg gtttattgct gataaatctg gagccggtga 5340
gcgtgggtct cgcggtatca ttgcagcact ggggccagat ggtaagccct cccgtatcgt 5400
agttatctac acgacgggga gtcaggcaac tatggatgaa cgaaatagac agatcgctga 5460
gataggtgcc tcactgatta agcattggta actgtcagac caagtttact catatatact 5520
ttagattgat ttaaaacttc atttttaatt taaaaggatc taggtgaaga tcctttttga 5580
taatctcatg accaaaatcc cttaacgtga gttttcgttc cactgagcgt cagaccccgt 5640
agaaaagatc aaaggatctt cttgagatcc tttttttctg cgcgtaatct gctgcttgca 5700
aacaaaaaaa ccaccgctac cagcggtggt ttgtttgccg gatcaagagc taccaactct 5760
ttttccgaag gtaactggct tcagcagagc gcagatacca aatactgttc ttctagtgta 5820
gccgtagtta ggccaccact tcaagaactc tgtagcaccg cctacatacc tcgctctgct 5880
aatcctgtta ccagtggctg ctgccagtgg cgataagtcg tgtcttaccg ggttggactc 5940
aagacgatag ttaccggata aggcgcagcg gtcgggctga acggggggtt cgtgcacaca 6000
gcccagcttg gagcgaacga cctacaccga actgagatac ctacagcgtg agctatgaga 6060
aagcgccacg cttcccgaag ggagaaaggc ggacaggtat ccggtaagcg gcagggtcgg 6120
aacaggagag cgcacgaggg agcttccagg gggaaacgcc tggtatcttt atagtcctgt 6180
cgggtttcgc cacctctgac ttgagcgtcg atttttgtga tgctcgtcag gggggcggag 6240
cctatggaaa aacgccagca acgcggcctt tttacggttc ctggcctttt gctggccttt 6300
tgctcacatg ttctttcctg cgttatcccc tgattctgtg gataaccgta ttaccgcctt 6360
tgagtgagct gataccgctc gccgcagccg aacgaccgag cgcagcgagt cagtgagcga 6420
ggaagcggaa gagcgcccaa tacgcaaacc gcctctcccc gcgcgttggc cgattcatta 6480
atgcagctgg cacgacaggt ttcccgactg gaaagcgggc agtgagcgca acgcaattaa 6540
tgtgagttag ctcactcatt aggcacccca ggctttacac tttatgcttc cggctcgtat 6600
gttgtgtgga attgtgagcg gataacaatt tcacacagga aacagctatg accatgatta 6660
cgccaagcgc gcaattaacc ctcactaaag ggaacaaaag ctggagctgc aagcttaatg 6720
tagtcttatg caatactctt gtagtcttgc aacatggtaa cgatgagtta gcaacatgcc 6780
ttacaaggag agaaaaagca ccgtgcatgc cgattggtgg aagtaaggtg gtacgatcgt 6840
gccttattag gaaggcaaca gacgggtctg acatggattg gacgaaccac tgaattgccg 6900
cattgcagag atattgtatt taagtgccta gctcgataca taaacgggtc tctctggtta 6960
gaccagatct gagcctggga gctctctggc taactaggga acccactgct taagcctcaa 7020
taaagcttgc cttgagtgct tcaagtagtg tgtgcccgtc tgttgtgtga ctctggtaac 7080
tagagatccc tcagaccctt ttagtcagtg tggaaaatct ctagcagtgg cgcccgaaca 7140
gggacttgaa agcgaaaggg aaaccagagg agctctctcg acgcaggact cggcttgctg 7200
aagcgcgcac ggcaagaggc gaggggcggc gactggtgag tacgccaaaa attttgacta 7260
gcggaggcta gaaggagaga gatgggtgcg agagcgtcag tattaagcgg gggagaatta 7320
gatcgcgatg ggaaaaaatt cggttaaggc cagggggaaa gaaaaaatat aaattaaaac 7380
atatagtatg ggcaagcagg gagctagaac gattcgcagt taatcctggc ctgttagaaa 7440
catcagaagg ctgtagacaa atactgggac agctacaacc atcccttcag acaggatcag 7500
aagaacttag atcattatat aatacagtag caaccctcta ttgtgtgcat caaaggatag 7560
agataaaaga caccaaggaa gctttagaca agatagagga agagcaaaac aaaagtaaga 7620
ccaccgcaca gcaagcggcc gctgatcttc agacctggag gaggagatat gagggacaat 7680
tggagaagtg aattatataa atataaagta gtaaaaattg aaccattagg agtagcaccc 7740
accaaggcaa agagaagagt ggtgcagaga gaaaaaagag cagtgggaat aggagctttg 7800
ttccttgggt tcttgggagc agcaggaagc actatgggcg cagcgtcaat gacgctgacg 7860
gtacaggcca gacaattatt gtctggtata gtgcagcagc agaacaattt gctgagggct 7920
attgaggcgc aacagcatct gttgcaactc acagtctggg gcatcaagca gctccaggca 7980
agaatcctgg ctgtggaaag atacctaaag gatcaacagc tcctggggat ttggggttgc 8040
tctggaaaac tcatttgcac cactgctgtg ccttggaatg ctagttggag taataaatct 8100
ctggaacaga tttggaatca cacgacctgg atggagtggg acagagaaat taacaattac 8160
acaagcttaa tacactcctt aattgaagaa tcgcaaaacc agcaagaaaa gaatgaacaa 8220
gaattattgg aattagataa atgggcaagt ttgtggaatt ggtttaacat aacaaattgg 8280
ctgtggtata taaaattatt cataatgata gtaggaggct tggtaggttt aagaatagtt 8340
tttgctgtac tttctatagt gaatagagtt aggcagggat attcaccatt atcgtttcag 8400
acccacctcc caaccccgag gggacccttg cgccttttcc aaggcagccc tgggtttgcg 8460
cagggacgcg gctgctctgg gcgtggttcc gggaaacgca gcggcgccga ccctgggtct 8520
cgcacattct tcacgtccgt tcgcagcgtc acccggatct tcgccgctac ccttgtgggc 8580
cccccggcga cgcttcctgc tccgccccta agtcgggaag gttccttgcg gttcgcggcg 8640
tgccggacgt gacaaacgga agccgcacgt ctcactagta ccctcgcaga cggacagcgc 8700
cagggagcaa tggcagcgcg ccgaccgcga tgggctgtgg ccaatagcgg ctgctcagca 8760
gggcgcgccg agagcagcgg ccgggaaggg gcggtgcggg aggcggggtg tggggcggta 8820
gtgtgggccc tgttcctgcc cgcgcggtgt tccgcattct gcaagcctcc ggagcgcacg 8880
tcggcagtcg gctccctcgt tgaccgaatc accgacctct ctccccaggg ggtacccagc 8940
tgtctagaga attctagatc ttgagacaaa tggcagtatt catccacaat tttaaaagaa 9000
aaggggggat tggggggtac agtgcagggg aaagaatagt agacataata gcaacagaca 9060
tacaaactaa agaattacaa aaacaaatta caaaaattca aaattttcgg gtttattaca 9120
gggacagcag agatccactt tggcgccggc tcgaggggg 9159
<210> 127
<211> 171
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 127
Met Lys Lys Pro Glu Leu Thr Ala Thr Ser Val Glu Lys Phe Leu Ile
1 5 10 15
Glu Lys Phe Asp Ser Val Ser Asp Leu Met Gln Leu Ser Glu Gly Glu
20 25 30
Glu Ser Arg Ala Phe Ser Phe Asp Val Gly Gly Arg Gly Tyr Val Leu
35 40 45
Arg Val Asn Ser Cys Ala Asp Gly Phe Tyr Lys Asp Arg Tyr Val Tyr
50 55 60
Arg His Phe Ala Ser Cys Leu Ser Tyr Glu Thr Glu Ile Leu Thr Val
65 70 75 80
Glu Tyr Gly Leu Leu Pro Ile Gly Lys Ile Val Glu Lys Arg Ile Glu
85 90 95
Cys Thr Val Tyr Ser Val Asp Asn Asn Gly Asn Ile Tyr Thr Gln Pro
100 105 110
Val Ala Gln Trp His Asp Arg Gly Glu Gln Glu Val Phe Glu Tyr Cys
115 120 125
Leu Glu Asp Gly Ser Leu Ile Arg Ala Thr Lys Asp His Lys Phe Met
130 135 140
Thr Val Asp Gly Gln Met Leu Pro Ile Asp Glu Ile Phe Glu Arg Glu
145 150 155 160
Leu Asp Leu Met Arg Val Asp Asn Leu Pro Asn
165 170
<210> 128
<211> 9579
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 128
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtgccgcca ccatgattaa gatcgctacg 660
cggaagtacc tggggaaaca gaacgtctac gacataggtg tggagcgcga tcacaacttt 720
gctctgaaaa atggatttat cgccagcaac tgcgccgcgc tcccgattcc ggaagtgctt 780
gacattgggg aatttagcga gagcctgacc tattgcatct cccgccgtgc acagggtgtc 840
acgttgcaag acctgcctga aaccgaactg cccgctgttc tgcagccggt cgcggaggcc 900
atggatgcga tcgctgcggc cgatcttagc cagacgagcg ggttcggccc attcggaccg 960
caaggaatcg gtcaatacac tacatggcgt gatttcatat gcgcgattgc tgatccccat 1020
gtgtatcact ggcaaactgt gatggacgac accgtcagtg cgtccgtcgc gcaggctctc 1080
gatgagctga tgctttgggc cgaggactgc cccgaagtcc ggcacctcgt gcacgcggat 1140
ttcggctcca acaatgtcct gacggacaat ggccgcataa cagcggtcat tgactggagc 1200
gaggcgatgt tcggggattc ccaatacgag gtcgccaaca tcttcttctg gaggccgtgg 1260
ttggcttgta tggagcagca gacgcgctac ttcgagcgga ggcatccgga gcttgcagga 1320
tcgccgcggc tccgggcgta tatgctccgc attggtcttg accaactcta tcagagcttg 1380
gttgacggca atttcgatga tgcagcttgg gcgcagggtc gatgcgacgc aatcgtccga 1440
tccggagccg ggactgtcgg gcgtacacaa atcgcccgca gaagcgcggc cgtctggacc 1500
gatggctgtg tagaagtact cgccgatagt ggaaaccgac gccccagcac tcgtccgagg 1560
gcaaaggaat agttaattaa gaattcgacc cagctttctt gtacaaagtg gttggtaagc 1620
ctatccctaa ccctctcctc ggtctcgatt ctacgtagta atgagctagc agtctcgagg 1680
ttaacgaatt ccgccccccc cctaacgtta ctggccgaag ccgcttggaa taaggccggt 1740
gtgcgcttgt ctatatgtta ttttccacca tattgccgtc ttttggcaat gtgagggccc 1800
ggaaacctgg ccctgtcttc ttgacgagca ttcctagggg tctttcccct ctcgccaaag 1860
gaatgcaagg tctgttgaat gtcgtgaagg aagcagttcc tctggaagct tcttgaagac 1920
aaacaacgtc tgtagcgacc ctttgcaggc agcggaaccc cccacctggc gacaggtgcc 1980
cctgcggcca aaagccacgt gtataagata cacctgcaaa ggcggcacaa ccccagtgcc 2040
acgttgtgag ttggatagtt gtggaaagag tcaaatggct ctcctcaagc gtattcaaca 2100
aggggctgaa ggatgcccag aaggtacccc attgtatggg atctgatctg gggcctcggt 2160
gcacatgctt tacatgtgtt tagtcgaggt taaaaaaacg tctaggcccc ccgaaccacg 2220
gggacgtggt tttcctttga aaaacacgat aataccatgg tgagcaaggg cgaggaggat 2280
aacatggcca tcatcaagga gttcatgcgc ttcaaggtgc acatggaggg ctccgtgaac 2340
ggccacgagt tcgagatcga gggcgagggc gagggccgcc cctacgaggg cacccagacc 2400
gccaagctga aggtgaccaa gggtggcccc ctgcccttcg cctgggacat cctgtcccct 2460
cagttcatgt acggctccaa ggcctacgtg aagcaccccg ccgacatccc cgactacttg 2520
aagctgtcct tccccgaggg cttcaagtgg gagcgcgtga tgaacttcga ggacggcggc 2580
gtggtgaccg tgacccagga ctcctccctg caggacggcg agttcatcta caaggtgaag 2640
ctgcgcggca ccaacttccc ctccgacggc cccgtaatgc agaagaagac catgggctgg 2700
gaggcctcct ccgagcggat gtaccccgag gacggcgccc tgaagggcga gatcaagcag 2760
aggctgaagc tgaaggacgg cggccactac gacgctgagg tcaagaccac ctacaaggcc 2820
aagaagcccg tgcagctgcc cggcgcctac aacgtcaaca tcaagttgga catcacctcc 2880
cacaacgagg actacaccat cgtggaacag tacgaacgcg ccgagggccg ccactccacc 2940
ggcggcatgg acgagctgta caagtaacac cggtggcgcg ttaagtcgac aatcaacctc 3000
tggattacaa aatttgtgaa agattgactg gtattcttaa ctatgttgct ccttttacgc 3060
tatgtggata cgctgcttta atgcctttgt atcatgctat tgcttcccgt atggctttca 3120
ttttctcctc cttgtataaa tcctggttgc tgtctcttta tgaggagttg tggcccgttg 3180
tcaggcaacg tggcgtggtg tgcactgtgt ttgctgacgc aacccccact ggttggggca 3240
ttgccaccac ctgtcagctc ctttccggga ctttcgcttt ccccctccct attgccacgg 3300
cggaactcat cgccgcctgc cttgcccgct gctggacagg ggctcggctg ttgggcactg 3360
acaattccgt ggtgttgtcg gggaaatcat cgtcctttcc ttggctgctc gcctgtgttg 3420
ccacctggat tctgcgcggg acgtccttct gctacgtccc ttcggccctc aatccagcgg 3480
accttccttc ccgcggcctg ctgccggctc tgcggcctct tccgcgtctt cgccttcgcc 3540
ctcagacgag tcggatctcc ctttgggccg cctccccgcg tcgactttaa gaccaatgac 3600
ttacaaggca gctgtagatc ttagccactt tttaaaagaa aaggggggac tggaagggct 3660
aattcactcc caacgaagac aagatctgct ttttgcttgt actgggtctc tctggttaga 3720
ccagatctga gcctgggagc tctctggcta actagggaac ccactgctta agcctcaata 3780
aagcttgcct tgagtgcttc aagtagtgtg tgcccgtctg ttgtgtgact ctggtaacta 3840
gagatccctc agaccctttt agtcagtgtg gaaaatctct agcagtacgt atagtagttc 3900
atgtcatctt attattcagt atttataact tgcaaagaaa tgaatatcag agagtgagag 3960
gaacttgttt attgcagctt ataatggtta caaataaagc aatagcatca caaatttcac 4020
aaataaagca tttttttcac tgcattctag ttgtggtttg tccaaactca tcaatgtatc 4080
ttatcatgtc tggctctagc tatcccgccc ctaactccgc ccatcccgcc cctaactccg 4140
cccagttccg cccattctcc gccccatggc tgactaattt tttttattta tgcagaggcc 4200
gaggccgcct cggcctctga gctattccag aagtagtgag gaggcttttt tggaggccta 4260
gggacgtacc caattcgccc tatagtgagt cgtattacgc gcgctcactg gccgtcgttt 4320
tacaacgtcg tgactgggaa aaccctggcg ttacccaact taatcgcctt gcagcacatc 4380
cccctttcgc cagctggcgt aatagcgaag aggcccgcac cgatcgccct tcccaacagt 4440
tgcgcagcct gaatggcgaa tgggacgcgc cctgtagcgg cgcattaagc gcggcgggtg 4500
tggtggttac gcgcagcgtg accgctacac ttgccagcgc cctagcgccc gctcctttcg 4560
ctttcttccc ttcctttctc gccacgttcg ccggctttcc ccgtcaagct ctaaatcggg 4620
ggctcccttt agggttccga tttagtgctt tacggcacct cgaccccaaa aaacttgatt 4680
agggtgatgg ttcacgtagt gggccatcgc cctgatagac ggtttttcgc cctttgacgt 4740
tggagtccac gttctttaat agtggactct tgttccaaac tggaacaaca ctcaacccta 4800
tctcggtcta ttcttttgat ttataaggga ttttgccgat ttcggcctat tggttaaaaa 4860
atgagctgat ttaacaaaaa tttaacgcga attttaacaa aatattaacg cttacaattt 4920
aggtggcact tttcggggaa atgtgcgcgg aacccctatt tgtttatttt tctaaataca 4980
ttcaaatatg tatccgctca tgagacaata accctgataa atgcttcaat aatattgaaa 5040
aaggaagagt atgagtattc aacatttccg tgtcgccctt attccctttt ttgcggcatt 5100
ttgccttcct gtttttgctc acccagaaac gctggtgaaa gtaaaagatg ctgaagatca 5160
gttgggtgca cgagtgggtt acatcgaact ggatctcaac agcggtaaga tccttgagag 5220
ttttcgcccc gaagaacgtt ttccaatgat gagcactttt aaagttctgc tatgtggcgc 5280
ggtattatcc cgtattgacg ccgggcaaga gcaactcggt cgccgcatac actattctca 5340
gaatgacttg gttgagtact caccagtcac agaaaagcat cttacggatg gcatgacagt 5400
aagagaatta tgcagtgctg ccataaccat gagtgataac actgcggcca acttacttct 5460
gacaacgatc ggaggaccga aggagctaac cgcttttttg cacaacatgg gggatcatgt 5520
aactcgcctt gatcgttggg aaccggagct gaatgaagcc ataccaaacg acgagcgtga 5580
caccacgatg cctgtagcaa tggcaacaac gttgcgcaaa ctattaactg gcgaactact 5640
tactctagct tcccggcaac aattaataga ctggatggag gcggataaag ttgcaggacc 5700
acttctgcgc tcggcccttc cggctggctg gtttattgct gataaatctg gagccggtga 5760
gcgtgggtct cgcggtatca ttgcagcact ggggccagat ggtaagccct cccgtatcgt 5820
agttatctac acgacgggga gtcaggcaac tatggatgaa cgaaatagac agatcgctga 5880
gataggtgcc tcactgatta agcattggta actgtcagac caagtttact catatatact 5940
ttagattgat ttaaaacttc atttttaatt taaaaggatc taggtgaaga tcctttttga 6000
taatctcatg accaaaatcc cttaacgtga gttttcgttc cactgagcgt cagaccccgt 6060
agaaaagatc aaaggatctt cttgagatcc tttttttctg cgcgtaatct gctgcttgca 6120
aacaaaaaaa ccaccgctac cagcggtggt ttgtttgccg gatcaagagc taccaactct 6180
ttttccgaag gtaactggct tcagcagagc gcagatacca aatactgttc ttctagtgta 6240
gccgtagtta ggccaccact tcaagaactc tgtagcaccg cctacatacc tcgctctgct 6300
aatcctgtta ccagtggctg ctgccagtgg cgataagtcg tgtcttaccg ggttggactc 6360
aagacgatag ttaccggata aggcgcagcg gtcgggctga acggggggtt cgtgcacaca 6420
gcccagcttg gagcgaacga cctacaccga actgagatac ctacagcgtg agctatgaga 6480
aagcgccacg cttcccgaag ggagaaaggc ggacaggtat ccggtaagcg gcagggtcgg 6540
aacaggagag cgcacgaggg agcttccagg gggaaacgcc tggtatcttt atagtcctgt 6600
cgggtttcgc cacctctgac ttgagcgtcg atttttgtga tgctcgtcag gggggcggag 6660
cctatggaaa aacgccagca acgcggcctt tttacggttc ctggcctttt gctggccttt 6720
tgctcacatg ttctttcctg cgttatcccc tgattctgtg gataaccgta ttaccgcctt 6780
tgagtgagct gataccgctc gccgcagccg aacgaccgag cgcagcgagt cagtgagcga 6840
ggaagcggaa gagcgcccaa tacgcaaacc gcctctcccc gcgcgttggc cgattcatta 6900
atgcagctgg cacgacaggt ttcccgactg gaaagcgggc agtgagcgca acgcaattaa 6960
tgtgagttag ctcactcatt aggcacccca ggctttacac tttatgcttc cggctcgtat 7020
gttgtgtgga attgtgagcg gataacaatt tcacacagga aacagctatg accatgatta 7080
cgccaagcgc gcaattaacc ctcactaaag ggaacaaaag ctggagctgc aagcttaatg 7140
tagtcttatg caatactctt gtagtcttgc aacatggtaa cgatgagtta gcaacatgcc 7200
ttacaaggag agaaaaagca ccgtgcatgc cgattggtgg aagtaaggtg gtacgatcgt 7260
gccttattag gaaggcaaca gacgggtctg acatggattg gacgaaccac tgaattgccg 7320
cattgcagag atattgtatt taagtgccta gctcgataca taaacgggtc tctctggtta 7380
gaccagatct gagcctggga gctctctggc taactaggga acccactgct taagcctcaa 7440
taaagcttgc cttgagtgct tcaagtagtg tgtgcccgtc tgttgtgtga ctctggtaac 7500
tagagatccc tcagaccctt ttagtcagtg tggaaaatct ctagcagtgg cgcccgaaca 7560
gggacttgaa agcgaaaggg aaaccagagg agctctctcg acgcaggact cggcttgctg 7620
aagcgcgcac ggcaagaggc gaggggcggc gactggtgag tacgccaaaa attttgacta 7680
gcggaggcta gaaggagaga gatgggtgcg agagcgtcag tattaagcgg gggagaatta 7740
gatcgcgatg ggaaaaaatt cggttaaggc cagggggaaa gaaaaaatat aaattaaaac 7800
atatagtatg ggcaagcagg gagctagaac gattcgcagt taatcctggc ctgttagaaa 7860
catcagaagg ctgtagacaa atactgggac agctacaacc atcccttcag acaggatcag 7920
aagaacttag atcattatat aatacagtag caaccctcta ttgtgtgcat caaaggatag 7980
agataaaaga caccaaggaa gctttagaca agatagagga agagcaaaac aaaagtaaga 8040
ccaccgcaca gcaagcggcc gctgatcttc agacctggag gaggagatat gagggacaat 8100
tggagaagtg aattatataa atataaagta gtaaaaattg aaccattagg agtagcaccc 8160
accaaggcaa agagaagagt ggtgcagaga gaaaaaagag cagtgggaat aggagctttg 8220
ttccttgggt tcttgggagc agcaggaagc actatgggcg cagcgtcaat gacgctgacg 8280
gtacaggcca gacaattatt gtctggtata gtgcagcagc agaacaattt gctgagggct 8340
attgaggcgc aacagcatct gttgcaactc acagtctggg gcatcaagca gctccaggca 8400
agaatcctgg ctgtggaaag atacctaaag gatcaacagc tcctggggat ttggggttgc 8460
tctggaaaac tcatttgcac cactgctgtg ccttggaatg ctagttggag taataaatct 8520
ctggaacaga tttggaatca cacgacctgg atggagtggg acagagaaat taacaattac 8580
acaagcttaa tacactcctt aattgaagaa tcgcaaaacc agcaagaaaa gaatgaacaa 8640
gaattattgg aattagataa atgggcaagt ttgtggaatt ggtttaacat aacaaattgg 8700
ctgtggtata taaaattatt cataatgata gtaggaggct tggtaggttt aagaatagtt 8760
tttgctgtac tttctatagt gaatagagtt aggcagggat attcaccatt atcgtttcag 8820
acccacctcc caaccccgag gggacccttg cgccttttcc aaggcagccc tgggtttgcg 8880
cagggacgcg gctgctctgg gcgtggttcc gggaaacgca gcggcgccga ccctgggtct 8940
cgcacattct tcacgtccgt tcgcagcgtc acccggatct tcgccgctac ccttgtgggc 9000
cccccggcga cgcttcctgc tccgccccta agtcgggaag gttccttgcg gttcgcggcg 9060
tgccggacgt gacaaacgga agccgcacgt ctcactagta ccctcgcaga cggacagcgc 9120
cagggagcaa tggcagcgcg ccgaccgcga tgggctgtgg ccaatagcgg ctgctcagca 9180
gggcgcgccg agagcagcgg ccgggaaggg gcggtgcggg aggcggggtg tggggcggta 9240
gtgtgggccc tgttcctgcc cgcgcggtgt tccgcattct gcaagcctcc ggagcgcacg 9300
tcggcagtcg gctccctcgt tgaccgaatc accgacctct ctccccaggg ggtacccagc 9360
tgtctagaga attctagatc ttgagacaaa tggcagtatt catccacaat tttaaaagaa 9420
aaggggggat tggggggtac agtgcagggg aaagaatagt agacataata gcaacagaca 9480
tacaaactaa agaattacaa aaacaaatta caaaaattca aaattttcgg gtttattaca 9540
gggacagcag agatccactt tggcgccggc tcgaggggg 9579
<210> 129
<211> 309
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 129
Met Ile Lys Ile Ala Thr Arg Lys Tyr Leu Gly Lys Gln Asn Val Tyr
1 5 10 15
Asp Ile Gly Val Glu Arg Asp His Asn Phe Ala Leu Lys Asn Gly Phe
20 25 30
Ile Ala Ser Asn Cys Ala Ala Leu Pro Ile Pro Glu Val Leu Asp Ile
35 40 45
Gly Glu Phe Ser Glu Ser Leu Thr Tyr Cys Ile Ser Arg Arg Ala Gln
50 55 60
Gly Val Thr Leu Gln Asp Leu Pro Glu Thr Glu Leu Pro Ala Val Leu
65 70 75 80
Gln Pro Val Ala Glu Ala Met Asp Ala Ile Ala Ala Ala Asp Leu Ser
85 90 95
Gln Thr Ser Gly Phe Gly Pro Phe Gly Pro Gln Gly Ile Gly Gln Tyr
100 105 110
Thr Thr Trp Arg Asp Phe Ile Cys Ala Ile Ala Asp Pro His Val Tyr
115 120 125
His Trp Gln Thr Val Met Asp Asp Thr Val Ser Ala Ser Val Ala Gln
130 135 140
Ala Leu Asp Glu Leu Met Leu Trp Ala Glu Asp Cys Pro Glu Val Arg
145 150 155 160
His Leu Val His Ala Asp Phe Gly Ser Asn Asn Val Leu Thr Asp Asn
165 170 175
Gly Arg Ile Thr Ala Val Ile Asp Trp Ser Glu Ala Met Phe Gly Asp
180 185 190
Ser Gln Tyr Glu Val Ala Asn Ile Phe Phe Trp Arg Pro Trp Leu Ala
195 200 205
Cys Met Glu Gln Gln Thr Arg Tyr Phe Glu Arg Arg His Pro Glu Leu
210 215 220
Ala Gly Ser Pro Arg Leu Arg Ala Tyr Met Leu Arg Ile Gly Leu Asp
225 230 235 240
Gln Leu Tyr Gln Ser Leu Val Asp Gly Asn Phe Asp Asp Ala Ala Trp
245 250 255
Ala Gln Gly Arg Cys Asp Ala Ile Val Arg Ser Gly Ala Gly Thr Val
260 265 270
Gly Arg Thr Gln Ile Ala Arg Arg Ser Ala Ala Val Trp Thr Asp Gly
275 280 285
Cys Val Glu Val Leu Ala Asp Ser Gly Asn Arg Arg Pro Ser Thr Arg
290 295 300
Pro Arg Ala Lys Glu
305
<210> 130
<211> 9345
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 130
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtgccacca tgaaaaagcc tgaactcacc 660
gcgacgtctg tcgagaagtt tctgatcgaa aagttcgaca gcgtctccga cctgatgcag 720
ctctcggagg gcgaagaatc tcgtgctttc agcttcgatg taggagggcg tggatatgtc 780
ctgcgggtaa atagctgcgc cgatggtttc tacaaagatc gttatgttta tcggcacttt 840
gcatcggccg cgctcccgat tccggaagtg cttgacattg gggaatttag cgagagcctg 900
acctattgca tctcccgccg tgcacagggt gtcacgttgc aagacctgcc tgaaaccgaa 960
ctgcccgctg ttctgcagcc ggtcgcggag gccatggatg cgatcgctgc ggccgatctt 1020
agccagacga gctgcctttc atacgagacc gagatcctga ctgtcgagta cggattgctt 1080
cctatcggca aaatcgtgga gaagaggatt gaatgtaccg tctattcagt cgataataat 1140
gggaacatct acacacagcc cgtggctcaa tggcacgaca gaggagagca ggaagttttt 1200
gaatactgtc tcgaggacgg atccctcatc cgcgctacta aagatcataa gtttatgacc 1260
gtggacggcc agatgctgcc aattgacgaa atttttgaac gagagctgga tctgatgaga 1320
gtcgacaacc ttccaaactg attaattaag aattcgaccc agctttcttg tacaaagtgg 1380
ttggtaagcc tatccctaac cctctcctcg gtctcgattc tacgtagtaa tgagctagca 1440
gtctcgaggt taacgaattc cgcccccccc ctaacgttac tggccgaagc cgcttggaat 1500
aaggccggtg tgcgcttgtc tatatgttat tttccaccat attgccgtct tttggcaatg 1560
tgagggcccg gaaacctggc cctgtcttct tgacgagcat tcctaggggt ctttcccctc 1620
tcgccaaagg aatgcaaggt ctgttgaatg tcgtgaagga agcagttcct ctggaagctt 1680
cttgaagaca aacaacgtct gtagcgaccc tttgcaggca gcggaacccc ccacctggcg 1740
acaggtgccc ctgcggccaa aagccacgtg tataagatac acctgcaaag gcggcacaac 1800
cccagtgcca cgttgtgagt tggatagttg tggaaagagt caaatggctc tcctcaagcg 1860
tattcaacaa ggggctgaag gatgcccaga aggtacccca ttgtatggga tctgatctgg 1920
ggcctcggtg cacatgcttt acatgtgttt agtcgaggtt aaaaaaacgt ctaggccccc 1980
cgaaccacgg ggacgtggtt ttcctttgaa aaacacgata ataccatggc catgagcgag 2040
ctgattaagg agaacatgca catgaagctg tacatggagg gcaccgtgga caaccatcac 2100
ttcaagtgca catccgaggg cgaaggcaag ccctacgagg gcacccagac catgagaatc 2160
aaggtggtcg agggcggccc tctccccttc gccttcgaca tcctggctac tagcttcctc 2220
tacggcagca agaccttcat caaccacacc cagggcatcc ccgacttctt caagcagtcc 2280
ttccctgagg gcttcacatg ggagagagtc accacatacg aagacggggg cgtgctgacc 2340
gctacccagg acaccagcct ccaggacggc tgcctcatct acaacgtcaa gatcagaggg 2400
gtgaacttca catccaacgg ccctgtgatg cagaagaaaa cactcggctg ggaggccttc 2460
accgagacgc tgtaccccgc tgacggcggc ctggaaggca gaaacgacat ggccctgaag 2520
ctcgtgggcg ggagccatct gatcgcaaac atcaagacca catatagatc caagaaaccc 2580
gctaagaacc tcaagatgcc tggcgtctac tatgtggact acagactgga aagaatcaag 2640
gaggccaaca acgagaccta cgtcgagcag cacgaggtgg cagtggccag atactgcgac 2700
ctccctagca aactggggca caagcttaat taacaccggt ggcgcgttaa gtcgacaatc 2760
aacctctgga ttacaaaatt tgtgaaagat tgactggtat tcttaactat gttgctcctt 2820
ttacgctatg tggatacgct gctttaatgc ctttgtatca tgctattgct tcccgtatgg 2880
ctttcatttt ctcctccttg tataaatcct ggttgctgtc tctttatgag gagttgtggc 2940
ccgttgtcag gcaacgtggc gtggtgtgca ctgtgtttgc tgacgcaacc cccactggtt 3000
ggggcattgc caccacctgt cagctccttt ccgggacttt cgctttcccc ctccctattg 3060
ccacggcgga actcatcgcc gcctgccttg cccgctgctg gacaggggct cggctgttgg 3120
gcactgacaa ttccgtggtg ttgtcgggga aatcatcgtc ctttccttgg ctgctcgcct 3180
gtgttgccac ctggattctg cgcgggacgt ccttctgcta cgtcccttcg gccctcaatc 3240
cagcggacct tccttcccgc ggcctgctgc cggctctgcg gcctcttccg cgtcttcgcc 3300
ttcgccctca gacgagtcgg atctcccttt gggccgcctc cccgcgtcga ctttaagacc 3360
aatgacttac aaggcagctg tagatcttag ccacttttta aaagaaaagg ggggactgga 3420
agggctaatt cactcccaac gaagacaaga tctgcttttt gcttgtactg ggtctctctg 3480
gttagaccag atctgagcct gggagctctc tggctaacta gggaacccac tgcttaagcc 3540
tcaataaagc ttgccttgag tgcttcaagt agtgtgtgcc cgtctgttgt gtgactctgg 3600
taactagaga tccctcagac ccttttagtc agtgtggaaa atctctagca gtacgtatag 3660
tagttcatgt catcttatta ttcagtattt ataacttgca aagaaatgaa tatcagagag 3720
tgagaggaac ttgtttattg cagcttataa tggttacaaa taaagcaata gcatcacaaa 3780
tttcacaaat aaagcatttt tttcactgca ttctagttgt ggtttgtcca aactcatcaa 3840
tgtatcttat catgtctggc tctagctatc ccgcccctaa ctccgcccat cccgccccta 3900
actccgccca gttccgccca ttctccgccc catggctgac taattttttt tatttatgca 3960
gaggccgagg ccgcctcggc ctctgagcta ttccagaagt agtgaggagg cttttttgga 4020
ggcctaggga cgtacccaat tcgccctata gtgagtcgta ttacgcgcgc tcactggccg 4080
tcgttttaca acgtcgtgac tgggaaaacc ctggcgttac ccaacttaat cgccttgcag 4140
cacatccccc tttcgccagc tggcgtaata gcgaagaggc ccgcaccgat cgcccttccc 4200
aacagttgcg cagcctgaat ggcgaatggg acgcgccctg tagcggcgca ttaagcgcgg 4260
cgggtgtggt ggttacgcgc agcgtgaccg ctacacttgc cagcgcccta gcgcccgctc 4320
ctttcgcttt cttcccttcc tttctcgcca cgttcgccgg ctttccccgt caagctctaa 4380
atcgggggct ccctttaggg ttccgattta gtgctttacg gcacctcgac cccaaaaaac 4440
ttgattaggg tgatggttca cgtagtgggc catcgccctg atagacggtt tttcgccctt 4500
tgacgttgga gtccacgttc tttaatagtg gactcttgtt ccaaactgga acaacactca 4560
accctatctc ggtctattct tttgatttat aagggatttt gccgatttcg gcctattggt 4620
taaaaaatga gctgatttaa caaaaattta acgcgaattt taacaaaata ttaacgctta 4680
caatttaggt ggcacttttc ggggaaatgt gcgcggaacc cctatttgtt tatttttcta 4740
aatacattca aatatgtatc cgctcatgag acaataaccc tgataaatgc ttcaataata 4800
ttgaaaaagg aagagtatga gtattcaaca tttccgtgtc gcccttattc ccttttttgc 4860
ggcattttgc cttcctgttt ttgctcaccc agaaacgctg gtgaaagtaa aagatgctga 4920
agatcagttg ggtgcacgag tgggttacat cgaactggat ctcaacagcg gtaagatcct 4980
tgagagtttt cgccccgaag aacgttttcc aatgatgagc acttttaaag ttctgctatg 5040
tggcgcggta ttatcccgta ttgacgccgg gcaagagcaa ctcggtcgcc gcatacacta 5100
ttctcagaat gacttggttg agtactcacc agtcacagaa aagcatctta cggatggcat 5160
gacagtaaga gaattatgca gtgctgccat aaccatgagt gataacactg cggccaactt 5220
acttctgaca acgatcggag gaccgaagga gctaaccgct tttttgcaca acatggggga 5280
tcatgtaact cgccttgatc gttgggaacc ggagctgaat gaagccatac caaacgacga 5340
gcgtgacacc acgatgcctg tagcaatggc aacaacgttg cgcaaactat taactggcga 5400
actacttact ctagcttccc ggcaacaatt aatagactgg atggaggcgg ataaagttgc 5460
aggaccactt ctgcgctcgg cccttccggc tggctggttt attgctgata aatctggagc 5520
cggtgagcgt gggtctcgcg gtatcattgc agcactgggg ccagatggta agccctcccg 5580
tatcgtagtt atctacacga cggggagtca ggcaactatg gatgaacgaa atagacagat 5640
cgctgagata ggtgcctcac tgattaagca ttggtaactg tcagaccaag tttactcata 5700
tatactttag attgatttaa aacttcattt ttaatttaaa aggatctagg tgaagatcct 5760
ttttgataat ctcatgacca aaatccctta acgtgagttt tcgttccact gagcgtcaga 5820
ccccgtagaa aagatcaaag gatcttcttg agatcctttt tttctgcgcg taatctgctg 5880
cttgcaaaca aaaaaaccac cgctaccagc ggtggtttgt ttgccggatc aagagctacc 5940
aactcttttt ccgaaggtaa ctggcttcag cagagcgcag ataccaaata ctgttcttct 6000
agtgtagccg tagttaggcc accacttcaa gaactctgta gcaccgccta catacctcgc 6060
tctgctaatc ctgttaccag tggctgctgc cagtggcgat aagtcgtgtc ttaccgggtt 6120
ggactcaaga cgatagttac cggataaggc gcagcggtcg ggctgaacgg ggggttcgtg 6180
cacacagccc agcttggagc gaacgaccta caccgaactg agatacctac agcgtgagct 6240
atgagaaagc gccacgcttc ccgaagggag aaaggcggac aggtatccgg taagcggcag 6300
ggtcggaaca ggagagcgca cgagggagct tccaggggga aacgcctggt atctttatag 6360
tcctgtcggg tttcgccacc tctgacttga gcgtcgattt ttgtgatgct cgtcaggggg 6420
gcggagccta tggaaaaacg ccagcaacgc ggccttttta cggttcctgg ccttttgctg 6480
gccttttgct cacatgttct ttcctgcgtt atcccctgat tctgtggata accgtattac 6540
cgcctttgag tgagctgata ccgctcgccg cagccgaacg accgagcgca gcgagtcagt 6600
gagcgaggaa gcggaagagc gcccaatacg caaaccgcct ctccccgcgc gttggccgat 6660
tcattaatgc agctggcacg acaggtttcc cgactggaaa gcgggcagtg agcgcaacgc 6720
aattaatgtg agttagctca ctcattaggc accccaggct ttacacttta tgcttccggc 6780
tcgtatgttg tgtggaattg tgagcggata acaatttcac acaggaaaca gctatgacca 6840
tgattacgcc aagcgcgcaa ttaaccctca ctaaagggaa caaaagctgg agctgcaagc 6900
ttaatgtagt cttatgcaat actcttgtag tcttgcaaca tggtaacgat gagttagcaa 6960
catgccttac aaggagagaa aaagcaccgt gcatgccgat tggtggaagt aaggtggtac 7020
gatcgtgcct tattaggaag gcaacagacg ggtctgacat ggattggacg aaccactgaa 7080
ttgccgcatt gcagagatat tgtatttaag tgcctagctc gatacataaa cgggtctctc 7140
tggttagacc agatctgagc ctgggagctc tctggctaac tagggaaccc actgcttaag 7200
cctcaataaa gcttgccttg agtgcttcaa gtagtgtgtg cccgtctgtt gtgtgactct 7260
ggtaactaga gatccctcag acccttttag tcagtgtgga aaatctctag cagtggcgcc 7320
cgaacaggga cttgaaagcg aaagggaaac cagaggagct ctctcgacgc aggactcggc 7380
ttgctgaagc gcgcacggca agaggcgagg ggcggcgact ggtgagtacg ccaaaaattt 7440
tgactagcgg aggctagaag gagagagatg ggtgcgagag cgtcagtatt aagcggggga 7500
gaattagatc gcgatgggaa aaaattcggt taaggccagg gggaaagaaa aaatataaat 7560
taaaacatat agtatgggca agcagggagc tagaacgatt cgcagttaat cctggcctgt 7620
tagaaacatc agaaggctgt agacaaatac tgggacagct acaaccatcc cttcagacag 7680
gatcagaaga acttagatca ttatataata cagtagcaac cctctattgt gtgcatcaaa 7740
ggatagagat aaaagacacc aaggaagctt tagacaagat agaggaagag caaaacaaaa 7800
gtaagaccac cgcacagcaa gcggccgctg atcttcagac ctggaggagg agatatgagg 7860
gacaattgga gaagtgaatt atataaatat aaagtagtaa aaattgaacc attaggagta 7920
gcacccacca aggcaaagag aagagtggtg cagagagaaa aaagagcagt gggaatagga 7980
gctttgttcc ttgggttctt gggagcagca ggaagcacta tgggcgcagc gtcaatgacg 8040
ctgacggtac aggccagaca attattgtct ggtatagtgc agcagcagaa caatttgctg 8100
agggctattg aggcgcaaca gcatctgttg caactcacag tctggggcat caagcagctc 8160
caggcaagaa tcctggctgt ggaaagatac ctaaaggatc aacagctcct ggggatttgg 8220
ggttgctctg gaaaactcat ttgcaccact gctgtgcctt ggaatgctag ttggagtaat 8280
aaatctctgg aacagatttg gaatcacacg acctggatgg agtgggacag agaaattaac 8340
aattacacaa gcttaataca ctccttaatt gaagaatcgc aaaaccagca agaaaagaat 8400
gaacaagaat tattggaatt agataaatgg gcaagtttgt ggaattggtt taacataaca 8460
aattggctgt ggtatataaa attattcata atgatagtag gaggcttggt aggtttaaga 8520
atagtttttg ctgtactttc tatagtgaat agagttaggc agggatattc accattatcg 8580
tttcagaccc acctcccaac cccgagggga cccttgcgcc ttttccaagg cagccctggg 8640
tttgcgcagg gacgcggctg ctctgggcgt ggttccggga aacgcagcgg cgccgaccct 8700
gggtctcgca cattcttcac gtccgttcgc agcgtcaccc ggatcttcgc cgctaccctt 8760
gtgggccccc cggcgacgct tcctgctccg cccctaagtc gggaaggttc cttgcggttc 8820
gcggcgtgcc ggacgtgaca aacggaagcc gcacgtctca ctagtaccct cgcagacgga 8880
cagcgccagg gagcaatggc agcgcgccga ccgcgatggg ctgtggccaa tagcggctgc 8940
tcagcagggc gcgccgagag cagcggccgg gaaggggcgg tgcgggaggc ggggtgtggg 9000
gcggtagtgt gggccctgtt cctgcccgcg cggtgttccg cattctgcaa gcctccggag 9060
cgcacgtcgg cagtcggctc cctcgttgac cgaatcaccg acctctctcc ccagggggta 9120
cccagctgtc tagagaattc tagatcttga gacaaatggc agtattcatc cacaatttta 9180
aaagaaaagg ggggattggg gggtacagtg caggggaaag aatagtagac ataatagcaa 9240
cagacataca aactaaagaa ttacaaaaac aaattacaaa aattcaaaat tttcgggttt 9300
attacaggga cagcagagat ccactttggc gccggctcga ggggg 9345
<210> 131
<211> 233
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 131
Met Lys Lys Pro Glu Leu Thr Ala Thr Ser Val Glu Lys Phe Leu Ile
1 5 10 15
Glu Lys Phe Asp Ser Val Ser Asp Leu Met Gln Leu Ser Glu Gly Glu
20 25 30
Glu Ser Arg Ala Phe Ser Phe Asp Val Gly Gly Arg Gly Tyr Val Leu
35 40 45
Arg Val Asn Ser Cys Ala Asp Gly Phe Tyr Lys Asp Arg Tyr Val Tyr
50 55 60
Arg His Phe Ala Ser Ala Ala Leu Pro Ile Pro Glu Val Leu Asp Ile
65 70 75 80
Gly Glu Phe Ser Glu Ser Leu Thr Tyr Cys Ile Ser Arg Arg Ala Gln
85 90 95
Gly Val Thr Leu Gln Asp Leu Pro Glu Thr Glu Leu Pro Ala Val Leu
100 105 110
Gln Pro Val Ala Glu Ala Met Asp Ala Ile Ala Ala Ala Asp Leu Ser
115 120 125
Gln Thr Ser Cys Leu Ser Tyr Glu Thr Glu Ile Leu Thr Val Glu Tyr
130 135 140
Gly Leu Leu Pro Ile Gly Lys Ile Val Glu Lys Arg Ile Glu Cys Thr
145 150 155 160
Val Tyr Ser Val Asp Asn Asn Gly Asn Ile Tyr Thr Gln Pro Val Ala
165 170 175
Gln Trp His Asp Arg Gly Glu Gln Glu Val Phe Glu Tyr Cys Leu Glu
180 185 190
Asp Gly Ser Leu Ile Arg Ala Thr Lys Asp His Lys Phe Met Thr Val
195 200 205
Asp Gly Gln Met Leu Pro Ile Asp Glu Ile Phe Glu Arg Glu Leu Asp
210 215 220
Leu Met Arg Val Asp Asn Leu Pro Asn
225 230
<210> 132
<211> 9393
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 132
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtgccgcca ccatgattaa gatcgctacg 660
cggaagtacc tggggaaaca gaacgtctac gacataggtg tggagcgcga tcacaacttt 720
gctctgaaaa atggatttat cgccagcaac tgcgggttcg gcccattcgg accgcaagga 780
atcggtcaat acactacatg gcgtgatttc atatgcgcga ttgctgatcc ccatgtgtat 840
cactggcaaa ctgtgatgga cgacaccgtc agtgcgtccg tcgcgcaggc tctcgatgag 900
ctgatgcttt gggccgagga ctgccccgaa gtccggcacc tcgtgcacgc ggatttcggc 960
tccaacaatg tcctgacgga caatggccgc ataacagcgg tcattgactg gagcgaggcg 1020
atgttcgggg attcccaata cgaggtcgcc aacatcttct tctggaggcc gtggttggct 1080
tgtatggagc agcagacgcg ctacttcgag cggaggcatc cggagcttgc aggatcgccg 1140
cggctccggg cgtatatgct ccgcattggt cttgaccaac tctatcagag cttggttgac 1200
ggcaatttcg atgatgcagc ttgggcgcag ggtcgatgcg acgcaatcgt ccgatccgga 1260
gccgggactg tcgggcgtac acaaatcgcc cgcagaagcg cggccgtctg gaccgatggc 1320
tgtgtagaag tactcgccga tagtggaaac cgacgcccca gcactcgtcc gagggcaaag 1380
gaatagttaa ttaagaattc gacccagctt tcttgtacaa agtggttggt aagcctatcc 1440
ctaaccctct cctcggtctc gattctacgt agtaatgagc tagcagtctc gaggttaacg 1500
aattccgccc cccccctaac gttactggcc gaagccgctt ggaataaggc cggtgtgcgc 1560
ttgtctatat gttattttcc accatattgc cgtcttttgg caatgtgagg gcccggaaac 1620
ctggccctgt cttcttgacg agcattccta ggggtctttc ccctctcgcc aaaggaatgc 1680
aaggtctgtt gaatgtcgtg aaggaagcag ttcctctgga agcttcttga agacaaacaa 1740
cgtctgtagc gaccctttgc aggcagcgga accccccacc tggcgacagg tgcccctgcg 1800
gccaaaagcc acgtgtataa gatacacctg caaaggcggc acaaccccag tgccacgttg 1860
tgagttggat agttgtggaa agagtcaaat ggctctcctc aagcgtattc aacaaggggc 1920
tgaaggatgc ccagaaggta ccccattgta tgggatctga tctggggcct cggtgcacat 1980
gctttacatg tgtttagtcg aggttaaaaa aacgtctagg ccccccgaac cacggggacg 2040
tggttttcct ttgaaaaaca cgataatacc atggtgagca agggcgagga ggataacatg 2100
gccatcatca aggagttcat gcgcttcaag gtgcacatgg agggctccgt gaacggccac 2160
gagttcgaga tcgagggcga gggcgagggc cgcccctacg agggcaccca gaccgccaag 2220
ctgaaggtga ccaagggtgg ccccctgccc ttcgcctggg acatcctgtc ccctcagttc 2280
atgtacggct ccaaggccta cgtgaagcac cccgccgaca tccccgacta cttgaagctg 2340
tccttccccg agggcttcaa gtgggagcgc gtgatgaact tcgaggacgg cggcgtggtg 2400
accgtgaccc aggactcctc cctgcaggac ggcgagttca tctacaaggt gaagctgcgc 2460
ggcaccaact tcccctccga cggccccgta atgcagaaga agaccatggg ctgggaggcc 2520
tcctccgagc ggatgtaccc cgaggacggc gccctgaagg gcgagatcaa gcagaggctg 2580
aagctgaagg acggcggcca ctacgacgct gaggtcaaga ccacctacaa ggccaagaag 2640
cccgtgcagc tgcccggcgc ctacaacgtc aacatcaagt tggacatcac ctcccacaac 2700
gaggactaca ccatcgtgga acagtacgaa cgcgccgagg gccgccactc caccggcggc 2760
atggacgagc tgtacaagta acaccggtgg cgcgttaagt cgacaatcaa cctctggatt 2820
acaaaatttg tgaaagattg actggtattc ttaactatgt tgctcctttt acgctatgtg 2880
gatacgctgc tttaatgcct ttgtatcatg ctattgcttc ccgtatggct ttcattttct 2940
cctccttgta taaatcctgg ttgctgtctc tttatgagga gttgtggccc gttgtcaggc 3000
aacgtggcgt ggtgtgcact gtgtttgctg acgcaacccc cactggttgg ggcattgcca 3060
ccacctgtca gctcctttcc gggactttcg ctttccccct ccctattgcc acggcggaac 3120
tcatcgccgc ctgccttgcc cgctgctgga caggggctcg gctgttgggc actgacaatt 3180
ccgtggtgtt gtcggggaaa tcatcgtcct ttccttggct gctcgcctgt gttgccacct 3240
ggattctgcg cgggacgtcc ttctgctacg tcccttcggc cctcaatcca gcggaccttc 3300
cttcccgcgg cctgctgccg gctctgcggc ctcttccgcg tcttcgcctt cgccctcaga 3360
cgagtcggat ctccctttgg gccgcctccc cgcgtcgact ttaagaccaa tgacttacaa 3420
ggcagctgta gatcttagcc actttttaaa agaaaagggg ggactggaag ggctaattca 3480
ctcccaacga agacaagatc tgctttttgc ttgtactggg tctctctggt tagaccagat 3540
ctgagcctgg gagctctctg gctaactagg gaacccactg cttaagcctc aataaagctt 3600
gccttgagtg cttcaagtag tgtgtgcccg tctgttgtgt gactctggta actagagatc 3660
cctcagaccc ttttagtcag tgtggaaaat ctctagcagt acgtatagta gttcatgtca 3720
tcttattatt cagtatttat aacttgcaaa gaaatgaata tcagagagtg agaggaactt 3780
gtttattgca gcttataatg gttacaaata aagcaatagc atcacaaatt tcacaaataa 3840
agcatttttt tcactgcatt ctagttgtgg tttgtccaaa ctcatcaatg tatcttatca 3900
tgtctggctc tagctatccc gcccctaact ccgcccatcc cgcccctaac tccgcccagt 3960
tccgcccatt ctccgcccca tggctgacta atttttttta tttatgcaga ggccgaggcc 4020
gcctcggcct ctgagctatt ccagaagtag tgaggaggct tttttggagg cctagggacg 4080
tacccaattc gccctatagt gagtcgtatt acgcgcgctc actggccgtc gttttacaac 4140
gtcgtgactg ggaaaaccct ggcgttaccc aacttaatcg ccttgcagca catccccctt 4200
tcgccagctg gcgtaatagc gaagaggccc gcaccgatcg cccttcccaa cagttgcgca 4260
gcctgaatgg cgaatgggac gcgccctgta gcggcgcatt aagcgcggcg ggtgtggtgg 4320
ttacgcgcag cgtgaccgct acacttgcca gcgccctagc gcccgctcct ttcgctttct 4380
tcccttcctt tctcgccacg ttcgccggct ttccccgtca agctctaaat cgggggctcc 4440
ctttagggtt ccgatttagt gctttacggc acctcgaccc caaaaaactt gattagggtg 4500
atggttcacg tagtgggcca tcgccctgat agacggtttt tcgccctttg acgttggagt 4560
ccacgttctt taatagtgga ctcttgttcc aaactggaac aacactcaac cctatctcgg 4620
tctattcttt tgatttataa gggattttgc cgatttcggc ctattggtta aaaaatgagc 4680
tgatttaaca aaaatttaac gcgaatttta acaaaatatt aacgcttaca atttaggtgg 4740
cacttttcgg ggaaatgtgc gcggaacccc tatttgttta tttttctaaa tacattcaaa 4800
tatgtatccg ctcatgagac aataaccctg ataaatgctt caataatatt gaaaaaggaa 4860
gagtatgagt attcaacatt tccgtgtcgc ccttattccc ttttttgcgg cattttgcct 4920
tcctgttttt gctcacccag aaacgctggt gaaagtaaaa gatgctgaag atcagttggg 4980
tgcacgagtg ggttacatcg aactggatct caacagcggt aagatccttg agagttttcg 5040
ccccgaagaa cgttttccaa tgatgagcac ttttaaagtt ctgctatgtg gcgcggtatt 5100
atcccgtatt gacgccgggc aagagcaact cggtcgccgc atacactatt ctcagaatga 5160
cttggttgag tactcaccag tcacagaaaa gcatcttacg gatggcatga cagtaagaga 5220
attatgcagt gctgccataa ccatgagtga taacactgcg gccaacttac ttctgacaac 5280
gatcggagga ccgaaggagc taaccgcttt tttgcacaac atgggggatc atgtaactcg 5340
ccttgatcgt tgggaaccgg agctgaatga agccatacca aacgacgagc gtgacaccac 5400
gatgcctgta gcaatggcaa caacgttgcg caaactatta actggcgaac tacttactct 5460
agcttcccgg caacaattaa tagactggat ggaggcggat aaagttgcag gaccacttct 5520
gcgctcggcc cttccggctg gctggtttat tgctgataaa tctggagccg gtgagcgtgg 5580
gtctcgcggt atcattgcag cactggggcc agatggtaag ccctcccgta tcgtagttat 5640
ctacacgacg gggagtcagg caactatgga tgaacgaaat agacagatcg ctgagatagg 5700
tgcctcactg attaagcatt ggtaactgtc agaccaagtt tactcatata tactttagat 5760
tgatttaaaa cttcattttt aatttaaaag gatctaggtg aagatccttt ttgataatct 5820
catgaccaaa atcccttaac gtgagttttc gttccactga gcgtcagacc ccgtagaaaa 5880
gatcaaagga tcttcttgag atcctttttt tctgcgcgta atctgctgct tgcaaacaaa 5940
aaaaccaccg ctaccagcgg tggtttgttt gccggatcaa gagctaccaa ctctttttcc 6000
gaaggtaact ggcttcagca gagcgcagat accaaatact gttcttctag tgtagccgta 6060
gttaggccac cacttcaaga actctgtagc accgcctaca tacctcgctc tgctaatcct 6120
gttaccagtg gctgctgcca gtggcgataa gtcgtgtctt accgggttgg actcaagacg 6180
atagttaccg gataaggcgc agcggtcggg ctgaacgggg ggttcgtgca cacagcccag 6240
cttggagcga acgacctaca ccgaactgag atacctacag cgtgagctat gagaaagcgc 6300
cacgcttccc gaagggagaa aggcggacag gtatccggta agcggcaggg tcggaacagg 6360
agagcgcacg agggagcttc cagggggaaa cgcctggtat ctttatagtc ctgtcgggtt 6420
tcgccacctc tgacttgagc gtcgattttt gtgatgctcg tcaggggggc ggagcctatg 6480
gaaaaacgcc agcaacgcgg cctttttacg gttcctggcc ttttgctggc cttttgctca 6540
catgttcttt cctgcgttat cccctgattc tgtggataac cgtattaccg cctttgagtg 6600
agctgatacc gctcgccgca gccgaacgac cgagcgcagc gagtcagtga gcgaggaagc 6660
ggaagagcgc ccaatacgca aaccgcctct ccccgcgcgt tggccgattc attaatgcag 6720
ctggcacgac aggtttcccg actggaaagc gggcagtgag cgcaacgcaa ttaatgtgag 6780
ttagctcact cattaggcac cccaggcttt acactttatg cttccggctc gtatgttgtg 6840
tggaattgtg agcggataac aatttcacac aggaaacagc tatgaccatg attacgccaa 6900
gcgcgcaatt aaccctcact aaagggaaca aaagctggag ctgcaagctt aatgtagtct 6960
tatgcaatac tcttgtagtc ttgcaacatg gtaacgatga gttagcaaca tgccttacaa 7020
ggagagaaaa agcaccgtgc atgccgattg gtggaagtaa ggtggtacga tcgtgcctta 7080
ttaggaaggc aacagacggg tctgacatgg attggacgaa ccactgaatt gccgcattgc 7140
agagatattg tatttaagtg cctagctcga tacataaacg ggtctctctg gttagaccag 7200
atctgagcct gggagctctc tggctaacta gggaacccac tgcttaagcc tcaataaagc 7260
ttgccttgag tgcttcaagt agtgtgtgcc cgtctgttgt gtgactctgg taactagaga 7320
tccctcagac ccttttagtc agtgtggaaa atctctagca gtggcgcccg aacagggact 7380
tgaaagcgaa agggaaacca gaggagctct ctcgacgcag gactcggctt gctgaagcgc 7440
gcacggcaag aggcgagggg cggcgactgg tgagtacgcc aaaaattttg actagcggag 7500
gctagaagga gagagatggg tgcgagagcg tcagtattaa gcgggggaga attagatcgc 7560
gatgggaaaa aattcggtta aggccagggg gaaagaaaaa atataaatta aaacatatag 7620
tatgggcaag cagggagcta gaacgattcg cagttaatcc tggcctgtta gaaacatcag 7680
aaggctgtag acaaatactg ggacagctac aaccatccct tcagacagga tcagaagaac 7740
ttagatcatt atataataca gtagcaaccc tctattgtgt gcatcaaagg atagagataa 7800
aagacaccaa ggaagcttta gacaagatag aggaagagca aaacaaaagt aagaccaccg 7860
cacagcaagc ggccgctgat cttcagacct ggaggaggag atatgaggga caattggaga 7920
agtgaattat ataaatataa agtagtaaaa attgaaccat taggagtagc acccaccaag 7980
gcaaagagaa gagtggtgca gagagaaaaa agagcagtgg gaataggagc tttgttcctt 8040
gggttcttgg gagcagcagg aagcactatg ggcgcagcgt caatgacgct gacggtacag 8100
gccagacaat tattgtctgg tatagtgcag cagcagaaca atttgctgag ggctattgag 8160
gcgcaacagc atctgttgca actcacagtc tggggcatca agcagctcca ggcaagaatc 8220
ctggctgtgg aaagatacct aaaggatcaa cagctcctgg ggatttgggg ttgctctgga 8280
aaactcattt gcaccactgc tgtgccttgg aatgctagtt ggagtaataa atctctggaa 8340
cagatttgga atcacacgac ctggatggag tgggacagag aaattaacaa ttacacaagc 8400
ttaatacact ccttaattga agaatcgcaa aaccagcaag aaaagaatga acaagaatta 8460
ttggaattag ataaatgggc aagtttgtgg aattggttta acataacaaa ttggctgtgg 8520
tatataaaat tattcataat gatagtagga ggcttggtag gtttaagaat agtttttgct 8580
gtactttcta tagtgaatag agttaggcag ggatattcac cattatcgtt tcagacccac 8640
ctcccaaccc cgaggggacc cttgcgcctt ttccaaggca gccctgggtt tgcgcaggga 8700
cgcggctgct ctgggcgtgg ttccgggaaa cgcagcggcg ccgaccctgg gtctcgcaca 8760
ttcttcacgt ccgttcgcag cgtcacccgg atcttcgccg ctacccttgt gggccccccg 8820
gcgacgcttc ctgctccgcc cctaagtcgg gaaggttcct tgcggttcgc ggcgtgccgg 8880
acgtgacaaa cggaagccgc acgtctcact agtaccctcg cagacggaca gcgccaggga 8940
gcaatggcag cgcgccgacc gcgatgggct gtggccaata gcggctgctc agcagggcgc 9000
gccgagagca gcggccggga aggggcggtg cgggaggcgg ggtgtggggc ggtagtgtgg 9060
gccctgttcc tgcccgcgcg gtgttccgca ttctgcaagc ctccggagcg cacgtcggca 9120
gtcggctccc tcgttgaccg aatcaccgac ctctctcccc agggggtacc cagctgtcta 9180
gagaattcta gatcttgaga caaatggcag tattcatcca caattttaaa agaaaagggg 9240
ggattggggg gtacagtgca ggggaaagaa tagtagacat aatagcaaca gacatacaaa 9300
ctaaagaatt acaaaaacaa attacaaaaa ttcaaaattt tcgggtttat tacagggaca 9360
gcagagatcc actttggcgc cggctcgagg ggg 9393
<210> 133
<211> 247
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 133
Met Ile Lys Ile Ala Thr Arg Lys Tyr Leu Gly Lys Gln Asn Val Tyr
1 5 10 15
Asp Ile Gly Val Glu Arg Asp His Asn Phe Ala Leu Lys Asn Gly Phe
20 25 30
Ile Ala Ser Asn Cys Gly Phe Gly Pro Phe Gly Pro Gln Gly Ile Gly
35 40 45
Gln Tyr Thr Thr Trp Arg Asp Phe Ile Cys Ala Ile Ala Asp Pro His
50 55 60
Val Tyr His Trp Gln Thr Val Met Asp Asp Thr Val Ser Ala Ser Val
65 70 75 80
Ala Gln Ala Leu Asp Glu Leu Met Leu Trp Ala Glu Asp Cys Pro Glu
85 90 95
Val Arg His Leu Val His Ala Asp Phe Gly Ser Asn Asn Val Leu Thr
100 105 110
Asp Asn Gly Arg Ile Thr Ala Val Ile Asp Trp Ser Glu Ala Met Phe
115 120 125
Gly Asp Ser Gln Tyr Glu Val Ala Asn Ile Phe Phe Trp Arg Pro Trp
130 135 140
Leu Ala Cys Met Glu Gln Gln Thr Arg Tyr Phe Glu Arg Arg His Pro
145 150 155 160
Glu Leu Ala Gly Ser Pro Arg Leu Arg Ala Tyr Met Leu Arg Ile Gly
165 170 175
Leu Asp Gln Leu Tyr Gln Ser Leu Val Asp Gly Asn Phe Asp Asp Ala
180 185 190
Ala Trp Ala Gln Gly Arg Cys Asp Ala Ile Val Arg Ser Gly Ala Gly
195 200 205
Thr Val Gly Arg Thr Gln Ile Ala Arg Arg Ser Ala Ala Val Trp Thr
210 215 220
Asp Gly Cys Val Glu Val Leu Ala Asp Ser Gly Asn Arg Arg Pro Ser
225 230 235 240
Thr Arg Pro Arg Ala Lys Glu
245
<210> 134
<211> 9465
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 134
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtgccacca tgaaaaagcc tgaactcacc 660
gcgacgtctg tcgagaagtt tctgatcgaa aagttcgaca gcgtctccga cctgatgcag 720
ctctcggagg gcgaagaatc tcgtgctttc agcttcgatg taggagggcg tggatatgtc 780
ctgcgggtaa atagctgcgc cgatggtttc tacaaagatc gttatgttta tcggcacttt 840
gcatcggccg cgctcccgat tccggaagtg cttgacattg gggaatttag cgagagcctg 900
acctattgca tctcccgccg tgcacagggt gtcacgttgc aagacctgcc tgaaaccgaa 960
ctgcccgctg ttctgcagcc ggtcgcggag gccatggatg cgatcgctgc ggccgatctt 1020
agccagacga gcgggttcgg cccattcgga ccgcaaggaa tcggtcaata cactacatgg 1080
cgtgatttca tatgcgcgat tgctgatccc catgtgtatc actggcaaac tgtgatggac 1140
gacaccgtca gttgcctttc atacgagacc gagatcctga ctgtcgagta cggattgctt 1200
cctatcggca aaatcgtgga gaagaggatt gaatgtaccg tctattcagt cgataataat 1260
gggaacatct acacacagcc cgtggctcaa tggcacgaca gaggagagca ggaagttttt 1320
gaatactgtc tcgaggacgg atccctcatc cgcgctacta aagatcataa gtttatgacc 1380
gtggacggcc agatgctgcc aattgacgaa atttttgaac gagagctgga tctgatgaga 1440
gtcgacaacc ttccaaactg attaattaag aattcgaccc agctttcttg tacaaagtgg 1500
ttggtaagcc tatccctaac cctctcctcg gtctcgattc tacgtagtaa tgagctagca 1560
gtctcgaggt taacgaattc cgcccccccc ctaacgttac tggccgaagc cgcttggaat 1620
aaggccggtg tgcgcttgtc tatatgttat tttccaccat attgccgtct tttggcaatg 1680
tgagggcccg gaaacctggc cctgtcttct tgacgagcat tcctaggggt ctttcccctc 1740
tcgccaaagg aatgcaaggt ctgttgaatg tcgtgaagga agcagttcct ctggaagctt 1800
cttgaagaca aacaacgtct gtagcgaccc tttgcaggca gcggaacccc ccacctggcg 1860
acaggtgccc ctgcggccaa aagccacgtg tataagatac acctgcaaag gcggcacaac 1920
cccagtgcca cgttgtgagt tggatagttg tggaaagagt caaatggctc tcctcaagcg 1980
tattcaacaa ggggctgaag gatgcccaga aggtacccca ttgtatggga tctgatctgg 2040
ggcctcggtg cacatgcttt acatgtgttt agtcgaggtt aaaaaaacgt ctaggccccc 2100
cgaaccacgg ggacgtggtt ttcctttgaa aaacacgata ataccatggc catgagcgag 2160
ctgattaagg agaacatgca catgaagctg tacatggagg gcaccgtgga caaccatcac 2220
ttcaagtgca catccgaggg cgaaggcaag ccctacgagg gcacccagac catgagaatc 2280
aaggtggtcg agggcggccc tctccccttc gccttcgaca tcctggctac tagcttcctc 2340
tacggcagca agaccttcat caaccacacc cagggcatcc ccgacttctt caagcagtcc 2400
ttccctgagg gcttcacatg ggagagagtc accacatacg aagacggggg cgtgctgacc 2460
gctacccagg acaccagcct ccaggacggc tgcctcatct acaacgtcaa gatcagaggg 2520
gtgaacttca catccaacgg ccctgtgatg cagaagaaaa cactcggctg ggaggccttc 2580
accgagacgc tgtaccccgc tgacggcggc ctggaaggca gaaacgacat ggccctgaag 2640
ctcgtgggcg ggagccatct gatcgcaaac atcaagacca catatagatc caagaaaccc 2700
gctaagaacc tcaagatgcc tggcgtctac tatgtggact acagactgga aagaatcaag 2760
gaggccaaca acgagaccta cgtcgagcag cacgaggtgg cagtggccag atactgcgac 2820
ctccctagca aactggggca caagcttaat taacaccggt ggcgcgttaa gtcgacaatc 2880
aacctctgga ttacaaaatt tgtgaaagat tgactggtat tcttaactat gttgctcctt 2940
ttacgctatg tggatacgct gctttaatgc ctttgtatca tgctattgct tcccgtatgg 3000
ctttcatttt ctcctccttg tataaatcct ggttgctgtc tctttatgag gagttgtggc 3060
ccgttgtcag gcaacgtggc gtggtgtgca ctgtgtttgc tgacgcaacc cccactggtt 3120
ggggcattgc caccacctgt cagctccttt ccgggacttt cgctttcccc ctccctattg 3180
ccacggcgga actcatcgcc gcctgccttg cccgctgctg gacaggggct cggctgttgg 3240
gcactgacaa ttccgtggtg ttgtcgggga aatcatcgtc ctttccttgg ctgctcgcct 3300
gtgttgccac ctggattctg cgcgggacgt ccttctgcta cgtcccttcg gccctcaatc 3360
cagcggacct tccttcccgc ggcctgctgc cggctctgcg gcctcttccg cgtcttcgcc 3420
ttcgccctca gacgagtcgg atctcccttt gggccgcctc cccgcgtcga ctttaagacc 3480
aatgacttac aaggcagctg tagatcttag ccacttttta aaagaaaagg ggggactgga 3540
agggctaatt cactcccaac gaagacaaga tctgcttttt gcttgtactg ggtctctctg 3600
gttagaccag atctgagcct gggagctctc tggctaacta gggaacccac tgcttaagcc 3660
tcaataaagc ttgccttgag tgcttcaagt agtgtgtgcc cgtctgttgt gtgactctgg 3720
taactagaga tccctcagac ccttttagtc agtgtggaaa atctctagca gtacgtatag 3780
tagttcatgt catcttatta ttcagtattt ataacttgca aagaaatgaa tatcagagag 3840
tgagaggaac ttgtttattg cagcttataa tggttacaaa taaagcaata gcatcacaaa 3900
tttcacaaat aaagcatttt tttcactgca ttctagttgt ggtttgtcca aactcatcaa 3960
tgtatcttat catgtctggc tctagctatc ccgcccctaa ctccgcccat cccgccccta 4020
actccgccca gttccgccca ttctccgccc catggctgac taattttttt tatttatgca 4080
gaggccgagg ccgcctcggc ctctgagcta ttccagaagt agtgaggagg cttttttgga 4140
ggcctaggga cgtacccaat tcgccctata gtgagtcgta ttacgcgcgc tcactggccg 4200
tcgttttaca acgtcgtgac tgggaaaacc ctggcgttac ccaacttaat cgccttgcag 4260
cacatccccc tttcgccagc tggcgtaata gcgaagaggc ccgcaccgat cgcccttccc 4320
aacagttgcg cagcctgaat ggcgaatggg acgcgccctg tagcggcgca ttaagcgcgg 4380
cgggtgtggt ggttacgcgc agcgtgaccg ctacacttgc cagcgcccta gcgcccgctc 4440
ctttcgcttt cttcccttcc tttctcgcca cgttcgccgg ctttccccgt caagctctaa 4500
atcgggggct ccctttaggg ttccgattta gtgctttacg gcacctcgac cccaaaaaac 4560
ttgattaggg tgatggttca cgtagtgggc catcgccctg atagacggtt tttcgccctt 4620
tgacgttgga gtccacgttc tttaatagtg gactcttgtt ccaaactgga acaacactca 4680
accctatctc ggtctattct tttgatttat aagggatttt gccgatttcg gcctattggt 4740
taaaaaatga gctgatttaa caaaaattta acgcgaattt taacaaaata ttaacgctta 4800
caatttaggt ggcacttttc ggggaaatgt gcgcggaacc cctatttgtt tatttttcta 4860
aatacattca aatatgtatc cgctcatgag acaataaccc tgataaatgc ttcaataata 4920
ttgaaaaagg aagagtatga gtattcaaca tttccgtgtc gcccttattc ccttttttgc 4980
ggcattttgc cttcctgttt ttgctcaccc agaaacgctg gtgaaagtaa aagatgctga 5040
agatcagttg ggtgcacgag tgggttacat cgaactggat ctcaacagcg gtaagatcct 5100
tgagagtttt cgccccgaag aacgttttcc aatgatgagc acttttaaag ttctgctatg 5160
tggcgcggta ttatcccgta ttgacgccgg gcaagagcaa ctcggtcgcc gcatacacta 5220
ttctcagaat gacttggttg agtactcacc agtcacagaa aagcatctta cggatggcat 5280
gacagtaaga gaattatgca gtgctgccat aaccatgagt gataacactg cggccaactt 5340
acttctgaca acgatcggag gaccgaagga gctaaccgct tttttgcaca acatggggga 5400
tcatgtaact cgccttgatc gttgggaacc ggagctgaat gaagccatac caaacgacga 5460
gcgtgacacc acgatgcctg tagcaatggc aacaacgttg cgcaaactat taactggcga 5520
actacttact ctagcttccc ggcaacaatt aatagactgg atggaggcgg ataaagttgc 5580
aggaccactt ctgcgctcgg cccttccggc tggctggttt attgctgata aatctggagc 5640
cggtgagcgt gggtctcgcg gtatcattgc agcactgggg ccagatggta agccctcccg 5700
tatcgtagtt atctacacga cggggagtca ggcaactatg gatgaacgaa atagacagat 5760
cgctgagata ggtgcctcac tgattaagca ttggtaactg tcagaccaag tttactcata 5820
tatactttag attgatttaa aacttcattt ttaatttaaa aggatctagg tgaagatcct 5880
ttttgataat ctcatgacca aaatccctta acgtgagttt tcgttccact gagcgtcaga 5940
ccccgtagaa aagatcaaag gatcttcttg agatcctttt tttctgcgcg taatctgctg 6000
cttgcaaaca aaaaaaccac cgctaccagc ggtggtttgt ttgccggatc aagagctacc 6060
aactcttttt ccgaaggtaa ctggcttcag cagagcgcag ataccaaata ctgttcttct 6120
agtgtagccg tagttaggcc accacttcaa gaactctgta gcaccgccta catacctcgc 6180
tctgctaatc ctgttaccag tggctgctgc cagtggcgat aagtcgtgtc ttaccgggtt 6240
ggactcaaga cgatagttac cggataaggc gcagcggtcg ggctgaacgg ggggttcgtg 6300
cacacagccc agcttggagc gaacgaccta caccgaactg agatacctac agcgtgagct 6360
atgagaaagc gccacgcttc ccgaagggag aaaggcggac aggtatccgg taagcggcag 6420
ggtcggaaca ggagagcgca cgagggagct tccaggggga aacgcctggt atctttatag 6480
tcctgtcggg tttcgccacc tctgacttga gcgtcgattt ttgtgatgct cgtcaggggg 6540
gcggagccta tggaaaaacg ccagcaacgc ggccttttta cggttcctgg ccttttgctg 6600
gccttttgct cacatgttct ttcctgcgtt atcccctgat tctgtggata accgtattac 6660
cgcctttgag tgagctgata ccgctcgccg cagccgaacg accgagcgca gcgagtcagt 6720
gagcgaggaa gcggaagagc gcccaatacg caaaccgcct ctccccgcgc gttggccgat 6780
tcattaatgc agctggcacg acaggtttcc cgactggaaa gcgggcagtg agcgcaacgc 6840
aattaatgtg agttagctca ctcattaggc accccaggct ttacacttta tgcttccggc 6900
tcgtatgttg tgtggaattg tgagcggata acaatttcac acaggaaaca gctatgacca 6960
tgattacgcc aagcgcgcaa ttaaccctca ctaaagggaa caaaagctgg agctgcaagc 7020
ttaatgtagt cttatgcaat actcttgtag tcttgcaaca tggtaacgat gagttagcaa 7080
catgccttac aaggagagaa aaagcaccgt gcatgccgat tggtggaagt aaggtggtac 7140
gatcgtgcct tattaggaag gcaacagacg ggtctgacat ggattggacg aaccactgaa 7200
ttgccgcatt gcagagatat tgtatttaag tgcctagctc gatacataaa cgggtctctc 7260
tggttagacc agatctgagc ctgggagctc tctggctaac tagggaaccc actgcttaag 7320
cctcaataaa gcttgccttg agtgcttcaa gtagtgtgtg cccgtctgtt gtgtgactct 7380
ggtaactaga gatccctcag acccttttag tcagtgtgga aaatctctag cagtggcgcc 7440
cgaacaggga cttgaaagcg aaagggaaac cagaggagct ctctcgacgc aggactcggc 7500
ttgctgaagc gcgcacggca agaggcgagg ggcggcgact ggtgagtacg ccaaaaattt 7560
tgactagcgg aggctagaag gagagagatg ggtgcgagag cgtcagtatt aagcggggga 7620
gaattagatc gcgatgggaa aaaattcggt taaggccagg gggaaagaaa aaatataaat 7680
taaaacatat agtatgggca agcagggagc tagaacgatt cgcagttaat cctggcctgt 7740
tagaaacatc agaaggctgt agacaaatac tgggacagct acaaccatcc cttcagacag 7800
gatcagaaga acttagatca ttatataata cagtagcaac cctctattgt gtgcatcaaa 7860
ggatagagat aaaagacacc aaggaagctt tagacaagat agaggaagag caaaacaaaa 7920
gtaagaccac cgcacagcaa gcggccgctg atcttcagac ctggaggagg agatatgagg 7980
gacaattgga gaagtgaatt atataaatat aaagtagtaa aaattgaacc attaggagta 8040
gcacccacca aggcaaagag aagagtggtg cagagagaaa aaagagcagt gggaatagga 8100
gctttgttcc ttgggttctt gggagcagca ggaagcacta tgggcgcagc gtcaatgacg 8160
ctgacggtac aggccagaca attattgtct ggtatagtgc agcagcagaa caatttgctg 8220
agggctattg aggcgcaaca gcatctgttg caactcacag tctggggcat caagcagctc 8280
caggcaagaa tcctggctgt ggaaagatac ctaaaggatc aacagctcct ggggatttgg 8340
ggttgctctg gaaaactcat ttgcaccact gctgtgcctt ggaatgctag ttggagtaat 8400
aaatctctgg aacagatttg gaatcacacg acctggatgg agtgggacag agaaattaac 8460
aattacacaa gcttaataca ctccttaatt gaagaatcgc aaaaccagca agaaaagaat 8520
gaacaagaat tattggaatt agataaatgg gcaagtttgt ggaattggtt taacataaca 8580
aattggctgt ggtatataaa attattcata atgatagtag gaggcttggt aggtttaaga 8640
atagtttttg ctgtactttc tatagtgaat agagttaggc agggatattc accattatcg 8700
tttcagaccc acctcccaac cccgagggga cccttgcgcc ttttccaagg cagccctggg 8760
tttgcgcagg gacgcggctg ctctgggcgt ggttccggga aacgcagcgg cgccgaccct 8820
gggtctcgca cattcttcac gtccgttcgc agcgtcaccc ggatcttcgc cgctaccctt 8880
gtgggccccc cggcgacgct tcctgctccg cccctaagtc gggaaggttc cttgcggttc 8940
gcggcgtgcc ggacgtgaca aacggaagcc gcacgtctca ctagtaccct cgcagacgga 9000
cagcgccagg gagcaatggc agcgcgccga ccgcgatggg ctgtggccaa tagcggctgc 9060
tcagcagggc gcgccgagag cagcggccgg gaaggggcgg tgcgggaggc ggggtgtggg 9120
gcggtagtgt gggccctgtt cctgcccgcg cggtgttccg cattctgcaa gcctccggag 9180
cgcacgtcgg cagtcggctc cctcgttgac cgaatcaccg acctctctcc ccagggggta 9240
cccagctgtc tagagaattc tagatcttga gacaaatggc agtattcatc cacaatttta 9300
aaagaaaagg ggggattggg gggtacagtg caggggaaag aatagtagac ataatagcaa 9360
cagacataca aactaaagaa ttacaaaaac aaattacaaa aattcaaaat tttcgggttt 9420
attacaggga cagcagagat ccactttggc gccggctcga ggggg 9465
<210> 135
<211> 273
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 135
Met Lys Lys Pro Glu Leu Thr Ala Thr Ser Val Glu Lys Phe Leu Ile
1 5 10 15
Glu Lys Phe Asp Ser Val Ser Asp Leu Met Gln Leu Ser Glu Gly Glu
20 25 30
Glu Ser Arg Ala Phe Ser Phe Asp Val Gly Gly Arg Gly Tyr Val Leu
35 40 45
Arg Val Asn Ser Cys Ala Asp Gly Phe Tyr Lys Asp Arg Tyr Val Tyr
50 55 60
Arg His Phe Ala Ser Ala Ala Leu Pro Ile Pro Glu Val Leu Asp Ile
65 70 75 80
Gly Glu Phe Ser Glu Ser Leu Thr Tyr Cys Ile Ser Arg Arg Ala Gln
85 90 95
Gly Val Thr Leu Gln Asp Leu Pro Glu Thr Glu Leu Pro Ala Val Leu
100 105 110
Gln Pro Val Ala Glu Ala Met Asp Ala Ile Ala Ala Ala Asp Leu Ser
115 120 125
Gln Thr Ser Gly Phe Gly Pro Phe Gly Pro Gln Gly Ile Gly Gln Tyr
130 135 140
Thr Thr Trp Arg Asp Phe Ile Cys Ala Ile Ala Asp Pro His Val Tyr
145 150 155 160
His Trp Gln Thr Val Met Asp Asp Thr Val Ser Cys Leu Ser Tyr Glu
165 170 175
Thr Glu Ile Leu Thr Val Glu Tyr Gly Leu Leu Pro Ile Gly Lys Ile
180 185 190
Val Glu Lys Arg Ile Glu Cys Thr Val Tyr Ser Val Asp Asn Asn Gly
195 200 205
Asn Ile Tyr Thr Gln Pro Val Ala Gln Trp His Asp Arg Gly Glu Gln
210 215 220
Glu Val Phe Glu Tyr Cys Leu Glu Asp Gly Ser Leu Ile Arg Ala Thr
225 230 235 240
Lys Asp His Lys Phe Met Thr Val Asp Gly Gln Met Leu Pro Ile Asp
245 250 255
Glu Ile Phe Glu Arg Glu Leu Asp Leu Met Arg Val Asp Asn Leu Pro
260 265 270
Asn
<210> 136
<211> 9273
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 136
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtgccgcca ccatgattaa gatcgctacg 660
cggaagtacc tggggaaaca gaacgtctac gacataggtg tggagcgcga tcacaacttt 720
gctctgaaaa atggatttat cgccagcaac tgcgcgtccg tcgcgcaggc tctcgatgag 780
ctgatgcttt gggccgagga ctgccccgaa gtccggcacc tcgtgcacgc ggatttcggc 840
tccaacaatg tcctgacgga caatggccgc ataacagcgg tcattgactg gagcgaggcg 900
atgttcgggg attcccaata cgaggtcgcc aacatcttct tctggaggcc gtggttggct 960
tgtatggagc agcagacgcg ctacttcgag cggaggcatc cggagcttgc aggatcgccg 1020
cggctccggg cgtatatgct ccgcattggt cttgaccaac tctatcagag cttggttgac 1080
ggcaatttcg atgatgcagc ttgggcgcag ggtcgatgcg acgcaatcgt ccgatccgga 1140
gccgggactg tcgggcgtac acaaatcgcc cgcagaagcg cggccgtctg gaccgatggc 1200
tgtgtagaag tactcgccga tagtggaaac cgacgcccca gcactcgtcc gagggcaaag 1260
gaatagttaa ttaagaattc gacccagctt tcttgtacaa agtggttggt aagcctatcc 1320
ctaaccctct cctcggtctc gattctacgt agtaatgagc tagcagtctc gaggttaacg 1380
aattccgccc cccccctaac gttactggcc gaagccgctt ggaataaggc cggtgtgcgc 1440
ttgtctatat gttattttcc accatattgc cgtcttttgg caatgtgagg gcccggaaac 1500
ctggccctgt cttcttgacg agcattccta ggggtctttc ccctctcgcc aaaggaatgc 1560
aaggtctgtt gaatgtcgtg aaggaagcag ttcctctgga agcttcttga agacaaacaa 1620
cgtctgtagc gaccctttgc aggcagcgga accccccacc tggcgacagg tgcccctgcg 1680
gccaaaagcc acgtgtataa gatacacctg caaaggcggc acaaccccag tgccacgttg 1740
tgagttggat agttgtggaa agagtcaaat ggctctcctc aagcgtattc aacaaggggc 1800
tgaaggatgc ccagaaggta ccccattgta tgggatctga tctggggcct cggtgcacat 1860
gctttacatg tgtttagtcg aggttaaaaa aacgtctagg ccccccgaac cacggggacg 1920
tggttttcct ttgaaaaaca cgataatacc atggtgagca agggcgagga ggataacatg 1980
gccatcatca aggagttcat gcgcttcaag gtgcacatgg agggctccgt gaacggccac 2040
gagttcgaga tcgagggcga gggcgagggc cgcccctacg agggcaccca gaccgccaag 2100
ctgaaggtga ccaagggtgg ccccctgccc ttcgcctggg acatcctgtc ccctcagttc 2160
atgtacggct ccaaggccta cgtgaagcac cccgccgaca tccccgacta cttgaagctg 2220
tccttccccg agggcttcaa gtgggagcgc gtgatgaact tcgaggacgg cggcgtggtg 2280
accgtgaccc aggactcctc cctgcaggac ggcgagttca tctacaaggt gaagctgcgc 2340
ggcaccaact tcccctccga cggccccgta atgcagaaga agaccatggg ctgggaggcc 2400
tcctccgagc ggatgtaccc cgaggacggc gccctgaagg gcgagatcaa gcagaggctg 2460
aagctgaagg acggcggcca ctacgacgct gaggtcaaga ccacctacaa ggccaagaag 2520
cccgtgcagc tgcccggcgc ctacaacgtc aacatcaagt tggacatcac ctcccacaac 2580
gaggactaca ccatcgtgga acagtacgaa cgcgccgagg gccgccactc caccggcggc 2640
atggacgagc tgtacaagta acaccggtgg cgcgttaagt cgacaatcaa cctctggatt 2700
acaaaatttg tgaaagattg actggtattc ttaactatgt tgctcctttt acgctatgtg 2760
gatacgctgc tttaatgcct ttgtatcatg ctattgcttc ccgtatggct ttcattttct 2820
cctccttgta taaatcctgg ttgctgtctc tttatgagga gttgtggccc gttgtcaggc 2880
aacgtggcgt ggtgtgcact gtgtttgctg acgcaacccc cactggttgg ggcattgcca 2940
ccacctgtca gctcctttcc gggactttcg ctttccccct ccctattgcc acggcggaac 3000
tcatcgccgc ctgccttgcc cgctgctgga caggggctcg gctgttgggc actgacaatt 3060
ccgtggtgtt gtcggggaaa tcatcgtcct ttccttggct gctcgcctgt gttgccacct 3120
ggattctgcg cgggacgtcc ttctgctacg tcccttcggc cctcaatcca gcggaccttc 3180
cttcccgcgg cctgctgccg gctctgcggc ctcttccgcg tcttcgcctt cgccctcaga 3240
cgagtcggat ctccctttgg gccgcctccc cgcgtcgact ttaagaccaa tgacttacaa 3300
ggcagctgta gatcttagcc actttttaaa agaaaagggg ggactggaag ggctaattca 3360
ctcccaacga agacaagatc tgctttttgc ttgtactggg tctctctggt tagaccagat 3420
ctgagcctgg gagctctctg gctaactagg gaacccactg cttaagcctc aataaagctt 3480
gccttgagtg cttcaagtag tgtgtgcccg tctgttgtgt gactctggta actagagatc 3540
cctcagaccc ttttagtcag tgtggaaaat ctctagcagt acgtatagta gttcatgtca 3600
tcttattatt cagtatttat aacttgcaaa gaaatgaata tcagagagtg agaggaactt 3660
gtttattgca gcttataatg gttacaaata aagcaatagc atcacaaatt tcacaaataa 3720
agcatttttt tcactgcatt ctagttgtgg tttgtccaaa ctcatcaatg tatcttatca 3780
tgtctggctc tagctatccc gcccctaact ccgcccatcc cgcccctaac tccgcccagt 3840
tccgcccatt ctccgcccca tggctgacta atttttttta tttatgcaga ggccgaggcc 3900
gcctcggcct ctgagctatt ccagaagtag tgaggaggct tttttggagg cctagggacg 3960
tacccaattc gccctatagt gagtcgtatt acgcgcgctc actggccgtc gttttacaac 4020
gtcgtgactg ggaaaaccct ggcgttaccc aacttaatcg ccttgcagca catccccctt 4080
tcgccagctg gcgtaatagc gaagaggccc gcaccgatcg cccttcccaa cagttgcgca 4140
gcctgaatgg cgaatgggac gcgccctgta gcggcgcatt aagcgcggcg ggtgtggtgg 4200
ttacgcgcag cgtgaccgct acacttgcca gcgccctagc gcccgctcct ttcgctttct 4260
tcccttcctt tctcgccacg ttcgccggct ttccccgtca agctctaaat cgggggctcc 4320
ctttagggtt ccgatttagt gctttacggc acctcgaccc caaaaaactt gattagggtg 4380
atggttcacg tagtgggcca tcgccctgat agacggtttt tcgccctttg acgttggagt 4440
ccacgttctt taatagtgga ctcttgttcc aaactggaac aacactcaac cctatctcgg 4500
tctattcttt tgatttataa gggattttgc cgatttcggc ctattggtta aaaaatgagc 4560
tgatttaaca aaaatttaac gcgaatttta acaaaatatt aacgcttaca atttaggtgg 4620
cacttttcgg ggaaatgtgc gcggaacccc tatttgttta tttttctaaa tacattcaaa 4680
tatgtatccg ctcatgagac aataaccctg ataaatgctt caataatatt gaaaaaggaa 4740
gagtatgagt attcaacatt tccgtgtcgc ccttattccc ttttttgcgg cattttgcct 4800
tcctgttttt gctcacccag aaacgctggt gaaagtaaaa gatgctgaag atcagttggg 4860
tgcacgagtg ggttacatcg aactggatct caacagcggt aagatccttg agagttttcg 4920
ccccgaagaa cgttttccaa tgatgagcac ttttaaagtt ctgctatgtg gcgcggtatt 4980
atcccgtatt gacgccgggc aagagcaact cggtcgccgc atacactatt ctcagaatga 5040
cttggttgag tactcaccag tcacagaaaa gcatcttacg gatggcatga cagtaagaga 5100
attatgcagt gctgccataa ccatgagtga taacactgcg gccaacttac ttctgacaac 5160
gatcggagga ccgaaggagc taaccgcttt tttgcacaac atgggggatc atgtaactcg 5220
ccttgatcgt tgggaaccgg agctgaatga agccatacca aacgacgagc gtgacaccac 5280
gatgcctgta gcaatggcaa caacgttgcg caaactatta actggcgaac tacttactct 5340
agcttcccgg caacaattaa tagactggat ggaggcggat aaagttgcag gaccacttct 5400
gcgctcggcc cttccggctg gctggtttat tgctgataaa tctggagccg gtgagcgtgg 5460
gtctcgcggt atcattgcag cactggggcc agatggtaag ccctcccgta tcgtagttat 5520
ctacacgacg gggagtcagg caactatgga tgaacgaaat agacagatcg ctgagatagg 5580
tgcctcactg attaagcatt ggtaactgtc agaccaagtt tactcatata tactttagat 5640
tgatttaaaa cttcattttt aatttaaaag gatctaggtg aagatccttt ttgataatct 5700
catgaccaaa atcccttaac gtgagttttc gttccactga gcgtcagacc ccgtagaaaa 5760
gatcaaagga tcttcttgag atcctttttt tctgcgcgta atctgctgct tgcaaacaaa 5820
aaaaccaccg ctaccagcgg tggtttgttt gccggatcaa gagctaccaa ctctttttcc 5880
gaaggtaact ggcttcagca gagcgcagat accaaatact gttcttctag tgtagccgta 5940
gttaggccac cacttcaaga actctgtagc accgcctaca tacctcgctc tgctaatcct 6000
gttaccagtg gctgctgcca gtggcgataa gtcgtgtctt accgggttgg actcaagacg 6060
atagttaccg gataaggcgc agcggtcggg ctgaacgggg ggttcgtgca cacagcccag 6120
cttggagcga acgacctaca ccgaactgag atacctacag cgtgagctat gagaaagcgc 6180
cacgcttccc gaagggagaa aggcggacag gtatccggta agcggcaggg tcggaacagg 6240
agagcgcacg agggagcttc cagggggaaa cgcctggtat ctttatagtc ctgtcgggtt 6300
tcgccacctc tgacttgagc gtcgattttt gtgatgctcg tcaggggggc ggagcctatg 6360
gaaaaacgcc agcaacgcgg cctttttacg gttcctggcc ttttgctggc cttttgctca 6420
catgttcttt cctgcgttat cccctgattc tgtggataac cgtattaccg cctttgagtg 6480
agctgatacc gctcgccgca gccgaacgac cgagcgcagc gagtcagtga gcgaggaagc 6540
ggaagagcgc ccaatacgca aaccgcctct ccccgcgcgt tggccgattc attaatgcag 6600
ctggcacgac aggtttcccg actggaaagc gggcagtgag cgcaacgcaa ttaatgtgag 6660
ttagctcact cattaggcac cccaggcttt acactttatg cttccggctc gtatgttgtg 6720
tggaattgtg agcggataac aatttcacac aggaaacagc tatgaccatg attacgccaa 6780
gcgcgcaatt aaccctcact aaagggaaca aaagctggag ctgcaagctt aatgtagtct 6840
tatgcaatac tcttgtagtc ttgcaacatg gtaacgatga gttagcaaca tgccttacaa 6900
ggagagaaaa agcaccgtgc atgccgattg gtggaagtaa ggtggtacga tcgtgcctta 6960
ttaggaaggc aacagacggg tctgacatgg attggacgaa ccactgaatt gccgcattgc 7020
agagatattg tatttaagtg cctagctcga tacataaacg ggtctctctg gttagaccag 7080
atctgagcct gggagctctc tggctaacta gggaacccac tgcttaagcc tcaataaagc 7140
ttgccttgag tgcttcaagt agtgtgtgcc cgtctgttgt gtgactctgg taactagaga 7200
tccctcagac ccttttagtc agtgtggaaa atctctagca gtggcgcccg aacagggact 7260
tgaaagcgaa agggaaacca gaggagctct ctcgacgcag gactcggctt gctgaagcgc 7320
gcacggcaag aggcgagggg cggcgactgg tgagtacgcc aaaaattttg actagcggag 7380
gctagaagga gagagatggg tgcgagagcg tcagtattaa gcgggggaga attagatcgc 7440
gatgggaaaa aattcggtta aggccagggg gaaagaaaaa atataaatta aaacatatag 7500
tatgggcaag cagggagcta gaacgattcg cagttaatcc tggcctgtta gaaacatcag 7560
aaggctgtag acaaatactg ggacagctac aaccatccct tcagacagga tcagaagaac 7620
ttagatcatt atataataca gtagcaaccc tctattgtgt gcatcaaagg atagagataa 7680
aagacaccaa ggaagcttta gacaagatag aggaagagca aaacaaaagt aagaccaccg 7740
cacagcaagc ggccgctgat cttcagacct ggaggaggag atatgaggga caattggaga 7800
agtgaattat ataaatataa agtagtaaaa attgaaccat taggagtagc acccaccaag 7860
gcaaagagaa gagtggtgca gagagaaaaa agagcagtgg gaataggagc tttgttcctt 7920
gggttcttgg gagcagcagg aagcactatg ggcgcagcgt caatgacgct gacggtacag 7980
gccagacaat tattgtctgg tatagtgcag cagcagaaca atttgctgag ggctattgag 8040
gcgcaacagc atctgttgca actcacagtc tggggcatca agcagctcca ggcaagaatc 8100
ctggctgtgg aaagatacct aaaggatcaa cagctcctgg ggatttgggg ttgctctgga 8160
aaactcattt gcaccactgc tgtgccttgg aatgctagtt ggagtaataa atctctggaa 8220
cagatttgga atcacacgac ctggatggag tgggacagag aaattaacaa ttacacaagc 8280
ttaatacact ccttaattga agaatcgcaa aaccagcaag aaaagaatga acaagaatta 8340
ttggaattag ataaatgggc aagtttgtgg aattggttta acataacaaa ttggctgtgg 8400
tatataaaat tattcataat gatagtagga ggcttggtag gtttaagaat agtttttgct 8460
gtactttcta tagtgaatag agttaggcag ggatattcac cattatcgtt tcagacccac 8520
ctcccaaccc cgaggggacc cttgcgcctt ttccaaggca gccctgggtt tgcgcaggga 8580
cgcggctgct ctgggcgtgg ttccgggaaa cgcagcggcg ccgaccctgg gtctcgcaca 8640
ttcttcacgt ccgttcgcag cgtcacccgg atcttcgccg ctacccttgt gggccccccg 8700
gcgacgcttc ctgctccgcc cctaagtcgg gaaggttcct tgcggttcgc ggcgtgccgg 8760
acgtgacaaa cggaagccgc acgtctcact agtaccctcg cagacggaca gcgccaggga 8820
gcaatggcag cgcgccgacc gcgatgggct gtggccaata gcggctgctc agcagggcgc 8880
gccgagagca gcggccggga aggggcggtg cgggaggcgg ggtgtggggc ggtagtgtgg 8940
gccctgttcc tgcccgcgcg gtgttccgca ttctgcaagc ctccggagcg cacgtcggca 9000
gtcggctccc tcgttgaccg aatcaccgac ctctctcccc agggggtacc cagctgtcta 9060
gagaattcta gatcttgaga caaatggcag tattcatcca caattttaaa agaaaagggg 9120
ggattggggg gtacagtgca ggggaaagaa tagtagacat aatagcaaca gacatacaaa 9180
ctaaagaatt acaaaaacaa attacaaaaa ttcaaaattt tcgggtttat tacagggaca 9240
gcagagatcc actttggcgc cggctcgagg ggg 9273
<210> 137
<211> 207
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 137
Met Ile Lys Ile Ala Thr Arg Lys Tyr Leu Gly Lys Gln Asn Val Tyr
1 5 10 15
Asp Ile Gly Val Glu Arg Asp His Asn Phe Ala Leu Lys Asn Gly Phe
20 25 30
Ile Ala Ser Asn Cys Ala Ser Val Ala Gln Ala Leu Asp Glu Leu Met
35 40 45
Leu Trp Ala Glu Asp Cys Pro Glu Val Arg His Leu Val His Ala Asp
50 55 60
Phe Gly Ser Asn Asn Val Leu Thr Asp Asn Gly Arg Ile Thr Ala Val
65 70 75 80
Ile Asp Trp Ser Glu Ala Met Phe Gly Asp Ser Gln Tyr Glu Val Ala
85 90 95
Asn Ile Phe Phe Trp Arg Pro Trp Leu Ala Cys Met Glu Gln Gln Thr
100 105 110
Arg Tyr Phe Glu Arg Arg His Pro Glu Leu Ala Gly Ser Pro Arg Leu
115 120 125
Arg Ala Tyr Met Leu Arg Ile Gly Leu Asp Gln Leu Tyr Gln Ser Leu
130 135 140
Val Asp Gly Asn Phe Asp Asp Ala Ala Trp Ala Gln Gly Arg Cys Asp
145 150 155 160
Ala Ile Val Arg Ser Gly Ala Gly Thr Val Gly Arg Thr Gln Ile Ala
165 170 175
Arg Arg Ser Ala Ala Val Trp Thr Asp Gly Cys Val Glu Val Leu Ala
180 185 190
Asp Ser Gly Asn Arg Arg Pro Ser Thr Arg Pro Arg Ala Lys Glu
195 200 205
<210> 138
<211> 9606
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 138
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtgccacca tgaaaaagcc tgaactcacc 660
gcgacgtctg tcgagaagtt tctgatcgaa aagttcgaca gcgtctccga cctgatgcag 720
ctctcggagg gcgaagaatc tcgtgctttc agcttcgatg taggagggcg tggatatgtc 780
ctgcgggtaa atagctgcgc cgatggtttc tacaaagatc gttatgttta tcggcacttt 840
gcatcggccg cgctcccgat tccggaagtg cttgacattg gggaatttag cgagagcctg 900
acctattgca tctcccgccg tgcacagggt gtcacgttgc aagacctgcc tgaaaccgaa 960
ctgcccgctg ttctgcagcc ggtcgcggag gccatggatg cgatcgctgc ggccgatctt 1020
agccagacga gcgggttcgg cccattcgga ccgcaaggaa tcggtcaata cactacatgg 1080
cgtgatttca tatgcgcgat tgctgatccc catgtgtatc actggcaaac tgtgatggac 1140
gacaccgtca gtgcgtccgt cgcgcaggct ctcgatgagc tgatgctttg ggccgaggac 1200
tgccccgaag tccggcacct cgtgcacgcg gatttcggct ccaacaatgt cctgacggac 1260
aatggccgca taacagcggt cattgactgg agctgccttt catacgagac cgagatcctg 1320
actgtcgagt acggattgct tcctatcggc aaaatcgtgg agaagaggat tgaatgtacc 1380
gtctattcag tcgataataa tgggaacatc tacacacagc ccgtggctca atggcacgac 1440
agaggagagc aggaagtttt tgaatactgt ctcgaggacg gatccctcat ccgcgctact 1500
aaagatcata agtttatgac cgtggacggc cagatgctgc caattgacga aatttttgaa 1560
cgagagctgg atctgatgag agtcgacaac cttccaaact gattaattaa gaattcgacc 1620
cagctttctt gtacaaagtg gttggtaagc ctatccctaa ccctctcctc ggtctcgatt 1680
ctacgtagta atgagctagc agtctcgagg ttaacgaatt ccgccccccc cctaacgtta 1740
ctggccgaag ccgcttggaa taaggccggt gtgcgcttgt ctatatgtta ttttccacca 1800
tattgccgtc ttttggcaat gtgagggccc ggaaacctgg ccctgtcttc ttgacgagca 1860
ttcctagggg tctttcccct ctcgccaaag gaatgcaagg tctgttgaat gtcgtgaagg 1920
aagcagttcc tctggaagct tcttgaagac aaacaacgtc tgtagcgacc ctttgcaggc 1980
agcggaaccc cccacctggc gacaggtgcc cctgcggcca aaagccacgt gtataagata 2040
cacctgcaaa ggcggcacaa ccccagtgcc acgttgtgag ttggatagtt gtggaaagag 2100
tcaaatggct ctcctcaagc gtattcaaca aggggctgaa ggatgcccag aaggtacccc 2160
attgtatggg atctgatctg gggcctcggt gcacatgctt tacatgtgtt tagtcgaggt 2220
taaaaaaacg tctaggcccc ccgaaccacg gggacgtggt tttcctttga aaaacacgat 2280
aataccatgg ccatgagcga gctgattaag gagaacatgc acatgaagct gtacatggag 2340
ggcaccgtgg acaaccatca cttcaagtgc acatccgagg gcgaaggcaa gccctacgag 2400
ggcacccaga ccatgagaat caaggtggtc gagggcggcc ctctcccctt cgccttcgac 2460
atcctggcta ctagcttcct ctacggcagc aagaccttca tcaaccacac ccagggcatc 2520
cccgacttct tcaagcagtc cttccctgag ggcttcacat gggagagagt caccacatac 2580
gaagacgggg gcgtgctgac cgctacccag gacaccagcc tccaggacgg ctgcctcatc 2640
tacaacgtca agatcagagg ggtgaacttc acatccaacg gccctgtgat gcagaagaaa 2700
acactcggct gggaggcctt caccgagacg ctgtaccccg ctgacggcgg cctggaaggc 2760
agaaacgaca tggccctgaa gctcgtgggc gggagccatc tgatcgcaaa catcaagacc 2820
acatatagat ccaagaaacc cgctaagaac ctcaagatgc ctggcgtcta ctatgtggac 2880
tacagactgg aaagaatcaa ggaggccaac aacgagacct acgtcgagca gcacgaggtg 2940
gcagtggcca gatactgcga cctccctagc aaactggggc acaagcttaa ttaacaccgg 3000
tggcgcgtta agtcgacaat caacctctgg attacaaaat ttgtgaaaga ttgactggta 3060
ttcttaacta tgttgctcct tttacgctat gtggatacgc tgctttaatg cctttgtatc 3120
atgctattgc ttcccgtatg gctttcattt tctcctcctt gtataaatcc tggttgctgt 3180
ctctttatga ggagttgtgg cccgttgtca ggcaacgtgg cgtggtgtgc actgtgtttg 3240
ctgacgcaac ccccactggt tggggcattg ccaccacctg tcagctcctt tccgggactt 3300
tcgctttccc cctccctatt gccacggcgg aactcatcgc cgcctgcctt gcccgctgct 3360
ggacaggggc tcggctgttg ggcactgaca attccgtggt gttgtcgggg aaatcatcgt 3420
cctttccttg gctgctcgcc tgtgttgcca cctggattct gcgcgggacg tccttctgct 3480
acgtcccttc ggccctcaat ccagcggacc ttccttcccg cggcctgctg ccggctctgc 3540
ggcctcttcc gcgtcttcgc cttcgccctc agacgagtcg gatctccctt tgggccgcct 3600
ccccgcgtcg actttaagac caatgactta caaggcagct gtagatctta gccacttttt 3660
aaaagaaaag gggggactgg aagggctaat tcactcccaa cgaagacaag atctgctttt 3720
tgcttgtact gggtctctct ggttagacca gatctgagcc tgggagctct ctggctaact 3780
agggaaccca ctgcttaagc ctcaataaag cttgccttga gtgcttcaag tagtgtgtgc 3840
ccgtctgttg tgtgactctg gtaactagag atccctcaga cccttttagt cagtgtggaa 3900
aatctctagc agtacgtata gtagttcatg tcatcttatt attcagtatt tataacttgc 3960
aaagaaatga atatcagaga gtgagaggaa cttgtttatt gcagcttata atggttacaa 4020
ataaagcaat agcatcacaa atttcacaaa taaagcattt ttttcactgc attctagttg 4080
tggtttgtcc aaactcatca atgtatctta tcatgtctgg ctctagctat cccgccccta 4140
actccgccca tcccgcccct aactccgccc agttccgccc attctccgcc ccatggctga 4200
ctaatttttt ttatttatgc agaggccgag gccgcctcgg cctctgagct attccagaag 4260
tagtgaggag gcttttttgg aggcctaggg acgtacccaa ttcgccctat agtgagtcgt 4320
attacgcgcg ctcactggcc gtcgttttac aacgtcgtga ctgggaaaac cctggcgtta 4380
cccaacttaa tcgccttgca gcacatcccc ctttcgccag ctggcgtaat agcgaagagg 4440
cccgcaccga tcgcccttcc caacagttgc gcagcctgaa tggcgaatgg gacgcgccct 4500
gtagcggcgc attaagcgcg gcgggtgtgg tggttacgcg cagcgtgacc gctacacttg 4560
ccagcgccct agcgcccgct cctttcgctt tcttcccttc ctttctcgcc acgttcgccg 4620
gctttccccg tcaagctcta aatcgggggc tccctttagg gttccgattt agtgctttac 4680
ggcacctcga ccccaaaaaa cttgattagg gtgatggttc acgtagtggg ccatcgccct 4740
gatagacggt ttttcgccct ttgacgttgg agtccacgtt ctttaatagt ggactcttgt 4800
tccaaactgg aacaacactc aaccctatct cggtctattc ttttgattta taagggattt 4860
tgccgatttc ggcctattgg ttaaaaaatg agctgattta acaaaaattt aacgcgaatt 4920
ttaacaaaat attaacgctt acaatttagg tggcactttt cggggaaatg tgcgcggaac 4980
ccctatttgt ttatttttct aaatacattc aaatatgtat ccgctcatga gacaataacc 5040
ctgataaatg cttcaataat attgaaaaag gaagagtatg agtattcaac atttccgtgt 5100
cgcccttatt cccttttttg cggcattttg ccttcctgtt tttgctcacc cagaaacgct 5160
ggtgaaagta aaagatgctg aagatcagtt gggtgcacga gtgggttaca tcgaactgga 5220
tctcaacagc ggtaagatcc ttgagagttt tcgccccgaa gaacgttttc caatgatgag 5280
cacttttaaa gttctgctat gtggcgcggt attatcccgt attgacgccg ggcaagagca 5340
actcggtcgc cgcatacact attctcagaa tgacttggtt gagtactcac cagtcacaga 5400
aaagcatctt acggatggca tgacagtaag agaattatgc agtgctgcca taaccatgag 5460
tgataacact gcggccaact tacttctgac aacgatcgga ggaccgaagg agctaaccgc 5520
ttttttgcac aacatggggg atcatgtaac tcgccttgat cgttgggaac cggagctgaa 5580
tgaagccata ccaaacgacg agcgtgacac cacgatgcct gtagcaatgg caacaacgtt 5640
gcgcaaacta ttaactggcg aactacttac tctagcttcc cggcaacaat taatagactg 5700
gatggaggcg gataaagttg caggaccact tctgcgctcg gcccttccgg ctggctggtt 5760
tattgctgat aaatctggag ccggtgagcg tgggtctcgc ggtatcattg cagcactggg 5820
gccagatggt aagccctccc gtatcgtagt tatctacacg acggggagtc aggcaactat 5880
ggatgaacga aatagacaga tcgctgagat aggtgcctca ctgattaagc attggtaact 5940
gtcagaccaa gtttactcat atatacttta gattgattta aaacttcatt tttaatttaa 6000
aaggatctag gtgaagatcc tttttgataa tctcatgacc aaaatccctt aacgtgagtt 6060
ttcgttccac tgagcgtcag accccgtaga aaagatcaaa ggatcttctt gagatccttt 6120
ttttctgcgc gtaatctgct gcttgcaaac aaaaaaacca ccgctaccag cggtggtttg 6180
tttgccggat caagagctac caactctttt tccgaaggta actggcttca gcagagcgca 6240
gataccaaat actgttcttc tagtgtagcc gtagttaggc caccacttca agaactctgt 6300
agcaccgcct acatacctcg ctctgctaat cctgttacca gtggctgctg ccagtggcga 6360
taagtcgtgt cttaccgggt tggactcaag acgatagtta ccggataagg cgcagcggtc 6420
gggctgaacg gggggttcgt gcacacagcc cagcttggag cgaacgacct acaccgaact 6480
gagataccta cagcgtgagc tatgagaaag cgccacgctt cccgaaggga gaaaggcgga 6540
caggtatccg gtaagcggca gggtcggaac aggagagcgc acgagggagc ttccaggggg 6600
aaacgcctgg tatctttata gtcctgtcgg gtttcgccac ctctgacttg agcgtcgatt 6660
tttgtgatgc tcgtcagggg ggcggagcct atggaaaaac gccagcaacg cggccttttt 6720
acggttcctg gccttttgct ggccttttgc tcacatgttc tttcctgcgt tatcccctga 6780
ttctgtggat aaccgtatta ccgcctttga gtgagctgat accgctcgcc gcagccgaac 6840
gaccgagcgc agcgagtcag tgagcgagga agcggaagag cgcccaatac gcaaaccgcc 6900
tctccccgcg cgttggccga ttcattaatg cagctggcac gacaggtttc ccgactggaa 6960
agcgggcagt gagcgcaacg caattaatgt gagttagctc actcattagg caccccaggc 7020
tttacacttt atgcttccgg ctcgtatgtt gtgtggaatt gtgagcggat aacaatttca 7080
cacaggaaac agctatgacc atgattacgc caagcgcgca attaaccctc actaaaggga 7140
acaaaagctg gagctgcaag cttaatgtag tcttatgcaa tactcttgta gtcttgcaac 7200
atggtaacga tgagttagca acatgcctta caaggagaga aaaagcaccg tgcatgccga 7260
ttggtggaag taaggtggta cgatcgtgcc ttattaggaa ggcaacagac gggtctgaca 7320
tggattggac gaaccactga attgccgcat tgcagagata ttgtatttaa gtgcctagct 7380
cgatacataa acgggtctct ctggttagac cagatctgag cctgggagct ctctggctaa 7440
ctagggaacc cactgcttaa gcctcaataa agcttgcctt gagtgcttca agtagtgtgt 7500
gcccgtctgt tgtgtgactc tggtaactag agatccctca gaccctttta gtcagtgtgg 7560
aaaatctcta gcagtggcgc ccgaacaggg acttgaaagc gaaagggaaa ccagaggagc 7620
tctctcgacg caggactcgg cttgctgaag cgcgcacggc aagaggcgag gggcggcgac 7680
tggtgagtac gccaaaaatt ttgactagcg gaggctagaa ggagagagat gggtgcgaga 7740
gcgtcagtat taagcggggg agaattagat cgcgatggga aaaaattcgg ttaaggccag 7800
ggggaaagaa aaaatataaa ttaaaacata tagtatgggc aagcagggag ctagaacgat 7860
tcgcagttaa tcctggcctg ttagaaacat cagaaggctg tagacaaata ctgggacagc 7920
tacaaccatc ccttcagaca ggatcagaag aacttagatc attatataat acagtagcaa 7980
ccctctattg tgtgcatcaa aggatagaga taaaagacac caaggaagct ttagacaaga 8040
tagaggaaga gcaaaacaaa agtaagacca ccgcacagca agcggccgct gatcttcaga 8100
cctggaggag gagatatgag ggacaattgg agaagtgaat tatataaata taaagtagta 8160
aaaattgaac cattaggagt agcacccacc aaggcaaaga gaagagtggt gcagagagaa 8220
aaaagagcag tgggaatagg agctttgttc cttgggttct tgggagcagc aggaagcact 8280
atgggcgcag cgtcaatgac gctgacggta caggccagac aattattgtc tggtatagtg 8340
cagcagcaga acaatttgct gagggctatt gaggcgcaac agcatctgtt gcaactcaca 8400
gtctggggca tcaagcagct ccaggcaaga atcctggctg tggaaagata cctaaaggat 8460
caacagctcc tggggatttg gggttgctct ggaaaactca tttgcaccac tgctgtgcct 8520
tggaatgcta gttggagtaa taaatctctg gaacagattt ggaatcacac gacctggatg 8580
gagtgggaca gagaaattaa caattacaca agcttaatac actccttaat tgaagaatcg 8640
caaaaccagc aagaaaagaa tgaacaagaa ttattggaat tagataaatg ggcaagtttg 8700
tggaattggt ttaacataac aaattggctg tggtatataa aattattcat aatgatagta 8760
ggaggcttgg taggtttaag aatagttttt gctgtacttt ctatagtgaa tagagttagg 8820
cagggatatt caccattatc gtttcagacc cacctcccaa ccccgagggg acccttgcgc 8880
cttttccaag gcagccctgg gtttgcgcag ggacgcggct gctctgggcg tggttccggg 8940
aaacgcagcg gcgccgaccc tgggtctcgc acattcttca cgtccgttcg cagcgtcacc 9000
cggatcttcg ccgctaccct tgtgggcccc ccggcgacgc ttcctgctcc gcccctaagt 9060
cgggaaggtt ccttgcggtt cgcggcgtgc cggacgtgac aaacggaagc cgcacgtctc 9120
actagtaccc tcgcagacgg acagcgccag ggagcaatgg cagcgcgccg accgcgatgg 9180
gctgtggcca atagcggctg ctcagcaggg cgcgccgaga gcagcggccg ggaaggggcg 9240
gtgcgggagg cggggtgtgg ggcggtagtg tgggccctgt tcctgcccgc gcggtgttcc 9300
gcattctgca agcctccgga gcgcacgtcg gcagtcggct ccctcgttga ccgaatcacc 9360
gacctctctc cccagggggt acccagctgt ctagagaatt ctagatcttg agacaaatgg 9420
cagtattcat ccacaatttt aaaagaaaag gggggattgg ggggtacagt gcaggggaaa 9480
gaatagtaga cataatagca acagacatac aaactaaaga attacaaaaa caaattacaa 9540
aaattcaaaa ttttcgggtt tattacaggg acagcagaga tccactttgg cgccggctcg 9600
aggggg 9606
<210> 139
<211> 320
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 139
Met Lys Lys Pro Glu Leu Thr Ala Thr Ser Val Glu Lys Phe Leu Ile
1 5 10 15
Glu Lys Phe Asp Ser Val Ser Asp Leu Met Gln Leu Ser Glu Gly Glu
20 25 30
Glu Ser Arg Ala Phe Ser Phe Asp Val Gly Gly Arg Gly Tyr Val Leu
35 40 45
Arg Val Asn Ser Cys Ala Asp Gly Phe Tyr Lys Asp Arg Tyr Val Tyr
50 55 60
Arg His Phe Ala Ser Ala Ala Leu Pro Ile Pro Glu Val Leu Asp Ile
65 70 75 80
Gly Glu Phe Ser Glu Ser Leu Thr Tyr Cys Ile Ser Arg Arg Ala Gln
85 90 95
Gly Val Thr Leu Gln Asp Leu Pro Glu Thr Glu Leu Pro Ala Val Leu
100 105 110
Gln Pro Val Ala Glu Ala Met Asp Ala Ile Ala Ala Ala Asp Leu Ser
115 120 125
Gln Thr Ser Gly Phe Gly Pro Phe Gly Pro Gln Gly Ile Gly Gln Tyr
130 135 140
Thr Thr Trp Arg Asp Phe Ile Cys Ala Ile Ala Asp Pro His Val Tyr
145 150 155 160
His Trp Gln Thr Val Met Asp Asp Thr Val Ser Ala Ser Val Ala Gln
165 170 175
Ala Leu Asp Glu Leu Met Leu Trp Ala Glu Asp Cys Pro Glu Val Arg
180 185 190
His Leu Val His Ala Asp Phe Gly Ser Asn Asn Val Leu Thr Asp Asn
195 200 205
Gly Arg Ile Thr Ala Val Ile Asp Trp Ser Cys Leu Ser Tyr Glu Thr
210 215 220
Glu Ile Leu Thr Val Glu Tyr Gly Leu Leu Pro Ile Gly Lys Ile Val
225 230 235 240
Glu Lys Arg Ile Glu Cys Thr Val Tyr Ser Val Asp Asn Asn Gly Asn
245 250 255
Ile Tyr Thr Gln Pro Val Ala Gln Trp His Asp Arg Gly Glu Gln Glu
260 265 270
Val Phe Glu Tyr Cys Leu Glu Asp Gly Ser Leu Ile Arg Ala Thr Lys
275 280 285
Asp His Lys Phe Met Thr Val Asp Gly Gln Met Leu Pro Ile Asp Glu
290 295 300
Ile Phe Glu Arg Glu Leu Asp Leu Met Arg Val Asp Asn Leu Pro Asn
305 310 315 320
<210> 140
<211> 9132
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 140
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtgccgcca ccatgattaa gatcgctacg 660
cggaagtacc tggggaaaca gaacgtctac gacataggtg tggagcgcga tcacaacttt 720
gctctgaaaa atggatttat cgccagcaac tgcgaggcga tgttcgggga ttcccaatac 780
gaggtcgcca acatcttctt ctggaggccg tggttggctt gtatggagca gcagacgcgc 840
tacttcgagc ggaggcatcc ggagcttgca ggatcgccgc ggctccgggc gtatatgctc 900
cgcattggtc ttgaccaact ctatcagagc ttggttgacg gcaatttcga tgatgcagct 960
tgggcgcagg gtcgatgcga cgcaatcgtc cgatccggag ccgggactgt cgggcgtaca 1020
caaatcgccc gcagaagcgc ggccgtctgg accgatggct gtgtagaagt actcgccgat 1080
agtggaaacc gacgccccag cactcgtccg agggcaaagg aatagttaat taagaattcg 1140
acccagcttt cttgtacaaa gtggttggta agcctatccc taaccctctc ctcggtctcg 1200
attctacgta gtaatgagct agcagtctcg aggttaacga attccgcccc ccccctaacg 1260
ttactggccg aagccgcttg gaataaggcc ggtgtgcgct tgtctatatg ttattttcca 1320
ccatattgcc gtcttttggc aatgtgaggg cccggaaacc tggccctgtc ttcttgacga 1380
gcattcctag gggtctttcc cctctcgcca aaggaatgca aggtctgttg aatgtcgtga 1440
aggaagcagt tcctctggaa gcttcttgaa gacaaacaac gtctgtagcg accctttgca 1500
ggcagcggaa ccccccacct ggcgacaggt gcccctgcgg ccaaaagcca cgtgtataag 1560
atacacctgc aaaggcggca caaccccagt gccacgttgt gagttggata gttgtggaaa 1620
gagtcaaatg gctctcctca agcgtattca acaaggggct gaaggatgcc cagaaggtac 1680
cccattgtat gggatctgat ctggggcctc ggtgcacatg ctttacatgt gtttagtcga 1740
ggttaaaaaa acgtctaggc cccccgaacc acggggacgt ggttttcctt tgaaaaacac 1800
gataatacca tggtgagcaa gggcgaggag gataacatgg ccatcatcaa ggagttcatg 1860
cgcttcaagg tgcacatgga gggctccgtg aacggccacg agttcgagat cgagggcgag 1920
ggcgagggcc gcccctacga gggcacccag accgccaagc tgaaggtgac caagggtggc 1980
cccctgccct tcgcctggga catcctgtcc cctcagttca tgtacggctc caaggcctac 2040
gtgaagcacc ccgccgacat ccccgactac ttgaagctgt ccttccccga gggcttcaag 2100
tgggagcgcg tgatgaactt cgaggacggc ggcgtggtga ccgtgaccca ggactcctcc 2160
ctgcaggacg gcgagttcat ctacaaggtg aagctgcgcg gcaccaactt cccctccgac 2220
ggccccgtaa tgcagaagaa gaccatgggc tgggaggcct cctccgagcg gatgtacccc 2280
gaggacggcg ccctgaaggg cgagatcaag cagaggctga agctgaagga cggcggccac 2340
tacgacgctg aggtcaagac cacctacaag gccaagaagc ccgtgcagct gcccggcgcc 2400
tacaacgtca acatcaagtt ggacatcacc tcccacaacg aggactacac catcgtggaa 2460
cagtacgaac gcgccgaggg ccgccactcc accggcggca tggacgagct gtacaagtaa 2520
caccggtggc gcgttaagtc gacaatcaac ctctggatta caaaatttgt gaaagattga 2580
ctggtattct taactatgtt gctcctttta cgctatgtgg atacgctgct ttaatgcctt 2640
tgtatcatgc tattgcttcc cgtatggctt tcattttctc ctccttgtat aaatcctggt 2700
tgctgtctct ttatgaggag ttgtggcccg ttgtcaggca acgtggcgtg gtgtgcactg 2760
tgtttgctga cgcaaccccc actggttggg gcattgccac cacctgtcag ctcctttccg 2820
ggactttcgc tttccccctc cctattgcca cggcggaact catcgccgcc tgccttgccc 2880
gctgctggac aggggctcgg ctgttgggca ctgacaattc cgtggtgttg tcggggaaat 2940
catcgtcctt tccttggctg ctcgcctgtg ttgccacctg gattctgcgc gggacgtcct 3000
tctgctacgt cccttcggcc ctcaatccag cggaccttcc ttcccgcggc ctgctgccgg 3060
ctctgcggcc tcttccgcgt cttcgccttc gccctcagac gagtcggatc tccctttggg 3120
ccgcctcccc gcgtcgactt taagaccaat gacttacaag gcagctgtag atcttagcca 3180
ctttttaaaa gaaaaggggg gactggaagg gctaattcac tcccaacgaa gacaagatct 3240
gctttttgct tgtactgggt ctctctggtt agaccagatc tgagcctggg agctctctgg 3300
ctaactaggg aacccactgc ttaagcctca ataaagcttg ccttgagtgc ttcaagtagt 3360
gtgtgcccgt ctgttgtgtg actctggtaa ctagagatcc ctcagaccct tttagtcagt 3420
gtggaaaatc tctagcagta cgtatagtag ttcatgtcat cttattattc agtatttata 3480
acttgcaaag aaatgaatat cagagagtga gaggaacttg tttattgcag cttataatgg 3540
ttacaaataa agcaatagca tcacaaattt cacaaataaa gcattttttt cactgcattc 3600
tagttgtggt ttgtccaaac tcatcaatgt atcttatcat gtctggctct agctatcccg 3660
cccctaactc cgcccatccc gcccctaact ccgcccagtt ccgcccattc tccgccccat 3720
ggctgactaa ttttttttat ttatgcagag gccgaggccg cctcggcctc tgagctattc 3780
cagaagtagt gaggaggctt ttttggaggc ctagggacgt acccaattcg ccctatagtg 3840
agtcgtatta cgcgcgctca ctggccgtcg ttttacaacg tcgtgactgg gaaaaccctg 3900
gcgttaccca acttaatcgc cttgcagcac atcccccttt cgccagctgg cgtaatagcg 3960
aagaggcccg caccgatcgc ccttcccaac agttgcgcag cctgaatggc gaatgggacg 4020
cgccctgtag cggcgcatta agcgcggcgg gtgtggtggt tacgcgcagc gtgaccgcta 4080
cacttgccag cgccctagcg cccgctcctt tcgctttctt cccttccttt ctcgccacgt 4140
tcgccggctt tccccgtcaa gctctaaatc gggggctccc tttagggttc cgatttagtg 4200
ctttacggca cctcgacccc aaaaaacttg attagggtga tggttcacgt agtgggccat 4260
cgccctgata gacggttttt cgccctttga cgttggagtc cacgttcttt aatagtggac 4320
tcttgttcca aactggaaca acactcaacc ctatctcggt ctattctttt gatttataag 4380
ggattttgcc gatttcggcc tattggttaa aaaatgagct gatttaacaa aaatttaacg 4440
cgaattttaa caaaatatta acgcttacaa tttaggtggc acttttcggg gaaatgtgcg 4500
cggaacccct atttgtttat ttttctaaat acattcaaat atgtatccgc tcatgagaca 4560
ataaccctga taaatgcttc aataatattg aaaaaggaag agtatgagta ttcaacattt 4620
ccgtgtcgcc cttattccct tttttgcggc attttgcctt cctgtttttg ctcacccaga 4680
aacgctggtg aaagtaaaag atgctgaaga tcagttgggt gcacgagtgg gttacatcga 4740
actggatctc aacagcggta agatccttga gagttttcgc cccgaagaac gttttccaat 4800
gatgagcact tttaaagttc tgctatgtgg cgcggtatta tcccgtattg acgccgggca 4860
agagcaactc ggtcgccgca tacactattc tcagaatgac ttggttgagt actcaccagt 4920
cacagaaaag catcttacgg atggcatgac agtaagagaa ttatgcagtg ctgccataac 4980
catgagtgat aacactgcgg ccaacttact tctgacaacg atcggaggac cgaaggagct 5040
aaccgctttt ttgcacaaca tgggggatca tgtaactcgc cttgatcgtt gggaaccgga 5100
gctgaatgaa gccataccaa acgacgagcg tgacaccacg atgcctgtag caatggcaac 5160
aacgttgcgc aaactattaa ctggcgaact acttactcta gcttcccggc aacaattaat 5220
agactggatg gaggcggata aagttgcagg accacttctg cgctcggccc ttccggctgg 5280
ctggtttatt gctgataaat ctggagccgg tgagcgtggg tctcgcggta tcattgcagc 5340
actggggcca gatggtaagc cctcccgtat cgtagttatc tacacgacgg ggagtcaggc 5400
aactatggat gaacgaaata gacagatcgc tgagataggt gcctcactga ttaagcattg 5460
gtaactgtca gaccaagttt actcatatat actttagatt gatttaaaac ttcattttta 5520
atttaaaagg atctaggtga agatcctttt tgataatctc atgaccaaaa tcccttaacg 5580
tgagttttcg ttccactgag cgtcagaccc cgtagaaaag atcaaaggat cttcttgaga 5640
tccttttttt ctgcgcgtaa tctgctgctt gcaaacaaaa aaaccaccgc taccagcggt 5700
ggtttgtttg ccggatcaag agctaccaac tctttttccg aaggtaactg gcttcagcag 5760
agcgcagata ccaaatactg ttcttctagt gtagccgtag ttaggccacc acttcaagaa 5820
ctctgtagca ccgcctacat acctcgctct gctaatcctg ttaccagtgg ctgctgccag 5880
tggcgataag tcgtgtctta ccgggttgga ctcaagacga tagttaccgg ataaggcgca 5940
gcggtcgggc tgaacggggg gttcgtgcac acagcccagc ttggagcgaa cgacctacac 6000
cgaactgaga tacctacagc gtgagctatg agaaagcgcc acgcttcccg aagggagaaa 6060
ggcggacagg tatccggtaa gcggcagggt cggaacagga gagcgcacga gggagcttcc 6120
agggggaaac gcctggtatc tttatagtcc tgtcgggttt cgccacctct gacttgagcg 6180
tcgatttttg tgatgctcgt caggggggcg gagcctatgg aaaaacgcca gcaacgcggc 6240
ctttttacgg ttcctggcct tttgctggcc ttttgctcac atgttctttc ctgcgttatc 6300
ccctgattct gtggataacc gtattaccgc ctttgagtga gctgataccg ctcgccgcag 6360
ccgaacgacc gagcgcagcg agtcagtgag cgaggaagcg gaagagcgcc caatacgcaa 6420
accgcctctc cccgcgcgtt ggccgattca ttaatgcagc tggcacgaca ggtttcccga 6480
ctggaaagcg ggcagtgagc gcaacgcaat taatgtgagt tagctcactc attaggcacc 6540
ccaggcttta cactttatgc ttccggctcg tatgttgtgt ggaattgtga gcggataaca 6600
atttcacaca ggaaacagct atgaccatga ttacgccaag cgcgcaatta accctcacta 6660
aagggaacaa aagctggagc tgcaagctta atgtagtctt atgcaatact cttgtagtct 6720
tgcaacatgg taacgatgag ttagcaacat gccttacaag gagagaaaaa gcaccgtgca 6780
tgccgattgg tggaagtaag gtggtacgat cgtgccttat taggaaggca acagacgggt 6840
ctgacatgga ttggacgaac cactgaattg ccgcattgca gagatattgt atttaagtgc 6900
ctagctcgat acataaacgg gtctctctgg ttagaccaga tctgagcctg ggagctctct 6960
ggctaactag ggaacccact gcttaagcct caataaagct tgccttgagt gcttcaagta 7020
gtgtgtgccc gtctgttgtg tgactctggt aactagagat ccctcagacc cttttagtca 7080
gtgtggaaaa tctctagcag tggcgcccga acagggactt gaaagcgaaa gggaaaccag 7140
aggagctctc tcgacgcagg actcggcttg ctgaagcgcg cacggcaaga ggcgaggggc 7200
ggcgactggt gagtacgcca aaaattttga ctagcggagg ctagaaggag agagatgggt 7260
gcgagagcgt cagtattaag cgggggagaa ttagatcgcg atgggaaaaa attcggttaa 7320
ggccaggggg aaagaaaaaa tataaattaa aacatatagt atgggcaagc agggagctag 7380
aacgattcgc agttaatcct ggcctgttag aaacatcaga aggctgtaga caaatactgg 7440
gacagctaca accatccctt cagacaggat cagaagaact tagatcatta tataatacag 7500
tagcaaccct ctattgtgtg catcaaagga tagagataaa agacaccaag gaagctttag 7560
acaagataga ggaagagcaa aacaaaagta agaccaccgc acagcaagcg gccgctgatc 7620
ttcagacctg gaggaggaga tatgagggac aattggagaa gtgaattata taaatataaa 7680
gtagtaaaaa ttgaaccatt aggagtagca cccaccaagg caaagagaag agtggtgcag 7740
agagaaaaaa gagcagtggg aataggagct ttgttccttg ggttcttggg agcagcagga 7800
agcactatgg gcgcagcgtc aatgacgctg acggtacagg ccagacaatt attgtctggt 7860
atagtgcagc agcagaacaa tttgctgagg gctattgagg cgcaacagca tctgttgcaa 7920
ctcacagtct ggggcatcaa gcagctccag gcaagaatcc tggctgtgga aagataccta 7980
aaggatcaac agctcctggg gatttggggt tgctctggaa aactcatttg caccactgct 8040
gtgccttgga atgctagttg gagtaataaa tctctggaac agatttggaa tcacacgacc 8100
tggatggagt gggacagaga aattaacaat tacacaagct taatacactc cttaattgaa 8160
gaatcgcaaa accagcaaga aaagaatgaa caagaattat tggaattaga taaatgggca 8220
agtttgtgga attggtttaa cataacaaat tggctgtggt atataaaatt attcataatg 8280
atagtaggag gcttggtagg tttaagaata gtttttgctg tactttctat agtgaataga 8340
gttaggcagg gatattcacc attatcgttt cagacccacc tcccaacccc gaggggaccc 8400
ttgcgccttt tccaaggcag ccctgggttt gcgcagggac gcggctgctc tgggcgtggt 8460
tccgggaaac gcagcggcgc cgaccctggg tctcgcacat tcttcacgtc cgttcgcagc 8520
gtcacccgga tcttcgccgc tacccttgtg ggccccccgg cgacgcttcc tgctccgccc 8580
ctaagtcggg aaggttcctt gcggttcgcg gcgtgccgga cgtgacaaac ggaagccgca 8640
cgtctcacta gtaccctcgc agacggacag cgccagggag caatggcagc gcgccgaccg 8700
cgatgggctg tggccaatag cggctgctca gcagggcgcg ccgagagcag cggccgggaa 8760
ggggcggtgc gggaggcggg gtgtggggcg gtagtgtggg ccctgttcct gcccgcgcgg 8820
tgttccgcat tctgcaagcc tccggagcgc acgtcggcag tcggctccct cgttgaccga 8880
atcaccgacc tctctcccca gggggtaccc agctgtctag agaattctag atcttgagac 8940
aaatggcagt attcatccac aattttaaaa gaaaaggggg gattgggggg tacagtgcag 9000
gggaaagaat agtagacata atagcaacag acatacaaac taaagaatta caaaaacaaa 9060
ttacaaaaat tcaaaatttt cgggtttatt acagggacag cagagatcca ctttggcgcc 9120
ggctcgaggg gg 9132
<210> 141
<211> 160
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 141
Met Ile Lys Ile Ala Thr Arg Lys Tyr Leu Gly Lys Gln Asn Val Tyr
1 5 10 15
Asp Ile Gly Val Glu Arg Asp His Asn Phe Ala Leu Lys Asn Gly Phe
20 25 30
Ile Ala Ser Asn Cys Glu Ala Met Phe Gly Asp Ser Gln Tyr Glu Val
35 40 45
Ala Asn Ile Phe Phe Trp Arg Pro Trp Leu Ala Cys Met Glu Gln Gln
50 55 60
Thr Arg Tyr Phe Glu Arg Arg His Pro Glu Leu Ala Gly Ser Pro Arg
65 70 75 80
Leu Arg Ala Tyr Met Leu Arg Ile Gly Leu Asp Gln Leu Tyr Gln Ser
85 90 95
Leu Val Asp Gly Asn Phe Asp Asp Ala Ala Trp Ala Gln Gly Arg Cys
100 105 110
Asp Ala Ile Val Arg Ser Gly Ala Gly Thr Val Gly Arg Thr Gln Ile
115 120 125
Ala Arg Arg Ser Ala Ala Val Trp Thr Asp Gly Cys Val Glu Val Leu
130 135 140
Ala Asp Ser Gly Asn Arg Arg Pro Ser Thr Arg Pro Arg Ala Lys Glu
145 150 155 160
<210> 142
<211> 9729
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 142
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtgccacca tgaaaaagcc tgaactcacc 660
gcgacgtctg tcgagaagtt tctgatcgaa aagttcgaca gcgtctccga cctgatgcag 720
ctctcggagg gcgaagaatc tcgtgctttc agcttcgatg taggagggcg tggatatgtc 780
ctgcgggtaa atagctgcgc cgatggtttc tacaaagatc gttatgttta tcggcacttt 840
gcatcggccg cgctcccgat tccggaagtg cttgacattg gggaatttag cgagagcctg 900
acctattgca tctcccgccg tgcacagggt gtcacgttgc aagacctgcc tgaaaccgaa 960
ctgcccgctg ttctgcagcc ggtcgcggag gccatggatg cgatcgctgc ggccgatctt 1020
agccagacga gcgggttcgg cccattcgga ccgcaaggaa tcggtcaata cactacatgg 1080
cgtgatttca tatgcgcgat tgctgatccc catgtgtatc actggcaaac tgtgatggac 1140
gacaccgtca gtgcgtccgt cgcgcaggct ctcgatgagc tgatgctttg ggccgaggac 1200
tgccccgaag tccggcacct cgtgcacgcg gatttcggct ccaacaatgt cctgacggac 1260
aatggccgca taacagcggt cattgactgg agcgaggcga tgttcgggga ttcccaatac 1320
gaggtcgcca acatcttctt ctggaggccg tggttggctt gtatggagca gcagacgcgc 1380
tacttcgagc ggaggcatcc ggagcttgca ggatcgtgcc tttcatacga gaccgagatc 1440
ctgactgtcg agtacggatt gcttcctatc ggcaaaatcg tggagaagag gattgaatgt 1500
accgtctatt cagtcgataa taatgggaac atctacacac agcccgtggc tcaatggcac 1560
gacagaggag agcaggaagt ttttgaatac tgtctcgagg acggatccct catccgcgct 1620
actaaagatc ataagtttat gaccgtggac ggccagatgc tgccaattga cgaaattttt 1680
gaacgagagc tggatctgat gagagtcgac aaccttccaa actgattaat taagaattcg 1740
acccagcttt cttgtacaaa gtggttggta agcctatccc taaccctctc ctcggtctcg 1800
attctacgta gtaatgagct agcagtctcg aggttaacga attccgcccc ccccctaacg 1860
ttactggccg aagccgcttg gaataaggcc ggtgtgcgct tgtctatatg ttattttcca 1920
ccatattgcc gtcttttggc aatgtgaggg cccggaaacc tggccctgtc ttcttgacga 1980
gcattcctag gggtctttcc cctctcgcca aaggaatgca aggtctgttg aatgtcgtga 2040
aggaagcagt tcctctggaa gcttcttgaa gacaaacaac gtctgtagcg accctttgca 2100
ggcagcggaa ccccccacct ggcgacaggt gcccctgcgg ccaaaagcca cgtgtataag 2160
atacacctgc aaaggcggca caaccccagt gccacgttgt gagttggata gttgtggaaa 2220
gagtcaaatg gctctcctca agcgtattca acaaggggct gaaggatgcc cagaaggtac 2280
cccattgtat gggatctgat ctggggcctc ggtgcacatg ctttacatgt gtttagtcga 2340
ggttaaaaaa acgtctaggc cccccgaacc acggggacgt ggttttcctt tgaaaaacac 2400
gataatacca tggccatgag cgagctgatt aaggagaaca tgcacatgaa gctgtacatg 2460
gagggcaccg tggacaacca tcacttcaag tgcacatccg agggcgaagg caagccctac 2520
gagggcaccc agaccatgag aatcaaggtg gtcgagggcg gccctctccc cttcgccttc 2580
gacatcctgg ctactagctt cctctacggc agcaagacct tcatcaacca cacccagggc 2640
atccccgact tcttcaagca gtccttccct gagggcttca catgggagag agtcaccaca 2700
tacgaagacg ggggcgtgct gaccgctacc caggacacca gcctccagga cggctgcctc 2760
atctacaacg tcaagatcag aggggtgaac ttcacatcca acggccctgt gatgcagaag 2820
aaaacactcg gctgggaggc cttcaccgag acgctgtacc ccgctgacgg cggcctggaa 2880
ggcagaaacg acatggccct gaagctcgtg ggcgggagcc atctgatcgc aaacatcaag 2940
accacatata gatccaagaa acccgctaag aacctcaaga tgcctggcgt ctactatgtg 3000
gactacagac tggaaagaat caaggaggcc aacaacgaga cctacgtcga gcagcacgag 3060
gtggcagtgg ccagatactg cgacctccct agcaaactgg ggcacaagct taattaacac 3120
cggtggcgcg ttaagtcgac aatcaacctc tggattacaa aatttgtgaa agattgactg 3180
gtattcttaa ctatgttgct ccttttacgc tatgtggata cgctgcttta atgcctttgt 3240
atcatgctat tgcttcccgt atggctttca ttttctcctc cttgtataaa tcctggttgc 3300
tgtctcttta tgaggagttg tggcccgttg tcaggcaacg tggcgtggtg tgcactgtgt 3360
ttgctgacgc aacccccact ggttggggca ttgccaccac ctgtcagctc ctttccggga 3420
ctttcgcttt ccccctccct attgccacgg cggaactcat cgccgcctgc cttgcccgct 3480
gctggacagg ggctcggctg ttgggcactg acaattccgt ggtgttgtcg gggaaatcat 3540
cgtcctttcc ttggctgctc gcctgtgttg ccacctggat tctgcgcggg acgtccttct 3600
gctacgtccc ttcggccctc aatccagcgg accttccttc ccgcggcctg ctgccggctc 3660
tgcggcctct tccgcgtctt cgccttcgcc ctcagacgag tcggatctcc ctttgggccg 3720
cctccccgcg tcgactttaa gaccaatgac ttacaaggca gctgtagatc ttagccactt 3780
tttaaaagaa aaggggggac tggaagggct aattcactcc caacgaagac aagatctgct 3840
ttttgcttgt actgggtctc tctggttaga ccagatctga gcctgggagc tctctggcta 3900
actagggaac ccactgctta agcctcaata aagcttgcct tgagtgcttc aagtagtgtg 3960
tgcccgtctg ttgtgtgact ctggtaacta gagatccctc agaccctttt agtcagtgtg 4020
gaaaatctct agcagtacgt atagtagttc atgtcatctt attattcagt atttataact 4080
tgcaaagaaa tgaatatcag agagtgagag gaacttgttt attgcagctt ataatggtta 4140
caaataaagc aatagcatca caaatttcac aaataaagca tttttttcac tgcattctag 4200
ttgtggtttg tccaaactca tcaatgtatc ttatcatgtc tggctctagc tatcccgccc 4260
ctaactccgc ccatcccgcc cctaactccg cccagttccg cccattctcc gccccatggc 4320
tgactaattt tttttattta tgcagaggcc gaggccgcct cggcctctga gctattccag 4380
aagtagtgag gaggcttttt tggaggccta gggacgtacc caattcgccc tatagtgagt 4440
cgtattacgc gcgctcactg gccgtcgttt tacaacgtcg tgactgggaa aaccctggcg 4500
ttacccaact taatcgcctt gcagcacatc cccctttcgc cagctggcgt aatagcgaag 4560
aggcccgcac cgatcgccct tcccaacagt tgcgcagcct gaatggcgaa tgggacgcgc 4620
cctgtagcgg cgcattaagc gcggcgggtg tggtggttac gcgcagcgtg accgctacac 4680
ttgccagcgc cctagcgccc gctcctttcg ctttcttccc ttcctttctc gccacgttcg 4740
ccggctttcc ccgtcaagct ctaaatcggg ggctcccttt agggttccga tttagtgctt 4800
tacggcacct cgaccccaaa aaacttgatt agggtgatgg ttcacgtagt gggccatcgc 4860
cctgatagac ggtttttcgc cctttgacgt tggagtccac gttctttaat agtggactct 4920
tgttccaaac tggaacaaca ctcaacccta tctcggtcta ttcttttgat ttataaggga 4980
ttttgccgat ttcggcctat tggttaaaaa atgagctgat ttaacaaaaa tttaacgcga 5040
attttaacaa aatattaacg cttacaattt aggtggcact tttcggggaa atgtgcgcgg 5100
aacccctatt tgtttatttt tctaaataca ttcaaatatg tatccgctca tgagacaata 5160
accctgataa atgcttcaat aatattgaaa aaggaagagt atgagtattc aacatttccg 5220
tgtcgccctt attccctttt ttgcggcatt ttgccttcct gtttttgctc acccagaaac 5280
gctggtgaaa gtaaaagatg ctgaagatca gttgggtgca cgagtgggtt acatcgaact 5340
ggatctcaac agcggtaaga tccttgagag ttttcgcccc gaagaacgtt ttccaatgat 5400
gagcactttt aaagttctgc tatgtggcgc ggtattatcc cgtattgacg ccgggcaaga 5460
gcaactcggt cgccgcatac actattctca gaatgacttg gttgagtact caccagtcac 5520
agaaaagcat cttacggatg gcatgacagt aagagaatta tgcagtgctg ccataaccat 5580
gagtgataac actgcggcca acttacttct gacaacgatc ggaggaccga aggagctaac 5640
cgcttttttg cacaacatgg gggatcatgt aactcgcctt gatcgttggg aaccggagct 5700
gaatgaagcc ataccaaacg acgagcgtga caccacgatg cctgtagcaa tggcaacaac 5760
gttgcgcaaa ctattaactg gcgaactact tactctagct tcccggcaac aattaataga 5820
ctggatggag gcggataaag ttgcaggacc acttctgcgc tcggcccttc cggctggctg 5880
gtttattgct gataaatctg gagccggtga gcgtgggtct cgcggtatca ttgcagcact 5940
ggggccagat ggtaagccct cccgtatcgt agttatctac acgacgggga gtcaggcaac 6000
tatggatgaa cgaaatagac agatcgctga gataggtgcc tcactgatta agcattggta 6060
actgtcagac caagtttact catatatact ttagattgat ttaaaacttc atttttaatt 6120
taaaaggatc taggtgaaga tcctttttga taatctcatg accaaaatcc cttaacgtga 6180
gttttcgttc cactgagcgt cagaccccgt agaaaagatc aaaggatctt cttgagatcc 6240
tttttttctg cgcgtaatct gctgcttgca aacaaaaaaa ccaccgctac cagcggtggt 6300
ttgtttgccg gatcaagagc taccaactct ttttccgaag gtaactggct tcagcagagc 6360
gcagatacca aatactgttc ttctagtgta gccgtagtta ggccaccact tcaagaactc 6420
tgtagcaccg cctacatacc tcgctctgct aatcctgtta ccagtggctg ctgccagtgg 6480
cgataagtcg tgtcttaccg ggttggactc aagacgatag ttaccggata aggcgcagcg 6540
gtcgggctga acggggggtt cgtgcacaca gcccagcttg gagcgaacga cctacaccga 6600
actgagatac ctacagcgtg agctatgaga aagcgccacg cttcccgaag ggagaaaggc 6660
ggacaggtat ccggtaagcg gcagggtcgg aacaggagag cgcacgaggg agcttccagg 6720
gggaaacgcc tggtatcttt atagtcctgt cgggtttcgc cacctctgac ttgagcgtcg 6780
atttttgtga tgctcgtcag gggggcggag cctatggaaa aacgccagca acgcggcctt 6840
tttacggttc ctggcctttt gctggccttt tgctcacatg ttctttcctg cgttatcccc 6900
tgattctgtg gataaccgta ttaccgcctt tgagtgagct gataccgctc gccgcagccg 6960
aacgaccgag cgcagcgagt cagtgagcga ggaagcggaa gagcgcccaa tacgcaaacc 7020
gcctctcccc gcgcgttggc cgattcatta atgcagctgg cacgacaggt ttcccgactg 7080
gaaagcgggc agtgagcgca acgcaattaa tgtgagttag ctcactcatt aggcacccca 7140
ggctttacac tttatgcttc cggctcgtat gttgtgtgga attgtgagcg gataacaatt 7200
tcacacagga aacagctatg accatgatta cgccaagcgc gcaattaacc ctcactaaag 7260
ggaacaaaag ctggagctgc aagcttaatg tagtcttatg caatactctt gtagtcttgc 7320
aacatggtaa cgatgagtta gcaacatgcc ttacaaggag agaaaaagca ccgtgcatgc 7380
cgattggtgg aagtaaggtg gtacgatcgt gccttattag gaaggcaaca gacgggtctg 7440
acatggattg gacgaaccac tgaattgccg cattgcagag atattgtatt taagtgccta 7500
gctcgataca taaacgggtc tctctggtta gaccagatct gagcctggga gctctctggc 7560
taactaggga acccactgct taagcctcaa taaagcttgc cttgagtgct tcaagtagtg 7620
tgtgcccgtc tgttgtgtga ctctggtaac tagagatccc tcagaccctt ttagtcagtg 7680
tggaaaatct ctagcagtgg cgcccgaaca gggacttgaa agcgaaaggg aaaccagagg 7740
agctctctcg acgcaggact cggcttgctg aagcgcgcac ggcaagaggc gaggggcggc 7800
gactggtgag tacgccaaaa attttgacta gcggaggcta gaaggagaga gatgggtgcg 7860
agagcgtcag tattaagcgg gggagaatta gatcgcgatg ggaaaaaatt cggttaaggc 7920
cagggggaaa gaaaaaatat aaattaaaac atatagtatg ggcaagcagg gagctagaac 7980
gattcgcagt taatcctggc ctgttagaaa catcagaagg ctgtagacaa atactgggac 8040
agctacaacc atcccttcag acaggatcag aagaacttag atcattatat aatacagtag 8100
caaccctcta ttgtgtgcat caaaggatag agataaaaga caccaaggaa gctttagaca 8160
agatagagga agagcaaaac aaaagtaaga ccaccgcaca gcaagcggcc gctgatcttc 8220
agacctggag gaggagatat gagggacaat tggagaagtg aattatataa atataaagta 8280
gtaaaaattg aaccattagg agtagcaccc accaaggcaa agagaagagt ggtgcagaga 8340
gaaaaaagag cagtgggaat aggagctttg ttccttgggt tcttgggagc agcaggaagc 8400
actatgggcg cagcgtcaat gacgctgacg gtacaggcca gacaattatt gtctggtata 8460
gtgcagcagc agaacaattt gctgagggct attgaggcgc aacagcatct gttgcaactc 8520
acagtctggg gcatcaagca gctccaggca agaatcctgg ctgtggaaag atacctaaag 8580
gatcaacagc tcctggggat ttggggttgc tctggaaaac tcatttgcac cactgctgtg 8640
ccttggaatg ctagttggag taataaatct ctggaacaga tttggaatca cacgacctgg 8700
atggagtggg acagagaaat taacaattac acaagcttaa tacactcctt aattgaagaa 8760
tcgcaaaacc agcaagaaaa gaatgaacaa gaattattgg aattagataa atgggcaagt 8820
ttgtggaatt ggtttaacat aacaaattgg ctgtggtata taaaattatt cataatgata 8880
gtaggaggct tggtaggttt aagaatagtt tttgctgtac tttctatagt gaatagagtt 8940
aggcagggat attcaccatt atcgtttcag acccacctcc caaccccgag gggacccttg 9000
cgccttttcc aaggcagccc tgggtttgcg cagggacgcg gctgctctgg gcgtggttcc 9060
gggaaacgca gcggcgccga ccctgggtct cgcacattct tcacgtccgt tcgcagcgtc 9120
acccggatct tcgccgctac ccttgtgggc cccccggcga cgcttcctgc tccgccccta 9180
agtcgggaag gttccttgcg gttcgcggcg tgccggacgt gacaaacgga agccgcacgt 9240
ctcactagta ccctcgcaga cggacagcgc cagggagcaa tggcagcgcg ccgaccgcga 9300
tgggctgtgg ccaatagcgg ctgctcagca gggcgcgccg agagcagcgg ccgggaaggg 9360
gcggtgcggg aggcggggtg tggggcggta gtgtgggccc tgttcctgcc cgcgcggtgt 9420
tccgcattct gcaagcctcc ggagcgcacg tcggcagtcg gctccctcgt tgaccgaatc 9480
accgacctct ctccccaggg ggtacccagc tgtctagaga attctagatc ttgagacaaa 9540
tggcagtatt catccacaat tttaaaagaa aaggggggat tggggggtac agtgcagggg 9600
aaagaatagt agacataata gcaacagaca tacaaactaa agaattacaa aaacaaatta 9660
caaaaattca aaattttcgg gtttattaca gggacagcag agatccactt tggcgccggc 9720
tcgaggggg 9729
<210> 143
<211> 361
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 143
Met Lys Lys Pro Glu Leu Thr Ala Thr Ser Val Glu Lys Phe Leu Ile
1 5 10 15
Glu Lys Phe Asp Ser Val Ser Asp Leu Met Gln Leu Ser Glu Gly Glu
20 25 30
Glu Ser Arg Ala Phe Ser Phe Asp Val Gly Gly Arg Gly Tyr Val Leu
35 40 45
Arg Val Asn Ser Cys Ala Asp Gly Phe Tyr Lys Asp Arg Tyr Val Tyr
50 55 60
Arg His Phe Ala Ser Ala Ala Leu Pro Ile Pro Glu Val Leu Asp Ile
65 70 75 80
Gly Glu Phe Ser Glu Ser Leu Thr Tyr Cys Ile Ser Arg Arg Ala Gln
85 90 95
Gly Val Thr Leu Gln Asp Leu Pro Glu Thr Glu Leu Pro Ala Val Leu
100 105 110
Gln Pro Val Ala Glu Ala Met Asp Ala Ile Ala Ala Ala Asp Leu Ser
115 120 125
Gln Thr Ser Gly Phe Gly Pro Phe Gly Pro Gln Gly Ile Gly Gln Tyr
130 135 140
Thr Thr Trp Arg Asp Phe Ile Cys Ala Ile Ala Asp Pro His Val Tyr
145 150 155 160
His Trp Gln Thr Val Met Asp Asp Thr Val Ser Ala Ser Val Ala Gln
165 170 175
Ala Leu Asp Glu Leu Met Leu Trp Ala Glu Asp Cys Pro Glu Val Arg
180 185 190
His Leu Val His Ala Asp Phe Gly Ser Asn Asn Val Leu Thr Asp Asn
195 200 205
Gly Arg Ile Thr Ala Val Ile Asp Trp Ser Glu Ala Met Phe Gly Asp
210 215 220
Ser Gln Tyr Glu Val Ala Asn Ile Phe Phe Trp Arg Pro Trp Leu Ala
225 230 235 240
Cys Met Glu Gln Gln Thr Arg Tyr Phe Glu Arg Arg His Pro Glu Leu
245 250 255
Ala Gly Ser Cys Leu Ser Tyr Glu Thr Glu Ile Leu Thr Val Glu Tyr
260 265 270
Gly Leu Leu Pro Ile Gly Lys Ile Val Glu Lys Arg Ile Glu Cys Thr
275 280 285
Val Tyr Ser Val Asp Asn Asn Gly Asn Ile Tyr Thr Gln Pro Val Ala
290 295 300
Gln Trp His Asp Arg Gly Glu Gln Glu Val Phe Glu Tyr Cys Leu Glu
305 310 315 320
Asp Gly Ser Leu Ile Arg Ala Thr Lys Asp His Lys Phe Met Thr Val
325 330 335
Asp Gly Gln Met Leu Pro Ile Asp Glu Ile Phe Glu Arg Glu Leu Asp
340 345 350
Leu Met Arg Val Asp Asn Leu Pro Asn
355 360
<210> 144
<211> 9009
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 144
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtgccgcca ccatgattaa gatcgctacg 660
cggaagtacc tggggaaaca gaacgtctac gacataggtg tggagcgcga tcacaacttt 720
gctctgaaaa atggatttat cgccagcaac tgcccgcggc tccgggcgta tatgctccgc 780
attggtcttg accaactcta tcagagcttg gttgacggca atttcgatga tgcagcttgg 840
gcgcagggtc gatgcgacgc aatcgtccga tccggagccg ggactgtcgg gcgtacacaa 900
atcgcccgca gaagcgcggc cgtctggacc gatggctgtg tagaagtact cgccgatagt 960
ggaaaccgac gccccagcac tcgtccgagg gcaaaggaat agttaattaa gaattcgacc 1020
cagctttctt gtacaaagtg gttggtaagc ctatccctaa ccctctcctc ggtctcgatt 1080
ctacgtagta atgagctagc agtctcgagg ttaacgaatt ccgccccccc cctaacgtta 1140
ctggccgaag ccgcttggaa taaggccggt gtgcgcttgt ctatatgtta ttttccacca 1200
tattgccgtc ttttggcaat gtgagggccc ggaaacctgg ccctgtcttc ttgacgagca 1260
ttcctagggg tctttcccct ctcgccaaag gaatgcaagg tctgttgaat gtcgtgaagg 1320
aagcagttcc tctggaagct tcttgaagac aaacaacgtc tgtagcgacc ctttgcaggc 1380
agcggaaccc cccacctggc gacaggtgcc cctgcggcca aaagccacgt gtataagata 1440
cacctgcaaa ggcggcacaa ccccagtgcc acgttgtgag ttggatagtt gtggaaagag 1500
tcaaatggct ctcctcaagc gtattcaaca aggggctgaa ggatgcccag aaggtacccc 1560
attgtatggg atctgatctg gggcctcggt gcacatgctt tacatgtgtt tagtcgaggt 1620
taaaaaaacg tctaggcccc ccgaaccacg gggacgtggt tttcctttga aaaacacgat 1680
aataccatgg tgagcaaggg cgaggaggat aacatggcca tcatcaagga gttcatgcgc 1740
ttcaaggtgc acatggaggg ctccgtgaac ggccacgagt tcgagatcga gggcgagggc 1800
gagggccgcc cctacgaggg cacccagacc gccaagctga aggtgaccaa gggtggcccc 1860
ctgcccttcg cctgggacat cctgtcccct cagttcatgt acggctccaa ggcctacgtg 1920
aagcaccccg ccgacatccc cgactacttg aagctgtcct tccccgaggg cttcaagtgg 1980
gagcgcgtga tgaacttcga ggacggcggc gtggtgaccg tgacccagga ctcctccctg 2040
caggacggcg agttcatcta caaggtgaag ctgcgcggca ccaacttccc ctccgacggc 2100
cccgtaatgc agaagaagac catgggctgg gaggcctcct ccgagcggat gtaccccgag 2160
gacggcgccc tgaagggcga gatcaagcag aggctgaagc tgaaggacgg cggccactac 2220
gacgctgagg tcaagaccac ctacaaggcc aagaagcccg tgcagctgcc cggcgcctac 2280
aacgtcaaca tcaagttgga catcacctcc cacaacgagg actacaccat cgtggaacag 2340
tacgaacgcg ccgagggccg ccactccacc ggcggcatgg acgagctgta caagtaacac 2400
cggtggcgcg ttaagtcgac aatcaacctc tggattacaa aatttgtgaa agattgactg 2460
gtattcttaa ctatgttgct ccttttacgc tatgtggata cgctgcttta atgcctttgt 2520
atcatgctat tgcttcccgt atggctttca ttttctcctc cttgtataaa tcctggttgc 2580
tgtctcttta tgaggagttg tggcccgttg tcaggcaacg tggcgtggtg tgcactgtgt 2640
ttgctgacgc aacccccact ggttggggca ttgccaccac ctgtcagctc ctttccggga 2700
ctttcgcttt ccccctccct attgccacgg cggaactcat cgccgcctgc cttgcccgct 2760
gctggacagg ggctcggctg ttgggcactg acaattccgt ggtgttgtcg gggaaatcat 2820
cgtcctttcc ttggctgctc gcctgtgttg ccacctggat tctgcgcggg acgtccttct 2880
gctacgtccc ttcggccctc aatccagcgg accttccttc ccgcggcctg ctgccggctc 2940
tgcggcctct tccgcgtctt cgccttcgcc ctcagacgag tcggatctcc ctttgggccg 3000
cctccccgcg tcgactttaa gaccaatgac ttacaaggca gctgtagatc ttagccactt 3060
tttaaaagaa aaggggggac tggaagggct aattcactcc caacgaagac aagatctgct 3120
ttttgcttgt actgggtctc tctggttaga ccagatctga gcctgggagc tctctggcta 3180
actagggaac ccactgctta agcctcaata aagcttgcct tgagtgcttc aagtagtgtg 3240
tgcccgtctg ttgtgtgact ctggtaacta gagatccctc agaccctttt agtcagtgtg 3300
gaaaatctct agcagtacgt atagtagttc atgtcatctt attattcagt atttataact 3360
tgcaaagaaa tgaatatcag agagtgagag gaacttgttt attgcagctt ataatggtta 3420
caaataaagc aatagcatca caaatttcac aaataaagca tttttttcac tgcattctag 3480
ttgtggtttg tccaaactca tcaatgtatc ttatcatgtc tggctctagc tatcccgccc 3540
ctaactccgc ccatcccgcc cctaactccg cccagttccg cccattctcc gccccatggc 3600
tgactaattt tttttattta tgcagaggcc gaggccgcct cggcctctga gctattccag 3660
aagtagtgag gaggcttttt tggaggccta gggacgtacc caattcgccc tatagtgagt 3720
cgtattacgc gcgctcactg gccgtcgttt tacaacgtcg tgactgggaa aaccctggcg 3780
ttacccaact taatcgcctt gcagcacatc cccctttcgc cagctggcgt aatagcgaag 3840
aggcccgcac cgatcgccct tcccaacagt tgcgcagcct gaatggcgaa tgggacgcgc 3900
cctgtagcgg cgcattaagc gcggcgggtg tggtggttac gcgcagcgtg accgctacac 3960
ttgccagcgc cctagcgccc gctcctttcg ctttcttccc ttcctttctc gccacgttcg 4020
ccggctttcc ccgtcaagct ctaaatcggg ggctcccttt agggttccga tttagtgctt 4080
tacggcacct cgaccccaaa aaacttgatt agggtgatgg ttcacgtagt gggccatcgc 4140
cctgatagac ggtttttcgc cctttgacgt tggagtccac gttctttaat agtggactct 4200
tgttccaaac tggaacaaca ctcaacccta tctcggtcta ttcttttgat ttataaggga 4260
ttttgccgat ttcggcctat tggttaaaaa atgagctgat ttaacaaaaa tttaacgcga 4320
attttaacaa aatattaacg cttacaattt aggtggcact tttcggggaa atgtgcgcgg 4380
aacccctatt tgtttatttt tctaaataca ttcaaatatg tatccgctca tgagacaata 4440
accctgataa atgcttcaat aatattgaaa aaggaagagt atgagtattc aacatttccg 4500
tgtcgccctt attccctttt ttgcggcatt ttgccttcct gtttttgctc acccagaaac 4560
gctggtgaaa gtaaaagatg ctgaagatca gttgggtgca cgagtgggtt acatcgaact 4620
ggatctcaac agcggtaaga tccttgagag ttttcgcccc gaagaacgtt ttccaatgat 4680
gagcactttt aaagttctgc tatgtggcgc ggtattatcc cgtattgacg ccgggcaaga 4740
gcaactcggt cgccgcatac actattctca gaatgacttg gttgagtact caccagtcac 4800
agaaaagcat cttacggatg gcatgacagt aagagaatta tgcagtgctg ccataaccat 4860
gagtgataac actgcggcca acttacttct gacaacgatc ggaggaccga aggagctaac 4920
cgcttttttg cacaacatgg gggatcatgt aactcgcctt gatcgttggg aaccggagct 4980
gaatgaagcc ataccaaacg acgagcgtga caccacgatg cctgtagcaa tggcaacaac 5040
gttgcgcaaa ctattaactg gcgaactact tactctagct tcccggcaac aattaataga 5100
ctggatggag gcggataaag ttgcaggacc acttctgcgc tcggcccttc cggctggctg 5160
gtttattgct gataaatctg gagccggtga gcgtgggtct cgcggtatca ttgcagcact 5220
ggggccagat ggtaagccct cccgtatcgt agttatctac acgacgggga gtcaggcaac 5280
tatggatgaa cgaaatagac agatcgctga gataggtgcc tcactgatta agcattggta 5340
actgtcagac caagtttact catatatact ttagattgat ttaaaacttc atttttaatt 5400
taaaaggatc taggtgaaga tcctttttga taatctcatg accaaaatcc cttaacgtga 5460
gttttcgttc cactgagcgt cagaccccgt agaaaagatc aaaggatctt cttgagatcc 5520
tttttttctg cgcgtaatct gctgcttgca aacaaaaaaa ccaccgctac cagcggtggt 5580
ttgtttgccg gatcaagagc taccaactct ttttccgaag gtaactggct tcagcagagc 5640
gcagatacca aatactgttc ttctagtgta gccgtagtta ggccaccact tcaagaactc 5700
tgtagcaccg cctacatacc tcgctctgct aatcctgtta ccagtggctg ctgccagtgg 5760
cgataagtcg tgtcttaccg ggttggactc aagacgatag ttaccggata aggcgcagcg 5820
gtcgggctga acggggggtt cgtgcacaca gcccagcttg gagcgaacga cctacaccga 5880
actgagatac ctacagcgtg agctatgaga aagcgccacg cttcccgaag ggagaaaggc 5940
ggacaggtat ccggtaagcg gcagggtcgg aacaggagag cgcacgaggg agcttccagg 6000
gggaaacgcc tggtatcttt atagtcctgt cgggtttcgc cacctctgac ttgagcgtcg 6060
atttttgtga tgctcgtcag gggggcggag cctatggaaa aacgccagca acgcggcctt 6120
tttacggttc ctggcctttt gctggccttt tgctcacatg ttctttcctg cgttatcccc 6180
tgattctgtg gataaccgta ttaccgcctt tgagtgagct gataccgctc gccgcagccg 6240
aacgaccgag cgcagcgagt cagtgagcga ggaagcggaa gagcgcccaa tacgcaaacc 6300
gcctctcccc gcgcgttggc cgattcatta atgcagctgg cacgacaggt ttcccgactg 6360
gaaagcgggc agtgagcgca acgcaattaa tgtgagttag ctcactcatt aggcacccca 6420
ggctttacac tttatgcttc cggctcgtat gttgtgtgga attgtgagcg gataacaatt 6480
tcacacagga aacagctatg accatgatta cgccaagcgc gcaattaacc ctcactaaag 6540
ggaacaaaag ctggagctgc aagcttaatg tagtcttatg caatactctt gtagtcttgc 6600
aacatggtaa cgatgagtta gcaacatgcc ttacaaggag agaaaaagca ccgtgcatgc 6660
cgattggtgg aagtaaggtg gtacgatcgt gccttattag gaaggcaaca gacgggtctg 6720
acatggattg gacgaaccac tgaattgccg cattgcagag atattgtatt taagtgccta 6780
gctcgataca taaacgggtc tctctggtta gaccagatct gagcctggga gctctctggc 6840
taactaggga acccactgct taagcctcaa taaagcttgc cttgagtgct tcaagtagtg 6900
tgtgcccgtc tgttgtgtga ctctggtaac tagagatccc tcagaccctt ttagtcagtg 6960
tggaaaatct ctagcagtgg cgcccgaaca gggacttgaa agcgaaaggg aaaccagagg 7020
agctctctcg acgcaggact cggcttgctg aagcgcgcac ggcaagaggc gaggggcggc 7080
gactggtgag tacgccaaaa attttgacta gcggaggcta gaaggagaga gatgggtgcg 7140
agagcgtcag tattaagcgg gggagaatta gatcgcgatg ggaaaaaatt cggttaaggc 7200
cagggggaaa gaaaaaatat aaattaaaac atatagtatg ggcaagcagg gagctagaac 7260
gattcgcagt taatcctggc ctgttagaaa catcagaagg ctgtagacaa atactgggac 7320
agctacaacc atcccttcag acaggatcag aagaacttag atcattatat aatacagtag 7380
caaccctcta ttgtgtgcat caaaggatag agataaaaga caccaaggaa gctttagaca 7440
agatagagga agagcaaaac aaaagtaaga ccaccgcaca gcaagcggcc gctgatcttc 7500
agacctggag gaggagatat gagggacaat tggagaagtg aattatataa atataaagta 7560
gtaaaaattg aaccattagg agtagcaccc accaaggcaa agagaagagt ggtgcagaga 7620
gaaaaaagag cagtgggaat aggagctttg ttccttgggt tcttgggagc agcaggaagc 7680
actatgggcg cagcgtcaat gacgctgacg gtacaggcca gacaattatt gtctggtata 7740
gtgcagcagc agaacaattt gctgagggct attgaggcgc aacagcatct gttgcaactc 7800
acagtctggg gcatcaagca gctccaggca agaatcctgg ctgtggaaag atacctaaag 7860
gatcaacagc tcctggggat ttggggttgc tctggaaaac tcatttgcac cactgctgtg 7920
ccttggaatg ctagttggag taataaatct ctggaacaga tttggaatca cacgacctgg 7980
atggagtggg acagagaaat taacaattac acaagcttaa tacactcctt aattgaagaa 8040
tcgcaaaacc agcaagaaaa gaatgaacaa gaattattgg aattagataa atgggcaagt 8100
ttgtggaatt ggtttaacat aacaaattgg ctgtggtata taaaattatt cataatgata 8160
gtaggaggct tggtaggttt aagaatagtt tttgctgtac tttctatagt gaatagagtt 8220
aggcagggat attcaccatt atcgtttcag acccacctcc caaccccgag gggacccttg 8280
cgccttttcc aaggcagccc tgggtttgcg cagggacgcg gctgctctgg gcgtggttcc 8340
gggaaacgca gcggcgccga ccctgggtct cgcacattct tcacgtccgt tcgcagcgtc 8400
acccggatct tcgccgctac ccttgtgggc cccccggcga cgcttcctgc tccgccccta 8460
agtcgggaag gttccttgcg gttcgcggcg tgccggacgt gacaaacgga agccgcacgt 8520
ctcactagta ccctcgcaga cggacagcgc cagggagcaa tggcagcgcg ccgaccgcga 8580
tgggctgtgg ccaatagcgg ctgctcagca gggcgcgccg agagcagcgg ccgggaaggg 8640
gcggtgcggg aggcggggtg tggggcggta gtgtgggccc tgttcctgcc cgcgcggtgt 8700
tccgcattct gcaagcctcc ggagcgcacg tcggcagtcg gctccctcgt tgaccgaatc 8760
accgacctct ctccccaggg ggtacccagc tgtctagaga attctagatc ttgagacaaa 8820
tggcagtatt catccacaat tttaaaagaa aaggggggat tggggggtac agtgcagggg 8880
aaagaatagt agacataata gcaacagaca tacaaactaa agaattacaa aaacaaatta 8940
caaaaattca aaattttcgg gtttattaca gggacagcag agatccactt tggcgccggc 9000
tcgaggggg 9009
<210> 145
<211> 119
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 145
Met Ile Lys Ile Ala Thr Arg Lys Tyr Leu Gly Lys Gln Asn Val Tyr
1 5 10 15
Asp Ile Gly Val Glu Arg Asp His Asn Phe Ala Leu Lys Asn Gly Phe
20 25 30
Ile Ala Ser Asn Cys Pro Arg Leu Arg Ala Tyr Met Leu Arg Ile Gly
35 40 45
Leu Asp Gln Leu Tyr Gln Ser Leu Val Asp Gly Asn Phe Asp Asp Ala
50 55 60
Ala Trp Ala Gln Gly Arg Cys Asp Ala Ile Val Arg Ser Gly Ala Gly
65 70 75 80
Thr Val Gly Arg Thr Gln Ile Ala Arg Arg Ser Ala Ala Val Trp Thr
85 90 95
Asp Gly Cys Val Glu Val Leu Ala Asp Ser Gly Asn Arg Arg Pro Ser
100 105 110
Thr Arg Pro Arg Ala Lys Glu
115
<210> 146
<211> 9783
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 146
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtgccacca tgaaaaagcc tgaactcacc 660
gcgacgtctg tcgagaagtt tctgatcgaa aagttcgaca gcgtctccga cctgatgcag 720
ctctcggagg gcgaagaatc tcgtgctttc agcttcgatg taggagggcg tggatatgtc 780
ctgcgggtaa atagctgcgc cgatggtttc tacaaagatc gttatgttta tcggcacttt 840
gcatcggccg cgctcccgat tccggaagtg cttgacattg gggaatttag cgagagcctg 900
acctattgca tctcccgccg tgcacagggt gtcacgttgc aagacctgcc tgaaaccgaa 960
ctgcccgctg ttctgcagcc ggtcgcggag gccatggatg cgatcgctgc ggccgatctt 1020
agccagacga gcgggttcgg cccattcgga ccgcaaggaa tcggtcaata cactacatgg 1080
cgtgatttca tatgcgcgat tgctgatccc catgtgtatc actggcaaac tgtgatggac 1140
gacaccgtca gtgcgtccgt cgcgcaggct ctcgatgagc tgatgctttg ggccgaggac 1200
tgccccgaag tccggcacct cgtgcacgcg gatttcggct ccaacaatgt cctgacggac 1260
aatggccgca taacagcggt cattgactgg agcgaggcga tgttcgggga ttcccaatac 1320
gaggtcgcca acatcttctt ctggaggccg tggttggctt gtatggagca gcagacgcgc 1380
tacttcgagc ggaggcatcc ggagcttgca ggatcgccgc ggctccgggc gtatatgctc 1440
cgcattggtc ttgaccaact ctatcagagc tgcctttcat acgagaccga gatcctgact 1500
gtcgagtacg gattgcttcc tatcggcaaa atcgtggaga agaggattga atgtaccgtc 1560
tattcagtcg ataataatgg gaacatctac acacagcccg tggctcaatg gcacgacaga 1620
ggagagcagg aagtttttga atactgtctc gaggacggat ccctcatccg cgctactaaa 1680
gatcataagt ttatgaccgt ggacggccag atgctgccaa ttgacgaaat ttttgaacga 1740
gagctggatc tgatgagagt cgacaacctt ccaaactgat taattaagaa ttcgacccag 1800
ctttcttgta caaagtggtt ggtaagccta tccctaaccc tctcctcggt ctcgattcta 1860
cgtagtaatg agctagcagt ctcgaggtta acgaattccg ccccccccct aacgttactg 1920
gccgaagccg cttggaataa ggccggtgtg cgcttgtcta tatgttattt tccaccatat 1980
tgccgtcttt tggcaatgtg agggcccgga aacctggccc tgtcttcttg acgagcattc 2040
ctaggggtct ttcccctctc gccaaaggaa tgcaaggtct gttgaatgtc gtgaaggaag 2100
cagttcctct ggaagcttct tgaagacaaa caacgtctgt agcgaccctt tgcaggcagc 2160
ggaacccccc acctggcgac aggtgcccct gcggccaaaa gccacgtgta taagatacac 2220
ctgcaaaggc ggcacaaccc cagtgccacg ttgtgagttg gatagttgtg gaaagagtca 2280
aatggctctc ctcaagcgta ttcaacaagg ggctgaagga tgcccagaag gtaccccatt 2340
gtatgggatc tgatctgggg cctcggtgca catgctttac atgtgtttag tcgaggttaa 2400
aaaaacgtct aggccccccg aaccacgggg acgtggtttt cctttgaaaa acacgataat 2460
accatggcca tgagcgagct gattaaggag aacatgcaca tgaagctgta catggagggc 2520
accgtggaca accatcactt caagtgcaca tccgagggcg aaggcaagcc ctacgagggc 2580
acccagacca tgagaatcaa ggtggtcgag ggcggccctc tccccttcgc cttcgacatc 2640
ctggctacta gcttcctcta cggcagcaag accttcatca accacaccca gggcatcccc 2700
gacttcttca agcagtcctt ccctgagggc ttcacatggg agagagtcac cacatacgaa 2760
gacgggggcg tgctgaccgc tacccaggac accagcctcc aggacggctg cctcatctac 2820
aacgtcaaga tcagaggggt gaacttcaca tccaacggcc ctgtgatgca gaagaaaaca 2880
ctcggctggg aggccttcac cgagacgctg taccccgctg acggcggcct ggaaggcaga 2940
aacgacatgg ccctgaagct cgtgggcggg agccatctga tcgcaaacat caagaccaca 3000
tatagatcca agaaacccgc taagaacctc aagatgcctg gcgtctacta tgtggactac 3060
agactggaaa gaatcaagga ggccaacaac gagacctacg tcgagcagca cgaggtggca 3120
gtggccagat actgcgacct ccctagcaaa ctggggcaca agcttaatta acaccggtgg 3180
cgcgttaagt cgacaatcaa cctctggatt acaaaatttg tgaaagattg actggtattc 3240
ttaactatgt tgctcctttt acgctatgtg gatacgctgc tttaatgcct ttgtatcatg 3300
ctattgcttc ccgtatggct ttcattttct cctccttgta taaatcctgg ttgctgtctc 3360
tttatgagga gttgtggccc gttgtcaggc aacgtggcgt ggtgtgcact gtgtttgctg 3420
acgcaacccc cactggttgg ggcattgcca ccacctgtca gctcctttcc gggactttcg 3480
ctttccccct ccctattgcc acggcggaac tcatcgccgc ctgccttgcc cgctgctgga 3540
caggggctcg gctgttgggc actgacaatt ccgtggtgtt gtcggggaaa tcatcgtcct 3600
ttccttggct gctcgcctgt gttgccacct ggattctgcg cgggacgtcc ttctgctacg 3660
tcccttcggc cctcaatcca gcggaccttc cttcccgcgg cctgctgccg gctctgcggc 3720
ctcttccgcg tcttcgcctt cgccctcaga cgagtcggat ctccctttgg gccgcctccc 3780
cgcgtcgact ttaagaccaa tgacttacaa ggcagctgta gatcttagcc actttttaaa 3840
agaaaagggg ggactggaag ggctaattca ctcccaacga agacaagatc tgctttttgc 3900
ttgtactggg tctctctggt tagaccagat ctgagcctgg gagctctctg gctaactagg 3960
gaacccactg cttaagcctc aataaagctt gccttgagtg cttcaagtag tgtgtgcccg 4020
tctgttgtgt gactctggta actagagatc cctcagaccc ttttagtcag tgtggaaaat 4080
ctctagcagt acgtatagta gttcatgtca tcttattatt cagtatttat aacttgcaaa 4140
gaaatgaata tcagagagtg agaggaactt gtttattgca gcttataatg gttacaaata 4200
aagcaatagc atcacaaatt tcacaaataa agcatttttt tcactgcatt ctagttgtgg 4260
tttgtccaaa ctcatcaatg tatcttatca tgtctggctc tagctatccc gcccctaact 4320
ccgcccatcc cgcccctaac tccgcccagt tccgcccatt ctccgcccca tggctgacta 4380
atttttttta tttatgcaga ggccgaggcc gcctcggcct ctgagctatt ccagaagtag 4440
tgaggaggct tttttggagg cctagggacg tacccaattc gccctatagt gagtcgtatt 4500
acgcgcgctc actggccgtc gttttacaac gtcgtgactg ggaaaaccct ggcgttaccc 4560
aacttaatcg ccttgcagca catccccctt tcgccagctg gcgtaatagc gaagaggccc 4620
gcaccgatcg cccttcccaa cagttgcgca gcctgaatgg cgaatgggac gcgccctgta 4680
gcggcgcatt aagcgcggcg ggtgtggtgg ttacgcgcag cgtgaccgct acacttgcca 4740
gcgccctagc gcccgctcct ttcgctttct tcccttcctt tctcgccacg ttcgccggct 4800
ttccccgtca agctctaaat cgggggctcc ctttagggtt ccgatttagt gctttacggc 4860
acctcgaccc caaaaaactt gattagggtg atggttcacg tagtgggcca tcgccctgat 4920
agacggtttt tcgccctttg acgttggagt ccacgttctt taatagtgga ctcttgttcc 4980
aaactggaac aacactcaac cctatctcgg tctattcttt tgatttataa gggattttgc 5040
cgatttcggc ctattggtta aaaaatgagc tgatttaaca aaaatttaac gcgaatttta 5100
acaaaatatt aacgcttaca atttaggtgg cacttttcgg ggaaatgtgc gcggaacccc 5160
tatttgttta tttttctaaa tacattcaaa tatgtatccg ctcatgagac aataaccctg 5220
ataaatgctt caataatatt gaaaaaggaa gagtatgagt attcaacatt tccgtgtcgc 5280
ccttattccc ttttttgcgg cattttgcct tcctgttttt gctcacccag aaacgctggt 5340
gaaagtaaaa gatgctgaag atcagttggg tgcacgagtg ggttacatcg aactggatct 5400
caacagcggt aagatccttg agagttttcg ccccgaagaa cgttttccaa tgatgagcac 5460
ttttaaagtt ctgctatgtg gcgcggtatt atcccgtatt gacgccgggc aagagcaact 5520
cggtcgccgc atacactatt ctcagaatga cttggttgag tactcaccag tcacagaaaa 5580
gcatcttacg gatggcatga cagtaagaga attatgcagt gctgccataa ccatgagtga 5640
taacactgcg gccaacttac ttctgacaac gatcggagga ccgaaggagc taaccgcttt 5700
tttgcacaac atgggggatc atgtaactcg ccttgatcgt tgggaaccgg agctgaatga 5760
agccatacca aacgacgagc gtgacaccac gatgcctgta gcaatggcaa caacgttgcg 5820
caaactatta actggcgaac tacttactct agcttcccgg caacaattaa tagactggat 5880
ggaggcggat aaagttgcag gaccacttct gcgctcggcc cttccggctg gctggtttat 5940
tgctgataaa tctggagccg gtgagcgtgg gtctcgcggt atcattgcag cactggggcc 6000
agatggtaag ccctcccgta tcgtagttat ctacacgacg gggagtcagg caactatgga 6060
tgaacgaaat agacagatcg ctgagatagg tgcctcactg attaagcatt ggtaactgtc 6120
agaccaagtt tactcatata tactttagat tgatttaaaa cttcattttt aatttaaaag 6180
gatctaggtg aagatccttt ttgataatct catgaccaaa atcccttaac gtgagttttc 6240
gttccactga gcgtcagacc ccgtagaaaa gatcaaagga tcttcttgag atcctttttt 6300
tctgcgcgta atctgctgct tgcaaacaaa aaaaccaccg ctaccagcgg tggtttgttt 6360
gccggatcaa gagctaccaa ctctttttcc gaaggtaact ggcttcagca gagcgcagat 6420
accaaatact gttcttctag tgtagccgta gttaggccac cacttcaaga actctgtagc 6480
accgcctaca tacctcgctc tgctaatcct gttaccagtg gctgctgcca gtggcgataa 6540
gtcgtgtctt accgggttgg actcaagacg atagttaccg gataaggcgc agcggtcggg 6600
ctgaacgggg ggttcgtgca cacagcccag cttggagcga acgacctaca ccgaactgag 6660
atacctacag cgtgagctat gagaaagcgc cacgcttccc gaagggagaa aggcggacag 6720
gtatccggta agcggcaggg tcggaacagg agagcgcacg agggagcttc cagggggaaa 6780
cgcctggtat ctttatagtc ctgtcgggtt tcgccacctc tgacttgagc gtcgattttt 6840
gtgatgctcg tcaggggggc ggagcctatg gaaaaacgcc agcaacgcgg cctttttacg 6900
gttcctggcc ttttgctggc cttttgctca catgttcttt cctgcgttat cccctgattc 6960
tgtggataac cgtattaccg cctttgagtg agctgatacc gctcgccgca gccgaacgac 7020
cgagcgcagc gagtcagtga gcgaggaagc ggaagagcgc ccaatacgca aaccgcctct 7080
ccccgcgcgt tggccgattc attaatgcag ctggcacgac aggtttcccg actggaaagc 7140
gggcagtgag cgcaacgcaa ttaatgtgag ttagctcact cattaggcac cccaggcttt 7200
acactttatg cttccggctc gtatgttgtg tggaattgtg agcggataac aatttcacac 7260
aggaaacagc tatgaccatg attacgccaa gcgcgcaatt aaccctcact aaagggaaca 7320
aaagctggag ctgcaagctt aatgtagtct tatgcaatac tcttgtagtc ttgcaacatg 7380
gtaacgatga gttagcaaca tgccttacaa ggagagaaaa agcaccgtgc atgccgattg 7440
gtggaagtaa ggtggtacga tcgtgcctta ttaggaaggc aacagacggg tctgacatgg 7500
attggacgaa ccactgaatt gccgcattgc agagatattg tatttaagtg cctagctcga 7560
tacataaacg ggtctctctg gttagaccag atctgagcct gggagctctc tggctaacta 7620
gggaacccac tgcttaagcc tcaataaagc ttgccttgag tgcttcaagt agtgtgtgcc 7680
cgtctgttgt gtgactctgg taactagaga tccctcagac ccttttagtc agtgtggaaa 7740
atctctagca gtggcgcccg aacagggact tgaaagcgaa agggaaacca gaggagctct 7800
ctcgacgcag gactcggctt gctgaagcgc gcacggcaag aggcgagggg cggcgactgg 7860
tgagtacgcc aaaaattttg actagcggag gctagaagga gagagatggg tgcgagagcg 7920
tcagtattaa gcgggggaga attagatcgc gatgggaaaa aattcggtta aggccagggg 7980
gaaagaaaaa atataaatta aaacatatag tatgggcaag cagggagcta gaacgattcg 8040
cagttaatcc tggcctgtta gaaacatcag aaggctgtag acaaatactg ggacagctac 8100
aaccatccct tcagacagga tcagaagaac ttagatcatt atataataca gtagcaaccc 8160
tctattgtgt gcatcaaagg atagagataa aagacaccaa ggaagcttta gacaagatag 8220
aggaagagca aaacaaaagt aagaccaccg cacagcaagc ggccgctgat cttcagacct 8280
ggaggaggag atatgaggga caattggaga agtgaattat ataaatataa agtagtaaaa 8340
attgaaccat taggagtagc acccaccaag gcaaagagaa gagtggtgca gagagaaaaa 8400
agagcagtgg gaataggagc tttgttcctt gggttcttgg gagcagcagg aagcactatg 8460
ggcgcagcgt caatgacgct gacggtacag gccagacaat tattgtctgg tatagtgcag 8520
cagcagaaca atttgctgag ggctattgag gcgcaacagc atctgttgca actcacagtc 8580
tggggcatca agcagctcca ggcaagaatc ctggctgtgg aaagatacct aaaggatcaa 8640
cagctcctgg ggatttgggg ttgctctgga aaactcattt gcaccactgc tgtgccttgg 8700
aatgctagtt ggagtaataa atctctggaa cagatttgga atcacacgac ctggatggag 8760
tgggacagag aaattaacaa ttacacaagc ttaatacact ccttaattga agaatcgcaa 8820
aaccagcaag aaaagaatga acaagaatta ttggaattag ataaatgggc aagtttgtgg 8880
aattggttta acataacaaa ttggctgtgg tatataaaat tattcataat gatagtagga 8940
ggcttggtag gtttaagaat agtttttgct gtactttcta tagtgaatag agttaggcag 9000
ggatattcac cattatcgtt tcagacccac ctcccaaccc cgaggggacc cttgcgcctt 9060
ttccaaggca gccctgggtt tgcgcaggga cgcggctgct ctgggcgtgg ttccgggaaa 9120
cgcagcggcg ccgaccctgg gtctcgcaca ttcttcacgt ccgttcgcag cgtcacccgg 9180
atcttcgccg ctacccttgt gggccccccg gcgacgcttc ctgctccgcc cctaagtcgg 9240
gaaggttcct tgcggttcgc ggcgtgccgg acgtgacaaa cggaagccgc acgtctcact 9300
agtaccctcg cagacggaca gcgccaggga gcaatggcag cgcgccgacc gcgatgggct 9360
gtggccaata gcggctgctc agcagggcgc gccgagagca gcggccggga aggggcggtg 9420
cgggaggcgg ggtgtggggc ggtagtgtgg gccctgttcc tgcccgcgcg gtgttccgca 9480
ttctgcaagc ctccggagcg cacgtcggca gtcggctccc tcgttgaccg aatcaccgac 9540
ctctctcccc agggggtacc cagctgtcta gagaattcta gatcttgaga caaatggcag 9600
tattcatcca caattttaaa agaaaagggg ggattggggg gtacagtgca ggggaaagaa 9660
tagtagacat aatagcaaca gacatacaaa ctaaagaatt acaaaaacaa attacaaaaa 9720
ttcaaaattt tcgggtttat tacagggaca gcagagatcc actttggcgc cggctcgagg 9780
ggg 9783
<210> 147
<211> 379
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 147
Met Lys Lys Pro Glu Leu Thr Ala Thr Ser Val Glu Lys Phe Leu Ile
1 5 10 15
Glu Lys Phe Asp Ser Val Ser Asp Leu Met Gln Leu Ser Glu Gly Glu
20 25 30
Glu Ser Arg Ala Phe Ser Phe Asp Val Gly Gly Arg Gly Tyr Val Leu
35 40 45
Arg Val Asn Ser Cys Ala Asp Gly Phe Tyr Lys Asp Arg Tyr Val Tyr
50 55 60
Arg His Phe Ala Ser Ala Ala Leu Pro Ile Pro Glu Val Leu Asp Ile
65 70 75 80
Gly Glu Phe Ser Glu Ser Leu Thr Tyr Cys Ile Ser Arg Arg Ala Gln
85 90 95
Gly Val Thr Leu Gln Asp Leu Pro Glu Thr Glu Leu Pro Ala Val Leu
100 105 110
Gln Pro Val Ala Glu Ala Met Asp Ala Ile Ala Ala Ala Asp Leu Ser
115 120 125
Gln Thr Ser Gly Phe Gly Pro Phe Gly Pro Gln Gly Ile Gly Gln Tyr
130 135 140
Thr Thr Trp Arg Asp Phe Ile Cys Ala Ile Ala Asp Pro His Val Tyr
145 150 155 160
His Trp Gln Thr Val Met Asp Asp Thr Val Ser Ala Ser Val Ala Gln
165 170 175
Ala Leu Asp Glu Leu Met Leu Trp Ala Glu Asp Cys Pro Glu Val Arg
180 185 190
His Leu Val His Ala Asp Phe Gly Ser Asn Asn Val Leu Thr Asp Asn
195 200 205
Gly Arg Ile Thr Ala Val Ile Asp Trp Ser Glu Ala Met Phe Gly Asp
210 215 220
Ser Gln Tyr Glu Val Ala Asn Ile Phe Phe Trp Arg Pro Trp Leu Ala
225 230 235 240
Cys Met Glu Gln Gln Thr Arg Tyr Phe Glu Arg Arg His Pro Glu Leu
245 250 255
Ala Gly Ser Pro Arg Leu Arg Ala Tyr Met Leu Arg Ile Gly Leu Asp
260 265 270
Gln Leu Tyr Gln Ser Cys Leu Ser Tyr Glu Thr Glu Ile Leu Thr Val
275 280 285
Glu Tyr Gly Leu Leu Pro Ile Gly Lys Ile Val Glu Lys Arg Ile Glu
290 295 300
Cys Thr Val Tyr Ser Val Asp Asn Asn Gly Asn Ile Tyr Thr Gln Pro
305 310 315 320
Val Ala Gln Trp His Asp Arg Gly Glu Gln Glu Val Phe Glu Tyr Cys
325 330 335
Leu Glu Asp Gly Ser Leu Ile Arg Ala Thr Lys Asp His Lys Phe Met
340 345 350
Thr Val Asp Gly Gln Met Leu Pro Ile Asp Glu Ile Phe Glu Arg Glu
355 360 365
Leu Asp Leu Met Arg Val Asp Asn Leu Pro Asn
370 375
<210> 148
<211> 8955
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 148
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtgccgcca ccatgattaa gatcgctacg 660
cggaagtacc tggggaaaca gaacgtctac gacataggtg tggagcgcga tcacaacttt 720
gctctgaaaa atggatttat cgccagcaac tgcttggttg acggcaattt cgatgatgca 780
gcttgggcgc agggtcgatg cgacgcaatc gtccgatccg gagccgggac tgtcgggcgt 840
acacaaatcg cccgcagaag cgcggccgtc tggaccgatg gctgtgtaga agtactcgcc 900
gatagtggaa accgacgccc cagcactcgt ccgagggcaa aggaatagtt aattaagaat 960
tcgacccagc tttcttgtac aaagtggttg gtaagcctat ccctaaccct ctcctcggtc 1020
tcgattctac gtagtaatga gctagcagtc tcgaggttaa cgaattccgc ccccccccta 1080
acgttactgg ccgaagccgc ttggaataag gccggtgtgc gcttgtctat atgttatttt 1140
ccaccatatt gccgtctttt ggcaatgtga gggcccggaa acctggccct gtcttcttga 1200
cgagcattcc taggggtctt tcccctctcg ccaaaggaat gcaaggtctg ttgaatgtcg 1260
tgaaggaagc agttcctctg gaagcttctt gaagacaaac aacgtctgta gcgacccttt 1320
gcaggcagcg gaacccccca cctggcgaca ggtgcccctg cggccaaaag ccacgtgtat 1380
aagatacacc tgcaaaggcg gcacaacccc agtgccacgt tgtgagttgg atagttgtgg 1440
aaagagtcaa atggctctcc tcaagcgtat tcaacaaggg gctgaaggat gcccagaagg 1500
taccccattg tatgggatct gatctggggc ctcggtgcac atgctttaca tgtgtttagt 1560
cgaggttaaa aaaacgtcta ggccccccga accacgggga cgtggttttc ctttgaaaaa 1620
cacgataata ccatggtgag caagggcgag gaggataaca tggccatcat caaggagttc 1680
atgcgcttca aggtgcacat ggagggctcc gtgaacggcc acgagttcga gatcgagggc 1740
gagggcgagg gccgccccta cgagggcacc cagaccgcca agctgaaggt gaccaagggt 1800
ggccccctgc ccttcgcctg ggacatcctg tcccctcagt tcatgtacgg ctccaaggcc 1860
tacgtgaagc accccgccga catccccgac tacttgaagc tgtccttccc cgagggcttc 1920
aagtgggagc gcgtgatgaa cttcgaggac ggcggcgtgg tgaccgtgac ccaggactcc 1980
tccctgcagg acggcgagtt catctacaag gtgaagctgc gcggcaccaa cttcccctcc 2040
gacggccccg taatgcagaa gaagaccatg ggctgggagg cctcctccga gcggatgtac 2100
cccgaggacg gcgccctgaa gggcgagatc aagcagaggc tgaagctgaa ggacggcggc 2160
cactacgacg ctgaggtcaa gaccacctac aaggccaaga agcccgtgca gctgcccggc 2220
gcctacaacg tcaacatcaa gttggacatc acctcccaca acgaggacta caccatcgtg 2280
gaacagtacg aacgcgccga gggccgccac tccaccggcg gcatggacga gctgtacaag 2340
taacaccggt ggcgcgttaa gtcgacaatc aacctctgga ttacaaaatt tgtgaaagat 2400
tgactggtat tcttaactat gttgctcctt ttacgctatg tggatacgct gctttaatgc 2460
ctttgtatca tgctattgct tcccgtatgg ctttcatttt ctcctccttg tataaatcct 2520
ggttgctgtc tctttatgag gagttgtggc ccgttgtcag gcaacgtggc gtggtgtgca 2580
ctgtgtttgc tgacgcaacc cccactggtt ggggcattgc caccacctgt cagctccttt 2640
ccgggacttt cgctttcccc ctccctattg ccacggcgga actcatcgcc gcctgccttg 2700
cccgctgctg gacaggggct cggctgttgg gcactgacaa ttccgtggtg ttgtcgggga 2760
aatcatcgtc ctttccttgg ctgctcgcct gtgttgccac ctggattctg cgcgggacgt 2820
ccttctgcta cgtcccttcg gccctcaatc cagcggacct tccttcccgc ggcctgctgc 2880
cggctctgcg gcctcttccg cgtcttcgcc ttcgccctca gacgagtcgg atctcccttt 2940
gggccgcctc cccgcgtcga ctttaagacc aatgacttac aaggcagctg tagatcttag 3000
ccacttttta aaagaaaagg ggggactgga agggctaatt cactcccaac gaagacaaga 3060
tctgcttttt gcttgtactg ggtctctctg gttagaccag atctgagcct gggagctctc 3120
tggctaacta gggaacccac tgcttaagcc tcaataaagc ttgccttgag tgcttcaagt 3180
agtgtgtgcc cgtctgttgt gtgactctgg taactagaga tccctcagac ccttttagtc 3240
agtgtggaaa atctctagca gtacgtatag tagttcatgt catcttatta ttcagtattt 3300
ataacttgca aagaaatgaa tatcagagag tgagaggaac ttgtttattg cagcttataa 3360
tggttacaaa taaagcaata gcatcacaaa tttcacaaat aaagcatttt tttcactgca 3420
ttctagttgt ggtttgtcca aactcatcaa tgtatcttat catgtctggc tctagctatc 3480
ccgcccctaa ctccgcccat cccgccccta actccgccca gttccgccca ttctccgccc 3540
catggctgac taattttttt tatttatgca gaggccgagg ccgcctcggc ctctgagcta 3600
ttccagaagt agtgaggagg cttttttgga ggcctaggga cgtacccaat tcgccctata 3660
gtgagtcgta ttacgcgcgc tcactggccg tcgttttaca acgtcgtgac tgggaaaacc 3720
ctggcgttac ccaacttaat cgccttgcag cacatccccc tttcgccagc tggcgtaata 3780
gcgaagaggc ccgcaccgat cgcccttccc aacagttgcg cagcctgaat ggcgaatggg 3840
acgcgccctg tagcggcgca ttaagcgcgg cgggtgtggt ggttacgcgc agcgtgaccg 3900
ctacacttgc cagcgcccta gcgcccgctc ctttcgcttt cttcccttcc tttctcgcca 3960
cgttcgccgg ctttccccgt caagctctaa atcgggggct ccctttaggg ttccgattta 4020
gtgctttacg gcacctcgac cccaaaaaac ttgattaggg tgatggttca cgtagtgggc 4080
catcgccctg atagacggtt tttcgccctt tgacgttgga gtccacgttc tttaatagtg 4140
gactcttgtt ccaaactgga acaacactca accctatctc ggtctattct tttgatttat 4200
aagggatttt gccgatttcg gcctattggt taaaaaatga gctgatttaa caaaaattta 4260
acgcgaattt taacaaaata ttaacgctta caatttaggt ggcacttttc ggggaaatgt 4320
gcgcggaacc cctatttgtt tatttttcta aatacattca aatatgtatc cgctcatgag 4380
acaataaccc tgataaatgc ttcaataata ttgaaaaagg aagagtatga gtattcaaca 4440
tttccgtgtc gcccttattc ccttttttgc ggcattttgc cttcctgttt ttgctcaccc 4500
agaaacgctg gtgaaagtaa aagatgctga agatcagttg ggtgcacgag tgggttacat 4560
cgaactggat ctcaacagcg gtaagatcct tgagagtttt cgccccgaag aacgttttcc 4620
aatgatgagc acttttaaag ttctgctatg tggcgcggta ttatcccgta ttgacgccgg 4680
gcaagagcaa ctcggtcgcc gcatacacta ttctcagaat gacttggttg agtactcacc 4740
agtcacagaa aagcatctta cggatggcat gacagtaaga gaattatgca gtgctgccat 4800
aaccatgagt gataacactg cggccaactt acttctgaca acgatcggag gaccgaagga 4860
gctaaccgct tttttgcaca acatggggga tcatgtaact cgccttgatc gttgggaacc 4920
ggagctgaat gaagccatac caaacgacga gcgtgacacc acgatgcctg tagcaatggc 4980
aacaacgttg cgcaaactat taactggcga actacttact ctagcttccc ggcaacaatt 5040
aatagactgg atggaggcgg ataaagttgc aggaccactt ctgcgctcgg cccttccggc 5100
tggctggttt attgctgata aatctggagc cggtgagcgt gggtctcgcg gtatcattgc 5160
agcactgggg ccagatggta agccctcccg tatcgtagtt atctacacga cggggagtca 5220
ggcaactatg gatgaacgaa atagacagat cgctgagata ggtgcctcac tgattaagca 5280
ttggtaactg tcagaccaag tttactcata tatactttag attgatttaa aacttcattt 5340
ttaatttaaa aggatctagg tgaagatcct ttttgataat ctcatgacca aaatccctta 5400
acgtgagttt tcgttccact gagcgtcaga ccccgtagaa aagatcaaag gatcttcttg 5460
agatcctttt tttctgcgcg taatctgctg cttgcaaaca aaaaaaccac cgctaccagc 5520
ggtggtttgt ttgccggatc aagagctacc aactcttttt ccgaaggtaa ctggcttcag 5580
cagagcgcag ataccaaata ctgttcttct agtgtagccg tagttaggcc accacttcaa 5640
gaactctgta gcaccgccta catacctcgc tctgctaatc ctgttaccag tggctgctgc 5700
cagtggcgat aagtcgtgtc ttaccgggtt ggactcaaga cgatagttac cggataaggc 5760
gcagcggtcg ggctgaacgg ggggttcgtg cacacagccc agcttggagc gaacgaccta 5820
caccgaactg agatacctac agcgtgagct atgagaaagc gccacgcttc ccgaagggag 5880
aaaggcggac aggtatccgg taagcggcag ggtcggaaca ggagagcgca cgagggagct 5940
tccaggggga aacgcctggt atctttatag tcctgtcggg tttcgccacc tctgacttga 6000
gcgtcgattt ttgtgatgct cgtcaggggg gcggagccta tggaaaaacg ccagcaacgc 6060
ggccttttta cggttcctgg ccttttgctg gccttttgct cacatgttct ttcctgcgtt 6120
atcccctgat tctgtggata accgtattac cgcctttgag tgagctgata ccgctcgccg 6180
cagccgaacg accgagcgca gcgagtcagt gagcgaggaa gcggaagagc gcccaatacg 6240
caaaccgcct ctccccgcgc gttggccgat tcattaatgc agctggcacg acaggtttcc 6300
cgactggaaa gcgggcagtg agcgcaacgc aattaatgtg agttagctca ctcattaggc 6360
accccaggct ttacacttta tgcttccggc tcgtatgttg tgtggaattg tgagcggata 6420
acaatttcac acaggaaaca gctatgacca tgattacgcc aagcgcgcaa ttaaccctca 6480
ctaaagggaa caaaagctgg agctgcaagc ttaatgtagt cttatgcaat actcttgtag 6540
tcttgcaaca tggtaacgat gagttagcaa catgccttac aaggagagaa aaagcaccgt 6600
gcatgccgat tggtggaagt aaggtggtac gatcgtgcct tattaggaag gcaacagacg 6660
ggtctgacat ggattggacg aaccactgaa ttgccgcatt gcagagatat tgtatttaag 6720
tgcctagctc gatacataaa cgggtctctc tggttagacc agatctgagc ctgggagctc 6780
tctggctaac tagggaaccc actgcttaag cctcaataaa gcttgccttg agtgcttcaa 6840
gtagtgtgtg cccgtctgtt gtgtgactct ggtaactaga gatccctcag acccttttag 6900
tcagtgtgga aaatctctag cagtggcgcc cgaacaggga cttgaaagcg aaagggaaac 6960
cagaggagct ctctcgacgc aggactcggc ttgctgaagc gcgcacggca agaggcgagg 7020
ggcggcgact ggtgagtacg ccaaaaattt tgactagcgg aggctagaag gagagagatg 7080
ggtgcgagag cgtcagtatt aagcggggga gaattagatc gcgatgggaa aaaattcggt 7140
taaggccagg gggaaagaaa aaatataaat taaaacatat agtatgggca agcagggagc 7200
tagaacgatt cgcagttaat cctggcctgt tagaaacatc agaaggctgt agacaaatac 7260
tgggacagct acaaccatcc cttcagacag gatcagaaga acttagatca ttatataata 7320
cagtagcaac cctctattgt gtgcatcaaa ggatagagat aaaagacacc aaggaagctt 7380
tagacaagat agaggaagag caaaacaaaa gtaagaccac cgcacagcaa gcggccgctg 7440
atcttcagac ctggaggagg agatatgagg gacaattgga gaagtgaatt atataaatat 7500
aaagtagtaa aaattgaacc attaggagta gcacccacca aggcaaagag aagagtggtg 7560
cagagagaaa aaagagcagt gggaatagga gctttgttcc ttgggttctt gggagcagca 7620
ggaagcacta tgggcgcagc gtcaatgacg ctgacggtac aggccagaca attattgtct 7680
ggtatagtgc agcagcagaa caatttgctg agggctattg aggcgcaaca gcatctgttg 7740
caactcacag tctggggcat caagcagctc caggcaagaa tcctggctgt ggaaagatac 7800
ctaaaggatc aacagctcct ggggatttgg ggttgctctg gaaaactcat ttgcaccact 7860
gctgtgcctt ggaatgctag ttggagtaat aaatctctgg aacagatttg gaatcacacg 7920
acctggatgg agtgggacag agaaattaac aattacacaa gcttaataca ctccttaatt 7980
gaagaatcgc aaaaccagca agaaaagaat gaacaagaat tattggaatt agataaatgg 8040
gcaagtttgt ggaattggtt taacataaca aattggctgt ggtatataaa attattcata 8100
atgatagtag gaggcttggt aggtttaaga atagtttttg ctgtactttc tatagtgaat 8160
agagttaggc agggatattc accattatcg tttcagaccc acctcccaac cccgagggga 8220
cccttgcgcc ttttccaagg cagccctggg tttgcgcagg gacgcggctg ctctgggcgt 8280
ggttccggga aacgcagcgg cgccgaccct gggtctcgca cattcttcac gtccgttcgc 8340
agcgtcaccc ggatcttcgc cgctaccctt gtgggccccc cggcgacgct tcctgctccg 8400
cccctaagtc gggaaggttc cttgcggttc gcggcgtgcc ggacgtgaca aacggaagcc 8460
gcacgtctca ctagtaccct cgcagacgga cagcgccagg gagcaatggc agcgcgccga 8520
ccgcgatggg ctgtggccaa tagcggctgc tcagcagggc gcgccgagag cagcggccgg 8580
gaaggggcgg tgcgggaggc ggggtgtggg gcggtagtgt gggccctgtt cctgcccgcg 8640
cggtgttccg cattctgcaa gcctccggag cgcacgtcgg cagtcggctc cctcgttgac 8700
cgaatcaccg acctctctcc ccagggggta cccagctgtc tagagaattc tagatcttga 8760
gacaaatggc agtattcatc cacaatttta aaagaaaagg ggggattggg gggtacagtg 8820
caggggaaag aatagtagac ataatagcaa cagacataca aactaaagaa ttacaaaaac 8880
aaattacaaa aattcaaaat tttcgggttt attacaggga cagcagagat ccactttggc 8940
gccggctcga ggggg 8955
<210> 149
<211> 101
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 149
Met Ile Lys Ile Ala Thr Arg Lys Tyr Leu Gly Lys Gln Asn Val Tyr
1 5 10 15
Asp Ile Gly Val Glu Arg Asp His Asn Phe Ala Leu Lys Asn Gly Phe
20 25 30
Ile Ala Ser Asn Cys Leu Val Asp Gly Asn Phe Asp Asp Ala Ala Trp
35 40 45
Ala Gln Gly Arg Cys Asp Ala Ile Val Arg Ser Gly Ala Gly Thr Val
50 55 60
Gly Arg Thr Gln Ile Ala Arg Arg Ser Ala Ala Val Trp Thr Asp Gly
65 70 75 80
Cys Val Glu Val Leu Ala Asp Ser Gly Asn Arg Arg Pro Ser Thr Arg
85 90 95
Pro Arg Ala Lys Glu
100
<210> 150
<211> 9048
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 150
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtgccacca tgaccgagta caagcccacg 660
gtgcgcctcg ccacccgcga cgacgtcccc agggccgtac gcaccctcgc cgccgcgttc 720
gccgactacc ccgcctgcct ttcatacgag accgagatcc tgactgtcga gtacggattg 780
cttcctatcg gcaaaatcgt ggagaagagg attgaatgta ccgtctattc agtcgataat 840
aatgggaaca tctacacaca gcccgtggct caatggcacg acagaggaga gcaggaagtt 900
tttgaatact gtctcgagga cggatccctc atccgcgcta ctaaagatca taagtttatg 960
accgtggacg gccagatgct gccaattgac gaaatttttg aacgagagct ggatctgatg 1020
agagtcgaca accttccaaa ctgattaatt aagaattcga cccagctttc ttgtacaaag 1080
tggttggtaa gcctatccct aaccctctcc tcggtctcga ttctacgtag taatgagcta 1140
gcagtctcga ggttaacgaa ttccgccccc cccctaacgt tactggccga agccgcttgg 1200
aataaggccg gtgtgcgctt gtctatatgt tattttccac catattgccg tcttttggca 1260
atgtgagggc ccggaaacct ggccctgtct tcttgacgag cattcctagg ggtctttccc 1320
ctctcgccaa aggaatgcaa ggtctgttga atgtcgtgaa ggaagcagtt cctctggaag 1380
cttcttgaag acaaacaacg tctgtagcga ccctttgcag gcagcggaac cccccacctg 1440
gcgacaggtg cccctgcggc caaaagccac gtgtataaga tacacctgca aaggcggcac 1500
aaccccagtg ccacgttgtg agttggatag ttgtggaaag agtcaaatgg ctctcctcaa 1560
gcgtattcaa caaggggctg aaggatgccc agaaggtacc ccattgtatg ggatctgatc 1620
tggggcctcg gtgcacatgc tttacatgtg tttagtcgag gttaaaaaaa cgtctaggcc 1680
ccccgaacca cggggacgtg gttttccttt gaaaaacacg ataataccat ggccatgagc 1740
gagctgatta aggagaacat gcacatgaag ctgtacatgg agggcaccgt ggacaaccat 1800
cacttcaagt gcacatccga gggcgaaggc aagccctacg agggcaccca gaccatgaga 1860
atcaaggtgg tcgagggcgg ccctctcccc ttcgccttcg acatcctggc tactagcttc 1920
ctctacggca gcaagacctt catcaaccac acccagggca tccccgactt cttcaagcag 1980
tccttccctg agggcttcac atgggagaga gtcaccacat acgaagacgg gggcgtgctg 2040
accgctaccc aggacaccag cctccaggac ggctgcctca tctacaacgt caagatcaga 2100
ggggtgaact tcacatccaa cggccctgtg atgcagaaga aaacactcgg ctgggaggcc 2160
ttcaccgaga cgctgtaccc cgctgacggc ggcctggaag gcagaaacga catggccctg 2220
aagctcgtgg gcgggagcca tctgatcgca aacatcaaga ccacatatag atccaagaaa 2280
cccgctaaga acctcaagat gcctggcgtc tactatgtgg actacagact ggaaagaatc 2340
aaggaggcca acaacgagac ctacgtcgag cagcacgagg tggcagtggc cagatactgc 2400
gacctcccta gcaaactggg gcacaagctt aattaacacc ggtggcgcgt taagtcgaca 2460
atcaacctct ggattacaaa atttgtgaaa gattgactgg tattcttaac tatgttgctc 2520
cttttacgct atgtggatac gctgctttaa tgcctttgta tcatgctatt gcttcccgta 2580
tggctttcat tttctcctcc ttgtataaat cctggttgct gtctctttat gaggagttgt 2640
ggcccgttgt caggcaacgt ggcgtggtgt gcactgtgtt tgctgacgca acccccactg 2700
gttggggcat tgccaccacc tgtcagctcc tttccgggac tttcgctttc cccctcccta 2760
ttgccacggc ggaactcatc gccgcctgcc ttgcccgctg ctggacaggg gctcggctgt 2820
tgggcactga caattccgtg gtgttgtcgg ggaaatcatc gtcctttcct tggctgctcg 2880
cctgtgttgc cacctggatt ctgcgcggga cgtccttctg ctacgtccct tcggccctca 2940
atccagcgga ccttccttcc cgcggcctgc tgccggctct gcggcctctt ccgcgtcttc 3000
gccttcgccc tcagacgagt cggatctccc tttgggccgc ctccccgcgt cgactttaag 3060
accaatgact tacaaggcag ctgtagatct tagccacttt ttaaaagaaa aggggggact 3120
ggaagggcta attcactccc aacgaagaca agatctgctt tttgcttgta ctgggtctct 3180
ctggttagac cagatctgag cctgggagct ctctggctaa ctagggaacc cactgcttaa 3240
gcctcaataa agcttgcctt gagtgcttca agtagtgtgt gcccgtctgt tgtgtgactc 3300
tggtaactag agatccctca gaccctttta gtcagtgtgg aaaatctcta gcagtacgta 3360
tagtagttca tgtcatctta ttattcagta tttataactt gcaaagaaat gaatatcaga 3420
gagtgagagg aacttgttta ttgcagctta taatggttac aaataaagca atagcatcac 3480
aaatttcaca aataaagcat ttttttcact gcattctagt tgtggtttgt ccaaactcat 3540
caatgtatct tatcatgtct ggctctagct atcccgcccc taactccgcc catcccgccc 3600
ctaactccgc ccagttccgc ccattctccg ccccatggct gactaatttt ttttatttat 3660
gcagaggccg aggccgcctc ggcctctgag ctattccaga agtagtgagg aggctttttt 3720
ggaggcctag ggacgtaccc aattcgccct atagtgagtc gtattacgcg cgctcactgg 3780
ccgtcgtttt acaacgtcgt gactgggaaa accctggcgt tacccaactt aatcgccttg 3840
cagcacatcc ccctttcgcc agctggcgta atagcgaaga ggcccgcacc gatcgccctt 3900
cccaacagtt gcgcagcctg aatggcgaat gggacgcgcc ctgtagcggc gcattaagcg 3960
cggcgggtgt ggtggttacg cgcagcgtga ccgctacact tgccagcgcc ctagcgcccg 4020
ctcctttcgc tttcttccct tcctttctcg ccacgttcgc cggctttccc cgtcaagctc 4080
taaatcgggg gctcccttta gggttccgat ttagtgcttt acggcacctc gaccccaaaa 4140
aacttgatta gggtgatggt tcacgtagtg ggccatcgcc ctgatagacg gtttttcgcc 4200
ctttgacgtt ggagtccacg ttctttaata gtggactctt gttccaaact ggaacaacac 4260
tcaaccctat ctcggtctat tcttttgatt tataagggat tttgccgatt tcggcctatt 4320
ggttaaaaaa tgagctgatt taacaaaaat ttaacgcgaa ttttaacaaa atattaacgc 4380
ttacaattta ggtggcactt ttcggggaaa tgtgcgcgga acccctattt gtttattttt 4440
ctaaatacat tcaaatatgt atccgctcat gagacaataa ccctgataaa tgcttcaata 4500
atattgaaaa aggaagagta tgagtattca acatttccgt gtcgccctta ttcccttttt 4560
tgcggcattt tgccttcctg tttttgctca cccagaaacg ctggtgaaag taaaagatgc 4620
tgaagatcag ttgggtgcac gagtgggtta catcgaactg gatctcaaca gcggtaagat 4680
ccttgagagt tttcgccccg aagaacgttt tccaatgatg agcactttta aagttctgct 4740
atgtggcgcg gtattatccc gtattgacgc cgggcaagag caactcggtc gccgcataca 4800
ctattctcag aatgacttgg ttgagtactc accagtcaca gaaaagcatc ttacggatgg 4860
catgacagta agagaattat gcagtgctgc cataaccatg agtgataaca ctgcggccaa 4920
cttacttctg acaacgatcg gaggaccgaa ggagctaacc gcttttttgc acaacatggg 4980
ggatcatgta actcgccttg atcgttggga accggagctg aatgaagcca taccaaacga 5040
cgagcgtgac accacgatgc ctgtagcaat ggcaacaacg ttgcgcaaac tattaactgg 5100
cgaactactt actctagctt cccggcaaca attaatagac tggatggagg cggataaagt 5160
tgcaggacca cttctgcgct cggcccttcc ggctggctgg tttattgctg ataaatctgg 5220
agccggtgag cgtgggtctc gcggtatcat tgcagcactg gggccagatg gtaagccctc 5280
ccgtatcgta gttatctaca cgacggggag tcaggcaact atggatgaac gaaatagaca 5340
gatcgctgag ataggtgcct cactgattaa gcattggtaa ctgtcagacc aagtttactc 5400
atatatactt tagattgatt taaaacttca tttttaattt aaaaggatct aggtgaagat 5460
cctttttgat aatctcatga ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc 5520
agaccccgta gaaaagatca aaggatcttc ttgagatcct ttttttctgc gcgtaatctg 5580
ctgcttgcaa acaaaaaaac caccgctacc agcggtggtt tgtttgccgg atcaagagct 5640
accaactctt tttccgaagg taactggctt cagcagagcg cagataccaa atactgttct 5700
tctagtgtag ccgtagttag gccaccactt caagaactct gtagcaccgc ctacatacct 5760
cgctctgcta atcctgttac cagtggctgc tgccagtggc gataagtcgt gtcttaccgg 5820
gttggactca agacgatagt taccggataa ggcgcagcgg tcgggctgaa cggggggttc 5880
gtgcacacag cccagcttgg agcgaacgac ctacaccgaa ctgagatacc tacagcgtga 5940
gctatgagaa agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg 6000
cagggtcgga acaggagagc gcacgaggga gcttccaggg ggaaacgcct ggtatcttta 6060
tagtcctgtc gggtttcgcc acctctgact tgagcgtcga tttttgtgat gctcgtcagg 6120
ggggcggagc ctatggaaaa acgccagcaa cgcggccttt ttacggttcc tggccttttg 6180
ctggcctttt gctcacatgt tctttcctgc gttatcccct gattctgtgg ataaccgtat 6240
taccgccttt gagtgagctg ataccgctcg ccgcagccga acgaccgagc gcagcgagtc 6300
agtgagcgag gaagcggaag agcgcccaat acgcaaaccg cctctccccg cgcgttggcc 6360
gattcattaa tgcagctggc acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa 6420
cgcaattaat gtgagttagc tcactcatta ggcaccccag gctttacact ttatgcttcc 6480
ggctcgtatg ttgtgtggaa ttgtgagcgg ataacaattt cacacaggaa acagctatga 6540
ccatgattac gccaagcgcg caattaaccc tcactaaagg gaacaaaagc tggagctgca 6600
agcttaatgt agtcttatgc aatactcttg tagtcttgca acatggtaac gatgagttag 6660
caacatgcct tacaaggaga gaaaaagcac cgtgcatgcc gattggtgga agtaaggtgg 6720
tacgatcgtg ccttattagg aaggcaacag acgggtctga catggattgg acgaaccact 6780
gaattgccgc attgcagaga tattgtattt aagtgcctag ctcgatacat aaacgggtct 6840
ctctggttag accagatctg agcctgggag ctctctggct aactagggaa cccactgctt 6900
aagcctcaat aaagcttgcc ttgagtgctt caagtagtgt gtgcccgtct gttgtgtgac 6960
tctggtaact agagatccct cagacccttt tagtcagtgt ggaaaatctc tagcagtggc 7020
gcccgaacag ggacttgaaa gcgaaaggga aaccagagga gctctctcga cgcaggactc 7080
ggcttgctga agcgcgcacg gcaagaggcg aggggcggcg actggtgagt acgccaaaaa 7140
ttttgactag cggaggctag aaggagagag atgggtgcga gagcgtcagt attaagcggg 7200
ggagaattag atcgcgatgg gaaaaaattc ggttaaggcc agggggaaag aaaaaatata 7260
aattaaaaca tatagtatgg gcaagcaggg agctagaacg attcgcagtt aatcctggcc 7320
tgttagaaac atcagaaggc tgtagacaaa tactgggaca gctacaacca tcccttcaga 7380
caggatcaga agaacttaga tcattatata atacagtagc aaccctctat tgtgtgcatc 7440
aaaggataga gataaaagac accaaggaag ctttagacaa gatagaggaa gagcaaaaca 7500
aaagtaagac caccgcacag caagcggccg ctgatcttca gacctggagg aggagatatg 7560
agggacaatt ggagaagtga attatataaa tataaagtag taaaaattga accattagga 7620
gtagcaccca ccaaggcaaa gagaagagtg gtgcagagag aaaaaagagc agtgggaata 7680
ggagctttgt tccttgggtt cttgggagca gcaggaagca ctatgggcgc agcgtcaatg 7740
acgctgacgg tacaggccag acaattattg tctggtatag tgcagcagca gaacaatttg 7800
ctgagggcta ttgaggcgca acagcatctg ttgcaactca cagtctgggg catcaagcag 7860
ctccaggcaa gaatcctggc tgtggaaaga tacctaaagg atcaacagct cctggggatt 7920
tggggttgct ctggaaaact catttgcacc actgctgtgc cttggaatgc tagttggagt 7980
aataaatctc tggaacagat ttggaatcac acgacctgga tggagtggga cagagaaatt 8040
aacaattaca caagcttaat acactcctta attgaagaat cgcaaaacca gcaagaaaag 8100
aatgaacaag aattattgga attagataaa tgggcaagtt tgtggaattg gtttaacata 8160
acaaattggc tgtggtatat aaaattattc ataatgatag taggaggctt ggtaggttta 8220
agaatagttt ttgctgtact ttctatagtg aatagagtta ggcagggata ttcaccatta 8280
tcgtttcaga cccacctccc aaccccgagg ggacccttgc gccttttcca aggcagccct 8340
gggtttgcgc agggacgcgg ctgctctggg cgtggttccg ggaaacgcag cggcgccgac 8400
cctgggtctc gcacattctt cacgtccgtt cgcagcgtca cccggatctt cgccgctacc 8460
cttgtgggcc ccccggcgac gcttcctgct ccgcccctaa gtcgggaagg ttccttgcgg 8520
ttcgcggcgt gccggacgtg acaaacggaa gccgcacgtc tcactagtac cctcgcagac 8580
ggacagcgcc agggagcaat ggcagcgcgc cgaccgcgat gggctgtggc caatagcggc 8640
tgctcagcag ggcgcgccga gagcagcggc cgggaagggg cggtgcggga ggcggggtgt 8700
ggggcggtag tgtgggccct gttcctgccc gcgcggtgtt ccgcattctg caagcctccg 8760
gagcgcacgt cggcagtcgg ctccctcgtt gaccgaatca ccgacctctc tccccagggg 8820
gtacccagct gtctagagaa ttctagatct tgagacaaat ggcagtattc atccacaatt 8880
ttaaaagaaa aggggggatt ggggggtaca gtgcagggga aagaatagta gacataatag 8940
caacagacat acaaactaaa gaattacaaa aacaaattac aaaaattcaa aattttcggg 9000
tttattacag ggacagcaga gatccacttt ggcgccggct cgaggggg 9048
<210> 151
<211> 134
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 151
Met Thr Glu Tyr Lys Pro Thr Val Arg Leu Ala Thr Arg Asp Asp Val
1 5 10 15
Pro Arg Ala Val Arg Thr Leu Ala Ala Ala Phe Ala Asp Tyr Pro Ala
20 25 30
Cys Leu Ser Tyr Glu Thr Glu Ile Leu Thr Val Glu Tyr Gly Leu Leu
35 40 45
Pro Ile Gly Lys Ile Val Glu Lys Arg Ile Glu Cys Thr Val Tyr Ser
50 55 60
Val Asp Asn Asn Gly Asn Ile Tyr Thr Gln Pro Val Ala Gln Trp His
65 70 75 80
Asp Arg Gly Glu Gln Glu Val Phe Glu Tyr Cys Leu Glu Asp Gly Ser
85 90 95
Leu Ile Arg Ala Thr Lys Asp His Lys Phe Met Thr Val Asp Gly Gln
100 105 110
Met Leu Pro Ile Asp Glu Ile Phe Glu Arg Glu Leu Asp Leu Met Arg
115 120 125
Val Asp Asn Leu Pro Asn
130
<210> 152
<211> 9264
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 152
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtgccgcca ccatgattaa gatcgctacg 660
cggaagtacc tggggaaaca gaacgtctac gacataggtg tggagcgcga tcacaacttt 720
gctctgaaaa atggatttat cgccagcaac tgcacgcgcc acaccgtcga tccggaccgc 780
cacatcgagc gggtcaccga gctgcaagaa ctcttcctca cgcgcgtcgg gctcgacatc 840
ggcaaggtgt gggtcgcgga cgacggcgcc gcggtggcgg tctggaccac gccggagagc 900
gtcgaagcgg gggcggtgtt cgccgagatc ggcccgcgca tggccgagtt gagcggttcc 960
cggctggccg cgcagcaaca gatggaaggc ctcctggcgc cgcaccggcc caaggagccc 1020
gcgtggttcc tggccaccgt cggcgtctcg cccgaccacc agggcaaggg tctgggcagc 1080
gccgtcgtgc tccccggagt ggaggcggcc gagcgcgccg gggtgcccgc cttcctggag 1140
acctccgcgc cccgcaacct ccccttctac gagcggctcg gcttcaccgt caccgccgac 1200
gtcgaggtgc ccgaaggacc gcgcacctgg tgcatgaccc gcaagcccgg tgcctgatta 1260
attaagaatt cgacccagct ttcttgtaca aagtggttgg taagcctatc cctaaccctc 1320
tcctcggtct cgattctacg tagtaatgag ctagcagtct cgaggttaac gaattccgcc 1380
ccccccctaa cgttactggc cgaagccgct tggaataagg ccggtgtgcg cttgtctata 1440
tgttattttc caccatattg ccgtcttttg gcaatgtgag ggcccggaaa cctggccctg 1500
tcttcttgac gagcattcct aggggtcttt cccctctcgc caaaggaatg caaggtctgt 1560
tgaatgtcgt gaaggaagca gttcctctgg aagcttcttg aagacaaaca acgtctgtag 1620
cgaccctttg caggcagcgg aaccccccac ctggcgacag gtgcccctgc ggccaaaagc 1680
cacgtgtata agatacacct gcaaaggcgg cacaacccca gtgccacgtt gtgagttgga 1740
tagttgtgga aagagtcaaa tggctctcct caagcgtatt caacaagggg ctgaaggatg 1800
cccagaaggt accccattgt atgggatctg atctggggcc tcggtgcaca tgctttacat 1860
gtgtttagtc gaggttaaaa aaacgtctag gccccccgaa ccacggggac gtggttttcc 1920
tttgaaaaac acgataatac catggtgagc aagggcgagg aggataacat ggccatcatc 1980
aaggagttca tgcgcttcaa ggtgcacatg gagggctccg tgaacggcca cgagttcgag 2040
atcgagggcg agggcgaggg ccgcccctac gagggcaccc agaccgccaa gctgaaggtg 2100
accaagggtg gccccctgcc cttcgcctgg gacatcctgt cccctcagtt catgtacggc 2160
tccaaggcct acgtgaagca ccccgccgac atccccgact acttgaagct gtccttcccc 2220
gagggcttca agtgggagcg cgtgatgaac ttcgaggacg gcggcgtggt gaccgtgacc 2280
caggactcct ccctgcagga cggcgagttc atctacaagg tgaagctgcg cggcaccaac 2340
ttcccctccg acggccccgt aatgcagaag aagaccatgg gctgggaggc ctcctccgag 2400
cggatgtacc ccgaggacgg cgccctgaag ggcgagatca agcagaggct gaagctgaag 2460
gacggcggcc actacgacgc tgaggtcaag accacctaca aggccaagaa gcccgtgcag 2520
ctgcccggcg cctacaacgt caacatcaag ttggacatca cctcccacaa cgaggactac 2580
accatcgtgg aacagtacga acgcgccgag ggccgccact ccaccggcgg catggacgag 2640
ctgtacaagt aacaccggtg gcgcgttaag tcgacaatca acctctggat tacaaaattt 2700
gtgaaagatt gactggtatt cttaactatg ttgctccttt tacgctatgt ggatacgctg 2760
ctttaatgcc tttgtatcat gctattgctt cccgtatggc tttcattttc tcctccttgt 2820
ataaatcctg gttgctgtct ctttatgagg agttgtggcc cgttgtcagg caacgtggcg 2880
tggtgtgcac tgtgtttgct gacgcaaccc ccactggttg gggcattgcc accacctgtc 2940
agctcctttc cgggactttc gctttccccc tccctattgc cacggcggaa ctcatcgccg 3000
cctgccttgc ccgctgctgg acaggggctc ggctgttggg cactgacaat tccgtggtgt 3060
tgtcggggaa atcatcgtcc tttccttggc tgctcgcctg tgttgccacc tggattctgc 3120
gcgggacgtc cttctgctac gtcccttcgg ccctcaatcc agcggacctt ccttcccgcg 3180
gcctgctgcc ggctctgcgg cctcttccgc gtcttcgcct tcgccctcag acgagtcgga 3240
tctccctttg ggccgcctcc ccgcgtcgac tttaagacca atgacttaca aggcagctgt 3300
agatcttagc cactttttaa aagaaaaggg gggactggaa gggctaattc actcccaacg 3360
aagacaagat ctgctttttg cttgtactgg gtctctctgg ttagaccaga tctgagcctg 3420
ggagctctct ggctaactag ggaacccact gcttaagcct caataaagct tgccttgagt 3480
gcttcaagta gtgtgtgccc gtctgttgtg tgactctggt aactagagat ccctcagacc 3540
cttttagtca gtgtggaaaa tctctagcag tacgtatagt agttcatgtc atcttattat 3600
tcagtattta taacttgcaa agaaatgaat atcagagagt gagaggaact tgtttattgc 3660
agcttataat ggttacaaat aaagcaatag catcacaaat ttcacaaata aagcattttt 3720
ttcactgcat tctagttgtg gtttgtccaa actcatcaat gtatcttatc atgtctggct 3780
ctagctatcc cgcccctaac tccgcccatc ccgcccctaa ctccgcccag ttccgcccat 3840
tctccgcccc atggctgact aatttttttt atttatgcag aggccgaggc cgcctcggcc 3900
tctgagctat tccagaagta gtgaggaggc ttttttggag gcctagggac gtacccaatt 3960
cgccctatag tgagtcgtat tacgcgcgct cactggccgt cgttttacaa cgtcgtgact 4020
gggaaaaccc tggcgttacc caacttaatc gccttgcagc acatccccct ttcgccagct 4080
ggcgtaatag cgaagaggcc cgcaccgatc gcccttccca acagttgcgc agcctgaatg 4140
gcgaatggga cgcgccctgt agcggcgcat taagcgcggc gggtgtggtg gttacgcgca 4200
gcgtgaccgc tacacttgcc agcgccctag cgcccgctcc tttcgctttc ttcccttcct 4260
ttctcgccac gttcgccggc tttccccgtc aagctctaaa tcgggggctc cctttagggt 4320
tccgatttag tgctttacgg cacctcgacc ccaaaaaact tgattagggt gatggttcac 4380
gtagtgggcc atcgccctga tagacggttt ttcgcccttt gacgttggag tccacgttct 4440
ttaatagtgg actcttgttc caaactggaa caacactcaa ccctatctcg gtctattctt 4500
ttgatttata agggattttg ccgatttcgg cctattggtt aaaaaatgag ctgatttaac 4560
aaaaatttaa cgcgaatttt aacaaaatat taacgcttac aatttaggtg gcacttttcg 4620
gggaaatgtg cgcggaaccc ctatttgttt atttttctaa atacattcaa atatgtatcc 4680
gctcatgaga caataaccct gataaatgct tcaataatat tgaaaaagga agagtatgag 4740
tattcaacat ttccgtgtcg cccttattcc cttttttgcg gcattttgcc ttcctgtttt 4800
tgctcaccca gaaacgctgg tgaaagtaaa agatgctgaa gatcagttgg gtgcacgagt 4860
gggttacatc gaactggatc tcaacagcgg taagatcctt gagagttttc gccccgaaga 4920
acgttttcca atgatgagca cttttaaagt tctgctatgt ggcgcggtat tatcccgtat 4980
tgacgccggg caagagcaac tcggtcgccg catacactat tctcagaatg acttggttga 5040
gtactcacca gtcacagaaa agcatcttac ggatggcatg acagtaagag aattatgcag 5100
tgctgccata accatgagtg ataacactgc ggccaactta cttctgacaa cgatcggagg 5160
accgaaggag ctaaccgctt ttttgcacaa catgggggat catgtaactc gccttgatcg 5220
ttgggaaccg gagctgaatg aagccatacc aaacgacgag cgtgacacca cgatgcctgt 5280
agcaatggca acaacgttgc gcaaactatt aactggcgaa ctacttactc tagcttcccg 5340
gcaacaatta atagactgga tggaggcgga taaagttgca ggaccacttc tgcgctcggc 5400
ccttccggct ggctggttta ttgctgataa atctggagcc ggtgagcgtg ggtctcgcgg 5460
tatcattgca gcactggggc cagatggtaa gccctcccgt atcgtagtta tctacacgac 5520
ggggagtcag gcaactatgg atgaacgaaa tagacagatc gctgagatag gtgcctcact 5580
gattaagcat tggtaactgt cagaccaagt ttactcatat atactttaga ttgatttaaa 5640
acttcatttt taatttaaaa ggatctaggt gaagatcctt tttgataatc tcatgaccaa 5700
aatcccttaa cgtgagtttt cgttccactg agcgtcagac cccgtagaaa agatcaaagg 5760
atcttcttga gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa aaaaaccacc 5820
gctaccagcg gtggtttgtt tgccggatca agagctacca actctttttc cgaaggtaac 5880
tggcttcagc agagcgcaga taccaaatac tgttcttcta gtgtagccgt agttaggcca 5940
ccacttcaag aactctgtag caccgcctac atacctcgct ctgctaatcc tgttaccagt 6000
ggctgctgcc agtggcgata agtcgtgtct taccgggttg gactcaagac gatagttacc 6060
ggataaggcg cagcggtcgg gctgaacggg gggttcgtgc acacagccca gcttggagcg 6120
aacgacctac accgaactga gatacctaca gcgtgagcta tgagaaagcg ccacgcttcc 6180
cgaagggaga aaggcggaca ggtatccggt aagcggcagg gtcggaacag gagagcgcac 6240
gagggagctt ccagggggaa acgcctggta tctttatagt cctgtcgggt ttcgccacct 6300
ctgacttgag cgtcgatttt tgtgatgctc gtcagggggg cggagcctat ggaaaaacgc 6360
cagcaacgcg gcctttttac ggttcctggc cttttgctgg ccttttgctc acatgttctt 6420
tcctgcgtta tcccctgatt ctgtggataa ccgtattacc gcctttgagt gagctgatac 6480
cgctcgccgc agccgaacga ccgagcgcag cgagtcagtg agcgaggaag cggaagagcg 6540
cccaatacgc aaaccgcctc tccccgcgcg ttggccgatt cattaatgca gctggcacga 6600
caggtttccc gactggaaag cgggcagtga gcgcaacgca attaatgtga gttagctcac 6660
tcattaggca ccccaggctt tacactttat gcttccggct cgtatgttgt gtggaattgt 6720
gagcggataa caatttcaca caggaaacag ctatgaccat gattacgcca agcgcgcaat 6780
taaccctcac taaagggaac aaaagctgga gctgcaagct taatgtagtc ttatgcaata 6840
ctcttgtagt cttgcaacat ggtaacgatg agttagcaac atgccttaca aggagagaaa 6900
aagcaccgtg catgccgatt ggtggaagta aggtggtacg atcgtgcctt attaggaagg 6960
caacagacgg gtctgacatg gattggacga accactgaat tgccgcattg cagagatatt 7020
gtatttaagt gcctagctcg atacataaac gggtctctct ggttagacca gatctgagcc 7080
tgggagctct ctggctaact agggaaccca ctgcttaagc ctcaataaag cttgccttga 7140
gtgcttcaag tagtgtgtgc ccgtctgttg tgtgactctg gtaactagag atccctcaga 7200
cccttttagt cagtgtggaa aatctctagc agtggcgccc gaacagggac ttgaaagcga 7260
aagggaaacc agaggagctc tctcgacgca ggactcggct tgctgaagcg cgcacggcaa 7320
gaggcgaggg gcggcgactg gtgagtacgc caaaaatttt gactagcgga ggctagaagg 7380
agagagatgg gtgcgagagc gtcagtatta agcgggggag aattagatcg cgatgggaaa 7440
aaattcggtt aaggccaggg ggaaagaaaa aatataaatt aaaacatata gtatgggcaa 7500
gcagggagct agaacgattc gcagttaatc ctggcctgtt agaaacatca gaaggctgta 7560
gacaaatact gggacagcta caaccatccc ttcagacagg atcagaagaa cttagatcat 7620
tatataatac agtagcaacc ctctattgtg tgcatcaaag gatagagata aaagacacca 7680
aggaagcttt agacaagata gaggaagagc aaaacaaaag taagaccacc gcacagcaag 7740
cggccgctga tcttcagacc tggaggagga gatatgaggg acaattggag aagtgaatta 7800
tataaatata aagtagtaaa aattgaacca ttaggagtag cacccaccaa ggcaaagaga 7860
agagtggtgc agagagaaaa aagagcagtg ggaataggag ctttgttcct tgggttcttg 7920
ggagcagcag gaagcactat gggcgcagcg tcaatgacgc tgacggtaca ggccagacaa 7980
ttattgtctg gtatagtgca gcagcagaac aatttgctga gggctattga ggcgcaacag 8040
catctgttgc aactcacagt ctggggcatc aagcagctcc aggcaagaat cctggctgtg 8100
gaaagatacc taaaggatca acagctcctg gggatttggg gttgctctgg aaaactcatt 8160
tgcaccactg ctgtgccttg gaatgctagt tggagtaata aatctctgga acagatttgg 8220
aatcacacga cctggatgga gtgggacaga gaaattaaca attacacaag cttaatacac 8280
tccttaattg aagaatcgca aaaccagcaa gaaaagaatg aacaagaatt attggaatta 8340
gataaatggg caagtttgtg gaattggttt aacataacaa attggctgtg gtatataaaa 8400
ttattcataa tgatagtagg aggcttggta ggtttaagaa tagtttttgc tgtactttct 8460
atagtgaata gagttaggca gggatattca ccattatcgt ttcagaccca cctcccaacc 8520
ccgaggggac ccttgcgcct tttccaaggc agccctgggt ttgcgcaggg acgcggctgc 8580
tctgggcgtg gttccgggaa acgcagcggc gccgaccctg ggtctcgcac attcttcacg 8640
tccgttcgca gcgtcacccg gatcttcgcc gctacccttg tgggcccccc ggcgacgctt 8700
cctgctccgc ccctaagtcg ggaaggttcc ttgcggttcg cggcgtgccg gacgtgacaa 8760
acggaagccg cacgtctcac tagtaccctc gcagacggac agcgccaggg agcaatggca 8820
gcgcgccgac cgcgatgggc tgtggccaat agcggctgct cagcagggcg cgccgagagc 8880
agcggccggg aaggggcggt gcgggaggcg gggtgtgggg cggtagtgtg ggccctgttc 8940
ctgcccgcgc ggtgttccgc attctgcaag cctccggagc gcacgtcggc agtcggctcc 9000
ctcgttgacc gaatcaccga cctctctccc cagggggtac ccagctgtct agagaattct 9060
agatcttgag acaaatggca gtattcatcc acaattttaa aagaaaaggg gggattgggg 9120
ggtacagtgc aggggaaaga atagtagaca taatagcaac agacatacaa actaaagaat 9180
tacaaaaaca aattacaaaa attcaaaatt ttcgggttta ttacagggac agcagagatc 9240
cactttggcg ccggctcgag gggg 9264
<210> 153
<211> 204
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 153
Met Ile Lys Ile Ala Thr Arg Lys Tyr Leu Gly Lys Gln Asn Val Tyr
1 5 10 15
Asp Ile Gly Val Glu Arg Asp His Asn Phe Ala Leu Lys Asn Gly Phe
20 25 30
Ile Ala Ser Asn Cys Thr Arg His Thr Val Asp Pro Asp Arg His Ile
35 40 45
Glu Arg Val Thr Glu Leu Gln Glu Leu Phe Leu Thr Arg Val Gly Leu
50 55 60
Asp Ile Gly Lys Val Trp Val Ala Asp Asp Gly Ala Ala Val Ala Val
65 70 75 80
Trp Thr Thr Pro Glu Ser Val Glu Ala Gly Ala Val Phe Ala Glu Ile
85 90 95
Gly Pro Arg Met Ala Glu Leu Ser Gly Ser Arg Leu Ala Ala Gln Gln
100 105 110
Gln Met Glu Gly Leu Leu Ala Pro His Arg Pro Lys Glu Pro Ala Trp
115 120 125
Phe Leu Ala Thr Val Gly Val Ser Pro Asp His Gln Gly Lys Gly Leu
130 135 140
Gly Ser Ala Val Val Leu Pro Gly Val Glu Ala Ala Glu Arg Ala Gly
145 150 155 160
Val Pro Ala Phe Leu Glu Thr Ser Ala Pro Arg Asn Leu Pro Phe Tyr
165 170 175
Glu Arg Leu Gly Phe Thr Val Thr Ala Asp Val Glu Val Pro Glu Gly
180 185 190
Pro Arg Thr Trp Cys Met Thr Arg Lys Pro Gly Ala
195 200
<210> 154
<211> 9204
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 154
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtgccacca tgaccgagta caagcccacg 660
gtgcgcctcg ccacccgcga cgacgtcccc agggccgtac gcaccctcgc cgccgcgttc 720
gccgactacc ccgccacgcg ccacaccgtc gatccggacc gccacatcga gcgggtcacc 780
gagctgcaag aactcttcct cacgcgcgtc gggctcgaca tcggcaaggt gtgggtcgcg 840
gacgacggcg ccgcggtggc ggtctggacc acgccggaga gcgtcgaagc gtgcctttca 900
tacgagaccg agatcctgac tgtcgagtac ggattgcttc ctatcggcaa aatcgtggag 960
aagaggattg aatgtaccgt ctattcagtc gataataatg ggaacatcta cacacagccc 1020
gtggctcaat ggcacgacag aggagagcag gaagtttttg aatactgtct cgaggacgga 1080
tccctcatcc gcgctactaa agatcataag tttatgaccg tggacggcca gatgctgcca 1140
attgacgaaa tttttgaacg agagctggat ctgatgagag tcgacaacct tccaaactga 1200
ttaattaaga attcgaccca gctttcttgt acaaagtggt tggtaagcct atccctaacc 1260
ctctcctcgg tctcgattct acgtagtaat gagctagcag tctcgaggtt aacgaattcc 1320
gccccccccc taacgttact ggccgaagcc gcttggaata aggccggtgt gcgcttgtct 1380
atatgttatt ttccaccata ttgccgtctt ttggcaatgt gagggcccgg aaacctggcc 1440
ctgtcttctt gacgagcatt cctaggggtc tttcccctct cgccaaagga atgcaaggtc 1500
tgttgaatgt cgtgaaggaa gcagttcctc tggaagcttc ttgaagacaa acaacgtctg 1560
tagcgaccct ttgcaggcag cggaaccccc cacctggcga caggtgcccc tgcggccaaa 1620
agccacgtgt ataagataca cctgcaaagg cggcacaacc ccagtgccac gttgtgagtt 1680
ggatagttgt ggaaagagtc aaatggctct cctcaagcgt attcaacaag gggctgaagg 1740
atgcccagaa ggtaccccat tgtatgggat ctgatctggg gcctcggtgc acatgcttta 1800
catgtgttta gtcgaggtta aaaaaacgtc taggcccccc gaaccacggg gacgtggttt 1860
tcctttgaaa aacacgataa taccatggcc atgagcgagc tgattaagga gaacatgcac 1920
atgaagctgt acatggaggg caccgtggac aaccatcact tcaagtgcac atccgagggc 1980
gaaggcaagc cctacgaggg cacccagacc atgagaatca aggtggtcga gggcggccct 2040
ctccccttcg ccttcgacat cctggctact agcttcctct acggcagcaa gaccttcatc 2100
aaccacaccc agggcatccc cgacttcttc aagcagtcct tccctgaggg cttcacatgg 2160
gagagagtca ccacatacga agacgggggc gtgctgaccg ctacccagga caccagcctc 2220
caggacggct gcctcatcta caacgtcaag atcagagggg tgaacttcac atccaacggc 2280
cctgtgatgc agaagaaaac actcggctgg gaggccttca ccgagacgct gtaccccgct 2340
gacggcggcc tggaaggcag aaacgacatg gccctgaagc tcgtgggcgg gagccatctg 2400
atcgcaaaca tcaagaccac atatagatcc aagaaacccg ctaagaacct caagatgcct 2460
ggcgtctact atgtggacta cagactggaa agaatcaagg aggccaacaa cgagacctac 2520
gtcgagcagc acgaggtggc agtggccaga tactgcgacc tccctagcaa actggggcac 2580
aagcttaatt aacaccggtg gcgcgttaag tcgacaatca acctctggat tacaaaattt 2640
gtgaaagatt gactggtatt cttaactatg ttgctccttt tacgctatgt ggatacgctg 2700
ctttaatgcc tttgtatcat gctattgctt cccgtatggc tttcattttc tcctccttgt 2760
ataaatcctg gttgctgtct ctttatgagg agttgtggcc cgttgtcagg caacgtggcg 2820
tggtgtgcac tgtgtttgct gacgcaaccc ccactggttg gggcattgcc accacctgtc 2880
agctcctttc cgggactttc gctttccccc tccctattgc cacggcggaa ctcatcgccg 2940
cctgccttgc ccgctgctgg acaggggctc ggctgttggg cactgacaat tccgtggtgt 3000
tgtcggggaa atcatcgtcc tttccttggc tgctcgcctg tgttgccacc tggattctgc 3060
gcgggacgtc cttctgctac gtcccttcgg ccctcaatcc agcggacctt ccttcccgcg 3120
gcctgctgcc ggctctgcgg cctcttccgc gtcttcgcct tcgccctcag acgagtcgga 3180
tctccctttg ggccgcctcc ccgcgtcgac tttaagacca atgacttaca aggcagctgt 3240
agatcttagc cactttttaa aagaaaaggg gggactggaa gggctaattc actcccaacg 3300
aagacaagat ctgctttttg cttgtactgg gtctctctgg ttagaccaga tctgagcctg 3360
ggagctctct ggctaactag ggaacccact gcttaagcct caataaagct tgccttgagt 3420
gcttcaagta gtgtgtgccc gtctgttgtg tgactctggt aactagagat ccctcagacc 3480
cttttagtca gtgtggaaaa tctctagcag tacgtatagt agttcatgtc atcttattat 3540
tcagtattta taacttgcaa agaaatgaat atcagagagt gagaggaact tgtttattgc 3600
agcttataat ggttacaaat aaagcaatag catcacaaat ttcacaaata aagcattttt 3660
ttcactgcat tctagttgtg gtttgtccaa actcatcaat gtatcttatc atgtctggct 3720
ctagctatcc cgcccctaac tccgcccatc ccgcccctaa ctccgcccag ttccgcccat 3780
tctccgcccc atggctgact aatttttttt atttatgcag aggccgaggc cgcctcggcc 3840
tctgagctat tccagaagta gtgaggaggc ttttttggag gcctagggac gtacccaatt 3900
cgccctatag tgagtcgtat tacgcgcgct cactggccgt cgttttacaa cgtcgtgact 3960
gggaaaaccc tggcgttacc caacttaatc gccttgcagc acatccccct ttcgccagct 4020
ggcgtaatag cgaagaggcc cgcaccgatc gcccttccca acagttgcgc agcctgaatg 4080
gcgaatggga cgcgccctgt agcggcgcat taagcgcggc gggtgtggtg gttacgcgca 4140
gcgtgaccgc tacacttgcc agcgccctag cgcccgctcc tttcgctttc ttcccttcct 4200
ttctcgccac gttcgccggc tttccccgtc aagctctaaa tcgggggctc cctttagggt 4260
tccgatttag tgctttacgg cacctcgacc ccaaaaaact tgattagggt gatggttcac 4320
gtagtgggcc atcgccctga tagacggttt ttcgcccttt gacgttggag tccacgttct 4380
ttaatagtgg actcttgttc caaactggaa caacactcaa ccctatctcg gtctattctt 4440
ttgatttata agggattttg ccgatttcgg cctattggtt aaaaaatgag ctgatttaac 4500
aaaaatttaa cgcgaatttt aacaaaatat taacgcttac aatttaggtg gcacttttcg 4560
gggaaatgtg cgcggaaccc ctatttgttt atttttctaa atacattcaa atatgtatcc 4620
gctcatgaga caataaccct gataaatgct tcaataatat tgaaaaagga agagtatgag 4680
tattcaacat ttccgtgtcg cccttattcc cttttttgcg gcattttgcc ttcctgtttt 4740
tgctcaccca gaaacgctgg tgaaagtaaa agatgctgaa gatcagttgg gtgcacgagt 4800
gggttacatc gaactggatc tcaacagcgg taagatcctt gagagttttc gccccgaaga 4860
acgttttcca atgatgagca cttttaaagt tctgctatgt ggcgcggtat tatcccgtat 4920
tgacgccggg caagagcaac tcggtcgccg catacactat tctcagaatg acttggttga 4980
gtactcacca gtcacagaaa agcatcttac ggatggcatg acagtaagag aattatgcag 5040
tgctgccata accatgagtg ataacactgc ggccaactta cttctgacaa cgatcggagg 5100
accgaaggag ctaaccgctt ttttgcacaa catgggggat catgtaactc gccttgatcg 5160
ttgggaaccg gagctgaatg aagccatacc aaacgacgag cgtgacacca cgatgcctgt 5220
agcaatggca acaacgttgc gcaaactatt aactggcgaa ctacttactc tagcttcccg 5280
gcaacaatta atagactgga tggaggcgga taaagttgca ggaccacttc tgcgctcggc 5340
ccttccggct ggctggttta ttgctgataa atctggagcc ggtgagcgtg ggtctcgcgg 5400
tatcattgca gcactggggc cagatggtaa gccctcccgt atcgtagtta tctacacgac 5460
ggggagtcag gcaactatgg atgaacgaaa tagacagatc gctgagatag gtgcctcact 5520
gattaagcat tggtaactgt cagaccaagt ttactcatat atactttaga ttgatttaaa 5580
acttcatttt taatttaaaa ggatctaggt gaagatcctt tttgataatc tcatgaccaa 5640
aatcccttaa cgtgagtttt cgttccactg agcgtcagac cccgtagaaa agatcaaagg 5700
atcttcttga gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa aaaaaccacc 5760
gctaccagcg gtggtttgtt tgccggatca agagctacca actctttttc cgaaggtaac 5820
tggcttcagc agagcgcaga taccaaatac tgttcttcta gtgtagccgt agttaggcca 5880
ccacttcaag aactctgtag caccgcctac atacctcgct ctgctaatcc tgttaccagt 5940
ggctgctgcc agtggcgata agtcgtgtct taccgggttg gactcaagac gatagttacc 6000
ggataaggcg cagcggtcgg gctgaacggg gggttcgtgc acacagccca gcttggagcg 6060
aacgacctac accgaactga gatacctaca gcgtgagcta tgagaaagcg ccacgcttcc 6120
cgaagggaga aaggcggaca ggtatccggt aagcggcagg gtcggaacag gagagcgcac 6180
gagggagctt ccagggggaa acgcctggta tctttatagt cctgtcgggt ttcgccacct 6240
ctgacttgag cgtcgatttt tgtgatgctc gtcagggggg cggagcctat ggaaaaacgc 6300
cagcaacgcg gcctttttac ggttcctggc cttttgctgg ccttttgctc acatgttctt 6360
tcctgcgtta tcccctgatt ctgtggataa ccgtattacc gcctttgagt gagctgatac 6420
cgctcgccgc agccgaacga ccgagcgcag cgagtcagtg agcgaggaag cggaagagcg 6480
cccaatacgc aaaccgcctc tccccgcgcg ttggccgatt cattaatgca gctggcacga 6540
caggtttccc gactggaaag cgggcagtga gcgcaacgca attaatgtga gttagctcac 6600
tcattaggca ccccaggctt tacactttat gcttccggct cgtatgttgt gtggaattgt 6660
gagcggataa caatttcaca caggaaacag ctatgaccat gattacgcca agcgcgcaat 6720
taaccctcac taaagggaac aaaagctgga gctgcaagct taatgtagtc ttatgcaata 6780
ctcttgtagt cttgcaacat ggtaacgatg agttagcaac atgccttaca aggagagaaa 6840
aagcaccgtg catgccgatt ggtggaagta aggtggtacg atcgtgcctt attaggaagg 6900
caacagacgg gtctgacatg gattggacga accactgaat tgccgcattg cagagatatt 6960
gtatttaagt gcctagctcg atacataaac gggtctctct ggttagacca gatctgagcc 7020
tgggagctct ctggctaact agggaaccca ctgcttaagc ctcaataaag cttgccttga 7080
gtgcttcaag tagtgtgtgc ccgtctgttg tgtgactctg gtaactagag atccctcaga 7140
cccttttagt cagtgtggaa aatctctagc agtggcgccc gaacagggac ttgaaagcga 7200
aagggaaacc agaggagctc tctcgacgca ggactcggct tgctgaagcg cgcacggcaa 7260
gaggcgaggg gcggcgactg gtgagtacgc caaaaatttt gactagcgga ggctagaagg 7320
agagagatgg gtgcgagagc gtcagtatta agcgggggag aattagatcg cgatgggaaa 7380
aaattcggtt aaggccaggg ggaaagaaaa aatataaatt aaaacatata gtatgggcaa 7440
gcagggagct agaacgattc gcagttaatc ctggcctgtt agaaacatca gaaggctgta 7500
gacaaatact gggacagcta caaccatccc ttcagacagg atcagaagaa cttagatcat 7560
tatataatac agtagcaacc ctctattgtg tgcatcaaag gatagagata aaagacacca 7620
aggaagcttt agacaagata gaggaagagc aaaacaaaag taagaccacc gcacagcaag 7680
cggccgctga tcttcagacc tggaggagga gatatgaggg acaattggag aagtgaatta 7740
tataaatata aagtagtaaa aattgaacca ttaggagtag cacccaccaa ggcaaagaga 7800
agagtggtgc agagagaaaa aagagcagtg ggaataggag ctttgttcct tgggttcttg 7860
ggagcagcag gaagcactat gggcgcagcg tcaatgacgc tgacggtaca ggccagacaa 7920
ttattgtctg gtatagtgca gcagcagaac aatttgctga gggctattga ggcgcaacag 7980
catctgttgc aactcacagt ctggggcatc aagcagctcc aggcaagaat cctggctgtg 8040
gaaagatacc taaaggatca acagctcctg gggatttggg gttgctctgg aaaactcatt 8100
tgcaccactg ctgtgccttg gaatgctagt tggagtaata aatctctgga acagatttgg 8160
aatcacacga cctggatgga gtgggacaga gaaattaaca attacacaag cttaatacac 8220
tccttaattg aagaatcgca aaaccagcaa gaaaagaatg aacaagaatt attggaatta 8280
gataaatggg caagtttgtg gaattggttt aacataacaa attggctgtg gtatataaaa 8340
ttattcataa tgatagtagg aggcttggta ggtttaagaa tagtttttgc tgtactttct 8400
atagtgaata gagttaggca gggatattca ccattatcgt ttcagaccca cctcccaacc 8460
ccgaggggac ccttgcgcct tttccaaggc agccctgggt ttgcgcaggg acgcggctgc 8520
tctgggcgtg gttccgggaa acgcagcggc gccgaccctg ggtctcgcac attcttcacg 8580
tccgttcgca gcgtcacccg gatcttcgcc gctacccttg tgggcccccc ggcgacgctt 8640
cctgctccgc ccctaagtcg ggaaggttcc ttgcggttcg cggcgtgccg gacgtgacaa 8700
acggaagccg cacgtctcac tagtaccctc gcagacggac agcgccaggg agcaatggca 8760
gcgcgccgac cgcgatgggc tgtggccaat agcggctgct cagcagggcg cgccgagagc 8820
agcggccggg aaggggcggt gcgggaggcg gggtgtgggg cggtagtgtg ggccctgttc 8880
ctgcccgcgc ggtgttccgc attctgcaag cctccggagc gcacgtcggc agtcggctcc 8940
ctcgttgacc gaatcaccga cctctctccc cagggggtac ccagctgtct agagaattct 9000
agatcttgag acaaatggca gtattcatcc acaattttaa aagaaaaggg gggattgggg 9060
ggtacagtgc aggggaaaga atagtagaca taatagcaac agacatacaa actaaagaat 9120
tacaaaaaca aattacaaaa attcaaaatt ttcgggttta ttacagggac agcagagatc 9180
cactttggcg ccggctcgag gggg 9204
<210> 155
<211> 186
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 155
Met Thr Glu Tyr Lys Pro Thr Val Arg Leu Ala Thr Arg Asp Asp Val
1 5 10 15
Pro Arg Ala Val Arg Thr Leu Ala Ala Ala Phe Ala Asp Tyr Pro Ala
20 25 30
Thr Arg His Thr Val Asp Pro Asp Arg His Ile Glu Arg Val Thr Glu
35 40 45
Leu Gln Glu Leu Phe Leu Thr Arg Val Gly Leu Asp Ile Gly Lys Val
50 55 60
Trp Val Ala Asp Asp Gly Ala Ala Val Ala Val Trp Thr Thr Pro Glu
65 70 75 80
Ser Val Glu Ala Cys Leu Ser Tyr Glu Thr Glu Ile Leu Thr Val Glu
85 90 95
Tyr Gly Leu Leu Pro Ile Gly Lys Ile Val Glu Lys Arg Ile Glu Cys
100 105 110
Thr Val Tyr Ser Val Asp Asn Asn Gly Asn Ile Tyr Thr Gln Pro Val
115 120 125
Ala Gln Trp His Asp Arg Gly Glu Gln Glu Val Phe Glu Tyr Cys Leu
130 135 140
Glu Asp Gly Ser Leu Ile Arg Ala Thr Lys Asp His Lys Phe Met Thr
145 150 155 160
Val Asp Gly Gln Met Leu Pro Ile Asp Glu Ile Phe Glu Arg Glu Leu
165 170 175
Asp Leu Met Arg Val Asp Asn Leu Pro Asn
180 185
<210> 156
<211> 9108
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 156
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtgccgcca ccatgattaa gatcgctacg 660
cggaagtacc tggggaaaca gaacgtctac gacataggtg tggagcgcga tcacaacttt 720
gctctgaaaa atggatttat cgccagcaac tgcggggcgg tgttcgccga gatcggcccg 780
cgcatggccg agttgagcgg ttcccggctg gccgcgcagc aacagatgga aggcctcctg 840
gcgccgcacc ggcccaagga gcccgcgtgg ttcctggcca ccgtcggcgt ctcgcccgac 900
caccagggca agggtctggg cagcgccgtc gtgctccccg gagtggaggc ggccgagcgc 960
gccggggtgc ccgccttcct ggagacctcc gcgccccgca acctcccctt ctacgagcgg 1020
ctcggcttca ccgtcaccgc cgacgtcgag gtgcccgaag gaccgcgcac ctggtgcatg 1080
acccgcaagc ccggtgcctg attaattaag aattcgaccc agctttcttg tacaaagtgg 1140
ttggtaagcc tatccctaac cctctcctcg gtctcgattc tacgtagtaa tgagctagca 1200
gtctcgaggt taacgaattc cgcccccccc ctaacgttac tggccgaagc cgcttggaat 1260
aaggccggtg tgcgcttgtc tatatgttat tttccaccat attgccgtct tttggcaatg 1320
tgagggcccg gaaacctggc cctgtcttct tgacgagcat tcctaggggt ctttcccctc 1380
tcgccaaagg aatgcaaggt ctgttgaatg tcgtgaagga agcagttcct ctggaagctt 1440
cttgaagaca aacaacgtct gtagcgaccc tttgcaggca gcggaacccc ccacctggcg 1500
acaggtgccc ctgcggccaa aagccacgtg tataagatac acctgcaaag gcggcacaac 1560
cccagtgcca cgttgtgagt tggatagttg tggaaagagt caaatggctc tcctcaagcg 1620
tattcaacaa ggggctgaag gatgcccaga aggtacccca ttgtatggga tctgatctgg 1680
ggcctcggtg cacatgcttt acatgtgttt agtcgaggtt aaaaaaacgt ctaggccccc 1740
cgaaccacgg ggacgtggtt ttcctttgaa aaacacgata ataccatggt gagcaagggc 1800
gaggaggata acatggccat catcaaggag ttcatgcgct tcaaggtgca catggagggc 1860
tccgtgaacg gccacgagtt cgagatcgag ggcgagggcg agggccgccc ctacgagggc 1920
acccagaccg ccaagctgaa ggtgaccaag ggtggccccc tgcccttcgc ctgggacatc 1980
ctgtcccctc agttcatgta cggctccaag gcctacgtga agcaccccgc cgacatcccc 2040
gactacttga agctgtcctt ccccgagggc ttcaagtggg agcgcgtgat gaacttcgag 2100
gacggcggcg tggtgaccgt gacccaggac tcctccctgc aggacggcga gttcatctac 2160
aaggtgaagc tgcgcggcac caacttcccc tccgacggcc ccgtaatgca gaagaagacc 2220
atgggctggg aggcctcctc cgagcggatg taccccgagg acggcgccct gaagggcgag 2280
atcaagcaga ggctgaagct gaaggacggc ggccactacg acgctgaggt caagaccacc 2340
tacaaggcca agaagcccgt gcagctgccc ggcgcctaca acgtcaacat caagttggac 2400
atcacctccc acaacgagga ctacaccatc gtggaacagt acgaacgcgc cgagggccgc 2460
cactccaccg gcggcatgga cgagctgtac aagtaacacc ggtggcgcgt taagtcgaca 2520
atcaacctct ggattacaaa atttgtgaaa gattgactgg tattcttaac tatgttgctc 2580
cttttacgct atgtggatac gctgctttaa tgcctttgta tcatgctatt gcttcccgta 2640
tggctttcat tttctcctcc ttgtataaat cctggttgct gtctctttat gaggagttgt 2700
ggcccgttgt caggcaacgt ggcgtggtgt gcactgtgtt tgctgacgca acccccactg 2760
gttggggcat tgccaccacc tgtcagctcc tttccgggac tttcgctttc cccctcccta 2820
ttgccacggc ggaactcatc gccgcctgcc ttgcccgctg ctggacaggg gctcggctgt 2880
tgggcactga caattccgtg gtgttgtcgg ggaaatcatc gtcctttcct tggctgctcg 2940
cctgtgttgc cacctggatt ctgcgcggga cgtccttctg ctacgtccct tcggccctca 3000
atccagcgga ccttccttcc cgcggcctgc tgccggctct gcggcctctt ccgcgtcttc 3060
gccttcgccc tcagacgagt cggatctccc tttgggccgc ctccccgcgt cgactttaag 3120
accaatgact tacaaggcag ctgtagatct tagccacttt ttaaaagaaa aggggggact 3180
ggaagggcta attcactccc aacgaagaca agatctgctt tttgcttgta ctgggtctct 3240
ctggttagac cagatctgag cctgggagct ctctggctaa ctagggaacc cactgcttaa 3300
gcctcaataa agcttgcctt gagtgcttca agtagtgtgt gcccgtctgt tgtgtgactc 3360
tggtaactag agatccctca gaccctttta gtcagtgtgg aaaatctcta gcagtacgta 3420
tagtagttca tgtcatctta ttattcagta tttataactt gcaaagaaat gaatatcaga 3480
gagtgagagg aacttgttta ttgcagctta taatggttac aaataaagca atagcatcac 3540
aaatttcaca aataaagcat ttttttcact gcattctagt tgtggtttgt ccaaactcat 3600
caatgtatct tatcatgtct ggctctagct atcccgcccc taactccgcc catcccgccc 3660
ctaactccgc ccagttccgc ccattctccg ccccatggct gactaatttt ttttatttat 3720
gcagaggccg aggccgcctc ggcctctgag ctattccaga agtagtgagg aggctttttt 3780
ggaggcctag ggacgtaccc aattcgccct atagtgagtc gtattacgcg cgctcactgg 3840
ccgtcgtttt acaacgtcgt gactgggaaa accctggcgt tacccaactt aatcgccttg 3900
cagcacatcc ccctttcgcc agctggcgta atagcgaaga ggcccgcacc gatcgccctt 3960
cccaacagtt gcgcagcctg aatggcgaat gggacgcgcc ctgtagcggc gcattaagcg 4020
cggcgggtgt ggtggttacg cgcagcgtga ccgctacact tgccagcgcc ctagcgcccg 4080
ctcctttcgc tttcttccct tcctttctcg ccacgttcgc cggctttccc cgtcaagctc 4140
taaatcgggg gctcccttta gggttccgat ttagtgcttt acggcacctc gaccccaaaa 4200
aacttgatta gggtgatggt tcacgtagtg ggccatcgcc ctgatagacg gtttttcgcc 4260
ctttgacgtt ggagtccacg ttctttaata gtggactctt gttccaaact ggaacaacac 4320
tcaaccctat ctcggtctat tcttttgatt tataagggat tttgccgatt tcggcctatt 4380
ggttaaaaaa tgagctgatt taacaaaaat ttaacgcgaa ttttaacaaa atattaacgc 4440
ttacaattta ggtggcactt ttcggggaaa tgtgcgcgga acccctattt gtttattttt 4500
ctaaatacat tcaaatatgt atccgctcat gagacaataa ccctgataaa tgcttcaata 4560
atattgaaaa aggaagagta tgagtattca acatttccgt gtcgccctta ttcccttttt 4620
tgcggcattt tgccttcctg tttttgctca cccagaaacg ctggtgaaag taaaagatgc 4680
tgaagatcag ttgggtgcac gagtgggtta catcgaactg gatctcaaca gcggtaagat 4740
ccttgagagt tttcgccccg aagaacgttt tccaatgatg agcactttta aagttctgct 4800
atgtggcgcg gtattatccc gtattgacgc cgggcaagag caactcggtc gccgcataca 4860
ctattctcag aatgacttgg ttgagtactc accagtcaca gaaaagcatc ttacggatgg 4920
catgacagta agagaattat gcagtgctgc cataaccatg agtgataaca ctgcggccaa 4980
cttacttctg acaacgatcg gaggaccgaa ggagctaacc gcttttttgc acaacatggg 5040
ggatcatgta actcgccttg atcgttggga accggagctg aatgaagcca taccaaacga 5100
cgagcgtgac accacgatgc ctgtagcaat ggcaacaacg ttgcgcaaac tattaactgg 5160
cgaactactt actctagctt cccggcaaca attaatagac tggatggagg cggataaagt 5220
tgcaggacca cttctgcgct cggcccttcc ggctggctgg tttattgctg ataaatctgg 5280
agccggtgag cgtgggtctc gcggtatcat tgcagcactg gggccagatg gtaagccctc 5340
ccgtatcgta gttatctaca cgacggggag tcaggcaact atggatgaac gaaatagaca 5400
gatcgctgag ataggtgcct cactgattaa gcattggtaa ctgtcagacc aagtttactc 5460
atatatactt tagattgatt taaaacttca tttttaattt aaaaggatct aggtgaagat 5520
cctttttgat aatctcatga ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc 5580
agaccccgta gaaaagatca aaggatcttc ttgagatcct ttttttctgc gcgtaatctg 5640
ctgcttgcaa acaaaaaaac caccgctacc agcggtggtt tgtttgccgg atcaagagct 5700
accaactctt tttccgaagg taactggctt cagcagagcg cagataccaa atactgttct 5760
tctagtgtag ccgtagttag gccaccactt caagaactct gtagcaccgc ctacatacct 5820
cgctctgcta atcctgttac cagtggctgc tgccagtggc gataagtcgt gtcttaccgg 5880
gttggactca agacgatagt taccggataa ggcgcagcgg tcgggctgaa cggggggttc 5940
gtgcacacag cccagcttgg agcgaacgac ctacaccgaa ctgagatacc tacagcgtga 6000
gctatgagaa agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg 6060
cagggtcgga acaggagagc gcacgaggga gcttccaggg ggaaacgcct ggtatcttta 6120
tagtcctgtc gggtttcgcc acctctgact tgagcgtcga tttttgtgat gctcgtcagg 6180
ggggcggagc ctatggaaaa acgccagcaa cgcggccttt ttacggttcc tggccttttg 6240
ctggcctttt gctcacatgt tctttcctgc gttatcccct gattctgtgg ataaccgtat 6300
taccgccttt gagtgagctg ataccgctcg ccgcagccga acgaccgagc gcagcgagtc 6360
agtgagcgag gaagcggaag agcgcccaat acgcaaaccg cctctccccg cgcgttggcc 6420
gattcattaa tgcagctggc acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa 6480
cgcaattaat gtgagttagc tcactcatta ggcaccccag gctttacact ttatgcttcc 6540
ggctcgtatg ttgtgtggaa ttgtgagcgg ataacaattt cacacaggaa acagctatga 6600
ccatgattac gccaagcgcg caattaaccc tcactaaagg gaacaaaagc tggagctgca 6660
agcttaatgt agtcttatgc aatactcttg tagtcttgca acatggtaac gatgagttag 6720
caacatgcct tacaaggaga gaaaaagcac cgtgcatgcc gattggtgga agtaaggtgg 6780
tacgatcgtg ccttattagg aaggcaacag acgggtctga catggattgg acgaaccact 6840
gaattgccgc attgcagaga tattgtattt aagtgcctag ctcgatacat aaacgggtct 6900
ctctggttag accagatctg agcctgggag ctctctggct aactagggaa cccactgctt 6960
aagcctcaat aaagcttgcc ttgagtgctt caagtagtgt gtgcccgtct gttgtgtgac 7020
tctggtaact agagatccct cagacccttt tagtcagtgt ggaaaatctc tagcagtggc 7080
gcccgaacag ggacttgaaa gcgaaaggga aaccagagga gctctctcga cgcaggactc 7140
ggcttgctga agcgcgcacg gcaagaggcg aggggcggcg actggtgagt acgccaaaaa 7200
ttttgactag cggaggctag aaggagagag atgggtgcga gagcgtcagt attaagcggg 7260
ggagaattag atcgcgatgg gaaaaaattc ggttaaggcc agggggaaag aaaaaatata 7320
aattaaaaca tatagtatgg gcaagcaggg agctagaacg attcgcagtt aatcctggcc 7380
tgttagaaac atcagaaggc tgtagacaaa tactgggaca gctacaacca tcccttcaga 7440
caggatcaga agaacttaga tcattatata atacagtagc aaccctctat tgtgtgcatc 7500
aaaggataga gataaaagac accaaggaag ctttagacaa gatagaggaa gagcaaaaca 7560
aaagtaagac caccgcacag caagcggccg ctgatcttca gacctggagg aggagatatg 7620
agggacaatt ggagaagtga attatataaa tataaagtag taaaaattga accattagga 7680
gtagcaccca ccaaggcaaa gagaagagtg gtgcagagag aaaaaagagc agtgggaata 7740
ggagctttgt tccttgggtt cttgggagca gcaggaagca ctatgggcgc agcgtcaatg 7800
acgctgacgg tacaggccag acaattattg tctggtatag tgcagcagca gaacaatttg 7860
ctgagggcta ttgaggcgca acagcatctg ttgcaactca cagtctgggg catcaagcag 7920
ctccaggcaa gaatcctggc tgtggaaaga tacctaaagg atcaacagct cctggggatt 7980
tggggttgct ctggaaaact catttgcacc actgctgtgc cttggaatgc tagttggagt 8040
aataaatctc tggaacagat ttggaatcac acgacctgga tggagtggga cagagaaatt 8100
aacaattaca caagcttaat acactcctta attgaagaat cgcaaaacca gcaagaaaag 8160
aatgaacaag aattattgga attagataaa tgggcaagtt tgtggaattg gtttaacata 8220
acaaattggc tgtggtatat aaaattattc ataatgatag taggaggctt ggtaggttta 8280
agaatagttt ttgctgtact ttctatagtg aatagagtta ggcagggata ttcaccatta 8340
tcgtttcaga cccacctccc aaccccgagg ggacccttgc gccttttcca aggcagccct 8400
gggtttgcgc agggacgcgg ctgctctggg cgtggttccg ggaaacgcag cggcgccgac 8460
cctgggtctc gcacattctt cacgtccgtt cgcagcgtca cccggatctt cgccgctacc 8520
cttgtgggcc ccccggcgac gcttcctgct ccgcccctaa gtcgggaagg ttccttgcgg 8580
ttcgcggcgt gccggacgtg acaaacggaa gccgcacgtc tcactagtac cctcgcagac 8640
ggacagcgcc agggagcaat ggcagcgcgc cgaccgcgat gggctgtggc caatagcggc 8700
tgctcagcag ggcgcgccga gagcagcggc cgggaagggg cggtgcggga ggcggggtgt 8760
ggggcggtag tgtgggccct gttcctgccc gcgcggtgtt ccgcattctg caagcctccg 8820
gagcgcacgt cggcagtcgg ctccctcgtt gaccgaatca ccgacctctc tccccagggg 8880
gtacccagct gtctagagaa ttctagatct tgagacaaat ggcagtattc atccacaatt 8940
ttaaaagaaa aggggggatt ggggggtaca gtgcagggga aagaatagta gacataatag 9000
caacagacat acaaactaaa gaattacaaa aacaaattac aaaaattcaa aattttcggg 9060
tttattacag ggacagcaga gatccacttt ggcgccggct cgaggggg 9108
<210> 157
<211> 152
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 157
Met Ile Lys Ile Ala Thr Arg Lys Tyr Leu Gly Lys Gln Asn Val Tyr
1 5 10 15
Asp Ile Gly Val Glu Arg Asp His Asn Phe Ala Leu Lys Asn Gly Phe
20 25 30
Ile Ala Ser Asn Cys Gly Ala Val Phe Ala Glu Ile Gly Pro Arg Met
35 40 45
Ala Glu Leu Ser Gly Ser Arg Leu Ala Ala Gln Gln Gln Met Glu Gly
50 55 60
Leu Leu Ala Pro His Arg Pro Lys Glu Pro Ala Trp Phe Leu Ala Thr
65 70 75 80
Val Gly Val Ser Pro Asp His Gln Gly Lys Gly Leu Gly Ser Ala Val
85 90 95
Val Leu Pro Gly Val Glu Ala Ala Glu Arg Ala Gly Val Pro Ala Phe
100 105 110
Leu Glu Thr Ser Ala Pro Arg Asn Leu Pro Phe Tyr Glu Arg Leu Gly
115 120 125
Phe Thr Val Thr Ala Asp Val Glu Val Pro Glu Gly Pro Arg Thr Trp
130 135 140
Cys Met Thr Arg Lys Pro Gly Ala
145 150
<210> 158
<211> 9363
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 158
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtgccacca tgaccgagta caagcccacg 660
gtgcgcctcg ccacccgcga cgacgtcccc agggccgtac gcaccctcgc cgccgcgttc 720
gccgactacc ccgccacgcg ccacaccgtc gatccggacc gccacatcga gcgggtcacc 780
gagctgcaag aactcttcct cacgcgcgtc gggctcgaca tcggcaaggt gtgggtcgcg 840
gacgacggcg ccgcggtggc ggtctggacc acgccggaga gcgtcgaagc gggggcggtg 900
ttcgccgaga tcggcccgcg catggccgag ttgagcggtt cccggctggc cgcgcagcaa 960
cagatggaag gcctcctggc gccgcaccgg cccaaggagc ccgcgtggtt cctggccacc 1020
gtcggcgtct cgcccgacca ccagggcaag tgcctttcat acgagaccga gatcctgact 1080
gtcgagtacg gattgcttcc tatcggcaaa atcgtggaga agaggattga atgtaccgtc 1140
tattcagtcg ataataatgg gaacatctac acacagcccg tggctcaatg gcacgacaga 1200
ggagagcagg aagtttttga atactgtctc gaggacggat ccctcatccg cgctactaaa 1260
gatcataagt ttatgaccgt ggacggccag atgctgccaa ttgacgaaat ttttgaacga 1320
gagctggatc tgatgagagt cgacaacctt ccaaactgat taattaagaa ttcgacccag 1380
ctttcttgta caaagtggtt ggtaagccta tccctaaccc tctcctcggt ctcgattcta 1440
cgtagtaatg agctagcagt ctcgaggtta acgaattccg ccccccccct aacgttactg 1500
gccgaagccg cttggaataa ggccggtgtg cgcttgtcta tatgttattt tccaccatat 1560
tgccgtcttt tggcaatgtg agggcccgga aacctggccc tgtcttcttg acgagcattc 1620
ctaggggtct ttcccctctc gccaaaggaa tgcaaggtct gttgaatgtc gtgaaggaag 1680
cagttcctct ggaagcttct tgaagacaaa caacgtctgt agcgaccctt tgcaggcagc 1740
ggaacccccc acctggcgac aggtgcccct gcggccaaaa gccacgtgta taagatacac 1800
ctgcaaaggc ggcacaaccc cagtgccacg ttgtgagttg gatagttgtg gaaagagtca 1860
aatggctctc ctcaagcgta ttcaacaagg ggctgaagga tgcccagaag gtaccccatt 1920
gtatgggatc tgatctgggg cctcggtgca catgctttac atgtgtttag tcgaggttaa 1980
aaaaacgtct aggccccccg aaccacgggg acgtggtttt cctttgaaaa acacgataat 2040
accatggcca tgagcgagct gattaaggag aacatgcaca tgaagctgta catggagggc 2100
accgtggaca accatcactt caagtgcaca tccgagggcg aaggcaagcc ctacgagggc 2160
acccagacca tgagaatcaa ggtggtcgag ggcggccctc tccccttcgc cttcgacatc 2220
ctggctacta gcttcctcta cggcagcaag accttcatca accacaccca gggcatcccc 2280
gacttcttca agcagtcctt ccctgagggc ttcacatggg agagagtcac cacatacgaa 2340
gacgggggcg tgctgaccgc tacccaggac accagcctcc aggacggctg cctcatctac 2400
aacgtcaaga tcagaggggt gaacttcaca tccaacggcc ctgtgatgca gaagaaaaca 2460
ctcggctggg aggccttcac cgagacgctg taccccgctg acggcggcct ggaaggcaga 2520
aacgacatgg ccctgaagct cgtgggcggg agccatctga tcgcaaacat caagaccaca 2580
tatagatcca agaaacccgc taagaacctc aagatgcctg gcgtctacta tgtggactac 2640
agactggaaa gaatcaagga ggccaacaac gagacctacg tcgagcagca cgaggtggca 2700
gtggccagat actgcgacct ccctagcaaa ctggggcaca agcttaatta acaccggtgg 2760
cgcgttaagt cgacaatcaa cctctggatt acaaaatttg tgaaagattg actggtattc 2820
ttaactatgt tgctcctttt acgctatgtg gatacgctgc tttaatgcct ttgtatcatg 2880
ctattgcttc ccgtatggct ttcattttct cctccttgta taaatcctgg ttgctgtctc 2940
tttatgagga gttgtggccc gttgtcaggc aacgtggcgt ggtgtgcact gtgtttgctg 3000
acgcaacccc cactggttgg ggcattgcca ccacctgtca gctcctttcc gggactttcg 3060
ctttccccct ccctattgcc acggcggaac tcatcgccgc ctgccttgcc cgctgctgga 3120
caggggctcg gctgttgggc actgacaatt ccgtggtgtt gtcggggaaa tcatcgtcct 3180
ttccttggct gctcgcctgt gttgccacct ggattctgcg cgggacgtcc ttctgctacg 3240
tcccttcggc cctcaatcca gcggaccttc cttcccgcgg cctgctgccg gctctgcggc 3300
ctcttccgcg tcttcgcctt cgccctcaga cgagtcggat ctccctttgg gccgcctccc 3360
cgcgtcgact ttaagaccaa tgacttacaa ggcagctgta gatcttagcc actttttaaa 3420
agaaaagggg ggactggaag ggctaattca ctcccaacga agacaagatc tgctttttgc 3480
ttgtactggg tctctctggt tagaccagat ctgagcctgg gagctctctg gctaactagg 3540
gaacccactg cttaagcctc aataaagctt gccttgagtg cttcaagtag tgtgtgcccg 3600
tctgttgtgt gactctggta actagagatc cctcagaccc ttttagtcag tgtggaaaat 3660
ctctagcagt acgtatagta gttcatgtca tcttattatt cagtatttat aacttgcaaa 3720
gaaatgaata tcagagagtg agaggaactt gtttattgca gcttataatg gttacaaata 3780
aagcaatagc atcacaaatt tcacaaataa agcatttttt tcactgcatt ctagttgtgg 3840
tttgtccaaa ctcatcaatg tatcttatca tgtctggctc tagctatccc gcccctaact 3900
ccgcccatcc cgcccctaac tccgcccagt tccgcccatt ctccgcccca tggctgacta 3960
atttttttta tttatgcaga ggccgaggcc gcctcggcct ctgagctatt ccagaagtag 4020
tgaggaggct tttttggagg cctagggacg tacccaattc gccctatagt gagtcgtatt 4080
acgcgcgctc actggccgtc gttttacaac gtcgtgactg ggaaaaccct ggcgttaccc 4140
aacttaatcg ccttgcagca catccccctt tcgccagctg gcgtaatagc gaagaggccc 4200
gcaccgatcg cccttcccaa cagttgcgca gcctgaatgg cgaatgggac gcgccctgta 4260
gcggcgcatt aagcgcggcg ggtgtggtgg ttacgcgcag cgtgaccgct acacttgcca 4320
gcgccctagc gcccgctcct ttcgctttct tcccttcctt tctcgccacg ttcgccggct 4380
ttccccgtca agctctaaat cgggggctcc ctttagggtt ccgatttagt gctttacggc 4440
acctcgaccc caaaaaactt gattagggtg atggttcacg tagtgggcca tcgccctgat 4500
agacggtttt tcgccctttg acgttggagt ccacgttctt taatagtgga ctcttgttcc 4560
aaactggaac aacactcaac cctatctcgg tctattcttt tgatttataa gggattttgc 4620
cgatttcggc ctattggtta aaaaatgagc tgatttaaca aaaatttaac gcgaatttta 4680
acaaaatatt aacgcttaca atttaggtgg cacttttcgg ggaaatgtgc gcggaacccc 4740
tatttgttta tttttctaaa tacattcaaa tatgtatccg ctcatgagac aataaccctg 4800
ataaatgctt caataatatt gaaaaaggaa gagtatgagt attcaacatt tccgtgtcgc 4860
ccttattccc ttttttgcgg cattttgcct tcctgttttt gctcacccag aaacgctggt 4920
gaaagtaaaa gatgctgaag atcagttggg tgcacgagtg ggttacatcg aactggatct 4980
caacagcggt aagatccttg agagttttcg ccccgaagaa cgttttccaa tgatgagcac 5040
ttttaaagtt ctgctatgtg gcgcggtatt atcccgtatt gacgccgggc aagagcaact 5100
cggtcgccgc atacactatt ctcagaatga cttggttgag tactcaccag tcacagaaaa 5160
gcatcttacg gatggcatga cagtaagaga attatgcagt gctgccataa ccatgagtga 5220
taacactgcg gccaacttac ttctgacaac gatcggagga ccgaaggagc taaccgcttt 5280
tttgcacaac atgggggatc atgtaactcg ccttgatcgt tgggaaccgg agctgaatga 5340
agccatacca aacgacgagc gtgacaccac gatgcctgta gcaatggcaa caacgttgcg 5400
caaactatta actggcgaac tacttactct agcttcccgg caacaattaa tagactggat 5460
ggaggcggat aaagttgcag gaccacttct gcgctcggcc cttccggctg gctggtttat 5520
tgctgataaa tctggagccg gtgagcgtgg gtctcgcggt atcattgcag cactggggcc 5580
agatggtaag ccctcccgta tcgtagttat ctacacgacg gggagtcagg caactatgga 5640
tgaacgaaat agacagatcg ctgagatagg tgcctcactg attaagcatt ggtaactgtc 5700
agaccaagtt tactcatata tactttagat tgatttaaaa cttcattttt aatttaaaag 5760
gatctaggtg aagatccttt ttgataatct catgaccaaa atcccttaac gtgagttttc 5820
gttccactga gcgtcagacc ccgtagaaaa gatcaaagga tcttcttgag atcctttttt 5880
tctgcgcgta atctgctgct tgcaaacaaa aaaaccaccg ctaccagcgg tggtttgttt 5940
gccggatcaa gagctaccaa ctctttttcc gaaggtaact ggcttcagca gagcgcagat 6000
accaaatact gttcttctag tgtagccgta gttaggccac cacttcaaga actctgtagc 6060
accgcctaca tacctcgctc tgctaatcct gttaccagtg gctgctgcca gtggcgataa 6120
gtcgtgtctt accgggttgg actcaagacg atagttaccg gataaggcgc agcggtcggg 6180
ctgaacgggg ggttcgtgca cacagcccag cttggagcga acgacctaca ccgaactgag 6240
atacctacag cgtgagctat gagaaagcgc cacgcttccc gaagggagaa aggcggacag 6300
gtatccggta agcggcaggg tcggaacagg agagcgcacg agggagcttc cagggggaaa 6360
cgcctggtat ctttatagtc ctgtcgggtt tcgccacctc tgacttgagc gtcgattttt 6420
gtgatgctcg tcaggggggc ggagcctatg gaaaaacgcc agcaacgcgg cctttttacg 6480
gttcctggcc ttttgctggc cttttgctca catgttcttt cctgcgttat cccctgattc 6540
tgtggataac cgtattaccg cctttgagtg agctgatacc gctcgccgca gccgaacgac 6600
cgagcgcagc gagtcagtga gcgaggaagc ggaagagcgc ccaatacgca aaccgcctct 6660
ccccgcgcgt tggccgattc attaatgcag ctggcacgac aggtttcccg actggaaagc 6720
gggcagtgag cgcaacgcaa ttaatgtgag ttagctcact cattaggcac cccaggcttt 6780
acactttatg cttccggctc gtatgttgtg tggaattgtg agcggataac aatttcacac 6840
aggaaacagc tatgaccatg attacgccaa gcgcgcaatt aaccctcact aaagggaaca 6900
aaagctggag ctgcaagctt aatgtagtct tatgcaatac tcttgtagtc ttgcaacatg 6960
gtaacgatga gttagcaaca tgccttacaa ggagagaaaa agcaccgtgc atgccgattg 7020
gtggaagtaa ggtggtacga tcgtgcctta ttaggaaggc aacagacggg tctgacatgg 7080
attggacgaa ccactgaatt gccgcattgc agagatattg tatttaagtg cctagctcga 7140
tacataaacg ggtctctctg gttagaccag atctgagcct gggagctctc tggctaacta 7200
gggaacccac tgcttaagcc tcaataaagc ttgccttgag tgcttcaagt agtgtgtgcc 7260
cgtctgttgt gtgactctgg taactagaga tccctcagac ccttttagtc agtgtggaaa 7320
atctctagca gtggcgcccg aacagggact tgaaagcgaa agggaaacca gaggagctct 7380
ctcgacgcag gactcggctt gctgaagcgc gcacggcaag aggcgagggg cggcgactgg 7440
tgagtacgcc aaaaattttg actagcggag gctagaagga gagagatggg tgcgagagcg 7500
tcagtattaa gcgggggaga attagatcgc gatgggaaaa aattcggtta aggccagggg 7560
gaaagaaaaa atataaatta aaacatatag tatgggcaag cagggagcta gaacgattcg 7620
cagttaatcc tggcctgtta gaaacatcag aaggctgtag acaaatactg ggacagctac 7680
aaccatccct tcagacagga tcagaagaac ttagatcatt atataataca gtagcaaccc 7740
tctattgtgt gcatcaaagg atagagataa aagacaccaa ggaagcttta gacaagatag 7800
aggaagagca aaacaaaagt aagaccaccg cacagcaagc ggccgctgat cttcagacct 7860
ggaggaggag atatgaggga caattggaga agtgaattat ataaatataa agtagtaaaa 7920
attgaaccat taggagtagc acccaccaag gcaaagagaa gagtggtgca gagagaaaaa 7980
agagcagtgg gaataggagc tttgttcctt gggttcttgg gagcagcagg aagcactatg 8040
ggcgcagcgt caatgacgct gacggtacag gccagacaat tattgtctgg tatagtgcag 8100
cagcagaaca atttgctgag ggctattgag gcgcaacagc atctgttgca actcacagtc 8160
tggggcatca agcagctcca ggcaagaatc ctggctgtgg aaagatacct aaaggatcaa 8220
cagctcctgg ggatttgggg ttgctctgga aaactcattt gcaccactgc tgtgccttgg 8280
aatgctagtt ggagtaataa atctctggaa cagatttgga atcacacgac ctggatggag 8340
tgggacagag aaattaacaa ttacacaagc ttaatacact ccttaattga agaatcgcaa 8400
aaccagcaag aaaagaatga acaagaatta ttggaattag ataaatgggc aagtttgtgg 8460
aattggttta acataacaaa ttggctgtgg tatataaaat tattcataat gatagtagga 8520
ggcttggtag gtttaagaat agtttttgct gtactttcta tagtgaatag agttaggcag 8580
ggatattcac cattatcgtt tcagacccac ctcccaaccc cgaggggacc cttgcgcctt 8640
ttccaaggca gccctgggtt tgcgcaggga cgcggctgct ctgggcgtgg ttccgggaaa 8700
cgcagcggcg ccgaccctgg gtctcgcaca ttcttcacgt ccgttcgcag cgtcacccgg 8760
atcttcgccg ctacccttgt gggccccccg gcgacgcttc ctgctccgcc cctaagtcgg 8820
gaaggttcct tgcggttcgc ggcgtgccgg acgtgacaaa cggaagccgc acgtctcact 8880
agtaccctcg cagacggaca gcgccaggga gcaatggcag cgcgccgacc gcgatgggct 8940
gtggccaata gcggctgctc agcagggcgc gccgagagca gcggccggga aggggcggtg 9000
cgggaggcgg ggtgtggggc ggtagtgtgg gccctgttcc tgcccgcgcg gtgttccgca 9060
ttctgcaagc ctccggagcg cacgtcggca gtcggctccc tcgttgaccg aatcaccgac 9120
ctctctcccc agggggtacc cagctgtcta gagaattcta gatcttgaga caaatggcag 9180
tattcatcca caattttaaa agaaaagggg ggattggggg gtacagtgca ggggaaagaa 9240
tagtagacat aatagcaaca gacatacaaa ctaaagaatt acaaaaacaa attacaaaaa 9300
ttcaaaattt tcgggtttat tacagggaca gcagagatcc actttggcgc cggctcgagg 9360
ggg 9363
<210> 159
<211> 239
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 159
Met Thr Glu Tyr Lys Pro Thr Val Arg Leu Ala Thr Arg Asp Asp Val
1 5 10 15
Pro Arg Ala Val Arg Thr Leu Ala Ala Ala Phe Ala Asp Tyr Pro Ala
20 25 30
Thr Arg His Thr Val Asp Pro Asp Arg His Ile Glu Arg Val Thr Glu
35 40 45
Leu Gln Glu Leu Phe Leu Thr Arg Val Gly Leu Asp Ile Gly Lys Val
50 55 60
Trp Val Ala Asp Asp Gly Ala Ala Val Ala Val Trp Thr Thr Pro Glu
65 70 75 80
Ser Val Glu Ala Gly Ala Val Phe Ala Glu Ile Gly Pro Arg Met Ala
85 90 95
Glu Leu Ser Gly Ser Arg Leu Ala Ala Gln Gln Gln Met Glu Gly Leu
100 105 110
Leu Ala Pro His Arg Pro Lys Glu Pro Ala Trp Phe Leu Ala Thr Val
115 120 125
Gly Val Ser Pro Asp His Gln Gly Lys Cys Leu Ser Tyr Glu Thr Glu
130 135 140
Ile Leu Thr Val Glu Tyr Gly Leu Leu Pro Ile Gly Lys Ile Val Glu
145 150 155 160
Lys Arg Ile Glu Cys Thr Val Tyr Ser Val Asp Asn Asn Gly Asn Ile
165 170 175
Tyr Thr Gln Pro Val Ala Gln Trp His Asp Arg Gly Glu Gln Glu Val
180 185 190
Phe Glu Tyr Cys Leu Glu Asp Gly Ser Leu Ile Arg Ala Thr Lys Asp
195 200 205
His Lys Phe Met Thr Val Asp Gly Gln Met Leu Pro Ile Asp Glu Ile
210 215 220
Phe Glu Arg Glu Leu Asp Leu Met Arg Val Asp Asn Leu Pro Asn
225 230 235
<210> 160
<211> 8949
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 160
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtgccgcca ccatgattaa gatcgctacg 660
cggaagtacc tggggaaaca gaacgtctac gacataggtg tggagcgcga tcacaacttt 720
gctctgaaaa atggatttat cgccagcaac tgcggtctgg gcagcgccgt cgtgctcccc 780
ggagtggagg cggccgagcg cgccggggtg cccgccttcc tggagacctc cgcgccccgc 840
aacctcccct tctacgagcg gctcggcttc accgtcaccg ccgacgtcga ggtgcccgaa 900
ggaccgcgca cctggtgcat gacccgcaag cccggtgcct gattaattaa gaattcgacc 960
cagctttctt gtacaaagtg gttggtaagc ctatccctaa ccctctcctc ggtctcgatt 1020
ctacgtagta atgagctagc agtctcgagg ttaacgaatt ccgccccccc cctaacgtta 1080
ctggccgaag ccgcttggaa taaggccggt gtgcgcttgt ctatatgtta ttttccacca 1140
tattgccgtc ttttggcaat gtgagggccc ggaaacctgg ccctgtcttc ttgacgagca 1200
ttcctagggg tctttcccct ctcgccaaag gaatgcaagg tctgttgaat gtcgtgaagg 1260
aagcagttcc tctggaagct tcttgaagac aaacaacgtc tgtagcgacc ctttgcaggc 1320
agcggaaccc cccacctggc gacaggtgcc cctgcggcca aaagccacgt gtataagata 1380
cacctgcaaa ggcggcacaa ccccagtgcc acgttgtgag ttggatagtt gtggaaagag 1440
tcaaatggct ctcctcaagc gtattcaaca aggggctgaa ggatgcccag aaggtacccc 1500
attgtatggg atctgatctg gggcctcggt gcacatgctt tacatgtgtt tagtcgaggt 1560
taaaaaaacg tctaggcccc ccgaaccacg gggacgtggt tttcctttga aaaacacgat 1620
aataccatgg tgagcaaggg cgaggaggat aacatggcca tcatcaagga gttcatgcgc 1680
ttcaaggtgc acatggaggg ctccgtgaac ggccacgagt tcgagatcga gggcgagggc 1740
gagggccgcc cctacgaggg cacccagacc gccaagctga aggtgaccaa gggtggcccc 1800
ctgcccttcg cctgggacat cctgtcccct cagttcatgt acggctccaa ggcctacgtg 1860
aagcaccccg ccgacatccc cgactacttg aagctgtcct tccccgaggg cttcaagtgg 1920
gagcgcgtga tgaacttcga ggacggcggc gtggtgaccg tgacccagga ctcctccctg 1980
caggacggcg agttcatcta caaggtgaag ctgcgcggca ccaacttccc ctccgacggc 2040
cccgtaatgc agaagaagac catgggctgg gaggcctcct ccgagcggat gtaccccgag 2100
gacggcgccc tgaagggcga gatcaagcag aggctgaagc tgaaggacgg cggccactac 2160
gacgctgagg tcaagaccac ctacaaggcc aagaagcccg tgcagctgcc cggcgcctac 2220
aacgtcaaca tcaagttgga catcacctcc cacaacgagg actacaccat cgtggaacag 2280
tacgaacgcg ccgagggccg ccactccacc ggcggcatgg acgagctgta caagtaacac 2340
cggtggcgcg ttaagtcgac aatcaacctc tggattacaa aatttgtgaa agattgactg 2400
gtattcttaa ctatgttgct ccttttacgc tatgtggata cgctgcttta atgcctttgt 2460
atcatgctat tgcttcccgt atggctttca ttttctcctc cttgtataaa tcctggttgc 2520
tgtctcttta tgaggagttg tggcccgttg tcaggcaacg tggcgtggtg tgcactgtgt 2580
ttgctgacgc aacccccact ggttggggca ttgccaccac ctgtcagctc ctttccggga 2640
ctttcgcttt ccccctccct attgccacgg cggaactcat cgccgcctgc cttgcccgct 2700
gctggacagg ggctcggctg ttgggcactg acaattccgt ggtgttgtcg gggaaatcat 2760
cgtcctttcc ttggctgctc gcctgtgttg ccacctggat tctgcgcggg acgtccttct 2820
gctacgtccc ttcggccctc aatccagcgg accttccttc ccgcggcctg ctgccggctc 2880
tgcggcctct tccgcgtctt cgccttcgcc ctcagacgag tcggatctcc ctttgggccg 2940
cctccccgcg tcgactttaa gaccaatgac ttacaaggca gctgtagatc ttagccactt 3000
tttaaaagaa aaggggggac tggaagggct aattcactcc caacgaagac aagatctgct 3060
ttttgcttgt actgggtctc tctggttaga ccagatctga gcctgggagc tctctggcta 3120
actagggaac ccactgctta agcctcaata aagcttgcct tgagtgcttc aagtagtgtg 3180
tgcccgtctg ttgtgtgact ctggtaacta gagatccctc agaccctttt agtcagtgtg 3240
gaaaatctct agcagtacgt atagtagttc atgtcatctt attattcagt atttataact 3300
tgcaaagaaa tgaatatcag agagtgagag gaacttgttt attgcagctt ataatggtta 3360
caaataaagc aatagcatca caaatttcac aaataaagca tttttttcac tgcattctag 3420
ttgtggtttg tccaaactca tcaatgtatc ttatcatgtc tggctctagc tatcccgccc 3480
ctaactccgc ccatcccgcc cctaactccg cccagttccg cccattctcc gccccatggc 3540
tgactaattt tttttattta tgcagaggcc gaggccgcct cggcctctga gctattccag 3600
aagtagtgag gaggcttttt tggaggccta gggacgtacc caattcgccc tatagtgagt 3660
cgtattacgc gcgctcactg gccgtcgttt tacaacgtcg tgactgggaa aaccctggcg 3720
ttacccaact taatcgcctt gcagcacatc cccctttcgc cagctggcgt aatagcgaag 3780
aggcccgcac cgatcgccct tcccaacagt tgcgcagcct gaatggcgaa tgggacgcgc 3840
cctgtagcgg cgcattaagc gcggcgggtg tggtggttac gcgcagcgtg accgctacac 3900
ttgccagcgc cctagcgccc gctcctttcg ctttcttccc ttcctttctc gccacgttcg 3960
ccggctttcc ccgtcaagct ctaaatcggg ggctcccttt agggttccga tttagtgctt 4020
tacggcacct cgaccccaaa aaacttgatt agggtgatgg ttcacgtagt gggccatcgc 4080
cctgatagac ggtttttcgc cctttgacgt tggagtccac gttctttaat agtggactct 4140
tgttccaaac tggaacaaca ctcaacccta tctcggtcta ttcttttgat ttataaggga 4200
ttttgccgat ttcggcctat tggttaaaaa atgagctgat ttaacaaaaa tttaacgcga 4260
attttaacaa aatattaacg cttacaattt aggtggcact tttcggggaa atgtgcgcgg 4320
aacccctatt tgtttatttt tctaaataca ttcaaatatg tatccgctca tgagacaata 4380
accctgataa atgcttcaat aatattgaaa aaggaagagt atgagtattc aacatttccg 4440
tgtcgccctt attccctttt ttgcggcatt ttgccttcct gtttttgctc acccagaaac 4500
gctggtgaaa gtaaaagatg ctgaagatca gttgggtgca cgagtgggtt acatcgaact 4560
ggatctcaac agcggtaaga tccttgagag ttttcgcccc gaagaacgtt ttccaatgat 4620
gagcactttt aaagttctgc tatgtggcgc ggtattatcc cgtattgacg ccgggcaaga 4680
gcaactcggt cgccgcatac actattctca gaatgacttg gttgagtact caccagtcac 4740
agaaaagcat cttacggatg gcatgacagt aagagaatta tgcagtgctg ccataaccat 4800
gagtgataac actgcggcca acttacttct gacaacgatc ggaggaccga aggagctaac 4860
cgcttttttg cacaacatgg gggatcatgt aactcgcctt gatcgttggg aaccggagct 4920
gaatgaagcc ataccaaacg acgagcgtga caccacgatg cctgtagcaa tggcaacaac 4980
gttgcgcaaa ctattaactg gcgaactact tactctagct tcccggcaac aattaataga 5040
ctggatggag gcggataaag ttgcaggacc acttctgcgc tcggcccttc cggctggctg 5100
gtttattgct gataaatctg gagccggtga gcgtgggtct cgcggtatca ttgcagcact 5160
ggggccagat ggtaagccct cccgtatcgt agttatctac acgacgggga gtcaggcaac 5220
tatggatgaa cgaaatagac agatcgctga gataggtgcc tcactgatta agcattggta 5280
actgtcagac caagtttact catatatact ttagattgat ttaaaacttc atttttaatt 5340
taaaaggatc taggtgaaga tcctttttga taatctcatg accaaaatcc cttaacgtga 5400
gttttcgttc cactgagcgt cagaccccgt agaaaagatc aaaggatctt cttgagatcc 5460
tttttttctg cgcgtaatct gctgcttgca aacaaaaaaa ccaccgctac cagcggtggt 5520
ttgtttgccg gatcaagagc taccaactct ttttccgaag gtaactggct tcagcagagc 5580
gcagatacca aatactgttc ttctagtgta gccgtagtta ggccaccact tcaagaactc 5640
tgtagcaccg cctacatacc tcgctctgct aatcctgtta ccagtggctg ctgccagtgg 5700
cgataagtcg tgtcttaccg ggttggactc aagacgatag ttaccggata aggcgcagcg 5760
gtcgggctga acggggggtt cgtgcacaca gcccagcttg gagcgaacga cctacaccga 5820
actgagatac ctacagcgtg agctatgaga aagcgccacg cttcccgaag ggagaaaggc 5880
ggacaggtat ccggtaagcg gcagggtcgg aacaggagag cgcacgaggg agcttccagg 5940
gggaaacgcc tggtatcttt atagtcctgt cgggtttcgc cacctctgac ttgagcgtcg 6000
atttttgtga tgctcgtcag gggggcggag cctatggaaa aacgccagca acgcggcctt 6060
tttacggttc ctggcctttt gctggccttt tgctcacatg ttctttcctg cgttatcccc 6120
tgattctgtg gataaccgta ttaccgcctt tgagtgagct gataccgctc gccgcagccg 6180
aacgaccgag cgcagcgagt cagtgagcga ggaagcggaa gagcgcccaa tacgcaaacc 6240
gcctctcccc gcgcgttggc cgattcatta atgcagctgg cacgacaggt ttcccgactg 6300
gaaagcgggc agtgagcgca acgcaattaa tgtgagttag ctcactcatt aggcacccca 6360
ggctttacac tttatgcttc cggctcgtat gttgtgtgga attgtgagcg gataacaatt 6420
tcacacagga aacagctatg accatgatta cgccaagcgc gcaattaacc ctcactaaag 6480
ggaacaaaag ctggagctgc aagcttaatg tagtcttatg caatactctt gtagtcttgc 6540
aacatggtaa cgatgagtta gcaacatgcc ttacaaggag agaaaaagca ccgtgcatgc 6600
cgattggtgg aagtaaggtg gtacgatcgt gccttattag gaaggcaaca gacgggtctg 6660
acatggattg gacgaaccac tgaattgccg cattgcagag atattgtatt taagtgccta 6720
gctcgataca taaacgggtc tctctggtta gaccagatct gagcctggga gctctctggc 6780
taactaggga acccactgct taagcctcaa taaagcttgc cttgagtgct tcaagtagtg 6840
tgtgcccgtc tgttgtgtga ctctggtaac tagagatccc tcagaccctt ttagtcagtg 6900
tggaaaatct ctagcagtgg cgcccgaaca gggacttgaa agcgaaaggg aaaccagagg 6960
agctctctcg acgcaggact cggcttgctg aagcgcgcac ggcaagaggc gaggggcggc 7020
gactggtgag tacgccaaaa attttgacta gcggaggcta gaaggagaga gatgggtgcg 7080
agagcgtcag tattaagcgg gggagaatta gatcgcgatg ggaaaaaatt cggttaaggc 7140
cagggggaaa gaaaaaatat aaattaaaac atatagtatg ggcaagcagg gagctagaac 7200
gattcgcagt taatcctggc ctgttagaaa catcagaagg ctgtagacaa atactgggac 7260
agctacaacc atcccttcag acaggatcag aagaacttag atcattatat aatacagtag 7320
caaccctcta ttgtgtgcat caaaggatag agataaaaga caccaaggaa gctttagaca 7380
agatagagga agagcaaaac aaaagtaaga ccaccgcaca gcaagcggcc gctgatcttc 7440
agacctggag gaggagatat gagggacaat tggagaagtg aattatataa atataaagta 7500
gtaaaaattg aaccattagg agtagcaccc accaaggcaa agagaagagt ggtgcagaga 7560
gaaaaaagag cagtgggaat aggagctttg ttccttgggt tcttgggagc agcaggaagc 7620
actatgggcg cagcgtcaat gacgctgacg gtacaggcca gacaattatt gtctggtata 7680
gtgcagcagc agaacaattt gctgagggct attgaggcgc aacagcatct gttgcaactc 7740
acagtctggg gcatcaagca gctccaggca agaatcctgg ctgtggaaag atacctaaag 7800
gatcaacagc tcctggggat ttggggttgc tctggaaaac tcatttgcac cactgctgtg 7860
ccttggaatg ctagttggag taataaatct ctggaacaga tttggaatca cacgacctgg 7920
atggagtggg acagagaaat taacaattac acaagcttaa tacactcctt aattgaagaa 7980
tcgcaaaacc agcaagaaaa gaatgaacaa gaattattgg aattagataa atgggcaagt 8040
ttgtggaatt ggtttaacat aacaaattgg ctgtggtata taaaattatt cataatgata 8100
gtaggaggct tggtaggttt aagaatagtt tttgctgtac tttctatagt gaatagagtt 8160
aggcagggat attcaccatt atcgtttcag acccacctcc caaccccgag gggacccttg 8220
cgccttttcc aaggcagccc tgggtttgcg cagggacgcg gctgctctgg gcgtggttcc 8280
gggaaacgca gcggcgccga ccctgggtct cgcacattct tcacgtccgt tcgcagcgtc 8340
acccggatct tcgccgctac ccttgtgggc cccccggcga cgcttcctgc tccgccccta 8400
agtcgggaag gttccttgcg gttcgcggcg tgccggacgt gacaaacgga agccgcacgt 8460
ctcactagta ccctcgcaga cggacagcgc cagggagcaa tggcagcgcg ccgaccgcga 8520
tgggctgtgg ccaatagcgg ctgctcagca gggcgcgccg agagcagcgg ccgggaaggg 8580
gcggtgcggg aggcggggtg tggggcggta gtgtgggccc tgttcctgcc cgcgcggtgt 8640
tccgcattct gcaagcctcc ggagcgcacg tcggcagtcg gctccctcgt tgaccgaatc 8700
accgacctct ctccccaggg ggtacccagc tgtctagaga attctagatc ttgagacaaa 8760
tggcagtatt catccacaat tttaaaagaa aaggggggat tggggggtac agtgcagggg 8820
aaagaatagt agacataata gcaacagaca tacaaactaa agaattacaa aaacaaatta 8880
caaaaattca aaattttcgg gtttattaca gggacagcag agatccactt tggcgccggc 8940
tcgaggggg 8949
<210> 161
<211> 99
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 161
Met Ile Lys Ile Ala Thr Arg Lys Tyr Leu Gly Lys Gln Asn Val Tyr
1 5 10 15
Asp Ile Gly Val Glu Arg Asp His Asn Phe Ala Leu Lys Asn Gly Phe
20 25 30
Ile Ala Ser Asn Cys Gly Leu Gly Ser Ala Val Val Leu Pro Gly Val
35 40 45
Glu Ala Ala Glu Arg Ala Gly Val Pro Ala Phe Leu Glu Thr Ser Ala
50 55 60
Pro Arg Asn Leu Pro Phe Tyr Glu Arg Leu Gly Phe Thr Val Thr Ala
65 70 75 80
Asp Val Glu Val Pro Glu Gly Pro Arg Thr Trp Cys Met Thr Arg Lys
85 90 95
Pro Gly Ala
<210> 162
<211> 9426
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 162
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtgccacca tgaccgagta caagcccacg 660
gtgcgcctcg ccacccgcga cgacgtcccc agggccgtac gcaccctcgc cgccgcgttc 720
gccgactacc ccgccacgcg ccacaccgtc gatccggacc gccacatcga gcgggtcacc 780
gagctgcaag aactcttcct cacgcgcgtc gggctcgaca tcggcaaggt gtgggtcgcg 840
gacgacggcg ccgcggtggc ggtctggacc acgccggaga gcgtcgaagc gggggcggtg 900
ttcgccgaga tcggcccgcg catggccgag ttgagcggtt cccggctggc cgcgcagcaa 960
cagatggaag gcctcctggc gccgcaccgg cccaaggagc ccgcgtggtt cctggccacc 1020
gtcggcgtct cgcccgacca ccagggcaag ggtctgggca gcgccgtcgt gctccccgga 1080
gtggaggcgg ccgagcgcgc cggggtgccc gcctgccttt catacgagac cgagatcctg 1140
actgtcgagt acggattgct tcctatcggc aaaatcgtgg agaagaggat tgaatgtacc 1200
gtctattcag tcgataataa tgggaacatc tacacacagc ccgtggctca atggcacgac 1260
agaggagagc aggaagtttt tgaatactgt ctcgaggacg gatccctcat ccgcgctact 1320
aaagatcata agtttatgac cgtggacggc cagatgctgc caattgacga aatttttgaa 1380
cgagagctgg atctgatgag agtcgacaac cttccaaact gattaattaa gaattcgacc 1440
cagctttctt gtacaaagtg gttggtaagc ctatccctaa ccctctcctc ggtctcgatt 1500
ctacgtagta atgagctagc agtctcgagg ttaacgaatt ccgccccccc cctaacgtta 1560
ctggccgaag ccgcttggaa taaggccggt gtgcgcttgt ctatatgtta ttttccacca 1620
tattgccgtc ttttggcaat gtgagggccc ggaaacctgg ccctgtcttc ttgacgagca 1680
ttcctagggg tctttcccct ctcgccaaag gaatgcaagg tctgttgaat gtcgtgaagg 1740
aagcagttcc tctggaagct tcttgaagac aaacaacgtc tgtagcgacc ctttgcaggc 1800
agcggaaccc cccacctggc gacaggtgcc cctgcggcca aaagccacgt gtataagata 1860
cacctgcaaa ggcggcacaa ccccagtgcc acgttgtgag ttggatagtt gtggaaagag 1920
tcaaatggct ctcctcaagc gtattcaaca aggggctgaa ggatgcccag aaggtacccc 1980
attgtatggg atctgatctg gggcctcggt gcacatgctt tacatgtgtt tagtcgaggt 2040
taaaaaaacg tctaggcccc ccgaaccacg gggacgtggt tttcctttga aaaacacgat 2100
aataccatgg ccatgagcga gctgattaag gagaacatgc acatgaagct gtacatggag 2160
ggcaccgtgg acaaccatca cttcaagtgc acatccgagg gcgaaggcaa gccctacgag 2220
ggcacccaga ccatgagaat caaggtggtc gagggcggcc ctctcccctt cgccttcgac 2280
atcctggcta ctagcttcct ctacggcagc aagaccttca tcaaccacac ccagggcatc 2340
cccgacttct tcaagcagtc cttccctgag ggcttcacat gggagagagt caccacatac 2400
gaagacgggg gcgtgctgac cgctacccag gacaccagcc tccaggacgg ctgcctcatc 2460
tacaacgtca agatcagagg ggtgaacttc acatccaacg gccctgtgat gcagaagaaa 2520
acactcggct gggaggcctt caccgagacg ctgtaccccg ctgacggcgg cctggaaggc 2580
agaaacgaca tggccctgaa gctcgtgggc gggagccatc tgatcgcaaa catcaagacc 2640
acatatagat ccaagaaacc cgctaagaac ctcaagatgc ctggcgtcta ctatgtggac 2700
tacagactgg aaagaatcaa ggaggccaac aacgagacct acgtcgagca gcacgaggtg 2760
gcagtggcca gatactgcga cctccctagc aaactggggc acaagcttaa ttaacaccgg 2820
tggcgcgtta agtcgacaat caacctctgg attacaaaat ttgtgaaaga ttgactggta 2880
ttcttaacta tgttgctcct tttacgctat gtggatacgc tgctttaatg cctttgtatc 2940
atgctattgc ttcccgtatg gctttcattt tctcctcctt gtataaatcc tggttgctgt 3000
ctctttatga ggagttgtgg cccgttgtca ggcaacgtgg cgtggtgtgc actgtgtttg 3060
ctgacgcaac ccccactggt tggggcattg ccaccacctg tcagctcctt tccgggactt 3120
tcgctttccc cctccctatt gccacggcgg aactcatcgc cgcctgcctt gcccgctgct 3180
ggacaggggc tcggctgttg ggcactgaca attccgtggt gttgtcgggg aaatcatcgt 3240
cctttccttg gctgctcgcc tgtgttgcca cctggattct gcgcgggacg tccttctgct 3300
acgtcccttc ggccctcaat ccagcggacc ttccttcccg cggcctgctg ccggctctgc 3360
ggcctcttcc gcgtcttcgc cttcgccctc agacgagtcg gatctccctt tgggccgcct 3420
ccccgcgtcg actttaagac caatgactta caaggcagct gtagatctta gccacttttt 3480
aaaagaaaag gggggactgg aagggctaat tcactcccaa cgaagacaag atctgctttt 3540
tgcttgtact gggtctctct ggttagacca gatctgagcc tgggagctct ctggctaact 3600
agggaaccca ctgcttaagc ctcaataaag cttgccttga gtgcttcaag tagtgtgtgc 3660
ccgtctgttg tgtgactctg gtaactagag atccctcaga cccttttagt cagtgtggaa 3720
aatctctagc agtacgtata gtagttcatg tcatcttatt attcagtatt tataacttgc 3780
aaagaaatga atatcagaga gtgagaggaa cttgtttatt gcagcttata atggttacaa 3840
ataaagcaat agcatcacaa atttcacaaa taaagcattt ttttcactgc attctagttg 3900
tggtttgtcc aaactcatca atgtatctta tcatgtctgg ctctagctat cccgccccta 3960
actccgccca tcccgcccct aactccgccc agttccgccc attctccgcc ccatggctga 4020
ctaatttttt ttatttatgc agaggccgag gccgcctcgg cctctgagct attccagaag 4080
tagtgaggag gcttttttgg aggcctaggg acgtacccaa ttcgccctat agtgagtcgt 4140
attacgcgcg ctcactggcc gtcgttttac aacgtcgtga ctgggaaaac cctggcgtta 4200
cccaacttaa tcgccttgca gcacatcccc ctttcgccag ctggcgtaat agcgaagagg 4260
cccgcaccga tcgcccttcc caacagttgc gcagcctgaa tggcgaatgg gacgcgccct 4320
gtagcggcgc attaagcgcg gcgggtgtgg tggttacgcg cagcgtgacc gctacacttg 4380
ccagcgccct agcgcccgct cctttcgctt tcttcccttc ctttctcgcc acgttcgccg 4440
gctttccccg tcaagctcta aatcgggggc tccctttagg gttccgattt agtgctttac 4500
ggcacctcga ccccaaaaaa cttgattagg gtgatggttc acgtagtggg ccatcgccct 4560
gatagacggt ttttcgccct ttgacgttgg agtccacgtt ctttaatagt ggactcttgt 4620
tccaaactgg aacaacactc aaccctatct cggtctattc ttttgattta taagggattt 4680
tgccgatttc ggcctattgg ttaaaaaatg agctgattta acaaaaattt aacgcgaatt 4740
ttaacaaaat attaacgctt acaatttagg tggcactttt cggggaaatg tgcgcggaac 4800
ccctatttgt ttatttttct aaatacattc aaatatgtat ccgctcatga gacaataacc 4860
ctgataaatg cttcaataat attgaaaaag gaagagtatg agtattcaac atttccgtgt 4920
cgcccttatt cccttttttg cggcattttg ccttcctgtt tttgctcacc cagaaacgct 4980
ggtgaaagta aaagatgctg aagatcagtt gggtgcacga gtgggttaca tcgaactgga 5040
tctcaacagc ggtaagatcc ttgagagttt tcgccccgaa gaacgttttc caatgatgag 5100
cacttttaaa gttctgctat gtggcgcggt attatcccgt attgacgccg ggcaagagca 5160
actcggtcgc cgcatacact attctcagaa tgacttggtt gagtactcac cagtcacaga 5220
aaagcatctt acggatggca tgacagtaag agaattatgc agtgctgcca taaccatgag 5280
tgataacact gcggccaact tacttctgac aacgatcgga ggaccgaagg agctaaccgc 5340
ttttttgcac aacatggggg atcatgtaac tcgccttgat cgttgggaac cggagctgaa 5400
tgaagccata ccaaacgacg agcgtgacac cacgatgcct gtagcaatgg caacaacgtt 5460
gcgcaaacta ttaactggcg aactacttac tctagcttcc cggcaacaat taatagactg 5520
gatggaggcg gataaagttg caggaccact tctgcgctcg gcccttccgg ctggctggtt 5580
tattgctgat aaatctggag ccggtgagcg tgggtctcgc ggtatcattg cagcactggg 5640
gccagatggt aagccctccc gtatcgtagt tatctacacg acggggagtc aggcaactat 5700
ggatgaacga aatagacaga tcgctgagat aggtgcctca ctgattaagc attggtaact 5760
gtcagaccaa gtttactcat atatacttta gattgattta aaacttcatt tttaatttaa 5820
aaggatctag gtgaagatcc tttttgataa tctcatgacc aaaatccctt aacgtgagtt 5880
ttcgttccac tgagcgtcag accccgtaga aaagatcaaa ggatcttctt gagatccttt 5940
ttttctgcgc gtaatctgct gcttgcaaac aaaaaaacca ccgctaccag cggtggtttg 6000
tttgccggat caagagctac caactctttt tccgaaggta actggcttca gcagagcgca 6060
gataccaaat actgttcttc tagtgtagcc gtagttaggc caccacttca agaactctgt 6120
agcaccgcct acatacctcg ctctgctaat cctgttacca gtggctgctg ccagtggcga 6180
taagtcgtgt cttaccgggt tggactcaag acgatagtta ccggataagg cgcagcggtc 6240
gggctgaacg gggggttcgt gcacacagcc cagcttggag cgaacgacct acaccgaact 6300
gagataccta cagcgtgagc tatgagaaag cgccacgctt cccgaaggga gaaaggcgga 6360
caggtatccg gtaagcggca gggtcggaac aggagagcgc acgagggagc ttccaggggg 6420
aaacgcctgg tatctttata gtcctgtcgg gtttcgccac ctctgacttg agcgtcgatt 6480
tttgtgatgc tcgtcagggg ggcggagcct atggaaaaac gccagcaacg cggccttttt 6540
acggttcctg gccttttgct ggccttttgc tcacatgttc tttcctgcgt tatcccctga 6600
ttctgtggat aaccgtatta ccgcctttga gtgagctgat accgctcgcc gcagccgaac 6660
gaccgagcgc agcgagtcag tgagcgagga agcggaagag cgcccaatac gcaaaccgcc 6720
tctccccgcg cgttggccga ttcattaatg cagctggcac gacaggtttc ccgactggaa 6780
agcgggcagt gagcgcaacg caattaatgt gagttagctc actcattagg caccccaggc 6840
tttacacttt atgcttccgg ctcgtatgtt gtgtggaatt gtgagcggat aacaatttca 6900
cacaggaaac agctatgacc atgattacgc caagcgcgca attaaccctc actaaaggga 6960
acaaaagctg gagctgcaag cttaatgtag tcttatgcaa tactcttgta gtcttgcaac 7020
atggtaacga tgagttagca acatgcctta caaggagaga aaaagcaccg tgcatgccga 7080
ttggtggaag taaggtggta cgatcgtgcc ttattaggaa ggcaacagac gggtctgaca 7140
tggattggac gaaccactga attgccgcat tgcagagata ttgtatttaa gtgcctagct 7200
cgatacataa acgggtctct ctggttagac cagatctgag cctgggagct ctctggctaa 7260
ctagggaacc cactgcttaa gcctcaataa agcttgcctt gagtgcttca agtagtgtgt 7320
gcccgtctgt tgtgtgactc tggtaactag agatccctca gaccctttta gtcagtgtgg 7380
aaaatctcta gcagtggcgc ccgaacaggg acttgaaagc gaaagggaaa ccagaggagc 7440
tctctcgacg caggactcgg cttgctgaag cgcgcacggc aagaggcgag gggcggcgac 7500
tggtgagtac gccaaaaatt ttgactagcg gaggctagaa ggagagagat gggtgcgaga 7560
gcgtcagtat taagcggggg agaattagat cgcgatggga aaaaattcgg ttaaggccag 7620
ggggaaagaa aaaatataaa ttaaaacata tagtatgggc aagcagggag ctagaacgat 7680
tcgcagttaa tcctggcctg ttagaaacat cagaaggctg tagacaaata ctgggacagc 7740
tacaaccatc ccttcagaca ggatcagaag aacttagatc attatataat acagtagcaa 7800
ccctctattg tgtgcatcaa aggatagaga taaaagacac caaggaagct ttagacaaga 7860
tagaggaaga gcaaaacaaa agtaagacca ccgcacagca agcggccgct gatcttcaga 7920
cctggaggag gagatatgag ggacaattgg agaagtgaat tatataaata taaagtagta 7980
aaaattgaac cattaggagt agcacccacc aaggcaaaga gaagagtggt gcagagagaa 8040
aaaagagcag tgggaatagg agctttgttc cttgggttct tgggagcagc aggaagcact 8100
atgggcgcag cgtcaatgac gctgacggta caggccagac aattattgtc tggtatagtg 8160
cagcagcaga acaatttgct gagggctatt gaggcgcaac agcatctgtt gcaactcaca 8220
gtctggggca tcaagcagct ccaggcaaga atcctggctg tggaaagata cctaaaggat 8280
caacagctcc tggggatttg gggttgctct ggaaaactca tttgcaccac tgctgtgcct 8340
tggaatgcta gttggagtaa taaatctctg gaacagattt ggaatcacac gacctggatg 8400
gagtgggaca gagaaattaa caattacaca agcttaatac actccttaat tgaagaatcg 8460
caaaaccagc aagaaaagaa tgaacaagaa ttattggaat tagataaatg ggcaagtttg 8520
tggaattggt ttaacataac aaattggctg tggtatataa aattattcat aatgatagta 8580
ggaggcttgg taggtttaag aatagttttt gctgtacttt ctatagtgaa tagagttagg 8640
cagggatatt caccattatc gtttcagacc cacctcccaa ccccgagggg acccttgcgc 8700
cttttccaag gcagccctgg gtttgcgcag ggacgcggct gctctgggcg tggttccggg 8760
aaacgcagcg gcgccgaccc tgggtctcgc acattcttca cgtccgttcg cagcgtcacc 8820
cggatcttcg ccgctaccct tgtgggcccc ccggcgacgc ttcctgctcc gcccctaagt 8880
cgggaaggtt ccttgcggtt cgcggcgtgc cggacgtgac aaacggaagc cgcacgtctc 8940
actagtaccc tcgcagacgg acagcgccag ggagcaatgg cagcgcgccg accgcgatgg 9000
gctgtggcca atagcggctg ctcagcaggg cgcgccgaga gcagcggccg ggaaggggcg 9060
gtgcgggagg cggggtgtgg ggcggtagtg tgggccctgt tcctgcccgc gcggtgttcc 9120
gcattctgca agcctccgga gcgcacgtcg gcagtcggct ccctcgttga ccgaatcacc 9180
gacctctctc cccagggggt acccagctgt ctagagaatt ctagatcttg agacaaatgg 9240
cagtattcat ccacaatttt aaaagaaaag gggggattgg ggggtacagt gcaggggaaa 9300
gaatagtaga cataatagca acagacatac aaactaaaga attacaaaaa caaattacaa 9360
aaattcaaaa ttttcgggtt tattacaggg acagcagaga tccactttgg cgccggctcg 9420
aggggg 9426
<210> 163
<211> 260
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 163
Met Thr Glu Tyr Lys Pro Thr Val Arg Leu Ala Thr Arg Asp Asp Val
1 5 10 15
Pro Arg Ala Val Arg Thr Leu Ala Ala Ala Phe Ala Asp Tyr Pro Ala
20 25 30
Thr Arg His Thr Val Asp Pro Asp Arg His Ile Glu Arg Val Thr Glu
35 40 45
Leu Gln Glu Leu Phe Leu Thr Arg Val Gly Leu Asp Ile Gly Lys Val
50 55 60
Trp Val Ala Asp Asp Gly Ala Ala Val Ala Val Trp Thr Thr Pro Glu
65 70 75 80
Ser Val Glu Ala Gly Ala Val Phe Ala Glu Ile Gly Pro Arg Met Ala
85 90 95
Glu Leu Ser Gly Ser Arg Leu Ala Ala Gln Gln Gln Met Glu Gly Leu
100 105 110
Leu Ala Pro His Arg Pro Lys Glu Pro Ala Trp Phe Leu Ala Thr Val
115 120 125
Gly Val Ser Pro Asp His Gln Gly Lys Gly Leu Gly Ser Ala Val Val
130 135 140
Leu Pro Gly Val Glu Ala Ala Glu Arg Ala Gly Val Pro Ala Cys Leu
145 150 155 160
Ser Tyr Glu Thr Glu Ile Leu Thr Val Glu Tyr Gly Leu Leu Pro Ile
165 170 175
Gly Lys Ile Val Glu Lys Arg Ile Glu Cys Thr Val Tyr Ser Val Asp
180 185 190
Asn Asn Gly Asn Ile Tyr Thr Gln Pro Val Ala Gln Trp His Asp Arg
195 200 205
Gly Glu Gln Glu Val Phe Glu Tyr Cys Leu Glu Asp Gly Ser Leu Ile
210 215 220
Arg Ala Thr Lys Asp His Lys Phe Met Thr Val Asp Gly Gln Met Leu
225 230 235 240
Pro Ile Asp Glu Ile Phe Glu Arg Glu Leu Asp Leu Met Arg Val Asp
245 250 255
Asn Leu Pro Asn
260
<210> 164
<211> 8886
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 164
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtgccgcca ccatgattaa gatcgctacg 660
cggaagtacc tggggaaaca gaacgtctac gacataggtg tggagcgcga tcacaacttt 720
gctctgaaaa atggatttat cgccagcaac tgcttcctgg agacctccgc gccccgcaac 780
ctccccttct acgagcggct cggcttcacc gtcaccgccg acgtcgaggt gcccgaagga 840
ccgcgcacct ggtgcatgac ccgcaagccc ggtgcctgat taattaagaa ttcgacccag 900
ctttcttgta caaagtggtt ggtaagccta tccctaaccc tctcctcggt ctcgattcta 960
cgtagtaatg agctagcagt ctcgaggtta acgaattccg ccccccccct aacgttactg 1020
gccgaagccg cttggaataa ggccggtgtg cgcttgtcta tatgttattt tccaccatat 1080
tgccgtcttt tggcaatgtg agggcccgga aacctggccc tgtcttcttg acgagcattc 1140
ctaggggtct ttcccctctc gccaaaggaa tgcaaggtct gttgaatgtc gtgaaggaag 1200
cagttcctct ggaagcttct tgaagacaaa caacgtctgt agcgaccctt tgcaggcagc 1260
ggaacccccc acctggcgac aggtgcccct gcggccaaaa gccacgtgta taagatacac 1320
ctgcaaaggc ggcacaaccc cagtgccacg ttgtgagttg gatagttgtg gaaagagtca 1380
aatggctctc ctcaagcgta ttcaacaagg ggctgaagga tgcccagaag gtaccccatt 1440
gtatgggatc tgatctgggg cctcggtgca catgctttac atgtgtttag tcgaggttaa 1500
aaaaacgtct aggccccccg aaccacgggg acgtggtttt cctttgaaaa acacgataat 1560
accatggtga gcaagggcga ggaggataac atggccatca tcaaggagtt catgcgcttc 1620
aaggtgcaca tggagggctc cgtgaacggc cacgagttcg agatcgaggg cgagggcgag 1680
ggccgcccct acgagggcac ccagaccgcc aagctgaagg tgaccaaggg tggccccctg 1740
cccttcgcct gggacatcct gtcccctcag ttcatgtacg gctccaaggc ctacgtgaag 1800
caccccgccg acatccccga ctacttgaag ctgtccttcc ccgagggctt caagtgggag 1860
cgcgtgatga acttcgagga cggcggcgtg gtgaccgtga cccaggactc ctccctgcag 1920
gacggcgagt tcatctacaa ggtgaagctg cgcggcacca acttcccctc cgacggcccc 1980
gtaatgcaga agaagaccat gggctgggag gcctcctccg agcggatgta ccccgaggac 2040
ggcgccctga agggcgagat caagcagagg ctgaagctga aggacggcgg ccactacgac 2100
gctgaggtca agaccaccta caaggccaag aagcccgtgc agctgcccgg cgcctacaac 2160
gtcaacatca agttggacat cacctcccac aacgaggact acaccatcgt ggaacagtac 2220
gaacgcgccg agggccgcca ctccaccggc ggcatggacg agctgtacaa gtaacaccgg 2280
tggcgcgtta agtcgacaat caacctctgg attacaaaat ttgtgaaaga ttgactggta 2340
ttcttaacta tgttgctcct tttacgctat gtggatacgc tgctttaatg cctttgtatc 2400
atgctattgc ttcccgtatg gctttcattt tctcctcctt gtataaatcc tggttgctgt 2460
ctctttatga ggagttgtgg cccgttgtca ggcaacgtgg cgtggtgtgc actgtgtttg 2520
ctgacgcaac ccccactggt tggggcattg ccaccacctg tcagctcctt tccgggactt 2580
tcgctttccc cctccctatt gccacggcgg aactcatcgc cgcctgcctt gcccgctgct 2640
ggacaggggc tcggctgttg ggcactgaca attccgtggt gttgtcgggg aaatcatcgt 2700
cctttccttg gctgctcgcc tgtgttgcca cctggattct gcgcgggacg tccttctgct 2760
acgtcccttc ggccctcaat ccagcggacc ttccttcccg cggcctgctg ccggctctgc 2820
ggcctcttcc gcgtcttcgc cttcgccctc agacgagtcg gatctccctt tgggccgcct 2880
ccccgcgtcg actttaagac caatgactta caaggcagct gtagatctta gccacttttt 2940
aaaagaaaag gggggactgg aagggctaat tcactcccaa cgaagacaag atctgctttt 3000
tgcttgtact gggtctctct ggttagacca gatctgagcc tgggagctct ctggctaact 3060
agggaaccca ctgcttaagc ctcaataaag cttgccttga gtgcttcaag tagtgtgtgc 3120
ccgtctgttg tgtgactctg gtaactagag atccctcaga cccttttagt cagtgtggaa 3180
aatctctagc agtacgtata gtagttcatg tcatcttatt attcagtatt tataacttgc 3240
aaagaaatga atatcagaga gtgagaggaa cttgtttatt gcagcttata atggttacaa 3300
ataaagcaat agcatcacaa atttcacaaa taaagcattt ttttcactgc attctagttg 3360
tggtttgtcc aaactcatca atgtatctta tcatgtctgg ctctagctat cccgccccta 3420
actccgccca tcccgcccct aactccgccc agttccgccc attctccgcc ccatggctga 3480
ctaatttttt ttatttatgc agaggccgag gccgcctcgg cctctgagct attccagaag 3540
tagtgaggag gcttttttgg aggcctaggg acgtacccaa ttcgccctat agtgagtcgt 3600
attacgcgcg ctcactggcc gtcgttttac aacgtcgtga ctgggaaaac cctggcgtta 3660
cccaacttaa tcgccttgca gcacatcccc ctttcgccag ctggcgtaat agcgaagagg 3720
cccgcaccga tcgcccttcc caacagttgc gcagcctgaa tggcgaatgg gacgcgccct 3780
gtagcggcgc attaagcgcg gcgggtgtgg tggttacgcg cagcgtgacc gctacacttg 3840
ccagcgccct agcgcccgct cctttcgctt tcttcccttc ctttctcgcc acgttcgccg 3900
gctttccccg tcaagctcta aatcgggggc tccctttagg gttccgattt agtgctttac 3960
ggcacctcga ccccaaaaaa cttgattagg gtgatggttc acgtagtggg ccatcgccct 4020
gatagacggt ttttcgccct ttgacgttgg agtccacgtt ctttaatagt ggactcttgt 4080
tccaaactgg aacaacactc aaccctatct cggtctattc ttttgattta taagggattt 4140
tgccgatttc ggcctattgg ttaaaaaatg agctgattta acaaaaattt aacgcgaatt 4200
ttaacaaaat attaacgctt acaatttagg tggcactttt cggggaaatg tgcgcggaac 4260
ccctatttgt ttatttttct aaatacattc aaatatgtat ccgctcatga gacaataacc 4320
ctgataaatg cttcaataat attgaaaaag gaagagtatg agtattcaac atttccgtgt 4380
cgcccttatt cccttttttg cggcattttg ccttcctgtt tttgctcacc cagaaacgct 4440
ggtgaaagta aaagatgctg aagatcagtt gggtgcacga gtgggttaca tcgaactgga 4500
tctcaacagc ggtaagatcc ttgagagttt tcgccccgaa gaacgttttc caatgatgag 4560
cacttttaaa gttctgctat gtggcgcggt attatcccgt attgacgccg ggcaagagca 4620
actcggtcgc cgcatacact attctcagaa tgacttggtt gagtactcac cagtcacaga 4680
aaagcatctt acggatggca tgacagtaag agaattatgc agtgctgcca taaccatgag 4740
tgataacact gcggccaact tacttctgac aacgatcgga ggaccgaagg agctaaccgc 4800
ttttttgcac aacatggggg atcatgtaac tcgccttgat cgttgggaac cggagctgaa 4860
tgaagccata ccaaacgacg agcgtgacac cacgatgcct gtagcaatgg caacaacgtt 4920
gcgcaaacta ttaactggcg aactacttac tctagcttcc cggcaacaat taatagactg 4980
gatggaggcg gataaagttg caggaccact tctgcgctcg gcccttccgg ctggctggtt 5040
tattgctgat aaatctggag ccggtgagcg tgggtctcgc ggtatcattg cagcactggg 5100
gccagatggt aagccctccc gtatcgtagt tatctacacg acggggagtc aggcaactat 5160
ggatgaacga aatagacaga tcgctgagat aggtgcctca ctgattaagc attggtaact 5220
gtcagaccaa gtttactcat atatacttta gattgattta aaacttcatt tttaatttaa 5280
aaggatctag gtgaagatcc tttttgataa tctcatgacc aaaatccctt aacgtgagtt 5340
ttcgttccac tgagcgtcag accccgtaga aaagatcaaa ggatcttctt gagatccttt 5400
ttttctgcgc gtaatctgct gcttgcaaac aaaaaaacca ccgctaccag cggtggtttg 5460
tttgccggat caagagctac caactctttt tccgaaggta actggcttca gcagagcgca 5520
gataccaaat actgttcttc tagtgtagcc gtagttaggc caccacttca agaactctgt 5580
agcaccgcct acatacctcg ctctgctaat cctgttacca gtggctgctg ccagtggcga 5640
taagtcgtgt cttaccgggt tggactcaag acgatagtta ccggataagg cgcagcggtc 5700
gggctgaacg gggggttcgt gcacacagcc cagcttggag cgaacgacct acaccgaact 5760
gagataccta cagcgtgagc tatgagaaag cgccacgctt cccgaaggga gaaaggcgga 5820
caggtatccg gtaagcggca gggtcggaac aggagagcgc acgagggagc ttccaggggg 5880
aaacgcctgg tatctttata gtcctgtcgg gtttcgccac ctctgacttg agcgtcgatt 5940
tttgtgatgc tcgtcagggg ggcggagcct atggaaaaac gccagcaacg cggccttttt 6000
acggttcctg gccttttgct ggccttttgc tcacatgttc tttcctgcgt tatcccctga 6060
ttctgtggat aaccgtatta ccgcctttga gtgagctgat accgctcgcc gcagccgaac 6120
gaccgagcgc agcgagtcag tgagcgagga agcggaagag cgcccaatac gcaaaccgcc 6180
tctccccgcg cgttggccga ttcattaatg cagctggcac gacaggtttc ccgactggaa 6240
agcgggcagt gagcgcaacg caattaatgt gagttagctc actcattagg caccccaggc 6300
tttacacttt atgcttccgg ctcgtatgtt gtgtggaatt gtgagcggat aacaatttca 6360
cacaggaaac agctatgacc atgattacgc caagcgcgca attaaccctc actaaaggga 6420
acaaaagctg gagctgcaag cttaatgtag tcttatgcaa tactcttgta gtcttgcaac 6480
atggtaacga tgagttagca acatgcctta caaggagaga aaaagcaccg tgcatgccga 6540
ttggtggaag taaggtggta cgatcgtgcc ttattaggaa ggcaacagac gggtctgaca 6600
tggattggac gaaccactga attgccgcat tgcagagata ttgtatttaa gtgcctagct 6660
cgatacataa acgggtctct ctggttagac cagatctgag cctgggagct ctctggctaa 6720
ctagggaacc cactgcttaa gcctcaataa agcttgcctt gagtgcttca agtagtgtgt 6780
gcccgtctgt tgtgtgactc tggtaactag agatccctca gaccctttta gtcagtgtgg 6840
aaaatctcta gcagtggcgc ccgaacaggg acttgaaagc gaaagggaaa ccagaggagc 6900
tctctcgacg caggactcgg cttgctgaag cgcgcacggc aagaggcgag gggcggcgac 6960
tggtgagtac gccaaaaatt ttgactagcg gaggctagaa ggagagagat gggtgcgaga 7020
gcgtcagtat taagcggggg agaattagat cgcgatggga aaaaattcgg ttaaggccag 7080
ggggaaagaa aaaatataaa ttaaaacata tagtatgggc aagcagggag ctagaacgat 7140
tcgcagttaa tcctggcctg ttagaaacat cagaaggctg tagacaaata ctgggacagc 7200
tacaaccatc ccttcagaca ggatcagaag aacttagatc attatataat acagtagcaa 7260
ccctctattg tgtgcatcaa aggatagaga taaaagacac caaggaagct ttagacaaga 7320
tagaggaaga gcaaaacaaa agtaagacca ccgcacagca agcggccgct gatcttcaga 7380
cctggaggag gagatatgag ggacaattgg agaagtgaat tatataaata taaagtagta 7440
aaaattgaac cattaggagt agcacccacc aaggcaaaga gaagagtggt gcagagagaa 7500
aaaagagcag tgggaatagg agctttgttc cttgggttct tgggagcagc aggaagcact 7560
atgggcgcag cgtcaatgac gctgacggta caggccagac aattattgtc tggtatagtg 7620
cagcagcaga acaatttgct gagggctatt gaggcgcaac agcatctgtt gcaactcaca 7680
gtctggggca tcaagcagct ccaggcaaga atcctggctg tggaaagata cctaaaggat 7740
caacagctcc tggggatttg gggttgctct ggaaaactca tttgcaccac tgctgtgcct 7800
tggaatgcta gttggagtaa taaatctctg gaacagattt ggaatcacac gacctggatg 7860
gagtgggaca gagaaattaa caattacaca agcttaatac actccttaat tgaagaatcg 7920
caaaaccagc aagaaaagaa tgaacaagaa ttattggaat tagataaatg ggcaagtttg 7980
tggaattggt ttaacataac aaattggctg tggtatataa aattattcat aatgatagta 8040
ggaggcttgg taggtttaag aatagttttt gctgtacttt ctatagtgaa tagagttagg 8100
cagggatatt caccattatc gtttcagacc cacctcccaa ccccgagggg acccttgcgc 8160
cttttccaag gcagccctgg gtttgcgcag ggacgcggct gctctgggcg tggttccggg 8220
aaacgcagcg gcgccgaccc tgggtctcgc acattcttca cgtccgttcg cagcgtcacc 8280
cggatcttcg ccgctaccct tgtgggcccc ccggcgacgc ttcctgctcc gcccctaagt 8340
cgggaaggtt ccttgcggtt cgcggcgtgc cggacgtgac aaacggaagc cgcacgtctc 8400
actagtaccc tcgcagacgg acagcgccag ggagcaatgg cagcgcgccg accgcgatgg 8460
gctgtggcca atagcggctg ctcagcaggg cgcgccgaga gcagcggccg ggaaggggcg 8520
gtgcgggagg cggggtgtgg ggcggtagtg tgggccctgt tcctgcccgc gcggtgttcc 8580
gcattctgca agcctccgga gcgcacgtcg gcagtcggct ccctcgttga ccgaatcacc 8640
gacctctctc cccagggggt acccagctgt ctagagaatt ctagatcttg agacaaatgg 8700
cagtattcat ccacaatttt aaaagaaaag gggggattgg ggggtacagt gcaggggaaa 8760
gaatagtaga cataatagca acagacatac aaactaaaga attacaaaaa caaattacaa 8820
aaattcaaaa ttttcgggtt tattacaggg acagcagaga tccactttgg cgccggctcg 8880
aggggg 8886
<210> 165
<211> 78
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 165
Met Ile Lys Ile Ala Thr Arg Lys Tyr Leu Gly Lys Gln Asn Val Tyr
1 5 10 15
Asp Ile Gly Val Glu Arg Asp His Asn Phe Ala Leu Lys Asn Gly Phe
20 25 30
Ile Ala Ser Asn Cys Phe Leu Glu Thr Ser Ala Pro Arg Asn Leu Pro
35 40 45
Phe Tyr Glu Arg Leu Gly Phe Thr Val Thr Ala Asp Val Glu Val Pro
50 55 60
Glu Gly Pro Arg Thr Trp Cys Met Thr Arg Lys Pro Gly Ala
65 70 75
<210> 166
<211> 9492
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 166
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtgccacca tgaccgagta caagcccacg 660
gtgcgcctcg ccacccgcga cgacgtcccc agggccgtac gcaccctcgc cgccgcgttc 720
gccgactacc ccgccacgcg ccacaccgtc gatccggacc gccacatcga gcgggtcacc 780
gagctgcaag aactcttcct cacgcgcgtc gggctcgaca tcggcaaggt gtgggtcgcg 840
gacgacggcg ccgcggtggc ggtctggacc acgccggaga gcgtcgaagc gggggcggtg 900
ttcgccgaga tcggcccgcg catggccgag ttgagcggtt cccggctggc cgcgcagcaa 960
cagatggaag gcctcctggc gccgcaccgg cccaaggagc ccgcgtggtt cctggccacc 1020
gtcggcgtct cgcccgacca ccagggcaag ggtctgggca gcgccgtcgt gctccccgga 1080
gtggaggcgg ccgagcgcgc cggggtgccc gccttcctgg agacctccgc gccccgcaac 1140
ctccccttct acgagcggct cggcttcacc gtcaccgcct gcctttcata cgagaccgag 1200
atcctgactg tcgagtacgg attgcttcct atcggcaaaa tcgtggagaa gaggattgaa 1260
tgtaccgtct attcagtcga taataatggg aacatctaca cacagcccgt ggctcaatgg 1320
cacgacagag gagagcagga agtttttgaa tactgtctcg aggacggatc cctcatccgc 1380
gctactaaag atcataagtt tatgaccgtg gacggccaga tgctgccaat tgacgaaatt 1440
tttgaacgag agctggatct gatgagagtc gacaaccttc caaactgatt aattaagaat 1500
tcgacccagc tttcttgtac aaagtggttg gtaagcctat ccctaaccct ctcctcggtc 1560
tcgattctac gtagtaatga gctagcagtc tcgaggttaa cgaattccgc ccccccccta 1620
acgttactgg ccgaagccgc ttggaataag gccggtgtgc gcttgtctat atgttatttt 1680
ccaccatatt gccgtctttt ggcaatgtga gggcccggaa acctggccct gtcttcttga 1740
cgagcattcc taggggtctt tcccctctcg ccaaaggaat gcaaggtctg ttgaatgtcg 1800
tgaaggaagc agttcctctg gaagcttctt gaagacaaac aacgtctgta gcgacccttt 1860
gcaggcagcg gaacccccca cctggcgaca ggtgcccctg cggccaaaag ccacgtgtat 1920
aagatacacc tgcaaaggcg gcacaacccc agtgccacgt tgtgagttgg atagttgtgg 1980
aaagagtcaa atggctctcc tcaagcgtat tcaacaaggg gctgaaggat gcccagaagg 2040
taccccattg tatgggatct gatctggggc ctcggtgcac atgctttaca tgtgtttagt 2100
cgaggttaaa aaaacgtcta ggccccccga accacgggga cgtggttttc ctttgaaaaa 2160
cacgataata ccatggccat gagcgagctg attaaggaga acatgcacat gaagctgtac 2220
atggagggca ccgtggacaa ccatcacttc aagtgcacat ccgagggcga aggcaagccc 2280
tacgagggca cccagaccat gagaatcaag gtggtcgagg gcggccctct ccccttcgcc 2340
ttcgacatcc tggctactag cttcctctac ggcagcaaga ccttcatcaa ccacacccag 2400
ggcatccccg acttcttcaa gcagtccttc cctgagggct tcacatggga gagagtcacc 2460
acatacgaag acgggggcgt gctgaccgct acccaggaca ccagcctcca ggacggctgc 2520
ctcatctaca acgtcaagat cagaggggtg aacttcacat ccaacggccc tgtgatgcag 2580
aagaaaacac tcggctggga ggccttcacc gagacgctgt accccgctga cggcggcctg 2640
gaaggcagaa acgacatggc cctgaagctc gtgggcggga gccatctgat cgcaaacatc 2700
aagaccacat atagatccaa gaaacccgct aagaacctca agatgcctgg cgtctactat 2760
gtggactaca gactggaaag aatcaaggag gccaacaacg agacctacgt cgagcagcac 2820
gaggtggcag tggccagata ctgcgacctc cctagcaaac tggggcacaa gcttaattaa 2880
caccggtggc gcgttaagtc gacaatcaac ctctggatta caaaatttgt gaaagattga 2940
ctggtattct taactatgtt gctcctttta cgctatgtgg atacgctgct ttaatgcctt 3000
tgtatcatgc tattgcttcc cgtatggctt tcattttctc ctccttgtat aaatcctggt 3060
tgctgtctct ttatgaggag ttgtggcccg ttgtcaggca acgtggcgtg gtgtgcactg 3120
tgtttgctga cgcaaccccc actggttggg gcattgccac cacctgtcag ctcctttccg 3180
ggactttcgc tttccccctc cctattgcca cggcggaact catcgccgcc tgccttgccc 3240
gctgctggac aggggctcgg ctgttgggca ctgacaattc cgtggtgttg tcggggaaat 3300
catcgtcctt tccttggctg ctcgcctgtg ttgccacctg gattctgcgc gggacgtcct 3360
tctgctacgt cccttcggcc ctcaatccag cggaccttcc ttcccgcggc ctgctgccgg 3420
ctctgcggcc tcttccgcgt cttcgccttc gccctcagac gagtcggatc tccctttggg 3480
ccgcctcccc gcgtcgactt taagaccaat gacttacaag gcagctgtag atcttagcca 3540
ctttttaaaa gaaaaggggg gactggaagg gctaattcac tcccaacgaa gacaagatct 3600
gctttttgct tgtactgggt ctctctggtt agaccagatc tgagcctggg agctctctgg 3660
ctaactaggg aacccactgc ttaagcctca ataaagcttg ccttgagtgc ttcaagtagt 3720
gtgtgcccgt ctgttgtgtg actctggtaa ctagagatcc ctcagaccct tttagtcagt 3780
gtggaaaatc tctagcagta cgtatagtag ttcatgtcat cttattattc agtatttata 3840
acttgcaaag aaatgaatat cagagagtga gaggaacttg tttattgcag cttataatgg 3900
ttacaaataa agcaatagca tcacaaattt cacaaataaa gcattttttt cactgcattc 3960
tagttgtggt ttgtccaaac tcatcaatgt atcttatcat gtctggctct agctatcccg 4020
cccctaactc cgcccatccc gcccctaact ccgcccagtt ccgcccattc tccgccccat 4080
ggctgactaa ttttttttat ttatgcagag gccgaggccg cctcggcctc tgagctattc 4140
cagaagtagt gaggaggctt ttttggaggc ctagggacgt acccaattcg ccctatagtg 4200
agtcgtatta cgcgcgctca ctggccgtcg ttttacaacg tcgtgactgg gaaaaccctg 4260
gcgttaccca acttaatcgc cttgcagcac atcccccttt cgccagctgg cgtaatagcg 4320
aagaggcccg caccgatcgc ccttcccaac agttgcgcag cctgaatggc gaatgggacg 4380
cgccctgtag cggcgcatta agcgcggcgg gtgtggtggt tacgcgcagc gtgaccgcta 4440
cacttgccag cgccctagcg cccgctcctt tcgctttctt cccttccttt ctcgccacgt 4500
tcgccggctt tccccgtcaa gctctaaatc gggggctccc tttagggttc cgatttagtg 4560
ctttacggca cctcgacccc aaaaaacttg attagggtga tggttcacgt agtgggccat 4620
cgccctgata gacggttttt cgccctttga cgttggagtc cacgttcttt aatagtggac 4680
tcttgttcca aactggaaca acactcaacc ctatctcggt ctattctttt gatttataag 4740
ggattttgcc gatttcggcc tattggttaa aaaatgagct gatttaacaa aaatttaacg 4800
cgaattttaa caaaatatta acgcttacaa tttaggtggc acttttcggg gaaatgtgcg 4860
cggaacccct atttgtttat ttttctaaat acattcaaat atgtatccgc tcatgagaca 4920
ataaccctga taaatgcttc aataatattg aaaaaggaag agtatgagta ttcaacattt 4980
ccgtgtcgcc cttattccct tttttgcggc attttgcctt cctgtttttg ctcacccaga 5040
aacgctggtg aaagtaaaag atgctgaaga tcagttgggt gcacgagtgg gttacatcga 5100
actggatctc aacagcggta agatccttga gagttttcgc cccgaagaac gttttccaat 5160
gatgagcact tttaaagttc tgctatgtgg cgcggtatta tcccgtattg acgccgggca 5220
agagcaactc ggtcgccgca tacactattc tcagaatgac ttggttgagt actcaccagt 5280
cacagaaaag catcttacgg atggcatgac agtaagagaa ttatgcagtg ctgccataac 5340
catgagtgat aacactgcgg ccaacttact tctgacaacg atcggaggac cgaaggagct 5400
aaccgctttt ttgcacaaca tgggggatca tgtaactcgc cttgatcgtt gggaaccgga 5460
gctgaatgaa gccataccaa acgacgagcg tgacaccacg atgcctgtag caatggcaac 5520
aacgttgcgc aaactattaa ctggcgaact acttactcta gcttcccggc aacaattaat 5580
agactggatg gaggcggata aagttgcagg accacttctg cgctcggccc ttccggctgg 5640
ctggtttatt gctgataaat ctggagccgg tgagcgtggg tctcgcggta tcattgcagc 5700
actggggcca gatggtaagc cctcccgtat cgtagttatc tacacgacgg ggagtcaggc 5760
aactatggat gaacgaaata gacagatcgc tgagataggt gcctcactga ttaagcattg 5820
gtaactgtca gaccaagttt actcatatat actttagatt gatttaaaac ttcattttta 5880
atttaaaagg atctaggtga agatcctttt tgataatctc atgaccaaaa tcccttaacg 5940
tgagttttcg ttccactgag cgtcagaccc cgtagaaaag atcaaaggat cttcttgaga 6000
tccttttttt ctgcgcgtaa tctgctgctt gcaaacaaaa aaaccaccgc taccagcggt 6060
ggtttgtttg ccggatcaag agctaccaac tctttttccg aaggtaactg gcttcagcag 6120
agcgcagata ccaaatactg ttcttctagt gtagccgtag ttaggccacc acttcaagaa 6180
ctctgtagca ccgcctacat acctcgctct gctaatcctg ttaccagtgg ctgctgccag 6240
tggcgataag tcgtgtctta ccgggttgga ctcaagacga tagttaccgg ataaggcgca 6300
gcggtcgggc tgaacggggg gttcgtgcac acagcccagc ttggagcgaa cgacctacac 6360
cgaactgaga tacctacagc gtgagctatg agaaagcgcc acgcttcccg aagggagaaa 6420
ggcggacagg tatccggtaa gcggcagggt cggaacagga gagcgcacga gggagcttcc 6480
agggggaaac gcctggtatc tttatagtcc tgtcgggttt cgccacctct gacttgagcg 6540
tcgatttttg tgatgctcgt caggggggcg gagcctatgg aaaaacgcca gcaacgcggc 6600
ctttttacgg ttcctggcct tttgctggcc ttttgctcac atgttctttc ctgcgttatc 6660
ccctgattct gtggataacc gtattaccgc ctttgagtga gctgataccg ctcgccgcag 6720
ccgaacgacc gagcgcagcg agtcagtgag cgaggaagcg gaagagcgcc caatacgcaa 6780
accgcctctc cccgcgcgtt ggccgattca ttaatgcagc tggcacgaca ggtttcccga 6840
ctggaaagcg ggcagtgagc gcaacgcaat taatgtgagt tagctcactc attaggcacc 6900
ccaggcttta cactttatgc ttccggctcg tatgttgtgt ggaattgtga gcggataaca 6960
atttcacaca ggaaacagct atgaccatga ttacgccaag cgcgcaatta accctcacta 7020
aagggaacaa aagctggagc tgcaagctta atgtagtctt atgcaatact cttgtagtct 7080
tgcaacatgg taacgatgag ttagcaacat gccttacaag gagagaaaaa gcaccgtgca 7140
tgccgattgg tggaagtaag gtggtacgat cgtgccttat taggaaggca acagacgggt 7200
ctgacatgga ttggacgaac cactgaattg ccgcattgca gagatattgt atttaagtgc 7260
ctagctcgat acataaacgg gtctctctgg ttagaccaga tctgagcctg ggagctctct 7320
ggctaactag ggaacccact gcttaagcct caataaagct tgccttgagt gcttcaagta 7380
gtgtgtgccc gtctgttgtg tgactctggt aactagagat ccctcagacc cttttagtca 7440
gtgtggaaaa tctctagcag tggcgcccga acagggactt gaaagcgaaa gggaaaccag 7500
aggagctctc tcgacgcagg actcggcttg ctgaagcgcg cacggcaaga ggcgaggggc 7560
ggcgactggt gagtacgcca aaaattttga ctagcggagg ctagaaggag agagatgggt 7620
gcgagagcgt cagtattaag cgggggagaa ttagatcgcg atgggaaaaa attcggttaa 7680
ggccaggggg aaagaaaaaa tataaattaa aacatatagt atgggcaagc agggagctag 7740
aacgattcgc agttaatcct ggcctgttag aaacatcaga aggctgtaga caaatactgg 7800
gacagctaca accatccctt cagacaggat cagaagaact tagatcatta tataatacag 7860
tagcaaccct ctattgtgtg catcaaagga tagagataaa agacaccaag gaagctttag 7920
acaagataga ggaagagcaa aacaaaagta agaccaccgc acagcaagcg gccgctgatc 7980
ttcagacctg gaggaggaga tatgagggac aattggagaa gtgaattata taaatataaa 8040
gtagtaaaaa ttgaaccatt aggagtagca cccaccaagg caaagagaag agtggtgcag 8100
agagaaaaaa gagcagtggg aataggagct ttgttccttg ggttcttggg agcagcagga 8160
agcactatgg gcgcagcgtc aatgacgctg acggtacagg ccagacaatt attgtctggt 8220
atagtgcagc agcagaacaa tttgctgagg gctattgagg cgcaacagca tctgttgcaa 8280
ctcacagtct ggggcatcaa gcagctccag gcaagaatcc tggctgtgga aagataccta 8340
aaggatcaac agctcctggg gatttggggt tgctctggaa aactcatttg caccactgct 8400
gtgccttgga atgctagttg gagtaataaa tctctggaac agatttggaa tcacacgacc 8460
tggatggagt gggacagaga aattaacaat tacacaagct taatacactc cttaattgaa 8520
gaatcgcaaa accagcaaga aaagaatgaa caagaattat tggaattaga taaatgggca 8580
agtttgtgga attggtttaa cataacaaat tggctgtggt atataaaatt attcataatg 8640
atagtaggag gcttggtagg tttaagaata gtttttgctg tactttctat agtgaataga 8700
gttaggcagg gatattcacc attatcgttt cagacccacc tcccaacccc gaggggaccc 8760
ttgcgccttt tccaaggcag ccctgggttt gcgcagggac gcggctgctc tgggcgtggt 8820
tccgggaaac gcagcggcgc cgaccctggg tctcgcacat tcttcacgtc cgttcgcagc 8880
gtcacccgga tcttcgccgc tacccttgtg ggccccccgg cgacgcttcc tgctccgccc 8940
ctaagtcggg aaggttcctt gcggttcgcg gcgtgccgga cgtgacaaac ggaagccgca 9000
cgtctcacta gtaccctcgc agacggacag cgccagggag caatggcagc gcgccgaccg 9060
cgatgggctg tggccaatag cggctgctca gcagggcgcg ccgagagcag cggccgggaa 9120
ggggcggtgc gggaggcggg gtgtggggcg gtagtgtggg ccctgttcct gcccgcgcgg 9180
tgttccgcat tctgcaagcc tccggagcgc acgtcggcag tcggctccct cgttgaccga 9240
atcaccgacc tctctcccca gggggtaccc agctgtctag agaattctag atcttgagac 9300
aaatggcagt attcatccac aattttaaaa gaaaaggggg gattgggggg tacagtgcag 9360
gggaaagaat agtagacata atagcaacag acatacaaac taaagaatta caaaaacaaa 9420
ttacaaaaat tcaaaatttt cgggtttatt acagggacag cagagatcca ctttggcgcc 9480
ggctcgaggg gg 9492
<210> 167
<211> 282
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 167
Met Thr Glu Tyr Lys Pro Thr Val Arg Leu Ala Thr Arg Asp Asp Val
1 5 10 15
Pro Arg Ala Val Arg Thr Leu Ala Ala Ala Phe Ala Asp Tyr Pro Ala
20 25 30
Thr Arg His Thr Val Asp Pro Asp Arg His Ile Glu Arg Val Thr Glu
35 40 45
Leu Gln Glu Leu Phe Leu Thr Arg Val Gly Leu Asp Ile Gly Lys Val
50 55 60
Trp Val Ala Asp Asp Gly Ala Ala Val Ala Val Trp Thr Thr Pro Glu
65 70 75 80
Ser Val Glu Ala Gly Ala Val Phe Ala Glu Ile Gly Pro Arg Met Ala
85 90 95
Glu Leu Ser Gly Ser Arg Leu Ala Ala Gln Gln Gln Met Glu Gly Leu
100 105 110
Leu Ala Pro His Arg Pro Lys Glu Pro Ala Trp Phe Leu Ala Thr Val
115 120 125
Gly Val Ser Pro Asp His Gln Gly Lys Gly Leu Gly Ser Ala Val Val
130 135 140
Leu Pro Gly Val Glu Ala Ala Glu Arg Ala Gly Val Pro Ala Phe Leu
145 150 155 160
Glu Thr Ser Ala Pro Arg Asn Leu Pro Phe Tyr Glu Arg Leu Gly Phe
165 170 175
Thr Val Thr Ala Cys Leu Ser Tyr Glu Thr Glu Ile Leu Thr Val Glu
180 185 190
Tyr Gly Leu Leu Pro Ile Gly Lys Ile Val Glu Lys Arg Ile Glu Cys
195 200 205
Thr Val Tyr Ser Val Asp Asn Asn Gly Asn Ile Tyr Thr Gln Pro Val
210 215 220
Ala Gln Trp His Asp Arg Gly Glu Gln Glu Val Phe Glu Tyr Cys Leu
225 230 235 240
Glu Asp Gly Ser Leu Ile Arg Ala Thr Lys Asp His Lys Phe Met Thr
245 250 255
Val Asp Gly Gln Met Leu Pro Ile Asp Glu Ile Phe Glu Arg Glu Leu
260 265 270
Asp Leu Met Arg Val Asp Asn Leu Pro Asn
275 280
<210> 168
<211> 8820
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 168
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtgccgcca ccatgattaa gatcgctacg 660
cggaagtacc tggggaaaca gaacgtctac gacataggtg tggagcgcga tcacaacttt 720
gctctgaaaa atggatttat cgccagcaac tgcgacgtcg aggtgcccga aggaccgcgc 780
acctggtgca tgacccgcaa gcccggtgcc tgattaatta agaattcgac ccagctttct 840
tgtacaaagt ggttggtaag cctatcccta accctctcct cggtctcgat tctacgtagt 900
aatgagctag cagtctcgag gttaacgaat tccgcccccc ccctaacgtt actggccgaa 960
gccgcttgga ataaggccgg tgtgcgcttg tctatatgtt attttccacc atattgccgt 1020
cttttggcaa tgtgagggcc cggaaacctg gccctgtctt cttgacgagc attcctaggg 1080
gtctttcccc tctcgccaaa ggaatgcaag gtctgttgaa tgtcgtgaag gaagcagttc 1140
ctctggaagc ttcttgaaga caaacaacgt ctgtagcgac cctttgcagg cagcggaacc 1200
ccccacctgg cgacaggtgc ccctgcggcc aaaagccacg tgtataagat acacctgcaa 1260
aggcggcaca accccagtgc cacgttgtga gttggatagt tgtggaaaga gtcaaatggc 1320
tctcctcaag cgtattcaac aaggggctga aggatgccca gaaggtaccc cattgtatgg 1380
gatctgatct ggggcctcgg tgcacatgct ttacatgtgt ttagtcgagg ttaaaaaaac 1440
gtctaggccc cccgaaccac ggggacgtgg ttttcctttg aaaaacacga taataccatg 1500
gtgagcaagg gcgaggagga taacatggcc atcatcaagg agttcatgcg cttcaaggtg 1560
cacatggagg gctccgtgaa cggccacgag ttcgagatcg agggcgaggg cgagggccgc 1620
ccctacgagg gcacccagac cgccaagctg aaggtgacca agggtggccc cctgcccttc 1680
gcctgggaca tcctgtcccc tcagttcatg tacggctcca aggcctacgt gaagcacccc 1740
gccgacatcc ccgactactt gaagctgtcc ttccccgagg gcttcaagtg ggagcgcgtg 1800
atgaacttcg aggacggcgg cgtggtgacc gtgacccagg actcctccct gcaggacggc 1860
gagttcatct acaaggtgaa gctgcgcggc accaacttcc cctccgacgg ccccgtaatg 1920
cagaagaaga ccatgggctg ggaggcctcc tccgagcgga tgtaccccga ggacggcgcc 1980
ctgaagggcg agatcaagca gaggctgaag ctgaaggacg gcggccacta cgacgctgag 2040
gtcaagacca cctacaaggc caagaagccc gtgcagctgc ccggcgccta caacgtcaac 2100
atcaagttgg acatcacctc ccacaacgag gactacacca tcgtggaaca gtacgaacgc 2160
gccgagggcc gccactccac cggcggcatg gacgagctgt acaagtaaca ccggtggcgc 2220
gttaagtcga caatcaacct ctggattaca aaatttgtga aagattgact ggtattctta 2280
actatgttgc tccttttacg ctatgtggat acgctgcttt aatgcctttg tatcatgcta 2340
ttgcttcccg tatggctttc attttctcct ccttgtataa atcctggttg ctgtctcttt 2400
atgaggagtt gtggcccgtt gtcaggcaac gtggcgtggt gtgcactgtg tttgctgacg 2460
caacccccac tggttggggc attgccacca cctgtcagct cctttccggg actttcgctt 2520
tccccctccc tattgccacg gcggaactca tcgccgcctg ccttgcccgc tgctggacag 2580
gggctcggct gttgggcact gacaattccg tggtgttgtc ggggaaatca tcgtcctttc 2640
cttggctgct cgcctgtgtt gccacctgga ttctgcgcgg gacgtccttc tgctacgtcc 2700
cttcggccct caatccagcg gaccttcctt cccgcggcct gctgccggct ctgcggcctc 2760
ttccgcgtct tcgccttcgc cctcagacga gtcggatctc cctttgggcc gcctccccgc 2820
gtcgacttta agaccaatga cttacaaggc agctgtagat cttagccact ttttaaaaga 2880
aaagggggga ctggaagggc taattcactc ccaacgaaga caagatctgc tttttgcttg 2940
tactgggtct ctctggttag accagatctg agcctgggag ctctctggct aactagggaa 3000
cccactgctt aagcctcaat aaagcttgcc ttgagtgctt caagtagtgt gtgcccgtct 3060
gttgtgtgac tctggtaact agagatccct cagacccttt tagtcagtgt ggaaaatctc 3120
tagcagtacg tatagtagtt catgtcatct tattattcag tatttataac ttgcaaagaa 3180
atgaatatca gagagtgaga ggaacttgtt tattgcagct tataatggtt acaaataaag 3240
caatagcatc acaaatttca caaataaagc atttttttca ctgcattcta gttgtggttt 3300
gtccaaactc atcaatgtat cttatcatgt ctggctctag ctatcccgcc cctaactccg 3360
cccatcccgc ccctaactcc gcccagttcc gcccattctc cgccccatgg ctgactaatt 3420
ttttttattt atgcagaggc cgaggccgcc tcggcctctg agctattcca gaagtagtga 3480
ggaggctttt ttggaggcct agggacgtac ccaattcgcc ctatagtgag tcgtattacg 3540
cgcgctcact ggccgtcgtt ttacaacgtc gtgactggga aaaccctggc gttacccaac 3600
ttaatcgcct tgcagcacat ccccctttcg ccagctggcg taatagcgaa gaggcccgca 3660
ccgatcgccc ttcccaacag ttgcgcagcc tgaatggcga atgggacgcg ccctgtagcg 3720
gcgcattaag cgcggcgggt gtggtggtta cgcgcagcgt gaccgctaca cttgccagcg 3780
ccctagcgcc cgctcctttc gctttcttcc cttcctttct cgccacgttc gccggctttc 3840
cccgtcaagc tctaaatcgg gggctccctt tagggttccg atttagtgct ttacggcacc 3900
tcgaccccaa aaaacttgat tagggtgatg gttcacgtag tgggccatcg ccctgataga 3960
cggtttttcg ccctttgacg ttggagtcca cgttctttaa tagtggactc ttgttccaaa 4020
ctggaacaac actcaaccct atctcggtct attcttttga tttataaggg attttgccga 4080
tttcggccta ttggttaaaa aatgagctga tttaacaaaa atttaacgcg aattttaaca 4140
aaatattaac gcttacaatt taggtggcac ttttcgggga aatgtgcgcg gaacccctat 4200
ttgtttattt ttctaaatac attcaaatat gtatccgctc atgagacaat aaccctgata 4260
aatgcttcaa taatattgaa aaaggaagag tatgagtatt caacatttcc gtgtcgccct 4320
tattcccttt tttgcggcat tttgccttcc tgtttttgct cacccagaaa cgctggtgaa 4380
agtaaaagat gctgaagatc agttgggtgc acgagtgggt tacatcgaac tggatctcaa 4440
cagcggtaag atccttgaga gttttcgccc cgaagaacgt tttccaatga tgagcacttt 4500
taaagttctg ctatgtggcg cggtattatc ccgtattgac gccgggcaag agcaactcgg 4560
tcgccgcata cactattctc agaatgactt ggttgagtac tcaccagtca cagaaaagca 4620
tcttacggat ggcatgacag taagagaatt atgcagtgct gccataacca tgagtgataa 4680
cactgcggcc aacttacttc tgacaacgat cggaggaccg aaggagctaa ccgctttttt 4740
gcacaacatg ggggatcatg taactcgcct tgatcgttgg gaaccggagc tgaatgaagc 4800
cataccaaac gacgagcgtg acaccacgat gcctgtagca atggcaacaa cgttgcgcaa 4860
actattaact ggcgaactac ttactctagc ttcccggcaa caattaatag actggatgga 4920
ggcggataaa gttgcaggac cacttctgcg ctcggccctt ccggctggct ggtttattgc 4980
tgataaatct ggagccggtg agcgtgggtc tcgcggtatc attgcagcac tggggccaga 5040
tggtaagccc tcccgtatcg tagttatcta cacgacgggg agtcaggcaa ctatggatga 5100
acgaaataga cagatcgctg agataggtgc ctcactgatt aagcattggt aactgtcaga 5160
ccaagtttac tcatatatac tttagattga tttaaaactt catttttaat ttaaaaggat 5220
ctaggtgaag atcctttttg ataatctcat gaccaaaatc ccttaacgtg agttttcgtt 5280
ccactgagcg tcagaccccg tagaaaagat caaaggatct tcttgagatc ctttttttct 5340
gcgcgtaatc tgctgcttgc aaacaaaaaa accaccgcta ccagcggtgg tttgtttgcc 5400
ggatcaagag ctaccaactc tttttccgaa ggtaactggc ttcagcagag cgcagatacc 5460
aaatactgtt cttctagtgt agccgtagtt aggccaccac ttcaagaact ctgtagcacc 5520
gcctacatac ctcgctctgc taatcctgtt accagtggct gctgccagtg gcgataagtc 5580
gtgtcttacc gggttggact caagacgata gttaccggat aaggcgcagc ggtcgggctg 5640
aacggggggt tcgtgcacac agcccagctt ggagcgaacg acctacaccg aactgagata 5700
cctacagcgt gagctatgag aaagcgccac gcttcccgaa gggagaaagg cggacaggta 5760
tccggtaagc ggcagggtcg gaacaggaga gcgcacgagg gagcttccag ggggaaacgc 5820
ctggtatctt tatagtcctg tcgggtttcg ccacctctga cttgagcgtc gatttttgtg 5880
atgctcgtca ggggggcgga gcctatggaa aaacgccagc aacgcggcct ttttacggtt 5940
cctggccttt tgctggcctt ttgctcacat gttctttcct gcgttatccc ctgattctgt 6000
ggataaccgt attaccgcct ttgagtgagc tgataccgct cgccgcagcc gaacgaccga 6060
gcgcagcgag tcagtgagcg aggaagcgga agagcgccca atacgcaaac cgcctctccc 6120
cgcgcgttgg ccgattcatt aatgcagctg gcacgacagg tttcccgact ggaaagcggg 6180
cagtgagcgc aacgcaatta atgtgagtta gctcactcat taggcacccc aggctttaca 6240
ctttatgctt ccggctcgta tgttgtgtgg aattgtgagc ggataacaat ttcacacagg 6300
aaacagctat gaccatgatt acgccaagcg cgcaattaac cctcactaaa gggaacaaaa 6360
gctggagctg caagcttaat gtagtcttat gcaatactct tgtagtcttg caacatggta 6420
acgatgagtt agcaacatgc cttacaagga gagaaaaagc accgtgcatg ccgattggtg 6480
gaagtaaggt ggtacgatcg tgccttatta ggaaggcaac agacgggtct gacatggatt 6540
ggacgaacca ctgaattgcc gcattgcaga gatattgtat ttaagtgcct agctcgatac 6600
ataaacgggt ctctctggtt agaccagatc tgagcctggg agctctctgg ctaactaggg 6660
aacccactgc ttaagcctca ataaagcttg ccttgagtgc ttcaagtagt gtgtgcccgt 6720
ctgttgtgtg actctggtaa ctagagatcc ctcagaccct tttagtcagt gtggaaaatc 6780
tctagcagtg gcgcccgaac agggacttga aagcgaaagg gaaaccagag gagctctctc 6840
gacgcaggac tcggcttgct gaagcgcgca cggcaagagg cgaggggcgg cgactggtga 6900
gtacgccaaa aattttgact agcggaggct agaaggagag agatgggtgc gagagcgtca 6960
gtattaagcg ggggagaatt agatcgcgat gggaaaaaat tcggttaagg ccagggggaa 7020
agaaaaaata taaattaaaa catatagtat gggcaagcag ggagctagaa cgattcgcag 7080
ttaatcctgg cctgttagaa acatcagaag gctgtagaca aatactggga cagctacaac 7140
catcccttca gacaggatca gaagaactta gatcattata taatacagta gcaaccctct 7200
attgtgtgca tcaaaggata gagataaaag acaccaagga agctttagac aagatagagg 7260
aagagcaaaa caaaagtaag accaccgcac agcaagcggc cgctgatctt cagacctgga 7320
ggaggagata tgagggacaa ttggagaagt gaattatata aatataaagt agtaaaaatt 7380
gaaccattag gagtagcacc caccaaggca aagagaagag tggtgcagag agaaaaaaga 7440
gcagtgggaa taggagcttt gttccttggg ttcttgggag cagcaggaag cactatgggc 7500
gcagcgtcaa tgacgctgac ggtacaggcc agacaattat tgtctggtat agtgcagcag 7560
cagaacaatt tgctgagggc tattgaggcg caacagcatc tgttgcaact cacagtctgg 7620
ggcatcaagc agctccaggc aagaatcctg gctgtggaaa gatacctaaa ggatcaacag 7680
ctcctgggga tttggggttg ctctggaaaa ctcatttgca ccactgctgt gccttggaat 7740
gctagttgga gtaataaatc tctggaacag atttggaatc acacgacctg gatggagtgg 7800
gacagagaaa ttaacaatta cacaagctta atacactcct taattgaaga atcgcaaaac 7860
cagcaagaaa agaatgaaca agaattattg gaattagata aatgggcaag tttgtggaat 7920
tggtttaaca taacaaattg gctgtggtat ataaaattat tcataatgat agtaggaggc 7980
ttggtaggtt taagaatagt ttttgctgta ctttctatag tgaatagagt taggcaggga 8040
tattcaccat tatcgtttca gacccacctc ccaaccccga ggggaccctt gcgccttttc 8100
caaggcagcc ctgggtttgc gcagggacgc ggctgctctg ggcgtggttc cgggaaacgc 8160
agcggcgccg accctgggtc tcgcacattc ttcacgtccg ttcgcagcgt cacccggatc 8220
ttcgccgcta cccttgtggg ccccccggcg acgcttcctg ctccgcccct aagtcgggaa 8280
ggttccttgc ggttcgcggc gtgccggacg tgacaaacgg aagccgcacg tctcactagt 8340
accctcgcag acggacagcg ccagggagca atggcagcgc gccgaccgcg atgggctgtg 8400
gccaatagcg gctgctcagc agggcgcgcc gagagcagcg gccgggaagg ggcggtgcgg 8460
gaggcggggt gtggggcggt agtgtgggcc ctgttcctgc ccgcgcggtg ttccgcattc 8520
tgcaagcctc cggagcgcac gtcggcagtc ggctccctcg ttgaccgaat caccgacctc 8580
tctccccagg gggtacccag ctgtctagag aattctagat cttgagacaa atggcagtat 8640
tcatccacaa ttttaaaaga aaagggggga ttggggggta cagtgcaggg gaaagaatag 8700
tagacataat agcaacagac atacaaacta aagaattaca aaaacaaatt acaaaaattc 8760
aaaattttcg ggtttattac agggacagca gagatccact ttggcgccgg ctcgaggggg 8820
<210> 169
<211> 56
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 169
Met Ile Lys Ile Ala Thr Arg Lys Tyr Leu Gly Lys Gln Asn Val Tyr
1 5 10 15
Asp Ile Gly Val Glu Arg Asp His Asn Phe Ala Leu Lys Asn Gly Phe
20 25 30
Ile Ala Ser Asn Cys Asp Val Glu Val Pro Glu Gly Pro Arg Thr Trp
35 40 45
Cys Met Thr Arg Lys Pro Gly Ala
50 55
<210> 170
<211> 9129
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 170
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtgccgcca ccatgaaaac atttaacatt 660
tctcaacagg atctagaatt agtagaagta gcgacagaga agattacaat gctttatgag 720
gataataaac atcatgtggg agcggcaatt cgtacgaaaa caggagaaat catttcggca 780
gtacatattg aagcgtatat aggacgagta actgtttgcc tttcatacga gaccgagatc 840
ctgactgtcg agtacggatt gcttcctatc ggcaaaatcg tggagaagag gattgaatgt 900
accgtctatt cagtcgataa taatgggaac atctacacac agcccgtggc tcaatggcac 960
gacagaggag agcaggaagt ttttgaatac tgtctcgagg acggatccct catccgcgct 1020
actaaagatc ataagtttat gaccgtggac ggccagatgc tgccaattga cgaaattttt 1080
gaacgagagc tggatctgat gagagtcgac aaccttccaa actgattaat taagaattcg 1140
acccagcttt cttgtacaaa gtggttggta agcctatccc taaccctctc ctcggtctcg 1200
attctacgta gtaatgagct agcagtctcg aggttaacga attccgcccc ccccctaacg 1260
ttactggccg aagccgcttg gaataaggcc ggtgtgcgct tgtctatatg ttattttcca 1320
ccatattgcc gtcttttggc aatgtgaggg cccggaaacc tggccctgtc ttcttgacga 1380
gcattcctag gggtctttcc cctctcgcca aaggaatgca aggtctgttg aatgtcgtga 1440
aggaagcagt tcctctggaa gcttcttgaa gacaaacaac gtctgtagcg accctttgca 1500
ggcagcggaa ccccccacct ggcgacaggt gcccctgcgg ccaaaagcca cgtgtataag 1560
atacacctgc aaaggcggca caaccccagt gccacgttgt gagttggata gttgtggaaa 1620
gagtcaaatg gctctcctca agcgtattca acaaggggct gaaggatgcc cagaaggtac 1680
cccattgtat gggatctgat ctggggcctc ggtgcacatg ctttacatgt gtttagtcga 1740
ggttaaaaaa acgtctaggc cccccgaacc acggggacgt ggttttcctt tgaaaaacac 1800
gataatacca tggccatgag cgagctgatt aaggagaaca tgcacatgaa gctgtacatg 1860
gagggcaccg tggacaacca tcacttcaag tgcacatccg agggcgaagg caagccctac 1920
gagggcaccc agaccatgag aatcaaggtg gtcgagggcg gccctctccc cttcgccttc 1980
gacatcctgg ctactagctt cctctacggc agcaagacct tcatcaacca cacccagggc 2040
atccccgact tcttcaagca gtccttccct gagggcttca catgggagag agtcaccaca 2100
tacgaagacg ggggcgtgct gaccgctacc caggacacca gcctccagga cggctgcctc 2160
atctacaacg tcaagatcag aggggtgaac ttcacatcca acggccctgt gatgcagaag 2220
aaaacactcg gctgggaggc cttcaccgag acgctgtacc ccgctgacgg cggcctggaa 2280
ggcagaaacg acatggccct gaagctcgtg ggcgggagcc atctgatcgc aaacatcaag 2340
accacatata gatccaagaa acccgctaag aacctcaaga tgcctggcgt ctactatgtg 2400
gactacagac tggaaagaat caaggaggcc aacaacgaga cctacgtcga gcagcacgag 2460
gtggcagtgg ccagatactg cgacctccct agcaaactgg ggcacaagct taattaacac 2520
cggtggcgcg ttaagtcgac aatcaacctc tggattacaa aatttgtgaa agattgactg 2580
gtattcttaa ctatgttgct ccttttacgc tatgtggata cgctgcttta atgcctttgt 2640
atcatgctat tgcttcccgt atggctttca ttttctcctc cttgtataaa tcctggttgc 2700
tgtctcttta tgaggagttg tggcccgttg tcaggcaacg tggcgtggtg tgcactgtgt 2760
ttgctgacgc aacccccact ggttggggca ttgccaccac ctgtcagctc ctttccggga 2820
ctttcgcttt ccccctccct attgccacgg cggaactcat cgccgcctgc cttgcccgct 2880
gctggacagg ggctcggctg ttgggcactg acaattccgt ggtgttgtcg gggaaatcat 2940
cgtcctttcc ttggctgctc gcctgtgttg ccacctggat tctgcgcggg acgtccttct 3000
gctacgtccc ttcggccctc aatccagcgg accttccttc ccgcggcctg ctgccggctc 3060
tgcggcctct tccgcgtctt cgccttcgcc ctcagacgag tcggatctcc ctttgggccg 3120
cctccccgcg tcgactttaa gaccaatgac ttacaaggca gctgtagatc ttagccactt 3180
tttaaaagaa aaggggggac tggaagggct aattcactcc caacgaagac aagatctgct 3240
ttttgcttgt actgggtctc tctggttaga ccagatctga gcctgggagc tctctggcta 3300
actagggaac ccactgctta agcctcaata aagcttgcct tgagtgcttc aagtagtgtg 3360
tgcccgtctg ttgtgtgact ctggtaacta gagatccctc agaccctttt agtcagtgtg 3420
gaaaatctct agcagtacgt atagtagttc atgtcatctt attattcagt atttataact 3480
tgcaaagaaa tgaatatcag agagtgagag gaacttgttt attgcagctt ataatggtta 3540
caaataaagc aatagcatca caaatttcac aaataaagca tttttttcac tgcattctag 3600
ttgtggtttg tccaaactca tcaatgtatc ttatcatgtc tggctctagc tatcccgccc 3660
ctaactccgc ccatcccgcc cctaactccg cccagttccg cccattctcc gccccatggc 3720
tgactaattt tttttattta tgcagaggcc gaggccgcct cggcctctga gctattccag 3780
aagtagtgag gaggcttttt tggaggccta gggacgtacc caattcgccc tatagtgagt 3840
cgtattacgc gcgctcactg gccgtcgttt tacaacgtcg tgactgggaa aaccctggcg 3900
ttacccaact taatcgcctt gcagcacatc cccctttcgc cagctggcgt aatagcgaag 3960
aggcccgcac cgatcgccct tcccaacagt tgcgcagcct gaatggcgaa tgggacgcgc 4020
cctgtagcgg cgcattaagc gcggcgggtg tggtggttac gcgcagcgtg accgctacac 4080
ttgccagcgc cctagcgccc gctcctttcg ctttcttccc ttcctttctc gccacgttcg 4140
ccggctttcc ccgtcaagct ctaaatcggg ggctcccttt agggttccga tttagtgctt 4200
tacggcacct cgaccccaaa aaacttgatt agggtgatgg ttcacgtagt gggccatcgc 4260
cctgatagac ggtttttcgc cctttgacgt tggagtccac gttctttaat agtggactct 4320
tgttccaaac tggaacaaca ctcaacccta tctcggtcta ttcttttgat ttataaggga 4380
ttttgccgat ttcggcctat tggttaaaaa atgagctgat ttaacaaaaa tttaacgcga 4440
attttaacaa aatattaacg cttacaattt aggtggcact tttcggggaa atgtgcgcgg 4500
aacccctatt tgtttatttt tctaaataca ttcaaatatg tatccgctca tgagacaata 4560
accctgataa atgcttcaat aatattgaaa aaggaagagt atgagtattc aacatttccg 4620
tgtcgccctt attccctttt ttgcggcatt ttgccttcct gtttttgctc acccagaaac 4680
gctggtgaaa gtaaaagatg ctgaagatca gttgggtgca cgagtgggtt acatcgaact 4740
ggatctcaac agcggtaaga tccttgagag ttttcgcccc gaagaacgtt ttccaatgat 4800
gagcactttt aaagttctgc tatgtggcgc ggtattatcc cgtattgacg ccgggcaaga 4860
gcaactcggt cgccgcatac actattctca gaatgacttg gttgagtact caccagtcac 4920
agaaaagcat cttacggatg gcatgacagt aagagaatta tgcagtgctg ccataaccat 4980
gagtgataac actgcggcca acttacttct gacaacgatc ggaggaccga aggagctaac 5040
cgcttttttg cacaacatgg gggatcatgt aactcgcctt gatcgttggg aaccggagct 5100
gaatgaagcc ataccaaacg acgagcgtga caccacgatg cctgtagcaa tggcaacaac 5160
gttgcgcaaa ctattaactg gcgaactact tactctagct tcccggcaac aattaataga 5220
ctggatggag gcggataaag ttgcaggacc acttctgcgc tcggcccttc cggctggctg 5280
gtttattgct gataaatctg gagccggtga gcgtgggtct cgcggtatca ttgcagcact 5340
ggggccagat ggtaagccct cccgtatcgt agttatctac acgacgggga gtcaggcaac 5400
tatggatgaa cgaaatagac agatcgctga gataggtgcc tcactgatta agcattggta 5460
actgtcagac caagtttact catatatact ttagattgat ttaaaacttc atttttaatt 5520
taaaaggatc taggtgaaga tcctttttga taatctcatg accaaaatcc cttaacgtga 5580
gttttcgttc cactgagcgt cagaccccgt agaaaagatc aaaggatctt cttgagatcc 5640
tttttttctg cgcgtaatct gctgcttgca aacaaaaaaa ccaccgctac cagcggtggt 5700
ttgtttgccg gatcaagagc taccaactct ttttccgaag gtaactggct tcagcagagc 5760
gcagatacca aatactgttc ttctagtgta gccgtagtta ggccaccact tcaagaactc 5820
tgtagcaccg cctacatacc tcgctctgct aatcctgtta ccagtggctg ctgccagtgg 5880
cgataagtcg tgtcttaccg ggttggactc aagacgatag ttaccggata aggcgcagcg 5940
gtcgggctga acggggggtt cgtgcacaca gcccagcttg gagcgaacga cctacaccga 6000
actgagatac ctacagcgtg agctatgaga aagcgccacg cttcccgaag ggagaaaggc 6060
ggacaggtat ccggtaagcg gcagggtcgg aacaggagag cgcacgaggg agcttccagg 6120
gggaaacgcc tggtatcttt atagtcctgt cgggtttcgc cacctctgac ttgagcgtcg 6180
atttttgtga tgctcgtcag gggggcggag cctatggaaa aacgccagca acgcggcctt 6240
tttacggttc ctggcctttt gctggccttt tgctcacatg ttctttcctg cgttatcccc 6300
tgattctgtg gataaccgta ttaccgcctt tgagtgagct gataccgctc gccgcagccg 6360
aacgaccgag cgcagcgagt cagtgagcga ggaagcggaa gagcgcccaa tacgcaaacc 6420
gcctctcccc gcgcgttggc cgattcatta atgcagctgg cacgacaggt ttcccgactg 6480
gaaagcgggc agtgagcgca acgcaattaa tgtgagttag ctcactcatt aggcacccca 6540
ggctttacac tttatgcttc cggctcgtat gttgtgtgga attgtgagcg gataacaatt 6600
tcacacagga aacagctatg accatgatta cgccaagcgc gcaattaacc ctcactaaag 6660
ggaacaaaag ctggagctgc aagcttaatg tagtcttatg caatactctt gtagtcttgc 6720
aacatggtaa cgatgagtta gcaacatgcc ttacaaggag agaaaaagca ccgtgcatgc 6780
cgattggtgg aagtaaggtg gtacgatcgt gccttattag gaaggcaaca gacgggtctg 6840
acatggattg gacgaaccac tgaattgccg cattgcagag atattgtatt taagtgccta 6900
gctcgataca taaacgggtc tctctggtta gaccagatct gagcctggga gctctctggc 6960
taactaggga acccactgct taagcctcaa taaagcttgc cttgagtgct tcaagtagtg 7020
tgtgcccgtc tgttgtgtga ctctggtaac tagagatccc tcagaccctt ttagtcagtg 7080
tggaaaatct ctagcagtgg cgcccgaaca gggacttgaa agcgaaaggg aaaccagagg 7140
agctctctcg acgcaggact cggcttgctg aagcgcgcac ggcaagaggc gaggggcggc 7200
gactggtgag tacgccaaaa attttgacta gcggaggcta gaaggagaga gatgggtgcg 7260
agagcgtcag tattaagcgg gggagaatta gatcgcgatg ggaaaaaatt cggttaaggc 7320
cagggggaaa gaaaaaatat aaattaaaac atatagtatg ggcaagcagg gagctagaac 7380
gattcgcagt taatcctggc ctgttagaaa catcagaagg ctgtagacaa atactgggac 7440
agctacaacc atcccttcag acaggatcag aagaacttag atcattatat aatacagtag 7500
caaccctcta ttgtgtgcat caaaggatag agataaaaga caccaaggaa gctttagaca 7560
agatagagga agagcaaaac aaaagtaaga ccaccgcaca gcaagcggcc gctgatcttc 7620
agacctggag gaggagatat gagggacaat tggagaagtg aattatataa atataaagta 7680
gtaaaaattg aaccattagg agtagcaccc accaaggcaa agagaagagt ggtgcagaga 7740
gaaaaaagag cagtgggaat aggagctttg ttccttgggt tcttgggagc agcaggaagc 7800
actatgggcg cagcgtcaat gacgctgacg gtacaggcca gacaattatt gtctggtata 7860
gtgcagcagc agaacaattt gctgagggct attgaggcgc aacagcatct gttgcaactc 7920
acagtctggg gcatcaagca gctccaggca agaatcctgg ctgtggaaag atacctaaag 7980
gatcaacagc tcctggggat ttggggttgc tctggaaaac tcatttgcac cactgctgtg 8040
ccttggaatg ctagttggag taataaatct ctggaacaga tttggaatca cacgacctgg 8100
atggagtggg acagagaaat taacaattac acaagcttaa tacactcctt aattgaagaa 8160
tcgcaaaacc agcaagaaaa gaatgaacaa gaattattgg aattagataa atgggcaagt 8220
ttgtggaatt ggtttaacat aacaaattgg ctgtggtata taaaattatt cataatgata 8280
gtaggaggct tggtaggttt aagaatagtt tttgctgtac tttctatagt gaatagagtt 8340
aggcagggat attcaccatt atcgtttcag acccacctcc caaccccgag gggacccttg 8400
cgccttttcc aaggcagccc tgggtttgcg cagggacgcg gctgctctgg gcgtggttcc 8460
gggaaacgca gcggcgccga ccctgggtct cgcacattct tcacgtccgt tcgcagcgtc 8520
acccggatct tcgccgctac ccttgtgggc cccccggcga cgcttcctgc tccgccccta 8580
agtcgggaag gttccttgcg gttcgcggcg tgccggacgt gacaaacgga agccgcacgt 8640
ctcactagta ccctcgcaga cggacagcgc cagggagcaa tggcagcgcg ccgaccgcga 8700
tgggctgtgg ccaatagcgg ctgctcagca gggcgcgccg agagcagcgg ccgggaaggg 8760
gcggtgcggg aggcggggtg tggggcggta gtgtgggccc tgttcctgcc cgcgcggtgt 8820
tccgcattct gcaagcctcc ggagcgcacg tcggcagtcg gctccctcgt tgaccgaatc 8880
accgacctct ctccccaggg ggtacccagc tgtctagaga attctagatc ttgagacaaa 8940
tggcagtatt catccacaat tttaaaagaa aaggggggat tggggggtac agtgcagggg 9000
aaagaatagt agacataata gcaacagaca tacaaactaa agaattacaa aaacaaatta 9060
caaaaattca aaattttcgg gtttattaca gggacagcag agatccactt tggcgccggc 9120
tcgaggggg 9129
<210> 171
<211> 160
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 171
Met Lys Thr Phe Asn Ile Ser Gln Gln Asp Leu Glu Leu Val Glu Val
1 5 10 15
Ala Thr Glu Lys Ile Thr Met Leu Tyr Glu Asp Asn Lys His His Val
20 25 30
Gly Ala Ala Ile Arg Thr Lys Thr Gly Glu Ile Ile Ser Ala Val His
35 40 45
Ile Glu Ala Tyr Ile Gly Arg Val Thr Val Cys Leu Ser Tyr Glu Thr
50 55 60
Glu Ile Leu Thr Val Glu Tyr Gly Leu Leu Pro Ile Gly Lys Ile Val
65 70 75 80
Glu Lys Arg Ile Glu Cys Thr Val Tyr Ser Val Asp Asn Asn Gly Asn
85 90 95
Ile Tyr Thr Gln Pro Val Ala Gln Trp His Asp Arg Gly Glu Gln Glu
100 105 110
Val Phe Glu Tyr Cys Leu Glu Asp Gly Ser Leu Ile Arg Ala Thr Lys
115 120 125
Asp His Lys Phe Met Thr Val Asp Gly Gln Met Leu Pro Ile Asp Glu
130 135 140
Ile Phe Glu Arg Glu Leu Asp Leu Met Arg Val Asp Asn Leu Pro Asn
145 150 155 160
<210> 172
<211> 9006
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 172
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtgccgcca ccatgattaa gatcgctacg 660
cggaagtacc tggggaaaca gaacgtctac gacataggtg tggagcgcga tcacaacttt 720
gctctgaaaa atggatttat cgccagcaac tgtgcagaag ccattgcgat tggtagtgca 780
gtttcgaatg gacaaaagga ttttgacacg attgtagctg ttagacaccc ttattctgac 840
gaagtagata gaagtattcg agtggtaagt ccttgtggta tgtgtaggga gttgatttca 900
gactatgcac cagattgttt tgtgttaata gaaatgaatg gcaagttagt caaaactacg 960
attgaagaac tcattccact caaatatacc cgaaattaat taattaagaa ttcgacccag 1020
ctttcttgta caaagtggtt ggtaagccta tccctaaccc tctcctcggt ctcgattcta 1080
cgtagtaatg agctagcagt ctcgaggtta acgaattccg ccccccccct aacgttactg 1140
gccgaagccg cttggaataa ggccggtgtg cgcttgtcta tatgttattt tccaccatat 1200
tgccgtcttt tggcaatgtg agggcccgga aacctggccc tgtcttcttg acgagcattc 1260
ctaggggtct ttcccctctc gccaaaggaa tgcaaggtct gttgaatgtc gtgaaggaag 1320
cagttcctct ggaagcttct tgaagacaaa caacgtctgt agcgaccctt tgcaggcagc 1380
ggaacccccc acctggcgac aggtgcccct gcggccaaaa gccacgtgta taagatacac 1440
ctgcaaaggc ggcacaaccc cagtgccacg ttgtgagttg gatagttgtg gaaagagtca 1500
aatggctctc ctcaagcgta ttcaacaagg ggctgaagga tgcccagaag gtaccccatt 1560
gtatgggatc tgatctgggg cctcggtgca catgctttac atgtgtttag tcgaggttaa 1620
aaaaacgtct aggccccccg aaccacgggg acgtggtttt cctttgaaaa acacgataat 1680
accatggtga gcaagggcga ggaggataac atggccatca tcaaggagtt catgcgcttc 1740
aaggtgcaca tggagggctc cgtgaacggc cacgagttcg agatcgaggg cgagggcgag 1800
ggccgcccct acgagggcac ccagaccgcc aagctgaagg tgaccaaggg tggccccctg 1860
cccttcgcct gggacatcct gtcccctcag ttcatgtacg gctccaaggc ctacgtgaag 1920
caccccgccg acatccccga ctacttgaag ctgtccttcc ccgagggctt caagtgggag 1980
cgcgtgatga acttcgagga cggcggcgtg gtgaccgtga cccaggactc ctccctgcag 2040
gacggcgagt tcatctacaa ggtgaagctg cgcggcacca acttcccctc cgacggcccc 2100
gtaatgcaga agaagaccat gggctgggag gcctcctccg agcggatgta ccccgaggac 2160
ggcgccctga agggcgagat caagcagagg ctgaagctga aggacggcgg ccactacgac 2220
gctgaggtca agaccaccta caaggccaag aagcccgtgc agctgcccgg cgcctacaac 2280
gtcaacatca agttggacat cacctcccac aacgaggact acaccatcgt ggaacagtac 2340
gaacgcgccg agggccgcca ctccaccggc ggcatggacg agctgtacaa gtaacaccgg 2400
tggcgcgtta agtcgacaat caacctctgg attacaaaat ttgtgaaaga ttgactggta 2460
ttcttaacta tgttgctcct tttacgctat gtggatacgc tgctttaatg cctttgtatc 2520
atgctattgc ttcccgtatg gctttcattt tctcctcctt gtataaatcc tggttgctgt 2580
ctctttatga ggagttgtgg cccgttgtca ggcaacgtgg cgtggtgtgc actgtgtttg 2640
ctgacgcaac ccccactggt tggggcattg ccaccacctg tcagctcctt tccgggactt 2700
tcgctttccc cctccctatt gccacggcgg aactcatcgc cgcctgcctt gcccgctgct 2760
ggacaggggc tcggctgttg ggcactgaca attccgtggt gttgtcgggg aaatcatcgt 2820
cctttccttg gctgctcgcc tgtgttgcca cctggattct gcgcgggacg tccttctgct 2880
acgtcccttc ggccctcaat ccagcggacc ttccttcccg cggcctgctg ccggctctgc 2940
ggcctcttcc gcgtcttcgc cttcgccctc agacgagtcg gatctccctt tgggccgcct 3000
ccccgcgtcg actttaagac caatgactta caaggcagct gtagatctta gccacttttt 3060
aaaagaaaag gggggactgg aagggctaat tcactcccaa cgaagacaag atctgctttt 3120
tgcttgtact gggtctctct ggttagacca gatctgagcc tgggagctct ctggctaact 3180
agggaaccca ctgcttaagc ctcaataaag cttgccttga gtgcttcaag tagtgtgtgc 3240
ccgtctgttg tgtgactctg gtaactagag atccctcaga cccttttagt cagtgtggaa 3300
aatctctagc agtacgtata gtagttcatg tcatcttatt attcagtatt tataacttgc 3360
aaagaaatga atatcagaga gtgagaggaa cttgtttatt gcagcttata atggttacaa 3420
ataaagcaat agcatcacaa atttcacaaa taaagcattt ttttcactgc attctagttg 3480
tggtttgtcc aaactcatca atgtatctta tcatgtctgg ctctagctat cccgccccta 3540
actccgccca tcccgcccct aactccgccc agttccgccc attctccgcc ccatggctga 3600
ctaatttttt ttatttatgc agaggccgag gccgcctcgg cctctgagct attccagaag 3660
tagtgaggag gcttttttgg aggcctaggg acgtacccaa ttcgccctat agtgagtcgt 3720
attacgcgcg ctcactggcc gtcgttttac aacgtcgtga ctgggaaaac cctggcgtta 3780
cccaacttaa tcgccttgca gcacatcccc ctttcgccag ctggcgtaat agcgaagagg 3840
cccgcaccga tcgcccttcc caacagttgc gcagcctgaa tggcgaatgg gacgcgccct 3900
gtagcggcgc attaagcgcg gcgggtgtgg tggttacgcg cagcgtgacc gctacacttg 3960
ccagcgccct agcgcccgct cctttcgctt tcttcccttc ctttctcgcc acgttcgccg 4020
gctttccccg tcaagctcta aatcgggggc tccctttagg gttccgattt agtgctttac 4080
ggcacctcga ccccaaaaaa cttgattagg gtgatggttc acgtagtggg ccatcgccct 4140
gatagacggt ttttcgccct ttgacgttgg agtccacgtt ctttaatagt ggactcttgt 4200
tccaaactgg aacaacactc aaccctatct cggtctattc ttttgattta taagggattt 4260
tgccgatttc ggcctattgg ttaaaaaatg agctgattta acaaaaattt aacgcgaatt 4320
ttaacaaaat attaacgctt acaatttagg tggcactttt cggggaaatg tgcgcggaac 4380
ccctatttgt ttatttttct aaatacattc aaatatgtat ccgctcatga gacaataacc 4440
ctgataaatg cttcaataat attgaaaaag gaagagtatg agtattcaac atttccgtgt 4500
cgcccttatt cccttttttg cggcattttg ccttcctgtt tttgctcacc cagaaacgct 4560
ggtgaaagta aaagatgctg aagatcagtt gggtgcacga gtgggttaca tcgaactgga 4620
tctcaacagc ggtaagatcc ttgagagttt tcgccccgaa gaacgttttc caatgatgag 4680
cacttttaaa gttctgctat gtggcgcggt attatcccgt attgacgccg ggcaagagca 4740
actcggtcgc cgcatacact attctcagaa tgacttggtt gagtactcac cagtcacaga 4800
aaagcatctt acggatggca tgacagtaag agaattatgc agtgctgcca taaccatgag 4860
tgataacact gcggccaact tacttctgac aacgatcgga ggaccgaagg agctaaccgc 4920
ttttttgcac aacatggggg atcatgtaac tcgccttgat cgttgggaac cggagctgaa 4980
tgaagccata ccaaacgacg agcgtgacac cacgatgcct gtagcaatgg caacaacgtt 5040
gcgcaaacta ttaactggcg aactacttac tctagcttcc cggcaacaat taatagactg 5100
gatggaggcg gataaagttg caggaccact tctgcgctcg gcccttccgg ctggctggtt 5160
tattgctgat aaatctggag ccggtgagcg tgggtctcgc ggtatcattg cagcactggg 5220
gccagatggt aagccctccc gtatcgtagt tatctacacg acggggagtc aggcaactat 5280
ggatgaacga aatagacaga tcgctgagat aggtgcctca ctgattaagc attggtaact 5340
gtcagaccaa gtttactcat atatacttta gattgattta aaacttcatt tttaatttaa 5400
aaggatctag gtgaagatcc tttttgataa tctcatgacc aaaatccctt aacgtgagtt 5460
ttcgttccac tgagcgtcag accccgtaga aaagatcaaa ggatcttctt gagatccttt 5520
ttttctgcgc gtaatctgct gcttgcaaac aaaaaaacca ccgctaccag cggtggtttg 5580
tttgccggat caagagctac caactctttt tccgaaggta actggcttca gcagagcgca 5640
gataccaaat actgttcttc tagtgtagcc gtagttaggc caccacttca agaactctgt 5700
agcaccgcct acatacctcg ctctgctaat cctgttacca gtggctgctg ccagtggcga 5760
taagtcgtgt cttaccgggt tggactcaag acgatagtta ccggataagg cgcagcggtc 5820
gggctgaacg gggggttcgt gcacacagcc cagcttggag cgaacgacct acaccgaact 5880
gagataccta cagcgtgagc tatgagaaag cgccacgctt cccgaaggga gaaaggcgga 5940
caggtatccg gtaagcggca gggtcggaac aggagagcgc acgagggagc ttccaggggg 6000
aaacgcctgg tatctttata gtcctgtcgg gtttcgccac ctctgacttg agcgtcgatt 6060
tttgtgatgc tcgtcagggg ggcggagcct atggaaaaac gccagcaacg cggccttttt 6120
acggttcctg gccttttgct ggccttttgc tcacatgttc tttcctgcgt tatcccctga 6180
ttctgtggat aaccgtatta ccgcctttga gtgagctgat accgctcgcc gcagccgaac 6240
gaccgagcgc agcgagtcag tgagcgagga agcggaagag cgcccaatac gcaaaccgcc 6300
tctccccgcg cgttggccga ttcattaatg cagctggcac gacaggtttc ccgactggaa 6360
agcgggcagt gagcgcaacg caattaatgt gagttagctc actcattagg caccccaggc 6420
tttacacttt atgcttccgg ctcgtatgtt gtgtggaatt gtgagcggat aacaatttca 6480
cacaggaaac agctatgacc atgattacgc caagcgcgca attaaccctc actaaaggga 6540
acaaaagctg gagctgcaag cttaatgtag tcttatgcaa tactcttgta gtcttgcaac 6600
atggtaacga tgagttagca acatgcctta caaggagaga aaaagcaccg tgcatgccga 6660
ttggtggaag taaggtggta cgatcgtgcc ttattaggaa ggcaacagac gggtctgaca 6720
tggattggac gaaccactga attgccgcat tgcagagata ttgtatttaa gtgcctagct 6780
cgatacataa acgggtctct ctggttagac cagatctgag cctgggagct ctctggctaa 6840
ctagggaacc cactgcttaa gcctcaataa agcttgcctt gagtgcttca agtagtgtgt 6900
gcccgtctgt tgtgtgactc tggtaactag agatccctca gaccctttta gtcagtgtgg 6960
aaaatctcta gcagtggcgc ccgaacaggg acttgaaagc gaaagggaaa ccagaggagc 7020
tctctcgacg caggactcgg cttgctgaag cgcgcacggc aagaggcgag gggcggcgac 7080
tggtgagtac gccaaaaatt ttgactagcg gaggctagaa ggagagagat gggtgcgaga 7140
gcgtcagtat taagcggggg agaattagat cgcgatggga aaaaattcgg ttaaggccag 7200
ggggaaagaa aaaatataaa ttaaaacata tagtatgggc aagcagggag ctagaacgat 7260
tcgcagttaa tcctggcctg ttagaaacat cagaaggctg tagacaaata ctgggacagc 7320
tacaaccatc ccttcagaca ggatcagaag aacttagatc attatataat acagtagcaa 7380
ccctctattg tgtgcatcaa aggatagaga taaaagacac caaggaagct ttagacaaga 7440
tagaggaaga gcaaaacaaa agtaagacca ccgcacagca agcggccgct gatcttcaga 7500
cctggaggag gagatatgag ggacaattgg agaagtgaat tatataaata taaagtagta 7560
aaaattgaac cattaggagt agcacccacc aaggcaaaga gaagagtggt gcagagagaa 7620
aaaagagcag tgggaatagg agctttgttc cttgggttct tgggagcagc aggaagcact 7680
atgggcgcag cgtcaatgac gctgacggta caggccagac aattattgtc tggtatagtg 7740
cagcagcaga acaatttgct gagggctatt gaggcgcaac agcatctgtt gcaactcaca 7800
gtctggggca tcaagcagct ccaggcaaga atcctggctg tggaaagata cctaaaggat 7860
caacagctcc tggggatttg gggttgctct ggaaaactca tttgcaccac tgctgtgcct 7920
tggaatgcta gttggagtaa taaatctctg gaacagattt ggaatcacac gacctggatg 7980
gagtgggaca gagaaattaa caattacaca agcttaatac actccttaat tgaagaatcg 8040
caaaaccagc aagaaaagaa tgaacaagaa ttattggaat tagataaatg ggcaagtttg 8100
tggaattggt ttaacataac aaattggctg tggtatataa aattattcat aatgatagta 8160
ggaggcttgg taggtttaag aatagttttt gctgtacttt ctatagtgaa tagagttagg 8220
cagggatatt caccattatc gtttcagacc cacctcccaa ccccgagggg acccttgcgc 8280
cttttccaag gcagccctgg gtttgcgcag ggacgcggct gctctgggcg tggttccggg 8340
aaacgcagcg gcgccgaccc tgggtctcgc acattcttca cgtccgttcg cagcgtcacc 8400
cggatcttcg ccgctaccct tgtgggcccc ccggcgacgc ttcctgctcc gcccctaagt 8460
cgggaaggtt ccttgcggtt cgcggcgtgc cggacgtgac aaacggaagc cgcacgtctc 8520
actagtaccc tcgcagacgg acagcgccag ggagcaatgg cagcgcgccg accgcgatgg 8580
gctgtggcca atagcggctg ctcagcaggg cgcgccgaga gcagcggccg ggaaggggcg 8640
gtgcgggagg cggggtgtgg ggcggtagtg tgggccctgt tcctgcccgc gcggtgttcc 8700
gcattctgca agcctccgga gcgcacgtcg gcagtcggct ccctcgttga ccgaatcacc 8760
gacctctctc cccagggggt acccagctgt ctagagaatt ctagatcttg agacaaatgg 8820
cagtattcat ccacaatttt aaaagaaaag gggggattgg ggggtacagt gcaggggaaa 8880
gaatagtaga cataatagca acagacatac aaactaaaga attacaaaaa caaattacaa 8940
aaattcaaaa ttttcgggtt tattacaggg acagcagaga tccactttgg cgccggctcg 9000
aggggg 9006
<210> 173
<211> 118
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 173
Met Ile Lys Ile Ala Thr Arg Lys Tyr Leu Gly Lys Gln Asn Val Tyr
1 5 10 15
Asp Ile Gly Val Glu Arg Asp His Asn Phe Ala Leu Lys Asn Gly Phe
20 25 30
Ile Ala Ser Asn Cys Ala Glu Ala Ile Ala Ile Gly Ser Ala Val Ser
35 40 45
Asn Gly Gln Lys Asp Phe Asp Thr Ile Val Ala Val Arg His Pro Tyr
50 55 60
Ser Asp Glu Val Asp Arg Ser Ile Arg Val Val Ser Pro Cys Gly Met
65 70 75 80
Cys Arg Glu Leu Ile Ser Asp Tyr Ala Pro Asp Cys Phe Val Leu Ile
85 90 95
Glu Met Asn Gly Lys Leu Val Lys Thr Thr Ile Glu Glu Leu Ile Pro
100 105 110
Leu Lys Tyr Thr Arg Asn
115
<210> 174
<211> 9531
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 174
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtgccgcca ccatgattaa gatcgctacg 660
cggaagtacc tggggaaaca gaacgtctac gacataggtg tggagcgcga tcacaacttt 720
gctctgaaaa atggatttat cgccagcaac tgcgccgatg gtttctacaa agatcgttat 780
gtttatcggc actttgcatc ggccgcgctc ccgattccgg aagtgcttga cattggggaa 840
tttagcgaga gcctgaccta ttgcatctcc cgccgtgcac agggtgtcac gttgcaagac 900
ctgcctgaaa ccgaactgcc cgctgttctg cagccggtcg cggaggccat ggatgcgatc 960
gctgcggccg atcttagcca gacgagcggg ttcggcccat tcggaccgca aggaatcggt 1020
caatacacta catggcgtga tttcatatgc gcgattgctg atccccatgt gtatcactgg 1080
caaactgtga tggacgacac cgtcagtgcg tccgtcgcgc aggctctcga tgagctgatg 1140
ctttgggccg aggactgccc cgaagtccgg cacctcgtgc acgcggattt cggctgtatc 1200
agtggcgact ccctgatctc actcgcaagc actggaaagc gagttagcat caaggacttg 1260
ctggacgaaa aggatttcga aatttgggca atcaatgagc agaccatgaa actggagtct 1320
gcaaaggtgt cccgggtgtt ttgcacgggt aagaagcttg tttatatcct taaaactaga 1380
ctgggccgga cgatcaaagc caccgcgaac cacagattct tgacaatcga cgggtggaaa 1440
cggctggacg aactgagctt gaaggagcac atcgcccttc ctcggaagct cgagtcatct 1500
tccctgcagc tgtgattaat taagaattcg acccagcttt cttgtacaaa gtggttggta 1560
agcctatccc taaccctctc ctcggtctcg attctacgta gtaatgagct agcagtctcg 1620
aggttaacga attccgcccc ccccctaacg ttactggccg aagccgcttg gaataaggcc 1680
ggtgtgcgct tgtctatatg ttattttcca ccatattgcc gtcttttggc aatgtgaggg 1740
cccggaaacc tggccctgtc ttcttgacga gcattcctag gggtctttcc cctctcgcca 1800
aaggaatgca aggtctgttg aatgtcgtga aggaagcagt tcctctggaa gcttcttgaa 1860
gacaaacaac gtctgtagcg accctttgca ggcagcggaa ccccccacct ggcgacaggt 1920
gcccctgcgg ccaaaagcca cgtgtataag atacacctgc aaaggcggca caaccccagt 1980
gccacgttgt gagttggata gttgtggaaa gagtcaaatg gctctcctca agcgtattca 2040
acaaggggct gaaggatgcc cagaaggtac cccattgtat gggatctgat ctggggcctc 2100
ggtgcacatg ctttacatgt gtttagtcga ggttaaaaaa acgtctaggc cccccgaacc 2160
acggggacgt ggttttcctt tgaaaaacac gataatacca tggtgagcaa gggcgaggag 2220
ctgttcaccg gggtggtgcc catcctggtc gagctggacg gcgacgtaaa cggccacaag 2280
ttcagcgtgt ccggcgaggg cgagggcgat gccacctacg gcaagctgac cctgaagttc 2340
atctgcacca ccggcaagct gcccgtgccc tggcccaccc tcgtgaccac cctgacctac 2400
ggcgtgcagt gcttcagccg ctaccccgac cacatgaagc agcacgactt cttcaagtcc 2460
gccatgcccg aaggctacgt ccaggagcgc accatcttct tcaaggacga cggcaactac 2520
aagacccgcg ccgaggtgaa gttcgagggc gacaccctgg tgaaccgcat cgagctgaag 2580
ggcatcgact tcaaggagga cggcaacatc ctggggcaca agctggagta caactacaac 2640
agccacaacg tctatatcat ggccgacaag cagaagaacg gcatcaaggt gaacttcaag 2700
atccgccaca acatcgagga cggcagcgtg cagctcgccg accactacca gcagaacacc 2760
cccatcggcg acggccccgt gctgctgccc gacaaccact acctgagcac ccagtccgcc 2820
ctgagcaaag accccaacga gaagcgcgat cacatggtcc tgctggagtt cgtgaccgcc 2880
gccgggatca ctctcggcat ggacgagctg tacaagtaac accggtggcg cgttaagtcg 2940
acaatcaacc tctggattac aaaatttgtg aaagattgac tggtattctt aactatgttg 3000
ctccttttac gctatgtgga tacgctgctt taatgccttt gtatcatgct attgcttccc 3060
gtatggcttt cattttctcc tccttgtata aatcctggtt gctgtctctt tatgaggagt 3120
tgtggcccgt tgtcaggcaa cgtggcgtgg tgtgcactgt gtttgctgac gcaaccccca 3180
ctggttgggg cattgccacc acctgtcagc tcctttccgg gactttcgct ttccccctcc 3240
ctattgccac ggcggaactc atcgccgcct gccttgcccg ctgctggaca ggggctcggc 3300
tgttgggcac tgacaattcc gtggtgttgt cggggaaatc atcgtccttt ccttggctgc 3360
tcgcctgtgt tgccacctgg attctgcgcg ggacgtcctt ctgctacgtc ccttcggccc 3420
tcaatccagc ggaccttcct tcccgcggcc tgctgccggc tctgcggcct cttccgcgtc 3480
ttcgccttcg ccctcagacg agtcggatct ccctttgggc cgcctccccg cgtcgacttt 3540
aagaccaatg acttacaagg cagctgtaga tcttagccac tttttaaaag aaaagggggg 3600
actggaaggg ctaattcact cccaacgaag acaagatctg ctttttgctt gtactgggtc 3660
tctctggtta gaccagatct gagcctggga gctctctggc taactaggga acccactgct 3720
taagcctcaa taaagcttgc cttgagtgct tcaagtagtg tgtgcccgtc tgttgtgtga 3780
ctctggtaac tagagatccc tcagaccctt ttagtcagtg tggaaaatct ctagcagtac 3840
gtatagtagt tcatgtcatc ttattattca gtatttataa cttgcaaaga aatgaatatc 3900
agagagtgag aggaacttgt ttattgcagc ttataatggt tacaaataaa gcaatagcat 3960
cacaaatttc acaaataaag catttttttc actgcattct agttgtggtt tgtccaaact 4020
catcaatgta tcttatcatg tctggctcta gctatcccgc ccctaactcc gcccatcccg 4080
cccctaactc cgcccagttc cgcccattct ccgccccatg gctgactaat tttttttatt 4140
tatgcagagg ccgaggccgc ctcggcctct gagctattcc agaagtagtg aggaggcttt 4200
tttggaggcc tagggacgta cccaattcgc cctatagtga gtcgtattac gcgcgctcac 4260
tggccgtcgt tttacaacgt cgtgactggg aaaaccctgg cgttacccaa cttaatcgcc 4320
ttgcagcaca tccccctttc gccagctggc gtaatagcga agaggcccgc accgatcgcc 4380
cttcccaaca gttgcgcagc ctgaatggcg aatgggacgc gccctgtagc ggcgcattaa 4440
gcgcggcggg tgtggtggtt acgcgcagcg tgaccgctac acttgccagc gccctagcgc 4500
ccgctccttt cgctttcttc ccttcctttc tcgccacgtt cgccggcttt ccccgtcaag 4560
ctctaaatcg ggggctccct ttagggttcc gatttagtgc tttacggcac ctcgacccca 4620
aaaaacttga ttagggtgat ggttcacgta gtgggccatc gccctgatag acggtttttc 4680
gccctttgac gttggagtcc acgttcttta atagtggact cttgttccaa actggaacaa 4740
cactcaaccc tatctcggtc tattcttttg atttataagg gattttgccg atttcggcct 4800
attggttaaa aaatgagctg atttaacaaa aatttaacgc gaattttaac aaaatattaa 4860
cgcttacaat ttaggtggca cttttcgggg aaatgtgcgc ggaaccccta tttgtttatt 4920
tttctaaata cattcaaata tgtatccgct catgagacaa taaccctgat aaatgcttca 4980
ataatattga aaaaggaaga gtatgagtat tcaacatttc cgtgtcgccc ttattccctt 5040
ttttgcggca ttttgccttc ctgtttttgc tcacccagaa acgctggtga aagtaaaaga 5100
tgctgaagat cagttgggtg cacgagtggg ttacatcgaa ctggatctca acagcggtaa 5160
gatccttgag agttttcgcc ccgaagaacg ttttccaatg atgagcactt ttaaagttct 5220
gctatgtggc gcggtattat cccgtattga cgccgggcaa gagcaactcg gtcgccgcat 5280
acactattct cagaatgact tggttgagta ctcaccagtc acagaaaagc atcttacgga 5340
tggcatgaca gtaagagaat tatgcagtgc tgccataacc atgagtgata acactgcggc 5400
caacttactt ctgacaacga tcggaggacc gaaggagcta accgcttttt tgcacaacat 5460
gggggatcat gtaactcgcc ttgatcgttg ggaaccggag ctgaatgaag ccataccaaa 5520
cgacgagcgt gacaccacga tgcctgtagc aatggcaaca acgttgcgca aactattaac 5580
tggcgaacta cttactctag cttcccggca acaattaata gactggatgg aggcggataa 5640
agttgcagga ccacttctgc gctcggccct tccggctggc tggtttattg ctgataaatc 5700
tggagccggt gagcgtgggt ctcgcggtat cattgcagca ctggggccag atggtaagcc 5760
ctcccgtatc gtagttatct acacgacggg gagtcaggca actatggatg aacgaaatag 5820
acagatcgct gagataggtg cctcactgat taagcattgg taactgtcag accaagttta 5880
ctcatatata ctttagattg atttaaaact tcatttttaa tttaaaagga tctaggtgaa 5940
gatccttttt gataatctca tgaccaaaat cccttaacgt gagttttcgt tccactgagc 6000
gtcagacccc gtagaaaaga tcaaaggatc ttcttgagat cctttttttc tgcgcgtaat 6060
ctgctgcttg caaacaaaaa aaccaccgct accagcggtg gtttgtttgc cggatcaaga 6120
gctaccaact ctttttccga aggtaactgg cttcagcaga gcgcagatac caaatactgt 6180
tcttctagtg tagccgtagt taggccacca cttcaagaac tctgtagcac cgcctacata 6240
cctcgctctg ctaatcctgt taccagtggc tgctgccagt ggcgataagt cgtgtcttac 6300
cgggttggac tcaagacgat agttaccgga taaggcgcag cggtcgggct gaacgggggg 6360
ttcgtgcaca cagcccagct tggagcgaac gacctacacc gaactgagat acctacagcg 6420
tgagctatga gaaagcgcca cgcttcccga agggagaaag gcggacaggt atccggtaag 6480
cggcagggtc ggaacaggag agcgcacgag ggagcttcca gggggaaacg cctggtatct 6540
ttatagtcct gtcgggtttc gccacctctg acttgagcgt cgatttttgt gatgctcgtc 6600
aggggggcgg agcctatgga aaaacgccag caacgcggcc tttttacggt tcctggcctt 6660
ttgctggcct tttgctcaca tgttctttcc tgcgttatcc cctgattctg tggataaccg 6720
tattaccgcc tttgagtgag ctgataccgc tcgccgcagc cgaacgaccg agcgcagcga 6780
gtcagtgagc gaggaagcgg aagagcgccc aatacgcaaa ccgcctctcc ccgcgcgttg 6840
gccgattcat taatgcagct ggcacgacag gtttcccgac tggaaagcgg gcagtgagcg 6900
caacgcaatt aatgtgagtt agctcactca ttaggcaccc caggctttac actttatgct 6960
tccggctcgt atgttgtgtg gaattgtgag cggataacaa tttcacacag gaaacagcta 7020
tgaccatgat tacgccaagc gcgcaattaa ccctcactaa agggaacaaa agctggagct 7080
gcaagcttaa tgtagtctta tgcaatactc ttgtagtctt gcaacatggt aacgatgagt 7140
tagcaacatg ccttacaagg agagaaaaag caccgtgcat gccgattggt ggaagtaagg 7200
tggtacgatc gtgccttatt aggaaggcaa cagacgggtc tgacatggat tggacgaacc 7260
actgaattgc cgcattgcag agatattgta tttaagtgcc tagctcgata cataaacggg 7320
tctctctggt tagaccagat ctgagcctgg gagctctctg gctaactagg gaacccactg 7380
cttaagcctc aataaagctt gccttgagtg cttcaagtag tgtgtgcccg tctgttgtgt 7440
gactctggta actagagatc cctcagaccc ttttagtcag tgtggaaaat ctctagcagt 7500
ggcgcccgaa cagggacttg aaagcgaaag ggaaaccaga ggagctctct cgacgcagga 7560
ctcggcttgc tgaagcgcgc acggcaagag gcgaggggcg gcgactggtg agtacgccaa 7620
aaattttgac tagcggaggc tagaaggaga gagatgggtg cgagagcgtc agtattaagc 7680
gggggagaat tagatcgcga tgggaaaaaa ttcggttaag gccaggggga aagaaaaaat 7740
ataaattaaa acatatagta tgggcaagca gggagctaga acgattcgca gttaatcctg 7800
gcctgttaga aacatcagaa ggctgtagac aaatactggg acagctacaa ccatcccttc 7860
agacaggatc agaagaactt agatcattat ataatacagt agcaaccctc tattgtgtgc 7920
atcaaaggat agagataaaa gacaccaagg aagctttaga caagatagag gaagagcaaa 7980
acaaaagtaa gaccaccgca cagcaagcgg ccgctgatct tcagacctgg aggaggagat 8040
atgagggaca attggagaag tgaattatat aaatataaag tagtaaaaat tgaaccatta 8100
ggagtagcac ccaccaaggc aaagagaaga gtggtgcaga gagaaaaaag agcagtggga 8160
ataggagctt tgttccttgg gttcttggga gcagcaggaa gcactatggg cgcagcgtca 8220
atgacgctga cggtacaggc cagacaatta ttgtctggta tagtgcagca gcagaacaat 8280
ttgctgaggg ctattgaggc gcaacagcat ctgttgcaac tcacagtctg gggcatcaag 8340
cagctccagg caagaatcct ggctgtggaa agatacctaa aggatcaaca gctcctgggg 8400
atttggggtt gctctggaaa actcatttgc accactgctg tgccttggaa tgctagttgg 8460
agtaataaat ctctggaaca gatttggaat cacacgacct ggatggagtg ggacagagaa 8520
attaacaatt acacaagctt aatacactcc ttaattgaag aatcgcaaaa ccagcaagaa 8580
aagaatgaac aagaattatt ggaattagat aaatgggcaa gtttgtggaa ttggtttaac 8640
ataacaaatt ggctgtggta tataaaatta ttcataatga tagtaggagg cttggtaggt 8700
ttaagaatag tttttgctgt actttctata gtgaatagag ttaggcaggg atattcacca 8760
ttatcgtttc agacccacct cccaaccccg aggggaccct tgcgcctttt ccaaggcagc 8820
cctgggtttg cgcagggacg cggctgctct gggcgtggtt ccgggaaacg cagcggcgcc 8880
gaccctgggt ctcgcacatt cttcacgtcc gttcgcagcg tcacccggat cttcgccgct 8940
acccttgtgg gccccccggc gacgcttcct gctccgcccc taagtcggga aggttccttg 9000
cggttcgcgg cgtgccggac gtgacaaacg gaagccgcac gtctcactag taccctcgca 9060
gacggacagc gccagggagc aatggcagcg cgccgaccgc gatgggctgt ggccaatagc 9120
ggctgctcag cagggcgcgc cgagagcagc ggccgggaag gggcggtgcg ggaggcgggg 9180
tgtggggcgg tagtgtgggc cctgttcctg cccgcgcggt gttccgcatt ctgcaagcct 9240
ccggagcgca cgtcggcagt cggctccctc gttgaccgaa tcaccgacct ctctccccag 9300
ggggtaccca gctgtctaga gaattctaga tcttgagaca aatggcagta ttcatccaca 9360
attttaaaag aaaagggggg attggggggt acagtgcagg ggaaagaata gtagacataa 9420
tagcaacaga catacaaact aaagaattac aaaaacaaat tacaaaaatt caaaattttc 9480
gggtttatta cagggacagc agagatccac tttggcgccg gctcgagggg g 9531
<210> 175
<211> 290
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 175
Met Ile Lys Ile Ala Thr Arg Lys Tyr Leu Gly Lys Gln Asn Val Tyr
1 5 10 15
Asp Ile Gly Val Glu Arg Asp His Asn Phe Ala Leu Lys Asn Gly Phe
20 25 30
Ile Ala Ser Asn Cys Ala Asp Gly Phe Tyr Lys Asp Arg Tyr Val Tyr
35 40 45
Arg His Phe Ala Ser Ala Ala Leu Pro Ile Pro Glu Val Leu Asp Ile
50 55 60
Gly Glu Phe Ser Glu Ser Leu Thr Tyr Cys Ile Ser Arg Arg Ala Gln
65 70 75 80
Gly Val Thr Leu Gln Asp Leu Pro Glu Thr Glu Leu Pro Ala Val Leu
85 90 95
Gln Pro Val Ala Glu Ala Met Asp Ala Ile Ala Ala Ala Asp Leu Ser
100 105 110
Gln Thr Ser Gly Phe Gly Pro Phe Gly Pro Gln Gly Ile Gly Gln Tyr
115 120 125
Thr Thr Trp Arg Asp Phe Ile Cys Ala Ile Ala Asp Pro His Val Tyr
130 135 140
His Trp Gln Thr Val Met Asp Asp Thr Val Ser Ala Ser Val Ala Gln
145 150 155 160
Ala Leu Asp Glu Leu Met Leu Trp Ala Glu Asp Cys Pro Glu Val Arg
165 170 175
His Leu Val His Ala Asp Phe Gly Cys Ile Ser Gly Asp Ser Leu Ile
180 185 190
Ser Leu Ala Ser Thr Gly Lys Arg Val Ser Ile Lys Asp Leu Leu Asp
195 200 205
Glu Lys Asp Phe Glu Ile Trp Ala Ile Asn Glu Gln Thr Met Lys Leu
210 215 220
Glu Ser Ala Lys Val Ser Arg Val Phe Cys Thr Gly Lys Lys Leu Val
225 230 235 240
Tyr Ile Leu Lys Thr Arg Leu Gly Arg Thr Ile Lys Ala Thr Ala Asn
245 250 255
His Arg Phe Leu Thr Ile Asp Gly Trp Lys Arg Leu Asp Glu Leu Ser
260 265 270
Leu Lys Glu His Ile Ala Leu Pro Arg Lys Leu Glu Ser Ser Ser Leu
275 280 285
Gln Leu
290
<210> 176
<211> 9222
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 176
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtgccgcca ccatgagtcc cgaaatcgaa 660
aagctctctc agagcgatat atattgggac tccatcgtaa gcataacaga gacgggggtc 720
gaggaggtgt tcgatctgac agttcctggg cctcataatt tcgtagcgaa cgacatcatt 780
gtacataact ccaacaatgt cctgacggac aatggccgca taacagcggt cattgactgg 840
agcgaggcga tgttcgggga ttcccaatac gaggtcgcca acatcttctt ctggaggccg 900
tggttggctt gtatggagca gcagacgcgc tacttcgagc ggaggcatcc ggagcttgca 960
ggatcgccgc ggctccgggc gtatatgctc cgcattggtc ttgaccaact ctatcagagc 1020
ttggttgacg gcaatttcga tgatgcagct tgggcgcagg gtcgatgcga cgcaatcgtc 1080
cgatccggag ccgggactgt cgggcgtaca caaatcgccc gcagaagcgc ggccgtctgg 1140
accgatggct gtgtagaagt actcgccgat agtggaaacc gacgccccag cactcgtccg 1200
agggcaaagg aatagttaat taagaattcg acccagcttt cttgtacaaa gtggttggta 1260
agcctatccc taaccctctc ctcggtctcg attctacgta gtaatgagct agcagtctcg 1320
aggttaacga attccgcccc ccccctaacg ttactggccg aagccgcttg gaataaggcc 1380
ggtgtgcgct tgtctatatg ttattttcca ccatattgcc gtcttttggc aatgtgaggg 1440
cccggaaacc tggccctgtc ttcttgacga gcattcctag gggtctttcc cctctcgcca 1500
aaggaatgca aggtctgttg aatgtcgtga aggaagcagt tcctctggaa gcttcttgaa 1560
gacaaacaac gtctgtagcg accctttgca ggcagcggaa ccccccacct ggcgacaggt 1620
gcccctgcgg ccaaaagcca cgtgtataag atacacctgc aaaggcggca caaccccagt 1680
gccacgttgt gagttggata gttgtggaaa gagtcaaatg gctctcctca agcgtattca 1740
acaaggggct gaaggatgcc cagaaggtac cccattgtat gggatctgat ctggggcctc 1800
ggtgcacatg ctttacatgt gtttagtcga ggttaaaaaa acgtctaggc cccccgaacc 1860
acggggacgt ggttttcctt tgaaaaacac gataatacca tggtgagcaa gggcgaggag 1920
gataacatgg ccatcatcaa ggagttcatg cgcttcaagg tgcacatgga gggctccgtg 1980
aacggccacg agttcgagat cgagggcgag ggcgagggcc gcccctacga gggcacccag 2040
accgccaagc tgaaggtgac caagggtggc cccctgccct tcgcctggga catcctgtcc 2100
cctcagttca tgtacggctc caaggcctac gtgaagcacc ccgccgacat ccccgactac 2160
ttgaagctgt ccttccccga gggcttcaag tgggagcgcg tgatgaactt cgaggacggc 2220
ggcgtggtga ccgtgaccca ggactcctcc ctgcaggacg gcgagttcat ctacaaggtg 2280
aagctgcgcg gcaccaactt cccctccgac ggccccgtaa tgcagaagaa gaccatgggc 2340
tgggaggcct cctccgagcg gatgtacccc gaggacggcg ccctgaaggg cgagatcaag 2400
cagaggctga agctgaagga cggcggccac tacgacgctg aggtcaagac cacctacaag 2460
gccaagaagc ccgtgcagct gcccggcgcc tacaacgtca acatcaagtt ggacatcacc 2520
tcccacaacg aggactacac catcgtggaa cagtacgaac gcgccgaggg ccgccactcc 2580
accggcggca tggacgagct gtacaagtaa caccggtggc gcgttaagtc gacaatcaac 2640
ctctggatta caaaatttgt gaaagattga ctggtattct taactatgtt gctcctttta 2700
cgctatgtgg atacgctgct ttaatgcctt tgtatcatgc tattgcttcc cgtatggctt 2760
tcattttctc ctccttgtat aaatcctggt tgctgtctct ttatgaggag ttgtggcccg 2820
ttgtcaggca acgtggcgtg gtgtgcactg tgtttgctga cgcaaccccc actggttggg 2880
gcattgccac cacctgtcag ctcctttccg ggactttcgc tttccccctc cctattgcca 2940
cggcggaact catcgccgcc tgccttgccc gctgctggac aggggctcgg ctgttgggca 3000
ctgacaattc cgtggtgttg tcggggaaat catcgtcctt tccttggctg ctcgcctgtg 3060
ttgccacctg gattctgcgc gggacgtcct tctgctacgt cccttcggcc ctcaatccag 3120
cggaccttcc ttcccgcggc ctgctgccgg ctctgcggcc tcttccgcgt cttcgccttc 3180
gccctcagac gagtcggatc tccctttggg ccgcctcccc gcgtcgactt taagaccaat 3240
gacttacaag gcagctgtag atcttagcca ctttttaaaa gaaaaggggg gactggaagg 3300
gctaattcac tcccaacgaa gacaagatct gctttttgct tgtactgggt ctctctggtt 3360
agaccagatc tgagcctggg agctctctgg ctaactaggg aacccactgc ttaagcctca 3420
ataaagcttg ccttgagtgc ttcaagtagt gtgtgcccgt ctgttgtgtg actctggtaa 3480
ctagagatcc ctcagaccct tttagtcagt gtggaaaatc tctagcagta cgtatagtag 3540
ttcatgtcat cttattattc agtatttata acttgcaaag aaatgaatat cagagagtga 3600
gaggaacttg tttattgcag cttataatgg ttacaaataa agcaatagca tcacaaattt 3660
cacaaataaa gcattttttt cactgcattc tagttgtggt ttgtccaaac tcatcaatgt 3720
atcttatcat gtctggctct agctatcccg cccctaactc cgcccatccc gcccctaact 3780
ccgcccagtt ccgcccattc tccgccccat ggctgactaa ttttttttat ttatgcagag 3840
gccgaggccg cctcggcctc tgagctattc cagaagtagt gaggaggctt ttttggaggc 3900
ctagggacgt acccaattcg ccctatagtg agtcgtatta cgcgcgctca ctggccgtcg 3960
ttttacaacg tcgtgactgg gaaaaccctg gcgttaccca acttaatcgc cttgcagcac 4020
atcccccttt cgccagctgg cgtaatagcg aagaggcccg caccgatcgc ccttcccaac 4080
agttgcgcag cctgaatggc gaatgggacg cgccctgtag cggcgcatta agcgcggcgg 4140
gtgtggtggt tacgcgcagc gtgaccgcta cacttgccag cgccctagcg cccgctcctt 4200
tcgctttctt cccttccttt ctcgccacgt tcgccggctt tccccgtcaa gctctaaatc 4260
gggggctccc tttagggttc cgatttagtg ctttacggca cctcgacccc aaaaaacttg 4320
attagggtga tggttcacgt agtgggccat cgccctgata gacggttttt cgccctttga 4380
cgttggagtc cacgttcttt aatagtggac tcttgttcca aactggaaca acactcaacc 4440
ctatctcggt ctattctttt gatttataag ggattttgcc gatttcggcc tattggttaa 4500
aaaatgagct gatttaacaa aaatttaacg cgaattttaa caaaatatta acgcttacaa 4560
tttaggtggc acttttcggg gaaatgtgcg cggaacccct atttgtttat ttttctaaat 4620
acattcaaat atgtatccgc tcatgagaca ataaccctga taaatgcttc aataatattg 4680
aaaaaggaag agtatgagta ttcaacattt ccgtgtcgcc cttattccct tttttgcggc 4740
attttgcctt cctgtttttg ctcacccaga aacgctggtg aaagtaaaag atgctgaaga 4800
tcagttgggt gcacgagtgg gttacatcga actggatctc aacagcggta agatccttga 4860
gagttttcgc cccgaagaac gttttccaat gatgagcact tttaaagttc tgctatgtgg 4920
cgcggtatta tcccgtattg acgccgggca agagcaactc ggtcgccgca tacactattc 4980
tcagaatgac ttggttgagt actcaccagt cacagaaaag catcttacgg atggcatgac 5040
agtaagagaa ttatgcagtg ctgccataac catgagtgat aacactgcgg ccaacttact 5100
tctgacaacg atcggaggac cgaaggagct aaccgctttt ttgcacaaca tgggggatca 5160
tgtaactcgc cttgatcgtt gggaaccgga gctgaatgaa gccataccaa acgacgagcg 5220
tgacaccacg atgcctgtag caatggcaac aacgttgcgc aaactattaa ctggcgaact 5280
acttactcta gcttcccggc aacaattaat agactggatg gaggcggata aagttgcagg 5340
accacttctg cgctcggccc ttccggctgg ctggtttatt gctgataaat ctggagccgg 5400
tgagcgtggg tctcgcggta tcattgcagc actggggcca gatggtaagc cctcccgtat 5460
cgtagttatc tacacgacgg ggagtcaggc aactatggat gaacgaaata gacagatcgc 5520
tgagataggt gcctcactga ttaagcattg gtaactgtca gaccaagttt actcatatat 5580
actttagatt gatttaaaac ttcattttta atttaaaagg atctaggtga agatcctttt 5640
tgataatctc atgaccaaaa tcccttaacg tgagttttcg ttccactgag cgtcagaccc 5700
cgtagaaaag atcaaaggat cttcttgaga tccttttttt ctgcgcgtaa tctgctgctt 5760
gcaaacaaaa aaaccaccgc taccagcggt ggtttgtttg ccggatcaag agctaccaac 5820
tctttttccg aaggtaactg gcttcagcag agcgcagata ccaaatactg ttcttctagt 5880
gtagccgtag ttaggccacc acttcaagaa ctctgtagca ccgcctacat acctcgctct 5940
gctaatcctg ttaccagtgg ctgctgccag tggcgataag tcgtgtctta ccgggttgga 6000
ctcaagacga tagttaccgg ataaggcgca gcggtcgggc tgaacggggg gttcgtgcac 6060
acagcccagc ttggagcgaa cgacctacac cgaactgaga tacctacagc gtgagctatg 6120
agaaagcgcc acgcttcccg aagggagaaa ggcggacagg tatccggtaa gcggcagggt 6180
cggaacagga gagcgcacga gggagcttcc agggggaaac gcctggtatc tttatagtcc 6240
tgtcgggttt cgccacctct gacttgagcg tcgatttttg tgatgctcgt caggggggcg 6300
gagcctatgg aaaaacgcca gcaacgcggc ctttttacgg ttcctggcct tttgctggcc 6360
ttttgctcac atgttctttc ctgcgttatc ccctgattct gtggataacc gtattaccgc 6420
ctttgagtga gctgataccg ctcgccgcag ccgaacgacc gagcgcagcg agtcagtgag 6480
cgaggaagcg gaagagcgcc caatacgcaa accgcctctc cccgcgcgtt ggccgattca 6540
ttaatgcagc tggcacgaca ggtttcccga ctggaaagcg ggcagtgagc gcaacgcaat 6600
taatgtgagt tagctcactc attaggcacc ccaggcttta cactttatgc ttccggctcg 6660
tatgttgtgt ggaattgtga gcggataaca atttcacaca ggaaacagct atgaccatga 6720
ttacgccaag cgcgcaatta accctcacta aagggaacaa aagctggagc tgcaagctta 6780
atgtagtctt atgcaatact cttgtagtct tgcaacatgg taacgatgag ttagcaacat 6840
gccttacaag gagagaaaaa gcaccgtgca tgccgattgg tggaagtaag gtggtacgat 6900
cgtgccttat taggaaggca acagacgggt ctgacatgga ttggacgaac cactgaattg 6960
ccgcattgca gagatattgt atttaagtgc ctagctcgat acataaacgg gtctctctgg 7020
ttagaccaga tctgagcctg ggagctctct ggctaactag ggaacccact gcttaagcct 7080
caataaagct tgccttgagt gcttcaagta gtgtgtgccc gtctgttgtg tgactctggt 7140
aactagagat ccctcagacc cttttagtca gtgtggaaaa tctctagcag tggcgcccga 7200
acagggactt gaaagcgaaa gggaaaccag aggagctctc tcgacgcagg actcggcttg 7260
ctgaagcgcg cacggcaaga ggcgaggggc ggcgactggt gagtacgcca aaaattttga 7320
ctagcggagg ctagaaggag agagatgggt gcgagagcgt cagtattaag cgggggagaa 7380
ttagatcgcg atgggaaaaa attcggttaa ggccaggggg aaagaaaaaa tataaattaa 7440
aacatatagt atgggcaagc agggagctag aacgattcgc agttaatcct ggcctgttag 7500
aaacatcaga aggctgtaga caaatactgg gacagctaca accatccctt cagacaggat 7560
cagaagaact tagatcatta tataatacag tagcaaccct ctattgtgtg catcaaagga 7620
tagagataaa agacaccaag gaagctttag acaagataga ggaagagcaa aacaaaagta 7680
agaccaccgc acagcaagcg gccgctgatc ttcagacctg gaggaggaga tatgagggac 7740
aattggagaa gtgaattata taaatataaa gtagtaaaaa ttgaaccatt aggagtagca 7800
cccaccaagg caaagagaag agtggtgcag agagaaaaaa gagcagtggg aataggagct 7860
ttgttccttg ggttcttggg agcagcagga agcactatgg gcgcagcgtc aatgacgctg 7920
acggtacagg ccagacaatt attgtctggt atagtgcagc agcagaacaa tttgctgagg 7980
gctattgagg cgcaacagca tctgttgcaa ctcacagtct ggggcatcaa gcagctccag 8040
gcaagaatcc tggctgtgga aagataccta aaggatcaac agctcctggg gatttggggt 8100
tgctctggaa aactcatttg caccactgct gtgccttgga atgctagttg gagtaataaa 8160
tctctggaac agatttggaa tcacacgacc tggatggagt gggacagaga aattaacaat 8220
tacacaagct taatacactc cttaattgaa gaatcgcaaa accagcaaga aaagaatgaa 8280
caagaattat tggaattaga taaatgggca agtttgtgga attggtttaa cataacaaat 8340
tggctgtggt atataaaatt attcataatg atagtaggag gcttggtagg tttaagaata 8400
gtttttgctg tactttctat agtgaataga gttaggcagg gatattcacc attatcgttt 8460
cagacccacc tcccaacccc gaggggaccc ttgcgccttt tccaaggcag ccctgggttt 8520
gcgcagggac gcggctgctc tgggcgtggt tccgggaaac gcagcggcgc cgaccctggg 8580
tctcgcacat tcttcacgtc cgttcgcagc gtcacccgga tcttcgccgc tacccttgtg 8640
ggccccccgg cgacgcttcc tgctccgccc ctaagtcggg aaggttcctt gcggttcgcg 8700
gcgtgccgga cgtgacaaac ggaagccgca cgtctcacta gtaccctcgc agacggacag 8760
cgccagggag caatggcagc gcgccgaccg cgatgggctg tggccaatag cggctgctca 8820
gcagggcgcg ccgagagcag cggccgggaa ggggcggtgc gggaggcggg gtgtggggcg 8880
gtagtgtggg ccctgttcct gcccgcgcgg tgttccgcat tctgcaagcc tccggagcgc 8940
acgtcggcag tcggctccct cgttgaccga atcaccgacc tctctcccca gggggtaccc 9000
agctgtctag agaattctag atcttgagac aaatggcagt attcatccac aattttaaaa 9060
gaaaaggggg gattgggggg tacagtgcag gggaaagaat agtagacata atagcaacag 9120
acatacaaac taaagaatta caaaaacaaa ttacaaaaat tcaaaatttt cgggtttatt 9180
acagggacag cagagatcca ctttggcgcc ggctcgaggg gg 9222
<210> 177
<211> 190
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 177
Met Ser Pro Glu Ile Glu Lys Leu Ser Gln Ser Asp Ile Tyr Trp Asp
1 5 10 15
Ser Ile Val Ser Ile Thr Glu Thr Gly Val Glu Glu Val Phe Asp Leu
20 25 30
Thr Val Pro Gly Pro His Asn Phe Val Ala Asn Asp Ile Ile Val His
35 40 45
Asn Ser Asn Asn Val Leu Thr Asp Asn Gly Arg Ile Thr Ala Val Ile
50 55 60
Asp Trp Ser Glu Ala Met Phe Gly Asp Ser Gln Tyr Glu Val Ala Asn
65 70 75 80
Ile Phe Phe Trp Arg Pro Trp Leu Ala Cys Met Glu Gln Gln Thr Arg
85 90 95
Tyr Phe Glu Arg Arg His Pro Glu Leu Ala Gly Ser Pro Arg Leu Arg
100 105 110
Ala Tyr Met Leu Arg Ile Gly Leu Asp Gln Leu Tyr Gln Ser Leu Val
115 120 125
Asp Gly Asn Phe Asp Asp Ala Ala Trp Ala Gln Gly Arg Cys Asp Ala
130 135 140
Ile Val Arg Ser Gly Ala Gly Thr Val Gly Arg Thr Gln Ile Ala Arg
145 150 155 160
Arg Ser Ala Ala Val Trp Thr Asp Gly Cys Val Glu Val Leu Ala Asp
165 170 175
Ser Gly Asn Arg Arg Pro Ser Thr Arg Pro Arg Ala Lys Glu
180 185 190
<210> 178
<211> 9420
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 178
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtgccgcca ccatgattaa gatcgctacg 660
cggaagtacc tggggaaaca gaacgtctac gacataggtg tggagcgcga tcacaacttt 720
gctctgaaaa atggatttat cgccagcaac tgcatctccc gccgtgcaca gggtgtcacg 780
ttgcaagacc tgcctgaaac cgaactgccc gctgttctgc agccggtcgc ggaggccatg 840
gatgcgatcg ctgcggccga tcttagccag acgagcgggt tcggcccatt cggaccgcaa 900
ggaatcggtc aatacactac atggcgtgat ttcatatgcg cgattgctga tccccatgtg 960
tatcactggc aaactgtgat ggacgacacc gtcagtgcgt ccgtcgcgca ggctctcgat 1020
gagctgatgc tttgggccga ggactgcccc gaagtccggc acctcgtgca cgcggatttc 1080
ggctgtatca gtggcgactc cctgatctca ctcgcaagca ctggaaagcg agttagcatc 1140
aaggacttgc tggacgaaaa ggatttcgaa atttgggcaa tcaatgagca gaccatgaaa 1200
ctggagtctg caaaggtgtc ccgggtgttt tgcacgggta agaagcttgt ttatatcctt 1260
aaaactagac tgggccggac gatcaaagcc accgcgaacc acagattctt gacaatcgac 1320
gggtggaaac ggctggacga actgagcttg aaggagcaca tcgcccttcc tcggaagctc 1380
gagtcatctt ccctgcagct gtgattaatt aagaattcga cccagctttc ttgtacaaag 1440
tggttggtaa gcctatccct aaccctctcc tcggtctcga ttctacgtag taatgagcta 1500
gcagtctcga ggttaacgaa ttccgccccc cccctaacgt tactggccga agccgcttgg 1560
aataaggccg gtgtgcgctt gtctatatgt tattttccac catattgccg tcttttggca 1620
atgtgagggc ccggaaacct ggccctgtct tcttgacgag cattcctagg ggtctttccc 1680
ctctcgccaa aggaatgcaa ggtctgttga atgtcgtgaa ggaagcagtt cctctggaag 1740
cttcttgaag acaaacaacg tctgtagcga ccctttgcag gcagcggaac cccccacctg 1800
gcgacaggtg cccctgcggc caaaagccac gtgtataaga tacacctgca aaggcggcac 1860
aaccccagtg ccacgttgtg agttggatag ttgtggaaag agtcaaatgg ctctcctcaa 1920
gcgtattcaa caaggggctg aaggatgccc agaaggtacc ccattgtatg ggatctgatc 1980
tggggcctcg gtgcacatgc tttacatgtg tttagtcgag gttaaaaaaa cgtctaggcc 2040
ccccgaacca cggggacgtg gttttccttt gaaaaacacg ataataccat ggtgagcaag 2100
ggcgaggagc tgttcaccgg ggtggtgccc atcctggtcg agctggacgg cgacgtaaac 2160
ggccacaagt tcagcgtgtc cggcgagggc gagggcgatg ccacctacgg caagctgacc 2220
ctgaagttca tctgcaccac cggcaagctg cccgtgccct ggcccaccct cgtgaccacc 2280
ctgacctacg gcgtgcagtg cttcagccgc taccccgacc acatgaagca gcacgacttc 2340
ttcaagtccg ccatgcccga aggctacgtc caggagcgca ccatcttctt caaggacgac 2400
ggcaactaca agacccgcgc cgaggtgaag ttcgagggcg acaccctggt gaaccgcatc 2460
gagctgaagg gcatcgactt caaggaggac ggcaacatcc tggggcacaa gctggagtac 2520
aactacaaca gccacaacgt ctatatcatg gccgacaagc agaagaacgg catcaaggtg 2580
aacttcaaga tccgccacaa catcgaggac ggcagcgtgc agctcgccga ccactaccag 2640
cagaacaccc ccatcggcga cggccccgtg ctgctgcccg acaaccacta cctgagcacc 2700
cagtccgccc tgagcaaaga ccccaacgag aagcgcgatc acatggtcct gctggagttc 2760
gtgaccgccg ccgggatcac tctcggcatg gacgagctgt acaagtaaca ccggtggcgc 2820
gttaagtcga caatcaacct ctggattaca aaatttgtga aagattgact ggtattctta 2880
actatgttgc tccttttacg ctatgtggat acgctgcttt aatgcctttg tatcatgcta 2940
ttgcttcccg tatggctttc attttctcct ccttgtataa atcctggttg ctgtctcttt 3000
atgaggagtt gtggcccgtt gtcaggcaac gtggcgtggt gtgcactgtg tttgctgacg 3060
caacccccac tggttggggc attgccacca cctgtcagct cctttccggg actttcgctt 3120
tccccctccc tattgccacg gcggaactca tcgccgcctg ccttgcccgc tgctggacag 3180
gggctcggct gttgggcact gacaattccg tggtgttgtc ggggaaatca tcgtcctttc 3240
cttggctgct cgcctgtgtt gccacctgga ttctgcgcgg gacgtccttc tgctacgtcc 3300
cttcggccct caatccagcg gaccttcctt cccgcggcct gctgccggct ctgcggcctc 3360
ttccgcgtct tcgccttcgc cctcagacga gtcggatctc cctttgggcc gcctccccgc 3420
gtcgacttta agaccaatga cttacaaggc agctgtagat cttagccact ttttaaaaga 3480
aaagggggga ctggaagggc taattcactc ccaacgaaga caagatctgc tttttgcttg 3540
tactgggtct ctctggttag accagatctg agcctgggag ctctctggct aactagggaa 3600
cccactgctt aagcctcaat aaagcttgcc ttgagtgctt caagtagtgt gtgcccgtct 3660
gttgtgtgac tctggtaact agagatccct cagacccttt tagtcagtgt ggaaaatctc 3720
tagcagtacg tatagtagtt catgtcatct tattattcag tatttataac ttgcaaagaa 3780
atgaatatca gagagtgaga ggaacttgtt tattgcagct tataatggtt acaaataaag 3840
caatagcatc acaaatttca caaataaagc atttttttca ctgcattcta gttgtggttt 3900
gtccaaactc atcaatgtat cttatcatgt ctggctctag ctatcccgcc cctaactccg 3960
cccatcccgc ccctaactcc gcccagttcc gcccattctc cgccccatgg ctgactaatt 4020
ttttttattt atgcagaggc cgaggccgcc tcggcctctg agctattcca gaagtagtga 4080
ggaggctttt ttggaggcct agggacgtac ccaattcgcc ctatagtgag tcgtattacg 4140
cgcgctcact ggccgtcgtt ttacaacgtc gtgactggga aaaccctggc gttacccaac 4200
ttaatcgcct tgcagcacat ccccctttcg ccagctggcg taatagcgaa gaggcccgca 4260
ccgatcgccc ttcccaacag ttgcgcagcc tgaatggcga atgggacgcg ccctgtagcg 4320
gcgcattaag cgcggcgggt gtggtggtta cgcgcagcgt gaccgctaca cttgccagcg 4380
ccctagcgcc cgctcctttc gctttcttcc cttcctttct cgccacgttc gccggctttc 4440
cccgtcaagc tctaaatcgg gggctccctt tagggttccg atttagtgct ttacggcacc 4500
tcgaccccaa aaaacttgat tagggtgatg gttcacgtag tgggccatcg ccctgataga 4560
cggtttttcg ccctttgacg ttggagtcca cgttctttaa tagtggactc ttgttccaaa 4620
ctggaacaac actcaaccct atctcggtct attcttttga tttataaggg attttgccga 4680
tttcggccta ttggttaaaa aatgagctga tttaacaaaa atttaacgcg aattttaaca 4740
aaatattaac gcttacaatt taggtggcac ttttcgggga aatgtgcgcg gaacccctat 4800
ttgtttattt ttctaaatac attcaaatat gtatccgctc atgagacaat aaccctgata 4860
aatgcttcaa taatattgaa aaaggaagag tatgagtatt caacatttcc gtgtcgccct 4920
tattcccttt tttgcggcat tttgccttcc tgtttttgct cacccagaaa cgctggtgaa 4980
agtaaaagat gctgaagatc agttgggtgc acgagtgggt tacatcgaac tggatctcaa 5040
cagcggtaag atccttgaga gttttcgccc cgaagaacgt tttccaatga tgagcacttt 5100
taaagttctg ctatgtggcg cggtattatc ccgtattgac gccgggcaag agcaactcgg 5160
tcgccgcata cactattctc agaatgactt ggttgagtac tcaccagtca cagaaaagca 5220
tcttacggat ggcatgacag taagagaatt atgcagtgct gccataacca tgagtgataa 5280
cactgcggcc aacttacttc tgacaacgat cggaggaccg aaggagctaa ccgctttttt 5340
gcacaacatg ggggatcatg taactcgcct tgatcgttgg gaaccggagc tgaatgaagc 5400
cataccaaac gacgagcgtg acaccacgat gcctgtagca atggcaacaa cgttgcgcaa 5460
actattaact ggcgaactac ttactctagc ttcccggcaa caattaatag actggatgga 5520
ggcggataaa gttgcaggac cacttctgcg ctcggccctt ccggctggct ggtttattgc 5580
tgataaatct ggagccggtg agcgtgggtc tcgcggtatc attgcagcac tggggccaga 5640
tggtaagccc tcccgtatcg tagttatcta cacgacgggg agtcaggcaa ctatggatga 5700
acgaaataga cagatcgctg agataggtgc ctcactgatt aagcattggt aactgtcaga 5760
ccaagtttac tcatatatac tttagattga tttaaaactt catttttaat ttaaaaggat 5820
ctaggtgaag atcctttttg ataatctcat gaccaaaatc ccttaacgtg agttttcgtt 5880
ccactgagcg tcagaccccg tagaaaagat caaaggatct tcttgagatc ctttttttct 5940
gcgcgtaatc tgctgcttgc aaacaaaaaa accaccgcta ccagcggtgg tttgtttgcc 6000
ggatcaagag ctaccaactc tttttccgaa ggtaactggc ttcagcagag cgcagatacc 6060
aaatactgtt cttctagtgt agccgtagtt aggccaccac ttcaagaact ctgtagcacc 6120
gcctacatac ctcgctctgc taatcctgtt accagtggct gctgccagtg gcgataagtc 6180
gtgtcttacc gggttggact caagacgata gttaccggat aaggcgcagc ggtcgggctg 6240
aacggggggt tcgtgcacac agcccagctt ggagcgaacg acctacaccg aactgagata 6300
cctacagcgt gagctatgag aaagcgccac gcttcccgaa gggagaaagg cggacaggta 6360
tccggtaagc ggcagggtcg gaacaggaga gcgcacgagg gagcttccag ggggaaacgc 6420
ctggtatctt tatagtcctg tcgggtttcg ccacctctga cttgagcgtc gatttttgtg 6480
atgctcgtca ggggggcgga gcctatggaa aaacgccagc aacgcggcct ttttacggtt 6540
cctggccttt tgctggcctt ttgctcacat gttctttcct gcgttatccc ctgattctgt 6600
ggataaccgt attaccgcct ttgagtgagc tgataccgct cgccgcagcc gaacgaccga 6660
gcgcagcgag tcagtgagcg aggaagcgga agagcgccca atacgcaaac cgcctctccc 6720
cgcgcgttgg ccgattcatt aatgcagctg gcacgacagg tttcccgact ggaaagcggg 6780
cagtgagcgc aacgcaatta atgtgagtta gctcactcat taggcacccc aggctttaca 6840
ctttatgctt ccggctcgta tgttgtgtgg aattgtgagc ggataacaat ttcacacagg 6900
aaacagctat gaccatgatt acgccaagcg cgcaattaac cctcactaaa gggaacaaaa 6960
gctggagctg caagcttaat gtagtcttat gcaatactct tgtagtcttg caacatggta 7020
acgatgagtt agcaacatgc cttacaagga gagaaaaagc accgtgcatg ccgattggtg 7080
gaagtaaggt ggtacgatcg tgccttatta ggaaggcaac agacgggtct gacatggatt 7140
ggacgaacca ctgaattgcc gcattgcaga gatattgtat ttaagtgcct agctcgatac 7200
ataaacgggt ctctctggtt agaccagatc tgagcctggg agctctctgg ctaactaggg 7260
aacccactgc ttaagcctca ataaagcttg ccttgagtgc ttcaagtagt gtgtgcccgt 7320
ctgttgtgtg actctggtaa ctagagatcc ctcagaccct tttagtcagt gtggaaaatc 7380
tctagcagtg gcgcccgaac agggacttga aagcgaaagg gaaaccagag gagctctctc 7440
gacgcaggac tcggcttgct gaagcgcgca cggcaagagg cgaggggcgg cgactggtga 7500
gtacgccaaa aattttgact agcggaggct agaaggagag agatgggtgc gagagcgtca 7560
gtattaagcg ggggagaatt agatcgcgat gggaaaaaat tcggttaagg ccagggggaa 7620
agaaaaaata taaattaaaa catatagtat gggcaagcag ggagctagaa cgattcgcag 7680
ttaatcctgg cctgttagaa acatcagaag gctgtagaca aatactggga cagctacaac 7740
catcccttca gacaggatca gaagaactta gatcattata taatacagta gcaaccctct 7800
attgtgtgca tcaaaggata gagataaaag acaccaagga agctttagac aagatagagg 7860
aagagcaaaa caaaagtaag accaccgcac agcaagcggc cgctgatctt cagacctgga 7920
ggaggagata tgagggacaa ttggagaagt gaattatata aatataaagt agtaaaaatt 7980
gaaccattag gagtagcacc caccaaggca aagagaagag tggtgcagag agaaaaaaga 8040
gcagtgggaa taggagcttt gttccttggg ttcttgggag cagcaggaag cactatgggc 8100
gcagcgtcaa tgacgctgac ggtacaggcc agacaattat tgtctggtat agtgcagcag 8160
cagaacaatt tgctgagggc tattgaggcg caacagcatc tgttgcaact cacagtctgg 8220
ggcatcaagc agctccaggc aagaatcctg gctgtggaaa gatacctaaa ggatcaacag 8280
ctcctgggga tttggggttg ctctggaaaa ctcatttgca ccactgctgt gccttggaat 8340
gctagttgga gtaataaatc tctggaacag atttggaatc acacgacctg gatggagtgg 8400
gacagagaaa ttaacaatta cacaagctta atacactcct taattgaaga atcgcaaaac 8460
cagcaagaaa agaatgaaca agaattattg gaattagata aatgggcaag tttgtggaat 8520
tggtttaaca taacaaattg gctgtggtat ataaaattat tcataatgat agtaggaggc 8580
ttggtaggtt taagaatagt ttttgctgta ctttctatag tgaatagagt taggcaggga 8640
tattcaccat tatcgtttca gacccacctc ccaaccccga ggggaccctt gcgccttttc 8700
caaggcagcc ctgggtttgc gcagggacgc ggctgctctg ggcgtggttc cgggaaacgc 8760
agcggcgccg accctgggtc tcgcacattc ttcacgtccg ttcgcagcgt cacccggatc 8820
ttcgccgcta cccttgtggg ccccccggcg acgcttcctg ctccgcccct aagtcgggaa 8880
ggttccttgc ggttcgcggc gtgccggacg tgacaaacgg aagccgcacg tctcactagt 8940
accctcgcag acggacagcg ccagggagca atggcagcgc gccgaccgcg atgggctgtg 9000
gccaatagcg gctgctcagc agggcgcgcc gagagcagcg gccgggaagg ggcggtgcgg 9060
gaggcggggt gtggggcggt agtgtgggcc ctgttcctgc ccgcgcggtg ttccgcattc 9120
tgcaagcctc cggagcgcac gtcggcagtc ggctccctcg ttgaccgaat caccgacctc 9180
tctccccagg gggtacccag ctgtctagag aattctagat cttgagacaa atggcagtat 9240
tcatccacaa ttttaaaaga aaagggggga ttggggggta cagtgcaggg gaaagaatag 9300
tagacataat agcaacagac atacaaacta aagaattaca aaaacaaatt acaaaaattc 9360
aaaattttcg ggtttattac agggacagca gagatccact ttggcgccgg ctcgaggggg 9420
<210> 179
<211> 253
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 179
Met Ile Lys Ile Ala Thr Arg Lys Tyr Leu Gly Lys Gln Asn Val Tyr
1 5 10 15
Asp Ile Gly Val Glu Arg Asp His Asn Phe Ala Leu Lys Asn Gly Phe
20 25 30
Ile Ala Ser Asn Cys Ile Ser Arg Arg Ala Gln Gly Val Thr Leu Gln
35 40 45
Asp Leu Pro Glu Thr Glu Leu Pro Ala Val Leu Gln Pro Val Ala Glu
50 55 60
Ala Met Asp Ala Ile Ala Ala Ala Asp Leu Ser Gln Thr Ser Gly Phe
65 70 75 80
Gly Pro Phe Gly Pro Gln Gly Ile Gly Gln Tyr Thr Thr Trp Arg Asp
85 90 95
Phe Ile Cys Ala Ile Ala Asp Pro His Val Tyr His Trp Gln Thr Val
100 105 110
Met Asp Asp Thr Val Ser Ala Ser Val Ala Gln Ala Leu Asp Glu Leu
115 120 125
Met Leu Trp Ala Glu Asp Cys Pro Glu Val Arg His Leu Val His Ala
130 135 140
Asp Phe Gly Cys Ile Ser Gly Asp Ser Leu Ile Ser Leu Ala Ser Thr
145 150 155 160
Gly Lys Arg Val Ser Ile Lys Asp Leu Leu Asp Glu Lys Asp Phe Glu
165 170 175
Ile Trp Ala Ile Asn Glu Gln Thr Met Lys Leu Glu Ser Ala Lys Val
180 185 190
Ser Arg Val Phe Cys Thr Gly Lys Lys Leu Val Tyr Ile Leu Lys Thr
195 200 205
Arg Leu Gly Arg Thr Ile Lys Ala Thr Ala Asn His Arg Phe Leu Thr
210 215 220
Ile Asp Gly Trp Lys Arg Leu Asp Glu Leu Ser Leu Lys Glu His Ile
225 230 235 240
Ala Leu Pro Arg Lys Leu Glu Ser Ser Ser Leu Gln Leu
245 250
<210> 180
<211> 9564
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 180
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtgccacca tgaaaaagcc tgaactcacc 660
gcgacgtctg tcgagaagtt tctgatcgaa aagttcgaca gcgtctccga cctgatgcag 720
ctctcggagg gcgaagaatc tcgtgctttc agcttcgatg taggagggcg tggatatgtc 780
ctgcgggtaa atagctgcgc cgatggtttc tacaaagatc gttatgttta tcggcacttt 840
gcatcggccg cgctcccgat tccggaagtg cttgacattg gggaatttag cgagagcctg 900
acctattgca tctcccgccg tgcacagggt gtcacgttgc aagacctgcc tgaaaccgaa 960
ctgcccgctg ttctgcagcc ggtcgcggag gccatggatg cgatcgctgc ggccgatctt 1020
agccagacga gcgggttcgg cccattcgga ccgcaaggaa tcggtcaata cactacatgg 1080
cgtgatttca tatgcgcgat tgctgatccc catgtgtatc actggcaaac tgtgatggac 1140
gacaccgtca gtgcgtccgt cgcgcaggct ctcgatgagc tgatgctttg ggccgaggac 1200
tgccccgaag tccggcacct cgtgcacgcg gatttcggct gtatcagtgg cgactccctg 1260
atctcactcg caagcactgg aaagcgagtt agcatcaagg acttgctgga cgaaaaggat 1320
ttcgaaattt gggcaatcaa tgagcagacc atgaaactgg agtctgcaaa ggtgtcccgg 1380
gtgttttgca cgggtaagaa gcttgtttat atccttaaaa ctagactggg ccggacgatc 1440
aaagccaccg cgaaccacag attcttgaca atcgacgggt ggaaacggct ggacgaactg 1500
agcttgaagg agcacatcgc ccttcctcgg aagctcgagt catcttccct gcagctgtga 1560
ttaattaaga attcgaccca gctttcttgt acaaagtggt tggtaagcct atccctaacc 1620
ctctcctcgg tctcgattct acgtagtaat gagctagcag tctcgaggtt aacgaattcc 1680
gccccccccc taacgttact ggccgaagcc gcttggaata aggccggtgt gcgcttgtct 1740
atatgttatt ttccaccata ttgccgtctt ttggcaatgt gagggcccgg aaacctggcc 1800
ctgtcttctt gacgagcatt cctaggggtc tttcccctct cgccaaagga atgcaaggtc 1860
tgttgaatgt cgtgaaggaa gcagttcctc tggaagcttc ttgaagacaa acaacgtctg 1920
tagcgaccct ttgcaggcag cggaaccccc cacctggcga caggtgcccc tgcggccaaa 1980
agccacgtgt ataagataca cctgcaaagg cggcacaacc ccagtgccac gttgtgagtt 2040
ggatagttgt ggaaagagtc aaatggctct cctcaagcgt attcaacaag gggctgaagg 2100
atgcccagaa ggtaccccat tgtatgggat ctgatctggg gcctcggtgc acatgcttta 2160
catgtgttta gtcgaggtta aaaaaacgtc taggcccccc gaaccacggg gacgtggttt 2220
tcctttgaaa aacacgataa taccatggcc atgagcgagc tgattaagga gaacatgcac 2280
atgaagctgt acatggaggg caccgtggac aaccatcact tcaagtgcac atccgagggc 2340
gaaggcaagc cctacgaggg cacccagacc atgagaatca aggtggtcga gggcggccct 2400
ctccccttcg ccttcgacat cctggctact agcttcctct acggcagcaa gaccttcatc 2460
aaccacaccc agggcatccc cgacttcttc aagcagtcct tccctgaggg cttcacatgg 2520
gagagagtca ccacatacga agacgggggc gtgctgaccg ctacccagga caccagcctc 2580
caggacggct gcctcatcta caacgtcaag atcagagggg tgaacttcac atccaacggc 2640
cctgtgatgc agaagaaaac actcggctgg gaggccttca ccgagacgct gtaccccgct 2700
gacggcggcc tggaaggcag aaacgacatg gccctgaagc tcgtgggcgg gagccatctg 2760
atcgcaaaca tcaagaccac atatagatcc aagaaacccg ctaagaacct caagatgcct 2820
ggcgtctact atgtggacta cagactggaa agaatcaagg aggccaacaa cgagacctac 2880
gtcgagcagc acgaggtggc agtggccaga tactgcgacc tccctagcaa actggggcac 2940
aagcttaatt aacaccggtg gcgcgttaag tcgacaatca acctctggat tacaaaattt 3000
gtgaaagatt gactggtatt cttaactatg ttgctccttt tacgctatgt ggatacgctg 3060
ctttaatgcc tttgtatcat gctattgctt cccgtatggc tttcattttc tcctccttgt 3120
ataaatcctg gttgctgtct ctttatgagg agttgtggcc cgttgtcagg caacgtggcg 3180
tggtgtgcac tgtgtttgct gacgcaaccc ccactggttg gggcattgcc accacctgtc 3240
agctcctttc cgggactttc gctttccccc tccctattgc cacggcggaa ctcatcgccg 3300
cctgccttgc ccgctgctgg acaggggctc ggctgttggg cactgacaat tccgtggtgt 3360
tgtcggggaa atcatcgtcc tttccttggc tgctcgcctg tgttgccacc tggattctgc 3420
gcgggacgtc cttctgctac gtcccttcgg ccctcaatcc agcggacctt ccttcccgcg 3480
gcctgctgcc ggctctgcgg cctcttccgc gtcttcgcct tcgccctcag acgagtcgga 3540
tctccctttg ggccgcctcc ccgcgtcgac tttaagacca atgacttaca aggcagctgt 3600
agatcttagc cactttttaa aagaaaaggg gggactggaa gggctaattc actcccaacg 3660
aagacaagat ctgctttttg cttgtactgg gtctctctgg ttagaccaga tctgagcctg 3720
ggagctctct ggctaactag ggaacccact gcttaagcct caataaagct tgccttgagt 3780
gcttcaagta gtgtgtgccc gtctgttgtg tgactctggt aactagagat ccctcagacc 3840
cttttagtca gtgtggaaaa tctctagcag tacgtatagt agttcatgtc atcttattat 3900
tcagtattta taacttgcaa agaaatgaat atcagagagt gagaggaact tgtttattgc 3960
agcttataat ggttacaaat aaagcaatag catcacaaat ttcacaaata aagcattttt 4020
ttcactgcat tctagttgtg gtttgtccaa actcatcaat gtatcttatc atgtctggct 4080
ctagctatcc cgcccctaac tccgcccatc ccgcccctaa ctccgcccag ttccgcccat 4140
tctccgcccc atggctgact aatttttttt atttatgcag aggccgaggc cgcctcggcc 4200
tctgagctat tccagaagta gtgaggaggc ttttttggag gcctagggac gtacccaatt 4260
cgccctatag tgagtcgtat tacgcgcgct cactggccgt cgttttacaa cgtcgtgact 4320
gggaaaaccc tggcgttacc caacttaatc gccttgcagc acatccccct ttcgccagct 4380
ggcgtaatag cgaagaggcc cgcaccgatc gcccttccca acagttgcgc agcctgaatg 4440
gcgaatggga cgcgccctgt agcggcgcat taagcgcggc gggtgtggtg gttacgcgca 4500
gcgtgaccgc tacacttgcc agcgccctag cgcccgctcc tttcgctttc ttcccttcct 4560
ttctcgccac gttcgccggc tttccccgtc aagctctaaa tcgggggctc cctttagggt 4620
tccgatttag tgctttacgg cacctcgacc ccaaaaaact tgattagggt gatggttcac 4680
gtagtgggcc atcgccctga tagacggttt ttcgcccttt gacgttggag tccacgttct 4740
ttaatagtgg actcttgttc caaactggaa caacactcaa ccctatctcg gtctattctt 4800
ttgatttata agggattttg ccgatttcgg cctattggtt aaaaaatgag ctgatttaac 4860
aaaaatttaa cgcgaatttt aacaaaatat taacgcttac aatttaggtg gcacttttcg 4920
gggaaatgtg cgcggaaccc ctatttgttt atttttctaa atacattcaa atatgtatcc 4980
gctcatgaga caataaccct gataaatgct tcaataatat tgaaaaagga agagtatgag 5040
tattcaacat ttccgtgtcg cccttattcc cttttttgcg gcattttgcc ttcctgtttt 5100
tgctcaccca gaaacgctgg tgaaagtaaa agatgctgaa gatcagttgg gtgcacgagt 5160
gggttacatc gaactggatc tcaacagcgg taagatcctt gagagttttc gccccgaaga 5220
acgttttcca atgatgagca cttttaaagt tctgctatgt ggcgcggtat tatcccgtat 5280
tgacgccggg caagagcaac tcggtcgccg catacactat tctcagaatg acttggttga 5340
gtactcacca gtcacagaaa agcatcttac ggatggcatg acagtaagag aattatgcag 5400
tgctgccata accatgagtg ataacactgc ggccaactta cttctgacaa cgatcggagg 5460
accgaaggag ctaaccgctt ttttgcacaa catgggggat catgtaactc gccttgatcg 5520
ttgggaaccg gagctgaatg aagccatacc aaacgacgag cgtgacacca cgatgcctgt 5580
agcaatggca acaacgttgc gcaaactatt aactggcgaa ctacttactc tagcttcccg 5640
gcaacaatta atagactgga tggaggcgga taaagttgca ggaccacttc tgcgctcggc 5700
ccttccggct ggctggttta ttgctgataa atctggagcc ggtgagcgtg ggtctcgcgg 5760
tatcattgca gcactggggc cagatggtaa gccctcccgt atcgtagtta tctacacgac 5820
ggggagtcag gcaactatgg atgaacgaaa tagacagatc gctgagatag gtgcctcact 5880
gattaagcat tggtaactgt cagaccaagt ttactcatat atactttaga ttgatttaaa 5940
acttcatttt taatttaaaa ggatctaggt gaagatcctt tttgataatc tcatgaccaa 6000
aatcccttaa cgtgagtttt cgttccactg agcgtcagac cccgtagaaa agatcaaagg 6060
atcttcttga gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa aaaaaccacc 6120
gctaccagcg gtggtttgtt tgccggatca agagctacca actctttttc cgaaggtaac 6180
tggcttcagc agagcgcaga taccaaatac tgttcttcta gtgtagccgt agttaggcca 6240
ccacttcaag aactctgtag caccgcctac atacctcgct ctgctaatcc tgttaccagt 6300
ggctgctgcc agtggcgata agtcgtgtct taccgggttg gactcaagac gatagttacc 6360
ggataaggcg cagcggtcgg gctgaacggg gggttcgtgc acacagccca gcttggagcg 6420
aacgacctac accgaactga gatacctaca gcgtgagcta tgagaaagcg ccacgcttcc 6480
cgaagggaga aaggcggaca ggtatccggt aagcggcagg gtcggaacag gagagcgcac 6540
gagggagctt ccagggggaa acgcctggta tctttatagt cctgtcgggt ttcgccacct 6600
ctgacttgag cgtcgatttt tgtgatgctc gtcagggggg cggagcctat ggaaaaacgc 6660
cagcaacgcg gcctttttac ggttcctggc cttttgctgg ccttttgctc acatgttctt 6720
tcctgcgtta tcccctgatt ctgtggataa ccgtattacc gcctttgagt gagctgatac 6780
cgctcgccgc agccgaacga ccgagcgcag cgagtcagtg agcgaggaag cggaagagcg 6840
cccaatacgc aaaccgcctc tccccgcgcg ttggccgatt cattaatgca gctggcacga 6900
caggtttccc gactggaaag cgggcagtga gcgcaacgca attaatgtga gttagctcac 6960
tcattaggca ccccaggctt tacactttat gcttccggct cgtatgttgt gtggaattgt 7020
gagcggataa caatttcaca caggaaacag ctatgaccat gattacgcca agcgcgcaat 7080
taaccctcac taaagggaac aaaagctgga gctgcaagct taatgtagtc ttatgcaata 7140
ctcttgtagt cttgcaacat ggtaacgatg agttagcaac atgccttaca aggagagaaa 7200
aagcaccgtg catgccgatt ggtggaagta aggtggtacg atcgtgcctt attaggaagg 7260
caacagacgg gtctgacatg gattggacga accactgaat tgccgcattg cagagatatt 7320
gtatttaagt gcctagctcg atacataaac gggtctctct ggttagacca gatctgagcc 7380
tgggagctct ctggctaact agggaaccca ctgcttaagc ctcaataaag cttgccttga 7440
gtgcttcaag tagtgtgtgc ccgtctgttg tgtgactctg gtaactagag atccctcaga 7500
cccttttagt cagtgtggaa aatctctagc agtggcgccc gaacagggac ttgaaagcga 7560
aagggaaacc agaggagctc tctcgacgca ggactcggct tgctgaagcg cgcacggcaa 7620
gaggcgaggg gcggcgactg gtgagtacgc caaaaatttt gactagcgga ggctagaagg 7680
agagagatgg gtgcgagagc gtcagtatta agcgggggag aattagatcg cgatgggaaa 7740
aaattcggtt aaggccaggg ggaaagaaaa aatataaatt aaaacatata gtatgggcaa 7800
gcagggagct agaacgattc gcagttaatc ctggcctgtt agaaacatca gaaggctgta 7860
gacaaatact gggacagcta caaccatccc ttcagacagg atcagaagaa cttagatcat 7920
tatataatac agtagcaacc ctctattgtg tgcatcaaag gatagagata aaagacacca 7980
aggaagcttt agacaagata gaggaagagc aaaacaaaag taagaccacc gcacagcaag 8040
cggccgctga tcttcagacc tggaggagga gatatgaggg acaattggag aagtgaatta 8100
tataaatata aagtagtaaa aattgaacca ttaggagtag cacccaccaa ggcaaagaga 8160
agagtggtgc agagagaaaa aagagcagtg ggaataggag ctttgttcct tgggttcttg 8220
ggagcagcag gaagcactat gggcgcagcg tcaatgacgc tgacggtaca ggccagacaa 8280
ttattgtctg gtatagtgca gcagcagaac aatttgctga gggctattga ggcgcaacag 8340
catctgttgc aactcacagt ctggggcatc aagcagctcc aggcaagaat cctggctgtg 8400
gaaagatacc taaaggatca acagctcctg gggatttggg gttgctctgg aaaactcatt 8460
tgcaccactg ctgtgccttg gaatgctagt tggagtaata aatctctgga acagatttgg 8520
aatcacacga cctggatgga gtgggacaga gaaattaaca attacacaag cttaatacac 8580
tccttaattg aagaatcgca aaaccagcaa gaaaagaatg aacaagaatt attggaatta 8640
gataaatggg caagtttgtg gaattggttt aacataacaa attggctgtg gtatataaaa 8700
ttattcataa tgatagtagg aggcttggta ggtttaagaa tagtttttgc tgtactttct 8760
atagtgaata gagttaggca gggatattca ccattatcgt ttcagaccca cctcccaacc 8820
ccgaggggac ccttgcgcct tttccaaggc agccctgggt ttgcgcaggg acgcggctgc 8880
tctgggcgtg gttccgggaa acgcagcggc gccgaccctg ggtctcgcac attcttcacg 8940
tccgttcgca gcgtcacccg gatcttcgcc gctacccttg tgggcccccc ggcgacgctt 9000
cctgctccgc ccctaagtcg ggaaggttcc ttgcggttcg cggcgtgccg gacgtgacaa 9060
acggaagccg cacgtctcac tagtaccctc gcagacggac agcgccaggg agcaatggca 9120
gcgcgccgac cgcgatgggc tgtggccaat agcggctgct cagcagggcg cgccgagagc 9180
agcggccggg aaggggcggt gcgggaggcg gggtgtgggg cggtagtgtg ggccctgttc 9240
ctgcccgcgc ggtgttccgc attctgcaag cctccggagc gcacgtcggc agtcggctcc 9300
ctcgttgacc gaatcaccga cctctctccc cagggggtac ccagctgtct agagaattct 9360
agatcttgag acaaatggca gtattcatcc acaattttaa aagaaaaggg gggattgggg 9420
ggtacagtgc aggggaaaga atagtagaca taatagcaac agacatacaa actaaagaat 9480
tacaaaaaca aattacaaaa attcaaaatt ttcgggttta ttacagggac agcagagatc 9540
cactttggcg ccggctcgag gggg 9564
<210> 181
<211> 306
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 181
Met Lys Lys Pro Glu Leu Thr Ala Thr Ser Val Glu Lys Phe Leu Ile
1 5 10 15
Glu Lys Phe Asp Ser Val Ser Asp Leu Met Gln Leu Ser Glu Gly Glu
20 25 30
Glu Ser Arg Ala Phe Ser Phe Asp Val Gly Gly Arg Gly Tyr Val Leu
35 40 45
Arg Val Asn Ser Cys Ala Asp Gly Phe Tyr Lys Asp Arg Tyr Val Tyr
50 55 60
Arg His Phe Ala Ser Ala Ala Leu Pro Ile Pro Glu Val Leu Asp Ile
65 70 75 80
Gly Glu Phe Ser Glu Ser Leu Thr Tyr Cys Ile Ser Arg Arg Ala Gln
85 90 95
Gly Val Thr Leu Gln Asp Leu Pro Glu Thr Glu Leu Pro Ala Val Leu
100 105 110
Gln Pro Val Ala Glu Ala Met Asp Ala Ile Ala Ala Ala Asp Leu Ser
115 120 125
Gln Thr Ser Gly Phe Gly Pro Phe Gly Pro Gln Gly Ile Gly Gln Tyr
130 135 140
Thr Thr Trp Arg Asp Phe Ile Cys Ala Ile Ala Asp Pro His Val Tyr
145 150 155 160
His Trp Gln Thr Val Met Asp Asp Thr Val Ser Ala Ser Val Ala Gln
165 170 175
Ala Leu Asp Glu Leu Met Leu Trp Ala Glu Asp Cys Pro Glu Val Arg
180 185 190
His Leu Val His Ala Asp Phe Gly Cys Ile Ser Gly Asp Ser Leu Ile
195 200 205
Ser Leu Ala Ser Thr Gly Lys Arg Val Ser Ile Lys Asp Leu Leu Asp
210 215 220
Glu Lys Asp Phe Glu Ile Trp Ala Ile Asn Glu Gln Thr Met Lys Leu
225 230 235 240
Glu Ser Ala Lys Val Ser Arg Val Phe Cys Thr Gly Lys Lys Leu Val
245 250 255
Tyr Ile Leu Lys Thr Arg Leu Gly Arg Thr Ile Lys Ala Thr Ala Asn
260 265 270
His Arg Phe Leu Thr Ile Asp Gly Trp Lys Arg Leu Asp Glu Leu Ser
275 280 285
Leu Lys Glu His Ile Ala Leu Pro Arg Lys Leu Glu Ser Ser Ser Leu
290 295 300
Gln Leu
305
<210> 182
<211> 9234
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 182
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtgccgcca ccatgagtcc cgaaatcgaa 660
aagctctctc agagcgatat atattgggac tccatcgtaa gcataacaga gacgggggtc 720
gaggaggtgt tcgatctgac agttcctggg cctcataatt tcgtagcgaa cgacatcatt 780
gtacataact ccaacaatgt cctgacggac aatggccgca taacagcggt cattgactgg 840
agcgaggcga tgttcgggga ttcccaatac gaggtcgcca acatcttctt ctggaggccg 900
tggttggctt gcctttcata cgagaccgag atcctgactg tcgagtacgg attgcttcct 960
atcggcaaaa tcgtggagaa gaggattgaa tgtaccgtct attcagtcga taataatggg 1020
aacatctaca cacagcccgt ggctcaatgg cacgacagag gagagcagga agtttttgaa 1080
tactgtctcg aggacggatc cctcatccgc gctactaaag atcataagtt tatgaccgtg 1140
gacggccaga tgctgccaat tgacgaaatt tttgaacgag agctggatct gatgagagtc 1200
gacaaccttc caaactgatt aattaagaat tcgacccagc tttcttgtac aaagtggttg 1260
gtaagcctat ccctaaccct ctcctcggtc tcgattctac gtagtaatga gctagcagtc 1320
tcgaggttaa cgaattccgc ccccccccta acgttactgg ccgaagccgc ttggaataag 1380
gccggtgtgc gcttgtctat atgttatttt ccaccatatt gccgtctttt ggcaatgtga 1440
gggcccggaa acctggccct gtcttcttga cgagcattcc taggggtctt tcccctctcg 1500
ccaaaggaat gcaaggtctg ttgaatgtcg tgaaggaagc agttcctctg gaagcttctt 1560
gaagacaaac aacgtctgta gcgacccttt gcaggcagcg gaacccccca cctggcgaca 1620
ggtgcccctg cggccaaaag ccacgtgtat aagatacacc tgcaaaggcg gcacaacccc 1680
agtgccacgt tgtgagttgg atagttgtgg aaagagtcaa atggctctcc tcaagcgtat 1740
tcaacaaggg gctgaaggat gcccagaagg taccccattg tatgggatct gatctggggc 1800
ctcggtgcac atgctttaca tgtgtttagt cgaggttaaa aaaacgtcta ggccccccga 1860
accacgggga cgtggttttc ctttgaaaaa cacgataata ccatggtgag caagggcgag 1920
gagctgttca ccggggtggt gcccatcctg gtcgagctgg acggcgacgt aaacggccac 1980
aagttcagcg tgtccggcga gggcgagggc gatgccacct acggcaagct gaccctgaag 2040
ttcatctgca ccaccggcaa gctgcccgtg ccctggccca ccctcgtgac caccctgacc 2100
tacggcgtgc agtgcttcag ccgctacccc gaccacatga agcagcacga cttcttcaag 2160
tccgccatgc ccgaaggcta cgtccaggag cgcaccatct tcttcaagga cgacggcaac 2220
tacaagaccc gcgccgaggt gaagttcgag ggcgacaccc tggtgaaccg catcgagctg 2280
aagggcatcg acttcaagga ggacggcaac atcctggggc acaagctgga gtacaactac 2340
aacagccaca acgtctatat catggccgac aagcagaaga acggcatcaa ggtgaacttc 2400
aagatccgcc acaacatcga ggacggcagc gtgcagctcg ccgaccacta ccagcagaac 2460
acccccatcg gcgacggccc cgtgctgctg cccgacaacc actacctgag cacccagtcc 2520
gccctgagca aagaccccaa cgagaagcgc gatcacatgg tcctgctgga gttcgtgacc 2580
gccgccggga tcactctcgg catggacgag ctgtacaagt aacaccggtg gcgcgttaag 2640
tcgacaatca acctctggat tacaaaattt gtgaaagatt gactggtatt cttaactatg 2700
ttgctccttt tacgctatgt ggatacgctg ctttaatgcc tttgtatcat gctattgctt 2760
cccgtatggc tttcattttc tcctccttgt ataaatcctg gttgctgtct ctttatgagg 2820
agttgtggcc cgttgtcagg caacgtggcg tggtgtgcac tgtgtttgct gacgcaaccc 2880
ccactggttg gggcattgcc accacctgtc agctcctttc cgggactttc gctttccccc 2940
tccctattgc cacggcggaa ctcatcgccg cctgccttgc ccgctgctgg acaggggctc 3000
ggctgttggg cactgacaat tccgtggtgt tgtcggggaa atcatcgtcc tttccttggc 3060
tgctcgcctg tgttgccacc tggattctgc gcgggacgtc cttctgctac gtcccttcgg 3120
ccctcaatcc agcggacctt ccttcccgcg gcctgctgcc ggctctgcgg cctcttccgc 3180
gtcttcgcct tcgccctcag acgagtcgga tctccctttg ggccgcctcc ccgcgtcgac 3240
tttaagacca atgacttaca aggcagctgt agatcttagc cactttttaa aagaaaaggg 3300
gggactggaa gggctaattc actcccaacg aagacaagat ctgctttttg cttgtactgg 3360
gtctctctgg ttagaccaga tctgagcctg ggagctctct ggctaactag ggaacccact 3420
gcttaagcct caataaagct tgccttgagt gcttcaagta gtgtgtgccc gtctgttgtg 3480
tgactctggt aactagagat ccctcagacc cttttagtca gtgtggaaaa tctctagcag 3540
tacgtatagt agttcatgtc atcttattat tcagtattta taacttgcaa agaaatgaat 3600
atcagagagt gagaggaact tgtttattgc agcttataat ggttacaaat aaagcaatag 3660
catcacaaat ttcacaaata aagcattttt ttcactgcat tctagttgtg gtttgtccaa 3720
actcatcaat gtatcttatc atgtctggct ctagctatcc cgcccctaac tccgcccatc 3780
ccgcccctaa ctccgcccag ttccgcccat tctccgcccc atggctgact aatttttttt 3840
atttatgcag aggccgaggc cgcctcggcc tctgagctat tccagaagta gtgaggaggc 3900
ttttttggag gcctagggac gtacccaatt cgccctatag tgagtcgtat tacgcgcgct 3960
cactggccgt cgttttacaa cgtcgtgact gggaaaaccc tggcgttacc caacttaatc 4020
gccttgcagc acatccccct ttcgccagct ggcgtaatag cgaagaggcc cgcaccgatc 4080
gcccttccca acagttgcgc agcctgaatg gcgaatggga cgcgccctgt agcggcgcat 4140
taagcgcggc gggtgtggtg gttacgcgca gcgtgaccgc tacacttgcc agcgccctag 4200
cgcccgctcc tttcgctttc ttcccttcct ttctcgccac gttcgccggc tttccccgtc 4260
aagctctaaa tcgggggctc cctttagggt tccgatttag tgctttacgg cacctcgacc 4320
ccaaaaaact tgattagggt gatggttcac gtagtgggcc atcgccctga tagacggttt 4380
ttcgcccttt gacgttggag tccacgttct ttaatagtgg actcttgttc caaactggaa 4440
caacactcaa ccctatctcg gtctattctt ttgatttata agggattttg ccgatttcgg 4500
cctattggtt aaaaaatgag ctgatttaac aaaaatttaa cgcgaatttt aacaaaatat 4560
taacgcttac aatttaggtg gcacttttcg gggaaatgtg cgcggaaccc ctatttgttt 4620
atttttctaa atacattcaa atatgtatcc gctcatgaga caataaccct gataaatgct 4680
tcaataatat tgaaaaagga agagtatgag tattcaacat ttccgtgtcg cccttattcc 4740
cttttttgcg gcattttgcc ttcctgtttt tgctcaccca gaaacgctgg tgaaagtaaa 4800
agatgctgaa gatcagttgg gtgcacgagt gggttacatc gaactggatc tcaacagcgg 4860
taagatcctt gagagttttc gccccgaaga acgttttcca atgatgagca cttttaaagt 4920
tctgctatgt ggcgcggtat tatcccgtat tgacgccggg caagagcaac tcggtcgccg 4980
catacactat tctcagaatg acttggttga gtactcacca gtcacagaaa agcatcttac 5040
ggatggcatg acagtaagag aattatgcag tgctgccata accatgagtg ataacactgc 5100
ggccaactta cttctgacaa cgatcggagg accgaaggag ctaaccgctt ttttgcacaa 5160
catgggggat catgtaactc gccttgatcg ttgggaaccg gagctgaatg aagccatacc 5220
aaacgacgag cgtgacacca cgatgcctgt agcaatggca acaacgttgc gcaaactatt 5280
aactggcgaa ctacttactc tagcttcccg gcaacaatta atagactgga tggaggcgga 5340
taaagttgca ggaccacttc tgcgctcggc ccttccggct ggctggttta ttgctgataa 5400
atctggagcc ggtgagcgtg ggtctcgcgg tatcattgca gcactggggc cagatggtaa 5460
gccctcccgt atcgtagtta tctacacgac ggggagtcag gcaactatgg atgaacgaaa 5520
tagacagatc gctgagatag gtgcctcact gattaagcat tggtaactgt cagaccaagt 5580
ttactcatat atactttaga ttgatttaaa acttcatttt taatttaaaa ggatctaggt 5640
gaagatcctt tttgataatc tcatgaccaa aatcccttaa cgtgagtttt cgttccactg 5700
agcgtcagac cccgtagaaa agatcaaagg atcttcttga gatccttttt ttctgcgcgt 5760
aatctgctgc ttgcaaacaa aaaaaccacc gctaccagcg gtggtttgtt tgccggatca 5820
agagctacca actctttttc cgaaggtaac tggcttcagc agagcgcaga taccaaatac 5880
tgttcttcta gtgtagccgt agttaggcca ccacttcaag aactctgtag caccgcctac 5940
atacctcgct ctgctaatcc tgttaccagt ggctgctgcc agtggcgata agtcgtgtct 6000
taccgggttg gactcaagac gatagttacc ggataaggcg cagcggtcgg gctgaacggg 6060
gggttcgtgc acacagccca gcttggagcg aacgacctac accgaactga gatacctaca 6120
gcgtgagcta tgagaaagcg ccacgcttcc cgaagggaga aaggcggaca ggtatccggt 6180
aagcggcagg gtcggaacag gagagcgcac gagggagctt ccagggggaa acgcctggta 6240
tctttatagt cctgtcgggt ttcgccacct ctgacttgag cgtcgatttt tgtgatgctc 6300
gtcagggggg cggagcctat ggaaaaacgc cagcaacgcg gcctttttac ggttcctggc 6360
cttttgctgg ccttttgctc acatgttctt tcctgcgtta tcccctgatt ctgtggataa 6420
ccgtattacc gcctttgagt gagctgatac cgctcgccgc agccgaacga ccgagcgcag 6480
cgagtcagtg agcgaggaag cggaagagcg cccaatacgc aaaccgcctc tccccgcgcg 6540
ttggccgatt cattaatgca gctggcacga caggtttccc gactggaaag cgggcagtga 6600
gcgcaacgca attaatgtga gttagctcac tcattaggca ccccaggctt tacactttat 6660
gcttccggct cgtatgttgt gtggaattgt gagcggataa caatttcaca caggaaacag 6720
ctatgaccat gattacgcca agcgcgcaat taaccctcac taaagggaac aaaagctgga 6780
gctgcaagct taatgtagtc ttatgcaata ctcttgtagt cttgcaacat ggtaacgatg 6840
agttagcaac atgccttaca aggagagaaa aagcaccgtg catgccgatt ggtggaagta 6900
aggtggtacg atcgtgcctt attaggaagg caacagacgg gtctgacatg gattggacga 6960
accactgaat tgccgcattg cagagatatt gtatttaagt gcctagctcg atacataaac 7020
gggtctctct ggttagacca gatctgagcc tgggagctct ctggctaact agggaaccca 7080
ctgcttaagc ctcaataaag cttgccttga gtgcttcaag tagtgtgtgc ccgtctgttg 7140
tgtgactctg gtaactagag atccctcaga cccttttagt cagtgtggaa aatctctagc 7200
agtggcgccc gaacagggac ttgaaagcga aagggaaacc agaggagctc tctcgacgca 7260
ggactcggct tgctgaagcg cgcacggcaa gaggcgaggg gcggcgactg gtgagtacgc 7320
caaaaatttt gactagcgga ggctagaagg agagagatgg gtgcgagagc gtcagtatta 7380
agcgggggag aattagatcg cgatgggaaa aaattcggtt aaggccaggg ggaaagaaaa 7440
aatataaatt aaaacatata gtatgggcaa gcagggagct agaacgattc gcagttaatc 7500
ctggcctgtt agaaacatca gaaggctgta gacaaatact gggacagcta caaccatccc 7560
ttcagacagg atcagaagaa cttagatcat tatataatac agtagcaacc ctctattgtg 7620
tgcatcaaag gatagagata aaagacacca aggaagcttt agacaagata gaggaagagc 7680
aaaacaaaag taagaccacc gcacagcaag cggccgctga tcttcagacc tggaggagga 7740
gatatgaggg acaattggag aagtgaatta tataaatata aagtagtaaa aattgaacca 7800
ttaggagtag cacccaccaa ggcaaagaga agagtggtgc agagagaaaa aagagcagtg 7860
ggaataggag ctttgttcct tgggttcttg ggagcagcag gaagcactat gggcgcagcg 7920
tcaatgacgc tgacggtaca ggccagacaa ttattgtctg gtatagtgca gcagcagaac 7980
aatttgctga gggctattga ggcgcaacag catctgttgc aactcacagt ctggggcatc 8040
aagcagctcc aggcaagaat cctggctgtg gaaagatacc taaaggatca acagctcctg 8100
gggatttggg gttgctctgg aaaactcatt tgcaccactg ctgtgccttg gaatgctagt 8160
tggagtaata aatctctgga acagatttgg aatcacacga cctggatgga gtgggacaga 8220
gaaattaaca attacacaag cttaatacac tccttaattg aagaatcgca aaaccagcaa 8280
gaaaagaatg aacaagaatt attggaatta gataaatggg caagtttgtg gaattggttt 8340
aacataacaa attggctgtg gtatataaaa ttattcataa tgatagtagg aggcttggta 8400
ggtttaagaa tagtttttgc tgtactttct atagtgaata gagttaggca gggatattca 8460
ccattatcgt ttcagaccca cctcccaacc ccgaggggac ccttgcgcct tttccaaggc 8520
agccctgggt ttgcgcaggg acgcggctgc tctgggcgtg gttccgggaa acgcagcggc 8580
gccgaccctg ggtctcgcac attcttcacg tccgttcgca gcgtcacccg gatcttcgcc 8640
gctacccttg tgggcccccc ggcgacgctt cctgctccgc ccctaagtcg ggaaggttcc 8700
ttgcggttcg cggcgtgccg gacgtgacaa acggaagccg cacgtctcac tagtaccctc 8760
gcagacggac agcgccaggg agcaatggca gcgcgccgac cgcgatgggc tgtggccaat 8820
agcggctgct cagcagggcg cgccgagagc agcggccggg aaggggcggt gcgggaggcg 8880
gggtgtgggg cggtagtgtg ggccctgttc ctgcccgcgc ggtgttccgc attctgcaag 8940
cctccggagc gcacgtcggc agtcggctcc ctcgttgacc gaatcaccga cctctctccc 9000
cagggggtac ccagctgtct agagaattct agatcttgag acaaatggca gtattcatcc 9060
acaattttaa aagaaaaggg gggattgggg ggtacagtgc aggggaaaga atagtagaca 9120
taatagcaac agacatacaa actaaagaat tacaaaaaca aattacaaaa attcaaaatt 9180
ttcgggttta ttacagggac agcagagatc cactttggcg ccggctcgag gggg 9234
<210> 183
<211> 191
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 183
Met Ser Pro Glu Ile Glu Lys Leu Ser Gln Ser Asp Ile Tyr Trp Asp
1 5 10 15
Ser Ile Val Ser Ile Thr Glu Thr Gly Val Glu Glu Val Phe Asp Leu
20 25 30
Thr Val Pro Gly Pro His Asn Phe Val Ala Asn Asp Ile Ile Val His
35 40 45
Asn Ser Asn Asn Val Leu Thr Asp Asn Gly Arg Ile Thr Ala Val Ile
50 55 60
Asp Trp Ser Glu Ala Met Phe Gly Asp Ser Gln Tyr Glu Val Ala Asn
65 70 75 80
Ile Phe Phe Trp Arg Pro Trp Leu Ala Cys Leu Ser Tyr Glu Thr Glu
85 90 95
Ile Leu Thr Val Glu Tyr Gly Leu Leu Pro Ile Gly Lys Ile Val Glu
100 105 110
Lys Arg Ile Glu Cys Thr Val Tyr Ser Val Asp Asn Asn Gly Asn Ile
115 120 125
Tyr Thr Gln Pro Val Ala Gln Trp His Asp Arg Gly Glu Gln Glu Val
130 135 140
Phe Glu Tyr Cys Leu Glu Asp Gly Ser Leu Ile Arg Ala Thr Lys Asp
145 150 155 160
His Lys Phe Met Thr Val Asp Gly Gln Met Leu Pro Ile Asp Glu Ile
165 170 175
Phe Glu Arg Glu Leu Asp Leu Met Arg Val Asp Asn Leu Pro Asn
180 185 190
<210> 184
<211> 9390
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 184
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtgccgcca ccatgagtcc cgaaatcgaa 660
aagctctctc agagcgatat atattgggac tccatcgtaa gcataacaga gacgggggtc 720
gaggaggtgt tcgatctgac agttcctggg cctcataatt tcgtagcgaa cgacatcatt 780
gtacataact ccaacaatgt cctgacggac aatggccgca taacagcggt cattgactgg 840
agcgaggcga tgttcgggga ttcccaatac gaggtcgcca acatcttctt ctggaggccg 900
tggttggctt gtatggagca gcagacgcgc tacttcgagc ggaggcatcc ggagcttgca 960
ggatcgccgc ggctccgggc gtatatgctc cgcattggtc ttgaccaact ctatcagagc 1020
ttggttgacg gcaatttcga tgatgcagct tgggcgcagg gtcgatgcct ttcatacgag 1080
accgagatcc tgactgtcga gtacggattg cttcctatcg gcaaaatcgt ggagaagagg 1140
attgaatgta ccgtctattc agtcgataat aatgggaaca tctacacaca gcccgtggct 1200
caatggcacg acagaggaga gcaggaagtt tttgaatact gtctcgagga cggatccctc 1260
atccgcgcta ctaaagatca taagtttatg accgtggacg gccagatgct gccaattgac 1320
gaaatttttg aacgagagct ggatctgatg agagtcgaca accttccaaa ctgattaatt 1380
aagaattcga cccagctttc ttgtacaaag tggttggtaa gcctatccct aaccctctcc 1440
tcggtctcga ttctacgtag taatgagcta gcagtctcga ggttaacgaa ttccgccccc 1500
cccctaacgt tactggccga agccgcttgg aataaggccg gtgtgcgctt gtctatatgt 1560
tattttccac catattgccg tcttttggca atgtgagggc ccggaaacct ggccctgtct 1620
tcttgacgag cattcctagg ggtctttccc ctctcgccaa aggaatgcaa ggtctgttga 1680
atgtcgtgaa ggaagcagtt cctctggaag cttcttgaag acaaacaacg tctgtagcga 1740
ccctttgcag gcagcggaac cccccacctg gcgacaggtg cccctgcggc caaaagccac 1800
gtgtataaga tacacctgca aaggcggcac aaccccagtg ccacgttgtg agttggatag 1860
ttgtggaaag agtcaaatgg ctctcctcaa gcgtattcaa caaggggctg aaggatgccc 1920
agaaggtacc ccattgtatg ggatctgatc tggggcctcg gtgcacatgc tttacatgtg 1980
tttagtcgag gttaaaaaaa cgtctaggcc ccccgaacca cggggacgtg gttttccttt 2040
gaaaaacacg ataataccat ggtgagcaag ggcgaggagc tgttcaccgg ggtggtgccc 2100
atcctggtcg agctggacgg cgacgtaaac ggccacaagt tcagcgtgtc cggcgagggc 2160
gagggcgatg ccacctacgg caagctgacc ctgaagttca tctgcaccac cggcaagctg 2220
cccgtgccct ggcccaccct cgtgaccacc ctgacctacg gcgtgcagtg cttcagccgc 2280
taccccgacc acatgaagca gcacgacttc ttcaagtccg ccatgcccga aggctacgtc 2340
caggagcgca ccatcttctt caaggacgac ggcaactaca agacccgcgc cgaggtgaag 2400
ttcgagggcg acaccctggt gaaccgcatc gagctgaagg gcatcgactt caaggaggac 2460
ggcaacatcc tggggcacaa gctggagtac aactacaaca gccacaacgt ctatatcatg 2520
gccgacaagc agaagaacgg catcaaggtg aacttcaaga tccgccacaa catcgaggac 2580
ggcagcgtgc agctcgccga ccactaccag cagaacaccc ccatcggcga cggccccgtg 2640
ctgctgcccg acaaccacta cctgagcacc cagtccgccc tgagcaaaga ccccaacgag 2700
aagcgcgatc acatggtcct gctggagttc gtgaccgccg ccgggatcac tctcggcatg 2760
gacgagctgt acaagtaaca ccggtggcgc gttaagtcga caatcaacct ctggattaca 2820
aaatttgtga aagattgact ggtattctta actatgttgc tccttttacg ctatgtggat 2880
acgctgcttt aatgcctttg tatcatgcta ttgcttcccg tatggctttc attttctcct 2940
ccttgtataa atcctggttg ctgtctcttt atgaggagtt gtggcccgtt gtcaggcaac 3000
gtggcgtggt gtgcactgtg tttgctgacg caacccccac tggttggggc attgccacca 3060
cctgtcagct cctttccggg actttcgctt tccccctccc tattgccacg gcggaactca 3120
tcgccgcctg ccttgcccgc tgctggacag gggctcggct gttgggcact gacaattccg 3180
tggtgttgtc ggggaaatca tcgtcctttc cttggctgct cgcctgtgtt gccacctgga 3240
ttctgcgcgg gacgtccttc tgctacgtcc cttcggccct caatccagcg gaccttcctt 3300
cccgcggcct gctgccggct ctgcggcctc ttccgcgtct tcgccttcgc cctcagacga 3360
gtcggatctc cctttgggcc gcctccccgc gtcgacttta agaccaatga cttacaaggc 3420
agctgtagat cttagccact ttttaaaaga aaagggggga ctggaagggc taattcactc 3480
ccaacgaaga caagatctgc tttttgcttg tactgggtct ctctggttag accagatctg 3540
agcctgggag ctctctggct aactagggaa cccactgctt aagcctcaat aaagcttgcc 3600
ttgagtgctt caagtagtgt gtgcccgtct gttgtgtgac tctggtaact agagatccct 3660
cagacccttt tagtcagtgt ggaaaatctc tagcagtacg tatagtagtt catgtcatct 3720
tattattcag tatttataac ttgcaaagaa atgaatatca gagagtgaga ggaacttgtt 3780
tattgcagct tataatggtt acaaataaag caatagcatc acaaatttca caaataaagc 3840
atttttttca ctgcattcta gttgtggttt gtccaaactc atcaatgtat cttatcatgt 3900
ctggctctag ctatcccgcc cctaactccg cccatcccgc ccctaactcc gcccagttcc 3960
gcccattctc cgccccatgg ctgactaatt ttttttattt atgcagaggc cgaggccgcc 4020
tcggcctctg agctattcca gaagtagtga ggaggctttt ttggaggcct agggacgtac 4080
ccaattcgcc ctatagtgag tcgtattacg cgcgctcact ggccgtcgtt ttacaacgtc 4140
gtgactggga aaaccctggc gttacccaac ttaatcgcct tgcagcacat ccccctttcg 4200
ccagctggcg taatagcgaa gaggcccgca ccgatcgccc ttcccaacag ttgcgcagcc 4260
tgaatggcga atgggacgcg ccctgtagcg gcgcattaag cgcggcgggt gtggtggtta 4320
cgcgcagcgt gaccgctaca cttgccagcg ccctagcgcc cgctcctttc gctttcttcc 4380
cttcctttct cgccacgttc gccggctttc cccgtcaagc tctaaatcgg gggctccctt 4440
tagggttccg atttagtgct ttacggcacc tcgaccccaa aaaacttgat tagggtgatg 4500
gttcacgtag tgggccatcg ccctgataga cggtttttcg ccctttgacg ttggagtcca 4560
cgttctttaa tagtggactc ttgttccaaa ctggaacaac actcaaccct atctcggtct 4620
attcttttga tttataaggg attttgccga tttcggccta ttggttaaaa aatgagctga 4680
tttaacaaaa atttaacgcg aattttaaca aaatattaac gcttacaatt taggtggcac 4740
ttttcgggga aatgtgcgcg gaacccctat ttgtttattt ttctaaatac attcaaatat 4800
gtatccgctc atgagacaat aaccctgata aatgcttcaa taatattgaa aaaggaagag 4860
tatgagtatt caacatttcc gtgtcgccct tattcccttt tttgcggcat tttgccttcc 4920
tgtttttgct cacccagaaa cgctggtgaa agtaaaagat gctgaagatc agttgggtgc 4980
acgagtgggt tacatcgaac tggatctcaa cagcggtaag atccttgaga gttttcgccc 5040
cgaagaacgt tttccaatga tgagcacttt taaagttctg ctatgtggcg cggtattatc 5100
ccgtattgac gccgggcaag agcaactcgg tcgccgcata cactattctc agaatgactt 5160
ggttgagtac tcaccagtca cagaaaagca tcttacggat ggcatgacag taagagaatt 5220
atgcagtgct gccataacca tgagtgataa cactgcggcc aacttacttc tgacaacgat 5280
cggaggaccg aaggagctaa ccgctttttt gcacaacatg ggggatcatg taactcgcct 5340
tgatcgttgg gaaccggagc tgaatgaagc cataccaaac gacgagcgtg acaccacgat 5400
gcctgtagca atggcaacaa cgttgcgcaa actattaact ggcgaactac ttactctagc 5460
ttcccggcaa caattaatag actggatgga ggcggataaa gttgcaggac cacttctgcg 5520
ctcggccctt ccggctggct ggtttattgc tgataaatct ggagccggtg agcgtgggtc 5580
tcgcggtatc attgcagcac tggggccaga tggtaagccc tcccgtatcg tagttatcta 5640
cacgacgggg agtcaggcaa ctatggatga acgaaataga cagatcgctg agataggtgc 5700
ctcactgatt aagcattggt aactgtcaga ccaagtttac tcatatatac tttagattga 5760
tttaaaactt catttttaat ttaaaaggat ctaggtgaag atcctttttg ataatctcat 5820
gaccaaaatc ccttaacgtg agttttcgtt ccactgagcg tcagaccccg tagaaaagat 5880
caaaggatct tcttgagatc ctttttttct gcgcgtaatc tgctgcttgc aaacaaaaaa 5940
accaccgcta ccagcggtgg tttgtttgcc ggatcaagag ctaccaactc tttttccgaa 6000
ggtaactggc ttcagcagag cgcagatacc aaatactgtt cttctagtgt agccgtagtt 6060
aggccaccac ttcaagaact ctgtagcacc gcctacatac ctcgctctgc taatcctgtt 6120
accagtggct gctgccagtg gcgataagtc gtgtcttacc gggttggact caagacgata 6180
gttaccggat aaggcgcagc ggtcgggctg aacggggggt tcgtgcacac agcccagctt 6240
ggagcgaacg acctacaccg aactgagata cctacagcgt gagctatgag aaagcgccac 6300
gcttcccgaa gggagaaagg cggacaggta tccggtaagc ggcagggtcg gaacaggaga 6360
gcgcacgagg gagcttccag ggggaaacgc ctggtatctt tatagtcctg tcgggtttcg 6420
ccacctctga cttgagcgtc gatttttgtg atgctcgtca ggggggcgga gcctatggaa 6480
aaacgccagc aacgcggcct ttttacggtt cctggccttt tgctggcctt ttgctcacat 6540
gttctttcct gcgttatccc ctgattctgt ggataaccgt attaccgcct ttgagtgagc 6600
tgataccgct cgccgcagcc gaacgaccga gcgcagcgag tcagtgagcg aggaagcgga 6660
agagcgccca atacgcaaac cgcctctccc cgcgcgttgg ccgattcatt aatgcagctg 6720
gcacgacagg tttcccgact ggaaagcggg cagtgagcgc aacgcaatta atgtgagtta 6780
gctcactcat taggcacccc aggctttaca ctttatgctt ccggctcgta tgttgtgtgg 6840
aattgtgagc ggataacaat ttcacacagg aaacagctat gaccatgatt acgccaagcg 6900
cgcaattaac cctcactaaa gggaacaaaa gctggagctg caagcttaat gtagtcttat 6960
gcaatactct tgtagtcttg caacatggta acgatgagtt agcaacatgc cttacaagga 7020
gagaaaaagc accgtgcatg ccgattggtg gaagtaaggt ggtacgatcg tgccttatta 7080
ggaaggcaac agacgggtct gacatggatt ggacgaacca ctgaattgcc gcattgcaga 7140
gatattgtat ttaagtgcct agctcgatac ataaacgggt ctctctggtt agaccagatc 7200
tgagcctggg agctctctgg ctaactaggg aacccactgc ttaagcctca ataaagcttg 7260
ccttgagtgc ttcaagtagt gtgtgcccgt ctgttgtgtg actctggtaa ctagagatcc 7320
ctcagaccct tttagtcagt gtggaaaatc tctagcagtg gcgcccgaac agggacttga 7380
aagcgaaagg gaaaccagag gagctctctc gacgcaggac tcggcttgct gaagcgcgca 7440
cggcaagagg cgaggggcgg cgactggtga gtacgccaaa aattttgact agcggaggct 7500
agaaggagag agatgggtgc gagagcgtca gtattaagcg ggggagaatt agatcgcgat 7560
gggaaaaaat tcggttaagg ccagggggaa agaaaaaata taaattaaaa catatagtat 7620
gggcaagcag ggagctagaa cgattcgcag ttaatcctgg cctgttagaa acatcagaag 7680
gctgtagaca aatactggga cagctacaac catcccttca gacaggatca gaagaactta 7740
gatcattata taatacagta gcaaccctct attgtgtgca tcaaaggata gagataaaag 7800
acaccaagga agctttagac aagatagagg aagagcaaaa caaaagtaag accaccgcac 7860
agcaagcggc cgctgatctt cagacctgga ggaggagata tgagggacaa ttggagaagt 7920
gaattatata aatataaagt agtaaaaatt gaaccattag gagtagcacc caccaaggca 7980
aagagaagag tggtgcagag agaaaaaaga gcagtgggaa taggagcttt gttccttggg 8040
ttcttgggag cagcaggaag cactatgggc gcagcgtcaa tgacgctgac ggtacaggcc 8100
agacaattat tgtctggtat agtgcagcag cagaacaatt tgctgagggc tattgaggcg 8160
caacagcatc tgttgcaact cacagtctgg ggcatcaagc agctccaggc aagaatcctg 8220
gctgtggaaa gatacctaaa ggatcaacag ctcctgggga tttggggttg ctctggaaaa 8280
ctcatttgca ccactgctgt gccttggaat gctagttgga gtaataaatc tctggaacag 8340
atttggaatc acacgacctg gatggagtgg gacagagaaa ttaacaatta cacaagctta 8400
atacactcct taattgaaga atcgcaaaac cagcaagaaa agaatgaaca agaattattg 8460
gaattagata aatgggcaag tttgtggaat tggtttaaca taacaaattg gctgtggtat 8520
ataaaattat tcataatgat agtaggaggc ttggtaggtt taagaatagt ttttgctgta 8580
ctttctatag tgaatagagt taggcaggga tattcaccat tatcgtttca gacccacctc 8640
ccaaccccga ggggaccctt gcgccttttc caaggcagcc ctgggtttgc gcagggacgc 8700
ggctgctctg ggcgtggttc cgggaaacgc agcggcgccg accctgggtc tcgcacattc 8760
ttcacgtccg ttcgcagcgt cacccggatc ttcgccgcta cccttgtggg ccccccggcg 8820
acgcttcctg ctccgcccct aagtcgggaa ggttccttgc ggttcgcggc gtgccggacg 8880
tgacaaacgg aagccgcacg tctcactagt accctcgcag acggacagcg ccagggagca 8940
atggcagcgc gccgaccgcg atgggctgtg gccaatagcg gctgctcagc agggcgcgcc 9000
gagagcagcg gccgggaagg ggcggtgcgg gaggcggggt gtggggcggt agtgtgggcc 9060
ctgttcctgc ccgcgcggtg ttccgcattc tgcaagcctc cggagcgcac gtcggcagtc 9120
ggctccctcg ttgaccgaat caccgacctc tctccccagg gggtacccag ctgtctagag 9180
aattctagat cttgagacaa atggcagtat tcatccacaa ttttaaaaga aaagggggga 9240
ttggggggta cagtgcaggg gaaagaatag tagacataat agcaacagac atacaaacta 9300
aagaattaca aaaacaaatt acaaaaattc aaaattttcg ggtttattac agggacagca 9360
gagatccact ttggcgccgg ctcgaggggg 9390
<210> 185
<211> 243
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 185
Met Ser Pro Glu Ile Glu Lys Leu Ser Gln Ser Asp Ile Tyr Trp Asp
1 5 10 15
Ser Ile Val Ser Ile Thr Glu Thr Gly Val Glu Glu Val Phe Asp Leu
20 25 30
Thr Val Pro Gly Pro His Asn Phe Val Ala Asn Asp Ile Ile Val His
35 40 45
Asn Ser Asn Asn Val Leu Thr Asp Asn Gly Arg Ile Thr Ala Val Ile
50 55 60
Asp Trp Ser Glu Ala Met Phe Gly Asp Ser Gln Tyr Glu Val Ala Asn
65 70 75 80
Ile Phe Phe Trp Arg Pro Trp Leu Ala Cys Met Glu Gln Gln Thr Arg
85 90 95
Tyr Phe Glu Arg Arg His Pro Glu Leu Ala Gly Ser Pro Arg Leu Arg
100 105 110
Ala Tyr Met Leu Arg Ile Gly Leu Asp Gln Leu Tyr Gln Ser Leu Val
115 120 125
Asp Gly Asn Phe Asp Asp Ala Ala Trp Ala Gln Gly Arg Cys Leu Ser
130 135 140
Tyr Glu Thr Glu Ile Leu Thr Val Glu Tyr Gly Leu Leu Pro Ile Gly
145 150 155 160
Lys Ile Val Glu Lys Arg Ile Glu Cys Thr Val Tyr Ser Val Asp Asn
165 170 175
Asn Gly Asn Ile Tyr Thr Gln Pro Val Ala Gln Trp His Asp Arg Gly
180 185 190
Glu Gln Glu Val Phe Glu Tyr Cys Leu Glu Asp Gly Ser Leu Ile Arg
195 200 205
Ala Thr Lys Asp His Lys Phe Met Thr Val Asp Gly Gln Met Leu Pro
210 215 220
Ile Asp Glu Ile Phe Glu Arg Glu Leu Asp Leu Met Arg Val Asp Asn
225 230 235 240
Leu Pro Asn
<210> 186
<211> 10263
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 186
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagctgaacg agaaacgtaa aatgatataa atatcaatat attaaattag 660
attttgcata aaaaacagac tacataatac tgtaaaacac aacatatcca gtcactatgg 720
cggccgcatt aggcacccca ggctttacac tttatgcttc cggctcgtat aatgtgtgga 780
ttttgagtta ggatccgtcg agattttcag gagctaagga agctaaaatg gagaaaaaaa 840
tcactggata taccaccgtt gatatatccc aatggcatcg taaagaacat tttgaggcat 900
ttcagtcagt tgctcaatgt acctataacc agaccgttca gctggatatt acggcctttt 960
taaagaccgt aaagaaaaat aagcacaagt tttatccggc ctttattcac attcttgccc 1020
gcctgatgaa tgctcatccg gaattccgta tggcaatgaa agacggtgag ctggtgatat 1080
gggatagtgt tcacccttgt tacaccgttt tccatgagca aactgaaacg ttttcatcgc 1140
tctggagtga ataccacgac gatttccggc agtttctaca catatattcg caagatgtgg 1200
cgtgttacgg tgaaaacctg gcctatttcc ctaaagggtt tattgagaat atgtttttcg 1260
tctcagccaa tccctgggtg agtttcacca gttttgattt aaacgtggcc aatatggaca 1320
acttcttcgc ccccgttttc accatgggca aatattatac gcaaggcgac aaggtgctga 1380
tgccgctggc gattcaggtt catcatgccg tttgtgatgg cttccatgtc ggcagaatgc 1440
ttaatgaatt acaacagtac tgcgatgagt ggcagggcgg ggcgtaaaga tctggatccg 1500
gcttactaaa agccagataa cagtatgcgt atttgcgcgc tgatttttgc ggtataagaa 1560
tatatactga tatgtatacc cgaagtatgt caaaaagagg tatgctatga agcagcgtat 1620
tacagtgaca gttgacagcg acagctatca gttgctcaag gcatatatga tgtcaatatc 1680
tccggtctgg taagcacaac catgcagaat gaagcccgtc gtctgcgtgc cgaacgctgg 1740
aaagcggaaa atcaggaagg gatggctgag gtcgcccggt ttattgaaat gaacggctct 1800
tttgctgacg agaacagggg ctggtgaaat gcagtttaag gtttacacct ataaaagaga 1860
gagccgttat cgtctgtttg tggatgtaca gagtgatatt attgacacgc ccgggcgacg 1920
gatggtgatc cccctggcca gtgcacgtct gctgtcagat aaagtctccc gtgaacttta 1980
cccggtggtg catatcgggg atgaaagctg gcgcatgatg accaccgata tggccagtgt 2040
gccggtctcc gttatcgggg aagaagtggc tgatctcagc caccgcgaaa atgacatcaa 2100
aaacgccatt aacctgatgt tctggggaat ataaatgtca ggctccctta tacacagcca 2160
gtctgcaggt cgaccatagt gactggatat gttgtgtttt acagtattat gtagtctgtt 2220
ttttatgcaa aatctaattt aatatattga tatttatatc attttacgtt tctcgttcag 2280
ctttcttgta caaagtggtt ggtaagccta tccctaaccc tctcctcggt ctcgattcta 2340
cgtagtaatg agctagcagt ctcgaggtta acgaattccg ccccccccct aacgttactg 2400
gccgaagccg cttggaataa ggccggtgtg cgcttgtcta tatgttattt tccaccatat 2460
tgccgtcttt tggcaatgtg agggcccgga aacctggccc tgtcttcttg acgagcattc 2520
ctaggggtct ttcccctctc gccaaaggaa tgcaaggtct gttgaatgtc gtgaaggaag 2580
cagttcctct ggaagcttct tgaagacaaa caacgtctgt agcgaccctt tgcaggcagc 2640
ggaacccccc acctggcgac aggtgcccct gcggccaaaa gccacgtgta taagatacac 2700
ctgcaaaggc ggcacaaccc cagtgccacg ttgtgagttg gatagttgtg gaaagagtca 2760
aatggctctc ctcaagcgta ttcaacaagg ggctgaagga tgcccagaag gtaccccatt 2820
gtatgggatc tgatctgggg cctcggtgca catgctttac atgtgtttag tcgaggttaa 2880
aaaaacgtct aggccccccg aaccacgggg acgtggtttt cctttgaaaa acacgataat 2940
accatggcca tgagcgagct gattaaggag aacatgcaca tgaagctgta catggagggc 3000
accgtggaca accatcactt caagtgcaca tccgagggcg aaggcaagcc ctacgagggc 3060
acccagacca tgagaatcaa ggtggtcgag ggcggccctc tccccttcgc cttcgacatc 3120
ctggctacta gcttcctcta cggcagcaag accttcatca accacaccca gggcatcccc 3180
gacttcttca agcagtcctt ccctgagggc ttcacatggg agagagtcac cacatacgaa 3240
gacgggggcg tgctgaccgc tacccaggac accagcctcc aggacggctg cctcatctac 3300
aacgtcaaga tcagaggggt gaacttcaca tccaacggcc ctgtgatgca gaagaaaaca 3360
ctcggctggg aggccttcac cgagacgctg taccccgctg acggcggcct ggaaggcaga 3420
aacgacatgg ccctgaagct cgtgggcggg agccatctga tcgcaaacat caagaccaca 3480
tatagatcca agaaacccgc taagaacctc aagatgcctg gcgtctacta tgtggactac 3540
agactggaaa gaatcaagga ggccaacaac gagacctacg tcgagcagca cgaggtggca 3600
gtggccagat actgcgacct ccctagcaaa ctggggcaca agcttaatta acaccggtgg 3660
cgcgttaagt cgacaatcaa cctctggatt acaaaatttg tgaaagattg actggtattc 3720
ttaactatgt tgctcctttt acgctatgtg gatacgctgc tttaatgcct ttgtatcatg 3780
ctattgcttc ccgtatggct ttcattttct cctccttgta taaatcctgg ttgctgtctc 3840
tttatgagga gttgtggccc gttgtcaggc aacgtggcgt ggtgtgcact gtgtttgctg 3900
acgcaacccc cactggttgg ggcattgcca ccacctgtca gctcctttcc gggactttcg 3960
ctttccccct ccctattgcc acggcggaac tcatcgccgc ctgccttgcc cgctgctgga 4020
caggggctcg gctgttgggc actgacaatt ccgtggtgtt gtcggggaaa tcatcgtcct 4080
ttccttggct gctcgcctgt gttgccacct ggattctgcg cgggacgtcc ttctgctacg 4140
tcccttcggc cctcaatcca gcggaccttc cttcccgcgg cctgctgccg gctctgcggc 4200
ctcttccgcg tcttcgcctt cgccctcaga cgagtcggat ctccctttgg gccgcctccc 4260
cgcgtcgact ttaagaccaa tgacttacaa ggcagctgta gatcttagcc actttttaaa 4320
agaaaagggg ggactggaag ggctaattca ctcccaacga agacaagatc tgctttttgc 4380
ttgtactggg tctctctggt tagaccagat ctgagcctgg gagctctctg gctaactagg 4440
gaacccactg cttaagcctc aataaagctt gccttgagtg cttcaagtag tgtgtgcccg 4500
tctgttgtgt gactctggta actagagatc cctcagaccc ttttagtcag tgtggaaaat 4560
ctctagcagt acgtatagta gttcatgtca tcttattatt cagtatttat aacttgcaaa 4620
gaaatgaata tcagagagtg agaggaactt gtttattgca gcttataatg gttacaaata 4680
aagcaatagc atcacaaatt tcacaaataa agcatttttt tcactgcatt ctagttgtgg 4740
tttgtccaaa ctcatcaatg tatcttatca tgtctggctc tagctatccc gcccctaact 4800
ccgcccatcc cgcccctaac tccgcccagt tccgcccatt ctccgcccca tggctgacta 4860
atttttttta tttatgcaga ggccgaggcc gcctcggcct ctgagctatt ccagaagtag 4920
tgaggaggct tttttggagg cctagggacg tacccaattc gccctatagt gagtcgtatt 4980
acgcgcgctc actggccgtc gttttacaac gtcgtgactg ggaaaaccct ggcgttaccc 5040
aacttaatcg ccttgcagca catccccctt tcgccagctg gcgtaatagc gaagaggccc 5100
gcaccgatcg cccttcccaa cagttgcgca gcctgaatgg cgaatgggac gcgccctgta 5160
gcggcgcatt aagcgcggcg ggtgtggtgg ttacgcgcag cgtgaccgct acacttgcca 5220
gcgccctagc gcccgctcct ttcgctttct tcccttcctt tctcgccacg ttcgccggct 5280
ttccccgtca agctctaaat cgggggctcc ctttagggtt ccgatttagt gctttacggc 5340
acctcgaccc caaaaaactt gattagggtg atggttcacg tagtgggcca tcgccctgat 5400
agacggtttt tcgccctttg acgttggagt ccacgttctt taatagtgga ctcttgttcc 5460
aaactggaac aacactcaac cctatctcgg tctattcttt tgatttataa gggattttgc 5520
cgatttcggc ctattggtta aaaaatgagc tgatttaaca aaaatttaac gcgaatttta 5580
acaaaatatt aacgcttaca atttaggtgg cacttttcgg ggaaatgtgc gcggaacccc 5640
tatttgttta tttttctaaa tacattcaaa tatgtatccg ctcatgagac aataaccctg 5700
ataaatgctt caataatatt gaaaaaggaa gagtatgagt attcaacatt tccgtgtcgc 5760
ccttattccc ttttttgcgg cattttgcct tcctgttttt gctcacccag aaacgctggt 5820
gaaagtaaaa gatgctgaag atcagttggg tgcacgagtg ggttacatcg aactggatct 5880
caacagcggt aagatccttg agagttttcg ccccgaagaa cgttttccaa tgatgagcac 5940
ttttaaagtt ctgctatgtg gcgcggtatt atcccgtatt gacgccgggc aagagcaact 6000
cggtcgccgc atacactatt ctcagaatga cttggttgag tactcaccag tcacagaaaa 6060
gcatcttacg gatggcatga cagtaagaga attatgcagt gctgccataa ccatgagtga 6120
taacactgcg gccaacttac ttctgacaac gatcggagga ccgaaggagc taaccgcttt 6180
tttgcacaac atgggggatc atgtaactcg ccttgatcgt tgggaaccgg agctgaatga 6240
agccatacca aacgacgagc gtgacaccac gatgcctgta gcaatggcaa caacgttgcg 6300
caaactatta actggcgaac tacttactct agcttcccgg caacaattaa tagactggat 6360
ggaggcggat aaagttgcag gaccacttct gcgctcggcc cttccggctg gctggtttat 6420
tgctgataaa tctggagccg gtgagcgtgg gtctcgcggt atcattgcag cactggggcc 6480
agatggtaag ccctcccgta tcgtagttat ctacacgacg gggagtcagg caactatgga 6540
tgaacgaaat agacagatcg ctgagatagg tgcctcactg attaagcatt ggtaactgtc 6600
agaccaagtt tactcatata tactttagat tgatttaaaa cttcattttt aatttaaaag 6660
gatctaggtg aagatccttt ttgataatct catgaccaaa atcccttaac gtgagttttc 6720
gttccactga gcgtcagacc ccgtagaaaa gatcaaagga tcttcttgag atcctttttt 6780
tctgcgcgta atctgctgct tgcaaacaaa aaaaccaccg ctaccagcgg tggtttgttt 6840
gccggatcaa gagctaccaa ctctttttcc gaaggtaact ggcttcagca gagcgcagat 6900
accaaatact gttcttctag tgtagccgta gttaggccac cacttcaaga actctgtagc 6960
accgcctaca tacctcgctc tgctaatcct gttaccagtg gctgctgcca gtggcgataa 7020
gtcgtgtctt accgggttgg actcaagacg atagttaccg gataaggcgc agcggtcggg 7080
ctgaacgggg ggttcgtgca cacagcccag cttggagcga acgacctaca ccgaactgag 7140
atacctacag cgtgagctat gagaaagcgc cacgcttccc gaagggagaa aggcggacag 7200
gtatccggta agcggcaggg tcggaacagg agagcgcacg agggagcttc cagggggaaa 7260
cgcctggtat ctttatagtc ctgtcgggtt tcgccacctc tgacttgagc gtcgattttt 7320
gtgatgctcg tcaggggggc ggagcctatg gaaaaacgcc agcaacgcgg cctttttacg 7380
gttcctggcc ttttgctggc cttttgctca catgttcttt cctgcgttat cccctgattc 7440
tgtggataac cgtattaccg cctttgagtg agctgatacc gctcgccgca gccgaacgac 7500
cgagcgcagc gagtcagtga gcgaggaagc ggaagagcgc ccaatacgca aaccgcctct 7560
ccccgcgcgt tggccgattc attaatgcag ctggcacgac aggtttcccg actggaaagc 7620
gggcagtgag cgcaacgcaa ttaatgtgag ttagctcact cattaggcac cccaggcttt 7680
acactttatg cttccggctc gtatgttgtg tggaattgtg agcggataac aatttcacac 7740
aggaaacagc tatgaccatg attacgccaa gcgcgcaatt aaccctcact aaagggaaca 7800
aaagctggag ctgcaagctt aatgtagtct tatgcaatac tcttgtagtc ttgcaacatg 7860
gtaacgatga gttagcaaca tgccttacaa ggagagaaaa agcaccgtgc atgccgattg 7920
gtggaagtaa ggtggtacga tcgtgcctta ttaggaaggc aacagacggg tctgacatgg 7980
attggacgaa ccactgaatt gccgcattgc agagatattg tatttaagtg cctagctcga 8040
tacataaacg ggtctctctg gttagaccag atctgagcct gggagctctc tggctaacta 8100
gggaacccac tgcttaagcc tcaataaagc ttgccttgag tgcttcaagt agtgtgtgcc 8160
cgtctgttgt gtgactctgg taactagaga tccctcagac ccttttagtc agtgtggaaa 8220
atctctagca gtggcgcccg aacagggact tgaaagcgaa agggaaacca gaggagctct 8280
ctcgacgcag gactcggctt gctgaagcgc gcacggcaag aggcgagggg cggcgactgg 8340
tgagtacgcc aaaaattttg actagcggag gctagaagga gagagatggg tgcgagagcg 8400
tcagtattaa gcgggggaga attagatcgc gatgggaaaa aattcggtta aggccagggg 8460
gaaagaaaaa atataaatta aaacatatag tatgggcaag cagggagcta gaacgattcg 8520
cagttaatcc tggcctgtta gaaacatcag aaggctgtag acaaatactg ggacagctac 8580
aaccatccct tcagacagga tcagaagaac ttagatcatt atataataca gtagcaaccc 8640
tctattgtgt gcatcaaagg atagagataa aagacaccaa ggaagcttta gacaagatag 8700
aggaagagca aaacaaaagt aagaccaccg cacagcaagc ggccgctgat cttcagacct 8760
ggaggaggag atatgaggga caattggaga agtgaattat ataaatataa agtagtaaaa 8820
attgaaccat taggagtagc acccaccaag gcaaagagaa gagtggtgca gagagaaaaa 8880
agagcagtgg gaataggagc tttgttcctt gggttcttgg gagcagcagg aagcactatg 8940
ggcgcagcgt caatgacgct gacggtacag gccagacaat tattgtctgg tatagtgcag 9000
cagcagaaca atttgctgag ggctattgag gcgcaacagc atctgttgca actcacagtc 9060
tggggcatca agcagctcca ggcaagaatc ctggctgtgg aaagatacct aaaggatcaa 9120
cagctcctgg ggatttgggg ttgctctgga aaactcattt gcaccactgc tgtgccttgg 9180
aatgctagtt ggagtaataa atctctggaa cagatttgga atcacacgac ctggatggag 9240
tgggacagag aaattaacaa ttacacaagc ttaatacact ccttaattga agaatcgcaa 9300
aaccagcaag aaaagaatga acaagaatta ttggaattag ataaatgggc aagtttgtgg 9360
aattggttta acataacaaa ttggctgtgg tatataaaat tattcataat gatagtagga 9420
ggcttggtag gtttaagaat agtttttgct gtactttcta tagtgaatag agttaggcag 9480
ggatattcac cattatcgtt tcagacccac ctcccaaccc cgaggggacc cttgcgcctt 9540
ttccaaggca gccctgggtt tgcgcaggga cgcggctgct ctgggcgtgg ttccgggaaa 9600
cgcagcggcg ccgaccctgg gtctcgcaca ttcttcacgt ccgttcgcag cgtcacccgg 9660
atcttcgccg ctacccttgt gggccccccg gcgacgcttc ctgctccgcc cctaagtcgg 9720
gaaggttcct tgcggttcgc ggcgtgccgg acgtgacaaa cggaagccgc acgtctcact 9780
agtaccctcg cagacggaca gcgccaggga gcaatggcag cgcgccgacc gcgatgggct 9840
gtggccaata gcggctgctc agcagggcgc gccgagagca gcggccggga aggggcggtg 9900
cgggaggcgg ggtgtggggc ggtagtgtgg gccctgttcc tgcccgcgcg gtgttccgca 9960
ttctgcaagc ctccggagcg cacgtcggca gtcggctccc tcgttgaccg aatcaccgac 10020
ctctctcccc agggggtacc cagctgtcta gagaattcta gatcttgaga caaatggcag 10080
tattcatcca caattttaaa agaaaagggg ggattggggg gtacagtgca ggggaaagaa 10140
tagtagacat aatagcaaca gacatacaaa ctaaagaatt acaaaaacaa attacaaaaa 10200
ttcaaaattt tcgggtttat tacagggaca gcagagatcc actttggcgc cggctcgagg 10260
ggg 10263
<210> 187
<211> 10275
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 187
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagctgaacg agaaacgtaa aatgatataa atatcaatat attaaattag 660
attttgcata aaaaacagac tacataatac tgtaaaacac aacatatcca gtcactatgg 720
cggccgcatt aggcacccca ggctttacac tttatgcttc cggctcgtat aatgtgtgga 780
ttttgagtta ggatccgtcg agattttcag gagctaagga agctaaaatg gagaaaaaaa 840
tcactggata taccaccgtt gatatatccc aatggcatcg taaagaacat tttgaggcat 900
ttcagtcagt tgctcaatgt acctataacc agaccgttca gctggatatt acggcctttt 960
taaagaccgt aaagaaaaat aagcacaagt tttatccggc ctttattcac attcttgccc 1020
gcctgatgaa tgctcatccg gaattccgta tggcaatgaa agacggtgag ctggtgatat 1080
gggatagtgt tcacccttgt tacaccgttt tccatgagca aactgaaacg ttttcatcgc 1140
tctggagtga ataccacgac gatttccggc agtttctaca catatattcg caagatgtgg 1200
cgtgttacgg tgaaaacctg gcctatttcc ctaaagggtt tattgagaat atgtttttcg 1260
tctcagccaa tccctgggtg agtttcacca gttttgattt aaacgtggcc aatatggaca 1320
acttcttcgc ccccgttttc accatgggca aatattatac gcaaggcgac aaggtgctga 1380
tgccgctggc gattcaggtt catcatgccg tttgtgatgg cttccatgtc ggcagaatgc 1440
ttaatgaatt acaacagtac tgcgatgagt ggcagggcgg ggcgtaaaga tctggatccg 1500
gcttactaaa agccagataa cagtatgcgt atttgcgcgc tgatttttgc ggtataagaa 1560
tatatactga tatgtatacc cgaagtatgt caaaaagagg tatgctatga agcagcgtat 1620
tacagtgaca gttgacagcg acagctatca gttgctcaag gcatatatga tgtcaatatc 1680
tccggtctgg taagcacaac catgcagaat gaagcccgtc gtctgcgtgc cgaacgctgg 1740
aaagcggaaa atcaggaagg gatggctgag gtcgcccggt ttattgaaat gaacggctct 1800
tttgctgacg agaacagggg ctggtgaaat gcagtttaag gtttacacct ataaaagaga 1860
gagccgttat cgtctgtttg tggatgtaca gagtgatatt attgacacgc ccgggcgacg 1920
gatggtgatc cccctggcca gtgcacgtct gctgtcagat aaagtctccc gtgaacttta 1980
cccggtggtg catatcgggg atgaaagctg gcgcatgatg accaccgata tggccagtgt 2040
gccggtctcc gttatcgggg aagaagtggc tgatctcagc caccgcgaaa atgacatcaa 2100
aaacgccatt aacctgatgt tctggggaat ataaatgtca ggctccctta tacacagcca 2160
gtctgcaggt cgaccatagt gactggatat gttgtgtttt acagtattat gtagtctgtt 2220
ttttatgcaa aatctaattt aatatattga tatttatatc attttacgtt tctcgttcag 2280
ctttcttgta caaagtggtt ggtaagccta tccctaaccc tctcctcggt ctcgattcta 2340
cgtagtaatg agctagcagt ctcgaggtta acgaattccg ccccccccct aacgttactg 2400
gccgaagccg cttggaataa ggccggtgtg cgcttgtcta tatgttattt tccaccatat 2460
tgccgtcttt tggcaatgtg agggcccgga aacctggccc tgtcttcttg acgagcattc 2520
ctaggggtct ttcccctctc gccaaaggaa tgcaaggtct gttgaatgtc gtgaaggaag 2580
cagttcctct ggaagcttct tgaagacaaa caacgtctgt agcgaccctt tgcaggcagc 2640
ggaacccccc acctggcgac aggtgcccct gcggccaaaa gccacgtgta taagatacac 2700
ctgcaaaggc ggcacaaccc cagtgccacg ttgtgagttg gatagttgtg gaaagagtca 2760
aatggctctc ctcaagcgta ttcaacaagg ggctgaagga tgcccagaag gtaccccatt 2820
gtatgggatc tgatctgggg cctcggtgca catgctttac atgtgtttag tcgaggttaa 2880
aaaaacgtct aggccccccg aaccacgggg acgtggtttt cctttgaaaa acacgataat 2940
accatggtga gcaagggcga ggagctgttc accggggtgg tgcccatcct ggtcgagctg 3000
gacggcgacg taaacggcca caagttcagc gtgtccggcg agggcgaggg cgatgccacc 3060
tacggcaagc tgaccctgaa gttcatctgc accaccggca agctgcccgt gccctggccc 3120
accctcgtga ccaccctgac ctacggcgtg cagtgcttca gccgctaccc cgaccacatg 3180
aagcagcacg acttcttcaa gtccgccatg cccgaaggct acgtccagga gcgcaccatc 3240
ttcttcaagg acgacggcaa ctacaagacc cgcgccgagg tgaagttcga gggcgacacc 3300
ctggtgaacc gcatcgagct gaagggcatc gacttcaagg aggacggcaa catcctgggg 3360
cacaagctgg agtacaacta caacagccac aacgtctata tcatggccga caagcagaag 3420
aacggcatca aggtgaactt caagatccgc cacaacatcg aggacggcag cgtgcagctc 3480
gccgaccact accagcagaa cacccccatc ggcgacggcc ccgtgctgct gcccgacaac 3540
cactacctga gcacccagtc cgccctgagc aaagacccca acgagaagcg cgatcacatg 3600
gtcctgctgg agttcgtgac cgccgccggg atcactctcg gcatggacga gctgtacaag 3660
taacaccggt ggcgcgttaa gtcgacaatc aacctctgga ttacaaaatt tgtgaaagat 3720
tgactggtat tcttaactat gttgctcctt ttacgctatg tggatacgct gctttaatgc 3780
ctttgtatca tgctattgct tcccgtatgg ctttcatttt ctcctccttg tataaatcct 3840
ggttgctgtc tctttatgag gagttgtggc ccgttgtcag gcaacgtggc gtggtgtgca 3900
ctgtgtttgc tgacgcaacc cccactggtt ggggcattgc caccacctgt cagctccttt 3960
ccgggacttt cgctttcccc ctccctattg ccacggcgga actcatcgcc gcctgccttg 4020
cccgctgctg gacaggggct cggctgttgg gcactgacaa ttccgtggtg ttgtcgggga 4080
aatcatcgtc ctttccttgg ctgctcgcct gtgttgccac ctggattctg cgcgggacgt 4140
ccttctgcta cgtcccttcg gccctcaatc cagcggacct tccttcccgc ggcctgctgc 4200
cggctctgcg gcctcttccg cgtcttcgcc ttcgccctca gacgagtcgg atctcccttt 4260
gggccgcctc cccgcgtcga ctttaagacc aatgacttac aaggcagctg tagatcttag 4320
ccacttttta aaagaaaagg ggggactgga agggctaatt cactcccaac gaagacaaga 4380
tctgcttttt gcttgtactg ggtctctctg gttagaccag atctgagcct gggagctctc 4440
tggctaacta gggaacccac tgcttaagcc tcaataaagc ttgccttgag tgcttcaagt 4500
agtgtgtgcc cgtctgttgt gtgactctgg taactagaga tccctcagac ccttttagtc 4560
agtgtggaaa atctctagca gtacgtatag tagttcatgt catcttatta ttcagtattt 4620
ataacttgca aagaaatgaa tatcagagag tgagaggaac ttgtttattg cagcttataa 4680
tggttacaaa taaagcaata gcatcacaaa tttcacaaat aaagcatttt tttcactgca 4740
ttctagttgt ggtttgtcca aactcatcaa tgtatcttat catgtctggc tctagctatc 4800
ccgcccctaa ctccgcccat cccgccccta actccgccca gttccgccca ttctccgccc 4860
catggctgac taattttttt tatttatgca gaggccgagg ccgcctcggc ctctgagcta 4920
ttccagaagt agtgaggagg cttttttgga ggcctaggga cgtacccaat tcgccctata 4980
gtgagtcgta ttacgcgcgc tcactggccg tcgttttaca acgtcgtgac tgggaaaacc 5040
ctggcgttac ccaacttaat cgccttgcag cacatccccc tttcgccagc tggcgtaata 5100
gcgaagaggc ccgcaccgat cgcccttccc aacagttgcg cagcctgaat ggcgaatggg 5160
acgcgccctg tagcggcgca ttaagcgcgg cgggtgtggt ggttacgcgc agcgtgaccg 5220
ctacacttgc cagcgcccta gcgcccgctc ctttcgcttt cttcccttcc tttctcgcca 5280
cgttcgccgg ctttccccgt caagctctaa atcgggggct ccctttaggg ttccgattta 5340
gtgctttacg gcacctcgac cccaaaaaac ttgattaggg tgatggttca cgtagtgggc 5400
catcgccctg atagacggtt tttcgccctt tgacgttgga gtccacgttc tttaatagtg 5460
gactcttgtt ccaaactgga acaacactca accctatctc ggtctattct tttgatttat 5520
aagggatttt gccgatttcg gcctattggt taaaaaatga gctgatttaa caaaaattta 5580
acgcgaattt taacaaaata ttaacgctta caatttaggt ggcacttttc ggggaaatgt 5640
gcgcggaacc cctatttgtt tatttttcta aatacattca aatatgtatc cgctcatgag 5700
acaataaccc tgataaatgc ttcaataata ttgaaaaagg aagagtatga gtattcaaca 5760
tttccgtgtc gcccttattc ccttttttgc ggcattttgc cttcctgttt ttgctcaccc 5820
agaaacgctg gtgaaagtaa aagatgctga agatcagttg ggtgcacgag tgggttacat 5880
cgaactggat ctcaacagcg gtaagatcct tgagagtttt cgccccgaag aacgttttcc 5940
aatgatgagc acttttaaag ttctgctatg tggcgcggta ttatcccgta ttgacgccgg 6000
gcaagagcaa ctcggtcgcc gcatacacta ttctcagaat gacttggttg agtactcacc 6060
agtcacagaa aagcatctta cggatggcat gacagtaaga gaattatgca gtgctgccat 6120
aaccatgagt gataacactg cggccaactt acttctgaca acgatcggag gaccgaagga 6180
gctaaccgct tttttgcaca acatggggga tcatgtaact cgccttgatc gttgggaacc 6240
ggagctgaat gaagccatac caaacgacga gcgtgacacc acgatgcctg tagcaatggc 6300
aacaacgttg cgcaaactat taactggcga actacttact ctagcttccc ggcaacaatt 6360
aatagactgg atggaggcgg ataaagttgc aggaccactt ctgcgctcgg cccttccggc 6420
tggctggttt attgctgata aatctggagc cggtgagcgt gggtctcgcg gtatcattgc 6480
agcactgggg ccagatggta agccctcccg tatcgtagtt atctacacga cggggagtca 6540
ggcaactatg gatgaacgaa atagacagat cgctgagata ggtgcctcac tgattaagca 6600
ttggtaactg tcagaccaag tttactcata tatactttag attgatttaa aacttcattt 6660
ttaatttaaa aggatctagg tgaagatcct ttttgataat ctcatgacca aaatccctta 6720
acgtgagttt tcgttccact gagcgtcaga ccccgtagaa aagatcaaag gatcttcttg 6780
agatcctttt tttctgcgcg taatctgctg cttgcaaaca aaaaaaccac cgctaccagc 6840
ggtggtttgt ttgccggatc aagagctacc aactcttttt ccgaaggtaa ctggcttcag 6900
cagagcgcag ataccaaata ctgttcttct agtgtagccg tagttaggcc accacttcaa 6960
gaactctgta gcaccgccta catacctcgc tctgctaatc ctgttaccag tggctgctgc 7020
cagtggcgat aagtcgtgtc ttaccgggtt ggactcaaga cgatagttac cggataaggc 7080
gcagcggtcg ggctgaacgg ggggttcgtg cacacagccc agcttggagc gaacgaccta 7140
caccgaactg agatacctac agcgtgagct atgagaaagc gccacgcttc ccgaagggag 7200
aaaggcggac aggtatccgg taagcggcag ggtcggaaca ggagagcgca cgagggagct 7260
tccaggggga aacgcctggt atctttatag tcctgtcggg tttcgccacc tctgacttga 7320
gcgtcgattt ttgtgatgct cgtcaggggg gcggagccta tggaaaaacg ccagcaacgc 7380
ggccttttta cggttcctgg ccttttgctg gccttttgct cacatgttct ttcctgcgtt 7440
atcccctgat tctgtggata accgtattac cgcctttgag tgagctgata ccgctcgccg 7500
cagccgaacg accgagcgca gcgagtcagt gagcgaggaa gcggaagagc gcccaatacg 7560
caaaccgcct ctccccgcgc gttggccgat tcattaatgc agctggcacg acaggtttcc 7620
cgactggaaa gcgggcagtg agcgcaacgc aattaatgtg agttagctca ctcattaggc 7680
accccaggct ttacacttta tgcttccggc tcgtatgttg tgtggaattg tgagcggata 7740
acaatttcac acaggaaaca gctatgacca tgattacgcc aagcgcgcaa ttaaccctca 7800
ctaaagggaa caaaagctgg agctgcaagc ttaatgtagt cttatgcaat actcttgtag 7860
tcttgcaaca tggtaacgat gagttagcaa catgccttac aaggagagaa aaagcaccgt 7920
gcatgccgat tggtggaagt aaggtggtac gatcgtgcct tattaggaag gcaacagacg 7980
ggtctgacat ggattggacg aaccactgaa ttgccgcatt gcagagatat tgtatttaag 8040
tgcctagctc gatacataaa cgggtctctc tggttagacc agatctgagc ctgggagctc 8100
tctggctaac tagggaaccc actgcttaag cctcaataaa gcttgccttg agtgcttcaa 8160
gtagtgtgtg cccgtctgtt gtgtgactct ggtaactaga gatccctcag acccttttag 8220
tcagtgtgga aaatctctag cagtggcgcc cgaacaggga cttgaaagcg aaagggaaac 8280
cagaggagct ctctcgacgc aggactcggc ttgctgaagc gcgcacggca agaggcgagg 8340
ggcggcgact ggtgagtacg ccaaaaattt tgactagcgg aggctagaag gagagagatg 8400
ggtgcgagag cgtcagtatt aagcggggga gaattagatc gcgatgggaa aaaattcggt 8460
taaggccagg gggaaagaaa aaatataaat taaaacatat agtatgggca agcagggagc 8520
tagaacgatt cgcagttaat cctggcctgt tagaaacatc agaaggctgt agacaaatac 8580
tgggacagct acaaccatcc cttcagacag gatcagaaga acttagatca ttatataata 8640
cagtagcaac cctctattgt gtgcatcaaa ggatagagat aaaagacacc aaggaagctt 8700
tagacaagat agaggaagag caaaacaaaa gtaagaccac cgcacagcaa gcggccgctg 8760
atcttcagac ctggaggagg agatatgagg gacaattgga gaagtgaatt atataaatat 8820
aaagtagtaa aaattgaacc attaggagta gcacccacca aggcaaagag aagagtggtg 8880
cagagagaaa aaagagcagt gggaatagga gctttgttcc ttgggttctt gggagcagca 8940
ggaagcacta tgggcgcagc gtcaatgacg ctgacggtac aggccagaca attattgtct 9000
ggtatagtgc agcagcagaa caatttgctg agggctattg aggcgcaaca gcatctgttg 9060
caactcacag tctggggcat caagcagctc caggcaagaa tcctggctgt ggaaagatac 9120
ctaaaggatc aacagctcct ggggatttgg ggttgctctg gaaaactcat ttgcaccact 9180
gctgtgcctt ggaatgctag ttggagtaat aaatctctgg aacagatttg gaatcacacg 9240
acctggatgg agtgggacag agaaattaac aattacacaa gcttaataca ctccttaatt 9300
gaagaatcgc aaaaccagca agaaaagaat gaacaagaat tattggaatt agataaatgg 9360
gcaagtttgt ggaattggtt taacataaca aattggctgt ggtatataaa attattcata 9420
atgatagtag gaggcttggt aggtttaaga atagtttttg ctgtactttc tatagtgaat 9480
agagttaggc agggatattc accattatcg tttcagaccc acctcccaac cccgagggga 9540
cccttgcgcc ttttccaagg cagccctggg tttgcgcagg gacgcggctg ctctgggcgt 9600
ggttccggga aacgcagcgg cgccgaccct gggtctcgca cattcttcac gtccgttcgc 9660
agcgtcaccc ggatcttcgc cgctaccctt gtgggccccc cggcgacgct tcctgctccg 9720
cccctaagtc gggaaggttc cttgcggttc gcggcgtgcc ggacgtgaca aacggaagcc 9780
gcacgtctca ctagtaccct cgcagacgga cagcgccagg gagcaatggc agcgcgccga 9840
ccgcgatggg ctgtggccaa tagcggctgc tcagcagggc gcgccgagag cagcggccgg 9900
gaaggggcgg tgcgggaggc ggggtgtggg gcggtagtgt gggccctgtt cctgcccgcg 9960
cggtgttccg cattctgcaa gcctccggag cgcacgtcgg cagtcggctc cctcgttgac 10020
cgaatcaccg acctctctcc ccagggggta cccagctgtc tagagaattc tagatcttga 10080
gacaaatggc agtattcatc cacaatttta aaagaaaagg ggggattggg gggtacagtg 10140
caggggaaag aatagtagac ataatagcaa cagacataca aactaaagaa ttacaaaaac 10200
aaattacaaa aattcaaaat tttcgggttt attacaggga cagcagagat ccactttggc 10260
gccggctcga ggggg 10275
<210> 188
<211> 10266
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 188
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagctgaacg agaaacgtaa aatgatataa atatcaatat attaaattag 660
attttgcata aaaaacagac tacataatac tgtaaaacac aacatatcca gtcactatgg 720
cggccgcatt aggcacccca ggctttacac tttatgcttc cggctcgtat aatgtgtgga 780
ttttgagtta ggatccgtcg agattttcag gagctaagga agctaaaatg gagaaaaaaa 840
tcactggata taccaccgtt gatatatccc aatggcatcg taaagaacat tttgaggcat 900
ttcagtcagt tgctcaatgt acctataacc agaccgttca gctggatatt acggcctttt 960
taaagaccgt aaagaaaaat aagcacaagt tttatccggc ctttattcac attcttgccc 1020
gcctgatgaa tgctcatccg gaattccgta tggcaatgaa agacggtgag ctggtgatat 1080
gggatagtgt tcacccttgt tacaccgttt tccatgagca aactgaaacg ttttcatcgc 1140
tctggagtga ataccacgac gatttccggc agtttctaca catatattcg caagatgtgg 1200
cgtgttacgg tgaaaacctg gcctatttcc ctaaagggtt tattgagaat atgtttttcg 1260
tctcagccaa tccctgggtg agtttcacca gttttgattt aaacgtggcc aatatggaca 1320
acttcttcgc ccccgttttc accatgggca aatattatac gcaaggcgac aaggtgctga 1380
tgccgctggc gattcaggtt catcatgccg tttgtgatgg cttccatgtc ggcagaatgc 1440
ttaatgaatt acaacagtac tgcgatgagt ggcagggcgg ggcgtaaaga tctggatccg 1500
gcttactaaa agccagataa cagtatgcgt atttgcgcgc tgatttttgc ggtataagaa 1560
tatatactga tatgtatacc cgaagtatgt caaaaagagg tatgctatga agcagcgtat 1620
tacagtgaca gttgacagcg acagctatca gttgctcaag gcatatatga tgtcaatatc 1680
tccggtctgg taagcacaac catgcagaat gaagcccgtc gtctgcgtgc cgaacgctgg 1740
aaagcggaaa atcaggaagg gatggctgag gtcgcccggt ttattgaaat gaacggctct 1800
tttgctgacg agaacagggg ctggtgaaat gcagtttaag gtttacacct ataaaagaga 1860
gagccgttat cgtctgtttg tggatgtaca gagtgatatt attgacacgc ccgggcgacg 1920
gatggtgatc cccctggcca gtgcacgtct gctgtcagat aaagtctccc gtgaacttta 1980
cccggtggtg catatcgggg atgaaagctg gcgcatgatg accaccgata tggccagtgt 2040
gccggtctcc gttatcgggg aagaagtggc tgatctcagc caccgcgaaa atgacatcaa 2100
aaacgccatt aacctgatgt tctggggaat ataaatgtca ggctccctta tacacagcca 2160
gtctgcaggt cgaccatagt gactggatat gttgtgtttt acagtattat gtagtctgtt 2220
ttttatgcaa aatctaattt aatatattga tatttatatc attttacgtt tctcgttcag 2280
ctttcttgta caaagtggtt ggtaagccta tccctaaccc tctcctcggt ctcgattcta 2340
cgtagtaatg agctagcagt ctcgaggtta acgaattccg ccccccccct aacgttactg 2400
gccgaagccg cttggaataa ggccggtgtg cgcttgtcta tatgttattt tccaccatat 2460
tgccgtcttt tggcaatgtg agggcccgga aacctggccc tgtcttcttg acgagcattc 2520
ctaggggtct ttcccctctc gccaaaggaa tgcaaggtct gttgaatgtc gtgaaggaag 2580
cagttcctct ggaagcttct tgaagacaaa caacgtctgt agcgaccctt tgcaggcagc 2640
ggaacccccc acctggcgac aggtgcccct gcggccaaaa gccacgtgta taagatacac 2700
ctgcaaaggc ggcacaaccc cagtgccacg ttgtgagttg gatagttgtg gaaagagtca 2760
aatggctctc ctcaagcgta ttcaacaagg ggctgaagga tgcccagaag gtaccccatt 2820
gtatgggatc tgatctgggg cctcggtgca catgctttac atgtgtttag tcgaggttaa 2880
aaaaacgtct aggccccccg aaccacgggg acgtggtttt cctttgaaaa acacgataat 2940
accatggtga gcaagggcga ggaggataac atggccatca tcaaggagtt catgcgcttc 3000
aaggtgcaca tggagggctc cgtgaacggc cacgagttcg agatcgaggg cgagggcgag 3060
ggccgcccct acgagggcac ccagaccgcc aagctgaagg tgaccaaggg tggccccctg 3120
cccttcgcct gggacatcct gtcccctcag ttcatgtacg gctccaaggc ctacgtgaag 3180
caccccgccg acatccccga ctacttgaag ctgtccttcc ccgagggctt caagtgggag 3240
cgcgtgatga acttcgagga cggcggcgtg gtgaccgtga cccaggactc ctccctgcag 3300
gacggcgagt tcatctacaa ggtgaagctg cgcggcacca acttcccctc cgacggcccc 3360
gtaatgcaga agaagaccat gggctgggag gcctcctccg agcggatgta ccccgaggac 3420
ggcgccctga agggcgagat caagcagagg ctgaagctga aggacggcgg ccactacgac 3480
gctgaggtca agaccaccta caaggccaag aagcccgtgc agctgcccgg cgcctacaac 3540
gtcaacatca agttggacat cacctcccac aacgaggact acaccatcgt ggaacagtac 3600
gaacgcgccg agggccgcca ctccaccggc ggcatggacg agctgtacaa gtaacaccgg 3660
tggcgcgtta agtcgacaat caacctctgg attacaaaat ttgtgaaaga ttgactggta 3720
ttcttaacta tgttgctcct tttacgctat gtggatacgc tgctttaatg cctttgtatc 3780
atgctattgc ttcccgtatg gctttcattt tctcctcctt gtataaatcc tggttgctgt 3840
ctctttatga ggagttgtgg cccgttgtca ggcaacgtgg cgtggtgtgc actgtgtttg 3900
ctgacgcaac ccccactggt tggggcattg ccaccacctg tcagctcctt tccgggactt 3960
tcgctttccc cctccctatt gccacggcgg aactcatcgc cgcctgcctt gcccgctgct 4020
ggacaggggc tcggctgttg ggcactgaca attccgtggt gttgtcgggg aaatcatcgt 4080
cctttccttg gctgctcgcc tgtgttgcca cctggattct gcgcgggacg tccttctgct 4140
acgtcccttc ggccctcaat ccagcggacc ttccttcccg cggcctgctg ccggctctgc 4200
ggcctcttcc gcgtcttcgc cttcgccctc agacgagtcg gatctccctt tgggccgcct 4260
ccccgcgtcg actttaagac caatgactta caaggcagct gtagatctta gccacttttt 4320
aaaagaaaag gggggactgg aagggctaat tcactcccaa cgaagacaag atctgctttt 4380
tgcttgtact gggtctctct ggttagacca gatctgagcc tgggagctct ctggctaact 4440
agggaaccca ctgcttaagc ctcaataaag cttgccttga gtgcttcaag tagtgtgtgc 4500
ccgtctgttg tgtgactctg gtaactagag atccctcaga cccttttagt cagtgtggaa 4560
aatctctagc agtacgtata gtagttcatg tcatcttatt attcagtatt tataacttgc 4620
aaagaaatga atatcagaga gtgagaggaa cttgtttatt gcagcttata atggttacaa 4680
ataaagcaat agcatcacaa atttcacaaa taaagcattt ttttcactgc attctagttg 4740
tggtttgtcc aaactcatca atgtatctta tcatgtctgg ctctagctat cccgccccta 4800
actccgccca tcccgcccct aactccgccc agttccgccc attctccgcc ccatggctga 4860
ctaatttttt ttatttatgc agaggccgag gccgcctcgg cctctgagct attccagaag 4920
tagtgaggag gcttttttgg aggcctaggg acgtacccaa ttcgccctat agtgagtcgt 4980
attacgcgcg ctcactggcc gtcgttttac aacgtcgtga ctgggaaaac cctggcgtta 5040
cccaacttaa tcgccttgca gcacatcccc ctttcgccag ctggcgtaat agcgaagagg 5100
cccgcaccga tcgcccttcc caacagttgc gcagcctgaa tggcgaatgg gacgcgccct 5160
gtagcggcgc attaagcgcg gcgggtgtgg tggttacgcg cagcgtgacc gctacacttg 5220
ccagcgccct agcgcccgct cctttcgctt tcttcccttc ctttctcgcc acgttcgccg 5280
gctttccccg tcaagctcta aatcgggggc tccctttagg gttccgattt agtgctttac 5340
ggcacctcga ccccaaaaaa cttgattagg gtgatggttc acgtagtggg ccatcgccct 5400
gatagacggt ttttcgccct ttgacgttgg agtccacgtt ctttaatagt ggactcttgt 5460
tccaaactgg aacaacactc aaccctatct cggtctattc ttttgattta taagggattt 5520
tgccgatttc ggcctattgg ttaaaaaatg agctgattta acaaaaattt aacgcgaatt 5580
ttaacaaaat attaacgctt acaatttagg tggcactttt cggggaaatg tgcgcggaac 5640
ccctatttgt ttatttttct aaatacattc aaatatgtat ccgctcatga gacaataacc 5700
ctgataaatg cttcaataat attgaaaaag gaagagtatg agtattcaac atttccgtgt 5760
cgcccttatt cccttttttg cggcattttg ccttcctgtt tttgctcacc cagaaacgct 5820
ggtgaaagta aaagatgctg aagatcagtt gggtgcacga gtgggttaca tcgaactgga 5880
tctcaacagc ggtaagatcc ttgagagttt tcgccccgaa gaacgttttc caatgatgag 5940
cacttttaaa gttctgctat gtggcgcggt attatcccgt attgacgccg ggcaagagca 6000
actcggtcgc cgcatacact attctcagaa tgacttggtt gagtactcac cagtcacaga 6060
aaagcatctt acggatggca tgacagtaag agaattatgc agtgctgcca taaccatgag 6120
tgataacact gcggccaact tacttctgac aacgatcgga ggaccgaagg agctaaccgc 6180
ttttttgcac aacatggggg atcatgtaac tcgccttgat cgttgggaac cggagctgaa 6240
tgaagccata ccaaacgacg agcgtgacac cacgatgcct gtagcaatgg caacaacgtt 6300
gcgcaaacta ttaactggcg aactacttac tctagcttcc cggcaacaat taatagactg 6360
gatggaggcg gataaagttg caggaccact tctgcgctcg gcccttccgg ctggctggtt 6420
tattgctgat aaatctggag ccggtgagcg tgggtctcgc ggtatcattg cagcactggg 6480
gccagatggt aagccctccc gtatcgtagt tatctacacg acggggagtc aggcaactat 6540
ggatgaacga aatagacaga tcgctgagat aggtgcctca ctgattaagc attggtaact 6600
gtcagaccaa gtttactcat atatacttta gattgattta aaacttcatt tttaatttaa 6660
aaggatctag gtgaagatcc tttttgataa tctcatgacc aaaatccctt aacgtgagtt 6720
ttcgttccac tgagcgtcag accccgtaga aaagatcaaa ggatcttctt gagatccttt 6780
ttttctgcgc gtaatctgct gcttgcaaac aaaaaaacca ccgctaccag cggtggtttg 6840
tttgccggat caagagctac caactctttt tccgaaggta actggcttca gcagagcgca 6900
gataccaaat actgttcttc tagtgtagcc gtagttaggc caccacttca agaactctgt 6960
agcaccgcct acatacctcg ctctgctaat cctgttacca gtggctgctg ccagtggcga 7020
taagtcgtgt cttaccgggt tggactcaag acgatagtta ccggataagg cgcagcggtc 7080
gggctgaacg gggggttcgt gcacacagcc cagcttggag cgaacgacct acaccgaact 7140
gagataccta cagcgtgagc tatgagaaag cgccacgctt cccgaaggga gaaaggcgga 7200
caggtatccg gtaagcggca gggtcggaac aggagagcgc acgagggagc ttccaggggg 7260
aaacgcctgg tatctttata gtcctgtcgg gtttcgccac ctctgacttg agcgtcgatt 7320
tttgtgatgc tcgtcagggg ggcggagcct atggaaaaac gccagcaacg cggccttttt 7380
acggttcctg gccttttgct ggccttttgc tcacatgttc tttcctgcgt tatcccctga 7440
ttctgtggat aaccgtatta ccgcctttga gtgagctgat accgctcgcc gcagccgaac 7500
gaccgagcgc agcgagtcag tgagcgagga agcggaagag cgcccaatac gcaaaccgcc 7560
tctccccgcg cgttggccga ttcattaatg cagctggcac gacaggtttc ccgactggaa 7620
agcgggcagt gagcgcaacg caattaatgt gagttagctc actcattagg caccccaggc 7680
tttacacttt atgcttccgg ctcgtatgtt gtgtggaatt gtgagcggat aacaatttca 7740
cacaggaaac agctatgacc atgattacgc caagcgcgca attaaccctc actaaaggga 7800
acaaaagctg gagctgcaag cttaatgtag tcttatgcaa tactcttgta gtcttgcaac 7860
atggtaacga tgagttagca acatgcctta caaggagaga aaaagcaccg tgcatgccga 7920
ttggtggaag taaggtggta cgatcgtgcc ttattaggaa ggcaacagac gggtctgaca 7980
tggattggac gaaccactga attgccgcat tgcagagata ttgtatttaa gtgcctagct 8040
cgatacataa acgggtctct ctggttagac cagatctgag cctgggagct ctctggctaa 8100
ctagggaacc cactgcttaa gcctcaataa agcttgcctt gagtgcttca agtagtgtgt 8160
gcccgtctgt tgtgtgactc tggtaactag agatccctca gaccctttta gtcagtgtgg 8220
aaaatctcta gcagtggcgc ccgaacaggg acttgaaagc gaaagggaaa ccagaggagc 8280
tctctcgacg caggactcgg cttgctgaag cgcgcacggc aagaggcgag gggcggcgac 8340
tggtgagtac gccaaaaatt ttgactagcg gaggctagaa ggagagagat gggtgcgaga 8400
gcgtcagtat taagcggggg agaattagat cgcgatggga aaaaattcgg ttaaggccag 8460
ggggaaagaa aaaatataaa ttaaaacata tagtatgggc aagcagggag ctagaacgat 8520
tcgcagttaa tcctggcctg ttagaaacat cagaaggctg tagacaaata ctgggacagc 8580
tacaaccatc ccttcagaca ggatcagaag aacttagatc attatataat acagtagcaa 8640
ccctctattg tgtgcatcaa aggatagaga taaaagacac caaggaagct ttagacaaga 8700
tagaggaaga gcaaaacaaa agtaagacca ccgcacagca agcggccgct gatcttcaga 8760
cctggaggag gagatatgag ggacaattgg agaagtgaat tatataaata taaagtagta 8820
aaaattgaac cattaggagt agcacccacc aaggcaaaga gaagagtggt gcagagagaa 8880
aaaagagcag tgggaatagg agctttgttc cttgggttct tgggagcagc aggaagcact 8940
atgggcgcag cgtcaatgac gctgacggta caggccagac aattattgtc tggtatagtg 9000
cagcagcaga acaatttgct gagggctatt gaggcgcaac agcatctgtt gcaactcaca 9060
gtctggggca tcaagcagct ccaggcaaga atcctggctg tggaaagata cctaaaggat 9120
caacagctcc tggggatttg gggttgctct ggaaaactca tttgcaccac tgctgtgcct 9180
tggaatgcta gttggagtaa taaatctctg gaacagattt ggaatcacac gacctggatg 9240
gagtgggaca gagaaattaa caattacaca agcttaatac actccttaat tgaagaatcg 9300
caaaaccagc aagaaaagaa tgaacaagaa ttattggaat tagataaatg ggcaagtttg 9360
tggaattggt ttaacataac aaattggctg tggtatataa aattattcat aatgatagta 9420
ggaggcttgg taggtttaag aatagttttt gctgtacttt ctatagtgaa tagagttagg 9480
cagggatatt caccattatc gtttcagacc cacctcccaa ccccgagggg acccttgcgc 9540
cttttccaag gcagccctgg gtttgcgcag ggacgcggct gctctgggcg tggttccggg 9600
aaacgcagcg gcgccgaccc tgggtctcgc acattcttca cgtccgttcg cagcgtcacc 9660
cggatcttcg ccgctaccct tgtgggcccc ccggcgacgc ttcctgctcc gcccctaagt 9720
cgggaaggtt ccttgcggtt cgcggcgtgc cggacgtgac aaacggaagc cgcacgtctc 9780
actagtaccc tcgcagacgg acagcgccag ggagcaatgg cagcgcgccg accgcgatgg 9840
gctgtggcca atagcggctg ctcagcaggg cgcgccgaga gcagcggccg ggaaggggcg 9900
gtgcgggagg cggggtgtgg ggcggtagtg tgggccctgt tcctgcccgc gcggtgttcc 9960
gcattctgca agcctccgga gcgcacgtcg gcagtcggct ccctcgttga ccgaatcacc 10020
gacctctctc cccagggggt acccagctgt ctagagaatt ctagatcttg agacaaatgg 10080
cagtattcat ccacaatttt aaaagaaaag gggggattgg ggggtacagt gcaggggaaa 10140
gaatagtaga cataatagca acagacatac aaactaaaga attacaaaaa caaattacaa 10200
aaattcaaaa ttttcgggtt tattacaggg acagcagaga tccactttgg cgccggctcg 10260
aggggg 10266
<210> 189
<211> 9669
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 189
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtgccacca tgaaaaagcc tgaactcacc 660
gcgacgtctg tcgagaagtt tctgatcgaa aagttcgaca gcgtctccga cctgatgcag 720
ctctcggagg gcgaagaatc tcgtgctttc agcttcgatg taggagggcg tggatatgtc 780
ctgcgggtaa atagctgcgc cgatggtttc tacaaagatc gttatgttta tcggcacttt 840
gcatcggccg cgctcccgat tccggaagtg cttgacattg gggaatttag cgagagcctg 900
acctattgca tctcccgccg tgcacagggt gtcacgttgc aagacctgcc tgaaaccgaa 960
ctgcccgctg ttctgcagcc ggtcgcggag gccatggatg cgatcgctgc ggccgatctt 1020
agccagacga gcgggttcgg cccattcgga ccgcaaggaa tcggtcaata cactacatgg 1080
cgtgatttca tatgcgcgat tgctgatccc catgtgtatc actggcaaac tgtgatggac 1140
gacaccgtca gtgcgtccgt cgcgcaggct ctcgatgagc tgatgctttg ggccgaggac 1200
tgccccgaag tccggcacct cgtgcacgcg gatttcggct ccaacaatgt cctgacggac 1260
aatggccgca taacagcggt cattgactgg agcgaggcga tgttcgggga ttcccaatac 1320
gaggtcgcca acatcttctt ctggaggccg tggttggctt gtatggagca gcagacgcgc 1380
tacttcgagc ggaggcatcc ggagcttgca ggatcgccgc ggctccgggc gtatatgctc 1440
cgcattggtc ttgaccaact ctatcagagc ttggttgacg gcaatttcga tgatgcagct 1500
tgggcgcagg gtcgatgcga cgcaatcgtc cgatccggag ccgggactgt cgggcgtaca 1560
caaatcgccc gcagaagcgc ggccgtctgg accgatggct gtgtagaagt actcgccgat 1620
agtggaaacc gacgccccag cactcgtccg agggcaaagg aatagttaat taagaattcg 1680
acccagcttt cttgtacaaa gtggttggta agcctatccc taaccctctc ctcggtctcg 1740
attctacgta gtaatgagct agcagtctcg aggttaacga attccgcccc ccccctaacg 1800
ttactggccg aagccgcttg gaataaggcc ggtgtgcgct tgtctatatg ttattttcca 1860
ccatattgcc gtcttttggc aatgtgaggg cccggaaacc tggccctgtc ttcttgacga 1920
gcattcctag gggtctttcc cctctcgcca aaggaatgca aggtctgttg aatgtcgtga 1980
aggaagcagt tcctctggaa gcttcttgaa gacaaacaac gtctgtagcg accctttgca 2040
ggcagcggaa ccccccacct ggcgacaggt gcccctgcgg ccaaaagcca cgtgtataag 2100
atacacctgc aaaggcggca caaccccagt gccacgttgt gagttggata gttgtggaaa 2160
gagtcaaatg gctctcctca agcgtattca acaaggggct gaaggatgcc cagaaggtac 2220
cccattgtat gggatctgat ctggggcctc ggtgcacatg ctttacatgt gtttagtcga 2280
ggttaaaaaa acgtctaggc cccccgaacc acggggacgt ggttttcctt tgaaaaacac 2340
gataatacca tggccatgag cgagctgatt aaggagaaca tgcacatgaa gctgtacatg 2400
gagggcaccg tggacaacca tcacttcaag tgcacatccg agggcgaagg caagccctac 2460
gagggcaccc agaccatgag aatcaaggtg gtcgagggcg gccctctccc cttcgccttc 2520
gacatcctgg ctactagctt cctctacggc agcaagacct tcatcaacca cacccagggc 2580
atccccgact tcttcaagca gtccttccct gagggcttca catgggagag agtcaccaca 2640
tacgaagacg ggggcgtgct gaccgctacc caggacacca gcctccagga cggctgcctc 2700
atctacaacg tcaagatcag aggggtgaac ttcacatcca acggccctgt gatgcagaag 2760
aaaacactcg gctgggaggc cttcaccgag acgctgtacc ccgctgacgg cggcctggaa 2820
ggcagaaacg acatggccct gaagctcgtg ggcgggagcc atctgatcgc aaacatcaag 2880
accacatata gatccaagaa acccgctaag aacctcaaga tgcctggcgt ctactatgtg 2940
gactacagac tggaaagaat caaggaggcc aacaacgaga cctacgtcga gcagcacgag 3000
gtggcagtgg ccagatactg cgacctccct agcaaactgg ggcacaagct taattaacac 3060
cggtggcgcg ttaagtcgac aatcaacctc tggattacaa aatttgtgaa agattgactg 3120
gtattcttaa ctatgttgct ccttttacgc tatgtggata cgctgcttta atgcctttgt 3180
atcatgctat tgcttcccgt atggctttca ttttctcctc cttgtataaa tcctggttgc 3240
tgtctcttta tgaggagttg tggcccgttg tcaggcaacg tggcgtggtg tgcactgtgt 3300
ttgctgacgc aacccccact ggttggggca ttgccaccac ctgtcagctc ctttccggga 3360
ctttcgcttt ccccctccct attgccacgg cggaactcat cgccgcctgc cttgcccgct 3420
gctggacagg ggctcggctg ttgggcactg acaattccgt ggtgttgtcg gggaaatcat 3480
cgtcctttcc ttggctgctc gcctgtgttg ccacctggat tctgcgcggg acgtccttct 3540
gctacgtccc ttcggccctc aatccagcgg accttccttc ccgcggcctg ctgccggctc 3600
tgcggcctct tccgcgtctt cgccttcgcc ctcagacgag tcggatctcc ctttgggccg 3660
cctccccgcg tcgactttaa gaccaatgac ttacaaggca gctgtagatc ttagccactt 3720
tttaaaagaa aaggggggac tggaagggct aattcactcc caacgaagac aagatctgct 3780
ttttgcttgt actgggtctc tctggttaga ccagatctga gcctgggagc tctctggcta 3840
actagggaac ccactgctta agcctcaata aagcttgcct tgagtgcttc aagtagtgtg 3900
tgcccgtctg ttgtgtgact ctggtaacta gagatccctc agaccctttt agtcagtgtg 3960
gaaaatctct agcagtacgt atagtagttc atgtcatctt attattcagt atttataact 4020
tgcaaagaaa tgaatatcag agagtgagag gaacttgttt attgcagctt ataatggtta 4080
caaataaagc aatagcatca caaatttcac aaataaagca tttttttcac tgcattctag 4140
ttgtggtttg tccaaactca tcaatgtatc ttatcatgtc tggctctagc tatcccgccc 4200
ctaactccgc ccatcccgcc cctaactccg cccagttccg cccattctcc gccccatggc 4260
tgactaattt tttttattta tgcagaggcc gaggccgcct cggcctctga gctattccag 4320
aagtagtgag gaggcttttt tggaggccta gggacgtacc caattcgccc tatagtgagt 4380
cgtattacgc gcgctcactg gccgtcgttt tacaacgtcg tgactgggaa aaccctggcg 4440
ttacccaact taatcgcctt gcagcacatc cccctttcgc cagctggcgt aatagcgaag 4500
aggcccgcac cgatcgccct tcccaacagt tgcgcagcct gaatggcgaa tgggacgcgc 4560
cctgtagcgg cgcattaagc gcggcgggtg tggtggttac gcgcagcgtg accgctacac 4620
ttgccagcgc cctagcgccc gctcctttcg ctttcttccc ttcctttctc gccacgttcg 4680
ccggctttcc ccgtcaagct ctaaatcggg ggctcccttt agggttccga tttagtgctt 4740
tacggcacct cgaccccaaa aaacttgatt agggtgatgg ttcacgtagt gggccatcgc 4800
cctgatagac ggtttttcgc cctttgacgt tggagtccac gttctttaat agtggactct 4860
tgttccaaac tggaacaaca ctcaacccta tctcggtcta ttcttttgat ttataaggga 4920
ttttgccgat ttcggcctat tggttaaaaa atgagctgat ttaacaaaaa tttaacgcga 4980
attttaacaa aatattaacg cttacaattt aggtggcact tttcggggaa atgtgcgcgg 5040
aacccctatt tgtttatttt tctaaataca ttcaaatatg tatccgctca tgagacaata 5100
accctgataa atgcttcaat aatattgaaa aaggaagagt atgagtattc aacatttccg 5160
tgtcgccctt attccctttt ttgcggcatt ttgccttcct gtttttgctc acccagaaac 5220
gctggtgaaa gtaaaagatg ctgaagatca gttgggtgca cgagtgggtt acatcgaact 5280
ggatctcaac agcggtaaga tccttgagag ttttcgcccc gaagaacgtt ttccaatgat 5340
gagcactttt aaagttctgc tatgtggcgc ggtattatcc cgtattgacg ccgggcaaga 5400
gcaactcggt cgccgcatac actattctca gaatgacttg gttgagtact caccagtcac 5460
agaaaagcat cttacggatg gcatgacagt aagagaatta tgcagtgctg ccataaccat 5520
gagtgataac actgcggcca acttacttct gacaacgatc ggaggaccga aggagctaac 5580
cgcttttttg cacaacatgg gggatcatgt aactcgcctt gatcgttggg aaccggagct 5640
gaatgaagcc ataccaaacg acgagcgtga caccacgatg cctgtagcaa tggcaacaac 5700
gttgcgcaaa ctattaactg gcgaactact tactctagct tcccggcaac aattaataga 5760
ctggatggag gcggataaag ttgcaggacc acttctgcgc tcggcccttc cggctggctg 5820
gtttattgct gataaatctg gagccggtga gcgtgggtct cgcggtatca ttgcagcact 5880
ggggccagat ggtaagccct cccgtatcgt agttatctac acgacgggga gtcaggcaac 5940
tatggatgaa cgaaatagac agatcgctga gataggtgcc tcactgatta agcattggta 6000
actgtcagac caagtttact catatatact ttagattgat ttaaaacttc atttttaatt 6060
taaaaggatc taggtgaaga tcctttttga taatctcatg accaaaatcc cttaacgtga 6120
gttttcgttc cactgagcgt cagaccccgt agaaaagatc aaaggatctt cttgagatcc 6180
tttttttctg cgcgtaatct gctgcttgca aacaaaaaaa ccaccgctac cagcggtggt 6240
ttgtttgccg gatcaagagc taccaactct ttttccgaag gtaactggct tcagcagagc 6300
gcagatacca aatactgttc ttctagtgta gccgtagtta ggccaccact tcaagaactc 6360
tgtagcaccg cctacatacc tcgctctgct aatcctgtta ccagtggctg ctgccagtgg 6420
cgataagtcg tgtcttaccg ggttggactc aagacgatag ttaccggata aggcgcagcg 6480
gtcgggctga acggggggtt cgtgcacaca gcccagcttg gagcgaacga cctacaccga 6540
actgagatac ctacagcgtg agctatgaga aagcgccacg cttcccgaag ggagaaaggc 6600
ggacaggtat ccggtaagcg gcagggtcgg aacaggagag cgcacgaggg agcttccagg 6660
gggaaacgcc tggtatcttt atagtcctgt cgggtttcgc cacctctgac ttgagcgtcg 6720
atttttgtga tgctcgtcag gggggcggag cctatggaaa aacgccagca acgcggcctt 6780
tttacggttc ctggcctttt gctggccttt tgctcacatg ttctttcctg cgttatcccc 6840
tgattctgtg gataaccgta ttaccgcctt tgagtgagct gataccgctc gccgcagccg 6900
aacgaccgag cgcagcgagt cagtgagcga ggaagcggaa gagcgcccaa tacgcaaacc 6960
gcctctcccc gcgcgttggc cgattcatta atgcagctgg cacgacaggt ttcccgactg 7020
gaaagcgggc agtgagcgca acgcaattaa tgtgagttag ctcactcatt aggcacccca 7080
ggctttacac tttatgcttc cggctcgtat gttgtgtgga attgtgagcg gataacaatt 7140
tcacacagga aacagctatg accatgatta cgccaagcgc gcaattaacc ctcactaaag 7200
ggaacaaaag ctggagctgc aagcttaatg tagtcttatg caatactctt gtagtcttgc 7260
aacatggtaa cgatgagtta gcaacatgcc ttacaaggag agaaaaagca ccgtgcatgc 7320
cgattggtgg aagtaaggtg gtacgatcgt gccttattag gaaggcaaca gacgggtctg 7380
acatggattg gacgaaccac tgaattgccg cattgcagag atattgtatt taagtgccta 7440
gctcgataca taaacgggtc tctctggtta gaccagatct gagcctggga gctctctggc 7500
taactaggga acccactgct taagcctcaa taaagcttgc cttgagtgct tcaagtagtg 7560
tgtgcccgtc tgttgtgtga ctctggtaac tagagatccc tcagaccctt ttagtcagtg 7620
tggaaaatct ctagcagtgg cgcccgaaca gggacttgaa agcgaaaggg aaaccagagg 7680
agctctctcg acgcaggact cggcttgctg aagcgcgcac ggcaagaggc gaggggcggc 7740
gactggtgag tacgccaaaa attttgacta gcggaggcta gaaggagaga gatgggtgcg 7800
agagcgtcag tattaagcgg gggagaatta gatcgcgatg ggaaaaaatt cggttaaggc 7860
cagggggaaa gaaaaaatat aaattaaaac atatagtatg ggcaagcagg gagctagaac 7920
gattcgcagt taatcctggc ctgttagaaa catcagaagg ctgtagacaa atactgggac 7980
agctacaacc atcccttcag acaggatcag aagaacttag atcattatat aatacagtag 8040
caaccctcta ttgtgtgcat caaaggatag agataaaaga caccaaggaa gctttagaca 8100
agatagagga agagcaaaac aaaagtaaga ccaccgcaca gcaagcggcc gctgatcttc 8160
agacctggag gaggagatat gagggacaat tggagaagtg aattatataa atataaagta 8220
gtaaaaattg aaccattagg agtagcaccc accaaggcaa agagaagagt ggtgcagaga 8280
gaaaaaagag cagtgggaat aggagctttg ttccttgggt tcttgggagc agcaggaagc 8340
actatgggcg cagcgtcaat gacgctgacg gtacaggcca gacaattatt gtctggtata 8400
gtgcagcagc agaacaattt gctgagggct attgaggcgc aacagcatct gttgcaactc 8460
acagtctggg gcatcaagca gctccaggca agaatcctgg ctgtggaaag atacctaaag 8520
gatcaacagc tcctggggat ttggggttgc tctggaaaac tcatttgcac cactgctgtg 8580
ccttggaatg ctagttggag taataaatct ctggaacaga tttggaatca cacgacctgg 8640
atggagtggg acagagaaat taacaattac acaagcttaa tacactcctt aattgaagaa 8700
tcgcaaaacc agcaagaaaa gaatgaacaa gaattattgg aattagataa atgggcaagt 8760
ttgtggaatt ggtttaacat aacaaattgg ctgtggtata taaaattatt cataatgata 8820
gtaggaggct tggtaggttt aagaatagtt tttgctgtac tttctatagt gaatagagtt 8880
aggcagggat attcaccatt atcgtttcag acccacctcc caaccccgag gggacccttg 8940
cgccttttcc aaggcagccc tgggtttgcg cagggacgcg gctgctctgg gcgtggttcc 9000
gggaaacgca gcggcgccga ccctgggtct cgcacattct tcacgtccgt tcgcagcgtc 9060
acccggatct tcgccgctac ccttgtgggc cccccggcga cgcttcctgc tccgccccta 9120
agtcgggaag gttccttgcg gttcgcggcg tgccggacgt gacaaacgga agccgcacgt 9180
ctcactagta ccctcgcaga cggacagcgc cagggagcaa tggcagcgcg ccgaccgcga 9240
tgggctgtgg ccaatagcgg ctgctcagca gggcgcgccg agagcagcgg ccgggaaggg 9300
gcggtgcggg aggcggggtg tggggcggta gtgtgggccc tgttcctgcc cgcgcggtgt 9360
tccgcattct gcaagcctcc ggagcgcacg tcggcagtcg gctccctcgt tgaccgaatc 9420
accgacctct ctccccaggg ggtacccagc tgtctagaga attctagatc ttgagacaaa 9480
tggcagtatt catccacaat tttaaaagaa aaggggggat tggggggtac agtgcagggg 9540
aaagaatagt agacataata gcaacagaca tacaaactaa agaattacaa aaacaaatta 9600
caaaaattca aaattttcgg gtttattaca gggacagcag agatccactt tggcgccggc 9660
tcgaggggg 9669
<210> 190
<211> 9672
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 190
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtgccacca tgaaaaagcc tgaactcacc 660
gcgacgtctg tcgagaagtt tctgatcgaa aagttcgaca gcgtctccga cctgatgcag 720
ctctcggagg gcgaagaatc tcgtgctttc agcttcgatg taggagggcg tggatatgtc 780
ctgcgggtaa atagctgcgc cgatggtttc tacaaagatc gttatgttta tcggcacttt 840
gcatcggccg cgctcccgat tccggaagtg cttgacattg gggaatttag cgagagcctg 900
acctattgca tctcccgccg tgcacagggt gtcacgttgc aagacctgcc tgaaaccgaa 960
ctgcccgctg ttctgcagcc ggtcgcggag gccatggatg cgatcgctgc ggccgatctt 1020
agccagacga gcgggttcgg cccattcgga ccgcaaggaa tcggtcaata cactacatgg 1080
cgtgatttca tatgcgcgat tgctgatccc catgtgtatc actggcaaac tgtgatggac 1140
gacaccgtca gtgcgtccgt cgcgcaggct ctcgatgagc tgatgctttg ggccgaggac 1200
tgccccgaag tccggcacct cgtgcacgcg gatttcggct ccaacaatgt cctgacggac 1260
aatggccgca taacagcggt cattgactgg agcgaggcga tgttcgggga ttcccaatac 1320
gaggtcgcca acatcttctt ctggaggccg tggttggctt gtatggagca gcagacgcgc 1380
tacttcgagc ggaggcatcc ggagcttgca ggatcgccgc ggctccgggc gtatatgctc 1440
cgcattggtc ttgaccaact ctatcagagc ttggttgacg gcaatttcga tgatgcagct 1500
tgggcgcagg gtcgatgcga cgcaatcgtc cgatccggag ccgggactgt cgggcgtaca 1560
caaatcgccc gcagaagcgc ggccgtctgg accgatggct gtgtagaagt actcgccgat 1620
agtggaaacc gacgccccag cactcgtccg agggcaaagg aatagttaat taagaattcg 1680
acccagcttt cttgtacaaa gtggttggta agcctatccc taaccctctc ctcggtctcg 1740
attctacgta gtaatgagct agcagtctcg aggttaacga attccgcccc ccccctaacg 1800
ttactggccg aagccgcttg gaataaggcc ggtgtgcgct tgtctatatg ttattttcca 1860
ccatattgcc gtcttttggc aatgtgaggg cccggaaacc tggccctgtc ttcttgacga 1920
gcattcctag gggtctttcc cctctcgcca aaggaatgca aggtctgttg aatgtcgtga 1980
aggaagcagt tcctctggaa gcttcttgaa gacaaacaac gtctgtagcg accctttgca 2040
ggcagcggaa ccccccacct ggcgacaggt gcccctgcgg ccaaaagcca cgtgtataag 2100
atacacctgc aaaggcggca caaccccagt gccacgttgt gagttggata gttgtggaaa 2160
gagtcaaatg gctctcctca agcgtattca acaaggggct gaaggatgcc cagaaggtac 2220
cccattgtat gggatctgat ctggggcctc ggtgcacatg ctttacatgt gtttagtcga 2280
ggttaaaaaa acgtctaggc cccccgaacc acggggacgt ggttttcctt tgaaaaacac 2340
gataatacca tggtgagcaa gggcgaggag gataacatgg ccatcatcaa ggagttcatg 2400
cgcttcaagg tgcacatgga gggctccgtg aacggccacg agttcgagat cgagggcgag 2460
ggcgagggcc gcccctacga gggcacccag accgccaagc tgaaggtgac caagggtggc 2520
cccctgccct tcgcctggga catcctgtcc cctcagttca tgtacggctc caaggcctac 2580
gtgaagcacc ccgccgacat ccccgactac ttgaagctgt ccttccccga gggcttcaag 2640
tgggagcgcg tgatgaactt cgaggacggc ggcgtggtga ccgtgaccca ggactcctcc 2700
ctgcaggacg gcgagttcat ctacaaggtg aagctgcgcg gcaccaactt cccctccgac 2760
ggccccgtaa tgcagaagaa gaccatgggc tgggaggcct cctccgagcg gatgtacccc 2820
gaggacggcg ccctgaaggg cgagatcaag cagaggctga agctgaagga cggcggccac 2880
tacgacgctg aggtcaagac cacctacaag gccaagaagc ccgtgcagct gcccggcgcc 2940
tacaacgtca acatcaagtt ggacatcacc tcccacaacg aggactacac catcgtggaa 3000
cagtacgaac gcgccgaggg ccgccactcc accggcggca tggacgagct gtacaagtaa 3060
caccggtggc gcgttaagtc gacaatcaac ctctggatta caaaatttgt gaaagattga 3120
ctggtattct taactatgtt gctcctttta cgctatgtgg atacgctgct ttaatgcctt 3180
tgtatcatgc tattgcttcc cgtatggctt tcattttctc ctccttgtat aaatcctggt 3240
tgctgtctct ttatgaggag ttgtggcccg ttgtcaggca acgtggcgtg gtgtgcactg 3300
tgtttgctga cgcaaccccc actggttggg gcattgccac cacctgtcag ctcctttccg 3360
ggactttcgc tttccccctc cctattgcca cggcggaact catcgccgcc tgccttgccc 3420
gctgctggac aggggctcgg ctgttgggca ctgacaattc cgtggtgttg tcggggaaat 3480
catcgtcctt tccttggctg ctcgcctgtg ttgccacctg gattctgcgc gggacgtcct 3540
tctgctacgt cccttcggcc ctcaatccag cggaccttcc ttcccgcggc ctgctgccgg 3600
ctctgcggcc tcttccgcgt cttcgccttc gccctcagac gagtcggatc tccctttggg 3660
ccgcctcccc gcgtcgactt taagaccaat gacttacaag gcagctgtag atcttagcca 3720
ctttttaaaa gaaaaggggg gactggaagg gctaattcac tcccaacgaa gacaagatct 3780
gctttttgct tgtactgggt ctctctggtt agaccagatc tgagcctggg agctctctgg 3840
ctaactaggg aacccactgc ttaagcctca ataaagcttg ccttgagtgc ttcaagtagt 3900
gtgtgcccgt ctgttgtgtg actctggtaa ctagagatcc ctcagaccct tttagtcagt 3960
gtggaaaatc tctagcagta cgtatagtag ttcatgtcat cttattattc agtatttata 4020
acttgcaaag aaatgaatat cagagagtga gaggaacttg tttattgcag cttataatgg 4080
ttacaaataa agcaatagca tcacaaattt cacaaataaa gcattttttt cactgcattc 4140
tagttgtggt ttgtccaaac tcatcaatgt atcttatcat gtctggctct agctatcccg 4200
cccctaactc cgcccatccc gcccctaact ccgcccagtt ccgcccattc tccgccccat 4260
ggctgactaa ttttttttat ttatgcagag gccgaggccg cctcggcctc tgagctattc 4320
cagaagtagt gaggaggctt ttttggaggc ctagggacgt acccaattcg ccctatagtg 4380
agtcgtatta cgcgcgctca ctggccgtcg ttttacaacg tcgtgactgg gaaaaccctg 4440
gcgttaccca acttaatcgc cttgcagcac atcccccttt cgccagctgg cgtaatagcg 4500
aagaggcccg caccgatcgc ccttcccaac agttgcgcag cctgaatggc gaatgggacg 4560
cgccctgtag cggcgcatta agcgcggcgg gtgtggtggt tacgcgcagc gtgaccgcta 4620
cacttgccag cgccctagcg cccgctcctt tcgctttctt cccttccttt ctcgccacgt 4680
tcgccggctt tccccgtcaa gctctaaatc gggggctccc tttagggttc cgatttagtg 4740
ctttacggca cctcgacccc aaaaaacttg attagggtga tggttcacgt agtgggccat 4800
cgccctgata gacggttttt cgccctttga cgttggagtc cacgttcttt aatagtggac 4860
tcttgttcca aactggaaca acactcaacc ctatctcggt ctattctttt gatttataag 4920
ggattttgcc gatttcggcc tattggttaa aaaatgagct gatttaacaa aaatttaacg 4980
cgaattttaa caaaatatta acgcttacaa tttaggtggc acttttcggg gaaatgtgcg 5040
cggaacccct atttgtttat ttttctaaat acattcaaat atgtatccgc tcatgagaca 5100
ataaccctga taaatgcttc aataatattg aaaaaggaag agtatgagta ttcaacattt 5160
ccgtgtcgcc cttattccct tttttgcggc attttgcctt cctgtttttg ctcacccaga 5220
aacgctggtg aaagtaaaag atgctgaaga tcagttgggt gcacgagtgg gttacatcga 5280
actggatctc aacagcggta agatccttga gagttttcgc cccgaagaac gttttccaat 5340
gatgagcact tttaaagttc tgctatgtgg cgcggtatta tcccgtattg acgccgggca 5400
agagcaactc ggtcgccgca tacactattc tcagaatgac ttggttgagt actcaccagt 5460
cacagaaaag catcttacgg atggcatgac agtaagagaa ttatgcagtg ctgccataac 5520
catgagtgat aacactgcgg ccaacttact tctgacaacg atcggaggac cgaaggagct 5580
aaccgctttt ttgcacaaca tgggggatca tgtaactcgc cttgatcgtt gggaaccgga 5640
gctgaatgaa gccataccaa acgacgagcg tgacaccacg atgcctgtag caatggcaac 5700
aacgttgcgc aaactattaa ctggcgaact acttactcta gcttcccggc aacaattaat 5760
agactggatg gaggcggata aagttgcagg accacttctg cgctcggccc ttccggctgg 5820
ctggtttatt gctgataaat ctggagccgg tgagcgtggg tctcgcggta tcattgcagc 5880
actggggcca gatggtaagc cctcccgtat cgtagttatc tacacgacgg ggagtcaggc 5940
aactatggat gaacgaaata gacagatcgc tgagataggt gcctcactga ttaagcattg 6000
gtaactgtca gaccaagttt actcatatat actttagatt gatttaaaac ttcattttta 6060
atttaaaagg atctaggtga agatcctttt tgataatctc atgaccaaaa tcccttaacg 6120
tgagttttcg ttccactgag cgtcagaccc cgtagaaaag atcaaaggat cttcttgaga 6180
tccttttttt ctgcgcgtaa tctgctgctt gcaaacaaaa aaaccaccgc taccagcggt 6240
ggtttgtttg ccggatcaag agctaccaac tctttttccg aaggtaactg gcttcagcag 6300
agcgcagata ccaaatactg ttcttctagt gtagccgtag ttaggccacc acttcaagaa 6360
ctctgtagca ccgcctacat acctcgctct gctaatcctg ttaccagtgg ctgctgccag 6420
tggcgataag tcgtgtctta ccgggttgga ctcaagacga tagttaccgg ataaggcgca 6480
gcggtcgggc tgaacggggg gttcgtgcac acagcccagc ttggagcgaa cgacctacac 6540
cgaactgaga tacctacagc gtgagctatg agaaagcgcc acgcttcccg aagggagaaa 6600
ggcggacagg tatccggtaa gcggcagggt cggaacagga gagcgcacga gggagcttcc 6660
agggggaaac gcctggtatc tttatagtcc tgtcgggttt cgccacctct gacttgagcg 6720
tcgatttttg tgatgctcgt caggggggcg gagcctatgg aaaaacgcca gcaacgcggc 6780
ctttttacgg ttcctggcct tttgctggcc ttttgctcac atgttctttc ctgcgttatc 6840
ccctgattct gtggataacc gtattaccgc ctttgagtga gctgataccg ctcgccgcag 6900
ccgaacgacc gagcgcagcg agtcagtgag cgaggaagcg gaagagcgcc caatacgcaa 6960
accgcctctc cccgcgcgtt ggccgattca ttaatgcagc tggcacgaca ggtttcccga 7020
ctggaaagcg ggcagtgagc gcaacgcaat taatgtgagt tagctcactc attaggcacc 7080
ccaggcttta cactttatgc ttccggctcg tatgttgtgt ggaattgtga gcggataaca 7140
atttcacaca ggaaacagct atgaccatga ttacgccaag cgcgcaatta accctcacta 7200
aagggaacaa aagctggagc tgcaagctta atgtagtctt atgcaatact cttgtagtct 7260
tgcaacatgg taacgatgag ttagcaacat gccttacaag gagagaaaaa gcaccgtgca 7320
tgccgattgg tggaagtaag gtggtacgat cgtgccttat taggaaggca acagacgggt 7380
ctgacatgga ttggacgaac cactgaattg ccgcattgca gagatattgt atttaagtgc 7440
ctagctcgat acataaacgg gtctctctgg ttagaccaga tctgagcctg ggagctctct 7500
ggctaactag ggaacccact gcttaagcct caataaagct tgccttgagt gcttcaagta 7560
gtgtgtgccc gtctgttgtg tgactctggt aactagagat ccctcagacc cttttagtca 7620
gtgtggaaaa tctctagcag tggcgcccga acagggactt gaaagcgaaa gggaaaccag 7680
aggagctctc tcgacgcagg actcggcttg ctgaagcgcg cacggcaaga ggcgaggggc 7740
ggcgactggt gagtacgcca aaaattttga ctagcggagg ctagaaggag agagatgggt 7800
gcgagagcgt cagtattaag cgggggagaa ttagatcgcg atgggaaaaa attcggttaa 7860
ggccaggggg aaagaaaaaa tataaattaa aacatatagt atgggcaagc agggagctag 7920
aacgattcgc agttaatcct ggcctgttag aaacatcaga aggctgtaga caaatactgg 7980
gacagctaca accatccctt cagacaggat cagaagaact tagatcatta tataatacag 8040
tagcaaccct ctattgtgtg catcaaagga tagagataaa agacaccaag gaagctttag 8100
acaagataga ggaagagcaa aacaaaagta agaccaccgc acagcaagcg gccgctgatc 8160
ttcagacctg gaggaggaga tatgagggac aattggagaa gtgaattata taaatataaa 8220
gtagtaaaaa ttgaaccatt aggagtagca cccaccaagg caaagagaag agtggtgcag 8280
agagaaaaaa gagcagtggg aataggagct ttgttccttg ggttcttggg agcagcagga 8340
agcactatgg gcgcagcgtc aatgacgctg acggtacagg ccagacaatt attgtctggt 8400
atagtgcagc agcagaacaa tttgctgagg gctattgagg cgcaacagca tctgttgcaa 8460
ctcacagtct ggggcatcaa gcagctccag gcaagaatcc tggctgtgga aagataccta 8520
aaggatcaac agctcctggg gatttggggt tgctctggaa aactcatttg caccactgct 8580
gtgccttgga atgctagttg gagtaataaa tctctggaac agatttggaa tcacacgacc 8640
tggatggagt gggacagaga aattaacaat tacacaagct taatacactc cttaattgaa 8700
gaatcgcaaa accagcaaga aaagaatgaa caagaattat tggaattaga taaatgggca 8760
agtttgtgga attggtttaa cataacaaat tggctgtggt atataaaatt attcataatg 8820
atagtaggag gcttggtagg tttaagaata gtttttgctg tactttctat agtgaataga 8880
gttaggcagg gatattcacc attatcgttt cagacccacc tcccaacccc gaggggaccc 8940
ttgcgccttt tccaaggcag ccctgggttt gcgcagggac gcggctgctc tgggcgtggt 9000
tccgggaaac gcagcggcgc cgaccctggg tctcgcacat tcttcacgtc cgttcgcagc 9060
gtcacccgga tcttcgccgc tacccttgtg ggccccccgg cgacgcttcc tgctccgccc 9120
ctaagtcggg aaggttcctt gcggttcgcg gcgtgccgga cgtgacaaac ggaagccgca 9180
cgtctcacta gtaccctcgc agacggacag cgccagggag caatggcagc gcgccgaccg 9240
cgatgggctg tggccaatag cggctgctca gcagggcgcg ccgagagcag cggccgggaa 9300
ggggcggtgc gggaggcggg gtgtggggcg gtagtgtggg ccctgttcct gcccgcgcgg 9360
tgttccgcat tctgcaagcc tccggagcgc acgtcggcag tcggctccct cgttgaccga 9420
atcaccgacc tctctcccca gggggtaccc agctgtctag agaattctag atcttgagac 9480
aaatggcagt attcatccac aattttaaaa gaaaaggggg gattgggggg tacagtgcag 9540
gggaaagaat agtagacata atagcaacag acatacaaac taaagaatta caaaaacaaa 9600
ttacaaaaat tcaaaatttt cgggtttatt acagggacag cagagatcca ctttggcgcc 9660
ggctcgaggg gg 9672
<210> 191
<211> 9243
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 191
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtgccacca tgaccgagta caagcccacg 660
gtgcgcctcg ccacccgcga cgacgtcccc agggccgtac gcaccctcgc cgccgcgttc 720
gccgactacc ccgccacgcg ccacaccgtc gatccggacc gccacatcga gcgggtcacc 780
gagctgcaag aactcttcct cacgcgcgtc gggctcgaca tcggcaaggt gtgggtcgcg 840
gacgacggcg ccgcggtggc ggtctggacc acgccggaga gcgtcgaagc gggggcggtg 900
ttcgccgaga tcggcccgcg catggccgag ttgagcggtt cccggctggc cgcgcagcaa 960
cagatggaag gcctcctggc gccgcaccgg cccaaggagc ccgcgtggtt cctggccacc 1020
gtcggcgtct cgcccgacca ccagggcaag ggtctgggca gcgccgtcgt gctccccgga 1080
gtggaggcgg ccgagcgcgc cggggtgccc gccttcctgg agacctccgc gccccgcaac 1140
ctccccttct acgagcggct cggcttcacc gtcaccgccg acgtcgaggt gcccgaagga 1200
ccgcgcacct ggtgcatgac ccgcaagccc ggtgcctgat taattaagaa ttcgacccag 1260
ctttcttgta caaagtggtt ggtaagccta tccctaaccc tctcctcggt ctcgattcta 1320
cgtagtaatg agctagcagt ctcgaggtta acgaattccg ccccccccct aacgttactg 1380
gccgaagccg cttggaataa ggccggtgtg cgcttgtcta tatgttattt tccaccatat 1440
tgccgtcttt tggcaatgtg agggcccgga aacctggccc tgtcttcttg acgagcattc 1500
ctaggggtct ttcccctctc gccaaaggaa tgcaaggtct gttgaatgtc gtgaaggaag 1560
cagttcctct ggaagcttct tgaagacaaa caacgtctgt agcgaccctt tgcaggcagc 1620
ggaacccccc acctggcgac aggtgcccct gcggccaaaa gccacgtgta taagatacac 1680
ctgcaaaggc ggcacaaccc cagtgccacg ttgtgagttg gatagttgtg gaaagagtca 1740
aatggctctc ctcaagcgta ttcaacaagg ggctgaagga tgcccagaag gtaccccatt 1800
gtatgggatc tgatctgggg cctcggtgca catgctttac atgtgtttag tcgaggttaa 1860
aaaaacgtct aggccccccg aaccacgggg acgtggtttt cctttgaaaa acacgataat 1920
accatggcca tgagcgagct gattaaggag aacatgcaca tgaagctgta catggagggc 1980
accgtggaca accatcactt caagtgcaca tccgagggcg aaggcaagcc ctacgagggc 2040
acccagacca tgagaatcaa ggtggtcgag ggcggccctc tccccttcgc cttcgacatc 2100
ctggctacta gcttcctcta cggcagcaag accttcatca accacaccca gggcatcccc 2160
gacttcttca agcagtcctt ccctgagggc ttcacatggg agagagtcac cacatacgaa 2220
gacgggggcg tgctgaccgc tacccaggac accagcctcc aggacggctg cctcatctac 2280
aacgtcaaga tcagaggggt gaacttcaca tccaacggcc ctgtgatgca gaagaaaaca 2340
ctcggctggg aggccttcac cgagacgctg taccccgctg acggcggcct ggaaggcaga 2400
aacgacatgg ccctgaagct cgtgggcggg agccatctga tcgcaaacat caagaccaca 2460
tatagatcca agaaacccgc taagaacctc aagatgcctg gcgtctacta tgtggactac 2520
agactggaaa gaatcaagga ggccaacaac gagacctacg tcgagcagca cgaggtggca 2580
gtggccagat actgcgacct ccctagcaaa ctggggcaca agcttaatta acaccggtgg 2640
cgcgttaagt cgacaatcaa cctctggatt acaaaatttg tgaaagattg actggtattc 2700
ttaactatgt tgctcctttt acgctatgtg gatacgctgc tttaatgcct ttgtatcatg 2760
ctattgcttc ccgtatggct ttcattttct cctccttgta taaatcctgg ttgctgtctc 2820
tttatgagga gttgtggccc gttgtcaggc aacgtggcgt ggtgtgcact gtgtttgctg 2880
acgcaacccc cactggttgg ggcattgcca ccacctgtca gctcctttcc gggactttcg 2940
ctttccccct ccctattgcc acggcggaac tcatcgccgc ctgccttgcc cgctgctgga 3000
caggggctcg gctgttgggc actgacaatt ccgtggtgtt gtcggggaaa tcatcgtcct 3060
ttccttggct gctcgcctgt gttgccacct ggattctgcg cgggacgtcc ttctgctacg 3120
tcccttcggc cctcaatcca gcggaccttc cttcccgcgg cctgctgccg gctctgcggc 3180
ctcttccgcg tcttcgcctt cgccctcaga cgagtcggat ctccctttgg gccgcctccc 3240
cgcgtcgact ttaagaccaa tgacttacaa ggcagctgta gatcttagcc actttttaaa 3300
agaaaagggg ggactggaag ggctaattca ctcccaacga agacaagatc tgctttttgc 3360
ttgtactggg tctctctggt tagaccagat ctgagcctgg gagctctctg gctaactagg 3420
gaacccactg cttaagcctc aataaagctt gccttgagtg cttcaagtag tgtgtgcccg 3480
tctgttgtgt gactctggta actagagatc cctcagaccc ttttagtcag tgtggaaaat 3540
ctctagcagt acgtatagta gttcatgtca tcttattatt cagtatttat aacttgcaaa 3600
gaaatgaata tcagagagtg agaggaactt gtttattgca gcttataatg gttacaaata 3660
aagcaatagc atcacaaatt tcacaaataa agcatttttt tcactgcatt ctagttgtgg 3720
tttgtccaaa ctcatcaatg tatcttatca tgtctggctc tagctatccc gcccctaact 3780
ccgcccatcc cgcccctaac tccgcccagt tccgcccatt ctccgcccca tggctgacta 3840
atttttttta tttatgcaga ggccgaggcc gcctcggcct ctgagctatt ccagaagtag 3900
tgaggaggct tttttggagg cctagggacg tacccaattc gccctatagt gagtcgtatt 3960
acgcgcgctc actggccgtc gttttacaac gtcgtgactg ggaaaaccct ggcgttaccc 4020
aacttaatcg ccttgcagca catccccctt tcgccagctg gcgtaatagc gaagaggccc 4080
gcaccgatcg cccttcccaa cagttgcgca gcctgaatgg cgaatgggac gcgccctgta 4140
gcggcgcatt aagcgcggcg ggtgtggtgg ttacgcgcag cgtgaccgct acacttgcca 4200
gcgccctagc gcccgctcct ttcgctttct tcccttcctt tctcgccacg ttcgccggct 4260
ttccccgtca agctctaaat cgggggctcc ctttagggtt ccgatttagt gctttacggc 4320
acctcgaccc caaaaaactt gattagggtg atggttcacg tagtgggcca tcgccctgat 4380
agacggtttt tcgccctttg acgttggagt ccacgttctt taatagtgga ctcttgttcc 4440
aaactggaac aacactcaac cctatctcgg tctattcttt tgatttataa gggattttgc 4500
cgatttcggc ctattggtta aaaaatgagc tgatttaaca aaaatttaac gcgaatttta 4560
acaaaatatt aacgcttaca atttaggtgg cacttttcgg ggaaatgtgc gcggaacccc 4620
tatttgttta tttttctaaa tacattcaaa tatgtatccg ctcatgagac aataaccctg 4680
ataaatgctt caataatatt gaaaaaggaa gagtatgagt attcaacatt tccgtgtcgc 4740
ccttattccc ttttttgcgg cattttgcct tcctgttttt gctcacccag aaacgctggt 4800
gaaagtaaaa gatgctgaag atcagttggg tgcacgagtg ggttacatcg aactggatct 4860
caacagcggt aagatccttg agagttttcg ccccgaagaa cgttttccaa tgatgagcac 4920
ttttaaagtt ctgctatgtg gcgcggtatt atcccgtatt gacgccgggc aagagcaact 4980
cggtcgccgc atacactatt ctcagaatga cttggttgag tactcaccag tcacagaaaa 5040
gcatcttacg gatggcatga cagtaagaga attatgcagt gctgccataa ccatgagtga 5100
taacactgcg gccaacttac ttctgacaac gatcggagga ccgaaggagc taaccgcttt 5160
tttgcacaac atgggggatc atgtaactcg ccttgatcgt tgggaaccgg agctgaatga 5220
agccatacca aacgacgagc gtgacaccac gatgcctgta gcaatggcaa caacgttgcg 5280
caaactatta actggcgaac tacttactct agcttcccgg caacaattaa tagactggat 5340
ggaggcggat aaagttgcag gaccacttct gcgctcggcc cttccggctg gctggtttat 5400
tgctgataaa tctggagccg gtgagcgtgg gtctcgcggt atcattgcag cactggggcc 5460
agatggtaag ccctcccgta tcgtagttat ctacacgacg gggagtcagg caactatgga 5520
tgaacgaaat agacagatcg ctgagatagg tgcctcactg attaagcatt ggtaactgtc 5580
agaccaagtt tactcatata tactttagat tgatttaaaa cttcattttt aatttaaaag 5640
gatctaggtg aagatccttt ttgataatct catgaccaaa atcccttaac gtgagttttc 5700
gttccactga gcgtcagacc ccgtagaaaa gatcaaagga tcttcttgag atcctttttt 5760
tctgcgcgta atctgctgct tgcaaacaaa aaaaccaccg ctaccagcgg tggtttgttt 5820
gccggatcaa gagctaccaa ctctttttcc gaaggtaact ggcttcagca gagcgcagat 5880
accaaatact gttcttctag tgtagccgta gttaggccac cacttcaaga actctgtagc 5940
accgcctaca tacctcgctc tgctaatcct gttaccagtg gctgctgcca gtggcgataa 6000
gtcgtgtctt accgggttgg actcaagacg atagttaccg gataaggcgc agcggtcggg 6060
ctgaacgggg ggttcgtgca cacagcccag cttggagcga acgacctaca ccgaactgag 6120
atacctacag cgtgagctat gagaaagcgc cacgcttccc gaagggagaa aggcggacag 6180
gtatccggta agcggcaggg tcggaacagg agagcgcacg agggagcttc cagggggaaa 6240
cgcctggtat ctttatagtc ctgtcgggtt tcgccacctc tgacttgagc gtcgattttt 6300
gtgatgctcg tcaggggggc ggagcctatg gaaaaacgcc agcaacgcgg cctttttacg 6360
gttcctggcc ttttgctggc cttttgctca catgttcttt cctgcgttat cccctgattc 6420
tgtggataac cgtattaccg cctttgagtg agctgatacc gctcgccgca gccgaacgac 6480
cgagcgcagc gagtcagtga gcgaggaagc ggaagagcgc ccaatacgca aaccgcctct 6540
ccccgcgcgt tggccgattc attaatgcag ctggcacgac aggtttcccg actggaaagc 6600
gggcagtgag cgcaacgcaa ttaatgtgag ttagctcact cattaggcac cccaggcttt 6660
acactttatg cttccggctc gtatgttgtg tggaattgtg agcggataac aatttcacac 6720
aggaaacagc tatgaccatg attacgccaa gcgcgcaatt aaccctcact aaagggaaca 6780
aaagctggag ctgcaagctt aatgtagtct tatgcaatac tcttgtagtc ttgcaacatg 6840
gtaacgatga gttagcaaca tgccttacaa ggagagaaaa agcaccgtgc atgccgattg 6900
gtggaagtaa ggtggtacga tcgtgcctta ttaggaaggc aacagacggg tctgacatgg 6960
attggacgaa ccactgaatt gccgcattgc agagatattg tatttaagtg cctagctcga 7020
tacataaacg ggtctctctg gttagaccag atctgagcct gggagctctc tggctaacta 7080
gggaacccac tgcttaagcc tcaataaagc ttgccttgag tgcttcaagt agtgtgtgcc 7140
cgtctgttgt gtgactctgg taactagaga tccctcagac ccttttagtc agtgtggaaa 7200
atctctagca gtggcgcccg aacagggact tgaaagcgaa agggaaacca gaggagctct 7260
ctcgacgcag gactcggctt gctgaagcgc gcacggcaag aggcgagggg cggcgactgg 7320
tgagtacgcc aaaaattttg actagcggag gctagaagga gagagatggg tgcgagagcg 7380
tcagtattaa gcgggggaga attagatcgc gatgggaaaa aattcggtta aggccagggg 7440
gaaagaaaaa atataaatta aaacatatag tatgggcaag cagggagcta gaacgattcg 7500
cagttaatcc tggcctgtta gaaacatcag aaggctgtag acaaatactg ggacagctac 7560
aaccatccct tcagacagga tcagaagaac ttagatcatt atataataca gtagcaaccc 7620
tctattgtgt gcatcaaagg atagagataa aagacaccaa ggaagcttta gacaagatag 7680
aggaagagca aaacaaaagt aagaccaccg cacagcaagc ggccgctgat cttcagacct 7740
ggaggaggag atatgaggga caattggaga agtgaattat ataaatataa agtagtaaaa 7800
attgaaccat taggagtagc acccaccaag gcaaagagaa gagtggtgca gagagaaaaa 7860
agagcagtgg gaataggagc tttgttcctt gggttcttgg gagcagcagg aagcactatg 7920
ggcgcagcgt caatgacgct gacggtacag gccagacaat tattgtctgg tatagtgcag 7980
cagcagaaca atttgctgag ggctattgag gcgcaacagc atctgttgca actcacagtc 8040
tggggcatca agcagctcca ggcaagaatc ctggctgtgg aaagatacct aaaggatcaa 8100
cagctcctgg ggatttgggg ttgctctgga aaactcattt gcaccactgc tgtgccttgg 8160
aatgctagtt ggagtaataa atctctggaa cagatttgga atcacacgac ctggatggag 8220
tgggacagag aaattaacaa ttacacaagc ttaatacact ccttaattga agaatcgcaa 8280
aaccagcaag aaaagaatga acaagaatta ttggaattag ataaatgggc aagtttgtgg 8340
aattggttta acataacaaa ttggctgtgg tatataaaat tattcataat gatagtagga 8400
ggcttggtag gtttaagaat agtttttgct gtactttcta tagtgaatag agttaggcag 8460
ggatattcac cattatcgtt tcagacccac ctcccaaccc cgaggggacc cttgcgcctt 8520
ttccaaggca gccctgggtt tgcgcaggga cgcggctgct ctgggcgtgg ttccgggaaa 8580
cgcagcggcg ccgaccctgg gtctcgcaca ttcttcacgt ccgttcgcag cgtcacccgg 8640
atcttcgccg ctacccttgt gggccccccg gcgacgcttc ctgctccgcc cctaagtcgg 8700
gaaggttcct tgcggttcgc ggcgtgccgg acgtgacaaa cggaagccgc acgtctcact 8760
agtaccctcg cagacggaca gcgccaggga gcaatggcag cgcgccgacc gcgatgggct 8820
gtggccaata gcggctgctc agcagggcgc gccgagagca gcggccggga aggggcggtg 8880
cgggaggcgg ggtgtggggc ggtagtgtgg gccctgttcc tgcccgcgcg gtgttccgca 8940
ttctgcaagc ctccggagcg cacgtcggca gtcggctccc tcgttgaccg aatcaccgac 9000
ctctctcccc agggggtacc cagctgtcta gagaattcta gatcttgaga caaatggcag 9060
tattcatcca caattttaaa agaaaagggg ggattggggg gtacagtgca ggggaaagaa 9120
tagtagacat aatagcaaca gacatacaaa ctaaagaatt acaaaaacaa attacaaaaa 9180
ttcaaaattt tcgggtttat tacagggaca gcagagatcc actttggcgc cggctcgagg 9240
ggg 9243
<210> 192
<211> 9246
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 192
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtgccacca tgaccgagta caagcccacg 660
gtgcgcctcg ccacccgcga cgacgtcccc agggccgtac gcaccctcgc cgccgcgttc 720
gccgactacc ccgccacgcg ccacaccgtc gatccggacc gccacatcga gcgggtcacc 780
gagctgcaag aactcttcct cacgcgcgtc gggctcgaca tcggcaaggt gtgggtcgcg 840
gacgacggcg ccgcggtggc ggtctggacc acgccggaga gcgtcgaagc gggggcggtg 900
ttcgccgaga tcggcccgcg catggccgag ttgagcggtt cccggctggc cgcgcagcaa 960
cagatggaag gcctcctggc gccgcaccgg cccaaggagc ccgcgtggtt cctggccacc 1020
gtcggcgtct cgcccgacca ccagggcaag ggtctgggca gcgccgtcgt gctccccgga 1080
gtggaggcgg ccgagcgcgc cggggtgccc gccttcctgg agacctccgc gccccgcaac 1140
ctccccttct acgagcggct cggcttcacc gtcaccgccg acgtcgaggt gcccgaagga 1200
ccgcgcacct ggtgcatgac ccgcaagccc ggtgcctgat taattaagaa ttcgacccag 1260
ctttcttgta caaagtggtt ggtaagccta tccctaaccc tctcctcggt ctcgattcta 1320
cgtagtaatg agctagcagt ctcgaggtta acgaattccg ccccccccct aacgttactg 1380
gccgaagccg cttggaataa ggccggtgtg cgcttgtcta tatgttattt tccaccatat 1440
tgccgtcttt tggcaatgtg agggcccgga aacctggccc tgtcttcttg acgagcattc 1500
ctaggggtct ttcccctctc gccaaaggaa tgcaaggtct gttgaatgtc gtgaaggaag 1560
cagttcctct ggaagcttct tgaagacaaa caacgtctgt agcgaccctt tgcaggcagc 1620
ggaacccccc acctggcgac aggtgcccct gcggccaaaa gccacgtgta taagatacac 1680
ctgcaaaggc ggcacaaccc cagtgccacg ttgtgagttg gatagttgtg gaaagagtca 1740
aatggctctc ctcaagcgta ttcaacaagg ggctgaagga tgcccagaag gtaccccatt 1800
gtatgggatc tgatctgggg cctcggtgca catgctttac atgtgtttag tcgaggttaa 1860
aaaaacgtct aggccccccg aaccacgggg acgtggtttt cctttgaaaa acacgataat 1920
accatggtga gcaagggcga ggaggataac atggccatca tcaaggagtt catgcgcttc 1980
aaggtgcaca tggagggctc cgtgaacggc cacgagttcg agatcgaggg cgagggcgag 2040
ggccgcccct acgagggcac ccagaccgcc aagctgaagg tgaccaaggg tggccccctg 2100
cccttcgcct gggacatcct gtcccctcag ttcatgtacg gctccaaggc ctacgtgaag 2160
caccccgccg acatccccga ctacttgaag ctgtccttcc ccgagggctt caagtgggag 2220
cgcgtgatga acttcgagga cggcggcgtg gtgaccgtga cccaggactc ctccctgcag 2280
gacggcgagt tcatctacaa ggtgaagctg cgcggcacca acttcccctc cgacggcccc 2340
gtaatgcaga agaagaccat gggctgggag gcctcctccg agcggatgta ccccgaggac 2400
ggcgccctga agggcgagat caagcagagg ctgaagctga aggacggcgg ccactacgac 2460
gctgaggtca agaccaccta caaggccaag aagcccgtgc agctgcccgg cgcctacaac 2520
gtcaacatca agttggacat cacctcccac aacgaggact acaccatcgt ggaacagtac 2580
gaacgcgccg agggccgcca ctccaccggc ggcatggacg agctgtacaa gtaacaccgg 2640
tggcgcgtta agtcgacaat caacctctgg attacaaaat ttgtgaaaga ttgactggta 2700
ttcttaacta tgttgctcct tttacgctat gtggatacgc tgctttaatg cctttgtatc 2760
atgctattgc ttcccgtatg gctttcattt tctcctcctt gtataaatcc tggttgctgt 2820
ctctttatga ggagttgtgg cccgttgtca ggcaacgtgg cgtggtgtgc actgtgtttg 2880
ctgacgcaac ccccactggt tggggcattg ccaccacctg tcagctcctt tccgggactt 2940
tcgctttccc cctccctatt gccacggcgg aactcatcgc cgcctgcctt gcccgctgct 3000
ggacaggggc tcggctgttg ggcactgaca attccgtggt gttgtcgggg aaatcatcgt 3060
cctttccttg gctgctcgcc tgtgttgcca cctggattct gcgcgggacg tccttctgct 3120
acgtcccttc ggccctcaat ccagcggacc ttccttcccg cggcctgctg ccggctctgc 3180
ggcctcttcc gcgtcttcgc cttcgccctc agacgagtcg gatctccctt tgggccgcct 3240
ccccgcgtcg actttaagac caatgactta caaggcagct gtagatctta gccacttttt 3300
aaaagaaaag gggggactgg aagggctaat tcactcccaa cgaagacaag atctgctttt 3360
tgcttgtact gggtctctct ggttagacca gatctgagcc tgggagctct ctggctaact 3420
agggaaccca ctgcttaagc ctcaataaag cttgccttga gtgcttcaag tagtgtgtgc 3480
ccgtctgttg tgtgactctg gtaactagag atccctcaga cccttttagt cagtgtggaa 3540
aatctctagc agtacgtata gtagttcatg tcatcttatt attcagtatt tataacttgc 3600
aaagaaatga atatcagaga gtgagaggaa cttgtttatt gcagcttata atggttacaa 3660
ataaagcaat agcatcacaa atttcacaaa taaagcattt ttttcactgc attctagttg 3720
tggtttgtcc aaactcatca atgtatctta tcatgtctgg ctctagctat cccgccccta 3780
actccgccca tcccgcccct aactccgccc agttccgccc attctccgcc ccatggctga 3840
ctaatttttt ttatttatgc agaggccgag gccgcctcgg cctctgagct attccagaag 3900
tagtgaggag gcttttttgg aggcctaggg acgtacccaa ttcgccctat agtgagtcgt 3960
attacgcgcg ctcactggcc gtcgttttac aacgtcgtga ctgggaaaac cctggcgtta 4020
cccaacttaa tcgccttgca gcacatcccc ctttcgccag ctggcgtaat agcgaagagg 4080
cccgcaccga tcgcccttcc caacagttgc gcagcctgaa tggcgaatgg gacgcgccct 4140
gtagcggcgc attaagcgcg gcgggtgtgg tggttacgcg cagcgtgacc gctacacttg 4200
ccagcgccct agcgcccgct cctttcgctt tcttcccttc ctttctcgcc acgttcgccg 4260
gctttccccg tcaagctcta aatcgggggc tccctttagg gttccgattt agtgctttac 4320
ggcacctcga ccccaaaaaa cttgattagg gtgatggttc acgtagtggg ccatcgccct 4380
gatagacggt ttttcgccct ttgacgttgg agtccacgtt ctttaatagt ggactcttgt 4440
tccaaactgg aacaacactc aaccctatct cggtctattc ttttgattta taagggattt 4500
tgccgatttc ggcctattgg ttaaaaaatg agctgattta acaaaaattt aacgcgaatt 4560
ttaacaaaat attaacgctt acaatttagg tggcactttt cggggaaatg tgcgcggaac 4620
ccctatttgt ttatttttct aaatacattc aaatatgtat ccgctcatga gacaataacc 4680
ctgataaatg cttcaataat attgaaaaag gaagagtatg agtattcaac atttccgtgt 4740
cgcccttatt cccttttttg cggcattttg ccttcctgtt tttgctcacc cagaaacgct 4800
ggtgaaagta aaagatgctg aagatcagtt gggtgcacga gtgggttaca tcgaactgga 4860
tctcaacagc ggtaagatcc ttgagagttt tcgccccgaa gaacgttttc caatgatgag 4920
cacttttaaa gttctgctat gtggcgcggt attatcccgt attgacgccg ggcaagagca 4980
actcggtcgc cgcatacact attctcagaa tgacttggtt gagtactcac cagtcacaga 5040
aaagcatctt acggatggca tgacagtaag agaattatgc agtgctgcca taaccatgag 5100
tgataacact gcggccaact tacttctgac aacgatcgga ggaccgaagg agctaaccgc 5160
ttttttgcac aacatggggg atcatgtaac tcgccttgat cgttgggaac cggagctgaa 5220
tgaagccata ccaaacgacg agcgtgacac cacgatgcct gtagcaatgg caacaacgtt 5280
gcgcaaacta ttaactggcg aactacttac tctagcttcc cggcaacaat taatagactg 5340
gatggaggcg gataaagttg caggaccact tctgcgctcg gcccttccgg ctggctggtt 5400
tattgctgat aaatctggag ccggtgagcg tgggtctcgc ggtatcattg cagcactggg 5460
gccagatggt aagccctccc gtatcgtagt tatctacacg acggggagtc aggcaactat 5520
ggatgaacga aatagacaga tcgctgagat aggtgcctca ctgattaagc attggtaact 5580
gtcagaccaa gtttactcat atatacttta gattgattta aaacttcatt tttaatttaa 5640
aaggatctag gtgaagatcc tttttgataa tctcatgacc aaaatccctt aacgtgagtt 5700
ttcgttccac tgagcgtcag accccgtaga aaagatcaaa ggatcttctt gagatccttt 5760
ttttctgcgc gtaatctgct gcttgcaaac aaaaaaacca ccgctaccag cggtggtttg 5820
tttgccggat caagagctac caactctttt tccgaaggta actggcttca gcagagcgca 5880
gataccaaat actgttcttc tagtgtagcc gtagttaggc caccacttca agaactctgt 5940
agcaccgcct acatacctcg ctctgctaat cctgttacca gtggctgctg ccagtggcga 6000
taagtcgtgt cttaccgggt tggactcaag acgatagtta ccggataagg cgcagcggtc 6060
gggctgaacg gggggttcgt gcacacagcc cagcttggag cgaacgacct acaccgaact 6120
gagataccta cagcgtgagc tatgagaaag cgccacgctt cccgaaggga gaaaggcgga 6180
caggtatccg gtaagcggca gggtcggaac aggagagcgc acgagggagc ttccaggggg 6240
aaacgcctgg tatctttata gtcctgtcgg gtttcgccac ctctgacttg agcgtcgatt 6300
tttgtgatgc tcgtcagggg ggcggagcct atggaaaaac gccagcaacg cggccttttt 6360
acggttcctg gccttttgct ggccttttgc tcacatgttc tttcctgcgt tatcccctga 6420
ttctgtggat aaccgtatta ccgcctttga gtgagctgat accgctcgcc gcagccgaac 6480
gaccgagcgc agcgagtcag tgagcgagga agcggaagag cgcccaatac gcaaaccgcc 6540
tctccccgcg cgttggccga ttcattaatg cagctggcac gacaggtttc ccgactggaa 6600
agcgggcagt gagcgcaacg caattaatgt gagttagctc actcattagg caccccaggc 6660
tttacacttt atgcttccgg ctcgtatgtt gtgtggaatt gtgagcggat aacaatttca 6720
cacaggaaac agctatgacc atgattacgc caagcgcgca attaaccctc actaaaggga 6780
acaaaagctg gagctgcaag cttaatgtag tcttatgcaa tactcttgta gtcttgcaac 6840
atggtaacga tgagttagca acatgcctta caaggagaga aaaagcaccg tgcatgccga 6900
ttggtggaag taaggtggta cgatcgtgcc ttattaggaa ggcaacagac gggtctgaca 6960
tggattggac gaaccactga attgccgcat tgcagagata ttgtatttaa gtgcctagct 7020
cgatacataa acgggtctct ctggttagac cagatctgag cctgggagct ctctggctaa 7080
ctagggaacc cactgcttaa gcctcaataa agcttgcctt gagtgcttca agtagtgtgt 7140
gcccgtctgt tgtgtgactc tggtaactag agatccctca gaccctttta gtcagtgtgg 7200
aaaatctcta gcagtggcgc ccgaacaggg acttgaaagc gaaagggaaa ccagaggagc 7260
tctctcgacg caggactcgg cttgctgaag cgcgcacggc aagaggcgag gggcggcgac 7320
tggtgagtac gccaaaaatt ttgactagcg gaggctagaa ggagagagat gggtgcgaga 7380
gcgtcagtat taagcggggg agaattagat cgcgatggga aaaaattcgg ttaaggccag 7440
ggggaaagaa aaaatataaa ttaaaacata tagtatgggc aagcagggag ctagaacgat 7500
tcgcagttaa tcctggcctg ttagaaacat cagaaggctg tagacaaata ctgggacagc 7560
tacaaccatc ccttcagaca ggatcagaag aacttagatc attatataat acagtagcaa 7620
ccctctattg tgtgcatcaa aggatagaga taaaagacac caaggaagct ttagacaaga 7680
tagaggaaga gcaaaacaaa agtaagacca ccgcacagca agcggccgct gatcttcaga 7740
cctggaggag gagatatgag ggacaattgg agaagtgaat tatataaata taaagtagta 7800
aaaattgaac cattaggagt agcacccacc aaggcaaaga gaagagtggt gcagagagaa 7860
aaaagagcag tgggaatagg agctttgttc cttgggttct tgggagcagc aggaagcact 7920
atgggcgcag cgtcaatgac gctgacggta caggccagac aattattgtc tggtatagtg 7980
cagcagcaga acaatttgct gagggctatt gaggcgcaac agcatctgtt gcaactcaca 8040
gtctggggca tcaagcagct ccaggcaaga atcctggctg tggaaagata cctaaaggat 8100
caacagctcc tggggatttg gggttgctct ggaaaactca tttgcaccac tgctgtgcct 8160
tggaatgcta gttggagtaa taaatctctg gaacagattt ggaatcacac gacctggatg 8220
gagtgggaca gagaaattaa caattacaca agcttaatac actccttaat tgaagaatcg 8280
caaaaccagc aagaaaagaa tgaacaagaa ttattggaat tagataaatg ggcaagtttg 8340
tggaattggt ttaacataac aaattggctg tggtatataa aattattcat aatgatagta 8400
ggaggcttgg taggtttaag aatagttttt gctgtacttt ctatagtgaa tagagttagg 8460
cagggatatt caccattatc gtttcagacc cacctcccaa ccccgagggg acccttgcgc 8520
cttttccaag gcagccctgg gtttgcgcag ggacgcggct gctctgggcg tggttccggg 8580
aaacgcagcg gcgccgaccc tgggtctcgc acattcttca cgtccgttcg cagcgtcacc 8640
cggatcttcg ccgctaccct tgtgggcccc ccggcgacgc ttcctgctcc gcccctaagt 8700
cgggaaggtt ccttgcggtt cgcggcgtgc cggacgtgac aaacggaagc cgcacgtctc 8760
actagtaccc tcgcagacgg acagcgccag ggagcaatgg cagcgcgccg accgcgatgg 8820
gctgtggcca atagcggctg ctcagcaggg cgcgccgaga gcagcggccg ggaaggggcg 8880
gtgcgggagg cggggtgtgg ggcggtagtg tgggccctgt tcctgcccgc gcggtgttcc 8940
gcattctgca agcctccgga gcgcacgtcg gcagtcggct ccctcgttga ccgaatcacc 9000
gacctctctc cccagggggt acccagctgt ctagagaatt ctagatcttg agacaaatgg 9060
cagtattcat ccacaatttt aaaagaaaag gggggattgg ggggtacagt gcaggggaaa 9120
gaatagtaga cataatagca acagacatac aaactaaaga attacaaaaa caaattacaa 9180
aaattcaaaa ttttcgggtt tattacaggg acagcagaga tccactttgg cgccggctcg 9240
aggggg 9246
<210> 193
<211> 9681
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 193
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtgccacca tgaaaaagcc tgaactcacc 660
gcgacgtctg tcgagaagtt tctgatcgaa aagttcgaca gcgtctccga cctgatgcag 720
ctctcggagg gcgaagaatc tcgtgctttc agcttcgatg taggagggcg tggatatgtc 780
ctgcgggtaa atagctgcgc cgatggtttc tacaaagatc gttatgttta tcggcacttt 840
gcatcggccg cgctcccgat tccggaagtg cttgacattg gggaatttag cgagagcctg 900
acctattgca tctcccgccg tgcacagggt gtcacgttgc aagacctgcc tgaaaccgaa 960
ctgcccgctg ttctgcagcc ggtcgcggag gccatggatg cgatcgctgc ggccgatctt 1020
agccagacga gcgggttcgg cccattcgga ccgcaaggaa tcggtcaata cactacatgg 1080
cgtgatttca tatgcgcgat tgctgatccc catgtgtatc actggcaaac tgtgatggac 1140
gacaccgtca gtgcgtccgt cgcgcaggct ctcgatgagc tgatgctttg ggccgaggac 1200
tgccccgaag tccggcacct cgtgcacgcg gatttcggct ccaacaatgt cctgacggac 1260
aatggccgca taacagcggt cattgactgg agcgaggcga tgttcgggga ttcccaatac 1320
gaggtcgcca acatcttctt ctggaggccg tggttggctt gtatggagca gcagacgcgc 1380
tacttcgagc ggaggcatcc ggagcttgca ggatcgccgc ggctccgggc gtatatgctc 1440
cgcattggtc ttgaccaact ctatcagagc ttggttgacg gcaatttcga tgatgcagct 1500
tgggcgcagg gtcgatgcga cgcaatcgtc cgatccggag ccgggactgt cgggcgtaca 1560
caaatcgccc gcagaagcgc ggccgtctgg accgatggct gtgtagaagt actcgccgat 1620
agtggaaacc gacgccccag cactcgtccg agggcaaagg aatagttaat taagaattcg 1680
acccagcttt cttgtacaaa gtggttggta agcctatccc taaccctctc ctcggtctcg 1740
attctacgta gtaatgagct agcagtctcg aggttaacga attccgcccc ccccctaacg 1800
ttactggccg aagccgcttg gaataaggcc ggtgtgcgct tgtctatatg ttattttcca 1860
ccatattgcc gtcttttggc aatgtgaggg cccggaaacc tggccctgtc ttcttgacga 1920
gcattcctag gggtctttcc cctctcgcca aaggaatgca aggtctgttg aatgtcgtga 1980
aggaagcagt tcctctggaa gcttcttgaa gacaaacaac gtctgtagcg accctttgca 2040
ggcagcggaa ccccccacct ggcgacaggt gcccctgcgg ccaaaagcca cgtgtataag 2100
atacacctgc aaaggcggca caaccccagt gccacgttgt gagttggata gttgtggaaa 2160
gagtcaaatg gctctcctca agcgtattca acaaggggct gaaggatgcc cagaaggtac 2220
cccattgtat gggatctgat ctggggcctc ggtgcacatg ctttacatgt gtttagtcga 2280
ggttaaaaaa acgtctaggc cccccgaacc acggggacgt ggttttcctt tgaaaaacac 2340
gataatacca tggtgagcaa gggcgaggag ctgttcaccg gggtggtgcc catcctggtc 2400
gagctggacg gcgacgtaaa cggccacaag ttcagcgtgt ccggcgaggg cgagggcgat 2460
gccacctacg gcaagctgac cctgaagttc atctgcacca ccggcaagct gcccgtgccc 2520
tggcccaccc tcgtgaccac cctgacctac ggcgtgcagt gcttcagccg ctaccccgac 2580
cacatgaagc agcacgactt cttcaagtcc gccatgcccg aaggctacgt ccaggagcgc 2640
accatcttct tcaaggacga cggcaactac aagacccgcg ccgaggtgaa gttcgagggc 2700
gacaccctgg tgaaccgcat cgagctgaag ggcatcgact tcaaggagga cggcaacatc 2760
ctggggcaca agctggagta caactacaac agccacaacg tctatatcat ggccgacaag 2820
cagaagaacg gcatcaaggt gaacttcaag atccgccaca acatcgagga cggcagcgtg 2880
cagctcgccg accactacca gcagaacacc cccatcggcg acggccccgt gctgctgccc 2940
gacaaccact acctgagcac ccagtccgcc ctgagcaaag accccaacga gaagcgcgat 3000
cacatggtcc tgctggagtt cgtgaccgcc gccgggatca ctctcggcat ggacgagctg 3060
tacaagtaac accggtggcg cgttaagtcg acaatcaacc tctggattac aaaatttgtg 3120
aaagattgac tggtattctt aactatgttg ctccttttac gctatgtgga tacgctgctt 3180
taatgccttt gtatcatgct attgcttccc gtatggcttt cattttctcc tccttgtata 3240
aatcctggtt gctgtctctt tatgaggagt tgtggcccgt tgtcaggcaa cgtggcgtgg 3300
tgtgcactgt gtttgctgac gcaaccccca ctggttgggg cattgccacc acctgtcagc 3360
tcctttccgg gactttcgct ttccccctcc ctattgccac ggcggaactc atcgccgcct 3420
gccttgcccg ctgctggaca ggggctcggc tgttgggcac tgacaattcc gtggtgttgt 3480
cggggaaatc atcgtccttt ccttggctgc tcgcctgtgt tgccacctgg attctgcgcg 3540
ggacgtcctt ctgctacgtc ccttcggccc tcaatccagc ggaccttcct tcccgcggcc 3600
tgctgccggc tctgcggcct cttccgcgtc ttcgccttcg ccctcagacg agtcggatct 3660
ccctttgggc cgcctccccg cgtcgacttt aagaccaatg acttacaagg cagctgtaga 3720
tcttagccac tttttaaaag aaaagggggg actggaaggg ctaattcact cccaacgaag 3780
acaagatctg ctttttgctt gtactgggtc tctctggtta gaccagatct gagcctggga 3840
gctctctggc taactaggga acccactgct taagcctcaa taaagcttgc cttgagtgct 3900
tcaagtagtg tgtgcccgtc tgttgtgtga ctctggtaac tagagatccc tcagaccctt 3960
ttagtcagtg tggaaaatct ctagcagtac gtatagtagt tcatgtcatc ttattattca 4020
gtatttataa cttgcaaaga aatgaatatc agagagtgag aggaacttgt ttattgcagc 4080
ttataatggt tacaaataaa gcaatagcat cacaaatttc acaaataaag catttttttc 4140
actgcattct agttgtggtt tgtccaaact catcaatgta tcttatcatg tctggctcta 4200
gctatcccgc ccctaactcc gcccatcccg cccctaactc cgcccagttc cgcccattct 4260
ccgccccatg gctgactaat tttttttatt tatgcagagg ccgaggccgc ctcggcctct 4320
gagctattcc agaagtagtg aggaggcttt tttggaggcc tagggacgta cccaattcgc 4380
cctatagtga gtcgtattac gcgcgctcac tggccgtcgt tttacaacgt cgtgactggg 4440
aaaaccctgg cgttacccaa cttaatcgcc ttgcagcaca tccccctttc gccagctggc 4500
gtaatagcga agaggcccgc accgatcgcc cttcccaaca gttgcgcagc ctgaatggcg 4560
aatgggacgc gccctgtagc ggcgcattaa gcgcggcggg tgtggtggtt acgcgcagcg 4620
tgaccgctac acttgccagc gccctagcgc ccgctccttt cgctttcttc ccttcctttc 4680
tcgccacgtt cgccggcttt ccccgtcaag ctctaaatcg ggggctccct ttagggttcc 4740
gatttagtgc tttacggcac ctcgacccca aaaaacttga ttagggtgat ggttcacgta 4800
gtgggccatc gccctgatag acggtttttc gccctttgac gttggagtcc acgttcttta 4860
atagtggact cttgttccaa actggaacaa cactcaaccc tatctcggtc tattcttttg 4920
atttataagg gattttgccg atttcggcct attggttaaa aaatgagctg atttaacaaa 4980
aatttaacgc gaattttaac aaaatattaa cgcttacaat ttaggtggca cttttcgggg 5040
aaatgtgcgc ggaaccccta tttgtttatt tttctaaata cattcaaata tgtatccgct 5100
catgagacaa taaccctgat aaatgcttca ataatattga aaaaggaaga gtatgagtat 5160
tcaacatttc cgtgtcgccc ttattccctt ttttgcggca ttttgccttc ctgtttttgc 5220
tcacccagaa acgctggtga aagtaaaaga tgctgaagat cagttgggtg cacgagtggg 5280
ttacatcgaa ctggatctca acagcggtaa gatccttgag agttttcgcc ccgaagaacg 5340
ttttccaatg atgagcactt ttaaagttct gctatgtggc gcggtattat cccgtattga 5400
cgccgggcaa gagcaactcg gtcgccgcat acactattct cagaatgact tggttgagta 5460
ctcaccagtc acagaaaagc atcttacgga tggcatgaca gtaagagaat tatgcagtgc 5520
tgccataacc atgagtgata acactgcggc caacttactt ctgacaacga tcggaggacc 5580
gaaggagcta accgcttttt tgcacaacat gggggatcat gtaactcgcc ttgatcgttg 5640
ggaaccggag ctgaatgaag ccataccaaa cgacgagcgt gacaccacga tgcctgtagc 5700
aatggcaaca acgttgcgca aactattaac tggcgaacta cttactctag cttcccggca 5760
acaattaata gactggatgg aggcggataa agttgcagga ccacttctgc gctcggccct 5820
tccggctggc tggtttattg ctgataaatc tggagccggt gagcgtgggt ctcgcggtat 5880
cattgcagca ctggggccag atggtaagcc ctcccgtatc gtagttatct acacgacggg 5940
gagtcaggca actatggatg aacgaaatag acagatcgct gagataggtg cctcactgat 6000
taagcattgg taactgtcag accaagttta ctcatatata ctttagattg atttaaaact 6060
tcatttttaa tttaaaagga tctaggtgaa gatccttttt gataatctca tgaccaaaat 6120
cccttaacgt gagttttcgt tccactgagc gtcagacccc gtagaaaaga tcaaaggatc 6180
ttcttgagat cctttttttc tgcgcgtaat ctgctgcttg caaacaaaaa aaccaccgct 6240
accagcggtg gtttgtttgc cggatcaaga gctaccaact ctttttccga aggtaactgg 6300
cttcagcaga gcgcagatac caaatactgt tcttctagtg tagccgtagt taggccacca 6360
cttcaagaac tctgtagcac cgcctacata cctcgctctg ctaatcctgt taccagtggc 6420
tgctgccagt ggcgataagt cgtgtcttac cgggttggac tcaagacgat agttaccgga 6480
taaggcgcag cggtcgggct gaacgggggg ttcgtgcaca cagcccagct tggagcgaac 6540
gacctacacc gaactgagat acctacagcg tgagctatga gaaagcgcca cgcttcccga 6600
agggagaaag gcggacaggt atccggtaag cggcagggtc ggaacaggag agcgcacgag 6660
ggagcttcca gggggaaacg cctggtatct ttatagtcct gtcgggtttc gccacctctg 6720
acttgagcgt cgatttttgt gatgctcgtc aggggggcgg agcctatgga aaaacgccag 6780
caacgcggcc tttttacggt tcctggcctt ttgctggcct tttgctcaca tgttctttcc 6840
tgcgttatcc cctgattctg tggataaccg tattaccgcc tttgagtgag ctgataccgc 6900
tcgccgcagc cgaacgaccg agcgcagcga gtcagtgagc gaggaagcgg aagagcgccc 6960
aatacgcaaa ccgcctctcc ccgcgcgttg gccgattcat taatgcagct ggcacgacag 7020
gtttcccgac tggaaagcgg gcagtgagcg caacgcaatt aatgtgagtt agctcactca 7080
ttaggcaccc caggctttac actttatgct tccggctcgt atgttgtgtg gaattgtgag 7140
cggataacaa tttcacacag gaaacagcta tgaccatgat tacgccaagc gcgcaattaa 7200
ccctcactaa agggaacaaa agctggagct gcaagcttaa tgtagtctta tgcaatactc 7260
ttgtagtctt gcaacatggt aacgatgagt tagcaacatg ccttacaagg agagaaaaag 7320
caccgtgcat gccgattggt ggaagtaagg tggtacgatc gtgccttatt aggaaggcaa 7380
cagacgggtc tgacatggat tggacgaacc actgaattgc cgcattgcag agatattgta 7440
tttaagtgcc tagctcgata cataaacggg tctctctggt tagaccagat ctgagcctgg 7500
gagctctctg gctaactagg gaacccactg cttaagcctc aataaagctt gccttgagtg 7560
cttcaagtag tgtgtgcccg tctgttgtgt gactctggta actagagatc cctcagaccc 7620
ttttagtcag tgtggaaaat ctctagcagt ggcgcccgaa cagggacttg aaagcgaaag 7680
ggaaaccaga ggagctctct cgacgcagga ctcggcttgc tgaagcgcgc acggcaagag 7740
gcgaggggcg gcgactggtg agtacgccaa aaattttgac tagcggaggc tagaaggaga 7800
gagatgggtg cgagagcgtc agtattaagc gggggagaat tagatcgcga tgggaaaaaa 7860
ttcggttaag gccaggggga aagaaaaaat ataaattaaa acatatagta tgggcaagca 7920
gggagctaga acgattcgca gttaatcctg gcctgttaga aacatcagaa ggctgtagac 7980
aaatactggg acagctacaa ccatcccttc agacaggatc agaagaactt agatcattat 8040
ataatacagt agcaaccctc tattgtgtgc atcaaaggat agagataaaa gacaccaagg 8100
aagctttaga caagatagag gaagagcaaa acaaaagtaa gaccaccgca cagcaagcgg 8160
ccgctgatct tcagacctgg aggaggagat atgagggaca attggagaag tgaattatat 8220
aaatataaag tagtaaaaat tgaaccatta ggagtagcac ccaccaaggc aaagagaaga 8280
gtggtgcaga gagaaaaaag agcagtggga ataggagctt tgttccttgg gttcttggga 8340
gcagcaggaa gcactatggg cgcagcgtca atgacgctga cggtacaggc cagacaatta 8400
ttgtctggta tagtgcagca gcagaacaat ttgctgaggg ctattgaggc gcaacagcat 8460
ctgttgcaac tcacagtctg gggcatcaag cagctccagg caagaatcct ggctgtggaa 8520
agatacctaa aggatcaaca gctcctgggg atttggggtt gctctggaaa actcatttgc 8580
accactgctg tgccttggaa tgctagttgg agtaataaat ctctggaaca gatttggaat 8640
cacacgacct ggatggagtg ggacagagaa attaacaatt acacaagctt aatacactcc 8700
ttaattgaag aatcgcaaaa ccagcaagaa aagaatgaac aagaattatt ggaattagat 8760
aaatgggcaa gtttgtggaa ttggtttaac ataacaaatt ggctgtggta tataaaatta 8820
ttcataatga tagtaggagg cttggtaggt ttaagaatag tttttgctgt actttctata 8880
gtgaatagag ttaggcaggg atattcacca ttatcgtttc agacccacct cccaaccccg 8940
aggggaccct tgcgcctttt ccaaggcagc cctgggtttg cgcagggacg cggctgctct 9000
gggcgtggtt ccgggaaacg cagcggcgcc gaccctgggt ctcgcacatt cttcacgtcc 9060
gttcgcagcg tcacccggat cttcgccgct acccttgtgg gccccccggc gacgcttcct 9120
gctccgcccc taagtcggga aggttccttg cggttcgcgg cgtgccggac gtgacaaacg 9180
gaagccgcac gtctcactag taccctcgca gacggacagc gccagggagc aatggcagcg 9240
cgccgaccgc gatgggctgt ggccaatagc ggctgctcag cagggcgcgc cgagagcagc 9300
ggccgggaag gggcggtgcg ggaggcgggg tgtggggcgg tagtgtgggc cctgttcctg 9360
cccgcgcggt gttccgcatt ctgcaagcct ccggagcgca cgtcggcagt cggctccctc 9420
gttgaccgaa tcaccgacct ctctccccag ggggtaccca gctgtctaga gaattctaga 9480
tcttgagaca aatggcagta ttcatccaca attttaaaag aaaagggggg attggggggt 9540
acagtgcagg ggaaagaata gtagacataa tagcaacaga catacaaact aaagaattac 9600
aaaaacaaat tacaaaaatt caaaattttc gggtttatta cagggacagc agagatccac 9660
tttggcgccg gctcgagggg g 9681
<210> 194
<211> 9771
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 194
ctcgaggggg cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca 60
tatatggagt tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac 120
gacccccgcc cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact 180
ttccattgac gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa 240
gtgtatcata tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg 300
cattatgccc agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta 360
gtcatcgcta ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg 420
tttgactcac ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg 480
caccaaaatc aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg 540
ggcggtaggc gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg 600
atcaacaagt ttgtacaaaa aagcaggctc cgaattcgcc cttgccgcca ccatgccaaa 660
aaaaaaacgc aaagtggtga gcaagggcga ggagctgttc accggggtgg tgcccatcct 720
ggtcgagctg gacggcgacg taaacggcca caagttcagc gtgtccggcg agggcgaggg 780
cgatgccacc tacggcaagc tgaccctgaa gttcatctgc accaccggca agctgcccgt 840
gccctggccc accctcgtga ccaccctgac ctacggcgtg cagtgcttca gccgctaccc 900
cgaccacatg aagcagcacg acttcttcaa gtccgccatg cccgaaggct acgtccagga 960
gcgcaccatc ttcttcaagg acgacggcaa ctacaagacc cgcgccgagg tgaagttcga 1020
gggcgacacc ctggtgaacc gcatcgagct gaagggcatc gacttcaagg aggacggcaa 1080
catcctgggg cacaagctgg agtacaacta caacagccac aacgtctata tcatggccga 1140
caagcagaag aacggcatca aggtgaactt caagatccgc cacaacatcg aggacggcag 1200
cgtgcagctc gccgaccact accagcagaa cacccccatc ggcgacggcc ccgtgctgct 1260
gcccgacaac cactacctga gcacccagtc cgccctgagc aaagacccca acgagaagcg 1320
cgatcacatg gtcctgctgg agttcgtgac cgccgccggg atcactctcg gcatggacga 1380
gctgtacaag taaaagggcg aattcgaccc agctttcttg tacaaagtgg ttggtaagcc 1440
tatccctaac cctctcctcg gtctcgattc tacgtagtaa tgagctagca gtctcgaggt 1500
taacgaattc cgcccccccc cctaacgtta ctggccgaag ccgcttggaa taaggccggt 1560
gtgcgtttgt ctatatgtta ttttccacca tattgccgtc ttttggcaat gtgagggccc 1620
ggaaacctgg ccctgtcttc ttgacgagca ttcctagggg tctttcccct ctcgccaaag 1680
gaatgcaagg tctgttgaat gtcgtgaagg aagcagttcc tctggaagct tcttgaagac 1740
aaacaacgtc tgtagcgacc ctttgcaggc agcggaaccc cccacctggc gacaggtgcc 1800
tctgcggcca aaagccacgt gtataagata cacctgcaaa ggcggcacaa ccccagtgcc 1860
acgttgtgag ttggatagtt gtggaaagag tcaaatggct ctcctcaagc gtattcaaca 1920
aggggctgaa ggatgcccag aaggtacccc attgtatggg atctgatctg gggcctcggt 1980
gcacatgctt tacatgtgtt tagtcgaggt taaaaaaacg tctaggcccc ccgaaccacg 2040
gggacgtggt tttcctttga aaaacacgat gataatatgg ccacaaccgc caccatgaaa 2100
aagcctgaac tcaccgcgac gtctgtcgag aagtttctga tcgaaaagtt cgacagcgtc 2160
tccgacctga tgcagctctc ggagggcgaa gaatctcgtg ctttcagctt cgatgtagga 2220
gggcgtggat atgtcctgcg ggtaaatagc tgcgccgatg gtttctacaa agatcgttat 2280
gtttatcggc actttgcatc ggccgcgctc ccgattccgg aagtgcttga cattggggaa 2340
tttagcgaga gcctgaccta ttgcatctcc cgccgtgcac agggtgtcac gttgcaagac 2400
ctgcctgaaa ccgaactgcc cgctgttctg cagccggtcg cggaggccat ggatgcgatc 2460
gctgcggccg atcttagcca gacgagcggg ttcggcccat tcggaccgca aggaatcggt 2520
caatacacta catggcgtga tttcatatgc gcgattgctg atccccatgt gtatcactgg 2580
caaactgtga tggacgacac cgtcagtgcg tccgtcgcgc aggctctcga tgagctgatg 2640
ctttgggccg aggactgccc cgaagtccgg cacctcgtgc acgcggattt cggctccaac 2700
aatgtcctga cggacaatgg ccgcataaca gcggtcattg actggagcga ggcgatgttc 2760
ggggattccc aatacgaggt cgccaacatc ttcttctgga ggccgtggtt ggcttgtatg 2820
gagcagcaga cgcgctactt cgagcggagg catccggagc ttgcaggatc gccgcggctc 2880
cgggcgtata tgctccgcat tggtcttgac caactctatc agagcttggt tgacggcaat 2940
ttcgatgatg cagcttgggc gcagggtcga tgcgacgcaa tcgtccgatc cggagccggg 3000
actgtcgggc gtacacaaat cgcccgcaga agcgcggccg tctggaccga tggctgtgta 3060
gaagtactcg ccgatagtgg aaaccgacgc cccagcactc gtccgagggc aaaggaatag 3120
acgcgtaccg gttagcaccg gtgtgatcag ggtcagacag ctgcctgcag gccggtggcg 3180
cgttaagtcg acaatcaacc tctggattac aaaatttgtg aaagattgac tggtattctt 3240
aactatgttg ctccttttac gctatgtgga tacgctgctt taatgccttt gtatcatgct 3300
attgcttccc gtatggcttt cattttctcc tccttgtata aatcctggtt gctgtctctt 3360
tatgaggagt tgtggcccgt tgtcaggcaa cgtggcgtgg tgtgcactgt gtttgctgac 3420
gcaaccccca ctggttgggg cattgccacc acctgtcagc tcctttccgg gactttcgct 3480
ttccccctcc ctattgccac ggcggaactc atcgccgcct gccttgcccg ctgctggaca 3540
ggggctcggc tgttgggcac tgacaattcc gtggtgttgt cggggaaatc atcgtccttt 3600
ccttggctgc tcgcctgtgt tgccacctgg attctgcgcg ggacgtcctt ctgctacgtc 3660
ccttcggccc tcaatccagc ggaccttcct tcccgcggcc tgctgccggc tctgcggcct 3720
cttccgcgtc ttcgccttcg ccctcagacg agtcggatct ccctttgggc cgcctccccg 3780
cgtcgacttt aagaccaatg acttacaagg cagctgtaga tcttagccac tttttaaaag 3840
aaaagggggg actggaaggg ctaattcact cccaacgaag acaagatctg ctttttgctt 3900
gtactgggtc tctctggtta gaccagatct gagcctggga gctctctggc taactaggga 3960
acccactgct taagcctcaa taaagcttgc cttgagtgct tcaagtagtg tgtgcccgtc 4020
tgttgtgtga ctctggtaac tagagatccc tcagaccctt ttagtcagtg tggaaaatct 4080
ctagcagtac gtatagtagt tcatgtcatc ttattattca gtatttataa cttgcaaaga 4140
aatgaatatc agagagtgag aggaacttgt ttattgcagc ttataatggt tacaaataaa 4200
gcaatagcat cacaaatttc acaaataaag catttttttc actgcattct agttgtggtt 4260
tgtccaaact catcaatgta tcttatcatg tctggctcta gctatcccgc ccctaactcc 4320
gcccatcccg cccctaactc cgcccagttc cgcccattct ccgccccatg gctgactaat 4380
tttttttatt tatgcagagg ccgaggccgc ctcggcctct gagctattcc agaagtagtg 4440
aggaggcttt tttggaggcc tagggacgta cccaattcgc cctatagtga gtcgtattac 4500
gcgcgctcac tggccgtcgt tttacaacgt cgtgactggg aaaaccctgg cgttacccaa 4560
cttaatcgcc ttgcagcaca tccccctttc gccagctggc gtaatagcga agaggcccgc 4620
accgatcgcc cttcccaaca gttgcgcagc ctgaatggcg aatgggacgc gccctgtagc 4680
ggcgcattaa gcgcggcggg tgtggtggtt acgcgcagcg tgaccgctac acttgccagc 4740
gccctagcgc ccgctccttt cgctttcttc ccttcctttc tcgccacgtt cgccggcttt 4800
ccccgtcaag ctctaaatcg ggggctccct ttagggttcc gatttagtgc tttacggcac 4860
ctcgacccca aaaaacttga ttagggtgat ggttcacgta gtgggccatc gccctgatag 4920
acggtttttc gccctttgac gttggagtcc acgttcttta atagtggact cttgttccaa 4980
actggaacaa cactcaaccc tatctcggtc tattcttttg atttataagg gattttgccg 5040
atttcggcct attggttaaa aaatgagctg atttaacaaa aatttaacgc gaattttaac 5100
aaaatattaa cgcttacaat ttaggtggca cttttcgggg aaatgtgcgc ggaaccccta 5160
tttgtttatt tttctaaata cattcaaata tgtatccgct catgagacaa taaccctgat 5220
aaatgcttca ataatattga aaaaggaaga gtatgagtat tcaacatttc cgtgtcgccc 5280
ttattccctt ttttgcggca ttttgccttc ctgtttttgc tcacccagaa acgctggtga 5340
aagtaaaaga tgctgaagat cagttgggtg cacgagtggg ttacatcgaa ctggatctca 5400
acagcggtaa gatccttgag agttttcgcc ccgaagaacg ttttccaatg atgagcactt 5460
ttaaagttct gctatgtggc gcggtattat cccgtattga cgccgggcaa gagcaactcg 5520
gtcgccgcat acactattct cagaatgact tggttgagta ctcaccagtc acagaaaagc 5580
atcttacgga tggcatgaca gtaagagaat tatgcagtgc tgccataacc atgagtgata 5640
acactgcggc caacttactt ctgacaacga tcggaggacc gaaggagcta accgcttttt 5700
tgcacaacat gggggatcat gtaactcgcc ttgatcgttg ggaaccggag ctgaatgaag 5760
ccataccaaa cgacgagcgt gacaccacga tgcctgtagc aatggcaaca acgttgcgca 5820
aactattaac tggcgaacta cttactctag cttcccggca acaattaata gactggatgg 5880
aggcggataa agttgcagga ccacttctgc gctcggccct tccggctggc tggtttattg 5940
ctgataaatc tggagccggt gagcgtgggt ctcgcggtat cattgcagca ctggggccag 6000
atggtaagcc ctcccgtatc gtagttatct acacgacggg gagtcaggca actatggatg 6060
aacgaaatag acagatcgct gagataggtg cctcactgat taagcattgg taactgtcag 6120
accaagttta ctcatatata ctttagattg atttaaaact tcatttttaa tttaaaagga 6180
tctaggtgaa gatccttttt gataatctca tgaccaaaat cccttaacgt gagttttcgt 6240
tccactgagc gtcagacccc gtagaaaaga tcaaaggatc ttcttgagat cctttttttc 6300
tgcgcgtaat ctgctgcttg caaacaaaaa aaccaccgct accagcggtg gtttgtttgc 6360
cggatcaaga gctaccaact ctttttccga aggtaactgg cttcagcaga gcgcagatac 6420
caaatactgt tcttctagtg tagccgtagt taggccacca cttcaagaac tctgtagcac 6480
cgcctacata cctcgctctg ctaatcctgt taccagtggc tgctgccagt ggcgataagt 6540
cgtgtcttac cgggttggac tcaagacgat agttaccgga taaggcgcag cggtcgggct 6600
gaacgggggg ttcgtgcaca cagcccagct tggagcgaac gacctacacc gaactgagat 6660
acctacagcg tgagctatga gaaagcgcca cgcttcccga agggagaaag gcggacaggt 6720
atccggtaag cggcagggtc ggaacaggag agcgcacgag ggagcttcca gggggaaacg 6780
cctggtatct ttatagtcct gtcgggtttc gccacctctg acttgagcgt cgatttttgt 6840
gatgctcgtc aggggggcgg agcctatgga aaaacgccag caacgcggcc tttttacggt 6900
tcctggcctt ttgctggcct tttgctcaca tgttctttcc tgcgttatcc cctgattctg 6960
tggataaccg tattaccgcc tttgagtgag ctgataccgc tcgccgcagc cgaacgaccg 7020
agcgcagcga gtcagtgagc gaggaagcgg aagagcgccc aatacgcaaa ccgcctctcc 7080
ccgcgcgttg gccgattcat taatgcagct ggcacgacag gtttcccgac tggaaagcgg 7140
gcagtgagcg caacgcaatt aatgtgagtt agctcactca ttaggcaccc caggctttac 7200
actttatgct tccggctcgt atgttgtgtg gaattgtgag cggataacaa tttcacacag 7260
gaaacagcta tgaccatgat tacgccaagc gcgcaattaa ccctcactaa agggaacaaa 7320
agctggagct gcaagcttaa tgtagtctta tgcaatactc ttgtagtctt gcaacatggt 7380
aacgatgagt tagcaacatg ccttacaagg agagaaaaag caccgtgcat gccgattggt 7440
ggaagtaagg tggtacgatc gtgccttatt aggaaggcaa cagacgggtc tgacatggat 7500
tggacgaacc actgaattgc cgcattgcag agatattgta tttaagtgcc tagctcgata 7560
cataaacggg tctctctggt tagaccagat ctgagcctgg gagctctctg gctaactagg 7620
gaacccactg cttaagcctc aataaagctt gccttgagtg cttcaagtag tgtgtgcccg 7680
tctgttgtgt gactctggta actagagatc cctcagaccc ttttagtcag tgtggaaaat 7740
ctctagcagt ggcgcccgaa cagggacttg aaagcgaaag ggaaaccaga ggagctctct 7800
cgacgcagga ctcggcttgc tgaagcgcgc acggcaagag gcgaggggcg gcgactggtg 7860
agtacgccaa aaattttgac tagcggaggc tagaaggaga gagatgggtg cgagagcgtc 7920
agtattaagc gggggagaat tagatcgcga tgggaaaaaa ttcggttaag gccaggggga 7980
aagaaaaaat ataaattaaa acatatagta tgggcaagca gggagctaga acgattcgca 8040
gttaatcctg gcctgttaga aacatcagaa ggctgtagac aaatactggg acagctacaa 8100
ccatcccttc agacaggatc agaagaactt agatcattat ataatacagt agcaaccctc 8160
tattgtgtgc atcaaaggat agagataaaa gacaccaagg aagctttaga caagatagag 8220
gaagagcaaa acaaaagtaa gaccaccgca cagcaagcgg ccgctgatct tcagacctgg 8280
aggaggagat atgagggaca attggagaag tgaattatat aaatataaag tagtaaaaat 8340
tgaaccatta ggagtagcac ccaccaaggc aaagagaaga gtggtgcaga gagaaaaaag 8400
agcagtggga ataggagctt tgttccttgg gttcttggga gcagcaggaa gcactatggg 8460
cgcagcgtca atgacgctga cggtacaggc cagacaatta ttgtctggta tagtgcagca 8520
gcagaacaat ttgctgaggg ctattgaggc gcaacagcat ctgttgcaac tcacagtctg 8580
gggcatcaag cagctccagg caagaatcct ggctgtggaa agatacctaa aggatcaaca 8640
gctcctgggg atttggggtt gctctggaaa actcatttgc accactgctg tgccttggaa 8700
tgctagttgg agtaataaat ctctggaaca gatttggaat cacacgacct ggatggagtg 8760
ggacagagaa attaacaatt acacaagctt aatacactcc ttaattgaag aatcgcaaaa 8820
ccagcaagaa aagaatgaac aagaattatt ggaattagat aaatgggcaa gtttgtggaa 8880
ttggtttaac ataacaaatt ggctgtggta tataaaatta ttcataatga tagtaggagg 8940
cttggtaggt ttaagaatag tttttgctgt actttctata gtgaatagag ttaggcaggg 9000
atattcacca ttatcgtttc agacccacct cccaaccccg aggggaccct tgcgcctttt 9060
ccaaggcagc cctgggtttg cgcagggacg cggctgctct gggcgtggtt ccgggaaacg 9120
cagcggcgcc gaccctgggt ctcgcacatt cttcacgtcc gttcgcagcg tcacccggat 9180
cttcgccgct acccttgtgg gccccccggc gacgcttcct gctccgcccc taagtcggga 9240
aggttccttg cggttcgcgg cgtgccggac gtgacaaacg gaagccgcac gtctcactag 9300
taccctcgca gacggacagc gccagggagc aatggcagcg cgccgaccgc gatgggctgt 9360
ggccaatagc ggctgctcag cagggcgcgc cgagagcagc ggccgggaag gggcggtgcg 9420
ggaggcgggg tgtggggcgg tagtgtgggc cctgttcctg cccgcgcggt gttccgcatt 9480
ctgcaagcct ccggagcgca cgtcggcagt cggctccctc gttgaccgaa tcaccgacct 9540
ctctccccag ggggtaccca gctgtctaga gaattctaga tcttgagaca aatggcagta 9600
ttcatccaca attttaaaag aaaagggggg attggggggt acagtgcagg ggaaagaata 9660
gtagacataa tagcaacaga catacaaact aaagaattac aaaaacaaat tacaaaaatt 9720
caaaattttc gggtttatta cagggacagc agagatccac tttggcgccg g 9771
<210> 195
<211> 9841
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 195
ctcgaggggg cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca 60
tatatggagt tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac 120
gacccccgcc cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact 180
ttccattgac gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa 240
gtgtatcata tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg 300
cattatgccc agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta 360
gtcatcgcta ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg 420
tttgactcac ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg 480
caccaaaatc aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg 540
ggcggtaggc gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg 600
atcaacaagt ttgtacaaaa aagcaggctc cgaattcacc ggtctagcgc taccggactc 660
agatctcgag ctcaagcttc gaattccacc atgggcgtgg ccgacctgat caagaagttc 720
gagagcatca gcaaggagga gggggatcca ccggtcgcca ccatggtgag caagggcgag 780
gcagtgatca aggagttcat gcggttcaag gtgcacatgg agggctccat gaacggccac 840
gagttcgaga tcgagggcga gggcgagggc cgcccctacg agggcaccca gaccgccaag 900
ctgaaggtga ccaagggtgg ccccctgccc ttctcctggg acatcctgtc ccctcagttc 960
atgtacggct ccagggcctt caccaagcac cccgccgaca tccccgacta ctataagcag 1020
tccttccccg agggcttcaa gtgggagcgc gtgatgaact tcgaggacgg cggcgccgtg 1080
accgtgaccc aggacacctc cctggaggac ggcaccctga tctacaaggt gaagctccgc 1140
ggcaccaact tccctcctga cggccccgta atgcagaaga agacaatggg ctgggaagcg 1200
tccaccgagc ggttgtaccc cgaggacggc gtgctgaagg gcgacattaa gatggccctg 1260
cgcctgaagg acggcggccg ctacctggcg gacttcaaga ccacctacaa ggccaagaag 1320
cccgtgcaga tgcccggcgc ctacaacgtc gaccgcaagt tggacatcac ctcccacaac 1380
gaggactaca ccgtggtgga acagtacgaa cgctccgagg gccgccactc caccggcggc 1440
atggacgagc tgtacaagta attaattaag aattcgaccc agctttcttg tacaaagtgg 1500
ttggtaagcc tatccctaac cctctcctcg gtctcgattc tacgtagtaa tgagctagca 1560
gtctcgaggt taacgaattc cgcccccccc cctaacgtta ctggccgaag ccgcttggaa 1620
taaggccggt gtgcgtttgt ctatatgtta ttttccacca tattgccgtc ttttggcaat 1680
gtgagggccc ggaaacctgg ccctgtcttc ttgacgagca ttcctagggg tctttcccct 1740
ctcgccaaag gaatgcaagg tctgttgaat gtcgtgaagg aagcagttcc tctggaagct 1800
tcttgaagac aaacaacgtc tgtagcgacc ctttgcaggc agcggaaccc cccacctggc 1860
gacaggtgcc tctgcggcca aaagccacgt gtataagata cacctgcaaa ggcggcacaa 1920
ccccagtgcc acgttgtgag ttggatagtt gtggaaagag tcaaatggct ctcctcaagc 1980
gtattcaaca aggggctgaa ggatgcccag aaggtacccc attgtatggg atctgatctg 2040
gggcctcggt gcacatgctt tacatgtgtt tagtcgaggt taaaaaaacg tctaggcccc 2100
ccgaaccacg gggacgtggt tttcctttga aaaacacgat gataatatgg ccacaaccgc 2160
caccatgaaa aagcctgaac tcaccgcgac gtctgtcgag aagtttctga tcgaaaagtt 2220
cgacagcgtc tccgacctga tgcagctctc ggagggcgaa gaatctcgtg ctttcagctt 2280
cgatgtagga gggcgtggat atgtcctgcg ggtaaatagc tgcgccgatg gtttctacaa 2340
agatcgttat gtttatcggc actttgcatc ggccgcgctc ccgattccgg aagtgcttga 2400
cattggggaa tttagcgaga gcctgaccta ttgcatctcc cgccgtgcac agggtgtcac 2460
gttgcaagac ctgcctgaaa ccgaactgcc cgctgttctg cagccggtcg cggaggccat 2520
ggatgcgatc gctgcggccg atcttagcca gacgagcggg ttcggcccat tcggaccgca 2580
aggaatcggt caatacacta catggcgtga tttcatatgc gcgattgctg atccccatgt 2640
gtatcactgg caaactgtga tggacgacac cgtcagtgcg tccgtcgcgc aggctctcga 2700
tgagctgatg ctttgggccg aggactgccc cgaagtccgg cacctcgtgc acgcggattt 2760
cggctccaac aatgtcctga cggacaatgg ccgcataaca gcggtcattg actggagcga 2820
ggcgatgttc ggggattccc aatacgaggt cgccaacatc ttcttctgga ggccgtggtt 2880
ggcttgtatg gagcagcaga cgcgctactt cgagcggagg catccggagc ttgcaggatc 2940
gccgcggctc cgggcgtata tgctccgcat tggtcttgac caactctatc agagcttggt 3000
tgacggcaat ttcgatgatg cagcttgggc gcagggtcga tgcgacgcaa tcgtccgatc 3060
cggagccggg actgtcgggc gtacacaaat cgcccgcaga agcgcggccg tctggaccga 3120
tggctgtgta gaagtactcg ccgatagtgg aaaccgacgc cccagcactc gtccgagggc 3180
aaaggaatag acgcgtaccg gttagcaccg gtgtgatcag ggtcagacag ctgcctgcag 3240
gccggtggcg cgttaagtcg acaatcaacc tctggattac aaaatttgtg aaagattgac 3300
tggtattctt aactatgttg ctccttttac gctatgtgga tacgctgctt taatgccttt 3360
gtatcatgct attgcttccc gtatggcttt cattttctcc tccttgtata aatcctggtt 3420
gctgtctctt tatgaggagt tgtggcccgt tgtcaggcaa cgtggcgtgg tgtgcactgt 3480
gtttgctgac gcaaccccca ctggttgggg cattgccacc acctgtcagc tcctttccgg 3540
gactttcgct ttccccctcc ctattgccac ggcggaactc atcgccgcct gccttgcccg 3600
ctgctggaca ggggctcggc tgttgggcac tgacaattcc gtggtgttgt cggggaaatc 3660
atcgtccttt ccttggctgc tcgcctgtgt tgccacctgg attctgcgcg ggacgtcctt 3720
ctgctacgtc ccttcggccc tcaatccagc ggaccttcct tcccgcggcc tgctgccggc 3780
tctgcggcct cttccgcgtc ttcgccttcg ccctcagacg agtcggatct ccctttgggc 3840
cgcctccccg cgtcgacttt aagaccaatg acttacaagg cagctgtaga tcttagccac 3900
tttttaaaag aaaagggggg actggaaggg ctaattcact cccaacgaag acaagatctg 3960
ctttttgctt gtactgggtc tctctggtta gaccagatct gagcctggga gctctctggc 4020
taactaggga acccactgct taagcctcaa taaagcttgc cttgagtgct tcaagtagtg 4080
tgtgcccgtc tgttgtgtga ctctggtaac tagagatccc tcagaccctt ttagtcagtg 4140
tggaaaatct ctagcagtac gtatagtagt tcatgtcatc ttattattca gtatttataa 4200
cttgcaaaga aatgaatatc agagagtgag aggaacttgt ttattgcagc ttataatggt 4260
tacaaataaa gcaatagcat cacaaatttc acaaataaag catttttttc actgcattct 4320
agttgtggtt tgtccaaact catcaatgta tcttatcatg tctggctcta gctatcccgc 4380
ccctaactcc gcccatcccg cccctaactc cgcccagttc cgcccattct ccgccccatg 4440
gctgactaat tttttttatt tatgcagagg ccgaggccgc ctcggcctct gagctattcc 4500
agaagtagtg aggaggcttt tttggaggcc tagggacgta cccaattcgc cctatagtga 4560
gtcgtattac gcgcgctcac tggccgtcgt tttacaacgt cgtgactggg aaaaccctgg 4620
cgttacccaa cttaatcgcc ttgcagcaca tccccctttc gccagctggc gtaatagcga 4680
agaggcccgc accgatcgcc cttcccaaca gttgcgcagc ctgaatggcg aatgggacgc 4740
gccctgtagc ggcgcattaa gcgcggcggg tgtggtggtt acgcgcagcg tgaccgctac 4800
acttgccagc gccctagcgc ccgctccttt cgctttcttc ccttcctttc tcgccacgtt 4860
cgccggcttt ccccgtcaag ctctaaatcg ggggctccct ttagggttcc gatttagtgc 4920
tttacggcac ctcgacccca aaaaacttga ttagggtgat ggttcacgta gtgggccatc 4980
gccctgatag acggtttttc gccctttgac gttggagtcc acgttcttta atagtggact 5040
cttgttccaa actggaacaa cactcaaccc tatctcggtc tattcttttg atttataagg 5100
gattttgccg atttcggcct attggttaaa aaatgagctg atttaacaaa aatttaacgc 5160
gaattttaac aaaatattaa cgcttacaat ttaggtggca cttttcgggg aaatgtgcgc 5220
ggaaccccta tttgtttatt tttctaaata cattcaaata tgtatccgct catgagacaa 5280
taaccctgat aaatgcttca ataatattga aaaaggaaga gtatgagtat tcaacatttc 5340
cgtgtcgccc ttattccctt ttttgcggca ttttgccttc ctgtttttgc tcacccagaa 5400
acgctggtga aagtaaaaga tgctgaagat cagttgggtg cacgagtggg ttacatcgaa 5460
ctggatctca acagcggtaa gatccttgag agttttcgcc ccgaagaacg ttttccaatg 5520
atgagcactt ttaaagttct gctatgtggc gcggtattat cccgtattga cgccgggcaa 5580
gagcaactcg gtcgccgcat acactattct cagaatgact tggttgagta ctcaccagtc 5640
acagaaaagc atcttacgga tggcatgaca gtaagagaat tatgcagtgc tgccataacc 5700
atgagtgata acactgcggc caacttactt ctgacaacga tcggaggacc gaaggagcta 5760
accgcttttt tgcacaacat gggggatcat gtaactcgcc ttgatcgttg ggaaccggag 5820
ctgaatgaag ccataccaaa cgacgagcgt gacaccacga tgcctgtagc aatggcaaca 5880
acgttgcgca aactattaac tggcgaacta cttactctag cttcccggca acaattaata 5940
gactggatgg aggcggataa agttgcagga ccacttctgc gctcggccct tccggctggc 6000
tggtttattg ctgataaatc tggagccggt gagcgtgggt ctcgcggtat cattgcagca 6060
ctggggccag atggtaagcc ctcccgtatc gtagttatct acacgacggg gagtcaggca 6120
actatggatg aacgaaatag acagatcgct gagataggtg cctcactgat taagcattgg 6180
taactgtcag accaagttta ctcatatata ctttagattg atttaaaact tcatttttaa 6240
tttaaaagga tctaggtgaa gatccttttt gataatctca tgaccaaaat cccttaacgt 6300
gagttttcgt tccactgagc gtcagacccc gtagaaaaga tcaaaggatc ttcttgagat 6360
cctttttttc tgcgcgtaat ctgctgcttg caaacaaaaa aaccaccgct accagcggtg 6420
gtttgtttgc cggatcaaga gctaccaact ctttttccga aggtaactgg cttcagcaga 6480
gcgcagatac caaatactgt tcttctagtg tagccgtagt taggccacca cttcaagaac 6540
tctgtagcac cgcctacata cctcgctctg ctaatcctgt taccagtggc tgctgccagt 6600
ggcgataagt cgtgtcttac cgggttggac tcaagacgat agttaccgga taaggcgcag 6660
cggtcgggct gaacgggggg ttcgtgcaca cagcccagct tggagcgaac gacctacacc 6720
gaactgagat acctacagcg tgagctatga gaaagcgcca cgcttcccga agggagaaag 6780
gcggacaggt atccggtaag cggcagggtc ggaacaggag agcgcacgag ggagcttcca 6840
gggggaaacg cctggtatct ttatagtcct gtcgggtttc gccacctctg acttgagcgt 6900
cgatttttgt gatgctcgtc aggggggcgg agcctatgga aaaacgccag caacgcggcc 6960
tttttacggt tcctggcctt ttgctggcct tttgctcaca tgttctttcc tgcgttatcc 7020
cctgattctg tggataaccg tattaccgcc tttgagtgag ctgataccgc tcgccgcagc 7080
cgaacgaccg agcgcagcga gtcagtgagc gaggaagcgg aagagcgccc aatacgcaaa 7140
ccgcctctcc ccgcgcgttg gccgattcat taatgcagct ggcacgacag gtttcccgac 7200
tggaaagcgg gcagtgagcg caacgcaatt aatgtgagtt agctcactca ttaggcaccc 7260
caggctttac actttatgct tccggctcgt atgttgtgtg gaattgtgag cggataacaa 7320
tttcacacag gaaacagcta tgaccatgat tacgccaagc gcgcaattaa ccctcactaa 7380
agggaacaaa agctggagct gcaagcttaa tgtagtctta tgcaatactc ttgtagtctt 7440
gcaacatggt aacgatgagt tagcaacatg ccttacaagg agagaaaaag caccgtgcat 7500
gccgattggt ggaagtaagg tggtacgatc gtgccttatt aggaaggcaa cagacgggtc 7560
tgacatggat tggacgaacc actgaattgc cgcattgcag agatattgta tttaagtgcc 7620
tagctcgata cataaacggg tctctctggt tagaccagat ctgagcctgg gagctctctg 7680
gctaactagg gaacccactg cttaagcctc aataaagctt gccttgagtg cttcaagtag 7740
tgtgtgcccg tctgttgtgt gactctggta actagagatc cctcagaccc ttttagtcag 7800
tgtggaaaat ctctagcagt ggcgcccgaa cagggacttg aaagcgaaag ggaaaccaga 7860
ggagctctct cgacgcagga ctcggcttgc tgaagcgcgc acggcaagag gcgaggggcg 7920
gcgactggtg agtacgccaa aaattttgac tagcggaggc tagaaggaga gagatgggtg 7980
cgagagcgtc agtattaagc gggggagaat tagatcgcga tgggaaaaaa ttcggttaag 8040
gccaggggga aagaaaaaat ataaattaaa acatatagta tgggcaagca gggagctaga 8100
acgattcgca gttaatcctg gcctgttaga aacatcagaa ggctgtagac aaatactggg 8160
acagctacaa ccatcccttc agacaggatc agaagaactt agatcattat ataatacagt 8220
agcaaccctc tattgtgtgc atcaaaggat agagataaaa gacaccaagg aagctttaga 8280
caagatagag gaagagcaaa acaaaagtaa gaccaccgca cagcaagcgg ccgctgatct 8340
tcagacctgg aggaggagat atgagggaca attggagaag tgaattatat aaatataaag 8400
tagtaaaaat tgaaccatta ggagtagcac ccaccaaggc aaagagaaga gtggtgcaga 8460
gagaaaaaag agcagtggga ataggagctt tgttccttgg gttcttggga gcagcaggaa 8520
gcactatggg cgcagcgtca atgacgctga cggtacaggc cagacaatta ttgtctggta 8580
tagtgcagca gcagaacaat ttgctgaggg ctattgaggc gcaacagcat ctgttgcaac 8640
tcacagtctg gggcatcaag cagctccagg caagaatcct ggctgtggaa agatacctaa 8700
aggatcaaca gctcctgggg atttggggtt gctctggaaa actcatttgc accactgctg 8760
tgccttggaa tgctagttgg agtaataaat ctctggaaca gatttggaat cacacgacct 8820
ggatggagtg ggacagagaa attaacaatt acacaagctt aatacactcc ttaattgaag 8880
aatcgcaaaa ccagcaagaa aagaatgaac aagaattatt ggaattagat aaatgggcaa 8940
gtttgtggaa ttggtttaac ataacaaatt ggctgtggta tataaaatta ttcataatga 9000
tagtaggagg cttggtaggt ttaagaatag tttttgctgt actttctata gtgaatagag 9060
ttaggcaggg atattcacca ttatcgtttc agacccacct cccaaccccg aggggaccct 9120
tgcgcctttt ccaaggcagc cctgggtttg cgcagggacg cggctgctct gggcgtggtt 9180
ccgggaaacg cagcggcgcc gaccctgggt ctcgcacatt cttcacgtcc gttcgcagcg 9240
tcacccggat cttcgccgct acccttgtgg gccccccggc gacgcttcct gctccgcccc 9300
taagtcggga aggttccttg cggttcgcgg cgtgccggac gtgacaaacg gaagccgcac 9360
gtctcactag taccctcgca gacggacagc gccagggagc aatggcagcg cgccgaccgc 9420
gatgggctgt ggccaatagc ggctgctcag cagggcgcgc cgagagcagc ggccgggaag 9480
gggcggtgcg ggaggcgggg tgtggggcgg tagtgtgggc cctgttcctg cccgcgcggt 9540
gttccgcatt ctgcaagcct ccggagcgca cgtcggcagt cggctccctc gttgaccgaa 9600
tcaccgacct ctctccccag ggggtaccca gctgtctaga gaattctaga tcttgagaca 9660
aatggcagta ttcatccaca attttaaaag aaaagggggg attggggggt acagtgcagg 9720
ggaaagaata gtagacataa tagcaacaga catacaaact aaagaattac aaaaacaaat 9780
tacaaaaatt caaaattttc gggtttatta cagggacagc agagatccac tttggcgccg 9840
g 9841
<210> 196
<211> 9259
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 196
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcgcc cttgccgcca ccatgccaaa aaaaaaacgc 660
aaagtggtga gcaagggcga ggagctgttc accggggtgg tgcccatcct ggtcgagctg 720
gacggcgacg taaacggcca caagttcagc gtgtccggcg agggcgaggg cgatgccacc 780
tacggcaagc tgaccctgaa gttcatctgc accaccggca agctgcccgt gccctggccc 840
accctcgtga ccaccctgac ctacggcgtg cagtgcttca gccgctaccc cgaccacatg 900
aagcagcacg acttcttcaa gtccgccatg cccgaaggct acgtccagga gcgcaccatc 960
ttcttcaagg acgacggcaa ctacaagacc cgcgccgagg tgaagttcga gggcgacacc 1020
ctggtgaacc gcatcgagct gaagggcatc gacttcaagg aggacggcaa catcctgggg 1080
cacaagctgg agtacaacta caacagccac aacgtctata tcatggccga caagcagaag 1140
aacggcatca aggtgaactt caagatccgc cacaacatcg aggacggcag cgtgcagctc 1200
gccgaccact accagcagaa cacccccatc ggcgacggcc ccgtgctgct gcccgacaac 1260
cactacctga gcacccagtc cgccctgagc aaagacccca acgagaagcg cgatcacatg 1320
gtcctgctgg agttcgtgac cgccgccggg atcactctcg gcatggacga gctgtacaag 1380
taaaagggcg aattcgaccc agctttcttg tacaaagtgg ttggtaagcc tatccctaac 1440
cctctcctcg gtctcgattc tacgtagtaa tgagctagca gtctcgaggt taacgaattc 1500
cgcccccccc ctaacgttac tggccgaagc cgcttggaat aaggccggtg tgcgcttgtc 1560
tatatgttat tttccaccat attgccgtct tttggcaatg tgagggcccg gaaacctggc 1620
cctgtcttct tgacgagcat tcctaggggt ctttcccctc tcgccaaagg aatgcaaggt 1680
ctgttgaatg tcgtgaagga agcagttcct ctggaagctt cttgaagaca aacaacgtct 1740
gtagcgaccc tttgcaggca gcggaacccc ccacctggcg acaggtgccc ctgcggccaa 1800
aagccacgtg tataagatac acctgcaaag gcggcacaac cccagtgcca cgttgtgagt 1860
tggatagttg tggaaagagt caaatggctc tcctcaagcg tattcaacaa ggggctgaag 1920
gatgcccaga aggtacccca ttgtatggga tctgatctgg ggcctcggtg cacatgcttt 1980
acatgtgttt agtcgaggtt aaaaaaacgt ctaggccccc cgaaccacgg ggacgtggtt 2040
ttcctttgaa aaacacgata ataccatggc catgaaaaag cctgaactca ccgcgacgtc 2100
tgtcgagaag tttctgatcg aaaagttcga cagcgtctcc gacctgatgc agctctcgga 2160
gggcgaagaa tctcgtgctt tcagcttcga tgtaggaggg cgtggatatg tcctgcgggt 2220
aaatagctgc gccgatggtt tctacaaaga tcgttatgtt tatcggcact ttgcatcggc 2280
cgcgctcccg attccggaag tgcttgacat tggggaattt agcgagagcc tgacctattg 2340
cctttcatac gagaccgaga tcctgactgt cgagtacgga ttgcttccta tcggcaaaat 2400
cgtggagaag aggattgaat gtaccgtcta ttcagtcgat aataatggga acatctacac 2460
acagcccgtg gctcaatggc acgacagagg agagcaggaa gtttttgaat actgtctcga 2520
ggacggatcc ctcatccgcg ctactaaaga tcataagttt atgaccgtgg acggccagat 2580
gctgccaatt gacgaaattt ttgaacgaga gctggatctg atgagagtcg acaaccttcc 2640
aaactgacac cggtggcgcg ttaagtcgac aatcaacctc tggattacaa aatttgtgaa 2700
agattgactg gtattcttaa ctatgttgct ccttttacgc tatgtggata cgctgcttta 2760
atgcctttgt atcatgctat tgcttcccgt atggctttca ttttctcctc cttgtataaa 2820
tcctggttgc tgtctcttta tgaggagttg tggcccgttg tcaggcaacg tggcgtggtg 2880
tgcactgtgt ttgctgacgc aacccccact ggttggggca ttgccaccac ctgtcagctc 2940
ctttccggga ctttcgcttt ccccctccct attgccacgg cggaactcat cgccgcctgc 3000
cttgcccgct gctggacagg ggctcggctg ttgggcactg acaattccgt ggtgttgtcg 3060
gggaaatcat cgtcctttcc ttggctgctc gcctgtgttg ccacctggat tctgcgcggg 3120
acgtccttct gctacgtccc ttcggccctc aatccagcgg accttccttc ccgcggcctg 3180
ctgccggctc tgcggcctct tccgcgtctt cgccttcgcc ctcagacgag tcggatctcc 3240
ctttgggccg cctccccgcg tcgactttaa gaccaatgac ttacaaggca gctgtagatc 3300
ttagccactt tttaaaagaa aaggggggac tggaagggct aattcactcc caacgaagac 3360
aagatctgct ttttgcttgt actgggtctc tctggttaga ccagatctga gcctgggagc 3420
tctctggcta actagggaac ccactgctta agcctcaata aagcttgcct tgagtgcttc 3480
aagtagtgtg tgcccgtctg ttgtgtgact ctggtaacta gagatccctc agaccctttt 3540
agtcagtgtg gaaaatctct agcagtacgt atagtagttc atgtcatctt attattcagt 3600
atttataact tgcaaagaaa tgaatatcag agagtgagag gaacttgttt attgcagctt 3660
ataatggtta caaataaagc aatagcatca caaatttcac aaataaagca tttttttcac 3720
tgcattctag ttgtggtttg tccaaactca tcaatgtatc ttatcatgtc tggctctagc 3780
tatcccgccc ctaactccgc ccatcccgcc cctaactccg cccagttccg cccattctcc 3840
gccccatggc tgactaattt tttttattta tgcagaggcc gaggccgcct cggcctctga 3900
gctattccag aagtagtgag gaggcttttt tggaggccta gggacgtacc caattcgccc 3960
tatagtgagt cgtattacgc gcgctcactg gccgtcgttt tacaacgtcg tgactgggaa 4020
aaccctggcg ttacccaact taatcgcctt gcagcacatc cccctttcgc cagctggcgt 4080
aatagcgaag aggcccgcac cgatcgccct tcccaacagt tgcgcagcct gaatggcgaa 4140
tgggacgcgc cctgtagcgg cgcattaagc gcggcgggtg tggtggttac gcgcagcgtg 4200
accgctacac ttgccagcgc cctagcgccc gctcctttcg ctttcttccc ttcctttctc 4260
gccacgttcg ccggctttcc ccgtcaagct ctaaatcggg ggctcccttt agggttccga 4320
tttagtgctt tacggcacct cgaccccaaa aaacttgatt agggtgatgg ttcacgtagt 4380
gggccatcgc cctgatagac ggtttttcgc cctttgacgt tggagtccac gttctttaat 4440
agtggactct tgttccaaac tggaacaaca ctcaacccta tctcggtcta ttcttttgat 4500
ttataaggga ttttgccgat ttcggcctat tggttaaaaa atgagctgat ttaacaaaaa 4560
tttaacgcga attttaacaa aatattaacg cttacaattt aggtggcact tttcggggaa 4620
atgtgcgcgg aacccctatt tgtttatttt tctaaataca ttcaaatatg tatccgctca 4680
tgagacaata accctgataa atgcttcaat aatattgaaa aaggaagagt atgagtattc 4740
aacatttccg tgtcgccctt attccctttt ttgcggcatt ttgccttcct gtttttgctc 4800
acccagaaac gctggtgaaa gtaaaagatg ctgaagatca gttgggtgca cgagtgggtt 4860
acatcgaact ggatctcaac agcggtaaga tccttgagag ttttcgcccc gaagaacgtt 4920
ttccaatgat gagcactttt aaagttctgc tatgtggcgc ggtattatcc cgtattgacg 4980
ccgggcaaga gcaactcggt cgccgcatac actattctca gaatgacttg gttgagtact 5040
caccagtcac agaaaagcat cttacggatg gcatgacagt aagagaatta tgcagtgctg 5100
ccataaccat gagtgataac actgcggcca acttacttct gacaacgatc ggaggaccga 5160
aggagctaac cgcttttttg cacaacatgg gggatcatgt aactcgcctt gatcgttggg 5220
aaccggagct gaatgaagcc ataccaaacg acgagcgtga caccacgatg cctgtagcaa 5280
tggcaacaac gttgcgcaaa ctattaactg gcgaactact tactctagct tcccggcaac 5340
aattaataga ctggatggag gcggataaag ttgcaggacc acttctgcgc tcggcccttc 5400
cggctggctg gtttattgct gataaatctg gagccggtga gcgtgggtct cgcggtatca 5460
ttgcagcact ggggccagat ggtaagccct cccgtatcgt agttatctac acgacgggga 5520
gtcaggcaac tatggatgaa cgaaatagac agatcgctga gataggtgcc tcactgatta 5580
agcattggta actgtcagac caagtttact catatatact ttagattgat ttaaaacttc 5640
atttttaatt taaaaggatc taggtgaaga tcctttttga taatctcatg accaaaatcc 5700
cttaacgtga gttttcgttc cactgagcgt cagaccccgt agaaaagatc aaaggatctt 5760
cttgagatcc tttttttctg cgcgtaatct gctgcttgca aacaaaaaaa ccaccgctac 5820
cagcggtggt ttgtttgccg gatcaagagc taccaactct ttttccgaag gtaactggct 5880
tcagcagagc gcagatacca aatactgttc ttctagtgta gccgtagtta ggccaccact 5940
tcaagaactc tgtagcaccg cctacatacc tcgctctgct aatcctgtta ccagtggctg 6000
ctgccagtgg cgataagtcg tgtcttaccg ggttggactc aagacgatag ttaccggata 6060
aggcgcagcg gtcgggctga acggggggtt cgtgcacaca gcccagcttg gagcgaacga 6120
cctacaccga actgagatac ctacagcgtg agctatgaga aagcgccacg cttcccgaag 6180
ggagaaaggc ggacaggtat ccggtaagcg gcagggtcgg aacaggagag cgcacgaggg 6240
agcttccagg gggaaacgcc tggtatcttt atagtcctgt cgggtttcgc cacctctgac 6300
ttgagcgtcg atttttgtga tgctcgtcag gggggcggag cctatggaaa aacgccagca 6360
acgcggcctt tttacggttc ctggcctttt gctggccttt tgctcacatg ttctttcctg 6420
cgttatcccc tgattctgtg gataaccgta ttaccgcctt tgagtgagct gataccgctc 6480
gccgcagccg aacgaccgag cgcagcgagt cagtgagcga ggaagcggaa gagcgcccaa 6540
tacgcaaacc gcctctcccc gcgcgttggc cgattcatta atgcagctgg cacgacaggt 6600
ttcccgactg gaaagcgggc agtgagcgca acgcaattaa tgtgagttag ctcactcatt 6660
aggcacccca ggctttacac tttatgcttc cggctcgtat gttgtgtgga attgtgagcg 6720
gataacaatt tcacacagga aacagctatg accatgatta cgccaagcgc gcaattaacc 6780
ctcactaaag ggaacaaaag ctggagctgc aagcttaatg tagtcttatg caatactctt 6840
gtagtcttgc aacatggtaa cgatgagtta gcaacatgcc ttacaaggag agaaaaagca 6900
ccgtgcatgc cgattggtgg aagtaaggtg gtacgatcgt gccttattag gaaggcaaca 6960
gacgggtctg acatggattg gacgaaccac tgaattgccg cattgcagag atattgtatt 7020
taagtgccta gctcgataca taaacgggtc tctctggtta gaccagatct gagcctggga 7080
gctctctggc taactaggga acccactgct taagcctcaa taaagcttgc cttgagtgct 7140
tcaagtagtg tgtgcccgtc tgttgtgtga ctctggtaac tagagatccc tcagaccctt 7200
ttagtcagtg tggaaaatct ctagcagtgg cgcccgaaca gggacttgaa agcgaaaggg 7260
aaaccagagg agctctctcg acgcaggact cggcttgctg aagcgcgcac ggcaagaggc 7320
gaggggcggc gactggtgag tacgccaaaa attttgacta gcggaggcta gaaggagaga 7380
gatgggtgcg agagcgtcag tattaagcgg gggagaatta gatcgcgatg ggaaaaaatt 7440
cggttaaggc cagggggaaa gaaaaaatat aaattaaaac atatagtatg ggcaagcagg 7500
gagctagaac gattcgcagt taatcctggc ctgttagaaa catcagaagg ctgtagacaa 7560
atactgggac agctacaacc atcccttcag acaggatcag aagaacttag atcattatat 7620
aatacagtag caaccctcta ttgtgtgcat caaaggatag agataaaaga caccaaggaa 7680
gctttagaca agatagagga agagcaaaac aaaagtaaga ccaccgcaca gcaagcggcc 7740
gctgatcttc agacctggag gaggagatat gagggacaat tggagaagtg aattatataa 7800
atataaagta gtaaaaattg aaccattagg agtagcaccc accaaggcaa agagaagagt 7860
ggtgcagaga gaaaaaagag cagtgggaat aggagctttg ttccttgggt tcttgggagc 7920
agcaggaagc actatgggcg cagcgtcaat gacgctgacg gtacaggcca gacaattatt 7980
gtctggtata gtgcagcagc agaacaattt gctgagggct attgaggcgc aacagcatct 8040
gttgcaactc acagtctggg gcatcaagca gctccaggca agaatcctgg ctgtggaaag 8100
atacctaaag gatcaacagc tcctggggat ttggggttgc tctggaaaac tcatttgcac 8160
cactgctgtg ccttggaatg ctagttggag taataaatct ctggaacaga tttggaatca 8220
cacgacctgg atggagtggg acagagaaat taacaattac acaagcttaa tacactcctt 8280
aattgaagaa tcgcaaaacc agcaagaaaa gaatgaacaa gaattattgg aattagataa 8340
atgggcaagt ttgtggaatt ggtttaacat aacaaattgg ctgtggtata taaaattatt 8400
cataatgata gtaggaggct tggtaggttt aagaatagtt tttgctgtac tttctatagt 8460
gaatagagtt aggcagggat attcaccatt atcgtttcag acccacctcc caaccccgag 8520
gggacccttg cgccttttcc aaggcagccc tgggtttgcg cagggacgcg gctgctctgg 8580
gcgtggttcc gggaaacgca gcggcgccga ccctgggtct cgcacattct tcacgtccgt 8640
tcgcagcgtc acccggatct tcgccgctac ccttgtgggc cccccggcga cgcttcctgc 8700
tccgccccta agtcgggaag gttccttgcg gttcgcggcg tgccggacgt gacaaacgga 8760
agccgcacgt ctcactagta ccctcgcaga cggacagcgc cagggagcaa tggcagcgcg 8820
ccgaccgcga tgggctgtgg ccaatagcgg ctgctcagca gggcgcgccg agagcagcgg 8880
ccgggaaggg gcggtgcggg aggcggggtg tggggcggta gtgtgggccc tgttcctgcc 8940
cgcgcggtgt tccgcattct gcaagcctcc ggagcgcacg tcggcagtcg gctccctcgt 9000
tgaccgaatc accgacctct ctccccaggg ggtacccagc tgtctagaga attctagatc 9060
ttgagacaaa tggcagtatt catccacaat tttaaaagaa aaggggggat tggggggtac 9120
agtgcagggg aaagaatagt agacataata gcaacagaca tacaaactaa agaattacaa 9180
aaacaaatta caaaaattca aaattttcgg gtttattaca gggacagcag agatccactt 9240
tggcgccggc tcgaggggg 9259
<210> 197
<211> 9620
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 197
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtctagcgc taccggactc agatctcgag 660
ctcaagcttc gaattccacc atgggcgtgg ccgacctgat caagaagttc gagagcatca 720
gcaaggagga gggggatcca ccggtcgcca ccatggtgag caagggcgag gcagtgatca 780
aggagttcat gcggttcaag gtgcacatgg agggctccat gaacggccac gagttcgaga 840
tcgagggcga gggcgagggc cgcccctacg agggcaccca gaccgccaag ctgaaggtga 900
ccaagggtgg ccccctgccc ttctcctggg acatcctgtc ccctcagttc atgtacggct 960
ccagggcctt caccaagcac cccgccgaca tccccgacta ctataagcag tccttccccg 1020
agggcttcaa gtgggagcgc gtgatgaact tcgaggacgg cggcgccgtg accgtgaccc 1080
aggacacctc cctggaggac ggcaccctga tctacaaggt gaagctccgc ggcaccaact 1140
tccctcctga cggccccgta atgcagaaga agacaatggg ctgggaagcg tccaccgagc 1200
ggttgtaccc cgaggacggc gtgctgaagg gcgacattaa gatggccctg cgcctgaagg 1260
acggcggccg ctacctggcg gacttcaaga ccacctacaa ggccaagaag cccgtgcaga 1320
tgcccggcgc ctacaacgtc gaccgcaagt tggacatcac ctcccacaac gaggactaca 1380
ccgtggtgga acagtacgaa cgctccgagg gccgccactc caccggcggc atggacgagc 1440
tgtacaagta attaattaag aattcgaccc agctttcttg tacaaagtgg ttggtaagcc 1500
tatccctaac cctctcctcg gtctcgattc tacgtagtaa tgagctagca gtctcgaggt 1560
taacgaattc cgcccccccc ctaacgttac tggccgaagc cgcttggaat aaggccggtg 1620
tgcgcttgtc tatatgttat tttccaccat attgccgtct tttggcaatg tgagggcccg 1680
gaaacctggc cctgtcttct tgacgagcat tcctaggggt ctttcccctc tcgccaaagg 1740
aatgcaaggt ctgttgaatg tcgtgaagga agcagttcct ctggaagctt cttgaagaca 1800
aacaacgtct gtagcgaccc tttgcaggca gcggaacccc ccacctggcg acaggtgccc 1860
ctgcggccaa aagccacgtg tataagatac acctgcaaag gcggcacaac cccagtgcca 1920
cgttgtgagt tggatagttg tggaaagagt caaatggctc tcctcaagcg tattcaacaa 1980
ggggctgaag gatgcccaga aggtacccca ttgtatggga tctgatctgg ggcctcggtg 2040
cacatgcttt acatgtgttt agtcgaggtt aaaaaaacgt ctaggccccc cgaaccacgg 2100
ggacgtggtt ttcctttgaa aaacacgata ataccatggc catgattaag atcgctacgc 2160
ggaagtacct ggggaaacag aacgtctacg acataggtgt ggagcgcgat cacaactttg 2220
ctctgaaaaa tggatttatc gccagcaact gcatctcccg ccgtgcacag ggtgtcacgt 2280
tgcaagacct gcctgaaacc gaactgcccg ctgttctgca gccggtcgcg gaggccatgg 2340
atgcgatcgc tgcggccgat cttagccaga cgagcgggtt cggcccattc ggaccgcaag 2400
gaatcggtca atacactaca tggcgtgatt tcatatgcgc gattgctgat ccccatgtgt 2460
atcactggca aactgtgatg gacgacaccg tcagtgcgtc cgtcgcgcag gctctcgatg 2520
agctgatgct ttgggccgag gactgccccg aagtccggca cctcgtgcac gcggatttcg 2580
gctccaacaa tgtcctgacg gacaatggcc gcataacagc ggtcattgac tggagcgagg 2640
cgatgttcgg ggattcccaa tacgaggtcg ccaacatctt cttctggagg ccgtggttgg 2700
cttgtatgga gcagcagacg cgctacttcg agcggaggca tccggagctt gcaggatcgc 2760
cgcggctccg ggcgtatatg ctccgcattg gtcttgacca actctatcag agcttggttg 2820
acggcaattt cgatgatgca gcttgggcgc agggtcgatg cgacgcaatc gtccgatccg 2880
gagccgggac tgtcgggcgt acacaaatcg cccgcagaag cgcggccgtc tggaccgatg 2940
gctgtgtaga agtactcgcc gatagtggaa accgacgccc cagcactcgt ccgagggcaa 3000
aggaatagca ccggtggcgc gttaagtcga caatcaacct ctggattaca aaatttgtga 3060
aagattgact ggtattctta actatgttgc tccttttacg ctatgtggat acgctgcttt 3120
aatgcctttg tatcatgcta ttgcttcccg tatggctttc attttctcct ccttgtataa 3180
atcctggttg ctgtctcttt atgaggagtt gtggcccgtt gtcaggcaac gtggcgtggt 3240
gtgcactgtg tttgctgacg caacccccac tggttggggc attgccacca cctgtcagct 3300
cctttccggg actttcgctt tccccctccc tattgccacg gcggaactca tcgccgcctg 3360
ccttgcccgc tgctggacag gggctcggct gttgggcact gacaattccg tggtgttgtc 3420
ggggaaatca tcgtcctttc cttggctgct cgcctgtgtt gccacctgga ttctgcgcgg 3480
gacgtccttc tgctacgtcc cttcggccct caatccagcg gaccttcctt cccgcggcct 3540
gctgccggct ctgcggcctc ttccgcgtct tcgccttcgc cctcagacga gtcggatctc 3600
cctttgggcc gcctccccgc gtcgacttta agaccaatga cttacaaggc agctgtagat 3660
cttagccact ttttaaaaga aaagggggga ctggaagggc taattcactc ccaacgaaga 3720
caagatctgc tttttgcttg tactgggtct ctctggttag accagatctg agcctgggag 3780
ctctctggct aactagggaa cccactgctt aagcctcaat aaagcttgcc ttgagtgctt 3840
caagtagtgt gtgcccgtct gttgtgtgac tctggtaact agagatccct cagacccttt 3900
tagtcagtgt ggaaaatctc tagcagtacg tatagtagtt catgtcatct tattattcag 3960
tatttataac ttgcaaagaa atgaatatca gagagtgaga ggaacttgtt tattgcagct 4020
tataatggtt acaaataaag caatagcatc acaaatttca caaataaagc atttttttca 4080
ctgcattcta gttgtggttt gtccaaactc atcaatgtat cttatcatgt ctggctctag 4140
ctatcccgcc cctaactccg cccatcccgc ccctaactcc gcccagttcc gcccattctc 4200
cgccccatgg ctgactaatt ttttttattt atgcagaggc cgaggccgcc tcggcctctg 4260
agctattcca gaagtagtga ggaggctttt ttggaggcct agggacgtac ccaattcgcc 4320
ctatagtgag tcgtattacg cgcgctcact ggccgtcgtt ttacaacgtc gtgactggga 4380
aaaccctggc gttacccaac ttaatcgcct tgcagcacat ccccctttcg ccagctggcg 4440
taatagcgaa gaggcccgca ccgatcgccc ttcccaacag ttgcgcagcc tgaatggcga 4500
atgggacgcg ccctgtagcg gcgcattaag cgcggcgggt gtggtggtta cgcgcagcgt 4560
gaccgctaca cttgccagcg ccctagcgcc cgctcctttc gctttcttcc cttcctttct 4620
cgccacgttc gccggctttc cccgtcaagc tctaaatcgg gggctccctt tagggttccg 4680
atttagtgct ttacggcacc tcgaccccaa aaaacttgat tagggtgatg gttcacgtag 4740
tgggccatcg ccctgataga cggtttttcg ccctttgacg ttggagtcca cgttctttaa 4800
tagtggactc ttgttccaaa ctggaacaac actcaaccct atctcggtct attcttttga 4860
tttataaggg attttgccga tttcggccta ttggttaaaa aatgagctga tttaacaaaa 4920
atttaacgcg aattttaaca aaatattaac gcttacaatt taggtggcac ttttcgggga 4980
aatgtgcgcg gaacccctat ttgtttattt ttctaaatac attcaaatat gtatccgctc 5040
atgagacaat aaccctgata aatgcttcaa taatattgaa aaaggaagag tatgagtatt 5100
caacatttcc gtgtcgccct tattcccttt tttgcggcat tttgccttcc tgtttttgct 5160
cacccagaaa cgctggtgaa agtaaaagat gctgaagatc agttgggtgc acgagtgggt 5220
tacatcgaac tggatctcaa cagcggtaag atccttgaga gttttcgccc cgaagaacgt 5280
tttccaatga tgagcacttt taaagttctg ctatgtggcg cggtattatc ccgtattgac 5340
gccgggcaag agcaactcgg tcgccgcata cactattctc agaatgactt ggttgagtac 5400
tcaccagtca cagaaaagca tcttacggat ggcatgacag taagagaatt atgcagtgct 5460
gccataacca tgagtgataa cactgcggcc aacttacttc tgacaacgat cggaggaccg 5520
aaggagctaa ccgctttttt gcacaacatg ggggatcatg taactcgcct tgatcgttgg 5580
gaaccggagc tgaatgaagc cataccaaac gacgagcgtg acaccacgat gcctgtagca 5640
atggcaacaa cgttgcgcaa actattaact ggcgaactac ttactctagc ttcccggcaa 5700
caattaatag actggatgga ggcggataaa gttgcaggac cacttctgcg ctcggccctt 5760
ccggctggct ggtttattgc tgataaatct ggagccggtg agcgtgggtc tcgcggtatc 5820
attgcagcac tggggccaga tggtaagccc tcccgtatcg tagttatcta cacgacgggg 5880
agtcaggcaa ctatggatga acgaaataga cagatcgctg agataggtgc ctcactgatt 5940
aagcattggt aactgtcaga ccaagtttac tcatatatac tttagattga tttaaaactt 6000
catttttaat ttaaaaggat ctaggtgaag atcctttttg ataatctcat gaccaaaatc 6060
ccttaacgtg agttttcgtt ccactgagcg tcagaccccg tagaaaagat caaaggatct 6120
tcttgagatc ctttttttct gcgcgtaatc tgctgcttgc aaacaaaaaa accaccgcta 6180
ccagcggtgg tttgtttgcc ggatcaagag ctaccaactc tttttccgaa ggtaactggc 6240
ttcagcagag cgcagatacc aaatactgtt cttctagtgt agccgtagtt aggccaccac 6300
ttcaagaact ctgtagcacc gcctacatac ctcgctctgc taatcctgtt accagtggct 6360
gctgccagtg gcgataagtc gtgtcttacc gggttggact caagacgata gttaccggat 6420
aaggcgcagc ggtcgggctg aacggggggt tcgtgcacac agcccagctt ggagcgaacg 6480
acctacaccg aactgagata cctacagcgt gagctatgag aaagcgccac gcttcccgaa 6540
gggagaaagg cggacaggta tccggtaagc ggcagggtcg gaacaggaga gcgcacgagg 6600
gagcttccag ggggaaacgc ctggtatctt tatagtcctg tcgggtttcg ccacctctga 6660
cttgagcgtc gatttttgtg atgctcgtca ggggggcgga gcctatggaa aaacgccagc 6720
aacgcggcct ttttacggtt cctggccttt tgctggcctt ttgctcacat gttctttcct 6780
gcgttatccc ctgattctgt ggataaccgt attaccgcct ttgagtgagc tgataccgct 6840
cgccgcagcc gaacgaccga gcgcagcgag tcagtgagcg aggaagcgga agagcgccca 6900
atacgcaaac cgcctctccc cgcgcgttgg ccgattcatt aatgcagctg gcacgacagg 6960
tttcccgact ggaaagcggg cagtgagcgc aacgcaatta atgtgagtta gctcactcat 7020
taggcacccc aggctttaca ctttatgctt ccggctcgta tgttgtgtgg aattgtgagc 7080
ggataacaat ttcacacagg aaacagctat gaccatgatt acgccaagcg cgcaattaac 7140
cctcactaaa gggaacaaaa gctggagctg caagcttaat gtagtcttat gcaatactct 7200
tgtagtcttg caacatggta acgatgagtt agcaacatgc cttacaagga gagaaaaagc 7260
accgtgcatg ccgattggtg gaagtaaggt ggtacgatcg tgccttatta ggaaggcaac 7320
agacgggtct gacatggatt ggacgaacca ctgaattgcc gcattgcaga gatattgtat 7380
ttaagtgcct agctcgatac ataaacgggt ctctctggtt agaccagatc tgagcctggg 7440
agctctctgg ctaactaggg aacccactgc ttaagcctca ataaagcttg ccttgagtgc 7500
ttcaagtagt gtgtgcccgt ctgttgtgtg actctggtaa ctagagatcc ctcagaccct 7560
tttagtcagt gtggaaaatc tctagcagtg gcgcccgaac agggacttga aagcgaaagg 7620
gaaaccagag gagctctctc gacgcaggac tcggcttgct gaagcgcgca cggcaagagg 7680
cgaggggcgg cgactggtga gtacgccaaa aattttgact agcggaggct agaaggagag 7740
agatgggtgc gagagcgtca gtattaagcg ggggagaatt agatcgcgat gggaaaaaat 7800
tcggttaagg ccagggggaa agaaaaaata taaattaaaa catatagtat gggcaagcag 7860
ggagctagaa cgattcgcag ttaatcctgg cctgttagaa acatcagaag gctgtagaca 7920
aatactggga cagctacaac catcccttca gacaggatca gaagaactta gatcattata 7980
taatacagta gcaaccctct attgtgtgca tcaaaggata gagataaaag acaccaagga 8040
agctttagac aagatagagg aagagcaaaa caaaagtaag accaccgcac agcaagcggc 8100
cgctgatctt cagacctgga ggaggagata tgagggacaa ttggagaagt gaattatata 8160
aatataaagt agtaaaaatt gaaccattag gagtagcacc caccaaggca aagagaagag 8220
tggtgcagag agaaaaaaga gcagtgggaa taggagcttt gttccttggg ttcttgggag 8280
cagcaggaag cactatgggc gcagcgtcaa tgacgctgac ggtacaggcc agacaattat 8340
tgtctggtat agtgcagcag cagaacaatt tgctgagggc tattgaggcg caacagcatc 8400
tgttgcaact cacagtctgg ggcatcaagc agctccaggc aagaatcctg gctgtggaaa 8460
gatacctaaa ggatcaacag ctcctgggga tttggggttg ctctggaaaa ctcatttgca 8520
ccactgctgt gccttggaat gctagttgga gtaataaatc tctggaacag atttggaatc 8580
acacgacctg gatggagtgg gacagagaaa ttaacaatta cacaagctta atacactcct 8640
taattgaaga atcgcaaaac cagcaagaaa agaatgaaca agaattattg gaattagata 8700
aatgggcaag tttgtggaat tggtttaaca taacaaattg gctgtggtat ataaaattat 8760
tcataatgat agtaggaggc ttggtaggtt taagaatagt ttttgctgta ctttctatag 8820
tgaatagagt taggcaggga tattcaccat tatcgtttca gacccacctc ccaaccccga 8880
ggggaccctt gcgccttttc caaggcagcc ctgggtttgc gcagggacgc ggctgctctg 8940
ggcgtggttc cgggaaacgc agcggcgccg accctgggtc tcgcacattc ttcacgtccg 9000
ttcgcagcgt cacccggatc ttcgccgcta cccttgtggg ccccccggcg acgcttcctg 9060
ctccgcccct aagtcgggaa ggttccttgc ggttcgcggc gtgccggacg tgacaaacgg 9120
aagccgcacg tctcactagt accctcgcag acggacagcg ccagggagca atggcagcgc 9180
gccgaccgcg atgggctgtg gccaatagcg gctgctcagc agggcgcgcc gagagcagcg 9240
gccgggaagg ggcggtgcgg gaggcggggt gtggggcggt agtgtgggcc ctgttcctgc 9300
ccgcgcggtg ttccgcattc tgcaagcctc cggagcgcac gtcggcagtc ggctccctcg 9360
ttgaccgaat caccgacctc tctccccagg gggtacccag ctgtctagag aattctagat 9420
cttgagacaa atggcagtat tcatccacaa ttttaaaaga aaagggggga ttggggggta 9480
cagtgcaggg gaaagaatag tagacataat agcaacagac atacaaacta aagaattaca 9540
aaaacaaatt acaaaaattc aaaattttcg ggtttattac agggacagca gagatccact 9600
ttggcgccgg ctcgaggggg 9620
<210> 198
<211> 21
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 198
gaccccacag tggggccact a 21
<210> 199
<211> 8509
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 199
gagggcctat ttcccatgat tccttcatat ttgcatatac gatacaaggc tgttagagag 60
ataattggaa ttaatttgac tgtaaacaca aagatattag tacaaaatac gtgacgtaga 120
aagtaataat ttcttgggta gtttgcagtt ttaaaattat gttttaaaat ggactatcat 180
atgcttaccg taacttgaaa gtatttcgat ttcttggctt tatatatctt gtggaaagga 240
cgaaacaccg accccacagt ggggccacta gttttagagc tagaaatagc aagttaaaat 300
aaggctagtc cgttatcaac ttgaaaaagt ggcaccgagt cggtgctttt ttgttttaga 360
gctagaaata gcaagttaaa ataaggctag tccgttttta gcgcgtgcgc caattctgca 420
gacaaatggc tctagaggta cccgttacat aacttacggt aaatggcccg cctggctgac 480
cgcccaacga cccccgccca ttgacgtcaa tagtaacgcc aatagggact ttccattgac 540
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 600
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattgtgccc 660
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 720
ttaccatggt cgaggtgagc cccacgttct gcttcactct ccccatctcc cccccctccc 780
cacccccaat tttgtattta tttatttttt aattattttg tgcagcgatg ggggcggggg 840
gggggggggg gcgcgcgcca ggcggggcgg ggcggggcga ggggcggggc ggggcgaggc 900
ggagaggtgc ggcggcagcc aatcagagcg gcgcgctccg aaagtttcct tttatggcga 960
ggcggcggcg gcggcggccc tataaaaagc gaagcgcgcg gcgggcggga gtcgctgcga 1020
cgctgccttc gccccgtgcc ccgctccgcc gccgcctcgc gccgcccgcc ccggctctga 1080
ctgaccgcgt tactcccaca ggtgagcggg cgggacggcc cttctcctcc gggctgtaat 1140
tagctgagca agaggtaagg gtttaaggga tggttggttg gtggggtatt aatgtttaat 1200
tacctggagc acctgcctga aatcactttt tttcaggttg gaccggtgcc accatggact 1260
ataaggacca cgacggagac tacaaggatc atgatattga ttacaaagac gatgacgata 1320
agatggcccc aaagaagaag cggaaggtcg gtatccacgg agtcccagca gccgacaaga 1380
agtacagcat cggcctggac atcggcacca actctgtggg ctgggccgtg atcaccgacg 1440
agtacaaggt gcccagcaag aaattcaagg tgctgggcaa caccgaccgg cacagcatca 1500
agaagaacct gatcggagcc ctgctgttcg acagcggcga aacagccgag gccacccggc 1560
tgaagagaac cgccagaaga agatacacca gacggaagaa ccggatctgc tatctgcaag 1620
agatcttcag caacgagatg gccaaggtgg acgacagctt cttccacaga ctggaagagt 1680
ccttcctggt ggaagaggat aagaagcacg agcggcaccc catcttcggc aacatcgtgg 1740
acgaggtggc ctaccacgag aagtacccca ccatctacca cctgagaaag aaactggtgg 1800
acagcaccga caaggccgac ctgcggctga tctatctggc cctggcccac atgatcaagt 1860
tccggggcca cttcctgatc gagggcgacc tgaaccccga caacagcgac gtggacaagc 1920
tgttcatcca gctggtgcag acctacaacc agctgttcga ggaaaacccc atcaacgcca 1980
gcggcgtgga cgccaaggcc atcctgtctg ccagactgag caagagcaga cggctggaaa 2040
atctgatcgc ccagctgccc ggcgagaaga agaatggcct gttcggaaac ctgattgccc 2100
tgagcctggg cctgaccccc aacttcaaga gcaacttcga cctggccgag gatgccaaac 2160
tgcagctgag caaggacacc tacgacgacg acctggacaa cctgctggcc cagatcggcg 2220
accagtacgc cgacctgttt ctggccgcca agaacctgtc cgacgccatc ctgctgagcg 2280
acatcctgag agtgaacacc gagatcacca aggcccccct gagcgcctct atgatcaaga 2340
gatacgacga gcaccaccag gacctgaccc tgctgaaagc tctcgtgcgg cagcagctgc 2400
ctgagaagta caaagagatt ttcttcgacc agagcaagaa cggctacgcc ggctacattg 2460
acggcggagc cagccaggaa gagttctaca agttcatcaa gcccatcctg gaaaagatgg 2520
acggcaccga ggaactgctc gtgaagctga acagagagga cctgctgcgg aagcagcgga 2580
ccttcgacaa cggcagcatc ccccaccaga tccacctggg agagctgcac gccattctgc 2640
ggcggcagga agatttttac ccattcctga aggacaaccg ggaaaagatc gagaagatcc 2700
tgaccttccg catcccctac tacgtgggcc ctctggccag gggaaacagc agattcgcct 2760
ggatgaccag aaagagcgag gaaaccatca ccccctggaa cttcgaggaa gtggtggaca 2820
agggcgcttc cgcccagagc ttcatcgagc ggatgaccaa cttcgataag aacctgccca 2880
acgagaaggt gctgcccaag cacagcctgc tgtacgagta cttcaccgtg tataacgagc 2940
tgaccaaagt gaaatacgtg accgagggaa tgagaaagcc cgccttcctg agcggcgagc 3000
agaaaaaggc catcgtggac ctgctgttca agaccaaccg gaaagtgacc gtgaagcagc 3060
tgaaagagga ctacttcaag aaaatcgagt gcttcgactc cgtggaaatc tccggcgtgg 3120
aagatcggtt caacgcctcc ctgggcacat accacgatct gctgaaaatt atcaaggaca 3180
aggacttcct ggacaatgag gaaaacgagg acattctgga agatatcgtg ctgaccctga 3240
cactgtttga ggacagagag atgatcgagg aacggctgaa aacctatgcc cacctgttcg 3300
acgacaaagt gatgaagcag ctgaagcggc ggagatacac cggctggggc aggctgagcc 3360
ggaagctgat caacggcatc cgggacaagc agtccggcaa gacaatcctg gatttcctga 3420
agtccgacgg cttcgccaac agaaacttca tgcagctgat ccacgacgac agcctgacct 3480
ttaaagagga catccagaaa gcccaggtgt ccggccaggg cgatagcctg cacgagcaca 3540
ttgccaatct ggccggcagc cccgccatta agaagggcat cctgcagaca gtgaaggtgg 3600
tggacgagct cgtgaaagtg atgggccggc acaagcccga gaacatcgtg atcgaaatgg 3660
ccagagagaa ccagaccacc cagaagggac agaagaacag ccgcgagaga atgaagcgga 3720
tcgaagaggg catcaaagag ctgggcagcc agatcctgaa agaacacccc gtggaaaaca 3780
cccagctgca gaacgagaag ctgtacctgt actacctgca gaatgggcgg gatatgtacg 3840
tggaccagga actggacatc aaccggctgt ccgactacga tgtggaccat atcgtgcctc 3900
agagctttct gaaggacgac tccatcgaca acaaggtgct gaccagaagc gacaagaacc 3960
ggggcaagag cgacaacgtg ccctccgaag aggtcgtgaa gaagatgaag aactactggc 4020
ggcagctgct gaacgccaag ctgattaccc agagaaagtt cgacaatctg accaaggccg 4080
agagaggcgg cctgagcgaa ctggataagg ccggcttcat caagagacag ctggtggaaa 4140
cccggcagat cacaaagcac gtggcacaga tcctggactc ccggatgaac actaagtacg 4200
acgagaatga caagctgatc cgggaagtga aagtgatcac cctgaagtcc aagctggtgt 4260
ccgatttccg gaaggatttc cagttttaca aagtgcgcga gatcaacaac taccaccacg 4320
cccacgacgc ctacctgaac gccgtcgtgg gaaccgccct gatcaaaaag taccctaagc 4380
tggaaagcga gttcgtgtac ggcgactaca aggtgtacga cgtgcggaag atgatcgcca 4440
agagcgagca ggaaatcggc aaggctaccg ccaagtactt cttctacagc aacatcatga 4500
actttttcaa gaccgagatt accctggcca acggcgagat ccggaagcgg cctctgatcg 4560
agacaaacgg cgaaaccggg gagatcgtgt gggataaggg ccgggatttt gccaccgtgc 4620
ggaaagtgct gagcatgccc caagtgaata tcgtgaaaaa gaccgaggtg cagacaggcg 4680
gcttcagcaa agagtctatc ctgcccaaga ggaacagcga taagctgatc gccagaaaga 4740
aggactggga ccctaagaag tacggcggct tcgacagccc caccgtggcc tattctgtgc 4800
tggtggtggc caaagtggaa aagggcaagt ccaagaaact gaagagtgtg aaagagctgc 4860
tggggatcac catcatggaa agaagcagct tcgagaagaa tcccatcgac tttctggaag 4920
ccaagggcta caaagaagtg aaaaaggacc tgatcatcaa gctgcctaag tactccctgt 4980
tcgagctgga aaacggccgg aagagaatgc tggcctctgc cggcgaactg cagaagggaa 5040
acgaactggc cctgccctcc aaatatgtga acttcctgta cctggccagc cactatgaga 5100
agctgaaggg ctcccccgag gataatgagc agaaacagct gtttgtggaa cagcacaagc 5160
actacctgga cgagatcatc gagcagatca gcgagttctc caagagagtg atcctggccg 5220
acgctaatct ggacaaagtg ctgtccgcct acaacaagca ccgggataag cccatcagag 5280
agcaggccga gaatatcatc cacctgttta ccctgaccaa tctgggagcc cctgccgcct 5340
tcaagtactt tgacaccacc atcgaccgga agaggtacac cagcaccaaa gaggtgctgg 5400
acgccaccct gatccaccag agcatcaccg gcctgtacga gacacggatc gacctgtctc 5460
agctgggagg cgacaaaagg ccggcggcca cgaaaaaggc cggccaggca aaaaagaaaa 5520
agtaagaatt cctagagctc gctgatcagc ctcgactgtg ccttctagtt gccagccatc 5580
tgttgtttgc ccctcccccg tgccttcctt gaccctggaa ggtgccactc ccactgtcct 5640
ttcctaataa aatgaggaaa ttgcatcgca ttgtctgagt aggtgtcatt ctattctggg 5700
gggtggggtg gggcaggaca gcaaggggga ggattgggaa gagaatagca ggcatgctgg 5760
ggagcggccg caggaacccc tagtgatgga gttggccact ccctctctgc gcgctcgctc 5820
gctcactgag gccgggcgac caaaggtcgc ccgacgcccg ggctttgccc gggcggcctc 5880
agtgagcgag cgagcgcgca gctgcctgca ggggcgcctg atgcggtatt ttctccttac 5940
gcatctgtgc ggtatttcac accgcatacg tcaaagcaac catagtacgc gccctgtagc 6000
ggcgcattaa gcgcggcggg tgtggtggtt acgcgcagcg tgaccgctac acttgccagc 6060
gccctagcgc ccgctccttt cgctttcttc ccttcctttc tcgccacgtt cgccggcttt 6120
ccccgtcaag ctctaaatcg ggggctccct ttagggttcc gatttagtgc tttacggcac 6180
ctcgacccca aaaaacttga tttgggtgat ggttcacgta gtgggccatc gccctgatag 6240
acggtttttc gccctttgac gttggagtcc acgttcttta atagtggact cttgttccaa 6300
actggaacaa cactcaaccc tatctcgggc tattcttttg atttataagg gattttgccg 6360
atttcggcct attggttaaa aaatgagctg atttaacaaa aatttaacgc gaattttaac 6420
aaaatattaa cgtttacaat tttatggtgc actctcagta caatctgctc tgatgccgca 6480
tagttaagcc agccccgaca cccgccaaca cccgctgacg cgccctgacg ggcttgtctg 6540
ctcccggcat ccgcttacag acaagctgtg accgtctccg ggagctgcat gtgtcagagg 6600
ttttcaccgt catcaccgaa acgcgcgaga cgaaagggcc tcgtgatacg cctattttta 6660
taggttaatg tcatgataat aatggtttct tagacgtcag gtggcacttt tcggggaaat 6720
gtgcgcggaa cccctatttg tttatttttc taaatacatt caaatatgta tccgctcatg 6780
agacaataac cctgataaat gcttcaataa tattgaaaaa ggaagagtat gagtattcaa 6840
catttccgtg tcgcccttat tccctttttt gcggcatttt gccttcctgt ttttgctcac 6900
ccagaaacgc tggtgaaagt aaaagatgct gaagatcagt tgggtgcacg agtgggttac 6960
atcgaactgg atctcaacag cggtaagatc cttgagagtt ttcgccccga agaacgtttt 7020
ccaatgatga gcacttttaa agttctgcta tgtggcgcgg tattatcccg tattgacgcc 7080
gggcaagagc aactcggtcg ccgcatacac tattctcaga atgacttggt tgagtactca 7140
ccagtcacag aaaagcatct tacggatggc atgacagtaa gagaattatg cagtgctgcc 7200
ataaccatga gtgataacac tgcggccaac ttacttctga caacgatcgg aggaccgaag 7260
gagctaaccg cttttttgca caacatgggg gatcatgtaa ctcgccttga tcgttgggaa 7320
ccggagctga atgaagccat accaaacgac gagcgtgaca ccacgatgcc tgtagcaatg 7380
gcaacaacgt tgcgcaaact attaactggc gaactactta ctctagcttc ccggcaacaa 7440
ttaatagact ggatggaggc ggataaagtt gcaggaccac ttctgcgctc ggcccttccg 7500
gctggctggt ttattgctga taaatctgga gccggtgagc gtggaagccg cggtatcatt 7560
gcagcactgg ggccagatgg taagccctcc cgtatcgtag ttatctacac gacggggagt 7620
caggcaacta tggatgaacg aaatagacag atcgctgaga taggtgcctc actgattaag 7680
cattggtaac tgtcagacca agtttactca tatatacttt agattgattt aaaacttcat 7740
ttttaattta aaaggatcta ggtgaagatc ctttttgata atctcatgac caaaatccct 7800
taacgtgagt tttcgttcca ctgagcgtca gaccccgtag aaaagatcaa aggatcttct 7860
tgagatcctt tttttctgcg cgtaatctgc tgcttgcaaa caaaaaaacc accgctacca 7920
gcggtggttt gtttgccgga tcaagagcta ccaactcttt ttccgaaggt aactggcttc 7980
agcagagcgc agataccaaa tactgtcctt ctagtgtagc cgtagttagg ccaccacttc 8040
aagaactctg tagcaccgcc tacatacctc gctctgctaa tcctgttacc agtggctgct 8100
gccagtggcg ataagtcgtg tcttaccggg ttggactcaa gacgatagtt accggataag 8160
gcgcagcggt cgggctgaac ggggggttcg tgcacacagc ccagcttgga gcgaacgacc 8220
tacaccgaac tgagatacct acagcgtgag ctatgagaaa gcgccacgct tcccgaaggg 8280
agaaaggcgg acaggtatcc ggtaagcggc agggtcggaa caggagagcg cacgagggag 8340
cttccagggg gaaacgcctg gtatctttat agtcctgtcg ggtttcgcca cctctgactt 8400
gagcgtcgat ttttgtgatg ctcgtcaggg gggcggagcc tatggaaaaa cgccagcaac 8460
gcggcctttt tacggttcct ggccttttgc tggccttttg ctcacatgt 8509
<210> 200
<211> 11194
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 200
gcacttttcg gggaaatgtg cgcggaaccc ctatttgttt atttttctaa atacattcaa 60
atatgtatcc gctcatgaga caataaccct gataaatgct tcaataatat tgaaaaagga 120
agagtatgag tattcaacat ttccgtgtcg cccttattcc cttttttgcg gcattttgcc 180
ttcctgtttt tgctcaccca gaaacgctgg tgaaagtaaa agatgctgaa gatcagttgg 240
gtgcacgagt gggttacatc gaactggatc tcaacagcgg taagatcctt gagagttttc 300
gccccgaaga acgttttcca atgatgagca cttttaaagt tctgctatgt ggcgcggtat 360
tatcccgtat tgacgccggg caagagcaac tcggtcgccg catacactat tctcagaatg 420
acttggttga gtactcacca gtcacagaaa agcatcttac ggatggcatg acagtaagag 480
aattatgcag tgctgccata accatgagtg ataacactgc ggccaactta cttctgacaa 540
cgatcggagg accgaaggag ctaaccgctt ttttgcacaa catgggggat catgtaactc 600
gccttgatcg ttgggaaccg gagctgaatg aagccatacc aaacgacgag cgtgacacca 660
cgatgcctgt agcaatggca acaacgttgc gcaaactatt aactggcgaa ctacttactc 720
tagcttcccg gcaacaatta atagactgga tggaggcgga taaagttgca ggaccacttc 780
tgcgctcggc ccttccggct ggctggttta ttgctgataa atctggagcc ggtgagcgtg 840
ggtctcgcgg tatcattgca gcactggggc cagatggtaa gccctcccgt atcgtagtta 900
tctacacgac ggggagtcag gcaactatgg atgaacgaaa tagacagatc gctgagatag 960
gtgcctcact gattaagcat tggtaactgt cagaccaagt ttactcatat atactttaga 1020
ttgatttaaa acttcatttt taatttaaaa ggatctaggt gaagatcctt tttgataatc 1080
tcatgaccaa aatcccttaa cgtgagtttt cgttccactg agcgtcagac cccgtagaaa 1140
agatcaaagg atcttcttga gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa 1200
aaaaaccacc gctaccagcg gtggtttgtt tgccggatca agagctacca actctttttc 1260
cgaaggtaac tggcttcagc agagcgcaga taccaaatac tgtccttcta gtgtagccgt 1320
agttaggcca ccacttcaag aactctgtag caccgcctac atacctcgct ctgctaatcc 1380
tgttaccagt ggctgctgcc agtggcgata agtcgtgtct taccgggttg gactcaagac 1440
gatagttacc ggataaggcg cagcggtcgg gctgaacggg gggttcgtgc acacagccca 1500
gcttggagcg aacgacctac accgaactga gatacctaca gcgtgagcta tgagaaagcg 1560
ccacgcttcc cgaagggaga aaggcggaca ggtatccggt aagcggcagg gtcggaacag 1620
gagagcgcac gagggagctt ccagggggaa acgcctggta tctttatagt cctgtcgggt 1680
ttcgccacct ctgacttgag cgtcgatttt tgtgatgctc gtcagggggg cggagcctat 1740
ggaaaaacgc cagcaacgcg gcctttttac ggttcctggc cttttgctgg ccttttgctc 1800
acatgttctt tcctgcgtta tcccctgatt ctgtggataa ccgtattacc gcctttgagt 1860
gagctgatac cgctcgccgc agccgaacga ccgagcgcag cgagtcagtg agcgaggaag 1920
cggaagagcg cccaatacgc aaaccgcctc tccccgcgcg ttggccgatt cattaatgca 1980
gctggcacga caggtttccc gactggaaag cgggcagtga gcgcaacgca attaatgtga 2040
gttagctcac tcattaggca ccccaggctt tacactttat gcttccggct cgtatgttgt 2100
gtggaattgt gagcggataa caatttcaca caggaaacag ctatgaccat gattacgcca 2160
agctcgaaat taaccctcac taaagggaac aaaagctgtg ctttctctga ccagcattct 2220
ctcccctggg cctgtgccgc tttctgtctg cagcttgtgg cctgggtcac ctctacggct 2280
ggcccagatc cttccctgcc gcctccttca ggttccgtct tcctccactc cctcttcccc 2340
ttgctctctg ctgtgttgct gcccaaggat gctctttccg gagcacttcc ttctcggcgc 2400
tgcaccacgt gatgtcctct gagcggatcc tccccgtgtc tgggtcctct ccgggcatct 2460
ctcctccctc acccaacccc atgccgtctt cactcgctgg gttccctttt ccttctcctt 2520
ctggggcctg tgccatctct cgtttcttag gatggccttc tccgacggat gtctcccttg 2580
cgtcccgcct ccccttcttg taggcctgca tcatcaccgt ttttctggac aaccccaaag 2640
taccccgtct ccctggcttt agccacctct ccatcctctt gctttctttg cctggacacc 2700
ccgttctcct gtggattcgg gtcacctctc actcctttca tttgggcagc tcccctaccc 2760
cccttacctc tctagtctgt gctagctctt ccagccccct gtcatggcat cttccagggg 2820
tccgagagct cagctagtct tcttcctcca acccgggccc ctatgtccac ttcaggacag 2880
catgtttgct gcctccaggg atcctgtgtc cccgagctgg gaccacctta tattcccagg 2940
gccggttaat gtggctctgg ttctgggtac ttttatctgt cccctccacc ccacagtggg 3000
gcaagcttct gacctcttct cttcctccca cagggcctcg agagatctgg cagcggagag 3060
ggcagaggaa gtcttctaac atgcggtgac gtggaggaga atcccggccc taggctcgag 3120
ggtaccatga ttgaacaaga tggattgcac gcaggttctc cggccgcttg ggtggagagg 3180
ctattcggct atgactgggc acaacagaca atcggctgct ctgatgccgc cgtgttccgg 3240
ctgtcagcgc aggggcgccc ggttcttttt gtcaagaccg acctgtccgg tgccctgaat 3300
gaactgcagg acgaggcagc gcggctatcg tggctggcca cgacgggcgt tccttgcgca 3360
gctgtgctcg acgttgtcac tgaagcggga agggactggc tgctattggg cgaagtgccg 3420
gggcaggatc tcctgtcatc tcaccttgct cctgccgaga aagtatccat catggctgat 3480
gcaatgcggc ggctgcatac gcttgatccg gctacctgcc cattcgacca ccaagcgaaa 3540
catcgcatcg agcgagcacg tactcggatg gaagccggtc ttgtcgatca ggatgatctg 3600
gacgaagagc atcaggggct cgcgccagcc gaactgttcg ccaggctcaa ggcgcgcatg 3660
cccgacggcg aggatctcgt cgtgacccat ggcgatgcct gcttgccgaa tatcatggtg 3720
gaaaatggcc gcttttctgg attcatcgac tgtggccggc tgggtgtggc ggaccgctat 3780
caggacatag cgttggctac ccgtgatatt gctgaagagc ttggcggcga atgggctgac 3840
cgcttcctcg tgctttacgg tatcgccgct cccgattcgc agcgcatcgc cttctatcgc 3900
cttcttgacg agttcttctg agtttaaacc cgctgatcag cctcgactgt gccttctagt 3960
tgccagccat ctgttgtttg cccctccccc gtgccttcct tgaccctgga aggtgccact 4020
cccactgtcc tttcctaata aaatgaggaa attgcatcgc attgtctgag taggtgtcat 4080
tctattctgg ggggtggggt ggggcaggac agcaaggggg aggattggga agacaatagc 4140
aggcatgctg gggatgcggt gggctctatg ggctagcggt ggcggcctcg acattgatta 4200
ttgactagta gatctggcgc gccgcctttt tacggttcct ggccttttgc tggccttttg 4260
ctcacatgtc acgtgaggcc ttaacgtctc gccctttggt ctccccctct taagtaccac 4320
atttgtagag gttttacttg ctttaaaaaa cctcccacac ctccccctga acctgaaaca 4380
taaaatgaat gcaattgttg ttgttaactt gtttattgca gcttataatg gttacaaata 4440
aagcaatagc atcacaaatt tcacaaataa agcatttttt tcactgcatt ctagttgtgg 4500
tttgtccaaa ctcatcggct agcttacccg gggagcatgt caaggtcaaa atcgtcaaga 4560
gcgtcagcag gcagcatatc aaggtcaaag tcgtcaaggg catcggctgg gagcatgtct 4620
aagtcaaaat cgtcaagggc gtcggtcggc ccgccgcttt cgcactttag ctgtttctcc 4680
aggccacata tgattagttc caggccgaaa aggaaggcag gttcggctcc ctgccggtcg 4740
aacagctcaa ttgcttgtct cagaagtggg ggcatagaat cggtggtagg tgtctctctt 4800
tcctcttttg ctacttgatg ctcctgttcc tccaatacgc agcccagtgt aaagtggccc 4860
acggcggaca gagcgtacag tgcgttctcc agggagaagc cttgctgaca caggaacgcg 4920
agctgatttt ccagggtttc gtactgtttc tctgttgggc gggtgccgag atgcacttta 4980
gccccgtcgc gatgtgagag gagagcacag cggtatgact tggcgttgtt ccgcagaaag 5040
tcttgccatg actcgccttc cagggggcag aagtgggtat gatgcctgtc cagcatctcg 5100
attggcaggg catcgagcag ggcccgcttg ttcttcacgt gccagtacag ggtaggctgc 5160
tcaactccca gcttttgagc gagtttcctt gtcgtcaggc cttcgatacc gacaccattg 5220
agtaattcca gagctccgtt tatgactttg ctcttgtcca gtctagacat tggaccaggg 5280
ttttcttcaa catcaccaca agtgaggaga gaacctctac cttcggcacc gggttccttt 5340
gccctcggac gagtgctggg gcgtcggttt ccactatcgg cgagtacttc tacacagcca 5400
tcggtccaga cggccgcgct tctgcgggcg atttgtgtac gcccgacagt cccggctccg 5460
gatcggacga ttgcgtcgca tcgaccctgc gcccaagctg catcatcgaa attgccgtca 5520
accaagctct gatagagttg gtcaagacca atgcggagca tatacgcccg gagccgcggc 5580
gatcctgcaa gctccggatg cctccgctcg aagtagcgcg tctgctgctc catacaagcc 5640
aaccacggcc tccagaagaa gatgttggcg acctcgtatt gggaatcccc gaacatcgcc 5700
tcgctccagt caatgaccgc tgttatgcgg ccattgtccg tcaggacatt gttggagccg 5760
aaatccgcgt gcacgaggtg ccggacttcg gggcagtcct cggcccaaag catcagctca 5820
tcgagagcct gcgcgacgga cgcactgacg gtgtcgtcca tcacagtttg ccagtgatac 5880
acatggggat cagcaatcgc gcatatgaaa tcacgccatg tagtgtattg accgattcct 5940
tgcggtccga atgggccgaa cccgctcgtc tggctaagat cggccgcagc gatcgcatcc 6000
atggcctccg cgaccggctg cagaacagcg ggcagttcgg tttcaggcag gtcttgcaac 6060
gtgacaccct gtgcacggcg ggagatgcaa taggtcaggc tctcgctaaa ttccccaatg 6120
tcaagcactt ccggaatcgg gagcgcggcc gatgcaaagt gccgataaac ataacgatct 6180
ttgtagaaac catcggcgca gctatttacc cgcaggacat atccacgccc tcctacatcg 6240
aagctgaaag cacgagattc ttcgccctcc gagagctgca tcaggtcgga gacgctgtcg 6300
aacttttcga tcagaaactt ctcgacagac gtcgcggtga gttcaggctt tttcatggtg 6360
gcggcactag taagggcgaa ttcggagcct gcttttttgt acaaacttgt tgatatctgc 6420
agaattccac cacactggac tagtggatcc gagctcggta ccaagcttct tcacgacacc 6480
tgaaatggaa gaaaaaaact ttgaaccact gtctgaggct tgagaatgaa ccaagatcca 6540
aactcaaaaa gggcaaattc caaggagaat tacatcaagt gccaagctgg cctaacttca 6600
gtctccaccc actcagtgtg gggaaactcc atcgcataaa acccctcccc ccaacctaaa 6660
gacgacgtac tccaaaagct cgagaactaa tcgaggtgcc tggacggcgc ccggtactcc 6720
gtggagtcac atgaagcgac ggctgaggac ggaaaggccc ttttcctttg tgtgggtgac 6780
tcacccgccc gctctcccga gcgccgcgtc ctccattttg agctccctgc agcagggccg 6840
ggaagcggcc atctttccgc tcacgcaact ggtgccgacc gggccagcct tgccgcccag 6900
ggcggggcga tacacggcgg cgcgaggcca ggcaccagag caggccggcc agcttgagac 6960
tacccccgtc cgattctcgg tggccgcgct cgcaggcccc gcctcgccga acatgtgcgc 7020
tgggacgcac gggccccgtc gccgcccgcg gccccaaaaa ccgaaatacc agtgtgcaga 7080
tcttggcccg catttacaag actatcttgc cagaaaaaaa gcgtcgcagc aggtcatcaa 7140
aaattttaaa tggctagaga cttatcgaaa gcagcgagac aggcgcgaag gtgccaccag 7200
attcgcacgc ggcggcccca gcgcccaggc caggcctcaa ctcaagcacg aggcgaaggg 7260
gctccttaag cgcaaggcct cgaactctcc cacccacttc caacccgaag ctcgggatca 7320
agaatcacgt actgcagcca ggtggaagta attcaaggca cgcaagggcc ataacccgta 7380
aagaggccag gcccgcggga accacacacg gcacttacct gtgttctggc ggcaaacccg 7440
ttgcgaaaaa gaacgttcac ggcgactact gcacttatat acggttctcc cccaccctcg 7500
ggaaaaaggc ggagccagta cacgacatca ctttcccagt ttaccccgcg ccaccttctc 7560
taggcaccgg ttcaattgcc gacccctccc cccaacttct cggggactgt gggcgatgtg 7620
cgctctgccc actgacgggc accggagcct cacgatcgat atgtcgagtt tactccctat 7680
cagtgataga gaacgtatgt cgagtttact ccctatcagt gatagagaac gatgtcgagt 7740
ttactcccta tcagtgatag agaacgtatg tcgagtttac tccctatcag tgatagagaa 7800
cgtatgtcga gtttactccc tatcagtgat agagaacgta tgtcgagttt atccctatca 7860
gtgatagaga acgtatgtcg agtttactcc ctatcagtga tagagaacgt atgtcgaggt 7920
aggcgtgtac ggtgggaggc ctatataagc agagctcgtt tagtgaaccg tcagatcgcc 7980
tggagaattg gctaggcacc ggtgacaagt ttgtacaaaa aagcaggctc cgaattcgcc 8040
cttactagtg ccgccaccat gaaaacattt aacatttctc aacaggatct agaattagta 8100
gaagtagcga cagagaagat tacaatgctt tatgaggata ataaacatca tgtgggagcg 8160
gcaattcgta cgaaaacagg agaaatcatt tcggcagtac atattgaagc gtatatagga 8220
cgagtaactg tttgtgcaga agccattgcg attggtagtg cagtttcgaa tggacaaaag 8280
gattttgaca cgattgtagc tgttagacac ccttattctg acgaagtaga tagaagtatt 8340
cgagtggtaa gtccttgtgg tatgtgtagg gagttgattt cagactatgc accagattgt 8400
tttgtgttaa tagaaatgaa tggcaagtta gtcaaaacta cgattgaaga actcattcca 8460
ctcaaatata cccgaaatac tagtggcagc ggcgccacaa acttctctct gctaaagcaa 8520
gcaggtgatg ttgaagaaaa ccccgggcct ggcgcgccaa tggtgagcaa gggcgaggag 8580
ctgttcaccg gggtggtgcc catcctggtc gagctggacg gcgacgtaaa cggccacaag 8640
ttcagcgtgt ccggcgaggg cgagggcgat gccacctacg gcaagctgac cctgaagttc 8700
atctgcacca ccggcaagct gcccgtgccc tggcccaccc tcgtgaccac cctgacctac 8760
ggcgtgcagt gcttcagccg ctaccccgac cacatgaagc agcacgactt cttcaagtcc 8820
gccatgcccg aaggctacgt ccaggagcgc accatcttct tcaaggacga cggcaactac 8880
aagacccgcg ccgaggtgaa gttcgagggc gacaccctgg tgaaccgcat cgagctgaag 8940
ggcatcgact tcaaggagga cggcaacatc ctggggcaca agctggagta caactacaac 9000
agccacaacg tctatatcat ggccgacaag cagaagaacg gcatcaaggt gaacttcaag 9060
atccgccaca acatcgagga cggcagcgtg cagctcgccg accactacca gcagaacacc 9120
cccatcggcg acggccccgt gctgctgccc gacaaccact acctgagcac ccagtccgcc 9180
ctgagcaaag accccaacga gaagcgcgat cacatggtcc tgctggagtt cgtgaccgcc 9240
gccgggatca ctctcggcat ggacgagctg tacaagtaat taattaagag ggcgaattcg 9300
acccagcttt cttgtacaaa gtggttgata tccagcacag tggcggccgc tcgagtctag 9360
agggcccgcg gttcgaaggt aagcctatcc ctaaccctct cctcggtctc gattctacgc 9420
gtaccggtta ggggcccgtt taaacccgct gatcagcctc gactgtgcct tctagttgcc 9480
agccatctgt tgtttgcccc tcccccgtgc cttccttgac cctggaaggt gccactccca 9540
ctgtcctttc ctaataaaat gaggaaattg catcgcattg tctgagtagg tgtcattcta 9600
ttctgggggg tggggtgggg caggacagca agggggagga ttgggaagac aatagcaggc 9660
atgctgggga tgcggtgggc tctatggctc tagaagtcga cagtactaag ctttgacaga 9720
aaagccccat ccttaggcct cctccttcct agtctcctga tattgggtct aacccccacc 9780
tcctgttagg cagattcctt atctggtgac acacccccat ttcctggagc catctctctc 9840
cttgccagaa cctctaaggt ttgcttacga tggagccaga gaggatcctg ggagggagag 9900
cttggcaggg ggtgggaggg aaggggggga tgcgtgacct gcccggttct cagtggccac 9960
cctgcgctac cctctcccag aacctgagct gctctgacgc ggctgtctgg tgcgtttcac 10020
tgatcctggt gctgcagctt ccttacactt cccaagagga gaagcagttt ggaaaaacaa 10080
aatcagaata agttggtcct gagttctaac tttggctctt cacctttcta gtccccaatt 10140
tatattgttc ctccgtgcgt cagttttacc tgtgagataa ggccagtagc cagccccgtc 10200
ctggcagggc tgtggtgagg aggggggtgt ccgtgtggaa aactcccttt gtgagaatgg 10260
tgcgtcctag gtgttcacca ggtcgtggcc gcctctactc cctttctctt tctccatcct 10320
tctttcctta aagagtcccc agtgctatct gggacatatt cctccgccca gagcagggtc 10380
ccgcttccct aaggccctgc tctgggcttc tgggtttgag tccttggcaa gcccaggaga 10440
ggcgctcagg cttccctgtc ccccttcctc gtccaccatc tcatgcccct ggctctcctg 10500
ccccttccct acaggggttc ctggctctgc tctagcgatc gccaattcgc cctatagtga 10560
gtcgtattac aattcactgg ccgtcgtttt acaacgtcgt gactgggaaa accctggcgt 10620
tacccaactt aatcgccttg cagcacatcc ccctttcgcc agctggcgta atagcgaaga 10680
ggcccgcacc gatcgccctt cccaacagtt gcgcagcctg aatggcgaat gggacgcgcc 10740
ctgtagcggc gcattaagcg cggcgggtgt ggtggttacg cgcagcgtga ccgctacact 10800
tgccagcgcc ctagcgcccg ctcctttcgc tttcttccct tcctttctcg ccacgttcgc 10860
cggctttccc cgtcaagctc taaatcgggg gctcccttta gggttccgat ttagtgcttt 10920
acggcacctc gaccccaaaa aacttgatta gggtgatggt tcacgtagtg ggccatcgcc 10980
ctgatagacg gtttttcgcc ctttgacgtt ggagtccacg ttctttaata gtggactctt 11040
gttccaaact ggaacaacac tcaaccctat ctcggtctat tcttttgatt tataagggat 11100
tttgccgatt tcggcctatt ggttaaaaaa tgagctgatt taacaaaaat ttaacgcgaa 11160
ttttaacaaa atattaacgc ttacaattta ggtg 11194
<210> 201
<211> 11173
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 201
gcacttttcg gggaaatgtg cgcggaaccc ctatttgttt atttttctaa atacattcaa 60
atatgtatcc gctcatgaga caataaccct gataaatgct tcaataatat tgaaaaagga 120
agagtatgag tattcaacat ttccgtgtcg cccttattcc cttttttgcg gcattttgcc 180
ttcctgtttt tgctcaccca gaaacgctgg tgaaagtaaa agatgctgaa gatcagttgg 240
gtgcacgagt gggttacatc gaactggatc tcaacagcgg taagatcctt gagagttttc 300
gccccgaaga acgttttcca atgatgagca cttttaaagt tctgctatgt ggcgcggtat 360
tatcccgtat tgacgccggg caagagcaac tcggtcgccg catacactat tctcagaatg 420
acttggttga gtactcacca gtcacagaaa agcatcttac ggatggcatg acagtaagag 480
aattatgcag tgctgccata accatgagtg ataacactgc ggccaactta cttctgacaa 540
cgatcggagg accgaaggag ctaaccgctt ttttgcacaa catgggggat catgtaactc 600
gccttgatcg ttgggaaccg gagctgaatg aagccatacc aaacgacgag cgtgacacca 660
cgatgcctgt agcaatggca acaacgttgc gcaaactatt aactggcgaa ctacttactc 720
tagcttcccg gcaacaatta atagactgga tggaggcgga taaagttgca ggaccacttc 780
tgcgctcggc ccttccggct ggctggttta ttgctgataa atctggagcc ggtgagcgtg 840
ggtctcgcgg tatcattgca gcactggggc cagatggtaa gccctcccgt atcgtagtta 900
tctacacgac ggggagtcag gcaactatgg atgaacgaaa tagacagatc gctgagatag 960
gtgcctcact gattaagcat tggtaactgt cagaccaagt ttactcatat atactttaga 1020
ttgatttaaa acttcatttt taatttaaaa ggatctaggt gaagatcctt tttgataatc 1080
tcatgaccaa aatcccttaa cgtgagtttt cgttccactg agcgtcagac cccgtagaaa 1140
agatcaaagg atcttcttga gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa 1200
aaaaaccacc gctaccagcg gtggtttgtt tgccggatca agagctacca actctttttc 1260
cgaaggtaac tggcttcagc agagcgcaga taccaaatac tgtccttcta gtgtagccgt 1320
agttaggcca ccacttcaag aactctgtag caccgcctac atacctcgct ctgctaatcc 1380
tgttaccagt ggctgctgcc agtggcgata agtcgtgtct taccgggttg gactcaagac 1440
gatagttacc ggataaggcg cagcggtcgg gctgaacggg gggttcgtgc acacagccca 1500
gcttggagcg aacgacctac accgaactga gatacctaca gcgtgagcta tgagaaagcg 1560
ccacgcttcc cgaagggaga aaggcggaca ggtatccggt aagcggcagg gtcggaacag 1620
gagagcgcac gagggagctt ccagggggaa acgcctggta tctttatagt cctgtcgggt 1680
ttcgccacct ctgacttgag cgtcgatttt tgtgatgctc gtcagggggg cggagcctat 1740
ggaaaaacgc cagcaacgcg gcctttttac ggttcctggc cttttgctgg ccttttgctc 1800
acatgttctt tcctgcgtta tcccctgatt ctgtggataa ccgtattacc gcctttgagt 1860
gagctgatac cgctcgccgc agccgaacga ccgagcgcag cgagtcagtg agcgaggaag 1920
cggaagagcg cccaatacgc aaaccgcctc tccccgcgcg ttggccgatt cattaatgca 1980
gctggcacga caggtttccc gactggaaag cgggcagtga gcgcaacgca attaatgtga 2040
gttagctcac tcattaggca ccccaggctt tacactttat gcttccggct cgtatgttgt 2100
gtggaattgt gagcggataa caatttcaca caggaaacag ctatgaccat gattacgcca 2160
agctcgaaat taaccctcac taaagggaac aaaagctgtg ctttctctga ccagcattct 2220
ctcccctggg cctgtgccgc tttctgtctg cagcttgtgg cctgggtcac ctctacggct 2280
ggcccagatc cttccctgcc gcctccttca ggttccgtct tcctccactc cctcttcccc 2340
ttgctctctg ctgtgttgct gcccaaggat gctctttccg gagcacttcc ttctcggcgc 2400
tgcaccacgt gatgtcctct gagcggatcc tccccgtgtc tgggtcctct ccgggcatct 2460
ctcctccctc acccaacccc atgccgtctt cactcgctgg gttccctttt ccttctcctt 2520
ctggggcctg tgccatctct cgtttcttag gatggccttc tccgacggat gtctcccttg 2580
cgtcccgcct ccccttcttg taggcctgca tcatcaccgt ttttctggac aaccccaaag 2640
taccccgtct ccctggcttt agccacctct ccatcctctt gctttctttg cctggacacc 2700
ccgttctcct gtggattcgg gtcacctctc actcctttca tttgggcagc tcccctaccc 2760
cccttacctc tctagtctgt gctagctctt ccagccccct gtcatggcat cttccagggg 2820
tccgagagct cagctagtct tcttcctcca acccgggccc ctatgtccac ttcaggacag 2880
catgtttgct gcctccaggg atcctgtgtc cccgagctgg gaccacctta tattcccagg 2940
gccggttaat gtggctctgg ttctgggtac ttttatctgt cccctccacc ccacagtggg 3000
gcaagcttct gacctcttct cttcctccca cagggcctcg agagatctgg cagcggagag 3060
ggcagaggaa gtcttctaac atgcggtgac gtggaggaga atcccggccc taggctcgag 3120
ggtaccatga ttgaacaaga tggattgcac gcaggttctc cggccgcttg ggtggagagg 3180
ctattcggct atgactgggc acaacagaca atcggctgct ctgatgccgc cgtgttccgg 3240
ctgtcagcgc aggggcgccc ggttcttttt gtcaagaccg acctgtccgg tgccctgaat 3300
gaactgcagg acgaggcagc gcggctatcg tggctggcca cgacgggcgt tccttgcgca 3360
gctgtgctcg acgttgtcac tgaagcggga agggactggc tgctattggg cgaagtgccg 3420
gggcaggatc tcctgtcatc tcaccttgct cctgccgaga aagtatccat catggctgat 3480
gcaatgcggc ggctgcatac gcttgatccg gctacctgcc cattcgacca ccaagcgaaa 3540
catcgcatcg agcgagcacg tactcggatg gaagccggtc ttgtcgatca ggatgatctg 3600
gacgaagagc atcaggggct cgcgccagcc gaactgttcg ccaggctcaa ggcgcgcatg 3660
cccgacggcg aggatctcgt cgtgacccat ggcgatgcct gcttgccgaa tatcatggtg 3720
gaaaatggcc gcttttctgg attcatcgac tgtggccggc tgggtgtggc ggaccgctat 3780
caggacatag cgttggctac ccgtgatatt gctgaagagc ttggcggcga atgggctgac 3840
cgcttcctcg tgctttacgg tatcgccgct cccgattcgc agcgcatcgc cttctatcgc 3900
cttcttgacg agttcttctg agtttaaacc cgctgatcag cctcgactgt gccttctagt 3960
tgccagccat ctgttgtttg cccctccccc gtgccttcct tgaccctgga aggtgccact 4020
cccactgtcc tttcctaata aaatgaggaa attgcatcgc attgtctgag taggtgtcat 4080
tctattctgg ggggtggggt ggggcaggac agcaaggggg aggattggga agacaatagc 4140
aggcatgctg gggatgcggt gggctctatg ggctagcggt ggcggcctcg acattgatta 4200
ttgactagta gatctggcgc gccgcctttt tacggttcct ggccttttgc tggccttttg 4260
ctcacatgtc acgtgaggcc ttaacgtctc gccctttggt ctccccctct taagtaccac 4320
atttgtagag gttttacttg ctttaaaaaa cctcccacac ctccccctga acctgaaaca 4380
taaaatgaat gcaattgttg ttgttaactt gtttattgca gcttataatg gttacaaata 4440
aagcaatagc atcacaaatt tcacaaataa agcatttttt tcactgcatt ctagttgtgg 4500
tttgtccaaa ctcatcggct agcttacccg gggagcatgt caaggtcaaa atcgtcaaga 4560
gcgtcagcag gcagcatatc aaggtcaaag tcgtcaaggg catcggctgg gagcatgtct 4620
aagtcaaaat cgtcaagggc gtcggtcggc ccgccgcttt cgcactttag ctgtttctcc 4680
aggccacata tgattagttc caggccgaaa aggaaggcag gttcggctcc ctgccggtcg 4740
aacagctcaa ttgcttgtct cagaagtggg ggcatagaat cggtggtagg tgtctctctt 4800
tcctcttttg ctacttgatg ctcctgttcc tccaatacgc agcccagtgt aaagtggccc 4860
acggcggaca gagcgtacag tgcgttctcc agggagaagc cttgctgaca caggaacgcg 4920
agctgatttt ccagggtttc gtactgtttc tctgttgggc gggtgccgag atgcacttta 4980
gccccgtcgc gatgtgagag gagagcacag cggtatgact tggcgttgtt ccgcagaaag 5040
tcttgccatg actcgccttc cagggggcag aagtgggtat gatgcctgtc cagcatctcg 5100
attggcaggg catcgagcag ggcccgcttg ttcttcacgt gccagtacag ggtaggctgc 5160
tcaactccca gcttttgagc gagtttcctt gtcgtcaggc cttcgatacc gacaccattg 5220
agtaattcca gagctccgtt tatgactttg ctcttgtcca gtctagacat tggaccaggg 5280
ttttcttcaa catcaccaca agtgaggaga gaacctctac cttcggcacc gggttccttt 5340
gccctcggac gagtgctggg gcgtcggttt ccactatcgg cgagtacttc tacacagcca 5400
tcggtccaga cggccgcgct tctgcgggcg atttgtgtac gcccgacagt cccggctccg 5460
gatcggacga ttgcgtcgca tcgaccctgc gcccaagctg catcatcgaa attgccgtca 5520
accaagctct gatagagttg gtcaagacca atgcggagca tatacgcccg gagccgcggc 5580
gatcctgcaa gctccggatg cctccgctcg aagtagcgcg tctgctgctc catacaagcc 5640
aaccacggcc tccagaagaa gatgttggcg acctcgtatt gggaatcccc gaacatcgcc 5700
tcgctccagt caatgaccgc tgttatgcgg ccattgtccg tcaggacatt gttggagccg 5760
aaatccgcgt gcacgaggtg ccggacttcg gggcagtcct cggcccaaag catcagctca 5820
tcgagagcct gcgcgacgga cgcactgacg gtgtcgtcca tcacagtttg ccagtgatac 5880
acatggggat cagcaatcgc gcatatgaaa tcacgccatg tagtgtattg accgattcct 5940
tgcggtccga atgggccgaa cccgctcgtc tggctaagat cggccgcagc gatcgcatcc 6000
atggcctccg cgaccggctg cagaacagcg ggcagttcgg tttcaggcag gtcttgcaac 6060
gtgacaccct gtgcacggcg ggagatgcaa taggtcaggc tctcgctaaa ttccccaatg 6120
tcaagcactt ccggaatcgg gagcgcggcc gatgcaaagt gccgataaac ataacgatct 6180
ttgtagaaac catcggcgca gctatttacc cgcaggacat atccacgccc tcctacatcg 6240
aagctgaaag cacgagattc ttcgccctcc gagagctgca tcaggtcgga gacgctgtcg 6300
aacttttcga tcagaaactt ctcgacagac gtcgcggtga gttcaggctt tttcatggtg 6360
gcggcactag taagggcgaa ttcggagcct gcttttttgt acaaacttgt tgatatctgc 6420
agaattccac cacactggac tagtggatcc gagctcggta ccaagcttct tcacgacacc 6480
tgaaatggaa gaaaaaaact ttgaaccact gtctgaggct tgagaatgaa ccaagatcca 6540
aactcaaaaa gggcaaattc caaggagaat tacatcaagt gccaagctgg cctaacttca 6600
gtctccaccc actcagtgtg gggaaactcc atcgcataaa acccctcccc ccaacctaaa 6660
gacgacgtac tccaaaagct cgagaactaa tcgaggtgcc tggacggcgc ccggtactcc 6720
gtggagtcac atgaagcgac ggctgaggac ggaaaggccc ttttcctttg tgtgggtgac 6780
tcacccgccc gctctcccga gcgccgcgtc ctccattttg agctccctgc agcagggccg 6840
ggaagcggcc atctttccgc tcacgcaact ggtgccgacc gggccagcct tgccgcccag 6900
ggcggggcga tacacggcgg cgcgaggcca ggcaccagag caggccggcc agcttgagac 6960
tacccccgtc cgattctcgg tggccgcgct cgcaggcccc gcctcgccga acatgtgcgc 7020
tgggacgcac gggccccgtc gccgcccgcg gccccaaaaa ccgaaatacc agtgtgcaga 7080
tcttggcccg catttacaag actatcttgc cagaaaaaaa gcgtcgcagc aggtcatcaa 7140
aaattttaaa tggctagaga cttatcgaaa gcagcgagac aggcgcgaag gtgccaccag 7200
attcgcacgc ggcggcccca gcgcccaggc caggcctcaa ctcaagcacg aggcgaaggg 7260
gctccttaag cgcaaggcct cgaactctcc cacccacttc caacccgaag ctcgggatca 7320
agaatcacgt actgcagcca ggtggaagta attcaaggca cgcaagggcc ataacccgta 7380
aagaggccag gcccgcggga accacacacg gcacttacct gtgttctggc ggcaaacccg 7440
ttgcgaaaaa gaacgttcac ggcgactact gcacttatat acggttctcc cccaccctcg 7500
ggaaaaaggc ggagccagta cacgacatca ctttcccagt ttaccccgcg ccaccttctc 7560
taggcaccgg ttcaattgcc gacccctccc cccaacttct cggggactgt gggcgatgtg 7620
cgctctgccc actgacgggc accggagcct cacgatcgat atgtcgagtt tactccctat 7680
cagtgataga gaacgtatgt cgagtttact ccctatcagt gatagagaac gatgtcgagt 7740
ttactcccta tcagtgatag agaacgtatg tcgagtttac tccctatcag tgatagagaa 7800
cgtatgtcga gtttactccc tatcagtgat agagaacgta tgtcgagttt atccctatca 7860
gtgatagaga acgtatgtcg agtttactcc ctatcagtga tagagaacgt atgtcgaggt 7920
aggcgtgtac ggtgggaggc ctatataagc agagctcgtt tagtgaaccg tcagatcgcc 7980
tggagaattg gctaggcacc ggtgacaagt ttgtacaaaa aagcaggctc cgaattcgcc 8040
cttactagtg ccgccaccat gaaaacattt aacatttctc aacaggatct agaattagta 8100
gaagtagcga cagagaagat tacaatgctt tatgaggata ataaacatca tgtgggagcg 8160
gcaattcgta cgaaaacagg agaaatcatt tcggcagtac atattgaagc gtatatagga 8220
cgagtaactg tttgtgcaga agccattgcg attggtagtg cagtttcgaa tggacaaaag 8280
gattttgaca cgattgtagc tgttagacac ccttattctg acgaagtaga tagaagtatt 8340
cgagtggtaa gtccttgtgg tatgtgtagg gagttgattt cagactatgc accagattgt 8400
tttgtgttaa tagaaatgaa tggcaagtta gtcaaaacta cgattgaaga actcattcca 8460
ctcaaatata cccgaaatac tagtggcagc ggcgccacaa acttctctct gctaaagcaa 8520
gcaggtgatg ttgaagaaaa ccccgggcct ggcgcgccaa tggtgagcaa gggcgaggca 8580
gtgatcaagg agttcatgcg gttcaaggtg cacatggagg gctccatgaa cggccacgag 8640
ttcgagatcg agggcgaggg cgagggccgc ccctacgagg gcacccagac cgccaagctg 8700
aaggtgacca agggtggccc cctgcccttc tcctgggaca tcctgtcccc tcagttcatg 8760
tacggctcca gggccttcac caagcacccc gccgacatcc ccgactacta taagcagtcc 8820
ttccccgagg gcttcaagtg ggagcgcgtg atgaacttcg aggacggcgg cgccgtgacc 8880
gtgacccagg acacctccct ggaggacggc accctgatct acaaggtgaa gctccgtggc 8940
accaacttcc ctcctgacgg ccccgtaatg cagaagaaga caatgggctg ggaagcgtcc 9000
accgagcggt tgtaccccga ggacggcgtg ctgaagggcg acattaagat ggccctgcgc 9060
ctgaaggacg gcggccgcta cctggcggac ttcaagacca cctacaaggc caagaagccc 9120
gtgcagatgc ccggcgccta caacgtcgac cgcaagttgg acatcacctc ccacaacgag 9180
gactacaccg tggtggaaca gtacgaacgc tccgagggcc gccactccac cggcggcatg 9240
gacgagctgt acaagtgatt aattaaaagg gcgaattcga cccagctttc ttgtacaaag 9300
tggttgatat ccagcacagt ggcggccgct cgagtctaga gggcccgcgg ttcgaaggta 9360
agcctatccc taaccctctc ctcggtctcg attctacgcg taccggttag gggcccgttt 9420
aaacccgctg atcagcctcg actgtgcctt ctagttgcca gccatctgtt gtttgcccct 9480
cccccgtgcc ttccttgacc ctggaaggtg ccactcccac tgtcctttcc taataaaatg 9540
aggaaattgc atcgcattgt ctgagtaggt gtcattctat tctggggggt ggggtggggc 9600
aggacagcaa gggggaggat tgggaagaca atagcaggca tgctggggat gcggtgggct 9660
ctatggctct agaagtcgac agtactaagc tttgacagaa aagccccatc cttaggcctc 9720
ctccttccta gtctcctgat attgggtcta acccccacct cctgttaggc agattcctta 9780
tctggtgaca cacccccatt tcctggagcc atctctctcc ttgccagaac ctctaaggtt 9840
tgcttacgat ggagccagag aggatcctgg gagggagagc ttggcagggg gtgggaggga 9900
agggggggat gcgtgacctg cccggttctc agtggccacc ctgcgctacc ctctcccaga 9960
acctgagctg ctctgacgcg gctgtctggt gcgtttcact gatcctggtg ctgcagcttc 10020
cttacacttc ccaagaggag aagcagtttg gaaaaacaaa atcagaataa gttggtcctg 10080
agttctaact ttggctcttc acctttctag tccccaattt atattgttcc tccgtgcgtc 10140
agttttacct gtgagataag gccagtagcc agccccgtcc tggcagggct gtggtgagga 10200
ggggggtgtc cgtgtggaaa actccctttg tgagaatggt gcgtcctagg tgttcaccag 10260
gtcgtggccg cctctactcc ctttctcttt ctccatcctt ctttccttaa agagtcccca 10320
gtgctatctg ggacatattc ctccgcccag agcagggtcc cgcttcccta aggccctgct 10380
ctgggcttct gggtttgagt ccttggcaag cccaggagag gcgctcaggc ttccctgtcc 10440
cccttcctcg tccaccatct catgcccctg gctctcctgc cccttcccta caggggttcc 10500
tggctctgct ctagcgatcg ccaattcgcc ctatagtgag tcgtattaca attcactggc 10560
cgtcgtttta caacgtcgtg actgggaaaa ccctggcgtt acccaactta atcgccttgc 10620
agcacatccc cctttcgcca gctggcgtaa tagcgaagag gcccgcaccg atcgcccttc 10680
ccaacagttg cgcagcctga atggcgaatg ggacgcgccc tgtagcggcg cattaagcgc 10740
ggcgggtgtg gtggttacgc gcagcgtgac cgctacactt gccagcgccc tagcgcccgc 10800
tcctttcgct ttcttccctt cctttctcgc cacgttcgcc ggctttcccc gtcaagctct 10860
aaatcggggg ctccctttag ggttccgatt tagtgcttta cggcacctcg accccaaaaa 10920
acttgattag ggtgatggtt cacgtagtgg gccatcgccc tgatagacgg tttttcgccc 10980
tttgacgttg gagtccacgt tctttaatag tggactcttg ttccaaactg gaacaacact 11040
caaccctatc tcggtctatt cttttgattt ataagggatt ttgccgattt cggcctattg 11100
gttaaaaaat gagctgattt aacaaaaatt taacgcgaat tttaacaaaa tattaacgct 11160
tacaatttag gtg 11173
<210> 202
<211> 11386
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 202
gcacttttcg gggaaatgtg cgcggaaccc ctatttgttt atttttctaa atacattcaa 60
atatgtatcc gctcatgaga caataaccct gataaatgct tcaataatat tgaaaaagga 120
agagtatgag tattcaacat ttccgtgtcg cccttattcc cttttttgcg gcattttgcc 180
ttcctgtttt tgctcaccca gaaacgctgg tgaaagtaaa agatgctgaa gatcagttgg 240
gtgcacgagt gggttacatc gaactggatc tcaacagcgg taagatcctt gagagttttc 300
gccccgaaga acgttttcca atgatgagca cttttaaagt tctgctatgt ggcgcggtat 360
tatcccgtat tgacgccggg caagagcaac tcggtcgccg catacactat tctcagaatg 420
acttggttga gtactcacca gtcacagaaa agcatcttac ggatggcatg acagtaagag 480
aattatgcag tgctgccata accatgagtg ataacactgc ggccaactta cttctgacaa 540
cgatcggagg accgaaggag ctaaccgctt ttttgcacaa catgggggat catgtaactc 600
gccttgatcg ttgggaaccg gagctgaatg aagccatacc aaacgacgag cgtgacacca 660
cgatgcctgt agcaatggca acaacgttgc gcaaactatt aactggcgaa ctacttactc 720
tagcttcccg gcaacaatta atagactgga tggaggcgga taaagttgca ggaccacttc 780
tgcgctcggc ccttccggct ggctggttta ttgctgataa atctggagcc ggtgagcgtg 840
ggtctcgcgg tatcattgca gcactggggc cagatggtaa gccctcccgt atcgtagtta 900
tctacacgac ggggagtcag gcaactatgg atgaacgaaa tagacagatc gctgagatag 960
gtgcctcact gattaagcat tggtaactgt cagaccaagt ttactcatat atactttaga 1020
ttgatttaaa acttcatttt taatttaaaa ggatctaggt gaagatcctt tttgataatc 1080
tcatgaccaa aatcccttaa cgtgagtttt cgttccactg agcgtcagac cccgtagaaa 1140
agatcaaagg atcttcttga gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa 1200
aaaaaccacc gctaccagcg gtggtttgtt tgccggatca agagctacca actctttttc 1260
cgaaggtaac tggcttcagc agagcgcaga taccaaatac tgtccttcta gtgtagccgt 1320
agttaggcca ccacttcaag aactctgtag caccgcctac atacctcgct ctgctaatcc 1380
tgttaccagt ggctgctgcc agtggcgata agtcgtgtct taccgggttg gactcaagac 1440
gatagttacc ggataaggcg cagcggtcgg gctgaacggg gggttcgtgc acacagccca 1500
gcttggagcg aacgacctac accgaactga gatacctaca gcgtgagcta tgagaaagcg 1560
ccacgcttcc cgaagggaga aaggcggaca ggtatccggt aagcggcagg gtcggaacag 1620
gagagcgcac gagggagctt ccagggggaa acgcctggta tctttatagt cctgtcgggt 1680
ttcgccacct ctgacttgag cgtcgatttt tgtgatgctc gtcagggggg cggagcctat 1740
ggaaaaacgc cagcaacgcg gcctttttac ggttcctggc cttttgctgg ccttttgctc 1800
acatgttctt tcctgcgtta tcccctgatt ctgtggataa ccgtattacc gcctttgagt 1860
gagctgatac cgctcgccgc agccgaacga ccgagcgcag cgagtcagtg agcgaggaag 1920
cggaagagcg cccaatacgc aaaccgcctc tccccgcgcg ttggccgatt cattaatgca 1980
gctggcacga caggtttccc gactggaaag cgggcagtga gcgcaacgca attaatgtga 2040
gttagctcac tcattaggca ccccaggctt tacactttat gcttccggct cgtatgttgt 2100
gtggaattgt gagcggataa caatttcaca caggaaacag ctatgaccat gattacgcca 2160
agctcgaaat taaccctcac taaagggaac aaaagctgtg ctttctctga ccagcattct 2220
ctcccctggg cctgtgccgc tttctgtctg cagcttgtgg cctgggtcac ctctacggct 2280
ggcccagatc cttccctgcc gcctccttca ggttccgtct tcctccactc cctcttcccc 2340
ttgctctctg ctgtgttgct gcccaaggat gctctttccg gagcacttcc ttctcggcgc 2400
tgcaccacgt gatgtcctct gagcggatcc tccccgtgtc tgggtcctct ccgggcatct 2460
ctcctccctc acccaacccc atgccgtctt cactcgctgg gttccctttt ccttctcctt 2520
ctggggcctg tgccatctct cgtttcttag gatggccttc tccgacggat gtctcccttg 2580
cgtcccgcct ccccttcttg taggcctgca tcatcaccgt ttttctggac aaccccaaag 2640
taccccgtct ccctggcttt agccacctct ccatcctctt gctttctttg cctggacacc 2700
ccgttctcct gtggattcgg gtcacctctc actcctttca tttgggcagc tcccctaccc 2760
cccttacctc tctagtctgt gctagctctt ccagccccct gtcatggcat cttccagggg 2820
tccgagagct cagctagtct tcttcctcca acccgggccc ctatgtccac ttcaggacag 2880
catgtttgct gcctccaggg atcctgtgtc cccgagctgg gaccacctta tattcccagg 2940
gccggttaat gtggctctgg ttctgggtac ttttatctgt cccctccacc ccacagtggg 3000
gcaagcttct gacctcttct cttcctccca cagggcctcg agagatctgg cagcggagag 3060
ggcagaggaa gtcttctaac atgcggtgac gtggaggaga atcccggccc taggctcgag 3120
ggtaccatga ttgaacaaga tggattgcac gcaggttctc cggccgcttg ggtggagagg 3180
ctattcggct atgactgggc acaacagaca atcggctgct ctgatgccgc cgtgttccgg 3240
ctgtcagcgc aggggcgccc ggttcttttt gtcaagaccg acctgtccgg tgccctgaat 3300
gaactgcagg acgaggcagc gcggctatcg tggctggcca cgacgggcgt tccttgcgca 3360
gctgtgctcg acgttgtcac tgaagcggga agggactggc tgctattggg cgaagtgccg 3420
gggcaggatc tcctgtcatc tcaccttgct cctgccgaga aagtatccat catggctgat 3480
gcaatgcggc ggctgcatac gcttgatccg gctacctgcc cattcgacca ccaagcgaaa 3540
catcgcatcg agcgagcacg tactcggatg gaagccggtc ttgtcgatca ggatgatctg 3600
gacgaagagc atcaggggct cgcgccagcc gaactgttcg ccaggctcaa ggcgcgcatg 3660
cccgacggcg aggatctcgt cgtgacccat ggcgatgcct gcttgccgaa tatcatggtg 3720
gaaaatggcc gcttttctgg attcatcgac tgtggccggc tgggtgtggc ggaccgctat 3780
caggacatag cgttggctac ccgtgatatt gctgaagagc ttggcggcga atgggctgac 3840
cgcttcctcg tgctttacgg tatcgccgct cccgattcgc agcgcatcgc cttctatcgc 3900
cttcttgacg agttcttctg agtttaaacc cgctgatcag cctcgactgt gccttctagt 3960
tgccagccat ctgttgtttg cccctccccc gtgccttcct tgaccctgga aggtgccact 4020
cccactgtcc tttcctaata aaatgaggaa attgcatcgc attgtctgag taggtgtcat 4080
tctattctgg ggggtggggt ggggcaggac agcaaggggg aggattggga agacaatagc 4140
aggcatgctg gggatgcggt gggctctatg ggctagcggt ggcggcctcg acattgatta 4200
ttgactagta gatctggcgc gccgcctttt tacggttcct ggccttttgc tggccttttg 4260
ctcacatgtc acgtgaggcc ttaacgtctc gccctttggt ctccccctct taagtaccac 4320
atttgtagag gttttacttg ctttaaaaaa cctcccacac ctccccctga acctgaaaca 4380
taaaatgaat gcaattgttg ttgttaactt gtttattgca gcttataatg gttacaaata 4440
aagcaatagc atcacaaatt tcacaaataa agcatttttt tcactgcatt ctagttgtgg 4500
tttgtccaaa ctcatcggct agcttacccg gggagcatgt caaggtcaaa atcgtcaaga 4560
gcgtcagcag gcagcatatc aaggtcaaag tcgtcaaggg catcggctgg gagcatgtct 4620
aagtcaaaat cgtcaagggc gtcggtcggc ccgccgcttt cgcactttag ctgtttctcc 4680
aggccacata tgattagttc caggccgaaa aggaaggcag gttcggctcc ctgccggtcg 4740
aacagctcaa ttgcttgtct cagaagtggg ggcatagaat cggtggtagg tgtctctctt 4800
tcctcttttg ctacttgatg ctcctgttcc tccaatacgc agcccagtgt aaagtggccc 4860
acggcggaca gagcgtacag tgcgttctcc agggagaagc cttgctgaca caggaacgcg 4920
agctgatttt ccagggtttc gtactgtttc tctgttgggc gggtgccgag atgcacttta 4980
gccccgtcgc gatgtgagag gagagcacag cggtatgact tggcgttgtt ccgcagaaag 5040
tcttgccatg actcgccttc cagggggcag aagtgggtat gatgcctgtc cagcatctcg 5100
attggcaggg catcgagcag ggcccgcttg ttcttcacgt gccagtacag ggtaggctgc 5160
tcaactccca gcttttgagc gagtttcctt gtcgtcaggc cttcgatacc gacaccattg 5220
agtaattcca gagctccgtt tatgactttg ctcttgtcca gtctagacat tggaccaggg 5280
ttttcttcaa catcaccaca agtgaggaga gaacctctac cttcggcacc gggttccttt 5340
gccctcggac gagtgctggg gcgtcggttt ccactatcgg cgagtacttc tacacagcca 5400
tcggtccaga cggccgcgct tctgcgggcg atttgtgtac gcccgacagt cccggctccg 5460
gatcggacga ttgcgtcgca tcgaccctgc gcccaagctg catcatcgaa attgccgtca 5520
accaagctct gatagagttg gtcaagacca atgcggagca tatacgcccg gagccgcggc 5580
gatcctgcaa gctccggatg cctccgctcg aagtagcgcg tctgctgctc catacaagcc 5640
aaccacggcc tccagaagaa gatgttggcg acctcgtatt gggaatcccc gaacatcgcc 5700
tcgctccagt caatgaccgc tgttatgcgg ccattgtccg tcaggacatt gttggagccg 5760
aaatccgcgt gcacgaggtg ccggacttcg gggcagtcct cggcccaaag catcagctca 5820
tcgagagcct gcgcgacgga cgcactgacg gtgtcgtcca tcacagtttg ccagtgatac 5880
acatggggat cagcaatcgc gcatatgaaa tcacgccatg tagtgtattg accgattcct 5940
tgcggtccga atgggccgaa cccgctcgtc tggctaagat cggccgcagc gatcgcatcc 6000
atggcctccg cgaccggctg cagaacagcg ggcagttcgg tttcaggcag gtcttgcaac 6060
gtgacaccct gtgcacggcg ggagatgcaa taggtcaggc tctcgctaaa ttccccaatg 6120
tcaagcactt ccggaatcgg gagcgcggcc gatgcaaagt gccgataaac ataacgatct 6180
ttgtagaaac catcggcgca gctatttacc cgcaggacat atccacgccc tcctacatcg 6240
aagctgaaag cacgagattc ttcgccctcc gagagctgca tcaggtcgga gacgctgtcg 6300
aacttttcga tcagaaactt ctcgacagac gtcgcggtga gttcaggctt tttcatggtg 6360
gcggcactag taagggcgaa ttcggagcct gcttttttgt acaaacttgt tgatatctgc 6420
agaattccac cacactggac tagtggatcc gagctcggta ccaagcttct tcacgacacc 6480
tgaaatggaa gaaaaaaact ttgaaccact gtctgaggct tgagaatgaa ccaagatcca 6540
aactcaaaaa gggcaaattc caaggagaat tacatcaagt gccaagctgg cctaacttca 6600
gtctccaccc actcagtgtg gggaaactcc atcgcataaa acccctcccc ccaacctaaa 6660
gacgacgtac tccaaaagct cgagaactaa tcgaggtgcc tggacggcgc ccggtactcc 6720
gtggagtcac atgaagcgac ggctgaggac ggaaaggccc ttttcctttg tgtgggtgac 6780
tcacccgccc gctctcccga gcgccgcgtc ctccattttg agctccctgc agcagggccg 6840
ggaagcggcc atctttccgc tcacgcaact ggtgccgacc gggccagcct tgccgcccag 6900
ggcggggcga tacacggcgg cgcgaggcca ggcaccagag caggccggcc agcttgagac 6960
tacccccgtc cgattctcgg tggccgcgct cgcaggcccc gcctcgccga acatgtgcgc 7020
tgggacgcac gggccccgtc gccgcccgcg gccccaaaaa ccgaaatacc agtgtgcaga 7080
tcttggcccg catttacaag actatcttgc cagaaaaaaa gcgtcgcagc aggtcatcaa 7140
aaattttaaa tggctagaga cttatcgaaa gcagcgagac aggcgcgaag gtgccaccag 7200
attcgcacgc ggcggcccca gcgcccaggc caggcctcaa ctcaagcacg aggcgaaggg 7260
gctccttaag cgcaaggcct cgaactctcc cacccacttc caacccgaag ctcgggatca 7320
agaatcacgt actgcagcca ggtggaagta attcaaggca cgcaagggcc ataacccgta 7380
aagaggccag gcccgcggga accacacacg gcacttacct gtgttctggc ggcaaacccg 7440
ttgcgaaaaa gaacgttcac ggcgactact gcacttatat acggttctcc cccaccctcg 7500
ggaaaaaggc ggagccagta cacgacatca ctttcccagt ttaccccgcg ccaccttctc 7560
taggcaccgg ttcaattgcc gacccctccc cccaacttct cggggactgt gggcgatgtg 7620
cgctctgccc actgacgggc accggagcct cacgatcgat atgtcgagtt tactccctat 7680
cagtgataga gaacgtatgt cgagtttact ccctatcagt gatagagaac gatgtcgagt 7740
ttactcccta tcagtgatag agaacgtatg tcgagtttac tccctatcag tgatagagaa 7800
cgtatgtcga gtttactccc tatcagtgat agagaacgta tgtcgagttt atccctatca 7860
gtgatagaga acgtatgtcg agtttactcc ctatcagtga tagagaacgt atgtcgaggt 7920
aggcgtgtac ggtgggaggc ctatataagc agagctcgtt tagtgaaccg tcagatcgcc 7980
tggagaattg gctaggcacc ggtgacaagt ttgtacaaaa aagcaggctc cgaattcgcc 8040
cttactagtg ccgccaccat gaaaacattt aacatttctc aacaggatct agaattagta 8100
gaagtagcga cagagaagat tacaatgctt tatgaggata ataaacatca tgtgggagcg 8160
gcaattcgta cgaaaacagg agaaatcatt tcggcagtac atattgaagc gtatatagga 8220
cgagtaactg tttgtgcaga agccattgcg attggtagtg cagtttcgaa tggacaaaag 8280
gattttgaca cgattgtagc tgttagacac ccttattctg acgaagtaga tagaagtatt 8340
cgagtggtaa gtccttgtgg tatgtgcctt tcatacgaga ccgagatcct gactgtcgag 8400
tacggattgc ttcctatcgg caaaatcgtg gagaagagga ttgaatgtac cgtctattca 8460
gtcgataata atgggaacat ctacacacag cccgtggctc aatggcacga cagaggagag 8520
caggaagttt ttgaatactg tctcgaggac ggatccctca tccgcgctac taaagatcat 8580
aagtttatga ccgtggacgg ccagatgctg ccaattgacg aaatttttga acgagagctg 8640
gatctgatga gagtcgacaa ccttccaaac actagtggca gcggcgccac aaacttctct 8700
ctgctaaagc aagcaggtga tgttgaagaa aaccccgggc ctggcgcgcc aatggtgagc 8760
aagggcgagg agctgttcac cggggtggtg cccatcctgg tcgagctgga cggcgacgta 8820
aacggccaca agttcagcgt gtccggcgag ggcgagggcg atgccaccta cggcaagctg 8880
accctgaagt tcatctgcac caccggcaag ctgcccgtgc cctggcccac cctcgtgacc 8940
accctgacct acggcgtgca gtgcttcagc cgctaccccg accacatgaa gcagcacgac 9000
ttcttcaagt ccgccatgcc cgaaggctac gtccaggagc gcaccatctt cttcaaggac 9060
gacggcaact acaagacccg cgccgaggtg aagttcgagg gcgacaccct ggtgaaccgc 9120
atcgagctga agggcatcga cttcaaggag gacggcaaca tcctggggca caagctggag 9180
tacaactaca acagccacaa cgtctatatc atggccgaca agcagaagaa cggcatcaag 9240
gtgaacttca agatccgcca caacatcgag gacggcagcg tgcagctcgc cgaccactac 9300
cagcagaaca cccccatcgg cgacggcccc gtgctgctgc ccgacaacca ctacctgagc 9360
acccagtccg ccctgagcaa agaccccaac gagaagcgcg atcacatggt cctgctggag 9420
ttcgtgaccg ccgccgggat cactctcggc atggacgagc tgtacaagta attaattaag 9480
agggcgaatt cgacccagct ttcttgtaca aagtggttga tatccagcac agtggcggcc 9540
gctcgagtct agagggcccg cggttcgaag gtaagcctat ccctaaccct ctcctcggtc 9600
tcgattctac gcgtaccggt taggggcccg tttaaacccg ctgatcagcc tcgactgtgc 9660
cttctagttg ccagccatct gttgtttgcc cctcccccgt gccttccttg accctggaag 9720
gtgccactcc cactgtcctt tcctaataaa atgaggaaat tgcatcgcat tgtctgagta 9780
ggtgtcattc tattctgggg ggtggggtgg ggcaggacag caagggggag gattgggaag 9840
acaatagcag gcatgctggg gatgcggtgg gctctatggc tctagaagtc gacagtacta 9900
agctttgaca gaaaagcccc atccttaggc ctcctccttc ctagtctcct gatattgggt 9960
ctaaccccca cctcctgtta ggcagattcc ttatctggtg acacaccccc atttcctgga 10020
gccatctctc tccttgccag aacctctaag gtttgcttac gatggagcca gagaggatcc 10080
tgggagggag agcttggcag ggggtgggag ggaagggggg gatgcgtgac ctgcccggtt 10140
ctcagtggcc accctgcgct accctctccc agaacctgag ctgctctgac gcggctgtct 10200
ggtgcgtttc actgatcctg gtgctgcagc ttccttacac ttcccaagag gagaagcagt 10260
ttggaaaaac aaaatcagaa taagttggtc ctgagttcta actttggctc ttcacctttc 10320
tagtccccaa tttatattgt tcctccgtgc gtcagtttta cctgtgagat aaggccagta 10380
gccagccccg tcctggcagg gctgtggtga ggaggggggt gtccgtgtgg aaaactccct 10440
ttgtgagaat ggtgcgtcct aggtgttcac caggtcgtgg ccgcctctac tccctttctc 10500
tttctccatc cttctttcct taaagagtcc ccagtgctat ctgggacata ttcctccgcc 10560
cagagcaggg tcccgcttcc ctaaggccct gctctgggct tctgggtttg agtccttggc 10620
aagcccagga gaggcgctca ggcttccctg tcccccttcc tcgtccacca tctcatgccc 10680
ctggctctcc tgccccttcc ctacaggggt tcctggctct gctctagcga tcgccaattc 10740
gccctatagt gagtcgtatt acaattcact ggccgtcgtt ttacaacgtc gtgactggga 10800
aaaccctggc gttacccaac ttaatcgcct tgcagcacat ccccctttcg ccagctggcg 10860
taatagcgaa gaggcccgca ccgatcgccc ttcccaacag ttgcgcagcc tgaatggcga 10920
atgggacgcg ccctgtagcg gcgcattaag cgcggcgggt gtggtggtta cgcgcagcgt 10980
gaccgctaca cttgccagcg ccctagcgcc cgctcctttc gctttcttcc cttcctttct 11040
cgccacgttc gccggctttc cccgtcaagc tctaaatcgg gggctccctt tagggttccg 11100
atttagtgct ttacggcacc tcgaccccaa aaaacttgat tagggtgatg gttcacgtag 11160
tgggccatcg ccctgataga cggtttttcg ccctttgacg ttggagtcca cgttctttaa 11220
tagtggactc ttgttccaaa ctggaacaac actcaaccct atctcggtct attcttttga 11280
tttataaggg attttgccga tttcggccta ttggttaaaa aatgagctga tttaacaaaa 11340
atttaacgcg aattttaaca aaatattaac gcttacaatt taggtg 11386
<210> 203
<211> 10975
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 203
gcacttttcg gggaaatgtg cgcggaaccc ctatttgttt atttttctaa atacattcaa 60
atatgtatcc gctcatgaga caataaccct gataaatgct tcaataatat tgaaaaagga 120
agagtatgag tattcaacat ttccgtgtcg cccttattcc cttttttgcg gcattttgcc 180
ttcctgtttt tgctcaccca gaaacgctgg tgaaagtaaa agatgctgaa gatcagttgg 240
gtgcacgagt gggttacatc gaactggatc tcaacagcgg taagatcctt gagagttttc 300
gccccgaaga acgttttcca atgatgagca cttttaaagt tctgctatgt ggcgcggtat 360
tatcccgtat tgacgccggg caagagcaac tcggtcgccg catacactat tctcagaatg 420
acttggttga gtactcacca gtcacagaaa agcatcttac ggatggcatg acagtaagag 480
aattatgcag tgctgccata accatgagtg ataacactgc ggccaactta cttctgacaa 540
cgatcggagg accgaaggag ctaaccgctt ttttgcacaa catgggggat catgtaactc 600
gccttgatcg ttgggaaccg gagctgaatg aagccatacc aaacgacgag cgtgacacca 660
cgatgcctgt agcaatggca acaacgttgc gcaaactatt aactggcgaa ctacttactc 720
tagcttcccg gcaacaatta atagactgga tggaggcgga taaagttgca ggaccacttc 780
tgcgctcggc ccttccggct ggctggttta ttgctgataa atctggagcc ggtgagcgtg 840
ggtctcgcgg tatcattgca gcactggggc cagatggtaa gccctcccgt atcgtagtta 900
tctacacgac ggggagtcag gcaactatgg atgaacgaaa tagacagatc gctgagatag 960
gtgcctcact gattaagcat tggtaactgt cagaccaagt ttactcatat atactttaga 1020
ttgatttaaa acttcatttt taatttaaaa ggatctaggt gaagatcctt tttgataatc 1080
tcatgaccaa aatcccttaa cgtgagtttt cgttccactg agcgtcagac cccgtagaaa 1140
agatcaaagg atcttcttga gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa 1200
aaaaaccacc gctaccagcg gtggtttgtt tgccggatca agagctacca actctttttc 1260
cgaaggtaac tggcttcagc agagcgcaga taccaaatac tgtccttcta gtgtagccgt 1320
agttaggcca ccacttcaag aactctgtag caccgcctac atacctcgct ctgctaatcc 1380
tgttaccagt ggctgctgcc agtggcgata agtcgtgtct taccgggttg gactcaagac 1440
gatagttacc ggataaggcg cagcggtcgg gctgaacggg gggttcgtgc acacagccca 1500
gcttggagcg aacgacctac accgaactga gatacctaca gcgtgagcta tgagaaagcg 1560
ccacgcttcc cgaagggaga aaggcggaca ggtatccggt aagcggcagg gtcggaacag 1620
gagagcgcac gagggagctt ccagggggaa acgcctggta tctttatagt cctgtcgggt 1680
ttcgccacct ctgacttgag cgtcgatttt tgtgatgctc gtcagggggg cggagcctat 1740
ggaaaaacgc cagcaacgcg gcctttttac ggttcctggc cttttgctgg ccttttgctc 1800
acatgttctt tcctgcgtta tcccctgatt ctgtggataa ccgtattacc gcctttgagt 1860
gagctgatac cgctcgccgc agccgaacga ccgagcgcag cgagtcagtg agcgaggaag 1920
cggaagagcg cccaatacgc aaaccgcctc tccccgcgcg ttggccgatt cattaatgca 1980
gctggcacga caggtttccc gactggaaag cgggcagtga gcgcaacgca attaatgtga 2040
gttagctcac tcattaggca ccccaggctt tacactttat gcttccggct cgtatgttgt 2100
gtggaattgt gagcggataa caatttcaca caggaaacag ctatgaccat gattacgcca 2160
agctcgaaat taaccctcac taaagggaac aaaagctgtg ctttctctga ccagcattct 2220
ctcccctggg cctgtgccgc tttctgtctg cagcttgtgg cctgggtcac ctctacggct 2280
ggcccagatc cttccctgcc gcctccttca ggttccgtct tcctccactc cctcttcccc 2340
ttgctctctg ctgtgttgct gcccaaggat gctctttccg gagcacttcc ttctcggcgc 2400
tgcaccacgt gatgtcctct gagcggatcc tccccgtgtc tgggtcctct ccgggcatct 2460
ctcctccctc acccaacccc atgccgtctt cactcgctgg gttccctttt ccttctcctt 2520
ctggggcctg tgccatctct cgtttcttag gatggccttc tccgacggat gtctcccttg 2580
cgtcccgcct ccccttcttg taggcctgca tcatcaccgt ttttctggac aaccccaaag 2640
taccccgtct ccctggcttt agccacctct ccatcctctt gctttctttg cctggacacc 2700
ccgttctcct gtggattcgg gtcacctctc actcctttca tttgggcagc tcccctaccc 2760
cccttacctc tctagtctgt gctagctctt ccagccccct gtcatggcat cttccagggg 2820
tccgagagct cagctagtct tcttcctcca acccgggccc ctatgtccac ttcaggacag 2880
catgtttgct gcctccaggg atcctgtgtc cccgagctgg gaccacctta tattcccagg 2940
gccggttaat gtggctctgg ttctgggtac ttttatctgt cccctccacc ccacagtggg 3000
gcaagcttct gacctcttct cttcctccca cagggcctcg agagatctgg cagcggagag 3060
ggcagaggaa gtcttctaac atgcggtgac gtggaggaga atcccggccc taggctcgag 3120
ggtaccatga ttgaacaaga tggattgcac gcaggttctc cggccgcttg ggtggagagg 3180
ctattcggct atgactgggc acaacagaca atcggctgct ctgatgccgc cgtgttccgg 3240
ctgtcagcgc aggggcgccc ggttcttttt gtcaagaccg acctgtccgg tgccctgaat 3300
gaactgcagg acgaggcagc gcggctatcg tggctggcca cgacgggcgt tccttgcgca 3360
gctgtgctcg acgttgtcac tgaagcggga agggactggc tgctattggg cgaagtgccg 3420
gggcaggatc tcctgtcatc tcaccttgct cctgccgaga aagtatccat catggctgat 3480
gcaatgcggc ggctgcatac gcttgatccg gctacctgcc cattcgacca ccaagcgaaa 3540
catcgcatcg agcgagcacg tactcggatg gaagccggtc ttgtcgatca ggatgatctg 3600
gacgaagagc atcaggggct cgcgccagcc gaactgttcg ccaggctcaa ggcgcgcatg 3660
cccgacggcg aggatctcgt cgtgacccat ggcgatgcct gcttgccgaa tatcatggtg 3720
gaaaatggcc gcttttctgg attcatcgac tgtggccggc tgggtgtggc ggaccgctat 3780
caggacatag cgttggctac ccgtgatatt gctgaagagc ttggcggcga atgggctgac 3840
cgcttcctcg tgctttacgg tatcgccgct cccgattcgc agcgcatcgc cttctatcgc 3900
cttcttgacg agttcttctg agtttaaacc cgctgatcag cctcgactgt gccttctagt 3960
tgccagccat ctgttgtttg cccctccccc gtgccttcct tgaccctgga aggtgccact 4020
cccactgtcc tttcctaata aaatgaggaa attgcatcgc attgtctgag taggtgtcat 4080
tctattctgg ggggtggggt ggggcaggac agcaaggggg aggattggga agacaatagc 4140
aggcatgctg gggatgcggt gggctctatg ggctagcggt ggcggcctcg acattgatta 4200
ttgactagta gatctggcgc gccgcctttt tacggttcct ggccttttgc tggccttttg 4260
ctcacatgtc acgtgaggcc ttaacgtctc gccctttggt ctccccctct taagtaccac 4320
atttgtagag gttttacttg ctttaaaaaa cctcccacac ctccccctga acctgaaaca 4380
taaaatgaat gcaattgttg ttgttaactt gtttattgca gcttataatg gttacaaata 4440
aagcaatagc atcacaaatt tcacaaataa agcatttttt tcactgcatt ctagttgtgg 4500
tttgtccaaa ctcatcggct agcttacccg gggagcatgt caaggtcaaa atcgtcaaga 4560
gcgtcagcag gcagcatatc aaggtcaaag tcgtcaaggg catcggctgg gagcatgtct 4620
aagtcaaaat cgtcaagggc gtcggtcggc ccgccgcttt cgcactttag ctgtttctcc 4680
aggccacata tgattagttc caggccgaaa aggaaggcag gttcggctcc ctgccggtcg 4740
aacagctcaa ttgcttgtct cagaagtggg ggcatagaat cggtggtagg tgtctctctt 4800
tcctcttttg ctacttgatg ctcctgttcc tccaatacgc agcccagtgt aaagtggccc 4860
acggcggaca gagcgtacag tgcgttctcc agggagaagc cttgctgaca caggaacgcg 4920
agctgatttt ccagggtttc gtactgtttc tctgttgggc gggtgccgag atgcacttta 4980
gccccgtcgc gatgtgagag gagagcacag cggtatgact tggcgttgtt ccgcagaaag 5040
tcttgccatg actcgccttc cagggggcag aagtgggtat gatgcctgtc cagcatctcg 5100
attggcaggg catcgagcag ggcccgcttg ttcttcacgt gccagtacag ggtaggctgc 5160
tcaactccca gcttttgagc gagtttcctt gtcgtcaggc cttcgatacc gacaccattg 5220
agtaattcca gagctccgtt tatgactttg ctcttgtcca gtctagacat tggaccaggg 5280
ttttcttcaa catcaccaca agtgaggaga gaacctctac cttcggcacc gggttccttt 5340
gccctcggac gagtgctggg gcgtcggttt ccactatcgg cgagtacttc tacacagcca 5400
tcggtccaga cggccgcgct tctgcgggcg atttgtgtac gcccgacagt cccggctccg 5460
gatcggacga ttgcgtcgca tcgaccctgc gcccaagctg catcatcgaa attgccgtca 5520
accaagctct gatagagttg gtcaagacca atgcggagca tatacgcccg gagccgcggc 5580
gatcctgcaa gctccggatg cctccgctcg aagtagcgcg tctgctgctc catacaagcc 5640
aaccacggcc tccagaagaa gatgttggcg acctcgtatt gggaatcccc gaacatcgcc 5700
tcgctccagt caatgaccgc tgttatgcgg ccattgtccg tcaggacatt gttggagccg 5760
aaatccgcgt gcacgaggtg ccggacttcg gggcagtcct cggcccaaag catcagctca 5820
tcgagagcct gcgcgacgga cgcactgacg gtgtcgtcca tcacagtttg ccagtgatac 5880
acatggggat cagcaatcgc gcatatgaaa tcacgccatg tagtgtattg accgattcct 5940
tgcggtccga atgggccgaa cccgctcgtc tggctaagat cggccgcagc gatcgcatcc 6000
atggcctccg cgaccggctg cagaacagcg ggcagttcgg tttcaggcag gtcttgcaac 6060
gtgacaccct gtgcacggcg ggagatgcaa taggtcaggc tctcgctaaa ttccccaatg 6120
tcaagcactt ccggaatcgg gagcgcggcc gatgcaaagt gccgataaac ataacgatct 6180
ttgtagaaac catcggcgca gctatttacc cgcaggacat atccacgccc tcctacatcg 6240
aagctgaaag cacgagattc ttcgccctcc gagagctgca tcaggtcgga gacgctgtcg 6300
aacttttcga tcagaaactt ctcgacagac gtcgcggtga gttcaggctt tttcatggtg 6360
gcggcactag taagggcgaa ttcggagcct gcttttttgt acaaacttgt tgatatctgc 6420
agaattccac cacactggac tagtggatcc gagctcggta ccaagcttct tcacgacacc 6480
tgaaatggaa gaaaaaaact ttgaaccact gtctgaggct tgagaatgaa ccaagatcca 6540
aactcaaaaa gggcaaattc caaggagaat tacatcaagt gccaagctgg cctaacttca 6600
gtctccaccc actcagtgtg gggaaactcc atcgcataaa acccctcccc ccaacctaaa 6660
gacgacgtac tccaaaagct cgagaactaa tcgaggtgcc tggacggcgc ccggtactcc 6720
gtggagtcac atgaagcgac ggctgaggac ggaaaggccc ttttcctttg tgtgggtgac 6780
tcacccgccc gctctcccga gcgccgcgtc ctccattttg agctccctgc agcagggccg 6840
ggaagcggcc atctttccgc tcacgcaact ggtgccgacc gggccagcct tgccgcccag 6900
ggcggggcga tacacggcgg cgcgaggcca ggcaccagag caggccggcc agcttgagac 6960
tacccccgtc cgattctcgg tggccgcgct cgcaggcccc gcctcgccga acatgtgcgc 7020
tgggacgcac gggccccgtc gccgcccgcg gccccaaaaa ccgaaatacc agtgtgcaga 7080
tcttggcccg catttacaag actatcttgc cagaaaaaaa gcgtcgcagc aggtcatcaa 7140
aaattttaaa tggctagaga cttatcgaaa gcagcgagac aggcgcgaag gtgccaccag 7200
attcgcacgc ggcggcccca gcgcccaggc caggcctcaa ctcaagcacg aggcgaaggg 7260
gctccttaag cgcaaggcct cgaactctcc cacccacttc caacccgaag ctcgggatca 7320
agaatcacgt actgcagcca ggtggaagta attcaaggca cgcaagggcc ataacccgta 7380
aagaggccag gcccgcggga accacacacg gcacttacct gtgttctggc ggcaaacccg 7440
ttgcgaaaaa gaacgttcac ggcgactact gcacttatat acggttctcc cccaccctcg 7500
ggaaaaaggc ggagccagta cacgacatca ctttcccagt ttaccccgcg ccaccttctc 7560
taggcaccgg ttcaattgcc gacccctccc cccaacttct cggggactgt gggcgatgtg 7620
cgctctgccc actgacgggc accggagcct cacgatcgat atgtcgagtt tactccctat 7680
cagtgataga gaacgtatgt cgagtttact ccctatcagt gatagagaac gatgtcgagt 7740
ttactcccta tcagtgatag agaacgtatg tcgagtttac tccctatcag tgatagagaa 7800
cgtatgtcga gtttactccc tatcagtgat agagaacgta tgtcgagttt atccctatca 7860
gtgatagaga acgtatgtcg agtttactcc ctatcagtga tagagaacgt atgtcgaggt 7920
aggcgtgtac ggtgggaggc ctatataagc agagctcgtt tagtgaaccg tcagatcgcc 7980
tggagaattg gctaggcacc ggtgacaagt ttgtacaaaa aagcaggctc cgaattcgcc 8040
cttactagtg ccgccaccat gattaagatc gctacgcgga agtacctggg gaaacagaac 8100
gtctacgaca taggtgtgga gcgcgatcac aactttgctc tgaaaaatgg atttatcgcc 8160
agcaactgta gggagttgat ttcagactat gcaccagatt gttttgtgtt aatagaaatg 8220
aatggcaagt tagtcaaaac tacgattgaa gaactcattc cactcaaata tacccgaaat 8280
actagtggca gcggcgccac aaacttctct ctgctaaagc aagcaggtga tgttgaagaa 8340
aaccccgggc ctggcgcgcc aatggtgagc aagggcgagg cagtgatcaa ggagttcatg 8400
cggttcaagg tgcacatgga gggctccatg aacggccacg agttcgagat cgagggcgag 8460
ggcgagggcc gcccctacga gggcacccag accgccaagc tgaaggtgac caagggtggc 8520
cccctgccct tctcctggga catcctgtcc cctcagttca tgtacggctc cagggccttc 8580
accaagcacc ccgccgacat ccccgactac tataagcagt ccttccccga gggcttcaag 8640
tgggagcgcg tgatgaactt cgaggacggc ggcgccgtga ccgtgaccca ggacacctcc 8700
ctggaggacg gcaccctgat ctacaaggtg aagctccgtg gcaccaactt ccctcctgac 8760
ggccccgtaa tgcagaagaa gacaatgggc tgggaagcgt ccaccgagcg gttgtacccc 8820
gaggacggcg tgctgaaggg cgacattaag atggccctgc gcctgaagga cggcggccgc 8880
tacctggcgg acttcaagac cacctacaag gccaagaagc ccgtgcagat gcccggcgcc 8940
tacaacgtcg accgcaagtt ggacatcacc tcccacaacg aggactacac cgtggtggaa 9000
cagtacgaac gctccgaggg ccgccactcc accggcggca tggacgagct gtacaagtga 9060
ttaattaaaa gggcgaattc gacccagctt tcttgtacaa agtggttgat atccagcaca 9120
gtggcggccg ctcgagtcta gagggcccgc ggttcgaagg taagcctatc cctaaccctc 9180
tcctcggtct cgattctacg cgtaccggtt aggggcccgt ttaaacccgc tgatcagcct 9240
cgactgtgcc ttctagttgc cagccatctg ttgtttgccc ctcccccgtg ccttccttga 9300
ccctggaagg tgccactccc actgtccttt cctaataaaa tgaggaaatt gcatcgcatt 9360
gtctgagtag gtgtcattct attctggggg gtggggtggg gcaggacagc aagggggagg 9420
attgggaaga caatagcagg catgctgggg atgcggtggg ctctatggct ctagaagtcg 9480
acagtactaa gctttgacag aaaagcccca tccttaggcc tcctccttcc tagtctcctg 9540
atattgggtc taacccccac ctcctgttag gcagattcct tatctggtga cacaccccca 9600
tttcctggag ccatctctct ccttgccaga acctctaagg tttgcttacg atggagccag 9660
agaggatcct gggagggaga gcttggcagg gggtgggagg gaaggggggg atgcgtgacc 9720
tgcccggttc tcagtggcca ccctgcgcta ccctctccca gaacctgagc tgctctgacg 9780
cggctgtctg gtgcgtttca ctgatcctgg tgctgcagct tccttacact tcccaagagg 9840
agaagcagtt tggaaaaaca aaatcagaat aagttggtcc tgagttctaa ctttggctct 9900
tcacctttct agtccccaat ttatattgtt cctccgtgcg tcagttttac ctgtgagata 9960
aggccagtag ccagccccgt cctggcaggg ctgtggtgag gaggggggtg tccgtgtgga 10020
aaactccctt tgtgagaatg gtgcgtccta ggtgttcacc aggtcgtggc cgcctctact 10080
ccctttctct ttctccatcc ttctttcctt aaagagtccc cagtgctatc tgggacatat 10140
tcctccgccc agagcagggt cccgcttccc taaggccctg ctctgggctt ctgggtttga 10200
gtccttggca agcccaggag aggcgctcag gcttccctgt cccccttcct cgtccaccat 10260
ctcatgcccc tggctctcct gccccttccc tacaggggtt cctggctctg ctctagcgat 10320
cgccaattcg ccctatagtg agtcgtatta caattcactg gccgtcgttt tacaacgtcg 10380
tgactgggaa aaccctggcg ttacccaact taatcgcctt gcagcacatc cccctttcgc 10440
cagctggcgt aatagcgaag aggcccgcac cgatcgccct tcccaacagt tgcgcagcct 10500
gaatggcgaa tgggacgcgc cctgtagcgg cgcattaagc gcggcgggtg tggtggttac 10560
gcgcagcgtg accgctacac ttgccagcgc cctagcgccc gctcctttcg ctttcttccc 10620
ttcctttctc gccacgttcg ccggctttcc ccgtcaagct ctaaatcggg ggctcccttt 10680
agggttccga tttagtgctt tacggcacct cgaccccaaa aaacttgatt agggtgatgg 10740
ttcacgtagt gggccatcgc cctgatagac ggtttttcgc cctttgacgt tggagtccac 10800
gttctttaat agtggactct tgttccaaac tggaacaaca ctcaacccta tctcggtcta 10860
ttcttttgat ttataaggga ttttgccgat ttcggcctat tggttaaaaa atgagctgat 10920
ttaacaaaaa tttaacgcga attttaacaa aatattaacg cttacaattt aggtg 10975
<210> 204
<211> 11947
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 204
gcacttttcg gggaaatgtg cgcggaaccc ctatttgttt atttttctaa atacattcaa 60
atatgtatcc gctcatgaga caataaccct gataaatgct tcaataatat tgaaaaagga 120
agagtatgag tattcaacat ttccgtgtcg cccttattcc cttttttgcg gcattttgcc 180
ttcctgtttt tgctcaccca gaaacgctgg tgaaagtaaa agatgctgaa gatcagttgg 240
gtgcacgagt gggttacatc gaactggatc tcaacagcgg taagatcctt gagagttttc 300
gccccgaaga acgttttcca atgatgagca cttttaaagt tctgctatgt ggcgcggtat 360
tatcccgtat tgacgccggg caagagcaac tcggtcgccg catacactat tctcagaatg 420
acttggttga gtactcacca gtcacagaaa agcatcttac ggatggcatg acagtaagag 480
aattatgcag tgctgccata accatgagtg ataacactgc ggccaactta cttctgacaa 540
cgatcggagg accgaaggag ctaaccgctt ttttgcacaa catgggggat catgtaactc 600
gccttgatcg ttgggaaccg gagctgaatg aagccatacc aaacgacgag cgtgacacca 660
cgatgcctgt agcaatggca acaacgttgc gcaaactatt aactggcgaa ctacttactc 720
tagcttcccg gcaacaatta atagactgga tggaggcgga taaagttgca ggaccacttc 780
tgcgctcggc ccttccggct ggctggttta ttgctgataa atctggagcc ggtgagcgtg 840
ggtctcgcgg tatcattgca gcactggggc cagatggtaa gccctcccgt atcgtagtta 900
tctacacgac ggggagtcag gcaactatgg atgaacgaaa tagacagatc gctgagatag 960
gtgcctcact gattaagcat tggtaactgt cagaccaagt ttactcatat atactttaga 1020
ttgatttaaa acttcatttt taatttaaaa ggatctaggt gaagatcctt tttgataatc 1080
tcatgaccaa aatcccttaa cgtgagtttt cgttccactg agcgtcagac cccgtagaaa 1140
agatcaaagg atcttcttga gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa 1200
aaaaaccacc gctaccagcg gtggtttgtt tgccggatca agagctacca actctttttc 1260
cgaaggtaac tggcttcagc agagcgcaga taccaaatac tgtccttcta gtgtagccgt 1320
agttaggcca ccacttcaag aactctgtag caccgcctac atacctcgct ctgctaatcc 1380
tgttaccagt ggctgctgcc agtggcgata agtcgtgtct taccgggttg gactcaagac 1440
gatagttacc ggataaggcg cagcggtcgg gctgaacggg gggttcgtgc acacagccca 1500
gcttggagcg aacgacctac accgaactga gatacctaca gcgtgagcta tgagaaagcg 1560
ccacgcttcc cgaagggaga aaggcggaca ggtatccggt aagcggcagg gtcggaacag 1620
gagagcgcac gagggagctt ccagggggaa acgcctggta tctttatagt cctgtcgggt 1680
ttcgccacct ctgacttgag cgtcgatttt tgtgatgctc gtcagggggg cggagcctat 1740
ggaaaaacgc cagcaacgcg gcctttttac ggttcctggc cttttgctgg ccttttgctc 1800
acatgttctt tcctgcgtta tcccctgatt ctgtggataa ccgtattacc gcctttgagt 1860
gagctgatac cgctcgccgc agccgaacga ccgagcgcag cgagtcagtg agcgaggaag 1920
cggaagagcg cccaatacgc aaaccgcctc tccccgcgcg ttggccgatt cattaatgca 1980
gctggcacga caggtttccc gactggaaag cgggcagtga gcgcaacgca attaatgtga 2040
gttagctcac tcattaggca ccccaggctt tacactttat gcttccggct cgtatgttgt 2100
gtggaattgt gagcggataa caatttcaca caggaaacag ctatgaccat gattacgcca 2160
agctcgaaat taaccctcac taaagggaac aaaagctgtg ctttctctga ccagcattct 2220
ctcccctggg cctgtgccgc tttctgtctg cagcttgtgg cctgggtcac ctctacggct 2280
ggcccagatc cttccctgcc gcctccttca ggttccgtct tcctccactc cctcttcccc 2340
ttgctctctg ctgtgttgct gcccaaggat gctctttccg gagcacttcc ttctcggcgc 2400
tgcaccacgt gatgtcctct gagcggatcc tccccgtgtc tgggtcctct ccgggcatct 2460
ctcctccctc acccaacccc atgccgtctt cactcgctgg gttccctttt ccttctcctt 2520
ctggggcctg tgccatctct cgtttcttag gatggccttc tccgacggat gtctcccttg 2580
cgtcccgcct ccccttcttg taggcctgca tcatcaccgt ttttctggac aaccccaaag 2640
taccccgtct ccctggcttt agccacctct ccatcctctt gctttctttg cctggacacc 2700
ccgttctcct gtggattcgg gtcacctctc actcctttca tttgggcagc tcccctaccc 2760
cccttacctc tctagtctgt gctagctctt ccagccccct gtcatggcat cttccagggg 2820
tccgagagct cagctagtct tcttcctcca acccgggccc ctatgtccac ttcaggacag 2880
catgtttgct gcctccaggg atcctgtgtc cccgagctgg gaccacctta tattcccagg 2940
gccggttaat gtggctctgg ttctgggtac ttttatctgt cccctccacc ccacagtggg 3000
gcaagcttct gacctcttct cttcctccca cagggcctcg agagatctgg cagcggagag 3060
ggcagaggaa gtcttctaac atgcggtgac gtggaggaga atcccggccc taggctcgag 3120
ggtaccatga ttgaacaaga tggattgcac gcaggttctc cggccgcttg ggtggagagg 3180
ctattcggct atgactgggc acaacagaca atcggctgct ctgatgccgc cgtgttccgg 3240
ctgtcagcgc aggggcgccc ggttcttttt gtcaagaccg acctgtccgg tgccctgaat 3300
gaactgcagg acgaggcagc gcggctatcg tggctggcca cgacgggcgt tccttgcgca 3360
gctgtgctcg acgttgtcac tgaagcggga agggactggc tgctattggg cgaagtgccg 3420
gggcaggatc tcctgtcatc tcaccttgct cctgccgaga aagtatccat catggctgat 3480
gcaatgcggc ggctgcatac gcttgatccg gctacctgcc cattcgacca ccaagcgaaa 3540
catcgcatcg agcgagcacg tactcggatg gaagccggtc ttgtcgatca ggatgatctg 3600
gacgaagagc atcaggggct cgcgccagcc gaactgttcg ccaggctcaa ggcgcgcatg 3660
cccgacggcg aggatctcgt cgtgacccat ggcgatgcct gcttgccgaa tatcatggtg 3720
gaaaatggcc gcttttctgg attcatcgac tgtggccggc tgggtgtggc ggaccgctat 3780
caggacatag cgttggctac ccgtgatatt gctgaagagc ttggcggcga atgggctgac 3840
cgcttcctcg tgctttacgg tatcgccgct cccgattcgc agcgcatcgc cttctatcgc 3900
cttcttgacg agttcttctg agtttaaacc cgctgatcag cctcgactgt gccttctagt 3960
tgccagccat ctgttgtttg cccctccccc gtgccttcct tgaccctgga aggtgccact 4020
cccactgtcc tttcctaata aaatgaggaa attgcatcgc attgtctgag taggtgtcat 4080
tctattctgg ggggtggggt ggggcaggac agcaaggggg aggattggga agacaatagc 4140
aggcatgctg gggatgcggt gggctctatg ggctagcggt ggcggcctcg acattgatta 4200
ttgactagta gatctggcgc gccgcctttt tacggttcct ggccttttgc tggccttttg 4260
ctcacatgtc acgtgaggcc ttaacgtctc gccctttggt ctccccctct taagtaccac 4320
atttgtagag gttttacttg ctttaaaaaa cctcccacac ctccccctga acctgaaaca 4380
taaaatgaat gcaattgttg ttgttaactt gtttattgca gcttataatg gttacaaata 4440
aagcaatagc atcacaaatt tcacaaataa agcatttttt tcactgcatt ctagttgtgg 4500
tttgtccaaa ctcatcggct agcttacccg gggagcatgt caaggtcaaa atcgtcaaga 4560
gcgtcagcag gcagcatatc aaggtcaaag tcgtcaaggg catcggctgg gagcatgtct 4620
aagtcaaaat cgtcaagggc gtcggtcggc ccgccgcttt cgcactttag ctgtttctcc 4680
aggccacata tgattagttc caggccgaaa aggaaggcag gttcggctcc ctgccggtcg 4740
aacagctcaa ttgcttgtct cagaagtggg ggcatagaat cggtggtagg tgtctctctt 4800
tcctcttttg ctacttgatg ctcctgttcc tccaatacgc agcccagtgt aaagtggccc 4860
acggcggaca gagcgtacag tgcgttctcc agggagaagc cttgctgaca caggaacgcg 4920
agctgatttt ccagggtttc gtactgtttc tctgttgggc gggtgccgag atgcacttta 4980
gccccgtcgc gatgtgagag gagagcacag cggtatgact tggcgttgtt ccgcagaaag 5040
tcttgccatg actcgccttc cagggggcag aagtgggtat gatgcctgtc cagcatctcg 5100
attggcaggg catcgagcag ggcccgcttg ttcttcacgt gccagtacag ggtaggctgc 5160
tcaactccca gcttttgagc gagtttcctt gtcgtcaggc cttcgatacc gacaccattg 5220
agtaattcca gagctccgtt tatgactttg ctcttgtcca gtctagacat tggaccaggg 5280
ttttcttcaa catcaccaca agtgaggaga gaacctctac cttcggcacc gggatttcgg 5340
gtatatttga gtggaatgag ttcttcaatc gtagttttga ctaacttgcc attcatttct 5400
attaacacaa aacaatctgg tgcatagtct gaaatcaact ccctacacat accacaagga 5460
cttaccactc gaatacttct atctacttcg tcagaataag ggtgtctaac agctacaatc 5520
gtgtcaaaat ccttttgtcc attcgaaact gcactaccaa tcgcaatggc ttctgcacaa 5580
acagttactc gtcctatata cgcttcaata tgtactgccg aaatgatttc tcctgttttc 5640
gtacgaattg ccgctcccac atgatgttta ttatcctcat aaagcattgt aatcttctct 5700
gtcgctactt ctactaattc tagatcctgt tgagaaatgt taaatgtttt catggtggcg 5760
gcactagtaa gggcgaattc ggagcctgct tttttgtaca aacttgttga tatctgcaga 5820
attccaccac actggactag tggatccgag ctcggtacca agcttcttca cgacacctga 5880
aatggaagaa aaaaactttg aaccactgtc tgaggcttga gaatgaacca agatccaaac 5940
tcaaaaaggg caaattccaa ggagaattac atcaagtgcc aagctggcct aacttcagtc 6000
tccacccact cagtgtgggg aaactccatc gcataaaacc cctcccccca acctaaagac 6060
gacgtactcc aaaagctcga gaactaatcg aggtgcctgg acggcgcccg gtactccgtg 6120
gagtcacatg aagcgacggc tgaggacgga aaggcccttt tcctttgtgt gggtgactca 6180
cccgcccgct ctcccgagcg ccgcgtcctc cattttgagc tccctgcagc agggccggga 6240
agcggccatc tttccgctca cgcaactggt gccgaccggg ccagccttgc cgcccagggc 6300
ggggcgatac acggcggcgc gaggccaggc accagagcag gccggccagc ttgagactac 6360
ccccgtccga ttctcggtgg ccgcgctcgc aggccccgcc tcgccgaaca tgtgcgctgg 6420
gacgcacggg ccccgtcgcc gcccgcggcc ccaaaaaccg aaataccagt gtgcagatct 6480
tggcccgcat ttacaagact atcttgccag aaaaaaagcg tcgcagcagg tcatcaaaaa 6540
ttttaaatgg ctagagactt atcgaaagca gcgagacagg cgcgaaggtg ccaccagatt 6600
cgcacgcggc ggccccagcg cccaggccag gcctcaactc aagcacgagg cgaaggggct 6660
ccttaagcgc aaggcctcga actctcccac ccacttccaa cccgaagctc gggatcaaga 6720
atcacgtact gcagccaggt ggaagtaatt caaggcacgc aagggccata acccgtaaag 6780
aggccaggcc cgcgggaacc acacacggca cttacctgtg ttctggcggc aaacccgttg 6840
cgaaaaagaa cgttcacggc gactactgca cttatatacg gttctccccc accctcggga 6900
aaaaggcgga gccagtacac gacatcactt tcccagttta ccccgcgcca ccttctctag 6960
gcaccggttc aattgccgac ccctcccccc aacttctcgg ggactgtggg cgatgtgcgc 7020
tctgcccact gacgggcacc ggagcctcac gatcgatatg tcgagtttac tccctatcag 7080
tgatagagaa cgtatgtcga gtttactccc tatcagtgat agagaacgat gtcgagttta 7140
ctccctatca gtgatagaga acgtatgtcg agtttactcc ctatcagtga tagagaacgt 7200
atgtcgagtt tactccctat cagtgataga gaacgtatgt cgagtttatc cctatcagtg 7260
atagagaacg tatgtcgagt ttactcccta tcagtgatag agaacgtatg tcgaggtagg 7320
cgtgtacggt gggaggccta tataagcaga gctcgtttag tgaaccgtca gatcgcctgg 7380
agaattggct aggcaccggt gacaagtttg tacaaaaaag caggctccga attcgccctt 7440
actagtgccg ccaccatgaa aaagcctgaa ctcaccgcga cgtctgtcga gaagtttctg 7500
atcgaaaagt tcgacagcgt ctccgacctg atgcagctct cggagggcga agaatctcgt 7560
gctttcagct tcgatgtagg agggcgtgga tatgtcctgc gggtaaatag ctgcgccgat 7620
ggtttctaca aagatcgtta tgtttatcgg cactttgcat cggccgcgct cccgattccg 7680
gaagtgcttg acattgggga atttagcgag agcctgacct attgcatctc ccgccgtgca 7740
cagggtgtca cgttgcaaga cctgcctgaa accgaactgc ccgctgttct gcagccggtc 7800
gcggaggcca tggatgcgat cgctgcggcc gatcttagcc agacgagcgg gttcggccca 7860
ttcggaccgc aaggaatcgg tcaatacact acatggcgtg atttcatatg cgcgattgct 7920
gatccccatg tgtatcactg gcaaactgtg atggacgaca ccgtcagtgc gtccgtcgcg 7980
caggctctcg atgagctgat gctttgggcc gaggactgcc ccgaagtccg gcacctcgtg 8040
cacgcggatt tcggctccaa caatgtcctg acggacaatg gccgcataac agcggtcatt 8100
gactggagcg aggcgatgtt cggggattcc caatacgagg tcgccaacat cttcttctgg 8160
aggccgtggt tggcttgtat ggagcagcag acgcgctact tcgagcggag gcatccggag 8220
cttgcaggat cgccgcggct ccgggcgtat atgctccgca ttggtcttga ccaactctat 8280
cagagcttgg ttgacggcaa tttcgatgat gcagcttggg cgcagggtcg atgcgacgca 8340
atcgtccgat ccggagccgg gactgtcggg cgtacacaaa tcgcccgcag aagcgcggcc 8400
gtctggaccg atggctgtgt agaagtactc gccgatagtg gaaaccgacg ccccagcact 8460
cgtccgaggg caaaggaaac tagtggcagc ggcgccacaa acttctctct gctaaagcaa 8520
gcaggtgatg ttgaagaaaa ccccgggcct ggcgcgccaa tgaatacact cgagatggac 8580
atcatcagcg tggctctgaa gaggcactcc accaaggctt tcgacgcttc caagaaactg 8640
acccctgaac aggccgagca gatcaagacc ctgctccagt acagccctag ctccaccaac 8700
agccagcctt ggcacttcat cgtggctagc accgaggaag gcaaagctag ggtggctaag 8760
agcgccgctg gcaactacgt gttcaacgag aggaagatgc tggatgctag ccacgtggtg 8820
gtgttctgcg ctaagaccgc catggacgat gtgtggctga agctggtggt ggatcaggaa 8880
gatgctgatg gcaggttcgc tacccctgaa gctaaggccg ctaacgacaa gggcaggaag 8940
ttcttcgccg acatgcacag gaaggatctg cacgatgatg ctgagtggat ggccaagcag 9000
gtgtacctga acgtgggcaa cttcctgctc ggcgtggctg ccctgggcct cgatgctgtg 9060
cccatcgaag gcttcgatgc tgctatcctg gatgccgagt tcggcctgaa ggagaaaggc 9120
tacaccagcc tggtggtggt gcctgtgggc caccacagcg tggaggactt caacgctacc 9180
ctgcctaaga gcaggctgcc ccagaacatc accctgaccg aggtgggccg gccaggctcg 9240
ggccagtgta ctaattatgc tctcttgaaa ttggctggag atgttgagag caacccaggt 9300
cccttaatta agatggtgag caagggcgag gagctgttca ccggggtggt gcccatcctg 9360
gtcgagctgg acggcgacgt aaacggccac aagttcagcg tgtccggcga gggcgagggc 9420
gatgccacct acggcaagct gaccctgaag ttcatctgca ccaccggcaa gctgcccgtg 9480
ccctggccca ccctcgtgac caccctgacc tacggcgtgc agtgcttcag ccgctacccc 9540
gaccacatga agcagcacga cttcttcaag tccgccatgc ccgaaggcta cgtccaggag 9600
cgcaccatct tcttcaagga cgacggcaac tacaagaccc gcgccgaggt gaagttcgag 9660
ggcgacaccc tggtgaaccg catcgagctg aagggcatcg acttcaagga ggacggcaac 9720
atcctggggc acaagctgga gtacaactac aacagccaca acgtctatat catggccgac 9780
aagcagaaga acggcatcaa ggtgaacttc aagatccgcc acaacatcga ggacggcagc 9840
gtgcagctcg ccgaccacta ccagcagaac acccccatcg gcgacggccc cgtgctgctg 9900
cccgacaacc actacctgag cacccagtcc gccctgagca aagaccccaa cgagaagcgc 9960
gatcacatgg tcctgctgga gttcgtgacc gccgccggga tcactctcgg catggacgag 10020
ctgtacaagt aattaattaa gagggcgaat tcgacccagc tttcttgtac aaagtggttg 10080
atatccagca cagtggcggc cgctcgagtc tagagggccc gcggttcgaa ggtaagccta 10140
tccctaaccc tctcctcggt ctcgattcta cgcgtaccgg ttaggggccc gtttaaaccc 10200
gctgatcagc ctcgactgtg ccttctagtt gccagccatc tgttgtttgc ccctcccccg 10260
tgccttcctt gaccctggaa ggtgccactc ccactgtcct ttcctaataa aatgaggaaa 10320
ttgcatcgca ttgtctgagt aggtgtcatt ctattctggg gggtggggtg gggcaggaca 10380
gcaaggggga ggattgggaa gacaatagca ggcatgctgg ggatgcggtg ggctctatgg 10440
ctctagaagt cgacagtact aagctttgac agaaaagccc catccttagg cctcctcctt 10500
cctagtctcc tgatattggg tctaaccccc acctcctgtt aggcagattc cttatctggt 10560
gacacacccc catttcctgg agccatctct ctccttgcca gaacctctaa ggtttgctta 10620
cgatggagcc agagaggatc ctgggaggga gagcttggca gggggtggga gggaaggggg 10680
ggatgcgtga cctgcccggt tctcagtggc caccctgcgc taccctctcc cagaacctga 10740
gctgctctga cgcggctgtc tggtgcgttt cactgatcct ggtgctgcag cttccttaca 10800
cttcccaaga ggagaagcag tttggaaaaa caaaatcaga ataagttggt cctgagttct 10860
aactttggct cttcaccttt ctagtcccca atttatattg ttcctccgtg cgtcagtttt 10920
acctgtgaga taaggccagt agccagcccc gtcctggcag ggctgtggtg aggagggggg 10980
tgtccgtgtg gaaaactccc tttgtgagaa tggtgcgtcc taggtgttca ccaggtcgtg 11040
gccgcctcta ctccctttct ctttctccat ccttctttcc ttaaagagtc cccagtgcta 11100
tctgggacat attcctccgc ccagagcagg gtcccgcttc cctaaggccc tgctctgggc 11160
ttctgggttt gagtccttgg caagcccagg agaggcgctc aggcttccct gtcccccttc 11220
ctcgtccacc atctcatgcc cctggctctc ctgccccttc cctacagggg ttcctggctc 11280
tgctctagcg atcgccaatt cgccctatag tgagtcgtat tacaattcac tggccgtcgt 11340
tttacaacgt cgtgactggg aaaaccctgg cgttacccaa cttaatcgcc ttgcagcaca 11400
tccccctttc gccagctggc gtaatagcga agaggcccgc accgatcgcc cttcccaaca 11460
gttgcgcagc ctgaatggcg aatgggacgc gccctgtagc ggcgcattaa gcgcggcggg 11520
tgtggtggtt acgcgcagcg tgaccgctac acttgccagc gccctagcgc ccgctccttt 11580
cgctttcttc ccttcctttc tcgccacgtt cgccggcttt ccccgtcaag ctctaaatcg 11640
ggggctccct ttagggttcc gatttagtgc tttacggcac ctcgacccca aaaaacttga 11700
ttagggtgat ggttcacgta gtgggccatc gccctgatag acggtttttc gccctttgac 11760
gttggagtcc acgttcttta atagtggact cttgttccaa actggaacaa cactcaaccc 11820
tatctcggtc tattcttttg atttataagg gattttgccg atttcggcct attggttaaa 11880
aaatgagctg atttaacaaa aatttaacgc gaattttaac aaaatattaa cgcttacaat 11940
ttaggtg 11947
<210> 205
<211> 11938
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 205
gcacttttcg gggaaatgtg cgcggaaccc ctatttgttt atttttctaa atacattcaa 60
atatgtatcc gctcatgaga caataaccct gataaatgct tcaataatat tgaaaaagga 120
agagtatgag tattcaacat ttccgtgtcg cccttattcc cttttttgcg gcattttgcc 180
ttcctgtttt tgctcaccca gaaacgctgg tgaaagtaaa agatgctgaa gatcagttgg 240
gtgcacgagt gggttacatc gaactggatc tcaacagcgg taagatcctt gagagttttc 300
gccccgaaga acgttttcca atgatgagca cttttaaagt tctgctatgt ggcgcggtat 360
tatcccgtat tgacgccggg caagagcaac tcggtcgccg catacactat tctcagaatg 420
acttggttga gtactcacca gtcacagaaa agcatcttac ggatggcatg acagtaagag 480
aattatgcag tgctgccata accatgagtg ataacactgc ggccaactta cttctgacaa 540
cgatcggagg accgaaggag ctaaccgctt ttttgcacaa catgggggat catgtaactc 600
gccttgatcg ttgggaaccg gagctgaatg aagccatacc aaacgacgag cgtgacacca 660
cgatgcctgt agcaatggca acaacgttgc gcaaactatt aactggcgaa ctacttactc 720
tagcttcccg gcaacaatta atagactgga tggaggcgga taaagttgca ggaccacttc 780
tgcgctcggc ccttccggct ggctggttta ttgctgataa atctggagcc ggtgagcgtg 840
ggtctcgcgg tatcattgca gcactggggc cagatggtaa gccctcccgt atcgtagtta 900
tctacacgac ggggagtcag gcaactatgg atgaacgaaa tagacagatc gctgagatag 960
gtgcctcact gattaagcat tggtaactgt cagaccaagt ttactcatat atactttaga 1020
ttgatttaaa acttcatttt taatttaaaa ggatctaggt gaagatcctt tttgataatc 1080
tcatgaccaa aatcccttaa cgtgagtttt cgttccactg agcgtcagac cccgtagaaa 1140
agatcaaagg atcttcttga gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa 1200
aaaaaccacc gctaccagcg gtggtttgtt tgccggatca agagctacca actctttttc 1260
cgaaggtaac tggcttcagc agagcgcaga taccaaatac tgtccttcta gtgtagccgt 1320
agttaggcca ccacttcaag aactctgtag caccgcctac atacctcgct ctgctaatcc 1380
tgttaccagt ggctgctgcc agtggcgata agtcgtgtct taccgggttg gactcaagac 1440
gatagttacc ggataaggcg cagcggtcgg gctgaacggg gggttcgtgc acacagccca 1500
gcttggagcg aacgacctac accgaactga gatacctaca gcgtgagcta tgagaaagcg 1560
ccacgcttcc cgaagggaga aaggcggaca ggtatccggt aagcggcagg gtcggaacag 1620
gagagcgcac gagggagctt ccagggggaa acgcctggta tctttatagt cctgtcgggt 1680
ttcgccacct ctgacttgag cgtcgatttt tgtgatgctc gtcagggggg cggagcctat 1740
ggaaaaacgc cagcaacgcg gcctttttac ggttcctggc cttttgctgg ccttttgctc 1800
acatgttctt tcctgcgtta tcccctgatt ctgtggataa ccgtattacc gcctttgagt 1860
gagctgatac cgctcgccgc agccgaacga ccgagcgcag cgagtcagtg agcgaggaag 1920
cggaagagcg cccaatacgc aaaccgcctc tccccgcgcg ttggccgatt cattaatgca 1980
gctggcacga caggtttccc gactggaaag cgggcagtga gcgcaacgca attaatgtga 2040
gttagctcac tcattaggca ccccaggctt tacactttat gcttccggct cgtatgttgt 2100
gtggaattgt gagcggataa caatttcaca caggaaacag ctatgaccat gattacgcca 2160
agctcgaaat taaccctcac taaagggaac aaaagctgtg ctttctctga ccagcattct 2220
ctcccctggg cctgtgccgc tttctgtctg cagcttgtgg cctgggtcac ctctacggct 2280
ggcccagatc cttccctgcc gcctccttca ggttccgtct tcctccactc cctcttcccc 2340
ttgctctctg ctgtgttgct gcccaaggat gctctttccg gagcacttcc ttctcggcgc 2400
tgcaccacgt gatgtcctct gagcggatcc tccccgtgtc tgggtcctct ccgggcatct 2460
ctcctccctc acccaacccc atgccgtctt cactcgctgg gttccctttt ccttctcctt 2520
ctggggcctg tgccatctct cgtttcttag gatggccttc tccgacggat gtctcccttg 2580
cgtcccgcct ccccttcttg taggcctgca tcatcaccgt ttttctggac aaccccaaag 2640
taccccgtct ccctggcttt agccacctct ccatcctctt gctttctttg cctggacacc 2700
ccgttctcct gtggattcgg gtcacctctc actcctttca tttgggcagc tcccctaccc 2760
cccttacctc tctagtctgt gctagctctt ccagccccct gtcatggcat cttccagggg 2820
tccgagagct cagctagtct tcttcctcca acccgggccc ctatgtccac ttcaggacag 2880
catgtttgct gcctccaggg atcctgtgtc cccgagctgg gaccacctta tattcccagg 2940
gccggttaat gtggctctgg ttctgggtac ttttatctgt cccctccacc ccacagtggg 3000
gcaagcttct gacctcttct cttcctccca cagggcctcg agagatctgg cagcggagag 3060
ggcagaggaa gtcttctaac atgcggtgac gtggaggaga atcccggccc taggctcgag 3120
ggtaccatga ttgaacaaga tggattgcac gcaggttctc cggccgcttg ggtggagagg 3180
ctattcggct atgactgggc acaacagaca atcggctgct ctgatgccgc cgtgttccgg 3240
ctgtcagcgc aggggcgccc ggttcttttt gtcaagaccg acctgtccgg tgccctgaat 3300
gaactgcagg acgaggcagc gcggctatcg tggctggcca cgacgggcgt tccttgcgca 3360
gctgtgctcg acgttgtcac tgaagcggga agggactggc tgctattggg cgaagtgccg 3420
gggcaggatc tcctgtcatc tcaccttgct cctgccgaga aagtatccat catggctgat 3480
gcaatgcggc ggctgcatac gcttgatccg gctacctgcc cattcgacca ccaagcgaaa 3540
catcgcatcg agcgagcacg tactcggatg gaagccggtc ttgtcgatca ggatgatctg 3600
gacgaagagc atcaggggct cgcgccagcc gaactgttcg ccaggctcaa ggcgcgcatg 3660
cccgacggcg aggatctcgt cgtgacccat ggcgatgcct gcttgccgaa tatcatggtg 3720
gaaaatggcc gcttttctgg attcatcgac tgtggccggc tgggtgtggc ggaccgctat 3780
caggacatag cgttggctac ccgtgatatt gctgaagagc ttggcggcga atgggctgac 3840
cgcttcctcg tgctttacgg tatcgccgct cccgattcgc agcgcatcgc cttctatcgc 3900
cttcttgacg agttcttctg agtttaaacc cgctgatcag cctcgactgt gccttctagt 3960
tgccagccat ctgttgtttg cccctccccc gtgccttcct tgaccctgga aggtgccact 4020
cccactgtcc tttcctaata aaatgaggaa attgcatcgc attgtctgag taggtgtcat 4080
tctattctgg ggggtggggt ggggcaggac agcaaggggg aggattggga agacaatagc 4140
aggcatgctg gggatgcggt gggctctatg ggctagcggt ggcggcctcg acattgatta 4200
ttgactagta gatctggcgc gccgcctttt tacggttcct ggccttttgc tggccttttg 4260
ctcacatgtc acgtgaggcc ttaacgtctc gccctttggt ctccccctct taagtaccac 4320
atttgtagag gttttacttg ctttaaaaaa cctcccacac ctccccctga acctgaaaca 4380
taaaatgaat gcaattgttg ttgttaactt gtttattgca gcttataatg gttacaaata 4440
aagcaatagc atcacaaatt tcacaaataa agcatttttt tcactgcatt ctagttgtgg 4500
tttgtccaaa ctcatcggct agcttacccg gggagcatgt caaggtcaaa atcgtcaaga 4560
gcgtcagcag gcagcatatc aaggtcaaag tcgtcaaggg catcggctgg gagcatgtct 4620
aagtcaaaat cgtcaagggc gtcggtcggc ccgccgcttt cgcactttag ctgtttctcc 4680
aggccacata tgattagttc caggccgaaa aggaaggcag gttcggctcc ctgccggtcg 4740
aacagctcaa ttgcttgtct cagaagtggg ggcatagaat cggtggtagg tgtctctctt 4800
tcctcttttg ctacttgatg ctcctgttcc tccaatacgc agcccagtgt aaagtggccc 4860
acggcggaca gagcgtacag tgcgttctcc agggagaagc cttgctgaca caggaacgcg 4920
agctgatttt ccagggtttc gtactgtttc tctgttgggc gggtgccgag atgcacttta 4980
gccccgtcgc gatgtgagag gagagcacag cggtatgact tggcgttgtt ccgcagaaag 5040
tcttgccatg actcgccttc cagggggcag aagtgggtat gatgcctgtc cagcatctcg 5100
attggcaggg catcgagcag ggcccgcttg ttcttcacgt gccagtacag ggtaggctgc 5160
tcaactccca gcttttgagc gagtttcctt gtcgtcaggc cttcgatacc gacaccattg 5220
agtaattcca gagctccgtt tatgactttg ctcttgtcca gtctagacat tggaccaggg 5280
ttttcttcaa catcaccaca agtgaggaga gaacctctac cttcggcacc gggatttcgg 5340
gtatatttga gtggaatgag ttcttcaatc gtagttttga ctaacttgcc attcatttct 5400
attaacacaa aacaatctgg tgcatagtct gaaatcaact ccctacacat accacaagga 5460
cttaccactc gaatacttct atctacttcg tcagaataag ggtgtctaac agctacaatc 5520
gtgtcaaaat ccttttgtcc attcgaaact gcactaccaa tcgcaatggc ttctgcacaa 5580
acagttactc gtcctatata cgcttcaata tgtactgccg aaatgatttc tcctgttttc 5640
gtacgaattg ccgctcccac atgatgttta ttatcctcat aaagcattgt aatcttctct 5700
gtcgctactt ctactaattc tagatcctgt tgagaaatgt taaatgtttt catggtggcg 5760
gcactagtaa gggcgaattc ggagcctgct tttttgtaca aacttgttga tatctgcaga 5820
attccaccac actggactag tggatccgag ctcggtacca agcttcttca cgacacctga 5880
aatggaagaa aaaaactttg aaccactgtc tgaggcttga gaatgaacca agatccaaac 5940
tcaaaaaggg caaattccaa ggagaattac atcaagtgcc aagctggcct aacttcagtc 6000
tccacccact cagtgtgggg aaactccatc gcataaaacc cctcccccca acctaaagac 6060
gacgtactcc aaaagctcga gaactaatcg aggtgcctgg acggcgcccg gtactccgtg 6120
gagtcacatg aagcgacggc tgaggacgga aaggcccttt tcctttgtgt gggtgactca 6180
cccgcccgct ctcccgagcg ccgcgtcctc cattttgagc tccctgcagc agggccggga 6240
agcggccatc tttccgctca cgcaactggt gccgaccggg ccagccttgc cgcccagggc 6300
ggggcgatac acggcggcgc gaggccaggc accagagcag gccggccagc ttgagactac 6360
ccccgtccga ttctcggtgg ccgcgctcgc aggccccgcc tcgccgaaca tgtgcgctgg 6420
gacgcacggg ccccgtcgcc gcccgcggcc ccaaaaaccg aaataccagt gtgcagatct 6480
tggcccgcat ttacaagact atcttgccag aaaaaaagcg tcgcagcagg tcatcaaaaa 6540
ttttaaatgg ctagagactt atcgaaagca gcgagacagg cgcgaaggtg ccaccagatt 6600
cgcacgcggc ggccccagcg cccaggccag gcctcaactc aagcacgagg cgaaggggct 6660
ccttaagcgc aaggcctcga actctcccac ccacttccaa cccgaagctc gggatcaaga 6720
atcacgtact gcagccaggt ggaagtaatt caaggcacgc aagggccata acccgtaaag 6780
aggccaggcc cgcgggaacc acacacggca cttacctgtg ttctggcggc aaacccgttg 6840
cgaaaaagaa cgttcacggc gactactgca cttatatacg gttctccccc accctcggga 6900
aaaaggcgga gccagtacac gacatcactt tcccagttta ccccgcgcca ccttctctag 6960
gcaccggttc aattgccgac ccctcccccc aacttctcgg ggactgtggg cgatgtgcgc 7020
tctgcccact gacgggcacc ggagcctcac gatcgatatg tcgagtttac tccctatcag 7080
tgatagagaa cgtatgtcga gtttactccc tatcagtgat agagaacgat gtcgagttta 7140
ctccctatca gtgatagaga acgtatgtcg agtttactcc ctatcagtga tagagaacgt 7200
atgtcgagtt tactccctat cagtgataga gaacgtatgt cgagtttatc cctatcagtg 7260
atagagaacg tatgtcgagt ttactcccta tcagtgatag agaacgtatg tcgaggtagg 7320
cgtgtacggt gggaggccta tataagcaga gctcgtttag tgaaccgtca gatcgcctgg 7380
agaattggct aggcaccggt gacaagtttg tacaaaaaag caggctccga attcgccctt 7440
actagtgccg ccaccatgaa aaagcctgaa ctcaccgcga cgtctgtcga gaagtttctg 7500
atcgaaaagt tcgacagcgt ctccgacctg atgcagctct cggagggcga agaatctcgt 7560
gctttcagct tcgatgtagg agggcgtgga tatgtcctgc gggtaaatag ctgcgccgat 7620
ggtttctaca aagatcgtta tgtttatcgg cactttgcat cggccgcgct cccgattccg 7680
gaagtgcttg acattgggga atttagcgag agcctgacct attgcatctc ccgccgtgca 7740
cagggtgtca cgttgcaaga cctgcctgaa accgaactgc ccgctgttct gcagccggtc 7800
gcggaggcca tggatgcgat cgctgcggcc gatcttagcc agacgagcgg gttcggccca 7860
ttcggaccgc aaggaatcgg tcaatacact acatggcgtg atttcatatg cgcgattgct 7920
gatccccatg tgtatcactg gcaaactgtg atggacgaca ccgtcagtgc gtccgtcgcg 7980
caggctctcg atgagctgat gctttgggcc gaggactgcc ccgaagtccg gcacctcgtg 8040
cacgcggatt tcggctccaa caatgtcctg acggacaatg gccgcataac agcggtcatt 8100
gactggagcg aggcgatgtt cggggattcc caatacgagg tcgccaacat cttcttctgg 8160
aggccgtggt tggcttgtat ggagcagcag acgcgctact tcgagcggag gcatccggag 8220
cttgcaggat cgccgcggct ccgggcgtat atgctccgca ttggtcttga ccaactctat 8280
cagagcttgg ttgacggcaa tttcgatgat gcagcttggg cgcagggtcg atgcgacgca 8340
atcgtccgat ccggagccgg gactgtcggg cgtacacaaa tcgcccgcag aagcgcggcc 8400
gtctggaccg atggctgtgt agaagtactc gccgatagtg gaaaccgacg ccccagcact 8460
cgtccgaggg caaaggaaac tagtggcagc ggcgccacaa acttctctct gctaaagcaa 8520
gcaggtgatg ttgaagaaaa ccccgggcct ggcgcgccaa tgaatacact cgagatggac 8580
atcatcagcg tggctctgaa gaggcactcc accaaggctt tcgacgcttc caagaaactg 8640
acccctgaac aggccgagca gatcaagacc ctgctccagt acagccctag ctccaccaac 8700
agccagcctt ggcacttcat cgtggctagc accgaggaag gcaaagctag ggtggctaag 8760
agcgccgctg gcaactacgt gttcaacgag aggaagatgc tggatgctag ccacgtggtg 8820
gtgttctgcg ctaagaccgc catggacgat gtgtggctga agctggtggt ggatcaggaa 8880
gatgctgatg gcaggttcgc tacccctgaa gctaaggccg ctaacgacaa gggcaggaag 8940
ttcttcgccg acatgcacag gaaggatctg cacgatgatg ctgagtggat ggccaagcag 9000
gtgtacctga acgtgggcaa cttcctgctc ggcgtggctg ccctgggcct cgatgctgtg 9060
cccatcgaag gcttcgatgc tgctatcctg gatgccgagt tcggcctgaa ggagaaaggc 9120
tacaccagcc tggtggtggt gcctgtgggc caccacagcg tggaggactt caacgctacc 9180
ctgcctaaga gcaggctgcc ccagaacatc accctgaccg aggtgggccg gccaggctcg 9240
ggccagtgta ctaattatgc tctcttgaaa ttggctggag atgttgagag caacccaggt 9300
cccttaatta agatggtgag caagggcgag gaggataaca tggccatcat caaggagttc 9360
atgcgcttca aggtgcacat ggagggctcc gtgaacggcc acgagttcga gatcgagggc 9420
gagggcgagg gccgccccta cgagggcacc cagaccgcca agctgaaggt gaccaagggt 9480
ggccccctgc ccttcgcctg ggacatcctg tcccctcagt tcatgtacgg ctccaaggcc 9540
tacgtgaagc accccgccga catccccgac tacttgaagc tgtccttccc cgagggcttc 9600
aagtgggagc gcgtgatgaa cttcgaggac ggcggcgtgg tgaccgtgac ccaggactcc 9660
tccctgcagg acggcgagtt catctacaag gtgaagctgc gcggcaccaa cttcccctcc 9720
gacggccccg taatgcagaa gaagaccatg ggctgggagg cctcctccga gcggatgtac 9780
cccgaggacg gcgccctgaa gggcgagatc aagcagaggc tgaagctgaa ggacggcggc 9840
cactacgacg ctgaggtcaa gaccacctac aaggccaaga agcccgtgca gctgcccggc 9900
gcctacaacg tcaacatcaa gttggacatc acctcccaca acgaggacta caccatcgtg 9960
gaacagtacg aacgcgccga gggccgccac tccaccggcg gcatggacga gctgtacaag 10020
taattaatta aaagggcgaa ttcgacccag ctttcttgta caaagtggtt gatatccagc 10080
acagtggcgg ccgctcgagt ctagagggcc cgcggttcga aggtaagcct atccctaacc 10140
ctctcctcgg tctcgattct acgcgtaccg gttaggggcc cgtttaaacc cgctgatcag 10200
cctcgactgt gccttctagt tgccagccat ctgttgtttg cccctccccc gtgccttcct 10260
tgaccctgga aggtgccact cccactgtcc tttcctaata aaatgaggaa attgcatcgc 10320
attgtctgag taggtgtcat tctattctgg ggggtggggt ggggcaggac agcaaggggg 10380
aggattggga agacaatagc aggcatgctg gggatgcggt gggctctatg gctctagaag 10440
tcgacagtac taagctttga cagaaaagcc ccatccttag gcctcctcct tcctagtctc 10500
ctgatattgg gtctaacccc cacctcctgt taggcagatt ccttatctgg tgacacaccc 10560
ccatttcctg gagccatctc tctccttgcc agaacctcta aggtttgctt acgatggagc 10620
cagagaggat cctgggaggg agagcttggc agggggtggg agggaagggg gggatgcgtg 10680
acctgcccgg ttctcagtgg ccaccctgcg ctaccctctc ccagaacctg agctgctctg 10740
acgcggctgt ctggtgcgtt tcactgatcc tggtgctgca gcttccttac acttcccaag 10800
aggagaagca gtttggaaaa acaaaatcag aataagttgg tcctgagttc taactttggc 10860
tcttcacctt tctagtcccc aatttatatt gttcctccgt gcgtcagttt tacctgtgag 10920
ataaggccag tagccagccc cgtcctggca gggctgtggt gaggaggggg gtgtccgtgt 10980
ggaaaactcc ctttgtgaga atggtgcgtc ctaggtgttc accaggtcgt ggccgcctct 11040
actccctttc tctttctcca tccttctttc cttaaagagt ccccagtgct atctgggaca 11100
tattcctccg cccagagcag ggtcccgctt ccctaaggcc ctgctctggg cttctgggtt 11160
tgagtccttg gcaagcccag gagaggcgct caggcttccc tgtccccctt cctcgtccac 11220
catctcatgc ccctggctct cctgcccctt ccctacaggg gttcctggct ctgctctagc 11280
gatcgccaat tcgccctata gtgagtcgta ttacaattca ctggccgtcg ttttacaacg 11340
tcgtgactgg gaaaaccctg gcgttaccca acttaatcgc cttgcagcac atcccccttt 11400
cgccagctgg cgtaatagcg aagaggcccg caccgatcgc ccttcccaac agttgcgcag 11460
cctgaatggc gaatgggacg cgccctgtag cggcgcatta agcgcggcgg gtgtggtggt 11520
tacgcgcagc gtgaccgcta cacttgccag cgccctagcg cccgctcctt tcgctttctt 11580
cccttccttt ctcgccacgt tcgccggctt tccccgtcaa gctctaaatc gggggctccc 11640
tttagggttc cgatttagtg ctttacggca cctcgacccc aaaaaacttg attagggtga 11700
tggttcacgt agtgggccat cgccctgata gacggttttt cgccctttga cgttggagtc 11760
cacgttcttt aatagtggac tcttgttcca aactggaaca acactcaacc ctatctcggt 11820
ctattctttt gatttataag ggattttgcc gatttcggcc tattggttaa aaaatgagct 11880
gatttaacaa aaatttaacg cgaattttaa caaaatatta acgcttacaa tttaggtg 11938
<210> 206
<211> 11497
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 206
gcacttttcg gggaaatgtg cgcggaaccc ctatttgttt atttttctaa atacattcaa 60
atatgtatcc gctcatgaga caataaccct gataaatgct tcaataatat tgaaaaagga 120
agagtatgag tattcaacat ttccgtgtcg cccttattcc cttttttgcg gcattttgcc 180
ttcctgtttt tgctcaccca gaaacgctgg tgaaagtaaa agatgctgaa gatcagttgg 240
gtgcacgagt gggttacatc gaactggatc tcaacagcgg taagatcctt gagagttttc 300
gccccgaaga acgttttcca atgatgagca cttttaaagt tctgctatgt ggcgcggtat 360
tatcccgtat tgacgccggg caagagcaac tcggtcgccg catacactat tctcagaatg 420
acttggttga gtactcacca gtcacagaaa agcatcttac ggatggcatg acagtaagag 480
aattatgcag tgctgccata accatgagtg ataacactgc ggccaactta cttctgacaa 540
cgatcggagg accgaaggag ctaaccgctt ttttgcacaa catgggggat catgtaactc 600
gccttgatcg ttgggaaccg gagctgaatg aagccatacc aaacgacgag cgtgacacca 660
cgatgcctgt agcaatggca acaacgttgc gcaaactatt aactggcgaa ctacttactc 720
tagcttcccg gcaacaatta atagactgga tggaggcgga taaagttgca ggaccacttc 780
tgcgctcggc ccttccggct ggctggttta ttgctgataa atctggagcc ggtgagcgtg 840
ggtctcgcgg tatcattgca gcactggggc cagatggtaa gccctcccgt atcgtagtta 900
tctacacgac ggggagtcag gcaactatgg atgaacgaaa tagacagatc gctgagatag 960
gtgcctcact gattaagcat tggtaactgt cagaccaagt ttactcatat atactttaga 1020
ttgatttaaa acttcatttt taatttaaaa ggatctaggt gaagatcctt tttgataatc 1080
tcatgaccaa aatcccttaa cgtgagtttt cgttccactg agcgtcagac cccgtagaaa 1140
agatcaaagg atcttcttga gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa 1200
aaaaaccacc gctaccagcg gtggtttgtt tgccggatca agagctacca actctttttc 1260
cgaaggtaac tggcttcagc agagcgcaga taccaaatac tgtccttcta gtgtagccgt 1320
agttaggcca ccacttcaag aactctgtag caccgcctac atacctcgct ctgctaatcc 1380
tgttaccagt ggctgctgcc agtggcgata agtcgtgtct taccgggttg gactcaagac 1440
gatagttacc ggataaggcg cagcggtcgg gctgaacggg gggttcgtgc acacagccca 1500
gcttggagcg aacgacctac accgaactga gatacctaca gcgtgagcta tgagaaagcg 1560
ccacgcttcc cgaagggaga aaggcggaca ggtatccggt aagcggcagg gtcggaacag 1620
gagagcgcac gagggagctt ccagggggaa acgcctggta tctttatagt cctgtcgggt 1680
ttcgccacct ctgacttgag cgtcgatttt tgtgatgctc gtcagggggg cggagcctat 1740
ggaaaaacgc cagcaacgcg gcctttttac ggttcctggc cttttgctgg ccttttgctc 1800
acatgttctt tcctgcgtta tcccctgatt ctgtggataa ccgtattacc gcctttgagt 1860
gagctgatac cgctcgccgc agccgaacga ccgagcgcag cgagtcagtg agcgaggaag 1920
cggaagagcg cccaatacgc aaaccgcctc tccccgcgcg ttggccgatt cattaatgca 1980
gctggcacga caggtttccc gactggaaag cgggcagtga gcgcaacgca attaatgtga 2040
gttagctcac tcattaggca ccccaggctt tacactttat gcttccggct cgtatgttgt 2100
gtggaattgt gagcggataa caatttcaca caggaaacag ctatgaccat gattacgcca 2160
agctcgaaat taaccctcac taaagggaac aaaagctgtg ctttctctga ccagcattct 2220
ctcccctggg cctgtgccgc tttctgtctg cagcttgtgg cctgggtcac ctctacggct 2280
ggcccagatc cttccctgcc gcctccttca ggttccgtct tcctccactc cctcttcccc 2340
ttgctctctg ctgtgttgct gcccaaggat gctctttccg gagcacttcc ttctcggcgc 2400
tgcaccacgt gatgtcctct gagcggatcc tccccgtgtc tgggtcctct ccgggcatct 2460
ctcctccctc acccaacccc atgccgtctt cactcgctgg gttccctttt ccttctcctt 2520
ctggggcctg tgccatctct cgtttcttag gatggccttc tccgacggat gtctcccttg 2580
cgtcccgcct ccccttcttg taggcctgca tcatcaccgt ttttctggac aaccccaaag 2640
taccccgtct ccctggcttt agccacctct ccatcctctt gctttctttg cctggacacc 2700
ccgttctcct gtggattcgg gtcacctctc actcctttca tttgggcagc tcccctaccc 2760
cccttacctc tctagtctgt gctagctctt ccagccccct gtcatggcat cttccagggg 2820
tccgagagct cagctagtct tcttcctcca acccgggccc ctatgtccac ttcaggacag 2880
catgtttgct gcctccaggg atcctgtgtc cccgagctgg gaccacctta tattcccagg 2940
gccggttaat gtggctctgg ttctgggtac ttttatctgt cccctccacc ccacagtggg 3000
gcaagcttct gacctcttct cttcctccca cagggcctcg agagatctgg cagcggagag 3060
ggcagaggaa gtcttctaac atgcggtgac gtggaggaga atcccggccc taggctcgag 3120
ggtaccatga ttgaacaaga tggattgcac gcaggttctc cggccgcttg ggtggagagg 3180
ctattcggct atgactgggc acaacagaca atcggctgct ctgatgccgc cgtgttccgg 3240
ctgtcagcgc aggggcgccc ggttcttttt gtcaagaccg acctgtccgg tgccctgaat 3300
gaactgcagg acgaggcagc gcggctatcg tggctggcca cgacgggcgt tccttgcgca 3360
gctgtgctcg acgttgtcac tgaagcggga agggactggc tgctattggg cgaagtgccg 3420
gggcaggatc tcctgtcatc tcaccttgct cctgccgaga aagtatccat catggctgat 3480
gcaatgcggc ggctgcatac gcttgatccg gctacctgcc cattcgacca ccaagcgaaa 3540
catcgcatcg agcgagcacg tactcggatg gaagccggtc ttgtcgatca ggatgatctg 3600
gacgaagagc atcaggggct cgcgccagcc gaactgttcg ccaggctcaa ggcgcgcatg 3660
cccgacggcg aggatctcgt cgtgacccat ggcgatgcct gcttgccgaa tatcatggtg 3720
gaaaatggcc gcttttctgg attcatcgac tgtggccggc tgggtgtggc ggaccgctat 3780
caggacatag cgttggctac ccgtgatatt gctgaagagc ttggcggcga atgggctgac 3840
cgcttcctcg tgctttacgg tatcgccgct cccgattcgc agcgcatcgc cttctatcgc 3900
cttcttgacg agttcttctg agtttaaacc cgctgatcag cctcgactgt gccttctagt 3960
tgccagccat ctgttgtttg cccctccccc gtgccttcct tgaccctgga aggtgccact 4020
cccactgtcc tttcctaata aaatgaggaa attgcatcgc attgtctgag taggtgtcat 4080
tctattctgg ggggtggggt ggggcaggac agcaaggggg aggattggga agacaatagc 4140
aggcatgctg gggatgcggt gggctctatg ggctagcggt ggcggcctcg acattgatta 4200
ttgactagta gatctggcgc gccgcctttt tacggttcct ggccttttgc tggccttttg 4260
ctcacatgtc acgtgaggcc ttaacgtctc gccctttggt ctccccctct taagtaccac 4320
atttgtagag gttttacttg ctttaaaaaa cctcccacac ctccccctga acctgaaaca 4380
taaaatgaat gcaattgttg ttgttaactt gtttattgca gcttataatg gttacaaata 4440
aagcaatagc atcacaaatt tcacaaataa agcatttttt tcactgcatt ctagttgtgg 4500
tttgtccaaa ctcatcggct agcttacccg gggagcatgt caaggtcaaa atcgtcaaga 4560
gcgtcagcag gcagcatatc aaggtcaaag tcgtcaaggg catcggctgg gagcatgtct 4620
aagtcaaaat cgtcaagggc gtcggtcggc ccgccgcttt cgcactttag ctgtttctcc 4680
aggccacata tgattagttc caggccgaaa aggaaggcag gttcggctcc ctgccggtcg 4740
aacagctcaa ttgcttgtct cagaagtggg ggcatagaat cggtggtagg tgtctctctt 4800
tcctcttttg ctacttgatg ctcctgttcc tccaatacgc agcccagtgt aaagtggccc 4860
acggcggaca gagcgtacag tgcgttctcc agggagaagc cttgctgaca caggaacgcg 4920
agctgatttt ccagggtttc gtactgtttc tctgttgggc gggtgccgag atgcacttta 4980
gccccgtcgc gatgtgagag gagagcacag cggtatgact tggcgttgtt ccgcagaaag 5040
tcttgccatg actcgccttc cagggggcag aagtgggtat gatgcctgtc cagcatctcg 5100
attggcaggg catcgagcag ggcccgcttg ttcttcacgt gccagtacag ggtaggctgc 5160
tcaactccca gcttttgagc gagtttcctt gtcgtcaggc cttcgatacc gacaccattg 5220
agtaattcca gagctccgtt tatgactttg ctcttgtcca gtctagacat tggaccaggg 5280
ttttcttcaa catcaccaca agtgaggaga gaacctctac cttcggcacc gggatttcgg 5340
gtatatttga gtggaatgag ttcttcaatc gtagttttga ctaacttgcc attcatttct 5400
attaacacaa aacaatctgg tgcatagtct gaaatcaact ccctacacat accacaagga 5460
cttaccactc gaatacttct atctacttcg tcagaataag ggtgtctaac agctacaatc 5520
gtgtcaaaat ccttttgtcc attcgaaact gcactaccaa tcgcaatggc ttctgcacaa 5580
acagttactc gtcctatata cgcttcaata tgtactgccg aaatgatttc tcctgttttc 5640
gtacgaattg ccgctcccac atgatgttta ttatcctcat aaagcattgt aatcttctct 5700
gtcgctactt ctactaattc tagatcctgt tgagaaatgt taaatgtttt catggtggcg 5760
gcactagtaa gggcgaattc ggagcctgct tttttgtaca aacttgttga tatctgcaga 5820
attccaccac actggactag tggatccgag ctcggtacca agcttcttca cgacacctga 5880
aatggaagaa aaaaactttg aaccactgtc tgaggcttga gaatgaacca agatccaaac 5940
tcaaaaaggg caaattccaa ggagaattac atcaagtgcc aagctggcct aacttcagtc 6000
tccacccact cagtgtgggg aaactccatc gcataaaacc cctcccccca acctaaagac 6060
gacgtactcc aaaagctcga gaactaatcg aggtgcctgg acggcgcccg gtactccgtg 6120
gagtcacatg aagcgacggc tgaggacgga aaggcccttt tcctttgtgt gggtgactca 6180
cccgcccgct ctcccgagcg ccgcgtcctc cattttgagc tccctgcagc agggccggga 6240
agcggccatc tttccgctca cgcaactggt gccgaccggg ccagccttgc cgcccagggc 6300
ggggcgatac acggcggcgc gaggccaggc accagagcag gccggccagc ttgagactac 6360
ccccgtccga ttctcggtgg ccgcgctcgc aggccccgcc tcgccgaaca tgtgcgctgg 6420
gacgcacggg ccccgtcgcc gcccgcggcc ccaaaaaccg aaataccagt gtgcagatct 6480
tggcccgcat ttacaagact atcttgccag aaaaaaagcg tcgcagcagg tcatcaaaaa 6540
ttttaaatgg ctagagactt atcgaaagca gcgagacagg cgcgaaggtg ccaccagatt 6600
cgcacgcggc ggccccagcg cccaggccag gcctcaactc aagcacgagg cgaaggggct 6660
ccttaagcgc aaggcctcga actctcccac ccacttccaa cccgaagctc gggatcaaga 6720
atcacgtact gcagccaggt ggaagtaatt caaggcacgc aagggccata acccgtaaag 6780
aggccaggcc cgcgggaacc acacacggca cttacctgtg ttctggcggc aaacccgttg 6840
cgaaaaagaa cgttcacggc gactactgca cttatatacg gttctccccc accctcggga 6900
aaaaggcgga gccagtacac gacatcactt tcccagttta ccccgcgcca ccttctctag 6960
gcaccggttc aattgccgac ccctcccccc aacttctcgg ggactgtggg cgatgtgcgc 7020
tctgcccact gacgggcacc ggagcctcac gatcgatatg tcgagtttac tccctatcag 7080
tgatagagaa cgtatgtcga gtttactccc tatcagtgat agagaacgat gtcgagttta 7140
ctccctatca gtgatagaga acgtatgtcg agtttactcc ctatcagtga tagagaacgt 7200
atgtcgagtt tactccctat cagtgataga gaacgtatgt cgagtttatc cctatcagtg 7260
atagagaacg tatgtcgagt ttactcccta tcagtgatag agaacgtatg tcgaggtagg 7320
cgtgtacggt gggaggccta tataagcaga gctcgtttag tgaaccgtca gatcgcctgg 7380
agaattggct aggcaccggt gacaagtttg tacaaaaaag caggctccga attcgccctt 7440
actagtgccg ccaccatgaa aaagcctgaa ctcaccgcga cgtctgtcga gaagtttctg 7500
atcgaaaagt tcgacagcgt ctccgacctg atgcagctct cggagggcga agaatctcgt 7560
gctttcagct tcgatgtagg agggcgtgga tatgtcctgc gggtaaatag ctgcgccgat 7620
ggtttctaca aagatcgtta tgtttatcgg cactttgcat cggccgcgct cccgattccg 7680
gaagtgcttg acattgggga atttagcgag agcctgacct attgcctttc atacgagacc 7740
gagatcctga ctgtcgagta cggattgctt cctatcggca aaatcgtgga gaagaggatt 7800
gaatgtaccg tctattcagt cgataataat gggaacatct acacacagcc cgtggctcaa 7860
tggcacgaca gaggagagca ggaagttttt gaatactgtc tcgaggacgg atccctcatc 7920
cgcgctacta aagatcataa gtttatgacc gtggacggcc agatgctgcc aattgacgaa 7980
atttttgaac gagagctgga tctgatgaga gtcgacaacc ttccaaacac tagtggcagc 8040
ggcgccacaa acttctctct gctaaagcaa gcaggtgatg ttgaagaaaa ccccgggcct 8100
ggcgcgccaa tgaatacact cgagatggac atcatcagcg tggctctgaa gaggcactcc 8160
accaaggctt tcgacgcttc caagaaactg acccctgaac aggccgagca gatcaagacc 8220
ctgctccagt acagccctag ctccaccaac agccagcctt ggcacttcat cgtggctagc 8280
accgaggaag gcaaagctag ggtggctaag agcgccgctg gcaactacgt gttcaacgag 8340
aggaagatgc tggatgctag ccacgtggtg gtgttctgcg ctaagaccgc catggacgat 8400
gtgtggctga agctggtggt ggatcaggaa gatgctgatg gcaggttcgc tacccctgaa 8460
gctaaggccg ctaacgacaa gggcaggaag ttcttcgccg acatgcacag gaaggatctg 8520
cacgatgatg ctgagtggat ggccaagcag gtgtacctga acgtgggcaa cttcctgctc 8580
ggcgtggctg ccctgggcct cgatgctgtg cccatcgaag gcttcgatgc tgctatcctg 8640
gatgccgagt tcggcctgaa ggagaaaggc tacaccagcc tggtggtggt gcctgtgggc 8700
caccacagcg tggaggactt caacgctacc ctgcctaaga gcaggctgcc ccagaacatc 8760
accctgaccg aggtgggccg gccaggctcg ggccagtgta ctaattatgc tctcttgaaa 8820
ttggctggag atgttgagag caacccaggt cccttaatta agatggtgag caagggcgag 8880
gagctgttca ccggggtggt gcccatcctg gtcgagctgg acggcgacgt aaacggccac 8940
aagttcagcg tgtccggcga gggcgagggc gatgccacct acggcaagct gaccctgaag 9000
ttcatctgca ccaccggcaa gctgcccgtg ccctggccca ccctcgtgac caccctgacc 9060
tacggcgtgc agtgcttcag ccgctacccc gaccacatga agcagcacga cttcttcaag 9120
tccgccatgc ccgaaggcta cgtccaggag cgcaccatct tcttcaagga cgacggcaac 9180
tacaagaccc gcgccgaggt gaagttcgag ggcgacaccc tggtgaaccg catcgagctg 9240
aagggcatcg acttcaagga ggacggcaac atcctggggc acaagctgga gtacaactac 9300
aacagccaca acgtctatat catggccgac aagcagaaga acggcatcaa ggtgaacttc 9360
aagatccgcc acaacatcga ggacggcagc gtgcagctcg ccgaccacta ccagcagaac 9420
acccccatcg gcgacggccc cgtgctgctg cccgacaacc actacctgag cacccagtcc 9480
gccctgagca aagaccccaa cgagaagcgc gatcacatgg tcctgctgga gttcgtgacc 9540
gccgccggga tcactctcgg catggacgag ctgtacaagt aattaattaa gagggcgaat 9600
tcgacccagc tttcttgtac aaagtggttg atatccagca cagtggcggc cgctcgagtc 9660
tagagggccc gcggttcgaa ggtaagccta tccctaaccc tctcctcggt ctcgattcta 9720
cgcgtaccgg ttaggggccc gtttaaaccc gctgatcagc ctcgactgtg ccttctagtt 9780
gccagccatc tgttgtttgc ccctcccccg tgccttcctt gaccctggaa ggtgccactc 9840
ccactgtcct ttcctaataa aatgaggaaa ttgcatcgca ttgtctgagt aggtgtcatt 9900
ctattctggg gggtggggtg gggcaggaca gcaaggggga ggattgggaa gacaatagca 9960
ggcatgctgg ggatgcggtg ggctctatgg ctctagaagt cgacagtact aagctttgac 10020
agaaaagccc catccttagg cctcctcctt cctagtctcc tgatattggg tctaaccccc 10080
acctcctgtt aggcagattc cttatctggt gacacacccc catttcctgg agccatctct 10140
ctccttgcca gaacctctaa ggtttgctta cgatggagcc agagaggatc ctgggaggga 10200
gagcttggca gggggtggga gggaaggggg ggatgcgtga cctgcccggt tctcagtggc 10260
caccctgcgc taccctctcc cagaacctga gctgctctga cgcggctgtc tggtgcgttt 10320
cactgatcct ggtgctgcag cttccttaca cttcccaaga ggagaagcag tttggaaaaa 10380
caaaatcaga ataagttggt cctgagttct aactttggct cttcaccttt ctagtcccca 10440
atttatattg ttcctccgtg cgtcagtttt acctgtgaga taaggccagt agccagcccc 10500
gtcctggcag ggctgtggtg aggagggggg tgtccgtgtg gaaaactccc tttgtgagaa 10560
tggtgcgtcc taggtgttca ccaggtcgtg gccgcctcta ctccctttct ctttctccat 10620
ccttctttcc ttaaagagtc cccagtgcta tctgggacat attcctccgc ccagagcagg 10680
gtcccgcttc cctaaggccc tgctctgggc ttctgggttt gagtccttgg caagcccagg 10740
agaggcgctc aggcttccct gtcccccttc ctcgtccacc atctcatgcc cctggctctc 10800
ctgccccttc cctacagggg ttcctggctc tgctctagcg atcgccaatt cgccctatag 10860
tgagtcgtat tacaattcac tggccgtcgt tttacaacgt cgtgactggg aaaaccctgg 10920
cgttacccaa cttaatcgcc ttgcagcaca tccccctttc gccagctggc gtaatagcga 10980
agaggcccgc accgatcgcc cttcccaaca gttgcgcagc ctgaatggcg aatgggacgc 11040
gccctgtagc ggcgcattaa gcgcggcggg tgtggtggtt acgcgcagcg tgaccgctac 11100
acttgccagc gccctagcgc ccgctccttt cgctttcttc ccttcctttc tcgccacgtt 11160
cgccggcttt ccccgtcaag ctctaaatcg ggggctccct ttagggttcc gatttagtgc 11220
tttacggcac ctcgacccca aaaaacttga ttagggtgat ggttcacgta gtgggccatc 11280
gccctgatag acggtttttc gccctttgac gttggagtcc acgttcttta atagtggact 11340
cttgttccaa actggaacaa cactcaaccc tatctcggtc tattcttttg atttataagg 11400
gattttgccg atttcggcct attggttaaa aaatgagctg atttaacaaa aatttaacgc 11460
gaattttaac aaaatattaa cgcttacaat ttaggtg 11497
<210> 207
<211> 11779
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 207
gcacttttcg gggaaatgtg cgcggaaccc ctatttgttt atttttctaa atacattcaa 60
atatgtatcc gctcatgaga caataaccct gataaatgct tcaataatat tgaaaaagga 120
agagtatgag tattcaacat ttccgtgtcg cccttattcc cttttttgcg gcattttgcc 180
ttcctgtttt tgctcaccca gaaacgctgg tgaaagtaaa agatgctgaa gatcagttgg 240
gtgcacgagt gggttacatc gaactggatc tcaacagcgg taagatcctt gagagttttc 300
gccccgaaga acgttttcca atgatgagca cttttaaagt tctgctatgt ggcgcggtat 360
tatcccgtat tgacgccggg caagagcaac tcggtcgccg catacactat tctcagaatg 420
acttggttga gtactcacca gtcacagaaa agcatcttac ggatggcatg acagtaagag 480
aattatgcag tgctgccata accatgagtg ataacactgc ggccaactta cttctgacaa 540
cgatcggagg accgaaggag ctaaccgctt ttttgcacaa catgggggat catgtaactc 600
gccttgatcg ttgggaaccg gagctgaatg aagccatacc aaacgacgag cgtgacacca 660
cgatgcctgt agcaatggca acaacgttgc gcaaactatt aactggcgaa ctacttactc 720
tagcttcccg gcaacaatta atagactgga tggaggcgga taaagttgca ggaccacttc 780
tgcgctcggc ccttccggct ggctggttta ttgctgataa atctggagcc ggtgagcgtg 840
ggtctcgcgg tatcattgca gcactggggc cagatggtaa gccctcccgt atcgtagtta 900
tctacacgac ggggagtcag gcaactatgg atgaacgaaa tagacagatc gctgagatag 960
gtgcctcact gattaagcat tggtaactgt cagaccaagt ttactcatat atactttaga 1020
ttgatttaaa acttcatttt taatttaaaa ggatctaggt gaagatcctt tttgataatc 1080
tcatgaccaa aatcccttaa cgtgagtttt cgttccactg agcgtcagac cccgtagaaa 1140
agatcaaagg atcttcttga gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa 1200
aaaaaccacc gctaccagcg gtggtttgtt tgccggatca agagctacca actctttttc 1260
cgaaggtaac tggcttcagc agagcgcaga taccaaatac tgtccttcta gtgtagccgt 1320
agttaggcca ccacttcaag aactctgtag caccgcctac atacctcgct ctgctaatcc 1380
tgttaccagt ggctgctgcc agtggcgata agtcgtgtct taccgggttg gactcaagac 1440
gatagttacc ggataaggcg cagcggtcgg gctgaacggg gggttcgtgc acacagccca 1500
gcttggagcg aacgacctac accgaactga gatacctaca gcgtgagcta tgagaaagcg 1560
ccacgcttcc cgaagggaga aaggcggaca ggtatccggt aagcggcagg gtcggaacag 1620
gagagcgcac gagggagctt ccagggggaa acgcctggta tctttatagt cctgtcgggt 1680
ttcgccacct ctgacttgag cgtcgatttt tgtgatgctc gtcagggggg cggagcctat 1740
ggaaaaacgc cagcaacgcg gcctttttac ggttcctggc cttttgctgg ccttttgctc 1800
acatgttctt tcctgcgtta tcccctgatt ctgtggataa ccgtattacc gcctttgagt 1860
gagctgatac cgctcgccgc agccgaacga ccgagcgcag cgagtcagtg agcgaggaag 1920
cggaagagcg cccaatacgc aaaccgcctc tccccgcgcg ttggccgatt cattaatgca 1980
gctggcacga caggtttccc gactggaaag cgggcagtga gcgcaacgca attaatgtga 2040
gttagctcac tcattaggca ccccaggctt tacactttat gcttccggct cgtatgttgt 2100
gtggaattgt gagcggataa caatttcaca caggaaacag ctatgaccat gattacgcca 2160
agctcgaaat taaccctcac taaagggaac aaaagctgtg ctttctctga ccagcattct 2220
ctcccctggg cctgtgccgc tttctgtctg cagcttgtgg cctgggtcac ctctacggct 2280
ggcccagatc cttccctgcc gcctccttca ggttccgtct tcctccactc cctcttcccc 2340
ttgctctctg ctgtgttgct gcccaaggat gctctttccg gagcacttcc ttctcggcgc 2400
tgcaccacgt gatgtcctct gagcggatcc tccccgtgtc tgggtcctct ccgggcatct 2460
ctcctccctc acccaacccc atgccgtctt cactcgctgg gttccctttt ccttctcctt 2520
ctggggcctg tgccatctct cgtttcttag gatggccttc tccgacggat gtctcccttg 2580
cgtcccgcct ccccttcttg taggcctgca tcatcaccgt ttttctggac aaccccaaag 2640
taccccgtct ccctggcttt agccacctct ccatcctctt gctttctttg cctggacacc 2700
ccgttctcct gtggattcgg gtcacctctc actcctttca tttgggcagc tcccctaccc 2760
cccttacctc tctagtctgt gctagctctt ccagccccct gtcatggcat cttccagggg 2820
tccgagagct cagctagtct tcttcctcca acccgggccc ctatgtccac ttcaggacag 2880
catgtttgct gcctccaggg atcctgtgtc cccgagctgg gaccacctta tattcccagg 2940
gccggttaat gtggctctgg ttctgggtac ttttatctgt cccctccacc ccacagtggg 3000
gcaagcttct gacctcttct cttcctccca cagggcctcg agagatctgg cagcggagag 3060
ggcagaggaa gtcttctaac atgcggtgac gtggaggaga atcccggccc taggctcgag 3120
ggtaccatga ttgaacaaga tggattgcac gcaggttctc cggccgcttg ggtggagagg 3180
ctattcggct atgactgggc acaacagaca atcggctgct ctgatgccgc cgtgttccgg 3240
ctgtcagcgc aggggcgccc ggttcttttt gtcaagaccg acctgtccgg tgccctgaat 3300
gaactgcagg acgaggcagc gcggctatcg tggctggcca cgacgggcgt tccttgcgca 3360
gctgtgctcg acgttgtcac tgaagcggga agggactggc tgctattggg cgaagtgccg 3420
gggcaggatc tcctgtcatc tcaccttgct cctgccgaga aagtatccat catggctgat 3480
gcaatgcggc ggctgcatac gcttgatccg gctacctgcc cattcgacca ccaagcgaaa 3540
catcgcatcg agcgagcacg tactcggatg gaagccggtc ttgtcgatca ggatgatctg 3600
gacgaagagc atcaggggct cgcgccagcc gaactgttcg ccaggctcaa ggcgcgcatg 3660
cccgacggcg aggatctcgt cgtgacccat ggcgatgcct gcttgccgaa tatcatggtg 3720
gaaaatggcc gcttttctgg attcatcgac tgtggccggc tgggtgtggc ggaccgctat 3780
caggacatag cgttggctac ccgtgatatt gctgaagagc ttggcggcga atgggctgac 3840
cgcttcctcg tgctttacgg tatcgccgct cccgattcgc agcgcatcgc cttctatcgc 3900
cttcttgacg agttcttctg agtttaaacc cgctgatcag cctcgactgt gccttctagt 3960
tgccagccat ctgttgtttg cccctccccc gtgccttcct tgaccctgga aggtgccact 4020
cccactgtcc tttcctaata aaatgaggaa attgcatcgc attgtctgag taggtgtcat 4080
tctattctgg ggggtggggt ggggcaggac agcaaggggg aggattggga agacaatagc 4140
aggcatgctg gggatgcggt gggctctatg ggctagcggt ggcggcctcg acattgatta 4200
ttgactagta gatctggcgc gccgcctttt tacggttcct ggccttttgc tggccttttg 4260
ctcacatgtc acgtgaggcc ttaacgtctc gccctttggt ctccccctct taagtaccac 4320
atttgtagag gttttacttg ctttaaaaaa cctcccacac ctccccctga acctgaaaca 4380
taaaatgaat gcaattgttg ttgttaactt gtttattgca gcttataatg gttacaaata 4440
aagcaatagc atcacaaatt tcacaaataa agcatttttt tcactgcatt ctagttgtgg 4500
tttgtccaaa ctcatcggct agcttacccg gggagcatgt caaggtcaaa atcgtcaaga 4560
gcgtcagcag gcagcatatc aaggtcaaag tcgtcaaggg catcggctgg gagcatgtct 4620
aagtcaaaat cgtcaagggc gtcggtcggc ccgccgcttt cgcactttag ctgtttctcc 4680
aggccacata tgattagttc caggccgaaa aggaaggcag gttcggctcc ctgccggtcg 4740
aacagctcaa ttgcttgtct cagaagtggg ggcatagaat cggtggtagg tgtctctctt 4800
tcctcttttg ctacttgatg ctcctgttcc tccaatacgc agcccagtgt aaagtggccc 4860
acggcggaca gagcgtacag tgcgttctcc agggagaagc cttgctgaca caggaacgcg 4920
agctgatttt ccagggtttc gtactgtttc tctgttgggc gggtgccgag atgcacttta 4980
gccccgtcgc gatgtgagag gagagcacag cggtatgact tggcgttgtt ccgcagaaag 5040
tcttgccatg actcgccttc cagggggcag aagtgggtat gatgcctgtc cagcatctcg 5100
attggcaggg catcgagcag ggcccgcttg ttcttcacgt gccagtacag ggtaggctgc 5160
tcaactccca gcttttgagc gagtttcctt gtcgtcaggc cttcgatacc gacaccattg 5220
agtaattcca gagctccgtt tatgactttg ctcttgtcca gtctagacat tggaccaggg 5280
ttttcttcaa catcaccaca agtgaggaga gaacctctac cttcggcacc gggatttcgg 5340
gtatatttga gtggaatgag ttcttcaatc gtagttttga ctaacttgcc attcatttct 5400
attaacacaa aacaatctgg tgcatagtct gaaatcaact ccctacacat accacaagga 5460
cttaccactc gaatacttct atctacttcg tcagaataag ggtgtctaac agctacaatc 5520
gtgtcaaaat ccttttgtcc attcgaaact gcactaccaa tcgcaatggc ttctgcacaa 5580
acagttactc gtcctatata cgcttcaata tgtactgccg aaatgatttc tcctgttttc 5640
gtacgaattg ccgctcccac atgatgttta ttatcctcat aaagcattgt aatcttctct 5700
gtcgctactt ctactaattc tagatcctgt tgagaaatgt taaatgtttt catggtggcg 5760
gcactagtaa gggcgaattc ggagcctgct tttttgtaca aacttgttga tatctgcaga 5820
attccaccac actggactag tggatccgag ctcggtacca agcttcttca cgacacctga 5880
aatggaagaa aaaaactttg aaccactgtc tgaggcttga gaatgaacca agatccaaac 5940
tcaaaaaggg caaattccaa ggagaattac atcaagtgcc aagctggcct aacttcagtc 6000
tccacccact cagtgtgggg aaactccatc gcataaaacc cctcccccca acctaaagac 6060
gacgtactcc aaaagctcga gaactaatcg aggtgcctgg acggcgcccg gtactccgtg 6120
gagtcacatg aagcgacggc tgaggacgga aaggcccttt tcctttgtgt gggtgactca 6180
cccgcccgct ctcccgagcg ccgcgtcctc cattttgagc tccctgcagc agggccggga 6240
agcggccatc tttccgctca cgcaactggt gccgaccggg ccagccttgc cgcccagggc 6300
ggggcgatac acggcggcgc gaggccaggc accagagcag gccggccagc ttgagactac 6360
ccccgtccga ttctcggtgg ccgcgctcgc aggccccgcc tcgccgaaca tgtgcgctgg 6420
gacgcacggg ccccgtcgcc gcccgcggcc ccaaaaaccg aaataccagt gtgcagatct 6480
tggcccgcat ttacaagact atcttgccag aaaaaaagcg tcgcagcagg tcatcaaaaa 6540
ttttaaatgg ctagagactt atcgaaagca gcgagacagg cgcgaaggtg ccaccagatt 6600
cgcacgcggc ggccccagcg cccaggccag gcctcaactc aagcacgagg cgaaggggct 6660
ccttaagcgc aaggcctcga actctcccac ccacttccaa cccgaagctc gggatcaaga 6720
atcacgtact gcagccaggt ggaagtaatt caaggcacgc aagggccata acccgtaaag 6780
aggccaggcc cgcgggaacc acacacggca cttacctgtg ttctggcggc aaacccgttg 6840
cgaaaaagaa cgttcacggc gactactgca cttatatacg gttctccccc accctcggga 6900
aaaaggcgga gccagtacac gacatcactt tcccagttta ccccgcgcca ccttctctag 6960
gcaccggttc aattgccgac ccctcccccc aacttctcgg ggactgtggg cgatgtgcgc 7020
tctgcccact gacgggcacc ggagcctcac gatcgatatg tcgagtttac tccctatcag 7080
tgatagagaa cgtatgtcga gtttactccc tatcagtgat agagaacgat gtcgagttta 7140
ctccctatca gtgatagaga acgtatgtcg agtttactcc ctatcagtga tagagaacgt 7200
atgtcgagtt tactccctat cagtgataga gaacgtatgt cgagtttatc cctatcagtg 7260
atagagaacg tatgtcgagt ttactcccta tcagtgatag agaacgtatg tcgaggtagg 7320
cgtgtacggt gggaggccta tataagcaga gctcgtttag tgaaccgtca gatcgcctgg 7380
agaattggct aggcaccggt gacaagtttg tacaaaaaag caggctccga attcgccctt 7440
actagtgccg ccaccatgat taagatcgct acgcggaagt acctggggaa acagaacgtc 7500
tacgacatag gtgtggagcg cgatcacaac tttgctctga aaaatggatt tatcgccagc 7560
aactgcatct cccgccgtgc acagggtgtc acgttgcaag acctgcctga aaccgaactg 7620
cccgctgttc tgcagccggt cgcggaggcc atggatgcga tcgctgcggc cgatcttagc 7680
cagacgagcg ggttcggccc attcggaccg caaggaatcg gtcaatacac tacatggcgt 7740
gatttcatat gcgcgattgc tgatccccat gtgtatcact ggcaaactgt gatggacgac 7800
accgtcagtg cgtccgtcgc gcaggctctc gatgagctga tgctttgggc cgaggactgc 7860
cccgaagtcc ggcacctcgt gcacgcggat ttcggctcca acaatgtcct gacggacaat 7920
ggccgcataa cagcggtcat tgactggagc gaggcgatgt tcggggattc ccaatacgag 7980
gtcgccaaca tcttcttctg gaggccgtgg ttggcttgta tggagcagca gacgcgctac 8040
ttcgagcgga ggcatccgga gcttgcagga tcgccgcggc tccgggcgta tatgctccgc 8100
attggtcttg accaactcta tcagagcttg gttgacggca atttcgatga tgcagcttgg 8160
gcgcagggtc gatgcgacgc aatcgtccga tccggagccg ggactgtcgg gcgtacacaa 8220
atcgcccgca gaagcgcggc cgtctggacc gatggctgtg tagaagtact cgccgatagt 8280
ggaaaccgac gccccagcac tcgtccgagg gcaaaggaaa ctagtggcag cggcgccaca 8340
aacttctctc tgctaaagca agcaggtgat gttgaagaaa accccgggcc tggcgcgcca 8400
atgaatacac tcgagatgga catcatcagc gtggctctga agaggcactc caccaaggct 8460
ttcgacgctt ccaagaaact gacccctgaa caggccgagc agatcaagac cctgctccag 8520
tacagcccta gctccaccaa cagccagcct tggcacttca tcgtggctag caccgaggaa 8580
ggcaaagcta gggtggctaa gagcgccgct ggcaactacg tgttcaacga gaggaagatg 8640
ctggatgcta gccacgtggt ggtgttctgc gctaagaccg ccatggacga tgtgtggctg 8700
aagctggtgg tggatcagga agatgctgat ggcaggttcg ctacccctga agctaaggcc 8760
gctaacgaca agggcaggaa gttcttcgcc gacatgcaca ggaaggatct gcacgatgat 8820
gctgagtgga tggccaagca ggtgtacctg aacgtgggca acttcctgct cggcgtggct 8880
gccctgggcc tcgatgctgt gcccatcgaa ggcttcgatg ctgctatcct ggatgccgag 8940
ttcggcctga aggagaaagg ctacaccagc ctggtggtgg tgcctgtggg ccaccacagc 9000
gtggaggact tcaacgctac cctgcctaag agcaggctgc cccagaacat caccctgacc 9060
gaggtgggcc ggccaggctc gggccagtgt actaattatg ctctcttgaa attggctgga 9120
gatgttgaga gcaacccagg tcccttaatt aagatggtga gcaagggcga ggaggataac 9180
atggccatca tcaaggagtt catgcgcttc aaggtgcaca tggagggctc cgtgaacggc 9240
cacgagttcg agatcgaggg cgagggcgag ggccgcccct acgagggcac ccagaccgcc 9300
aagctgaagg tgaccaaggg tggccccctg cccttcgcct gggacatcct gtcccctcag 9360
ttcatgtacg gctccaaggc ctacgtgaag caccccgccg acatccccga ctacttgaag 9420
ctgtccttcc ccgagggctt caagtgggag cgcgtgatga acttcgagga cggcggcgtg 9480
gtgaccgtga cccaggactc ctccctgcag gacggcgagt tcatctacaa ggtgaagctg 9540
cgcggcacca acttcccctc cgacggcccc gtaatgcaga agaagaccat gggctgggag 9600
gcctcctccg agcggatgta ccccgaggac ggcgccctga agggcgagat caagcagagg 9660
ctgaagctga aggacggcgg ccactacgac gctgaggtca agaccaccta caaggccaag 9720
aagcccgtgc agctgcccgg cgcctacaac gtcaacatca agttggacat cacctcccac 9780
aacgaggact acaccatcgt ggaacagtac gaacgcgccg agggccgcca ctccaccggc 9840
ggcatggacg agctgtacaa gtaattaatt aaaagggcga attcgaccca gctttcttgt 9900
acaaagtggt tgatatccag cacagtggcg gccgctcgag tctagagggc ccgcggttcg 9960
aaggtaagcc tatccctaac cctctcctcg gtctcgattc tacgcgtacc ggttaggggc 10020
ccgtttaaac ccgctgatca gcctcgactg tgccttctag ttgccagcca tctgttgttt 10080
gcccctcccc cgtgccttcc ttgaccctgg aaggtgccac tcccactgtc ctttcctaat 10140
aaaatgagga aattgcatcg cattgtctga gtaggtgtca ttctattctg gggggtgggg 10200
tggggcagga cagcaagggg gaggattggg aagacaatag caggcatgct ggggatgcgg 10260
tgggctctat ggctctagaa gtcgacagta ctaagctttg acagaaaagc cccatcctta 10320
ggcctcctcc ttcctagtct cctgatattg ggtctaaccc ccacctcctg ttaggcagat 10380
tccttatctg gtgacacacc cccatttcct ggagccatct ctctccttgc cagaacctct 10440
aaggtttgct tacgatggag ccagagagga tcctgggagg gagagcttgg cagggggtgg 10500
gagggaaggg ggggatgcgt gacctgcccg gttctcagtg gccaccctgc gctaccctct 10560
cccagaacct gagctgctct gacgcggctg tctggtgcgt ttcactgatc ctggtgctgc 10620
agcttcctta cacttcccaa gaggagaagc agtttggaaa aacaaaatca gaataagttg 10680
gtcctgagtt ctaactttgg ctcttcacct ttctagtccc caatttatat tgttcctccg 10740
tgcgtcagtt ttacctgtga gataaggcca gtagccagcc ccgtcctggc agggctgtgg 10800
tgaggagggg ggtgtccgtg tggaaaactc cctttgtgag aatggtgcgt cctaggtgtt 10860
caccaggtcg tggccgcctc tactcccttt ctctttctcc atccttcttt ccttaaagag 10920
tccccagtgc tatctgggac atattcctcc gcccagagca gggtcccgct tccctaaggc 10980
cctgctctgg gcttctgggt ttgagtcctt ggcaagccca ggagaggcgc tcaggcttcc 11040
ctgtccccct tcctcgtcca ccatctcatg cccctggctc tcctgcccct tccctacagg 11100
ggttcctggc tctgctctag cgatcgccaa ttcgccctat agtgagtcgt attacaattc 11160
actggccgtc gttttacaac gtcgtgactg ggaaaaccct ggcgttaccc aacttaatcg 11220
ccttgcagca catccccctt tcgccagctg gcgtaatagc gaagaggccc gcaccgatcg 11280
cccttcccaa cagttgcgca gcctgaatgg cgaatgggac gcgccctgta gcggcgcatt 11340
aagcgcggcg ggtgtggtgg ttacgcgcag cgtgaccgct acacttgcca gcgccctagc 11400
gcccgctcct ttcgctttct tcccttcctt tctcgccacg ttcgccggct ttccccgtca 11460
agctctaaat cgggggctcc ctttagggtt ccgatttagt gctttacggc acctcgaccc 11520
caaaaaactt gattagggtg atggttcacg tagtgggcca tcgccctgat agacggtttt 11580
tcgccctttg acgttggagt ccacgttctt taatagtgga ctcttgttcc aaactggaac 11640
aacactcaac cctatctcgg tctattcttt tgatttataa gggattttgc cgatttcggc 11700
ctattggtta aaaaatgagc tgatttaaca aaaatttaac gcgaatttta acaaaatatt 11760
aacgcttaca atttaggtg 11779
<210> 208
<211> 9330
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 208
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtgccacca tgaaaaagcc tgaactcacc 660
gcgacgtctg tcgagaagtt tctgatcgaa aagttcgaca gcgtctccga cctgatgcag 720
ctctcggagg gcgaagaatc tcgtgctttc agcttcgatg taggagggcg tggatatgtc 780
ctgcgggtaa atagctgcgc cgatggtttc tacaaagatc gttatgttta tcggcacttt 840
gcatcggccg cgctcccgat tccggaagtg cttgacattg gggaatttag cgagagcctg 900
acctattgcc tttcatacga gaccgagatc ctgactgtcg agtacggatt gcttcctatc 960
ggcaaaatcg tggagaagag gattgaatgt accgtctatt cagtcgataa taatgggaac 1020
atctacacac agcccgtggc tcaatggcac gacagaggag agcaggaagt ttttgaatac 1080
tgtctcgagg acggatccct catccgcgct actaaagatc ataagtttat gaccgtggac 1140
ggccagatgc tgccaattga cgaaattttt gaacgagagc tggatctgat gagagtcgac 1200
aaccttccaa acggtggagg ggggtcaggc tctgcgcagc tggaaaagga gcttcaagcc 1260
ctcgaaaaaa agttggccca gctcgagtgg gagaaccagg ctctggagaa agaactggcc 1320
cagtgattaa ttaagaattc gacccagctt tcttgtacaa agtggttggt aagcctatcc 1380
ctaaccctct cctcggtctc gattctacgt agtaatgagc tagcagtctc gaggttaacg 1440
aattccgccc cccccctaac gttactggcc gaagccgctt ggaataaggc cggtgtgcgc 1500
ttgtctatat gttattttcc accatattgc cgtcttttgg caatgtgagg gcccggaaac 1560
ctggccctgt cttcttgacg agcattccta ggggtctttc ccctctcgcc aaaggaatgc 1620
aaggtctgtt gaatgtcgtg aaggaagcag ttcctctgga agcttcttga agacaaacaa 1680
cgtctgtagc gaccctttgc aggcagcgga accccccacc tggcgacagg tgcccctgcg 1740
gccaaaagcc acgtgtataa gatacacctg caaaggcggc acaaccccag tgccacgttg 1800
tgagttggat agttgtggaa agagtcaaat ggctctcctc aagcgtattc aacaaggggc 1860
tgaaggatgc ccagaaggta ccccattgta tgggatctga tctggggcct cggtgcacat 1920
gctttacatg tgtttagtcg aggttaaaaa aacgtctagg ccccccgaac cacggggacg 1980
tggttttcct ttgaaaaaca cgataatacc atggccatga gcgagctgat taaggagaac 2040
atgcacatga agctgtacat ggagggcacc gtggacaacc atcacttcaa gtgcacatcc 2100
gagggcgaag gcaagcccta cgagggcacc cagaccatga gaatcaaggt ggtcgagggc 2160
ggccctctcc ccttcgcctt cgacatcctg gctactagct tcctctacgg cagcaagacc 2220
ttcatcaacc acacccaggg catccccgac ttcttcaagc agtccttccc tgagggcttc 2280
acatgggaga gagtcaccac atacgaagac gggggcgtgc tgaccgctac ccaggacacc 2340
agcctccagg acggctgcct catctacaac gtcaagatca gaggggtgaa cttcacatcc 2400
aacggccctg tgatgcagaa gaaaacactc ggctgggagg ccttcaccga gacgctgtac 2460
cccgctgacg gcggcctgga aggcagaaac gacatggccc tgaagctcgt gggcgggagc 2520
catctgatcg caaacatcaa gaccacatat agatccaaga aacccgctaa gaacctcaag 2580
atgcctggcg tctactatgt ggactacaga ctggaaagaa tcaaggaggc caacaacgag 2640
acctacgtcg agcagcacga ggtggcagtg gccagatact gcgacctccc tagcaaactg 2700
gggcacaagc ttaattaaca ccggtggcgc gttaagtcga caatcaacct ctggattaca 2760
aaatttgtga aagattgact ggtattctta actatgttgc tccttttacg ctatgtggat 2820
acgctgcttt aatgcctttg tatcatgcta ttgcttcccg tatggctttc attttctcct 2880
ccttgtataa atcctggttg ctgtctcttt atgaggagtt gtggcccgtt gtcaggcaac 2940
gtggcgtggt gtgcactgtg tttgctgacg caacccccac tggttggggc attgccacca 3000
cctgtcagct cctttccggg actttcgctt tccccctccc tattgccacg gcggaactca 3060
tcgccgcctg ccttgcccgc tgctggacag gggctcggct gttgggcact gacaattccg 3120
tggtgttgtc ggggaaatca tcgtcctttc cttggctgct cgcctgtgtt gccacctgga 3180
ttctgcgcgg gacgtccttc tgctacgtcc cttcggccct caatccagcg gaccttcctt 3240
cccgcggcct gctgccggct ctgcggcctc ttccgcgtct tcgccttcgc cctcagacga 3300
gtcggatctc cctttgggcc gcctccccgc gtcgacttta agaccaatga cttacaaggc 3360
agctgtagat cttagccact ttttaaaaga aaagggggga ctggaagggc taattcactc 3420
ccaacgaaga caagatctgc tttttgcttg tactgggtct ctctggttag accagatctg 3480
agcctgggag ctctctggct aactagggaa cccactgctt aagcctcaat aaagcttgcc 3540
ttgagtgctt caagtagtgt gtgcccgtct gttgtgtgac tctggtaact agagatccct 3600
cagacccttt tagtcagtgt ggaaaatctc tagcagtacg tatagtagtt catgtcatct 3660
tattattcag tatttataac ttgcaaagaa atgaatatca gagagtgaga ggaacttgtt 3720
tattgcagct tataatggtt acaaataaag caatagcatc acaaatttca caaataaagc 3780
atttttttca ctgcattcta gttgtggttt gtccaaactc atcaatgtat cttatcatgt 3840
ctggctctag ctatcccgcc cctaactccg cccatcccgc ccctaactcc gcccagttcc 3900
gcccattctc cgccccatgg ctgactaatt ttttttattt atgcagaggc cgaggccgcc 3960
tcggcctctg agctattcca gaagtagtga ggaggctttt ttggaggcct agggacgtac 4020
ccaattcgcc ctatagtgag tcgtattacg cgcgctcact ggccgtcgtt ttacaacgtc 4080
gtgactggga aaaccctggc gttacccaac ttaatcgcct tgcagcacat ccccctttcg 4140
ccagctggcg taatagcgaa gaggcccgca ccgatcgccc ttcccaacag ttgcgcagcc 4200
tgaatggcga atgggacgcg ccctgtagcg gcgcattaag cgcggcgggt gtggtggtta 4260
cgcgcagcgt gaccgctaca cttgccagcg ccctagcgcc cgctcctttc gctttcttcc 4320
cttcctttct cgccacgttc gccggctttc cccgtcaagc tctaaatcgg gggctccctt 4380
tagggttccg atttagtgct ttacggcacc tcgaccccaa aaaacttgat tagggtgatg 4440
gttcacgtag tgggccatcg ccctgataga cggtttttcg ccctttgacg ttggagtcca 4500
cgttctttaa tagtggactc ttgttccaaa ctggaacaac actcaaccct atctcggtct 4560
attcttttga tttataaggg attttgccga tttcggccta ttggttaaaa aatgagctga 4620
tttaacaaaa atttaacgcg aattttaaca aaatattaac gcttacaatt taggtggcac 4680
ttttcgggga aatgtgcgcg gaacccctat ttgtttattt ttctaaatac attcaaatat 4740
gtatccgctc atgagacaat aaccctgata aatgcttcaa taatattgaa aaaggaagag 4800
tatgagtatt caacatttcc gtgtcgccct tattcccttt tttgcggcat tttgccttcc 4860
tgtttttgct cacccagaaa cgctggtgaa agtaaaagat gctgaagatc agttgggtgc 4920
acgagtgggt tacatcgaac tggatctcaa cagcggtaag atccttgaga gttttcgccc 4980
cgaagaacgt tttccaatga tgagcacttt taaagttctg ctatgtggcg cggtattatc 5040
ccgtattgac gccgggcaag agcaactcgg tcgccgcata cactattctc agaatgactt 5100
ggttgagtac tcaccagtca cagaaaagca tcttacggat ggcatgacag taagagaatt 5160
atgcagtgct gccataacca tgagtgataa cactgcggcc aacttacttc tgacaacgat 5220
cggaggaccg aaggagctaa ccgctttttt gcacaacatg ggggatcatg taactcgcct 5280
tgatcgttgg gaaccggagc tgaatgaagc cataccaaac gacgagcgtg acaccacgat 5340
gcctgtagca atggcaacaa cgttgcgcaa actattaact ggcgaactac ttactctagc 5400
ttcccggcaa caattaatag actggatgga ggcggataaa gttgcaggac cacttctgcg 5460
ctcggccctt ccggctggct ggtttattgc tgataaatct ggagccggtg agcgtgggtc 5520
tcgcggtatc attgcagcac tggggccaga tggtaagccc tcccgtatcg tagttatcta 5580
cacgacgggg agtcaggcaa ctatggatga acgaaataga cagatcgctg agataggtgc 5640
ctcactgatt aagcattggt aactgtcaga ccaagtttac tcatatatac tttagattga 5700
tttaaaactt catttttaat ttaaaaggat ctaggtgaag atcctttttg ataatctcat 5760
gaccaaaatc ccttaacgtg agttttcgtt ccactgagcg tcagaccccg tagaaaagat 5820
caaaggatct tcttgagatc ctttttttct gcgcgtaatc tgctgcttgc aaacaaaaaa 5880
accaccgcta ccagcggtgg tttgtttgcc ggatcaagag ctaccaactc tttttccgaa 5940
ggtaactggc ttcagcagag cgcagatacc aaatactgtt cttctagtgt agccgtagtt 6000
aggccaccac ttcaagaact ctgtagcacc gcctacatac ctcgctctgc taatcctgtt 6060
accagtggct gctgccagtg gcgataagtc gtgtcttacc gggttggact caagacgata 6120
gttaccggat aaggcgcagc ggtcgggctg aacggggggt tcgtgcacac agcccagctt 6180
ggagcgaacg acctacaccg aactgagata cctacagcgt gagctatgag aaagcgccac 6240
gcttcccgaa gggagaaagg cggacaggta tccggtaagc ggcagggtcg gaacaggaga 6300
gcgcacgagg gagcttccag ggggaaacgc ctggtatctt tatagtcctg tcgggtttcg 6360
ccacctctga cttgagcgtc gatttttgtg atgctcgtca ggggggcgga gcctatggaa 6420
aaacgccagc aacgcggcct ttttacggtt cctggccttt tgctggcctt ttgctcacat 6480
gttctttcct gcgttatccc ctgattctgt ggataaccgt attaccgcct ttgagtgagc 6540
tgataccgct cgccgcagcc gaacgaccga gcgcagcgag tcagtgagcg aggaagcgga 6600
agagcgccca atacgcaaac cgcctctccc cgcgcgttgg ccgattcatt aatgcagctg 6660
gcacgacagg tttcccgact ggaaagcggg cagtgagcgc aacgcaatta atgtgagtta 6720
gctcactcat taggcacccc aggctttaca ctttatgctt ccggctcgta tgttgtgtgg 6780
aattgtgagc ggataacaat ttcacacagg aaacagctat gaccatgatt acgccaagcg 6840
cgcaattaac cctcactaaa gggaacaaaa gctggagctg caagcttaat gtagtcttat 6900
gcaatactct tgtagtcttg caacatggta acgatgagtt agcaacatgc cttacaagga 6960
gagaaaaagc accgtgcatg ccgattggtg gaagtaaggt ggtacgatcg tgccttatta 7020
ggaaggcaac agacgggtct gacatggatt ggacgaacca ctgaattgcc gcattgcaga 7080
gatattgtat ttaagtgcct agctcgatac ataaacgggt ctctctggtt agaccagatc 7140
tgagcctggg agctctctgg ctaactaggg aacccactgc ttaagcctca ataaagcttg 7200
ccttgagtgc ttcaagtagt gtgtgcccgt ctgttgtgtg actctggtaa ctagagatcc 7260
ctcagaccct tttagtcagt gtggaaaatc tctagcagtg gcgcccgaac agggacttga 7320
aagcgaaagg gaaaccagag gagctctctc gacgcaggac tcggcttgct gaagcgcgca 7380
cggcaagagg cgaggggcgg cgactggtga gtacgccaaa aattttgact agcggaggct 7440
agaaggagag agatgggtgc gagagcgtca gtattaagcg ggggagaatt agatcgcgat 7500
gggaaaaaat tcggttaagg ccagggggaa agaaaaaata taaattaaaa catatagtat 7560
gggcaagcag ggagctagaa cgattcgcag ttaatcctgg cctgttagaa acatcagaag 7620
gctgtagaca aatactggga cagctacaac catcccttca gacaggatca gaagaactta 7680
gatcattata taatacagta gcaaccctct attgtgtgca tcaaaggata gagataaaag 7740
acaccaagga agctttagac aagatagagg aagagcaaaa caaaagtaag accaccgcac 7800
agcaagcggc cgctgatctt cagacctgga ggaggagata tgagggacaa ttggagaagt 7860
gaattatata aatataaagt agtaaaaatt gaaccattag gagtagcacc caccaaggca 7920
aagagaagag tggtgcagag agaaaaaaga gcagtgggaa taggagcttt gttccttggg 7980
ttcttgggag cagcaggaag cactatgggc gcagcgtcaa tgacgctgac ggtacaggcc 8040
agacaattat tgtctggtat agtgcagcag cagaacaatt tgctgagggc tattgaggcg 8100
caacagcatc tgttgcaact cacagtctgg ggcatcaagc agctccaggc aagaatcctg 8160
gctgtggaaa gatacctaaa ggatcaacag ctcctgggga tttggggttg ctctggaaaa 8220
ctcatttgca ccactgctgt gccttggaat gctagttgga gtaataaatc tctggaacag 8280
atttggaatc acacgacctg gatggagtgg gacagagaaa ttaacaatta cacaagctta 8340
atacactcct taattgaaga atcgcaaaac cagcaagaaa agaatgaaca agaattattg 8400
gaattagata aatgggcaag tttgtggaat tggtttaaca taacaaattg gctgtggtat 8460
ataaaattat tcataatgat agtaggaggc ttggtaggtt taagaatagt ttttgctgta 8520
ctttctatag tgaatagagt taggcaggga tattcaccat tatcgtttca gacccacctc 8580
ccaaccccga ggggaccctt gcgccttttc caaggcagcc ctgggtttgc gcagggacgc 8640
ggctgctctg ggcgtggttc cgggaaacgc agcggcgccg accctgggtc tcgcacattc 8700
ttcacgtccg ttcgcagcgt cacccggatc ttcgccgcta cccttgtggg ccccccggcg 8760
acgcttcctg ctccgcccct aagtcgggaa ggttccttgc ggttcgcggc gtgccggacg 8820
tgacaaacgg aagccgcacg tctcactagt accctcgcag acggacagcg ccagggagca 8880
atggcagcgc gccgaccgcg atgggctgtg gccaatagcg gctgctcagc agggcgcgcc 8940
gagagcagcg gccgggaagg ggcggtgcgg gaggcggggt gtggggcggt agtgtgggcc 9000
ctgttcctgc ccgcgcggtg ttccgcattc tgcaagcctc cggagcgcac gtcggcagtc 9060
ggctccctcg ttgaccgaat caccgacctc tctccccagg gggtacccag ctgtctagag 9120
aattctagat cttgagacaa atggcagtat tcatccacaa ttttaaaaga aaagggggga 9180
ttggggggta cagtgcaggg gaaagaatag tagacataat agcaacagac atacaaacta 9240
aagaattaca aaaacaaatt acaaaaattc aaaattttcg ggtttattac agggacagca 9300
gagatccact ttggcgccgg ctcgaggggg 9330
<210> 209
<211> 228
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 209
Met Lys Lys Pro Glu Leu Thr Ala Thr Ser Val Glu Lys Phe Leu Ile
1 5 10 15
Glu Lys Phe Asp Ser Val Ser Asp Leu Met Gln Leu Ser Glu Gly Glu
20 25 30
Glu Ser Arg Ala Phe Ser Phe Asp Val Gly Gly Arg Gly Tyr Val Leu
35 40 45
Arg Val Asn Ser Cys Ala Asp Gly Phe Tyr Lys Asp Arg Tyr Val Tyr
50 55 60
Arg His Phe Ala Ser Ala Ala Leu Pro Ile Pro Glu Val Leu Asp Ile
65 70 75 80
Gly Glu Phe Ser Glu Ser Leu Thr Tyr Cys Leu Ser Tyr Glu Thr Glu
85 90 95
Ile Leu Thr Val Glu Tyr Gly Leu Leu Pro Ile Gly Lys Ile Val Glu
100 105 110
Lys Arg Ile Glu Cys Thr Val Tyr Ser Val Asp Asn Asn Gly Asn Ile
115 120 125
Tyr Thr Gln Pro Val Ala Gln Trp His Asp Arg Gly Glu Gln Glu Val
130 135 140
Phe Glu Tyr Cys Leu Glu Asp Gly Ser Leu Ile Arg Ala Thr Lys Asp
145 150 155 160
His Lys Phe Met Thr Val Asp Gly Gln Met Leu Pro Ile Asp Glu Ile
165 170 175
Phe Glu Arg Glu Leu Asp Leu Met Arg Val Asp Asn Leu Pro Asn Gly
180 185 190
Gly Gly Gly Ser Gly Ser Ala Gln Leu Glu Lys Glu Leu Gln Ala Leu
195 200 205
Glu Lys Lys Leu Ala Gln Leu Glu Trp Glu Asn Gln Ala Leu Glu Lys
210 215 220
Glu Leu Ala Gln
225
<210> 210
<211> 9534
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 210
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtgccgcca ccatggctca actgaagaag 660
aaactccagg ccaataagaa agaactggca cagctgaagt ggaagctgca ggcactgaaa 720
aagaagctgg cacagggcgg aggaggctca ggcagtatga ttaagatcgc tacgcggaag 780
tacctgggga aacagaacgt ctacgacata ggtgtgggcg agccccacaa ctttgctctg 840
aaaaatggat ttatcgccag caactgcatc tcccgccgtg cacagggtgt cacgttgcaa 900
gacctgcctg aaaccgaact gcccgctgtt ctgcagccgg tcgcggaggc catggatgcg 960
atcgctgcgg ccgatcttag ccagacgagc gggttcggcc cattcggacc gcaaggaatc 1020
ggtcaataca ctacatggcg tgatttcata tgcgcgattg ctgatcccca tgtgtatcac 1080
tggcaaactg tgatggacga caccgtcagt gcgtccgtcg cgcaggctct cgatgagctg 1140
atgctttggg ccgaggactg ccccgaagtc cggcacctcg tgcacgcgga tttcggctgt 1200
atcagtggcg actccctgat ctcactcgca agcactggaa agcgagttag catcaaggac 1260
ttgctggacg aaaaggattt cgaaatttgg gcaatcaatg agcagaccat gaaactggag 1320
tctgcaaagg tgtcccgggt gttttgcacg ggtaagaagc ttgtttatat ccttaaaact 1380
agactgggcc ggacgatcaa agccaccgcg aaccacagat tcttgacaat cgacgggtgg 1440
aaacggctgg acgaactgag cttgaaggag cacatcgccc ttcctcggaa gctcgagtca 1500
tcttccctgc agctgtgatt aattaagaat tcgacccagc tttcttgtac aaagtggttg 1560
gtaagcctat ccctaaccct ctcctcggtc tcgattctac gtagtaatga gctagcagtc 1620
tcgaggttaa cgaattccgc ccccccccta acgttactgg ccgaagccgc ttggaataag 1680
gccggtgtgc gcttgtctat atgttatttt ccaccatatt gccgtctttt ggcaatgtga 1740
gggcccggaa acctggccct gtcttcttga cgagcattcc taggggtctt tcccctctcg 1800
ccaaaggaat gcaaggtctg ttgaatgtcg tgaaggaagc agttcctctg gaagcttctt 1860
gaagacaaac aacgtctgta gcgacccttt gcaggcagcg gaacccccca cctggcgaca 1920
ggtgcccctg cggccaaaag ccacgtgtat aagatacacc tgcaaaggcg gcacaacccc 1980
agtgccacgt tgtgagttgg atagttgtgg aaagagtcaa atggctctcc tcaagcgtat 2040
tcaacaaggg gctgaaggat gcccagaagg taccccattg tatgggatct gatctggggc 2100
ctcggtgcac atgctttaca tgtgtttagt cgaggttaaa aaaacgtcta ggccccccga 2160
accacgggga cgtggttttc ctttgaaaaa cacgataata ccatggtgag caagggcgag 2220
gagctgttca ccggggtggt gcccatcctg gtcgagctgg acggcgacgt aaacggccac 2280
aagttcagcg tgtccggcga gggcgagggc gatgccacct acggcaagct gaccctgaag 2340
ttcatctgca ccaccggcaa gctgcccgtg ccctggccca ccctcgtgac caccctgacc 2400
tacggcgtgc agtgcttcag ccgctacccc gaccacatga agcagcacga cttcttcaag 2460
tccgccatgc ccgaaggcta cgtccaggag cgcaccatct tcttcaagga cgacggcaac 2520
tacaagaccc gcgccgaggt gaagttcgag ggcgacaccc tggtgaaccg catcgagctg 2580
aagggcatcg acttcaagga ggacggcaac atcctggggc acaagctgga gtacaactac 2640
aacagccaca acgtctatat catggccgac aagcagaaga acggcatcaa ggtgaacttc 2700
aagatccgcc acaacatcga ggacggcagc gtgcagctcg ccgaccacta ccagcagaac 2760
acccccatcg gcgacggccc cgtgctgctg cccgacaacc actacctgag cacccagtcc 2820
gccctgagca aagaccccaa cgagaagcgc gatcacatgg tcctgctgga gttcgtgacc 2880
gccgccggga tcactctcgg catggacgag ctgtacaagt aacaccggtg gcgcgttaag 2940
tcgacaatca acctctggat tacaaaattt gtgaaagatt gactggtatt cttaactatg 3000
ttgctccttt tacgctatgt ggatacgctg ctttaatgcc tttgtatcat gctattgctt 3060
cccgtatggc tttcattttc tcctccttgt ataaatcctg gttgctgtct ctttatgagg 3120
agttgtggcc cgttgtcagg caacgtggcg tggtgtgcac tgtgtttgct gacgcaaccc 3180
ccactggttg gggcattgcc accacctgtc agctcctttc cgggactttc gctttccccc 3240
tccctattgc cacggcggaa ctcatcgccg cctgccttgc ccgctgctgg acaggggctc 3300
ggctgttggg cactgacaat tccgtggtgt tgtcggggaa atcatcgtcc tttccttggc 3360
tgctcgcctg tgttgccacc tggattctgc gcgggacgtc cttctgctac gtcccttcgg 3420
ccctcaatcc agcggacctt ccttcccgcg gcctgctgcc ggctctgcgg cctcttccgc 3480
gtcttcgcct tcgccctcag acgagtcgga tctccctttg ggccgcctcc ccgcgtcgac 3540
tttaagacca atgacttaca aggcagctgt agatcttagc cactttttaa aagaaaaggg 3600
gggactggaa gggctaattc actcccaacg aagacaagat ctgctttttg cttgtactgg 3660
gtctctctgg ttagaccaga tctgagcctg ggagctctct ggctaactag ggaacccact 3720
gcttaagcct caataaagct tgccttgagt gcttcaagta gtgtgtgccc gtctgttgtg 3780
tgactctggt aactagagat ccctcagacc cttttagtca gtgtggaaaa tctctagcag 3840
tacgtatagt agttcatgtc atcttattat tcagtattta taacttgcaa agaaatgaat 3900
atcagagagt gagaggaact tgtttattgc agcttataat ggttacaaat aaagcaatag 3960
catcacaaat ttcacaaata aagcattttt ttcactgcat tctagttgtg gtttgtccaa 4020
actcatcaat gtatcttatc atgtctggct ctagctatcc cgcccctaac tccgcccatc 4080
ccgcccctaa ctccgcccag ttccgcccat tctccgcccc atggctgact aatttttttt 4140
atttatgcag aggccgaggc cgcctcggcc tctgagctat tccagaagta gtgaggaggc 4200
ttttttggag gcctagggac gtacccaatt cgccctatag tgagtcgtat tacgcgcgct 4260
cactggccgt cgttttacaa cgtcgtgact gggaaaaccc tggcgttacc caacttaatc 4320
gccttgcagc acatccccct ttcgccagct ggcgtaatag cgaagaggcc cgcaccgatc 4380
gcccttccca acagttgcgc agcctgaatg gcgaatggga cgcgccctgt agcggcgcat 4440
taagcgcggc gggtgtggtg gttacgcgca gcgtgaccgc tacacttgcc agcgccctag 4500
cgcccgctcc tttcgctttc ttcccttcct ttctcgccac gttcgccggc tttccccgtc 4560
aagctctaaa tcgggggctc cctttagggt tccgatttag tgctttacgg cacctcgacc 4620
ccaaaaaact tgattagggt gatggttcac gtagtgggcc atcgccctga tagacggttt 4680
ttcgcccttt gacgttggag tccacgttct ttaatagtgg actcttgttc caaactggaa 4740
caacactcaa ccctatctcg gtctattctt ttgatttata agggattttg ccgatttcgg 4800
cctattggtt aaaaaatgag ctgatttaac aaaaatttaa cgcgaatttt aacaaaatat 4860
taacgcttac aatttaggtg gcacttttcg gggaaatgtg cgcggaaccc ctatttgttt 4920
atttttctaa atacattcaa atatgtatcc gctcatgaga caataaccct gataaatgct 4980
tcaataatat tgaaaaagga agagtatgag tattcaacat ttccgtgtcg cccttattcc 5040
cttttttgcg gcattttgcc ttcctgtttt tgctcaccca gaaacgctgg tgaaagtaaa 5100
agatgctgaa gatcagttgg gtgcacgagt gggttacatc gaactggatc tcaacagcgg 5160
taagatcctt gagagttttc gccccgaaga acgttttcca atgatgagca cttttaaagt 5220
tctgctatgt ggcgcggtat tatcccgtat tgacgccggg caagagcaac tcggtcgccg 5280
catacactat tctcagaatg acttggttga gtactcacca gtcacagaaa agcatcttac 5340
ggatggcatg acagtaagag aattatgcag tgctgccata accatgagtg ataacactgc 5400
ggccaactta cttctgacaa cgatcggagg accgaaggag ctaaccgctt ttttgcacaa 5460
catgggggat catgtaactc gccttgatcg ttgggaaccg gagctgaatg aagccatacc 5520
aaacgacgag cgtgacacca cgatgcctgt agcaatggca acaacgttgc gcaaactatt 5580
aactggcgaa ctacttactc tagcttcccg gcaacaatta atagactgga tggaggcgga 5640
taaagttgca ggaccacttc tgcgctcggc ccttccggct ggctggttta ttgctgataa 5700
atctggagcc ggtgagcgtg ggtctcgcgg tatcattgca gcactggggc cagatggtaa 5760
gccctcccgt atcgtagtta tctacacgac ggggagtcag gcaactatgg atgaacgaaa 5820
tagacagatc gctgagatag gtgcctcact gattaagcat tggtaactgt cagaccaagt 5880
ttactcatat atactttaga ttgatttaaa acttcatttt taatttaaaa ggatctaggt 5940
gaagatcctt tttgataatc tcatgaccaa aatcccttaa cgtgagtttt cgttccactg 6000
agcgtcagac cccgtagaaa agatcaaagg atcttcttga gatccttttt ttctgcgcgt 6060
aatctgctgc ttgcaaacaa aaaaaccacc gctaccagcg gtggtttgtt tgccggatca 6120
agagctacca actctttttc cgaaggtaac tggcttcagc agagcgcaga taccaaatac 6180
tgttcttcta gtgtagccgt agttaggcca ccacttcaag aactctgtag caccgcctac 6240
atacctcgct ctgctaatcc tgttaccagt ggctgctgcc agtggcgata agtcgtgtct 6300
taccgggttg gactcaagac gatagttacc ggataaggcg cagcggtcgg gctgaacggg 6360
gggttcgtgc acacagccca gcttggagcg aacgacctac accgaactga gatacctaca 6420
gcgtgagcta tgagaaagcg ccacgcttcc cgaagggaga aaggcggaca ggtatccggt 6480
aagcggcagg gtcggaacag gagagcgcac gagggagctt ccagggggaa acgcctggta 6540
tctttatagt cctgtcgggt ttcgccacct ctgacttgag cgtcgatttt tgtgatgctc 6600
gtcagggggg cggagcctat ggaaaaacgc cagcaacgcg gcctttttac ggttcctggc 6660
cttttgctgg ccttttgctc acatgttctt tcctgcgtta tcccctgatt ctgtggataa 6720
ccgtattacc gcctttgagt gagctgatac cgctcgccgc agccgaacga ccgagcgcag 6780
cgagtcagtg agcgaggaag cggaagagcg cccaatacgc aaaccgcctc tccccgcgcg 6840
ttggccgatt cattaatgca gctggcacga caggtttccc gactggaaag cgggcagtga 6900
gcgcaacgca attaatgtga gttagctcac tcattaggca ccccaggctt tacactttat 6960
gcttccggct cgtatgttgt gtggaattgt gagcggataa caatttcaca caggaaacag 7020
ctatgaccat gattacgcca agcgcgcaat taaccctcac taaagggaac aaaagctgga 7080
gctgcaagct taatgtagtc ttatgcaata ctcttgtagt cttgcaacat ggtaacgatg 7140
agttagcaac atgccttaca aggagagaaa aagcaccgtg catgccgatt ggtggaagta 7200
aggtggtacg atcgtgcctt attaggaagg caacagacgg gtctgacatg gattggacga 7260
accactgaat tgccgcattg cagagatatt gtatttaagt gcctagctcg atacataaac 7320
gggtctctct ggttagacca gatctgagcc tgggagctct ctggctaact agggaaccca 7380
ctgcttaagc ctcaataaag cttgccttga gtgcttcaag tagtgtgtgc ccgtctgttg 7440
tgtgactctg gtaactagag atccctcaga cccttttagt cagtgtggaa aatctctagc 7500
agtggcgccc gaacagggac ttgaaagcga aagggaaacc agaggagctc tctcgacgca 7560
ggactcggct tgctgaagcg cgcacggcaa gaggcgaggg gcggcgactg gtgagtacgc 7620
caaaaatttt gactagcgga ggctagaagg agagagatgg gtgcgagagc gtcagtatta 7680
agcgggggag aattagatcg cgatgggaaa aaattcggtt aaggccaggg ggaaagaaaa 7740
aatataaatt aaaacatata gtatgggcaa gcagggagct agaacgattc gcagttaatc 7800
ctggcctgtt agaaacatca gaaggctgta gacaaatact gggacagcta caaccatccc 7860
ttcagacagg atcagaagaa cttagatcat tatataatac agtagcaacc ctctattgtg 7920
tgcatcaaag gatagagata aaagacacca aggaagcttt agacaagata gaggaagagc 7980
aaaacaaaag taagaccacc gcacagcaag cggccgctga tcttcagacc tggaggagga 8040
gatatgaggg acaattggag aagtgaatta tataaatata aagtagtaaa aattgaacca 8100
ttaggagtag cacccaccaa ggcaaagaga agagtggtgc agagagaaaa aagagcagtg 8160
ggaataggag ctttgttcct tgggttcttg ggagcagcag gaagcactat gggcgcagcg 8220
tcaatgacgc tgacggtaca ggccagacaa ttattgtctg gtatagtgca gcagcagaac 8280
aatttgctga gggctattga ggcgcaacag catctgttgc aactcacagt ctggggcatc 8340
aagcagctcc aggcaagaat cctggctgtg gaaagatacc taaaggatca acagctcctg 8400
gggatttggg gttgctctgg aaaactcatt tgcaccactg ctgtgccttg gaatgctagt 8460
tggagtaata aatctctgga acagatttgg aatcacacga cctggatgga gtgggacaga 8520
gaaattaaca attacacaag cttaatacac tccttaattg aagaatcgca aaaccagcaa 8580
gaaaagaatg aacaagaatt attggaatta gataaatggg caagtttgtg gaattggttt 8640
aacataacaa attggctgtg gtatataaaa ttattcataa tgatagtagg aggcttggta 8700
ggtttaagaa tagtttttgc tgtactttct atagtgaata gagttaggca gggatattca 8760
ccattatcgt ttcagaccca cctcccaacc ccgaggggac ccttgcgcct tttccaaggc 8820
agccctgggt ttgcgcaggg acgcggctgc tctgggcgtg gttccgggaa acgcagcggc 8880
gccgaccctg ggtctcgcac attcttcacg tccgttcgca gcgtcacccg gatcttcgcc 8940
gctacccttg tgggcccccc ggcgacgctt cctgctccgc ccctaagtcg ggaaggttcc 9000
ttgcggttcg cggcgtgccg gacgtgacaa acggaagccg cacgtctcac tagtaccctc 9060
gcagacggac agcgccaggg agcaatggca gcgcgccgac cgcgatgggc tgtggccaat 9120
agcggctgct cagcagggcg cgccgagagc agcggccggg aaggggcggt gcgggaggcg 9180
gggtgtgggg cggtagtgtg ggccctgttc ctgcccgcgc ggtgttccgc attctgcaag 9240
cctccggagc gcacgtcggc agtcggctcc ctcgttgacc gaatcaccga cctctctccc 9300
cagggggtac ccagctgtct agagaattct agatcttgag acaaatggca gtattcatcc 9360
acaattttaa aagaaaaggg gggattgggg ggtacagtgc aggggaaaga atagtagaca 9420
taatagcaac agacatacaa actaaagaat tacaaaaaca aattacaaaa attcaaaatt 9480
ttcgggttta ttacagggac agcagagatc cactttggcg ccggctcgag gggg 9534
<210> 211
<211> 291
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 211
Met Ala Gln Leu Lys Lys Lys Leu Gln Ala Asn Lys Lys Glu Leu Ala
1 5 10 15
Gln Leu Lys Trp Lys Leu Gln Ala Leu Lys Lys Lys Leu Ala Gln Gly
20 25 30
Gly Gly Gly Ser Gly Ser Met Ile Lys Ile Ala Thr Arg Lys Tyr Leu
35 40 45
Gly Lys Gln Asn Val Tyr Asp Ile Gly Val Gly Glu Pro His Asn Phe
50 55 60
Ala Leu Lys Asn Gly Phe Ile Ala Ser Asn Cys Ile Ser Arg Arg Ala
65 70 75 80
Gln Gly Val Thr Leu Gln Asp Leu Pro Glu Thr Glu Leu Pro Ala Val
85 90 95
Leu Gln Pro Val Ala Glu Ala Met Asp Ala Ile Ala Ala Ala Asp Leu
100 105 110
Ser Gln Thr Ser Gly Phe Gly Pro Phe Gly Pro Gln Gly Ile Gly Gln
115 120 125
Tyr Thr Thr Trp Arg Asp Phe Ile Cys Ala Ile Ala Asp Pro His Val
130 135 140
Tyr His Trp Gln Thr Val Met Asp Asp Thr Val Ser Ala Ser Val Ala
145 150 155 160
Gln Ala Leu Asp Glu Leu Met Leu Trp Ala Glu Asp Cys Pro Glu Val
165 170 175
Arg His Leu Val His Ala Asp Phe Gly Cys Ile Ser Gly Asp Ser Leu
180 185 190
Ile Ser Leu Ala Ser Thr Gly Lys Arg Val Ser Ile Lys Asp Leu Leu
195 200 205
Asp Glu Lys Asp Phe Glu Ile Trp Ala Ile Asn Glu Gln Thr Met Lys
210 215 220
Leu Glu Ser Ala Lys Val Ser Arg Val Phe Cys Thr Gly Lys Lys Leu
225 230 235 240
Val Tyr Ile Leu Lys Thr Arg Leu Gly Arg Thr Ile Lys Ala Thr Ala
245 250 255
Asn His Arg Phe Leu Thr Ile Asp Gly Trp Lys Arg Leu Asp Glu Leu
260 265 270
Ser Leu Lys Glu His Ile Ala Leu Pro Arg Lys Leu Glu Ser Ser Ser
275 280 285
Leu Gln Leu
290
<210> 212
<211> 9345
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 212
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtgccgcca ccatgagtcc cgaaatcgaa 660
aagctctctc agagcgatat atattgggac tccatcgtaa gcataacaga gacgggggtc 720
gaggaggtgt tcgatctgac agttcctggg cctcataatt tcgtagcgaa cgacatcatt 780
gtacataact ccaacaatgt cctgacggac aatggccgca taacagcggt cattgactgg 840
agcgaggcga tgttcgggga ttcccaatac gaggtcgcca acatcttctt ctggaggccg 900
tggttggctt gcctttcata cgagaccgag atcctgactg tcgagtacgg attgcttcct 960
atcggcaaaa tcgtggagaa gaggattgaa tgtaccgtct attcagtcga taataatggg 1020
aacatctaca cacagcccgt ggctcaatgg cacgacagag gagagcagga agtttttgaa 1080
tactgtctcg aggacggatc cctcatccgc gctactaaag atcataagtt tatgaccgtg 1140
gacggccaga tgctgccaat tgacgaaatt tttgaacgag agctggatct gatgagagtc 1200
gacaaccttc caaacggtgg aggggggtca ggctctgcgc agctggaaaa ggagcttcaa 1260
gccctcgaaa aaaagttggc ccagctcgag tgggagaacc aggctctgga gaaagaactg 1320
gcccagtgat taattaagaa ttcgacccag ctttcttgta caaagtggtt ggtaagccta 1380
tccctaaccc tctcctcggt ctcgattcta cgtagtaatg agctagcagt ctcgaggtta 1440
acgaattccg ccccccccct aacgttactg gccgaagccg cttggaataa ggccggtgtg 1500
cgcttgtcta tatgttattt tccaccatat tgccgtcttt tggcaatgtg agggcccgga 1560
aacctggccc tgtcttcttg acgagcattc ctaggggtct ttcccctctc gccaaaggaa 1620
tgcaaggtct gttgaatgtc gtgaaggaag cagttcctct ggaagcttct tgaagacaaa 1680
caacgtctgt agcgaccctt tgcaggcagc ggaacccccc acctggcgac aggtgcccct 1740
gcggccaaaa gccacgtgta taagatacac ctgcaaaggc ggcacaaccc cagtgccacg 1800
ttgtgagttg gatagttgtg gaaagagtca aatggctctc ctcaagcgta ttcaacaagg 1860
ggctgaagga tgcccagaag gtaccccatt gtatgggatc tgatctgggg cctcggtgca 1920
catgctttac atgtgtttag tcgaggttaa aaaaacgtct aggccccccg aaccacgggg 1980
acgtggtttt cctttgaaaa acacgataat accatggtga gcaagggcga ggagctgttc 2040
accggggtgg tgcccatcct ggtcgagctg gacggcgacg taaacggcca caagttcagc 2100
gtgtccggcg agggcgaggg cgatgccacc tacggcaagc tgaccctgaa gttcatctgc 2160
accaccggca agctgcccgt gccctggccc accctcgtga ccaccctgac ctacggcgtg 2220
cagtgcttca gccgctaccc cgaccacatg aagcagcacg acttcttcaa gtccgccatg 2280
cccgaaggct acgtccagga gcgcaccatc ttcttcaagg acgacggcaa ctacaagacc 2340
cgcgccgagg tgaagttcga gggcgacacc ctggtgaacc gcatcgagct gaagggcatc 2400
gacttcaagg aggacggcaa catcctgggg cacaagctgg agtacaacta caacagccac 2460
aacgtctata tcatggccga caagcagaag aacggcatca aggtgaactt caagatccgc 2520
cacaacatcg aggacggcag cgtgcagctc gccgaccact accagcagaa cacccccatc 2580
ggcgacggcc ccgtgctgct gcccgacaac cactacctga gcacccagtc cgccctgagc 2640
aaagacccca acgagaagcg cgatcacatg gtcctgctgg agttcgtgac cgccgccggg 2700
atcactctcg gcatggacga gctgtacaag taacaccggt ggcgcgttaa gtcgacaatc 2760
aacctctgga ttacaaaatt tgtgaaagat tgactggtat tcttaactat gttgctcctt 2820
ttacgctatg tggatacgct gctttaatgc ctttgtatca tgctattgct tcccgtatgg 2880
ctttcatttt ctcctccttg tataaatcct ggttgctgtc tctttatgag gagttgtggc 2940
ccgttgtcag gcaacgtggc gtggtgtgca ctgtgtttgc tgacgcaacc cccactggtt 3000
ggggcattgc caccacctgt cagctccttt ccgggacttt cgctttcccc ctccctattg 3060
ccacggcgga actcatcgcc gcctgccttg cccgctgctg gacaggggct cggctgttgg 3120
gcactgacaa ttccgtggtg ttgtcgggga aatcatcgtc ctttccttgg ctgctcgcct 3180
gtgttgccac ctggattctg cgcgggacgt ccttctgcta cgtcccttcg gccctcaatc 3240
cagcggacct tccttcccgc ggcctgctgc cggctctgcg gcctcttccg cgtcttcgcc 3300
ttcgccctca gacgagtcgg atctcccttt gggccgcctc cccgcgtcga ctttaagacc 3360
aatgacttac aaggcagctg tagatcttag ccacttttta aaagaaaagg ggggactgga 3420
agggctaatt cactcccaac gaagacaaga tctgcttttt gcttgtactg ggtctctctg 3480
gttagaccag atctgagcct gggagctctc tggctaacta gggaacccac tgcttaagcc 3540
tcaataaagc ttgccttgag tgcttcaagt agtgtgtgcc cgtctgttgt gtgactctgg 3600
taactagaga tccctcagac ccttttagtc agtgtggaaa atctctagca gtacgtatag 3660
tagttcatgt catcttatta ttcagtattt ataacttgca aagaaatgaa tatcagagag 3720
tgagaggaac ttgtttattg cagcttataa tggttacaaa taaagcaata gcatcacaaa 3780
tttcacaaat aaagcatttt tttcactgca ttctagttgt ggtttgtcca aactcatcaa 3840
tgtatcttat catgtctggc tctagctatc ccgcccctaa ctccgcccat cccgccccta 3900
actccgccca gttccgccca ttctccgccc catggctgac taattttttt tatttatgca 3960
gaggccgagg ccgcctcggc ctctgagcta ttccagaagt agtgaggagg cttttttgga 4020
ggcctaggga cgtacccaat tcgccctata gtgagtcgta ttacgcgcgc tcactggccg 4080
tcgttttaca acgtcgtgac tgggaaaacc ctggcgttac ccaacttaat cgccttgcag 4140
cacatccccc tttcgccagc tggcgtaata gcgaagaggc ccgcaccgat cgcccttccc 4200
aacagttgcg cagcctgaat ggcgaatggg acgcgccctg tagcggcgca ttaagcgcgg 4260
cgggtgtggt ggttacgcgc agcgtgaccg ctacacttgc cagcgcccta gcgcccgctc 4320
ctttcgcttt cttcccttcc tttctcgcca cgttcgccgg ctttccccgt caagctctaa 4380
atcgggggct ccctttaggg ttccgattta gtgctttacg gcacctcgac cccaaaaaac 4440
ttgattaggg tgatggttca cgtagtgggc catcgccctg atagacggtt tttcgccctt 4500
tgacgttgga gtccacgttc tttaatagtg gactcttgtt ccaaactgga acaacactca 4560
accctatctc ggtctattct tttgatttat aagggatttt gccgatttcg gcctattggt 4620
taaaaaatga gctgatttaa caaaaattta acgcgaattt taacaaaata ttaacgctta 4680
caatttaggt ggcacttttc ggggaaatgt gcgcggaacc cctatttgtt tatttttcta 4740
aatacattca aatatgtatc cgctcatgag acaataaccc tgataaatgc ttcaataata 4800
ttgaaaaagg aagagtatga gtattcaaca tttccgtgtc gcccttattc ccttttttgc 4860
ggcattttgc cttcctgttt ttgctcaccc agaaacgctg gtgaaagtaa aagatgctga 4920
agatcagttg ggtgcacgag tgggttacat cgaactggat ctcaacagcg gtaagatcct 4980
tgagagtttt cgccccgaag aacgttttcc aatgatgagc acttttaaag ttctgctatg 5040
tggcgcggta ttatcccgta ttgacgccgg gcaagagcaa ctcggtcgcc gcatacacta 5100
ttctcagaat gacttggttg agtactcacc agtcacagaa aagcatctta cggatggcat 5160
gacagtaaga gaattatgca gtgctgccat aaccatgagt gataacactg cggccaactt 5220
acttctgaca acgatcggag gaccgaagga gctaaccgct tttttgcaca acatggggga 5280
tcatgtaact cgccttgatc gttgggaacc ggagctgaat gaagccatac caaacgacga 5340
gcgtgacacc acgatgcctg tagcaatggc aacaacgttg cgcaaactat taactggcga 5400
actacttact ctagcttccc ggcaacaatt aatagactgg atggaggcgg ataaagttgc 5460
aggaccactt ctgcgctcgg cccttccggc tggctggttt attgctgata aatctggagc 5520
cggtgagcgt gggtctcgcg gtatcattgc agcactgggg ccagatggta agccctcccg 5580
tatcgtagtt atctacacga cggggagtca ggcaactatg gatgaacgaa atagacagat 5640
cgctgagata ggtgcctcac tgattaagca ttggtaactg tcagaccaag tttactcata 5700
tatactttag attgatttaa aacttcattt ttaatttaaa aggatctagg tgaagatcct 5760
ttttgataat ctcatgacca aaatccctta acgtgagttt tcgttccact gagcgtcaga 5820
ccccgtagaa aagatcaaag gatcttcttg agatcctttt tttctgcgcg taatctgctg 5880
cttgcaaaca aaaaaaccac cgctaccagc ggtggtttgt ttgccggatc aagagctacc 5940
aactcttttt ccgaaggtaa ctggcttcag cagagcgcag ataccaaata ctgttcttct 6000
agtgtagccg tagttaggcc accacttcaa gaactctgta gcaccgccta catacctcgc 6060
tctgctaatc ctgttaccag tggctgctgc cagtggcgat aagtcgtgtc ttaccgggtt 6120
ggactcaaga cgatagttac cggataaggc gcagcggtcg ggctgaacgg ggggttcgtg 6180
cacacagccc agcttggagc gaacgaccta caccgaactg agatacctac agcgtgagct 6240
atgagaaagc gccacgcttc ccgaagggag aaaggcggac aggtatccgg taagcggcag 6300
ggtcggaaca ggagagcgca cgagggagct tccaggggga aacgcctggt atctttatag 6360
tcctgtcggg tttcgccacc tctgacttga gcgtcgattt ttgtgatgct cgtcaggggg 6420
gcggagccta tggaaaaacg ccagcaacgc ggccttttta cggttcctgg ccttttgctg 6480
gccttttgct cacatgttct ttcctgcgtt atcccctgat tctgtggata accgtattac 6540
cgcctttgag tgagctgata ccgctcgccg cagccgaacg accgagcgca gcgagtcagt 6600
gagcgaggaa gcggaagagc gcccaatacg caaaccgcct ctccccgcgc gttggccgat 6660
tcattaatgc agctggcacg acaggtttcc cgactggaaa gcgggcagtg agcgcaacgc 6720
aattaatgtg agttagctca ctcattaggc accccaggct ttacacttta tgcttccggc 6780
tcgtatgttg tgtggaattg tgagcggata acaatttcac acaggaaaca gctatgacca 6840
tgattacgcc aagcgcgcaa ttaaccctca ctaaagggaa caaaagctgg agctgcaagc 6900
ttaatgtagt cttatgcaat actcttgtag tcttgcaaca tggtaacgat gagttagcaa 6960
catgccttac aaggagagaa aaagcaccgt gcatgccgat tggtggaagt aaggtggtac 7020
gatcgtgcct tattaggaag gcaacagacg ggtctgacat ggattggacg aaccactgaa 7080
ttgccgcatt gcagagatat tgtatttaag tgcctagctc gatacataaa cgggtctctc 7140
tggttagacc agatctgagc ctgggagctc tctggctaac tagggaaccc actgcttaag 7200
cctcaataaa gcttgccttg agtgcttcaa gtagtgtgtg cccgtctgtt gtgtgactct 7260
ggtaactaga gatccctcag acccttttag tcagtgtgga aaatctctag cagtggcgcc 7320
cgaacaggga cttgaaagcg aaagggaaac cagaggagct ctctcgacgc aggactcggc 7380
ttgctgaagc gcgcacggca agaggcgagg ggcggcgact ggtgagtacg ccaaaaattt 7440
tgactagcgg aggctagaag gagagagatg ggtgcgagag cgtcagtatt aagcggggga 7500
gaattagatc gcgatgggaa aaaattcggt taaggccagg gggaaagaaa aaatataaat 7560
taaaacatat agtatgggca agcagggagc tagaacgatt cgcagttaat cctggcctgt 7620
tagaaacatc agaaggctgt agacaaatac tgggacagct acaaccatcc cttcagacag 7680
gatcagaaga acttagatca ttatataata cagtagcaac cctctattgt gtgcatcaaa 7740
ggatagagat aaaagacacc aaggaagctt tagacaagat agaggaagag caaaacaaaa 7800
gtaagaccac cgcacagcaa gcggccgctg atcttcagac ctggaggagg agatatgagg 7860
gacaattgga gaagtgaatt atataaatat aaagtagtaa aaattgaacc attaggagta 7920
gcacccacca aggcaaagag aagagtggtg cagagagaaa aaagagcagt gggaatagga 7980
gctttgttcc ttgggttctt gggagcagca ggaagcacta tgggcgcagc gtcaatgacg 8040
ctgacggtac aggccagaca attattgtct ggtatagtgc agcagcagaa caatttgctg 8100
agggctattg aggcgcaaca gcatctgttg caactcacag tctggggcat caagcagctc 8160
caggcaagaa tcctggctgt ggaaagatac ctaaaggatc aacagctcct ggggatttgg 8220
ggttgctctg gaaaactcat ttgcaccact gctgtgcctt ggaatgctag ttggagtaat 8280
aaatctctgg aacagatttg gaatcacacg acctggatgg agtgggacag agaaattaac 8340
aattacacaa gcttaataca ctccttaatt gaagaatcgc aaaaccagca agaaaagaat 8400
gaacaagaat tattggaatt agataaatgg gcaagtttgt ggaattggtt taacataaca 8460
aattggctgt ggtatataaa attattcata atgatagtag gaggcttggt aggtttaaga 8520
atagtttttg ctgtactttc tatagtgaat agagttaggc agggatattc accattatcg 8580
tttcagaccc acctcccaac cccgagggga cccttgcgcc ttttccaagg cagccctggg 8640
tttgcgcagg gacgcggctg ctctgggcgt ggttccggga aacgcagcgg cgccgaccct 8700
gggtctcgca cattcttcac gtccgttcgc agcgtcaccc ggatcttcgc cgctaccctt 8760
gtgggccccc cggcgacgct tcctgctccg cccctaagtc gggaaggttc cttgcggttc 8820
gcggcgtgcc ggacgtgaca aacggaagcc gcacgtctca ctagtaccct cgcagacgga 8880
cagcgccagg gagcaatggc agcgcgccga ccgcgatggg ctgtggccaa tagcggctgc 8940
tcagcagggc gcgccgagag cagcggccgg gaaggggcgg tgcgggaggc ggggtgtggg 9000
gcggtagtgt gggccctgtt cctgcccgcg cggtgttccg cattctgcaa gcctccggag 9060
cgcacgtcgg cagtcggctc cctcgttgac cgaatcaccg acctctctcc ccagggggta 9120
cccagctgtc tagagaattc tagatcttga gacaaatggc agtattcatc cacaatttta 9180
aaagaaaagg ggggattggg gggtacagtg caggggaaag aatagtagac ataatagcaa 9240
cagacataca aactaaagaa ttacaaaaac aaattacaaa aattcaaaat tttcgggttt 9300
attacaggga cagcagagat ccactttggc gccggctcga ggggg 9345
<210> 213
<211> 228
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 213
Met Ser Pro Glu Ile Glu Lys Leu Ser Gln Ser Asp Ile Tyr Trp Asp
1 5 10 15
Ser Ile Val Ser Ile Thr Glu Thr Gly Val Glu Glu Val Phe Asp Leu
20 25 30
Thr Val Pro Gly Pro His Asn Phe Val Ala Asn Asp Ile Ile Val His
35 40 45
Asn Ser Asn Asn Val Leu Thr Asp Asn Gly Arg Ile Thr Ala Val Ile
50 55 60
Asp Trp Ser Glu Ala Met Phe Gly Asp Ser Gln Tyr Glu Val Ala Asn
65 70 75 80
Ile Phe Phe Trp Arg Pro Trp Leu Ala Cys Leu Ser Tyr Glu Thr Glu
85 90 95
Ile Leu Thr Val Glu Tyr Gly Leu Leu Pro Ile Gly Lys Ile Val Glu
100 105 110
Lys Arg Ile Glu Cys Thr Val Tyr Ser Val Asp Asn Asn Gly Asn Ile
115 120 125
Tyr Thr Gln Pro Val Ala Gln Trp His Asp Arg Gly Glu Gln Glu Val
130 135 140
Phe Glu Tyr Cys Leu Glu Asp Gly Ser Leu Ile Arg Ala Thr Lys Asp
145 150 155 160
His Lys Phe Met Thr Val Asp Gly Gln Met Leu Pro Ile Asp Glu Ile
165 170 175
Phe Glu Arg Glu Leu Asp Leu Met Arg Val Asp Asn Leu Pro Asn Gly
180 185 190
Gly Gly Gly Ser Gly Ser Ala Gln Leu Glu Lys Glu Leu Gln Ala Leu
195 200 205
Glu Lys Lys Leu Ala Gln Leu Glu Trp Glu Asn Gln Ala Leu Glu Lys
210 215 220
Glu Leu Ala Gln
225
<210> 214
<211> 9177
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 214
cccggggtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 60
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 120
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 180
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 240
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 300
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 360
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 420
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 480
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 540
gtgtacggtg ggaggtctat ataagcagag ctctctggct aactgtcggg atcaacaagt 600
ttgtacaaaa aagcaggctc cgaattcacc ggtgccgcca ccatggctca actgaagaag 660
aaactccagg ccaataagaa agaactggca cagctgaagt ggaagctgca ggcactgaaa 720
aagaagctgg cacagggcgg aggaggctca ggcagtatga ttaagatcgc tacgcggaag 780
tacctgggga aacagaacgt ctacgacata ggtgtgggcg agccccacaa ctttgctctg 840
aaaaatggat ttatcgccag caactgtatg gagcagcaga cgcgctactt cgagcggagg 900
catccggagc ttgcaggatc gccgcggctc cgggcgtata tgctccgcat tggtcttgac 960
caactctatc agagcttggt tgacggcaat ttcgatgatg cagcttgggc gcagggtcga 1020
tgcgacgcaa tcgtccgatc cggagccggg actgtcgggc gtacacaaat cgcccgcaga 1080
agcgcggccg tctggaccga tggctgtgta gaagtactcg ccgatagtgg aaaccgacgc 1140
cccagcactc gtccgagggc aaaggaatag ttaattaaga attcgaccca gctttcttgt 1200
acaaagtggt tggtaagcct atccctaacc ctctcctcgg tctcgattct acgtagtaat 1260
gagctagcag tctcgaggtt aacgaattcc gccccccccc taacgttact ggccgaagcc 1320
gcttggaata aggccggtgt gcgcttgtct atatgttatt ttccaccata ttgccgtctt 1380
ttggcaatgt gagggcccgg aaacctggcc ctgtcttctt gacgagcatt cctaggggtc 1440
tttcccctct cgccaaagga atgcaaggtc tgttgaatgt cgtgaaggaa gcagttcctc 1500
tggaagcttc ttgaagacaa acaacgtctg tagcgaccct ttgcaggcag cggaaccccc 1560
cacctggcga caggtgcccc tgcggccaaa agccacgtgt ataagataca cctgcaaagg 1620
cggcacaacc ccagtgccac gttgtgagtt ggatagttgt ggaaagagtc aaatggctct 1680
cctcaagcgt attcaacaag gggctgaagg atgcccagaa ggtaccccat tgtatgggat 1740
ctgatctggg gcctcggtgc acatgcttta catgtgttta gtcgaggtta aaaaaacgtc 1800
taggcccccc gaaccacggg gacgtggttt tcctttgaaa aacacgataa taccatggtg 1860
agcaagggcg aggaggataa catggccatc atcaaggagt tcatgcgctt caaggtgcac 1920
atggagggct ccgtgaacgg ccacgagttc gagatcgagg gcgagggcga gggccgcccc 1980
tacgagggca cccagaccgc caagctgaag gtgaccaagg gtggccccct gcccttcgcc 2040
tgggacatcc tgtcccctca gttcatgtac ggctccaagg cctacgtgaa gcaccccgcc 2100
gacatccccg actacttgaa gctgtccttc cccgagggct tcaagtggga gcgcgtgatg 2160
aacttcgagg acggcggcgt ggtgaccgtg acccaggact cctccctgca ggacggcgag 2220
ttcatctaca aggtgaagct gcgcggcacc aacttcccct ccgacggccc cgtaatgcag 2280
aagaagacca tgggctggga ggcctcctcc gagcggatgt accccgagga cggcgccctg 2340
aagggcgaga tcaagcagag gctgaagctg aaggacggcg gccactacga cgctgaggtc 2400
aagaccacct acaaggccaa gaagcccgtg cagctgcccg gcgcctacaa cgtcaacatc 2460
aagttggaca tcacctccca caacgaggac tacaccatcg tggaacagta cgaacgcgcc 2520
gagggccgcc actccaccgg cggcatggac gagctgtaca agtaacaccg gtggcgcgtt 2580
aagtcgacaa tcaacctctg gattacaaaa tttgtgaaag attgactggt attcttaact 2640
atgttgctcc ttttacgcta tgtggatacg ctgctttaat gcctttgtat catgctattg 2700
cttcccgtat ggctttcatt ttctcctcct tgtataaatc ctggttgctg tctctttatg 2760
aggagttgtg gcccgttgtc aggcaacgtg gcgtggtgtg cactgtgttt gctgacgcaa 2820
cccccactgg ttggggcatt gccaccacct gtcagctcct ttccgggact ttcgctttcc 2880
ccctccctat tgccacggcg gaactcatcg ccgcctgcct tgcccgctgc tggacagggg 2940
ctcggctgtt gggcactgac aattccgtgg tgttgtcggg gaaatcatcg tcctttcctt 3000
ggctgctcgc ctgtgttgcc acctggattc tgcgcgggac gtccttctgc tacgtccctt 3060
cggccctcaa tccagcggac cttccttccc gcggcctgct gccggctctg cggcctcttc 3120
cgcgtcttcg ccttcgccct cagacgagtc ggatctccct ttgggccgcc tccccgcgtc 3180
gactttaaga ccaatgactt acaaggcagc tgtagatctt agccactttt taaaagaaaa 3240
ggggggactg gaagggctaa ttcactccca acgaagacaa gatctgcttt ttgcttgtac 3300
tgggtctctc tggttagacc agatctgagc ctgggagctc tctggctaac tagggaaccc 3360
actgcttaag cctcaataaa gcttgccttg agtgcttcaa gtagtgtgtg cccgtctgtt 3420
gtgtgactct ggtaactaga gatccctcag acccttttag tcagtgtgga aaatctctag 3480
cagtacgtat agtagttcat gtcatcttat tattcagtat ttataacttg caaagaaatg 3540
aatatcagag agtgagagga acttgtttat tgcagcttat aatggttaca aataaagcaa 3600
tagcatcaca aatttcacaa ataaagcatt tttttcactg cattctagtt gtggtttgtc 3660
caaactcatc aatgtatctt atcatgtctg gctctagcta tcccgcccct aactccgccc 3720
atcccgcccc taactccgcc cagttccgcc cattctccgc cccatggctg actaattttt 3780
tttatttatg cagaggccga ggccgcctcg gcctctgagc tattccagaa gtagtgagga 3840
ggcttttttg gaggcctagg gacgtaccca attcgcccta tagtgagtcg tattacgcgc 3900
gctcactggc cgtcgtttta caacgtcgtg actgggaaaa ccctggcgtt acccaactta 3960
atcgccttgc agcacatccc cctttcgcca gctggcgtaa tagcgaagag gcccgcaccg 4020
atcgcccttc ccaacagttg cgcagcctga atggcgaatg ggacgcgccc tgtagcggcg 4080
cattaagcgc ggcgggtgtg gtggttacgc gcagcgtgac cgctacactt gccagcgccc 4140
tagcgcccgc tcctttcgct ttcttccctt cctttctcgc cacgttcgcc ggctttcccc 4200
gtcaagctct aaatcggggg ctccctttag ggttccgatt tagtgcttta cggcacctcg 4260
accccaaaaa acttgattag ggtgatggtt cacgtagtgg gccatcgccc tgatagacgg 4320
tttttcgccc tttgacgttg gagtccacgt tctttaatag tggactcttg ttccaaactg 4380
gaacaacact caaccctatc tcggtctatt cttttgattt ataagggatt ttgccgattt 4440
cggcctattg gttaaaaaat gagctgattt aacaaaaatt taacgcgaat tttaacaaaa 4500
tattaacgct tacaatttag gtggcacttt tcggggaaat gtgcgcggaa cccctatttg 4560
tttatttttc taaatacatt caaatatgta tccgctcatg agacaataac cctgataaat 4620
gcttcaataa tattgaaaaa ggaagagtat gagtattcaa catttccgtg tcgcccttat 4680
tccctttttt gcggcatttt gccttcctgt ttttgctcac ccagaaacgc tggtgaaagt 4740
aaaagatgct gaagatcagt tgggtgcacg agtgggttac atcgaactgg atctcaacag 4800
cggtaagatc cttgagagtt ttcgccccga agaacgtttt ccaatgatga gcacttttaa 4860
agttctgcta tgtggcgcgg tattatcccg tattgacgcc gggcaagagc aactcggtcg 4920
ccgcatacac tattctcaga atgacttggt tgagtactca ccagtcacag aaaagcatct 4980
tacggatggc atgacagtaa gagaattatg cagtgctgcc ataaccatga gtgataacac 5040
tgcggccaac ttacttctga caacgatcgg aggaccgaag gagctaaccg cttttttgca 5100
caacatgggg gatcatgtaa ctcgccttga tcgttgggaa ccggagctga atgaagccat 5160
accaaacgac gagcgtgaca ccacgatgcc tgtagcaatg gcaacaacgt tgcgcaaact 5220
attaactggc gaactactta ctctagcttc ccggcaacaa ttaatagact ggatggaggc 5280
ggataaagtt gcaggaccac ttctgcgctc ggcccttccg gctggctggt ttattgctga 5340
taaatctgga gccggtgagc gtgggtctcg cggtatcatt gcagcactgg ggccagatgg 5400
taagccctcc cgtatcgtag ttatctacac gacggggagt caggcaacta tggatgaacg 5460
aaatagacag atcgctgaga taggtgcctc actgattaag cattggtaac tgtcagacca 5520
agtttactca tatatacttt agattgattt aaaacttcat ttttaattta aaaggatcta 5580
ggtgaagatc ctttttgata atctcatgac caaaatccct taacgtgagt tttcgttcca 5640
ctgagcgtca gaccccgtag aaaagatcaa aggatcttct tgagatcctt tttttctgcg 5700
cgtaatctgc tgcttgcaaa caaaaaaacc accgctacca gcggtggttt gtttgccgga 5760
tcaagagcta ccaactcttt ttccgaaggt aactggcttc agcagagcgc agataccaaa 5820
tactgttctt ctagtgtagc cgtagttagg ccaccacttc aagaactctg tagcaccgcc 5880
tacatacctc gctctgctaa tcctgttacc agtggctgct gccagtggcg ataagtcgtg 5940
tcttaccggg ttggactcaa gacgatagtt accggataag gcgcagcggt cgggctgaac 6000
ggggggttcg tgcacacagc ccagcttgga gcgaacgacc tacaccgaac tgagatacct 6060
acagcgtgag ctatgagaaa gcgccacgct tcccgaaggg agaaaggcgg acaggtatcc 6120
ggtaagcggc agggtcggaa caggagagcg cacgagggag cttccagggg gaaacgcctg 6180
gtatctttat agtcctgtcg ggtttcgcca cctctgactt gagcgtcgat ttttgtgatg 6240
ctcgtcaggg gggcggagcc tatggaaaaa cgccagcaac gcggcctttt tacggttcct 6300
ggccttttgc tggccttttg ctcacatgtt ctttcctgcg ttatcccctg attctgtgga 6360
taaccgtatt accgcctttg agtgagctga taccgctcgc cgcagccgaa cgaccgagcg 6420
cagcgagtca gtgagcgagg aagcggaaga gcgcccaata cgcaaaccgc ctctccccgc 6480
gcgttggccg attcattaat gcagctggca cgacaggttt cccgactgga aagcgggcag 6540
tgagcgcaac gcaattaatg tgagttagct cactcattag gcaccccagg ctttacactt 6600
tatgcttccg gctcgtatgt tgtgtggaat tgtgagcgga taacaatttc acacaggaaa 6660
cagctatgac catgattacg ccaagcgcgc aattaaccct cactaaaggg aacaaaagct 6720
ggagctgcaa gcttaatgta gtcttatgca atactcttgt agtcttgcaa catggtaacg 6780
atgagttagc aacatgcctt acaaggagag aaaaagcacc gtgcatgccg attggtggaa 6840
gtaaggtggt acgatcgtgc cttattagga aggcaacaga cgggtctgac atggattgga 6900
cgaaccactg aattgccgca ttgcagagat attgtattta agtgcctagc tcgatacata 6960
aacgggtctc tctggttaga ccagatctga gcctgggagc tctctggcta actagggaac 7020
ccactgctta agcctcaata aagcttgcct tgagtgcttc aagtagtgtg tgcccgtctg 7080
ttgtgtgact ctggtaacta gagatccctc agaccctttt agtcagtgtg gaaaatctct 7140
agcagtggcg cccgaacagg gacttgaaag cgaaagggaa accagaggag ctctctcgac 7200
gcaggactcg gcttgctgaa gcgcgcacgg caagaggcga ggggcggcga ctggtgagta 7260
cgccaaaaat tttgactagc ggaggctaga aggagagaga tgggtgcgag agcgtcagta 7320
ttaagcgggg gagaattaga tcgcgatggg aaaaaattcg gttaaggcca gggggaaaga 7380
aaaaatataa attaaaacat atagtatggg caagcaggga gctagaacga ttcgcagtta 7440
atcctggcct gttagaaaca tcagaaggct gtagacaaat actgggacag ctacaaccat 7500
cccttcagac aggatcagaa gaacttagat cattatataa tacagtagca accctctatt 7560
gtgtgcatca aaggatagag ataaaagaca ccaaggaagc tttagacaag atagaggaag 7620
agcaaaacaa aagtaagacc accgcacagc aagcggccgc tgatcttcag acctggagga 7680
ggagatatga gggacaattg gagaagtgaa ttatataaat ataaagtagt aaaaattgaa 7740
ccattaggag tagcacccac caaggcaaag agaagagtgg tgcagagaga aaaaagagca 7800
gtgggaatag gagctttgtt ccttgggttc ttgggagcag caggaagcac tatgggcgca 7860
gcgtcaatga cgctgacggt acaggccaga caattattgt ctggtatagt gcagcagcag 7920
aacaatttgc tgagggctat tgaggcgcaa cagcatctgt tgcaactcac agtctggggc 7980
atcaagcagc tccaggcaag aatcctggct gtggaaagat acctaaagga tcaacagctc 8040
ctggggattt ggggttgctc tggaaaactc atttgcacca ctgctgtgcc ttggaatgct 8100
agttggagta ataaatctct ggaacagatt tggaatcaca cgacctggat ggagtgggac 8160
agagaaatta acaattacac aagcttaata cactccttaa ttgaagaatc gcaaaaccag 8220
caagaaaaga atgaacaaga attattggaa ttagataaat gggcaagttt gtggaattgg 8280
tttaacataa caaattggct gtggtatata aaattattca taatgatagt aggaggcttg 8340
gtaggtttaa gaatagtttt tgctgtactt tctatagtga atagagttag gcagggatat 8400
tcaccattat cgtttcagac ccacctccca accccgaggg gacccttgcg ccttttccaa 8460
ggcagccctg ggtttgcgca gggacgcggc tgctctgggc gtggttccgg gaaacgcagc 8520
ggcgccgacc ctgggtctcg cacattcttc acgtccgttc gcagcgtcac ccggatcttc 8580
gccgctaccc ttgtgggccc cccggcgacg cttcctgctc cgcccctaag tcgggaaggt 8640
tccttgcggt tcgcggcgtg ccggacgtga caaacggaag ccgcacgtct cactagtacc 8700
ctcgcagacg gacagcgcca gggagcaatg gcagcgcgcc gaccgcgatg ggctgtggcc 8760
aatagcggct gctcagcagg gcgcgccgag agcagcggcc gggaaggggc ggtgcgggag 8820
gcggggtgtg gggcggtagt gtgggccctg ttcctgcccg cgcggtgttc cgcattctgc 8880
aagcctccgg agcgcacgtc ggcagtcggc tccctcgttg accgaatcac cgacctctct 8940
ccccaggggg tacccagctg tctagagaat tctagatctt gagacaaatg gcagtattca 9000
tccacaattt taaaagaaaa ggggggattg gggggtacag tgcaggggaa agaatagtag 9060
acataatagc aacagacata caaactaaag aattacaaaa acaaattaca aaaattcaaa 9120
attttcgggt ttattacagg gacagcagag atccactttg gcgccggctc gaggggg 9177
<210> 215
<211> 175
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Polypeptide
<400> 215
Met Ala Gln Leu Lys Lys Lys Leu Gln Ala Asn Lys Lys Glu Leu Ala
1 5 10 15
Gln Leu Lys Trp Lys Leu Gln Ala Leu Lys Lys Lys Leu Ala Gln Gly
20 25 30
Gly Gly Gly Ser Gly Ser Met Ile Lys Ile Ala Thr Arg Lys Tyr Leu
35 40 45
Gly Lys Gln Asn Val Tyr Asp Ile Gly Val Gly Glu Pro His Asn Phe
50 55 60
Ala Leu Lys Asn Gly Phe Ile Ala Ser Asn Cys Met Glu Gln Gln Thr
65 70 75 80
Arg Tyr Phe Glu Arg Arg His Pro Glu Leu Ala Gly Ser Pro Arg Leu
85 90 95
Arg Ala Tyr Met Leu Arg Ile Gly Leu Asp Gln Leu Tyr Gln Ser Leu
100 105 110
Val Asp Gly Asn Phe Asp Asp Ala Ala Trp Ala Gln Gly Arg Cys Asp
115 120 125
Ala Ile Val Arg Ser Gly Ala Gly Thr Val Gly Arg Thr Gln Ile Ala
130 135 140
Arg Arg Ser Ala Ala Val Trp Thr Asp Gly Cys Val Glu Val Leu Ala
145 150 155 160
Asp Ser Gly Asn Arg Arg Pro Ser Thr Arg Pro Arg Ala Lys Glu
165 170 175
<210> 216
<211> 926
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 216
cgcagcaccg gtgccgccac catgattaag atcgctacgc ggaagtacct ggggaaacag 60
aacgtctacg acataggtgt ggagcgcgat cacaactttg ctctgaaaaa tggatttatc 120
gccagcaact gcttcaatgg cggaggcgga tctcaccacc atcatcacca tggatcaatc 180
gatgatccca agaagaaacg caaggtggat cccaagaaga agaggaaagt agaccccaag 240
aaaaagagga aggtgggatc aactgggtct cgcggaggtg gggggtcagg cggaggtggc 300
tcaggaggcg gaggctcagg gcgcgccgga tccggacgcg ccgacgcgct ggacgatttc 360
gatctcgaca tgctgggttc tgatgccctc gatgactttg acctggatat gttgggaagc 420
gacgcattgg atgactttga tctggacatg ctcggctccg atgctctgga cgatttcgat 480
ctcgatatgt tatatggcgg ccgcggaggc ggcggttccg gaggtggtgg aagtgggggt 540
ggagggtctc cgaagaaaaa gcgcaaggtc gggccggccg gtggaggagg tagcgctgag 600
tactgccttt catacgagac cgagatcctg actgtcgagt acggattgct tcctatcggc 660
aaaatcgtgg agaagaggat tgaatgtacc gtctattcag tcgataataa tgggaacatc 720
tacacacagc ccgtggctca atggcacgac agaggagagc aggaagtttt tgaatactgt 780
ctcgaggacg gatccctcat ccgcgctact aaagatcata agtttatgac cgtggacggc 840
cagatgctgc caattgacga aatttttgaa cgagagctgg atctgatgag agtcgacaac 900
cttccaaact gattaattaa gcgtcg 926
<210> 217
<211> 977
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 217
cgcagcaccg gtgccgccac catgagcact ggaaagcgag ttagcatcaa ggacttgctg 60
gacgaaaagg atttcgaaat ttgggcaatc aatgagcaga ccatgaaact ggagtctgca 120
aaggtgtccc gggtgttttg cacgggtaag aagcttgttt atatccttaa aactagactg 180
ggccggacga tcaaagccac cgcgaaccac agattcttga caatcgacgg gtggaaacgg 240
ctggacgaac tgagcttgaa ggagcacatc gcccttcctc ggaagctcga gtcatcttcc 300
ctgcagctga gtcccgaaat cgaaaagctc tctcagagcg atatatattg ggactccatc 360
gtaagcataa cagagacggg ggtcgaggag gtgttcgatc tgacagttcc tgggcctcat 420
aatttcgtag cgaacgacat cattgtacat aactccattg aaggcggagg cggatctcac 480
caccatcatc accatggatc aatcgatgat cccaagaaga aacgcaaggt ggatcccaag 540
aagaagagga aagtagaccc caagaaaaag aggaaggtgg gatcaactgg gtctcgcgga 600
ggtggggggt caggcggagg tggctcagga ggcggaggct cagggcgcgc cggatccgga 660
cgcgccgacg cgctggacga tttcgatctc gacatgctgg gttctgatgc cctcgatgac 720
tttgacctgg atatgttggg aagcgacgca ttggatgact ttgatctgga catgctcggc 780
tccgatgctc tggacgattt cgatctcgat atgttatatg gcggccgcgg aggcggcggt 840
tccggaggtg gtggaagtgg gggtggaggg tctccgaaga aaaagcgcaa ggtcgggccg 900
gccggtggag gaggtagcga atccggatgt atcagtggcg actccctgat ctcactcgca 960
tgattaatta agcgtag 977
<210> 218
<211> 4304
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Polynucleotide
<400> 218
ctttcctgcg ttatcccctg attctgtgga taaccgtatt accgcctttg agtgagctga 60
taccgctcgc cgcagccgaa cgaccgagcg cagcgagtca gtgagcgagg aagcggaaga 120
gcgcccaata cgcaaaccgc ctctccccgc gcgttggccg attcattaat gcagctggca 180
cgacaggttt cccgactgga aagcgggcag tgagcgcaac gcaattaata cgcgtaccgc 240
tagccaggaa gagtttgtag aaacgcaaaa aggccatccg tcaggatggc cttctgctta 300
gtttgatgcc tggcagttta tggcgggcgt cctgcccgcc accctccggg ccgttgcttc 360
acaacgatca aatccgctcc cggcggattt gtcctactca ggagagcgtt caccgacaaa 420
caacagataa aacgaaaggc ccagtcttcc gactgagcct ttcgttttat ttgatgcctg 480
gcagttccct actctcgcgt taacgctagc atggatgttt tcccagtcac gacgttgtaa 540
aacgacggcc agtcttaagc tcgggcccca aataatgatt ttattttgac tgatagtgac 600
ctgttcgttg caacaaattg atgagcaatg cttttttata atgccaactt tgtacaaaaa 660
agcaggctcc gaattcaccg gtgccgccac catggggccg gccgcggccg cattaggcac 720
cccaggcttt acactttatg cttccggctc gtataatgtg tggattttga gttaggatcc 780
gtcgagattt tcaggagcta aggaagctaa aatggagaaa aaaatcactg gatataccac 840
cgttgatata tcccaatggc atcgtaaaga acattttgag gcatttcagt cagttgctca 900
atgtacctat aaccagaccg ttcagctgga tattacggcc tttttaaaga ccgtaaagaa 960
aaataagcac aagttttatc cggcctttat tcacattctt gcccgcctga tgaatgctca 1020
tccggaattc cgtatggcaa tgaaagacgg tgagctggtg atatgggata gtgttcaccc 1080
ttgttacacc gttttccatg agcaaactga aaccttttca tcgctctgga gtgaatacca 1140
cgacgatttc cggcagtttc tacacatata ttcgcaagat gtggcgtgtt acggtgaaaa 1200
cctggcctat ttccctaaag ggtttattga gaatatgttt ttcgtctcag ccaatccctg 1260
ggtgagtttc accagttttg atttaaacgt ggccaatatg gacaacttct tcgcccccgt 1320
tttcaccatg ggcaaatatt atacgcaagg cgacaaggtg ctgatgccgc tggcgattca 1380
ggttcatcat gccgtttgtg atggcttcca tgtcggcaga atgcttaatg aattacaaca 1440
gtactgcgat gagtggcagg gcggggcgta aagatctgga tccggcttac taaaagccag 1500
ataacagtat gcgtatttgc gcgctgattt ttgcggtata agaatatata ctgatatgta 1560
tacccgaagt atgtcaaaaa gaggtatgct atgaagcagc gtattacagt gacagttgac 1620
agcgacagct atcagttgct caaggcatat atgatgtcaa tatctccggt ctggtaagca 1680
caaccatgca gaatgaagcc cgtcgtctgc gtgccgaacg ctggaaagcg gaaaatcagg 1740
aagggatggc tgaggtcgcc cggtttattg aaatgaacgg ctcttttgct gacgagaaca 1800
ggggctggtg aaatgcagtt taaggtttac acctataaaa gagagagccg ttatcgtctg 1860
tttgtggatg tacagagtga tattattgac acgcccgggc gacggatggt gatccccctg 1920
gccagtgcac gtctgctgtc agataaagtc tcccgtgaac tttacccggt ggtgcatatc 1980
ggggatgaaa gctggcgcat gatgaccacc gatatggcca gtgtgccggt ctccgttatc 2040
ggggaagaag tggctgatct cagccaccgc gaaaatgaca tcaaaaacgc cattaacctg 2100
atgttctggg gaatataaat gtcaggctcc cttatacaca gccagtctgc aggtcgacaa 2160
cgtttgatta attaagaatt cgacccagct ttcttgtaca aagttggcat tataaaaaat 2220
aattgctcat caatttgttg caacgaacag gtcactatca gtcaaaataa aatcattatt 2280
tgccatccag ctgatatccc ctatagtgag tcgtattaca tggtcatagc tgtttcctgg 2340
cagctctggc ccgtgtctca aaatctctga tgttacattg cacaagataa aaatatatca 2400
tcatgcctcc tctagaccag ccaggacaga aatgcctcga cttcgctgct gcccaaggtt 2460
gccgggtgac gcacaccgtg gaaacggatg aaggcacgaa cccagtggac ataagcctgt 2520
tcggttcgta agctgtaatg caagtagcgt atgcgctcac gcaactggtc cagaaccttg 2580
accgaacgca gcggtggtaa cggcgcagtg gcggttttca tggcttgtta tgactgtttt 2640
tttggggtac agtctatgcc tcgggcatcc aagcagcaag cgcgttacgc cgtgggtcga 2700
tgtttgatgt tatggagcag caacgatgtt acgcagcagg gcagtcgccc taaaacaaag 2760
ttaaacatca tgagggaagc ggtgatcgcc gaagtatcga ctcaactatc agaggtagtt 2820
ggcgtcatcg agcgccatct cgaaccgacg ttgctggccg tacatttgta cggctccgca 2880
gtggatggcg gcctgaagcc acacagtgat attgatttgc tggttacggt gaccgtaagg 2940
cttgatgaaa caacgcggcg agctttgatc aacgaccttt tggaaacttc ggcttcccct 3000
ggagagagcg agattctccg cgctgtagaa gtcaccattg ttgtgcacga cgacatcatt 3060
ccgtggcgtt atccagctaa gcgcgaactg caatttggag aatggcagcg caatgacatt 3120
cttgcaggta tcttcgagcc agccacgatc gacattgatc tggctatctt gctgacaaaa 3180
gcaagagaac atagcgttgc cttggtaggt ccagcggcgg aggaactctt tgatccggtt 3240
cctgaacagg atctatttga ggcgctaaat gaaaccttaa cgctatggaa ctcgccgccc 3300
gactgggctg gcgatgagcg aaatgtagtg cttacgttgt cccgcatttg gtacagcgca 3360
gtaaccggca aaatcgcgcc gaaggatgtc gctgccgact gggcaatgga gcgcctgccg 3420
gcccagtatc agcccgtcat acttgaagct agacaggctt atcttggaca agaagaagat 3480
cgcttggcct cgcgcgcaga tcagttggaa gaatttgtcc actacgtgaa aggcgagatc 3540
accaaggtag tcggcaaata accctcgagc cacccatgac caaaatccct taacgtgagt 3600
tacgcgtcgt tccactgagc gtcagacccc gtagaaaaga tcaaaggatc ttcttgagat 3660
cctttttttc tgcgcgtaat ctgctgcttg caaacaaaaa aaccaccgct accagcggtg 3720
gtttgtttgc cggatcaaga gctaccaact ctttttccga aggtaactgg cttcagcaga 3780
gcgcagatac caaatactgt ccttctagtg tagccgtagt taggccacca cttcaagaac 3840
tctgtagcac cgcctacata cctcgctctg ctaatcctgt taccagtggc tgctgccagt 3900
ggcgataagt cgtgtcttac cgggttggac tcaagacgat agttaccgga taaggcgcag 3960
cggtcgggct gaacgggggg ttcgtgcaca cagcccagct tggagcgaac gacctacacc 4020
gaactgagat acctacagcg tgagcattga gaaagcgcca cgcttcccga agggagaaag 4080
gcggacaggt atccggtaag cggcagggtc ggaacaggag agcgcacgag ggagcttcca 4140
gggggaaacg cctggtatct ttatagtcct gtcgggtttc gccacctctg acttgagcgt 4200
cgatttttgt gatgctcgtc aggggggcgg agcctatgga aaaacgccag caacgcggcc 4260
tttttacggt tcctggcctt ttgctggcct tttgctcaca tgtt 4304
Claims (21)
- (a) (i) 인테인의 N-말단 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, 선택가능한 마커 단백질의 N-말단 단편을 코딩하는 뉴클레오티드 서열 및 (ii) 관심있는 제1 분자를 코딩하는 뉴클레오티드 서열을 포함하는 제1 벡터; 및
(b) (i) 선택가능한 마커 단백질의 C-말단 단편으로부터 상류에 있는, 인테인의 C-말단 단편을 코딩하는 뉴클레오티드 서열 및 (ii) 관심있는 제2 분자를 코딩하는 뉴클레오티드 서열을 포함하는 제2 벡터
를 진핵생물 세포로 전달하는 것을 포함하며, 여기서 인테인의 N-말단 단편 및 C-말단 단편은 선택가능한 마커 단백질의 N-말단 단편 및 C-말단 단편의 연결을 촉매하여 전장 선택가능한 마커 단백질을 생성하는 것인 방법. - 제1항에 있어서, 트랜스제닉 진핵생물 세포를 생성하기 위해 진핵생물 세포로의 제1 및 제2 벡터의 도입을 허용하는 조건 하에 진핵생물 세포를 유지하는 것을 추가로 포함하는 방법.
- 제2항에 있어서, 전장 선택가능한 마커 단백질을 포함하는 트랜스제닉 진핵생물 세포를 선택하는 것을 추가로 포함하는 방법.
- 제1항 내지 제3항 중 어느 한 항에 있어서, 진핵생물 세포가 포유동물 세포인 방법.
- 제1항 내지 제4항 중 어느 한 항에 있어서, 선택가능한 마커 단백질이 항생제 내성 단백질인 방법.
- 제5항에 있어서, 항생제 내성 단백질이 히그로마이신, G418, 퓨로마이신, 플레오마이신 D1 또는 블라스티시딘에 대한 내성을 부여하는 것인 방법.
- 제5항 또는 제6항에 있어서, 항생제 내성 단백질이 hygB 유전자, bsr 유전자, pac 유전자 또는 neo 유전자에 의해 코딩된 것인 방법.
- 제1항 내지 제4항 중 어느 한 항에 있어서, 선택가능한 마커 단백질이 형광 단백질인 방법.
- 제8항에 있어서, 형광 단백질이 TagCFP, mTagCFP2, Czurite, ECFP2, mKalama1, Sirius, Sapphire, T-Sapphire, ECFP, Cerulean, SCFP3C, mTurquoise, mTurquoise2, 모노머 Midoriishi-Cyan, TagCFP, mTFP1, EGFP, Emerald, Superfolder GFP, 모노머 Czami Green, TagGFP2, mUKG, mWasabi, Clover, mNeonGreen, EYFP, Citrine, Venus, SYFP2, TagYFP, 모노머 Kusabira-Orange, mKOκ, mKO2, mOrange, mOrange2, mRaspberry, mCherry, mStrawberry, mScarlet, mTangerine, tdTomato, TagRFP, TagRFP-T, mCpple, mRuby, mRuby2, mPlum, HcRed-Tandem, mKate2, mNeptune, NirFP, TagRFP657, IFP1.4 및 iRFP로부터 선택된 것인 방법.
- 제1항 내지 제9항 중 어느 한 항에 있어서, 인테인이 분할 인테인인 방법.
- 제10항에 있어서, 분할 인테인이 자연 분할이며, 임의로 여기서 자연 분할 인테인이 DnaE 인테인으로부터 선택되고, 임의로 DnaE 인테인이 시네코시스티스(Synechocystis) sp. DnaE (SspDnaE) 인테인 및 노스톡 푼크티포르메(Nostoc punctiforme) (NpuDnaE) 인테인으로부터 선택된 것인 방법.
- 제10항에 있어서, 분할 인테인이 조작된 분할 인테인이며, 임의로 여기서 조작된 분할 인테인이 DnaB 인테인 또는 GyrB 인테인으로부터 조작되고, 임의로 조작된 분할 인테인이 SspDnaB S1 인테인 또는 SspGyrB S11 인테인인 방법.
- 제1항 내지 제12항 중 어느 한 항에 있어서, 제1 및/또는 제2 분자가 단백질 또는 비-코딩 리보핵산 (RNA)이고, 임의로 여기서 비-코딩 RNA가 마이크로RNA (miRNA), 안티센스 RNA, 짧은-간섭 RNA (siRNA) 또는 짧은-헤어핀 RNA (shRNA)인 방법.
- 제1항 내지 제13항 중 어느 한 항에 있어서, 제1 및/또는 제2 벡터가 플라스미드 벡터 또는 바이러스 벡터인 방법.
- (a) (i) 인테인의 N-말단 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, 선택가능한 마커 단백질의 N-말단 단편을 코딩하는 뉴클레오티드 서열 및 (ii) 관심있는 제1 분자를 코딩하는 뉴클레오티드 서열을 포함하는 제1 벡터; 및
(b) (i) 선택가능한 마커 단백질의 C-말단 단편으로부터 상류에 있는, 인테인의 C-말단 단편을 코딩하는 뉴클레오티드 서열 및 (ii) 관심있는 제2 분자를 코딩하는 뉴클레오티드 서열을 포함하는 제2 벡터
를 포함하며, 여기서 인테인의 N-말단 단편 및 C-말단 단편은 선택가능한 마커 단백질의 N-말단 단편 및 C-말단 단편의 연결을 촉매하여 전장 항생제 내성 단백질을 생성하는 것인 진핵생물 세포. - (a) 인테인의 N-말단 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, 선택가능한 마커 단백질의 N-말단 단편을 코딩하는 뉴클레오티드 서열을 포함하는 제1 벡터; 및
(b) 선택가능한 마커 단백질의 C-말단 단편으로부터 상류에 있는, 인테인의 C-말단 단편을 코딩하는 뉴클레오티드 서열을 포함하는 제2 벡터
를 포함하며, 여기서 인테인의 N-말단 단편 및 C-말단 단편은 선택가능한 마커 단백질의 N-말단 단편 및 C-말단 단편의 연결을 촉매하여 전장 선택가능한 마커 단백질을 생성하는 것인 키트. - (a) (i) 제1 인테인의 N-말단 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, 선택가능한 마커 단백질의 N-말단 단편을 코딩하는 뉴클레오티드 서열 및 (ii) 관심있는 제1 분자를 코딩하는 뉴클레오티드 서열을 포함하는 제1 벡터,
(b) (i) 제2 인테인의 N-말단 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, 선택가능한 마커 단백질의 중심 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, 제1 인테인의 C-말단 단편을 코딩하는 뉴클레오티드 서열 및 (ii) 관심있는 제2 분자를 코딩하는 뉴클레오티드 서열을 포함하는 제2 벡터, 및
(c) (i) 선택가능한 마커 단백질의 C-말단 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, 제2 인테인의 C-말단 단편을 코딩하는 뉴클레오티드 서열 및 (ii) 관심있는 제3 분자를 코딩하는 뉴클레오티드 서열을 포함하는 제3 벡터
를 진핵생물 세포로 전달하는 것을 포함하며, 여기서 제1 인테인의 N-말단 단편 및 C-말단 단편은 선택가능한 마커 단백질의 중심 단편으로의 선택가능한 마커 단백질의 N-말단 단편의 연결을 촉매하고, 제2 인테인의 N-말단 단편 및 C-말단 단편은 선택가능한 마커 단백질의 C-말단 단편으로의 선택가능한 마커 단백질의 중심 단편의 연결을 촉매하여 전장 선택가능한 마커 단백질을 생성하는 것인 방법. - 제19항에 있어서, 트랜스제닉 진핵생물 세포를 생성하기 위해 진핵생물 세포로의 제1, 제2 및 제3 벡터의 도입을 허용하는 조건 하에 진핵생물 세포를 유지하는 것을 추가로 포함하는 방법.
- 제18항에 있어서, 전장 선택가능한 마커 단백질을 포함하는 트랜스제닉 진핵생물 세포를 선택하는 것을 추가로 포함하는 방법.
- (a) (i) 제1 인테인의 N-말단 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, 선택가능한 마커 단백질의 N-말단 단편을 코딩하는 뉴클레오티드 서열 및 (ii) 관심있는 제1 분자를 코딩하는 뉴클레오티드 서열을 포함하는 제1 벡터,
(b) (i) 제2 인테인의 N-말단 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, 선택가능한 마커 단백질의 중심 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, 제1 인테인의 C-말단 단편을 코딩하는 뉴클레오티드 서열 및 (ii) 관심있는 제2 분자를 코딩하는 뉴클레오티드 서열을 포함하는 제2 벡터, 및
(c) (i) 선택가능한 마커 단백질의 C-말단 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, 제2 인테인의 C-말단 단편을 코딩하는 뉴클레오티드 서열 및 (ii) 관심있는 제3 분자를 코딩하는 뉴클레오티드 서열을 포함하는 제3 벡터
를 포함하며, 여기서 제1 인테인의 N-말단 단편 및 C-말단 단편은 선택가능한 마커 단백질의 중심 단편으로의 선택가능한 마커 단백질의 N-말단 단편의 연결을 촉매하고, 제2 인테인의 N-말단 단편 및 C-말단 단편은 선택가능한 마커 단백질의 C-말단 단편으로의 선택가능한 마커 단백질의 중심 단편의 연결을 촉매하여 전장 선택가능한 마커 단백질을 생성하는 것인 진핵생물 세포. - (a) 제1 인테인의 N-말단 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, 선택가능한 마커 단백질의 N-말단 단편을 코딩하는 뉴클레오티드 서열을 포함하는 제1 벡터,
(b) 제2 인테인의 N-말단 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, 선택가능한 마커 단백질의 중심 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, 제1 인테인의 C-말단 단편을 코딩하는 뉴클레오티드 서열을 포함하는 제2 벡터, 및
(c) 선택가능한 마커 단백질의 C-말단 단편을 코딩하는 뉴클레오티드 서열로부터 상류에 있는, 제2 인테인의 C-말단 단편을 코딩하는 뉴클레오티드 서열을 포함하는 제3 벡터
를 포함하며, 여기서 제1 인테인의 N-말단 단편 및 C-말단 단편은 선택가능한 마커 단백질의 중심 단편으로의 선택가능한 마커 단백질의 N-말단 단편의 연결을 촉매하고, 제2 인테인의 N-말단 단편 및 C-말단 단편은 선택가능한 마커 단백질의 C-말단 단편으로의 선택가능한 마커 단백질의 중심 단편의 연결을 촉매하여 전장 선택가능한 마커 단백질을 생성하는 것인 키트.
Applications Claiming Priority (9)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201762571672P | 2017-10-12 | 2017-10-12 | |
US62/571,672 | 2017-10-12 | ||
US201762608478P | 2017-12-20 | 2017-12-20 | |
US62/608,478 | 2017-12-20 | ||
US201862616281P | 2018-01-11 | 2018-01-11 | |
US62/616,281 | 2018-01-11 | ||
US201862624629P | 2018-01-31 | 2018-01-31 | |
US62/624,629 | 2018-01-31 | ||
PCT/US2018/055412 WO2019075200A1 (en) | 2017-10-12 | 2018-10-11 | METHODS AND COMPOSITIONS OF TRANSGENIC SELECTION |
Publications (1)
Publication Number | Publication Date |
---|---|
KR20200064129A true KR20200064129A (ko) | 2020-06-05 |
Family
ID=66101179
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1020207013411A KR20200064129A (ko) | 2017-10-12 | 2018-10-11 | 트랜스제닉 선택 방법 및 조성물 |
Country Status (8)
Country | Link |
---|---|
US (1) | US20200263197A1 (ko) |
EP (1) | EP3694869A4 (ko) |
JP (2) | JP7394752B2 (ko) |
KR (1) | KR20200064129A (ko) |
CN (1) | CN111511759B (ko) |
AU (1) | AU2018347421B2 (ko) |
CA (1) | CA3079017A1 (ko) |
WO (1) | WO2019075200A1 (ko) |
Families Citing this family (26)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10253316B2 (en) | 2017-06-30 | 2019-04-09 | Inscripta, Inc. | Automated cell processing methods, modules, instruments, and systems |
US11293021B1 (en) | 2016-06-23 | 2022-04-05 | Inscripta, Inc. | Automated cell processing methods, modules, instruments, and systems |
US10011849B1 (en) | 2017-06-23 | 2018-07-03 | Inscripta, Inc. | Nucleic acid-guided nucleases |
US9982279B1 (en) | 2017-06-23 | 2018-05-29 | Inscripta, Inc. | Nucleic acid-guided nucleases |
US10858761B2 (en) | 2018-04-24 | 2020-12-08 | Inscripta, Inc. | Nucleic acid-guided editing of exogenous polynucleotides in heterologous cells |
US10557216B2 (en) | 2018-04-24 | 2020-02-11 | Inscripta, Inc. | Automated instrumentation for production of T-cell receptor peptide libraries |
CA3108767A1 (en) | 2018-06-30 | 2020-01-02 | Inscripta, Inc. | Instruments, modules, and methods for improved detection of edited sequences in live cells |
US11142740B2 (en) | 2018-08-14 | 2021-10-12 | Inscripta, Inc. | Detection of nuclease edited sequences in automated modules and instruments |
US11214781B2 (en) | 2018-10-22 | 2022-01-04 | Inscripta, Inc. | Engineered enzyme |
AU2020247900A1 (en) | 2019-03-25 | 2021-11-04 | Inscripta, Inc. | Simultaneous multiplex genome editing in yeast |
US11001831B2 (en) | 2019-03-25 | 2021-05-11 | Inscripta, Inc. | Simultaneous multiplex genome editing in yeast |
AU2020288623A1 (en) | 2019-06-06 | 2022-01-06 | Inscripta, Inc. | Curing for recursive nucleic acid-guided cell editing |
WO2021102059A1 (en) | 2019-11-19 | 2021-05-27 | Inscripta, Inc. | Methods for increasing observed editing in bacteria |
US11008557B1 (en) | 2019-12-18 | 2021-05-18 | Inscripta, Inc. | Cascade/dCas3 complementation assays for in vivo detection of nucleic acid-guided nuclease edited cells |
CA3157061A1 (en) | 2020-01-27 | 2021-08-05 | Christian SILTANEN | Electroporation modules and instrumentation |
US20210332388A1 (en) | 2020-04-24 | 2021-10-28 | Inscripta, Inc. | Compositions, methods, modules and instruments for automated nucleic acid-guided nuclease editing in mammalian cells |
US11787841B2 (en) | 2020-05-19 | 2023-10-17 | Inscripta, Inc. | Rationally-designed mutations to the thrA gene for enhanced lysine production in E. coli |
EP4214314A4 (en) | 2020-09-15 | 2024-10-16 | Inscripta Inc | CRISPR EDITING TO EMBED NUCLEIC ACID LANDING PADS INTO LIVING CELL GENOMES |
US11512297B2 (en) | 2020-11-09 | 2022-11-29 | Inscripta, Inc. | Affinity tag for recombination protein recruitment |
EP4271802A1 (en) | 2021-01-04 | 2023-11-08 | Inscripta, Inc. | Mad nucleases |
US11332742B1 (en) | 2021-01-07 | 2022-05-17 | Inscripta, Inc. | Mad nucleases |
US11884924B2 (en) | 2021-02-16 | 2024-01-30 | Inscripta, Inc. | Dual strand nucleic acid-guided nickase editing |
US20240124850A1 (en) | 2021-03-03 | 2024-04-18 | Shape Therapeutics Inc. | Auxotrophic Cells for Virus Production and Compositions and Methods of Making |
EP4400585A1 (en) * | 2021-08-27 | 2024-07-17 | National University Corporation Tokyo Medical and Dental University | System for regulating protein translation |
WO2023027169A1 (ja) * | 2021-08-27 | 2023-03-02 | 国立大学法人 東京医科歯科大学 | 生細胞の選別システム |
CN115896147B (zh) * | 2022-10-11 | 2023-10-03 | 态创生物科技(广州)有限公司 | 内含肽进化系统和方法、对应的突变质粒和报告质粒 |
Family Cites Families (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH10113174A (ja) * | 1996-10-08 | 1998-05-06 | Amashiyamu Kk | ヒトシトクロムp−450とヒトシトクロムp−450還元酵素の同時製造方法 |
DE60036942T2 (de) * | 1999-05-24 | 2008-08-07 | New England Biolabs, Inc., Beverly | Verfahren zur herstellung von getrennten, nicht-übertragbaren genen, welche befähigt sind ein aktives proteinprodukt zu exprimieren |
US6858775B1 (en) * | 1999-05-24 | 2005-02-22 | New England Biolabs, Inc. | Method for generating split, non-transferable genes that are able to express an active protein product |
EP2395083B1 (en) * | 2002-01-08 | 2017-04-12 | Agrivida, Inc. | Transgenic plants expressing CIVPS or intein modified proteins and related method |
EP1692156B1 (en) * | 2003-10-24 | 2012-12-26 | Los Alamos National Security, LLC | Self-assembling split-fluorescent protein systems |
EP2468881A3 (en) * | 2005-07-21 | 2012-08-15 | Abbott Laboratories | Multiple gene expression including sorf contructs and methods with polyproteins, pro-proteins, and proteolysis |
US10100080B2 (en) * | 2011-09-28 | 2018-10-16 | Era Biotech, S.A. | Split inteins and uses thereof |
-
2018
- 2018-10-11 KR KR1020207013411A patent/KR20200064129A/ko active IP Right Grant
- 2018-10-11 CA CA3079017A patent/CA3079017A1/en active Pending
- 2018-10-11 JP JP2020520468A patent/JP7394752B2/ja active Active
- 2018-10-11 CN CN201880078542.7A patent/CN111511759B/zh active Active
- 2018-10-11 WO PCT/US2018/055412 patent/WO2019075200A1/en unknown
- 2018-10-11 US US16/755,065 patent/US20200263197A1/en active Pending
- 2018-10-11 EP EP18867279.4A patent/EP3694869A4/en active Pending
- 2018-10-11 AU AU2018347421A patent/AU2018347421B2/en active Active
-
2023
- 2023-11-28 JP JP2023200808A patent/JP2024015079A/ja active Pending
Also Published As
Publication number | Publication date |
---|---|
JP2024015079A (ja) | 2024-02-01 |
JP7394752B2 (ja) | 2023-12-08 |
EP3694869A4 (en) | 2021-11-24 |
US20200263197A1 (en) | 2020-08-20 |
JP2020537646A (ja) | 2020-12-24 |
CN111511759B (zh) | 2024-07-30 |
AU2018347421A1 (en) | 2020-05-14 |
CN111511759A (zh) | 2020-08-07 |
WO2019075200A1 (en) | 2019-04-18 |
EP3694869A1 (en) | 2020-08-19 |
AU2018347421B2 (en) | 2024-08-22 |
CA3079017A1 (en) | 2019-04-18 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
KR20200064129A (ko) | 트랜스제닉 선택 방법 및 조성물 | |
CN111344395B (zh) | 产生经修饰的自然杀伤细胞的方法及使用方法 | |
AU774643B2 (en) | Compositions and methods for use in recombinational cloning of nucleic acids | |
KR102451510B1 (ko) | Pd-1 호밍 엔도뉴클레아제 변이체, 조성물 및 사용 방법 | |
KR20210149060A (ko) | Tn7-유사 트랜스포존을 사용한 rna-유도된 dna 통합 | |
KR101982360B1 (ko) | 콤팩트 tale-뉴클레아제의 발생 방법 및 이의 용도 | |
AU2021204620A1 (en) | Central nervous system targeting polynucleotides | |
AU2016343979A1 (en) | Delivery of central nervous system targeting polynucleotides | |
KR102628872B1 (ko) | 세포의 증식을 제어하기 위해 세포 분열 좌위를 사용하기 위한 도구 및 방법 | |
DK2768848T3 (en) | METHODS AND PROCEDURES FOR EXPRESSION AND SECRETARY OF PEPTIDES AND PROTEINS | |
US11033638B2 (en) | Single-vector gene construct comprising insulin and glucokinase genes | |
CN112041334A (zh) | 人foxp3在经基因编辑的t细胞中的表达 | |
KR20220130093A (ko) | 오토펄린 듀얼 벡터 시스템을 사용한 감각신경성 난청을 치료하기 위한 조성물 및 방법 | |
KR20240004253A (ko) | 오토펄린 듀얼 벡터 시스템을 사용한 감각신경성 난청을 치료하기 위한 방법 | |
CN110785179A (zh) | Wiskott-Aldrich综合征和X连锁血小板减少症中的治疗性基因组编辑 | |
CN116083398B (zh) | 分离的Cas13蛋白及其应用 | |
KR102409420B1 (ko) | 형질전환 생물체 선별용 마커 조성물, 형질전환 생물체 및 형질전환 방법 | |
KR20240037192A (ko) | 게놈 통합을 위한 방법 및 조성물 | |
CN115768890A (zh) | 通过分子和物理启动对t细胞免疫疗法的热控制 | |
KR20220142502A (ko) | 근육 특이적 핵산 조절 요소 및 이의 방법 및 용도 | |
CN107988259B (zh) | SmartBac杆状病毒表达系统及其应用 | |
KR20210151785A (ko) | 비바이러스성 dna 벡터 및 fviii 치료제 발현을 위한 이의 용도 | |
CN116323942A (zh) | 用于基因组编辑的组合物及其使用方法 | |
KR20220115943A (ko) | 재조합 활성화 유전자 1 (rag1) 중증 복합 면역결핍증 (scid) 을 치료하기 위한 조혈 줄기 세포에서의 렌티바이러스 벡터 | |
NL2027815B1 (en) | Genomic integration |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
E902 | Notification of reason for refusal | ||
E701 | Decision to grant or registration of patent right |