CN1232504A - 3' genomic promoter region and polymerase gene mutations responsible for attenuation in viruses of order designated mononegavirales - Google Patents
3' genomic promoter region and polymerase gene mutations responsible for attenuation in viruses of order designated mononegavirales Download PDFInfo
- Publication number
- CN1232504A CN1232504A CN97198321A CN97198321A CN1232504A CN 1232504 A CN1232504 A CN 1232504A CN 97198321 A CN97198321 A CN 97198321A CN 97198321 A CN97198321 A CN 97198321A CN 1232504 A CN1232504 A CN 1232504A
- Authority
- CN
- China
- Prior art keywords
- virus
- leu
- ile
- ser
- nucleotide
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 241000700605 Viruses Species 0.000 title claims abstract description 254
- 108700039691 Genetic Promoter Regions Proteins 0.000 title claims abstract description 44
- 241000711513 Mononegavirales Species 0.000 title claims abstract description 21
- 206010064571 Gene mutation Diseases 0.000 title 1
- 229960005486 vaccine Drugs 0.000 claims abstract description 153
- 230000002238 attenuated effect Effects 0.000 claims abstract description 24
- 230000004224 protection Effects 0.000 claims abstract description 20
- 241001493065 dsRNA viruses Species 0.000 claims abstract description 19
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 claims abstract description 16
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 claims description 1116
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 claims description 692
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 claims description 568
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 claims description 432
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 claims description 419
- 239000002773 nucleotide Substances 0.000 claims description 179
- 125000003729 nucleotide group Chemical group 0.000 claims description 179
- 230000008859 change Effects 0.000 claims description 158
- 241000712079 Measles morbillivirus Species 0.000 claims description 94
- 108090000623 proteins and genes Proteins 0.000 claims description 94
- 150000001413 amino acids Chemical class 0.000 claims description 69
- 235000001014 amino acid Nutrition 0.000 claims description 64
- 229940024606 amino acid Drugs 0.000 claims description 64
- 230000003612 virological effect Effects 0.000 claims description 50
- 238000000034 method Methods 0.000 claims description 46
- 210000004027 cell Anatomy 0.000 claims description 31
- 108020004707 nucleic acids Proteins 0.000 claims description 30
- 102000039446 nucleic acids Human genes 0.000 claims description 30
- 150000007523 nucleic acids Chemical class 0.000 claims description 30
- 230000035897 transcription Effects 0.000 claims description 27
- 238000013518 transcription Methods 0.000 claims description 27
- 206010035664 Pneumonia Diseases 0.000 claims description 24
- 239000000203 mixture Substances 0.000 claims description 20
- 206010035742 Pneumonitis Diseases 0.000 claims description 19
- 208000015181 infectious disease Diseases 0.000 claims description 19
- 239000004475 Arginine Substances 0.000 claims description 15
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 claims description 15
- 235000008521 threonine Nutrition 0.000 claims description 15
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 claims description 13
- 239000004473 Threonine Substances 0.000 claims description 13
- 239000013604 expression vector Substances 0.000 claims description 13
- 108700004029 pol Genes Proteins 0.000 claims description 12
- 101150088264 pol gene Proteins 0.000 claims description 12
- 235000003704 aspartic acid Nutrition 0.000 claims description 11
- 229930182817 methionine Natural products 0.000 claims description 11
- 108091069025 single-strand RNA Proteins 0.000 claims description 11
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 claims description 10
- 239000013598 vector Substances 0.000 claims description 10
- 235000014705 isoleucine Nutrition 0.000 claims description 9
- XUYPXLNMDZIRQH-LURJTMIESA-N N-acetyl-L-methionine Chemical compound CSCC[C@@H](C(O)=O)NC(C)=O XUYPXLNMDZIRQH-LURJTMIESA-N 0.000 claims description 8
- 230000035479 physiological effects, processes and functions Effects 0.000 claims description 8
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 claims description 7
- 241000711504 Paramyxoviridae Species 0.000 claims description 7
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 claims description 7
- 235000008729 phenylalanine Nutrition 0.000 claims description 7
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 claims description 6
- 241000282560 Macaca mulatta Species 0.000 claims description 6
- 208000002606 Paramyxoviridae Infections Diseases 0.000 claims description 6
- 241000710799 Rubella virus Species 0.000 claims description 6
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 claims description 6
- 230000002458 infectious effect Effects 0.000 claims description 6
- 230000008521 reorganization Effects 0.000 claims description 6
- 241000711950 Filoviridae Species 0.000 claims description 5
- 241000711920 Human orthopneumovirus Species 0.000 claims description 5
- 150000001510 aspartic acids Chemical class 0.000 claims description 5
- 230000000295 complement effect Effects 0.000 claims description 5
- 150000002520 isoleucines Chemical class 0.000 claims description 5
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 claims description 5
- 241000712045 Morbillivirus Species 0.000 claims description 4
- 230000001939 inductive effect Effects 0.000 claims description 4
- 229960000310 isoleucine Drugs 0.000 claims description 4
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 claims description 4
- 210000001161 mammalian embryo Anatomy 0.000 claims description 4
- 150000002994 phenylalanines Chemical class 0.000 claims description 4
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 claims description 3
- 241000711931 Rhabdoviridae Species 0.000 claims description 3
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 claims description 3
- 238000001890 transfection Methods 0.000 claims description 3
- 150000008536 L-asparagines Chemical class 0.000 claims description 2
- 238000006243 chemical reaction Methods 0.000 claims description 2
- 150000003588 threonines Chemical class 0.000 claims description 2
- 210000002821 alveolar epithelial cell Anatomy 0.000 claims 1
- 239000000969 carrier Substances 0.000 claims 1
- 230000003834 intracellular effect Effects 0.000 claims 1
- 230000003053 immunization Effects 0.000 abstract description 8
- 230000035772 mutation Effects 0.000 abstract description 5
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 434
- 125000001360 methionine group Chemical group N[C@@H](CCSC)C(=O)* 0.000 description 241
- XUJNEKJLAYXESH-REOHCLBHSA-N L-Cysteine Chemical compound SC[C@H](N)C(O)=O XUJNEKJLAYXESH-REOHCLBHSA-N 0.000 description 204
- 241000725643 Respiratory syncytial virus Species 0.000 description 87
- 102220023257 rs387907546 Human genes 0.000 description 71
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 63
- 102220023256 rs387907547 Human genes 0.000 description 47
- 102220369445 c.668T>C Human genes 0.000 description 42
- 108020004999 messenger RNA Proteins 0.000 description 37
- 101150062031 L gene Proteins 0.000 description 34
- 102000004169 proteins and genes Human genes 0.000 description 31
- 102220369447 c.1352G>A Human genes 0.000 description 27
- 235000018102 proteins Nutrition 0.000 description 26
- 201000005505 Measles Diseases 0.000 description 19
- 201000010099 disease Diseases 0.000 description 19
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 19
- 230000000694 effects Effects 0.000 description 18
- 210000002845 virion Anatomy 0.000 description 17
- 108020004705 Codon Proteins 0.000 description 16
- 108090001074 Nucleocapsid Proteins Proteins 0.000 description 16
- 241000282577 Pan troglodytes Species 0.000 description 16
- 102220004457 rs11567847 Human genes 0.000 description 16
- 210000001331 nose Anatomy 0.000 description 15
- 230000008569 process Effects 0.000 description 15
- 230000000875 corresponding effect Effects 0.000 description 14
- 238000012360 testing method Methods 0.000 description 14
- 239000000523 sample Substances 0.000 description 13
- 210000001519 tissue Anatomy 0.000 description 13
- 230000006870 function Effects 0.000 description 12
- 210000003501 vero cell Anatomy 0.000 description 12
- 238000002360 preparation method Methods 0.000 description 11
- 102220023258 rs387907548 Human genes 0.000 description 11
- 108020004414 DNA Proteins 0.000 description 10
- WSFSSNUMVMOOMR-UHFFFAOYSA-N Formaldehyde Chemical compound O=C WSFSSNUMVMOOMR-UHFFFAOYSA-N 0.000 description 10
- 238000004458 analytical method Methods 0.000 description 10
- 102220369446 c.1274G>A Human genes 0.000 description 10
- 230000014509 gene expression Effects 0.000 description 10
- 239000006210 lotion Substances 0.000 description 10
- 210000004072 lung Anatomy 0.000 description 10
- 230000008488 polyadenylation Effects 0.000 description 10
- 238000005406 washing Methods 0.000 description 10
- 101150046652 M2 gene Proteins 0.000 description 9
- 239000002299 complementary DNA Substances 0.000 description 9
- 230000009849 deactivation Effects 0.000 description 9
- 230000001419 dependent effect Effects 0.000 description 9
- 238000011156 evaluation Methods 0.000 description 9
- 230000036039 immunity Effects 0.000 description 9
- 238000012882 sequential analysis Methods 0.000 description 9
- 230000002103 transcriptional effect Effects 0.000 description 9
- 230000014616 translation Effects 0.000 description 9
- KDCGOANMDULRCW-UHFFFAOYSA-N 7H-purine Chemical compound N1=CNC2=NC=NC2=C1 KDCGOANMDULRCW-UHFFFAOYSA-N 0.000 description 8
- 241000712083 Canine morbillivirus Species 0.000 description 8
- 238000011081 inoculation Methods 0.000 description 8
- 239000007788 liquid Substances 0.000 description 8
- 108090000765 processed proteins & peptides Proteins 0.000 description 8
- 238000013519 translation Methods 0.000 description 8
- 230000001018 virulence Effects 0.000 description 8
- 108010033040 Histones Proteins 0.000 description 7
- DRCKHKZYDLJYFQ-YWIQKCBGSA-N Ile-Thr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DRCKHKZYDLJYFQ-YWIQKCBGSA-N 0.000 description 7
- 241000711975 Vesicular stomatitis virus Species 0.000 description 7
- 230000001276 controlling effect Effects 0.000 description 7
- 239000000463 material Substances 0.000 description 7
- 230000004044 response Effects 0.000 description 7
- 238000002965 ELISA Methods 0.000 description 6
- 101710133291 Hemagglutinin-neuraminidase Proteins 0.000 description 6
- 241001465754 Metazoa Species 0.000 description 6
- 241000711408 Murine respirovirus Species 0.000 description 6
- NBIIXXVUZAFLBC-UHFFFAOYSA-N Phosphoric acid Chemical compound OP(O)(O)=O NBIIXXVUZAFLBC-UHFFFAOYSA-N 0.000 description 6
- 125000000539 amino acid group Chemical group 0.000 description 6
- 230000002068 genetic effect Effects 0.000 description 6
- 230000012010 growth Effects 0.000 description 6
- 230000028993 immune response Effects 0.000 description 6
- 238000011160 research Methods 0.000 description 6
- 108091026890 Coding region Proteins 0.000 description 5
- 238000010171 animal model Methods 0.000 description 5
- 239000000427 antigen Substances 0.000 description 5
- 102000036639 antigens Human genes 0.000 description 5
- 108091007433 antigens Proteins 0.000 description 5
- 238000001574 biopsy Methods 0.000 description 5
- 238000004113 cell culture Methods 0.000 description 5
- 238000013461 design Methods 0.000 description 5
- 238000000338 in vitro Methods 0.000 description 5
- 210000003734 kidney Anatomy 0.000 description 5
- 229940041323 measles vaccine Drugs 0.000 description 5
- 238000004321 preservation Methods 0.000 description 5
- 230000001681 protective effect Effects 0.000 description 5
- 230000001105 regulatory effect Effects 0.000 description 5
- 238000012163 sequencing technique Methods 0.000 description 5
- 210000003437 trachea Anatomy 0.000 description 5
- 241000711404 Avian avulavirus 1 Species 0.000 description 4
- 102000004190 Enzymes Human genes 0.000 description 4
- 101001065501 Escherichia phage MS2 Lysis protein Proteins 0.000 description 4
- 101100342977 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) leu-1 gene Proteins 0.000 description 4
- 241000711798 Rabies lyssavirus Species 0.000 description 4
- 230000000890 antigenic effect Effects 0.000 description 4
- 210000000234 capsid Anatomy 0.000 description 4
- 238000001816 cooling Methods 0.000 description 4
- 238000002649 immunization Methods 0.000 description 4
- 230000007246 mechanism Effects 0.000 description 4
- 239000002245 particle Substances 0.000 description 4
- 230000001717 pathogenic effect Effects 0.000 description 4
- 102000004196 processed proteins & peptides Human genes 0.000 description 4
- 238000010839 reverse transcription Methods 0.000 description 4
- 230000035945 sensitivity Effects 0.000 description 4
- 229940031626 subunit vaccine Drugs 0.000 description 4
- TZDNWXDLYFIFPT-BJDJZHNGSA-N Ala-Ile-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O TZDNWXDLYFIFPT-BJDJZHNGSA-N 0.000 description 3
- JQFZHHSQMKZLRU-IUCAKERBSA-N Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N JQFZHHSQMKZLRU-IUCAKERBSA-N 0.000 description 3
- HZYFHQOWCFUSOV-IMJSIDKUSA-N Asn-Asp Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(O)=O HZYFHQOWCFUSOV-IMJSIDKUSA-N 0.000 description 3
- UPALZCBCKAMGIY-PEFMBERDSA-N Asn-Gln-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UPALZCBCKAMGIY-PEFMBERDSA-N 0.000 description 3
- 102000053642 Catalytic RNA Human genes 0.000 description 3
- 108090000994 Catalytic RNA Proteins 0.000 description 3
- 229920000742 Cotton Polymers 0.000 description 3
- 108090000790 Enzymes Proteins 0.000 description 3
- HVKAAUOFFTUSAA-XDTLVQLUSA-N Glu-Tyr-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O HVKAAUOFFTUSAA-XDTLVQLUSA-N 0.000 description 3
- ZALGPUWUVHOGAE-GVXVVHGQSA-N Glu-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZALGPUWUVHOGAE-GVXVVHGQSA-N 0.000 description 3
- 108090000288 Glycoproteins Proteins 0.000 description 3
- 102000003886 Glycoproteins Human genes 0.000 description 3
- 101710128560 Initiator protein NS1 Proteins 0.000 description 3
- 108091029795 Intergenic region Proteins 0.000 description 3
- SVBJIZVVYJYGLA-DCAQKATOSA-N Leu-Ser-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O SVBJIZVVYJYGLA-DCAQKATOSA-N 0.000 description 3
- AXVIGSRGTMNSJU-YESZJQIVSA-N Leu-Tyr-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N AXVIGSRGTMNSJU-YESZJQIVSA-N 0.000 description 3
- WAIHHELKYSFIQN-XUXIUFHCSA-N Lys-Ile-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O WAIHHELKYSFIQN-XUXIUFHCSA-N 0.000 description 3
- 108010090054 Membrane Glycoproteins Proteins 0.000 description 3
- 102000012750 Membrane Glycoproteins Human genes 0.000 description 3
- TZLYIHDABYBOCJ-FXQIFTODSA-N Met-Asp-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O TZLYIHDABYBOCJ-FXQIFTODSA-N 0.000 description 3
- 101710144127 Non-structural protein 1 Proteins 0.000 description 3
- 101150084044 P gene Proteins 0.000 description 3
- 241001494479 Pecora Species 0.000 description 3
- 102000002508 Peptide Elongation Factors Human genes 0.000 description 3
- 108010068204 Peptide Elongation Factors Proteins 0.000 description 3
- 229940124679 RSV vaccine Drugs 0.000 description 3
- 102000004389 Ribonucleoproteins Human genes 0.000 description 3
- 108010081734 Ribonucleoproteins Proteins 0.000 description 3
- 241000711897 Rinderpest morbillivirus Species 0.000 description 3
- 238000012300 Sequence Analysis Methods 0.000 description 3
- GZGFSPWOMUKKCV-NAKRPEOUSA-N Ser-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO GZGFSPWOMUKKCV-NAKRPEOUSA-N 0.000 description 3
- 108091081024 Start codon Proteins 0.000 description 3
- 101710137500 T7 RNA polymerase Proteins 0.000 description 3
- PQSNETRGCRUOGP-KKHAAJSZSA-N Val-Thr-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O PQSNETRGCRUOGP-KKHAAJSZSA-N 0.000 description 3
- 108010067390 Viral Proteins Proteins 0.000 description 3
- 208000036142 Viral infection Diseases 0.000 description 3
- 230000009471 action Effects 0.000 description 3
- 229910000147 aluminium phosphate Inorganic materials 0.000 description 3
- 230000003321 amplification Effects 0.000 description 3
- 230000008901 benefit Effects 0.000 description 3
- 239000003795 chemical substances by application Substances 0.000 description 3
- 239000011248 coating agent Substances 0.000 description 3
- 238000000576 coating method Methods 0.000 description 3
- 230000004927 fusion Effects 0.000 description 3
- 230000001976 improved effect Effects 0.000 description 3
- 230000001524 infective effect Effects 0.000 description 3
- 210000004962 mammalian cell Anatomy 0.000 description 3
- 230000000877 morphologic effect Effects 0.000 description 3
- 230000000869 mutational effect Effects 0.000 description 3
- 230000003472 neutralizing effect Effects 0.000 description 3
- 238000003199 nucleic acid amplification method Methods 0.000 description 3
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 3
- 230000010076 replication Effects 0.000 description 3
- 210000002345 respiratory system Anatomy 0.000 description 3
- 108091092562 ribozyme Proteins 0.000 description 3
- 210000002966 serum Anatomy 0.000 description 3
- 230000005030 transcription termination Effects 0.000 description 3
- 230000029812 viral genome replication Effects 0.000 description 3
- 230000009385 viral infection Effects 0.000 description 3
- CERZMXAJYMMUDR-QBTAGHCHSA-N 5-amino-3,5-dideoxy-D-glycero-D-galacto-non-2-ulopyranosonic acid Chemical compound N[C@@H]1[C@@H](O)CC(O)(C(O)=O)O[C@H]1[C@H](O)[C@H](O)CO CERZMXAJYMMUDR-QBTAGHCHSA-N 0.000 description 2
- VGRHZPNRCLAHQA-IMJSIDKUSA-N Asp-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(O)=O VGRHZPNRCLAHQA-IMJSIDKUSA-N 0.000 description 2
- 241000894006 Bacteria Species 0.000 description 2
- 206010007559 Cardiac failure congestive Diseases 0.000 description 2
- 241000282693 Cercopithecidae Species 0.000 description 2
- 241000282552 Chlorocebus aethiops Species 0.000 description 2
- 108091062157 Cis-regulatory element Proteins 0.000 description 2
- 241000699800 Cricetinae Species 0.000 description 2
- 101150034814 F gene Proteins 0.000 description 2
- 241000287828 Gallus gallus Species 0.000 description 2
- 101710154606 Hemagglutinin Proteins 0.000 description 2
- 241000282412 Homo Species 0.000 description 2
- VEXZGXHMUGYJMC-UHFFFAOYSA-N Hydrochloric acid Chemical compound Cl VEXZGXHMUGYJMC-UHFFFAOYSA-N 0.000 description 2
- 102100034349 Integrase Human genes 0.000 description 2
- XEEYBQQBJWHFJM-UHFFFAOYSA-N Iron Chemical compound [Fe] XEEYBQQBJWHFJM-UHFFFAOYSA-N 0.000 description 2
- VTJUNIYRYIAIHF-IUCAKERBSA-N Leu-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(O)=O VTJUNIYRYIAIHF-IUCAKERBSA-N 0.000 description 2
- 102000004895 Lipoproteins Human genes 0.000 description 2
- 108090001030 Lipoproteins Proteins 0.000 description 2
- NPBGTPKLVJEOBE-IUCAKERBSA-N Lys-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(O)=O)CCCNC(N)=N NPBGTPKLVJEOBE-IUCAKERBSA-N 0.000 description 2
- 241000711828 Lyssavirus Species 0.000 description 2
- 108010052285 Membrane Proteins Proteins 0.000 description 2
- 102000018697 Membrane Proteins Human genes 0.000 description 2
- 108700026244 Open Reading Frames Proteins 0.000 description 2
- 241000283973 Oryctolagus cuniculus Species 0.000 description 2
- 101710093908 Outer capsid protein VP4 Proteins 0.000 description 2
- 101710135467 Outer capsid protein sigma-1 Proteins 0.000 description 2
- RFCVXVPWSPOMFJ-STQMWFEESA-N Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 RFCVXVPWSPOMFJ-STQMWFEESA-N 0.000 description 2
- 208000000474 Poliomyelitis Diseases 0.000 description 2
- 241000288906 Primates Species 0.000 description 2
- 101710176177 Protein A56 Proteins 0.000 description 2
- CZPWVGJYEJSRLH-UHFFFAOYSA-N Pyrimidine Chemical compound C1=CN=CN=C1 CZPWVGJYEJSRLH-UHFFFAOYSA-N 0.000 description 2
- 108020004511 Recombinant DNA Proteins 0.000 description 2
- 241001533467 Rubulavirus Species 0.000 description 2
- 101150107578 SH gene Proteins 0.000 description 2
- 101710172711 Structural protein Proteins 0.000 description 2
- LUMXICQAOKVQOB-YWIQKCBGSA-N Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](N)[C@@H](C)O LUMXICQAOKVQOB-YWIQKCBGSA-N 0.000 description 2
- 108700009124 Transcription Initiation Site Proteins 0.000 description 2
- LCPVBXOHXMBLFW-JSGCOSHPSA-N Trp-Arg Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O)=CNC2=C1 LCPVBXOHXMBLFW-JSGCOSHPSA-N 0.000 description 2
- 108091034135 Vault RNA Proteins 0.000 description 2
- 108020000999 Viral RNA Proteins 0.000 description 2
- 230000001154 acute effect Effects 0.000 description 2
- 230000001458 anti-acid effect Effects 0.000 description 2
- 230000000840 anti-viral effect Effects 0.000 description 2
- 239000011324 bead Substances 0.000 description 2
- 230000004071 biological effect Effects 0.000 description 2
- 230000023555 blood coagulation Effects 0.000 description 2
- 210000001124 body fluid Anatomy 0.000 description 2
- 239000010839 body fluid Substances 0.000 description 2
- 239000013553 cell monolayer Substances 0.000 description 2
- 239000003153 chemical reaction reagent Substances 0.000 description 2
- 238000010367 cloning Methods 0.000 description 2
- 230000008878 coupling Effects 0.000 description 2
- 238000010168 coupling process Methods 0.000 description 2
- 238000005859 coupling reaction Methods 0.000 description 2
- 230000000120 cytopathologic effect Effects 0.000 description 2
- 230000034994 death Effects 0.000 description 2
- 230000007812 deficiency Effects 0.000 description 2
- 210000001840 diploid cell Anatomy 0.000 description 2
- 230000008034 disappearance Effects 0.000 description 2
- 238000004043 dyeing Methods 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 238000002474 experimental method Methods 0.000 description 2
- 230000002349 favourable effect Effects 0.000 description 2
- 210000002950 fibroblast Anatomy 0.000 description 2
- 239000012634 fragment Substances 0.000 description 2
- 239000000185 hemagglutinin Substances 0.000 description 2
- 208000021760 high fever Diseases 0.000 description 2
- 238000009396 hybridization Methods 0.000 description 2
- 230000008105 immune reaction Effects 0.000 description 2
- 230000000977 initiatory effect Effects 0.000 description 2
- 239000010410 layer Substances 0.000 description 2
- 238000004519 manufacturing process Methods 0.000 description 2
- 239000002609 medium Substances 0.000 description 2
- 210000004379 membrane Anatomy 0.000 description 2
- 239000012528 membrane Substances 0.000 description 2
- 244000005700 microbiome Species 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 210000000214 mouth Anatomy 0.000 description 2
- 210000003097 mucus Anatomy 0.000 description 2
- 238000002703 mutagenesis Methods 0.000 description 2
- 231100000350 mutagenesis Toxicity 0.000 description 2
- CERZMXAJYMMUDR-UHFFFAOYSA-N neuraminic acid Natural products NC1C(O)CC(O)(C(O)=O)OC1C(O)C(O)CO CERZMXAJYMMUDR-UHFFFAOYSA-N 0.000 description 2
- 230000036961 partial effect Effects 0.000 description 2
- 230000007170 pathology Effects 0.000 description 2
- 230000002085 persistent effect Effects 0.000 description 2
- 239000013612 plasmid Substances 0.000 description 2
- 102000005962 receptors Human genes 0.000 description 2
- 230000003362 replicative effect Effects 0.000 description 2
- 230000000241 respiratory effect Effects 0.000 description 2
- 208000023504 respiratory system disease Diseases 0.000 description 2
- 230000000717 retained effect Effects 0.000 description 2
- 238000000926 separation method Methods 0.000 description 2
- 238000002741 site-directed mutagenesis Methods 0.000 description 2
- 230000006641 stabilisation Effects 0.000 description 2
- 238000011105 stabilization Methods 0.000 description 2
- 208000024891 symptom Diseases 0.000 description 2
- 230000002195 synergetic effect Effects 0.000 description 2
- 241000712461 unidentified influenza virus Species 0.000 description 2
- 238000011144 upstream manufacturing Methods 0.000 description 2
- MEIRRNXMZYDVDW-MQQKCMAXSA-N (2E,4E)-2,4-hexadien-1-ol Chemical compound C\C=C\C=C\CO MEIRRNXMZYDVDW-MQQKCMAXSA-N 0.000 description 1
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 1
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 1
- SUBDBMMJDZJVOS-UHFFFAOYSA-N 5-methoxy-2-{[(4-methoxy-3,5-dimethylpyridin-2-yl)methyl]sulfinyl}-1H-benzimidazole Chemical compound N=1C2=CC(OC)=CC=C2NC=1S(=O)CC1=NC=C(C)C(OC)=C1C SUBDBMMJDZJVOS-UHFFFAOYSA-N 0.000 description 1
- BWMPANCYUOEPFB-UHFFFAOYSA-N 5-phenyl-5-piperidin-1-yl-1,3-bis(2-piperidin-1-ylethyl)-1,3-diazinane-2,4,6-trione Chemical compound O=C1C(N2CCCCC2)(C=2C=CC=CC=2)C(=O)N(CCN2CCCCC2)C(=O)N1CCN1CCCCC1 BWMPANCYUOEPFB-UHFFFAOYSA-N 0.000 description 1
- GUBGYTABKSRVRQ-XLOQQCSPSA-N Alpha-Lactose Chemical compound O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@H]1O[C@@H]1[C@@H](CO)O[C@H](O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-XLOQQCSPSA-N 0.000 description 1
- IYVSIZAXNLOKFQ-BYULHYEWSA-N Asn-Asp-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IYVSIZAXNLOKFQ-BYULHYEWSA-N 0.000 description 1
- 241000271566 Aves Species 0.000 description 1
- 241000283690 Bos taurus Species 0.000 description 1
- 241000712462 Bovine ephemeral fever virus Species 0.000 description 1
- 241000711895 Bovine orthopneumovirus Species 0.000 description 1
- VTYYLEPIZMXCLO-UHFFFAOYSA-L Calcium carbonate Chemical class [Ca+2].[O-]C([O-])=O VTYYLEPIZMXCLO-UHFFFAOYSA-L 0.000 description 1
- 241001353878 Canine parainfluenza virus Species 0.000 description 1
- 241000282465 Canis Species 0.000 description 1
- 241000283707 Capra Species 0.000 description 1
- 108091035707 Consensus sequence Proteins 0.000 description 1
- 241000701558 Corticovirus Species 0.000 description 1
- 241001481833 Coryphaena hippurus Species 0.000 description 1
- 206010011224 Cough Diseases 0.000 description 1
- 102000004127 Cytokines Human genes 0.000 description 1
- 108090000695 Cytokines Proteins 0.000 description 1
- 238000001712 DNA sequencing Methods 0.000 description 1
- 239000006145 Eagle's minimal essential medium Substances 0.000 description 1
- 241000196324 Embryophyta Species 0.000 description 1
- 101710121417 Envelope glycoprotein Proteins 0.000 description 1
- 208000006536 Ephemeral Fever Diseases 0.000 description 1
- 241001455610 Ephemerovirus Species 0.000 description 1
- 206010016654 Fibrosis Diseases 0.000 description 1
- 206010017533 Fungal infection Diseases 0.000 description 1
- 241000233866 Fungi Species 0.000 description 1
- 101150082239 G gene Proteins 0.000 description 1
- 108010010803 Gelatin Proteins 0.000 description 1
- 230000010558 Gene Alterations Effects 0.000 description 1
- 102220480121 H/ACA ribonucleoprotein complex subunit DKC1_R10A_mutation Human genes 0.000 description 1
- 208000031886 HIV Infections Diseases 0.000 description 1
- 208000037357 HIV infectious disease Diseases 0.000 description 1
- 208000005331 Hepatitis D Diseases 0.000 description 1
- 208000037262 Hepatitis delta Diseases 0.000 description 1
- 241000709721 Hepatovirus A Species 0.000 description 1
- 102000006947 Histones Human genes 0.000 description 1
- 101000911390 Homo sapiens Coagulation factor VIII Proteins 0.000 description 1
- 101000833492 Homo sapiens Jouberin Proteins 0.000 description 1
- 101000651236 Homo sapiens NCK-interacting protein with SH3 domain Proteins 0.000 description 1
- 206010061598 Immunodeficiency Diseases 0.000 description 1
- 108060003951 Immunoglobulin Proteins 0.000 description 1
- 206010061218 Inflammation Diseases 0.000 description 1
- 102100024407 Jouberin Human genes 0.000 description 1
- GUBGYTABKSRVRQ-QKKXKWKRSA-N Lactose Natural products OC[C@H]1O[C@@H](O[C@H]2[C@H](O)[C@@H](O)C(O)O[C@@H]2CO)[C@H](O)[C@@H](O)[C@H]1O GUBGYTABKSRVRQ-QKKXKWKRSA-N 0.000 description 1
- 241000282567 Macaca fascicularis Species 0.000 description 1
- 241001115401 Marburgvirus Species 0.000 description 1
- 229940124848 Measles-Mumps-Rubella vaccine Drugs 0.000 description 1
- 206010027336 Menstruation delayed Diseases 0.000 description 1
- 241000711386 Mumps virus Species 0.000 description 1
- 241000186359 Mycobacterium Species 0.000 description 1
- 108091029480 NONCODE Proteins 0.000 description 1
- 101150118742 NP gene Proteins 0.000 description 1
- 101150033828 NS1 gene Proteins 0.000 description 1
- 108091007491 NSP3 Papain-like protease domains Proteins 0.000 description 1
- 101800000512 Non-structural protein 1 Proteins 0.000 description 1
- 101800000511 Non-structural protein 2 Proteins 0.000 description 1
- 108091092724 Noncoding DNA Proteins 0.000 description 1
- 108010061100 Nucleoproteins Proteins 0.000 description 1
- 108020005187 Oligonucleotide Probes Proteins 0.000 description 1
- 206010033078 Otitis media Diseases 0.000 description 1
- 238000012408 PCR amplification Methods 0.000 description 1
- 241000711502 Paramyxovirinae Species 0.000 description 1
- 206010034038 Parotitis Diseases 0.000 description 1
- 241000144300 Peromyscus gossypinus Species 0.000 description 1
- 241000009328 Perro Species 0.000 description 1
- 206010057249 Phagocytosis Diseases 0.000 description 1
- 241000711899 Phocine morbillivirus Species 0.000 description 1
- 208000005155 Picornaviridae Infections Diseases 0.000 description 1
- 206010035148 Plague Diseases 0.000 description 1
- 241000233872 Pneumocystis carinii Species 0.000 description 1
- 241000711904 Pneumoviridae Species 0.000 description 1
- 230000004570 RNA-binding Effects 0.000 description 1
- 108010092799 RNA-directed DNA polymerase Proteins 0.000 description 1
- 206010037742 Rabies Diseases 0.000 description 1
- 206010061603 Respiratory syncytial virus infection Diseases 0.000 description 1
- 206010057190 Respiratory tract infections Diseases 0.000 description 1
- 108091028664 Ribonucleotide Proteins 0.000 description 1
- CWHJIJJSDGEHNS-MYLFLSLOSA-N Senegenin Chemical compound C1[C@H](O)[C@H](O)[C@@](C)(C(O)=O)[C@@H]2CC[C@@]3(C)C(CC[C@]4(CCC(C[C@H]44)(C)C)C(O)=O)=C4[C@@H](CCl)C[C@@H]3[C@]21C CWHJIJJSDGEHNS-MYLFLSLOSA-N 0.000 description 1
- 102000007562 Serum Albumin Human genes 0.000 description 1
- 108010071390 Serum Albumin Proteins 0.000 description 1
- 241000144290 Sigmodon hispidus Species 0.000 description 1
- 208000037065 Subacute sclerosing leukoencephalitis Diseases 0.000 description 1
- 206010042297 Subacute sclerosing panencephalitis Diseases 0.000 description 1
- 108091027544 Subgenomic mRNA Proteins 0.000 description 1
- 229930006000 Sucrose Natural products 0.000 description 1
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 1
- 230000024932 T cell mediated immunity Effects 0.000 description 1
- 241000710914 Totivirus Species 0.000 description 1
- 241000711955 Turkey rhinotracheitis virus Species 0.000 description 1
- 206010046865 Vaccinia virus infection Diseases 0.000 description 1
- 241000711970 Vesiculovirus Species 0.000 description 1
- 108010087302 Viral Structural Proteins Proteins 0.000 description 1
- 241000701815 Xipapillomavirus 1 Species 0.000 description 1
- 241000607479 Yersinia pestis Species 0.000 description 1
- HCHKCACWOHOZIP-UHFFFAOYSA-N Zinc Chemical compound [Zn] HCHKCACWOHOZIP-UHFFFAOYSA-N 0.000 description 1
- 210000001015 abdomen Anatomy 0.000 description 1
- 230000002159 abnormal effect Effects 0.000 description 1
- 238000009825 accumulation Methods 0.000 description 1
- 208000038016 acute inflammation Diseases 0.000 description 1
- 230000006022 acute inflammation Effects 0.000 description 1
- 239000000654 additive Substances 0.000 description 1
- 230000000996 additive effect Effects 0.000 description 1
- 239000002671 adjuvant Substances 0.000 description 1
- 230000002411 adverse Effects 0.000 description 1
- 239000000443 aerosol Substances 0.000 description 1
- 210000001691 amnion Anatomy 0.000 description 1
- 229940069428 antacid Drugs 0.000 description 1
- 239000003159 antacid agent Substances 0.000 description 1
- 230000000692 anti-sense effect Effects 0.000 description 1
- 239000004599 antimicrobial Substances 0.000 description 1
- 108010062796 arginyllysine Proteins 0.000 description 1
- 208000006673 asthma Diseases 0.000 description 1
- 101150036080 at gene Proteins 0.000 description 1
- KUCQYCKVKVOKAY-CTYIDZIISA-N atovaquone Chemical compound C1([C@H]2CC[C@@H](CC2)C2=C(C(C3=CC=CC=C3C2=O)=O)O)=CC=C(Cl)C=C1 KUCQYCKVKVOKAY-CTYIDZIISA-N 0.000 description 1
- 229960003159 atovaquone Drugs 0.000 description 1
- 230000001580 bacterial effect Effects 0.000 description 1
- 230000027455 binding Effects 0.000 description 1
- 238000002306 biochemical method Methods 0.000 description 1
- 230000008827 biological function Effects 0.000 description 1
- 230000033228 biological regulation Effects 0.000 description 1
- 210000004369 blood Anatomy 0.000 description 1
- 239000008280 blood Substances 0.000 description 1
- 238000009395 breeding Methods 0.000 description 1
- 230000001488 breeding effect Effects 0.000 description 1
- 206010006451 bronchitis Diseases 0.000 description 1
- 206010006475 bronchopulmonary dysplasia Diseases 0.000 description 1
- 239000000872 buffer Substances 0.000 description 1
- 210000004899 c-terminal region Anatomy 0.000 description 1
- 102220351054 c.122G>A Human genes 0.000 description 1
- 230000003197 catalytic effect Effects 0.000 description 1
- 210000000170 cell membrane Anatomy 0.000 description 1
- 230000002490 cerebral effect Effects 0.000 description 1
- 239000002962 chemical mutagen Substances 0.000 description 1
- 238000006757 chemical reactions by type Methods 0.000 description 1
- 210000003837 chick embryo Anatomy 0.000 description 1
- 230000001684 chronic effect Effects 0.000 description 1
- 208000037976 chronic inflammation Diseases 0.000 description 1
- 230000006020 chronic inflammation Effects 0.000 description 1
- 230000004186 co-expression Effects 0.000 description 1
- 238000012790 confirmation Methods 0.000 description 1
- 239000013256 coordination polymer Substances 0.000 description 1
- 230000002596 correlated effect Effects 0.000 description 1
- 230000037029 cross reaction Effects 0.000 description 1
- 238000012258 culturing Methods 0.000 description 1
- 238000005520 cutting process Methods 0.000 description 1
- 230000002950 deficient Effects 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 238000003745 diagnosis Methods 0.000 description 1
- 238000002405 diagnostic procedure Methods 0.000 description 1
- 238000009792 diffusion process Methods 0.000 description 1
- 239000003085 diluting agent Substances 0.000 description 1
- LOKCTEFSRHRXRJ-UHFFFAOYSA-I dipotassium trisodium dihydrogen phosphate hydrogen phosphate dichloride Chemical compound P(=O)(O)(O)[O-].[K+].P(=O)(O)([O-])[O-].[Na+].[Na+].[Cl-].[K+].[Cl-].[Na+] LOKCTEFSRHRXRJ-UHFFFAOYSA-I 0.000 description 1
- PXEDJBXQKAGXNJ-QTNFYWBSSA-L disodium L-glutamate Chemical compound [Na+].[Na+].[O-]C(=O)[C@@H](N)CCC([O-])=O PXEDJBXQKAGXNJ-QTNFYWBSSA-L 0.000 description 1
- 238000006073 displacement reaction Methods 0.000 description 1
- 208000017574 dry cough Diseases 0.000 description 1
- 238000001035 drying Methods 0.000 description 1
- 230000005284 excitation Effects 0.000 description 1
- 239000013613 expression plasmid Substances 0.000 description 1
- 230000004761 fibrosis Effects 0.000 description 1
- 238000007710 freezing Methods 0.000 description 1
- 230000008014 freezing Effects 0.000 description 1
- 208000024386 fungal infectious disease Diseases 0.000 description 1
- 239000000499 gel Substances 0.000 description 1
- 239000008273 gelatin Substances 0.000 description 1
- 229920000159 gelatin Polymers 0.000 description 1
- 235000019322 gelatine Nutrition 0.000 description 1
- 235000011852 gelatine desserts Nutrition 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 102000035122 glycosylated proteins Human genes 0.000 description 1
- 108091005608 glycosylated proteins Proteins 0.000 description 1
- 108060003552 hemocyanin Proteins 0.000 description 1
- 208000002672 hepatitis B Diseases 0.000 description 1
- 208000029570 hepatitis D virus infection Diseases 0.000 description 1
- 102000057593 human F8 Human genes 0.000 description 1
- 208000033519 human immunodeficiency virus infectious disease Diseases 0.000 description 1
- 230000002209 hydrophobic effect Effects 0.000 description 1
- 206010020718 hyperplasia Diseases 0.000 description 1
- 230000005847 immunogenicity Effects 0.000 description 1
- 102000018358 immunoglobulin Human genes 0.000 description 1
- 229940088592 immunologic factor Drugs 0.000 description 1
- 230000004054 inflammatory process Effects 0.000 description 1
- 206010022000 influenza Diseases 0.000 description 1
- 230000005764 inhibitory process Effects 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 210000001739 intranuclear inclusion body Anatomy 0.000 description 1
- 229910052742 iron Inorganic materials 0.000 description 1
- 239000000644 isotonic solution Substances 0.000 description 1
- 238000005304 joining Methods 0.000 description 1
- 210000003292 kidney cell Anatomy 0.000 description 1
- 239000008101 lactose Substances 0.000 description 1
- 108010057821 leucylproline Proteins 0.000 description 1
- 230000021633 leukocyte mediated immunity Effects 0.000 description 1
- 230000000670 limiting effect Effects 0.000 description 1
- GZQKNULLWNGMCW-PWQABINMSA-N lipid A (E. coli) Chemical compound O1[C@H](CO)[C@@H](OP(O)(O)=O)[C@H](OC(=O)C[C@@H](CCCCCCCCCCC)OC(=O)CCCCCCCCCCCCC)[C@@H](NC(=O)C[C@@H](CCCCCCCCCCC)OC(=O)CCCCCCCCCCC)[C@@H]1OC[C@@H]1[C@@H](O)[C@H](OC(=O)C[C@H](O)CCCCCCCCCCC)[C@@H](NC(=O)C[C@H](O)CCCCCCCCCCC)[C@@H](OP(O)(O)=O)O1 GZQKNULLWNGMCW-PWQABINMSA-N 0.000 description 1
- 210000004698 lymphocyte Anatomy 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 230000000813 microbial effect Effects 0.000 description 1
- 238000000386 microscopy Methods 0.000 description 1
- 235000013923 monosodium glutamate Nutrition 0.000 description 1
- 210000004400 mucous membrane Anatomy 0.000 description 1
- 101150061325 mv gene Proteins 0.000 description 1
- 101150031334 mvl gene Proteins 0.000 description 1
- 210000003928 nasal cavity Anatomy 0.000 description 1
- 230000017074 necrotic cell death Effects 0.000 description 1
- 210000003360 nephrocyte Anatomy 0.000 description 1
- 210000000440 neutrophil Anatomy 0.000 description 1
- 235000015097 nutrients Nutrition 0.000 description 1
- 229940046166 oligodeoxynucleotide Drugs 0.000 description 1
- 239000002751 oligonucleotide probe Substances 0.000 description 1
- 229960000381 omeprazole Drugs 0.000 description 1
- 210000000056 organ Anatomy 0.000 description 1
- 229960005030 other vaccine in atc Drugs 0.000 description 1
- 238000012856 packing Methods 0.000 description 1
- 230000008506 pathogenesis Effects 0.000 description 1
- 230000035515 penetration Effects 0.000 description 1
- 230000008782 phagocytosis Effects 0.000 description 1
- 108010073101 phenylalanylleucine Proteins 0.000 description 1
- 239000002953 phosphate buffered saline Substances 0.000 description 1
- 230000026731 phosphorylation Effects 0.000 description 1
- 238000006366 phosphorylation reaction Methods 0.000 description 1
- 210000002826 placenta Anatomy 0.000 description 1
- 210000004224 pleura Anatomy 0.000 description 1
- 239000002574 poison Substances 0.000 description 1
- 231100000614 poison Toxicity 0.000 description 1
- 238000003752 polymerase chain reaction Methods 0.000 description 1
- 238000006116 polymerization reaction Methods 0.000 description 1
- 229920001184 polypeptide Polymers 0.000 description 1
- 230000002265 prevention Effects 0.000 description 1
- 238000001243 protein synthesis Methods 0.000 description 1
- 229940023143 protein vaccine Drugs 0.000 description 1
- 230000005180 public health Effects 0.000 description 1
- 230000002685 pulmonary effect Effects 0.000 description 1
- 150000003230 pyrimidines Chemical class 0.000 description 1
- 125000000714 pyrimidinyl group Chemical group 0.000 description 1
- 230000009257 reactivity Effects 0.000 description 1
- 229940047431 recombinate Drugs 0.000 description 1
- 238000005215 recombination Methods 0.000 description 1
- 230000006798 recombination Effects 0.000 description 1
- 210000000664 rectum Anatomy 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 230000001373 regressive effect Effects 0.000 description 1
- 239000002336 ribonucleotide Substances 0.000 description 1
- 125000002652 ribonucleotide group Chemical group 0.000 description 1
- 201000005404 rubella Diseases 0.000 description 1
- 150000003839 salts Chemical class 0.000 description 1
- 230000001932 seasonal effect Effects 0.000 description 1
- 230000028327 secretion Effects 0.000 description 1
- 238000003375 selectivity assay Methods 0.000 description 1
- 229940073490 sodium glutamate Drugs 0.000 description 1
- 239000007790 solid phase Substances 0.000 description 1
- 239000000243 solution Substances 0.000 description 1
- 125000006850 spacer group Chemical group 0.000 description 1
- 239000007921 spray Substances 0.000 description 1
- 238000010186 staining Methods 0.000 description 1
- 210000004406 stave cell Anatomy 0.000 description 1
- 238000003756 stirring Methods 0.000 description 1
- 238000003860 storage Methods 0.000 description 1
- 101150012509 sub gene Proteins 0.000 description 1
- 238000007920 subcutaneous administration Methods 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 239000005720 sucrose Substances 0.000 description 1
- 230000004083 survival effect Effects 0.000 description 1
- 230000002194 synthesizing effect Effects 0.000 description 1
- 239000009871 tenuigenin Substances 0.000 description 1
- 238000013185 thoracic computed tomography Methods 0.000 description 1
- 231100000331 toxic Toxicity 0.000 description 1
- 230000002588 toxic effect Effects 0.000 description 1
- 230000005026 transcription initiation Effects 0.000 description 1
- 238000002255 vaccination Methods 0.000 description 1
- 239000012646 vaccine adjuvant Substances 0.000 description 1
- 229940124931 vaccine adjuvant Drugs 0.000 description 1
- 208000007089 vaccinia Diseases 0.000 description 1
- 210000001215 vagina Anatomy 0.000 description 1
- 235000013343 vitamin Nutrition 0.000 description 1
- 239000011782 vitamin Substances 0.000 description 1
- 229940088594 vitamin Drugs 0.000 description 1
- 229930003231 vitamin Natural products 0.000 description 1
- 150000003722 vitamin derivatives Chemical class 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
- 230000003313 weakening effect Effects 0.000 description 1
- 239000011701 zinc Substances 0.000 description 1
- 229910052725 zinc Inorganic materials 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/005—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from viruses
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N7/00—Viruses; Bacteriophages; Compositions thereof; Preparation or purification thereof
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2760/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssRNA viruses negative-sense
- C12N2760/00011—Details
- C12N2760/18011—Paramyxoviridae
- C12N2760/18411—Morbillivirus, e.g. Measles virus, canine distemper
- C12N2760/18422—New viral proteins or individual genes, new structural or functional aspects of known viral proteins or genes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2760/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssRNA viruses negative-sense
- C12N2760/00011—Details
- C12N2760/18011—Paramyxoviridae
- C12N2760/18511—Pneumovirus, e.g. human respiratory syncytial virus
- C12N2760/18522—New viral proteins or individual genes, new structural or functional aspects of known viral proteins or genes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2760/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssRNA viruses negative-sense
- C12N2760/00011—Details
- C12N2760/18011—Paramyxoviridae
- C12N2760/18611—Respirovirus, e.g. Bovine, human parainfluenza 1,3
- C12N2760/18622—New viral proteins or individual genes, new structural or functional aspects of known viral proteins or genes
Abstract
The invention discloses isolated, recombinantly-generated, attenuated, nonsegmented, negative-sense, single stranded RNA viruses of the Order Mononegavirales having at least one attenuating mutation in the 3' genomic promoter region and having at least one attenuating mutation in the RNA polymerase gene are described. Vaccines are formulated comprising such viruses and a physiologically acceptable carrier. The vaccines are used for immunizing an individual to induce protection against a nonsegmented, negative-sense, single stranded RNA virus of the Order Mononegavirales.
Description
Invention field
The present invention relates to mononegavirale virales (Order designated Mononegavirales), isolating, recombinate that produce, attenuation, Nonsegmented, negative justice (negative-sense), single strand RNA virus, it has the attenuation sudden change of at least one place in 3 ' genomic promoter region, have place attenuation sudden change in rna polymerase gene at least.The present invention has obtained the support of government, authorizes fund by Public Health Service.Government has certain right of the present invention.
Background of invention
The negative adopted single strand RNA virus of bag quilt is by the tissue of uniqueness and expression.The geneome RNA of this mononegavirale virus plays two kinds of template actions in virus nucleocapsid: the one, and as the template of synthetic messenger RNA(mRNA) (mRNA), it two is templates as synthetic anti-genome (+) chain.Negative adopted single strand RNA virus is encoded and is packed their RNA RNA-dependent polysaccharase.When virus is sloughed capsid in by the cell of its infection, only synthetic messenger RNA(mRNA).Virus replication occurred in after synthesizing of mRNA, and needed the synthetic continuously of viral protein.The new anti-genome of synthetic (+) chain plays a part further generation (-) geneome RNA copy template.
The polysaccharase mixture starts and implements by the cis acting signal in conjunction with (engage) genome 3 ' terminal (promoter region specifically) transcribes and duplicates.So the virogene uniaxially transcribes from genomic templates according to 3 ' to 5 ' direction.For its upstream neighbor (being nucleoprotein gene (N)), always less from the mRNA that downstream gene (as pol gene (L)) makes.So,, have the gradient of a mRNA abundance usually according to the position of the relative genome 3 ' of gene.
According to the classification that the international council of viral nomenclature rearranged in 1993, established an order, be called mononegavirale virales (Monogenavirales).This order comprises the enveloped virus of three sections, and strand, non-sections, negative polarity (negative justice) geneome RNA are all arranged.Described section is Paramyxoviridae, Rhabdoviridae and Filoviridae.Paramyxoviridae is further divided into two subfamilies again: paramyxovirus (paramytoxinae) and pneumonitis virus (pneumovirinae).The paramyxovirus subfamily contains three genus: paramyxovirus, rubella virus (Rubulavirus) and Measles virus (mobillivirus).The pneumonitis virus subfamily comprises pneumonitis virus and belongs to.
These two kinds of new classification are according to morphological criteria, virus genomic composition, proteinic biological activity and sequence relation.The distinctive morphological specificity of paramyxovirus subfamily is the size and the shape (diameter 18mm, long 1mm, pitch (pitch) 5.5nm) of its nucleocapsid in the enveloped virus, has the left hand helix symmetry.Biologic criteria is: the antigenic cross-reaction and 2 in 1) belonging between the member) paramyxovirus, rubella virus genus have the neuraminic acid enzymic activity, and Morbillivirus does not then have.In addition, also to consider the variation of P genes encoding potential (coding potential), because in rubella virus, there is an extra gene (SH).
Pneumonitis virus (pneumovius) can be distinguished with paramyxovirus (paramyxovirinae) on form mutually, because the former nucleocapsid is very narrow.In addition, the main difference of pneumonitis virus and paramyxovirus be the protein coding cistron number (have 10 in the pneumonitis virus, and paramyxovirus being 6) and with the visibly different attachment protein (G) of paramyxovirus.Though paramyxovirus and pneumonitis virus have 6 albumen (N, P, M, G/H/HN, F and L) seemingly corresponding on function, have only latter two albumen to show serial correlation tangible between two subfamilies.Some pneumonitis virus albumen does not have all corresponding parts of most of paramyxovirus, i.e. non-structural protein NS 1 and NS2, little hydrophobic proteins SH and secondary albumen M2.Some paramyxovirus albumen is C and V, lacks the corresponding part in the pneumonitis virus.But it is identical that the disease basi gene group of pneumonitis virus and paramyxovirus is formed.Rhabdovirus and Filovirus are too.Table 1 provided these three kinds of viruses name classification, and each example that belongs to.
Table 1
The classification of the non-sections of mononegavirale virales, negative adopted single strand RNA virus
Paramyxoviridae
The paramyxovirus subfamily
Paramyxovirus genus
Sendai virus (mouse 1 type parainfluenza virus)
1 type and 3 type human parainfluenza viruses (PIV)
3 type bovine influenza viruses (BIV)
Rubella virus genus
SV 41 virus (SV) (2 type canine parainfluenza virus)
Mumps virus
New castle disease virus (NDV) (1 type avian paramyxoviruses)
2,4a and 4b type human parainfluenza virus
Morbillivirus
Measles virus (MV)
The dolphin Measles virus
Canine distemper virus (CDV)
For a short time ruminate beastly plague virus
The Phocine distemper virus
Rinderpest virus
The pneumonitis virus subfamily
Pneumonitis virus belongs to
Human respiratory syncytial virus (RSV)
Bovine respiratory syncytial virus
Mouse pneumonia virus
Turkey Coryzavirus (Turkey rhinotracheitis virus)
Rhabdoviridae
Lyssavirus (Lyssavirus)
Rabies virus (Rabie virus)
Vesiculovirus genus
Vesicular stomatitis virus
Ephemeral fever virus belongs to (Ephemerovirus)
Bovine ephemeral fever virus
Filoviridae
Filovirus belongs to
Marburg virus
For many above-mentioned viruses, also without any the vaccine that can get.So, be necessary to develop the vaccine that resists this class humans and animals pathogenic agent.This class vaccine should be able to cause protective immunological reaction in inoculator's body.The quality of this favourable reaction and quantity characteristics are to be inferred by the characteristics seen in the natural viral infection survivor, and such survivor generally speaking is not subjected to infection more identical or that high correlation is viral in the long duration afterwards.
Seeking to develop in the process of this class vaccine has several different methods to consider, comprises use: the single virus protein vaccine (subunit vaccine) of (1) purifying; (2) the totivirus preparation of deactivation; (3) attenuated virus of Huoing.
That the advantage of subunit vaccine is is pure, composition clearly and relatively easily ins all sorts of ways mass production, comprises the recombinant dna expression method.So far, except famous hepatitis B surface antigen(HBsAg), the viral sub-units vaccine generally only causes fugitive and/or inadequate immunizing power, especially in natural receptor.
The formalin deactivation preparation of full poliomyelitis (IPV) and hepatitis A virus is proved to be safe and effective.On the contrary, use similar inactivated whole virus immunity all to cause bad immune response and/or such reaction type, make vaccine inoculation person contact natural later on again or " wild-type " when virus, overreaction or abnormal diseases took place easily such as respiratory syncytial virus and Measles virus.
(1966) were in one's early years once attempted the RSV vaccine that the parenteral road gives the formalin deactivation and were inoculated to the child.Unfortunately, the unusual disease of serious characteristic (reference list number 1,2) has appearred having taken place behind the natural infection RSV of serious adverse effects-afterwards in the agent of this vaccine time test in place.Propose, the antigen of this formalin deactivation has caused unusual or unbalanced immune response, makes vaccine inoculation person to RSV disease-susceptible humans (3,4).
So, produced the attenuation candidate vaccine of living by cold going down to posterity (cold passage) or chemical substance mutagenesis.These RSV strains are found in virulence attenuation of among the seropositivity grownup.Unfortunately, when giving seronegative baby, it is excessive or not enough that they are found attenuation; Sometimes, they also are found and lack genetic stability (5,6).Another kind of parenteral road gives the inoculation method of live virus because of invalid the termination (7).It should be noted that these live RSV vaccines never with disease increase the weight of relevant, not as above-mentioned observed with the RSV vaccine of formalin deactivation.At present, though utilize coldly go down to posterity, the RSV virus of A 2 of chemomorphosis and the present well afoot of clinical experiment that the B-1 strain carries out, also be not approved for human RSV vaccine so far.
The derived virus alive of the suitable attenuation of wild virus warp provides the outstanding advantage as candidate vaccine.As the replicability factor of living; they cause infection in inoculator's body; during this time; the virogene product is expressed, is processed and offered with inoculator's specificity MHC I and II type molecule; induce body fluid and cell-mediated immune response thus; and synergetic property cytokine pattern, the latter is identical with natural infection survivor's protective immunity type.
This favourable immune response pattern is opposite with the limited response that has deactivation or subunit vaccine to excite, and deactivation or subunit vaccine mainly are confined to humoral immunization attenuation system usually.And; the whole virus vaccine of some formalin deactivation (for example Measles virus of developing the sixties and respiratory syncytial virus vaccines) inductive immunne response not only can not provide persistent protection, and unusual, hyperreactive or mortality disease takes place easily when in fact causing vaccine inoculation person to contact wild-type virus later on.
Though the attenuated virus of living has fabulous characteristic as candidate vaccine, they are proved to be and are difficult to development.The focus of difficulty is to isolate a kind of like this derived virus of wild-type strain, and it has been lost pathogenic (being virulence) but has kept enough replicatioies, to infect the inoculator and to cause enough strong required type of immune response.
In history, the fine balance between this virulence and the attenuation once reached like this, even the strain isolated of a wild-type virus is following to different host tissues or cell continuous passage at different growth conditionss (for example temperature).This process helps the growth that virus becomes strain (mutant strain) by inference, and some becomes strain and has good attenuation feature.Once in a while, also can realize further attenuation by chemomorphosis.
This propagation/the scheme that goes down to posterity cause usually occurring to virus temperature sensitive, acclimatization to cold and/or that host range changes derive strain-they one or the variation that is different from the wild-type pathogenic virus-promptly may the variation relevant with attenuation is arranged all.
Utilize this method to produce several live-virus vaccines, comprised vaccine prevention measles and parotitis (paramyxovirus) and opposing poliomyelitis and rubella (positive chain RNA virus), and become the main means of world today's immunization programs for children.
But the method for this generation attenuated live virus candidate vaccine expends time in, and, good again, be still unpredictalbely, mainly rely on those genome mutation strains that take place at random of selecting growth fast with required attenuation characteristic.Perhaps, the virus of gained has required phenotype external, even also to show as in animal model be attenuation.But in the human or animal host of hope used as candidate vaccine, all attenuation deficiency or attenuation are excessive often again for they.
Even using vaccine, still need more effective vaccine at present.For example, present Measles Vaccine provides goodish provide protection.But popular the showing of measles has defectiveness on the vaccine potency now recently.Although mother has been carried out immunity, the ratio that acute measles infection takes place one-year-old following child is still very high, and this shows that vaccine can not induce and the wild-type measles infects the suitable anti-measles antibody level (8,9,10) of institute's inductive.So the passive antibody that can not provide competent placenta to shift for the baby through mother of vaccine immunity is to protect the newborn infant in the follow-up continuation of insurance of a some months of birth.
The acute measles of accepting in the past to take place among immune teenager and the youth infects the new problem of having pointed out.The inefficacy of these secondary vaccines shows, existing vaccine has limitation (11,12,13) inducing and keep on the ability of abundant and persistent antiviral provide protection.Recently, another potential problems have been found again.In the past 15 years of the hemagglutinin of isolating wild-type Measles virus show and vaccine strain (14) distance more and more far away.This " antigenic drift " proposed a problem, and promptly the ideal antigenic characteristic that provides best protection required may be provided vaccine strain.So, need improved vaccine.
Rational vaccine design must specifically, promptly determine the evaluations that base and those genomes that cause attenuation change by means of the virulence to encoding viral by means of these vaccines are better understood.
The present invention's general introduction
Given this, one of the object of the invention is to identify that those cause the sudden change zone of viral attenuation in the mononegavirale virales rna virus cdna group.
Another object of the present invention is that reorganization is created in the virus of having mixed described attenuation sudden change in the genome.
Another object of the present invention is to make the vaccine preparation that contains attenuated virus.
Hereinafter will discuss above other purpose that reaches of the present invention, these purposes will be that produce with the reorganization that separates the mononegavirale virales, attenuation by producing, non-segmental, negative adopted single strand RNA virus reach, and the attenuation that such virus has at least one attenuation that is positioned at 3 ' genomic promoter region sudden change and at least one rna polymerase gene suddenlys change.
With the Measles virus is example, the attenuation of at least one sudden change is selected from the 3 ' genomic promoter region: and Nucleotide 26 (A → T), Nucleotide 42 (A → T or A → C) and Nucleotide 96 (G → A), above all is present on the normal chain with described other Nucleotide of the application (unless otherwise mentioned), being anti-genomic, promptly is signal (coding) meaning; And the attenuation of at least one sudden change is selected from the Nucleotide change that can cause following amino acid change in the rna polymerase gene, and described amino acid change is selected from: residue 331 (Isoleucine → Threonine), residue 1409 (L-Ala → Threonine), residue 1624 (Threonine → L-Ala), residue 1649 (arginine → methionine(Met)), residue 1717 (aspartic acid → L-Ala), residue 1936 (Histidine → tyrosine), residue 2074 (glutamine → arginine) and residue 2114 (arginine → Methionin).
With the human 3-type parainfluenza virus is example, and the attenuation of at least one sudden change is selected from the 3 ' genomic promoter region: and Nucleotide 23 (T → C), Nucleotide 24 (C → T), Nucleotide 28 (G → T) and Nucleotide 45 (T → A); And the attenuation of at least one sudden change is selected from the Nucleotide change that can cause following amino acid change in the rna polymerase gene, and described amino acid change is selected from: residue 942 (Methionin → Histidine), residue 992 (leucine → phenylalanine), 1292 (leucine → phenylalanines) and residue 1558 (Threonine → Isoleucine).
With B subgroup human respiratory syncytial virus is example, and the attenuation of at least one sudden change is selected from the 3 ' genomic promoter region: Nucleotide 4 (C → G) and insert an extra A in the continuous A of Nucleotide 6-11 position; And the attenuation of at least one sudden change is selected from the Nucleotide change that can cause following amino acid change in the rna polymerase gene, described amino acid change is selected from: residue 353 (arginine → Methionin), 451 (Methionin → arginine), 1229 (aspartic acids →), 2029 (Threonine → Isoleucines) and 2050 (l-asparagine → aspartic acids).
In another embodiment of the present invention, attenuated virus is used to prepare vaccine, and described vaccine can excite the protective immune response of the viral wild-type of opposing.
In another embodiment of the present invention, isolating, normal chain, an anti-genome courier nucleic acid molecule (or isolating, minus strand genomic nucleic acids molecule) with intact virus nucleotide sequence (wild-type or through non-recombination method attenuated virus) is processed, promptly introduce the described attenuation sudden change of one or more the application, to produce an attenuated virus isolating, that reorganization produces.Then such virus is used for preparing vaccine, described vaccine can excite the protective immunological reaction of antiviral wild-type.
In another embodiment of the present invention, such one section complete wild-type or vaccine virus nucleotide sequence are used to: (1) design PCR primer is used for the existence of corresponding virus in PCR test test sample; Or (2 designs and selection peptide are used for the existence of corresponding virus in the ELISA test sample).
The accompanying drawing summary
Fig. 1 has described the history that goes down to posterity of Edmonston Measles virus (15).The abbreviation meaning is as follows among the figure: HK-people's kidney; HA-people's amnion; CE (am)-Embryo Gallus domesticus; The CEF-chick embryo fibroblast; DK-dog kidney; The WI-38 human diploid cell, SK-sheep kidney; *-the plaque clone.Numbering immediately following each abbreviation is represented passage number.
Fig. 2 has described the Genome Atlas of Measles virus, has shown that one is positioned at and the cis acting controlling element of inferring of and anti-genome end terminal near genome.The Genome Atlas of this figure top-Measles virus, start from 3 ' end 52 Nucleotide leader sequence (1), end at 5 ' end 37 Nucleotide tailer sequence (t).Represent the gene border with vertical line; It under each gene the few nucleotide of cistron.This figure bottom-genome and anti-genome 3 ' end extends the enlarged view of promotor, shown position and the sequence of two sections high conservative zone A and B.Marked simultaneously and inserted intergenic trinucleotide.According to hypothesis, the new life 5 ' RNA that comprises A ' and B ' district contains regulating and controlling sequence and cause N albumen package action in this sequence.
Fig. 3 has described the gene mapping (last figure) of the B subgroup wild-type strain of the RSV that is called as 2B and 18537, and overlapping (figure below) of 68 Nucleotide arranged between the intergenic sequence of strain (middle figure) and M2 gene and the L gene.RSV 2B strain is compared few 6 Nucleotide in the G gene with 18537 strains, 2 amino-acid residues of therefore encoding less in G albumen.The 2B strain has 145 Nucleotide in 5 ' tail region, by contrast, there are 149 Nucleotide in this district of 18537 strains.Compare with 18537 strains, the 2B strain is each many Nucleotide in NS-1, NS-2 and N gene, and each lacks a Nucleotide in M and F gene.
Detailed description of the present invention
By the enzymic activity of a multimeric protein, obtained that negative adopted single strand RNA virus is genomic to be transcribed and duplicate to nucleoglucoprotein core (nucleocapsid) effect.Exposed geneome RNA can not be as template.But these genome sequences only just can be identified when being wrapping in the capsid structure fully by N albumen.This only occurs in following condition, and promptly the terminal promoter sequence of genome and anti-genome is identified and excites and transcribe or duplicate path.
For the carrying out of above-mentioned polysaccharase path, all paramyxovirus all need two kinds of viral proteins, L and P.In order to transcribe effectively carrying out of path, comprise that the pneumonitis virus of RSV also needs transcriptional elongation factor M2.Other cofactor also may work, and perhaps comprises the NS1 and the NS2 albumen of encoding viral, and host cell encoded protein matter.
But, considerable evidence shows, L albumen is carrying out great majority (if not full agent) and is transcribing and duplicating relevant enzymic process just, comprise the initiation and the termination of ribonucleotide polymerization, adding of mRNA transcription product emitted and polyadenylation, methylates and the proteic specificity phosphorylation of (perhaps) P.It is big that L albumen central role in genome is transcribed and duplicated has obtained its volume, to the support (16) responsive and its competent catalytic capability in the viral complex body of transcriptionally active that suddenlys change.
Following proposal has been drawn in above-mentioned consideration, and L albumen is made of a series of linearly aligned structural domains, and this cascaded structure has been incorporated into together (17) with each dispersive function.In fact, according to the dependency of the clear and definite proteinic functional domain of other characteristic, having identified that in the negative-sense viral L albumen 3 are this has demarcation (delimitad) dispersive element.They comprise: identification of RNA template and/or the phosphodiester bond that infer (1) form structural domain; (2) RNA binding member and (3) ATP binding domains.These functional element of inferring (17) have been disclosed for the negative justice of non-sections, proteic all the previous researchs of single strand RNA virus L.
Be not subjected to the constraint of the following stated, can reasonably suppose, the important determinative that these non-encoding histones, promotor and other adjusted and controlled territory of cis acting genome are duplicating efficiencies, transcribe and duplicate relevant with L albumen is accomplished because the Measles virus (MV) of these factor mononegavirale viraleses is with other virus, so they may be again these viral virulence determinatives.
Generally speaking, the present invention has been considered to comprise the one group of synergetic property that is positioned between cis acting adjustment signal (3 ' genomic promoter region) and the pol gene (L) and has changed, and these variations cause viral attenuation to make virus keep enough replicatioies simultaneously.Attenuation suddenlys change by the reasonableness of 3 ' genomic promoter region and pol gene and optimizes, such sudden change provides duplicating efficiency ideal balance: therefore, virus vaccines can not reproduce into disease, but still keep the ability that infects vaccine inoculation person's cell, to express that enough abundant gene product excites comprehensively and to produce the ideal immunne response of type and produce again and enlarge immunne response that it was caused greatly at utmost entirely.
Be not subjected to the constraint of the following stated, the attenuation sudden change in promotor of extension (3 ' genomic promoter region) and the pol gene it is believed that the effect that affects the cis acting signal and in conjunction with the conformation of the polysaccharase mixture of these signals.For example, be curled into helical pattern after promotor RNA is wrapped.Variation in the promoter sequence may have influence on the relative position of using the conservative signal that is relative to each other.Specifically, Measles virus wild-type 3 ' genomic promoter region respectively has a pyrimidine (uridylic) (anti-genome courier sequence is VITAMIN B4 at this) at the 26th and 42.And vaccine strain is that (anti-genome courier sequence correspondingly is pyrimidine to purine in described position; Referring to the table 3 of embodiment 1 hereinafter).The purine that volume is bigger may change distance and/or the angle between the promotor conserved domain (for example in Measles virus, position 1-11 and position 87-98), causes the change of cis acting signal three-dimensional conformation when polysaccharase is offered.
Zooscopy shows that the minimizing of virus replication will be enough to avoid disease but also be enough to excite required immunne response.The minimizing of duplicating may have been represented the minimizing of transcribing, the minimizing of the protein expression of encoding viral, and the minimizing of antisense template is so produced less new genome.The attenuated virus of gained is compared with wild-type, and its virulence significantly reduces.
By following two kinds of methods virus strain is introduced in attenuation sudden change as herein described:
(1) ordinary method, for example carry out chemomorphosis at virus growing period in the cell culture that is added with chemical mutagen, select the virus that under the suboptimal temperature degree, easily goes down to posterity to select the sudden change of temperature sensitive and/or acclimatization to cold, evaluation produces the mutated viruses of little plaque in cell culture, and goes down to posterity by heterologous host and to select host-range mutant.In animal model, filter out the virus that biological activity weakens then.Attenuated virus is carried out the nucleotide sequencing of its 3 ' genomic promoter region and pol gene, to search the position of attenuation sudden change.In a single day above process is finished, then manner of execution (2):
(2) the better method of introducing the attenuation sudden change comprises and adopts site-directed mutagenesis to make predetermined sudden change.The closely related virus of the known attenuation sudden change of these sudden change employing methods (1) or reference is identified.In 3 ' genomic promoter region and pol gene, respectively introduce one or more sudden changes.Also can estimate the storage effect of the various combination of coding and non-code change.
By the standard recombinant dna method, virus genomic DNA copy is introduced in the sudden change in 3 ' genomic promoter region and the pol gene.This may be wild-type or the genetic background of modifying C-type virus C (for example virus of modifying by method (1)), produces new virus thus.Produce infectious clone or the particle that comprises these attenuation sudden changes with cDNA " rescue " system, this system has been used to multiple virus, comprising Sendai virus (18); Measles virus (19); Respiratory syncytial virus (20); Rabies virus (21); Vesicular stomatitis virus (VSV) (15); And rinderpest virus (23); This paper quotes also with reference to above reference.Relevant Measles virus is saved system, referring to the disclosed International Patent Application WO 97/06270 (24) of the U.S.; Relevant PIV-3 saves system, referring to U.S. Provisional Patent Application 60/047575 (25); Relevant RSV saves system, referring to the disclosed International Patent Application WO 97/12023 (26) of the U.S.; These applications are included this paper in as a reference.
In brief, all mononegavirale virus rescue systems can be summarized as follows: all need portion to be equivalent to the genomic clone's of intact virus DNA, this genome is positioned between suitable dna dependent rna polymerase promoter (for example t7 rna polymerase promotor) and self the cutting ribozyme sequence (for example hepatitis δ ribozyme), and this part clone's DNA is inserted in the fertile bacterial plasmid.This transcription vector provides the dna profiling of easy handling, and RNA polymerase (for example t7 rna polymerase) can verily be transcribed down the single stranded RNA copy with accurate or intimate viral anti-genome (or genome) of 5 ' and 3 ' end accurately from this template.The orientation of genomic dna copy and flank promoter sequence and ribozyme sequence has determined whether anti-genome or genomic RNA equivalents transcribe.The filial generation of rescue new virus also needs virus-specific trans-acting albumen, be about to exposed strand anti-genome of virus or geneome RNA transcription product and wrap up the required albumen of into functional nucleocapsid template, viral nucleocapsid protein (N or NP), the relevant phosphoric acid albumen (P) of polysaccharase and polysaccharase (L) albumen are arranged.These albumen comprise active viral RNA RNA-dependent polysaccharase, and the latter must syncaryon capsid template transcribe and duplicate with acquisition.
The trans-acting albumen that the Measles virus rescue is required is parcel albumen N and polysaccharase complex proteins, P and L.The parcel albumen of PIV-3 is called NP, and the polysaccharase complex proteins is also referred to as P and L.With regard to RSV, virus-specific trans-acting albumen comprises N, P and L, adds another albumen M2 (transcriptional elongation factor of RSV coding).
Usually, these viral trans-acting albumen are to be produced by the material expression vector of one or more coding desirable proteins, though all or part of required trans-acting albumen can produce in genetic engineering modified mammalian cell, these cells contain as stable transformant and express these virus-specific gene and gene products.
Typical rescue environment (but definitely not getting rid of other) comprises suitable mammalian cell environment, wherein has the T7 polysaccharase to drive the transcription vector of self-contained viral genome cDNA, transcribes anti-genome (or genome) single stranded RNA.When transcribing, perhaps be right after thereafter, the anti-genome of this virus (or genome) rna transcription product is wrapping in the functional template by nucleocapsid protein, and by required polysaccharase component institute combination, this polysaccharase component is produced simultaneously from the proteic cotransfection expression plasmid of required virus-specific trans-acting of encoding.These incidents and process have caused as the transcribing of the virus mRNA of prerequisite, and newly genomicly duplicate and increase, and produce new daughter of virus thus, promptly obtain rescue.
With regard to the rescue of mad dog, VSV and Sendai virus, the T7 polysaccharase is provided by recombinant vaccinia seedling diseases poison VTF7-3.But, this system requirements, the virus of being rescued must be by physics and biochemical method or by going down to posterity repeatedly to separate with virus vaccinicum in the cell and the tissue that are not the good host of poxvirus.With regard to the rescue of MV cDNA, avoided above-mentioned requirements by producing an expression T7 polysaccharase and virus N and the proteic clone of P, in auxiliary cell line by genome expression vector and the transfection of L expression vector have been realized being rescued.The advantage of virus vaccinicum host range mutant MVA-7 can be expressed t7 rna polymerase but not duplicate in mammalian cell, is used to save RSV, rinderpest virus and MV.After essential parcel albumen was expressed simultaneously, the anti-genome virus RNA of synthetic total length was wrapped, duplicates and transcribed by varial polymerases albumen, and the genome after duplicating is wrapped in the infective virus particle.Except that described anti-genome, now, the genome analogue of Sendai virus and PIV-3 is also by successfully rescue (25,27).
The rescue system provides a kind of composition thus, it comprises a transcription vector, this carrier comprises an isolated nucleic acid molecule, the genome or the inverted defined gene group of a non-sections of this nucleic acid molecule encoding mononegavirale virales, negative adopted single strand RNA virus, they have at least one attenuation sudden change and at least one attenuation sudden change in rna polymerase gene in 3 ' genomic promoter region; Together also have at least one expression vector, this carrier comprises at least one isolated nucleic acid molecule, this nucleic acid molecule encoding parcel, transcribes and duplicate necessary trans-acting albumen (for example N of Measles virus, P and L; The P of PIV-3 and L; The N of RSV, P, L and M2).Transform or transfection host cell with at least two above-described expression vectors then, host cell is cultivated under the condition that allows these carrier co expression, to produce the infectivity attenuated virus.
The infective virus of being rescued is then at first tested its required phenotype (temperature sensitivity, acclimatization to cold, plaque form and transcribe and duplicate weaken) by in vitro method.Adopt minimum replicon system that the sudden change that is positioned at cis acting 3 ' genomic promoter region is also tested, wherein required trans-acting parcel and polymerase activity are provided by wild-type or vaccine helper virus, or the N, P gene that comprise gene specific attenuation sudden change by expression and different L gene plasmid provided.
There is the attenuation phenotype in virus if be rescued, and just attacks experiment with suitable animal model.Non-human primate is the preferred animal model of research human disease pathogenesis.These primates are at first used the recombinant virus immunity of attenuation, attack with the wild-type of virus then.Adopt number of ways to include but not limited to that nose is interior, tracheae is interior and subcutaneous vaccination is infected monkey, (29).The rhesus monkey of experimental infection and cynomolgus macaque are also as the vaccine-induced anti-measles provide protection (30) of Research of Animal Model for Study.Weigh provide protection by following standard: disease S﹠S, survival rate, virus are discharged (shedding) and antibody titer.If satisfactory standard, then this attenuation recombinant virus considers that can be used as candidate vaccine is used for the human test.The virus of " being rescued " is considered to " reorganization produces ", and filial generation that this is viral and offspring also are that so they have all introduced the attenuation sudden change.
Even can be used as the optimum level of vaccine relatively, the attenuation deficiency or the attenuation of " being rescued " virus are excessive, and this is still valuable information to developing best vaccine strain.
Best, the codon that contains the attenuation point mutation adds the 3rd sudden change by introducing second or second in this codon and comes stabilization, and does not change by the coded amino acid of the codon that only contains the attenuation point mutation.The infective virus clone who contains the sudden change of attenuation and stabilization also uses above-mentioned cDNA " rescue " system to produce.
Measles virus is the useful model of the present invention, because as described herein, and the sequence information of its pathogenic wild virus and proved that in history the sequence information of effectively anti-disease vaccine now obtains.
Measles virus was named as to separate infected patient's the tissue culture of David Edmonston from one early than 1954 and obtains (31).This Edmonston strain of Measles virus becomes the ancestors of many attenuated live Measles Vaccines, comprises Moratan, and it is the present vaccine (Attenuvax of the U.S.
TMMerck Shap ﹠amp; Dohme, WestPoint, PA) effective in nineteen sixty-eight approval and proof.
The positive immune programme for children of determining in the middle and later periods sixties has caused the measles case reported rapidly to drop to 1500 of nineteen eighty-three by nearly 700,000 of nineteen sixty-five.Simultaneously, also developed other vaccine strain (Fig. 1) by the Edmonston strain, and Schwarz (Institut Merieux, Lyon, France), Zagreb (Zagreb, Yugoslavia) and AIK-C (Japan).These other vaccines also prove effectively and are extensive use of.In one's early years, the insufficient vaccine strain (Rubeovax of reactionogenicity attenuation
TM: Merck Sharp ﹠amp; Dohme) in children, caused measles sample disease, its use thereby be interrupted.But, it through successfully further attenuation produced Moraten vaccine strain (see figure 1) (32).The measles virus vaccines that live provide the once successful experience of exploitation effective vaccine, and provide model for the virus vaccines attenuation molecule mechanism of understanding the negative adopted single strand RNA virus of non-sections.
Because as this importance of human morbidity and main cause of death, Measles virus (MV) is widely studied.MV be a kind of big, be close to the spheric bag by particle, it is made up of two components: lipoprotein membrane and ribonucleoprotein core, the distinctive biological functions of they each tools (33).The capsid of virion is a host cell deutero-cytoplasmic membrane, through three kinds of virus-specific albumen: hemagglutinin (H; About 80 kilodaltons (kD)) and fusion glycoprotein (F
1,2About 60kD; They are given prominence on the surface of virion and give the ability that virion adheres to and enter host cell) modification (16).Be considered to protectiveness at the antibody of H and/or F, virus causes the ability (34,35,36) that infects because they can neutralize.Stromatin (M, about 37kD) is the both sexes albumen that is arranged in the internal surface of film, thereby thinks duplicate (37) that virus takes place perfect the form of its perfect virion.The core of virion contains the geneome RNA of long 15,894 Nucleotide, and geneome RNA template activity (38,39,40) has been given in it and tight associating of nucleocapsid protein (N) of about 60kD of 2600 molecules.With it be about 1 micron volution ribonucleoprotein particle losse associations be the viral RNA RNA-dependent polysaccharase (L that enzymic activity is arranged; About 240kD), it and polysaccharase cofactor (P; About 70kD), may also have other virus-specific and the cooperation of host-encoded protein, transcribe and duplicate MV genome sequence (41).
So far, complete nucleotide sequence (only limiting to Edmonston B laboratory strain and AIK-C vaccine strain), coding potentiality and the genomic composition of MV (33) have been reported.6 kinds of virion structural protein are by 6 nonoverlapping coded by said gene of adjoining, and they arrange as follows: 3 '-N-P-M-F-H-L-5 '.Two other MV gene product that present function it be unclear that has also been made evaluation.These the two kinds Nonstructural Proteins that are known as C (about 20kD) and V (about 45kD) are all by the P genes encoding, and the former is by secondary reading frame coding among the PmRNA; The latter is by corotation record editor's P gene deutero-mRNA coding, and this mRNA coding has the hybridization albumen (16) that P N-terminal sequence and new zinc fingerprint sample are rich in the C-terminal structural domain of halfcystine.
Except the proteic sequence of coding virus-specific, the MV genome also contains unique non-encoding histone structural domain, and these structural domains are transcribed similar with the structural domain that duplicates path (16,42) to the guidance of correlated virus.
These adjustment signals are positioned at MV genomic 3 ' and 5 ' end, and between each cistron of cross-over connection in the short transcribed spacer on border.The former coding instructs that genome is transcribed, genome and anti-genome parcel and the promotor of inferring and/or the regulating and controlling sequence element that duplicate.The latter sends signal and makes Transcription Termination and carry out the polyadenylation of each monocistron (monocistronic) virus mRNA, and then starts next gene transcription.In general, as if though the RNA RNA-dependent polysaccharase of other non-segmented negative-strand RNA viruses is arranged, MV polysaccharase mixture is also to these signals react (16,42,43,44).
Transcription initiation in MV genomic 3 ' terminal or its near, carry out to 5 ' direction then, form monocistronic mRNA (40,42,45).Along with polysaccharase moves along the horizontal of MV genomic templates, it and stopping/the start signal experience of inferring, these signals by 3 ' to 5 ' order are: partly conservative Transcription Termination/polyadenylation signal (A/GU/G UA A/U NN A
4, wherein N can be any in 4 kinds of bases), each monocistron RNA finishes at this; Trinucleotide punctuate mark (CUU between the gene of not transcribed; But at H: it is CGU for the L border); Start half required conservative start signal (AGG A/G NN C/AAA/GGA/U, wherein N can be any in 4 kinds of bases) (45,46) with next genetic transcription.Because some polysaccharase mixture can not restart, the abundance of various MV mRNA is along with encoding gene reduces away from genome 3 ' end.This mRNA gradient is directly corresponding with the proteic abundance of each virus-specific.This shows that the MV protein expression finally is controlled (44) on transcriptional level.
3 ' and 5 ' MV genome end contains non-encoding histone sequence, has unique similarity (42) with the leader of SVS and tail RNA coding region.Nucleotide 1 to 55 has been determined the zone between genome 3 ' end and N gene starting point, simultaneously finds 37 additional nucleotides between the terminal point of L gene and genome 5 ' end.But, do not resemble VSV, even do not resemble Sendai virus and the NDV of paramyxovirus, MV is not transcribed into these end region (+) short, unmodified or (-) adopted leading RNA (47,48,49).But, be transcribed into leading company and read transcription product, comprise total length polyadenylation leader: N, leader: N: P, leader: N: P: M also has the anti-genome MV of total length RNA (48,49) certainly.So in MV, decision starts the leading transcription product of weak point that VSV strand, negative polarity genome are transcribed into reproduction switch, i.e. operating element (50,51,52) that seemingly do not have.This has caused the consideration of other pattern of this important reproduction process and exploration (42).
It seems that Measles virus and all other mononegavirale viruses except that rhabdovirus all prolong the restriction (42) that its terminal adjusted and controlled territory is not subjected to its leader and tail region encoding sequence.These zones of Measles virus comprise 107 3 ' end genome nucleotides (" 3 ' genomic promoter region ", claim again " prolongation promotor ", it comprises 42 Nucleotide of the leader of encoding, thereafter 3 genes are Nucleotide at interval, 52 Nucleotide with 5 ' the end non-translational region of coding N mRNA) and 109 5 ' terminal nucleotides (Nucleotide of 3 ' the end non-translational region of 69 coding L mRNA, gene is the Nucleotide of trinucleotide and 37 tail regions of encoding at interval).The total nucleotide sequence that two short districts are all arranged in about 100 Nucleotide of said gene group and anti-genome 3 ' end: 14 in 16 Nucleotide of genome and anti-genomic absolute 3 ' end is identical.Be positioned at these ends, also have the zone of 12 definitely identical Nucleotide of another sequence.Their position just or transcribe near the MV genome and to begin the position that begins certainly with anti-genome duplication certainly, show that these short sequence domains of holding comprise a prolongation promoter region.
These sequential elements that separate can be arranged the transcription initiation site at N gene initiation site place-internal structure territory instruction transcription initiation site among both and anti-genomic generation (42,48,53) is instructed in 3 ' end structure territory.Except their regulating and controlling effects as the cis acting determinative of transcribing and duplicating, encoding respectively new life's 5 ' end of anti-genome and geneome RNA of genome that these 3 ' ends prolong and anti-genomic promoter region.Exist the required signal of still undetermined N albumen nucleogenesis in these nascent RNAs, it is required and after this amplify and transcribe and duplicate another required crucial controlling element promptly to form the nucleocapsid template.Fig. 2 has shown the position and the sequence in the adjusted and controlled territory of cis acting of inferring of these high conservatives.
In other member's genome of paramyxovirus genus (Paramyxoviridae), exist the similar terminal non-encoding histone zone of position, size and spacing, though have only 8 to 11 identical (42,54) with MV in their the absolute ends Nucleotide.Morbillivirus rabies virus (canine distemper virus) genome end (CDV) shows the homology with its relatives MV higher degree: the Nucleotide of two virus leader sequences and tailer sequence 73% is identical, comprises 17 (55) in absolute 3 ' terminal 18 Nucleotide 16 and 18 Nucleotide of 5 ' end.Still do not find to prolong the attached inner CDV genome structure territory that promotor has homology with MV.But, between the genomic Nucleotide 85 of CDV and 103, the section of 20 Nucleotide of a segment length being arranged between 15,587 to 15,606,15 in these 20 Nucleotide is complementary (Gene Bank accession number AF 14953).This shows that CDV is the same with MV, contains another zone at its non-coding 3 ' genome and anti-genome end, and this zone may provide important cis acting promotor and/or adjustment signal (55).
In addition, the exact length (55 Nucleotide) of several members (MV, CDV, PIV-3, BPV-3, SV and NDV) 3 ' leader is identical in the Paramyxoviridae.Other evidence of relevant these prolongations, non-protein-coding region importance is from the analysis to a large amount of different copy protections (copy-back) defective interferingviruses (DI), and these viruses are separated to from subacute sclerosing panencephalitis (SSPE) cerebral tissue recently.Do not find that the stem district is shorter than the DI of 95 5 ' terminal gene group Nucleotide.This shows that MV DI rna replicon and the required minimum signal of parcel are longer than the long tailer sequence of 37 Nucleotide, comprise another adjusted and controlled territory, inside of inferring (56) and extend to.
Be example to a certain extent with the Measles virus, the present invention is directed to such notion, promptly important virulence/attenuation decision base is present in the trans-acting that must act on it with these cis-acting elements in the non-encoding histone regulation and control of viral genome the zone in transcribes/the replicative enzyme mixture in.MV genomic 3 ' and 5 ' end have all been found the cis acting structural domain, are positioned at 6 both sides of adjoining gene of coding virus structural protein; And be present in the MV genome as the short district that comprises border between internal gene.The former coding instructs that genome is transcribed, genome and anti-genome parcel and duplicate etc. significant process, infer promotor and/or regulating and controlling sequence element.The latter sends the signal of each monocistron virus mRNA Transcription Termination and polyadenylation, and then starts next gene transcription.Transcribe/replicative enzyme, promptly RNA RNA-dependent polysaccharase molecule can regulate transcribe and/efficient of duplicating, so determining the abundance of cytopathogenic effect virogene product and/or virion filial generation.
The present invention for the evidence of this viewpoint of Measles virus available from having determined Edmonston wild-type MV strain isolated for generations and the derive nucleotide sequence (referring to Fig. 1) of coding region (and aminoacid sequence of predicting) of the non-coding control region (3 ' genomic promoter region territory) of the Measles Vaccine strain that obtains and L gene of strain isolated thus first.Also other wild-type strain isolated is independently detected, for relatively.
The nucleotide sequence of vaccine measles strain in 4 kinds of wild-types and 5 (in normal chain, anti-genome, the messenger strand), and the putative amino acid sequence of the RNA polymerase of these Measles viruss (L albumen) is as described below, numbers (SEQ ID NO) referring to the corresponding sequence of this paper:
Virus Nucleotide sequence The L protein sequence Wild-typeEdmonston SEQ ID NO:1 SEQ ID NO:21977 SEQ ID NO:3 SEQ ID NO:41983 SEQ ID NO:5 SEQ ID NO:6Montefiore SEQ ID NO:7 SEQ ID NO:8 vaccine RubeovaxTM SEQ ID NO:9 SEQ ID NO:10Moraten SEQ ID NO:11 SEQ ID NO:12Zagreb SEQ ID NO:13 SEQ ID NO:14AIK-C SEQ ID NO:15 SEQ ID NO:16
15,894 Nucleotide of each Measles virus gene group leader listed above.The translation of L gene starts from the codon at Nucleotide 9234-9236 place; Translation stop codon is positioned at Nucleotide 15783-15785.Translate long 2,183 amino acid of L albumen.
Notice that the Nucleotide 2499 of 1983 wild-type Measles viruss is expressed as " G " in SEQ ID NO:5.In fact, this base is that " G " and " C " use with.In addition, note Rubeovax
TMThe Nucleotide 2143 of vaccine virus is expressed as " T " in SEQ ID NO:9.In quilt 9 clones that check order, this base of 7 clones is " T ", and 2 is " C "; So this base can be " T " or " C ".
In addition, except at Nucleotide 4917 and 4924 Schwarz of place being " C " but not " T ", Schwarz vaccine virus genome is identical with Moraten vaccine virus genome (SEQ ID NO:11).
Then, the nucleotide difference and the L gene and the Nucleotide and the amino acid difference of L protein sequence of distinguishing them of 3 ' end genomic promoter region of distinguishing the wild virus of Edmonston wild-type strain isolated, vaccine strain and other independent separate is compared and arranges (referring to the table 3 of embodiment 1 hereinafter to 5).
As shown in table 3, the sudden change (in anti-genome, messenger strand) of 3 places from the wild for generations MV strain isolated and vaccine strain 3 ' the end genomic promoter region of deriving arranged: at Nucleotide 26 places, " A " becomes " T "; Nucleotide 42 places, " A " become " C " or " A " becomes " T "; Only in Zagreb, Nucleotide 96 places, " G " becomes " A ".In addition, other tested wild-type strain isolated is " A " but not " G " at Nucleotide 50 places, and is therefore all different with wild for generations strain isolated and vaccine strain.
Measles Vaccine strain (Rubeovax
TM, Moraten, Schwarz, AIK-C and Zagreb) with the predicted amino acid sequence of wild-type strain isolated (1977,1983 with Montefiore) L gene and wild for generations strain isolated (Edmonston) 49 places different (referring to hereinafter table 4 and the table 5 of embodiment 1) are arranged in long 2183 amino acid whose opening code-reading frames.
These amino acid differences can be divided into 4 classes:
(1) a certain vaccine strain is different from for generations, and the position that is different from other vaccine and wild strain, points out a possible attenuation site.
(2) specific differences between all wild-types and all the vaccine sequences; These also may constitute important attenuation site.
(3) newer wild-type is different from the residue of older wild-type on the time; They may cause the heredity drift.
(4) one or more vaccine strains and/or wild-type strain have the total amino acid whose position that is different from other toxic strain; These variations may have been represented the dependency between the pedigree specificity in the vaccine strain, the variation of potential attenuation and the wild-type strain isolated respectively.
(1) class that has 4 kinds of a certain vaccines to be different from other vaccine and wild-type strain changes.Two kinds are present in (amino acid 331 and 2114) among Moraten and the Schwarz, and two kinds are present in (amino acid/11 624 and 2074) among the AIK-C.These sudden changes have special meaning, because all these viruses all are good vaccines.So these positions are attenuation sites.
Have only a position, promptly 1717, belong to (2) class, promptly all wild-types are aspartic acid all at this, and all vaccines are L-Ala.What is interesting is that this position is arranged in of two zones, the L gene of Measles virus and rabies virus (otherwise, be the height homologous) do not show unusual conservative property in described zone.This difference makes 1717 to be likely the key position of attenuation sudden change in the Measles virus.
Newer wild-type on time (1983 with Montefiore) has 5 places different with older wild-type (Edmonston and 1977), and 149,636,720,2017 and 2119, so these differences belong to (3) class.These differences prompt for the heredity drift rather than point out the attenuation mutational site.In addition, Montefiore (1989 strain isolated) has 16 place's differences (seeing Table 5) in addition with all the other strains.These may be heredity drift ((3) class) or random variation ((4) class).Remaining 23 place is (4) class difference, promptly is different from one or more viral differences that have jointly.
It is the sudden changes of potential attenuation (4) class that 3 places (1409,1649,1936) are arranged in these positions.Described changing into, two kinds of vaccine strains have and are different from the common change of wild-type strain for generations.These changes may with produce Rubeovax
TMRelevant with the vaccine pedigree of Moraten vaccine (Fig. 1).
The applicant finds that their AIK-C vaccine strain nucleotide sequence has 21 place's differences with disclosed sequence (33), comprises that a place inserts and place disappearance.Several codings that caused in these differences change, and comprise that intragenic 2 places of L (being positioned at amino acid/11 477 and 2008) change.
So, for the preparation living vaccine along with measles for generations strain constantly be attenuated obtaining best replication, in the L gene order naturally other change of increase it seems and be suppressed and be limited.According to hypothesis, this the limited tolerance of L gene varied number and position not only is subjected to keeping the influence of polysaccharase multi-function capability needs, and the influence that changes of original 3 ' end promotor that just has that the L albumen that is subjected to producing subsequently is inevitable interacts with it (transcribe and duplicate obtaining).In other words, Zui Jia viral attenuation needs polymerase protein and its collaborative (promptly chain) that applies in the cis acting controlling element of effect is changed.
3 '-leader shows the lowest tolerated that changes, and it only allows Nucleotide 26 in the attenuation process (after all being that " A " becomes " T ") and Nucleotide 42 (" A " becomes " C " or " A " becomes " T ") two place's high selectivities to change (in the anti-genome messenger strand).In Zagreb, only there is a place to change in addition, promptly " G " at the 96th place becomes " A ", may be very important when this and L gene specific sexually revise when combining.As if 3 '-leader has only experienced place heredity drift since 1954, promptly the 50th " G " becomes " A " (seeing Table 3).
3 ' end changes into only in all MV vaccine strains that two place's pyrimidines are replaced by purine in the genome meaning chain in the genomic promoter region in the attenuation process.The common evolution of L gene is considered to reflect to help the selection result of viral delicate variation of breeding in different host cells in these attenuation processes.All vaccine strains are all cultivated in chicken embryo (CE) or chick embryo fibroblast (CEF) in its attenuation process (Fig. 1).In addition, some vaccine strain contacts with special host cell; That is, the Zagreb vaccine is cultivated in Madin-Darby canine kidney(cell line) and in the human diploid cell, and the AIK-C vaccine adapts to the sheep nephrocyte.Moraten and Rubeovax
TMOnly in CE and CEF, grow.
Some pedigree specificity L gene alteration (Rubeovax
TM, in Moraten and the Schwarz vaccine the 1649th; With the 1717th change in all vaccines) represented the L gene of a subgroup to adapt to the change of 3 '-leader, regulate for the vaccine attenuation and transcribe/reproduction process.In addition, the vaccine individual specificity changes (change of (1) class) may provide meticulous and harmonious adjusting for the virus replication of each vaccine strain/transcribe.
According to table 3 and above argumentation, the sudden change of the key attenuation of MV 3 ' genomic promoter region is Nucleotide 26 (A → T), Nucleotide 42 (A → T or A → C) and Nucleotide 96 (G → A) (in anti-genome, the messenger strand).
According to table 4 and above argumentation, the proteic key attenuation of L mutational site is as follows: amino-acid residue 331 (Isoleucine → Threonine), 1409 (L-Ala → Threonines), 1642 (Threonine → L-Ala), 1649 (arginine → methionine(Met)s), 1717 (aspartic acid → L-Ala), 1936 (Histidine → tyrosine), 2074 (glutamine → arginine) and 2114 (arginine → Methionins).Should be appreciated that, cause the Nucleotide of above-mentioned amino acid change to change to be not limited to hereinafter among the embodiment 1 table 4 listed; Causing the codon change to become above-mentioned amino acid whose all Nucleotide variations after translation all is included in the scope of the present invention.
3 type human parainfluenza viruses (HPIV-3) are another kind of non-sections, negative justice, strand coating RNA viruses.HPIV-3 belongs to Paramyxoviridae (seeing Table 1).15,462 Nucleotide of the gene group leader of HPIV-3, and 6 the nonoverlapping protein coding genes (57) of encoding.A kind of virion structural protein of respectively encoding of 5 genes wherein, they are called as NP (corresponding to the N albumen of MV), M, F, HN (hemagglutinin-neuraminidase) and L.The 6th mRNA coding P albumen, and, go back encoding D albumen by rna editing mechanism by eclipsed 5 ' the immediate opening code-reading frame (ORF) coding C albumen.
The same with MV, HPIV-3 comprises 3 '-non-encoding histone leader of 55 Nucleotide, but is different from Measles virus (this district is 37 Nucleotide), and it has 5 '-tail region of 44 Nucleotide of a segment length.Polysaccharase is with linearity, succession, initial-the mode open gene group that stops, and this mode is by the signals direct of transcribing in the RNA template.
Wild-type virus JS strain has been obtained to have the result (7,57) of prospect through the go down to posterity trial of developing the active HPIV-3 vaccine of attenuation of cell cultures under the suboptimal temperature degree.The go down to posterity evaluation of level has separated (CP) mutant strain of several strains " cold going down to posterity " according to JS strain difference.One of them mutant strain goes down to posterity from 45 times, is called as cp45.
This virus shows 3 interesting characteristics: (1) acclimatization to cold (ca): be lower than effective replication under 20 ℃ of suboptimal temperature degree; (2) temperature sensitivity (ts): can not replication in vitro when temperature is equal to or higher than 39 ℃; (3) little plaque morphological specificity.It seems that this mutant strain be candidate vaccine likely, because: (a) its ca, ts and little plaque phenotype keep stable after cell cultures goes down to posterity; (b) it duplicates the upper respiratory tract and the lower respiratory tract that is confined to hamster; (c) it has induced the remarkable protection (58,59) that anti-wild-type HPIV-3 attacks once more in hamster.
Evaluation to this strain in rhesus monkey shows that the attenuation sudden change in the cp45 is the combination (60) of ts and non-ts sudden change.After this show that in the intravital evaluation of chimpanzee cp45 has obtained gratifying attenuation, the protection (61) that still can induce high-caliber anti-wild-type virus to attack simultaneously.In seronegativity baby and children's, cp45 has been made preliminary clinical evaluation again, pointed out this candidate vaccine strain to have suitable infectivity and suitable attenuation, and have medium immunogenicity (61).
The cp45 strain is as described below to be cultivated in rhesus monkey embryo lung (FRhL) cell and Vero cell: the PIV-3 cp45 virus of cultivating in the FRhL cell prepares this virus by the FRhL cell monolayer that the MOI inoculation with 0.1-1.0 is paved with in tissue culture flasks.Metainfective cell with the EMEM substratum 32 ℃ of cultivations.After about 7 days, when observing maximum cell pathology effect (synplasm (synctyia)), culture is taken turns freeze-thaw cycle through one, results virus is compiled viral liquid and is stored in-70 ℃.
The Vero cell forms on microcarrier bead and is paved with individual layer in the bio-reactor, produces virus on the Vero cell monolayer, and the continuously stirring microcarrier bead is cultivated, and preparation is grown in the PIV-3 cp45 virus in the Vero cell thus.Metainfective bioreactor culture thing is maintained 30 ℃.After 4 to 5 days, results virus when observing synplasm (Syncytial) CPE.The nutrient solution that will contain virus is stored in-70 ℃.
Below be HPIV-3 JS wild-type strain (89) and be grown in the FRhL cell and the Vero cell in the nucleotide sequence (in the normal chain, anti-genome, messenger strand) of cp45 vaccine strain, and the putative amino acid sequence of the RNA polymerase of these HPIV-3 viruses (L albumen), referring to the corresponding SEQ ID of this paper NO.:
Virus Nucleotide sequence The L protein sequence Wild-typeJS SEQ ID NO:17 SEQ ID NO:18 vaccine FRhL cp45 SEQ ID NO:19 SEQ ID NO:20Vero cp45 SEQ ID NO:21 SEQ ID NO:22
Above listed each PIV-3 viral genome is all grown 15,462 Nucleotide.The proteic translation of L starts from Nucleotide 8646 to 8648 codons; Translation stop codon is positioned at Nucleotide 15345 to 15347.Long 2,233 amino acid of the L albumen of translating into.
As embodiment 2 hereinafter and wherein table 6 in detail as described in, according to the difference between the cp45 sudden change vaccine strain of cultivating in wild-type JS strain and the FRhL, the key attenuation sudden change of HPIV-3 3 ' end genomic promoter region is Nucleotide 23 (T → C), Nucleotide 24 (C → T), Nucleotide 28 (G → T) and Nucleotide 45 (T → A) (in anti-genome, the messenger strand).Hereinafter embodiment 2 and table 6 wherein also describe in detail, and the proteic key mutational site of HPIV-3L comprises following site: amino-acid residue 942 (tyrosine → Histidine), 992 (leucine → phenylalanines) and 1558 (Threonine → Isoleucines).
In addition, the cp45 sudden change vaccine strain of Vero cultivation also has another place's sudden change because of the intragenic coding of L changes: amino-acid residue 1292 (leucine → phenylalanine).
Should be appreciated that, cause the Nucleotide variation of above amino acid change to be not limited to hereinafter described in the embodiment 2; Cause codon to change and after translation, become above amino acid whose all Nucleotide and change all within the scope of the present invention.
Human respiratory syncytial virus (RSV) is another kind of non-sections, negative justice, strand coating RNA viruses.The pneumonitis virus that RSV belongs to the pneumonitis virus subfamily belongs to (seeing Table 1).
According to the reactivity of F and G surface glycoprotein and monoclonal antibody, identified two kinds of main people RSV subgroups, be called A and B (62).Recently, the A of RSV strain and B pedigree are confirmed (63,64) by sequential analysis.Also be separated to should virus ox, sheep, goat strain.This viral host specificity is relevant with the G attachment protein obviously, and this albumen height deviates from (65,66) between people and Niu/sheep strain, and may (at least in part) influenced by the receptors bind effect.
RSV be virus pneumonia serious among the infant and bronchitic main diseases because of.Very popular among the baby of serious disease (being lower respiratory illness (LRD)) below 6 monthly ages.It is the most normal to betide not immune baby and contacts RSV first.RSV is also relevant with the air flue hyperergy with asthma, and is broncho-pulmonary dysplasia and congestive heart disease (CHD) " high-risk " child patient's the important cause of the death.It still makes children easily suffer from one of common respiratory tract infection of otitis media.In the adult, RSV mainly represents the upper respiratory disease of a kind of no complication (uncomplecated); But in the elderly, it can be equal to influenza, is both the susceptibility factor that develops into particularly bacillary bronchitis of serious LRD and pneumonia.Disease is confined to respiratory tract usually, but when serious immunocompromised, the diffusion to other organ may occur.Virus contains viral respiratory secretions to other position diffuse pollution thing by having polluted, and infection x nasal cavity, oral cavity or connection mucous membrane begin.
The RSV disease is seasonal, and virus only is separated to usually in the winter time in month, is that November is to April at the north latitude degree for example.Virus distributes very wide, has 90% to infect once at least below 2 years old among the children.Multiple strain co-propagate.Do not have the direct evidence (as A type influenza virus finding) of antigenic drift, accumulated amino acid change, point out immune pressure may promote the evolution of virus but sequence studies show that in G albumen and the SH albumen hypervariable region.
In mouse and cotton vole (cotton rat) model, the F of RSV and G albumen have all excited neutralizing antibody, and single providing with these protein immunizations resists the long-effective protection power (67,68) that infects again.
In human body, do not produce complete immunizing power, all can take place to infect again all the life (69,70) RSV; But, evidence suggests that immune factor can be protected and not suffer from serious disease.The reduction of disease seriousness with in the past once or multilayer infect relevant; and evidence suggests; the children that infected by one of RSV two main subgroups have the protection (71) that anti-homology subgroup to a certain degree infects; many observation promptings, the attenuated live virus vaccines may provide the provide protection that is enough to prevent serious M ﹠ M.Rsv infection simultaneous excitation antibody and cell-mediated immunity power.In some researchs, find relevantly with anti-LRD protection at the proteic serum neutralizing antibody of F and G, do not reduce to some extent though also prove upper respiratory disease (URD).The intravital high-level serum antibody of baby is relevant with anti-LRD protection, and intravenously is used the tire immunoglobulin (Ig) of RSV neutralizing antibody of height and is presented at the protection (70,72,73) that anti-severe is provided in the high risk child.The effect of local immunity, nose antibody specifically is among studying.
The ribonucleoprotein that constitutes of RSV virion is wrapped in the lipoprotein envelope.The virion that pneumonitis virus belongs in size with similar to all other paramyxovirus in shape.Show through negative staining and electron microscope, the virion out-of-shape, diameter is from 150 to 300nm differences (74).This viral nucleocapsid is a symmetric volution, similar to other paramyxovirus, but screw diameter is 12 to 15nm, but not 18nm.Coating is made of the double-layer of lipoid from host cell membrane, and contain encoding viral stride the film surface glycoprotein.Viral glycoprotein mediation adhewsive action and penetration, and be formed in the virion furcella with being separated from each other.All members of paramyxovirus subfamily have blood coagulation activity, but this function is not the feature of pneumonitis virus, because blood coagulation activity is not present in RSV, but are present in PVM (75).The neuraminic acid enzymic activity is present in the member of paramyxovirus, rubella virus (rubulavirus) genus, but is not present in Measles virus and the mouse pneumonia virus (PVM) (75).
RSV has two subgroups, A and B.Wild-type RSV genome (strain 2B) is the negative adopted RNA single strand (SEQ ID NO:23) of long 15,218 Nucleotide, becomes 10 kinds of main subgenomic mRNAs through transcribing.10 kinds of a kind of main polypeptide chains of each own coding of mRNA: three kinds is to stride film surface protein (G, F and SH); Three kinds is to combine the protein (N, P and L) that forms virus nucleocapsid with geneome RNA; Two kinds is Nonstructural Protein (NS1 and NS2), and they are at infected cell inner accumulation, but also trace is present in the virion, and may work at regulatory transcription with duplicate the time; A kind of is non-glycosylated virion stromatin (M); At last a kind of is M2, and another kind of non-glycosylated protein is proved to be RSV specific transcriptional elongation factor (see figure 3) recently.These 10 kinds of viral proteins have illustrated most encoding viral ability.
Viral genome is wrapped up by main nucleocapsid protein (N), and combines with phosphoric acid albumen (P) and big (L) polymerase protein.These three kinds of albumen are proved to be to instruct necessary and be enough (76) by the rna replicon of the RSV minimal genome of cDNA coding.Further research to have proved in order transcribing and to have carried out needing M2 albumen (ORF1) (74) fully.When M2 lacked, the transcription product of brachymemma occupied the majority, and the rescue (74) of full-length gene group does not take place.
M (stromatin) and M2 albumen all are that the virion of inside is conjugated protein, and they are not present in the nucleocapsid structure.Similar because of with other non-segmented negative-strand RNA viruses, M albumen is considered to make before packing transcribes inertia in the nucleocapsid, and mediates it and combine with peplos.Only measured very small amount of NS1 and NS2 albumen in purified virus, and thought that at present they are Nonstructural Proteins, their effect is still uncertain, though they may be the instrumentalities of transcribing and duplicating.There are three kinds to stride film surface glycoprotein: G in the virion, F and SH.G and F (syzygy) are envelope glycoproteins, and known their mediation virus is adhered to and penetrated (77) host cell.The proteic Unknown Function of SH is though report hints that it relates to the fusion function (78) of virus recently.
At present after measured the genomic complete sequence of two strain wild-type RSV B subgroups (2B and 18537) (see below literary composition SEQ ID NO:23 and 25).Geneome RNA had not both added yet polyadenylation (79) not of cap.No matter in virion and in the cell, geneome RNA is all combined closely with N albumen.
3 ' end of geneome RNA has the outer leader of the gene of one section 44 Nucleotide, infers that it contains main viral promotors (Fig. 3).3 ' end is 10 virogenes after the genomic promoter region, its be in proper order 3 '-NS1-NS2-N-P-M-SH-G-F-M2-L-5 ' (Fig. 3).L gene back is tail region (Fig. 3) outside the gene of 145 to 149 Nucleotide.Each gene all starts from one section conservative property, 9 nucleotide gene start signals, 3 '-GGGGCAAAU, and (except the 10 Nucleotide start signals of L gene, it is 3 '-GGG
ACAAA
AU; The difference place has added underscore).Each gene transcription is all from first Nucleotide of this signal.Each gene all ends at gene end (the 3 '-AGU/GU/AANNNU/AA of one and half conservative propertys, 12 to 14 Nucleotide
3-5) (wherein N may be in 4 kinds of bases any one), this end instructs the termination and the polyadenylation (Fig. 3) of transcribing.Preceding 9 genes of RSVB strain all are nonoverlapping, are separated with the intergenic region (Fig. 3) of 3 to 56 Nucleotide.Intergenic region does not contain any conservative property motif or any tangible second structure characteristic, and intergenic region proof little replicon (minreplicon) system in in front and continue after genetic expression do not influence (Fig. 3).Latter two RSV gene has 68 Nucleotide overlapping (Fig. 3).The gene start signal of L gene is positioned within the M2 gene, rather than after it.The overlap of these 68 Nucleotide encoded last 68 Nucleotide (do not comprise poly-A tail) of M2 mRNA and preceding 68 Nucleotide of coding L mRNA.
The product that genome is transcribed is that 10 kinds of different subgene group polyadenylation mRNA and many poly-cistron polyadenylations are even read transcription product (74).The genome deactivation method of utilizing UV (ultraviolet) light to mediate is transcribed drawing and be studies show that, the RSV gene begins to be transcribed (80) by 3 ' to 5 ' order from the single promotor near 3 ' end.So as if RSV is synthetic has followed with regard to singlely entering of proposing of all mononegavirale virus, consecutive transcription pattern (16,81).According to this pattern, polysaccharase (L) contacts geneome RNA with the form of nucleocapsid at 3 ' end genomic promoter region, and from first Nucleotide transcriptional start.RSV mRNA is the collinearity copy of these genes, does not find that mRNA edits or montage.
The sequential analysis of RSV mRNA shows synthetic first Nucleotide (74) of gene start signal separately that originates in of each transcription product in the born of the same parents.The structure that mRNA 5 ' end is added cap is as follows: m7G (5 ') ppp (5 ')
GP (G that wherein is added with underscore is first template nucleotide of mRNA), and, mRNA separately 3 ' end polyadenylation (82).These two kinds of modifications are considered to be carried out in corotation record mode by varial polymerases.3 area discovers of RSV3 ' end genomic promoter are important cis-acting elements (83).They are preceding 10 Nucleotide (supposing to play a part promotor), Nucleotide 21-25 and the gene start signal (83) that is positioned at Nucleotide 45-53.Different with other paramyxovirus such as measles, celestial platform and PIV-3, find that the rest part of RSV NS1 gene leader and non-coding region is found the tolerance (83) that insertion, disappearance and replacement is had height.
In addition, by to 3 ' end genomic promoter region preceding 12 Nucleotide saturation mutagenesis (promptly, each base is by respectively with a kind of displacement of its excess-three kind base and the efficient of relatively translating and duplicating), a succession of U that is positioned at Nucleotide 6-10 demonstrates the height inhibition (83) to replacement.On the contrary, preceding 5 multiple replacements of nucleotide pair tolerate relatively, and wherein two of the 4th is to raise sudden change, cause duplicating and transcribe and having improved 4 to 20 times of RSV-CAT RNA.By using the little replicon of bicistronic mRNA (minireplicon) system, prove that it is mRNA synthetic signal that gene initial sum gene stops motif, and seemingly self controls, with the characteristic irrelevant (84) of flanking sequence.
L gene start signal has occupied 68 Nucleotide in upstream of M2 gene termination signal, overlapping (Fig. 3) (74) of causing gene thus.M2 gene termination signal causes in the L gene that the L genetic transcription is regular to be finished in advance.Total length LmRNA is considerably less, only can not discern at polysaccharase just to produce when the M2 gene stops motif.This makes that transcribing of LmRNA is much lower.The overlapping linear precedence transcriptional profile that as if do not meet of gene.Do not know whether jump back to again outside the L gene start signal after polysaccharase leaves the M2 gene, perhaps whether exist second internal promoter (74) of L genetic transcription.Also may, it is approaching that the L gene can be aggregated small segment institute of enzyme, this enzyme can not be come L gene start signal place and slipped over the M2 gene at M2 gene start signal place transcriptional start.
The relative abundance of various RSV mRNA reduces with the distance of each gene apart from promotor, supposes that this is to cause (80) owing to polysaccharase fails in the consecutive transcription process.Gene overlap is to cause the synthetic second kind of mechanism that reduces of total length L mRNA.And some mRNA has the characteristic that possibility reduces translation efficiency.The initiator codon of SH mRNA is in the suitable Kozak sequence environment in an Asia, and G ORF starts from second methionyl codon in the mRNA.
Duplicating of RSV RNA is considered to the pattern (16,81) that (74) have followed institute's proposition of carrying out with vesicular stomatitis virus and Sendai virus.This relates to and changes anti-terminator into from mRNA synthetic termination-initial modes and connect reading mode.The result has synthesized justice to duplicate-middle (RI) RNA, and this is and the accurate complementary copy of geneome RNA.Then, it plays the effect of synthon for genomic templates.Relate to the mechanism that changes to the anti-terminator pattern and it is believed that relating to the record of N albumen corotation wraps up nascent RNA (16,81).Rna replicon in the RSV is the same with other non-segmented negative-strand RNA viruses, depends on ongoing protein synthesis (85).Detected the RI RNA (74,85) that standard virus and RSV-CAT minigene group were once foretold.In standard virus system and minigene group system, abundance is all than low 10 to 20 times of filial generation genome in the born of the same parents of RI RNA.The corresponding separately herein sequence numbering (SEQ.ID No) of following reference has provided the putative amino acid sequence of the RNA polymerase (L albumen) of the nucleotide sequence (with the demonstration of normal chain, anti-genome messenger strand) of various wild-types, vaccine type and answer type RSV strain and these RSV viruses:
Virus Nucleotide sequence The L protein sequence Wild-type2B SEQ ID NO:23 SEQ ID NO:2418537 SEQ ID NO:25 SEQ ID NO:26 vaccine 2B33F SEQ ID NO:27 SEQ ID NO:282B20L SEQ ID NO:29 SEQ ID NO:30 reply type 2B33F TS (+) SEQ ID NO:31 SEQ ID NO:322B20L TS (-) SEQ ID NO:33 SEQ ID NO:34
Long 2,166 amino acid whose L albumen of various RSV virogene group codings.Genome length and other Nucleotide information are as follows:
Virus Genome Wild-type Length The L initiator codon The L terminator codon2B 15218 8502-8504 15000-1500218537 15229 8509-8511 15007-15009
Vaccine2B33F 15219 8503-8505 15001-150032B20L 15219 8503-8505 15001-15003
The answer type2B33F TS (+) 15219 8503-8505 15001-150032B20L TS (-) 15219 8503-8505 15001-15003
As the hereinafter detailed description of embodiment 3 (particularly table 7 and table 8), the crucial attenuation sudden change of 3 ' end genomic promoter region of RSV B subgroup is Nucleotide 4 (C → G) and inserted an A (in the anti-genome messenger strand) in a succession of A of Nucleotide 6-11.Hereinafter embodiment 3 also describes in detail, the potential attenuation of the proteic key of the L of RSV site is as follows: amino-acid residue 353 (arginine → Methionin), 451 (Methionin → arginine), 1229 (aspartic acid → l-asparagines), 2029 (Threonine → Isoleucines) and 2025 (l-asparagine → aspartic acids).Should be appreciated that, cause the Nucleotide variation of above-mentioned amino acid change to be not limited to hereinafter those described in the embodiment 3; All cause codon to change and become above-mentioned amino acid whose Nucleotide change all within the scope of the present invention after translation.
Compare with infected person and animal host's wild-type virus, attenuated virus of the present invention shows significantly weakening of virulence.The degree of attenuation reaches in the individuality of great majority acceptance immunity infection symptoms can not occur, but virus has kept enough replicatioies and also can excite required type of immune response at infectious (infectious in) in having in vaccine inoculation person's body.
Attenuated virus of the present invention can be used to prepare vaccine.For this reason, attenuated virus is adjusted to suitable concentration and be equipped with suitable vaccine adjuvant, diluent or carrier.Can be with acceptable medium on the physiology as carrier.They include but not limited to: suitable isotonic solution, phosphate buffered saline buffer or the like.Suitable adjuvant includes but not limited to MPL
TM(3-O-deacylated tRNA list phosphoric acid lipid A; RIBI ImmunoChem Research, Inc., Hamilton, MT) and IL-12 (Genetics Institute, Cambridge, MA).
In one embodiment of the invention, be intended to comprising that the preparation of attenuated virus is as vaccine.Can be with attenuated virus and freezing protection additive or stablizer, for example protein (as serum albumin, gelatin), sugar (as sucrose, lactose, sorbyl alcohol), amino acid (as Sodium Glutamate), salt or other protective material mix.This mixture is maintained liquid state, then or drying or lyophilize in order to transportation with preserve, mix with water more before use temporarily.
The preparation that comprises attenuated virus of the present invention can be used for immune human or animal, the protection that infects with the corresponding wild-type virus of reactance attenuated virus.So the present invention also provides a kind of immune body to induce the method for anti-mononegavirale virales picornavirus infection protection, promptly by giving the vaccine preparation of individual immunity significant quantity, described preparation has mixed the above viral attenuation variant of this paper.
In order to excite immunne response, the essential capacity vaccine of giving individual with suitable number of times.Those skilled in the art can easily determine above-mentioned total amount and dose.Administration can be by arbitrary conventional effective means, for example in the nose, parenteral, per os or be locally applied to mucomembranous surface (for example in the nose, oral cavity, intraocular, vagina or rectum surface), for example passes through aerosol spray.Preferred means of administration is an intranasal administration.
In another embodiment of the present invention, one section isolated nucleic acid molecule is used to produce oligonucleotide probe (from normal chain response gene group messenger strand or minus strand complementary gene group chain) and expression of peptides (only from the anti-genome messenger strand of normal chain), described nucleic acid molecule has the intact virus nucleotide sequence of wild-type virus or vaccine virus of the present invention, and described peptide is used for detection, body fluid or tissue sample and whether has described wild-type virus and/or vaccine strain.This nucleotide sequence is used for designing the diagnostic test of high specific and susceptibility, whether there to be virus in the test sample.
The primer that has synthesized polymerase chain reaction (RCR) according to the sequence of viral wild-type of the present invention or vaccine.Testing sample is carried out the RNA reverse transcription, the pcr amplification in the cDNA district that selectes then, described zone is corresponding to nucleotide sequence as herein described (the peculiar Nucleotide that promptly has certain virus strain).On gel, identify amplification PCR products, and by confirming their specificity with the hybridization of specificity nucleotide probe.
Test with ELISA and to detect the antigen that whether has wild-type or vaccine virus strain.Design is also picked out the peptide that contains one or more unique residues (according to wild-type as herein described or vaccine sequence).Then with these peptides and haptens (as keyhole _ hemocyanin (KLH)) coupling, and be used for immune animal (for example rabbit) with the manufacture order specific polyclonal antibody.Select polyclonal antibody or, be used for the antigen that " capturing ELISA " detects described virus generation then polyclonal antibody and monoclonal antibody combination.
Moraten measles virus vaccines strain sample has been used for the microbial preservation budapest treaty (" budapest treaty ") of patented procedure according to international recognition by the applicant, at American type culture collection (12301Parklawn Drive, Rockville, Maryland 20852, USA.) carried out preservation, the preserving number that obtains is ATCC VR2587.The sample that the Vero of HPIV-3 virus cultivates the cp45 vaccine strain by the applicant according to budapest treaty, (Maryland 20852 for 12301 Parklawn Drive, Rockville in American type culture collection, USA.) carried out preservation, the preserving number that obtains is ATCC VR2588.The sample of 2B wild-type RSV virus by the applicant according to budapest treaty, (Maryland 20852 for 12301 Parklawn Drive, Rockville in American type culture collection, USA.) carried out preservation, the preserving number that obtains is ATCC VR2586.
According to above three parts of strain of preservation and sequence informations of relevant they and other virus strain, can utilize previously described site-directed mutagenesis and rescue technology to be incorporated herein the sudden change (or keep wildtype phenotype) of described all virus strain, and hereinafter obtain these virus strain and carry out other sudden change in the listed sudden change group in the table 3,4 and 6 to 8.
For the present invention is understood better, provide following examples.They are for the present invention is described, it can not be understood as to limit the scope of the invention.
Embodiment
Standard molecular biological technique uses according to described program such as Sambrook (86).
Embodiment 1
Measles
Before preparation RNA is used for sequential analysis, in the Vero cell, directly from AttenuvaxTM vaccine bottle (lot number #0716B) Moraten MV vaccine virus is cultivated a generation, the Schwarz vaccine virus is cultivated a generation (lot number #96G04/M179 G41D), Zagreb and Rubeovax
TMVaccine virus respectively cultivated for 2 generations.Before the extracting RNA material, MV wild-type strain isolated Montefiore (56) is gone down to posterity in the Vero cell 5 to 6 times, similarly, before the extracting RNA material is used for analyzing, MV wild-type strain isolated 1977,1983 (14) is gone down to posterity 5 to 7 times.From Dr.J.Beeler (CBER) receive Edmonston wild-type strain isolated (see figure 1) be before receiving at human kidney cells go down to posterity 7 times and in the Vero cell, go down to posterity 3 times former Edmonston strain isolated, before being used for sequential analysis, go down to posterity once at the Vero cell again.
Infection multiplicity with 0.1 to 1.0 (m.o.i.) vero cells infection prepares RNA, and just gathers in the crops after allowing it reach maximum cytopathogenic effect.Use Trizol
TMTotal RNA of reagent (Gibco-BRL) extracting viral infection of measles cell.
From the total RNA of Vero passage material separation, with reversed transcriptive enzyme-PCR (Perkin-Elmer/Cetus) amplification, use be that Corticovirus genomic 3 ' and 5 ' promoter region and 5 ' hold measles (EdmonstonB strain (the 19)) Auele Specific Primer of L gene right.Table 2 provides the sequence of these primers.Primer SEQ IDNO:35-54,74,77 and 78 is anti-genome messenger strands.Primer SEQ ID NO:55-73,75,76 and 79 is genome minus strands.
Table 2
PCR primer and to the order-checking of MVL gene and genome end
9047CATATCACTC ACTCTGGGAT GGAG
9070(SEQ ID NO:35)
9371TCAGAACATC AAGCACCGCC
9390(SEQ ID NO:36)
9741ACAGTCAAGA CTGAGATGAG
9760(SEQ ID NO:37)
10001AAGAGTCAGA TACATGTGGA
10020(SEQ ID NO:38)
10351ACATGAATCA GCCTAAAGTC
10370(SEQ ID NO:39)
10674CCGAAAGAGT TCCTGCGTTA CGACC
10698(SEQ ID NO:40)
11083CAGTCCACAC AAGTACCAGG
11102(SEQ ID NO:41)
11461GTCAGAAGCT GTGGACCATC
11480(SEQ ID NO:42)
11841AATATTGCTA CAACAATGGC
11860(SEQ ID NO:43)
12196ACTCTTCATT CCTAGACTGG
12215(SEQ ID NO:44)
12542GTCCAATTAT GACTATGAAC
12561(SEQ ID NO:45)
12891AGAACAGACA TGAAGCTTGC
12910(SEQ ID NO:46)
13232CCAACAAGGA ATGCTTCTAG
13251(SEQ ID NO:47)
13551ACAGCACTAT CTATGATTGA CCTGG
13575(SEQ ID NO:48)
13930GCAACATGGT TTACACATGC
13949(SEQ ID NO:49)
14280AGATTGAGAG TTGATCCAGG
14299(SEQ ID NO:50)
14629AGGAGATACT TAAACTAAGC
14648(SEQ ID NO:51)
14981TAAGCTTATG CCTTTCAGCG
15000(SEQ ID NO:52)
15337TTAACGGACC TAAGCTGTGC
15356(SEQ ID NO:53)
15671GAAACAGATT ATTATGACGG
15690(SEQ ID NO:54)
9290CGGGCTATCT AGGTGAACTT CAGG
9267(SEQ ID NO:55)
9500ATTTGGATAT GGAATATGAG
9481(SEQ ID NO:56)
9840ACTCAACTGA ACTACCAGTG
9821(SEQ ID NO:57)
10181AAGAACATCA TGTATTTCAG
10162(SEQ ID NO:58)
10549TTATCAACGC ACTGCTCATG
10530(SEQ ID NO:59)
10919ATTTTCAGCA ATCACTTGGC ATGCC
10895(SEQ ID NO:60)
11280GCCTCTGTGC AAACAAGCTG
11261(SEQ ID NO:61)
11638TCTCTAGTTA CTCTAGCAGC
11619(SEQ ID NO:62)
12010AGGTCGTTGT TTGTGAGGAG
11991(SEQ ID NO:63)
12361TCGTCCTCTT CTTTACTGTC
12342(SEQ ID NO:64)
12689CCGTCCTCGA GCTAGCCTCG
12670(SEQ ID NO:65)
13052CTCCTCCAGG CTCACATTGG
13033(SEQ ID NO:66)
13420GGGTTGGTAC ATAGCTCTGC
13401(SEQ ID NO:67)
13767CACCCATCTG ATATTTCCCT GATGG
13743(SEQ ID NO:68)
14099TGGTTGACAG TACAAATCTG
14080(SEQ ID NO:69)
14460CTGAAATGGG AAGATTGTGC
14441(SEQ ID NO:70)
14820AGCAATCTAC ACTGCCTACC
14801(SEQ ID NO:71)
15180TCACAGATGA TTCAATTATC
15161(SEQ ID NO:72)
15530GATCCTAGAT ATAAGTTCTC
15511(SEQ ID NO:73)
1ACCAAACAAA GTTGGGTAAG G
21(SEQ ID NO:74) GGGGGATCC
100ATCCCTAATCCTGCTCTTGTCCC
78(SEQ ID NO:75)
200GATTCCTCTG ATGGCTCCAC
181(SEQ ID NO:76)
15221TAACAGTCAA GGAGACCAAA G
15741(SEQ ID NO:77) GGGAAGCTT
15801AACCCTAATCCTGCCCTAGGTGG
15823(SEQ ID NO:78)
15894ACCAGACAAA GCTGGGAATA GA
15873The genomic overlapping PCR fragment of (SEQ ID NO:79) intact virus directly check order and without cloning to obtain consensus sequence with dideoxy terminator cycle sequencing (ABI PRISM377 sequenator and ABI PRISM 377 sequencing kits) with double-stranded.In order to determine the sequence of absolute ends, adopted described linker (55) in the past.
In order to check hypothesis of the present invention, measured Edmonston wild-type MV strain isolated for generations, the nucleotide sequence of the vaccine strain that is derived from this strain isolated that obtains and non-encoding histone control region of other wild strain and L gene.To shown in 5 each Nucleotide (anti-genome messenger strand) and amino acid difference are compared and arrange (difference is sentenced italic and represented) as table 3 hereinafter then:
Table 3
Difference in MV3 ' the end genomic promoter region nucleotide sequence
The Nucleotide numbering:
Virus
26 42 50 96Edmonston w-t A A G G vaccine: Rubeovax
TMT C G GMoraten T C G GSchwarz T C G GZagreb T T G AAIK-C T C G G wild type: 1977 A A A G1983 A A A GMontefiore A A A G
Table 4
MV L Nucleotide and amino acid whose difference between Edmonston wild-type and the vaccine strain
331 1409 1624 1649 1717 1887 1936 2074 2114Edmonston w-t ATT GCA ACC AGG GAT AAC CAT CAA AGA sudden change ACT ACA GCC ATG GCT GAC TAT CGA AAAEdmonston w-t I A T R D N H Q RRubeovax
TMvac. I A T M A D H Q RMoraten vac. T A T M A D H Q KSchwarz vac. T A T M A D H Q KZagreb vac. I T T R A N H Q RAIK-C vac. I T A R A N Y R R
Table 5
MV L Nucleotide and amino acid whose difference between the wild-type strain
81 122 149 252 331 441 447 500 513 570 613Edmonston w-t GCC GAT GTT ACA ATT AAA AAA GAT GTG AAA TAC sudden change ACC AAT ATT GCA GTT AGA AGA AAT ATG AAT CACEdmonston w-t A D V T I K K D V K Y1977 w-t A N V T V K K D M K Y1983 w-t T D I T I K K N M N HMontefiore w-t A D I A I R R D M K Y
618 621 623 626 628 632 636 637 641 645 650Edmonston w-t GTC AGT AGG AGA GCA ATA CAA GTA GAC GAT ATG sudden change GCC AAT AAG AAA GAA GTA CAT ATA AAT AAT ATAEdmonston w-t V S R R A I Q V D D M1977 w-t A S R R A I Q I D N M1983 w-t V N K R A I H V D D MMontefiore w-t V S R K E V H V N D I
652 720 723 794 914 970 1,044 1,294 1,569 1705 1745Edmonston w-t GCT ATC TAT CGG CGG GCC GGA AGC GTT ATC AAT sudden change ACC GTC TGC TGG CAG TCA AGA ACC ATT GTC AGTEdmonston w-t A I Y R R A G S V I N1977 w-t A I C W Q A G S V I N1983 w-t A V C R R S G T I I NMontefiore w-t T V C R R A R S V V S
1860 1,865 1,936 2,007 2,013 2,017 2,030 2,096 2119 2165Edmonston w-t GTA TTC CAT GAC GAT ACT AAT ATA AAG GTC sudden change ATA TAC TAT GGC GGT ATT AGT GTA CGG ATCEdmonston w-t V F H D D T N I K V1977 w-t V Y H D D T N I K V1983 w-t V F Y D G I N I R IMontefiore w-t I F H G D I S V R V
Embodiment 2
PIV-3
Table 6 is that the parental generation wild-type JS strain and the FRhL-sequence with the cp45 mutant strain Vero cultivation that cultivate of PIV-3 virus compares (in the anti-gene messenger strand).If the change of codon does not cause amino acid whose change, table 6 is with " non-" expression, and the back is unaltered amino acid name.
Table 6
The cp45 that Vero cultivates and FRhL-cultivates and the sequence of JS strain are relatively
Gene regions nucleotide position JS FRhL cp45 Vero cp45 codon changes amino acid change
(numbering in the L) 3 ' leader sequence, 23 T C C
24?????????C?????????T????????????T
28?????????G?????????T????????????T
45 T A ANP UTR, 62 A T TNP coding regions, 397 T C C GTA → GCC Val → Ala
1275 T G G TCT → GCT Ser → AlaP code area 2080 T C C AAT → AAC are non-/and AsnM code area 4347 C A A CCC → ACC Pro → ThrF code area 5536 C T T AAC → AAT are non-/Asn
6329?????????A?????????G????????????G??????????ATA→GTA???????Ile→Val
6419 G A A GCA → ACA Ala → ThrHN coding region 6847 T C C GGT → GGC are non-/Gly
7956 T C C GTT → GCT Val → AlaL coding region 9323 T C C TAT → TAC are non-/Tyr (226)
9971 A G G GAA → GAG are non-/Glu (442)
11469?????????T?????????C????????????C??????????TAC→CAC???????Tyr→His(442)
11621?????????G?????????T????????????T??????????TTG→TTT???????Leu→Phe(942)
11521?????????A?????????A????????????T
*????????TTA→TTT???????Leu→Phe(1292)
12581 C T T TTC → TTT are non-/Phe (992)
13318 C T T ACT → ATT Thr → Ile (1558) # suddenlys change 20 20
The sequential analysis of the cp45 mutant strain that PIV-3 virus parent's wild-type JS strain and FRhL are cultivated shows that the latter has 20 places' Nucleotide and changes.Wherein 4 are in non-coding 3 ' the end leader, are: Nucleotide 23 (T → G), 24 (C → T), 28 (G → T) and 45 (T → A) (in the anti-genome messenger strands).From the genome minus strand, become bigger purine (" A ") by less pyrimidine (" C ") on the 28th and may change the size that is clipped in zone between interior two conserved regions of 3 ' end genomic promoter region, changed the cis acting signal thus and offered to the space of polysaccharase.
It is that the intragenic coding of NP, M, F, HN and L changes that 9 places change.The change of 7 places is non-coding or reticent change the in NP, P, F, HN and L gene or the NP non-translational region (UTR) in addition.Verified, since its ts phenotype, cp45 sudden change transcriptional activity very weak (87) under nonpermissive temperature.This ts phenotype is existing to cause (88) through atlas analysis by viral L gene.Because verified, with regard to the sudden change in HN and the F glycoprotein, cp45 virus can normally be exercised its function (87), and this has just supported this hint, that is, the intragenic sudden change of 3 ' leader and L has contribution to the attenuation phenotype of virus.
So, 4 place's specificitys in 3 ' the end leader of the cp45 that FRhL cultivates change, sexually revise with causing with intragenic 3 places of the L of upper/lower positions amino acid change coding, attenuation phenotype to the cp45 candidate vaccine strain has been made significant contribution, described amino acid change is: 942 (Tyr → His), 992 (Leu → Phe) and 1558 (Thr → Ile).
And the cp45 sudden change vaccine strain that Vero cultivates also has another place's sudden change because of the intragenic coding of L changes: amino-acid residue 1292 (leucine → phenylalanine) (marking with asterisk in the table 6).
Preceding two amino acid whose changes (the 942nd and 992) are arranged in one of high conservative region of all paramyxovirus L genes in the L albumen through atlas analysis.The 3rd amino acid change (the 1558th) is through the joining region of atlas analysis between two conservative pieces, and corresponding with it is the change of amino acid/11 717 in the MV vaccine strain.
Disclosed document (89) has only pointed out that 18 places change between the anti-genome messenger strand of cp45 strain that JS and FRhL cultivate.The applicant has found 16 places wherein.
Disclosed document not report is changed by 4 places that the applicant finds: and the Nucleotide 45 in the 3 ' leader (T → A), Nucleotide 62 in the NP UTR (A → T), or cause (the Nucleotide 397 of Val → the Ala) (T → A) and cause amino acid change (Nucleotide 1275 (T → C) (Nucleotide on the anti-genome messenger strand changes) of Ser → Ala) of amino acid change in the NP albumen.Disclosed document was not reported another place's potential attenuation sudden change that the inventor finds yet in the cp45 vaccine strain that Vero cultivates, it is because of the Nucleotide 12521 (amino acid/11 292 of A → T) cause (Leu → Phe).
The B subgroup of RSV
Temperature sensitivity phenotype (ts) is closely related with the interior attenuation of body; In addition, some non-ts sudden change may also cause attenuation.By sequential analysis and the ts to RSV mutant strain and revert strain, the evaluation to ts and the sudden change of non-ts attenuation has been finished in the evaluation of these phenotypes of growth in acclimatization to cold (ca) and the body.
Below 5 kinds of RSV 2B strains genome all the order-checking: 2B parent, 2B33F, revert strain 2B33F TS (+), 2B20L and revert strain 2B20L TS (+).2B33F and 2B20L strain are ts and ca, describe (90) to some extent in this paper reference U.S. Patent application of quoting 08/059,444.In having identified 2B33F and 2B20L the sudden change region after, measured the sequence in the following strain aforementioned region again: obtain in addition 9 kinds of 2B33F " answers " strain isolateds at 39 ℃ of subculture in vitro separately with behind cercopithecus aethiops or chimpanzee interior generation and obtain other 9 kinds of 2B20L " answer " strain isolated at 39 ℃ of subculture in vitro separately.
Table 7 is to 12 being summaries of these results.
Table 7
Sequence between RSV 2B and 2B33F virus strain relatively
Nucleotide position+ | Nucleotide changes | ||||
Gene/zone | 3 ' the end of vRNA | ??RSV?2B | ??RSV ??2B33F | RSV 2B33F TS (+), 5a revert strain | Amino acid change |
Genomic promoter | ???4 ???6 | ????C ????- | The A that G is extra | The A that G is extra | The non-coding of non-coding |
??M | ???4175 ???4199 | ????T ????T | ????C ????C | ????C ????C | The non-coding of non-coding |
??SH | ???4329 ???4409 ???4420 ???4442 ???4454 ???4484 ???4497 ???4505 ???4525 ???4526 ???4542 ???4561 ???4575 ???4598 | ????T ????T ????T ????T ????T ????T ????T ????T ????T ????T ????T ????T ????T ????T | ????C ????C ????C ????C ????C ????C ????C ????C ????C ????C ????C ????C ????C ????C | ????C ????C ????C ????C ????C ????C ????C ????C ????C ????C ????C ????C ????C ????C | The non-Thr (99 of the non-Ser of the non-Tyr of the non-Cys of the non-His of the non-Ile of Phe-Leu (10) (36) Ile-Thr (40) (47) (51) (61) termination-Gln (66) (68) Ile-Thr (75) Ile-Thr (75) termination-Gln (81) Leu-Pro (87) Trp-Arg (92) |
??L | ???9559 ???9853* ???12186 ???14587 ???15071 | ????G ????A ????G ????C ????A | ????A ????G ????A ????T ????G | ????A ????A ????A ????T ????G | The non-coding of Arg-Lys (353) Lys-Arg (451) * Asp-Asn (1229) Thr-Ile (2029) |
+: the nucleotide position number number of M, the SH of 2B33F and 2B33F TS (+) and L gene than 2 bigger.
*: in 2B33F TS (+) strain, the 9853rd Lys → Arg change reverts back to Lys.
Table 8
Sequence between RSV 2B and 2B20L virus strain relatively
+: the nucleotide position number number of the L gene of 2B20L and 2B20L TS (+) is more bigger than 2B.
*: the total sudden change in 2B33F and the 2B20L virus strain.
*: at the 14650th, sudden change has suppressed the ts phenotype in 2B20L TS (+) the revert strain.
Nucleotide position+ | Nucleotide changes | ||||
Gene/zone | 3 ' the end of vRNA | RSV?2B | ???RSV ??2B20L | RSV 2B20L TS (+), R1 revert strain | Amino acid change |
Genomic promoter | ??4 ??6 | ????C ????- | The A that G is extra | The A that G is extra | Non-coding *Non-coding * |
?L | ??8963 ??13347 ??14587 ??14649 ??14650 | ????C ????A ????C ????A ????A | ????T ????A ????T ????G ????A | ????T ????G ????T ????G ????T | Non-Thr (154) Asn-AsP (1616) Thr-Ile (2029) *????Asn-Asp(2050) ????Asn-Asp-Val ????(2050) ** |
Table 9
RSV 2B, ts and revert strain
Sample | The source | External phenotype ts ca | Grow cotton vole AGM in the body | ||||
39/32 ℃ of EOP plaque form | 20/32 ℃ of yield | Nasal mucus | Lung | The nose washing lotion | Bronchial perfusate | ||
RVS?2B | Wild parent plant | ????0.7 ????(WT) | ???0.0001 | ????5.5 a????3.9 b????(4/4) | ????5.8 a????5.2 b????(4/4) | ????5.8 e????(4/4) | ????4.7 e????(4/4) |
RVS?2B33F | Ca, ts sudden change separates self cooling 33 times the 2B of going down to posterity | ????0.00007 ????(sp/int/wt) | ???0.04 | ??≤1.6 a??<1.9 b????(1/4) | ????<1.5 a????<1.2 b????(0/4) | ????3.0 e????(4/4) | ????<0.9 e????(0/4) |
RVS?2B33F-5a TS(+) | The 2B33F turn is gone down to posterity, at the plaque of 39 ℃ of pickings | ????0.5 ????(WT) | ???0.03 | ??≤1.7 a????(1/4) | ????3.5 a????(4/4) | ????4.2 e????(4/4) | ????4.0 e????(4/4) |
RVS?2B33F-4a TS(+) | The 2B33F turn is gone down to posterity, at the plaque of 39 ℃ of pickings | ????0.7 ????(WT) | ???0.01 | ??≤1.7 a????(3/4) | ????3.8 a????(4/4) | ????ND | ????ND |
RVS?2B33F-3b TS(+) | The 2B33F turn is gone down to posterity, at the plaque of 39 ℃ of pickings | ????0.5 ???(WT) | ???0.04 | ??≤2.5 a????(3/4) | ????2.9 a????(4/4) | ????ND | ????ND |
AGM?pp2 | The AGM#A2 that 2B33F infects, d7 nose washing lotion is at the plaque of 32 ℃ of pickings | ????0.3 ???(sp,int) | ???0.00002 | ??≤2.0 b????(1/4) | ????1.6 b????(4/4) | ????ND | ????ND |
AGM?pp4 | The AGM#A2 that 2B33F infects, d7 nose washing lotion is at the plaque of 32 ℃ of pickings | ???0.1 ???(sp,int) | ???0.008 | ??<1.6 b????(0/4) | ????1.2 b????(4/4) | ????ND | ????ND |
AGM?pp6 | The AGM#A4 that 2B33F infects, d12 nose washing lotion is at the plaque of 32 ℃ of pickings | ???0.000004 ???(wt) | ?≤0.00005 | ??≤1.5 b????(1/4) | ????<1.1 b????(0/4) | ????ND | ????ND |
AGM?pp7 | The AGM#A2 that 2B33F infects, d12 nose washing lotion is at the plaque of 32 ℃ of pickings | ???0.000004 ???(sp/int/wt) | ???0.007 | ???≤1.4 b???(1/4) | ????<1.0 b????(0/4) | ????ND | ????ND |
Table 9 (continuing)
RSV 2B, ts and revert strain
*: growth, i.e. Log in the body of mensuration
10Average virus titer (# infected/# sum).ND=does not carry out.WT=wild-type plaque size.The little plaque of sp=.Int=median size plaque.
aDosage=10
6.6PFU IN
bDosage=10
5.6PFU IN
cDosage=10
6.3PFU IN
dDosage=10
5.9PFU IN
eDosage=10
6.6PFU IN+IT
fDosage=10
6.0PFU IN+IT
Sample | The source | External phenotype ts ca | Grow cotton vole AGM in the body | ||||
39/32 ℃ of EOP plaque form | 20/32 ℃ of yield | Nasal mucus | Lung | The nose washing lotion | Bronchial perfusate | ||
Chimpanzee pp1A | The chimpanzee #1552 that 2B33F infects, d4 lavage of trachea liquid is at the plaque of 32 ℃ of pickings | ?0.5 ?(WT) | ???ND | ??ND | ??ND | ??ND | ??ND |
Chimpanzee pp3A | The chimpanzee #1560 that 2B33F infects, d6 lavage of trachea liquid is at the plaque of 32 ℃ of pickings | ?0.7 ?(WT) | ???ND | ??2.4 c??(4/4) | ??≤3.0 c??(3/4) | ??ND | ??ND |
Chimpanzee pp5A | The chimpanzee #1563 that 2B33F infects, the d6 nose swab is at the plaque of 32 ℃ of pickings | ?0.7 ?(WT) | ???ND | ??≤2.3 c??(3/4) | ??3.0 c??(4/4) | ??ND | ??ND |
RVS?2B20L | Ca, ts sudden change separates self cooling 20 times the 2B of going down to posterity | ?0.0002 ?(int/wt) | ???0.02 | ??<1.9 d??(0/4) | ??<1.3 d??(0/4) | ??<0.7 f??(0/2) | ??<0.7 f??(0/2) |
RVS?2B20L?R1 TS(+) | The 2B20L turn is gone down to posterity, at the plaque of 39 ℃ of pickings | ?0.6 ?(WT) | ???ND | ??2.3 c??(4/4) | ??3.5 c??(4/4) | ??ND | ??ND |
RVS?2B20L?R2 TS(+) | The 2B20L turn is gone down to posterity, at the plaque of 39 ℃ of pickings | ?0.6 ?(WT) | ???ND | ??≤2.5 c??(3/4) | ??2.7 c??(4/4) | ??ND | ??ND |
RVS?2B20L?R9 TS(+) | The 2B20L turn is gone down to posterity, at the plaque of 39 ℃ of pickings | ?0.8 ?(WT) | ???ND | ??≤2.2 c??(3/4) | ??4.0 c??(4/4) | ??ND | ??ND |
RVS?2B20L?R10 TS(+) | The 2B20L turn is gone down to posterity, at the plaque of 39 ℃ of pickings | ?0.7 ?(WT) | ???ND | ??2.6 c??(4/4) | ??3.2 c??(4/4) | ??ND | ??ND |
Table 10
2B33F revert strain
+: the base number of these 2B33F revert strains M, SH and L gene is more bigger than 2B.
*: base 4330,4410,4421,4443,4455,4485,4498,4506,4526,4527,4543,4562,4576,4599.The slight answer ND=that replys fully on answer moderate on the r=phenotype (r)=phenotype that the base 2B=that S=is identical with 2B33F is returned on 2B base or the phenotype does not carry out
????ts(+)In?Vitro | ?????????AGM | Chimpanzee | |
????5a????4a????3b | ?pp2??pp4??pp6??pp7 | ??1A????3A????5A | |
Base number+ | |||
????M | ????S?????S?????S | ?S????S????S????S | ??S?????S?????S |
????4176,4200 | |||
????SH | ????S?????S?????S | ?S????S????S????S | ??S?????S?????S |
14 bases * | |||
????L | ????S?????S?????S ????2B????2B????2B ????S?????S?????S ????S?????S?????S ????S?????S?????S | ?S????S????S????S ?2B???S????S????S ?S????S????S????S ?S????S????S????S ?S????S????S????S | ??S?????S?????S ??ND????2B????2B ??S?????S?????S ??ND????S?????S ??S?????S?????S |
????9560 ????9854 ????12187 ????14588 ????15072 | |||
Phenotype | |||
Ts ca attenuation | ????2B????2B????2B ????S?????S?????S ????r?????r?????r | ?r????r????S????S ?2B???S????2B???S ?(r)??(r)??S????S | ??2B????2B????2B ??ND????ND????ND ??ND????r?????r |
Table 11
2B20L revert strain
+: the base number of these 2B20L revert strains L gene is more bigger than 2B.The base 2B=that S=is identical with 2B33F is returned to moderate answer on 2B base or the r=phenotype
*=sequence change the ND=different with 2B or 2820L do not carry out
TS (+) in-vitro separation thing | |
Base number+ | R1???R2???R3A???R4A???R5A???R6A???67A???R8A???R9A???R10A |
???L | S????S????S?????S?????S?????S?????S?????S?????S?????S C *??S????ND????S?????S?????ND????S?????S?????S?????S S????S????S?????S?????S?????S?????S?????S?????S?????S S????S????2B????S?????2B????2B????S?????S?????2B????2B A *??A *??S?????A *???S?????S?????A *???A *???S?????S |
???8964 ???13348 ???14588 ???14650 ???14651 | |
Phenotype | |
The ts attenuation | 2B???2B???ND????ND????ND????ND????ND????ND????2B????2B r????r????ND????ND????ND????ND????ND????ND????r?????r |
Table 12
RSV 2B, ts and revert strain: phenotype is summed up
ND=do not carry out-=the wild-type phenotype, that is, non temperature-sensibility, non-acclimatization to cold, non-attenuation+extremely ++ ++=temperature sensitivity, acclimatization to cold or attenuation level increase
Virus isolated strain | The source | External phenotype | Attenuation in the body | ||
??ts | ????ca | Cotton mouse | ??AGM | ||
?RSV?2B | The wild-type parent strain | ??- | ????- | ??- | ???- |
?RSV?2B33F | Ca, ts sudden change separates self cooling 33 times the 2B of going down to posterity | ??++++ | ????++ | ??++++ | ???+++ |
?RSV?2B33F-5a ?TS(+) | The 2B33F turn is gone down to posterity, at the plaque of 39 ℃ of pickings | ??- | ????++ | ????++ | ???+ |
?RSV?2B33F-4a ?TS(+) | The 2B33F turn is gone down to posterity, at the plaque of 39 ℃ of pickings | ??- | ????++ | ???++ | ???ND |
RSV?2B33F-3b TS(+) | The 2B33F turn is gone down to posterity, at the plaque of 39 ℃ of pickings | ???- | ??++ | ??++ | ??ND |
AGM?pp2 | The AGM#A2 that 2B33F infects, d7 nose washing lotion is the phagocytosis of 32 ℃ of pickings | ???+ | ??- | ??+++ | ??ND |
AGM?pp4 | The AGM#A2 that 2B33F infects, d7 nose washing lotion is at the plaque of 32 ℃ of pickings | ???+ | ??++ | ??+++ | ??ND |
AGM?pp6 | The AGM#A4 that 2B33F infects, d12 nose washing lotion is at the plaque of 32 ℃ of pickings | ???++++ | ??- | ??++++ | ??ND |
AGM?pp7 | The AGM#A2 that 2B33F infects, d12 nose washing lotion is at the plaque of 32 ℃ of pickings | ???++++ | ??++ | ??++++ | ??ND |
Chimpanzee pp1A | The chimpanzee #1552 that 2B33F infects, d4 lavage of trachea liquid is at the plaque of 32 ℃ of pickings | ???- | ??ND | ??ND | ??ND |
Chimpanzee pp3A | The chimpanzee #1560 that 2B33F infects, d6 lavage of trachea liquid is at the plaque of 32 ℃ of pickings | ???- | ??ND | ??++ | ??ND |
Chimpanzee pp5A | The chimpanzee #1563 that 2B33F infects, d10 lavage of trachea liquid is at the plaque of 32 ℃ of pickings | ???- | ??ND | ??++ | ??ND |
RSV?2B20L | Ca, ts sudden change separates self cooling 20 times the 2B of going down to posterity | ???- | ??++ | ??++++ | ??++++ |
RSV?2B20L?R1 TS(+) | The 2B20L turn is gone down to posterity, at the plaque of 39 ℃ of pickings | ???- | ??ND | ??++ | ??ND |
RSV?2B20L?R2 TS(+) | The 2B20L turn is gone down to posterity, at the plaque of 39 ℃ of pickings | ???- | ??ND | ??++ | ??ND |
RSV?2B20L?R9 TS(+) | The 2B20L turn is gone down to posterity, at the plaque of 39 ℃ of pickings | ???- | ??ND | ??++ | ??ND |
RSV?2B20L R10 TS(+) | The 2B20L turn is gone down to posterity, at the plaque of 39 ℃ of pickings | ???- | ??ND | ??++ | ??ND |
Can draw several important insight according to above information:
A. shown in table 7 (2B33F) and table 8 (2B20L), the sequence that identifies in these two mutant strains changes quite few: the difference of RSV 2B33F and its parent RSV 2B is that two places in the 3 ' genomic promoter region change, the non-coding of M gene 5 ' two terminal places change, and sexually revise and the non-coding change in 1 place (poly-A primitive) at intragenic 4 places of the L of coding RNA RNA-dependent polysaccharase coding.In addition, through atlas analysis, the SH gene has 14 places separately and changes.RSV 2B20L and its parent RSV 2B are only variant on 7 nucleotide positions, and wherein 3 places and 2B33F virus is total, and this comprises that change of two places and the intragenic place of L coding in the 3 ' genomic promoter region change.The distinctive change of two 2B20L of place virus is through the coding region of atlas analysis at the L gene in addition.Identified the potential attenuation sudden change that is positioned at non-coding 3 ' genomic promoter region and RNA RNA-dependent pol gene.
B. in attenuated virus 2B33F and 2B20L strain, can identify intragenic two ts of the place sudden change of L:
(i) in 2B33F, cause L Argine Monohydrochloride 451 to change (9853 sudden changes of Nucleotide of Lys → Arg) (A → G) obvious and ts and attenuation phenotypic correlation.In 2B33F TS (+) the 5a strain only the answer of this position just cause recovering fully 39 ℃ of growths (table 9) and in animal the part of attenuation reply.The related of this and ts and attenuation phenotype obtains from chimpanzee with from isolating 6 kinds in addition " TS revert the strain fully " (4a of cell cultures, 3b, pp2,3A, 5a, 5A) the support of partial sequence analysis wherein has only Nucleotide 9853 sudden changes that answer (table 10-12) (note at 9853 places regressive isolate A GM (cercopithecus aethiops) taking place, part only takes place its ts phenotype replys) has taken place.(Lys → Arg) keep stable in cDNA infections clone construct promptly by stablizing this codon by inserting one second sudden change, reduces the possibility that it is returned to Lys whereby in described amino acid 451 sudden changes.
(ii) in 2B20L, cause that place coding sexually revises in the L albumen (amino acid 2,050, base 14, the 649 sudden change (A → G) it seems and ts and attenuation phenotypic correlation of Asn → Asp).In TS (+) revert strain, answer has all taken place this aspartic acid that is positioned at amino acid 2050 places, and (Asp → Asn), perhaps, (A → T) has become another amino acid (Asp → Val) (table 8,11) because the Nucleotide at Nucleotide 14,650 places replaces.Above suggestion is according to TS (+) being replied the complete sequence analysis of strain R1 and other several TS (+) being replied strain (R2, R4A, R7A, R8A) the partial sequence analysis (table 11) of selection area.In R1 revert strain, also found the sudden change of another relevant place, be positioned at (the amino acid/11 616Asn → Asp) of Nucleotide 13,347 places with above-mentioned answer.But, do not know what effect this sudden change has to the ts phenotype; The L gene of other revert strain is not order-checking fully as yet.
C.2B33F with the 2B20L strain the total sequence change in 3 places is arranged:
(i) all have in 2B33F and the 2B20L corresponding to amino acid 2029 and change (change (C → T) (table 7,8) of 14,587 place's Nucleotide of Thr → Ile).This " T " Nucleotide replaces to be found to be present in 10% the virus strain of the RSV2B for generations colony, and may obtain preferred in the attenuation process.In 2B33F and 2B20L virus, all do not find " C " base of wild-type.
(ii) in the 3 ' genomic promoter region of 2B33F and 2B20L, found the sudden change of two places: Nucleotide 4 (C → G) He among a succession of A in Nucleotide 6 to 11 places inserted an extra A (in anti-genomic information chain).When analyzing the sequence of selected TS (+) revert strain, find to have the sudden change of 3 places in 2B33F TS (+) 5a (table 7) and 2B20L TS (+) R1 (table 8) revert strain, to be retained.These non-codings that are retained, cis acting suddenly change still relevant with the viral attenuation of part.
Change the expression of using little replicon RSV-CAT system to carry out and show for analyzing these cis actings, as 2B when virus or 2B33F or 2B33F TS (+) provide auxiliary L gene function for generations (these viral N, P are identical with the M2 gene), in this vitro system, the Nucleotide of 3 ' genomic promoter 4 changes that (C → G) is the rise sudden change of a kind of transcribing/duplicate.
Also with the little replicon of this RSV-CAT system to the 3 ' genomic promoter of 2B33F and 2B for generations the subsidiary function that provides of virus or RSV2B33F or 2B33F TS (+) virus carried out complementation analysis.3 kinds of strains are all supported the transcribing/copy function of 3 ' genomic promoter mediation of 2B and 2B33F simultaneously.But 2B33F and 2B33F TS (+) virus are preferred their 2B33F 3 ' genomic promoter.This analysis clearly illustrates that in the attenuation process of vaccine, 3 ' genomic promoter is evolved jointly with RNA RNA-dependent pol gene.Proof clearly under the support of the transcribing of 37 ℃ of the little replicons of RSV-CAT/copy function, (Arg → answer Lys) has caused the answer of ts phenotype in the 2B33F mutant strain 5a to the proteic single amino acids 451 of the L that sequential analysis draws.2B33F virus does not provide subsidiary function for the little replicon of RSV-CAT (having 2B or 2B33F3 ' genomic promoter) at 37 ℃.
No matter d. phenotype how, the SH that bias is arranged that finds in the 2B33F is super, and sudden change is present in all 2B33F revert strains, but does not then have in the 2B20L of ts, ca and attenuation.So, do not have data this sudden change and any biology phenotypic correlation can be joined at present.
Another kind of wild-type RSV 18537 strains have also been accepted order-checking and have been compared with wild-type RSV 2B strain.Equally, in all above-mentioned Key residues positions, these two kinds of wild-type strains all are the same.2B is at the Thr at the codon ACA at Nucleotide 14568 to 14588 places coding L Argine Monohydrochloride 2090 places, and 18537 strains are at the Ile at codon ATT coded amino acid 2029 places at Nucleotide 14593 to 14595 places (compare with the 8502-8504 among the 2B, the initiator codon of L gene is positioned at Nucleotide 8509-8611 in 18537 strains).
Embodiment 4
Detect the PCR test of Measles virus
Patient of 21 years old because of continuous three weeks carrying out property dry cough, out of breath and high fever be admitted to hospital.After treating 7 days with clarithomycin, perhaps carrying out similar treatment with atovaquone, his symptom is not improved.And with the upper right abdomen pain of main suit, proving has not conformability to omeprazole and antacid.Relevant medical history of past comprise that the VIII factor lacks and this is admitted to hospital before 3-4 diagnose out the HIV infection.Before 1 year, he has accepted the booster immunization of a measles-mumps-rubella vaccine for college entrance.
Bronchoalveolar lavage and the transbronchial biopsy that carries out two days later of being admitted to hospital shows that anti-property hyperplasia and alveolar stave cell come off and chronic little inflammation.Microorganism is not all found in gram, argentiform or PAS dyeing.Thoracic CT scan shows at left lung base portion multiple pathology fusion tubercle is arranged.Although cause patient's HIV in late period pulmonary complication and used empirical antimicrobial drug that in order to prevent and treat opportunistic bacterium, mycobacterium and mycosis liquid the patient remains and maintains 39 ℃ high fever.Left side pleura generation sepage; The diagnostic pleurocentesis holds itself out to be exudative, but what is not diagnosed in others.The bronchoalveolar lavage that carries out after 3 weeks only shows alveolar tissue cell (wherein some has been full of iron content blood yellow), some lymphocytes and neutrophil cell.FITE, AFB and argentiform dyeing are still negative.
After 2 weeks, carried out the wedge excision of left lung by the small-sized throacotomy (minithoracotomy) under the CT guide.Many tissue slicies disclose the acute and chronic inflammation tuberal area with regional necrosis and fibrosis.Have many polykaryocytes, wherein some contains in the tenuigenin and intranuclear inclusion simultaneously, prompting Measles virus giant cell pneumonia.Bacterium, fungi, P.carinii and antiacid special stain for microorganism provide negative findings once more.Electron microscope microscopy to this lung biopsy slice shows particulate form and paramyxovirus, for example Measles virus unanimity.Measure the anti-measles IgM of the serum feminine gender of tiring through the solid phase ha test, IgM after this catches the immunity test result too.
After 2 weeks, rhesus monkey (RMK) the nephridial tissue culturing cell of having inoculated patient's lung biopsy material shows the characteristic cytopathy of viral infection of measles.And obtained using the confirmation of the immunofluorescent test of carrying out at the monoclonal antibody of Measles virus.According to this diagnosis, give oral sharp Barverine 1000mg B.I.D.14 days.Unfortunately, the patient is constantly worsened, and is finally dead after 2 months.
In order to prove conclusively the character of the Measles virus that exists in patient's body, will carry out reverse transcription and RCR amplification from the virus that infected tissue obtains, carry out sequential analysis then.With the isolating Measles virus of patient's lung biopsy inoculation RhMK, be interior 2 propagation that go down to posterity at continuous Vero (monkey kidney) tissue culture cells.According to the explanation of manufacturers, with TRIzol reagent (Life Technologies, Grand Island, total RNA of cells infected in NY) the extracting s-generation Vero cell.Equally, extracted total RNA from patient's lung biopsy material.Obtain measles virus vaccines strain (Moraten) (as a component of trivalent MMR the vaccine) (Attenuvax of the present U.S. use of unit price form
TM, Merck, Sharpe , ﹠amp; Dohme).This virus goes down to posterity once in the Vero cell, then as mentioned above, and total RNA of extracting vaccine infection cell.
(Branchburg NJ) will above-mentioned each RNA preparation reverse transcription (RT) one-tenth cDNA for Perkin-Elmer/CetusRT-PCR test kit, Perkin-Elmer Cetus with six poly-primers and Maloney mouse leukovirus reverse transcriptase at random.Use Measles virus specificity oligodeoxynucleotide primer according to aforementioned Edmonston Measles virus sequences Design to carrying out PCR then, these cDNA increase.These PCR products have comprised across the genomic one group of overlapping DNA fragment of 15,894 Nucleotide Measles viruss of complete length.Adopt manufacturer (ABIPRISM 377 sequenators and ABI PRISM dna sequencing kit; Perkin-Elmer/Centus, FosterCity, CA) each PCR product of dideoxy terminator cycle sequencing direct analysis of Jian Liing and not cloning has been determined total genome sequence.The analysis that two chains of pcr amplified dna product have all been carried out is to eliminate possible sequencing error.
The nucleotide sequence that reaches the genomic selection area of Measles virus of sick lung tissue existence in patient's viral isolates is compared with the Moraten vaccine virus, and compared with the nucleotide sequence of other measles wild-type virus and vaccine strain.This sequential analysis shows identical with the Moraten vaccine strain, and does not show with in the past or the dependency of present popular wild virus or other Measles Vaccine strain.
Detect the ELISA of RSV
The ELISA test is used for detecting whether have RSV.Peptide is according to designing and select to all B subgroup virus strain or to the homology that each wild-type, vaccine or the B subgroup of replying RSV have a specific RSV sequence with described herein.Then, with these peptides and KLH coupling and be used for immunizing rabbit, to produce the polyclonal antibody of monospecific.Select these polyclonal antibodies, or, be used for " catching ELISA " then, detect whether there is RSV antigen polyclonal antibody and monoclonal antibody combination.
Reference
1.Kapikian,A.Z.,et?al.,
Am.J. Epidemol.,89,405-421(1969).
2.Chin,J.,et?al.,
Am.J.Epidemol.,
89,449-463(1969).
3.Fulginiti,V.A.,et?al.,
Am.J. Epidemol.,
89,435-448(1969).
4.Prince,G.A.,et?al.,
J.Virology,
57,721-728(1986).
5.Kim,H.W.,et?al.,
Pediatrics,
52,56-63(1973).
6.Hodes,D.S.,et?al.,
Proc.Soc.Exp. Biol.Med.,
145,1158-1164(1974).
7.Belshe, R.B., and Hissom, F.K.,
J.Med. Virol.,
10, 235-242 (1982).
8.Black,F.L.,et?al.,
Am.J.Epidemiol.,
124,442-452(1986).
9.Lennon, J.L., and Black, F.L.,
J. Pediatrics,
108, 671-676 (1986).
10.Pabst,H.F.,et?al.,
Pediatr.Infect. Dis.J.,
11,525-529(1992).
11.Centers?for?Disease?Control,
MMWR,
40,369-372(1991).
12.Centers?for?Disease?Control,
MMWR,
41:S6,1-12(1992).
13.King,G.E.,et?al.,
Pediatr.Infect. Dis.J.,
10,883-887(1991).
14.Rota,J.S.,et?al.,
Virology,
188,135-142(1992).
15.Rota,J.S.,et?al.,
Virus?Res.,
31,317-330(1994).
16.Lamb, R.A., and Kolakosky, D., pp.1177-1204 of Vol. 1,
Fields Virology, B.N. Fields, etal., Eds. (3rd ed., Raven Press, 1996).
17.Sidhu,M.S.,et?al.,
Virology,
193,50-65(1993).
18.Garcin,D.,et?al.,
EMBO?J.,
14,6087-6094(1995).
19.Radecke,F.,et?al.,
EMBO?J.,
14,5773-5783(1995).
20.Collins,P.L.,et?al.,
Proc.Natl.Acad. Sci.,USA,
92,11563-11567(1995).
21. disclosed european patent application No.702,085.
22. disclosed International Application No. WO 96/10400.
23.Baron, M.D., and Barrett, T.,
J. Virology,
71, 1265-1271 (1997).
24. disclosed International Application No. WO 97/06270.
25. U.S. Provisional Patent Application 60/047575.
26. disclosed International Application No. WO 97/12032.
27.Kato,A.,et?al.,
Genes?to?Cells,
1,569-579(1996).
28.Sidhu,M.S.,et?al.,
Virology,
208,800-807(1995).
29.Shaffer,M.F.,et?al.,
J.Immunol.,
41,241-256(1941).
30.Enders,J.F.,et?al.,
N.Engl.J.Med.,
263,153-159(1960).
31.Enders, J.F., and Peebles, M.E.,
Proc. Soc.Exp.Biol.Med.,
86, 227-286 (1954).
32.Schwarz,A.J.F.,
Am.J.Dis.Child.,
103,216-219(1962).
33.Griffin, D.E., and Bellini, W.J., pp.1267-1312 of Vol.1,
Fields Virology, B.N.Fields, etal., Eds. (3rd ed., Raven Press, 1996).
34.Birrer,M.J.,et?al.,
Viroloqy,
108,381-390(1981).
35.Birrer,M.J.,et?al.,
Nature,
293,67-69(1981).
36.Norby,E.,et?al.,p?p.?481-507,
The Paramyxoviruses,D.Kingsbury,Ed.(Plenum?Press,1991).
37.Peebles,M.E.,pp.?427-456,
The Paramyxoviruses,D.Kingsbury,Ed.(Plenum?Press,1991).
38.Egelman,E.H.,et?al.,
J.Virol.,
63,2233-2243(1989).
39.Udem,S.A.,et?al.,
J.Virol.Methods,
8,123-136(1984).
40.Udem, S.A., and Cook, K.A.,
J.Virol,
49, 57-65 (1984).
41.Moyer, S.A., and Horikami, S.M., pp. 249-274,
The Paramyxoviruses, D.Kingsbury, Ed. (Plenum Press, 1991).
42.Blumberg,B.,et?al.,pp.?235-247,
The?paramyxoviruses,D.Kingsbury,Ed.(Plenum?Press,1991).
43.Berrett,T.,et?al.,pp.83-102,
The?Pramyxoviruses,D.Kingsbury,Ed.(Plenum?Press,1991).
44.Tordo,N.,et?al.,
Sem.in?Virology,
3,341-357(1992).
45.Cattaneo,R.,et?al.,
EMBO?J.,
6,681-688(1987).
46.Crowley,J.C.,et?al.,
Virology,
164,498-506(1988).
47.Banerjee, A.K., and Barik, S., et al.,
Virology,
188, 417-428 (1992).
48.Castaneda, S.J., and Wong, T.C.,
J. Virol.,
63, 2977-2986 (1989).
49.Chan, J., et al., pp.221-231,
Genetics and Pathogenicity of Negative Stranded Viruses, B.W.J.Mahy and D.Kolakofsky, Eds. (ElsevierBiomedical Press, 1989).
50.Blumberg,B.,et?al.,
Cell,
23,837-845(1981).
51.Blumberg,B.,et?al.,
Cell,
32,559-567(1983).
52.Kolakofsky, D., and Blumberg, B.M., pages 203-213,
Virus Persistence, B.M.J.Mahy, etal., Eds. (Cambridge University Press, 1982).
53.Castaneda, S.J., and Wong, T.C.,
J. Virol.,
64, 222-230 (1990).
54.Curran, J.A., and Kolakofsky, D.,
Virology,
182, 168-176 (1991).
55.Sidhu,M.S.,et?al.,
Virology,
193,66-72(1993).
56.Sidhu,M.S.,et?al.,
Virology,
202,631-641(1994).
57.Collins,P.L.,et?al.,pp.1205-1241of?Vol.1,
Fields?Virology,B.N.Fields,et?al.,Eds.(3rd?ed.,Raven?PresB,1996).
58.Crookshanks, F.K., and Belshe, R.B.,
J. Med, Virol., 13,243-249 (1984).
59.Crookshanks-Newman, F.K., and Belshe, R.B.,
J.Med.Virol..,
18, 131-137 (1986).
60.Hall,S.L.,et?al.,
Virus?Res.,
22,173-184(1992).
61.Karron,R.A.,et?al.,
J.Inf.Dis.,
172,1445-1450(1995).
62.Anderson,L.J.,et?al.,
J.Infect.Dis.,
151,626-633(1985).
63.Collins, P.L., pp.103-162 of
The Paramyxoviruses, D.W.Kingsbury, Ed. (Plenum Press, NY and London, 1991).
64.Sullender,W.M.,
J.Virology,
65,5425-5434(1991).
65.Lerch,R.A.,et?al.,
J.Virology,
64,5559-5569(1990).
66.Mallipeddi, S.K., and Samal, S.K.,
J. Gen Virol.,
74, 2787-2791 (1993).
67.Johnson,P.R.,et?al.,
J.Virology,
61,3163-3166(1987).
68.Stott,E.J.,et?al.,
J.Virology,
61,3855-3861(1987).
69.Henderson,F.W.,et?al.,
N.Engl.J. Med.,300,530-534(1979).
70.Hall,S.L.,et?al.,
J.Infect.Dis.,
163,693-698(1991).
71.Mufson,M.A.,et?al.,
J.Gen.Virol.,
66,2111-2124(1985).
72.Glezen,W.P.,et?al.,
Am.J.Dis. Child.,
140,543-546(1986).
73.Hemming,V.G.,et?al.,
Clin.Microbiol. Res.,8,22-33(1995).
74.Collins,P.L.et.al.,pp.1313-1351of?vol.1,
Fields?Virology,B.N.Fields,et?al.,Eds.(3rded.,Raven?Press,1996).
75.Ling, R., and Pringle, C.R.,
J.Gen. Virol.,
70, 1427-1440 (1989).
76.Yu,Q.,et?al.,
J.Virology,
69,2412-2419(1995).
77.McIntosh, K., and Chanock, R.M., pp.1045-1072 of
Virology, B.N.Fields, et al., Eds. (2nded., Raven Press, 1990).
78.Heminway,B.R.,et?al.,pp.167?ofAbstracts?of?the?IX?International?Congress?of?Virology,P17-2,(1993).
79.Mink,M.A.,et?al.,
Viroloqy,
185,615-624(1991).
80.Dickens,L.E.,et?al.,
J.Virology.,
52,364-369(1990).
81.Wagner, R.R., and Rose, J.K., pp. 1121-1135 of vol.1,
Fields Virology, B.N.Fields, et al., Eds. (3rd ed., Raven Press, 1996).
82.Barik,S.,
J.Gen.Virol.,
74,485-490(1993).
83.Collins,P.L.,et?al.,pp.259-264?of
Vaccines?93:modern?approaches?to?new?vaccines including?prevention?of?AIDS,F.Brown?et?al.,Eds.(Cold?Spring?Harbor?Laboratory?Press,NY,1993).
84.Kuo,L.,et?al.,
J.Virology.,
70,6892-6901(1996).
85.Huang, Y.T., and Wertz, G.W.,
J. Viroloogy,
43, 150-157 (1982).
86.Sambrook,J.,et?al.,
Molecular?Cloning: A?Laboratory?Manual,2nd?ed.,Cold?Spring?HarborLaboratory?Press,Cold?Spring?Harbor,N.Y.(1989).
87.Ray,R.,et?al.,
J.Virol.,
69,1959-1963(1995).
88.Ray,R.,et?al.,
J.Virol.,
70,580-584(1996).
89.Stokes,A.,et?al.,
Virus?Research,
30,43-52(1993).
90. U.S. Patent application No.08/059,444.
Sequence table (1) general information:
(i) applicant: Udem, Stephen A.
Sidhu,Mohinderjit?S.
Tatem,Joanne?M.
Murphy,Brian?R.
Randolph,Valerie?B.
(ii) denomination of invention: in mononegavirale virales (Order designated Mononegavirales) virus, cause 3 ' genomic promoter region of attenuation and the sudden change in the pol gene
(iii) sequence quantity: 79
(iv) contact address:
(A) address: American Home Products Corporation
(B) street: One Campus Drive
(C) city: Parsippany
(D) state: New Jersey
(E) country: United States
(F) postcode: 07054
(v) computer-reader form:
(A) media types: floppy disk
(B) computer: IBM PC compatible
(C) operating system: PC-DOS/MS-DOS
(D) software: PatentIn Release#1.0, Version#1.30
(vi) current application information:
(A) application number: US
(B) applying date:
(C) classification:
(viii) lawyer/proxy's information:
(A) name: Gordon, Alan M.
(B) accession designation number: 30,637
(C) reference/folder numbering: 33,294 PCT
(ix) telecommunication information:
(A) phone: 973/683-2157
(B) information of fax: 973/683-4117 (2) SEQ ID NO:1:
(i) sequence signature:
(A) length: 15894 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) collection of illustrative plates structure: linearity
(ii) molecule type: RNA (genome)
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1:
ACCAAACAAA GTTGGGTAAG GATAGATCAA TCAATGATCA TATTCTAGTG CACTTAGGAT 60
TCAAGATCCT ATTATCAGGG ACAAGAGCAG GATTAGGGAT ATCCGAGATG GCCACACTTT 120
TAAGGAGCTT AGCATTGTTC AAAAGAAACA AGGACAAACC ACCCATTACA TCAGGATCCG 180
GTGGAGCCAT CAGAGGAATC AAACACATTA TTATAGTACC AATCCCTGGA GATTCCTCAA 240
TTACCACTCG ATCCAGACTT CTGGACCGGT TGGTCAGGTT AATTGGAAAC CCGGATGTGA 300
GCGGGCCCAA ACTAACAGGG GCACTAATAG GTATATTATC CTTATTTGTG GAGTCTCCAG 360
GTCAATTGAT TCAGAGGATC ACCGATGACC CTGACGTTAG CATAAGGCTG TTAGAGGTTG 420
TCCAGAGTGA CCAGTCACAA TCTGGCCTTA CCTTCGCATC AAGAGGTACC AACATGGAGG 480
ATGAGGCGGA CCAATACTTT TCACATGATG ATCCAATTAG TAGTGATCAA TCCAGGTTCG 540
GATGGTTCGA GAACAAGGAA ATCTCAGATA TTGAAGTGCA AGACCCTGAG GGATTCAACA 600
TGATTCTGGG TACCATCCTA GCCCAAATTT GGGTCTTGCT CGCAAAGGCG GTTACGGCCC 660
CAGACACGGC AGCTGATTCG GAGCTAAGAA GGTGGATAAA GTACACCCAA CAAAGAAGGG 720
TAGTTGGTGA ATTTAGATTG GAGAGAAAAT GGTTGGATGT GGTGAGGAAC AGGATTGCCG 780
AGGACCTCTC CTTACGCCGA TTCATGGTCG CTCTAATCCT GGATATCAAG AGAACACCCG 840
GAAACAAACC CAGGATTGCT GAAATGATAT GTGACATTGA TACATATATC GTAGAGGCAG 900
GATTAGCCAG TTTTATCCTG ACTATTAAGT TTGGGATAGA AACTATGTAT CCTGCTCTTG 960
GACTGCATGA ATTTGCTGGT GAGTTATCCA CACTTGAGTC CTTGATGAAC CTTTACCAGC 1020
AAATGGGGGA AACTGCACCC TACATGGTAA TCCTGGAGAA CTCAATTCAG AACAAGTTCA 1080
GTGCAGGATC ATACCCTCTG CTCTGGAGCT ATGCCATGGG AGTAGGAGTG GAACTTGAAA 1140
ACTCCATGGG AGGTTTGAAC TTTGGCCGAT CTTACTTTGA TCCAGCATAT TTTAGATTAG 1200
GGCAAGAGAT GGTAAGGAGG TCAGCTGGAA AGGTCAGTTC CACATTGGCA TCTGAACTCG 1260
GTATCACTGC CGAGGATGCA AGGCTTGTTT CAGAGATTGC AATGCATACT ACTGAGGACA 1320
AGATCAGTAG AGCGGTTGGA CCCAGACAAG CCCAAGTATC ATTTCTACAC GGTGATCAAA 1380
GTGAGAATGA GCTACCGAGA TTGGGGGGCA AGGAAGATAG GAGGGTCAAA CAGAGTCGAG 1440
GAGAAGCCAG GGAGAGCTAC AGAGAAACCG GGCCCAGCAG AGCAAGTGAT GCGAGAGCTG 1500
CCCATCTTCC AACCGGCACA CCCCTAGACA TTGACACTGC ATCGGAGTCC AGCCAAGATC 1560
CGCAGGACAG TCGAAGGTCA GCTGACGCCC TGCTTAGGCT GCAAGCCATG GCAGGAATCT 1620
CGGAAGAACA AGGCTCAGAC ACGGACACCC CTATAGTGTA CAATGACAGA AATCTTCTAG 1680
ACTAGGTGCG AGAGGCCGAG GACCAGAACA ACATCCGCCT ACCCTCCATC ATTGTTATAA 1740
AAAACTTAGG AACCAGGTCC ACACAGCCGC CAGCCCATCA ACCATCCACT CCCACGATTG 1800
GAGCCGATGG CAGAAGAGCA GGCACGCCAT GTCAAAAACG GACTGGAATG CATCCGGGCT 1860
CTCAAGGCCG AGCCCATCGG CTCACTGGCC ATCGAGGAAG CTATGGCAGC ATGGTCAGAA 1920
ATATCAGACA ACCCAGGACA GGAGCGAGCC ACCTGCAGGG AAGAGAAGGC AGGCAGTTCG 1980
GGTCTCAGCA AACCATGCCT CTCAGCAATT GGATCAACTG AAGGCGGTGC ACCTCGCATC 2040
CGCGGCCAGG GACCTGGAGA GAGCGATGAC GACGCTGAAA CTTTGGGAAT CCCCCCAAGA 2100
AATCTCCAGG CATCAAGCAC TGGGTTACAG TGTTATTATG TTTATGATCA CAGCGGTGAA 2160
GCGGTTAAGG GAATCCAAGA TGCTGACTCT ATCATGGTTC AATCAGGCCT TGATGGTGAT 2220
AGCACCCTCT CAGGAGGAGA CAATGAATCT GAAAACAGCG ATGTGGATAT TGGCGAACCT 2280
GATACCGAGG GATATGCTAT CACTGACCGG GGATCTGCTC CCATCTCTAT GGGGTTCAGG 2340
GCTTCTGATG TTGAAACTGC AGAAGGAGGG GAGATCCACG AGCTCCTGAG ACTCCAATCC 2400
AGAGGCAACA ACTTTCCGAA GCTTGGGAAA ACTCTCAATG TTCCTCCGCC CCCGGACCCC 2460
GGTAGGGCCA GCACTTCCGA GACACCCATT AAAAAGGGCA CAGACGCGAG ATTAGCCTCA 2520
TTTGGAACGG AGATCGCGTC TTTATTGACA GGTGGTGCAA CCCAATGTGC TCGAAAGTCA 2580
CCCTCGGAAC CATCAGGGCC AGGTGCACCT GCGGGGAATG TCCCCGAGTG TGTGAGCAAT 2640
GCCGCACTGA TACAGGAGTG GACACCCGAA TCTGGTACCA CAATCTCCCC GAGATCCCAG 2700
AATAATGAAG AAGGGGGAGA CTATTATGAT GATGAGCTGT TCTCTGATGT CCAAGATATT 2760
AAAACAGCCT TGGCCAAAAT ACACGAGGAT AATCAGAAGA TAATCTCCAA GCTAGAATCA 2820
CTGCTGTTAT TGAAGGGAGA AGTTGAGTCA ATTAAGAAGC AGATCAACAG GCAAAATATC 2880
AGCATATCCA CCCTGGAAGG ACACCTCTCA AGCATCATGA TCGCCATTCC TGGACTTGGG 2940
AAGGATCCCA ACGACCCCAC TGCAGATGTC GAAATCAATC CCGACTTGAA ACCCATCATA 3000
GGCAGAGATT CAGGCCGAGC ACTGGCCGAA GTTCTCAAGA AACCCGTTGC CAGCCGACAA 3060
CTCCAAGGAA TGACAAATGG ACGGACCAGT TCCAGAGGAC AGCTGCTGAA GGAATTTCAG 3120
CTAAAGCCGA TCGGGAAAAA GATGAGCTCA GCCGTCGGGT TTGTTCCTGA CACCGGCCCT 3180
GCATCACGCA GTGTAATCCG CTCCATTATA AAATCCAGCC GGCTAGAGGA GGATCGGAAG 3240
CGTTACCTGA TGACTCTCCT TGATGATATC AAAGGAGCCA ATGATCTTGC CAAGTTCCAC 3300
CAGATGCTGA TGAAGATAAT AATGAAGTAG CTACAGCTCA ACTTACCTGC CAACCCCATG 3360
CCAGTCGACC CAACTAGTAC AACCTAAATC CATTATAAAA AACTTAGGAG CAAAGTGATT 3420
GCCTCCCAAG TTCCACAATG ACAGAGATCT ACGACTTCGA CAAGTCGGCA TGGGACATCA 3480
AAGGGTCGAT CGCTCCGATA CAACCCACCA CCTACAGTGA TGGCAGGCTG GTGCCCCAGG 3540
TCAGAGTCAT AGATCCTGGT CTAGGCGACA GGAAGGATGA ATGCTTTATG TACATGTTTC 3600
TGCTGGGGGT TGTTGAGGGC AGCGATCCCC TAGGGCCTCC AATCGGGCGA GCATTTGGGT 3660
CCCTGCCCTT AGGTGTTGGC AGATCCACAG CAAAGCCCGA AGAACTCCTC AAAGAGGCCA 3720
CTGAGCTTGA CATAGTTGTT AGACGTACAG CAGGGCTCAA TGAAAAACTG GTGTTCTACA 3780
ACAACACCCC ACTAACTCTC CTCACACCTT GGAGAAAGGT CCTAACAACA GGGAGTGTCT 3840
TCAACGCAAA CCAAGTGTGC AATGCGGTTA ATCTGATACC GCTCGATACC CCGCAGAGGT 3900
TCCGTGTTGT TTATATGAGC ATCACCCGTC TTTCGGATAA CGGGTATTAC ACCGTTCCTA 3960
GAAGAATGCT GGAATTCAGA TCGGTCAATG CAGTGGCCTT CAACCTGCTG GTGACCCTTA 4020
GGATTGACAA GGCGATAGGC CCTGGGAAGA TCATCGACAA TACAGAGCAA CTTCCTGAGG 4080
CAACATTTAT GGTCCACATC GGGAACTTCA GGAGAAAGAA GAGTGAAGTC TACTCTGCCG 4140
ATTATTGCAA AATGAAAATC GAAAAGATGG GCCTGGTTTT TGCACTTGGT GGGATAGGGG 4200
GCACCAGTCT TCACATTAGA AGCACAGGCA AGATGAGCAA GACTCTCCAT GCACAACTCG 4260
GGTTCAAGAA GACCTTATGT TACCCGCTGA TGGATATCAA TGAAGACCTT AATCGATTAC 4320
TCTGGAGGAG CAGATGCAAG ATAGTAAGAA TCCAGGCAGT TTTGCAGCCA TCAGTTCCTC 4380
AAGAATTCCG CATTTACGAC GACGTGATCA TAAATGATGA CCAAGGACTA TTCAAAGTTC 4440
TGTAGACCGT AGTGCCCAGC AATGCCCGAA AACGACCCCC CTCACAATGA CAGCCAGAAG 4500
GCCCGGACAA AAAAGCCCCC TCCGAAAGAC TCCACGGACC AAGCGAGAGG CCAGCCAGCA 4560
GCCGACGGCA AGCGCGAACA CCAGGCGGCC CCAGCACAGA ACAGCCCTGA CACAAGGCCA 4620
CCACCAGCCA CCCCAATCTG CATCCTCCTC GTGGGACCCC CGAGGACCAA CCCCCAAGGC 4680
TGCCCCCGAT CCAAACCACC AACCGCATCC CCACCACCCC CGGGAAAGAA ACCCCCAGCA 4740
ATTGGAAGGC CCCTCCCCCT CTTCCTCAAC ACAAGAACTC CACAACCGAA CCGCACAAGC 4800
GACCGAGGTG ACCCAACCGC AGGCATCCGA CTCCCTAGAC AGATCCTCTC TCCCCGGCAA 4860
ACTAAACAAA ACTTAGGGCC AAGGAACATA CACACCCAAC AGAACCCAGA CCCCGGCCCA 4920
CGGCGCCGCG CCCCCAACCC CCGACAACCA GAGGGAGCCC CCAACCAATC CCGCCGGTTC 4980
CCCCGGTGCC CACAGGCAGG GACACCAACC CCCGAACAGA CCCAGCACCC AACCATCGAC 5040
AATCCAAGAC GGGGGGGCCC CCCCAAAAAA AGGCCCCCAG GGGCCGACAG CCAGCACCGC 5100
GAGGAAGCCC ACCCACCCCA CACACGACCA CGGCAACCAA ACCAGAACCC AGACCACCCT 5160
GGGCCACCAG CTCCCAGACT CGGCCATCAC CCCGCAGAAA GGAAAGGCCA CAACCCGCGC 5220
ACCCCAGCCC CGATCCGGCG GGGAGCCACC CAACCCGAAC CAGCACCCAA GAGCGATCCC 5280
CGAAGGACCC CCGAACCGCA AAGGACATCA GTATCCCACA GCCTCTCCAA GTCCCCCGGT 5340
CTCCTCCTTT TCTCGAAGGG ACCAAAAGAT CAATCCACCA CACCCGACGA CACTCAACTC 5400
CCCACCCCTA AAGGAGACAC CGGGAATCCC AGAATCAAGA CTCATCCAAT GTCCATCATG 5460
GGTCTCAAGG TGAACGTCTC TGCCATATTC ATGGCAGTAC TGTTAACTCT CCAGACACCC 5520
ACCGGTCAAA TCCATTGGGG CAATCTCTCT AAGATAGGGG TGGTAGGAAT AGGAAGTGCA 5580
AGCTACAAAG TTATGACTCG TTCCAGCCAT CAATCATTAG TCATAAAATT AATGCCCAAT 5640
ATAACTCTCC TCAATAACTG CACGAGGGTA GAGATTGCAG AATACAGGAG ACTACTGAGA 5700
ACAGTTTTGG AACCAATTAG AGATGCACTT AATGCAATGA CCCAGAATAT AAGACCGGTT 5760
CAGAGTGTAG CTTCAAGTAG GAGACACAAG AGATTTGCGG GAGTAGTCCT GGCAGGTGCG 5820
GCCCTAGGCG TTGCCACAGC TGCTCAGATA ACAGCCGGCA TTGCACTTCA CCAGTCCATG 5880
CTGAACTCTC AAGCCATCGA CAATCTGAGA GCGAGCCTGG AAACTACTAA TCAGGCAATT 5940
GAGGCAATCA GACAAGCAGG GCAGGAGATG ATATTGGCTG TTCAGGGTGT CCAAGACTAC 6000
ATCAATAATG AGCTGATACC GTCTATGAAC CAACTATCTT GTGATTTAAT CGGCCAGAAG 6060
CTCGGGCTCA AATTGCTCAG ATACTATACA GAAATCCTGT CATTATTTGG CCCCAGCTTA 6120
CGGGACCCCA TATCTGCGGA GATATCTATC CAGGCTTTGA GCTATGCGCT TGGAGGAGAC 6180
ATCAATAAGG TGTTAGAAAA GCTCGGATAC AGTGGAGGTG ATTTACTGGG CATCTTAGAG 6240
AGCAGAGGAA TAAAGGCCCG GATAACTCAC GTCGACACAG AGTCCTACTT CATTGTCCTC 6300
AGTATAGCCT ATCCGACGCT GTCCGAGATT AAGGGGGTGA TTGTCCACCG GCTAGAGGGG 6360
GTCTCGTACA ACATAGGCTC TCAAGAGTGG TATACCACTG TGCCCAAGTA TGTTGCAACC 6420
CAAGGGTACC TTATCTCGAA TTTTGATGAG TCATCGTGTA CTTTCATGCC AGAGGGGACT 6480
GTGTGCAGCC AAAATGCCTT GTACCCGATG AGTCCTCTGC TCCAAGAATG CCTCCGGGGG 6540
TCCACCAAGT CCTGTGCTCG TACACTCGTA TCCGGGTCTT TTGGGAACCG GTTCATTTTA 6600
TCACAAGGGA ACCTAATAGC CAATTGTGCA TCAATCCTTT GCAAGTGTTA CACAACAGGA 6660
ACGATCATTA ATCAAGACCC TGACAAGATC CTAACATACA TTGCTGCCGA TCACTGCCCG 6720
GTAGTCGAGG TGAACGGCGT GACCATCCAA GTCGGGAGCA GGAGGTATCC AGACGCTGTG 6780
TACTTGCACA GAATTGACCT CGGTCCTCCC ATATCATTGG AGAGGTTGGA CGTAGGGACA 6840
AATCTGGGGA ATGCAATTGC TAAGTTGGAG GATGCCAAGG AATTGTTGGA GTCATCGGAC 6900
CAGATATTGA GGAGTATGAA AGGTTTATCG AGCACTAGCA TAGTCTACAT CCTGATTGCA 6960
GTGTGTCTTG GAGGGTTGAT AGGGATCCCC GCTTTAATAT GTTGCTGCAG GGGGCGTTGT 7020
AACAAAAAGG GAGAACAAGT TGGTATGTCA AGACCAGGCC TAAAGCCTGA TCTTACGGGA 7080
ACATCAAAAT CCTATGTAAG GTCGCTCTGA TCCTCTACAA CTCTTGAAAC ACAAATGTCC 7140
CACAAGTCTC CTCTTCGTCA TCAAGCAACC ACCGCACCCA GCATCAAGCC CACCTGAAAT 7200
TATCTCCGGC TTCCCTCTGG CCGAACAATA TCGGTAGTTA ATTAAAACTT AGGGTGCAAG 7260
ATCATCCACA ATGTCACCAC AACGAGACCG GATAAATGCC TTCTACAAAG ATAACCCCCA 7320
TCCCAAGGGA AGTAGGATAG TCATTAACAG AGAACATCTT ATGATTGATA GACCTTATGT 7380
TTTGCTGGCT GTTCTGTTTG TCATGTCTCT GAGCTTGATC GGGTTGCTAG CCATTGCAGG 7440
CATTAGACTT CATCGGGCAG CCATCTACAC CGCAGAGATC CATAAAAGCC TCAGCACCAA 7500
TCTAGATGTA ACTAACTCAA TCGAGCATCA GGTCAAGGAC GTGCTGACAC CACTCTTCAA 7560
AATCATCGGT GATGAAGTGG GCCTGAGGAC ACCTCAGAGA TTCACTGACC TAGTGAAATT 7620
CATCTCTGAC AAGATTAAAT TCCTTAATCC GGATAGGGAG TACGACTTCA GAGATCTCAC 7680
TTGGTGTATC AACCCGCCAG AGAGAATCAA ATTGGATTAT GATCAATACT GTGCAGATGT 7740
GGCTGCTGAA GAGCTCATGA ATGCATTGGT GAACTCAACT CTACTGGAGA CCAGAACAAC 7800
CAATCAGTTC CTAGCTGTCT CAAAGGGAAA CTGCTCAGGG CCCACTACAA TCAGAGGTCA 7860
ATTCTCAAAC ATGTCGCTGT CCCTGTTAGA CTTGTATTTA AGTCGAGGTT ACAATGTGTC 7920
ATCTATAGTC ACTATGACAT CCCAGGGAAT GTATGGGGGA ACTTACCTAG TGGAAAAGCC 7980
TAATCTGAGC AGCAAAAGGT CAGAGTTGTC ACAACTGAGC ATGTACCGAG TGTTTGAAGT 8040
AGGTGTTATC AGAAATCCGG GTTTGGGGGC TCCGGTGTTC CATATGACAA ACTATCTTGA 8100
GCAACCAGTC AGTAATGATC TCAGCAACTG TATGGTGGCT TTGGGGGAGC TCAAACTCGC 8160
AGCCCTTTGT CACGGGGAAG ATTCTATCAC AATTCCCTAT CAGGGATCAG GGAAAGGTGT 8220
CAGCTTCCAG CTCGTCAAGC TAGGTGTCTG GAAATCCCCA ACCGACATGC AATCCTGGGT 8280
CCCCTTATCA ACGGATGATC CAGTGATAGA CAGGCTTTAC CTCTCATCTC ACAGAGGTGT 8340
TATCGCTGAC AATCAAGCAA AATGGGCTGT CCCGACAACA CGAACAGATG ACAAGTTGCG 8400
AATGGAGACA TGCTTCCAAC AGGCGTGTAA GGGTAAAATC CAAGCACTCT GCGAGAATCC 8460
CGAGTGGGCA CCATTGAAGG ATAACAGGAT TCCTTCATAC GGGGTCTTGT CTGTTGATCT 8520
GAGTCTGACA GTTGAGCTTA AAATCAAAAT TGCTTCGGGA TTCGGGCCAT TGATCACACA 8580
CGGTTCAGGG ATGGACCTAT ACAAATCCAA CCACAACAAT GTGTATTGGC TGACTATCCC 8640
GCCAATGAAG AACCTAGCCT TAGGTGTAAT CAACACATTG GAGTGGATAC CGAGATTCAA 8700
GGTTAGTCCC AACCTCTTCA CTGTCCCAAT TAAGGAAGCA GGCGAAGACT GCCATGCCCC 8760
AACATACCTA CCTGCGGAGG TGGATGGTGA TGTCAAACTC AGTTCCAATC TGGTGATTCT 8820
ACCTGGTCAA GATCTCCAAT ATGTTTTGGC AACCTACGAT ACTTCCAGGG TTGAACATGC 8880
TGTGGTTTAT TACGTTTACA GCCCAGGCCG CTCATTTTCT TACTTTTATC CTTTTAGGTT 8940
GCCTATAAAG GGGGTCCCCA TCGAATTACA AGTGGAATGC TTCACATGGG ACCAAAAACT 9000
CTGGTGCCGT CACTTCTGTG TGCTTGCGGA CTCAGAATCT GGTGGACATA TCACTCACTC 9060
TGGGATGGTG GGCATGGGAG TCAGCTGCAC AGTCACCCGG GAAGATGGAA CCAATCGCAG 9120
ATAGGGCTGC TAGTGAACCA ATCACATGAT GTCACCCAGA CATCAGGCAT ACCCACTAGT 9180
GTGAAATAGA CATCAGAATT AAGAAAAACG TAGGGTCCAA GTGGTTCCCC GTTATGGACT 9240
CGCTATCTGT CAACCAGATC TTATACCCTG AAGTTCACCT AGATAGCCCG ATAGTTACCA 9300
ATAAGATAGT AGCCATCCTG GAGTATGCTC GAGTCCCTCA CGCTTACAGC CTGGAGGACC 9360
CTACACTGTG TCAGAACATC AAGCACCGCC TAAAAAACGG ATTTTCCAAC CAAATGATTA 9420
TAAACAATGT GGAAGTTGGG AATGTCATCA AGTCCAAGCT TAGGAGTTAT CCGGCCCACT 9480
CTCATATTCC ATATCCAAAT TGTAATCAGG ATTTATTTAA CATAGAAGAC AAAGAGTCAA 9540
CGAGGAAGAT CCGTGAACTC CTCAAAAAGG GGAATTCGCT GTACTCCAAA GTCAGTGATA 9600
AGGTTTTCCA ATGCTTAAGG GACACTAACT CACGGCTTGG CCTAGGCTCC GAATTGAGGG 9660
AGGACATCAA GGAGAAAGTT ATTAACTTGG GAGTTTACAT GCACAGCTCC CAGTGGTTTG 9720
AGCCCTTTCT GTTTTGGTTT ACAGTCAAGA CTGAGATGAG GTCAGTGATT AAATCACAAA 9780
CCCATACTTG CCATAGGAGG AGACACACAC CTGTATTCTT CACTGGTAGT TCAGTTGAGT 9840
TGCTAATCTC TCGTGACCTT GTTGCTATAA TCAGTAAAGA GTCTCAACAT GTATATTACC 9900
TGACATTTGA ACTGGTTTTG ATGTATTGTG ATGTCATAGA GGGGAGGTTA ATGACAGAGA 9960
CCGCTATGAC TATTGATGCT AGGTATACAG AGCTTCTAGG AAGAGTCAGA TACATGTGGA 10020
AACTGATAGA TGGTTTCTTC CCTGCACTCG GGAATCCAAC TTATCAAATT GTAGCCATGC 10080
TGGAGCCTCT TTCACTTGCT TACCTGCAGC TGAGGGATAT AACAGTAGAA CTCAGAGGTG 10140
CTTTCCTTAA CCACTGCTTT ACTGAAATAC ATGATGTTCT TGACCAAAAC GGGTTTTCTG 10200
ATGAAGGTAC TTATCATGAG TTAATTGAAG CTCTAGATTA CATTTTCATA ACTGATGACA 10260
TACATCTGAC AGGGGAGATT TTCTCATTTT TCAGAAGTTT CGGCCACCCC AGACTTGAAG 10320
CAGTAACGGC TGCTGAAAAT GTTAGGAAAT ACATGAATCA GCCTAAAGTC ATTGTGTATG 10380
AGACTCTGAT GAAAGGTCAT GCCATATTTT GTGGAATCAT AATCAACGGC TATCGTGACA 10440
GGCACGGAGG CAGTTGGCCA CCGCTGACCC TCCCCCTGCA TGCTGCAGAC ACAATCCGGA 10500
ATGCTCAAGC TTCAGGTGAA GGGTTAACAC ATGAGCAGTG CGTTGATAAC TGGAAATCTT 10560
TTGCTGGAGT GAAATTTGGC TGCTTTATGC CTCTTAGCCT GGATAGTGAT CTGACAATGT 10620
ACCTAAAGGA CAAGGCACTT GCTGCTCTCC AAAGGGAATG GGATTCAGTT TACCCGAAAG 10680
AGTTCCTGCG TTACGACCCT CCCAAGGGAA CCGGGTCACG GAGGCTTGTA GATGTTTTCC 10740
TTAATGATTC GAGCTTTGAC CCATATGATG TGATAATGTA TGTTGTAAGT GGAGCTTACC 10800
TCCATGACCC TGAGTTCAAC CTGTCTTACA GCCTGAAAGA AAAGGAGATC AAGGAAACAG 10860
GTAGACTTTT TGCTAAAATG ACTTACAAAA TGAGGGCATG CCAAGTGATT GCTGAAAATC 10920
TAATCTCAAA CGGGATTGGC AAATATTTTA AGGACAATGG GATGGCCAAG GATGAGCACG 10980
ATTTGACTAA GGCACTCCAC ACTCTAGCTG TCTCAGGAGT CCCCAAAGAT CTCAAAGAAA 11040
GTCACAGGGG GGGGCCAGTC TTAAAAACCT ACTCCCGAAG CCCAGTCCAC ACAAGTACCA 11100
GGAACGTGAG AGCAGCAAAA GGGTTTATAG GGTTCCCTCA AGTAATTCGG CAGGACCAAG 11160
ACACTGATCA TCCGGAGAAT ATGGAAGCTT ACGAGACAGT CAGTGCATTT ATCACGACTG 11220
ATCTCAAGAA GTACTGCCTT AATTGGAGAT ATGAGACCAT CAGCTTGTTT GCACAGAGGC 11280
TAAATGAGAT TTACGGATTG CCCTCATTTT TCCAGTGGCT GCATAAGAGG CTTGAGACCT 11340
CTGTCCTGTA TGTAAGTGAC CCTCATTGCC CCCCCGACCT TGACGCCCAT ATCCCGTTAT 11400
ATAAAGTCCC CAATGATCAA ATCTTCATTA AGTACCCTAT GGGAGGTATA GAAGGGTATT 11460
GTCAGAAGCT GTGGACCATC AGCACCATTC CCTATCTATA CCTGGCTGCT TATGAGAGCG 11520
GAGTAAGGAT TGCTTCGTTA GTGCAAGGGG ACAATCAGAC CATAGCCGTA ACAAAAAGGG 11580
TACCCAGCAC ATGGCCCTAC AACCTTAAGA AACGGGAAGC TGCTAGAGTA ACTAGAGATT 11640
ACTTTGTAAT TCTTAGGCAA AGGCTACATG ATATTGGCCA TCACCTCAAG GCAAATGAGA 11700
CAATTGTTTC ATCACATTTT TTTGTCTATT CAAAAGGAAT ATATTATGAT GGGCTACTTG 11760
TGTCCCAATC ACTCAAGAGC ATCGCAAGAT GTGTATTCTG GTCAGAGACT ATAGTTGATG 11820
AAACAAGGGC AGCATGCAGT AATATTGCTA CAACAATGGC TAAAAGCATC GAGAGAGGTT 11880
ATGACCGTTA CCTTGCATAT TCCCTGAACG TCCTAAAAGT GATACAGCAA ATTCTGATCT 11940
CTCTTGGCTT CACAATCAAT TCAACCATGA CCCGGGATGT AGTCATACCC CTCCTCACAA 12000
ACAACGACCT CTTAATAAGG ATGGCACTGT TGCCCGCTCC TATTGGGGGG ATGAATTATC 12060
TGAATATGAG CAGGCTGTTT GTCAGAAACA TCGGTGATCC AGTAACATCA TCAATTGCTG 12120
ATCTCAAGAG AATGATTCTC GCCTCACTAA TGCCTGAAGA GACCCTCCAT CAAGTAATGA 12180
CACAACAACC GGGGGACTCT TCATTCCTAG ACTGGGCTAG CGACCCTTAC TCAGCAAATC 12240
TTGTATGTGT CCAGAGCATC ACTAGACTCC TCAAGAACAT AACTGCAAGG TTTGTCCTGA 12300
TCCATAGTCC AAACCCAATG TTAAAAGGAT TATTCCATGA TGACAGTAAA GAAGAGGACG 12360
AGGGACTGGC GGCATTCCTC ATGGACAGGC ATATTATAGT ACCTAGGGCA GCTCATGAAA 12420
TCCTGGATCA TAGTGTCACA GGGGCAAGAG AGTCTATTGC AGGCATGCTG GATACCACAA 12480
AAGGCCTGAT TCGAGCCAGC ATGAGGAAGG GGGGGTTAAC CTCTCGAGTG ATAACCAGAT 12540
TGTCCAATTA TGACTATGAA CAATTCAGAG CAGGGATGGT GCTATTGACA GGAAGAAAGA 12600
GAAATGTCCT CATTGACAAA GAGTCATGTT CAGTGCAGCT GGCGAGAGCT CTAAGAAGCC 12660
ATATGTGGGC GAGGCTAGCT CGAGGACGGC CTATTTACGG CCTTGAGGTC CCTGATGTAC 12720
TAGAATCTAT GCGAGGCCAC CTTATTCGGC GTCATGAGAC ATGTGTCATC TGCGAGTGTG 12780
GATCAGTCAA CTACGGATGG TTTTTTGTCC CCTCGGGTTG CCAACTGGAT GATATTGACA 12840
AGGAAACATC ATCCTTGAGA GTCCCATATA TTGGTTCTAC CACTGATGAG AGAACAGACA 12900
TGAAGCTTGC CTTCGTAAGA GCCCCAAGTC GATCCTTGCG ATCTGCTGTT AGAATAGCAA 12960
CAGTGTACTC ATGGGCTTAC GGTGATGATG ATAGCTCTTG GAACGAAGCC TGGTTGTTGG 13020
CTAGGCAAAG GGCCAATGTG AGCCTGGAGG AGCTAAGGGT GATCACTCCC ATCTCAACTT 13080
CGACTAATTT AGCGCATAGG TTGAGGGATC GTAGCACTCA AGTGAAATAC TCAGGTACAT 13140
CCCTTGTCCG AGTGGCGAGG TATACCACAA TCTCCAACGA CAATCTCTCA TTTGTCATAT 13200
CAGATAAGAA GGTTGATACT AACTTTATAT ACCAACAAGG AATGCTTCTA GGGTTGGGTG 13260
TTTTAGAAAC ATTGTTTCGA CTCGAGAAAG ATACCGGATC ATCTAACACG GTATTACATC 13320
TTCACGTCGA AACAGATTGT TGCGTGATCC CGATGATAGA TCATCCCAGG ATACCCAGCT 13380
CCCGCAAGCT AGAGCTGAGG GCAGAGCTAT GTACCAACCC ATTGATATAT GATAATGCAC 13440
CTTTAATTGA CAGAGATGCA ACAAGGCTAT ACACCCAGAG CCATAGGAGG CACCTTGTGG 13500
AATTTGTTAC ATGGTCCACA CCCCAACTAT ATCACATTTT AGCTAAGTCC ACAGCACTAT 13560
CTATGATTGA CCTGGTAACA AAATTTGAGA AGGACCATAT GAATGAAATT TCAGCTCTCA 13620
TAGGGGATGA CGATATCAAT AGTTTCATAA CTGAGTTTCT GCTCATAGAG CCAAGATTAT 13680
TCACTATCTA CTTGGGCCAG TGTGCGGCCA TCAATTGGGC ATTTGATGTA CATTATCATA 13740
GACCATCAGG GAAATATCAG ATGGGTGAGC TGTTGTCATC GTTCCTTTCT AGAATGAGCA 13800
AAGGAGTGTT TAAGGTGCTT GTCAATGCTC TAAGCCACCC AAAGATCTAC AAGAAATTCT 13860
GGCATTGTGG TATTATAGAG CCTATCCATG GTCCTTCACT TGATGCTCAA AACTTGCACA 13920
CAACTGTGTG CAACATGGTT TACACATGCT ATATGACCTA CCTCGACCTG TTGTTGAATG 13980
AAGAGTTAGA AGAGTTCACA TTTCTCTTGT GTGAAAGCGA CGAGGATGTA GTACCGGACA 14040
GATTCGACAA CATCCAGGCA AAACACTTAT GTGTTCTGGC AGATTTGTAC TGTCAACCAG 14100
GGACCTGCCC ACCAATTCGA GGTCTAAGAC CGGTAGAGAA ATGTGCAGTT CTAACCGACC 14160
ATATCAAGGC AGAGGCTAGG TTATCTCCAG CAGGATCTTC GTGGAACATA AATCCAATTA 14220
TTGTAGACCA TTACTCATGC TCTCTGACTT ATCTCCGGCG AGGATCGATC AAACAGATAA 14280
GATTGAGAGT TGATCCAGGA TTCATTTTCG ACGCCCTCGC TGAGGTAAAT GTCAGTCAGC 14340
CAAAGATCGG CAGCAACAAC ATCTCAAATA TGAGCATCAA GGATTTCAGA CCCCCACACG 14400
ATGATGTTGC AAAATTGCTC AAAGATATCA ACACAAGCAA GCACAATCTT CCCATTTCAG 14460
GGGGCAATCT CGCCAATTAT GAAATCCATG CTTTCCGCAG AATCGGGTTG AACTCATCTG 14520
CTTGCTACAA AGCTGTTGAG ATATCAACAT TAATTAGGAG ATGCCTTGAG CCAGGGGAAG 14580
ACGGCTTGTT CTTGGGTGAG GGATCGGGTT CTATGTTGAT CACTTATAAG GAGATACTTA 14640
AACTAAACAA GTGCTTCTAT AATAGTGGGG TTTCCGCCAA TTCTAGATCT GGTCAAAGGG 14700
AATTAGCACC CTATCCCTCC GAAGTTGGCC TTGTCGAACA CAGAATGGGA GTAGGTAATA 14760
TTGTCAAAGT GCTCTTTAAC GGGAGGCCCG AAGTCACGTG GGTAGGCAGT GTAGATTGCT 14820
TCAATTTCAT AGTTAGTAAT ATCCCTACCT CTAGTGTGGG GTTTATCCAT TCAGATATAG 14880
AGACCTTGCC TAACAAAGAT ACTATAGAGA AGCTAGAGGA ATTGGCAGCC ATCTTATCGA 14940
TGGCTCTGCT CCTGGGCAAA ATAGGATCAA TACTGGTGAT TAAGCTTATG CCTTTCAGCG 15000
GGGATTTTGT TCAGGGATTT ATAAGTTATG TAGGGTCTCA TTATAGAGAA GTGAACCTTG 15060
TATACCCTAG ATACAGCAAC TTCATATCTA CTGAATCTTA TTTGGTTATG ACAGATCTCA 15120
AGGCTAACCG GCTAATGAAT CCTGAAAAGA TTAAGCAGCA GATAATTGAA TCATCTGTGA 15180
GGACTTCACC TGGACTTATA GGTCACATCC TATCCATTAA GCAACTAAGC TGCATACAAG 15240
CAATTGTGGG AGACGCAGTT AGTAGAGGTG ATATCAATCC TACTCTGAAA AAACTTACAC 15300
CTATAGAGCA GGTGCTGATC AATTGCGGGT TGGCAATTAA CGGACCTAAG CTGTGCAAAG 15360
AATTGATCCA CCATGATGTT GCCTCAGGGC AAGATGGATT GCTTAATTCT ATACTCATCC 15420
TCTACAGGGA GTTGGCAAGA TTCAAAGACA ACCAAAGAAG TCAACAAGGG ATGTTCCACG 15480
CTTACCCCGT ATTGGTAAGT AGCAGGCAAC GAGAACTTAT ATCTAGGATC ACCCGCAAAT 15540
TTTGGGGGCA CATTCTTCTT TACTCCGGGA ACAGAAAGTT GATAAATAAG TTTATCCAGA 15600
ATCTCAAGTC CGGCTATCTG ATACTAGACT TACACCAGAA TATCTTCGTT AAGAATCTAT 15660
CCAAGTCAGA GAAACAGATT ATTATGACGG GGGGTTTGAA ACGTGAGTGG GTTTTTAAGG 15720
TAACAGTCAA GGAGACCAAA GAATGGTATA AGTTAGTCGG ATACAGTGCC CTGATTAAGG 15780
ACTAATTGGT TGAACTCCGG AACCCTAATC CTGCCCTAGG TGGTTAGGCA TTATTTGCAA 15840
TATATTAAAG AAAACTTTGA AAATACGAAG TTTCTATTCC CAGCTTTGTC TGGT 15894
(2) SEQ ID NO: 2 of the message:
...
(i) sequence signature:
(A) length: 2183 amino acid
(B) type: amino acid
(C) chain:
(D) topological framework: linearity
(ii) molecule type: protein
(xi) sequence description: SEQ ID NO:2:
Met?Asp?Ser?Leu?Ser?Val?Asn?Gln?Ile?Leu?Tyr?Pro?Glu?Val?His?Leu
1???????????????5???????????????????10??????????????????15
Asp?Ser?Pro?Ile?Val?Thr?Asn?Lys?Ile?Val?Ala?Ile?Leu?Glu?Tyr?Ala
20??????????????????25???????????????????30
Arg?Val?Pro?His?Ala?Tyr?Ser?Leu?Glu?Asp?Pro?Thr?Leu?Cys?Gln?Asn
35??????????????????40?????????????????45
Ile?Lys?His?Arg?Leu?Lys?Asn?Gly?Phe?Ser?Asn?Gln?Met?Ile?Ile?Asn
50??????????????????55??????????????????60
Asn?Val?Glu?Val?Gly?Asn?Val?Ile?Lys?Ser?Lys?Leu?Arg?Ser?Tyr?Pro
65??????????????????70??????????????????75??????????????????80
Ala?His?Ser?His?Ile?Pro?Tyr?Pro?Asn?Cys?Asn?Gln?Asp?Leu?Phe?Asn
85??????????????????90??????????????????95
Ile?Glu?Asp?Lys?Glu?Ser?Thr?Arg?Lys?Ile?Arg?Glu?Leu?Leu?Lys?Lys
100?????????????????105??????????????????110Gly?Asn?Ser?Leu?Tyr?Ser?Lys?Val?Ser?Asp?Lys?Val?Phe?Gln?Cys?Leu
115?????????????????120?????????????????125Arg?Asp?Thr?Asn?Ser?Arg?Leu?Gly?Leu?Gly?Ser?Glu?Leu?Arg?Glu?Asp
130?????????????????135?????????????????140Ile?Lys?Glu?Lys?Val?Ile?Asn?Leu?Gly?Val?Tyr?Met?His?Ser?Ser?Gln145?????????????????150?????????????????155?????????????????160Trp?Phe?Glu?Pro?Phe?Leu?Phe?Trp?Phe?Thr?Val?Lys?Thr?Glu?Met?Arg
165?????????????????170?????????????????175Ser?Val?Ile?Lys?Ser?Gln?Thr?His?Thr?Cys?His?Arg?Arg?Arg?His?Thr
180?????????????????185?????????????????190Pro?Val?Phe?Phe?Thr?Gly?Ser?Ser?Val?Glu?Leu?Leu?Ile?Ser?Arg?Asp
195?????????????????200?????????????????205Leu?Val?Ala?Ile?Ile?Ser?Lys?Glu?Ser?Gln?His?Val?Tyr?Tyr?Leu?Thr
210?????????????????215?????????????????220phe?Glu?Leu?Val?Leu?Met?Tyr?Cys?Asp?Val?Ile?Glu?Gly?Arg?Leu?Met225?????????????????230?????????????????235?????????????????240Thr?Glu?Thr?Ala?Met?Thr?Ile?Asp?Ala?Arg?Tyr?Thr?Glu?Leu?Leu?Gly
245?????????????????250?????????????????255Arg?Val?Arg?Tyr?Met?Trp?Lys?Leu?Ile?Asp?Gly?Phe?Phe?Pro?Ala?Leu
260?????????????????265?????????????????270Gly?Asn?Pro?Thr?Tyr?Gln?Ile?Val?Ala?Met?Leu?Glu?Pro?Leu?Ser?Leu
275?????????????????280?????????????????285Ala?Tyr?Leu?Gln?Leu?Arg?Asp?Ile?Thr?Val?Glu?Leu?Arg?Gly?Ala?Phe
290?????????????????295?????????????????300Leu?Asn?His?Cys?Phe?Thr?Glu?Ile?His?Asp?Val?Leu?Asp?Gln?Asn?Gly305?????????????????310?????????????????315?????????????????320Phe?Ser?Asp?Glu?Gly?Thr?Tyr?His?Glu?Leu?Ile?Glu?Ala?Leu?Asp?Tyr
325?????????????????330?????????????????335Ile?Phe?Ile?Thr?Asp?Asp?Ile?His?Leu?Thr?Gly?Glu?Ile?Phe?Ser?Phe
340?????????????????345?????????????????350Phe?Arg?Ser?Phe?Gly?His?Pro?Arg?Leu?Glu?Ala?Val?Thr?Ala?Ala?Glu
355?????????????????360?????????????????365Asn?Val?Arg?Lys?Tyr?Met?Asn?Gln?Pro?Lys?Val?Ile?Val?Tyr?Glu?Thr
370?????????????????375?????????????????380Leu?Met?Lys?Gly?His?Ala?Ile?Phe?Cys?Gly?Ile?Ile?Ile?Asn?Gly?Tyr385?????????????????390?????????????????395?????????????????400Arg?Asp?Arg?His?Gly?Gly?Ser?Trp?Pro?Pro?Leu?Thr?Leu?Pro?Leu?His
405?????????????????410?????????????????415Ala?Ala?Asp?Thr?Ile?Arg?Asn?Ala?Gln?Ala?Ser?Gly?Glu?Gly?Leu?Thr
420?????????????????425?????????????????430His?Glu?Gln?Cys?Val?Asp?Asn?Trp?Lys?Ser?Phe?Ala?Gly?Val?Lys?Phe
435?????????????????440?????????????????445Gly?Cys?Phe?Met?Pro?Leu?Ser?Leu?Asp?Ser?Asp?Leu?Thr?Met?Tyr?Leu
450????????????????455??????????????????460Lys?Asp?Lys?Ala?Leu?Ala?Ala?Leu?Gln?Arg?Glu?Trp?Asp?Ser?Val?Tyr465?????????????????470?????????????????475?????????????????480Pro?Lys?Glu?Phe?Leu?Arg?Tyr?Asp?Pro?Pro?Lys?Gly?Thr?Gly?Ser?Arg
485?????????????????490?????????????????495Arg?Leu?Val?Asp?Val?Phe?Leu?Asn?Asp?Ser?Ser?Phe?Asp?Pro?Tyr?Asp
500?????????????????505?????????????????510Val?Ile?Met?Tyr?Val?Val?Ser?Gly?Ala?Tyr?Leu?His?Asp?Pro?Glu?Phe
515?????????????????520?????????????????525Asn?Leu?Ser?Tyr?Ser?Leu?Lys?Glu?Lys?Glu?Ile?Lys?Glu?Thr?Gly?Arg
530?????????????????535?????????????????540Leu?Phe?Ala?Lys?Met?Thr?Tyr?Lys?Met?Arg?Ala?Cys?Gln?Val?Ile?Ala545?????????????????550?????????????????555?????????????????560Glu?Asn?Leu?Ile?Ser?Asn?Gly?Ile?Gly?Lys?Tyr?Phe?Lys?Asp?Asn?Gly
565?????????????????570?????????????????575Met?Ala?Lys?Asp?Glu?His?Asp?Leu?Thr?Lys?Ala?Leu?His?Thr?Leu?Ala
580?????????????????585?????????????????590Val?Ser?Gly?Val?Pro?Lys?Asp?Leu?Lys?Glu?Ser?His?Arg?Gly?Gly?Pro
595?????????????????600?????????????????605Val?Leu?Lys?Thr?Tyr?Ser?Arg?Ser?Pro?Val?His?Thr?Ser?Thr?Arg?Asn
610?????????????????615?????????????????620Val?Arg?Ala?Ala?Lys?Gly?Phe?Ile?Gly?Phe?Pro?Gln?Val?Ile?Arg?Gln625?????????????????630?????????????????635?????????????????640Asp?Gln?Asp?Thr?Asp?His?Pro?Glu?Asn?Met?Glu?Ala?Tyr?Glu?Thr?Val
645?????????????????650?????????????????655Ser?Ala?Phe?Ile?Thr?Thr?Asp?Leu?Lys?Lys?Tyr?Cys?Leu?Asn?Trp?Arg
660?????????????????665?????????????????670Tyr?Glu?Thr?Ile?Ser?Leu?Phe?Ala?Gln?Arg?Leu?Asn?Glu?Ile?Tyr?Gly
675?????????????????680?????????????????685Leu?Pro?Ser?Phe?Phe?Gln?Trp?Leu?His?Lys?Arg?Leu?Glu?Thr?Ser?Val
690?????????????????695?????????????????700Leu?Tyr?Val?Ser?Asp?Pro?His?Cys?Pro?Pro?Asp?Leu?Asp?Ala?His?Ile705?????????????????710?????????????????715?????????????????720Pro?Leu?Tyr?Lys?Val?Pro?Asn?Asp?Gln?Ile?Phe?Ile?Lys?Tyr?Pro?Met
725?????????????????730?????????????????735Gly?Gly?Ile?Glu?Gly?Tyr?Cys?Gln?Lys?Leu?Trp?Thr?Ile?Ser?Thr?Ile
740?????????????????745?????????????????750Pro?Tyr?Leu?Tyr?Leu?Ala?Ala?Tyr?Glu?Ser?Gly?Val?Arg?Ile?Ala?Ser
755?????????????????760?????????????????765Leu?Val?Gln?Gly?Asp?Asn?Gln?Thr?Ile?Ala?Val?Thr?Lys?Arg?Val?Pro
770?????????????????775?????????????????780Ser?Thr?Trp?Pro?Tyr?Asn?Leu?Lys?Lys?Arg?Glu?Ala?Ala?Arg?Val?Thr785?????????????????790?????????????????795?????????????????800Arg?Asp?Tyr?Phe?Val?Ile?Leu?Arg?Gln?Arg?Leu?His?Asp?Ile?Gly?His
805?????????????????810?????????????????815His?Leu?Lys?Ala?Asn?Glu?Thr?Ile?Val?Ser?Ser?His?Phe?Phe?Val?Tyr
820?????????????????825?????????????????830Ser?Lys?Gly?Ile?Tyr?Tyr?Asp?Gly?Leu?Leu?Val?Ser?Gln?Ser?Leu?Lys
835?????????????????840?????????????????845Ser?Ile?Ala?Arg?Cys?Val?Phe?Trp?Ser?Glu?Thr?Ile?Val?Asp?Glu?Thr
850?????????????????855?????????????????860Arg?Ala?Ala?Cys?Ser?Asn?Ile?Ala?Thr?Thr?Met?Ala?Lys?Ser?Ile?Glu865??????????????????870????????????????875?????????????????880Arg?Gly?Tyr?Asp?Arg?Tyr?Leu?Ala?Tyr?Ser?Leu?Asn?Val?Leu?Lys?Val
885?????????????????890?????????????????895Ile?Gln?Gln?Ile?Leu?Ile?Ser?Leu?Gly?Phe?Thr?Ile?Asn?Ser?Thr?Met
900?????????????????905?????????????????910Thr?Arg?Asp?Val?Val?Ile?Pro?Leu?Leu?Thr?Asn?Asn?Asp?Leu?Leu?Ile
915?????????????????920?????????????????925Arg?Met?Ala?Leu?Leu?Pro?Ala?Pro?Ile?Gly?Gly?Met?Asn?Tyr?Leu?Asn
930?????????????????935?????????????????940Met?Ser?Arg?Leu?Phe?Val?Arg?Asn?Ile?Gly?Asp?Pro?Val?Thr?Ser?Ser945?????????????????950?????????????????955?????????????????960Ile?Ala?Asp?Leu?Lys?Arg?Met?Ile?Leu?Ala?Ser?Leu?Met?Pro?Glu?Glu
965?????????????????970?????????????????975Thr?Leu?His?Gln?Val?Met?Thr?Gln?Gln?Pro?Gly?Asp?Ser?Ser?Phe?Leu
980?????????????????985?????????????????990Asp?Trp?Ala?Ser?Asp?Pro?Tyr?Ser?Ala?Asn?Leu?Val?Cys?Val?Gln?Ser
995?????????????????1000????????????????1005Ile?Thr?Arg?Leu?Leu?Lys?Asn?Ile?Thr?Ala?Arg?Phe?Val?Leu?Ile?His
1010????????????????1015????????????????1020Ser?Pro?Asn?Pro?Met?Leu?Lys?Gly?Leu?Phe?His?Asp?Asp?Ser?Lys?Glu1025????????????????1030????????????????1035????????????????1040Glu?Asp?Glu?Gly?Leu?Ala?Ala?Phe?Leu?Met?Asp?Arg?His?Ile?Ile?Val
1045????????????????1050????????????????1055Pro?Arg?Ala?Ala?His?Glu?Ile?Leu?Asp?His?Ser?Val?Thr?Gly?Ala?Arg
1060????????????????1065????????????????1070Glu?Ser?Ile?Ala?Gly?Met?Leu?Asp?Thr?Thr?Lys?Gly?Leu?Ile?Arg?Ala
1075????????????????1080????????????????1085Ser?Met?Arg?Lys?Gly?Gly?Leu?Thr?Ser?Arg?Val?Ile?Thr?Arg?Leu?Ser
1090????????????????1095????????????????1100Asn?Tyr?Asp?Tyr?Glu?Gln?Phe?Arg?Ala?Gly?Met?Val?Leu?Leu?Thr?Gly1105????????????????1110????????????????1115????????????????1120Arg?Lys?Arg?Asn?Val?Leu?Ile?Asp?Lys?Glu?Ser?Cys?Ser?Val?Gln?Leu
1125????????????????1130????????????????1135Ala?Arg?Ala?Leu?Arg?Ser?His?Met?Trp?Ala?Arg?Leu?Ala?Arg?Gly?Arg
1140????????????????1145????????????????1150Pro?Ile?Tyr?Gly?Leu?Glu?Val?Pro?Asp?Val?Leu?Glu?Ser?Met?Arg?Gly
1155????????????????1160????????????????1165His?Leu?Ile?Arg?Arg?His?Glu?Thr?Cys?Val?Ile?Cys?Glu?Cys?Gly?Ser
1170????????????????1175????????????????1180Val?Asn?Tyr?Gly?Trp?Phe?Phe?Val?Pro?Ser?Gly?Cys?Gln?Leu?Asp?Asp1185????????????????1190????????????????1195????????????????1200Ile?Asp?Lys?Glu?Thr?Ser?Ser?Leu?Arg?Val?Pro?Tyr?Ile?Gly?Ser?Thr
1205????????????????1210????????????????1215Thr?Asp?Glu?Arg?Thr?Asp?Met?Lys?Leu?Ala?Phe?Val?Arg?Ala?Pro?Ser
1220????????????????1225????????????????1230Arg?Ser?Leu?Arg?Ser?Ala?Val?Arg?Ile?Ala?Thr?Val?Tyr?Ser?Trp?Ala
1235????????????????1240????????????????1245Tyr?Gly?Asp?Asp?Asp?Ser?Ser?Trp?Asn?Glu?Ala?Trp?Leu?Leu?Ala?Arg
1250????????????????1255????????????????1260Gln?Arg?Ala?Asn?Val?Ser?Leu?Glu?Glu?Leu?Arg?Val?Ile?Thr?Pro?Ile1265????????????????1270????????????????1275????????????????1280Ser?Thr?Ser?Thr?Asn?Leu?Ala?His?Arg?Leu?Arg?Asp?Arg?Ser?Thr?Gln
1285????????????????1290????????????????1295Val?Lys?Tyr?Ser?Gly?Thr?Ser?Leu?Val?Arg?Val?Ala?Arg?Tyr?Thr?Thr
1300????????????????1305????????????????1310Ile?Ser?Asn?Asp?Asn?Leu?Ser?Phe?Val?Ile?Ser?Asp?Lys?Lys?Val?Asp
1315????????????????1320????????????????1325Thr?Asn?Phe?Ile?Tyr?Gln?Gln?Gly?Met?Leu?Leu?Gly?Leu?Gly?Val?Leu
1330????????????????1335????????????????1340Glu?Thr?Leu?Phe?Arg?Leu?Glu?Lys?Asp?Thr?Gly?Ser?Ser?Asn?Thr?Val1345????????????????1350????????????????1355????????????????1360Leu?His?Leu?His?Val?Glu?Thr?Asp?Cys?Cys?Val?Ile?Pro?Met?Ile?Asp
1365????????????????1370????????????????1375His?Pro?Arg?Ile?Pro?Ser?Ser?Arg?Lys?Leu?Glu?Leu?Arg?Ala?Glu?Leu
1380????????????????1385????????????????1390Cys?Thr?Asn?Pro?Leu?Ile?Tyr?Asp?Asn?Ala?Pro?Leu?Ile?Asp?Arg?Asp
1395????????????????1400????????????????1405Ala?Thr?Arg?Leu?Tyr?Thr?Gln?Ser?His?Arg?Arg?His?Leu?Val?Glu?Phe
1410????????????????1415????????????????1420Val?Thr?Trp?Ser?Thr?Pro?Gln?Leu?Tyr?His?Ile?Leu?Ala?Lys?Ser?Thr1425????????????????1430????????????????1435????????????????1440Ala?Leu?Ser?Met?Ile?Asp?Leu?Val?Thr?Lys?Phe?Glu?Lys?Asp?His?Met
1445????????????????1450????????????????1455Asn?Glu?Ile?Ser?Ala?Leu?Ile?Gly?Asp?Asp?Asp?Ile?Asn?Ser?Phe?Ile
1460????????????????1465????????????????1470Thr?Glu?Phe?Leu?Leu?Ile?Glu?Pro?Arg?Leu?Phe?Thr?Ile?Tyr?Leu?Gly
1475????????????????1480????????????????1485Gln?Cys?Ala?Ala?Ile?Asn?Trp?Ala?Phe?Asp?Val?His?Tyr?His?Arg?Pro
1490????????????????1495????????????????1500Ser?Gly?Lys?Tyr?Gln?Met?Gly?Glu?Leu?Leu?Ser?Ser?Phe?Leu?Ser?Arg1505????????????????1510????????????????1515????????????????1520Met?Ser?Lys?Gly?Val?Phe?Lys?Val?Leu?Val?Asn?Ala?Leu?Ser?His?Pro
1525?????????????????1530????????????????1535Lys?Ile?Tyr?Lys?Lys?Phe?Trp?His?Cys?Gly?Ile?Ile?Glu?Pro?Ile?His
1540????????????????1545????????????????1550Gly?Pro?Ser?Leu?Asp?Ala?Gln?Asn?Leu?His?Thr?Thr?Val?Cys?Asn?Met
1555????????????????1560????????????????1565Val?Tyr?Thr?Cys?Tyr?Met?Thr?Tyr?Leu?Asp?Leu?Leu?Leu?Asn?Glu?Glu
1570????????????????1575????????????????1580Leu?Glu?Glu?Phe?Thr?Phe?Leu?Leu?Cys?Glu?Ser?Asp?Glu?Asp?Val?Val1585????????????????1590????????????????1595????????????????1600Pro?Asp?Arg?Phe?Asp?Asn?Ile?Gln?Ala?Lys?His?Leu?Cys?Val?Leu?Ala
1605????????????????1610????????????????1615Asp?Leu?Tyr?Cys?Gln?Pro?Gly?Thr?Cys?Pro?Pro?Ile?Arg?Gly?Leu?Arg
1620????????????????1625????????????????1630Pro?Val?Glu?Lys?Cys?Ala?Val?Leu?Thr?Asp?His?Ile?Lys?Ala?Glu?Ala
1635????????????????1640????????????????1645Arg?Leu?Ser?Pro?Ala?Gly?Ser?Ser?Trp?Asn?Ile?Asn?Pro?Ile?Ile?Val
1650????????????????1655????????????????1660Asp?His?Tyr?Ser?Cys?Ser?Leu?Thr?Tyr?Leu?Arg?Arg?Gly?Ser?Ile?Lys1665????????????????1670????????????????1675????????????????1680Gln?Ile?Arg?Leu?Arg?Val?Asp?Pro?Gly?Phe?Ile?Phe?Asp?Ala?Leu?Ala
1685????????????????1690????????????????1695Glu?Val?Asn?Val?Ser?Gln?Pro?Lys?Ile?Gly?Ser?Asn?Asn?Ile?Ser?Asn
1700????????????????1705????????????????1710Met?Ser?Ile?Lys?Asp?Phe?Arg?Pro?Pro?His?Asp?Asp?Val?Ala?Lys?Leu
1715????????????????1720????????????????1725Leu?Lys?Asp?Ile?Asn?Thr?Ser?Lys?His?Asn?Leu?Pro?Ile?Ser?Gly?Gly
1730????????????????1735????????????????1740Asn?Leu?Ala?Asn?Tyr?Glu?Ile?His?Ala?Phe?Arg?Arg?Ile?Gly?Leu?Asn1745????????????????1750????????????????1755????????????????1760Ser?Ser?Ala?Cys?Tyr?Lys?Ala?Val?Glu?Ile?Ser?Thr?Leu?Ile?Arg?Arg
1765????????????????1770????????????????1775Cys?Leu?Glu?Pro?Gly?Glu?Asp?Gly?Leu?Phe?Leu?Gly?Glu?Gly?Ser?Gly
1780????????????????1785????????????????1790Ser?Met?Leu?Ile?Thr?Tyr?Lys?Glu?Ile?Leu?Lys?Leu?Asn?Lys?Cys?Phe
1795????????????????1800????????????????1805Tyr?Asn?Ser?Gly?Val?Ser?Ala?Asn?Ser?Arg?Ser?Gly?Gln?Arg?Glu?Leu
1810????????????????1815????????????????1820Ala?Pro?Tyr?Pro?Ser?Glu?Val?Gly?Leu?Val?Glu?His?Arg?Met?Gly?Val1825????????????????1830????????????????1835????????????????1840Gly?Asn?Ile?Val?Lys?Val?Leu?Phe?Asn?Gly?Arg?Pro?Glu?Val?Thr?Trp
1845????????????????1850????????????????1855Val?Gly?Ser?Val?Asp?Cys?Phe?Asn?Phe?Ile?Val?Ser?Asn?Ile?Pro?Thr
1860????????????????1865????????????????1870Ser?Ser?Val?Gly?Phe?Ile?His?Ser?Asp?Ile?Glu?Thr?Leu?Pro?Asn?Lys
1875????????????????1880????????????????1885Asp?Thr?Ile?Glu?Lys?Leu?Glu?Glu?Leu?Ala?Ala?Ile?Leu?Ser?Met?Ala
1890????????????????1895????????????????1900Leu?Leu?Leu?Gly?Lys?Ile?Gly?Ser?Ile?Leu?Val?Ile?Lys?Leu?Met?Pro1905????????????????1910????????????????1915????????????????1920Phe?Ser?Gly?Asp?Phe?Val?Gln?Gly?Phe?Ile?Ser?Tyr?Val?Gly?Ser?His
1925????????????????1930????????????????1935Tyr?Arg?Glu?Val?Asn?Leu?Val?Tyr?Pro?Arg?Tyr?Ser?Asn?Phe?Ile?Ser
1940????????????????1945????????????????1950Thr?Glu?Ser?Tyr?Leu?Val?Met?Thr?Asp?Leu?Lys?Ala?Asn?Arg?Leu?Met
1955????????????????1960????????????????1965Asn?Pro?Glu?Lys?Ile?Lys?Gln?Gln?Ile?Ile?Glu?Ser?Ser?Val?Arg?Thr
1970????????????????1975????????????????1980Ser?Pro?Gly?Leu?Ile?Gly?His?Ile?Leu?Ser?Ile?Lys?Gln?Leu?Ser?Cys1985????????????????1990????????????????1995????????????????2000Ile?Gln?Ala?Ile?Val?Gly?Asp?Ala?Val?Ser?Arg?Gly?Asp?Ile?Asn?Pro
2005????????????????2010????????????????2015Thr?Leu?Lys?Lys?Leu?Thr?Pro?Ile?Glu?Gln?Val??Leu?Ile?Asn?Cys?Gly
2020????????????????2025?????????????????2030Leu?Ala?Ile?Asn?Gly?Pro?Lys?Leu?Cys?Lys?Glu?Leu?Ile?His?His?Asp
2035????????????????2040????????????????2045Val?Ala?Ser?Gly?Gln?Asp?Gly?Leu?Leu?Asn?Ser?Ile?Leu?Ile?Leu?Tyr
2050????????????????2055????????????????2060Arg?Glu?Leu?Ala?Arg?Phe?Lys?Asp?Asn?Gln?Arg?Ser?Gln?Gln?Gly?Met2065????????????????2070????????????????2075????????????????2080Phe?His?Ala?Tyr?Pro?Val?Leu?Val?Ser?Ser?Arg?Gln?Arg?Glu?Leu?Ile
2085????????????????2090????????????????2095Ser?Arg?Ile?Thr?Arg?Lys?Phe?Trp?Gly?His?Ile?Leu?Leu?Tyr?Ser?Gly
2100????????????????2105????????????????2110Asn?Arg?Lys?Leu?Ile?Asn?Lys?Phe?Ile?Gln?Asn?Leu?Lys?Ser?Gly?Tyr
2115????????????????2120????????????????2125Leu?Ile?Leu?Asp?Leu?His?Gln?Asn?Ile?Phe?Val?Lys?Asn?Leu?Ser?Lys
2130????????????????2135????????????????2140Ser?Glu?Lys?Gln?Ile?Ile?Met?Thr?Gly?Gly?Leu?Lys?Arg?Glu?Trp?Val
2145????????????????2150????????????????2155????????????????2160
Phe?Lys?Val?Thr?Val?Lys?Glu?Thr?Lys?Glu?Trp?Tyr?Lys?Leu?Val?Gly
2165????????????????2170????????????????2175
Tyr?Ser?Ala?Leu?Ile?Lys?Asp
The information of 2180 (2) SEQ ID NO:3:
(i) sequence signature:
(A) length: 15894 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: DNA (genome)
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3:
ACCAAACAAA GTTGGGTAAG GATAGATCAA TCAATGATCA TATTCTAGTA CACTTAGGAT 60
TCAAGATCCT ATTATCAGGG ACAAGAGCAG GATTAGGGAT ATCCGAGATG GCCACACTTC 120
TAAGGAGCTT AGCATTGTTC AAAAGAAACA AGGACAAACC ACCCATTACA TCAGGATCCG 180
GTGGAGCCAT CAGAGGAATC AAACACATTA TTATAGTACC AATCCCGGGA GATTCCTCAA 240
TTACCACTCG ATCTAGACTT CTGGACCGGT TGGTCAGGTT AATTGGAAAC CCGGATGTGA 300
GCGGGCCCAA ACTAACAGGG GCACTAATAG GTATATTATC CTTATTTGTG GAGTCTCCAG 360
GTCAATTGAT TCAGAGGATC ACCGATGACC CTGACGTTAG CATAAGGCTG TTAGAGGTTG 420
TCCAGAGTGA CCAGTCACAA TCTGGCCTTA CCTTCGCATC AAGAGGTACC AACATGGAGG 480
ATGAGGCGGA CCAATATTTT TCACATGATG ATCCAAGTAG TAGTGATCAA TCCAGGTTCG 540
GATGGTTCGA GAACAAGGAA ATCTCAGATA TTGAAGTGCA AGACCCTGAG GGATTCAACA 600
TGATTCTGGG TACCATCCTA GCTCAAATTT GGGTCTTGCT CGCAAAGGCG GTTACGGCCC 660
CAGACACGGC AGCTGATTCG GAGCTAAGAA GGTGGATAAA GTACACCCAA CAAAGAAGGG 720
TAGTTGGTGA ATTTAGATTG GAGAGAAAAT GGTTGGATGT GGTGAGGAAC AGGATTGCCG 780
AGGACCTCTC CTTACGCCGA TTCATGGTCG CTCTAATCCT GGATATCAAG AGAACACCCG 840
GGAACAAACC CAGGATTGCT GAAATGATAT GTGACATTGA TACATATATC GTAGAGGCAG 900
GATTAGCCAG TTTTATCCTG ACTATTAAGT TTGGGATAGA AACTATGTAT CCTGCTCTTG 960
GACTGCATGA ATTTGCTGGT GAGTTATCCA CACTTGAGTC CTTGATGAAT CTTTACCAGC 1020
AAATGGGGGA AACTGCACCA TACATGGTAA TCCTGGAGAA CTCAATTCAG AACAAGTTCA 1080
GTGCAGGATC ATACCCTCTG CTCTGGAGCT ATGCCATGGG AGTAGGAGTG GAACTTGAAA 1140
ACTCCATGGG AGGTTTGAAC TTTGGCCGAT CTTACTTCGA TCCAGCATAT TTCAGACTAG 1200
GGCAAGAGAT GGTGAGGAGG TCAGCTGGAA AGGTCAGTTC CACATTGGCA TCTGAACTCG 1260
GTATCACTGC CGAAGATGCA AGGCTTGTTT CAGAGATCGC AATGCATACT ACAGAGGACA 1320
GGATCAGTAG AGCGGTTGGA CCCAGACAAT CCCAAGTGTC ATTCCTACAC GGTGATCAAA 1380
ATGAAAATGA GCTACCGAGA TGGGGGGGTA AGGAAGATAT GAGGGTCAAA CAGAGTCGGG 1440
GAGAAGCCAG AGAGAGCTAC AGAGAAACCA GGCCCAGCAG AGCAAGTGAC GCGAGAGCTA 1500
CCCATCCTCC AACCGACACA CCCTTAGACA TTGACACTGC ATCGGAGTCC AGCCAAGATC 1560
CGCAGGACAG TCGAAGGTCA GCTGACGCCC TGCTCAGGCT GCAAGCCATG GCAGGAATCT 1620
CGGAAGAACA AGGCTCAGAC ACGGACACCC CTAGAGTGTA CAATGACAGA GATCTTCTAG 1680
ACTAGGTGCA AGAGGCCGAG GACCAGAACA ACATCCGCCT ACCCTCCATC ATTGTTATAA 1740
AAAACTTAGG AACCAGGTCC ACACAGCCGC CAGCCCACCA ACCATCCACT CCCACGATTG 1800
GGGCCGATGG CAGAAGAGCA GGCACGCCAT GTCAAAAACG GACTGGAATG CATCCGGGCT 1860
CTCAAGGCCG AGCCCATCGG CTCACTGGCC ATCGAGGAAG CTATGGCAGC ATGGTCAGAA 1920
ATATCAGACA ACCCAGGACA GGAGCGAGCC GCCTGCAAGG AAGAGAAGGC AAGCAGTCCG 1980
GGTCTCAGCA AACCATGCCT CTCAGCAATT GGATCAACTG AAGGCGGTGC ACCTCGCATC 2040
CGCGGTCAGG GATCTGGAGA GAGCGATGAC GACGCTGAAA CTTTGGGAAT CCCCTCAGGA 2100
AATCTCCAGG CATCAAGCAC TGGGTTACAG TGTTATTATG TTTATGATCA CAGCGGTGAA 2160
GCGGTTAAGG GAATCCAAGA TGCTGACTCT ATCATGGTTC AATCAGGCCT TGATGGTGAT 2220
AGCACCCTCT CAGGAGGAGA CAATGAATCT GAAAACAGCG ATGTGGATAT TGGCGAACCT 2280
GATACCGAGG GATATGCTAT CACTGACCGG GGATCTGCTC CCATCTCTAT GGGGTTCAGG 2340
GCTTCTGATG TTGAAACTGC AGAAGGAGGG GAGATCCACG AGCTCCTGAG ACTCCAATCC 2400
AGAGGCAACA ACTTTCCAAA GCTTAGGAAA ACTCTCAATG TTCCCCCGCC CCCGGACCCT 2460
GGTAGGGCCA GCACTTCCGA GACACCCATT AAAAAGGGCA CAGACGCGAG ATTAGCCTCA 2520
TTTGGAACGG AGATCGCGTC TTTATTGACA GGTGGTGCAA CCCAATGTGC TCGAAAGTCA 2580
CCCTCGGAAC CATCAGGGCC AGGTGCACCT GCGGGGAATG TCCCCGAGTG TGTGAGCAAT 2640
GCCGTACTGA TACAGGAGTG GACACCCGAA TCTGGTACCA CAATCTCCCC GAGATCCCAG 2700
AATAATGAAG AAGGGGGAGA TTATTATGAT GATGAGCTGT TCTCTGATGT CCAAGATATT 2760
AAAACAGCCT TGGCCAAAAT ACACGAGGAT AATCAGAAGA TAATCACCAA GCTAGAATCA 2820
CTGCTGTTAT TGAAGGGGGA AGTTGAGTCA ATCAAGAAGC AGATCAACAG GCAAAATATC 2880
AGCATATCCA CCTTGGAAGG ACACCTCTCA AGCATCATGA TCGCCATTCC TGGACTTGGG 2940
AAGGATCCCA ACGACCCCAC TGCAGATGTC GAAATCAATC CCGACTTGAA ACCCATCATA 3000
GGCAGAGATT CAGGCCGAGC ACTGGCTGAA GTTCTCAAGA AACCCGTTGC CAGCCGACAA 3060
ATCCAAGGAA TGACAAATGG ACGGACCAGT TCCAGAGGAC AGCTGCTGAA GGAATTTCAG 3120
CTAAAGCCGA TCGGGAAAAA GATGAGCTCA GCCGTCGGGT TTGTTCCGGA CACCGGCCCT 3180
GCATCACGCA GTGTAATCCG CTCCATTATA AAATCCAGCC GGCTAGAGGA GGATCGGAAG 3240
CGTTACCTGA TGACTCTCCT TGATGACATC AAAGGAGCCA ACGATCTTGC CAAGTTCCAC 3300
CAGATGCTGA TGAAGATAAT AATGAAGTAG CTACAGCTCA ACTTACCTGC CAACCCCATG 3360
CCAGTCGACC TAGCTAATAC AACCTAAATC CATTATAAAA AACTTAGGAG CAAAGTGATT 3420
GCCTCCCAAG TTCCACAATG ACAGAGATCT ACGACTTCGA CAAGTCGGCA TGGGACATCA 3480
AAGGGTCGAT CGCTCCGATA CAACCCACCA CCTACAGTGA TGGCAGGCTG GTGCCCCAGG 3540
TCAGAGTCAT AGATCCTGGT CTAGGCGACA GAAAAGATGA ATGTTTTATG TACATGTTTC 3600
TGCTGGGGGT TGTTGAGGAC AGCGATCTCC TAGGGCCTCC AATCGGGCGA GCATTTGGGT 3660
CTCTGCCCTT AGGTGTTGGC AGATCCACAG CAAAACCCGA AGAACTCCTC AAAGAGGCCA 3720
CTGAGCTTGA CATAGTTGTT AGACGTACAG CAGGGCTCAA TGAAAAACTG GTGTTCTACA 3780
ACAACACCCC ACTAACTCTC CTCATACCTT GGAGAAAGGT CCTAACAACA GGGAGTGTCT 3840
TCAACGCAAA CCAAGTGTGC AATGCGGTTA ATCTGATACC GCTGGATACC CCGCAGAGGT 3900
TCCGTGTTGT TTATATGAGC ATCACCCGTC TTTCAGATAA CGGGTATTAC ACCGTTCCTA 3960
GAAGAATGCT GGAATTCAGA TCGGTCAATG CAGTGGCCTT CAACCTGCTG GTGACCCTTA 4020
GGATTGACAA GGCGATTGGC CATGGGAAGA TCATCGACAA TGCAGAGCAA CTTCCTGAGG 4080
CAACATTTAT GGTCCACATC GGGAACTTCA GGAGAAAGAA AAGTGAAGTC TACTCTGCCG 4140
ATTATTGCAA AATGAAAATC GAAAAGATGG GCCTGGTTTT TGCACTTGGT GGGATAGGGG 4200
GCACCAGTCT TCACATTAGA AGCACAGGCA AAATGAGCAA GACTCTCCAT GCACAACTCG 4260
GGTTCAAGAA GACCCTATGT TACCCACTGA TGGATATCAA TGAAGACCTT AATCGATTAC 4320
TCTGGAGGAG CAGATGCAAG ATAGTAAGAA TCCAGGCAGT TTTGCAGCCA TCAGTTCCTC 4380
AAGAATTCCG CATTTACGAC GACGTTATCA TAAATGATGA CCAAGGATTA TTCAAAGTTC 4440
TGTAGACCGT AGTGCCCAGC AATGCCCGAA GACGACCCTC CTCACAATGA CAGCCAGAAG 4500
GCCCGGAAAA AAAGGCCCCC TCCGAAAGAC TCCACAGACC AAATGAGAGG CCAGCCAGCA 4560
GCTGACGGCA AGCACGAACA CCAGGCGGCC CCAGCACAGA ACAGCCCTGA CATAAGGCCA 4620
CCACCAGCCA TCCCAATCTG CATCCTCCTC GTAGGACCCC CGAGGACCAA CCCCCAAGGT 4680
TGCCCCCCAC CCAAACCACC AACCGCATCC CTACCACCCC CGGGAAAGAA ACCCCCAGCA 4740
ACTGGAAGAG CCCTTCCCCT TTCCCTCAAC ACAAGAACTC CACAACCGAA CCACACAAGC 4800
GACCGAGGTG ACCCAACCGC AGGCACCCGA CTCCCTAGAC AGATCCTCTC CCCCTGGCAA 4860
ACTAAACAAA ACTTAGGGCC AAGGAACATA CACACCCAAC AGAACCCAGA CCCCGGCCCA 4920
CGGCGCCGCG CCCCCAACCC CCGACAACCA GAGGGAGCCC CCAACCAATC CCGCCGGCTC 4980
CCCCGGTGCC CACAGGCAGG CACACCAACC CCCGAACAGA CCCAGCACCC AGCCATCGAC 5040
AATCCAAGAC GGGGGGGCCC CCCCAAAAAA AGGCCCCCAG GGGCCGACAG CCAGCACCGC 5100
GAGGAAGCCC ACCCACCCCA CACACGACCA CGACAACCAA ACCAGAACCC AGACCACCCT 5160
GGGCCACCAG TTCCCAGACT CGGCCATCAC CCCGCAGAAA GGAAAGGCCA CAACCTGCGC 5220
ACCCCAGCCC CGATCCGGCG GGCAGCCACC CAACCCTAAC CAGCACCCAA GAGCGATCCC 5280
CGAAGGACCC CCGAACCGCA AAGGACATCA GTATCCCACA GCCTCTCCAA GTCCCCCGGT 5340
CTCTTCCTCT TCTCGAAGGG ACTAAAAGAT CAATCCACCA CATCCGACGA CACTCAACTC 5400
CCCGTCCCTA AAGGAGACAC CGGGAATCCC GGAATTAAGA CTCATCCAAT GTCCATCATG 5460
GGTCTCAAGG TGAACGTCTC TGCCATATTC ATGGCAGTAC TGTTAACTCT CCAAACACCC 5520
ACCGGTCAAA TCCATTGGGG CAATCTCTCT AAGATAGGGG TGGTAGGAAT AGGAAGTGCA 5580
AGCTACAAAG TTATGACTCG TTCCAGCCAT CAATCATTAG TCATAAAATT AATGCCCAAT 5640
ATAACTCTCC TCAATAACTG CACGAGGGTA GAGATTGCAG AATACAGGAG ACTACTGAGA 5700
ACAGTTTTGG AACCAATTAG AGATGCACTT AATGCAATGA CCCAGAATAT AAGACCGTTT 5760
CAGAGTGTAG CTTCAAGTAG GAGACACAAG AGATTTGCAG GAGTAGTCCT GGCAGGTGCG 5820
GCCCTAGGCG TTGCCACAGC TGCTCAGATA ACAGCCGGCA TTGCACTTCA CCAGTCCATG 5880
CTGAACTCTC AAGCCATCGA CAATCTGAGG GCAAGTCTGG AAACTACTAA TCAGGCAATT 5940
GAGGCAATCA GACAAGCAGG GCAGGAGATG ATATTGGCTG TTCAGGGTGT CCAAGACTAC 6000
ATCAATAATG AGCTGATACC GTCTATGAAC CAACTATCTT GTGATTTAAT CGGCCAGAAG 6060
CTCGGGCTCA AATTGCTCAG ATACTATACA GAAATCCTGT CATTATTTGG CCCTAGCTTA 6120
CGGGACCCCA TATCTGCGGA GATATCTATC CAGGCTTTGA GCTATGCGCT CGGAGGAGAT 6180
ATCAATAAGG TGTTAGAAAA GCTCGGATAT AGTGGAGGTG ATTTACTGGG CATCTTAGAG 6240
AGCAGAGGAA TAAAGGCCCG GATAACTCAC GTCGACACAG AGTCCTACTT CATTGTCCTC 6300
AGTATAGCCT ACCCGACGCT GTCCGAGATC AAGGGGGTGA TTGTCCACCG GCTAGAGGGG 6360
GTCTCGTACA ACATAGGCTC TCAAGAGTGG TATACGACTG TGCCCAAGTA TGTTGCAACC 6420
CAAGGGTACC TTATCTCGAA TTTTGATGAG TCATCGTGTA CTTTCATGCC AGAGGGGACT 6480
GTGTGCAGCC AAAATGCCTT GTACCCGATG AGTCCTCTGC TCCAAGAATG CCTCCGGGGG 6540
TCCACCAAGT CCTGTGCTCG TACACTCGTA TCTGGGTCTT TTGGGAACCG GTTCATTTTG 6600
TCACAAGGGA ACCTAATAGC CAATTGTGCA TCAATCCTTT GCAAGTGTTA CACAACAGGA 6660
ACGATCATTA ATCAAGACCC TGACAAGATC CTAACATACA TTGCTGCCGA TCACTGCCCG 6720
GTAGTCGAGG TGAACGGCGT GACCATCCAA GTCGGGAGCA GGAGGTATCC AGACGCTGTG 6780
TACTTGCACA GAATTGACCT CGGTCCTCCC ATATCATTGG AGAGGTTGGA CGTAGGGACA 6840
AATCTGGGGA ATGCAATTGC TAAGTTGGAG GATGCCAAGG AATTGTTGGA GTCATCGGAC 6900
CAGATATTGA GGAGTATGAA AGGTTTGTCG AGCACTAGCA TAGTCTACAT CCTGATTGCA 6960
GTGTGTCTTG GAGGGTTGAT AGGGATCCCC GCTTTAATAT GTTGCTGCAG GGGGCGTTGT 7020
AACAAAAAGG GAGAACAAGT TGGTATGTCA AGACCAGGCC TAAAGCCTGA TCTTACAGGA 7080
ACATCGAAAT CCTATGTAAG GTCGCTCTGA TCCTCTACAA CTCTTGGAAC ACAAATGTCC 7140
CACAAGTCTC CTCTTCGTCA TCAAGCAACC ACCGCATCCA GCATCAAGCC CACCTGAAAT 7200
TATCTCCGGC TCCCCTTTGG CCGAACAATA TCGGTAGTTA ATTAAAACTT AGGGTGCAAG 7260
ATCATCCACA ATGTCACCAC AACGAGACCG GATAAATGCC TTCTACAAAG ATAACCCCCA 7320
TCCCAAGGGA AGTAGGATAG TTATCAACAG AGAACACCTT ATGATTGATA GACCTTATGT 7380
TTTGCTGGCT GTTCTGTTCG TCATGTTTCT GAGCTTGATC GGGTTGCTAG CAATTGCAGG 7440
CATTAGACTT CATCGGGCAG CCATCTACAC CGCAGAGATC CATAAAAGCC TCAGCACCAA 7500
TCTAGATGTA ACTAACTCAA TTGAGCATCA GGTCAAGGAC GTGCTGACAC CACTCTTCAA 7560
AATCATCGGT GATGAAGTGG GCCTGAGGAC ACCTCAGAGA TTCACTGACC TAGTGAAATT 7620
CATCTCTGAC AAGATTAAAT TCCTTAACCC GGATAGGGAG TACGACTTCA GAGATCTCAC 7680
TTGGTGTATC AACCCGCCAG AGAGAATCAA ATTGGATTAT GATCAATACT GTGCAGATGT 7740
GGCTGCTGAA GAGCTCATGA ATGCATTGGT GAACTCAACT CTACTGGAGA CCAGAACAAC 7800
CAATCAGTTC CTAGCTGTCT CAAAGGGAAA CTGCTCAGGG CCCACTACAA TCAGAGGTCA 7860
ATTCTCAAAC ATGTCGCTGT CCCTGTTGGA CTTGTATTTA AGTCGAGGTT ACAATGTGTC 7920
ATCTATAGTC ACTATGACAT CCCAGGGAAT GTACGGGGGA ACTTACCTAG TGGAAAAGCC 7980
TAATCTGAGC AGCAAAGGGT CAGAGTTGTC ACAACTGAGC ATGTACCGAG TGTTTGAAGT 8040
AGGTGTTATC AGAAATCCGG GTTTGGGGGC TCCGGTGTTC CATATGACAA ACTATTTTGA 8100
GCAACCAGTC AGTAATGATC TCAGCAACTG TATGGTGGCT TTGGGGGAGC TCAAACTCGC 8160
AGCCCTTTGT CACGGGGGAG ATTCTATCAC AATTCCCTAT CAGGGATCAG GGAAAGGTGT 8220
CAGCTTTCAG CTCGTCAAGC TAGGTGTCTG GAAATCCCCA ACCGACATGC AATCCTGGGT 8280
CCCCTTCTCA ACGGATGACC CAGTGATAGA CAGGCTTTAC CTCTCATCTC ACAGAGGTGT 8340
TATCGCTGAC AATCAAGCAA AATGGGCTAT CCCGACAACA AGAACAGATG ACAAGTTGCG 8400
AATGGAGACA TGCTTCCAGC AGGCGTGTAA GGGTAAAATC CAAGCACTCT GCGAGAATCC 8460
CGAGTGGGCA CCATTGAAGG ATAACAGGAT TCCTTCATAC GGAGTCTTGT CTGTTGATCT 8520
GAGTCTAACA GTTGAGCTTA AAATCAAAAT TGCTTCGGGA TTCGGGCCAT TGATCACACA 8580
CGGTTCAGGG ATGGACCTAT ACAAGTCCAA CCACAACAAT GAGTATTGGC TGACTATCCC 8640
GCCAATGAAG AACCTAGCCC TAGGTGTAAT CAACACATTG GAGTGGATAC CGAGATTCAA 8700
GGTTAGTCCC AACCTCTTCA CTGTCCCAAT TAAGGAAGCA GGCGAAGACT GCCATGCCCC 8760
AACATACCTA CCTGCGGAGG TGGATGGTGA TGTCAAACTC AGTTCCAATC TGGTGATCCT 8820
ACCTGGTCAA GATCTCCAAT ATGTTTTGGC AACCTACGAT ACTTCCAGGG TTGAACATGC 8880
TGTGGTTTAT TACGTTTACA GCCCAAGCCG CTCATTTTCT TACTTTTATC CTTTTAGGTT 8940
GCCTATAAAG GGGATCCCCA TCGAATTACA AGTGGAATGC TTCACATGGG ACCAAAAACT 9000
CTGGTGCCGT CACTTCTGTG TGCTTGCGGA CTCAGAATCT GGTGGACATA TCACTCACTC 9060
TGGGATGGTG GGCATGGGAG TCAGCTGCAC AGTCACCCGG GAAGATGGAA CCAATAGCAG 9120
ATAGGGCTGC CAGTGAACCA ATCACATGAT GTCACCCAGA CATCAGGCAT ACCCACTAGT 9180
GTGAAATAGA CATCAGAATT AAGAAAAACG TAGGGTCCAA GTGGTTCCCC GTTATGGACT 9240
CGCTATCTGT CAACCAGATC TTATACCCCG AAGTTCACCT AGATAGCCCG ATAGTTACCA 9300
ACAAGATAGT AGCCATCCTG GAGTATGCTC GAGTCCCTCA CGCTTACAGC CTGGAGGACC 9360
CTACACTGTG TCAGAACATC AAGCACCGCC TAAAAAACGG ATTTTCCAAC CAAATGATTA 9420
TAAACAATGT GGAAGTTGGG AATGTCATCA AGTCCAAGCT TAGGAGTTAT CCGGCCCACT 9480
CTCATATTCC ATATCCAAAC TGTAATCAGG ATTTATTTAA CATAGAAGAC AAAGAGTCAA 9540
CGAGGAAGAT CCGTGAACTC CTCAAAAAGG GAAATTCGCT GTACTCTAAA GTCAGTAATA 9600
AGGTTTTCCA ATGCTTGAGG GACACTAATT CACGGCTTGG TCTAGGCTCC GAATTGAGGG 9660
AGGACATCAA GGAGAAAGTT ATTAACTTGG GAGTTTACAT GCACAGCTCC CAATGGTTTG 9720
AGCCCTTTCT GTTTTGGTTT ACAGTCAAGA CTGAGATGAG GTCAGTGATT AAATCACAAA 9780
CCCATACTTG CCATAGGAGG AGACACACAC CTGTATTCTT CACTGGTAGT TCAGTTGAGT 9840
TGCTAATCTC TCGTGACCTT GTTGCTATAA TCAGTAAAGA GTCTCAACAT GTATATTACC 9900
TAACATTTGA GCTGGTTTTG ATGTATTGTG ATGTCATAGA GGGGAGGTTA ATGACAGAGA 9960
CTGCTATGAC CATTGATGCT AGATATACAG AGCTTCTAGG AAGAGTCAGA TACATGTGGA 10020
AATTGATAGA TGGTTTCTTC CCTGCACTCG GGAATCCAAC TTATCAAATT GTAGCCATGC 10080
TGGAGCCTCT TTCACTTGCT TACCTGCAGC TGAGGGATAT AACGGTAGAA CTCAGAGGTG 10140
CTTTCCTTAA CCACTGCTTT ACTGAAATAC ATGATGTTCT TGACCAAAAC GGGTTTTCTG 10200
ATGAAGGTAC TTATCACGAG TTAGTTGAAG CTCTAGATTA CATTTTCATA ACTGATGACA 10260
TACACCTGAC AGGGGAGATT TTCTCATTTT TCAGAAGTTT CGGCCACCCC AGACTTGAAG 10320
CAGTAACGGC TGCTGAAAAT GTTAGGAAAT ACATGAATCA GCCTAAAGTC ATTGTGTATG 10380
AGACTCTGAT GAAAGGTCAT GCCATATTTT GTGGAATCAT AATCAACGGC TATCGTGACA 10440
GGCACGGAGG CAGTTGGCCA CCGCTGACCC TCCCCCTGCA TGCTGCAGAC ACAATCCGGA 10500
ATGCTCAAGC CTCAGGTGAA GGATTAACAC ATGAGCAGTG CGTTGATAAC TGGAAATCTT 10560
TTGCTGGAGT GAAATTTGGC TGCTTCATGC CTCTTAGCCT GGATAGTGAT CTGACAATGT 10620
ACCTAAAGGA CAAGGCACTT GCTGCTCTCC AAAGGGAATG GGATTCAGTT TACCCGAAAG 10680
AGTTCCTGCG TTACGACCCC CCCAAGGGAA CCGGGTCACG GAGGCTTGTA GATGTTTTCC 10740
TTAATGATTC GAGCTTTGAC CCATATGATA TGATAATGTA TGTTGTAAGT GGAGCTTACC 10800
TCCATGACCC TGAGTTCAAC CTGTCTTACA GCCTGAAAGA AAAGGAGATC AAGGAAACAG 10860
GTAGACTTTT TGCTAAAATG ACTTACAAAA TGAGGGCATG CCAAGTGATT GCTGAAAATC 10920
TAATCTCAAA CGGGATTGGC AAATATTTTA AGGACAATGG GATGGCCAAG GATGAGCACG 10980
ATTTGACTAA GGCACTCCAC ACTCTGGCTG TCTCAGGAGT CCCTAAAGAT CTCAAAGAAA 11040
GTCACAGAGG GGGGCCAGTC CTAAAAACCT ACTCCCGAAG CCCAGCCCAC ACAAATACCA 11100
GGAACGTGAG GGCAGCAAAA GGGTTTATAG GGTTCCCTCA GATAATTCGG CAGGACCAAG 11160
ACACTAATCA TCCGGAGAAT ATGGAAGCTT ACGAGACAGT CAGTGCATTT ATCACAACTG 11220
ATCTCAAGAA GTACTGCCTT AATTGGAGAT ATGAGACCAT CAGCTTGTTT GCACAGAGGC 11280
TAAATGAGAT TTACGGATTA CCCTCATTTT TTCAGTGGCT GCATAAGAGG CTTGAGACCT 11340
CTGTCCTGTA TGTAAGTGAC CCTCATTGCC CCCCCGACCT TGACGCCCAT ATCCCGTTAT 11400
GCAAAGTCCC CAATGACCAA ATCTTCATTA AGTACCCTAT GGGAGGTATA GAAGGGTATT 11460
GTCAGAAGCT GTGGACCATC AGCACCATTC CCTATTTATA CCTGGCTGCT TATGAGAGCG 11520
GAGTAAGGAT TGCTTCATTA GTGCAAGGGG ACAATCAGAC CATAGCTGTA ACAAAAAGGG 11580
TACCCAGCAC ATGGCCTTAC AACCTTAAGA AATGGGAAGC TGCTAGAGTA ACTAGAGATT 11640
ACTTTGTAAT TCTTAGGCAA AGGCTACATG ACATTGGCCA TCACCTCAAG GCAAATGAGA 11700
CAATTGTTTC ATCACATTTT TTTGTTTATT CAAAAGGAAT ATATTATGAT GGGCTACTTG 11760
TGTCCCAATC ACTCAAGAGC ATCGCAAGAT GTGTATTCTG GTCAGAGACT ATAGTTGATG 11820
AAACAAGGGC AGCATGCAGT AATATTGCTA CAACAATGGC TAAAAGCATC GAGAGAGGTT 11880
ATGACCGTTA CCTTGCATAT TCCCTGAACG TCCTAAAAGT GATACAGCAG ATTCTGATCT 11940
CTCTTGGCTT CACAATCAAT TCAACCATGA CCCAGGATGT AGTCATACCC CTCCTCACAA 12000
ACAACGACCT CTTAATAAGG ATGGCACTGT TGCCCGCTCC TATTGGGGGG ATGAATTATC 12060
TGAATATGAG CAGGCTGTTT GTCAGAAACA TCGGTGATCC AGTAACATCA TCAATTGCTG 12120
ATCTCAAGAG AATGATTCTC GCATCACTGA TGCCTGAAGA GACCCTCCAT CAAGTAATGA 12180
CACAGCAACC GGGGGACTCT TCATTCCTAG ACTGGGCTAG CGACCCTTAC TCAGCAAATC 12240
TTGTATGTGT CCAGAGCATC ACTAGACTCC TCAAGAACAT AACTGCAAGG TTTGTCCTAA 12300
TCCACAGTCC AAACCCAATG TTAAAGGGAT TATTCCATGA TGACAGTAAA GAAGAGGACG 12360
AGGGACTGGC AGCATTCCTC ATGGACAGGC ATATTATAGT ACCTAGGGCA GCTCATGAAA 12420
TCCTGGATCA TAGTGTCACA GGGGCAAGAG AGTCTATTGC AGGCATGCTA GATACCACAA 12480
AAGGCCTGAT TCGAGCCAGC ATGAGGAAGG GGGGGTTAAC CTCTCGAGTG ATAACCAGAT 12540
TGTCCAATTA TGACTATGAA CAATTCAGAG CAGGGATGGT GCTATTAACA GGAAGAAAGA 12600
GAAATGTCCT CATTGACAAA GAGTCATGTT CAGTGCAGCT GGCGAGAGCC CTAAGAAGCC 12660
ATATGTGGGC GAGGCTAGCT CGAGGACGGC CTATTTACGG CCTTGAGGTC CCTGATGTAC 12720
TAGAATCTAT GCGAGGCCAC CTTATTCGGC GTCATGAGAC ATGTGTCATC TGCGAGTGTG 12780
GATCAGTCAA CTACGGATGG TTTTTTGTCC CCTCGGGTTG CCAACTGGAT GATATTGACA 12840
AGGAAACATC ATCCTTGAGA GTCCCATATA TTGGTTCTAC CACTGATGAG AGAACAGACA 12900
TGAAGCTTGC CTTCGTAAGA GCCCCAAGTC GATCCTTGCG ATCTGCTGTT AGAATAGCAA 12960
CAGTGTACTC ATGGGCTTAC GGTGATGATG ATAGCTCTTG GAACGAAGCC TGGTTGTTGG 13020
CAAGGCAAAG GGCTAATGTG AGCCTGGAGG AGCTAAGGGT GATCACTCCC ATCTCAACTT 13080
CGACTAATTT AGCACATAGG TTGAGGGATC GTAGCACTCA AGTGAAATAC TCAGGTACAT 13140
CCCTTGTCCG AGTGGCAAGG TATACCACAA TCTCCAACGA CAATCTCTCA TTTGTCATAT 13200
CAGATAAGAA GGTTGATACT AACTTTATAT ACCAACAAGG AATGCTCCTA GGGTTGGGCG 13260
TTTTAGAAAC ATTGTTTCGA CTCGAGAAAG ATACCGGATC ATCTAACACG GTATTACATC 13320
TTCACGTCGA AACAGATTGT TGCGTGATCC CAATGATAGA TCATCCCAGG ATACCCAGCT 13380
CTCGCAAGCT AGAGCTGAGG GCAGAGCTGT GTACCAACCC ATTGATATAT GATAATGCAC 13440
CTTTAATTGA CAGAGATGCA ACAAGGCTAT ACACCCAGAG CCATAGGAGG CACCTTGTAG 13500
AATTTGTTAC ATGGTCCACA CCCCAACTAT ATCACATTCT AGCTAAGTCC ACAGCACTAT 13560
CTATGATTGA CCTGGTAACA AAATTTGAGA AGGACCATAT GAATGAAATT TCAGCTCTCA 13620
TAGGGGATGA CGATATCAAT AGTTTCATAA CTGAGTTTCT GCTTATAGAG CCAAGATTAT 13680
TCACTATCTA CTTGGGCCAG TGTGCGGCCA TCAATTGGGC ATTTGATGTA CATTATCATA 13740
GACCATCAGG GAAATATCAG ATGGGTGAGC TGTTGTCATC GTTCCTTTCT AGAATGAGCA 13800
AAGGAGTGTT TAAGGTGCTT GTCAATGCTC TAAGCCACCC AAAGATCTAC AAGAAATTCT 13860
GGCACTGTGG TATTATAGAG CCTATCCATG GTCCTTCACT TGATGCTCAA AACTTGCACA 13920
CAACTGTGTG CAACATGGTT TACACATGCT ATATGACCTA CCTCGACCTG TTGTTGAATG 13980
AAGAGTTAGA AGAGTTTACA TTTCTTTTGT GTGAAAGTGA CGAGGATGTA GTACCGGACA 14040
GATTCGACAA CATCCAGGCA AAACACTTGT GTGTTCTGGC AGATTTGTAC TGTCAACCAG 14100
GGACCTGCCC ACCAATTCGA GGTCTAAGAC CGGTAGAGAA ATGTGCAGTT CTAACCGACC 14160
ATATCAAGGC GGAGGCTAGG TTATCTCCAG CAGGATCTTC GTGGAACATA AATCCAATTA 14220
TTGTAGACCA TTACTCATGC TCTCTGACTT ATCTTCGGCG AGGATCGATC AAACAGATAA 14280
GATTGAGAGT TGATCCAGGA TTCATTTTCG ACGCCCTCGC TGAGGTAAAT GTCAGTCAGC 14340
CAAAGATCGG CAGCAACAAC ATCTCAAATA TGAGCATCAA GGATTTCAGA CCCCCACACG 14400
ATGATGTTGC AAAATTGCTC AAAGATATCA ATACAAGCAA GCACAATCTT CCCATTTCTG 14460
GGGGCAATCT CGCCAATTAT GAAATCCATG CTTTCCGCAG AATCGGGTTG AACTCATCTG 14520
CTTGCTACAA AGCTGTTGAG ATATCAACAT TAATTAGGAG ATGCCTTGAG CCAGGGGAAG 14580
ACGGCTTATT CTTGGGTGAG GGATCGGGTT CTATGTTGAT CACTTATAAG GAGATACTTA 14640
AACTAAACAA GTGCTTCTAT AATAGTGGGG TCTCTGCCAA TTCTAGATCT GGTCAAAGGG 14700
AATTAGCACC CTATCCCTCC GAAGTTGGCC TTGTCGAACA CAGAATGGGA GTAGGTAATA 14760
TTGTCAAGGT GCTCTTTAAC GGGAGGCCCG AAGTCACATG GGTAGGCAGT GTAGATTGCT 14820
TCAATTACAT AGTTAGTAAT ATCCCTACCT CTAGTGTGGG GTTTATCCAT TCAGATATAG 14880
AGACCTTACC TAACAAAGAT ACTATAGAGA AGCTAGAGGA ATTGGCAGCC ATCTTATCGA 14940
TGGCTCTGCT CCTGGGCAAA ATAGGATCAA TACTGGTGAT TAAGCTTATG CCTTTCAGCG 15000
GGGATTTTGT TCAGGGATTT ATAAGTTATG TAGGGTCTCA TTATAGAGAA GTGAACCTTG 15060
TATACCCCAG ATACAGCAAC TTCATATCTA CTGAATCTTA TTTGGTTATG ACAGATCTCA 15120
AGGCTAACCG GCTAATGAAT CCTGAAAAGA TTAAGCAGCA GATAATTGAA TCATCTGTGC 15180
GGACTTCACC TGGACTTATA GGTCACATCC TATCCATTAA GCAACTAAGC TGCATACAAG 15240
CAATTGTGGG AGACGCAGTT AGTAGAGGTG ATATCAATCC TACTCTGAAA AAACTTACAC 15300
CTATAGAGCA GGTGCTGATC AATTGCGGGT TGGCAATTAA CGGACCTAAA CTGTGCAAAG 15360
AATTGATCCA CCATGATGTT GCCTCAGGGC AAGATGGATT GCTTAATTCT ATACTCATCC 15420
TCTACAGGGA GTTGGCAAGA TTCAAGGACA ACCAAAGAAG TCAACAAGGG ATGTTCCACG 15480
CTTACCCCGT ATTGGTAAGT AGCAGGCAAC GAGAACTTAT ATCTAGAATC ACTCGCAAAT 15540
TTTGGGGGCA CATTCTTCTT TACTCCGGGA ACAGAAAGTT GATAAATAAG TTTATCCAGA 15600
ATCTCAAGTC CGGTTATCTG ATACTAGACT TACACCAGAA TATCTTCGTT AAGAATCTAT 15660
CCAAGTCAGA GAAACAGATT ATTATGACGG GGGGTTTGAA ACGTGAGTGG GTTTTTAAGG 15720
TAACAGTCAA GGAGACCAAG GAATGGTATA AGTTAGTCGG ATACAGTGCC CTGATTAAGG 15780
ACTAATTGGT TGAACTCCGG AACCCTAATC CTGCCCCAGG TGGTTAGGCA TTATTTGTAA 15840
TATATTAAAG AAAACTTTGA AAATACGAAG TTTCTATTCC CAGCTTTGTC TGGT 15894
(2) SEQ ID NO: 4 Information:
(I) SEQUENCE CHARACTERISTICS:
...
(A) length: 2183 amino acid
(B) type: amino acid
(C) chain:
(D) topological framework: linearity is molecule type (ii): protein (xi) sequence description: SEQ ID NO:4:Met Asp Ser Leu Ser Val Asn Gln Ile Leu Tyr Pro Glu Val His Leu1 5 10 15Asp Ser Pro Ile Val Thr Asn Lys Ile Val Ala Ile Leu Glu Tyr Ala
20??????????????????25??????????????????30Arg?Val?Pro?His?Ala?Tyr?Ser?Leu?Glu?Asp?Pro?Thr?Leu?Cys?Gln?Asn
35??????????????????40??????????????????45Ile?Lys?His?Arg?Leu?Lys?Asn?Gly?Phe?Ser?Asn?Gln?Met?Ile?Ile?Asn
50??????????????????55??????????????????60Asn?Val?Glu?Val?Gly?Asn?Val?Ile?Lys?Ser?Lys?Leu?Arg?Ser?Tyr?Pro65??????????????????70??????????????????75??????????????????80Ala?His?Ser?His?Ile?Pro?Tyr?Pro?Asn?Cys?Asn?Gln?Asp?Leu?Phe?Asn
85??????????????????90??????????????????95Ile?Glu?Asp?Lys?Glu?Ser?Thr?Arg?Lys?Ile?Arg?Glu?Leu?Leu?Lys?Lys
100?????????????????105?????????????????110Gly?Asn?Ser?Leu?Tyr?Ser?Lys?Val?Ser?Asn?Lys?Val?Phe?Gln?Cys?Leu
115?????????????????120?????????????????125Arg?Asp?Thr?Asn?Ser?Arg?Leu?Gly?Leu?Gly?Ser?Glu?Leu?Arg?Glu?Asp
130?????????????????135?????????????????140Ile?Lys?Glu?Lys?Val?Ile?Asn?Leu?Gly?Val?Tyr?Met?His?Ser?Ser?Gln145?????????????????150?????????????????155?????????????????160Trp?Phe?G1u?Pro?Phe?Leu?Phe?Trp?phe?Thr?Val?Lys?Thr?Glu?Met?Arg
165?????????????????170?????????????????175Ser?Val?Ile?Lys?Ser?Gln?Thr?His?Thr?Cys?His?Arg?Arg?Arg?His?Thr
180?????????????????185?????????????????190Pro?Val?Phe?Phe?Thr?Gly?Ser?Ser?Val?Glu?Leu?Leu?Ile?Ser?Arg?Asp
195?????????????????200?????????????????205Leu?Val?Ala?Ile?Ile?Ser?Lys?Glu?Ser?Gln?His?Val?Tyr?Tyr?Leu?Thr
210?????????????????215?????????????????220Phe?Glu?Leu?Val?Leu?Met?Tyr?Cys?Asp?Val?Ile?Glu?Gly?Arg?Leu?Met225?????????????????230?????????????????235?????????????????240Thr?Glu?Thr?Ala?Met?Thr?Ile?Asp?Ala?Arg?Tyr?Thr?Glu?Leu?Leu?Gly
245?????????????????250?????????????????255Arg?Val?Arg?Tyr?Met?Trp?Lys?Leu?Ile?Asp?Gly?Phe?Phe?Pro?Ala?Leu
260?????????????????265?????????????????270Gly?Asn?Pro?Thr?Tyr?Gln?Ile?Val?Ala?Met?Leu?Glu?Pro?Leu?Ser?Leu
275?????????????????280?????????????????285Ala?Tyr?Leu?Gln?Leu?Arg?Asp?Ile?Thr?Val?Glu?Leu?Arg?Gly?Ala?Phe
290?????????????????295?????????????????300Leu?Asn?His?Cys?Phe?Thr?Glu?Ile?His?Asp?Val?Leu?Asp?Gln?Asn?Gly305?????????????????310?????????????????315?????????????????320Phe?Ser?Asp?Glu?Gly?Thr?Tyr?His?Glu?Leu?Val?Glu?Ala?Leu?Asp?Tyr
325?????????????????330?????????????????335Ile?Phe?Ile?Thr?Asp?Asp?Ile?His?Leu?Thr?Gly?Glu?Ile?Phe?Ser?Phe
340?????????????????345?????????????????350Phe?Arg?Ser?Phe?Gly?His?Pro?Arg?Leu?Glu?Ala?Val?Thr?Ala?Ala?Glu
355?????????????????360?????????????????365Asn?Val?Arg?Lys?Tyr?Met?Asn?Gln?Pro?Lys?Val?Ile?Val?Tyr?Glu?Thr
370?????????????????375?????????????????380Leu?Met?Lys?Gly?His?Ala?Ile?Phe?Cys?Gly?Ile?Ile?Ile?Asn?Gly?Tyr385?????????????????390?????????????????395?????????????????400Arg?Asp?Arg?His?Gly?Gly?Ser?Trp?Pro?Pro?Leu?Thr?Leu?Pro?Leu?His
405?????????????????410?????????????????415Ala?Ala?Asp?Thr?Ile?Arg?Asn?Ala?Gln?Ala?Ser?Gly?Glu?Gly?Leu?Thr
420?????????????????425?????????????????430His?Glu?Gln?Cys?Val?Asp?Asn?Trp?Lys?Ser?Phe?Ala?Gly?Val?Lys?Phe
435?????????????????440?????????????????445Gly?Cys?Phe?Met?Pro?Leu?Ser?Leu?Asp?Ser?Asp?Leu?Thr?Met?Tyr?Leu
450?????????????????455?????????????????460Lys?Asp?Lys?Ala?Leu?Ala?Ala?Leu?Gln?Arg?Glu?Trp?Asp?Ser?Val?Tyr465?????????????????470?????????????????475?????????????????480Pro?Lys?Glu?Phe?Leu?Arg?Tyr?Asp?Pro?Pro?Lys?Gly?Thr?Gly?Ser?Arg
485?????????????????490?????????????????495Arg?Leu?Val?Asp?Val?Phe?Leu?Asn?Asp?Ser?Ser?Phe?Asp?Pro?Tyr?Asp
500?????????????????505?????????????????510Met?Ile?Met?Tyr?Val?Val?Ser?Gly?Ala?Tyr?Leu?His?Asp?Pro?Glu?Phe
515?????????????????520?????????????????525Asn?Leu?Ser?Tyr?Ser?Leu?Lys?Glu?Lys?Glu?Ile?Lys?Glu?Thr?Gly?Arg
530?????????????????535?????????????????540Leu?Phe?Ala?Lys?Met?Thr?Tyr?Lys?Met?Arg?Ala?Cys?Gln?Val?Ile?Ala545?????????????????550?????????????????555?????????????????560Glu?Asn?Leu?Ile?Ser?Asn?Gly?Ile?Gly?Lys?Tyr?Phe?Lys?Asp?Asn?Gly
565?????????????????570?????????????????575Met?Ala?Lys?Asp?Glu?His?Asp?Leu?Thr?Lys?Ala?Leu?His?Thr?Leu?Ala
580?????????????????585?????????????????590Val?Ser?Gly?Val?Pro?Lys?Asp?Leu?Lys?Glu?Ser?His?Arg?Gly?Gly?Pro
595?????????????????600?????????????????605Val?Leu?Lys?Thr?Tyr?Ser?Arg?Ser?Pro?Ala?His?Thr?Asn?Thr?Arg?Asn
610?????????????????615?????????????????620Val?Arg?Ala?Ala?Lys?Gly?Phe?Ile?Gly?Phe?Pro?Gln?Ile?Ile?Arg?Gln625?????????????????630?????????????????635?????????????????640Asp?Gln?Asp?Thr?Asn?His?Pro?Glu?Asn?Met?Glu?Ala?Tyr?Glu?Thr?Val
645?????????????????650?????????????????655Ser?Ala?Phe?Ile?Thr?Thr?Asp?Leu?Lys?Lys?Tyr?Cys?Leu?Asn?Trp?Arg
660?????????????????665?????????????????670Tyr?Glu?Thr?Ile?Ser?Leu?Phe?Ala?Gln?Arg?Leu?Asn?Glu?Ile?Tyr?Gly
675?????????????????680?????????????????685Leu?Pro?Ser?Phe?Phe?Gln?Trp?Leu?His?Lys?Arg?Leu?Glu?Thr?Ser?Val
690?????????????????695?????????????????700Leu?Tyr?Val?Ser?Asp?Pro?His?Cys?Pro?Pro?Asp?Leu?Asp?Ala?His?Ile705?????????????????710?????????????????715?????????????????720Pro?Leu?Cys?Lys?Val?Pro?Asn?Asp?Gln?Ile?Phe?Ile?Lys?Tyr?Pro?Met
725?????????????????730?????????????????735Gly?Gly?Ile?Glu?Gly?Tyr?Cys?Gln?Lys?Leu?Trp?Thr?Ile?Ser?Thr?Ile
740?????????????????745?????????????????750Pro?Tyr?Leu?Tyr?Leu?Ala?Ala?Tyr?Glu?Ser?Gly?Val?Arg?Ile?Ala?Ser
755?????????????????760?????????????????765Leu?Val?Gln?Gly?Asp?Asn?Gln?Thr?Ile?Ala?Val?Thr?Lys?Arg?Val?Pro
770?????????????????775?????????????????780Ser?Thr?Trp?Pro?Tyr?Asn?Leu?Lys?Lys?Trp?Glu?Ala?Ala?Arg?Val?Thr785?????????????????790?????????????????795?????????????????800Arg?Asp?Tyr?Phe?Val?Ile?Leu?Arg?Gln?Arg?Leu?His?Asp?Ile?Gly?His
805?????????????????810?????????????????815His?Leu?Lys?Ala?Asn?Glu?Thr?Ile?Val?Ser?Ser?His?Phe?Phe?Val?Tyr
820?????????????????825?????????????????830Ser?Lys?Gly?Ile?Tyr?Tyr?Asp?Gly?Leu?Leu?Val?Ser?Gln?Ser?Leu?Lys
835?????????????????840?????????????????845Ser?Ile?Ala?Arg?Cys?Val?Phe?Trp?Ser?Glu?Thr?Ile?Val?Asp?Glu?Thr
850?????????????????855?????????????????860Arg?Ala?Ala?Cys?Ser?Asn?Ile?Ala?Thr?Thr?Met?Ala?Lys?Ser?Ile?Glu865?????????????????870?????????????????875?????????????????880Arg?Gly?Tyr?Asp?Arg?Tyr?Leu?Ala?Tyr?Ser?Leu?Asn?Val?Leu?Lys?Val
885?????????????????890?????????????????895Ile?Gln?Gln?Ile?Leu?Ile?Ser?Leu?Gly?Phe?Thr?Ile?Asn?Ser?Thr?Met
900?????????????????905?????????????????910Thr?Gln?Asp?Val?Val?Ile?Pro?Leu?Leu?Thr?Asn?Asn?Asp?Leu?Leu?Ile
915?????????????????920?????????????????925Arg?Met?Ala?Leu?Leu?Pro?Ala?Pro?Ile?Gly?Gly?Met?Asn?Tyr?Leu?Asn
930?????????????????935?????????????????940Met?Ser?Arg?Leu?Phe?Val?Arg?Asn?Ile?Gly?Asp?Pro?Val?Thr?Ser?Ser945?????????????????950?????????????????955?????????????????960Ile?Ala?Asp?Leu?Lys?Arg?Met?Ile?Leu?Ala?Ser?Leu?Met?Pro?Glu?Glu
965?????????????????970?????????????????975Thr?Leu?His?Gln?Val?Met?Thr?Gln?Gln?Pro?Gly?Asp?Ser?Ser?Phe?Leu
980?????????????????985?????????????????990Asp?Trp?Ala?Ser?Asp?Pro?Tyr?Ser?Ala?Asn?Leu?Val?Cys?Val?Gln?Ser
995?????????????????1000????????????????1005Ile?Thr?Arg?Leu?Leu?Lys?Asn?Ile?Thr?Ala?Arg?Phe?Val?Leu?Ile?His
1010????????????????1015????????????????1020Ser?Pro?Asn?Pro?Met?Leu?Lys?Gly?Leu?Phe?His?Asp?Asp?Ser?Lys?Glu1025????????????????1030????????????????1035????????????????1040Glu?Asp?Glu?Gly?Leu?Ala?Ala?Phe?Leu?Met?Asp?Arg?His?Ile?Ile?Val
1045????????????????1050????????????????1055Pro?Arg?Ala?Ala?His?Glu?Ile?Leu?Asp?His?Ser?Val?Thr?Gly?Ala?Arg
1060????????????????1065????????????????1070Glu?Ser?Ile?Ala?Gly?Met?Leu?Asp?Thr?Thr?Lys?Gly?Leu?Ile?Arg?Ala
1075????????????????1080????????????????1085Ser?Met?Arg?Lys?Gly?Gly?Leu?Thr?Ser?Arg?Val?Ile?Thr?Arg?Leu?Ser
1090????????????????1095????????????????1100Asn?Tyr?Asp?Tyr?Glu?Gln?Phe?Arg?Ala?Gly?Met?Val?Leu?Leu?Thr?Gly1105????????????????1110????????????????1115????????????????1120Arg?Lys?Arg?Asn?Val?Leu?Ile?Asp?Lys?Glu?Ser?Cys?Ser?Val?Gln?Leu
1125????????????????1130????????????????1135Ala?Arg?Ala?Leu?Arg?Ser?His?Met?Trp?Ala?Arg?Leu?Ala?Arg?Gly?Arg
1140????????????????1145????????????????1150Pro?Ile?Tyr?Gly?Leu?Glu?Val?Pro?Asp?Val?Leu?Glu?Ser?Met?Arg?Gly
1155????????????????1160????????????????1165His?Leu?Ile?Arg?Arg?His?Glu?Thr?Cys?Val?Ile?Cys?Glu?Cys?Gly?Ser
1170????????????????1175????????????????1180Val?Asn?Tyr?Gly?Trp?Phe?Phe?Val?Pro?Ser?Gly?Cys?Gln?Leu?Asp?Asp1185????????????????1190????????????????1195????????????????1200Ile?Asp?Lys?Glu?Thr?Ser?Ser?Leu?Arg?Val?Pro?Tyr?Ile?Gly?Ser?Thr
1205????????????????1210????????????????1215Thr?Asp?Glu?Arg?Thr?Asp?Met?Lys?Leu?Ala?Phe?Val?Arg?Ala?Pro?Ser
1220????????????????1225????????????????1230Arg?Ser?Leu?Arg?Ser?Ala?Val?Arg?Ile?Ala?Thr?Val?Tyr?Ser?Trp?Ala
1235????????????????1240????????????????1245Tyr?Gly?Asp?Asp?Asp?Ser?Ser?Trp?Asn?Glu?Ala?Trp?Leu?Leu?Ala?Arg
1250????????????????1255????????????????1260Gln?Arg?Ala?Asn?Val?Ser?Leu?Glu?Glu?Leu?Arg?Val?Ile?Thr?Pro?Ile1265????????????????1270????????????????1275????????????????1280Ser?Thr?Ser?Thr?Asn?Leu?Ala?His?Arg?Leu?Arg?Asp?Arg?Ser?Thr?Gln
1285????????????????1290????????????????1295Val?Lys?Tyr?Ser?Gly?Thr?Ser?Leu?Val?Arg?Val?Ala?Arg?Tyr?Thr?Thr
1300????????????????1305????????????????1310Ile?Ser?Asn?Asp?Asn?Leu?Ser?Phe?Val?Ile?Ser?Asp?Lys?Lys?Val?Asp
1315????????????????1320????????????????1325Thr?Asn?Phe?Ile?Tyr?Gln?Gln?Gly?Met?Leu?Leu?Gly?Leu?Gly?Val?Leu
1330????????????????1335????????????????1340Glu?Thr?Leu?Phe?Arg?Leu?Glu?Lys?Asp?Thr?Gly?Ser?Ser?Asn?Thr?Val1345???????????????1350????????????????1355?????????????????1360Leu?His?Leu?His?Val?Glu?Thr?Asp?Cys?Cys?Val?Ile?Pro?Met?Ile?Asp
1365????????????????1370????????????????1375His?Pro?Arg?Ile?Pro?Ser?Ser?Arg?Lys?Leu?Glu?Leu?Arg?Ala?Glu?Leu
1380????????????????1385????????????????1390Cys?Thr?Asn?Pro?Leu?Ile?Tyr?Asp?Asn?Ala?Pro?Leu?Ile?Asp?Arg?Asp
1395????????????????1400????????????????1405Ala?Thr?Arg?Leu?Tyr?Thr?Gln?Ser?His?Arg?Arg?His?Leu?Val?Glu?Phe
1410????????????????1415????????????????1420Val?Thr?Trp?Ser?Thr?Pro?Gln?Leu?Tyr?His?Ile?Leu?Ala?Lys?Ser?Thr1425????????????????1430????????????????1435????????????????1440Ala?Leu?Ser?Met?Ile?Asp?Leu?Val?Thr?Lys?Phe?Glu?Lys?Asp?His?Met
1445????????????????1450????????????????1455Asn?Glu?Ile?Ser?Ala?Leu?Ile?Gly?Asp?Asp?Asp?Ile?Asn?Ser?Phe?Ile
1460????????????????1465????????????????1470Thr?Glu?Phe?Leu?Leu?Ile?Glu?Pro?Arg?Leu?Phe?Thr?Ile?Tyr?Leu?Gly
1475????????????????1480????????????????1485Gln?Cys?Ala?Ala?Ile?Asn?Trp?Ala?Phe?Asp?Val?His?Tyr?His?Arg?Pro
1490????????????????1495????????????????1500Ser?Gly?Lys?Tyr?Gln?Met?Gly?Glu?Leu?Leu?Ser?Ser?Phe?Leu?Ser?Arg1505????????????????1510????????????????1515????????????????1520Met?Ser?Lys?Gly?Val?Phe?Lys?Val?Leu?Val?Asn?Ala?Leu?Ser?His?Pro
1525????????????????1530????????????????1535Lys?Ile?Tyr?Lys?Lys?Phe?Trp?His?Cys?Gly?Ile?Ile?Glu?Pro?Ile?His
1540????????????????1545????????????????1550Gly?Pro?Ser?Leu?Asp?Ala?Gln?Asn?Leu?His?Thr?Thr?Val?Cys?Asn?Met
1555????????????????1560????????????????1565Val?Tyr?Thr?Cys?Tyr?Met?Thr?Tyr?Leu?Asp?Leu?Leu?Leu?Asn?Glu?Glu
1570????????????????1575????????????????1580Leu?Glu?Glu?Phe?Thr?Phe?Leu?Leu?Cys?Glu?Ser?Asp?Glu?Asp?Val?Val1585????????????????1590????????????????1595????????????????1600Pro?Asp?Arg?Phe?Asp?Asn?Ile?Gln?Ala?Lys?His?Leu?Cys?Val?Leu?Ala
1605????????????????1610????????????????1615Asp?Leu?Tyr?Cys?Gln?Pro?Gly?Thr?Cys?Pro?Pro?Ile?Arg?Gly?Leu?Arg
1620????????????????1625????????????????1630Pro?Val?Glu?Lys?Cys?Ala?Val?Leu?Thr?Asp?His?Ile?Lys?Ala?Glu?Ala
1635????????????????1640????????????????1645Arg?Leu?Ser?Pro?Ala?Gly?Ser?Ser?Trp?Asn?Ile?Asn?Pro?Ile?Ile?Val
1650????????????????1655????????????????1660Asp?His?Tyr?Ser?Cys?Ser?Leu?Thr?Tyr?Leu?Arg?Arg?Gly?Ser?Ile?Lys1665????????????????1670????????????????1675????????????????1680Gln?Ile?Arg?Leu?Arg?Val?Asp?Pro?Gly?Phe?Ile?Phe?Asp?Ala?Leu?Ala
1685????????????????1690????????????????1695Glu?Val?Asn?Val?Ser?Gln?Pro?Lys?Ile?Gly?Ser?Asn?Asn?Ile?Ser?Asn
1700????????????????1705????????????????1710Met?Ser?Ile?Lys?Asp?Phe?Arg?Pro?Pro?His?Asp?Asp?Val?Ala?Lys?Leu
1715????????????????1720????????????????1725Leu?Lys?Asp?Ile?Asn?Thr?Ser?Lys?His?Asn?Leu?Pro?Ile?Ser?Gly?Gly
1730????????????????1735????????????????1740Asn?Leu?Ala?Asn?Tyr?Glu?Ile?His?Ala?Phe?Arg?Arg?Ile?Gly?Leu?Asn1745????????????????1750????????????????1755????????????????1760Ser?Ser?Ala?Cys?Tyr?Lys?Ala?Val?Glu?Ile?Ser?Thr?Leu?Ile?Arg?Arg
1765????????????????1770????????????????1775Cys?Leu?Glu?Pro?Gly?Glu?Asp?Gly?Leu?Phe?Leu?Gly?Glu?Gly?Ser?Gly
1780????????????????1785????????????????1790Ser?Met?Leu?Ile?Thr?Tyr?Lys?Glu?Ile?Leu?Lys?Leu?Asn?Lys?Cys?Phe
1795????????????????1800????????????????1805Tyr?Asn?Ser?Gly?Val?Ser?Ala?Asn?Ser?Arg?Ser?Gly?Gln?Arg?Glu?Leu
1810????????????????1815????????????????1820Ala?Pro?Tyr?Pro?Ser?Glu?Val?Gly?Leu?Val?Glu?His?Arg?Met?Gly?Val1825????????????????1830????????????????1835????????????????1840Gly?Asn?Ile?Val?Lys?Val?Leu?Phe?Asn?Gly?Arg?Pro?Glu?Val?Thr?Trp
1845????????????????1850????????????????1855Val?Gly?Ser?Val?Asp?Cys?Phe?Asn?Tyr?Ile?Val?Ser?Asn?Ile?Pro?Thr
1860????????????????1865????????????????1870Ser?Ser?Val?Gly?Phe?Ile?His?Ser?Asp?Ile?Glu?Thr?Leu?Pro?Asn?Lys
1875????????????????1880????????????????1885Asp?Thr?Ile?Glu?Lys?Leu?Glu?Glu?Leu?Ala?Ala?Ile?Leu?Ser?Met?Ala
1890????????????????1895????????????????1900Leu?Leu?Leu?Gly?Lys?Ile?Gly?Ser?Ile?Leu?Val?Ile?Lys?Leu?Met?Pro1905????????????????1910????????????????1915????????????????1920Phe?Ser?Gly?Asp?Phe?Val?Gln?Gly?Phe?Ile?Ser?Tyr?Val?Gly?Ser?His
1925????????????????1930????????????????1935Tyr?Arg?Glu?Val?Asn?Leu?Val?Tyr?Pro?Arg?Tyr?Ser?Asn?Phe?Ile?Ser
1940????????????????1945????????????????1950
Thr?Glu?Ser?Tyr?Leu?Val?Met?Thr?Asp?Leu?Lys?Ala?Asn?Arg?Leu?Met
1955????????????????1960????????????????1965
Asn?Pro?Glu?Lys?Ile?Lys?Gln?Gln?Ile?Ile?Glu?Ser?Ser?Val?Arg?Thr
1970????????????????1975????????????????1980
Ser?Pro?Gly?Leu?Ile?Gly?His?Ile?Leu?Ser?Ile?Lys?Gln?Leu?Ser?Cys
1985????????????????1990????????????????1995????????????????2000
Ile?Gln?Ala?Ile?Val?Gly?Asp?Ala?Val?Ser?Arg?Gly?Asp?Ile?Asn?Pro
2005????????????????2010????????????????2015
Thr?Leu?Lys?Lys?Leu?Thr?Pro?Ile?Glu?Gln?Val?Leu?Ile?Asn?Cys?Gly
2020????????????????2025????????????????2030
Leu?Ala?Ile?Asn?Gly?Pro?Lys?Leu?Cys?Lys?Glu?Leu?Ile?His?His?Asp
2035????????????????2040????????????????2045
Val?Ala?Ser?Gly?Gln?Asp?Gly?Leu?Leu?Asn?Ser?Ile?Leu?Ile?Leu?Tyr
2050????????????????2055????????????????2060
Arg?Glu?Leu?Ala?Arg?Phe?Lys?Asp?Asn?Gln?Arg?Ser?Gln?Gln?Gly?Met
2065????????????????2070????????????????2075????????????????2080
Phe?His?Ala?Tyr?Pro?Val?Leu?Val?Ser?Ser?Arg?Gln?Arg?Glu?Leu?Ile
2085???????????????2090?????????????????2095
Ser?Arg?Ile?Thr?Arg?Lys?Phe?Trp?Gly?His?Ile?Leu?Leu?Tyr?Ser?Gly
2100????????????????2105????????????????2110
Asn?Arg?Lys?Leu?Ile?Asn?Lys?Phe?Ile?Gln?Asn?Leu?Lys?Ser?Gly?Tyr
2115????????????????2120????????????????2125
Leu?Ile?Leu?Asp?Leu?His?Gln?Asn?Ile?Phe?Val?Lys?Asn?Leu?Ser?Lys
2130????????????????2135????????????????2140
Ser?Glu?Lys?Gln?Ile?Ile?Met?Thr?Gly?Gly?Leu?Lys?Arg?Glu?Trp?Val
2145????????????????2150????????????????2155????????????????2160
Phe?Lys?Val?Thr?Val?Lys?Glu?Thr?Lys?Glu?Trp?Tyr?Lys?Leu?Val?Gly
2165???????????????2170????????????????2175
Tyr?Ser?Ala?Leu?Ile?Lys?Asp
The information of 2180 (2) SEQ ID NO:5:
(i) sequence signature:
(A) length: 15894 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: DNA (genome)
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5:
ACCAAACAAA GTTGGGTAAG GATAGATCAA TCAATGATCA TATTCTAGTA CACTTAGGAT 60
TCAAGATCCT ATTATCAGGG ACAAGAGCAG GATTAGGGAT ATCCGAGATG GCCACACTTT 120
TAAGGAGCTT AGCATTGTTC AAAAGAAACA AGGACAAACC ACCCATTACA TCAGGATCCG 180
GTGGAGCCAT CAGAGGAATC AAACACATTA TTATAGTACC AATCCCTGGA GATTCCTCAA 240
TTACCACTCG ATCCAGACTA CTGGACCGGT TGGTCAGGTT AATTGGAAAC CCGGATGTGA 300
GCGGGCCCAA ACTAACAGGG GCACTAATAG GTATATTATC CTTGTTTGTG GAGTCTCCAG 360
GTCAATTGAT TCAGAGGATC ACCGATGACC CTGACGTTAG CATCAGGCTG TTAGAGGTTG 420
TCCAGAGTGA CCAGTCACAA TCTGGCCTTA CCTTCGCATC AAGAGGTACC AACATGGAGG 480
ATGAGGCGGA CCAATACTTT TCACATGATG ATCCAAGTAG TAGTGATCAA TCCAGGTCCG 540
GATGGTTCGA GAACAAGGAA ATCTCAGATA TTGAAGTGCA AGACCCTGAG GGATTCAACA 600
TGATTCTGGG TACCATTCTA GCCCAAATTT GGGTCTTGCT CGCGAAGGCG GTTACGGCCC 660
CAGACACGGC AGCTGATTCG GAGCTAAGAA GGTGGATAAA GTACACCCAA CAAAGAAGGG 720
TAGTTGGTGA ATTCAGATTG GAGAGAAAAT GGTTGGATGT GGTGAGGAAC AGGATTGCCG 780
AGGACCTCTC CTTACGCCGA TTCATGGTCG CTCTAATCCT GGATATCAAG AGGACACCCG 840
GGAACAAACC AAGGATTGCT GAAATGATAT GTGACATTGA TACATATATC GTAGAGGCAG 900
GATTAGCCAG TTTTATCCTA ACTATTAAGT TTGGGATAGA AACTATGTAT CCTGCTCTTG 960
GACTGCATGA ATTTGCTGGT GAGTTATCCA CACTTGAGTC CTTGATGAAT CTTTACCAGC 1020
AAATGGGAGA AACTGCACCC TACATGGTAA TCCTGGAGAA CTCAATTCAG AACAAGTTCA 1080
GTGCAGGATC ATACCCCCTG CTCTGGAGCT ATGCCATGGG AGTAGGGGTG GAACTTGAAA 1140
ACTCCATGGG AGGTTTGAAC TTTGGTCGAT CTTACTTTGA TCCAGCATAT TTTAGATTAG 1200
GGCAAGAGAT GGTGAGGAGG TCAGCTGGGA AAGTCAGTTC CACATTAGCA TCTGAACTCG 1260
GTATCACTGC TGAGGATGCA AGGCTTGTTT CAGAGATTGC AATGCACACT ACTGAGGACA 1320
GGACCAGTAG AGCGGTTGGA CCCAGACAAG CCCAAGTGTC ATTTCTACAC GGTGATCAAA 1380
GTGAGAATGA GCTACCAGGA TTGGGGGGCA AGGAAGATAG GAGGGTCAAA CAGAGTCGGG 1440
GAGAAGCCAG GGAGAGCTAC AGAGAAACCG GGTCTAGCAG AGCAAGCGAT GCGAGAGCTG 1500
CCCATCTTCC AACCAGCGCA CCCCTAGACA TTGACACTGC ATCGGAGTCA GGCCAAGATC 1560
CGCAGGACAG TCGACGGTCA GCTGACGCCC TGCTCAGGCT GCAAGCCATG GCAGGAATCT 1620
TGGAAGAACA AGGCTCAGAC ACGGACACCC CTAGGGTGTA CAATGACAGA GATCTTCTAG 1680
ACTAGGTGCG AGAGGCCGAG GACCAGAACA ACATCCGCCT ACCCTCCATC ATTGTTATAA 1740
AAAACTTAGG AACCAGGTCC ACACAGCCGC CAGCCAACCA ACCATCCACT CCTACGACTG 1800
GGGCCGATGG CAGAAGAGCA GGCACGCCAT GTCAAAAACG GACTGGAATG CATCCGGGCT 1860
CTCAAGGCCG AGCCCATCGG CTCACTGGCC GTCGAGGAAG CCATGGCAGC ATGGTCACAA 1920
ATATCAGACA ACCCAGGACA GGACCGAACC ACCCGCAAGG AAGAGGAGGC AGGCAGTTCG 1980
GGTCTCAGCA AACCATGCCT CTCAGCAATT GGATCAACTG AAGGCAGTGC ACCTCGCATC 2040
TGCGGTCAGG GATCTGGAGA GAGCGATGAC AACGCTGAAA CTTTGGGAAT CCCCTCAAGA 2100
AATCTCCAGG CATCAAGCAC TGGGTTACAG TGTTATCATG TTTATGATCA CAGCGGTGAA 2160
GCGGTTAAGG GAATCCAAGA TGCTGACTCT ATCATGGTTC AATCAGGCCT TGATGGTGAT 2220
AGCACCCTCT CAGGAGGAGA CGATGAATCT GAAAACAGCG ATGTGGATAT TGGCGAACCT 2280
GATACCGAGG GATATGCTAT CACTGACCGG GGATCTGCTC CCATCTCTAT GGGGTTCAGG 2340
GCTTCTGATG TTGAAACTGC AGAAGGAGGG GAGATCCACG AGCTCCTGAG ACTCCAATCT 2400
AGAGGCAACA ACTTCCCGAA GCTTGGGAAA ACTCTCAATG TTCCTCCGCC CCCGAACCCC 2460
GGTAGGGCCA GCACTTCCGA GACACCCATT AAAAAGGGGA CAGACGCGAG ATTAGCCTCA 2520
TTTGGAGCGG AGATCGCGTC TTTATTGACA GGTGGTGCAA CCCAATGTGC TCGAAAGTCA 2580
CCCTCGGAAC CATCAGGGCC AGGTGCACCT GTGGGGAATG TCCCCGAGTG TGTGAGCAAT 2640
GCCGCACTGA TACAGGAGTG GACACCCGAA TCTGGTACCA CAATCTCCCC GAGATCCCAG 2700
AATAATGAAG AAGGGGGAGA TTATTATGAT GATGAGCTGT TCTCCGATGT CCAAGACATC 2760
AAAACAGCCT TGGCCAAAAT ACACGAGGAT AATCAGAAGA TAATCTCCAA GCTAGAATCA 2820
CTGCTGTTAT TGAAGGGAGA AGTTGAGTCA ATTAAAAAGC AGATCAACAG GCAAAATATC 2880
AGCATATCCA CCCTGGAAGG ACACCTCTCA AGCATCATGA TCGCCATTCC TGGACTTGGG 2940
AAGGATCCCA ACGACCCCAC TGCAGATGTC GAACTCAATC CCGACCTGAA ACCCATCATA 3000
GGCAGAGATT CAGGCCGAGC ACTGGCCGAA GTTCTCAAGA AACCCGTTGC CAGCCGACAA 3060
CTCCAAGGAA TGACAAATGG ACGGACCAGT TCCAGAGGAC AGCTGCTGAA GGAATTTCAA 3120
CTAAAGCCGA TCGGGAAAAA GATGAGCTCA GCCGTCGGGT TTGTTCCTGA CACCGGCCCC 3180
GCATCACGCA GTGTAATCCG CTCCATTATA AAATCCAGCC GGCTAGAGGA GGATCGGAAG 3240
CGTTACCTGA TGACTCTCCT TGATGATATC AAAGGAGCCA ACGATCTTGC CAAGTTCCAC 3300
CAGATGCTGA TGAAGATAAT AATGAAGTAG CTACAGCTCA ACTTACCTGC CAACCTCATG 3360
CCAATCGACC TAATTAGTAC AGCCTAAATC CATTATAAAA AACTTAGGAG CAAAGTGATT 3420
GCCTCCCAAG TTCCACAATG ACAGAGATCT ACGACTTCGA CAAGTCGGCA TGGGACATCA 3480
AAGGGTCGAT CGCTCCGATA CAACCTACCA CCTACAGTGA TGGCAGGCTG GTGCCCCAGG 3540
TCAGAGTCAT AGATCCTGGT CTAGGCGACA GGAAGGATGA ATGCTTTACG TACATGTTTC 3600
TGCTGGGGGT TGTTGAGGAC AGCGATCCCC TAGGGCCTCC AATCGGGCGA GCATTTGGGT 3660
CCCTGCCCTT AGGTGTTGGT AGATCCACAG CAAAACCCGA AGAACTCCTC AAAGAGGCCA 3720
CTGAGCTTGA CATAGTCGTT AGACGTACAG CAGGGCTCAA TGAAAAACTG GTGTTCTACA 3780
ACAACACCCC ACTAACTCTC CTCACACCTT GGAGAAAGGT CCTAACAACA GGGAGTGTCT 3840
TCAACGCAAA CCAAGTGTGC AATGCGGTTA ATCTGATACC GCTGGATACC CCGCAGAGGT 3900
TCCGTGTTGT TTATATGAGC ATCACCCGTC TTTCGGATAA CGGGTATTAC ACCGTTCCTA 3960
GAAGAATGCT AGAATTCAGA TCGGTCAATG CAGTGGCTTT CAACCTGCTG GTGACCCTTA 4020
GGATTGACAA AGCGATTGGC CCTGGGAAGA TCATCGATAA TGCAGAGCAA CTTCCTGAGG 4080
CAACATTTAT GGTCCACATC GGGAACTTCA GGAGAAAGAA GAGTGAAGTC TACTCTGCTG 4140
ATTATTGCAA AATGAAAATC GAAAAGATGG GCCTGGTTTT TGCACTTGGT GGGATAGGGG 4200
GCACCAGTCT TCACATTAGA AGCACAGGCA AAATGAGCAA GACTCTCCAT GCACAACTCG 4260
GGTTCAAAAA GACCTTATGT TACCCACTGA TGGATATCAA TGAAGACCTT AATCGATTAC 4320
TCTGGAGGAG CAGATGCAAG ATAGTAAGAA TCCAGGCAGT TTTGCAGCCA TCAGTTCCCC 4380
AAGAATTCCG CATTTACGAC GACGTGATCA TAAATGATGA CCAAGGACTA TTCAAAGTTC 4440
TGTAGACCGT AGTGCCCAGC AATACCCGAA AACGACCCCC CTCATAATGA CAGCCAGAAG 4500
GCCCGGACAA AAAAGCCCCC TCCAAAAGAC TCCACGGACC AAGTGAGAGG CCAGCCAGCA 4560
GCTGACGGCA AGCGTGAACA CCAGGCGGCC TGGGCACAGA ACAGCCCCGA CACAAGGCAA 4620
CCACCAGCCA TCCCAATCTG CGTCCTCCTC GTGGGACCCC CGAGGACCAA CCCCCAAGGT 4680
CGCCCCCGAC CCAGACCACC AACCGCATCC CCACAGCCCC CGGGAAAGAG ACCCCCAGCA 4740
ACTGGAAGGC CCCTCCCCCT TTCCCTCAAC GCAAGAACTC CACAACCGAA CCGCACAAGC 4800
GATCGAGGTG ACCCAACCGC AGGCATCCGA CTCCCTAGAC AGATCCTCTC CCCCCGGCAA 4860
ACTAAACAAA ACTTAGGGCC AAGGAACATA CACACCCGAC AGAACCCAGA CCCCGGCCCA 4920
CGGCGCCGCG CCCCCACCTC CCGACAACCA GAGGGAGCCC CCAACCAATC CCGCCGGCTC 4980
CCCCGGTGCC CACAGGCAGG CACACCAACC CTCGAACAGA CCCAGCACCC AGCCATCGAC 5040
AATTCAAGAC GGGGGGCCCC CCCCAAAAAA AGGCCCCCAG GGGCCGACAG CCAGCACCGC 5100
GAGGAAGCCC ACCCACCCCA CACACGACCA CAGGAACCGA ACCAGAATCC AGACCACCCT 5160
GGGCCACCAG TTCCCAGACT CGGCCATCAC CCCGCAGAAA GGAAAGGCCA CAACCCGCGC 5220
ACCCCTGCCC TGATCCGGTG GGCGGCCACC CAACCCGAAC CAGCACCCAA GAGCGATCCC 5280
CGAAGGGCCC CCGAACCGCA AAAGACATCA GTATCCCACA GCCTCTCCAA GTCCCCCGGT 5340
CTCCCCCTCT TCTCGAAGGG ACCAAAAGAT CAATCCACCA CACCCGACGA CACTCAATTC 5400
CCCACCCCTA AAGGAGACAC CGGGAATCCC AGAATCAAGA CTCATCCAAT GTCCATCATG 5460
GGTCTCAAGG TGAACGTCTC TGCCATATTC ATGGCAGTAC TGTTAACTCT CCAAACACCC 5520
ACCGGTCAAA TCCATTGGGG CAATCTCTCT AAGATAGGGG TGGTAGGGAT AGGAAGTGCA 5580
AGCTACAAAG TTATGACTCG TTCCAGCCAT CAATCATTAG TCATAAAATT AATGCCCAAT 5640
ATAACTCTCC TCAATAACTG CACGAGGGTA GAGATTGCAG AATACAGGAG ACTACTGAGA 5700
ACAGTTTTGG AACCAATTAG AGATGCACTT AATGCAATGA CCCAGAATAT AAGACCGGTT 5760
CAGAGTGTAG CTTCAAGTAG GAGACACAAG AGATTTGCTG GAGTTGTCCT GGCGGGTGCG 5820
GCCCTAGGCG TTGCCACAGC TGCTCAGATA ACAGCCGGCA TTGCACTTCA CCAGTCCATG 5880
TTGAACTCTC AAGCCATCGA CAATCTGAGA GCGAGCCTGG AAACTACTAA TCAGGCAATT 5940
GAGGCAATCA GACAAGCAGG GCAGGAGATG ATATTGGCTG TTCAGGGTGT CCAAGACTAC 6000
ATCAATAATG AGCTGATACC GTCTATGAAC CAACTATCTT GTGATTTAAT CGGCCAGAAG 6060
CTAGGGCTCA AATTGCTCAG ATACTATACA GAAATCCTGT CACTATTTGG CCCCAGCTTA 6120
CGGGACCCCA TATCTGCGGA GATATCTATC CAGGCTTTGA GCTATGCGCT TGGAGGAGAT 6180
ATCAATAAGG TGTTAGAAAA GCTCGGATAC AGTGGAGGTG ATTTACTGGG CATCTTAGAG 6240
AGCAGAGGAA TAAAGGCCCG GATAACTCAC GTCGACACAG AGTCCTACTT CATTGTACTC 6300
AGTATAGCCT ATCCGACGCT GTCCGAGATT AAGGGGGTGA TTGTCCACCG GCTAGAAGGG 6360
GTCTCGTACA ACATAGGCTC TCAAGAGTGG TATACCACTG TGCCCAAGTA TGTTGCAACC 6420
CAAGGGTACC TTATCTCGAA TTTTGATGAG TCATCGTGTA CTTTCATGCC AGAGGGGACT 6480
GTGTGCAGCC AAAATGCCTT GTACCCGATG AGTCCTCTGC TCCAAGAATG CCTCCGGGGG 6540
TCCACCAAGT CCTGTGCTCG TACACTTGTA TCCGGGTCTT TTGGGAACCG GTTCATTTTA 6600
TCACAAGGGA ATCTAATAGC CAATTGTGCA TCAATCCTTT GCAAGTGTTA CACAACAGGA 6660
ACGATCATTA ATCAGGACCC TGACAAGATC CTAACATACA TTGCTGCCGA TCACTGCCCG 6720
GTGGTCGAGG TGAACGGCGT GACCATCCAA GTCGGGAGCA GGCGGTATCC GGACGCTGTG 6780
TACTTGCACA GAATTGACCT CGGTCCTCCC ATATCATTGG AGAGGTTGGA CGTAGGGACA 6840
AATCTGGGGA ATGCAATTGC TAAGTTGGAG GATGCCAAGG AATTGTTGGA GTCATCGGAC 6900
CAGATATTGA GGAGTATGAA AGGTTTATCG AGCACTAGCA TAGTTTACAT CCTGATTGCA 6960
GTGTGTCTTG GAGGGTTGAT AGGGATCCCC GCTTTAATAT GTTGCTGCAG GGGGCGTTGT 7020
AACAAAAAGG GAGAACAAGT TGGTATGTCA AGACCAGGCC TAAAGCCTGA TCTTACAGGA 7080
ACATCAAAAT CCTATGTAAG GTCGCTCTGA TCCTCTACAA CTCTTGAAAC ACAAATGTCC 7140
CACAAGTCTC CTCTTCGTCA TCAAGCAACC ACCGCATCCA GCATCGAGCC CACCTGAAAT 7200
TGTCTCCGGA TTCCCTCTGG CCGAACAATA TCGGTAGTTA ATTAAAACTT AGGGTGCAAG 7260
ATCATCCACA ATGTCACCAC AACGAGACCG GATAAATGCC TTCTACAAAG ACAACCCCCA 7320
TCCTAGGGGA AGTAGGATAG TTATTAACAG AGAACATCTT ATGATTGATA GACCTTATGT 7380
TTTGCTGGCT GTTCTATTCG TCATGTTTCT GAGCTTGATC GGGTTGCTAG CCATTGCAGG 7440
CATAAGACTT CATCGGGCAG CCATCTACAC CGCAGAGATC CATAAAAGCC TCAGCACCAA 7500
TCTAGATGTA ACTAACTCAA TCGAGCATCA GGTCAAGGAC GTGCTGACAC CACTCTTCAA 7560
GATCATCGGT GATGAAGTGG GCCTGAGGAC ACCTCAGAGA TTCACCGACC TAGTGAAATT 7620
CATCTCTGAC AAGATTAAAT TCCTTAATCC GGATAGGGAG TACGACTTCA GAGATCTCAC 7680
TTGGTGTATC AACCCGCCAG AGAGAATCAA ATTGGATTAT GATCAATACT GTGCAGATGT 7740
GGCTGCTGAA GAACTCATGA ATGCATTGGT GAACTCAACT CTACTGGAGG CCAGGGTAAC 7800
CAATCAGTTC CTAGCTGTCT CAAAGGGAAA CTGCTCAGGG CCCACTACAA TCAGAGGTCA 7860
ATTCTCAAAC ATGTCGCTGT CCCTGTTGGA CTTGTATTTA AATCGAGGTT ACAATGTGTC 7920
ATCTATAGTC ACTATGACAT CCCAGGGAAT GTACGGGGGA ACTTACCTAG TGGAAAAGCC 7980
TAATCTGAGC AGTAAAGGGT CAGAGTTGTC ACAACTGAGC ATGCACCGAG TGTTTGAAGT 8040
AGGTGTTATC AGAAATCCGG GTTTGGGGGC TCCGGTGTTC CATATGACAA ACTATTTTGA 8100
GCAACCAGTC AGTAATGATT TCAGCAACTG CATGGTGGCT TTGGGGGAGC TCAAATTCGC 8160
AGCCCTTTGT CACAGGGAAG ATTCTATCAC AATTCCCTAT CAGGGATCAG GGAAAGGTGT 8220
CAGCTTCCAG CTCGTCAAGC TAGGTGTCTG GAAATCCCCA ACCGACATGC AATCCTGGGT 8280
CCCCCTATCA ACGGATGATC CAGTGATAGA CAGGCTCTAC CTCTCATCTC ACAGAGGCGT 8340
TATCGCTGAC AATCAAGCAA AATGGGCTGT CCCGACAACA CGGACAGATG ACAAGTTGCG 8400
AATGGAGACA TGCTTCCAGC AGGCGTGTAA GGGTAAAATC CAAGCACTCT GCGAGAATCC 8460
CGAGTGGGCA CCATTGAAGG ATAACAGGAT TCCTTCATAC GGGGTCTTGT CTGTTAATCT 8520
GAGTCTGACA GTTGAGCTTA AAATCAAAAT TGCTTCAGGA TTCGGGCCAT TGATCACACA 8580
CGGTTCAGGG ATGGACCTAT ACAAATCCAA CCACAACAAT GTGTATTGGC TGACTATCCC 8640
GCCAATGAAG AACCTAGCCT TAGGTGTAAT CAACACATTG GAGTGGATAC CGAGATTCAA 8700
GGTTAGTCCC TACCTCTTCA CTGTTCCAAT TAAGGAAGCA GGCGAGGACT GCCATGCCCC 8760
AACATACCTA CCTGCGGAGG TGGATGGTGA TGTCAAACTC AGTTCCAATC TGGTGATTCT 8820
ACCTGGTCAA GATCTCCAAT ATGTTTTGGC AACCTATGAT ACTTCCAGAG TTGAACATGC 8880
TGTGGTTTAT TACGTTTACA GCCCAAGCCG CTCATTTTCT TACTTTTATC CTTTTAGGTT 8940
GCCTATAAGG GGGGTCCCCA TCGAATTACA AGTGGAATGC TTCACATGGG ACCAAAAACT 9000
CTGGTGCCGT CACTTCTGTG TGCTTGCGGA CTCAGAATCT GGTGGATATA TCACTCACTC 9060
TGGGATGGTG GGCATGGGAG TCAGCTGCAC AGTCACTCGG GAAGATGGAA CCAACCGCAG 9120
ATAGGGCTGC CAGTGAACCA ATCACATGAT GTCACCCAGA CATCAGGCAT ACCCACTAGT 9180
GTGAAATAGA CATCAGAATT AAGAAAAACG TAGGGTCCAA GTGGTTCCCC GTTATGGACT 9240
CGCTATCTGT CAACCAGATC TTATACCCTG AAGTTCACCT AGATAGCCCG ATAGTTACCA 9300
ATAAGATAGT AGCTATCCTG GAGTATGCTC GAGTCCCTCA CGCATACAGC CTGGAGGACC 9360
CTACACTGTG TCAGAACATC AAGCACCGCC TAAAAAACGG ATTTTCCAAC CAAATGATTA 9420
TAAACAATGT GGAAGTTGGG AATGTCATCA AGTCCAAGCT TAGGAGTTAT CCGACCCACT 9480
CTCATATTCC ATATCCAAAT TGTAATCAGG ATTTATTTAA CATAGAAGAC AAAGAGTCAA 9540
CAAGGAAGAT CCGTGAGCTC CTCAAAAAGG GAAATTCGCT GTACTCCAAA GTCAGTGATA 9600
AGGTTTTCCA ATGCCTGAGG GACACTAACT CACGGCTTGG CCTAGGCTCC GAATTGAGGG 9660
AGGACATCAA GGAGAAAATT ATTAACTTGG GAGTTTACAT GCACAGCTCC CAATGGTTTG 9720
AGCCCTTTCT GTTTTGGTTT ACAGTCAAGA CTGAGATGAG GTCAGTGATT AAATCACAAA 9780
CCCATACTTG CCATAGGAGG AGACACACAC CAGTATTCTT CACTGGTAGT TCAGTTGAGT 9840
TGCTAATCTC TCGTGACCTT GTTGCTATAA TCAGTAAAGA GTCTCAACAT GTATATTACC 9900
TGACGTTTGA ACTGGTCTTG ATGTATTGTG ATGTCATAGA GGGGAGGTTA ATGACAGAGA 9960
CCGCTATGAC CATTGATGCT AGGTATACAG AGCTTCTAGG AAGAGTCAGA TACATGTGGA 10020
AACTGATAGA TGGTTTCTTC CCTGCACTCG GGAATCCAAC TTACCAAATT GTAGCCATGC 10080
TGGAGCCTCT TTCACTTGCT TACCTGCAGC TGAGGGATAT AACAGTAGAA CTCAGAGGTG 10140
CTTTCCTTAA CCACTGCTTT ACTGAAATAC ATGATGTTCT TGACCAAAAC GGGTTTTCTG 10200
ATGAAGGTAC TTATCATGAG TTAATTGAAG CCCTAGATTA CATTTTCATA ACTGATGACA 10260
TACATCTGAC AGGGGAGATT TTCTCATTTT TCAGAAGTTT CGGCCACCCC AGACTTGAAG 10320
CAGTAACGGC TGCTGAAAAT GTTAGGAAAT ACATGAATCA GCCTAAAGTC ATTGTGTATG 10380
AGACTCTGAT GAAAGGTCAT GCCATATTCT GTGGAATCAT AATCAACGGC TATCGTGACA 10440
GGCACGGAGG CAGTTGGCCA CCCCTGACCC TCCCCCTGCA TGCTGCAGAC ACAATCCGGA 10500
ATGCTCAAGC TTCAGGTGAA GGGTTAACAC ATGAGCAGTG CGTTGATAAC TGGAAATCTT 10560
TTGCTGGAGT GAAATTTGGC TGCTTTATGC CTCTTAGCCT GGATAGTGAT CTGACAATGT 10620
ACCTAAAGGA CAAGGCACTT GCTGCTCTCC AAAGGGAATG GGATTCAGTT TACCCGAAAG 10680
AGTTCCTGCG TTACGACCCT CCCAAAGGAA CTGGGTCACG GAGGCTTGTA AATGTTTTCC 10740
TTAATGATTC GAGCTTTGAC CCATATGACA TGATAATGTA TGTTGTAAGT GGAGCTTACC 10800
TCCATGACCC TGAGTTCAAC CTGTCTTACA GCCTGAAAGA AAAGGAGATC AAGGAAACAG 10860
GTAGACTTTT TGCTAAAATG ACTTACAAAA TGAGGGCATG CCAAGTGATT GCTGAAAATC 10920
TAATCTCAAA CGGGATTGGC AATTATTTTA AGGACAATGG GATGGCCAAG GACGAGCACG 10980
ATTTGACTAA GGCACTCCAC ACTCTAGCTG TCTCAGGAGT CCCCAAAGAT CTCAAAGAAA 11040
GTCACAGGGG GGGGCCAGTC TTAAAAACCC ACTCCCGAAG CCCAGTCCAC ACAAGTACCA 11100
AGAACGTGAG AGCAGCAAAA GGGTTTATAG GATTCCCTCA TGTAATTCGG CAGGACCAAG 11160
ACACTGATCA TCCGGAGAAT ATGGAGGCTT ACGAGACAGT CAGTGCATTT ATCACGACTG 11220
ATCTCAAGAA GTACTGCCTT AATTGGAGAT ATGAGACCAT CAGCTTATTT GCACAAAGGC 11280
TAAATGAGAT TTACGGATTA CCCTCATTTT TCCAGTGGCT GCATAAGAGG CTTGAAACCT 11340
CTGTCCTCTA TGTAAGTGAC CCTCATTGCC CCCCTGACCT TGACGCCCAT GTCCCGTTAT 11400
GCAAAGTCCC CAATGACCAA ATCTTCATTA AGTACCCTAT GGGAGGTATA GAAGGGTATT 11460
GTCAGAAGCT GTGGACCATC AGCACCATTC CCTATTTATA CCTGGCTGCT TATGAGAGCG 11520
GAGTAAGGAT TGCTTCGTTA GTGCAAGGGG ACAATCAGAC CATAGCCGTA ACAAAAAGGG 11580
TACCCAGCAC ATGGCCTTAC AACCTTAAGA AACGGGAAGC TGCTAGAGTA ACTAGAGATT 11640
ACTTTGTAAT TCTTAGGCAA AGGCTACATG ACATAGGCCA TCACCTCAAG GCAAATGAGA 11700
CAATTGTCTC ATCACATTTT TTTGTCTATT CAAAAGGAAT ATATTATGAT GGGCTACTTG 11760
TGTCCCAATC ACTCAAGAGC ATCGCAAGAT GTGTATTCTG GTCAGAGACT ATAGTTGATG 11820
AAACAAGGGC AGCATGCAGT AATATTGCTA CAACAATGGC TAAAAGCATC GAGAGAGGTT 11880
ATGACCGTTA CCTTGCATAT TCCCTGAACG TCCTAAAAGT GATACAGCAA ATCCTGATCT 11940
CTCTTGGCTT CACAATCAAT TCAACCATGA CCCGGGATGT AGTCATACCC CTCCTCACAA 12000
ACAACGATCT CTTAATAAGG ATGGCACTGT TGCCCGCTCC TATCGGGGGG ATGAATTATC 12060
TGAATATGAG CAGGCTGTTT GTCAGAAACA TCGGTGATCC AGTAACATCA TCAATTGCTG 12120
ATCTCAAGAG AATGATTCTC TCATCACTAA TGCCTGAAGA GACCCTTCAT CAAGTAATGA 12180
CACAACAACC GGGGGACTCT TCATTCCTAG ACTGGGCTAG CGACCCTTAC TCAGCAAATC 12240
TTGTATGCGT CCAGAGCATC ACTAGACTCC TCAAGAACAT AACTGCAAGG TTTGTCCTGA 12300
TCCATAGTCC AAACCCAATG TTAAAAGGGT TATTCCATGA TGACAGTAAA GAAGAGGACG 12360
AGGGACTGGC GGCATTCCTC ATGGACAGGC ATATTATAGT ACCTAGGGCA GCTCATGAAA 12420
TCCTGGATCA TAGTGTCACA GGGGCAAGAG AGTCTATTGC AGGCATGCTA GATACCACAA 12480
AAGGCCTGAT TCGAGCCAGC ATGAGGAAGG GGGGGTTAAC CTCTCGAGTG ATAACCAGAT 12540
TGTCCAATTA TGACTATGAA CAATTTAGAG CAGGGATGGT GCTATTGACA GGAAGAAAGA 12600
GAAATGTCCT CATTGACAAA GAGTCATGTT CAGTGCAGCT GGCTAGAGCC CTAAGAAGCC 12660
ATATGTGGGC AAGGCTAGCT CGAGGACGGC CTATTTACGG CCTTGAGGTC CCTGATGTAC 12720
TAGAATCTAT GCGAGGCCAC CTTATTCGGC GCCATGAGAC ATGTGTCATC TGCGAGTGTG 12780
GATCAGTCAA CTACGGATGG TTTTTTGTCC CCTCGGGTTG CCAACTGGAT GATATTGACA 12840
AGGAAACATC ATCCTTGAGA GTCCCATATA TTGGTTCTAC CACTGATGAG AGAACAGACA 12900
TGAAGCTTGC CTTCGTAAGA GCCCCAAGTC GATCCTTGCG ATCTGCTGTT AGAATAGCAA 12960
CAGTGTACTC ATGGGCTTAT GGTGATGATG ATAGCTCTTG GAACGAAGCC TGGTTGTTGG 13020
CAAGGCAAAG GGCCAATGTG AGCCTGGAGG AGCTAAGGGT GATCACTCCC ATCTCAACTT 13080
CGACTAATTT AGCGCATAGG TTGAGGGATC GTACCACTCA AGTGAAATAC TCAGGTACAT 13140
CCCTTGTCCG AGTGGCAAGG TATACCACAA TCTCCAACGA CAATCTCTCA TTTGTCATAT 13200
CAGATAAGAA GGTTGATACT AACTTTATAT ACCAACAGGG AATGCTTCTA GGGTTGGGTG 13260
TTTTAGAAAC ATTGTTTCGA CTCGAGAAAG ATACCGGATC ATCTAACACG GTATTACATC 13320
TTCACGTCGA AACAGATTGT TGCGTGATCC CGATGATAGA TCATCCCAGG ATACCCAGCT 13380
CCCGCAAGCT AGAGCTTAGG GCAGAGCTAT GTACCAACCC ATTGATATAT GATAATGCAC 13440
CTTTAATTGA CAGAGATGCA ACAAGGCTAT ACACCCAGAG CCATAGGAGG CACCTTGTGG 13500
AATTTGTTAC ATGGTCCACA CCCCAACTAT ATCACATTTT AGCTAAGTCC ACAGCACTAT 13560
CTATGATTGA CCTGGTAACA AAATTTGAGA AGGACCATAT GAATGAAATT TCAGCTCTCA 13620
TAGGGGATGA CGATATCAAT AGTTTCATAA CTGAGTTTCT GCTTATAGAG CCAAGATTAT 13680
TCACTATCTA CTTGGGCCAG TGTGCAGCCA TCAATTGGGC ATTTGATGTA CATTATCATA 13740
GACCATCAGG GAAATATCAG ATGGGTGAGC TGTTGTCTTC GTTCCTTTCT AGAATGAGCA 13800
AAGGAGTGTT TAAGGTGCTT GTCAATGCTC TAAGCCACCC AAAGATCTAC AAGAAATTCT 13860
GGCATTGTGG TATTATAGAG CCTATCCATG GTCCTTCACT TGATGCTCAA AACTTACACA 13920
CAACTGTGTG CAACATGATT TACACATGCT ATATGACCTA CCTCGACCTG TTGTTGAATG 13980
AAGAGTTAGA AGAGTTCACA TTTCTTCTGT GTGAAAGCGA CGAGGATGTA GTACCGGACA 14040
GATTCGACAA TATCCAGGCA AAACACTTGT GTGTTCTAGC AGATTTGTAC TGTCAACCAG 14100
GGACCTGCCC ACCAATTCGA GGTCTACGAC CTGTAGAGAA ATGTGCAGTT CTAACCGATC 14160
ATATCAAGGC AGAGGCTAGG TTATCTCCAG CAGGGTCTTC GTGGAACATA AATCCAATTA 14220
TTGTAGACCA TTACTCATGC TCTCTGACTT ATCTCCGGCG AGGATCGATC AAACAGATAA 14280
GATTGAGAGT TGATCCAGGA TTCATTTTTG ACGCCCTCGC TGAGGTAAAT GTCAGTCAGC 14340
CAAAGATCGG CAGCAACAAC ATCTCAAATA TGAGCATCAA GGATTTCAGA CCTCCACACG 14400
ATGATGTTGC AAAATTGCTC AAAGATATCA ACACAAGCAA GCACAATCTT CCCATTTCAG 14460
GGGGTAATCT CGCCAATTAT GAAATCCACG CTTTCCGCAG AATCGGGTTA AACTCATCCG 14520
CTTGCTACAA AGCTGTTGAG ATATCAACAT TAATTAGGAG ATGCCTTGAG CCAGGGGAAG 14580
ACGGCTTGTT CTTGGGTGAG GGGTCGGGTT CTATGTTGAT CACTTATAAG GAGATACTAA 14640
AACTAAACAA GTGCTTCTAT AATAGTGGGG TTTCCGCCAA TTCTAGATCT GGTCAAAGGG 14700
AATTAGCACC CTATCCCTCC GAAGTTGGTC TTGTCGAACA CAGAATGGGA GTAGGTAATA 14760
TTGTCAAAGT GCTCTTTAAC GGGAGGCCCG AAGTCACGTG GGTAGGCAGT GTAGATTGCT 14820
TCAATTTCAT AGTCAGTAAT ATCCCTACCT CTAGTGTGGG GTTTATCCAT TCAGATATAG 14880
AGACCTTACC TAACAAAGAT ACTATAGAGA AGCTAGAGGA ATTAGCAGCC ATCTTATCGA 14940
TGGCTCTGCT CCTTGGCAAA ATAGGATCAA TACTGGTGAT TAAGCTTATG CCTTTCAGCG 15000
GGGATTTTGT TCAGGGATTT ATAAGTTATG TAGGGTCTTA TTATAGAGAA GTGAACCTTG 15060
TCTACCCTAG ATACAGCAAC TTCATATCTA CTGAATCTTA TTTAGTCATG ACAGATCTCA 15120
AAGCTAACCG GCTAATGAAT CCTGAAAAGA TTAAGCAGCA GATAATTGAA TCATCTGTGC 15180
GGACTTCACC TGGACTTATA GGTCACATCC TATCCATTAA GCAACTAAGC TGCATACAAG 15240
CAATTGTGGG AGACGCAGTT AGTAGAGGTG GTATCAACCC TATTCTGAAG AAACTTACAC 15300
CTATAGAGCA GGTGCTGATC AATTGCGGGT TGGCAATTAA CGGACCTAAA CTGTGCAAAG 15360
AATTGATCCA CCATGATGTT GCCTCAGGGC AAGATGGATT GCTTAACTCT ATACTCATCC 15420
TCTACAGGGA GTTGGCAAGA TTCAAAGACA ACCAAAGAAG TCAACAAGGG ATGTTCCATG 15480
CTTACCCCGT ATTGGTAAGT AGCAGGCAAC GAGAACTTAT ATCTAGGATC ACCCGCAAAT 15540
TTTGGGGGCA TATTCTTCTT TACTCCGGGA ACAGAAAGTT GATAAATCGG TTTATCCAGA 15600
ATCTCAAGTC CGGTTACCTG ATACTAGACT TACACCAGAA TATCTTCGTT AAGAATCTAT 15660
CTAAGTCAGA GAAACAGATT ATTATGACGG GGGGTTTAAA ACGTGAGTGG GTTTTTAAGG 15720
TAACAATCAA GGAGACCAAA GAATGGTATA AGTTAGTCGG ATACAGTGCC CTGATTAAGG 15780
ATTAATTGGT TGGACTCCGG GACCCTAATC CTGCCCTAGG TAGTTAGGCA TTATTTGCAA 15840
TATATTAAAG AAAACTTTGA AAATACGAAG TTTCTATTCC CAGCTTTGTC TGGT 15894
(2) SEQ ID NO: 6 of the message:
...
(i) sequence signature:
(A) length: 2183 amino acid
(B) type: amino acid
(C) chain:
(D) topological framework: linearity
(ii) molecule type: protein
(xi) sequence description: SEQ ID NO:6:Met Asp Ser Leu Ser Val Asn Gln Ile Leu Tyr Pro Glu Val His Leu1 5 10 15Asp Ser Pro Ile Val Thr Asn Lys Ile Val Ala Ile Leu Glu Tyr Ala
20??????????????????25??????????????????30Arg?Val?Pro?His?Ala?Tyr?Ser?Leu?Glu?Asp?Pro?Thr?Leu?Cys?Gln?Asn
35??????????????????40??????????????????45Ile?Lys?His?Arg?Leu?Lys?Asn?Gly?Phe?Ser?Asn?Gln?Met?Ile?Ile?Asn
50??????????????????55??????????????????60Asn?Val?Glu?Val?Gly?Asn?Val?Ile?Lys?Ser?Lys?Leu?Arg?Ser?Tyr?Pro65??????????????????70??????????????????75??????????????????80Thr?His?Ser?His?Ile?Pro?Tyr?Pro?Asn?Cys?Asn?Gln?Asp?Leu?Phe?Asn
85??????????????????90??????????????????95Ile?Glu?Asp?Lys?Glu?Ser?Thr?Arg?Lys?Ile?Arg?Glu?Leu?Leu?Lys?Lys
100?????????????????105?????????????????110Gly?Asn?Ser?Leu?Tyr?Ser?Lys?Val?Ser?Asp?Lys?Val?Phe?Gln?Cys?Leu
115?????????????????120?????????????????125Arg?Asp?Thr?Asn?Ser?Arg?Leu?Gly?Leu?Gly?Ser?Glu?Leu?Arg?Glu?Asp
130?????????????????135?????????????????140Ile?Lys?Glu?Lys?Ile?Ile?Asn?Leu?Gly?Val?Tyr?Met?His?Ser?Ser?Gln145?????????????????150?????????????????155?????????????????160Trp?Phe?Glu?Pro?Phe?Leu?Phe?Trp?Phe?Thr?Val?Lys?Thr?Glu?Met?Arg
165?????????????????170?????????????????175Ser?Val?Ile?Lys?Ser?Gln?Thr?His?Thr?Cys?His?Arg?Arg?Arg?His?Thr
180?????????????????185?????????????????190Pro?Val?Phe?Phe?Thr?Gly?Ser?Ser?Val?Glu?Leu?Leu?Ile?Ser?Arg?Asp
195?????????????????200?????????????????205Leu?Val?Ala?Ile?Ile?Ser?Lys?Glu?Ser?Gln?His?Val?Tyr?Tyr?Leu?Thr
210?????????????????215?????????????????220Phe?Glu?Leu?Val?Leu?Met?Tyr?Cys?Asp?Val?Ile?Glu?Gly?Arg?Leu?Met225?????????????????230?????????????????235?????????????????240Thr?Glu?Thr?Ala?Met?Thr?Ile?Asp?Ala?Arg?Tyr?Thr?Glu?Leu?Leu?Gly
245?????????????????250?????????????????255Arg?Val?Arg?Tyr?Met?Trp?Lys?Leu?Ile?Asp?Gly?Phe?Phe?Pro?Ala?Leu
260?????????????????265?????????????????270Gly?Asn?Pro?Thr?Tyr?Gln?Ile?Val?Ala?Met?Leu?Glu?Pro?Leu?Ser?Leu
275?????????????????280?????????????????285Ala?Tyr?Leu?Gln?Leu?Arg?Asp?Ile?Thr?Val?Glu?Leu?Arg?Gly?Ala?Phe
290?????????????????295??????????????????300Leu?Asn?His?Cys?Phe?Thr?Glu?Ile?His?Asp?Val?Leu?Asp?Gln?Asn?Gly305?????????????????310?????????????????315?????????????????320Phe?Ser?Asp?Glu?Gly?Thr?Tyr?His?Glu?Leu?Ile?Glu?Ala?Leu?Asp?Tyr
325?????????????????330?????????????????335Ile?Phe?Ile?Thr?Asp?Asp?Ile?His?Leu?Thr?Gly?Glu?Ile?Phe?Ser?Phe
340?????????????????345?????????????????350Phe?Arg?Ser?Phe?Gly?His?Pro?Arg?Leu?Glu?Ala?Val?Thr?Ala?Ala?Glu
355?????????????????360?????????????????365Asn?Val?Arg?Lys?Tyr?Met?Asn?Gln?Pro?Lys?Val?Ile?Val?Tyr?Glu?Thr
370?????????????????375?????????????????380Leu?Met?Lys?Gly?His?Ala?Ile?Phe?Cys?Gly?Ile?Ile?Ile?Asn?Gly?Tyr385?????????????????390?????????????????395?????????????????400Arg?Asp?Arg?His?Gly?Gly?Ser?Trp?Pro?Pro?Leu?Thr?Leu?Pro?Leu?His
405?????????????????410?????????????????415Ala?Ala?Asp?Thr?Ile?Arg?Asn?Ala?Gln?Ala?Ser?Gly?Glu?Gly?Leu?Thr
420?????????????????425?????????????????430His?Glu?Gln?Cys?Val?Asp?Asn?Trp?Lys?Ser?Phe?Ala?Gly?Val?Lys?Phe
435?????????????????440?????????????????445Gly?Cys?Phe?Met?Pro?Leu?Ser?Leu?Asp?Ser?Asp?Leu?Thr?Met?Tyr?Leu
450?????????????????455?????????????????460Lys?Asp?Lys?Ala?Leu?Ala?Ala?Leu?Gln?Arg?Glu?Trp?Asp?Ser?Val?Tyr465????????????????470??????????????????475?????????????????480Pro?Lys?Glu?Phe?Leu?Arg?Tyr?Asp?Pro?Pro?Lys?Gly?Thr?Gly?Ser?Arg
485?????????????????490?????????????????495Arg?Leu?Val?Asn?Val?Phe?Leu?Asn?Asp?Ser?Ser?Phe?Asp?Pro?Tyr?Asp
500?????????????????505?????????????????510Met?Ile?Met?Tyr?Val?Val?Ser?Gly?Ala?Tyr?Leu?His?Asp?Pro?Glu?Phe
515?????????????????520?????????????????525Asn?Leu?Ser?Tyr?Ser?Leu?Lys?Glu?Lys?Glu?Ile?Lys?Glu?Thr?Gly?Arg
530?????????????????535?????????????????540Leu?Phe?Ala?Lys?Met?Thr?Tyr?Lys?Met?Arg?Ala?Cys?Gln?Val?Ile?Ala545?????????????????550?????????????????555?????????????????560Glu?Asn?Leu?Ile?Ser?Asn?Gly?Ile?Gly?Asn?Tyr?Phe?Lys?Asp?Asn?Gly
565?????????????????570?????????????????575Met?Ala?Lys?Asp?Glu?His?Asp?Leu?Thr?Lys?Ala?Leu?His?Thr?Leu?Ala
580?????????????????585?????????????????590Val?Ser?Gly?Val?Pro?Lys?Asp?Leu?Lys?Glu?Ser?His?Arg?Gly?Gly?Pro
595?????????????????600?????????????????605Val?Leu?Lys?Thr?His?Ser?Arg?Ser?Pro?Val?His?Thr?Ser?Thr?Lys?Asn
610?????????????????615?????????????????????????620Val?Arg?Ala?Ala?Lys?Gly?Phe?Ile?Gly?Phe?Pro?His?Val?Ile?Arg?Gln625?????????????????630?????????????????635?????????????????640Asp?Gln?Asp?Thr?Asp?His?Pro?Glu?Asn?Met?Glu?Ala?Tyr?Glu?Thr?Val
645?????????????????650?????????????????655Ser?Ala?Phe?Ile?Thr?Thr?Asp?Leu?Lys?Lys?Tyr?Cys?Leu?Asn?Trp?Arg
660?????????????????665?????????????????670Tyr?Glu?Thr?Ile?Ser?Leu?Phe?Ala?Gln?Arg?Leu?Asn?Glu?Ile?Tyr?Gly
675?????????????????680?????????????????685Leu?Pro?Ser?Phe?Phe?Gln?Trp?Leu?His?Lys?Arg?Leu?Glu?Thr?Ser?Val
690?????????????????695?????????????????700Leu?Tyr?Val?Ser?Asp?Pro?His?Cys?Pro?Pro?Asp?Leu?Asp?Ala?His?Val705?????????????????710?????????????????715?????????????????720Pro?Leu?Cys?Lys?Val?Pro?Asn?Asp?Gln?Ile?Phe?Ile?Lys?Tyr?Pro?Met
725?????????????????730?????????????????735Gly?Gly?Ile?Glu?Gly?Tyr?Cys?Gln?Lys?Leu?Trp?Thr?Ile?Ser?Thr?Ile
740?????????????????745?????????????????750Pro?Tyr?Leu?Tyr?Leu?Ala?Ala?Tyr?Glu?Ser?Gly?Val?Arg?Ile?Ala?Ser
755?????????????????760?????????????????765Leu?Val?Gln?Gly?Asp?Asn?Gln?Thr?Ile?Ala?Val?Thr?Lys?Arg?Val?Pro
770?????????????????775?????????????????780Ser?Thr?Trp?Pro?Tyr?Asn?Leu?Lys?Lys?Arg?Glu?Ala?Ala?Arg?Val?Thr785?????????????????790?????????????????795?????????????????800Arg?Asp?Tyr?Phe?Val?Ile?Leu?Arg?Gln?Arg?Leu?His?Asp?Ile?Gly?His
805?????????????????810?????????????????815His?Leu?Lys?Ala?Asn?Glu?Thr?Ile?Val?Ser?Ser?His?Phe?Phe?Val?Tyr
820?????????????????825?????????????????830Ser?Lys?Gly?Ile?Tyr?Tyr?Asp?Gly?Leu?Leu?Val?Ser?Gln?Ser?Leu?Lys
835?????????????????840?????????????????845Ser?Ile?Ala?Arg?Cys?Val?Phe?Trp?Ser?Glu?Thr?Ile?Val?Asp?Glu?Thr
850?????????????????855?????????????????860Arg?Ala?Ala?Cys?Ser?Asn?Ile?Ala?Thr?Thr?Met?Ala?Lys?Ser?Ile?Glu865?????????????????870?????????????????875?????????????????880Arg?Gly?Tyr?Asp?Arg?Tyr?Leu?Ala?Tyr?Ser?Leu?Asn?Val?Leu?Lys?Val
885?????????????????890?????????????????895Ile?Gln?Gln?Ile?Leu?Ile?Ser?Leu?Gly?Phe?Thr?Ile?Asn?Ser?Thr?Met
900?????????????????905?????????????????910Thr?Arg?Asp?Val?Val?Ile?Pro?Leu?Leu?Thr?Asn?Asn?Asp?Leu?Leu?Ile
915?????????????????920?????????????????925Arg?Met?Ala?Leu?Leu?Pro?Ala?Pro?Ile?Gly?Gly?Met?Asn?Tyr?Leu?Asn
930?????????????????935?????????????????940Met?Ser?Arg?Leu?Phe?Val?Arg?Asn?Ile?Gly?Asp?Pro?Val?Thr?Ser?Ser945?????????????????950?????????????????955?????????????????960Ile?Ala?Asp?Leu?Lys?Arg?Met?Ile?Leu?Ser?Ser?Leu?Met?Pro?Glu?Glu
965?????????????????970?????????????????975Thr?Leu?His?Gln?Val?Met?Thr?Gln?Gln?Pro?Gly?Asp?Ser?Ser?Phe?Leu
980?????????????????985?????????????????990Asp?Trp?Ala?Ser?Asp?Pro?Tyr?Ser?Ala?Asn?Leu?Val?Cys?Val?Gln?Ser
995?????????????????1000????????????????1005Ile?Thr?Arg?Leu?Leu?Lys?Asn?Ile?Thr?Ala?Arg?Phe?Val?Leu?Ile?His
1010????????????????1015????????????????1020Ser?Pro?Asn?Pro?Met?Leu?Lys?Gly?Leu?Phe?His?Asp?Asp?Ser?Lys?Glu1025????????????????1030????????????????1035????????????????1040Glu?Asp?Glu?Gly?Leu?Ala?Ala?Phe?Leu?Met?Asp?Arg?His?Ile?Ile?Val
1045????????????????1050????????????????1055Pro?Arg?Ala?Ala?His?Glu?Ile?Leu?Asp?His?Ser?Val?Thr?Gly?Ala?Arg
1060????????????????????1065????????????????1070Glu?Ser?Ile?Ala?Gly?Met?Leu?Asp?Thr?Thr?Lys?Gly?Leu?Ile?Arg?Ala
1075????????????????1080????????????????1085Ser?Met?Arg?Lys?Gly?Gly?Leu?Thr?Ser?Arg?Val?Ile?Thr?Arg?Leu?Ser
1090????????????????1095????????????????1100Asn?Tyr?Asp?Tyr?Glu?Gln?Phe?Arg?Ala?Gly?Met?Val?Leu?Leu?Thr?Gly1105????????????????1110????????????????1115????????????????1120Arg?Lys?Arg?Asn?Val?Leu?Ile?Asp?Lys?Glu?Ser?Cys?Ser?Val?Gln?Leu
1125???????????????1130????????????????1135Ala?Arg?Ala?Leu?Arg?Ser?His?Met?Trp?Ala?Arg?Leu?Ala?Arg?Gly?Arg
1140????????????????1145????????????????1150Pro?Ile?Tyr?Gly?Leu?Glu?Val?Pro?Asp?Val?Leu?Glu?Ser?Met?Arg?Gly
1155????????????????1160????????????????1165His?Leu?Ile?Arg?Arg?His?Glu?Thr?Cys?Val?Ile?Cys?Glu?Cys?Gly?Ser
1170????????????????1175????????????????1180Val?Asn?Tyr?Gly?Trp?Phe?Phe?Val?Pro?Ser?Gly?Cys?Gln?Leu?Asp?Asp1185????????????????1190????????????????1195????????????????1200Ile?Asp?Lys?Glu?Thr?Ser?Ser?Leu?Arg?Val?Pro?Tyr?Ile?Gly?Ser?Thr
1205????????????????1210????????????????1215Thr?Asp?Glu?Arg?Thr?Asp?Met?Lys?Leu?Ala?Phe?Val?Arg?Ala?Pro?Ser
1220????????????????1225????????????????1230Arg?Ser?Leu?Arg?Ser?Ala?Val?Arg?Ile?Ala?Thr?Val?Tyr?Ser?Trp?Ala
1235????????????????1240????????????????1245Tyr?Gly?Asp?Asp?Asp?Ser?Ser?Trp?Asn?Glu?Ala?Trp?Leu?Leu?Ala?Arg
1250????????????????1255????????????????1260Gln?Arg?Ala?Asn?Val?Ser?Leu?Glu?Glu?Leu?Arg?Val?Ile?Thr?Pro?Ile1265????????????????1270????????????????1275????????????????1280Ser?Thr?Ser?Thr?Asn?Leu?Ala?His?Arg?Leu?Arg?Asp?Arg?Thr?Thr?Gln
1285????????????????1290????????????????1295Val?Lys?Tyr?Ser?Gly?Thr?Ser?Leu?Val?Arg?Val?Ala?Arg?Tyr?Thr?Thr
1300????????????????1305????????????????1310Ile?Ser?Asn?Asp?Asn?Leu?Ser?Phe?Val?Ile?Ser?Asp?Lys?Lys?Val?Asp
1315????????????????1320????????????????1325Thr?Asn?Phe?Ile?Tyr?Gln?Gln?Gly?Met?Leu?Leu?Gly?Leu?Gly?Val?Leu
1330????????????????1335????????????????1340Glu?Thr?Leu?Phe?Arg?Leu?Glu?Lys?Asp?Thr?Gly?Ser?Ser?Asn?Thr?Val1345????????????????1350????????????????1355????????????????1360Leu?His?Leu?His?Val?Glu?Thr?Asp?Cys?Cys?Val?Ile?Pro?Met?Ile?Asp
1365????????????????1370????????????????1375His?Pro?Arg?Ile?Pro?Ser?Ser?Arg?Lys?Leu?Glu?Leu?Arg?Ala?Glu?Leu
1380????????????????1385????????????????1390Cys?Thr?Asn?Pro?Leu?Ile?Tyr?Asp?Asn?Ala?Pro?Leu?Ile?Asp?Arg?Asp
1395?????????????????1400????????????????1405Ala?Thr?Arg?Leu?Tyr?Thr?Gln?Ser?His?Arg?Arg?His?Leu?Val?Glu?Phe
1410????????????????1415????????????????1420Val?Thr?Trp?Ser?Thr?Pro?Gln?Leu?Tyr?His?Ile?Leu?Ala?Lys?Ser?Thr1425????????????????1430????????????????1435????????????????1440Ala?Leu?Ser?Met?Ile?Asp?Leu?Val?Thr?Lys?Phe?Glu?Lys?Asp?His?Met
1445????????????????1450????????????????1455Asn?Glu?Ile?Ser?Ala?Leu?Ile?Gly?Asp?Asp?Asp?Ile?Asn?Ser?Phe?Ile
1460????????????????1465????????????????1470Thr?Glu?Phe?Leu?Leu?Ile?Glu?Pro?Arg?Leu?Phe?Thr?Ile?Tyr?Leu?Gly
1475????????????????1480????????????????1485Gln?Cys?Ala?Ala?Ile?Asn?Trp?Ala?Phe?Asp?Val?His?Tyr?His?Arg?Pro
1490????????????????1495????????????????1500Ser?Gly?Lys?Tyr?Gln?Met?Gly?Glu?Leu?Leu?Ser?Ser?Phe?Leu?Ser?Arg1505????????????????1510????????????????1515????????????????1520Met?Ser?Lys?Gly?Val?Phe?Lys?Val?Leu?Val?Asn?Ala?Leu?Ser?His?Pro
1525????????????????1530????????????????1535Lys?Ile?Tyr?Lys?Lys?Phe?Trp?His?Cys?Gly?Ile?Ile?Glu?Pro?Ile?His
1540????????????????1545????????????????1550Gly?Pro?Ser?Leu?Asp?Ala?Gln?Asn?Leu?His?Thr?Thr?Val?Cys?Asn?Met
1555????????????????1560????????????????1565Ile?Tyr?Thr?Cys?Tyr?Met?Thr?Tyr?Leu?Asp?Leu?Leu?Leu?Asn?Glu?Glu
1570????????????????1575????????????????1580Leu?Glu?Glu?Phe?Thr?Phe?Leu?Leu?Cys?Glu?Ser?Asp?Glu?Asp?Val?Val1585????????????????1590????????????????1595????????????????1600Pro?Asp?Arg?Phe?Asp?Asn?Ile?Gln?Ala?Lys?His?Leu?Cys?Val?Leu?Ala
1605????????????????1610????????????????1615Asp?Leu?Tyr?Cys?Gln?Pro?Gly?Thr?Cys?Pro?Pro?Ile?Arg?Gly?Leu?Arg
1620????????????????1625????????????????1630Pro?Val?Glu?Lys?Cys?Ala?Val?Leu?Thr?Asp?His?Ile?Lys?Ala?Glu?Ala
1635????????????????1640????????????????1645Arg?Leu?Ser?Pro?Ala?Gly?Ser?Ser?Trp?Asn?Ile?Asn?Pro?Ile?Ile?Val
1650????????????????1655????????????????1660Asp?His?Tyr?Ser?Cys?Ser?Leu?Thr?Tyr?Leu?Arg?Arg?Gly?Ser?Ile?Lys1665????????????????1670????????????????1675????????????????1680Gln?Ile?Arg?Leu?Arg?Val?Asp?Pro?Gly?Phe?Ile?Phe?Asp?Ala?Leu?Ala
1685????????????????1690????????????????1695Glu?Val?Asn?Val?Ser?Gln?Pro?Lys?Ile?Gly?Ser?Asn?Asn?Ile?Ser?Asn
1700????????????????1705????????????????1710Met?Ser?Ile?Lys?Asp?Phe?Arg?Pro?Pro?His?Asp?Asp?Val?Ala?Lys?Leu
1715????????????????1720????????????????1725Leu?Lys?Asp?Ile?Asn?Thr?Ser?Lys?His?Asn?Leu?Pro?Ile?Ser?Gly?Gly
1730????????????????1735????????????????1740Asn?Leu?Ala?Asn?Tyr?Glu?Ile?His?Ala?Phe?Arg?Arg?Ile?Gly?Leu?Asn1745????????????????1750????????????????1755????????????????1760Ser?Ser?Ala?Cys?Tyr?Lys?Ala?Val?Glu?Ile?Ser?Thr?Leu?Ile?Arg?Arg
1765????????????????1770????????????????1775Cys?Leu?Glu?Pro?Gly?Glu?Asp?Gly?Leu?Phe?Leu?Gly?Glu?Gly?Ser?Gly
1780????????????????1785????????????????1790Ser?Met?Leu?Ile?Thr?Tyr?Lys?Glu?Ile?Leu?Lys?Leu?Asn?Lys?Cys?Phe
1795????????????????1800????????????????1805Tyr?Asn?Ser?Gly?Val?Ser?Ala?Asn?Ser?Arg?Ser?Gly?Gln?Arg?Glu?Leu
1810????????????????1815????????????????1820Ala?Pro?Tyr?Pro?Ser?Glu?Val?Gly?Leu?Val?Glu?His?Arg?Met?Gly?Val1825????????????????1830????????????????1835????????????????1840Gly?Asn?Ile?Val?Lys?Val?Leu?Phe?Asn?Gly?Arg?Pro?Glu?Val?Thr?Trp
1845????????????????1850????????????????1855Val?Gly?Ser?Val?Asp?Cys?Phe?Asn?Phe?Ile?Val?Ser?Asn?Ile?Pro?Thr
1860????????????????1865????????????????1870Ser?Ser?Val?Gly?Phe?Ile?His?Ser?Asp?Ile?Glu?Thr?Leu?Pro?Asn?Lys
1875????????????????1880????????????????1885Asp?Thr?Ile?Glu?Lys?Leu?Glu?Glu?Leu?Ala?Ala?Ile?Leu?Ser?Met?Ala
1890????????????????1895????????????????1900Leu?Leu?Leu?Gly?Lys?Ile?Gly?Ser?Ile?Leu?Val?Ile?Lys?Leu?Met?Pro1905????????????????1910????????????????1915????????????????1920Phe?Ser?Gly?Asp?Phe?Val?Gln?Gly?Phe?Ile?Ser?Tyr?Val?Gly?Ser?Tyr
1925????????????????1930????????????????1935Tyr?Arg?Glu?Val?Asn?Leu?Val?Tyr?Pro?Arg?Tyr?Ser?Asn?Phe?Ile?Ser
1940????????????????1945????????????????1950Thr?Glu?Ser?Tyr?Leu?Val?Met?Thr?Asp?Leu?Lys?Ala?Asn?Arg?Leu?Met
1955????????????????1960????????????????1965Asn?Pro?Glu?Lys?Ile?Lys?Gln?Gln?Ile?Ile?Glu?Ser?Ser?Val?Arg?Thr
1970????????????????1975????????????????1980Ser?Pro?Gly?Leu?Ile?Gly?His?Ile?Leu?Ser?Ile?Lys?Gln?Leu?Ser?Cys1985????????????????1990????????????????1995????????????????2000Ile?Gln?Ala?Ile?Val?Gly?Asp?Ala?Val?Ser?Arg?Gly?Gly?Ile?Asn?Pro
2005????????????????2010????????????????2015Ile?Leu?Lys?Lys?Leu?Thr?Pro?Ile?Glu?Gln?Val?Leu?Ile?Asn?Cys?Gly
2020????????????????2025????????????????2030Leu?Ala?Ile?Asn?Gly?Pro?Lys?Leu?Cys?Lys?Glu?Leu?Ile?His?His?Asp
2035????????????????2040????????????????2045
Val?Ala?Ser?Gly?Gln?Asp?Gly?Leu?Leu?Asn?Ser?Ile?Leu?Ile?Leu?Tyr
2050????????????????2055????????????????2060
Arg?Glu?Leu?Ala?Arg?Phe?Lys?Asp?Asn?Gln?Arg?Ser?Gln?Gln?Gly?Met
2065????????????????2070????????????????2075????????????????2080
Phe?His?Ala?Tyr?Pro?Val?Leu?Val?Ser?Ser?Arg?Gln?Arg?Glu?Leu?Ile
2085????????????????2090????????????????2095
Ser?Arg?Ile?Thr?Arg?Lys?Phe?Trp?Gly?His?Ile?Leu?Leu?Tyr?Ser?Gly
2100???????????????2105????????????????2110
Asn?Arg?Lys?Leu?Ile?Asn?Arg?Phe?Ile?Gln?Asn?Leu?Lys?Ser?Gly?Tyr
2115????????????????2120????????????????2125
Leu?Ile?Leu?Asp?Leu?His?Gln?Asn?Ile?Phe?Val?Lys?Asn?Leu?Ser?Lys
2130????????????????2135????????????????2140
Ser?Glu?Lys?Gln?Ile?Ile?Met?Thr?Gly?Gly?Leu?Lys?Arg?Glu?Trp?Val
2145????????????????2150????????????????2155????????????????2160
Phe?Lys?Val?Thr?Ile?Lys?Glu?Thr?Lys?Glu?Trp?Tyr?Lys?Leu?Val?Gly
2165???????????????2170?????????????????2175
Tyr?Ser?Ala?Leu?Ile?Lys?Asp
The information of 2180 (2) SEQ ID NO:7:
(i) sequence signature:
(A) length: 15894 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7:
ACCAAACAAA GTTGGGTAAG GATAGATCAA TCAATGATCA TATTCTAGTA CACTTAGGAT 60
TCAAGATCCT ATTATCAGGG ACAAGAGCAG GATTAGGGAT ATCCGAGATG GCCACACTTT 120
TGAGGAGCTT AGCATTGTTC AAAAGAAACA AGGACAAACC ACCCATTACA TCAGGATCCG 180
GTGGAGCCAT CAGAGGAATC AAACACATTA TTATAGTACC AATTCCTGGA GATTCCTCAA 240
TTACCACTCG ATCCAGACTA CTGGACCGGT TGGTCAGGTT AATTGGAAAC CCGGATGTGA 300
GCGGGCCCAA ACTAACAGGG GCACTAATAG GTATATTATC CTTATTTGTG GAGTCTCCAG 360
GTCAATTGAT TCAGAGGATC ACCGATGACC CTGACGTTAG CATCAGGCTG TTAGAGGTTG 420
TTCAGAGTGA CCAGTCACAA TCTGGCCTTA CCTTCGCATC AAGAGGTACC AACATGGAGG 480
ATGAGGCGGA CCAATACTTT TCACATGATG ATCCAAGCAG TAGTGATCAA TCCAGGTCCG 540
GATGGTTCGA GAACAAGGAA ATCTCAGATA TTGAAGTGCA AGATCCTGAG GGATTCAACA 600
TGATTCTGGG TACCATTCTA GCCCAGATCT GGGTCTTGCT CGCAAAGGCG GTTACGGCCC 660
CAGACACGGC AGCTGATTCG GAGCTAAGAA GGTGGATAAA GTACACCCAA CAAAGAAGGG 720
TAGTTGGTGA ATTTAGATTG GAGAGAAAAT GGTTGGATGT GGTGAGGAAC AGGATTGCCG 780
AGGACCTCTC TTTACGCCGA TTCATGGTGG CTCTAATCCT GGATATCAAG AGGACACCCG 840
GGAACAAACC TAGGATTGCT GAAATGATAT GTGACATTGA TACATATATC GTAGAGGCAG 900
GATTAGCCAG TTTTATCTTG ACTATTAAGT TTGGGATAGA AACTATGTATCCTGCTCTTG 960
GACTGCATGA ATTTGCTGGT GAGTTATCCA CACTTGAGTC CTTGATGAAT CTTTACCAGC 1020
AAATGGGAGA AACTGCACCC TACATGGTAA TCCTAGAGAA CTCAATTCAG AACAAGTTCA 1080
GCGCAGGATC ATACCCTCTG CTCTGGAGCT ATGCCATGGG AGTAGGAGTG GAACTTGAAA 1140
ACTCCATGGG AGGTTTGAAC TTTGGTCGAT CTTACTTTGA TCCAGCATAT TTTAGATTAG 1200
GGCAAGAGAT GGTGAGGAGG TCAGCTGGAA AGGTCAGTTC CACATTGGCA TCCGAACTCG 1260
GTATCACTGC CGAGGATGCA AGGCTTGTTT CAGAGATTGC AATGCATACT ACTGAGGACA 1320
GGATCAGTAG AGCGGTCGGA CCCAGACAAG CCCAAGTATC ATTTCTACAC GGTGATCAAA 1380
GTGAGAATGA GCTACCAGGA TTGGGGGGCA AGGAAGACAG GAGGGTCAAA CAGAGTCGGG 1440
GAGAAGCCAG GGAGAGCTAC AGAGAAACCG AGTCCAGCAG AGCAAGTGAT GCGAGAGCTG 1500
CCCATCCTCC AACCAGCATG CCCCTAGACA TTGACACTGC ATCGGAGTCA GGCCAAGATC 1560
CGCAGGACAG TCGAAGGTCA GCTGACGCTC TGCTCAGGCT GCAAGCCATG GCAGGAATCT 1620
TGGAAGAACA AGGCTCAGAC ACGGACACCC CTAGGGTATA CAATGACAGA GATCTTCTAG 1680
ATTAGGTGCG AGAGGCCGAG GACCAGAACA ACATCCGCCT ACCCTCCATC ATTGTTATAA 1740
AAAACTTAGG AACCAGGTCC ACACAGCCGC CAGCCAACCA ACCATCCACT CCCACGACTG 1800
GAGCCGATGG CAGAAGAGCA GGCACGCCAT GTCAAAAACG GACTGGAATG CATCCGGGCT 1860
CTCAAGGCCG AGCCCATCGG CTCACTGGCC GTCGAGGAAG CCATGGCAGC ATGGTCAGAA 1920
ATATCAGACA ATCCAGGACA GGACCGAGCC GCCTGCAAGG AAGAGGAGGC AGGCAGTTCG 1980
GGTCTCAGCA AACCATGCTT CTCAGCAATT GGATCAACTG AAGGCGGTGC ACCTCGCATC 2040
CGCGGTCAGG GATCTGGAGA AAGCGATGAC GACGCTGAAA CTTTGGGAAT CCCCTCAAGA 2100
AATCTCCAGG CATCAAGCAC TGGGTTACAG TGTTATCATG TTTATGATCA CAGCGGTGAA 2160
GCGGTTAAGG GAATCCAAGA TGCTGACTCT ATCATGGTTC AATCAGGCCT TGATGGTGAT 2220
AGCACCCTCT CAGGAGGAGA CGATGAATCT GAAAACAGCG ATGTGGATAT TGGCGAACCT 2280
GATACCGAGG GATATGCTAT CACTGACCGG GGATCTGCTC CCATCTCTAT GGGGTTCAGG 2340
GCTTCTGATG TTGAAACTGC AGAAGGAGGG GAGATCCACG AGCTCCTGAA ACTCCAATCC 2400
AGAGGCAACA ACTTTCCGAA GCTTGGGAAA ACTCTCAATG TTCCTCCGCC CCCGAACCCC 2460
AGTAGGGCCA GCACTTCCGA GACACCCATT AAAAAGGGGA CAGACGCGAG ATTGGCCTCA 2520
TTTGGAACGG AGATCGCGTC TTTATTGACA GGTGGTGCAA CCCAATGTGC TCGAAAGTCA 2580
CCCTCGGAAC CGTCAGGGCC AGATGCACCT GCGGGGAATG TCCCCGAGTG TGTGAGCAAT 2640
GCCGCACTGA TACAGGAGTG GACACCCGAA TCTGGTACCA CAATCTCCCC GAGATCCCAG 2700
AATAATGAAG AAGGGGGAGA CTATTATGAT GATGAGCTGT TCTCCGATGT CCAAGACATC 2760
AAAACAGCCT TGGCCAAAAT ACACGAGGAT AATCAGAAGA TAATCTCCAA GCTAGAATCA 2820
TTGCTGTTAT TGAAGGGAGA AGTTGAGTCA ATTAAGAAGC AGATCAACAG GCAAAATATC 2880
AGCATATCCA CCCTGGAAGG ACACCTCTCA AGCATCATGA TTGCCATTCC TGGACTTGGG 2940
AAGGATCCCA ACGACCCCAC TGCAGATGTC GAACTCAATC CCGACCTGAA ACCCATCATA 3000
GGCAGAGATT CAGGCCGAGC ACTGGCCGAA GTTCTCAAGA AGCCCGTTGC CAGCCGACAA 3060
CTCCAGGGAA TGACTAATGG ACGGACCAGT TCCAGAGGAC AGCTGCTGAA GGAATTTCAA 3120
CTAAAGCCGA TCGGGAAAAA GGTGAGCTCA GCCGTCGGGT TTGTCCCTGA CACCGGCCCT 3180
GCATCACGCA GTGTAATCCG CTCCATTATA AAATCCAGCC GGCTAGAGGA GGATCGGAAG 3240
CGTTACCTGT TGACTCTCCT TGATGATATC AAAGGAGCCA ACGATCTTGC CAAGTTCCAC 3300
CAGATGCTGA TGAAGATAAT AATGAAGTAG CTACAGCTCA ACTTACCTGC CAACCCCATG 3360
CCAGTCGACC TAATTAGTAC AACCTAAATC CATTATAAAA AACTTAGGAG CAAAGTGATT 3420
GCCTCCTAAG TTCCACAATG ACAGAGATCT ACGACTTCGA CAAGTCGGCA TGGGACATCA 3480
AAGGGTCGAT CGCTCCGATA CAACCTACCA CCTACAGTGA TGGCAGGCTG GTGCCCCAGG 3540
TCAGAGTCAT AGATCCTGGT CTAGGTGATA GGAAGGATGA ATGCTTTATG TACATGTTTC 3600
TGCTGGGGGT TGTTGAGGAC AGAGATCCCC TAGGGCCTCC AATCGGGCGA GCATTCGGGT 3660
CCCTGCCCTT AGGTGTTGGT AGATCCACAG CAAAACCCGA GGAACTCCTC AAAGAGGCCA 3720
CTGAGCTTGA CATAGTTGTT AGACGTACAG CAGGGCTCAA TGAAAAACTG GTGTTCTACA 3780
ACAACACCCC ACTAACCCTC CTCACACCTT GGAGAAAGGT CCTAACAACA GGGAGTGTCT 3840
TCAATGCAAA CCAAGTGTGC AATGCGGTTA ATCTAATACC GCTGGACACC CCGCAGAGGT 3900
TCCGTGTTGT TTATATGAGC ATCACCCGTC TTTCGGATAA CGGGTATTAC ACCGTTCCCA 3960
GAAGAATGCT GGAATTCAGA TCGGTCAATG CAGTGGCCTT CAACCTGCTA GTGACCCTCA 4020
GGATTGACAA GGCGATTGGC CCTGGGAAGA TCATCGACAA TGCAGAGCAA CTTCCTGAGG 4080
CAACATTTAT GGTCCACATC GGGAACTTCA GGAGAAAGAA GAGTGAAGTC TACTCTGCCG 4140
ATTATTGCAA AATGAAAATC GAAAAGATGG GCCTGGTTTT TGCACTTGGT GGGATAGGGG 4200
GCACCAGTCT TCACATTAGA AGCACAGGCA AAATGAGCAA GACTCTCCAT GCACAACTCG 4260
GGTTCAAGAA GACCTTATGT TACCCACTGA TGGATATCAA TGAAGACCTT AATCGGTTAC 4320
TCTGGAGGAG CAGATGCAAG ATAGTAAGAA TCCAGGCAGT TTTGCAGCCA TCAGTTCCTC 4380
AAGAATTCCG CATTTACGAC GACGTGATCA TAAATGATGA CCAAGGACTA TTCAAAGTTC 4440
TGTAGACCGT AGTGCCCAGC AATACCCGAA AACGACCCCC CTCATAATGA CAGCCAGAAG 4500
GCCCGGACAA AAAAGCCCCC TCCAAAAGAC TTCACGGACC AAGCGAGAGG CCAGCCAGCA 4560
GCCGACAGCA AGTGTGGACA CCAGGCGGCC CAAGCACAGA ACAGCCCCGA CACAAGGCCA 4620
CCACCAGCCA TCCCAATCCG CGTCCTCCTC GTAGGACCCC CGAGGACCAA CCCCCAAGGT 4680
CGCTCCGGAC ACAGACCACC AGCCGCATCC CCACAGCCCT CGGGAAAGGA ACCCCCAGCA 4740
ACTGGAAGGC CCCTTCCCCC CTCCCCCAAC GCAAGAACCC CACAACCGAA CCGCACAAGC 4800
GACCGAGGTG ACCCAACCGC AGGCATCCGA CTCCCTAGAC AGACCCTCCC TCCCCGGCAT 4860
ACTAAACAAA ACTTAGGGCC AAGGAACACA CACACCCGAC AGAACCCAGA CCCCGGCCCG 4920
CGGCACCGCG CCCCCACCCC CCGAAAACCA GAGGGAGCCC CCAACCAATC CCGCCGCCCC 4980
CCCCGGTGCC CACAGGTAGG CACACCAACC CCCGAACAGA CCCAGCACCC AGCCACCGAC 5040
AATCCAAGAC GGGGGGCCCC CCCCAAAAAA AGGCCCCCAG GGGCCGACAG CCAGCATCGC 5100
GAGGAAGCCC ACCCACCCCA CACACGACCA CGGCAACCAA ACCAGAGCCC AGACCACCCT 5160
GGGCCACCAG CTCCCAGACT CGGCCATCAC CCCGAAAAAA GGAAAGGCCA CAACCCGCGC 5220
ACCCCAGGCC CGATCCGGCG GGAAGCCACC CAACCCGAAC CAGCACCCAA GAGCGATCCC 5280
TGGGGGACCC CCAAACCGCA AAAGACATCA GTATCCCACC GCCTCTCCAA GTCCCCCGGT 5340
CTCCTCCTCT TCTCGAAGGG ACCAAAAGAT CAATCCACCA CATCCGACGA CACTCAATTC 5400
CCCACCCCTA AAGGAGACAC CGGGAATCCC AGAATCAAGA CTCATCCAAT GTCCATCATG 5460
GGTCTCAAGG TGAATGTCTT TGCCATATTC ATGGCAGTAC TGTTAACTCT CCAAACACCC 5520
ACCGGTCAAA TCCATTGGGG CAATCTCTCT AAGATAGGGG TGGTAGGGAT AGGAAGTGCA 5580
AGCTACAAAG TTATGACTCG TTCCAGCCAT CAATCATTGG TCATAAAATT AATGCCCAAT 5640
ATAACTCTCC TCAATAACTG CACGAGGGTA GAAATTGCAG AATACAGGAG ACTACTGAGA 5700
ACAGTTTTGG AACCAATTAG AGATGCACTT AATGCAATGA CCCAGAATAT AAGACCGGTT 5760
CAGAGTGTAG CTTCAAGTAG GAGACACAAG AGATTTGCGG GAGTTGTCCT GGCAGGTGCG 5820
GCCCTAGGCG TTGCCACAGC TGCTCAGATA ACAGCCGGCA TTGCACTTCA CCAGTCCATG 5880
CTGAACTCTC AAGCCATCGA CAATCTGAGA GCAAGCCTGG AAACTACTAA TCAGGCAATT 5940
GAGGCAATCA GGCAAGCAGG GCAGGAGATG ATATTGGCTG TTCAGGGTGT CCAAGACTAC 6000
ATCAATAATG AGCTGATACC GTCTATGAAC CAACTATCTT GTGATTTAAT CGGCCAGAAG 6060
CTAGGGCTCA AATTGCTCAG ATACTATACA GAAATCCTGT CATTATTTGG CCCCAGCTTA 6120
CGGGACCCCA TATCTGCGGA GATATCCATC CAGGCTTTGA GCTATGCGCT TGGGGGAGAT 6180
ATCAATAAGG TATTAGAAAA GCTCGGATAC AGTGGAGGTG ATTTACTGGG CATCTTAGAG 6240
AGCAGAGGAA TAAAGGCCCG GATAACTCAC GTCGACACAG AGTCCTACTT CATTGTCCTC 6300
AGTATAGCCT ATCCGACGCT GTCCGAGATT AAGGGGGTGA TTGTCCACCG GCTAGAGGGG 6360
GTCTCGTACA ATATAGGCTC TCAAGAGTGG TATACCACTG TGCCCAAGTA TGTTGCAACC 6420
CAAGGGTACC TTATCTCGAA TTTTGATGAG TCATCGTGTA CTTTCATGCC AGAGGGGACT 6480
GTGTGCAGCC AAAATGCCTT GTACCCGATG AGTCCTCTGC TCCAAGAATG CCTCCGGGGG 6540
TCCACCAAGT CCTGTGCTCG TACACTCGTA TCCGGGTCTT TTGGGAACCG GTTCATTTTA 6600
TCACAAGGGA ACCTAATAGC CAATTGTGCA TCAATCCTCT GCAAGTGTTA CACAACAGGA 6660
ACGATCATTA ATCAAGACCC TGACAAGATC CTAACATACA TTGCTGCCGA TCACTGCCCG 6720
GTGGTCGAGG TGAACGGTGT GACCATCCAA GTCGGGAGCA GGAGGTATCC GGACGCGGTG 6780
TACCTGCACA GAATTGACCT CGGTCCTCCC ATATCATTGG AGAAGTTGGA CGTAGGGACA 6840
AATCTGGGGA ATGCAATTGC TAAGCTGGAG GATGCCAAGG AATTGCTGGA GTCATCGGAC 6900
CAGATATTGA GGAGTATGAA AGGTTTATCG AGCACTAGCA TAGTTTACAT CCTGATTGCA 6960
GTGTGTCTTG GAGGGTTGAT AGGGATCCCC GCTTTAATAT GTTGCTGCAG GGGGCGTTGT 7020
AACAAAAAGG GGGAACAAGT TGGTATGTCA AGACCAGGCC TAAAGCCTGA TCTTACAGGG 7080
ACATCAAAAT CCTATGTAAG GTCGCTCTGA TCCCCTACAA CTCTTGAAAC ACAGATTTCC 7140
CACAAGTCTC CTCTCCGTCA TCAAGCAACC ACCGCATCCA GCATCAAGGC CACCCGAAAT 7200
TGTCTCCGGC TTCCCTCTGG CCGAACGATA TCGGTAGTTA ATTAAAACTT AGGGTGCAAG 7260
ATCATCCACA ATGTCACCAC ACCGAGACCG AATAAATGCC TTCTACAAAG ACAACCCCCA 7320
TCCTAAGGGA AGTAGGATAG TTATTAACAG AGAACATCTT ATGATTGATA GACCTTATGT 7380
TTTGCTGGCT GTTCTATTCG TCATGTTTCT GAGCTTGATC GGGTTGCTAG CCATTGCAGG 7440
CATTAGACTC CATCGGGCAG CCATCTACAC CGCAGAGATC CATAAGAGCC TCAGCACCAA 7500
TCTAGATGTA ACTAACTCAA TCGAGCATCA GGTCAAGGAC GTGCTGACAC CACTCTTCAA 7560
GATCATCGGT GATGAAGTGG GCCTGAGGAC ACCTCAGAGA TTCACTGACC TAGTGAAATT 7620
CATCTCTGAC AAAATTAAAT TCCTTAATCC GGATAGGGAG TACGACTTCA GAGATCTCAC 7680
TTGGTGTATC AACCCGCCAG AGAGAATCAA ATTGGATTAT GATCAATACT GTGCAGATGT 7740
GGCTGCTGAA GAACTCATGA ATGCATTGGT GAACTCAACT CTACTGGAGG CCAGGGCAAC 7800
CAATCAGTTC CTAGCTGTCT CAAAGGGAAA CTGCTCAGGG CCCACTACAA TCAGAGGTCA 7860
ATTCTCAAAC ATGTCGCTGT CCCTGTTGGA CTTGTATTTA AGTCGAGGTT ACAATGTGTC 7920
ATCTATAGTC ACCATGACAT CCCAGGGAAT GTACGGGGGA ACTTACCTAG TGGGAAAGCC 7980
TAATCTGAGC AGTAAAGGGT CAGAGTTGTC ACAACTGAGC ATGCACCGAG TGTTTGAAGT 8040
AGGGGTTATC AGAAATCCGG GTTTGGGGGC TCCGGTGTTC CATATGACAA ACTATTTTGA 8100
GCAACCAGTC AGTAATGATT TCAGCAACTG CATGGTGGCT TTGGGGGAGC TCAGGTTCGC 8160
AGCCCTCTGT CACAGGGAAG ATTCTGTCAC GGTTCCCTAT CAGGGGTCAG GGAAAGGTGT 8220
CAGCTTCCAG CTCGTCAAGC TAGGTGTCTG GAAATCCCCA ACCGACATGC AATCCTGGGT 8280
CCCCCTATCA ACGGATGATC CAGTGATAGA TAGGCTTTAC CTCTCATCTC ACAGAGGTGT 8340
TATCGCTGAC AATCAAGCAA AATGGGCTGT CCCGACAACA CGGACAGATG ACAAGTTGCG 8400
AATGGAGACA TGCTTCCAGC AGGCGTGTAA GGGTAAAAAC CAAGCACTCT GCGAGAATCC 8460
CGAGTGGGCA CCATTGAAGG ATAACAGGAT TCCTTCATAC GGGGTCTTGT CTGTTAATCT 8520
GAGTCTGACA GTTGAGCTTA AAATCAAAAT TGCTTCAGGA TTCGGGCCAT TGATCACACA 8580
CGGTTCAGGG ATGGACCTAT ACAAAACCAA CCACAACAAT GTGTATTGGC TGACTATCCC 8640
GCCAATGAAG AACCTAGCCT TAGGTGTAAT CAACACATTG GAGTGGATAC CGAGATTCAA 8700
GGTTAGTCCC AACCTCTTCA CTGTTCCAAT CAAGGAAGCA GGCGAGGACT GCCATGCCCC 8760
AACATACCTA CCTGCGGAGG TGGATGGTGA TGTCAAACTC AGTTCCAATC TGGTAATTCT 8820
ACCTGGTCAG GATCTCCAAT ATGTTTTGGC AACCTACGAT ACTTCCAGGG TTGAACATGC 8880
TGTGGTTTAT TATGTTTACA GCCCAGGCCG CTCATTTTCT TACTTTTATC CTTTTAGGTT 8940
GCCTATAAAG GGGGTCCCAA TCGAATTACA AGTGGAATGC TTCACATGGG ACCAAAAACT 9000
CTGGTGCCGT CACTTCTGTG TGCTTGCGGA TTCAGAATCT GGTGGACATA TCACTCACTC 9060
TGGGATGGTG GGCATGGGAG TCAGCTGCAC AGTCACTCGG GAAGATGGAA CCAATCGCAG 9120
ATAGGGCTGC CAGTGAACCG ATCACATGAT GTCACTCAGA CACCAGGCAT ACCCACTAGT 9180
GTGAAATAGA CATCAGAATT AAGAAAAACG TAGGGTCCAA GTGGTTTCCC GTCATGGACT 9240
CGCTATCTGT CAACCAGATC TTGTACCCTG AAGTTCACCT AGATAGCCCG ATAGTTACCA 9300
ATAAGATAGT AGCTATCCTG GAGTATGCTC GAGTCCCTCA CGCTTACAGC CTTGAGGACC 9360
CTACACTGTG TCAGAACATC AAGCACCGCC TAAAAAACGG ATTCTCCAAC CAAATGATTA 9420
TAAACAATGT GGAAGTTGGG AATGTCATCA AGTCCAAGCT TAGGAGTTAT CCGGCCCACT 9480
CTCATATTCC ATATCCAAAT TGTAATCAGG ATTTATTTAA CATAGAAGAC AAAGAGTCAA 9540
CAAGGAAGAT CCGTGAGCTC CTAAAAAAGG GAAATTCGCT GTACTCCAAA GTCAGTGATA 9600
AGGTTTTCCA ATGCCTGAGG GACACTAACT CACGGCTTGG CCTAGGCTCC GAATTGAGGG 9660
AGGACATCAA GGAGAAAATT ATTAACTTGG GAGTTTACAT GCACAGCTCC CAATGGTTTG 9720
AGCCCTTTCT GTTTTGGTTT ACAGTCAAGA CTGAGATGAG GTCAGTGATT AAATCACAAA 9780
CCCATACTTG CCATAGGAGG AGACACACAC CTGTATTCTT CACTGGTAGT TCAGTTGAGC 9840
TGTTAATCTC TCGTGACCTT GTTGCTATAA TCAGTAAGGA GTCTCAACAT GTATATTACC 9900
TGACGTTTGA ACTGGTTTTG ATGTATTGTG ATGTCATAGA GGGGAGGTTA ATGACAGAGA 9960
CCGCTATGAC CATTGATGCT AGGTATGCAG AACTTCTAGG AAGAGTCAGA TACATGTGGA 10020
AACTGATAGA TGGTTTCTTC CCTGCACTCG GGAATCCAAC TTATCAAATT GTAGCTATGC 10080
TGGAGCCACT TTCACTTGCT TACCTGCAAC TGAGGGACAT AACAGTAGAA CTCAGAGGTG 10140
CTTTCCTTAA CCACTGCTTT ACTGAAATAC ATGATGTTCT TGACCAAAAC GGGTTTTCTG 10200
ATGAAGGTAC TTATCATGAG TTAATTGAAG CCTTAGATTA CATTTTCATA ACTGATGACA 10260
TACATCTGAC AGGGGAGATT TTCTCATTTT TCAGAAGTTT CGGCCACCCC AGACTTGAAG 10320
CAGTAACGGC TGCTGAAAAT GTCAGGAAAT ACATGAATCA GCCTAAAGTC ATTGTGTATG 10380
AGACTCTGAT GAAGGGTCAT GCCATATTTT GTGGAATCAT AATCAACGGC TATCGTGACA 10440
GGCACGGAGG CAGTTGGCCA CCCCTGACCC TCCCCCTGCA TGCTGCAGAC ACAATCCGGA 10500
ATGCTCAAGC TTCAGGTGAA GGGTTAACAC ATGAGCAGTG CGTTGATAAC TGGAGATCAT 10560
TTGCTGGAGT GAGATTTGGC TGTTTTATGC CTCTTAGCCT GGACAGTGAT CTGACAATGT 10620
ACCTAAAGGA CAAGGCACTT GCTGCTCTCC AAAGGGAATG GGATTCAGTT TACCCGAAAG 10680
AGTTCCTGCG TTACGATCCT CCCAAGGGAA CCGGGTCACG GAGGCTTGTA GATGTTTTCC 10740
TTAATGATTC GAGCTTTGAC CCATATGATA TGATAATGTA TGTCGTAAGT GGAGCCTACC 10800
TCCATGACCC TGAGTTCAAT CTGTCTTACA GCCTGAAAGA AAAGGAGATC AAGGAAACAG 10860
GTAGACTTTT CGCTAAAATG ACTTACAAAA TGAGGGCATG CCAAGTGATC GCTGAAAATC 10920
TAATCTCAAA CGGGATTGGC AAGTATTTTA AGGACAATGG GATGGCCAAG GATGAGCACG 10980
ATTTGACTAA GGCACTCCAC ACTCTGGCTG TCTCAGGAGT CCCCAAAGAT CTCAAAGAAA 11040
GTCACAGGGG GGGGCCAGTC TTAAAAACCT ACTCCCGAAG CCCAGTCCAC ACAAGTACCA 11100
GGAACGTTAA AGCAGAAAAA GGGTTTGTAG GATTCCCTCA TGTAATTCGG CAGAATCAAG 11160
ACACTGATCA TCCGGAGAAT ATAGAAACCT ACGAGACAGT CAGCGCATTT ATCACGACTG 11220
ATCTCAAGAA GTACTGCCTT AATTGGAGAT ATGAGACCAT CAGCTTATTT GCACAGAGGC 11280
TAAATGAGAT TTACGGATTA CCCTCATTTT TTCAGTGGCT GCATAAGAGG CTTGAAACCT 11340
CTGTCCTCTA TGTAAGTGAT CCTCATTGCC CCCCCGACCT TGACGCCCAT GTCCCGTTAT 11400
GCAAAGTCCC CAATGACCAA ATCTTCATCA AGTACCCTAT GGGAGGTATA GAAGGGTATT 11460
GTCAGAAGCT GTGGACCATC AGCACCATTC CCTACTTATA CCTGGCTGCT TATGAGAGCG 11520
GGGTAAGGAT TGCCTCGTTA GTGCAAGGGG ACAATCAGAC CATAGCCGTA ACAAAAAGGG 11580
TACCCAGCAC ATGGCCTTAC AACCTTAAGA AACGGGAAGC TGCTAGAGTA ACTAGAGATT 11640
ACTTTGTAAT TCTTAGGCAA AGGCTACATG ACATTGGCCA TCACCTCAAG GCAAATGAGA 11700
CAATTGTTTC ATCACATTTT TTTGTCTATT CAAAAGGAAT ATATTATGAT GGGCTACTTG 11760
TGTCCCAATC ACTCAAGAGC ATTGCAAGAT GTGTATTCTG GTCAGAGACT ATAGTTGATG 11820
AAACAAGGGC AGCATGCAGT AATATTGCTA CAACAATGGC TAAAAGCATC GAGAGAGGTT 11880
ATGACCGTTA TCTTGCATAT TCCCTGAACG TCCTAAAAGT GATACAGCAA ATTTTGATCT 11940
CTCTTGGCTT CACAATCAAT TCAACCATGA CCCGAGATGT AGTCATACCC CTCCTCACAA 12000
ACAACGATCT CTTAATAAGG ATGGCACTGT TGCCCGCTCC TATTGGGGGG ATGAATTATC 12060
TGAACATGAG CAGGCTGTTT GTCAGAAACA TCGGTGATCC AGTAACATCA TCAATTGCTG 12120
ATCTCAAGAG AATGATTCTC GCATCACTAA TGCCTGAAGA GACCCTCCAT CAAGTAATGA 12180
CACAACAACC GGGGGACTCT TCATTCCTAG ACTGGGCTAG CGACCCTTAC TCAGCAAATC 12240
TTGTATGCGT CCAGAGCATC ACTAGACTCC TCAAGAACAT AACTGCAAGG TTTGTCCTAA 12300
TCCATAGTCC AAACCCAATG TTAAAAGGGT TATTCCATGA TGACAGTAAA GAAGAGGACG 12360
AGAGACTGGC GGCATTCCTC ATGGACAGGC ATATTATAGT ACCTAGGGCA GCTCATGAAA 12420
TCCTGGATCA TAGTGTCACA GGGGCAAGAG AGTCTATTGC AGGCATGCTA GATACCACAA 12480
AAGGCCTGAT TCGAGCCAGC ATGAGGAAGG GGGGGTTAAC CTCTCGAGTG ATAACCAGAT 12540
TGTCCAATTA TGACTATGAA CAATTTAGAG CAGGGATGGT GCTATTGACA GGAAGAAAGA 12600
GAAATGTCCT CATTGACAAA GAGTCATGTT CAGTGCAGCT GGCTAGAGCC CTAAGAAGCC 12660
ATATGTGGGC AAGACTAGCT CGAGGACGGC CTATTTACGG CCTTGAGGTC CCTGATGTAC 12720
TAGAATCTAT GCGAGGCCAC CTTATTCGGC GTCATGAGAC ATGTGTCATC TGCGAGTGTG 12780
GATCAGTCAA CTACGGATGG TTTTTTGTCC CCTCGGGTTG CCAACTGGAT GATATTGACA 12840
AGGAAACATC ATCCTTGAGA GTCCCATATA TTGGTTCTAC CACTGATGAG AGAACAGACA 12900
TGAAGCTTGC CTTCGTAAGA GCCCCAAGTA GATCCTTGCG ATCTGCCGTT AGAATAGCAA 12960
CAGTGTACTC ATGGGCTTAC GGTGATGATG ATAGCTCTTG GAACGAAGCC TGGTTGTTGG 13020
CAAGGCAAAG GGCCAATGTG AGCCTGGAGG AGCTAAGGGT GATCACTCCC ATCTCGACTT 13080
CGACTAATTT AGCGCATAGG TTGAGGGATC GTAGCACTCA AGTGAAATAC TCAGGTACAT 13140
CCCTTGTCAG AGTGGCAAGG TATACCACAA TCTCCAACGA CAATCTCTCA TTTGTCATAT 13200
CAGATAAGAA AGTTGATACT AACTTTATAT ACCAACAAGG AATGCTTCTA GGGTTGGGTG 13260
TTTTAGAAAC ATTGTTTCGA CTCGAGAAAG ATACTGGATC ATCTAACACG GTATTACATC 13320
TTCACGTCGA AACAGATTGT TGCGTGATCC CGATGATAGA TCATCCCAGG ATACCCAGCT 13380
CCCGCAAGCT AGAGCTGAGG GCAGAGCTAT GTACCAACCC ATTGATATAT GATAATGCAC 13440
CTTTAATTGA CAGAGATGCA ACAAGGCTAT ACACCCAGAG CCATAGGAGG CACCTTGTGG 13500
AATTTGTTAC ATGGTCCACA CCCCAACTAT ATCACATTCT AGCTAAGTCC ACAGCACTAT 13560
CTATGATTGA CCTGGTAACA AAATTTGAGA AGGACCATAT GAATGAAATT TCAGCTCTCA 13620
TAGGGGATGA CGATATCAAT AGTTTCATAA CTGAGTTTCT GCTTATAGAG CCAAGATTAT 13680
TCACCATCTA CTTGGGCCAG TGTGCAGCCA TCAATTGGGC ATTTGATGTA CATTATCATA 13740
GACCATCAGG GAAATATCAG ATGGGTGAGC TGTTGTCTTC GTTCCTTTCT AGAATGAGCA 13800
AAGGAGTGTT TAAGGTGCTT GTCAATGCTC TAAGCCACCC AAAGATCTAC AAGAAATTCT 13860
GGCATTGTGG TATTATAGAG CCTATCCATG GTCCTTCACT TGATGCTCAA AACTTGCACA 13920
CAACTGTGTG CAACATGGTT TACACATGCT ATATGACCTA CCTTGACCTG TTGTTGAATG 13980
AAGAGTTAGA AGAGTTCACA TTTCTTTTGT GTGAAAGCGA TGAGGATGTA GTACCGGACA 14040
GATTCGACAA CATCCAGGCA AAACACTTGT GTGTTCTGGC AGATTTGTAC TGTCAACCAG 14100
GGACCTGCCC ACCGATTCGA GGTCTAAGGC CGGTAGAGAA ATGTGCAGTT CTAACCGATC 14160
ATATCAAGGC AGAGGCTAGG TTATCTCCAG CAGGATCTTC GTGGAACATA AATCCAATTA 14220
TTGTAGACCA TTACTCATGC TCTCTGACTT ATCTCCGTCG AGGATCTATC AAACAGATAA 14280
GATTGAGAGT TGATCCAGGA TTCATTTTTG ATGCCCTCGC TGAGGTAAAT GTCAGTCAGC 14340
CAAAGGTCGG CAGCAACAAC ATCTCAAATA TGAGCATCAA GGATTTCAGA CCTCCACACG 14400
ATGATGTTGC AAAATTGCTC AAAGATATCA ACACAAGCAA GCACAATCTT CCCATTTCAG 14460
GGGGTAGTCT TGCCAATlAT GAAATCCATG CTTTCCGCAG AATCGGGTTA AACTCATCTG 14520
CTTGCTACAA AGCTGTTGAG ATATCAACAT TAATTAGGAG ATGCCTTGAG CCAGGGGAAG 14580
ACGGCTTGTT CTTGGGTGAG GGGTCGGGTT CTATGTTGAT CACTTATAAG GAGATACTAA 14640
AACTAAACAA GTGCTTCTAT AATAGTGGGG TTTCCGCCAA TTCTAGATCT GGTCAAAGGG 14700
AATTAGCACC CTATCCCTCC GAAGTTGGCC TTGTCGAACA CAGAATGGGA GTAGGTAATA 14760
TTGTCAAGGT GCTCTTTAAC GGGAGGCCCG AAGTCACGTG GGTAGGCAGT ATAGATTGCT 14820
TCAATTTCAT AGTCAGTAAT ATCCCTACCT CTAGTGTGGG ATTTATCCAT TCAGATATAG 14880
AGACCTTACC CAACAAAGAT ACTATAGAGA AGTTAGAGGA ATTGGCAGCC ATCTTATCGA 14940
TGGCTCTACT CCTTGGCAAA ATAGGATCAA TACTGGTGAT TAAGCTTATG CCTTTCAGCG 15000
GGGATTTTGT TCAGGGATTT ATAAGCTATG TAGGGTCTCA TTATAGAGAA GTGAACCTTG 15060
TCTACCCTAG GTACAGCAAC TTCATATCTA CTGAATCTTA TTTAGTTATG ACAGATCTCA 15120
AAGCTAACCG GCTAATGAAT CCTGAAAAGA TTAAGCAGCA GATAATTGAA TCATCTGTGC 15180
GGACTTCACC TGGACTTATA GGTCACATCC TATCTATCAA GCAACTAAGC TGCATACAAG 15240
CAATTGTGGG AGGCGCAGTT AGTAGAGGTG ATATCAACCC TATTCTGAAA AAACTTACAC 15300
CTATAGAGCA GGTGCTGATC AGTTGCGGGT TGGCAATTAA CGGACCTAAG CTGTGCAAAG 15360
AATTAATCCA CCATGATGTT GCCTCAGGGC AAGATGGATT GCTTAACTCT ATACTCATCC 15420
TCTACAGGGA GTTGGCAAGA TTCAAAGACA ACCAAAGAAG TCAACAAGGG ATGTTCCACG 15480
CTTACCCCGT ATTGGTAAGT AGTAGGCAAC GAGAACTTGT ATCTAGGATC ACTCGCAAAT 15540
TTTGGGGGCA TATTCTTCTT TACTCCGGGA ACAGAAAGTT GATAAATCGG TTTATCCAGA 15600
ATCTCAAGTC CGGTTATCTA ATACTAGACT TACACCAGAA TATCTTCGTT AAGAATCTAT 15660
CCAAGTCAGA GAAACAGATT ATTATGACGG GGGGTTTAAA ACGTGAGTGG GTTTTTAAGG 15720
TAACAGTCAA GGAGACCAAA GAATGGTATA AGTTAGTCGG ATACAGCGCT CTGATTAAGG 15780
ATTAATTGGT TGAACTCCGG AACCCTAATC CTACCCTAGG TAGTTAGGCA TTATTTGCAA 15840
TATATTAAAG AAAACTTTGA AAATACGAAG TTTCTATTCC CAGCTTTGTC TGGT 15894
(2) SEQ ID NO: 8 information about:
...
(i) sequence signature:
(A) length: 2183 amino acid
(B) type: amino acid
(C) chain:
(D) topological framework: linearity
(ii) molecule type: protein
(xi) sequence description: SEQ ID NO:8:
Met?Asp?Ser?Leu?Ser?Val?Asn?Gln?Ile?Leu?Tyr?Pro?Glu?Val?His?Leu
1???????????????5???????????????????10??????????????????15
Asp?Ser?Pro?Ile?Val?Thr?Asn?Lys?Ile?Val?Ala?Ile?Leu?Glu?Tyr?Ala
20??????????????????25??????????????????30
Arg?Val?Pro?His?Ala?Tyr?Ser?Leu?Glu?Asp?Pro?Thr?Leu?Cys?Gln?Asn
35??????????????????40??????????????????45
Ile?Lys?His?Arg?Leu?Lys?Asn?Gly?Phe?Ser?Asn?Gln?Met?Ile?Ile?Asn
50??????????????????55??????????????????60
Asn?Val?Glu?Val?Gly?Asn?Val?Ile?Lys?Ser?Lys?Leu?Arg?Ser?Tyr?Pro65??????????????????70??????????????????75??????????????????80Ala?His?Ser?His?Ile?Pro?Tyr?Pro?Asn?Cys?Asn?Gln?Asp?Leu?Phe?Asn
85??????????????????90??????????????????95Ile?Glu?Asp?Lys?Glu?Ser?Thr?Arg?Lys?Ile?Arg?Glu?Leu?Leu?Lys?Lys
100?????????????????105?????????????????110Gly?Asn?Ser?Leu?Tyr?Ser?Lys?Val?Ser?Asp?Lys?Val??Phe?Gln?Cys?Leu
115?????????????????120??????????????????125Arg?Asp?Thr?Asn?Ser?Arg?Leu?Gly?Leu?Gly?Ser?Glu?Leu?Arg?Glu?Asp
130?????????????????135?????????????????140Ile?Lys?Glu?Lys?Ile?Ile?Asn?Leu?Gly?Val?Tyr?Met?His?Ser?Ser?Gln145?????????????????150?????????????????155?????????????????160Trp?Phe?Glu?Pro?Phe?Leu?Phe?Trp?Phe?Thr?Val?Lys?Thr?Glu?Met?Arg
165?????????????????170?????????????????175Ser?Val?Ile?Lys?Ser?Gln?Thr?His?Thr?Cys?His?Arg?Arg?Arg?His?Thr
180?????????????????????185?????????????????190Pro?Val?Phe?Phe?Thr?Gly?Ser?Ser?Val?Glu?Leu?Leu?Ile?Ser?Arg?Asp
195?????????????????200?????????????????205Leu?Val?Ala?Ile?Ile?Ser?Lys?Glu?Ser?Gln?His?Val?Tyr?Tyr?Leu?Thr
210?????????????????215?????????????????220Phe?Glu?Leu?Val?Leu?Met?Tyr?Cys?Asp?Val?Ile?Glu?Gly?Arg?Leu?Met225?????????????????230?????????????????235?????????????????240Thr?Glu?Thr?Ala?Met?Thr?Ile?Asp?Ala?Arg?Tyr?Ala?Glu?Leu?Leu?Gly
245?????????????????250?????????????????255Arg?Val?Arg?Tyr?Met?Trp?Lys?Leu?Ile?Asp?Gly?Phe?Phe?Pro?Ala?Leu
260?????????????????265??????????????????270Gly?Asn?Pro?Thr?Tyr?Gln?Ile?Val?Ala?Met?Leu?Glu?Pro?Leu?Ser?Leu
275?????????????????280?????????????????285Ala?Tyr?Leu?Gln?Leu?Arg?Asp?Ile?Thr?Val?Glu?Leu?Arg?Gly?Ala?Phe
290?????????????????295??????????????????300Leu?Asn?His?Cys?Phe?Thr?Glu?Ile?His?Asp?Val?Leu?Asp?Gln?Asn?Gly305?????????????????310?????????????????315?????????????????320Phe?Ser?Asp?Glu?Gly?Thr?Tyr?His?Glu?Leu?Ile?Glu?Ala?Leu?Asp?Tyr
325?????????????????330?????????????????335Ile?Phe?Ile?Thr?Asp?Asp?Ile?His?Leu?Thr?Gly?Glu?Ile?Phe?Ser?Phe
340?????????????????345?????????????????350Phe?Arg?Ser?Phe?Gly?His?Pro?Arg?Leu?Glu?Ala?Val?Thr?Ala?Ala?Glu
355?????????????????360?????????????????365Asn?Val?Arg?Lys?Tyr?Met?Asn?Gln?Pro?Lys?Val?Ile?Val?Tyr?Glu?Thr
370?????????????????375?????????????????380Leu?Met?Lys?Gly?His?Ala?Ile?Phe?Cys?Gly?Ile?Ile?Ile?Asn?Gly?Tyr385?????????????390?????????????????????395?????????????????400Arg?Asp?Arg?His?Gly?Gly?Ser?Trp?Pro?Pro?Leu?Thr?Leu?Pro?Leu?His
405?????????????????410?????????????????415Ala?Ala?Asp?Thr?Ile?Arg?Asn?Ala?Gln?Ala?Ser?Gly?Glu?Gly?Leu?Thr
420?????????????????425?????????????????430His?Glu?Gln?Cys?Val?Asp?Asn?Trp?Arg?Ser?Phe?Ala?Gly?Val?Arg?Phe
435?????????????????440?????????????????445Gly?Cys?Phe?Met?Pro?Leu?Ser?Leu?Asp?Ser?Asp?Leu?Thr?Met?Tyr?Leu
450?????????????????455?????????????????460Lys?Asp?Lys?Ala?Leu?Ala?Ala?Leu?Gln?Arg?Glu?Trp?Asp?Ser?Val?Tyr465?????????????????470?????????????????475?????????????????480Pro?Lys?Glu?Phe?Leu?Arg?Tyr?Asp?Pro?Pro?Lys?Gly?Thr?Gly?Ser?Arg
485?????????????????490?????????????????495Arg?Leu?Val?Asp?Val?Phe?Leu?Asn?Asp?Ser?Ser?Phe?Asp?Pro?Tyr?Asp
500?????????????????505?????????????????510Met?Ile?Met?Tyr?Val?Val?Ser?Gly?Ala?Tyr?Leu?His?Asp?Pro?Glu?Phe
515?????????????????520?????????????????525Asn?Leu?Ser?Tyr?Ser?Leu?Lys?Glu?Lys?Glu?Ile?Lys?Glu?Thr?Gly?Arg
530?????????????????535?????????????????540Leu?Phe?Ala?Lys?Met?Thr?Tyr?Lys?Met?Arg?Ala?Cys?Gln?Val?Ile?Ala545?????????????????550?????????????????555?????????????????560Glu?Asn?Leu?Ile?Ser?Asn?Gly?Ile?Gly?Lys?Tyr?Phe?Lys?Asp?Asn?Gly
565?????????????????570?????????????????575Met?Ala?Lys?Asp?Glu?His?Asp?Leu?Thr?Lys?Ala?Leu?His?Thr?Leu?Ala
580?????????????????585?????????????????590Val?Ser?Gly?Val?Pro?Lys?Asp?Leu?Lys?Glu?Ser?His?Arg?Gly?Gly?Pro
595?????????????????600?????????????????605Val?Leu?Lys?Thr?Tyr?Ser?Arg?Ser?Pro?Val?His?Thr?Ser?Thr?Arg?Asn
610?????????????????615?????????????????620Val?Lys?Ala?Glu?Lys?Gly?Phe?Val?Gly?Phe?Pro?His?Val?Ile?Arg?Gln625?????????????????630?????????????????635?????????????????640Asn?Gln?Asp?Thr?Asp?His?Pro?Glu?Asn?Ile?Glu?Thr?Tyr?Glu?Thr?Val
645?????????????????650?????????????????655Ser?Ala?Phe?Ile?Thr?Thr?Asp?Leu?Lys?Lys?Tyr?Cys?Leu?Asn?Trp?Arg
660?????????????????665?????????????????670Tyr?Glu?Thr?Ile?Ser?Leu?Phe?Ala?Gln?Arg?Leu?Asn?Glu?Ile?Tyr?Gly
675?????????????????680?????????????????685Leu?Pro?Ser?Phe?Phe?Gln?Trp?Leu?His?Lys?Arg?Leu?Glu?Thr?Ser?Val
690?????????????????695?????????????????700Leu?Tyr?Val?Ser?Asp?Pro?His?Cys?Pro?Pro?Asp?Leu?Asp?Ala?His?Val705?????????????????710?????????????????715?????????????????720Pro?Leu?Cys?Lys?Val?Pro?Asn?Asp?Gln?Ile?Phe?Ile?Lys?Tyr?Pro?Met
725?????????????????730?????????????????735Gly?Gly?Ile?Glu?Gly?Tyr?Cys?Gln?Lys?Leu?Trp?Thr?Ile?Ser?Thr?Ile
740?????????????????745?????????????????750Pro?Tyr?Leu?Tyr?Leu?Ala?Ala?Tyr?Glu?Ser?Gly?Val?Arg?Ile?Ala?Ser
755?????????????????760?????????????????765Leu?Val?Gln?Gly?Asp?Asn?Gln?Thr?Ile?Ala?Val?Thr?Lys?Arg?Val?Pro
770?????????????????775?????????????????780Ser?Thr?Trp?Pro?Tyr?Asn?Leu?Lys?Lys?Arg?Glu?Ala?Ala?Arg?Val?Thr785?????????????????790?????????????????795?????????????????800Arg?Asp?Tyr?Phe?Val?Ile?Leu?Arg?Gln?Arg?Leu?His?Asp?Ile?Gly?His
805?????????????????810?????????????????815His?Leu?Lys?Ala?Asn?Glu?Thr?Ile?Val?Ser?Ser?His?Phe?Phe?Val?Tyr
820?????????????????825?????????????????830Ser?Lys?Gly?Ile?Tyr?Tyr?Asp?Gly?Leu?Leu?Val?Ser?Gln?Ser?Leu?Lys
835?????????????????840?????????????????845Ser?Ile?Ala?Arg?Cys?Val?Phe?Trp?Ser?Glu?Thr?Ile?Val?Asp?Glu?Thr
850?????????????????855?????????????????860Arg?Ala?Ala?Cys?Ser?Asn?Ile?Ala?Thr?Thr?Met?Ala?Lys?Ser?Ile?Glu865?????????????????870?????????????????875?????????????????880Arg?Gly?Tyr?Asp?Arg?Tyr?Leu?Ala?Tyr?Ser?Leu?Asn?Val?Leu?Lys?Val
885?????????????????890?????????????????895Ile?Gln?Gln?Ile?Leu?Ile?Ser?Leu?Gly?Phe?Thr?Ile?Asn?Ser?Thr?Met
900?????????????????905?????????????????910Thr?Arg?Asp?Val?Val?Ile?Pro?Leu?Leu?Thr?Asn?Asn?Asp?Leu?Leu?Ile
915?????????????????920?????????????????925Arg?Met?Ala?Leu?Leu?Pro?Ala?Pro?Ile?Gly?Gly?Met?Asn?Tyr?Leu?Asn
930?????????????????935?????????????????940Met?Ser?Arg?Leu?Phe?Val?Arg?Asn?Ile?Gly?Asp?Pro?Val?Thr?Ser?Ser945?????????????????950?????????????????955?????????????????960Ile?Ala?Asp?Leu?Lys?Arg?Met?Ile?Leu?Ala?Ser?Leu?Met?Pro?Glu?Glu
965?????????????????970?????????????????975Thr?Leu?His?Gln?Val?Met?Thr?Gln?Gln?Pro?Gly?Asp?Ser?Ser?Phe?Leu
980?????????????????985?????????????????990Asp?Trp?Ala?Ser?Asp?Pro?Tyr?Ser?Ala?Asn?Leu?Val?Cys?Val?Gln?Ser
995?????????????????1000????????????????1005Ile?Thr?Arg?Leu?Leu?Lys?Asn?Ile?Thr?Ala?Arg?Phe?Val?Leu?Ile?His
1010???????????????1015?????????????????1020Ser?Pro?Asn?Pro?Met?Leu?Lys?Gly?Leu?Phe?His?Asp?Asp?Ser?Lys?Glu1025????????????????1030????????????????1035????????????????1040Glu?Asp?Glu?Arg?Leu?Ala?Ala?Phe?Leu?Met?Asp?Arg?His?Ile?Ile?Val
1045????????????????1050????????????????1055Pro?Arg?Ala?Ala?His?Glu?Ile?Leu?Asp?His?Ser?Val?Thr?Gly?Ala?Arg
1060????????????????1065????????????????1070Glu?Ser?Ile?Ala?Gly?Met?Leu?Asp?Thr?Thr?Lys?Gly?Leu?Ile?Arg?Ala
1075????????????????1080????????????????1085Ser?Met?Arg?Lys?Gly?Gly?Leu?Thr?Ser?Arg?Val?Ile?Thr?Arg?Leu?Ser
1090????????????????1095????????????????1100Asn?Tyr?Asp?Tyr?Glu?Gln?Phe?Arg?Ala?Gly?Met?Val?Leu?Leu?Thr?Gly1105????????????????1110????????????????1115????????????????1120Arg?Lys?Arg?Asn?Val?Leu?Ile?Asp?Lys?Glu?Ser?Cys?Ser?Val?Gln?Leu
1125????????????????1130????????????????1135Ala?Arg?Ala?Leu?Arg?Ser?His?Met?Trp?Ala?Arg?Leu?Ala?Arg?Gly?Arg
1140????????????????1145????????????????1150Pro?Ile?Tyr?Gly?Leu?Glu?Val?Pro?Asp?Val?Leu?Glu?Ser?Met?Arg?Gly
1155????????????????1160????????????????1165His?Leu?Ile?Arg?Arg?His?Glu?Thr?Cys?Val?Ile?Cys?Glu?Cys?Gly?Ser
1170????????????????1175????????????????1180Val?Asn?Tyr?Gly?Trp?Phe?Phe?Val?Pro?Ser?Gly?Cys?Gln?Leu?Asp?Asp1185????????????????1190????????????????1195????????????????1200Ile?Asp?Lys?Glu?Thr?Ser?Ser?Leu?Arg?Val?Pro?Tyr?Ile?Gly?Ser?Thr
1205????????????????1210????????????????1215Thr?Asp?Glu?Arg?Thr?Asp?Met?Lys?Leu?Ala?Phe?Val?Arg?Ala?Pro?Ser
1220????????????????1225????????????????1230Arg?Ser?Leu?Arg?Ser?Ala?Val?Arg?Ile?Ala?Thr?Val?Tyr?Ser?Trp?Ala
1235????????????????1240????????????????1245Tyr?Gly?Asp?Asp?Asp?Ser?Ser?Trp?Asn?Glu?Ala?Trp?Leu?Leu?Ala?Arg
1250????????????????1255????????????????1260Gln?Arg?Ala?Asn?Val?Ser?Leu?Glu?Glu?Leu?Arg?Val?Ile?Thr?Pro?Ile1265????????????????1270????????????????1275????????????????1280Ser?Thr?Ser?Thr?Asn?Leu?Ala?His?Arg?Leu?Arg?Asp?Arg?Ser?Thr?Gln
1285????????????????1290????????????????1295Val?Lys?Tyr?Ser?Gly?Thr?Ser?Leu?Val?Arg?Val?Ala?Arg?Tyr?Thr?Thr
1300????????????????1305????????????????1310Ile?Ser?Asn?Asp?Asn?Leu?Ser?Phe?Val?Ile?Ser?Asp?Lys?Lys?Val?Asp
1315????????????????1320????????????????1325Thr?Asn?Phe?Ile?Tyr?Gln?Gln?Gly?Met?Leu?Leu?Gly?Leu?Gly?Val?Leu
1330????????????????1335????????????????1340Glu?Thr?Leu?Phe?Arg?Leu?Glu?Lys?Asp?Thr?Gly?Ser?Ser?Asn?Thr?Val1345????????????????1350????????????????1355????????????????1360Leu?His?Leu?His?Val?Glu?Thr?Asp?Cys?Cys?Val?Ile?Pro?Met?Ile?Asp
1365????????????????1370????????????????1375His?Pro?Arg?Ile?Pro?Ser?Ser?Arg?Lys?Leu?Glu?Leu?Arg?Ala?Glu?Leu
1380????????????????1385????????????????1390Cys?Thr?Asn?Pro?Leu?Ile?Tyr?Asp?Asn?Ala?Pro?Leu?Ile?Asp?Arg?Asp
1395????????????????1400????????????????1405Ala?Thr?Arg?Leu?Tyr?Thr?Gln?Ser?His?Arg?Arg?His?Leu?Val?Glu?Phe
1410????????????????1415????????????????1420Val?Thr?Trp?Ser?Thr?Pro?Gln?Leu?Tyr?His?Ile?Leu?Ala?Lys?Ser?Thr1425????????????????1430????????????????1435????????????????1440Ala?Leu?Ser?Met?Ile?Asp?Leu?Val?Thr?Lys?Phe?Glu?Lys?Asp?His?Met
1445????????????????1450????????????????1455Asn?Glu?Ile?Ser?Ala?Leu?Ile?Gly?Asp?Asp?Asp?Ile?Asn?Ser?Phe?Ile
1460????????????????1465????????????????1470Thr?Glu?Phe?Leu?Leu?Ile?Glu?Pro?Arg?Leu?Phe?Thr?Ile?Tyr?Leu?Gly
1475????????????????1480????????????????1485Gln?Cys?Ala?Ala?Ile?Asn?Trp?Ala?Phe?Asp?Val?His?Tyr?His?Arg?Pro
1490????????????????1495????????????????1500Ser?Gly?Lys?Tyr?Gln?Met?Gly?Glu?Leu?Leu?Ser?Ser?Phe?Leu?Ser?Arg1505????????????????1510????????????????1515????????????????1520Met?Ser?Lys?Gly?Val?Phe?Lys?Val?Leu?Val?Asn?Ala?Leu?Ser?His?Pro
1525????????????????1530????????????????1535Lys?Ile?Tyr?Lys?Lys?phe?Trp?His?Cys?Gly?Ile?Ile?Glu?Pro?Ile?His
1540????????????????1545????????????????1550Gly?Pro?Ser?Leu?Asp?Ala?Gln?Asn?Leu?His?Thr?Thr?Val?Cys?Asn?Met
1555????????????????1560????????????????1565Val?Tyr?Thr?Cys?Tyr?Met?Thr?Tyr?Leu?Asp?Leu?Leu?Leu?Asn?Glu?Glu
1570????????????????1575????????????????1580Leu?Glu?Glu?Phe?Thr?Phe?Leu?Leu?Cys?Glu?Ser?Asp?Glu?Asp?Val?Val1585????????????????1590????????????????1595????????????????1600Pro?Asp?Arg?Phe?Asp?Asn?Ile?Gln?Ala?Lys?His?Leu?Cys?Val?Leu?Ala
1605????????????????1610????????????????1615Asp?Leu?Tyr?Cys?Gln?Pro?Gly?Thr?Cys?Pro?Pro?Ile?Arg?Gly?Leu?Arg
1620????????????????1625????????????????1630Pro?Val?Glu?Lys?Cys?Ala?Val?Leu?Thr?Asp?His?Ile?Lys?Ala?Glu?Ala
1635????????????????1640????????????????1645Arg?Leu?Ser?Pro?Ala?Gly?Ser?Ser?Trp?Asn?Ile?Asn?Pro?Ile?Ile?Val
1650????????????????1655????????????????1660Asp?His?Tyr?Ser?Cys?Ser?Leu?Thr?Tyr?Leu?Arg?Arg?Gly?Ser?Ile?Lys1665????????????????1670????????????????1675????????????????1680Gln?Ile?Arg?Leu?Arg?Val?Asp?Pro?Gly?Phe?Ile?Phe?Asp?Ala?Leu?Ala
1685????????????????1690????????????????1695Glu?Val?Asn?Val?Ser?Gln?Pro?Lys?Val?Gly?Ser?Asn?Asn?Ile?Ser?Asn
1700????????????????1705????????????????1710Met?Ser?Ile?Lys?Asp?Phe?Arg?Pro?Pro?His?Asp?Asp?Val?Ala?Lys?Leu
1715????????????????1720????????????????1725Leu?Lys?Asp?Ile?Asn?Thr?Ser?Lys?His?Asn?Leu?Pro?Ile?Ser?Gly?Gly
1730????????????????1735????????????????1740Ser?Leu?Ala?Asn?Tyr?Glu?Ile?His?Ala?Phe?Arg?Arg?Ile?Gly?Leu?Asn1745????????????????1750????????????????1755????????????????1760Ser?Ser?Ala?Cys?Tyr?Lys?Ala?Val?Glu?Ile?Ser?Thr?Leu?Ile?Arg?Arg
1765????????????????1770????????????????1775Cys?Leu?Glu?Pro?Gly?Glu?Asp?Gly?Leu?Phe?Leu?Gly?Glu?Gly?Ser?Gly
1780????????????????1785????????????????1790Ser?Met?Leu?Ile?Thr?Tyr?Lys?Glu?Ile?Leu?Lys?Leu?Asn?Lys?Cys?Phe
1795????????????????1800????????????????1805Tyr?Asn?Ser?Gly?Val?Ser?Ala?Asn?Ser?Arg?Ser?Gly?Gln?Arg?Glu?Leu
1810????????????????1815????????????????1820Ala?Pro?Tyr?Pro?Ser?Glu?Val?Gly?Leu?Val?Glu?His?Arg?Met?Gly?Val1825????????????????1830????????????????1835????????????????1840Gly?Asn?Ile?Val?Lys?Val?Leu?Phe?Asn?Gly?Arg?Pro?Glu?Val?Thr?Trp
1845????????????????1850????????????????1855Val?Gly?Ser?Ile?Asp?Cys?Phe?Asn?Phe?Ile?Val?Ser?Asn?Ile?Pro?Thr
1860????????????????1865????????????????1870Ser?Ser?Val?Gly?Phe?Ile?His?Ser?Asp?Ile?Glu?Thr?Leu?Pro?Asn?Lys
1875????????????????1880????????????????1885Asp?Thr?Ile?Glu?Lys?Leu?Glu?Glu?Leu?Ala?Ala?Ile?Leu?Ser?Met?Ala
1890????????????????1895????????????????1900Leu?Leu?Leu?Gly?Lys?Ile?Gly?Ser?Ile?Leu?Val?Ile?Lys?Leu?Met?Pro1905????????????????1910????????????????1915????????????????1920Phe?Ser?Gly?Asp?Phe?Val?Gln?Gly?Phe?Ile?Ser?Tyr?Val?Gly?Ser?His
1925????????????????1930????????????????1935Tyr?Arg?Glu?Val?Asn?Leu?Val?Tyr?Pro?Arg?Tyr?Ser?Asn?Phe?Ile?Ser
1940????????????????1945????????????????1950Thr?Glu?Ser?Tyr?Leu?Val?Met?Thr?Asp?Leu?Lys?Ala?Asn?Arg?Leu?Met
1955????????????????1960????????????????1965Asn?Pro?Glu?Lys?Ile?Lys?Gln?Gln?Ile?Ile?Glu?Ser?Ser?Val?Arg?Thr
1970????????????????1975????????????????1980Ser?Pro?Gly?Leu?Ile?Gly?His?Ile?Leu?Ser?Ile?Lys?Gln?Leu?Ser?Cys1985????????????????1990????????????????1995????????????????2000Ile?Gln?Ala?Ile?Val?Gly?Gly?Ala?Val?Ser?Arg?Gly?Asp?Ile?Asn?Pro
2005????????????????2010????????????????2015Ile?Leu?Lys?Lys?Leu?Thr?Pro?Ile?Glu?Gln?Val?Leu?Ile?Ser?Cys?Gly
2020????????????????2025????????????????2030Leu?Ala?Ile?Asn?Gly?Pro?Lys?Leu?Cys?Lys?Glu?Leu?Ile?His?His?Asp
2035????????????????2040????????????????2045Val?Ala?Ser?Gly?Gln?Asp?Gly?Leu?Leu?Asn?Ser?Ile?Leu?Ile?Leu?Tyr
2050????????????????2055????????????????2060Arg?Glu?Leu?Ala?Arg?Phe?Lys?Asp?Asn?Gln?Arg?Ser?Gln?Gln?Gly?Met2065????????????????2070????????????????2075????????????????2080Phe?His?Ala?Tyr?Pro?Val?Leu?Val?Ser?Ser?Arg?Gln?Arg?Glu?Leu?Val
2085????????????????2090????????????????2095Ser?Arg?Ile?Thr?Arg?Lys?Phe?Trp?Gly?His?Ile?Leu?Leu?Tyr?Ser?Gly
2100????????????????2105????????????????2110Asn?Arg?Lys?Leu?Ile?Asn?Arg?Phe?Ile?Gln?Asn?Leu?Lys?Ser?Gly?Tyr
2115????????????????2120????????????????2125
Leu?Ile?Leu?Asp?Leu?His?Gln?Asn?Ile?Phe?Val?Lys?Asn?Leu?Ser?Lys
2130????????????????2135????????????????2140
Ser?Glu?Lys?Gln?Ile?Ile?Met?Thr?Gly?Gly?Leu?Lys?Arg?Glu?Trp?Val
2145????????????????2150????????????????2155????????????????2160
Phe?Lys?Val?Thr?Val?Lys?Glu?Thr?Lys?Glu?Trp?Tyr?Lys?Leu?Val?Gly
2165????????????????2170????????????????2175
Tyr?Ser?Ala?Leu?Ile?Lys?Asp
The information of 2180 (2) SEQ ID NO:9:
(i) sequence signature:
(A) length: 15894 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9:
ACCAAACAAA GTTGGGTAAG GATAGTTCAA TCAATGATCA TCTTCTAGTG CACTTAGGAT 60
TCAAGATCCT ATTATCAGGG ACAAGAGCAG GATTAGGGAT ATCCGAGATG GCCACACTTT 120
TAAGGAGCTT AGCATTGTTC AAAAGAAACA AGGACAAACC ACCCATTACA TCAGGATCCG 180
GTGGAGCCAT CAGAGGAATC AAACACATTA TTATAGTACC AATCCCTGGA GATTCCTCAA 240
TTACCACTCG ATCCAGACTT CTGGACCGGT TGGTCAGGTT AATTGGAAAC CCGGATGTGA 300
GCGGGCCCAA ACTAACAGGG GCACTAATAG GTATATTATC CTTATTTGTG GAGTCTCCAG 360
GTCAATTGAT TCAGAGGATC ACCGATGACC CTGACGTTAG CATAAGGCTG TTAGAGGTTG 420
TCCAGAGTGA CCAGTCACAA TCTGGCCTTA CCTTCGCATC AAGAGGTACC AACATGGAGG 480
ATGAGGCGGA CCAATACTTT TCACATGATG ATCCAATTAG TAGTGATCAA TCCAGGTTCG 540
GATGGTTCGA GAACAAGGAA ATCTCAGATA TTGAAGTGCA AGACCCTGAG GGATTCAACA 600
TGATTCTGGG TACCATCCTA GCCCAAATTT GGGTCTTGCT CGCAAAGGCG GTTACGGCCC 660
CAGACACGGC AGCTGATTCG GAGCTAAGAA GGTGGATAAA GTACACCCAA CAAAGAAGGG 720
TGGTTGGTGA ATTTAGATTG GAGAGAAAAT GGTTGGATGT GGTGAGGAAC AGGATTGCCG 780
AGGACCTCTC CTTACGCCGA TTCATGGTCG CTCTAATCCT GGATATCAAG AGAACACCCG 840
GAAACAAACC CAGGATTGCT GAAATGATAT GTGACATTGA TACATATATC GTAGAGGCAG 900
GATTAGCCAG TTTTATCCTG ACTATTAAGT TTGGGATAGA AACTATGTAT CCTGCTCTTG 960
GACTGCATGA ATTTGCTGGT GAGTTATCCA CACTTGAGTC CTTGATGAAC CTTTACCAGC 1020
AAATGGGGGA AACTGCACCC TACATGGTAA TCCTGGAGAA CTCAATTCAG AACAAGTTCA 1080
GTGCAGGATC ATACCCTCTG CTCTGGAGCT ATGCCATGGG AGTAGGAGTG GAACTTGAAA 1140
ACTCCATGGG AGGTTTGAAC TTTGGCCGAT CTTACTTTGA TCCAGCATAT TTTAGATTAG 1200
GGCAAGAGAT GGTAAGGAGG TCAGCTGGAA AGGTCAGTTC CACATTGGCA TCTGAACTCG 1260
GTATCACTGC CGAGGATGCA AGGCTTGTTT CAGAGATTGC AATGCATACT ACTGAGGACA 1320
AGATCAGTAG AGCGGTTGGA CCCAGACAAG CCCAAGTATC ATTTCTACAC GGTGATCAAA 1380
GTGAGAATGA GCTACCGAGA TTGGGGGGCA AGGAAGATAG GAGGGTCAAA CAGAGTCGAG 1440
GAGAAGCCAG GGAGAGCTAC AGAGAAACCG GGCCCAGCAG AGCAAGTGAT GCGAGAGCTG 1500
CCCATCTTCC AACCGGCACA CCCCTAGACA TTGACACTGC AACGGAGTCC AGCCAAGATC 1560
CGCAGGACAG TCGAAGGTCA GCTGACGCCC TGCTTAGGCT GCAAGCCATG GCAGGAATCT 1620
CGGAAGAACA AGGCTCAGAC ACGGACACCC CTATAGTGTA CAATGACAGA AATCTTCTAG 1680
ACTAGGTGCG AGAGGCCGAG GGCCAGAACA ACATCCGCCT ACCCTCCATC ATTGTTATAA 1740
AAAACTTAGG AACCAGGTCC ACACAGCCGC CAGCCCATCA ACCATCCACT CCCACGATTG 1800
GAGCCAATGG CAGAAGAGCA GGCACGCCAT GTCAAAAACG GACTGGAATG CATCCGGGCT 1860
CTCAAGGCCG AGCCCATCGG CTCACTGGCC ATCGAGGAAG CTATGGCAGC ATGGTCAGAA 1920
ATATCAGACA ACCCAGGACA GGAGCGAGCC ACCTGCAGGG AAGAGAAGGC AGGCAGTTCG 1980
GGTCTCAGCA AACCATGCCT CTCAGCAATT GGATCAACTG AAGGCGGTGC ACCTCGCATC 2040
CGCGGTCAGG GACCTGGAGA GAGCGATGAC GACGCTGAAA CTTTGGGAAT CCCCCCAAGA 2100
AATCTCCAGG CATCAAGCAC TGGGTTACAG TGTTATTACG TTTATGATCA CAGCGGTGAA 2160
GCGGTTAAGG GAATCCAAGA TGCTGACTCT ATCATGGTTC AATCAGGCCT TGATGGTGAT 2220
AGCACCCTCT CAGGAGGAGA CAATGAATCT GAAAACAGCG ATGTGGATAT TGGCGAACCT 2280
GATACCGAGG GATATGCTAT CACTGACCGG GGATCTGCTC CCATCTCTAT GGGGTTCAGG 2340
GCTTCTGATG TTGAAACTGC AGAAGGAGGG GAGATCCACG AGCTCCTGAG ACTCCAATCC 2400
AGAGGCAACA ACTTTCCGAA GCTTGGGAAA ACTCTCAATG TTCCTCCGCC CCCGGACCCC 2460
GGTAGGGCCA GCACTTCCGG GACACCCATT AAAAAGGGCA CAGACGCGAG ATTAGCCTCA 2520
TTTGGAACGG AGATCGCGTC TTTATTGACA GGTGGTGCAA CCCAATGTGC TCGAAAGTCA 2580
CCCTCGGAAC CATCAGGGCC AGGTGCACCT GCGGGGAATG TCCCCGAGTG TGTGAGCAAT 2640
GCCGCACTGA TACAGGAGTG GACACCCGAA TCTGGTACCA CAATCTCCCC GAGATCCCAG 2700
AATAATGAAG AAGGGGGAGA CTATTATGAT GATGAGCTGT TCTCTGATGT CCAAGATATT 2760
AAAACAGCCT TGGCCAAAAT ACACGAGGAT AATCAGAAGA TAATCTCCAA GCTAGAATCA 2820
CTGCTGTTAT TGAAGGGAGA AGTTGAGTCA ATTAAGAAGC AGATCAACAG GCAAAATATC 2880
AGCATATCCA CCCTGGAAGG ACACCTCTCA AGCATCATGA TCGCCATTCC TGGACTTGGG 2940
AAGGATCCCA ACGACCCCAC TGCAGATGTC GAAATCAATC CCGACTTGAA ACCCATCATA 3000
GGCAGAGATT CAGGCCGAGC ACTGGCCGAA GTTCTCAAGA AACCCGTTGC CAGCCGACAA 3060
CTCCAAGGAA TGACAAATGG ACGGACCAGT TCCAGAGGAC AGCTGCTGAA GGAATTTCAG 3120
CTAAAGCCGA TCGGGAAAAA GATGAGCTCA GCCGTCGGGT TTGTTCCTGA CACCGGCCCT 3180
GCATCACGCA GTGTAATCCG CTCCATTATA AAATCCAGCC GGCTAGAGGA GGATCGGAAG 3240
CGTTACCTGA TGACTCTCCT TGATGATATC AAAGGAGCCA ATGATCTTGC CAAGTTCCAC 3300
CAGATGCTGA TGAAGATAAT AATGAAGTAG CTACAGCTCA ACTTACCTGC CAACCCCATG 3360
CCAGTCGACC CAACTAGTAC AACCTAAATC CATTATAAAA AACTTAGGAG CAAAGTGATT 3420
GCCTCCCAAG TTCCACAATG ACAGAGACCT ACGACTTCGA CAAGTCGGCA TGGGACATCA 3480
AAGGGTCGAT CGCTCCGATA CAACCCACCA CCTACAGTGA TGGCAGGCTG GTGCCCCAGG 3540
TCAGAGTCAT AGATCCTGGT CTAGGCGACA GGAAGGATGA ATGCTTTATG TACATGTTTC 3600
TGCTGGGGGT TGTTGAGGAC AGCGATTCCC TAGGGCCTCC AATCGGGCGA GCATTTGGGT 3660
CCCTGCCCTT AGGTGTTGGC AGATCCACAG CAAAGCCCGA AAAACTCCTC AAAGAGGCCA 3720
CTGAGCTTGA CATAGTTGTT AGACGTACAG CAGGGCTCAA TGAAAAACTG GTGTTCTACA 3780
ACAACACCCC ACTAACTCTC CTCACACCTT GGAGAAAGGT CCTAACAACA GGGAGTGTCT 3840
TCAACGCAAA CCAAGTGTGC AATGCGGTTA ATCTGATACC GCTCGATACC CCGCAGAGGT 3900
TCCGTGTTGT TTATATGAGC ATCACCCGTC TTTCGGATAA CGGGTATTAC ACCGTTCCTA 3960
GAAGAATGCT GGAATTCAGA TCGGTCAATG CAGTGGCCTT CAACCTGCTG GTGACCCTTA 4020
GGATTGACAA GGCGATAGGC CCTGGGAAGA TCATCGACAA TACAGAGCAA CTTCCTGAGG 4080
CAACATTTAT GGTCCACATC GGGAACTTCA GGAGAAAGAA GAGTGAAGTC TACTCTGCCG 4140
ATTATTGCAA AATGAAAATC GAAAAGATGG GCCTGGTTTT TGCACTTGGT GGGATAGGGG 4200
GCACCAGTCT TCACATTAGA AGCACAGGCA AAATGAGCAA GACTCTCCAT GCACAACTCG 4260
GGTTCAAGAA GACCTTATGT TACCCGCTGA TGGATATCAA TGAAGACCTT AATCGATTAC 4320
TCTGGAGGAG CAGATGCAAG ATAGTAAGAA TCCAGGCAGT TTTGCAGCCA TCAGTTCCTC 4380
AAGAATTCCG CATTTACGAC GACGTGATCA TAAATGATGA CCAAGGACTA TTCAAAGTTC 4440
TGTAGACCGT AGTGCCCAGC AATGCCCGAA AACGACCCCC CTCACAATGA CAGCCAGAAG 4500
GCCCGGACAA AAAAGCCCCC TCCGAAAGAC TCCACGGACC AAGCGAGAGG CCAGCCAGCA 4560
GCCGACGGCA AGCGCGAACA CCAGGCGGCC CCAGCACAGA ACAGCCCCGA CACAAGGCCA 4620
CCACCAGCCA CCCCAATCTG CATCCTCCTC GTGGGACCCC CGAGGACCAA CCCCCAAGGC 4680
TGCCCCCGAT CCAAACCACC AACCGCATCC CCACCACCCC CGGGAAAGAA ACCCCCAGCA 4740
ATTGGAAGGC CCCTCCCCCT CTTCCTCAAC ACAAGAACTC CACAACCGAA CCGCACAAGC 4800
GACCGAGGTG ACCCAACCGC AGGCATCCGA CTCCCTAGAC AGATCCTCTC TCCCCGGCAA 4860
ACTAAACAAA ACTTAGGGCC AAGGAACATA CACACCCAAC AGAACCCAGA CCCCGGCCCA 4920
CGGCGCCGCG CCCCCAACCC CCGACAACCA GAGGGAGCCC CCAACCAATC CCGCCGGCTC 4980
CCCCGGTGCC CACAGGCAGG GACACCAACC CCCGAACAGA CCCAGCACCC AACCATCGAC 5040
AATCCAAGAC GGGGGGGCCC CCCCAAAAAA AGGCCCCCAG GGGCCGACAG CCAGCACCGC 5100
GAGGAAGCCC ACCCACCCCA CACACGACCA CGGCAACCAA ACCAGAACCC AGACCACCCT 5160
GGGCCACCAG CTCCCAGACT CGGCCATCAC CCCGCAGAAA GGAAAGGCCA CAACCCGCGC 5220
ACCCCAGCCC CGATCCGGCG GGGAGCCACC CAACCCGAAC CAGCACCCAA GAGCGATCCC 5280
CGAAGGACCC CCGAACCGCA AAGGACACCA GTATCCCACA GCCTCTCCAA GTCCCCCGGT 5340
CTCCTCCTCT TCTCGAAGGG ACCAAAAGAT CAATCCACCA CACCCGACGA CACTCAACTC 5400
CCCACCCCTA AAGGAGACAC CGGGAATCCC AGAATCAAGA CTCATCCAAT GTCCATCATG 5460
GGTCTCAAGG TGAACGTCTC TGCCATATTC ATGGCAGTAC TGTTAACTCT CCAAACACCC 5520
ACCGGTCAAA TCCATTGGGG CAATCTCTCT AAGATAGGGG TGGTAGGAAT AGGAAGTGCA 5580
AGCTACAAAG TTATGACTCG TTCCAGCCAT CAATCATTAG TCATAAAATT AATGCCCAAT 5640
ATAACTCTCC TCAATAACTG CACGAGGGTA GAGATTGCAG AATACAGGAG ACTACTGAGA 5700
ACAGTTTTGG AACCAATTAG AGATGCACTT AATGCAATGA CCCAGAATAT AAGACCGGTT 5760
CAGAGTGTAG CTTCAAGTAG GAGACACAAG AGATTTGCGG GAGTAGTCCT GGCAGGTGCG 5820
GCCCTAGGCG TTGCCACAGC TGCTCAGATA ACAGCCGGCA TTGCACTTCA CCAGTCCATG 5880
CTGAACTCTC AAGCCATCGA CAATCTGAGA GCGAGCCTGG AAACTACTAA TCAGGCAATT 5940
GAGGCAATCA GACAAGCAGG GCAGGAGATG ATATTGGCTG TTCAGGGTGT CCAAGACTAC 6000
ATCAATAATG AGCTGATACC GTCTATGAAC CAACTATCTT GTGATTTAAT CGGCCAGAAG 6060
CTCGGGCTCA AATTGCTCAG ATACTATACA GAAATCCTGT CATTATTTGG CCCCAGTTTA 6120
CGGGACCCCA TATCTGCGGA GATATCTATC CAGGCTTTGA GCTATGCGCT TGGAGGAGAC 6180
ATCAATAAGG TGTTAGAAAA GCTCGGATAC AGTGGAGGTG ATTTACTGGG CATCTTAGAG 6240
AGCGGAGGAA TAAAGGCCCG GATAACTCAC GTCGACACAG AGTCCTACTT CATTGTCCTC 6300
AGTATAGCCT ATCCGACGCT GTCCGAGATT AAGGGGGTGA TTGTCCACCG GCTAGAGGGG 6360
GTCTCGTACA ACATAGGCTC TCAAGAGTGG TATACCACTG TGCCCAAGTA TGTTGCAACC 6420
CAAGGGTACC TTATCTCGAA TTTTGATGAG TCATCGTGTA CTTTCATGCC AGAGGGGACT 6480
GTGTGCAGCC AAAATGCCTT GTACCCGATG AGTCCTCTGC TCCAAGAATG CCTCCGGGGG 6540
TACACCAAGT CCTGTGCTCG TACACTCGTA TCCGGGTCTT TTGGGAACCG GTTCATTTTA 6600
TCACAAGGGA ACCTAATAGC CAATTGTGCA TCAATCCTTT GCAAGTGTTA CACAACAGGA 6660
ACGATCATTA ATCAAGACCC TGACAAGATC CTAACATACA TTGCTGCCGA TCACTGCCCG 6720
GTAGTCGAGG TGAACGGCGT GATCATCCAA GTCGGGAGCA GGAGGTATCC AGACGCTGTG 6780
TACTTGCACA GAATTGACCT CGGTCCTCCC ATATCATTGG AGAGGTTGGA CGTAGGGACA 6840
AATCTGGGGA ATGCAATTGC TAAGTTGGAG GATGCCAAGG AATTGTTGGA GTCATCGGAC 6900
CAGATATTGA GGAGTATGAA AGGTTTATCG AGCACTAGCA TAGTCTACAT CCTGATTGCA 6960
GTGTGTCTTG GAGGGTTGAT AGGGATCCCC GCTTTAATAT GTTGCTGCAG GGGGCGTTGT 7020
AACAAAAAGG GAGAACAAGT TGGTATGTCA AGACCAGGCC TAAAGCCTGA TCTTACGGGA 7080
ACATCAAAAT CCTATGTAAG GTCGCTCTGA TCCTCTACAA CTCTTGAAAC ACAAATGTCC 7140
CACAAGTCTC CTCTTCGTCA TCAAGCAACC ACCGCACCCA GCATCAAGCC CACCTGAAAT 7200
TATCTCCGGC TTCCCTCTGG CCGAACAATA TCGGTAGTTA ATTAAAACTT AGGGTGCAAG 7260
ATCATCCACA ATGTCACCAC AACGAGACCG GATAAATGCC TTCTACAAAG ATAACCCCCA 7320
TCCCAAGGGA AGTAGGATAG TCATTAACAG AGAACATCTT ATGATTGATA GACCTTATGT 7380
TTTGCTGGCT GTTCTGTTTG TCATGTTTCT GAGCTTGATC GGGTTGCTAG CCATTGCAGG 7440
CATTAGACTT CATCGGGCAG CCATCTACAC CGCAGAGATC CATAAAAGCC TCAGCACCAA 7500
TCTAGATGTA ACTAACTCAA TCGAGCATCA GGTCAAGGAC GTGCTGACAC CACTCTTCAA 7560
AATCATCGGT GATGAAGTGG GCCTGAGGAC ACCTCAGAGA TTCACTGACC TAGTGAAATT 7620
CATCTCTGAC AAGATTAAAT TCCTTAATCC GGATAGGGAG TACGACTTCA GAGATCTCAC 7680
TTGGTGTATC AACCCGCCAG AGAGAATCAA ATTGGATTAT GATCAATACT GTGCAGATGT 7740
GGCTGCTGAA GAGCTCATGA ATGCATTGGT GAACTCAACT CTACTGGAGA CCAGAACAAC 7800
CAATCAGTTC CTAGCTGTCT CAAAGGGAAA CTGCTCAGGG CCCACTACAA TCAGAGGTCA 7860
ATTCTCAAAC ATGTCGCTGT CCCTGTTAGA CTTGTATTTA GGTCGAGGTT ACAATGTGTC 7920
ATCTATAGTC ACTATGACAT CCCAGGGAAT GTATGGGGGA ACTTACCTAG TGGAAAAGCC 7980
TAATCTGAGC AGCAAAAGGT CAGAGTTGTC ACAACTGAGC ATGTACCGAG TGTTTGAAGT 8040
AGGTGTTATC AGAAATCCGG GTTTGGGGGC TCCGGTGTTC CATATGACAA ACTATCTTGA 8100
GCAACCAGTC AGTAATGATC TCAGCAACTG TATGGTGGCT TTGGGGGAGC TCAAACTCGC 8160
AGCCCTTTGT CACGGGGAAG ATTCTATCAC AATTCCCTAT CAGGGATCAG GGAAAGGTGT 8220
CAGCTTCCAG CTCGTCAAGC TAGGTGTCTG GAAATCCCCA ACCGACATGC AATCCTGGGT 8280
CCCCTTATCA ACGGATGATC CAGTGATAGA CAGGCTTTAC CTCTCATCTC ACAGAGGTGT 8340
TATCGCTGAC AATCAAGCAA AATGGGCTGT CCCGACAACA CGAACAGATG ACAAGTTGCG 8400
AATGGAGACA TGCTTCCAAC AGGCGTGTAA GGGTAAAATC CAAGCACTCT GCGAGAATCC 8460
CGAGTGGGCA CCATTGAAGG ATAACAGGAT TCCTTCATAC GGGGTCTTGT CTGTTGATCT 8520
GAGTCTGACA GTTGAGCTTA AAATCAAAAT TGCTTCGGGA TTCGGGCCAT TGATCACACA 8580
CGGTTCAGGG ATGGACCTAT ACAAATCCAA CCACAACAAT GTGTATTGGC TGACTATCCC 8640
GCCAATGAAG AACCTAGCCT TAGGTGTAAT CAACACATTG GAGTGGATAC CGAGATTCAA 8700
GGTTAGTCCC TACCTCTTCA CTGTCCCAAT TAAGGAAGCA GGCGAAGACT GCCATGCCCC 8760
AACATACCTA CCTGCGGAGG TGGATGGTGA TGTCAAACTC AGTTCCAATC TGGTGATTCT 8820
ACCTGGTCAA GATCTCCAAT ATGTTTTGGC AACCTACGAT ACTTCCAGGG TTGAACATGC 8880
TGTGGTTTAT TACGTTTACA GCCCAAGCCG CTCATTTTCT TACTTTTATC CTTTTAGGTT 8940
GCCTATAAAG GGGGTCCCCA TCGAATTACA AGTGGAATGC TTCACATGGG ACCAAAAACT 9000
CTGGTGCCGT CACTTCTGTG TGCTTGCGGA CTCAGAATCT GGTGGACATA TCACTCACTC 9060
TGGGATGGTG GGCATGGGAG TCAGCTGCAC AGTCACCCGG GAAGATGGAA CCAATCGCAG 9120
ATAGGGCTGC TAGTGAACCA ATCACATGAT GTCACCCAGA CATCAGGCAT ACCCACTAGT 9180
GTGAAATAGA CATCAGAATT AAGAAAAACG TAGGGTCCAA GTGGTTCCCC GTTATGGACT 9240
CGCTATCTGT CAACCAGATC TTATACCCTG AAGTTCACCT AGATAGCCCG ATAGTTACCA 9300
ATAAGATAGT AGCCATCCTG GAGTATGCTC GAGTCCCTCA CGCTTACAGC CTGGAGGACC 9360
CTACACTGTG TCAGAACATC AAGCACCGCC TAAAAAACGG ATTTTCCAAC CAAATGATTA 9420
TAAACAATGT GGAAGTTGGG AATGTCATCA AGTCCAAGCT TAGGAGTTAT CCGGCCCACT 9480
CTCATATTCC ATATCCAAAT TGTAATCAGG ATTTATTTAA CATAGAAGAC AAAGAGTCAA 9540
CGAGGAAGAT CCGTGAACTC CTCAAAAAGG GGAATTCGCT GTACTCCAAA GTCAGTGATA 9600
AGGTTTTCCA ATGCTTAAGG GACACTAACT CACGGCTTGG CCTAGGCTCC GAATTGAGGG 9660
AGGACATCAA GGAGAAAGTT ATTAACTTGG GAGTTTACAT GCACAGCTCC CAGTGGTTTG 9720
AGCCCTTTCT GTTTTGGTTT ACAGTCAAGA CTGAGATGAG GTCAGTGATT AAATCACAAA 9780
CCCATACTTG CCATAGGAGG AGACACACAC CTGTATTCTT CACTGGTAGT TCAGTTGAGT 9840
TGCTAATCTC TCGTGACCTT GTTGCTATAA TCAGTAAAGA GTCTCAACAT GTATATTACC 9900
TGACATTTGA ACTGGTTTTG ATGTATTGTG ATGTCATAGA GGGGAGGTTA ATGACAGAGA 9960
CCGCTATGAC TATTGATGCT AGGTATACAG AGCTTCTAGG AAGAGTCAGA TACATGTGGA 10020
AACTGATAGA TGGTTTCTTC CCTGCACTCG GGAATCCAAC TTATCAAATT GTAGCCATGC 10080
TGGAGCCTCT TTCACTTGCT TACCTGCAGC TGAGGGATAT AACAGTAGAA CTCAGAGGTG 10140
CTTTCCTTAA CCACTGCTTT ACTGAAATAC ATGATGTTCT TGACCAAAAC GGGTTTTCTG 10200
ATGAAGGTAC TTATCATGAG TTAATTGAAG CTCTAGATTA CATTTTCATA ACTGATGACA 10260
TACATCTGAC AGGGGAGATT TTCTCATTTT TCAGAAGTTT CGGCCACCCC AGACTTGAAG 10320
CAGTAACGGC TGCTGAAAAT GTTAGGAAAT ACATGAATCA GCCTAAAGTC ATTGTGTATG 10380
AGACTCTGAT GAAAGGTCAT GCCATATTTT GTGGAATCAT AATCAACGGC TATCGTGACA 10440
GGCACGGAGG CAGTTGGCCA CCGCTGACCC TCCCCCTGCA TGCTGCAGAC ACAATCCGGA 10500
ATGCTCAAGC TTCAGGTGAA GGGTTAACAC ATGAGCAGTG CGTTGATAAC TGGAAATCTT 10560
TTGCTGGAGT GAAATTTGGC TGCTTTATGC CTCTTAGCCT GGATAGTGAT CTGACAATGT 10620
ACCTAAAGGA CAAGGCACTT GCTGCTCTCC AAAGGGAATG GGATTCAGTT TACCCGAAAG 10680
AGTTCCTGCG TTACGACCCT CCCAAGGGAA CCGGGTCACG GAGGCTTGTA GATGTTTTCC 10740
TTAATGATTC GAGCTTTGAC CCATATGATG TGATAATGTA TGTTGTAAGT GGAGCTTACC 10800
TCCATGACCC TGAGTTCAAC CTGTCTTACA GCCTGAAAGA AAAGGAGATC AAGGAAACAG 10860
GTAGACTTTT TGCTAAAATG ACTTACAAAA TGAGGGCATG CCAAGTGATT GCTGAAAATC 10920
TAATCTCAAA CGGGATTGGC AAATATTTTA AGGACAATGG GATGGCCAAG GATGAGCACG 10980
ATTTGACTAA GGCACTCCAC ACTCTAGCTG TCTCAGGAGT CCCCAAAGAT CTCAAAGAAA 11040
GTCACAGGGG GGGGCCAGTC TTAAAAACCT ACTCCCGAAG CCCAGTCCAC ACAAGTACCA 11100
GGAACGTGAG AGCAGCAAAA GGGTTTATAG GGTTCCCTCA AGTAATTCGG CAGGACCAAG 11160
ACACTGATCA TCCGGAGAAT ATGGAAGCTT ACGAGACAGT CAGTGCATTT ATCACGACTG 11220
ATCTCAAGAA GTACTGCCTT AATTGGAGAT ATGAGACCAT CAGCTTGTTT GCACAGAGGC 11280
TAAATGAGAT TTACGGATTG CCCTCATTTT TCCAGTGGCT GCATAAGAGG CTTGAGACCT 11340
CTGTCCTGTA TGTAAGTGAC CCTCATTGCC CCCCCGACCT TGACGCCCAT ATCCCGTTAT 11400
ATAAAGTCCC CAATGATCAA ATCTTCATTA AGTACCCTAT GGGAGGTATA GAAGGGTATT 11460
GTCAGAAGCT GTGGACCATC AGCACCATTC CCTATCTATA CCTGGCTGCT TATGAGAGCG 11520
GAGTAAGGAT TGCTTCGTTA GTGCAAGGGG ACAATCAGAC CATAGCCGTA ACAAAAAGGG 11580
TACCCAGCAC ATGGCCCTAC AACCTTAAGA AACGGGAAGC TGCTAGAGTA ACTAGAGATT 11640
ACTTTGTAAT TCTTAGGCAA AGGCTACATG ATATTGGCCA TCACCTCAAG GCAAATGAGA 11700
CAATTGTTTC ATCACATTTT TTTGTCTATT CAAAAGGAAT ATATTATGAT GGGCTACTTG 11760
TGTCCCAATC ACTCAAGAGC ATCGCAAGAT GTGTATTCTG GTCAGAGACT ATAGTTGATG 11820
AAACAAGGGC AGCATGCAGT AATATTGCTA CAACAATGGC TAAAAGCATC GAGAGAGGTT 11880
ATGACCGTTA CCTTGCATAT TCCCTGAACG TCCTAAAAGT GATACAGCAA ATTCTGATCT 11940
CTCTTGGCTT CACAATCAAT TCAACCATGA CCCGGGATGT AGTCATACCC CTCCTCACAA 12000
ACAACGACCT CTTAATAAGG ATGGCACTGT TGCCCGCTCC TATTGGGGGG ATGAATTATC 12060
TGAATATGAG CAGGCTGTTT GTCAGAAACA TCGGTGATCC AGTAACATCA TCAATTGCTG 12120
ATCTCAAGAG AATGATTCTC GCCTCACTAA TGCCTGAAGA GACCCTCCAT CAAGTAATGA 12180
CACAACAACC GGGGGACTCT TCATTCCTAG ACTGGGCTAG CGACCCTTAC TCAGCAAATC 12240
TTGTATGTGT CCAGAGCATC ACTAGACTCC TCAAGAACAT AACTGCAAGG TTTGTCCTGA 12300
TCCATAGTCC AAACCCAATG TTAAAAGGAT TATTCCATGA TGACAGTAAA GAAGAGGACG 12360
AGGGACTGGC GGCATTCCTC ATGGACAGGC ATATTATAGT ACCTAGGGCA GCTCATGAAA 12420
TCCTGGATCA TAGTGTCACA GGGGCAAGAG AGTCTATTGC AGGCATGCTG GATACCACAA 12480
AAGGCTTGAT TCGAGCCAGC ATGAGGAAGG GGGGGTTAAC CTCTCGAGTG ATAACCAGAT 12540
TGTCCAATTA TGACTATGAA CAATTCAGAG CAGGGATGGT GCTATTGACA GGAAGAAAGC 12600
GAAATGTCCT CATTGACAAA GAGTCATGTT CAGTGCAGCT GGCGAGAGCT CTAAGAAGCC 12660
ATATGTGGGC GAGGCTAGCT CGAGGACGGC CTATTTACGG CCTTGAGGTC CCTGATGTAC 12720
TAGAATCTAT GCGAGGCCAC CTTATTCGGC GTCATGAGAC ATGTGTCATC TGCGAGTGTG 12780
GATCAGTCAA CTACGGATGG TTTTTTGTCC CCTCGGGTTG CCAACTGGAT GATATTGACA 12840
AGGAAACATC ATCCTTGAGA GTCCCATATA TTGGTTCTAC CACTGATGAG AGAACAGACA 12900
TGAAGCTTGC CTTCGTAAGA GCCCCAAGTC GATCCTTGCG ATCTGCTGTT AGAATAGCAA 12960
CAGTGTACTC ATGGGCTTAC GGTGATGATG ATAGCTCTTG GAACGAAGCC TGGTTGTTGG 13020
CTAGGCAAAG GGCCAATGTG AGCCTGGAGG AGCTAAGGGT GATCACTCCC ATCTCAACTT 13080
CGACTAATTT AGCGCATAGG TTGAGGGATC GTAGCACTCA AGTGAAATAC TCAGGTACAT 13140
CCCTTGTCCG AGTGGCGAGG TATACCACAA TCTCCAACGA CAATCTCTCA TTTGTCATAT 13200
CAGATAAGAA GGTTGATACT AACTTTATAT ACCAACAAGG AATGCTTCTA GGGTTGGGTG 13260
TTTTAGAAAC ATTGTTTCGA CTCGAGAAAG ATACCGGATC ATCTAACACG GTATTACATC 13320
TTCACGTCGA AACAGATTGT TGCGTGATCC CGATGATAGA TCATCCCAGG ATACCCAGCT 13380
CCCGCAAGCT AGAGCTGAGG GCAGAGCTAT GTACCAACCC ATTGATATAT GATAATGCAC 13440
CTTTAATTGA CAGAGATGCA ACAAGGCTAT ACACCCAGAG CCATAGGAGG CACCTTGTGG 13500
AATTTGTTAC ATGGTCCACA CCCCAACTAT ATCACATTTT AGCTAAGTCC ACAGCACTAT 13560
CTATGATTGA CCTGGTAACA AAATTTGAGA AGGACCATAT GAATGAAATT TCAGCTCTCA 13620
TAGGGGATGA CGATATCAAT AGTTTCATAA CTGAGTTTCT GCTCATAGAG CCAAGATTAT 13680
TCACTATCTA CTTGGGCCAG TGTGCGGCCA TCAATTGGGC ATTTGATGTA CATTATCATA 13740
GACCATCAGG GAAATATCAG ATGGGTGAGC TGTTGTCATC GTTCCTTTCT AGAATGAGCA 13800
AAGGAGTGTT TAAGGTGCTT GTCAATGCTC TAAGCCACCC AAAGATCTAC AAGAAATTCT 13860
GGCATTGTGG TATTATAGAG CCTATCCATG GTCCTTCACT TGATGCTCAA AACTTGCACA 13920
CAACTGTGTG CAACATGGTT TACACATGCT ATATGACCTA CCTCGACCTG TTGTTGAATG 13980
AAGAGTTAGA AGAGTTCACA TTTCTCTTGT GTGAAAGCGA CGAGGATGTA GTACCGGACA 14040
GATTCGACAA CATCCAGGCA AAACACTTAT GTGTTCTGGC AGATTTGTAC TGTCAACCAG 14100
GGACCTGCCC ACCAATTCGA GGTCTAAGAC CGGTAGAGAA ATGTGCAGTT CTAACCGACC 14160
ATATCAAGGC AGAGGCTATG TTATCTCCAG CAGGATCTTC GTGGAACATA AATCCAATTA 14220
TTGTAGACCA TTACTCATGC TCTCTGACTT ATCTCCGGCG AGGATCGATC AAACAGATAA 14280
GATTGAGAGT TGATCCAGGA TTCATTTTCG ACGCCCTCGC TGAGGTAAAT GTCAGTCAGC 14340
CAAAGATCGG CAGCAACAAC ATCTCAAATA TGAGCATCAA GGCTTTCAGA CCCCCACACG 14400
ATGATGTTGC AAAATTGCTC AAAGATATCA ACACAAGCAA GCACAATCTT CCCATTTCAG 14460
GGGGCAATCT CGCCAATTAT GAAATCCATG CTTTCCGCAG AATCGGGTTG AACTCATCTG 14520
CTTGCTACAA AGCTGTTGAG ATATCAACAT TAATTAGGAG ATGCCTTGAG CCAGGGGAGG 14580
ACGGCTTGTT CTTGGGTGAG GGATCGGGTT CTATGTTGAT CACTTATAAG GAGATACTTA 14640
AACTAAACAA GTGCTTCTAT AATAGTGGGG TTTCCGCCAA TTCTAGATCT GGTCAAAGGG 14700
AATTAGCACC CTATCCCTCC GAAGTTGGCC TTGTCGAACA CAGAATGGGA GTAGGTAATA 14760
TTGTCAAAGT GCTCTTTAAC GGGAGGCCCG AAGTCACGTG GGTAGGCAGT GTAGATTGCT 14820
TCAATTTCAT AGTTAGTAAT ATCCCTACCT CTAGTGTGGG GTTTATCCAT TCAGATATAG 14880
AGACCTTGCC TGACAAAGAT ACTATAGAGA AGCTAGAGGA ATTGGCAGCC ATCTTATCGA 14940
TGGCTCTGCT CCTGGGCAAA ATAGGATCAA TACTGGTGAT TAAGCTTATG CCTTTCAGCG 15000
GGGATTTTGT TCAGGGATTT ATAAGTTATG TAGGGTCTCA TTATAGAGAA GTGAACCTTG 15060
TATACCCTAG ATACAGCAAC TTCATCTCTA CTGAATCTTA TTTGGTTATG ACAGATCTCA 15120
AGGCTAACCG GCTAATGAAT CCTGAAAAGA TTAAGCAGCA GATAATTGAA TCATCTGTGA 15180
GGACTTCACC TGGACTTATA GGTCACATCC TATCCATTAA GCAACTAAGC TGCATACAAG 15240
CAATTGTGGG AGACGCAGTT AGTAGAGGTG ATATCAATCC TACTCTGAAA AAACTTACAC 15300
CTATAGAGCA GGTGCTGATC AATTGCGGGT TGGCAATTAA CGGACCTAAG CTGTGCAAAG 15360
AATTGATCCA CCATGATGTT GCCTCAGGGC AAGATGGATT GCTTAATTCT ATACTCATCC 15420
TCTACAGGGA GTTGGCAAGA TTCAAAGACA ACCAAAGAAG TCAACAAGGG ATGTTCCACG 15480
CTTACCCCGT ATTGGTAAGT AGCAGGCAAC GAGAACTTAT ATCTAGGATC ACCCGCAAAT 15540
TCTGGGGGCA CATTCTTCTT TACTCCGGGA ACAGAAAGTT GATAAATAAG TTTATCCAGA 15600
ATCTCAAGTC CGGCTATCTG ATACTAGACT TACACCAGAA TATCTTCGTT AAGAATCTAT 15660
CCAAGTCAGA GAAACAGATT ATTATGACGG GGGGTTTGAA ACGTGAGTGG GTTTTTAAGG 15720
TAACAGTCAA GGAGACCAAA GAATGGTATA AGTTAGTCGG ATACAGTGCC CTGATTAAGG 15780
ACTAATTGGT TGAACTCCGG AACCCTAATC CTGCCCTAGG TGGTTAGGCA TTATTTGCAA 15840
TATATTAAAG AAAACTTTGA AAATACGAAG TTTCTATTCC CAGCTTTGTC TGGT 15894
(2) SEQ ID NO: 10 information about:
(I) SEQUENCE CHARACTERISTICS:
...
(A) length: 2183 amino acid
(B) type: amino acid
(C) chain:
(D) topological framework: linearity is molecule type (ii): protein (xi) sequence description: SEQ ID NO:10:Met Asp Ser Leu Ser Val Asn Gln Ile Leu Tyr Pro Glu Val His Leu1 5 10 15Asp Ser Pro Ile Val Thr Asn Lys Ile Val Ala Ile Leu Glu Tyr Ala
20??????????????????25??????????????????30Arg?Val?Pro?His?Ala?Tyr?Ser?Leu?Glu?Asp?Pro?Thr?Leu?Cys?Gln?Asn
35??????????????????40??????????????????45Ile?Lys?His?Arg?Leu?Lys?Asn?Gly?Phe?Ser?Asn?Gln?Met?Ile?Ile?Asn
50??????????????????55???????????????????60Asn?Val?Glu?Val?Gly?Asn?Val?Ile?Lys?Ser?Lys?Leu?Arg?Ser?Tyr?Pro65??????????????????70??????????????????75??????????????????80Ala?His?Ser?His?Ile?Pro?Tyr?Pro?Asn?Cys?Asn?Gln?Asp?Leu?Phe?Asn
85??????????????????90??????????????????95Ile?Glu?Asp?Lys?Glu?Ser?Thr?Arg?Lys?Ile?Arg?Glu?Leu?Leu?Lys?Lys
100?????????????????105?????????????????110Gly?Asn?Ser?Leu?Tyr?Ser?Lys?Val?Ser?Asp?Lys?Val?Phe?Gln?Cys?Leu
115?????????????????120?????????????????125Arg?Asp?Thr?Asn?Ser?Arg?Leu?Gly?Leu?Gly?Ser?Glu?Leu?Arg?Glu?Asp
130?????????????????135?????????????????140Ile?Lys?Glu?Lys?Val?Ile?Asn?Leu?Gly?Val?Tyr?Met?His?Ser?Ser?Gln145?????????????????150?????????????????155?????????????????160Trp?Phe?Glu?Pro?Phe?Leu?Phe?Trp?Phe?Thr?Val?Lys?Thr?Glu?Met?Arg
165?????????????????170?????????????????175Ser?Val?Ile?Lys?Ser?Gln?Thr?His?Thr?Cys?His?Arg?Arg?Arg?His?Thr
180?????????????????185?????????????????190Pro?Val?Phe?Phe?Thr?Gly?Ser?Ser?Val?Glu?Leu?Leu?Ile?Ser?Arg?Asp
195?????????????????200?????????????????205Leu?Val?Ala?Ile?Ile?Ser?Lys?Glu?Ser?Gln?His?Val?Tyr?Tyr?Leu?Thr
210?????????????????215?????????????????220Phe?Glu?Leu?Val?Leu?Met?Tyr?Cys?Asp?Val?Ile?Glu?Gly?Arg?Leu?Met225?????????????????230?????????????????235?????????????????240Thr?Glu?Thr?Ala?Met?Thr?Ile?Asp?Ala?Arg?Tyr?Thr?Glu?Leu?Leu?Gly
245?????????????????250?????????????????255Arg?Val?Arg?Tyr?Met?Trp?Lys?Leu?Ile?Asp?Gly?Phe?Phe?Pro?Ala?Leu
260?????????????????265?????????????????270Gly?Asn?Pro?Thr?Tyr?Gln?Ile?Val?Ala?Met?Leu?Glu?Pro?Leu?Ser?Leu
275?????????????????280?????????????????285Ala?Tyr?Leu?Gln?Leu?Arg?Asp?Ile?Thr?Val?Glu?Leu?Arg?Gly?Ala?Phe
290?????????????????295?????????????????300Leu?Asn?His?Cys?Phe?Thr?Glu?Ile?His?Asp?Val?Leu?Asp?Gln?Asn?Gly305?????????????????310?????????????????315?????????????????320Phe?Ser?Asp?Glu?Gly?Thr?Tyr?His?Glu?Leu?Ile?Glu?Ala?Leu?Asp?Tyr
325?????????????????330?????????????????335Ile?Phe?Ile?Thr?Asp?Asp?Ile?His?Leu?Thr?Gly?Glu?Ile?Phe?Ser?Phe
340?????????????????345?????????????????350Phe?Arg?Ser?Phe?Gly?His?Pro?Arg?Leu?Glu?Ala?Val?Thr?Ala?Ala?Glu
355?????????????????360?????????????????365Asn?Val?Arg?Lys?Tyr?Met?Asn?Gln?Pro?Lys?Val?Ile?Val?Tyr?Glu?Thr
370?????????????????375?????????????????380Leu?Met?Lys?Gly?His?Ala?Ile?Phe?Cys?Gly?Ile?Ile?Ile?Asn?Gly?Tyr385?????????????????390?????????????????395?????????????????400Arg?Asp?Arg?His?Gly?Gly?Ser?Trp?Pro?Pro?Leu?Thr?Leu?Pro?Leu?His
405?????????????????410?????????????????415Ala?Ala?Asp?Thr?Ile?Arg?Asn?Ala?Gln?Ala?Ser?Gly?Glu?Gly?Leu?Thr
420?????????????????425?????????????430His?Glu?Gln?Cys?Val?Asp?Asn?Trp?Lys?Ser?Phe?Ala?Gly?Val?Lys?Phe
435?????????????????440?????????????????445Gly?Cys?Phe?Met?Pro?Leu?Ser?Leu?Asp?Ser?Asp?Leu?Thr?Met?Tyr?Leu
450?????????????????455?????????????????460Lys?Asp?Lys?Ala?Leu?Ala?Ala?Leu?Gln?Arg?Glu?Trp?Asp?Ser?Val?Tyr465?????????????????470?????????????????475?????????????????480Pro?Lys?Glu?Phe?Leu?Arg?Tyr?Asp?Pro?Pro?Lys?Gly?Thr?Gly?Ser?Arg
485?????????????????490?????????????????495Arg?Leu?Val?Asp?Val?Phe?Leu?Asn?Asp?Ser?Ser?Phe?Asp?Pro?Tyr?Asp
500?????????????????505?????????????????510Val?Ile?Met?Tyr?Val?Val?Ser?Gly?Ala?Tyr?Leu?His?Asp?Pro?Glu?Phe
5l5?????????????????520?????????????????525Asn?Leu?Ser?Tyr?Ser?Leu?Lys?Glu?Lys?Glu?Ile?Lys?Glu?Thr?Gly?Arg
530?????????????????535?????????????????540Leu?Phe?Ala?Lys?Met?Thr?Tyr?Lys?Met?Arg?Ala?Cys?Gln?Val?Ile?Ala545?????????????????550?????????????????555?????????????????560Glu?Asn?Leu?Ile?Ser?Asn?Gly?Ile?Gly?Lys?Tyr?Phe?Lys?Asp?Asn?Gly
565?????????????????570?????????????????575Met?Ala?Lys?Asp?Glu?His?Asp?Leu?Thr?Lys?Ala?Leu?His?Thr?Leu?Ala
580?????????????????585?????????????????590Val?Ser?Gly?Val?Pro?Lys?Asp?Leu?Lys?Glu?Ser?His?Arg?Gly?Gly?Pro
595?????????????????600?????????????????605Val?Leu?Lys?Thr?Tyr?Ser?Arg?Ser?Pro?Val?His?Thr?Ser?Thr?Arg?Asn
610?????????????????615?????????????????620Val?Arg?Ala?Ala?Lys?Gly?Phe?Ile?Gly?Phe?Pro?Gln?Val?Ile?Arg?Gln625?????????????????630?????????????????635?????????????????640Asp?Gln?Asp?Thr?Asp?His?Pro?Glu?Asn?Met?Glu?Ala?Tyr?Glu?Thr?Val
645?????????????????650?????????????????655Ser?Ala?Phe?Ile?Thr?Thr?Asp?Leu?Lys?Lys?Tyr?Cys?Leu?Asn?Trp?Arg
660?????????????????665?????????????????670Tyr?Glu?Thr?Ile?Ser?Leu?Phe?Ala?Gln?Arg?Leu?Asn?Glu?Ile?Tyr?Gly
675?????????????????680?????????????????685Leu?Pro?Ser?Phe?Phe?Gln?Trp?Leu?His?Lys?Arg?Leu?Glu?Thr?Ser?Val
690?????????????????695?????????????????700Leu?Tyr?Val?Ser?Asp?Pro?His?Cys?Pro?Pro?Asp?Leu?Asp?Ala?His?Ile705?????????????????710?????????????????715?????????????????720Pro?Leu?Tyr?Lys?Val?Pro?Asn?Asp?Gln?Ile?Phe?Ile?Lys?Tyr?Pro?Met
725?????????????????730?????????????????735Gly?Gly?Ile?Glu?Gly?Tyr?Cys?Gln?Lys?Leu?Trp?Thr?Ile?Ser?Thr?Ile
740?????????????????745?????????????????750Pro?Tyr?Leu?Tyr?Leu?Ala?Ala?Tyr?Glu?Ser?Gly?Val?Arg?Ile?Ala?Ser
755?????????????????760?????????????????765Leu?Val?Gln?Gly?Asp?Asn?Gln?Thr?Ile?Ala?Val?Thr?Lys?Arg?Val?Pro
770?????????????????775?????????????????780Ser?Thr?Trp?Pro?Tyr?Asn?Leu?Lys?Lys?Arg?Glu?Ala?Ala?Arg?Val?Thr785?????????????????790?????????????????795?????????????????800Arg?Asp?Tyr?Phe?Val?Ile?Leu?Arg?Gln?Arg?Leu?His?Asp?Ile?Gly?His
805?????????????????810?????????????????815His?Leu?Lys?Ala?Asn?Glu?Thr?Ile?Val?Ser?Ser?His?Phe?Phe?Val?Tyr
820?????????????????825?????????????????830Ser?Lys?Gly?Ile?Tyr?Tyr?Asp?Gly?Leu?Leu?Val?Ser?Gln?Ser?Leu?Lys
835?????????????????840?????????????????845Ser?Ile?Ala?Arg?Cys?Val?Phe?Trp?Ser?Glu?Thr?Ile?Val?Asp?Glu?Thr
850?????????????????855?????????????????860Arg?Ala?Ala?Cys?Ser?Asn?Ile?Ala?Thr?Thr?Met?Ala?Lys?Ser?Ile?Glu865?????????????????870?????????????????875?????????????????880Arg?Gly?Tyr?Asp?Arg?Tyr?Leu?Ala?Tyr?Ser?Leu?Asn?Val?Leu?Lys?Val
885?????????????????890?????????????????895Ile?Gln?Gln?Ile?Leu?Ile?Ser?Leu?Gly?Phe?Thr?Ile?Asn?Ser?Thr?Met
900?????????????????905?????????????????910Thr?Arg?Asp?Val?Val?Ile?Pro?Leu?Leu?Thr?Asn?Asn?Asp?Leu?Leu?Ile
915?????????????????920?????????????????925Arg?Met?Ala?Leu?Leu?Pro?Ala?Pro?Ile?Gly?Gly?Met?Asn?Tyr?Leu?Asn
930?????????????????935?????????????????940Met?Ser?Arg?Leu?Phe?Val?Arg?Asn?Ile?Gly?Asp?Pro?Val?Thr?Ser?Ser945?????????????????950?????????????????955?????????????????960Ile?Ala?Asp?Leu?Lys?Arg?Met?Ile?Leu?Ala?Ser?Leu?Met?Pro?Glu?Glu
965?????????????????970?????????????????975Thr?Leu?His?Gln?Val?Met?Thr?Gln?Gln?Pro?Gly?Asp?Ser?Ser?Phe?Leu
980?????????????????985?????????????????990Asp?Trp?Ala?Ser?Asp?Pro?Tyr?Ser?Ala?Asn?Leu?Val?Cys?Val?Gln?Ser
995?????????????????1000????????????????1005Ile?Thr?Arg?Leu?Leu?Lys?Asn?Ile?Thr?Ala?Arg?Phe?Val?Leu?Ile?His
1010????????????????1015???????????????1020Ser?Pro?Asn?Pro?Met?Leu?Lys?Gly?Leu?Phe?His?Asp?Asp?Ser?Lys?Glu1025????????????????1030????????????????1035????????????????1040Glu?Asp?Glu?Gly?Leu?Ala?Ala?Phe?Leu?Met?Asp?Arg?His?Ile?Ile?Val
1045????????????????1050????????????????1055Pro?Arg?Ala?Ala?His?Glu?Ile?Leu?Asp?His?Ser?Val?Thr?Gly?Ala?Arg
1060????????????????1065????????????????1070Glu?Ser?Ile?Ala?Gly?Met?Leu?Asp?Thr?Thr?Lys?Gly?Leu?Ile?Arg?Ala
1075????????????????1080????????????????1085Ser?Met?Arg?Lys?Gly?Gly?Leu?Thr?Ser?Arg?Val?Ile?Thr?Arg?Leu?Ser
1090????????????????1095????????????????1100Asn?Tyr?Asp?Tyr?Glu?Gln?Phe?Arg?Ala?Gly?Met?Val?Leu?Leu?Thr?Gly1105????????????????1110????????????????1115????????????????1120Arg?Lys?Arg?Asn?Val?Leu?Ile?Asp?Lys?Glu?Ser?Cys?Ser?Val?Gln?Leu
1125????????????????1130????????????????1135Ala?Arg?Ala?Leu?Arg?Ser?His?Met?Trp?Ala?Arg?Leu?Ala?Arg?Gly?Arg
1140????????????????1145????????????????1150Pro?Ile?Tyr?Gly?Leu?Glu?Val?Pro?Asp?Val?Leu?Glu?Ser?Met?Arg?Gly
1155????????????????1160????????????????1165His?Leu?Ile?Arg?Arg?His?Glu?Thr?Cys?Val?Ile?Cys?Glu?Cys?Gly?Ser
1170????????????????1175????????????????1180Val?Asn?Tyr?Gly?Trp?Phe?Phe?Val?Pro?Ser?Gly?Cys?Gln?Leu?Asp?Asp1185????????????????1190????????????????1195????????????????1200Ile?Asp?Lys?Glu?Thr?Ser?Ser?Leu?Arg?Val?Pro?Tyr?Ile?Gly?Ser?Thr
1205????????????????1210????????????????1215Thr?Asp?Glu?Arg?Thr?Asp?Met?Lys?Leu?Ala?Phe?Val?Arg?Ala?Pro?Ser
1220????????????????1225????????????????1230Arg?Ser?Leu?Arg?Ser?Ala?Val?Arg?Ile?Ala?Thr?Val?Tyr?Ser?Trp?Ala
1235????????????????1240????????????????1245Tyr?Gly?Asp?Asp?Asp?Ser?Ser?Trp?Asn?Glu?Ala?Trp?Leu?Leu?Ala?Arg
1250????????????????1255????????????????1260Gln?Arg?Ala?Asn?Val?Ser?Leu?Glu?Glu?Leu?Arg?Val?Ile?Thr?Pro?Ile1265????????????????1270????????????????1275????????????????1280Ser?Thr?Ser?Thr?Asn?Leu?Ala?His?Arg?Leu?Arg?Asp?Arg?Ser?Thr?Gln
1285????????????????1290????????????????1295Val?Lys?Tyr?Ser?Gly?Thr?Ser?Leu?Val?Arg?Val?Ala?Arg?Tyr?Thr?Thr
1300????????????????1305????????????????1310Ile?Ser?Asn?Asp?Asn?Leu?Ser?Phe?Val?Ile?Ser?Asp?Lys?Lys?Val?Asp
1315????????????????1320????????????????1325Thr?Asn?Phe?Ile?Tyr?Gln?Gln?Gly?Met?Leu?Leu?Gly?Leu?Gly?Val?Leu
1330????????????????1335????????????????1340Glu?Thr?Leu?Phe?Arg?Leu?Glu?Lys?Asp?Thr?Gly?Ser?Ser?Asn?Thr?Val1345????????????????1350????????????????1355????????????????1360Leu?His?Leu?His?Val?Glu?Thr?Asp?Cys?Cys?Val?Ile?Pro?Met?Ile?Asp
1365????????????????1370????????????????1375His?Pro?Arg?Ile?Pro?Ser?Ser?Arg?Lys?Leu?Glu?Leu?Arg?Ala?Glu?Leu
1380????????????????1385????????????????1390Cys?Thr?Asn?Pro?Leu?Ile?Tyr?Asp?Asn?Ala?Pro?Leu?Ile?Asp?Arg?Asp
1395????????????????1400???????????????1405Ala?Thr?Arg?Leu?Tyr?Thr?Gln?Ser?His?Arg?Arg?His?Leu?Val?Glu?Phe
1410????????????????1415????????????????1420Val?Thr?Trp?Ser?Thr?Pro?Gln?Leu?Tyr?His?Ile?Leu?Ala?Lys?Ser?Thr1425????????????????1430????????????????1435????????????????1440Ala?Leu?Ser?Met?Ile?Asp?Leu?Val?Thr?Lys?Phe?Glu?Lys?Asp?His?Met
1445????????????????1450????????????????1455Asn?Glu?Ile?Ser?Ala?Leu?Ile?Gly?Asp?Asp?Asp?Ile?Asn?Ser?Phe?Ile
1460????????????????1465????????????????1470Thr?Glu?Phe?Leu?Leu?Ile?Glu?Pro?Arg?Leu?Phe?Thr?Ile?Tyr?Leu?Gly
1475????????????????1480????????????????1485Gln?Cys?Ala?Ala?Ile?Asn?Trp?Ala?Phe?Asp?Val?His?Tyr?His?Arg?Pro
1490????????????????1495????????????????1500Ser?Gly?Lys?Tyr?Gln?Met?Gly?Glu?Leu?Leu?Ser?Ser?Phe?Leu?Ser?Arg1505????????????????1510????????????????1515????????????????1520Met?Ser?Lys?Gly?Val??Phe?Lys?Val?Leu?Val?Asn?Ala?Leu?Ser?His?Pro
1525?????????????????1530????????????????1535Lys?Ile?Tyr?Lys?Lys?Phe?Trp?His?Cys?Gly?Ile?Ile?Glu?Pro?Ile?His
1540????????????????1545????????????????1550Gly?Pro?Ser?Leu?Asp?Ala?Gln?Asn?Leu?His?Thr?Thr?Val?Cys?Asn?Met
1555????????????????1560????????????????1565Val?Tyr?Thr?Cys?Tyr?Met?Thr?Tyr?Leu?Asp?Leu?Leu?Leu?Asn?Glu?Glu
1570????????????????1575????????????????1580Leu?Glu?Glu?Phe?Thr?Phe?Leu?Leu?Cys?Glu?Ser?Asp?Glu?Asp?Val?Val1585????????????????1590????????????????1595????????????????1600Pro?Asp?Arg?Phe?Asp?Asn?Ile?Gln?Ala?Lys?His?Leu?Cys?Val?Leu?Ala
1605????????????????1610????????????????1615Asp?Leu?Tyr?Cys?Gln?Pro?Gly?Thr?Cys?Pro?Pro?Ile?Arg?Gly?Leu?Arg
1620????????????????1625????????????????1630Pro?Val?Glu?Lys?Cys?Ala?Val?Leu?Thr?Asp?His?Ile?Lys?Ala?Glu?Ala
1635????????????????1640????????????????1645Met?Leu?Ser?Pro?Ala?Gly?Ser?Ser?Trp?Asn?Ile?Asn?Pro?Ile?Ile?Val
1650????????????????1655????????????????1660Asp?His?Tyr?Ser?Cys?Ser?Leu?Thr?Tyr?Leu?Arg?Arg?Gly?Ser?Ile?Lys1665????????????????1670????????????????1675????????????????1680Gln?Ile?Arg?Leu?Arg?Val?Asp?Pro?Gly?Phe?Ile?Phe?Asp?Ala?Leu?Ala
1685????????????????1690????????????????1695Glu?Val?Asn?Val?Ser?Gln?Pro?Lys?Ile?Gly?Ser?Asn?Asn?Ile?Ser?Asn
1700????????????????1705????????????????1710Met?Ser?Ile?Lys?Ala?Phe?Arg?Pro?Pro?His?Asp?Asp?Val?Ala?Lys?Leu
1715????????????????1720????????????????1725Leu?Lys?Asp?Ile?Asn?Thr?Ser?Lys?His?Asn?Leu?Pro?Ile?Ser?Gly?Gly
1730????????????????1735????????????????1740Asn?Leu?Ala?Asn?Tyr?Glu?Ile?His?Ala?Phe?Arg?Arg?Ile?Gly?Leu?Asn1745????????????????1750????????????????1755????????????????1760Ser?Ser?Ala?Cys?Tyr?Lys?Ala?Val?Glu?Ile?Ser?Thr?Leu?Ile?Arg?Arg
1765????????????????1770????????????????1775Cys?Leu?Glu?Pro?Gly?Glu?Asp?Gly?Leu?Phe?Leu?Gly?Glu?Gly?Ser?Gly
1780????????????????1785????????????????1790Ser?Met?Leu?Ile?Thr?Tyr?Lys?Glu?Ile?Leu?Lys?Leu?Asn?Lys?Cys?Phe
1795????????????????1800????????????????1805Tyr?Asn?Ser?Gly?Val?Ser?Ala?Asn?Ser?Arg?Ser?Gly?Gln?Arg?Glu?Leu
1810????????????????1815????????????????1820Ala?Pro?Tyr?Pro?Ser?Glu?Val?Gly?Leu?Val?Glu?His?Arg?Met?Gly?Val1825????????????????1830????????????????1835????????????????1840Gly?Asn?Ile?Val?Lys?Val?Leu?Phe?Asn?Gly?Arg?Pro?Glu?Val?Thr?Trp
1845????????????????1850????????????????1855Val?Gly?Ser?Val?Asp?Cys?Phe?Asn?Phe?Ile?Val?Ser?Asn?Ile?Pro?Thr
1860????????????????1865????????????????1870Ser?Ser?Val?Gly?Phe?Ile?His?Ser?Asp?Ile?Glu?Thr?Leu?Pro?Asp?Lys
1875????????????????1880????????????????1885Asp?Thr?Ile?Glu?Lys?Leu?Glu?Glu?Leu?Ala?Ala?Ile?Leu?Ser?Met?Ala
1890????????????????1895????????????????1900Leu?Leu?Leu?Gly?Lys?Ile?Gly?Ser?Ile?Leu?Val?Ile?Lys?Leu?Met?Pro1905????????????????1910????????????????1915????????????????1920Phe?Ser?Gly?Asp?Phe?Val?Gln?Gly?Phe?Ile?Ser?Tyr?Val?Gly?Ser?His
1925????????????????1930????????????????1935Tyr?Arg?Glu?Val?Asn?Leu?Val?Tyr?Pro?Arg?Tyr?Ser?Asn?Phe?Ile?Ser
1940????????????????1945????????????????1950Thr?Glu?Ser?Tyr?Leu?Val?Met?Thr?Asp?Leu?Lys?Ala?Asn?Arg?Leu?Met
1955????????????????1960????????????????1965
Asn?Pro?Glu?Lys?Ile?Lys?Gln?Gln?Ile?Ile?Glu?Ser?Ser?Val?Arg?Thr
1970????????????????1975????????????????1980
Ser?Pro?Gly?Leu?Ile?Gly?His?Ile?Leu?Ser?Ile?Lys?Gln?Leu?Ser?Cys
1985????????????????1990????????????????1995????????????????2000
Ile?Gln?Ala?Ile?Val?Gly?Asp?Ala?Val?Ser?Arg?Gly?Asp?Ile?Asn?Pro
2005????????????????2010????????????????2015
Thr?Leu?Lys?Lys?Leu?Thr?Pro?Ile?Glu?Gln?Val?Leu?Ile?Asn?Cys?Gly
2020????????????????????2025????????????????2030
Leu?Ala?Ile?Asn?Gly?Pro?Lys?Leu?Cys?Lys?Glu?Leu?Ile?His?His?Asp
2035????????????????2040????????????????2045
Val?Ala?Ser?Gly?Gln?Asp?Gly?Leu?Leu?Asn?Ser?Ile?Leu?Ile?Leu?Tyr
2050????????????????2055????????????????2060
Arg?Glu?Leu?Ala?Arg?Phe?Lys?Asp?Asn?Gln?Arg?Ser?Gln?Gln?Gly?Met
2065????????????????2070????????????????2075????????????????2080
Phe?His?Ala?Tyr?Pro?Val?Leu?Val?Ser?Ser?Arg?Gln?Arg?Glu?Leu?Ile
2085????????????????2090????????????????2095
Ser?Arg?Ile?Thr?Arg?Lys?Phe?Trp?Gly?His?Ile?Leu?Leu?Tyr?Ser?Gly
2100????????????????2105????????????????2110
Asn?Arg?Lys?Leu?Ile?Asn?Lys?Phe?Ile?Gln?Asn?Leu?Lys?Ser?Gly?Tyr
2115????????????????2120????????????????2125
Leu?Ile?Leu?Asp?Leu?His?Gln?Asn?Ile?Phe?Val?Lys?Asn?Leu?Ser?Lys
2130????????????????2135????????????????2140
Ser?Glu?Lys?Gln?Ile?Ile?Met?Thr?Gly?Gly?Leu?Lys?Arg?Glu?Trp?Val
2145????????????????2150????????????????2155????????????????2160
Phe?Lys?Val?Thr?Val?Lys?Glu?Thr?Lys?Glu?Trp?Tyr?Lys?Leu?Val?Gly
2165????????????????2170????????????????2175
Tyr?Ser?Ala?Leu?Ile?Lys?Asp
The information of 2180 (2) SEQ ID NO:11:
(i) sequence signature:
(A) length: 15894 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11:
ACCAAACAAA GTTGGGTAAG GATAGTTCAA TCAATGATCA TCTTCTAGTG CACTTAGGAT 60
TCAAGATCCT ATTATCAGGG ACAAGAGCAG GATTAGGGAT ATCCGAGATG GCCACACTTT 120
TAAGGAGCTT AGCATTGTTC AAAAGAAACA AGGACAAACC ACCCATTACA TCAGGATCCG 180
GTGGAGCCAT CAGAGGAATC AAACACATTA TTATAGTACC AATCCCTGGA GATTCCTCAA 240
TTACCACTCG ATCCAGACTT CTGGACCGGT TGGTGAGGTT AATTGGAAAC CCGGATGTGA 300
GCGGGCCCAA ACTAACAGGG GCACTAATAG GTATATTATC CTTATTTGTG GAGTCTCCAG 360
GTCAATTGAT TCAGAGGATC ACCGATGACC CTGACGTTAG CATAAGGCTG TTAGAGGTTG 420
TCCAGAGTGA CCAGTCACAA TCTGGCCTTA CCTTCGCATC AAGAGGTACC AACATGGAGG 480
ATGAGGCGGA CCAATACTTT TCACATGATG ATCCAATTAG TAGTGATCAA TCCAGGTTCG 540
GATGGTTCGG GAACAAGGAA ATCTCAGATA TTGAAGTGCA AGACCCTGAG GGATTCAACA 600
TGATTCTGGG TACCATCCTA GCCCAAATTT GGGTCTTGCT CGCAAAGGCG GTTACGGCCC 660
CAGACACGGC AGCTGATTCG GAGCTAAGAA GGTGGATAAA GTACACCCAA CAAAGAAGGG 720
TAGTTGGTGA ATTTAGATTG GAGAGAAAAT GGTTGGATGT GGTGAGGAAC AGGATTGCCG 780
AGGACCTCTC CTTACGCCGA TTCATGGTCG CTCTAATCCT GGATATCAAG AGAACACCCG 840
GAAACAAACC CAGGATTGCT GAAATGATAT GTGACATTGA TACATATATC GTAGAGGCAG 900
GATTAGCCAG TTTTATCCTG ACTATTAAGT TTGGGATAGA AACTATGTAT CCTGCTCTTG 960
GACTGCATGA ATTTGCTGGT GAGTTATCCA CACTTGAGTC CTTGATGAAC CTTTACCAGC 1020
AAATGGGGGA AACTGCACCC TACATGGTAA TCCTGGAGAA CTCAATTCAG AACAAGTTCA 1080
GTGCAGGATC ATACCCTCTG CTCTGGAGCT ATGCCATGGG AGTAGGAGTG GAACTTGAAA 1140
ACTCCATGGG AGGTTTGAAC TTTGGCCGAT CTTACTTTGA TCCAGCATAT TTTAGATTAG 1200
GGCAAGAGAT GGTAAGGAGG TCAGCTGGAA AGGTCAGTTC CACATTGGCA TCTGAACTCG 1260
GTATCACTGC CGAGGATGCA AGGCTTGTTT CAGAGATTGC AATGCATACT ACTGAGGACA 1320
AGATCAGTAG AGCGGTTGGA CCCAGACAAG CCCAAGTATC ATTTCTACAC GGTGATCAAA 1380
GTGAGAATGA GCTACCGAGA TTGGGGGGCA AGGAAGATAG GAGGGTCAAA CAGAGTCGAG 1440
GAGAAGCCAG GGAGAGCTAC AGAGAAACCG GGCCCAGCAG AGCAAGTGAT GCGAGAGCTG 1500
CCCATCTTCC AACCGGCACA CCCCTAGACA TTGACACTGC AACGGAGTCC AGCCAAGATC 1560
CGCAGGACAG TCGAAGGTCA GCTGACGCCC TGCTTAGGCT GCAAGCCATG GCAGGAATCT 1620
CGGAAGAACA AGGCTCAGAC ACGGACACCC CTATAGTGTA CAATGACAGA AATCTTCTAG 1680
ACTAGGTGCG AGAGGCCGAG GGCCAGAACA ACATCCGCCT ACCATCCATC ATTGTTATAA 1740
AAAACTTAGG AACCAGGTCC ACACAGCCGC CAGCCCATCA ACCATCCACT CCCACGATTG 1800
GAGCCAATGG CAGAAGAGCA GGCACGCCAT GTCAAAAACG GACTGGAATG CATCCGGGCT 1860
CTCAAGGCCG AGCCCATCGG CTCACTGGCC ATCGAGGAAG CTATGGCAGC ATGGTCAGAA 1920
ATATCAGACA ACCCAGGACA GGAGCGAGCC ACCTGCAGGG AAGAGAAGGC AGGCAGTTCG 1980
GGTCTCAGCA AACCATGCCT CTCAGCAATT GGATCAACTG AAGGCGGTGC ACCTCGCATC 2040
CGCGGTCAGG GACCTGGAGA GAGCGATGAC GACGCTGAAA CTTTGGGAAT CCCCCCAAGA 2100
AATCTCCAGG CATCAAGCAC TGGGTTACAG TGTTATTACG TTTATGATCA CAGCGGTGAA 2160
GCGGTTAAGG GAATCCAAGA TGCTGACTCT ATCATGGTTC AATCAGGCCT TGATGGTGAT 2220
AGCACCCTCT CAGGAGGAGA CAATGAATCT GAAAACAGCG ATGTGGATAT TGGCGAACCT 2280
GATACCGAGG GATATGCTAT CACTGACCGG GGATCTGCTC CCATCTCTAT GGGGTTCAGG 2340
GCTTCTGATG TTGAAACTGC AGAAGGAGGG GAGATCCACG AGCTCCTGAG ACTCCAATCC 2400
AGAGGCAACA ACTTTCCGAA GCTTGGGAAA ACTCTCAATG TTCCTCCGCC CCCGGACCCC 2460
GGTAGGGCCA GCACTTCCGG GACACCCATT AAAAAGGGCA CAGACGCGAG ATTAGCCTCA 2520
TTTGGAACGG AGATCGCGTC TTTATTGACA GGTGGTGCAA CCCAATGTGC TCGAAAGTCA 2580
CCCTCGGAAC CATCAGGGCC AGGTGCACCT GCGGGGAATG TCCCCGAGTG TGTGAGCAAT 2640
GCCGCACTGA TACAGGAGTG GACACCCGAA TCTGGTACCA CAATCTCCCC GAGATCCCAG 2700
AATAATGAAG AAGGGGGAGA CTATTATGAT GATGAGCTGT TCTCTGATGT CCAAGATATT 2760
AAAACAGCCT TGGCCAAAAT ACACGAGGAT AATCAGAAGA TAATCTCCAA GCTAGAATCA 2820
CTGCTGTTAT TGAAGGGAGA AGTTGAGTCA ATTAAGAAGC AGATCAACAG GCAAAATATC 2880
AGCATATCCA CCCTGGAAGG ACACCTCTCA AGCATCATGA TCGCCATTCC TGGACTTGGG 2940
AAGGATCCCA ACGACCCCAC TGCAGATGTC GAAATCAATC CCGACTTGAA ACCCATCATA 3000
GGCAGAGATT CAGGCCGAGC ACTGGCCGAA GTTCTCAAGA AACCCGTTGC CAGCCGACAA 3060
CTCCAAGGAA TGACAAATGG ACGGACCAGT TCCAGAGGAC AGCTGCTGAA GGAATTTCAG 3120
CTAAAGCCGA TCGGGAAAAA GATGAGCTCA GCCGTCGGGT TTGTTCCTGA CACCGGCCCT 3180
GCATCACGCA GTGTAATCCG CTCCATTATA AAATCCAGCC GGCTAGAGGA GGATCGGAAG 3240
CGTTACCTGA TGACTCTCCT TGATGATATC AAAGGAGCCA ATGATCTTGC CAAGTTCCAC 3300
CAGATGCTGA TGAAGATAAT AATGAAGTAG CTACAGCTCA ACTTACCTGC CAACCCCATG 3360
CCAGTCGACC CAACTAGTAC AACCTAAATC CATTATAAAA AACTTAGGAG CAAAGTGATT 3420
GCCTCCCAAG GTCCACAATG ACAGAGACCT ACGACTTCGA CAAGTCGGCA TGGGACATCA 3480
AAGGGTCGAT CGCTCCGATA CAACCCACCA CCTACAGTGA TGGCAGGCTG GTGCCCCAGG 3540
TCAGAGTCAT AGATCCTGGT CTAGGCGACA GGAAGGATGA ATGCTTTATG TACATGTTTC 3600
TGCTGGGGGT TGTTGAGGAC AGCGATTCCC TAGGGCCTCC AATCGGGCGA GCATTTGGGT 3660
TCCTGCCCTT AGGTGTTGGC AGATCCACAG CAAAGCCCGA AAAACTCCTC AAAGAGGCCA 3720
CTGAGCTTGA CATAGTTGTT AGACGTACAG CAGGGCTCAA TGAAAAACTG GTGTTCTACA 3780
ACAACACCCC ACTAACTCTC CTCACACCTT GGAGAAAGGT CCTAACAACA GGGAGTGTCT 3840
TCAACGCAAA CCAAGTGTGC AATGCGGTTA ATCTGATACC GCTCGATACC CCGCAGAGGT 3900
TCCGTGTTGT TTATATGAGC ATCACCCGTC TTTCGGATAA CGGGTATTAC ACCGTTCCTA 3960
GAAGAATGCT GGAATTCAGA TCGGTCAATG CAGTGGCCTT CAACCTGCTG GTGACCCTTA 4020
GGATTGACAA GGCGATAGGC CCTGGGAAGA TCATCGACAA TACAGAGCAA CTTCCTGAGG 4080
CAACATTTAT GGTCCACATC GGGAACTTCA GGAGAAAGAA GAGTGAAGTC TACTCTGCCG 4140
ATTATTGCAA AATGAAAATC GAAAAGATGG GCCTGGTTTT TGCACTTGGT GGGATAGGGG 4200
GCACCAGTCT TCACATTAGA AGCACAGGCA AAATGAGCAA GACTCTCCAT GCACAACTCG 4260
GGTTCAAGAA GACCTTATGT TACCCGCTGA TGGATATCAA TGAAGACCTT AATCGATTAC 4320
TCTGGAGGAG CAGATGCAAG ATAGTAAGAA TCCAGGCAGT TTTGCAGCCA TCAGTTCCTC 4380
AAGAATTCCG CATTTACGAC GACGTGATCA TAAATGATGA CCAAGGACTA TTCAAAGTTC 4440
TGTAGACCGT AGTGCCCAGC AATGCCCGAA AACGACCCCC CTCACAATGA CAGCCAGAAG 4500
GCCCGGACAA AAAAGCCCCC TCCGAAAGAC TCCACGGACC AAGCGAGAGG CCAGCCAGCA 4560
GCCGACGGCA AGCGCGAACA CCAGGCGGCC CCAGCACAGA ACAGCCCTGA CACAAGGCCA 4620
CCACCAGCCA CCCCAATCTG CATCCTCCTC GTGGGACCCC CGAGGACCAA CCCCCAAGGC 4680
TGCCCCCGAT CCAAACCACC AACCGCATCC CCACCACCCC CGGGAAAGAA ACCCCCAGCA 4740
ATTGGAAGGC CCCTCCCCCT CTTCCTCAAC ACAAGAACTC CACAACCGAA CCGCACAAGC 4800
GACCGAGGTG ACCCAACCGC AGGCATCCGA CTCCCTAGAC AGATCCTCTC TCCCCGGCAA 4860
ACTAAACAAA ACTTAGGGCC AAGGAACATA CACACCCAAC AGAACCCAGA CCCCGGTCCA 4920
CGGTGCCGCG CCCCCAACCC CCGACAACCA GAGGGAGCCC CCAACCAATC CCGCCGGCTC 4980
CCCCGGTGCC CACAGGCAGG GACACCAACC CCCGAACAGA CCCAGCACCC AACCATCGAC 5040
AATCCAAGAC GGGGGGGCCC CCCCAAAAAA AGGCCCCCAG GGGCCGACAG CCAGCACCGC 5100
GAGGAAGCCC ACCCACCCCA CACACGACCA CGGCAACCAA ACCAGAACCC AGACCACCCT 5160
GGGCCACCAG CTCCCAGACT CGGCCATCAC CCCGCAGAAA GGAAAGGCCA CAACCCGCGC 5220
ACCCCAGCCC CGATCCGGCG GGGAGCCACC CAACCCGAAC CAGCACCCAA GAGCGATCCC 5280
CGAAGGACCC CCGAACCGCA AAGGACATCA GTATCCCACA GCCTCTCCAA GTCCCCCGGT 5340
CTCCTCCTCT TCTCGAAGGG ACCAAAAGAT CAATCCACCA CACCCGACGA CACTCAACTC 5400
CCCACCCCTA AAGGAGACAC CGGGAATCCC AGAATCAAGA CTCATCCAAT GTCCATCATG 5460
GGTCTCAAGG TGAACGTCTC TGCCATATTC ATGGCAGTAC TGTTAACTCT CCAAACACCC 5520
ACCGGTCAAA TCCATTGGGG CAATCTCTCT AAGATAGGGG TGGTAGGAAT AGGAAGTGCA 5580
AGCTACAAAG TTATGACTCG TTCCAGCCAT CAATCATTAG TCATAAAATT AATGCCCAAT 5640
ATAACTCTCC TCAATAACTG CACGAGGGTA GAGATTGCAG AATACAGGAG ACTACTGAGA 5700
ACAGTTTTGG AACCAATTAG AGATGCACTT AATGCAATGA CCCAGAATAT AAGACCGGTT 5760
CAGAGTGTAG CTTCAAGTAG GAGACACAAG AGATTTGCGG GAGTAGTCCT GGCAGGTGCG 5820
GCCCTAGGCG TTGCCACAGC TGCTCAGATA ACAGCCGGCA TTGCACTTCA CCAGTCCATG 5880
CTGAACTCTC AAGCCATCGA CAATCTGAGA GCGAGCCTGG AAACTACTAA TCAGGCAATT 5940
GAGACAATCA GACAAGCAGG GCAGGAGATG ATATTGGCTG TTCAGGGTGT CCAAGACTAC 6000
ATCAATAATG AGCTGATACC GTCTATGAAC CAACTATCTT GTGATTTAAT CGGCCAGAAG 6060
CTCGGGCTCA AATTGCTCAG ATACTATACA GAAATCCTGT CATTATTTGG CCCCAGTTTA 6120
CGGGACCCCA TATCTGCGGA GATATCTATC CAGGCTTTGA GCTATGCGCT TGGAGGAGAC 6180
ATCAATAAGG TGTTAGAAAA GCTCGGATAC AGTGGAGGTG ATTTACTGGG CATCTTAGAG 6240
AGCGGAGGAA TAAAGGCCCG GATAACTCAC GTCGACACAG AGTCCTACTT CATTGTCCTC 6300
AGTATAGCCT ATCCGACGCT GTCCGAGATT AAGGGGGTGA TTGTCCACCG GCTAGAGGGG 6360
GTCTCGTACA ACATAGGCTC TCAAGAGTGG TATACCACTG TGCCCAAGTA TGTTGCAACC 6420
CAAGGGTACC TTATCTCGAA TTTTGATGAG TCATCGTGTA CTTTCATGCC AGAGGGGACT 6480
GTGTGCAGCC AAAATGCCTT GTACCCGATG AGTCCTCTGC TCCAAGAATG CCTCCGGGGG 6540
TACACCAAGT CCTGTGCTCG TACACTCGTA TCCGGGTCTT TTGGGAACCG GTTCATTTTA 6600
TCACAAGGGA ACCTAATAGC CAATTGTGCA TCAATCCTTT GCAAGTGTTA CACAACAGGA 6660
ACGATCATTA ATCAAGACCC TGACAAGATC CTAACATACA TTGCTGCCGA TCACTGCCCG 6720
GTAGTCGAGG TGAACGGCGT GACCATCCAA GTCGGGAGCA GGAGGTATCC AGACGCTGTG 6780
TACTTGCACA GAATTGACCT CGGTCCTCCC ATATCATTGG AGAGGTTGGA CGTAGGGACA 6840
AATCTGGGGA ATGCAATTGC TAAGTTGGAG GATGCCAAGG AATTGTTGGA GTCATCGGAC 6900
CAGATATTGA GGAGTATGAA AGGTTTATCG AGCACTAGCA TAGTCTACAT CCTGATTGCA 6960
GTGTGTCTTG GAGGGTTGAT AGGGATCCCC GCTTTAATAT GTTGCTGCAG GGGGCGTTGT 7020
AACAAAAAGG GAGAACAAGT TGGTATGTCA AGACCAGGCC TAAAGCCTGA TCTTACGGGA 7080
ACATCAAAAT CCTATGTAAG GTCGCTCTGA TCCTCTACAA CTCTTGAAAC ACAAATGTCC 7140
CACAAGTCTC CTCTTCGTCA TCAAGCAACC ACCGCACCCA GCATCAAGCC CACCTGAAAT 7200
TATCTCCGGC TTCCCTCTGG CCGAACAATA TCGGTAGTTA ATCAAAACTT AGGGTGCAAG 7260
ATCATCCACA ATGTCACCAC AACGAGACCG GATAAATGCC TTCTACAAAG ATAACCCCCA 7320
TCCCAAGGGA AGTAGGATAG TCATTAACAG AGAACATCTT ATGATTGATA GACCTTATGT 7380
TTTGCTGGCT GTTCTGTTTG TCATGTTTCT GAGCTTGATC GGGTTGCTAG CCATTGCAGG 7440
CATTAGACTT CATCGGGCAG CCATCTACAC CGCAGAGATC CATAAAAGCC TCAGCACCAA 7500
TCTAGATGTA ACTAACTCAA TCGAGCATCA GGTCAAGGAC GTGCTGACAC CACTCTTCAA 7560
AATCATCGGT GATGAAGTGG GCCTGAGGAC ACCTCAGAGA TTCACTGACC TAGTGAAATT 7620
AATCTCTGAC AAGATTAAAT TCCTTAATCC GGATAGGGAG TACGACTTCA GAGATCTCAC 7680
TTGGTGTATC AACCCGCCAG AGAGAATCAA ATTGGATTAT GATCAATACT GTGCAGATGT 7740
GGCTGCTGAA GAGCTCATGA ATGCATTGGT GAACTCAACT CTACTGGAGA CCAGAACAAC 7800
CAATCAGTTC CTAGCTGTCT CAAAGGGAAA CTGCTCAGGG CCCACTACAA TCAGAGGTCA 7860
ATTCTCAAAC ATGTCGCTGT CCCTGTTAGA CTTGTATTTA GGTCGAGGTT ACAATGTGTC 7920
ATCTATAGTC ACTATGACAT CCCAGGGAAT GTATGGGGGA ACTTACCTAG TGGAAAAGCC 7980
TAATCTGAGC AGCAAAAGGT CAGAGTTGTC ACAACTGAGC ATGTACCGAG TGTTTGAAGT 8040
AGGTGTTATC AGAAATCCGG GTTTGGGGGC TCCGGTGTTC CATATGACAA ACTATCTTGA 8100
GCAACCAGTC AGTAATGATC TCAGCAACTG TATGGTGGCT TTGGGGGAGC TCAAACTCGC 8160
AGCCCTTTGT CACGGGGAAG ATTCTATCAC AATTCCCTAT CAGGGATCAG GGAAAGGTGT 8220
CAGCTTCCAG CTCGTCAAGC TAGGTGTCTG GAAATCCCCA ACCGACATGC AATCCTGGGT 8280
CCCCTTATCA ACGGATGATC CAGTGATAGA CAGGCTTTAC CTCTCATCTC ACAGAGGTGT 8340
TATCGCTGAC AATCAAGCAA AATGGGCTGT CCCGACAACA CGAACAGATG ACAAGTTGCG 8400
AATGGAGACA TGCTTCCAAC AGGCGTGTAA GGGTAAAATC CAAGCACTCT GCGAGAATCC 8460
CGAGTGGGCA CCATTGAAGG ATAACAGGAT TCCTTCATAC GGGGTCTTGT CTGTTGATCT 8520
GAGTCTGACA GTTGAGCTTA AAATCAAAAT TGCTTCGGGA TTCGGGCCAT TGATCACACA 8580
CGGTTCAGGG ATGGACCTAT ACAAATCCAA CCACAACAAT GTGTATTGGC TGACTATCCC 8640
GCCAATGAAG AACCTAGCCT TAGGTGTAAT CAACACATTG GAGTGGATAC CGAGATTCAA 8700
GGTTAGTCCC TACCTCTTCA CTGTCCCAAT TAAGGAAGCA GGCGAAGACT GCCATGCCCC 8760
AACATACCTA CCTGCGGAGG TGGATGGTGA TGTCAAACTC AGTTCCAATC TGGTGATTCT 8820
ACCTGGTCAA GATCTCCAAT ATGTTTTGGC AACCTACGAT ACTTCCAGGG TTGAACATGC 8880
TGTGGTTTAT TACGTTTACA GCCCAAGCCG CTCATTTTCT TACTTTTATC CTTTTAGGTT 8940
GCCTATAAAG GGGGTCCCCA TCGAATTACA AGTGGAATGC TTCACATGGG ACCAAAAACT 9000
CTGGTGCCGT CACTTCTGTG TGCTTGCGGA CTCAGAATCT GGTGGACATA TCACTCACTC 9060
TGGGATGGTG GGCATGGGAG TCAGCTGCAC AGTCACCCGG GAAGATGGAA CCAATCGCAG 9120
ATAGGGCTGC TAGTGAACCA ATCACATGAT GTCACCCAGA CATCAGGCAT ACCCACTAGT 9180
GTGAAATAGA CATCAGAATT AAGAAAAACG TAGGGTCCAA GTGGTTCCCC GTTATGGACT 9240
CGCTATCTGT CAACCAGATC TTATACCCTG AAGTTCACCT AGATAGCCCG ATAGTTACCA 9300
ATAAGATAGT AGCCATCCTG GAGTATGCTC GAGTCCCTCA CGCTTACAGC CTGGAGGACC 9360
CTACACTGTG TCAGAACATC AAGCACCGCC TAAAAAACGG ATTTTCCAAC CAAATGATTA 9420
TAAACAATGT GGAAGTTGGG AATGTCATCA AGTCCAAGCT TAGGAGTTAT CCGGCCCACT 9480
CTCATATTCC ATATCCAAAT TGTAATCAGG ATTTATTTAA CATAGAAGAC AAAGAGTCAA 9540
CGAGGAAGAT CCGTGAACTC CTCAAAAAGG GGAATTCGCT GTACTCCAAA GTCAGTGATA 9600
AGGTTTTCCA ATGCTTAAGG GACACTAACT CACGGCTTGG CCTAGGCTCC GAATTGAGGG 9660
AGGACATCAA GGAGAAAGTT ATTAACTTGG GAGTTTACAT GCACAGCTCC CAGTGGTTTG 9720
AGCCCTTTCT GTTTTGGTTT ACAGTCAAGA CTGAGATGAG GTCAGTGATT AAATCACAAA 9780
CCCATACTTG CCATAGGAGG AGACACACAC CTGTATTCTT CACTGGTAGT TCAGTTGAGT 9840
TGCTAATCTC TCGTGACCTT GTTGCTATAA TCAGTAAAGA GTCTCAACAT GTATATTACC 9900
TGACATTTGA ACTGGTTTTG ATGTATTGTG ATGTCATAGA GGGGAGGTTA ATGACAGAGA 9960
CCGCTATGAC TATTGATGCT AGGTATACAG AGCTTCTAGG AAGAGTCAGA TACATGTGGA 10020
AACTGATAGA TGGTTTCTTC CCTGCACTCG GGAATCCAAC TTATCAAATT GTAGCCATGC 10080
TGGAGCCTCT TTCACTTGCT TACCTGCAGC TGAGGGATAT AACAGTAGAA CTCAGAGGTG 10140
CTTTCCTTAA CCACTGCTTT ACTGAAATAC ATGATGTTCT TGACCAAAAC GGGTTTTCTG 10200
ATGAAGGTAC TTATCATGAG TTAACTGAAG CTCTAGATTA CATTTTCATA ACTGATGACA 10260
TACATCTGAC AGGGGAGATT TTCTCATTTT TCAGAAGTTT CGGCCACCCC AGACTTGAAG 10320
CAGTAACGGC TGCTGAAAAT GTTAGGAAAT ACATGAATCA GCCTAAAGTC ATTGTGTATG 10380
AGACTCTGAT GAAAGGTCAT GCCATATTTT GTGGAATCAT AATCAACGGC TATCGTGACA 10440
GGCACGGAGG CAGTTGGCCA CCGCTGACCC TCCCCCTGCA TGCTGCAGAC ACAATCCGGA 10500
ATGCTCAAGC TTCAGGTGAA GGGTTAACAC ATGAGCAGTG CGTTGATAAC TGGAAATCTT 10560
TTGCTGGAGT GAAATTTGGC TGCTTTATGC CTCTTAGCCT GGATAGTGAT CTGACAATGT 10620
ACCTAAAGGA CAAGGCACTT GCTGCTCTCC AAAGGGAATG GGATTCAGTT TACCCGAAAG 10680
AGTTCCTGCG TTACGACCCT CCCAAGGGAA CCGGGTCACG GAGGCTTGTA GATGTTTTCC 10740
TTAATGATTC GAGCTTTGAC CCATATGATG TGATAATGTA TGTTGTAAGT GGAGCTTACC 10800
TCCATGACCC TGAGTTCAAC CTGTCTTACA GCCTGAAAGA AAAGGAGATC AAGGAAACAG 10860
GTAGACTTTT TGCTAAAATG ACTTACAAAA TGAGGGCATG CCAAGTGATT GCTGAAAATC 10920
TAATCTCAAA CGGGATTGGC AAATATTTTA AGGACAATGG GATGGCCAAG GATGAGCACG 10980
ATTTGACTAA GGCACTCCAC ACTCTAGCTG TCTCAGGAGT CCCCAAAGAT CTCAAAGAAA 11040
GTCACAGGGG GGGGCCAGTC TTAAAAACCT ACTCCCGAAG CCCAGTCCAC ACAAGTACCA 11100
GGAACGTGAG AGCAGCAAAA GGGTTTATAG GGTTCCCTCA AGTAATTCGG CAGGACCAAG 11160
ACACTGATCA TCCGGAGAAT ATGGAAGCTT ACGAGACAGT CAGTGCATTT ATCACGACTG 11220
ATCTCAAGAA GTACTGCCTT AATTGGAGAT ATGAGACCAT CAGCTTGTTT GCACAGAGGC 11280
TAAATGAGAT TTACGGATTG CCCTCATTTT TCCAGTGGCT GCATAAGAGG CTTGAGACCT 11340
CTGTCCTGTA TGTAAGTGAC CCTCATTGCC CCCCCGACCT TGACGCCCAT ATCCCGTTAT 11400
ATAAAGTCCC CAATGATCAA ATCTTCATTA AGTACCCTAT GGGAGGTATA GAAGGGTATT 11460
GTCAGAAGCT GTGGACCATC AGCACCATTC CCTATCTATA CCTGGCTGCT TATGAGAGCG 11520
GAGTAAGGAT TGCTTCGTTA GTGCAAGGGG ACAATCAGAC CATAGCCGTA ACAAAAAGGG 11580
TACCCAGCAC ATGGCCCTAC AACCTTAAGA AACGGGAAGC TGCTAGAGTA ACTAGAGATT 11640
ACTTTGTAAT TCTTAGGCAA AGGCTACATG ATATTGGCCA TCACCTCAAG GCAAATGAGA 11700
CAATTGTTTC ATCACATTTT TTTGTCTATT CAAAAGGAAT ATATTATGAT GGGCTACTTG 11760
TGTCCCAATC ACTCAAGAGC ATCGCAAGAT GTGTATTCTG GTCAGAGACT ATAGTTGATG 11820
AAACAAGGGC AGCATGCAGT AATATTGCTA CAACAATGGC TAAAAGCATC GAGAGAGGTT 11880
ATGACCGTTA CCTTGCATAT TCCCTGAACG TCCTAAAAGT GATACAGCAA ATTCTGATCT 11940
CTCTTGGCTT CACAATCAAT TCAACCATGA CCCGGGATGT AGTCATACCC CTCCTCACAA 12000
ACAACGACCT CTTAATAAGG ATGGCACTGT TGCCCGCTCC TATTGGGGGG ATGAATTATC 12060
TGAATATGAG CAGGCTGTTT GTCAGAAACA TCGGTGATCC AGTAACATCA TCAATTGCTG 12120
ATCTCAAGAG AATGATTCTC GCCTCACTAA TGCCTGAAGA GACCCTCCAT CAAGTAATGA 12180
CACAACAACC GGGGGACTCT TCATTCCTAG ACTGGGCTAG CGACCCTTAC TCAGCAAATC 12240
TTGTATGTGT CCAGAGCATC ACTAGACTCC TCAAGAACAT AACTGCAAGG TTTGTCCTGA 12300
TCCATAGTCC AAACCCAATG TTAAAAGGAT TATTCCATGA TGACAGTAAA GAAGAGGACG 12360
AGGGACTGGC GGCATTCCTC ATGGACAGGC ATATTATAGT ACCTAGGGCA GCTCATGAAA 12420
TCCTGGATCA TAGTGTCACA GGGGCAAGAG AGTCTATTGC AGGCATGCTG GATACCACAA 12480
AAGGCTTGAT TCGAGCCAGC ATGAGGAAGG GGGGGTTAAC CTCTCGAGTG ATAACCAGAT 12540
TGTCCAATTA TGACTATGAA CAATTCAGAG CAGGGATGGT GCTATTGACA GGAAGAAAGA 12600
GAAATGTCCT CATTGACAAA GAGTCATGTT CAGTGCAGCT GGCGAGAGCT CTAAGAAGCC 12660
ATATGTGGGC GAGGCTAGCT CGAGGACGGC CTATTTACGG CCTTGAGGTC CCTGATGTAC 12720
TAGAATCTAT GCGAGGCCAC CTTATTCGGC GTCATGAGAC ATGTGTCATC TGCGAGTGTG 12780
GATCAGTCAA CTACGGATGG TTTTTTGTCC CCTCGGGTTG CCAACTGGAT GATATTGACA 12840
AGGAAACATC ATCCTTGAGA GTCCCATATA TTGGTTCTAC CACTGATGAG AGAACAGACA 12900
TGAAGCTTGC CTTCGTAAGA GCCCCAAGTC GATCCTTGCG ATCTGCTGTT AGAATAGCAA 12960
CAGTGTACTC ATGGGCTTAC GGTGATGATG ATAGCTCTTG GAACGAAGCC TGGTTGTTGG 13020
CTAGGCAAAG GGCCAATGTG AGCCTGGAGG AGCTAAGGGT GATCACTCCC ATCTCAACTT 13080
CGACTAATTT AGCGCATAGG TTGAGGGATC GTAGCACTCA AGTGAAATAC TCAGGTACAT 13140
CCCTTGTCCG AGTGGCGAGG TATACCACAA TCTCCAACGA CAATCTCTCA TTTGTCATAT 13200
CAGATAAGAA GGTTGATACT AACTTTATAT ACCAACAAGG AATGCTTCTA GGGTTGGGTG 13260
TTTTAGAAAC ATTGTTTCGA CTCGAGAAAG ATACCGGATC ATCTAACACG GTATTACATC 13320
TTCACGTCGA AACAGATTGT TGCGTGATCC CGATGATAGA TCATCCCAGG ATACCCAGCT 13380
CCCGCAAGCT AGAGCTGAGG GCAGAGCTAT GTACCAACCC ATTGATATAT GATAATGCAC 13440
CTTTAATTGA CAGAGATGCA ACAAGGCTAT ACACCCAGAG CCATAGGAGG CACCTTGTGG 13500
AATTTGTTAC ATGGTCCACA CCCCAACTAT ATCACATTTT AGCTAAGTCC ACAGCACTAT 13560
CTATGATTGA CCTGGTAACA AAATTTGAGA AGGACCATAT GAATGAAATT TCAGCTCTCA 13620
TAGGGGATGA CGATATCAAT AGTTTCATAA CTGAGTTTCT GCTCATAGAG CCAAGATTAT 13680
TCACTATCTA CTTGGGCCAG TGTGCGGCCA TCAATTGGGC ATTTGATGTA CATTATCATA 13740
GACCATCAGG GAAATATCAG ATGGGTGAGC TGTTGTCATC GTTCCTTTCT AGAATGAGCA 13800
AAGGAGTGTT TAAGGTGCTT GTCAATGCTC TAAGCCACCC AAAGATCTAC AAGAAATTCT 13860
GGCATTGTGG TATTATAGAG CCTATCCATG GTCCTTCACT TGATGCTCAA AACTTGCACA 13920
CAACTGTGTG CAACATGGTT TACACATGCT ATATGACCTA CCTCGACCTG TTGTTGAATG 13980
AAGAGTTAGA AGAGTTCACA TTTCTCTTGT GTGAAAGCGA CGAGGATGTA GTACCGGACA 14040
GATTCGACAA CATCCAGGCA AAACACTTAT GTGTTCTGGC AGATTTGTAC TGTCAACCAG 14100
GGACCTGCCC ACCAATTCGA GGTCTAAGAC CGGTAGAGAA ATGTGCAGTT CTAACCGACC 14160
ATATCAAGGC AGAGGCTATG TTATCTCCAG CAGGATCTTC GTGGAACATA AATCCAATTA 14220
TTGTAGACCA TTACTCATGC TCTCTGACTT ATCTCCGGCG AGGATCGATC AAACAGATAA 14280
GATTGAGAGT TGATCCAGGA TTCATTTTCG ACGCCCTCGC TGAGGTAAAT GTCAGTCAGC 14340
CAAAGATCGG CAGCAACAAC ATCTCAAATA TGAGCATCAA GGCTTTCAGA CCCCCACACG 14400
ATGATGTTGC AAAATTGCTC AAAGATATCA ACACAAGCAA GCACAATCTT CCCATTTCAG 14460
GGGGCAATCT CGCCAATTAT GAAATCCATG CTTTCCGCAG AATCGGGTTG AACTCATCTG 14520
CTTGCTACAA AGCTGTTGAG ATATCAACAT TAATTAGGAG ATGCCTTGAG CCAGGGGAGG 14580
ACGGCTTGTT CTTGGGTGAG GGATCGGGTT CTATGTTGAT CACTTATAAA GAGATACTTA 14640
AACTAAACAA GTGCTTCTAT AATAGTGGGG TTTCCGCCAA TTCTAGATCT GGTCAAAGGG 14700
AATTAGCACC CTATCCCTCC GAAGTTGGCC TTGTCGAACA CAGAATGGGA GTAGGTAATA 14760
TTGTCAAAGT GCTCTTTAAC GGGAGGCCCG AAGTCACGTG GGTAGGCAGT GTAGATTGCT 14820
TCAATTTCAT AGTTAGTAAT ATCCCTACCT CTAGTGTGGG GTTTATCCAT TCAGATATAG 14880
AGACCTTGCC TGACAAAGAT ACTATAGAGA AGCTAGAGGA ATTGGCAGCC ATCTTATCGA 14940
TGGCTCTGCT CCTGGGCAAA ATAGGATCAA TACTGGTGAT TAAGCTTATG CCTTTCAGCG 15000
GGGATTTTGT TCAGGGATTT ATAAGTTATG TAGGGTCTCA TTATAGAGAA GTGAACCTTG 15060
TATACCCTAG ATACAGCAAC TTCATCTCTA CTGAATCTTA TTTGGTTATG ACAGATCTCA 15120
AGGCTAACCG GCTAATGAAT CCTGAAAAGA TTAAGCAGCA GATAATTGAA TCATCTGTGA 15180
GGACTTCACC TGGACTTATA GGTCACATCC TATCCATTAA GCAACTAAGC TGCATACAAG 15240
CAATTGTGGG AGACGCAGTT AGTAGAGGTG ATATCAATCC TACTCTGAAA AAACTTACAC 15300
CTATAGAGCA GGTGCTGATC AATTGCGGGT TGGCAATTAA CGGACCTAAG CTGTGCAAAG 15360
AATTGATCCA CCATGATGTT GCCTCAGGGC AAGATGGATT GCTTAATTCT ATACTCATCC 15420
TCTACAGGGA GTTGGCAAGA TTCAAAGACA ACCAAAGAAG TCAACAAGGG ATGTTCCACG 15480
CTTACCCCGT ATTGGTAAGT AGCAGGCAAC GAGAACTTAT ATCTAGGATC ACCCGCAAAT 15540
TCTGGGGGCA CATTCTTCTT TACTCCGGGA ACAAAAAGTT GATAAATAAG TTTATCCAGA 15600
ATCTCAAGTC CGGCTATCTG ATACTAGACT TACACCAGAA TATCTTCGTT AAGAATCTAT 15660
CCAAGTCAGA GAAACAGATT ATTATGACGG GGGGTTTGAA ACGTGAGTGG GTTTTTAAGG 15720
TAACAGTCAA GGAGACCAAA GAATGGTATA AGTTAGTCGG ATACAGTGCC CTGATTAAGG 15780
ACTAATTGGT TGAACTCCGG AACCCTAATC CTGCCCTAGG TGGTTAGGCA TTATTTGCAA 15840
TATATTAAAG AAAACTTTGA AAATACGAAG TTTCTATTCC CAGCTTTGTC TGGT 15894
(2) SEQ ID NO: 12 information about:
...
(i) sequence signature:
(A) length: 2183 amino acid
(B) type: amino acid
(C) chain:
(D) topological framework: linearity
(ii) molecule type: protein
(xi) sequence description: SEQ ID NO:12:
Met?Asp?Ser?Leu?Ser?Val?Asn?Gln?Ile?Leu?Tyr?Pro?Glu?Val?His?Leu1???????????????5???????????????????10??????????????????15Asp?Ser?Pro?Ile?Val?Thr?Asn?Lys?Ile?Val?Ala?Ile?Leu?Glu?Tyr?Ala
20??????????????????25??????????????????30Arg?Val?Pro?His?Ala?Tyr?Ser?Leu?Glu?Asp?Pro?Thr?Leu?Cys?Gln?Asn
35??????????????????40??????????????????45Ile?Lys?His?Arg?Leu?Lys?Asn?Gly?Phe?Ser?Asn?Gln?Met?Ile?Ile?Asn
50??????????????????55??????????????????60Asn?Val?Glu?Val?Gly?Asn?Val?Ile?Lys?Ser?Lys?Leu?Arg?Ser?Tyr?Pro65??????????????????70??????????????????75??????????????????80Ala?His?Ser?His?Ile?Pro?Tyr?Pro?Asn?Cys?Asn?Gln?Asp?Leu?Phe?Asn
85??????????????????90??????????????????95Ile?Glu?Asp?Lys?Glu?Ser?Thr?Arg?Lys?Ile?Arg?Glu?Leu?Leu?Lys?Lys
100?????????????????105?????????????????110Gly?Asn?Ser?Leu?Tyr?Ser?Lys?Val?Ser?Asp?Lys?Val?Phe?Gln?Cys?Leu
115?????????????????120?????????????????125Arg?Asp?Thr?Asn?Ser?Arg?Leu?Gly?Leu?Gly?Ser?Glu?Leu?Arg?Glu?Asp
130?????????????????135?????????????????140Ile?Lys?Glu?Lys?Val?Ile?Asn?Leu?Gly?Val?Tyr?Met?His?Ser?Ser?Gln145?????????????????150?????????????????155?????????????????160Trp?Phe?Glu?Pro?Phe?Leu?Phe?Trp?Phe?Thr?Val?Lys?Thr?Glu?Met?Arg
165?????????????????170?????????????????175Ser?Val?Ile?Lys?Ser?Gln?Thr?His?Thr?Cys?His?Arg?Arg?Arg?His?Thr
180?????????????????185?????????????????190Pro?Val?Phe?Phe?Thr?Gly?Ser?Ser?Val?Glu?Leu?Leu?Ile?Ser?Arg?Asp
195?????????????????200?????????????????205Leu?Val?Ala?Ile?Ile?Ser?Lys?Glu?Ser?Gln?His?Val?Tyr?Tyr?Leu?Thr
210?????????????????215?????????????????220Phe?Glu?Leu?Val?Leu?Met?Tyr?Cys?Asp?Val?Ile?Glu?Gly?Arg?Leu?Met225?????????????????230?????????????????235?????????????????240Thr?Glu?Thr?Ala?Met?Thr?Ile?Asp?Ala?Arg?Tyr?Thr?Glu?Leu?Leu?Gly
245?????????????????250?????????????????255Arg?Val?Arg?Tyr?Met?Trp?Lys?Leu?Ile?Asp?Gly?Phe?Phe?Pro?Ala?Leu
260?????????????????265?????????????????270Gly?Asn?Pro?Thr?Tyr?Gln?Ile?Val?Ala?Met?Leu?Glu?Pro?Leu?Ser?Leu
275?????????????????280?????????????????285Ala?Tyr?Leu?Gln?Leu?Arg?Asp?Ile?Thr?Val?Glu?Leu?Arg?Gly?Ala?Phe
290?????????????????295?????????????????300Leu?Asn?His?Cys?Phe?Thr?Glu?Ile?His?Asp?Val?Leu?Asp?Gln?Asn?Gly305?????????????????310?????????????????315?????????????????320Phe?Ser?Asp?Glu?Gly?Thr?Tyr?His?Glu?Leu?Thr?Glu?Ala?Leu?Asp?Tyr
325?????????????????330?????????????????335Ile?Phe?Ile?Thr?Asp?Asp?Ile?His?Leu?Thr?Gly?Glu?Ile?Phe?Ser?Phe
340?????????????????345?????????????????350Phe?Arg?Ser?Phe?Gly?His?Pro?Arg?Leu?Glu?Ala?Val?Thr?Ala?Ala?Glu
355?????????????????360?????????????????365Asn?Val?Arg?Lys?Tyr?Met?Asn?Gln?Pro?Lys?Val?Ile?Val?Tyr?Glu?Thr
370?????????????????375?????????????????380Leu?Met?Lys?Gly?His?Ala?Ile?Phe?Cys?Gly?Ile?Ile?Ile?Asn?Gly?Tyr385?????????????????390?????????????????395?????????????????400Arg?Asp?Arg?His?Gly?Gly?Ser?Trp?Pro?Pro?Leu?Thr?Leu?Pro?Leu?His
405?????????????????410?????????????????415Ala?Ala?Asp?Thr?Ile?Arg?Asn?Ala?Gln?Ala?Ser?Gly?Glu?Gly?Leu?Thr
420?????????????????425?????????????????430His?Glu?Gln?Cys?Val?Asp?Asn?Trp?Lys?Ser?Phe?Ala?Gly?Val?Lys?Phe
435?????????????????440?????????????????445Gly?Cys?Phe?Met?Pro?Leu?Ser?Leu?Asp?Ser?Asp?Leu?Thr?Met?Tyr?Leu
450?????????????????455?????????????????460Lys?Asp?Lys?Ala?Leu?Ala?Ala?Leu?Gln?Arg?Glu?Trp?Asp?Ser?Val?Tyr465?????????????????470?????????????????475?????????????????480Pro?Lys?Glu?Phe?Leu?Arg?Tyr?Asp?Pro?Pro?Lys?Gly?Thr?Gly?Ser?Arg
485?????????????????490?????????????????495Arg?Leu?Val?Asp?Val?Phe?Leu?Asn?Asp?Ser?Ser?Phe?Asp?Pro?Tyr?Asp
500?????????????????505?????????????????510Val?Ile?Met?Tyr?Val?Val?Ser?Gly?Ala?Tyr?Leu?His?Asp?Pro?Glu?Phe
515?????????????????520?????????????????525Asn?Leu?Ser?Tyr?Ser?Leu?Lys?Glu?Lys?Glu?Ile?Lys?Glu?Thr?Gly?Arg
530?????????????????535?????????????????540Leu?Phe?Ala?Lys?Met?Thr?Tyr?Lys?Met?Arg?Ala?Cys?Gln?Val?Ile?Ala545?????????????????550?????????????????555?????????????????560Glu?Asn?Leu?Ile?Ser?Asn?Gly?Ile?Gly?Lys?Tyr?Phe?Lys?Asp?Asn?Gly
565?????????????????570?????????????????575Met?Ala?Lys?Asp?Glu?His?Asp?Leu?Thr?Lys?Ala?Leu?His?Thr?Leu?Ala
580?????????????????585?????????????????590Val?Ser?Gly?Val?Pro?Lys?Asp?Leu?Lys?Glu?Ser?His?Arg?Gly?Gly?Pro
595?????????????????600?????????????????605Val?Leu?Lys?Thr?Tyr?Ser?Arg?Ser?Pro?Val?His?Thr?Ser?Thr?Arg?Asn
610?????????????????615?????????????????620Val?Arg?Ala?Ala?Lys?Gly?Phe?Ile?Gly?Phe?Pro?Gln?Val?Ile?Arg?Gln625?????????????????630?????????????????635?????????????????640Asp?Gln?Asp?Thr?Asp?His?Pro?Glu?Asn?Met?Glu?Ala?Tyr?Glu?Thr?Val
645?????????????????650?????????????????655Ser?Ala?Phe?Ile?Thr?Thr?Asp?Leu?Lys?Lys?Tyr?Cys?Leu?Asn?Trp?Arg
660?????????????????665?????????????????670Tyr?Glu?Thr?Ile?Ser?Leu?Phe?Ala?Gln?Arg?Leu?Asn?Glu?Ile?Tyr?Gly
675?????????????????680?????????????????685Leu?Pro?Ser?Phe?Phe?Gln?Trp?Leu?His?Lys?Arg?Leu?Glu?Thr?Ser?Val
690?????????????????695?????????????????700Leu?Tyr?Val?Ser?Asp?Pro?His?Cys?Pro?Pro?Asp?Leu?Asp?Ala?His?Ile705?????????????????710?????????????????715?????????????????720Pro?Leu?Tyr?Lys?Val?Pro?Asn?Asp?Gln?Ile?Phe?Ile?Lys?Tyr?Pro?Met
725?????????????????730?????????????????735Gly?Gly?Ile?Glu?Gly?Tyr?Cys?Gln?Lys?Leu?Trp?Thr?Ile?Ser?Thr?Ile
740?????????????????745?????????????????750Pro?Tyr?Leu?Tyr?Leu?Ala?Ala?Tyr?Glu?Ser?Gly?Val?Arg?Ile?Ala?Ser
755?????????????????760?????????????????765Leu?Val?Gln?Gly?Asp?Asn?Gln?Thr?Ile?Ala?Val?Thr?Lys?Arg?Val?Pro
770?????????????????775?????????????????780Ser?Thr?Trp?Pro?Tyr?Asn?Leu?Lys?Lys?Arg?Glu?Ala?Ala?Arg?Val?Thr785?????????????????790?????????????????795?????????????????800Arg?Asp?Tyr?Phe?Val?Ile?Leu?Arg?Gln?Arg?Leu?His?Asp?Ile?Gly?His
805?????????????????810?????????????????815His?Leu?Lys?Ala?Asn?Glu?Thr?Ile?Val?Ser?Ser?His?Phe?Phe?Val?Tyr
820?????????????????825?????????????830Ser?Lys?Gly?Ile?Tyr?Tyr?Asp?Gly?Leu?Leu?Val?Ser?Gln?Ser?Leu?Lys
835?????????????????840?????????????????845Ser?Ile?Ala?Arg?Cys?Val?Phe?Trp?Ser?Glu?Thr?Ile?Val?Asp?Glu?Thr
850?????????????????855?????????????????860Arg?Ala?Ala?Cys?Ser?Asn?Ile?Ala?Thr?Thr?Met?Ala?Lys?Ser?Ile?Glu865?????????????????870?????????????????875?????????????????880Arg?Gly?Tyr?Asp?Arg?Tyr?Leu?Ala?Tyr?Ser?Leu?Asn?Val?Leu?Lys?Val
885?????????????????890?????????????????895Ile?Gln?Gln?Ile?Leu?Ile?Ser?Leu?Gly?Phe?Thr?Ile?Asn?Ser?Thr?Met
900?????????????????905?????????????????910Thr?Arg?Asp?Val?Val?Ile?Pro?Leu?Leu?Thr?Asn?Asn?Asp?Leu?Leu?Ile
915?????????????????920?????????????????925Arg?Met?Ala?Leu?Leu?Pro?Ala?Pro?Ile?Gly?Gly?Met?Asn?Tyr?Leu?Asn
930?????????????????935?????????????????940Met?Ser?Arg?Leu?Phe?Val?Arg?Asn?Ile?Gly?Asp?Pro?Val?Thr?Ser?Ser945?????????????????950?????????????????955?????????????????960Ile?Ala?Asp?Leu?Lys?Arg?Met?Ile?Leu?Ala?Ser?Leu?Met?Pro?Glu?Glu
965?????????????????970?????????????????975Thr?Leu?His?Gln?Val?Met?Thr?Gln?Gln?Pro?Gly?Asp?Ser?Ser?Phe?Leu
980?????????????????985?????????????????990Asp?Trp?Ala?Ser?Asp?Pro?Tyr?Ser?Ala?Asn?Leu?Val?Cys?Val?Gln?Ser
995?????????????????1000????????????????1005Ile?Thr?Arg?Leu?Leu?Lys?Asn?Ile?Thr?Ala?Arg?Phe?Val?Leu?Ile?His
1010????????????????1015????????????????1020Ser?Pro?Asn?Pro?Met?Leu?Lys?Gly?Leu?Phe?His?Asp?Asp?Ser?Lys?Glu1025????????????????1030???????????????1035????????????????1040Glu?Asp?Glu?Gly?Leu?Ala?Ala?Phe?Leu?Met?Asp?Arg?His?Ile?Ile?Val
1045????????????????1050????????????????1055Pro?Arg?Ala?Ala?His?Glu?Ile?Leu?Asp?His?Ser?Val?Thr?Gly?Ala?Arg
1060????????????????1065????????????????1070Glu?Ser?Ile?Ala?Gly?Met?Leu?Asp?Thr?Thr?Lys?Gly?Leu?Ile?Arg?Ala
1075????????????????1080????????????????1085Ser?Met?Arg?Lys?Gly?Gly?Leu?Thr?Ser?Arg?Val?Ile?Thr?Arg?Leu?Ser
1090???????????????1095?????????????????1100Asn?Tyr?Asp?Tyr?Glu?Gln?Phe?Arg?Ala?Gly?Met?Val?Leu?Leu?Thr?Gly1105???????????????1110?????????????????1115????????????????1120Arg?Lys?Arg?Asn?Val?Leu?Ile?Asp?Lys?Glu?Ser?Cys?Ser?Val?Gln?Leu
1125????????????????1130????????????????1135Ala?Arg?Ala?Leu?Arg?Ser?His?Met?Trp?Ala?Arg?Leu?Ala?Arg?Gly?Arg
1140????????????????1145????????????????1150Pro?Ile?Tyr?Gly?Leu?Glu?Val?Pro?Asp?Val?Leu?Glu?Ser?Met?Arg?Gly
1155????????????????1160????????????????1165His?Leu?Ile?Arg?Arg?His?Glu?Thr?Cys?Val?Ile?Cys?Glu?Cys?Gly?Ser
1170????????????????1175???????????????1180Val?Asn?Tyr?Gly?Trp?Phe?Phe?Val?Pro?Ser?Gly?Cys?Gln?Leu?Asp?Asp1185????????????????1190????????????????1195???????????????1200Ile?Asp?Lys?Glu?Thr?Ser?Ser?Leu?Arg?Val?Pro?Tyr?Ile?Gly?Ser?Thr
1205????????????????1210????????????????1215Thr?Asp?Glu?Arg?Thr?Asp?Met?Lys?Leu?Ala?Phe?Val?Arg?Ala?Pro?Ser
1220????????????????1225????????????????1230Arg?Ser?Leu?Arg?Ser?Ala?Val?Arg?Ile?Ala?Thr?Val?Tyr?Ser?Trp?Ala
1235????????????????1240????????????????1245Tyr?Gly?Asp?Asp?Asp?Ser?Ser?Trp?Asn?Glu?Ala?Trp?Leu?Leu?Ala?Arg
1150????????????????1255????????????????1260Gln?Arg?Ala?Asn?Val?Ser?Leu?Glu?Glu?Leu?Arg?Val?Ile?Thr?Pro?Ile1265????????????????1270????????????????1275????????????????1280Ser?Thr?Ser?Thr?Asn?Leu?Ala?His?Arg?Leu?Arg?Asp?Arg?Ser?Thr?Gln
1285????????????????1290????????????????1295Val?Lys?Tyr?Ser?Gly?Thr?Ser?Leu?Val?Arg?Val?Ala?Arg?Tyr?Thr?Thr
1300????????????????1305????????????????1310Ile?Ser?Asn?Asp?Asn?Leu?Ser?Phe?Val?Ile?Ser?Asp?Lys?Lys?Val?Asp
1315????????????????1320????????????????1325Thr?Asn?Phe?Ile?Tyr?Gln?Gln?Gly?Met?Leu?Leu?Gly?Leu?Gly?Val?Leu
1330????????????????1335????????????????1340Glu?Thr?Leu?Phe?Arg?Leu?Glu?Lys?Asp?Thr?Gly?Ser?Ser?Asn?Thr?Val1345????????????????1350????????????????1355????????????????1360Leu?His?Leu?His?Val?Glu?Thr?Asp?Cys?Cys?Val?Ile?Pro?Met?Ile?Asp
1365????????????????1370????????????????1375His?Pro?Arg?Ile?Pro?Ser?Ser?Arg?Lys?Leu?Glu?Leu?Arg?Ala?Glu?Leu
1380????????????????1385???????????????1390Cys?Thr?Asn?Pro?Leu?Ile?Tyr?Asp?Asn?Ala?Pro?Leu?Ile?Asp?Arg?Asp
1395????????????????1400????????????????1405Ala?Thr?Arg?Leu?Tyr?Thr?Gln?Ser?His?Arg?Arg?His?Leu?Val?Glu?Phe
1410????????????????1415????????????????1420Val?Thr?Trp?Ser?Thr?Pro?Gln?Leu?Tyr?His?Ile?Leu?Ala?Lys?Ser?Thr1425????????????????1430????????????????1435????????????????1440Ala?Leu?Ser?Met?Ile?Asp?Leu?Val?Thr?Lys?Phe?Glu?Lys?Asp?His?Met
1445????????????????1450????????????????1455Asn?Glu?Ile?Ser?Ala?Leu?Ile?Gly?Asp?Asp?Asp?Ile?Asn?Ser?Phe?Ile
1460????????????????1465????????????????1470Thr?Glu?Phe?Leu?Leu?Ile?Glu?Pro?Arg?Leu?Phe?Thr?Ile?Tyr?Leu?Gly
1475????????????????1480????????????????1485Gln?Cys?Ala?Ala?Ile?Asn?Trp?Ala?Phe?Asp?Val?His?Tyr?His?Arg?Pro
1490????????????????1495????????????????1500Ser?Gly?Lys?Tyr?Gln?Met?Gly?Glu?Leu?Leu?Ser?Ser?Phe?Leu?Ser?Arg1505????????????????1510????????????????1515????????????????1520Met?Ser?Lys?Gly?Val?Phe?Lys?Val?Leu?Val?Asn?Ala?Leu?Ser?His?Pro
1525????????????????1530????????????????1535Lys?Ile?Tyr?Lys?Lys?Phe?Trp?His?Cys?Gly?Ile?Ile?Glu?Pro?Ile?His
1540????????????????1545????????????????1550Gly?Pro?Ser?Leu?Asp?Ala?Gln?Asn?Leu?His?Thr?Thr?Val?Cys?Asn?Met
1555????????????????1560????????????????1565Val?Tyr?Thr?Cys?Tyr?Met?Thr?Tyr?Leu?Asp?Leu?Leu?Leu?Asn?Glu?Glu
1570????????????????1575????????????????1580Leu?Glu?Glu?Phe?Thr?Phe?Leu?Leu?Cys?Glu?Ser?Asp?Glu?Asp?Val?Val1585????????????????1590????????????????1595????????????????1600Pro?Asp?Arg?Phe?Asp?Asn?Ile?Gln?Ala?Lys?His?Leu?Cys?Val?Leu?Ala
1605????????????????16l0????????????????1615Asp?Leu?Tyr?Cys?Gln?Pro?Gly?Thr?Cys?Pro?Pro?Ile?Arg?Gly?Leu?Arg
1620????????????????1625????????????????1630Pro?Val?Glu?Lys?Cys?Ala?Val?Leu?Thr?Asp?His?Ile?Lys?Ala?Glu?Ala
1635????????????????1640????????????????1645Met?Leu?Ser?Pro?Ala?Gly?Ser?Ser?Trp?Asn?Ile?Asn?Pro?Ile?Ile?Val
1650???????????????1655?????????????????1660Asp?His?Tyr?Ser?Cys?Ser?Leu?Thr?Tyr?Leu?Arg?Arg?Gly?Ser?Ile?Lys1665????????????????1670????????????????1675????????????????1680Gln?Ile?Arg?Leu?Arg?Val?Asp?Pro?Gly?Phe?Ile?Phe?Asp?Ala?Leu?Ala
1685????????????????1690????????????????1695Glu?Val?Asn?Val?Ser?Gln?Pro?Lys?Ile?Gly?Ser?Asn?Asn?Ile?Ser?Asn
1700????????????????1705????????????????1710Met?Ser?Ile?Lys?Ala?Phe?Arg?Pro?Pro?His?Asp?Asp?Val?Ala?Lys?Leu
1715????????????????1720????????????????1725Leu?Lys?Asp?Ile?Asn?Thr?Ser?Lys?His?Asn?Leu?Pro?Ile?Ser?Gly?Gly
1730????????????????1735????????????????1740Asn?Leu?Ala?Asn?Tyr?Glu?Ile?His?Ala?Phe?Arg?Arg?Ile?Gly?Leu?Asn1745????????????????1750????????????????1755????????????????1760Ser?Ser?Ala?Cys?Tyr?Lys?Ala?Val?Glu?Ile?Ser?Thr?Leu?Ile?Arg?Arg
1765????????????????1770????????????????1775Cys?Leu?Glu?Pro?Gly?Glu?Asp?Gly?Leu?Phe?Leu?Gly?Glu?Gly?Ser?Gly
1780????????????????1785????????????????1790Ser?Met?Leu?Ile?Thr?Tyr?Lys?Glu?Ile?Leu?Lys?Leu?Asn?Lys?Cys?Phe
1795????????????????1800????????????????1805Tyr?Asn?Ser?Gly?Val?Ser?Ala?Asn?Ser?Arg?Ser?Gly?Gln?Arg?Glu?Leu
1810????????????????1815????????????????1820Ala?Pro?Tyr?Pro?Ser?Glu?Val?Gly?Leu?Val?Glu?His?Arg?Met?Gly?Val1825????????????????1830????????????????1835????????????????1840Gly?Asn?Ile?Val?Lys?Val?Leu?Phe?Asn?Gly?Arg?Pro?Glu?Val?Thr?Trp
1845????????????????1850????????????????1855Val?Gly?Ser?Val?Asp?Cys?Phe?Asn?Phe?Ile?Val?Ser?Asn?Ile?Pro?Thr
1860????????????????1865????????????????1870Ser?Ser?Val?Gly?Phe?Ile?His?Ser?Asp?Ile?Glu?Thr?Leu?Pro?Asp?Lys
1875????????????????1880????????????????1885Asp?Thr?Ile?Glu?Lys?Leu?Glu?Glu?Leu?Ala?Ala?Ile?Leu?Ser?Met?Ala
1890????????????????1895????????????????1900Leu?Leu?Leu?Gly?Lys?Ile?Gly?Ser?Ile?Leu?Val?Ile?Lys?Leu?Met?Pro1905????????????????1910????????????????1915????????????????1920Phe?Ser?Gly?Asp?Phe?Val?Gln?Gly?Phe?Ile?Ser?Tyr?Val?Gly?Ser?His
1925????????????????1930???????????????1935Tyr?Arg?Glu?Val?Asn?Leu?Val?Tyr?Pro?Arg?Tyr?Ser?Asn?Phe?Ile?Ser
1940????????????????1945????????????????1950Thr?Glu?Ser?Tyr?Leu?Val?Met?Thr?Asp?Leu?Lys?Ala?Asn?Arg?Leu?Met
1955????????????????1960????????????????1965Asn?Pro?Glu?Lys?Ile?Lys?Gln?Gln?Ile?Ile?Glu?Ser?Ser?Val?Arg?Thr
1970????????????????1975????????????????1980Ser?Pro?Gly?Leu?Ile?Gly?His?Ile?Leu?Ser?Ile?Lys?Gln?Leu?Ser?Cys1985????????????????1990????????????????1995????????????????2000Ile?Gln?Ala?Ile?Val?Gly?Asp?Ala?Val?Ser?Arg?Gly?Asp?Ile?Asn?Pro
2005????????????????2010????????????????2015Thr?Leu?Lys?Lys?Leu?Thr?Pro?Ile?Glu?Gln?Val?Leu?Ile?Asn?Cys?Gly
2020????????????????????2025????????????????2030Leu?Ala?Ile?Asn?Gly?Pro?Lys?Leu?Cys?Lys?Glu?Leu?Ile?His?His?Asp
2035????????????????????2040????????????????2045Val?Ala?Ser?Gly?Gln?Asp?Gly?Leu?Leu?Asn?Ser?Ile?Leu?Ile?Leu?Tyr
2050????????????????2055????????????????2060
Arg?Glu?Leu?Ala?Arg?Phe?Lys?Asp?Asn?Gln?Arg?Ser?Gln?Gln?Gly?Met
2065????????????????2070????????????????2075????????????????2080
Phe?His?Ala?Tyr?Pro?Val?Leu?Val?Ser?Ser?Arg?Gln?Arg?Glu?Leu?Ile
2085????????????????2090????????????????2095
Ser?Arg?Ile?Thr?Arg?Lys?Phe?Trp?Gly?His?Ile?Leu?Leu?Tyr?Ser?Gly
2100????????????????2105????????????????2110
Asn?Lys?Lys?Leu?Ile?Asn?Lys?Phe?Ile?Gln?Asn?Leu?Lys?Ser?Gly?Tyr
2115????????????????2120????????????????2125
Leu?Ile?Leu?Asp?Leu?His?Gln?Asn?Ile?Phe?Val?Lys?Asn?Leu?Ser?Lys
2130????????????????????2135????????????????2140
Ser?Glu?Lys?Gln?Ile?Ile?Met?Thr?Gly?Gly?Leu?Lys?Arg?Glu?Trp?Val
2145????????????????2150????????????????2155????????????????2160
Phe?Lys?Val?Thr?Val?Lys?Glu?Thr?Lys?Glu?Trp?Tyr?Lys?Leu?Val?Gly
2165????????????????2170????????????????2175
Tyr?Ser?Ala?Leu?Ile?Lys?Asp
The information of 2180 (2) SEQ ID NO:13:
(i) sequence signature:
(A) length: 15894 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13:
ACCAAACAAA GTTGGGTAAG GATAGTTCAA TCAATGATCA TTTTCTAGTG CACTTAGGAT 60
TCAAGATCCT ATTATCAGGG ACAAGAGCAG GATTAAGGAT ATCCGAGATG GCCACACTTT 120
TAAGGAGCTT AGCATTGTTC AAAAGAAACA AGGACAAACC ACCCATTACA TCAGGATCCG 180
GTGGAGCCAT CAGAGGAATC AAACACATTA TTATAGTACC AATCCCTGGA GATTCCTCAA 240
TTACCACTCG ATCCAGACTT CTGGACCGGT TGGTCAGGTT AATTGGAAAC CCGGATGTGA 300
GCGGGCCCAA ACTAACAGGG GCACTAATAG GTATATTATC CTTATTTGTG GAGTCTCCAG 360
GTCAATTGAT TCAGAGGATC ACCGATGACC CTGACGTTAG CATAAGGCTG TTAGAGGTTG 420
TCCAGAGTGA CCAGTCACAA TCTGGCCTTA CCTTCGCATC AAGAGGTACC AACATGGAGG 480
ATGAGGCGGA CCAATACTTT TCACATGATG ATCCAATTAG TAGTGATCAA TCCAGGTTCG 540
GATGGTTCGA GAACAAGGAA ATCTCAGATA TTGAAGTGCA AGACCCTGAG GGATTCAACA 600
TGATTCTGGG TACCATCCTA GCTCAAATTT GGGTCTTGCT CGCAAAGGCG GTTACGGCCC 660
CAGACACGGC AGCTGATTCG GAGCTAAGAA GGTGGATAAA GTACACCCAA CAAAGAAGGG 720
TAGTTGGTGA ATTTAGATTG GAGAGAAAAT GGTTGGATGT GGTGAGGAAC AGGATTGCCG 780
AGGACCTCTC CTTACGCCGA TTCATGGTCG CTCTAATCCT GGATATCAAG AGAACACCCG 840
GAAACAAACC CAGGATTGCT GAAATGATAT GTGACATTGA TACATATATC GTAGAGGCAG 900
GATTAGCCAG TTTTATCCTG ACTATTAAGT TTGGGATAGA AACTATGTAT CCTGCTCTTG 960
GACTGCATGA ATTTGCTGGT GAGTTATCCA CACTTGAGTC CTTGATGAAC CTTTACCAGC 1020
AAATGGGGGA AACTGCACCC TACATGGTAA TCCTGGAGAA CTCAATTCAG AACAAGTTCA 1080
GTGCAGGATC ATACCCTCTG CTCTGGAGCT ATGCCATGGG AGTAGGAGTG GAACTTGAAA 1140
ACTCCATGGG AGGTTTGAAC TTTGGCCGAT CTTACTTTGA TCCAGCATAT TTTAGATTAG 1200
GGCAAGAGAT GGTAAGGAGG TCAGCTGGAA AGGTCAGTTC CACATTGGCA TCTGAACTCG 1260
GTATCACTGC CGAGGATGCA AGGCTTGTTT CAGAGATTGC AATGCATACT ACTGAGGACA 1320
AGATCAGTAG AGCGGTTGGA CCCAGACAAG CCCAAGTATC ATTTCTACAC GGTGATCAAA 1380
GTGAGAATGA GCTACCGAGA TTGGGGGGCA AGGAAGATAG GAGGGTCAAA CAGAGTCGAG 1440
GAGAAGCCAG GGAGAGCTAC AGAGAAACCG GGCCCAGCAG AGCAAGTGAT GCGAGAGCTG 1500
CCCATCTTCC AACCGGCACA CCCCTAGACA TTGACACTGC ATCGGAGTCC AGCCAAGATC 1560
CGCAGGACAG TCGAAGGTCA GCTGACGCCC TGCTTAGGCT GCAAGCCATG GCAGGAATCT 1620
CGGAAGAACA AGGCTCAGAC ACGGACACCC CTATAGTGTA CAATGACAGA AATCTTCTAG 1680
ACTAGGTGCG AGAGGCCGAG GGCCAGAACA ACATCCGCCT ACCCTCCATC ATTGTTATAA 1740
AAAACTTAGG AACCAGGTCC ACACAGCCGC CAGCCCATCA ACCATCCACT CCCACGATTG 1800
GAGCCGATGG CAGAAGAGCA GGCACGCCAT GTCAAAAACG GACTGGAATG CATCCGGGCT 1860
CTCAAGGCCG AGCCCATCGG CTCACTGGCC ATCGAGGAAG CTATGGCAGC ATGGTCAGAA 1920
ATATCAGACA ACCCAGGACA GGAGCGAGCC ACCTGCAGGG AAGAGAAGGC AGGCAGTTCG 1980
GGTCTCAGCA AACCATGCCT CTCAGCAATT GGATCAACTG AAGGCGGTGC ACCTCGCATC 2040
CGCGGTCAGG GACCTGGAGA GAGCGATGAC GACGCTGAAA CTTTGGGAAT CCCCCCAAGA 2100
AATCTCCAGG CATCAAGCAC TGGGTTACAG TGTTATTATG TTTATGATCA CAGCGGTGAA 2160
GCGGTTAAGG GAATCCAAGA TGCTGACTCT ATCATGGTTC AATCAGGCCT TGATGGTGAT 2220
AGCACCCTCT CAGGAGGAGA CAATGAATCT GAAAACAGCG ATGTGGATAT TGGCGAACCT 2280
GATACCGAGG GATATGCTAT CACTGACCGG GGATCTGCTC CCATCTCTAT GGGGTTCAGG 2340
GCTTCTGATG TTGAAACTGC AGAAGGAGGG GAGATCCACG AGCTCCTGAG ACTCCAATCC 2400
AGAGGCAACA ACTTTCCGAA GCTTGGGAAA ACTCTCAATG TTCCTCCGCC TCCGGACCCC 2460
GGTAGGGCCA GCACTTCCGG GACACCCATT AAAAAGGGCA CAGACGCGAG ATTAGCCTCA 2520
TTTGGAACGG AGATCGCGTC TTTATTGACA GGTGGTGCAA CCCAATGTGC TCGAAAGTCA 2580
CCCTCGGAAC CATCAGGGCC AGGTGCACCT GCGGGGAATG TCCCCGAGTG TGTGAGCAAT 2640
GCCGCACTGA TACAGGAGTG GACACCCGAA TCTGGTACCA CAATCTCCCC GAGATCCCAG 2700
AATAATGAAG AAGGGGGAGA CTATTATGAT GATGAGCTGT TCTCTGATGT CCAAGATATT 2760
AAAACAGCCT TGGCCAAAAT ACACGAGGAT AATCAGAAGA TAATCTCCAA GCTAGAATCA 2820
CTGCTGTTAT TGAAGGGAGA AGTTGAGTCA ATTAAGAAGC AGATCAACAG GCAAAATATC 2880
AGCATATCCA CCCTGGAAGG ACACCTCTCA AGCATCATGA TCGCCATTCC TGGACTTGGG 2940
AAGGATCCCA ACGACCCCAC TGCAGATGTC GAAATCAATC CCGACTTGAA ACCCATCATA 3000
GGCAGAGATT CAGGCCGAGC ACTGGCCGAA GTTCTCAAGA AACCCGTTGC CAGCCGACAA 3060
CTCCAAGGAA TGACAAATGG ACGGACCAGT TCCAGAGGAC AGCTGCTGAA GGAATTTCAG 3120
CTAAAGCCGA TCGGGAAAAA GATGAGCTCA GCCGTCGGGT TTGTTCCTGA CACCGGCCCT 3180
GCATCACGCA GTGTAATCCG CTCCATTATA AAATCCAGCC GGCTAGAGGA GGATCGGAAG 3240
CGTTACCTGA TGACTCTCCT TGATGATATC AAAGGAGCCA ATGATCTTGC CAAGTTCCAC 3300
CAGATGCTGA TGAAGATAAT AATGAAGTAG CTACAGCTCA ACTTACCTGC CAACCCCATG 3360
CCAGTCGACC CAACTAGTAC AACCTAAATC CATTATAAAA AACTTAGGAG CAAAGTGATT 3420
GCCTCCCAAG TTCCACAATG ACAGAGATCT ACGACTTCGA CAAGTCGGCA TGGGACATCA 3480
AAGGGTTGAT CGCTCCGATA CAACCCACCA CCTACAGTGA TGGCAGGCTG GTGCCCCAGG 3540
TCAGAGTCAT AGATCCTGGT CTAGGCGACA GGAAGGATGA ATGCTTTATG TACATGTTTC 3600
TGCTGGGGGT TGTTGAGGAC AGCGATCCCC TAGGGCCTCC AATCGGGCGA GCATTTGGGT 3660
CCCTGCCCTT AGGTGTTGGC AAATCCACAG CAAAGCCCGA AAAACTCCTC AAAGAGGCCA 3720
CTGAGCTTGA CATAGTTGTT AGACGTACAG CAGGGCTCAA TGAAAAACTG GTGTTCTACA 3780
ACAACACCCC ACTAACTCTC CTCACACCTT GGAGAAAGGT CCTAACAACA GGGAGTGTCT 3840
TCAACGCAAA CCAAGTGTGC AGTGCGGTTA ATCTGATACC GCTCGATACC CCGCAGAGGT 3900
TCCGTGTTGT TTATATGAGC ATCACCCGTC TTTCGGATAA CGGGTATTAC ACCGTTCCTA 3960
GAAGAATGCT GGAATTCAGA TCGGTCAATG CAGTGGCCTT CAACCTGCTG GTGACCCTTA 4020
GGATTGACAA GGCGATAGGC CCTGGGAAGA TCATCGACAA TACAGAGCAA CTTCCTGAGG 4080
CAACATTTAT GGTCCACATC GGGAACTTCA GGAGAAAGAA GAGTGAAGTC TACTCTGCCG 4140
ATTATTGCAA AATGAAAATC GAAAAGATGG GCCTGGTTTT TGCACTTGGT GGGATAGGGG 4200
GCACCAGTCT TCACATTAGA AGCACAGGCA AAATGAGCAA GACTCTCCAT GCACAACTCG 4260
GGTTCAAGAA GACCTTATGT TACCCGCTGA TAGATATCAA TGAAGACCTT AATCGATTAC 4320
TCTGGAGGAG CAGATGCAAG ATAGTAAGAA TCCAGGCAGT TTTGCAGCCA TCAGTTCCTC 4380
AAGAATTCCG CATTTACGAC GACGTGATCA TAAATGATGA CCAAGGACTA TTCAAAGTTC 4440
TGTAGACCGT AGTGCCCAGC AATGCCCGAA AACGACCCCC CTCACAATGA CAGCCAGAAG 4500
GCCCGGACAA AAAAGCCCCC TCCGAAAGAC TCCACGGACC AAGCGAGAGG CCAGCCAGCA 4560
GCCGACGGCA AGCGCGAACA CCAGGCGGCC CCAGCACAGA ACAGCCCTGA TACAAGGCCA 4620
CCACCAGCCA CCCCAATCTG CATCCTCCTC GTGGGACCCC CGAGGACCAA CCCCCAAGGC 4680
TGCCCCCGAT CCAAACCACC AACCGCATCC CCACCACCCC CGGGAAAGAA ACCCCCAGCA 4740
ATTGGAAGGC CCCTCCCCCT CTTCCTCAAC ACAAGAACTC CACAACCGAA CCGCACAAGC 4800
GACCGAGGTG ACCCAACCGC AGGCATCCGA CTCCCTAGAC AGATCCTCTC TCCCCGGCAA 4860
ACTAAACAAA ACTTAGGGCC AAGGAACATA CACACCCAAC AGAACCCAGA CCCCGGCCCA 4920
CGGCGCCGCG CCCCCAACCC CCGACAACCA GAGGGAGCCC CCAACCAATC CCGCCGGCTC 4980
CCCCGGTGCC CACAGGCAGG GACACCAACC CCCGAACAGA CCCAGCACCC AACCATCGAC 5040
AATCCAAGAC GGGGGGGCCC CCCCAAAAAA AGGCCCCCAG GGGCCGACAG CCAGCACCGC 5100
GAGGAAGCCC ACCCACCCCA CACACGACCA CGGCAACCAA ACCAGAACCC AGACCACCCT 5160
GGGCCACCAG CTCCCAGACT CGGCCATCAC CCCGCAGAAA GGAAAGGCCA CAACCCGCGC 5220
ACCCCAGCCC CGATCCGGCG GGGAGCCACC CAACCCGAAC CAGCACCCAA GAGCGATCCC 5280
CGAAGGACCC CCGAACCGCA AAGGACATCA GTATCCCACA GCCTCTCCAA GTCCCCCGGT 5340
CTCCTCCCCT TCTCGAAGGG ACCAAAAGAT CAATCCACCA CACCCGACGA CACTCAACTC 5400
CCCACCCCTA AAGGAGACAC CGGGAATCCC AGAATCAAGA CTCATCCAAT GTCCATCATG 5460
GGTCTCAAGG TGAACGTCTC TGCCATATTC ATGGCAGTAC TGTTAACTCT CCAAACACCC 5520
ACCGGTCAAA TCCATTGGGG CAATCTCTCT AAGATAGGGG TGGTAGGAAT AGGAAGTGCA 5580
AGCTACAAAG TTATGACTCG TTCCAGCCAT CAATCATTAG TCATAAAATT AATGCCCAAT 5640
ATAACTCTCC TCAATAACTG CACGAGGGTA GAGATTGCAG AATACAGGAG ACTACTGAGA 5700
ACAGTTTTGG AACCAATTAG AGATGCACTT AATGCAATGA CCCAGAATAT AAGACCGGTT 5760
CAGAGTGTAG CTTCAAGTAG GAGACACAAG AGATTTGCGG GAGTAGTCCT GGCAGGTGCG 5820
GCCCTAGGCG TTGCCACAGC TGCTCAGATA ACAGCCGGCA TTGCACTTCA CCAGTCCATG 5880
CTGAACTCTC AAGCCATCGA CAATCTGAGA GCGAGCCTGG AAACTACTAA TCAGGCAATT 5940
GAGGCAATCA GACAAGCAGG GCAGGAGATG ATATTGGCTG TTCAGGGTGT CCAAGACTAC 6000
ATCAATAATG AGCTGATACC GTCTATGAAC CAACTATCTT GTGATTTAAT CGGCCAGAAG 6060
CTCGGGCTCA AATTGCTCAG ATACTATACA GAAATCCTGT CATTATTTGG CCCCAGCTTA 6120
CGGGACCCCA TATCTGCGGA GATATCTATC CAGGCTTTGA GCTATGCGCT TGGAGGAGAC 6180
ATCAATAAGG TGTTAGAAAA GCTCGGATAC AGTGGAGGTG ATTTACTGGG CATCTTAGAG 6240
AGCAGAGGAA TAAAGGCCCG GATAACTCAC GTCGACACAG AGTCCTACTT CATTGTCCTC 6300
AGTATAGCCT ATCCGACGCT GTCCGAGATT AAGGGGGTGA TTGTCCACCG GCTAGAGGGG 6360
GTCTCGTACA ACATAGGCTC TCAAGAGTGG TATACCACTG TGCCCAAGTA TGTTGCAACC 6420
CAAGGGTACC TTATCTCGAA TTTTGATGAG TCATCGTGTA CTTTCATGCC AGAGGGGACT 6480
GTGTGCAGCC AAAATGCCTT GTACCCGATG AGTCCTCTGC TCCAAGAATG CCTCCGGGGG 6540
TCCACCAAGT CCTGTGCTCG TACACTCGTA TCCGGGTCTT TTGGGAACCG GTTCATTTTA 6600
TCACAAGGGA ACCTAATAGC CAATTGTGCA TCAATCCTTT GCAAGTGTTA CACAACAGGA 6660
ACGATCATTA ATCAAGACCC TGACAAGATC CTAACATACA TTGCTGCCGA TCACTGCCCG 6720
GTAGTCGAGG TGAACGGCGT GACCATCCAA GTCGGGAGCA GGAGGTATCC AGATGCTGTG 6780
TACTTGCACA GAATTGACCT CGGTCCTCCC ATATCATTGG AGAGGTTGGA CGTAGGGACA 6840
AATCTGGGGA ATGCAATTGC TAAGTTGGAG GATGCCAAGG AATTGTTGGA GTCATCGGAC 6900
CAGATATTGA GGAGTATGAA AGGTTTATCG AGCACTAGCA TAGTCTACAT CCTGATTGCA 6960
GTGTGTCTTG GAGGGTTGAT AGGGATCCCC GCTTTAATAT GTTGCTGCAG GGGGCGTTGT 7020
AACAAAAAGG GAGAACAAGT TGGTATGTCA AGACCAGGCC TAAAGCCTGA TCTTACGGGA 7080
ACATCAAAAT CCTATGTAAG GTCGCTCTGA TCCTCTACAA CTCTTGAAAC ACAAATGTCC 7140
CACAAGTCTC CTCTTCGTCA TCAAGCAACC ACCGCACCCA GCATCAAGCC CACCTGAAAT 7200
TATCTCCGGC TTCCCTCTGG CCGAACAATA TCGGTAGTTA ATTAAAACTT AGGGTGCAAG 7260
ATCATCCACA ATGTCACCAC AACGAGACCG GATAAATGCC TTCTACAAAG ATAACCCCCA 7320
TCCCAAGGGA AGTAGGATAG TCATTAACAG AGAACATCTT ATGATTGATA GACCTTATGT 7380
TTTGCTGGCT GTTCTGTTTG TCATGTTTCT GAGCTTGATC GGGTTGCTAG CCATTGCAGG 7440
CATTAGACTT CATCGGGCAG CCATCTACAC CGCAGAGATC CATAAAAGCC TCAGCACCAA 7500
TCTAGATGTA ACTAACTCAA TCGAGCATCA GGTCAAGGAC GTGCTGACAC CACTCTTCAA 7560
AATCATCGGT GATGAAGTGG GCCTGAGGAC ACCTCAGAGA TTCACTGACC TAGTGAAATT 7620
CATCTCTGAC AAGATTAAAT TCCTTAATCC GGATAGGGAG TACGACTTCA GAGATCTCAC 7680
TTGGTGTATC AACCCGCCAG AGAGAATCAA ATTGGATTAT GATCAATACT GTGCAGATGT 7740
GGCTGCTGAA GAGCTCATGA ATGCATTGGT GAACTCAACT CTACTGGAGA CCAGAACAAC 7800
CAATCAGTTC CTAGCTGTCT CAAAGGGAAA CTGCTCAGGG CCCACTACAA TCAGAGGTCA 7860
ATTCTCAAAC ATGTCGCTGT CCCTGTTAGA CTTGTATTTA GGTCGAGGTT ACAATGTGTC 7920
ATCTATAGTC ACTATGACAT CCCAGGGAAT GTATGGGGGA ACTTACCTAG TGGAAAAGCC 7980
TAATCTGAGC AGCAAAAGGT CAGAGTTGTC ACAACTGAGC ATGTACCGAG TGTTTGAAGT 8040
AGGTGTTATC AGAAATCCGG GTTTGGGGGC TCCGGTGTTC CATATGACAA ACTATCTTGA 8100
GCAACCAGCC AGTAATGATC TCAGCAACTG TATGGTGGCT TTGGGGGAGC TCAAACTCGC 8160
AGCCCTTTGT CACGGGGAAG ATTCTATCAC AATTCCCTAT CAGGGATCAG GGAAAGGTGT 8220
CAGCTTCCAG CTCGTCAAGC TAGGTGTCTG GAAATCCCCA ACCGACATGC AATCCTGGGT 8280
CCCCTTATCA ACGGATGATC CAGTGATAGA CAGGCTTTAC CTCTCATCTC ACAGAGGTGT 8340
TATCGCTGAC AATCAAGCAA AATGGGCTGT CCCGACAACA CGAACAGATG ACAAGTTGCG 8400
AATGGAGACA TGCTTCCAAC AGGCGTGTAA GGGTAAAATC CAAGCACTCT GCGAGAATCC 8460
CGAGTGGGCA CCATTGAAGG ATAACAGGAT TCCTTCATAC GGGGTCTTGT CTGTTGATCT 8520
GAGTCTGACA GTTGAGCTTA AAATCAAAAT TGCTTCGGGA TTCGGGCCAT TGATCACACA 8580
CGGTTCAGGG ATGGACCTAT ACAAATCCAA CCACAACAAT GTGTATTGGC TGACTATCCC 8640
GCCAATGAAG AACCTAGCCT TAGGTGTAAT CAACACATTG GAGTGGATAC CGAGATTCAA 8700
GGTTAGTCCC TACCTCTTCA ATGTCCCAAT TAAGGAAGCA GGCGAAGACT GCCATGCCCC 8760
AACATACCTA CCTGCGGAGG TGGATGGTGA TGTCAAACTC AGTTCCAATC TGGTGATTCT 8820
ACCTGGTCAA GATCTCCAAT ATGTTTTGGC AACCTACGAT ACTTCCAGGG TTGAACATGC 8880
TGTGGTTTAT TACGTTTACA GCCCAGGCCG CTCATTTTCT TACTTTTATC CTTTTAGGTT 8940
GCCTATAAAG GGGGTCCCCA TCGAATTACA AGTGGAATGC TTCACATGGG ACCAAAAACT 9000
CTGGTGCCGT CACTTCTGTG TGCTTGCGGA CTCAGAATCT GGTGGACATA TCACTCACTC 9060
TGGGATGGTG GGCATGGGAG TCAGCTGCAC AGTCACCCGG GAAGATGGAA CCAATCGCAG 9120
ATAGGGCTGC TAGTGAACCA ATCTCATGAT GTCACCCAGA CATCAGGCAT ACCCACTAGT 9180
GTGAAATAGA CATCAGAATT AAGAAAAACG TAGGGTCCAA GTGGTTCCCC GTTATGGACT 9240
CGCTATCTGT CAACCAGATC TTATACCCTG AAGTTCACCT AGATAGCCCG ATAGTTACCA 9300
ATAAGATAGT AGCCATCCTG GAGTATGCTC GAGTCCCTCA CGCTTACAGC CTGGAGGACC 9360
CTACACTGTG TCAGAACATC AAGCACCGCC TAAAAAACGG ATTTTCCAAC CAAATGATTA 9420
TAAACAATGT GGAAGTTGGG AATGTCATCA AGTCCAAGCT TAGGAGTTAT CCGGCCCACT 9480
CTCATATTCC ATATCCAAAT TGTAATCAGG ATTTATTTAA CATAGAAGAC AAAGAGTCAA 9540
CGAGGAAGAT CCGTGAACTC CTCAAAAAGG GGAATTCGCT GTACTCCAAA GTCAGTGATA 9600
AGGTTTTCCA ATGCTTAAGG GACACTAACT CACGGCTTGG CCTAGGCTCC GAATTGAGGG 9660
AGGACATCAA GGAGAAAGTT ATTAACTTGG GAGTTTACAT GCACAGCTCC CAGTGGTTTG 9720
AGCCCTTTCT GTTTTGGTTT ACAGTCAAGA CTGAGATGAG GTCAGTGATT AAATCACAAA 9780
CCCATACTTG CCATAGGAGG AGACACACAC CTGTATTCTT CACTGGTAGT TCAGTTGAGT 9840
TGCTAATCTC TCGTGACCTT GTTGCTATAA TCAGTAAAGA GTCTCAACAT GTATATTACC 9900
TGACATTTGA ACTGGTTTTG ATGTATTGTG ATGTCATAGA GGGGAGGTTA ATGACAGAGA 9960
CCGCTATGAC TATTGATGCT AGGTATACAG AGCTTCTAGG AAGAGTCAGA TACATGTGGA 10020
AACTGATAGA TGGTTTCTTC CCTGCACTCG GGAATCCAAC TTATCAAATT GTAGCCATGC 10080
TGGAGCCTCT TTCACTTGCT TACCTGCAGC TGAGGGATAT AACAGTAGAA CTCAGAGGTG 10140
CTTTCCTTAA CCACTGCTTT ACTGAAATAC ATGATGTTCT TGACCAAAAC GGGTTTTCTG 10200
ATGAAGGTAC TTATCATGAG TTAATTGAAG CTCTAGATTA CATTTTCATA ACTGATGACA 10260
TACATCTGAC AGGGGAGATT TTCTCATTTT TCAGAAGTTT CGGCCACCCC AGACTTGAAG 10320
CAGTAACGGC TGCTGAAAAT GTTAGGAAAT ACATGAATCA GCCTAAAGTC ATTGTGTATG 10380
AGACTCTGAT GAAAGGTCAT GCCATATTTT GTGGAATCAT AATCAACGGC TATCGTGACA 10440
GGCACGGAGG CAGTTGGCCA CCGCTGACCC TCCCCCTGCA TGCTGCAGAC ACAATCCGGA 10500
ATGCTCAAGC TTCAGGTGAA GGGTTAACAC ATGAGCAGTG CGTTGATAAC TGGAAATCTT 10560
TTGCTGGAGT GAAATTTGGC TGCTTTATGC CTCTTAGCCT GGATAGTGAT CTGACAATGT 10620
ACCTAAAGGA CAAGGCACTT GCTGCTCTCC AAAGGGAATG GGATTCAGTT TACCCGAAAG 10680
AGTTCCTGCG TTACGACCCT CCCAAGGGAA CCGGGTCACG GAGGCTTGTA GATGTTTTCC 10740
TTAATGATTC GAGCTTTGAC CCATATGATG TGATAATGTA TGTTGTAAGT GGAGCTTACC 10800
TCCATGACCC TGAGTTCAAC CTGTCTTACA GCCTGAAAGA AAAGGAGATC AAGGAAACAG 10860
GTAGACTTTT TGCTAAAATG ACTTACAAAA TGAGGGCATG CCAAGTGATT GCTGAAAATC 10920
TAATCTCAAA CGGGATTGGC AAATATTTTA AGGACAATGG GATGGCCAAG GATGAGCACG 10980
ATTTGACTAA GGCACTCCAC ACTCTAGCTG TCTCAGGAGT CCCCAAAGAT CTCAAAGAAA 11040
GTCACAGGGG GGGGCCAGTC TTAAAAACCT ACTCCCGAAG CCCAGTCCAC ACAAGTACCA 11100
GGAACGTGAG AGCAGCAAAA GGGTTTATAG GGTTCCCTCA AGTAATTCGG CAGGACCAAG 11160
ACACTGATCA TCCGGAGAAT ATGGAAGCTT ACGAGACAGT CAGTGCATTT ATCACGACTG 11220
ATCTCAAGAA GTACTGCCTT AATTGGAGAT ATGAGACCAT CAGCTTGTTT GCACAGAGGC 11280
TAAATGAGAT TTACGGATTG CCCTCATTTT TCCAGTGGCT GCATAAGAGG CTTGAGACCT 11340
CTGTCCTGTA TGTAAGTGAC CCTCATTGCC CCCCCGACCT TGACGCCCAT ATCCCGTTAT 11400
ATAAAGTCCC CAATGATCAA ATCTTCATTA AGTACCCTAT GGGAGGTATA GAAGGGTATT 11460
GTCAGAAGCT GTGGACCATC AGCACCATTC CCTATCTATA CCTGGCTGCT TATGAGAGCG 11520
GAGTAAGGAT TGCTTCGTTA GTGCAAGGGG ACAATCAGAC CATAGCCGTA ACAAAAAGGG 11580
TACCCAGCAC ATGGCCCTAC AACCTTAAGA AACGGGAAGC TGCTAGAGTA ACTAGAGATT 11640
ACTTTGTAAT TCTTAGGCAA AGGCTACATG ATATTGGCCA TCACCTCAAG GCAAATGAGA 11700
CAATTGTTTC ATCACATTTT TTTGTCTATT CAAAAGGAAT ATATTATGAT GGGCTACTTG 11760
TGTCCCAATC ACTCAAGAGC ATCGCAAGAT GTGTATTCTG GTCAGAGACT ATAGTTGATG 11820
AAACAAGGGC AGCATGCAGT AATATTGCTA CAACAATGGC TAAAAGCATC GAGAGAGGTT 11880
ATGACCGTTA CCTTGCATAT TCCCTGAACG TCCTAAAAGT GATACAGCAA ATTCTGATCT 11940
CTCTTGGCTT CACAATCAAT TCAACCATGA CCCGGGATGT AGTCATACCC CTCCTCACAA 12000
ACAACGACCT CTTAATAAGG ATGGCACTGT TGCCCGCTCC TATTGGGGGG ATGAATTATC 12060
TGAATATGAG CAGGCTGTTT GTCAGAAACA TCGGTGATCC AGTAACATCA TCAATTGCTG 12120
ATCTCAAGAG AATGATTCTC GCCTCACTAA TGCCTGAAGA GACCCTCCAT CAAGTAATGA 12180
CACAACAACC GGGGGACTCT TCATTCCTAG ACTGGGCTAG CGACCCTTAC TCAGCAAATC 12240
TTGTATGTGT CCAGAGCATC ACTAGACTCC TCAAGAACAT AACTGCAAGG TTTGTCCTGA 12300
TCCATAGTCC AAACCCAATG TTAAAAGGAT TATTCCATGA TGACAGTAAA GAAGAGGACG 12360
AGGGACTGGC GGCATTCCTC ATGGACAGGC ATATTATAGT ACCTAGGGCA GCTCATGAAA 12420
TCCTGGATCA TAGTGTCACA GGGGCAAGAG AGTCTATTGC AGGCATGCTG GATACCACAA 12480
AAGGCCTGAT TCGAGCCAGC ATGAGGAAGG GGGGGTTAAC CTCTCGAGTG ATAACCAGAT 12540
TGTCCAATTA TGACTATGAA CAATTCAGAG CAGGGATGGT GCTATTGACA GGAAGAAAGA 12600
GAAATGTCCT CATTGACAAA GAGTCATGTT CAGTGCAGCT GGCGAGAGCT CTAAGAAGCC 12660
ATATGTGGGC GAGGCTAGCT CGAGGACGGC CTATTTACGG CCTTGAGGTC CCTGATGTAC 12720
TAGAATCTAT GCGAGGCCAC CTTATTCGGC GTCATGAGAC ATGTGTCATC TGCGAGTGTG 12780
GATCAGTCAA CTACGGATGG TTTTTTGTCC CCTCGGGTTG CCAACTGGAT GATATTGACA 12840
AGGAAACATC ATCCTTGAGA GTCCCATATA TTGGTTCTAC CACTGATGAG AGAACAGACA 12900
TGAAGCTTGC CTTCGTAAGA GCCCCAAGTC GATCCTTGCG ATCTGCTGTT AGAATAGCAA 12960
CAGTGTACTC ATGGGCTTAC GGTGATGATG ATAGCTCTTG GAACGAAGCC TGGTTGTTGG 13020
CTAGGCAAAG GGCCAATGTG AGCCTGGAGG AGCTAAGGGT GATCACTCCC ATCTCAACTT 13080
CGACTAATTT AGCGCATAGG TTGAGGGATC GTAGCACTCA AGTGAAATAC TCAGGTACAT 13140
CCCTTGTCCG AGTGGCGAGG TATACCACAA TCTCCAACGA CAATCTCTCA TTTGTCATAT 13200
CAGATAAGAA GGTTGATACT AACTTTATAT ACCAACAAGG AATGCTTCTA GGGTTGGGTG 13260
TTTTAGAAAC ATTGTTTCGA CTCGAGAAAG ATACCGGATC ATCTAACACG GTATTACATC 13320
TTCACGTCGA AACAGATTGT TGCGTGATCC CGATGATAGA TCATCCCAGG ATACCCAGCT 13380
CCCGCAAGCT AGAGCTGAGG GCAGAGCTAT GTACCAACCC ATTGATATAT GATAATGCAC 13440
CTTTAATTGA CAGAGATACA ACAAGGCTAT ACACCCAGAG CCATAGGAGG CACCTTGTGG 13500
AATTTGTTAC ATGGTCCACA CCCCAACTAT ATCACATTTT AGCTAAGTCC ACAGCACTAT 13560
CTATGATTGA CCTGGTAACA AAATTTGAGA AGGACCATAT GAATGAAATT TCAGCTCTCA 13620
TAGGGGATGA CGATATCAAT AGTTTCATAA CTGAGTTTCT GCTCATAGAG CCAAGATTAT 13680
TCACTATCTA CTTGGGCCAG TGTGCGGCCA TCAATTGGGC ATTTGATGTA CATTATCATA 13740
GACCATCAGG GAAATATCAG ATGGGTGAGC TGTTGTCATC GTTCCTTTCT AGAATGAGCA 13800
AAGGAGTGTT TAAGGTGCTT GTCAATGCTC TAAGCCACCC AAAGATCTAC AAGAAATTCT 13860
GGCATTGTGG TATTATAGAG CCTATCCATG GTCCTTCACT TGATGCTCAA AACTTGCACA 13920
CAACTGTGTG CAACATGGTT TACACATGCT ATATGACCTA CCTCGACCTG TTGTTGAATG 13980
AAGAGTTAGA AGAGTTCACA TTTCTCTTGT GTGAAAGCGA CGAGGATGTA GTACCGGACA 14040
GATTCGACAA CATCCAGGCA AAACACTTAT GTGTTCTGGC AGATTTGTAC TGTCAACCAG 14100
GGACCTGCCC ACCAATTCGA GGTCTAAGAC CGGTAGAGAA ATGTGCAGTT CTAACCGACC 14160
ATATCAAGGC AGAGGCTAGG TTATCTCCAG CAGGATCTTC GTGGAACATA AATCCAATTA 14220
TTGTAGACCA TTACTCATGC TCTCTGACTT ATCTCCGGCG AGGATCGATC AAACAGATAA 14280
GATTGAGAGT TGATCCAGGA TTCATTTTCG ACGCCCTCGC TGAGGTAAAT GTCAGTCAGC 14340
CAAAGATCGG CAGCAACAAC ATCTCAAATA TGAGCATCAA GGCTTTCAGA CCCCCACACG 14400
ATGATGTTGC AAAATTGCTC AAAGATATCA ACACAAGCAA GCACAATCTT CCCATTTCAG 14460
GGGGCAATCT CGCCAATTAT GAAATCCATG CTTTCCGCAG AATCGGGTTG AACTCATCTG 14520
CTTGCTACAA AGCTGTTGAG ATATCAACAT TAATTAGGAG ATGCCTTGAG CCAGGGGAGG 14580
ACGGCTTGTT CTTGGGTGAG GGATCGGGTT CTATGTTGAT CACTTATAAG GAGATACTTA 14640
AACTAAACAA GTGCTTCTAT AATAGTGGGG TTTCCGCCAA TTCTAGATCT GGTCAAAGGG 14700
AATTAGCACC CTATCCCTCC GAAGTTGGCC TTGTCGAACA CAGAATGGGA GTAGGTAATA 14760
TTGTCAAAGT GCTCTTTAAC GGGAGGCCCG AAGTCACGTG GGTAGGCAGT GTAGATTGCT 14820
TCAATTTCAT AGTTAGTAAT ATCCCTACCT CTAGTGTGGG GTTTATCCAT TCAGATATAG 14880
AGACCTTGCC TAACAAAGAT ACTATAGAGA AGCTAGAGGA ATTGGCAGCC ATCTTATCGA 14940
TGGCTCTGCT CCTGGGCAAA ATAGGATCAA TACTGGTGAT TAAGCTTATG CCTTTCAGCG 15000
GGGATTTTGT TCAGGGATTT ATAAGTTATG TAGGGTCCCA TTATAGAGAA GTGAACCTTG 15060
TATACCCTAG ATACAGCAAC TTCATATCTA CTGAATCTTA TTTGGTTATG ACAGATCTCA 15120
AGGCTAACCG GCTAATGAAT CCTGAAAAGA TTAAGCAGCA GATAATTGAA TCATCTGTGA 15180
GGACTTCACC TGGACTTATA GGTCACATCC TATCCATTAA GCAACTAAGC TGCATACAAG 15240
CAATTGTGGG AGACGCAGTT AGTAGAGGTG ATATCAATCC TACTCTGAAA AAACTTACAC 15300
CTATAGAGCA GGTGCTGATC AATTGCGGGT TGGCAATTAA CGGACCTAAG CTGTGCAAAG 15360
AATTGATCCA CCATGATGTT GCCTCAGGGC AAGATGGATT GCTTAATTCT ATACTCATCC 15420
TCTACAGGGA GTTGGCAAGA TTCAAAGACA ACCAAAGAAG TCAACAAGGG ATGTTCCACG 15480
CTTACCCCGT ATTGGTAAGT AGCAGGCAAC GAGAACTTAT ATCTAGGATC ACCCGCAAAT 15540
TTTGGGGGCA CATTCTTCTT TACTCCGGGA ACAGAAAGTT GATAAATAAG TTTATCCAGA 15600
ATCTCAAGTC CGGCTATCTG ATACTAGACT TACACCAGAA TATCTTCGTT AAGAATCTAT 15660
CCAAGTCAGA GAAACAGATT ATTATGACGG GGGGTTTGAA ACGTGAGTGG GTTTTTAAGG 15720
TAACAGTCAA GGAGACCAAA GAATGGTATA AGTTAGTCGG ATACAGTGCC CTGATTAAGG 15780
ACTAATTGGT TGAACTCCGG AACCCTAATC CTGCCCTAGG TGGTTAGGCA TTATTTGCAA 15840
TAGATTAAAG AAAACTTTGA AAATACGAAG TTTCTATTCC CAGCTTTGTC TGGT 15894
(2) SEQ ID NO: 14 information about:
...
(i) sequence signature:
(A) length: 2183 amino acid
(B) type: amino acid
(C) chain:
(D) topological framework: linearity
(ii) molecule type: protein
(xi) sequence description: SEQ ID NO:14:
Met?Asp?Ser?Leu?Ser?Val?Asn?Gln?Ile?Leu?Tyr?Pro?Glu?Val?His?Leu
1???????????????5???????????????????10??????????????????15
Asp?Ser?Pro?Ile?Val?Thr?Asn?Lys?Ile?Val?Ala?Ile?Leu?Glu?Tyr?Ala
20??????????????????25??????????????????30
Arg?Val?Pro?His?Ala?Tyr?Ser?Leu?Glu?Asp?Pro?Thr?Leu?Cys?Gln?Asn
35??????????????????40??????????????????45
Ile?Lys?His?Arg?Leu?Lys?Asn?Gly?Phe?Ser?Asn?Gln?Met?Ile?Ile?Asn
50??????????????????55??????????????????60
Asn?Val?Glu?Val?Gly?Asn?Val?Ile?Lys?Ser?Lys?Leu?Arg?Ser?Tyr?Pro
65??????????????????70??????????????????75??????????????????80
Ala?His?Ser?His?Ile?Pro?Tyr?Pro?Asn?Cys?Asn?Gln?Asp?Leu?Phe?Asn
85?????????????????90??????????????????95Ile?Glu?Asp?Lys?Glu?Ser?Thr?Arg?Lys?Ile?Arg?Glu?Leu?Leu?Lys?Lys
100?????????????????105?????????????????110Gly?Asn?Ser?Leu?Tyr?Ser?Lys?Val?Ser?Asp?Lys?Val?Phe?Gln?Cys?Leu
115?????????????????120?????????????????125Arg?Asp?Thr?Asn?Ser?Arg?Leu?Gly?Leu?Gly?Ser?Glu?Leu?Arg?Glu?Asp
130?????????????????135?????????????????140Ile?Lys?Glu?Lys?Val?Ile?Asn?Leu?Gly?Val?Tyr?Met?His?Ser?Ser?Gln145?????????????????150?????????????????155?????????????????160Trp?Phe?Glu?Pro?Phe?Leu?Phe?Trp?Phe?Thr?Val?Lys?Thr?Glu?Met?Arg
165?????????????????170?????????????????175Ser?Val?Ile?Lys?Ser?Gln?Thr?His?Thr?Cys?His?Arg?Arg?Arg?His?Thr
180?????????????????185?????????????????190Pro?Val?Phe?Phe?Thr?Gly?Ser?Ser?Val?Glu?Leu?Leu?Ile?Ser?Arg?Asp
195?????????????????200?????????????????205Leu?Val?Ala?Ile?Ile?Ser?Lys?Glu?Ser?Gln?His?Val?Tyr?Tyr?Leu?Thr
210?????????????????215?????????????????220Phe?Glu?Leu?Val?Leu?Met?Tyr?Cys?Asp?Val?Ile?Glu?Gly?Arg?Leu?Met225?????????????????230?????????????????235?????????????????240Thr?Glu?Thr?Ala?Met?Thr?Ile?Asp?Ala?Arg?Tyr?Thr?Glu?Leu?Leu?Gly
245?????????????????250?????????????????255Arg?Val?Arg?Tyr?Met?Trp?Lys?Leu?Ile?Asp?Gly?Phe?Phe?Pro?Ala?Leu
260?????????????????265?????????????????270Gly?Asn?Pro?Thr?Tyr?Gln?Ile?Val?Ala?Met?Leu?Glu?Pro?Leu?Ser?Leu
275?????????????????280?????????????????285Ala?Tyr?Leu?Gln?Leu?Arg?Asp?Ile?Thr?Val?Glu?Leu?Arg?Gly?Ala?Phe
290?????????????????295?????????????????300Leu?Asn?His?Cys?Phe?Thr?Glu?Ile?His?Asp?Val?Leu?Asp?Gln?Asn?Gly305?????????????????310?????????????????315?????????????????320Phe?Ser?Asp?Glu?Gly?Thr?Tyr?His?Glu?Leu?Ile?Glu?Ala?Leu?Asp?Tyr
325?????????????????330?????????????????335Ile?Phe?Ile?Thr?Asp?Asp?Ile?His?Leu?Thr?Gly?Glu?Ile?Phe?Ser?Phe
340?????????????????345?????????????????350Phe?Arg?Ser?Phe?Gly?His?Pro?Arg?Leu?Glu?Ala?Val?Thr?Ala?Ala?Glu
355?????????????????360?????????????????365Asn?Val?Arg?Lys?Tyr?Met?Asn?Gln?Pro?Lys?Val?Ile?Val?Tyr?Glu?Thr
370?????????????????375?????????????????380Leu?Met?Lys?Gly?His?Ala?Ile?Phe?Cys?Gly?Ile?Ile?Ile?Asn?Gly?Tyr385?????????????????390?????????????????395?????????????????400Arg?Asp?Arg?His?Gly?Gly?Ser?Trp?Pro?Pro?Leu?Thr?Leu?Pro?Leu?His
405?????????????????410?????????????????415Ala?Ala?Asp?Thr?Ile?Arg?Asn?Ala?Gln?Ala?Ser?Gly?Glu?Gly?Leu?Thr
420?????????????????425?????????????????430His?Glu?Gln?Cys?Val?Asp?Asn?Trp?Lys?Ser?Phe?Ala?Gly?Val?Lys?Phe
435?????????????????440?????????????????445Gly?Cys?Phe?Met?Pro?Leu?Ser?Leu?Asp?Ser?Asp?Leu?Thr?Met?Tyr?Leu
450?????????????????455?????????????????460Lys?Asp?Lys?Ala?Leu?Ala?Ala?Leu?Gln?Arg?Glu?Trp?Asp?Ser?Val?Tyr465?????????????????470?????????????????475?????????????????480Pro?Lys?Glu?Phe?Leu?Arg?Tyr?Asp?Pro?Pro?Lys?Gly?Thr?Gly?Ser?Arg
485?????????????????490?????????????????495Arg?Leu?Val?Asp?Val?Phe?Leu?Asn?Asp?Ser?Ser?Phe?Asp?Pro?Tyr?Asp
500?????????????????505?????????????????510Val?Ile?Met?Tyr?Val?Val?Ser?Gly?Ala?Tyr?Leu?His?Asp?Pro?Glu?Phe
515?????????????????520?????????????????525Asn?Leu?Ser?Tyr?Ser?Leu?Lys?Glu?Lys?Glu?Ile?Lys?Glu?Thr?Gly?Arg
530?????????????????535?????????????????540Leu?Phe?Ala?Lys?Met?Thr?Tyr?Lys?Met?Arg?Ala?Cys?Gln?Val?Ile?Ala545?????????????????550?????????????????555?????????????????560Glu?Asn?Leu?Ile?Ser?Asn?Gly?Ile?Gly?Lys?Tyr?Phe?Lys?Asp?Asn?Gly
565?????????????????570?????????????????575Met?Ala?Lys?Asp?Glu?His?Asp?Leu?Thr?Lys?Ala?Leu?His?Thr?Leu?Ala
580?????????????????585?????????????????590Val?Ser?Gly?Val?Pro?Lys?Asp?Leu?Lys?Glu?Ser?His?Arg?Gly?Gly?Pro
595?????????????????600?????????????????605Val?Leu?Lys?Thr?Tyr?Ser?Arg?Ser?Pro?Val?His?Thr?Ser?Thr?Arg?Asn
610?????????????????615?????????????????620Val?Arg?Ala?Ala?Lys?Gly?Phe?Ile?Gly?Phe?Pro?Gln?Val?Ile?Arg?Gln625?????????????????630?????????????????635?????????????????640Asp?Gln?Asp?Thr?Asp?His?Pro?Glu?Asn?Met?Glu?Ala?Tyr?Glu?Thr?Val
645?????????????????650?????????????????655Ser?Ala?Phe?Ile?Thr?Thr?Asp?Leu?Lys?Lys?Tyr?Cys?Leu?Asn?Trp?Arg
660?????????????????665??????????????????670Tyr?Glu?Thr?Ile?Ser?Leu?Phe?Ala?Gln?Arg?Leu?Asn?Glu?Ile?Tyr?Gly
675?????????????????680?????????????????685Leu?Pro?Ser?Phe?Phe?Gln?Trp?Leu?His?Lys?Arg?Leu?Glu?Thr?Ser?Val
690?????????????????695?????????????????700Leu?Tyr?Val?Ser?Asp?Pro?His?Cys?Pro?Pro?Asp?Leu?Asp?Ala?His?Ile705?????????????????710?????????????????715?????????????????720Pro?Leu?Tyr?Lys?Val?Pro?Asn?Asp?Gln?Ile?Phe?Ile?Lys?Tyr?Pro?Met
725?????????????????730?????????????????735Gly?Gly?Ile?Glu?Gly?Tyr?Cys?Gln?Lys?Leu?Trp?Thr?Ile?Ser?Thr?Ile
740?????????????????745?????????????????750Pro?Tyr?Leu?Tyr?Leu?Ala?Ala?Tyr?Glu?Ser?Gly?Val?Arg?Ile?Ala?Ser
755?????????????????760?????????????????765Leu?Val?Gln?Gly?Asp?Asn?Gln?Thr?Ile?Ala?Val?Thr?Lys?Arg?Val?Pro
770?????????????????775?????????????????780Ser?Thr?Trp?Pro?Tyr?Asn?Leu?Lys?Lys?Arg?Glu?Ala?Ala?Arg?Val?Thr785?????????????????790?????????????????795?????????????????800Arg?Asp?Tyr?Phe?Val?Ile?Leu?Arg?Gln?Arg?Leu?His?Asp?Ile?Gly?His
805?????????????????810?????????????????8l5His?Leu?Lys?Ala?Asn?Glu?Thr?Ile?Val?Ser?Ser?His?Phe?Phe?Val?Tyr
820?????????????????825?????????????????830Ser?Lys?Gly?Ile?Tyr?Tyr?Asp?Gly?Leu?Leu?Val?Ser?Gln?Ser?Leu?Lys
835?????????????????840?????????????????845Ser?Ile?Ala?Arg?Cys?Val?Phe?Trp?Ser?Glu?Thr?Ile?Val?Asp?Glu?Thr
850?????????????????855?????????????????860Arg?Ala?Ala?Cys?Ser?Asn?Ile?Ala?Thr?Thr?Met?Ala?Lys?Ser?Ile?Glu865?????????????????870?????????????????875?????????????????880Arg?Gly?Tyr?Asp?Arg?Tyr?Leu?Ala?Tyr?Ser?Leu?Asn?Val?Leu?Lys?Val
885?????????????????890?????????????????895Ile?Gln?Gln?Ile?Leu?Ile?Ser?Leu?Gly?Phe?Thr?Ile?Asn?Ser?Thr?Met
900?????????????????905?????????????????910Thr?Arg?Asp?Val?Val?Ile?Pro?Leu?Leu?Thr?Asn?Asn?Asp?Leu?Leu?Ile
915?????????????????920?????????????????925Arg?Met?Ala?Leu?Leu?Pro?Ala?Pro?Ile?Gly?Gly?Met?Asn?Tyr?Leu?Asn
930?????????????????935?????????????????940Met?Ser?Arg?Leu?Phe?Val?Arg?Asn?Ile?Gly?Asp?Pro?Val?Thr?Ser?Ser945?????????????????950?????????????????955?????????????????960Ile?Ala?Asp?Leu?Lys?Arg?Met?Ile?Leu?Ala?Ser?Leu?Met?Pro?Glu?Glu
965?????????????????970?????????????????975Thr?Leu?His?Gln?Val?Met?Thr?Gln?Gln?Pro?Gly?Asp?Ser?Ser?Phe?Leu
980?????????????????985?????????????????990Asp?Trp?Ala?Ser?Asp?Pro?Tyr?Ser?Ala?Asn?Leu?Val?Cys?Val?Gln?Ser
995?????????????????1000????????????????1005Ile?Thr?Arg?Leu?Leu?Lys?Asn?Ile?Thr?Ala?Arg?Phe?Val?Leu?Ile?His
1010????????????????1015????????????????1020Ser?Pro?Asn?Pro?Met?Leu?Lys?Gly?Leu?Phe?His?Asp?Asp?Ser?Lys?Glu1025????????????????1030????????????????1035????????????????1040Glu?Asp?Glu?Gly?Leu?Ala?Ala?Phe?Leu?Met?Asp?Arg?His?Ile?Ile?Val
1045????????????????1050????????????????1055Pro?Arg?Ala?Ala?His?Glu?Ile?Leu?Asp?His?Ser?Val?Thr?Gly?Ala?Arg
1060????????????????1065????????????????1070Glu?Ser?Ile?Ala?Gly?Met?Leu?Asp?Thr?Thr?Lys?Gly?Leu?Ile?Arg?Ala
1075????????????????1080?????????????????1085Ser?Met?Arg?Lys?Gly?Gly?Leu?Thr?Ser?Arg?Val?Ile?Thr?Arg?Leu?Ser
1090????????????????1095????????????????1100Asn?Tyr?Asp?Tyr?Glu?Gln?Phe?Arg?Ala?Gly?Met?Val?Leu?Leu?Thr?Gly1105????????????????1110????????????????1115????????????????1120Arg?Lys?Arg?Asn?Val?Leu?Ile?Asp?Lys?Glu?Ser?Cys?Ser?Val?Gln?Leu
1125????????????????1130????????????????1135Ala?Arg?Ala?Leu?Arg?Ser?His?Met?Trp?Ala?Arg?Leu?Ala?Arg?Gly?Arg
1140????????????????1145????????????????1150Pro?Ile?Tyr?Gly?Leu?Glu?Val?Pro?Asp?Val?Leu?Glu?Ser?Met?Arg?Gly
1155????????????????1160????????????????1165His?Leu?Ile?Arg?Arg?His?Glu?Thr?Cys?Val?Ile?Cys?Glu?Cys?Gly?Ser
1170????????????????1175????????????????1180Val?Asn?Tyr?Gly?Trp?Phe?Phe?Val?Pro?Ser?Gly?Cys?Gln?Leu?Asp?Asp1185????????????????1190????????????????1195????????????????1200Ile?Asp?Lys?Glu?Thr?Ser?Ser?Leu?Arg?Val?Pro?Tyr?Ile?Gly?Ser?Thr
1205????????????????1210????????????????1215Thr?Asp?Glu?Arg?Thr?Asp?Met?Lys?Leu?Ala?Phe?Val?Arg?Ala?Pro?Ser
1220????????????????1225????????????????1230Arg?Ser?Leu?Arg?Ser?Ala?Val?Arg?Ile?Ala?Thr?Val?Tyr?Ser?Trp?Ala
1235????????????????1240????????????????1245Tyr?Gly?Asp?Asp?Asp?Ser?Ser?Trp?Asn?Glu?Ala?Trp?Leu?Leu?Ala?Arg
1250????????????????1255????????????????1260Gln?Arg?Ala?Asn?Val?Ser?Leu?Glu?Glu?Leu?Arg?Val?Ile?Thr?Pro?Ile1265????????????????1270????????????????1275????????????????1280Ser?Thr?Ser?Thr?Asn?Leu?Ala?His?Arg?Leu?Arg?Asp?Arg?Ser?Thr?Gln
1285????????????????1290????????????????1295Val?Lys?Tyr?Ser?Gly?Thr?Ser?Leu?Val?Arg?Val?Ala?Arg?Tyr?Thr?Thr
1300????????????????1305????????????????1310Ile?Ser?Asn?Asp?Asn?Leu?Ser?Phe?Val?Ile?Ser?Asp?Lys?Lys?Val?Asp
1315????????????????1320????????????????1325Thr?Asn?Phe?Ile?Tyr?Gln?Gln?Gly?Met?Leu?Leu?Gly?Leu?Gly?Val?Leu
1330????????????????1335????????????????1340Glu?Thr?Leu?Phe?Arg?Leu?Glu?Lys?Asp?Thr?Gly?Ser?Ser?Asn?Thr?Val1345????????????????1350????????????????1355????????????????1360Leu?His?Leu?His?Val?Glu?Thr?Asp?Cys?Cys?Val?Ile?Pro?Met?Ile?Asp
1365????????????????1370????????????????1375His?Pro?Arg?Ile?Pro?Ser?Ser?Arg?Lys?Leu?Glu?Leu?Arg?Ala?Glu?Leu
1380????????????????1385????????????????1390Cys?Thr?Asn?Pro?Leu?Ile?Tyr?Asp?Asn?Ala?Pro?Leu?Ile?Asp?Arg?Asp
1395????????????????1400????????????????1405Thr?Thr?Arg?Leu?Tyr?Thr?Gln?Ser?His?Arg?Arg?His?Leu?Val?Glu?Phe
1410????????????????1415????????????????1420Val?Thr?Trp?Ser?Thr?Pro?Gln?Leu?Tyr?His?Ile?Leu?Ala?Lys?Ser?Thr1425????????????????1430????????????????1435???????????????1440Ala?Leu?Ser?Met?Ile?Asp?Leu?Val?Thr?Lys?Phe?Glu?Lys?Asp?His?Met
1445????????????????1450????????????????1455Asn?Glu?Ile?Ser?Ala?Leu?Ile?Gly?Asp?Asp?Asp?Ile?Asn?Ser?Phe?Ile
1460????????????????1465????????????????1470Thr?Glu?Phe?Leu?Leu?Ile?Glu?Pro?Arg?Leu?Phe?Thr?Ile?Tyr?Leu?Gly
1475????????????????1480????????????????1485Gln?Cys?Ala?Ala?Ile?Asn?Trp?Ala?Phe?Asp?Val?His?Tyr?His?Arg?Pro
1490????????????????1495????????????????1500Ser?Gly?Lys?Tyr?Gln?Met?Gly?Glu?Leu?Leu?Ser?Ser?Phe?Leu?Ser?Arg1505????????????????1510????????????????1515????????????????1520Met?Ser?Lys?Gly?Val?Phe?Lys?Val?Leu?Val?Asn?Ala?Leu?Ser?His?Pro
1525????????????????1530????????????????1535Lys?Ile?Tyr?Lys?Lys?Phe?Trp?His?Cys?Gly?Ile?Ile?Glu?Pro?Ile?His
1540????????????????1545????????????????1550Gly?Pro?Ser?Leu?Asp?Ala?Gln?Asn?Leu?His?Thr?Thr?Val?Cys?Asn?Met
1555????????????????1560????????????????1565Val?Tyr?Thr?Cys?Tyr?Met?Thr?Tyr?Leu?Asp?Leu?Leu?Leu?Asn?Glu?Glu
1570????????????????1575????????????????1580Leu?Glu?Glu?Phe?Thr?Phe?Leu?Leu?Cys?Glu?Ser?Asp?Glu?Asp?Val?Val1585????????????????1590????????????????1595????????????????1600Pro?Asp?Arg?Phe?Asp?Asn?Ile?Gln?Ala?Lys?His?Leu?Cys?Val?Leu?Ala
1605????????????????1610????????????????1615Asp?Leu?Tyr?Cys?Gln?Pro?Gly?Thr?Cys?Pro?Pro?Ile?Arg?Gly?Leu?Arg
1620????????????????1625????????????????1630Pro?Val?Glu?Lys?Cys?Ala?Val?Leu?Thr?Asp?His?Ile?Lys?Ala?Glu?Ala
1635????????????????1640????????????????1645Arg?Leu?Ser?Pro?Ala?Gly?Ser?Ser?Trp?Asn?Ile?Asn?Pro?Ile?Ile?Val
1650????????????????1655????????????????1660Asp?His?Tyr?Ser?Cys?Ser?Leu?Thr?Tyr?Leu?Arg?Arg?Gly?Ser?Ile?Lys1665????????????????1670????????????????1675????????????????1680Gln?Ile?Arg?Leu?Arg?Val?Asp?Pro?Gly?Phe?Ile?Phe?Asp?Ala?Leu?Ala
1685????????????????1690????????????????1695Glu?Val?Asn?Val?Ser?Gln?Pro?Lys?Ile?Gly?Ser?Asn?Asn?Ile?Ser?Asn
1700????????????????1705????????????????1710Met?Ser?Ile?Lys?Ala?Phe?Arg?Pro?Pro?His?Asp?Asp?Val?Ala?Lys?Leu
1715????????????????1720????????????????1725Leu?Lys?Asp?Ile?Asn?Thr?Ser?Lys?His?Asn?Leu?Pro?Ile?Ser?Gly?Gly
1730????????????????1735????????????????1740Asn?Leu?Ala?Asn?Tyr?Glu?Ile?His?Ala?Phe?Arg?Arg?Ile?Gly?Leu?Asn1745????????????????1750????????????????1755????????????????1760Ser?Ser?Ala?Cys?Tyr?Lys?Ala?Val?Glu?Ile?Ser?Thr?Leu?Ile?Arg?Arg
1765????????????????1770????????????????1775Cys?Leu?Glu?Pro?Gly?Glu?Asp?Gly?Leu?Phe?Leu?Gly?Glu?Gly?Ser?Gly
1780????????????????1785????????????????1790Ser?Met?Leu?Ile?Thr?Tyr?Lys?Glu?Ile?Leu?Lys?Leu?Asn?Lys?Cys?Phe
1795????????????????1800????????????????1805Tyr?Asn?Ser?Gly?Val?Ser?Ala?Asn?Ser?Arg?Ser?Gly?Gln?Arg?Glu?Leu
1810????????????????1815????????????????1820Ala?Pro?Tyr?Pro?Ser?Glu?Val?Gly?Leu?Val?Glu?His?Arg?Met?Gly?Val1825????????????????1830????????????????1835????????????????1840Gly?Asn?Ile?Val?Lys?Val?Leu?Phe?Asn?Gly?Arg?Pro?Glu?Val?Thr?Trp
1845????????????????1850????????????????1855Val?Gly?Ser?Val?Asp?Cys?Phe?Asn?Phe?Ile?Val?Ser?Asn?Ile?Pro?Thr
1860????????????????1865????????????????1870Ser?Ser?Val?Gly?Phe?Ile?His?Ser?Asp?Ile?Glu?Thr?Leu?Pro?Asn?Lys
1875????????????????1880????????????????1885Asp?Thr?Ile?Glu?Lys?Leu?Glu?Glu?Leu?Ala?Ala?Ile?Leu?Ser?Met?Ala
1890????????????????1895????????????????1900Leu?Leu?Leu?Gly?Lys?Ile?Gly?Ser?Ile?Leu?Val?Ile?Lys?Leu?Met?Pro1905????????????????1910????????????????1915????????????????1920Phe?Ser?Gly?Asp?Phe?Val?Gln?Gly?Phe?Ile?Ser?Tyr?Val?Gly?Ser?His
1925????????????????1930????????????????1935Tyr?Arg?Glu?Val?Asn?Leu?Val?Tyr?Pro?Arg?Tyr?Ser?Asn?Phe?Ile?Ser
1940????????????????1945????????????????1950Thr?Glu?Ser?Tyr?Leu?Val?Met?Thr?Asp?Leu?Lys?Ala?Asn?Arg?Leu?Met
1955????????????????1960????????????????1965Asn?Pro?Glu?Lys?Ile?Lys?Gln?Gln?Ile?Ile?Glu?Ser?Ser?Val?Arg?Thr
1970????????????????1975????????????????1980Ser?Pro?Gly?Leu?Ile?Gly?His?Ile?Leu?Ser?Ile?Lys?Gln?Leu?Ser?Cys1985????????????????1990????????????????1995????????????????2000Ile?Gln?Ala?Ile?Val?Gly?Asp?Ala?Val?Ser?Arg?Gly?Asp?Ile?Asn?Pro
2005????????????????2010????????????????2015Thr?Leu?Lys?Lys?Leu?Thr?Pro?Ile?Glu?Gln?Val?Leu?Ile?Asn?Cys?Gly
2020????????????????2025????????????????2030Leu?Ala?Ile?Asn?Gly?Pro?Lys?Leu?Cys?Lys?Glu?Leu?Ile?His?His?Asp
2035????????????????2040????????????????2045Val?Ala?Ser?Gly?Gln?Asp?Gly?Leu?Leu?Asn?Ser?Ile?Leu?Ile?Leu?Tyr
2050????????????????2055????????????????2060Arg?Glu?Leu?Ala?Arg?Phe?Lys?Asp?Asn?Gln?Arg?Ser?Gln?Gln?Gly?Met2065????????????????2070????????????????2075????????????????2080Phe?His?Ala?Tyr?Pro?Val?Leu?Val?Ser?Ser?Arg?Gln?Arg?Glu?Leu?Ile
2085????????????????2090????????????????2095Ser?Arg?Ile?Thr?Arg?Lys?Phe?Trp?Gly?His?Ile?Leu?Leu?Tyr?Ser?Gly
2100????????????????2105????????????????2110Asn?Arg?Lys?Leu?Ile?Asn?Lys?Phe?Ile?Gln?Asn?Leu?Lys?Ser?Gly?Tyr
2115????????????????2120????????????????2125Leu?Ile?Leu?Asp?Leu?His?Gln?Asn?Ile?Phe?Val?Lys?Asn?Leu?Ser?Lys
2130????????????????2135????????????????2140
Ser?Glu?Lys?Gln?Ile?Ile?Met?Thr?Gly?Gly?Leu?Lys?Arg?Glu?Trp?Val
2145????????????????2150????????????????2155????????????????2160
Phe?Lys?Val?Thr?Val?Lys?Glu?Thr?Lys?Glu?Trp?Tyr?Lys?Leu?Val?Gly
2165????????????????2170????????????????2175
Tyr?Ser?Ala?Leu?Ile?Lys?Asp
The information of 2180 (2) SEQ ID NO:15:
(i) sequence signature:
(A) length: 15894 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15:
ACCAAACAAA GTTGGGTAAG GATAGTTCAA TCAATGATCA TCTTCTAGTG CACTTAGGAT 60
TCAAGATCCT ATTATCAGGG ACAAGAGCAG GATTAGGGAT ATCCGAGATG GCCACACTTT 120
TAAGGAGCTT AGCATTGTTC AAAAGAAACA AGGACAAACC ACCCATTACA TCAGGATCCG 180
GTGGAGCCAT CAGAGGAATC AAACACATTA TTATAGTACC AATCCCTGGA GATTCCTCAA 240
TTACCACTCG ATCCAGACTT CTGGACCGGT TGGTCAGGTT AATTGGAAAC CCGGATGTGA 300
GCGGGCCCAA ACTAACAGGG GCACTAATAG GTATATTATC CTTATTTGTG GAGTCTCCAG 360
GTCAATTGAT TCAGAGGATC ACCGATGACC CTGACGTTAG CATAAGGCTG TTAGAGGTTG 420
TCCAGAGTGA CCAGTCACAA TCTGGCCTTA CCTTCGCATC AAGAGGTACC AACATGGAGG 480
ATGAGGCGGA CAAATACTTT TCACATGATG ATCCAATTAG TAGTGATCAA TCCAGGTTCG 540
GATGGTTCGA GAACAAGGAA ATCTCAGATA TTGAAGTGCA AGACCCTGAG GGATTCAACA 600
TGATTCTGGG TACCATCCTA GCCCAAATTT GGGTCTTGCT CGCAAAGGCG GTTACGGCCC 660
CAGACACGGC AGCTGATTCG GAGCTAAGAA GGTGGATAAA GTACACCCAA CAAAGAAGGG 720
TAGTTGGTGA ATTTAGATTG GAGAGAAAAT GGTTGGATGT GGTGAGGAAC AGGATTGCCG 780
AGGACCTCTC CTTACGCCGA TTCATGGTCG CTCTAATCCT GGATATCAAG AGAACACCCG 840
GAAACAAACC CAGGATTGCT GAAATGATAT GTGACATTGA TACATATATC GTAGAGGCAG 900
GATTAGCCAG TTTTATCCTG ACTATTAAGT TTGGGATAGA AACTATGTAT CCTGCTCTTG 960
GACTGCATGA ATTTGCTGGT GAGTTATCCA CACTTGAGTC CTTGATGAAC CTTTACCAGC 1020
AAATGGGGGA AACTGCACCC TACATGGTAA TCCTGGAGAA CTCAATTCAG AACAAGTTCA 1080
GTGCAGGATC ATACCCTCTG CTCTGGAGCT ATGCCATGGG AGTAGGAGTG GAACTTGAAA 1140
ACTCCATGGG AGGTTTGAAC TTTGGCCGAT CTTACTTTGA TCCAGCATAT TTTAGATTAG 1200
GGCAAGAGAT GGTAAGGAGG TCAGCTGGAA AGGTCAGTTC CACATTGGCA TCTGAACTCG 1260
GTATCACTGC CGAGGATGCA AGGCTTGTTT CAGAGATTGC AATGCATACT ACTGAGGACA 1320
AGATCAGTAG AGCGGTTGGA CCCAGACAAG CCCAAGTATC ATTTCTACAC GGTGATCAAA 1380
GTGAGAATGA GCTACCGAGA TTGGGGGGCA AGGAAGATAG GAGGGTCAAA CAGAGTCGAG 1440
GAGAAGCCAG GGAGAGCTAC AGAGAAACCG GGCCCAGCAG AGCAAGTGAT GCGAGAGCTG 1500
CCCATCTTCC AACCGGCACA CCCCTAGACA TTGACACTGC ATCGGAGTCC AGCCAAGATC 1560
CGCAGGACAG TCGAAGGTCA GCTGACGCCC TGCTTAGGCT GCAAGCCATG GCAGGAATCT 1620
CGGAAGAACA AGGCTCAGAC ACGGACACCC CTATAGTGTA CAATGACAGA AATCTTCTAG 1680
ACTAGGTGCG AGAGGCCGAG GACCAGAACA ACATCCGCCT ACCCTCCATC ATTGTTATAA 1740
AAAACTTAGG AACCAGGTCC ACACAGCCGC CAGCCCATCA ACCATCCACT CCCACGATTG 1800
GAGCCGATGG CAGAAGAGCA GGCACGCCAT GTCAAAAACG GACTGGAATG CATCCGGGCT 1860
CTCAAGGCCG AGCCCATCGG CTCACTGGCC ATCGAGGAAG CTATGGCAGC ATGGTCAGAA 1920
ATATCAGACA ACCCAGGACA GGAGCGAGCC ACCTGCAGGG AAGAGAAGGC AGGCAGTTCG 1980
GGTCTCAGCA AACCATGCCT CTCAGCAATT GGATCAACTG AAGGCGGTGC ACCTCGCATC 2040
CGCGGTCAGG GACCTGGAGA GAGCGATGAC GACGCTGAAA CTTTGGGAAT CCCCCCAAGA 2100
AATCTCCAGG CATCAAGCAC TGGGTTACAG TGTTATTATG TTTATGATCA CAGCGGTGAA 2160
GCGGTTAAGG GAATCCAAGA TGCTGACTCT ATCATGGTTC AATCAGGCCT TGATGGTGAT 2220
AGCACCCTAT CAGGAGGAGA CAATGAATCT GAAAACAGCG ATGTGGATAT TGGCGAACCT 2280
GATACCGAGG GATATGCTAT CACTGACCGG GGATCTGCTC CCATCTCTAT GGGGTTCAGG 2340
GCTTCTGATG TTGAAACTGC AGAAGGAGGG GAGATCCACG AGCTCCTGAG ACTCCAATCC 2400
AGAGGCAACA ACTTTCCGAA GCTTGGGAAA ACTCTCAATG TTCCTCCGCC CCCGGACCCC 2460
GGTAGGGCCA GCACTTCCGG GACACCCATT AAAAAGGGCA CAGACGCGAG ATTAGCCTCA 2520
TTTGGAACGG AGATCGCGTC TTTATTGACA GGTGGTGCAA CCCAATGTGC TCGAAAGTCA 2580
CCCTCGGAAC CATCAGGGCC AGGTGCACCT GCGGGGAATG TCCCCGAGTA TGTGAGCAAT 2640
GCCGCACTGA TACAGGAGTG GACACCCGAA TCTGGTACCA CAATCTCCCC GAGATCCCAG 2700
AATAATGAAG AAGGGGGAGA CTATTATGAT GATGAGCTGT TCTCTGATGT CCAAGATATT 2760
AAAACAGCCT TGGCCAAAAT ACACGAGGAT AATCAGAAGA TAATCTCCAA GCTAGAATCA 2820
CTGCTGTTAT TGAAGGGAGA AGTTGAGTCA ATTAAGAAGC AGATCAACAG GCAAAATATC 2880
AGCATATCCA CCCTGGAAGG ACACCTCTCA AGCATCATGA TCGCCATTCC TGGACTTGGG 2940
AAGGATCCCA ACGACCCCAC TGCAGATGTC GAAATCAATC CCGACTTGAA ACCCATCATA 3000
GGCAGAGATT CAGGCCGAGC ACTGGCCGAA GTTCTCAAGA AACCCGTTGC CAGCCGACAA 3060
CTCCAAGGAA TGACAAATGG ACGGACCAGT TCCAGAGGAC AGCTGCTGAA GGAATTTCAG 3120
CCAAAGCCGA TCGGGAAAAA GATGAGCTCA GCCGTCGGGT TTGTTCCTGA CACCGGCCCT 3180
GCATCACGCA GTGTAATCCG CTCCATTATA AAATCCAGCC GGCTAGAGGA GGATCGGAAG 3240
CGTTACCTGA TGACTCTCCT TGATGATATC AAAGGAGCCA ATGATCTTGC CAAGTTCCAC 3300
CAGATGCTGA TGAAGATAAT AATGAAGTAG CTACAGCTCA ACTTACCTGC CAACCCCATG 3360
CCAGTCGACC CAACTAGTAC AACCTAAATC CATTATAAAA AACTTAGGAG CAAAGTGATT 3420
GCCTCCCAAG TTCCACAATG ACAGAGATCT ACGACTTCGA CAAGTCGGCA TGGGACATCA 3480
AAGGGTCGAT CGCTCCGATA CAACCGACCA CCTACAGTGA TGGCAGGCTG GTGCCCCAGG 3540
TCAGAGTCAT AGATCCTGGT CTAGGCGACA GGAAGGATGA ATGCTTTATG TACATGTCTC 3600
TGCTGGGGGT TGTTGAGGAC AGCGATCCCC TAGGGCCTCC AATCGGGCGA GCATTTGGGT 3660
CCCTGCCCTT AGGTGTTGGC AGATCCACAG CAAAGCCCGA AAAACTCCTC AAAGAGGCCA 3720
CTGAGCTTGA CATAGTTGTT AGACGTACAG CAGGGCTCAA TGAAAAACTG GTGTTCTACA 3780
ACAACACCCC ACTAACTCTC CTCACACCTT GGAGAAAGGT CCTAACAACA GGGAGTGTCT 3840
TCAACGCAAA CCAAGTGTGC AATGCGGTTA ATCTGATACC GCTCGATACC CCGCAGAGGT 3900
TCCGTGTTGT TTATATGAGC ATCACCCGTC TTTCGGATAA CGGGTATTAC ACCGTTCCTA 3960
GAAGAATGCT GGAATTCAGA TCGGTCAATG CAGTGGCCTT CAACCTGCTG GTGACCCTTA 4020
GGATTGACAA GGCGATAGGC CCTGGGAAGA TCATCGACAA TACAGAGCAA CTTCCTGAGG 4080
CAACATTTAT GGTCCACATC GGGAACTTCA GGAGAAAGAA GAGTGAAGTC TACTCTGCCG 4140
ATTATTGCAA AATGAAAATC GAAAAGATGG GCCTGGTTTT TGCACTTGGT GGGATAGGGG 4200
GCACCAGTCT TCACATTAGA AGCACAGGCA AAATGAGCAA GACTCTCCAT GCACAACTCG 4260
GGTTCAAGAA GACCTTATGT TACCCGCTGA TGGATATCAA TGAAGACCTT AATCGATTAC 4320
TCTGGAGGAG CAGATGCAAG ATAGTAAGAA TCCAGGCAGT TTTGCAGCCA TCAGTTCCTC 4380
AAGAATTCCG CATTTACGAC GACGTGATCA TAAATGATGA CCAAGGACTA TTCAAAGTTC 4440
TGTAGACCGT AGTGCCCAGC AATGCCCGAA AACGACCCCC CTCACAATGA CAGCCAGAAG 4500
GCCCGGACAA AAAAGCCCCC TCCGAAAGAC TCCACTGACC AAGCGAGAGG CCAGCCAGCA 4560
GCCGACGGCA AGCACGAACA CCAGGCGGCC CCAGCACAGA ACAGCCCTGA TACAAGGCCA 4620
CCACCAGCCA CCCCAATCTG CATCCTCCTC GTGGGACCCC CGAGGACCAA CCCCCAAGGC 4680
TGCCCCCGAT CCAAACCACC AACCGCATCC CCACCACCCC CGGGAAAGAA ACCCCCAGCA 4740
ATTGGAAGGC CCCTCCCCCT CTTCCTCAAC ACAAGAACTC CACAACCGAA CCGCACAAGC 4800
GACCGAGGTG ACCCAACCGC AGGCATCCGA CTCCCTAGAC AGATCCTCTC TCCCCGGCAA 4860
ACTAAACAAA ACTTAGGGCC AAGGAACATA CACACCCAAC AGAACCCAGA CCCCGGCCCA 4920
CGGCGCCGCG CCCCCAACCC CCGACAACCA GAGGGAGCCC CCAACCAATC CCGCCGGCTC 4980
CCCCGGTGCC CACAGGCAGG GACACCAACC CCCGAACAGA CCCAGCACCT AACCATCGAC 5040
AATCCAAGAC GGGGGGGCCC CCCCAAAAAA AGGCCCCCAG GGGCCGACAG CCAGCACCGC 5100
GAGGAAGCCC ACCCACCCCA CACACGACCA CGGCAACCAA ACCAGAACCC AGACCACCCT 5160
GGGCCACCAG CTCCCAGACT CGGCCATCAC CCCGCAGAAA GGAAAGGCCA CAACCCGCGC 5220
ACCCCAGCCC CGATCCGGCG GGGAGCCACC CAACCCGAAC CAGCACCCAA GAGCGATCCC 5280
CGAAGGACCC CCGAACCGCA AAGGACATCA GTATCCCACA GCCTCTCCAA GTCCCCCGGT 5340
CTCCTCCTCT TCTCGAAGGG ACCAAAAGAT CAATCCACCA CACCCGACGA CACTCAACTC 5400
CCCACCCCTA AAGGAGACAC CGGGAATCCC AGAATCAAGA CTCATCCAAT GTCCATCATG 5460
GGTCTCAAGG TGAACGTCTC TGCCATATTC ATGGCAGTAC TGTTAACTCT CCAAACACCC 5520
ACCGGTCAAA TCCATTGGGG CAATCTCTCT AAGATAGGGG TGGTAGGAAT AGGAAGTGCA 5580
AGCTACAAAG TTATGACTCG TTCCAGCCAT CAATCATTAG TCATAAAATT AATGCCCAAT 5640
ATAACTCTCC TCAATAACTG CACGAGGGTA GAGATTGCAG AATACAGGAG ACTACTGAGA 5700
ACAGTTTTGG AACCAATTAG AGATGCACTT AATGCAATGA CCCAGAATAT AAGACCGGTT 5760
CAGAGTGTAG CTTCAAGTAG GAGACACAAG AGATTTGCGG GAGTAGTCCT GGCAGGTGCG 5820
GCCCTAGGCG TTGCCACAGC TGCTCAGATA ACAGCCGGCA TTGCACTTCA CCAGTCCATG 5880
CTGAACTCTC AAGCCATCGA CAATCTGAGA GCGAGCCTGG AAACTACTAA TCAGGCAATT 5940
GAGGCAATCA GACAAGCAGG GCAGGAGATG ATATTGGCTG TTCAGGGTGT CCAAGACTAC 6000
ATCAATAATG AGCTGATACC GTCTATGAAC CAACTATCTT GTGATTTAAT CGGCCAGAAG 6060
CTCGGGCTCA AATTGCTCAG ATACTATACA GAAATCCTGT CATTATTTGG CCCCAGCTTA 6120
CGGGACCCCA TATCTGCGGA GATATCTATC CAGGCTTTGA GCTATGCGCT TGGAGGAGAC 6180
ATCAATAAGG TGTTAGAAAA GCTCGGATAC AGTGGAGGTG ATTTACTGGG CATCTTAGAG 6240
AGCAGAGGAA TAAAGGCCCG GATAACTCAC GTCGACACAG AGTCCTACTT CATTGTCCTC 6300
AGTATAGCCT ATCCGACGCT GTCCGAGATT AAGGGGGTGA TTGTCCACCG GCTAGAGGGG 6360
GTCTCGTACA ACATAGGCTC TCAAGAGTGG TATACCACTG TGCCCAAGTA TGTTGCAACC 6420
CAAGGGTACC TTATCTCGAA TTTTGATGAG TCATCGTGTA CTTTCATGCC AGAGGGGACT 6480
GTGTGCAGCC AAAATGCCTT GTACCCGATG AGTCCTCTGC TCCAAGAATG CCTCCGGGGG 6540
TACACCAAGT CCTGTGCTCG TACACTCGTA TCCGGGTCTT TTGGGAACCG GTTCATTTTA 6600
TCACAAGGGA ACCTAATAGC CAATTGTGCA TCAATCCTTT GCAAGTGTTA CACAACAGGA 6660
ACGATCATTA ATCAAGACCC TGACAAGATC CTAACATACA TTGCTGCCGA TAACTGCCCG 6720
GTAGTCGAGG TGAACGGCGT GACCATCCAA GTCGGGAGCA GGAGGTATCC AGACGCTGTG 6780
TACTTGCACA GAATTGACCT CGGTCCTCCC ATATTATTGG AGAGGTTGGA CGTAGGGACA 6840
AATCTGGGGA ATGCAATTGC TAAGTTGGAG GATGCCAAGG AATTGTTGGA GTCATCGGAC 6900
CAGATATTGA GGAGTATGAA AGGTTTATCG AGCACTTGCA TAGTCTACAT CCTGATTGCA 6960
GTGTGTCTTG GAGGGTTGAT AGGGATCCCC GCTTTAATAT GTTGCTGCAG GGGGCGTTGT 7020
AACAAAAAGG GAGAACAAGT TGGTATGTCA AGACCAGGCC TAAAGCCTGA TCTTACGGGA 7080
ACATCAAAAT CCTATGTAAG GTCGCTCTGA TCCTCTACAA CTCTTGAAAC ACAAATGTCC 7140
CACAAGTCTC CTCTTCGTCA TCAAGCAACC ACCGCACCCA GCATCAAGCC CACCTGAAAT 7200
TATCTCCGGC TTCCCTCTGG CCGAACAATA TCGGTAGTTA ATTAAAACTT AGGGTGCAAG 7260
ATCATCCACA ATGTCACCAC AACGAGACCG GATAAATGCC TTCTACAAAG ATAACCCCCA 7320
TCCCAAGGGA AGTAGGATAG TCATTAACAG AGAACATCTT ATGATTGATA GACCTTATGT 7380
TTTGCTGGCT GTTCTGTTTG TCATGTTTCT GAGCTTGATC GGGTTGCTAG CCATTGCAGG 7440
CATTAGACTT CATCGGGCAG CCATCTACAC CGCAGAGATC CATAAAAGCC TCAGCACCAA 7500
TCTAGATGTA ACTAACTCAA TCGAGCATCA GGTCAAGGAC GTGCTGACAC CACTCTTCAA 7560
AATCATCGGT GATGAAGTGG GCCTGAGGAC ACCTCAGAGA TTCACTGACC TAGTGAAATT 7620
CATCTCTGAC AAGATTAAAT TCCTTAATCC GGATAGGGAG TACGACTTCA GAGATCTCAC 7680
TTGGTGTATC AACCCGCCAG AGAGAATCAA ATTGGATTAT GATCAATACT GTGCAGATGT 7740
GGCTGCTGAA GAGCTCATGA ATGCATTGGT GAACTCAACT CTACTGGAGA CCAGAACAAC 7800
CAATCAGTTC CTAGCTGTCT CAAAGGGAAA CTGCTCAGGG CCCACTACAA TCAGAGGTCA 7860
ATTCTCAAAC ATGTCGCTGT CCCTGTTAGA CTTGTATTTA GGTCGAGGTT ACAATGTGTC 7920
ATCTATAGTC ACTATGACAT CCCAGGGAAT GTATGGGGGA ACTTACCTAG TGGAAAAGCC 7980
TAATCTGAGC AGCAAAAGGT CAGAGTTGTC ACAACTGAGC ATGTACCGAG TGTTTGAAGT 8040
AGGTGTTATC AGAAATCCGG GTTTGGGGGC TCCGGTGTTC CATATGACAA ACTATCTTGA 8100
GCAACCAGTC AGTAATGATC TCAGCAACTG TATGGTGGCT TTGGGGGAGC TCAAACTCGC 8160
AGCCCTTTGT CACCGGGAAG ATTCTATCAC AATTCCCTAT CAGGGATCAG GGAAAGGTGT 8220
CAGCTTCCAG CTCGTCAAGC TAGGTGTCTG GAAATCCCCA ACCGACATGC AATCCTGGGT 8280
CACCTTATCA ACGGATGATC CAGTGATAGA CAGGCTTTAC CTCTCATCTC ACAGAGGTGT 8340
TATCGCTGAC AATCAAGCAA AATGGGCTGT CCCGACAACA CGAACAGATG ACAAGTTGCG 8400
AATGGAGACA TGCTTCCAAC AGGCGTGTAA GGGTAAAATC CAAGCACTCT GCGAGAATCC 8460
CGAGTGGGCA CCATTGAAGG ATAACAGGAT TCCTTCATAC GGGGTCTTGT CTGTTGATCT 8520
GAGTCTGACA GTTGAGCTTA AAATCAAAAT TGCTTCGGGA TTCGGGCCAT TGATCACACA 8580
CGGTTCAGGG ATGGACCTAT ACAAATCCAA CCACAACAAT GTGTATTGGC TGACTATCCC 8640
ACCAATGAAG AACCTAGCCT TAGGTGTAAT CAACACATTG GAGTGGATAC CGAGATTCAA 8700
GGTTAGTCCC TACCTCTTCA ATGTCCCAAT TAAGGAAGCA GGCGAAGACT GCCATGCCCC 8760
AACATACCTA CCTGCGGAGG TGGATGGTGA TGTCAAACTC AGTTCCAATC TGGTGATTCT 8820
ACCTGGTCAA GATCTCCAAT ATGTTTTGGC AACCTACGAT ACTTCCAGGG TTGAACATGC 8880
TGTGGTTTAT TACGTTTACA GCCCAAGCCG CTCATTTTCT TACTTTTATC CTTTTAGGTT 8940
GCCTATAAAG GGGGTCCCCA TCGAATTACA AGTGGAATGC TTCACATGGG ACCAAAAACT 9000
CTGGTGCCGT CACTTCTGTG TGCTTGCGGA CTCAGAATCT GGTGGACATA TCACTCACTC 9060
TGGGATGGTG GGCATGGGAG TCAGCTGCAC AGTCACCCGG GAAGATGGAA CCAATCGCAG 9120
ATAGGGCTGC TAGTGAACTA ATCTCATGAT GTCACCCAGA CATCAGGCAT ACCCACTAGT 9180
GTGAAATAGA CATCAGAATT AAGAAAAACG TAGGGTCCAA GTGGTTCCCC GTTATGGACT 9240
CGCTATCTGT CAACCAGATC TTATACCCTG AAGTTCACCT AGATAGCCCG ATAGTTACCA 9300
ATAAGATAGT AGCCATCCTG GAGTATGCTC GAGTCCCTCA CGCTTACAGC CTGGAGGACC 9360
CTACACTGTG TCAGAACATC AAGCACCGCC TAAAAAACGG ATTTTCCAAC CAAATGATTA 9420
TAAACAATGT GGAAGTTGGG AATGTCATCA AGTCCAAGCT TAGGAGTTAT CCGGCCCACT 9480
CTCATATTCC ATATCCAAAT TGTAATCAGG ATTTATTTAA CATAGAAGAC AAAGAGTCAA 9540
CGAGGAAGAT CCGTGAACTC CTCAAAAAGG GGAATTCGCT GTACTCCAAA GTCAGTGATA 9600
AGGTTTTCCA ATGCTTAAGG GACACTAACT CACGGCTTGG CCTAGGCTCC GAATTGAGGG 9660
AGGACATCAA GGAGAAAGTT ATTAACTTGG GAGTTTACAT GCACAGCTCC CAGTGGTTTG 9720
AGCCCTTTCT GTTTTGGTTT ACAGTCAAGA CTGAGATGAG GTCAGTGATT AAATCACAAA 9780
CCCATACTTG CCATAGGAGG AGACACACAC CTGTATTCTT CACTGGTAGT TCAGTTGAGT 9840
TGCTAATCTC TCGTGACCTT GTTGCTATAA TCAGTAAAGA GTCTCAACAT GTATATTACC 9900
TGACATTTGA ACTGGTTTTG ATGTATTGTG ATGTCATAGA GGGGAGGTTA ATGACAGAGA 9960
CCGCTATGAC TATTGATGCT AGGTATACAG AGCTTCTAGG AAGAGTCAGA TACATGTGGA 10020
AACTGATAGA TGGTTTCTTC CCTGCACTCG GGAATCCAAC TTATCAAATT GTAGCCATGC 10080
TGGAGCCTCT TTCACTTGCT TACCTGCAGC TGAGGGATAT AACAGTAGAA CTCAGAGGTG 10140
CTTTCCTTAA CCACTGCTTT ACTGAAATAC ATGATGTTCT TGACCAAAAC GGGTTTTCTG 10200
ATGAAGGTAC TTATCATGAG TTAATTGAAG CTCTAGATTA CATTTTCATA ACTGATGACA 10260
TACATCTGAC AGGGGAGATT TTCTCATTTT TCAGAAGTTT CGGCCACCCC AGACTTGAAG 10320
CAGTAACGGC TGCTGAAAAT GTTAGGAAAT ACATGAATCA GCCTAAAGTC ATTGTGTATG 10380
AGACTCTGAT GAAAGGTCAT GCCATATTTT GTGGAATCAT AATCAACGGC TATCGTGACA 10440
GGCACGGAGG CAGTTGGCCA CCGCTGACCC TCCCCCTGCA TGCTGCAGAC ACAATCCGGA 10500
ATGCTCAAGC TTCAGGTGAA GGGTTAACAC ATGAGCAGTG CGTTGATAAC TGGAAATCTT 10560
TTGCTGGAGT GAAATTTGGC TGCTTTATGC CTCTTAGCCT GGATAGTGAT CTGACAATGT 10620
ACCTAAAGGA CAAGGCACTT GCTGCTCTCC AAAGGGAATG GGATTCAGTT TACCCGAAAG 10680
AGTTCCTGCG TTACGACCCT CCCAAGGGAA CCGGGTCACG GAGGCTTGTA GATGTTTTCC 10740
TTAATGATTC GAGCTTTGAC CCATATGATG TGATAATGTA TGTTGTAAGT GGAGCTTACC 10800
TCCATGACCC TGAGTTCAAC CTGTCTTACA GCCTGAAAGA AAAGGAGATC AAGGAAACAG 10860
GTAGACTTTT TGCTAAAATG ACTTACAAAA TGAGGGCATG CCAAGTGATT GCTGAAAATC 10920
TAATCTCAAA CGGGATTGGC AAATATTTTA AGGACAATGG GATGGCCAAG GATGAGCACG 10980
ATTTGACTAA GGCACTCCAC ACTCTAGCTG TCTCAGGAGT CCCCAAAGAT CTCAAAGAAA 11040
GTCACAGGGG GGGGCCAGTC TTAAAAACCT ACTCCCGAAG CCCAGTCCAC ACAAGTACCA 11100
GGAACGTGAG AGCAGCAAAA GGGTTTATAG GGTTCCCTCA AGTAATTCGG CAGGACCAAG 11160
ACACTGATCA TCCGGAGAAT ATGGAAGCTT ACGAGACAGT CAGTGCATTT ATCACGACTG 11220
ATCTCAAGAA GTACTGCCTT AATTGGAGAT ATGAGACCAT CAGCTTGTTT GCACAGAGGC 11280
TAAATGAGAT TTACGGATTG CCCTCATTTT TCCAGTGGCT GCATAAGAGG CTTGAGACCT 11340
CTGTCCTGTA TGTAAGTGAC CCTCATTGCC CCCCCGACCT TGACGCCCAT ATCCCGTTAT 11400
ATAAAGTCCC CAATGATCAA ATCTTCATTA AGTACCCTAT GGGAGGTATA GAAGGGTATT 11460
GTCAGAAGCT GTGGACCATC AGCACCATTC CCTATCTATA CCTGGCTGCT TATGAGAGCG 11520
GAGTAAGGAT TGCTTCGTTA GTGCAAGGGG ACAATCAGAC CATAGCCGTA ACAAAAAGGG 11580
TACCCAGCAC ATGGCCCTAC AACCTTAAGA AACGGGAAGC TGCTAGAGTA ACTAGAGATT 11640
ACTTTGTAAT TCTTAGGCAA AGGCTACATG ATATTGGCCA TCACCTCAAG GCAAATGAGA 11700
CAATTGTTTC ATCACATTTT TTTGTCTATT CAAAAGGAAT ATATTATGAT GGGCTACTTG 11760
TGTCCCAATC ACTCAAGAGC ATCGCAAGAT GTGTATTCTG GTCAGAGACT ATAGTTGATG 11820
AAACAAGGGC AGCATGCAGT AATATTGCTA CAACAATGGC TAAAAGCATC GAGAGAGGTT 11880
ATGACCGTTA CCTTGCATAT TCCCTGAACG TCCTAAAAGT GATACAGCAA ATTCTGATCT 11940
CTCTTGGCTT CACAATCAAT TCAACCATGA CCCGGGATGT AGTCATACCC CTCCTCACAA 12000
ACAACGACCT CTTAATAAGG ATGGCACTGT TGCCCGCTCC TATTGGGGGG ATGAATTATC 12060
TGAATATGAG CAGGCTGTTT GTCAGAAACA TCGGTGATCC AGTAACATCA TCAATTGCTG 12120
ATCTCAAGAG AATGATTCTC GCCTCACTAA TGCCTGAAGA GACCCTCCAT CAAGTAATGA 12180
CACAACAACC GGGGGACTCT TCATTCCTAG ACTGGGCTAG CGACCCTTAC TCAGCAAATC 12240
TTGTATGTGT CCAGAGCATC ACTAGACTCC TCAAGAACAT AACTGCAAGG TTTGTCCTGA 12300
TCCATAGTCC AAACCCAATG TTAAAAGGAT TATTCCATGA TGACAGTAAA GAAGAGGACG 12360
AGGGACTGGC GGCATTCCTC ATGGACAGGC ATATTATAGT ACCTAGGGCA GCTCATGAAA 12420
TCCTGGATCA TAGTGTCACA GGGGCAAGAG AGTCTATTGC AGGCATGCTG GATACCACAA 12480
AAGGCCTGAT TCGAGCCAGC ATGAGGAAGG GGGGGTTAAC CTCTCGAGTG ATAACCAGAT 12540
TGTCCAATTA TGACTATGAA CAATTCAGAG CAGGGATGGT GCTATTGACA GGAAGAAAGA 12600
GAAATGTCCT CATTGACAAA GAGTCATGTT CAGTGCAGCT GGCGAGAGCT CTAAGAAGCC 12660
ATATGTGGGC GAGGCTAGCT CGAGGACGGC CTATTTACGG CCTTGAGGTC CCTGATGTAC 12720
TAGAATCTAT GCGAGGCCAC CTTATTCGGC GTCATGAGAC ATGTGTCATC TGCGAGTGTG 12780
GATCAGTCAA CTACGGATGG TTTTTTGTCC CCTCGGGTTG CCAACTGGAT GATATTGACA 12840
AGGAAACATC ATCCTTGAGA GTCCCATATA TTGGTTCTAC CACTGATGAG AGAACAGACA 12900
TGAAGCTTGC CTTCGTAAGA GCCCCAAGTC GATCCTTGCG ATCTGCTGTT AGAATAGCAA 12960
CAGTGTACTC ATGGGCTTAC GGTGATGATG ATAGCTCTTG GAACGAAGCC TGGTTGTTGG 13020
CTAGGCAAAG GGCCAATGTG AGCCTGGAGG AGCTAAGGGT GATCACTCCC ATCTCAACTT 13080
CGACTAATTT AGCGCATAGG TTGAGGGATC GTAGCACTCA AGTGAAATAC TCAGGTACAT 13140
CCCTTGTCCG AGTGGCGAGG TATACCACAA TCTCCAACGA CAATCTCTCA TTTGTCATAT 13200
CAGATAAGAA GGTTGATACT AACTTTATAT ACCAACAAGG AATGCTTCTA GGGTTGGGTG 13260
TTTTAGAAAC ATTGTTTCGA CTCGAGAAAG ATACCGGATC ATCTAACACG GTATTACATC 13320
TTCACGTCGA AACAGATTGT TGCGTGATCC CGATGATAGA TCATCCCAGG ATACCCAGCT 13380
CCCGCAAGCT AGAGCTGAGG GCAGAGCTAT GTACCAACCC ATTGATATAT GATAATGCAC 13440
CTTTAATTGA CAGAGATACA ACAAGGCTAT ACACCCAGAG CCATAGGAGG CACCTTGTGG 13500
AATTTGTTAC ATGGTCCACA CCCCAACTAT ATCACATTTT AGCTAAGTCC ACAGCACTAT 13560
CTATGATTGA CCTGGTAACA AAATTTGAGA AGGACCATAT GAATGAAATT TCAGCTCTCA 13620
TAGGGGATGA CGATATCAAT AGTTTCATAA CTGAGTTTCT GCTCATAGAG CCAAGATTAT 13680
TCACTATCTA CTTGGGCCAG TGTGCGGCCA TCAATTGGGC ATTTGATGTA CATTATCATA 13740
GACCATCAGG GAAATATCAG ATGGGTGAGC TGTTGTCATC GTTCCTTTCT AGAATGAGCA 13800
AAGGAGTGTT TAAGGTGCTT GTCAATGCTC TAAGCCACCC AAAGATCTAC AAGAAATTCT 13860
GGCATTGTGG TATTATAGAG CCTATCCATG GTCCTTCACT TGATGCTCAA AACTTGCACA 13920
CAACTGTGTG CAACATGGTT TACACATGCT ATATGACCTA CCTCGACCTG TTGTTGAATG 13980
AAGAGTTAGA AGAGTTCACA TTTCTCTTGT GTGAAAGCGA CGAGGATGTA GTACCGGACA 14040
GATTCGACAA CATCCAGGCA AAACACTTAT GTGTTCTGGC AGATTTGTAC TGTCAACCAG 14100
GGGCCTGCCC ACCAATTCGA GGTCTAAGAC CGGTAGAGAA ATGTGCAGTT CTAACCGACC 14160
ATATCAAGGC AGAGGCTAGG TTATCTCCAG CAGGATCTTC GTGGAACATA AATCCAATTA 14220
TTGTAGACCA TTACTCATGC TCTCTGACTT ATCTCCGGCG AGGATCGATC AAACAGATAA 14280
GATTGAGAGT TGATCCAGGA TTCATTTTCG ACGCCCTCGC TGAGGTAAAT GTCAGTCAGC 14340
CAAAGATCGG CAGCAACAAC ATCTCAAATA TGAGCATCAA GGCTTTCAGA CCCCCACACG 14400
ATGATGTTGC AAAATTGCTC AAAGATATCA ACACAAGCAA GCACAATCTT CCCATTTCAG 14460
GGGGCAATCT CGCCAATTAT GAAATCCATG CTTTCCGCAG AATCGGGTTG AACTCATCTG 14520
CTTGCTACAA AGCTGTTGAG ATATCAACAT TAATTAGGAG ATGCCTTGAG CCAGGGGAGG 14580
ACGGCTTGTT CTTGGGTGAG GGATCGGGTT CTATGTTGAT CACTTATAAG GAGATACTTA 14640
AACTAAACAA GTGCTTCTAT AATAGTGGGG TTTCCGCCAA TTCTAGATCT GGTCAAAGGG 14700
AATTAGCACC CTATCCCTCC GAAGTTGGCC TTGTCGAACA CAGAATGGGA GTAGGTAATA 14760
TTGTCAAAGT GCTCTTTAAC GGGAGGCCCG AAGTCACGTG GGTAGGCAGT GTAGATTGCT 14820
TCAATTTCAT AGTTAGTAAT ATCCCTACCT CTAGTGTGGG GTTTATCCAT TCAGATATAG 14880
AGACCTTGCC TAACAAAGAT ACTATAGAGA AGCTAGAGGA ATTGGCAGCC ATCTTATCGA 14940
TGGCTCTGCT CCTGGGCAAA ATAGGATCAA TACTGGTGAT TAAGCTTATG CCTTTCAGCG 15000
GGGATTTTGT TCAGGGATTT ATAAGTTATG TAGGGTCTTA TTATAGAGAA GTGAACCTTG 15060
TATACCCTAG ATACAGCAAC TTCATATCTA CTGAATCTTA TTTGGTTATG ACAGATCTCA 15120
AGGCTAACCG GCTAATGAAT CCTGAAAAGA TTAAGCAGCA GATAATTGAA TCATCTGTGA 15180
GGACTTCACC TGGACTTATA GGTCACATCC TATCCATTAA GCAACTAAGC TGCATACAAG 15240
CAATTGTGGG AGACGCAGTT AGTAGAGGTG ATATCAATCC TACTCTGAAA AAACTTACAC 15300
CTATAGAGCA GGTGCTGATC AATTGCGGGT TGGCAATTAA CGGACCTAAG CTGTGCAAAG 15360
AATTGATCCA CCATGATGTT GCCTCAGGGC AAGATGGATT GCTTAATTCT ATACTCATCC 15420
TCTACAGGGA GTTGGCAAGA TTCAAAGACA ACCGAAGAAG TCAACAAGGG ATGTTCCACG 15480
CTTACCCCGT ATTGGTAAGT AGCAGGCAAC GAGAACTTAT ATCTAGGATC ACCCGCAAAT 15540
TTTGGGGGCA CATTCTTCTT TACTCCGGGA ACAGAAAGTT GATAAATAAG TTTATCCAGA 15600
ATCTCAAGTC CGGCTATCTG ATACTAGACT TACACCAGAA TATCTTCGTT AAGAATCTAT 15660
CCAAGTCAGA GAAACAGATT ATTATGACGG GGGGTTTGAA ACGTGAGTGG GTTTTTAAGG 15720
TAACAGTCAA GGAGACCAAA GAATGGTATA AGTTAGTCGG ATACAGTGCC CTGATTAAGG 15780
ACTAATTGAT TGAACTCCGG AACCCTAATC CTGCCCTAGG TGGTTAGGCA TTATTTGCAA 15840
TATATTAAAG AAAACTTTGA AAATACGAAG TTTCTATTCC CAGCTTTGTC TGGT 15894
(2) SEQ ID NO: 16 information about:
(I) SEQUENCE CHARACTERISTICS:
...
(A) length: 2183 amino acid
(B) type: amino acid
(C) chain:
(D) topological framework: linearity is molecule type (ii): protein (xi) sequence description: SEQ ID NO:16:Met Asp Ser Leu Ser Val Asn Gln Ile Leu Tyr Pro Glu Val His Leu 15 10 15Asp Ser Pro Ile Val Thr Asn Lys Ile Val Ala Ile Leu Glu Tyr Ala
20??????????????????25??????????????????30Arg?Val?Pro?His?Ala?Tyr?Ser?Leu?Glu?Asp?Pro?Thr?Leu?Cys?Gln?Asn
35??????????????????40??????????????????45Ile?Lys?His?Arg?Leu?Lys?Asn?Gly?Phe?Ser?Asn?Gln?Met?Ile?Ile?Asn
50??????????????????55??????????????????60Asn?Val?Glu?Val?Gly?Asn?Val?Ile?Lys?Ser?Lys?Leu?Arg?Ser?Tyr?Pro65??????????????????70??????????????????75??????????????????80Ala?His?Ser?His?Ile?Pro?Tyr?Pro?Asn?Cys?Asn?Gln?Asp?Leu?Phe?Asn
85??????????????????90??????????????????95Ile?Glu?Asp?Lys?Glu?Ser?Thr?Arg?Lys?Ile?Arg?Glu?Leu?Leu?Lys?Lys
100?????????????????105?????????????????110Gly?Asn?Ser?Leu?Tyr?Ser?Lys?Val?Ser?Asp?Lys?Val?Phe?Gln?Cys?Leu
115?????????????????120?????????????????125Arg?Asp?Thr?Asn?Ser?Arg?Leu?Gly?Leu?Gly?Ser?Glu?Leu?Arg?Glu?Asp
130?????????????????135?????????????????140Ile?Lys?Glu?Lys?Val?Ile?Asn?Leu?Gly?Val?Tyr?Met?His?Ser?Ser?Gln145?????????????????150?????????????????155?????????????????160Trp?Phe?Glu?Pro?Phe?Leu?Phe?Trp?Phe?Thr?Val?Lys?Thr?Glu?Met?Arg
165?????????????????170?????????????????175Ser?Val?Ile?Lys?Ser?Gln?Thr?His?Thr?Cys?His?Arg?Arg?Arg?His?Thr
180?????????????????185?????????????????190Pro?Val?Phe?Phe?Thr?Gly?Ser?Ser?Val?Glu?Leu?Leu?Ile?Ser?Arg?Asp
195?????????????????200?????????????????205Leu?Val?Ala?Ile?Ile?Ser?Lys?Glu?Ser?Gln?His?Val?Tyr?Tyr?Leu?Thr
210?????????????????215?????????????????220Phe?Glu?Leu?Val?Leu?Met?Tyr?Cys?Asp?Val?Ile?Glu?Gly?Arg?Leu?Met225?????????????????230?????????????????235?????????????????240Thr?Glu?Thr?Ala?Met?Thr?Ile?Asp?Ala?Arg?Tyr?Thr?Glu?Leu?Leu?Gly
245?????????????????250?????????????????255Arg?Val?Arg?Tyr?Met?Trp?Lys?Leu?Ile?Asp?Gly?Phe?Phe?Pro?Ala?Leu
260?????????????????265?????????????????270Gly?Asn?Pro?Thr?Tyr?Gln?Ile?Val?Ala?Met?Leu?Glu?Pro?Leu?Ser?Leu
275?????????????????280?????????????????285Ala?Tyr?Leu?Gln?Leu?Arg?Asp?Ile?Thr?Val?Glu?Leu?Arg?Gly?Ala?Phe
290?????????????????295?????????????????300Leu?Asn?His?Cys?Phe?Thr?Glu?Ile?His?Asp?Val?Leu?Asp?Gln?Asn?Gly305?????????????????310?????????????????315?????????????????320Phe?Ser?Asp?Glu?Gly?Thr?Tyr?His?Glu?Leu?Ile?Glu?Ala?Leu?Asp?Tyr
325?????????????????330?????????????????335Ile?Phe?Ile?Thr?Asp?Asp?Ile?His?Leu?Thr?Gly?Glu?Ile?Phe?Ser?Phe
340?????????????????345?????????????????350Phe?Arg?Ser?Phe?Gly?His?Pro?Arg?Leu?Glu?Ala?Val?Thr?Ala?Ala?Glu
355?????????????????360?????????????????365Asn?Val?Arg?Lys?Tyr?Met?Asn?Gln?Pro?Lys?Val?Ile?Val?Tyr?Glu?Thr
370?????????????????375?????????????????380Leu?Met?Lys?Gly?His?Ala?Ile?Phe?Cys?Gly?Ile?Ile?Ile?Asn?Gly?Tyr385?????????????????390?????????????????395?????????????????400Arg?Asp?Arg?His?Gly?Gly?Ser?Trp?Pro?Pro?Leu?Thr?Leu?Pro?Leu?His
405?????????????????410?????????????????415Ala?Ala?Asp?Thr?Ile?Arg?Asn?Ala?Gln?Ala?Ser?Gly?Glu?Gly?Leu?Thr
420?????????????????425?????????????????430His?Glu?Gln?Cys?Val?Asp?Asn?Trp?Lys?Ser?Phe?Ala?Gly?Val?Lys?Phe
435?????????????????440?????????????????445Gly?Cys?Phe?Met?Pro?Leu?Ser?Leu?Asp?Ser?Asp?Leu?Thr?Met?Tyr?Leu
450?????????????????455?????????????????460Lys?Asp?Lys?Ala?Leu?Ala?Ala?Leu?Gln?Arg?Glu?Trp?Asp?Ser?Val?Tyr465?????????????????470?????????????????475?????????????????480Pro?Lys?Glu?Phe?Leu?Arg?Tyr?Asp?Pro?Pro?Lys?Gly?Thr?Gly?Ser?Arg
485?????????????????490?????????????????495Arg?Leu?Val?Asp?Val?Phe?Leu?Asn?Asp?Ser?Ser?Phe?Asp?Pro?Tyr?Asp
500?????????????????505?????????????????510Val?Ile?Met?Tyr?Val?Val?Ser?Gly?Ala?Tyr?Leu?His?Asp?Pro?Glu?Phe
515?????????????????520?????????????????525Asn?Leu?Ser?Tyr?Ser?Leu?Lys?Glu?Lys?Glu?Ile?Lys?Glu?Thr?Gly?Arg
530?????????????????535?????????????????540Leu?Phe?Ala?Lys?Met?Thr?Tyr?Lys?Met?Arg?Ala?Cys?Gln?Val?Ile?Ala545?????????????????550?????????????????555?????????????????560Glu?Asn?Leu?Ile?Ser?Asn?Gly?Ile?Gly?Lys?Tyr?Phe?Lys?Asp?Asn?Gly
565?????????????????570?????????????????575Met?Ala?Lys?Asp?Glu?His?Asp?Leu?Thr?Lys?Ala?Leu?His?Thr?Leu?Ala
580?????????????????585?????????????????590Val?Ser?Gly?Val?Pro?Lys?Asp?Leu?Lys?Glu?Ser?His?Arg?Gly?Gly?Pro
595?????????????????600?????????????????605Val?Leu?Lys?Thr?Tyr?Ser?Arg?Ser?Pro?Val?His?Thr?Ser?Thr?Arg?Asn
610?????????????????615?????????????????620Val?Arg?Ala?Ala?Lys?Gly?Phe?Ile?Gly?Phe?Pro?Gln?Val?Ile?Arg?Gln625?????????????????630?????????????????635?????????????????640Asp?Gln?Asp?Thr?Asp?His?Pro?Glu?Asn?Met?Glu?Ala?Tyr?Glu?Thr?Val
645?????????????????650?????????????????655Ser?Ala?Phe?Ile?Thr?Thr?Asp?Leu?Lys?Lys?Tyr?Cys?Leu?Asn?Trp?Arg
660?????????????????665?????????????????670Tyr?Glu?Thr?Ile?Ser?Leu?Phe?Ala?Gln?Arg?Leu?Asn?Glu?Ile?Tyr?Gly
675?????????????????680?????????????????685Leu?Pro?Ser?Phe?Phe?Gln?Trp?Leu?His?Lys?Arg?Leu?Glu?Thr?Ser?Val
690?????????????????695?????????????????700Leu?Tyr?Val?Ser?Asp?Pro?His?Cys?Pro?Pro?Asp?Leu?Asp?Ala?His?Ile705?????????????????710?????????????????715?????????????????720Pro?Leu?Tyr?Lys?Val?Pro?Asn?Asp?Gln?Ile?Phe?Ile?Lys?Tyr?Pro?Met
725?????????????????730?????????????????735Gly?Gly?Ile?Glu?Gly?Tyr?Cys?Gln?Lys?Leu?Trp?Thr?Ile?Ser?Thr?Ile
740?????????????????745?????????????????750Pro?Tyr?Leu?Tyr?Leu?Ala?Ala?Tyr?Glu?Ser?Gly?Val?Arg?Ile?Ala?Ser
755?????????????????760?????????????????765Leu?Val?Gln?Gly?Asp?Asn?Gln?Thr?Ile?Ala?Val?Thr?Lys?Arg?Val?Pro
770?????????????????775?????????????????780Ser?Thr?Trp?Pro?Tyr?Asn?Leu?Lys?Lys?Arg?Glu?Ala?Ala?Arg?Val?Thr785?????????????????790?????????????????795?????????????????800Arg?Asp?Tyr?Phe?Val?Ile?Leu?Arg?Gln?Arg?Leu?His?Asp?Ile?Gly?His
805?????????????????810?????????????????815His?Leu?Lys?Ala?Asn?Glu?Thr?Ile?Val?Ser?Ser?His?Phe?Phe?Val?Tyr
820?????????????????825?????????????????830Ser?Lys?Gly?Ile?Tyr?Tyr?Asp?Gly?Leu?Leu?Val?Ser?Gln?Ser?Leu?Lys
835?????????????????840?????????????????845Ser?Ile?Ala?Arg?Cys?Val?Phe?Trp?Ser?Glu?Thr?Ile?Val?Asp?Glu?Thr
850?????????????????855?????????????????860Arg?Ala?Ala?Cys?Ser?Asn?Ile?Ala?Thr?Thr?Met?Ala?Lys?Ser?Ile?Glu865?????????????????870?????????????????875?????????????????880Arg?Gly?Tyr?Asp?Arg?Tyr?Leu?Ala?Tyr?Ser?Leu?Asn?Val?Leu?Lys?Val
885?????????????????890?????????????????895Ile?Gln?Gln?Ile?Leu?Ile?Ser?Leu?Gly?Phe?Thr?Ile?Asn?Ser?Thr?Met
900?????????????????905?????????????????910Thr?Arg?Asp?Val?Val?Ile?Pro?Leu?Leu?Thr?Asn?Asn?Asp?Leu?Leu?Ile
915?????????????????920?????????????????925Arg?Met?Ala?Leu?Leu?Pro?Ala?Pro?Ile?Gly?Gly?Met?Asn?Tyr?Leu?Asn
930?????????????????935?????????????????940Met?Ser?Arg?Leu?Phe?Val?Arg?Asn?Ile?Gly?Asp?Pro?Val?Thr?Ser?Ser945?????????????????950?????????????????955?????????????????960Ile?Ala?Asp?Leu?Lys?Arg?Met?Ile?Leu?Ala?Ser?Leu?Met?Pro?Glu?Glu
965?????????????????970?????????????????975Thr?Leu?His?Gln?Val?Met?Thr?Gln?Gln?Pro?Gly?Asp?Ser?Ser?Phe?Leu
980?????????????????985?????????????????990Asp?Trp?Ala?Ser?Asp?Pro?Tyr?Ser?Ala?Asn?Leu?Val?Cys?Val?Gln?Ser
995?????????????????1000????????????????1005Ile?Thr?Arg?Leu?Leu?Lys?Asn?Ile?Thr?Ala?Arg?Phe?Val?Leu?Ile?His
1010????????????????1015????????????????1020Ser?Pro?Asn?Pro?Met?Leu?Lys?Gly?Leu?Phe?His?Asp?Asp?Ser?Lys?Glu1025????????????????1030????????????????1035????????????????1040Glu?Asp?Glu?Gly?Leu?Ala?Ala?Phe?Leu?Met?Asp?Arg?His?Ile?Ile?Val
1045????????????????1050????????????????1055Pro?Arg?Ala?Ala?His?Glu?Ile?Leu?Asp?His?Ser?Val?Thr?Gly?Ala?Arg
1060????????????????1065????????????????1070Glu?Ser?Ile?Ala?Gly?Met?Leu?Asp?Thr?Thr?Lys?Gly?Leu?Ile?Arg?Ala
1075????????????????1080????????????????1085Ser?Met?Arg?Lys?Gly?Gly?Leu?Thr?Ser?Arg?Val?Ile?Thr?Arg?Leu?Ser
1090????????????????1095????????????????1100Asn?Tyr?Asp?Tyr?Glu?Gln?Phe?Arg?Ala?Gly?Met?Val?Leu?Leu?Thr?Gly1105????????????????1110????????????????1115????????????????1120Arg?Lys?Arg?Asn?Val?Leu?Ile?Asp?Lys?Glu?Ser?Cys?Ser?Val?Gln?Leu
1125????????????????1130????????????????1135Ala?Arg?Ala?Leu?Arg?Ser?His?Met?Trp?Ala?Arg?Leu?Ala?Arg?Gly?Arg
1140????????????????1145????????????????1150Pro?Ile?Tyr?Gly?Leu?Glu?Val?Pro?Asp?Val?Leu?Glu?Ser?Met?Arg?Gly
1155????????????????1160????????????????1165His?Leu?Ile?Arg?Arg?His?Glu?Thr?Cys?Val?Ile?Cys?Glu?Cys?Gly?Ser
1170????????????????1175????????????????1180Val?Asn?Tyr?Gly?Trp?Phe?Phe?Val?Pro?Ser?Gly?Cys?Gln?Leu?Asp?Asp1185????????????????1190????????????????1195????????????????1200Ile?Asp?Lys?Glu?Thr?Ser?Ser?Leu?Arg?Val?Pro?Tyr?Ile?Gly?Ser?Thr
1205????????????????1210????????????????1215Thr?Asp?Glu?Arg?Thr?Asp?Met?Lys?Leu?Ala?Phe?Val?Arg?Ala?Pro?Ser
1220????????????????1225????????????????1230Arg?Ser?Leu?Arg?Ser?Ala?Val?Arg?Ile?Ala?Thr?Val?Tyr?Ser?Trp?Ala
1235????????????????1240????????????????1245Tyr?Gly?Asp?Asp?Asp?Ser?Ser?Trp?Asn?Glu?Ala?Trp?Leu?Leu?Ala?Arg
1250????????????????1255????????????????1260Gln?Arg?Ala?Asn?Val?Ser?Leu?Glu?Glu?Leu?Arg?Val?Ile?Thr?Pro?Ile1265????????????????1270????????????????1275????????????????1280Ser?Thr?Ser?Thr?Asn?Leu?Ala?His?Arg?Leu?Arg?Asp?Arg?Ser?Thr?Gln
1285????????????????1290????????????????1295Val?Lys?Tyr?Ser?Gly?Thr?Ser?Leu?Val?Arg?Val?Ala?Arg?Tyr?Thr?Thr
1300????????????????1305????????????????1310Ile?Ser?Asn?Asp?Asn?Leu?Ser?Phe?Val?Ile?Ser?Asp?Lys?Lys?Val?Asp
1315????????????????1320????????????????1325Thr?Asn?Phe?Ile?Tyr?Gln?Gln?Gly?Met?Leu?Leu?Gly?Leu?Gly?Val?Leu
1330????????????????1335????????????????1340Glu?Thr?Leu?Phe?Arg?Leu?Glu?Lys?Asp?Thr?Gly?Ser?Ser?Asn?Thr?Val1345????????????????1350????????????????1355????????????????1360Leu?His?Leu?His?Val?Glu?Thr?Asp?Cys?Cys?Val?Ile?Pro?Met?Ile?Asp
1365????????????????1370????????????????1375His?Pro?Arg?Ile?Pro?Ser?Ser?Arg?Lys?Leu?Glu?Leu?Arg?Ala?Glu?Leu
1380????????????????1385????????????????1390Cys?Thr?Asn?Pro?Leu?Ile?Tyr?Asp?Asn?Ala?Pro?Leu?Ile?Asp?Arg?Asp
1395????????????????1400????????????????1405Thr?Thr?Arg?Leu?Tyr?Thr?Gln?Ser?His?Arg?Arg?His?Leu?Val?Glu?Phe
1410????????????????1415????????????????1420Val?Thr?Trp?Ser?Thr?Pro?Gln?Leu?Tyr?His?Ile?Leu?Ala?Lys?Ser?Thr1425????????????????1430????????????????1435????????????????1440Ala?Leu?Ser?Met?Ile?Asp?Leu?Val?Thr?Lys?Phe?Glu?Lys?Asp?His?Met
1445????????????????1450????????????????1455Asn?Glu?Ile?Ser?Ala?Leu?Ile?Gly?Asp?Asp?Asp?Ile?Asn?Ser?Phe?Ile
1460????????????????1465????????????????1470Thr?Glu?Phe?Leu?Leu?Ile?Glu?Pro?Arg?Leu?Phe?Thr?Ile?Tyr?Leu?Gly
1475????????????????1480????????????????1485Gln?Cys?Ala?Ala?Ile?Asn?Trp?Ala?Phe?Asp?Val?His?Tyr?His?Arg?Pro
1490????????????????1495????????????????1500Ser?Gly?Lys?Tyr?Gln?Met?Gly?Glu?Leu?Leu?Ser?Ser?Phe?Leu?Ser?Arg1505????????????????1510????????????????1515????????????????1520Met?Ser?Lys?Gly?Val?Phe?Lys?Val?Leu?Val?Asn?Ala?Leu?Ser?His?Pro
1525????????????????1530????????????????1535Lys?Ile?Tyr?Lys?Lys?Phe?Trp?His?Cys?Gly?Ile?Ile?Glu?Pro?Ile?His
1540????????????????1545????????????????1550Gly?Pro?Ser?Leu?Asp?Ala?Gln?Asn?Leu?His?Thr?Thr?Val?Cys?Asn?Met
1555????????????????1560????????????????1565Val?Tyr?Thr?Cys?Tyr?Met?Thr?Tyr?Leu?Asp?Leu?Leu?Leu?Asn?Glu?Glu
1570????????????????1575????????????????1580Leu?Glu?Glu?Phe?Thr?Phe?Leu?Leu?Cys?Glu?Ser?Asp?Glu?Asp?Val?Val1585????????????????1590????????????????1595????????????????1600Pro?Asp?Arg?Phe?Asp?Asn?Ile?Gln?Ala?Lys?His?Leu?Cys?Val?Leu?Ala
1605????????????????1610????????????????1615Asp?Leu?Tyr?Cys?Gln?Pro?Gly?Ala?Cys?Pro?Pro?Ile?Arg?Gly?Leu?Arg
1620????????????????1625????????????????1630Pro?Val?Glu?Lys?Cys?Ala?Val?Leu?Thr?Asp?His?Ile?Lys?Ala?Glu?Ala
1635????????????????1640????????????????1645Arg?Leu?Ser?Pro?Ala?Gly?Ser?Ser?Trp?Asn?Ile?Asn?Pro?Ile?Ile?Val
1650????????????????1655????????????????1660Asp?His?Tyr?Ser?Cys?Ser?Leu?Thr?Tyr?Leu?Arg?Arg?Gly?Ser?Ile?Lys1665????????????????1670????????????????1675????????????????1680Gln?Ile?Arg?Leu?Arg?Val?Asp?Pro?Gly?Phe?Ile?Phe?Asp?Ala?Leu?Ala
1685????????????????1690????????????????1695Glu?Val?Asn?Val?Ser?Gln?Pro?Lys?Ile?Gly?Ser?Asn?Asn?Ile?Ser?Asn
1700????????????????1705????????????????1710Met?Ser?Ile?Lys?Ala?Phe?Arg?Pro?Pro?His?Asp?Asp?Val?Ala?Lys?Leu
1715????????????????1720????????????????1725Leu?Lys?Asp?Ile?Asn?Thr?Ser?Lys?His?Asn?Leu?Pro?Ile?Ser?Gly?Gly
1730????????????????1735????????????????1740Asn?Leu?Ala?Asn?Tyr?Glu?Ile?His?Ala?Phe?Arg?Arg?Ile?Gly?Leu?Asn1745????????????????1750????????????????1755????????????????1760Ser?Ser?Ala?Cys?Tyr?Lys?Ala?Val?Glu?Ile?Ser?Thr?Leu?Ile?Arg?Arg
1765????????????????1770????????????????1775Cys?Leu?Glu?Pro?Gly?Glu?Asp?Gly?Leu?Phe?Leu?Gly?Glu?Gly?Ser?Gly
1780????????????????1785????????????????1790Ser?Met?Leu?Ile?Thr?Tyr?Lys?Glu?Ile?Leu?Lys?Leu?Asn?Lys?Cys?Phe
1795????????????????1800????????????????1805Tyr?Asn?Ser?Gly?Val?Ser?Ala?Asn?Ser?Arg?Ser?Gly?Gln?Arg?Glu?Leu
1810????????????????1815????????????????1820Ala?Pro?Tyr?Pro?Ser?Glu?Val?Gly?Leu?Val?Glu?His?Arg?Met?Gly?Val1825????????????????1830????????????????1835????????????????1840Gly?Asn?Ile?Val?Lys?Val?Leu?Phe?Asn?Gly?Arg?Pro?Glu?Val?Thr?Trp
1845????????????????1850????????????????1855Val?Gly?Ser?Val?Asp?Cys?Phe?Asn?Phe?Ile?Val?Ser?Asn?Ile?Pro?Thr
1860????????????????1865????????????????1870Ser?Ser?Val?Gly?Phe?Ile?His?Ser?Asp?Ile?Glu?Thr?Leu?Pro?Asn?Lys
1875????????????????1880????????????????1885Asp?Thr?Ile?Glu?Lys?Leu?Glu?Glu?Leu?Ala?Ala?Ile?Leu?Ser?Met?Ala
1890????????????????1895????????????????1900Leu?Leu?Leu?Gly?Lys?Ile?Gly?Ser?Ile?Leu?Val?Ile?Lys?Leu?Met?Pro1905????????????????1910????????????????1915????????????????1920Phe?Ser?Gly?Asp?Phe?Val?Gln?Gly?Phe?Ile?Ser?Tyr?Val?Gly?Ser?Tyr
1925????????????????1930????????????????1935Tyr?Arg?Glu?Val?Asn?Leu?Val?Tyr?Pro?Arg?Tyr?Ser?Asn?Phe?Ile?Ser
1940????????????????1945????????????????1950Thr?Glu?Ser?Tyr?Leu?Val?Met?Thr?Asp?Leu?Lys?Ala?Asn?Arg?Leu?Met
1955????????????????1960????????????????1965
Asn?Pro?Glu?Lys?Ile?Lys?Gln?Gln?Ile?Ile?Glu?Ser?Ser?Val?Arg?Thr
1970????????????????1975???????????????1980
Ser?Pro?Gly?Leu?Ile?Gly?His?Ile?Leu?Ser?Ile?Lys?Gln?Leu?Ser?Cys
1985????????????????1990????????????????1995????????????????2000
Ile?Gln?Ala?Ile?Val?Gly?Asp?Ala?Val?Ser?Arg?Gly?Asp?Ile?Asn?Pro
2005????????????????2010????????????????2015
Thr?Leu?Lys?Lys?Leu?Thr?Pro?Ile?Glu?Gln?Val?Leu?Ile?Asn?Cys?Gly
2020????????????????2025????????????????2030
Leu?Ala?Ile?Asn?Gly?Pro?Lys?Leu?Cys?Lys?Glu?Leu?Ile?His?His?Asp
2035????????????????2040????????????????2045
Val?Ala?Ser?Gly?Gln?Asp?Gly?Leu?Leu?Asn?Ser?Ile?Leu?Ile?Leu?Tyr
2050????????????????2055????????????????2060
Arg?Glu?Leu?Ala?Arg?Phe?Lys?Asp?Asn?Arg?Arg?Ser?Gln?Gln?Gly?Met
2065????????????????2070????????????????2075????????????????2080
Phe?His?Ala?Tyr?Pro?Val?Leu?Val?Ser?Ser?Arg?Gln?Arg?Glu?Leu?Ile
2085????????????????2090????????????????2095
Ser?Arg?Ile?Thr?Arg?Lys?Phe?Trp?Gly?His?Ile?Leu?Leu?Tyr?Ser?Gly
2100????????????????2105????????????????2110
Asn?Arg?Lys?Leu?Ile?Asn?Lys?Phe?Ile?Gln?Asn?Leu?Lys?Ser?Gly?Tyr
2115????????????????2120????????????????2125
Leu?Ile?Leu?Asp?Leu?His?Gln?Asn?Ile?Phe?Val?Lys?Asn?Leu?Ser?Lys
2130????????????????2135????????????????2140
Ser?Glu?Lys?Gln?Ile?Ile?Met?Thr?Gly?Gly?Leu?Lys?Arg?Glu?Trp?Val
2145????????????????2150????????????????2155????????????????2160
Phe?Lys?Val?Thr?Val?Lys?Glu?Thr?Lys?Glu?Trp?Tyr?Lys?Leu?Val?Gly
2165????????????????2170????????????????2175
Tyr?Ser?Ala?Leu?Ile?Lys?Asp
The information of 2180 (2) SEQ ID NO:17:
(i) sequence signature:
(A) length: 15462 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17:
ACCAAACAAG AGAAGAAACT TGTCTGGGAA TATAAATTTA ACTTTAAATT AACTTAGGAT 60
TAAAGACATT GACTAGAAGG TCAAGAAAAG GGAACTCTAT AATTTCAAAA ATGTTGAGCC 120
TATTTGATAC ATTTAATGCA CGTAGGCAAG AAAACATAAC AAAATCAGCC GGTGGAGCTA 180
TCATTCCTGG ACAGAAAAAT ACTGTCTCTA TATTCGCCCT TGGACCGACA ATAACTGATG 240
ATAATGAGAA AATGACATTA GCTCTTCTAT TTCTATCTCA TTCACTAGAT AATGAGAAAC 300
AACATGCACA AAGGGCAGGG TTCTTGGTGT CTTTATTGTC AATGGCTTAT GCCAATCCAG 360
AGCTCTACCT AACAACAAAT GGAAGTAATG CAGATGTCAA GTATGTCATA TACATGATTG 420
AGAAAGATCT AAAACGGCAA AAGTATGGAG GATTTGTGGT TAAGACGAGA GAGATGATAT 480
ATGAAAAGAC AACTGATTGG ATATTTGGAA GTGACCTGGA TTATGATCAG GAAACTATGT 540
TGCAGAACGG CAGGAACAAT TCAACAATTG AAGACCTTGT CCACACATTT GGGTATCCAT 600
CATGTTTAGG AGCTCTTATA ATACAGATCT GGATAGTTCT GGTCAAAGCT ATCACTAGTA 660
TCTCAGGGTT AAGAAAAGGC TTTTTCACCC GATTGGAAGC TTTCAGACAA GATGGAACAG 720
TGCAGGCAGG GCTGGTATTG AGCGGTGACA CAGTGGATCA GATTGGGTCA ATCATGCGGT 780
CTCAACAGAG CTTGGTAACT CTTATGGTTG AAACATTAAT AACAATGAAT ACCAGCAGAA 840
ATGACCTCAC AACCATAGAA AAGAATATAC AAATTGTTGG CAACTACATA AGAGATGCAG 900
GTCTCGCTTC ATTCTTCAAT ACAATCAGAT ATGGAATTGA GACCAGAATG GCAGCTTTGA 960
CTCTATCCAC TCTCAGACCA GATATCAATA GATTAAAAGC TTTGATGGAA CTGTATTTAT 1020
CAAAGGGACC ACGCGCTCCT TTCATCTGTA TCCTCAGAGA TCCTATACAT GGTGAGTTCG 1080
CACCAGGCAA CTATCCTGCC ATATGGAGCT ATGCAATGGG GGTGGCAGTT GTACAAAATA 1140
GAGCCATGCA ACAGTATGTG ACGGGAAGAT CATATCTAGA CATTGATATG TTCCAGCTAG 1200
GACAAGCAGT AGCACGTGAT GCCGAAGCTC AAATGAGCTC AACACTGGAA GATGAACTTG 1260
GAGTGACACA CGAATCTAAA GAAAGCTTGA AGAGACATAT AAGGAACATA AACAGTTCAG 1320
AGACATCTTT CCACAAACCG ACAGGTGGAT CAGCCATAGA GATGGCAATA GATGAAGAGC 1380
CAGAACAATT CGAACATAGA GCAGATCAAG AACAAAATGG AGAACCTCAA TCATCCATAA 1440
TTCAATATGC CTGGGCAGAA GGAAATAGAA GCGATGATCA GACTGAGCAA GCTACAGAAT 1500
CTGACAATAT CAAGACCGAA CAACAAAACA TCAGAGACAG ACTAAACAAG AGACTCAACG 1560
ACAAGAAGAA ACAAAGCAGT CAACCACCCA CTAATCCCAC AAACAGAACA AACCAGGACG 1620
AAATAGATGA TCTGTTTAAC GCATTTGGAA GCAACTAATC GAATCAACAT TTTAATCTAA 1680
ATCAATAATA AATAAGAAAA ACTTAGGATT AAAGAATCCT ATCATACCGG AATATAGGGT 1740
GGTAAATTTA GAGTCTGCTT GAAACTCAAT CAATAGAGAG TTGATGGAAA GCGATGCTAA 1800
AAACTATCAA ATCATGGATT CTTGGGAAGA GGAATCAAGA GATAAATCAA CTAATATCTC 1860
CTCGGCCCTC AACATCATTG AATTCATACT CAGCACCGAC CCCCAAGAAG ACTTATCGGA 1920
AAACGACACA ATCAACACAA GAACCCAGCA ACTCAGTGCC ACCATCTGTC AACCAGAAAT 1980
CAAACCAACA GAAACAAGTG AGAAAGATAG TGGATCAACT GACAAAAATA GACAGTCCGG 2040
GTCATCACAC GAATGTACAA CAGAAGCAAA AGATAGAAAT ATTGATCAGG AAACTGTACA 2100
GAGAGGACCT GGGAGAAGAA GCAGCTCAGA TAGTAGAGCT GAGACTGTGG TCTCTGGAGG 2160
AATCCCCAGA AGCATCACAG ATTCTAAAAA TGGAACCCAA AACACGGAGG ATATTGATCT 2220
CAATGAAATT AGAAAGATGG ATAAGGACTC TATTGAGGGG AAAATGCGAC AATCTGCAAA 2280
TGTTCCAAGC GAGATATCAG GAAGTGATGA CATATTTACA ACAGAACAAA GTAGAAACAG 2340
TGATCATGGA AGAAGCCTGG AATCTATCAG TACACCTGAT ACAAGATCAA TAAGTGTTGT 2400
TACTGCTGCA ACACCAGATG ATGAAGAAGA AATACTAATG AAAAATAGTA GGACAAAGAA 2460
AAGTTCTTCA ACACATCAAG AAGATGACAA AAGAATTAAA AAAGGGGGAA AAGGGAAAGA 2520
CTGGTTTAAG AAATCAAAAG ATACCGACAA CCAGATACCA ACATCAGACT ACAGATCCAC 2580
ATCAAAAGGG CAGAAGAAAA TCTCAAAGAC AACAACCACC AACACCGACA CAAAGGGGCA 2640
AACAGAAATA CAGACAGAAT CATCAGAAAC ACAATCCTCA TCATGGAATC TCATCATCGA 2700
CAACAACACC GACCGGAACG AACAGACAAG CACAACTCCT CCAACAACAA CTTCCAGATC 2760
AACTTATACA AAAGAATCGA TCCGAACAAA CTCTGAATCC AAACCCAAGA CACAAAAGAC 2820
AAATGGAAAG GAAAGGAAGG ATACAGAAGA GAGCAATCGA TTTACAGAGA GGGCAATTAC 2880
TCTATTGCAG AATCTTGGTG TAATTCAATC CACATCAAAA CTAGATTTAT ATCAAGACAA 2940
ACGAGTTGTA TGTGTAGCAA ATGTACTAAA CAATGTAGAT ACTGCATCAA AGATAGATTT 3000
CCTGGCAGGA TTAGTCATAG GGGTTTCAAT GGACAACGAC ACAAAATTAA CACAGATACA 3060
AAATGAAATG CTAAACCTCA AAGCAGATCT AAAGAAAATG GACGAATCAC ATAGAAGATT 3120
GATAGAAAAT CAAAGAGAAC AACTGTCATT GATCACGTCA CTAATTTCAA ATCTCAAAAT 3180
TATGACTGAG AGAGGAGGAA AGAAAGACCA AAATGAATCC AATGAGAGAG TATCCATGAT 3240
CAAAACAAAA TTGAAAGAAG AAAAGATCAA GAAGACCAGG TTTGACCCAC TTATGGAGGC 3300
ACAAGGCATT GACAAGAATA TACCCGATCT ATATCGACAT GCAGGAGATA CACTAGAGAA 3360
CGATGTACAA GTTAAATCAG AGATATTAAG TTCATACAAT GAGTCAAATG CAACAAGACT 3420
AATACCCAAA AAAGTGAGCA GTACAATGAG ATCACTAGTT GCAGTCATCA ACAACAGCAA 3480
TCTCTCACAA AGCACAAAAC AATCATACAT AAACGAACTC AAACGTTGCA AAAATGATGA 3540
AGAAGTATCT GAATTAATGG ACATGTTCAA TGAAGATGTC AACAATTGCC AATGATCCAA 3600
CAAAGAAACG ACACCGAACA AACAGACAAG AAACAACAGT AGATCAAAAC CTGTCAACAC 3660
ACACAAAATC AAGCAGAATG AAACAACAGA TATCAATCAA TATACAAATA AGAAAAACTT 3720
AGGATTAAAG AATAAATTAA TCCTTGTCCA AAATGAGTAT AACTAACTCT GCAATATACA 3780
CATTCCCAGA ATCATCATTC TCTGAAAATG GTCATATAGA ACCATTACCA CTCAAAGTCA 3840
ATGAACAGAG GAAAGCAGTA CCCCACATTA GAGTTGCCAA GATCGGAAAT CCACCAAAAC 3900
ACGGATCCCG GTATTTAGAT GTCTTCTTAC TCGGCTTCTT CGAGATGGAA CGAATCAAAG 3960
ACAAATACGG GAGTGTGAAT GATCTCGACA GTGACCCGAG TTACAAAGTT TGTGGCTCTG 4020
GATCATTACC AATCGGATTG GCTAAGTACA CTGGGAATGA CCAGGAATTG TTACAAGCCG 4080
CAACCAAACT GGATATAGAA GTGAGAAGAA CAGTCAAAGC GAAAGAGATG GTTGTTTACA 4140
CGGTACAAAA TATAAAACCA GAACTGTACC CATGGTCCAA TAGACTAAGA AAAGGAATGC 4200
TGTTCGATGC CAACAAAGTT GCTCTTGCTC CTCAATGTCT TCCACTAGAT AGGAGCATAA 4260
AATTTAGAGT AATCTTCGTG AATTGTACGG CAATTGGATC AATAACCTTG TTCAAAATTC 4320
CTAAGTCAAT GGCATCACTA TCTCTACCCA ACACAATATC AATCAATCTG CAGGTACACA 4380
TAAAAACAGG GGTTCAGACT GATTCTAAAG GGATAGTTCA AATTTTGGAT GAGAAAGGCG 4440
AAAAATCACT GAATTTCATG GTCCATCTCG GATTGATCAA AAGAAAAGTA GGCAGAATGT 4500
ACTCTGTTGA ATACTGTAAA CAGAAAATCG AGAAAATGAG ATTGATATTT TCTTTAGGAC 4560
TAGTTGGAGG AATCAGTCTT CATGTCAATG CAACTGGGTC CATATCAAAA ACACTAGCAA 4620
GTCAGCTGGT ATTCAAAAGA GAGATTTGTT ATCCTTTAAT GGATCTAAAT CCGCATCTCA 4680
ATCTAGTTAT CTGGGCTTCA TCAGTAGAGA TTACAAGAGT GGATGCAATT TTCCAACCTT 4740
CTTTACCTGG CGAGTTCAGA TACTATCCTA ATATTATTGC AAAAGGAGTT GGGAAAATCA 4800
AACAATGGAA CTAGTAATCT CTATTTTAGT CCGGACGTAT CTATTAAGCC GAAGCAAATA 4860
AAGGATAATC AAAAACTTAG GACAAAAGAG GTCAATACCA ACAACTATTA GCAGTCACAC 4920
TCGCAAGAAT AAGAGAGAAG GGACCAAAAA AGTCAAATAG GAGAAATCAA AACAAAAGGT 4980
ACAGAACACC AGAACAACAA AATCAAAACA TCCAACTCAC TCAAAACAAA AATTCCAAAA 5040
GAGACCGGCA ACACAACAAG CACTGAACAC AATGCCAACT TCAATACTGC TAATTATTAC 5100
AACCATGATC ATGGCATCTT TCTGCCAAAT AGATATCACA AAACTACAGC ACGTAGGTGT 5160
ATTGGTCAAC AGTCCCAAAG GGATGAAGAT ATCACAAAAC TTTGAAACAA GATATCTAAT 5220
TTTGAGCCTC ATACCAAAAA TAGAAGACTC TAACTCTTGT GGTGACCAAC AGATCAAGCA 5280
ATACAAGAAG TTATTGGATA GACTGATCAT CCCTTTATAT GATGGATTAA GATTACAGAA 5340
AGATGTGATA GTAACCAATC AAGAATCCAA TGAAAACACT GATCCCAGAA CAAAACGATT 5400
CTTTGGAGGG GTAATTGGAA CCATTGCTCT GGGAGTAGCA ACCTCAGCAC AAATTACAGC 5460
GGCAGTTGCT CTGGTTGAAG CCAAGCAGGC AAGATCAGAC ATCGAAAAAC TCAAAGAAGC 5520
AATTAGGGAC ACAAACAAAG CAGTGCAGTC AGTTCAGAGC TCCATAGGAA ATTTAATAGT 5580
AGCAATTAAA TCAGTCCAGG ATTATGTTAA CAAAGAAATC GTGCCATCGA TTGCGAGGCT 5640
AGGTTGTGAA GCAGCAGGAC TTCAATTAGG AATTGCATTA ACACAGCATT ACTCAGAATT 5700
AACAAACATA TTTGGTGATA ACATAGGATC GTTACAAGAA AAAGGAATAA AATTACAAGG 5760
TATAGCATCA TTATACCGCA CAAATATCAC AGAAATATTC ACAACATCAA CAGTTGATAA 5820
ATATGATATC TATGATCTGT TATTTACAGA ATCAATAAAG GTGAGAGTTA TAGATGTTGA 5880
CTTGAATGAT TACTCAATCA CCCTCCAAGT CAGACTCCCT TTATTAACTA GGCTGCTGAA 5940
CACTCAGATC TACAAAGTAG ATTCCATATC ATATAACATC CAAAACAGAG AATGGTATAT 6000
CCCTCTTCCC AGCCATATCA TGACGAAAGG GGCATTTCTA GGTGGAGCAG ACGTCAAAGA 6060
ATGTATAGAA GCATTCAGCA GCTATATATG CCCTTCTGAT CCAGGATTTG TATTAAACCA 6120
TGAAATAGAG AGCTGCTTAT CAGGAAACAT ATCCCAATGT CCAAGAACAA CGGTCACATC 6180
AGACATTGTT CCAAGATATG CATTTGTCAA TGGAGGAGTG GTTGCAAACT GTATAACAAC 6240
CACCTGTACA TGCAACGGAA TTGGTAATAG AATCAATCAA CCACCTGATC AAGGAGTAAA 6300
AATTATAACA CATAAAGAAT GTAGTACAAT AGGTATCAAC GGAATGCTGT TCAATACAAA 6360
TAAAGAAGGA ACTCTTGCAT TCTATACACC AAATGATATA ACACTAAACA ATTCTGTTGC 6420
ACTTGATCCA ATTGACATAT CAATCGAGCT CAACAAGGCC AAATCAGATC TAGAAGAATC 6480
AAAAGAATGG ATAAGAAGGT CAAATCAAAA ACTAGATTCT ATTGGAAATT GGCATCAATC 6540
TAGCACTACA ATCATAATTA TTTTGATAAT GATCATTATA TTGTTTATAA TTAATATAAC 6600
GATAATTACA ATTGCAATTA AGTATTACAG AATTCAAAAG AGAAATCGAG TGGATCAAAA 6660
TGACAAGCCA TATGTACTAA CAAACAAATA ACATATCTAC AGATCATTAG ATATTAAAAT 6720
TATAAAAAAC TTAGGAGTAA AGTTACGCAA TCCAACTCTA CTCATATAAT TGAGGAAGGA 6780
CCCAATAGAC AAATCCAAAT TCGAGATGGA ATACTGGAAG CATACCAATC ACGGAAAGGA 6840
TGCTGGTAAT GAGCTGGAGA CGTCTATGGC TACTCATGGC AACAAGCTCA CTAATAAGAT 6900
AATATACATA TTATGGACAA TAATCCTGGT GTTATTATCA ATAGTCTTCA TCATAGTGCT 6960
AATTAATTCC ATCAAAAGTG AAAAGGCCCA CGAATCATTG CTGCAAGACA TAAATAATGA 7020
GTTTATGGAA ATTACAGAAA AGATCCAAAT GGCATCGGAT AATACCAATG ATCTAATACA 7080
GTCAGGAGTG AATACAAGGC TTCTTACAAT TCAGAGTCAT GTCCAGAATT ACATACCAAT 7140
ATCATTGACA CAACAGATGT CAGATCTTAG GAAATTCATT AGTGAAATTA CAATTAGAAA 7200
TGATAATCAA GAAGTGCTGC CACAAAGAAT AACACATGAT GTAGGTATAA AACCTTTAAA 7260
TCCAGATGAT TTTTGGAGAT GCACGTCTGG TCTTCCATCT TTAATGAAAA CTCCAAAAAT 7320
AAGGTTAATG CCAGGGCCGG GATTATTAGC TATGCCAACG ACTGTTGATG GCTGTGTTAG 7380
AACTCCGTCT TTAGTTATAA ATGATCTGAT TTATGCTTAT ACCTCAAATC TAATTACTCG 7440
AGGTTGTCAG GATATAGGAA AATCATATCA AGTCTTACAG ATAGGGATAA TAACTGTAAA 7500
CTCAGACTTG GTACCTGACT TAAATCCTAG GATCTCTCAT ACCTTTAACA TAAATGACAA 7560
TAGGAAGTCA TGTTCTCTAG CACTCCTAAA TACAGATGTA TATCAACTGT GTTCAACTCC 7620
CAAAGTTGAT GAAAGATCAG ATTATGCATC ATCAGGCATA GAAGATATTG TACTTGATAT 7680
TGTCAATTAT GATGGTTCAA TCTCAACAAC AAGATTTAAG AATAATAACA TAAGCTTTGA 7740
TCAACCATAT GCTGCACTAT ACCCATCTGT TGGACCAGGG ATATACTACA AAGGCAAAAT 7800
AATATTTCTC GGGTATGGAG GTCTTGAACA TCCAATAAAT GAGAATGTAA TCTGCAACAC 7860
AACTGGGTGC CCCGGGAAAA CACAGAGAGA CTGTAATCAA GCGTCTCATA GTCCATGGTT 7920
TTCAGATAGG AGGATGGTCA ACTCCATCAT TGTTGTTGAC AAAGGCTTAA ACTCAATTCC 7980
AAAATTGAAA GTATGGACGA TATCTATGCG ACAAAATTAC TGGGGGTCAG AAGGAAGGTT 8040
ACTTCTACTA GGTAACAAGA TCTATATATA TACAAGATCT ACAAGTTGGC ATAGCAAGTT 8100
ACAATTAGGA ATAATTGATA TTACTGATTA CAGTGATATA AGGATAAAAT GGACATGGCA 8160
TAATGTGCTA TCAAGACCAG GAAACAATGA ATGTCCATGG GGACATTCAT GTCCAGATGG 8220
ATGTATAACA GGAGTATATA CTGATGCATA TCCACTCAAT CCCACAGGGA GCATTGTGTC 8280
ATCTGTCATA TTAGACTCAC AAAAATCGAG AGTGAACCCA GTCATAACTT ACTCAACAGC 8340
AACCGAAAGA GTAAACGAGC TGGCCATCCT AAACAGAACA CTCTCAGCTG GATATACAAC 8400
AACAAGCTGC ATTACACACT ATAACAAAGG ATATTGTTTT CATATAGTAG AAATAAATCA 8460
TAAAAGCTTA AACACATTTC AACCCATGTT GTTCAAAACA GAGATTCCAA AAAGCTGCAG 8520
TTAATCATAA TTAACCATAA TATGCATCAA TCTATCTATA ATACAAGTAT ATGATAAGTA 8580
ATCAGCAATC AGACAATAGA CAAAAGGGAA ATATAAAAAA CTTAGGAGCA AAGCGTGCTC 8640
GGGAAATGGA CACTGAATCT AACAATGGCA CTGTATCTGA CATACTCTAT CCTGAGTGTC 8700
ACCTTAACTC TCCTATCGTT AAAGGTAAAA TAGCACAATT ACACACTATT ATGAGTCTAC 8760
CTCAGCCTTA TGATATGGAT GACGACTCAA TACTAGTTAT CACTAGACAG AAAATAAAAC 8820
TTAATAAATT GGATAAAAGA CAACGATCTA TTAGAAGATT AAAATTAATA TTAACTGAAA 8880
AAGTGAATGA CTTAGGAAAA TACACATTTA TCAGATATCC AGAAATGTCA AAAGAAATGT 8940
TCAAATTATA TATACCTGGT ATTAACAGTA AAGTGACTGA ATTATTACTT AAAGCAGATA 9000
GAACATATAG TCAAATGACT GATGGATTAA GAGATCTATG GATTAATGTG CTATCAAAAT 9060
TAGCCTCAAA AAATGATGGA AGCAATTATG ATCTTAATGA AGAAATTAAT AATATATCGA 9120
AAGTTCACAC AACCTATAAA TCAGATAAAT GGTATAATCC ATTCAAAACA TGGTTTACTA 9180
TCAAGTATGA TATGAGAAGA TTACAAAAAG CTCGAAATGA GATCACTTTT AATGTTGGGA 9240
AGGATTATAA CTTGTTAGAA GACCAGAAGA ATTTCTTATT GATACATCCA GAATTGGTTT 9300
TGATATTAGA TAAACAAAAC TATAATGGTT ATCTAATTAC TCCTGAATTA GTATTGATGT 9360
ATTGTGACGT AGTCGAAGGC CGATGGAATA TAAGTGCATG TGCTAAGTTA GATCCAAAAT 9420
TACAATCTAT GTATCAGAAA GGTAATAACC TGTGGGAAGT GATAGATAAA TTGTTTCCAA 9480
TTATGGGAGA AAAGACATTT GATGTGATAT CGTTATTAGA ACCACTTGCA TTATCCTTAA 9540
TTCAAACTCA TGATCCTGTT AAACAACTAA GAGGAGCTTT TTTAAATCAT GTGTTATCCG 9600
AGATGGAATT AATATTTGAA TCTAGAGAAT CGATTAAGGA ATTTCTGAGT GTAGATTACA 9660
TTGATAAAAT TTTAGATATA TTTAATAAGT CTACAATAGA TGAAATAGCA GAGATTTTCT 9720
CTTTTTTTAG AACATTTGGG CATCCTCCAT TAGAAGCTAG TATTGCAGCA GAAAAGGTTA 9780
GAAAATATAT GTATATTGGA AAACAATTAA AATTTGACAC TATTAATAAA TGTCATGCTA 9840
TCTTCTGTAC AATAATAATT AACGGATATA GAGAGAGGCA TGGTGGACAG TGGCCTCCTG 9900
TGACATTACC TGATCATGCA CACGAATTCA TCATAAATGC TTACGGTTCA AACTCTGCGA 9960
TATCATATGA AAATGCTGTT GATTATTACC AGAGCTTTAT AGGAATAAAA TTCAATAAAT 10020
TCATAGAGCC TCAGTTAGAT GAGGATTTGA CAATTTATAT GAAAGATAAA GCATTATCTC 10080
CAAAAAAATC AAATTGGGAC ACAGTTTATC CTGCATCTAA TTTACTGTAC CGTACTAACG 10140
CATCCAACGA ATCACGAAGA TTAGTTGAAG TATTTATAGC AGATAGTAAA TTTGATCCTC 10200
ATCAGATATT GGATTATGTA GAATCTGGGG ACTGGTTAGA TGATCCAGAA TTTAATATTT 10260
CTTATAGTCT TAAAGAAAAA GAGATCAAAC AGGAAGGTAG ACTCTTTGCA AAAATGACAT 10320
ACAAAATGAG AGCTACACAA GTTTTATCAG AGACACTACT TGCAAATAAC ATAGGAAAAT 10380
TCTTTCAAGA AAATGGGATG GTGAAGGGAG AGATTGAATT ACTTAAGAGA TTAACAACCA 10440
TATCAATATC AGGAGTTCCA CGGTATAATG AAGTGTACAA TAATTCTAAA AGCCATACAG 10500
ATGACCTTAA AACCTACAAT AAAATAAGTA ATCTTAATTT GTCTTCTAAT CAGAAATCAA 10560
AGAAATTTGA ATTCAAGTCA ACGGATATCT ACAATGATGG ATACGAGACT GTGAGCTGTT 10620
TCCTAACAAC AGATCTCAAA AAATACTGTC TTAATTGGAG ATATGAATCA ACAGCTCTAT 10680
TTGGAGAAAC TTGCAACCAA ATATTTGGAT TAAATAAATT GTTTAATTGG TTACACCCTC 10740
GTCTTGAAGG AAGTACAATC TATGTAGGTG ATCCTTACTG TCCTCCATCA GATAAAGAAC 10800
ATATATCATT AGAGGATCAC CCTGATTCTG GTTTTTACGT TCATAACCCA AGAGGGGGTA 10860
TAGAAGGATT TTGTCAAAAA TTATGGACAC TCATATCTAT AAGTGCAATA CATCTAGCAG 10920
CTGTTAGAAT AGGCGTGAGG GTGACTGCAA TGGTTCAAGG AGACAATCAA GCTATAGCTG 10980
TAACCACAAG AGTACCCAAC AATTATGACT ACAGAGTTAA GAAGGAGATA GTTTATAAAG 11040
ATGTAGTGAG ATTTTTTGAT TCATTAAGAG AAGTGATGGA TGATCTAGGT CATGAACTTA 11100
AATTAAATGA AACGATTATA AGTAGCAAGA TGTTCATATA TAGCAAAAGA ATCTATTATG 11160
ATGGGAGAAT TCTTCCTCAA GCTCTAAAAG CATTATCTAG ATGTGTCTTC TGGTCAGAGA 11220
CAGTAATAGA CGAAACAAGA TCAGCATCTT CAAATTTGGC AACATCATTT GCAAAAGCAA 11280
TTGAGAATGG TTATTCACCT GTTCTAGGAT ATGCATGCTC AATTTTTAAG AACATTCAAC 11340
AACTATATAT TGCCCTTGGG ATGAATATCA ATCCAACTAT AACACAGAAT ATCAGAGATC 11400
AGTATTTTAG GAATCCAAAT TGGATGCAAT ATGCCTCTTT AATACCTGCT AGTGTTGGGG 11460
GATTCAATTA CATGGCCATG TCAAGATGTT TTGTAAGGAA TATTGGTGAT CCATCAGTTG 11520
CCGCATTGGC TGATATTAAA AGATTTATTA AGGCGAATCT ATTAGACCGA AGTGTTCTTT 11580
ATAGGATTAT GAATCAAGAA CCAGGTGAGT CATCTTTTTT GGACTGGGCT TCAGATCCAT 11640
ATTCATGCAA TTTACCACAA TCTCAAAATA TAACCACCAT GATAAAAAAT ATAACAGCAA 11700
GGAATGTATT ACAAGATTCA CCAAATCCAT TATTATCTGG ATTATTCACA AATACAATGA 11760
TAGAAGAAGA TGAAGAATTA GCTGAGTTCC TGATGGACAG GAAGGTAATT CTCCCTAGAG 11820
TTGCACATGA TATTCTAGAT AATTCTCTCA CAGGAATTAG AAATGCCATA GCTGGAATGT 11880
TAGATACGAC AAAATCACTA ATTCGGGTTG GCATAAATAG AGGAGGACTG ACATATAGTT 11940
TGTTGAGGAA AATCAGTAAT TACGATCTAG TACAATATGA AACACTAAGT AGGACTTTGC 12000
GACTAATTGT AAGTGATAAA ATCAAGTATG AAGATATGTG TTCGGTAGAC CTTGCCATAG 12060
CATTGCGACA AAAGATGTGG ATTCATTTAT CAGGAGGAAG GATGATAAGT GGACTTGAAA 12120
CGCCTGACCC ATTAGAATTA CTATCTGGGG TAGTAATAAC AGGATCAGAA CATTGTAAAA 12180
TATGTTATTC TTCAGATGGC ACAAACCCAT ATACTTGGAT GTATTTACCC GGTAATATCA 12240
AAATAGGATC AGCAGAAACA GGTATATCGT CATTAAGAGT TCCTTATTTT GGATCAGTCA 12300
CTGATGAAAG ATCTGAAGCA CAATTAGGAT ATATCAAGAA TCTTAGTAAA CCTGCAAAAG 12360
CCGCAATAAG AATAGCAATG ATATATACAT GGGCATTTGG TAATGATGAG ATATCTTGGA 12420
TGGAAGCCTC ACAGATAGCA CAAACACGTG CAAATTTTAC ACTAGATAGT CTCAAAATTT 12480
TAACACCGGT AGCTACATCA ACAAATTTAT CACACAGATT AAAGGATACT GCAACTCAGA 12540
TGAAATTCTC CAGTACATCA TTGATCAGAG TCAGCAGATT CATAACAATG TCCAATGATA 12600
ACATGTCTAT CAAAGAAGCT AATGAAACCA AAGATACTAA TCTTATTTAT CAACAAATAA 12660
TGTTAACAGG ATTAAGTGTT TTCGAATATT TATTTAGATT AAAAGAAACC ACAGGACACA 12720
ACCCTATAGT TATGCATCTG CACATAGAAG ATGAGTGTTG TATTAAAGAA AGTTTTAATG 12780
ATGAACATAT TAATCCAGAG TCTACATTAG AATTAATTCG ATATCCTGAA AGTAATGAAT 12840
TTATTTATGA TAAAGACCCA CTCAAAGATG TGGACTTATC AAAACTTATG GTTATTAAAG 12900
ACCATTCTTA CACAATTGAT ATGAATTATT GGGATGATAC TGACATCATA CATGCAATTT 12960
CAATATGTAC TGCAATTACA ATAGCAGATA CTATGTCACA ATTAGATCGA GATAATTTAA 13020
AAGAGATAAT AGTTATTGCA AATGATGATG ATATTAATAG CTTAATCACT GAATTTTTGA 13080
CTCTTGACAT ACTTGTATTT CTCAAGACAT TTGGTGGATT ATTAGTAAAT CAATTTGCAT 13140
ACACTCTTTA TAGTCTAAAA ATAGAAGGTA GGGATCTCAT TTGGGATTAT ATAATGAGAA 13200
CACTGAGAGA TACTTCCCAT TCAATATTAA AAGTATTATC TAATGCATTA TCTCATCCTA 13260
AAGTATTCAA GAGGTTCTGG GATTGTGGAG TTTTAAACCC TATTTATGGT CCTAATACTG 13320
CTAGTCAAGA CCAGATAAAA CTTGCCCTAT CTATATGTGA ATATTCACTA GATCTATTTA 13380
TGAGAGAATG GTTGAATGGT GTATCACTTG AAATATACAT TTGTGACAGC GATATGGAAG 13440
TTGCAAATGA TAGGAAACAA GCCTTTATTT CTAGACACCT TTCATTTGTT TGTTGTTTAG 13500
CAGAAATTGC ATCTTTCGGA CCTAACCTGT TAAACTTAAC ATACTTGGAG AGACTTGATC 13560
TATTGAAACA ATATCTTGAA TTAAATATTA AAGAAGACCC TACTCTTAAA TATGTACAAA 13620
TATCTGGATT ATTAATTAAA TCGTTCCCAT CAACTGTAAC ATACGTAAGA AAGACTGCAA 13680
TCAAATATCT AAGGATTCGC GGTATTAGTC CACCTGAGGT AATTGATGAT TGGGATCCGG 13740
TAGAAGATGA AAATATGCTG GATAACATTG TCAAAACTAT AAATGATAAC TGTAATAAAG 13800
ATAATAAAGG GAATAAAATT AACAATTTCT GGGGACTAGC ACTTAAGAAC TATCAAGTCC 13860
TTAAAATCAG ATCTATAACA AGTGATTCTG ATGATAATGA TAGACTAGAT GCTAATACAA 13920
GTGGTTTGAC ACTTCCTCAA GGAGGGAATT ATCTATCGCA TCAATTGAGA TTATTCGGAA 13980
TCAACAGCAC TAGTTGTCTG AAAGCTCTTG AGTTATCACA AATTTTAATG AAGGAAGTCA 14040
ATAAAGACAA GGACAGGCTC TTCCTGGGAG AAGGAGCAGG AGCTATGCTA GCATGTTATG 14100
ATGCCACATT AGGACCTGCA GTTAATTATT ATAATTCAGG TTTGAATATA ACAGATGTAA 14160
TTGGTCAACG AGAATTGAAA ATATTTCCTT CAGAGGTATC ATTAGTAGGT AAAAAATTAG 14220
GAAATGTGAC ACAGATTCTT AACAGGGTAA AAGTACTGTT CAATGGGAAT CCTAATTCAA 14280
CATGGATAGG AAATATGGAA TGTGAGAGCT TAATATGGAG TGAATTAAAT GATAAGTCCA 14340
TTGGATTAGT ACATTGTGAT ATGGAAGGAG CTATCGGTAA ATCAGAAGAA ACTGTTCTAC 14400
ATGAACATTA TAGTGTTATA AGAATTACAT ACTTGATTGG GGATGATGAT GTTGTTTTAG 14460
TTTCCAAAAT TATACCTACA ATCACTCCGA ATTGGTCTAG AATACTTTAT CTATATAAAT 14520
TATATTGGAA AGATGTAAGT ATAATATCAC TCAAAACTTC TAATCCTGCA TCAACAGAAT 14580
TATATCTAAT TTCGAAAGAT GCATATTGTA CTATAATGGA ACCTAGTGAA ATTGTTTTAT 14640
CAAAACTTAA AAGATTGTCA CTCTTGGAAG AAAATAATCT ATTAAAATGG ATCATTTTAT 14700
CAAAGAAGAG GAATAATGAA TGGTTACATC ATGAAATCAA AGAAGGAGAA AGAGATTATG 14760
GAATCATGAG ACCATATCAT ATGGCACTAC AAATCTTTGG ATTTCAAATC AATTTAAATC 14820
ATCTGGCGAA AGAATTTTTA TCAACCCCAG ATCTGACTAA TATCAACAAT ATAATCCAAA 14880
GTTTTCAGCG AACAATAAAG GATGTTTTAT TTGAATGGAT TAATATAACT CATGATGATA 14940
AGAGACATAA ATTAGGCGGA AGATATAACA TATTCCCACT GAAAAATAAG GGAAAGTTAA 15000
GACTGCTATC GAGAAGACTA GTATTAAGTT GGATTTCATT ATCATTATCG ACTCGATTAC 15060
TTACAGGTCG CTTTCCTGAT GAAAAATTTG AACATAGAGC ACAGACTGGA TATGTATCAT 15120
TAGCTGATAC TGATTTAGAA TCATTAAAGT TATTGTCGAA AAACATCATT AAGAATTACA 15180
GAGAGTGTAT AGGATCAATA TCATATTGGT TTCTAACCAA AGAAGTTAAA ATACTTATGA 15240
AATTGATTGG TGGTGCTAAA TTATTAGGAA TTCCCAGACA ATATAAAGAA CCCGAAGACC 15300
AGTTATTAGA AAACTACAAT CAACATGATG AATTTGATAT CGATTAAAAC ATAAATACAA 15360
TGAAGATATA TCCTAACCTT TATCTTTAAG CCTAGGAATA GACAAAAAGT AAGAAAAACA 15420
TGTAATATAT ATATACCAAA CAGAGTTCTT CTCTTGTTTG GT 15462
(2) SEQ ID NO: 18 information about:
...
(i) sequence signature:
(A) length: 2233 amino acid
(B) type: amino acid
(C) chain:
(D) topological framework: linearity
(ii) molecule type: protein
(xi) sequence description: SEQ ID NO:18:
Met?Asp?Thr?Glu?Ser?Asn?Asn?Gly?Thr?Val?Ser?Asp?Ile?Leu?Tyr?Pro
1???????????????5???????????????????10??????????????????15
Glu?Cys?His?Leu?Asn?Ser?Pro?Ile?Val?Lys?Gly?Lys?Ile?Ala?Gln?Leu
20??????????????????25??????????????????30
His?Thr?Ile?Met?Ser?Leu?Pro?Gln?Pro?Tyr?Asp?Met?Asp?Asp?Asp?Ser
35??????????????????40??????????????????45
Ile?Leu?Val?Ile?Thr?Arg?Gln?Lys?Ile?Lys?Leu?Asn?Lys?Leu?Asp?Lys
50??????????????????55??????????????????60Arg?Gln?Arg?Ser?Ile?Arg?Arg?Leu?Lys?Leu?Ile?Leu?Thr?Glu?Lys?Val65??????????????????70??????????????????75??????????????????80Asn?Asp?Leu?Gly?Lys?Tyr?Thr?Phe?Ile?Arg?Tyr?Pro?Glu?Met?Ser?Lys
85??????????????????90??????????????????95Glu?Met?Phe?Lys?Leu?Tyr?Ile?Pro?Gly?Ile?Asn?Ser?Lys?Val?Thr?Glu
100?????????????????105?????????????????110Leu?Leu?Leu?Lys?Ala?Asp?Arg?Thr?Tyr?Ser?Gln?Met?Thr?Asp?Gly?Leu
115?????????????????120?????????????????125Arg?Asp?Leu?Trp?Ile?Asn?Val?Leu?Ser?Lys?Leu?Ala?Ser?Lys?Asn?Asp
130?????????????????135?????????????????140Gly?Ser?Asn?Tyr?Asp?Leu?Asn?Glu?Glu?Ile?Asn?Asn?Ile?Ser?Lys?Val145?????????????????150?????????????????155?????????????????160His?Thr?Thr?Tyr?Lys?Ser?Asp?Lys?Trp?Tyr?Asn?Pro?Phe?Lys?Thr?Trp
165?????????????????170?????????????????175Phe?Thr?Ile?Lys?Tyr?Asp?Met?Arg?Arg?Leu?Gln?Lys?Ala?Arg?Asn?Glu
180?????????????????185?????????????????190Ile?Thr?Phe?Asn?Val?Gly?Lys?Asp?Tyr?Asn?Leu?Leu?Glu?Asp?Gln?Lys
195?????????????????200?????????????????205Asn?Phe?Leu?Leu?Ile?His?Pro?Glu?Leu?Val?Leu?Ile?Leu?Asp?Lys?Gln
210?????????????????215?????????????????220Asn?Tyr?Asn?Gly?Tyr?Leu?Ile?Thr?Pro?Glu?Leu?Val?Leu?Met?Tyr?Cys225?????????????????230?????????????????235?????????????????240Asp?Val?Val?Glu?Gly?Arg?Trp?Asn?Ile?Ser?Ala?Cys?Ala?Lys?Leu?Asp
245?????????????????250?????????????????255Pro?Lys?Leu?Gln?Ser?Met?Tyr?Gln?Lys?Gly?Asn?Asn?Leu?Trp?Glu?Val
260?????????????????265?????????????????270Ile?Asp?Lys?Leu?Phe?Pro?Ile?Met?Gly?Glu?Lys?Thr?Phe?Asp?Val?Ile
275?????????????????280?????????????????285Ser?Leu?Leu?Glu?Pro?Leu?Ala?Leu?Ser?Leu?Ile?Gln?Thr?His?Asp?Pro
290?????????????????295?????????????????300Val?Lys?Gln?Leu?Arg?Gly?Ala?Phe?Leu?Asn?His?Val?Leu?Ser?Glu?Met305?????????????????310?????????????????315?????????????????320Glu?Leu?Ile?Phe?Glu?Ser?Arg?Glu?Ser?Ile?Lys?Glu?Phe?Leu?Ser?Val
325?????????????????330?????????????????335Asp?Tyr?Ile?Asp?Lys?Ile?Leu?Asp?Ile?Phe?Asn?Lys?Ser?Thr?Ile?Asp
340?????????????????345?????????????????350Glu?Ile?Ala?Glu?Ile?Phe?Ser?Phe?Phe?Arg?Thr?Phe?Gly?His?Pro?Pro
355?????????????????360?????????????????365Leu?Glu?Ala?Ser?Ile?Ala?Ala?Glu?Lys?Val?Arg?Lys?Tyr?Met?Tyr?Ile
370?????????????????375?????????????????380Gly?Lys?Gln?Leu?Lys?Phe?Asp?Thr?Ile?Asn?Lys?Cys?His?Ala?Ile?Phe385?????????????????390?????????????????395?????????????????400Cys?Thr?Ile?Ile?Ile?Asn?Gly?Tyr?Arg?Glu?Arg?His?Gly?Gly?Gln?Trp
405?????????????????410?????????????????415Pro?Pro?Val?Thr?Leu?Pro?Asp?His?Ala?His?Glu?Phe?Ile?Ile?Asn?Ala
420?????????????????425?????????????????430Tyr?Gly?Ser?Asn?Ser?Ala?Ile?Ser?Tyr?Glu?Asn?Ala?Val?Asp?Tyr?Tyr
435?????????????????440?????????????????445Gln?Ser?Phe?Ile?Gly?Ile?Lys?Phe?Asn?Lys?Phe?Ile?Glu?Pro?Gln?Leu
450?????????????????455?????????????????460Asp?Glu?Asp?Leu?Thr?Ile?Tyr?Met?Lys?Asp?Lys?Ala?Leu?Ser?Pro?Lys465?????????????????470?????????????????475?????????????????480Lys?Ser?Asn?Trp?Asp?Thr?Val?Tyr?Pro?Ala?Ser?Asn?Leu?Leu?Tyr?Arg
485?????????????????490?????????????????495Thr?Asn?Ala?Ser?Asn?Glu?Ser?Arg?Arg?Leu?Val?Glu?Val?Phe?Ile?Ala
500?????????????????505?????????????????510Asp?Ser?Lys?Phe?Asp?Pro?His?Gln?Ile?Leu?Asp?Tyr?Val?Glu?Ser?Gly
515?????????????????520?????????????????525Asp?Trp?Leu?Asp?Asp?Pro?Glu?Phe?Asn?Ile?Ser?Tyr?Ser?Leu?Lys?Glu
530?????????????????535?????????????????540Lys?Glu?Ile?Lys?Gln?Glu?Gly?Arg?Leu?Phe?Ala?Lys?Met?Thr?Tyr?Lys545?????????????????550?????????????????555?????????????????560Met?Arg?Ala?Thr?Gln?Val?Leu?Ser?Glu?Thr?Leu?Leu?Ala?Asn?Asn?Ile
565?????????????????570?????????????????575Gly?Lys?Phe?Phe?Gln?Glu?Asn?Gly?Met?Val?Lys?Gly?Glu?Ile?Glu?Leu
580?????????????????585?????????????????590Leu?Lys?Arg?Leu?Thr?Thr?Ile?Ser?Ile?Ser?Gly?Val?Pro?Arg?Tyr?Asn
595?????????????????600?????????????????605Glu?Val?Tyr?Asn?Asn?Ser?Lys?Ser?His?Thr?Asp?Asp?Leu?Lys?Thr?Tyr
610?????????????????615?????????????????620Asn?Lys?Ile?Ser?Asn?Leu?Asn?Leu?Ser?Ser?Asn?Gln?Lys?Ser?Lys?Lys625?????????????????630?????????????????635?????????????????640Phe?Glu?Phe?Lys?Ser?Thr?Asp?Ile?Tyr?Asn?Asp?Gly?Tyr?Glu?Thr?Val
645?????????????????650?????????????????655Ser?Cys?Phe?Leu?Thr?Thr?Asp?Leu?Lys?Lys?Tyr?Cys?Leu?Asn?Trp?Arg
660?????????????????665?????????????????670Tyr?Glu?Ser?Thr?Ala?Leu?Phe?Gly?Glu?Thr?Cys?Asn?Gln?Ile?Phe?Gly
675?????????????????680?????????????????685Leu?Asn?Lys?Leu?Phe?Asn?Trp?Leu?His?Pro?Arg?Leu?Glu?Gly?Ser?Thr
690?????????????????695?????????????????700Ile?Tyr?Val?Gly?Asp?Pro?Tyr?Cys?Pro?Pro?Ser?Asp?Lys?Glu?His?Ile705?????????????????710?????????????????715?????????????????720Ser?Leu?Glu?Asp?His?Pro?Asp?Ser?Gly?Phe?Tyr?Val?His?Asn?Pro?Arg
725?????????????????730?????????????????735Gly?Gly?Ile?Glu?Gly?Phe?Cys?Gln?Lys?Leu?Trp?Thr?Leu?Ile?Ser?Ile
740?????????????????745?????????????????750Ser?Ala?Ile?His?Leu?Ala?Ala?Val?Arg?Ile?Gly?Val?Arg?Val?Thr?Ala
755?????????????????760?????????????????765Met?Val?Gln?Gly?Asp?Asn?Gln?Ala?Ile?Ala?Val?Thr?Thr?Arg?Val?Pro
770?????????????????775?????????????????780Asn?Asn?Tyr?Asp?Tyr?Arg?Val?Lys?Lys?Glu?Ile?Val?Tyr?Lys?Asp?Val785?????????????????790?????????????????795?????????????????800Val?Arg?Phe?Phe?Asp?Ser?Leu?Arg?Glu?Val?Met?Asp?Asp?Leu?Gly?His
805?????????????????810?????????????????815Glu?Leu?Lys?Leu?Asn?Glu?Thr?Ile?Ile?Ser?Ser?Lys?Met?Phe?Ile?Tyr
820?????????????????825?????????????????830Ser?Lys?Arg?Ile?Tyr?Tyr?Asp?Gly?Arg?Ile?Leu?Pro?Gln?Ala?Leu?Lys
835?????????????????840?????????????????845Ala?Leu?Ser?Arg?Cys?Val?Phe?Trp?Ser?Glu?Thr?Val?Ile?Asp?Glu?Thr
850?????????????????855?????????????????860Arg?Ser?Ala?Ser?Ser?Asn?Leu?Ala?Thr?Ser?Phe?Ala?Lys?Ala?Ile?Glu865?????????????????870?????????????????875?????????????????880Asn?Gly?Tyr?Ser?Pro?Val?Leu?Gly?Tyr?Ala?Cys?Ser?Ile?Phe?Lys?Asn
885?????????????????890?????????????????895Ile?Gln?Gln?Leu?Tyr?Ile?Ala?Leu?Gly?Met?Asn?Ile?Asn?Pro?Thr?Ile
900?????????????????905?????????????????910Thr?Gln?Asn?Ile?Arg?Asp?Gln?Tyr?Phe?Arg?Asn?Pro?Asn?Trp?Met?Gln
915?????????????????920?????????????????925Tyr?Ala?Ser?Leu?Ile?Pro?Ala?Ser?Val?Gly?Gly?Phe?Asn?Tyr?Met?Ala
930?????????????????935?????????????????940Met?Ser?Arg?Cys?Phe?Val?Arg?Asn?Ile?Gly?Asp?Pro?Ser?Val?Ala?Ala945?????????????????950?????????????????955?????????????????960Leu?Ala?Asp?Ile?Lys?Arg?Phe?Ile?Lys?Ala?Asn?Leu?Leu?Asp?Arg?Ser
965?????????????????970?????????????????975Val?Leu?Tyr?Arg?Ile?Met?Asn?Gln?Glu?Pro?Gly?Glu?Ser?Ser?Phe?Leu
980?????????????????985?????????????????990Asp?Trp?Ala?Ser?Asp?Pro?Tyr?Ser?Cys?Asn?Leu?Pro?Gln?Ser?Gln?Asn
995?????????????????1000????????????????1005Ile?Thr?Thr?Met?Ile?Lys?Asn?Ile?Thr?Ala?Arg?Asn?Val?Leu?Gln?Asp
1010????????????????1015????????????????1020Ser?Pro?Asn?Pro?Leu?Leu?Ser?Gly?Leu?Phe?Thr?Asn?Thr?Met?Ile?Glu1025????????????????1030????????????????1035????????????????1040Glu?Asp?Glu?Glu?Leu?Ala?Glu?Phe?Leu?Met?Asp?Arg?Lys?Val?Ile?Leu
1045????????????????1050????????????????1055Pro?Arg?Val?Ala?His?Asp?Ile?Leu?Asp?Asn?Ser?Leu?Thr?Gly?Ile?Arg
1060????????????????1065????????????????1070Asn?Ala?Ile?Ala?Gly?Met?Leu?Asp?Thr?Thr?Lys?Ser?Leu?Ile?Arg?Val
1075????????????????1080????????????????1085Gly?Ile?Asn?Arg?Gly?Gly?Leu?Thr?Tyr?Ser?Leu?Leu?Arg?Lys?Ile?Ser
1090????????????????1095????????????????1100Asn?Tyr?Asp?Leu?Val?Gln?Tyr?Glu?Thr?Leu?Ser?Arg?Thr?Leu?Arg?Leu1105????????????????1110????????????????1115????????????????1120Ile?Val?Ser?Asp?Lys?Ile?Lys?Tyr?Glu?Asp?Met?Cys?Ser?Val?Asp?Leu
1125????????????????1130????????????????1135Ala?Ile?Ala?Leu?Arg?Gln?Lys?Met?Trp?Ile?His?Leu?Ser?Gly?Gly?Arg
1140????????????????1145????????????????1150Met?Ile?Ser?Gly?Leu?Glu?Thr?Pro?Asp?Pro?Leu?Glu?Leu?Leu?Ser?Gly
1155????????????????1160????????????????1165Val?Val?Ile?Thr?Gly?Ser?Glu?His?Cys?Lys?Ile?Cys?Tyr?Ser?Ser?Asp
1170????????????????1175????????????????1180Gly?Thr?Asn?Pro?Tyr?Thr?Trp?Met?Tyr?Leu?Pro?Gly?Asn?Ile?Lys?Ile1185????????????????1190????????????????1195????????????????1200Gly?Ser?Ala?Glu?Thr?Gly?Ile?Ser?Ser?Leu?Arg?Val?Pro?Tyr?Phe?Gly
1205????????????????1210????????????????1215Ser?Val?Thr?Asp?Glu?Arg?Ser?Glu?Ala?Gln?Leu?Gly?Tyr?Ile?Lys?Asn
1220????????????????1225????????????????1230Leu?Ser?Lys?Pro?Ala?Lys?Ala?Ala?Ile?Arg?Ile?Ala?Met?Ile?Tyr?Thr
1235????????????????1240????????????????1245Trp?Ala?Phe?Gly?Asn?Asp?Glu?Ile?Ser?Trp?Met?Glu?Ala?Ser?Gln?Ile
1250????????????????1255????????????????1260Ala?Gln?Thr?Arg?Ala?Asn?Phe?Thr?Leu?Asp?Ser?Leu?Lys?Ile?Leu?Thr1265????????????????1270????????????????1275????????????????1280Pro?Val?Ala?Thr?Ser?Thr?Asn?Leu?Ser?His?Arg?Leu?Lys?Asp?Thr?Ala
1285????????????????1290????????????????1295Thr?Gln?Met?Lys?Phe?Ser?Ser?Thr?Ser?Leu?Ile?Arg?Val?Ser?Arg?Phe
1300????????????????1305????????????????1310Ile?Thr?Met?Ser?Asn?Asp?Asn?Met?Ser?Ile?Lys?Glu?Ala?Asn?Glu?Thr
1315????????????????1320????????????????1325Lys?Asp?Thr?Asn?Leu?Ile?Tyr?Gln?Gln?Ile?Met?Leu?Thr?Gly?Leu?Ser
1330????????????????1335????????????????1340Val?Phe?Glu?Tyr?Leu?Phe?Arg?Leu?Lys?Glu?Thr?Thr?Gly?His?Asn?Pro1345????????????????1350????????????????1355????????????????1360Ile?Val?Met?His?Leu?His?Ile?Glu?Asp?Glu?Cys?Cys?Ile?Lys?Glu?Ser
1365????????????????1370????????????????1375Phe?Asn?Asp?Glu?His?Ile?Asn?Pro?Glu?Ser?Thr?Leu?Glu?Leu?Ile?Arg
1380????????????????1385????????????????1390Tyr?Pro?Glu?Ser?Asn?Glu?Phe?Ile?Tyr?Asp?Lys?Asp?Pro?Leu?Lys?Asp
1395????????????????1400????????????????1405Val?Asp?Leu?Ser?Lys?Leu?Met?Val?Ile?Lys?Asp?His?Ser?Tyr?Thr?Ile
1410????????????????1415????????????????1420Asp?Met?Asn?Tyr?Trp?Asp?Asp?Thr?Asp?Ile?Ile?His?Ala?Ile?Ser?Ile1425????????????????1430????????????????1435????????????????1440Cys?Thr?Ala?Ile?Thr?Ile?Ala?Asp?Thr?Met?Ser?Gln?Leu?Asp?Arg?Asp
1445????????????????1450????????????????1455Asn?Leu?Lys?Glu?Ile?Ile?Val?Ile?Ala?Asn?Asp?Asp?Asp?Ile?Asn?Ser
1460????????????????1465????????????????1470Leu?Ile?Thr?Glu?Phe?Leu?Thr?Leu?Asp?Ile?Leu?Val?Phe?Leu?Lys?Thr
1475????????????????1480????????????????1485Phe?Gly?Gly?Leu?Leu?Val?Asn?Gln?Phe?Ala?Tyr?Thr?Leu?Tyr?Ser?Leu
1490????????????????1495????????????????1500Lys?Ile?Glu?Gly?Arg?Asp?Leu?Ile?Trp?Asp?Tyr?Ile?Met?Arg?Thr?Leu1505????????????????1510????????????????1515????????????????1520Arg?Asp?Thr?Ser?His?Ser?Ile?Leu?Lys?Val?Leu?Ser?Asn?Ala?Leu?Ser
1525????????????????1530????????????????1535His?Pro?Lys?Val?Phe?Lys?Arg?Phe?Trp?Asp?Cys?Gly?Val?Leu?Asn?Pro
1540????????????????1545????????????????1550Ile?Tyr?Gly?Pro?Asn?Thr?Ala?Ser?Gln?Asp?Gln?Ile?Lys?Leu?Ala?Leu
1555????????????????1560????????????????1565Ser?Ile?Cys?Glu?Tyr?Ser?Leu?Asp?Leu?Phe?Met?Arg?Glu?Trp?Leu?Asn
1570????????????????1575????????????????1580Gly?Val?Ser?Leu?Glu?Ile?Tyr?Ile?Cys?Asp?Ser?Asp?Met?Glu?Val?Ala1585????????????????1590????????????????1595????????????????1600Asn?Asp?Arg?Lys?Gln?Ala?Phe?Ile?Ser?Arg?His?Leu?Ser?Phe?Val?Cys
1605????????????????1610????????????????1615Cys?Leu?Ala?Glu?Ile?Ala?Ser?Phe?Gly?Pro?Asn?Leu?Leu?Asn?Leu?Thr
1620????????????????1625????????????????1630Tyr?Leu?Glu?Arg?Leu?Asp?Leu?Leu?Lys?Gln?Tyr?Leu?Glu?Leu?Asn?Ile
1635????????????????1640????????????????1645Lys?Glu?Asp?Pro?Thr?Leu?Lys?Tyr?Val?Gln?Ile?Ser?Gly?Leu?Leu?Ile
1650????????????????1655????????????????1660Lys?Ser?Phe?Pro?Ser?Thr?Val?Thr?Tyr?Val?Arg?Lys?Thr?Ala?Ile?Lys1665????????????????1670????????????????1675????????????????1680Tyr?Leu?Arg?Ile?Arg?Gly?Ile?Ser?Pro?Pro?Glu?Val?Ile?Asp?Asp?Trp
1685????????????????1690????????????????1695Asp?Pro?Val?Glu?Asp?Glu?Asn?Met?Leu?Asp?Asn?Ile?Val?Lys?Thr?Ile
1700????????????????1705????????????????1710Asn?Asp?Asn?Cys?Asn?Lys?Asp?Asn?Lys?Gly?Asn?Lys?Ile?Asn?Asn?Phe
1715????????????????1720????????????????1725Trp?Gly?Leu?Ala?Leu?Lys?Asn?Tyr?Gln?Val?Leu?Lys?Ile?Arg?Ser?Ile
1730????????????????1735????????????????1740Thr?Ser?Asp?Ser?Asp?Asp?Asn?Asp?Arg?Leu?Asp?Ala?Asn?Thr?Ser?Gly1745????????????????1750????????????????1755????????????????1760Leu?Thr?Leu?Pro?Gln?Gly?Gly?Asn?Tyr?Leu?Ser?His?Gln?Leu?Arg?Leu
1765????????????????1770????????????????1775Phe?Gly?Ile?Asn?Ser?Thr?Ser?Cys?Leu?Lys?Ala?Leu?Glu?Leu?Ser?Gln
1780????????????????1785????????????????1790Ile?Leu?Met?Lys?Glu?Val?Asn?Lys?Asp?Lys?Asp?Arg?Leu?Phe?Leu?Gly
1795????????????????1800????????????????1805Glu?Gly?Ala?Gly?Ala?Met?Leu?Ala?Cys?Tyr?Asp?Ala?Thr?Leu?Gly?Pro
1810????????????????1815????????????????1820Ala?Val?Asn?Tyr?Tyr?Asn?Ser?Gly?Leu?Asn?Ile?Thr?Asp?Val?Ile?Gly1825????????????????1830????????????????1835????????????????1840Gln?Arg?Glu?Leu?Lys?Ile?Phe?Pro?Ser?Glu?Val?Ser?Leu?Val?Gly?Lys
1845????????????????1850????????????????1855Lys?Leu?Gly?Asn?Val?Thr?Gln?Ile?Leu?Asn?Arg?Val?Lys?Val?Leu?Phe
1860????????????????1865????????????????1870Asn?Gly?Asn?Pro?Asn?Ser?Thr?Trp?Ile?Gly?Asn?Met?Glu?Cys?Glu?Ser
1875????????????????1880????????????????1885Leu?Ile?Trp?Ser?Glu?Leu?Asn?Asp?Lys?Ser?Ile?Gly?Leu?Val?His?Cys
1890????????????????1895????????????????1900Asp?Met?Glu?Gly?Ala?Ile?Gly?Lys?Ser?Glu?Glu?Thr?Val?Leu?His?Glu1905????????????????1910????????????????1915????????????????1920His?Tyr?Ser?Val?Ile?Arg?Ile?Thr?Tyr?Leu?Ile?Gly?Asp?Asp?Asp?Val
1925????????????????1930????????????????1935Val?Leu?Val?Ser?Lys?Ile?Ile?Pro?Thr?Ile?Thr?Pro?Asn?Trp?Ser?Arg
1940????????????????1945????????????????1950Ile?Leu?Tyr?Leu?Tyr?Lys?Leu?Tyr?Trp?Lys?Asp?Val?Ser?Ile?Ile?Ser
1955????????????????1960????????????????1965Leu?Lys?Thr?Ser?Asn?Pro?Ala?Ser?Thr?Glu?Leu?Tyr?Leu?Ile?Ser?Lys
1970????????????????1975????????????????1980Asp?Ala?Tyr?Cys?Thr?Ile?Met?Glu?Pro?Ser?Glu?Ile?Val?Leu?Ser?Lys1985????????????????1990????????????????1995????????????????2000Leu?Lys?Arg?Leu?Ser?Leu?Leu?Glu?Glu?Asn?Asn?Leu?Leu?Lys?Trp?Ile
2005????????????????2010????????????????2015Ile?Leu?Ser?Lys?Lys?Arg?Asn?Asn?Glu?Trp?Leu?His?His?Glu?Ile?Lys
2020????????????????2025????????????????2030Glu?Gly?Glu?Arg?Asp?Tyr?Gly?Ile?Met?Arg?Pro?Tyr?His?Met?Ala?Leu
2035????????????????2040????????????????2045Gln?Ile?Phe?Gly?Phe?Gln?Ile?Asn?Leu?Asn?His?Leu?Ala?Lys?Glu?Phe
2050???????????????2055????????????????2060Leu?Ser?Thr?Pro?Asp?Leu?Thr?Asn?Ile?Asn?Asn?Ile?Ile?Gln?Ser?Phe2065????????????????2070????????????????2075????????????????2080Gln?Arg?Thr?Ile?Lys?Asp?Val?Leu?Phe?Glu?Trp?Ile?Asn?Ile?Thr?His
2085????????????????2090????????????????2095Asp?Asp?Lys?Arg?His?Lys?Leu?Gly?Gly?Arg?Tyr?Asn?Ile?Phe?Pro?Leu
2100????????????????2105????????????????2110
Lys?Asn?Lys?Gly?Lys?Leu?Arg?Leu?Leu?Ser?Arg?Arg?Leu?Val?Leu?Ser
2115????????????????2120????????????????2125
Trp?Ile?Ser?Leu?Ser?Leu?Ser?Thr?Arg?Leu?Leu?Thr?Gly?Arg?Phe?Pro
2130????????????????2135????????????????2140
Asp?Glu?Lys?Phe?Glu?His?Arg?Ala?Gln?Thr?Gly?Tyr?Val?Ser?Leu?Ala
2145????????????????2150????????????????2155????????????????2160
Asp?Thr?Asp?Leu?Glu?Ser?Leu?Lys?Leu?Leu?Ser?Lys?Asn?Ile?Ile?Lys
2165????????????????2170????????????????2175
Asn?Tyr?Arg?Glu?Cys?Ile?Gly?Ser?Ile?Ser?Tyr?Trp?Phe?Leu?Thr?Lys
2180????????????????2185????????????????2190
Glu?Val?Lys?Ile?Leu?Met?Lys?Leu?Ile?Gly?Gly?Ala?Lys?Leu?Leu?Gly
2195????????????????2200????????????????2205
Ile?Pro?Arg?Gln?Tyr?Lys?Glu?Pro?Glu?Asp?Gln?Leu?Leu?Glu?Asn?Tyr
2210????????????????2215????????????????2220
Asn?Gln?His?Asp?Glu?Phe?Asp?Ile?Asp
The information of 2,225 2230 (2) SEQ ID NO:19:
(i) sequence signature:
(A) length: 15462 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19:
ACCAAACAAG AGAAGAAACT TGCTTGGTAA TATAAATTTA ACTTAAAATT AACTTAGGAT 60
TTAAGACATT GACTAGAAGG TCAAGAAAAG GGAACTCTAT AATTTCAAAA ATGTTGAGCC 120
TATTTGATAC ATTTAATGCA GGTAGGCAAG AAAACATAAC AAAATCAGCC GGTGGAGCTA 180
TCATTCCTGG ACAGAAAAAT ACTGTCTCTA TATTCGCCCT TGGACCGACA ATAACTGATG 240
ATAATGAGAA AATGACATTA GCTCTTCTAT TTCTATCTCA TTCACTAGAT AATGAGAAAC 300
AACATGCACA AAGGGCAGGG TTCTTGGTGT CTTTATTGTC AATGGCTTAT GCCAATCCAG 360
AGCTCTACCT AACAACAAAT GGAAGTAATG CAGATGCCAA GTATGTCATA TACATGATTG 420
AGAAAGATCT AAAACGGCAA AAGTATGGAG GATTTGTGGT TAAGACGAGA GAGATGATAT 480
ATGAAAAGAC AACTGATTGG ATATTTGGAA GTGACCTGGA TTATGATCAG GAAACTATGT 540
TGCAGAACGG CAGGAACAAT TCAACAATTG AAGACCTTGT CCACACATTT GGGTATCCAT 600
CATGTTTAGG AGCTCTTATA ATACAGATCT GGATAGTTCT GGTCAAAGCT ATCACTAGTA 660
TCTCAGGGTT AAGAAAAGGC TTTTTCACCC GATTGGAAGC TTTCAGACAA GATGGAACAG 720
TGCAGGCAGG GCTGGTATTG AGCGGTGACA CAGTGGATCA GATTGGGTCA ATCATGCGGT 780
CTCAACAGAG CTTGGTAACT CTTATGGTTG AAACATTAAT AACAATGAAT ACCAGCAGAA 840
ATGACCTCAC AACCATAGAA AAGAATATAC AAATTGTTGG CAACTACATA AGAGATGCAG 900
GTCTCGCTTC ATTCTTCAAT ACAATCAGAT ATGGAATTGA GACCAGAATG GCAGCTTTGA 960
CTCTATCCAC TCTCAGACCA GATATCAATA GATTAAAAGC TTTGATGGAA CTGTATTTAT 1020
CAAAGGGACC ACGCGCTCCT TTCATCTGTA TCCTCAGAGA TCCTATACAT GGTGAGTTCG 1080
CACCAGGCAA CTATCCTGCC ATATGGAGCT ATGCAATGGG GGTGGCAGTT GTACAAAATA 1140
GAGCCATGCA ACAGTATGTG ACGGGAAGAT CATATCTAGA CATTGATATG TTCCAGCTAG 1200
GACAAGCAGT AGCACGTGAT GCCGAAGCTC AAATGAGCTC AACACTGGAA GATGAACTTG 1260
GAGTGACACA CGAAGCTAAA GAAAGCTTGA AGAGACATAT AAGGAACATA AACAGTTCAG 1320
AGACATCTTT CCACAAACCG ACAGGTGGAT CAGCCATAGA GATGGCAATA GATGAAGAGC 1380
CAGAACAATT CGAACATAGA GCAGATCAAG AACAAAATGG AGAACCTCAA TCATCCATAA 1440
TTCAATATGC CTGGGCAGAA GGAAATAGAA GCGATGATCA GACTGAGCAA GCTACAGAAT 1500
CTGACAATAT CAAGACCGAA CAACAAAACA TCAGAGACAG ACTAAACAAG AGACTCAACG 1560
ACAAGAAGAA ACAAAGCAGT CAACCACCCA CTAATCCCAC AAACAGAACA AACCAGGACG 1620
AAATAGATGA TCTGTTTAAC GCATTTGGAA GCAACTAATC GAATCAACAT TTTAATCTAA 1680
ATCAATAATA AATAAGAAAA ACTTAGGATT AAAGAATCCT ATCATACCGG AATATAGGGT 1740
GGTAAATTTA GAGTCTGCTT GAAACTCAAT CAATAGAGAG TTGATGGAAA GCGATGCTAA 1800
AAACTATCAA ATCATGGATT CTTGGGAAGA GGAATCAAGA GATAAATCAA CTAATATCTC 1860
CTCGGCCCTC AACATCATTG AATTCATACT CAGCACCGAC CCCCAAGAAG ACTTATCGGA 1920
AAACGACACA ATCAACACAA GAACCCAGCA ACTCAGTGCC ACCATCTGTC AACCAGAAAT 1980
CAAACCAACA GAAACAAGTG AGAAAGATAG TGGATCAACT GACAAAAATA GACAGTCTGG 2040
GTCATCACAC GAATGTACAA CAGAAGCAAA AGATAGAAAC ATTGATCAGG AAACTGTACA 2100
GAGAGGACCT GGGAGAAGAA GCAGCTCAGA TAGTAGAGCT GAGACTGTGG TCTCTGGAGG 2160
AATCCCCAGA AGCATCACAG ATTCTAAAAA TGGAACCCAA AACACGGAGG ATATTGATCT 2220
CAATGAAATT AGAAAGATGG ATAAGGACTC TATTGAGGGG AAAATGCGAC AATCTGCAAA 2280
TGTTCCAAGC GAGATATCAG GAAGTGATGA CATATTTACA ACAGAACAAA GTAGAAACAG 2340
TGATCATGGA AGAAGCCTGG AATCTATCAG TACACCTGAT ACAAGATCAA TAAGTGTTGT 2400
TACTGCTGCA ACACCAGATG ATGAAGAAGA AATACTAATG AAAAATAGTA GGACAAAGAA 2460
AAGTTCTTCA ACACATCAAG AAGATGACAA AAGAATTAAA AAAGGGGGAA AAGGGAAAGA 2520
CTGGTTTAAG AAATCAAAAG ATACCGACAA CCAGATACCA ACATCAGACT ACAGATCCAC 2580
ATCAAAAGGG CAGAAGAAAA TCTCAAAGAC AACAACCACC AACACCGACA CAAAGGGGCA 2640
AACAGAAATA CAGACAGAAT CATCAGAAAC ACAATCCTCA TCATGGAATC TCATCATCGA 2700
CAACAACACC GACCGGAACG AACAGACAAG CACAACTCCT CCAACAACAA CTTCCAGATC 2760
AACTTATACA AAAGAATCGA TCCGAACAAA CTCTGAATCC AAACCCAAGA CACAAAAGAC 2820
AAATGGAAAG GAAAGGAAGG ATACAGAAGA GAGCAATCGA TTTACAGAGA GGGCAATTAC 2880
TCTATTGCAG AATCTTGGTG TAATTCAATC CACATCAAAA CTAGATTTAT ATCAAGACAA 2940
ACGAGTTGTA TGTGTAGCAA ATGTACTAAA CAATGTAGAT ACTGCATCAA AGATAGATTT 3000
CCTGGCAGGA TTAGTCATAG GGGTTTCAAT GGACAACGAC ACAAAATTAA CACAGATACA 3060
AAATGAAATG CTAAACCTCA AAGCAGATCT AAAGAAAATG GACGAATCAC ATAGAAGATT 3120
GATAGAAAAT CAAAGAGAAC AACTGTCATT GATCACGTCA CTAATTTCAA ATCTCAAAAT 3180
TATGACTGAG AGAGGAGGAA AGAAAGACCA AAATGAATCC AATGAGAGAG TATCCATGAT 3240
CAAAACAAAA TTGAAAGAAG AAAAGATCAA GAAGACCAGG TTTGACCCAC TTATGGAGGC 3300
ACAAGGCATT GACAAGAATA TACCCGATCT ATATCGACAT GCAGGAGATA CACTAGAGAA 3360
CGATGTACAA GTTAAATCAG AGATATTAAG TTCATACAAT GAGTCAAATG CAACAAGACT 3420
AATACCCAAA AAAGTGAGCA GTACAATGAG ATCACTAGTT GCAGTCATCA ACAACAGCAA 3480
TCTCTCACAA AGCACAAAAC AATCATACAT AAACGAACTC AAACGTTGCA AAAATGATGA 3540
AGAAGTATCT GAATTAATGG ACATGTTCAA TGAAGATGTC AACAATTGCC AATGATCCAA 3600
CAAAGAAACG ACACCGAACA AACAGACAAG AAACAACAGT AGATCAAAAC CTGTCAACAC 3660
ACACAAAATC AAGCAGAATG AAACAACAGA TATCAATCAA TATACAAATA AGAAAAACTT 3720
AGGATTAAAG AATAAATTAA TCCTTGTCCA AAATGAGTAT AACTAACTCT GCAATATACA 3780
CATTCCCAGA ATCATCATTC TCTGAAAATG GTCATATAGA ACCATTACCA CTCAAAGTCA 3840
ATGAACAGAG GAAAGCAGTA CCCCACATTA GAGTTGCCAA GATCGGAAAT CCACCAAAAC 3900
ACGGATCCCG GTATTTAGAT GTCTTCTTAC TCGGCTTCTT CGAGATGGAA CGAATCAAAG 3960
ACAAATACGG GAGTGTGAAT GATCTCGACA GTGACCCGAG TTACAAAGTT TGTGGCTCTG 4020
GATCATTACC AATCGGATTG GCTAAGTACA CTGGGAATGA CCAGGAATTG TTACAAGCCG 4080
CAACCAAACT GGATATAGAA GTGAGAAGAA CAGTCAAAGC GAAAGAGATG GTTGTTTACA 4140
CGGTACAAAA TATAAAACCA GAACTGTACC CATGGTCCAA TAGACTAAGA AAAGGAATGC 4200
TGTTCGATGC CAACAAAGTT GCTCTTGCTC CTCAATGTCT TCCACTAGAT AGGAGCATAA 4260
AATTTAGAGT AATCTTCGTG AATTGTACGG CAATTGGATC AATAACCTTG TTCAAAATTC 4320
CTAAGTCAAT GGCATCACTA TCTCTAACCA ACACAATATC AATCAATCTG CAGGTACACA 4380
TAAAAACAGG GGTTCAGACT GATTCTAAAG GGATAGTTCA AATTTTGGAT GAGAAAGGCG 4440
AAAAATCACT GAATTTCATG GTCCATCTCG GATTGATCAA AAGAAAAGTA GGCAGAATGT 4500
ACTCTGTTGA ATACTGTAAA CAGAAAATCG AGAAAATGAG ATTGATATTT TCTTTAGGAC 4560
TAGTTGGAGG AATCAGTCTT CATGTCAATG CAACTGGGTC CATATCAAAA ACACTAGCAA 4620
GTCAGCTGGT ATTCAAAAGA GAGATTTGTT ATCCTTTAAT GGATCTAAAT CCGCATCTCA 4680
ATCTAGTTAT CTGGGCTTCA TCAGTAGAGA TTACAAGAGT GGATGCAATT TTCCAACCTT 4740
CTTTACCTGG CGAGTTCAGA TACTATCCTA ATATTATTGC AAAAGGAGTT GGGAAAATCA 4800
AACAATGGAA CTAGTAATCT CTATTTTAGT CCGGACGTAT CTATTAAGCC GAAGCAAATA 4860
AAGGATAATC AAAAACTTAG GACAAAAGAG GTCAATACCA ACAACTATTA GCAGTCACAC 4920
TCGCAAGAAT AAGAGAGAAG GGACCAAAAA AGTCAAATAG GAGAAATCAA AACAAAAGGT 4980
ACAGAACACC AGAACAACAA AATCAAAACA TCCAACTCAC TCAAAACAAA AATTCCAAAA 5040
GAGACCGGCA ACACAACAAG CACTGAACAC AATGCCAACT TCAATACTGC TAATTATTAC 5100
AACCATGATC ATGGCATCTT TCTGCCAAAT AGATATCACA AAACTACAGC ACGTAGGTGT 5160
ATTGGTCAAC AGTCCCAAAG GGATGAAGAT ATCACAAAAC TTTGAAACAA GATATCTAAT 5220
TTTGAGCCTC ATACCAAAAA TAGAAGACTC TAACTCTTGT GGTGACCAAC AGATCAAGCA 5280
ATACAAGAAG TTATTGGATA GACTGATCAT CCCTTTATAT GATGGATTAA GATTACAGAA 5340
AGATGTGATA GTAACCAATC AAGAATCCAA TGAAAACACT GATCCCAGAA CAAAACGATT 5400
CTTTGGAGGG GTAATTGGAA CCATTGCTCT GGGAGTAGCA ACCTCAGCAC AAATTACAGC 5460
GGCAGTTGCT CTGGTTGAAG CCAAGCAGGC AAGATCAGAC ATCGAAAAAC TCAAAGAAGC 5520
AATTAGGGAC ACAAATAAAG CAGTGCAGTC AGTTCAGAGC TCCATAGGAA ATTTAATAGT 5580
AGCAATTAAA TCAGTCCAGG ATTATGTTAA CAAAGAAATC GTGCCATCGA TTGCGAGGCT 5640
AGGTTGTGAA GCAGCAGGAC TTCAATTAGG AATTGCATTA ACACAGCATT ACTCAGAATT 5700
AACAAACATA TTTGGTGATA ACATAGGATC GTTACAAGAA AAAGGAATAA AATTACAAGG 5760
TATAGCATCA TTATACCGCA CAAATATCAC AGAAATATTC ACAACATCAA CAGTTGATAA 5820
ATATGATATC TATGATCTGT TATTTACAGA ATCAATAAAG GTGAGAGTTA TAGATGTTGA 5880
CTTGAATGAT TACTCAATCA CCCTCCAAGT CAGACTCCCT TTATTAACTA GGCTGCTGAA 5940
CACTCAGATC TACAAAGTAG ATTCCATATC ATATAACATC CAAAACAGAG AATGGTATAT 6000
CCCTCTTCCC AGCCATATCA TGACGAAAGG GGCATTTCTA GGTGGAGCAG ACGTCAAAGA 6060
ATGTATAGAA GCATTCAGCA GCTATATATG CCCTTCTGAT CCAGGATTTG TATTAAACCA 6120
TGAAATAGAG AGCTGCTTAT CAGGAAACAT ATCCCAATGT CCAAGAACAA CGGTCACATC 6180
AGACATTGTT CCAAGATATG CATTTGTCAA TGGAGGAGTG GTTGCAAACT GTATAACAAC 6240
CACCTGTACA TGCAACGGAA TTGGTAATAG AATCAATCAA CCACCTGATC AAGGAGTAAA 6300
AATTATAACA CATAAAGAAT GTAGTACAGT AGGTATCAAC GGAATGCTGT TCAATACAAA 6360
TAAAGAAGGA ACTCTTGCAT TCTATACACC AAATGATATA ACACTAAACA ATTCTGTTAC 6420
ACTTGATCCA ATTGACATAT CAATCGAGCT CAACAAGGCC AAATCAGATC TAGAAGAATC 6480
AAAAGAATGG ATAAGAAGGT CAAATCAAAA ACTAGATTCT ATTGGAAATT GGCATCAATC 6540
TAGCACTACA ATCATAATTA TTTTGATAAT GATCATTATA TTGTTTATAA TTAATATAAC 6600
GATAATTACA ATTGCAATTA AGTATTACAG AATTCAAAAG AGAAATCGAG TGGATCAAAA 6660
TGACAAGCCA TATGTACTAA CAAACAAATA ACATATCTAC AGATCATTAG ATATTAAAAT 6720
TATAAAAAAC TTAGGAGTAA AGTTACGCAA TCCAACTCTA CTCATATAAT TGAGGAAGGA 6780
CCCAATAGAC AAATCCAAAT TCGAGATGGA ATACTGGAAG CATACCAATC ACGGAAAGGA 6840
TGCTGGCAAT GAGCTGGAGA CGTCTATGGC TACTCATGGC AACAAGCTCA CTAATAAGAT 6900
AATATACATA TTATGGACAA TAATCCTGGT GTTATTATCA ATAGTCTTCA TCATAGTGCT 6960
AATTAATTCC ATCAAAAGTG AAAAGGCCCA CGAATCATTG CTGCAAGACA TAAATAATGA 7020
GTTTATGGAA ATTACAGAAA AGATCCAAAT GGCATCGGAT AATACCAATG ATCTAATACA 7080
GTCAGGAGTG AATACAAGGC TTCTTACAAT TCAGAGTCAT GTCCAGAATT ACATACCAAT 7140
ATCATTGACA CAACAGATGT CAGATCTTAG GAAATTCATT AGTGAAATTA CAATTAGAAA 7200
TGATAATCAA GAAGTGCTGC CACAAAGAAT AACACATGAT GTAGGTATAA AACCTTTAAA 7260
TCCAGATGAT TTTTGGAGAT GCACGTCTGG TCTTCCATCT TTAATGAAAA CTCCAAAAAT 7320
AAGGTTAATG CCAGGGCCGG GATTATTAGC TATGCCAACG ACTGTTGATG GCTGTGTTAG 7380
AACTCCGTCT TTAGTTATAA ATGATCTGAT TTATGCTTAT ACCTCAAATC TAATTACTCG 7440
AGGTTGTCAG GATATAGGAA AATCATATCA AGTCTTACAG ATAGGGATAA TAACTGTAAA 7500
CTCAGACTTG GTACCTGACT TAAATCCTAG GATCTCTCAT ACCTTTAACA TAAATGACAA 7560
TAGGAAGTCA TGTTCTCTAG CACTCCTAAA TACAGATGTA TATCAACTGT GTTCAACTCC 7620
CAAAGTTGAT GAAAGATCAG ATTATGCATC ATCAGGCATA GAAGATATTG TACTTGATAT 7680
TGTCAATTAT GATGGTTCAA TCTCAACAAC AAGATTTAAG AATAATAACA TAAGCTTTGA 7740
TCAACCATAT GCTGCACTAT ACCCATCTGT TGGACCAGGG ATATACTACA AAGGCAAAAT 7800
AATATTTCTC GGGTATGGAG GTCTTGAACA TCCAATAAAT GAGAATGTAA TCTGCAACAC 7860
AACTGGGTGC CCCGGGAAAA CACAGAGAGA CTGTAATCAA GCGTCTCATA GTCCATGGTT 7920
TTCAGATAGG AGGATGGTCA ACTCCATCAT TGTTGCTGAC AAAGGCTTAA ACTCAATTCC 7980
AAAATTGAAA GTATGGACGA TATCTATGCG ACAAAATTAC TGGGGGTCAG AAGGAAGGTT 8040
ACTTCTACTA GGTAACAAGA TCTATATATA TACAAGATCT ACAAGTTGGC ATAGCAAGTT 8100
ACAATTAGGA ATAATTGATA TTACTGATTA CAGTGATATA AGGATAAAAT GGACATGGCA 8160
TAATGTGCTA TCAAGACCAG GAAACAATGA ATGTCCATGG GGACATTCAT GTCCAGATGG 8220
ATGTATAACA GGAGTATATA CTGATGCATA TCCACTCAAT CCCACAGGGA GCATTGTGTC 8280
ATCTGTCATA TTAGACTCAC AAAAATCGAG AGTGAACCCA GTCATAACTT ACTCAACAGC 8340
AACCGAAAGA GTAAACGAGC TGGCCATCCT AAACAGAACA CTCTCAGCTG GATATACAAC 8400
AACAAGCTGC ATTACACACT ATAACAAAGG ATATTGTTTT CATATAGTAG AAATAAATCA 8460
TAAAAGCTTA AACACATTTC AACCCATGTT GTTCAAAACA GAGATTCCAA AAAGCTGCAG 8520
TTAATCATAA TTAACCATAA TATGCATCAA TCTATCTATA ATACAAGTAT ATGATAAGTA 8580
ATCAGCAATC AGACAATAGA CAAAAGGGAA ATATAAAAAA CTTAGGAGCA AAGCGTGCTC 8640
GGGAAATGGA CACTGAATCT AACAATGGCA CTGTATCTGA CATACTCTAT CCTGAGTGTC 8700
ACCTTAACTC TCCTATCGTT AAAGGTAAAA TAGCACAATT ACACACTATT ATGAGTCTAC 8760
CTCAGCCTTA TGATATGGAT GACGACTCAA TACTAGTTAT CACTAGACAG AAAATAAAAC 8820
TTAATAAATT GGATAAAAGA CAACGATCTA TTAGAAGATT AAAATTAATA TTAACTGAAA 8880
AAGTGAATGA CTTAGGAAAA TACACATTTA TCAGATATCC AGAAATGTCA AAAGAAATGT 8940
TCAAATTATA TATACCTGGT ATTAACAGTA AAGTGACTGA ATTATTACTT AAAGCAGATA 9000
GAACATATAG TCAAATGACT GATGGATTAA GAGATCTATG GATTAATGTG CTATCAAAAT 9060
TAGCCTCAAA AAATGATGGA AGCAATTATG ATCTTAATGA AGAAATTAAT AATATATCGA 9120
AAGTTCACAC AACCTATAAA TCAGATAAAT GGTATAATCC ATTCAAAACA TGGTTTACTA 9180
TCAAGTATGA TATGAGAAGA TTACAAAAAG CTCGAAATGA GATCACTTTT AATGTTGGGA 9240
AGGATTATAA CTTGTTAGAA GACCAGAAGA ATTTCTTATT GATACATCCA GAATTGGTTT 9300
TGATATTAGA TAAACAAAAC TACAATGGTT ATCTAATTAC TCCTGAATTA GTATTGATGT 9360
ATTGTGACGT AGTCGAAGGC CGATGGAATA TAAGTGCATG TGCTAAGTTA GATCCAAAAT 9420
TACAATCTAT GTATCAGAAA GGTAATAACC TGTGGGAAGT GATAGATAAA TTGTTTCCAA 9480
TTATGGGAGA AAAGACATTT GATGTGATAT CGTTATTAGA ACCACTTGCA TTATCCTTAA 9540
TTCAAACTCA TGATCCTGTT AAACAACTAA GAGGAGCTTT TTTAAATCAT GTGTTATCCG 9600
AGATGGAATT AATATTTGAA TCTAGAGAAT CGATTAAGGA ATTTCTGAGT GTAGATTACA 9660
TTGATAAAAT TTTAGATATA TTTAATAAGT CTACAATAGA TGAAATAGCA GAGATTTTCT 9720
CTTTTTTTAG AACATTTGGG CATCCTCCAT TAGAAGCTAG TATTGCAGCA GAAAAGGTTA 9780
GAAAATATAT GTATATTGGA AAACAATTAA AATTTGACAC TATTAATAAA TGTCATGCTA 9840
TCTTCTGTAC AATAATAATT AACGGATATA GAGAGAGGCA TGGTGGACAG TGGCCTCCTG 9900
TGACATTACC TGATCATGCA CACGAATTCA TCATAAATGC TTACGGTTCA AACTCTGCGA 9960
TATCATATGA GAATGCTGTT GATTATTACC AGAGCTTTAT AGGAATAAAA TTCAATAAAT 10020
TCATAGAGCC TCAGTTAGAT GAGGATTTGA CAATTTATAT GAAAGATAAA GCATTATCTC 10080
CAAAAAAATC AAATTGGGAC ACAGTTTATC CTGCATCTAA TTTACTGTAC CGTACTAACG 10140
CATCCAACGA ATCACGAAGA TTAGTTGAAG TATTTATAGC AGATAGTAAA TTTGATCCTC 10200
ATCAGATATT GGATTATGTA GAATCTGGGG ACTGGTTAGA TGATCCAGAA TTTAATATTT 10260
CTTATAGTCT TAAAGAAAAA GAGATCAAAC AGGAAGGTAG ACTCTTTGCA AAAATGACAT 10320
ACAAAATGAG AGCTACACAA GTTTTATCAG AGACACTACT TGCAAATAAC ATAGGAAAAT 10380
TCTTTCAAGA AAATGGGATG GTGAAGGGAG AGATTGAATT ACTTAAGAGA TTAACAACCA 10440
TATCAATATC AGGAGTTCCA CGGTATAATG AAGTGTACAA TAATTCTAAA AGCCATACAG 10500
ATGACCTTAA AACCTACAAT AAAATAAGTA ATCTTAATTT GTCTTCTAAT CAGAAATCAA 10560
AGAAATTTGA ATTCAAGTCA ACGGATATCT ACAATGATGG ATACGAGACT GTGAGCTGTT 10620
TCCTAACAAC AGATCTCAAA AAATACTGTC TTAATTGGAG ATATGAATCA ACAGCTCTAT 10680
TTGGAGAAAC TTGCAACCAA ATATTTGGAT TAAATAAATT GTTTAATTGG TTACACCCTC 10740
GTCTTGAAGG AAGTACAATC TATGTAGGTG ATCCTTACTG TCCTCCATCA GATAAAGAAC 10800
ATATATCATT AGAGGATCAC CCTGATTCTG GTTTTTACGT TCATAACCCA AGAGGGGGTA 10860
TAGAAGGATT TTGTCAAAAA TTATGGACAC TCATATCTAT AAGTGCAATA CATCTAGCAG 10920
CTGTTAGAAT AGGCGTGAGG GTGACTGCAA TGGTTCAAGG AGACAATCAA GCTATAGCTG 10980
TAACCACAAG AGTACCCAAC AATTATGACT ACAGAGTTAA GAAGGAGATA GTTTATAAAG 11040
ATGTAGTGAG ATTTTTTGAT TCATTAAGAG AAGTGATGGA TGATCTAGGT CATGAACTTA 11100
AATTAAATGA AACGATTATA AGTAGCAAGA TGTTCATATA TAGCAAAAGA ATCTATTATG 11160
ATGGGAGAAT TCTTCCTCAA GCTCTAAAAG CATTATCTAG ATGTGTCTTC TGGTCAGAGA 11220
CAGTAATAGA CGAAACAAGA TCAGCATCTT CAAATTTGGC AACATCATTT GCAAAAGCAA 11280
TTGAGAATGG TTATTCACCT GTTCTAGGAT ATGCATGCTC AATTTTTAAG AACATTCAAC 11340
AACTATATAT TGCCCTTGGG ATGAATATCA ATCCAACTAT AACACAGAAT ATCAGAGATC 11400
AGTATTTTAG GAATCCAAAT TGGATGCAAT ATGCCTCTTT AATACCTGCT AGTGTTGGGG 11460
GATTCAATCA CATGGCCATG TCAAGATGTT TTGTAAGGAA TATTGGTGAT CCATCAGTTG 11520
CCGCATTGGC TGATATTAAA AGATTTATTA AGGCGAATCT ATTAGACCGA AGTGTTCTTT 11580
ATAGGATTAT GAATCAAGAA CCAGGTGAGT CATCTTTTTT TGACTGGGCT TCAGATCCAT 11640
ATTCATGCAA TTTACCACAA TCTCAAAATA TAACCACCAT GATAAAAAAT ATAACAGCAA 11700
GGAATGTATT ACAAGATTCA CCAAATCCAT TATTATCTGG ATTATTCACA AATACAATGA 11760
TAGAAGAAGA TGAAGAATTA GCTGAGTTCC TGATGGACAG GAAGGTAATT CTCCCTAGAG 11820
TTGCACATGA TATTCTAGAT AATTCTCTCA CAGGAATTAG AAATGCCATA GCTGGAATGT 11880
TAGATACGAC AAAATCACTA ATTCGGGTTG GCATAAATAG AGGAGGACTG ACATATAGTT 11940
TGTTGAGGAA AATCAGTAAT TACGATCTAG TACAATATGA AACACTAAGT AGGACTTTGC 12000
GACTAATTGT AAGTGATAAA ATCAAGTATG AAGATATGTG TTCGGTAGAC CTTGCCATAG 12060
CATTGCGACA AAAGATGTGG ATTCATTTAT CAGGAGGAAG GATGATAAGT GGACTTGAAA 12120
CGCCTGACCC ATTAGAATTA CTATCTGGGG TAGTAATAAC AGGATCAGAA CATTGTAAAA 12180
TATGTTATTC TTCAGATGGC ACAAACCCAT ATACTTGGAT GTATTTACCC GGTAATATCA 12240
AAATAGGATC AGCAGAAACA GGTATATCGT CATTAAGAGT TCCTTATTTT GGATCAGTCA 12300
CTGATGAAAG ATCTGAAGCA CAATTAGGAT ATATCAAGAA TCTTAGTAAA CCTGCAAAAG 12360
CCGCAATAAG AATAGCAATG ATATATACAT GGGCATTTGG TAATGATGAG ATATCTTGGA 12420
TGGAAGCCTC ACAGATAGCA CAAACACGTG CAAATTTTAC ACTAGATAGT CTCAAAATTT 12480
TAACACCGGT AGCTACATCA ACAAATTTAT CACACAGATT AAAGGATACT GCAACTCAGA 12540
TGAAATTCTC CAGTACATCA TTGATCAGAG TCAGCAGATT TATAACAATG TCCAATGATA 12600
ACATGTCTAT CAAAGAAGCT AATGAAACCA AAGATACTAA TCTTATTTAT CAACAAATAA 12660
TGTTAACAGG ATTAAGTGTT TTCGAATATT TATTTAGATT AAAAGAAACC ACAGGACACA 12720
ACCCTATAGT TATGCATCTG CACATAGAAG ATGAGTGTTG TATTAAAGAA AGTTTTAATG 12780
ATGAACATAT TAATCCAGAG TCTACATTAG AATTAATTCG ATATCCTGAA AGTAATGAAT 12840
TTATTTATGA TAAAGACCCA CTCAAAGATG TGGACTTATC AAAACTTATG GTTATTAAAG 12900
ACCATTCTTA CACAATTGAT ATGAATTATT GGGATGATAC TGACATCATA CATGCAATTT 12960
CAATATGTAC TGCAATTACA ATAGCAGATA CTATGTCACA ATTAGATCGA GATAATTTAA 13020
AAGAGATAAT AGTTATTGCA AATGATGATG ATATTAATAG CTTAATCACT GAATTTTTGA 13080
CTCTTGACAT ACTTGTATTT CTCAAGACAT TTGGTGGATT ATTAGTAAAT CAATTTGCAT 13140
ACACTCTTTA TAGTCTAAAA ATAGAAGGTA GGGATCTCAT TTGGGATTAT ATAATGAGAA 13200
CACTGAGAGA TACTTCCCAT TCAATATTAA AAGTATTATC TAATGCATTA TCTCATCCTA 13260
AAGTATTCAA GAGGTTCTGG GATTGTGGAG TTTTAAACCC TATTTATGGT CCTAATATTG 13320
CTAGTCAAGA CCAGATAAAA CTTGCCCTAT CTATATGTGA ATATTCACTA GATCTATTTA 13380
TGAGAGAATG GTTGAATGGT GTATCACTTG AAATATACAT TTGTGACAGC GATATGGAAG 13440
TTGCAAATGA TAGGAAACAA GCCTTTATTT CTAGACACCT TTCATTTGTT TGTTGTTTAG 13500
CAGAAATTGC ATCTTTCGGA CCTAACCTGT TAAACTTAAC ATACTTGGAG AGACTTGATC 13560
TATTGAAACA ATATCTTGAA TTAAATATTA AAGAAGACCC TACTCTTAAA TATGTACAAA 13620
TATCTGGATT ATTAATTAAA TCGTTCCCAT CAACTGTAAC ATACGTAAGA AAGACTGCAA 13680
TCAAATATCT AAGGATTCGC GGTATTAGTC CACCTGAGGT AATTGATGAT TGGGATCCGG 13740
TAGAAGATGA AAATATGCTG GATAACATTG TCAAAACTAT AAATGATAAC TGTAATAAAG 13800
ATAATAAAGG GAATAAAATT AACAATTTCT GGGGACTAGC ACTTAAGAAC TATCAAGTCC 13860
TTAAAATCAG ATCTATAACA AGTGATTCTG ATGATAATGA TAGACTAGAT GCTAATACAA 13920
GTGGTTTGAC ACTTCCTCAA GGAGGGAATT ATCTATCGCA TCAATTGAGA TTATTCGGAA 13980
TCAACAGCAC TAGTTGTCTG AAAGCTCTTG AGTTATCACA AATTTTAATG AAGGAAGTCA 14040
ATAAAGACAA GGACAGGCTC TTCCTGGGAG AAGGAGCAGG AGCTATGCTA GCATGTTATG 14100
ATGCCACATT AGGACCTGCA GTTAATTATT ATAATTCAGG TTTGAATATA ACAGATGTAA 14160
TTGGTCAACG AGAATTGAAA ATATTTCCTT CAGAGGTATC ATTAGTAGGT AAAAAATTAG 14220
GAAATGTGAC ACAGATTCTT AACAGGGTAA AAGTACTGTT CAATGGGAAT CCTAATTCAA 14280
CATGGATAGG AAATATGGAA TGTGAGAGCT TAATATGGAG TGAATTAAAT GATAAGTCCA 14340
TTGGATTAGT ACATTGTGAT ATGGAAGGAG CTATCGGTAA ATCAGAAGAA ACTGTTCTAC 14400
ATGAACATTA TAGTGTTATA AGAATTACAT ACTTGATTGG GGATGATGAT GTTGTTTTAG 14460
TTTCCAAAAT TATACCTACA ATCACTCCGA ATTGGTCTAG AATACTTTAT CTATATAAAT 14520
TATATTGGAA AGATGTAAGT ATAATATCAC TCAAAACTTC TAATCCTGCA TCAACAGAAT 14580
TATATCTAAT TTCGAAAGAT GCATATTGTA CTATAATGGA ACCTAGTGAA ATTGTTTTAT 14640
CAAAACTTAA AAGATTGTCA CTCTTGGAAG AAAATAATCT ATTAAAATGG ATCATTTTAT 14700
CAAAGAAGAG GAATAATGAA TGGTTACATC ATGAAATCAA AGAAGGAGAA AGAGATTATG 14760
GAATCATGAG ACCATATCAT ATGGCACTAC AAATCTTTGG ATTTCAAATC AATTTAAATC 14820
ATCTGGCGAA AGAATTTTTA TCAACCCCAG ATCTGACTAA TATCAACAAT ATAATCCAAA 14880
GTTTTCAGCG AACAATAAAG GATGTTTTAT TTGAATGGAT TAATATAACT CATGATGATA 14940
AGAGACATAA ATTAGGCGGA AGATATAACA TATTCCCACT GAAAAATAAG GGAAAGTTAA 15000
GACTGCTATC GAGAAGACTA GTATTAAGTT GGATTTCATT ATCATTATCG ACTCGATTAC 15060
TTACAGGTCG CTTTCCTGAT GAAAAATTTG AACATAGAGC ACAGACTGGA TATGTATCAT 15120
TAGCTGATAC TGATTTAGAA TCATTAAAGT TATTGTCGAA AAACATCATT AAGAATTACA 15180
GAGAGTGTAT AGGATCAATA TCATATTGGT TTCTAACCAA AGAAGTTAAA ATACTTATGA 15240
AATTGATTGG TGGTGCTAAA TTATTAGGAA TTCCCAGACA ATATAAAGAA CCCGAAGACC 15300
AGTTATTAGA AAACTACAAT CAACATGATG AATTTGATAT CGATTAAAAC ATAAATACAA 15360
TGAAGATATA TCCTAACCTT TATCTTTAAG CCTAGGAATA GACAAAAAGT AAGAAAAACA 15420
TGTAATATAT ATATACCAAA CAGAGTTCTT CTCTTGTTTG GT 15462
(2) SEQ ID NO: 20 information about:
...
(i) sequence signature:
(A) length: 2233 amino acid
(B) type: amino acid
(C) chain:
(D) topological framework: linearity
(ii) molecule type: protein
(xi) sequence description: SEQ ID NO:20:
Met?Asp?Thr?Glu?Ser?Asn?Asn?Gly?Thr?Val?Ser?Asp?Ile?Leu?Tyr?Pro
1???????????????5???????????????????10??????????????????15
Glu?Cys?His?Leu?Asn?Ser?Pro?Ile?Val?Lys?Gly?Lys?Ile?Ala?Gln?Leu
20??????????????????25??????????????????30
His?Thr?Ile?Met?Ser?Leu?Pro?Gln?Pro?Tyr?Asp?Met?Asp?Asp?Asp?Ser
35??????????????????40??????????????????45
Ile?Leu?Val?Ile?Thr?Arg?Gln?Lys?Ile?Lys?Leu?Asn?Lys?Leu?Asp?Lys
50??????????????????55??????????????????60
Arg?Gln?Arg?Ser?Ile?Arg?Arg?Leu?Lys?Leu?Ile?Leu?Thr?Glu?Lys?Val
65??????????????????70??????????????????75??????????????????80
Asn?Asp?Leu?Gly?Lys?Tyr?Thr?Phe?Ile?Arg?Tyr?Pro?Glu?Met?Ser?Lys
85??????????????????90??????????????????95
Glu?Met?Phe?Lys?Leu?Tyr?Ile?Pro?Gly?Ile?Asn?Ser?Lys?Val?Thr?Glu
100?????????????????105?????????????????110
Leu?Leu?Leu?Lys?Ala?Asp?Arg?Thr?Tyr?Ser?Gln?Met?Thr?Asp?Gly?Leu
115?????????????????120?????????????????125
Arg?Asp?Leu?Trp?Ile?Asn?Val?Leu?Ser?Lys?Leu?Ala?Ser?Lys?Asn?Asp
130?????????????????135?????????????????140
Gly?Ser?Asn?Tyr?Asp?Leu?Asn?Glu?Glu?Ile?Asn?Asn?Ile?Ser?Lys?Val145?????????????????150?????????????????155?????????????????160His?Thr?Thr?Tyr?Lys?Ser?Asp?Lys?Trp?Tyr?Asn?Pro?Phe?Lys?Thr?Trp
165?????????????????170?????????????????175Phe?Thr?Ile?Lys?Tyr?Asp?Met?Arg?Arg?Leu?Gln?Lys?Ala?Arg?Asn?Glu
180?????????????????185?????????????????190Ile?Thr?Phe?Asn?Val?Gly?Lys?Asp?Tyr?Asn?Leu?Leu?Glu?Asp?Gln?Lys
195?????????????????200?????????????????205Asn?Phe?Leu?Leu?Ile?His?Pro?Glu?Leu?Val?Leu?Ile?Leu?Asp?Lys?Gln
210?????????????????215?????????????????220Asn?Tyr?Asn?Gly?Tyr?Leu?Ile?Thr?Pro?Glu?Leu?Val?Leu?Met?Tyr?Cys225?????????????????230?????????????????235?????????????????240Asp?Val?Val?Glu?Gly?Arg?Trp?Asn?Ile?Ser?Ala?Cys?Ala?Lys?Leu?Asp
245?????????????????250?????????????????255Pro?Lys?Leu?Gln?Ser?Met?Tyr?Gln?Lys?Gly?Asn?Asn?Leu?Trp?Glu?Val
260?????????????????265?????????????????270Ile?Asp?Lys?Leu?Phe?Pro?Ile?Met?Gly?Glu?Lys?Thr?Phe?Asp?Val?Ile
275?????????????????280?????????????????285Ser?Leu?Leu?Glu?Pro?Leu?Ala?Leu?Ser?Leu?Ile?Gln?Thr?His?Asp?Pro
290?????????????????295?????????????????300Val?Lys?Gln?Leu?Arg?Gly?Ala?Phe?Leu?Asn?His?Val?Leu?Ser?Glu?Met305?????????????????310?????????????????315?????????????????320Glu?Leu?Ile?Phe?Glu?Ser?Arg?Glu?Ser?Ile?Lys?Glu?Phe?Leu?Ser?Val
325?????????????????330?????????????????335Asp?Tyr?Ile?Asp?Lys?Ile?Leu?Asp?Ile?Phe?Asn?Lys?Ser?Thr?Ile?Asp
340?????????????????345?????????????????350Glu?Ile?Ala?Glu?Ile?Phe?Ser?Phe?Phe?Arg?Thr?Phe?Gly?His?Pro?Pro
355?????????????????360?????????????????365Leu?Glu?Ala?Ser?Ile?Ala?Ala?Glu?Lys?Val?Arg?Lys?Tyr?Met?Tyr?Ile
370?????????????????375?????????????????380Gly?Lys?Gln?Leu?Lys?Phe?Asp?Thr?Ile?Asn?Lys?Cys?His?Ala?Ile?Phe385?????????????????390?????????????????395?????????????????400Cys?Thr?Ile?Ile?Ile?Asn?Gly?Tyr?Arg?Glu?Arg?His?Gly?Gly?Gln?Trp
405?????????????????410?????????????????415Pro?Pro?Val?Thr?Leu?Pro?Asp?His?Ala?His?Glu?Phe?Ile?Ile?Asn?Ala
420?????????????????425?????????????????430Tyr?Gly?Ser?Asn?Ser?Ala?Ile?Ser?Tyr?Glu?Asn?Ala?Val?Asp?Tyr?Tyr
435?????????????????440?????????????????445Gln?Ser?Phe?Ile?Gly?Ile?Lys?Phe?Asn?Lys?Phe?Ile?Glu?Pro?Gln?Leu
450?????????????????455?????????????????460Asp?Glu?Asp?Leu?Thr?Ile?Tyr?Met?Lys?Asp?Lys?Ala?Leu?Ser?Pro?Lys465?????????????????470?????????????????475?????????????????480Lys?Ser?Asn?Trp?Asp?Thr?Val?Tyr?Pro?Ala?Ser?Asn?Leu?Leu?Tyr?Arg
485?????????????????490?????????????????495Thr?Asn?Ala?Ser?Asn?Glu?Ser?Arg?Arg?Leu?Val?Glu?Val?Phe?Ile?Ala
500?????????????????505?????????????????510Asp?Ser?Lys?Phe?Asp?Pro?His?Gln?Ile?Leu?Asp?Tyr?Val?Glu?Ser?Gly
515?????????????????520?????????????????525Asp?Trp?Leu?Asp?Asp?Pro?Glu?Phe?Asn?Ile?Ser?Tyr?Ser?Leu?Lys?Glu
530?????????????????535?????????????????540Lys?Glu?Ile?Lys?Gln?Glu?Gly?Arg?Leu?Phe?Ala?Lys?Met?Thr?Tyr?Lys545?????????????????550?????????????????555?????????????????560Met?Arg?Ala?Thr?Gln?Val?Leu?Ser?Glu?Thr?Leu?Leu?Ala?Asn?Asn?Ile
565?????????????????570?????????????????575Gly?Lys?Phe?Phe?Gln?Glu?Asn?Gly?Met?Val?Lys?Gly?Glu?Ile?Glu?Leu
580?????????????????585?????????????????590Leu?Lys?Arg?Leu?Thr?Thr?Ile?Ser?Ile?Ser?Gly?Val?Pro?Arg?Tyr?Asn
595?????????????????600?????????????????605Glu?Val?Tyr?Asn?Asn?Ser?Lys?Ser?His?Thr?Asp?Asp?Leu?Lys?Thr?Tyr
610?????????????????615?????????????????620Asn?Lys?Ile?Ser?Asn?Leu?Asn?Leu?Ser?Ser?Asn?Gln?Lys?Ser?Lys?Lys625?????????????????630?????????????????635?????????????????640Phe?Glu?Phe?Lys?Ser?Thr?Asp?Ile?Tyr?Asn?Asp?Gly?Tyr?Glu?Thr?Val
645?????????????????650?????????????????655Ser?Cys?Phe?Leu?Thr?Thr?Asp?Leu?Lys?Lys?Tyr?Cys?Leu?Asn?Trp?Arg
660?????????????????665?????????????????670Tyr?Glu?Ser?Thr?Ala?Leu?Phe?Gly?Glu?Thr?Cys?Asn?Gln?Ile?Phe?Gly
675?????????????????680?????????????????685Leu?Asn?Lys?Leu?Phe?Asn?Trp?Leu?His?Pro?Arg?Leu?Glu?Gly?Ser?Thr
690?????????????????695?????????????????700Ile?Tyr?Val?Gly?Asp?Pro?Tyr?Cys?Pro?Pro?Ser?Asp?Lys?Glu?His?Ile705?????????????????710?????????????????715?????????????????720Ser?Leu?Glu?Asp?His?Pro?Asp?Ser?Gly?Phe?Tyr?Val?His?Asn?Pro?Arg
725?????????????????730?????????????????735Gly?Gly?Ile?Glu?Gly?Phe?Cys?Gln?Lys?Leu?Trp?Thr?Leu?Ile?Ser?Ile
740?????????????????745?????????????????750Ser?Ala?Ile?His?Leu?Ala?Ala?Val?Arg?Ile?Gly?Val?Arg?Val?Thr?Ala
755?????????????????760?????????????????765Met?Val?Gln?Gly?Asp?Asn?Gln?Ala?Ile?Ala?Val?Thr?Thr?Arg?Val?Pro
770?????????????????775?????????????????780Asn?Asn?Tyr?Asp?Tyr?Arg?Val?Lys?Lys?Glu?Ile?Val?Tyr?Lys?Asp?Val785?????????????????790?????????????????795?????????????????800Val?Arg?Phe?Phe?Asp?Ser?Leu?Arg?Glu?Val?Met?Asp?Asp?Leu?Gly?His
805?????????????????810?????????????????815Glu?Leu?Lys?Leu?Asn?Glu?Thr?Ile?Ile?Ser?Ser?Lys?Met?Phe?Ile?Tyr
820?????????????????825?????????????????830Ser?Lys?Arg?Ile?Tyr?Tyr?Asp?Gly?Arg?Ile?Leu?Pro?Gln?Ala?Leu?Lys
835?????????????????840?????????????????845Ala?Leu?Ser?Arg?Cys?Val?Phe?Trp?Ser?Glu?Thr?Val?Ile?Asp?Glu?Thr
850?????????????????855?????????????????860Arg?Ser?Ala?Ser?Ser?Asn?Leu?Ala?Thr?Ser?Phe?Ala?Lys?Ala?Ile?Glu865?????????????????870?????????????????875?????????????????880Asn?Gly?Tyr?Ser?Pro?Val?Leu?Gly?Tyr?Ala?Cys?Ser?Ile?Phe?Lys?Asn
885?????????????????890?????????????????895Ile?Gln?Gln?Leu?Tyr?Ile?Ala?Leu?Gly?Met?Asn?Ile?Asn?Pro?Thr?Ile
900?????????????????905?????????????????910Thr?Gln?Asn?Ile?Arg?Asp?Gln?Tyr?Phe?Arg?Asn?Pro?Asn?Trp?Met?Gln
915?????????????????920?????????????????925Tyr?Ala?Ser?Leu?Ile?Pro?Ala?Ser?Val?Gly?Gly?Phe?Asn?His?Met?Ala
930?????????????????935?????????????????940Met?Ser?Arg?Cys?Phe?Val?Arg?Asn?Ile?Gly?Asp?Pro?Ser?Val?Ala?Ala945?????????????????950?????????????????955?????????????????960Leu?Ala?Asp?Ile?Lys?Arg?Phe?Ile?Lys?Ala?Asn?Leu?Leu?Asp?Arg?Ser
965?????????????????970?????????????????975Val?Leu?Tyr?Arg?Ile?Met?Asn?Gln?Glu?Pro?Gly?Glu?Ser?Ser?Phe?Phe
980?????????????????985?????????????????990Asp?Trp?Ala?Ser?Asp?Pro?Tyr?Ser?Cys?Asn?Leu?Pro?Gln?Ser?Gln?Asn
995?????????????????1000????????????????1005Ile?Thr?Thr?Met?Ile?Lys?Asn?Ile?Thr?Ala?Arg?Asn?Val?Leu?Gln?Asp
1010????????????????1015????????????????1020Ser?Pro?Asn?Pro?Leu?Leu?Ser?Gly?Leu?Phe?Thr?Asn?Thr?Met?Ile?Glu1025????????????????1030????????????????1035????????????????1040Glu?Asp?Glu?Glu?Leu?Ala?Glu?Phe?Leu?Met?Asp?Arg?Lys?Val?Ile?Leu
1045????????????????1050????????????????1055Pro?Arg?Val?Ala?His?Asp?Ile?Leu?Asp?Asn?Ser?Leu?Thr?Gly?Ile?Arg
1060????????????????1065????????????????1070Asn?Ala?Ile?Ala?Gly?Met?Leu?Asp?Thr?Thr?Lys?Ser?Leu?Ile?Arg?Val
1075????????????????1080????????????????1085Gly?Ile?Asn?Arg?Gly?Gly?Leu?Thr?Tyr?Ser?Leu?Leu?Arg?Lys?Ile?Ser
1090????????????????1095????????????????1100Asn?Tyr?Asp?Leu?Val?Gln?Tyr?Glu?Thr?Leu?Ser?Arg?Thr?Leu?Arg?Leu1105????????????????1110????????????????1115????????????????1120Ile?Val?Ser?Asp?Lys?Ile?Lys?Tyr?Glu?Asp?Met?Cys?Ser?Val?Asp?Leu
1125????????????????1130????????????????1135Ala?Ile?Ala?Leu?Arg?Gln?Lys?Met?Trp?Ile?His?Leu?Ser?Gly?Gly?Arg
1140????????????????1145????????????????1150Met?Ile?Ser?Gly?Leu?Glu?Thr?Pro?Asp?Pro?Leu?Glu?Leu?Leu?Ser?Gly
1155????????????????1160????????????????1165Val?Val?Ile?Thr?Gly?Ser?Glu?His?Cys?Lys?Ile?Cys?Tyr?Ser?Ser?Asp
1170????????????????1175????????????????1180Gly?Thr?Asn?Pro?Tyr?Thr?Trp?Met?Tyr?Leu?Pro?Gly?Asn?Ile?Lys?Ile1185????????????????1190????????????????1195????????????????1200Gly?Ser?Ala?Glu?Thr?Gly?Ile?Ser?Ser?Leu?Arg?Val?Pro?Tyr?Phe?Gly
1205????????????????1210????????????????1215Ser?Val?Thr?Asp?Glu?Arg?Ser?Glu?Ala?Gln?Leu?Gly?Tyr?Ile?Lys?Asn
1220????????????????1225????????????????1230Leu?Ser?Lys?Pro?Ala?Lys?Ala?Ala?Ile?Arg?Ile?Ala?Met?Ile?Tyr?Thr
1235????????????????1240????????????????1245Trp?Ala?Phe?Gly?Asn?Asp?Glu?Ile?Ser?Trp?Met?Glu?Ala?Ser?Gln?Ile
1250????????????????1255????????????????1260Ala?Gln?Thr?Arg?Ala?Asn?Phe?Thr?Leu?Asp?Ser?Leu?Lys?Ile?Leu?Thr1265????????????????1270????????????????1275????????????????1280Pro?Val?Ala?Thr?Ser?Thr?Asn?Leu?Ser?His?Arg?Leu?Lys?Asp?Thr?Ala
1285????????????????1290????????????????1295Thr?Gln?Met?Lys?Phe?Ser?Ser?Thr?Ser?Leu?Ile?Arg?Val?Ser?Arg?Phe
1300????????????????1305????????????????1310Ile?Thr?Met?Ser?Asn?Asp?Asn?Met?Ser?Ile?Lys?Glu?Ala?Asn?Glu?Thr
1315????????????????1320????????????????1325Lys?Asp?Thr?Asn?Leu?Ile?Tyr?Gln?Gln?Ile?Met?Leu?Thr?Gly?Leu?Ser
1330????????????????1335????????????????1340Val?Phe?Glu?Tyr?Leu?Phe?Arg?Leu?Lys?Glu?Thr?Thr?Gly?His?Asn?Pro1345????????????????1350????????????????1355????????????????1360Ile?Val?Met?His?Leu?His?Ile?Glu?Asp?Glu?Cys?Cys?Ile?Lys?Glu?Ser
1365????????????????1370????????????????1375Phe?Asn?Asp?Glu?His?Ile?Asn?Pro?Glu?Ser?Thr?Leu?Glu?Leu?Ile?Arg
1380????????????????1385????????????????1390Tyr?Pro?Glu?Ser?Asn?Glu?Phe?Ile?Tyr?Asp?Lys?Asp?Pro?Leu?Lys?Asp
1395????????????????1400????????????????1405Val?Asp?Leu?Ser?Lys?Leu?Met?Val?Ile?Lys?Asp?His?Ser?Tyr?Thr?Ile
1410????????????????1415????????????????1420Asp?Met?Asn?Tyr?Trp?Asp?Asp?Thr?Asp?Ile?Ile?His?Ala?Ile?Ser?Ile1425????????????????1430????????????????1435????????????????1440Cys?Thr?Ala?Ile?Thr?Ile?Ala?Asp?Thr?Met?Ser?Gln?Leu?Asp?Arg?Asp
1445????????????????1450????????????????1455Asn?Leu?Lys?Glu?Ile?Ile?Val?Ile?Ala?Asn?Asp?Asp?Asp?Ile?Asn?Ser
1460????????????????1465????????????????1470Leu?Ile?Thr?Glu?Phe?Leu?Thr?Leu?Asp?Ile?Leu?Val?Phe?Leu?Lys?Thr
1475????????????????1480????????????????1485Phe?Gly?Gly?Leu?Leu?Val?Asn?Gln?Phe?Ala?Tyr?Thr?Leu?Tyr?Ser?Leu
1490????????????????1495????????????????1500Lys?Ile?Glu?Gly?Arg?Asp?Leu?Ile?Trp?Asp?Tyr?Ile?Met?Arg?Thr?Leu1505????????????????1510????????????????1515????????????????1520Arg?Asp?Thr?Ser?His?Ser?Ile?Leu?Lys?Val?Leu?Ser?Asn?Ala?Leu?Ser
1525????????????????1530????????????????1535His?Pro?Lys?Val?Phe?Lys?Arg?Phe?Trp?Asp?Cys?Gly?Val?Leu?Asn?Pro
1540????????????????1545????????????????1550Ile?Tyr?Gly?Pro?Asn?Ile?Ala?Ser?Gln?Asp?Gln?Ile?Lys?Leu?Ala?Leu
1555????????????????1560????????????????1565Ser?Ile?Cys?Glu?Tyr?Ser?Leu?Asp?Leu?Phe?Met?Arg?Glu?Trp?Leu?Asn
1570????????????????1575????????????????1580Gly?Val?Ser?Leu?Glu?Ile?Tyr?Ile?Cys?Asp?Ser?Asp?Met?Glu?Val?Ala1585????????????????1590????????????????1595????????????????1600Asn?Asp?Arg?Lys?Gln?Ala?Phe?Ile?Ser?Arg?His?Leu?Ser?Phe?Val?Cys
1605????????????????1610????????????????1615Cys?Leu?Ala?Glu?Ile?Ala?Ser?Phe?Gly?Pro?Asn?Leu?Leu?Asn?Leu?Thr
1620????????????????1625????????????????1630Tyr?Leu?Glu?Arg?Leu?Asp?Leu?Leu?Lys?Gln?Tyr?Leu?Glu?Leu?Asn?Ile
1635????????????????1640????????????????1645Lys?Glu?Asp?Pro?Thr?Leu?Lys?Tyr?Val?Gln?Ile?Ser?Gly?Leu?Leu?Ile
1650????????????????1655????????????????1660Lys?Ser?Phe?Pro?Ser?Thr?Val?Thr?Tyr?Val?Arg?Lys?Thr?Ala?Ile?Lys1665????????????????1670????????????????1675????????????????1680Tyr?Leu?Arg?Ile?Arg?Gly?Ile?Ser?Pro?Pro?Glu?Val?Ile?Asp?Asp?Trp
1685????????????????1690????????????????1695Asp?Pro?Val?Glu?Asp?Glu?Asn?Met?Leu?Asp?Asn?Ile?Val?Lys?Thr?Ile
1700????????????????1705????????????????1710Asn?Asp?Asn?Cys?Asn?Lys?Asp?Asn?Lys?Gly?Asn?Lys?Ile?Asn?Asn?Phe
1715????????????????1720????????????????1725Trp?Gly?Leu?Ala?Leu?Lys?Asn?Tyr?Gln?Val?Leu?Lys?Ile?Arg?Ser?Ile
1730????????????????1735????????????????1740Thr?Ser?Asp?Ser?Asp?Asp?Asn?Asp?Arg?Leu?Asp?Ala?Asn?Thr?Ser?Gly1745????????????????1750????????????????1755????????????????1760Leu?Thr?Leu?Pro?Gln?Gly?Gly?Asn?Tyr?Leu?Ser?His?Gln?Leu?Arg?Leu
1765????????????????1770????????????????1775Phe?Gly?Ile?Asn?Ser?Thr?Ser?Cys?Leu?Lys?Ala?Leu?Glu?Leu?Ser?Gln
1780????????????????1785????????????????1790Ile?Leu?Met?Lys?Glu?Val?Asn?Lys?Asp?Lys?Asp?Arg?Leu?Phe?Leu?Gly
1795????????????????1800????????????????1805Glu?Gly?Ala?Gly?Ala?Met?Leu?Ala?Cys?Tyr?Asp?Ala?Thr?Leu?Gly?Pro
1810????????????????1815????????????????1820Ala?Val?Asn?Tyr?Tyr?Asn?Ser?Gly?Leu?Asn?Ile?Thr?Asp?Val?Ile?Gly1825????????????????1830????????????????1835????????????????1840Gln?Arg?Glu?Leu?Lys?Ile?Phe?Pro?Ser?Glu?Val?Ser?Leu?Val?Gly?Lys
1845????????????????1850????????????????1855Lys?Leu?Gly?Asn?Val?Thr?Gln?Ile?Leu?Asn?Arg?Val?Lys?Val?Leu?Phe
1860????????????????1865????????????????1870Asn?Gly?Asn?Pro?Asn?Ser?Thr?Trp?Ile?Gly?Asn?Met?Glu?Cys?Glu?Ser
1875????????????????1880????????????????1885Leu?Ile?Trp?Ser?Glu?Leu?Asn?Asp?Lys?Ser?Ile?Gly?Leu?Val?His?Cys
1890????????????????1895????????????????1900Asp?Met?Glu?Gly?Ala?Ile?Gly?Lys?Ser?Glu?Glu?Thr?Val?Leu?His?Glu1905????????????????1910???????????????1915????????????????1920His?Tyr?Ser?Val?Ile?Arg?Ile?Thr?Tyr?Leu?Ile?Gly?Asp?Asp?Asp?Val
1925????????????????1930????????????????1935Val?Leu?Val?Ser?Lys?Ile?Ile?Pro?Thr?Ile?Thr?Pro?Asn?Trp?Ser?Arg
1940????????????????1945????????????????1950Ile?Leu?Tyr?Leu?Tyr?Lys?Leu?Tyr?Trp?Lys?Asp?Val?Ser?Ile?Ile?Ser
1955????????????????1960????????????????1965Leu?Lys?Thr?Ser?Asn?Pro?Ala?Ser?Thr?Glu?Leu?Tyr?Leu?Ile?Ser?Lys
1970????????????????1975????????????????1980Asp?Ala?Tyr?Cys?Thr?Ile?Met?Glu?Pro?Ser?Glu?Ile?Val?Leu?Ser?Lys1985????????????????1990????????????????1995????????????????2000Leu?Lys?Arg?Leu?Ser?Leu?Leu?Glu?Glu?Asn?Asn?Leu?Leu?Lys?Trp?Ile
2005????????????????2010????????????????2015Ile?Leu?Ser?Lys?Lys?Arg?Asn?Asn?Glu?Trp?Leu?His?His?Glu?Ile?Lys
2020????????????????2025????????????????2030Glu?Gly?Glu?Arg?Asp?Tyr?Gly?Ile?Met?Arg?Pro?Tyr?His?Met?Ala?Leu
2035????????????????2040????????????????2045Gln?Ile?Phe?Gly?Phe?Gln?Ile?Asn?Leu?Asn?His?Leu?Ala?Lys?Glu?Phe
2050????????????????2055????????????????2060Leu?Ser?Thr?Pro?Asp?Leu?Thr?Asn?Ile?Asn?Asn?Ile?Ile?Gln?Ser?Phe2065????????????????2070????????????????2075????????????????2080Gln?Arg?Thr?Ile?Lys?Asp?Val?Leu?Phe?Glu?Trp?Ile?Asn?Ile?Thr?His
2085????????????????2090????????????????2095Asp?Asp?Lys?Arg?His?Lys?Leu?Gly?Gly?Arg?Tyr?Asn?Ile?Phe?Pro?Leu
2100????????????????2105????????????????2110Lys?Asn?Lys?Gly?Lys?Leu?Arg?Leu?Leu?Ser?Arg?Arg?Leu?Val?Leu?Ser
2115????????????????2120????????????????2125Trp?Ile?Ser?Leu?Ser?Leu?Ser?Thr?Arg?Leu?Leu?Thr?Gly?Arg?Phe?Pro
2130????????????????2135????????????????2140Asp?Glu?Lys?Phe?Glu?His?Arg?Ala?Gln?Thr?Gly?Tyr?Val?Ser?Leu?Ala2145????????????????2150????????????????2155????????????????2160Asp?Thr?Asp?Leu?Glu?Ser?Leu?Lys?Leu?Leu?Ser?Lys?Asn?Ile?Ile?Lys
2165????????????????2170????????????????2175Asn?Tyr?Arg?Glu?Cys?Ile?Gly?Ser?Ile?Ser?Tyr?Trp?Phe?Leu?Thr?Lys
2180????????????????2185????????????????2190Glu?Val?Lys?Ile?Leu?Met?Lys?Leu?Ile?Gly?Gly?Ala?Lys?Leu?Leu?Gly
2195????????????????2200????????????????2205
Ile?Pro?Arg?Gln?Tyr?Lys?Glu?Pro?Glu?Asp?Gln?Leu?Leu?Glu?Asn?Tyr
2210????????????????2215????????????????2220
Asn?Gln?His?Asp?Glu?Phe?Asp?Ile?Asp
The information of 2,225 2230 (2) SEQ ID NO:21:
(i) sequence signature:
(A) length: 15462 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21:
ACCAAACAAG AGAAGAAACT TGCTTGGTAA TATAAATTTA ACTTAAAATT AACTTAGGAT 60
TTAAGACATT GACTAGAAGG TCAAGAAAAG GGAACTCTAT AATTTCAAAA ATGTTGAGCC 120
TATTTGATAC ATTTAATGCA CGTAGGCAAG AAAACATAAC AAAATCAGCC GGTGGAGCTA 180
TCATTCCTGG ACAGAAAAAT ACTGTCTCTA TATTCGCCCT TGGACCGACA ATAACTGATG 240
ATAATGAGAA AATGACATTA GCTCTTCTAT TTCTATCTCA TTCACTAGAT AATGAGAAAC 300
AACATGCACA AAGGGCAGGG TTCTTGGTGT CTTTATTGTC AATGGCTTAT GCCAATCCAG 360
AGCTCTACCT AACAACAAAT GGAAGTAATG CAGATGCCAA GTATGTCATA TACATGATTG 420
AGAAAGATCT AAAACGGCAA AAGTATGGAG GATTTGTGGT TAAGACGAGA GAGATGATAT 480
ATGAAAAGAC AACTGATTGG ATATTTGGAA GTGACCTGGA TTATGATCAG GAAACTATGT 540
TGCAGAACGG CAGGAACAAT TCAACAATTG AAGACCTTGT CCACACATTT GGGTATCCAT 600
CATGTTTAGG AGCTCTTATA ATACAGATCT GGATAGTTCT GGTCAAAGCT ATCACTAGTA 660
TCTCAGGGTT AAGAAAAGGC TTTTTCACCC GATTGGAAGC TTTCAGACAA GATGGAACAG 720
TGCAGGCAGG GCTGGTATTG AGCGGTGACA CAGTGGATCA GATTGGGTCA ATCATGCGGT 780
CTCAACAGAG CTTGGTAACT CTTATGGTTG AAACATTAAT AACAATGAAT ACCAGCAGAA 840
ATGACCTCAC AACCATAGAA AAGAATATAC AAATTGTTGG CAACTACATA AGAGATGCAG 900
GTCTCGCTTC ATTCTTCAAT ACAATCAGAT ATGGAATTGA GACCAGAATG GCAGCTTTGA 960
CTCTATCCAC TCTCAGACCA GATATCAATA GATTAAAAGC TTTGATGGAA CTGTATTTAT 1020
CAAAGGGACC ACGCGCTCCT TTCATCTGTA TCCTCAGAGA TCCTATACAT GGTGAGTTCG 1080
CACCAGGCAA CTATCCTGCC ATATGGAGCT ATGCAATGGG GGTGGCAGTT GTACAAAATA 1140
GAGCCATGCA ACAGTATGTG ACGGGAAGAT CATATCTAGA CATTGATATG TTCCAGCTAG 1200
GACAAGCAGT AGCACGTGAT GCCGAAGCTC AAATGAGCTC AACACTGGAA GATGAACTTG 1260
GAGTGACACA CGAAGCTAAA GAAAGCTTGA AGAGACATAT AAGGAACATA AACAGTTCAG 1320
AGACATCTTT CCACAAACCG ACAGGTGGAT CAGCCATAGA GATGGCAATA GATGAAGAGC 1380
CAGAACAATT CGAACATAGA GCAGATCAAG AACAAAATGG AGAACCTCAA TCATCCATAA 1440
TTCAATATGC CTGGGCAGAA GGAAATAGAA GCGATGATCA GACTGAGCAA GCTACAGAAT 1500
CTGACAATAT CAAGACCGAA CAACAAAACA TCAGAGACAG ACTAAACAAG AGACTCAACG 1560
ACAAGAAGAA ACAAAGCAGT CAACCACCCA CTAATCCCAC AAACAGAACA AACCAGGACG 1620
AAATAGATGA TCTGTTTAAC GCATTTGGAA GCAACTAATC GAATCAACAT TTTAATCTAA 1680
ATCAATAATA AATAAGAAAA ACTTAGGATT AAAGAATCCT ATCATACCGG AATATAGGGT 1740
GGTAAATTTA GAGTCTGCTT GAAACTCAAT CAATAGAGAG TTGATGGAAA GCGATGCTAA 1800
AAACTATCAA ATCATGGATT CTTGGGAAGA GGAATCAAGA GATAAATCAA CTAATATCTC 1860
CTCGGCCCTC AACATCATTG AATTCATACT CAGCACCGAC CCCCAAGAAG ACTTATCGGA 1920
AAACGACACA ATCAACACAA GAACCCAGCA ACTCAGTGCC ACCATCTGTC AACCAGAAAT 1980
CAAACCAACA GAAACAAGTG AGAAAGATAG TGGATCAACT GACAAAAATA GACAGTCTGG 2040
GTCATCACAC GAATGTACAA CAGAAGCAAA AGATAGAAAC ATTGATCAGG AAACTGTACA 2100
GAGAGGACCT GGGAGAAGAA GCAGCTCAGA TAGTAGAGCT GAGACTGTGG TCTCTGGAGG 2160
AATCCCCAGA AGCATCACAG ATTCTAAAAA TGGAACCCAA AACACGGAGG ATATTGATCT 2220
CAATGAAATT AGAAAGATGG ATAAGGACTC TATTGAGGGG AAAATGCGAC AATCTGCAAA 2280
TGTTCCAAGC GAGATATCAG GAAGTGATGA CATATTTACA ACAGAACAAA GTAGAAACAG 2340
TGATCATGGA AGAAGCCTGG AATCTATCAG TACACCTGAT ACAAGATCAA TAAGTGTTGT 2400
TACTGCTGCA ACACCAGATG ATGAAGAAGA AATACTAATG AAAAATAGTA GGACAAAGAA 2460
AAGTTCTTCA ACACATCAAG AAGATGACAA AAGAATTAAA AAAGGGGGAA AAGGGAAAGA 2520
CTGGTTTAAG AAATCAAAAG ATACCGACAA CCAGATACCA ACATCAGACT ACAGATCCAC 2580
ATCAAAAGGG CAGAAGAAAA TCTCAAAGAC AACAACCACC AACACCGACA CAAAGGGGCA 2640
AACAGAAATA CAGACAGAAT CATCAGAAAC ACAATCCTCA TCATGGAATC TCATCATCGA 2700
CAACAACACC GACCGGAACG AACAGACAAG CACAACTCCT CCAACAACAA CTTCCAGATC 2760
AACTTATACA AAAGAATCGA TCCGAACAAA CTCTGAATCC AAACCCAAGA CACAAAAGAC 2820
AAATGGAAAG GAAAGGAAGG ATACAGAAGA GAGCAATCGA TTTACAGAGA GGGCAATTAC 2880
TCTATTGCAG AATCTTGGTG TAATTCAATC CACATCAAAA CTAGATTTAT ATCAAGACAA 2940
ACGAGTTGTA TGTGTAGCAA ATGTACTAAA CAATGTAGAT ACTGCATCAA AGATAGATTT 3000
CCTGGCAGGA TTAGTCATAG GGGTTTCAAT GGACAACGAC ACAAAATTAA CACAGATACA 3060
AAATGAAATG CTAAACCTCA AAGCAGATCT AAAGAAAATG GACGAATCAC ATAGAAGATT 3120
GATAGAAAAT CAAAGAGAAC AACTGTCATT GATCACGTCA CTAATTTCAA ATCTCAAAAT 3180
TATGACTGAG AGAGGAGGAA AGAAAGACCA AAATGAATCC AATGAGAGAG TATCCATGAT 3240
CAAAACAAAA TTGAAAGAAG AAAAGATCAA GAAGACCAGG TTTGACCCAC TTATGGAGGC 3300
ACAAGGCATT GACAAGAATA TACCCGATCT ATATCGACAT GCAGGAGATA CACTAGAGAA 3360
CGATGTACAA GTTAAATCAG AGATATTAAG TTCATACAAT GAGTCAAATG CAACAAGACT 3420
AATACCCAAA AAAGTGAGCA GTACAATGAG ATCACTAGTT GCAGTCATCA ACAACAGCAA 3480
TCTCTCACAA AGCACAAAAC AATCATACAT AAACGAACTC AAACGTTGCA AAAATGATGA 3540
AGAAGTATCT GAATTAATGG ACATGTTCAA TGAAGATGTC AACAATTGCC AATGATCCAA 3600
CAAAGAAACG ACACCGAACA AACAGACAAG AAACAACAGT AGATCAAAAC CTGTCAACAC 3660
ACACAAAATC AAGCAGAATG AAACAACAGA TATCAATCAA TATACAAATA AGAAAAACTT 3720
AGGATTAAAG AATAAATTAA TCCTTGTCCA AAATGAGTAT AACTAACTCT GCAATATACA 3780
CATTCCCAGA ATCATCATTC TCTGAAAATG GTCATATAGA ACCATTACCA CTCAAAGTCA 3840
ATGAACAGAG GAAAGCAGTA CCCCACATTA GAGTTGCCAA GATCGGAAAT CCACCAAAAC 3900
ACGGATCCCG GTATTTAGAT GTCTTCTTAC TCGGCTTCTT CGAGATGGAA CGAATCAAAG 3960
ACAAATACGG GAGTGTGAAT GATCTCGACA GTGACCCGAG TTACAAAGTT TGTGGCTCTG 4020
GATCATTACC AATCGGATTG GCTAAGTACA CTGGGAATGA CCAGGAATTG TTACAAGCCG 4080
CAACCAAACT GGATATAGAA GTGAGAAGAA CAGTCAAAGC GAAAGAGATG GTTGTTTACA 4140
CGGTACAAAA TATAAAACCA GAACTGTACC CATGGTCCAA TAGACTAAGA AAAGGAATGC 4200
TGTTCGATGC CAACAAAGTT GCTCTTGCTC CTCAATGTCT TCCACTAGAT AGGAGCATAA 4260
AATTTAGAGT AATCTTCGTG AATTGTACGG CAATTGGATC AATAACCTTG TTCAAAATTC 4320
CTAAGTCAAT GGCATCACTA TCTCTAACCA ACACAATATC AATCAATCTG CAGGTACACA 4380
TAAAAACAGG GGTTCAGACT GATTCTAAAG GGATAGTTCA AATTTTGGAT GAGAAAGGCG 4440
AAAAATCACT GAATTTCATG GTCCATCTCG GATTGATCAA AAGAAAAGTA GGCAGAATGT 4500
ACTCTGTTGA ATACTGTAAA CAGAAAATCG AGAAAATGAG ATTGATATTT TCTTTAGGAC 4560
TAGTTGGAGG AATCAGTCTT CATGTCAATG CAACTGGGTC CATATCAAAA ACACTAGCAA 4620
GTCAGCTGGT ATTCAAAAGA GAGATTTGTT ATCCTTTAAT GGATCTAAAT CCGCATCTCA 4680
ATCTAGTTAT CTGGGCTTCA TCAGTAGAGA TTACAAGAGT GGATGCAATT TTCCAACCTT 4740
CTTTACCTGG CGAGTTCAGA TACTATCCTA ATATTATTGC AAAAGGAGTT GGGAAAATCA 4800
AACAATGGAA CTAGTAATCT CTATTTTAGT CCGGACGTAT CTATTAAGCC GAAGCAAATA 4860
AAGGATAATC AAAAACTTAG GACAAAAGAG GTCAATACCA ACAACTATTA GCAGTCACAC 4920
TCGCAAGAAT AAGAGAGAAG GGACCAAAAA AGTCAAATAG GAGAAATCAA AACAAAAGGT 4980
ACAGAACACC AGAACAACAA AATCAAAACA TCCAACTCAC TCAAAACAAA AATTCCAAAA 5040
GAGACCGGCA ACACAACAAG CACTGAACAC AATGCCAACT TCAATACTGC TAATTATTAC 5100
AACCATGATC ATGGCATCTT TCTGCCAAAT AGATATCACA AAACTACAGC ACGTAGGTGT 5160
ATTGGTCAAC AGTCCCAAAG GGATGAAGAT ATCACAAAAC TTTGAAACAA GATATCTAAT 5220
TTTGAGCCTC ATACCAAAAA TAGAAGACTC TAACTCTTGT GGTGACCAAC AGATCAAGCA 5280
ATACAAGAAG TTATTGGATA GACTGATCAT CCCTTTATAT GATGGATTAA GATTACAGAA 5340
AGATGTGATA GTAACCAATC AAGAATCCAA TGAAAACACT GATCCCAGAA CAAAACGATT 5400
CTTTGGAGGG GTAATTGGAA CCATTGCTCT GGGAGTAGCA ACCTCAGCAC AAATTACAGC 5460
GGCAGTTGCT CTGGTTGAAG CCAAGCAGGC AAGATCAGAC ATCGAAAAAC TCAAAGAAGC 5520
AATTAGGGAC ACAAATAAAG CAGTGCAGTC AGTTCAGAGC TCCATAGGAA ATTTAATAGT 5580
AGCAATTAAA TCAGTCCAGG ATTATGTTAA CAAAGAAATC GTGCCATCGA TTGCGAGGCT 5640
AGGTTGTGAA GCAGCAGGAC TTCAATTAGG AATTGCATTA ACACAGCATT ACTCAGAATT 5700
AACAAACATA TTTGGTGATA ACATAGGATC GTTACAAGAA AAAGGAATAA AATTACAAGG 5760
TATAGCATCA TTATACCGCA CAAATATCAC AGAAATATTC ACAACATCAA CAGTTGATAA 5820
ATATGATATC TATGATCTGT TATTTACAGA ATCAATAAAG GTGAGAGTTA TAGATGTTGA 5880
CTTGAATGAT TACTCAATCA CCCTCCAAGT CAGACTCCCT TTATTAACTA GGCTGCTGAA 5940
CACTCAGATC TACAAAGTAG ATTCCATATC ATATAACATC CAAAACAGAG AATGGTATAT 6000
CCCTCTTCCC AGCCATATCA TGACGAAAGG GGCATTTCTA GGTGGAGCAG ACGTCAAAGA 6060
ATGTATAGAA GCATTCAGCA GCTATATATG CCCTTCTGAT CCAGGATTTG TATTAAACCA 6120
TGAAATAGAG AGCTGCTTAT CAGGAAACAT ATCCCAATGT CCAAGAACAA CGGTCACATC 6180
AGACATTGTT CCAAGATATG CATTTGTCAA TGGAGGAGTG GTTGCAAACT GTATAACAAC 6240
CACCTGTACA TGCAACGGAA TTGGTAATAG AATCAATCAA CCACCTGATC AAGGAGTAAA 6300
AATTATAACA CATAAAGAAT GTAGTACAGT AGGTATCAAC GGAATGCTGT TCAATACAAA 6360
TAAAGAAGGA ACTCTTGCAT TCTATACACC AAATGATATA ACACTAAACA ATTCTGTTAC 6420
ACTTGATCCA ATTGACATAT CAATCGAGCT CAACAAGGCC AAATCAGATC TAGAAGAATC 6480
AAAAGAATGG ATAAGAAGGT CAAATCAAAA ACTAGATTCT ATTGGAAATT GGCATCAATC 6540
TAGCACTACA ATCATAATTA TTTTGATAAT GATCATTATA TTGTTTATAA TTAATATAAC 6600
GATAATTACA ATTGCAATTA AGTATTACAG AATTCAAAAG AGAAATCGAG TGGATCAAAA 6660
TGACAAGCCA TATGTACTAA CAAACAAATA ACATATCTAC AGATCATTAG ATATTAAAAT 6720
TATAAAAAAC TTAGGAGTAA AGTTACGCAA TCCAACTCTA CTCATATAAT TGAGGAAGGA 6780
CCCAATAGAC AAATCCAAAT TCGAGATGGA ATACTGGAAG CATACCAATC ACGGAAAGGA 6840
TGCTGGCAAT GAGCTGGAGA CGTCTATGGC TACTCATGGC AACAAGCTCA CTAATAAGAT 6900
AATATACATA TTATGGACAA TAATCCTGGT GTTATTATCA ATAGTCTTCA TCATAGTGCT 6960
AATTAATTCC ATCAAAAGTG AAAAGGCCCA CGAATCATTG CTGCAAGACA TAAATAATGA 7020
GTTTATGGAA ATTACAGAAA AGATCCAAAT GGCATCGGAT AATACCAATG ATCTAATACA 7080
GTCAGGAGTG AATACAAGGC TTCTTACAAT TCAGAGTCAT GTCCAGAATT ACATACCAAT 7140
ATCATTGACA CAACAGATGT CAGATCTTAG GAAATTCATT AGTGAAATTA CAATTAGAAA 7200
TGATAATCAA GAAGTGCTGC CACAAAGAAT AACACATGAT GTAGGTATAA AACCTTTAAA 7260
TCCAGATGAT TTTTGGAGAT GCACGTCTGG TCTTCCATCT TTAATGAAAA CTCCAAAAAT 7320
AAGGTTAATG CCAGGGCCGG GATTATTAGC TATGCCAACG ACTGTTGATG GCTGTGTTAG 7380
AACTCCGTCT TTAGTTATAA ATGATCTGAT TTATGCTTAT ACCTCAAATC TAATTACTCG 7440
AGGTTGTCAG GATATAGGAA AATCATATCA AGTCTTACAG ATAGGGATAA TAACTGTAAA 7500
CTCAGACTTG GTACCTGACT TAAATCCTAG GATCTCTCAT ACCTTTAACA TAAATGACAA 7560
TAGGAAGTCA TGTTCTCTAG CACTCCTAAA TACAGATGTA TATCAACTGT GTTCAACTCC 7620
CAAAGTTGAT GAAAGATCAG ATTATGCATC ATCAGGCATA GAAGATATTG TACTTGATAT 7680
TGTCAATTAT GATGGTTCAA TCTCAACAAC AAGATTTAAG AATAATAACA TAAGCTTTGA 7740
TCAACCATAT GCTGCACTAT ACCCATCTGT TGGACCAGGG ATATACTACA AAGGCAAAAT 7800
AATATTTCTC GGGTATGGAG GTCTTGAACA TCCAATAAAT GAGAATGTAA TCTGCAACAC 7860
AACTGGGTGC CCCGGGAAAA CACAGAGAGA CTGTAATCAA GCGTCTCATA GTCCATGGTT 7920
TTCAGATAGG AGGATGGTCA ACTCCATCAT TGTTGCTGAC AAAGGCTTAA ACTCAATTCC 7980
AAAATTGAAA GTATGGACGA TATCTATGCG ACAAAATTAC TGGGGGTCAG AAGGAAGGTT 8040
ACTTCTACTA GGTAACAAGA TCTATATATA TACAAGATCT ACAAGTTGGC ATAGCAAGTT 8100
ACAATTAGGA ATAATTGATA TTACTGATTA CAGTGATATA AGGATAAAAT GGACATGGCA 8160
TAATGTGCTA TCAAGACCAG GAAACAATGA ATGTCCATGG GGACATTCAT GTCCAGATGG 8220
ATGTATAACA GGAGTATATA CTGATGCATA TCCACTCAAT CCCACAGGGA GCATTGTGTC 8280
ATCTGTCATA TTAGACTCAC AAAAATCGAG AGTGAACCCA GTCATAACTT ACTCAACAGC 8340
AACCGAAAGA GTAAACGAGC TGGCCATCCT AAACAGAACA CTCTCAGCTG GATATACAAC 8400
AACAAGCTGC ATTACACACT ATAACAAAGG ATATTGTTTT CATATAGTAG AAATAAATCA 8460
TAAAAGCTTA AACACATTTC AACCCATGTT GTTCAAAACA GAGATTCCAA AAAGCTGCAG 8520
TTAATCATAA TTAACCATAA TATGCATCAA TCTATCTATA ATACAAGTAT ATGATAAGTA 8580
ATCAGCAATC AGACAATAGA CAAAAGGGAA ATATAAAAAA CTTAGGAGCA AAGCGTGCTC 8640
GGGAAATGGA CACTGAATCT AACAATGGCA CTGTATCTGA CATACTCTAT CCTGAGTGTC 8700
ACCTTAACTC TCCTATCGTT AAAGGTAAAA TAGCACAATT ACACACTATT ATGAGTCTAC 8760
CTCAGCCTTA TGATATGGAT GACGACTCAA TACTAGTTAT CACTAGACAG AAAATAAAAC 8820
TTAATAAATT GGATAAAAGA CAACGATCTA TTAGAAGATT AAAATTAATA TTAACTGAAA 8880
AAGTGAATGA CTTAGGAAAA TACACATTTA TCAGATATCC AGAAATGTCA AAAGAAATGT 8940
TCAAATTATA TATACCTGGT ATTAACAGTA AAGTGACTGA ATTATTACTT AAAGCAGATA 9000
GAACATATAG TCAAATGACT GATGGATTAA GAGATCTATG GATTAATGTG CTATCAAAAT 9060
TAGCCTCAAA AAATGATGGA AGCAATTATG ATCTTAATGA AGAAATTAAT AATATATCGA 9120
AAGTTCACAC AACCTATAAA TCAGATAAAT GGTATAATCC ATTCAAAACA TGGTTTACTA 9180
TCAAGTATGA TATGAGAAGA TTACAAAAAG CTCGAAATGA GATCACTTTT AATGTTGGGA 9240
AGGATTATAA CTTGTTAGAA GACCAGAAGA ATTTCTTATT GATACATCCA GAATTGGTTT 9300
TGATATTAGA TAAACAAAAC TACAATGGTT ATCTAATTAC TCCTGAATTA GTATTGATGT 9360
ATTGTGACGT AGTCGAAGGC CGATGGAATA TAAGTGCATG TGCTAAGTTA GATCCAAAAT 9420
TACAATCTAT GTATCAGAAA GGTAATAACC TGTGGGAAGT GATAGATAAA TTGTTTCCAA 9480
TTATGGGAGA AAAGACATTT GATGTGATAT CGTTATTAGA ACCACTTGCA TTATCCTTAA 9540
TTCAAACTCA TGATCCTGTT AAACAACTAA GAGGAGCTTT TTTAAATCAT GTGTTATCCG 9600
AGATGGAATT AATATTTGAA TCTAGAGAAT CGATTAAGGA ATTTCTGAGT GTAGATTACA 9660
TTGATAAAAT TTTAGATATA TTTAATAAGT CTACAATAGA TGAAATAGCA GAGATTTTCT 9720
CTTTTTTTAG AACATTTGGG CATCCTCCAT TAGAAGCTAG TATTGCAGCA GAAAAGGTTA 9780
GAAAATATAT GTATATTGGA AAACAATTAA AATTTGACAC TATTAATAAA TGTCATGCTA 9840
TCTTCTGTAC AATAATAATT AACGGATATA GAGAGAGGCA TGGTGGACAG TGGCCTCCTG 9900
TGACATTACC TGATCATGCA CACGAATTCA TCATAAATGC TTACGGTTCA AACTCTGCGA 9960
TATCATATGA GAATGCTGTT GATTATTACC AGAGCTTTAT AGGAATAAAA TTCAATAAAT 10020
TCATAGAGCC TCAGTTAGAT GAGGATTTGA CAATTTATAT GAAAGATAAA GCATTATCTC 10080
CAAAAAAATC AAATTGGGAC ACAGTTTATC CTGCATCTAA TTTACTGTAC CGTACTAACG 10140
CATCCAACGA ATCACGAAGA TTAGTTGAAG TATTTATAGC AGATAGTAAA TTTGATCCTC 10200
ATCAGATATT GGATTATGTA GAATCTGGGG ACTGGTTAGA TGATCCAGAA TTTAATATTT 10260
CTTATAGTCT TAAAGAAAAA GAGATCAAAC AGGAAGGTAG ACTCTTTGCA AAAATGACAT 10320
ACAAAATGAG AGCTACACAA GTTTTATCAG AGACACTACT TGCAAATAAC ATAGGAAAAT 10380
TCTTTCAAGA AAATGGGATG GTGAAGGGAG AGATTGAATT ACTTAAGAGA TTAACAACCA 10440
TATCAATATC AGGAGTTCCA CGGTATAATG AAGTGTACAA TAATTCTAAA AGCCATACAG 10500
ATGACCTTAA AACCTACAAT AAAATAAGTA ATCTTAATTT GTCTTCTAAT CAGAAATCAA 10560
AGAAATTTGA ATTCAAGTCA ACGGATATCT ACAATGATGG ATACGAGACT GTGAGCTGTT 10620
TCCTAACAAC AGATCTCAAA AAATACTGTC TTAATTGGAG ATATGAATCA ACAGCTCTAT 10680
TTGGAGAAAC TTGCAACCAA ATATTTGGAT TAAATAAATT GTTTAATTGG TTACACCCTC 10740
GTCTTGAAGG AAGTACAATC TATGTAGGTG ATCCTTACTG TCCTCCATCA GATAAAGAAC 10800
ATATATCATT AGAGGATCAC CCTGATTCTG GTTTTTACGT TCATAACCCA AGAGGGGGTA 10860
TAGAAGGATT TTGTCAAAAA TTATGGACAC TCATATCTAT AAGTGCAATA CATCTAGCAG 10920
CTGTTAGAAT AGGCGTGAGG GTGACTGCAA TGGTTCAAGG AGACAATCAA GCTATAGCTG 10980
TAACCACAAG AGTACCCAAC AATTATGACT ACAGAGTTAA GAAGGAGATA GTTTATAAAG 11040
ATGTAGTGAG ATTTTTTGAT TCATTAAGAG AAGTGATGGA TGATCTAGGT CATGAACTTA 11100
AATTAAATGA AACGATTATA AGTAGCAAGA TGTTCATATA TAGCAAAAGA ATCTATTATG 11160
ATGGGAGAAT TCTTCCTCAA GCTCTAAAAG CATTATCTAG ATGTGTCTTC TGGTCAGAGA 11220
CAGTAATAGA CGAAACAAGA TCAGCATCTT CAAATTTGGC AACATCATTT GCAAAAGCAA 11280
TTGAGAATGG TTATTCACCT GTTCTAGGAT ATGCATGCTC AATTTTTAAG AACATTCAAC 11340
AACTATATAT TGCCCTTGGG ATGAATATCA ATCCAACTAT AACACAGAAT ATCAGAGATC 11400
AGTATTTTAG GAATCCAAAT TGGATGCAAT ATGCCTCTTT AATACCTGCT AGTGTTGGGG 11460
GATTCAATCA CATGGCCATG TCAAGATGTT TTGTAAGGAA TATTGGTGAT CCATCAGTTG 11520
CCGCATTGGC TGATATTAAA AGATTTATTA AGGCGAATCT ATTAGACCGA AGTGTTCTTT 11580
ATAGGATTAT GAATCAAGAA CCAGGTGAGT CATCTTTTTT TGACTGGGCT TCAGATCCAT 11640
ATTCATGCAA TTTACCACAA TCTCAAAATA TAACCACCAT GATAAAAAAT ATAACAGCAA 11700
GGAATGTATT ACAAGATTCA CCAAATCCAT TATTATCTGG ATTATTCACA AATACAATGA 11760
TAGAAGAAGA TGAAGAATTA GCTGAGTTCC TGATGGACAG GAAGGTAATT CTCCCTAGAG 11820
TTGCACATGA TATTCTAGAT AATTCTCTCA CAGGAATTAG AAATGCCATA GCTGGAATGT 11880
TAGATACGAC AAAATCACTA ATTCGGGTTG GCATAAATAG AGGAGGACTG ACATATAGTT 11940
TGTTGAGGAA AATCAGTAAT TACGATCTAG TACAATATGA AACACTAAGT AGGACTTTGC 12000
GACTAATTGT AAGTGATAAA ATCAAGTATG AAGATATGTG TTCGGTAGAC CTTGCCATAG 12060
CATTGCGACA AAAGATGTGG ATTCATTTAT CAGGAGGAAG GATGATAAGT GGACTTGAAA 12120
CGCCTGACCC ATTAGAATTA CTATCTGGGG TAGTAATAAC AGGATCAGAA CATTGTAAAA 12180
TATGTTATTC TTCAGATGGC ACAAACCCAT ATACTTGGAT GTATTTACCC GGTAATATCA 12240
AAATAGGATC AGCAGAAACA GGTATATCGT CATTAAGAGT TCCTTATTTT GGATCAGTCA 12300
CTGATGAAAG ATCTGAAGCA CAATTAGGAT ATATCAAGAA TCTTAGTAAA CCTGCAAAAG 12360
CCGCAATAAG AATAGCAATG ATATATACAT GGGCATTTGG TAATGATGAG ATATCTTGGA 12420
TGGAAGCCTC ACAGATAGCA CAAACACGTG CAAATTTTAC ACTAGATAGT CTCAAAATTT 12480
TAACACCGGT AGCTACATCA ACAAATTTAT CACACAGATT TAAGGATACT GCAACTCAGA 12540
TGAAATTCTC CAGTACATCA TTGATCAGAG TCAGCAGATT TATAACAATG TCCAATGATA 12600
ACATGTCTAT CAAAGAAGCT AATGAAACCA AAGATACTAA TCTTATTTAT CAACAAATAA 12660
TGTTAACAGG ATTAAGTGTT TTCGAATATT TATTTAGATT AAAAGAAACC ACAGGACACA 12720
ACCCTATAGT TATGCATCTG CACATAGAAG ATGAGTGTTG TATTAAAGAA AGTTTTAATG 12780
ATGAACATAT TAATCCAGAG TCTACATTAG AATTAATTCG ATATCCTGAA AGTAATGAAT 12840
TTATTTATGA TAAAGACCCA CTCAAAGATG TGGACTTATC AAAACTTATG GTTATTAAAG 12900
ACCATTCTTA CACAATTGAT ATGAATTATT GGGATGATAC TGACATCATA CATGCAATTT 12960
CAATATGTAC TGCAATTACA ATAGCAGATA CTATGTCACA ATTAGATCGA GATAATTTAA 13020
AAGAGATAAT AGTTATTGCA AATGATGATG ATATTAATAG CTTAATCACT GAATTTTTGA 13080
CTCTTGACAT ACTTGTATTT CTCAAGACAT TTGGTGGATT ATTAGTAAAT CAATTTGCAT 13140
ACACTCTTTA TAGTCTAAAA ATAGAAGGTA GGGATCTCAT TTGGGATTAT ATAATGAGAA 13200
CACTGAGAGA TACTTCCCAT TCAATATTAA AAGTATTATC TAATGCATTA TCTCATCCTA 13260
AAGTATTCAA GAGGTTCTGG GATTGTGGAG TTTTAAACCC TATTTATGGT CCTAATATTG 13320
CTAGTCAAGA CCAGATAAAA CTTGCCCTAT CTATATGTGA ATATTCACTA GATCTATTTA 13380
TGAGAGAATG GTTGAATGGT GTATCACTTG AAATATACAT TTGTGACAGC GATATGGAAG 13440
TTGCAAATGA TAGGAAACAA GCCTTTATTT CTAGACACCT TTCATTTGTT TGTTGTTTAG 13500
CAGAAATTGC ATCTTTCGGA CCTAACCTGT TAAACTTAAC ATACTTGGAG AGACTTGATC 13560
TATTGAAACA ATATCTTGAA TTAAATATTA AAGAAGACCC TACTCTTAAA TATGTACAAA 13620
TATCTGGATT ATTAATTAAA TCGTTCCCAT CAACTGTAAC ATACGTAAGA AAGACTGCAA 13680
TCAAATATCT AAGGATTCGC GGTATTAGTC CACCTGAGGT AATTGATGAT TGGGATCCGG 13740
TAGAAGATGA AAATATGCTG GATAACATTG TCAAAACTAT AAATGATAAC TGTAATAAAG 13800
ATAATAAAGG GAATAAAATT AACAATTTCT GGGGACTAGC ACTTAAGAAC TATCAAGTCC 13860
TTAAAATCAG ATCTATAACA AGTGATTCTG ATGATAATGA TAGACTAGAT GCTAATACAA 13920
GTGGTTTGAC ACTTCCTCAA GGAGGGAATT ATCTATCGCA TCAATTGAGA TTATTCGGAA 13980
TCAACAGCAC TAGTTGTCTG AAAGCTCTTG AGTTATCACA AATTTTAATG AAGGAAGTCA 14040
ATAAAGACAA GGACAGGCTC TTCCTGGGAG AAGGAGCAGG AGCTATGCTA GCATGTTATG 14100
ATGCCACATT AGGACCTGCA GTTAATTATT ATAATTCAGG TTTGAATATA ACAGATGTAA 14160
TTGGTCAACG AGAATTGAAA ATATTTCCTT CAGAGGTATC ATTAGTAGGT AAAAAATTAG 14220
GAAATGTGAC ACAGATTCTT AACAGGGTAA AAGTACTGTT CAATGGGAAT CCTAATTCAA 14280
CATGGATAGG AAATATGGAA TGTGAGAGCT TAATATGGAG TGAATTAAAT GATAAGTCCA 14340
TTGGATTAGT ACATTGTGAT ATGGAAGGAG CTATCGGTAA ATCAGAAGAA ACTGTTCTAC 14400
ATGAACATTA TAGTGTTATA AGAATTACAT ACTTGATTGG GGATGATGAT GTTGTTTTAG 14460
TTTCCAAAAT TATACCTACA ATCACTCCGA ATTGGTCTAG AATACTTTAT CTATATAAAT 14520
TATATTGGAA AGATGTAAGT ATAATATCAC TCAAAACTTC TAATCCTGCA TCAACAGAAT 14580
TATATCTAAT TTCGAAAGAT GCATATTGTA CTATAATGGA ACCTAGTGAA ATTGTTTTAT 14640
CAAAACTTAA AAGATTGTCA CTCTTGGAAG AAAATAATCT ATTAAAATGG ATCATTTTAT 14700
CAAAGAAGAG GAATAATGAA TGGTTACATC ATGAAATCAA AGAAGGAGAA AGAGATTATG 14760
GAATCATGAG ACCATATCAT ATGGCACTAC AAATCTTTGG ATTTCAAATC AATTTAAATC 14820
ATCTGGCGAA AGAATTTTTA TCAACCCCAG ATCTGACTAA TATCAACAAT ATAATCCAAA 14880
GTTTTCAGCG AACAATAAAG GATGTTTTAT TTGAATGGAT TAATATAACT CATGATGATA 14940
AGAGACATAA ATTAGGCGGA AGATATAACA TATTCCCACT GAAAAATAAG GGAAAGTTAA 15000
GACTGCTATC GAGAAGACTA GTATTAAGTT GGATTTCATT ATCATTATCG ACTCGATTAC 15060
TTACAGGTCG CTTTCCTGAT GAAAAATTTG AACATAGAGC ACAGACTGGA TATGTATCAT 15120
TAGCTGATAC TGATTTAGAA TCATTAAAGT TATTGTCGAA AAACATCATT AAGAATTACA 15180
GAGAGTGTAT AGGATCAATA TCATATTGGT TTCTAACCAA AGAAGTTAAA ATACTTATGA 15240
AATTGATTGG TGGTGCTAAA TTATTAGGAA TTCCCAGACA ATATAAAGAA CCCGAAGACC 15300
AGTTATTAGA AAACTACAAT CAACATGATG AATTTGATAT CGATTAAAAC ATAAATACAA 15360
TGAAGATATA TCCTAACCTT TATCTTTAAG CCTAGGAATA GACAAAAAGT AAGAAAAACA 15420
TGTAATATAT ATATACCAAA CAGAGTTCTT CTCTTGTTTG GT 15462
(2) SEQ ID NO: 22 information about:
...
(i) sequence signature:
(A) length: 2233 amino acid
(B) type: amino acid
(C) chain:
(D) topological framework: linearity
(ii) molecule type: protein
(xi) sequence description: SEQ ID NO:22:Met Asp Thr Glu Ser Asn Asn Gly Thr Val Ser Asp Ile Leu Tyr Pro1 5 10 15Glu Cys His Leu Asn Ser Pro Ile Val Lys Gly Lys Ile Ala Gln Leu
20??????????????????25??????????????????30His?Thr?Ile?Met?Ser?Leu?Pro?Gln?Pro?Tyr?Asp?Met?Asp?Asp?Asp?Ser
35??????????????????40??????????????????45Ile?Leu?Val?Ile?Thr?Arg?Gln?Lys?Ile?Lys?Leu?Asn?Lys?Leu?Asp?Lys
50??????????????????55??????????????????60Arg?Gln?Arg?Ser?Ile?Arg?Arg?Leu?Lys?Leu?Ile?Leu?Thr?Glu?Lys?Val65??????????????????70??????????????????75??????????????????80Asn?Asp?Leu?Gly?Lys?Tyr?Thr?Phe?Ile?Arg?Tyr?Pro?Glu?Met?Ser?Lys
85??????????????????90??????????????????95Glu?Met?Phe?Lys?Leu?Tyr?Ile?Pro?Gly?Ile?Asn?Ser?Lys?Val?Thr?Glu
100?????????????????105?????????????????110Leu?Leu?Leu?Lys?Ala?Asp?Arg?Thr?Tyr?Ser?Gln?Met?Thr?Asp?Gly?Leu
115?????????????????120?????????????????125Arg?Asp?Leu?Trp?Ile?Asn?Val?Leu?Ser?Lys?Leu?Ala?Ser?Lys?Asn?Asp
130?????????????????135?????????????????140Gly?Ser?Asn?Tyr?Asp?Leu?Asn?Glu?Glu?Ile?Asn?Asn?Ile?Ser?Lys?Val145?????????????????150?????????????????155?????????????????160His?Thr?Thr?Tyr?Lys?Ser?Asp?Lys?Trp?Tyr?Asn?Pro?Phe?Lys?Thr?Trp
165?????????????????170?????????????????175Phe?Thr?Ile?Lys?Tyr?Asp?Met?Arg?Arg?Leu?Gln?Lys?Ala?Arg?Asn?Glu
180?????????????????185?????????????????190Ile?Thr?Phe?Asn?Val?Gly?Lys?Asp?Tyr?Asn?Leu?Leu?Glu?Asp?Gln?Lys
195?????????????????200?????????????????205Asn?Phe?Leu?Leu?Ile?His?Pro?Glu?Leu?Val?Leu?Ile?Leu?Asp?Lys?Gln
210?????????????????215?????????????????220Asn?Tyr?Asn?Gly?Tyr?Leu?Ile?Thr?Pro?Glu?Leu?Val?Leu?Met?Tyr?Cys225?????????????????230?????????????????235?????????????????240Asp?Val?Val?Glu?Gly?Arg?Trp?Asn?Ile?Ser?Ala?Cys?Ala?Lys?Leu?Asp
245?????????????????250?????????????????255Pro?Lys?Leu?Gln?Ser?Met?Tyr?Gln?Lys?Gly?Asn?Asn?Leu?Trp?Glu?Val
260?????????????????265?????????????????270Ile?Asp?Lys?Leu?Phe?Pro?Ile?Met?Gly?Glu?Lys?Thr?Phe?Asp?Val?Ile
275?????????????????280?????????????????285Ser?Leu?Leu?Glu?Pro?Leu?Ala?Leu?Ser?Leu?Ile?Gln?Thr?His?Asp?Pro
290?????????????????295?????????????????300Val?Lys?Gln?Leu?Arg?Gly?Ala?Phe?Leu?Asn?His?Val?Leu?Ser?Glu?Met305?????????????????310?????????????????315?????????????????320Glu?Leu?Ile?Phe?Glu?Ser?Arg?Glu?Ser?Ile?Lys?Glu?Phe?Leu?Ser?Val
325?????????????????330?????????????????335Asp?Tyr?Ile?Asp?Lys?Ile?Leu?Asp?Ile?Phe?Asn?Lys?Ser?Thr?Ile?Asp
340?????????????????345?????????????????350Glu?Ile?Ala?Glu?Ile?Phe?Ser?Phe?Phe?Arg?Thr?Phe?Gly?His?Pro?Pro
355?????????????????360?????????????????365Leu?Glu?Ala?Ser?Ile?Ala?Ala?Glu?Lys?Val?Arg?Lys?Tyr?Met?Tyr?Ile
370?????????????????375?????????????????380Gly?Lys?Gln?Leu?Lys?Phe?Asp?Thr?Ile?Asn?Lys?Cys?His?Ala?Ile?Phe385?????????????????390?????????????????395?????????????????400Cys?Thr?Ile?Ile?Ile?Asn?Gly?Tyr?Arg?Glu?Arg?His?Gly?Gly?Gln?Trp
405?????????????????410?????????????????415Pro?Pro?Val?Thr?Leu?Pro?Asp?His?Ala?His?Glu?Phe?Ile?Ile?Asn?Ala
420?????????????????425?????????????????430Tyr?Gly?Ser?Asn?Ser?Ala?Ile?Ser?Tyr?Glu?Asn?Ala?Val?Asp?Tyr?Tyr
435?????????????????440?????????????????445Gln?Ser?Phe?Ile?Gly?Ile?Lys?Phe?Asn?Lys?Phe?Ile?Glu?Pro?Gln?Leu
450?????????????????455?????????????????460Asp?Glu?Asp?Leu?Thr?Ile?Tyr?Met?Lys?Asp?Lys?Ala?Leu?Ser?Pro?Lys465?????????????????470?????????????????475?????????????????480Lys?Ser?Asn?Trp?Asp?Thr?Val?Tyr?Pro?Ala?Ser?Asn?Leu?Leu?Tyr?Arg
485?????????????????490?????????????????495Thr?Asn?Ala?Ser?Asn?Glu?Ser?Arg?Arg?Leu?Val?Glu?Val?Phe?Ile?Ala
500?????????????????505?????????????????510Asp?Ser?Lys?Phe?Asp?Pro?His?Gln?Ile?Leu?Asp?Tyr?Val?Glu?Ser?Gly
515?????????????????520?????????????????525Asp?Trp?Leu?Asp?Asp?Pro?Glu?Phe?Asn?Ile?Ser?Tyr?Ser?Leu?Lys?Glu
530?????????????????535?????????????????540Lys?Glu?Ile?Lys?Gln?Glu?Gly?Arg?Leu?Phe?Ala?Lys?Met?Thr?Tyr?Lys545?????????????????550?????????????????555?????????????????560Met?Arg?Ala?Thr?Gln?Val?Leu?Ser?Glu?Thr?Leu?Leu?Ala?Asn?Asn?Ile
565?????????????????570?????????????????575Gly?Lys?Phe?Phe?Gln?Glu?Asn?Gly?Met?Val?Lys?Gly?Glu?Ile?Glu?Leu
580?????????????????585?????????????????590Leu?Lys?Arg?Leu?Thr?Thr?Ile?Ser?Ile?Ser?Gly?Val?Pro?Arg?Tyr?Asn
595?????????????????600?????????????????605Glu?Val?Tyr?Asn?Asn?Ser?Lys?Ser?His?Thr?Asp?Asp?Leu?Lys?Thr?Tyr
610?????????????????615?????????????????620Asn?Lys?Ile?Ser?Asn?Leu?Asn?Leu?Ser?Ser?Asn?Gln?Lys?Ser?Lys?Lys625?????????????????630?????????????????635?????????????????640Phe?Glu?Phe?Lys?Ser?Thr?Asp?Ile?Tyr?Asn?Asp?Gly?Tyr?Glu?Thr?Val
645?????????????????650?????????????????655Ser?Cys?Phe?Leu?Thr?Thr?Asp?Leu?Lys?Lys?Tyr?Cys?Leu?Asn?Trp?Arg
660?????????????????665?????????????????670Tyr?Glu?Ser?Thr?Ala?Leu?Phe?Gly?Glu?Thr?Cys?Asn?Gln?Ile?Phe?Gly
675?????????????????680?????????????????685Leu?Asn?Lys?Leu?Phe?Asn?Trp?Leu?His?Pro?Arg?Leu?Glu?Gly?Ser?Thr
690?????????????????695?????????????????700Ile?Tyr?Val?Gly?Asp?Pro?Tyr?Cys?Pro?Pro?Ser?Asp?Lys?Glu?His?Ile705?????????????????710?????????????????715?????????????????720Ser?Leu?Glu?Asp?His?Pro?Asp?Ser?Gly?Phe?Tyr?Val?His?Asn?Pro?Arg
725?????????????????730?????????????????735Gly?Gly?Ile?Glu?Gly?Phe?Cys?Gln?Lys?Leu?Trp?Thr?Leu?Ile?Ser?Ile
740?????????????????745?????????????????750Ser?Ala?Ile?His?Leu?Ala?Ala?Val?Arg?Ile?Gly?Val?Arg?Val?Thr?Ala
755?????????????????760?????????????????765Met?Val?Gln?Gly?Asp?Asn?Gln?Ala?Ile?Ala?Val?Thr?Thr?Arg?Val?Pro
770?????????????????775?????????????????780Asn?Asn?Tyr?Asp?Tyr?Arg?Val?Lys?Lys?Glu?Ile?Val?Tyr?Lys?Asp?Val785?????????????????790?????????????????795?????????????????800Val?Arg?Phe?Phe?Asp?Ser?Leu?Arg?Glu?Val?Met?Asp?Asp?Leu?Gly?His
805?????????????????810?????????????????815Glu?Leu?Lys?Leu?Asn?Glu?Thr?Ile?Ile?Ser?Ser?Lys?Met?Phe?Ile?Tyr
820?????????????????825?????????????????830Ser?Lys?Arg?Ile?Tyr?Tyr?Asp?Gly?Arg?Ile?Leu?Pro?Gln?Ala?Leu?Lys
835?????????????????840?????????????????845Ala?Leu?Ser?Arg?Cys?Val?Phe?Trp?Ser?Glu?Thr?Val?Ile?Asp?Glu?Thr
850?????????????????855?????????????????860Arg?Ser?Ala?Ser?Ser?Asn?Leu?Ala?Thr?Ser?Phe?Ala?Lys?Ala?Ile?Glu865?????????????????870?????????????????875?????????????????880Asn?Gly?Tyr?Ser?Pro?Val?Leu?Gly?Tyr?Ala?Cys?Ser?Ile?Phe?Lys?Asn
885?????????????????890?????????????????895Ile?Gln?Gln?Leu?Tyr?Ile?Ala?Leu?Gly?Met?Asn?Ile?Asn?Pro?Thr?Ile
900?????????????????905?????????????????910Thr?Gln?Asn?Ile?Arg?Asp?Gln?Tyr?Phe?Arg?Asn?Pro?Asn?Trp?Met?Gln
915?????????????????920?????????????????925Tyr?Ala?Ser?Leu?Ile?Pro?Ala?Ser?Val?Gly?Gly?Phe?Asn?His?Met?Ala
930?????????????????935?????????????????940Met?Ser?Arg?Cys?Phe?Val?Arg?Asn?Ile?Gly?Asp?Pro?Ser?Val?Ala?Ala945?????????????????950?????????????????955?????????????????960Leu?Ala?Asp?Ile?Lys?Arg?Phe?Ile?Lys?Ala?Asn?Leu?Leu?Asp?Arg?Ser
965?????????????????970?????????????????975Val?Leu?Tyr?Arg?Ile?Met?Asn?Gln?Glu?Pro?Gly?Glu?Ser?Ser?Phe?Phe
980?????????????????985?????????????????990Asp?Trp?Ala?Ser?Asp?Pro?Tyr?Ser?Cys?Asn?Leu?Pro?Gln?Ser?Gln?Asn
995?????????????????1000????????????????1005Ile?Thr?Thr?Met?Ile?Lys?Asn?Ile?Thr?Ala?Arg?Asn?Val?Leu?Gln?Asp
1010????????????????1015????????????????1020Ser?Pro?Asn?Pro?Leu?Leu?Ser?Gly?Leu?Phe?Thr?Asn?Thr?Met?Ile?Glu1025????????????????1030????????????????1035????????????????1040Glu?Asp?Glu?Glu?Leu?Ala?Glu?Phe?Leu?Met?Asp?Arg?Lys?Val?Ile?Leu
1045????????????????1050????????????????1055Pro?Arg?Val?Ala?His?Asp?Ile?Leu?Asp?Asn?Ser?Leu?Thr?Gly?Ile?Arg
1060????????????????1065????????????????1070Asn?Ala?Ile?Ala?Gly?Met?Leu?Asp?Thr?Thr?Lys?Ser?Leu?Ile?Arg?Val
1075????????????????1080????????????????1085Gly?Ile?Asn?Arg?Gly?Gly?Leu?Thr?Tyr?Ser?Leu?Leu?Arg?Lys?Ile?Ser
1090????????????????1095????????????????1100Asn?Tyr?Asp?Leu?Val?Gln?Tyr?Glu?Thr?Leu?Ser?Arg?Thr?Leu?Arg?Leu1105????????????????1110????????????????1115????????????????1120Ile?Val?Ser?Asp?Lys?Ile?Lys?Tyr?Glu?Asp?Met?Cys?Ser?Val?Asp?Leu
1125????????????????1130????????????????1135Ala?Ile?Ala?Leu?Arg?Gln?Lys?Met?Trp?Ile?His?Leu?Ser?Gly?Gly?Arg
1140????????????????1145????????????????1150Met?Ile?Ser?Gly?Leu?Glu?Thr?Pro?Asp?Pro?Leu?Glu?Leu?Leu?Ser?Gly
1155????????????????1160????????????????1165Val?Val?Ile?Thr?Gly?Ser?Glu?His?Cys?Lys?Ile?Cys?Tyr?Ser?Ser?Asp
1170????????????????1175????????????????1180Gly?Thr?Asn?Pro?Tyr?Thr?Trp?Met?Tyr?Leu?Pro?Gly?Asn?Ile?Lys?Ile1185????????????????1190????????????????1195????????????????1200Gly?Ser?Ala?Glu?Thr?Gly?Ile?Ser?Ser?Leu?Arg?Val?Pro?Tyr?Phe?Gly
1205????????????????1210????????????????1215Ser?Val?Thr?Asp?Glu?Arg?Ser?Glu?Ala?Gln?Leu?Gly?Tyr?Ile?Lys?Asn
1220????????????????1225????????????????1230Leu?Ser?Lys?Pro?Ala?Lys?Ala?Ala?Ile?Arg?Ile?Ala?Met?Ile?Tyr?Thr
1235????????????????1240????????????????1245Trp?Ala?Phe?Gly?Asn?Asp?Glu?Ile?Ser?Trp?Met?Glu?Ala?Ser?Gln?Ile
1250????????????????1255????????????????1260Ala?Gln?Thr?Arg?Ala?Asn?Phe?Thr?Leu?Asp?Ser?Leu?Lys?Ile?Leu?Thr1265????????????????1270????????????????1275????????????????1280Pro?Val?Ala?Thr?Ser?Thr?Asn?Leu?Ser?His?Arg?Phe?Lys?Asp?Thr?Ala
1285????????????????1290????????????????1295Thr?Gln?Met?Lys?Phe?Ser?Ser?Thr?Ser?Leu?Ile?Arg?Val?Ser?Arg?Phe
1300????????????????1305????????????????1310Ile?Thr?Met?Ser?Asn?Asp?Asn?Met?Ser?Ile?Lys?Glu?Ala?Asn?Glu?Thr
1315????????????????1320????????????????1325Lys?Asp?Thr?Asn?Leu?Ile?Tyr?Gln?Gln?Ile?Met?Leu?Thr?Gly?Leu?Ser
1330????????????????1335????????????????1340Val?Phe?Glu?Tyr?Leu?Phe?Arg?Leu?Lys?Glu?Thr?Thr?Gly?His?Asn?Pro1345????????????????1350????????????????1355????????????????1360Ile?Val?Met?His?Leu?His?Ile?Glu?Asp?Glu?Cys?Cys?Ile?Lys?Glu?Ser
1365????????????????1370????????????????1375Phe?Asn?Asp?Glu?His?Ile?Asn?Pro?Glu?Ser?Thr?Leu?Glu?Leu?Ile?Arg
1380????????????????1385????????????????1390Tyr?Pro?Glu?Ser?Asn?Glu?Phe?Ile?Tyr?Asp?Lys?Asp?Pro?Leu?Lys?Asp
1395????????????????1400????????????????1405Val?Asp?Leu?Ser?Lys?Leu?Met?Val?Ile?Lys?Asp?His?Ser?Tyr?Thr?Ile
1410????????????????1415????????????????1420Asp?Met?Asn?Tyr?Trp?Asp?Asp?Thr?Asp?Ile?Ile?His?Ala?Ile?Ser?Ile1425????????????????1430????????????????1435????????????????1440Cys?Thr?Ala?Ile?Thr?Ile?Ala?Asp?Thr?Met?Ser?Gln?Leu?Asp?Arg?Asp
1445????????????????1450????????????????1455Asn?Leu?Lys?Glu?Ile?Ile?Val?Ile?Ala?Asn?Asp?Asp?Asp?Ile?Asn?Ser
1460????????????????1465????????????????1470Leu?Ile?Thr?Glu?Phe?Leu?Thr?Leu?Asp?Ile?Leu?Val?Phe?Leu?Lys?Thr
1475????????????????1480????????????????1485Phe?Gly?Gly?Leu?Leu?Val?Asn?Gln?Phe?Ala?Tyr?Thr?Leu?Tyr?Ser?Leu
1490????????????????1495????????????????1500Lys?Ile?Glu?Gly?Arg?Asp?Leu?Ile?Trp?Asp?Tyr?Ile?Met?Arg?Thr?Leu1505????????????????1510????????????????1515????????????????1520Arg?Asp?Thr?Ser?His?Ser?Ile?Leu?Lys?Val?Leu?Ser?Asn?Ala?Leu?Ser
1525????????????????1530????????????????1535His?Pro?Lys?Val?Phe?Lys?Arg?Phe?Trp?Asp?Cys?Gly?Val?Leu?Asn?Pro
1540????????????????1545????????????????1550Ile?Tyr?Gly?Pro?Asn?Ile?Ala?Ser?Gln?Asp?Gln?Ile?Lys?Leu?Ala?Leu
1555????????????????1560????????????????1565Ser?Ile?Cys?Glu?Tyr?Ser?Leu?Asp?Leu?Phe?Met?Arg?Glu?Trp?Leu?Asn
1570????????????????1575????????????????1580Gly?Val?Ser?Leu?Glu?Ile?Tyr?Ile?Cys?Asp?Ser?Asp?Met?Glu?Val?Ala1585????????????????1590????????????????1595????????????????1600Asn?Asp?Arg?Lys?Gln?Ala?Phe?Ile?Ser?Arg?His?Leu?Ser?Phe?Val?Cys
1605????????????????1610????????????????1615Cys?Leu?Ala?Glu?Ile?Ala?Ser?Phe?Gly?Pro?Asn?Leu?Leu?Asn?Leu?Thr
1620????????????????1625????????????????1630Tyr?Leu?Glu?Arg?Leu?Asp?Leu?Leu?Lys?Gln?Tyr?Leu?Glu?Leu?Asn?Ile
1635????????????????1640????????????????1645Lys?Glu?Asp?Pro?Thr?Leu?Lys?Tyr?Val?Gln?Ile?Ser?Gly?Leu?Leu?Ile
1650????????????????1655????????????????1660Lys?Ser?Phe?Pro?Ser?Thr?Val?Thr?Tyr?Val?Arg?Lys?Thr?Ala?Ile?Lys1665????????????????1670????????????????1675????????????????1680Tyr?Leu?Arg?Ile?Arg?Gly?Ile?Ser?Pro?Pro?Glu?Val?Ile?Asp?Asp?Trp
1685????????????????1690????????????????1695Asp?Pro?Val?Glu?Asp?Glu?Asn?Met?Leu?Asp?Asn?Ile?Val?Lys?Thr?Ile
1700????????????????1705????????????????1710Asn?Asp?Asn?Cys?Asn?Lys?Asp?Asn?Lys?Gly?Asn?Lys?Ile?Asn?Asn?Phe
1715????????????????1720????????????????1725Trp?Gly?Leu?Ala?Leu?Lys?Asn?Tyr?Gln?Val?Leu?Lys?Ile?Arg?Ser?Ile
1730????????????????1735????????????????1740Thr?Ser?Asp?Ser?Asp?Asp?Asn?Asp?Arg?Leu?Asp?Ala?Asn?Thr?Ser?Gly1745????????????????1750????????????????1755????????????????1760Leu?Thr?Leu?Pro?Gln?Gly?Gly?Asn?Tyr?Leu?Ser?His?Gln?Leu?Arg?Leu
1765????????????????1770????????????????1775Phe?Gly?Ile?Asn?Ser?Thr?Ser?Cys?Leu?Lys?Ala?Leu?Glu?Leu?Ser?Gln
1780????????????????1785????????????????1790Ile?Leu?Met?Lys?Glu?Val?Asn?Lys?Asp?Lys?Asp?Arg?Leu?Phe?Leu?Gly
1795????????????????1800????????????????1805Glu?Gly?Ala?Gly?Ala?Met?Leu?Ala?Cys?Tyr?Asp?Ala?Thr?Leu?Gly?Pro
1810????????????????1815????????????????1820Ala?Val?Asn?Tyr?Tyr?Asn?Ser?Gly?Leu?Asn?Ile?Thr?Asp?Val?Ile?Gly1825????????????????1830????????????????1835????????????????1840Gln?Arg?Glu?Leu?Lys?Ile?Phe?Pro?Ser?Glu?Val?Ser?Leu?Val?Gly?Lys
1845????????????????1850????????????????1855Lys?Leu?Gly?Asn?Val?Thr?Gln?Ile?Leu?Asn?Arg?Val?Lys?Val?Leu?Phe
1860????????????????1865????????????????1870Asn?Gly?Asn?Pro?Asn?Ser?Thr?Trp?Ile?Gly?Asn?Met?Glu?Cys?Glu?Ser
1875????????????????1880????????????????1885Leu?Ile?Trp?Ser?Glu?Leu?Asn?Asp?Lys?Ser?Ile?Gly?Leu?Val?His?Cys
1890????????????????1895????????????????1900Asp?Met?Glu?Gly?Ala?Ile?Gly?Lys?Ser?Glu?Glu?Thr?Val?Leu?His?Glu1905????????????????1910????????????????1915????????????????1920His?Tyr?Ser?Val?Ile?Arg?Ile?Thr?Tyr?Leu?Ile?Gly?Asp?Asp?Asp?Val
1925????????????????1930????????????????1935Val?Leu?Val?Ser?Lys?Ile?Ile?Pro?Thr?Ile?Thr?Pro?Asn?Trp?Ser?Arg
1940????????????????1945????????????????1950Ile?Leu?Tyr?Leu?Tyr?Lys?Leu?Tyr?Trp?Lys?Asp?Val?Ser?Ile?Ile?Ser
1955????????????????1960????????????????1965Leu?Lys?Thr?Ser?Asn?Pro?Ala?Ser?Thr?Glu?Leu?Tyr?Leu?Ile?Ser?Lys
1970????????????????1975????????????????1980Asp?Ala?Tyr?Cys?Thr?Ile?Met?Glu?Pro?Ser?Glu?Ile?Val?Leu?Ser?Lys1985????????????????1990????????????????1995????????????????2000Leu?Lys?Arg?Leu?Ser?Leu?Leu?Glu?Glu?Asn?Asn?Leu?Leu?Lys?Trp?Ile
2005????????????????2010????????????????2015Ile?Leu?Ser?Lys?Lys?Arg?Asn?Asn?Glu?Trp?Leu?His?His?Glu?Ile?Lys
2020????????????????2025????????????????2030Glu?Gly?Glu?Arg?Asp?Tyr?Gly?Ile?Met?Arg?Pro?Tyr?His?Met?Ala?Leu
2035????????????????2040????????????????2045
Gln?Ile?Phe?Gly?Phe?Gln?Ile?Asn?Leu?Asn?His?Leu?Ala?Lys?Glu?Phe
2050????????????????2055????????????????2060
Leu?Ser?Thr?Pro?Asp?Leu?Thr?Asn?Ile?Asn?Asn?Ile?Ile?Gln?Ser?Phe
2065????????????????2070????????????????2075????????????????2080
Gln?Arg?Thr?Ile?Lys?Asp?Val?Leu?phe?Glu?Trp?Ile?Asn?Ile?Thr?His
2085????????????????2090????????????????2095
Asp?Asp?Lys?Arg?His?Lys?Leu?Gly?Gly?Arg?Tyr?Asn?Ile?Phe?Pro?Leu
2100????????????????2105????????????????2110
Lys?Asn?Lys?Gly?Lys?Leu?Arg?Leu?Leu?Ser?Arg?Arg?Leu?Val?Leu?Ser
2115????????????????2120????????????????2125
Trp?Ile?Ser?Leu?Ser?Leu?Ser?Thr?Arg?Leu?Leu?Thr?Gly?Arg?Phe?Pro
2130????????????????2135????????????????2140
Asp?Glu?Lys?Phe?Glu?His?Arg?Ala?Gln?Thr?Gly?Tyr?Val?Ser?Leu?Ala
2145????????????????2150????????????????2155????????????????2160
Asp?Thr?Asp?Leu?Glu?Ser?Leu?Lys?Leu?Leu?Ser?Lys?Asn?Ile?Ile?Lys
2165????????????????2170????????????????2175
Asn?Tyr?Arg?Glu?Cys?Ile?Gly?Ser?Ile?Ser?Tyr?Trp?Phe?Leu?Thr?Lys
2180????????????????2185????????????????2190
Glu?Val?Lys?Ile?Leu?Met?Lys?Leu?Ile?Gly?Gly?Ala?Lys?Leu?Leu?Gly
2195????????????????2200????????????????2205
Ile?Pro?Arg?Gln?Tyr?Lys?Glu?Pro?Glu?Asp?Gln?Leu?Leu?Glu?Asn?Tyr
2210????????????????2215????????????????2220
Asn?Gln?His?Asp?Glu?Phe?Asp?Ile?Asp
The information of 2,225 2230 (2) SEQ ID NO:23:
(i) sequence signature:
(A) length: 15218 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23:
ACGCGAAAAA ATGCGTACTA CAAACTTGCA CATTCGAAAA AAATGGGGCA AATAAGAACT 60
TGATAAGTGC TATTTAAGTC TAACCTTTTC AATCAGAAAT GGGGTGCAAT TCACTGAGCA 120
TGATAAAGGT TAGATTACAA AATTTATTTG ACAATGACGA AGTAGCATTG TTAAAAATAA 180
CATGTTATAC TGATAAATTA ATTCTTCTGA CCAATGCATT AGCCAAAGCA GCAATACATA 240
CAATTAAATT AAACGGCATA GTTTTTATAC ATGTTATAAC AAGCAGTGAA GTGTGCCCTG 300
ATAACAATAT TGTAGTGAAA TCTAACTTTA CAACAATGCC AATACTACAA AATGGAGGAT 360
ACATATGGGA ATTGATTGAG TTGACACACT GCTCTCAATT AAACGGTTTA ATGGATGATA 420
ATTGTGAAAT CAAATTTTCT AAAAGACTAA GTGACTCAGT AATGACTAAT TATATGAATC 480
AAATATCTGA CTTACTTGGG CTTGATCTCA ATTCATGAAT TATGTTTAGT CTAATTCAAT 540
AGACATGTGT TTATTACCAT TTTAGTTAAT ATAAAAACTC ATCAAAGGGA AATGGGGCAA 600
ATAAACTCAC CTAATCAATC AAACCATGAG CACTACAAAT GACAACACTA CTATGCAAAG 660
ATTGATGATC ACAGACATGA GACCCCTGTC AATGGATTCA ATAATAACAT CTCTTACCAA 720
AGAAATCATC ACACACAAAT TCATATACTT GATAAACAAT GAATGTATTG TAAGAAAACT 780
TGATGAAAGA CAAGCTACAT TTACATTCTT AGTCAATTAT GAGATGAAGC TACTGCACAA 840
AGTAGGGAGT ACCAAATACA AAAAATACAC TGAATATAAT ACAAAATATG GCACTTTCCC 900
CATGCCTATA TTTATCAATC ACGGCGGGTT TCTAGAATGT ATTGGCATTA AGCCTACAAA 960
ACACACTCCT ATAATATACA AATATGACCT CAACCCGTGA ATTCCAACAA AAAAACCAAC 1020
CCAACCAAAC CAAACTATTC CTCAAACAAC AGTGCTCAAT AGTTAAGAAG GAGCTAATCC 1080
ATTTTAGTAA TTAAAAATAA AAGTAAAGCC AATAACATAA ATTGGGGCAA ATACAAAGAT 1140
GGCTCTTAGC AAAGTCAAGT TGAATGATAC ATTAAATAAG GATCAGCTGC TGTCATCCAG 1200
CAAATACACT ATTCAACGTA GTACAGGAGA TAATATTGAC ACTCCCAATT ATGATGTGCA 1260
AAAACACCTA AACAAACTAT GTGGTATGCT ATTAATCACT GAAGATGCAA ATCATAAATT 1320
CACAGGATTA ATAGGTATGT TATATGCTAT GTCCAGGTTA GGAAGGGAAG ACACTATAAA 1380
GATACTTAAA GATGCTGGAT ATCATGTTAA AGCTAATGGA GTAGATATAA CAACATATCG 1440
TCAAGATATA AATGGAAAGG AAATGAAATT CGAAGTATTA ACATTATCAA GCTTGACATC 1500
AGAAATACAA GTCAATATTG AGATAGAATC TAGAAAGTCC TACAAAAAAA TGCTAAAAGA 1560
GATGGGAGAA GTGGCTCCAG AATATAGGCA TGATTCTCCA GACTGTGGGA TGATAATACT 1620
GTGTATAGCT GCACTTGTGA TAACCAAATT AGCAGCAGGA GACAGATCAG GTCTTACAGC 1680
AGTAATTAGG AGGGCAAACA ATGTCTTAAA AAACGAAATA AAACGATACA AGGGCCTCAT 1740
ACCAAAGGAT ATAGCTAACA GTTTTTATGA AGTGTTTGAA AAACACCCTC ATCTTATAGA 1800
TGTTTTCGTG CACTTTGGCA TTGCACAATC ATCCACAAGA GGGGGTAGTA GAGTTGAAGG 1860
AATCTTTGCA GGATTGTTTA TGAATGCCTA TGGTTCAGGG CAAGTAATGC TAAGATGGGG 1920
AGTTTTAGCC AAATCTGTAA AAAATATCAT GCTAGGACAT GCTAGTGTCC AGGCAGAAAT 1980
GGAGCAAGTT GTGGAAGTCT ATGAGTATGC ACAGAAGTTG GGAGGAGAAG CTGGATTCTA 2040
CCATATATTG AACAATCCAA AAGCATCATT GCTGTCATTA ACTCAATTTC CCAACTTCTC 2100
AAGTGTGGTC CTAGGCAATG CAGCAGGTCT AGGCATAATG GGAGAGTATA GAGGTACACC 2160
AAGAAACCAG GATCTTTATG ATGCAGCTAA AGCATATGCA GAGCAACTCA AAGAAAATGG 2220
AGTAATAAAC TACAGTGTAT TAGACTTAAC AGCAGAAGAA TTGGAAGCCA TAAAGCATCA 2280
ACTCAACCCC AAAGAAGATG ATGTAGAGCT TTAAGTTAAC AAAAAATACG GGGCAAATAA 2340
GTCAACATGG AGAAGTTTGC ACCTGAATTT CATGGAGAAG ATGCAAATAA CAAAGCTACC 2400
AAATTCCTAG AATCAATAAA GGGCAAGTTC GCATCATCCA AAGATCCTAA GAAGAAAGAT 2460
AGCATAATAT CTGTTAACTC AATAGATATA GAAGTAACTA AAGAGAGCCC GATAACATCT 2520
GGCACCAACA TCATCAATCC AACAAGTGAA GCCGACAGTA CCCCAGAAAC AAAAGCCAAC 2580
TACCCAAGAA AACCCCTAGT AAGCTTCAAA GAAGATCTCA CCCCAAGTGA CAACCCTTTT 2640
TCTAAGTTGT ACAAGGAAAC AATAGAAACA TTTGATAACA ATGAAGAAGA ATCTAGCTAC 2700
TCATATGAAG AGATAAATGA TCAAACAAAT GACAACATTA CAGCAAGACT AGATAGAATT 2760
GATGAAAAAT TAAGTGAAAT ATTAGGAATG CTCCATACAT TAGTAGTTGC AAGTGCAGGA 2820
CCCACTTCAG CTCGCGATGG AATAAGAGAT GCTATGGTTG GTCTAAGAGA AGAGATGATA 2880
GAAAAAATAA GAGCGGAAGC ATTAATGACC AATGATAGGT TAGAGGCTAT GGCAAGACTT 2940
AGGAATGAGG AAAGCGAAAA AATGGCAAAA GACACCTCAG ATGAAGTGTC TCTTAATCCA 3000
ACTTCCAAAA AATTGAGTGA CTTGTTGGAA GACAACGATA GTGACAATGA TCTATCACTT 3060
GATGATTTTT GATCAGCGAT CAACTCACTC AGCAATCAAC AACATCAATA AAACAGACAT 3120
CAATCCATTG AATCAACTGC CAGACCGAAC AAACAAACGT CCATCAGTAG AACCACCAAC 3180
CAATCAATCA ACCAATTGAT CAATCAGCAA CCCGACAAAA TTAACAATAT AGTAACAAAA 3240
AAAGAACAAG ATGGGGCAAA TATGGAAACA TACGTGAACA AGCTTCACGA AGGCTCCACA 3300
TACACAGCAG CTGTTCAGTA CAATGTTCTA GAAAAAGATG ATGATCCTGC ATCACTAACA 3360
ATATGGGTGC CTATGTTCCA GTCATCTGTG CCAGCAGACT TGCTCATAAA AGAACTTGCA 3420
AGCATCAATA TACTAGTGAA GCAGATCTCT ACGCCCAAAG GACCTTCACT ACGAGTCACG 3480
ATTAACTCAA GAAGTGCTGT GCTGGCTCAA ATGCCTAGTA ATTTCATCAT AAGCGCAAAT 3540
GTATCATTAG ATGAAAGAAG CAAATTAGCA TATGATGTAA CTACACCTTG TGAAATCAAA 3600
GCATGCAGTC TAACATGCTT AAAAGTAAAA AGTATGTTAA CTACAGTCAA AGATCTTACC 3660
ATGAAGACAT TCAACCCCAC TCATGAGATC ATTGCTCTAT GTGAATTTGA AAATATTATG 3720
ACATCAAAAA GAGTAATAAT ACCAACCTAT CTAAGATCAA TTAGTGTCAA GAACAAGGAT 3780
CTGAACTCAC TAGAAAATAT AGCAACCACC GAATTCAAAA ATGCTATCAC CAATGCAAAA 3840
ATTATTCCTT ATGCAGGATT AGTGTTAGTT ATCACAGTTA CTGACAATAA AGGAGCATTC 3900
AAATATATCA AACCACAGAG TCAATTTATA GTAGATCTTG GTGCCTACCT AGAAAAAGAG 3960
AGCATATATT ATGTGACTAC TAATTGGAAG CATACAGCTA CACGTTTTTC AATCAAACCA 4020
CTAGAGGATT AAACTTAATT ATCAACACTG AATGACAGGT CCACATATAT CCTCAAACTA 4080
CACACTATAT CCAAACATCA TAAACATCTA CACTACACAC TTCATCACAC AAACCAATCC 4140
CACTCAAAAT CCAAAATCAC TACCAGCCAC TATCTGCTAG ACCTAGAGTG CGAATAGGTA 4200
AATAAAACCA AAATATGGGG TAAATAGACA TTAGTTAGAG TTCAATCAAT CTTAACAACC 4260
ATTTATACCG CCAATTCAAC ACATATACTA TAAATCTTAA AATGGGAAAT ACATCCATCA 4320
CAATAGAATT CACAAGCAAA TTTTGGCCCT ATTTTACACT AATACATATG ATCTTAACTC 4380
TAATCTTTTT ACTAATTATA ATCACTATTA TGATTGCAAT ACTAAATAAG CTAAGTGAAC 4440
ATAAAGCATT CTGTAACAAA ACTCTTGAAC TAGGACAGAT GTATCAAATC AACACATAGA 4500
GTTCTACCAT TATGCTGTGT CAAATTATAA TCCTGTATAT ATAAACAAAC AAATCCAATC 4560
TTCTCACAGA GTCATGGTGT CGCAAAACCA CGCTAACTAT CATGGTAGCA TAGAGTAGTT 4620
ATTTAAAAAT TAACATAATG ATGAATTGTT AGTATGAGAT CAAAAACAAC ATTGGGGCAA 4680
ATGCAACCAT GTCCAAACAC AAGAATCAAC GCACTGCCAG GACTCTAGAA AAGACCTGGG 4740
ATACTCTTAA TCATCTAATT GTAATATCCT CTTGTTTATA CAGATTAAAT TTAAAATCTA 4800
TAGCACAAAT AGCACTATCA GTTTTGGCAA TGATAATCTC AACCTCTCTC ATAATTGCAG 4860
CCATAATATT CATCATCTCT GCCAATCACA AAGTTACACT AACAACGGTC ACAGTTCAAA 4920
CAATAAAAAA CCACACTGAA AAAAACATCA CCACCTACCC TACTCAAGTC TCACCAGAAA 4980
GGGTTAGTTC ATCCAAGCAA CCCACAACCA CATCACCAAT CCACACAAGT TCAGCTACAA 5040
CATCACCCAA TACAAAATCA GAAACACACC ATACAACAGC ACAAACCAAA GGCAGAACCA 5100
CCACTTCAAC ACAGACCAAC AAGCCAAGCA CAAAACCACG TCCAAAAAAT CCACCAAAAA 5160
AAGATGATTA CCATTTTGAA GTGTTCAACT TCGTTCCCTG CAGTATATGT GGCAACAATC 5220
AACTTTGCAA ATCCATCTGC AAAACAATAC CAAGCAACAA ACCAAAGAAG AAACCAACCA 5280
TCAAACCCAC AAACAAACCA ACCACCAAAA CCACAAACAA AAGAGACCCA AAAACACCAG 5340
CCAAAACGAC GAAAAAAGAA ACTACCACCA ACCCAACAAA AAAACTAACC CTCAAGACCA 5400
CAGAAAGAGA CACCAGCACC TCACAATCCA CTGCACTCGA CACAACCACA TTAAAACACA 5460
CAGTCCAACA GCAATCCCTC CTCTCAACCA CCCCCGAAAA CACACCCAAC TCCACACAAA 5520
CACCCACAGC ATCCGAGCCC TCCACACCAA ACTCCACCCA AAAAACCCAG CCACATGCTT 5580
AGTTATTCAA AAACTACATC TTAGCAGAGA ACCGTGATCT ATCAAGCAAG AACGAAATTA 5640
AACCTGGGGC AAATAACCAT GGAGTTGATG ATCCACAAGT CAAGTGCAAT CTTCCTAACT 5700
CTTGCTATTA ATGCATTGTA CCTCACCTCA AGTCAGAACA TAACTGAGGA GTTTTACCAA 5760
TCGACATGTA GTGCAGTTAG CAGAGGTTAT TTTAGTGCTT TAAGAACAGG TTGGTATACT 5820
AGTGTCATAA CAATAGAATT AAGTAATATA AAAGAAACCA AATGCAATGG AACTGACACT 5880
AAAGTAAAAC TTATGAAACA AGAATTAGAT AAGTATAAGA ATGCAGTAAC AGAATTACAG 5940
CTACTTATGC AAAACACACC AGCTGTCAAC AACCGGGCCA GAAGAGAAGC ACCACAGTAT 6000
ATGAACTACA CAATCAATAC CACTAAAAAC CTAAATGTAT CAATAAGCAA GAAGAGGAAA 6060
CGAAGATTTC TAGGCTTCTT GTTAGGTGTG GGATCTGCAA TAGCAAGTGG TATAGCTGTA 6120
TCAAAAGTTC TACACCTTGA AGGAGAAGTG AACAAGATCA AAAATGCTTT GTTGTCTACA 6180
AACAAAGCTG TAGTCAGTTT ATCAAATGGG GTCAGTGTTT TAACCAGCAA AGTGTTAGAT 6240
CTCAAGAATT ACATAAATAA CCAATTATTA CCCATAGTAA ATCAACAGAG CTGTCGCATC 6300
TCCAACATTG AAACAGTTAT AGAATTCCAG CAGAAGAACA GCAGATTGTT GGAAATCACC 6360
AGAGAATTTA GTGTCAATGC AGGTGTAACA ACACCTTTAA GCACTTACAT GTTGACAAAC 6420
AGTGAGTTAC TATCATTAAT CAATGATATG CCTATAACAA ATGATCAGAA AAAATTAATG 6480
TCAAGCAATG TTCAGATAGT AAGGCAACAA AGTTATTCCA TCATGTCTAT AATAAAGGAA 6540
GAAGTCCTTG CATATGTTGT ACAGCTGCCT ATCTATGGTG TAATAGATAC ACCTTGCTGG 6600
AAATTGCACA CATCGCCTCT ATGCACTACC AACATCAAAG AAGGATCAAA TATTTGTTTA 6660
ACAAGGACTG ATAGAGGATG GTATTGTGAT AATGCAGGAT CAGTATCCTT CTTTCCACAG 6720
GCTGACACTT GTAAAGTACA GTCCAATCGA GTATTTTGTG ACACTATGAA CAGTTTGACA 6780
TTACCAAGTG AAGTCAGCCT TTGTAACACT GACATATTCA ATTCCAAGTA TGACTGCAAA 6840
ATTATGACAT CAAAAACAGA CATAAGCAGC TCAGTAATTA CTTCTCTTGG AGCTATAGTG 6900
TCATGCTATG GTAAAACTAA ATGCACTGCA TCCAACAAAA ATCGTGGGAT TATAAAGACA 6960
TTTTCTAATG GTTGTGACTA TGTGTCAAAC AAAGGAGTAG ATACTGTGTC AGTGGGCAAC 7020
ACTTTATACT ATGTAAACAA GCTGGAAGGC AAGAACCTTT ATGTAAAAGG GGAACCTATA 7080
ATAAATTACT ATGACCCTCT AGTGTTTCCT TCTGATGAGT TTGATGCATC AATATCTCAA 7140
GTCAATGAAA AAATCAATCA AAGTTTAGCT TTTATTCGTA GATCTGATGA ATTACTACAT 7200
AATGTAAATA CTGGCAAATC TACTACAAAT ATTATGATAA CTACAATTAT TATAGTAATC 7260
ATTGTAGTAT TGTTATCATT AATAGCTATT GGTTTACTGT TGTATTGTAA AGCCAAAAAC 7320
ACACCAGTTA CACTAAGCAA AGACCAACTA AGTGGAATCA ATAATATTGC ATTCAGCAAA 7380
TAGACAAAAA ACCACCTGAT CATGTTTCAA CAACAATCTG CTGACCACCA ATCCCAAATC 7440
AACTTACAAC AAATATTTCA ACATCACAGT ACAGGCTGAA TCATTTCCTC ACATCATGCT 7500
ACCCACATAA CTAAGCTAGA TCCTTAACTT ATAGTTACAT AAAAACCTCA AGTATCACAA 7560
TCAACCACTA AATCAACACA TCATTCACAA AATTAACAGC TGGGGCAAAT ATGTCGCGAA 7620
GAAATCCTTG TAAATTTGAG ATTAGAGGTC ATTGCTTGAA TGGTAGAAGA TGTCACTACA 7680
GTCATAATTA CTTTGAATGG CCTCCTCATG CATTACTAGT GAGGCAAAAC TTCATGTTAA 7740
ACAAGATACT CAAGTCAATG GACAAAAGCA TAGACACTTT GTCTGAAATA AGTGGAGCTG 7800
CTGAACTGGA TAGAACAGAA GAATATGCTC TTGGTATAGT TGGAGTGCTA GAGAGTTACA 7860
TAGGATCTAT AAACAACATA ACAAAACAAT CAGCATGTGT TGCTATGAGT AAACTTCTTA 7920
TTGAGATCAA TAGTGATGAC ATTAAAAAGC TTAGAGATAA TGAAGAACCC AATTCACCTA 7980
AGATAAGAGT GTACAATACT GTTATATCAT ACATTGAGAG CAATAGAAAA AACAACAAGC 8040
AAACCATCCA TCTGCTCAAG AGACTACCAG CAGACGTGCT GAAGAAGACA ATAAAGAACA 8100
CATTAGATAT CCACAAAAGC ATAACCATAA GCAATCCAAA AGAGTCAACT GTGAATGATC 8160
AAAATGACCA AACCAAAAAT AATGATATTA CCGGATAAAT ATCCTTGTAG TATATCATCC 8220
ATATTGATCT CAAGTGAAAG CATGGTTGCT ACATTCAATC ATAAAAACAT ATTACAATTT 8280
AACCATAACT ATTTGGATAA CCACCAGCGT TTATTAAATC ATATATTTGA TGAAATTCAT 8340
TGGACACCTA AAAACTTATT AGATGCCACT CAACAATTTC TCCAACATCT TAACATCCCT 8400
GAAGATATAT ATACAGTATA TATATTAGTG TCATAATGCT TGACCATAAC GACTCTATGT 8460
CATCCAACCA TAAAACTATT TTGATAAGGT TATGGGACAA AATGGATCCC ATTATTAATG 8520
GAAACTCTGC TAATGTGTAT CTAACTGATA GTTATTTAAA AGGTGTTATC TCTTTTTCAG 8580
AGTGTAATGC TTTAGGGAGT TATCTTTTTA ACGGCCCTTA TCTTAAAAAT GATTACACCA 8640
ACTTAATTAG TAGACAAAGC CCACTACTAG AGCATATGAA TCTTAAAAAA CTAACTATAA 8700
CACAGTCATT AATATCTAGA TATCATAAAG GTGAACTGAA ATTAGAAGAA CCAACTTATT 8760
TCCAGTCATT ACTTATGACA TATAAAAGTA TGTCCTCGTC TGAACAAATT GCTACAACTA 8820
ACTTACTTAA AAAAATAATA CGAAGAGCCA TAGAAATAAG TGATGTAAAG GTGTACGCCA 8880
TCTTGAATAA ACTAGGATTA AAGGAAAAGG ACAGAGTTAA GCCCAACAAT AATTCAGGTG 8940
ATGAAAACTC AGTACTTACA ACCATAATTA AAGATGATAT ACTTTCGGCT GTGGAAAACA 9000
ATCAATCATA TACAAATTCA GACAAAAGTC ACTCAGTAAA TCAAAATATC ACTATCAAAA 9060
CAACACTCTT GAAAAAATTG ATGTGTTCAA TGCAACATCC TCCATCATGG TTAATACACT 9120
GGTTCAATTT ATATACAAAA TTAAATAACA TATTAACACA ATATCGATCA AATGAGGTAA 9180
AAAGTCATGG GTTTATATTA ATAGATAATC AAACTTTAAG TGGTTTTCAG TTTATTTTAA 9240
ATCAATATGG TTGTATCGTT TATCATAAAG GACTCAAAAA AATCACAACT ACTACTTACA 9300
ATCAATTTTT GACATGGAAA GACATCAGCC TTAGCAGATT AAATGTTTGC TTAATTACTT 9360
GGATAAGTAA TTGTTTAAAT ACATTAAACA AAAGCTTAGG GCTGAGATGT GGATTCAATA 9420
ATGTTGTGTT ATCACAATTA TTTCTTTATG GAGATTGTAT ACTGAAATTA TTTCATAATG 9480
AAGGCTTCTA CATAATAAAA GAAGTAGAGG GATTTATTAT GTCTTTAATT CTAAACATAA 9540
CAGAAGAAGA TCAATTTAGG AAACGATTTT ATAATAGCAT GCTAAATAAC ATCACAGATG 9600
CAGCTATTAA GGCTCAAAAG GACCTACTAT CAAGAGTATG TCACACTTTA TTAGACAAGA 9660
CAGTGTCTGA TAATATCATA AATGGTAAAT GGATAATCCT ATTAAGTAAA TTTCTTAAAT 9720
TGATTAAGCT TGCAGGTGAT AATAATCTCA ATAACTTGAG TGAGCTATAT TTTCTCTTCA 9780
GAATCTTTGG ACATCCAATG GTCGATGAAA GACAAGCAAT GGATTCTGTA AGAATTAACT 9840
GTAATGAAAC TAAGTTCTAC TTATTAAGTA GTCTAAGTAC ATTAAGAGGT GCTTTCATTT 9900
ATAGAATCAT AAAAGGGTTT GTAAATACCT ACAACAGATG GCCCACCTTA AGGAATGCTA 9960
TTGTCCTACC TCTAAGATGG TTAAACTACT ATAAACTTAA TACTTATCCA TCTCTACTTG 10020
AAATCACAGA AAATGATTTG ATTATTTTAT CAGGATTGCG GTTCTATCGT GAGTTTCATC 10080
TGCCTAAAAA AGTGGATCTT GAAATGATAA TAAATGACAA AGCCATTTCA CCTCCAAAAG 10140
ATCTAATATG GACTAGTTTT CCTAGAAATT ACATGCCATC ACATATACAA AATTATATAG 10200
AACATGAAAA GTTGAAGTTC TCTGAAAGCG ACAGATCGAG AAGAGTACTA GAGTATTACT 10260
TGAGAGATAA TAAATTCAAT GAATGCGATC TATACAATTG TGTAGTCAAT CAAAGCTATC 10320
TCAACAACTC TAATCACGTG GTATCACTAA CTGGTAAAGA AAGAGAGCTC AGTGTAGGTA 10380
GAATGTTTGC TATGCAACCA GGTATGTTTA GGCAAATCCA AATCTTAGCA GAGAAAATGA 10440
TAGCTGAAAA TATTTTACAA TTCTTCCCTG AGAGTTTGAC AAGATATGGT GATCTAGAGC 10500
TTCAAAAGAT ATTAGAATTA AAAGCAGGAA TAAGCAACAA GTCAAATCGT TATAATGATA 10560
ACTACAACAA TTATATCAGT AAATGTTCTA TCATTACAGA TCTTAGCAAA TTCAATCAGG 10620
CATTTAGATA TGAAACATCA TGTATCTGCA GTGATGTATT AGATGAACTG CATGGAGTAC 10680
AATCTCTGTT CTCTTGGTTG CATTTAACAA TACCTCTTGT CACAATAATA TGTACATATA 10740
GACATGCACC TCCTTTCATA AAGGATCATG TTGTTAATCT TAATGAGGTT GATGAACAAA 10800
GTGGATTATA CAGATATCAT ATGGGTGGTA TTGAGGGCTG GTGTCAAAAA CTGTGGACCA 10860
TTGAAGCTAT ATCATTATTA GATCTAATAT CTCTCAAAGG GAAATTCTCT ATCACAGCTC 10920
TGATAAATGG TGATAATCAG TCAATTGATA TAAGCAAACC AGTTAGACTT ATAGAGGGTC 10980
AGACCCATGC ACAAGCAGAT TATTTGTTAG CATTAAATAG CCTTAAATTG TTATATAAAG 11040
AGTATGCAGG TATAGGCCAT AAGCTTAAGG GAACAGAGAC CTATATATCC CGAGATATGC 11100
AGTTCATGAG CAAAACAATC CAGCACAATG GAGTGTACTA TCCAGCCAGT ATCAAAAAAG 11160
TCCTGAGAGT AGGTCCATGG ATAAACACGA TACTTGATGA TTTTAAAGTT AGTTTAGAAT 11220
CTATAGGCAG CTTAACACAG GAGTTAGAAT ACAGAGGAGA AAGCTTATTA TGCAGTTTAA 11280
TATTTAGGAA CATTTGGTTA TACAATCAAA TTGCTTTGCA ACTCCGAAAT CATGCATTAT 11340
GTAACAATAA GCTATATTTA GATATATTGA AAGTATTAAA ACACTTAAAA ACTTTTTTTA 11400
ATCTTGATAG CATTGATATG GCTTTATCAT TGTATATGAA TTTGCCTATG CTGTTTGGTG 11460
GTGGTGATCC TAATTTGTTA TATCGAAGCT TTTATAGGAG AACTCCAGAC TTCCTTACAG 11520
AAGCTATAGT ACATTCAGTG TTTGTGTTGA GCTATTATAC TGGTCACGAT TTACAAGATA 11580
AGCTCCAGGA TCTTCCAGAT GATAGACTGA ACAAATTCTT GACATGTGTC ATCACATTTG 11640
ATAAAAATCC CAATGCCGAG TTTGTAACAT TGATGAGGGA TCCACAGGCT TTAGGGTCTG 11700
AAAGGCAAGC TAAAATTACT AGTGAGATTA ATAGATTAGC AGTAACAGAA GTCTTAAGTA 11760
TAGCCCCAAA CAAAATATTT TCTAAAAGTG CACAACATTA TACTACCACT GAGATTGATC 11820
TAAATGACAT TATGCAAAAT ATAGAACCAA CTTACCCTCA TGGATTAAGA GTTGTTTATG 11880
AAAGTTTACC TTTTTATAAA GCAGAAAAAA TAGTTAATCT TATATCAGGA ACAAAATCCA 11940
TAACTAATAT ACTTGAAAAA ACATCAGCAA TAGATACAAC TGATATTAAT AGGGCTACTG 12000
ATATGATGAG GAAAAATATA ACTTTACTTA TAAGGATACT TCCACTAGAT TGTAACAAAG 12060
ACAAAAGAGA GTTATTAAGT TTAGAAAATC TTAGTATAAC TGAATTAAGC AAGTATGTAA 12120
GAGAAAGATC TTGGTCATTA TCCAATATAG TAGGAGTAAC ATCGCCAAGT ATTATGTTCA 12180
CAATGGACAT TAAATATACA ACTAGCACTA TAGCCAGTGG TATAATAATA GAAAAATATA 12240
ATGTTAATAG TTTAACTCGT GGTGAAAGAG GACCCACCAA GCCATGGGTA GGCTCATCCA 12300
CGCAGGAGAA AAAAACAATG CCAGTGTACA ACAGACAAGT TTTAACCAAA AAGCAAAGAG 12360
ACCAAATAGA TTTATTAGCA AAATTAGACT GGGTATATGC ATCCATAGAC AACAAAGATG 12420
AATTCATGGA AGAACTGAGT ACTGGAACAC TTGGACTGTC ATATGAAAAA GCCAAAAAGT 12480
TGTTTCCACA ATATCTAAGT GTCAATTATT TACACCGTTT AACAGTCAGT AGTAGACCAT 12540
GTGAATTCCC TGCATCAATA CCAGCTTATA GAACAACAAA TTATCATTTT GATACTAGTC 12600
CTATCAATCA TGTATTAACA GAAAAGTATG GAGATGAAGA TATCGACATT GTGTTTCAAA 12660
ATTGCATAAG TTTTGGTCTT AGCCTGATGT CGGTTGTGGA ACAATTCACA AACATATGTC 12720
CTAATAGAAT TATTCTCATA CCGAAGCTGA ATGAGATACA TTTGATGAAA CCTCCTATAT 12780
TTACAGGAGA TGTTGATATC ATCAAGTTGA AGCAAGTGAT ACAAAAGCAG CACATGTTCC 12840
TACCAGATAA AATAAGTTTA ACCCAATATG TAGAATTATT CTTAAGTAAC AAAGCACTTA 12900
AATCTGGATC TCACATCAAC TCTAATTTAA TATTAGTACA TAAAATGTCT GATTATTTTC 12960
ATAATGCTTA TATTTTAAGT ACTAATTTAG CTGGACATTG GATTCTGATT ATTCAACTTA 13020
TGAAAGATTC AAAAGGTATT TTTGAAAAAG ATTGGGGAGA GGGGTACATA ACTGATCATA 13080
TGTTCATTAA TTTGAATGTT TTCTTTAATG CTTATAAGAC TTATTTGCTA TGTTTTCATA 13140
AAGGTTATGG TAAAGCAAAA TTAGAATGTG ATATGAACAC TTCAGATCTT CTTTGTGTTT 13200
TGGAGTTAAT AGACAGTAGC TACTGGAAAT CTATGTCTAA AGTTTTCCTA GAACAAAAAG 13260
TCATAAAATA CATAGTCAAT CAAGACACAA GTTTGCGTAG AATAAAAGGC TGTCACAGTT 13320
TTAAGTTGTG GTTTTTAAAA CGCCTTAATA ATGCTAAATT TACCGTATGC CCTTGGGTTG 13380
TTAACATAGA TTATCACCCA ACACACATGA AAGCTATATT ATCTTACATA GATTTAGTTA 13440
GAATGGGGTT AATAAATGTA GATAAATTAA CCATTAAAAA TAAAAACAAA TTCAATGATG 13500
AATTTTACAC ATCAAATCTC TTTTACATTA GTTATAACTT TTCAGACAAC ACTCATTTGC 13560
TAACAAAACA AATAAGAATT GCTAATTCAG AATTAGAAGA TAATTATAAC AAACTATATC 13620
ACCCAACCCC AGAAACTTTA GAAAATATGT CATTAATTCC TGTTAAAAGT AATAATAGTA 13680
ACAAACCTAA ATTTTGTATA AGTGGAAATA CCGAATCTAT GATGATGTCA ACATTCTCTA 13740
GTAAAATGCA TATTAAATCT TCCACTGTTA CCACAAGATT CAATTATAGC AAACAAGACT 13800
TGTACAATTT ATTTCCAATT GTTGTGATAG ACAAGATTAT AGATCATTCA GGTAATACAG 13860
CAAAATCTAA CCAACTTTAC ACCACCACTT CACATCAGAC ATCTTTAGTA AGGAATAGTG 13920
CATCACTTTA TTGCATGCTT CCTTGGCATC ATGTCAATAG ATTTAACTTT GTATTTAGTT 13980
CCACAGGATG CAAGATCAGT ATAGAGTATA TTTTAAAAGA TCTTAAGATT AAGGACCCCA 14040
GTTGTATAGC ATTCATAGGT GAAGGAGCTG GTAACTTATT ATTACGTACG GTAGTAGAAC 14100
TTCATCCAGA CATAAGATAC ATTTACAGAA GTTTAAAAGA TTGCAATGAT CATAGTTTAC 14160
CTATTGAATT TCTAAGGTTA TACAACGGGC ATATAAACAT AGATTATGGT GAGAATTTAA 14220
CCATTCCTGC TACAGATGCA ACTAATAACA TTCATTGGTC TTATTTACAT ATAAAATTTG 14280
CAGAACCTAT TAGCATCTTT GTCTGCGATG CTGAATTACC TGTTACAGCC AATTGGAGTA 14340
AAATTATAAT TGAATGGAGT AAGCATGTAA GAAAGTGCAA GTACTGTTCT TCTGTAAATA 14400
GATGCATTTT AATTGCAAAA TATCATGCTC AAGATGACAT TGATTTCAAA TTAGATAACA 14460
TTACTATATT AAAAACTTAC GTGTGCCTAG GTAGCAAGTT AAAAGGATCT GAAGTTTACT 14520
TAATCCTTAC AATAGGCCCT GCAAATATAC TTCCTGTTTT TGATGTTGTA CAAAATGCTA 14580
AATTGACACT TTCAAGAACT AAAAATTTCA TTATGCCTAA AAAAACTGAC AAGGAATCTA 14640
TCGATGCAAA TATTAAAAGC TTAATACCTT TCCTTTGTTA CCCTATAACA AAAAAAGGAA 14700
TTAAGACTTC ATTGTCAAAA TTGAAGAGTG TAGTTAATGG AGATATATTA TCATATTCTA 14760
TAGCTGGACG TAATGAAGTA TTCAGCAACA AGCTTATAAA CCACAAGCAT ATGAATATCC 14820
TAAAATGGCT AGATCATGTT TTAAATTTTA GATCAGCTGA ACTTAATTAC AATCATTTAT 14880
ACATGATAGA GTCCACATAT CCTTACTTAA GTGAATTGTT AAATAGTTTA ACAACCAATG 14940
AGCTCAAGAA GCTGATTAAA ATAACAGGTA GTGTGCTATA CAACCTTCCC AACGAACAGT 15000
AGTTTAAAAT ATCATTAACA AGTTTGGTCA AATTTAGATG CTAACACATC ATTATATTAT 15060
AGTTATTAAA AAATATACAA ACTTTTCAAT AATTTAGCAT ATTGATTCCA AAATTATCAT 15120
TTTAGTCTTA AGGGGTTAAA TAAAAGTCTA AAACTAACAA TTATACATGT GCATTCACAA 15180
CACAACGAGA CATTAGTTTT TGACACTTTT TTTCTCGT 15218
(2) SEQ ID NO: 24 information about:
...
(i) sequence signature:
(A) length: 2166 amino acid
(B) type: amino acid
(C) chain:
(D) topological framework: linearity
(ii) molecule type: protein
(xi) sequence description: SEQ ID NO:24:
Met?Asp?Pro?Ile?Ile?Asn?Gly?Asn?Ser?Ala?Asn?Val?Tyr?Leu?Thr?Asp
1???????????????5???????????????????10??????????????????15
Ser?Tyr?Leu?Lys?Gly?Val?Ile?Ser?Phe?Ser?Glu?Cys?Asn?Ala?Leu?Gly
20??????????????????25??????????????????30
Ser?Tyr?Leu?Phe?Asn?Gly?Pro?Tyr?Leu?Lys?Asn?Asp?Tyr?Thr?Asn?Leu
35??????????????????40??????????????????45
Ile?Ser?Arg?Gln?Ser?Pro?Leu?Leu?Glu?His?Met?Asn?Leu?Lys?Lys?Leu
50??????????????????55??????????????????60
Thr?Ile?Thr?Gln?Ser?Leu?Ile?Ser?Arg?Tyr?His?Lys?Gly?Glu?Leu?Lys
65??????????????????70??????????????????75??????????????????80
Leu?Glu?Glu?Pro?Thr?Tyr?Phe?Gln?Ser?Leu?Leu?Met?Thr?Tyr?Lys?Ser
85??????????????????90??????????????????95
Met?Ser?Ser?Ser?Glu?Gln?Ile?Ala?Thr?Thr?Asn?Leu?Leu?Lys?Lys?Ile
100?????????????????105?????????????????110
Ile?Arg?Arg?Ala?Ile?Glu?Ile?Ser?Asp?Val?Lys?Val?Tyr?Ala?Ile?Leu
115?????????????????120?????????????????125Asn?Lys?Leu?Gly?Leu?Lys?Glu?Lys?Asp?Arg?Val?Lys?Pro?Asn?Asn?Asn
130?????????????????135?????????????????140Ser?Gly?Asp?Glu?Asn?Ser?Val?Leu?Thr?Thr?Ile?Ile?Lys?Asp?Asp?Ile145?????????????????150?????????????????155?????????????????160Leu?Ser?Ala?Val?Glu?Asn?Asn?Gln?Ser?Tyr?Thr?Asn?Ser?Asp?Lys?Ser
165?????????????????170?????????????????175His?Ser?Val?Asn?Gln?Asn?Ile?Thr?Ile?Lys?Thr?Thr?Leu?Leu?Lys?Lys
180?????????????????185?????????????????190Leu?Met?Cys?Ser?Met?Gln?His?Pro?Pro?Ser?Trp?Leu?Ile?His?Trp?Phe
195?????????????????200?????????????????205Asn?Leu?Tyr?Thr?Lys?Leu?Asn?Asn?Ile?Leu?Thr?Gln?Tyr?Arg?Ser?Asn
210?????????????????215?????????????????220Glu?Val?Lys?Ser?His?Gly?Phe?Ile?Leu?Ile?Asp?Asn?Gln?Thr?Leu?Ser225?????????????????230?????????????????235?????????????????240Gly?Phe?Gln?Phe?Ile?Leu?Asn?Gln?Tyr?Gly?Cys?Ile?Val?Tyr?His?Lys
245?????????????????250?????????????????255Gly?Leu?Lys?Lys?Ile?Thr?Thr?Thr?Thr?Tyr?Asn?Gln?Phe?Leu?Thr?Trp
260?????????????????265?????????????????270Lys?Asp?Ile?Ser?Leu?Ser?Arg?Leu?Asn?Val?Cys?Leu?Ile?Thr?Trp?Ile
275?????????????????280?????????????????285Ser?Asn?Cys?Leu?Asn?Thr?Leu?Asn?Lys?Ser?Leu?Gly?Leu?Arg?Cys?Gly
290?????????????????295?????????????????300Phe?Asn?Asn?Val?Val?Leu?Ser?Gln?Leu?Phe?Leu?Tyr?Gly?Asp?Cys?Ile305?????????????????310?????????????????315?????????????????320Leu?Lys?Leu?Phe?His?Asn?Glu?Gly?Phe?Tyr?Ile?Ile?Lys?Glu?Val?Glu
325?????????????????330?????????????????335Gly?Phe?Ile?Met?Ser?Leu?Ile?Leu?Asn?Ile?Thr?Glu?Glu?Asp?Gln?Phe
340?????????????????345?????????????????350Arg?Lys?Arg?Phe?Tyr?Asn?Ser?Met?Leu?Asn?Asn?Ile?Thr?Asp?Ala?Ala
355?????????????????360?????????????????365Ile?Lys?Ala?Gln?Lys?Asp?Leu?Leu?Ser?Arg?Val?Cys?His?Thr?Leu?Leu
370?????????????????375?????????????????380Asp?Lys?Thr?Val?Ser?Asp?Asn?Ile?Ile?Asn?Gly?Lys?Trp?Ile?Ile?Leu385?????????????????390?????????????????395?????????????????400Leu?Ser?Lys?Phe?Leu?Lys?Leu?Ile?Lys?Leu?Ala?Gly?Asp?Asn?Asn?Leu
405?????????????????410?????????????????415Asn?Asn?Leu?Ser?Glu?Leu?Tyr?Phe?Leu?Phe?Arg?Ile?Phe?Gly?His?Pro
420?????????????????425?????????????????430Met?Val?Asp?Glu?Arg?Gln?Ala?Met?Asp?Ser?Val?Arg?Ile?Asn?Cys?Asn
435?????????????????440?????????????????445Glu?Thr?Lys?Phe?Tyr?Leu?Leu?Ser?Ser?Leu?Ser?Thr?Leu?Arg?Gly?Ala
450?????????????????455?????????????????460Phe?Ile?Tyr?Arg?Ile?Ile?Lys?Gly?Phe?Val?Asn?Thr?Tyr?Asn?Arg?Trp465?????????????????470?????????????????475?????????????????480Pro?Thr?Leu?Arg?Asn?Ala?Ile?Val?Leu?Pro?Leu?Arg?Trp?Leu?Asn?Tyr
485?????????????????490?????????????????495Tyr?Lys?Leu?Asn?Thr?Tyr?Pro?Ser?Leu?Leu?Glu?Ile?Thr?Glu?Asn?Asp
500?????????????????505?????????????????510Leu?Ile?Ile?Leu?Ser?Gly?Leu?Arg?Phe?Tyr?Arg?Glu?Phe?His?Leu?Pro
515?????????????????520?????????????????525Lys?Lys?Val?Asp?Leu?Glu?Met?Ile?Ile?Asn?Asp?Lys?Ala?Ile?Ser?Pro
530?????????????????535?????????????????540Pro?Lys?Asp?Leu?Ile?Trp?Thr?Ser?Phe?Pro?Arg?Asn?Tyr?Met?Pro?Ser545?????????????????550?????????????????555?????????????????560His?Ile?Gln?Asn?Tyr?Ile?Glu?His?Glu?Lys?Leu?Lys?Phe?Ser?Glu?Ser
565?????????????????570?????????????????575Asp?Arg?Ser?Arg?Arg?Val?Leu?Glu?Tyr?Tyr?Leu?Arg?Asp?Asn?Lys?Phe
580?????????????????585?????????????????590Asn?Glu?Cys?Asp?Leu?Tyr?Asn?Cys?Val?Val?Asn?Gln?Ser?Tyr?Leu?Asn
595?????????????????600?????????????????605Asn?Ser?Asn?His?Val?Val?Ser?Leu?Thr?Gly?Lys?Glu?Arg?Glu?Leu?Ser
610?????????????????615?????????????????620Val?Gly?Arg?Met?Phe?Ala?Met?Gln?Pro?Gly?Met?Phe?Arg?Gln?Ile?Gln625?????????????????630?????????????????635?????????????????640Ile?Leu?Ala?Glu?Lys?Met?Ile?Ala?Glu?Asn?Ile?Leu?Gln?Phe?Phe?Pro
645?????????????????650?????????????????655Glu?Ser?Leu?Thr?Arg?Tyr?Gly?Asp?Leu?Glu?Leu?Gln?Lys?Ile?Leu?Glu
660?????????????????665?????????????????670Leu?Lys?Ala?Gly?Ile?Ser?Asn?Lys?Ser?Asn?Arg?Tyr?Asn?Asp?Asn?Tyr
675?????????????????680?????????????????685Asn?Asn?Tyr?Ile?Ser?Lys?Cys?Ser?Ile?Ile?Thr?Asp?Leu?Ser?Lys?Phe
690?????????????????695?????????????????700Asn?Gln?Ala?Phe?Arg?Tyr?Glu?Thr?Ser?Cys?Ile?Cys?Ser?Asp?Val?Leu705?????????????????710?????????????????715?????????????????720Asp?Glu?Leu?His?Gly?Val?Gln?Ser?Leu?Phe?Ser?Trp?Leu?His?Leu?Thr
725?????????????????730?????????????????735Ile?Pro?Leu?Val?Thr?Ile?Ile?Cys?Thr?Tyr?Arg?His?Ala?Pro?Pro?Phe
740?????????????????745?????????????????750Ile?Lys?Asp?His?Val?Val?Asn?Leu?Asn?Glu?Val?Asp?Glu?Gln?Ser?Gly
755?????????????????760?????????????????765Leu?Tyr?Arg?Tyr?His?Met?Gly?Gly?Ile?Glu?Gly?Trp?Cys?Gln?Lys?Leu
770?????????????????775?????????????????780Trp?Thr?Ile?Glu?Ala?Ile?Ser?Leu?Leu?Asp?Leu?Ile?Ser?Leu?Lys?Gly785?????????????????790?????????????????795?????????????????800Lys?Phe?Ser?Ile?Thr?Ala?Leu?Ile?Asn?Gly?Asp?Asn?Gln?Ser?Ile?Asp
805?????????????????810?????????????????815Ile?Ser?Lys?Pro?Val?Arg?Leu?Ile?Glu?Gly?Gln?Thr?His?Ala?Gln?Ala
820?????????????????825?????????????????830Asp?Tyr?Leu?Leu?Ala?Leu?Asn?Ser?Leu?Lys?Leu?Leu?Tyr?Lys?Glu?Tyr
835?????????????????840?????????????????845Ala?Gly?Ile?Gly?His?Lys?Leu?Lys?Gly?Thr?Glu?Thr?Tyr?Ile?Ser?Arg
850?????????????????855?????????????????860Asp?Met?Gln?Phe?Met?Ser?Lys?Thr?Ile?Gln?His?Asn?Gly?Val?Tyr?Tyr865?????????????????870?????????????????875?????????????????880Pro?Ala?Ser?Ile?Lys?Lys?Val?Leu?Arg?Val?Gly?Pro?Trp?Ile?Asn?Thr
885?????????????????890?????????????????895Ile?Leu?Asp?Asp?Phe?Lys?Val?Ser?Leu?Glu?Ser?Ile?Gly?Ser?Leu?Thr
900?????????????????905?????????????????910Gln?Glu?Leu?Glu?Tyr?Arg?Gly?Glu?Ser?Leu?Leu?Cys?Ser?Leu?Ile?Phe
915?????????????????920?????????????????925Arg?Asn?Ile?Trp?Leu?Tyr?Asn?Gln?Ile?Ala?Leu?Gln?Leu?Arg?Asn?His
930?????????????????935?????????????????940Ala?Leu?Cys?Asn?Asn?Lys?Leu?Tyr?Leu?Asp?Ile?Leu?Lys?Val?Leu?Lys945?????????????????950?????????????????955?????????????????960His?Leu?Lys?Thr?Phe?Phe?Asn?Leu?Asp?Ser?Ile?Asp?Met?Ala?Leu?Ser
965?????????????????970?????????????????975Leu?Tyr?Met?Asn?Leu?Pro?Met?Leu?Phe?Gly?Gly?Gly?Asp?Pro?Asn?Leu
980?????????????????985?????????????????990Leu?Tyr?Arg?Ser?Phe?Tyr?Arg?Arg?Thr?Pro?Asp?Phe?Leu?Thr?Glu?Ala
995?????????????????1000????????????????1005Ile?Val?His?Ser?Val?Phe?Val?Leu?Ser?Tyr?Tyr?Thr?Gly?His?Asp?Leu
1010????????????????1015????????????????1020Gln?Asp?Lys?Leu?Gln?Asp?Leu?Pro?Asp?Asp?Arg?Leu?Asn?Lys?Phe?Leu1025????????????????1030????????????????1035????????????????1040Thr?Cys?Val?Ile?Thr?Phe?Asp?Lys?Asn?Pro?Asn?Ala?Glu?Phe?Val?Thr
1045????????????????1050????????????????1055Leu?Met?Arg?Asp?Pro?Gln?Ala?Leu?Gly?Ser?Glu?Arg?Gln?Ala?Lys?Ile
1060????????????????1065????????????????1070Thr?Ser?Glu?Ile?Asn?Arg?Leu?Ala?Val?Thr?Glu?Val?Leu?Ser?Ile?Ala
1075????????????????1080????????????????1085Pro?Asn?Lys?Ile?Phe?Ser?Lys?Ser?Ala?Gln?His?Tyr?Thr?Thr?Thr?Glu
1090????????????????1095????????????????1100Ile?Asp?Leu?Asn?Asp?Ile?Met?Gln?Asn?Ile?Glu?Pro?Thr?Tyr?Pro?His1105????????????????1110????????????????1115????????????????1120Gly?Leu?Arg?Val?Val?Tyr?Glu?Ser?Leu?Pro?Phe?Tyr?Lys?Ala?Glu?Lys
1125????????????????1130????????????????1135Ile?Val?Asn?Leu?Ile?Ser?Gly?Thr?Lys?Ser?Ile?Thr?Asn?Ile?Leu?Glu
1140????????????????1145????????????????1150Lys?Thr?Ser?Ala?Ile?Asp?Thr?Thr?Asp?Ile?Asn?Arg?Ala?Thr?Asp?Met
1155????????????????1160????????????????1165Met?Arg?Lys?Asn?Ile?Thr?Leu?Leu?Ile?Arg?Ile?Leu?Pro?Leu?Asp?Cys
1170????????????????1175????????????????1180Asn?Lys?Asp?Lys?Arg?Glu?Leu?Leu?Ser?Leu?Glu?Asn?Leu?Ser?Ile?Thr1185????????????????1190????????????????1195????????????????1200Glu?Leu?Ser?Lys?Tyr?Val?Arg?Glu?Arg?Ser?Trp?Ser?Leu?Ser?Asn?Ile
1205????????????????1210????????????????1215Val?Gly?Val?Thr?Ser?Pro?Ser?Ile?Met?Phe?Thr?Met?Asp?Ile?Lys?Tyr
1220????????????????1225????????????????1230Thr?Thr?Ser?Thr?Ile?Ala?Ser?Gly?Ile?Ile?Ile?Glu?Lys?Tyr?Asn?Val
1235????????????????1240????????????????1245Asn?Ser?Leu?Thr?Arg?Gly?Glu?Arg?Gly?Pro?Thr?Lys?Pro?Trp?Val?Gly
1250????????????????1255????????????????1260Ser?Ser?Thr?Gln?Glu?Lys?Lys?Thr?Met?Pro?Val?Tyr?Asn?Arg?Gln?Val1265????????????????1270????????????????1275????????????????1280Leu?Thr?Lys?Lys?Gln?Arg?Asp?Gln?Ile?Asp?Leu?Leu?Ala?Lys?Leu?Asp
1285????????????????1290????????????????1295Trp?Val?Tyr?Ala?Ser?Ile?Asp?Asn?Lys?Asp?Glu?Phe?Met?Glu?Glu?Leu
1300????????????????1305????????????????1310Ser?Thr?Gly?Thr?Leu?Gly?Leu?Ser?Tyr?Glu?Lys?Ala?Lys?Lys?Leu?Phe
1315????????????????1320????????????????1325Pro?Gln?Tyr?Leu?Ser?Val?Asn?Tyr?Leu?His?Arg?Leu?Thr?Val?Ser?Ser
1330????????????????1335????????????????1340Arg?Pro?Cys?Glu?Phe?Pro?Ala?Ser?Ile?Pro?Ala?Tyr?Arg?Thr?Thr?Asn1345????????????????1350????????????????1355????????????????1360Tyr?His?Phe?Asp?Thr?Ser?Pro?Ile?Asn?His?Val?Leu?Thr?Glu?Lys?Tyr
1365????????????????1370????????????????1375Gly?Asp?Glu?Asp?Ile?Asp?Ile?Val?Phe?Gln?Asn?Cys?Ile?Ser?Phe?Gly
1380????????????????1385????????????????1390Leu?Ser?Leu?Met?Ser?Val?Val?Glu?Gln?Phe?Thr?Asn?Ile?Cys?Pro?Asn
1395????????????????1400????????????????1405Arg?Ile?Ile?Leu?Ile?Pro?Lys?Leu?Asn?Glu?Ile?His?Leu?Met?Lys?Pro
1410????????????????1415????????????????1420Pro?Ile?Phe?Thr?Gly?Asp?Val?Asp?Ile?Ile?Lys?Leu?Lys?Gln?Val?Ile1425????????????????1430????????????????1435????????????????1440Gln?Lys?Gln?His?Met?Phe?Leu?Pro?Asp?Lys?Ile?Ser?Leu?Thr?Gln?Tyr
1445????????????????1450????????????????1455Val?Glu?Leu?Phe?Leu?Ser?Asn?Lys?Ala?Leu?Lys?Ser?Gly?Ser?His?Ile
1460????????????????1465????????????????1470Asn?Ser?Asn?Leu?Ile?Leu?Val?His?Lys?Met?Ser?Asp?Tyr?Phe?His?Asn
1475????????????????1480????????????????1485Ala?Tyr?Ile?Leu?Ser?Thr?Asn?Leu?Ala?Gly?His?Trp?Ile?Leu?Ile?Ile
1490????????????????1495????????????????1500Gln?Leu?Met?Lys?Asp?Ser?Lys?Gly?Ile?Phe?Glu?Lys?Asp?Trp?Gly?Glu1505????????????????1510????????????????1515????????????????1520Gly?Tyr?Ile?Thr?Asp?His?Met?Phe?Ile?Asn?Leu?Asn?Val?Phe?Phe?Asn
1525????????????????1530????????????????1535Ala?Tyr?Lys?Thr?Tyr?Leu?Leu?Cys?Phe?His?Lys?Gly?Tyr?Gly?Lys?Ala
1540????????????????1545????????????????1550Lys?Leu?Glu?Cys?Asp?Met?Asn?Thr?Ser?Asp?Leu?Leu?Cys?Val?Leu?Glu
1555????????????????1560????????????????1565Leu?Ile?Asp?Ser?Ser?Tyr?Trp?Lys?Ser?Met?Ser?Lys?Val?Phe?Leu?Glu
1570????????????????1575????????????????1580Gln?Lys?Val?Ile?Lys?Tyr?Ile?Val?Asn?Gln?Asp?Thr?Ser?Leu?Arg?Arg1585????????????????1590????????????????1595????????????????1600Ile?Lys?Gly?Cys?His?Ser?Phe?Lys?Leu?Trp?Phe?Leu?Lys?Arg?Leu?Asn
1605????????????????1610????????????????1615Asn?Ala?Lys?Phe?Thr?Val?Cys?Pro?Trp?Val?Val?Asn?Ile?Asp?Tyr?His
1620????????????????1625????????????????1630Pro?Thr?His?Met?Lys?Ala?Ile?Leu?Ser?Tyr?Ile?Asp?Leu?Val?Arg?Met
1635????????????????1640????????????????1645Gly?Leu?Ile?Asn?Val?Asp?Lys?Leu?Thr?Ile?Lys?Asn?Lys?Asn?Lys?Phe
1650????????????????1655????????????????1660Asn?Asp?Glu?Phe?Tyr?Thr?Ser?Asn?Leu?Phe?Tyr?Ile?Ser?Tyr?Asn?Phe1665????????????????1670????????????????1675????????????????1680Ser?Asp?Asn?Thr?His?Leu?Leu?Thr?Lys?Gln?Ile?Arg?Ile?Ala?Asn?Ser
1685????????????????1690????????????????1695Glu?Leu?Glu?Asp?Asn?Tyr?Asn?Lys?Leu?Tyr?His?Pro?Thr?Pro?Glu?Thr
1700????????????????1705????????????????1710Leu?Glu?Asn?Met?Ser?Leu?Ile?Pro?Val?Lys?Ser?Asn?Asn?Ser?Asn?Lys
1715????????????????1720????????????????1725Pro?Lys?Phe?Cys?Ile?Ser?Gly?Asn?Thr?Glu?Ser?Met?Met?Met?Ser?Thr
1730????????????????1735????????????????1740Phe?Ser?Ser?Lys?Met?His?Ile?Lys?Ser?Ser?Thr?Val?Thr?Thr?Arg?Phe1745????????????????1750????????????????1755????????????????1760Asn?Tyr?Ser?Lys?Gln?Asp?Leu?Tyr?Asn?Leu?Phe?Pro?Ile?Val?Val?Ile
1765????????????????1770????????????????1775Asp?Lys?Ile?Ile?Asp?His?Ser?Gly?Asn?Thr?Ala?Lys?Ser?Asn?Gln?Leu
1780????????????????1785????????????????1790Tyr?Thr?Thr?Thr?Ser?His?Gln?Thr?Ser?Leu?Val?Arg?Asn?Ser?Ala?Ser
1795????????????????1800????????????????1805Leu?Tyr?Cys?Met?Leu?Pro?Trp?His?His?Val?Asn?Arg?Phe?Asn?Phe?Val
1810????????????????1815????????????????1820Phe?Ser?Ser?Thr?Gly?Cys?Lys?Ile?Ser?Ile?Glu?Tyr?Ile?Leu?Lys?Asp1825????????????????1830????????????????1835????????????????1840Leu?Lys?Ile?Lys?Asp?Pro?Ser?Cys?Ile?Ala?Phe?Ile?Gly?Glu?Gly?Ala
1845????????????????1850????????????????1855Gly?Asn?Leu?Leu?Leu?Arg?Thr?Val?Val?Glu?Leu?His?Pro?Asp?Ile?Arg
1860????????????????1865????????????????1870Tyr?Ile?Tyr?Arg?Ser?Leu?Lys?Asp?Cys?Asn?Asp?His?Ser?Leu?Pro?Ile
1875????????????????1880????????????????1885Glu?Phe?Leu?Arg?Leu?Tyr?Asn?Gly?His?Ile?Asn?Ile?Asp?Tyr?Gly?Glu
1890????????????????1895????????????????1900Asn?Leu?Thr?Ile?Pro?Ala?Thr?Asp?Ala?Thr?Asn?Asn?Ile?His?Trp?Ser1905????????????????1910????????????????1915????????????????1920Tyr?Leu?His?Ile?Lys?Phe?Ala?Glu?Pro?Ile?Ser?Ile?Phe?Val?Cys?Asp
1925????????????????1930????????????????1935Ala?Glu?Leu?Pro?Val?Thr?Ala?Asn?Trp?Ser?Lys?Ile?Ile?Ile?Glu?Trp
1940????????????????1945????????????????1950Ser?Lys?His?Val?Arg?Lys?Cys?Lys?Tyr?Cys?Ser?Ser?Val?Asn?Arg?Cys
1955????????????????1960????????????????1965Ile?Leu?Ile?Ala?Lys?Tyr?His?Ala?Gln?Asp?Asp?Ile?Asp?Phe?Lys?Leu
1970????????????????1975????????????????1980Asp?Asn?Ile?Thr?Ile?Leu?Lys?Thr?Tyr?Val?Cys?Leu?Gly?Ser?Lys?Leu1985????????????????1990????????????????1995????????????????2000Lys?Gly?Ser?Glu?Val?Tyr?Leu?Ile?Leu?Thr?Ile?Gly?Pro?Ala?Asn?Ile
2005????????????????2010????????????????2015Leu?Pro?Val?Phe?Asp?Val?Val?Gln?Asn?Ala?Lys?Leu?Thr?Leu?Ser?Arg
2020????????????????2025????????????????2030Thr?Lys?Asn?Phe?Ile?Met?Pro?Lys?Lys?Thr?Asp?Lys?Glu?Ser?Ile?Asp
2035????????????????2040????????????????2045Ala?Asn?Ile?Lys?Ser?Leu?Ile?Pro?Phe?Leu?Cys?Tyr?Pro?Ile?Thr?Lys
2050????????????????2055????????????????2060Lys?Gly?Ile?Lys?Thr?Ser?Leu?Ser?Lys?Leu?Lys?Ser?Val?Val?Asn?Gly2065????????????????2070????????????????2075????????????????2080Asp?Ile?Leu?Ser?Tyr?Ser?Ile?Ala?Gly?Arg?Asn?Glu?Val?Phe?Ser?Asn
2085????????????????2090????????????????2095Lys?Leu?Ile?Asn?His?Lys?His?Met?Asn?Ile?Leu?Lys?Trp?Leu?Asp?His
2100????????????????2105????????????????2110Val?Leu?Asn?Phe?Arg?Ser?Ala?Glu?Leu?Asn?Tyr?Asn?His?Leu?Tyr?Met
2115????????????????2120????????????????2125Ile?Glu?Ser?Thr?Tyr?Pro?Tyr?Leu?Ser?Glu?Leu?Leu?Asn?Ser?Leu?Thr
2130????????????????2135????????????????2140Thr?Asn?Glu?Leu?Lys?Lys?Leu?Ile?Lys?Ile?Thr?Gly?Ser?Val?Leu?Tyr2145????????????????2150????????????????2155????????????????2160Asn?Leu?Pro?Asn?Glu?Gln
The information of 2165 (2) SEQ ID NO:25:
(i) sequence signature:
(A) length: 15229 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25:
ACGCGAAAAA ATGCGTACTA CAAACTTGCA CATTCGGAAA AAATGGGGCA AATAAGAATT 60
TGATAAGTGC TATTTAAATC TAACCTTTTC AATCAGAAAT GGGGTGCAAT TCACTGAGCA 120
TGATAAAGGT TAGATTACAA AATTTATTTG ACAATGACGA AGTAGCATTG TTAAAAATAA 180
CATGTTATAC TGACAAATTA ATTCTTCTGA CCAATGCATT AGCCAAAGCA GTAATACATA 240
CAATTAAATT AAACGGCATA GTTTTTATAC ATGTTATAAC AAGCAGTGAA GTGTGCCCTG 300
ACAACAATAT TGTAGTGAAA TCTAACTTTA CAACAATGCC AATATTACAA AACGGAGGAT 360
ACATATGGGA ATTGATTGAG TTGACACACT GCTCTCAATC AAATGGTCTA ATGGATGATA 420
ATTGTGAAAT CAAATTTTCT AAAAGACTAA GTGACTCAGT AATGACTAAT TATATGAATC 480
AAATATCTGA TTTACTTGGG CTTGATCTCA ATTCATGAAT TATGTTTAGT CTAATTTAAT 540
AGACATGTGT TTATCACCAT TTTAGTTAAT ATAAAACCTC ATCAAAGGGA AATGGGGCAA 600
ATAAACTCAC CTAATCAGTC AAACCATGAG CACTACAAAT GACAACACTA CTATGCAAAG 660
ATTGATGATC ACAGACATGA GACCCCTGTC GATGGAATCA ATAATAACAT CTCTCACCAA 720
AGAAATCATA ACACACAAAT TCATATACTT GATAAACAAT GAATGTATTG TAAGAAAACT 780
TGATGAAAGA CAAGCTACAT TTACATTCTT AGTCAATTAT GAGATGAAGC TATTGCACAA 840
AGTAGGGAGT ACCAAATACA AGAAATACAC TGAATATAAT ACAAAATATG GCACTTTCCC 900
CATGCCTATA TTTATCAATC ATGACGGGTT TCTAGAATGT ATTGGCATTA AGCCTACAAA 960
ACACACTCCT ATAATATACA AATATGACCT CAACCCGTAA ATTCCAACAA AAAACTAACC 1020
CATCCAAACT AAGCTATTCC TCAAACAACA GTGCTCAACA GTTAAGAAGG AGCTAATCCA 1080
TTTTAGTAAT TAAAAATAAA GGCAGAGCCA ATAACATAAA TTGGGGCAAA TACAAAGATG 1140
GCTCTTAGCA AAGTCAAGTT AAATGATACA TTAAATAAGG ATCAGCTGCT GTCATCCAGC 1200
AAATACACTA TTCAACGTAG TACAGGAGAT AATATTGACA CTCCCAATTA TGATGTGCAA 1260
AAACACCTAA ACAAACTATG TGGTATGCTA TTAATCACTG AAGATGCAAA TCATAAATTC 1320
ACAGGATTAA TAGGTATGTT ATATGCTATG TCCAGGTTAG GAAGGGAAGA CACTATAAAG 1380
ATACTTAAAG ATGCTGGATA TCATGTTAAA GCTAATGGAG TAGATATAAC AACATATCGT 1440
CAAGATATAA ACGGAAAGGA AATGAAATTC GAAGTATTAA CATTATCAAG CTTGACATCA 1500
GAAATACAAG TCAATATTGA GATAGAATCT AGAAAGTCCT ACAAAAAAAT GCTAAAAGAG 1560
ATGGGAGAAG TGGCTCCAGA ATATAGGCAT GATTCTCCAG ACTGTGGGAT GATAATACTG 1620
TGTATAGCTG CACTTGTAAT AACCAAGTTA GCAGCAGGAG ATAGATCAGG TCTTACAGCA 1680
GTAATTAGGA GGGCAAACAA TGTCTTAAAA AACGAAATAA AACGCTACAA GGGCCTCATA 1740
CCAAAGGATA TAGCTAACAG TTTTTATGAA GTGTTTGAAA AACACCCTCA TCTTATAGAT 1800
GTTTTTGTGC ACTTTGGCAT TGCACAATCA TCCACAAGAG GGGGTAGTAG AGTTGAAGGA 1860
ATCTTTGCAG GATTATTTAT GAATGCCTAT GGTTCAGGGC AAGTAATGCT AAGATGGGGA 1920
GTTCTAGCCA AATCTGTAAA AAATATCATG CTAGGACATG CTAGTGTCCA GGCAGAAATG 1980
GAACAAGTTG TGGAAGTTTA TGAGTATGCA CAGAAGTTGG GAGGAGAAGC TGGATTCTAC 2040
CATATATTGA ACAATCCAAA AGCATCATTG CTGTCATTAA CTCAATTTCC TAACTTCTCA 2100
AGTGTGGTCC TAGGCAATGC AGCAGGTCTA GGCATAATGG GAGAGTATAG AGGTACACCA 2160
AGAAACCAAG ATCTATATGA TGCAGCCAAA GCATATGCAG AGCAACTCAA AGAAAATGGA 2220
GTAATAAACT ACAGTGTATT AGACTTAACA GCAGAAGAAT TGGAAGCCAT AAAGCATCAA 2280
CTCAACCCCA AAGAAGATGA TGTAGAGCTT TAAGTTAACA AAAAATACGG GGCAAATAAG 2340
TCAACATGGA GAAGTTTGCA CCTGAATTTC ATGGAGAAGA TGCAAACAAC AAAGCTACCA 2400
AATTCCTAGA ATCAATAAAG GGCAAGTTTG CATCATCCAA AGATCCTAAG AAGAAAGATA 2460
GCATAATATC TGTTAACTCA ATAGATATAG AAGTAACTAA AGAGAGCCCG ATAACATCTG 2520
GCACCAACAT CATCAATCCA ATAAGTGAAG CTGATAGTAC CCCAGAAGCT AAAGCCAACT 2580
ACCCAAGAAA ACCCCTAGTA AGCTTCAAAG AAGATCTCAC CCCAAGTGAC AACCCCTTTT 2640
CTAAGTTGTA CAAAGAAACA ATAGAAACAT TTGATAACAA TGAAGAAGAA TCTAGCTACT 2700
CATATGAAGA AATAAATGAT CAAACAAATG ACAACATTAC AGCAAGACTA GATAGAATTG 2760
ATGAAAAATT AAGTGAAATA TTAGGAATGC TCCATACATT AGTAGTTGCA AGTGCAGGAC 2820
CCACCTCAGC TCGCGATGGA ATAAGAGATG CTATGGTTGG TCTAAGAGAA GAAATGATAG 2880
AAAAAATAAG AGCGGAAGCA TTAATGACCA ATGATAGGTT AGAGGCTATG GCAAGACTTA 2940
GGAATGAGGA AAGCGAAAAA ATGGCAAAAG ACACCTCAGA TGAAGTGTCT CTTAATCCAA 3000
CTTCCAAAAA ATTGAGTAAT TTGTTGGAAG ACAACGATAG TGACAATGAT CTATCACTTG 3060
ATGATTTTTG ATCAGTGATC AACTCACTCA GCAATCAACA ACATCAATGA AACAGACATC 3120
AATCCATTGA ATCAACTGCC AGACTGAACA CACAAACGTC CATCAGCAGA ACTACCAACC 3180
AATCAATCAA CCAATTGATC AATCAGCGAC CTAACAAAAT TAACAATATA GTAACAAAAA 3240
AAGAACAAGA TGGGGCAAAT ATGGAAACAT ACGTGAACAA GCTTCACGAG GGCTCCACAT 3300
ACACAGCAGC TGTTCAGTAC AATGTTCTAG AAAAAGATGA TGATCCTGCA TCACTAACAA 3360
TATGGGTGCC TATGTTCCAG TCATCTGTGC CAGCAGACTT GCTCATAAAA GAACTTGCAA 3420
GCATCAACAT ACTAGTGAAG CAGATCTCCA CGCCCAAAGG ACCTTCACTA CGAGTCACGA 3480
TTAACTCAAG AAGTGCTGTG CTGGCACAAA TGCCTAGTAG TTTTATCATA AGTGCAAATG 3540
TATCATTAGA TGAAAGAAGC AAATTAGCAT ATGATGTAAC TACACCTTGT GAAATCAAAG 3600
CATGCAGTCT AACATGCTTA AAAGTAAAAA GTATGTTAAC TACAGTCAAA GATCTTACCA 3660
TGAAAACATT CAATCCCACT CATGAGATTA TTGCTCTATG TGAATTTGAA AATATTATGA 3720
CATCAAAAAG AGTAATAATA CCAACCTATC TAAGATCAAT TAGTGTCAAA AACAAGGACC 3780
TGAACTCACT AGAAAATATA GCAACCACCG AATTCAAAAA TGCTATCACC AATGCGAAAA 3840
TTATTCCCTA TGCAGGATTA GTATTAGTTA TCACAGTTAC TGACAATAAA GGAGCATTCA 3900
AATATATCAA GCCACAGAGT CAATTTATAG TAGATCTTGG GGCCTACCTA GAAAAAGAGA 3960
GCATATATTA TGTGACTACA AATTGGAAGC ATACAGCTAC ACGTTTTTCA ATCAAACCAC 4020
TAGAGGATTA AACTTAATTA TCAACACTAA ATGACAGGTC CACATATATC TTCAAACTAT 4080
ACATTATATC CAAACATCAT GAGCATTTAC ACTACACACT TTTACCATAT AAATCAATCT 4140
CATTTAAAAT CCAAAATTAC TTCCAGCTAT CATCTGTTAG ACCTAGAGTG CGAATAGGTA 4200
AATAAAACCA AAATATGGGG TAAATAGACA TTAGTTAGAG TTCAATCAAT CTCAACAACC 4260
ATTTATACCG CCAATTCAGT ACATATACTA TAAATCTCAA AATGGGAAAT ACATCCATCA 4320
CAATAGAATT CACAAGCAAA TTTTGGCCTT ATTTTACACT AATACATATG ATCTTAACTC 4380
TAATCTCTTT ACTAATTATA ATCACTATTA TGATTGCAAT ACTAAATAAG CTAAGTGAAC 4440
ATAAAACATT CTGCAACAAA ACTCTTGAAC TAGGACAGAT GTATCAAATC AACACATAGT 4500
GTTCTACCAT TATGCTGTGT CAAATTATAA TCTTGTATAT ATAAACAAAC AAATCCAATC 4560
TTCTCACAGA GTCATGGTGG CGCAAAACCA CGCCAACCAT CATGATAGCA TAGAGTAGTT 4620
ATTTAAAAAT TAACATAATG ATGAATTATT GGTATGAGAT CAGGAACAAC ATTGGGGCAA 4680
ATGCAGCCAT GTCCAAGCAC AAGAATCGGC GCACTGCCGG GACTCTAGAA AGGACCTGGG 4740
ATACTCTTAA TCATCTAATT GTAATATCCT CTTGTTTATA CAGATTAAAT TTAAAATCTA 4800
TAGCACAAAT AGCACTGTCA GTTTTGGCAA TGATAATCTC AACCTCTCTC ATAATTGCAG 4860
CCATAATATT CATCATCTCT GCCAATCACA AAGTTACACT AACAACGGTT ACAGTTCAAA 4920
CAATAAAAAA CCACACTGAA AAAAACATCT CCACCTACCT TACTCAAGTC CCACCAGAAA 4980
GGGTCAACTC ATCCAAACAA CCCACAACCA CATCACCAAT CCACACAAAT TCAGCCACAA 5040
TATCACCAAA TACAAAATCA GAAACACACC ATACAACAGC ACAAACCAAA GGCAGAATCA 5100
CCACTTCAAC ACAGACCAAC AAGCCAAGCA CAAAATCACG TTCAAAAAAT CCACCAAAAA 5160
AACCAAAAGA TGATTACCAT TTTGAAGTGT TCAATTTTGT TCCCTGTAGT ATATGTGGTA 5220
ATAATCAACT CTGCAAATCC ATCTGCAAAA CAATACCAAG CAACAAACCA AAGAAAAAAC 5280
CAACCATCAA ACCCACAAAC AAACCAACCA CCAAAACCAC AAACAAAAGA GACCCCAAAA 5340
CACCAGCCAA AATGCCAAAA AAAGAAATCA TCACCAACCC AGCAAAAAAA CCAACCCTCA 5400
AGACCACAGA AAGAGACACC AGCATTTCAC AATCCACCGT GCTCGACACA ATCACTCCAA 5460
AATACACAAT CCAACAGCAA TCCCTCCACT CAACCACCTC CGAAAACACA CCCAGCTCCA 5520
CACAAATACC CACAGCATCC GAGCCCTCCA CATTAAATCC TAATTAAAAA ACCTAGTCAC 5580
ATGCTTAGTT ATTCAAAAAC TACATCTTAG CAGAGAACCG TGATCTATCA AGCAAGAACA 5640
AAATTAAACC TGGGGCAAAT AACCATGGAG TTGCTGATCC ACAGGTCAAG TGCAATCTTC 5700
CTAACTCTTG CTGTTAATGC ATTGTACCTC ACCTCAAGTC AGAACATAAC TGAGGAGTTT 5760
TACCAATCGA CATGTAGTGC AGTTAGCAGA GGTTATTTTA GTGCTTTAAG AACAGGTTGG 5820
TATACCAGTG TCATAACAAT AGAATTAAGT AATATAAAAG AAACCAAATG CAATGGAACT 5880
GACACTAAAG TAAAACTTAT AAAACAAGAA TTAGATAAGT ATAAGAATGC AGTAACAGAA 5940
TTACAGCTAC TTATGCAAAA CACGCCAGCT GCCAACAACC GGGCCAGAAG AGAAGCACCA 6000
CAGTACATGA ACTACACAAT CAATACCACA AAAAACCTAA ATGTATCAAT AAGCAAGAAA 6060
AGGAAACGAA GATTTCTGGG CTTCTTGTTA GGTGTAGGAT CTGCAATAGC AAGTGGTATA 6120
GCTGTATCCA AAGTTTTACA CCTTGAAGGA GAAGTGAACA AAATCAAAAA TGCTTTGTTG 6180
TCTACAAACA AAGCTGTAGT CAGTCTATCA AATGGGGTCA GTGTTTTAAC CAGCAAAGTG 6240
TTAGATCTCA AGAATTACAT AAATAACCGA ATATTACCCA TAGTAAATCA ACAGAGCTGT 6300
CGCATCTCCA ACATTGAAAC AGTTATAGAA TTCCAGCAGA AGAATAGCAG ATTGTTGGAA 6360
ATCACCAGAG AATTTAGTGT TAATGCAGGT GTAACAACAC CTTTAAGCAC TTACATGTTA 6420
ACAAACAGTG AGTTACTATC ATTGATCAAT GATATGCCTA TAACAAATGA CCAGAAAAAA 6480
TTAATGTCAA GCAATGTTCA GATAGTAAGG CAACAAAGTT ATTCTATCAT GTCTATAATA 6540
AAGGAAGAAG TCCTTGCATA TGTTGTACAG CTACCTATCT ATGGTGTAAT AGATACACCT 6600
TGCTGGAAAT TACACACATC ACCTCTATGC ACCACCAACA TCAAAGAAGG ATCAAATATT 6660
TGTTTAACAA GGACTGATAG AGGATGGTAT TGTGATAATG CAGGATCAGT ATCCTTCTTC 6720
CCACAGGCTG ATACTTGCAA AGTACAGTCC AATCGAGTAT TTTGTGACAC TATGAACAGT 6780
TTAACATTAC CAAGTGAAGT CAGCCTTTGT AACACTGACA TATTCAATTC CAAGTATGAC 6840
TGCAAAATTA TGACATCAAA AACAGACATA AGCAGCTCAG TAATTACTTC TCTTGGAGCT 6900
ATAGTGTCAT GCTATGGAAA AACTAAATGC ACTGCATCCA ATAAAAATCG TGGGATTATA 6960
AAGACATTTT CTAATGGTTG TGACTATGTG TCAAACAAAG GAGTAGATAC TGTGTCAGTG 7020
GGCAACACTT TATACTATGT AAACAAGCTG GAAGGCAAAA ACCTTTATGT AAAAGGGGAA 7080
CCTATAATAA ATTACTATGA TCCTCTAGTG TTTCCTTCTG ATGAGTTTGA TGCATCAATA 7140
TCTCAAGTCA ATGAAAAAAT CAATCAAAGT TTAGCTTTTA TTCGTAGATC TGATGAATTA 7200
CTACATAATG TAAATACTGG CAAATCTACT ACAAATATTA TGATAACTAC AATTATTATA 7260
GTAATCATTG TAGTATTGTT ATCATTAATA GCTATTGGTT TACTGTTGTA TTGCAAAGCC 7320
AAAAACACAC CAGTTACACT AAGCAAAGAC CAACTAAGTG GAATCAATAA TATTGCATTC 7380
AGCAAATAGA CAAAAAACTA CTTAATCATG TTTCAACAAC AATCTGCTGA CCACCAATCC 7440
CAAATCAACT TAACAACAAA TATTTCAACA TCATAGCACA GGCTGAATCA TTTCCTCATA 7500
TCATGCTACC TACACAACTA AGCTAGATCT TCAACTCATA GTTACATAAA AACCCCAAGT 7560
ATCACAATCA AACACTAAAT CGACACATCA TTCACAAAAT TAACAACTGG GGCAAATATG 7620
TCGCGAAGAA ATCCTTGTAA ATTTGAGATT AGAGGTCATT GCTTGAATGG TAGAAGATGT 7680
CACTACAGTC ATAATTATTT TGAATGGCCT CCTCATGCAT TACTAGTGAG GCAAAACTTC 7740
ATGTTAAACA AGATACTTAA GTCAATGGAC AAAAGCATAG ACACTTTGTC GGAAATAAGT 7800
GGAGCTGCTG AACTGGATAG AACAGAAGAA TATGCTCTTG GTATAGTTGG AGTGCTAGAG 7860
AGTTACATAG GATCAATAAA CAACATAACA AAACAATCAG CATGTGTTGC TATGAGTAAA 7920
CTTCTTATTG AGATCAACAG TGATGACATT AAAAAACTGA GAGATAACGA AGAACCCAAT 7980
TCGCCTAAGA TAAGAGTGTA CAATACTGTT ATATCATACA TTGAGAGCAA TAGAAAAAAC 8040
AACAAGCAAA CCATCCATCT GCTCAAAAGA CTACCAGCAG ACGTGCTGAA GAAGACAATA 8100
AAGAACACAT TAGATATCCA CAAAAGCATA ACCATAAGCA ACTCAAAAGA GTCAACCGTG 8160
AATGATCAAA ATGACCAAAC CAAAAATAAT GATATTACCG GATAAATATC CTTGTAGTAT 8220
ATCATCCATA TTGATTTCAA GTGAAAGCAT GATTGCTACA TTCAATCATA AAAACATATT 8280
ACAATTTAAC CATAACCATT TGGATAACCA CCAGTGTTTA TTAAATCATA TATTTGATGA 8340
AATTCATTGG ACACCTAAAA ACTTATTAGA TGCCACTCAA CAATTTCTCC AACATCTTAA 8400
CATCCCTGAA GATATATATA CAGTATATAT ATTAGTGTCA TAATGCTTGA CCATAACAAT 8460
TTTATATCAT TCAACCATAA AACAACCTTA ATAAGGTTAT GGGACAAAAT GGATCCCATT 8520
ATTAATGGAA ACTCTGCCAA TGTGTATCTA ACTGATAGTT ATCTAAAAGG TGTTATCTCT 8580
TTTTCAGAAT GTAATGCTTT AGGGAGTTAC CTTTTTAACG GCCCCTATCT TAAAAATGAT 8640
TACACCAACT TAATTAGTAG ACAAAGCCCA CTACTAGAGC ATATGAATCT AAAAAAACTA 8700
ACTATAACAC AGTCATTAAT ATCTAGATAT CATAAAGGTG AACTGAAGTT AGAAGAACCA 8760
ACTTATTTCC AGTCATTACT TATGACATAT AAAAGTATGT CCTCGTCTGA ACAAATTGCT 8820
ACAACTAATT TACTTAAAAA AATAATACGA AGAGCTATAG AAATAAGTGA TGTAAAGGTG 8880
TACGCCATCT TGAATAAACT GGGACTAAAG GAAAAGGACA GAGTTAAGCC CAACAATAAT 8940
TCAGGTGATG AAAACTCAGT TCTTACAACC ATAATCAAAG ATGATATACT TTCAGCTGTG 9000
GAAAACAATC AATCATATAC AAATTCAGAC AAAAATCATT CAGTAAATCA AAATATCACT 9060
ATCAAAACAA CACTCTTGAA AAAATTGATG TGTTCAATGC AACATCCTCC ATCATGGTTA 9120
ATACACTGGT TCAATTTATA TACAAAATTA AATAACATAT TAACACAATA TCGATCAAAT 9180
GAGGTAAAAA GTCATGGGTT TATATTAATA GATAATCAAA CTTTAAGTGA TTTTCAGTTT 9240
ATTTTAAATC AATATGGTTG TATCGTTTAT CATAAAGGAC TCAAAAAAAT CACAACTACT 9300
ACTTACAATC AATTTTTGAC ATGGAAAGAC ATCAGCCTTA GCAGATTAAA TGTTTGCTTA 9360
ATTACTTGGA TAAGTAATTG TTTAAATACA TTAAATAAAA GCTTAGGGCT GAGATGTGGA 9420
TTCAATAATG TTGTGTTATC ACAACTATTT CTTTATGGAG ATTGTATACT GAAATTATTC 9480
CATAATGAAG GCTTCTACAT AATAAAAGAA GTAGAGGGAT TTATTATGTC TTTAATTCTA 9540
AACATAACAG AAGAAGATCA ATTTAGGAAA CGATTTTATA ATAGCATGCT AAATAACATC 9600
ACAGATGCAG CTATTAAGGC TCAAAAAAAC CTACTATCAA GAGTATGTCA CACTTTATTA 9660
GACAAGACAG TGTCTGATAA TATCATAAAT GGTAAATGGA TAATCCTATT AAGTAAATTT 9720
CTTAAATTGA TTAAGCTTGC AGGTGATAAT AATCTCAATA ACTTGAGTGA GCTTTATTTT 9780
CTCTTCAGAA TCTTTGGACA TCCAATGGTC GATGAAAGAC AAGCAATGGA TGCTGTAAGA 9840
ATTAACTGTA ATGAAACCAA GTTCTACTTA TTAAGTAATC TAAGTACGTT AAGAGGTGCT 9900
TTCATTTATA GAATCATAAA GGGGTTTGTA AATACCTACA ACAGATGGCC CACTTTAAGG 9960
AATGCTATTG TTCTACCTCT AAGATGGTTG AACTATTATA AACTTAATAC TTATCCATCT 10020
CTACTTGAAA TCACAGAGAA AGATTTGATT ATTTTATCAG GATTGCGGTT CTATCGTGAG 10080
TTTCATCTGC CTAAAAAAGT GGATCTTGAA ATGATAATAA ATGACAAAGC CATTTCACCT 10140
CCAAAAGATT TAATATGGAC TAGTTTTCCT AGAAATTACA TGCCATCACA TATACAAAAT 10200
TATATAGAAC ATGAAAAGTT GAAGTTCTCT GAAAGTGACA GATCAAGAAG AGTACTAGAG 10260
TATTACTTGA GAGATAATAA ATTCAATGAA TGCGATCTAT ACAATTGTGT GGTCAATCAA 10320
AGCTATCTCA ACAACTCTAA CCATGTGGTA TCACTAACTG GTAAAGAAAG AGAGCTCAGT 10380
GTAGGTAGAA TGTTTGCTAT GCAACCAGGT ATGTTTAGGC AAATTCAAAT CTTAGCAGAG 10440
AAAATGATAG CCGAAAATAT TTTACAATTC TTCCCTGAGA GTTTGACAAG ATATGGTGAT 10500
CTAGAGCTTC AAAAGATATT AGAATTAAAA GCAGGAATAA GCAACAAGTC AAATCGTTAT 10560
AATGATAACT ACAACAATTA TATCAGTAAA TGTTCTATCA TTACAGACCT TAGCAAATTC 10620
AATCAAGCAT TTAGATATGA AACATCATGT ATCTGCAGTG ATGTATTAGA TGAACTGCAT 10680
GGAGTACAAT CTCTGTTCTC TTGGTTGCAT TTAACAATAC CTCTTGTCAC AATAATATGT 10740
ACATATAGAC ATGCACCTCC TTTTATAAAG GATCATGTTG TTAATCTTAA TAAAGTTGAT 10800
GAACAAAGTG GATTATACAG ATATCATATG GGTGGTATTG AAGGCTGGTG TCAAAAACTG 10860
TGGACCATTG AAGCTATATC ATTATTAGAT CTAATATCTC TCAAAGGGAA ATTCTCTATC 10920
ACAGCTCTAA TAAATGGTGA TAATCAGTCA ATTGATATAA GTAAACCAGT TAGACTTATA 10980
GAGGGTCAGA CCCATGCTCA AGCAGATTAT TTGTTAGCAT TAAATAGCCT TAAATTGCTA 11040
TATAAAGAGT ATGCGGGCAT AGGCCACAAG CTCAAGGGAA CAGAGACCTA TATATCCCGA 11100
GATATGCAAT TCATGAGCAA AACAATCCAG CACAATGGAG TGTACTATCC AGCCAGTATC 11160
AAAAAAGTCC TGAGAGTAGG TCCATGGATA AATACAATAC TTGATGATTT TAAAGTTAGT 11220
TTAGAATCTA TAGGTAGCTT AACACAGGAG TTAGAATATA GAGGAGAGAG CTTATTATGC 11280
AGTTTAATAT TTAGGAACAT TTGGTTATAC AATCAAATTG CTTTGCAACT CCGAAATCAT 11340
GCATTATGTC ACAATAAGCT ATATTTAGAT ATATTGAAAG TATTAAAACA CTTAAAAACT 11400
TTTTTTAATC TTGATAGTAT TGATATGGCT TTAACATTGT ATATGAATTT GCCTATGCTG 11460
TTTGGTGGTG GTGATCCTAA TTTGTTATAT CGAAGCTTTT ATAGGAGAAC TCCAGACTTC 11520
CTTACAGAAG CTATAGTACA TTCAGTGTTT GTGTTGAGCT ATTATACTGG TCACGATTTA 11580
CAAGATAAGC TCCAGGATCT TCCAGATGAT AGACTGAACA AATTCTTGAC ATGTATCATC 11640
ACGTTTGATA AAAATCCCAA TGCCGAGTTT GTAACATTGA TGAGAGATCC ACAGGCTTTA 11700
GGGTCTGAAA GGCAAGCAAA AATTACTAGT GAGATTAATA GATTAGCAGT GACAGAAGTC 11760
TTAAGTATAG CTCCAAACAA AATATTTTCT AAAAGTGCAC AACATTATAC TACCACTGAG 11820
ATTGATCTAA ATGATATTAT GCAAAATATA GAACCAACTT ACCCTCATGG ATTAAGAGTT 11880
GTTTATGAAA GTTTACCTTT TTATAAAGCA GAAAAAATAG TTAATCTTAT ATCAGGAACA 11940
AAATCCATAA CTAATATACT TGAAAAAACA TCAGCAATAG ATTCAACTGA TATTAATAGG 12000
GCTACTGATA TGATGAGGAA AAATATAACT TTACTTATAA GGATACTTCC ACTAGATTGT 12060
AACAAAGACA AAAGAGAGTT ATTAAGTTTA GAAAATCTTA GTATAACTGA ATTAAGCAAG 12120
TATGTAAGAG AAAGATCTTG GTCGTTATCC AATATAGTAG GAGTAACATC GCCAAGTATT 12180
ATGTTCACAA TGGACATTAA ATATACAACT AGCACTATAG CCAGTGGTAT AATTATAGAA 12240
AAATATAATG TTAATAGTTT AACTCGTGGT GAAAGAGGAC CTACTAAGCC ATGGGTAGGT 12300
TCATCTACGC AGGAGAAAAA AACAATGCCA GTGTACAATA GACAAGTTTT AACCAAAAAG 12360
CAAAGAGACC AAATAGATTT ATTAGCAAAA TTAGACTGGG TATATGCATC CATAGACAAC 12420
AAAGATGAAT TCATGGAAGA ACTGAGTACT GGAACACTTG GACTGTCATA TGAGAAAGCC 12480
AAAAAATTGT TTCCACAATA TCTAAGTGTC AATTATTTAC ACCGCTTAAC AGTCAGTAGT 12540
AGACCATGTG AATTCCCTGC ATCAATACCA GCTTATAGAA CAACAAATTA TCATTTCGAT 12600
ACTAGTCCTA TCAACCATGT ATTAACAGAA AAGTATGGAG ATGAAGATAT CGACATTGTG 12660
TTTCAAAATT GCATAAGTTT TGGTCTTAGC TTAATGTCGG TTGTGGAACA ATTCACAAAC 12720
ATATGTCCTA ATAGAATTAT TCTCATACCG AAGCTGAATG AGATACATTT GATGAAACCT 12780
CCTATATTTA CAGGAGATGT TGATATCATC AAGTTGAAGC AAGTGATACA AAAACAGCAC 12840
ATGTTCCTAC CAGATAAAAT AAGTTTAACC CAATATGTAG AATTATTCCT AAGTAACAAA 12900
GCACTTAAAT CTGGATCTCA CATCAACTCT AATTTAATAT TAGTACATAA AATGTCTGAT 12960
TATTTTCATA ATGCTTATAT TTTAAGTACT AATTTAGCTG GACATTGGAT TCTGATTATT 13020
CAACTTATGA AGGATTCAAA AGGTATTTTT GAAAAAGATT GGGGAGAGGG GTATATAACT 13080
GATCATATGT TCATTAATTT GAATGTTTTC TTTAATGCTT ATAAGACTTA TTTGCTATGT 13140
TTTCATAAAG GTTATGGTAA AGCAAAATTA GAATGTGATA TGAACACTTC AGATCTTCTT 13200
TGTGTTTTGG AGCTAATAGA CAGTAGCTAC TGGAAATCTA TGTCTAAAGT TTTCCTAGAA 13260
CAAAAAGTCA TAAAATACAT AATCAATCAA GACACAAGTT TGCATAGAAT AAAAGGTTGT 13320
CATAGTTTTA AGTTATGGTT TTTAAAACGC CTTAATAATG CTAAATTTAC CGTATGCCCT 13380
TGGGTTGTTA ACATAGATTA TCACCCAACA CACATGAAAG CTATATTATC TTACATAGAT 13440
TTAGTTAGAA TGGGGTTAAT AAATGTAGAT AAATTAACCA TTAAAAATAA AAATAAATTC 13500
AATGATGAAT TTTACACATC AAATCTCTTT TACATTAGTT ATAACTTTTC AGATAACACT 13560
CATTTGCTAA CAAAACAAAT AAGAATTGCT AATTCAGAAT TAGAAAATAA TTATAACAAA 13620
CTATATCACC CAACCCCAGA AACTTTAGAA AATATGTCAT TAATTCCTGT CAAAAGTAAT 13680
AATAGTAATA AACCTAAATT TGGTATAAGT GGAAATACCG AATCTATGAT GACGTCAACA 13740
TTCTCCAATA AAACGCATAT TAAATCTTCC GCTGTTATTA CAAGATTCAA TTATAGTAAA 13800
CAAGACTTGT ACAATTTATT TCCAATTGTC GTGATAGACA GGATTATAGA TCATTCAGGT 13860
AATACAGCAA AATCTAACCA ACTCTACACT ACCACTTCAC ATCAGACATC TTTAGTAAGG 13920
AATAGTGCAT CACTTTATTG CATGCTTCCT TGGCATCATG TCAATAGATT TAACTTTGTA 13980
TTTAGTTCCA CAGGATGCAA GATCAGTATA GAGTATATTT TAAAAGATCT TAAGATTAAA 14040
GACCCCAGTT GTATAGCATT CATAGGTGAA GGAGCTGGTA ACTTATTATT ACGTACAGTA 14100
GTAGAACTTC ATCCAGACAT AAGATACATT TACAGAAGTT TAAAAGATTG CAATGATCAT 14160
AGTTTACCTA TTGAATTTCT AAGGTTATAC AACGGGCATA TAAACATAGA TTATGGTGAG 14220
AATTTAACCA TTCCTGCTAC AGATGCAACT AATAACATTC ATTGGTCTTA TTTACATATA 14280
AAATTTGCAG AACCTATTAG CATTTTTGTC TGCGATGCTG AATTACCTGT TACAGCCAAT 14340
TGGAGTAAAA TTATAATTGA ATGGAGTAAG CATGTAAGAA AGTGCAAGTA CTGTTCCTCT 14400
GTAAATAGAT GCATTTTAAT TGCAAAATAT CATGCCCAAG ATGATATTGA TTTCAAATTA 14460
GATAACATTA CTATATTAAA AACTTACGTG TGCCTAGGTA GCAAGTTAAA AGGATCTGAA 14520
GTTTACTTAG TCCTTACAAT AGGCCCTGCA AATATACTTC CTGTTTTTAA TGTTGTGCAA 14580
AATGCTAAAT TGATTCTTTC AAGGACTAAA AATTTCATTA TGCCTAAAAA AACTGACAAA 14640
GAATCTATCG ATGCAAATAT TAAAAGCTTA ATACCTTTCC TTTGTTACCC TATAACAAAA 14700
AAAGGAATTA AGACTTCATT GTCAAAATTG AAGAGTGTAG TTAGTGGAGA TATATTATCA 14760
TATTCTATAG CTGGACGTAA TGAAGTATTC AGCAACAAGC TTATAAACCA CAAGCATATG 14820
AATATCCTAA AATGGCTAGA TCATGTTTTA AACTTTAGAT CAGCTGAACT TAATTACAAT 14880
CATTTATATA TGATAGAGTC CACATATCCT TACTTAAGTG AATTGTTAAA CAGTTTAACA 14940
ACCAATGAGC TCAAGAAGCT GATTAAAATA ACAGGTAGTG TACTATACAA CCTTCCCAAC 15000
GAACAGTAAC TTAAAACATC ATTAACAAGT TTGATCAAAT TTAGATGCTA ACACATCATA 15060
ATATTATAGT TATTAAAAAA TATATATGCA AACTTTTCAA TAATTTAGCA TATTGATTCC 15120
AAAGTTATCA TTTTGGTCTT AAGGGGTTGA ATAAAAATCT AAAACTAACA ATTATACATG 15180
TGCATTTACA ACACAACGAG ACATTAGTTT TTGACACTTT TTTTCTCGT 15229
(2) SEQ ID NO: 26 information about:
...
(i) sequence signature:
(A) length: 2166 amino acid
(B) type: amino acid
(C) chain:
(D) topological framework: linearity
(ii) molecule type: protein
(xi) sequence description: SEQ ID NO:26:
Met?Asp?Pro?Ile?Ile?Asn?Gly?Asn?Ser?Ala?Asn?Val?Tyr?Leu?Thr?Asp
1???????????????5???????????????????10??????????????????15
Ser?Tyr?Leu?Lys?Gly?Val?Ile?Ser?Phe?Ser?Glu?Cys?Asn?Ala?Leu?Gly
20??????????????????25??????????????????30
Ser?Tyr?Leu?Phe?Asn?Gly?Pro?Tyr?Leu?Lys?Asn?Asp?Tyr?Thr?Asn?Leu
35??????????????????40??????????????????45Ile?Ser?Arg?Gln?Ser?Pro?Leu?Leu?Glu?His?Met?Asn?Leu?Lys?Lys?Leu
50??????????????????55??????????????????60Thr?Ile?Thr?Gln?Ser?Leu?Ile?Ser?Arg?Tyr?His?Lys?Gly?Glu?Leu?Lys65??????????????????70??????????????????75??????????????????80Leu?Glu?Glu?Pro?Thr?Tyr?Phe?Gln?Ser?Leu?Leu?Met?Thr?Tyr?Lys?Ser
85??????????????????90??????????????????95Met?Ser?Ser?Ser?Glu?Gln?Ile?Ala?Thr?Thr?Asn?Leu?Leu?Lys?Lys?Ile
100?????????????????105?????????????????110Ile?Arg?Arg?Ala?Ile?Glu?Ile?Ser?Asp?Val?Lys?Val?Tyr?Ala?Ile?Leu
115?????????????????120?????????????????125Asn?Lys?Leu?Gly?Leu?Lys?Glu?Lys?Asp?Arg?Val?Lys?Pro?Asn?Asn?Asn
130?????????????????135?????????????????140Ser?Gly?Asp?Glu?Asn?Ser?Val?Leu?Thr?Thr?Ile?Ile?Lys?Asp?Asp?Ile145?????????????????150?????????????????155?????????????????160Leu?Ser?Ala?Val?Glu?Asn?Asn?Gln?Ser?Tyr?Thr?Asn?Ser?Asp?Lys?Asn
165?????????????????170?????????????????175His?Ser?Val?Asn?Gln?Asn?Ile?Thr?Ile?Lys?Thr?Thr?Leu?Leu?Lys?Lys
180?????????????????185?????????????????190Leu?Met?Cys?Ser?Met?Gln?His?Pro?Pro?Ser?Trp?Leu?Ile?His?Trp?Phe
195?????????????????200?????????????????205Asn?Leu?Tyr?Thr?Lys?Leu?Asn?Asn?Ile?Leu?Thr?Gln?Tyr?Arg?Ser?Asn
210?????????????????215?????????????????220Glu?Val?Lys?Ser?His?Gly?Phe?Ile?Leu?Ile?Asp?Asn?Gln?Thr?Leu?Ser225?????????????????230?????????????????235?????????????????240Asp?Phe?Gln?Phe?Ile?Leu?Asn?Gln?Tyr?Gly?Cys?Ile?Val?Tyr?His?Lys
245?????????????????250?????????????????255Gly?Leu?Lys?Lys?Ile?Thr?Thr?Thr?Thr?Tyr?Asn?Gln?Phe?Leu?Thr?Trp
260?????????????????265?????????????????270Lys?Asp?Ile?Ser?Leu?Ser?Arg?Leu?Asn?Val?Cys?Leu?Ile?Thr?Trp?Ile
275?????????????????280?????????????????285Ser?Asn?Cys?Leu?Asn?Thr?Leu?Asn?Lys?Ser?Leu?Gly?Leu?Arg?Cys?Gly
290?????????????????295?????????????????300Phe?Asn?Asn?Val?Val?Leu?Ser?Gln?Leu?Phe?Leu?Tyr?Gly?Asp?Cys?Ile305?????????????????310?????????????????315?????????????????320Leu?Lys?Leu?Phe?His?Asn?Glu?Gly?Phe?Tyr?Ile?Ile?Lys?Glu?Val?Glu
325?????????????????330?????????????????335Gly?Phe?Ile?Met?Ser?Leu?Ile?Leu?Asn?Ile?Thr?Glu?Glu?Asp?Gln?Phe
340?????????????????345?????????????????350Arg?Lys?Arg?Phe?Tyr?Asn?Ser?Met?Leu?Asn?Asn?Ile?Thr?Asp?Ala?Ala
355?????????????????360?????????????????365Ile?Lys?Ala?Gln?Lys?Asn?Leu?Leu?Ser?Arg?Val?Cys?His?Thr?Leu?Leu
370?????????????????375?????????????????380Asp?Lys?Thr?Val?Ser?Asp?Asn?Ile?Ile?Asn?Gly?Lys?Trp?Ile?Ile?Leu385?????????????????390?????????????????395?????????????????400Leu?Ser?Lys?Phe?Leu?Lys?Leu?Ile?Lys?Leu?Ala?Gly?Asp?Asn?Asn?Leu
405?????????????????410?????????????????415Asn?Asn?Leu?Ser?Glu?Leu?Tyr?Phe?Leu?Phe?Arg?Ile?Phe?Gly?His?Pro
420?????????????????425?????????????????430Met?Val?Asp?Glu?Arg?Gln?Ala?Met?Asp?Ala?Val?Arg?Ile?Asn?Cys?Asn
435?????????????????440?????????????????445Glu?Thr?Lys?Phe?Tyr?Leu?Leu?Ser?Asn?Leu?Ser?Thr?Leu?Arg?Gly?Ala
450?????????????????455?????????????????460Phe?Ile?Tyr?Arg?Ile?Ile?Lys?Gly?Phe?Val?Asn?Thr?Tyr?Asn?Arg?Trp465?????????????????470?????????????????475?????????????????480Pro?Thr?Leu?Arg?Asn?Ala?Ile?Val?Leu?Pro?Leu?Arg?Trp?Leu?Asn?Tyr
485?????????????????490?????????????????495Tyr?Lys?Leu?Asn?Thr?Tyr?Pro?Ser?Leu?Leu?Glu?Ile?Thr?Glu?Lys?Asp
500?????????????????505?????????????????510Leu?Ile?Ile?Leu?Ser?Gly?Leu?Arg?Phe?Tyr?Arg?Glu?Phe?His?Leu?Pro
515?????????????????520?????????????????525Lys?Lys?Val?Asp?Leu?Glu?Met?Ile?Ile?Asn?Asp?Lys?Ala?Ile?Ser?Pro
530?????????????????535?????????????????540Pro?Lys?Asp?Leu?Ile?Trp?Thr?Ser?Phe?Pro?Arg?Asn?Tyr?Met?Pro?Ser545?????????????????550?????????????????555?????????????????560His?Ile?Gln?Asn?Tyr?Ile?Glu?His?Glu?Lys?Leu?Lys?Phe?Ser?Glu?Ser
565?????????????????570?????????????????575Asp?Arg?Ser?Arg?Arg?Val?Leu?Glu?Tyr?Tyr?Leu?Arg?Asp?Asn?Lys?Phe
580?????????????????585?????????????????590Asn?Glu?Cys?Asp?Leu?Tyr?Asn?Cys?Val?Val?Asn?Gln?Ser?Tyr?Leu?Asn
595?????????????????600?????????????????605Asn?Ser?Asn?His?Val?Val?Ser?Leu?Thr?Gly?Lys?Glu?Arg?Glu?Leu?Ser
610?????????????????615?????????????????620Val?Gly?Arg?Met?Phe?Ala?Met?Gln?Pro?Gly?Met?Phe?Arg?Gln?Ile?Gln625?????????????????630?????????????????635?????????????????640Ile?Leu?Ala?Glu?Lys?Met?Ile?Ala?Glu?Asn?Ile?Leu?Gln?Phe?Phe?Pro
645?????????????????650?????????????????655Glu?Ser?Leu?Thr?Arg?Tyr?Gly?Asp?Leu?Glu?Leu?Gln?Lys?Ile?Leu?Glu
660?????????????????665?????????????????670Leu?Lys?Ala?Gly?Ile?Ser?Asn?Lys?Ser?Asn?Arg?Tyr?Asn?Asp?Asn?Tyr
675?????????????????680?????????????????685Asn?Asn?Tyr?Ile?Ser?Lys?Cys?Ser?Ile?Ile?Thr?Asp?Leu?Ser?Lys?Phe
690?????????????????695?????????????????700Asn?Gln?Ala?Phe?Arg?Tyr?Glu?Thr?Ser?Cys?Ile?Cys?Ser?Asp?Val?Leu705?????????????????710?????????????????715?????????????????720Asp?Glu?Leu?His?Gly?Val?Gln?Ser?Leu?Phe?Ser?Trp?Leu?His?Leu?Thr
725?????????????????730?????????????????735Ile?Pro?Leu?Val?Thr?Ile?Ile?Cys?Thr?Tyr?Arg?His?Ala?Pro?Pro?Phe
740?????????????????745?????????????????750Ile?Lys?Asp?His?Val?Val?Asn?Leu?Asn?Lys?Val?Asp?Glu?Gln?Ser?Gly
755?????????????????760?????????????????765Leu?Tyr?Arg?Tyr?His?Met?Gly?Gly?Ile?Glu?Gly?Trp?Cys?Gln?Lys?Leu
770?????????????????775?????????????????780Trp?Thr?Ile?Glu?Ala?Ile?Ser?Leu?Leu?Asp?Leu?Ile?Ser?Leu?Lys?Gly785?????????????????790?????????????????795?????????????????800Lys?Phe?Ser?Ile?Thr?Ala?Leu?Ile?Asn?Gly?Asp?Asn?Gln?Ser?Ile?Asp
805?????????????????810?????????????????815Ile?Ser?Lys?Pro?Val?Arg?Leu?Ile?Glu?Gly?Gln?Thr?His?Ala?Gln?Ala
820?????????????????825?????????????????830Asp?Tyr?Leu?Leu?Ala?Leu?Asn?Ser?Leu?Lys?Leu?Leu?Tyr?Lys?Glu?Tyr
835?????????????????840?????????????????845Ala?Gly?Ile?Gly?His?Lys?Leu?Lys?Gly?Thr?Glu?Thr?Tyr?Ile?Ser?Arg
850?????????????????855?????????????????860Asp?Met?Gln?Phe?Met?Ser?Lys?Thr?Ile?Gln?His?Asn?Gly?Val?Tyr?Tyr865?????????????????870?????????????????875?????????????????880Pro?Ala?Ser?Ile?Lys?Lys?Val?Leu?Arg?Val?Gly?Pro?Trp?Ile?Asn?Thr
885?????????????????890?????????????????895Ile?Leu?Asp?Asp?Phe?Lys?Val?Ser?Leu?Glu?Ser?Ile?Gly?Ser?Leu?Thr
900?????????????????905?????????????????910Gln?Glu?Leu?Glu?Tyr?Arg?Gly?Glu?Ser?Leu?Leu?Cys?Ser?Leu?Ile?Phe
915?????????????????920?????????????????925Arg?Asn?Ile?Trp?Leu?Tyr?Asn?Gln?Ile?Ala?Leu?Gln?Leu?Arg?Asn?His
930?????????????????935?????????????????940Ala?Leu?Cys?His?Asn?Lys?Leu?Tyr?Leu?Asp?Ile?Leu?Lys?Val?Leu?Lys945?????????????????950?????????????????955?????????????????960His?Leu?Lys?Thr?Phe?Phe?Asn?Leu?Asp?Ser?Ile?Asp?Met?Ala?Leu?Thr
965?????????????????970?????????????????975Leu?Tyr?Met?Asn?Leu?Pro?Met?Leu?Phe?Gly?Gly?Gly?Asp?Pro?Asn?Leu
980?????????????????985?????????????????990Leu?Tyr?Arg?Ser?Phe?Tyr?Arg?Arg?Thr?Pro?Asp?Phe?Leu?Thr?Glu?Ala
995?????????????????1000????????????????1005Ile?Val?His?Ser?Val?Phe?Val?Leu?Ser?Tyr?Tyr?Thr?Gly?His?Asp?Leu
1010????????????????1015????????????????1020Gln?Asp?Lys?Leu?Gln?Asp?Leu?Pro?Asp?Asp?Arg?Leu?Asn?Lys?Phe?Leu1025????????????????1030????????????????1035????????????????1040Thr?Cys?Ile?Ile?Thr?Phe?Asp?Lys?Asn?Pro?Asn?Ala?Glu?Phe?Val?Thr
1045????????????????1050????????????????1055Leu?Met?Arg?Asp?Pro?Gln?Ala?Leu?Gly?Ser?Glu?Arg?Gln?Ala?Lys?Ile
1060????????????????1065????????????????1070Thr?Ser?Glu?Ile?Asn?Arg?Leu?Ala?Val?Thr?Glu?Val?Leu?Ser?Ile?Ala
1075????????????????1080????????????????1085Pro?Asn?Lys?Ile?Phe?Ser?Lys?Ser?Ala?Gln?His?Tyr?Thr?Thr?Thr?Glu
1090????????????????1095????????????????1100Ile?Asp?Leu?Asn?Asp?Ile?Met?Gln?Asn?Ile?Glu?Pro?Thr?Tyr?Pro?His1105????????????????1110????????????????1115????????????????1120Gly?Leu?Arg?Val?Val?Tyr?Glu?Ser?Leu?Pro?Phe?Tyr?Lys?Ala?Glu?Lys
1125????????????????1130????????????????1135Ile?Val?Asn?Leu?Ile?Ser?Gly?Thr?Lys?Ser?Ile?Thr?Asn?Ile?Leu?Glu
1140????????????????1145????????????????1150Lys?Thr?Ser?Ala?Ile?Asp?Ser?Thr?Asp?Ile?Asn?Arg?Ala?Thr?Asp?Met
1155????????????????1160????????????????1165Met?Arg?Lys?Asn?Ile?Thr?Leu?Leu?Ile?Arg?Ile?Leu?Pro?Leu?Asp?Cys
1170????????????????1175????????????????1180Asn?Lys?Asp?Lys?Arg?Glu?Leu?Leu?Ser?Leu?Glu?Asn?Leu?Ser?Ile?Thr1185????????????????1190????????????????1195????????????????1200Glu?Leu?Ser?Lys?Tyr?Val?Arg?Glu?Arg?Ser?Trp?Ser?Leu?Ser?Asn?Ile
1205????????????????1210????????????????1215Val?Gly?Val?Thr?Ser?Pro?Ser?Ile?Met?Phe?Thr?Met?Asp?Ile?Lys?Tyr
1220????????????????1225????????????????1230Thr?Thr?Ser?Thr?Ile?Ala?Ser?Gly?Ile?Ile?Ile?Glu?Lys?Tyr?Asn?Val
1235????????????????1240????????????????1245Asn?Ser?Leu?Thr?Arg?Gly?Glu?Arg?Gly?Pro?Thr?Lys?Pro?Trp?Val?Gly
1250????????????????1255????????????????1260Ser?Ser?Thr?Gln?Glu?Lys?Lys?Thr?Met?Pro?Val?Tyr?Asn?Arg?Gln?Val1265????????????????1270????????????????1275????????????????1280Leu?Thr?Lys?Lys?Gln?Arg?Asp?Gln?Ile?Asp?Leu?Leu?Ala?Lys?Leu?Asp
1285????????????????1290????????????????1295Trp?Val?Tyr?Ala?Ser?Ile?Asp?Asn?Lys?Asp?Glu?Phe?Met?Glu?Glu?Leu
1300????????????????1305????????????????1310Ser?Thr?Gly?Thr?Leu?Gly?Leu?Ser?Tyr?Glu?Lys?Ala?Lys?Lys?Leu?Phe
1315????????????????1320????????????????1325Pro?Gln?Tyr?Leu?Ser?Val?Asn?Tyr?Leu?His?Arg?Leu?Thr?Val?Ser?Ser
1330????????????????1335????????????????1340Arg?Pro?Cys?Glu?Phe?Pro?Ala?Ser?Ile?Pro?Ala?Tyr?Arg?Thr?Thr?Asn1345????????????????1350????????????????1355????????????????1360Tyr?His?Phe?Asp?Thr?Ser?Pro?Ile?Asn?His?Val?Leu?Thr?Glu?Lys?Tyr
1365????????????????1370????????????????1375Gly?Asp?Glu?Asp?Ile?Asp?Ile?Val?Phe?Gln?Asn?Cys?Ile?Ser?Phe?Gly
1380????????????????1385????????????????1390Leu?Ser?Leu?Met?Ser?Val?Val?Glu?Gln?Phe?Thr?Asn?Ile?Cys?Pro?Asn
1395????????????????1400????????????????1405Arg?Ile?Ile?Leu?Ile?Pro?Lys?Leu?Asn?Glu?Ile?His?Leu?Met?Lys?Pro
1410????????????????1415????????????????1420Pro?Ile?Phe?Thr?Gly?Asp?Val?Asp?Ile?Ile?Lys?Leu?Lys?Gln?Val?Ile1425????????????????1430????????????????1435????????????????1440Gln?Lys?Gln?His?Met?Phe?Leu?Pro?Asp?Lys?Ile?Ser?Leu?Thr?Gln?Tyr
1445????????????????1450????????????????1455Val?Glu?Leu?Phe?Leu?Ser?Asn?Lys?Ala?Leu?Lys?Ser?Gly?Ser?His?Ile
1460????????????????1465????????????????1470Asn?Ser?Asn?Leu?Ile?Leu?Val?His?Lys?Met?Ser?Asp?Tyr?Phe?His?Asn
1475????????????????1480????????????????1485Ala?Tyr?Ile?Leu?Ser?Thr?Asn?Leu?Ala?Gly?His?Trp?Ile?Leu?Ile?Ile
1490????????????????1495????????????????1500Gln?Leu?Met?Lys?Asp?Ser?Lys?Gly?Ile?Phe?Glu?Lys?Asp?Trp?Gly?Glu1505????????????????1510????????????????1515????????????????1520Gly?Tyr?Ile?Thr?Asp?His?Met?Phe?Ile?Asn?Leu?Asn?Val?Phe?Phe?Asn
1525????????????????1530????????????????1535Ala?Tyr?Lys?Thr?Tyr?Leu?Leu?Cys?Phe?His?Lys?Gly?Tyr?Gly?Lys?Ala
1540????????????????1545????????????????1550Lys?Leu?Glu?Cys?Asp?Met?Asn?Thr?Ser?Asp?Leu?Leu?Cys?Val?Leu?Glu
1555????????????????1560????????????????1565Leu?Ile?Asp?Ser?Ser?Tyr?Trp?Lys?Ser?Met?Ser?Lys?Val?Phe?Leu?Glu
1570????????????????1575????????????????1580Gln?Lys?Val?Ile?Lys?Tyr?Ile?Ile?Asn?Gln?Asp?Thr?Ser?Leu?His?Arg1585????????????????1590????????????????1595????????????????1600Ile?Lys?Gly?Cys?His?Ser?Phe?Lys?Leu?Trp?Phe?Leu?Lys?Arg?Leu?Asn
1605????????????????1610????????????????1615Asn?Ala?Lys?Phe?Thr?Val?Cys?Pro?Trp?Val?Val?Asn?Ile?Asp?Tyr?His
1620????????????????1625????????????????1630Pro?Thr?His?Met?Lys?Ala?Ile?Leu?Ser?Tyr?Ile?Asp?Leu?Val?Arg?Met
1635????????????????1640????????????????1645Gly?Leu?Ile?Asn?Val?Asp?Lys?Leu?Thr?Ile?Lys?Asn?Lys?Asn?Lys?Phe
1650????????????????1655????????????????1660Asn?Asp?Glu?Phe?Tyr?Thr?Ser?Asn?Leu?Phe?Tyr?Ile?Ser?Tyr?Asn?Phe1665????????????????1670????????????????1675????????????????1680Ser?Asp?Asn?Thr?His?Leu?Leu?Thr?Lys?Gln?Ile?Arg?Ile?Ala?Asn?Ser
1685????????????????1690????????????????1695Glu?Leu?Glu?Asn?Asn?Tyr?Asn?Lys?Leu?Tyr?His?Pro?Thr?Pro?Glu?Thr
1700????????????????1705????????????????1710Leu?Glu?Asn?Met?Ser?Leu?Ile?Pro?Val?Lys?Ser?Asn?Asn?Ser?Asn?Lys
1715????????????????1720????????????????1725Pro?Lys?Phe?Gly?Ile?Ser?Gly?Asn?Thr?Glu?Ser?Met?Met?Thr?Ser?Thr
1730????????????????1735????????????????1740Phe?Ser?Asn?Lys?Thr?His?Ile?Lys?Ser?Ser?Ala?Val?Ile?Thr?Arg?Phe1745????????????????1750????????????????1755????????????????1760Asn?Tyr?Ser?Lys?Gln?Asp?Leu?Tyr?Asn?Leu?Phe?Pro?Ile?Val?Val?Ile
1765????????????????1770????????????????1775Asp?Arg?Ile?Ile?Asp?His?Ser?Gly?Asn?Thr?Ala?Lys?Ser?Asn?Gln?Leu
1780????????????????1785????????????????1790Tyr?Thr?Thr?Thr?Ser?His?Gln?Thr?Ser?Leu?Val?Arg?Asn?Ser?Ala?Ser
1795????????????????1800????????????????1805Leu?Tyr?Cys?Met?Leu?Pro?Trp?His?His?Val?Asn?Arg?Phe?Asn?Phe?Val
1810????????????????1815????????????????1820Phe?Ser?Ser?Thr?Gly?Cys?Lys?Ile?Ser?Ile?Glu?Tyr?Ile?Leu?Lys?Asp1825????????????????1830????????????????1835????????????????1840Leu?Lys?Ile?Lys?Asp?Pro?Ser?Cys?Ile?Ala?Phe?Ile?Gly?Glu?Gly?Ala
1845????????????????1850????????????????1855Gly?Asn?Leu?Leu?Leu?Arg?Thr?Val?Val?Glu?Leu?His?Pro?Asp?Ile?Arg
1860????????????????1865????????????????1870Tyr?Ile?Tyr?Arg?Ser?Leu?Lys?Asp?Cys?Asn?Asp?His?Ser?Leu?Pro?Ile
1875????????????????1880????????????????1885Glu?Phe?Leu?Arg?Leu?Tyr?Asn?Gly?His?Ile?Asn?Ile?Asp?Tyr?Gly?Glu
1890????????????????1895????????????????1900Asn?Leu?Thr?Ile?Pro?Ala?Thr?Asp?Ala?Thr?Asn?Asn?Ile?His?Trp?Ser1905????????????????1910????????????????1915????????????????1920Tyr?Leu?His?Ile?Lys?Phe?Ala?Glu?Pro?Ile?Ser?Ile?Phe?Val?Cys?Asp
1925????????????????1930????????????????1935Ala?Glu?Leu?Pro?Val?Thr?Ala?Asn?Trp?Ser?Lys?Ile?Ile?Ile?Glu?Trp
1940????????????????1945????????????????1950Ser?Lys?His?Val?Arg?Lys?Cys?Lys?Tyr?Cys?Ser?Ser?Val?Asn?Arg?Cys
1955????????????????1960????????????????1965Ile?Leu?Ile?Ala?Lys?Tyr?His?Ala?Gln?Asp?Asp?Ile?Asp?Phe?Lys?Leu
1970????????????????1975????????????????1980Asp?Asn?Ile?Thr?Ile?Leu?Lys?Thr?Tyr?Val?Cys?Leu?Gly?Ser?Lys?Leu1985????????????????1990????????????????1995????????????????2000Lys?Gly?Ser?Glu?Val?Tyr?Leu?Val?Leu?Thr?Ile?Gly?Pro?Ala?Asn?Ile
2005????????????????2010????????????????2015Leu?Pro?Val?Phe?Asn?Val?Val?Gln?Asn?Ala?Lys?Leu?Ile?Leu?Ser?Arg
2020????????????????2025????????????????2030Thr?Lys?Asn?Phe?Ile?Met?Pro?Lys?Lys?Thr?Asp?Lys?Glu?Ser?Ile?Asp
2035????????????????2040????????????????2045Ala?Asn?Ile?Lys?Ser?Leu?Ile?Pro?Phe?Leu?Cys?Tyr?Pro?Ile?Thr?Lys
2050????????????????2055????????????????2060Lys?Gly?Ile?Lys?Thr?Ser?Leu?Ser?Lys?Leu?Lys?Ser?Val?Val?Ser?Gly2065????????????????2070????????????????2075????????????????2080Asp?Ile?Leu?Ser?Tyr?Ser?Ile?Ala?Gly?Arg?Asn?Glu?Val?Phe?Ser?Asn
2085????????????????2090????????????????2095
Lys?Leu?Ile?Asn?His?Lys?His?Met?Asn?Ile?Leu?Lys?Trp?Leu?Asp?His
2100????????????????2105????????????????2110
Val?Leu?Asn?Phe?Arg?Ser?Ala?Glu?Leu?Asn?Tyr?Asn?His?Leu?Tyr?Met
2115????????????????2120????????????????2125
Ile?Glu?Ser?Thr?Tyr?Pro?Tyr?Leu?Ser?Glu?Leu?Leu?Asn?Ser?Leu?Thr
2130????????????????2135????????????????2140
Thr?Asn?Glu?Leu?Lys?Lys?Leu?Ile?Lys?Ile?Thr?Gly?Ser?Val?Leu?Tyr
2145????????????????2150????????????????2155????????????????2160
Asn?Leu?Pro?Asn?Glu?Gln
The information of 2165 (2) SEQ ID NO:27:
(i) sequence signature:
(A) length: 15219 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27:
ACGGGAAAAA AATGCGTACT ACAAACTTGC ACATTCGAAA AAAATGGGGC AAATAAGAAC 60
TTGATAAGTG CTATTTAAGT CTAACCTTTT CAATCAGAAA TGGGGTGCAA TTCACTGAGC 120
ATGATAAAGG TTAGATTACA AAATTTATTT GACAATGACG AAGTAGCATT GTTAAAAATA 180
ACATGTTATA CTGATAAATT AATTCTTCTG ACCAATGCAT TAGCCAAAGC AGCAATACAT 240
ACAATTAAAT TAAACGGCAT AGTTTTTATA CATGTTATAA CAAGCAGTGA AGTGTGCCCT 300
GATAACAATA TTGTAGTGAA ATCTAACTTT ACAACAATGC CAATACTACA AAATGGAGGA 360
TACATATGGG AATTGATTGA GTTGACACAC TGCTCTCAAT TAAACGGTTT AATGGATGAT 420
AATTGTGAAA TCAAATTTTC TAAAAGACTA AGTGACTCAG TAATGACTAA TTATATGAAT 480
CAAATATCTG ACTTACTTGG GCTTGATCTC AATTCATGAA TTATGTTTAG TCTAATTCAA 540
TAGACATGTG TTTATTACCA TTTTAGTTAA TATAAAAACT CATCAAAGGG AAATGGGGCA 600
AATAAACTCA CCTAATCAAT CAAACCATGA GCACTACAAA TGACAACACT ACTATGCAAA 660
GATTGATGAT CACAGACATG AGACCCCTGT CAATGGATTC AATAATAACA TCTCTTACCA 720
AAGAAATCAT CACACACAAA TTCATATACT TGATAAACAA TGAATGTATT GTAAGAAAAC 780
TTGATGAAAG ACAAGCTACA TTTACATTCT TAGTCAATTA TGAGATGAAG CTACTGCACA 840
AAGTAGGGAG TACCAAATAC AAAAAATACA CTGAATATAA TACAAAATAT GGCACTTTCC 900
CCATGCCTAT ATTTATCAAT CACGGCGGGT TTCTAGAATG TATTGGCATT AAGCCTACAA 960
AACACACTCC TATAATATAC AAATATGACC TCAACCCGTG AATTCCAACA AAAAAACCAA 1020
CCCAACCAAA CCAAACTATT CCTCAAACAA CAGTGCTCAA TAGTTAAGAA GGAGCTAATC 1080
CATTTTAGTA ATTAAAAATA AAAGTAAAGC CAATAACATA AATTGGGGCA AATACAAAGA 1140
TGGCTCTTAG CAAAGTCAAG TTGAATGATA CATTAAATAA GGATCAGCTG CTGTCATCCA 1200
GCAAATACAC TATTCAACGT AGTACAGGAG ATAATATTGA CACTCCCAAT TATGATGTGC 1260
AAAAACACCT AAACAAACTA TGTGGTATGC TATTAATCAC TGAAGATGCA AATCATAAAT 1320
TCACAGGATT AATAGGTATG TTATATGCTA TGTCCAGGTT AGGAAGGGAA GACACTATAA 1380
AGATACTTAA AGATGCTGGA TATCATGTTA AAGCTAATGG AGTAGATATA ACAACATATC 1440
GTCAAGATAT AAATGGAAAG GAAATGAAAT TCGAAGTATT AACATTATCA AGCTTGACAT 1500
CAGAAATACA AGTCAATATT GAGATAGAAT CTAGAAAGTC CTACAAAAAA ATGCTAAAAG 1560
AGATGGGAGA AGTGGCTCCA GAATATAGGC ATGATTCTCC AGACTGTGGG ATGATAATAC 1620
TGTGTATAGC TGCACTTGTG ATAACCAAAT TAGCAGCAGG AGACAGATCA GGTCTTACAG 1680
CAGTAATTAG GAGGGCAAAC AATGTCTTAA AAAACGAAAT AAAACGATAC AAGGGCCTCA 1740
TACCAAAGGA TATAGCTAAC AGTTTTTATG AAGTGTTTGA AAAACACCCT CATCTTATAG 1800
ATGTTTTCGT GCACTTTGGC ATTGCACAAT CATCCACAAG AGGGGGTAGT AGAGTTGAAG 1860
GAATCTTTGC AGGATTGTTT ATGAATGCCT ATGGTTCAGG GCAAGTAATG CTAAGATGGG 1920
GAGTTTTAGC CAAATCTGTA AAAAATATCA TGCTAGGACA TGCTAGTGTC CAGGCAGAAA 1980
TGGAGCAAGT TGTGGAAGTC TATGAGTATG CACAGAAGTT GGGAGGAGAA GCTGGATTCT 2040
ACCATATATT GAACAATCCA AAAGCATCAT TGCTGTCATT AACTCAATTT CCCAACTTCT 2100
CAAGTGTGGT CCTAGGCAAT GCAGCAGGTC TAGGCATAAT GGGAGAGTAT AGAGGTACAC 2160
CAAGAAACCA GGATCTTTAT GATGCAGCTA AAGCATATGC AGAGCAACTC AAAGAAAATG 2220
GAGTAATAAA CTACAGTGTA TTAGACTTAA CAGCAGAAGA ATTGGAAGCC ATAAAGCATC 2280
AACTCAACCC CAAAGAAGAT GATGTAGAGC TTTAAGTTAA CAAAAAATAC GGGGCAAATA 2340
AGTCAACATG GAGAAGTTTG CACCTGAATT TCATGGAGAA GATGCAAATA ACAAAGCTAC 2400
CAAATTCCTA GAATCAATAA AGGGCAAGTT CGCATCATCC AAAGATCCTA AGAAGAAAGA 2460
TAGCATAATA TCTGTTAACT CAATAGATAT AGAAGTAACT AAAGAGAGCC CGATAACATC 2520
TGGCACCAAC ATCATCAATC CAACAAGTGA AGCCGACAGT ACCCCAGAAA CAAAAGCCAA 2580
CTACCCAAGA AAACCCCTAG TAAGCTTCAA AGAAGATCTC ACCCCAAGTG ACAACCCTTT 2640
TTCTAAGTTG TACAAGGAAA CAATAGAAAC ATTTGATAAC AATGAAGAAG AATCTAGCTA 2700
CTCATATGAA GAGATAAATG ATCAAACAAA TGACAACATT ACAGCAAGAC TAGATAGAAT 2760
TGATGAAAAA TTAAGTGAAA TATTAGGAAT GCTCCATACA TTAGTAGTTG CAAGTGCAGG 2820
ACCCACTTCA GCTCGCGATG GAATAAGAGA TGCTATGGTT GGTCTAAGAG AAGAGATGAT 2880
AGAAAAAATA AGAGCGGAAG CATTAATGAC CAATGATAGG TTAGAGGCTA TGGCAAGACT 2940
TAGGAATGAG GAAAGCGAAA AAATGGCAAA AGACACCTCA GATGAAGTGT CTCTTAATCC 3000
AACTTCCAAA AAATTGAGTG ACTTGTTGGA AGACAACGAT AGTGACAATG ATCTATCACT 3060
TGATGATTTT TGATCAGCGA TCAACTCACT CAGCAATCAA CAACATCAAT AAAACAGACA 3120
TCAATCCATT GAATCAACTG CCAGACCGAA CAAACAAACG TCCATCAGTA GAACCACCAA 3180
CCAATCAATC AACCAATTGA TCAATCAGCA ACCCGACAAA ATTAACAATA TAGTAACAAA 3240
AAAAGAACAA GATGGGGCAA ATATGGAAAC ATACGTGAAC AAGCTTCACG AAGGCTCCAC 3300
ATACACAGCA GCTGTTCAGT ACAATGTTCT AGAAAAAGAT GATGATCCTG CATCACTAAC 3360
AATATGGGTG CCTATGTTCC AGTCATCTGT GCCAGCAGAC TTGCTCATAA AAGAACTTGC 3420
AAGCATCAAT ATACTAGTGA AGCAGATCTC TACGCCCAAA GGACCTTCAC TACGAGTCAC 3480
GATTAACTCA AGAAGTGCTG TGCTGGCTCA AATGCCTAGT AATTTCATCA TAAGCGCAAA 3540
TGTATCATTA GATGAAAGAA GCAAATTAGC ATATGATGTA ACTACACCTT GTGAAATCAA 3600
AGCATGCAGT CTAACATGCT TAAAAGTAAA AAGTATGTTA ACTACAGTCA AAGATCTTAC 3660
CATGAAGACA TTCAACCCCA CTCATGAGAT CATTGCTCTA TGTGAATTTG AAAATATTAT 3720
GACATCAAAA AGAGTAATAA TACCAACCTA TCTAAGATCA ATTAGTGTCA AGAACAAGGA 3780
TCTGAACTCA CTAGAAAATA TAGCAACCAC CGAATTCAAA AATGCTATCA CCAATGCAAA 3840
AATTATTCCT TATGCAGGAT TAGTGTTAGT TATCACAGTT ACTGACAATA AAGGAGCATT 3900
CAAATATATC AAACCACAGA GTCAATTTAT AGTAGATCTT GGTGCCTACC TAGAAAAAGA 3960
GAGCATATAT TATGTGACTA CTAATTGGAA GCATACAGCT ACACGTTTTT CAATCAAACC 4020
ACTAGAGGAT TAAACTTAAT TATCAACACT GAATGACAGG TCCACATATA TCCTCAAACT 4080
ACACACTATA TCCAAACATC ATAAACATCT ACACTACACA CTTCATCACA CAAACCAATC 4140
CCACTCAAAA TCCAAAATCA CTACCAGCCA CTATCCGCTA GACCTAGAGT GCGAATAGGC 4200
AAATAAAACC AAAATATGGG GTAAATAGAC ATTAGTTAGA GTTCAATCAA TCTTAACAAC 4260
CATTTATACC GCCAATTCAA CACATATACT ATAAATCTTA AAATGGGAAA TACATCCATC 4320
ACAATAGAAC TCACAAGCAA ATTTTGGCCC TATTTTACAC TAATACATAT GATCTTAACT 4380
CTAATCTTTT TACTAATTAT AATCACTATC ATGATTGCAA CACTAAATAA GCTAAGTGAA 4440
CACAAAGCAT TCTGCAACAA AACTCTTGAA CTAGGACAGA TGTACCAAAT CAACACACAG 4500
AGTTCCACCA TTATGCTGTG TCAAACCATA ATCCTGTATA TACAAACAAA CAAATCCAAT 4560
CCTCTCACAG AGTCACGGTG TCGCAAAACC ACGCTAACCA TCATGGTAGC ATAGAGTAGT 4620
TATTTAAAAA TTAACATAAT GATGAATTGT TAGTATGAGA TCAAAAACAA CATTGGGGCA 4680
AATGCAACCA TGTCCAAACA CAAGAATCAA CGCACTGCCA GGACTCTAGA AAAGACCTGG 4740
GATACTCTTA ATCATCTAAT TGTAATATCC TCTTGTTTAT ACAGATTAAA TTTAAAATCT 4800
ATAGCACAAA TAGCACTATC AGTTTTGGCA ATGATAATCT CAACCTCTCT CATAATTGCA 4860
GCCATAATAT TCATCATCTC TGCCAATCAC AAAGTTACAC TAACAACGGT CACAGTTCAA 4920
ACAATAAAAA ACCACACTGA AAAAAACATC ACCACCTACC CTACTCAAGT CTCACCAGAA 4980
AGGGTTAGTT CATCCAAGCA ACCCACAACC ACATCACCAA TCCACACAAG TTCAGCTACA 5040
ACATCACCCA ATACAAAATC AGAAACACAC CATACAACAG CACAAACCAA AGGCAGAACC 5100
ACCACTTCAA CACAGACCAA CAAGCCAAGC ACAAAACCAC GTCCAAAAAA TCCACCAAAA 5160
AAAGATGATT ACCATTTTGA AGTGTTCAAC TTCGTTCCCT GCAGTATATG TGGCAACAAT 5220
CAACTTTGCA AATCCATCTG CAAAACAATA CCAAGCAACA AACCAAAGAA GAAACCAACC 5280
ATCAAACCCA CAAACAAACC AACCACCAAA ACCACAAACA AAAGAGACCC AAAAACACCA 5340
GCCAAAACGA CGAAAAAAGA AACTACCACC AACCCAACAA AAAAACTAAC CCTCAAGACC 5400
ACAGAAAGAG ACACCAGCAC CTCACAATCC ACTGCACTCG ACACAACCAC ATTAAAACAC 5460
ACAGTCCAAC AGCAATCCCT CCTCTCAACC ACCCCCGAAA ACACACCCAA CTCCACACAA 5520
ACACCCACAG CATCCGAGCC CTCCACACCA AACTCCACCC AAAAAACCCA GCCACATGCT 5580
TAGTTATTCA AAAACTACAT CTTAGCAGAG AACCGTGATC TATCAAGCAA GAACGAAATT 5640
AAACCTGGGG CAAATAACCA TGGAGTTGAT GATCCACAAG TCAAGTGCAA TCTTCCTAAC 5700
TCTTGCTATT AATGCATTGT ACCTCACCTC AAGTCAGAAC ATAACTGAGG AGTTTTACCA 5760
ATCGACATGT AGTGCAGTTA GCAGAGGTTA TTTTAGTGCT TTAAGAACAG GTTGGTATAC 5820
TAGTGTCATA ACAATAGAAT TAAGTAATAT AAAAGAAACC AAATGCAATG GAACTGACAC 5880
TAAAGTAAAA CTTATGAAAC AAGAATTAGA TAAGTATAAG AATGCAGTAA CAGAATTACA 5940
GCTACTTATG CAAAACACAC CAGCTGTCAA CAACCGGGCC AGAAGAGAAG CACCACAGTA 6000
TATGAACTAC ACAATCAATA CCACTAAAAA CCTAAATGTA TCAATAAGCA AGAAGAGGAA 6060
ACGAAGATTT CTAGGCTTCT TGTTAGGTGT GGGATCTGCA ATAGCAAGTG GTATAGCTGT 6120
ATCAAAAGTT CTACACCTTG AAGGAGAAGT GAACAAGATC AAAAATGCTT TGTTGTCTAC 6180
AAACAAAGCT GTAGTCAGTT TATCAAATGG GGTCAGTGTT TTAACCAGCA AAGTGTTAGA 6240
TCTCAAGAAT TACATAAATA ACCAATTATT ACCCATAGTA AATCAACAGA GCTGTCGCAT 6300
CTCCAACATT GAAACAGTTA TAGAATTCCA GCAGAAGAAC AGCAGATTGT TGGAAATCAC 6360
CAGAGAATTT AGTGTCAATG CAGGTGTAAC AACACCTTTA AGCACTTACA TGTTGACAAA 6420
CAGTGAGTTA CTATCATTAA TCAATGATAT GCCTATAACA AATGATCAGA AAAAATTAAT 6480
GTCAAGCAAT GTTCAGATAG TAAGGCAACA AAGTTATTCC ATCATGTCTA TAATAAAGGA 6540
AGAAGTCCTT GCATATGTTG TACAGCTGCC TATCTATGGT GTAATAGATA CACCTTGCTG 6600
GAAATTGCAC ACATCGCCTC TATGCACTAC CAACATCAAA GAAGGATCAA ATATTTGTTT 6660
AACAAGGACT GATAGAGGAT GGTATTGTGA TAATGCAGGA TCAGTATCCT TCTTTCCACA 6720
GGCTGACACT TGTAAAGTAC AGTCCAATCG AGTATTTTGT GACACTATGA ACAGTTTGAC 6780
ATTACCAAGT GAAGTCAGCC TTTGTAACAC TGACATATTC AATTCCAAGT ATGACTGCAA 6840
AATTATGACA TCAAAAACAG ACATAAGCAG CTCAGTAATT ACTTCTCTTG GAGCTATAGT 6900
GTCATGCTAT GGTAAAACTA AATGCACTGC ATCCAACAAA AATCGTGGGA TTATAAAGAC 6960
ATTTTCTAAT GGTTGTGACT ATGTGTCAAA CAAAGGAGTA GATACTGTGT CAGTGGGCAA 7020
CACTTTATAC TATGTAAACA AGCTGGAAGG CAAGAACCTT TATGTAAAAG GGGAACCTAT 7080
AATAAATTAC TATGACCCTC TAGTGTTTCC TTCTGATGAG TTTGATGCAT CAATATCTCA 7140
AGTCAATGAA AAAATCAATC AAAGTTTAGC TTTTATTCGT AGATCTGATG AATTACTACA 7200
TAATGTAAAT ACTGGCAAAT CTACTACAAA TATTATGATA ACTACAATTA TTATAGTAAT 7260
CATTGTAGTA TTGTTATCAT TAATAGCTAT TGGTTTACTG TTGTATTGTA AAGCCAAAAA 7320
CACACCAGTT ACACTAAGCA AAGACCAACT AAGTGGAATC AATAATATTG CATTCAGCAA 7380
ATAGACAAAA AACCACCTGA TCATGTTTCA ACAACAATCT GCTGACCACC AATCCCAAAT 7440
CAACTTACAA CAAATATTTC AACATCACAG TACAGGCTGA ATCATTTCCT CACATCATGC 7500
TACCCACATA ACTAAGCTAG ATCCTTAACT TATAGTTACA TAAAAACCTC AAGTATCACA 7560
ATCAACCACT AAATCAACAC ATCATTCACA AAATTAACAG CTGGGGCAAA TATGTCGCGA 7620
AGAAATCCTT GTAAATTTGA GATTAGAGGT CATTGCTTGA ATGGTAGAAG ATGTCACTAC 7680
AGTCATAATT ACTTTGAATG GCCTCCTCAT GCATTACTAG TGAGGCAAAA CTTCATGTTA 7740
AACAAGATAC TCAAGTCAAT GGACAAAAGC ATAGACACTT TGTCTGAAAT AAGTGGAGCT 7800
GCTGAACTGG ATAGAACAGA AGAATATGCT CTTGGTATAG TTGGAGTGCT AGAGAGTTAC 7860
ATAGGATCTA TAAACAACAT AACAAAACAA TCAGCATGTG TTGCTATGAG TAAACTTCTT 7920
ATTGAGATCA ATAGTGATGA CATTAAAAAG CTTAGAGATA ATGAAGAACC CAATTCACCT 7980
AAGATAAGAG TGTACAATAC TGTTATATCA TACATTGAGA GCAATAGAAA AAACAACAAG 8040
CAAACCATCC ATCTGCTCAA GAGACTACCA GCAGACGTGC TGAAGAAGAC AATAAAGAAC 8100
ACATTAGATA TCCACAAAAG CATAACCATA AGCAATCCAA AAGAGTCAAC TGTGAATGAT 8160
CAAAATGACC AAACCAAAAA TAATGATATT ACCGGATAAA TATCCTTGTA GTATATCATC 8220
CATATTGATC TCAAGTGAAA GCATGGTTGC TACATTCAAT CATAAAAACA TATTACAATT 8280
TAACCATAAC TATTTGGATA ACCACCAGCG TTTATTAAAT CATATATTTG ATGAAATTCA 8340
TTGGACACCT AAAAACTTAT TAGATGCCAC TCAACAATTT CTCCAACATC TTAACATCCC 8400
TGAAGATATA TATACAGTAT ATATATTAGT GTCATAATGC TTGACCATAA CGACTCTATG 8460
TCATCCAACC ATAAAACTAT TTTGATAAGG TTATGGGACA AAATGGATCC CATTATTAAT 8520
GGAAACTCTG CTAATGTGTA TCTAACTGAT AGTTATTTAA AAGGTGTTAT CTCTTTTTCA 8580
GAGTGTAATG CTTTAGGGAG TTATCTTTTT AACGGCCCTT ATCTTAAAAA TGATTACACC 8640
AACTTAATTA GTAGACAAAG CCCACTACTA GAGCATATGA ATCTTAAAAA ACTAACTATA 8700
ACACAGTCAT TAATATCTAG ATATCATAAA GGTGAACTGA AATTAGAAGA ACCAACTTAT 8760
TTCCAGTCAT TACTTATGAC ATATAAAAGT ATGTCCTCGT CTGAACAAAT TGCTACAACT 8820
AACTTACTTA AAAAAATAAT ACGAAGAGCC ATAGAAATAA GTGATGTAAA GGTGTACGCC 8880
ATCTTGAATA AACTAGGATT AAAGGAAAAG GACAGAGTTA AGCCCAACAA TAATTCAGGT 8940
GATGAAAACT CAGTACTTAC AACCATAATT AAAGATGATA TACTTTCGGC TGTGGAAAAC 9000
AATCAATCAT ATACAAATTC AGACAAAAGT CACTCAGTAA ATCAAAATAT CACTATCAAA 9060
ACAACACTCT TGAAAAAATT GATGTGTTCA ATGCAACATC CTCCATCATG GTTAATACAC 9120
TGGTTCAATT TATATACAAA ATTAAATAAC ATATTAACAC AATATCGATC AAATGAGGTA 9180
AAAAGTCATG GGTTTATATT AATAGATAAT CAAACTTTAA GTGGTTTTCA GTTTATTTTA 9240
AATCAATATG GTTGTATCGT TTATCATAAA GGACTCAAAA AAATCACAAC TACTACTTAC 9300
AATCAATTTT TGACATGGAA AGACATCAGC CTTAGCAGAT TAAATGTTTG CTTAATTACT 9360
TGGATAAGTA ATTGTTTAAA TACATTAAAC AAAAGCTTAG GGCTGAGATG TGGATTCAAT 9420
AATGTTGTGT TATCACAATT ATTTCTTTAT GGAGATTGTA TACTGAAATT ATTTCATAAT 9480
GAAGGCTTCT ACATAATAAA AGAAGTAGAG GGATTTATTA TGTCTTTAAT TCTAAACATA 9540
ACAGAAGAAG ATCAATTTAA GAAACGATTT TATAATAGCA TGCTAAATAA CATCACAGAT 9600
GCAGCTATTA AGGCTCAAAA GGACCTACTA TCAAGAGTAT GTCACACTTT ATTAGACAAG 9660
ACAGTGTCTG ATAATATCAT AAATGGTAAA TGGATAATCC TATTAAGTAA ATTTCTTAAA 9720
TTGATTAAGC TTGCAGGTGA TAATAATCTC AATAACTTGA GTGAGCTATA TTTTCTCTTC 9780
AGAATCTTTG GACATCCAAT GGTCGATGAA AGACAAGCAA TGGATTCTGT AAGAATTAAC 9840
TGTAATGAAA CTAGGTTCTA CTTATTAAGT AGTCTAAGTA CATTAAGAGG TGCTTTCATT 9900
TATAGAATCA TAAAAGGGTT TGTAAATACC TACAACAGAT GGCCCACCTT AAGGAATGCT 9960
ATTGTCCTAC CTCTAAGATG GTTAAACTAC TATAAACTTA ATACTTATCC ATCTCTACTT 10020
GAAATCACAG AAAATGATTT GATTATTTTA TCAGGATTGC GGTTCTATCG TGAGTTTCAT 10080
CTGCCTAAAA AAGTGGATCT TGAAATGATA ATAAATGACA AAGCCATTTC ACCTCCAAAA 10140
GATCTAATAT GGACTAGTTT TCCTAGAAAT TACATGCCAT CACATATACA AAATTATATA 10200
GAACATGAAA AGTTGAAGTT CTCTGAAAGC GACAGATCGA GAAGAGTACT AGAGTATTAC 10260
TTGAGAGATA ATAAATTCAA TGAATGCGAT CTATACAATT GTGTAGTCAA TCAAAGCTAT 10320
CTCAACAACT CTAATCACGT GGTATCACTA ACTGGTAAAG AAAGAGAGCT CAGTGTAGGT 10380
AGAATGTTTG CTATGCAACC AGGTATGTTT AGGCAAATCC AAATCTTAGC AGAGAAAATG 10440
ATAGCTGAAA ATATTTTACA ATTCTTCCCT GAGAGTTTGA CAAGATATGG TGATCTAGAG 10500
CTTCAAAAGA TATTAGAATT AAAAGCAGGA ATAAGCAACA AGTCAAATCG TTATAATGAT 10560
AACTACAACA ATTATATCAG TAAATGTTCT ATCATTACAG ATCTTAGCAA ATTCAATCAG 10620
GCATTTAGAT ATGAAACATC ATGTATCTGC AGTGATGTAT TAGATGAACT GCATGGAGTA 10680
CAATCTCTGT TCTCTTGGTT GCATTTAACA ATACCTCTTG TCACAATAAT ATGTACATAT 10740
AGACATGCAC CTCCTTTCAT AAAGGATCAT GTTGTTAATC TTAATGAGGT TGATGAACAA 10800
AGTGGATTAT ACAGATATCA TATGGGTGGT ATTGAGGGCT GGTGTCAAAA ACTGTGGACC 10860
ATTGAAGCTA TATCATTATT AGATCTAATA TCTCTCAAAG GGAAATTCTC TATCACAGCT 10920
CTGATAAATG GTGATAATCA GTCAATTGAT ATAAGCAAAC CAGTTAGACT TATAGAGGGT 10980
CAGACCCATG CACAAGCAGA TTATTTGTTA GCATTAAATA GCCTTAAATT GTTATATAAA 11040
GAGTATGCAG GTATAGGCCA TAAGCTTAAG GGAACAGAGA CCTATATATC CCGAGATATG 11100
CAGTTCATGA GCAAAACAAT CCAGCACAAT GGAGTGTACT ATCCAGCCAG TATCAAAAAA 11160
GTCCTGAGAG TAGGTCCATG GATAAACACG ATACTTGATG ATTTTAAAGT TAGTTTAGAA 11220
TCTATAGGCA GCTTAACACA GGAGTTAGAA TACAGAGGAG AAAGCTTATT ATGCAGTTTA 11280
ATATTTAGGA ACATTTGGTT ATACAATCAA ATTGCTTTGC AACTCCGAAA TCATGCATTA 11340
TGTAACAATA AGCTATATTT AGATATATTG AAAGTATTAA AACACTTAAA AACTTTTTTT 11400
AATCTTGATA GCATTGATAT GGCTTTATCA TTGTATATGA ATTTGCCTAT GCTGTTTGGT 11460
GGTGGTGATC CTAATTTGTT ATATCGAAGC TTTTATAGGA GAACTCCAGA CTTCCTTACA 11520
GAAGCTATAG TACATTCAGT GTTTGTGTTG AGCTATTATA CTGGTCACGA TTTACAAGAT 11580
AAGCTCCAGG ATCTTCCAGA TGATAGACTG AACAAATTCT TGACATGTGT CATCACATTT 11640
GATAAAAATC CCAATGCCGA GTTTGTAACA TTGATGAGGG ATCCACAGGC TTTAGGGTCT 11700
GAAAGGCAAG CTAAAATTAC TAGTGAGATT AATAGATTAG CAGTAACAGA AGTCTTAAGT 11760
ATAGCCCCAA ACAAAATATT TTCTAAAAGT GCACAACATT ATACTACCAC TGAGATTGAT 11820
CTAAATGACA TTATGCAAAA TATAGAACCA ACTTACCCTC ATGGATTAAG AGTTGTTTAT 11880
GAAAGTTTAC CTTTTTATAA AGCAGAAAAA ATAGTTAATC TTATATCAGG AACAAAATCC 11940
ATAACTAATA TACTTGAAAA AACATCAGCA ATAGATACAA CTGATATTAA TAGGGCTACT 12000
GATATGATGA GGAAAAATAT AACTTTACTT ATAAGGATAC TTCCACTAGA TTGTAACAAA 12060
GACAAAAGAG AGTTATTAAG TTTAGAAAAT CTTAGTATAA CTGAATTAAG CAAGTATGTA 12120
AGAGAAAGAT CTTGGTCATT ATCCAATATA GTAGGAGTAA CATCGCCAAG TATTATGTTC 12180
ACAATGAACA TTAAATATAC AACTAGCACT ATAGCCAGTG GTATAATAAT AGAAAAATAT 12240
AATGTTAATA GTTTAACTCG TGGTGAAAGA GGACCCACCA AGCCATGGGT AGGCTCATCC 12300
ACGCAGGAGA AAAAAACAAT GCCAGTGTAC AACAGACAAG TTTTAACCAA AAAGCAAAGA 12360
GACCAAATAG ATTTATTAGC AAAATTAGAC TGGGTATATG CATCCATAGA CAACAAAGAT 12420
GAATTCATGG AAGAACTGAG TACTGGAACA CTTGGACTGT CATATGAAAA AGCCAAAAAG 12480
TTGTTTCCAC AATATCTAAG TGTCAATTAT TTACACCGTT TAACAGTCAG TAGTAGACCA 12540
TGTGAATTCC CTGCATCAAT ACCAGCTTAT AGAACAACAA ATTATCATTT TGATACTAGT 12600
CCTATCAATC ATGTATTAAC AGAAAAGTAT GGAGATGAAG ATATCGACAT TGTGTTTCAA 12660
AATTGCATAA GTTTTGGTCT TAGCCTGATG TCGGTTGTGG AACAATTCAC AAACATATGT 12720
CCTAATAGAA TTATTCTCAT ACCGAAGCTG AATGAGATAC ATTTGATGAA ACCTCCTATA 12780
TTTACAGGAG ATGTTGATAT CATCAAGTTG AAGCAAGTGA TACAAAAGCA GCACATGTTC 12840
CTACCAGATA AAATAAGTTT AACCCAATAT GTAGAATTAT TCTTAAGTAA CAAAGCACTT 12900
AAATCTGGAT CTCACATCAA CTCTAATTTA ATATTAGTAC ATAAAATGTC TGATTATTTT 12960
CATAATGCTT ATATTTTAAG TACTAATTTA GCTGGACATT GGATTCTGAT TATTCAACTT 13020
ATGAAAGATT CAAAAGGTAT TTTTGAAAAA GATTGGGGAG AGGGGTACAT AACTGATCAT 13080
ATGTTCATTA ATTTGAATGT TTTCTTTAAT GCTTATAAGA CTTATTTGCT ATGTTTTCAT 13140
AAAGGTTATG GTAAAGCAAA ATTAGAATGT GATATGAACA CTTCAGATCT TCTTTGTGTT 13200
TTGGAGTTAA TAGACAGTAG CTACTGGAAA TCTATGTCTA AAGTTTTCCT AGAACAAAAA 13260
GTCATAAAAT ACATAGTCAA TCAAGACACA AGTTTGCGTA GAATAAAAGG CTGTCACAGT 13320
TTTAAGTTGT GGTTTTTAAA ACGCCTTAAT AATGCTAAAT TTACCGTATG CCCTTGGGTT 13380
GTTAACATAG ATTATCACCC AACACACATG AAAGCTATAT TATCTTACAT AGATTTAGTT 13440
AGAATGGGGT TAATAAATGT AGATAAATTA ACCATTAAAA ATAAAAACAA ATTCAATGAT 13500
GAATTTTACA CATCAAATCT CTTTTACATT AGTTATAACT TTTCAGACAA CACTCATTTG 13560
CTAACAAAAC AAATAAGAAT TGCTAATTCA GAATTAGAAG ATAATTATAA CAAACTATAT 13620
CACCCAACCC CAGAAACTTT AGAAAATATG TCATTAATTC CTGTTAAAAG TAATAATAGT 13680
AACAAACCTA AATTTTGTAT AAGTGGAAAT ACCGAATCTA TGATGATGTC AACATTCTCT 13740
AGTAAAATGC ATATTAAATC TTCCACTGTT ACCACAAGAT TCAATTATAG CAAACAAGAC 13800
TTGTACAATT TATTTCCAAT TGTTGTGATA GACAAGATTA TAGATCATTC AGGTAATACA 13860
GCAAAATCTA ACCAACTTTA CACCACCACT TCACATCAGA CATCTTTAGT AAGGAATAGT 13920
GCATCACTTT ATTGCATGCT TCCTTGGCAT CATGTCAATA GATTTAACTT TGTATTTAGT 13980
TCCACAGGAT GCAAGATCAG TATAGAGTAT ATTTTAAAAG ATCTTAAGAT TAAGGACCCC 14040
AGTTGTATAG CATTCATAGG TGAAGGAGCT GGTAACTTAT TATTACGTAC GGTAGTAGAA 14100
CTTCATCCAG ACATAAGATA CATTTACAGA AGTTTAAAAG ATTGCAATGA TCATAGTTTA 14160
CCTATTGAAT TTCTAAGGTT ATACAACGGG CATATAAACA TAGATTATGG TGAGAATTTA 14220
ACCATTCCTG CTACAGATGC AACTAATAAC ATTCATTGGT CTTATTTACA TATAAAATTT 14280
GCAGAACCTA TTAGCATCTT TGTCTGCGAT GCTGAATTAC CTGTTACAGC CAATTGGAGT 14340
AAAATTATAA TTGAATGGAG TAAGCATGTA AGAAAGTGCA AGTACTGTTC TTCTGTAAAT 14400
AGATGCATTT TAATTGCAAA ATATCATGCT CAAGATGACA TTGATTTCAA ATTAGATAAC 14460
ATTACTATAT TAAAAACTTA CGTGTGCCTA GGTAGCAAGT TAAAAGGATC TGAAGTTTAC 14520
TTAATCCTTA CAATAGGCCC TGCAAATATA CTTCCTGTTT TTGATGTTGT ACAAAATGCT 14580
AAATTGATAC TTTCAAGAAC TAAAAATTTC ATTATGCCTA AAAAAACTGA CAAGGAATCT 14640
ATCGATGCAA ATATTAAAAG CTTAATACCT TTCCTTTGTT ACCCTATAAC AAAAAAAGGA 14700
ATTAAGACTT CATTGTCAAA ATTGAAGAGT GTAGTTAATG GAGATATATT ATCATATTCT 14760
ATAGCTGGAC GTAATGAAGT ATTCAGCAAC AAGCTTATAA ACCACAAGCA TATGAATATC 14820
CTAAAATGGC TAGATCATGT TTTAAATTTT AGATCAGCTG AACTTAATTA CAATCATTTA 14880
TACATGATAG AGTCCACATA TCCTTACTTA AGTGAATTGT TAAATAGTTT AACAACCAAT 14940
GAGCTCAAGA AGCTGATTAA AATAACAGGT AGTGTGCTAT ACAACCTTCC CAACGAACAG 15000
TAGTTTAAAA TATCATTAAC AAGTTTGGTC AAATTTAGAT GCTAACACAT CATTATATTA 15060
TAGTTATTAA AGAATATACA AACTTTTCAA TAATTTAGCA TATTGATTCC AAAATTATCA 15120
TTTTAGTCTT AAGGGGTTAA ATAAAAGTCT AAAACTAACA ATTATACATG TGCATTCACA 15180
ACACAACGAG ACATTAGTTT TTGACACTTT TTTTCTCGT 15219
(2) SEQ ID NO: 28 information about:
...
(i) sequence signature:
(A) length: 2166 amino acid
(B) type: amino acid
(C) chain:
(D) topological framework: linearity
(ii) molecule type: protein (xi) sequence description: SEQ ID NO:28:Met Asp Pro Ile Ile Asn Gly Asn Ser Ala Asn Val Tyr Leu Thr Asp1 5 10 15Ser Tyr Leu Lys Gly Val Ile Ser Phe Ser Glu Cys Asn Ala Leu Gly
20??????????????????25??????????????????30Ser?Tyr?Leu?Phe?Asn?Gly?Pro?Tyr?Leu?Lys?Asn?Asp?Tyr?Thr?Asn?Leu
35??????????????????40??????????????????45Ile?Ser?Arg?Gln?Ser?Pro?Leu?Leu?Glu?His?Met?Asn?Leu?Lys?Lys?Leu
50??????????????????55??????????????????60Thr?Ile?Thr?Gln?Ser?Leu?Ile?Ser?Arg?Tyr?His?Lys?Gly?Glu?Leu?Lys65??????????????????70??????????????????75??????????????????80Leu?Glu?Glu?Pro?Thr?Tyr?Phe?Gln?Ser?Leu?Leu?Met?Thr?Tyr?Lys?Ser
85??????????????????90??????????????????95Met?Ser?Ser?Ser?Glu?Gln?Ile?Ala?Thr?Thr?Asn?Leu?Leu?Lys?Lys?Ile
100?????????????????105?????????????????110Ile?Arg?Arg?Ala?Ile?Glu?Ile?Ser?Asp?Val?Lys?Val?Tyr?Ala?Ile?Leu
115?????????????????120?????????????????125Asn?Lys?Leu?Gly?Leu?Lys?Glu?Lys?Asp?Arg?Val?Lys?Pro?Asn?Asn?Asn
130?????????????????135?????????????????140Ser?Gly?Asp?Glu?Asn?Ser?Val?Leu?Thr?Thr?Ile?Ile?Lys?Asp?Asp?Ile145?????????????????150?????????????????155?????????????????160Leu?Ser?Ala?Val?Glu?Asn?Asn?Gln?Ser?Tyr?Thr?Asn?Ser?Asp?Lys?Ser
165?????????????????170?????????????????175His?Ser?Val?Asn?Gln?Asn?Ile?Thr?Ile?Lys?Thr?Thr?Leu?Leu?Lys?Lys
180?????????????????185?????????????????190Leu?Met?Cys?Ser?Met?Gln?His?Pro?Pro?Ser?Trp?Leu?Ile?His?Trp?Phe
195?????????????????200?????????????????205Asn?Leu?Tyr?Thr?Lys?Leu?Asn?Asn?Ile?Leu?Thr?Gln?Tyr?Arg?Ser?Asn
210?????????????????215?????????????????220Glu?Val?Lys?Ser?His?Gly?Phe?Ile?Leu?Ile?Asp?Asn?Gln?Thr?Leu?Ser225?????????????????230?????????????????235?????????????????240Gly?Phe?Gln?Phe?Ile?Leu?Asn?Gln?Tyr?Gly?Cys?Ile?Val?Tyr?His?Lys
245?????????????????250?????????????????255Gly?Leu?Lys?Lys?Ile?Thr?Thr?Thr?Thr?Tyr?Asn?Gln?Phe?Leu?Thr?Trp
260?????????????????265?????????????????270Lys?Asp?Ile?Ser?Leu?Ser?Arg?Leu?Asn?Val?Cys?Leu?Ile?Thr?Trp?Ile
275?????????????????280?????????????????285Ser?Asn?Cys?Leu?Asn?Thr?Leu?Asn?Lys?Ser?Leu?Gly?Leu?Arg?Cys?Gly
290?????????????????295?????????????????300Phe?Asn?Asn?Val?Val?Leu?Ser?Gln?Leu?Phe?Leu?Tyr?Gly?Asp?Cys?Ile305?????????????????310?????????????????315?????????????????320Leu?Lys?Leu?Phe?His?Asn?Glu?Gly?Phe?Tyr?Ile?Ile?Lys?Glu?Val?Glu
325?????????????????330?????????????????335Gly?Phe?Ile?Met?Ser?Leu?Ile?Leu?Asn?Ile?Thr?Glu?Glu?Asp?Gln?Phe
340?????????????????345?????????????????350Lys?Lys?Arg?Phe?Tyr?Asn?Ser?Met?Leu?Asn?Asn?Ile?Thr?Asp?Ala?Ala
355?????????????????360?????????????????365Ile?Lys?Ala?Gln?Lys?Asp?Leu?Leu?Ser?Arg?Val?Cys?His?Thr?Leu?Leu
370?????????????????375?????????????????380Asp?Lys?Thr?Val?Ser?Asp?Asn?Ile?Ile?Asn?Gly?Lys?Trp?Ile?Ile?Leu385?????????????????390?????????????????395?????????????????400Leu?Ser?Lys?Phe?Leu?Lys?Leu?Ile?Lys?Leu?Ala?Gly?Asp?Asn?Asn?Leu
405?????????????????410?????????????????415Asn?Asn?Leu?Ser?Glu?Leu?Tyr?Phe?Leu?Phe?Arg?Ile?Phe?Gly?His?Pro
420?????????????????425?????????????????430Met?Val?Asp?Glu?Arg?Gln?Ala?Met?Asp?Ser?Val?Arg?Ile?Asn?Cys?Asn
435?????????????????440?????????????????445Glu?Thr?Arg?Phe?Tyr?Leu?Leu?Ser?Ser?Leu?Ser?Thr?Leu?Arg?Gly?Ala
450?????????????????455?????????????????460Phe?Ile?Tyr?Arg?Ile?Ile?Lys?Gly?Phe?Val?Asn?Thr?Tyr?Asn?Arg?Trp465?????????????????470?????????????????475?????????????????480Pro?Thr?Leu?Arg?Asn?Ala?Ile?Val?Leu?Pro?Leu?Arg?Trp?Leu?Asn?Tyr
485?????????????????490?????????????????495Tyr?Lys?Leu?Asn?Thr?Tyr?Pro?Ser?Leu?Leu?Glu?Ile?Thr?Glu?Asn?Asp
500?????????????????505?????????????????510Leu?Ile?Ile?Leu?Ser?Gly?Leu?Arg?Phe?Tyr?Arg?Glu?Phe?His?Leu?Pro
515?????????????????520?????????????????525Lys?Lys?Val?Asp?Leu?Glu?Met?Ile?Ile?Asn?Asp?Lys?Ala?Ile?Ser?Pro
530?????????????????535?????????????????540Pro?Lys?Asp?Leu?Ile?Trp?Thr?Ser?Phe?Pro?Arg?Asn?Tyr?Met?Pro?Ser545?????????????????550?????????????????555?????????????????560His?Ile?Gln?Asn?Tyr?Ile?Glu?His?Glu?Lys?Leu?Lys?Phe?Ser?Glu?Ser
565?????????????????570?????????????????575Asp?Arg?Ser?Arg?Arg?Val?Leu?Glu?Tyr?Tyr?Leu?Arg?Asp?Asn?Lys?Phe
580?????????????????585?????????????????590Asn?Glu?Cys?Asp?Leu?Tyr?Asn?Cys?Val?Val?Asn?Gln?Ser?Tyr?Leu?Asn
595?????????????????600?????????????????605Asn?Ser?Asn?His?Val?Val?Ser?Leu?Thr?Gly?Lys?Glu?Arg?Glu?Leu?Ser
610?????????????????615?????????????????620Val?Gly?Arg?Met?Phe?Ala?Met?Gln?Pro?Gly?Met?Phe?Arg?Gln?Ile?Gln625?????????????????630?????????????????635?????????????????640Ile?Leu?Ala?Glu?Lys?Met?Ile?Ala?Glu?Asn?Ile?Leu?Gln?Phe?Phe?Pro
645?????????????????650?????????????????655Glu?Ser?Leu?Thr?Arg?Tyr?Gly?Asp?Leu?Glu?Leu?Gln?Lys?Ile?Leu?Glu
660?????????????????665?????????????????670Leu?Lys?Ala?Gly?Ile?Ser?Asn?Lys?Ser?Asn?Arg?Tyr?Asn?Asp?Asn?Tyr
675?????????????????680?????????????????685Asn?Asn?Tyr?Ile?Ser?Lys?Cys?Ser?Ile?Ile?Thr?Asp?Leu?Ser?Lys?Phe
690?????????????????695?????????????????700Asn?Gln?Ala?Phe?Arg?Tyr?Glu?Thr?Ser?Cys?Ile?Cys?Ser?Asp?Val?Leu705?????????????????710?????????????????715?????????????????720Asp?Glu?Leu?His?Gly?Val?Gln?Ser?Leu?Phe?Ser?Trp?Leu?His?Leu?Thr
725?????????????????730?????????????????735Ile?Pro?Leu?Val?Thr?Ile?Ile?Cys?Thr?Tyr?Arg?His?Ala?Pro?Pro?Phe
740?????????????????745?????????????????750Ile?Lys?Asp?His?Val?Val?Asn?Leu?Asn?Glu?Val?Asp?Glu?Gln?Ser?Gly
755?????????????????760?????????????????765Leu?Tyr?Arg?Tyr?His?Met?Gly?Gly?Ile?Glu?Gly?Trp?Cys?Gln?Lys?Leu
770?????????????????775?????????????????780Trp?Thr?Ile?Glu?Ala?Ile?Ser?Leu?Leu?Asp?Leu?Ile?Ser?Leu?Lys?Gly785?????????????????790?????????????????795?????????????????800Lys?Phe?Ser?Ile?Thr?Ala?Leu?Ile?Asn?Gly?Asp?Asn?Gln?Ser?Ile?Asp
805?????????????????810?????????????????815Ile?Ser?Lys?Pro?Val?Arg?Leu?Ile?Glu?Gly?Gln?Thr?His?Ala?Gln?Ala
820?????????????????825?????????????????830Asp?Tyr?Leu?Leu?Ala?Leu?Asn?Ser?Leu?Lys?Leu?Leu?Tyr?Lys?Glu?Tyr
835?????????????????840?????????????????845Ala?Gly?Ile?Gly?His?Lys?Leu?Lys?Gly?Thr?Glu?Thr?Tyr?Ile?Ser?Arg
850?????????????????855?????????????????860Asp?Met?Gln?Phe?Met?Ser?Lys?Thr?Ile?Gln?His?Asn?Gly?Val?Tyr?Tyr865?????????????????870?????????????????875?????????????????880Pro?Ala?Ser?Ile?Lys?Lys?Val?Leu?Arg?Val?Gly?Pro?Trp?Ile?Asn?Thr
885?????????????????890?????????????????895Ile?Leu?Asp?Asp?Phe?Lys?Val?Ser?Leu?Glu?Ser?Ile?Gly?Ser?Leu?Thr
900?????????????????905?????????????????910Gln?Glu?Leu?Glu?Tyr?Arg?Gly?Glu?Ser?Leu?Leu?Cys?Ser?Leu?Ile?Phe
915?????????????????920?????????????????925Arg?Asn?Ile?Trp?Leu?Tyr?Asn?Gln?Ile?Ala?Leu?Gln?Leu?Arg?Asn?His
930?????????????????935?????????????????940Ala?Leu?Cys?Asn?Asn?Lys?Leu?Tyr?Leu?Asp?Ile?Leu?Lys?Val?Leu?Lys945?????????????????950?????????????????955?????????????????960His?Leu?Lys?Thr?Phe?Phe?Asn?Leu?Asp?Ser?Ile?Asp?Met?Ala?Leu?Ser
965?????????????????970?????????????????975Leu?Tyr?Met?Asn?Leu?Pro?Met?Leu?Phe?Gly?Gly?Gly?Asp?Pro?Asn?Leu
980?????????????????985?????????????????990Leu?Tyr?Arg?Ser?Phe?Tyr?Arg?Arg?Thr?Pro?Asp?Phe?Leu?Thr?Glu?Ala
995?????????????????1000????????????????1005Ile?Val?His?Ser?Val?Phe?Val?Leu?Ser?Tyr?Tyr?Thr?Gly?His?Asp?Leu
1010????????????????1015????????????????1020Gln?Asp?Lys?Leu?Gln?Asp?Leu?Pro?Asp?Asp?Arg?Leu?Asn?Lys?Phe?Leu1025????????????????1030????????????????1035????????????????1040Thr?Cys?Val?Ile?Thr?Phe?Asp?Lys?Asn?Pro?Asn?Ala?Glu?Phe?Val?Thr
1045????????????????1050????????????????1055Leu?Met?Arg?Asp?Pro?Gln?Ala?Leu?Gly?Ser?Glu?Arg?Gln?Ala?Lys?Ile
1060????????????????1065????????????????1070Thr?Ser?Glu?Ile?Asn?Arg?Leu?Ala?Val?Thr?Glu?Val?Leu?Ser?Ile?Ala
1075????????????????1080????????????????1085Pro?Asn?Lys?Ile?Phe?Ser?Lys?Ser?Ala?Gln?His?Tyr?Thr?Thr?Thr?Glu
1090????????????????1095????????????????1100Ile?Asp?Leu?Asn?Asp?Ile?Met?Gln?Asn?Ile?Glu?Pro?Thr?Tyr?Pro?His1105????????????????1110????????????????1115????????????????1120Gly?Leu?Arg?Val?Val?Tyr?Glu?Ser?Leu?Pro?Phe?Tyr?Lys?Ala?Glu?Lys
1125????????????????1130????????????????1135Ile?Val?Asn?Leu?Ile?Ser?Gly?Thr?Lys?Ser?Ile?Thr?Asn?Ile?Leu?Glu
1140????????????????1145????????????????1150Lys?Thr?Ser?Ala?Ile?Asp?Thr?Thr?Asp?Ile?Asn?Arg?Ala?Thr?Asp?Met
1155????????????????1160????????????????1165Met?Arg?Lys?Asn?Ile?Thr?Leu?Leu?Ile?Arg?Ile?Leu?Pro?Leu?Asp?Cys
1170????????????????1175????????????????1180Asn?Lys?Asp?Lys?Arg?Glu?Leu?Leu?Ser?Leu?Glu?Asn?Leu?Ser?Ile?Thr1185????????????????1190????????????????1195????????????????1200Glu?Leu?Ser?Lys?Tyr?Val?Arg?Glu?Arg?Ser?Trp?Ser?Leu?Ser?Asn?Ile
1205????????????????1210????????????????1215Val?Gly?Val?Thr?Ser?Pro?Ser?Ile?Met?Phe?Thr?Met?Asn?Ile?Lys?Tyr
1220????????????????1225????????????????1230Thr?Thr?Ser?Thr?Ile?Ala?Ser?Gly?Ile?Ile?Ile?Glu?Lys?Tyr?Asn?Val
1235????????????????1240????????????????1245Asn?Ser?Leu?Thr?Arg?Gly?Glu?Arg?Gly?Pro?Thr?Lys?Pro?Trp?Val?Gly
1250????????????????1255????????????????1260Ser?Ser?Thr?Gln?Glu?Lys?Lys?Thr?Met?Pro?Val?Tyr?Asn?Arg?Gln?Val1265????????????????1270????????????????1275????????????????1280Leu?Thr?Lys?Lys?Gln?Arg?Asp?Gln?Ile?Asp?Leu?Leu?Ala?Lys?Leu?Asp
1285????????????????1290????????????????1295Trp?Val?Tyr?Ala?Ser?Ile?Asp?Asn?Lys?Asp?Glu?Phe?Met?Glu?Glu?Leu
1300????????????????1305????????????????1310Ser?Thr?Gly?Thr?Leu?Gly?Leu?Ser?Tyr?Glu?Lys?Ala?Lys?Lys?Leu?Phe
1315????????????????1320????????????????1325Pro?Gln?Tyr?Leu?Ser?Val?Asn?Tyr?Leu?His?Arg?Leu?Thr?Val?Ser?Ser
1330????????????????1335????????????????1340Arg?Pro?Cys?Glu?Phe?Pro?Ala?Ser?Ile?Pro?Ala?Tyr?Arg?Thr?Thr?Asn1345????????????????1350????????????????1355????????????????1360Tyr?His?Phe?Asp?Thr?Ser?Pro?Ile?Asn?His?Val?Leu?Thr?Glu?Lys?Tyr
1365????????????????1370????????????????1375Gly?Asp?Glu?Asp?Ile?Asp?Ile?Val?Phe?Gln?Asn?Cys?Ile?Ser?Phe?Gly
1380????????????????1385????????????????1390Leu?Ser?Leu?Met?Ser?Val?Val?Glu?Gln?Phe?Thr?Asn?Ile?Cys?Pro?Asn
1395????????????????1400????????????????1405Arg?Ile?Ile?Leu?Ile?Pro?Lys?Leu?Asn?Glu?Ile?His?Leu?Met?Lys?Pro
1410????????????????1415????????????????1420Pro?Ile?Phe?Thr?Gly?Asp?Val?Asp?Ile?Ile?Lys?Leu?Lys?Gln?Val?Ile1425????????????????1430????????????????1435????????????????1440Gln?Lys?Gln?His?Met?Phe?Leu?Pro?Asp?Lys?Ile?Ser?Leu?Thr?Gln?Tyr
1445????????????????1450????????????????1455Val?Glu?Leu?Phe?Leu?Ser?Asn?Lys?Ala?Leu?Lys?Ser?Gly?Ser?His?Ile
1460????????????????1465????????????????1470Asn?Ser?Asn?Leu?Ile?Leu?Val?His?Lys?Met?Ser?Asp?Tyr?Phe?His?Asn
1475????????????????1480????????????????1485Ala?Tyr?Ile?Leu?Ser?Thr?Asn?Leu?Ala?Gly?His?Trp?Ile?Leu?Ile?Ile
1490????????????????1495????????????????1500Gln?Leu?Met?Lys?Asp?Ser?Lys?Gly?Ile?Phe?Glu?Lys?Asp?Trp?Gly?Glu1505????????????????1510????????????????1515????????????????1520Gly?Tyr?Ile?Thr?Asp?His?Met?Phe?Ile?Asn?Leu?Asn?Val?Phe?Phe?Asn
1525????????????????1530????????????????1535Ala?Tyr?Lys?Thr?Tyr?Leu?Leu?Cys?Phe?His?Lys?Gly?Tyr?Gly?Lys?Ala
1540????????????????1545????????????????1550Lys?Leu?Glu?Cys?Asp?Met?Asn?Thr?Ser?Asp?Leu?Leu?Cys?Val?Leu?Glu
1555????????????????1560????????????????1565Leu?Ile?Asp?Ser?Ser?Tyr?Trp?Lys?Ser?Met?Ser?Lys?Val?Phe?Leu?Glu
1570????????????????1575????????????????1580Gln?Lys?Val?Ile?Lys?Tyr?Ile?Val?Asn?Gln?Asp?Thr?Ser?Leu?Arg?Arg1585????????????????1590????????????????1595????????????????1600Ile?Lys?Gly?Cys?His?Ser?Phe?Lys?Leu?Trp?Phe?Leu?Lys?Arg?Leu?Asn
1605????????????????1610????????????????1615Asn?Ala?Lys?Phe?Thr?Val?Cys?Pro?Trp?Val?Val?Asn?Ile?Asp?Tyr?His
1620????????????????1625????????????????1630Pro?Thr?His?Met?Lys?Ala?Ile?Leu?Ser?Tyr?Ile?Asp?Leu?Val?Arg?Met
1635????????????????1640????????????????1645Gly?Leu?Ile?Asn?Val?Asp?Lys?Leu?Thr?Ile?Lys?Asn?Lys?Asn?Lys?Phe
1650????????????????1655????????????????I660Asn?Asp?Glu?Phe?Tyr?Thr?Ser?Asn?Leu?Phe?Tyr?Ile?Ser?Tyr?Asn?Phe1665????????????????1670????????????????1675????????????????1680Ser?Asp?Asn?Thr?His?Leu?Leu?Thr?Lys?Gln?Ile?Arg?Ile?Ala?Asn?Ser
1685????????????????1690????????????????1695Glu?Leu?Glu?Asp?Asn?Tyr?Asn?Lys?Leu?Tyr?His?Pro?Thr?Pro?Glu?Thr
1700????????????????1705????????????????1710Leu?Glu?Asn?Met?Ser?Leu?Ile?Pro?Val?Lys?Ser?Asn?Asn?Ser?Asn?Lys
1715????????????????1720????????????????1725Pro?Lys?Phe?Cys?Ile?Ser?Gly?Asn?Thr?Glu?Ser?Met?Met?Met?Ser?Thr
1730????????????????1735????????????????1740Phe?Ser?Ser?Lys?Met?His?Ile?Lys?Ser?Ser?Thr?Val?Thr?Thr?Arg?Phe1745????????????????1750????????????????1755????????????????1760Asn?Tyr?Ser?Lys?Gln?Asp?Leu?Tyr?Asn?Leu?Phe?Pro?Ile?Val?Val?Ile
1765????????????????1770????????????????1775Asp?Lys?Ile?Ile?Asp?His?Ser?Gly?Asn?Thr?Ala?Lys?Ser?Asn?Gln?Leu
1780????????????????1785????????????????1790Tyr?Thr?Thr?Thr?Ser?His?Gln?Thr?Ser?Leu?Val?Arg?Asn?Ser?Ala?Ser
1795????????????????1800????????????????1805Leu?Tyr?Cys?Met?Leu?Pro?Trp?His?His?Val?Asn?Arg?Phe?Asn?Phe?Val
1810????????????????1815????????????????1820Phe?Ser?Ser?Thr?Gly?Cys?Lys?Ile?Ser?Ile?Glu?Tyr?Ile?Leu?Lys?Asp1825????????????????1830????????????????1835????????????????1840Leu?Lys?Ile?Lys?Asp?Pro?Ser?Cys?Ile?Ala?Phe?Ile?Gly?Glu?Gly?Ala
1845????????????????1850????????????????1855Gly?Asn?Leu?Leu?Leu?Arg?Thr?Val?Val?Glu?Leu?His?Pro?Asp?Ile?Arg
1860????????????????1865????????????????1870Tyr?Ile?Tyr?Arg?Ser?Leu?Lys?Asp?Cys?Asn?Asp?His?Ser?Leu?Pro?Ile
1875????????????????1880????????????????1885Glu?Phe?Leu?Arg?Leu?Tyr?Asn?Gly?His?Ile?Asn?Ile?Asp?Tyr?Gly?Glu
1890????????????????1895????????????????1900Asn?Leu?Thr?Ile?Pro?Ala?Thr?Asp?Ala?Thr?Asn?Asn?Ile?His?Trp?Ser1905????????????????1910????????????????1915????????????????1920Tyr?Leu?His?Ile?Lys?Phe?Ala?Glu?Pro?Ile?Ser?Ile?Phe?Val?Cys?Asp
1925????????????????1930????????????????1935Ala?Glu?Leu?Pro?Val?Thr?Ala?Asn?Trp?Ser?Lys?Ile?Ile?Ile?Glu?Trp
1940????????????????1945????????????????1950Ser?Lys?His?Val?Arg?Lys?Cys?Lys?Tyr?Cys?Ser?Ser?Val?Asn?Arg?Cys
1955????????????????1960????????????????1965Ile?Leu?Ile?Ala?Lys?Tyr?His?Ala?Gln?Asp?Asp?Ile?Asp?Phe?Lys?Leu
1970????????????????1975????????????????1980Asp?Asn?Ile?Thr?Ile?Leu?Lys?Thr?Tyr?Val?Cys?Leu?Gly?Ser?Lys?Leu1985????????????????1990????????????????1995????????????????2000Lys?Gly?Ser?Glu?Val?Tyr?Leu?Ile?Leu?Thr?Ile?Gly?Pro?Ala?Asn?Ile
2005????????????????2010????????????????2015Leu?Pro?Val?Phe?Asp?Val?Val?Gln?Asn?Ala?Lys?Leu?Ile?Leu?Ser?Arg
2020????????????????2025????????????????2030
Thr?Lys?Asn?Phe?Ile?Met?Pro?Lys?Lys?Thr?Asp?Lys?Glu?Ser?Ile?Asp
2035????????????????2040????????????????2045
Ala?Asn?Ile?Lys?Ser?Leu?Ile?Pro?Phe?Leu?Cys?Tyr?Pro?Ile?Thr?Lys
2050????????????????2055????????????????2060
Lys?Gly?Ile?Lys?Thr?Ser?Leu?Ser?Lys?Leu?Lys?Ser?Val?Val?Asn?Gly
2065????????????????2070????????????????2075????????????????2080
Asp?Ile?Leu?Ser?Tyr?Ser?Ile?Ala?Gly?Arg?Asn?Glu?Val?Phe?Ser?Asn
2085????????????????2090????????????????2095
Lys?Leu?Ile?Asn?His?Lys?His?Met?Asn?Ile?Leu?Lys?Trp?Leu?Asp?His
2100????????????????2105????????????????2110
Val?Leu?Asn?Phe?Arg?Ser?Ala?Glu?Leu?Asn?Tyr?Asn?His?Leu?Tyr?Met
2115????????????????2120????????????????2125
Ile?Glu?Ser?Thr?Tyr?Pro?Tyr?Leu?Ser?Glu?Leu?Leu?Asn?Ser?Leu?Thr
2130????????????????2135????????????????2140
Thr?Asn?Glu?Leu?Lys?Lys?Leu?Ile?Lys?Ile?Thr?Gly?Ser?Val?Leu?Tyr
2145????????????????2150????????????????2155????????????????2160
Asn?Leu?Pro?Asn?Glu?Gln
The information of 2165 (2) SEQ ID NO:29:
(i) sequence signature:
(A) length: 15219 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29:
ACGGGAAAAA AATGCGTACT ACAAACTTGC ACATTCGAAA AAAATGGGGC AAATAAGAAC 60
TTGATAAGTG CTATTTAAGT CTAACCTTTT CAATCAGAAA TGGGGTGCAA TTCACTGAGC 120
ATGATAAAGG TTAGATTACA AAATTTATTT GACAATGACG AAGTAGCATT GTTAAAAATA 180
ACATGTTATA CTGATAAATT AATTCTTCTG ACCAATGCAT TAGCCAAAGC AGCAATACAT 240
ACAATTAAAT TAAACGGCAT AGTTTTTATA CATGTTATAA CAAGCAGTGA AGTGTGCCCT 300
GATAACAATA TTGTAGTGAA ATCTAACTTT ACAACAATGC CAATACTACA AAATGGAGGA 360
TACATATGGG AATTGATTGA GTTGACACAC TGCTCTCAAT TAAACGGTTT AATGGATGAT 420
AATTGTGAAA TCAAATTTTC TAAAAGACTA AGTGACTCAG TAATGACTAA TTATATGAAT 480
CAAATATCTG ACTTACTTGG GCTTGATCTC AATTCATGAA TTATGTTTAG TCTAATTCAA 540
TAGACATGTG TTTATTACCA TTTTAGTTAA TATAAAAACT CATCAAAGGG AAATGGGGCA 600
AATAAACTCA CCTAATCAAT CAAACCATGA GCACTACAAA TGACAACACT ACTATGCAAA 660
GATTGATGAT CACAGACATG AGACCCCTGT CAATGGATTC AATAATAACA TCTCTTACCA 720
AAGAAATCAT CACACACAAA TTCATATACT TGATAAACAA TGAATGTATT GTAAGAAAAC 780
TTGATGAAAG ACAAGCTACA TTTACATTCT TAGTCAATTA TGAGATGAAG CTACTGCACA 840
AAGTAGGGAG TACCAAATAC AAAAAATACA CTGAATATAA TACAAAATAT GGCACTTTCC 900
CCATGCCTAT ATTTATCAAT CACGGCGGGT TTCTAGAATG TATTGGCATT AAGCCTACAA 960
AACACACTCC TATAATATAC AAATATGACC TCAACCCGTG AATTCCAACA AAAAAACCAA 1020
CCCAACCAAA CCAAACTATT CCTCAAACAA CAGTGCTCAA TAGTTAAGAA GGAGCTAATC 1080
CATTTTAGTA ATTAAAAATA AAAGTAAAGC CAATAACATA AATTGGGGCA AATACAAAGA 1140
TGGCTCTTAG CAAAGTCAAG TTGAATGATA CATTAAATAA GGATCAGCTG CTGTCATCCA 1200
GCAAATACAC TATTCAACGT AGTACAGGAG ATAATATTGA CACTCCCAAT TATGATGTGC 1260
AAAAACACCT AAACAAACTA TGTGGTATGC TATTAATCAC TGAAGATGCA AATCATAAAT 1320
TCACAGGATT AATAGGTATG TTATATGCTA TGTCCAGGTT AGGAAGGGAA GACACTATAA 1380
AGATACTTAA AGATGCTGGA TATCATGTTA AAGCTAATGG AGTAGATATA ACAACATATC 1440
GTCAAGATAT AAATGGAAAG GAAATGAAAT TCGAAGTATT AACATTATCA AGCTTGACAT 1500
CAGAAATACA AGTCAATATT GAGATAGAAT CTAGAAAGTC CTACAAAAAA ATGCTAAAAG 1560
AGATGGGAGA AGTGGCTCCA GAATATAGGC ATGATTCTCC AGACTGTGGG ATGATAATAC 1620
TGTGTATAGC TGCACTTGTG ATAACCAAAT TAGCAGCAGG AGACAGATCA GGTCTTACAG 1680
CAGTAATTAG GAGGGCAAAC AATGTCTTAA AAAACGAAAT AAAACGATAC AAGGGCCTCA 1740
TACCAAAGGA TATAGCTAAC AGTTTTTATG AAGTGTTTGA AAAACACCCT CATCTTATAG 1800
ATGTTTTCGT GCACTTTGGC ATTGCACAAT CATCCACAAG AGGGGGTAGT AGAGTTGAAG 1860
GAATCTTTGC AGGATTGTTT ATGAATGCCT ATGGTTCAGG GCAAGTAATG CTAAGATGGG 1920
GAGTTTTAGC CAAATCTGTA AAAAATATCA TGCTAGGACA TGCTAGTGTC CAGGCAGAAA 1980
TGGAGCAAGT TGTGGAAGTC TATGAGTATG CACAGAAGTT GGGAGGAGAA GCTGGATTCT 2040
ACCATATATT GAACAATCCA AAAGCATCAT TGCTGTCATT AACTCAATTT CCCAACTTCT 2100
CAAGTGTGGT CCTAGGCAAT GCAGCAGGTC TAGGCATAAT GGGAGAGTAT AGAGGTACAC 2160
CAAGAAACCA GGATCTTTAT GATGCAGCTA AAGCATATGC AGAGCAACTC AAAGAAAATG 2220
GAGTAATAAA CTACAGTGTA TTAGACTTAA CAGCAGAAGA ATTGGAAGCC ATAAAGCATC 2280
AACTCAACCC CAAAGAAGAT GATGTAGAGC TTTAAGTTAA CAAAAAATAC GGGGCAAATA 2340
AGTCAACATG GAGAAGTTTG CACCTGAATT TCATGGAGAA GATGCAAATA ACAAAGCTAC 2400
CAAATTCCTA GAATCAATAA AGGGCAAGTT CGCATCATCC AAAGATCCTA AGAAGAAAGA 2460
TAGCATAATA TCTGTTAACT CAATAGATAT AGAAGTAACT AAAGAGAGCC CGATAACATC 2520
TGGCACCAAC ATCATCAATC CAACAAGTGA AGCCGACAGT ACCCCAGAAA CAAAAGCCAA 2580
CTACCCAAGA AAACCCCTAG TAAGCTTCAA AGAAGATCTC ACCCCAAGTG ACAACCCTTT 2640
TTCTAAGTTG TACAAGGAAA CAATAGAAAC ATTTGATAAC AATGAAGAAG AATCTAGCTA 2700
CTCATATGAA GAGATAAATG ATCAAACAAA TGACAACATT ACAGCAAGAC TAGATAGAAT 2760
TGATGAAAAA TTAAGTGAAA TATTAGGAAT GCTCCATACA TTAGTAGTTG CAAGTGCAGG 2820
ACCCACTTCA GCTCGCGATG GAATAAGAGA TGCTATGGTT GGTCTAAGAG AAGAGATGAT 2880
AGAAAAAATA AGAGCGGAAG CATTAATGAC CAATGATAGG TTAGAGGCTA TGGCAAGACT 2940
TAGGAATGAG GAAAGCGAAA AAATGGCAAA AGACACCTCA GATGAAGTGT CTCTTAATCC 3000
AACTTCCAAA AAATTGAGTG ACTTGTTGGA AGACAACGAT AGTGACAATG ATCTATCACT 3060
TGATGATTTT TGATCAGCGA TCAACTCACT CAGCAATCAA CAACATCAAT AAAACAGACA 3120
TCAATCCATT GAATCAACTG CCAGACCGAA CAAACAAACG TCCATCAGTA GAACCACCAA 3180
CCAATCAATC AACCAATTGA TCAATCAGCA ACCCGACAAA ATTAACAATA TAGTAACAAA 3240
AAAAGAACAA GATGGGGCAA ATATGGAAAC ATACGTGAAC AAGCTTCACG AAGGCTCCAC 3300
ATACACAGCA GCTGTTCAGT ACAATGTTCT AGAAAAAGAT GATGATCCTG CATCACTAAC 3360
AATATGGGTG CCTATGTTCC AGTCATCTGT GCCAGCAGAC TTGCTCATAA AAGAACTTGC 3420
AAGCATCAAT ATACTAGTGA AGCAGATCTC TACGCCCAAA GGACCTTCAC TACGAGTCAC 3480
GATTAACTCA AGAAGTGCTG TGCTGGCTCA AATGCCTAGT AATTTCATCA TAAGCGCAAA 3540
TGTATCATTA GATGAAAGAA GCAAATTAGC ATATGATGTA ACTACACCTT GTGAAATCAA 3600
AGCATGCAGT CTAACATGCT TAAAAGTAAA AAGTATGTTA ACTACAGTCA AAGATCTTAC 3660
CATGAAGACA TTCAACCCCA CTCATGAGAT CATTGCTCTA TGTGAATTTG AAAATATTAT 3720
GACATCAAAA AGAGTAATAA TACCAACCTA TCTAAGATCA ATTAGTGTCA AGAACAAGGA 3780
TCTGAACTCA CTAGAAAATA TAGCAACCAC CGAATTCAAA AATGCTATCA CCAATGCAAA 3840
AATTATTCCT TATGCAGGAT TAGTGTTAGT TATCACAGTT ACTGACAATA AAGGAGCATT 3900
CAAATATATC AAACCACAGA GTCAATTTAT AGTAGATCTT GGTGCCTACC TAGAAAAAGA 3960
GAGCATATAT TATGTGACTA CTAATTGGAA GCATACAGCT ACACGTTTTT CAATCAAACC 4020
ACTAGAGGAT TAAACTTAAT TATCAACACT GAATGACAGG TCCACATATA TCCTCAAACT 4080
ACACACTATA TCCAAACATC ATAAACATCT ACACTACACA CTTCATCACA CAAACCAATC 4140
CCACTCAAAA TCCAAAATCA CTACCAGCCA CTATCTGCTA GACCTAGAGT GCGAATAGGT 4200
AAATAAAACC AAAATATGGG GTAAATAGAC ATTAGTTAGA GTTCAATCAA TCTTAACAAC 4260
CATTTATACC GCCAATTCAA CACATATACT ATAAATCTTA AAATGGGAAA TACATCCATC 4320
ACAATAGAAT TCACAAGCAA ATTTTGGCCC TATTTTACAC TAATACATAT GATCTTAACT 4380
CTAATCTTTT TACTAATTAT AATCACTATT ATGATTGCAA TACTAAATAA GCTAAGTGAA 4440
CATAAAGCAT TCTGTAACAA AACTCTTGAA CTAGGACAGA TGTATCAAAT CAACACATAG 4500
AGTTCTACCA TTATGCTGTG TCAAATTATA ATCCTGTATA TATAAACAAA CAAATCCAAT 4560
CTTCTCACAG AGTCATGGTG TCGCAAAACC ACGCTAACTA TCATGGTAGC ATAGAGTAGT 4620
TATTTAAAAA TTAACATAAT GATGAATTGT TAGTATGAGA TCAAAAACAA CATTGGGGCA 4680
AATGCAACCA TGTCCAAACA CAAGAATCAA CGCACTGCCA GGACTCTAGA AAAGACCTGG 4740
GATACTCTTA ATCATCTAAT TGTAATATCC TCTTGTTTAT ACAGATTAAA TTTAAAATCT 4800
ATAGCACAAA TAGCACTATC AGTTTTGGCA ATGATAATCT CAACCTCTCT CATAATTGCA 4860
GCCATAATAT TCATCATCTC TGCCAATCAC AAAGTTACAC TAACAACGGT CACAGTTCAA 4920
ACAATAAAAA ACCACACTGA AAAAAACATC ACCACCTACC CTACTCAAGT CTCACCAGAA 4980
AGGGTTAGTT CATCCAAGCA ACCCACAACC ACATCACCAA TCCACACAAG TTCAGCTACA 5040
ACATCACCCA ATACAAAATC AGAAACACAC CATACAACAG CACAAACCAA AGGCAGAACC 5100
ACCACTTCAA CACAGACCAA CAAGCCAAGC ACAAAACCAC GTCCAAAAAA TCCACCAAAA 5160
AAAGATGATT ACCATTTTGA AGTGTTCAAC TTCGTTCCCT GCAGTATATG TGGCAACAAT 5220
CAACTTTGCA AATCCATCTG CAAAACAATA CCAAGCAACA AACCAAAGAA GAAACCAACC 5280
ATCAAACCCA CAAACAAACC AACCACCAAA ACCACAAACA AAAGAGACCC AAAAACACCA 5340
GCCAAAACGA CGAAAAAAGA AACTACCACC AACCCAACAA AAAAACTAAC CCTCAAGACC 5400
ACAGAAAGAG ACACCAGCAC CTCACAATCC ACTGCACTCG ACACAACCAC ATTAAAACAC 5460
ACAGTCCAAC AGCAATCCCT CCTCTCAACC ACCCCCGAAA ACACACCCAA CTCCACACAA 5520
ACACCCACAG CATCCGAGCC CTCCACACCA AACTCCACCC AAAAAACCCA GCCACATGCT 5580
TAGTTATTCA AAAACTACAT CTTAGCAGAG AACCGTGATC TATCAAGCAA GAACGAAATT 5640
AAACCTGGGG CAAATAACCA TGGAGTTGAT GATCCACAAG TCAAGTGCAA TCTTCCTAAC 5700
TCTTGCTATT AATGCATTGT ACCTCACCTC AAGTCAGAAC ATAACTGAGG AGTTTTACCA 5760
ATCGACATGT AGTGCAGTTA GCAGAGGTTA TTTTAGTGCT TTAAGAACAG GTTGGTATAC 5820
TAGTGTCATA ACAATAGAAT TAAGTAATAT AAAAGAAACC AAATGCAATG GAACTGACAC 5880
TAAAGTAAAA CTTATGAAAC AAGAATTAGA TAAGTATAAG AATGCAGTAA CAGAATTACA 5940
GCTACTTATG CAAAACACAC CAGCTGTCAA CAACCGGGCC AGAAGAGAAG CACCACAGTA 6000
TATGAACTAC ACAATCAATA CCACTAAAAA CCTAAATGTA TCAATAAGCA AGAAGAGGAA 6060
ACGAAGATTT CTAGGCTTCT TGTTAGGTGT GGGATCTGCA ATAGCAAGTG GTATAGCTGT 6120
ATCAAAAGTT CTACACCTTG AAGGAGAAGT GAACAAGATC AAAAATGCTT TGTTGTCTAC 6180
AAACAAAGCT GTAGTCAGTT TATCAAATGG GGTCAGTGTT TTAACCAGCA AAGTGTTAGA 6240
TCTCAAGAAT TACATAAATA ACCAATTATT ACCCATAGTA AATCAACAGA GCTGTCGCAT 6300
CTCCAACATT GAAACAGTTA TAGAATTCCA GCAGAAGAAC AGCAGATTGT TGGAAATCAC 6360
CAGAGAATTT AGTGTCAATG CAGGTGTAAC AACACCTTTA AGCACTTACA TGTTGACAAA 6420
CAGTGAGTTA CTATCATTAA TCAATGATAT GCCTATAACA AATGATCAGA AAAAATTAAT 6480
GTCAAGCAAT GTTCAGATAG TAAGGCAACA AAGTTATTCC ATCATGTCTA TAATAAAGGA 6540
AGAAGTCCTT GCATATGTTG TACAGCTGCC TATCTATGGT GTAATAGATA CACCTTGCTG 6600
GAAATTGCAC ACATCGCCTC TATGCACTAC CAACATCAAA GAAGGATCAA ATATTTGTTT 6660
AACAAGGACT GATAGAGGAT GGTATTGTGA TAATGCAGGA TCAGTATCCT TCTTTCCACA 6720
GGCTGACACT TGTAAAGTAC AGTCCAATCG AGTATTTTGT GACACTATGA ACAGTTTGAC 6780
ATTACCAAGT GAAGTCAGCC TTTGTAACAC TGACATATTC AATTCCAAGT ATGACTGCAA 6840
AATTATGACA TCAAAAACAG ACATAAGCAG CTCAGTAATT ACTTCTCTTG GAGCTATAGT 6900
GTCATGCTAT GGTAAAACTA AATGCACTGC ATCCAACAAA AATCGTGGGA TTATAAAGAC 6960
ATTTTCTAAT GGTTGTGACT ATGTGTCAAA CAAAGGAGTA GATACTGTGT CAGTGGGCAA 7020
CACTTTATAC TATGTAAACA AGCTGGAAGG CAAGAACCTT TATGTAAAAG GGGAACCTAT 7080
AATAAATTAC TATGACCCTC TAGTGTTTCC TTCTGATGAG TTTGATGCAT CAATATCTCA 7140
AGTCAATGAA AAAATCAATC AAAGTTTAGC TTTTATTCGT AGATCTGATG AATTACTACA 7200
TAATGTAAAT ACTGGCAAAT CTACTACAAA TATTATGATA ACTACAATTA TTATAGTAAT 7260
CATTGTAGTA TTGTTATCAT TAATAGCTAT TGGTTTACTG TTGTATTGTA AAGCCAAAAA 7320
CACACCAGTT ACACTAAGCA AAGACCAACT AAGTGGAATC AATAATATTG CATTCAGCAA 7380
ATAGACAAAA AACCACCTGA TCATGTTTCA ACAACAATCT GCTGACCACC AATCCCAAAT 7440
CAACTTACAA CAAATATTTC AACATCACAG TACAGGCTGA ATCATTTCCT CACATCATGC 7500
TACCCACATA ACTAAGCTAG ATCCTTAACT TATAGTTACA TAAAAACCTC AAGTATCACA 7560
ATCAACCACT AAATCAACAC ATCATTCACA AAATTAACAG CTGGGGCAAA TATGTCGCGA 7620
AGAAATCCTT GTAAATTTGA GATTAGAGGT CATTGCTTGA ATGGTAGAAG ATGTCACTAC 7680
AGTCATAATT ACTTTGAATG GCCTCCTCAT GCATTACTAG TGAGGCAAAA CTTCATGTTA 7740
AACAAGATAC TCAAGTCAAT GGACAAAAGC ATAGACACTT TGTCTGAAAT AAGTGGAGCT 7800
GCTGAACTGG ATAGAACAGA AGAATATGCT CTTGGTATAG TTGGAGTGCT AGAGAGTTAC 7860
ATAGGATCTA TAAACAACAT AACAAAACAA TCAGCATGTG TTGCTATGAG TAAACTTCTT 7920
ATTGAGATCA ATAGTGATGA CATTAAAAAG CTTAGAGATA ATGAAGAACC CAATTCACCT 7980
AAGATAAGAG TGTACAATAC TGTTATATCA TACATTGAGA GCAATAGAAA AAACAACAAG 8040
CAAACCATCC ATCTGCTCAA GAGACTACCA GCAGACGTGC TGAAGAAGAC AATAAAGAAC 8100
ACATTAGATA TCCACAAAAG CATAACCATA AGCAATCCAA AAGAGTCAAC TGTGAATGAT 8160
CAAAATGACC AAACCAAAAA TAATGATATT ACCGGATAAA TATCCTTGTA GTATATCATC 8220
CATATTGATC TCAAGTGAAA GCATGGTTGC TACATTCAAT CATAAAAACA TATTACAATT 8280
TAACCATAAC TATTTGGATA ACCACCAGCG TTTATTAAAT CATATATTTG ATGAAATTCA 8340
TTGGACACCT AAAAACTTAT TAGATGCCAC TCAACAATTT CTCCAACATC TTAACATCCC 8400
TGAAGATATA TATACAGTAT ATATATTAGT GTCATAATGC TTGACCATAA CGACTCTATG 8460
TCATCCAACC ATAAAACTAT TTTGATAAGG TTATGGGACA AAATGGATCC CATTATTAAT 8520
GGAAACTCTG CTAATGTGTA TCTAACTGAT AGTTATTTAA AAGGTGTTAT CTCTTTTTCA 8580
GAGTGTAATG CTTTAGGGAG TTATCTTTTT AACGGCCCTT ATCTTAAAAA TGATTACACC 8640
AACTTAATTA GTAGACAAAG CCCACTACTA GAGCATATGA ATCTTAAAAA ACTAACTATA 8700
ACACAGTCAT TAATATCTAG ATATCATAAA GGTGAACTGA AATTAGAAGA ACCAACTTAT 8760
TTCCAGTCAT TACTTATGAC ATATAAAAGT ATGTCCTCGT CTGAACAAAT TGCTACAACT 8820
AACTTACTTA AAAAAATAAT ACGAAGAGCC ATAGAAATAA GTGATGTAAA GGTGTACGCC 8880
ATCTTGAATA AACTAGGATT AAAGGAAAAG GACAGAGTTA AGCCCAACAA TAATTCAGGT 8940
GATGAAAACT CAGTACTTAC AACTATAATT AAAGATGATA TACTTTCGGC TGTGGAAAAC 9000
AATCAATCAT ATACAAATTC AGACAAAAGT CACTCAGTAA ATCAAAATAT CACTATCAAA 9060
ACAACACTCT TGAAAAAATT GATGTGTTCA ATGCAACATC CTCCATCATG GTTAATACAC 9120
TGGTTCAATT TATATACAAA ATTAAATAAC ATATTAACAC AATATCGATC AAATGAGGTA 9180
AAAAGTCATG GGTTTATATT AATAGATAAT CAAACTTTAA GTGGTTTTCA GTTTATTTTA 9240
AATCAATATG GTTGTATCGT TTATCATAAA GGACTCAAAA AAATCACAAC TACTACTTAC 9300
AATCAATTTT TGACATGGAA AGACATCAGC CTTAGCAGAT TAAATGTTTG CTTAATTACT 9360
TGGATAAGTA ATTGTTTAAA TACATTAAAC AAAAGCTTAG GGCTGAGATG TGGATTCAAT 9420
AATGTTGTGT TATCACAATT ATTTCTTTAT GGAGATTGTA TACTGAAATT ATTTCATAAT 9480
GAAGGCTTCT ACATAATAAA AGAAGTAGAG GGATTTATTA TGTCTTTAAT TCTAAACATA 9540
ACAGAAGAAG ATCAATTTAG GAAACGATTT TATAATAGCA TGCTAAATAA CATCACAGAT 9600
GCAGCTATTA AGGCTCAAAA GGACCTACTA TCAAGAGTAT GTCACACTTT ATTAGACAAG 9660
ACAGTGTCTG ATAATATCAT AAATGGTAAA TGGATAATCC TATTAAGTAA ATTTCTTAAA 9720
TTGATTAAGC TTGCAGGTGA TAATAATCTC AATAACTTGA GTGAGCTATA TTTTCTCTTC 9780
AGAATCTTTG GACATCCAAT GGTCGATGAA AGACAAGCAA TGGATTCTGT AAGAATTAAC 9840
TGTAATGAAA CTAAGTTCTA CTTATTAAGT AGTCTAAGTA CATTAAGAGG TGCTTTCATT 9900
TATAGAATCA TAAAAGGGTT TGTAAATACC TACAACAGAT GGCCCACCTT AAGGAATGCT 9960
ATTGTCCTAC CTCTAAGATG GTTAAACTAC TATAAACTTA ATACTTATCC ATCTCTACTT 10020
GAAATCACAG AAAATGATTT GATTATTTTA TCAGGATTGC GGTTCTATCG TGAGTTTCAT 10080
CTGCCTAAAA AAGTGGATCT TGAAATGATA ATAAATGACA AAGCCATTTC ACCTCCAAAA 10140
GATCTAATAT GGACTAGTTT TCCTAGAAAT TACATGCCAT CACATATACA AAATTATATA 10200
GAACATGAAA AGTTGAAGTT CTCTGAAAGC GACAGATCGA GAAGAGTACT AGAGTATTAC 10260
TTGAGAGATA ATAAATTCAA TGAATGCGAT CTATACAATT GTGTAGTCAA TCAAAGCTAT 10320
CTCAACAACT CTAATCACGT GGTATCACTA ACTGGTAAAG AAAGAGAGCT CAGTGTAGGT 10380
AGAATGTTTG CTATGCAACC AGGTATGTTT AGGCAAATCC AAATCTTAGC AGAGAAAATG 10440
ATAGCTGAAA ATATTTTACA ATTCTTCCCT GAGAGTTTGA CAAGATATGG TGATCTAGAG 10500
CTTCAAAAGA TATTAGAATT AAAAGCAGGA ATAAGCAACA AGTCAAATCG TTATAATGAT 10560
AACTACAACA ATTATATCAG TAAATGTTCT ATCATTACAG ATCTTAGCAA ATTCAATCAG 10620
GCATTTAGAT ATGAAACATC ATGTATCTGC AGTGATGTAT TAGATGAACT GCATGGAGTA 10680
CAATCTCTGT TCTCTTGGTT GCATTTAACA ATACCTCTTG TCACAATAAT ATGTACATAT 10740
AGACATGCAC CTCCTTTCAT AAAGGATCAT GTTGTTAATC TTAATGAGGT TGATGAACAA 10800
AGTGGATTAT ACAGATATCA TATGGGTGGT ATTGAGGGCT GGTGTCAAAA ACTGTGGACC 10860
ATTGAAGCTA TATCATTATT AGATCTAATA TCTCTCAAAG GGAAATTCTC TATCACAGCT 10920
CTGATAAATG GTGATAATCA GTCAATTGAT ATAAGCAAAC CAGTTAGACT TATAGAGGGT 10980
CAGACCCATG CACAAGCAGA TTATTTGTTA GCATTAAATA GCCTTAAATT GTTATATAAA 11040
GAGTATGCAG GTATAGGCCA TAAGCTTAAG GGAACAGAGA CCTATATATC CCGAGATATG 11100
CAGTTCATGA GCAAAACAAT CCAGCACAAT GGAGTGTACT ATCCAGCCAG TATCAAAAAA 11160
GTCCTGAGAG TAGGTCCATG GATAAACACG ATACTTGATG ATTTTAAAGT TAGTTTAGAA 11220
TCTATAGGCA GCTTAACACA GGAGTTAGAA TACAGAGGAG AAAGCTTATT ATGCAGTTTA 11280
ATATTTAGGA ACATTTGGTT ATACAATCAA ATTGCTTTGC AACTCCGAAA TCATGCATTA 11340
TGTAACAATA AGCTATATTT AGATATATTG AAAGTATTAA AACACTTAAA AACTTTTTTT 11400
AATCTTGATA GCATTGATAT GGCTTTATCA TTGTATATGA ATTTGCCTAT GCTGTTTGGT 11460
GGTGGTGATC CTAATTTGTT ATATCGAAGC TTTTATAGGA GAACTCCAGA CTTCCTTACA 11520
GAAGCTATAG TACATTCAGT GTTTGTGTTG AGCTATTATA CTGGTCACGA TTTACAAGAT 11580
AAGCTCCAGG ATCTTCCAGA TGATAGACTG AACAAATTCT TGACATGTGT CATCACATTT 11640
GATAAAAATC CCAATGCCGA GTTTGTAACA TTGATGAGGG ATCCACAGGC TTTAGGGTCT 11700
GAAAGGCAAG CTAAAATTAC TAGTGAGATT AATAGATTAG CAGTAACAGA AGTCTTAAGT 11760
ATAGCCCCAA ACAAAATATT TTCTAAAAGT GCACAACATT ATACTACCAC TGAGATTGAT 11820
CTAAATGACA TTATGCAAAA TATAGAACCA ACTTACCCTC ATGGATTAAG AGTTGTTTAT 11880
GAAAGTTTAC CTTTTTATAA AGCAGAAAAA ATAGTTAATC TTATATCAGG AACAAAATCC 11940
ATAACTAATA TACTTGAAAA AACATCAGCA ATAGATACAA CTGATATTAA TAGGGCTACT 12000
GATATGATGA GGAAAAATAT AACTTTACTT ATAAGGATAC TTCCACTAGA TTGTAACAAA 12060
GACAAAAGAG AGTTATTAAG TTTAGAAAAT CTTAGTATAA CTGAATTAAG CAAGTATGTA 12120
AGAGAAAGAT CTTGGTCATT ATCCAATATA GTAGGAGTAA CATCGCCAAG TATTATGTTC 12180
ACAATGGACA TTAAATATAC AACTAGCACT ATAGCCAGTG GTATAATAAT AGAAAAATAT 12240
AATGTTAATA GTTTAACTCG TGGTGAAAGA GGACCCACCA AGCCATGGGT AGGCTCATCC 12300
ACGCAGGAGA AAAAAACAAT GCCAGTGTAC AACAGACAAG TTTTAACCAA AAAGCAAAGA 12360
GACCAAATAG ATTTATTAGC AAAATTAGAC TGGGTATATG CATCCATAGA CAACAAAGAT 12420
GAATTCATGG AAGAACTGAG TACTGGAACA CTTGGACTGT CATATGAAAA AGCCAAAAAG 12480
TTGTTTCCAC AATATCTAAG TGTCAATTAT TTACACCGTT TAACAGTCAG TAGTAGACCA 12540
TGTGAATTCC CTGCATCAAT ACCAGCTTAT AGAACAACAA ATTATCATTT TGATACTAGT 12600
CCTATCAATC ATGTATTAAC AGAAAAGTAT GGAGATGAAG ATATCGACAT TGTGTTTCAA 12660
AATTGCATAA GTTTTGGTCT TAGCCTGATG TCGGTTGTGG AACAATTCAC AAACATATGT 12720
CCTAATAGAA TTATTCTCAT ACCGAAGCTG AATGAGATAC ATTTGATGAA ACCTCCTATA 12780
TTTACAGGAG ATGTTGATAT CATCAAGTTG AAGCAAGTGA TACAAAAGCA GCACATGTTC 12840
CTACCAGATA AAATAAGTTT AACCCAATAT GTAGAATTAT TCTTAAGTAA CAAAGCACTT 12900
AAATCTGGAT CTCACATCAA CTCTAATTTA ATATTAGTAC ATAAAATGTC TGATTATTTT 12960
CATAATGCTT ATATTTTAAG TACTAATTTA GCTGGACATT GGATTCTGAT TATTCAACTT 13020
ATGAAAGATT CAAAAGGTAT TTTTGAAAAA GATTGGGGAG AGGGGTACAT AACTGATCAT 13080
ATGTTCATTA ATTTGAATGT TTTCTTTAAT GCTTATAAGA CTTATTTGCT ATGTTTTCAT 13140
AAAGGTTATG GTAAAGCAAA ATTAGAATGT GATATGAACA CTTCAGATCT TCTTTGTGTT 13200
TTGGAGTTAA TAGACAGTAG CTACTGGAAA TCTATGTCTA AAGTTTTCCT AGAACAAAAA 13260
GTCATAAAAT ACATAGTCAA TCAAGACACA AGTTTGCGTA GAATAAAAGG CTGTCACAGT 13320
TTTAAGTTGT GGTTTTTAAA ACGCCTTAAT AATGCTAAAT TTACCGTATG CCCTTGGGTT 13380
GTTAACATAG ATTATCACCC AACACACATG AAAGCTATAT TATCTTACAT AGATTTAGTT 13440
AGAATGGGGT TAATAAATGT AGATAAATTA ACCATTAAAA ATAAAAACAA ATTCAATGAT 13500
GAATTTTACA CATCAAATCT CTTTTACATT AGTTATAACT TTTCAGACAA CACTCATTTG 13560
CTAACAAAAC AAATAAGAAT TGCTAATTCA GAATTAGAAG ATAATTATAA CAAACTATAT 13620
CACCCAACCC CAGAAACTTT AGAAAATATG TCATTAATTC CTGTTAAAAG TAATAATAGT 13680
AACAAACCTA AATTTTGTAT AAGTGGAAAT ACCGAATCTA TGATGATGTC AACATTCTCT 13740
AGTAAAATGC ATATTAAATC TTCCACTGTT ACCACAAGAT TCAATTATAG CAAACAAGAC 13800
TTGTACAATT TATTTCCAAT TGTTGTGATA GACAAGATTA TAGATCATTC AGGTAATACA 13860
GCAAAATCTA ACCAACTTTA CACCACCACT TCACATCAGA CATCTTTAGT AAGGAATAGT 13920
GCATCACTTT ATTGCATGCT TCCTTGGCAT CATGTCAATA GATTTAACTT TGTATTTAGT 13980
TCCACAGGAT GCAAGATCAG TATAGAGTAT ATTTTAAAAG ATCTTAAGAT TAAGGACCCC 14040
AGTTGTATAG CATTCATAGG TGAAGGAGCT GGTAACTTAT TATTACGTAC GGTAGTAGAA 14100
CTTCATCCAG ACATAAGATA CATTTACAGA AGTTTAAAAG ATTGCAATGA TCATAGTTTA 14160
CCTATTGAAT TTCTAAGGTT ATACAACGGG CATATAAACA TAGATTATGG TGAGAATTTA 14220
ACCATTCCTG CTACAGATGC AACTAATAAC ATTCATTGGT CTTATTTACA TATAAAATTT 14280
GCAGAACCTA TTAGCATCTT TGTCTGCGAT GCTGAATTAC CTGTTACAGC CAATTGGAGT 14340
AAAATTATAA TTGAATGGAG TAAGCATGTA AGAAAGTGCA AGTACTGTTC TTCTGTAAAT 14400
AGATGCATTT TAATTGCAAA ATATCATGCT CAAGATGACA TTGATTTCAA ATTAGATAAC 14460
ATTACTATAT TAAAAACTTA CGTGTGCCTA GGTAGCAAGT TAAAAGGATC TGAAGTTTAC 14520
TTAATCCTTA CAATAGGCCC TGCAAATATA CTTCCTGTTT TTGATGTTGT ACAAAATGCT 14580
AAATTGATAC TTTCAAGAAC TAAAAATTTC ATTATGCCTA AAAAAACTGA CAAGGAATCT 14640
ATCGATGCAG ATATTAAAAG CTTAATACCT TTCCTTTGTT ACCCTATAAC AAAAAAAGGA 14700
ATTAAGACTT CATTGTCAAA ATTGAAGAGT GTAGTTAATG GAGATATATT ATCATATTCT 14760
ATAGCTGGAC GTAATGAAGT ATTCAGCAAC AAGCTTATAA ACCACAAGCA TATGAATATC 14820
CTAAAATGGC TAGATCATGT TTTAAATTTT AGATCAGCTG AACTTAATTA CAATCATTTA 14880
TACATGATAG AGTCCACATA TCCTTACTTA AGTGAATTGT TAAATAGTTT AACAACCAAT 14940
GAGCTCAAGA AGCTGATTAA AATAACAGGT AGTGTGCTAT ACAACCTTCC CAACGAACAG 15000
TAGTTTAAAA TATCATTAAC AAGTTTGGTC AAATTTAGAT GCTAACACAT CATTATATTA 15060
TAGTTATTAA AAAATATACA AACTTTTCAA TAATTTAGCA TATTGATTCC AAAATTATCA 15120
TTTTAGTCTT AAGGGGTTAA ATAAAAGTCT AAAACTAACA ATTATACATG TGCATTCACA 15180
ACACAACGAG ACATTAGTTT TTGACACTTT TTTTCTCGT 15219
(2) SEQ ID NO: 30 information about:
(I) SEQUENCE CHARACTERISTICS:
...
(A) length: 2166 amino acid
(B) type: amino acid
(C) chain:
(D) topological framework: linearity is molecule type (ii): protein (xi) sequence description: SEQ ID NO:30:Met Asp Pro Ile Ile Asn Gly Asn Ser Ala Asn Val Tyr Leu Thr Asp1 5 10 15Ser Tyr Leu Lys Gly Val Ile Ser Phe Ser Glu Cys Asn Ala Leu Gly
20??????????????????25??????????????????30Ser?Tyr?Leu?Phe?Asn?Gly?Pro?Tyr?Leu?Lys?Asn?Asp?Tyr?Thr?Asn?Leu
35??????????????????40??????????????????45Ile?Ser?Arg?Gln?Ser?Pro?Leu?Leu?Glu?His?Met?Asn?Leu?Lys?Lys?Leu
50??????????????????55??????????????????60Thr?Ile?Thr?Gln?Ser?Leu?Ile?Ser?Arg?Tyr?His?Lys?Gly?Glu?Leu?Lys65??????????????????70??????????????????75??????????????????80Leu?Glu?Glu?Pro?Thr?Tyr?Phe?Gln?Ser?Leu?Leu?Met?Thr?Tyr?Lys?Ser
85??????????????????90??????????????????95Met?Ser?Ser?Ser?Glu?Gln?Ile?Ala?Thr?Thr?Asn?Leu?Leu?Lys?Lys?Ile
100?????????????????105?????????????????110Ile?Arg?Arg?Ala?Ile?Glu?Ile?Ser?Asp?Val?Lys?Val?Tyr?Ala?Ile?Leu
115?????????????????120?????????????????125Asn?Lys?Leu?Gly?Leu?Lys?Glu?Lys?Asp?Arg?Val?Lys?Pro?Asn?Asn?Asn
130?????????????????135?????????????????140Ser?Gly?Asp?Glu?Asn?Ser?Val?Leu?Thr?Thr?Ile?Ile?Lys?Asp?Asp?Ile145?????????????????150?????????????????155?????????????????160Leu?Ser?Ala?Val?Glu?Asn?Asn?Gln?Ser?Tyr?Thr?Asn?Ser?Asp?Lys?Ser
165?????????????????170?????????????????175His?Ser?Val?Asn?Gln?Asn?Ile?Thr?Ile?Lys?Thr?Thr?Leu?Leu?Lys?Lys
180?????????????????185?????????????????190Leu?Met?Cys?Ser?Met?Gln?His?Pro?Pro?Ser?Trp?Leu?Ile?His?Trp?Phe
195?????????????????200?????????????????205Asn?Leu?Tyr?Thr?Lys?Leu?Asn?Asn?Ile?Leu?Thr?Gln?Tyr?Arg?Ser?Asn
210?????????????????215?????????????????220Glu?Val?Lys?Ser?His?Gly?Phe?Ile?Leu?Ile?Asp?Asn?Gln?Thr?Leu?Ser225?????????????????230?????????????????235?????????????????240Gly?Phe?Gln?Phe?Ile?Leu?Asn?Gln?Tyr?Gly?Cys?Ile?Val?Tyr?His?Lys
245?????????????????250?????????????????255Gly?Leu?Lys?Lys?Ile?Thr?Thr?Thr?Thr?Tyr?Asn?Gln?Phe?Leu?Thr?Trp
260?????????????????265?????????????????270Lys?Asp?Ile?Ser?Leu?Ser?Arg?Leu?Asn?Val?Cys?Leu?Ile?Thr?Trp?Ile
275?????????????????280?????????????????285Ser?Asn?Cys?Leu?Asn?Thr?Leu?Asn?Lys?Ser?Leu?Gly?Leu?Arg?Cys?Gly
290?????????????????295?????????????????300Phe?Asn?Asn?Val?Val?Leu?Ser?Gln?Leu?Phe?Leu?Tyr?Gly?Asp?Cys?Ile305?????????????????310?????????????????315?????????????????320Leu?Lys?Leu?Phe?His?Asn?Glu?Gly?Phe?Tyr?Ile?Ile?Lys?Glu?Val?Glu
325?????????????????330?????????????????335Gly?Phe?Ile?Met?Ser?Leu?Ile?Leu?Asn?Ile?Thr?Glu?Glu?Asp?Gln?Phe
340?????????????????345?????????????????350Arg?Lys?Arg?Phe?Tyr?Asn?Ser?Met?Leu?Asn?Asn?Ile?Thr?Asp?Ala?Ala
355?????????????????360?????????????????365Ile?Lys?Ala?Gln?Lys?Asp?Leu?Leu?Ser?Arg?Val?Cys?His?Thr?Leu?Leu
370?????????????????375?????????????????380Asp?Lys?Thr?Val?Ser?Asp?Asn?Ile?Ile?Asn?Gly?Lys?Trp?Ile?Ile?Leu385?????????????????390?????????????????395?????????????????400Leu?Ser?Lys?Phe?Leu?Lys?Leu?Ile?Lys?Leu?Ala?Gly?Asp?Asn?Asn?Leu
405?????????????????410?????????????????415Asn?Asn?Leu?Ser?Glu?Leu?Tyr?Phe?Leu?Phe?Arg?Ile?Phe?Gly?His?Pro
420?????????????????425?????????????????430Met?Val?Asp?Glu?Arg?Gln?Ala?Met?Asp?Ser?Val?Arg?Ile?Asn?Cys?Asn
435?????????????????440?????????????????445Glu?Thr?Lys?Phe?Tyr?Leu?Leu?Ser?Ser?Leu?Ser?Thr?Leu?Arg?Gly?Ala
450?????????????????455?????????????????460Phe?Ile?Tyr?Arg?Ile?Ile?Lys?Gly?Phe?Val?Asn?Thr?Tyr?Asn?Arg?Trp465?????????????????470?????????????????475?????????????????480Pro?Thr?Leu?Arg?Asn?Ala?Ile?Val?Leu?Pro?Leu?Arg?Trp?Leu?Asn?Tyr
485?????????????????490?????????????????495Tyr?Lys?Leu?Asn?Thr?Tyr?Pro?Ser?Leu?Leu?Glu?Ile?Thr?Glu?Asn?Asp
500?????????????????505?????????????????510Leu?Ile?Ile?Leu?Ser?Gly?Leu?Arg?Phe?Tyr?Arg?Glu?Phe?His?Leu?Pro
515?????????????????520?????????????????525Lys?Lys?Val?Asp?Leu?Glu?Met?Ile?Ile?Asn?Asp?Lys?Ala?Ile?Ser?Pro
530?????????????????535?????????????????540Pro?Lys?Asp?Leu?Ile?Trp?Thr?Ser?Phe?Pro?Arg?Asn?Tyr?Met?Pro?Ser545?????????????????550?????????????????555?????????????????560His?Ile?Gln?Asn?Tyr?Ile?Glu?His?Glu?Lys?Leu?Lys?Phe?Ser?Glu?Ser
565?????????????????570?????????????????575Asp?Arg?Ser?Arg?Arg?Val?Leu?Glu?Tyr?Tyr?Leu?Arg?Asp?Asn?Lys?Phe
580?????????????????585?????????????????590Asn?Glu?Cys?Asp?Leu?Tyr?Asn?Cys?Val?Val?Asn?Gln?Ser?Tyr?Leu?Asn
595?????????????????600?????????????????605Asn?Ser?Asn?His?Val?Val?Ser?Leu?Thr?Gly?Lys?Glu?Arg?Glu?Leu?Ser
610?????????????????615?????????????????620Val?Gly?Arg?Met?Phe?Ala?Met?Gln?Pro?Gly?Met?Phe?Arg?Gln?Ile?Gln625?????????????????630?????????????????635?????????????????640Ile?Leu?Ala?Glu?Lys?Met?Ile?Ala?Glu?Asn?Ile?Leu?Gln?Phe?Phe?Pro
645?????????????????650?????????????????655Glu?Ser?Leu?Thr?Arg?Tyr?Gly?Asp?Leu?Glu?Leu?Gln?Lys?Ile?Leu?Glu
660?????????????????665?????????????????670Leu?Lys?Ala?Gly?Ile?Ser?Asn?Lys?Ser?Asn?Arg?Tyr?Asn?Asp?Asn?Tyr
675?????????????????680?????????????????685Asn?Asn?Tyr?Ile?Ser?Lys?Cys?Ser?Ile?Ile?Thr?Asp?Leu?Ser?Lys?Phe
690?????????????????695?????????????????700Asn?Gln?Ala?Phe?Arg?Tyr?Glu?Thr?Ser?Cys?Ile?Cys?Ser?Asp?Val?Leu705?????????????????710?????????????????715?????????????????720Asp?Glu?Leu?His?Gly?Val?Gln?Ser?Leu?Phe?Ser?Trp?Leu?His?Leu?Thr
725?????????????????730?????????????????735Ile?Pro?Leu?Val?Thr?Ile?Ile?Cys?Thr?Tyr?Arg?His?Ala?Pro?Pro?Phe
740?????????????????745?????????????????750Ile?Lys?Asp?His?Val?Val?Asn?Leu?Asn?Glu?Val?Asp?Glu?Gln?Ser?Gly
755?????????????????760?????????????????765Leu?Tyr?Arg?Tyr?His?Met?Gly?Gly?Ile?Glu?Gly?Trp?Cys?Gln?Lys?Leu
770?????????????????775?????????????????780Trp?Thr?Ile?Glu?Ala?Ile?Ser?Leu?Leu?Asp?Leu?Ile?Ser?Leu?Lys?Gly785?????????????????790?????????????????795?????????????????800Lys?Phe?Ser?Ile?Thr?Ala?Leu?Ile?Asn?Gly?Asp?Asn?Gln?Ser?Ile?Asp
805?????????????????810?????????????????815Ile?Ser?Lys?Pro?Val?Arg?Leu?Ile?Glu?Gly?Gln?Thr?His?Ala?Gln?Ala
820?????????????????825?????????????????830Asp?Tyr?Leu?Leu?Ala?Leu?Asn?Ser?Leu?Lys?Leu?Leu?Tyr?Lys?Glu?Tyr
835?????????????????840?????????????????845Ala?Gly?Ile?Gly?His?Lys?Leu?Lys?Gly?Thr?Glu?Thr?Tyr?Ile?Ser?Arg
850?????????????????855?????????????????860Asp?Met?Gln?Phe?Met?Ser?Lys?Thr?Ile?Gln?His?Asn?Gly?Val?Tyr?Tyr865?????????????????870?????????????????875?????????????????880Pro?Ala?Ser?Ile?Lys?Lys?Val?Leu?Arg?Val?Gly?Pro?Trp?Ile?Asn?Thr
885?????????????????890?????????????????895Ile?Leu?Asp?Asp?Phe?Lys?Val?Ser?Leu?Glu?Ser?Ile?Gly?Ser?Leu?Thr
900?????????????????905?????????????????910Gln?Glu?Leu?Glu?Tyr?Arg?Gly?Glu?Ser?Leu?Leu?Cys?Ser?Leu?Ile?Phe
915?????????????????920?????????????????925Arg?Asn?Ile?Trp?Leu?Tyr?Asn?Gln?Ile?Ala?Leu?Gln?Leu?Arg?Asn?His
930?????????????????935?????????????????940Ala?Leu?Cys?Asn?Asn?Lys?Leu?Tyr?Leu?Asp?Ile?Leu?Lys?Val?Leu?Lys945?????????????????950?????????????????955?????????????????960His?Leu?Lys?Thr?Phe?Phe?Asn?Leu?Asp?Ser?Ile?Asp?Met?Ala?Leu?Ser
965?????????????????970?????????????????975Leu?Tyr?Met?Asn?Leu?Pro?Met?Leu?Phe?Gly?Gly?Gly?Asp?Pro?Asn?Leu
980?????????????????985?????????????????990Leu?Tyr?Arg?Ser?Phe?Tyr?Arg?Arg?Thr?Pro?Asp?Phe?Leu?Thr?Glu?Ala
995?????????????????1000????????????????1005Ile?Val?His?Ser?Val?Phe?Val?Leu?Ser?Tyr?Tyr?Thr?Gly?His?Asp?Leu
1010????????????????1015????????????????1020Gln?Asp?Lys?Leu?Gln?Asp?Leu?Pro?Asp?Asp?Arg?Leu?Asn?Lys?Phe?Leu1025????????????????1030????????????????1035????????????????1040Thr?Cys?Val?Ile?Thr?Phe?Asp?Lys?Asn?Pro?Asn?Ala?Glu?Phe?Val?Thr
1045????????????????1050????????????????1055Leu?Met?Arg?Asp?Pro?Gln?Ala?Leu?Gly?Ser?Glu?Arg?Gln?Ala?Lys?Ile
1060????????????????1065????????????????1070Thr?Ser?Glu?Ile?Asn?Arg?Leu?Ala?Val?Thr?Glu?Val?Leu?Ser?Ile?Ala
1075????????????????1080????????????????1085Pro?Asn?Lys?Ile?Phe?Ser?Lys?Ser?Ala?Gln?His?Tyr?Thr?Thr?Thr?Glu
1090????????????????1095????????????????1100Ile?Asp?Leu?Asn?Asp?Ile?Met?Gln?Asn?Ile?Glu?Pro?Thr?Tyr?Pro?His1105????????????????1110????????????????1115????????????????1120Gly?Leu?Arg?Val?Val?Tyr?Glu?Ser?Leu?Pro?Phe?Tyr?Lys?Ala?Glu?Lys
1125????????????????1130????????????????1135Ile?Val?Asn?Leu?Ile?Ser?Gly?Thr?Lys?Ser?Ile?Thr?Asn?Ile?Leu?Glu
1140????????????????1145????????????????1150Lys?Thr?Ser?Ala?Ile?Asp?Thr?Thr?Asp?Ile?Asn?Arg?Ala?Thr?Asp?Met
1155????????????????1160????????????????1165Met?Arg?Lys?Asn?Ile?Thr?Leu?Leu?Ile?Arg?Ile?Leu?Pro?Leu?Asp?Cys
1170????????????????1175????????????????1180Asn?Lys?Asp?Lys?Arg?Glu?Leu?Leu?Ser?Leu?Glu?Asn?Leu?Ser?Ile?Thr1185????????????????1190????????????????1195????????????????1200Glu?Leu?Ser?Lys?Tyr?Val?Arg?Glu?Arg?Ser?Trp?Ser?Leu?Ser?Asn?Ile
1205????????????????1210????????????????1215Val?Gly?Val?Thr?Ser?Pro?Ser?Ile?Met?Phe?Thr?Met?Asp?Ile?Lys?Tyr
1220????????????????1225????????????????1230Thr?Thr?Ser?Thr?Ile?Ala?Ser?Gly?Ile?Ile?Ile?Glu?Lys?Tyr?Asn?Val
1235????????????????1240????????????????1245Asn?Ser?Leu?Thr?Arg?Gly?Glu?Arg?Gly?Pro?Thr?Lys?Pro?Trp?Val?Gly
1250????????????????1255????????????????1260Ser?Ser?Thr?Gln?Glu?Lys?Lys?Thr?Met?Pro?Val?Tyr?Asn?Arg?Gln?Val1265????????????????1270????????????????1275????????????????1280Leu?Thr?Lys?Lys?Gln?Arg?Asp?Gln?Ile?Asp?Leu?Leu?Ala?Lys?Leu?Asp
1285????????????????1290????????????????1295Trp?Val?Tyr?Ala?Ser?Ile?Asp?Asn?Lys?Asp?Glu?Phe?Met?Glu?Glu?Leu
1300????????????????1305????????????????1310Ser?Thr?Gly?Thr?Leu?Gly?Leu?Ser?Tyr?Glu?Lys?Ala?Lys?Lys?Leu?Phe
1315????????????????1320????????????????1325Pro?Gln?Tyr?Leu?Ser?Val?Asn?Tyr?Leu?His?Arg?Leu?Thr?Val?Ser?Ser
1330????????????????1335????????????????1340Arg?Pro?Cys?Glu?Phe?Pro?Ala?Ser?Ile?Pro?Ala?Tyr?Arg?Thr?Thr?Asn1345????????????????1350????????????????1355????????????????1360Tyr?His?Phe?Asp?Thr?Ser?Pro?Ile?Asn?His?Val?Leu?Thr?Glu?Lys?Tyr
1365????????????????1370????????????????1375Gly?Asp?Glu?Asp?Ile?Asp?Ile?Val?Phe?Gln?Asn?Cys?Ile?Ser?Phe?Gly
1380????????????????1385????????????????1390Leu?Ser?Leu?Met?Ser?Val?Val?Glu?Gln?Phe?Thr?Asn?Ile?Cys?Pro?Asn
1395????????????????1400????????????????1405Arg?Ile?Ile?Leu?Ile?Pro?Lys?Leu?Asn?Glu?Ile?His?Leu?Met?Lys?Pro
1410????????????????1415????????????????1420Pro?Ile?Phe?Thr?Gly?Asp?Val?Asp?Ile?Ile?Lys?Leu?Lys?Gln?Val?Ile1425????????????????1430????????????????1435????????????????1440Gln?Lys?Gln?His?Met?Phe?Leu?Pro?Asp?Lys?Ile?Ser?Leu?Thr?Gln?Tyr
1445????????????????1450????????????????1455Val?Glu?Leu?Phe?Leu?Ser?Asn?Lys?Ala?Leu?Lys?Ser?Gly?Ser?His?Ile
1460????????????????1465????????????????1470Asn?Ser?Asn?Leu?Ile?Leu?Val?His?Lys?Met?Ser?Asp?Tyr?Phe?His?Asn
1475????????????????1480????????????????1485Ala?Tyr?Ile?Leu?Ser?Thr?Asn?Leu?Ala?Gly?His?Trp?Ile?Leu?Ile?Ile
1490????????????????1495????????????????1500Gln?Leu?Met?Lys?Asp?Ser?Lys?Gly?Ile?Phe?Glu?Lys?Asp?Trp?Gly?Glu1505????????????????1510????????????????1515????????????????1520Gly?Tyr?Ile?Thr?Asp?His?Met?Phe?Ile?Asn?Leu?Asn?Val?Phe?Phe?Asn
1525????????????????1530????????????????1535Ala?Tyr?Lys?Thr?Tyr?Leu?Leu?Cys?Phe?His?Lys?Gly?Tyr?Gly?Lys?Ala
1540????????????????1545????????????????1550Lys?Leu?Glu?Cys?Asp?Met?Asn?Thr?Ser?Asp?Leu?Leu?Cys?Val?Leu?Glu
1555????????????????1560????????????????1565Leu?Ile?Asp?Ser?Ser?Tyr?Trp?Lys?Ser?Met?Ser?Lys?Val?Phe?Leu?Glu
1570????????????????1575????????????????1580Gln?Lys?Val?Ile?Lys?Tyr?Ile?Val?Asn?Gln?Asp?Thr?Ser?Leu?Arg?Arg1585????????????????1590????????????????1595????????????????1600Ile?Lys?Gly?Cys?His?Ser?Phe?Lys?Leu?Trp?Phe?Leu?Lys?Arg?Leu?Asn
1605????????????????1610????????????????1615Asn?Ala?Lys?Phe?Thr?Val?Cys?Pro?Trp?Val?Val?Asn?Ile?Asp?Tyr?His
1620????????????????1625????????????????1630Pro?Thr?His?Met?Lys?Ala?Ile?Leu?Ser?Tyr?Ile?Asp?Leu?Val?Arg?Met
1635????????????????1640????????????????1645Gly?Leu?Ile?Asn?Val?Asp?Lys?Leu?Thr?Ile?Lys?Asn?Lys?Asn?Lys?Phe
1650????????????????1655????????????????1660Asn?Asp?Glu?Phe?Tyr?Thr?Ser?Asn?Leu?Phe?Tyr?Ile?Ser?Tyr?Asn?Phe1665????????????????1670????????????????1675????????????????1680Ser?Asp?Asn?Thr?His?Leu?Leu?Thr?Lys?Gln?Ile?Arg?Ile?Ala?Asn?Ser
1685????????????????1690????????????????1695Glu?Leu?Glu?Asp?Asn?Tyr?Asn?Lys?Leu?Tyr?His?Pro?Thr?Pro?Glu?Thr
1700????????????????1705????????????????1710Leu?Glu?Asn?Met?Ser?Leu?Ile?Pro?Val?Lys?Ser?Asn?Asn?Ser?Asn?Lys
1715????????????????1720????????????????1725Pro?Lys?Phe?Cys?Ile?Ser?Gly?Asn?Thr?Glu?Ser?Met?Met?Met?Ser?Thr
1730????????????????1735????????????????1740Phe?Ser?Ser?Lys?Met?His?Ile?Lys?Ser?Ser?Thr?Val?Thr?Thr?Arg?Phe1745????????????????1750????????????????1755????????????????1760Asn?Tyr?Ser?Lys?Gln?Asp?Leu?Tyr?Asn?Leu?Phe?Pro?Ile?Val?Val?Ile
1765????????????????1770????????????????1775Asp?Lys?Ile?Ile?Asp?His?Ser?Gly?Asn?Thr?Ala?Lys?Ser?Asn?Gln?Leu
1780????????????????1785????????????????1790Tyr?Thr?Thr?Thr?Ser?His?Gln?Thr?Ser?Leu?Val?Arg?Asn?Ser?Ala?Ser
1795????????????????1800????????????????1805Leu?Tyr?Cys?Met?Leu?Pro?Trp?His?His?Val?Asn?Arg?Phe?Asn?Phe?Val
1810????????????????1815????????????????1820Phe?Ser?Ser?Thr?Gly?Cys?Lys?Ile?Ser?Ile?Glu?Tyr?Ile?Leu?Lys?Asp1825????????????????1830????????????????1835????????????????1840Leu?Lys?Ile?Lys?Asp?Pro?Ser?Cys?Ile?Ala?Phe?Ile?Gly?Glu?Gly?Ala
1845????????????????1850????????????????1855Gly?Asn?Leu?Leu?Leu?Arg?Thr?Val?Val?Glu?Leu?His?Pro?Asp?Ile?Arg
1860????????????????1865????????????????1870Tyr?Ile?Tyr?Arg?Ser?Leu?Lys?Asp?Cys?Asn?Asp?His?Ser?Leu?Pro?Ile
1875????????????????1880????????????????1885Glu?Phe?Leu?Arg?Leu?Tyr?Asn?Gly?His?Ile?Asn?Ile?Asp?Tyr?Gly?Glu
1890????????????????1895????????????????1900Asn?Leu?Thr?Ile?Pro?Ala?Thr?Asp?Ala?Thr?Asn?Asn?Ile?His?Trp?Ser1905????????????????1910????????????????1915????????????????1920Tyr?Leu?His?Ile?Lys?Phe?Ala?Glu?Pro?Ile?Ser?Ile?Phe?Val?Cys?Asp
1925????????????????1930????????????????1935Ala?Glu?Leu?Pro?Val?Thr?Ala?Asn?Trp?Ser?Lys?Ile?Ile?Ile?Glu?Trp
1940????????????????1945????????????????1950Ser?Lys?His?Val?Arg?Lys?Cys?Lys?Tyr?Cys?Ser?Ser?Val?Asn?Arg?Cys
1955????????????????1960????????????????1965
Ile?Leu?Ile?Ala?Lys?Tyr?His?Ala?Gln?Asp?Asp?Ile?Asp?Phe?Lys?Leu
1970????????????????1975????????????????1980
Asp?Asn?Ile?Thr?Ile?Leu?Lys?Thr?Tyr?Val?Cys?Leu?Gly?Ser?Lys?Leu
1985????????????????1990????????????????1995????????????????2000
Lys?Gly?Ser?Glu?Val?Tyr?Leu?Ile?Leu?Thr?Ile?Gly?Pro?Ala?Asn?Ile
2005????????????????2010????????????????2015
Leu?Pro?Val?Phe?Asp?Val?Val?Gln?Asn?Ala?Lys?Leu?Ile?Leu?Ser?Arg
2020????????????????2025????????????????2030
Thr?Lys?Asn?Phe?Ile?Met?Pro?Lys?Lys?Thr?Asp?Lys?Glu?Ser?Ile?Asp
2035????????????????2040????????????????2045
Ala?Asp?Ile?Lys?Ser?Leu?Ile?Pro?Phe?Leu?Cys?Tyr?Pro?Ile?Thr?Lys
2050????????????????2055????????????????2060
Lys?Gly?Ile?Lys?Thr?Ser?Leu?Ser?Lys?Leu?Lys?Ser?Val?Val?Asn?Gly
2065????????????????2070????????????????2075????????????????2080
Asp?Ile?Leu?Ser?Tyr?Ser?Ile?Ala?Gly?Arg?Asn?Glu?Val?Phe?Ser?Asn
2085????????????????2090????????????????2095
Lys?Leu?Ile?Asn?His?Lys?His?Met?Asn?Ile?Leu?Lys?Trp?Leu?Asp?His
2100????????????????2105????????????????2110
Val?Leu?Asn?Phe?Arg?Ser?Ala?Glu?Leu?Asn?Tyr?Asn?His?Leu?Tyr?Met
2115????????????????2120????????????????2125
Ile?Glu?Ser?Thr?Tyr?Pro?Tyr?Leu?Ser?Glu?Leu?Leu?Asn?Ser?Leu?Thr
2130????????????????2135????????????????2140
Thr?Asn?Glu?Leu?Lys?Lys?Leu?Ile?Lys?Ile?Thr?Gly?Ser?Val?Leu?Tyr
2145????????????????2150????????????????2155????????????????2160
Asn?Leu?Pro?Asn?Glu?Gln
The information of 2165 (2) SEQ ID NO:31:
(i) sequence signature:
(A) length: 15219 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31:
ACGGGAAAAA AATGCGTACT ACAAACTTGC ACATTCGAAA AAAATGGGGC AAATAAGAAC 60
TTGATAAGTG CTATTTAAGT CTAACCTTTT CAATCAGAAA TGGGGTGCAA TTCACTGAGC 120
ATGATAAAGG TTAGATTACA AAATTTATTT GACAATGACG AAGTAGCATT GTTAAAAATA 180
ACATGTTATA CTGATAAATT AATTCTTCTG ACCAATGCAT TAGCCAAAGC AGCAATACAT 240
ACAATTAAAT TAAACGGCAT AGTTTTTATA CATGTTATAA CAAGCAGTGA AGTGTGCCCT 300
GATAACAATA TTGTAGTGAA ATCTAACTTT ACAACAATGC CAATACTACA AAATGGAGGA 360
TACATATGGG AATTGATTGA GTTGACACAC TGCTCTCAAT TAAACGGTTT AATGGATGAT 420
AATTGTGAAA TCAAATTTTC TAAAAGACTA AGTGACTCAG TAATGACTAA TTATATGAAT 480
CAAATATCTG ACTTACTTGG GCTTGATCTC AATTCATGAA TTATGTTTAG TCTAATTCAA 540
TAGACATGTG TTTATTACCA TTTTAGTTAA TATAAAAACT CATCAAAGGG AAATGGGGCA 600
AATAAACTCA CCTAATCAAT CAAACCATGA GCACTACAAA TGACAACACT ACTATGCAAA 660
GATTGATGAT CACAGACATG AGACCCCTGT CAATGGATTC AATAATAACA TCTCTTACCA 720
AAGAAATCAT CACACACAAA TTCATATACT TGATAAACAA TGAATGTATT GTAAGAAAAC 780
TTGATGAAAG ACAAGCTACA TTTACATTCT TAGTCAATTA TGAGATGAAG CTACTGCACA 840
AAGTAGGGAG TACCAAATAC AAAAAATACA CTGAATATAA TACAAAATAT GGCACTTTCC 900
CCATGCCTAT ATTTATCAAT CACGGCGGGT TTCTAGAATG TATTGGCATT AAGCCTACAA 960
AACACACTCC TATAATATAC AAATATGACC TCAACCCGTG AATTCCAACA AAAAAACCAA 1020
CCCAACCAAA CCAAACTATT CCTCAAACAA CAGTGCTCAA TAGTTAAGAA GGAGCTAATC 1080
CATTTTAGTA ATTAAAAATA AAAGTAAAGC CAATAACATA AATTGGGGCA AATACAAAGA 1140
TGGCTCTTAG CAAAGTCAAG TTGAATGATA CATTAAATAA GGATCAGCTG CTGTCATCCA 1200
GCAAATACAC TATTCAACGT AGTACAGGAG ATAATATTGA CACTCCCAAT TATGATGTGC 1260
AAAAACACCT AAACAAACTA TGTGGTATGC TATTAATCAC TGAAGATGCA AATCATAAAT 1320
TCACAGGATT AATAGGTATG TTATATGCTA TGTCCAGGTT AGGAAGGGAA GACACTATAA 1380
AGATACTTAA AGATGCTGGA TATCATGTTA AAGCTAATGG AGTAGATATA ACAACATATC 1440
GTCAAGATAT AAATGGAAAG GAAATGAAAT TCGAAGTATT AACATTATCA AGCTTGACAT 1500
CAGAAATACA AGTCAATATT GAGATAGAAT CTAGAAAGTC CTACAAAAAA ATGCTAAAAG 1560
AGATGGGAGA AGTGGCTCCA GAATATAGGC ATGATTCTCC AGACTGTGGG ATGATAATAC 1620
TGTGTATAGC TGCACTTGTG ATAACCAAAT TAGCAGCAGG AGACAGATCA GGTCTTACAG 1680
CAGTAATTAG GAGGGCAAAC AATGTCTTAA AAAACGAAAT AAAACGATAC AAGGGCCTCA 1740
TACCAAAGGA TATAGCTAAC AGTTTTTATG AAGTGTTTGA AAAACACCCT CATCTTATAG 1800
ATGTTTTCGT GCACTTTGGC ATTGCACAAT CATCCACAAG AGGGGGTAGT AGAGTTGAAG 1860
GAATCTTTGC AGGATTGTTT ATGAATGCCT ATGGTTCAGG GCAAGTAATG CTAAGATGGG 1920
GAGTTTTAGC CAAATCTGTA AAAAATATCA TGCTAGGACA TGCTAGTGTC CAGGCAGAAA 1980
TGGAGCAAGT TGTGGAAGTC TATGAGTATG CACAGAAGTT GGGAGGAGAA GCTGGATTCT 2040
ACCATATATT GAACAATCCA AAAGCATCAT TGCTGTCATT AACTCAATTT CCCAACTTCT 2100
CAAGTGTGGT CCTAGGCAAT GCAGCAGGTC TAGGCATAAT GGGAGAGTAT AGAGGTACAC 2160
CAAGAAACCA GGATCTTTAT GATGCAGCTA AAGCATATGC AGAGCAACTC AAAGAAAATG 2220
GAGTAATAAA CTACAGTGTA TTAGACTTAA CAGCAGAAGA ATTGGAAGCC ATAAAGCATC 2280
AACTCAACCC CAAAGAAGAT GATGTAGAGC TTTAAGTTAA CAAAAAATAC GGGGCAAATA 2340
AGTCAACATG GAGAAGTTTG CACCTGAATT TCATGGAGAA GATGCAAATA ACAAAGCTAC 2400
CAAATTCCTA GAATCAATAA AGGGCAAGTT CGCATCATCC AAAGATCCTA AGAAGAAAGA 2460
TAGCATAATA TCTGTTAACT CAATAGATAT AGAAGTAACT AAAGAGAGCC CGATAACATC 2520
TGGCACCAAC ATCATCAATC CAACAAGTGA AGCCGACAGT ACCCCAGAAA CAAAAGCCAA 2580
CTACCCAAGA AAACCCCTAG TAAGCTTCAA AGAAGATCTC ACCCCAAGTG ACAACCCTTT 2640
TTCTAAGTTG TACAAGGAAA CAATAGAAAC ATTTGATAAC AATGAAGAAG AATCTAGCTA 2700
CTCATATGAA GAGATAAATG ATCAAACAAA TGACAACATT ACAGCAAGAC TAGATAGAAT 2760
TGATGAAAAA TTAAGTGAAA TATTAGGAAT GCTCCATACA TTAGTAGTTG CAAGTGCAGG 2820
ACCCACTTCA GCTCGCGATG GAATAAGAGA TGCTATGGTT GGTCTAAGAG AAGAGATGAT 2880
AGAAAAAATA AGAGCGGAAG CATTAATGAC CAATGATAGG TTAGAGGCTA TGGCAAGACT 2940
TAGGAATGAG GAAAGCGAAA AAATGGCAAA AGACACCTCA GATGAAGTGT CTCTTAATCC 3000
AACTTCCAAA AAATTGAGTG ACTTGTTGGA AGACAACGAT AGTGACAATG ATCTATCACT 3060
TGATGATTTT TGATCAGCGA TCAACTCACT CAGCAATCAA CAACATCAAT AAAACAGACA 3120
TCAATCCATT GAATCAACTG CCAGACCGAA CAAACAAACG TCCATCAGTA GAACCACCAA 3180
CCAATCAATC AACCAATTGA TCAATCAGCA ACCCGACAAA ATTAACAATA TAGTAACAAA 3240
AAAAGAACAA GATGGGGCAA ATATGGAAAC ATACGTGAAC AAGCTTCACG AAGGCTCCAC 3300
ATACACAGCA GCTGTTCAGT ACAATGTTCT AGAAAAAGAT GATGATCCTG CATCACTAAC 3360
AATATGGGTG CCTATGTTCC AGTCATCTGT GCCAGCAGAC TTGCTCATAA AAGAACTTGC 3420
AAGCATCAAT ATACTAGTGA AGCAGATCTC TACGCCCAAA GGACCTTCAC TACGAGTCAC 3480
GATTAACTCA AGAAGTGCTG TGCTGGCTCA AATGCCTAGT AATTTCATCA TAAGCGCAAA 3540
TGTATCATTA GATGAAAGAA GCAAATTAGC ATATGATGTA ACTACACCTT GTGAAATCAA 3600
AGCATGCAGT CTAACATGCT TAAAAGTAAA AAGTATGTTA ACTACAGTCA AAGATCTTAC 3660
CATGAAGACA TTCAACCCCA CTCATGAGAT CATTGCTCTA TGTGAATTTG AAAATATTAT 3720
GACATCAAAA AGAGTAATAA TACCAACCTA TCTAAGATCA ATTAGTGTCA AGAACAAGGA 3780
TCTGAACTCA CTAGAAAATA TAGCAACCAC CGAATTCAAA AATGCTATCA CCAATGCAAA 3840
AATTATTCCT TATGCAGGAT TAGTGTTAGT TATCACAGTT ACTGACAATA AAGGAGCATT 3900
CAAATATATC AAACCACAGA GTCAATTTAT AGTAGATCTT GGTGCCTACC TAGAAAAAGA 3960
GAGCATATAT TATGTGACTA CTAATTGGAA GCATACAGCT ACACGTTTTT CAATCAAACC 4020
ACTAGAGGAT TAAACTTAAT TATCAACACT GAATGACAGG TCCACATATA TCCTCAAACT 4080
ACACACTATA TCCAAACATC ATAAACATCT ACACTACACA CTTCATCACA CAAACCAATC 4140
CCACTCAAAA TCCAAAATCA CTACCAGCCA CTATCCGCTA GACCTAGAGT GCGAATAGGC 4200
AAATAAAACC AAAATATGGG GTAAATAGAC ATTAGTTAGA GTTCAATCAA TCTTAACAAC 4260
CATTTATACC GCCAATTCAA CACATATACT ATAAATCTTA AAATGGGAAA TACATCCATC 4320
ACAATAGAAC TCACAAGCAA ATTTTGGCCC TATTTTACAC TAATACATAT GATCTTAACT 4380
CTAATCTTTT TACTAATTAT AATCACTATC ATGATTGCAA CACTAAATAA GCTAAGTGAA 4440
CACAAAGCAT TCTGCAACAA AACTCTTGAA CTAGGACAGA TGTACCAAAT CAACACACAG 4500
AGTTCCACCA TTATGCTGTG TCAAACCATA ATCCTGTATA TACAAACAAA CAAATCCAAT 4560
CCTCTCACAG AGTCACGGTG TCGCAAAACC ACGCTAACCA TCATGGTAGC ATAGAGTAGT 4620
TATTTAAAAA TTAACATAAT GATGAATTGT TAGTATGAGA TCAAAAACAA CATTGGGGCA 4680
AATGCAACCA TGTCCAAACA CAAGAATCAA CGCACTGCCA GGACTCTAGA AAAGACCTGG 4740
GATACTCTTA ATCATCTAAT TGTAATATCC TCTTGTTTAT ACAGATTAAA TTTAAAATCT 4800
ATAGCACAAA TAGCACTATC AGTTTTGGCA ATGATAATCT CAACCTCTCT CATAATTGCA 4860
GCCATAATAT TCATCATCTC TGCCAATCAC AAAGTTACAC TAACAACGGT CACAGTTCAA 4920
ACAATAAAAA ACCACACTGA AAAAAACATC ACCACCTACC CTACTCAAGT CTCACCAGAA 4980
AGGGTTAGTT CATCCAAGCA ACCCACAACC ACATCACCAA TCCACACAAG TTCAGCTACA 5040
ACATCACCCA ATACAAAATC AGAAACACAC CATACAACAG CACAAACCAA AGGCAGAACC 5100
ACCACTTCAA CACAGACCAA CAAGCCAAGC ACAAAACCAC GTCCAAAAAA TCCACCAAAA 5160
AAAGATGATT ACCATTTTGA AGTGTTCAAC TTCGTTCCCT GCAGTATATG TGGCAACAAT 5220
CAACTTTGCA AATCCATCTG CAAAACAATA CCAAGCAACA AACCAAAGAA GAAACCAACC 5280
ATCAAACCCA CAAACAAACC AACCACCAAA ACCACAAACA AAAGAGACCC AAAAACACCA 5340
GCCAAAACGA CGAAAAAAGA AACTACCACC AACCCAACAA AAAAACTAAC CCTCAAGACC 5400
ACAGAAAGAG ACACCAGCAC CTCACAATCC ACTGCACTCG ACACAACCAC ATTAAAACAC 5460
ACAGTCCAAC AGCAATCCCT CCTCTCAACC ACCCCCGAAA ACACACCCAA CTCCACACAA 5520
ACACCCACAG CATCCGAGCC CTCCACACCA AACTCCACCC AAAAAACCCA GCCACATGCT 5580
TAGTTATTCA AAAACTACAT CTTAGCAGAG AACCGTGATC TATCAAGCAA GAACGAAATT 5640
AAACCTGGGG CAAATAACCA TGGAGTTGAT GATCCACAAG TCAAGTGCAA TCTTCCTAAC 5700
TCTTGCTATT AATGCATTGT ACCTCACCTC AAGTCAGAAC ATAACTGAGG AGTTTTACCA 5760
ATCGACATGT AGTGCAGTTA GCAGAGGTTA TTTTAGTGCT TTAAGAACAG GTTGGTATAC 5820
TAGTGTCATA ACAATAGAAT TAAGTAATAT AAAAGAAACC AAATGCAATG GAACTGACAC 5880
TAAAGTAAAA CTTATGAAAC AAGAATTAGA TAAGTATAAG AATGCAGTAA CAGAATTACA 5940
GCTACTTATG CAAAACACAC CAGCTGTCAA CAACCGGGCC AGAAGAGAAG CACCACAGTA 6000
TATGAACTAC ACAATCAATA CCACTAAAAA CCTAAATGTA TCAATAAGCA AGAAGAGGAA 6060
ACGAAGATTT CTAGGCTTCT TGTTAGGTGT GGGATCTGCA ATAGCAAGTG GTATAGCTGT 6120
ATCAAAAGTT CTACACCTTG AAGGAGAAGT GAACAAGATC AAAAATGCTT TGTTGTCTAC 6180
AAACAAAGCT GTAGTCAGTT TATCAAATGG GGTCAGTGTT TTAACCAGCA AAGTGTTAGA 6240
TCTCAAGAAT TACATAAATA ACCAATTATT ACCCATAGTA AATCAACAGA GCTGTCGCAT 6300
CTCCAACATT GAAACAGTTA TAGAATTCCA GCAGAAGAAC AGCAGATTGT TGGAAATCAC 6360
CAGAGAATTT AGTGTCAATG CAGGTGTAAC AACACCTTTA AGCACTTACA TGTTGACAAA 6420
CAGTGAGTTA CTATCATTAA TCAATGATAT GCCTATAACA AATGATCAGA AAAAATTAAT 6480
GTCAAGCAAT GTTCAGATAG TAAGGCAACA AAGTTATTCC ATCATGTCTA TAATAAAGGA 6540
AGAAGTCCTT GCATATGTTG TACAGCTGCC TATCTATGGT GTAATAGATA CACCTTGCTG 6600
GAAATTGCAC ACATCGCCTC TATGCACTAC CAACATCAAA GAAGGATCAA ATATTTGTTT 6660
AACAAGGACT GATAGAGGAT GGTATTGTGA TAATGCAGGA TCAGTATCCT TCTTTCCACA 6720
GGCTGACACT TGTAAAGTAC AGTCCAATCG AGTATTTTGT GACACTATGA ACAGTTTGAC 6780
ATTACCAAGT GAAGTCAGCC TTTGTAACAC TGACATATTC AATTCCAAGT ATGACTGCAA 6840
AATTATGACA TCAAAAACAG ACATAAGCAG CTCAGTAATT ACTTCTCTTG GAGCTATAGT 6900
GTCATGCTAT GGTAAAACTA AATGCACTGC ATCCAACAAA AATCGTGGGA TTATAAAGAC 6960
ATTTTCTAAT GGTTGTGACT ATGTGTCAAA CAAAGGAGTA GATACTGTGT CAGTGGGCAA 7020
CACTTTATAC TATGTAAACA AGCTGGAAGG CAAGAACCTT TATGTAAAAG GGGAACCTAT 7080
AATAAATTAC TATGACCCTC TAGTGTTTCC TTCTGATGAG TTTGATGCAT CAATATCTCA 7140
AGTCAATGAA AAAATCAATC AAAGTTTAGC TTTTATTCGT AGATCTGATG AATTACTACA 7200
TAATGTAAAT ACTGGCAAAT CTACTACAAA TATTATGATA ACTACAATTA TTATAGTAAT 7260
CATTGTAGTA TTGTTATCAT TAATAGCTAT TGGTTTACTG TTGTATTGTA AAGCCAAAAA 7320
CACACCAGTT ACACTAAGCA AAGACCAACT AAGTGGAATC AATAATATTG CATTCAGCAA 7380
ATAGACAAAA AACCACCTGA TCATGTTTCA ACAACAATCT GCTGACCACC AATCCCAAAT 7440
CAACTTACAA CAAATATTTC AACATCACAG TACAGGCTGA ATCATTTCCT CACATCATGC 7500
TACCCACATA ACTAAGCTAG ATCCTTAACT TATAGTTACA TAAAAACCTC AAGTATCACA 7560
ATCAACCACT AAATCAACAC ATCATTCACA AAATTAACAG CTGGGGCAAA TATGTCGCGA 7620
AGAAATCCTT GTAAATTTGA GATTAGAGGT CATTGCTTGA ATGGTAGAAG ATGTCACTAC 7680
AGTCATAATT ACTTTGAATG GCCTCCTCAT GCATTACTAG TGAGGCAAAA CTTCATGTTA 7740
AACAAGATAC TCAAGTCAAT GGACAAAAGC ATAGACACTT TGTCTGAAAT AAGTGGAGCT 7800
GCTGAACTGG ATAGAACAGA AGAATATGCT CTTGGTATAG TTGGAGTGCT AGAGAGTTAC 7860
ATAGGATCTA TAAACAACAT AACAAAACAA TCAGCATGTG TTGCTATGAG TAAACTTCTT 7920
ATTGAGATCA ATAGTGATGA CATTAAAAAG CTTAGAGATA ATGAAGAACC CAATTCACCT 7980
AAGATAAGAG TGTACAATAC TGTTATATCA TACATTGAGA GCAATAGAAA AAACAACAAG 8040
CAAACCATCC ATCTGCTCAA GAGACTACCA GCAGACGTGC TGAAGAAGAC AATAAAGAAC 8100
ACATTAGATA TCCACAAAAG CATAACCATA AGCAATCCAA AAGAGTCAAC TGTGAATGAT 8160
CAAAATGACC AAACCAAAAA TAATGATATT ACCGGATAAA TATCCTTGTA GTATATCATC 8220
CATATTGATC TCAAGTGAAA GCATGGTTGC TACATTCAAT CATAAAAACA TATTACAATT 8280
TAACCATAAC TATTTGGATA ACCACCAGCG TTTATTAAAT CATATATTTG ATGAAATTCA 8340
TTGGACACCT AAAAACTTAT TAGATGCCAC TCAACAATTT CTCCAACATC TTAACATCCC 8400
TGAAGATATA TATACAGTAT ATATATTAGT GTCATAATGC TTGACCATAA CGACTCTATG 8460
TCATCCAACC ATAAAACTAT TTTGATAAGG TTATGGGACA AAATGGATCC CATTATTAAT 8520
GGAAACTCTG CTAATGTGTA TCTAACTGAT AGTTATTTAA AAGGTGTTAT CTCTTTTTCA 8580
GAGTGTAATG CTTTAGGGAG TTATCTTTTT AACGGCCCTT ATCTTAAAAA TGATTACACC 8640
AACTTAATTA GTAGACAAAG CCCACTACTA GAGCATATGA ATCTTAAAAA ACTAACTATA 8700
ACACAGTCAT TAATATCTAG ATATCATAAA GGTGAACTGA AATTAGAAGA ACCAACTTAT 8760
TTCCAGTCAT TACTTATGAC ATATAAAAGT ATGTCCTCGT CTGAACAAAT TGCTACAACT 8820
AACTTACTTA AAAAAATAAT ACGAAGAGCC ATAGAAATAA GTGATGTAAA GGTGTACGCC 8880
ATCTTGAATA AACTAGGATT AAAGGAAAAG GACAGAGTTA AGCCCAACAA TAATTCAGGT 8940
GATGAAAACT CAGTACTTAC AACCATAATT AAAGATGATA TACTTTCGGC TGTGGAAAAC 9000
AATCAATCAT ATACAAATTC AGACAAAAGT CACTCAGTAA ATCAAAATAT CACTATCAAA 9060
ACAACACTCT TGAAAAAATT GATGTGTTCA ATGCAACATC CTCCATCATG GTTAATACAC 9120
TGGTTCAATT TATATACAAA ATTAAATAAC ATATTAACAC AATATCGATC AAATGAGGTA 9180
AAAAGTCATG GGTTTATATT AATAGATAAT CAAACTTTAA GTGGTTTTCA GTTTATTTTA 9240
AATCAATATG GTTGTATCGT TTATCATAAA GGACTCAAAA AAATCACAAC TACTACTTAC 9300
AATCAATTTT TGACATGGAA AGACATCAGC CTTAGCAGAT TAAATGTTTG CTTAATTACT 9360
TGGATAAGTA ATTGTTTAAA TACATTAAAC AAAAGCTTAG GGCTGAGATG TGGATTCAAT 9420
AATGTTGTGT TATCACAATT ATTTCTTTAT GGAGATTGTA TACTGAAATT ATTTCATAAT 9480
GAAGGCTTCT ACATAATAAA AGAAGTAGAG GGATTTATTA TGTCTTTAAT TCTAAACATA 9540
ACAGAAGAAG ATCAATTTAA GAAACGATTT TATAATAGCA TGCTAAATAA CATCACAGAT 9600
GCAGCTATTA AGGCTCAAAA GGACCTACTA TCAAGAGTAT GTCACACTTT ATTAGACAAG 9660
ACAGTGTCTG ATAATATCAT AAATGGTAAA TGGATAATCC TATTAAGTAA ATTTCTTAAA 9720
TTGATTAAGC TTGCAGGTGA TAATAATCTC AATAACTTGA GTGAGCTATA TTTTCTCTTC 9780
AGAATCTTTG GACATCCAAT GGTCGATGAA AGACAAGCAA TGGATTCTGT AAGAATTAAC 9840
TGTAATGAAA CTAAGTTCTA CTTATTAAGT AGTCTAAGTA CATTAAGAGG TGCTTTCATT 9900
TATAGAATCA TAAAAGGGTT TGTAAATACC TACAACAGAT GGCCCACCTT AAGGAATGCT 9960
ATTGTCCTAC CTCTAAGATG GTTAAACTAC TATAAACTTA ATACTTATCC ATCTCTACTT 10020
GAAATCACAG AAAATGATTT GATTATTTTA TCAGGATTGC GGTTCTATCG TGAGTTTCAT 10080
CTGCCTAAAA AAGTGGATCT TGAAATGATA ATAAATGACA AAGCCATTTC ACCTCCAAAA 10140
GATCTAATAT GGACTAGTTT TCCTAGAAAT TACATGCCAT CACATATACA AAATTATATA 10200
GAACATGAAA AGTTGAAGTT CTCTGAAAGC GACAGATCGA GAAGAGTACT AGAGTATTAC 10260
TTGAGAGATA ATAAATTCAA TGAATGCGAT CTATACAATT GTGTAGTCAA TCAAAGCTAT 10320
CTCAACAACT CTAATCACGT GGTATCACTA ACTGGTAAAG AAAGAGAGCT CAGTGTAGGT 10380
AGAATGTTTG CTATGCAACC AGGTATGTTT AGGCAAATCC AAATCTTAGC AGAGAAAATG 10440
ATAGCTGAAA ATATTTTACA ATTCTTCCCT GAGAGTTTGA CAAGATATGG TGATCTAGAG 10500
CTTCAAAAGA TATTAGAATT AAAAGCAGGA ATAAGCAACA AGTCAAATCG TTATAATGAT 10560
AACTACAACA ATTATATCAG TAAATGTTCT ATCATTACAG ATCTTAGCAA ATTCAATCAG 10620
GCATTTAGAT ATGAAACATC ATGTATCTGC AGTGATGTAT TAGATGAACT GCATGGAGTA 10680
CAATCTCTGT TCTCTTGGTT GCATTTAACA ATACCTCTTG TCACAATAAT ATGTACATAT 10740
AGACATGCAC CTCCTTTCAT AAAGGATCAT GTTGTTAATC TTAATGAGGT TGATGAACAA 10800
AGTGGATTAT ACAGATATCA TATGGGTGGT ATTGAGGGCT GGTGTCAAAA ACTGTGGACC 10860
ATTGAAGCTA TATCATTATT AGATCTAATA TCTCTCAAAG GGAAATTCTC TATCACAGCT 10920
CTGATAAATG GTGATAATCA GTCAATTGAT ATAAGCAAAC CAGTTAGACT TATAGAGGGT 10980
CAGACCCATG CACAAGCAGA TTATTTGTTA GCATTAAATA GCCTTAAATT GTTATATAAA 11040
GAGTATGCAG GTATAGGCCA TAAGCTTAAG GGAACAGAGA CCTATATATC CCGAGATATG 11100
CAGTTCATGA GCAAAACAAT CCAGCACAAT GGAGTGTACT ATCCAGCCAG TATCAAAAAA 11160
GTCCTGAGAG TAGGTCCATG GATAAACACG ATACTTGATG ATTTTAAAGT TAGTTTAGAA 11220
TCTATAGGCA GCTTAACACA GGAGTTAGAA TACAGAGGAG AAAGCTTATT ATGCAGTTTA 11280
ATATTTAGGA ACATTTGGTT ATACAATCAA ATTGCTTTGC AACTCCGAAA TCATGCATTA 11340
TGTAACAATA AGCTATATTT AGATATATTG AAAGTATTAA AACACTTAAA AACTTTTTTT 11400
AATCTTGATA GCATTGATAT GGCTTTATCA TTGTATATGA ATTTGCCTAT GCTGTTTGGT 11460
GGTGGTGATC CTAATTTGTT ATATCGAAGC TTTTATAGGA GAACTCCAGA CTTCCTTACA 11520
GAAGCTATAG TACATTCAGT GTTTGTGTTG AGCTATTATA CTGGTCACGA TTTACAAGAT 11580
AAGCTCCAGG ATCTTCCAGA TGATAGACTG AACAAATTCT TGACATGTGT CATCACATTT 11640
GATAAAAATC CCAATGCCGA GTTTGTAACA TTGATGAGGG ATCCACAGGC TTTAGGGTCT 11700
GAAAGGCAAG CTAAAATTAC TAGTGAGATT AATAGATTAG CAGTAACAGA AGTCTTAAGT 11760
ATAGCCCCAA ACAAAATATT TTCTAAAAGT GCACAACATT ATACTACCAC TGAGATTGAT 11820
CTAAATGACA TTATGCAAAA TATAGAACCA ACTTACCCTC ATGGATTAAG AGTTGTTTAT 11880
GAAAGTTTAC CTTTTTATAA AGCAGAAAAA ATAGTTAATC TTATATCAGG AACAAAATCC 11940
ATAACTAATA TACTTGAAAA AACATCAGCA ATAGATACAA CTGATATTAA TAGGGCTACT 12000
GATATGATGA GGAAAAATAT AACTTTACTT ATAAGGATAC TTCCACTAGA TTGTAACAAA 12060
GACAAAAGAG AGTTATTAAG TTTAGAAAAT CTTAGTATAA CTGAATTAAG CAAGTATGTA 12120
AGAGAAAGAT CTTGGTCATT ATCCAATATA GTAGGAGTAA CATCGCCAAG TATTATGTTC 12180
ACAATGAACA TTAAATATAC AACTAGCACT ATAGCCAGTG GTATAATAAT AGAAAAATAT 12240
AATGTTAATA GTTTAACTCG TGGTGAAAGA GGACCCACCA AGCCATGGGT AGGCTCATCC 12300
ACGCAGGAGA AAAAAACAAT GCCAGTGTAC AACAGACAAG TTTTAACCAA AAAGCAAAGA 12360
GACCAAATAG ATTTATTAGC AAAATTAGAC TGGGTATATG CATCCATAGA CAACAAAGAT 12420
GAATTCATGG AAGAACTGAG TACTGGAACA CTTGGACTGT CATATGAAAA AGCCAAAAAG 12480
TTGTTTCCAC AATATCTAAG TGTCAATTAT TTACACCGTT TAACAGTCAG TAGTAGACCA 12540
TGTGAATTCC CTGCATCAAT ACCAGCTTAT AGAACAACAA ATTATCATTT TGATACTAGT 12600
CCTATCAATC ATGTATTAAC AGAAAAGTAT GGAGATGAAG ATATCGACAT TGTGTTTCAA 12660
AATTGCATAA GTTTTGGTCT TAGCCTGATG TCGGTTGTGG AACAATTCAC AAACATATGT 12720
CCTAATAGAA TTATTCTCAT ACCGAAGCTG AATGAGATAC ATTTGATGAA ACCTCCTATA 12780
TTTACAGGAG ATGTTGATAT CATCAAGTTG AAGCAAGTGA TACAAAAGCA GCACATGTTC 12840
CTACCAGATA AAATAAGTTT AACCCAATAT GTAGAATTAT TCTTAAGTAA CAAAGCACTT 12900
AAATCTGGAT CTCACATCAA CTCTAATTTA ATATTAGTAC ATAAAATGTC TGATTATTTT 12960
CATAATGCTT ATATTTTAAG TACTAATTTA GCTGGACATT GGATTCTGAT TATTCAACTT 13020
ATGAAAGATT CAAAAGGTAT TTTTGAAAAA GATTGGGGAG AGGGGTACAT AACTGATCAT 13080
ATGTTCATTA ATTTGAATGT TTTCTTTAAT GCTTATAAGA CTTATTTGCT ATGTTTTCAT 13140
AAAGGTTATG GTAAAGCAAA ATTAGAATGT GATATGAACA CTTCAGATCT TCTTTGTGTT 13200
TTGGAGTTAA TAGACAGTAG CTACTGGAAA TCTATGTCTA AAGTTTTCCT AGAACAAAAA 13260
GTCATAAAAT ACATAGTCAA TCAAGACACA AGTTTGCGTA GAATAAAAGG CTGTCACAGT 13320
TTTAAGTTGT GGTTTTTAAA ACGCCTTAAT AATGCTAAAT TTACCGTATG CCCTTGGGTT 13380
GTTAACATAG ATTATCACCC AACACACATG AAAGCTATAT TATCTTACAT AGATTTAGTT 13440
AGAATGGGGT TAATAAATGT AGATAAATTA ACCATTAAAA ATAAAAACAA ATTCAATGAT 13500
GAATTTTACA CATCAAATCT CTTTTACATT AGTTATAACT TTTCAGACAA CACTCATTTG 13560
CTAACAAAAC AAATAAGAAT TGCTAATTCA GAATTAGAAG ATAATTATAA CAAACTATAT 13620
CACCCAACCC CAGAAACTTT AGAAAATATG TCATTAATTC CTGTTAAAAG TAATAATAGT 13680
AACAAACCTA AATTTTGTAT AAGTGGAAAT ACCGAATCTA TGATGATGTC AACATTCTCT 13740
AGTAAAATGC ATATTAAATC TTCCACTGTT ACCACAAGAT TCAATTATAG CAAACAAGAC 13800
TTGTACAATT TATTTCCAAT TGTTGTGATA GACAAGATTA TAGATCATTC AGGTAATACA 13860
GCAAAATCTA ACCAACTTTA CACCACCACT TCACATCAGA CATCTTTAGT AAGGAATAGT 13920
GCATCACTTT ATTGCATGCT TCCTTGGCAT CATGTCAATA GATTTAACTT TGTATTTAGT 13980
TCCACAGGAT GCAAGATCAG TATAGAGTAT ATTTTAAAAG ATCTTAAGAT TAAGGACCCC 14040
AGTTGTATAG CATTCATAGG TGAAGGAGCT GGTAACTTAT TATTACGTAC GGTAGTAGAA 14100
CTTCATCCAG ACATAAGATA CATTTACAGA AGTTTAAAAG ATTGCAATGA TCATAGTTTA 14160
CCTATTGAAT TTCTAAGGTT ATACAACGGG CATATAAACA TAGATTATGG TGAGAATTTA 14220
ACCATTCCTG CTACAGATGC AACTAATAAC ATTCATTGGT CTTATTTACA TATAAAATTT 14280
GCAGAACCTA TTAGCATCTT TGTCTGCGAT GCTGAATTAC CTGTTACAGC CAATTGGAGT 14340
AAAATTATAA TTGAATGGAG TAAGCATGTA AGAAAGTGCA AGTACTGTTC TTCTGTAAAT 14400
AGATGCATTT TAATTGCAAA ATATCATGCT CAAGATGACA TTGATTTCAA ATTAGATAAC 14460
ATTACTATAT TAAAAACTTA CGTGTGCCTA GGTAGCAAGT TAAAAGGATC TGAAGTTTAC 14520
TTAATCCTTA CAATAGGCCC TGCAAATATA CTTCCTGTTT TTGATGTTGT ACAAAATGCT 14580
AAATTGATAC TTTCAAGAAC TAAAAATTTC ATTATGCCTA AAAAAACTGA CAAGGAATCT 14640
ATCGATGCAA ATATTAAAAG CTTAATACCT TTCCTTTGTT ACCCTATAAC AAAAAAAGGA 14700
ATTAAGACTT CATTGTCAAA ATTGAAGAGT GTAGTTAATG GAGATATATT ATCATATTCT 14760
ATAGCTGGAC GTAATGAAGT ATTCAGCAAC AAGCTTATAA ACCACAAGCA TATGAATATC 14820
CTAAAATGGC TAGATCATGT TTTAAATTTT AGATCAGCTG AACTTAATTA CAATCATTTA 14880
TACATGATAG AGTCCACATA TCCTTACTTA AGTGAATTGT TAAATAGTTT AACAACCAAT 14940
GAGCTCAAGA AGCTGATTAA AATAACAGGT AGTGTGCTAT ACAACCTTCC CAACGAACAG 15000
TAGTTTAAAA TATCATTAAC AAGTTTGGTC AAATTTAGAT GCTAACACAT CATTATATTA 15060
TAGTTATTAA AGAATATACA AACTTTTCAA TAATTTAGCA TATTGATTCC AAAATTATCA 15120
TTTTAGTCTT AAGGGGTTAA ATAAAAGTCT AAAACTAACA ATTATACATG TGCATTCACA 15180
ACACAACGAG ACATTAGTTT TTGACACTTT TTTTCTCGT 15219
(2) SEQ ID NO: 32 information about:
...
(i) sequence signature:
(A) length: 2166 amino acid
(B) type: amino acid
(C) chain:
(D) topological framework: linearity
(ii) molecule type: protein
(xi) sequence description: SEQ ID NO:32:
Met?Asp?Pro?Ile?Ile?Asn?Gly?Asn?Ser?Ala?Asn?Val?Tyr?Leu?Thr?Asp
1???????????????5???????????????????10??????????????????15
Ser?Tyr?Leu?Lys?Gly?Val?Ile?Ser?Phe?Ser?Glu?Cys?Asn?Ala?Leu?Gly
20??????????????????25??????????????????30
Ser?Tyr?Leu?Phe?Asn?Gly?Pro?Tyr?Leu?Lys?Asn?Asp?Tyr?Thr?Asn?Leu
35??????????????????40??????????????????45
Ile?Ser?Arg?Gln?Ser?Pro?Leu?Leu?Glu?His?Met?Asn?Leu?Lys?Lys?Leu
50??????????????????55??????????????????60
Thr?Ile?Thr?Gln?Ser?Leu?Ile?Ser?Arg?Tyr?His?Lys?Gly?Glu?Leu?Lys
65??????????????????70??????????????????75??????????????????80
Leu?Glu?Glu?Pro?Thr?Tyr?Phe?Gln?Ser?Leu?Leu?Met?Thr?Tyr?Lys?Ser
85??????????????????90??????????????????95
Met?Ser?Ser?Ser?Glu?Gln?Ile?Ala?Thr?Thr?Asn?Leu?Leu?Lys?Lys?Ile
100?????????????????105?????????????????110Ile?Arg?Arg?Ala?Ile?Glu?Ile?Ser?Asp?Val?Lys?Val?Tyr?Ala?Ile?Leu
115?????????????????120?????????????????125Asn?Lys?Leu?Gly?Leu?Lys?Glu?Lys?Asp?Arg?Val?Lys?Pro?Asn?Asn?Asn
130?????????????????135?????????????????140Ser?Gly?Asp?Glu?Asn?Ser?Val?Leu?Thr?Thr?Ile?Ile?Lys?Asp?Asp?Ile145?????????????????150?????????????????155?????????????????160Leu?Ser?Ala?Val?Glu?Asn?Asn?Gln?Ser?Tyr?Thr?Asn?Ser?Asp?Lys?Ser
165?????????????????170?????????????????175His?Ser?Val?Asn?Gln?Asn?Ile?Thr?Ile?Lys?Thr?Thr?Leu?Leu?Lys?Lys
180?????????????????185?????????????????190Leu?Met?Cys?Ser?Met?Gln?His?Pro?Pro?Ser?Trp?Leu?Ile?His?Trp?Phe
195?????????????????200?????????????????205Asn?Leu?Tyr?Thr?Lys?Leu?Asn?Asn?Ile?Leu?Thr?Gln?Tyr?Arg?Ser?Asn
210?????????????????215?????????????????220Glu?Val?Lys?Ser?His?Gly?Phe?Ile?Leu?Ile?Asp?Asn?Gln?Thr?Leu?Ser225?????????????????230?????????????????235?????????????????240Gly?Phe?Gln?Phe?Ile?Leu?Asn?Gln?Tyr?Gly?Cys?Ile?Val?Tyr?His?Lys
245?????????????????250?????????????????255Gly?Leu?Lys?Lys?Ile?Thr?Thr?Thr?Thr?Tyr?Asn?Gln?Phe?Leu?Thr?Trp
260?????????????????265?????????????????270Lys?Asp?Ile?Ser?Leu?Ser?Arg?Leu?Asn?Val?Cys?Leu?Ile?Thr?Trp?Ile
275?????????????????280?????????????????285Ser?Asn?Cys?Leu?Asn?Thr?Leu?Asn?Lys?Ser?Leu?Gly?Leu?Arg?Cys?Gly
290?????????????????295?????????????????300Phe?Asn?Asn?Val?Val?Leu?Ser?Gln?Leu?Phe?Leu?Tyr?Gly?Asp?Cys?Ile305?????????????????310?????????????????315?????????????????320Leu?Lys?Leu?Phe?His?Asn?Glu?Gly?Phe?Tyr?Ile?Ile?Lys?Glu?Val?Glu
325?????????????????330?????????????????335Gly?Phe?Ile?Met?Ser?Leu?Ile?Leu?Asn?Ile?Thr?Glu?Glu?Asp?Gln?Phe
340?????????????????345?????????????????350Lys?Lys?Arg?Phe?Tyr?Asn?Ser?Met?Leu?Asn?Asn?Ile?Thr?Asp?Ala?Ala
355?????????????????360?????????????????365Ile?Lys?Ala?Gln?Lys?Asp?Leu?Leu?Ser?Arg?Val?Cys?His?Thr?Leu?Leu
370?????????????????375?????????????????380Asp?Lys?Thr?Val?Ser?Asp?Asn?Ile?Ile?Asn?Gly?Lys?Trp?Ile?Ile?Leu385?????????????????390?????????????????395?????????????????400Leu?Ser?Lys?Phe?Leu?Lys?Leu?Ile?Lys?Leu?Ala?Gly?Asp?Asn?Asn?Leu
405?????????????????410?????????????????415Asn?Asn?Leu?Ser?Glu?Leu?Tyr?Phe?Leu?Phe?Arg?Ile?Phe?Gly?His?Pro
420?????????????????425?????????????????430Met?Val?Asp?Glu?Arg?Gln?Ala?Met?Asp?Ser?Val?Arg?Ile?Asn?Cys?Asn
435?????????????????440?????????????????445Glu?Thr?Lys?Phe?Tyr?Leu?Leu?Ser?Ser?Leu?Ser?Thr?Leu?Arg?Gly?Ala
450?????????????????455?????????????????460Phe?Ile?Tyr?Arg?Ile?Ile?Lys?Gly?Phe?Val?Asn?Thr?Tyr?Asn?Arg?Trp465?????????????????470?????????????????475?????????????????480Pro?Thr?Leu?Arg?Asn?Ala?Ile?Val?Leu?Pro?Leu?Arg?Trp?Leu?Asn?Tyr
485?????????????????490?????????????????495Tyr?Lys?Leu?Asn?Thr?Tyr?Pro?Ser?Leu?Leu?Glu?Ile?Thr?Glu?Asn?Asp
500?????????????????505?????????????????510Leu?Ile?Ile?Leu?Ser?Gly?Leu?Arg?Phe?Tyr?Arg?Glu?Phe?His?Leu?Pro
515?????????????????520?????????????????525Lys?Lys?Val?Asp?Leu?Glu?Met?Ile?Ile?Asn?Asp?Lys?Ala?Ile?Ser?Pro
530?????????????????535?????????????????540Pro?Lys?Asp?Leu?Ile?Trp?Thr?Ser?Phe?Pro?Arg?Asn?Tyr?Met?Pro?Ser545?????????????????550?????????????????555?????????????????560His?Ile?Gln?Asn?Tyr?Ile?Glu?His?Glu?Lys?Leu?Lys?Phe?Ser?Glu?Ser
565?????????????????570?????????????????575Asp?Arg?Ser?Arg?Arg?Val?Leu?Glu?Tyr?Tyr?Leu?Arg?Asp?Asn?Lys?Phe
580?????????????????585?????????????????590Asn?Glu?Cys?Asp?Leu?Tyr?Asn?Cys?Val?Val?Asn?Gln?Ser?Tyr?Leu?Asn
595?????????????????600?????????????????605Asn?Ser?Asn?His?Val?Val?Ser?Leu?Thr?Gly?Lys?Glu?Arg?Glu?Leu?Ser
610?????????????????615?????????????????620Val?Gly?Arg?Met?Phe?Ala?Met?Gln?Pro?Gly?Met?Phe?Arg?Gln?Ile?Gln625?????????????????630?????????????????635?????????????????640Ile?Leu?Ala?Glu?Lys?Met?Ile?Ala?Glu?Asn?Ile?Leu?Gln?Phe?Phe?Pro
645?????????????????650?????????????????655Glu?Ser?Leu?Thr?Arg?Tyr?Gly?Asp?Leu?Glu?Leu?Gln?Lys?Ile?Leu?Glu
660?????????????????665?????????????????670Leu?Lys?Ala?Gly?Ile?Ser?Asn?Lys?Ser?Asn?Arg?Tyr?Asn?Asp?Asn?Tyr
675?????????????????680?????????????????685Asn?Asn?Tyr?Ile?Ser?Lys?Cys?Ser?Ile?Ile?Thr?Asp?Leu?Ser?Lys?Phe
690?????????????????695?????????????????700Asn?Gln?Ala?Phe?Arg?Tyr?Glu?Thr?Ser?Cys?Ile?Cys?Ser?Asp?Val?Leu705?????????????????710?????????????????715?????????????????720Asp?Glu?Leu?His?Gly?Val?Gln?Ser?Leu?Phe?Ser?Trp?Leu?His?Leu?Thr
725?????????????????730?????????????????735Ile?Pro?Leu?Val?Thr?Ile?Ile?Cys?Thr?Tyr?Arg?His?Ala?Pro?Pro?Phe
740?????????????????745?????????????????750Ile?Lys?Asp?His?Val?Val?Asn?Leu?Asn?Glu?Val?Asp?Glu?Gln?Ser?Gly
755?????????????????760?????????????????765Leu?Tyr?Arg?Tyr?His?Met?Gly?Gly?Ile?Glu?Gly?Trp?Cys?Gln?Lys?Leu
770?????????????????775?????????????????780Trp?Thr?Ile?Glu?Ala?Ile?Ser?Leu?Leu?Asp?Leu?Ile?Ser?Leu?Lys?Gly785?????????????????790?????????????????795?????????????????800Lys?Phe?Ser?Ile?Thr?Ala?Leu?Ile?Asn?Gly?Asp?Asn?Gln?Ser?Ile?Asp
805?????????????????810?????????????????815Ile?Ser?Lys?Pro?Val?Arg?Leu?Ile?Glu?Gly?Gln?Thr?His?Ala?Gln?Ala
820?????????????????825?????????????????830Asp?Tyr?Leu?Leu?Ala?Leu?Asn?Ser?Leu?Lys?Leu?Leu?Tyr?Lys?Glu?Tyr
835?????????????????840?????????????????845Ala?Gly?Ile?Gly?His?Lys?Leu?Lys?Gly?Thr?Glu?Thr?Tyr?Ile?Ser?Arg
850?????????????????855?????????????????860Asp?Met?Gln?Phe?Met?Ser?Lys?Thr?Ile?Gln?His?Asn?Gly?Val?Tyr?Tyr865?????????????????870?????????????????875?????????????????880Pro?Ala?Ser?Ile?Lys?Lys?Val?Leu?Arg?Val?Gly?Pro?Trp?Ile?Asn?Thr
885?????????????????890?????????????????895Ile?Leu?Asp?Asp?Phe?Lys?Val?Ser?Leu?Glu?Ser?Ile?Gly?Ser?Leu?Thr
900?????????????????905?????????????????910Gln?Glu?Leu?Glu?Tyr?Arg?Gly?Glu?Ser?Leu?Leu?Cys?Ser?Leu?Ile?Phe
915?????????????????920?????????????????925Arg?Asn?Ile?Trp?Leu?Tyr?Asn?Gln?Ile?Ala?Leu?Gln?Leu?Arg?Asn?His
930?????????????????935?????????????????940Ala?Leu?Cys?Asn?Asn?Lys?Leu?Tyr?Leu?Asp?Ile?Leu?Lys?Val?Leu?Lys945?????????????????950?????????????????955?????????????????960His?Leu?Lys?Thr?Phe?Phe?Asn?Leu?Asp?Ser?Ile?Asp?Met?Ala?Leu?Ser
965?????????????????970?????????????????975Leu?Tyr?Met?Asn?Leu?Pro?Met?Leu?Phe?Gly?Gly?Gly?Asp?Pro?Asn?Leu
980?????????????????985?????????????????990Leu?Tyr?Arg?Ser?Phe?Tyr?Arg?Arg?Thr?Pro?Asp?Phe?Leu?Thr?Glu?Ala
995?????????????????1000????????????????1005Ile?Val?His?Ser?Val?Phe?Val?Leu?Ser?Tyr?Tyr?Thr?Gly?His?Asp?Leu
1010????????????????1015????????????????1020Gln?Asp?Lys?Leu?Gln?Asp?Leu?Pro?Asp?Asp?Arg?Leu?Asn?Lys?Phe?Leu1025????????????????1030????????????????1035????????????????1040Thr?Cys?Val?Ile?Thr?Phe?Asp?Lys?Asn?Pro?Asn?Ala?Glu?Phe?Val?Thr
1045????????????????1050????????????????1055Leu?Met?Arg?Asp?Pro?Gln?Ala?Leu?Gly?Ser?Glu?Arg?Gln?Ala?Lys?Ile
1060????????????????1065????????????????1070Thr?Ser?Glu?Ile?Asn?Arg?Leu?Ala?Val?Thr?Glu?Val?Leu?Ser?Ile?Ala
1075????????????????1080????????????????1085Pro?Asn?Lys?Ile?Phe?Ser?Lys?Ser?Ala?Gln?His?Tyr?Thr?Thr?Thr?Glu
1090????????????????1095????????????????1100Ile?Asp?Leu?Asn?Asp?Ile?Met?Gln?Asn?Ile?Glu?Pro?Thr?Tyr?Pro?His1105????????????????1110????????????????1115????????????????1120Gly?Leu?Arg?Val?Val?Tyr?Glu?Ser?Leu?Pro?Phe?Tyr?Lys?Ala?Glu?Lys
1125????????????????1130????????????????1135Ile?Val?Asn?Leu?Ile?Ser?Gly?Thr?Lys?Ser?Ile?Thr?Asn?Ile?Leu?Glu
1140????????????????1145????????????????1150Lys?Thr?Ser?Ala?Ile?Asp?Thr?Thr?Asp?Ile?Asn?Arg?Ala?Thr?Asp?Met
1155????????????????1160????????????????1165Met?Arg?Lys?Asn?Ile?Thr?Leu?Leu?Ile?Arg?Ile?Leu?Pro?Leu?Asp?Cys
1170????????????????1175????????????????1180Asn?Lys?Asp?Lys?Arg?Glu?Leu?Leu?Ser?Leu?Glu?Asn?Leu?Ser?Ile?Thr1185????????????????1190????????????????1195????????????????I200Glu?Leu?Ser?Lys?Tyr?Val?Arg?Glu?Arg?Ser?Trp?Ser?Leu?Ser?Asn?Ile
1205????????????????1210????????????????1215Val?Gly?Val?Thr?Ser?Pro?Ser?Ile?Met?Phe?Thr?Met?Asn?Ile?Lys?Tyr
1220????????????????1225????????????????1230Thr?Thr?Ser?Thr?Ile?Ala?Ser?Gly?Ile?Ile?Ile?Glu?Lys?Tyr?Asn?Val
1235????????????????1240????????????????1245Asn?Ser?Leu?Thr?Arg?Gly?Glu?Arg?Gly?Pro?Thr?Lys?Pro?Trp?Val?Gly
1250????????????????1255????????????????1260Ser?Ser?Thr?Gln?Glu?Lys?Lys?Thr?Met?Pro?Val?Tyr?Asn?Arg?Gln?Val1265????????????????1270????????????????1275????????????????1280Leu?Thr?Lys?Lys?Gln?Arg?Asp?Gln?Ile?Asp?Leu?Leu?Ala?Lys?Leu?Asp
1285????????????????1290????????????????1295Trp?Val?Tyr?Ala?Ser?Ile?Asp?Asn?Lys?Asp?Glu?Phe?Met?Glu?Glu?Leu
1300????????????????1305????????????????1310Ser?Thr?Gly?Thr?Leu?Gly?Leu?Ser?Tyr?Glu?Lys?Ala?Lys?Lys?Leu?Phe
1315????????????????1320????????????????1325Pro?Gln?Tyr?Leu?Ser?Val?Asn?Tyr?Leu?His?Arg?Leu?Thr?Val?Ser?Ser
1330????????????????1335????????????????1340Arg?Pro?Cys?Glu?Phe?Pro?Ala?Ser?Ile?Pro?Ala?Tyr?Arg?Thr?Thr?Asn1345????????????????1350????????????????1355????????????????1360Tyr?His?Phe?Asp?Thr?Ser?Pro?Ile?Asn?His?Val?Leu?Thr?Glu?Lys?Tyr
1365????????????????1370????????????????1375Gly?Asp?Glu?Asp?Ile?Asp?Ile?Val?Phe?Gln?Asn?Cys?Ile?Ser?Phe?Gly
1380????????????????1385????????????????1390Leu?Ser?Leu?Met?Ser?Val?Val?Glu?Gln?Phe?Thr?Asn?Ile?Cys?Pro?Asn
1395????????????????1400????????????????1405Arg?Ile?Ile?Leu?Ile?Pro?Lys?Leu?Asn?Glu?Ile?His?Leu?Met?Lys?Pro
1410????????????????1415????????????????1420Pro?Ile?Phe?Thr?Gly?Asp?Val?Asp?Ile?Ile?Lys?Leu?Lys?Gln?Val?Ile1425????????????????1430????????????????1435????????????????1440Gln?Lys?Gln?His?Met?Phe?Leu?Pro?Asp?Lys?Ile?Ser?Leu?Thr?Gln?Tyr
1445????????????????1450????????????????1455Val?Glu?Leu?Phe?Leu?Ser?Asn?Lys?Ala?Leu?Lys?Ser?Gly?Ser?His?Ile
1460????????????????1465????????????????1470Asn?Ser?Asn?Leu?Ile?Leu?Val?His?Lys?Met?Ser?Asp?Tyr?Phe?His?Asn
1475????????????????1480????????????????1485Ala?Tyr?Ile?Leu?Ser?Thr?Asn?Leu?Ala?Gly?His?Trp?Ile?Leu?Ile?Ile
1490????????????????1495????????????????1500Gln?Leu?Met?Lys?Asp?Ser?Lys?Gly?Ile?Phe?Glu?Lys?Asp?Trp?Gly?Glu1505????????????????1510????????????????1515????????????????1520Gly?Tyr?Ile?Thr?Asp?His?Met?Phe?Ile?Asn?Leu?Asn?Val?Phe?Phe?Asn
1525????????????????1530????????????????1535Ala?Tyr?Lys?Thr?Tyr?Leu?Leu?Cys?Phe?His?Lys?Gly?Tyr?Gly?Lys?Ala
1540????????????????1545????????????????1550Lys?Leu?Glu?Cys?Asp?Met?Asn?Thr?Ser?Asp?Leu?Leu?Cys?Val?Leu?Glu
1555????????????????1560????????????????1565Leu?Ile?Asp?Ser?Ser?Tyr?Trp?Lys?Ser?Met?Ser?Lys?Val?Phe?Leu?Glu
1570????????????????1575????????????????1580Gln?Lys?Val?Ile?Lys?Tyr?Ile?Val?Asn?Gln?Asp?Thr?Ser?Leu?Arg?Arg1585????????????????1590????????????????1595????????????????1600Ile?Lys?Gly?Cys?His?Ser?Phe?Lys?Leu?Trp?Phe?Leu?Lys?Arg?Leu?Asn
1605????????????????1610????????????????1615Asn?Ala?Lys?Phe?Thr?Val?Cys?Pro?Trp?Val?Val?Asn?Ile?Asp?Tyr?His
1620????????????????1625????????????????1630Pro?Thr?His?Met?Lys?Ala?Ile?Leu?Ser?Tyr?Ile?Asp?Leu?Val?Arg?Met
1635????????????????1640????????????????1645Gly?Leu?Ile?Asn?Val?Asp?Lys?Leu?Thr?Ile?Lys?Asn?Lys?Asn?Lys?Phe
1650????????????????1655????????????????1660Asn?Asp?Glu?Phe?Tyr?Thr?Ser?Asn?Leu?Phe?Tyr?Ile?Ser?Tyr?Asn?Phe1665????????????????1670????????????????1675????????????????1680Ser?Asp?Asn?Thr?His?Leu?Leu?Thr?Lys?Gln?Ile?Arg?Ile?Ala?Asn?Ser
1685????????????????1690????????????????1695Glu?Leu?Glu?Asp?Asn?Tyr?Asn?Lys?Leu?Tyr?His?Pro?Thr?Pro?Glu?Thr
1700????????????????1705????????????????1710Leu?Glu?Asn?Met?Ser?Leu?Ile?Pro?Val?Lys?Ser?Asn?Asn?Ser?Asn?Lys
1715????????????????1720????????????????1725Pro?Lys?Phe?Cys?Ile?Ser?Gly?Asn?Thr?Glu?Ser?Met?Met?Met?Ser?Thr
1730????????????????1735????????????????1740Phe?Ser?Ser?Lys?Met?His?Ile?Lys?Ser?Ser?Thr?Val?Thr?Thr?Arg?Phe1745????????????????1750????????????????1755????????????????1760Asn?Tyr?Ser?Lys?Gln?Asp?Leu?Tyr?Asn?Leu?Phe?Pro?Ile?Val?Val?Ile
1765????????????????1770????????????????1775Asp?Lys?Ile?Ile?Asp?His?Ser?Gly?Asn?Thr?Ala?Lys?Ser?Asn?Gln?Leu
1780????????????????1785????????????????1790Tyr?Thr?Thr?Thr?Ser?His?Gln?Thr?Ser?Leu?Val?Arg?Asn?Ser?Ala?Ser
1795????????????????1800????????????????1805Leu?Tyr?Cys?Met?Leu?Pro?Trp?His?His?Val?Asn?Arg?Phe?Asn?Phe?Val
1810????????????????1815????????????????1820Phe?Ser?Ser?Thr?Gly?Cys?Lys?Ile?Ser?Ile?Glu?Tyr?Ile?Leu?Lys?Asp1825????????????????1830????????????????1835????????????????1840Leu?Lys?Ile?Lys?Asp?Pro?Ser?Cys?Ile?Ala?Phe?Ile?Gly?Glu?Gly?Ala
1845????????????????1850????????????????1855Gly?Asn?Leu?Leu?Leu?Arg?Thr?Val?Val?Glu?Leu?His?Pro?Asp?Ile?Arg
1860????????????????1865????????????????1870Tyr?Ile?Tyr?Arg?Ser?Leu?Lys?Asp?Cys?Asn?Asp?His?Ser?Leu?Pro?Ile
1875????????????????1880????????????????1885Glu?Phe?Leu?Arg?Leu?Tyr?Asn?Gly?His?Ile?Asn?Ile?Asp?Tyr?Gly?Glu
1890????????????????1895????????????????1900Asn?Leu?Thr?Ile?Pro?Ala?Thr?Asp?Ala?Thr?Asn?Asn?Ile?His?Trp?Ser1905????????????????1910????????????????1915????????????????1920Tyr?Leu?His?Ile?Lys?Phe?Ala?Glu?Pro?Ile?Ser?Ile?Phe?Val?Cys?Asp
1925????????????????1930????????????????1935Ala?Glu?Leu?Pro?Val?Thr?Ala?Asn?Trp?Ser?Lys?Ile?Ile?Ile?Glu?Trp
1940????????????????1945????????????????1950Ser?Lys?His?Val?Arg?Lys?Cys?Lys?Tyr?Cys?Ser?Ser?Val?Asn?Arg?Cys
1955????????????????1960????????????????1965Ile?Leu?Ile?Ala?Lys?Tyr?His?Ala?Gln?Asp?Asp?Ile?Asp?Phe?Lys?Leu
1970????????????????1975????????????????1980Asp?Asn?Ile?Thr?Ile?Leu?Lys?Thr?Tyr?Val?Cys?Leu?Gly?Ser?Lys?Leu1985????????????????1990????????????????1995????????????????2000Lys?Gly?Ser?Glu?Val?Tyr?Leu?Ile?Leu?Thr?Ile?Gly?Pro?Ala?Asn?Ile
2005????????????????2010????????????????2015Leu?Pro?Val?Phe?Asp?Val?Val?Gln?Asn?Ala?Lys?Leu?Ile?Leu?Ser?Arg
2020????????????????2025????????????????2030Thr?Lys?Asn?Phe?Ile?Met?Pro?Lys?Lys?Thr?Asp?Lys?Glu?Ser?Ile?Asp
2035????????????????2040????????????????2045Ala?Asn?Ile?Lys?Ser?Leu?Ile?Pro?Phe?Leu?Cys?Tyr?Pro?Ile?Thr?Lys
2050????????????????2055????????????????2060Lys?Gly?Ile?Lys?Thr?Ser?Leu?Ser?Lys?Leu?Lys?Ser?Val?Val?Asn?Gly2065????????????????2070????????????????2075????????????????2080Asp?Ile?Leu?Ser?Tyr?Ser?Ile?Ala?Gly?Arg?Asn?Glu?Val?Phe?Ser?Asn
2085????????????????2090????????????????2095Lys?Leu?Ile?Asn?His?Lys?His?Met?Asn?Ile?Leu?Lys?Trp?Leu?Asp?His
2100????????????????2105????????????????2110Val?Leu?Asn?Phe?Arg?Ser?Ala?Glu?Leu?Asn?Tyr?Asn?His?Leu?Tyr?Met
2115????????????????2120????????????????2125Ile?Glu?Ser?Thr?Tyr?Pro?Tyr?Leu?Ser?Glu?Leu?Leu?Asn?Ser?Leu?Thr
2130????????????????2135????????????????2140Thr?Asn?Glu?Leu?Lys?Lys?Leu?Ile?Lys?Ile?Thr?Gly?Ser?Val?Leu?Tyr
2145????????????????2150????????????????2155????????????????2160
Asn?Leu?Pro?Asn?Glu?Gln
The information of 2165 (2) SEQ ID NO:33:
(i) sequence signature:
(A) length: 15219 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33:
ACGGGAAAAA AATGCGTACT ACAAACTTGC ACATTCGAAA AAAATGGGGC AAATAAGAAC 60
TTGATAAGTG CTATTTAAGT CTAACCTTTT CAATCAGAAA TGGGGTGCAA TTCACTGAGC 120
ATGATAAAGG TTAGATTACA AAATTTATTT GACAATGACG AAGTAGCATT GTTAAAAATA 180
ACATGTTATA CTGATAAATT AATTCTTCTG ACCAATGCAT TAGCCAAAGC AGCAATACAT 240
ACAATTAAAT TAAACGGCAT AGTTTTTATA CATGTTATAA CAAGCAGTGA AGTGTGCCCT 300
GATAACAATA TTGTAGTGAA ATCTAACTTT ACAACAATGC CAATACTACA AAATGGAGGA 360
TACATATGGG AATTGATTGA GTTGACACAC TGCTCTCAAT TAAACGGTTT AATGGATGAT 420
AATTGTGAAA TCAAATTTTC TAAAAGACTA AGTGACTCAG TAATGACTAA TTATATGAAT 480
CAAATATCTG ACTTACTTGG GCTTGATCTC AATTCATGAA TTATGTTTAG TCTAATTCAA 540
TAGACATGTG TTTATTACCA TTTTAGTTAA TATAAAAACT CATCAAAGGG AAATGGGGCA 600
AATAAACTCA CCTAATCAAT CAAACCATGA GCACTACAAA TGACAACACT ACTATGCAAA 660
GATTGATGAT CACAGACATG AGACCCCTGT CAATGGATTC AATAATAACA TCTCTTACCA 720
AAGAAATCAT CACACACAAA TTCATATACT TGATAAACAA TGAATGTATT GTAAGAAAAC 780
TTGATGAAAG ACAAGCTACA TTTACATTCT TAGTCAATTA TGAGATGAAG CTACTGCACA 840
AAGTAGGGAG TACCAAATAC AAAAAATACA CTGAATATAA TACAAAATAT GGCACTTTCC 900
CCATGCCTAT ATTTATCAAT CACGGCGGGT TTCTAGAATG TATTGGCATT AAGCCTACAA 960
AACACACTCC TATAATATAC AAATATGACC TCAACCCGTG AATTCCAACA AAAAAACCAA 1020
CCCAACCAAA CCAAACTATT CCTCAAACAA CAGTGCTCAA TAGTTAAGAA GGAGCTAATC 1080
CATTTTAGTA ATTAAAAATA AAAGTAAAGC CAATAACATA AATTGGGGCA AATACAAAGA 1140
TGGCTCTTAG CAAAGTCAAG TTGAATGATA CATTAAATAA GGATCAGCTG CTGTCATCCA 1200
GCAAATACAC TATTCAACGT AGTACAGGAG ATAATATTGA CACTCCCAAT TATGATGTGC 1260
AAAAACACCT AAACAAACTA TGTGGTATGC TATTAATCAC TGAAGATGCA AATCATAAAT 1320
TCACAGGATT AATAGGTATG TTATATGCTA TGTCCAGGTT AGGAAGGGAA GACACTATAA 1380
AGATACTTAA AGATGCTGGA TATCATGTTA AAGCTAATGG AGTAGATATA ACAACATATC 1440
GTCAAGATAT AAATGGAAAG GAAATGAAAT TCGAAGTATT AACATTATCA AGCTTGACAT 1500
CAGAAATACA AGTCAATATT GAGATAGAAT CTAGAAAGTC CTACAAAAAA ATGCTAAAAG 1560
AGATGGGAGA AGTGGCTCCA GAATATAGGC ATGATTCTCC AGACTGTGGG ATGATAATAC 1620
TGTGTATAGC TGCACTTGTG ATAACCAAAT TAGCAGCAGG AGACAGATCA GGTCTTACAG 1680
CAGTAATTAG GAGGGCAAAC AATGTCTTAA AAAACGAAAT AAAACGATAC AAGGGCCTCA 1740
TACCAAAGGA TATAGCTAAC AGTTTTTATG AAGTGTTTGA AAAACACCCT CATCTTATAG 1800
ATGTTTTCGT GCACTTTGGC ATTGCACAAT CATCCACAAG AGGGGGTAGT AGAGTTGAAG 1860
GAATCTTTGC AGGATTGTTT ATGAATGCCT ATGGTTCAGG GCAAGTAATG CTAAGATGGG 1920
GAGTTTTAGC CAAATCTGTA AAAAATATCA TGCTAGGACA TGCTAGTGTC CAGGCAGAAA 1980
TGGAGCAAGT TGTGGAAGTC TATGAGTATG CACAGAAGTT GGGAGGAGAA GCTGGATTCT 2040
ACCATATATT GAACAATCCA AAAGCATCAT TGCTGTCATT AACTCAATTT CCCAACTTCT 2100
CAAGTGTGGT CCTAGGCAAT GCAGCAGGTC TAGGCATAAT GGGAGAGTAT AGAGGTACAC 2160
CAAGAAACCA GGATCTTTAT GATGCAGCTA AAGCATATGC AGAGCAACTC AAAGAAAATG 2220
GAGTAATAAA CTACAGTGTA TTAGACTTAA CAGCAGAAGA ATTGGAAGCC ATAAAGCATC 2280
AACTCAACCC CAAAGAAGAT GATGTAGAGC TTTAAGTTAA CAAAAAATAC GGGGCAAATA 2340
AGTCAACATG GAGAAGTTTG CACCTGAATT TCATGGAGAA GATGCAAATA ACAAAGCTAC 2400
CAAATTCCTA GAATCAATAA AGGGCAAGTT CGCATCATCC AAAGATCCTA AGAAGAAAGA 2460
TAGCATAATA TCTGTTAACT CAATAGATAT AGAAGTAACT AAAGAGAGCC CGATAACATC 2520
TGGCACCAAC ATCATCAATC CAACAAGTGA AGCCGACAGT ACCCCAGAAA CAAAAGCCAA 2580
CTACCCAAGA AAACCCCTAG TAAGCTTCAA AGAAGATCTC ACCCCAAGTG ACAACCCTTT 2640
TTCTAAGTTG TACAAGGAAA CAATAGAAAC ATTTGATAAC AATGAAGAAG AATCTAGCTA 2700
CTCATATGAA GAGATAAATG ATCAAACAAA TGACAACATT ACAGCAAGAC TAGATAGAAT 2760
TGATGAAAAA TTAAGTGAAA TATTAGGAAT GCTCCATACA TTAGTAGTTG CAAGTGCAGG 2820
ACCCACTTCA GCTCGCGATG GAATAAGAGA TGCTATGGTT GGTCTAAGAG AAGAGATGAT 2880
AGAAAAAATA AGAGCGGAAG CATTAATGAC CAATGATAGG TTAGAGGCTA TGGCAAGACT 2940
TAGGAATGAG GAAAGCGAAA AAATGGCAAA AGACACCTCA GATGAAGTGT CTCTTAATCC 3000
AACTTCCAAA AAATTGAGTG ACTTGTTGGA AGACAACGAT AGTGACAATG ATCTATCACT 3060
TGATGATTTT TGATCAGCGA TCAACTCACT CAGCAATCAA CAACATCAAT AAAACAGACA 3120
TCAATCCATT GAATCAACTG CCAGACCGAA CAAACAAACG TCCATCAGTA GAACCACCAA 3180
CCAATCAATC AACCAATTGA TCAATCAGCA ACCCGACAAA ATTAACAATA TAGTAACAAA 3240
AAAAGAACAA GATGGGGCAA ATATGGAAAC ATACGTGAAC AAGCTTCACG AAGGCTCCAC 3300
ATACACAGCA GCTGTTCAGT ACAATGTTCT AGAAAAAGAT GATGATCCTG CATCACTAAC 3360
AATATGGGTG CCTATGTTCC AGTCATCTGT GCCAGCAGAC TTGCTCATAA AAGAACTTGC 3420
AAGCATCAAT ATACTAGTGA AGCAGATCTC TACGCCCAAA GGACCTTCAC TACGAGTCAC 3480
GATTAACTCA AGAAGTGCTG TGCTGGCTCA AATGCCTAGT AATTTCATCA TAAGCGCAAA 3540
TGTATCATTA GATGAAAGAA GCAAATTAGC ATATGATGTA ACTACACCTT GTGAAATCAA 3600
AGCATGCAGT CTAACATGCT TAAAAGTAAA AAGTATGTTA ACTACAGTCA AAGATCTTAC 3660
CATGAAGACA TTCAACCCCA CTCATGAGAT CATTGCTCTA TGTGAATTTG AAAATATTAT 3720
GACATCAAAA AGAGTAATAA TACCAACCTA TCTAAGATCA ATTAGTGTCA AGAACAAGGA 3780
TCTGAACTCA CTAGAAAATA TAGCAACCAC CGAATTCAAA AATGCTATCA CCAATGCAAA 3840
AATTATTCCT TATGCAGGAT TAGTGTTAGT TATCACAGTT ACTGACAATA AAGGAGCATT 3900
CAAATATATC AAACCACAGA GTCAATTTAT AGTAGATCTT GGTGCCTACC TAGAAAAAGA 3960
GAGCATATAT TATGTGACTA CTAATTGGAA GCATACAGCT ACACGTTTTT CAATCAAACC 4020
ACTAGAGGAT TAAACTTAAT TATCAACACT GAATGACAGG TCCACATATA TCCTCAAACT 4080
ACACACTATA TCCAAACATC ATAAACATCT ACACTACACA CTTCATCACA CAAACCAATC 4140
CCACTCAAAA TCCAAAATCA CTACCAGCCA CTATCTGCTA GACCTAGAGT GCGAATAGGT 4200
AAATAAAACC AAAATATGGG GTAAATAGAC ATTAGTTAGA GTTCAATCAA TCTTAACAAC 4260
CATTTATACC GCCAATTCAA CACATATACT ATAAATCTTA AAATGGGAAA TACATCCATC 4320
ACAATAGAAT TCACAAGCAA ATTTTGGCCC TATTTTACAC TAATACATAT GATCTTAACT 4380
CTAATCTTTT TACTAATTAT AATCACTATT ATGATTGCAA TACTAAATAA GCTAAGTGAA 4440
CATAAAGCAT TCTGTAACAA AACTCTTGAA CTAGGACAGA TGTATCAAAT CAACACATAG 4500
AGTTCTACCA TTATGCTGTG TCAAATTATA ATCCTGTATA TATAAACAAA CAAATCCAAT 4560
CTTCTCACAG AGTCATGGTG TCGCAAAACC ACGCTAACTA TCATGGTAGC ATAGAGTAGT 4620
TATTTAAAAA TTAACATAAT GATGAATTGT TAGTATGAGA TCAAAAACAA CATTGGGGCA 4680
AATGCAACCA TGTCCAAACA CAAGAATCAA CGCACTGCCA GGACTCTAGA AAAGACCTGG 4740
GATACTCTTA ATCATCTAAT TGTAATATCC TCTTGTTTAT ACAGATTAAA TTTAAAATCT 4800
ATAGCACAAA TAGCACTATC AGTTTTGGCA ATGATAATCT CAACCTCTCT CATAATTGCA 4860
GCCATAATAT TCATCATCTC TGCCAATCAC AAAGTTACAC TAACAACGGT CACAGTTCAA 4920
ACAATAAAAA ACCACACTGA AAAAAACATC ACCACCTACC CTACTCAAGT CTCACCAGAA 4980
AGGGTTAGTT CATCCAAGCA ACCCACAACC ACATCACCAA TCCACACAAG TTCAGCTACA 5040
ACATCACCCA ATACAAAATC AGAAACACAC CATACAACAG CACAAACCAA AGGCAGAACC 5100
ACCACTTCAA CACAGACCAA CAAGCCAAGC ACAAAACCAC GTCCAAAAAA TCCACCAAAA 5160
AAAGATGATT ACCATTTTGA AGTGTTCAAC TTCGTTCCCT GCAGTATATG TGGCAACAAT 5220
CAACTTTGCA AATCCATCTG CAAAACAATA CCAAGCAACA AACCAAAGAA GAAACCAACC 5280
ATCAAACCCA CAAACAAACC AACCACCAAA ACCACAAACA AAAGAGACCC AAAAACACCA 5340
GCCAAAACGA CGAAAAAAGA AACTACCACC AACCCAACAA AAAAACTAAC CCTCAAGACC 5400
ACAGAAAGAG ACACCAGCAC CTCACAATCC ACTGCACTCG ACACAACCAC ATTAAAACAC 5460
ACAGTCCAAC AGCAATCCCT CCTCTCAACC ACCCCCGAAA ACACACCCAA CTCCACACAA 5520
ACACCCACAG CATCCGAGCC CTCCACACCA AACTCCACCC AAAAAACCCA GCCACATGCT 5580
TAGTTATTCA AAAACTACAT CTTAGCAGAG AACCGTGATC TATCAAGCAA GAACGAAATT 5640
AAACCTGGGG CAAATAACCA TGGAGTTGAT GATCCACAAG TCAAGTGCAA TCTTCCTAAC 5700
TCTTGCTATT AATGCATTGT ACCTCACCTC AAGTCAGAAC ATAACTGAGG AGTTTTACCA 5760
ATCGACATGT AGTGCAGTTA GCAGAGGTTA TTTTAGTGCT TTAAGAACAG GTTGGTATAC 5820
TAGTGTCATA ACAATAGAAT TAAGTAATAT AAAAGAAACC AAATGCAATG GAACTGACAC 5880
TAAAGTAAAA CTTATGAAAC AAGAATTAGA TAAGTATAAG AATGCAGTAA CAGAATTACA 5940
GCTACTTATG CAAAACACAC CAGCTGTCAA CAACCGGGCC AGAAGAGAAG CACCACAGTA 6000
TATGAACTAC ACAATCAATA CCACTAAAAA CCTAAATGTA TCAATAAGCA AGAAGAGGAA 6060
ACGAAGATTT CTAGGCTTCT TGTTAGGTGT GGGATCTGCA ATAGCAAGTG GTATAGCTGT 6120
ATCAAAAGTT CTACACCTTG AAGGAGAAGT GAACAAGATC AAAAATGCTT TGTTGTCTAC 6180
AAACAAAGCT GTAGTCAGTT TATCAAATGG GGTCAGTGTT TTAACCAGCA AAGTGTTAGA 6240
TCTCAAGAAT TACATAAATA ACCAATTATT ACCCATAGTA AATCAACAGA GCTGTCGCAT 6300
CTCCAACATT GAAACAGTTA TAGAATTCCA GCAGAAGAAC AGCAGATTGT TGGAAATCAC 6360
CAGAGAATTT AGTGTCAATG CAGGTGTAAC AACACCTTTA AGCACTTACA TGTTGACAAA 6420
CAGTGAGTTA CTATCATTAA TCAATGATAT GCCTATAACA AATGATCAGA AAAAATTAAT 6480
GTCAAGCAAT GTTCAGATAG TAAGGCAACA AAGTTATTCC ATCATGTCTA TAATAAAGGA 6540
AGAAGTCCTT GCATATGTTG TACAGCTGCC TATCTATGGT GTAATAGATA CACCTTGCTG 6600
GAAATTGCAC ACATCGCCTC TATGCACTAC CAACATCAAA GAAGGATCAA ATATTTGTTT 6660
AACAAGGACT GATAGAGGAT GGTATTGTGA TAATGCAGGA TCAGTATCCT TCTTTCCACA 6720
GGCTGACACT TGTAAAGTAC AGTCCAATCG AGTATTTTGT GACACTATGA ACAGTTTGAC 6780
ATTACCAAGT GAAGTCAGCC TTTGTAACAC TGACATATTC AATTCCAAGT ATGACTGCAA 6840
AATTATGACA TCAAAAACAG ACATAAGCAG CTCAGTAATT ACTTCTCTTG GAGCTATAGT 6900
GTCATGCTAT GGTAAAACTA AATGCACTGC ATCCAACAAA AATCGTGGGA TTATAAAGAC 6960
ATTTTCTAAT GGTTGTGACT ATGTGTCAAA CAAAGGAGTA GATACTGTGT CAGTGGGCAA 7020
CACTTTATAC TATGTAAACA AGCTGGAAGG CAAGAACCTT TATGTAAAAG GGGAACCTAT 7080
AATAAATTAC TATGACCCTC TAGTGTTTCC TTCTGATGAG TTTGATGCAT CAATATCTCA 7140
AGTCAATGAA AAAATCAATC AAAGTTTAGC TTTTATTCGT AGATCTGATG AATTACTACA 7200
TAATGTAAAT ACTGGCAAAT CTACTACAAA TATTATGATA ACTACAATTA TTATAGTAAT 7260
CATTGTAGTA TTGTTATCAT TAATAGCTAT TGGTTTACTG TTGTATTGTA AAGCCAAAAA 7320
CACACCAGTT ACACTAAGCA AAGACCAACT AAGTGGAATC AATAATATTG CATTCAGCAA 7380
ATAGACAAAA AACCACCTGA TCATGTTTCA ACAACAATCT GCTGACCACC AATCCCAAAT 7440
CAACTTACAA CAAATATTTC AACATCACAG TACAGGCTGA ATCATTTCCT CACATCATGC 7500
TACCCACATA ACTAAGCTAG ATCCTTAACT TATAGTTACA TAAAAACCTC AAGTATCACA 7560
ATCAACCACT AAATCAACAC ATCATTCACA AAATTAACAG CTGGGGCAAA TATGTCGCGA 7620
AGAAATCCTT GTAAATTTGA GATTAGAGGT CATTGCTTGA ATGGTAGAAG ATGTCACTAC 7680
AGTCATAATT ACTTTGAATG GCCTCCTCAT GCATTACTAG TGAGGCAAAA CTTCATGTTA 7740
AACAAGATAC TCAAGTCAAT GGACAAAAGC ATAGACACTT TGTCTGAAAT AAGTGGAGCT 7800
GCTGAACTGG ATAGAACAGA AGAATATGCT CTTGGTATAG TTGGAGTGCT AGAGAGTTAC 7860
ATAGGATCTA TAAACAACAT AACAAAACAA TCAGCATGTG TTGCTATGAG TAAACTTCTT 7920
ATTGAGATCA ATAGTGATGA CATTAAAAAG CTTAGAGATA ATGAAGAACC CAATTCACCT 7980
AAGATAAGAG TGTACAATAC TGTTATATCA TACATTGAGA GCAATAGAAA AAACAACAAG 8040
CAAACCATCC ATCTGCTCAA GAGACTACCA GCAGACGTGC TGAAGAAGAC AATAAAGAAC 8100
ACATTAGATA TCCACAAAAG CATAACCATA AGCAATCCAA AAGAGTCAAC TGTGAATGAT 8160
CAAAATGACC AAACCAAAAA TAATGATATT ACCGGATAAA TATCCTTGTA GTATATCATC 8220
CATATTGATC TCAAGTGAAA GCATGGTTGC TACATTCAAT CATAAAAACA TATTACAATT 8280
TAACCATAAC TATTTGGATA ACCACCAGCG TTTATTAAAT CATATATTTG ATGAAATTCA 8340
TTGGACACCT AAAAACTTAT TAGATGCCAC TCAACAATTT CTCCAACATC TTAACATCCC 8400
TGAAGATATA TATACAGTAT ATATATTAGT GTCATAATGC TTGACCATAA CGACTCTATG 8460
TCATCCAACC ATAAAACTAT TTTGATAAGG TTATGGGACA AAATGGATCC CATTATTAAT 8520
GGAAACTCTG CTAATGTGTA TCTAACTGAT AGTTATTTAA AAGGTGTTAT CTCTTTTTCA 8580
GAGTGTAATG CTTTAGGGAG TTATCTTTTT AACGGCCCTT ATCTTAAAAA TGATTACACC 8640
AACTTAATTA GTAGACAAAG CCCACTACTA GAGCATATGA ATCTTAAAAA ACTAACTATA 8700
ACACAGTCAT TAATATCTAG ATATCATAAA GGTGAACTGA AATTAGAAGA ACCAACTTAT 8760
TTCCAGTCAT TACTTATGAC ATATAAAAGT ATGTCCTCGT CTGAACAAAT TGCTACAACT 8820
AACTTACTTA AAAAAATAAT ACGAAGAGCC ATAGAAATAA GTGATGTAAA GGTGTACGCC 8880
ATCTTGAATA AACTAGGATT AAAGGAAAAG GACAGAGTTA AGCCCAACAA TAATTCAGGT 8940
GATGAAAACT CAGTACTTAC AACTATAATT AAAGATGATA TACTTTCGGC TGTGGAAAAC 9000
AATCAATCAT ATACAAATTC AGACAAAAGT CACTCAGTAA ATCAAAATAT CACTATCAAA 9060
ACAACACTCT TGAAAAAATT GATGTGTTCA ATGCAACATC CTCCATCATG GTTAATACAC 9120
TGGTTCAATT TATATACAAA ATTAAATAAC ATATTAACAC AATATCGATC AAATGAGGTA 9180
AAAAGTCATG GGTTTATATT AATAGATAAT CAAACTTTAA GTGGTTTTCA GTTTATTTTA 9240
AATCAATATG GTTGTATCGT TTATCATAAA GGACTCAAAA AAATCACAAC TACTACTTAC 9300
AATCAATTTT TGACATGGAA AGACATCAGC CTTAGCAGAT TAAATGTTTG CTTAATTACT 9360
TGGATAAGTA ATTGTTTAAA TACATTAAAC AAAAGCTTAG GGCTGAGATG TGGATTCAAT 9420
AATGTTGTGT TATCACAATT ATTTCTTTAT GGAGATTGTA TACTGAAATT ATTTCATAAT 9480
GAAGGCTTCT ACATAATAAA AGAAGTAGAG GGATTTATTA TGTCTTTAAT TCTAAACATA 9540
ACAGAAGAAG ATCAATTTAG GAAACGATTT TATAATAGCA TGCTAAATAA CATCACAGAT 9600
GCAGCTATTA AGGCTCAAAA GGACCTACTA TCAAGAGTAT GTCACACTTT ATTAGACAAG 9660
ACAGTGTCTG ATAATATCAT AAATGGTAAA TGGATAATCC TATTAAGTAA ATTTCTTAAA 9720
TTGATTAAGC TTGCAGGTGA TAATAATCTC AATAACTTGA GTGAGCTATA TTTTCTCTTC 9780
AGAATCTTTG GACATCCAAT GGTCGATGAA AGACAAGCAA TGGATTCTGT AAGAATTAAC 9840
TGTAATGAAA CTAAGTTCTA CTTATTAAGT AGTCTAAGTA CATTAAGAGG TGCTTTCATT 9900
TATAGAATCA TAAAAGGGTT TGTAAATACC TACAACAGAT GGCCCACCTT AAGGAATGCT 9960
ATTGTCCTAC CTCTAAGATG GTTAAACTAC TATAAACTTA ATACTTATCC ATCTCTACTT 10020
GAAATCACAG AAAATGATTT GATTATTTTA TCAGGATTGC GGTTCTATCG TGAGTTTCAT 10080
CTGCCTAAAA AAGTGGATCT TGAAATGATA ATAAATGACA AAGCCATTTC ACCTCCAAAA 10140
GATCTAATAT GGACTAGTTT TCCTAGAAAT TACATGCCAT CACATATACA AAATTATATA 10200
GAACATGAAA AGTTGAAGTT CTCTGAAAGC GACAGATCGA GAAGAGTACT AGAGTATTAC 10260
TTGAGAGATA ATAAATTCAA TGAATGCGAT CTATACAATT GTGTAGTCAA TCAAAGCTAT 10320
CTCAACAACT CTAATCACGT GGTATCACTA ACTGGTAAAG AAAGAGAGCT CAGTGTAGGT 10380
AGAATGTTTG CTATGCAACC AGGTATGTTT AGGCAAATCC AAATCTTAGC AGAGAAAATG 10440
ATAGCTGAAA ATATTTTACA ATTCTTCCCT GAGAGTTTGA CAAGATATGG TGATCTAGAG 10500
CTTCAAAAGA TATTAGAATT AAAAGCAGGA ATAAGCAACA AGTCAAATCG TTATAATGAT 10560
AACTACAACA ATTATATCAG TAAATGTTCT ATCATTACAG ATCTTAGCAA ATTCAATCAG 10620
GCATTTAGAT ATGAAACATC ATGTATCTGC AGTGATGTAT TAGATGAACT GCATGGAGTA 10680
CAATCTCTGT TCTCTTGGTT GCATTTAACA ATACCTCTTG TCACAATAAT ATGTACATAT 10740
AGACATGCAC CTCCTTTCAT AAAGGATCAT GTTGTTAATC TTAATGAGGT TGATGAACAA 10800
AGTGGATTAT ACAGATATCA TATGGGTGGT ATTGAGGGCT GGTGTCAAAA ACTGTGGACC 10860
ATTGAAGCTA TATCATTATT AGATCTAATA TCTCTCAAAG GGAAATTCTC TATCACAGCT 10920
CTGATAAATG GTGATAATCA GTCAATTGAT ATAAGCAAAC CAGTTAGACT TATAGAGGGT 10980
CAGACCCATG CACAAGCAGA TTATTTGTTA GCATTAAATA GCCTTAAATT GTTATATAAA 11040
GAGTATGCAG GTATAGGCCA TAAGCTTAAG GGAACAGAGA CCTATATATC CCGAGATATG 11100
CAGTTCATGA GCAAAACAAT CCAGCACAAT GGAGTGTACT ATCCAGCCAG TATCAAAAAA 11160
GTCCTGAGAG TAGGTCCATG GATAAACACG ATACTTGATG ATTTTAAAGT TAGTTTAGAA 11220
TCTATAGGCA GCTTAACACA GGAGTTAGAA TACAGAGGAG AAAGCTTATT ATGCAGTTTA 11280
ATATTTAGGA ACATTTGGTT ATACAATCAA ATTGCTTTGC AACTCCGAAA TCATGCATTA 11340
TGTAACAATA AGCTATATTT AGATATATTG AAAGTATTAA AACACTTAAA AACTTTTTTT 11400
AATCTTGATA GCATTGATAT GGCTTTATCA TTGTATATGA ATTTGCCTAT GCTGTTTGGT 11460
GGTGGTGATC CTAATTTGTT ATATCGAAGC TTTTATAGGA GAACTCCAGA CTTCCTTACA 11520
GAAGCTATAG TACATTCAGT GTTTGTGTTG AGCTATTATA CTGGTCACGA TTTACAAGAT 11580
AAGCTCCAGG ATCTTCCAGA TGATAGACTG AACAAATTCT TGACATGTGT CATCACATTT 11640
GATAAAAATC CCAATGCCGA GTTTGTAACA TTGATGAGGG ATCCACAGGC TTTAGGGTCT 11700
GAAAGGCAAG CTAAAATTAC TAGTGAGATT AATAGATTAG CAGTAACAGA AGTCTTAAGT 11760
ATAGCCCCAA ACAAAATATT TTCTAAAAGT GCACAACATT ATACTACCAC TGAGATTGAT 11820
CTAAATGACA TTATGCAAAA TATAGAACCA ACTTACCCTC ATGGATTAAG AGTTGTTTAT 11880
GAAAGTTTAC CTTTTTATAA AGCAGAAAAA ATAGTTAATC TTATATCAGG AACAAAATCC 11940
ATAACTAATA TACTTGAAAA AACATCAGCA ATAGATACAA CTGATATTAA TAGGGCTACT 12000
GATATGATGA GGAAAAATAT AACTTTACTT ATAAGGATAC TTCCACTAGA TTGTAACAAA 12060
GACAAAAGAG AGTTATTAAG TTTAGAAAAT CTTAGTATAA CTGAATTAAG CAAGTATGTA 12120
AGAGAAAGAT CTTGGTCATT ATCCAATATA GTAGGAGTAA CATCGCCAAG TATTATGTTC 12180
ACAATGGACA TTAAATATAC AACTAGCACT ATAGCCAGTG GTATAATAAT AGAAAAATAT 12240
AATGTTAATA GTTTAACTCG TGGTGAAAGA GGACCCACCA AGCCATGGGT AGGCTCATCC 12300
ACGCAGGAGA AAAAAACAAT GCCAGTGTAC AACAGACAAG TTTTAACCAA AAAGCAAAGA 12360
GACCAAATAG ATTTATTAGC AAAATTAGAC TGGGTATATG CATCCATAGA CAACAAAGAT 12420
GAATTCATGG AAGAACTGAG TACTGGAACA CTTGGACTGT CATATGAAAA AGCCAAAAAG 12480
TTGTTTCCAC AATATCTAAG TGTCAATTAT TTACACCGTT TAACAGTCAG TAGTAGACCA 12540
TGTGAATTCC CTGCATCAAT ACCAGCTTAT AGAACAACAA ATTATCATTT TGATACTAGT 12600
CCTATCAATC ATGTATTAAC AGAAAAGTAT GGAGATGAAG ATATCGACAT TGTGTTTCAA 12660
AATTGCATAA GTTTTGGTCT TAGCCTGATG TCGGTTGTGG AACAATTCAC AAACATATGT 12720
CCTAATAGAA TTATTCTCAT ACCGAAGCTG AATGAGATAC ATTTGATGAA ACCTCCTATA 12780
TTTACAGGAG ATGTTGATAT CATCAAGTTG AAGCAAGTGA TACAAAAGCA GCACATGTTC 12840
CTACCAGATA AAATAAGTTT AACCCAATAT GTAGAATTAT TCTTAAGTAA CAAAGCACTT 12900
AAATCTGGAT CTCACATCAA CTCTAATTTA ATATTAGTAC ATAAAATGTC TGATTATTTT 12960
CATAATGCTT ATATTTTAAG TACTAATTTA GCTGGACATT GGATTCTGAT TATTCAACTT 13020
ATGAAAGATT CAAAAGGTAT TTTTGAAAAA GATTGGGGAG AGGGGTACAT AACTGATCAT 13080
ATGTTCATTA ATTTGAATGT TTTCTTTAAT GCTTATAAGA CTTATTTGCT ATGTTTTCAT 13140
AAAGGTTATG GTAAAGCAAA ATTAGAATGT GATATGAACA CTTCAGATCT TCTTTGTGTT 13200
TTGGAGTTAA TAGACAGTAG CTACTGGAAA TCTATGTCTA AAGTTTTCCT AGAACAAAAA 13260
GTCATAAAAT ACATAGTCAA TCAAGACACA AGTTTGCGTA GAATAAAAGG CTGTCACAGT 13320
TTTAAGTTGT GGTTTTTAAA ACGCCTTGAT AATGCTAAAT TTACCGTATG CCCTTGGGTT 13380
GTTAACATAG ATTATCACCC AACACACATG AAAGCTATAT TATCTTACAT AGATTTAGTT 13440
AGAATGGGGT TAATAAATGT AGATAAATTA ACCATTAAAA ATAAAAACAA ATTCAATGAT 13500
GAATTTTACA CATCAAATCT CTTTTACATT AGTTATAACT TTTCAGACAA CACTCATTTG 13560
CTAACAAAAC AAATAAGAAT TGCTAATTCA GAATTAGAAG ATAATTATAA CAAACTATAT 13620
CACCCAACCC CAGAAACTTT AGAAAATATG TCATTAATTC CTGTTAAAAG TAATAATAGT 13680
AACAAACCTA AATTTTGTAT AAGTGGAAAT ACCGAATCTA TGATGATGTC AACATTCTCT 13740
AGTAAAATGC ATATTAAATC TTCCACTGTT ACCACAAGAT TCAATTATAG CAAACAAGAC 13800
TTGTACAATT TATTTCCAAT TGTTGTGATA GACAAGATTA TAGATCATTC AGGTAATACA 13860
GCAAAATCTA ACCAACTTTA CACCACCACT TCACATCAGA CATCTTTAGT AAGGAATAGT 13920
GCATCACTTT ATTGCATGCT TCCTTGGCAT CATGTCAATA GATTTAACTT TGTATTTAGT 13980
TCCACAGGAT GCAAGATCAG TATAGAGTAT ATTTTAAAAG ATCTTAAGAT TAAGGACCCC 14040
AGTTGTATAG CATTCATAGG TGAAGGAGCT GGTAACTTAT TATTACGTAC GGTAGTAGAA 14100
CTTCATCCAG ACATAAGATA CATTTACAGA AGTTTAAAAG ATTGCAATGA TCATAGTTTA 14160
CCTATTGAAT TTCTAAGGTT ATACAACGGG CATATAAACA TAGATTATGG TGAGAATTTA 14220
ACCATTCCTG CTACAGATGC AACTAATAAC ATTCATTGGT CTTATTTACA TATAAAATTT 14280
GCAGAACCTA TTAGCATCTT TGTCTGCGAT GCTGAATTAC CTGTTACAGC CAATTGGAGT 14340
AAAATTATAA TTGAATGGAG TAAGCATGTA AGAAAGTGCA AGTACTGTTC TTCTGTAAAT 14400
AGATGCATTT TAATTGCAAA ATATCATGCT CAAGATGACA TTGATTTCAA ATTAGATAAC 14460
ATTACTATAT TAAAAACTTA CGTGTGCCTA GGTAGCAAGT TAAAAGGATC TGAAGTTTAC 14520
TTAATCCTTA CAATAGGCCC TGCAAATATA CTTCCTGTTT TTGATGTTGT ACAAAATGCT 14580
AAATTGATAC TTTCAAGAAC TAAAAATTTC ATTATGCCTA AAAAAACTGA CAAGGAATCT 14640
ATCGATGCAG TTATTAAAAG CTTAATACCT TTCCTTTGTT ACCCTATAAC AAAAAAAGGA 14700
ATTAAGACTT CATTGTCAAA ATTGAAGAGT GTAGTTAATG GAGATATATT ATCATATTCT 14760
ATAGCTGGAC GTAATGAAGT ATTCAGCAAC AAGCTTATAA ACCACAAGCA TATGAATATC 14820
CTAAAATGGC TAGATCATGT TTTAAATTTT AGATCAGCTG AACTTAATTA CAATCATTTA 14880
TACATGATAG AGTCCACATA TCCTTACTTA AGTGAATTGT TAAATAGTTT AACAACCAAT 14940
GAGCTCAAGA AGCTGATTAA AATAACAGGT AGTGTGCTAT ACAACCTTCC CAACGAACAG 15000
TAGTTTAAAA TATCATTAAC AAGTTTGGTC AAATTTAGAT GCTAACACAT CATTATATTA 15060
TAGTTATTAA AAAATATACA AACTTTTCAA TAATTTAGCA TATTGATTCC AAAATTATCA 15120
TTTTAGTCTT AAGGGGTTAA ATAAAAGTCT AAAACTAACA ATTATACATG TGCATTCACA 15180
ACACAACGAG ACATTAGTTT TTGACACTTT TTTTCTCGT 15219
(2) SEQ ID NO: 34 information about:
...
(i) sequence signature:
(A) length: 2166 amino acid
(B) type: amino acid
(C) chain:
(D) topological framework: linearity
(ii) molecule type: protein
(xi) sequence description: SEQ ID NO:34:
Met?Asp?Pro?Ile?Ile?Asn?Gly?Asn?Ser?Ala?Asn?Val?Tyr?Leu?Thr?Asp
1???????????????5???????????????????10??????????????????15
Ser?Tyr?Leu?Lys?Gly?Val?Ile?Ser?Phe?Ser?Glu?Cys?Asn?Ala?Leu?Gly
20??????????????????25??????????????????30
Ser?Tyr?Leu?Phe?Asn?Gly?Pro?Tyr?Leu?Lys?Asn?Asp?Tyr?Thr?Asn?Leu
35??????????????????40??????????????????45Ile?Ser?Arg?Gln?Ser?Pro?Leu?Leu?Glu?His?Met?Asn?Leu?Lys?Lys?Leu
50??????????????????55??????????????????60Thr?Ile?Thr?Gln?Ser?Leu?Ile?Ser?Arg?Tyr?His?Lys?Gly?Glu?Leu?Lys65??????????????????70??????????????????75??????????????????80Leu?Glu?Glu?Pro?Thr?Tyr?Phe?Gln?Ser?Leu?Leu?Met?Thr?Tyr?Lys?Ser
85??????????????????90??????????????????95Met?Ser?Ser?Ser?Glu?Gln?Ile?Ala?Thr?Thr?Asn?Leu?Leu?Lys?Lys?Ile
100?????????????????105?????????????????110Ile?Arg?Arg?Ala?Ile?Glu?Ile?Ser?Asp?Val?Lys?Val?Tyr?Ala?Ile?Leu
115?????????????????120?????????????????125Asn?Lys?Leu?Gly?Leu?Lys?Glu?Lys?Asp?Arg?Val?Lys?Pro?Asn?Asn?Asn
130?????????????????135?????????????????140Ser?Gly?Asp?Glu?Asn?Ser?Val?Leu?Thr?Thr?Ile?Ile?Lys?Asp?Asp?Ile145?????????????????150?????????????????155?????????????????160Leu?Ser?Ala?Val?Glu?Asn?Asn?Gln?Ser?Tyr?Thr?Asn?Ser?Asp?Lys?Ser
165?????????????????170?????????????????175His?Ser?Val?Asn?Gln?Asn?Ile?Thr?Ile?Lys?Thr?Thr?Leu?Leu?Lys?Lys
180?????????????????185?????????????????190Leu?Met?Cys?Ser?Met?Gln?His?Pro?Pro?Ser?Trp?Leu?Ile?His?Trp?Phe
195?????????????????200?????????????????205Asn?Leu?Tyr?Thr?Lys?Leu?Asn?Asn?Ile?Leu?Thr?Gln?Tyr?Arg?Ser?Asn
210?????????????????215?????????????????220Glu?Val?Lys?Ser?His?Gly?Phe?Ile?Leu?Ile?Asp?Asn?Gln?Thr?Leu?Ser225?????????????????230?????????????????235?????????????????240Gly?Phe?Gln?Phe?Ile?Leu?Asn?Gln?Tyr?Gly?Cys?Ile?Val?Tyr?His?Lys
245?????????????????250?????????????????255Gly?Leu?Lys?Lys?Ile?Thr?Thr?Thr?Thr?Tyr?Asn?Gln?Phe?Leu?Thr?Trp
260?????????????????265?????????????????270Lys?Asp?Ile?Ser?Leu?Ser?Arg?Leu?Asn?Val?Cys?Leu?Ile?Thr?Trp?Ile
275?????????????????280?????????????????285Ser?Asn?Cys?Leu?Asn?Thr?Leu?Asn?Lys?Ser?Leu?Gly?Leu?Arg?Cys?Gly
290?????????????????295?????????????????300Phe?Asn?Asn?Val?Val?Leu?Ser?Gln?Leu?Phe?Leu?Tyr?Gly?Asp?Cys?Ile305?????????????????310?????????????????315?????????????????320Leu?Lys?Leu?Phe?His?Asn?Glu?Gly?Phe?Tyr?Ile?Ile?Lys?Glu?Val?Glu
325?????????????????330?????????????????335Gly?Phe?Ile?Met?Ser?Leu?Ile?Leu?Asn?Ile?Thr?Glu?Glu?Asp?Gln?Phe
340?????????????????345?????????????????350Arg?Lys?Arg?Phe?Tyr?Asn?Ser?Met?Leu?Asn?Asn?Ile?Thr?Asp?Ala?Ala
355?????????????????360?????????????????365Ile?Lys?Ala?Gln?Lys?Asp?Leu?Leu?Ser?Arg?Val?Cys?His?Thr?Leu?Leu
370?????????????????375?????????????????380Asp?Lys?Thr?Val?Ser?Asp?Asn?Ile?Ile?Asn?Gly?Lys?Trp?Ile?Ile?Leu385?????????????????390?????????????????395?????????????????400Leu?Ser?Lys?Phe?Leu?Lys?Leu?Ile?Lys?Leu?Ala?Gly?Asp?Asn?Asn?Leu
405?????????????????410?????????????????415Asn?Asn?Leu?Ser?Glu?Leu?Tyr?Phe?Leu?Phe?Arg?Ile?Phe?Gly?His?Pro
420?????????????????425?????????????????430Met?Val?Asp?Glu?Arg?Gln?Ala?Met?Asp?Ser?Val?Arg?Ile?Asn?Cys?Asn
435?????????????????440?????????????????445Glu?Thr?Lys?Phe?Tyr?Leu?Leu?Ser?Ser?Leu?Ser?Thr?Leu?Arg?Gly?Ala
450?????????????????455?????????????????460Phe?Ile?Tyr?Arg?Ile?Ile?Lys?Gly?Phe?Val?Asn?Thr?Tyr?Asn?Arg?Trp465?????????????????470?????????????????475?????????????????480Pro?Thr?Leu?Arg?Asn?Ala?Ile?Val?Leu?Pro?Leu?Arg?Trp?Leu?Asn?Tyr
485?????????????????490?????????????????495Tyr?Lys?Leu?Asn?Thr?Tyr?Pro?Ser?Leu?Leu?Glu?Ile?Thr?Glu?Asn?Asp
500?????????????????505?????????????????510Leu?Ile?Ile?Leu?Ser?Gly?Leu?Arg?Phe?Tyr?Arg?Glu?Phe?His?Leu?Pro
515?????????????????520?????????????????525Lys?Lys?Val?Asp?Leu?Glu?Met?Ile?Ile?Asn?Asp?Lys?Ala?Ile?Ser?Pro
530?????????????????535?????????????????540Pro?Lys?Asp?Leu?Ile?Trp?Thr?Ser?Phe?Pro?Arg?Asn?Tyr?Met?Pro?Ser545?????????????????550?????????????????555?????????????????560His?Ile?Gln?Asn?Tyr?Ile?Glu?His?Glu?Lys?Leu?Lys?Phe?Ser?Glu?Ser
565?????????????????570?????????????????575Asp?Arg?Ser?Arg?Arg?Val?Leu?Glu?Tyr?Tyr?Leu?Arg?Asp?Asn?Lys?Phe
580?????????????????585?????????????????590Asn?Glu?Cys?Asp?Leu?Tyr?Asn?Cys?Val?Val?Asn?Gln?Ser?Tyr?Leu?Asn
595?????????????????600?????????????????605Asn?Ser?Asn?His?Val?Val?Ser?Leu?Thr?Gly?Lys?Glu?Arg?Glu?Leu?Ser
610?????????????????615?????????????????620Val?Gly?Arg?Met?Phe?Ala?Met?Gln?Pro?Gly?Met?Phe?Arg?Gln?Ile?Gln625?????????????????630?????????????????635?????????????????640Ile?Leu?Ala?Glu?Lys?Met?Ile?Ala?Glu?Asn?Ile?Leu?Gln?Phe?Phe?Pro
645?????????????????650?????????????????655Glu?Ser?Leu?Thr?Arg?Tyr?Gly?Asp?Leu?Glu?Leu?Gln?Lys?Ile?Leu?Glu
660?????????????????665?????????????????670Leu?Lys?Ala?Gly?Ile?Ser?Asn?Lys?Ser?Asn?Arg?Tyr?Asn?Asp?Asn?Tyr
675?????????????????680?????????????????685Asn?Asn?Tyr?Ile?Ser?Lys?Cys?Ser?Ile?Ile?Thr?Asp?Leu?Ser?Lys?Phe
690?????????????????695?????????????????700Asn?Gln?Ala?Phe?Arg?Tyr?Glu?Thr?Ser?Cys?Ile?Cys?Ser?Asp?Val?Leu705?????????????????710?????????????????715?????????????????720Asp?Glu?Leu?His?Gly?Val?Gln?Ser?Leu?Phe?Ser?Trp?Leu?His?Leu?Thr
725?????????????????730?????????????????735Ile?Pro?Leu?Val?Thr?Ile?Ile?Cys?Thr?Tyr?Arg?His?Ala?Pro?Pro?Phe
740?????????????????745?????????????????750Ile?Lys?Asp?His?Val?Val?Asn?Leu?Asn?Glu?Val?Asp?Glu?Gln?Ser?Gly
755?????????????????760?????????????????765Leu?Tyr?Arg?Tyr?His?Met?Gly?Gly?Ile?Glu?Gly?Trp?Cys?Gln?Lys?Leu
770?????????????????775?????????????????780Trp?Thr?Ile?Glu?Ala?Ile?Ser?Leu?Leu?Asp?Leu?Ile?Ser?Leu?Lys?Gly785?????????????????790?????????????????795?????????????????800Lys?Phe?Ser?Ile?Thr?Ala?Leu?Ile?Asn?Gly?Asp?Asn?Gln?Ser?Ile?Asp
805?????????????????810?????????????????815Ile?Ser?Lys?Pro?Val?Arg?Leu?Ile?Glu?Gly?Gln?Thr?His?Ala?Gln?Ala
820?????????????????825?????????????????830Asp?Tyr?Leu?Leu?Ala?Leu?Asn?Ser?Leu?Lys?Leu?Leu?Tyr?Lys?Glu?Tyr
835?????????????????840?????????????????845Ala?Gly?Ile?Gly?His?Lys?Leu?Lys?Gly?Thr?Glu?Thr?Tyr?Ile?Ser?Arg
850?????????????????855?????????????????860Asp?Met?Gln?Phe?Met?Ser?Lys?Thr?Ile?Gln?His?Asn?Gly?Val?Tyr?Tyr865?????????????????870?????????????????875?????????????????880Pro?Ala?Ser?Ile?Lys?Lys?Val?Leu?Arg?Val?Gly?Pro?Trp?Ile?Asn?Thr
885?????????????????890?????????????????895Ile?Leu?Asp?Asp?Phe?Lys?Val?Ser?Leu?Glu?Ser?Ile?Gly?Ser?Leu?Thr
900?????????????????905?????????????????910Gln?Glu?Leu?Glu?Tyr?Arg?Gly?Glu?Ser?Leu?Leu?Cys?Ser?Leu?Ile?Phe
915?????????????????920?????????????????925Arg?Asn?Ile?Trp?Leu?Tyr?Asn?Gln?Ile?Ala?Leu?Gln?Leu?Arg?Asn?His
930?????????????????935?????????????????940Ala?Leu?Cys?Asn?Asn?Lys?Leu?Tyr?Leu?Asp?Ile?Leu?Lys?Val?Leu?Lys945?????????????????950?????????????????955?????????????????960His?Leu?Lys?Thr?Phe?Phe?Asn?Leu?Asp?Ser?Ile?Asp?Met?Ala?Leu?Ser
965?????????????????970?????????????????975Leu?Tyr?Met?Asn?Leu?Pro?Met?Leu?Phe?Gly?Gly?Gly?Asp?Pro?Asn?Leu
980?????????????????985?????????????????990Leu?Tyr?Arg?Ser?Phe?Tyr?Arg?Arg?Thr?Pro?Asp?Phe?Leu?Thr?Glu?Ala
995?????????????????1000????????????????1005Ile?Val?His?Ser?Val?Phe?Val?Leu?Ser?Tyr?Tyr?Thr?Gly?His?Asp?Leu
1010????????????????1015????????????????1020Gln?Asp?Lys?Leu?Gln?Asp?Leu?Pro?Asp?Asp?Arg?Leu?Asn?Lys?Phe?Leu1025????????????????1030????????????????1035????????????????1040Thr?Cys?Val?Ile?Thr?Phe?Asp?Lys?Asn?Pro?Asn?Ala?Glu?Phe?Val?Thr
1045????????????????1050????????????????1055Leu?Met?Arg?Asp?Pro?Gln?Ala?Leu?Gly?Ser?Glu?Arg?Gln?Ala?Lys?Ile
1060????????????????1065????????????????1070Thr?Ser?Glu?Ile?Asn?Arg?Leu?Ala?Val?Thr?Glu?Val?Leu?Ser?Ile?Ala
1075????????????????1080????????????????1085Pro?Asn?Lys?Ile?Phe?Ser?Lys?Ser?Ala?Gln?His?Tyr?Thr?Thr?Thr?Glu
1090????????????????1095????????????????1100Ile?Asp?Leu?Asn?Asp?Ile?Met?Gln?Asn?Ile?Glu?Pro?Thr?Tyr?Pro?His1105????????????????1110????????????????1115????????????????1120Gly?Leu?Arg?Val?Val?Tyr?Glu?Ser?Leu?Pro?Phe?Tyr?Lys?Ala?Glu?Lys
1125????????????????1130????????????????1135Ile?Val?Asn?Leu?Ile?Ser?Gly?Thr?Lys?Ser?Ile?Thr?Asn?Ile?Leu?Glu
1140????????????????1145????????????????1150Lys?Thr?Ser?Ala?Ile?Asp?Thr?Thr?Asp?Ile?Asn?Arg?Ala?Thr?Asp?Met
1155????????????????1160????????????????1165Met?Arg?Lys?Asn?Ile?Thr?Leu?Leu?Ile?Arg?Ile?Leu?Pro?Leu?Asp?Cys
1170????????????????1175????????????????1180Asn?Lys?Asp?Lys?Arg?Glu?Leu?Leu?Ser?Leu?Glu?Asn?Leu?Ser?Ile?Thr1185????????????????1190????????????????1195????????????????1200Glu?Leu?Ser?Lys?Tyr?Val?Arg?Glu?Arg?Ser?Trp?Ser?Leu?Ser?Asn?Ile
1205????????????????1210????????????????1215Val?Gly?Val?Thr?Ser?Pro?Ser?Ile?Met?Phe?Thr?Met?Asp?Ile?Lys?Tyr
1220????????????????1225????????????????1230Thr?Thr?Ser?Thr?Ile?Ala?Ser?Gly?Ile?Ile?Ile?Glu?Lys?Tyr?Asn?Val
1235????????????????1240????????????????1245Asn?Ser?Leu?Thr?Arg?Gly?Glu?Arg?Gly?Pro?Thr?Lys?Pro?Trp?Val?Gly
1250????????????????1255????????????????1260Ser?Ser?Thr?Gln?Glu?Lys?Lys?Thr?Met?Pro?Val?Tyr?Asn?Arg?Gln?Val1265????????????????1270????????????????1275????????????????1280Leu?Thr?Lys?Lys?Gln?Arg?Asp?Gln?Ile?Asp?Leu?Leu?Ala?Lys?Leu?Asp
1285????????????????1290????????????????1295Trp?Val?Tyr?Ala?Ser?Ile?Asp?Asn?Lys?Asp?Glu?Phe?Met?Glu?Glu?Leu
1300????????????????1305????????????????1310Ser?Thr?Gly?Thr?Leu?Gly?Leu?Ser?Tyr?Glu?Lys?Ala?Lys?Lys?Leu?Phe
1315????????????????1320????????????????1325Pro?Gln?Tyr?Leu?Ser?Val?Asn?Tyr?Leu?His?Arg?Leu?Thr?Val?Ser?Ser
1330????????????????1335????????????????1340Arg?Pro?Cys?Glu?Phe?Pro?Ala?Ser?Ile?Pro?Ala?Tyr?Arg?Thr?Thr?Asn1345????????????????1350????????????????1355????????????????1360Tyr?His?Phe?Asp?Thr?Ser?Pro?Ile?Asn?His?Val?Leu?Thr?Glu?Lys?Tyr
1365????????????????1370????????????????1375Gly?Asp?Glu?Asp?Ile?Asp?Ile?Val?Phe?Gln?Asn?Cys?Ile?Ser?Phe?Gly
1380????????????????1385????????????????1390Leu?Ser?Leu?Met?Ser?Val?Val?Glu?Gln?Phe?Thr?Asn?Ile?Cys?Pro?Asn
1395????????????????1400????????????????1405Arg?Ile?Ile?Leu?Ile?Pro?Lys?Leu?Asn?Glu?Ile?His?Leu?Met?Lys?Pro
1410????????????????1415????????????????1420Pro?Ile?Phe?Thr?Gly?Asp?Val?Asp?Ile?Ile?Lys?Leu?Lys?Gln?Val?Ile1425????????????????1430????????????????1435????????????????1440Gln?Lys?Gln?His?Met?Phe?Leu?Pro?Asp?Lys?Ile?Ser?Leu?Thr?Gln?Tyr
1445????????????????1450????????????????1455Val?Glu?Leu?Phe?Leu?Ser?Asn?Lys?Ala?Leu?Lys?Ser?Gly?Ser?His?Ile
1460????????????????1465????????????????1470Asn?Ser?Asn?Leu?Ile?Leu?Val?His?Lys?Met?Ser?Asp?Tyr?Phe?His?Asn
1475????????????????1480????????????????1485Ala?Tyr?Ile?Leu?Ser?Thr?Asn?Leu?Ala?Gly?His?Trp?Ile?Leu?Ile?Ile
1490????????????????1495????????????????1500Gln?Leu?Met?Lys?Asp?Ser?Lys?Gly?Ile?Phe?Glu?Lys?Asp?Trp?Gly?Glu1505????????????????1510????????????????1515????????????????1520Gly?Tyr?Ile?Thr?Asp?His?Met?Phe?Ile?Asn?Leu?Asn?Val?Phe?Phe?Asn
1525????????????????1530????????????????1535Ala?Tyr?Lys?Thr?Tyr?Leu?Leu?Cys?Phe?His?Lys?Gly?Tyr?Gly?Lys?Ala
1540????????????????1545????????????????1550Lys?Leu?Glu?Cys?Asp?Met?Asn?Thr?Ser?Asp?Leu?Leu?Cys?Val?Leu?Glu
1555????????????????1560????????????????1565Leu?Ile?Asp?Ser?Ser?Tyr?Trp?Lys?Ser?Met?Ser?Lys?Val?Phe?Leu?Glu
1570????????????????1575????????????????1580Gln?Lys?Val?Ile?Lys?Tyr?Ile?Val?Asn?Gln?Asp?Thr?Ser?Leu?Arg?Arg1585????????????????1590????????????????1595????????????????1600Ile?Lys?Gly?Cys?His?Ser?Phe?Lys?Leu?Trp?Phe?Leu?Lys?Arg?Leu?Asp
1605????????????????1610????????????????1615Asn?Ala?Lys?Phe?Thr?Val?Cys?Pro?Trp?Val?Val?Asn?Ile?Asp?Tyr?His
1620????????????????1625????????????????1630Pro?Thr?His?Met?Lys?Ala?Ile?Leu?Ser?Tyr?Ile?Asp?Leu?Val?Arg?Met
1635????????????????1640????????????????1645Gly?Leu?Ile?Asn?Val?Asp?Lys?Leu?Thr?Ile?Lys?Asn?Lys?Asn?Lys?Phe
1650????????????????1655????????????????1660Asn?Asp?Glu?Phe?Tyr?Thr?Ser?Asn?Leu?Phe?Tyr?Ile?Ser?Tyr?Asn?Phe1665????????????????1670????????????????1675????????????????1680Ser?Asp?Asn?Thr?His?Leu?Leu?Thr?Lys?Gln?Ile?Arg?Ile?Ala?Asn?Ser
1685????????????????1690????????????????1695Glu?Leu?Glu?Asp?Asn?Tyr?Asn?Lys?Leu?Tyr?His?Pro?Thr?Pro?Glu?Thr
1700????????????????1705????????????????1710Leu?Glu?Asn?Met?Ser?Leu?Ile?Pro?Val?Lys?Ser?Asn?Asn?Ser?Asn?Lys
1715????????????????1720????????????????1725Pro?Lys?Phe?Cys?Ile?Ser?Gly?Asn?Thr?Glu?Ser?Met?Met?Met?Ser?Thr
1730????????????????1735????????????????1740Phe?Ser?Ser?Lys?Met?His?Ile?Lys?Ser?Ser?Thr?Val?Thr?Thr?Arg?Phe1745????????????????1750????????????????1755????????????????1760Asn?Tyr?Ser?Lys?Gln?Asp?Leu?Tyr?Asn?Leu?Phe?Pro?Ile?Val?Val?Ile
1765????????????????1770????????????????1775Asp?Lys?Ile?Ile?Asp?His?Ser?Gly?Asn?Thr?Ala?Lys?Ser?Asn?Gln?Leu
1780????????????????1785????????????????1790Tyr?Thr?Thr?Thr?Ser?His?Gln?Thr?Ser?Leu?Val?Arg?Asn?Ser?Ala?Ser
1795????????????????1800????????????????1805Leu?Tyr?Cys?Met?Leu?Pro?Trp?His?His?Val?Asn?Arg?Phe?Asn?Phe?Val
1810????????????????1815????????????????1820Phe?Ser?Ser?Thr?Gly?Cys?Lys?Ile?Ser?Ile?Glu?Tyr?Ile?Leu?Lys?Asp1825????????????????1830???????????????1835?????????????????1840Leu?Lys?Ile?Lys?Asp?Pro?Ser?Cys?Ile?Ala?Phe?Ile?Gly?Glu?Gly?Ala
1845????????????????1850????????????????1855Gly?Asn?Leu?Leu?Leu?Arg?Thr?Val?Val?Glu?Leu?His?Pro?Asp?Ile?Arg
1860????????????????1865????????????????1870Tyr?Ile?Tyr?Arg?Ser?Leu?Lys?Asp?Cys?Asn?Asp?His?Ser?Leu?Pro?Ile
1875????????????????1880????????????????1885Glu?Phe?Leu?Arg?Leu?Tyr?Asn?Gly?His?Ile?Asn?Ile?Asp?Tyr?Gly?Glu
1890????????????????1895????????????????1900Asn?Leu?Thr?Ile?Pro?Ala?Thr?Asp?Ala?Thr?Asn?Asn?Ile?His?Trp?Ser1905????????????????1910????????????????1915????????????????1920Tyr?Leu?His?Ile?Lys?Phe?Ala?Glu?Pro?Ile?Ser?Ile?Phe?Val?Cys?Asp
1925????????????????1930????????????????1935Ala?Glu?Leu?Pro?Val?Thr?Ala?Asn?Trp?Ser?Lys?Ile?Ile?Ile?Glu?Trp
1940????????????????1945????????????????1950Ser?Lys?His?Val?Arg?Lys?Cys?Lys?Tyr?Cys?Ser?Ser?Val?Asn?Arg?Cys
1955????????????????1960????????????????1965Ile?Leu?Ile?Ala?Lys?Tyr?His?Ala?Gln?Asp?Asp?Ile?Asp?Phe?Lys?Leu
1970????????????????1975????????????????1980Asp?Asn?Ile?Thr?Ile?Leu?Lys?Thr?Tyr?Val?Cys?Leu?Gly?Ser?Lys?Leu1985????????????????1990????????????????1995????????????????2000Lys?Gly?Ser?Glu?Val?Tyr?Leu?Ile?Leu?Thr?Ile?Gly?Pro?Ala?Asn?Ile
2005????????????????2010????????????????2015Leu?Pro?Val?Phe?Asp?Val?Val?Gln?Asn?Ala?Lys?Leu?Ile?Leu?Ser?Arg
2020????????????????2025????????????????2030Thr?Lys?Asn?Phe?Ile?Met?Pro?Lys?Lys?Thr?Asp?Lys?Glu?Ser?Ile?Asp
2035????????????????2040????????????????2045Ala?Val?Ile?Lys?Ser?Leu?Ile?Pro?Phe?Leu?Cys?Tyr?Pro?Ile?Thr?Lys
2050????????????????2055????????????????2060Lys?Gly?Ile?Lys?Thr?Ser?Leu?Ser?Lys?Leu?Lys?Ser?Val?Val?Asn?Gly2065????????????????2070????????????????2075????????????????2080Asp?Ile?Leu?Ser?Tyr?Ser?Ile?Ala?Gly?Arg?Asn?Glu?Val?Phe?Ser?Asn
2085????????????????2090????????????????2095
Lys?Leu?Ile?Asn?His?Lys?His?Met?Asn?Ile?Leu?Lys?Trp?Leu?Asp?His
2100????????????????2105????????????????2110
Val?Leu?Asn?Phe?Arg?Ser?Ala?Glu?Leu?Asn?Tyr?Asn?His?Leu?Tyr?Met
2115????????????????2120????????????????2125
Ile?Glu?Ser?Thr?Tyr?Pro?Tyr?Leu?Ser?Glu?Leu?Leu?Asn?Ser?Leu?Thr
2130????????????????2135????????????????2140
Thr?Asn?Glu?Leu?Lys?Lys?Leu?Ile?Lys?Ile?Thr?Gly?Ser?Val?Leu?Tyr
2145????????????????2150????????????????2155????????????????2160
Asn?Leu?Pro?Asn?Glu?Gln
The information of 2165 (2) SEQ ID NO:35:
(i) sequence signature:
(A) length: 24 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:35:CATATCACTC ACTCTGGGAT GGAG 24 (2) SEQ ID NO:36:
(i) sequence signature:
(A) length: 20 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:36:TCAGAACATC AAGCACCGCC 20 (2) SEQ ID NO:37:
(i) sequence signature:
(A) length: 20 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:37:ACAGTCAAGA CTGAGATGAG 20 (2) SEQ ID NO:38:
(i) sequence signature:
(A) length: 20 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:38:AAGAGTCAGA TACATGTGGA 20 (2) SEQ ID NO:39:
(i) sequence signature:
(A) length: 20 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:39:ACATGAATCA GCCTAAAGTC 20 (2) SEQ ID NO:40:
(i) sequence signature:
(A) length: 25 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:40:CCGAAAGAGT TCCTGCGTTA CGACC 25 (2) SEQ ID NO:41:
(i) sequence signature:
(A) length: 20 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:41:CAGTCCACAC AAGTACCAGG 20 (2) SEQ ID NO:42:
(i) sequence signature:
(A) length: 20 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:42:GTCAGAAGCT GTGGACCATC 20 (2) SEQ ID NO:43:
(i) sequence signature:
(A) length: 20 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:43:AATATTGCTA CAACAATGGC 20 (2) SEQ ID NO:44:
(i) sequence signature:
(A) length: 20 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:44:ACTCTTCATT CCTAGACTGG 20 (2) SEQ ID NO:45:
(i) sequence signature:
(A) length: 20 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:45:GTCCAATTAT GACTATGAAC 20 (2) SEQ ID NO:46:
(i) sequence signature:
(A) length: 20 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) topological framework: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:46:AGAACAGACA TGAAGCTTGC 20 (2) SEQ ID NO:47:
(i) sequence signature:
(A) length: 20 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:47:CCAACAAGGA ATGCTTCTAG 20 (2) SEQ ID NO:48:
(i) sequence signature:
(A) length: 25 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:48:ACAGCACTAT CTATGATTGA CCTGG 25 (2) SEQ ID NO:49:
(i) sequence signature:
(A) length: 20 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:49:GCAACATGGT TTACACATGC 20 (2) SEQ ID NO:50:
(i) sequence signature:
(A) length: 20 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:50:AGATTGAGAG TTGATCCAGG 20 (2) SEQ ID NO:51:
(i) sequence signature:
(A) length: 20 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:51:AGGAGATACT TAAACTAAGC 20 (2) SEQ ID NO:52:
(i) sequence signature:
(A) length: 20 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:52:TAAGCTTATG CCTTTCAGCG 20 (2) SEQ ID NO:53:
(i) sequence signature:
(A) length: 20 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:53:TTAACGGACC TAAGCTGTGC 20 (2) SEQ ID NO:54:
(i) sequence signature:
(A) length: 20 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:54:GAAACAGATT ATTATGACGG 20 (2) SEQ ID NO:55:
(i) sequence signature:
(A) length: 24 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:55:CGGGCTATCT AGGTGAACTT CAGG 24 (2) SEQ ID NO:56:
(i) sequence signature:
(A) length: 20 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:56:ATTTGGATAT GGAATATGAG 20 (2) SEQ ID NO:57:
(i) sequence signature:
(A) length: 20 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:57:ACTCAACTGA ACTACCAGTG 20 (2) SEQ ID NO:58:
(i) sequence signature:
(A) length: 20 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:58:AAGAACATCA TGTATTTCAG 20 (2) SEQ ID NO:59:
(i) sequence signature:
(A) length: 20 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:59:TTATCAACGC ACTGCTCATG 20 (2) SEQ ID NO:60:
(i) sequence signature:
(A) length: 25 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:60:ATTTTCAGCA ATCACTTGGC ATGCC 25 (2) SEQ ID NO:61:
(i) sequence signature:
(A) length: 20 base pairs
(B) chain: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:61:GCCTCTGTGC AAACAAGCTG 20 (2) SEQ ID NO:62:
(i) sequence signature:
(A) length: 20 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence signature: the information of SEQ ID NO:62:TCTCTAGTTA CTCTAGCAGC 20 (2) SEQ ID NO:63:
(i) sequence signature:
(A) length: 20 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:63:AGGTCGTTGT TTGTGAGGAG 20 (2) SEQ ID NO:64:
(i) sequence signature:
(A) length: 20 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linear r
(ii) molecule type: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:64:TCGTCCTCTT CTTTACTGTC 20 (2) SEQ ID NO:65:
(i) sequence signature:
(A) length: 20 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:65:CCGTCCTCGA GCTAGCCTCG 20 (2) SEQ ID NO:66:
(i) sequence signature:
(A) length: 20 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:66:CTCCTCCAGG CTCACATTGG 20 (2) SEQ ID NO:67:
(i) sequence signature:
(A) length: 20 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:67:GGGTTGGTAC ATAGCTCTGC 20 (2) SEQ ID NO:68:
(i) sequence signature:
(A) length: 25 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:68:CACCCATCTG ATATTTCCCT GATGG 25 (2) SEQ ID NO:69:
(i) sequence signature:
(A) length: 20 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:69:TGGTTGACAG TACAAATCTG 20 (2) SEQ ID NO:70:
(i) sequence signature:
(A) length: 20 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:70:CTGAAATGGG AAGATTGTGC 20 (2) SEQ ID NO:71:
(i) sequence signature:
(A) length: 20 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:71:AGCAATCTAC ACTGCCTACC 20 (2) SEQ ID NO:72:
(i) sequence signature:
(A) length: 20 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:72:TCACAGATGA TTCAATTATC 20 (2) SEQ ID NO:73:
(i) sequence signature:
(A) length: 20 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:73:GATCCTAGAT ATAAGTTCTC 20 (2) SEQ ID NO:74:
(i) sequence signature:
(A) length: 21 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:74:ACCAAACAAA GTTGGGTAAG G 21 (2) SEQ ID NO:75:
(i) sequence signature:
(A) length: 32 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:75:GGGGGATCCA TCCCTAATCC TGCTCTTGTC CC 32 (2) SEQ ID NO:76:
(i) sequence signature:
(A) length: 20 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:76:GATTCCTCTG ATGGCTCCAC 20 (2) SEQ ID NO:77:
(i) sequence signature:
(A) length: 21 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:77:TAACAGTCAA GGAGACCAAA G 21 (2) SEQ ID NO:78:
(i) sequence signature:
(A) length: 32 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:78:GGGAAGCTTA ACCCTAATCC TGCCCTAGGT GG 32 (2) SEQ ID NO:79:
(i) sequence signature:
(A) length: 22 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence description: SEQ ID NO:79:ACCAGACAAA GCTGGGAATA GA 22
Claims (46)
1. a mononegavirale virales is isolating, reorganization is that produce, attenuation, Nonsegmented, negative justice, strand RAN virus, and it has at least one attenuation sudden change at 3 ' genomic promoter region, and has at least one attenuation sudden change in RNA polymerase.
2. virus according to claim 1, wherein said virus is from Paramyxoviridae.
3. virus according to claim 2, wherein said virus is from the paramyxovirus subfamily.
4. virus according to claim 3, wherein said virus is from Morbillivirus.
5. virus according to claim 4, wherein said virus is Measles virus.
6. Measles virus according to claim 5, wherein:
(a) have the attenuation sudden change of at least one place to be selected from 3 ' genomic promoter region: (A → T), (A → T or A → C) (G → A), these Nucleotide all exist in normal chain, anti-group group, the messenger strand Nucleotide 42 Nucleotide 26 with Nucleotide 96; With
(b) having the attenuation sudden change of at least one place to be selected from the Nucleotide that causes following amino acid change in rna polymerase gene changes: residue 331 (Isoleucine → Threonine), 1409 (L-Ala → Threonines), 1624 (Threonine → L-Ala), 1649 (arginine → methionine(Met)s), 1717 (aspartic acid → L-Ala), 1936 (Histidine → tyrosine), 2074 (glutamine → arginine) and 2114 (arginine → Methionins).
7. virus according to claim 3, wherein said virus is from paramyxovirus genus.
8. virus according to claim 7, wherein said virus are 3 type human parainfluenza viruses (PIV-3).
9. PIV-3 virus according to claim 8, wherein
(a) have the attenuation sudden change of at least one place to be selected from 3 ' genomic promoter region: (T → C), (C → T), (G → T) (T → A), these Nucleotide all exist in normal chain, anti-genome, the messenger strand Nucleotide 28 Nucleotide 24 Nucleotide 23 with Nucleotide 45; With
(b) having the attenuation sudden change of at least one place to be selected from the Nucleotide that causes following amino acid change in rna polymerase gene changes: residue 942 (tyrosine → Histidine), 992 (leucine → phenylalanines), 1292 (leucine → phenylalanines) and 1558 (Threonine → Isoleucines).
10. virus according to claim 3, wherein said virus is from rubella virus genus.
11. virus according to claim 2, wherein said virus is from the pneumonitis virus subfamily.
12. virus according to claim 11, wherein said virus belongs to from pneumonitis virus.
13. virus according to claim 12, wherein said virus are human respiratory syncytial virus's (RSV) B subgroups.
14. virus according to claim 13, wherein
(a) sudden change of at least one place attenuation in 3 ' genomic promoter region is selected from: and Nucleotide 4 (C → G) and in a succession of A at Nucleotide 6 to 11 places, inserted an extra A, these Nucleotide all exist in normal chain, anti-genome, the messenger strand; With
(b) at least one place attenuation sudden change in rna polymerase gene is selected from the Nucleotide change that causes following amino acid change: residue 353 (arginine → Methionin), 451 (Methionin → arginine), 1229 (aspartic acid → l-asparagines), 2029 (Threonine → Isoleucines) and 2050 (l-asparagine → aspartic acids).
15. virus according to claim 1, wherein said virus is from Rhabdoviridae.
16. virus according to claim 1, wherein said virus is from Filoviridae.
17. a vaccine, it comprise according to the described mononegavirale virales of claim 1 isolating, reorganization is that produce, acceptable carrier on attenuation, Nonsegmented, negative justice, strand RAN virus and the physiology.
18. vaccine according to claim 17, it comprises acceptable carrier on described Measles virus of claim 5 and the physiology.
19. vaccine according to claim 18, it comprises acceptable carrier on described Measles virus of claim 6 and the physiology.
20. vaccine according to claim 17, it comprises acceptable carrier on described PIV-3 of claim 8 and the physiology.
21. vaccine according to claim 20, it comprises acceptable carrier on described PIV-3 of claim 9 and the physiology.
22. vaccine according to claim 17, it comprises acceptable carrier on claim 13 described RSV B subgroup and the physiology.
23. vaccine according to claim 22, it comprises acceptable carrier on claim 14 described RSV B subgroup and the physiology.
24. an immune body is with the method for the non-sections of inducing anti-mononegavirale virales, negative justice, strand RAN virus protection, it comprises and gives the individual right requirement 17 described vaccines.
25. method according to claim 24, vaccine wherein are the described vaccines of claim 18.
26. method according to claim 25, vaccine wherein are the described vaccines of claim 19.
27. method according to claim 24, vaccine wherein are the described vaccines of claim 20.
28. method according to claim 27, vaccine wherein are the described vaccines of claim 21.
29. method according to claim 24, vaccine wherein are the described vaccines of claim 22.
30. method according to claim 29, vaccine wherein are the described vaccines of claim 23.
31. isolated nucleic acid molecule, it comprises the normal chain that is selected from following Measles virus group, the sequence on the anti-genome messenger strand: 1977 wild-type strains (SEQ ID NO:3), 1983 wild-type strains (SEQ IDNO:5), wherein the 2499th Nucleotide is G or C; Montefiore wild-type strain (SEQ ID NO:7), Rubeovax
TMVaccine strain (SEQ ID NO:9), wherein Nucleotide 2143 is T or C; Moraten vaccine strain (SEQ ID NO:11), Schwarz vaccine strain (SEQ ID NO:11), wherein Nucleotide 4917 is that C and Nucleotide 4924 are C and Zagreb vaccine strain (SEQ ID NO:13), and their complementary gene group sequence
32. isolated nucleic acid molecule, it comprises the PIV-3 sequence on normal chain, the anti-genome messenger strand, this sequence is selected from: cultivate cp45 vaccine strain (SEQ ID NO:19) in rhesus monkey embryo pneumonocyte and cultivation at the intracellular cp45 vaccine of Vero (SEQ ID NO:21), and the complementary gene group sequence of strain.
33. composition, it comprises a transcription vector, this carrier comprises isolated nucleic acid molecule, the non-sections of this molecule encoding mononegavirale virales, negative justice, single strand RNA virus, this nucleic acid molecule has attenuation sudden change and the interior attenuation sudden change of at least one RAN pol gene at least one 3 ' genomic promoter region, they with at least one expression vector together, this expression vector comprises at least one isolated nucleic acid molecule, this nucleic acid molecule encoding wraps up, transcribes and duplicate necessary trans-acting albumen, expresses producing infectious attenuated virus by this.
34. composition according to claim 33, wherein said transcription vector comprises an isolated nucleic acid molecule and at least one expression vector, the described Measles virus of this isolated nucleic acid molecule coding claim 5, this expression vector comprise at least one isolated nucleic acid molecule of coding trans-acting albumen N, P and L.
35. composition according to claim 34, wherein said transcription vector comprise the isolated nucleic acid molecule of the described Measles virus of coding claim 6.
36. composition according to claim 33, wherein said transcription vector contains the isolated nucleic acid molecule of the described PIV-3 of coding claim 8, and containing at least one expression vector, this expression vector comprises at least one isolated nucleic acid molecule of coding trans-acting albumen NP, P and L.
37. composition according to claim 36, wherein said transcription vector comprise the isolated nucleic acid molecule of the described PIV-3 of coding claim 9.
38. composition according to claim 33, wherein said transcription vector comprises an isolated nucleic acid molecule of the described RSV B of coding claim 13 subgroup, and comprising at least one expression vector, this expression vector comprises at least one isolated nucleic acid molecule of coding trans-acting albumen N, P, L and M2.
39. according to the described composition of claim 38, wherein said transcription vector comprises an isolated nucleic acid molecule of the described RSV B of coding claim 14 subgroup virus.
40. method of producing the infectious attenuation of strand negative-sense viral order, non-sections, negative justice, single strand RNA virus, it comprises with described two kinds of carriers conversion of claim 33 or transfection host cell at least, and under the condition that allows these carrier coexpressions, cultivate host cell, to produce infectious attenuated virus.
41. according to the described method of claim 40, virus wherein is the described Measles virus of claim 5.
42. according to the described method of claim 41, virus wherein is the described Measles virus of claim 6.
43. according to the described method of claim 40, virus wherein is the described PIV-3 virus of claim 8.
44. according to the described method of claim 43, virus wherein is the described PIV-3 virus of claim 9.
45. according to the described method of claim 40, virus wherein is the described RSV B of claim 13 subgroup virus.
46. according to the described method of claim 45, virus wherein is the described RSV B of claim 14 subgroup virus.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US2682396P | 1996-09-27 | 1996-09-27 | |
US60/026,823 | 1996-09-27 |
Publications (1)
Publication Number | Publication Date |
---|---|
CN1232504A true CN1232504A (en) | 1999-10-20 |
Family
ID=21833976
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN97198321A Pending CN1232504A (en) | 1996-09-27 | 1997-09-19 | 3' genomic promoter region and polymerase gene mutations responsible for attenuation in viruses of order designated mononegavirales |
Country Status (8)
Country | Link |
---|---|
EP (1) | EP0932684A2 (en) |
JP (1) | JP2000517194A (en) |
KR (1) | KR20000048628A (en) |
CN (1) | CN1232504A (en) |
AU (1) | AU4427897A (en) |
BR (1) | BR9712138A (en) |
CA (1) | CA2265554A1 (en) |
WO (1) | WO1998013501A2 (en) |
Families Citing this family (68)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6887699B1 (en) | 1990-05-22 | 2005-05-03 | Medimmune Vaccines, Inc. | Recombinant negative strand RNA virus expression systems and vaccines |
US6410023B1 (en) | 1997-05-23 | 2002-06-25 | United States Of America | Recombinant parainfluenza virus vaccines attenuated by deletion or ablation of a non-essential gene |
BR9812232A (en) * | 1997-09-19 | 2000-07-18 | American Cyanamid Co | Human respiratory syncytial virus (rsv) human subgroup b, isolated, recombinantly generated, attenuated, vaccine, process to immunize an individual to induce protection against rsv subgroup b, composition, process to produce attenuated infectious rsv subgroup b, and isolated acid molecule nucleic |
CA2323927A1 (en) * | 1998-03-26 | 1999-09-30 | American Cyanamid Company | Mutations responsible for attenuation in measles virus or human respiratory syncytial virus subgroup b |
DK1090108T3 (en) * | 1998-06-03 | 2011-04-26 | Wyeth Corp | New methods for salvaging RNA viruses |
AU748416B2 (en) | 1998-06-12 | 2002-06-06 | Andrej Egorov | Interferon inducing genetically engineered attenuated viruses |
CA2334895C (en) | 1998-06-12 | 2016-01-19 | Mount Sinai School Of Medicine | Attenuated negative strand viruses with altered interferon antagonist activity for use as vaccines and pharmaceuticals |
US6544785B1 (en) | 1998-09-14 | 2003-04-08 | Mount Sinai School Of Medicine Of New York University | Helper-free rescue of recombinant negative strand RNA viruses |
US6146642A (en) | 1998-09-14 | 2000-11-14 | Mount Sinai School Of Medicine, Of The City University Of New York | Recombinant new castle disease virus RNA expression systems and vaccines |
US6764685B1 (en) | 2000-03-21 | 2004-07-20 | Medimmune Vaccines, Inc. | Recombinant parainfluenza virus expression systems and vaccines |
WO2001077394A1 (en) | 2000-04-10 | 2001-10-18 | Mount Sinai School Of Medicine Of New York University | Screening methods for identifying viral proteins with interferon antagonizing functions and potential antiviral agents |
PT1292615E (en) * | 2000-06-23 | 2007-01-31 | Wyeth Corp | Modified morbillivirus v proteins |
US7361496B1 (en) | 2000-08-02 | 2008-04-22 | Wyeth | Rescue of mumps virus from cDNA |
CN101921732A (en) | 2001-01-19 | 2010-12-22 | 维洛诺瓦蒂夫公司 | A virus causing respiratory tract illness in susceptible mammals |
US8715922B2 (en) | 2001-01-19 | 2014-05-06 | ViroNovative | Virus causing respiratory tract illness in susceptible mammals |
AU2003219839B2 (en) | 2002-02-21 | 2008-02-21 | Medimmune, Llc | Recombinant parainfluenza virus expression systems and vaccines comprising heterologous antigens derived from metapneumovirus |
US7465456B2 (en) | 2002-04-26 | 2008-12-16 | Medimmune, Llc | Multi plasmid system for the production of influenza virus |
CN103540568A (en) | 2002-04-26 | 2014-01-29 | 米迪缪尼有限公司 | Multi plasmid system for the production of influenza virus |
DE60233038D1 (en) * | 2002-06-20 | 2009-09-03 | Pasteur Institut | Infectious cDNA of an approved measles virus vaccine strain. Use in immunogenic compositions |
EP1375670B1 (en) | 2002-06-20 | 2013-06-12 | Institut Pasteur | Recombinant measles viruses expressing epitopes of antigens of RNA viruses and use of the recombinant viruses for the preparation of vaccine compositions |
CA2432738A1 (en) * | 2003-02-26 | 2004-08-26 | Philippe Despres | New dengue and west nile viruses proteins and genes coding the foregoing, and their use in vaccinal, therapeutic and diagnostic applications |
US7572904B2 (en) | 2003-03-28 | 2009-08-11 | Medimmune, Llc | Nucleic acids encoding respiratory syncytial virus subgroup B strain 9320 |
EP2494986A1 (en) | 2003-04-25 | 2012-09-05 | MedImmune Vaccines, Inc. | Metapneumovirus strains and their use in vaccine formulations and as vectors for expression of antigenic sequences and methods for propagating virus |
US7566458B2 (en) | 2003-06-16 | 2009-07-28 | Medimmune, Llc | Influenza hemagglutinin and neuraminidase variants |
WO2005062820A2 (en) | 2003-12-23 | 2005-07-14 | Medimmune Vaccines, Inc | Multi plasmid system for the production of influenza virus |
JP4980895B2 (en) | 2004-05-25 | 2012-07-18 | メディミューン,エルエルシー | Influenza hemagglutinin and neuraminidase variants |
CN102727880A (en) | 2004-06-01 | 2012-10-17 | 西奈山医学院 | Genetically engineered swine influenza virus and uses thereof |
EP1855713B1 (en) | 2005-02-15 | 2016-04-27 | Mount Sinai School of Medicine | Genetically engineered equine influenza virus and uses thereof |
CA2600730C (en) | 2005-03-08 | 2014-11-25 | Medimmune, Inc. | Influenza hemagglutinin and neuraminidase variants |
AU2006262380A1 (en) | 2005-06-21 | 2007-01-04 | Medimmune, Llc | Methods and compositions for expressing a heterologous protease |
US7790434B2 (en) | 2005-06-21 | 2010-09-07 | Medimmune, Llc | Methods and compositions for expressing negative-sense viral RNA in canine cells |
KR101492643B1 (en) | 2005-12-02 | 2015-02-12 | 이칸 스쿨 오브 메디슨 엣 마운트 시나이 | Chimeric viruses presenting non-native surface proteins and uses thereof |
MX2008013388A (en) | 2006-04-19 | 2009-03-02 | Medimmune Llc | Methods and compositions for expressing negative-sense viral rna in canine cells. |
WO2008133701A1 (en) | 2006-07-21 | 2008-11-06 | Medimmune, Llc. | Methods and compositions for increasing replication capacity of an influenza virus |
CN101983069B (en) | 2006-08-09 | 2014-07-16 | 米迪缪尼有限公司 | Influenza hemagglutinin and neuraminidase variants |
EP2099903A4 (en) * | 2006-12-22 | 2010-07-28 | Penn State Res Found | Modified polymerases and attenuated viruses and methods of use thereof |
JP5666905B2 (en) | 2007-06-18 | 2015-02-12 | メディミューン,エルエルシー | Influenza B virus having alterations in hemagglutinin polypeptide |
CA2730408A1 (en) | 2008-07-11 | 2010-01-14 | Chin-Fen Yang | Influenza hemagglutinin and neuraminidase variants |
WO2010053986A1 (en) | 2008-11-05 | 2010-05-14 | Wyeth | Multicomponent immunogenic composition for the prevention of beta-hemolytic streptococcal (bhs) disease |
ES2550179T3 (en) | 2009-02-05 | 2015-11-05 | Icahn School Of Medicine At Mount Sinai | Chimeric Newcastle disease viruses and uses thereof |
CN102361649A (en) | 2009-02-12 | 2012-02-22 | 米迪缪尼有限公司 | Influenza hemagglutinin and neuraminidase variants |
JP2012521786A (en) | 2009-03-30 | 2012-09-20 | モウント シナイ スクール オフ メディシネ | Influenza virus vaccine and use thereof |
WO2011014504A1 (en) | 2009-07-27 | 2011-02-03 | Mount Sinai School Of Medicine Of New York University | Recombinant influenza virus vectors and uses thereof |
EP2459585A1 (en) | 2009-07-30 | 2012-06-06 | Mount Sinai School of Medicine | Influenza viruses and uses thereof |
US9708373B2 (en) | 2010-03-30 | 2017-07-18 | Icahn School Of Medicine At Mount Sinai | Influenza virus vaccine and uses thereof |
EP2420242A1 (en) | 2010-08-20 | 2012-02-22 | Lauer, Ulrich M. | Oncolytic measles virus |
CN103906843B (en) * | 2011-06-08 | 2016-12-07 | 维什瓦斯·乔希 | Double-mass model mammalian expression systems |
US9441205B2 (en) | 2011-06-08 | 2016-09-13 | Vishwas Joshi | Two plasmid mammalian expression system |
BR112014006694A2 (en) | 2011-09-20 | 2020-11-17 | Mount Sinai School Of Medicine | influenza vaccines and uses of these |
US20150224181A1 (en) | 2012-09-14 | 2015-08-13 | The United States Of America As Represented By The Secretary Department Of Health And Human Se | Brachyury protein, non-poxvirus non-yeast vectors encoding brachyury protein, and their use |
CN105263516A (en) | 2012-12-18 | 2016-01-20 | 西奈山伊坎医学院 | Influenza virus vaccines and uses thereof |
US20160015760A1 (en) | 2013-03-14 | 2016-01-21 | Icahn School Of Medicine At Mount Sinai | Newcastle disease viruses and uses thereof |
WO2014159960A1 (en) | 2013-03-14 | 2014-10-02 | Icahn School Of Medicine At Mount Sinai | Antibodies against influenza virus hemagglutinin and uses thereof |
US20170000832A1 (en) | 2014-02-27 | 2017-01-05 | Viralytics Limited | Combination method for treatment of cancer |
CA2974699A1 (en) | 2015-01-23 | 2016-07-28 | Icahn School Of Medicine At Mount Sinai | Influenza virus vaccination regimens |
WO2016137929A1 (en) | 2015-02-26 | 2016-09-01 | Boehringer Ingelheim Vetmedica Gmbh | Bivalent swine influenza virus vaccine |
WO2017024000A1 (en) | 2015-08-03 | 2017-02-09 | The United States Of America, As Represented By The Secretary, Department Of Health And Human Services | Brachyury deletion mutants, non-yeast vectors encoding brachyury deletion mutants, and their use |
WO2017031408A1 (en) | 2015-08-20 | 2017-02-23 | University Of Rochester | Single-cycle virus for the development of canine influenza vaccines |
US10973903B2 (en) | 2015-08-20 | 2021-04-13 | University Of Rochester | NS1 truncated virus for the development of canine influenza vaccines |
AU2016308917A1 (en) | 2015-08-20 | 2018-03-15 | Cornell University | Live-attenuated vaccine having mutations in viral polymerase for the treatment and prevention of canine influenza virus |
EP3463439B1 (en) | 2016-06-03 | 2022-08-03 | University of Rochester | Equine influenza virus live-attenuated vaccines |
CA3023143A1 (en) | 2016-06-15 | 2017-12-21 | Icahn School Of Medicine At Mount Sinai | Influenza virus hemagglutinin proteins and uses thereof |
US11254733B2 (en) | 2017-04-07 | 2022-02-22 | Icahn School Of Medicine At Mount Sinai | Anti-influenza B virus neuraminidase antibodies and uses thereof |
JOP20190256A1 (en) | 2017-05-12 | 2019-10-28 | Icahn School Med Mount Sinai | Newcastle disease viruses and uses thereof |
CN111989116A (en) | 2018-02-27 | 2020-11-24 | 罗切斯特大学 | Multivalent attenuated live influenza vaccine for prevention and control of Equine Influenza Virus (EIV) |
US11166996B2 (en) | 2018-12-12 | 2021-11-09 | Flagship Pioneering Innovations V, Inc. | Anellovirus compositions and methods of use |
WO2020176709A1 (en) | 2019-02-27 | 2020-09-03 | University Of Rochester | Multivalent live-attenuated influenza vaccine for prevention and control of equine influenza virus (eiv) in horses |
US11103576B1 (en) | 2020-06-15 | 2021-08-31 | University Of Pittsburgh - Of The Commonwealth System Of Higher Education | Measles virus vaccine expressing SARS-COV-2 protein(s) |
Family Cites Families (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
AU7007491A (en) * | 1990-02-02 | 1991-08-08 | Schweiz. Serum- & Impfinstitut Bern | Cdna corresponding to the genome of negative-strand rna viruses, and process for the production of infectious negative-strand rna viruses |
JP3045581B2 (en) * | 1991-10-14 | 2000-05-29 | 社団法人北里研究所 | Measles vaccine virus strain identification method |
EP0636172B1 (en) * | 1992-04-14 | 2005-08-17 | The Mount Sinai School of Medicine of the City University of New York | Genetically engineered attenuated viruses |
IL105456A (en) * | 1992-04-21 | 1996-12-05 | American Home Prod | Attenuated respiratory syncytial virus vaccine compositions |
TW275632B (en) * | 1992-04-21 | 1996-05-11 | American Cyanamid Co | |
ES2210273T5 (en) * | 1994-07-18 | 2010-03-29 | Conzelmann, Karl-Klaus, Prof. Dr. | VIRUS WITH NEGATIVE CHAIN NON-SEGMENTED RECOMBINANT INFECTIVE. |
-
1997
- 1997-09-19 CA CA002265554A patent/CA2265554A1/en not_active Abandoned
- 1997-09-19 EP EP97942613A patent/EP0932684A2/en not_active Withdrawn
- 1997-09-19 BR BR9712138-0A patent/BR9712138A/en not_active Application Discontinuation
- 1997-09-19 KR KR1019990702569A patent/KR20000048628A/en not_active Application Discontinuation
- 1997-09-19 WO PCT/US1997/016718 patent/WO1998013501A2/en not_active Application Discontinuation
- 1997-09-19 JP JP10515749A patent/JP2000517194A/en not_active Ceased
- 1997-09-19 CN CN97198321A patent/CN1232504A/en active Pending
- 1997-09-19 AU AU44278/97A patent/AU4427897A/en not_active Abandoned
Also Published As
Publication number | Publication date |
---|---|
JP2000517194A (en) | 2000-12-26 |
BR9712138A (en) | 2000-01-18 |
WO1998013501A2 (en) | 1998-04-02 |
EP0932684A2 (en) | 1999-08-04 |
WO1998013501A3 (en) | 1998-08-13 |
AU4427897A (en) | 1998-04-17 |
CA2265554A1 (en) | 1998-04-02 |
KR20000048628A (en) | 2000-07-25 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN1232504A (en) | 3' genomic promoter region and polymerase gene mutations responsible for attenuation in viruses of order designated mononegavirales | |
CN1273603A (en) | Attenuated respiratory syncytial viruses | |
CN1250725C (en) | Prodn of attenuated parainfluenza virus vaccines from cloned nucleotide sequences | |
CN101012454B (en) | Production of attenuated chimeric respiratory syncytial virus vaccines from cloned nucleotide sequences | |
US6664066B2 (en) | Modified Morbillivirus V proteins | |
AU2020203460B2 (en) | Attenuation of human respiratory syncytial virus by genome scale codon-pair deoptimization | |
US7192593B2 (en) | Use of recombinant parainfluenza viruses (PIVs) as vectors to protect against infection and disease caused by PIV and other human pathogens | |
CN1347453A (en) | Use of recombinant parainfluenza viruses (PIVs) as vectors to protect against infection and disease caused by PIV and other human pathogens | |
CN1347458A (en) | Production of attenuated negative stranded RNA virus vaccines from cloned nucleotide sequences | |
AU2001267014A1 (en) | Modified morbillivirus V proteins | |
WO1999015631A1 (en) | Recombinant rsv virus expression systems and vaccines | |
CN1364195A (en) | Production of attenuated chimeric respiratory syncytial virus vaccines from cloned nucleotide sequences | |
CN1402792A (en) | Production of attenuated, human-bovine chimeric respiratory syncytial virus vaccines | |
KR20110063863A (en) | Live, attenuated respiratory syncytial virus | |
CN1370237A (en) | Recombinant parainfluenza virus vaccines attenuated by deletion or ablation of non-essential gene | |
CN113293149A (en) | Construction of F gene replaced chimeric measles attenuated strain | |
CN1177927C (en) | Mutations responsible for attenuation in measles virus or human respiratory syncytial virus subgroup B | |
CN1369011A (en) | Attenuated human-bovine chimeric parainfluenza virus (PIV) vaccines | |
WO2013154728A1 (en) | Genetically stable live attenuated respiratory syncytial virus vaccine and its production | |
CN1224462A (en) | Production of attenuated respiratory syncytial virus vaccines from cloned nucleotide sequences | |
CN1468301A (en) | Respiratory syncytial virus vaccines expressing protective antigens from promotor-proximal genes | |
AU8933001A (en) | 3' genomic promoter region and polymerase gene mutations responsible for attenuation in viruses of the order designated mononegavirales | |
MXPA00009256A (en) | Mutations responsible for attenuation in measles virus or human respiratory syncytial virus subgroup b | |
Takeuchi et al. | Toward understanding the pathogenicity of wild-type measles virus by reverse genetics | |
AU5592201A (en) | Production of attenuated respiratory syncytial virus vaccines from cloned nucleotide sequences |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C12 | Rejection of a patent application after its publication | ||
RJ01 | Rejection of invention patent application after publication |