CN1318103A - Nucleic acids and proteins from streptococcus pneumoniae - Google Patents
Nucleic acids and proteins from streptococcus pneumoniae Download PDFInfo
- Publication number
- CN1318103A CN1318103A CN99810978A CN99810978A CN1318103A CN 1318103 A CN1318103 A CN 1318103A CN 99810978 A CN99810978 A CN 99810978A CN 99810978 A CN99810978 A CN 99810978A CN 1318103 A CN1318103 A CN 1318103A
- Authority
- CN
- China
- Prior art keywords
- sequence
- protein
- polypeptide
- streptococcus pneumoniae
- dna
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 102000004169 proteins and genes Human genes 0.000 title claims abstract description 94
- 241000193998 Streptococcus pneumoniae Species 0.000 title claims abstract description 61
- 229940031000 streptococcus pneumoniae Drugs 0.000 title claims abstract description 60
- 150000007523 nucleic acids Chemical class 0.000 title claims abstract description 20
- 108020004707 nucleic acids Proteins 0.000 title claims description 18
- 102000039446 nucleic acids Human genes 0.000 title claims description 18
- 108091005461 Nucleic proteins Proteins 0.000 title description 2
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 144
- 238000000034 method Methods 0.000 claims abstract description 50
- 229960005486 vaccine Drugs 0.000 claims abstract description 22
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 53
- 239000012634 fragment Substances 0.000 claims description 52
- 229920001184 polypeptide Polymers 0.000 claims description 48
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 48
- 238000012360 testing method Methods 0.000 claims description 28
- 239000000203 mixture Substances 0.000 claims description 18
- 239000002773 nucleotide Substances 0.000 claims description 18
- 125000003729 nucleotide group Chemical group 0.000 claims description 18
- 230000004083 survival effect Effects 0.000 claims description 18
- 208000015181 infectious disease Diseases 0.000 claims description 15
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 12
- 238000001514 detection method Methods 0.000 claims description 12
- 230000002163 immunogen Effects 0.000 claims description 12
- 230000000890 antigenic effect Effects 0.000 claims description 11
- 238000003745 diagnosis Methods 0.000 claims description 11
- 230000000295 complement effect Effects 0.000 claims description 9
- 230000005847 immunogenicity Effects 0.000 claims description 8
- 239000003795 chemical substances by application Substances 0.000 claims description 7
- 230000002265 prevention Effects 0.000 claims description 7
- 238000011282 treatment Methods 0.000 claims description 7
- 239000003814 drug Substances 0.000 claims description 6
- 239000002671 adjuvant Substances 0.000 claims description 5
- 230000008485 antagonism Effects 0.000 claims description 4
- 230000000845 anti-microbial effect Effects 0.000 claims description 2
- 230000009849 deactivation Effects 0.000 claims description 2
- 238000002405 diagnostic procedure Methods 0.000 claims description 2
- 230000005764 inhibitory process Effects 0.000 claims 1
- 238000012216 screening Methods 0.000 abstract description 12
- 235000018102 proteins Nutrition 0.000 description 74
- 108020004414 DNA Proteins 0.000 description 38
- 241000699666 Mus <mouse, genus> Species 0.000 description 26
- 241000894006 Bacteria Species 0.000 description 17
- 241000194035 Lactococcus lactis Species 0.000 description 16
- 235000014897 Streptococcus lactis Nutrition 0.000 description 16
- 108091007433 antigens Proteins 0.000 description 15
- 102000036639 antigens Human genes 0.000 description 15
- 239000000427 antigen Substances 0.000 description 13
- 230000036039 immunity Effects 0.000 description 13
- 239000000523 sample Substances 0.000 description 13
- 108010076504 Protein Sorting Signals Proteins 0.000 description 12
- 238000013461 design Methods 0.000 description 12
- 101150100619 nuc gene Proteins 0.000 description 12
- 239000013612 plasmid Substances 0.000 description 12
- 101000702488 Rattus norvegicus High affinity cationic amino acid transporter 1 Proteins 0.000 description 10
- 230000001580 bacterial effect Effects 0.000 description 10
- 238000012408 PCR amplification Methods 0.000 description 9
- 230000029087 digestion Effects 0.000 description 9
- 238000005516 engineering process Methods 0.000 description 9
- 108090000994 Catalytic RNA Proteins 0.000 description 8
- 241001465754 Metazoa Species 0.000 description 8
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 8
- 230000000694 effects Effects 0.000 description 8
- 108091008146 restriction endonucleases Proteins 0.000 description 8
- 230000028327 secretion Effects 0.000 description 8
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 7
- 108091034117 Oligonucleotide Proteins 0.000 description 7
- 235000001014 amino acid Nutrition 0.000 description 7
- 150000001413 amino acids Chemical class 0.000 description 7
- 201000010099 disease Diseases 0.000 description 7
- 238000013467 fragmentation Methods 0.000 description 7
- 238000006062 fragmentation reaction Methods 0.000 description 7
- 238000002649 immunization Methods 0.000 description 7
- 230000003053 immunization Effects 0.000 description 7
- 239000002054 inoculum Substances 0.000 description 7
- 102000053642 Catalytic RNA Human genes 0.000 description 6
- 210000004369 blood Anatomy 0.000 description 6
- 239000008280 blood Substances 0.000 description 6
- 230000006870 function Effects 0.000 description 6
- 230000004927 fusion Effects 0.000 description 6
- 230000002441 reversible effect Effects 0.000 description 6
- 108091092562 ribozyme Proteins 0.000 description 6
- 239000013598 vector Substances 0.000 description 6
- 238000011238 DNA vaccination Methods 0.000 description 5
- 102000004190 Enzymes Human genes 0.000 description 5
- 108090000790 Enzymes Proteins 0.000 description 5
- 108700008625 Reporter Genes Proteins 0.000 description 5
- 210000004027 cell Anatomy 0.000 description 5
- 238000010367 cloning Methods 0.000 description 5
- 208000024891 symptom Diseases 0.000 description 5
- 230000005030 transcription termination Effects 0.000 description 5
- 238000013519 translation Methods 0.000 description 5
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 4
- 229930182555 Penicillin Natural products 0.000 description 4
- JGSARLDLIJGVTE-MBNYWOFBSA-N Penicillin G Chemical compound N([C@H]1[C@H]2SC([C@@H](N2C1=O)C(O)=O)(C)C)C(=O)CC1=CC=CC=C1 JGSARLDLIJGVTE-MBNYWOFBSA-N 0.000 description 4
- 239000006161 blood agar Substances 0.000 description 4
- 238000004891 communication Methods 0.000 description 4
- 238000005520 cutting process Methods 0.000 description 4
- 238000002474 experimental method Methods 0.000 description 4
- 150000004676 glycans Chemical class 0.000 description 4
- 230000002209 hydrophobic effect Effects 0.000 description 4
- 238000011081 inoculation Methods 0.000 description 4
- 229940049954 penicillin Drugs 0.000 description 4
- 229920001282 polysaccharide Polymers 0.000 description 4
- 239000005017 polysaccharide Substances 0.000 description 4
- 239000000047 product Substances 0.000 description 4
- 230000008521 reorganization Effects 0.000 description 4
- 230000004044 response Effects 0.000 description 4
- 238000011144 upstream manufacturing Methods 0.000 description 4
- 229920001817 Agar Polymers 0.000 description 3
- 241000193830 Bacillus <bacterium> Species 0.000 description 3
- 102000012410 DNA Ligases Human genes 0.000 description 3
- 108010061982 DNA Ligases Proteins 0.000 description 3
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 3
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 3
- 102000008394 Immunoglobulin Fragments Human genes 0.000 description 3
- 108010021625 Immunoglobulin Fragments Proteins 0.000 description 3
- 208000035109 Pneumococcal Infections Diseases 0.000 description 3
- 241000191967 Staphylococcus aureus Species 0.000 description 3
- 241000194017 Streptococcus Species 0.000 description 3
- 108020005038 Terminator Codon Proteins 0.000 description 3
- 239000008272 agar Substances 0.000 description 3
- 125000003275 alpha amino acid group Chemical group 0.000 description 3
- 238000004458 analytical method Methods 0.000 description 3
- 238000000137 annealing Methods 0.000 description 3
- 230000000692 anti-sense effect Effects 0.000 description 3
- 244000052616 bacterial pathogen Species 0.000 description 3
- 230000008859 change Effects 0.000 description 3
- 238000005336 cracking Methods 0.000 description 3
- 238000013016 damping Methods 0.000 description 3
- 239000012530 fluid Substances 0.000 description 3
- 230000000968 intestinal effect Effects 0.000 description 3
- 238000004519 manufacturing process Methods 0.000 description 3
- 235000013372 meat Nutrition 0.000 description 3
- 239000012528 membrane Substances 0.000 description 3
- 230000007170 pathology Effects 0.000 description 3
- 230000008569 process Effects 0.000 description 3
- 230000001681 protective effect Effects 0.000 description 3
- 235000014347 soups Nutrition 0.000 description 3
- 238000007619 statistical method Methods 0.000 description 3
- 230000014621 translational initiation Effects 0.000 description 3
- PXFBZOLANLWPMH-UHFFFAOYSA-N 16-Epiaffinine Natural products C1C(C2=CC=CC=C2N2)=C2C(=O)CC2C(=CC)CN(C)C1C2CO PXFBZOLANLWPMH-UHFFFAOYSA-N 0.000 description 2
- 201000001178 Bacterial Pneumonia Diseases 0.000 description 2
- 241001478240 Coccus Species 0.000 description 2
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 2
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 2
- 101100349540 Halobacterium salinarum (strain ATCC 700922 / JCM 11081 / NRC-1) nucS1 gene Proteins 0.000 description 2
- 101100349541 Halobacterium salinarum (strain ATCC 700922 / JCM 11081 / NRC-1) nucS2 gene Proteins 0.000 description 2
- 102000017727 Immunoglobulin Variable Region Human genes 0.000 description 2
- 108010067060 Immunoglobulin Variable Region Proteins 0.000 description 2
- 241000186660 Lactobacillus Species 0.000 description 2
- 241000194036 Lactococcus Species 0.000 description 2
- 108090001030 Lipoproteins Proteins 0.000 description 2
- 102000004895 Lipoproteins Human genes 0.000 description 2
- 108010052285 Membrane Proteins Proteins 0.000 description 2
- 102000018697 Membrane Proteins Human genes 0.000 description 2
- 108700026244 Open Reading Frames Proteins 0.000 description 2
- 239000001888 Peptone Substances 0.000 description 2
- 108010080698 Peptones Proteins 0.000 description 2
- 235000010627 Phaseolus vulgaris Nutrition 0.000 description 2
- 244000046052 Phaseolus vulgaris Species 0.000 description 2
- 206010035664 Pneumonia Diseases 0.000 description 2
- 102100024952 Protein CBFA2T1 Human genes 0.000 description 2
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 2
- 241000191940 Staphylococcus Species 0.000 description 2
- 241000700605 Viruses Species 0.000 description 2
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 2
- 239000003708 ampul Substances 0.000 description 2
- 230000005875 antibody response Effects 0.000 description 2
- 238000013459 approach Methods 0.000 description 2
- 239000012472 biological sample Substances 0.000 description 2
- 210000002421 cell wall Anatomy 0.000 description 2
- 238000006243 chemical reaction Methods 0.000 description 2
- 239000003153 chemical reaction reagent Substances 0.000 description 2
- 230000000052 comparative effect Effects 0.000 description 2
- 230000034994 death Effects 0.000 description 2
- 230000008034 disappearance Effects 0.000 description 2
- 230000002068 genetic effect Effects 0.000 description 2
- 239000008103 glucose Substances 0.000 description 2
- 235000011187 glycerol Nutrition 0.000 description 2
- 210000004408 hybridoma Anatomy 0.000 description 2
- 230000006872 improvement Effects 0.000 description 2
- 238000002347 injection Methods 0.000 description 2
- 239000007924 injection Substances 0.000 description 2
- 238000009413 insulation Methods 0.000 description 2
- JVTAAEKCZFNVCJ-UHFFFAOYSA-N lactic acid Chemical compound CC(O)C(O)=O JVTAAEKCZFNVCJ-UHFFFAOYSA-N 0.000 description 2
- 229940039696 lactobacillus Drugs 0.000 description 2
- 206010025482 malaise Diseases 0.000 description 2
- 210000004962 mammalian cell Anatomy 0.000 description 2
- 239000003550 marker Substances 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- 230000000813 microbial effect Effects 0.000 description 2
- 244000005700 microbiome Species 0.000 description 2
- 210000001331 nose Anatomy 0.000 description 2
- 101150032913 nucA gene Proteins 0.000 description 2
- 235000019319 peptone Nutrition 0.000 description 2
- 239000002243 precursor Substances 0.000 description 2
- 210000003296 saliva Anatomy 0.000 description 2
- 238000000926 separation method Methods 0.000 description 2
- 238000012163 sequencing technique Methods 0.000 description 2
- 201000009890 sinusitis Diseases 0.000 description 2
- 230000000087 stabilizing effect Effects 0.000 description 2
- 210000001519 tissue Anatomy 0.000 description 2
- 239000012137 tryptone Substances 0.000 description 2
- 108010087967 type I signal peptidase Proteins 0.000 description 2
- QDZOEBFLNHCSSF-PFFBOGFISA-N (2S)-2-[[(2R)-2-[[(2S)-1-[(2S)-6-amino-2-[[(2S)-1-[(2R)-2-amino-5-carbamimidamidopentanoyl]pyrrolidine-2-carbonyl]amino]hexanoyl]pyrrolidine-2-carbonyl]amino]-3-(1H-indol-3-yl)propanoyl]amino]-N-[(2R)-1-[[(2S)-1-[[(2R)-1-[[(2S)-1-[[(2S)-1-amino-4-methyl-1-oxopentan-2-yl]amino]-4-methyl-1-oxopentan-2-yl]amino]-3-(1H-indol-3-yl)-1-oxopropan-2-yl]amino]-1-oxo-3-phenylpropan-2-yl]amino]-3-(1H-indol-3-yl)-1-oxopropan-2-yl]pentanediamide Chemical compound C([C@@H](C(=O)N[C@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(N)=O)NC(=O)[C@@H](CC=1C2=CC=CC=C2NC=1)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](CC=1C2=CC=CC=C2NC=1)NC(=O)[C@H]1N(CCC1)C(=O)[C@H](CCCCN)NC(=O)[C@H]1N(CCC1)C(=O)[C@H](N)CCCNC(N)=N)C1=CC=CC=C1 QDZOEBFLNHCSSF-PFFBOGFISA-N 0.000 description 1
- BZSALXKCVOJCJJ-IPEMHBBOSA-N (4s)-4-[[(2s)-2-acetamido-3-methylbutanoyl]amino]-5-[[(2s)-1-[[(2s)-1-[[(2s,3r)-1-[[(2s)-1-[[(2s)-1-[[2-[[(2s)-1-amino-1-oxo-3-phenylpropan-2-yl]amino]-2-oxoethyl]amino]-5-(diaminomethylideneamino)-1-oxopentan-2-yl]amino]-1-oxopropan-2-yl]amino]-3-hydroxy Chemical compound CC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCC)C(=O)N[C@@H](CCCC)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(=O)N[C@H](C(N)=O)CC1=CC=CC=C1 BZSALXKCVOJCJJ-IPEMHBBOSA-N 0.000 description 1
- 108020004465 16S ribosomal RNA Proteins 0.000 description 1
- 208000030507 AIDS Diseases 0.000 description 1
- 241000972773 Aulopiformes Species 0.000 description 1
- 241000283707 Capra Species 0.000 description 1
- 241000282693 Cercopithecidae Species 0.000 description 1
- 241000193403 Clostridium Species 0.000 description 1
- 241000193401 Clostridium acetobutylicum Species 0.000 description 1
- 108020004705 Codon Proteins 0.000 description 1
- PMATZTZNYRCHOR-CGLBZJNRSA-N Cyclosporin A Chemical compound CC[C@@H]1NC(=O)[C@H]([C@H](O)[C@H](C)C\C=C\C)N(C)C(=O)[C@H](C(C)C)N(C)C(=O)[C@H](CC(C)C)N(C)C(=O)[C@H](CC(C)C)N(C)C(=O)[C@@H](C)NC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)N(C)C(=O)[C@H](C(C)C)NC(=O)[C@H](CC(C)C)N(C)C(=O)CN(C)C1=O PMATZTZNYRCHOR-CGLBZJNRSA-N 0.000 description 1
- 206010011831 Cytomegalovirus infection Diseases 0.000 description 1
- 101100277543 Escherichia coli (strain K12) deoR gene Proteins 0.000 description 1
- 108091029865 Exogenous DNA Proteins 0.000 description 1
- 108700039691 Genetic Promoter Regions Proteins 0.000 description 1
- 241000606768 Haemophilus influenzae Species 0.000 description 1
- 108010033040 Histones Proteins 0.000 description 1
- 102000001706 Immunoglobulin Fab Fragments Human genes 0.000 description 1
- 108010054477 Immunoglobulin Fab Fragments Proteins 0.000 description 1
- 206010061218 Inflammation Diseases 0.000 description 1
- 241000699670 Mus sp. Species 0.000 description 1
- 108700010674 N-acetylVal-Nle(7,8)- allatotropin (5-13) Proteins 0.000 description 1
- 108091007491 NSP3 Papain-like protease domains Proteins 0.000 description 1
- 241000283973 Oryctolagus cuniculus Species 0.000 description 1
- 206010033078 Otitis media Diseases 0.000 description 1
- 241001494479 Pecora Species 0.000 description 1
- 108010087702 Penicillinase Proteins 0.000 description 1
- 108091005804 Peptidases Proteins 0.000 description 1
- 206010035039 Piloerection Diseases 0.000 description 1
- 206010035226 Plasma cell myeloma Diseases 0.000 description 1
- 241000276498 Pollachius virens Species 0.000 description 1
- 239000004365 Protease Substances 0.000 description 1
- 101710194807 Protective antigen Proteins 0.000 description 1
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 1
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 1
- 206010057190 Respiratory tract infections Diseases 0.000 description 1
- 102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 description 1
- 241000283984 Rodentia Species 0.000 description 1
- 206010040047 Sepsis Diseases 0.000 description 1
- 108090000233 Signal peptidase II Proteins 0.000 description 1
- VMHLLURERBWHNL-UHFFFAOYSA-M Sodium acetate Chemical compound [Na+].CC([O-])=O VMHLLURERBWHNL-UHFFFAOYSA-M 0.000 description 1
- 206010041349 Somnolence Diseases 0.000 description 1
- 108091081024 Start codon Proteins 0.000 description 1
- 201000005010 Streptococcus pneumonia Diseases 0.000 description 1
- 229940124858 Streptococcus pneumoniae vaccine Drugs 0.000 description 1
- 102400000096 Substance P Human genes 0.000 description 1
- 101800003906 Substance P Proteins 0.000 description 1
- 210000001744 T-lymphocyte Anatomy 0.000 description 1
- 239000007983 Tris buffer Substances 0.000 description 1
- 239000013543 active substance Substances 0.000 description 1
- 230000001154 acute effect Effects 0.000 description 1
- 150000001350 alkyl halides Chemical class 0.000 description 1
- WNROFYMDJYEPJX-UHFFFAOYSA-K aluminium hydroxide Chemical compound [OH-].[OH-].[OH-].[Al+3] WNROFYMDJYEPJX-UHFFFAOYSA-K 0.000 description 1
- 229910021502 aluminium hydroxide Inorganic materials 0.000 description 1
- 230000003321 amplification Effects 0.000 description 1
- 238000011091 antibody purification Methods 0.000 description 1
- 230000030741 antigen processing and presentation Effects 0.000 description 1
- 238000011203 antimicrobial therapy Methods 0.000 description 1
- 239000000074 antisense oligonucleotide Substances 0.000 description 1
- 238000012230 antisense oligonucleotides Methods 0.000 description 1
- 108010058966 bacteriophage T7 induced DNA polymerase Proteins 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 230000033228 biological regulation Effects 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 210000000988 bone and bone Anatomy 0.000 description 1
- 210000004556 brain Anatomy 0.000 description 1
- 210000004899 c-terminal region Anatomy 0.000 description 1
- 239000000969 carrier Substances 0.000 description 1
- 210000000170 cell membrane Anatomy 0.000 description 1
- 239000013611 chromosomal DNA Substances 0.000 description 1
- 150000001875 compounds Chemical class 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 230000002596 correlated effect Effects 0.000 description 1
- 230000001186 cumulative effect Effects 0.000 description 1
- SUYVUBYJARFZHO-RRKCRQDMSA-N dATP Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@H]1C[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 SUYVUBYJARFZHO-RRKCRQDMSA-N 0.000 description 1
- SUYVUBYJARFZHO-UHFFFAOYSA-N dATP Natural products C1=NC=2C(N)=NC=NC=2N1C1CC(O)C(COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 SUYVUBYJARFZHO-UHFFFAOYSA-N 0.000 description 1
- NHVNXKFIZYSCEB-XLPZGREQSA-N dTTP Chemical compound O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)[C@@H](O)C1 NHVNXKFIZYSCEB-XLPZGREQSA-N 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 206010012601 diabetes mellitus Diseases 0.000 description 1
- 238000010790 dilution Methods 0.000 description 1
- 239000012895 dilution Substances 0.000 description 1
- 239000000975 dye Substances 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- 238000006911 enzymatic reaction Methods 0.000 description 1
- 238000001976 enzyme digestion Methods 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 239000013613 expression plasmid Substances 0.000 description 1
- 239000013604 expression vector Substances 0.000 description 1
- 239000012467 final product Substances 0.000 description 1
- 238000004108 freeze drying Methods 0.000 description 1
- 238000007710 freezing Methods 0.000 description 1
- 230000008014 freezing Effects 0.000 description 1
- 238000003209 gene knockout Methods 0.000 description 1
- 229940047650 haemophilus influenzae Drugs 0.000 description 1
- 208000019622 heart disease Diseases 0.000 description 1
- 125000001165 hydrophobic group Chemical group 0.000 description 1
- 238000002169 hydrotherapy Methods 0.000 description 1
- 230000028993 immune response Effects 0.000 description 1
- 229960003444 immunosuppressant agent Drugs 0.000 description 1
- 230000001861 immunosuppressant effect Effects 0.000 description 1
- 239000003018 immunosuppressive agent Substances 0.000 description 1
- 238000001727 in vivo Methods 0.000 description 1
- 238000011534 incubation Methods 0.000 description 1
- 230000004054 inflammatory process Effects 0.000 description 1
- 206010022000 influenza Diseases 0.000 description 1
- 239000003999 initiator Substances 0.000 description 1
- 230000000977 initiatory effect Effects 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- 239000004310 lactic acid Substances 0.000 description 1
- 235000014655 lactic acid Nutrition 0.000 description 1
- 231100000636 lethal dose Toxicity 0.000 description 1
- 231100000225 lethality Toxicity 0.000 description 1
- 230000033001 locomotion Effects 0.000 description 1
- 210000004072 lung Anatomy 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 239000000155 melt Substances 0.000 description 1
- 230000004060 metabolic process Effects 0.000 description 1
- 244000000010 microbial pathogen Species 0.000 description 1
- 238000002156 mixing Methods 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 238000012544 monitoring process Methods 0.000 description 1
- 201000000050 myeloid neoplasm Diseases 0.000 description 1
- 210000000107 myocyte Anatomy 0.000 description 1
- 210000003928 nasal cavity Anatomy 0.000 description 1
- 238000003199 nucleic acid amplification method Methods 0.000 description 1
- SUWZHLCNFQWNPE-LATRNWQMSA-N optochin Chemical compound C([C@H]([C@H](C1)CC)C2)CN1[C@@H]2[C@H](O)C1=CC=NC2=CC=C(OCC)C=C21 SUWZHLCNFQWNPE-LATRNWQMSA-N 0.000 description 1
- 230000001717 pathogenic effect Effects 0.000 description 1
- 230000037361 pathway Effects 0.000 description 1
- 230000005371 pilomotor reflex Effects 0.000 description 1
- 229940031937 polysaccharide vaccine Drugs 0.000 description 1
- 238000001556 precipitation Methods 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 238000002203 pretreatment Methods 0.000 description 1
- 235000019419 proteases Nutrition 0.000 description 1
- 230000005180 public health Effects 0.000 description 1
- 238000000163 radioactive labelling Methods 0.000 description 1
- 230000006798 recombination Effects 0.000 description 1
- 238000005215 recombination Methods 0.000 description 1
- 208000012802 recumbency Diseases 0.000 description 1
- 230000010076 replication Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 235000019515 salmon Nutrition 0.000 description 1
- 208000013223 septicemia Diseases 0.000 description 1
- 238000013207 serial dilution Methods 0.000 description 1
- 239000001632 sodium acetate Substances 0.000 description 1
- 229960004249 sodium acetate Drugs 0.000 description 1
- 235000017281 sodium acetate Nutrition 0.000 description 1
- 239000011780 sodium chloride Substances 0.000 description 1
- 239000000243 solution Substances 0.000 description 1
- 210000004988 splenocyte Anatomy 0.000 description 1
- 230000003068 static effect Effects 0.000 description 1
- 230000004936 stimulating effect Effects 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 239000000725 suspension Substances 0.000 description 1
- 238000004114 suspension culture Methods 0.000 description 1
- 230000008685 targeting Effects 0.000 description 1
- HNONEKILPDHFOL-UHFFFAOYSA-M tolonium chloride Chemical compound [Cl-].C1=C(C)C(N)=CC2=[S+]C3=CC(N(C)C)=CC=C3N=C21 HNONEKILPDHFOL-UHFFFAOYSA-M 0.000 description 1
- 150000004992 toluidines Chemical class 0.000 description 1
- 230000035897 transcription Effects 0.000 description 1
- 238000013518 transcription Methods 0.000 description 1
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 1
- 201000008827 tuberculosis Diseases 0.000 description 1
- 229940125575 vaccine candidate Drugs 0.000 description 1
- 230000001018 virulence Effects 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/195—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria
- C07K14/315—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria from Streptococcus (G), e.g. Enterococci
- C07K14/3156—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria from Streptococcus (G), e.g. Enterococci from Streptococcus pneumoniae (Pneumococcus)
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P11/00—Drugs for disorders of the respiratory system
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P31/00—Antiinfectives, i.e. antibiotics, antiseptics, chemotherapeutics
- A61P31/04—Antibacterial agents
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K2039/51—Medicinal preparations containing antigens or antibodies comprising whole cells, viruses or DNA/RNA
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
Landscapes
- Health & Medical Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Medicinal Chemistry (AREA)
- General Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Veterinary Medicine (AREA)
- Pulmonology (AREA)
- General Chemical & Material Sciences (AREA)
- Pharmacology & Pharmacy (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Animal Behavior & Ethology (AREA)
- Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
- Public Health (AREA)
- Oncology (AREA)
- Genetics & Genomics (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Communicable Diseases (AREA)
- Gastroenterology & Hepatology (AREA)
- Biochemistry (AREA)
- Biophysics (AREA)
- Engineering & Computer Science (AREA)
- Molecular Biology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Medicines Containing Antibodies Or Antigens For Use As Internal Diagnostic Agents (AREA)
- Peptides Or Proteins (AREA)
- Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Pharmaceuticals Containing Other Organic And Inorganic Compounds (AREA)
Abstract
Novel proteins from Streptococcus pneumoniae are described, together with nucleic acid sequences encoding them. Their use in vaccines and in screening methods is also described.
Description
The present invention relates to be derived from the protein of streptococcus pneumoniae (Streptococcus pneumoniae), this proteinic nucleic acid molecule of encoding, this nucleic acid and/or protein are as antigen/immunogenic purposes and the purposes in detection/diagnosis, and screening is as the method for the protein/nucleotide sequence of potential resisting-microorganism target.
Streptococcus pneumoniae often is called as streptococcus pneumoniae, and it is a kind of important pathogenic microorganism.In developing country and developed country, streptococcus pneumoniae infection occupies critical role all the time in human diseases, and the relevant expert had carried out commentary (Fiber, G.R., science, 265:1385-1387 (1994)) to this.It is believed that worldwide this microorganism is the modal pathogenic bacterium of acute respiratory infection, estimate to cause 100 ten thousand death of child every year, great majority wherein are the children (Stansfield of developing country, S.K., paediatrics transmissible disease, 6:622 (1987)).In the U.S., the someone proposes (Breiman etc., Arch.Intern.Med., 150:1401 (1990)) streptococcus pneumoniae is still the modal pathogenic bacterium of bacterial pneumonia, children, the elderly and suffer from the easy infection disease is as alienia disease, heart trouble, tuberculosis and ephrosis, diabetes, crapulent patient, or suffer among the patient of immunosuppressant disease (as AIDS), the sickness rate of bacterial pneumonia is high especially.Therefore suffer from an inflammation of the lungs coccus septicemia and (suffer from) meningitic risk of these crowds is higher, the risk that the coccus that therefore dies of pneumonia infects is bigger.Streptococcus pneumoniae also is the pathogenic bacterium of otitis media and sinusitis paranasal sinusitis, and these two kinds of diseases are popular infection always in the children of developed country, and bring very big cost.
Recently, the pneumococcal appearance of penicillin-resistance makes to the needs of the strategy of effective prevention pneumococcal infection urgent day by day.It is reported, in the 13 tame hospitals in 12 states of the U.S., 6.6% streptococcus pneumoniae strain isolated has resistance to penicillin, some strain isolateds are also to other microbiotic, comprise that third generation S-Neoral has resistance (Schappert, S.M., the population and the healthy state statistic data at national Centre for Disease Control/healthy state statistics center, 214:1 (1992)).In some hospital, the ratio of penicillin resistance higher (reaching 20%) (Breiman etc., American Medical Association's will, 271:1831 (1994)).Because pneumococcal penicillin resistance is to keep better curative effect emergent in the recent period after decades at penicillin, so these discoveries make us panic.
In view of above-mentioned reason, must be devoted to the improvement prevention, control, the method for diagnosis or treatment streptococcus pneumoniae disease.
Adopted several different methods to provide the prevention pneumococcal infection used vaccine.It can be divided into different serotype (at least 90 kinds) according to the polysaccharide pod membrane structure around the microorganism, produce a plurality of difficult problems thus.Vaccine at single serotype is invalid to other serotype, this means that vaccine must comprise that the polysaccharide antigen of all serotypes could be in most of the cases effective.Someone finds (determining serotype separately when capsular polysaccharide; and be main protective antigen) be purified and when the vaccine; can not in the children's body below two years old, induce protection antibody to reply reliably, and this age group invasive pneumococcal infection and the highest crowd of meningitic sickness rate exactly.
Use the improvement of kantigen method to rely on polysaccharide and proteinic puting together, thereby reply the feature that depends on the T-cell and obtain the enhanced immunne response by giving especially.This method has been used to develop for example vaccine of anti influenza influenzae (Haemophilus influenzae).Yet, be still bone of contention about multiple polysaccharide vaccine with based on the cost of the vaccine of conjugate.
The third method is to seek other to have the antigen component of the potentiality that become candidate vaccine, and this has constituted basis of the present invention.Use specifically developed bacterial expression system, we can identify one group of pneumoprotein matter antigen, and described antigen combines with bacterial outer membrane or can be secreted.
Therefore, first aspect the invention provides streptococcus pneumoniae proteins or polypeptide, and it has and is selected from the sequence shown in the table 1.
Second aspect the invention provides streptococcus pneumoniae proteins or polypeptide, and it has and is selected from the sequence shown in the table 2.
Protein of the present invention that is provided or polypeptide can be pure basically forms.For example, can be to be substantially free of other proteinic form.
As described herein, protein of the present invention and polypeptide can be used as antigenic substance.Described material can be " antigenic " and/or " immunogenic ".Usually, " antigenic " is meant that in fact protein or polypeptide can be used to produce antibody, perhaps can induce antibody response in subject." immunogenic " is meant that protein or polypeptide can cause protective immune response in subject.Therefore, under latter event, protein or polypeptide can not only produce antibody response, can also produce not the immunne response based on antibody.
Those skilled in the art should understand that the homologue of protein of the present invention or polypeptide or derivative also can be used for context of the present invention, promptly as antigenic/immunogenic material.Therefore, the present invention also comprises for example having one or more interpolations, disappearance, the protein or the polypeptide of replacement etc.In addition, can be with the amino acid of a kind of aminoacid replacement another kind similar " type ".For example, replace another kind of hydrophobic amino acid with a kind of hydrophobic amino acid.
Can service routine (as the CLUSTAL program) comparing amino acid sequence.This program can compare aminoacid sequence, and inserts in arbitrary sequence when suitable and find the optimal sequence contrast at interval.Can correlated amino acid identity of calculating optimum sequence or similarity (identity adds the conservative property of amino acid type).Can contrast one section the longest similar sequences and specify such as the program of BLASTx and meet value.Therefore, can obtain comparative result, can find several similaritys zone therein, they have different score values separately.The present invention wishes to carry out this identity analysis of two types.
For homologue and derivative, more inessential with the identity degree of protein described herein or polypeptide, importantly homologue or derivative should keep the antigenicity or the immunogenicity of urporotein or polypeptide.Yet, provide that to have the homologue or a derivative (as discussed above) of at least 60% similarity comparatively suitable with protein described herein or polypeptide, homologue or derivative with at least 70% similarity preferably are provided, homologue or derivative with at least 80% similarity more preferably are provided, most preferably provide to have at least 90% or even the homologue or the derivative of 95% similarity.
In another approach, homologue or derivative can be fusion roteins, and the moiety of wherein mixing is by for example mark desired protein or polypeptide make purifying become easier effectively.Must remove " marker ", perhaps, in fact fusion rotein self should keep enough antigenicities so that it is useful.
In another aspect of this invention, provide protein of the present invention or polypeptide, or the antigenicity/immunogenic fragments of its homologue or derivative.
With regard to protein described herein or polypeptide, or the fragment of its homologue or derivative, situation is slightly different.As everyone knows, can screen antigen protein or polypeptide, promptly be responsible for the antigenicity or the immunogenic zone of protein or polypeptide to identify epitope regions.It is well-known in the art carrying out this method for screening, and therefore, fragment of the present invention should comprise one or more described epitope regions, perhaps enough similar to keep its antigenicity/immunogenicity to described zone.Therefore, for fragment of the present invention, the identity degree may be incoherent because they can with protein as herein described or polypeptide, the specific part 100% of homologue or derivative is identical.Equally, main problem is that fragment must keep antigenicity/immunogenicity.
Therefore, for the homologue of deriving and obtaining by protein or polypeptide, derivative and fragment, importantly they should have its derived from described protein or the incomplete antigen/immunogenicity at least of polypeptide.
Can use gene clone technology that pure basically protein of the present invention is provided.Described technology is disclosed in for example Sambrook etc., molecular cloning, the 2nd edition, press of cold spring harbor laboratory (1989).
Therefore, the third aspect the invention provides nucleic acid molecule, and it contains following sequence or is made up of following sequence:
(ⅰ) any dna sequence dna shown in the table 1 or its RNA equivalent;
(ⅱ) with (ⅰ) in arbitrary sequence complementary sequence;
(ⅲ) with (ⅰ) or the sequence encoding same protein (ⅱ) or the sequence of polypeptide;
(ⅳ) with (ⅰ), (ⅱ) and (ⅲ) the substantially the same sequence of arbitrary sequence in;
(ⅴ) the coding schedule 1 proteinic homologue that limits, derivative or fragments sequence.
Fourth aspect the invention provides nucleic acid molecule, and it contains following sequence or is made up of following sequence:
(ⅰ) any dna sequence dna shown in the table 2 or its RNA equivalent;
(ⅱ) with (ⅰ) in arbitrary sequence complementary sequence;
(ⅲ) with (ⅰ) or the sequence encoding same protein (ⅱ) or the sequence of polypeptide;
(ⅳ) with (ⅰ), (ⅱ) and (ⅲ) the substantially the same sequence of arbitrary sequence in;
(ⅴ) the coding schedule 2 proteinic homologue that limits, derivative or fragments sequence.
Nucleic acid molecule of the present invention comprises a plurality of this sequences and/or fragment.Those skilled in the art should understand the neomorph that the present invention includes the new specific nucleic acid molecule that this paper exemplifies.The present invention includes this variant.Described variant can be natural variant, for example the variant that produces owing to the bacterial strain variation.For example, comprise interpolation, replace and/or disappearance.In addition, particularly when using the microbial expression system, people wish by utilizing the codon of expressing known preference use in the used specific organism to transform nucleotide sequence.Therefore, also comprise synthetic or non--natural variant in the scope of the present invention.
Sequence that the given RNA molecule of above used term " RNA equivalent " expression has and the sequence complementation of given dna molecular (allowing in the RNA genetic code, to substitute " T ") with " U ".
When relatively nucleotide sequence is with mensuration homology or identity degree, for example can use relatively two sequences of BESTFIT and GAP (all deriving from Wisconsin Genetics Computer Group (GCG) software package) BESTFIT program, produce the optimal sequence contrast of similar section.GAP can make sequence compare along its total length, and inserts in arbitrary sequence when suitable and find the optimal sequence contrast at interval.In the context of the present invention, when the identity of nucleotide sequence is discussed, be suitable for by carrying out sequence along full length sequence to comparative sequences recently.
Preferred substantially the same sequence and described sequence has at least 50% sequence identity, more preferably has at least 75% sequence identity, more preferably has at least 90% or at least 95% sequence identity.In some cases, sequence identity can be 99% or more than.
Preferably, term " substantially the same " expression: for the identity degree of the nucleotide sequence of described sequence and prior art, the identity degree of described sequence and arbitrary sequence described herein is higher.
Yet, should be noted that when nucleic acid sequence encoding of the present invention during to the new gene product of small part, comprise all possible sequence of encoding gene product or its new part in the scope of the invention.
Nucleic acid molecule can be the form of isolating form or reorganization.It can be mixed carrier, carrier can be impregnated in the host.Described carrier and suitable host constitute another aspect of the present invention.
Therefore, for example,, can identify the gene of streptococcus pneumoniae by using probe based on the nucleotide sequence that this paper provided.Use Restriction Enzyme can downcut these genes, be cloned into carrier, again carrier is imported suitable host to express.
By the partial sequence complementary suitable probe of use, can from streptococcus pneumoniae, obtain nucleic acid molecule of the present invention with nucleic acid molecule.Can use Restriction Enzyme or supersound process technology to obtain suitably big or small fragment to be used as probe.
Perhaps, can use the required nucleotide sequence of round pcr amplification.Therefore, can use two primers of sequence data design provided herein to be used for PCR, comprise that complete genome or its segmental required sequence by target, are increased immediately high-levelly thereby make.
It is long that primer generally is at least 15 to 25 Nucleotide.
Another kind method is to use chemosynthesis, and this method can be carried out automatization.Can chemosynthesis relatively short sequence, be interconnected to form long sequence again.
Use bacterial expression system as herein described to identify another group and derive from streptococcus pneumoniae proteins.They are the known protein matter of streptococcus pneumoniae, but nobody was accredited as antigen protein with them in the past.The aminoacid sequence of this histone matter and their dna sequence dna of encoding are shown in table 3.These protein or its homologue, derivative and/or fragment also can be used as antigen/immunogen.Therefore, on the other hand, the invention provides and have protein or polypeptide or its homologue of sequence that is selected from sequence shown in the table 1-3, derivative and/or fragment are as immunogen/antigenic purposes.
On the other hand, the invention provides immunogenicity/antigenic composition, said composition contains one or more and has protein or polypeptide or its homologue or the derivative of sequence that is selected from sequence shown in the table 1-3, and/or any the fragment in them.In preferred embodiments, immunogenicity/antigenic composition is a vaccine, perhaps can be used for diagnostic test.
With regard to vaccine, it can comprise the vehicle of suitable interpolation, thinner, adjuvant etc.A plurality of specific exampless are well-known in the art.
Nucleotide sequence shown in also can use table 1-3 prepares so-called dna vaccination.Therefore, the present invention also provides the vaccine composition that contains one or more nucleotide sequences defined herein.Existing people described dna vaccination (example is seen Donnelly etc., and 15:617-648 (1997) is commented in immunity academic year) in this area, and those of skill in the art can use the described technology production in this area and use dna vaccination of the present invention.
As mentioned above, in the method for detection/diagnosis streptococcus pneumoniae, can use protein as herein described or polypeptide, its homologue or derivative, and/or any the fragment in them.Detect the anti-described proteinic antibody that exists in the subject, set up aforesaid method based on this.Therefore, the invention provides the method for detection/diagnosis streptococcus pneumoniae, described method comprises makes given the test agent and at least a protein as herein described or its homologue, the step of derivative or fragment contact.Preferred sample is a biological sample, as derives from experimenter's to be measured tissue sample or blood sample or saliva.
In another approach, can use protein as herein described, or its homologue, derivative and/or fragment produce antibody, and conversely, described antibody can be used for detecting antigen, thereby can detect streptococcus pneumoniae.Described antibody constitutes another aspect of the present invention.Antibody in the scope of the invention can be mono-clonal or polyclonal antibody.
When with protein as herein described, or its homologue, when derivative or fragment are injected in suitable animal host (as mouse, rat, cavy, rabbit, sheep, goat or the monkey) body, can make animal produce polyclonal antibody by stimulating antibody to generate.In case of necessity, can use adjuvant and protein simultaneously.Well-known adjuvant comprises freund's adjuvant (complete and Freund) and aluminium hydroxide.Then, utilize antibody and combination of proteins described herein to get final product antibody purification.
Monoclonal antibody can be produced by hybridoma.Can form hybridoma by the splenocyte that merges myeloma cell and the required antibody of generation with the clone that forms infinite multiplication.Therefore, can use well-known Kohler﹠amp; Milstein technology (nature, 256 (1975)) or the technology of on the basis of this technology, changing.
At present, production has been the mature technology of this area with the technology of specific polypeptides bonded mono-clonal and polyclonal antibody.Relevant discussion can be referring to the immunology textbook of standard, Roitt etc. for example, immunology, the 2nd edition (1989), Churchill Livingstone, London.
Except complete antibody, the present invention also comprises the derivative of this antibody, and described derivative can combine with protein as described herein etc.Therefore, the present invention includes antibody fragment and synthetic construct.The example of antibody fragment and synthetic construct can be referring to Dougall etc., Tibtech 12372-379 (in September, 1994).
Antibody fragment comprises for example Fab, F (ab ')
2With the Fv fragment.Fab fragment (these fragments are discussed in [document are the same] such as Roitt).Can modify the Fv fragment is known as strand Fv (scFv) molecule with generation synthetic construct.This molecule comprises and V
hAnd V
lDistinguish covalently bound peptide linker, it is contributed to some extent to the stability of molecule.Operable other synthetic construct comprises the CDR peptide, and they are to contain antigen-in conjunction with the synthetic peptide of determinant.Also can use simulating peptide, these molecules are the limited organic ring of conformation normally, and it can simulate the structure of CDR ring, and comprise can with the side chain of AI.
Synthetic construct comprises chimeric molecule.Therefore, for example, also comprise humanization (or primateization) antibody or derivatives thereof in the scope of the invention.The example of humanized antibody is the antibody with people's framework region and rodent hypervariable region.Producing the method for chimeric antibody can be referring to for example Morrison etc., PNAS, and 81,6851-6855 (1984) and Takeda etc., naturally, and 314,452-454 (1985).
Synthetic construct also comprises the molecule that contains other moiety, and described other composition can be molecule some desired characteristic except that the antigen binding characteristic are provided.For example, described moiety can be marker (for example fluorescence or a radio-labeling).Perhaps, also can be pharmaceutically active agents.
The antibody or derivatives thereof can be used for detection/diagnosis streptococcus pneumoniae.Therefore, on the other hand, the invention provides the method for detection/diagnosis streptococcus pneumoniae, described method comprises the step that given the test agent is contacted with antibody, described antibody capable is in conjunction with one or more protein as herein described, or its homologue, derivative and/or fragment.
In addition, can use so-called " affine body ".These affine bodies are conjugated protein (Nord etc.) that are selected from the combinatorial library of alpha-helix bacterial receptor structural domain.Therefore, can use the combinatorial library method select can with different target protein specificity bonded small protein matter structural domains.
Obviously, can use nucleotide sequence detection/diagnosis streptococcus pneumoniae as herein described.Therefore, on the other hand, the invention provides the method for detection/diagnosis streptococcus pneumoniae, described method comprises makes given the test agent contact with at least a nucleotide sequence as herein described.Preferred sample is a biological sample, as derives from experimenter's to be measured tissue sample or blood sample or saliva.Before being used for method of the present invention, can carry out pre-treatment to sample.Therefore, for example, can handle sample to extract DNA.Can use the nucleic acid that detects streptococcus pneumoniae based on the dna probe of nucleotide sequence described herein (being generally the fragment of this sequence) then.
In others, the invention provides:
(a) the immunization experimenter is in case the method for streptococcus pneumoniae, and described method comprises to the experimenter uses protein of the present invention or polypeptide, or derivatives thereof, homologue or fragment, or the step of immunogenic composition of the present invention;
(b) the immunization experimenter is in case the method for streptococcus pneumoniae, and described method comprises the step of using nucleic acid molecule defined herein to the experimenter;
(c) method of prevention or treatment streptococcus pneumoniae infection, described method comprises to the experimenter uses protein of the present invention or polypeptide, or derivatives thereof, homologue or fragment, or the step of immunogenic composition of the present invention;
(d) method of prevention or treatment streptococcus pneumoniae infection, described method comprises the step of using nucleic acid molecule defined herein to the experimenter;
(e) be used to detect/diagnose the test kit of streptococcus pneumoniae infection, it comprises one or more protein of the present invention or polypeptide, or its homologue, derivative or fragment, or antigenic composition of the present invention; With
(f) be used to detect/diagnose the test kit of streptococcus pneumoniae infection, it comprises the nucleic acid molecule that one or more are defined herein.
Even we have identified one group of important protein matter, described protein is the potential target of antimicrobial therapy.Whether yet still must measure each protein is that microbial survival is necessary.Therefore, the present invention also provides and has measured protein described herein or whether polypeptide is the method for the antimicrobial target of potential, described method comprises antagonism, suppress or otherwise disturb described proteinic function or expression, and measure whether still survival of streptococcus pneumoniae.
Whether the proteinic proper method of deactivation is to carry out selected gene knockout, promptly stop protein expression and measure to cause lethality to change.Carry out the proper method that described base knocks out and be described in Li etc., institute of NAS newspaper (P.N.A.S.), 94:13251-13256 (1997) and Kolkman etc., 178:3736-3741 (1996).
Last aspect, the invention provides can antagonism, suppress or otherwise disturb the medicament of the function of protein of the present invention or polypeptide or expression to be used to prepare the purposes of medicine, described medicine can be used for treatment or prevention streptococcus pneumoniae infection.
As mentioned above, we have used the instrument of bacterial expression system as identification of protein, and described protein combines with the surface, and therefore the protein of being secreted or exporting can be used as antigen.
Furtherd investigate the necessary genetic information of secretion/output protein in the bacterium.In most of the cases, protein output needs the N-end of precursor protein to have signal peptide, with the transhipment machine on plasma membrane that it is led.In the middle of the transhipment or after the transhipment, by removing signal peptide with membrane-bound signal peptidase.At last, by sequence rather than leading peptide itself determine proteinic location (be that protein is secreted, or integral protein or be bonded on the cell walls).
We are interested in especially the protein that is positioned at the surface or is output, because they may be antigen, can be used for vaccine, the target that is used as diagnostic reagent or treats with new compound molecule.Therefore, we work out a kind of screening vector-system in Lactococcus lactis (Lactococcus lactis), use this system to identify and separate the proteinic gene of coding output.Hereinafter will provide representational example, show how to identify and characterize the given new surface bonding protein that derives from streptococcus pneumoniae.Screening vector mix lack himself output signal staphylococcus ribozyme gene nuc as the secretion reporter gene.The staphylococcus ribozyme is natural excretory heat-stable monomeric enzyme, and it can be by effective expression and secretion (Shortle, gene, 22:181-189 (1983) in gram positive bacterium; Kovacevic etc., bacteriology magazine, 162:521-528 (1985); Miller etc., bacteriology magazine, 169:3508-3514 (1987); Liebl etc., bacteriology magazine, 174:1854-1861 (1992); LeLoir etc., bacteriology magazine, 176:5135-5139 (1994); Poquet etc., bacteriology magazine, 180:1904-1912 (1998)).
Recently, Poquet etc. ((1998), document is the same) have described a kind of screening vector, its mix lack himself targeting signal the nuc gene as reporter gene, to identify the output protein in the gram positive bacterium, this carrier has been applied to Lactococcus lactis.Except containing the ColE1 replicon that duplicates that promotes in intestinal bacteria and some other gram negative bacterium, this carrier (pFUN) also contains pAM β 1 replicon that works in the gram positive bacterium of wide host range.Can use the unique cloning site that exists in the carrier to produce and transcribe and translate fusions, described fusions is at cloned genes group dna fragmentation and does not contain between the open reading frame of brachymemma nuc gene of himself signal secretion leader sequence and produce.The nuc gene is the ideal reporter gene, and because using simple and responsive treadmill test can easily detect the secretion of ribozyme: the reorganization bacterium colony of secretion ribozyme produces the pink colour haloing, and the contrast bacterium colony still be white (Shortle, 1983, document is the same; Le Loir etc., 1994, document is the same).
Therefore, describe the present invention hereinafter with reference to representational embodiment, these embodiment will describe protein as herein described in detail, and polypeptide and nucleotide sequence are how to be accredited as the antigen target.
In this article, we have described the structure of 3 report carriers, and they are used to identify in Lactococcus lactis and the purposes of separating coding excretory or the proteinic streptococcus pneumoniae genomic DNA fragment of surface bonding.
Describe the present invention hereinafter with reference to embodiment, embodiment should not regarded as and limit the present invention by any way.The accompanying drawing of reference is among the embodiment:
Fig. 1: the result who demonstrates a large amount of dna vaccination tests; With
Fig. 2: the result who demonstrates other dna vaccination tests.
Embodiment 1 (ⅰ) makes up pTREP1-nuc series report genophore (a) construction expression plasmid pTREP1
The pTREP1 plasmid is the θ-duplicate the Gram-positive plasmid of height-copy number (each cell 40-80 copy), and it is the derivative of pTREX plasmid, and the pTREX plasmid itself is the derivative of previously disclosed pIL253 plasmid.PIL253 has mixed pAM β 1 replicon (Simon and Chopin, Biochimie, 70:559-567 (1988)) of wide Gram-positive host range, can not lean on the sex factor of Lactococcus lactis to shift.PIL253 also lacks the tra function, and described function is that parental generation plasmid (for example pIL501) shifts or effectively move necessary by engaging.Faecalis pAM β 1 replicon once was transferred in a plurality of kinds in the past, comprise streptococcus (Streptococcus), kind of lactobacillus (Lactobacillus) and bacillus (Bacillus) and clostridium acetobutylicum (Clostridium acteobutylicum) (Oultram and Klaenhammer, the communication of FEMS microbiology, 27:129-134 (1985); Gibson etc., (1979); LeBlanc etc., institute of NAS newspaper, 75:3484-3487 (1978)), demonstrate the practicality of potential wide host range.The pTREP1 plasmid is the composing type transcription vector.
Make up the pTREX carrier as follows.2 complementary oligonucleotide also extend with the Tfl archaeal dna polymerase by annealing, can produce such artificial DNA fragment, described fragment contains the RNA stabilizing sequences of inferring, and translation initiation district (TIR) is used to insert the multiple clone site and the transcription termination sequence of target gene.There are justice and antisense oligonucleotide to contain recognition site of Nhe I and BamH I respectively so that the clone at its 5 ' end.With this fragment cloning between the Xba I and BamH I site of pUC19NT7, pUC19NT7 is the derivative of pUC19, and it contains T7 expression cassette (Wells etc., the applied bacteriology magazine that derives from pLET1,74:629-636 (1993)), this expression cassette is cloned between EcoR I and Hind III site.The gained construct is called as pUCLEX.By cutting with the Hind III, make it become flush end, cut to remove the whole expression cassette of pUCLEX with the EcoR I again, be cloned into EcoR I and Sac I (flush end) site of pIL253 then, produce carrier pTREX (Wells and Schofield, metabolism, genetics and application latest developments-NATO ASISeries, H 98:37-62 (1996)).RNA stabilizing sequences of inferring and TIR derive from intestinal bacteria T7 phage sequence, and on a nucleotide position, modified with the complementarity that strengthens Shine Dalgarno (SD) primitive and Lactococcus lactis rrna 16s RNA privacy communications such as (, department of pathology of Cambridge University) Schofield.
The Lactococcus lactis MG1363 chromosomal dna fragment that will show promoter activity (being referred to as P7 subsequently) is cloned between the EcoR I and Bgl II site that exists in the expression cassette, produces pTREX7.In the past, once used promotor carrier detection pSB292 (Waterfield etc., gene, 165:9-15 (1995)) to isolate this activity promoter region.According to the explanation of manufacturer, use the VentDNA polysaccharase, through the pcr amplification promoter fragment.
Then by following structure pTREP1 carrier.By 2 the eclipsed part complementary synthetic oligonucleotides of annealing, and extend with Sequenase according to manufacturer explanation and can produce such artificial DNA fragment, this fragment contains transcription termination sequence, forward pUC sequencing primer, promotor multiple clone site zone and general translation termination sequence.There are justice and antisense (pTREPF and pTREPR) oligonucleotide to contain the recognition site of EcoRV and BamH I respectively so that be cloned into pTREX7 at its 5 ' end.Transcription termination sequence is derived from the genus bacillus penicillinase gene, has confirmed its in lactococcus same effectively (Jos etc. uses and environmental microbiology 50:540-542 (1985)).It is believed that this sequence is essential, because through observing the expression defectiveness of target gene in the pTREX carrier, the someone thinks that this is (Schofield etc., privacy communication, a department of pathology of Cambridge University) due to the hidden promoter activity in initiator.Comprise that forward pUC primer order-checking can directly measure the cloned DNA fragments sequence.The translation termination sequence that comprises the terminator codon in 3 different frameworks of coding can prevent to produce between vector gene and the cloned DNA fragment translation fusions.At first, with EcoR I digestion pTREX7 carrier,, use 5 '-3 ' polymerase activity of T4 archaeal dna polymerase (NEB) to make it become flush end according to manufacturer's explanation.Then with the pTREX7 carrier of Bgl II digestion, to remove the P7 promotor through digestion of EcoR I and flush endization.Digest by annealed synthetic oligonucleotide deutero-artificial DNA fragment with EcoRV and BamH I, be cloned into pTREX7 carrier, produce pTREP through EcoR I (flush end)-Bgl II digestion.The Lactococcus lactis MG1363 karyomit(e) promotor that will be called as P1 then is cloned between the EcoR I and Bgl II site that exists in the pTREP expression cassette, forms pTREP1.Use promotor carrier detection pSB292 to separate this promotor, and by (1995) such as Waterfield, document is the same to be identified.Originally, according to manufacturer's explanation, use the VentDNA polysaccharase, and be cloned into pTREX as EcoR I-Bgl II dna fragmentation through pcr amplification P1 promoter fragment.From pTREX1, remove the fragment that contains EcoR I-Bgl II P1 promotor by Restriction Enzyme digestion, and be used to be cloned into pTREP (Schofield etc., privacy communication, department of pathology of Cambridge University).(b) pcr amplification streptococcus aureus (S.aureus) nuc gene
Use the used synthetic oligonucleotide primer thing of nucleotide sequence (EMBL database registration number is V01281) design pcr amplification of streptococcus aureus nuc gene.The design primer can produce nucA (document is the same for Shortle, (1983)) with the mature form (being called as nucA) of amplification nuc gene by 19 to 21 amino acid with protease cracking secretor type propetide (being called as Snase B) N-end.Designing 3 has adopted primer (nucS1, nucS2 and nucS3, appendix I), and each primer has flush end restriction endonuclease cracking site EcoRV or Sma I in the reading frame different with the nuc gene.In addition, there are being justice and 5 ' end of antisense primer to mix Bgl II and BamH I so that be cloned into the pTREP1 that cuts through BamH I and Bgl II respectively.The sequence of all primers is shown in the appendix I.Unite and use each that adopted primer and above-mentioned antisense primer are arranged, through the nuc gene DNA fragment of 3 encoding ribozyme genes of pcr amplification mature form (NucA).Use staphylococcus aureus gene group dna profiling, the condition that Vent archaeal dna polymerase (NEB) and manufacturer are recommended is through pcr amplification nuc gene fragment.Originally in 93 ℃ of sex change 2 minutes, then with 93 ℃ of sex change 45 seconds, 50 ℃ of annealing were extended circulation in 1 minute 30 times in 45 seconds and 73 ℃, extended 5 minutes in 73 ℃ at last.Use Wizard to clean post (Promega) purifying pcr amplification product to remove uncorporated Nucleotide and primer.(c) make up the pTREP1-nuc carrier
Use standard conditions, nuc gene fragment with the purifying described in Bgl II and the BamH I digestion b part, be connected to through BamH I and cutting of Bgl II and dephosphorylized pTREP1, produce pTREP1-nuc1, the reporter gene carrier of pTREP1-nuc2 and pTREP1-nuc3 series.Reagent that use is provided by manufacturer and damping fluid or use standard conditions to carry out common Protocols in Molecular Biology (document is the same for Sambrook and Maniatis, (1989)).In each pTREP1-nuc carrier, expression cassette contains transcription termination sequence, Lactococcus promoters P1, and unique cloning site (Bgl II, EcoRV or Sma I) then is nuc gene and second transcription termination sequence of mature form.Should note: in this construct deliberately with the translation of nuc gene with secrete required sequence and foreclose.This class component has only by the exogenous dna fragment (being equivalent to target bacteria) through suitably digestion to be provided, and described fragment can be cloned into the unique restriction site that is right after the nuc upstream region of gene.
Having aspect the promotor, pTREP1-nuc carrier and Poquet etc., (1998), the pFUN carrier difference that document is same as above, the pFUN carrier can be used for by directly in Lactococcus lactis directly screening Nuc activity identify the output protein of Lactococcus lactis.Because the pFUN carrier do not contain promotor in the upstream of nuc open reading frame, therefore, except those required elements of translation initiation that Nuc is provided and secretion, the cloned genomic dna fragment also must provide and transcribe signal.This restriction can prevent to separate with promotor away from gene, the gene in the polycistronic operon for example.In addition, can't guarantee that the promotor that derives from other kind bacterium can and work therein by Lactococcus lactis identification.Some promotor is under the strictness regulation and control of natural host rather than Lactococcus lactis.On the contrary, the existence of P1 promotor can guarantee that the promoterless dna fragmentation dna fragmentation of the promoter sequence of non-activity in Lactococcus lactis (or contain) still can be transcribed in the pTREP1-nuc serial carrier.(d) secretary protein in the screening pneumonia streptococcus seedling
With the genomic dna of Restriction Enzyme Tru9 I digestion separation from streptococcus pneumoniae.This enzyme that why uses recognition sequence 5 '-TTAA-3 ' is because it can effectively cut the genome that is rich in A/T, and can produce the random gene group dna fragmentation in the preferred size scope (average out to 0.5-1.0kb usually).Why preferred this magnitude range be because this moment the P1 promotor to be used to transcribe the possibility of new gene order bigger.Yet the P1 promotor not all is essential in all cases, because a lot of streptococcic promotor can be identified in Lactococcus lactis.The dna fragmentation of the different magnitude range of purifying from the part Tru9 I digest of streptococcus pneumoniae genomic dna.Because Tru9 I Restriction Enzyme has produced staggered end, therefore have to make dna fragmentation become flush end earlier, and then be connected with pTREP1-nuc carrier through EcoRV or the cutting of Sma I.Carrying out part by 5 '-3 ' polymerase activity that uses the Klenow enzyme mends flat enzyme reaction and can realize this purpose.Briefly, will be dissolved in through the DNA of Tru9 I digestion and be added with T4 dna ligase damping fluid (New EnglandBiolabs; NEB) (1X) and in the solution of the various required dNTP of 33 μ M (herein for dATP and dTTP) (cumulative volume is generally 10-20 μ l).Add Klenow enzyme (every μ g DNA adds 1 unit K lenow enzyme (NEB)), 25 ℃ of insulation reaction 15 minutes.By in 75 ℃ of incubation mixtures 20 minutes with termination reaction.Add pTREP-nuc plasmid DNA (being generally 200-400ng) then through EcoRV or the digestion of Sma I.In mixture, add the T4 dna ligase (NEB) and the T4 dna ligase damping fluid (1X) of 400 units again, in 16 ℃ of incubated overnight.Directly precipitation is connected mixture in the 3M sodium-acetate (pH5.2) of 100% ethanol and 1/10 volume, and is used to transform Lactococcus lactis MG1363 (Gasson, 1983).Perhaps, Bgl II site is also contained in the gene clone site of pTREP-nuc carrier, and this site can be used for cloning the genomic DNA fragment that for example digests through the Sau3A I.
On brain heart infuse agar, cultivate the transformant bacterium colony of Lactococcus lactis, press Shortle basically, 1983, document is the same with Le Loir etc., 1994, document is same as above, by toluidine blue-DNA-agar over lay (0.05M Tris pH9.0,10g agar/l, 10g NaCl/l, 0.1mM CaCl
2, 0.03%wt/vol salmon sperm DNA and 90mg blutene dyestuff) detect and secrete ribozyme (Nuc
+) the clone.Then flat board is placed 37 ℃ of insulations 2 hours, the clone of secretion ribozyme demonstrates the pink colour haloing of easy discriminating.From Nuc
+Isolated plasmid dna among the recombination lactic acid galactococcus clone uses the NucSeq sequencing primer described in the appendix I to measure the sequence that DNA inserts a chain of fragment, and the sequence of described primer is directly passed DNA and inserted fragment.From streptococcus pneumoniae, separate the proteinic gene of coding output
Use the ribozyme screening system to identify a large amount of proteinic gene orders of output of codified streptococcus pneumoniae by inference.Below will further analyze to remove the artefact it.The sequence of using the quantity of parameters analysis to identify with screening system.
1. use software program Sequencher (Gene Codes company) and DNA Strider (Marck, nucleic acids research, 16:1829-1836 (1988)) to analyze the leading/signal peptide sequence of all surface proteins of inferring.The bacterium signal peptide sequence is enjoyed the common design, it is characterized in that and then one section hydrophobic residue (middle body-h zone) of short positively charged N-end (n-quadrant) back, then is to contain cracking site and have more polar C-terminal portions (c-zone).Can use to draw out and infer proteinic hydropathy profile, and can easily identify the computer software of the representational very distinguished hydrophobic part of leader peptide sequences (h-zone).In addition, need whether to have potential ribosome bind site (Shine-Dalgarno primitive) in the checking sequence, this site is that the translation initiation of the nuc reporter gene fusion rotein of inferring is necessary.
2. use disclosed database [OWL-proteins that comprises SwissProt and GenBank translation] that all the surface protein sequence and all proteins/dna sequence dnas of inferring are complementary.Like this can evaluation to known or have the similar sequence of gene homologue of some functions.Can infer the function of using some genes that the LEEP system identifies thus, and can determine beyond all doubtly that this system can be used for identifying the gene order with the release surface conjugated protein.We should can confirm also that in fact these protein are exactly surface bonding protein rather than artefact.The LEEP system has been used to identify vaccine and the used new gene target of treatment.
3. the certified protein of some genes does not have typical leader peptide sequences, does not have homology with any DNA/ protein sequence in the database yet.In fact, these albumen mass-energy demonstrate the major advantage of screening method of the present invention, promptly separate atypical surperficial related protein, and in all aforementioned screening methods or the method based on the sequence homology retrieval, but can't accomplish this point.
In all cases, originally have to the portion gene sequence.In all cases, can obtain full-length gene with reference to TIGR streptococcus pneumoniae database (www@tigr.org).Therefore, be complementary, can identify full-length gene order by making the partial sequence and the database that originally obtain.As described herein, clearly identify 3 groups of genes by this method, the i.e. proteinic gene of streptococcus pneumoniae do not identified in the past of a group coding, another group coding and the known protein matter that derives from multiple source show the proteinic gene of some homologys, and the 3rd group coding is known but be not the proteinic gene of streptococcus pneumoniae of known antigens.
Embodiment 2: vaccine test pcDNA3.1+ is as dna vaccine vector pcDNA3.1+
Selected carrier as dna vaccine vector is pcDNA3.1 (Invitrogen) (is actually pcDNA3.1+, all use the forward carrier in all cases, but this paper is referred to as pcDNA3.1).In the literature, this carrier extensively and successfully has been used as host's carrier to detect the situation (Zhang etc., Kurar and Splitter, Anderson etc.) of vaccine candidate gene initiation at the pathogenic agent provide protection.This carrier is designed to carry out high level in mammalian cell stable and non--transient expression that duplicates.PcDNA3.1 contains the ColE1 replication orgin, duplicates and grows so that carry out high copy number in intestinal bacteria.Also can clone and detect a lot of genes fast and effectively conversely.The pcDNA3.1 carrier has a large amount of cloning sites, also contains the amicillin resistance encoding gene that helps clonal selection and the human cytomegalic inclusion disease virus of high level expression recombinant protein (CMV) immediate early promoter/enhanser effectively.In the various kinds of cell type that comprises myocyte and immunity (antigen presentation) cell, the CMV promotor is the strong virus promotor.This is most important for best immunne response, because do not know still that at present which kind of cell type is most important for producing protective response in vivo.The T7 promotor of multiple clone site upstream can make required modified inset effective expression, and can be at the external clone gene of transcribing on sense orientation.
Zhang, D., Yang, X., Berry, J.Shen, C., McClarty, G and Brunham, R.C. (1997), " carry out dna immunization with the major outer membrane protein gene and can induce the acquired immunity power that infects at chlamydia trachomatis (mouse pneumonia) ", infect and immunity, 176,1035-40.
Kurar, E and Splitter, G.A (1997), " nucleic acid immunization of Bacillus abortus rrna L7/L12 gene causes immunne response ", vaccine, 15,1851-57.
Ander, R., Gao, X.M., Papakonstantinopoulou, A., Roberts, M and Dougan, G (1996), " with the immunne response after the segmental dna immunization mouse of coding tetanus toxin C " infects and immunity, and 64,3168-3173.The preparation dna vaccination
Be each the required gene design Oligonucleolide primers that uses the LEEP system to obtain.Overhaul each gene, if possible, should design primer and make its target it is believed that the only Gene Partial of the proteinic maturing part of encoding gene.We wish that when expressing those sequences of expressing the maturing part of the target gene protein matter of only encoding can help proteinic correct folding in mammalian cell.For example, in most of the cases, the design primer with.The terminal signal peptide sequence of the N-that the final amplified production that is cloned into the pcDNA3.1 expression vector is not contained infer.Signal peptide via protein output pathway guiding cytolemma, generally by signalase I (if lipoprotein be signal peptidase II) downcuts signal peptide at this with polypeptide precursor.Therefore, no matter be presented on bacterium surface or secreted, signal peptide does not constitute arbitrary part of mature protein.When the terminal leader peptide sequences of N-when not being apparent immediately, should design primer with the whole gene order of target with clone in pcDNA3.1 and finally express.
Yet, said that proteinic further feature also can influence the expression of soluble protein and presents.In the process of design oligonucleotides, should in the gene of coding desired protein, get rid of the dna sequence dna of these features of coding.These features comprise:
1.LPXTG the cell walls anchor primitive.
2.LXXC lipoprotein binding site.
3. hydrophobic C-end structure territory.
4. when not having terminal signal peptide of N-or LXXC, should remove terminator codon.
5. when not having hydrophobicity C-end structure territory or LPXTG primitive, should remove terminator codon.
Be interested each gene design suitable substance P CR primer, when these primers of design, should from gene, remove any and All Ranges of the above-mentioned feature of coding.Primer is designed to have suitable Restriction Enzyme site, and and then conservative thereafter Kozak nucleotide sequence (in most of the cases (note, except under rare occasion, ID59 for example) that use is GCCACC.The Kozak sequence helps eucaryon rrna identification homing sequence) and be positioned at the ATG initiator codon of required gene inset upstream.For example, use the forward primer in BamH I site initial with GCGGGATCCGCCACCATG, a bit of sequence of required gene 5 ' end is followed in the back.The design reverse primer is with compatible with forward primer, and in most of the cases compatible with the Not I restriction site (this site is TTGCGGCCGC) of 5 ' end (noticing that except under rare occasion, for example ID59 substitutes the Not I with Xho I site).The PCR primer
Following PCR primer is designed to the required truncated gene that increases.ID5
Forward primer 5 ' CGGATCCGCCACCATGGGTCTAATTGAAGACTTAAAAAATCAA 3 '
Reverse primer 5 ' TTGCGGCCGCCAATGCTAGACTAAACACAAGACTCA 3 ' ID59
Forward primer 5 ' CGCGGATCCATGAAAAAAATCTATTCATTTTTAGCA 3 '
Reverse primer 5 ' CCCTCGAGGGCTACTTCCGATACATTTTAAACTGTAGG3 '
ID51
Forward primer 5 ' CGGATCCGCCACCATGAGTCATGTCGCTGCAAATG 3 '
Reverse primer 5 ' TTGCGGCCGCATACCAAACGCTGACATCTACG 3 '
ID29
Forward primer 5 ' CGGATCCGCCACCATGCAAAAAGAGCGGTATGGTTATG3 '
Reverse primer 5 ' TTGCGGCCGCACCCCCATTCTTAATCCCTT 3 '
ID50
Forward primer 5 '
CGGATCCGCCACCATGGAGGTATGTGAAATGTCACGTAAA?3′
Reverse primer 5 ' TTGCGGCCGCTTTTACAAAGTCAAGCAAAGCC 3 ' clone
Isolation of genomic DNA from the 4 type S. pneumoniae strains 11886 that derive from national typical culture collection center, as template through pcr amplification inset and flanking sequence with above-mentioned feature.With suitable Restriction Enzyme cutting PCR product, use conventional Protocols in Molecular Biology to be cloned into the multiple clone site of pcDNA3.1.Cultivate through the suitably required gene clone of mapping, the extensive separation quality grain of use Plasmid Mega test kit (Qiagen) (>1.5mg).Carry out restriction map analysis and order-checking by about 700 base pairs, confirm that further the success of gene is cloned and kept 5 ' clone's joint of each extensive goods of each construct.Bacterial strain confirms
Use 4 type bacterial strains in the method for clone and attack, the streptococcus pneumoniae genome of this bacterial strain is checked order.The homogeneous laboratory strains freeze-drying ampoule of 4 type S. pneumoniae strains 11886 derives from national typical DSMZ.Open ampoule, with 0.5ml tryptone beans peptone meat soups (0.5% glucose, 5% blood) suspension culture again.The culture suspension that goes down to posterity in 10ml tryptone beans peptone meat soup (0.5% glucose, 5% blood) is in 37 ℃ of static incubated overnight.This culture of line is rule on the blood agar inclined-plane to check pollutent and to confirm survival on 5% blood agar flat board, uses all the other cultures to prepare 20% glycerine original seed.Slant culture is delivered to the public health experimental center, and verified serotype is 4 types.
On 5% blood agar flat board the line NCTC11886 the glycerine original seed, in 37 ℃ at CO
2Incubated overnight in the gas tank.Prepare fresh streak culture and confirm optochin susceptibility.Streptococcus pneumoniae is attacked
Through go down to posterity streptococcus pneumoniae culture 1 time of mouse, from the blood of infection animal, collect streptococcus pneumoniae, in meat soup, be cultured to predetermined viable count and be about 10
9Cfu/ml, freezing then, thus prepare the standard inoculation frozen cultures of 4 type streptococcus pneumoniaes.The schema of preparation is as follows:
Line streptococcus pneumoniae culture also confirms its identity
↓
On above-mentioned flat board, cultivate the overnight culture of 4-5 bacterium colony
↓
With animal passage streptococcus pneumoniae culture (the peritoneal injection heart is got blood and is collected)
↓
Cultivation is through the pneumococcal overnight culture of animal passage
↓
By cultivating the whole day culture (to predetermined light through the overnight culture of animal passage
Density) and in-70 ℃ freezing-this is the limit of standard
↓
The inoculum that melts an equal portions standard is to carry out viable count
↓
Use the standard inoculation thing to measure effective dose (be referred to as virulence and detect test)
↓
Use the standard inoculation thing of effective dose to carry out all subsequently attacks
With 500 times of the inoculum of equal portions standard dilutions, and be used to inoculate mouse with PBS.
Using the slight anesthetized mice of haloalkane, is 1.4 * 10 with dosage then
5The streptococcus pneumoniae of cfu is applied to the nasal cavity of each mouse, and the eupnea of mouse will help taking in streptococcus pneumoniae, allows mouse carry on the back recumbency Wait-to-Restore down.The Streptococcus pneumoniae vaccine test
(Harlan UK) uses DNA and carry out vaccine test in mouse CBA/ca mouse by giving for 6 ages in week.The mouse for the treatment of immunization is divided into 6 groups, and every group is carried out immunization with reorganization pcDNA3.1+ plasmid DNA, contains required particular target gene order in the described plasmid DNA.100 μ g are dissolved in the preceding tibialis (every leg 50 μ l) of DNA intramuscularly to two leg among the Dulbecco PBS (Sigma) altogether.The identical method of 4 weeks back use is carried out booster immunization.In order to compare, in all vaccine tests, all comprise control group.These control groups or without the animal of immunity, or only used the animal of non-reorganization pcDNA3.1+DNA (false immunity) by time course same as described above.Immunization is after 3 weeks, with attacking all mouse groups in streptococcus pneumoniae serotype 4 (bacterial strain NCTC11886) nose of lethal dose for the second time.Monitor the number of the bacterium of using through the inoculum of serial dilution by tiling inoculation on 5% blood agar flat board.The problem of immunization is in the nose: in some mouse, inoculum comes up as frothing from the nostril, is noticing this problem in the table as a result, and this problem has been taken into account when calculating.More unconspicuous problem is that every mouse may swallow a part of inoculum.Suppose that for every mouse the amount of being swallowed is identical, will reach average in seeded process.Yet employed sample is less, and this problem will produce remarkably influenced to some experiments.After injection 3 or 4 days, kill after the attack still all mouse of survival.In course of infection, induce with streptococcus pneumoniae in the mouse attacked of monitoring-the relevant symptom development of outbreak of disease.Typical symptoms comprises piloerection successively, and the protuberance figure of Zeng Jiaing is flowed out movement in the eyes gradually, and is drowsiness and be reluctant to move.The symptom in later stage is consistent with the generation of moribund condition usually, and the mouse of rejecting moribund condition is so that they no longer suffer misery.It is believed that these mouse are at death's door, use the rejecting time to determine that the survival time is in order to carry out statistical analysis.When finding dead mouse, the survival time is by as being to think the last time point that mouse still lives through monitoring.Result's explanation
If cloned and be used for the provide protection that any dna sequence dna of above-mentioned attack experiment can create antagonism and attack, be positive findings.Can produce the dna sequence dna of significant provide protection (confidence level is 95% (p<0.05)) on the statistics; with the inadequate or approaching significant provide protection of using Mann-Whitney to find; or demonstrate some protective features; one or many irrelevant mouse are for example arranged or, be considered to provide protection because occur dead time lengthening first.When we think associated problem that some results are used intranasal infection again when smudgy, being considered to the potential positive findings near remarkable or non-significant result is acceptable.The result tests the 1-6 (see figure 1)
*-when by this dosage immunity, there is sub-fraction to be emerged as frothing, therefore may not accept whole inoculums.T-stops when experiment finishes, and does not have infection symptoms.The survival time of the incomplete dosage that the numeral in the bracket-do not consider is adopted.P value 1 refers to and the significance test of comparing without the contrast of immunity.P value 2 refers to the significance test of comparing with the contrast of pcDNA3.1+ immunity.Statistical analysis
The mouse number | Mean survival time (hour) | ||||||||
Non-immune contrast (1) | ?pcDNA ?3.1+(1) | ID5(1) | Non-immune contrast (2) | ID59(2) | Non-immune contrast (5) | ID59(5) | Non-immune contrast (6) | ID51(6) | |
1 | ????47.5 | ????61.0 | ?61.0 | ????49.0 | ????55.0 | ????58.0 | ????55.3 | ????71.6 * | ?50.0 |
2 | ????57.0 | ????47.5 | ?61.0 | ????51.0 | ????55.0 | ????75.0 | ????98.0 | ????60.7 | ?99.9T |
3 | ????47.5 | ????50.5 | ?57.0 | ????49.0 | ????55.0 | ????48.0 | ????58.5 | ????98.5 | ?53.6 |
4 | ????47.5 | ????50.5 | ?72.0 | ????55.0 | ????69.5 | ????46.7 | ????55.3 | ?(101.2) *T | ?99.9 |
5 | ????77.0 | ????72.0 | ?47.5 | ????49.0 | ????74.0 | ????58.0 | ????53.5 | ????60.7 | ?59.4 |
6 | ????57.0 | ????50.5 | Dead mouse | ????49.0 | Dead mouse | ????75.0 | ????98.0 | ????50.8 | ?50.0 * |
Mean value | ????55.6 | ????55.3 | ?59.7 | ????50.3 | ????61.7 | ????60.1 | ????69.8 | ????68.4 | ?68.8 |
sd | ????11.5 | ????9.4 | ?8.8 | ????2.4 | ????9.3 | ????12.5 | ????21.9 | ????18.3 | ?24.4 |
P value 1 | ????- | ????- | ?0.1722 | ????- | ???0.0064 | ?????- | ???0.2862 | ?????- | <36.0 |
P value 2 | ????- | ????- | ?0.2565 | ????- | ????- | ?????- | ????- | ?????- | ??- |
Other organizes none comparison according to the significantly longer survival time is arranged test 1-.Survival time without immunity and pcDNA3.1 control group is significant not different.The mouse of ID5 is the result who has nothing to do, and the mean survival time of ID5 is extended, but not remarkable.
Test 2-compares with the control group without immunity, and the survival time of ID59 immune group is significantly longer.
Test 5-compared with the control, the survival time of ID59 immune group is on average grown 10 hours approximately, but this result is not very remarkable statistically.
Test 6-compares with the control group without immunity, and the ID51 immune group does not have the significantly longer survival time (p=<36.0), yet, 2 irrelevant mouse are arranged in immune group.Vaccine test 7 and 8 (see figure 2)s
The mouse number | Mean survival time (hour) | |||
Non-immune contrast (7) | ID29(7) | Non-immune contrast (8) | ID50(8) | |
?1 | ????59.6 | ?73.1 | ????45.1 | ?60.6 |
?2 | ????47.2 | ?54.8 | ????50.8 | ?60.6 |
?3 | ????59.6 | ?59.3 | ????60.4 | ?51.1 |
?4 | ????70.9 | ?54.8 * | ????55.2 | ?60.6 |
?5 | ????68.6 * | ?59.3 | ????45.1 | ?60.6 |
?6 | ????76.0 | ?54.8 | ????45.1 | ?60.6 |
Mean value | ????63.6 | ?59.35 | ????50.2 | ?59.1 |
?sd | ????10.3 | ?7.1 | ????6.4 | ?3.9 |
| ?????- | <39.0 | ?????- | ?0.0048 |
Dead time lengthening appears in test 7-ID29 immune group first.
Test 8-compares with the control group without immunity, and the ID50 immune group has the significantly longer survival time.
Appendix I-Oligonucleolide primers nucS1
BglⅡ?Eco?RV5′-cgagatctgatatctcacaaacagataacggcgtaaatag-3′nucS2
BglⅡ??SmaⅠ5′-gaagatcttccccgggatcacaaacagataacggcgtaaatag-3′nucS3
BglⅡ?Eco?RV5′-cgagatctgatatccatcacaaacagataacggcgtaaatag-3′nucR
BamHⅠ5′-cgggatccttatggacctgaatcagcgttgtc-3′NucSeq5′-ggatgctttgtttcaggtgtatc-3′pTREPF5′-catgatatcggtacctcaagctcatatcattgtccggcaatggtgtgggctttttttgttttagcggataagttatccgcta-3'pTREPR5′-gcggatcccccgggcttaattaatgtttaaacactagtcgaagatctcgcgaattctcctgtgtgaaattgttatccgcta-3'pUCF5′-cgccagggttttcccagtcacgac-3′VR5′-tcaggggggcggagcctatg-3′V15′-tcgtatgttgtgtggaattgtg-3′V25′-tccggctcgtatgttgtgtggaattg-3′
1
ID41200bp
ATGAGAAATATGTGGGTTGTAATCAAGGAAACCTATCTTCGACATGTCGAGTCATGGAGTTTCTTCTTTATGGTGA
TTTCGCCGTTCCTCTTTTTAGGAATCTCTGTAGGAATTGGGCATCTCCAAGGTTCTTCTATGGCTAAAAATAATAA
AGTGGCAGTAGTGACAACAGTGCCATCTGTAGCAGAAGGACTGAAGAATGTAAATGGTGTTAACTTCGACTATAA
AGACGAAGCAAGTGCCAAAGAAGCAATTAAAGAAGAAAAATTAAAAGGTTATTTGACCATTGATCAAGAAGATA
GTGTTCTAAAGGCAGTTTATCATGGCGAAACATCGCTTGAAAATGGGAATTAAATTTGAGGTTACAGGTACACTCA
ATGAACTGCAAAATCAGCTTAATCGTTCAACTGCTTCCTTGTCTCAAGAGCAGGAAAAACGCTTAGCGCAGACAA
TTCAATTCACAGAAAAGATTGATGAAGCCAAGGAAAATAAAAAGTTTATTCAAACAATTGCAGCAGGTGCCTTAG
GATTCTTTCTTTATATGATTCTGATTACCTATGCGGGTGTAACAGCTCAGGAAGTTGCCAGTGAAAAAGGCACCAA
AATTATGGAAGTCGTTTTTTCTAGCATAAGGGCAAGTCACTATTTCTATGCGCGGATGATGGCTCTGTTTCTAGTG
ATTTTAACGCATATTGGGATCTATGTTGTAGGTGGTCTGGCTGCCGTTTTGCTCTTTAAAGATTTGCCATTCTTGGC
TCAGTCTGGTATTTTGGATCACTTGGGAGATGCTATCTCACTGAATACCTTGCTCTTTATTTTGATCAGTCTTTTCA
TGTACGTAGTCTTGGCAGCCTTCCTAGGATCTATGGTTTCTCGTCCTGAGGACTCAGGGAAAGCCTTGTCGCCTTT
GATGATTTTGATTATGGGTGGTTTTTTTGGAGTGACAGCTCTAGGTGCAGCTGGTGACAATCTCCTCTTGAAGATT
GGTTCTTATATTCCCTTTATTTCGACCTTCTTTATGCCGTTTCGAACGATTAATGACTATGCGGGGGGAGCAGAAG
CATGGATTTCACTTGCTATTACAGTGATTTTTGCGGTGGTAGCAACAGGATTTATCGGACGCATGTATGCTAGTCT
CGTTCTTCAAACGGATGATTTAGGGATTTGGAAAACCTTTAAACGTGCCTTATCTTATAAATAG
MRNMWVVIKETYLRHVESWSFFFMVISPFLFLGISVGIGHLQGSSMAKNNKVAVVTTVPSVAEGLKNVNGVNFDYKD
EASAKEAIKEEKLKGYLTIDQEDSVLKAVYHGETSLENGIKFEVTGTLNELQNQLNRSTASLSQEQEKRLAQTIQFTEKI
DEAKENKKFIQTIAAGALGFFLYMILITYAGVTAQEVASEKGTKIMEVVFSSIRASHYFYARMMALFLVILTHIGIYVVG
GLAAVLLFKDLPFLAQSGILDHLGDAISLNTLLFILISLFMYVVLAAFLGSMVSRPEDSGKALSPLMILIMGGFFGVTALG
AAGDNLLLKIGSYIPFISTFFMPFRTINDYAGGAEAWISLATTVIFAVVATGFIGRMYASLVLQTDDLGIWKTFKRALSYK
Z
IDS1125bp
CCTGGGAAAGTCTTGAAAATTATGATAGAATGGTGGAAGGAAAAATTCAGGAGAGTAGTAGTGACTCAAAATGTT
GAAAGTCTTCTCGTATCCATTGTAATCAGTGCATACAATGAAGAAAAATATCTGCCTGGTCTAATTGAAGACTTAA
AAAATCAAACCTATCCTAAAGAGGATATTGAAATTCTATTTATAAATGCTATGTCCACAGATGGGACCACAGCTA
TCATTCAGCAATTTATAAAGGAAGATACAGAGTTTAACTCAATTAGATTGTATAACAATCCTAAGAAAAATCAAG
CTAGTGGTTTTAACCTGGGAGTTAAACATTCTGTAGGGGACCTTATTTTAAAAATTGATGCTCATTCAAAAGTTAC
TGAGACTTTTGTAATGAACAATGTGGCTATTATTCAACAAGGTGAATTTGTCTGTGGGGGGCCTAGACCGACGATT
GTCGAAGGAAAAGGAAAATGGGCAGAGACCTTGCATCTTGTTGAGGAAAATATGTTTGGCAGTAGCATTGCCAAT
TATCGAAATAGTTCTGAGGATAGATATGTTTCTTCTATTTTTCATGGAATGTATAAACGAGAGGTTTTCCAGAAGG
TTGGTTTAGTAAATGAGCAACTTGGCCGAACTGAAGATAATGATATTCATTATAGAATTCGAGAATATGGTTATAA
AATCCGCTATAGCCCAAGTATTCTATCTTATCAGTATATTCGACCAACATTCAAGAAAATGCTGCATCAAAAGTAT
TCAAATGGTTTGTGGATTGGCTTGACAAGTCATGTTCAGTTTAAGTGTTTATCATTATTTCACTATGTTCCTTGTTT
ATTTGTTTTGAGTCTTGTGTTTAGTCTAGCATTGTTACCGATCACATTCGTATTCATAACTTTACTATTAGGTGCCT
ATTTTCTACTTTTGTCATTACTCACTTTGCTGACTTTATTAAAACATAAAAATGGATTTCTAATTGTGATGCCCTTT
ATTTTATTTTCCATTCACTTTGCTTATGGCCTTGGGACGATTGTAGGTTTAATTAGAGGATTTAAATGGAAGAAGG
AGTACAAGAGAACAATAATTTATTTGGATAAAATAAGCCAAATAAATCAAAATATGCTATAA
PGKVLKIMIEWWKEKFRRVVVTQNVESLLVSIVISAYNEEKYLPGLIEDLKNQTYPKEDIEILFINAMSTDGTTAIIQQFIK
EDTEFNSIRLYNNPKKNQASGFNLGVKHSVGDLILKIDAHSKVTETFVMNNVAIIQQGEFVCGGPRPTIVEGKGKWAET
LHLVEENMFGSSIANYRNSSEDRYVSSIFHGMYKREVFQKVGLVNEQLGRTEDNDIHYRIREYGYKIRYSPSILSYQYIRP
TFKKMLHQKYSNGLWIGLTSHVQFKCLSLFHYVPCLFVLSLVFSLALLPTTFVFITLLLGAYFLLLSLLTLLTLLKHKNGF
LIVMPFILFSIHFAYGLGTIVGLIRGFKWKKEYKRTIIYLDKISQINQNMLZ
ID11696bp
ATGATGAAAGAACAAAATACGATAGAAATCGATGTATTTCAATTAGTTAAAAGCTTGTGGAAACGCAAGCTAATG
ATTTTAATAGTGGCACTTGTGACAGGTGCGGGGGCTTTTGCATATAGCACTTTTATTGTTAAGCCAGAATATACGA
GTACCACGCGAATTTACGTAGTGAATCGCAATCAAGGAGACAAGCCGGGGTTGACAAATCAGGATTTGCAGGCAG
GAACTTATCTGGTAAAAGACTACCGTGAGATTATCCTTTCGCAGGATGTTTTGGAGGAAGTTGTTTCTGATTTGAA
ACTAGATTTGACGCCAAAAGGTTTGGCTAATAAAATTAAAGTGACAGTACCAGTTGATACCCGTATTGTCTCTATT
TCAGTTAATGATCGAGTTCCTGAAGAGGCAAGCCGTATCGCTAACTCTTTGAGAGAAGTAGCTGCTCAAAAAATT
ATCAGTATTACTCGTGTTTCTGACGTGACAACACTGGAGGAGGCAAGGCCGGCGATATCCCCGTCTTCGCCAAAT
ATTAAACGCAATACACTAATTGGTTTTTTGGCAGGGGTGATTGGAACTAGTGTTATAGTTCTTCATCTTGAACTTTT
GGATACTCGTGTGAAACGTCCGGAAGATATCGAAAATACATTGCAGATGACACTTTTGGGAGTTGTGCCAAACTT
GGGTAAGTTGAAATAG
MMKEQNTIEIDVFQLVKSLWKRTKLMTLIVALVTGAGAFAYSTFIVKPEYTSTTRIYVVNRNQGDKPGLTNQDLQAGTYL
VKDYREIILSQDVLEEVVSDLKLDLTPKGLANKIKVTVPVDTRIVSISVNDRVPEEASRIANSLREVAAQKIISITRVSDVT
TLEEARPAISPSSPNIKRNTLIGFLAGVIGTSVIVLHLELLDTRVKRPEDIENTLQMTLLGVVPNLGKLKZ
IDI9555bp
ATGGTAAAAGTAGCAGTTATATTAGCTCAGGGCTTTGAAGAAATTGAAGCCTTGACAGTTGTAGATGTCTTGCGTC
GAGCCAATATCACATGTGATATGGTTGGTTTTGAAGAGCAAGTAACGGGTTCGCATGCAATCCAAGTAAGAGCAG
ATCATGTCTTTGATGGAGATTTATCAGACTATGATATGATTGTTCTTCCTGGAGGTATGCCTGGTTCTGCACATTTA
CGTGATAATCAGACCTTGATTCAAGAATTGCAAAGCTTCGAGCAAGAAGGGAAGAAACTAGCAGCCATTTGTGCG
GCACCAATTGCCCTCAATCAAGCAGAGATATTGAAAAATAAGCGATACACTTGTTATGACGGCGTTCAAGAGCAA
ATCCTTGATGGTCACTACGTCAAGGAAACAGTAGTGGTAGATGGTCAGTTGACAACCAGTCGGGGTCCTTCAACA
GCCCTTGCCTTTGCCTACGAGTTGGTGGAGCAACTAGGAGGGGACGCAGAGAGTTTACGAACAGGAATGCTCTAT
CGAGATGTCTTTGGTAAAAATCAGTAA
MVKVAVILAQGFEEIEALTVVDVLRRANTTCDMVGFEEQVTGSHAIQVRADHVFDGDLSDYDMIVLPGGMPGSAHLR
DNQTLIQELQSFEQEGKKLAAICAAPIALNQAEILKNKRYTCYDGVQEQILDGHYVKETVVVDGQLTTSRGPSTALAFA
YELVEQLGGDAESLRTGMLYRDVFGKNQZ
ID27306bp
GTGGTAGGGATGGTAGAACCAAACCTAGAAAGCCTTATAAAAGATCTTTACAATCATGCTCGACATGATTTGAGT
GAAGATTTAGTTGCTGCTCTCCTAGAGACTACTAAAAAACTGCCTACTACAAATGAGCAATTGCAGGCAGTTCGTC
TCTCAGGCCTGGTCAATCGTGAATTGCTCCTAAATCCCAAACATCCAGCACCTGAGTTGCTCAACTTGGCTCGCTT
TGTCAAAAGAGAAGAAGCCAAGTACAGAGGAACTGCGACTTCTGCGCTTATGTATGAGGAACTCTTTAAAATGCT
TTGA
MVGMVEPNLESLIKDLYNHARHDLSEDLVAALLETTKKLPTTNEQLQAVRLSGLVNRELLLNPKHPAPELLNLARFVK
REEAKYRGTATSALMYEELFKMLZ
ID29945bp
TTGTTCTTAAAAAAGGAAAGAGAGGTAATCAGCATGCGTAAATGGACAAAAGGATTTCTCATCTTTGGTGTGGTG
ACTACCGTTATCGGCTTTATCCTGCTTTTTGTAGGTATCCAATCTGACGGGATTAAGAGCCTACTTTCCATGTCCAA
AGAACCTGTCTATGATAGCCGTACGGAAAAGCTAACCTTTGGCAAGGAAGTCGAAAACCTAGAAATTACTCTCCA
CCAACACACGCTCACCATCACAGACTCTTTCGATGATCAAATCCACATTTCTTACCATCCATCTCTTTCTGCTCAC
CATGATCTTATCACCAATCAGAACGATAGAACTCTGAGTCTCACTGATAAGAAACTGTCTGAAACTCCGTTTCTCT
CTCTGGAATTGGTGGGATTCTTCATATCGCAAGTAGCTACTAGTCGTTTTGAAGAAGTTATTCTCCGACTACC
AAAAGGGAGAACTCTAAAAGGGATCAACATCTCAGCCAATCGCGGACAAACCACCATCATAAATGCTAGCCTTGA
AAATGCGACCCTCAATACAAACAGCTATATCCTCCGAATTGAAGGAAGTCGTATCAAAAACAGTAAACTCACAAC
GCCCAATATCGTTAATATCTTTGATACAGTTCTTACAGATAGTCAGCTAGAGTCAACAGAGAATCACTTCCACGCT
GAAAATATCCAAGTCCATGGCAAGGTTGAACTGACTGCCAAAGATTATCTCAGAATCATCCTAGACCAGAAAGAA
AGCCAACGAATTAACTGGGACATCTCAAGCAACTATGGTTCTATCTTCCAATTCACAAGAGAAAAGCCTGAATCA
AGAGGTACGGAATTAAGCAACCCTTACAAAACTGAAAAAACCGATGTCAAGGATCAACTCATTGCGAGATCTGAT
GATAATATTGATCTAATATCCACACCAAGCAGACGTTGA
MFLKKEREVISMRKWTKGFLIFGVVTTVIGFILLFVGIQSDGIKSLLSMSKEPVYDSRTEKLTFGKEVENLEITLHQHTLTI
TDSFDDQIHISYHPSLSAHHDLITNQNDRTLSLTDKKLSETPFLSSGIGGILHIASSYSSRFEEVILRLPKGRTLKGINISANR
GQTTIINASLENATLNTNSYILRIEGSRIKNSKCTTPNIVNIFDTVLTDSQLESTENHFHAENIQVHGKVELTAKDYLRIILD
QKESQRINWDISSNYGSTFQFTREKPESRGTELSNPYKTEKTDVKDQLLARSDDNIDLISTPSRRZ
ID30879bp
ATGAAACAAGAATGGTTTGAAAGTAATGATTTTGTAAAAACAACAAGCAAGAACAAGCCTGAAGAGCAAGCTCA
AGAGGTTGCAGACAAGGCTGAAGAAACGATAGCCGATCTCGATACACCAATTGAAAAAAATACTCAGTTAGAGG
AGGAAGTCCCTCAAGCTGAAGTCGAATTGGAAAGCCAGCAAGAAGAGAAAATTGAAGCTCCTGAAGACAGTGAA
GCGAGAACAGAAATAGAAGAAAAGAAGGCATCTAATTCTACTGAAGAAGAGCCAGACCTTTCTAAAGAAACAGA
AAAAGTCACTATAGCTGAAGAGAGCCAAGAAGCTCTTCCTCAGCAAAAAGCAACCACGAAAGAGCCACTTCTTAT
CAGTAAATCTTTAGAAAGTCCTTATATCCCCGACCAAGCTCCAAAATCTAGGGATAAATGGAAAGAGCAAGTGCT
TGATTTTTGCGTCTTTGGCTAGTGGAAGCGATCAAATCTCCACAAGTAAGTTGGAAACAAGTATCACACACAGTTAC
ACAGCCTTTCTCTTGCTCATTCTGTTTTCTGCATCTTCCTTTTTCTTTAGTATCTATCACATCAAACATGCTTACTAT
GGACATATAGCAAGCATTAACAGTCGCTCCCTGAGCAGCTAGCTCCTTTAACTCTTTTTTCTATCATCTCTATCCT
AGTAGCGACAACACTCTTCTTCTTTTCATTCCTCTTGGGTAGTTTCGTTGTGAGACGATTTATCCACCAGGAAAAG
GACTGGACGCTAGACAAGGTTCTCCAACAATATAGTCAACTCTTTGGCAATTCCAATCTCCTCACTGCTATTGCTAG
TTTCTTTGCTTTCTTTGATAGCCTACGATTTACAGCCCTCTTGTGTGTGA
MKQEWFESNDFVKTTSKNKPEEQAQEVADKAEETIADLDTPIEKNTQLEEEVPQAEVELESQQEEKIEAPEDSEARTEIE
EKKASNSTEEEPDLSKETEKVTIAEESQEALPQQKATTKEPLLISKSLESPYIPDQAPKSRDKWKEQVLDFWSWLVEAIKS
PTSKLETSITHSYTAFLLLILFSASSFFFSIYHIKHAYYGHIASINSRFPEQLAPLTLFSIISILVATTLFFFSFLLGSFVVRRFIH
QEKDWTLDKVLQQYSQLLAIPISSLLLLVSLLSLIAYDLQPSCVZ
ID105990bp
ATGCAACTCGCTTCTTCGGTCTACTCATTGTTCGTCTGGTACAATTTGTTCTTAAAAAAGGAAAGAGAGGTAATCA
GCATGCGTAAATGGACAAAAGGATTTCTCATCTTTGGTGTGGTGACTACCGTTATCGGCTTTATCCTGCTTTTTGTA
GGTATCCAATCTGACGGGATTAAGAGCCTATTTCCATGTCCAAAGAACCTGTCTATGATAGCCGTACGGAAAAG
CTAACCTTTGGCAAGGAAGTCGAAAACCTAGAAATTACTCTCCACCAACACACGCTCACCATCACAGACTCTTTC
GATGATCAAATCCACATTTCTTACCATCCATCTCTTTCTGCTCACCATGATCTTATCACCAATCAGAACGATAGAA
CTCTGAGTCTCACTGATAAGAAACTGTCTGAAACTCCGTTTCTCTCTTCTGGAATTGGTGGGATTCTTCATATCGC
AAGTAGCTACTCTAGTCGGTTTTGAAGAAGTTATTCTCCGACTACCAAAAGGGAGAACTCTAAAAGGGATCAACAT
CTCAGCCAATCGCGGACAAACCACCATCATAAATGCTAGCCTTGAAAATGCGACCCTCAATACAAACAGCTATAT
CCTCCGAATTGAAGGAAGTCGTATCAAAAACAGTAAACTCACAACGCCCAATATCGTTAATATCTTTGATACAGTT
CTTACAGATAGTCAGCTAGAGTCAACAGAGAATCACTTCCACGCTGAAAATATCCAAGTCCATGGCAAGGTTGAA
CTGACTGCCAAAGATTATCTCAGAATCATCCTAGACCAGAAAGAAAGCCAACGAATTAACTGGGGACATCTCAAGC
AACTATGGTTCTATCTTCCAATTCACAAGAGAAAAGCCTGAATCAAGAGGTACGGAATTAAGCAACCCTTACAAA
ACTGAAAAAACCGATGTCAAGGATCAACTCATTGCGAGATCTGATGATAATATTGATCTAATATCCACACCAAGC
AGACGTTGA
MQLASSVYSLFVWYNLFLKKEREVISMRKWTKGFLIFGVVTTVIGFILLFVGIQSDGIKSLLSMSKEPVYDSRTEKTFG
KEVENLEITLHQHTLTTTDSFDDQIHISYHPSLSAHHDLITNQNDRTLSLTDKKLSETPFLSSGIGGILHIASSYSSRFEEVIL
RLPKGRTLKGINISANRGQTTIINASLENATLNTNSYILRIEGSRIKNSKLTTPNIVNIFDTVLTDSQLESTENHFHAENIQV
HGKVELTAKDYLPRIILDQKESQRINWDISSNYGSIFQFTREKPESRGTELSNPYKTEKTDVKDQLIARSDDNIDLISTPSRR
Z
ID107-78bp
ATGATATGTAAAATGAAGCAGGGAGGGAGCAGGGCGTGCTGGGGATGGAGAGTGGGGGAGGGACGCTGCTATTT
TAATC
MICKMKQGGSRACWGWRVGEGRCYFN
ID109714bp
CGATAAAGAGGCCTTGAGTAATCTCAATTTGCAGATTGAAAATGGAGAGATTATGGGCTTGATTGGTCATAATGG
GGCTGGAAAATCGACCACTATAAAATCCCTAGTCAGTATCATTTCACCCAGCAGTGGTCGTATTTTGGTAGACGGT
CAGGAGTTATCGGAAAATCGCTTGGCTATTAAACGAAAGATTGGCTACGTAGCAGACTCGCCTGACTTATTTTTAC
GCTTAACGGCCAATGAATTTTGGGAATTGATCGCCTCATCCTATGATCTGAGTAGATCTGACTTGGAGGCTAGTCT
AGCTAGGCTATTGAACGTTTTTGATTTTGCTGAAAATCGCTATCAGGTTATTGAAACTCTTTCTCACGGAATGCGT
CAGAAAGTCTTTGTCATCGGAGCACTCTTGTCTGATCCCGATATTTGGGTTTTGGACGAACCCTTGACTGGTTTGG
ATCCCCAGGCTGCCTTTGATTTGAAACAGATGATGAAGGAACATGCACAAAAAGGGAAGACAGTCTTGTTTTCAA
CTCATGTCCTAGAGGTGGCAGAGCAAGTCTGTGATCGGATTGCCATTTTGAAAAAGGGGCATTTGATTTATTGTGG
TAAGGTAGAGGACTTGAGGAAAGACCACCCAGACCAGTCTTTGGAAAGTATCTACCTTTAGTCTTGCTGGTAGAAA
AGAGGAGGTTGCGGATGCGTCTCAAGGTCATTAA
DKEALSNLNLQIENGEIMGLIGHNGAGKSTTIKSLVSIISPSSGRILVDGQELSENRLAIKRKIGYVADSPDLFLRLTANEF
WELIASSYDLSRSDLEASLARLLNVFDFAENRYQVIETLSHGMRQKVFVIGALLSDPDIWVLDEPLTGLDPQAAFDLKQ
MMKEHAQKGKTVLFSTHVLEVAEQVCDRIAILKKGHLIYCGKVEDLRKDHPDQSLESIYLSLAGRKEEVADASQGHZ
ID112360bp
ATGGCTTTGTTTTCAGAGAGAGGAGCAGTACGGAAGACACCAATGGCAAGTCCAATAATGAGACCTATGATGGTT
CCGACGATAGAGATTAAAAGAGTGATACCAGCACCACGCAAGAGTTGTTGCCAGTTTTCAGAAAGAATTTTAGCA
ACTTGGCTAAAGAAACTACTGCTAGTCTCTTCAGTTGTTGTAGCTTCGGCAGGTTGTTCCTTGATCATACGATCCA
TCAAGGCAACTTGGTCATTTTTGAAATGGTTTCAATGCTGGCATTGATTTGGCTAATACGATTGTCATTTTTACGA
AGCCCGATAGCGATAGCTGTATCTTCTTCCCCAGTTTTGAAACCAGGTTCTACTTGA
MALFSERGAVRKTPMASPIMRPMMVPTIEIKRVIPAPRKSCCQFSERILATWLKKLLLVSSVVVASAGCSLIIRSIKATWSS
FEMVSMLALIWLIRLSFLPRSPIAIAVSSSPVLKPGSTZ
ID128-3.43
ATGAAATTTAGTAAAAAATATATAGCAGCTGGATCAGCTGTTATCGTATC
CTTGAGTCTATGTGCCTATGCACTAAACCAGCATCGTTCGCAGGAAAATA
AGGACAATAATCGTGTCTCTTATGTGGATGGCAGCCAGTCAAGTCAGAAA
AGTGAAAACTTGACACCAGACCAGGTTAGCCAGAAAGAAGGAATTCAGGC
TGAGCAAATTGTAATCAAAATTACAGATCAGGGCTATGTAACGTCACACG
GTGACCACTATCATTACTATAATGGGAAAGTTCCTTATGATGCCCTCTTT
AGTGAAGAACTCTTGATGAAGGATCCAAACTATCAACTTAAAGACGCTGA
TATTGTCAATGAAGTCAAGGGTGGTTATATCATCAAGGTCGATGGAAAAT
ATTATGTCTACCTGAAAGATGCAGCTCATGCTGATAATGTTCGAACTAAA
GATGAAATCAATCGTCAAAAACAAGAACATGTCAAAGATAATGAGAAGGT
TAACTCTAATGTTGCTGTAGCAAGGTCTCAGGGACGATATACGACAAATG
ATGGTTATGTCTTTAATCCAGCTGATATTATCGAAGATACGGGTAATGCT
TATATCGTTCCTCATGGAGGTCACTATCACTACATTCCCAAAAGCGATTT
ATCTGCTAGTGAATTAGCAGCAGCTAAAGCACATCTGGCTGGAAAAAATA
TGCAACCGAGTCAGTTAAGCTATTCTTCAACAGCTAGTGACAATAACACG
CAATCTGTAGCAAAAGGATCAACTAGCAAGCCAGCAAATAAATCTGAAAA
TCTCCAGAGTCTTTTGAAGGAACTCTATGATTCACCTAGCGCCCAACGTT
ACAGTGAATCAGATGGCCTGGTCTTTGACCCTGCTAAGATTATCAGTCGT
ACACCAAATGGAGTTGCGATTCCGCATGGCGACCATTACCACTTTATTCC
TTACAGCAAGCTTTCTGCCTTAGAAGAAAAGATTGCCAGAATGGTGCCTA
TCAGTGGAACTGGTTCTACAGTTTCTACAAATGCAAAACCTAATGAAGTA
GTGTCTAGTCTAGGCAGTCTTTCAAGCAATCCTTCTTCTTTAACGACAAG
TAAGGAGCTCTCTTCAGCATCTGATGGTTATATTTTTAATCCAAAAGATA
TCGTTGAAGAAACGGCTACAGCTTATATTGTAAGACATGGTGATCATTTC
CATTACATTCCAAAATCAAATCAAATTGGGCAACCGACTCTTCCAAACAA
TAGTCTAGCAACACCTTCTCCATCTCTTCCAATCAATCCAGGAACTTCAC
ATGAGAAACATGAAGAAGATGGATACGGATTTGATGCTAATCGTATTATC
GCTGAAGATGAATCAGGTTTTGTCATGAGTCACGGAGACCACAATCATTA
TTTCTTCAAGAAGGACTTGACAGAAGAGCAAATTAAGGTGCGCAAAAACA
TTTAG
MKFSKKYIAAGSAVIVSLSLCAYALNQHRSQENKDNNRVSYBDGSQSSQK
SENLTPDQVSQKEGIQAEQIVIKITDQGYVTSHGDHYHYYNGKVPYDALF
SEELLMKDPNYQLKDADIVNEVKGGYIIKVDGKYYVYLKDAAHADNVRTK
DEINRQKQEHVKDNEKVNSNVAVARSQGRYTTNDGYVFNPADIIEDTGNA
YIVPHGGHYHYIPKSDLSASELAAAKAHLAGKNMQPSQLSYSSTASDNNT
QSVAKGSTSKPANKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKIISR
TPNGVAIPHGDHYHFIPYSKLSALEEKIARMVPISGTGSTVSTNAKPNEV
VSSLGSLSSNPSSLTTSKELSSASDGYIFNPKDIVEETATAYIVRHGDHF
HYIPKSNQIGQPTLPNNSLATPSPSLPINPGTSHEKHEEDGYGFDANRII
AEDESGFVMSHGDHNHYFFKKDLTEEQIKVRKNI
*
2
ID2840bp
ATGGGAATTGCTCTAGAAAATGTGAATTTTACATATCAAGAAGGTACTCCCTTAGCTTCAGCAGCTTTGTCGGATG
TTTCTTTGACGATTGAAGATGGCTCTTATACAGCCTTTAAATTGGGCACACAGGTAGTGGTAAATCAACTATTTTACA
ACTCTTAAATGGTTTATTGGTGCCAAGTCAAGGGAGTGTGAGGGTTTTTGATACCTTAATCACCTCGACTTCTAAA
AATAAAGATATTCGTCAAATTAGAAAACAGGTTGGCTTGGTATTTCAGTTTGCTGAAAATCAGATTTTTGAAGAAA
CGGTTTTGAAGGACGTTGCTTTTGGACCGCAAAATTTTGGAGTTTCTGAAGAAGATGCTGTGAAGACTGCGCGTGA
GAAACTGGCTCTGGTTGGAATTGATGAATCACTTTTTGATCGTAGTCCGTTTGAGCTGTCAGGGGGACAAATGAGA
CGTGTTGCCATTGCAGGCATACTYGCCATGGAGCCAGCTATATTAGTCTTAGATGAGCCAACAGCTGGTCTAGATC
CTCTAGGGAGAAAAGAGTTGATGACCCTGTTCAAAAAACTCCACCAGTCAGGGATGACCATCGTCTTGGTAACGC
ATTTGATGGATGATGTTGCTGAATATGCGAATCAAGTCTATGTAATGGAAAAGGGACGTTTAGTAAAGGGGGGCA
AACCAAGTGATGTCTTTCAAGACGTTGTTTTTATGGAAGAAGTTCAGTTGGGAGTACCTAAAATTACGGCCTTTTG
TAAACGATTGGCTGATAGAGGCGTGTCATTTAAACGATTACCGATTAAGATAGAGGAGTTCAAGGAGTCGCTAAA
TGGATAG
MGIALENVNFTYQEGTPLASAALSDVSLTIEDGSYTALIGHTGSGKSTILQLLNGLLVPSQGSVRVFDTLITSTSKNKDIR
QIRKQVGLVFQFAENQIFEETVLKDVAFGPQNFGVSEEDAVKTAREKLALVGIDESLFDRSPFELSGGQMRRVAIAGILA
MEPAILVLDEPTAGLDPLGRKELMTLFKKLHQSGMTIVLVTHLMDDVAEYANQVYVMEKGRLVKGGKPSDVFQDVV
FMEEVQLGVPKITAFCKRLADRGVSFKRLPIKIEEFKESLNGZ
ID36360bp
TACCCGGTAGTCTTAGCAGACACATCTAGCTCTGAAGATGCTTTAAACATCTCTGATAAAGAAAAAGTAGCAGAA
AATAAAGAGAAACATGAAAATATCCATAGTGCTATGGAAACTTCACAGGATTTTAAAGAGAAGAAAACAGCAGTC
ATTTAAGGAAAAAGAAGTTGTTAGTAAAAATCCTGTGATAGACAATAACACTAGCAATGAAGAAGCAAAAATCAA
AGAAGAAAATTCCAATAAATCCCAAGGAGATTATACGGACTCATTTGTGAATAAAAACACAGAAAATCCCAAAAA
AGAAGATAAAGTTGTCTATATTGCTGAATTTAAAGATAAAGAATCTGGAGAAAAAGCAATCAAGGAACTATCCAG
TCTTAAGAATACAAAAGTTTTATATACTTATGATAGAATTTTTAACGGTAGTGCCATAGAAACAACTCCAGATAAC
TTGGACAAAATTAAACAAATAGAAGGTATTTCATCGGTTGAAAGGGCACAAAAAGTCCAACCCATGATGAATCAT
GCCAGAAAGGAAATTGGAGTTGAGGAAGCTATTGATTACCTAAAGTCTATCAATGCTCCGTTTGGGAAAAATTTT
GATGGTAGAGGTATGGTCATTTCAAATATCGATACTGGAACAGATTATAGACATAAGGCTATGAGAATCGATGAT
GATGCCAAAGCCTCAATGAGATTTAAAAAAGAAGACTTAAAAGGCACTGATAAAAATTATTGGTTGAGTGATAAA
ATCCCTCATGCGTTCAATTATTATAATGGTGGCAAAATCACTGTAGAAAAATATGATGATGGAAGGGATTATTTTG
ACCCACATGGGATGCATATTGCAGGGATTCTTGCTGGAAATGATACTGAACAAGACATCAAAAACTTTAACGGCA
TAGATGGAATTGCACCTAATGCACAAATTTTCTCTTACAAAATGTATTCTGACGCAGGATCTGGGTTTGCGGGTGA
TGAAACAATGTTTCATGCTATTGAAGATTCTATCAAACACAACGTTGATGTTGTTTCGGTATCATCTGGTTTTACA
GGAACAGGTCTTGTAGGTGAGAAATATTGGCAAGCTATTCGGGCATTAAGAAAAGCAGGCATTCCAATGGTTGTC
GCTACGGGTAACTATGCGACTTCTGCTTCAAGTTCTTCATGGGATTTAGTAGCAAATAATCATCTGAAAATGACCG
ACACTGGAAATGTAACACGAACTGCAGCACATGAAGATGCGATAGCGGTCGCTTCTGCTAAAAATCAAACAGTTG
AGTTTGATAAAGTTAACATAGGTGGAGAAAGTTTTAAATACAGAAATATAGGGGCCTTTTTCGATAAGAGTAAAA
TCACAACAAATGAAGATGGAACAAAAGCTCCTAGTAAATTAAAATTTGTATATATAGGCAAGGGGCAAGACCAAG
ATTTGATAGGTTTGGATCTTAGGGGCAAAATTGCAGTAATGGATAGAATTTATACAAAGGATTTAAAAAATGCTTT
TAAAAAAGCTATGGATAAGGGTGCACGCGCCATTATGGTTGTAAATACTGTAAATTACTACAATAGAGATAATTG
GACAGAGCTTCCAGCTATGGGATATGAAGCGGATGAAGGTACTAAAAGTCAAGTGTTTTCAATTTCAGGAGATGA
TGGTGTAAAGCTATGGAACATGATTAATCCTGATAAAAAAACTGAAGTCAAAAGAAATAATAAAGAAGATTTTAA
AGATAAATTGGAGCAATACTATCCAATTGATATGGAAAGTTTTAATTCCAACAAACCGAATGTAGGTGACGAAAA
AGAGATTGACTTTAAGTTTGCACCTGACACAGACAAAGAACTCTATAAAGAAGATATCATCGTTCCAGCAGGATC
TACATCTTGGGGGCCAAGAATAGATTTACTTTTAAAACCCGATGTTTCAGCACCTGGTAAAAATATTAAATCCACG
CTTAATGTTATTAATGGCAAATCAACTTATGGCTATATGTCAGGAACTAGTATGGCGACTCCAATCGTGGCAGCTT
CTACTGTTTTGATTAGACCGAAATTAAAGGAAATGCTTGAAAGACCTGTATTGAAAAATCTTAAGGGAGATGACA
AAATAGATCTTACAAGTCTTACAAAAATTGCCCTACAAAATACTGCGCGACCTATGATGGATGCAACTTCTTGGA
AAGAAAAAAGTCAATACTTTGCATCACCTAGACAACAGGGAGCAGGCCTAATTAATGTGGCCAATGCTTTGAGAA
ATGAAGTTGTAGCAACTTTCAAAAACACTGATTCTAAAGGTTTGGTAAACTCATATGGTTCCATTTCTCTTAAAGA
AATAAAAGGTGATAAAAAATACTTTACAATCAAGCTTCACAATACATCAAACAGACCTTTGACTTTTAAAGTTTCA
GCATCAGCGATAACTACAGATTCTCTAACTGACAGATTAAAACTTGATGAAACATATAAAGATGAAAAATCTCCA
GATGGTAAGCAAATTGTTCCAGAAATTCACCCAGAAAAAGTCAAAGGAGCAAATATCACATTTGAGCATGATACT
TTCACTATAGGCGCAAATTCTAGCTTTGATTTGAATGCGGTTATAAATGTTGGAGAGGCCAAAAACAAAAATAAA
TTTGTAGAATCATTTATTCATTTTGAGTCAGTGGAAGCGATGGAAGCTCTAAACTCCAGCGGGAAGAAAATAAAC
TTCCAACCTTCTTTGTCGATGCCTCTAATGGGATTTGCTGGGAATTGGAACCACGAACCAATCCTTGATAAATGGG
CTTGGGAAGAAGGGTCAAGATCAAAAACACTGGGAGGTTATGATGATGATGGTAAACCGAAAATTCCAGGAACCT
TAAATAAGGGAATTGGTGGAGAACATGGTATAGATAAATTTAATCCAGCAGGAGTTATACAAAATAGAAAAGATA
AAAATACAACATCCCTGGATCAAAATCCAGAATTATTTGCTTTCAATAACGAAGGGATCAACGCTCCATCATCAA
GTGGTTCTAAGATTGCTAACATTTATCCTTTAGATTCAAATGGAAATCCTCAAGATGCTCAACTTGAAAGAGGATT
AACACCTTCTCCACTTGTATTAAGAAGTGCAGAAGAAGGATTGATTTCAATAGTAAATACAAATAAAGAGGGAGA
AAATCAAAGAGACTTAAAAGTCATTTCGAGAGAACACTTTATTAGAGGAATTTTAAATTCTAAAAGCAATGATGC
AAAGGGAATCAAATCATCTAAACTAAAAGTTTGGGGTGACTTGAAGTGGGATGGACTCATCTATAATCCTAGAGG
TAGAGAAGAAAATGCACCAGAAAGTAAGGATAATCAAGATCCTGCTACTAAGATAAGAGGTCAATTTGAACCGAT
TGCGGAAGGTCAATATTTCTATAAATTTAAATATAGATTAACTAAAGATTACCCATGGCAGGTTTCCTATATTCCT
GTAAAAATTGATAACACCGCCCCTAAGATTGTTTCGGTTGATTTTTCAAATCCTGAAAAAATTAAGTTGATTACAA
AGGATACTTATCATAAGGTAAAAGATCAGTATAAGAATGAAACGCTATTTGCGAGAGATCAAAAAGAACATCCTG
AAAAATTTGACGAGATTGCGAACGAAGTTTGGTATGCTGGCGCCGCTCTTGTTAATGAAGATGGAGAGGTTGAAA
AAAATCTTGAAGTAACTTACGCAGGTGAGGGTCAAGGAAGAAATAGAAAACTTGATAAAGACGGAAATACCATTT
ATGAAATTAAAGGTGCGGGAGATTTAAGGGGAAAAATCATTGAAGTCATTGCATTAGATGGTTCTAGCAATTTCA
CAAAGATTCATAGAATTAAATTTGCTAATCAGGCTGATGAAAAGGGGATGATTTCCTATTATCTAGTAGATCCTGA
TCAAGATTCATCTAAATATCAAAAGCTTGGCGAGATTGCAGAATCTAAATTTAAAAATTTAGGAAATGGAAAAGA
GGGTAGTCTAAAAAAAGATACAACTGGGGTAGAACATCATCATCAAGAAAATGAAGAGTCTATTAAAGAAAAAT
CTAGTTTTACTATTGATAGAAATATTTCAACAATTAGAGACTTTGAAAATAAAGACTTAAAGAAACTCATTAAAAA
GAAATTTAGAGAAGTTGATGATTTTACAAGTGAAACTGGTAAGAGAATGGAGGAATACGATTATAAATACGATGA
TAAAGGAAATATAATAGCCTACGATGATGGGACTGATCTAGAATATGAAACTGAGAAACTTGACGAAATCAAATC
AAAAATTTATGGTGTTCTAAGTCCGTCTAAAGATGGACATTTGAAATTCTTGGAAAGATAAGTAATGTTTCTAAA
AATGCCAAGGTATATTATGGGAATAACTATAAATCTATAGAAATCAAAGCGACCAAGTATGATTTCCACTCAAAA
ACGATGACATTTGATCTATACGCTAATATTAATGATATTGTGGATGGATTAGCTTTTGCAGGAGATATGAGATTAT
TTGTTAAAGATAATGATCAGAAAAAAGCTGAAATTAAAATTAGAATGCCTGAAAAAATTAAGGAAACTAAATCAG
AATATCCCTATGTATCAAGTTATGGGAATGTCATAGAATTAGGGGAAGGAGATCTTTCAAAAAACAAACCAGACA
ATTTAACTAAAATGGAATCTGGTAAAATCTATTCTGATTCAGAAAAACAACAATATCTGTTAAAGGATAATATCAT
TCTAAGAAAAGGCTATGCACTAAAAGTGACTACCTATAATCCTGGAAAAACGGATATGTTAGAAGGAAATGGAGT
CTATAGCAAGGAAGATATAGCAAAAATACAAAAGGCCAATCCTAATCTAAGAGCCCTTTCAGAAACAACAATTTA
TGCTGATAGTAGAAATGTTGAAGATGGAAGAAGTACCCAATCTGTATTAATGTCGGCTTTGGACGGCTTTAACATT
ATAAGGTATCAAGTGTTTACATTTAAAATGAACGATAAAGGGGAAGCTATCGATAAAGACGGAAATCTTGTGACA
GATTCTTCTAAACTTGTATTATTTGGTAAGGATGATAAAGAATACACTGGAGAGGATAAGTTCAATGTAGAAGCTA
TAAAAGAAGATGGCTCCATGTTATTTATTGATACCAAACCAGTAAACCTTTCAATGGATAAGAACTACTTTAATCC
ATCTAAATCTAATAAAATTTATGTACGAAATCCAGAATTTTATTTAAGAGGTAAGATTTCTGATAAGGGTGGTTTT
AACTGGGAATTGAGAGTTAATGAATCGGTTGTAGATAATTATTTAATCTACGGAGATTTACACATTGATAACACTA
GAGATTTTAATATTAAGCTGAATGTTAAAGACGGTGACATCATGGACTGGGGAATGAAAGACTATAAAGCAAACG
GATTTCCAGATAAGGTAACAGATATGGATGGAAATGTTTATCTTCAAACTGGCTATAGCGATTTGAATGCTAAAGC
AGTTGGAGTCCACTATCAGTTTTTATATGATAATGTTAAACCCGAAGTAAACATTGATCCTAAGGGAAATACTAGT
ATCGAATATGCTGATGGAAAATCTGTAGTCTTTAACATCAATGATAAAAGAAATAATGGATTCGATGGTGAGATT
CAAGAACAACATATTTATATAAATGGAAAAGAATATACATCATTTAATGATATTAAACAAATAATAGACAAGACA
CTAAACATTAAGATTGTTGTAAAAGATTTTGCAAGAAATACAACCGTAAAAGAATTCATTTTAAATAAAGATACG
GGAGAGGTAAGTGAATTAAAACCTCATAGGGTAACTGTGACCATTCAAAATGGAAAAGAAATGAGTTCAACGATA
GTGTCGGAAGAAGATTTTATTTTACCTGTTTATAAGGGTGAATTAGAAAAAGGATACCAATTTGATGGTTGGGAAA
TTTCTGGTTTCGAAGGTAAAAAAGACGCTGGCTATGTTATTAATCTATCAAAAGATACCTTTATAAAACCTGTATT
CAAGAAAATAGAGGAGAAAAAGGAGGAAGAAAATAAACCTACTTTTGATGTATCGAAAAAGAAAGATAACCCAC
AAGTAAACCATAGTCAATTAAATGAAAGTCACAGAAAAGAGGATTTACAAAGAGAAGAGCATTCACAAAAATCT
GATTCAACTAAGGATGTTACAGCTACAGTTCTTGATAAAAACAATATCAGTAGTAAATCAACTACTAACAATCCT
AATAAGTTGCCAAAAACTGGAACAGCAAGCGGAGCCCAGACACTATTAGCTGCCGGAATAATGTTTATAGTAGGA
ATTTTCTTGGATTGAAGAAAAAAAATCAAGATTAA
YPVVLADTSSSEDALNISDKEKVAENKEKHENIHSAMETSQDFKEKKTAVIKEKEVVSKNPVIDNNTSNEEAKIKEENSN
KSQGDYTDSFVNKNTENPKKEDKVVYIAEFKDKESGEKAIKELSSLKNTKVLYTYDRIFNGSAIETTPDNLDKIKQIEGIS
SVERAQKVQPMMNHARKEIGVEEAIDYLKSINAPFGKNFDGRGMVISNIDTGTDYRHKAMRIDDDAKASMRFKKEDL
KGTDKNYWLSDKIPHAFNYYNGGKITVEKYDDGRDYFDPHGMHIAGILAGNDTEQDIKNFNGIDGIAPNAQIFSYKMY
SDAGSGFAGDETMFHAIEDSIKHNVDVVSVSSGFTGTGLVGEKYWQAIRALRKAGLPMVVATGNYATSASSSSWDLVA
NNHLKMTDTGNVTRTAAHEDAIAVASAKNQTVEFDKVNIGGESFKYRNIGAFFDKSKITTNEDGTKAPSKLKFVYIGK
GQDQDLIGLDLRGKIAVMDRIYTKDLKNAFKKAMDKGARAIMVVNTVNYYNRDNWTELPAMGYEADEGTKSQVFSI
SGDDGVKLWNMINPDKKTEVKRNNKEDFKDKLEQYYPIDMESFNSNKPNVGDEKEIDFKFAPDTDKELYKEDIIVPAG
STSWGPRIDLLLKPDVSAPGKNIKSTLNVINGKSTYGYMSGTSMATPIVAASTVLIRPKLKEMLERPVLKNLKGDDKIDL
TSLTKIALQNTARPMMDATSWKEKSQYFASPRQQGAGLINVANALRNEVVATFKNTDsKGLVNSYGSISLKEIKGDKK
YFTIKLHNTSNRPLTFKVSASAITTDSLTDRLKLDETYKDEKSPDGKQIVPEIHPEKVKGANITFEHDTFTIGANSSFDLN
AVINVGEAKNKNKFVESFIHFESVEAMEALNSSGKKINFQPSLSMPLMGFAGNWNHEPILDKWAWEEGSRSKTLGGYD
DDGKPKIPGTLNKGIGGEHGIDKFNPAGVIQNRKDKNTTSLDQNPELFAFNNEGINAPSSSGSKIANIYPLDSNGNPQDA
QLERGLTPSPLVLRSAEEGLISIVNTNKEGENQRDLKVISREHFIRGILNSKSNDAKGIKSSKLKVWGDLKWDCGLIYNPRG
REENAPESKDNQDPATKIRGQFEPIAEGQYFYKFKYRLTKDYPWQVSYIPVKIDNTAPKIVSVDFSNPEKIKLITKDTYHK
VKDQYKNETLFARDQKEHPEKFDEIANEVWYAGAALVNEDGEVEKNLEVTYAGEGQGRNRKLDKDGNTIYEIKGAG
DLRGKIIEVIALDGSSNFTKIHRAKFANQADEKGMISYYLVDPDQDSSKYQKLGEIAFSKFKNLGNGKEGSLKKDTTGVE
HHHQENEESIKEKSSFTIDRNISTIRDFENKDLKKLIKKKFREVDDFTSETGKRMEEYDYKYDDKGNIIAYDDGTDLEYE
TEKLDEIKSKIYGVLSPSKDGHFEILGKISNVSKNAKVYYGNNYKSIEIKATKYDFHSKTMTFDLYANINDIVDGLAFAG
DMRLFVKDNDQKKAEIKIRMPEKIKETKSEYPYVSSYGNVIELGEGDLSKNKPDNLTKMESGKIYSDSEKQQYLLKDNII
LRKGYALKVTTYNPGKTDMLEGNGVYSKEDIAKIQKANPNLRALSETTIYADSRNVEDGRSTQSVLMSALDGFNIIRYQ
VFTFKMNDKGEAIDKDGNLVTDSSKLVLFGKDDKEYTGEDKFNVEAIKEDGSMLFIDTKPVNLSMDKNYFNPSKSNKI
YVRNPEFYLRGKISDKGGFNWELRVNESVVDNYLIYGDLHIDNTRDFNIKLNVKDGDIMDWGMKDYKANGFPDKVTD
MDGNVYLQTGYSDLNAKAVGVHYQFLYDNVKPEVNIDPKGNTSIEYADGKSVVFNINDKRNNGFDGEIQEQHIYINGK
EYTSFNDIKQIIDKTLNIKIVVKDFARNTTVKEFILNKDTGEVSELIKPHRVTVTIQNGKEMSSTIVSEEDFILPVYKGELEK
GYQFDGWEISGFEGKKDAGYVINISKDTFIKPVFKKIEEKKEEENKPTFDVSKKKDNPQVNHSQLNESHRKEDLQREEH
SQKSDSTKDVTATVLDKNNISSKSTTNNPNKLPKTGTASGAQTLLAAGIMFIVGIFLGLKKKNQDZ
ID6597bp
CTTGAATTAAATAAAAAACGTCATGCGACTAAGCATTTTACTGATAAGCTTGTTGATCCCAAAGATGTGCGTACGG
CTATCGAAATTGCAACCTTAGCGCCAAGCGCCCACAACAGCCAGCCTTGGAAATTTGTGGTGGTACGTGAGAAAA
ATGCTGAACTGGCAAAGTTAGCTTATGGTTCCAATTTTGAACAGGTATCATCAGCGCCTTGTAACCATTGCCTTGTT
TACAGATACGGACTTAGCCAAACGTGCTCGTAAGATTGCCCGTGTTGGTGGTGCTAATAACTTTTCTGAAGAGCAA
CTTCAATATTTTATGAAAAATCTGCCAGCTGAGTTTGCCCGTTACAGTGAGCAACAAGTCAGCGACTACCTAGCTC
TCAATGCAGGTTTGGTTGCCATGAACTTGGTTCTTGCATTGACAGACCAAGGAATTGGTTCTAACATTATTCTTGG
TTTTGACAAATCAAAAGTTAATGAAGTTTTGGAAATCGAAGACCGTTTCCGCCCAGAACTCTTGATCACAGTGGGT
TATACAGACGAAAAATTGGAACCAAGCTACCGCTTGCCAGTAGATGAAATCATCGAGAAAAGATAG
LELNKKRHATKHFTDKLVDPKDVRTAIEIATLAPSAHNSQPWKFVVVREKNAELAKLAYGSNFEQVSSAPVTIALFTDT
DLAKRARKIARVGGANNFSEEQLQYFMKNLPAEFARYSEQQVSDYLALNAGLVAMNLVLALTDQGIGSNIILGFDKSK
VNEVLEIEDRFRPELLITVGYTDEKLEPSYRLPVDEIIEKRZ
ID71401bp
ATGACAGCAATTGATTTTACAGCAGAAGTAGAAAAACGCAAAGAAGACCTCTTGGCTGACTTGTTTAGCCTTTTG
GAAATCAATTCAGAACGTGATGACAGCAAGGCTGATGCCCAGCATCCATTTGGGCCTGGTCCAGTAAAAGCCTTG
GAGAAATTCCTTGAAATCGCAGACCGCGATGGCTACCCAACTAAGAATGTTGATAACTATGCAGGACATTTTGAG
TTTGGTGATGGAGAAGAAGTTCTCGGAATCTTTGCCCATATGGATGTGGTGCCTGCTGGTAGCGGTTGGGACACAG
ACCCTTACACACCAACTATCAAAGATGGTCGCCTTTATGCGCGCGGGGCTTCGGACGATAAGGGTCCTACAACAG
CTTGTTACTATGGTTTGAAAATCATCAAAGAATTGGGTCTTCCAACTTCTAAGAAAGTTCGCTTCATCGTTGGAAC
AGACGAAGAATCAGGCTGGGCAGACATGGACTACTACTTTGAGCACGTAGGACTTGCCAAACCAGATTTCGGTTT
CTCACCAGATGCTGAATTTCCAATCATCAATGGTGAAAAAGGAAATATCACGGAATACCTCCACTTTGCAGGAGA
AAATACAGGTGTTGCCCGTCTTCACAGCTTTACAGGTGGTTTACGTGAAAATATGGTACCAGAATCAGCAACAGC
AGTCGTTTCAGGTGACTTGGCTGACTTGCAAGCTAAACTAGATGCCTTTGTTGCAGAACACAAACTTAGAGGAGA
ACTCCAAGAAGAAGCTGGCAAATACAAGGTGACGATCATTGGTAAATCAGCCCACGGTGCTATGCCTGCTTCAGG
TGTCAATGGCGCAACTTACCTTGCCCTCTTCCTCAGCCAGTTTGGCTTTGCTGGTCCAGCCAAAGACTACCTTGAC
ATCGCAGGTAAAATTCTCTTGAACGATCATGAGGGTGAAAATCTTAAGATTGCTCATGTGGATGAAAAGATGGGT
GCTCTTTCTATGAATGCCGGCGTCTTCCACTTCGATGAAACAAGTGCTGATAATACCATTGCCCTCAACATCCGCT
ATCCAAAAGGAACAAGTCCAGAACAAATCAAGTCAATCCTTGAAAACTTGCCAGTTGTTTCTGTTAGCCTGTCTGA
ACACGGTCACACGCCTCACTATGTGCCAATGGAAGATCCACTTGTGCAAACCTTGTTGAATATCTATGAAAAACA
AACTGGCTTTAAAGGTCATGAACAAGTCATCGGTGGTGGAACCTTTGGTCGCTTGCTAGAACGCGGAGTTGCCTA
CGGTGCTATGTTCCCAGACTCGATTGATACCATGCACCAAGCCAATGAATTTATCGCCTTGGATGATCTTTTCCGA
GCAGCAGCAATTTATGCCGAAGCTATTTACGAATTGATCAAATAA
MTAIDFTAEVEKRKEDLLADLFSLLEINSERDDSKADAQHPFGPGPVKALEKFLEIADRDGYPTKNVDNYAGHFEFGD
GEEVLGIFAHMDVVPAGSGWDTDPYTPTIKDGRLYARGASDDKGPTTACYYGLKIIKELGLPTSKKVRFIVGTDEESGW
ADMDYYFEHVGLAKPDFGFSPDAEFPIINGEKGNTTEYLHFAGENTGVARLHSFTGGLRENMVPESATAVVSGDLADL
QAKLDAFVAEHKLRGELQEEAGKYKVTIIGKSAHGAMPASGVNGATYLALFLSQFGFAGPAKDYLDIAGKILLNDHEG
ENLKIAHVDEKMGALSMNAGVFHFDETSADNTIALNIRYPKGTSPEQIKSILENLPVVSVSLSEHGHTPHYVPMEDPLVQ
TLLNIYEKQTGFKGHEQVIGGGTFGRLLERGVAYGAMFPDSIDTMHQANEFIALDDLFRAAAIYAEAIYELIKZ
ID81617bp
GTGTATACTATTATAAAATCAAATATAAAAAAATTTAGTTTATTAACGATATTTATTGTTGCTGGTCAATTATTGCT
AATTTATGCAGCAACTATTAATGCTCTGGTGTTGAATGAATTAATTGCGATGAATTTAGAGCGGTTTTTGAAATTG
TCAATCTACCAAATGATTGTCTGGTGTGGGATAATATTCCTTGACTGGGTAGTGAAAAATTATCAGGTTGAAGTGA
TCCAAGAGTTTAATCTAGAGATTCGAAATAGAGTTGCCACAGACATCTCTAACTCTACCTATCAAGAATTTCATAG
TAAATCATCAGGAACATATCTTTCGTGGCTAAATAATGATGTTCAGACTTTAAATGATCAGGCGTTTAAACAACTT
TTTTTAGTAATAAAAGGAATTTCTGGTACTATATTTGCAGTTGTGACTCTTAATCACTATCATGGTCATTGACTGT
AGCCACCTTGTTTTCATTAATGATTATGCTACTTGTACCAAAAATCTTTGCATCGAAAATGCGAGAAGTTAGTCTA
AATTTAACTAACCAAAATGAAGCTTTTTTAAAATCTAGTGAGACTATATTGAATGGATTTGATGTGTTAGCGTCCT
TGAATCTTTTATATGTATTGCCTAAGAAAATTAAAGAAGCAGGAATTTTATTAAAGATGGTTATACAAAGAAAGA
CAACTGTAGAAACGTTAGCAGGCGCTATTAGCTTCTTTCTCAATATTTTTTTTCAGATATCTCTCGTTTTTTTAACA
GGCTATCTTGCAATAAAAGGAATAGTGAAAATTGGTACTATTGAAGCAATAGGAGCACTAACAGGTGTTATTTTT
ACAGCGCTAGGTGAATTAGGAGGTCAATTATCCTCTATTATTGGTACGAAGCCTATTTTTTTAAAATTGTATTCAA
TTAATCCA ATTGAGTCAAATAAAATGAATGATATCGAACCAAATGAGGTGAATAGAGATTTTCCGTTATATGAAG
CAAAAAATATTTGCTATAAGTATGGAGATAAAGAAATATTAAAAAACTTAAATTTTTGTTTTCAACGTAATGAAAA
GTATTTAATTTTAGGTGAAAGTGGAAGCGGGAAATCTACATTATTAAAATTATTGAATGGCTTTTTGAGAGATTAT
AGTGGAGAATTGCGATTCTGCGGGGATGATATAAAAAAAACCTCCTATTTAAATATGGTTTCGAATGTTCTATATG
TAGATCAAAAAGCTTATTTGTTTGAAGGTACGATTAGAGATAATATTTTATTGGAAGAAAATTATACTGATGAAGA
AATACTACAGTCTTTAGAGCAAGTTGGTTTGAGTGTAAAAGATTTTCCTAATAACATTTTAGATTATTATGTTGGT
GATGATGGGAGATTACTGTCAGGAGGGCAGAAACAAAAAATTACTTTAGCTAGAGGGCTAATTAGAAATAAGAA
AATAGTATTAATTGACGAGGGAACTTCTGCTATCGATAGGAGAACTTCGTTAGCGATTGAACGTAAGATATTAGA
TAGAGAGGATTTGACTGTCATTATTGTTACCCATGCTCCGCATCCGGAACTTAAACAATATTTTACTAAGATATAT
CAATTTCCAAAGGATTTTATTTAA
MYTIIKSNIKKFSLLTIFIVAGQLLLIYAATINALVLNELIAMNLERFLKLSIYQMIVWCGIIFLDWVVKNYQVEVIQEFNL
EIRNRVATDISNSTYQEFHSKSSGTYLSWLNNDVQTLNDQAFKQLFLVIKGISGTIFAVVTLNHYHWSLTVATLFSLMIM
LLVPKIFASKMREVSLNLTNQNEAFLKSSETILNGFDVLASLNLLYVLPKKIKEAGILLKMVIQRKTTVETLAGAISFFLNI
FFQISLVFLTGYLAIKGIVKIGTIEAIGALTGVIFTALGELGGQLSSIIGTKPIFLKLYSINPIESNKMNDIEPNEVNRDFPLYE
AKNICYKYGDKEILKNLNFCFQRNEKYLILGESGSGKSTLLKLLNGFLRDYSGELRFCGDDIKKTSYLNMVSNVLYVDQ
KAYLFEGTIRDNILLEENYTDEEILQSLEQVGLSVKDFPNNILDYYVGDDGRLLSGGQKQKITLARGLIRNKKIVLIDEGT
SAIDRRTSLAIERKILDREDLTVIIVTHAPHPELKQYFTKIYQFPKDFIZ
ID9705bp
ATAACAGTTAAACAGATTATGGACGAAATAGCCGTTTCAGATATGACTGCAAGGCGCTATTTACAGGAATTAGCT
GATAAAGATTTGCTGATTCGTGTGCATGGTGGAGCTGAAAAACTTCGAACCAACTCCCTTTTGACTAATGAGCGAT
CAAATATTGAAAAACAAGCCCTCCAAACGGCAGAAAAACAAGAAATAGCCCATTTTGCAGGCAGTCTAGTAGAA
GAAAGAGAAACTATTTTCATTGGACCAGGAACAACATTAGAGTTTTTTGCGCGTGAGTTGCCTATTGACAATATCC
GCGTCGTAACCAACAGTCTACCTGTTTTTCTGKTTTTAAGCGAACGAAAATTAACAGATTTGATTTTAATAGGTGG
AAATTATCGCGATATTACAGGTGCTTTTGTTGGTACATTGACCCTACAAAATCTCTCTAATCTCCAATTTTCTAAA
GCTTTCGTTAGCTGTAATGGTATTCAAAACGGAGCTCTAGCTACTTTTAGCGAGGAAGAGGGAGAGGCTCAACGC
ATCGCTTTAAATAATTCTAATAAAAAATATTTACTCGCAGATCATAGCAAGTTCAATAAGTTTGATTTTTATACTTT
TTATAATGTATCAAATCTTGATACTATTGTTTCAGATTCTAAACTAAGTGATTCAATCCTTTTTAAGCTATCTAAAC
ACATTAAAGTCATCAAGCCTTAA
ITVKQIMDEIAVSDMTARRYLQELADKDLLIRVHGGAEKLRTNSLLTNERSNIEKQALQTAEKQEIAHFAGSLVEERETI
FIGPGTTLEFFARELPIDNIRVVTNSLPVFLILSERKLTDLILIGGNYRDITGAFVGTLTLQNLSNLQFSKAFVSCNGIQNGA
LATFSEEEGEAQRIALNNSNKKYLLADHSKFNKFDFYTFYNVSNLDTIVSDSKLSDSILFKLSKHIKVIKPZ
ID10483bp
ATGACTGAGTTTTCGTTAGATCTTCTTCTAGAAGCCATTAAACTAGCTCGTTGGACCTACTACTATCACTTGAAAC
AGCTAGACAAAACAGATAAAGACCAAGAGCTTAAAACTGAAATTCAATCCATCTTTATCGAACACAAGGGAAATT
ATGCTTATCGCCGGGTTCATTTAGAACTAAGAAATCGTGGTTATCTGGTAAATCATAAAAGAGTTCAAGGCTTGaT
GAAAGTACTCAATTTACAAGCTAAAATGCGAAAGAAACGAAAATATTCTTCTCATAAAGGAGACGTTGGTAAGAA
GGCAGAGAATCTCATTCAAGCCCAATTTGAAGGCTCTAAAACAATGGAAAAGTGCTACACAGATGTGACTGAATT
TGCCATTCCAGCAAGTACTCAAAAGCTTTACTTATCACCAGTTTTAGATGGCTTTAACAGCGAAATTATTGCTTTT
AATCTTTCTTGTTCGCCTAATTTAGAATAA
MTEFSLDLLLEAIKLARWTYYYHLKQLDKTDKDQELKTEIQSIFIEHKGNYAYRRVHLELRNRGYLVNHKRVQGLMK
VLNLQAKMRKKRKYSSHKGDVGKKAENLIQAQFEGSKTMEKCYTDVTEFAIPASTQKLYLSPVLDGFNSEIIAFNLSCS
PNLEZ
ID141266bp
CCAGGATTTGGTACCGTTGCAAGTGGTGTGCCTTTCCTCCTAAAGGAAAATGGAGGAAAAATCAATCAATCAGCA
CATTCAGATATCAAAGTTGCTAAGGTATTGGTCAAGGATGAAGATGAAAAAAATCGCTTGCTTGCAGCAGGGAAT
GACTTTAACTTTGTAACCAATGTGGATGATATTTTATCAGACCAGGATATTACTATCGTAGTGGAATTGATGGGGC
GTATTGAGCCTGCTAAAACCCTTTATCACTCGTGCCTTGGAAGCTGGAAAACACGTTGTTACTGCTAACAAGGACCT
TTTAGCTGTCCATGGCGCAGAATTGCTAGAAATCGCTCAAGCTAACAAGGTAGCACTTTACTACGAAGCAGCAGT
TGCTGGTGGGATTCCAATTCTTCGTACTTTAGCAAATTCCTTGGCTTCTGATAAAATTACGCGCGTGCTTGGAGTA
GTGAACGGAACTTCCAACTTCATGGTGACCAAGATGGTGGAAGAAGGCTGGTCTTACGATGATGCTCTTGCGGAA
GCACAACGTCTAGGATTTGCAGAAAGCGATCCGACGAATGACGTAGATGGGATTGATGCAGCCTACAAGATGGTT
ATTTTGAGCCAATTTGCCTTTGGCATGAAGATTGCCTTTGATGATGTAGCCCACAAGGGAATCCGCAATATCACAC
CAGAAGACGTAGCTGTAGCTCAAGAGCTTGGTTACGTAGTGAAATTGGTTGGTTCTATTGAGGAAACTTCTTCAGG
TATTGCTGCAGAAGTGACTCCAACCTTCCTACCTAAAGCGCACCCACTTGCTAGTGTGAATGGCGTAATGAACGCT
GTCTTTGTAGAATCTATCGGTATTGGTGAGTCTATGTACTACGGACCAGGTGCGGGTCAAAAACCAACTGCAACA
AGTGTTGTAGCTGATATTGTCCGTATCGTTCGTCGTTTGAATGATGGTACTATTGGCAAAGACTTCAACGAATATA
GCCGTGACTTGGTCTTGGCAAATCCTGAAGATGTCAAAGCAAACTACTATTTCTCAATCTTGGCTCTAGACTCAAA
AGGTCAGGTCTTGAAGTTGGCTGAAATCTTCAATGCTCAAGATATTTCCTTTAAGCAAATCCTTCAAGATGGCAAA
GAGGGTGACAAGGCGCGTGTCGTTATCATCACACACAAGATTAATAAAGCCCAGCTTGAAAATGTCTCAGCTGAA
TTGAAGAAGGTTTCAGAATTCGACCTCTTGAATACCTTCAAGGTGCTAGGAGAATAA
PGFGTVASGVPFLLKENGGKINQSAHSDIKVAKVLVKDEDEKNRLLAAGNDFNFVTNVDDILSDQDITIVVELMGRIEP
AKTFITRALEAGKHVVTANKDLLAVHGAELLEIAQANKVALYYEAAVAGGIPILRTLANSLASDKITRVLGVVNGTSNF
MVTKMVEEGWSYDDALAEAQRLGFAESDPTNDVDGIDAAYKMVILSQFAFGMKIAFDDVAHKGIRNITPEDVAVAQE
LGYVVKLVGSIEETSSGIAAEVTPTFLPKAHPLASVNGVMNAVFVESIGIGESMYYGPGAGQKPTATSVVADIVRIVRRL
NDGTIGKDFNEYSRDLVLANPEDVKANYYFSILALDSKGQVLKLAEIFNAQDISFKQILQDGKEGDKARVVIITHKINKA
QLENVSAELKKVSEFDLLNTFKVLGEZ
ID161725bp
ATGAAACACCTATTATCTTACTTCAAACCCTACATCAAGGAATCAATTTTAGCCCCCTTGTTCAAGCTGTTAGAAG
CTGTTTTTGAGCTCTTGGTTCCCATGGTGATTGCTGGGATTGTTGACCAATCTTTACCTCAGGGAGATCAAGGTCA
TCTCTGGATGCAGATTGGCCTGCTCCTTATCTTTGCAGTAATTGGCGTTTTAGTGGCCTTGATAGCTCAATTTTACT
CAGCAAAGGCAGCAGTAGGTTCTGCTAAGGAATTGACAAACGATCTTTATCGTCATATTCTTTCCTTGCCCAAGGA
CAGCAGAGACCGTCTGACAACTTCTAGTTTGGTCACTCGCTTGACTTCGGATACCTACCAGATTCAGACTGGTATC
AATCAATTCCTGCGTCTCTTTTTACGAGCGCCCATTATCGTTTTTGGTGCCATTTTTATGGCTTATCGAATCTCAGC
TGAGTTGACTTTCTGGTTCTTAGTCTTGGTTGCCATTTTTGACCATTGTCATTGTAGGGTTATCTCGATTGGTCAATC
CTTTCTACAGTAGTCTCAGAAAGAAAACGGACCAACTGGTTCAGGAAACGCGCCAGCAATTGCAAGGGATGCGGG
TTATTCGTGCTTTTGGTCAAGAAAAACGAGAGTTACAGATTTTTCAAACCCTTAACCAAGTTTATGCTAGATTACA
AGAAAAGACAGGTTTCTGGTCTAGTTTATTAACACCTCTGACCTATCTGATTGTCAATGGAACTCTTCTCGTTATT
ATCTGGCAAGGCTATATTCAATTCAAGGAGGAGTGCTCAGTCAAGGTGCTCTCATTGCTCTTATCAATTACCTCT
TACAGATTTTGGTGGAATTGGTCAAGCTAGCCATGTTGATCAATTCCCTCAACCAGTCCTATATCTCAGTCAAGCG
AATCGAGGAAGTCTTTGTTGAGGCTCCAGAGGATATCCATTCAGAGTTAGAACAAAAGCAAGCTACCAGAGATAA
GGTTTTACAAGTCCAAGAATTGACCTTTACCTATCCTGATGCGGCCCAGCCTTCTCTGAGATACATTTCCTTTGAT
ATGACTCAAGGACAAATTCTAGGTATCATCGGGGGAACTGGTTCTGGTAAATCAAGCTTGGTGCAACTCTTACTTG
GACTTTATCCAGTAGACAAGGGGAACATTGACCTTTATCAAAATGGACGTAGTCCTCTTAATTTGGAGCAGTGGC
GGTCTTGGATTGCCTATGTACCTCAAAAGGTCGAACTCTTTAAAGGAACCATTCGTTCCAACTTGACTCTAGGTTT
CAATCAAGAAGTATCTGACCAGGAACTCTGGCAGGCCTTGGAGATTGCGCAAGCTAAGGATTTTGTCAGTGAAAA
GGAAGGACTCTTGGATGCTCTAGTTGAGGCAGGGGGGCGAAATTTCTCAGGTGGACAAAAACAAAGATTGTCTAT
CGCCCGAGCAGTCTTGCGCCAGGCTCCGTTTCTCATCCTAGATGATGCAACCTCGGCACTGGATACCATTACAGAG
TCCAAGCTCTTGAAAGCTATTAGAGAAAATTTTCCAAACACGAGCTTAATTTTGATCTCTCAACGAACCTCAACTT
TACAGATGGCGGACCAGATTCTCCTCTTGGAAAAAGGTGAGTTGCTAGCTGTTGGCAAGCACGATGACTTGATGA
AATCCAGCCAAGTCTATTGTGAAATCAATGCATCCCAACATGGAAAGGAGGACTAG
MKHLLSYFKPYIKESILAPLFKLLEAVFELLVPMVIAGIVDQSLPQGDQGHLWMQTGLLLIFAVIGVLVALTAQFYSAKA
AVGSAKELTNDLYRHILSLPKDSRDRLTTSSLVTRLTSDTYQIQTGINQFLRLFLRAPIIVFGAIFMAYRISAELTFWFLVL
VAILTIVIVGLSRLVNPFYSSLRKKTDQLVQETRQQLQGMRVIRAFGQEKRELQIFQTLNQVYARLQEKTGFWSSLLTPL
TYLIVNGTLLVIIWQGYISIQGGVLSQGALIALINYLLQILVELVKLAMLINSLNQSYISVKRIEEVFVEAPEDIHSELEQKQ
ATRDKVLQVQELTFTYPDAAQPSLRYISFDMTQGQILGIIGGTGSGKSSLVQLLLGLYPVDKGNIDLYQNGRSPLNLEQ
WRSWIAYVPQKVELFKGTIRSNLTLGFNQEVSDQELWQALEIAQAKDFVSEKEGLLDALVEAGGRNFSGGQKQRLSIA
RAVLRQAPFLILDDATSALDTITESKLLKATRENFPNTSLILISQRTSTLQMADQILLLEKGELLAVGKHDDLMKSSQVYC
EINASQHGKEDZ
ID181224bp
ATGAAACGTTCTCTCGACTCAAGAGTCGATTACAGTTTGCTCTTGCCAGTATTTTTTCTACTGGTCATCGGTGTGGT
GGCTATCTATATAGCCGTTAGTCATGATTATCCCAATAATATTCTGCCCATTTTAGGGCAGCAGGTCGCCTGGATT
GCCTTGGGGCTTGTGATTGGTTTTGTGGTCATGCTCTTTAATACAGAATTTCTTTGGAAGGTGACCCCCTTTCTATA
TATTTTAGGCTTGGGACTTATGATCTTGCCGATTGTATTTTATAATCCAAGCTTAGTTGCATCAACGGGTGCCAAA
AACTGGGTATCAATAAATGGAATTACCCTATTCCAACCGTCAGAATTTATGAAGATATCCTATATCCTCATGTTGG
CTCGTGTCATTGTCCAATTTACAAAGAAACATAAGGAATGGAGACGCACGGTTCCGCTGGACTTTTTGTTAATTTT
CTGGATGATTCTCTTTACCATTCCAGTCCTAGTTCTTTTAGCACTTCAAAGTGACTTGGGGACGGCTTTGGTTTTTG
TAGCCATTTTCTCAGGAATCGTTTTATTATCAGGGGTTTCTTGGAAAATTATTATCCCAGTATTTGTGACTGCTGTA
ACAGGAGTTGCTGGTTTCTTAGCTATCTTTATTAGCAAGGACGGACGAGCTTTTCTTCACCAGATTGGAATGCCGA
CCTACCAAATTAATCGGATTTTGGCTTGGCTCAATCCCTTTGAGTTTGCCCAAACAACGACTTACCAGCAGGCTCA
AGGGCAGATTGCCATTGGGAGTGGTGGCTTATTTGGTCAGGGATTTAATGCTTCGAATCTGCTTATCCCAGTTCGA
GAGTCAGATATGATTTTTACGGTTATTGCAGAAGATTTTGGCTTTATTGGCTCTGTCCTGGTTATTGCCCTCTATCT
CATGTTGATTTACCGTATGTTGAAGATTACTCTTAAATCAAATAACCAGTTCTACACTTATATTTCCACAGGTTTGA
TTATGATGTTGCTCTTCCACATCTTTGAGAATATCGGTGCTGTGACTGGACTACTTCCTTTGACGGGGATTCCCTTG
CCTTTCATTTCGCAAGGGGGATCAGCTATTATCAGTAATCTGATTGGTGTTGGTTTGCTTTTATCGATGAGTTACCA
GACTAATCTAGCTGAAGAAAAGAGCGGAAAAGTCCCATTCAAACGGAAAAAGGTTGTATTAAAACAAATTAAATA
A
MKRSLDSRVDYSLLLPVFFLLVTGVVAIYIAVSHDYPNNILPILGQQVAWIALGLVIGFVVMLFNTEFLWKVTPFLYILGL
GLMILPIVFYNPSLVASTGAKNWVSINGITLFQPSEFMKISYILMLARVIVQFTKKHKEWRRTVPLDFLLIFWMILFTIPVL
VLLALQSDLGTALVFVAIFSGIVLLSGVSWKIIIPVFVTAVTGVAGFLAIFISKDGRAFLHQIGMPTYQINRILAWLNPFEF
AQTTTYQQAQGQIAIGSGGLFGQGFNASNLLIPVRESDMIFTVIAEDFGFIGSVLVIALYLMLIYRMLKITLKSNNQFYTY
ISTGLIMMLLFHIFENIGAVTGLLPLTGIPLPFISQGGSAIISNLIGVGLLLSMSYQTNLAEEKSGKVPFKRKKVVLKQIKZ
ID22987bp
ATGGTGGCTAAGAAAAAAATCTTATTTTTTATGTGGTCTTTTTCTCTTGGAGGTGGTGCAGAGAAGATTCTATCAA
CCATTGTTTCAAATCTGGATCCAGAAAAGTATGATATTGATATTCTTGAAATGGAGCACTTTGACAAGGGATATGA
ATCTGTTCCAAAGCATGTACGCATTTTAAAATCCCTTCAAGATTATCGCCAAACCAGATGGTTACGAGCTTTTTTG
TGGAGAATGAGAATTTATTTTCCAAGACTGACTCGTCGTTTGCTTGTAAAAGATGATTATGATGTTGAAGTTTCTT
TTACCATTATGAATCCACCACTGTTGTTCTCTAAAAGAAGAGAAGTCAAGAAGATATCTTGGATTCATGGAAGTAT
TGAAGAACTTCTTAAGGATAGCTCTAAAAGAGAATCACATAGAAGCCAGTTGGATGCTGCGAATACAATTGTAGG
GATTTCAAAAAAGACCAGCAATTCTATCAAGGAAGTTTATCCAGATTATACTTCTAAATTACAGACAATCTACAAT
GGATATGATTTTCAGACTATTCTAGAAAAATCTCAAGAGAAGATCGATATCGAGATTGCTCCTCAAAGTATCTGTA
CTATCGGACGGATTGAGGAAAATAAGGGTTCTGACCGTGTAGTGGAAGTGATACGATTATTACACCAAGAGGGAA
AAAACTATCATCTCTATTTTATCGGGGCTGGTGATATGGAAGAGGAACTGAAAAAACGAGTCAAAGAGTATGGGA
TTGAGGACTATGTACATTTCCTTGGTTATCAAAAAAATCCTTATCAGTATCTATCTCAGACGAAAGTTCTTTTGTCT
ATGTCTAAACAAGAAGGTTTTCCTGGAGTGTATGTGGAGGCCTTGAGTCTGGGACTCCCTTTTATCTCTACGGACG
TTGGAGGGGCTGAGGAATTATCCCAAGAAGGACGATTTGGACAAATCATTGAGAGCAATCAAGAGGCAGCTCAG
GCGAYTACTAATTACATGACTTCTGCCTCAAACTTTGATGTCGATGAGGCTAGCCAATTCATTCAACAATTTACAA
TTACAAAACAAATCGAACAAGTAGAAAAACTATTAGAGGAGTAG
MVAKKKILFFMWSFSLGGGAEKILSTIVSNLDPEKYDIDILEMEHFDKGYESVPKHVRILKSLQDYRQTRWLRAFLWRM
RIYFPPLTRRLLVKDDYDVEVSFTIMNPPLLFSKRREVKKISWIHGSIEELLKDSSKRESHRSQLDAANTIVGTSKKTSNSK
EVYPDYTSKLQTIYNGYDFQTILEKSQEKIDIEIAPQSICTIGRIEENKGSDRVVEVIRLLHQEGKNYHLYFIGAGDMEEEL
KKRVKEYGIEDYVHFLGYQKNPYQYLSQTKVLLSMSKQEGFPGVYVEALSLGLPFISTDVGGAEELSQEGRFGQIIESNQ
EAAQAITNYMTSASNFDVDEASQFIQQFTITKQIEQVEKLLEEZ
ID231434bp
ATGGAAACTGCATTAATTAGTGTGATTGTGCCAGTCTATAATGTGGCGCAGTACCTAGAAAAATCGATAGCTTCCA
TTCAGAAGCAGACCTATCAAAATCTGGAAATTATTCTTGTTGATGATGGTGCAACAGATGAAAGTGGTCGCTTGTG
TGATTCAATCGCTGAACAAGATGACAGGGTGTCAGTGCTTCATAAAAAGAACGAAGGATTGTCGCAAGCACGAAA
TGATGGGATGAAGCAGGCTCACGGGGATTATCTGATTTTTATTGACTCAGATGATTATATCCATCCAGAAATGATT
CAGAGCTTATATGAGCAATTAGTTCAAGAAGATGCGGATGTTTCGAGCTGTGGTGTCATGAATGTCTATGCTAATG
ATGAAAGCCCACAGTCAGCCAATCAGGATGACTATTTTGTCTGTGATTCTCAAACATTTCTAAAGGAATACCTCAT
AGGTGAAAAAATACCTGGGACGATTTGCAATAAGCTAATCAAGAGACAGATTGCAACTGCCCTATCCTTTCCTAA
GGGGTTGATTTACGAAGATGCCTATTACCATTTTGATTTAATCAAGTTGGCCAAGAAGTATGTGGTTAATACTAAA
CCCTATTATTACTATTTCCATAGAGGGGATAGTATTACGACCAAACCCTATGCAGAGAAGGATTTAGCCTATATTG
ATATCTACCAAAAGTTTTATAATGAAGTTGTGAAAAACTATCCTGACTTGAAAGAGGTCGCTTTTTTCAGATTGGC
CTATGCCCACTTCTTTATTCTGGATAAGATGTTGCTAGATGATCAGTATAAACAGTTTGAAGCCTATTCTCAGATT
CATCGTTTTTTAAAAGGCCATGCCTTTGCTATTTCTAGGAATCCAATTTTCCGTAAGGGGAGAAGAATTAGTGCTT
TGGCCCTATTCATAAATATTTCCTTATATCGATTCTTATTACTGAAAAATATTGAAAAATCTAAAAAATTACATTA
G
METALISVIVPVYNVAQYLEKSIASIQKQTYQNLEIILVDDGATDESGRLCDSIAEQDDRVSVLHKKNEGLSQARNDGM
KQAHGDYLIFIDSDDYIHPEMIQSLYEQLVQEDADVSSCGVMNVYANDESPQSANQDDYFVCDSQTFLKEYLIGEKIPG
TICNKLIKRQIATALSFPKGLIYEDAYYHFDLIKLAKKYVVNTKPYYYYFHRGDSITTKPYAEKDLAYIDIYQKFYNEVV
KNYPDLKEVAFFRLAYAHFFILDKMLLDDQYKQFEAYSQIHRFLKGHAFAISRNPIFRKGRRISALALFINISLYRFLLLK
NIEKSKKLHZ
ID24735bp
ATGAGAATCAAAGAGAAAACCAATAATATTAATGGAGGAATAAAAAATGTAAGTAAGCATTATGGTCATTCAATC
ATTCTCAAAGATATAAATTTTGCACTTAACAAGGGTGAAATTGTTGGTCTAGCAGGGAGAAATGGAGTTGGTAAG
AGTACGTTGATGAAAATTCTTGTTCAGAATAATCAACCGACTTCAGGTAATATTATAAGCAGTGATAATGTTGGGT
ATTTAATCGAAGAACCAAAATTATTTTTATCTAAAACAGGTTTAGAGAATTTAAAATATTTGTCAAATTTATATGG
TGTTGACTACAATCAAGAAAGATTTAGATGTTTGATCCAAGAGTTAGATTTGACTCAGTCTATTAATAAAAAAGTA
AAGACCTATTCTTTGGGTACAAAACAAAAATTAGCTTTGCTTCTAACTCTCGTTACGGAACCTGATATATTGATTT
TAGATGAACCGACTAATGGTTTAGATATTGAATCATCACAAATAGTTTTAGCGGTTCTAAAAAAATTAGCTTTACA
TGAAAATGTGGGAATTTTAATATCGAGTCATAAATTAGAAGACATTGAAGAAATTTGTGAGAGAGTTCTTTTCTTG
GAGAACGGGTTTTGACATTTCAAAAAGTAGGAAAAGATAGTCATAATTTCTTGTTTGAGATAGCTTTTTCATCAG
CTACAGATAGAGACATTTTCATTACCAAACAAGAATTTTGGGATATTGTTTAG
MRIKEKTNNINGGIKNVSKHYGHSIILKDINFALNKGEIVGLAGRNGVGKSTLMKILVQNNQPTSGNIISSDNVGYLIEEP
KLFLSKTGLENLKYLSNLYGVDYNQERFRCLIQELDLTQSINKKVKTYSLGTKQKLALLLTLVTEPDILILDEPTNGLDIE
SSQIVLAVLKKLALHENVGILISSHKLEDIEEICERVLFLENGLLTFQKVGKDSHNFLFEIAFSSATDRDIFITKQEFWDIVZ
ID251704bp
ATGACTGAATTAGATAAACGTCACCGCAGTAGCATTTATGACAGCATGGTTAAATCACCTAACCGTGCTATGCTTC
GTGCGACTGGTATGACAGATAAGGACTTTGAAACATCGATTGTGGGAGTGATTTCGACTTGGGCGGAAAATACAC
CATGTAACATTCACTTGCATGATTTCGGGAAACTGGCTAAAGAAGGTGTCAAATCTGCAGGCGCTTGGCCTGTAC
AGTTTGGAACCATTACCGTAGCGGACGGGATCGCTATGGGAACGCCTGGTATGCGTTTCTCTCTAACATCTCGTGA
CATCATCGCGGACTCCATCGAGGCGGCTATGAGTGGTCACAACGTGGATGCCTTCGTCGCTATCGGTGGCTGTGA
CAAGAACATGCCTGGATCTATGATTGCTATTGCTAATATGGATATCCCAGCTATTTTCGCCTATGGTGGAACTATT
GCACCGGGAAATCTTGATGGTAAAGATATCGACTTGGTTTCTGTCTTTGAAGGTATCGGAAAATGGAACCACGGT
GACATGACAGCTGAGGACGTGAAACGTCTTGAATGTAATGCCTGCCCTGGCCCTGGTGGTTGTGGTGGTATGTAT
ACTGCTAATACCATGGCAACTGCTATCGAAGTTCTAGGGATGAGTTTGCCAGGGTCATCCTCTCACCCAGCTGAAT
CAGCTGATAAGAAAGAAGATATCGAAGCAGCAGGACGTGCTGTTGTTAAGATGTTGGAACTTGGTCTCAAACCAT
CAGATATCTTGACTCGTGAAGCCTTTGAAGATGCTATCACTGTAACGATGGCTCTCGGTGGTTCTACAAACGCCAC
TCTTCACTTGCTCGCCATTGCCCATGCCGCAAATGTTGACTTGTCACTTGAGGACTTCAATACGATTCAAGAACGT
GTGCCTCACTTGGCCGACTTGAAACCATCTGGTCAGTATGTCTTCCAAGACCTCTACGAAGTCGGTGGTGTCCCTG
CGGTTATGAAGTATTTGTTGGCAAATGGTTTCCTTCACGGAGATCGCATCACATGTACTGGTAAGACTGTAGCTGA
AAACTTGGCTGACTTTGCAGACTTGACTCCAGGCCAAAAAGTTATCATGCCACTTGAAAATCCAAAACGTGCGGA
TGGTCCGCTTATCATCTTGAACGGGAACCTTGCTCCTGACGGTGCAGTTGCCAAGGTATCAGGTGTTAAAGTGCGT
CGTCACGTTGGGCCAGCTAAGGTCTTTGACTCAGAAGAAGATGCGATTCAGGCCGTTCTGACAGATGAAATCGTT
GATGGCGATGTAGTCGTTGTTCGTTTTGTTGGACCTAAAGGTGGTCCTGGTATGCCTGAGATGCTATCACTTTCTTC
AATGATTGTTGGTAAAGGTCAGGGAGATAAGGTGGCCCTCTTGACGGACGGACGTTTCTCTGGTGGTACTTATGGT
CTGGTTGTTGGACATATCGCTCCTGAAGCTCAGGATGGTGGACCAATTGCCTATCTCCGTACCGGCGATATCGTTA
CGGTTGACCAAGATACCAAAGAAATTTCTATGGCCGTATCCGAAGAAGAACTTGAAAAACGCAAGGCAGAAACA
ACCTTGCCACCACTTTACAGCCGTGGTGTCCTCGGTAAATATGCCCACATCGTATCATCTGCTTCACGCGGAGCCG
TGACAGACTTCTGGAATATGGACAAGTCAGGTAAAAAATAA
MTELDKRHRSSIYDSMVKSPNRAMLRATGMTDKDFETSIVGVISTWAENTPCNIHLHDFGKLAKEGVKSAGAWPVQFG
TITVADGIAMGTPGMRFSLTSRDIIADSIEAAMSGHNVDAFVAIGGCDKNMPGSMIAIANMDIPAIFAYGGTIAPGNLDG
KDIDLVSVFEGIGKWNHGDMTAEDVKRLECNACPGPGGCGGMYTANTMATAIEVLGMSLPGSSSHPAESADKKEDIE
AAGRAVVKMLELGLKPSDILTREAFEDAITVTMALGGSTNATLHLLAIAHAANVDLSLEDFNTIQERVPHLADLKPSGQ
YVFQDLYEVGGVPAVMKYLLANGFLHGDRITCTGKTVAENLADFADLTPGQKVIMPLENPKRADGPLIILNGNLAPDG
AVAKVSGVKVRRHVGPAKVFDSEEDAIQAVLTDEIVDGDVVVVRFVGPKGGPGMPEMLSLSSMIVGKGQGDKVALLT
DGRFSGGTYGLVVGHIAPEAQDGGPIAYLRTGDIVTVDQDTKEISMAVSEEELEKRKAETTLPPLYSRGVLGKYAHIVSS
ASRGAVTDFWNMDKSGKKZ
ID26274bp
ATGTTATAATAAAAATAAAGAATTTAAGGAGAAATACAATATGTCAATTTTTATTGGAGGAGCATGGCCATATGC
AAACGGTTCGTTACATATTGGTCACGCGGCAGCGCTTTTACCGGGGGATATTCTTGCAAGATACTATCGTCAGAA
GGGAGAGGAAGTTTTATATGTTTCTGGAAGTGATTGTAATGGAACCCCTATTTCTATCAGAGCTAAAAAAGAAAA
TAAGTCTGTGAAAGAAATTGCTGATTTTTATCATAAGGAATTTAATCCA
CYNKNKEFKEKYNMSIFIGGAWPYANGSLHIGHAAALLPGDILARYYRQKGEEVLYVSGSDCNGTPISIRAKKENKSVK
EIADFYHKEFNP
ID281065bp
ATGACAACATTATTTTCAAAAATTAAAGAAGTAACAGAACTTGCTGCAGTCTCAGGTCATGAAGCGCCTGTCCGT
GCTTATCTTCGTGAAAAGTTGACACCGCATGTGGATGAAGTGGTGACAGATGGCTTGGGTGGTATTTTTGGTATCA
AACATTCAGAAGCTGTGGATGCACCGCGCGTCTTGGTCGCTTCTCATATGGACGAAGTTGGTTTTATGGTCAGCGA
AATCAAGCCAGATGGTACCTTCCGTGTCGTAGAAATCGGTGGCTGGAACCCCATGGTGGTTAGCAGCCAACGTTT
CAAACTCTTGACTCGTGATGGTCATGAAATTCCTGTGATTTCAGGTTCTGTTCCTCCGCATTTGACTCGTGGAAAG
GGGGGACCAACCATGCCAGCCATTGCCGATATCGTTTTTGATGGTGGTTTTGCGGACAAGGCTGAGGCAGAAAGT
TTTGGCATCCGTCCTGGTGATACCATTGTACCAGATAGTTCTGCAATTTTGACAGCCAATGAAAAAAATATCATCT
CAAAAGCTTGGGATAACCGCTACGGTGTCCTCATGGTAAGCGAGCTAGCTGAAGCTTTATCGGGTCAAAAACTCG
GCAATGAACTCTATCTGGGTTCTAACGTCCAAGAAGAAGTTGGTCTGCGTGGCGCTCATACCTCTACAACCAAGTT
TGACCCAGAAGTCTTCCTCGCAGTTGATTGCTCACCAGCAGGTGATGTCTACGGTGGTCAAGGCAAGATTGGAGA
TGGAACCTTGATTCGTTTCTATGATCCAGGTCACTTGCTTCTCCCAGGGATGAAGGATTTCCTTTTGACAACGGCT
GAAGAAGCTGGTATCAAGTACCAATACTACTGTGGTAAAGGCGGAACAGATGCAGGTGCAGCTCATCTGAAAAAT
GGTGGTGTCCCATCAACAACTATCGGTGTCTGCGCTCGTTATATCCATTCTCACCAAACCCTCTATGCAATGGATG
ACTTCCTAGAAGCGCAAGCTTTCTTACAAGCCTTGGTGAAGAAATTGGATCGTTCAACGGTTGATTTGATTAAACA
TTATTAA
MTTLFSKIKEVTELAAVSGHEAPVRAYLREKLTPHVDEVVTDGLGGIFGIKHSEAVDAPRVLVASHMDEVGFMVSEIKP
DGTFRVVEIGGWNPMVVSSQRFKLLTRDGHEIPVISGSVPPHLTRGKGGPTMPAIADIVFDGGFADKAEAESFGIRPGDT
IVPDSSAILTANEKNIISKAWDNRYGVLMVSELAEALSGQKLGNELYLGSNVQEEVGLRGAHTSTTKFDPEVFLAVDCS
PAGDVYGGQGKIGDGTLIRFYDPGHLLLPGMKDFLLTTAEEAGIKYQYYCGKGGTDAGAAHLKNGGVPSTTIGVCARY
IHSHQTLYAMDDFLEAQAFLQALVKKLDRSTVDLIKHYZ
ID311182bp
ATGGAATTTTCTATGAAATCAGTCAAAGGACTACTCTTTATCATAGCTAGTTTTATCTTGACTCTTTTGACTTGGAT
GAACACTTCTCCCCAATTCATGATTCCAGGACTAGCTTTAACAAGCCTATCTCTGACTTTTATCCTAGCCACTCGT
CTCCCACTACTAGAAAGCTGGTTTCACAGTTTGGAGAAGGTCTACACCGTCCACAAATTCACAGCCTTTCTCTCAA
TCATCCTACTAATCTTTCATAACTTTAGTATGGGCGGTTTGTGGGGCTCTCGCTTAGCTGCTCAGTTTGGCAATCTT
GCCATCTATATCTTTGCCAGCATCATCCTTGTCGCCTATTTAGGCAAATACATCCAATACGAAGCTTGGCGATGGA
TTCACCGCCTGGTTTACCTAGCCTATATTTTAGGACTCTTTCACATCTACATGATAATGGGCAATCGTCTCCTTACA
TTTAATCTTCTAAGTTTTCTTGTTGGTAGCTATGCCCTTTTAGGCTTACTAGCTGGTTTTTATATCATTTTTCTATAT
CAAAAGATTTCCTTCCCCTATCTAGGGAAAATTACCCATCTCAAACGCTTAAATCACGATACTAGAGAAATTCAA
ATCCATCTTAGCAGACCTTTCAACTATCAATCAGGACAATTTGCCTTTCTAAAGATTTTCCAAGAAGGCTTTGAAA
GTGCTCCGCATCCCTTTTCTATCTCAGGAGGTCATGGTCAAACTCTTTACTTTACTGTTAAAACTTCAGGCGACCA
TACCAAGAATATCTATGATAATCTTCAAGCCGGCAGCAAAGTAACCCTAGACAGAGCTTACGGACACATGATCAT
AGAAGAAGGACGAGAAAATCAGGTTTGGATTGCTGGAGGTATTGGGATCACCCCCTTCATCTCTTACATCCGTGA
ACATCCTATTTTAGATAAACAGGTTCACTTCTACTATAGCTTCCGTGGAGATGAAAATGCAGTCTACCTAGATTTA
CTCCGTAACTATGCTCAGAAAAATCCTAATTTTGAACTCCATCTAATCGACAGTACGAAAGACGGCTATCTTAATT
TTGAACAAAAAGAAGTGCCCGAACATGCAACCGTCTATATGTGTGGTCCTATTTCTATGATGAAGGCACTTGCCA
AACAGATTAAGAAACAAAATCCAAAAACAGAGCATATTTAC
MEFSMKSVKGLLFIIASFILTLLTWMNTSPQFMIPGLALTSLSLTFILATRLPLLESWFHSLEKVYTVHKFTAFLSIILLIFH
NFSMGGLWGSRLAAQFGNLAIYIFASIILVAYLGKYIQYEAWRWIHRLVYLAYILGLFHIYMIMGNRLLTFNLLSFLVGS
YALLGLLAGFYIIFLYQKISFPYLGKITHLKRLNHDTREIQIHLSRPFNYQSGQFAFLKIFQEGFESAPHPFSISGGHGQTLY
FTVKTSGDHTKNIYDNLQAGSKVTLDRAYGHMIIEEGRENQVWIAGGIGITPFISYIREHPILDKQVHFYYSFRGDENAV
YLDLLRNYAQKNPNFELHLIDSTKDGYLNFEQKEVPEHATVYMCGPISMMKALAKQIKKQNPKTEHIY
ID32900bp
ATGACTTTTAAATCAGGCTTTGTAGCCATTTTAGGACGTCCCAATGTTGGGAAGTCAACCTTTTTAAATCACGTTA
TGGGGCAAAAGATTGCCATCATGAGTGACAAGGCGCAGACAACGCGCAATAAAATCATGGGAATTTACACGACTG
ATAAGGAGCAAATTGTCTTTATCGACACACCAGGGATTCACAAGCCTAAAACAGCTCTCGGAGATTTCATGGTTG
AGTCTGCCTACAGTACCCTTCGCGAAGTGGACACTGTTCTTTTCATGGTGCCTGCTGATGAAGCGCGTGGTAAGGG
GGACGATATGATTATCGAGCGTCTCAAGGCTGCCAAGGTTCCTGTGATTTTGGTGGTGAATAAAATCGATAAGGTC
CATCCAGACCAGCTCTTGTCTCAGATTGATGACTTCCGTAATCAAATGGACTTTAAGGAAATTGTTCCAATCTCAG
CCCTTCAGGGAAATAACGTGTCTCGTCTAGTGGATATTTTGAGTGAAAATCTGGATGAAGGTTTCCAATATTTCCC
GTCTGATCAAATCACAGACCATCCAGAACGTTTCTTGGTTTCAGAAATGGTTCGCGAGAAAGTCTTGCACCTAACT
CGTGAAGAGATTCCGCATTCTGTAGCAGTAGTTGTTGACTCTATGAAACGAGACGAAGAGACAGACAAGGTTCAC
ATCCGTGCAACCATCATGGTCGAGCGCGATAGCCAAAAAGGGATTATCATCGGTAAAGGTGGCGCTATGCTTAAG
AAAATCGGTAGCATGGCCCGTCGTGATATCGAACTCATGCTAGGAGACAAGGTCTTCCTAGAAACCTGGGTCAAG
GTCAAGAAAAACTGGCGCGATAAAAAGCTAGATTTGGCTGACTTTGGCTATAATGAAAGAGAATACTAA
MTFKSGFVAILGRPNVGKSTFLNHVMGQKIAIMSDKAQTTRNKIMGIYTTTDKEQIVFIDTPGIHKPKTALGDFMVESAYS
TLREVDTVLFMVPADEARGKGDDMIIERLKAAKVPVILVVNKIDKVHPDQLLSQIDDFRNQMDFKEIVPISALQGNNVS
RLVDILSENLDEGFQYFPSDQITDHPERFLVSEMVREKVLHLTREEIPHSVAVVVDSMKRDEETDKVHIRATIMVERDSQ
KGIIIGKGGAMLKKIGSMARRDIELMLGDKVFLETWVKVKKNWRDKKLDLADFGYNEREYZ
ID33855bp
CTGCTTCTTTGTTTTTACAGAAGGAGGACTTATGCCTGAATTACCTGAGGTTGAAACCGTTTGTCGTGGCTTAGAAA
AATTGATTATAGGAAAGAAGATTTCGAGTATAGAAATTCGCTACCCCAAGATGATTAAGACGGATTTGGAAGAGT
TTCAAAGGGAATTGCCTAGTCAGATTATCGAGTCAATGGGACGTCGTGGAAAATATTTGCTTTTTTATCTGACAGA
CAAGGTCTTGATTTCCCATTTGCGGATGGAGGGCAAGTATTTTTACTATCCAGACCAAGGACCTGAACGCAAGCAT
GCCCATGTTTTCTTTCATTTTGAAGATGGTGGCACGCTTGTTTATGAGGATGTTCGCAAGTTTGGAACCATGGAAC
TCTTGGTGCCTGACCTTTTAGACGTCTACTTTATTTCTAAAAAATTAGGTCCTGAACCAAGCGAACAAGACTTTGA
TTTACAGGTCTTTCAATCTGCCCTTGCCAAGTCCAAAAAGCCTATCAAATCCCATCTCCTAGACCAGACCTTGGTA
GCTGGACTTGGCAATATCTATGTGGATGAGGTTCTCTGGCGAGCTCAGGTTCATCCAGCTAGACCTTCCCAGACTT
TGACAGCAGAAGAAGCGACTGCCATTCATGACCAGACCATTGCTGTTTTGGGCCAGGCTGTTGAAAAAGGTGGCT
CCACCATTCGGACTTATACCAATGCCTTTGGGGAAGATGGAAGCATGCAGGACTTTCATCAGGTCTATGATAAGA
CTGGTCAAGAATGTGTACGCTGTGGTACCATCATTGAGAAAATTCAACTAGGCGGACGTGGAACCCACTTTTGTCC
AAACTGTCAAAGGAGGGACTGA
MLLVFTEGGLM PELPEVETVCRGLEKLIIGKKISSIEIRYPKMIKTDLEEFQRELPSQIIESMGRRGKYLLFYLTDKVISHL
RMEGKYFYYPDQGPERKHAHVFFHFEDGGTLVYEDVRKFGTMELLVPDLLDVYFISKKLGPEPSEQDFDLQVFQSALA
KSKKPIKSHLLDQTLVAGLGNIYVDEVLWRAQVHPARPSQTLTAEEATAIHDQTIAVLGQAVEKGGSTIRTYTNAFGED
GSMQDFHQVYDKTGQECVRCGTIIEKIQLGGRGTHFCPNCQRRDZ
ID34633bp
TTGTCCAAACTGTCAAAGGAGGGACTGATGGGAAAAATCATCGGAATCACTGGGGGAATTGCCTCTGGTAAGTCA
ACTGTGACAAATTTTCTAAGACAGCAAGGCTTTCAAGTAGTGGATGCCGACGCAGTCGTCCACCAACTACAGAAA
CCTGGTGGTCGTCTGTTTGAGGCTCTAGTACAGCACTTTGGGCAAGAAATCATTCTTGAAAACGGAGAACTCAATC
GCCCTCTCCTAGCTAGTCTCATCTTTTCAAATCCTGATGAACGAGAATGGTCTAAGCAAATTCAAGGGGAGATTAT
CCGTGAGGAACTGGCTACTTTGAGAGAACAGTTGGCTCAGACAGAAGAGATTTTCTTCATGGATATTCCCCTACTT
TTTGAGCAGGACTACAGCGATTGGTTTGCTGAGACTTGGTTGGTCTATGTGGACCGAGATGCCCAAGTGGAACGC
TTAATGAAAAGGGACCAGTTGTCCAAAGATGAAGCTGAGTCTCGTCTGGCAGCCCAGTGGCCTTTAGAAAAAAAG
AAAGATTTGGCCAGCCAGGTTCTTGATAATAATGGCAATCAGAACCAGCTTCTTAATCAAGTGCATATCCTTCTTG
AGGGAGGTAGGCAAGATGACAGAGATTAA
MSTCLSKEGLMGKIIGITGGIASGKSTVTNFLRQQGFQVVDADAVVHQLQKPGGRLFEALVQHFGQEIILENGELNRPLL
ASLIFSNPDEREWSKQIQGEIIREELATLREQLAQTEEIFFMDIPLLFEQDYSDWFAETWLVYVDRDAQVERLMKRDQLS
KDEAESRLAAQWPLEKKKDLASQVLDNNGNQNQLLNQVHILLEGGRQDDRDZ
ID351269bp
TTGATAATAATGGCAATCAGAACCAGCTTCTTAATCAAGTGCATATCCTTCTTGAGGGAGGTAGGCAAGATGACA
GAGATTAACTGGAAGGATAATCTGCGCATTGCCTGGTTTGGTAATTTTCTGACAGGAGCCAGTATTTCTTTGGTTG
TACCTTTTATGCCCATCTTCGTGGAAAATCTAGGTGTAGGGAGTCAGCAAGTCGCTTTTTATGCAGGCTTAGCAAT
TTCTGTCTCTGCTATTTCCGCGGCGCTCTTTTCTCCTATTTGGGGTATTCTTGCTGACAAATACGGCCGAAAACCCA
TGATGATTCGGGCAGGTCTTGCTATGACTATCACTATGGGAGGCTTGGCTTTGTCCCAAATATCTATTGGTTAAT
CTTTCTTCGTTTACTAAACGGTGTATTTGCAGGTTTTGTTCCTAATGCAACGGCACTGATAGCCAGTCAGGTTCCA
AAGGAGAAATCAGGCTCTGCCTTAGGTACTTTGTCTACAGGCGTAGTTGCAGGTACTCTAACTGGTCCCTTTATTG
GTGGCTTTATCGCAGAATTATTTGGCATTCGTACAGTTTTCTTACTGGTTGGTAGTTTTCTATTTTTAGCTGCTATTT
TGACTATTTGCTTTATCAAGGAAGATTTTCAACCAGTAGCCAAGGAAAAGGCTATTCCAACAAAGGAATTATTTAC
CTCGGTTAAATATCCCTATCTTTTGCTCAATCTCTTTTTAACCAGTTTTGTCATCCAATTTTCAGCTCAATCGATTG
GCCCTATTTTGGCTCTTTATGTACGCGACTTAGGGCAGACAGAGAATCTTCTTTTTGTCTCTGGTTTGATTGTGTCC
AGTATGGGCTTTTCCAGCATGATGAGTGCAGGAGTCATGGGCAAGCTAGGTGACAAGGTGGGCAATCATCGTCTC
TTGGTTGTCGCCCAGTTTTATTCAGTCATCATCTATCTCCTCTGTGCCAATGCCTCTAGCCCCCTTCA ACTAGGACT
CTATCGTTTCCTCTTTGGATTGGGAACCGGTGCCTTGATTCCCGGGGTTAATGCCCTACTCAGCAAAATGACTCCC
AAAGCCGGCATTTCGAGGGTCTTTGCCTTCAATCAGGTATTCTTTTATCTGGGAGGTGTTGTTGGTCCCATGGCAG
GTTCTGCAGTAGCAGGTCAATTTGGCTACCATGCTGTCTTTTATGCGACAAGCCTTTGTGTTGCCTTTAGTTGTCTC
TTTAACCTGATTCAATTTCGAACATTATTAAAAGTAAAGGAAATCTAG
MIIMAIRTSFLIKCISFLREVGKMTEINWKDNLRIAWFGNFLTGASISLVVPFMPIFVENLGVGSQQVAFYAGLAISVSAIS
AALFSPIWGILADKYGRKPMMIRAGLAMTITMGGLAFVPNIYWLIFLRLLNGVFAGFVPNATALIASQVPKEKSGSALG
TLSTGVVAGTLTGPFIGGFIAELFGIRTVFLLVGSFLFLAAILTICFIKEDFQPVAKEKAIPTKELFTSVKYPYLLLNLFLTS
FVTQFSAQSIGPILALYVRDLGQTENLLFVSGLIVSSMGFSSMMSAGVMGKLGDKVGNHRLLVVAQFYSVIIYLLCANAS
SPLQLGLYRFLFGLGTGALIPGVNALLSKMTPKAGISRVFAFNQVFFYLGGVVGPMAGSAVAGQFGYHAVFYATSLCV
AFSCLFNLIQFRTLLKVKEIZ
ID361311bp
ATGGCCCTACCAACTATTGCCATTGTAGGACGTCCCAATGTTGGGAAATCAACCCTATTTAATCGGATCGCTGGTG
AGCGAATCTCCATTGTAGAAGATGTCGAAGGAGTGACACGTGACCGTATTTATGCAACGGGTGAGTGGCTCAATC
GTTCTTTTAGCATGATTGATACAGGAGGAATTGATGATGTCGATGCTCCTTTCATGGAACAAATCAAGCACCAGGC
AGAAATTGCCATGGAAGAAGCAGATGTTATCGTTTTTGTCGTGTCTGGTAAGGAAGGAATTACTGATGCAGACGA
ATACGTAGCTCGTAAGCTTTATAAGACCCACAAACCAGTTATCCTCGCAGTCAACAAGGTGGACAACCCTGAGAT
GAGAAATGATATATATGATTTCTATGCTCTCGGTTTGGGTGAACCATTGCCTATCTCATCTGTCCATGGAATCGGT
ACAGGGGATGTGCTAGATGCGATCGTAGAAAATCTTCCAAATGAATATGAGGAAGAAAATCCAGATGTCATTAAG
TTTAGCTTGATTGGTCGTCCTAACGTTGGAAAATCAAGCTTGATCAATGCTATCTTGGGAGAAGACCGTGTTATTG
CTAGTCCTGTTGCTGGAACAACTCGTGATGCCATTGATACCCACTTTACAGATACAGATGGTCAAGAGTTTACCAT
GATTGATACGGCTGGTATGCGTAAGTCTGGTAAGGTTTATGAAAATACTGAGAAATACTCTGTTATGCGTGCCATG
CGTGCTATTGACCGTTCAGATGTGGTCTTGATGGTCATCAATGCGGAAGAAGGCATTCGTGAGTACGACAAGCGT
ATCGCAGGATTTGCCCATGAAGCTGGTAAAGGGATGATTATCGTGGTCAACAAGTGGGATACGCTTGAAAAAGAT
AACCACACTATGAAAAACTGGGAAGAAGATATCCGTGAGCAGTTCCAATACCTGCCTTACGCACCGATTATCTTT
GTATCAGCTTTAACCAAGCAACGTCTCCACAAACTTCCTGAGATGATTAAGCAAATCAGCGAAAGTCAAAATACA
CGTATTCCATCAGCTGTCTTGAACGATGTCATCATGGATGCCATTGCCATCAACCCAACACCGACAGACAAAGGA
AAACGTCTCAAGATTTTCTATGCGACCCAAGTGGCAACCAAACCACCAACCTTTGTTCATCTTTGTCAATGAAGAAG
AACTCATGCACTTTTCTTACCTGCGTTTCTTGGAAAATCAAATCCGCAAGGCCTTTGTTTTTGAGGGAACACCGAT
TCATCTCATCGCAAGAAAACGCAAATAA
MALPTIAIVGRPNVGKSTLFNRIAGERISIVEDVEGVTRDRIYATGEWLNRSFSMIDTGGIDDVDAPFMEQIKHTQAEIAM
EEADVIVFVVSGKEGITDADEYVARKLYKTHKPVILAVNKVDNPEMRNDIYDFYALGLGEPLPISSVHGIGTGDVLDAI
VENLPNEYEEENPDVIKFSLIGRPNVGKSSLINAILGEDRVIASPVAGTTRDAIDTHFTDTDGQEFTMIDTAGMRKSGKV
YENTEKYSVMRAMRAIDRSDVVLMVINAEEGIREYDKRIAGFAHEAGKGMIIVVNKWDTLEKDNHTMKNWEEDIREQ
FQYLPYAPIIFVSALTKQRLHKLPEMIKQISESQNTRIPSAVLNDVIMDAIAINPTPTDKGKRLKIFYATQVATKPPTFVIFV
NEEELMHFSYLRFLENQIRKAFVFEGTPIHLIARKRKZ
ID37714bp
ATGACAGAAACCATTAAATTGATGAAGGCTCATACTTCAGTGCGCAGGTTTAAAGAGCAAGAAATTCCCCAAGTA
GACTTAAATGAGATTTTGACAGCAGCCCAGATGGCATCATCTTGGAAGAATTTCCAATCCTACTCTGTGATTGTGG
TACGAAGTCAAGAGAAGAAAGATGCCTTGTATGAATTGGTACCTCAAGAAGCCATTCGCCAGTCTGCTGTTTTCCT
TCTCTTTGTCGGAGATTTGAACCGAGCAGAAAAGGGAGCCCGACTTCATACCGACACCTTCCAACCCCAAGGTGT
GGAAGGTCTCTTGATTAGTTCGGTCGATGCAGCTCTTGCTGGACAAAACGCCTTGTTGGCAGCTGAAAGCTTGGGC
TATGGTGGTGTGATTATCGGTTTGGTTCGATACAAGTCTGAAGAAGTGGCAGAGCTCTTTAACCTACCTGACTACA
CCTATTCTGTCTTTGGGATGGCACTGGGTGTGCCAAATCAACATCATGATATGAAACCGAGACTGCCACTAGAGA
ATGTTGTCTTTGAGGAAGAATACCAAGAACAGTCAACTGAGGCAATCCAAGCTTATGACCGTGTTCAGGCTGACT
ATGCTGGGGCGCGTGCGACCACAAGCTGGAGTCAGCGCCTAGCAGAACAGTTTGGTCAAGCTGAACCAAGCTCAA
CTAGAAAAAATCTTGAACAGAAGAAATTATTGTAG
MTETIKLMKAHTSVRRFKEQEIPQVDLNEILTAAQMASSWKNFQSYSVIVVRSQEKKDALYELVPQEAIRQSAVFLLFV
GDLNRAEKGARLHTDTFQPQGVEGLLISSVDAALAGQNALLAAESLGYGGVIIGLVRYKS EEVAELFNLPDYTYSVFG
MALGVPNQHHDMKPRLPLENVVFEEEYQEQSTEAIQAYDRVQADYAGARATTSWSQRLAEQFGQAEPSSTRKNLEQK
KLLZ
ID38729bp
ATGACAGAAATTAGACTAGAGCACGTCAGTTATGCCTATGGTCAGGAGAGGATTTTAGAGGATATCAACCTACAG
GTGACTTCAGGCGAAGTGGTTTCCATCCTAGGCCCAAGTGGTGTTGGAAAGACCACCCTCTTTAATCTAATCGCTG
GGATTTTAGAAGTTCAGTCAGGGAGAATTGTCCTTGATGGTGAAGAAAATCCCAAGGGGCGCGTGAGTTATATGT
TGCAAAAGGATCTGCTCTTGGAGCACAAGACGGTGCTTGGAAATATCATTCTGCCCCTCTTGATTCAAAAGGTGG
ATAAGGCAGAAGCTATTTCCCGAGCGGATAAAATTCTTGCGACCTTCCAGCTGACAGCTGTAAGAGACAAGTATC
CTCATGAACTTAGCGGTGGGATGCGCCAGCGTGTAGCCTTACTCCGGACCTACCTTTTTGGGCACAAGCTCTTTCT
CTTAGATGAGGCCTTTAGCGCCTTGGATGAGATGACAAAGATGGAACTCCACGCTTGGTATCTTGAGATTCACAA
GCAGTTGCAGCTAACAACCCTGATCATCACGCATAGTATTGAGGAGGCCCTCAATCTCAGCGACCGTATCTATATC
TTGAAAAATCGCCCTGGGCAGATTGTTTCAGAAATTAAACTAGATTGGTCTGAAGATGAGGACAAGGAAGTCCAA
AAGATTGCCTACAAACGTCAAATTTTGGCGGAATTAGGCTTAGATAAGTAG
MTEIRLEHVSYAYGQERILEDINLQVTSGEVVSILGPSGVGKTTLFNLIAGILEVQSGRIVLDGEENPKGRVSYMLQKDLL
LEIHKTVLGNIILPLLIQKVDKAEAISRADKILATFQLTAVRDKYPHELSGGMRQRVALLRTYLFGHKLFLLDEAFSALDE
MTKMELHAWYLEIHKQLQLTTLIITHSIEEALNLSDRIYILKNRPGQIVSEIKLDWSEDEDKEVQKIAYKRQILAELGLDK
Z
ID392433bp
ATGAACTATTCAAAAGCATTGAATGAATGTATCGAAAGTGCCTACATGGTTGCTGGACATTTTGGAGCTCGTTATC
TAGAGTCGTGGCACTTGTTGATTGCCATGTCTAATCACAGTTATAGTGTAGCAGGGGCAACTTTAAATGATTATCC
GTATGAGATGGACCGTTTAGAAGAGGTGGCTTTGGAACTGACTGAAACGGACTATAGCCAGGATGAAACCTTTAC
GGAATTGCCGTTCTCCCGTCGTTTGCAGGTTCTTTTTGATGAAGCAGAGTATGTAGCGTCAGTGGTCCATGCTAAG
GTACTAGGGACAGAGCACGTCCTCTATGCGATTTTGCATGATAGCAATGCCTTGGCGACTCGTATCTTGGAGAGG
GCTGGTTTTTCTTATGAAGACAAGAAAGATCAGGTCAAGATTGCTGCTCTTCGTCGAAATTTAGAAGAACGGGCA
GGCTGGACTCGTGAAGATCTCAAGGCTTTACGCCAACGCCATCGTACAGTAGCTGACAAGCAAAATTCTATGGCC
AATATGATGGGCATGCCGCAGACTCCTAGTGGTGGTCTCGAGGATTATACGCATGATTTGACAGAGCAAGCGCGT
TCTGGCAAGTTAGAACCAGTCATCGGTCGGGACAAGGAAATCTCACGTATGATTCAAATCTTGAGCCGAAGACT
AAGAACAACCCTGTCTTGGTTGGGGATGCTGGTGTCGGGAAAACAGCTCTGGCGCTTGGTCTTGCCCAGCGTATTG
CTAGTGGTGACGTGCCTGCGGAAATGGCTAAGATGCGCGTGTTAGAACTTGATTTGATGAATGTCGTTGCAGGGA
CACGCTTCCGTGGTGACTTTGAAGAACGCATGAATAATATCATCAAGGATATTGAAGAAGATGGCCAAGTCATCC
TCTTTATCGATGAACTCCACACCATCATGGGTTCTGGTAGCGGGATTGATTCGACTCTGGATGCGGCCAATATCTT
GAAACCAGCCTTGGCGCGTGGAACTTTGAGAACGGTTGGTGCCACTACTCAGGAAGAATATCAAAAACATATCGA
AAAAGATGCGGCACTTTCTCGTCGTTTCGCTAAAGTGACGATTGAAGAACCAAGTGTGGCAGATAGTATGACTAT
TTTACAAGGTTTGAAGGCGACTTATGAGAAACATCACCGTGTACAAATCACAGATGAAGCGGTTGAAACAGCGGT
TAAGATGGCTCATCGTTATTTAACCAGTCGTCACTTGCCAGACTCTGCTATCGATCTCTTGGATGAGGCGGCAGCA
ACAGTGCAAAATAAGGCAAAGCATGTAAAAGCAGACGATTCAGATTTGAGTCCAGCTGACAAGGCCCTGATGGAT
GGCAAGTGGAAACAGGCAGCCCAGCTAATCGCAAAAGAAGAGGAAGTACCTGTCTACAAAGACTTGGTGACAGA
GTCTGATATTTTGACCACCTTGAGTCGCTTGTCAGGAATCCCAGTTCAAAAACTGACTCAAACGGATGCTAAGAAG
TATTTAAATCTTGAAGCAGAACTCCATAAACGGGTTATCGGTCAAGATCAAGCTGTTTCAAGCATTAGCCGTGCCA
TTCGCCGCAACCAGTCAGGGATTCGCAGTCATAAGCGTCCGATTGGTTCCTTTATGTTCCTAGGGCCTACAGGTGT
CGGGAAAACTGAATTAGCCAAGGCTCTGGCAGAAGTTCTTTTTGACGACGAATCAGCCCTTATCCGCTTTGATATG
AGTGAGTATATGGAGAAATTTGCAGCTAGTCGTCTCAACGGAGCTCCTCCAGGCTATGTAGGATATGAAGAAGGT
GGGGAGTTGACAGAGAAGGTTCGCAATAAACCCTATTCCGTTCTCCTCTTTGATGAGGTAGAGAAGGCCCACCCA
GATATCTTTAATGTTCTCTTGCAGGTTCTGGATGACGGTGTCTTGACAGATAGCAAGGGACGCAAGGTCGATTTTT
CAAATACCATTATCATTATGACATCGAATCTAGGTGCGACTGCCCTTCGTGATGATAAGACTGTTGGTTTTGGGGC
TAAGGATATTCGTTTTGACCAGGAAAATATGGAAAAACGCATGTTTGAAGAACTGAAAAAAGCTTATAGACCGGA
ATTCATCAACCGTATTGATGAGAAGGTGGTCTTCCATAGCCTATCTAGTGATCATATGCAGGAAGTGGTGAAGATT
ATGGTCAAGCCTTTAGTGGCAAGTTTGACTGAAAAAGGCATTGACTTGAAATTACAAGCTTCAGCTCTGAAATTGT
TAGCAAATCAAGGATATGACCCAGAGATGGGAGCTCGCCCACTTCGCAGAACCCTGCAAACAGAAGTGGAGGAC
AAGTTGGCAGAACTTCTTCTCAAGGGAGATTTAGTGGCAGGCAGCACACTTAAGATTGGTGTCAAAGCAGGCCAG
TTAAAATTTGATATTGCATAA
MNYSKALNTECIESAYMVAGHFGARYLESWHLLIAMSNHSYSVAGATLNDYPYEMDRLEEVALELTETDYSQDETFTE
LPFSRRLQVLFDEAEYTVASVVHAKVLGTEHVLYAILHDSNALATRILERAGFSYEDKKDQVKIAALRRRNLEERAGWTR
EDLKALRQRHRTVADKQNSMANMMGMPQTPSGGLEDYTHDLTEQARSGKLEPVTGRDKEISRMIQILSRKTKNNPVLV
GDAGVGKTALALGLAQRIASGDVPAEMAKMRVLELDLMNVVAGTRFRGDFEERMNNIKDIEEDGQVILFIDELHTIM
GSGSGIDSTLDAANILKPALARGTLRTVGATTQEEYQKHIEKDAALSRRFAKVTIEEPSVADSMTILQGLKATYEKHHRV
QITDEAVETAVKMAHRYLTSRHLPDSAIDLLDEAAATVQNKAKHVKADDSDLSPADKALMDGKWKQAAQLIAKEEEV
PVYKDLVTESDILTTLSRLSGIPVQKLTQTDAKKYLNLEAELHKRVIGQDQAVSSISRAIRRNQSGIRSHKRPIGSFMFLGP
TGVGKTELAKALAEVLFDDESALIRFDMSEYMEKFAASRLNGAPPGYVGYEEGGELTEKVRNKPYSVLLFDEVEKAHP
DIFNVLLQVLDDGVLTDSKGRKVDFSNTIIIMTSNLGATALRDDKTVGFGAKDIRFDQENMEKRMFEELKKAYRPEFIN
RIDEKVVFHSLSSDHMQEVVKIMVKPLVASLTEKGIDLKLQASALKLLANQGYDPEMGARPLRRTLQTEVEDKLAELL
LKGDLVAGSTLKIGVKAGQLKFDIAZ
ID401008bp
ATGAAGAAAACATGGAAAGTGTTTTTAACGCTTGTAACAGCTCTTGTAGCTGTTGTGCTTGTGGCCTGTGGTCAAG
GAACTGCTTCTAAAGACAACAAAGAGGCAGAACTTAAGAAGGTTGACTTTATCCTAGACTGGACACCAAATACCA
ACCACACAGGGCTTTATGTTGCCAAGGAAAAAGGTTATTTCAAAGAAGCTGGAGTGGATGTTGATTTGAAATTGC
CACCAGAAGAAAGTTCTTCTGACTTGGTTATCAACGGAAAGGCACCATTTGCAGTGTATTTCCAAGACTACATGGC
TAAGAAATTGGAAAAAGGAGCAGGAATCACTGCCGTTGCAGCTATTGTTGAACACAATACATCAGGAATCATCTC
TCGTAAATCTGATAATGTAAGCAGTCCAAAAGACTTGGTTGGTAAGAAATATGGGACATG GAATGACCCAACTGA
ACTTGCTATGTTGAAAACCTTGGTAGAATCTCAAGGTGGAGACTTTGAGAAGGTTGAAAAAGTACCAAATAACGA
CTCAAACTCAATCACACCGATTGCCAATGGCGTCTTTGATACTGCTTGGATTTACTACGGTTGGGATGGTATCCTT
GCTAAATCTCAAGGTGTAGATGCTAACTTCATGTACTTGAAAGACTATGTCAAGGAGTTTGACTACTATTCACCAG
TTATCATCGCAAACAACGACTATCTGAAAGATAACAAAGAAGAAGCTCGCAAAGTCATCCAAGCCATCAAAAAA
GGCTACCAATATGCCATGGAACATCCAGAAGAAGCTGCAGATATTCTCATCAAGAATGCACCTGAACTCAAGGAA
AAACGTGACTTTGTCATCGAATCTCAAAAATACTTGTCAAAAGAATACGCAAGCGACAAGGAAAAATGGGGTCAA
TTTGACGCAGCTCGCTGGAATGCTTTCTACAAATGGGATAAAGAAAATGGTATCCTTAAAGAAGACTTGACAGAC
AAAGGCTTCACCAACGAATTTGTGAAATAA
MKKTWKVFLTLVTALVAVVLVACGQGTASKDNKEAELKKVDFILDWTPNTNHTGLYTVAKEKGYFKEAGVDVDLKLP
PEESSSDLVINGKAPFAVYFQDYMAKKLEKGAGITAVAAIVEHNTSGIISRKSDNVSSPKDLVGKKYGTWNDPTELAML
KTLVESQGGDFEKVEKVPNNDSNSITPIANGVFDTAWIYYGWDGILAKSQGVDANFMYIKDYVKEFDYYSPVIIANND
YLKDNKEEARKVIQAIKKGYQYAMEHPEEAADILIKNAPELKEKRDFVIESQKYLSKEYASDKEKWGQFDAARWNAFY
KWDKENGILKEDLTDKGFTNEFVKZ
ID41762bp
TTGATGAGAAACTTGAGAAGTATACTGAGACGACACATTAGTCTATTGGGCTTTCTCGGAGTATTGTCAATCTGGC
AGTTAGCAGGTTTTCTTAAACTTCTCCCCAAGTTTATCCTGCCGACACCTCTTGAAATTCTCCAGCCCTTTGTTCGT
GACAGAGAATTTCTCTGGCACCATAGCTGGGCGACCTTGAGAGTGGCTTTACTGGGGCTGATTTTGGGAGTTTTGA
TTGCCTGTCTTATGGCTGTGCTCATGGATAGTTTGACTTGGCTCAATGACCTGATTTACCCTATGATGGTGGTCATT
CAGACCATTCCGACCATTGCCATAGCTCCTATCCTGGTCTTGTGGCTAGGTTATGGGATTTTGCCCAAGATTGTCT
TGATTATCTTAACGACAACCTTTCCCATCATCGTTAGTATTTTGGACGGTTTTAGGCATTGCGACAAGGATATGCT
GACCTTGTTTAGTCTGATGCGGGCCAAGCCTTGGCAAATCCTGTGGCATTTTAAAATCCCAGTTAGCCTGCCTTAC
TTTTATGCAGGTCTGAGGGTCAGTGTCTCCTACGCCTTTATCACAACTGTGGTATCTGAGTGGTTGGGAGGTTTTG
AAGGTCTTGGTGTTTATATGATTCAGTCTAAAAAACTGTTTCAGTATGATACCATGTTTGCCATTATTATTCTGGTG
TCGATTATCAGTCTTTTGGGTATGAAGCTGGTCGATATCAGTGAAAAATATGTGATTAAATGGAAACGTTCGTAG
MMRNLRSILRRHISLLGFLGVLSIWQLAGFLKLLPKFILPTPLEILQPFVRDREFLWHHSWATLRVALLGLILGVLIACLM
AVLMDSLTWLNDLIYPMMVVIQTIPTIAIAPILVLWLGYGILPKIVLIILTTTFPIIVSILDGFRHCDKDMLTLFSLMRAKP
WQILWHFKIPVSLPYFYAGLRVSVSYAFTTTVVSEWLGGFEGLGVYMIQSKKLFQYDTMFAIIILVSIISLLGMKLVDISE
KYVIKWKRSZ
ID42372bp
TTGATTTTTAATCCTATTTGCTGTATGATAAGGGAAAAGAAAGGGGACAGAGATATGGCTTTTACCAATACCCACA
TGCGATCTGCTAGTTTTGGTATTGTTACCAGCTTGCCTGATGACATCATTGACTCTTTTTGGTATATCATCGACCAT
TTCTTAAAAAATGTCTTTGAATTGGAAGAAGAACTCGAGTTTCAATTGCTTAATAACCAAGGAAAGATTACCTTCC
ACTTTTCAAGTCAACACCTCCCTACAGCCATTGATTTTGACTTTAACCATCCTTTCGACCCTCGTTATCCCCCAAGA
GTACTGGTTTTAGACATGGACGGTAGAGAAACTATCCTCCTCCCAGAAGAAAATGACCTATTTTAA
MIFNPICCMIREKKGDRDMAFTNTHMRSASFGIVTSLPDDIIDSFWYIIDHFLKNVFELEEELEFQLLNNQGKITFHFSSQ
HLPTAIDFDFNHPFDPRYPPRVLVLDMDGRETILLPEENDLFZ
ID431569bp
ACAGCGGTGTCATTCTATCTATTTTAAGAAAAGTAATAATCAATTGTTAAAAATAGTAAA AAAATTGGAGGTTCTG
ATGAAATATTTTGTTCCTAATGAGGTATTCAGTATTCGTAAATTAAAGGTGGGGACTTGCTCGGTACTATTGGCAA
TTTCAATTTTGGGAAGCCAAGGTATTTTATCGGATGAAGTTGTTACTAGTTCTTCACCGATGGCTACAAAAGAGTC
TTCTAATGCAATTACTAATGATTTAGATAATTCACCAACTGTTAATCAGAATCGTTCTGCTGAAATGATTGCCTCT
AATTCAACCACTAATGGTTTAGATAATTCGTTAAGTGTTAATAGCATCAGCTCTAATGGTACTATTCGTTCCAATT
CACAATTAGACAACAGAACAGTTGAATCTACAGTAACATCTACTAATGAAAATAAGAGTTATAAGGAAGATGTTA
TAAGTGACAGAATTATCAAAAAAGAATTTGAAGATACTGCTTTAAGTGTAAAAGATTATGGTGCAGTAGGTGATG
GGATTCATGATGATCGACAAGCAATTCAAGATGCAATAGATGCTGCAGCTCAAGGGCTAGGTGGAGGAAATGTAT
ATTTTCCTGAAGGAACTTATTTAGTAAAAGAAATTGTTTTTTAAAAAGTCATACACACTTAGAATTGAATGAGAA
AGCTACAATTCTAAATGGTATAAATATTAAGAATCACCCTTCCATTGTTTTTATGACAGGTTTATTTACGGATGAT
GGTGCGCAAGTAGAATGGGGCCCAACAGAAGATATTAGTTATTCTGGTGGTACGATTGATATGAACGGTGTTTG
AATGAAGAAGGAACTAAAGCAAAAAATCTACCACTTATAAATTCTTCAGGTGCATTTGCTATTGGGAATTCAAAT
AACGTAACTATAAAAAATGTAACATTCAAGGATAGTTATCAAGGGCATGCTATTCAAATTGCAGGTTCGAAAAAT
GTATTAGTTGATAATTCTCGTTTTCTTGGGCAAGCCTTACCCAAAACGATGAAGGATGGGCAAATCATAAGTAAGG
AGAGCATTCAGATTGAACCATTAACTAGAAAAGGTTTTCCTTATGCCTTGAATGATGATGGGAAAAAATCTGAAA
ATGTGACTATTCAAAATTCCTATTTTGGCAAAAGTGATAAATCTGGGGAATTAGTAACAGCAATTGGCACACACTA
TCAAACATTGTCGACACAGAACCCCTCTAATATTAAAATTCAAAATAATCATTTTGATAACATGATGTATGCAGGT
GTACGTTTTACAGGATTCACTGATGTATTAATCAAAGGAAATCGCTTTGATAAGAAAGTTAAAGGAGAGAGTGTA
CATTATCGAGAAAGCGGAGCAGCTTTAGTAAATGCTTATAGCTATAAAAACACTAAAGACCTATTAGATTTAAAT
AAACAGGTGGTTATCGCCGAAAATATATTTAATATTGCCGATCCTAAAACAAAAGCGATACGAGTTGCAAAAGAT
AGTGCAGAATGTTTAGGAAAAGTATCAGATATTACTGTAACAAAAAATGTAATTAATAATAATTCTAAGGAAACA
GAACAACCAAATATTGAATTATTACGAGTTAGTGATAATTTAGTAGTCTCAGAGAATAGT
QRCHSIYFKKSNNQLLKIVKKLEVLMKYFVPNEVFSIRKLKTVGTCSVLLAISILGSQGILSDEVVTSSSPMATKESSNAITN
DLDNSPTVNQNRSAEMIASNSTTNGLDNSLSVNSISSNGTIRSNSQLDNRTVESTVTSTNENKSYKEDVISDRIIKKEFEDT
ALSVKDYGAVGDGIHDDRQAIQDAIDAAAQGLGGGNVYFPEGTYLVKEIVFLKSHTHLELNEKATILNGINIKNHPSIVF
MTGLFTDDGAQVEWGPTEDISYSGGTIDMNGALNEEGTKAKNLPLINSSGAFAIGNSNNVTIKNVTFKDSYQGHAIQIA
GSKNVLVDNSRFLGQALPKTMKDGQIISKESIQIEPLTRKGFPYALNDDGKKSENVTIQNSYFGKSDKSGELVTAIGTHY
QTLSTQNPSNIKIQNNHFDNMMYAGVRFTGFTDVLIKGNRFDKKVKGESVHYRESGAALVNAYSYKNTKDLLDLNKQ
VVIAENIFNIADPKTKAIRVAKDSAECLGKVSDITVTKNVINNNSKETEQPNIELLRVSDNLVVSENS
ID44324bp
GTGATGAAAGAAACTCAGCTATTAAAAGGTGTTCTTGAAGGTTGTGTCTTGGATATGATTGGTCAAAAAGAGCGG
TATGGTTATGAGTTGGTTCAGACTTTGCGAGAGGCTGGATTTGATACTATCGTTCCAGGAACTATTTATCCTTTGTT
GCAAAAGTTAGAAAAAAATCAATGGATAAGAGGCGACATGCGCCCGTCGCCAGATGGTCCAGATCGGAAGTATTT
TTCATTAATGAAAGAAGGAGAAGAGCGTGTCTCAGTCTTTTGGCAACAATGGGACGATTTGAGTCAAAAAGTAGA
AGGGATTAAGAATGGGGGTTAA
MMKETQLLKGVLEGCVLDMIGQKERYGYELVQTLREAGFDTIVPGTIYPLLQKLEKNQWIRGDMRPSPDGPDRKYFSL
MKEGEERVSVFWQQWDDLSQKVEGIKNGGZ
ID45816bp
ATGAAGAAAATGAAGTATTACGAAGAAACAAGCGCTTTGCTACATGAGTTTTCTGAGGAGAATCAAAAGTATTTT
GAGGAGTTGTGGGAAAGTTTTAATCTTGCTGGATTTCTCTATGATGAAGACTATCTCAGAGAGCAGATCTATTTGA
TGATGCTAGATTTCTCAGAAGCAGAACGAGATGGCATGAGTGCAGAGGATTATCTAGGTAAGAATCCTAAAAAAA
TAATGAAAGAGATTCTCAAGGGAGCACCTCGCAGTTCTATCAAAGAGTCCCTTTTGACGCCAATTCTTGTCCTGGC
GGTATTACGTTATTATCAACTACTAAGTGATTTTTCTAAAGGTCCTCTCTTAACAGTCAATTTGCTCACATTTTTAG
GGCAACTTCTTATTTTTCTGATTGGATTTGGACTTGTGGCCACAATTTTACGAAGAAGTTTAGTCCAAGATTCTCCT
AAAATGAAAATTGGCACTTACATTGTTGTTGGGACTATAGTTCTTCTAGTTGTTTTAGGATATGTAGGAATGGCAA
GCTTCATACAAGAAGGAGCCTTTTATATTCCGGCTCCCTGGGATAGTTTGTCTGTCTTTACGATTTCGCTAGTTATC
GGTATTTGGAATTGGAAAGAAGCGGTCTTTCGTCCATTTGTCAGTATGATTATTGCCCATCTTGTGGTGGGTTCTCT
GCTCCGTTATTATGAGTGGATGGGAATTTCAAATGTTTTCCTTACAAAAGTTATTCCTTTAGCTGTCCTCTTTATTG
GAATCTTTGTCTTGTTCCGTGGGTTTAAGAAGATAAAATGGAGTGAAGTATAG
MKKMKYYEETSALLHEFSEENQKYFEELWESFNLAGFLYDEDYLREQIYLMMLDFSEAERDGMSAEDYLGKNPKKIM
KEILKGAPRSSIKESLLTPILVLAVLRYYQLLSDFSKGPLLTVNLLTFLGQLLIFLIGFGLVATILRRSLVQDSPKMKIGTYI
VVGTIVLLVVLGYVGMASFIQEGAFYIPAPWDSLSVFTISLVIGIWNWKEAVFRPFVSMIIAHLVVGSLLRYYTEWMGISN
VFLTKVIPLAVLFIGIFVLFRGFKKIKWSEVZ
ID46348bp
CTGTTTTTTTATTTATACTCAATGAAAATCAAAGAGCAAACTAGGAAGCTAGCCGCAGGTTGCTCAAAACACTGTT
TTGAGGTTGTAGACGAAACTGACGAAGTCAGCTCAAAACATGTTTTTGAGGTTGTAGATGAAACTGACGAAGTCA
GCTCAAAACACTGTTTTGAGGTTGTAGATGAAACTGACGAAGTCAGCTCAAAACACTGTTTTGAGGTTGTAGATG
AAACTGACGAAGTCAGCTCAAAACATGTTTTTGAGGTTGTAGATGAAACTGACGAAGTCAGTAACCATACATACG
GTAGGGCGACGCTGACGTGGTTTTGAAGAGATTTTCGAAGAGTATTAA
MFFYLYSMKIKEQTRKLAAGCSKHCFEVVDETDEVSSKHVFEVVDETDEVSSKHCIFEVVDETDEVSSKHCFEVVDETD
EVSSKHVFEVVDETDEVSN HTYG RATLTWFEEIFEEYZ
ID471260bp
ATGCAGAATCTGAAATTTGCCTTTTCATCTATCATGGCTCACAAGATGCGTTCTTTGCTTACTATGATTGGGATTAT
TATCGGTGTTTCATCAGTTGTTGTGATTATGGCTTTGGGTGATTCCCTATCTCGTCAAGTCAATAAAGATATGACTA
AATCTCAGAAAAATATTAGCGTCTTTTTCTCTCCTAAAAAAAGTAAAGACGGGTCTTTTACTCAGAAACAATCAGC
TTTTACGGTTTCTGGAAAGGAAGAGGAAGTTCCTGTTGAACCGCCAAAACCGCAAGAATCCTGGGTCCAAGAGGC
AGCTAAACTGAAGGGAGTGGATAGTTACTATGTAACCAATTCAACGAATGCCATCTTGACCTATCAAGATAAAAA
GGTTGAGAATGCTAATTTGACAGGTGGAAACAGAACTTACATGGACGCTGTTAAGAATGAAATTATTGCAGGTCG
TAGTCTGAGAGAGCAAGATTTCAAAGAGTTTGCAAGTGTCATTTTGCTAGATGAGGAATTGTCCATTAGTTTATTT
GAATCTCCTCAAGAGGCTATTAACAAGGTTGTAGAAGTCAATGGATTTAGTTACCGGGTCATTGGGGTTTATACTA
GTCCGGAGGCTAAAAGATCAAAAATATATGGGTTTGGTGGCTTGCCTATTACTACCAATATCTCCCTTGCTGCGAA
TTTTAATGTAGATGAAATAGCTAATATTGTCTTTCGAGTGAATGATACCAGTTTAACCCCAACTCTGGGTCCAGAA
CTGGCACGAAAAATGACAGAGCTTGCAGGCTTACAACAGGGAGAATACCAGGTGGCAGATGAGTCCGTTGTATTT
GCAGAAATTCAACAATCGTTTATTTTATGACGACGATTATTAGTTCCATCGCAGGGATTTCTCTCTTTGTTGGAG
GAACTGGTGTCATGAACATCATGCTGGTTTCGGTGACAGAGCGCACTCGTGAGATTGGTCTTCGTAAGGCTTTGGG
TGCAACACGTGCCAATATTTTAATTCAGTTTTTGATTGAATCCATGATTTTGACCTTGTTAGGTGGCTTAATTGGCT
TGACAATTGCAAGTGGTTTAACTGCCTTAGCAGGTTTGTTACTGCAAGGTTTAATAGAAGGTATAGAAGTTGGAGT
ATCAATCCCAGTCGCCCTATTTAGTCTTGCAGTTTCGGCTAGTGTTGGTATGATTTTTGGAGTCTTGCCAGCCAAC
AAGGCATCGAAACTTGATCCAATTGAAGCCCTTCGTTATGAATGA
MQNLKFAFSSTMAHKMRSLLTMIGIIIGVSSVVVIMALGDSLSRQVNKDMTKSQKNISVFFSPKSKDGSFTQKQSAFTVS
GKEEEVPVEPPKPQESWVQEAAKLKGVDSYYVTNSTNAILTYQDKKTVENANLTGGNRTYMDAVKNEIIAGRSLREQDF
KEFASVILLDEELSISLFESPQEAINKVVEVNGFSYRVIGVYTSPEAKRSKIYGFGGLPITTNISLAANFNVDEIANIVFRVN
DTSLTPTLGPELARKMTELAGLQQGEYQVADESVVFAEIQQSFSFMTTIISSIAGISLFVGGTGVMNIMLVSVTERTREIG
LRKALGATRANILIQFLIESMILTLLGGLIGLTIASGLTALAGLLLQGLIEGIEVGVSIPVALFSLAVSASVGMIFGVLPANK
ASKLDPIEALRYEZ
ID48705bp
CTGATGAAGCAACTAATTAGTCTAAAAAATATCTTCAGAAGTTACCGTAATGGTGACCAAGAACTGCAGGTTCTC
AAAAATATCAATCTAGAAGTGAATGAGGGTGAATTTGTAGCCATCATGGGACCATCTGGGTCTGGTAAGTCCACT
CTGATGAATACGATTGGCATGTTGGATACACCAACCAGTGGAGAATATTATCTTGAAGGTCAAGAAGTGGCTGGG
CTTGGTGAAAAACAACTAGCTAAGGTCCGTAACCAACAAATCGGTTTTGTCTTTCAGCAGTTCTTTCTTCTATCGA
AGCTCAATGCTCTGCAAAATGTAGAATTGCCCTTGATTTACGCAGGAGTTTCGTCTTCAAAACGTCGCAAGTTGGC
TGAGGAATATTTAGACAAGGTTGAATTGACAGAACGTAGTCACCATTTACCTTCAGAATTATCTGGTGGTCAAAA
GCAACGTGTAGCCATTGCGCGTGCCTTGGTAAACAATCCTTCTATTATCCTAGCGGATGAACCGACAGGAGCCTTG
GATACCAAAACAGGTAACCAAATTATGCAATTATTGGTTGATTTGAATAAAGAAGGAAAAACCATTATCATGGTA
ACGCATGAGCCTGAGATTGCTGCCTATGCCAAACGTCAGATTGTCATTCGGGATGGGGTCATTTCGTCTGACAGTG
CTCAGTTAGGAAAGGAGGAAAACTAA
MMKQLISLKNIFRSYRNGDQELQVLKNINLEVNEGEFVAIMGPSGSGKSTLMNTIGMLDTPTSGEYYLEGQEVAGLGEK
QLAKVRNQQIGFVFQQFFLLSKLNALQNVELPLIYAGVSSSKRRKLAEEYLDKVELTERSHHLPSELSGGQKQRVAIARA
LVNNPSIILADEPTGALDTKTGNQIMQLLVDLNKEGKTIIMVTHEPEIAAYAKRQIVIRDGVISSDSAQLGKEENZ
ID491200bp
ATGAAGAAAAAGAATGGTAAAGCTAAAAAGTGGCAACTGTATGCAGCAATCGGTGCTGCGAGTGTAGTTVGTATTG
GGTGCTGGGGGGATTTTACTCTTTAGACAACCTTCTCAGACTGCTCTAAAAGATGAGCCTACTCATCTTGTTGTTG
CCAAGGAAGGAAGCGTGGCCTCCTCTGTTTTATTGTCAGGGACAGTAACAGCAAAAAATGAACAATATGTTTATT
TTGATGCTAGTAAGGGTGATTTAGATGAAATCCTTGTTTCTGTGGGCGATAAGGTCAGCGA AGGGCAGGCTTTAGT
CAAGTACAGTAGTTCAGAAGCGCAGGCGGCCTATGATTCAGCTAGTCGAGCAGTAGCTAGGGCAGATCGTCATAT
CAATGAACTCAATCAAGCACGAAATGAAGCCGCTTCAGCTCCGGCTCCACAGTTACCAGCGCCAGTAGGAGGAGA
AGATGCAACGGTGCAAAGCCCAACTCCAGTGGCTGGAAATTCTGTTGCTTCTATTGACGCTCAATTGGGTGATGCC
CGTGATGCGCGTGCAGATGCTGCGGCGCAATTAAGCAAGGCTCAAAGTCAATTGGATGCAACAACTGTTCTCAGT
ACCCTAGAGGGAACTGTGGTCGAAGTCAATAGCAATGTTTCTAAATCTCCAACAGGGGCGAGTCAAGTTATGGTT
CATATTGTCAGCAATGAAAATTTACAAGTCAAGGGAGAATTGTCTGAGTACAATCTAGCCAACCTTTCTGTAGGTC
AAGAAGTAAGCTTTACTTCTAAAGTGTATCCTGATAAAAAATGGACTGGGAAATTAAGCTATATTTCTGACTATCC
TAAAAACAATGGTGAAGCAGCTAGTCCAGCAGCCGGGAATAATACAGGTTCTAAATACCCTTATACTATTGATGT
GACAGGCGAGGTTGGTGATTTGAAACAAGGTTTTTCTGTCAACATTGAGGTTAAAAGCAAAACTAAGGCTATTCTT
GTTCCTGTTAGCAGTCTAGTAATGGATGATAGTAAAAATTATGTCTGGATTGTGGATGAACAACAAAAGGCTAAA
AAAGTTGAGGTTTCATTGGGAAATGCTGACGCAGAAAATCAAGAAATCACTTCTGGTTTAACGAACGGTGCTAAG
GTCATCAGTAATCCAACATCTTCCTTGGAAGAAGGAAAAGAGGTGAAGGCTGATGAAGCAACTAATTAG
MKKKNGKAKKWQLYAAIGAASVVVLGAGGILLFRQPSQTALKDEPTHLVVAKEGSVASSVLLSGTVTAKNEQYVYFD
ASKGDLDEILVSVGDKVSEGQALVKYSSSEAQAAYDSASRAVARADRHINELNQARNEAASAPAPQLPKPVGGEDATV
QSPTPVAGNSVASIDAQLGDARDARADAAAQLSKAQSQLDATTVLSTLEGTVVEVNSNVSKSPTGASQVMVHIVSNEN
LQVKGELSEYNLANLSVGQEVSFTSKVYPDKKWTGKLSYISDYPKNNGEAASPAAGNNTGSKYPYTIDVTGEVGDLKQ
GFSVNIEVKSKTKAILVPVSSLVMDDSKNYVWIVDEQQKAKKVEVSLGNADAENQETTSGLTNGAKVISNPTSSLEEGKE
VKADEATNZ
ID50759bp
ATGTCACGTAAACCATTTATCGCTGGTAACTGGAAAATGAACAAAAATCCAGAAGAAGCTAAAGCATTCGTTGAA
GCAGTTGCATCAAAACTTCCTTCATCAGATCTTGTTGAAGCAGGTATCGCTGCTCCAGCTCTTGATTTGACAACTG
TTCTTGCTGTTGCAAAAGGCTCAAACCTTAAAGTTGCTGCTCAAAACTGCTACTTTGAAAATGCAGGTGCTTTCAC
TGGTGAAACTAGCCCACAAGTTTTGAAAGAAATCGGTACTGACTACGTTGTTATCGGTCACTCAGAACGCCGTGA
CTACTTCCATGAAACTGATGAAGATATCAACAAAAAAGCAAAAGCAATCTTTGCGAACGGTATGCTTCCAATCAT
CTGTTGTGGTGAATCACTTGAAACTTACGAAGCTGGTAAAGCTGCTGAATTCGTAGGTGCTCAAGTATCTGCTGCA
TTGGCTGGATTGACTGCTGAACAAGTTGCTGCCTCAGTTATCGCTTATGAGCCAATCTGGGCTATCGGTACTGGTA
AATCAGCTTCACAAGACGATGCACAAAAAATGTGTAAAGTTGTTCGTGACGTTGTAGCTGCTGACTTTGGTCAAG
AAGTCGCAGACAAAGTTCGTGTTCAATACGGTGGTTCTGTTAAACCTGAAAATGTTGCTTCATACATGGCTTGCCC
AGACGTTGACGGTGCCCTTGTAGGTGGTGCGTCACTTGAAGCTGAAAGCTTCTTGGCTTTGCTTGACTTTGTAAAA
TAA
MSRKPFIAGNWKMNKNPEEAKAFVEAVASKLPSSDLVEAGIAAPALDLTTVLAVAKGSNLKVAAQNCYFENAGAFTG
ETSPQVLKEIGTDYVVIGHSERRDYFHETDEDINKKAKAIFANGMLPIICCGESLETYEAGKAAEFVGAQVSAALAGLTA
EQVAASVIAYEPIWAIGTGKSASQDDAQKMCKVVRDVVAADFGQEVADKVRVQYGGSVKPENVASYMACPDVDGAL
VGGASLEAESFLALLDFVKZ
ID511473bp
TTGAAAACAAAAATTGGATTAGCAAGTATCTGTTTACTAGGCTTGGCAACTAGTCATGTCGCTGCAAATGAAACTG
AAGTAGCAAAAACTTCGCAGGATACAACGACAGCTTCAAGTAGTTCAGAGCAAAATCAGTCTTCTAATAAAACGC
AAACGAGCGCAGAAGTACAGACTAATGCTGCTGCCCACTGGGATGGGGATTATTATGTAAAGGATGATGGTTCTA
AAGCTCAAAGTGAATGGATTTTTGACAACTACTATAAGGCTTGGTTTTATATTAATTCAGATGGTCGTTACTCGCA
GAATGAATGGCATGGAAATTACTACCTGAAATCAGGTGGATATATGGCCCAAAACGAGTGGATCTATGACAGTAA
TTACAAGAGTTGGTTTTATCTCAAGTCAGATGGGGCTTATGCTCATCAAGAATGGCAATTGATTGGAAATAAGTGG
TACTACTTCAAGAAGTGGGGTTACATGGCTAAAAGCCAATGGCAAGGAAGTTATTTCTTGAATGGTCAAGGAGCT
ATGATGCAAAATGAATGGCTCTATGATCCAGCCTATTCTGCTTATTTTTATCTAAAATCCGATGGAACTTATGCTA
ACCAAGAGTGGCAAAAAGTGGGCGGCAAATGGTACTATTTCAAGAAGTGGGGCTATATGGCTCGGAATGAGTGGC
AAGGCAACTACTATTTGACTGGAAGTGGTGCCATGGCGACTGACGAAGTGATTATGGATGGTACTCGCTATATCTT
TGCGGCCTCTGGTGAGCTCAAAGAAAAAAAAGATTTGAATGTCGGCTGGGTTCACAGAGATGGTAAGCGCTATTT
CTTTAATAATAGAGAAGAACAAGTGGGAACCGAACATGCTAAGAAAGTCATTGATATTAGTGAGCACAATGGTCG
TATCAATGATTGGAAAAAGGTTATTGATGAGAACGAAGTGGATGGTGTCATTGTTCGTCTAGGTTATAGCGGTAA
AGAAGACAAGGAATTGGCGCATAACATTAAGGAGTTAAACCGTCTGGGAATTCCTTATGGTGTCTATCTCTATAC
CTATGCTGAAAATGAGACCGTGCTGAGAGTGACGCTAAACAGACCATTGAACTTATAAAGAAATACAATATGAAC
CTGTCTTACCCTATCTATTATGATGTTGAGAATTGGGAATATGTAAATAAGAGCAAGAGAGCTCCAAGTGATACA
GGCACTTGGGTTAAAATCATCAACAAGTACATGGACACGATGAAGCAGGCGGGTTATCAAAATGTGTATGTCTAT
AGCTATCGTAGTTTATTACAGACGCGTTTAAAACACCCAGATATTTTAAAACATGTAAACTGGGTAGCGGCCTATA
CGAATGCTTTAGAATGGGAAAACCCTCATTATTCAGGAAAAAAAGGTTGGCAATATACCTCTTCTGAATACATGA
AAGGAATCCAAGGGCGCGTAGATGTCAGCGTTTGGTATTAA
MKTKIGLASICLLGLATSHVAANETEVAKTSQDTTTASSSSEQNQSSNKTQTSAEVQTNAAAHWDGDYYVKDDGSKAQ
SEWIFDNYYKAWFYINSDGRYSQNEWHGNYYLKSGGYMAQNEWIYDSNYKSWFYLKSDGAYAHQEWQLIGNKWYY
FKKWGYMAKSQWQGSYFLNGQGAMMQNEWLYDPAYSAYFYLKSDGTYANQEWQKVGGKWYYFKKWGYMARNE
WQGNYYLTGSGAMATDEVIMDGTRYIFAASGELKEKKDLNVGWVHRDGKRYFFNNREEQVGTEHAKKVIDISEHNG
RINDWKKVIDENEVDGVIVRLGYSGKEDKELAHNIKELNRLGIPYGVYLYTYAENETDAESDAKQTIELIKKYNMNLSY
PIYYDVENWEYVNKSKRAPSDTGTWVKIINKYMDTMKQAGYQNVYVYSYRSLLQTRLKHPDILKHVNWVAAYTNAL
EWENPHYSGKKGWQYTSSEYMKGIQGRVDVSVWYZ
IDS2774bp
ATGAAAAAATTTGCCAACCTTTATCTGGGACTGGTCTTTCTGGTCCTCTACCTGCCTATCTTTTACTTGATTGGCTA
TGCCTTTAATGCTGGTGATGATATGAATAGCTTTACAGGTTTTAGCTGGACTCACTTTGAAACCATGTTTGGAGAT
GGGAGACTCATGCTGATTTTGGCTCAGACATTTTTCTTGGCCTTCCTATCAGCCTTGATAGCGACCATTATCGGGA
CTTTTGGTGCCATTTACATCTACCAGTCTCGTAAGAAATACCAAGAAGCCTTTCTATCACTCAATAATATCCTCAT
GGTTGCGCCTGACGTTATGATTGGTGCTAGCTTCTTGATTCTCTTTACCCAACTCAAGTTTTCACTTGGCTTTTTGA
CCGTTCTATCTAGTCACGTGGCCTTCTCCATTCCTATCGTGGTCTTGATGGTCTTGCCTCGACTCAAGGAAATGAA
TGGCGACATGATTCATGCGGCCTATGACTTGGGAGCTAGTCAATTTCAGATGTTCAAGGAAATCATGCTTCCTTAC
CTGACTCCGTCTATCATTACTGGTTATTTCATGGCCTTCACCTATTCGTTAGATGACTTTGCCGTGACCTTCTTTGT
AACAGGAAATGGCTTTTCAACCCTATCAGTCGAGATTTACTCTCGTGCTCGCAAGGGGATTTCCTTAGAAATCAAT
GCCCTGTCTGCTCTAGTCTTTCTCTTTAGTATTATCCTAGTTGTAGGTTATTACTTTATCTCTCGTGAGAAGGAGGA
GCAAGCATGA
MKKFANLYLGLVFLVLYLPIFYLIGYAFNAGDDMNSFTGFSWTHFEFMFGDGRLMLILAQTFFLAFLSALIATIIGTFGA
IYIYQSRKKYQEAFLSLNNILMVAPDVMIGASFLILFTQLKFSLGFLTVLSSHVAFSIPIVVLMVLPRLKEMNGDMIHAAY
DLGASQFQMFKEIMLPYLTPSIITGYFMAFTYSLDDFAVTFFVTGNGFSTLSVEIYSRARKGISLEINALSALVFLFSIILVV
GYYFISREKEEQAZ
ID591071bp
ATGAAAAAAATCTATTCATTTTTAGCAGGAATTGCAGCGATTATCCTTGTCTTGTGGGGAATTGCGACTCATTTAG
ATAGTAAAATCAATAGTCGAGATAGTCAAAAATTGGTTATCTATAACTGGGGAGACTATATCGATCCTGAACTCTT
GACTCAGTTTACAGAAGAAACAGGAATTCAAGTTCAGTACGAGACTTTTGACTCCAACGAAGCCATGTACACTAA
GATAAAGCAGGGTGGAACGACCTACGATATTGCCATTCCAAGTGAATACATGATTAACAAGATGAAGGACGAAG
ACCTCTTGGTTCCGCTTGATTATTCAAAAATTGAAGGAATCGAAAATATCGGACCAGAGTTTCTCAACCAGTCCTT
TGACCCAGGTAATAAATTCTCCATCCCTTACTTCTGGGGAACCTTAGGAATTGTCTACAACGAAACCATGGTAGAT
GAAGCGCCTGAGCATTGGGATGACCTTTGGAAGCCGGAGTATAAGAATTCTATCATGCTCTTTGATGGGGCGCGT
GAGGTGCTGGGACTAGGACTCAATTCCCTCGGCTACAGCCTCAACTCCAAGGATCTGCAGCAGTTGGAAGAGACA
GTGGATAAGCTCTACAAACTGACTCCAAATATCAAGGCTATCGTTGCGGACGAGATGAAGGGCTATATGATTCAG
AATAATGTTGCAATCGGCGTGACCTTCTCTGGTGAAGCCAGCCAAATGTTAGAAAAAAATGAAAATCTACGTTAT
GTGGTACCGACAGAGGCCAGCAATCTTTGGTTTGACAATATGGTCATTCCCAAAACAGTTAAAAACCAAAACTCA
GCCTATGCCTTTATCAACTTTATGTTGAAACCTGAAAATGCTCTCCAAAATGCGGAGTATGTCGGCTATTCAACAC
CAAACCTACCAGCGAAGGAATTGCTCCCAGAGGAAACAAAGGAAGATAAGGCCTTCTATCCCGATGTTGAAACCA
TGAAACACCTAGAAGTTTATGAGAAATTTGACCATAAATGGACAGGGAAATATAGCGACCTCTTCCTACAGTTTA
AAATGTATCGGAAGTAG
MKKIYSFLAGIAAIILVLWGIATHLDSKINSRDSQKLVIYNWGDYIDPELLTQFTEETGIQVQYETFDSNEAMYTKIKQGG
TTYDIAIPSEYMINKMKDEDLLVPLDYSKIEGIENIGPEFLNQSFDPGNKFSIPYFWGTLGIVYNETMVDEAPEHWDDLW
KPEYKNSIMLFDGAREVLGLGLNSLGYSLNSKDLQQLEETVDKLYKLTPNIKAIVADEMKGYMIQNNVAIGVTFSGEAS
QMLEKNENLRYVVPTEASNLWFDNMVIPKTVKNQNSAYAFINFMLKPENALQNAEYVGYSTPNLPAKELLPEETKED
KAFYPDVETMKHLEVYEKFDHKWTGKYSDLFLQFKMYRKZ
ID611851bp
ATGAATAAAAAACTAACAGATTATGTGATTGATCTGGTGGAAATTTTAAATAAACAACAAAAGCAGGTTTTCTGG
GGAATATTTGATATTTTCAGTATGGTGGTCCATCATTGTATCTTATATTTTATTTTATGGGCTGATTAATCCAGC
ACCTGTTGACTACATTATCTATACGAGTTTGGCCTTCCTGTTCTATCAATTGATGATTGGTTTTTGGGGGTTGAACG
CGAGCATTAGTCGTTACAGCAAGATTACGGATTTCATGAAAATCTTTTTTGGTGTGACTGCTAGCAGTGTCTTGTC
ATATAGTATCTGTTATGCCTTCTTGCCACTCTTCTCCATCCGTTTCATCATTCTCTTTATCTTGTTGAGTACCTTCTT
GATTTTATTGCCACGGATTACTTGGCAGTTAATCTACTCCAGACGCAAAAAAGGTAGTGGTGATGGAGAACACCG
TCGGACCTTCTTGATTGGTGCCGGTGATGGTGGGGCTCTTTTTATGGATAGTTACCAACATCCAACCAGTGAATTA
GAACTGGTCGGTATTTTGGATAAGGATTCTAAGAAAAAGGGTCAAAAACTTGGTGGTATTCCTGTTTTGGGCTCTT
ATGACAATCTGCCTGAATTAGCCAAACGCCATCAAATCGAGCGTGTCATCGTTGCGATTCCGTCGCTGGATCCGTC
AGAATATGAGCGTATCTTGCAGATGTGTAATAAGCTGGGTGTCAAATGTTACAAGATGCCTAAGGTTGAAACTGT
TGTTCAGGGCCTTCACCAAGCAGGTACTGGCTTCCAAAAAATTGATATTACGGACCTTTTGGGTCGTCAGGAAATC
CGTCTTGACGAATCGCGTCTGGGTGCAGAACTGACAGGTAAGACCATCTTAGTCACAGGAGCTGGAGGTTCAATC
GGTTCTGAAATCTGTCGTCAAGTTAGTCGCTTCAATCCTGAACGCATTGTCTTGCTCGGTCATGGGGAAAACTCAA
TCTACCTTGTTTATCATGAATTGATTCGTAAGTTCCAAGGGATTGATTATGTACCTGTGATTGCGGACATTCAAGA
CTATGATCGTTTGTTGCAAGTCTTTGAGCAGTACAAACCTGCTATTGTTTATCATGCGGCAGCCCACAAGCATGTT
CCTATGATGGAGCGCAATCCAAAAGAAGCCTTCAAAAACAATATCCGTGGAACTTACAATGTTGCTAAGGCTGTT
GATGAAGCTAAAGTGTCTAAGATGGTTATGATTTCGACAGATAAGGCAGTCAATCCACCAAATGTTATGGGAGCA
ACCAAGCGCGTGGCGGAGTTGATTGTCACTGGCTTTAACCAACGTAGCCAATCAACCTACTGTGCAGTTCGTTTTG
GGAATGTTCTTGGTAGCCGTGGTAGTGTCATTCCAGTCTTTGAACGTCAGATTGCTGAAGGTGGGCCTGTAACGGT
GACAGACTTCCGTATGACCCGTTACTTTATGACCATTCCAGAAGCTAGCCGTCTGGTTATCCATGCTGGTGCTTAT
GCCAAAGATGGGGAAGTCTTTATCCTTGATATGGGCAAACCAGTCAAGATTTATGACTTGGCCAAGAAGATGGTG
CTTCTAAGTGGCCACACTGAAAGTGAAATTCCAATCGTTGAAGTTGGAATCCGCCCAGGTGAAAAACTCTACGAA
GAACTCTTGGTATCAACCGAACTCGTTGATAATCAAGTTATGGATAAGATTTTCGTTGGTAAGGTTAATGTCATGC
CTTTAGAATCCATCAATCAAAAGATTGGAGAGTTCCGCACTCTCAGTGGAGATGAGTTGAAGCAAGCTATTATCG
CCTTTGCTAATCAAACAACCCACATTGAATAA
MNKKLTDYVIDLVEILNKQQKQVFWGIFDIFSMVVSIIVSYILFYGLINPAPVDYIIYTSLAFLFYQLMIGFWGLNASISRY
SKITDFMKIFFGVTASSVLSYSICYAFLPLFSIRFIILFILLSTFLILLPRITWQLIYSRRKKGSGDGEHRRTFLIGAGDGGALF
MDSYQHPTSELELVGILDKDSKKKGQKLGGIPVLGSYDNLPELAKRHQIERVIVAIPSLDPSEYERILQMCNKLGVKCYK
MPKVETVVQGLHQAGTGFQKIDITDLLGRQEIRLDESRLGAELTGKTILVTGAGGSIGSEICRQVSRFNPERIVLLGHGEN
SIYLVYHELIRKFQGIDYVPVIADIQDYDRLLQVFEQYKPAIVYHAAAHKHVPMMERNPKEAFKNNIRGTYNVAKAVD
EAKVSKMVISTDKAVNPPNVMGATKRVAELIVTGFNQRSQSTYCAVRFGNVLGSRGSVIPVFERQIAEGGPVTVTDFR
MTRYFMTIPEASRLVIHAGAYAKDGEVFILDMGKPVKIYDLAKKMVLLSGHTESEIPIVEVGIRPGEKLYEELLVSTELV
DNQVMDKIFVGKVNVMPLESINQKIGEFRTLSGDELKQAIIAFANQTTHIEZ
ID1011338bp
ATGATTGAACTTTATGATAGTTACAGTCAAGAAAGTCGAGATTTACATGAAAGTCTAGTCGCTACTGGTCTTTCTC
AACTTGGAGTGGTCATCGATGCAGATGGTTTTCTGCCTGATGGTCTGCTTTCTCCTTTTACCTATTATCTAGGTTAC
GAGGATGGAAAACCTCTCTATTTTAATCAAGTTCCCGTTTCAGATTTTTGGGAAATTTTAGGAGATAATCAGTCTG
CTTGTATTGAAGATGTGACGCAGGAGAGGGCTGTCATTCATTATGCTGATGGAATGCAGGCTCGCTTGGTTAAACA
GGTAGACTGGAAAGACCTAGAAGGTCGAGTACGTCAGGTTGACCACTACAATCGCTTCGGAGCTTTGTTTTGCTAC
AACGACTTATAGCGCAGATAGCGAGCCGATTATGACAGTTTACCAAGATGTCAATGGTCAACAAGTTTTACTGGA
AAACCATGTGACGGGTGATATCTTATTGACTTTGCCAGGTCAGTCCATGCGTTACTTTGCAAATAAAGTTGAATTT
ATCACCTTCTTTTTGCAAGATTTGGAAATAGATACCAGTCAGCTTATCTTTAATACTCTAGCGACTCCTTTCTTGGT
TTCCTTCCATCATCCAGATAAATCTGGCTCGGATGTCTTGGTATGGCAGGAACCTCTCTATGATGCCATTCCAGGT
AATATGCAGTTGATTTTGGAAAGTGATAATGTGCGTACTAAGAAGATCATCATTCCAAATAAGGCGACTTATGAG
CGCGCTTTAGAGTTAACTGACGAGAAATACCATGATCAGTTTGTGCACTTGGGTTATCATTACCAGTTCAAACGTG
ATAATTTCCTAAGACGAGATGCCTTAATCTTGACCAATTCAGATCAGATTGAGCAAGTAGAAGCAATCGCAGGAG
CCTTGCCTGATGTCACTTTCCGTATTGCAGCGGTGACAGAGATGTCTTCTAAGCTCTTAGACATGCTTTGCTATCCT
AATGTGGCCCTTTACCAGAACGCTAGTCCACAGAAGATTCAGGAGCTGTATCAACTGTCGGATATTTACTTGGATA
TAAACCACAGTAATGAGTTGCTACAGGCAGTGCGTCAGGCCTTTGAGCACAATCTCTTGATTCTTGGCTTTAATCA
GACGGTGCACAATAGACTTTATATCGCTCCAGACCATCTATTTGAAAGTAGTGAAGTTGCTGCTTTGGTTGAGACC
ATTAAATTGGCCTTTCAGATGTTGATCAAATGCGTCAGGCACTTGGCAAACAAGGCCAACATGCAAATTATGTTG
ACTTGGTGAGATATCAGGAAACCATGCAAACTGTTTTAGGAGGCTAA
MIELYDSYSQESRDLHESLVATGLSQLGVVIDADGFLPDGLLSPFTYYLGYEDGKPLYFNQVPVSDFWEILGDNQSACIE
DVTQERAVIHYADGMQARLVKQVDWKDLEGRVRQVDHYNRFGACFATTTYSADSEPIMTVYQDVNGQQVLLENHV
TGDILLTLPGQSMRYFANKVEFTTFFLQDLEIDTSQLIFNTLATPFLVSFHHPDKSGSDVLVWQEPLYDAIPGNMQLILES
DNVRTKKIIIPNKATYERALELTDEKYHDQFVHLGYHYQFKRDNFLRRDALILTNSDQIEQVEATAGALPDVTFRIAAVT
EMSSKLLDMLCYPNVALYQNASPQKIQELYQLSDIYLDINHSNELLQAVRQAFEHNLLILGFNQTVHNRLYIAPDHLFE
SSEVAALVETIKLALSDVDQMRQALGKQGQHANYVDLVRYQETMQTVLGGZ
ID1021512bp
ATGACAATTTACAATATAAATTTAGGAATTGGTTGGGCTAGTAGCGGTGTTGAATACGCTCAAGCCTATCGTGCTG
GTGTTTTTCGGAAATTAAATCTGTCCTCTAAGTTTATCTTTACAGATATGATTTTAGCCGATAATATTCAGCACTTA
ACAGCCAATATTGGTTTTGATGATAATCAGGTTATCTGGCTTTATAATCATTTCACAGATATCAAAATTGCACCTA
CTAGCGTGACAGTGGATGATGTCTTGGCTTACTTTGGTGGTGAAGAAAGTCACAGAGAAAAAAATGGCAAGGTTT
TACGTGTATTTCTTTTTTGACCAAGATAAGTTTGTAACCTGTTATTTGGTTGATGAGAACAAGGACTTGGTTCAACA
TGCCGAGTATGTTTTTAAGGGAAACCTGATTCGGAAGGATTACTTTTCTTATACGCGTTATTGTAGCGAGTATTTT
GCTCCCAAGGACAATGTTGCAGTCTTATACCAACGAACTTTTTATAATGAAGACGGGACTCCAGTCTATGATATCT
TGATGAATCAAGGGAAGGAAGAAGTTTATCATTTCAAGGATAAGATTTTCTATGGAAAGCAAGCTTTTGTGCGTG
CCTTTATGAAATCTTTGAATTTGAATAAGTCTGATTTGGTCATTCTCGATAGGGAGACAGGTATTGGACAGGTTGT
GTTTGAGGAAGCACAGACAGCACATCTAGCGGTAGTTGTTCATGCGGAGCATTATAGTGAAAATGCTACAAATGA
GGACTATATCCTTTGGAATAACTATTATGACTATCAGTTTACCAATGCAGATAAGGTTGACTTCTTTATCGTGTCT
ACTGATAGACAAAATGAAGTTCTACAAGAGCAATTTGCCAAATATACTCAGCATCAGCCAAAGATTGTTACCATT
CCTGTAGGCAGTATTGATTCCTTGACAGATTCAAGTCAAGGGCGCAAACCATTTTCATTGATTACGGCTTCACGTC
TTGCCAAAGAAAAGCACATTGATTGGCTTGTGAAAGCTGTGATTGAAGCTCATAAGGAGTTACCGGAACTAACCT
TTGATATCTATGGTAGTGGTGGAGAAGATTCTCTGCTTAGAGAAATTATTGCAAATCATCAGGCAGAGGACTATAT
CCAACTCAAGGGGCATGCGGAACTTTCGCAGATTTATAGCCAGTATGAGGTCTACTTAACGGCTTCTACCAGCGA
AGGATTTGGTCTGACCTTGATGGAAGCTATTGGTTCAGGTCTACCTCTAATTGGTTTTGATGTGCCTTATGGTAATC
AGACCTTTATAGAGGATGGGCAAAATGGTTATTTGATTCCAAGTTCATCTGACCATGTAGAAGACCAAATCAAGC
AAGCTTATGCCGCTAAGATTTGTCAATTGTATCAAGAAAATCGTTTGGAAGCTATGCGTGCCTATTCTTACCAAAT
TGCAGAAGGCTTCTTGACCAAAGAAATTTTAGAAAAGTGGAAGAAAACAGTAGAGGAGGTGCTCCATGATTGA
MTIYNINLGIGWASSGVEYAQAYRAGVFRKLNLSSKFIFTDMILADNIQHLTANIGFDDNQVIWLYNHFTDIKIAPTSVT
VDDVLAYFGGEESHREKNGKVLRVFFFDQDKFVTCYLVDENKDLVQHAEYVFKGNLIRKDYFSYTRYCSEYFAPKDN
VAVLYQRTFYNEDGTPVYDILMNQGKEEVYHFKDKIFYGKQAFVRAFMKSLNLNKSDLVILDRETGIGQVVFEEAQTA
HLAVVVHAEHYSENATNEDYILWNNYYDYQFTNADKVDFFIVSTDRQNEVLQEQFAKYTQHQPKIVTIPVGSIDSLTDS
SQGRKPFSLITASRLAKEKHIDWLVKAVIEAHKELPELTFDIYGSGGEDSLLREIIANHQAEDYIQLKGHAELSQIYSQYE
VYLTASTSEGFGLTLMEAIGSGLPLIGFDVPYGNQTFIEDGQNGYLIPSSSDHVEDQIKQAYAAKICQLYQENRLEAMRA
YSYQIAEGFLTKEILEKWKKTVEEVLHDZ
ID1032292bp
ATGTCCTCTCTTTCGGATCAAGAATTAGTAGCTAAAACAGTAGAGTTTCGTCAGCGTCTTTCCGAGGGAGAAAGTC
TAGACGATATTTTGGTTGAAGCTTTTGCTGTGGTGCGTGAAGCAGATAAGCGGATTTTAGGGATGTTTCCTTATGA
TGTTCAAGTCATGGGAGCTATTGTCATGCACTATGGAAATGTTGCTGAGATGAATACGGGGGAAGGTAAGACCTT
GACAGCTACCATGCCTGTCTATTTGAACGCTTTTTCAGGAGAAGGAGTGATGGTTGTGACTCCTAATGAGTATTTA
TCAAAGCGTGATGCCGAGGAAATGGGTCAAGTTTATCGTTTTCTAGGATTGACCATTGGTGTACCATTTACGGAAG
ATCCAAAGAAGGAGATGAAAGCTGAAGAAAAGAAGCTTATCTATGCTTCGGATATCATCTACACAACCAATAGTA
ATTTAGGTTTTGATTATCTAAATGATAACCTAGCCTCGAATGAAGAAGGTAAGTTTTTACGACCGTTTAACTATGT
GATTATTGATGAAATTGATGATATCTTGCTTGATAGTGCACAAACTCCTCTGATTATTGCGGGTTCTCCTCGTGTTC
AGTCTAATTACTATGCGATCATTGATACACTTGTAACAACCTTGGTCGAAGGAGAGGATTATATCTTTAAAGAGGA
GAAAGAGGAGGTTTGGCTCACTACTAAGGGGGCCAAGTCTGCTGAGAATTTCCTAGGGATTGATAATTTATACAA
GGAAGAGCATGCGTCTTTTGCTCGTCATTTGGTTTATGCGATTCGAGCTCATAAGCTCTTTACTAAAGATAAGGAC
TATATCATTCGTGGAAATGAGATGGTACTGGTTGATAAGGGAACAGGGCGTCTAATGGAAATGACTAAACTTCAA
GGAGGTCTCCATCAGGCTATTGAAGCCAAGGAACATGTCAAATTATCTCCTGAGACGCGGGCTATGGCCTCGATC
ACCTATCAGAGTCTTTTTAAGATGTTTAATAAGATATCTGGTATGACAGGGACAGGTAAGGTCGCGGAAAAAGAG
TTTATTGAAACTTACAATATGTCTGTAGTACGCATTCCAACCAATCGTCCGAGACAACGGATTGACTATCCAGATA
ATCTATATATCACTTTACCTGAAAAAGTGTATGCATCCTTGGAGTACATCAAGCAATACCATGCTAAGGGAAATCC
TTTACTCGTTTTTGTAGGCTCAGTTGAAATGTCTCAACTCTATTCGTCTCTCTTGTTTCGTGAAGGGATTGCCCATA
ATGTCCTAAATGCTAATAATGCGGCGCGTGAGGCTCAGATTATCTCCGAGTCAGGTCAGATGGGGGCTGTGACAG
TGGCTACCTCTATGGCAGGACGTGGTACGGATATCAAGCTTGGTAAAGGAGTCGCAGAGCTTGGGGGCTTGATTG
TTATTGGGACTGAGCGGATGGAAAGTCAGCGGATCGACCTACAAATTCGTGGCCGTTCTGGTCGTCAGGGAGATC
CTGGTATGAGTAAATTTTTTGTATCCTTAGAGGATGATGTTATCAAGAAATTTGGTCCATCTTGGGTGCATAAAAA
GTACAAAGACTATCAGGTTCAAGATATGACTCAACCGGAAGTATTGAAAGGTCGTAAATACCGGAAACTAGTCGA
AAAGGCTCAGCATGCCAGTGATAGTGCTGGACGTTCAGCACGTCGTCAGACTCTGGAGTATGCTGAAAGTATGAA
TATACAACGGGATATAGTCTATAAAGAGAGAAATCGTCTAATAGATGGTTCTCGTGACTTAGAGGATGTTGTTGTG
GATATCATTGAGAGATATACAGAAGAGGTAGCGGCTGATCACTATGCTAGTCGTGAATTATTGTTTCACTTTATTG
TGACCAATATTAGTTTTCATGTTAAAGAGGGTTCCAGATTATATAGATGTAACTGACAAAACTGCAGTTCGTAGCTT
TATGAAGCAGGTGATTGATAAAGAACTTTCTGAAAAGAAAGAATTACTTAATCAACATGACTTATATGAACAGTT
TTTACGACTTTTCACTGCTTAAAGCCATTGATGACAACTGGGTAGAGCAGGTAGACTATCTACAACAGCTATCCATG
GCTATCGGTGGTCAATCTGCTAGTCAGAAAAATCCAATCGTAGAGTACTATCAAGAAGCCTACGCGGGCTTTGAA
GCTATGAAAGAACAGATTCATGCGGATATGGTGCGTAATCTCCTGATGGGGCTGGTTGAGGTCACTCCAAAAGGT
GAAATCGTGACTCATTTTCCATAA
MSSLSDQELVAKTVEFRQRLSEGESLDDILVEAFAVVREADKRILGMFPYDVQVMGAIVMHYGNVAEMNTGEGKTLT
ATMPVYLNAFSGEGVMVVTPNEYLSKRDAEEMGQVYRFLGLTIGVPFTEDPKKEMKAEEKKLIYASDIIYTTNSNLGF
DYLNDNLASNEEGKFLRPFNYVIIDEIDDILLDSAQTPLIIAGSPRVQSNYYAIIDTLVTTLVEGEDYIFKEEKEEVWLTTK
GAKSAENFLGIDNLYKEEHASFARHLVYAIRAHKLFTKDKDYIIRGNEMVLVDKGTGRLMEMTKLQGGLHQAIEAKEH
VKLSPETRAMASITYQSLFKMFNKISGMTGTGKVAEKEFIETYNMSVVRIPTNRPRQRIDYPDNLYITLPEKVYASLEYIK
QYHAKGNPLLVFVGSVEMSQLYSSLLFREGIAHNVLNANNAAREAQIISESGQMGAVTVATSMAGRGTDIKLGKGVAE
LGGLIVIGTERMESQRIDLQIRGRSGRQGDPGMSKFFVSLEDDVIKKFGPSWVHKKYKDYQVQDMTQPEVLKGRKYRK
LVEKAQHASDSAGRSARRQTLEYAESMNIQRDIVYKERNRLIDGSRDLEDVVVDIIERYTEEVAADHYASRELLFHFIVT
NISFHVKEVPDYIDVTDKTAVRSFMKQVIDKELSEKKELLNQHDLYEQFLRLSLLKAIDDNWVEQVDYLQQLSMAIGG
QSASQKNPIVEYYQEAYAGFEAMKEQIHADMVRNLLMGLVEVTPKGEIVTHFPZ
ID104879bp
ATGAAACAAGAATGGTTTGAAAGTAATGATTTTGTAAAAACAACAAGCAAGAACAAGCCTGAAGAGCAAGCTCA
AGAGGTTGCAGACAAGGCTGAAGAAAGGATACCCGATCTCGATACACCAATTGAAAAAAATACTCAGTTAGAGG
AGGAAGTCTCTCAAGCTGAAGTCGAATTGGAAAGCCAGCAAGAAGAGAAAATTGAAGCTCCTGAAGACAGTGAA
GCGAGAACAGAAATAGAAGAAAAGAAGGCATCTAATTCTACTGAAGAAGAGCCAGACCTTTCTAAAGAAACAGA
AAAAGTCACTATAGCTGAAGAGAGCCAAGAAGCTCTTCCTCAGCAAAAAGCAACCACGAAAGAGCCACTTCTTAT
CAGTAAATCTTTAGAAAGTCCTTATATCCCCGACCAAGCTCCAAAATCTAGGGATAAATGGAAAGAGCAAGTGCT
TGATTTTTGGTCTTGGCTAGTGGAAGCGATCAAATCTCCTACAAGTAAGTTGGAAACAAGTATCACACACAGTTAC
ACAGCCTTTCTCTTGCTCATTCTGTTTTCTGCATCTTCCTTTTTCTTTAGTATCTATCACATCAAACATGCTTACTAT
GGACATATAGCAAGCATTAACAGTCGCTTCCCTGAGCAGCTAGCTCCTTTAACTCTTTTTTCTATCATCTCTATCCT
AGTAGCGACAACACTCTTCTTCTTTTCATTCCTCTTGGGTAGTTTCGTTGTGAGACGATTTATCCACCAGGAAAAG
GACTGGACGCTAGACAAGGTTCTCCAACAATATAGTCAACTCTTGGCAATTCCAATCTCCTCACTGCTATTGCTAG
TTTCTTTGCTTTCTTTGATAGCCTACGATTTACAGCCCTCTTGTGTGTGA
MKQEWFESNDFVKCTTSKNKPEEQAQEVADKAEERIPDLDTPIEKNTQLEEEVSQAEVELESQQEEKIEAPEDSEARTEIE
EKKASNSTEEEPDLSKETEKVTIAEESQEALPQQKATTKEPLLISKSLESPYIPDQAPKSRDKWKEQVLDFWSWLVEAIKS
PTSKLETSITHSYTAFLLLILFSASSFFFSIYHIKHAYYGHIASINSRFPEQLAPLTLFSIISILVATTLFFFSFLLGSFVVRRFIH
QEKDWTLDKVLQQYSQLLAIPISSLLLLVSLLSLIAYDLQPSCVZ
ID106327bp
ATGTACTTTCCAACATCCTCTGCCTTGATTGAATTTCTCATCTTGGCTGTACTGGAGCAGGGTGATTCTTATGGTTA
TGAGATTAGCCAAACCATTAAGCTGATCGCTAATATCAAAGAATCCACACTCTATCCCATTCTCAAAAAATTGGA
AGGCAATAGCTTTCTGACAACCTATTCTAGAGAGTTCCAAGGTCGCATGCGCAAATACTACTCCTTGACAAACGG
TGGTATAGAGCAGCTCTTGACCCTAAAAGATGAATGGGCACTCTATACAGACACCATCAATGGCATCATAGAAGG
GAGTATCCGCCATGACAAGAACTGA
MYFPTSSALIEFLILAVLEQGDSYGYEISQTIKLIANIKESTLYPILKKLEGNSFLTTYSREFQGRMRKYYSLTNGGIEQLLT
LKDEWALYTTDTINGIIEGSIRHDKNZ
ID108954bp
ATGGATTTTGAAAAAATTGAACAAGCTTATATCTATTTACTAGAGAATGTCCAAGTCATCCAAAGTGATTTGGCGA
CCAACTTTTATGACGCCTTGGTGGAGCAAAATAGCATCTATCTGGATGGTGAAACTGAGCTAAACCAGGTCAAAG
ACAACAATCAGGCCCTTAAGCGTTTAGCACTACGCAAAGAAGAATGGCTCAAGACCTACCAGTTTCTCTTGATGA
AGGCTGGGCAAACAGAACCCTTGCAGGCCAATCACCAGTTTACACCGGATGCTATTGCTTTGCTTTTGGTGTTTAT
TGTGGAAGAGTTGTTTAAAGAGGAGGAAATTACTATCCTCGAAATGGGTTCTGGGATGGGAATTCTAGGCGCTAT
TTTCTTGACCTCGCTTACTAAAAAGGTGGATTACTTGGGAATGGAAGTGGATGATTTGCTGATTGATCTGGCAGCT
AGCATGGCAGATGTAATTGGTTTGCAGGCTGGCTTTGTCCAAGGAGATGCCGTTCGCCCACAAATGCTCAAAGAA
AGCGATGTGGTCATCAGTGACTTGCCTGTCGGCTATTATCCTGATGATGCCGTTGCGTCGCGCCATCAAGTTGCTT
CTAGCCAAGAACATACTTACGCCCATCACTTGCTCATGGAACAAGGGCTTAAGTACCTCAAGTCAGACGGATACG
CTATTTTTCTAGCTCCGAGTGATTTGTTGACCAGTCCTCAAAGTGATTTGTTAAAAGAATGGCTGAAAGAAGAGGC
GAGTCTGGTTGCTATGATTAGTCTGCCTGAAAATCTCTTTGCTAATGCCAAACAATCTAAGACTATTTTTATCTTAC
AGAAGAAAAATGAAATAGCAGTAGAGCCTTTTGTTTATCCACTTGCTAGCTTGCAAGATGCAAGTGTTTTAATGAA
ATTTAAAGAAAATTTTCAAAAATGGACTCAAGGTACTGAAATATAA
MDFEKIEQAYIYLLENVQVIQSDLATNFYDALVEQNSIYLDGETELNQVKDNNQALKRLALRKEEWLKTYQFLLMKA
GQTEPLQANHQFTPDAIALLLVFIVEELFKEEEITILEMGSGMGILGAIFLTSLTKKVDYLGMEVDDLLIDLAASMADVI
GLQAGFVQGDAVRPQMLKESDVVISDLPVGYYPDDAVASRHQVASSQEHTYAHHLLMEQGLKYLKSDGYAIFLAPSD
LLTSPQSDLLKEWLKEEASLVAMISLPENLFANAKQSKTIFILQKKNEIAVEPFVYPLASLQDASVLMKFKENFQKWTQG
TEIZ
ID1101902bp
ATGATTATTTTACAAGCTAATAAAATTGAACGTTCTTTTGCAGGAGAGGTTCTTTTCGATAATATCAACCTGCAGG
TTGATGAACGAGATCGGATTGCTCTTGTTGGGAAAAATGGTGCAGGTAAGTCTACTCTTTTGAAGATTTTAGTTGG
AGAAGAGGAGCCAACTAGCGGAGAAATCAATAAGAAAAAAGATATTTCTCTGTCTTACCTAGCCCAAGATAGCCG
TTTTGAGTCTGAAAATACCATCTACGATGAAATGCTTCATGTCTTTAATGATTTGCGTCGGACGGAGAGACAACTG
CGTCAGATGGAGCTGGAGATGGGTGAAAAGTCTGGTGAGGATTTGGATAAACTGATGTCAGATTATGACCGCTTA
TCTGAGAATTTTCGCCAAGCAGGTGGCTTTACCTATGAAGCTGATATTCGAGCGATTTTGAATGGATTCAAGTTTG
ACGAGTCTATGTGGCAGATGAAAATTGCTGAGCTTTCTGGTGGTCAAAATACTCGTTTGGCACTTGCCAAAATGCT
CCTTGAAAAGCCCAATCTCTTGGTCTTGGACGAGCCAACTAACCACTTGGATATTGAAACCATCGCCTGGCTAGA
GAATTACTTGGTAAACTATAGCGGTGCCCTCATTATCGTCAGCCACGACCGTTATTTCTTGGACAAGGTTGCGACA
ATTACGCTAGATTTGACCAAGCATTCCTTGGATCGCTATGTGGGGAATTACTCTCGTTTTGTCGAATTGAAGGAGC
AAAAGCTAGTTACTGAGGCAAAAAACTATGAAAAGCAACAGAAGGAAATCGCTGCTCTGGAAGACTTTGTCAATC
GCAATCTAGTTCGTGCTTCAACGACTAAACGTGCTCAATCTCGCCGTAAACAACTAGAAAAAATGGAGCGTTTGG
ACAAGCCTGAAGCTGGCAAGAAAGCAGCCAACATGACCTTCCAGTCTGAAAAAACGTCGGG CAATGTTGTTTTGA
CTGTTGAAAATGCAGCTGTTGGCTATGACGGGGAAGTCTTGTCACAACCTATCAACCTAGATCTTCGTAAGATGAA
TGCTGTCGCTATCGTTGGTCCAAATGGTATCGGCAAGTCAACCTTTATCAAGTCTATTGTGGACCAGATTCCTTTT
ATCAAGGGAGAAAAGCGCTTTGGCGCTAATGTTGAGGTTGGTTACTATGACCAAACCCAAAGCAAGCTGACACCA
AGTAATACGGTGCTGGATGAACTCTGGAATGATTTCAAACTGACACCAGAAGTTGAAATCCGCAACCGTCTTGGA
GCCTTCCTTTTCTCAGGAGATGATGTTAAAAAATCAGTCGGCATGCTATCTGGTGGCGAAAAAGCTCGTTTGCTTT
TAGCTAAATTGTCTATGGAAAACAATAACTTTTTGATTCTGGATGAGCCGACCAACCACTTGGATATTGATAGTAA
GGAAGTGCTAGAAAATGCCTTGATTGACTTTGATGGAACCTTGCTGTTTGTCAGTCATGATCGTTACTTTATCAAT
CGTGTGGCAACTCATGTTTTGGAATTGTCTGAGAATGGTTCAACTCTCTACCTTGGAGATTACGACTACTATGTTG
AGAAGAAAGCAACAGCAGAAATGAGTCAGACTGAGGAAGCTTCAACTAGCAATCAAGCAAAGGAAGCAAGTCCA
GTCAATGACTATCAGGCCCAGAAAGAAAGTCAAAAAGAAGTTCGCAAACTCATGCGACAAATCGAAAGTCTAGA
AGCTGAAATTGAAGAGCTAGAAAGTCAAAGCCAAGCCATTTCTGAACAAATGTTGGAAACAAACGATGCCGACA
AACTCATGGAATTACAGGCTGAGCTGGACAAAATCAGCCATCGTCAGGAAGAAGCTATGCTTGAGTGGGAAGAAT
TATCAGAGCAGGTGTAA
MIILQANKIERSFAGEVLFDNINLQVDERDRIALVGKNGAGKSTLLKILVGEEEPTSGEINKKKDISLSYLAQDSRFESENT
IYDEMLHVFNDLRRTERQLRQMELEMGEKSGEDLDKLMSDYDRLSENFRQAGGFTYEADIRAILNGFKFDESMWQMK
IAELSGGQNTRLALAKMLLEKPNLLVLDEPTNHLDIETIAWLENYLVNYSGALITVSHDRYFLDKVATITLDLTKHSLDR
YVGNYSRFVELKEQKLVTEAKNYEKQQKEIAALEDFVNRNLVRASTTKRAQSRRKQLEKMERLDKPEAGKKAANMTE
QSEKTSGNVVLTVENAAVGYDGEVLSQPINLDLRKMNAVAIVGPNGIGKSTFIKSIVDQIPFIKGEKRFGANVEVGYYDQ
TQSKLTPSNTVLDELWNDFKLTPEVEIRNRLGAFLFSGDDVKKSVGMLSGGEKARLLLAKLSMENNNFLILDEPTNHL
DIDSKEVLENALIDFDGTLLFVSHDRYFINRVATHVLELSENGSTLYLGDYDYYVEKKATAEMSQTEEASTSNQAKEAS
PVNDYQAQKESQKEVRKLMRQIESLEAEIEELESQSQAISEQMLETNDADKLMELQAELDKISHRQEEAMLEWEELSEQ
VZ
IDT1111179bp
ATGAATCGCTATGCAGTGCAGTTGATTAGCCGTGGGGCTATCAATAAAATGGGAAATATGCTCTATGATTATGGA
AATAGTGTCTGGTTGGCTTCTATGGGGACTATAGGACAGACAGTTTTAGGAATGTATCAGATTTCTGAGCTCGTCA
CATCTATTCTCGTCAATCCTTTGGCGGAGTTATTTCAGACCGTTTTTCTCGTCGTAAGATTTTAATGACGGCAGAT
CTTGTTTGTGGGATTCTTTGTCTGGCTATTTCTTTCATAAGGAATGATAGCTGGATGATGGCGCTTTGATTGTTGC
TAACATTGTGCAGGCTATTGCTTTTGCCTTTTCTCGCACAGCCAATAAAGCTATCATAACTGAAGTGGTGGAGAAA
GATGAGATTGTGATCTATAATTCTCGCTTAGAGCTGGTTTTGCAGGTTGTAGGTGTTAGCTCTCCTGTTCTTTCCTT
CCTTGTTTTACAGTTTGCAAGTCTCCATATGACGCTACTGCTAGACTCGCTGACTTTTTCATTGCTTTTGTTCTAG
TGGCTTTCCTTCCAAAAGAGGAAGCAAAAGTTCAAGAGAAAAAGGCTTTTACTGGGAGAGATATTTTTGTAGATA
TCAAGGATGGGTTACACTATATCTGGCATCAGCAAGAAATTTTCTTCCTTTTGCTGGTAGCTTCCAGCGTTAATTT
CTTTTTTGCAGCTTTTGAATTTCTACTTCCTTTTCGAATCAGCTTTACGGGTCAGAAGGAGCCTATGCAAGTATTT
TAACTATGGGGGCTATTGGTTCCATCATTGGGGCTCTTCTAGCTAGTAAAATTAAAGCTAATATTTATAATCTTTT
GATTTTACTGGCTTTGACAGGTGTCGGAGTTTTTATGATGGGATTACCACTTCCAACTTTTCTTTCCTTTTCTGGAA
ATTTAGTTTGTGAATTGTTTATGACGATTTTTAATATTCACTTTTTTACTCAAGTACAAACCAAGGTTGAGAGCGAA
TTTCTTGGAAGAGTACTGAGTACAATTTTTACCTTAGCTATTCTATTTATGCCTATTGCAAAAGGATTTATGACAGT
CTTGCCAAGTGTCCATCTTTATTCTTTCTTGATTATTGGACTTGGAGTTGTAGCCTTATATTTCTTAGCTCTTCGGAT
ATGTTCGAACTCATTTTGAAAAATTGATATAA
MNRYAVQLISRGAINKMGNMLYDYGNSVWLASMGTIGQTVLGMYQISELVTSILVNPFGGVISDRFSRRKILMTADLV
CGILCLAISFIRNDSWMIGALIVANIVQAIAFAFSRTANKAIITEVVEKDEIVIYNSRLELVLQVVGVSSPVLSFLVLQFASL
HMTLLLDSLTFFIAFVLVAFLPKEEAKVQEKKAFTGRDIFVDIKDGLHYIWHQQEIFFLLLVASSVNFFFAAFEFLLPFSN
QLYGSEGAYASILTMGAIGSIIGALLASKIKANIYNLLILLALTGVGVFMMGLPLPTFLSFSGNLVCELFMTIFNIHFFTQV
QTKVESEFLGRVLSTIFTLAILFMPIAKGFMTVLPSVHLYSFLIIGLGVVALYFLALGYVRTHFEKLIZ
ID1132466bp
ATGCAAAATCAATTAAATGAATTAAAACGAAAAATGCTGGAATTTTTCCAGCAAAAACAAAAAAATAAAAAAATCA
GCTAGACCTGGCAAGAAAGGTTCAAGTACCAAAAAATCTAAAACCTTAGATAAGTCAGCCATTTTCCCAGCTATT
TTACTGAGTATAAAAGCCTTATTTAACTTACTCTTTGTACTCGGTTTTCTAGGAGGAATGTTGGGAGCTGGGATTG
CTTTGGGATACGGAGTGGCCTTATTTGACAAGGTTCGGGTGCCTCAGACAGAAGAATTGGTGAATCAGGTCAAGG
ACATCTCTTCTATTTCAGAGATTACCTATTCGGACGGGACGGTGATTGCTTCCATAGAGAGTGATTTGTTGCGCAC
TTCTATCTCATCTGAGCAAATTTCGGAAAATCTGAAGAAGGCTATCATTGCGACAGAAGATGAACACTTTAAAGA
ACATAAGGGTGTAGTACCCAAGGCGGTGATTCGTGCGACCTTGGGGAAATTTGTAGGTTTGGGTTCCTCTAGTGGG
GGTTCAACCTTGACCCAGCAACTAATTAAACAGCAGGTGGTTGGGGATGCGCCGACCTTGGCTCGTAAGGCGGCA
GAGATTGTGGATGCTCTTGCCTTGGAACGCGCCATGAATAAAGATGAGATTTTAACGACCTATCTCAATGTGGCTC
CCTTTGGCCGAAATAATAAGGGACAGAATATTGCAGGGGCTCGGCAAGCAGCTGAGGGAATTTTCGGTGTAGATG
CCAGTCAGTTGACTGTTCCTCAAGCAGCAATTTTTAGCAGGACTTCCACAGAGTCCCATTACTTACTCTCCTTATGA
AAATACTGGGGAGTTGAAGAGTGATGAAGACCTAGAAATTGGCTTAAGACGGGCTAAGGCAGTTCTTTACAGTAT
GTATCGTACAGGTGCATTAAGCAAAGACGAGTATTCTCAGTACAAGGATTATGACCTTAAACAGGACTTTTTACC
ATCGGGCACGGTTACAGGAATTTCACGAGACTATTTATACTTTACAACTTTGGCAGAAGCTCAAGAACGTATGTAT
GACTATCTAGCTCAGAGAGACAATGTCTCCGCTAAGGAGTTGAAAAATGAGGCAACTCAGAAGTTTTATCGAGAT
TTGGCAGCCAAGGAAATTGAAAATGGTGGTTATAAGATTACTACTACCATAGATCAGAAAATTCATTCTGCCATG
CAAAGTGCGGTTGCTGATTATGGCTATCTTTTAGACGATGGAACAGGTCGTGTAGAAGTAGGGAATGTCTTGATG
GATAACCAAACAGGTGCTATTCTAGGCTTTGTAGGTGGTCGTAATTATCAAGAAAATCAAAATAATCATGCCTTTG
ATACCAAACGTTCGCCAGCTTCTACTACCAAGCCCTTGCTGGCCTACGGTATTGCTATTGACCAGGGCTTGATGGG
AAGTGAAACGATTCTATCTAACTATCCAACAAACTTTGCTAATGGCAATCCGATTATGTATGCTAATAGCAAGGG
AACAGGAATGATGACCTTGGGAGAAGCTCTGAACTATTCATGGAATATCCCTGCTTACTGGACCTATCGTATGCTC
CGTGAAAAGGGTGTTGATGTCAAGGGTTATATGGAAAAGATGGGTTACGAGATTCCTGAGTACGGTATTGAGAGC
TTGCCAATGGGTGGTGGTATTGAAGTCACAGTTGCCCAGCATACCAATGGCTATCAGACCTTAGCTAATAATGGA
GTTTATCATCAGAAGCATGTGATTTCAAAGATTGAAGCAGCAGATGGTAGAGTGGTGTATGAGTATCAGGATAAA
CCGGTTCAAGTCTATTCAAAAGCTACTGCGACGATTATGCAGGGATTGCTACGAGAAGTTCTATCCTCTCGTGTGA
CAACAACCTTCAAGTCTAACCTGACTTCTTTAAATCCTACTCTGGCTAATGCAGATTGGATTGGGAAGACTGGTAC
AACCAACCAAGACGAAAATATGTGGCTCATGCTTTCGACACCTAGATTAACCCTAGGTGGCTGGATTGGGCATGA
TGATAATCATTCATTGTCACGTAGAGCAGGTTATTCTAATAACTCTAATTACATGGCTCATCTGGTAAATGCGATT
CAGCAAGCTTCCCCAAGCATTTGGGGGAACGAGCGCTTTGCTTTAGATCCTAGTGTAGTGAAATCGGAAGTCTTG
AAATCAACAGGTCAAAAACCAGAGAAGGTTTCTGTTGAAGGAAAAGAAGTAGAGGTCACAGGTTCGACTGTTACC
AGCTATTGGGCTAATAAGTCAGGAGCGCCAGCGACAAGTTATCGCTTTGCTATTGGCGGAAGTGATGCGGATTAT
CAGAATGCTTGGTCTAGTATTGTGGGGAGTCTACCAACTCCATCCAGCTCCAGCAGTTCAAGTAGTAGTTCTAGCG
ATAGCAGTAACTCAAGTACTACACGACCTTCTTCTTCAAGGGCGAGACGATAA
MQNQLNELKRKMLEFFQQKQKNKKSARPGKKGSSTKKSKTLDKSAIFPAILLSIKALFNLLFVLGFLGGMLGAGIALGY
GVALFDKVRVPQTEELVNQVKDISSISEITYSDGTVIASIESDLLRTSISSEQISENLKKAIIATEDEHFKEHKGVVPKAVIR
ATLGKFVGLGSSSGGSTLTQQLIKQQVVGDAPTLARKAAEIVDALALERAMNKDEILTTYLNVAPFGRNNKGQNIAGA
RQAAEGIFGVDASQLTVPQAAFLAGLPQSPITYSPYENTGELKSDEDLEIGLRRAKAVLYSMYRTGALSKDEYSQYTKDY
DLKQDFLPSGTVTGISRDYLYFTTLAEAQERMYDYLAQRDNVSAKELKNEATQKFYRDLAAKEIENGGYKITTTIDQKI
HSAMQSAVADYGYLLDDGTGRVEVGNVLMDNQTGAILGFVGGRNYQENQNNHAFDTKRSPASTTKPLLAYGIAIDQG
LMGSETILSNYPTNFANGNPIMYANSKGTGMMTLGEALNYSWNIPAYWTYRMLREKGVDVKGYMEKMGYEIPEYGIE
SLPMGGGIEVTVAQHTNGYQTLANNGVYHQKHVISKIEAADGRVVYEYQDKPVQVYSKATATIMQGLLAEVLSSRVTT
TFKSNLTSLNPTLANADWIGKTGTTNQDENMWLMLSTPRLTLGGWIGHDDNHSLSRRAGYSNNSNYMAHLVNAIQQA
SPSIWGNERFALDPSVVKSEVLKSTGQKPEKVSVEGKEVEVTGSTVTSYWANKSGAPATSYRFAIGGSDADYQNAWSS
VGSLPTPSSSSSSSSSSSDSSNSSTTRPSSSRARRRZ
ID1141974bp
ATGAAAAAATTTTATGTAAGTCCAATTTTTCCTATTCTAGTAGGATTGATTGCGTTTGGAGTCTTATCCACTTTCAT
TATTTTTGTTAATAATAATCTGTTGACGGTTTTAATTTTGTTTCTTTTGTAGGAGGCTATGTTTTTTTATTTAAGAA
ACTGAGAGTGCATTATACAAGGAGTGATGTAGAACAGATACAGTATGTAAACCACCAAGCGGAAGAAAGTTTGAC
AGCTCTATTGGAACAGATGCCTGTAGGTGTTATGAAATTGAATTTATCTTCTGGAGAGGTTGAGTGGTTTAATCCC
TATGCTGAATTGATTTTGACCAAGGAAGATGGTGATTTTGATTTAGAAGCTGTTCAAACGATTATCAAGGCTTCAG
TAGGAAATCCGTCTACTTATGCCAAGCTTGGTGAGAAGCGTTATGCTGTTCATATGGATGCTTCTTCCGGTGTTTT
GTATTTTGTAGATGTATCCAGGGAACAAGCCATAACAGATGAATTGGTAACAAGTAGACCAGTGATTGGGATTGT
CTCTGTGGATAATTATGATGATTTGGAGGATGAAACTTCTGAGTCAGATATTAGTCAAATCAATAGTTTTGTAGCT
AATTTTATATCAGAGTTTTCAGAAAAACACATGATGTTTTCTCGTCGGGTAAGTATGGATCGATTTTATCTATTTAC
TGACTACACGGTGCTTGAGGGCTTGATGAATGATAAATTTTCTGTTATTGATGCTTTCAGAGAAGAGTCGAAACAG
AGACAGTTGCCCTTGACCTTAAGTATGGGATTTTCTTATGGCGATGGAAATCATGATGAGATAGGGAAAGTTGCTT
TGCTCAATTTGAACTTCGCTGAAGTACGTGGTGGCGACCAGGTGGTTGTTAAGGAAAACGACGAACGAAAAATC
CAGTTTATTTTGGTGGTGGGTCTGCTGCTTCAATCAAGCGTACACGGACTCGTACGCGCGCTATGATGACAGCTAT
TTCAGATAAGATTCGGAGTGTAGATCAGGTTTTTGTAGTCGGTCACAAAAATTTAGACATGGATGCTTTGGGCTCT
GCTGTAGGTATGCAGTTGTTCGCCAGCAATGTGATTGAAAATAGCTATGCTCTTTATGATGAAGAACAAATGTCTC
CAGATATTGAACGAGCTGTTTCATTCATAGAAAAAGAAGGAGTTACGAAGTTGTTGTCTGTTAAGGATGCAATGG
GGATGGTGACCAATCGTTCTTTGTTGATTCTTGTAGACCATTCAAAGACAGCCTTAACATTATCAAAGAATTTTA
TGATTTATTTACCCAAACCATTGTTATTGACCACCATAGAAGGGATCAGGATTTTCCAGATAATGCGGTTATTACT
TATATCGAAAGTGGTGCAAGTAGTGCCAGTGAGTTGGTAACGGAATTGATTCAGTTCCAGAATTCTAAGAAAAAT
CGTTTGAGTCGTATGCAAGCAAGTGTCTTGATGGCTGGTATGATGTTGGATACTAAAAATTTCACCTCGCGAGTAA
CTAGTCGGACATTTGATGTTGCTAGCTATCTCAGAACGCGCGGAAGTGATAGTATTGCTATCCAGGAAATCGCTGC
GACAGATTTTGAAGAATATCGTGAGGTCAATGAACTTATTTTACAGGGGCGTAAATTAGGTTCAGATGTACTAATA
GCAGAGGCTAAGGACATGAAATGCTATGATACAGTTGTTATTAGTAAGGCAGCAGATGCCATGTTAGCCATGTCA
GGTATTGAAGCGAGTTTTGTTCTTGCGAAGAATACACAAGGATTTATCTCTATCTCAGCTCGAAGTTCGTAGTAAAC
TGAATGTACAACGGATTATGGAAGAGTTAGGCGGTGGAGGCCACTTTAATTTGGCAGCAGCTCAAATTAAAGATG
TAACCTTGTCAGAAGCAGGTGAAAAACTGACAGAAATTGTATTAAATGAAATGAAGGAAAAGGAGAAAGAAGAA
TGA
MKKFYVSPIFPILVGLIAFGVLSTFIIFVNNNLLTVLILFLFVGGYVFLFKKLRVHYTRSDVEQIQYVNHQAEESLTALLE
QMPVGVMKLNLSSGEVEWFNPYAELILTKEDGDFDLEAVQTIIKASVGNPSTYAKLGEKRYAVHMDASSGVLYFVDVS
REQAITDELVTSRPVIGIVSVDNYDDLEDETSESDISQINSFVANFISEFSEKHMMFSRRVSMDRFYLFTDYTVLEGLMN
DKFSVIDAFREESKQRQLPLTLSMGFSYGDGNHDEIGKVALLNLNLAEVRGGDQVVVKENDETKNPVYFGGGSAASIK
RTRTRTRAMMTAISDKIRSVDQVFVVGHKNLDMDALGSAVGMQLFASNVIENSYALYDEEQMSPDIERAVSFIEKEGV
TKLLSVKDAMGMVTNRSLLILVDHSKTALTLSKEFYDLFTQTIVIDHHRRDQDFPDNAVITYIESGASSASELVTELIQFQ
NSKKNRLSRMQASVLMAGMMLDTKNFTSRVTSRTFDVASYLRTRGSDSIAIQEIAATDFEEYREVNELILQGRKLGSDV
LIAEAKDMKCYDTVVISKAADAMLAMSGIEASFVLAKNTQGFISTSARSRSKLNVQRIMEELGGGGHFNLAAAQIKDVT
LSEAGEKLTEIVLNEMKEKEKEEZ
ID115663bp
ATGAAGTGCTTGTTATGTGGGCAGACTATGAAGACTGTTTTAACTTTTAGTAGTCTCTTACTTCTGAGGAATGATG
ACTCTTGTCTTTGTTCAGACTGTGATTCTACTTTTGAAAGAATTGGGGAAGAGAACTGTCCAAATTGTATGAAAAC
AGAGTTGTCAACAAAGTGTCAAGATTGTCAACTTTGGTGTAAAGAGGGAGTTGAAGTCAGTCATAGAGCGATTTT
TACTTACAATCAAGCTATGAAGGATTTTTTCAGTCGGTATAAGTTTGATGGAGACTTCCTGTTAAGAAAAGTTTTC
GCTTCATTTTTAAGTGAGGAGTTGAAAAAGTACAAAGAGTATCAATTTGTTGTAATTCCCCTAAGTCCTGATAGAT
ATGCTAATAGAGGATTTAATCAGGTTGAGGGCTTGGTAGAGGCAGCAGGCTTTGAGTATCTGGATTTATTAGAGA
AAAGAGAAGAGAGAGCCAGTTCTTCTAAAAATCGTTCAGAGCGCTTGGGGACAGAACTTCCTTTCTTTATTAAAA
GTGGAGTCACTATTCCTAAAAAAATCCTACTTATAGATGATATCTATACTACAGGAGCAACTATAAATCGTGTTAA
GAAACTGTTGGAAGAAGCTGGTGCTAAGGATGTAAAAACATTTTCCCTTGTAAGATGA
MKCLLCGQTMKTVLTFSSLLLLRNDDSCLCSDCDSTFERTGEENCPNCMKTELSTKCQDCQLWCKEGVEVSHRAIFTY
NQAMKDFFSRYKFDGDFLLRKVFASFLSEELKKYKEYQFVVIPLSPDRYANRGFNQVEGLVEAAGFEYLDLLEKREER
ASSSKNRSERLGTELPFFIKSGVTIPKKILLIDDIYTTGATINRVKKLLEEAGAKDVKTFSLVRZ
ID1161299bp
ATGAAAGTAAATTTAGATTATCTCGGTCGTTTATTTACTGAGAATGAATTAACAGAAGAAGAACGTCAGTTGGCG
GAGAAACTTCCAGCAATGAGAAAGGAGAAGGGGAAACTTTTCTGTCAACGCTGTAATAGTACTATTCTAGAAGAA
TGGTATTTGCCCATCGGTGCTTACTATTGTCGAGAGTGCTTGCTGATGAAGCGAGTCAGAAGTGATCAAACTTTAT
ACTATTTTCCGCAGGAGGATTTTCCAAAGCAAGATGTTCTCAAATGGCGCGGCCAATTAACTCCTTTTCAAGAGAA
GGTGTCAGAGGGATTGCTTCAAGTAGTAGACAAGCAAAAGCCAACCTTAGTTCATGCGGTAACAGGAGCTGGAAA
GACAGAAATGATTTATCAAGTAGTGGCTAAAGTGATCAATGCGGGTGGTGCAGTGTGTTTGGCTAGTCCTCGCAT
AGATGTTTGTTTGGAGCTGTACAAGCGCCTGCAACAGGATTTTTCTTGCGGGATAGCTTTGCTACATGGAGAATCG
GAACCTTATTTTCGAACACCACTAGTTGTTGCAACAACCCATCAGTTATTGAAGTTTTATCAAGCTTTTGATTTGCT
GATAGTGGATGAAGTAGATGCTTTTCCTTATGTTGATAATCCCATGCTTTACCACGCTGTCAAGAATAGTGTAAAG
GAGAATGGATTGAGAATCTTTTTAACAGCGACTTCGACCAATGAGTTAGATAAAAAGGTCCGTTTAGGAGAACTA
AAAAGACTGAATTTACCGAGACGGTTTCATGGAAATCCGTTGATTATTCCAAAACCAATTTGGTTATCGGATTTTA
ATCGCTACTTAGACAAGAATCGTTTGTCACCAAAGTTAAAGTCCTATATTGAGAAGCAGAGAAAGACAGCTTATC
CGTTACTCATTTTTGCTTCAGAAATTAAGAAAGGGGAGCAGTTAGCAGAAATCTTACAGGAGCAATTTCCAAATG
AGAAAATTGGCTTTGTATCTTCTGTAACAGAGGATCGATTAGAGCAAGTACAAGCTTTTCGAGATGGAGAACTGA
CAATACTTATCAGTACGACAATCTTGGAGCGCGGAGTTACCTTCCCTTGTGTGGATGTTTTCGTAGTAGAGGCCAA
TCATCGTTTGTTTACCAAGTCTAGTTTGATTCAGATTGGTGGACGAGTTGGACGAAGCATGGATAGACCGACAGGA
GATTTGCTTTTCTTCCATGATGGGTTAAATGCTTCAATCAAGAAGGCGATTAAGGAAATTCAGATGATGAATAAGG
AGGCTGGTCTATGA
MKVNLDYLGRLFTENELTEEERQLAEKLPAMRKEKGKLFCQRCNSTILEEWYLPIGAYYCRECLLMKRVRSDQTLYYF
PQEDFPKQDVLKWRGQLTPFQEKVSEGLLQVVDKQKPTLVHAVTGAGKTEMIYQVVAKVINAGGAVCLASPRIDVCL
ELYKRLQQDFSCGIALLHGESEPYFRTPLVVATTHQLLKFYQAFDLLIVDEVDAFPYVDNPMLYHAVKNSVKENGLRIF
LTKTSTNELDKKVRLGELKRLNLPRRFHGNPLIIPKPIWLSDFNRYLDKNRLSPKLKSYIEKQRKTAYPLLIFASEIKKGE
QLAEILQEQFPNEKIGFVSSVTEDRLEQVQAFRDGELTILISTTILERGVTFPCVDVFVVEANHRLFTKSSLIQIGGRVGRS
MDRPTGDLLFFHDGLNASIKKKIKEIQMMNKEAGLZ
ID117870bp
ATGCAAATTCAAAAAAGTTTTAAGGGGCAGTCTCCCTATGGCAAGCTGTATCTAGTGGCAACGCCGATTGGCAAT
CTAGATGATATGACTTTTCGTGCTATCCAGACCTTGAAAGAAGTGGACTGGATTGCTGCTGAGGATACGCGCAAT
ACAGGGCTTTTGCTCAAGCATTTTGACATTTCCACCAAGCAGATCAGTTTTCATGAGCACAATGCCAAGGAAAAA
ATTCCTGATTTGATTGGTTTCTTGAAAGCAGGGCAAAGTATTGCTCAGGTCTCTGATGCCGGTTTGCCTAGCATTT
CAGACCCTGGTCATGATTTAGTTAAGGCAGCTATTGAGGAAGAAATTGCAGTTGTGACAGTTCCAGGTGCCTCTGC
AGGAATTTCTGCCTTGATTGCCAGTGGTTTAGCGCCACAGCCACATATCTTTTACGGTTTTTTACCGAGAAAATCA
GGTCAGCAGAAGCAATTTTTTGGCTTGAAAAAAGATTATCCTGAAACACAGATTTTTTATGAATCACCTCATCGTG
TAGCAGACACGTTGGAAAATATGTTAGAAGTCTACGGTGACCGCTCCGTTGTCTTGGTCAGGGAATTGACCAAAA
TCTATGAAGAATACCAACGAGGTACTATCTCTGAGTTATTAGAAAGCATTGCTGAAACGCCACTCAAGGGCGAAT
GTCTTCTCATTGTTGAGGGTGCCAGTCAGGGTGTGGAGGAAAAGGACGAGGAAGACTTGTTCGTAGAAATTCAAA
CCCGCATCCAGCAAGGTGTGAAGAAAAACCAAGCTATCAAGGAAGTCGCTAAGATTTACCAGTGGAATAAAAGTC
AGCTCTACGCTGCCTACCACGACTGGGAAGAAAAACAATAA
MQIQKSFKGQSPYGKLYLVATPIGNLDDMTFRAIQTLKEVDWIAAEDTRNTGLLLKHFDISTKQISFHEHNAKEKIPDLI
GFLKAGQSIAQVSDAGLPSISDPGHDLVKAAIEEEIAVVTVPGASAGISALIASGLAPQPHIFYGFLPRKSGQQKQFFGLKK
DYPETQIFYESPHRVADTLENMLEVYGDRSVVLVRELTKIYEEYQRGTISELLESIAETPLKGECLLIVEGASQGVEEKDE
EDLFVEIQTRIQQGVKKNQAIKEVAKIYQWNKSQLYAAYHDWEEKQZ
ID118345bp
ATGATAAAGAAAGGAAAGGGCTGTTTTATGGACAAAAAAGAATTATTTGACGCGCTGGATGATTTTTCCCAACAA
TTATTGGTAACCTTAGCCGATGTGGAAGCCATCAAGAAAAATCTCAAGAGCCTGGTAGAGGAAAATACAGCTCTT
CGCTTGGAAAATAGTAAGTTGCGAGAACGCTTGGGTGAGGTGGAAGCAGATGCTCCTGTCAAGGCCAAGCATGTT
CGCGAAAGTGTCCGTCGTATTTACCGTGATGGATTTCACGTATGTAATGATTTTTATGGACAACGTCGAGAGCAGG
ACGAAGAATGTATGTTTTGTGACGAGTTGTTATACAGGGAGTAA
MIKKGKGCFMDKKELFDALDDFSQQLLVTLADVEAIKKNLKSLVEENTALRLENSKLRERLGEVEADAPVKAKHVRES
VRRIYRDGFHVCNDFYGQRREQDEECMFCDELLYREZ
ID119639bp
ATGTCAAAAGGATTTTTAGTCTCTCTTGAGGGACCAGAGGGAGCAGGCAAGACCAGTGTTTTAGAGGCTCTGCTA
CCAATTTTAGAGGAAAAAGGAGTAGAGGTGTTGACGACCCGTGAACCTGGCGGAGTCTTGATTGGGGAGAAGATT
CGGGAAGTGATTTTGGATCCAAGTCAATACTCAGATGGATGCTAAAACAGAGCTACTTCTCTATATTGCCAGTCGCA
GACAGCATTTGGTGGAAAAAGTTCTTCCAGCCCTTGAAGCTGGCAAGTTGGTCATCATGGATCGTTTTATCGATAG
TTCTGTTGCCTATCAGGGATTTGGTCGTGGCTTAGATATTGAAGCCATTGACTGGCTCAATCAGTTTGCGACAGAT
GGCCTCAAACCCGATTTGACACTCTATTTTGACATCGAGGTGGAAGAAGGGCTGGCTCGTATTGCTGCTAATAGTG
ACCGCGAGGTTAATCGTTTGGATTTGGAAGGGTTGGACTTGCATAAAAAAGTTCGTCAAGGCTACCTTTCTCTTCT
GGATAAAGAGGGAAATCGCATTGTCAAGATTGATGCTAGTCTCCCTTTGGAGCAAGTTGTGGAAACTACCAAGGC
TGTCTTGTTTGACGGAATGGGCTTGGCCAAATGA
MSKGFLVSLEGPEGAGKTSVLEALLPILEEKGVEVLTTREPGGVLIGEKIREVILDPSHTQMDAKTELLLYIASRRQHLVE
KVLPALEAGKLVIMDRFIDSSVAYQGFGRGLDIEAIDWLNQFATDGLKPDLTLYFDIEVEEGLARIAANSDREVNRLDL
EGLDLHKKVRQGYLSLLDKEGNRIVKIDASLPLEQVVETTKAVLFDGMGLAKZ
ID120408bp
ATGGTAGAACAAAGAAAATCAATTACCATGAAAGATGTTGCTTTAGAAGCAGGAGTTAGTGTTGGAACTGTTTCA
CGTGTAATTAATAAAGAAAAAGGCATTAAAGAAGTAACTTTGAAAAAAGTGGAACAAGCG ATTAAAACTTTGAAT
TACATTCCAGATTACTACGCTAGAGGAATGAAAAAAAATCGAACAGAAACGATTGCAATCATTGTACCAAGTATC
TGGCATCCCTTCTTTTCAGAATTTGCTATGCATGTGGAAAATGAAGTCTATAAGAGAAATAACAAATTACTCTTAT
GTTCTATCAATGGTACAAATAGAGAGCAAGACTATCTGGAGATGTTGCGTCATAATAAAGTTGATGGAGTGGTTG
CCATTACCTATAGGCCAATTGAACATTACTTGACGTCAGGAATTCCCTTTGTTAGTATTGACCGCACATACTCAGA
GATTGCCATTCCTTGTGTTTCA
MVEQRKSITMKDVALEAGVSVGTVSRVINKEKGIKEVTLKKVEQAIKTLNYIPDYYARGMKKNRTETIAIIVPSIWHPFF
SEFAMHVENEVYKRNNKLLLCSINGTNREQDYLEMLRHNKVDGVVAITYRPIEHYLTSGIPFVSIDRTYSEIAIPCVS
ID121285bp
ATGAATATATTTAGAACAAAGAATGTTAGTTTAGATAAAACAGAGATGCATAGGCATTTGAAGTTATGGGATTTG
ATTTTGCTGGGTATCGGAGCCATGGTAGGGACAGGCGTCTTTACAATCACAGGTACTGCAGCTGCAACACTTGCTG
GCCCAGCCCTAGTGATTTCAATCGTTATTTCTGCCTTGTGTGTGGGATTATCAGCCCTCTTTTTTGCAGAATTTGCC
TCGCGAGTACCCGCTACAGGAGGTGCCTATAGTTACCTCTATGCTATCTTAGGAGAATTCCCTGCCTGGTTGGCTG
GTTGGTTAACCATGATGGAGTTCATGACAGCCATATCAGGCGTAGCTTCGGGTTGGGCAGCTTATTTTAA
MNIFRTKNVSLDKTEMHRHLKLWDLILLGIGAMVGTGVFTTTGTAAATLAGPALVISIVISALCVGLSALFFAEFASRVP
ATGGAYSYLYAILGEFPAWLAGWLTMMEFMTAISGVASGWAAYF
ID1241311bp
ATGAAATCAAGAGTAAAGGAAACGAGTATGGATAAAATTGTGGTTCAAGGTGGCGATAATCGTCTGGTAGGAAGC
GTGACGATCGAGGGAGCAAAAAATGCAGTCTTACCCTTGTTGGCAGCGACTATTCTAGCAAGTGAAGGAAAGACC
GTCTTGCAGAATGTTCCGATTTTGTCGGATGTCTTTATTATGAATCAGGTAGTTGGTGGTTTGAATGCCAAGGTTG
ACTTTGATGAGGAAGCTCATCTTGTCAAGGTGGATGCTACTGGCGACATCACTGAGGAAGCCCCTTACAAGTATG
TCAGCAAGATGCGCGCCTCCATCGTTGTATTAGGGCCAATCCTTGCCCGTGTGGGTCATGCCAAGGTATCCATGCC
AGGTGGTTGTACGATTGGTAGCCGTCCTATTGATCTTCATTTGAAAGGTCTGGAAGCTATGGGGGTTAAGATTAGT
CAGACAGCTGGTTACATCGAAGCCAAGGCAGAACGCTTGCATGGTGCTCATATCTATATGGACTTTCCAAGTGTTG
GTGCAACGCAGAACTTGATGATGGCAGCGACTCTGGCTGATGGGGTGACAGTGATTGAGAATGCTGCGCGTGAGC
CTGAGATTGTTGACTTAGCCATTCTCCTTAATGAAATGGGAGCCAAGGTCAAAGGTGCTGGTACAGAGACTATAA
CCATTACTGGTGTTGAGAAACTTCATGGTACGACTCACAATGTAGTCCAAGACCGTATCGAAGCAGGAACCTTTAT
GGTAGCTGCTGCCATGACTGGTGGTGATGTCTTGATTCGAGACGCTGTCTGGGAGCACAACCGTCCCTTGATTGCC
AAGTTACTTGAAATGGGTGTTGAAGTAATTGAAGAAGACGAAGGAATTCGTGTTCGTTCTCAACTAGAAAATCTA
AAAGCTGTTCATGTGAAAACCTTGCCCCACCCAGGATTTCCAACAGATATGCAGGCTCAATTTACAGCCTTGATGA
CAGTTGCAAAAGGCGAATCAACCATGGTGGAGACAGTTTTCGAAAATCGTTTCCAAACCTAGAAGAGATGCGCCG
CATGGGCTTGCATTCTGAGATTATCCGTGATACAGCTCGTATTGTTGGTGGACAGCCTTTGCAGGGAGCAGAAGTT
CTTTCAACTGACCTTCGTGCCAGTGCGGCCTTGATTTTGACAGGTTTGGTAGCACAGGGAGAAACTGTGGTCGGTA
AATTGGTTCACTTGGATAGAGGTTACTACGGTTTCCATGAGAAGTTGGCGCAGCTAGGTGCTAAGATTCAGCGGAT
TGAGGCAAGTGATGAAGATGAATAA
MKSRVKETSMDKIVVQGGDNRLVGSVTIEGAKNAVLPLLAATILASEGKTVLQNVPILSDVFIMNQVVGGLNAKVDFD
EEAHLVKVDATGDITEEAPYKYVSKMRASIVVLGPILARVGHAKVSMPGGCTIGSRPIDLHLKGLEAMGVKISQTAGYIE
AKAERLHGAHIYMDFPSVGATQNLMMAATLADGVTVIENAAREPEIVDLAILLNEMGAKVKGAGTETITITGVEKLHG
TTHNVVQDRIEAGTFMVAAAMTGGDVLIRDAVWEHNRPLIAKLLEMGVEVIEEDEGIRVRSQLENLKAVHVKTLPHP
GFPTDMQAQFTALMTVAKGESTMVETVFENRFQHLEEMRRMGLHSEIIRDTARIVGGQPLQGAEVLSTDLRASAALIL
TGLVAQGETVVGKLVHLDRGYYGFHEKLAQLGAKIQRIEASDEDEZ
ID1251101bp
ATGTTATTAGCGTCAACAGTAGCCTTGTCATTTGCCCCAGTATTGGCAACTCAAGCAGAAGAAGTTCTTTGGACTG
CACGTAGTGTTGAGCAAATCCAAAACGATTTGACTAAAACGGACAACAAAACAAGTTATACCGTACAGTATGGTG
ATACTTTGAGCACCATTGCAGAAGCCTTGGGTGTAGATGTCACAGTGCTTGCGAATCTGAACAAAATCACTAATAT
GGACTTGATTTTCCCAGAAACTGTTTTGACAACGACTGTCAATGAAGCAGAAGAAGTAACAGAAGTTGAAATCCA
AACACCTCAAGCAGACTCTAGTGAAGAAGTGACAACTGCGACAGCAGATTTGACCACTAATCAAGTGACCGTTGA
TGATCAAACTGTTCAGGTTGCAGACTTTCTCAACCAATTGCAGAAGTTACAAAGACAGTGATTGCTTCTGAAGAA
GTGGCACCATCTACGGGCACTTCTGTCCCAGAGGAGCAAACGACCGAAACAACTCGCCCAGTTGCAGAAGAAGCT
CCTCAGGAAACGACTCCAGCTGAGAAGCAGGAAACACAAACAAGCCCTCAAGCTGCATCAGCAGTGGAAGCAAC
TACAACAAGTTCAGAAGCAAAAGAAGTAGCATCATCAAATGGAGCTACAGCAGCAGTTTCTACTTATCAACCAGA
AGAAACGAAAGTAATTTCAACAACTTACGAGGCTCCAGCTGCGCCCGATTATGCTGGACTTGCAGTAGCAAAATC
TGAAAATGCAGGTCTTCAACCACAAACAGCTGCCTTTAAWGAAGAAATTGCTAACTTGTTTGGCATTACATCCTTT
AGTGGTTATCGTCCAGGAGACAGTGGAGATCACGGAAAAGGTTTGGCTATCGACTTTATGGTACCAGAACGTTCA
GAATTAGGGGATAAGATTGCGGAATATGCTATTCAAAATATGGCCAGCCGTGGCATTAGTTACATCATCTGGAAA
CAACGTTTCTATGCTCCATTCGATAGCAAATATGGGCCAGCTAACACTTGGAACCCAATGCCAGACCGTGGTAGT
GTGACAGAAAATCACTATGATCACGTTCACGTTTCAATGAATGGATAA
MLLASTVALSFAPVLATQAEEVLWTARSVEQIQNDLTKTDNKTSYTVQYGDTLSTIAEALGVDVTVLANLNKTTNMDL
IFPETVLTTTVNEAEEVTEVEIQTPQADSSEEVTTATADLTTNQVTVDDQTVQVADLSQPIAEVTKTVIASEEVAPSTGTS
VPEEQTTETTRPVAEEAPQETTPAEKQETQTSPQAASAVEATTTSSEAKEVASSNGATAAVSTYQPEETKVISTTYEAPA
APDYAGLAVAKSENAGLQPQTAAFKKKLLTCLALHPLVVIVQETVEITEKVWLSTLWYQNVQNZGIRLRNMLFKIWPA
VALVTSSGNNVSMLHSIANMGQLTLGTQCQTVVVZQKITMITFTFQZMD
ID1261281bp
TTGTTTAAGAAAAATAAAGACATTCTTAATATTGCATTGCCAGCTATGGGTGAAAACTTTTTGCAGATGCTAATGG
GAATGGTGGACAGTTATTTGGTTGCTCATTTAGGATTGATAGCTATTTCAGGGGTTTCAGTAGCTGGTAATATTAT
CACCATTTATCAGGCGATTTTCATCGCTCTGGGAGCTGCTATTTCCAGTGTTATTTCAAAAAGCATAGGGCAGAAA
GACCAGTCGAAGTTGGCCTATCATGTGACTGAGGCGTTGAAGATTACCTTACTATTAAGTTTCCTTTTAGGATTTT
TGTCCATCTTCGCTGGGAAAGAGATGATAGGACTTTTGGGGACGGAGAGGGATGTAGCTGAGAGTGGTGGACTGT
ATCTATCTTTGGTAGGCGGATCGATTGTTCTCTTAGGTTTAATGACTAGTCTAGGAGCCTTGATTCGTGCAACGCA
TAATCCACGTCTGCCTCTCTATGTTAGTTTTTTATCCAATGCCTTGAATATTCTTTTTTCAAGTCTAGCTATTTTTGT
TCTGGATATGGGGATAGCTGGTGTTGCTTGGGGGACAATTGTGTCTCGTTTGGTTGGTCTTGTGATTTTGTGGTCAC
AATTAAAACTGCCTTATGGGAAGCCAACTTTTGGTTTAGATAAGGAACTGTTGACCTTGGCTTTACCAGCAGCTGG
AGAGCGACTTATGATGAGGGCTGGAGATGTAGTGATCATTGCCTTGGTCGTTTCTTTTGGGACGGAGGCAGTTGCT
GGGAATGCAATCGGAGAAGTCTTGACCCAGTTTAACTATATGCCTGCCTTTGGCGTCGCTACGGCAACGGTCATG
CTGTTGGCCCGAGCAGTTGGAGAGGATGATTGGAAAAGAGTTGCTAGTTTGAGTAAACAAACCTTTTGGCTTTCTC
TGTTCCTCATGTTGCCCCTGTCCTTTAGTATATATGTCTTGGGTGTACCATTAACTCATCTCTATACGACTGATTCT
CTAGCGGTGGAGGCTAGTGTTCTAGTGACACTGTTTTCACTACTTGGGACCCCTATGACGACAGGAACAGTCATCT
ATACGGCAGTCTGGCAGGGATTAGGAAATGCACGCCTCCCTTTTTATGCGACAAGTATAGGAATGTGGTGTATCC
GCATTGGGACAGGATATCTGATGGGGATTGTGCTTGGTTGGGGCTTGCCTGGTATTTGGGCAGGGTCTCTCTTGGA
TAATGGTTTTCGCTGGTTATTTCTACGCTATCGTTACCAGCGCTATATGAGCTTGAAAGGATAG
LFKKNKDILNIALPAMGENFLQMLMGMVDSYLVAHLGLIAISGVSVAGNIITIYQAIFIALGAAISSVISKSIGQKDQSKLA
YHVTEALKITLLLSFLLGFLSIFAGKEMIGLLGTERDVAESGGLYLSLVGGSIVLLGLMTSLGALIRATHNPRLPLYVSFL
SNALNILFSSLAIFVLDMGIAGVAWGTIVSRLVGLVILWSQLKLPYGKPTFGLDKELLTLALPAAGERLMMRAGDVVIIA
LVVSFGTEAVAGNAIGEVLTQFNYMPAFGVATATVMLLARAVGEDDWKRVASLSKQTFWLSLFLMLPLSFSIYVLGVP
LTHLYTTDSLAVEASVLVTLFSLLGTPMTTGTVIYTAVWQGLGNARLPFYATSIGMWCIRIGTGYLMGIVLGWGLPGIW
AGSLLDNGFRWLFLRYRYQRYMSLKGZ
ID127894bp
GTGGGAAGAATTATCAGAGCAGGTGTAAAGATGGAACATCTTGGAAAAGTATTTCGTGAATTTCGAACAAGTGGA
AATTATTCTTTAAAGGAAGCAGCAGGCGAATCCTGCTCTACCTCTCAGTTATCTCGCTTTGAGCTTGGGGAGTCTG
ACCTGGCAGTCTCCCGTTTCTTTGAGATTTTGGATAACATTCATGTAACAATCGAAAATTTCATGGATAAGGCAAG
GAATTTTCATAATCATGAACATGTGTCTATGATGGCACAGATTATCCCACTTTACTATTCAAACGATATTGCAGGT
TTTCAAAAGCTTCAAAGAGAACAACTTGAAAAGTCTAAGAGTTCGACGACTCCCTTTATTTTGAGCTGAACTGGA
TTTTGCTACAAGGTCTGATTTGTCAAAGAGATGCGAGTTATGATATGAAGCAGGATGATTTGGGTAAGGTAGCAG
ATTATCTCTTCAAAACAGAAGAATGGACCATGTATGAGTTGATTCTTTTCGGTAACCTCTATAGTTTCTACGATGT
AGACTATGTCACTCGGATTGGTAGAGAAGTTATGGAGAGGGAGGAATTTTACCAAGAGATTAGTCGCCATAAGAG
ATTAGTGTTGATTTTGGCCCTCAATTGTTACCAGCATTGTTTAGAGCATTCTTTTTATAATGCCAACTATTTTG
AGGCTTATACAGAGAAGATTATTGACAAAGGTATTAAGCTTTATGAGCGTAATGTTTTCCATTATTTAAAAGGTTT
TGCCTTATATCAAAAAGGACAGTGTAAAGAAGGCTGTAAGCAGATGCAAGAGGCCATGCATATTTTTGATGTGTT
AGGTCTTCCAGAGCAAGTAGCCTATTATCAGGAACACTACGAAAAATTTGTCAAAAGTTAA
VGRIIRAGVKMEHLGKVFREFRTSGNYSLKEAAGESCSTSQLSRFELGESDLAVSRFFEILDNIHVTIENFMDKARNFHN
HEHVSMMAQIIPLYYSNDIAGFQKLQREQLEKSKSSTTPLYFELNWILLQGLICQRDASYDMKQDDLGKVADYLFKTEE
WTMYELILFGNLYSFYDVDYVTRIGREVMEREEFYQEISRHKRLVLILALNCYQHCLEHSSFYNANYFEAYTEKIIDKGI
KLYERNVFHYLKGFALYQKGQCKEGCKQMQEAMHIFDVLGLPEQVAYYQEHYEKFVKSZ
3
ID11068bp
ATGTCTAACATTCAAAACATGTCCCTGGAGGACATCATGGGAGAGCGCTTTGGTCGCTACTCCAAGTACATTATTC
AAGACCGGGCTTTGCCAGATATTCGTGATGGGTTGAAGCCGGTTCAGCGCCGTATTCTTTATTCTATGAATAAGGA
TAGCAATACTTTTGACAAGAGCTACCGTAAGTCGGCCAAGTCAGTCGGGAACATCATGGGGAATTTCCACCCACA
CGGGGATTCTTCTATCTATGATGCCATGGTTCGTATGTCACAGAACTGGAAAAATCGTGAGATTCTAGTTGAAATG
CACGGTAATAACGGTTCTATGGACGGAGATCCTCCTGCGGCTATGCGTTATACTGAGGCACGTTTGTCTGAAATTG
CAGGCTACCTTCTTCAGGATATCGAGAAAAAGACAGTTCCTTTTGCATGGAACTTTGACGATACGGAGAAAGAAC
CAACGGTCTTGCCAGCAGCCTTTCCAAACCTCTTGGTCAATGGTTCGACTGGGATTTCGGCTGGTTATGCCACAGA
CATTCCTCCCCATAATTTAGCTGAGGTCATAGATGCTGCAGTTTACATGATTGACCACCCAACTGCAAAGATTGAT
AAACTCATGGAATTCTTGCCTGGACCAGACTTCCCTACAGGGGCTATTATTCAGGGTCGTGATGAAATCAAGAAA
GCTTATGAGACTGGGAAAGGGCGCGTGGTTGTTCGTTCCAAGACTGAAATTGAAAAGCTAAAAGGTGGTAAGGAA
CAAATCGTTATTATTGAGATTCCTTATGAAATCAATAAGGCCAATCTAGTCAAGAAAATCGATGATGTTCGTGTTA
ATAACAAGGTAGCTGGGATTGCTGAGGTTCGTGATGAGTCTGACCGTGATGGTCTTCGTATCGCTATCGAACTTAA
GAAAGACGCTAATACTGAGCTTGTTCTCAACTACTTATTTAAGTACACCGACCTACAAATCAACTACAACTTTAAT
ATGGTGGCGATTGACAATTTCACACCTCGTCAGGTTGGATTGTTCCAATCCTGTCTAGCTATATCGCTCACCGTCG
AGAAGTGA
MSNIQNMSLEDIMGERFGRYSKYIIQDRALPDIRDGLKPVQRRILYSMNKDSNTFDKSYRKSAKSVGNIMGNFHPHGDS
SIYDAMVRMSQNWKNREILVEMHGNNGSMDGDPPAAMRYTEARLSEIAGYLLQDIEKKTVPFAWNFDDTEKEPTVLP
AAFPNLLVNGSTGISAGYATDIPPHNLAEVIDAAVYMIDHPTAKIDKLMEFLPGPDFPTGAIIQGRDEIKKAYETGKGRV
VVRSKTEIEKLKGGKEQIVIIEIPYEINKANLVKKIDDVRVNNKVAGIAEVRDESDRDGLRIAIELKKDANTELVLNYLFK
YTDLQINYNFNMVAIDNFTPRQVGLFQSCLAISLTVEKZ
ID12684bp
ATGCCGACATTAGAAATAGCACAAAAAAAACTGGAGTTCATTAAGAAGGCAGAAGAATATTACAATGCCTTGTGT
ACAAATATACAGTTGAGCGGAGATAAACTAAAAGTAATTTCCGTTACTTCTGTTAACCCTGGGGAAGGAAAAACA
ACTACTTCCATAAATATAGCATGGTCGTTTGCGCGTGCAGGCTATAAAACTCTTTTGATCGATGGCGATACTCGAA
ATTCAGTTATGTTAGGAGTTTTTAAATCTCGTGAAAAAATTACAGGGCTAACAGAATTTTTATCTGGGACAGCTGA
TTTATCTCACGGTTTATGTGATACAAATATTGAAAATTTATTTGTAGTTCAATCGGGATCTGTATCACCAAACCCT
ACAGCCTTGTTACAAAGTAAAAATTTTAATGATATGATTGAAACATTGCGTAAATATTTTGATTATATCATTATTG
ATACACCGCCTATTGGAATTGTTATTGATGCGGCAATTATCACTCAAAAGTGTGATGCGTCCATCTTGGTAACAGC
AACAGGTGAGGCGAATAAACGTGATATCCAAAAAGCGAAACAACAATTAAAACAAACAGGGAAACTGTTCCTAG
GAGTTGTTTTAAATAAATTGGATATCTCGGTTAATAAGTATGGAGTTTACGGTTCCTATGGAAATTATGGTAAAAA
ATAA
MPTLEIAQKKLEFIKKAEEYYNALCTNIQLSGDKLKVISVTSVNPGEGKTTTSINIAWSFARAGYKTLLIDGDTRNSVML
GVFKSREKITGLTEFLSGTADLSHGLCDTNIENLFVVQSGSVSPNPTALLQSKNFNDMIETLRKYFDYIIIDTPPIGIVIDAA
IITQKCDASILVTATGEANKRDIQKAKQQLKQTGKLFLGVVLNKLDISVNKYGVYGSYGNYGKKZ
ID131182bp
ATGGAGGCAAATATGAAACATCTAAAAACATTTTACAAAAAATGGTTTCAATTATTAGTCGTTATCGTCATTAGCT
TTTTTAGTGGAGCCTTGGGTAGTTTTTCAATAACTCAACTAACTCAAAAAAGTAGTGTAAACAACTCTAACAACAA
TAGTACTATTACACAAACTGCCTATAAGAACGAAAATTCAACAACACAGGCTGTTAACAAAGTAAAAGATGCTGT
TGTTTCTGTTATTACTTATTCGGCAAACAGACAAAATAGCGTATTTGGCAATGATGATACTG ACACAGATTCTCAG
CGAATCTCTAGTGAAGGATCTGGAGTTATTTATAAAAAGAATGATAAAGAAGCTTACATCGTCACCAACAATCAC
GTTATTAATGGCGCCAGCAAAGTAGATATTCGATTGTCAGATGGGACTAAAGTACCTGGAGAAATTGTCGGAGCT
GACACTTTCTCTGATATTGCTGTCGTCAAAATCTCTTCAGAAAAAGTGACAACAGTAGCTGAGTTTGGTGATTCTA
GTAAGTTAACTGTAGGAGAAACTGCTATTGCCATCGGTAGCCCGTTAGGTTCTGAATATG CAAATACTGTCACTCA
AGGTATCGTATCCAGTCTCAATAGAAATGTATCCTTAAAATCGGAAGATGGACAAGCTATTTCTACAAAAGCCAT
CCAAACTGATACTGCTATTAACCCAGGTAACTCTGGCGGCCCACTGATCAATATTCAAGGGCAGGTTATCGGAAT
TACCTCAAGTAAAATTGCTACAAATGGAGGAACATCTGTAGAAGGTCTTGGTTTCGCAATTCCTGCAAATGATGCT
ATCAATATTATTGAACAGTTAGAAAAAAACGGAAAAGTGACGCGTCCAGCTTTGGGAATCCAGATGGTTAATTTA
TCTAATGTGAGTACAAGCGACATCAGAAGACTCAATATTCCAAGTAATGTTACATCTGGTGTAATTGTTCGTTCGG
TACAAAGTAATATGCCTGCCAATGGTCACCTTGAAAAATACGATGTAATTACAAAAGTAGATGACAAAGAGATTG
CTTCATCAACAGACTTACAAAGTGCTCTTTACAACCATTCTATCGGAGACACCATTAAGATAACCTACTATCGTAA
CGGGAAAGAAGAAACTACCTCTATCAAACTTAACAAGAGTTCAGGTGATTTAGAATCTTAA
MEANMKHLKTFYTKKWFQLLVVIVISFFSGALGSFSITQLTQKSSVNNSNNNSTITQTAYKNENSTTQAVNkVKDAVVSV
ITYSANRQNSVFGNDDTDTDSQRISSEGSGVIYKKNDKEAYIVTNNHVINGASKVDIRLSDGTKVPGEIVGADTFSDIAV
VKISSEKVTTVAEFGDSSKLTVGETAIAIGSPLGSEYANTVTQGIVSSLNRNVSLKSEDGQAISTKAIQTDTAINPGNSGGP
LINIQGQVIGITSSKIATNGGTSVEGLGFAIPANDAINIIEQLEKNGKVTRPALGIQMVNLSNVSTSDIRRLNIPSNVTSGVIV
RSVQSNMPANGHLEKYDVITKVDDKEIASSTDLQSALYNHSIGDTIKITYYRNGKEETTSIKLNKSSGDLESZ
ID15939bp
ATGGCAGAAATTTATCTAGCAGGTGGTTGTTTTTGGGGCCTAGAGGAATATTTTTCACGCATTTCTGGAGTGCTAG
AAACCAGTGTTGGCTACGCTAATGGTCAAGTCGAAACGACCAATTACCAGTTGCTCAAGGAAACAGACCATGCAG
AAACGGTCCAAGTGATTTACGATGAGAAGGAAGTGTCACTCAGAGAGATTTTACTTTATTATTTCCGAGTTATCGA
TCCTCTATCTATCAATCAACAAGGGAATGACCGTGGTCGCCAATATCGAACTGGGATTTATTATCAGGATGAAGC
AGATTTGCCAGCTATCTACACAGTGGTGCAGGAGCAGGAACGCATGCTGGGTCGAAAGATTGCAGTAGAAGTGGA
GCAATTACGCCACTACATTCTGGCTGAAGACTACCACCAAGACTATCTCAGGAAGAATCCTTCAGGTTACTGTCAT
ATCGATGTGACCGATGCTGATAAGCCATTGATTGATGCAGCAAACTATGAAAAGCCTAGTCAAGAGGTGTTGAAG
GCCAGTCTATCTGAAGAGTCTTATCGTGTCACACAAGAAGCTGCTACAGAGGCTCCATTTACCAATGCCTATGACC
AAACCTTTGAAGAGGGGATTTATGTAGATATTACGACAGGTGAGCCACTCTTTTTTGCCAAGGATAAGTTTGCTTC
AGGTTGTGGTTGGCCAAGTTTTAGCCGTCCGATTTCCAAAGAGTTGATTCATTATTACAAGGATCTGAGCCATGGA
ATGGAGCGAATTGAAGTTCGTTCTCGTTCAGGCAGTGCTCACTTGGGTCATGTTTTCACAGATGGACCGCGGGAGT
TAGGCGGCCTCCGTTACTGTATCAATTCTGCTTCTTTACGCTTTGTGGCCAAGGATGAGATGGAAAAAGCAGGATA
TGGCTATCTATTGCCTTACTTAAACAAATAA
MAEIYLAGGCFWGLEEYFSRISGVLETSVGYANGQVETTNYQLLKETDHAETVQVIYDEKEVSLREILLYYFRVIDPLSI
NQQGNDRGRQYRTGIYYQDEADLPAIYTVVQEQERMLGRKIAVEVEQLRHYILAEDYHQDYLRKNPSGYCHIDVTDA
DKPLIDAANYEKPSQEVLKASLSEESYRVTQEAATEAPFTNAYDQTFEEGIYVDITTGEPLFFAKDKFASGCGWPSFSRPI
SKELIHYYKDLSHGMERIEVRSRSGSAHLGHVFTDGPRELGGLRYCINSASLRFVAKDEMEKAGYGYLLPYLNKZ
ID17870bp
ATGAAGATTATTGTACCTGCAACCAGTGCCAATATCGGGCCAGGTTTTGACTCGGTCGGTGTAGCTGTAACCAAGT
ATCTTCAAATTGAGGTCTGCGAAGAACGAGATGAGTGGCTGATTGAACACCAGATTGGCAAATGGATTCCACATG
ACGAGCGTAATCTCTTGCTCAAAATCGCTTTGCAAATTGTACCAGACTTGCAACCAAGACGCTTGAAAATGACCA
GTGATGTCCCTTTGGCGCGCGGTTTGGGTTCTTCCAGCTCGGTTATCGTTGCTGGGATTGAACTAGCCAACCAACT
GGGTCAACTCAACTTATCAGACCATGAAAAATTGCAGTTAGCGACCAAGATTGAAGGGCATCCTGACAATGTGGC
TCCAGCCATTTATGGTAATCTCGTTATTGCAAGTTCTGTTGAAGGGCAAGTCTCTGCTATCGTAGCAGACTTTCCA
GAGTGTGATTTTCTAGCTTACATTCCAAACTATGAATTACGTACTCGCGACAGCCGTAGTGTCTTGCCTAAAAAAT
TGTCTTATAAGGAAGCTGTTGCTGCAAGTTCTATCGCCAATGTAGCGGTTGCTGCCTTGTTGGCAGGAGACATGGT
GACCGCTGGGCAAGCAATCGAGGGAGACCTCTTCCATGAGCGCTATCGTCAGGACTTGGTAAGAGAATTTGCGAT
GATTAAGCAAGTGACCAAAGAAAATGGGGCCTATGCAACCTACCTTTCTGGTGCTGGGCCGACAGTTATGGTTCT
GCCTTCTCATGACAAGATGCCAACAATTAAGGCAGAATTGGAAAAGCAACCTTTCAAAGGAAAACTGCATGACTT
GAGAGTTGATACCCAAGGTGTCCGTGTAGAAGCAAAATAA
MKIIVPATSANIGPGFDSVGVAVTKYLQIEVCEERDEWLIEHQIGKWIPHDERNLLLKIALQIVPDLQPRRLKMTSDVPLA
RGLGSSSSVIVAGIELANQLGQLNLSDHEKLQLATKIEGHPDNVAPAIYGNLVIASSVEGQVSAIVADFPECDFLAYIPNY
ELRTRDSRSVLPKKLSYKEAVAASSIANVAYAALLAGDMVTAGQAIEGDLFHERYRQDLVREFAMIKQVTKENGAYAT
YLSGAGPTVMVLASHDKMPTIKAELEKQPFKGKLHDLRVDTQGVRVEAKZ
ID20564bp
ATGAAATATCACGATTACATCTGGGATTTAGGTGGAACTTTACTGGATAATTATGAAACTTCAACAGCTGCATTTG
TTGAAACATTGGCACTGTATGGTATCACACAAGACCATGACAGTGTCTATCAAGCTTTAAAGGTTTCTACTCCTTT
TGCGATTGAGACATTCGCTCCCAATTTAGAGAATTTTTAGAAAAGTACAAGGAAAATGAAGCCAGAGAGCTTGA
ACACCCGATTTTATTTGAAGGAGTTTCTGACCTATTGGAAGACATTTCAAATCAAGGTGGCCGTCATTTTTTGGTC
TCTCATCGAAATGATCAGGTTTTGGAAATTTTAGAAAAAACCTCTATAGCAGCTTATTTTACAGAAGTGGTGACTT
CTAGCTCAGGCTTTAAGAGAAAGCCAAATCCCGAATCCATGCTTTATTTAAGAGAAAAGTATCAGATTAGCTCTG
GTCTTGTCATTGGTGATCGGCCGATTGATATCGAAGCAGGTCAAGCTGCAGGACTTGATACCCACTTGTTTACCAG
TATCGTGAATTTAAGACAAGTATTAGACATATAA
MKYHDYIWDLGGTLLDNYETSTAAFVETLALYGITQDHDSVYQALKVSTPFAIETFAPNLENFLEKYKENEARELEHPI
LFEGVSDLLEDISNQGGRHFLVSHRNDQVLEILEKTSIAAYFTEVVTSSSGFKRKPNPESMLYLREKYQISSGLVIGDRPID
IEAGQAAGLDTHLFTSIVNLRQVLDIZ
ID211875bp
ATGACAGAAGAAATCAAAAATCTGCAGGCACAGGATTATGATGCCAGTCAAATTCAAGTTTTAGAGGGCTTAGAG
GCTGTTCGTATGCGTCCAGGGATGTACATTGGATCAACCTCAAAAGAAGGTCTTCACCATCTAGTCTGGGAAATTG
TTGATAACTCAATTGACGAGGCCTTGGCAGGATTTGCCAGCCATATTCAAGTTTTTATTGAGCCAGATGATTCGAT
TACTGTTGTGGATGATGGGCGTGGTATCCCAGTCGATATTCAGGAAAAAACAGGCCGTCCTGCTGTTGAGACCGT
CTTTACAGTCCTTCACGCTGGAGGAAAGTTCGGCGGTGGTGGATACAAGGTTTCAGGTGGTCTTCACGGGGTGGG
GTCGTCAGTAGTTAATGCCCTTTCCACTCAATTAGACGTTCATGTTCACAAAAATGGTAAGATTCATTACCAAGAA
TACCGTCGTGGTCATGTTGTCGCAGATCTTGAAATAGTTGGAGATACGGATAAAACAGGAACAACTGTTCACTTC
ACACCGGACCCAAAAATCTTCACTGAAACAACAATCTTTGATTTTGATAAATTAAATAAACGGATTCAAGAGTTG
GCCTTTCTAAATCGCGGTCTTCAAATTTCAATTACAGATAAGCGCCAAGGTTTGGAACAAACCAAGCATTATCATT
ATGAAGGTGGGATTGCTAGTTACGTTGAATATATCAACGAGAACAAGGATGTAATCTTTGATACACCAATCTATA
CAGACGGTGAGATGGATGATATCACAGTTGAGGTAGCCATGCAGTACACAACTGGTTACCATGAAAATGTCATGA
GTTTCGCCAATAATATTCATACCCATGAAGGTGGAACACATGAACAAGGTTTCCGTACAGCCTTGACACGTGTTAT
CAACGATTATGCTCGTAAAAATAAGTTACTGAAAGACAATGAAGATAATTTAACAGGGGAAGATGTTCGCGAAGG
CTTAACTGCAGTTATCTCAGTTAAACACCCAAATCCACAGTTTGAAGGACAAACCAAGACCAAATTGGGAAATAG
CGAAGTGGTCAAGATTACCAATCGCCTCTTCAGTGAAGCTTTCTCCGATTTCCTCATGGAAAATCCACAGATTGCC
AAACGTATCGTAGAAAAAGGAATTTTGGCTGCCAAGGCTCGTGTGGCTGCCAAGCGTGCGCGTGAAGTCACACGT
AAAAAATCTGGTTTGGAAATTTCCAACCTTCCAGGGAAACTAGCAGACTGTTCTTCTAATAACCCTGCTGAAACAG
AACTCTTCATCGTCGAAGGAGACTCAGCTGGTGGATCAGCCAAATCTGGTCGTAACCGTGAGTTTCAGGCTATCCT
TCCAATTCGCGGTAAGATTTTGAACGTTGAAAAAGCAAGTATGGATAAGATTCTAGCCAACGAAGAAATTCGTAG
TCTTTTCACAGCCATGGGAACAGGATTTGGCGCAGAATTTGATGTTTCGAAAGCCCGTTACCAAAAACTCGTTTTG
ATGACCGATGCCGATGTCGATGGAGCCCACATTCGTACCCTTCTTTTAACCTTGATTTATCGTTATATGAAACCAA
TCCTAGAAGCTGGTTATGTTTATATTGCCCAACCACCAATCTATGGTGTCAAGGTTGGAAGCGAGATTAAAGAATA
TATCCAGCCGGGTGCAGATCAAGAAATCAAACTCCAAGAAGCTTTAGCCCGTTATAGTGAAGGTCGTACCAAACC
GACTATTCAGCGTTATAAGGGGCTAGGTGAAATGGACGATCATCAGCTGTGGGAAACAACCATGGATCCCGAACA
TCGCTTGATGGCTAGAGTTTCTGTAGATGATGTGCAGAAGCAGATAAAATCTTTGATATGTTGA
MTEEIKNLQAQDYDASQIQVLEGLEAVRMRPGMYIGSTSKEGLHHLVWEIVDNSIDEALAGFASHIQVFIEPDDSITVVD
DGRGIPVDIQEKTGRPAVETVFTVLHAGGKFGGGGYKVSGGLHGVGSSVVNALSTQLDVHVHKNGKIHYQEYRRGHV
VADLEIVGDTDKTGTTVHFTPDPKIFTETTIFDFDKLNKRIQELAFLNRGLQISITDKRQGLEQTKHYHYEGGIASYVEYI
NENKDVIFDTPIYTDGEMDDITVEVAMQYTTGYHENVMSFANNIHTHEGGTHEQGFRTALTRVINDYARKNKLLKDN
EDNLTGEDVREGLTAVISVKHPNPQFEGQTKTKLGNSEVVKTTNRLFSEAFSDFLMENPQIAKRIVEKGILAAKARVAAK
RAREVTRKKSGLEISNLPGKLADCSSNNPAETELFIVEGDSAGGSAKSGRNREFQAILPIRGKILNVEKASMDKILANEEI
RSLFTAMGTGFGAEFDVSKARYQKLVLMTDADVDGAHIRTLLLTLIYRYMKPILEAGYVYIAQPPIYGVKVGSEIKEYI
QPGADQEIKLQEALARYSEGRTKPTIQRYKGLGEMDDHQLWETTMDPEHRLMARVSVDDVQKQIKSLICZ
ID541446bp
ATGAGTAGACGTTTTAAAAAATCACGTTCACAGAAAGTGAAGCGAAGTGTTAATATAGTTTTGCTGACTATTTATT
TATTGTTAGTTTGTTTTTTATTGTTCTTAATCTTTAAGTACAATATCCTTGCTTTTAGATATCTTAATCTAGTGGTAA
CTGCGTTAGTCCTACTAGTTGCCTTGGTAGGGCTACTCTTGATTATCTATAAAAAAGCTGAAAAGTTTACTATTTTT
CTGTTGGTGTTCTCTATCCTTGTCAGCTCTGTGTCGCTCTTTGCAGTACAGCAGTTTGTTGGACTGACCAATCGTTT
AAATGCGACTTCTAATTACTCAGAATATTCAATCAGTGTCGCTGTTTTAGCAGATAGTGAGATCGAAAATGTTACG
CAACTGACGAGTGTGACAGCACCGACTGGGACTAATAATGAAAATATTCAGAAATTACTAGCTGATATCAAGTCA
AGTCAGAATACCGATTTGACGGTCAACCAGAGTTCGTCTTACTTGGCAGCTTACAAGAGTTTGATTGCAGGGGAG
ACTAAGGCCATTGTCCTAAATAGTGTCTTTGAAAACATCATCGAGTCAGAGTATCCAGACTACGCATCGAAGATA
AAAAAGATTTATACTAAGGGATTCACTAAAAAAGTAGAAGCTCCTAAGACGTCTAAGAGTCAGTCTTTCAATATC
TATGTTAGTGGAATTGACACCTATGGTCCTATTAGTTCGGTGTCGCGATCAGATGTCAACATCCTGATGACTGTCA
ATCGAGATACCAAGAAAATCCTCTTGACCACAACGCCACGTGATGCCTATGTACCAATCGCAGATGGTGGAAATA
ATCAAAAAGATAAATTGACTCATGCGGGCATTTATGGAGTTGATTCGTCCATTCACACCTTAGAAAATCTCTATGG
AGTGGATATCAATTACTATGTGCGATTGAACTTCACTTCGTTTTTGAAATTGATTGATTTGTTGGGTGGAATTGATG
TTTATAATGATCAAGAATTTACTGCCCATACGAATGGAAAGTATTACCCTGCAGGCAATGTTCATCTTGATTCAGA
ACAGGCTCTCGGTTTTGTTCGTGAGCGCTACTCCCTAGCAGATGGCGATCGTGACCGCGGGCGCCATCAACAAAA
GGTGATTGTGGCTATCCTTCAAAAATTAACGTCAACCGAAGTGCTGAAAAATTATAGTACGATCATTAATAGCTTG
CAAGATTCTATCCAAACAAATATGCCACTTGAGACCATGATAAATTTGGTCAATGCTCAGTTAGAAAGTGGAGGG
AATTATAAAGTAAATTCTCAAGATTTAAAAGGGACAGGTCGGATGGATCTTCCTTCTTATGCAATGCCAGACAGTA
ACCTCTATGTGATGGAAATAGATGATAGTAGTTTAGCTGTAGTTAAAGCAGCTATACAGGATGTGATGGAGGGTA
GATGA
MSRRFKKSRSQKVKRSVNIVLLTIYLLLVCFLLFLIFKYNILAFRYLNLVVTALVLLVALVGLLLHYKKAEKFTIFLLVFS
ILVSSVSLFAVQQFVGLTNRLNATSNYSEYSISVAVLADSEIENVTQLTSVTAPTGTNNENIQKLLADIKSSQNTDLTVNQ
SSSYLAAYKSLIAGETKAIVLNSVFENIIESEYPDYASKIKKIYTKGFTKKVEAPKTSKSQSFNIYVSGIDTYGPISSVSRSDV
NILMTVNRDTKKILLTTTPRDAYVPIADGGNNQKDKLTHAGIYGVDSSIHTLENLYGVDINYYVRLNFTSFLKLIDLLGG
IDVYNDQEFTAHTNGKYYPAGNVHLDSEQALGFVRERYSLADGDRDRGRHQQKVIVAILQKLTSTEVLKNYSTIINSLQ
DSIQTNMPLETMINLVNAQLESGGNYKVNSQDLKGTGRMDLPSYAMPDSNLYVMEIDDSSLAVVKAAIQDVMEGRZ
ID55732bp
ATGATAGACATCCATTCGCATATCGTTTTTGATGTAGATGACGGTCCCAAGTCAAGAGAGGAAAGCAAGGCTCTC
TTGGCAGAATCCTACAGACAGGGGGTGCGAACCATTGTTTCTACCTCTCACCGTCGCAAGGGCATGTTTGAAACTC
CGGAAGAGAAGATAGCAGAAAACTTTCTTCAGGTTCGGGAAATAGCTAAGGAAGTGGCGAGTGACTTGGTCATTG
CTTACGGGGCTGAAATTTATTACACACCAGATGTTCTGGATAAGCTGGAAAAAAAGCGGATTCCGACCCTCAATG
ATAGTCGTTATGCCTTGATAGAGTTTAGTATGAACACTCCTTATCGCGATATTCATAGCGCCTTGAGCAAGATCTT
GATGTTGGGAATTACTCCAGTCATTGCCCACATTGAGCGCTATGATGCTCTTGAAAATAATGAAAAACGCGTTCGA
GAACTGATCGATATGGGCTGTTACACGCAAGTAAATAGTTCACATGTCCTCAAACCCAAACTTTTTGGCGAACGTT
ATAAATTCATGAAAAAAAGAGCTCAGTATTTTTTAGAGCAGGATTTGGTTCATGTCATTGCAAGTGATATGCACAA
TCTAGACGGTAGACCTCCTCATATGGCAGAAGCATATGACCTTGTTACCCAAAAATACGGAGAAGCGAAGGCTCA
GGAACTTTTTATAGACAATCCTCGAAAAATTGTAATGGATCAACTAATTTAG
MIDIHSHIVFDVDDGPKSREESKALLAESYRQGVRTIVSTSHRRKGMFETPEEKIAENFLQVREIAKEVASDLVIAYGAEI
YYTPDVLDKLEKKRIPTLNDSRYALIEFSMNTPYRDIHSALSKCILMLGITPVIAHIERYDALENNEKRVRELIDMGCYTQV
NSSHVLKPKLFGERYKFMKKRAQYFLEQDLVHVIASDMHNLDGRPPHMAEAYDLVTQKYGEAKAQELFIDNPRKIVM
DQLIZ
ID583990bp
TTGATTTATATAATCGCTATCAATATAACAATGCAATCAGGAGGTTTTGCAATGAAACATGAAAAACAACAGCGT
TTTTCTATTCGTAAATACGCTGTAGGAGCAGCTTCTGTTCTAATTGGATTTGCCTTCCAAGCACAGACTGTTGCAG
CCGATGGAGTTACTCCTACTACTACAGAAAACCAACCGACCATCCATACGGTTTCTGATTCCCCTCAATCATCCGA
AAATCGGACTGAGGAAACACCTAAAGCAGTGCTTCAACCAGAAGCTCCAAAAACTGTAGAAACAGAAACTCCAG
CTACTGATAAGGTAGCTAGTCTTCCAAAAACAGAAGAAAAACCACAAGAGGAAGTTAGTTCAACTCCTAGTGATA
AAGCAGAAGTGGTAACTCCAACTTCTGCTGAAAAAGAAACTGCTAATAAAAAGGCAGAAGAAGCTAGCCCTAAA
AAGGAAGAAGCGAAAGAGGTTGATTCTAAAGAGTCAAATACAGACAAGACTGACAAGGATAAACCAGCTAAAAA
AGATGAAGCGAAAGCAGAGGCTGACAAACCGGCAACAGAGGCAGGAAAGGAACGTGCTGCAACTGTAAATGAAA
AACTAGCGAAAAAGAAAATTGTTTCTATTGATGCTGGACGTAAATATTTCTCACCAGAACAGCTCAAGGAAATCA
TCGATAAAGCGAAACATTATGGCTACACTGATTTACACCTATTAGTCGGAAATGATGGACTCCGTTTCATGTTGGA
CGATATGAGCATCACAGCTAACGGCAAGACCTATGCCAGTGACGATGTCA4ACGCGCCATTGAAAAAGGTACAAA
TGATTATTACAACGATCCAAACGGCAATCACTTAACAGAAAGTCAAATGACAGATCTGATTAACTATGCCAAAGA
TAAAGGTATCGGTCTCATTCCGACAGTAAATAGTCCTGGACACATGGATGCGATTCTCAATGCCATGAAAGAATT
GGGAATCCAAAACCCTAACTTTAGCTATTTTGGGAAGAAATCAGCCCGTACTGTCGATCTTGACAACGAACAAGC
TGTCGCTTTTACAAAAGCCCTTATCGACAAGTATGCTGCTTATTTCGCGAAAAAGACTGAAATCTTCAACATCGGA
CTTGATGAATATGCCAATGATGCGACAGATGCTAAAGGTTGGAGTGTGCTTCAAGCTGATAAATACTATCCAAAC
GAAGGCTACCCTGTAAAAGGCTATGAAAAATTTATTGCCTACGCCAATGACCTCGCTCGTATTGTAAAATCGCAC
GGTCTCAAACCAATGGCTTTTAACGACGGTATCTACTACAATAGCGACACAAGCTTTGGTAGTTTTGACAAAGAC
ATCATCGTTTCTATGTGGACTGGTGGTTGGGGAGGCTACGATGTCGCTTCTTCTAAACTACTAGCTGAAAAAGGTC
ACCAAATCCTTAATACCAATGATGCTTGGTACTACGTTCTTGGACGAAACGCTGATGGCCAAGGCTGGTACAATCT
CGATCAGGGGCTCAATGGTATTAAAAACACACCAATCACTTCTGTACCAAAAACAGAAGGAGCTGATATCCCAAT
CATCGGTGGTATGGTAGCTGCTTGGGCTGACACTCCATCTGCACGTTATTCACCATCACGCCTCTTCAAACTCATG
CGTCATTTTGCAAATGCCAACGCTGAATACTTCGCAGCTGATTATGAATCTGCAGAGCAAGCACTTAACGAGGTA
CCAAAAGACCTGAACCGTTATACTGCAGAAAGCGTCACGGCCGTAAAAGAAGCTGAAAAAGCTATTCGCTCTCTC
GATAGCAACCTTAGCCGTGCCCAACAAGATACGATTGATCAAGCCATTGCTAAACTTCAAGAAACTGTCAACAAC
TTGACCCTCACGCCTGAAGCTCAAAAAGAAGAAGAAGCTAAACGTGAGGTTGAAAAACTTGCCAAAAACAAGGT
AATCTCAATCGATGCTGGACGCAAATACTTTACTCTGAACCAGCTCAAACGCATCGTAGACAAGGCCAGTGAGCT
CGGATATTCTGATGTCCATCTCCTTCTAGGAAATGACGGACTTCGCTTTCTACTCGATGATATGACCATTACTGCC
AACGGAAAAACCTATGCTAGTGATGACGTTAAAAAAGCTATTATCGAAGGAACTAAAGCTTACTACGACGATCCA
AACGGTACTGCACTAACACAGGCAGAAGTAACAGAGCTAATTGAATACGCTAAATCTAAGGACATCGGTCTCATC
CCAGCTATTAACAGTCCAGGTCACATGGATGCTATGCTGGTTGCCATGGAAAAATTAGGTATTAAAAATCCTCAA
GCCCACTTTGATAAAGTTTCAAAAACAACTATGGACTTGAAAAACGAAGAAGCGATGAACTTTGTAAAAGCCCTC
ATCGGTAAATACATGGACTTCTTTGCAGGTAAAACAAAGATTTTCAACTTTGGTACTGACGAATACGCCAACGAT
GCGACTAGTGCCCAAGGCTGGTACTACCTCAAGTGGTATCAACTCTATGGCAAATTTGCCGAATATGCCAACACC
CTCGCAGCTATGGCCAAAGAAAGAGGGCTTCAACCAATGGCCTTCAACGATGGCTTCTACTATGAAGACAAGGAC
GATGTTCAGTTTGACAAAGATGTCTTGATTTCTTACTGGTCTAAAGGCTGGTGGGGATATAACCTCGCATCACCTC
AATACCTAGCAAGCAAAGGCTATAAATTCTTGAATACCAACGGTGACTGGTACTACATTCTTGGTCAAAAACCAG
AAGATGGTGGTGGTTTCCTCAAGAAAGCTATTGAGAATACTGGAAAAACACCATTCAATCAACTAGCTTCTACCA
AATATCCTGAAGTAGATCTTCCAACAGTCGGAAGTATGCTTTCAATCTGGGCAGATAGACCAAGCGCTGAATACA
AGGAAGAGGAAATCTTTGAACTCATGACTGCCTTTGCAGACCACAACAAAGACTACTTTCGTGCTAATTATAATG
CTCTCCGCGAAGAATTAGCTAAAATTCCTACAAACTTAGAAGGATATAGTAAAGAAAGTCTTGAGGCCCTTGACG
CAGCTAAAACAGCTCTAAATTACAACCTCAACCGTAATAAACAAGCTGAGCTTGACACGCTTGTAGCCAACCTAA
AAGCCGCTCTTCAAGGCCTCAAACCAGCTGTAACTCATTCAGGAAGCCTAGATGAAAATGAAGTGGCTGCCAATG
TTGAAACCAGACCAGAACTCATCACAAGAACTGAAGAAATTCCATTTGAAGTTATCAAGAAAGAAAATCCTAACC
TCCCAGCCGGTCAGGAAAATATTATCACAGCAGGAGTCAAAGGTGAACGAACTCATTACATCTCTGTACTCACTG
AAAATGGAAAAACAACAGAAACAGTCCTTGATAGCCAGGTAACCAAAGAAGTTATAAACCAAGTGGTTGAAGTT
GGCGCTCCTGTAACTCACAAGGGTGATGAAAGTGGTCTTGCACCAACTACTGAGGTAAAACCTAGACTGGATATC
CAAGAAGAAGAAATTCCATTTACCACAGTGACTTGTGAAAATCCACTCTTACTCAAAGGAAAAACACAAGTCATT
ACTAAGGGCGTCAATGGACATCGTAGCAACTTCTACTCTGTGAGCACTTCTGCCGATGGTAAGGAAGTGAAAACA
CTTGTAAATAGTGTCGTAGCACAGGAAGCCGTTACTCAAATAGTCGAAGTCGGAACTATGGTAACACATGTAGGC
GATGAAAACGGACAAGCCGCTATTGCTGAAGAAAAACCAAAACTAGAAATCCCAAGCCAACCAGCTCCATCAAC
TGCTCCTGCTGAGGAAAGCAAAGTTCTTCCTCAAGATCCAGCTCCTGTGGTAACAGAGAAAAAACTTCCTGAAAC
AGGAACTCACGATTCTGCAGGACTAGTAGTCGCAGGACTCATGTCCACACTAGCAGCCTATGGACTCACTAAAAG
AAAAGAAGACTAA
MIYIIAINITMQSGGFAMKHEKQQRFSIRKYAVGAASVLIGFAFQAQTVAADGVTPTTTENQPTIHTVSDSPQSSENRTEE
TPKAVLQPEAPKTVETETPATDKVASLPKTEEKPQEEVSSTPSDKAEVVTPTSAEKETANKKAEEASPKKEEAKEVDSKE
SNTDKTDKDKPAKKDEAKAEADKPATEAGKERAATVNEKLAKKKIVSIDAGRKYFSPEQLKEIIDKAKHYGYTDLHLL
VGNDGLRFMLDDMSITANGKTYASDDVKRAIEKGTNDYYNDPNGNHLTESQMTDLINYAKDKGIGLIPTVNSPGHMD
AILNAMKELGIQNPNFSYFGKKSARTVDLDNEQAVAFTKALIDKYAAYFAKKTEIFNIGLDEYANDATDAKGWSVLQA
DKYYPNEGYPVKGYEKFIAYANDLARIVKSHGLKPMAFNDGIYYNSDTSFGSFDKDIIVSMWTGGWGGYDVASSKLLA
EKGHQILNTNDAWYYVLGRNADGQGWYNLDQGLNGIKNTPITSVPKTEGADIPIIGGMVAAWADTPSARYSPSRLFKL
MRHFANANAEYFAADYESAEQALNEVPKDLNRYTAESVTAVKEAEKAIRSLDSNLSRAQQDTIDQAIAKLQETVNNLT
LTPEAQKEEEAKREVEKLAKNKVISIDAGRKYFTLNQLKRIVDKASELGYSDVHLLLGNDGLRFLLDDMTITANGKTYA
SDDVKKAIIEGTKAYYDDPNGTALTQAEVTELIEYAKSKDIGLIPAINSPGHMDAMLVAMEKLGIKNPQAHFDKVSKTT
MDLKNEEAMNFVKALIGKYMDFFAGKTKIFNFGTDEYANDATSAQGWYYLKWYQLYGKFAEYANTLAAMAKERGL
QPMAFNDGFYYEDKDDVQFDKDVLISYWSKGWWGYNLASPQYLASKGYKFLNTNGDWYYILGQKPEDGGGFLKKAI
ENTGKTPFNQLASTKYPEVDLPTVGSMLSIWADRPSAEYKEEEIFELMTAFADHNKDYFRANYNALREELAKIPTNLEG
YSKESLEALDAAKTALNYNLNRNKQAELDTLVANLKAALQGLKPAVTHSGSLDENEVAANVETRPELTTRTEEIPFEVI
KKENPNLPAGQENIITAGVKGERTHYISVLTENGKTTETVLDSQVTKEVINQVVEVGAPVTHKGDESGLAPTTEVKPRL
DIQEEEIPFTTVTCENPLLLKGKTQVITKGVNGHRSNFYSVSTSADGKEVKTLVNSVVAQEAVTQIVEVGTMVTHVGDE
NGQAAIAEEKPKLEIPSQPAPSTAPAEESKVLPQDPAPVVTEKKLPETGTHDSAGLVVAGLMSTLAAYGLTKRKEDZ
ID122825bp
ATGAACAAAAAAACAAGACAGACACTAATCGGACTGCTAGTGTTATTGCTTTTGTCTACAGGGAGCTATTATATC
AAGCAGATGCCGTCGGCACCTAATAGTCCCAAAACCAATCTTAGTCAGAAAAAACAAGCGTCTGAAGCTCCTAGT
CAAGCATTGGCAGAGAGTGTCTTAACAGACGCAGTCAAGAGTCAAATAAAGGGGAGTCTGGAGTGGAATGGCTC
AGGTGCTTTTATCGTCAATGGTAATAAAACAAATCTAGATGCCAAGGTTTCAAGTAAGCCCTACGCTGACAATAA
AACAAAGACAGTGGGCAAGGAAACTGTTCCAACCGTAGCTAATGCCCTCTTGTCTAAGGCCACTCGTCAGTACAA
GAATCGTAAAGAAACTGGGAATGGTTCAACTTCTTGGACTCCTCCAGGTTGGCATCAGGTCAAGAATCTAAAGGG
CTCTTATACCCATGCAGTCGATAGAGGTCATTTGTTAGGCTATGCCTTAATCGGTGGTTTGGATGGTTTTGATGCCT
CAACAAGCAATCCTAAAAACATTGCTGTTCAGACAGCCTGGGCAAATCAGGCACAAGCCGAGTATTCGACTGGTC
AAAACTACTATGAAAGCAAGGTGCGTAAAGCCTTGGACCAAAACAAGCGTGTCCGTTACCGTGTAACCCTTTACT
ACGCTTCAAACGAGGATTTAGTTCCCTCAGCTTCACAGATTGAAGCCAAGTCTTCGGATGGAGAATTGGAATTCA
ATGTTCTAGTTCCCAATGTTCAAAAGGGACTTCAACTGGATTACCGAACTGGAGAAGTAACTGTAACTCAGTAA
MNKKTRQTLIGLLVLLLLSTGSYYIKQMPSAPNSPKTNLSQKKQASEAPSQALAESVLTDAVKSQIKGSLEWNGSGAFIV
NGNKTNLDAKVSSKPYADNKTKTVGKETVPTVANALLSKATRQYKNRKETGNGSTSWTPPGWHQVKNLKGSYTHAV
DRGHLLGYALIGGLDGFDASTSNPKNIAVQTAWANQAQAEYSTGQNYYESKVRKALDQNKRVRYRVTLYYASNEDL
VPSASQIEAKSSDGELEFNVLVPNVQKGLQLDYRTGEVTVTQZ
ID123225bp
GTGCTAAGATTCAGCGGATTGAGGCAAGTGATGAAGATGAATAAGAAATCAAGCTACGTAGTCAAGCGTTTACTT
TTAGTCATCATAGTACTGATTTTAGGTACTCTGGCTCTAGGAATCGGTTTAATGGTAGGTTATGGAATCTTGGGCA
AGGGTCAAGATCCATGGGCTATCCTGTCTCCAGCAAAATGGCAGGAATTGATTCATAAATTTACAGGAAATTAG
VLRFSGLRQVMKMNKKSSYVVKRLLLVIIYLILGTLALGIGLMVGYGILGKGQDPWAILSPAKWQELIHKFTGNZ
Claims (20)
1. streptococcus pneumoniae protein or polypeptide, it has the sequence that is selected from sequence shown in the table 1.
2. streptococcus pneumoniae protein or polypeptide, it has the sequence that is selected from sequence shown in the table 2.
3. claim 1 or 2 protein or polypeptide, it is provided with pure basically form.
With claim 1 to 3 in each defined substantially the same protein or polypeptide.
5. the homologue or the derivative of each defined protein or polypeptide in the claim 1 to 4.
6. the antigenicity and/or the immunogenic fragments of defined protein or polypeptide among the table 1-3.
7. nucleic acid molecule, it contains following sequence or is made up of following sequence:
(ⅰ) any dna sequence dna shown in the table 1 or its RNA equivalent;
(ⅱ) with (ⅰ) in arbitrary sequence complementary sequence;
(ⅲ) with (ⅰ) or the sequence encoding same protein (ⅱ) or the sequence of polypeptide;
(ⅳ) with (ⅰ), (ⅱ) and (ⅲ) the substantially the same sequence of arbitrary sequence in;
(ⅴ) the coding schedule 1 proteinic homologue that defines, derivative or fragments sequence.
8. nucleic acid molecule, it contains following sequence or is made up of following sequence:
(ⅰ) any dna sequence dna shown in the table 2 or its RNA equivalent;
(ⅱ) with (ⅰ) in arbitrary sequence complementary sequence;
(ⅲ) with (ⅰ) or the sequence encoding same protein (ⅱ) or the sequence of polypeptide;
(ⅳ) with (ⅰ), (ⅱ) and (ⅲ) the substantially the same sequence of arbitrary sequence in;
(ⅴ) the coding schedule 2 proteinic homologue that defines, derivative or fragments sequence.
9. have the protein of sequence that is selected from sequence shown in the table 1-3 or polypeptide or its homologue, derivative and/or fragment as immunogen and/or antigenic purposes.
10. immunogenicity and/or antigenic composition, said composition contain one or more and have the protein that is selected from the sequence of sequence shown in the table 1-3 or polypeptide or its homologue or derivative and/or any the fragment in them.
11. described immunogenicity of claim 10 and/or antigenic composition, said composition is a vaccine, perhaps can be used for diagnostic test.
12. the described vaccine of claim 11, it contains one or more other components that is selected from vehicle, thinner, adjuvant etc.
13. vaccine composition, it contains one or more as the defined nucleotide sequence of table 1-3.
14. the method for a detection/diagnosis streptococcus pneumoniae, described method comprise the step that given the test agent is contacted with at least a as table defined protein of 1-3 or polypeptide or its homologue, derivative or fragment.
15. an antibody, described antibody capable with as the table defined protein of 1-3 or polypeptide or its homologue, derivative or fragment combine.
16. the defined antibody of claim 15, described antibody is monoclonal antibody.
17. comprising, the method for a detection/diagnosis streptococcus pneumoniae, described method make given the test agent and at least a step that contacts as claim 15 or 16 defined antibody.
18. comprising, the method for detection/diagnosis streptococcus pneumoniae, described method make given the test agent and at least a step that contacts as claim 7 or 8 defined nucleotide sequences.
19. whether a mensuration is the method for the antimicrobial target of potential as table defined protein of 1-3 or polypeptide, whether still described method comprises described protein of deactivation or polypeptide and measures streptococcus pneumoniae survival.
20. can antagonism, inhibition or otherwise disturb as the medicament of the function of table defined protein of 1-3 or polypeptide or expression is used to prepare the purposes of medicine, described medicine can be used for treatment or prevention streptococcus pneumoniae infection.
Applications Claiming Priority (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
GBGB9816336.3A GB9816336D0 (en) | 1998-07-27 | 1998-07-27 | Proteins |
GB9816336.3 | 1998-07-27 | ||
US12532999P | 1999-03-19 | 1999-03-19 | |
US60/125,329 | 1999-03-19 |
Related Child Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CNA2007101283898A Division CN101108877A (en) | 1998-07-27 | 1999-07-27 | Nucleic acids and proteins from streptococcus pneumoniae |
Publications (1)
Publication Number | Publication Date |
---|---|
CN1318103A true CN1318103A (en) | 2001-10-17 |
Family
ID=26314124
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN99810978A Pending CN1318103A (en) | 1998-07-27 | 1999-07-27 | Nucleic acids and proteins from streptococcus pneumoniae |
Country Status (4)
Country | Link |
---|---|
EP (1) | EP1144640A3 (en) |
JP (2) | JP2002521058A (en) |
CN (1) | CN1318103A (en) |
WO (1) | WO2000006738A2 (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101155833B (en) * | 2005-04-08 | 2011-04-20 | 惠氏公司 | Separation of contaminants from streptococcus pneumoniae polysaccharide by ph manipulation |
CN103834667A (en) * | 2013-12-31 | 2014-06-04 | 李越希 | Chemosynthetic extracellular region gene fragment of streptococcus pneumonia PspA protein, and expression and application thereof |
Families Citing this family (39)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6800744B1 (en) | 1997-07-02 | 2004-10-05 | Genome Therapeutics Corporation | Nucleic acid and amino acid sequences relating to Streptococcus pneumoniae for diagnostics and therapeutics |
ES2512496T3 (en) | 1998-07-22 | 2014-10-24 | Stichting Dienst Landbouwkundig Onderzoek | Vaccines and diagnostic tests for Streptococcus suis |
US7128918B1 (en) | 1998-12-23 | 2006-10-31 | Id Biomedical Corporation | Streptococcus antigens |
EP1950302B1 (en) * | 1998-12-23 | 2012-12-05 | ID Biomedical Corporation of Quebec | Streptococcus antigens |
TR200200633T2 (en) * | 1998-12-23 | 2002-06-21 | Shire Biochem Inc. | New streptococcus antigens |
US6887480B1 (en) | 1999-06-10 | 2005-05-03 | Medimmune, Inc. | Streptococcus pneumoniae proteins and vaccines |
AU777856B2 (en) * | 1999-06-10 | 2004-11-04 | Human Genome Sciences, Inc. | Streptococcus pneumoniae proteins and vaccines |
DE10012370A1 (en) * | 2000-03-14 | 2001-09-27 | Chiron Behring Gmbh & Co | Use of oil-in-water emulsion as vaccine adjuvant, particularly for influenza and pneumococcal vaccines, administered at different site from the vaccine |
US7074415B2 (en) | 2000-06-20 | 2006-07-11 | Id Biomedical Corporation | Streptococcus antigens |
EP1205552A1 (en) * | 2000-11-09 | 2002-05-15 | ID-Lelystad, Instituut voor Dierhouderij en Diergezondheid B.V. | Virulence of streptococci, involving ORFs from Streptococcus suis |
GB0107661D0 (en) | 2001-03-27 | 2001-05-16 | Chiron Spa | Staphylococcus aureus |
GB0107658D0 (en) * | 2001-03-27 | 2001-05-16 | Chiron Spa | Streptococcus pneumoniae |
GB0108079D0 (en) * | 2001-03-30 | 2001-05-23 | Microbial Technics Ltd | Protein |
US7262024B2 (en) | 2001-12-20 | 2007-08-28 | Id Biomedical Corporation | Streptococcus antigens |
AU2003286056A1 (en) * | 2002-11-26 | 2004-06-18 | Id Biomedical Corporation | Streptococcus pneumoniae surface polypeptides |
EP2311989A1 (en) * | 2003-04-15 | 2011-04-20 | Intercell AG | S. pneumoniae antigens |
GB0714963D0 (en) * | 2007-08-01 | 2007-09-12 | Novartis Ag | Compositions comprising antigens |
MY150481A (en) | 2008-03-03 | 2014-01-30 | Irm Llc | Compounds and compositions as tlr activity modulators |
US8241643B2 (en) | 2008-03-17 | 2012-08-14 | Intercell Ag | Peptides protective against S. pneumoniae and compositions, methods and uses relating thereto |
ATE523204T1 (en) | 2009-02-16 | 2011-09-15 | Karlsruher Inst Technologie | CD44V6 PEPTIDES AS BACTERIAL INFECTION INHIBITORS |
US9517263B2 (en) | 2009-06-10 | 2016-12-13 | Glaxosmithkline Biologicals Sa | Benzonaphthyridine-containing vaccines |
KR101748453B1 (en) | 2009-06-29 | 2017-06-16 | 제노시아 바이오사이언스, 인크. | Vaccines and Compositions Against Streptococcus Pneumoniae |
TWI445708B (en) | 2009-09-02 | 2014-07-21 | Irm Llc | Compounds and compositions as tlr activity modulators |
ES2443952T3 (en) | 2009-09-02 | 2014-02-21 | Novartis Ag | Immunogenic compositions that include modulators of TLR activity |
EP2475385A1 (en) | 2009-09-10 | 2012-07-18 | Novartis AG | Combination vaccines against respiratory tract diseases |
WO2011057148A1 (en) | 2009-11-05 | 2011-05-12 | Irm Llc | Compounds and compositions as tlr-7 activity modulators |
SG181712A1 (en) | 2009-12-15 | 2012-07-30 | Novartis Ag | Homogeneous suspension of immunopotentiating compounds and uses thereof |
US20130039947A1 (en) | 2010-03-12 | 2013-02-14 | Children's Medical Center Corporation | Novel immunogens and methods for discovery and screening thereof |
CA2792938C (en) | 2010-03-23 | 2018-07-31 | Irm Llc | Compounds (cystein based lipopeptides) and compositions as tlr2 agonists used for treating infections, inflammations, respiratory diseases etc. |
WO2012072769A1 (en) | 2010-12-01 | 2012-06-07 | Novartis Ag | Pneumococcal rrgb epitopes and clade combinations |
JP6126993B2 (en) | 2011-01-20 | 2017-05-10 | ジェノセア バイオサイエンシーズ, インコーポレイテッド | Vaccines and compositions against Streptococcus pneumoniae |
US20150132339A1 (en) | 2012-03-07 | 2015-05-14 | Novartis Ag | Adjuvanted formulations of streptococcus pneumoniae antigens |
CN105188747A (en) | 2013-02-01 | 2015-12-23 | 葛兰素史密斯克莱生物公司 | Intradermal delivery of immunological compositions comprising TOLL-like receptor agonists |
ES2959258T3 (en) * | 2013-02-07 | 2024-02-22 | Childrens Medical Center | Protein antigens that provide protection against pneumococcal colonization and/or disease |
BR112020016314A2 (en) | 2018-02-12 | 2020-12-15 | Inimmune Corporation | PHARMACEUTICALLY ACCEPTABLE COMPOUNDS OR SALTS, PHARMACEUTICAL COMPOSITION, KIT, AND, METHODS FOR ELICITATING, INTENSIFYING OR MODIFYING AN IMMUNOLOGICAL RESPONSE, TO TREAT, PREVENT OR REDUCE THE SUSCETIBILITY TO CANCER, TO REDUCE, UNDERSTAND TREAT, PREVENT OR REDUCE SUSCEPTIBILITY TO AN ALLERGY, TO TREAT, PREVENT OR REDUCE SUSCETIBILITY TO AUTOIMMUNE AFFECTION, TO TREAT, PREVENT OR REDUCE SUSCETIBILITY IN A SUBJECT TO BACTERIAL INFECTION, ALTERNATE, VENEER, NAVAL, NAVARI TREAT, PREVENT OR REDUCE SUSCEPTIBILITY TO AUTOIMMUNITY, ALLERGY, ISCHEMIA OR SEPSIS REPERFUSION, TO TREAT, PREVENT OR REDUCE THE GRAVITY OF EPILETIC ATTACKS AND TO TREAT, PREVENT OR REDUCE THE MACANTIC HERITAGE OF HERITAGE, |
WO2020056202A1 (en) | 2018-09-12 | 2020-03-19 | Affinivax, Inc. | Multivalent pneumococcal vaccines |
SG11202101973YA (en) * | 2018-09-12 | 2021-03-30 | Childrens Medical Ct Corp | Pneumococcal fusion protein vaccines |
CA3198876A1 (en) | 2020-11-04 | 2022-05-12 | Eligo Bioscience | Cutibacterium acnes recombinant phages, method of production and uses thereof |
CA3231684A1 (en) | 2021-09-09 | 2023-03-16 | Affinivax, Inc. | Multivalent pneumococcal vaccines |
Family Cites Families (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5928900A (en) * | 1993-09-01 | 1999-07-27 | The Rockefeller University | Bacterial exported proteins and acellular vaccines based thereon |
EP0907738A1 (en) * | 1996-04-02 | 1999-04-14 | Smithkline Beecham Corporation | Novel compounds |
US6294661B1 (en) * | 1996-05-14 | 2001-09-25 | Smithkline Beecham Corporation | Compounds |
DE69737125T3 (en) * | 1996-10-31 | 2015-02-26 | Human Genome Sciences, Inc. | Streptococcus pneumoniae antigens and vaccines |
US6074847A (en) * | 1996-12-13 | 2000-06-13 | Eli Lilly And Company | Streptococcus pneumoniae gene sequence HI1146 |
GB9700939D0 (en) * | 1997-01-17 | 1997-03-05 | Microbial Technics Limited | Therapy |
-
1999
- 1999-07-27 EP EP99934990A patent/EP1144640A3/en not_active Withdrawn
- 1999-07-27 JP JP2000562520A patent/JP2002521058A/en active Pending
- 1999-07-27 CN CN99810978A patent/CN1318103A/en active Pending
- 1999-07-27 WO PCT/GB1999/002452 patent/WO2000006738A2/en not_active Application Discontinuation
-
2007
- 2007-08-28 JP JP2007221409A patent/JP2008022856A/en active Pending
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101155833B (en) * | 2005-04-08 | 2011-04-20 | 惠氏公司 | Separation of contaminants from streptococcus pneumoniae polysaccharide by ph manipulation |
CN103834667A (en) * | 2013-12-31 | 2014-06-04 | 李越希 | Chemosynthetic extracellular region gene fragment of streptococcus pneumonia PspA protein, and expression and application thereof |
CN103834667B (en) * | 2013-12-31 | 2016-08-17 | 李越希 | The streptococcus pneumoniae PspA protein extracellular genetic fragment of chemosynthesis and expression, application |
Also Published As
Publication number | Publication date |
---|---|
WO2000006738A3 (en) | 2001-08-23 |
JP2008022856A (en) | 2008-02-07 |
JP2002521058A (en) | 2002-07-16 |
EP1144640A2 (en) | 2001-10-17 |
EP1144640A3 (en) | 2001-11-28 |
WO2000006738A2 (en) | 2000-02-10 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN1318103A (en) | Nucleic acids and proteins from streptococcus pneumoniae | |
US8101187B2 (en) | Secreted Streptococcus pneumoniae proteins | |
AU2005203189B2 (en) | Anti-bacterial vaccine compositions | |
JP2008263970A (en) | Streptococcus pneumoniae protein and nucleic acid molecule | |
JP6262728B2 (en) | Attenuated Streptococcus swiss vaccine and its production method and use | |
US20220127628A1 (en) | A genetically modified lactobacillus and uses thereof | |
CN110582296B (en) | Attenuation of bacterial virulence by attenuation of bacterial folate transport | |
JP2009077713A (en) | NUCLEIC ACIDS ENCODING RECOMBINANT 56 AND 82 kDa ANTIGENS FROM GAMETOCYTES OF EIMERIA MAXIMA AND THEIR USE | |
DK2450054T3 (en) | New virulence factors of Streptococcus pneumoniae | |
EP1368456B1 (en) | Anti-bacterial vaccine compositions | |
EP1100920A2 (en) | Nucleic acids and proteins from group b streptococcus | |
US8632784B2 (en) | Nucleic acids and proteins from Streptococcus pneumoniae | |
Fatehi et al. | Oral vaccination with novel Lactococcus lactis mucosal live vector-secreting Brucella lumazine synthase (BLS) protein induces humoral and cellular immune protection against Brucella abortus | |
CN1215729A (en) | Clostridium perfringens vaccine | |
US9493519B2 (en) | Toxin in type A Clostridium perfringens | |
CN1367833A (en) | Streptococcus pneumoniae proteins and nucleic acid molecules | |
KR101845571B1 (en) | Marker vaccine for classical swine fever | |
CN101108877A (en) | Nucleic acids and proteins from streptococcus pneumoniae | |
KR102542601B1 (en) | Novel porcine epidemic diarrhea virus isolate and use thereof | |
EP1624064A2 (en) | Nucleic acids and proteins from streptococcus pneumoniae | |
EP1801218A2 (en) | Nucleic acids and proteins from streptococcus pneumoniae | |
AU2013201267B2 (en) | Anti-bacterial vaccine compositions | |
Camus | Pathobiology of Streptococcus iniae infections in cultured tilapia | |
Yung | Characterization of virulence factors in an M50 group A streptococcus strain virulent for mice |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C12 | Rejection of a patent application after its publication | ||
RJ01 | Rejection of invention patent application after publication |