CN101848929A - 肺炎链球菌菌毛抗原 - Google Patents
肺炎链球菌菌毛抗原 Download PDFInfo
- Publication number
- CN101848929A CN101848929A CN200880100325A CN200880100325A CN101848929A CN 101848929 A CN101848929 A CN 101848929A CN 200880100325 A CN200880100325 A CN 200880100325A CN 200880100325 A CN200880100325 A CN 200880100325A CN 101848929 A CN101848929 A CN 101848929A
- Authority
- CN
- China
- Prior art keywords
- seq
- pili
- streptococcus pneumoniae
- sequence
- antibody
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 241000193998 Streptococcus pneumoniae Species 0.000 title claims abstract description 315
- 229940031000 streptococcus pneumoniae Drugs 0.000 title claims abstract description 301
- 239000000427 antigen Substances 0.000 title description 167
- 108091007433 antigens Proteins 0.000 title description 153
- 102000036639 antigens Human genes 0.000 title description 153
- 108090000765 processed proteins & peptides Proteins 0.000 claims abstract description 280
- 102000004196 processed proteins & peptides Human genes 0.000 claims abstract description 236
- 229920001184 polypeptide Polymers 0.000 claims abstract description 234
- 239000000203 mixture Substances 0.000 claims abstract description 201
- 238000000034 method Methods 0.000 claims abstract description 193
- 230000002163 immunogen Effects 0.000 claims abstract description 68
- 239000012634 fragment Substances 0.000 claims abstract description 48
- 208000015181 infectious disease Diseases 0.000 claims abstract description 41
- 239000003112 inhibitor Substances 0.000 claims abstract description 8
- 210000004027 cell Anatomy 0.000 claims description 94
- 108090000623 proteins and genes Proteins 0.000 claims description 86
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 77
- 239000000523 sample Substances 0.000 claims description 64
- 102000004169 proteins and genes Human genes 0.000 claims description 59
- 108020004707 nucleic acids Proteins 0.000 claims description 57
- 102000039446 nucleic acids Human genes 0.000 claims description 57
- 150000007523 nucleic acids Chemical class 0.000 claims description 57
- 235000018102 proteins Nutrition 0.000 claims description 56
- 150000001875 compounds Chemical class 0.000 claims description 53
- 238000012360 testing method Methods 0.000 claims description 46
- 102000004190 Enzymes Human genes 0.000 claims description 41
- 108090000790 Enzymes Proteins 0.000 claims description 41
- 235000001014 amino acid Nutrition 0.000 claims description 41
- 210000002421 cell wall Anatomy 0.000 claims description 40
- 150000001413 amino acids Chemical class 0.000 claims description 39
- 230000001580 bacterial effect Effects 0.000 claims description 38
- 230000004044 response Effects 0.000 claims description 35
- 238000011282 treatment Methods 0.000 claims description 30
- 108010000916 Fimbriae Proteins Proteins 0.000 claims description 28
- 238000009396 hybridization Methods 0.000 claims description 27
- 239000003153 chemical reaction reagent Substances 0.000 claims description 25
- 239000003795 chemical substances by application Substances 0.000 claims description 21
- 238000006384 oligomerization reaction Methods 0.000 claims description 17
- 238000000926 separation method Methods 0.000 claims description 15
- 239000011159 matrix material Substances 0.000 claims description 14
- 230000008034 disappearance Effects 0.000 claims description 13
- 238000001976 enzyme digestion Methods 0.000 claims description 13
- 238000010008 shearing Methods 0.000 claims description 13
- 239000003053 toxin Substances 0.000 claims description 13
- 231100000765 toxin Toxicity 0.000 claims description 13
- 229920000642 polymer Polymers 0.000 claims description 12
- 210000002919 epithelial cell Anatomy 0.000 claims description 11
- 108091033319 polynucleotide Proteins 0.000 claims description 11
- 102000040430 polynucleotide Human genes 0.000 claims description 11
- 239000002157 polynucleotide Substances 0.000 claims description 11
- 230000008569 process Effects 0.000 claims description 11
- 239000007790 solid phase Substances 0.000 claims description 11
- 206010035664 Pneumonia Diseases 0.000 claims description 10
- 238000004458 analytical method Methods 0.000 claims description 10
- 241000194017 Streptococcus Species 0.000 claims description 9
- 230000001413 cellular effect Effects 0.000 claims description 9
- 238000003780 insertion Methods 0.000 claims description 9
- 230000037431 insertion Effects 0.000 claims description 9
- 239000002773 nucleotide Substances 0.000 claims description 9
- 125000003729 nucleotide group Chemical group 0.000 claims description 9
- 238000005204 segregation Methods 0.000 claims description 9
- MSFSPUZXLOGKHJ-UHFFFAOYSA-N Muraminsaeure Natural products OC(=O)C(C)OC1C(N)C(O)OC(CO)C1O MSFSPUZXLOGKHJ-UHFFFAOYSA-N 0.000 claims description 7
- 108010013639 Peptidoglycan Proteins 0.000 claims description 7
- 230000003750 conditioning effect Effects 0.000 claims description 7
- 230000005764 inhibitory process Effects 0.000 claims description 7
- 108010009719 mutanolysin Proteins 0.000 claims description 7
- 125000000539 amino acid group Chemical group 0.000 claims description 6
- 230000000295 complement effect Effects 0.000 claims description 6
- 238000000432 density-gradient centrifugation Methods 0.000 claims description 6
- 101000925662 Enterobacteria phage PRD1 Endolysin Proteins 0.000 claims description 5
- 230000002685 pulmonary effect Effects 0.000 claims description 5
- 102000008394 Immunoglobulin Fragments Human genes 0.000 claims description 4
- 108010021625 Immunoglobulin Fragments Proteins 0.000 claims description 4
- 230000000903 blocking effect Effects 0.000 claims description 4
- 230000009467 reduction Effects 0.000 claims description 4
- 210000002966 serum Anatomy 0.000 claims description 4
- 241000894007 species Species 0.000 claims description 4
- 102000001706 Immunoglobulin Fab Fragments Human genes 0.000 claims description 3
- 108010054477 Immunoglobulin Fab Fragments Proteins 0.000 claims description 3
- 239000002872 contrast media Substances 0.000 claims description 3
- 241000683224 Streptococcus pneumoniae TIGR4 Species 0.000 claims description 2
- 230000015556 catabolic process Effects 0.000 claims description 2
- 238000006731 degradation reaction Methods 0.000 claims description 2
- 238000001641 gel filtration chromatography Methods 0.000 claims description 2
- 210000004072 lung Anatomy 0.000 claims description 2
- 231100000167 toxic agent Toxicity 0.000 claims 2
- 239000003440 toxic substance Substances 0.000 claims 2
- 101710163270 Nuclease Proteins 0.000 claims 1
- 238000011091 antibody purification Methods 0.000 claims 1
- 230000003053 immunization Effects 0.000 abstract description 10
- 238000002649 immunization Methods 0.000 abstract description 10
- 230000003308 immunostimulating effect Effects 0.000 abstract description 9
- 230000001225 therapeutic effect Effects 0.000 abstract description 8
- 229960001438 immunostimulant agent Drugs 0.000 abstract description 3
- 239000003022 immunostimulating agent Substances 0.000 abstract description 3
- 230000002401 inhibitory effect Effects 0.000 abstract description 2
- 238000004519 manufacturing process Methods 0.000 abstract description 2
- 230000000069 prophylactic effect Effects 0.000 abstract 1
- 229960005486 vaccine Drugs 0.000 description 71
- 239000002671 adjuvant Substances 0.000 description 61
- 238000002360 preparation method Methods 0.000 description 41
- 241000894006 Bacteria Species 0.000 description 37
- 230000008878 coupling Effects 0.000 description 37
- 238000010168 coupling process Methods 0.000 description 37
- 238000005859 coupling reaction Methods 0.000 description 37
- 229940088598 enzyme Drugs 0.000 description 36
- 229940024606 amino acid Drugs 0.000 description 33
- 108020004414 DNA Proteins 0.000 description 25
- 239000011521 glass Substances 0.000 description 24
- 239000002245 particle Substances 0.000 description 23
- 239000000126 substance Substances 0.000 description 22
- 230000000694 effects Effects 0.000 description 21
- 239000000758 substrate Substances 0.000 description 21
- 230000004927 fusion Effects 0.000 description 20
- 239000000243 solution Substances 0.000 description 20
- 201000005010 Streptococcus pneumonia Diseases 0.000 description 19
- 230000036039 immunity Effects 0.000 description 19
- 244000005700 microbiome Species 0.000 description 19
- 241000700605 Viruses Species 0.000 description 18
- 239000000463 material Substances 0.000 description 18
- 239000001397 quillaja saponaria molina bark Substances 0.000 description 18
- 229930182490 saponin Natural products 0.000 description 18
- 150000007949 saponins Chemical class 0.000 description 18
- 239000002158 endotoxin Substances 0.000 description 17
- 229920006008 lipopolysaccharide Polymers 0.000 description 17
- 241000588724 Escherichia coli Species 0.000 description 16
- -1 Xie Ansuan Chemical compound 0.000 description 15
- 238000005516 engineering process Methods 0.000 description 15
- 108091034117 Oligonucleotide Proteins 0.000 description 14
- 239000003814 drug Substances 0.000 description 14
- 239000012530 fluid Substances 0.000 description 13
- 239000012528 membrane Substances 0.000 description 13
- 210000004379 membrane Anatomy 0.000 description 13
- 241000701806 Human papillomavirus Species 0.000 description 12
- 239000002253 acid Substances 0.000 description 12
- 241000282414 Homo sapiens Species 0.000 description 11
- 230000000890 antigenic effect Effects 0.000 description 11
- 239000007764 o/w emulsion Substances 0.000 description 11
- 239000000047 product Substances 0.000 description 11
- RTZKZFJDLAIYFH-UHFFFAOYSA-N Diethyl ether Chemical compound CCOCC RTZKZFJDLAIYFH-UHFFFAOYSA-N 0.000 description 10
- 241000233866 Fungi Species 0.000 description 10
- 241000699666 Mus <mouse, genus> Species 0.000 description 10
- 241000223238 Trichophyton Species 0.000 description 10
- 238000006243 chemical reaction Methods 0.000 description 10
- 238000001514 detection method Methods 0.000 description 10
- 239000000839 emulsion Substances 0.000 description 10
- 229940023064 escherichia coli Drugs 0.000 description 10
- 150000004676 glycans Chemical class 0.000 description 10
- 229920001282 polysaccharide Polymers 0.000 description 10
- 239000005017 polysaccharide Substances 0.000 description 10
- 241000725303 Human immunodeficiency virus Species 0.000 description 9
- 208000019802 Sexually transmitted disease Diseases 0.000 description 9
- 238000013461 design Methods 0.000 description 9
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 9
- 229940079593 drug Drugs 0.000 description 9
- 229940035032 monophosphoryl lipid a Drugs 0.000 description 9
- 230000001717 pathogenic effect Effects 0.000 description 9
- 102000004127 Cytokines Human genes 0.000 description 8
- 108090000695 Cytokines Proteins 0.000 description 8
- 101710116435 Outer membrane protein Proteins 0.000 description 8
- 230000004087 circulation Effects 0.000 description 8
- 238000001784 detoxification Methods 0.000 description 8
- 201000010099 disease Diseases 0.000 description 8
- 230000002538 fungal effect Effects 0.000 description 8
- 230000005730 ADP ribosylation Effects 0.000 description 7
- 241000193738 Bacillus anthracis Species 0.000 description 7
- 238000011156 evaluation Methods 0.000 description 7
- 238000002474 experimental method Methods 0.000 description 7
- 230000006870 function Effects 0.000 description 7
- 229960003971 influenza vaccine Drugs 0.000 description 7
- 239000007788 liquid Substances 0.000 description 7
- 125000001360 methionine group Chemical group N[C@@H](CCSC)C(=O)* 0.000 description 7
- 238000002493 microarray Methods 0.000 description 7
- 238000012544 monitoring process Methods 0.000 description 7
- 231100000252 nontoxic Toxicity 0.000 description 7
- 230000003000 nontoxic effect Effects 0.000 description 7
- 102000005962 receptors Human genes 0.000 description 7
- 108020003175 receptors Proteins 0.000 description 7
- 210000002345 respiratory system Anatomy 0.000 description 7
- 241000251730 Chondrichthyes Species 0.000 description 6
- 241000194032 Enterococcus faecalis Species 0.000 description 6
- 241000192125 Firmicutes Species 0.000 description 6
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 6
- 241001465754 Metazoa Species 0.000 description 6
- GOWLTLODGKPXMN-MEKRSRHXSA-N OM-174 Chemical compound O1[C@H](OP(O)(O)=O)[C@H](NC(=O)C[C@H](O)CCCCCCCCCCC)[C@@H](O)[C@H](O)[C@H]1CO[C@H]1[C@H](NC(=O)C[C@H](CCCCCCCCCCC)OC(=O)CCCCCCCCCCC)[C@@H](O)[C@H](OP(O)(O)=O)[C@@H](CO)O1 GOWLTLODGKPXMN-MEKRSRHXSA-N 0.000 description 6
- 241000193985 Streptococcus agalactiae Species 0.000 description 6
- 241000193996 Streptococcus pyogenes Species 0.000 description 6
- 150000001336 alkenes Chemical class 0.000 description 6
- 238000013459 approach Methods 0.000 description 6
- 230000008859 change Effects 0.000 description 6
- HVYWMOMLDIMFJA-DPAQBDIFSA-N cholesterol Chemical compound C1C=C2C[C@@H](O)CC[C@]2(C)[C@@H]2[C@@H]1[C@@H]1CC[C@H]([C@H](C)CCCC(C)C)[C@@]1(C)CC2 HVYWMOMLDIMFJA-DPAQBDIFSA-N 0.000 description 6
- 238000005336 cracking Methods 0.000 description 6
- 238000013016 damping Methods 0.000 description 6
- 239000000284 extract Substances 0.000 description 6
- 229910052500 inorganic mineral Inorganic materials 0.000 description 6
- 230000000968 intestinal effect Effects 0.000 description 6
- 238000002372 labelling Methods 0.000 description 6
- 229930182817 methionine Natural products 0.000 description 6
- 239000011707 mineral Substances 0.000 description 6
- 235000010482 polyoxyethylene sorbitan monooleate Nutrition 0.000 description 6
- 229920000053 polysorbate 80 Polymers 0.000 description 6
- 241000712461 unidentified influenza virus Species 0.000 description 6
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 6
- 241000193830 Bacillus <bacterium> Species 0.000 description 5
- 108020004705 Codon Proteins 0.000 description 5
- 101710146739 Enterotoxin Proteins 0.000 description 5
- 101710154606 Hemagglutinin Proteins 0.000 description 5
- 101710093908 Outer capsid protein VP4 Proteins 0.000 description 5
- 101710135467 Outer capsid protein sigma-1 Proteins 0.000 description 5
- 101710176177 Protein A56 Proteins 0.000 description 5
- 241000589517 Pseudomonas aeruginosa Species 0.000 description 5
- 241000725643 Respiratory syncytial virus Species 0.000 description 5
- 241001147691 Staphylococcus saprophyticus Species 0.000 description 5
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 5
- 239000004473 Threonine Substances 0.000 description 5
- 208000002474 Tinea Diseases 0.000 description 5
- 241000893966 Trichophyton verrucosum Species 0.000 description 5
- 159000000013 aluminium salts Chemical class 0.000 description 5
- 229910000329 aluminium sulfate Inorganic materials 0.000 description 5
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 5
- 229940032049 enterococcus faecalis Drugs 0.000 description 5
- 239000000147 enterotoxin Substances 0.000 description 5
- 231100000655 enterotoxin Toxicity 0.000 description 5
- 238000009472 formulation Methods 0.000 description 5
- 239000000185 hemagglutinin Substances 0.000 description 5
- 239000002955 immunomodulating agent Substances 0.000 description 5
- 229940121354 immunomodulator Drugs 0.000 description 5
- 230000001939 inductive effect Effects 0.000 description 5
- 230000003993 interaction Effects 0.000 description 5
- 231100000956 nontoxicity Toxicity 0.000 description 5
- 210000001331 nose Anatomy 0.000 description 5
- 229940046166 oligodeoxynucleotide Drugs 0.000 description 5
- 230000003389 potentiating effect Effects 0.000 description 5
- 238000000746 purification Methods 0.000 description 5
- 230000008521 reorganization Effects 0.000 description 5
- 230000002441 reversible effect Effects 0.000 description 5
- 230000004936 stimulating effect Effects 0.000 description 5
- 229940030998 streptococcus agalactiae Drugs 0.000 description 5
- 238000005406 washing Methods 0.000 description 5
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N (+)-Biotin Chemical compound N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 description 4
- 241000186361 Actinobacteria <class> Species 0.000 description 4
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 4
- 241001647372 Chlamydia pneumoniae Species 0.000 description 4
- 241000193468 Clostridium perfringens Species 0.000 description 4
- 241000186227 Corynebacterium diphtheriae Species 0.000 description 4
- NYHBQMYGNKIUIF-UUOKFMHZSA-N Guanosine Chemical compound C1=NC=2C(=O)NC(N)=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O NYHBQMYGNKIUIF-UUOKFMHZSA-N 0.000 description 4
- 108090001005 Interleukin-6 Proteins 0.000 description 4
- 241000124008 Mammalia Species 0.000 description 4
- 241000712079 Measles morbillivirus Species 0.000 description 4
- 241001460074 Microsporum distortum Species 0.000 description 4
- 241000187479 Mycobacterium tuberculosis Species 0.000 description 4
- 241000588650 Neisseria meningitidis Species 0.000 description 4
- 108010006232 Neuraminidase Proteins 0.000 description 4
- 102000005348 Neuraminidase Human genes 0.000 description 4
- KDLHZDBZIXYQEI-UHFFFAOYSA-N Palladium Chemical compound [Pd] KDLHZDBZIXYQEI-UHFFFAOYSA-N 0.000 description 4
- 201000005702 Pertussis Diseases 0.000 description 4
- 241000191940 Staphylococcus Species 0.000 description 4
- 241000191967 Staphylococcus aureus Species 0.000 description 4
- 244000057717 Streptococcus lactis Species 0.000 description 4
- 206010043376 Tetanus Diseases 0.000 description 4
- 241000893962 Trichophyton equinum Species 0.000 description 4
- 241000607626 Vibrio cholerae Species 0.000 description 4
- 241000607479 Yersinia pestis Species 0.000 description 4
- 229940061720 alpha hydroxy acid Drugs 0.000 description 4
- 238000003556 assay Methods 0.000 description 4
- 239000000227 bioadhesive Substances 0.000 description 4
- 230000015572 biosynthetic process Effects 0.000 description 4
- 210000000170 cell membrane Anatomy 0.000 description 4
- OPTASPLRGRRNAP-UHFFFAOYSA-N cytosine Chemical compound NC=1C=CNC(=O)N=1 OPTASPLRGRRNAP-UHFFFAOYSA-N 0.000 description 4
- RWSXRVCMGQZWBV-WDSKDSINSA-N glutathione Chemical compound OC(=O)[C@@H](N)CCC(=O)N[C@@H](CS)C(=O)NCC(O)=O RWSXRVCMGQZWBV-WDSKDSINSA-N 0.000 description 4
- HOPZBJPSUKPLDT-UHFFFAOYSA-N imidazo[4,5-h]quinolin-2-one Chemical class C1=CN=C2C3=NC(=O)N=C3C=CC2=C1 HOPZBJPSUKPLDT-UHFFFAOYSA-N 0.000 description 4
- 230000002584 immunomodulator Effects 0.000 description 4
- 230000006698 induction Effects 0.000 description 4
- 230000016379 mucosal immune response Effects 0.000 description 4
- 210000004400 mucous membrane Anatomy 0.000 description 4
- 230000035772 mutation Effects 0.000 description 4
- 239000003960 organic solvent Substances 0.000 description 4
- 230000001105 regulatory effect Effects 0.000 description 4
- 238000011160 research Methods 0.000 description 4
- 150000003839 salts Chemical class 0.000 description 4
- 210000003491 skin Anatomy 0.000 description 4
- PRXRUNOAOLTIEF-WUOFIQDXSA-N sorbitan trioleate Chemical compound CCCCCCCC\C=C\CCCCCCCC(=O)OCC(OC(=O)CCCCCCC\C=C\CCCCCCCC)C1OCC(O)C1OC(=O)CCCCCCC\C=C\CCCCCCCC PRXRUNOAOLTIEF-WUOFIQDXSA-N 0.000 description 4
- 229940118696 vibrio cholerae Drugs 0.000 description 4
- 239000000277 virosome Substances 0.000 description 4
- RKDVKSZUMVYZHH-UHFFFAOYSA-N 1,4-dioxane-2,5-dione Chemical compound O=C1COC(=O)CO1 RKDVKSZUMVYZHH-UHFFFAOYSA-N 0.000 description 3
- 241000186045 Actinomyces naeslundii Species 0.000 description 3
- 125000001433 C-terminal amino-acid group Chemical group 0.000 description 3
- 241000222122 Candida albicans Species 0.000 description 3
- 108010039939 Cell Wall Skeleton Proteins 0.000 description 3
- 241000606153 Chlamydia trachomatis Species 0.000 description 3
- 102000008186 Collagen Human genes 0.000 description 3
- 108010035532 Collagen Proteins 0.000 description 3
- 208000035473 Communicable disease Diseases 0.000 description 3
- 241000186216 Corynebacterium Species 0.000 description 3
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 3
- 108010049003 Fibrinogen Proteins 0.000 description 3
- 102000008946 Fibrinogen Human genes 0.000 description 3
- 108010067306 Fibronectins Proteins 0.000 description 3
- 102000016359 Fibronectins Human genes 0.000 description 3
- WSFSSNUMVMOOMR-UHFFFAOYSA-N Formaldehyde Chemical compound O=C WSFSSNUMVMOOMR-UHFFFAOYSA-N 0.000 description 3
- 208000037262 Hepatitis delta Diseases 0.000 description 3
- 241000724709 Hepatitis delta virus Species 0.000 description 3
- 241000701085 Human alphaherpesvirus 3 Species 0.000 description 3
- 241000713772 Human immunodeficiency virus 1 Species 0.000 description 3
- 108090000978 Interleukin-4 Proteins 0.000 description 3
- 108010002616 Interleukin-5 Proteins 0.000 description 3
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 description 3
- 208000007764 Legionnaires' Disease Diseases 0.000 description 3
- 241000186779 Listeria monocytogenes Species 0.000 description 3
- 208000016604 Lyme disease Diseases 0.000 description 3
- 102000018697 Membrane Proteins Human genes 0.000 description 3
- 108010052285 Membrane Proteins Proteins 0.000 description 3
- 241001480037 Microsporum Species 0.000 description 3
- 241001529936 Murinae Species 0.000 description 3
- 241000186359 Mycobacterium Species 0.000 description 3
- 108010064696 N,O-diacetylmuramidase Proteins 0.000 description 3
- 241000893976 Nannizzia gypsea Species 0.000 description 3
- 101710203389 Outer membrane porin F Proteins 0.000 description 3
- 101710160102 Outer membrane protein B Proteins 0.000 description 3
- 108010067902 Peptide Library Proteins 0.000 description 3
- 108090000279 Peptidyltransferases Proteins 0.000 description 3
- 241000235645 Pichia kudriavzevii Species 0.000 description 3
- 229920002732 Polyanhydride Polymers 0.000 description 3
- 229920000331 Polyhydroxybutyrate Polymers 0.000 description 3
- 241000605862 Porphyromonas gingivalis Species 0.000 description 3
- 241000288906 Primates Species 0.000 description 3
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 3
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 3
- 241000293871 Salmonella enterica subsp. enterica serovar Typhi Species 0.000 description 3
- 241000219287 Saponaria Species 0.000 description 3
- VYPSYNLAJGMNEJ-UHFFFAOYSA-N Silicium dioxide Chemical compound O=[Si]=O VYPSYNLAJGMNEJ-UHFFFAOYSA-N 0.000 description 3
- 241000191963 Staphylococcus epidermidis Species 0.000 description 3
- 229930182558 Sterol Natural products 0.000 description 3
- 230000024932 T cell mediated immunity Effects 0.000 description 3
- 108020005038 Terminator Codon Proteins 0.000 description 3
- 210000000447 Th1 cell Anatomy 0.000 description 3
- 108010060818 Toll-Like Receptor 9 Proteins 0.000 description 3
- 102100033117 Toll-like receptor 9 Human genes 0.000 description 3
- 241001045770 Trichophyton mentagrophytes Species 0.000 description 3
- 108010067390 Viral Proteins Proteins 0.000 description 3
- 230000003321 amplification Effects 0.000 description 3
- 230000000692 anti-sense effect Effects 0.000 description 3
- 238000003705 background correction Methods 0.000 description 3
- 230000008901 benefit Effects 0.000 description 3
- 229920002988 biodegradable polymer Polymers 0.000 description 3
- 239000004621 biodegradable polymer Substances 0.000 description 3
- 210000004520 cell wall skeleton Anatomy 0.000 description 3
- 230000036755 cellular response Effects 0.000 description 3
- 229940038705 chlamydia trachomatis Drugs 0.000 description 3
- 235000012000 cholesterol Nutrition 0.000 description 3
- 229920001436 collagen Polymers 0.000 description 3
- 229920001577 copolymer Polymers 0.000 description 3
- 239000006059 cover glass Substances 0.000 description 3
- 210000004443 dendritic cell Anatomy 0.000 description 3
- 238000011161 development Methods 0.000 description 3
- 230000018109 developmental process Effects 0.000 description 3
- 238000003745 diagnosis Methods 0.000 description 3
- 206010013023 diphtheria Diseases 0.000 description 3
- 229960003983 diphtheria toxoid Drugs 0.000 description 3
- PRAKJMSDJKAYCZ-UHFFFAOYSA-N dodecahydrosqualene Natural products CC(C)CCCC(C)CCCC(C)CCCCC(C)CCCC(C)CCCC(C)C PRAKJMSDJKAYCZ-UHFFFAOYSA-N 0.000 description 3
- 150000002148 esters Chemical class 0.000 description 3
- 229940012952 fibrinogen Drugs 0.000 description 3
- 238000001914 filtration Methods 0.000 description 3
- 230000012447 hatching Effects 0.000 description 3
- 230000036541 health Effects 0.000 description 3
- 208000006454 hepatitis Diseases 0.000 description 3
- 231100000283 hepatitis Toxicity 0.000 description 3
- 201000010284 hepatitis E Diseases 0.000 description 3
- DOUYETYNHWVLEO-UHFFFAOYSA-N imiquimod Chemical compound C1=CC=CC2=C3N(CC(C)C)C=NC3=C(N)N=C21 DOUYETYNHWVLEO-UHFFFAOYSA-N 0.000 description 3
- 229960002751 imiquimod Drugs 0.000 description 3
- 230000002519 immonomodulatory effect Effects 0.000 description 3
- 206010022000 influenza Diseases 0.000 description 3
- JJTUDXZGHPGLLC-UHFFFAOYSA-N lactide Chemical compound CC1OC(=O)C(C)OC1=O JJTUDXZGHPGLLC-UHFFFAOYSA-N 0.000 description 3
- 230000000670 limiting effect Effects 0.000 description 3
- GZQKNULLWNGMCW-PWQABINMSA-N lipid A (E. coli) Chemical class O1[C@H](CO)[C@@H](OP(O)(O)=O)[C@H](OC(=O)C[C@@H](CCCCCCCCCCC)OC(=O)CCCCCCCCCCCCC)[C@@H](NC(=O)C[C@@H](CCCCCCCCCCC)OC(=O)CCCCCCCCCCC)[C@@H]1OC[C@@H]1[C@@H](O)[C@H](OC(=O)C[C@H](O)CCCCCCCCCCC)[C@@H](NC(=O)C[C@H](O)CCCCCCCCCCC)[C@@H](OP(O)(O)=O)O1 GZQKNULLWNGMCW-PWQABINMSA-N 0.000 description 3
- 239000003550 marker Substances 0.000 description 3
- 238000003199 nucleic acid amplification method Methods 0.000 description 3
- 238000011275 oncology therapy Methods 0.000 description 3
- 244000052769 pathogen Species 0.000 description 3
- 150000003016 phosphoric acids Chemical class 0.000 description 3
- 229910052698 phosphorus Inorganic materials 0.000 description 3
- 235000013550 pizza Nutrition 0.000 description 3
- 239000005015 poly(hydroxybutyrate) Substances 0.000 description 3
- 229920001610 polycaprolactone Polymers 0.000 description 3
- 239000004632 polycaprolactone Substances 0.000 description 3
- 230000002265 prevention Effects 0.000 description 3
- 230000003449 preventive effect Effects 0.000 description 3
- 238000012545 processing Methods 0.000 description 3
- 230000000241 respiratory effect Effects 0.000 description 3
- 238000012552 review Methods 0.000 description 3
- 235000003702 sterols Nutrition 0.000 description 3
- 239000004094 surface-active agent Substances 0.000 description 3
- 208000024891 symptom Diseases 0.000 description 3
- 210000001519 tissue Anatomy 0.000 description 3
- 238000012546 transfer Methods 0.000 description 3
- 229940055035 trichophyton verrucosum Drugs 0.000 description 3
- 239000003981 vehicle Substances 0.000 description 3
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 2
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 2
- PXFBZOLANLWPMH-UHFFFAOYSA-N 16-Epiaffinine Natural products C1C(C2=CC=CC=C2N2)=C2C(=O)CC2C(=CC)CN(C)C1C2CO PXFBZOLANLWPMH-UHFFFAOYSA-N 0.000 description 2
- JYCQQPHGFMYQCF-UHFFFAOYSA-N 4-tert-Octylphenol monoethoxylate Chemical compound CC(C)(C)CC(C)(C)C1=CC=C(OCCO)C=C1 JYCQQPHGFMYQCF-UHFFFAOYSA-N 0.000 description 2
- UHPMCKVQTMMPCG-UHFFFAOYSA-N 5,8-dihydroxy-2-methoxy-6-methyl-7-(2-oxopropyl)naphthalene-1,4-dione Chemical compound CC1=C(CC(C)=O)C(O)=C2C(=O)C(OC)=CC(=O)C2=C1O UHPMCKVQTMMPCG-UHFFFAOYSA-N 0.000 description 2
- 241000186046 Actinomyces Species 0.000 description 2
- 102000002260 Alkaline Phosphatase Human genes 0.000 description 2
- 108020004774 Alkaline Phosphatase Proteins 0.000 description 2
- 241000223600 Alternaria Species 0.000 description 2
- 241000186063 Arthrobacter Species 0.000 description 2
- 241000228212 Aspergillus Species 0.000 description 2
- 241001225321 Aspergillus fumigatus Species 0.000 description 2
- 241000351920 Aspergillus nidulans Species 0.000 description 2
- 241000228245 Aspergillus niger Species 0.000 description 2
- 108090001008 Avidin Proteins 0.000 description 2
- 244000063299 Bacillus subtilis Species 0.000 description 2
- 235000014469 Bacillus subtilis Nutrition 0.000 description 2
- 241000186000 Bifidobacterium Species 0.000 description 2
- 241001608472 Bifidobacterium longum Species 0.000 description 2
- 241000588832 Bordetella pertussis Species 0.000 description 2
- 241000714198 Caliciviridae Species 0.000 description 2
- 241000222120 Candida <Saccharomycetales> Species 0.000 description 2
- 241000222173 Candida parapsilosis Species 0.000 description 2
- 241000222178 Candida tropicalis Species 0.000 description 2
- 101710132601 Capsid protein Proteins 0.000 description 2
- 102000014914 Carrier Proteins Human genes 0.000 description 2
- 108010078791 Carrier Proteins Proteins 0.000 description 2
- 206010008631 Cholera Diseases 0.000 description 2
- 101710117490 Circumsporozoite protein Proteins 0.000 description 2
- 241001668502 Cladophialophora carrionii Species 0.000 description 2
- 241000222290 Cladosporium Species 0.000 description 2
- 241001508813 Clavispora lusitaniae Species 0.000 description 2
- 241000193163 Clostridioides difficile Species 0.000 description 2
- 241000193155 Clostridium botulinum Species 0.000 description 2
- 241000193449 Clostridium tetani Species 0.000 description 2
- 241001478240 Coccus Species 0.000 description 2
- MIKUYHXYGGJMLM-GIMIYPNGSA-N Crotonoside Natural products C1=NC2=C(N)NC(=O)N=C2N1[C@H]1O[C@@H](CO)[C@H](O)[C@@H]1O MIKUYHXYGGJMLM-GIMIYPNGSA-N 0.000 description 2
- 201000007336 Cryptococcosis Diseases 0.000 description 2
- 241000221204 Cryptococcus neoformans Species 0.000 description 2
- 241000371644 Curvularia ravenelii Species 0.000 description 2
- 241000701022 Cytomegalovirus Species 0.000 description 2
- NYHBQMYGNKIUIF-UHFFFAOYSA-N D-guanosine Natural products C1=2NC(N)=NC(=O)C=2N=CN1C1OC(CO)C(O)C1O NYHBQMYGNKIUIF-UHFFFAOYSA-N 0.000 description 2
- 208000001490 Dengue Diseases 0.000 description 2
- 206010012310 Dengue fever Diseases 0.000 description 2
- 206010012504 Dermatophytosis Diseases 0.000 description 2
- 241000255925 Diptera Species 0.000 description 2
- 241000196324 Embryophyta Species 0.000 description 2
- 241000194033 Enterococcus Species 0.000 description 2
- 241000194031 Enterococcus faecium Species 0.000 description 2
- 241000991587 Enterovirus C Species 0.000 description 2
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 2
- 101710154643 Filamentous hemagglutinin Proteins 0.000 description 2
- 241000187809 Frankia Species 0.000 description 2
- 241000223218 Fusarium Species 0.000 description 2
- 241000531123 GB virus C Species 0.000 description 2
- 241000287828 Gallus gallus Species 0.000 description 2
- 108010024636 Glutathione Proteins 0.000 description 2
- 108010070675 Glutathione transferase Proteins 0.000 description 2
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 2
- 239000004471 Glycine Substances 0.000 description 2
- 108090000288 Glycoproteins Proteins 0.000 description 2
- 102000003886 Glycoproteins Human genes 0.000 description 2
- 108010043121 Green Fluorescent Proteins Proteins 0.000 description 2
- 102000004144 Green Fluorescent Proteins Human genes 0.000 description 2
- 208000031886 HIV Infections Diseases 0.000 description 2
- 102100029100 Hematopoietic prostaglandin D synthase Human genes 0.000 description 2
- 241000700721 Hepatitis B virus Species 0.000 description 2
- 241000228404 Histoplasma capsulatum Species 0.000 description 2
- 108010001336 Horseradish Peroxidase Proteins 0.000 description 2
- 241000713340 Human immunodeficiency virus 2 Species 0.000 description 2
- 108010058683 Immobilized Proteins Proteins 0.000 description 2
- 102000008070 Interferon-gamma Human genes 0.000 description 2
- 108010074328 Interferon-gamma Proteins 0.000 description 2
- 108090000174 Interleukin-10 Proteins 0.000 description 2
- 108010065805 Interleukin-12 Proteins 0.000 description 2
- 108010002350 Interleukin-2 Proteins 0.000 description 2
- XEEYBQQBJWHFJM-UHFFFAOYSA-N Iron Chemical compound [Fe] XEEYBQQBJWHFJM-UHFFFAOYSA-N 0.000 description 2
- 241000588748 Klebsiella Species 0.000 description 2
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 2
- HNDVDQJCIGZPNO-YFKPBYRVSA-N L-histidine Chemical compound OC(=O)[C@@H](N)CC1=CN=CN1 HNDVDQJCIGZPNO-YFKPBYRVSA-N 0.000 description 2
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 2
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 2
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 2
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 2
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 2
- 241000194036 Lactococcus Species 0.000 description 2
- 241000589248 Legionella Species 0.000 description 2
- 241000192132 Leuconostoc Species 0.000 description 2
- 241000192130 Leuconostoc mesenteroides Species 0.000 description 2
- 241000186781 Listeria Species 0.000 description 2
- 108060001084 Luciferase Proteins 0.000 description 2
- 108090000988 Lysostaphin Proteins 0.000 description 2
- 201000009906 Meningitis Diseases 0.000 description 2
- 241000192041 Micrococcus Species 0.000 description 2
- 241000893980 Microsporum canis Species 0.000 description 2
- 241000588655 Moraxella catarrhalis Species 0.000 description 2
- 241000235575 Mortierella Species 0.000 description 2
- 241000235395 Mucor Species 0.000 description 2
- 241001467553 Mycobacterium africanum Species 0.000 description 2
- 241000186366 Mycobacterium bovis Species 0.000 description 2
- 241000186362 Mycobacterium leprae Species 0.000 description 2
- 241000187919 Mycobacterium microti Species 0.000 description 2
- NQTADLQHYWFPDB-UHFFFAOYSA-N N-Hydroxysuccinimide Chemical compound ON1C(=O)CCC1=O NQTADLQHYWFPDB-UHFFFAOYSA-N 0.000 description 2
- XUYPXLNMDZIRQH-LURJTMIESA-N N-acetyl-L-methionine Chemical compound CSCC[C@@H](C(O)=O)NC(C)=O XUYPXLNMDZIRQH-LURJTMIESA-N 0.000 description 2
- 241000588654 Neisseria cinerea Species 0.000 description 2
- 241000588652 Neisseria gonorrhoeae Species 0.000 description 2
- 241000588649 Neisseria lactamica Species 0.000 description 2
- 206010028980 Neoplasm Diseases 0.000 description 2
- 206010021888 Nervous system infections Diseases 0.000 description 2
- 241000187654 Nocardia Species 0.000 description 2
- 241000187678 Nocardia asteroides Species 0.000 description 2
- 241001597008 Nomeidae Species 0.000 description 2
- 108700026244 Open Reading Frames Proteins 0.000 description 2
- 241000283973 Oryctolagus cuniculus Species 0.000 description 2
- 101710160104 Outer membrane protein F Proteins 0.000 description 2
- 238000012408 PCR amplification Methods 0.000 description 2
- 241001236817 Paecilomyces <Clavicipitaceae> Species 0.000 description 2
- 241001631646 Papillomaviridae Species 0.000 description 2
- 241000526686 Paracoccidioides brasiliensis Species 0.000 description 2
- 241000606012 Pectinatus Species 0.000 description 2
- 241000192001 Pediococcus Species 0.000 description 2
- 229930182555 Penicillin Natural products 0.000 description 2
- JGSARLDLIJGVTE-MBNYWOFBSA-N Penicillin G Chemical compound N([C@H]1[C@H]2SC([C@@H](N2C1=O)C(O)=O)(C)C)C(=O)CC1=CC=CC=C1 JGSARLDLIJGVTE-MBNYWOFBSA-N 0.000 description 2
- 108091005804 Peptidases Proteins 0.000 description 2
- 102000035195 Peptidases Human genes 0.000 description 2
- 241000709664 Picornaviridae Species 0.000 description 2
- 241000224017 Plasmodium berghei Species 0.000 description 2
- 241000233872 Pneumocystis carinii Species 0.000 description 2
- 229920003171 Poly (ethylene oxide) Polymers 0.000 description 2
- 108010076039 Polyproteins Proteins 0.000 description 2
- 101710194807 Protective antigen Proteins 0.000 description 2
- 241000589516 Pseudomonas Species 0.000 description 2
- 206010057190 Respiratory tract infections Diseases 0.000 description 2
- 241001113283 Respirovirus Species 0.000 description 2
- 241001361634 Rhizoctonia Species 0.000 description 2
- 241000702670 Rotavirus Species 0.000 description 2
- 241000192031 Ruminococcus Species 0.000 description 2
- 241000192029 Ruminococcus albus Species 0.000 description 2
- 241000315672 SARS coronavirus Species 0.000 description 2
- 241000607142 Salmonella Species 0.000 description 2
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 2
- 241000713311 Simian immunodeficiency virus Species 0.000 description 2
- 240000002493 Smilax officinalis Species 0.000 description 2
- 235000008981 Smilax officinalis Nutrition 0.000 description 2
- 241001149962 Sporothrix Species 0.000 description 2
- 241001149963 Sporothrix schenckii Species 0.000 description 2
- 241001147695 Staphylococcus caprae Species 0.000 description 2
- 241000192099 Staphylococcus schleiferi Species 0.000 description 2
- 108010090804 Streptavidin Proteins 0.000 description 2
- 206010061372 Streptococcal infection Diseases 0.000 description 2
- 241000194048 Streptococcus equi Species 0.000 description 2
- 241000120569 Streptococcus equi subsp. zooepidemicus Species 0.000 description 2
- 241000194026 Streptococcus gordonii Species 0.000 description 2
- 235000014897 Streptococcus lactis Nutrition 0.000 description 2
- 241000194019 Streptococcus mutans Species 0.000 description 2
- 241000194021 Streptococcus suis Species 0.000 description 2
- 241001312524 Streptococcus viridans Species 0.000 description 2
- 241000187747 Streptomyces Species 0.000 description 2
- 241000187432 Streptomyces coelicolor Species 0.000 description 2
- 241001312310 Streptomyces somaliensis Species 0.000 description 2
- 210000001744 T-lymphocyte Anatomy 0.000 description 2
- 241001523006 Talaromyces marneffei Species 0.000 description 2
- 108090000190 Thrombin Proteins 0.000 description 2
- 241000893963 Trichophyton concentricum Species 0.000 description 2
- 241001480050 Trichophyton violaceum Species 0.000 description 2
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 2
- 239000006035 Tryptophane Substances 0.000 description 2
- 208000025865 Ulcer Diseases 0.000 description 2
- 229930003756 Vitamin B7 Natural products 0.000 description 2
- 241000607447 Yersinia enterocolitica Species 0.000 description 2
- 241000607477 Yersinia pseudotuberculosis Species 0.000 description 2
- 241000222126 [Candida] glabrata Species 0.000 description 2
- 229940124832 acellular pertussis vaccine Drugs 0.000 description 2
- 230000004913 activation Effects 0.000 description 2
- 230000001154 acute effect Effects 0.000 description 2
- 239000000654 additive Substances 0.000 description 2
- 230000000996 additive effect Effects 0.000 description 2
- 238000012382 advanced drug delivery Methods 0.000 description 2
- 238000001042 affinity chromatography Methods 0.000 description 2
- 238000010171 animal model Methods 0.000 description 2
- 230000008485 antagonism Effects 0.000 description 2
- 239000007864 aqueous solution Substances 0.000 description 2
- 238000003491 array Methods 0.000 description 2
- 229940091771 aspergillus fumigatus Drugs 0.000 description 2
- 229940065181 bacillus anthracis Drugs 0.000 description 2
- 230000004071 biological effect Effects 0.000 description 2
- 239000000872 buffer Substances 0.000 description 2
- 210000004900 c-terminal fragment Anatomy 0.000 description 2
- 229940095731 candida albicans Drugs 0.000 description 2
- 208000032343 candida glabrata infection Diseases 0.000 description 2
- 229940055022 candida parapsilosis Drugs 0.000 description 2
- 238000010367 cloning Methods 0.000 description 2
- 238000007621 cluster analysis Methods 0.000 description 2
- 238000010276 construction Methods 0.000 description 2
- 230000000875 corresponding effect Effects 0.000 description 2
- 239000007822 coupling agent Substances 0.000 description 2
- 238000004132 cross linking Methods 0.000 description 2
- 238000005520 cutting process Methods 0.000 description 2
- 210000001151 cytotoxic T lymphocyte Anatomy 0.000 description 2
- 239000002254 cytotoxic agent Substances 0.000 description 2
- 229940127089 cytotoxic agent Drugs 0.000 description 2
- 231100000599 cytotoxic agent Toxicity 0.000 description 2
- 208000025729 dengue disease Diseases 0.000 description 2
- LOKCTEFSRHRXRJ-UHFFFAOYSA-I dipotassium trisodium dihydrogen phosphate hydrogen phosphate dichloride Chemical compound P(=O)(O)(O)[O-].[K+].P(=O)(O)([O-])[O-].[Na+].[Na+].[Cl-].[K+].[Cl-].[Na+] LOKCTEFSRHRXRJ-UHFFFAOYSA-I 0.000 description 2
- 238000007598 dipping method Methods 0.000 description 2
- 239000006196 drop Substances 0.000 description 2
- 238000001962 electrophoresis Methods 0.000 description 2
- 206010014599 encephalitis Diseases 0.000 description 2
- 210000002615 epidermis Anatomy 0.000 description 2
- 239000000945 filler Substances 0.000 description 2
- GNBHRKFJIUUOQI-UHFFFAOYSA-N fluorescein Chemical compound O1C(=O)C2=CC=CC=C2C21C1=CC=C(O)C=C1OC1=CC(O)=CC=C21 GNBHRKFJIUUOQI-UHFFFAOYSA-N 0.000 description 2
- 230000000799 fusogenic effect Effects 0.000 description 2
- 229940044627 gamma-interferon Drugs 0.000 description 2
- 239000007789 gas Substances 0.000 description 2
- 239000000499 gel Substances 0.000 description 2
- 238000003209 gene knockout Methods 0.000 description 2
- 229960003180 glutathione Drugs 0.000 description 2
- 239000005090 green fluorescent protein Substances 0.000 description 2
- 229940029575 guanosine Drugs 0.000 description 2
- 210000003128 head Anatomy 0.000 description 2
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 2
- 210000004408 hybridoma Anatomy 0.000 description 2
- 239000000017 hydrogel Substances 0.000 description 2
- FHBSGPWHCCIQPG-UHFFFAOYSA-N hydroxy-methyl-oxo-sulfanylidene-$l^{6}-sulfane Chemical class CS(S)(=O)=O FHBSGPWHCCIQPG-UHFFFAOYSA-N 0.000 description 2
- 230000001976 improved effect Effects 0.000 description 2
- 238000001727 in vivo Methods 0.000 description 2
- 230000002779 inactivation Effects 0.000 description 2
- 230000000977 initiatory effect Effects 0.000 description 2
- 238000011081 inoculation Methods 0.000 description 2
- 229910017053 inorganic salt Inorganic materials 0.000 description 2
- 210000000936 intestine Anatomy 0.000 description 2
- 230000003834 intracellular effect Effects 0.000 description 2
- 229960000310 isoleucine Drugs 0.000 description 2
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 2
- JVTAAEKCZFNVCJ-UHFFFAOYSA-N lactic acid Chemical compound CC(O)C(O)=O JVTAAEKCZFNVCJ-UHFFFAOYSA-N 0.000 description 2
- 150000002632 lipids Chemical class 0.000 description 2
- 239000002502 liposome Substances 0.000 description 2
- HWYHZTIRURJOHG-UHFFFAOYSA-N luminol Chemical compound O=C1NNC(=O)C2=C1C(N)=CC=C2 HWYHZTIRURJOHG-UHFFFAOYSA-N 0.000 description 2
- 238000002595 magnetic resonance imaging Methods 0.000 description 2
- 201000004792 malaria Diseases 0.000 description 2
- 230000035800 maturation Effects 0.000 description 2
- 230000010534 mechanism of action Effects 0.000 description 2
- 230000001404 mediated effect Effects 0.000 description 2
- 210000001806 memory b lymphocyte Anatomy 0.000 description 2
- VNWKTOKETHGBQD-UHFFFAOYSA-N methane Natural products C VNWKTOKETHGBQD-UHFFFAOYSA-N 0.000 description 2
- BDAGIHXWWSANSR-UHFFFAOYSA-N methanoic acid Natural products OC=O BDAGIHXWWSANSR-UHFFFAOYSA-N 0.000 description 2
- 239000004005 microsphere Substances 0.000 description 2
- 238000013508 migration Methods 0.000 description 2
- 230000005012 migration Effects 0.000 description 2
- 239000011259 mixed solution Substances 0.000 description 2
- 229910052757 nitrogen Inorganic materials 0.000 description 2
- 229950004053 octoxinol Drugs 0.000 description 2
- 239000002674 ointment Substances 0.000 description 2
- 229920001542 oligosaccharide Polymers 0.000 description 2
- 150000002482 oligosaccharides Chemical class 0.000 description 2
- 230000003287 optical effect Effects 0.000 description 2
- 229910052763 palladium Inorganic materials 0.000 description 2
- UNEIHNMKASENIG-UHFFFAOYSA-N para-chlorophenylpiperazine Chemical compound C1=CC(Cl)=CC=C1N1CCNCC1 UNEIHNMKASENIG-UHFFFAOYSA-N 0.000 description 2
- 230000003071 parasitic effect Effects 0.000 description 2
- 229940049954 penicillin Drugs 0.000 description 2
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 2
- 239000002953 phosphate buffered saline Substances 0.000 description 2
- 239000011574 phosphorus Substances 0.000 description 2
- 210000005134 plasmacytoid dendritic cell Anatomy 0.000 description 2
- 229920000747 poly(lactic acid) Polymers 0.000 description 2
- 108010094020 polyglycine Proteins 0.000 description 2
- 229920000232 polyglycine polymer Polymers 0.000 description 2
- 229920000136 polysorbate Polymers 0.000 description 2
- 239000000843 powder Substances 0.000 description 2
- 238000001556 precipitation Methods 0.000 description 2
- 239000002243 precursor Substances 0.000 description 2
- 238000007639 printing Methods 0.000 description 2
- 230000001737 promoting effect Effects 0.000 description 2
- 238000011321 prophylaxis Methods 0.000 description 2
- 238000001742 protein purification Methods 0.000 description 2
- 239000012857 radioactive material Substances 0.000 description 2
- 229950010550 resiquimod Drugs 0.000 description 2
- BXNMTOQRYBFHNZ-UHFFFAOYSA-N resiquimod Chemical compound C1=CC=CC2=C(N(C(COCC)=N3)CC(C)(C)O)C3=C(N)N=C21 BXNMTOQRYBFHNZ-UHFFFAOYSA-N 0.000 description 2
- 230000003248 secreting effect Effects 0.000 description 2
- 201000009890 sinusitis Diseases 0.000 description 2
- 239000007787 solid Substances 0.000 description 2
- 239000007921 spray Substances 0.000 description 2
- 238000010561 standard procedure Methods 0.000 description 2
- 150000003432 sterols Chemical class 0.000 description 2
- 239000000725 suspension Substances 0.000 description 2
- 208000006379 syphilis Diseases 0.000 description 2
- 229960004072 thrombin Drugs 0.000 description 2
- XETCRXVKJHBPMK-MJSODCSWSA-N trehalose 6,6'-dimycolate Chemical compound C([C@@H]1[C@H]([C@H](O)[C@@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](O)[C@H](O)[C@@H](COC(=O)C(CCCCCCCCCCC3C(C3)CCCCCCCCCCCCCCCCCC)C(O)CCCCCCCCCCCCCCCCCCCCCCCCC)O2)O)O1)O)OC(=O)C(C(O)CCCCCCCCCCCCCCCCCCCCCCCCC)CCCCCCCCCCC1CC1CCCCCCCCCCCCCCCCCC XETCRXVKJHBPMK-MJSODCSWSA-N 0.000 description 2
- 229960004799 tryptophan Drugs 0.000 description 2
- 201000008827 tuberculosis Diseases 0.000 description 2
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 2
- 231100000397 ulcer Toxicity 0.000 description 2
- 230000003612 virological effect Effects 0.000 description 2
- 230000001018 virulence Effects 0.000 description 2
- 239000011735 vitamin B7 Substances 0.000 description 2
- 235000011912 vitamin B7 Nutrition 0.000 description 2
- 229940098232 yersinia enterocolitica Drugs 0.000 description 2
- KIUKXJAPPMFGSW-DNGZLQJQSA-N (2S,3S,4S,5R,6R)-6-[(2S,3R,4R,5S,6R)-3-Acetamido-2-[(2S,3S,4R,5R,6R)-6-[(2R,3R,4R,5S,6R)-3-acetamido-2,5-dihydroxy-6-(hydroxymethyl)oxan-4-yl]oxy-2-carboxy-4,5-dihydroxyoxan-3-yl]oxy-5-hydroxy-6-(hydroxymethyl)oxan-4-yl]oxy-3,4,5-trihydroxyoxane-2-carboxylic acid Chemical compound CC(=O)N[C@H]1[C@H](O)O[C@H](CO)[C@@H](O)[C@@H]1O[C@H]1[C@H](O)[C@@H](O)[C@H](O[C@H]2[C@@H]([C@@H](O[C@H]3[C@@H]([C@@H](O)[C@H](O)[C@H](O3)C(O)=O)O)[C@H](O)[C@@H](CO)O2)NC(C)=O)[C@@H](C(O)=O)O1 KIUKXJAPPMFGSW-DNGZLQJQSA-N 0.000 description 1
- DQJCDTNMLBYVAY-ZXXIYAEKSA-N (2S,5R,10R,13R)-16-{[(2R,3S,4R,5R)-3-{[(2S,3R,4R,5S,6R)-3-acetamido-4,5-dihydroxy-6-(hydroxymethyl)oxan-2-yl]oxy}-5-(ethylamino)-6-hydroxy-2-(hydroxymethyl)oxan-4-yl]oxy}-5-(4-aminobutyl)-10-carbamoyl-2,13-dimethyl-4,7,12,15-tetraoxo-3,6,11,14-tetraazaheptadecan-1-oic acid Chemical compound NCCCC[C@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)CC[C@H](C(N)=O)NC(=O)[C@@H](C)NC(=O)C(C)O[C@@H]1[C@@H](NCC)C(O)O[C@H](CO)[C@H]1O[C@H]1[C@H](NC(C)=O)[C@@H](O)[C@H](O)[C@@H](CO)O1 DQJCDTNMLBYVAY-ZXXIYAEKSA-N 0.000 description 1
- YYGNTYWPHWGJRM-UHFFFAOYSA-N (6E,10E,14E,18E)-2,6,10,15,19,23-hexamethyltetracosa-2,6,10,14,18,22-hexaene Chemical compound CC(C)=CCCC(C)=CCCC(C)=CCCC=C(C)CCC=C(C)CCC=C(C)C YYGNTYWPHWGJRM-UHFFFAOYSA-N 0.000 description 1
- MMRINLZOZVAPDZ-LSGRDSQZSA-N (6r,7r)-7-[[(2z)-2-(2-amino-1,3-thiazol-4-yl)-2-methoxyiminoacetyl]amino]-3-[(1-methylpyrrolidin-1-ium-1-yl)methyl]-8-oxo-5-thia-1-azabicyclo[4.2.0]oct-2-ene-2-carboxylic acid;chloride Chemical compound Cl.S([C@@H]1[C@@H](C(N1C=1C([O-])=O)=O)NC(=O)\C(=N/OC)C=2N=C(N)SC=2)CC=1C[N+]1(C)CCCC1 MMRINLZOZVAPDZ-LSGRDSQZSA-N 0.000 description 1
- HLHSUNWAPXINQU-GQCTYLIASA-N (E)-3-(3,4-dihydroxyphenyl)-N-prop-2-ynylprop-2-enamide Chemical compound OC=1C=C(C=CC=1O)/C=C/C(=O)NCC#C HLHSUNWAPXINQU-GQCTYLIASA-N 0.000 description 1
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 1
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 1
- IHDBZCJYSHDCKF-UHFFFAOYSA-N 4,6-dichlorotriazine Chemical compound ClC1=CC(Cl)=NN=N1 IHDBZCJYSHDCKF-UHFFFAOYSA-N 0.000 description 1
- OSWFIVFLDKOXQC-UHFFFAOYSA-N 4-(3-methoxyphenyl)aniline Chemical compound COC1=CC=CC(C=2C=CC(N)=CC=2)=C1 OSWFIVFLDKOXQC-UHFFFAOYSA-N 0.000 description 1
- 101710166488 6 kDa early secretory antigenic target Proteins 0.000 description 1
- CJIJXIFQYOPWTF-UHFFFAOYSA-N 7-hydroxycoumarin Natural products O1C(=O)C=CC2=CC(O)=CC=C21 CJIJXIFQYOPWTF-UHFFFAOYSA-N 0.000 description 1
- 241000235389 Absidia Species 0.000 description 1
- QTBSBXVTEAMEQO-UHFFFAOYSA-M Acetate Chemical compound CC([O-])=O QTBSBXVTEAMEQO-UHFFFAOYSA-M 0.000 description 1
- 101100230376 Acetivibrio thermocellus (strain ATCC 27405 / DSM 1237 / JCM 9322 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) celI gene Proteins 0.000 description 1
- 241000589220 Acetobacter Species 0.000 description 1
- 241001468161 Acetobacterium Species 0.000 description 1
- 108010022752 Acetylcholinesterase Proteins 0.000 description 1
- 102100033639 Acetylcholinesterase Human genes 0.000 description 1
- 108010042708 Acetylmuramyl-Alanyl-Isoglutamine Proteins 0.000 description 1
- 241000203716 Actinomycetaceae Species 0.000 description 1
- 108010000239 Aequorin Proteins 0.000 description 1
- 229920000936 Agarose Polymers 0.000 description 1
- 241000700587 Alphaherpesvirinae Species 0.000 description 1
- 206010059313 Anogenital warts Diseases 0.000 description 1
- 239000004475 Arginine Substances 0.000 description 1
- 241000228218 Aspergillus amstelodami Species 0.000 description 1
- 241000228197 Aspergillus flavus Species 0.000 description 1
- 241000132177 Aspergillus glaucus Species 0.000 description 1
- 241001277988 Aspergillus sydowii Species 0.000 description 1
- 241001465318 Aspergillus terreus Species 0.000 description 1
- MQVRGDZCYDEQML-UHFFFAOYSA-N Astragalin Natural products C1=CC(OC)=CC=C1C1=C(OC2C(C(O)C(O)C(CO)O2)O)C(=O)C2=C(O)C=C(O)C=C2O1 MQVRGDZCYDEQML-UHFFFAOYSA-N 0.000 description 1
- 241000193755 Bacillus cereus Species 0.000 description 1
- 241000235579 Basidiobolus Species 0.000 description 1
- VKJGBAJNNALVAV-UHFFFAOYSA-M Berberine chloride (TN) Chemical compound [Cl-].C1=C2CC[N+]3=CC4=C(OC)C(OC)=CC=C4C=C3C2=CC2=C1OCO2 VKJGBAJNNALVAV-UHFFFAOYSA-M 0.000 description 1
- 241000335423 Blastomyces Species 0.000 description 1
- 241000228405 Blastomyces dermatitidis Species 0.000 description 1
- 241001118702 Border disease virus Species 0.000 description 1
- 241000589968 Borrelia Species 0.000 description 1
- 241000589969 Borreliella burgdorferi Species 0.000 description 1
- 101100453077 Botryococcus braunii HDR gene Proteins 0.000 description 1
- 241000710780 Bovine viral diarrhea virus 1 Species 0.000 description 1
- QOFRNSMLZCPQKL-KTIIIUPTSA-N CCN(CC)CC.CCCCCCCCCCCCCC(=O)O[C@H](CCCCCCCCCCC)CC(=O)NCCO[C@@H]1O[C@H](CO)[C@@H](OP(O)(O)=O)[C@H](OC(=O)C[C@@H](CCCCCCCCCCC)OC(=O)CCCCCCCCCCCCC)[C@H]1NC(=O)C[C@@H](CCCCCCCCCCC)OC(=O)CCCCCCCCCCCCC Chemical compound CCN(CC)CC.CCCCCCCCCCCCCC(=O)O[C@H](CCCCCCCCCCC)CC(=O)NCCO[C@@H]1O[C@H](CO)[C@@H](OP(O)(O)=O)[C@H](OC(=O)C[C@@H](CCCCCCCCCCC)OC(=O)CCCCCCCCCCCCC)[C@H]1NC(=O)C[C@@H](CCCCCCCCCCC)OC(=O)CCCCCCCCCCCCC QOFRNSMLZCPQKL-KTIIIUPTSA-N 0.000 description 1
- 101100039010 Caenorhabditis elegans dis-3 gene Proteins 0.000 description 1
- 102000000584 Calmodulin Human genes 0.000 description 1
- 108010041952 Calmodulin Proteins 0.000 description 1
- 244000025254 Cannabis sativa Species 0.000 description 1
- 108090000565 Capsid Proteins Proteins 0.000 description 1
- 229920002134 Carboxymethyl cellulose Polymers 0.000 description 1
- 102100035882 Catalase Human genes 0.000 description 1
- 108010053835 Catalase Proteins 0.000 description 1
- 240000001817 Cereus hexagonus Species 0.000 description 1
- 102100023321 Ceruloplasmin Human genes 0.000 description 1
- LZZYPRNAOMGNLH-UHFFFAOYSA-M Cetrimonium bromide Chemical compound [Br-].CCCCCCCCCCCCCCCC[N+](C)(C)C LZZYPRNAOMGNLH-UHFFFAOYSA-M 0.000 description 1
- 108091006146 Channels Proteins 0.000 description 1
- 229920002101 Chitin Polymers 0.000 description 1
- 229920001661 Chitosan Polymers 0.000 description 1
- 241000193403 Clostridium Species 0.000 description 1
- 108010065152 Coagulase Proteins 0.000 description 1
- 101710094648 Coat protein Proteins 0.000 description 1
- 241000223205 Coccidioides immitis Species 0.000 description 1
- 208000000907 Condylomata Acuminata Diseases 0.000 description 1
- 241001480517 Conidiobolus Species 0.000 description 1
- 108010060123 Conjugate Vaccines Proteins 0.000 description 1
- FBUKMFOXMZRGRB-UHFFFAOYSA-N Coronaric acid Natural products CCCCCC=CCC1OC1CCCCCCCC(O)=O FBUKMFOXMZRGRB-UHFFFAOYSA-N 0.000 description 1
- 241000711573 Coronaviridae Species 0.000 description 1
- 241001644925 Corynebacterium efficiens Species 0.000 description 1
- 241000699802 Cricetulus griseus Species 0.000 description 1
- 241000235555 Cunninghamella Species 0.000 description 1
- 241000223208 Curvularia Species 0.000 description 1
- DYDCUQKUCUHJBH-UWTATZPHSA-N D-Cycloserine Chemical compound N[C@@H]1CONC1=O DYDCUQKUCUHJBH-UWTATZPHSA-N 0.000 description 1
- IGXWBGJHJZYPQS-SSDOTTSWSA-N D-Luciferin Chemical compound OC(=O)[C@H]1CSC(C=2SC3=CC=C(O)C=C3N=2)=N1 IGXWBGJHJZYPQS-SSDOTTSWSA-N 0.000 description 1
- YVGGHNCTFXOJCH-UHFFFAOYSA-N DDT Chemical compound C1=CC(Cl)=CC=C1C(C(Cl)(Cl)Cl)C1=CC=C(Cl)C=C1 YVGGHNCTFXOJCH-UHFFFAOYSA-N 0.000 description 1
- 102000004594 DNA Polymerase I Human genes 0.000 description 1
- 108010017826 DNA Polymerase I Proteins 0.000 description 1
- 208000001840 Dandruff Diseases 0.000 description 1
- XPDXVDYUQZHFPV-UHFFFAOYSA-N Dansyl Chloride Chemical compound C1=CC=C2C(N(C)C)=CC=CC2=C1S(Cl)(=O)=O XPDXVDYUQZHFPV-UHFFFAOYSA-N 0.000 description 1
- 206010011968 Decreased immune responsiveness Diseases 0.000 description 1
- CYCGRDQQIOGCKX-UHFFFAOYSA-N Dehydro-luciferin Natural products OC(=O)C1=CSC(C=2SC3=CC(O)=CC=C3N=2)=N1 CYCGRDQQIOGCKX-UHFFFAOYSA-N 0.000 description 1
- 108010016626 Dipeptides Proteins 0.000 description 1
- 102100027723 Endogenous retrovirus group K member 6 Rec protein Human genes 0.000 description 1
- 241000305071 Enterobacterales Species 0.000 description 1
- 102100029727 Enteropeptidase Human genes 0.000 description 1
- 108010013369 Enteropeptidase Proteins 0.000 description 1
- 101710091045 Envelope protein Proteins 0.000 description 1
- 241001480035 Epidermophyton Species 0.000 description 1
- 241000206602 Eukaryota Species 0.000 description 1
- 241000223682 Exophiala Species 0.000 description 1
- 101710082714 Exotoxin A Proteins 0.000 description 1
- 108010074860 Factor Xa Proteins 0.000 description 1
- 241000282326 Felis catus Species 0.000 description 1
- BJGNCJDXODQBOB-UHFFFAOYSA-N Fivefly Luciferin Natural products OC(=O)C1CSC(C=2SC3=CC(O)=CC=C3N=2)=N1 BJGNCJDXODQBOB-UHFFFAOYSA-N 0.000 description 1
- 241000710781 Flaviviridae Species 0.000 description 1
- 238000012413 Fluorescence activated cell sorting analysis Methods 0.000 description 1
- 241000122862 Fonsecaea Species 0.000 description 1
- 241000710198 Foot-and-mouth disease virus Species 0.000 description 1
- 241000605909 Fusobacterium Species 0.000 description 1
- 208000005577 Gastroenteritis Diseases 0.000 description 1
- 208000018522 Gastrointestinal disease Diseases 0.000 description 1
- 241000178292 Geotrichum clavatum Species 0.000 description 1
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Chemical compound OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 1
- 102000009127 Glutaminase Human genes 0.000 description 1
- 108010073324 Glutaminase Proteins 0.000 description 1
- BCCRXDTUTZHDEU-VKHMYHEASA-N Gly-Ser Chemical compound NCC(=O)N[C@@H](CO)C(O)=O BCCRXDTUTZHDEU-VKHMYHEASA-N 0.000 description 1
- JZNWSCPGTDBMEW-UHFFFAOYSA-N Glycerophosphorylethanolamin Natural products NCCOP(O)(=O)OCC(O)CO JZNWSCPGTDBMEW-UHFFFAOYSA-N 0.000 description 1
- 108010015899 Glycopeptides Proteins 0.000 description 1
- 102000002068 Glycopeptides Human genes 0.000 description 1
- 102100021181 Golgi phosphoprotein 3 Human genes 0.000 description 1
- 206010018612 Gonorrhoea Diseases 0.000 description 1
- 101150029115 HOPX gene Proteins 0.000 description 1
- 241000606768 Haemophilus influenzae Species 0.000 description 1
- 241000590002 Helicobacter pylori Species 0.000 description 1
- 241000207155 Heliobacterium Species 0.000 description 1
- 108010006464 Hemolysin Proteins Proteins 0.000 description 1
- 241000711549 Hepacivirus C Species 0.000 description 1
- 206010019799 Hepatitis viral Diseases 0.000 description 1
- 241000709721 Hepatovirus A Species 0.000 description 1
- 208000001688 Herpes Genitalis Diseases 0.000 description 1
- 241000700588 Human alphaherpesvirus 1 Species 0.000 description 1
- 241000701074 Human alphaherpesvirus 2 Species 0.000 description 1
- 241000701044 Human gammaherpesvirus 4 Species 0.000 description 1
- 241000701027 Human herpesvirus 6 Species 0.000 description 1
- 241000702617 Human parvovirus B19 Species 0.000 description 1
- 206010061217 Infestation Diseases 0.000 description 1
- 206010061218 Inflammation Diseases 0.000 description 1
- 102000006992 Interferon-alpha Human genes 0.000 description 1
- 108010047761 Interferon-alpha Proteins 0.000 description 1
- 108010050904 Interferons Proteins 0.000 description 1
- 102000014150 Interferons Human genes 0.000 description 1
- 108010002352 Interleukin-1 Proteins 0.000 description 1
- 108010002586 Interleukin-7 Proteins 0.000 description 1
- 108010063738 Interleukins Proteins 0.000 description 1
- 102000015696 Interleukins Human genes 0.000 description 1
- KFZMGEQAYNKOFK-UHFFFAOYSA-N Isopropanol Chemical compound CC(C)O KFZMGEQAYNKOFK-UHFFFAOYSA-N 0.000 description 1
- 241000710842 Japanese encephalitis virus Species 0.000 description 1
- 244000285963 Kluyveromyces fragilis Species 0.000 description 1
- 235000014663 Kluyveromyces fragilis Nutrition 0.000 description 1
- XUJNEKJLAYXESH-REOHCLBHSA-N L-Cysteine Chemical compound SC[C@H](N)C(O)=O XUJNEKJLAYXESH-REOHCLBHSA-N 0.000 description 1
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 1
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 1
- ODKSFYDXXFIFQN-BYPYZUCNSA-P L-argininium(2+) Chemical compound NC(=[NH2+])NCCC[C@H]([NH3+])C(O)=O ODKSFYDXXFIFQN-BYPYZUCNSA-P 0.000 description 1
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 1
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 1
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 description 1
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 1
- 241000186660 Lactobacillus Species 0.000 description 1
- 229920002884 Laureth 4 Polymers 0.000 description 1
- 241000589242 Legionella pneumophila Species 0.000 description 1
- 208000004023 Legionellosis Diseases 0.000 description 1
- 208000035353 Legionnaires disease Diseases 0.000 description 1
- 108700005090 Lethal Genes Proteins 0.000 description 1
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 1
- 101710170970 Leukotoxin Proteins 0.000 description 1
- 108090001030 Lipoproteins Proteins 0.000 description 1
- 102000004895 Lipoproteins Human genes 0.000 description 1
- 241000186805 Listeria innocua Species 0.000 description 1
- 239000005089 Luciferase Substances 0.000 description 1
- DDWFXDSYGUXRAY-UHFFFAOYSA-N Luciferin Natural products CCc1c(C)c(CC2NC(=O)C(=C2C=C)C)[nH]c1Cc3[nH]c4C(=C5/NC(CC(=O)O)C(C)C5CC(=O)O)CC(=O)c4c3C DDWFXDSYGUXRAY-UHFFFAOYSA-N 0.000 description 1
- 101710122154 MPT51 antigen Proteins 0.000 description 1
- 102000007651 Macrophage Colony-Stimulating Factor Human genes 0.000 description 1
- 108010046938 Macrophage Colony-Stimulating Factor Proteins 0.000 description 1
- 241001539803 Magnusiomyces capitatus Species 0.000 description 1
- 101710125418 Major capsid protein Proteins 0.000 description 1
- 241000555676 Malassezia Species 0.000 description 1
- 241000555688 Malassezia furfur Species 0.000 description 1
- 101710175625 Maltose/maltodextrin-binding periplasmic protein Proteins 0.000 description 1
- 241000235048 Meyerozyma guilliermondii Species 0.000 description 1
- 241000187708 Micromonospora Species 0.000 description 1
- 241001260008 Microsporum equinum Species 0.000 description 1
- 241001092142 Molina Species 0.000 description 1
- 241001363490 Monilia Species 0.000 description 1
- WPNJAUFVNXKLIM-UHFFFAOYSA-N Moxonidine Chemical compound COC1=NC(C)=NC(Cl)=C1NC1=NCCN1 WPNJAUFVNXKLIM-UHFFFAOYSA-N 0.000 description 1
- 241000711386 Mumps virus Species 0.000 description 1
- 241000699670 Mus sp. Species 0.000 description 1
- 125000003047 N-acetyl group Chemical group 0.000 description 1
- 125000000729 N-terminal amino-acid group Chemical group 0.000 description 1
- 241000264375 Nannizzia nana Species 0.000 description 1
- 241000158526 Nasalis Species 0.000 description 1
- 241001644525 Nastus productus Species 0.000 description 1
- 241000231286 Neottia Species 0.000 description 1
- 239000000020 Nitrocellulose Substances 0.000 description 1
- 208000035823 Non-specific autoimmune cerebellar ataxia without characteristic antibodies Diseases 0.000 description 1
- 101710138767 Non-structural glycoprotein 4 Proteins 0.000 description 1
- 241000714209 Norwalk virus Species 0.000 description 1
- 101710141454 Nucleoprotein Proteins 0.000 description 1
- 241001195348 Nusa Species 0.000 description 1
- 239000004677 Nylon Substances 0.000 description 1
- NUAQIRUAZSJTAI-YRPZDAAMSA-N O([C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@H]1OC1=C2C=C3C=C1OC1=CC=C(C=C1Cl)[C@@H](O)[C@H](C(N[C@@H](CC(N)=O)C(=O)N[C@H]3C(=O)N[C@H]1C(=O)N[C@H](C(N[C@@H](C3=CC(O)=CC(O)=C3C=3C(O)=CC=C1C=3)C(O)=O)=O)[C@H](O)C1=CC=C(C(=C1)Cl)O2)=O)NC(=O)[C@H](N)CC(C)C)[C@H]1C[C@](C)(N)[C@H](O)[C@H](C)O1 Chemical compound O([C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@H]1OC1=C2C=C3C=C1OC1=CC=C(C=C1Cl)[C@@H](O)[C@H](C(N[C@@H](CC(N)=O)C(=O)N[C@H]3C(=O)N[C@H]1C(=O)N[C@H](C(N[C@@H](C3=CC(O)=CC(O)=C3C=3C(O)=CC=C1C=3)C(O)=O)=O)[C@H](O)C1=CC=C(C(=C1)Cl)O2)=O)NC(=O)[C@H](N)CC(C)C)[C@H]1C[C@](C)(N)[C@H](O)[C@H](C)O1 NUAQIRUAZSJTAI-YRPZDAAMSA-N 0.000 description 1
- 108010079246 OMPA outer membrane proteins Proteins 0.000 description 1
- 206010030113 Oedema Diseases 0.000 description 1
- 108010038807 Oligopeptides Proteins 0.000 description 1
- 102000015636 Oligopeptides Human genes 0.000 description 1
- 206010033078 Otitis media Diseases 0.000 description 1
- 101150026476 PAO1 gene Proteins 0.000 description 1
- 208000002606 Paramyxoviridae Infections Diseases 0.000 description 1
- 241000594335 Pavetta Species 0.000 description 1
- 241000228143 Penicillium Species 0.000 description 1
- 241000228150 Penicillium chrysogenum Species 0.000 description 1
- 108091093037 Peptide nucleic acid Proteins 0.000 description 1
- 241000710778 Pestivirus Species 0.000 description 1
- 102000012288 Phosphopyruvate Hydratase Human genes 0.000 description 1
- 108010022181 Phosphopyruvate Hydratase Proteins 0.000 description 1
- 101710099976 Photosystem I P700 chlorophyll a apoprotein A1 Proteins 0.000 description 1
- 108010004729 Phycoerythrin Proteins 0.000 description 1
- 235000008331 Pinus X rigitaeda Nutrition 0.000 description 1
- 235000011613 Pinus brutia Nutrition 0.000 description 1
- 241000018646 Pinus brutia Species 0.000 description 1
- 241000305299 Pithomyces Species 0.000 description 1
- 206010035148 Plague Diseases 0.000 description 1
- 101710183389 Pneumolysin Proteins 0.000 description 1
- 206010035718 Pneumonia legionella Diseases 0.000 description 1
- 229920002730 Poly(butyl cyanoacrylate) Polymers 0.000 description 1
- 229920000954 Polyglycolide Polymers 0.000 description 1
- 229920001214 Polysorbate 60 Polymers 0.000 description 1
- 239000004372 Polyvinyl alcohol Substances 0.000 description 1
- 108010013381 Porins Proteins 0.000 description 1
- 101710083689 Probable capsid protein Proteins 0.000 description 1
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 1
- 241000186429 Propionibacterium Species 0.000 description 1
- 102000006010 Protein Disulfide-Isomerase Human genes 0.000 description 1
- 102000001253 Protein Kinase Human genes 0.000 description 1
- 229940096437 Protein S Drugs 0.000 description 1
- 108010076504 Protein Sorting Signals Proteins 0.000 description 1
- 108010001267 Protein Subunits Proteins 0.000 description 1
- 102000002067 Protein Subunits Human genes 0.000 description 1
- 101710188315 Protein X Proteins 0.000 description 1
- 108010075142 Protollin Proteins 0.000 description 1
- 241000125945 Protoparvovirus Species 0.000 description 1
- 206010037660 Pyrexia Diseases 0.000 description 1
- 240000001987 Pyrus communis Species 0.000 description 1
- 229940124875 RabAvert Drugs 0.000 description 1
- 241000711798 Rabies lyssavirus Species 0.000 description 1
- KGZHFKDNSAEOJX-WIFQYKSHSA-N Ramoplanin Chemical compound C([C@H]1C(=O)N[C@H](CCCN)C(=O)N[C@H](C(=O)N[C@@H](C(=O)N[C@H](C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C)C(=O)N[C@H](C(=O)O[C@@H]([C@@H](C(N[C@@H](C(=O)N[C@H](CCCN)C(=O)N[C@@H](C(=O)N[C@H](C(=O)N[C@@H](C(=O)N[C@H](C(=O)N1)[C@H](C)O)C=1C=CC(O)=CC=1)C=1C=CC(O)=CC=1)[C@@H](C)O)C=1C=CC(O)=CC=1)=O)NC(=O)[C@H](CC(N)=O)NC(=O)\C=C/C=C/CC(C)C)C(N)=O)C=1C=C(Cl)C(O)=CC=1)C=1C=CC(O)=CC=1)[C@@H](C)O)C=1C=CC(O[C@@H]2[C@H]([C@@H](O)[C@H](O)[C@@H](CO)O2)O[C@@H]2[C@H]([C@@H](O)[C@H](O)[C@@H](CO)O2)O)=CC=1)C1=CC=CC=C1 KGZHFKDNSAEOJX-WIFQYKSHSA-N 0.000 description 1
- 108020004511 Recombinant DNA Proteins 0.000 description 1
- 208000017442 Retinal disease Diseases 0.000 description 1
- 241000235527 Rhizopus Species 0.000 description 1
- 108091028664 Ribonucleotide Proteins 0.000 description 1
- 241000606701 Rickettsia Species 0.000 description 1
- 241000710799 Rubella virus Species 0.000 description 1
- 241000235070 Saccharomyces Species 0.000 description 1
- 241000293026 Saksenaea Species 0.000 description 1
- 241000293025 Saksenaea vasiformis Species 0.000 description 1
- 241000293869 Salmonella enterica subsp. enterica serovar Typhimurium Species 0.000 description 1
- 240000003946 Saponaria officinalis Species 0.000 description 1
- 241000852049 Scedosporium apiospermum Species 0.000 description 1
- 229920002684 Sepharose Polymers 0.000 description 1
- 206010040047 Sepsis Diseases 0.000 description 1
- 102100031056 Serine protease 57 Human genes 0.000 description 1
- 101710197596 Serine protease 57 Proteins 0.000 description 1
- 101710084578 Short neurotoxin 1 Proteins 0.000 description 1
- 241000700584 Simplexvirus Species 0.000 description 1
- 241000710960 Sindbis virus Species 0.000 description 1
- 108020004459 Small interfering RNA Proteins 0.000 description 1
- 244000261559 Smilax china Species 0.000 description 1
- 235000000485 Smilax china Nutrition 0.000 description 1
- 240000009022 Smilax rotundifolia Species 0.000 description 1
- 235000003205 Smilax rotundifolia Nutrition 0.000 description 1
- 239000004147 Sorbitan trioleate Substances 0.000 description 1
- PRXRUNOAOLTIEF-ADSICKODSA-N Sorbitan trioleate Chemical compound CCCCCCCC\C=C/CCCCCCCC(=O)OC[C@@H](OC(=O)CCCCCCC\C=C/CCCCCCCC)[C@H]1OC[C@H](O)[C@H]1OC(=O)CCCCCCC\C=C/CCCCCCCC PRXRUNOAOLTIEF-ADSICKODSA-N 0.000 description 1
- 101710198474 Spike protein Proteins 0.000 description 1
- 241000605008 Spirillum Species 0.000 description 1
- 241000204388 Sporomusa Species 0.000 description 1
- 241001134656 Staphylococcus lugdunensis Species 0.000 description 1
- 108091081024 Start codon Proteins 0.000 description 1
- 101100269620 Streptococcus pneumoniae (strain ATCC BAA-255 / R6) aliB gene Proteins 0.000 description 1
- 101100289918 Streptococcus pneumoniae (strain ATCC BAA-255 / R6) lytB gene Proteins 0.000 description 1
- 101100239936 Streptococcus pneumoniae (strain ATCC BAA-255 / R6) nanA gene Proteins 0.000 description 1
- 101100137826 Streptococcus pneumoniae (strain ATCC BAA-255 / R6) prsA gene Proteins 0.000 description 1
- 241001607520 Streptococcus pneumoniae 23F Species 0.000 description 1
- 241000058959 Streptococcus pneumoniae INV200 Species 0.000 description 1
- 241000058954 Streptococcus pneumoniae OXC141 Species 0.000 description 1
- 241001468227 Streptomyces avermitilis Species 0.000 description 1
- 241001655322 Streptomycetales Species 0.000 description 1
- 101800001271 Surface protein Proteins 0.000 description 1
- 230000005867 T cell response Effects 0.000 description 1
- 108010053950 Teicoplanin Proteins 0.000 description 1
- BHEOSNUKNHRBNM-UHFFFAOYSA-N Tetramethylsqualene Natural products CC(=C)C(C)CCC(=C)C(C)CCC(C)=CCCC=C(C)CCC(C)C(=C)CCC(C)C(C)=C BHEOSNUKNHRBNM-UHFFFAOYSA-N 0.000 description 1
- 210000004241 Th2 cell Anatomy 0.000 description 1
- 102100036407 Thioredoxin Human genes 0.000 description 1
- 241001664469 Tibicina haematodes Species 0.000 description 1
- 241000710771 Tick-borne encephalitis virus Species 0.000 description 1
- 241000710924 Togaviridae Species 0.000 description 1
- 241000710914 Totivirus Species 0.000 description 1
- 101710182532 Toxin a Proteins 0.000 description 1
- 108010031133 Transferrin-Binding Protein A Proteins 0.000 description 1
- 108010031127 Transferrin-Binding Protein B Proteins 0.000 description 1
- 101800001690 Transmembrane protein gp41 Proteins 0.000 description 1
- 241000589884 Treponema pallidum Species 0.000 description 1
- 241001609979 Trichophyton quinckeanum Species 0.000 description 1
- 241000223229 Trichophyton rubrum Species 0.000 description 1
- 241001480048 Trichophyton tonsurans Species 0.000 description 1
- 206010067409 Trichophytosis Diseases 0.000 description 1
- 241000223230 Trichosporon Species 0.000 description 1
- 241000223231 Trichosporon beigelii Species 0.000 description 1
- 108050007852 Tumour necrosis factor Proteins 0.000 description 1
- 102000018594 Tumour necrosis factor Human genes 0.000 description 1
- 208000037386 Typhoid Diseases 0.000 description 1
- 229920013701 VORANOL™ Polymers 0.000 description 1
- 241000607598 Vibrio Species 0.000 description 1
- 241000710886 West Nile virus Species 0.000 description 1
- 206010052428 Wound Diseases 0.000 description 1
- 208000027418 Wounds and injury Diseases 0.000 description 1
- 241000710772 Yellow fever virus Species 0.000 description 1
- 241000606834 [Haemophilus] ducreyi Species 0.000 description 1
- 238000010521 absorption reaction Methods 0.000 description 1
- 238000010306 acid treatment Methods 0.000 description 1
- 239000011149 active material Substances 0.000 description 1
- 125000002252 acyl group Chemical group 0.000 description 1
- 230000010062 adhesion mechanism Effects 0.000 description 1
- 230000000240 adjuvant effect Effects 0.000 description 1
- 230000002776 aggregation Effects 0.000 description 1
- 238000004220 aggregation Methods 0.000 description 1
- 150000001280 alpha hydroxy acids Chemical class 0.000 description 1
- 239000004411 aluminium Substances 0.000 description 1
- 229910052782 aluminium Inorganic materials 0.000 description 1
- XAGFODPZIPBFFR-UHFFFAOYSA-N aluminium Chemical compound [Al] XAGFODPZIPBFFR-UHFFFAOYSA-N 0.000 description 1
- 229910000323 aluminium silicate Inorganic materials 0.000 description 1
- 210000002821 alveolar epithelial cell Anatomy 0.000 description 1
- 125000003368 amide group Chemical group 0.000 description 1
- 238000005576 amination reaction Methods 0.000 description 1
- 150000001412 amines Chemical class 0.000 description 1
- 125000003277 amino group Chemical group 0.000 description 1
- RXDALBZNGVATNY-CWLIKTDRSA-N ampicillin trihydrate Chemical compound O.O.O.C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 RXDALBZNGVATNY-CWLIKTDRSA-N 0.000 description 1
- 230000033115 angiogenesis Effects 0.000 description 1
- 238000000137 annealing Methods 0.000 description 1
- 208000025009 anogenital human papillomavirus infection Diseases 0.000 description 1
- 201000004201 anogenital venereal wart Diseases 0.000 description 1
- 239000003242 anti bacterial agent Substances 0.000 description 1
- 230000001857 anti-mycotic effect Effects 0.000 description 1
- 229940088710 antibiotic agent Drugs 0.000 description 1
- 230000005875 antibody response Effects 0.000 description 1
- 230000027645 antigenic variation Effects 0.000 description 1
- 239000002543 antimycotic Substances 0.000 description 1
- 239000002246 antineoplastic agent Substances 0.000 description 1
- 229940041181 antineoplastic drug Drugs 0.000 description 1
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 1
- 125000003118 aryl group Chemical group 0.000 description 1
- 101150107204 asd gene Proteins 0.000 description 1
- 235000003704 aspartic acid Nutrition 0.000 description 1
- 230000003190 augmentative effect Effects 0.000 description 1
- 230000000680 avirulence Effects 0.000 description 1
- 210000003719 b-lymphocyte Anatomy 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 108010005774 beta-Galactosidase Proteins 0.000 description 1
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 1
- 229940009291 bifidobacterium longum Drugs 0.000 description 1
- 238000002306 biochemical method Methods 0.000 description 1
- 230000003115 biocidal effect Effects 0.000 description 1
- 230000008827 biological function Effects 0.000 description 1
- 239000012620 biological material Substances 0.000 description 1
- 230000033228 biological regulation Effects 0.000 description 1
- 239000012472 biological sample Substances 0.000 description 1
- 229960002685 biotin Drugs 0.000 description 1
- 235000020958 biotin Nutrition 0.000 description 1
- 239000011616 biotin Substances 0.000 description 1
- 230000006287 biotinylation Effects 0.000 description 1
- 238000007413 biotinylation Methods 0.000 description 1
- 210000004369 blood Anatomy 0.000 description 1
- 239000008280 blood Substances 0.000 description 1
- 210000001124 body fluid Anatomy 0.000 description 1
- 239000010839 body fluid Substances 0.000 description 1
- 238000009395 breeding Methods 0.000 description 1
- 230000001488 breeding effect Effects 0.000 description 1
- SNCZNSNPXMPCGN-UHFFFAOYSA-N butanediamide Chemical compound NC(=O)CCC(N)=O SNCZNSNPXMPCGN-UHFFFAOYSA-N 0.000 description 1
- 210000004899 c-terminal region Anatomy 0.000 description 1
- 159000000007 calcium salts Chemical class 0.000 description 1
- 201000011510 cancer Diseases 0.000 description 1
- 239000002775 capsule Substances 0.000 description 1
- 150000001720 carbohydrates Chemical class 0.000 description 1
- 235000014633 carbohydrates Nutrition 0.000 description 1
- 125000002915 carbonyl group Chemical group [*:2]C([*:1])=O 0.000 description 1
- 239000001768 carboxy methyl cellulose Substances 0.000 description 1
- 235000010948 carboxy methyl cellulose Nutrition 0.000 description 1
- 239000008112 carboxymethyl-cellulose Substances 0.000 description 1
- 235000021466 carotenoid Nutrition 0.000 description 1
- 150000001747 carotenoids Chemical class 0.000 description 1
- 239000000969 carrier Substances 0.000 description 1
- 125000002091 cationic group Chemical group 0.000 description 1
- 229960002100 cefepime Drugs 0.000 description 1
- 238000004113 cell culture Methods 0.000 description 1
- 230000008614 cellular interaction Effects 0.000 description 1
- 239000001913 cellulose Substances 0.000 description 1
- 229920002678 cellulose Polymers 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 229940047526 cephalexin monohydrate Drugs 0.000 description 1
- DDTDNCYHLGRFBM-YZEKDTGTSA-N chembl2367892 Chemical compound CC(=O)N[C@H]1[C@@H](O)[C@H](O)[C@H](CO)O[C@H]1O[C@@H]([C@H]1C(N[C@@H](C2=CC(O)=CC(O[C@@H]3[C@H]([C@H](O)[C@H](O)[C@@H](CO)O3)O)=C2C=2C(O)=CC=C(C=2)[C@@H](NC(=O)[C@@H]2NC(=O)[C@@H]3C=4C=C(O)C=C(C=4)OC=4C(O)=CC=C(C=4)[C@@H](N)C(=O)N[C@H](CC=4C=C(Cl)C(O5)=CC=4)C(=O)N3)C(=O)N1)C(O)=O)=O)C(C=C1Cl)=CC=C1OC1=C(O[C@H]3[C@H]([C@@H](O)[C@H](O)[C@H](CO)O3)NC(C)=O)C5=CC2=C1 DDTDNCYHLGRFBM-YZEKDTGTSA-N 0.000 description 1
- 239000007795 chemical reaction product Substances 0.000 description 1
- 229960001231 choline Drugs 0.000 description 1
- OEYIOHPDSNJKLS-UHFFFAOYSA-N choline Chemical compound C[N+](C)(C)CCO OEYIOHPDSNJKLS-UHFFFAOYSA-N 0.000 description 1
- 238000004587 chromatography analysis Methods 0.000 description 1
- 239000012504 chromatography matrix Substances 0.000 description 1
- 239000013611 chromosomal DNA Substances 0.000 description 1
- 210000004081 cilia Anatomy 0.000 description 1
- 239000004927 clay Substances 0.000 description 1
- 229960002227 clindamycin Drugs 0.000 description 1
- KDLRVYVGXIQJDK-AWPVFWJPSA-N clindamycin Chemical compound CN1C[C@H](CCC)C[C@H]1C(=O)N[C@H]([C@H](C)Cl)[C@@H]1[C@H](O)[C@H](O)[C@@H](O)[C@@H](SC)O1 KDLRVYVGXIQJDK-AWPVFWJPSA-N 0.000 description 1
- 238000005354 coacervation Methods 0.000 description 1
- 238000004440 column chromatography Methods 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 230000006854 communication Effects 0.000 description 1
- 230000000052 comparative effect Effects 0.000 description 1
- 239000002299 complementary DNA Substances 0.000 description 1
- 235000009508 confectionery Nutrition 0.000 description 1
- 208000035196 congenital hypomyelinating 2 neuropathy Diseases 0.000 description 1
- 229940031670 conjugate vaccine Drugs 0.000 description 1
- 230000021615 conjugation Effects 0.000 description 1
- 230000002596 correlated effect Effects 0.000 description 1
- 230000001808 coupling effect Effects 0.000 description 1
- 239000013078 crystal Substances 0.000 description 1
- 238000012258 culturing Methods 0.000 description 1
- 229940104302 cytosine Drugs 0.000 description 1
- SUYVUBYJARFZHO-RRKCRQDMSA-N dATP Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@H]1C[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 SUYVUBYJARFZHO-RRKCRQDMSA-N 0.000 description 1
- SUYVUBYJARFZHO-UHFFFAOYSA-N dATP Natural products C1=NC=2C(N)=NC=NC=2N1C1CC(O)C(COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 SUYVUBYJARFZHO-UHFFFAOYSA-N 0.000 description 1
- RGWHQCVHVJXOKC-SHYZEUOFSA-J dCTP(4-) Chemical compound O=C1N=C(N)C=CN1[C@@H]1O[C@H](COP([O-])(=O)OP([O-])(=O)OP([O-])([O-])=O)[C@@H](O)C1 RGWHQCVHVJXOKC-SHYZEUOFSA-J 0.000 description 1
- HAAZLUGHYHWQIW-KVQBGUIXSA-N dGTP Chemical compound C1=NC=2C(=O)NC(N)=NC=2N1[C@H]1C[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 HAAZLUGHYHWQIW-KVQBGUIXSA-N 0.000 description 1
- NHVNXKFIZYSCEB-XLPZGREQSA-N dTTP Chemical compound O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)[C@@H](O)C1 NHVNXKFIZYSCEB-XLPZGREQSA-N 0.000 description 1
- 238000007405 data analysis Methods 0.000 description 1
- 230000034994 death Effects 0.000 description 1
- 230000007812 deficiency Effects 0.000 description 1
- 230000000593 degrading effect Effects 0.000 description 1
- 238000002242 deionisation method Methods 0.000 description 1
- 230000002939 deleterious effect Effects 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 239000005547 deoxyribonucleotide Substances 0.000 description 1
- 125000002637 deoxyribonucleotide group Chemical group 0.000 description 1
- 230000001066 destructive effect Effects 0.000 description 1
- 239000003599 detergent Substances 0.000 description 1
- 206010012601 diabetes mellitus Diseases 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 238000001085 differential centrifugation Methods 0.000 description 1
- 230000009699 differential effect Effects 0.000 description 1
- 208000010643 digestive system disease Diseases 0.000 description 1
- 238000006471 dimerization reaction Methods 0.000 description 1
- HNPSIPDUKPIQMN-UHFFFAOYSA-N dioxosilane;oxo(oxoalumanyloxy)alumane Chemical compound O=[Si]=O.O=[Al]O[Al]=O HNPSIPDUKPIQMN-UHFFFAOYSA-N 0.000 description 1
- 238000007599 discharging Methods 0.000 description 1
- AFOSIXZFDONLBT-UHFFFAOYSA-N divinyl sulfone Chemical compound C=CS(=O)(=O)C=C AFOSIXZFDONLBT-UHFFFAOYSA-N 0.000 description 1
- 239000003937 drug carrier Substances 0.000 description 1
- 238000001035 drying Methods 0.000 description 1
- 238000004043 dyeing Methods 0.000 description 1
- 239000012636 effector Substances 0.000 description 1
- 238000010828 elution Methods 0.000 description 1
- 239000003995 emulsifying agent Substances 0.000 description 1
- 230000000369 enteropathogenic effect Effects 0.000 description 1
- 230000000688 enterotoxigenic effect Effects 0.000 description 1
- 230000002255 enzymatic effect Effects 0.000 description 1
- 210000000981 epithelium Anatomy 0.000 description 1
- 230000032050 esterification Effects 0.000 description 1
- 238000005886 esterification reaction Methods 0.000 description 1
- UJTPZISIAWDGFF-UHFFFAOYSA-N ethenylsulfonylbenzene Chemical compound C=CS(=O)(=O)C1=CC=CC=C1 UJTPZISIAWDGFF-UHFFFAOYSA-N 0.000 description 1
- 125000001495 ethyl group Chemical group [H]C([H])([H])C([H])([H])* 0.000 description 1
- 210000003527 eukaryotic cell Anatomy 0.000 description 1
- 239000002095 exotoxin Substances 0.000 description 1
- 231100000776 exotoxin Toxicity 0.000 description 1
- 239000003889 eye drop Substances 0.000 description 1
- 229940012356 eye drops Drugs 0.000 description 1
- 230000001605 fetal effect Effects 0.000 description 1
- MHMNJMPURVTYEJ-UHFFFAOYSA-N fluorescein-5-isothiocyanate Chemical compound O1C(=O)C2=CC(N=C=S)=CC=C2C21C1=CC=C(O)C=C1OC1=CC(O)=CC=C21 MHMNJMPURVTYEJ-UHFFFAOYSA-N 0.000 description 1
- 238000001917 fluorescence detection Methods 0.000 description 1
- 235000013305 food Nutrition 0.000 description 1
- 235000011194 food seasoning agent Nutrition 0.000 description 1
- 235000019253 formic acid Nutrition 0.000 description 1
- 125000002485 formyl group Chemical group [H]C(*)=O 0.000 description 1
- 238000011207 functional examination Methods 0.000 description 1
- 244000053095 fungal pathogen Species 0.000 description 1
- 208000018685 gastrointestinal system disease Diseases 0.000 description 1
- 238000002523 gelfiltration Methods 0.000 description 1
- 238000007429 general method Methods 0.000 description 1
- 201000004946 genital herpes Diseases 0.000 description 1
- 229930182478 glucoside Natural products 0.000 description 1
- 229960002989 glutamic acid Drugs 0.000 description 1
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 1
- 235000011187 glycerol Nutrition 0.000 description 1
- 125000003147 glycosyl group Chemical group 0.000 description 1
- 125000003630 glycyl group Chemical group [H]N([H])C([H])([H])C(*)=O 0.000 description 1
- PCHJSUWPFVWCPO-UHFFFAOYSA-N gold Chemical compound [Au] PCHJSUWPFVWCPO-UHFFFAOYSA-N 0.000 description 1
- 229910052737 gold Inorganic materials 0.000 description 1
- 239000010931 gold Substances 0.000 description 1
- 239000010440 gypsum Substances 0.000 description 1
- 229910052602 gypsum Inorganic materials 0.000 description 1
- 229940047650 haemophilus influenzae Drugs 0.000 description 1
- 238000010438 heat treatment Methods 0.000 description 1
- 229940037467 helicobacter pylori Drugs 0.000 description 1
- 239000003228 hemolysin Substances 0.000 description 1
- 108010037896 heparin-binding hemagglutinin Proteins 0.000 description 1
- 208000002672 hepatitis B Diseases 0.000 description 1
- 239000000833 heterodimer Substances 0.000 description 1
- 229930186900 holotoxin Natural products 0.000 description 1
- 238000000265 homogenisation Methods 0.000 description 1
- 229920002674 hyaluronan Polymers 0.000 description 1
- 229960003160 hyaluronic acid Drugs 0.000 description 1
- 230000000642 iatrogenic effect Effects 0.000 description 1
- 238000010191 image analysis Methods 0.000 description 1
- 239000012216 imaging agent Substances 0.000 description 1
- 238000003384 imaging method Methods 0.000 description 1
- 230000001900 immune effect Effects 0.000 description 1
- 238000003018 immunoassay Methods 0.000 description 1
- 230000005847 immunogenicity Effects 0.000 description 1
- 238000001114 immunoprecipitation Methods 0.000 description 1
- 230000001771 impaired effect Effects 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000000338 in vitro Methods 0.000 description 1
- 201000001371 inclusion conjunctivitis Diseases 0.000 description 1
- 238000011534 incubation Methods 0.000 description 1
- 230000004054 inflammatory process Effects 0.000 description 1
- 208000037799 influenza C Diseases 0.000 description 1
- 239000004615 ingredient Substances 0.000 description 1
- 238000002347 injection Methods 0.000 description 1
- 239000007924 injection Substances 0.000 description 1
- 238000007689 inspection Methods 0.000 description 1
- 230000008611 intercellular interaction Effects 0.000 description 1
- 229940079322 interferon Drugs 0.000 description 1
- 230000009878 intermolecular interaction Effects 0.000 description 1
- 238000007918 intramuscular administration Methods 0.000 description 1
- 238000007912 intraperitoneal administration Methods 0.000 description 1
- 238000001990 intravenous administration Methods 0.000 description 1
- 230000009545 invasion Effects 0.000 description 1
- 238000004255 ion exchange chromatography Methods 0.000 description 1
- 229910052742 iron Inorganic materials 0.000 description 1
- JPUKWEQWGBDDQB-QSOFNFLRSA-N kaempferol 3-O-beta-D-glucoside Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@H]1OC1=C(C=2C=CC(O)=CC=2)OC2=CC(O)=CC(O)=C2C1=O JPUKWEQWGBDDQB-QSOFNFLRSA-N 0.000 description 1
- 239000004310 lactic acid Substances 0.000 description 1
- 235000014655 lactic acid Nutrition 0.000 description 1
- 229940039696 lactobacillus Drugs 0.000 description 1
- 229940062711 laureth-9 Drugs 0.000 description 1
- 229940115932 legionella pneumophila Drugs 0.000 description 1
- 231100000518 lethal Toxicity 0.000 description 1
- 230000001665 lethal effect Effects 0.000 description 1
- 230000021633 leukocyte mediated immunity Effects 0.000 description 1
- 125000005647 linker group Chemical group 0.000 description 1
- ULXTYUPMJXVUHQ-OVTFQNCVSA-N lipid II Chemical compound OC(=O)[C@@H](C)NC(=O)[C@@H](C)NC(=O)[C@H](CCCCN)NC(=O)CC[C@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](C)O[C@@H]1[C@@H](NC(C)=O)[C@@H](OP(O)(=O)OP(O)(=O)OC\C=C(\C)CC\C=C(\C)CC\C=C(\C)CC\C=C(\C)CC\C=C(\C)CC\C=C(\C)CC\C=C(\C)CC\C=C(\C)CC\C=C(/C)CC\C=C(/C)CCC=C(C)C)O[C@H](CO)[C@H]1O[C@H]1[C@H](NC(C)=O)[C@@H](O)[C@H](O)[C@@H](CO)O1 ULXTYUPMJXVUHQ-OVTFQNCVSA-N 0.000 description 1
- 239000007791 liquid phase Substances 0.000 description 1
- 210000004185 liver Anatomy 0.000 description 1
- 208000001581 lymphogranuloma venereum Diseases 0.000 description 1
- 238000012423 maintenance Methods 0.000 description 1
- 210000001161 mammalian embryo Anatomy 0.000 description 1
- 230000008774 maternal effect Effects 0.000 description 1
- 230000013011 mating Effects 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- GMASOWLHOOQBOY-UHFFFAOYSA-M mercury(1+);benzoate;hydrate Chemical compound O.[Hg+].[O-]C(=O)C1=CC=CC=C1 GMASOWLHOOQBOY-UHFFFAOYSA-M 0.000 description 1
- 230000004060 metabolic process Effects 0.000 description 1
- 229910044991 metal oxide Inorganic materials 0.000 description 1
- 150000004706 metal oxides Chemical class 0.000 description 1
- 229940050176 methyl chloride Drugs 0.000 description 1
- 108091070501 miRNA Proteins 0.000 description 1
- 239000002679 microRNA Substances 0.000 description 1
- 239000011859 microparticle Substances 0.000 description 1
- 238000000386 microscopy Methods 0.000 description 1
- 238000002156 mixing Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 102000035118 modified proteins Human genes 0.000 description 1
- 108091005573 modified proteins Proteins 0.000 description 1
- 238000002625 monoclonal antibody therapy Methods 0.000 description 1
- 210000001616 monocyte Anatomy 0.000 description 1
- 230000001459 mortal effect Effects 0.000 description 1
- 238000010172 mouse model Methods 0.000 description 1
- 210000004877 mucosa Anatomy 0.000 description 1
- BSOQXXWZTUDTEL-ZUYCGGNHSA-N muramyl dipeptide Chemical compound OC(=O)CC[C@H](C(N)=O)NC(=O)[C@H](C)NC(=O)[C@@H](C)O[C@H]1[C@H](O)[C@@H](CO)O[C@@H](O)[C@@H]1NC(C)=O BSOQXXWZTUDTEL-ZUYCGGNHSA-N 0.000 description 1
- YGZIWEZFFBPCLN-UHFFFAOYSA-N n,3-bis(2-chloroethyl)-4-hydroperoxy-2-oxo-1,3,2$l^{5}-oxazaphosphinan-2-amine Chemical compound OOC1CCOP(=O)(NCCCl)N1CCCl YGZIWEZFFBPCLN-UHFFFAOYSA-N 0.000 description 1
- JXTPJDDICSTXJX-UHFFFAOYSA-N n-Triacontane Natural products CCCCCCCCCCCCCCCCCCCCCCCCCCCCCC JXTPJDDICSTXJX-UHFFFAOYSA-N 0.000 description 1
- OHDXDNUPVVYWOV-UHFFFAOYSA-N n-methyl-1-(2-naphthalen-1-ylsulfanylphenyl)methanamine Chemical compound CNCC1=CC=CC=C1SC1=CC=CC2=CC=CC=C12 OHDXDNUPVVYWOV-UHFFFAOYSA-N 0.000 description 1
- 210000003928 nasal cavity Anatomy 0.000 description 1
- 239000013642 negative control Substances 0.000 description 1
- 238000006386 neutralization reaction Methods 0.000 description 1
- 229910000480 nickel oxide Inorganic materials 0.000 description 1
- 229920001220 nitrocellulos Polymers 0.000 description 1
- 108700014375 norvancomycin Proteins 0.000 description 1
- 238000007899 nucleic acid hybridization Methods 0.000 description 1
- 235000015097 nutrients Nutrition 0.000 description 1
- 229920001778 nylon Polymers 0.000 description 1
- 229940124276 oligodeoxyribonucleotide Drugs 0.000 description 1
- 229940126578 oral vaccine Drugs 0.000 description 1
- 210000001672 ovary Anatomy 0.000 description 1
- TWNQGVIAIRXVLR-UHFFFAOYSA-N oxo(oxoalumanyloxy)alumane Chemical compound O=[Al]O[Al]=O TWNQGVIAIRXVLR-UHFFFAOYSA-N 0.000 description 1
- GNRSAWUEBMWBQH-UHFFFAOYSA-N oxonickel Chemical compound [Ni]=O GNRSAWUEBMWBQH-UHFFFAOYSA-N 0.000 description 1
- 239000006174 pH buffer Substances 0.000 description 1
- 230000020477 pH reduction Effects 0.000 description 1
- 230000007918 pathogenicity Effects 0.000 description 1
- 238000010647 peptide synthesis reaction Methods 0.000 description 1
- MPNNOLHYOHFJKL-UHFFFAOYSA-N peroxyphosphoric acid Chemical compound OOP(O)(O)=O MPNNOLHYOHFJKL-UHFFFAOYSA-N 0.000 description 1
- 108010021711 pertactin Proteins 0.000 description 1
- 238000002823 phage display Methods 0.000 description 1
- 210000001539 phagocyte Anatomy 0.000 description 1
- 210000003800 pharynx Anatomy 0.000 description 1
- 239000012071 phase Substances 0.000 description 1
- WTJKGGKOPKCXLL-RRHRGVEJSA-N phosphatidylcholine Chemical compound CCCCCCCCCCCCCCCC(=O)OC[C@H](COP([O-])(=O)OCC[N+](C)(C)C)OC(=O)CCCCCCCC=CCCCCCCCC WTJKGGKOPKCXLL-RRHRGVEJSA-N 0.000 description 1
- 150000008104 phosphatidylethanolamines Chemical class 0.000 description 1
- NBIIXXVUZAFLBC-UHFFFAOYSA-N phosphoric acid Substances OP(O)(O)=O NBIIXXVUZAFLBC-UHFFFAOYSA-N 0.000 description 1
- 235000011007 phosphoric acid Nutrition 0.000 description 1
- 230000004962 physiological condition Effects 0.000 description 1
- 239000013612 plasmid Substances 0.000 description 1
- 229920003023 plastic Polymers 0.000 description 1
- 239000004033 plastic Substances 0.000 description 1
- ONJQDTZCDSESIW-UHFFFAOYSA-N polidocanol Chemical compound CCCCCCCCCCCCOCCOCCOCCOCCOCCOCCOCCOCCOCCO ONJQDTZCDSESIW-UHFFFAOYSA-N 0.000 description 1
- 229920001983 poloxamer Polymers 0.000 description 1
- 229920000771 poly (alkylcyanoacrylate) Polymers 0.000 description 1
- 229920000724 poly(L-arginine) polymer Polymers 0.000 description 1
- 229920001606 poly(lactic acid-co-glycolic acid) Polymers 0.000 description 1
- 229920002627 poly(phosphazenes) Polymers 0.000 description 1
- 238000002264 polyacrylamide gel electrophoresis Methods 0.000 description 1
- 108010011110 polyarginine Proteins 0.000 description 1
- 229920002721 polycyanoacrylate Polymers 0.000 description 1
- 239000004633 polyglycolic acid Substances 0.000 description 1
- 229920002451 polyvinyl alcohol Polymers 0.000 description 1
- 229920000036 polyvinylpyrrolidone Polymers 0.000 description 1
- 239000001267 polyvinylpyrrolidone Substances 0.000 description 1
- 235000013855 polyvinylpyrrolidone Nutrition 0.000 description 1
- 102000007739 porin activity proteins Human genes 0.000 description 1
- 239000013641 positive control Substances 0.000 description 1
- 230000002062 proliferating effect Effects 0.000 description 1
- 230000001902 propagating effect Effects 0.000 description 1
- 230000001681 protective effect Effects 0.000 description 1
- 238000003498 protein array Methods 0.000 description 1
- 108020003519 protein disulfide isomerase Proteins 0.000 description 1
- 108060006633 protein kinase Proteins 0.000 description 1
- 230000012743 protein tagging Effects 0.000 description 1
- 230000003161 proteinsynthetic effect Effects 0.000 description 1
- 208000008666 pythiosis Diseases 0.000 description 1
- 230000005855 radiation Effects 0.000 description 1
- 239000002516 radical scavenger Substances 0.000 description 1
- 230000002285 radioactive effect Effects 0.000 description 1
- 229950003551 ramoplanin Drugs 0.000 description 1
- 108010076689 ramoplanin Proteins 0.000 description 1
- 239000002994 raw material Substances 0.000 description 1
- 239000000376 reactant Substances 0.000 description 1
- 239000011541 reaction mixture Substances 0.000 description 1
- 238000005215 recombination Methods 0.000 description 1
- 230000006798 recombination Effects 0.000 description 1
- 210000000664 rectum Anatomy 0.000 description 1
- 238000006722 reduction reaction Methods 0.000 description 1
- 230000002787 reinforcement Effects 0.000 description 1
- 125000006853 reporter group Chemical group 0.000 description 1
- 229920005989 resin Polymers 0.000 description 1
- 239000011347 resin Substances 0.000 description 1
- 238000004366 reverse phase liquid chromatography Methods 0.000 description 1
- 238000004007 reversed phase HPLC Methods 0.000 description 1
- PYWVYCXTNDRMGF-UHFFFAOYSA-N rhodamine B Chemical compound [Cl-].C=12C=CC(=[N+](CC)CC)C=C2OC2=CC(N(CC)CC)=CC=C2C=1C1=CC=CC=C1C(O)=O PYWVYCXTNDRMGF-UHFFFAOYSA-N 0.000 description 1
- 239000002336 ribonucleotide Substances 0.000 description 1
- 125000002652 ribonucleotide group Chemical group 0.000 description 1
- 238000009666 routine test Methods 0.000 description 1
- 229960002181 saccharomyces boulardii Drugs 0.000 description 1
- 239000012266 salt solution Substances 0.000 description 1
- 238000003345 scintillation counting Methods 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 239000000565 sealant Substances 0.000 description 1
- 102000012498 secondary active transmembrane transporter activity proteins Human genes 0.000 description 1
- 108040003878 secondary active transmembrane transporter activity proteins Proteins 0.000 description 1
- 230000028327 secretion Effects 0.000 description 1
- 208000013223 septicemia Diseases 0.000 description 1
- 230000000405 serological effect Effects 0.000 description 1
- 229940099992 seromycin Drugs 0.000 description 1
- 239000000377 silicon dioxide Substances 0.000 description 1
- 229960001866 silicon dioxide Drugs 0.000 description 1
- 235000012239 silicon dioxide Nutrition 0.000 description 1
- 208000017520 skin disease Diseases 0.000 description 1
- 235000019337 sorbitan trioleate Nutrition 0.000 description 1
- 229960000391 sorbitan trioleate Drugs 0.000 description 1
- 238000001179 sorption measurement Methods 0.000 description 1
- 238000001228 spectrum Methods 0.000 description 1
- 238000005507 spraying Methods 0.000 description 1
- 229940032094 squalane Drugs 0.000 description 1
- 229940031439 squalene Drugs 0.000 description 1
- TUHBEKDERLKLEC-UHFFFAOYSA-N squalene Natural products CC(=CCCC(=CCCC(=CCCC=C(/C)CCC=C(/C)CC=C(C)C)C)C)C TUHBEKDERLKLEC-UHFFFAOYSA-N 0.000 description 1
- 238000000528 statistical test Methods 0.000 description 1
- 230000001954 sterilising effect Effects 0.000 description 1
- 238000004659 sterilization and disinfection Methods 0.000 description 1
- 230000000638 stimulation Effects 0.000 description 1
- 210000002784 stomach Anatomy 0.000 description 1
- 238000007920 subcutaneous administration Methods 0.000 description 1
- 229940031626 subunit vaccine Drugs 0.000 description 1
- 238000000856 sucrose gradient centrifugation Methods 0.000 description 1
- 239000006228 supernatant Substances 0.000 description 1
- 239000000829 suppository Substances 0.000 description 1
- 230000002194 synthesizing effect Effects 0.000 description 1
- 238000010189 synthetic method Methods 0.000 description 1
- 239000006188 syrup Substances 0.000 description 1
- 235000020357 syrup Nutrition 0.000 description 1
- 230000009885 systemic effect Effects 0.000 description 1
- 239000003826 tablet Substances 0.000 description 1
- 229960001608 teicoplanin Drugs 0.000 description 1
- 238000004809 thin layer chromatography Methods 0.000 description 1
- 108060008226 thioredoxin Proteins 0.000 description 1
- 229940094937 thioredoxin Drugs 0.000 description 1
- 239000012049 topical pharmaceutical composition Substances 0.000 description 1
- 206010044325 trachoma Diseases 0.000 description 1
- 238000012549 training Methods 0.000 description 1
- 230000005030 transcription termination Effects 0.000 description 1
- 238000001890 transfection Methods 0.000 description 1
- 230000017105 transposition Effects 0.000 description 1
- STCOOQWBFONSKY-UHFFFAOYSA-N tributyl phosphate Chemical compound CCCCOP(=O)(OCCCC)OCCCC STCOOQWBFONSKY-UHFFFAOYSA-N 0.000 description 1
- 150000004684 trihydrates Chemical class 0.000 description 1
- 238000005829 trimerization reaction Methods 0.000 description 1
- DZWJBKUVHJSHAR-UHFFFAOYSA-M trimethyl(2-methylsulfonylsulfanylethyl)azanium;bromide Chemical compound [Br-].C[N+](C)(C)CCSS(C)(=O)=O DZWJBKUVHJSHAR-UHFFFAOYSA-M 0.000 description 1
- 150000004043 trisaccharides Chemical group 0.000 description 1
- 125000000430 tryptophan group Chemical group [H]N([H])C(C(=O)O*)C([H])([H])C1=C([H])N([H])C2=C([H])C([H])=C([H])C([H])=C12 0.000 description 1
- 201000008297 typhoid fever Diseases 0.000 description 1
- 238000000108 ultra-filtration Methods 0.000 description 1
- ORHBXUUXSCNDEV-UHFFFAOYSA-N umbelliferone Chemical compound C1=CC(=O)OC2=CC(O)=CC=C21 ORHBXUUXSCNDEV-UHFFFAOYSA-N 0.000 description 1
- HFTAFOQKODTIJY-UHFFFAOYSA-N umbelliferone Natural products Cc1cc2C=CC(=O)Oc2cc1OCC=CC(C)(C)O HFTAFOQKODTIJY-UHFFFAOYSA-N 0.000 description 1
- 241001529453 unidentified herpesvirus Species 0.000 description 1
- 241001430294 unidentified retrovirus Species 0.000 description 1
- 208000019206 urinary tract infection Diseases 0.000 description 1
- 238000002255 vaccination Methods 0.000 description 1
- 210000001215 vagina Anatomy 0.000 description 1
- 229910052720 vanadium Inorganic materials 0.000 description 1
- 201000001862 viral hepatitis Diseases 0.000 description 1
- 230000009385 viral infection Effects 0.000 description 1
- 239000000304 virulence factor Substances 0.000 description 1
- 230000007923 virulence factor Effects 0.000 description 1
- 230000004304 visual acuity Effects 0.000 description 1
- 235000013343 vitamin Nutrition 0.000 description 1
- 239000011782 vitamin Substances 0.000 description 1
- 229940088594 vitamin Drugs 0.000 description 1
- 229930003231 vitamin Natural products 0.000 description 1
- 150000003722 vitamin derivatives Chemical class 0.000 description 1
- 238000001262 western blot Methods 0.000 description 1
- 239000000080 wetting agent Substances 0.000 description 1
- 229940051021 yellow-fever virus Drugs 0.000 description 1
- 229940118695 yersinia pestis Drugs 0.000 description 1
- 229910052727 yttrium Inorganic materials 0.000 description 1
- PAPBSGBWRJIAAV-UHFFFAOYSA-N ε-Caprolactone Chemical compound O=C1CCCCCO1 PAPBSGBWRJIAAV-UHFFFAOYSA-N 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/195—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria
- C07K14/315—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria from Streptococcus (G), e.g. Enterococci
- C07K14/3156—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria from Streptococcus (G), e.g. Enterococci from Streptococcus pneumoniae (Pneumococcus)
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P31/00—Antiinfectives, i.e. antibiotics, antiseptics, chemotherapeutics
- A61P31/04—Antibacterial agents
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P37/00—Drugs for immunological or allergic disorders
- A61P37/02—Immunomodulators
- A61P37/04—Immunostimulants
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/48—Hydrolases (3) acting on peptide bonds (3.4)
- C12N9/50—Proteinases, e.g. Endopeptidases (3.4.21-3.4.25)
- C12N9/52—Proteinases, e.g. Endopeptidases (3.4.21-3.4.25) derived from bacteria or Archaea
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N33/00—Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
- G01N33/48—Biological material, e.g. blood, urine; Haemocytometers
- G01N33/50—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
- G01N33/53—Immunoassay; Biospecific binding assay; Materials therefor
- G01N33/569—Immunoassay; Biospecific binding assay; Materials therefor for microorganisms, e.g. protozoa, bacteria, viruses
- G01N33/56911—Bacteria
- G01N33/56944—Streptococcus
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K2039/505—Medicinal preparations containing antigens or antibodies comprising antibodies
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K2039/51—Medicinal preparations containing antigens or antibodies comprising whole cells, viruses or DNA/RNA
- A61K2039/53—DNA (RNA) vaccination
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N2333/00—Assays involving biological materials from specific organisms or of a specific nature
- G01N2333/195—Assays involving biological materials from specific organisms or of a specific nature from bacteria
- G01N2333/315—Assays involving biological materials from specific organisms or of a specific nature from bacteria from Streptococcus (G), e.g. Enterococci
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N2469/00—Immunoassays for the detection of microorganisms
- G01N2469/10—Detection of antigens from microorganism in sample from host
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Engineering & Computer Science (AREA)
- Organic Chemistry (AREA)
- Immunology (AREA)
- General Health & Medical Sciences (AREA)
- Medicinal Chemistry (AREA)
- Molecular Biology (AREA)
- Biochemistry (AREA)
- Biomedical Technology (AREA)
- Genetics & Genomics (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Urology & Nephrology (AREA)
- Microbiology (AREA)
- Hematology (AREA)
- Biotechnology (AREA)
- Wood Science & Technology (AREA)
- Zoology (AREA)
- Pulmonology (AREA)
- Pharmacology & Pharmacy (AREA)
- Pathology (AREA)
- Analytical Chemistry (AREA)
- Gastroenterology & Hepatology (AREA)
- Biophysics (AREA)
- Physics & Mathematics (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Food Science & Technology (AREA)
- Cell Biology (AREA)
- Chemical Kinetics & Catalysis (AREA)
- General Chemical & Material Sciences (AREA)
- Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
- General Physics & Mathematics (AREA)
- Animal Behavior & Ethology (AREA)
- Public Health (AREA)
- Veterinary Medicine (AREA)
- Virology (AREA)
- Tropical Medicine & Parasitology (AREA)
- General Engineering & Computer Science (AREA)
- Communicable Diseases (AREA)
Abstract
描述了来自肺炎链球菌的多肽。在一些方面,所述多肽包括在肺炎链球菌分离物INV104中鉴定的来自第二菌毛岛(菌毛II岛(INV104B))的菌毛多肽。在另一些方面,所述多肽包括肺炎链球菌分离物INV104中没有的来自肺炎链球菌菌株23F、INV200和OXC141的菌毛多肽和非菌毛多肽。免疫原性组合物中可使用这些多肽及其片段和变体,以便预防性或治疗性免疫对抗肺炎链球菌。也可以在用于产生抗体的组合物和免疫刺激物中使用这些多肽。还提供了抑制肺炎链球菌的方法、治疗肺炎链球菌感染的方法、鉴定肺炎链球菌抑制剂的方法以及诊断/检测肺炎链球菌感染的方法。
Description
技术领域
本发明涉及多肽,包括来自肺炎链球菌(Streptococcus pneumoniae,S.pneumoniae)的菌毛蛋白质,包括其片段和变体,及其在肺炎链球菌感染的治疗和免疫接种中的使用方法。
背景技术
革兰氏阳性菌肺炎链球菌(也称为Spn或肺炎球菌)是全世界范围发病和死亡的主要原因,与HIV、疟疾和结核病一起并称为四大主要的感染性疾病杀手(Bruyn,G.A.W.和van Furth,R.(1991)Eur.J.Clin.Microbiol.Infect.Dis.10,897-910;Ryan,M.W.和Antonelli,P.J.(2000)Laryngoscope 110,961-964;Cutts,F.T.,Zaman,S.M.,Enwere,G.,Jaffar,S.,Levine,O.S.,Okoko,C.Oluwalana,A.,Vaughan,S.,Obaro,A.,Leach,A.等,(2005)Lancet 365,1139-1 146;Swiatlo,E.,Champlin,F.R.,Holman,S.C.,Wilson,W.W.&Watt,J.M.(2002)Infect.Immun.70,412-415;Sandgren,A.,Albiger,B.,Orihuela,C.,Tuomanen,E.,Normark,S.和Henriques-Normark,B.(2005)J.Infect.Dis.192,791-800)。肺炎链球菌不仅是导致呼吸道感染如中耳炎、鼻窦炎和社群获得性肺炎的主要原因,而且是侵入性疾病如败血症和脑膜炎中的重要病原体。虽然肺炎球菌是一种破坏性病原体,但它也大量无害地定居在日间护理中心的健康儿童中(HenriquesNormark,B.,Christensson,B.,Sandgren,A.,Noreen,B.,Sylvan,S.,Burman,L.G.和Olsson-Liljequist,B.(2003)Microb.Drug Resist.9,337-344;Nunes,S.,R.,J.,Alves,C.R.,Mato,R.,A.B.,Saldanha,J.,Almeida,J.S.,Sanches,I.S.和de Lencastre,H.(2005)J.Clin.Microbiol.43,1285-1293)。肺炎球菌疾病的重要毒力因素在于多糖荚膜,肺炎球菌通过多糖荚膜分成至少90种不同的血清型(Henrichsen,J.(1995)J.Clin.Microbiol.33,2759-2762)。已描述了其他对毒力具有重要作用的遗传因子,例如CbpA(胆碱-结合蛋白A)和肺炎链球菌溶血素(Lau,G.W.,Haataja,S.,Lonetto,M.,Kensit,S.E.,Marra,A.,Bryant,A.P.,McDevitt,D.,Morrison,D.A.和Holden,D.W.(2001)Mol.Microbiol.40,555-571;Rosenow,C.,Ryan,P.,Weiser,J.N.,Johnson,S.,Fontan,P.,Ortqvist,A.和Masure,H.R.(1997)Mol.Microbiol.25,819-829;Tuomanen,E.(1999)Current Opin.Biol.2,35-39)。
肺炎链球菌感染可通过初始在鼻咽部建群而引发的侵入性疾病,但粘附机制尚不清楚。有荚膜肺炎球菌的体外粘附比无荚膜的无毒力衍生物要低得多(Swiatlo,E.,Champlin,F.R.,Holman,S.C.,Wilson,W.W.&Watt,J.M.(2002)Infect.Immun.70,412-415),但荚膜表达是在上气道成功建群的必需因素。这些发现提示,肺炎球菌在体内即使存在厚荚膜也具有粘附性(Sandgren,A.,Albiger,B.,Orihuela,C.,Tuomanen,E.,Normark,S.和Henriques-Normark,B.(2005)J.Infect.Dis.192,791-800)。
在其他革兰氏阳性菌,例如白喉棒状杆菌(Corynebacteriumdiphtheriae)(Ton-That,H.,Marraffini,L.A.和Schneewind,O.(2004)Mol.Microbiol.53,251-261;Ton-That,H.和Schneewind,O.(2003)Mol.Microbiol.50,1429-1438),放线菌(Actinomyces,spp.)(Kelstrup,J.,Theilade,J.和Fejerskov,O.(1979)Scand.J.Dent.Res.87,415-423)和近来发现的A型链球菌(GAS)和B型链球菌(GBS)(Mora,M.,Bensi,G.,Capo,S.,Falugi,F.,Zingaretti,C.,Manetti,A.G.O.,Maggi,T.,Taddei,A.R.,Grandi,G.和Telford,J.L.(2005)Proc.Natl.Acad.Sci.USA 102,15641-15646;Lauer,P.,Rinaudo,C.D.,Soriani,M.,Margarit,I.,Mainone,D.,Rosini,R.,Taddei,A.R.,Mora,M.,Rappuoli,R.,Grandi,G.和Telford,J.L.(2005)Science 309,105)中,电镜鉴定到菌毛样表面结构并通过遗传以及生物化学方法进行表征(Ton-That,H.,Marraffini,L.A.和Schneewind,O.(2004)Mol.Microbiol.53,251-261;Ton-That,H.和Schneewind,O.(2003)Mol.Microbiol.50,1429-1438;Mora,M.,Bensi,G.,Capo,S.,Falugi,F.,Zingaretti,C.,Manetti,A.G.O.,Maggi,T.,Taddei,A.R.,Grandi,G.和Telford,J.L.(2005)Proc.Natl.Acad.Sci.USA 102,15641-15646;Lauer,P.,Rinaudo,C.D.,Soriani,M.,Margarit,I.,Mainone,D.,Rosini,R.,Taddei,A.R.,Mora,M.,Rappuoli,R.,Grandi,G.和Telford,J.L.(2005)Science 309,105)。在放线菌中,1型菌毛基因介导牙齿和粘膜表面粘附(Li,T.,Khah,M.K.,Slavnic,S.,Johansson,I.和N.(2001)Infect Immun.69,7224-7233)。然而,仍然需要关于致病性链球菌的菌毛和其他抗原在感染性疾病中的生理学作用和功能的关系数据。
革兰氏阳性菌毛是由转肽酶反应形成的延长的聚合物,所述转肽酶反应涉及由特定的分选酶(sortase)进行组装的含有特定氨基酸基序的亚基蛋白质的共价键交联。这种分选酶也与菌毛和肽聚糖细胞壁的共价粘附有关。
发明内容
本发明描述了来自肺炎链球菌的多肽。在一些方面,本文所述的多肽包括来自肺炎链球菌的菌毛肽。在其他方面,描述了来自肺炎链球菌的其他多肽。本文所述的肺炎链球菌多肽可用于针对肺炎链球菌感染的治疗和免疫接种的方法中。
在一些方面,本发明描述了来自肺炎链球菌INV104B第二菌毛岛(菌毛II岛(INV104B))的菌毛多肽。在其他方面,本发明描述了在肺炎链球菌23F、INV200和OXC141中鉴定的菌毛多肽。认为该菌毛在肺炎链球菌的致病过程中起作用。
在其他方面,本发明描述了由肺炎链球菌菌毛II岛(INV104B)编码的分离的菌毛。在一些实施方式中,菌毛包括分选酶。在一些实施方式中,菌毛包括LPXTG细胞壁锚定蛋白,如具有SEQ ID NO:2,4和/或6氨基酸序列的多肽,或其加工形式。
在一些实施方式中,通过酶消化(例如,用一种或多种酶,如肽聚糖水解酶(如,变溶菌素、溶葡萄球菌素和溶菌酶))从细胞分离菌毛。在一些实施方式中,通过机械剪切(例如,通过超声处理)从细胞分离菌毛。在一些实施方式中,菌毛基本不含细菌细胞。在一些实施方式中,本发明提供了产生菌毛(例如,肺炎链球菌菌毛)的方法,所述方法包括使产生菌毛的细胞经历酶消化或机械剪切以及从细胞分离菌毛。
在其他方面,本发明提供了免疫原性组合物,其包含一种或多种分离的菌毛(例如肺炎链球菌菌毛)。
在其他方面,本发明提供了一种分离的肺炎链球菌分选酶,所述分选酶是SEQ ID NO:282,SEQ ID NO:1386,SEQ ID NO:676或SEQ ID NO:1123中的一种。
在其他方面,本发明提供了分离的肺炎链球菌LPXTG细胞壁锚定蛋白,所述LPXTG细胞壁锚定蛋白是SEQ ID NO:2,SEQ ID NO:4,SEQ ID NO:6,SEQ IDNO:7,SEQ ID NO:8或SEQ ID NO:9中的一种。
在其他方面,本发明提供了分离肺炎链球菌菌毛II岛(INV104B)编码菌毛的方法,所述方法包括使产生肺炎链球菌菌毛II岛(INV104B)编码菌毛的细菌细胞经历酶消化(如,变溶菌素)或或机械剪切(如,超声处理),以及从细胞分离菌毛。在一些实施方式中,分离包括密度梯度离心。在一些实施方式中,分离包括使用例如凝胶过滤色谱,根据大小分离组分以降低多分散性。
在其他方面,本发明提供了特异性结合肺炎链球菌菌毛II岛(INV104B)编码的菌毛的抗体。在一些实施方式中,所述抗体是单克隆抗体、多克隆抗体、嵌合抗体、人抗体、人源化抗体、单链抗体或Fab片段。
在其他方面,本发明提供了一种免疫原性组合物,其包含低聚物形式的纯化的肺炎链球菌菌毛II岛(INV104B)多肽。在一些实施方式中,所述多肽是超寡聚体(hyperoligomer)。在其它实施方式中,所述多肽是肺炎链球菌菌毛II岛(INV104B)编码的LPXTG细胞壁锚定蛋白的片段。
在其他方面,本发明提供了诱导针对肺炎链球菌的免疫应答的方法。在一些实施方式中,所述方法包括给予对象有效量的肺炎链球菌菌毛II岛(INV104B)编码的菌毛。
在其他方面,本发明提供了检测对象是否感染肺炎链球菌的方法。在一些实施方式中,所述方法包括分析来自对象的样品中是否存在肺炎链球菌菌毛II岛(INV104B)编码菌毛的抗体。
在其他方面,本发明提供了检测对象是否感染肺炎链球菌的方法。在一些实施方式中,所述方法包括使样品与抗体相接触以及检测所述抗体与样品组分的结合。在一些实施方式中,所述抗体结合菌毛组分。在其它实施方式中,所述抗体结合菌毛复合物。
在其他方面,本发明提供了治疗肺炎链球菌感染对象的方法。在一些实施方式中,所述方法包括给予对象有效量的特异性结合肺炎链球菌菌毛II岛(INV104B)编码菌毛的试剂。在一些实施方式中,所述试剂是抗体。在一些实施方式中,所述抗体阻断肺炎链球菌附着于细胞。在一些实施方式中,所述抗体特异性结合肺炎链球菌菌毛II岛(INV104B)编码的一种或多种LPXTG细胞壁锚定蛋白。
在其他方面,本发明提供了确定肺炎链球菌感染对象的疗程的方法。在一些实施方式中,所述方法包括分析来自对象的样品中是否存在针对肺炎链球菌菌毛II岛(INV104B)编码菌毛的抗体,以及如果检测到存在这种抗体则给予对象抗炎药。在其它实施方式中,所述方法包括分析来自对象的样品中是否存在针对肺炎链球菌菌毛II岛(INV104B)编码菌毛的抗体,以及如果没有检测到这种抗体则给予对象抗生素。
在其他方面,本发明提供了分离的菌毛和菌毛样多聚体,其包含最多具有30个氨基酸取代、插入或缺失的肺炎链球菌菌毛II岛(INV104B)编码的菌毛蛋白的氨基酸序列。在一些实施方式中,该氨基酸序列最多具有20个氨基酸取代、插入或缺失。在其它实施方式中,该氨基酸序列最多具有10个氨基酸取代、插入或缺失。在其它实施方式中,该氨基酸序列最多具有5个氨基酸取代、插入或缺失。
在其他方面,本发明提供了具有肺炎链球菌菌毛II岛(INV104B)编码的一种或多种LPXTG细胞壁锚定蛋白的氨基酸序列的多肽。在其他方面,本发明提供了一种或多种肺炎链球菌菌毛II岛(INV104B)编码的一种或多种LPXTG细胞壁锚定蛋白的免疫原性片段。在其他方面,本发明提供了编码以下多肽的多核苷酸,所述多肽具有肺炎链球菌菌毛II岛(INV104B)编码的一种或多种LPXTG细胞壁锚定蛋白的氨基酸序列。
在其他方面,本发明提供了具有氨基酸序列SEQ ID NO:2、4、6、7、8和9的纯化的多肽。在其他方面,本发明提供了具有SEQ ID NO:2、4、6、7、8和9的10个连续残基的纯化的多肽。在其他方面,本发明提供了纯化的多肽,其具有与SEQ ID NO:2、4、6、7、8和9至少85%相同的氨基酸序列。
在其他方面,本发明提供了纯化的多肽,其与选自SEQ ID NO:29到SEQ IDNO:1742的序列或其免疫原性片段至少具有85%的序列相同性。在其他方面,本发明提供了纯化的多肽,其具有选自SEQ ID NO:29到SEQ ID NO:1742的氨基酸序列或其免疫原性片段。
在一些实施方式中,本发明提供了纯化的OCX141多肽,其与选自下组的序列具有至少85%的序列相同性:SEQ ID NO:53、SEQ ID NO:65、SEQ ID NO:70、SEQ ID NO:99、SEQ ID NO:104、SEQ ID NO:117、SEQ ID NO:135、SEQ ID NO:177、SEQ ID NO:178、SEQ ID NO:198、SEQ ID NO:235、SEQID NO:236、SEQ ID NO:237、SEQ ID NO:242、SEQ ID NO:247、SEQ ID NO:248、SEQ ID NO:250、SEQ ID NO:251、SEQ ID NO:252、SEQ ID NO:253、SEQ ID NO:433、SEQ ID NO:439、SEQ ID NO:444、SEQ ID NO:538、SEQID NO:539、SEQ ID NO:540、SEQ ID NO:541、SEQ ID NO:542、SEQ ID NO:543、SEQ ID NO:544、SEQ ID NO:545、SEQ ID NO:581或SEQ ID NO:593、或其免疫原性片段。
在其它实施方式中,本发明提供了纯化的INV200多肽,其与选自下组的序列具有至少85%的序列相同性:SEQ ID NO:626、SEQ ID NO:628、SEQ ID NO:629、SEQ ID NO:630、SEQ ID NO:631、SEQ ID NO:632、SEQ ID NO:639、SEQ ID NO:645、SEQ ID NO:747、SEQ ID NO:751、SEQ ID NO:752、SEQID NO:783、SEQ ID NO:786、SEQ ID NO:787、SEQ ID NO:810、SEQ IDNO:812、SEQ ID NO:813、SEQ ID NO:824、SEQ ID NO:831、SEQ ID NO:842、SEQ ID NO:847、SEQ ID NO:875、SEQ ID NO:876、SEQ ID NO:879、SEQ ID NO:880、SEQ ID NO:882、SEQ ID NO:913、SEQ ID NO:914、SEQID NO:925、SEQ ID NO:926、SEQ ID NO:947、SEQ ID NO:948、SEQ ID NO:968、SEQ ID NO:987、SEQ ID NO:988、SEQ ID NO:990、SEQ ID NO:992、SEQ ID NO:1003、SEQ ID NO:1007、SEQ ID NO:1008、SEQ ID NO:1036、SEQ ID NO:1082、SEQ ID NO:1120、或SEQ ID NO:1123、或其免疫原性片段。
在其他实施方式中,本发明提供了纯化的23F多肽,其与选自下组的序列具有至少85%的序列相同性:SEQ ID NO:1297、SEQ ID NO:1309、SEQ ID NO:1311、SEQ ID NO:1343、SEQ ID NO:1362、SEQ ID NO:1364、SEQ ID NO:1434、SEQ ID NO:1451、SEQ ID NO:1455、SEQ ID NO:1466、SEQ ID NO:14678、SEQ ID NO:1470、SEQ ID NO:1474、SEQ ID NO:1484、SEQ ID NO:1485、SEQ ID NO:1486、SEQ ID NO:1487、或SEQ ID NO:1491、或其免疫原性片段。
在其他方面,本发明提供了肺炎链球菌菌毛II岛(INV104B)编码的LPXTG细胞壁锚定蛋白的免疫原性片段。
在其他方面,本发明提供了具有多核苷酸序列SEQ ID NO:1、3和5的分离的核酸。在其他方面,本发明提供了分离的核酸,其在严谨条件下与杂交探针杂交,其中所述探针具有多核苷酸序列SEQ ID NO:1、3和5或SEQ ID NO:1、3和5的互补序列。在其他方面,本发明提供了分离的核酸,其具有编码与SEQ IDNO:2、4、6、7、8和9至少85%相同的氨基酸序列的序列。
在其他方面,本发明提供了分离的核酸,其具有编码与选自SEQ ID NO:29到SEQ ID NO:1742的序列至少85%相同的氨基酸序列的序列。在其他方面,本发明提供了分离的核酸,其具有编码选自SEQ ID NO:29到SEQ ID NO:1742的氨基酸序列的序列。
在其他方面,本发明提供了诱导针对肺炎链球菌的免疫应答的方法。在一些实施方式中,所述方法包括给予对象有效量的肺炎链球菌菌毛II岛(INV104B)编码的LPXTG细胞壁锚定蛋白的免疫原性片段。在一些实施方式中,所述对象是人。
在其他方面,本发明提供了针对细胞中肺炎链球菌菌毛II岛(INV104B)编码的菌毛蛋白的抗体。在一些实施方式中,所述方法包括在细胞中表达编码肺炎链球菌菌毛II岛(INV104B)编码菌毛蛋白的抗体的核酸。在一些实施方式中,所述菌毛蛋白是LPXTG细胞壁锚定蛋白。
在其他方面,本发明提供了从包含肺炎链球菌的样品纯化肺炎链球菌的方法。所述方法包括:提供抗体结合于固相载体的亲和基质;使所述样品与该亲和基质接触以形成亲和基质-肺炎链球菌复合物;使该亲和基质-肺炎链球菌复合物与样品其余部分分离;和由该亲和基质释放肺炎链球菌。
在其他方面,本发明提供了将细胞毒剂或诊断试剂递送至肺炎链球菌的方法。所述方法包括:提供偶联于抗体或其片段的细胞毒剂或诊断试剂;和使肺炎链球菌接触所述抗体-试剂或片段-试剂偶联物。
在其他方面,本发明提供了鉴定肺炎链球菌菌毛II岛(INV104B)编码的菌毛的结合调节剂的方法。所述方法包括:使肺炎链球菌菌毛易于结合的动物细胞与候选化合物和具有肺炎链球菌菌毛II岛(INV104B)编码的菌毛的细菌细胞相接触;以及测定细菌细胞与动物细胞的结合是否受到抑制。在一些实施方式中,结合活性的抑制预示该化合物是肺炎链球菌菌毛II岛(INV104B)编码的菌毛的结合抑制剂。
在其他方面,本发明提供了鉴定肺炎链球菌菌毛II岛(INV104B)编码的菌毛活性的结合调节剂的方法。所述方法包括:使肺炎链球菌菌毛易于结合的细胞与候选化合物和肺炎链球菌菌毛II岛(INV104B)编码的菌毛相接触;以及测定菌毛与细胞的结合是否受到抑制。在一些实施方式中,结合活性的抑制预示该化合物是肺炎链球菌菌毛II岛(INV104B)编码的菌毛的结合抑制剂。
在其他方面,本发明提供了鉴定肺炎链球菌菌毛II岛(INV104B)编码的菌毛的结合调节剂的方法。所述方法包括:使肺炎链球菌菌毛易于结合的细胞与候选化合物和肺炎链球菌菌毛II岛(INV104B)编码的菌毛蛋白或其细胞结合片段相接触;以及测定菌毛蛋白或其细胞结合片段与细胞的结合是否受到抑制。在一些实施方式中,结合活性的抑制预示该化合物是肺炎链球菌菌毛II岛(INV104B)编码的菌毛的结合抑制剂。
在其他方面,本发明提供了分离肺炎链球菌菌毛II岛(INV104B)编码的菌毛的方法。所述方法包括:使产生肺炎链球菌菌毛II岛(INV104B)编码的菌毛的肺炎链球菌细胞接受超声处理或分解酶消化;密度梯度离心分离非细胞组分;和分离肺炎链球菌菌毛II岛(INV104B)编码的菌毛。在一些实施方式中,所述分解酶是变溶菌素。在其它实施方式中,产生肺炎链球菌菌毛II岛(INV104B)编码的菌毛的肺炎链球菌细胞是肺炎链球菌TIGR4细胞。
除非另外定义,本文中所使用的所有技术和科学术语具有本领域普通技术人员通常所理解的含义。虽然在本发明的实施或测试中可以采用类似于或等同于本文所述的那些方法和材料,但是,下面描述了合适的方法和材料。在出现矛盾的情况下,以本说明书的定义为准。此外,材料、方法和实施例都只是说明性的,并不构成限制。
在附图和下述描述中详细描述了本发明的一种或多种实施方式。通过以下说明书和附图以及附加的实施方式,本发明的其他特征、目的和优点将是显而易见的。
附图简要说明
图1是肺炎链球菌菌毛II岛(INV104B)的示意图。
发明详述
申请人鉴定了来自肺炎链球菌(也称为肺炎球菌)的新的多肽序列,包括新的菌毛多肽和其他可用作抗原的多肽。在由基因研究机构(The Institute for GenomicResearch)(参见网站“tigr.org”)测序的肺炎链球菌分离物INV104的未完成基因组序列中鉴定到新的菌毛多肽序列。这些菌毛多肽序列由本文中称为菌毛II岛(INV104B)的致病性岛编码,它出现在一些但并非全部临床肺炎球菌分离物中。菌毛对于肺炎球菌附着于肺上皮细胞以及建群是重要的。此外,还分析了肺炎链球菌菌株23F、INV200和OXC141的Sanger不完全基因组(参见网站“sanger.ac.uk/Projects/Microbes/”)以鉴定INV104中不存在的多肽的编码基因,其结果包括额外的菌毛多肽以及大量其他非-INV104多肽。因此,本发明提供了肺炎链球菌纤毛、菌毛和其他多肽组合物以及所述物质在治疗、诊断、免疫接种对抗肺炎链球菌感染中的应用。本文所用术语“肺炎链球菌多肽”表示包括肺炎链球菌菌毛II岛(INV104B)菌毛多肽,来自23F、INV200和OXC141的菌毛多肽以及其他来自肺炎链球菌的多肽。并且,本文所用术语“肺炎链球菌菌毛多肽”表示肺炎链球菌菌毛II岛(INV104B)菌毛多肽以及来自23F、INV200和OXC141的菌毛多肽。
INV104B菌毛II岛的肺炎链球菌菌毛
本文描述了对应于菌株TIGR4的肺炎链球菌分离物INV104B(ST227,血清型1)的sp1008和sp1009的基因之间6.5kb插入物编码的肺炎球菌菌毛(如图1所示)。肺炎链球菌分离物INV104B的这个区域在这里称为“肺炎链球菌菌毛II岛(INV104B)”。肺炎链球菌菌毛II岛(INV104B)编码三种LPXTG细胞壁锚定蛋白(在这里称为LPXTG-1、LPXTG-1A和LPXTG-2)、LepA肽酶(orf01289;SEQ IDNO:673)和两种预测的分选酶(在这里称为分选-1(SEQ ID NO:676)和分选-2(SEQID NO:1123))。这些LepA和分选酶序列参见实施例4。
这里提供了LPXTG-1(orf01290-长)的示例性核酸序列:
ATGAACGTTCAATATGATTTTAAGAAGATTCAATATTTTACCAGTAGTTTAGTTATCTTTCTCGCTATTCTTTTTTTGTGTGCACCAATTAATTCTTTACGTGCAGATTCAATAACTGAACCTCAGACAACTCTGCACAAAACGATTACTCCGATATCAGGGCAAAAAGACCAGTATGAGTTGTCACTGGATATCACATCTAAACTGGGAACGGAGACCCAGTCAGAACCCTTGGATGTAGTCTTGGTTGCCGATCTTTCAGGGAGTATGGAAGAGCGAGATGTGTGGTCTTACTCTAGTAGACGATACATTAGTAGGATTGAAGCACTAAAACATACACTGAAAGGTGTGAATGGTCGTCAGGGGCTCATTGATACAATTCTTTCTAATTCCCAAAACCGTCTGTCTATAGTTGGTTTTGCCGGAAAGATTGATAATCAGTATAATGACCGTTATTATAATGAATATTATCTGAGTTATCAATATGGAACTTGGCCAAATGCTGGTTGGTATTCAAATATCTCTTCATATGATGATGCTAAAACTTTAGTATCTTGGAGCACGGATTCTAATAGCTCAAAAAATATTGTTAGTTCGTTAACAATTGCTGACTCTAGTCGTTCTTATGGTATGGACGCGGGCATTGGCACTGGGACAAATATAAATGCTGGGTTAACTGAAGCTCAAAGATTGTTGCAAAGTGCAAGGGCTGGGGCAAAAAAAGTAGTTATTCTGCTGTCAGATGGCGAAGCTAATATGTATTACGAGTCTAATAGTGGGAGAACAATATATAACTATTATTCTAATCCAAATGTGGGACGTATGATTGATACTCCATATTGGTTTACCTCTGGTTTAGAGAGAGGAATGCTGAATATATCTAGTTTAATAGCTCCAAAAATAGATGGCTTTTATTCAATCAAATTCAGATATATAGGTTCAAACGATAGTATCACATCTCTTAAAGGATATATCAGTGGTTATAATTCTGGAATCCCCAACGAAATATTTTCTGCCAATAATGAAAATGACTTGCAACAAAAATTCAAAGAAATCACAGATAAAATTCTACCTCTAGGCGTACACCATGTAACTATATCAGATGTCTTGTCCAAGTACGTGCAGCTGTTACCTGGTGATGCTTCACACCTTCGTGTCGTCAAAATCAAGGATGGTAACGAGCAAGAACTGAATGACAATCAAGTTACGATTGAAACTAAGAAGAACGAACAGGGATTAGTGGAAGTAACAGCCAAGTTTAATCCGAGTTACACTTTGGAGGATGACGCCAAGTACGTTCTCAAGTTTACTGTCACCTCTAGCCAAGAGGCATTTGATGCGATTGCGGGTGATAAGACACTTACTAGTGATGATGCCGAAGAAGCCGATGCTACTAAACTCTACTCCAACAAGGGGGCAAAAGTTGCCTATTCCTATGGTATTGGGACCTCACGTACCAAAATAAAAGACTATTCTGAGAAGCCCACTTTCAAGCCGTCAGATCCATTGACGGTTCCTGTAGAGATTGAGTGGAAAGGTGTGGATGGAAAATCAAATCCATCAGCAAATCGTCCACCTAGTGTCGAATTAAACTTAAACCAAAAGAAAGATGGAAGTATAAAGGATTCCTATCGAAAGGTCACTAGTCCAGTTCAAACGAATAGTTTTACTGAAAATACTAGTTTTGCAAAGGTAGCTAAGGGATATGACTACGAACTGAAAGCACCAGACGCTCCGGGATACACAGTCGAAGTTCAAAAGACAGGTACGAAAGAGAAACCATCCTTCAAAGTTATTTACCGACAGCTTCCAAGTCTCACCGTAAAGAAAATCCTAGAAGGTGAACAATCACCTAATAAATCTTTCACAATTAATGTTACCTTTTCAGATAAGGATGGCAAGCCGATTAACGGCAAGTTTGGGAATACAACAGTGACTAACGGGAAAGCACAGATTTCTCTCAAAAATAGTCAGGAAACTGCCCTCAGTTATCTGCCTCGTGATACCCACTATAAGGTGGAAGAAGTAGAGAACTCTAGAACGGGATATCATGTCACCTATGAAAAACAAGAGGGGACTTTGTCAGAGGATGTTCAAACAATCGTCACCAACCACAGACTTCCGACACTTTCAGTCACAAAAAAAGTTACAGGTGCTTTTGCTAATCTTCTGCAATCCTTTAAGATTACCATTAACGTAAAGGATGCGCAAAATAAACCATTGAATGGATCGTATAGTGCAATAGTAAATAATCAAAAAACAACGCTACAATTCACCAATGGTAAGGCGACAGTTGATCTAAAGAAAGATAAAACCATCAAGATTCTCGACCTTCCTCTAAATGCTCGTTATAGTATCGAAGAAGAAGCAAGTTCGTCTCGTGGGTATCAGGTGTCCTATGATAAAAAAGAAGGAACTCTTGATGCAAATAAGTCTGCGACAGTCACGAATAATAAAAACAGCGTACCTGAAACGGGAATTGACTTCTTGAGTAGCACTCTCGTGCTTGGAGTCGTTCTTCCTCTAGGAGGGATCTTCTTTATCATCTTACTTGGTCACCTTGTGGTGAATAGGAGGAA(SEQ ID NO:1)
这里提供了LPXTG-1的示例性氨基酸序列:
MNVQYDFKKIQYFTSSLVIFLAILFLCAPINSLRADSITEPQTTLHKTITPISGQKDQYELSLDITSKLGTETQSEPLDVVLVADLSGSMEERDVWSYSSRRYISRIEALKHTLKGVNGRQGLIDTILSNSQNRLSIVGFAGKIDNQYNDRYYNEYYLSYQYGTWPN*AGWYSNISSYDDAKTLVSWSTDSNSSKNIVSSLTIADSSRSYGMDAGIGTGTNINAGLTEAQRLLQSARAGAKKVVILLSDGEANMYYESNSGRTIYNYYSNPNVGRMIDTPYWFTSGLERGMLNISSLIAPKIDGFYSIKFRYIGSNDSITSLKGYISGYNSGIPNEIFSANNENDLQQKFKEITDKILPLGVHHVTISDVLSKYVQLLPGDASHLRVVKIKDGNEQELNDNQVTIETKKNEQGLVEVTAKFNPSYTLEDDAKYVLKFTVTSSQEAFDAIAGDKTLTSDDAEEADATKLYSNKGAKVAYSYGIGTSRTKIKDYSEKPTFKPSDPLTVPVEIEWKGVDGKSNPSANRPPSVELNLNQKKDGSIKDSYRKVTSPVQTNSFTENTSFAKVAKGYDYELKAPDAPGYTVEVQKTGTKEKPSFKVIYRQLPSLTVKKILEGEQSPNKSFTINVTFSDKDGKPINGKFGNTTVTNGKAQISLKNSQETALSYLPRDTHYKVEEVENSRTGYHVTYEKQEGTLSEDVQTIVTNHRLPTLSVTKKVTGAFANLLQSFKITINVKDAQNKPLNGSYSAIVNNQKTTLQFTNGKATVDLKKDKTIKILDLPLNARYSIEEEASSSRGYQVSYDKKEGTLDANKSATVTNNKNSVPE TGIDFLSSTLVLGVVLPLGGIFFIILLGHLVVNRRK(SEQ ID NO:769和2)
LPXTG-1包含分选酶底物基序VPXTG(SEQ ID NO:16),如上述SEQ IDNO:2中下划线所示。
orf01290-长的序列(SEQ ID NO:1)具有中间终止密码子,在SEQ ID NO:1中以粗体下划线表示。在该终止密码子处终止的示例性核酸序列(orf01290-短)具有以下转录基因序列(LPXTG-1A):
ATGGACGCGGGCATTGGCACTGGGACAAATATAAATGCTGGGTTAACTGAAGCTCAAAGATTGTTGCAAAGTGCAAGGGCTGGGGCAAAAAAAGTAGTTATTCTGCTGTCAGATGGCGAAGCTAATATGTATTACGAGTCTAATAGTGGGAGAACAATATATAACTATTATTCTAATCCAAATGTGGGACGTATGATTGATACTCCATATTGGTTTACCTCTGGTTTAGAGAGAGGAATGCTGAATATATCTAGTTTAATAGCTCCAAAAATAGATGGCTTTTATTCAATCAAATTCAGATATATAGGTTCAAACGATAGTATCACATCTCTTAAAGGATATATCAGTGGTTATAATTCTGGAATCCCCAACGAAATATTTTCTGCCAATAATGAAAATGACTTGCAACAAAAATTCAAAGAAATCACAGATAAAATTCTACCTCTAGGCGTACACCATGTAACTATATCAGATGTCTTGTCCAAGTACGTGCAGCTGTTACCTGGTGATGCTTCACACCTTCGTGTCGTCAAAATCAAGGATGGTAACGAGCAAGAACTGAATGACAATCAAGTTACGATTGAAACTAAGAAGAACGAACAGGGATTAGTGGAAGTAACAGCCAAGTTTAATCCGAGTTACACTTTGGAGGATGACGCCAAGTACGTTCTCAAGTTTACTGTCACCTCTAGCCAAGAGGCATTTGATGCGATTGCGGGTGATAAGACACTTACTAGTGATGATGCCGAAGAAGCCGATGCTACTAAACTCTACTCCAACAAGGGGGCAAAAGTTGCCTATTCCTATGGTATTGGGACCTCACGTACCAAAATAAAAGACTATTCTGAGAAGCCCACTTTCAAGCCGTCAGATCCATTGACGGTTCCTGTAGAGATTGAGTGGAAAGGTGTGGATGGAAAATCAAATCCATCAGCAAATCGTCCACCTAGTGTCGAATTAAACTTAAACCAAAAGAAAGATGGAAGTATAAAGGATTCCTATCGAAAGGTCACTAGTCCAGTTCAAACGAATAGTTTTACTGAAAATACTAGTTTTGCAAAGGTAGCTAAGGGATATGACTACGAACTGAAAGCACCAGACGCTCCGGGATACACAGTCGAAGTTCAAAAGACAGGTACGAAAGAGAAACCATCCTTCAAAGTTATTTACCGACAGCTTCCAAGTCTCACCGTAAAGAAAATCCTAGAAGGTGAACAATCACCTAATAAATCTTTCACAATTAATGTTACCTTTTCAGATAAGGATGGCAAGCCGATTAACGGCAAGTTTGGGAATACAACAGTGACTAACGGGAAAGCACAGATTTCTCTCAAAAATAGTCAGGAAACTGCCCTCAGTTATCTGCCTCGTGATACCCACTATAAGGTGGAAGAAGTAGAGAACTCTAGAACGGGATATCATGTCACCTATGAAAAACAAGAGGGGACTTTGTCAGAGGATGTTCAAACAATCGTCACCAACCACAGACTTCCGACACTTTCAGTCACAAAAAAAGTTACAGGTGCTTTTGCTAATCTTCTGCAATCCTTTAAGATTACCATTAACGTAAAGGATGCGCAAAATAAACCATTGAATGGATCGTATAGTGCAATAGTAAATAATCAAAAAACAACGCTACAATTCACCAATGGTAAGGCGACAGTTGATCTAAAGAAAGATAAAACCATCAAGATTCTCGACCTTCCTCTAAATGCTCGTTATAGTATCGAAGAAGAAGCAAGTTCGTCTCGTGGGTATCAGGTGTCCTATGATAAAAAAGAAGGAACTCTTGATGCAAATAAGTCTGCGACAGTCACGAATAATAAAAACAGCGTACCTGAAACGGGAATTGACTTCTTGAGTAGCACTCTCGTGCTTGGAGTCGTTCTTCCTCTAGGAGGGATCTTCTTTATCATCTTACTTGGTCACCTTGTGGTGAATAGGAGGAA(SEQ ID NO:3)
下面提供了LPXTG-1A(来自orf01290-短)的示例性氨基酸序列:
MDAGIGTGTNINAGLTEAQRLLQSARAGAKKVVILLSDGEANMYYESNSGRTIYNYYSNPNVGRMIDTPYWFTSGLERGMLNISSLIAPKIDGFYSIKFRYIGSNDSITSLKGYISGYNSGIPNEIFSANNENDLQQKFKEITDKILPLGVHHVTISDVLSKYVQLLPGDASHLRVVKIKDGNEQELNDNQVTIETKKNEQGLVEVTAKFNPSYTLEDDAKYVLKFTVTSSQEAFDAIAGDKTLTSDDAEEADATKLYSNKGAKVAYSYGIGTSRTKIKDYSEKPTFKPSDPLTVPVEIEWKGVDGKSNPSANRPPSVELNLNQKKDGSIKDSYRKVTSPVQTNSFTENTSFAKVAKGYDYELKAPDAPGYTVEVQKTGTKEKPSFKVIYRQLPSLTVKKILEGEQSPNKSFTINVTFSDKDGKPINGKFGNTTVTNGKAQISLKNSQETALSYLPRDTHYKVEEVENSRTGYHVTYEKQEGTLSEDVQTIVTNHRLPTLSVTKKVTGAFANLLQSFKITINVKDAQNKPLNGSYSAIVNNQKTTLQFTNGKATVDLKKDKTIKILDLPLNARYSIEEEASSSRGYQVSYDKKEGTLDANKSATVTNNKNSVPETGIDFLSSTLVLGVVLPLGGIFFIILLGHLVVNRRK(SEQ ID NO:4)
LPXTG-1A包含分选酶底物基序VPXTG(SEQ ID NO:16),如上文SEQ IDNO:4中下划线所示。
下面提供了LPXTG-2(orf01287)的示例性核酸序列:
TTGATGATCATAATGAAAAAAGAAAATAAAAAAACAAAAGAAATAATCATGAAAAAAACATTCTTTAAAAAGCTATTCACTGCAAGCATTGCAGCTATAACCGCTTTGTCCGTATTCAGAGGTGTCCCGACTTTTGCGGATGATAATTCAGCAATAACCAAAGCAAATGGTGAAAATAATGCTGTTGTGAAGATTAATAAAACGTTGAATATTGCAGAGGGAATAACAACACCAACAGCGACATTTACATTTAAGTTTACAGAAAAAACAGGACAATCTTCTAACGGTGCGCCATATCAAACCGGAGTTGCAATTCCAGATAGAAATGTAGAATACAATAAAAATGATCACCCAACTGCTGATAAGATTCAAAAAGCAACAGAAGACATTTTTTCGGGAGTTGCTTATGGCCATGCTGGTGAATACGTTTATGATGTAGCGGAAGCAAAAACTGGATGGCAGGCGATTACCAAAAATGGTAAAACAATTGATGCCATGAGATACGACAAACGTACATATGAAATGCACGTTATTGTTAAGAATAAAGTAAATGGTGGTGTCTATATTTCATCAGTATACTTTAAGGAAAATAATAAATCTAACGCCCCTAAAGTAGAACCAAGTGAACAAGGCGTTTATAATTTATTTGATAACACATATACCAAAGACGCAAGTAAGGAGCCTAATCCTGATGATCCGAGTCAAGTAGACCCCAATGCGAAAGCATTAACAATTACTAAAAAAGTTGATGGAGCTTCAGGGGATAAAACAAGAGATTTCCAATTCCATATCAAGATTCAACTTCCAAGTACAAATAAAACAGCAGAAACCCCTGTTACGAATATTATAGTAAAACATGGATCTAAGTCAGAGGTGTTGGCAGTAGTGACCCCAGCAGATACAGTTGAGTACAATTTTACTCTTAAAGATGGTGAAACATTTACAGTTGAACAACTACCAGCAGGTTCTAAATATACAGTAACTGAAACTGGAGTAGCAGGTTATACAGATTCATCAATTTATACTACAAATGGTGCAGAACAAACATCTCAAGGACAAAAAAATGTAGATTTTACATTAACAGATATCCTCATAGGTGAAAAGAAAAACGACAACAAAGTTACTAACAAAATCGACGACGTTACTCCTACTGGTCTCTTGATTGATAACCTTCCATTCATTTTGATGATTGGTCTTGGTTTGGCTGGATTTGTTGTCTTGTCTAAAAAACGTAGAGAAGCCTA(SEQ ID NO:5)
下面提供了LPXTG-2的示例性氨基酸序列:
MIIMKKENKKTKEIIMKKTFFKKLFTASIAAITALSVFRGVPTFADDNSAITKANGENNAVVKINKTLNIAEGITTPTATFTFKFTEKTGQSSNGAPYQTGVAIPDRNVEYNKNDHPTADKIQKATEDIFSGVAYGHAGEYVYDVAEAKTGWQAITKNGKTIDAMRYDKRTYEMHVIVKNKVNGGVYISSVYFKENNKSNAPKVEPSEQGVYNLFDNTYTKDASKEPNPDDPSQVDPNAKALTITKKVDGASGDKTRDFQFHIKIQLPSTNKTAETPVTNIIVKHGSKSEVLAVVTPADTVEYNFTLKDGETFTVEQLPAGSKYTVTETGVAGYTDSSIYTTNGAEQTSQGQKNVDFTLTDILIGEKKNDNKVTNKIDDVTPTGLLIDNLPFILMIGLGLAGFVVLSKKRREA(SEQ ID NO:6)
LPXTG-2包含分选酶底物基序VTXTG(SEQ ID NO:21),如上文SEQ IDNO:6中下划线所示。
在23F、INV200和OXC141肺炎链球菌菌株中鉴定的多肽序列
肺炎链球菌菌株23F、INV200和OXC141的Sanger不完全基因组不包含肺炎链球菌菌毛II岛(INV104B)区域。然而,23F、INV200和OXC141编码分选酶和LPXTG细胞壁锚定蛋白,如本文所述。例如,23F和OXC141各自编码至少一种非肺炎链球菌菌株INV104B编码的分选酶(分选-23F(orf01917;SEQ IDNO:1386)和分选-OXC141(orf01672;SEQ ID NO:282)),INV200编码至少三种非肺炎链球菌菌株INV104B编码的细胞壁锚定蛋白(锚定-1、锚定-2和锚定-3)。
下面提供了锚定-1(INV200-orf00426)的示例性氨基酸序列:
MRVSSDTNIYEYRALSPQQKAALEMIRADLYKFTVPYENLEYRFYKPDWVFGLGYQALATVRWKIEPATITVTKKWENVKEGAKKPDVWIQLLKDGKPEGERKRIESDKGQTTFEIPNKDEINKYSVKEVDKEGRDWKHKDFTAGQPVNKGNGHFEITNTKKEKPKIKVTFKKIAGDTNKDLAGAHLVLKKIFDDGNGLLIKQWDTIGQPVDIDLDAGSYTLTEEKAPDGYMLAAPVSFYVEEDGQIILPKGEDLEAQNDKTITMVDEKIKEKPTKPSGKLATTVEVDGTKADAQKELELSVATDKVTKTVKDTVVYENLLAGETYKLTGQLMKITADKEEEVATKETTFVADASGNGTTSLEFEDVSLEAGVKYVVYETAESEKEIDFKEGKEKHKVEHKDKDDKAQTVVVTKEKPTKPSGKLATTVEVDGTKADAQKELELSVATDKVTKTVKDTVVYENLLAGETYKLTGQLMKITADKEEEVATKETTFVADASGNGTTSLEFEDVSLEAGVKYVVYETAESEKEIDFKEGKEKHKVEHKDKDDKAQTVVVSKIKPEPGAQEVHFSKVNVGGEEIAGAEIHIKQGDTVVASWVSEAGKTHTLKLKPGHYIFHEAVAPGGYLAVTDIHFSVDETGQVTVTDVNGNTAVAEGNKLTVTDQTKPVTPPSPEEPGAQEVHFSKVNVGGEEIAGAEIHIKQGDTVVASWVSEAGKTHTLKLKPGHYIFHEAVAPGGYLAVTDIHFSVDETGQVTVTDVNGNTAVAEGNKLTVTDQTKPVTPPSPEEPGAQEVHFSKVNVGGEEIAGAEIHIKQGDTVVASWVSEAGKTHTLKLKPGHYIFHEAVAPGGYLAVTDIHFSVDETGQVTVTDVNGNTAVAEGNKLTVTDQSADKDKQDKLPNTGETTGTYLSILGMITAVFASLLYRSKKK(SEQ ID NO:7)
锚定-1包含分选酶底物基序LPNTG(SEQ ID NO:10),如上文SEQ ID NO:7中下划线所示。
下面提供了锚定-2(INV200-orf00441)的示例性氨基酸序列:
MNKGLFEKRCKYSIRKFSLGVASVMIGAAFFGTSPVLADSVQSGSTANLPADLATALATAKENDGRDFEAPKVGEDQGSPEVTDGPKTEEELLALEKEKPAEEKPKEDKPAAAKPETPKTVTPEWQTVEKKEQKGTVTIREEKGVRYNQLSSTAQNDNAGKPALFEKKGLTVVANGNATVDLTFKDDSEKGKSRFGVFLKFKDTNNNVFVGYDKDGWFWEYKSPTTSTWYRGSRVAAPETGSTNRLSITLKSDGQLNASNNDVNLFDTVTLPAAVNDHLKNEKKILLKAGSYGNDRTVVSVKTDNQEGVKADDTPAQKETGPVVDDSKVTYDTIQSKVLKAVIDQAFPRVKEYSLNGHTLPGQVQQFNQVFINNHRITPEVTYKKINETTAEYLMKIRDDAHLINAEMTVRLQVVDNQLHFDVTKIVNHNQVTPGQKIDDERKLLSSISFLGNALVSVSSDQTGAKFDGATMSNNTHVSGDDHIDVTNPMKDLAKGYMYGFVSTDKLAAGVWSNSQNSYGGGSNDWTRLTAYKETVGNANYVGIHSSEWQWEKAYKGIVFPEYTKELPSAKVVITEDANADKKVDWQDGAIAYRSIMNNPQGWEKVKDITAYRIAMNFGSQAQNPFLMTLDGIKKINLHTDGLGQGVLLKGYGSEGHDSGHLNYADIGKRIGGVEDFKTLIEKAKKYGAHLGIHVNASETYPESKYFNEKILRKNPDGSYSYGWNWLDQGINIDAAYDLAHGRLARWEDLKKKLGDGLDFIYVDVWGNGQSGDNGAWATHVLAKEINKQGWRFAIEWGHGGEYDSTFHHWAADLTYGGYTNKGINSAITRFIRNHQKDAWVGDYRSYGGAANYPLLGGYSMKDFEGWQGRSDYNGYVTNLFAHDVMTKYFQHFTVSKWENGTPVTMTDNGSTYKWTPEMRVELVDADNNKVVVTRKSNDVNSPQYRERTVTLNGRVIQDGSAYLTPWNWDANGKKLSTDKEKMYYFNTQAGATTWTLPSDWAKSKVYLYKLTDQGKTEEQELTVKDGKITLDLLANQPYVLYRSKQTNPEMSWSEGMHIYDQGFNSGTLKHWTISGDASKAEIVKSQGANDMLRIQGNKEKVSLTQKLTGLKPNTKYAVYVGVDNRSNAKASITVNTGEKEVTTYTNKSLALNYVKAYAHNTRRDNATVDDTSYFQNMYAFFTTGADVSNVTLTLSREAGDQATYFDEIRTFENNSSMYGDKHDTGKGTFKQDFENVAQGIFPFVVGGVEGVEDNRTHLSEKHNPYTQRGWNGKKVDDVIEGNWSLKTNGLVSRRNLVYQTIPQNFRFEAGKTYRVTFEYEAGSDNTYAFVVGKGEFQSGRRGTQASNLEMHELPNTWTDSKKAKKATFLVTGAETGDTWVGIYSTGNASNTRGDSGGNANFRGYNDFMMDNLQIEEITLTGKMLTENALKNYLPTVAMTNYTKESMDALKEAVFNLSQADDDISVEEARAEIAKIEALKNALVQKKTALVADDFASLTAPAQAQEGLANAFDGNVSSLWHTSWNGGDVGKPATMVLKEPTEITGLRYVPRGSGSNGNLRDVKLVVTDESGKEHTFTATDWPDNNKPKDIDFGKTIKAKKIVLTGTKTYGDGGDKYQSAAELIFTRPQVAETPLDLSGYEAALAKAQKLTDKDNQEEVASVQASMKYATDNHLLTERMVEYFADYLNQLKDSATKPDAPTVEKPEFKLSSLVSEQGKTPDYKQEIARPETPEQILPATGESQSDTSLFLASVSLALSALFVVKTKKD(SEQ ID NO:8)
锚定-2包含分选酶底物基序LPATG(SEQ ID NO:10),如上文SEQ ID NO:8中下划线所示。
下面提供了锚定-3(INV200-orf03448)的示例性氨基酸序列:
MRVSSDTNIYEYRALSPQQKAALEMIRADLYKFTVPYENLEYRFYKPDWVFGLGYQALATVRWKIEPATITVTKKWENVKEGAKKPDVWIQLLKDGKPEGERKRIESDKGQTTFEIPNKDEINKYSVKEVDKEGRDWKHKDFTAGQPVNKGNGHFEITNTKKEKPKIKVTFKKIAGDTNKDLAGAHLVLKKIFDDGNGLLIKQWDTIGQPVDIDLDAGSYTLTEEKAPDGYMLAAPVSFYVEEDGQIILPKGEDLEAQNDKTITMVDEKIKEKPTKPSGKLATTVEVDGTKADAQKELELSVATDKVTKTVKDTVVYENLLAGETYKLTGQLMKITADKEEEVATKETTFVADASGNGTTSLEFEDVSLEAGVKYVVYETAESEKEIDFKEGKEKHKVEHKDKDDKAQTVVVTKEKPTKPSGKLATTVEVDGTKADAQKELELSVATDKVTKTVKDTVVYENLLAGETYKLTGQLMKITADKEEEVATKETTFVADASGNGTTSLEFEDVSLEAGVKYVVYETAESEKEIDFKEGKEKHKVEHKDKDDKAQTVVVTKEKPTKPSGKLATTVEVDGTKADAQKELELSVATDKVTKTVKDTVVYENLLAGETYKLTGQLMKITADKEEEVATKETTFVADASGNGTTSLEFEDVSLEAGVKYVVYETAESEKEIDFKEGKEKHKVEHKDKDDKAQTVVVTKEKPTKPSGKLATTVEVDGTKADAQKELELSVATDKVTKTVKDTVVYENLLAGETYKLTGQLMKITADKEEEVATKETTFVADASGNGTTSLEFEDVSLEAGVKYVVYETAESEKEIDFKEGKEKHKVEHKDKDDKAQTVVVSKIKPEPGAQEVHFSKVNVGGEEIAGAEIHIKQGDTVVASWVSEAGKTHTLKLKPGHYIFHEAVAPGGYLAVTDIHFSVDETGQVTVTDVNGNTAVAEGNKLTVTDQTKPVTPPSPEEPGAQEVHFSKVNVGGEEIAGAEIHIKQGDTVVASWVSEAGKTHTLKLKPGHYIFHEAVAPGGYLAVTDIHFSVDETGQVTVTDVNGNTAVAEGNKLTVTDQTKPVTPPSPEEPGAQEVHFSKVNVGGEEIAGAEIHIKQGDTVVASWVSEAGKTHTLKLKPGHYIFHEAVAPGGYLAVTDIHFSVDETGQVTVTDVNGNTAVAEGNKLTVTDQTKPVTPPSPEEPGAQEVHFSKVNVGGEEIAGAEIHIKQGDTVVASWVSEAGKTHTLKLKPGHYIFHEAVAPGGYLAVTDIHFSVDETGQVTVTDVNGNTAVAEGNKLTVTDQSADKDKQDKLPNTGETTGTYLSILGMITAVFASLLYRSKKK(SEQ ID NO:9)
锚定-3包含分选酶底物基序LPNTG(SEQ ID NO:10),如上文SEQ ID NO:9中下划线所示。
23F、INV200和OXC141还编码非肺炎链球菌菌株INV104B编码的额外的多肽序列。这些额外的多肽序列在本文实施例2中描述,作为本文所述方法可用序列的具体例子,以及作为在对象中产生抗体和/或刺激免疫应答的免疫原性组合物中的抗原。
其他细菌菌毛多肽和来自肺炎链球菌的多肽
本文所述方法和组合物可用于菌毛多肽或来自任何革兰氏阳性菌的其他多肽,所述革兰氏阳性菌包括例如,肺炎链球菌。在GAS(如酿脓链球菌(Streptococcus pyogenes))(Mora等,2005,Proc.Natl.Acad.Sci.USA,102:15641-6),GBS(如,无乳链球菌(Streptococcus agalactiae))(Lauer等,2005,Science,309:105;WO 2006/078318),内氏放线菌(Actinomycetes naeslundii)(Yeung等,1998,Infect.Immun.,66:1482-91),白喉棒状杆菌(Ton-That等,2003,Mol.Microbiol.,50:1429-38;Ton-That和Schneewind,2004,Trends.Microbiol.,12:228-34),产气荚膜梭菌(Clostridium perfringens)和粪肠球菌(Enterococcus faecalis)中已经鉴定到已知和推定的菌毛蛋白。革兰氏阳性菌的例子包括但不限于:硬壁菌科,例如链球菌属(Streptococcus)(如肺炎链球菌(S.pneumoniae)、无乳链球菌(S.agalactiae)、酿脓链球菌(S.pyogenes)、猪链球菌(S.suis)、兽疫链球菌(S.zooepidemicus)、草绿色链球菌(S.viridans)、变形链球菌(S.mutans)、格氏链球菌(S.gordonii)、马链球菌(S.equi))、杆菌属(Bacillus)(如炭疽杆菌(B.anthracis)、蜡样芽孢杆菌(B.cereus)、枯草杆菌(B.subtilis))、李斯特菌属(Listeria)(如无害李斯特菌(L.innocua)、单核细胞增生李斯特菌(L.monocytogenes))、葡萄球菌属(Staphylococcus)(如金黄色葡萄球菌(S.aureus)、表皮葡萄球菌(S.epidermidis)、山羊葡萄球菌(S.caprae)、腐生葡萄球菌(S.saprophyticus)、路邓葡萄球菌(S.lugdunensis)、施氏葡萄球菌(S.schleiferi))、肠球菌属(Enterococcus)(如粪肠球菌(E.faecalis)、屎肠球菌(E.faecium))、乳酸菌属(Lactobacillus)、乳球菌属(Lactococcus)(如乳酸乳球菌(L.lactis))、明串珠菌属(Leuconostoc)(如肠膜明串珠菌(L.mesenteroides))、梳状菌属(pectinatus)、片球菌属(Pediococcus)、醋酸杆菌属(Acetobacterium)、梭菌属(Clostridium)(如肉毒梭菌(C.botulinum)、艰难梭菌(C.difficile)、产气荚膜梭菌(C.perfringens)、破伤风梭菌(C.tetani))、瘤胃球菌属(Ruminococcus)(如白色瘤胃球菌(R.albus))、螺旋杆菌属(Heliobacterium)、草螺菌属(Heliospirillum)和鼠孢菌属(Sporomusa);和放线菌科,例如放线菌属(Actinomycetes)(如内氏放线菌(A.naeslundii))、棒状杆菌属(Corynebacterium)(如白喉棒状杆菌(C.diphtheriae)、效力棒状杆菌(C.efficiens))、节杆菌属(Arthrobacter)、双岐杆菌属(Bifidobacterium)(如长双歧杆菌(B.longum))、弗兰克菌属(Frankia)、微球菌属(Micrococcus)、单孢丝菌属(Micromonospora)、分枝杆菌属(Mycobacterium)(如结核分枝杆菌(M.tuberculosis)、麻风分枝杆菌(M.leprae)、牛分枝杆菌(M.bovis)、非洲分枝杆菌(M.africanum)、田鼠分枝杆菌(M.microti))、诺卡氏菌属(Nocardia)(如星形诺卡氏菌(N.asteroides))、丙酸杆菌属(Propionibacteriun)和链霉菌属(Streptomyces)(如索马里链霉菌(S.somaliensis)、除虫链霉菌(S.avermitilis)、天蓝色链霉菌(S.coelicolor))。
分离的菌毛多肽和其他来自肺炎链球菌的多肽
分离的肺炎链球菌多肽可用于本文所述的方法中,并用作在对象中产生抗体和/或刺激免疫应答的免疫原性组合物中的抗原。有用的肺炎链球菌LPXTG细胞壁锚定菌毛多肽的例子包括:LPXTG-1、LPXTG-1A、LPXTG-2、锚定-1、锚定-2和锚定-3(即SEQ ID NO:2、4、6、7、8和9)。有用的肺炎链球菌分选酶多肽的例子包括:分选-1、分选-2、分选-23F和分选-OXC141(即SEQ ID NO:676、1123、1386和282)。肺炎链球菌多肽的变体也可用于本文所述的方法中,作为在对象中产生抗体和/或刺激免疫应答的免疫原性组合物中的抗原。例如,新方法中也可使用与肺炎链球菌多肽序列具有至少80%序列相同性,例如至少85%、至少90%、至少95%、至少98%或至少99%相同性的肺炎链球菌多肽。而且,本文所述的组合物和方法中可使用最多具有50个,例如1、3、5、10、15、20、25、30或40个氨基酸插入、删除或取代,例如保守氨基酸取代的肺炎链球菌多肽序列。
确定两个氨基酸序列之间的相同性百分比可采用BLAST 2.0程序来完成,公众可以从ncbi.nlm.nih.gov/BLAST获得该程序。采用无缺口比对和默认参数(BLOSUM 62矩阵,存在缺口损耗为11,每个残基缺口损耗为1,λ比率0.85)进行序列比较。BLAST程序中采用的数学算法参见Altschul等,1997,NucleicAcids Research,25:3389-3402。
如本文所用,“保守氨基酸取代”表示多肽中的一个氨基酸家族内的氨基酸取代。氨基酸家族是本领域所明白的,基于氨基酸侧链的物理和化学性质进行分类。家族包括:具有碱性侧链的氨基酸(例如,赖氨酸、精氨酸和组氨酸);具有酸性侧链的氨基酸(例如,天冬氨酸和谷氨酸);具有不带电极性侧链的氨基酸(例如,甘氨酸、天冬酰胺、谷胺酰胺、丝氨酸、苏氨酸、酪氨酸和半胱氨酸);具有非极性侧链的氨基酸(例如,丙氨酸、缬氨酸、亮氨酸、异亮氨酸、脯氨酸、苯丙氨酸、甲硫氨酸和色氨酸);具有分枝侧链的氨基酸(例如,苏氨酸、缬氨酸和异亮氨酸);和具有芳族侧链的氨基酸(例如,酪氨酸、苯丙氨酸、色氨酸和组氨酸)。氨基酸可属于一个以上的家族。
肺炎链球菌多肽的片段,如免疫原性片段也可用于本文所述的方法和组合物中。通常,该片段具有至少8、10、15、20、50、100、200或500个毗连的肺炎链球菌多肽的氨基酸残基。具有有用片段的肺炎链球菌LPXTG多肽的例子包括:LPXTG-1、LPXTG-1A、LPXTG-2、锚定-1、锚定-2或锚定-3(例如,SEQ ID NO:2、4、6、7、8或9)。具有有用片段的肺炎链球菌分选酶多肽的非限制性例子包括:分选-1、分选-2、分选-23F和分选-OXC141(即,SEQ ID NO:676、1123、1386和282)。在一些实施方式中,这些片段保留至少一种全长蛋白质的生物学活性,例如与肽聚糖细胞壁共价结合或能够通过LPXTG基序与另一片段或蛋白质交联。
在一些实施方式中,本文所述的免疫原性组合物包含一种或多种可以寡聚体(菌毛)形式配制或纯化的肺炎链球菌菌毛多肽。在一些实施方式中,所述寡聚体形式是超寡聚体。在一些实施方式中,本文所述的免疫原性组合物包含一种或多种以寡聚体(菌毛)形式分离的肺炎链球菌菌毛多肽。所述包含肺炎链球菌菌毛多肽的寡聚体或超寡聚体菌毛结构可纯化或以其他方式配制,以便用于免疫原性组合物中。
一种或多种来自肺炎链球菌开放阅读框多核苷酸序列的肺炎链球菌多肽可以被编码替代ORF的片段的多核苷酸序列所替代。在一些实施方式中,一种或多种来自肺炎链球菌开放阅读框的肺炎链球菌多肽可以被与替代ORF具有序列同源性的序列所替代。
一种或多种肺炎链球菌菌毛多肽序列通常包含LPXTG基序(如LPXTG(SEQID NO:10))或其他分选酶底物基序。通常,肺炎链球菌菌毛蛋白的LPXTG分选酶底物基序可以表示为通式X1X2X3X4G(SEQ ID NO:1746),其中X1是L、V、E、Y、I、Q;如果X1是L,则X2是P;如果X1是E或Q则X2是V;如果X1是V,则X2是V或P;X3是任何氨基酸残基;如果X1是V、E或Q,则X4是T;如果X1是L,则X4是T、S或A。LPXTG基序的非限制性例子包括:YPXTG(SEQ IDNO:11)、IPXTG(SEQ ID NO:12)、LPXSG(SEQ ID NO:13)、VVXTG(SEQ IDNO:14)、EVXTG(SEQ ID NO:15)、VPXTG(SEQ ID NO:16)、QVXTG(SEQ IDNO:17)、LPXAG(SEQ ID NO:18)、QVPTG(SEQ ID NO:19)、FPXTG(SEQ IDNO:20)和VTXTG(SEQ ID NO:21)。
本文所述的肺炎链球菌菌毛多肽可实现肺炎链球菌附着于上皮细胞并侵入上皮细胞的功能。这些菌毛多肽也可能影响肺炎链球菌易位通过上皮细胞层的能力。在一些实施方式中,一种或多种来自肺炎链球菌的肺炎链球菌菌毛多肽能够结合或以其他方式联接于上皮细胞表面或其合成模型。在一些实施方式中,一种或多种肺炎链球菌菌毛多肽结合或联接于纤维蛋白原、纤连蛋白或胶原中的一种或多种。
预测肺炎链球菌分选酶蛋白与含LPXTG的表面蛋白的分泌与锚定有关。本文所述的肺炎链球菌菌毛II岛(INV104B)分选酶蛋白由对应于TIGR4的sp1008和sp1009的基因之间相同6.5kb插入物中所见的基因(分选-1和分选-2)编码,如上文LPXTG-1和LPXTG-2基因所述。23F和OXC141也编码分选酶:分选-23F(SEQ ID NO:1386)和分选-OXC141(SEQ ID NO:282)。本文所述方法中使用的分选酶蛋白和分选酶蛋白的变体可由革兰氏阳性菌获得。
肺炎链球菌菌毛多肽可通过膜结合的转肽酶(如分选酶)共价连接于细菌细胞壁。分选酶可用于切割表面蛋白,优选在LPXTG基序的苏氨酸和甘氨酸残基之间。然后,分选酶有助于在苏氨酸羧基与细胞壁前体如脂质II之间形成酰胺键。然后,该前体通过细菌壁合成的转糖基和转肽反应结合到肽聚糖中。参见Comfort等,Infection和Immunity(2004)72(5):2710-2722。
在一些实施方式中,本文所述的组合物包含寡聚菌毛样结构,所述寡聚菌毛样结构包含肺炎链球菌菌毛多肽如LPXTG-1、LPXTG-1A、LPXTG-2、锚定-1、锚定-2或锚定-3(如SEQ ID NO:2、4、6、7、8或9)。寡聚菌毛样结构可包含许多菌毛多肽单元。在一些实施方式中,寡聚菌毛样结构包含两种或更多种菌毛蛋白。在一些实施方式中,寡聚菌毛样结构包含超-寡聚菌毛样结构,该结构包含至少两个(例如、2、3、4、5、6、7、8、9、10、11、12、13、14、15、20、25、30、35、40、45、50、60、70、80、90、100、120、140、150、200或更多个)寡聚亚基,其中每个亚基包含菌毛蛋白或其片段。寡聚亚基可通过菌毛蛋白基序内的保守赖氨酸共价结合。寡聚亚基可通过LPXTG基序共价结合,在一些实施方式中,可通过苏氨酸或丝氨酸的氨基酸残基共价结合。
肺炎链球菌菌毛多肽或其片段可结合到本文所述的寡聚菌毛样结构中,在一些实施方式中,包含菌毛蛋白基序。寡聚菌毛样结构可以单独使用或联用。在一些实施方式中,本文所述的组合物包含寡聚形式的肺炎链球菌菌毛II岛(INV104B)菌毛。并且,在一些实施方式中,肺炎链球菌菌毛II岛(INV104B)菌毛可以超寡聚形式构造。
菌毛纯化方法
肺炎链球菌编码的菌毛可以从表达肺炎链球菌菌毛或菌毛样结构的细胞,例如细菌细胞纯化,该过程包括例如通过机械剪切或酶消化从细胞分离菌毛,以及离析经分离的菌毛。
用于纯化菌毛的合适的细菌细胞包括:表达肺炎链球菌菌毛多肽的有菌毛的革兰氏阳性菌菌株,用一种或多种革兰氏阳性菌毛蛋白如肺炎链球菌LPXTG-1、LPXTG-1A、LPXTG-2、锚定-1、锚定-2和锚定-3(如SEQ ID NO:2、4、6、7、8和9)转化的无菌毛的革兰氏阳性菌,以及用一种或多种革兰氏阳性菌菌毛蛋白如肺炎链球菌LPXTG-1、LPXTG-1A、LPXTG-2、锚定-1、锚定-2和锚定-3(如、SEQID NO:2、4、6、7、8和9)转化的革兰氏阴性或其他细胞。通常,用于纯化菌毛的细胞仅产生所需的菌毛类型,例如内源性或异源性菌毛。为产生异源性菌毛,通过例如突变或重组DNA方法改变细胞,使其不产生内源性菌毛。通常,可用于纯化的产生菌毛的革兰氏阳性菌细胞将表达一种或多种相容的分选酶,使得菌毛在细胞表面表达。可进行纯化的肺炎链球菌菌毛II岛(INV104B)、23F、INV200和OXC141LPXTG细胞壁锚定多肽的例子包括:LPXTG-1、LPXTG-1A、LPXTG-2、锚定-1、锚定-2和锚定-3(如、SEQ ID NO:2、4、6、7、8和9)。可进行纯化的肺炎链球菌菌毛II岛(INV104B)、23F、INV200和OXC141分选酶的例子包括:分选-1、分选-2、分选-23F和分选-OXC141(如SEQ ID NO:676、1123、1386和282)。
从革兰氏阳性菌细胞分离菌毛通常通过机械剪切、酶消化、降低或抑制分选酶活性,或用干扰细胞壁完整性的化合物进行处理来实现。机械剪切可从细胞物理去除菌毛,而其他方法可消除菌毛连接点(例如,通过降解细胞壁或菌毛组分)。从细胞分离菌毛之后,菌毛和细胞可通过离心分离。
机械剪切方法的非限制性例子包括:超声处理、玻璃珠剪切和混合。超声处理的方法例如在Yamaguchi等,2004,Current Microbiol.,49:59-65中讨论。玻璃珠剪切方法例如在Levesque等,2001,J.Bacteriol.,183:2724-32中讨论。机械剪切的一般方法例如在Wolfgang等,1998,Mol.Microbiol.,29:321-30;Trachtenberg等,2005,J.Mol.Biol.,346:665-676;Parge等,1990,J.Biol.Chem.,265:2278-85;Isaacson等,1981,J.Bacteriol.,146:784-9;Korhonen等,1980,Infect.Immun.,27:569-75;Hahn等,2002,J.Mol.Biol.,323:845-57;St.Geme等,1996,Proc.Natl.Acad.Sci.USA,93:11913-18;Weber等,2005,J.Bacteriol.,187:2458-68;和Mu等,2002,J.Bacteriol.,184:4868-74中讨论。
适用于酶消化的酶的非限制性例子包括细胞壁降解酶,例如变溶菌素、溶葡萄球菌素和溶菌酶。酶消化的方法例如在Bender等,2003,J.Bacteriol.,185:6057-66;Ton-That等,2004,Mol.Microbiol.,53:251-61;和Ton-That等,2003,Mol.Microbiol.,50:1429-38中讨论。对于下游给予对象,可采用多种酶以去除可能造成不希望的宿主反应的细胞壁组分。
抑制或降低分选酶活性的方法的非限制性例子包括:通过引入SrtA失能等位基因、删除内源性SrtA基因、表达降低SrtA表达的核酸(例如,反义或miRNA)以及用抑制SrtA活性的化合物处理细胞来降低SrtA活性(参见例如,Marrafini等,Microbiol.Mol.Biol.Rev.,70:192-221,2006)。
示例性的分选酶A抑制剂包括:甲烷-硫代磺酸酯(如,MTSET和(2-磺酸基乙基)甲烷-硫代磺酸酯)(Ton-That和Schneewind,J.Biol.Chem.,274:24316-24320,1999),对羟基汞苯甲酸,葡糖基甾醇β-谷固醇-3-O-吡喃葡糖醇(Kim等,Biosci.Biotechnol.Biochem.,67:2477-79,2003),氯化黄连素(Kim等,Biosci.Biotechnol.Biochem.,68:421-24,2004),肽基-重氮甲烷(LPAT-CHN2)(Scott等,Biochem.J.,366:953-58,2002),肽基-氯甲烷(LPAT-CH2Cl),肽基-乙烯砜[LPAT-SO2(Ph)](Conolly等,J.Biol.Chem.,278:34061-65,2003),乙烯砜(如,二-、乙基-、甲基-和苯基乙烯砜)(Frankel等,J.Am.Chem.Soc.,126:3404-3405,2004),苏氨酸残基被膦酸酯基团取代的LPXTG基序肽(如,LPEΨ{PO2H-CH2}G)(Kruger等,Bioorg.Med.Chem.,12:3723-29,2004),取代的(Z)-二芳基丙烯腈(Oh等,J.Med.Chem.,47:2418-21,2004)和各种药用植物的提取物(Kim等,Biosci.Biotechnol.Biochem.,66:2751-54,2002)。
干扰细胞壁完整性的化合物的非限制性例子包括甘氨酸和抗生素,例如青霉素(如甲氧西林、阿莫西林、氨苄青霉素)、头孢菌素(如头孢氨苄、头孢丙烯、头孢吡肟)、糖肽(如去甲万古霉素、替考拉宁、雷莫拉宁)和环丝氨酸。
分离的菌毛可通过密度,例如采用密度梯度离心而与其他组分分离。例如,菌毛可通过蔗糖梯度离心而分离。
通常,由于菌毛中存在不同数量的菌毛蛋白亚基,含革兰氏阳性菌毛的样品将包含不同分子量的菌毛寡聚体。为了降低多分散性,可根据大小分离含革兰氏阳性菌毛的样品。例如,可采用凝胶过滤柱或大小排阻柱。也可使用超滤膜来降低多分散性。
可采用亲和方法如亲和色谱法分离革兰氏阳性菌毛。例如,将特异性结合革兰氏阳性菌毛的蛋白质,如特异性结合菌毛组分的抗体或优先结合菌毛的抗体固定到固相基质(例如色谱基质)上,然后使含革兰氏阳性菌毛的样品曝露于固定的结合蛋白质。这种亲和分离方法也可用于分离、纯化或富集表达革兰氏阳性菌毛的细胞制品。
革兰氏阳性菌毛也可采用本领域已知的任何其他蛋白纯化方法,例如沉淀、柱色谱方法和样品浓缩方法进行分离。其他方法例如参见Ruffolo等,1997,Infect.Immun.,65:339-43。蛋白纯化方法详见于Scopes,R.K.,《蛋白质纯化:原理与实践》(Protein Purification:Principles and Practice),第3版,1994,Springer,NY。
纯化期间组分中出现革兰氏阳性菌毛后可进行电泳(如聚丙烯酰胺电泳),测定特异性结合革兰氏阳性菌毛的试剂(例如,针对菌毛蛋白的抗体或优先结合菌毛的抗体)的结合,和/或测定菌毛活性,例如蛋白质或细胞结合活性。
抗体
本文所述的肺炎链球菌多肽也可用于制备针对这些肺炎链球菌多肽的特异性抗体。在一些实施方式中,抗体对寡聚体或超寡聚体形式的肺炎链球菌菌毛多肽是特异性的。本文所述的组合物也可包含肺炎链球菌菌毛多肽与选择的其他肺炎链球菌多肽的特异性抗体的组合,以提供针对范围扩大的血清型和菌株分离物的保护作用。例如,这种组合可包含第一和第二抗体,其中所述第一抗体是对第一肺炎链球菌多肽特异性的,第二抗体是对第二肺炎链球菌多肽或非-肺炎链球菌多肽特异性的。
本文所述的特异性肺炎链球菌多肽抗体包括一个或多个能够通过化学或物理方式结合于或联接于肺炎链球菌多肽的表位的生物学部分。本文所述抗体包括特异性结合肺炎链球菌多肽的抗体。本文所述组合物包含由多克隆和单克隆制剂获得的抗体,以及以下物质:杂交(嵌合)抗体分子(参见例如,Winter等,(1991)Nature 349:293-299;和美国专利4,816,567;F(ab’)2和F(ab)片段;Fv分子(非共价异二聚体,参见例如,Inbar等,(1972)Proc Natl Acad Sci USA 69:2659-2662;和Ehrlich等,(1980)Biochem 19:4091-4096);单链Fv分子(sFv)(参见例如,Huston等,(1988)Proc Natl Acad Sci USA 85:5897-5883);二聚和三聚抗体片段构建物;小抗体(参见例如,Pack等,(1992)Biochem 31:1579-1584;Cumber等,(1992)JImmunology 149B:120-126);人源化抗体分子(参见例如,Riechmann等,(1988)Nature 332:323-327;Verhoeyan等,(1988)Science 239:1534-1536;和英国专利公开GB 2,276,169,1994年9月21日出版);以及从这些分子获得的任何功能性片段,其中所述片段保留母体抗体分子的免疫结合性质。本文所述的组合物还包含通过非常规方法,例如噬菌体展示获得的抗体。
本文所述的抗体可以是多克隆、单克隆、重组(例如嵌合或人源化)的完全人、非人(例如鼠)或单链抗体。这些抗体的制备方法是已知的。在一些情况下,抗体具有效应功能并能固定补体(fix complement)。抗体也可偶联毒素、报道基团或成像试剂。
在一些实施方式中,本文所述的特异性肺炎链球菌抗体是单克隆抗体。这些单克隆抗体包括具有均一抗体群的抗体组合物。本文所述的单克隆抗体可以从鼠杂交瘤获得,采用人而非鼠杂交瘤获得人单克隆抗体。例如参见Cote等,单克隆抗体和癌症治疗(Monoclonal Antibodies and Cancer Therapy),Alan R.Liss,1985,第77页。
嵌合、人源化(例如完全人)的抗体适合需要重复给予人对象的情况,例如治疗性应用(以及一些诊断应用)。
本文所述的抗体也可用于肺炎链球菌感染的预防或治疗应用。该抗体可阻断肺炎链球菌附着于宿主细胞或对宿主细胞的一些其他活性。此外,抗体还可用于将毒素或治疗剂(如抗生素)递送至肺炎链球菌细胞。
本文所述的抗体可用于诊断应用,例如,用于检测生物样品中是否存在肺炎链球菌多肽。来自肺炎链球菌抗体的抗-纤丝、菌毛多肽或其他多肽可用于诊断目的,以监测组织中的蛋白质水平作为临床试验过程的一部分,例如以测定给定治疗方案的效力。将抗体偶联(例如物理连接)于可检测物质(即抗体标记)以便于检测。可检测物质的例子包括:各种酶、辅基、荧光材料、造影剂、发光材料、生物发光材料和放射性材料。合适的酶的例子包括辣根过氧化物酶、碱性磷酸酶、β-半乳糖苷酶或乙酰基胆碱酯酶;合适的辅基复合物的例子包括链霉亲和素/生物素和亲和素/生物素;合适的荧光材料的例子包括伞形酮、荧光素、荧光素异硫氰酸酯、罗丹明、二氯三嗪胺荧光素、丹酰氯或藻红蛋白;造影剂的例子包括适合电镜镜检的电子致密材料如金颗粒、或适合磁共振成象的磁活性材料如超磁性(supermagnetic)铁颗粒;发光材料的例子包括发光氨(luminol);生物发光材料的例子包括萤光素酶、萤光素和水母素;合适的放射性材料的例子包括125I、131I、35S或3H。检测受感染患者,例如测定来自患者的样品中是否存在含肺炎链球菌菌毛II岛(INV104B)菌毛的有菌毛肺炎链球菌的方法中可使用这些诊断抗体。然后基于肺炎链球菌的存在情况选择疗程。例如,被无菌毛的肺炎链球菌感染的患者可用抗生素进行治疗,而被含有肺炎链球菌菌毛多肽的有菌毛肺炎链球菌感染的患者可用菌毛结合化合物,如抗体和/或抗炎药(如,IL-6或抗-TNF试剂如抗TNF抗体)进行治疗。
筛选试验
在一些方面,本文所述方法(在这里也称为“筛选试验”)可用于鉴定调节剂,即从一种或多种测试化合物(例如,抗体、蛋白质、肽、肽模拟物、类肽、无机小分子、非核酸类有机小分子、核酸(如反义核酸、siRNA、寡核苷酸或合成寡核苷酸)、或其他药物)鉴定的抑制肺炎链球菌菌毛多肽的活性(如结合活性)的候选化合物或试剂。这样鉴定的化合物可用于调节含肺炎链球菌菌毛多肽的肺炎链球菌的结合活性,或者在治疗方案中这种肺炎链球菌的粘附,以调整肺炎链球菌的生物学功能。
在一些实施方式中,提供筛选测试化合物以鉴定结合肺炎链球菌菌毛多肽或其一部分的那些化合物的试验。可测定结合肺炎链球菌菌毛多肽的化合物调节肺炎链球菌菌毛相关活性,例如粘附、感染或者炎症反应的能力。
本文所述方法中使用的测试化合物可利用本领域已知的许多组合文库方法中的任一种获得,包括生物学文库;类肽文库(具有肽功能,但具有新型非肽主链的文库,其不发生酶降解,但仍然保持生物活性;参见例如,Zuckermann等,1994,J.Med.Chem.,37:2678-2685);空间可寻址的平行固相或溶液相文库;要求解卷积的合成文库方法;“一珠一化合物”的文库方法;和利用亲和色谱选择的合成文库方法。生物学文库和类肽文库方法限于肽文库,而其他四种方法适用于肽、非肽寡聚体或小分子化合物的文库(Lam,1997,Anticancer Drug Des.,12:145)。
本领域中分子文库合成方法的例子可参见例如:DeWitt等,(1993,Proc.Natl.Acad.Sci.USA,90:6909;Erb等,1994,Proc.Natl.Acad.Sci.USA,91:11422;Zuckermann等,1994,J.Med.Chem.,37:2678;Cho等,1993,Science,261:1303;Carrell等,1994,Angew.Chem.Int.Ed.Engl.,33:2059;Carell等,1994,Angew.Chem.Int.Ed.Engl.,33:2061;和in Gallop等,1994,J.Med.Chem.,37:1233)。
化合物文库可以存在于:溶液中(如,Houghten,1992,Biotechniques,13:412-421),或珠上(Lam,1991,Nature,354:82-84),芯片上(Fodor,1993,Nature,364:555-556),细菌中(Ladner,美国专利5,223,409),孢子中(Ladner,美国专利5,223,409),质粒中(Cull等,1992,Proc.Natl.Acad.Sci.USA,89:1865-1869),或噬菌体上(Scott和Smith,1990,Science,249:386-390;Devlin,1990,Science,249:404-406;Cwirla等,1990,Proc.Natl.Acad.Sci.USA,87:6378-6382;Felici,1991,J.Mol.Biol.,222:301-310;和Ladner同上)。
在一些实施方式中,试验是基于细胞的试验,其中表达肺炎链球菌多肽或其生物活性部分的细胞(例如细菌细胞)与测试化合物相接触,例如通过监测细胞结合,测定测试化合物调节肺炎链球菌活性的能力。例如,细胞可以是哺乳动物来源,例如鼠、大鼠、或人来源。细胞可以是上皮细胞,例如A549肺上皮细胞。
例如,通过在化合物(例如底物)上偶联放射性同位素或酶标记,使得可通过检测复合物中的标记化合物(例如底物)来确定化合物(例如底物)与肺炎链球菌菌毛II岛(INV104B)菌毛或菌毛蛋白的结合,从而评价测试化合物调节肺炎链球菌菌毛多肽与配体或底物,例如细胞或蛋白质如纤维蛋白原、纤连蛋白或胶原结合的能力。或者,可将肺炎链球菌多肽偶联于放射性同位素或酶标记,以监测测试化合物调节肺炎链球菌多肽与底物结合形成复合物的能力。例如,用125I、35S、14C或3H直接或间接标记化合物(如,肺炎链球菌多肽结合伴侣),并通过放射发射直接计数或通过闪烁计数检测放射性同位素。或者,可用辣根过氧化物酶、碱性磷酸酶或荧光素酶酶法标记化合物,并通过测定合适底物向产物的转化检测酶标记。
可评价化合物与具有或不具有任何相互反应物标记的肺炎链球菌多肽发生相互作用的能力。例如,可利用显微生理机能检测仪来检测化合物与肺炎链球菌多肽的相互作用,化合物或肺炎链球菌多肽均未标记(McConnell等,1992,Science257:1906-1912)。在本文中,“显微生理机能检测仪”(如,)是利用光可寻址的电位测定传感器(LAPS)测量细胞酸化其环境的速率的分析仪器。这种酸化速率的改变可用作化合物与肺炎链球菌多肽相互作用的指标。
在一些实施方式中,提供一种无细胞试验,其中肺炎链球菌多肽或其生物活性部分与测试化合物相接触,并评价测试化合物与肺炎链球菌多肽或其生物活性部分结合的能力。通常,新型试验中使用的肺炎链球菌多肽的生物活性部分包括参与与其他肺炎链球菌多肽分子相互作用的片段。
无细胞试验包括在足以使两种组分相互作用和结合的条件和时间下制备靶基因蛋白和测试化合物的反应混合物,从而形成可以去除和/或检测的复合物。
可采用荧光能量转移(FET)来检测两种分子间的相互作用(参见例如,Lakowicz等,美国专利5,631,169和Stavrianopoulos等,美国专利4,868,103)。选择第一“供体”分子上的荧光团标记,使其发射的荧光能量被第二“接受体”分子上的荧光标记吸收,所述第二“接受体”分子由于吸收能量而发出荧光。或者,“接受体”蛋白分子可简单地利用色氨酸残基的天然荧光能量。选择发射不同波长光的标记,使得“接受体”分子标记区别于“供体”分子标记。由于标记间的能量转移效率与分子间隔距离有关,所以可评价分子间的空间关系。在分子间发生结合的情况下,试验中“接受体”分子标记的荧光发射量最大。采用本领域公知的标准荧光检测技术(例如,采用荧光计)可方便地测定FET结合事件。
在一些实施方式中,采用实时生物分子相互作用试验(BIA)确定肺炎链球菌多肽结合靶分子(例如纤维蛋白原、纤连蛋白、或胶原多肽或其片段)的能力(如,Sjolander等,1991,Anal.Chem.,63:2338-2345和Szabo等,1995,Curr.Opin.Struct.Biol.,5:699-705)。“表面等离振子共振”或“BIA”实时检测生物特异性相互作用,无需标记任何相互作用物(例如,BIAcore)。结合表面的质量变化(表明发生结合)导致该表面附近光折射率的改变(表面等离振子共振(SPR)的光学现象),产生可用于指示生物分子之间实时反应的可检测信号。
在一些实施方式中,将靶基因产物或测试物质锚定到固相上。锚定到固相上的靶基因产物/测试化合物复合物可以在反应结束时检测。靶基因产物可锚定到固体表面上,而不发生锚定的测试化合物可用本文所述可检测标记进行直接或间接标记。
多种靶基因产物可采用蛋白微阵列技术锚定到固相上,这种技术也称为蛋白芯片技术或固相蛋白阵列技术。蛋白微阵列技术是本领域普通技术人员所熟知的,可包括但不限于,获得在固定基板上包含已鉴定肽或蛋白质的阵列,使靶分子或生物组分与肽结合,并评价这种结合。例如参见,G.MacBeath和S.L.Schreiber,将蛋白质印刷成微阵列以进行高通量功能测定(“Printing Proteins asMicroarrays for High-Throughput Function Determination”),Science289(5485):1760-1763,2000。微阵列基板包括但不限于玻璃、二氧化硅、铝硅酸盐、硼硅酸盐、金属氧化物如氧化铝和氧化镍、各种粘土、硝基纤维素或尼龙。可以用化合物包被微阵列基板以促进基板上探针(例如肽)的合成。基板上的偶联剂或基团可用于将第一氨基酸共价连接到基板。本领域技术人员已知许多偶联剂或基团。可以在基板上预定的栅格中直接合成肽探针。或者,可以在基板上点标肽探针,在这种情况下可用化合物包被基板以提高探针与基板的结合。在这些实施方式中,将预先合成的探针以精确的预定体积和栅格图案施加到基板上,优选利用计算机控制的机器设备以接触-印刷方式或以非接触方式(例如喷墨或压电递送)将探针施加到基板上。探针可共价连接于基板。在一些实施方式中,可将一种或多种对照肽或蛋白分子连接到基板。对照肽或蛋白分子能够确定诸如肽或蛋白质质量和结合特性、反应试剂质量和有效性、杂交成功率以及分析阈值和成功率等方面的因素。
在一些实施方式中,希望固定肺炎链球菌多肽、抗-菌毛抗体或菌毛蛋白抗体,或肺炎链球菌多肽结合蛋白以促进上述一种或两种蛋白质复合形式与非复合形式的分离,并且适应分析自动化。测试化合物与肺炎链球菌多肽的结合,或者在有或没有候选化合物的情况下肺炎链球菌多肽与靶分子的相互作用可以在适合包含这些反应试剂的任何容器内完成。这些容器的例子包括:微量滴定板、试管和微量离心管。在一个实施方式中,可提供增加使得上述一种或两种蛋白与基质结合的结构域的融合蛋白。例如,谷胱甘肽-S-转移酶/菌毛II岛(INV104B)菌毛蛋白融合蛋白或谷胱甘肽-S-转移酶/靶标融合蛋白可吸附到谷胱甘肽琼脂糖(Sepharose)TM珠上(密苏里州圣路易斯的西格玛化学品公司(Sigma Chemical,St.Louis,MO)或谷胱甘肽衍生的微量滴定板上,然后混合测试化合物,或者混合测试化合物与未吸附的靶蛋白或肺炎链球菌菌毛II岛(INV104B)菌毛或菌毛蛋白,在诱导复合物形成的条件下孵育该混合物(例如,在盐和pH的生理学条件下)。孵育后,洗涤珠或微量滴定板以去除未结合的组分,对于珠的情况,例如如上所述直接或间接测定基质固定的复合物。或者,复合物可与基质解离,并用标准技术测定肺炎链球菌多肽结合或活性水平。
将肺炎链球菌多肽或结合靶标固定到基质上的其他技术包括利用生物素和亲和素的偶联。可利用本领域已知的技术由生物素-NHS(N-羟基-琥珀酰亚胺)制备生物素化的肺炎链球菌多肽或靶分子(例如,伊利诺斯州罗克福德的皮尔斯化学品公司(Pierce Chemicals,Rockford,IL)的生物素化试剂盒),并固定到链霉亲和素包被的96孔板的各孔中(皮尔斯化学品公司(Piece Chemical))。
为进行试验,将非固定的组分加到含有锚定组分的包被表面上。反应完全后,在任何形成的复合物保持固定在固相表面的条件下去除未反应的组分(例如通过洗涤)。锚定在固相表面上的复合物的检测可采用多种方式进行。如果先前所述非固定的组分是预先标记的,则检测到固定在固相表面上的标记表明形成复合物。如果先前所述非固定的组分未经预先标记,可利用间接标记来检测锚定在表面上的复合物;例如利用标记的固定组分的特异性抗体(该抗体又可用标记的抗Ig抗体直接标记或间接标记)。
在一些实施方式中,试验采用能够特异性结合肺炎链球菌多肽或结合靶标,但不会干扰肺炎链球菌多肽与其靶标的结合的抗体。可将这些抗体引入培养板各孔中,未结合的靶标或肺炎链球菌多肽通过抗体偶联作用而被俘获在各孔中。检测这些复合物的方法除了上述GST固定复合物方法之外,还包括利用与肺炎链球菌多肽或靶分子反应的抗体免疫检测复合物,以及依赖于检测与肺炎链球菌多肽或靶分子相关的酶活性的酶联免疫分析。
在一些实施方式中,在液相中进行无细胞试验。在这种试验中,通过许多标准技术将反应产物与未反应的组分分离,这些技术包括但不限于:差速离心(例如,Rivas等,1993,Trends Biochem.Sci.,18:284-287);色谱(凝胶过滤色谱,离子交换色谱);电泳(如,Ausubel等编,1999,《新编分子生物学方案》(Current Protocols in Molecular Biology),J.Wiley:纽约);和免疫沉淀(例如,Ausubel等编,1999,《新编分子生物学方案》,J.Wiley:纽约)。这些树脂和色谱技术是本领域技术人员已知的(例如,Heegaard,1998,J.Mol.Recognit.,11:141-148和Hage等,1997,J.Chromatogr.B.Biomed.Sci.Appl.,699:499-525)。而且,如本文所述,也可方便地采用荧光能量转移技术来检测结合,而无需从溶液进一步纯化复合物。
在一些实施方式中,该试验包括使肺炎链球菌菌毛多肽或其生物活性部分与结合肺炎链球菌菌毛多肽的已知细胞或化合物(如,蛋白质)相接触,以形成试验混合物,使该试验混合物与测试化合物相接触,并测定测试化合物影响肺炎链球菌菌毛多肽与该细胞或化合物结合的能力。
测定与表达肺炎链球菌菌毛II岛(INV104B)菌毛的细菌细胞结合的试验通常包括将表达肺炎链球菌菌毛II岛(INV104B)菌毛的细菌细胞与A549肺上皮细胞一起孵育,洗涤以去除未贴附的细菌细胞,并检测贴附的细菌细胞。细菌贴附可通过本领域的任何方式进行测定,例如检测抗体与贴附的细菌细胞的结合,或者裂解上皮细胞并对相结合的细菌细胞进行计数。表达肺炎链球菌菌毛II岛(INV104B)菌毛的细菌细胞的结合试验中也可使用HEP2细胞、CHO细胞或HeLa细胞。
免疫原性组合物
本文所述的免疫原性组合物除肺炎链球菌多肽外还可包含一种或多种抗原性物质。示例性的抗原包括下面列出的那些。此外,本文所述组合物可用于治疗或预防任何下述微生物导致的感染。免疫原性组合物中使用的抗原包括但不限于一种或多种以下抗原,或者由一种或多种以下抗原衍生的抗原:
细菌抗原
脑膜炎奈瑟菌(N.meningitides):来自脑膜炎萘瑟菌血清型A、C、W135、Y和/或B的蛋白质抗原(Bruyn,G.A.W.和van Furth,R.(1991)Eur.J.Clin.Microbiol.Infect.Dis.10,897-910;Ryan,M.W.和Antonelli,P.J.(2000)Laryngoscope 110,961-964;Cutts,F.T.,Zaman,S.M.,Enwere,G.,Jaffar,S.,Levine,O.S.,Okoko,C.Oluwalana,A.,Vaughan,S.,Obaro,A.,Leach,A.等,(2005)Lancet365,1139-1146;Swiatlo,E.,Champlin,F.R.,Holman,S.C.,Wilson,W.W.&Watt,J.M.(2002)Infect.Immun.70,412-415;Sandgren,A.,Albiger,B.,Orihuela,C.,Tuomanen,E.,Normark,S.和Henriques-Normark,B.(2005)J.Infect.Dis.192,791-800;Henriques Normark,B.,Christensson,B.,Sandgren,A.,Noreen,B.,Sylvan,S.,Burman,L.G.和Olsson-Liljequist,B.(2003)Microb.Drug Resist.9,337-344;Nunes,S.,R.,J.,Alves,C.R.,Mato,R.,A.B.,Saldanha,J.,Almeida,J.S.,Sanches,I.S.和de Lencastre,H.(2005)J.Clin.Microbiol.43,1285-1293);来自脑膜炎萘瑟菌血清型B的外膜囊泡(OMV)制剂(Henrichsen,J.(1995)J.Clin.Microbiol.33,2759-2762;Lau,G.W.,Haataja,S.,Lonetto,M.,Kensit,S.E.,Marra,A.,Bryant,A.P.,McDevitt,D.,Morrison,D.A.和Holden,D.W.(2001)Mol.Microbiol.40,555-571;Rosenow,C.,Ryan,P.,Weiser,J.N.,Johnson,S.,Fontan,P.,Ortqvist,A.和Masure,H.R.(1997)Mol.Microbiol.25,819-829;Tuomanen,E.(1999)Current Opin.Biol.2,35-39);来自脑膜炎萘瑟菌血清型A、B、CW135和/或Y的糖抗原,包括LPS,如来自血清型C的寡糖(参见PCT/US99/09346;PCT IB98/01665;和PCT IB99/00103);
肺炎链球菌:来自肺炎链球菌的糖或蛋白质抗原,尤其是糖,或者PhtD(BVH-11-2,SP 1003,spr0907)(Adamou等,Infect.Immun.,69:949-53,2001;Hamel等,Infect.Immun.,72:2659-70,2004);PhtE(BVH-3,SP 1004,spr0908)(Adamou等,Infect.Immun.,69:949-53,2001;Hamel等,Infect.Immun.,72:2659-70,2004);PhtB(PhpA,BVH-11,SP1174,spr1060)(Adamou等,Infect.Immun.,69:949-53,2001;Zhang等,Infect.Immun.,69:3827-36,2001;Hamel等,Infect.Immun.,72:2659-70,2004);PhtA(BVH-11-3,SP1175,spr1061)(Adamou等,Infect.Immun.,69:949-53,2001;Wizemann等,Infect.Immun.,69:1593-98,2001;Zhang等,Infect.Immun.,69:3827-36,2001;Hamel等,Infect.Immun.,72:2659-70,2004);NanA(SP1693,spr1536)(Tong等,Infect.Immun.,73:7775-78,2005);SP1872(spr1687)(Brown等,Infect.Immun.,69:6702-06,2001);PspC(CbpA,SP2190,spr1995)(Ogunniyi等,Infect.Immun.,69:5997-6003,2001);PspA(SP0177,spr0121,spr1274)(Briles等,疫苗,19:S87-S95,2001);SP0498(spr0440);LytB(SP0965,spr0867)(Wizemann等,Infect.Immun.,69:1593-98,2001);AliB(SP1527,spr1382);PpmA(SP0981,spr0884)(Overweg等,Infect.Immun.,68:4180-4188,2000);LytC(SP1573,spr1431)(Wizemann等,Infect.Immun.,69:1593-98,2001);PsaA(Briles等,Vaccine,19:S87-S95,2001);PdB(Ogunniyi等,Infect.Immun.,69:5997-6003,2001);RPhp(Zhang等,Infect.Immun.,69:3827-36,2001);PiuA(Jomaa等,Vaccine,24:5133-39,2006);PiaA(Jomaa等,Vaccine,24:5133-39,2006);6PGD(Daniely等,Clin.Exp.Immunol.,144:254-263,2006);或PppA(Green等,Infect.Immun.,73:981-89,2005)的蛋白质或抗原性肽;
无乳链球菌(Streptococcus agalactiae):如B型链球菌抗原;
酿脓链球菌(Streptococcus pyogenes):如A型链球菌抗原;
粪肠球菌(Enterococcus faecalis)或屎肠球菌(Enterococcus faecium):如美国专利6,756,361中提供的三糖重复单元或其他肠球菌衍生抗原;
幽门螺旋杆菌(Helicobacter pylori):包括Cag、Vac、Nap、HopX、HopY和/或脲酶抗原;
百日咳博德特菌(Bordetella pertussis):如百日咳博德特菌的百日咳全毒素(PT)和丝状血凝素(FHA),任选也可以与百日咳杆菌黏附素(pertactin)和/或凝集原2和3抗原联用;
金黄色葡萄球菌(Staphylococcus aureus):包括金黄色葡萄球菌5型和8型荚膜多糖,任选地偶联有无毒性重组绿脓假单胞菌(Pseudomonas aeruginosa)外毒素A如StaphVAXTM、或源自表面蛋白的抗原,侵袭素(杀白细胞素、激酶、透明质酸酶)、抑制吞噬细胞吞噬的表面因子(荚膜、蛋白质A)、类胡萝卜素、产生过氧化氢酶、蛋白质A、凝固酶、凝血因子、和/或任选地脱毒的裂解真核细胞膜的膜破坏性毒素(溶血素、白细胞毒素、杀白细胞素);
表皮葡萄球菌(Staphylococcus epidermis):尤其是表皮葡萄球菌粘菌相关抗原(SAA);
腐生性葡萄球菌(Staphylococcus saprophyticus):(导致尿路感染)尤其是腐生性葡萄球菌抗原的160kDa血凝素;
绿脓假单胞菌(Pseudomonas aeruginosa):尤其是内毒素A,Wzz蛋白,绿脓假单胞菌LPS,更具体是从PAO1(O5血清型)分离的LPS,和/或外膜蛋白,包括外膜蛋白F(OprF)(InfectImmun.2001年5月;69(5):3510-3515);
炭疽杆菌(Bacillus anthracis)(炭疽):例如来自A-组分(致死因子(LF)和水肿因子(EF))的炭疽杆菌抗原(任选地脱毒),它们具有相同的称为保护性抗原(PA)的B-组分;
粘膜炎莫拉菌(Moraxella catarrhalis):(呼吸道)包括外膜蛋白抗原(HMW-OMP),C-抗原,和/或LPS;
鼠疫耶尔森菌(Yersiniapestis)(鼠疫):如F 1荚膜抗原(Infect Immun.2003年1月;71(1)):374-383,LPS(Infect Immun.1999年10月;67(10):5395),鼠疫耶尔森菌V抗原(Infect Immun.1997年11月;65(11):4476-4482);
小肠结肠炎耶尔森菌(Yersinia enterocolitica)(胃肠病原体):尤其是LPS(InfectImmun.2002年8月;70(8):4414);
假结核耶尔森菌(Yersiniapseudotuberculosis):胃肠病原体抗原;
结核分枝杆菌(Mycobacterium tuberculosis):例如任选地制备成阳离子脂质囊泡的脂蛋白、LPS、BCG抗原、抗原85B(Ag85B)的融合蛋白和/或ESAT-6(Infect Immun.2004年10月;72(10):6148),结核分枝杆菌(Mtb)异柠檬酸脱氢酶相关抗原(Proc.NatlAcadSci U.S.A.2004年8月24日;101(34):12652),和/或MPT51抗原(InfectImmun.2004年7月;72(7):3829);
嗜肺军团菌(Legionella pneumophila)(军团病):嗜肺军团菌抗原-任选地源自asd基因被破坏的细胞系(Infect Immun.1998年5月;66(5):1898);
立克次氏体(Rickettsia):包括外膜蛋白,包括外膜蛋白A和/或B(OmpB)(Biochim Biophys Acta.2004年11月1;1702(2):145),LPS和表面蛋白抗原(SPA)(J Autoimmun.1989年6月;2增补:81);
大肠杆菌(E.coli):包括来自肠毒性大肠杆菌(ETEC)、肠聚集性大肠杆菌(EAggEC)、弥散粘附性大肠杆菌(DAEC)、肠道致病性大肠杆菌(EPEC)和/或肠出血性大肠杆菌(EHEC)的抗原;
霍乱弧菌(Vibrio cholerae):包括蛋白酶抗原、LPS,尤其是霍乱弧菌II的脂多糖、O1Inaba O-特异性多糖、霍乱弧菌O139、IEM108疫苗的抗原(Infect Immun.2003年10月;71(10):5498-504),和/或闭锁小带毒素(Zot);
伤寒沙门菌(Salmonella typhi)(伤寒):包括荚膜多糖,优选偶联物(Vi,即vax-TyVi);
鼠伤寒沙门菌(Salmonellatyphimurium)(胃肠炎):考虑将由其衍生的抗原用于微生物和癌症治疗,包括抑制血管新生和调节llk;
单核细胞增多性李斯特菌(Listeria monocytogenes)(免疫缺陷或老年患者中发生的全身感染,胎儿感染):源自单核细胞增多性李斯特菌昀抗原优选用作本文所述偶联物/相关组合物胞质内递送的载体;
牙龈卟啉单胞菌(Porphyromonas gingivalis):例如牙龈卟啉单胞菌外膜蛋白(OMP);
破伤风(Tetanus):如破伤风类毒素(TT)抗原,例如用作与本文所述组合物偶联的载体蛋白;
白喉(Diphtheria):如白喉类毒素或白喉类毒素突变体,例如CRM197,考虑能够调节、抑制ADP核糖基化或与其相关的其他抗原与本文所述组合物联用/共同给药/偶联,白喉类毒素可用作载体蛋白;
伯氏疏螺旋体(Borrelia burgdorferi)(莱姆病):如P39和P13的相关抗原(一种整体膜蛋白,Infect Immun.2001年5月;69(5):3323-3334),VlsE抗原变异蛋白(J.Clin Microbiol.1999年12月;37(12):3997);
乙型流感嗜血杆菌(Haemophilus influenzae B):例如由其衍生的糖抗原;
克雷伯杆菌(Klebsiella):如OMP,包括OMP A,或任选地与破伤风类毒素偶联的多糖;
淋病奈瑟菌(Neiserria gonorrhoeae):包括Por(或孔蛋白)蛋白,如PorB(参见Zhu等,Vaccine(2004)22:660-669),转移结合蛋白,如TbpA和TbpB(参见Price等,Infection andImmunity(2004)71(1):277-283),不透明性蛋白(如Opa),还原可修饰蛋白(Rmp)和外膜囊泡(OMV)制剂(参见Plante等,JInfectious Disease(2000)182:848-855),也可参见例如,WO99/24578,WO99/36544,WO99/57280,WO02/079243);
肺炎衣原体(Chlamydiapneumoniae):尤其是肺炎衣原体蛋白质抗原;
沙眼衣原体(Chlamydia trachomatis):包括源自血清型A、B、Ba和C的抗原(沙眼病原体,失明原因之一),源自血清型L1、L2和L3(与性病淋巴肉芽肿相关)的抗原和源自血清型D-K的抗原;
苍白密螺旋体(Treponema pallidum)(梅毒):尤其是TmpA抗原;和
杜氏嗜血杆菌(Haemophilus ducreyi)(引起软下疳):包括外膜蛋白(DsrA)。
虽然未具体提及,但本文所述的其它细菌抗原可以是上述任何一种细菌的荚膜抗原、多糖抗原或蛋白质抗原。其它细菌抗原也可包括外膜囊泡(OMV)制剂。此外,抗原包括活、减毒、裂解(对于有包膜的病毒而言)和/或纯化的任何上述细菌。本文所述细菌或微生物来源的抗原可以是革兰氏阴性菌或革兰氏阳性菌,以及需氧或厌氧菌。
此外,任何上述细菌来源的糖(多糖、LPS、LOS或寡糖)可偶联于另一种试剂或抗原,例如载体蛋白(例如CRM197)。这种偶联可以是糖上羰基部分与蛋白质氨基基团还原胺化实现的直接偶联,如美国专利5,360,897和Can J Biochem CellBiol.1984年5月;62(5):270-5所述。或者,糖可以通过连接基团偶联,例如采用《生物偶联技术》(Bioconjugate Techniques),1996和CRC,《蛋白质偶联与交联化学》(Chemistry of Protein Conjugation and Cross-Linking),1993中提供的琥珀酰胺或其他连接键进行偶联。
病毒抗原
流感病毒:包括全病毒颗粒(减毒)、裂解部分或含有血凝素(HA)和/或神经氨酸酶(NA)表面蛋白的亚单位,流感抗原可源自鸡胚或基于细胞培养繁殖,和/或流感抗原可源自甲、乙和/或丙型流感及其他流感;
呼吸道合胞病毒(RSV):包括RSV A2毒株的F蛋白(J Gen Virol.2004年11月;85(Pt 11):3229)和/或G糖蛋白;
副流感病毒(PIV):包括1、2和3型PIV,优选含有血凝素、神经氨酸酶和/或融合糖蛋白;
脊髓灰质炎病毒:包括源自小核糖核酸病毒科(picornaviridae)的抗原,优选脊髓灰质炎病毒抗原如OPV,或优选IPV;
麻疹病毒:包括任选地与Protollin组合的裂解麻疹病毒(MV)抗原,和/或MMR疫苗中包含的抗原;
腮腺炎病毒:包括MMR疫苗中包含的抗原;
风疹病毒:包括MMR疫苗中包含的抗原以及来自披盖病毒科(Togaviridae)的其他抗原,包括登革热病毒;
狂犬病病毒:例如冻干的灭活病毒(RabAvertTM);
黄病毒科(Flaviridae)病毒:例如黄热病病毒、日本脑炎病毒、登革热病毒(1、2、3或4型)、蜱媒脑炎病毒和西尼罗病毒(以及由其衍生的抗原);
杯状病毒科(Calici viridae):由其衍生的抗原;
HIV:包括HIV-1或HIV-2毒株抗原,例如gag(p24gag和p55gag)、env(gp160和gp41)、pol、tat、nef、rev vpu、微小蛋白(miniprotein)(优选p55gag和gp140v缺失)和来自分离物HIVIIIb、HIVSF2、HIVLAV、HIVLAI、HIVMN、HIV-1CM235、HIV-1US4、HIV-2的抗原;猴免疫缺陷病毒(SIV)及其他;
轮状病毒:包括VP4、VP5、VP6、VP7、VP8蛋白(Protein Expr Purif.2004年12月;38(2):205)和/或NSP4;
瘟病毒:例如来自经典的猪热病毒、牛病毒性腹泻病毒和/或边界病病毒的抗原。
细小病毒:例如细小病毒B19;
冠状病毒:包括SARS病毒抗原,尤其是突起蛋白或由其衍生的蛋白酶,以及WO 04/92360中包括的抗原;
甲型肝炎病毒:例如灭活病毒;
乙型肝炎病毒:例如表面和/或核心抗原(sAg),以及前表面序列前-S1和前-S2(过去称为前-S),以及上述物质的组合,例如sAg/前-S1、sAg/前-S2、sAg/前-S1/前-S2和前-S1/前-S2(例如参见AHBV疫苗:人用疫苗与疫苗接种(AHBV Vaccines-Human Vaccines and Vaccination),第159-176页;和美国专利4,722,840、5,098,704、5,324,513;Beames等,J.Virol.(1995)69:6833-6838,Birnbaum等,J.Virol.(1990)64:3319-3330;和Zhou等,J.Virol.(1991)65:5457-5464);
丙型肝炎病毒:例如E1、E2、E1/E2(参见Houghton等,Hepatology(1991)14:381)、NS345多聚蛋白、NS 345-核心多聚蛋白、核心和/或来自非结构区的肽(国际公开WO 89/04669、WO 90/11089和WO 90/14436);
丁型肝炎病毒(HDV):由其衍生的抗原,尤其是源自HDV的δ-抗原(参见例如,美国专利5,378,814);
戊型肝炎病霉(HRV):由其衍生的抗原;
庚型肝炎病毒(HGV):由其衍生的抗原;
水痘-带状疱疹病毒:由水痘-带状疱疹病毒衍生的抗原(VZV)(J.Gen.Virol.(1986)67:1759);
EB病毒:由EBV衍生的抗原(Baer等,Nature(1984)310:207);
巨细胞病毒:CMV抗原,包括gB和gH(巨细胞病毒(J.K.McDougall编,Springer-Verlag 1990)第125-169页);
单纯疱疹病毒:包括由HSV-1或HSV-2毒株衍生的抗原,以及糖蛋白gB、gD和gH(McGeoch等,J.Gen.Virol.(1988)69:1531和美国专利5,171,568);
人疱疹病毒:由其他人疱疹病毒如HHV6和HHV7衍生的抗原;和
HPV:包括与人乳头状瘤病毒(HPV)相关或由其衍生的抗原,例如E1-E7、L1、L2以及它们的融合形式中的一种或多种,具体说本文所述组合物可包含含有L1主要衣壳蛋白的病毒样颗粒(VLP),更具体说HPV抗原是针对HPV血清型6、11、16和/或18中的一种或多种具有保护性的。
还提供了在《病毒》(Vaccines),第4版(Plotkin和Orenstein编,2004);《医学微生物学》(Medical Microbiology)第4版(Murray等编,2002);《病毒学》(Virology),第3版(W.K.Joklik编,1988);《基础病毒学》(Fundamental Virology),第2版(B.N.Fields和D.M.Knipe,编,1991)中包含的抗原、组合物、方法和微生物,考虑将它们与本文所述组合物联用。
此外,抗原包括活、减毒、裂解和/或纯化的任何上述病毒。
真菌抗原
与疫苗联合使用的真菌抗原包括:美国专利4,229,434和4,368,191中所述的抗原,用于预防和治疗须癣毛癣菌(Trichophyton mentagrophyte)导致的毛癣菌病(trichopytosis);美国专利5,277,904和5,284,652中所述的用于预防动物皮肤癣菌感染的广谱皮肤癣菌疫苗的抗原,所述动物例如是豚鼠、猫、兔、马和羊,这些抗原包含有效量的杀伤性马毛癣菌(T.equinum)、须癣毛癣菌(T.mentagrophytes)(颗粒变种)、犬小孢子菌(M.canis)和/或石膏样小孢子菌(M.gypseum)的悬浮液,任选地含有佐剂;美国专利5,453,273和6,132,733中所述的用于癣菌病疫苗的抗原,其在载体中包含有效量的经匀浆化处理的甲醛杀伤的真菌,即犬小孢子菌培养物;以及美国专利5,948,413中所述的抗原,涉及腐皮病(pythiosis)的胞外和胞内蛋白质。抗真菌疫苗中鉴定的其他抗原包括Ringvac bovisLTF-130和Bioveta。
并且,这里所用的真菌抗原可衍生自皮肤癣菌,包括:絮状表皮霉菌(Epidermophyton floccusum)、奥杜安氏小孢子菌(Microsporum audouini)、犬小孢子菌(Microsporum canis)、扭曲小孢子菌(Microsporum distortum)、马小孢子菌(Microsporum equinum)、石膏样小孢子菌(Microsporum gypsum)、矮小小孢子菌(Microsporum nanum)、同心性毛癣菌(Trichophyton concentricum)、马毛癣菌(Trichophyton equinum)、鸡毛癣菌(Trichophyton gallinae)、石膏样毛癣菌(Trichophyton gypseum)、蒙氏毛癣菌(Trichophyton megnini)、须癣毛癣菌(Trichophyton mentagrophytes)、发癣毛癣菌(Trichophyton quinckeanum)、红色毛癣菌(Trichophyton rubrum)、许兰毛癣菌(Trichophyton schoenleini)、断发毛癣菌(Trichophyton tonsurans)、疣状毛癣菌(Trichophyton verrucosum),疣状毛癣菌(T.verrucosum)白色变种(album)、盘状变种(discoides)、赭色变种(ochraceum),紫色毛癣菌(Trichophyton violaceum)和/或蜜块状毛癣菌(Trichophyton faviforme)。
与本文所述组合物联用的用作抗原或衍生抗原的真菌病原体包括:烟曲霉(Aspergillus fumigatus)、黄曲霉(Aspergillusflavus)、黑曲霉(Aspergillusniger)、构巢曲霉(Aspergillus nidulans)、土曲霉(Aspergillus terreus)、聚多曲霉(Aspergillussydowi)、黄曲霉(Aspergillus flavatus)、灰绿曲霉(Aspergillus glaucus)、头状芽裂殖菌(Blastoschizomyces capitatus)、白假丝酵母(Candida albicans)、(Candida enolase)、热带假丝酵母(Candida tropicalis)、光滑假丝酵母(Candida glabrata)、克鲁斯假丝酵母(Candida krusei)、近平滑假丝酵母(Candida parapsilosis)、类星形假丝酵母(Candidastellatoidea)、克鲁斯假丝酵母(Candida kusei)、(Candidaparakwsei)、葡萄牙假丝酵母(Candida lusitaniae)、伪热带假丝酵母(Candidapseudotropicalis)、季也蒙假丝酵母(Candida guilliermondi)、卡氏枝孢霉(Cladosporium carrionii)、粗球孢子菌(Coccidioides immitis)、皮炎芽生菌(Blastomyces dermatidis)、新型隐球菌(Cryptococcus neoformans)、棒地霉(Geotrichum clavatum)、荚膜组织胞浆菌(Histoplasma capsulatum)、巴西副球孢子菌(Paracoccidioides brasiliensis)、卡氏肺孢子虫(Pneumocystis carinii)、诡谲腐霉(Pythiumn insidiosum)、皮屑芽胞菌(Pityrosporum ovale)、酿酒酵母(Sacharomyces cerevisae)、布拉酵母(Saccharomycesboulardii)、粟酒酵母(Saccharomyces pombe)、尖端赛多孢子菌(Scedosporiumapiosperum)、申克孢子丝菌(Sporothrix schenckii)、白吉利丝孢酵母(Trichosporonbeigelii)、马尔尼菲青霉菌(Penicillium marneffei)、马拉色菌(Malassezia spp.)、着色真菌(Fonsecaea spp.)、王氏霉菌(Wangiella spp.)、孢子丝菌(Sporothrix spp.)、蛙粪霉(Basidiobolus spp.)、耳霉(Conidiobolus spp.)、根霉(Rhizopus spp.)、毛霉(Mucor spp.)、犁头霉(Absidia spp.)、被孢霉(Mortierella spp.)、小克银汉霉(Cunninghamella spp.)和瓶霉(Saksenaea spp.)。
衍生得到抗原的其他真菌包括:支链孢属(Alternaria spp)、弯孢属(Curvulariaspp)、长蠕孢属(Helminthosporium spp)、镰孢菌属(Fusarium spp)、曲霉(Aspergillusspp)、青霉菌(Penicillium spp)、单线属(Monolinia spp)、丝核菌属(Rhizoctonia spp)、拟青霉属(Paecilomyces spp)、半知菌纲霉菌(Pithomyces spp)和分支孢子菌属(Cladosporium spp)。
本领域熟知产生真菌抗原的方法(参见美国专利6,333,164)。在一些方法中,从已基本除去或至少部分除去细胞壁的真菌细胞的不溶性组分中提取和分离溶解组分,其特征在于所述方法包括:获得活真菌细胞;获得已基本除去或至少部分除去细胞壁的真菌细胞;破碎已基本除去或至少部分除去细胞壁的真菌细胞;获得不溶性组分;以及从不溶性组分中提取和分离溶解组分。
STD抗原
在一些实施方式中,本发明组合物和方法能够对抗的微生物(细菌、病毒和/或真菌)包括导致性传播疾病(STD)的微生物和/或在其表面上展示可能成为本文所述靶点或抗原组合物的抗原的那些微生物。在一些实施方式中,组合物与源自病毒性或细菌性STD的抗原组合。源自细菌或病毒的抗原可以与本文所述组合物联合给予,以提供针对至少一种以下STD的保护作用:衣原体、生殖器疱疹、肝炎(特别是HCV)、生殖器疣、淋病、梅毒和/或软下疳(参见WO00/15255)。
在一些实施方式中,本文所述组合物与用于预防或治疗STD的一种或多种抗原共同给予。
优选源自以下STD相关病毒的抗原与本文所述组合物共同给予:肝炎(特别是HCV)、HPV、HIV或HSV(详述见上文)。
此外,优选源自以下STD相关细菌的抗原与本文所述组合物共同给予,它们是淋病奈瑟菌、肺炎衣原体、沙眼衣原体、苍白密螺旋体或杜氏嗜血杆菌(详述见上文)。
呼吸道抗原
肺炎链球菌抗原可以是呼吸道抗原,可进一步用于预防和/或治疗呼吸道病原体感染的方法所用的免疫原性组合物中,所述病原体包括病毒、细菌或真菌,例如呼吸道合胞病毒(RSV)、PIV、SARS病毒、流感病毒、炭疽杆菌,具体是通过减轻或防止感染和/或呼吸道病毒感染的一种或多种症状。包含本文所述抗原,例如源自呼吸道病毒、细菌或真菌的抗原的组合物可与本文所述组合物联用,给予处于接触特定呼吸道微生物的风险中、已接触呼吸道微生物或者受到呼吸道病毒、细菌或真菌感染的个体。本文所述组合物可以与一种或多种呼吸道病原体的抗原同时或在同一制剂中并行给予。给予组合物导致呼吸道感染的一种或多种症状的发生率和/或严重性降低。
儿科/老年病学抗原
在一些实施方式中,本文所述组合物与用于治疗儿科群体的一种或多种抗原如儿科抗原联用。在一些实施方式中,儿科群体中的对象年龄小于约3岁,或小于约2岁,或小于约1岁。在一些实施方式中,儿科抗原(与本文所述组合物联用)在至少1、2或3年内分多次给予。
在一些实施方式中,本文所述组合物与用于治疗老年病学群体的一种或多种抗原如老年病学抗原联用。在一些实施方式中,老年病学群体中的对象年龄大于约50岁,大于约55岁,大于约60岁,大于约65岁,大于约70岁,大于约75岁,大于约80岁,或大于约85岁。在一些实施方式中,老年病学抗原(与本文所述组合物联用)在至少1、2或3年内分多次给予。
其他抗原
可与本发明组合物联用的其他抗原包括医院获得性(医源性)相关抗原。
在一些实施方式中,考虑将寄生性抗原与本文所述组合物联用。寄生性抗原的例子包括那些源自导致疟疾和/或莱姆病的生物体的抗原。
在一些实施方式中,与本文所述组合物联用的抗原与蚊媒病有关或能有效对抗蚊媒病。在一些实施方式中,与本文所述组合物联用的抗原与脑炎有关或能有效对抗脑炎。在一些实施方式中,与本文所述组合物联用的抗原与神经系统感染有关或能有效对抗神经系统感染。
在一些实施方式中,与本文所述组合物联用的抗原是可通过血液或体液传播的抗原。
抗原制剂
在一些方面,提供了吸附抗原的微粒的制备方法。该方法包括:(a)通过分散含有(i)水、(ii)去污剂、(iii)有机溶剂和(iv)可生物降解聚合物的混合物提供乳液,所述可生物降解聚合物选自:聚(α-羟酸)、聚羟基丁酸、聚己内酯、聚原酸酯、聚酐和聚腈基丙烯酸酯。相对于有机溶剂,该聚合物在混合物中的浓度一般约为1%-30%,而混合物中去污剂-聚合物的重量比一般约为0.00001∶1-0.1∶1(更一般约为0.0001∶1-0.1∶1,约为0.001∶1-0.1∶1,或约为0.005∶1-0.1∶1);(b)除去乳液中的有机溶剂;和(c)使抗原吸附于微粒表面上。在一些实施方式中,相对于有机溶剂,可生物降解聚合物的浓度约为3%-10%。
在一些实施方式中,这里使用的微粒由可灭菌、无毒并且可生物降解的材料制成。这些材料包括但不限于:聚(α-羟酸)、聚羟基丁酸、聚己内酯、聚原酸酯、聚酐、PACA和聚氰基丙烯酸酯。在一些实施方式中,本文所述方法中使用的微粒衍生自聚(α-羟酸),具体说,衍生自聚(丙交酯)(″PLA″)或D,L-丙交酯和乙交酯或乙醇酸的共聚物,如聚(D,L-丙交酯-乙交酯共聚物)(″PLG″或″PLGA″),或D,L-丙交酯和己内酯的共聚物。该微粒可衍生自具有各种分子量的各种聚合物原料,在共聚物如各种丙交酯∶乙交酯比例的PLG情况下,选择丙交酯∶乙交酯比例是主要问题,部分取决于共同给予的大分子。下面更全面地讨论这些参数。
其它细菌抗原也可包括外膜囊泡(OMV)制剂。
抗原也可吸附在各种革兰氏阳性菌的肽聚糖上,形成革兰氏阳性增强子基质(GEM)颗粒,如Bosma等Appl.Env.Microbiol.,72:880-889,2006所述,其全部内容被纳入本文作为参考。该方法依赖于LysM基序(Buist等,J.Bact.,177:1554-63,1995;Bateman和Bycroft,J.Mol.Biol.,299:1113-19,2000)与酸处理细胞的细胞壁肽聚糖的非共价结合。简言之,将连接于一个或多个LysM基序(例如,非共价或共价连接(例如,以融合蛋白形式或者通过偶联连接))的多肽抗原加入酸处理的革兰氏阳性菌中。抗原肽以高亲和力结合,可用于免疫原性组合物中。这些方法中使用的酸的例子包括三氯乙酸(例如,0.1%-10%)、乙酸(例如5.6M)、HCl(例如0.01M)、乳酸(例如0.72M)和甲酸(例如0.56M)。
美国专利6,884,435中提供了其它制备方法和抗原(尤其是肿瘤抗原)。
抗原参考文献
以下参考文献包括与本文所述组合物联用的抗原,它们各自的内容具体纳入本文作为参考:
国际专利申请WO99/24578
国际专利申请WO99/36544
国际专利申请WO99/57280
国际专利申请WO00/22430
Tettelin等(2000)Science 287:1809-1815
国际专利申请WO96/29412
Pizza等(2000)Science 287:1816-1820.
PCT WO 01/52885.
Bjune等(1991)Lancet 338(8775).
Fuskasawa等(1999)Vaccine 17:2951-2958.
Rosenqist等(1998)Dev.Biol.Strand 92:323-333.
Costantino等(1992)Vaccine 10:691-698.
Costantino等(1999)Vaccine 17:1251-1263.
Watson(2000)Pediatr Infect Dis J 19:331-332.
Rubin(20000)Pediatr Clin North Am 47:269-285,v.
Jedrzejas(2001)Microbiol Mol Biol Rev 65:187-207.
要求GB0016363.4;WO 02/02606;PCT IB/01/00166的优先权的2001年7月3日提交的国际专利申请
Kalman等(1999)Nature Genetics 21:385-389.
Read等(2000)Nucleic Acids Res 28:1397-406.
Shirai等(2000)J.Infect.Dis 181(增补3):S524-S527.
国际专利申请WO99/27105.
国际专利申请WO00/27994.
国际专利申请WO00/37494.
国际专利申请WO99/28475.
Bell(2000)Pediatr Infect Dis J19:1187-1188.
Iwarson(1995)APMIS 103:321-326.
Gerlich等(1990)Vaccine 8增补:S63-68和79-80.
Hsu等(1999)Clin Liver Dis 3:901-915.
Gastofsson等(1996)N.Engl.J.Med.334-:349-355.
Rappuoli等(1991)TIBTECH 9:232-238.
《疫苗》(Vaccines)(1988)编.Plotkin和Mortimer.ISBN 0-7216-1946-0.
Del Guidice等(1998)Molecular Aspects of Medicine 19:1-70.
国际专利申请WO93/018150.
国际专利申请WO99/53310.
国际专利申请WO98/04702.
Ross等(2001)Vaccine 19:135-142.
Sutter等(2000)Pediatr Clin North Am 47:287-308.
Zimmerman和Spann(1999)Am Fan Physician 59:113-118,125-126.
Dreensen(1997)Vaccine 15增补”S2-6.
MMWR Morb Mortal Wkly rep 1998年1月16日:47(1):12,9.
McMichael(2000)Vaccine19增补1:S101-107.
Schuchat(1999)Lancet 353(9146):51-6.
GB专利申请0026333.5,0028727.6和0105640.7.
Dale(1999)Infect Disclin North Am 13:227-43,viii.
Ferretti等(2001)PNAS USA 98:4658-4663.
Kuroda等(2001)Lancet 357(9264):1225-1240;也可参见1218-1219.
Ramsay等(2001)Lancet 357(9251):195-196.
Lindberg(1999)Vaccine 17增补2:S28-36.
Buttery和Moxon(2000)J R Coil Physicians Long 34:163-168.
Ahmad和Chapnick(1999)Infect Dis Clin North Am 13:113-133,vii.
Goldblatt(1998)J.Med.Microbiol.47:663-567.
欧洲专利0477508.
美国专利5,306,492.
国际专利申请WO98/42721.
《偶联疫苗》(Conjugate Vaccines)(Cruse等编)ISBN 3805549326,具体是第10卷:48-114.
Hermanson(1996)《生物偶联技术》(Bioconjugate Techniques)ISBN:012323368和012342335X.
欧洲专利申请0372501.
欧洲专利申请0378881.
欧洲专利申请0427347.
国际专利申请WO93/17712.
国际专利申请WO98/58668.
欧洲专利申请0471177.
国际专利申请WO00/56360.
国际专利申请WO00/67161.
融合蛋白
本文所述组合物中使用的肺炎链球菌多肽可以作为独立多肽存在于组合物中,但在一些实施方式中,至少两种(即2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17或18种)抗原可以表达成一条多肽链(“杂交”或“融合”多肽)。这种融合多肽具有以下两方面的主要优点:首先,本身不稳定或者表达较差的多肽可以通过加入能够克服该问题的合适的融合伴侣得到改善,其次,商业生产得到简化,因为只需利用一次表达和纯化即可生产可作抗原应用的两种多肽。
融合多肽可包含一种或多种本文所述肺炎链球菌多肽编码的多肽序列。因此,本文所述组合物可包含一种融合肽,所述融合肽具有第一氨基酸序列和第二氨基酸序列,所述第一和第二氨基酸序列选自由肺炎链球菌菌毛II岛(INV104B)或其片段所编码的蛋白质。在一些实施方式中,所述融合多肽中的第一和第二氨基酸序列包含不同的表位。
在一些实施方式中,优选由本文所述肺炎链球菌多肽序列编码的两种、三种、四种、五种、六种、七种、八种、九种或十种抗原的氨基酸序列构成的杂交体(或融合体)。在一些实施方式中,优选由本文所述肺炎链球菌多肽序列编码的两种、三种、四种或五种抗原的氨基酸序列构成的杂交体。
不同的杂交多肽可以混合在单一制剂中。在这种组合内,本文所述肺炎链球菌多肽序列所编码的序列可以在一种以上的杂交多肽中和/或作为非杂交多肽存在。然而,在一些实施方式中,抗原以杂交形式或者非杂交形式存在,但这两种形式不同时存在。
杂交多肽可以通式NH2-A-{-X-L-}n-B-COOH表示,其中:X是本文所述肺炎链球菌多肽或其片段的氨基酸序列;L是任选的接头氨基酸序列;A是任选的N-末端氨基酸序列;B是任选的C-末端氨基酸序列;n是2、3、4、5、6、7、8、9、10、11、12、13、14或15。
如果野生型形式中-X-部分具有前导肽序列,在杂交蛋白中可以包含或者略去其前导肽。在一些实施方式中,前导肽可缺失,除非-X-部分位于杂交蛋白的N-末端,即保留X1的前导肽,但略去X2...Xn的前导肽。这相当于删除所有前导肽并使用X1的前导肽作为-A-部分。
在{-X-L-}的各个n值的情况下,接头氨基酸序列-L-可存在或不存在。例如,当n=2时,杂交体可以是NH2-X1-L1-X2-L2-COOH、NH2-X1-X2-COOH、NH2-X1-L1-X2-COOH、NH2-X1-X2-L2-COOH等。接头氨基酸序列-L-一般较短(如20个或更少的氨基酸,即19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2、1个氨基酸)。例子包括有利于克隆的短肽序列,多聚甘氨酸接头(即,包括Glyn,其中n=2、3、4、5、6、7、8、9、10或更大)。本领域技术人员显然了解其它合适的接头氨基酸序列。有用的接头是GSGGGG,Gly-Ser二肽由BamHI限制位点形成,因而有助于克隆和操作,(Gly)4四肽是常用的多聚甘氨酸接头。
在上述杂交多肽通式中,-A-是任选的N-末端氨基酸序列。它一般较短(例如,40个或更少的氨基酸,即39、38、37、36、35、34、33、32、31、30、29、28、27、26、25、24、23、22、21、20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2、1个氨基酸)。例子包括指导蛋白质运输的前导序列,或有利于克隆或纯化的短肽序列。本领域技术人员显然了解其它合适的N-末端氨基酸序列。如果X1缺少其自身的N-末端甲硫氨酸,-A-优选是提供N-末端甲硫氨酸的寡肽(例如,具有1、2、3、4、5、6、7或8个氨基酸)。
在上述杂交多肽通式中,-B-是任选的C末端氨基酸序列。它一般较短(例如,40个或更少的氨基酸,即39、38、37、36、35、34、33、32、31、30、29、28、27、26、25、24、23、22、21、20、19、18、17、16、15、14、13、12、11、10、9、8、7、6、5、4、3、2、1个氨基酸)。例子包括指导蛋白质运输的序列,有利于克隆或纯化的短肽序列,或提高蛋白质稳定性的序列。本领域技术人员显然了解其它合适的C-末端氨基酸序列。
在上述杂交多肽通式中,最优选n为2或3。
核酸
本说明书提供了编码肺炎链球菌多肽序列和/或杂交融合多肽的核酸。本说明书还提供了编码肺炎链球菌多肽抗原和/或本文所述杂交融合多肽的核酸。并且,本说明书提供了能够与这些核酸杂交,优选在“高度严谨”条件下(例如,65℃,在0.1x SSC,0.5%SDS溶液中)杂交的核酸。
本文所述多肽可通过多种方式(例如,重组表达、从细胞培养物纯化、化学合成等)并以多种形式(例如,天然、融合、非糖基化、脂化等形式)进行制备。它们优选制备成基本纯形式(即基本上不含其他宿主细胞蛋白)。
本文所述核酸可以许多方式(例如,通过化学合成,来自基因组或cDNA文库,来自生物体本身等)并以各种形式(例如,单链、双链、载体、探针等)进行制备。它们优选以合适的纯化形式制备(即基本上不含其他宿主细胞核酸)。
本文所用术语“核酸”包括DNA和RNA,以及它们的类似物,例如含有经修饰的主链(例如硫代磷酸酯等)的类似物,以及肽核酸(PNA)等等。本文所述组合物包含含有上述序列的互补序列的核酸(例如出于反义和探针目的)。
本说明书还揭示了一种多肽制备方法,该方法包括以下步骤:在诱导多肽表达的条件下培养用本文所述核酸转染的宿主细胞。
本说明书还揭示了一种多肽制备方法,该方法包括通过化学方式合成至少一部分多肽的步骤。
本说明书还揭示了一种核酸制备方法,该方法包括用基于引物的扩增方法(例如PCR)扩增核酸的步骤。
本说明书还揭示了一种核酸制备方法,该方法包括通过化学方式合成至少一部分的核酸的步骤。
纯化和重组表达
本文所述的肺炎链球菌多肽序列可从天然肺炎链球菌分离,或者它们可以(例如)在异源宿主中重组制备。例如,本文所述的肺炎链球菌菌毛II岛(INV104B)抗原可以从含有肺炎链球菌菌毛II岛(INV104B)的肺炎链球菌分离,或者它们可以(例如)在异源宿主中重组制备。
异源宿主可以是原核(例如细菌)或真核生物。优选大肠杆菌,但也可采用其他合适的宿主,包括枯草杆菌(Bacillus subtilis)、霍乱弧菌、伤寒沙门菌、鼠伤寒沙门菌、乳糖奈瑟菌(Neisseria lactamica)、灰色奈瑟菌(Neisseria cinerea)、分枝杆菌(Mycobacteria)(例如结核分枝杆菌)、格氏链球菌(S.gordonii)、乳酸乳球菌(L.lactis)、酵母等。
在需要表达的本文所述的肺炎链球菌多肽序列中加入标签蛋白,形成含有标签蛋白和所述多肽的融合蛋白,以便于多肽的重组制备。例如,在需要表达的肺炎链球菌菌毛II岛(INV104B)菌毛抗原中加入标签蛋白,形成含有标签蛋白和肺炎链球菌菌毛II岛(INV104B)菌毛抗原的融合蛋白,以便于多肽的重组制备。这种标签蛋白有利于表达的蛋白质的纯化、检测和稳定。适用于本文所述组合物的标签蛋白包括:聚精氨酸标签(Arg-标签)、FLAG-标签、Strep-标签、c-myc-标签、S-标签、钙调蛋白结合肽、纤维素结合结构域、SBP-标签,、壳多糖结合结构域、谷胱甘肽S-转移酶-标签(GST)、麦芽糖结合蛋白、转录终止抗-终止因子(NusA)、大肠杆菌硫氧还蛋白(TrxA)和蛋白质二硫键异构酶I(DsbA)。优选的标签蛋白包括GST。关于标签蛋白的应用的全面讨论可参见Terpe等,标签蛋白融合的综述:从分子和生化基本原理到商业系统(“Overview of tag protein fusions:frommolecular and biochemical fundamentals to commercial systems)”,Appl.Microbiol.Biotechnol.(2003)60:523-533。
纯化后,可任选地从表达的融合蛋白上去除标签蛋白,即通过本领域已知的特别定制的酶法处理。常用的蛋白酶包括肠激酶、烟草蚀纹病毒(TEV)、凝血酶和凝血因子Xa。
免疫原性组合物和药物
本文所述组合物优选是免疫原性组合物,更优选是疫苗组合物。在一些实施方式中,组合物的pH为6-8,优选约为7。可使用缓冲液来维持pH。该组合物可以是无菌和/或无热原的。组合物与人体等张。
本发明疫苗可以是预防性(即预防感染)或治疗性(即治疗感染)疫苗,但一般是预防性疫苗。因此,提供了在易受肺炎链球菌感染的动物中治疗性或预防性治疗肺炎链球菌感染的方法,该方法包括给予所述动物治疗或预防量的本文所述免疫原性组合物。例如,提供了在易受链球菌感染的动物中治疗性或预防性治疗肺炎链球菌感染的方法,该方法包括给予所述动物治疗或预防量的本文所述免疫原性组合物。
本文所述组合物也可用作药物。在一些实施方式中,药物优选能够引起哺乳动物的免疫应答(即它是一种免疫原性组合物),并且在一些实施方式中,它是疫苗。本文所述组合物也可以用于引起哺乳动物免疫应答的药物的制备过程中。在一些实施方式中,所述药物是疫苗。
本文所述组合物也可以在含有一个或多个组合物容器的试剂盒中使用。这些组合物可以是液体形式或是冻干形式,各个抗原也是这样。组合物合适的容器包括例如瓶、小瓶、注射器和试管。容器可由多种材料,包括玻璃和塑料制成。容器可具有无菌进入口(例如,容器可以是静脉内输液袋或者具有皮下注射针可刺穿塞子的小瓶)。这些组合物可包含含有一种或多种肺炎链球菌菌毛多肽的第一组分。优选地,肺炎链球菌菌毛多肽是寡聚或高寡聚形式。
试剂盒还可包括含有药学上可接受的缓冲剂,例如磷酸盐缓冲盐水、林格溶液或右旋糖溶液的第二容器。试剂盒也可包括适用于最终使用者的其他材料,包括其他缓冲剂、稀释剂、填充剂、针和注射器。试剂盒也可包括另一活性剂(例如抗生素)的第二或第三容器。
试剂盒也可包括包装说明书,说明书上写明诱导针对肺炎链球菌的免疫或者用于治疗肺炎链球菌感染的方法的指导说明。包装说明书可以是未经批准的说明书草图,或者可以是经食品药品管理局(FDA)或其他管理机构批准的包装说明书。
也可使用预先填充有本文所述免疫原性组合物的递送装置。
诱导哺乳动物免疫应答的方法包括给予有效量的本文所述组合物的步骤。在一些实施方式中,免疫应答是保护性的,优选涉及抗体和/或细胞介导的免疫。在一些实施方式中,免疫应答将诱导长效(例如中和)抗体以及接触一种或多种肺炎链球菌抗原后快速应答的细胞介导的免疫。在一些实施方式中,该方法引起加强的应答。
哺乳动物中中和肺炎链球菌感染的方法可包括:给予哺乳动物有效量的本文所述的免疫原性组合物,本文所述的疫苗,或者识别本文所述免疫原性组合物的抗体。
在一些实施方式中,所述哺乳动物是人。如果疫苗用作预防用途,人可以是男性或女性(不论是生育期或者青春期)。或者,所述人可以是老人(例如,超过50、55、60、65、70、75、80或85岁),可能患有潜在的疾病如糖尿病或癌症。在一些实施方式中,如果疫苗用作治疗用途,所述人可以是孕妇或老人。
在一些实施方式中,本文所述的用途和方法是用于防止和/或治疗肺炎链球菌导致的疾病。该组合物也可有效对抗其他链球菌。该组合物也可有效对抗其他革兰氏阳性菌。
检测治疗性治疗的功效的一些方法涉及在给予本发明组合物后监测肺炎链球菌感染。检测预防性治疗的功效的一种非限制性方式涉及在给予组合物之后监测针对本文所述组合物中肺炎链球菌抗原的免疫应答。
评价本文所述免疫原性组合物的组分蛋白的免疫原性的非限制性方式是重组表达该蛋白并通过免疫印迹方法筛选患者血清或粘膜分泌物。蛋白质和患者血清之间的阳性反应表明该患者已对所研究蛋白质产生免疫应答,即该蛋白是免疫原。也可利用该方法鉴定优势免疫蛋白和/或表位。
检测治疗性治疗的功效的另一种方式涉及在给予本发明组合物后监测肺炎链球菌感染。检测预防性治疗的功效的一种方式涉及在给予组合物后监测针对本文所述组合物中肺炎链球菌抗原的全身免疫应答(例如监测IgG1和IgG2a产生水平)和粘膜免疫应答(例如监测IgA产生水平)。通常,肺炎链球菌血清特异性抗体应答在免疫后、刺激前进行测定,而粘膜肺炎链球菌特异性抗体应答在免疫后、刺激后测定。
在一些实施方式中,本文所述疫苗组合物给予宿主(例如人)之前可以在体外和体内动物模型中进行评价。
本文所述免疫原性组合物的功效也可以通过用免疫原性组合物刺激肺炎链球菌感染的动物模型(例如豚鼠或小鼠)进行体内测定。免疫原性组合物可源自与刺激血清型相同或不同的血清型。在一些实施方式中,免疫原性组合物可源自与刺激血清型相同的血清型。在一些实施方式中,免疫原性组合物和/或刺激血清型可源自肺炎链球菌的血清型。
体内功效模型包括但不限于:(i)使用人肺炎链球菌血清型的鼠感染模型;(ii)鼠疾病模型,这是一种采用鼠适应性肺炎链球菌菌株,例如在小鼠中特别有毒力的菌株的鼠模型;和(iii)使用人肺炎链球菌分离物的灵长类模型。
免疫应答可以是TH1免疫应答和TH2免疫应答之一或两者。免疫应答可以是改善的、或提高的、或改变的免疫应答。免疫应答可以是全身免疫应答和粘膜免疫应答之一或两者。在一些实施方式中,免疫应答是提高的全身应答和/或粘膜应答。提高的全身免疫和/或粘膜免疫表现为提高的TH1和/或TH2免疫应答。在一些实施方式中,提高的免疫应答包括IgG1和/或IgG2a和/或IgA产生增加。在一些实施方式中,粘膜免疫应答是TH2免疫应答。在一些实施方式中,粘膜免疫应答包括IgA产生增加。
活化的TH2细胞提高抗体产生,因而在应对胞外感染中有价值。活化的TH2细胞可分泌IL-4、IL-5、IL-6和IL-10中的一种或多种。TH2免疫应答可导致产生IgG1、IgE、IgA和记忆B细胞,用于将来的保护作用。
TH2免疫应答可包括一种或多种与TH2免疫应答相关的细胞因子(例如IL-4、IL-5、IL-6和IL-10)中的一种或多种增加,或者IgG1、IgE、IgA和记忆B细胞产生增加。在一些实施方式中,提高的TH2免疫应答将包括IgG1产生的增加。
TH1免疫应答可包括CTL中的一种或多种增加,与TH1免疫应答相关的细胞因子(例如IL-2、IFNγ和TNFβ)中的一种或多种增加,活化的巨噬细胞增加,NK活性增加,或者IgG2a产生增加。在一些实施方式中,提高的TH1免疫应答将包括IgG2a产生的增加。
本文所述免疫原性组合物,特别是包含一种或多种肺炎链球菌多肽抗原的免疫原性组合物,可单独使用或与其他抗原以及任选地与能够引发Th1和/或Th2应答的免疫调节剂联用。
本文所述组合物通常直接给予患者。在一些实施方式中,对于某些组合物宜采用某些途径,以便产生更有效的免疫应答,CMI应答,或引起副作用可能性较低,或者更容易给予。可通过胃肠道外注射(如皮下、腹膜内、皮内、静脉内、肌内或给予组织间隙),或通过直肠、口服(例如片剂、喷雾剂)、阴道、局部、透皮(例如参见WO 99/27961)或经皮(例如参见WO 02/074244和WO 02/064162)、鼻内(例如参见WO03/028760)、眼部、耳内、经肺或其它粘膜给药途径进行直接递送。
可采用本文所述组合物引发全身和/或粘膜免疫。
在一些实施方式中,免疫原性组合物包含一种或多种可引发中和性抗体应答的肺炎链球菌多肽抗原以及一种或多种可引发细胞介导免疫应答的肺炎链球菌多肽抗原。这样,中和性抗体应答可防止或抑制初始肺炎链球菌感染,而能够引发提高的Th1细胞应答的细胞介导免疫应答可进一步防止肺炎链球菌广泛的扩散。免疫原性组合物可包含一种或多种肺炎链球菌多肽抗原;一种或多种肺炎链球菌菌毛或其他肺炎链球菌抗原;以及一种或多种非菌毛肺炎链球菌抗原,例如胞质抗原。在一些实施方式中,免疫原性组合物包含一种或多种肺炎链球菌表面抗原等以及一种或多种其他抗原,例如能够引发Th1细胞应答的胞质抗原。
可以通过单剂量方案或多剂量方案进行剂量治疗。多剂量可用于初免方案和/或加强免疫方案。在多剂量方案中,可通过相同或不同途径给予各种剂量,例如初免采用胃肠道外途径而加强免疫采用粘膜途径,或者初免采用粘膜途径而加强免疫采用胃肠道外途径等。
本文所述组合物可以制备成多种形式。例如,可将该组合物制备成液体溶液或悬浮液形式的注射剂。也可制备适合在注射前溶解或悬浮于液体运载体的固体形式(如冻干组合物)。该组合物可制备成局部制剂,如油膏剂、乳膏剂或粉末剂。该组合物可制备成口服给药制剂,如片剂或胶囊,喷雾剂,或糖浆剂(任选调味)。该组合物可制备成使用细粉或喷雾的肺部给药制剂,如吸入剂。该组合物可制备成栓剂或子宫托。该组合物可制成鼻内、耳内或眼内给药,例如滴剂。该组合物可以是药盒形式,设计成临给予患者之前重建合并的组合物。这种药盒可包含一种或多种液体形式的抗原以及一种或多种冻干抗原。
用作疫苗的免疫原性组合物包含免疫有效量的抗原,以及需要的任何其它组分,例如抗生素。“免疫有效量”指以一次剂量或一系列剂量的一部分,将某剂量给予个体能有效治疗或预防,或者提高可恒量的免疫应答,或者防止或减轻临床症状。此量取决于所治疗个体的健康和身体状况、年龄、所治疗个体的分类地位(如非人灵长动物、灵长动物等)、个体的免疫系统合成抗体的能力、所需的保护程度、疫苗配方、治疗医生对医学情况的评估和其它相关因素。预计该量将落入可通过常规试验测定的相对较宽的范围内。
组合物的其他组分
除上述组分外,本文所述组合物还可包含一种或多种“药学上可接受的载体”,其包括本身不诱导产生对接受该组合物的个体有害的抗体的任何载体。合适的载体一般是代谢慢的大分子,如蛋白质、多糖、聚乳酸、聚乙醇酸、聚氨基酸、氨基酸共聚物和脂质聚集体(如油滴或脂质体)。本领域技术人员熟知这类载体。疫苗也可含有稀释剂,如水、盐水、甘油等。此外,也可存在辅助剂,如湿润剂或乳化剂、pH缓冲剂等。药学上可接受的赋形剂的充分讨论参见Gennaro(2000)《雷明登:药物科学与实践》(Remington:The Science and Practice ofPharmacy),第20版,ISBN:0683306472.。
佐剂
本文所述疫苗可与其他免疫调节剂联合给予。在一些实施方式中,组合物还包含一种或多种佐剂。用于本文所述疫苗的佐剂包括但不限于以下所述一种或多种:
含矿物质的组合物
本发明中适合用作佐剂的含有矿物质的组合物包括矿物盐,例如铝盐和钙盐。本发明包括矿物盐,例如氢氧化物(如羟基氧化物)、磷酸盐(如羟基磷酸盐、正磷酸盐)、硫酸盐等(例如参见《疫苗设计》(Vaccine Design)(1995),Powell和Newman编,ISBN:030644867X.Plenum的第8和9章),或不同矿物质化合物的混合物(例如,磷酸盐和氢氧化物佐剂的混合物,任选地磷酸盐过量),这些化合物可采取任何合适的形式(如凝胶、晶体、无定形等),优选吸附于盐。含有矿物质的组合物也可配制为金属盐颗粒(WO 00/23105)。
本文所述疫苗中可包含铝盐,使Al3+剂量在每剂0.2到1.0毫克之间。
油乳剂
适合用作本发明佐剂的油乳剂组合物包括角鲨烯-水乳剂,例如MF59(利用微流化床配制成亚微米颗粒的5%角鲨烯、0.5%吐温80和0.5%司盘85)。参见WO90/14837。也可参见Podda,含新型佐剂的含佐剂流感疫苗:含MF-59佐剂疫苗的经历(“The adjuvanted influenza vaccines with novel adjuvants:experiencewith the MF59-adjuvanted vaccine”),Vaccine(2001)19:2673-2680;Frey等,在除老人以外的成人对象中含MF-59佐剂的流感疫苗与不含佐剂的流感疫苗的安全性、耐受性和免疫原性的比较(“Comparison of the safety,tolerability andimmunogenicity of a MF59-adjuvanted influenza vaccine and a non-adjuvantedinfluenza vaccine in non-elderly adults”),Vaccine(2003)21:4234-4237。MF59用作FLUADTM流感病毒三价亚基疫苗的佐剂。
在一些实施方式中,用于该组合物中的佐剂为亚微米水包油乳剂。本发明采用的亚微米水包油乳剂优选为鲨烯/水乳剂,任选地包含不同量的MTP-PE,如含有4-5%(重量/体积)鲨烯、0.25-1.0%(重量/体积)吐温80TM(聚氧乙烯脱水山梨糖醇单油酸酯)、和/或0.25-1.0%的司盘85TM(脱水山梨糖醇三油酸酯)以及任选的N-乙酰胞壁酰基-L-丙氨酰基-D-异谷氨酰氨酰基-L-丙氨酸-2-(1’-2’-二棕榈酰基-sn-甘油-3-羟基磷酰氧基)-乙胺(MTP-PE)的亚微米水包油乳剂,例如,称为“MF59”的亚微乳水包油乳剂(国际公开WO 90/14837;美国专利6,299,884和6,451,325,其内容被纳入本文作为参考;和Ott等,MF-59:人用疫苗佐剂的设计与安全性和效力的评价(“MF59--Design and Evaluation of a Safe and Potent Adjuvant forHuman Vaccines”,《疫苗设计:亚基与佐剂方法》(Vaccine Design:The Subunitand Adjuvant Approach)(Powell,M.F.和Newman,M.J.编)Plenum Press,纽约,1995,第277-296页)。MF59包含4-5%(重量/体积)鲨烯(例如4.3%)、0.25-0.5%(重量/体积)吐温80TM和0.5%(重量/体积)司盘85TM,以及任选地含有各种含量的MTP-PE,利用微流化床如110Y型微流化床(马萨诸塞州牛顿市的微流化床公司(Microfluidics,Newton,MA)配制成亚微米水包油乳剂。例如,MTP-PE的用量约为0-500微克/剂量,更优选0-250微克/剂量,最优选0-100微克/剂量。如本文所用,术语“MF59-0”表示不含MTP-PE的上述亚微米水包油乳剂,而术语MF59-MTP表示含MTP-PE的制剂。例如,“MF59-100”包含100微克MTP-PE/剂量,等等。本文所用的另一种亚微米水包油乳剂MF69包含4.3%(重量/体积)鲨烯、0.25%(重量/体积)吐温80TM和0.75%(重量/体积)司盘85TM以及任选的MTP-PE。另一种亚微米水包油乳剂是MF75,也称为SAF,包含10%鲨烯,0.4%吐温80TM,5%普流罗尼-嵌段聚合物L121和thr-MDP,也可微流化形成亚微米乳剂。MF75-MTP表示包含MTP的MF75,例如每剂量包含100-400微克MTP-PE。
国际公开WO 90/14837和美国专利6,299,884和6,451,325(纳入本文作为参考)中详细描述了用于组合物的亚微米水包油乳剂、其制备方法和免疫刺激剂如胞壁酰肽。
还可将完全弗氏佐剂(CFA)和不完全弗氏佐剂(IFA)用作本发明组合物的佐剂。
皂苷制剂
皂苷制剂也可用作本文所述组合物的佐剂。皂苷是在许多种类植物的树皮、叶、茎干、根甚至花中发现的甾醇糖苷和三萜糖苷的异质群体。已广泛研究了作为佐剂的来自皂树(Quillaia saponaria)Molina树皮的皂苷。皂苷也可获自丽花菝葜(Smilax ornata)(墨西哥菝葜)、满天星(Gypsophilla paniculata)(婚纱花)和肥皂草(Saponaria officianalis)(皂根)。皂苷佐剂制剂包括纯化制剂如QS21以及脂质制剂如ISCOM。
已采用高效薄层色谱(HP-TLC)和反相高效液相色谱(RP-HPLC)对皂苷组合物进行纯化。已鉴定了用这些技术纯化的特定组分,包括QS7、QS17、QS18、QS21、QH-A、QH-B和QH-C。制备QS21的方法参见美国专利5,057,540。皂苷制剂也可包含甾醇,如胆固醇(参见WO96/33739)。
皂苷和胆固醇的组合可用于形成称为免疫刺激复合物(ISCOM)的独特颗粒。ISCOM通常也含有磷脂如磷脂酰乙醇胺或磷脂酰胆碱。ISCOM中可采用任何已知的皂苷。ISCOM优选包含QuilA、QHA和QHC中的一种或多种。ISCOM可进一步参见EP0109942、WO 96/11711和WO 96/33739。任选地,ISCOM可不含其它去污剂。参见WO 00/07621。
开发基于皂苷的佐剂的综述可参见Barr等,ISCOM和其他基于皂苷的佐剂(“ISCOMs and other saponin based adjuvants”),Advanced Drug Delivery Reviews(1998)32:247-271。也可参见Sjolander等,口服递送的皂苷和ISCOM疫苗的摄取和佐剂活性(“Uptake and adjuvant activity of orally delivered saponin and ISCOMvaccines”),Advanced Drug Delivery Reviews(1998)32:321-338。
病毒体和病毒样颗粒(VLP)
病毒体和病毒样颗粒(VLP)也可用作本发明组合物的佐剂。这些结构通常包含一种或多种任选地与磷脂组合或一起配制的病毒蛋白。它们通常无病原性、不能复制,通常不含任何天然病毒基因组。可用重组方法产生或从全病毒分离得到这种病毒蛋白。这些适用于病毒体或VLP中的病毒蛋白包括来源于流感病毒(例如HA或NA)、乙肝病毒(例如核心蛋白或包膜蛋白)、戊肝病毒、麻疹病毒、辛德比斯病毒、轮状病毒、口蹄疫病毒、逆转录病毒、诺瓦克病毒、人乳头状瘤病毒、HIV、RNA-噬菌体、Qβ-噬菌体(例如外壳蛋白)、GA-噬菌体、fr-噬菌体、AP205噬菌体和Ty(例如反转录转座子Ty蛋白p1)的蛋白。VLP在WO 03/024480、WO 03/024481和Niikura等,“嵌合重组戊型肝炎病毒样颗粒作为呈递外来表位的口服疫苗载体”(“Chimeric Recombinant Hepatitis E Virus-Like Particles as anOral Vaccine Vehicle Presenting Foreign Epitopes”),Virology(2002)293:273-280;Lenz等,“乳头状瘤病毒样颗粒诱导树突状细胞的急性活化”(“Papillomarivurs-Like Particles Induce Acute Activation of Dendritic Cells”),Journalof Immunology(2001)5246-5355;Pinto等,“用重组HPV-16L1病毒样颗粒免疫的健康志愿者对乳头状瘤病毒(HPV)-16L1的细胞免疫应答”(“Cellular ImmuneResponses to Human Papillomavirus(HPV)-16L 1Healthy Volunteers Immunizedwith Recombinant HPV-16L 1Virus-Like Particles”),Journal of Infectious Diseases(2003)188:327-338;和Gerber等,“与大肠杆菌热不稳定肠毒素突变体R192G或CpG共同给予时人乳头状瘤病毒病毒样颗粒是有效的口服免疫原”(“HumanPapillomavirus Virus-Like Particles Are Efficient Oral Immunogens whenCoadministered with Escherichia coli Heat-Labile Enterotoxin Mutant R192G orCpG”),Journal of Virology(2001)75(10):4752-4760中进一步讨论。病毒体在例如Gluck等,“未来疫苗开发的新技术平台”(“New Technology Platforms in theDevelopment of Vaccines for the Future”),Vaccine(2002)20:B10-B16中进一步讨论。免疫增强的重建流感病毒体(IRIV)可用作鼻内三价INFLEXALTM产品{Mischler和Metcalfe(2002)Vaccine 20增补5:B17-23}和INFLUVAC PLUSTM产品的亚单位抗原递送系统。
细菌或微生物衍生物
适用于本发明组合物的佐剂包括细菌或微生物衍生物,例如:
(a)肠细菌脂多糖(LPS)的无毒性衍生物
这些衍生物包括单磷酰脂质A(MPL)和3-O-脱酰基MPL(3dMPL)。3dMPL是具有4、5或6酰化链的3脱-O-酰基单磷酰脂质A的混合物。3脱-O-酰基单磷酰脂质A的优选“小颗粒”形式见EP 0 689 454中所述。3dMPL的这种“小颗粒”小到足以在过滤除菌时通过0.22μm膜(参见EP 0689454)。其它无毒LPS衍生物包括单磷酰脂质A模拟物,例如氨基烷基氨基葡萄糖苷磷酸盐衍生物如RC-529。参见Johnson等(1999)Bioorg Med Chem Lett 9:2273-2278。
(b)脂质A衍生物
脂质A衍生物包括大肠杆菌(Escherichia coli)的脂质A衍生物,如OM-174。OM-174参见例如Meraldi等,“OM-174,一种具有人用价值的新型佐剂,与由柏格鼠疟原虫的环孢子蛋白合成的C-末端片段一起给予可诱导保护性应答”(“OM-174,a New Adjuvant with a Potential for Human Use,Induces a ProtectiveResponse with Administered with the Synthetic C-Terminal Fragment 242-310 fromthe circumsporozoite protein of Plasmodium berghei”),Vaccine(2003)21:2485-2491;和Pajak等,“佐剂OM-174可诱导体内鼠树突状细胞的迁移和成熟”(“TheAdiuvant OM-174 induces both the migration and maturation of murine dendritic cellsin vivo”),Vaccine(2003)21:836-842。
(c)免疫刺激性寡核苷酸
适合用作本发明组合物的佐剂的免疫刺激性寡核苷酸包括含CpG基序的核苷酸序列(含有通过磷酸酯键连接的非甲基化胞嘧啶和鸟嘌呤核苷的序列)。含有回文或聚(dG)序列的细菌双链RNA或寡核苷酸也显示出免疫刺激性。
CpG可包含核苷酸修饰/类似物,如硫代磷酸酯修饰,可以是双链或单链。任选地,鸟苷可被类似物如2’-脱氧-7-脱氮鸟苷取代。可能的类似替代物的例子参见Kandimalla等,“各种合成的核苷酸基序识别模式:具有独特细胞因子诱导特性的强效免疫调节性寡脱氧核苷酸试剂的设计与开发”(“Divergent syntheticnucleotide motif recognition pattern:design and development of potentimmunomodulatory oligodeoxyribonucleotide agents with distinct cytokine inductionprofiles”),Nucleic Acids Research(2003)31(9):2393-2400;WO02/26757和WO99/62923。Krieg,“CpG基序:细菌提取物中的活性成分?”(“CpG motifs:theactive ingredient in bacterial extracts?”),Nature Medicine(2003)9(7):831-835;McCluskie等,“小鼠中用乙肝表面抗原和CpG DNA的胃肠外和粘膜初免-加强免疫方案”(“Parenteral and mucosal prime-boost immunization strategies in mice withhepatitis B surface antigen and CpG DNA”),FEMS Immunology and MedicalMicrobiology(2002)32:179-185;WO98/40100;美国专利6,207,646;美国专利6,239,116和美国专利6,429,199.中进一步讨论了CpG寡核苷酸的佐剂作用。
CpG序列可能导向TLR9,例如基序GTCGTT(SEQ ID NO:22)或TTCGTT(SEQ ID NO:23)。参见Kandimalla等,“Toll样受体9:新型合成CpG DNA调节识别和细胞因子诱导”(“Toll-like receptor 9:modulation of recognition and cytokineinduction by novel synthetic CpG DNAs”),Biochemical Society Transactions(2003)31(第3部分):654-658。CpG序列可特异性诱导Th1免疫应答,例如CpG-A ODN,或更特异地诱导B细胞应答,例如CpG-B ODN。CpG-A和CpG-B ODN在Blackwell等,“CpG-A-诱导的单核细胞IFN-γ-可诱导的蛋白-10产生受类浆细胞树突状细胞衍生的IFN-α的调控”(“CpG-A-Induced MonocyteIFN-gamma-Inducible Protein-10Production is Regulated by Plasmacytoid DendriticCell Derived IFN-alpha”),J.Immunol.(2003)170(8):4061-4068;Krieg,“CpG的A到Z”(“From A to Z on CpG”),TRENDS in Immunology(2002)23(2):64-65和WO01/95935中讨论。CpG优选是CpG-A ODN。
在一些实施方式中,构建CpG寡核苷酸时使其5’端可为受体所识别。任选将两个CpG寡核苷酸序列的3’端相连接形成“免疫聚体”。例如参见Kandimalla等,“CpG寡核苷酸的二级结构影响免疫刺激活性”(“Secondary structures in CpGoligonucleotides affect immunostimulatory activity”),BBRC(2003)306:948-953;Kandimalla等,“Toll-样受体9:新型合成的GpG DNA调节识别和细胞因子诱导”(“Toll-like receptor 9:modulation of recognition and cytokine induction by novelsynthetic GpG DNAs”),Biochemical Society Transactions(2003)31(第3部分):664-658;Bhagat等,“CpG五-和六脱氧核糖核苷酸作为强效免疫调节剂”(“CpG penta-and hexadeoxyribonucleotides as potent immunomodulatory agents”)BBRC(2003)300:853-861和WO 03/035836。
(d)ADP-核糖基化毒素及其脱毒的衍生物
细菌ADP-核糖基化毒素及其脱毒衍生物可用作本发明组合物的佐剂。在一些实施方式中,该蛋白获自大肠杆菌(大肠杆菌不耐热肠毒素“LT”)、霍乱(“CT”)菌或百日咳(“PT”)菌。WO95/17211中描述了将脱毒的ADP-核糖基化毒素用作粘膜佐剂,WO98/42375中描述了将其用作胃肠道外佐剂。在一些实施方式中,佐剂是脱毒的LT突变体如LT-K63、LT-R72和LT-192G。以下参考文献中描述了将ADP-核糖基化毒素及其脱毒衍生物,尤其是LT-K63和LT-R72用作佐剂,这些文献各自的内容被具体纳入本文作为参考:Beignon等,“大肠杆菌不耐热肠毒素的LTR72突变体增强肽抗原引发CD4+T细胞和在辅助施加到裸露皮肤上之后分泌γ-干扰素的能力”(“The LTR72 Mutant of Heat-Labile Enterotoxin ofEscherichia coli Enhances the Ability of Peptide Antigens to Elicit CD4+T Cells andSecrete Gamma Interferon after Coapplication onto Bare Skin”),Infection andImmunity(2002)70(6):3012-3019;Pizza等,“粘膜疫苗:LT和CT的无毒衍生物作为粘膜佐剂”(“Mucosal vaccines:non toxic derivatives of LT and CT as mucosaladjuvants)”,Vaccine(2001)19:2534-2541;Pizza等,“LTK63和LTR72,两种适合临床试验的粘膜佐剂”(“LTK63 and LTR72,two mucosal adjuvants ready forclinical trials”),Int.J.Med.Microbiol(2000)290(4-5):455-461;Scharton-Kersten等,“用细菌ADP-核糖基化外毒素、亚单位和无关佐剂进行经皮免疫”(“Transcutaneous Immunization with Bacterial ADP-Ribosylating Exotoxins,Subunitsand Unrelated Adjuvants”),Infection and Immunity(2000)68(9):5306-5313;Ryan等,“大肠杆菌不耐热肠毒素的突变体有效用作鼻腔递送无细胞百日咳疫苗的粘膜佐剂:无毒AB复合物和酶活性对Th1和Th2细胞的差别作用”(“Mutants ofEscherichia coli Heat-Labile Toxin Act as Effective Mucosal Adjuvants for NasalDelivery of an Acellular Pertussis Vaccine:Differential Effects of the Nontoxic ABComplex and Enzyme Activity on Th 1 and Th2Cells”),Infection and Immunity(1999)67(12):6270-6280;Partidos等,“大肠杆菌的不耐热肠毒素及其位点导向突变体LTK63提高对鼻内共同免疫合成肽的增殖性和细胞毒性T细胞应答”(“Heat-labileenterotoxin of Escherichia coli and its site-directed mutant LTK63enhance theproliferative and cytotoxic T-cell responses to intranasally co-immunized syntheticpeptides”),Immunol.Lett.(1999)67(3):209-216;Peppoloni等,“大肠杆菌不耐热肠毒素的突变体用作鼻内递送疫苗的安全强效佐剂”(“Mutants of the Escherichiacoli heat-labile enterotoxin as safe and strong adjuvants for intranasal delivery ofvaccines”),Vaccines(2003)2(2):285-293;和Pine等,(2002)“用流感疫苗和大肠杆菌不耐热肠毒素的脱毒突变体进行鼻内免疫”(“Intranasal immunization withinfluenza vaccine and a detoxified mutant of heat labile enterotoxin from Escherichiacoli(LTK63)”,J.Control Release(2002)85(1-3):263-270。优选根据Domenighini等,Mol.Microbiol(1995)15(6):1165-1167中提出的ADP-核糖基化毒素的A和B亚单位的排列对比对氨基酸取代基编号,该参考文献的全部内容特别纳入本文作为参考。
生物粘附剂和粘膜粘附剂
生物粘附剂和粘膜粘附剂也可用作本发明的佐剂。合适的生物粘附剂包括酯化透明质酸微球(Singh等(2001)J.Cont.Rele.70:267-276),或粘膜粘附剂如聚(丙烯酸)、聚乙烯醇、聚乙烯吡咯烷酮、多糖和羧甲基纤维素的交联衍生物。壳聚糖及其衍生物也可用作本发明的佐剂。例如参见WO 99/27960。
微粒
微粒也可用作本文所述组合物的佐剂。微粒(即粒径约100nm-150μm,更优选约200nm-30μm,最优选约500nm-10μm)由生物可降解的无毒材料(例如聚(α-羟酸)、聚羟基丁酸、聚原酸酯、聚酐、聚己内酯等),优选聚(丙交酯-乙交酯共聚物)形成,并任选经处理而具有带负电荷表面(例如用SDS处理)或带正电荷表面(例如用阳离子去污剂如CTAB处理)。
脂质体
适合用作佐剂的脂质体制剂的例子在美国专利6,090,406、美国专利5,916,588和EP 0 626 169中描述。
聚氧乙烯醚和聚氧乙烯酯制剂
适合用于本文所述组合物的佐剂包括聚氧乙烯醚和聚氧乙烯酯。参见WO99/52549。这种制剂还包括聚氧乙烯去水山梨糖醇酯表面活性剂和辛苯聚醇(WO01/21207)以及聚氧乙烯烷基醚或酯表面活性剂和至少一种其它非离子型表面活性剂如辛苯聚醇(WO 01/21152)的混合物。
在一些实施方式中,聚氧乙烯醚选自下组:聚氧乙烯-9-月桂醚(月桂醇聚醚9)、聚氧乙烯-9-硬脂酰基(steoryl)醚、聚氧乙烯-8-硬脂酰基醚、聚氧乙烯-4-月桂醚、聚氧乙烯-35-月桂醚和聚氧乙烯-23-月桂醚。
聚磷腈(PCPP)
PCPP制剂例如在Andrianov等,“通过聚磷腈水溶液的凝聚来制备水凝胶微球”(“Preparation of hydrogel microspheres by coacervation of aqueouspolyphophazene solutions”),Biomaterials(1998)19(1-3):109-115和Payne等,“从聚磷腈基质释放蛋白”(“Protein Release from Polyphosphazene Matrices”),Adv.Drug.Delivery Review(1998)31(3):185-196中描述。
胞壁酰肽
适合用作本发明佐剂的胞壁酰肽的例子包括N-乙酰基-胞壁酰-L-苏氨酰-D-异谷酰胺(thr-MDP)、N-乙酰基-正胞壁酰-L-丙氨酰-D-异谷酰胺(正-MDP)和N-乙酰胞壁酰-L-丙氨酰-D-异谷氨酰胺酰-L-丙氨酸-2-(1’-2’-二棕榈酰-sn-甘油-3-羟基磷酰氧基)-乙胺MTP-PE)。
咪唑并喹诺酮化合物。
适合用作本文所述组合物的佐剂的咪唑并喹诺酮化合物的例子包括:咪喹莫特(Imiquamod)及其类似物,在Stanley,“咪喹莫特和咪唑并喹诺酮:作用机制与治疗效果”(“Imiquimod and the imidazoquinolones:mechanism of action andtherapeutic potential”),Clin Exp Dermatol(2002)27(7):571-577和Jones,“瑞喹莫德3M”(“Resiquimod 3M”),Curr Opin Investig Drugs(2003)4(2):214-218中进一步描述。
本文所述组合物也可包含上述佐剂的组合。例如,以下佐剂组合物可用于本文所述的组合物:
(1)皂苷和水包油乳剂(WO 99/11241);
(2)皂苷(例如QS21)+无毒性LPS衍生物(例如3dMPL)(参见WO94/00153);
(3)皂苷(例如QS21)+无毒性LPS衍生物(例如3dMPL)+胆固醇;
(4)皂苷(例如QS21)+3dMPL+IL-12(任选地+甾醇)(WO 98/57659);
(5)3dMPL与例如QS21和/或水包油乳剂的组合(参见欧洲专利申请0835318、0735898和0761231);
(6)SAF,含10%角鲨烷、0.4%吐温80、5%普流罗尼嵌段聚合物L121和thr-MDP,微流化形成亚微米乳剂或涡旋形成较大粒径的乳剂;
(7)RibiTM佐剂体系(RAS)(Ribi Immunochem),含有2%鲨烯、0.2%吐温80和一种或多种细菌细胞壁组分,所述组分选自单磷酰脂质A(MPL)、海藻糖二霉菌酸酯(TDM)或细胞壁骨架(CWS),优选MPL+CWS(DetoxTM);
(8)一种或多种无机盐(例如铝盐)+无毒性LPS衍生物(例如3dMPL)。
(9)一种或多种无机盐(例如铝盐)+免疫调节性寡核苷酸(例如包含CpG基序的核苷酸序列)。组合(9)是一种优选的佐剂组合。
人免疫调节剂
适合用作本文所述组合物的佐剂的人免疫调节剂包括:细胞因子,如白介素(如IL-1、IL-2、IL-4、IL-5、IL-6、IL-7、IL-12等)、干扰素(如干扰素-γ)、巨噬细胞集落刺激因子和肿瘤坏死因子。
铝盐和MF59是适合可注射流感疫苗的优选佐剂。细菌毒素和生物粘附剂是适合粘膜递送疫苗,例如鼻腔疫苗的优选佐剂。
本文所述的免疫原性组合物可与抗生素治疗方案联合给予。在一些实施方式中,在给予本文所述的抗原或者包含一种或多种本文所述抗原的组合物之前给予抗生素。
在一些实施方式中,在给予一种或多种本文所述的抗原或者包含一种或多种本文所述抗原的组合物之后给予抗生素。适用于治疗链球菌感染的抗生素的例子包括但不限于:青霉素或其衍生物或克林霉素等。
本文所述的专利、专利申请、登录号和出版物的内容各自被纳入本文作为参考。
根据上述内容,本领域技术人员可以作出除本文所述内容之外的本发明的各种改进形式。这些改进形式也落在所附实施方式的范围内。下面的实施例将进一步阐述本发明,这些实施例是为了说明的目的,而不是为了限制本发明的范围。
实施例
材料和方法
肺炎链球菌染色体DNA提取
在200mL液体培养基(THYE培养基)中培养表1所列肺炎链球菌,直到OD600达到0.25-0.5。然后,在600RPM将样品离心15-20分钟(此时将一些团块保存于-20℃)。然后在pH 8.0用2.7毫升50mM EDTA(最终体积3毫升)重悬这些团块,并转移至15毫升FalconTM管(马萨诸塞州贝德福德的BD生物科技公司(BD Biosciences;Bedford,MA))中。将0.55mL新鲜制备的溶菌酶(12mg/mL西格玛(Sigma)L-6876,在50mM EDTA中pH 8.0(密苏里州圣路易斯的西格玛-奥德里奇公司(Sigma-Aldrich Co.;St.Louis,MO))和50μL的5000U/mL变溶菌素(西格玛M-9901)的水溶液相继加入该Falcon管中。这些样品在37℃孵育1小时。孵育后,加入3.6毫升细胞核裂解液(基因组DNA纯化试剂盒(威斯康星州麦德森的普罗麦格公司(Promega Corp.;Madison,WI)),翻转六次混合样品。然后将样品在80℃孵育5分钟,再冷却至室温。然后加入18μLRNA酶溶液(基因组DNA纯化试剂盒)并翻转10次混合样品。然后,这些样品在37℃孵育30分钟。
将每种样品分入6个1.2毫升的Eppendorf管。在每个Eppendorf管中加入0.2毫升蛋白沉淀液(基因组DNA纯化试剂盒)并翻转10次进行混合。翻转10次后立即将样品在室温下在Eppendorf离心机中全速离心30分钟。然后将Eppendorf管的上清液汇集到15毫升Falcon管(通常回收约8毫升)中。然后,加入0.6体积的异丙醇,翻转10次混合样品。
将在火焰中加热密封并弯曲的巴斯德吸管插入Falcon管并钩住团块来回收沉淀的DNA。然后用70%乙醇洗涤沉淀的DNA团块。最后,将沉淀的DNA团块溶解到3毫升TE(10mM Tris HCl,1mM EDTA,pH 8.0)中。
PCR试验
PCR的详细内容是本领域技术人员所熟知的。(参见Mark A.Valasek,M.A.,和Repa,J.J.(2005)Advan.Physiol.Educ.29,151-159;Ausubel,F.M.,Brent,R.,Kingston,R.E.,Moore,D.D.,Seidman,J.G.,Smith,J.A.和Struhl,K,《新编分子生物学方案》(Current Protocols in Molecular Biology),Hoboken,NJ:Wiley,2005.)。本文所用具体参数如下(扩增的最终体积为50μL):
基因组DNA的PCR扩增
试剂
5μL缓冲液(10x浓缩液)
0.5μL BSA(0.5mg/ml)
1μL dNTP(10mM浓缩液)
1.5μL寡核苷酸@10pmol/μL浓度(15pmol)
1.5μL寡核苷酸@10pmol/μL浓度(15pmol)
0.25μL Taq聚合酶(瑞士巴塞尔的豪夫-迈罗氏有限公司(F.Hoffmann-LaRoche Ltd;Basel,Switzerland)。
50ng基因组DNA
加水至50μL
循环参数:
94℃保持3分钟(一个循环)
94℃保持30秒,52℃保持30秒,72℃保持1分钟20秒(6个循环)
94℃保持30秒,58℃保持30秒,72℃保持1分钟20秒(30个循环)
72℃保持8分钟(一个循环)
细菌(克隆)DNA的PCR扩增
试剂
5μL缓冲液(10x浓缩液)
0.5μL BSA(0.5mg/ml)
1μLdNTP(10mM浓缩液)
1.5μL寡核苷酸@10pmol/μL浓度(15pmol)
1.5μL寡核苷酸@10pmol/μL浓度(15pmol)
0.25μL Taq聚合酶(瑞士巴塞尔的豪夫-迈罗氏有限公司(F.Hoffmann-LaRoche Ltd;Basel,Switzerland)。
40.25μL水
细菌
循环参数
94℃保持8分钟(一个循环)
94℃保持30秒,52℃保持30秒,72℃保持1分钟20秒(6个循环)
94℃保持30秒,58℃保持30秒,72℃保持1分钟20秒(30个循环)
72℃保持8分钟(一个循环)
DNA探针制备与杂交
首先用单独的容器平行制备DNA样品(对于步骤1和2)进行Cy3和Cy5标记。
(1)引物预先退火至DNA
将1μg基因组DNA、2.6μL随机九聚物(英国白金汉郡的GE保健公司(GEHealthcare;Buckinghamshire,UK)(前身是安法玛西亚制药公司(AmershamPharmacia))和1.5μL掺入物(对照DNA)组合,加水至最终体积29.5μL。将该混合物加热至70℃保持5分钟,然后冷却至室温保持5分钟。最后,短暂离心该混合物。
(2)反应
将4mL NEB2缓冲液、2mL核苷酸混合物(2mM dATP,dGTP,dTTP和1mMdCTP)(各自获自英国白金汉郡的GE保健公司)和2mL dCTP Cy3(或Cy5)(英国白金汉郡的GE保健公司(GE Healthcare;Buckinghamshire,UK)(前身是安玛西亚制药公司(Amersham Pharmacia))加入步骤(1)的混合物中。轻柔混合该混合物并加入2.5mL DNA聚合酶I,大(克列诺)片段(马萨诸塞州伊维池的新英格兰生物试验室(New England Biolabs,Inc.;Ipswich,MA))。进一步轻柔混合该组合物并在37℃孵育2.5小时。最后,短暂离心该混合物。
然后合并两种标记反应液,使该溶液通过Qiaquick PCR净化离心柱(加利福尼亚州巴伦西亚的凯杰公司(Qiagen,Inc.;Valencia,CA)以去除未掺合的核苷酸。用两次30μL的EB缓冲液从柱洗脱样品。通过高速真空装置(speed vacuum)将体积降至7.5μL。
(3)预杂交
铝载玻片在5X SSC、50%甲酰胺、0.2%SDS溶液中,42℃孵育2小时。然后在一个培养皿中将载玻片浸入水中三次,在另一培养皿中再浸渍两次。用氮气吹干这样制备的载玻片。
(4)杂交
将步骤2的DNA样品在95℃加热变性2分钟。然后快速离心试管。制备包含1体积缓冲液II和2体积100%甲酰胺(去离子化)的杂交混合液,将22.5μL该混合液加入各个样品中。然后混合样品,短暂离心并在42℃孵育1小时。最后,在步骤3制备的载玻片上加载样品并在42℃孵育过夜。
(5)杂交后洗涤
将步骤4制备的载玻片浸没在55℃的1X SSC/0.2%SDS溶液中(去除盖玻片之后)。然后取出载玻片并置于含相同溶液的第二托盘的支架内。载玻片在轨道式振荡器上在第二托盘中室温孵育5分钟。然后,将支架和载玻片转移到55℃的含有0.1X SSC/0.2%SDS溶液的第三托盘内。将支架浸入第三托盘的溶液中5分钟,然后在室温下孵育10分钟。然后将支架转移至含有与第三托盘相同溶液的第四托盘中,载玻片在第四托盘中同样浸渍5分钟并在室温下孵育5分钟。然后将支架和载玻片转移至55℃的含有0.1X SSC溶液的第五托盘,支架和载玻片在第五托盘中浸渍5分钟。然后将支架转移至含有与第五托盘相同溶液的第六托盘中,载玻片在第六托盘中同样浸渍5分钟。最后,支架和载玻片转移至含水的第七托盘,支架和载玻片在第七托盘中快速浸渍两次,然后用氮气干燥支架。
肺炎链球菌菌株基因组DNA的比较基因组杂交的微阵列数据分析方案
载玻片获取
用微阵列扫描仪ScanArray(康涅狄格州舍而顿的帕金埃尔默生命分析科学公司(PerkinElmer Life and Analytical Sciences;Shelton,CT))以10μm的分辨率获取两个杂交载玻片的彩色图像。调节获取参数(光电倍增管增益和激光器功率)以维持两个荧光团通道间相当的荧光水平(测定特定对照点的强度水平以及所有点的平均强度)。
图像分析
用软件Genepix 5.x(加利福尼亚州日照谷的分子设备公司(Molecular DevicesCorp.;Sunnyvale,CA))分析图像。采用无益品质点的自动和手动专业标记程序(curated flagging procedure)。根据获取的图像,Genepix软件提取微阵列载玻片上每个特征点的强度水平并计算默认统计参数。
数据处理与质量检查
通过本地安装BASE(生物阵列软件公司(BioArray Software Environment),rel1.2.10,卢德大学(Lund University))进行数据处理。数据分析包括两个阶段:
(a)背景校正和标准化:在该步骤中,对每个点的强度减去局部背景强度。然后用所有点的平均或中位强度标准化该强度。根据以下参数,在BASE中采用背景校正和标准化程序((插件rel.1.4):
实验标准化: 实验中位值
实验背景: 局部背景
校正阈值(bkg标准差):1
新值(bkg标准差): 1
系统校正(CH1=CH2): 有
(b)数据合并-合并单个载玻片的重复点或重复载玻片的强度值,计算分离的两个颜色通道的平均强度。滤除标记为低品质水平或背景校正的点。该算法也评价具有两个颜色强度平均值之间不同强度水平的可能性(采用T检验,插件rel.1.7)。所得数据集合储存在BASE应用中,可输出用于其他分析。
聚类分析
将两个颜色强度水平的比率转化为log2数值。
计算的比率在数据集合表中列出,该表的列表示每个载玻片或合并的载玻片组(实验),行表示每个点或组合并的点(基因)。将该表上传到软件TMeV(rel,3.1,TIGR)中进行聚类分析。
采用三种主要的聚类算法。
(a)分级聚类-该算法适合在不同实验中发现具有类似存在/缺失图案的基因类群。具体说,该算法采用以下参数设置:
树状选择: 基因树和/或样品树
距离米制选择:欧几里德距离,皮尔森非中心
连锁方法选择:平均连锁聚类
构建相似性树(基于算术平均算法的未加权对群方法,UPGMA)和相似性矩阵(基于距离米制选择)来代表实验之间和基因之间的图形和数值距离。
(b)巴氏模板匹配(Pavlidis template matching)-该算法适合发现存在/缺失图案类似于模板图案的基因。巴氏模板匹配适合在不同实验中发现图案与参比基因相关或不相关的基因。该算法适合在所得匹配和非匹配的组中采用阈值参数绝对值R(选择与模板相关或不相关的图案),阈值p值0.05,皮尔森默认距离米制和各元素分级群聚。
(c)微阵列重要性分析-该算法适合发现具有能够将实验分成单独的组的存在/缺失图案的重要基因(定义如上文所述)。该算法适合发现给定实验组的特异性基因。该算法采用以下参数:
统计学检验: 不成对的两组
排列数目 100
S0: Tusher等的方法
Q值 无
归咎引擎(Imputation K-最邻近归咎
engine):
所得匹配和非匹配组中各元素的分级聚类
实施例1:检测基因组DNA中是否存在菌毛II岛(INV104B)
在纯化的基因组DNA或直接在细菌上(集落PCR)测定是否存在表1所示肺炎链球菌菌毛II岛(INV104B)。表2列出了用于扩增的四种寡核苷酸。
表2 PCR寡核苷酸
标识 | 方向 | 序列 |
1008正向 | 5’→3’ | GCTGGATCGAGTTTGAAACCAGAA (SEQ IDNO:24) |
1009反向 | 3’→5’ | TAAGGATCACCAAAGTCCAAGGCA (SEQ IDNO:25) |
Int-反向 | 3’→5’ | TTTCAGTGTATGTTTTAGTGCTTCA (SEQ IDNO:26) |
Int-正向 | 5’→3’ | ATGGCTTCAGGGGCTATGTTCGGTG (SEQ IDNO:27) |
采用以下PCR寡核苷酸组合,对表1所列各个菌株进行三次诊断PCR:1008正向-1009反向;1008正向-int反向;和int正向-1009反向。1008正向-int反向和int正向-1009反向成功扩增菌毛II岛(INV104B)的阳性克隆。1008正向-1009反向成功扩增菌毛II岛(INV104B)的阴性菌株。
在表1所示肺炎链球菌中检测是否存在菌毛岛,即rlrA岛(相对于菌毛II岛(INV104B))。类似于菌毛II岛(INV104B),设计特异性引物,用于扩增rlrA岛的指定区域。rlrA岛检测方法的例子参见2007年2月16日提交的题为“细菌抗原的纯化”(“Purification of Bacterial Antigen”)的美国申请第11/707,433号;和2006年2月17日提交的题为“细菌抗原的纯化”的美国临时申请第60/774,450号,这些参考文献的内容被纳入本文作为参考。
表1指明许多肺炎链球菌菌株含有菌毛和菌毛II岛(INV104B)中的一个或二者。
实施例2:23F、INV200和OXC141的序列
1.序列下载与组装
从Sanger网站下载四种肺炎链球菌菌株的初始序列(参见万维网“sanger.ac.uk/Projects/Microbes/”)。Sanger序列由不同数量的非重叠毗连群组成。下载序列的详细情况在表3中列出:
表3.下载序列
菌株 | 血清型 | ST | 毗连群数量 | bp总量 |
23F | 23F | 81 | 21 | 2225211 |
INV104B | 1 | 227 | 68 | 1986609 |
NV200 | 14 | 9 | 167 | 2022487 |
菌株 | 血清型 | ST | 毗连群数量 | bp总量 |
OXC141 | 3 | 180 | 120 | 1962139 |
为鉴定毗连群的可能顺序,采用MUMmer3.19将该序列与TIGR4完整序列进行比对,以形成单个假分子。为分离两个随后的毗连群,插入以下序列:
NNNNNCATTCCATTCATTAATTAATTAATGAATGAATGNNNNN (SEQID NO:28)
SEQ ID NO:28设计成(i)在所有六个读码框内产生终止密码子,从而预测所有基因均未跨过接合处,和(ii)在所有读码框中提供起始位点,指向毗连群以预测在其末端的不完整基因。将没有对齐TIGR4基因组的毗连群设置在假染色体的末端。
2.基因预测
用glimmer3.02套件预测基因。用TIGR4预测基因作为训练集来训练隐马尔可夫模型(Hidden Markov Model)。
3.蛋白质集合比较
采用Fasta将每个基因组的所有蛋白质与其他基因组的所有蛋白质进行比较。如果其氨基酸序列在其长度至少50%上的相同性百分数至少为90%,则认为该蛋白质是保守的。
不符合该标准的OXC141、INV200和23F的氨基酸序列如下所示。这些序列可提供适合本发明组合物或治疗方法、用于诊断肺炎链球菌感染和免疫对抗肺炎链球菌感染的多肽和/或蛋白质。
4.起始密码子
本文所示氨基酸序列的N末端残基表示对应核苷酸序列中第一个密码子编码的氨基酸。在第一个密码子不是ATG的序列中,应理解当某密码子是启动密码子时它应被翻译为甲硫氨酸,但如果该序列位于融合伴侣的C末端,则该密码子可被翻译成所示的非-Met氨基酸。所示序列特别公开和包括用N-末端甲硫氨酸残基(如甲酰基-甲硫氨酸残基)替代任何所示非-Met残基的所列各氨基酸序列。
5由OXC141鉴定的序列
>orf00007
MEMSFIAQDFDKLNIITVLESRTQAIIRNPMNTRLSSDTESSFNKIVRN (SEQID NO:29)
>orf00009
MELAETSIVKKNHQIPCIINQKIAQKLIKKTSMTDIDHQLSISTSTVIRKINDFHFEHDFSRLPEIMS(SEQID NO:30)
>orf00013
LFKIGRVYYRQLQEDLLTCCNKYPKLFFFIIISLNSTQSGGVF (SEQ ID NO:31)
>orf00015
MKYNKTKYPNIYYYETAKGKRYYVRRSFFFRGKKREKSKSGFTTLPQARAALVELEQQIQEQELGINTNLTLDQYWDIYSEKRLSTGRWNDTSYYLNDNLYNNHIKAKFGSILLKNLDRNEYELFIAEKLQNHTRYTVQTLNSSFMALLNDAVKNGNLLSNRLKGVFIGQSDIPAANKKVTLKEFKTWIAKAEEIMPKQFYALTYLTIFGLRRGEVFGLRPMDITQNDSGRAILHLRDSRSNQTLKGKGGLKTKDSERYVCLDDIGTDLIYYLIAEASKIKRKLGIIKEQHKDYITINEKGGLINPNQLNRNFNLVNEATGLHVTPHMMRHFFTTQSIIAGVPLEQLSQALGHTKVYMTDRYNQVEDELAEATTDLFLSHIR(SEQ ID NO:32)
>orf00016
MSYSYVALDVETANDFRGSVCSIGLVKFKDGNIVDTFYTLINPEEEFDDFNIFIHGITPEDVLDSPTFPEVRKSIVDFIGLDIVVAHFAQFDMGALKDVYQKYELDFDNIEYICSYRLAKVALPGQLNYKLKRLAKNLNIELDHHNALSDARASGLILEYLLSTNSFSDLTAFLKEYRYNKTGLLGQYGFKRKKGYQYKENLIYQPTEEEKAAMNPDHYFYGLYFCFTGKLERMTRKEANKAAALVGGIPEKGVTKHTNILVVGEQDWRVVGTDGLSSKMKKAQTLLEKGQDIEIMTENDFIRLLEE (SEQ ID NO:33)
>orf00017
MSLLSLHQCKVFLFYHDFISHGFKIVVGHEFEVIKLCGVIQAF (SEQ ID NO:34)
>orf00019
MKNKKPNAERLQEIADYFNVSTDYLLGRTDNPAIAGDSKEYTWQGKTLNVEEMASNVMMFGGRELTDEKKKIIQSIIEGYLKEAGD (SEQ ID NO:35)
>orf00020
MSKLTKEDVLQVSQEIINDAIPVIKDMLDEVFKEYPIDMEIRKAILNSVLVAHKLSTETTVSLLTELVNAQEN (SEQ ID NO:36)
>orf00021
MSKELKIIKAKIKTRLIELDMTQAELAKQVSVASSVISELLKYGKGSESVKEKVADVLGIENPWENS (SEQ ID NO:37)
>orf00022
MDYQILIQPAISVILAIISGLWSYIASKANNKAEIEKQAKEHSHIVEKLEKEFHYQIDTLKQQHTLELEKVKQAHELRLQELEKVSQIDTETDKAMKMNDLIYKTFTGEVDLDKALKLADKANNHKQKLNKKFIQKTSKKS (SEQ ID NO:38)
>orf00023
MNEIFNFHGQEVRTLTIDDEPWFVGKDVADILGYSKARNAITLHVDEEDALKQGIPTSGGTQDMLIINESGLYSLILSSKLPQAKEFKRWVTSEVLPAIRKQGGFIREDLDEDAFIALFTGQKKLREQQATMLEDIDYLKSEQPIHPSYAQSLLKKRKARVVACLGGIDSPAYADKVFAQSVFRQAEIDFKDHFNISRYDLLPKKHADAALAYWMTWEPSTNTKMKIMKLNSFDDV (SEQ ID NO:39)
>orf00025
MFEPPILDQLMGVGALLLGFAGACRHIKLQEQRKEEERREEQEFASMIIQGYNHAYERGREAERQEIRKNIRRPFKGFTYDNEPPQGLRPEPLALPEPKQSAIRLL (SEQ ID NO:40)
>orf00026
MEELIESLDNLIMIVKELEGRESTSRHFITIWENDYKNLLLVKEYLTDYEKLAKDYRYVTLKNKLLKIEKMELEGRHIYEDMRMKYRANRRKWGARYV (SEQ ID NO:41)
>orf00027
VLGMSEIKWIKITTDIFDDEKICLIDALPDPDAILVIWFKILTLAGKHNSNGLLMMTDKVHYTDEMLATIFRRPLNTVRMAIGVFEQFGMIEIIDGIISLPNWEKHQNVDGMEKIKEQTRNRVAKYRKKQKNLALGNVTGNVTVTDGNALEEDKDKNKNRLDKDKNKKRITTTSSGSEENILELFQSEFRRLLSGFEIEEINHLLNENDVDLVKEALKTAINSGKPNIKYIGGILRNWQMNNVTTVEQVRQSEKKNKDKKEEQEAKDEWGY (SEQ ID NO:42)
>orf00029
MRRSASMVDNVFEEIALSYRRNTEQQEEFCEKHNIPLIKILRTESVVCRMCESERIHEENQERVNELANAENERERKYYLEKFSLYDEVLKNATLDNFETPTEKEAEKLAFAKRICREWSEGARNNIVLQGEAGTGKSHLAFAMVKVLSEYTKEIAIFINVTDLLMKIKADFSQEEFLVNKIASAKFLVLDDLGMEKDSEWSFTILYNILNKRSNTIITTNLISADIQKRYGRPFMSRLMKGVDKDHLMVFNDLTNKRKQYF (SEQ ID NO:43)
>orf00030
MLELYFVYNGHCKFFLGRFDNVDDLIEQMEDHQWAFSAITHPRFQKHIGQRTTRFDYGSKDCYYLATFSGGE (SEQ ID NO:44)
>orf00031
MVGVTYQEIHLFVEFLKEQYGQGRPDYIEALNDLDGLVEVSYREAIERFLEDEVR (SEQ ID NO:45)
>orf00032
MMEELKKKVNAVYNWTVEDGKPQPPQQDLPQAVKDRVDYFWEMAEDGMTFMGAMECIFADEKPTDYELGATKGWLPKSKEFDDWIGYSPSMAQVVIAVYLIYGGN (SEQ ID NO:46)
>orf00033
MEETKMNKQELIKKLEERRTIIGNFQGYAVSYFWIYWIVEK (SEQ ID NO:47)
>orf00035
MEFVSPIKDNDDIQAMKDYLREWNEMYYMLFITGLNTGLRVGDILTLKVKDVQGWHIKLRERKTGKQITRRMTKELKKEMRRYVEGKPFHHFLFKSRQGQNKAITRERAYQIIHEAAEELGIDNVGTHTMRKTFGYKYYNKTKDVGTLQKMFNHSSPAITLRYIGIEQAELDDALRNFVI(SEQ ID NO:48)
>orf00036
MYNKPVRPSLKSKKWEKFRDRIMRKHDYLCQESLRYGISVQAEMVHHIFPVSEYPELEFVEWNCLPLTNKKHNTFHDRVNDRVINQGLYWQKKRKKEFLNFFKNEK (SEQ ID NO:49)
>orf00037
LAKPITAKSIKSKVVKQMKDLGTYRKEFEMIIDIFAGMLYQYQKLAQDYANLGYPVTDTYVNKAGAENERKVPILTAMEILRKDILSYSNQLMMNPKSLGEVVEQEGDSVLTEVLKFKNEIKKKRVSGNG (SEQ ID NO:50)
>orf00040
MGNLDKAKEYAQHVLTHREEHCEENILAAERFFRDLENPAFEMDEDMVDFVIHFIENVIVHQQGDDMFAVSIRNKPLLLQPWQHFVVVNLFGFYYKGTNERRFKEALIMLARKNGKTSFTAAIALAYQILDTDSGSKCYIVANSVKQAMEAFGFLRFNVERWNDKNIRIKDNNQEHSITANFGDEGSFFIQALANDESRLDSLNGNVIILDEAHTMRNSKKHGLMKKTMSAYRNSMLFVISTAGDIPTGFLANRLKYCQKVLKQLVTDDSFFIFICKANQSADGDVVNYLDENILKMANPSWGVTVSLKALKEEAEQAMNDPQTRNEFFNKTLNIFTNSMNAYFNPDEFIASDSCYDWSLEELARLPIRWYGGADLSRLHDLTAAALYGVYHDGEKDVDICITHAFFPRINAQKKANDDGIPLFGWQSDGWLTMSNTPTVLYDDIVKWFIKMREKGFKIAAVGMDRKFGREFLTKMKQARFKMIDQPQLFYLKSEGFRRIEFKVKNKEFYYLHSDAYEYCVSNVRAIEKVDDAVQYEKLDGDGGTARIDLFDASVFACIQALANLGKGGDVMRFFD (SEQ ID NO:51)
>orf00043
MNEIVLSEHEINLLINKGRVKVILNGEVVTIRQRHMKNLMAETVKWEKQVIDVSQNIVRNKHFDSLFQNTFR (SEQ ID NO:52)
>orf00045
MGIFEKFWKRNKPSKPINMLSHSDLGLSNLMDSYVPLARNPDVVTAVNKIADLVSNMTIHLMENTDKGDIRIRDGLARKIDINPCKHMTRKSWIFKIVRDLLLYGDGNSVLHVEYEPVTDYISNLRPFPMREVSFQTDKDSYVISFRGEEYSPDEVVHFVINPDPDILYIGTGFRVTLTDVVQSLNMATKTKKSFMNGKNIPSLIVKVDSSSAELDSEQGRERIAEKYLSTSRVGAPWIVPEALLDIQQVKPLSLTDIALNESVELDKRTVAGLLGVPAFILGVGEFNKTEYNNFVNTTVMSIATTITQTLTRDLLLSSNRYFKLNPRSLFSYNITELSAVAQQMANSAAMRRNEWRDWLGMAPDPEMEELIVLENFLPQEKLGDQNKLKGGEEENAKAK (SEQ ID NO:53)
>orf00047
MQKRNSYRATQFQTREEESGDLVLSGYFIKFDEETELWRGYHEVIKRAGVEKAVTDADIRALFNHDDSLVLGRTGNGTLTLGVDDVGLFGDIIINKDDPQAVGAYARVKRGDVIGCSFGFIPVKIETEEREDGSYLDTVLELEIFEVSPCTFPAYPQTEIAARQKDFESQKRANREALDKRKKEIKEKFKL (SEQ ID NO:54)
>orf00048
MNKALIFGARMRAKATKVVELEETIEELNKRSVVELEKLDRAKNDEEVLAVEKTVDGLQREIEEKEAEKVQLENEIDELDKQIKEQNRKAPTPGKMEERGGKTLGQREAFNHYLRTKEARADGFKSAEGEAIIPVELMTPKEAKQDKTDLTSLVNIVNVKNASGKWSVVKLTDQAMNTVEELEENPELAKPTFTKVNYEIKTRRGHLPVSQEFIDDADYDVMGLVAKQTKNQERITKNKEIAKVLKTATSKSAAGLDGLKDILNVELKTYYNATIVCTQSMFAALDKIKDKDGRYMLQTDITSPTGYKFAGRVIVVYPDDIIGESKGDLKAFIGDVGEFATLFDRAQTTVKWQDDKIYGQYLATANRFDVVKVDGDAGFYVTYTDAVL (SEQ ID NO:55)
>orf00050
LMNVEDKDMSKDDTLTTEVEKEETKVVDGKPEEGDEE (SEQ ID NO:56)
>orf00051
MDNVQLLELLKLKLGIATKLRDKPLEKIIEAVKTELEDNLGVLLDLDSSEDQMFVVDFAAFRYEGGVDMPRHLQWRLHNLQIASKKKVKNVES (SEQ ID NO:57)
>orf00052
MWNHEIKLISKKITGKDKLLQPISEDVEVTLLCRKKNVTRSEFYQANQAGLKPSLVVEIRNFEYENQEFAKFEGKQYRILKTYPIDSEILELTLTEVLK (SEQ ID NO:58)
>orf00054
MSNDLADLIAKELAAYSDEVTEEVDKIAEQVTDETVDELKETSPKRYGKYRRSWKKKKLANGSFVVFNAVASLTHILENGHLSRNGGRVAGIVHIKPAEEKAIQNFEKRIKEIGK (SEQ ID NO:59)
>orf00055
MKLSDFAAILEQANLPVTYRAFKIGNAPDLPYLVYYESSPVINSADNTVNHQIKSVTVELAFESKDEDLEERLEELWANHKLFFEVQEETFIETERLYVKSYTVYLY (SEQ ID NO:60)
>orf00057
MTQENKVTFGLENVHIAPIKTLAADGVITYGDVFRFPGAMELILDTKGETTPIKADNKNYHFMNSNEGYEGKLKIPHIIDEFATKILGEIKDPQTGVMTEKADASLTEFAIMFQFEGDKNKTRYVMYYCFASRPSLGSKTKNGTSTNERELSFKASPRPLDTVVKRSITSADDKDAYDNWFKKVYEPTAVAA (SEQ ID NO:61)
>orf00058
MRKIVLVGDQEYELGTNGYTPIAYKQQFGKDYFQDLFSMLKNQSFMNELNKLETDKELTATNIDISMLSDFDMTFFNRLFWTFAKSANPQIKPYEQFFMEMEIFPIQEVGPVLMEMLNASMTTKKHQMNQNQLAKKSSQ (SEQ ID NO:62)
>orf00060
MAGNIKGIKIEIDGDTQPLQKALKAINKESVNTTNELKQIDKALKFDTGNVILLTQKQEVLQKQIGITRDKLETLRQAQSKVDEEFKKGNIGSEQYRAFQREVEVTQNVLKGYEGKLASLTQALEGNGDAAKNNQAQLKELQNEQKLLASESEKVVSSFKLQESQMGANASEADKLALAEKKIGAQSEIVTRQIENLEKQLSLTKEQYGENSAEANKMEAELNQAKTAYANLNQELGKLGSTAKSNQTQLKELQNEQSQLASEMSKVTSSFKLQESALGSNASEAERNALAQKKIGAQSEIVSKQISNLEQQLEITKKEFGENSTQANKMESELNQAKTAFNHLNDEMKGTKSAADSTQESLSEISRNLRAELLQQFSEKLSAISEKLVEVGKEALEAAAQMQASNAQFTTVFGDMETQAREALNAIGQEMDIVPERLQGSFTQMASFAKTSGLDTAEALDLTSRATRAAADGAAFYDKSIESVTESLQSFLKGNFANDAALGISATETTRNAAANKLYGKSFKDLSEAQKQLTLLQMVEDGNKLSGALGQAARESDGLENVMGNLKQAGTNALSAIGQPLLEMMIPVFQTLATIVKGVAELFSSLPAPVKDFVVILGTVVTAVGVIAPIFLSLQALAEFLKISIGEMIIAALPIIGTAIAIAAAVAAIVAIVKYLWETNEGFRDAVTTVWNAILEVINAVVSEISNFVMSIFGTVVTWWTENQELIRTSAETVWNAIYTVISTILDILGPLLQAGWDNIQLIITTTWEIIKIVVETAINVVLGVIQAVMQIITGDWSGAWETIKGVFSTVWQAIQSIVQTIFSAIQSYISNILNGISGTVSNIWNSIKDTVSNVLNAISSTVSSVWEGIKSTISSAINGARDAVSSAIEAIKGLFNFNISWPHIPLPHFYVSGSANPLDWLSQGVPSIGIEWYAKGGIMTKPTIFGMNGNNIMVGGEAGNEAVLPLNDKTLGAIGRGIAQTMGGTSPTINITITGNTVREEADISRIADEVAQRIADELQRKTQLRGGFT (SEQ ID NO:63)
>orf00064
MIKHNELVIDGVRTSSFPFKVIVHDSPSIALGESKTALLEHGGISGAIVQTNKHRELVKKTYTIYLVKPTEEQMNQFMSLFIREKFWLESERVKTTRLWCYKVNVSDLEEVQPGLYMTKATFTCHPTKHFKVTDTQRLTRSGTLNVQGSALAFPKITIVGQSASETSFTIAGQVIRLERLTESLVMVNNPDNPSFKTTTGKPVKWSGDFITVDPAKVKNIGVVLGPGIQSLEIETVWGWA(SEQ ID NO:64)
>orf00068
LLYLLNEDVRTVRWNGESLHEATSAIVKETMNGDFTLTVKYPISDSGIYQLIQEDMLIKAPTPVLGAQLFRIKKPVEHNDHLEITAYHISDDVMQRSITQMSVTSQSCGMALSRMVQNTKTALGDFSFNSDIQDRRTFNTTETETLYSVLLDGKHSIVGTWEGELVRDNFAMTVKKSRGENRGVVITTHKNLKDYQRTKNSQNVVTRIHAKSTFKPEGAEKETTIRVTVDSPLINSYPYINEKEYENNNAKTVEELQKWAQSKFSNEGIDKVSDAIKIEAYELDGQVVHMGDTVNLKSWKHNVDAFKKAIAYEFDALKEEYISLTFDDKAGIGGSRASGGLSSAADAILGVTESAQEIALEKALQNADLDFDHKAGLLRQEISDDIELAKARAEEVKRELSDTINQRFNSFDNGPLKETKRKAEEALRNAGASSSLAQESKRIGLDSVARLEAFKSQTTSAQTALSGDLDALKRTIANDIRPKQAQAEAEIAKQVEALSRTKNELAGASTLLAQEAKRIELDSVARLEAFKSQTTSAQTALSGDLDALKRTIANDIRPKQAQAEAEIAKQVEALSRTKNELAGASTLLAQEAKRIELDSVARLEAFKSQTTSAQTALSGDLDALKRTIVNDIRPKQAQAEAEIAKQVEALSRTKNELAGVKSAQATYKETTTRRLSELTNLANGKASKSELTQTAEELSSKIASVQASGRNLFLNSLFKQDIPKTGIWTTSTYTATIDSESKYLGHKALKIIGLNPSGRDGGNPKVTYPALGQFGKVIPGSTTNQDVTISFYAKANKNGIMLRSRLGNIGYKTGNVTLSTEIKRYAVHIPKGWTNESKRTTNEWLFNFNQEGTVWIWMPKFEISDVDTSYSEAPEDIEGQISTVESTFKQRANSLEAGVSRLTEGLRTKADISSLNVTAENIRQSVKSLETDTQNKLNQKLSQAEFEVRAGSIRQEILNATKDKADKTLVVSEAGKLREEFSKMKVGGRNLWIKSKTVGAVIEKLPENHVTGQKECYRLENNSTLTFNLEPDFSSRLYQKVTFSAWIKYENVVQGRNFWNVFNCFKHYLFRKNSETGVQSGPDYDTLGRYKGSADWKYITFTYDYSEKTNFDQLKTLLRFNLEGATSGTAWVTGIKVEIGSVATDWSPAPEDADGLITEAKATFERTAQGLRTDLSAITEGLRTKVDISALNVTAENIRQSVKSLETDTQNKLNQKLSQAEFEVRAGSIRQEILNATKDKADKTLVVSEAGKLREEFSKMKVGGRNLWIKSKTVGAVIEKLPENHVTGQKECYRLENNSTLTFNLEPDFSSRLYQKVTFSAWIKYENVVQGRNFWNVFNCFKHYLFRKNSETGVQSGPDYDTLGRYKGSADWKYITFTYDYSEKTNFDQLKTSLRFNLEGATSGTAWVTGIKVEIGSVATDWSPAPEDADGLITEAKATFERTAQGLRTDLSAIQEYVNKDGQRQEALQRYTREESTRQATAVRELVNRDFVGKATYQEDVKGINQRIEAVKTSANKDIASQIASYRQSVDGKFTDISSQITTYKQDVGGQISGLSNRLTSSEQGTTTQISNLSNRINSNKQGTDNQISNLKTQVATNKDNAERQMGRISDQVSANKANADSQFANVTNQLARKVETTDFQRVKETSKLYERILGNTENGIADKVARMALTNQLFQVEVAKNASNGQNLLKGTKDFSGGWKNKGANWKKHAEKYKGVDVLFKNNSWNGVGQEIDAKIGEVYTFSLWMKSDWKNDTVNFYVNRNGSVEKGWGVPSETSVAITSEWKRYSFAFKITVDGFIFPRVERLNQNTNLYIAGLKLEKGSYATPYTEAPEDTDEAIRSVQSQLTGSWAVQNINSAGDIISGINLGANGHNRFVGKLTHITGETLIDRAVIKSAMVDKLKTANFEAGSVTTTILDAEAVTADKVRFDAAFIRKMTANDAFIDQLTSKRIFSTKVESVISSSTFLEAYQGRIGGFTIGRFAQGRGRWISGINQFSVGMGNGEGGSYNGENTAFWANWGHSWNSPGPNAWYVTTSGNMYCRNGADFHGKVDFSNSSRANFYGNTTFSRSPVFSNGIELGSKDVLGDGWNPKGGRNAVVWWNQVGSGSVKYWMEQKSDRRLKENITDTAVKALDKINRLRMVAFDFIENKKHEEIGLIAQEAETIVPRIVSRDPENPDGYLHIDYTALVPYLIKAIQELNQKIEKMEKTIA (SEQ ID NO:65)
>orf00069
MNTEQLNQALQMTIREMSTTSTNSMITSNILSIQLNEQREENQRLQARVDELEALLDEQTKPADKGE (SEQ ID NO:66)
>orf00071
MAETIQNTDNLLDLTKITEPFDLASALRYMKENGEFIRCKNVSDDFYMYRDVQKRPVIVNGRRQFKDIETVWAFNQWGGTITTINVAVLLNHEFYIMKFDAEGNPDWTNPTVEPKE (SEQ ID NO:67)
>orf00072
MQIEFFNFLRSVVQTEDGLVLYALALIVSMEIIDFVTGH (SEQ ID NO:68)
>orf00073
MILIPASVLLPEKTGFVFLHSIYLGYIAFTFQSLIENYRKLKGNVTLFQPIVKVFQRLLEKDDDTKKGE (SEQ ID NO:69)
>orf00074
MQQITEIITNGAISILVILAGIAVKAVKEYLVKKGGEKTIKIVEILAKNAVNAVEQVAAETGYKGDEKLAQARAKVRAELTKYNISMTDKDLDTFVESAVKQMNDAWKGR (SEQ ID NO:70)
>orf00075
MAFNQFNRCVTLSIPTAPNIPTSVVHRTYLHDTAVSDNM (SEQ ID NO:71)
>orf00079
MKNREEEWQGIIAKNAILLIIAPFYFLIIVKNGVLLKIKTVTEITAF (SEQ ID NO:72)
>orf00085
VEEVEVAEVKNARVSLTGEKTKPMKLAEVTSINVNRTKTEMEEFTRVLGGGVVPGSLVLIGGDPGIGKSTLLLQVSTQLSQVGTVLYVSGEESAQQIKLRAERLGDIDSEFYLYAETNMQSVRAEVERIQPDFLIIDSIQTIMSPEISGVQGSVSQVREVTAELMQLAKTNNIAIFIVGHVTKEGTLAGPRMLEHMVDTVLYFEGERHHTFRILRAVKNRFGSTNEIGIFEMQSGGLVEVLNPSQVFLEERLDGATGSSIVVTMEGTRPILAEVQALVTPTMFGNAKRTTTGLDFNRASLIMAVLEKRAGLLLQNQDAYLKSAGGVKLDEPAIDLAVAVAIASIYKDKPTNPQECFVGELGLTGEIRRVNRIEQRINEAAKLGFTKIYVPKNSLTGITLPKEIQVIGVTTIQEVLKKVFA(SEQ ID NO:73)
>orf00088
MGVSIFLALFYMIPALYFLFHIGKKWELPKKVLILSLLGAICSFTSLLLFGIYNHRRKSSKV (SEQ ID NO:74)
>orf00094
VFVALVSITFSLTNFFKILINLTAQVSPQVIDEKILMMDLNLNNYLSTVIQLRQDVYTGIKILH (SEQ ID NO:75)
>orf00095
VNIASLQNGHIFCWQVQHIANKLTSDFWIAEYFLSNQIISWADARMSENPPIPPSIVS (SEQ ID NO:76)
>orf00103
MKIKEQTRKLAAGCSKHCFKVVNGTDEVSSKHCFEVVDGTDEVSSKHCFEVVDRTDEVSNHIYGKATLTWFEEIFEEYKSLHNKTHITKVV (SEQ ID NO:77)
>orf00109
MNDDDSRCIHIERDGKTIEFGYLNISSTDRNTSHADGLVGIFNSNFSGVRVRGIAVFLNGPDNLDTTLVGNFQTIWNFRIICIHSQNTCNKG (SEQ ID NO:78)
>orf00110
LEFNFCRSIIKNGRDNLPNTNSTSGMATRWANHNWSDDIKDRLKTK (SEQ ID NO:79)
>orf00114
MRYDFGKVYKEIRESKGLTQEEVCGNVISRTSLSKIESGKATPKYENMEFLLRQINMSF(SEQ ID NO:80)
>orf00116
MNYNLKYLLSGIFVLVFIGFLVWMRYFNQRKEEEHSDVSFEARLDSEVKTLYKQLGLGEEPHYFLAYRYLHPWFDLAILPPTVEIFLTKIALVMVTPDELLIRNLGNGLTFTSEHHRDLRQGLIRIPKSEMKEFEIRNWKKFFVFGDFLTIKTSQHSYYLQVRDDGLQKGSLSTKHFSDLKSQDFLGLLTDKRTF (SEQ ID NO:81)
>orf00124
LSLLDLRGSLCLRIYLHEPLITTVSQDFTSLSDISHF (SEQ ID NO:82)
>orf00125
MDFKSFIIGLVVGIFGPYMDDLIRKKFLKSSEKKTEKSVKK (SEQ ID NO:83)
>orf00175
MGRFILFENLFKPGQLHLAVDMVTDFVSSIHNLKTVF (SEQ ID NO:84)
>orf00177
METSISMADFYGKYQNENLELIDVREAHEFQAGHAPGAKNLPLSTLEQGYKELKPDHEYYVICQGGVRSASTCQFLSSQGLTVTNVEGGMNVWPGQVE (SEQ ID NO:85)
>orf00179
LGGKSCLLEDRLCDIAAQTTVAADDVGLFFVQFISFLLDTLSVFDTIVQN (SEQ ID NO:86)
>orf00183
MKIKDQTRKLAAGCSKHCFEVVDRTDEVSSKHCFEVADRTDEVSNIYTARRR (SEQ ID NO:87)
>orf00184
MKLLSIAISSYNAAAYLHYCVESLVIGGEQVGILIINDGSQDQTQEIAECLASKYPNIVRAIYQENKGHGGVVNRGLAEASGRYFKVVDSDDWWILVPT (SEQ ID NO:88)
>orf00185
LKILETLQELESKGQEMDVFVTNFVYEKEGQSRKKSMSYESVLPVRQIFGWDQVGNFSKGQYIMMHSLIYRTDLLRASQF (SEQ ID NO:89)
>orf00186
MYYLPVDFYRYLIGREDQSVNEQVMIKCIDQQLKVNRLLVDQLDLSQVSHPKMREYLLNHIEITTVISSTLLNRSETAEHLAKKRQLWTYIQQENPEVFQAIRKTMLSRLTKHSVLPDRKLSNVVYQITKSVYGFN (SEQ ID NO:90)
>orf00194
MSLQIKLKKLAKELSKLLKDSNLETVDKDVLENSQEELQKAVLFLADEKGSEHTAAELIDNLKEVIAKLKANA (SEQ ID NO:91)
>orf00202
MKIKEQTRKLAAGCSKHCFEVVDKTDEVSYIYLRQGEADAV (SEQ ID NO:92)
>orf00213
MSKEKVILAYSGGLDTSVAITWLKKDYDVVSVCMDVGEGKDLDFIHDKALKVGAVESYVIDVKDEFATDYVLVALQSHAYYEQKYPLVSALSRPLISKKLVEIAHQTGATTIAHGCTGKGNDQVEYQIAVAKKANEAKK (SEQ ID NO:93)
>orf00215
VLDSLVFMGFSMKLIHDLDTHTTHSTAKMLYNVKAIKNDFSIRE (SEQ ID NO:94)
>orf00218
MGQLHFITKLLDIKDTNTQII DVVNRDSHKEIIAKLDYEAPSCPECRSQMKKYYFQKPSKIPYLETTGMPTRILLRKRRFKCYHCSKMMVAETSIVKKNHQIPRIINQKIAQKLIEKISMTDIAHQLSISTSTVIRKLNDFHFECNFRNLPKIMSWDVETVRGVTVSIGRWR (SEQ ID NO:95)
>orf00220
MRYDFGKVYKEIRESKGLTQEEVCGGVLSRTSLSKIESGKTTPKYENMEFLLRQINMSFEEFEYICHLYQPSQRTEIMQTYLNMTSIIGSNSLVHFFETCQDYLKTHHDLPIEEIRDMLEVVIYIRQHGAGELSDHAEQVVKKLWRKIEKQDTWYESDLKILNTILFSFPIEYLHLITGKILQRLEVYKNYQHLYDLRIAILLNLSTLYLYNQDKNMCKQICYTLLEDAKNKKSYDRLAICYVRIGICTDNAKLIQKGFSLLELTEETSMLSHLKKEVETHYQPKKL (SEQ ID NO:96)
>orf00221
MNSKELSISMLKKYPCTMQHDQSDCAAAVVSTVLLSYKKELSIMKIREIIGTDMYGTTVSGIVSGLNKLNFTVKAVRVALEDLTPKLTFPAILQVKNDLGQNHFVVLHSIKRNSKFYVADPASGIRKMSSDELGEIYQGITLFMVPNSDFERGKLKGKGLLDLFGRLIFNQKGLISTVILASFVLSIIGILSSLFSKVIMDEVIPYALKNSLYMFLIVFGIVSFLQTLLSAFRQHVLLFLSRKIDIPVLMGYYDHIIHLPYSFFGSRRVGDVLTRFQDAMTIKNVFTSVSISLVMDITLSVISAVVLWTINQSLFLILVFMVIVNIILIYCFKKPYKKINHEQMEANGLLNSQLIESIRNIDTIKSQHDEEQRLNKIEEKFVHTLEIGYKEGVLQNIQSTISSMTSTMGGLLFMGVGALFIIDGKMTIGDLLVFQTLSQYFTEPIQNLVGLQLTFQEVQVAVSRLQELMEVDREDIALDYSIRDFTLCDDIEFKDVTFAYGSRPPVIKDFNLRIKQGEKIAFVGESGAGKSTLVRLLLRFINPSEGKIRIGENDLSDLDYGKLRKKISYIPQTIELFTGTIIDNLKIGNPSVTYEDMVRVCRIVGIHDTIQRLQNRYGSFVEEGGQNFSGGEKQRLAIARALLSKADLYIFDEATSNLDSFSEQIIQDLIFNKIMDKTTIVVAHRLSTILRCDKICFLENGTIVEYGTHEELMAKNGKYARMVGLQSVQVNQQIQSQAVLDTEEVTYG (SEQ ID NO:97)
>orf00222
MAKLEVKDNKKLVLKSVICKKLHDTKVEDVDQEINKFHQHLQLLKAQIFGPLIVKSCGTTIHDDGLITTDFEFYIQAHNAQQYSNIYDVQDSISVPYCLYVRFEDSPEYLQYAYSKLDLYVYENDIQTDGIVYTVYVNSSPEKMVVDIFRPIVSL (SEQ ID NO:98)
>orf00223
MKLYNKSELRYSRIFFDKRPPAFAFILIISTAIILSGALVGAAYIPKNYIVKANGNSVITGTEFLSAIGSGKVVTLHKSEGDMVNAGDVIISLSSGQEGLQASSLNKQLEKLRAKEAIFQKFEQSLNEKYNHLSNSGEEQEYYGKVEYYLSQLNSENYNNGTQYSKIQDEYTKLNKITAERNQLDADLQTLQNELIQLQQQGDSSSLSDTTSDDDKAKLETKISEITTKIEALKTNITSKNSEIDSQQSNIKDMNRTYNDPTSQAYNIYAQLISELGTARSNNNKSITELEANLGVATGQDKAHSILASNEGTLHYLVPLKQGMSIQQGQTIAEVSGKEKGYYVEAFVLASDISRVSKGAKVDVAITGVNSQKYGTLKGQVRQIDSGTISQETKEGNISLYKVMIELETLTLKHGSETVILQKDMPVEVRIVYDKETYLDWILEMLSFKQ (SEQ ID NO:99)
>orf00224
MELVLPNNYVVIDEEEMMYLDGGAIYIPRWAITGAITGAAYAALAAAGGGGLQLVLASYGLRSALVAGIVKGLGVLGIHIGNAFANTVIRSIASAGIGAGADWIFTNIIDGWDGRRDNQLRIG (SEQ ID NO:100)
>orf00226
MKDDQKYLLAGLYSLLVAIFYFPLIESKGIFVSILMAVLLLYLIYFIATVIHIVIIKFIRKKSFKYLVLYPFTYDGSWRFQPINLLYFPEMVRDVIPINLVQEYCQGQPYGLLKKMLKRIRLSREISLLLATIIVYFFTHRILPLSVFTFIFSYILLFAQSYLGGNTVWIGNRRLIIDDEFEKILLSKSYIKEISSARYSEYLTCEYKNLTPIILLAIFENLLDSYLIQNQSKVDLDIFYKVLPLLYKEKYTMGFNYFVSLNYLLYKVGFLGIIYDNEALRDLSKQYLNKNISELQDGSFEDGIQDAVASKQIVVINEFIACLNSRCVPSQYDRFFYKDRPYIFSRKSPIKG (SEQ ID NO:101)
>orf00227
MKNKRYFFDTILIILLLISTIFCVSPVFIKLDILGTPSHAILTFVLAIPLFYILSQCLHTLLLLVSSIFCKLRPIYFYFIFVIIIGARKYYRILFHQLMGFSPGVAVFYKESQTTKNLFKFYYFLYFTTLISYYFFFTFVYDKPLLLPLIPFSIIIALVQKLYRIENQQLFLLKSKVLTILESKKDCEFNLQDYHEIWKLQSKSELPCVALSYISLIKPYLSESVREQIDLLEVKRFKKINHPISLYGMLDVIKLNLYLRHYNEKNKYESMLKKILEVRPDFVLIEQNIDDSLNSSQPLSLSLAISEIQLLLEVYMGIKHVSIRR (SEQ ID NO:102)
>orf00228
MIRKPIIFLLMLPIWGLWIELHLLVSNLQLNLEIPFDFVVSTSLTFFVLILSKIVLDILYALKDLYKKEALITIFPFIFIRRKKVNVRFSPYFSFHRKSLSPDDLRSRIIWSFILEIAIILVFILKIPFAIIMLTTIFFWTIMDINHLVFNKTEFLFNQNKWQKEDSFESDLTKTLKDKIQKSELSYSDLMSLQLYDAMNQSTFLTDSELFEDILKKIEDSHNTLLCTGLVELLLYEMSISNNNNWQEKVDKIRIQLIRINQLDFFYYTSWLRQNFDFCMNREYHKMKSRKLLLSNKKIV(SEQ ID NO:103)
>orf00229
MELVLPNNYVVIDEEEMMYLDGGAYLSKRACQGICVALAMSPGTFIALTGAAVLTKKLINYIKVGGLGGWLIGAAAGVLAGAAGRIAYCIGYGALNRGCDISGNPYPWDGFISATVR (SEQ ID NO:104)
>orf00230
LGWIHICDSKMSNVDKIRKIHIIVCWVYIFLSFRAIINDTEYFLLIFLAFIYSIVSLPLYSVKNKIVSICLVINSILLMSFPILINKFFPESFSTYIVLISVFITELIIFHLIGKDFDTKLTNEYKKISQFRSKVSQSPWIKYLEISSFILTIFPSILYGTVDNHVLTLIFLIKICVDTTIKFLFIRLFDTSTLMKRRIFFLFALDVIAYLFLGYLLVIQKAGYLFSVLLLFSNFSVPFIKAKEYELFKNSK (SEQ ID NO:105)
>orf00232
MNKKKMILTSLASVAILGAGFVASSPTFVRAEEAPQVVEKSSLEKKYEEAKAKYDAAKKDYDEAKKKAAEAQKKYEEDQKKTEEKAKKEKEAAKEVDDASLAVQKAHVEYRKVLDSRNSYRNPSDYAKKLAEADKKITEETTKLTNAQTKFQSIRTTIVVPGQSELAETKKKAEEAKAEEKVAKRKYDYATLKVALAKKEVEAKELEIEKLQDEISTLEQEVATAQHQVDNLKKLLAGVDPDDTEAIEAKLKKGEAELNAKQAELAKKQTELEKLLDSLDPEGKTQDELDKEAAEAELNKKVESLQNKVADLEKEISNLEILLGGADSEDDTAALQNKLATKKAELAKKQTELEKLLDSLDPEGKTQDELDKEAAEAELDKKVESLQNKVADLEKEISNLEILLGGADSEDDTAALQNKLATKKAELEKTQKELDAALNELGPDGDEEETPAPAPQPEQPAPAPKPEQPAPAPKPEQPAPAPKPEQPAKPEKPAEEPTQPEKPATPKTGWKQENGMWYFYNTDGSMATGWLQNNGSWYYLNANGSMATGWVKDGDTWYYLEASGAMKASQWFKVSDKWYYVNSNGAMATGWLQYNGSWYYLNSNGAMATGWAKVNGSWYYLNANGSMATGWVKDGDTWYYLEASGAMKASQWFKVSDKWYYVNSNGAMATGWLQYNGSWYYLNSNGAMATGWAKVNGSWYYLNANGSMATGWVKDGDTWYYLEASGAMKASQWFKVSDKWYYVNSNGAMATGWLQYNGSWYYLNSNGAMATGWAKVNGSWYYLNANGSMATGWVKDGDTWYYLEASGAMKASQWFKVSDKWYYVNGLGALAVNTTVDGYRVNANGEWV (SEQ ID NO:106)
>orf00233
MKIRRRYTHIIRIICILTISFKKQFLSSSLSSLTKRVIMNTAQATFNREAHTTFNRE(SEQ ID NO:107)
>orf00252
LKKRMNRWQFLLNQSKEMVGILLLKVKEQELIEFVVNL (SEQ ID NO:108)
>orf00253
LIKVIKRKAFGFRNFNNFKKRILMTLNIKKESTNFVLSRL (SEQ ID NO:109)
>orf00257
MTYNEKRLTNSLERGHMEQLKNTTDLLGLEDKNIKILSVLKYQTHLVVQAKLDSPAPPCPHCQGKMIKYDFQKASKIPLLDCQGLPTVLHLKKRRFQCKNCLKVVVSQTSIVKKNCQISNMVRQKIAQLLLEKQSMTEIAHRLAVSTSTVIRKLREFKFETDWTKLPKVMSWDEYSFKKSKMSFIAQDFESKSILAILDGRTHAVIRNHFQRYQREVRELVEVITMDMYSPYYRLAKQLFPKAKIVLDRFHIVQHLSRAMNRVRIQIMNQFDRKSLEYRALKRFWNPRFFVSRLGLNQSTGLIYYTRIASSSVRNDSISPRFECT (SEQ ID NO:110)
>orf00258
MGYSLKKSRTYCEQDPEKVNRFLKELNHLSYLTPIYIYETGVETYFYLEYDRALSRQLVSLEEDIII (SEQ ID NO:111)
>orf00265
LREGCSIYDNLYPSRIVVGDETVEGRKIAELFLSISTHSTANIKNVMLVSPTEAEAIKLFSNTFLALRVAFFNELDFFAERRSLNAEVVIKGVCLDPRIGNFYNNPSFEFGGYCLPKDTKQLKKEFIEINAPVIEAIDISNTNRKQFIVKQILERKPKIVGIYKLGMKYNSDNYKESAILSIINELLIVGIKILVYEPNLNVSIDNVIFEKNFELFTKQSDLIVANRWDRGLEAYKDKVYTRGIWIRD (SEQ ID NO:112)
>orf00268
MLNLQFAETMELTEAELETVYGGEFGNNAVIPAGAWGGLGTSWSITNFWKKYFNHDSSTVNRRHY (SEQ ID NO:113)
>orf00272
MKIKEQTRKLAAGCSKHRFEVADRTDEVSSKHCFEVVDRTDEVSNHTYGKVKLTWFEESFEEYK (SEQ ID NO:114)
>orf00338
LNTSYSFGKKDQFALEHCFCIKLSIFARAVTLFVSCIN (SEQ ID NO:115)
>orf00359
MLIGEGYRTFPVLIYTQFISEVGGNSAFAIMAIIIALAIFLIQKHIANRYSFSMNLLHPIEPKKTTKGKMAAIYATVYGIIFISVLPQIYLIYTSFLKTSGMVFVKGYSPNSYKVAFNRMGSAIFNTIRIPLIALVLVVLFTTFISYLAVRKRNLFTNLIDSLSMVPYIVPGTVLGIAFISSFNTGLFGSGFLMITGTAFILIMSLSVRRLPYTIRSSVASLQQIAPSIEEAAESLGSSRLNTFAKITTPMMLSGIISGAILSWVTMISKLSTSILLYNVKTRTMTVAIYTEVLRGNYGVAAALSTILTVLTVGSLLLFMKISKSNSITL (SEQ ID NO:116)
>orf00360
LIIIASMSTPFVGAYSWILLLGRNEVITKFLTNALYLPAIDIYGFKGIILVFTLQLFPLVFLYVAGTMNSIDNSLLEAAESMGSFGFKPIVTVVLPLLVPTLLAAPCLYL (SEQ ID NO:117)
>orf00362
LLSTTEFIGLSIRILSNLHESKILVGLLNQFFFWNLLLHKTKSNVVSDSQMWENSVVLENQPDIAFAGFHIIDFCIIEVKFSIFDTVETCNHTKKGRFPTS (SEQ ID NO:118)
>orf00380
VTAPTSITPLLVNTHERKSSQSLTSCLVYVVKTVLTSQHLIYSKLKLK (SEQ ID NO:119)
>orf00382
MITIKKQEIVKLEDVLHLYQAVGWTNYTHQPEMLEQALSHSLVIYLALDGDAVVGLIRLVGDGFSSVLVQDLIVLPIYQRQGIGSALMKEALEDYKDAYQVQLVTEQTERTLGFYRSMGFEILSTYNCIGMTWMNRKK (SEQ ID NO:120)
>orf00441
MGAFVLFLFQLTINRYKKKSFYWYKEVIESNGETLDN (SEQ ID NO:121)
>orf00465
VAIDKIAGITSEKDSRAHQIFRISPTCSRCFCNDELVKWVARTIFLQFTKRCCLRSGNITRSNSVTLDIGSTVFRRNVAGQHFQAPFSSSISANCFTSQFAHHRTNIDNLSMPFLYHRRNNCL (SEQ ID NO:122)
>orf00466
LFDLLDHGLDTVLVCHVTDISMGFDANFTISFNPFIDQILIDIVKDNSSAGFSVGFGNSKSNSIRSAGDESNFSF (SEQID NO:123)
>orf00478
MKSLARLLNIHVFISIFLFFALISGAVSHTVLLLLLLFLPALNKGLEKIQSKRIPVLNAALFFLLISFPQLLTNPVQWKFSIFLVVTIISSLAYFYNFYQVVKEVDQKQLI (SEQ ID NO:124)
>orf00480
LEAAGEIETEFQGWIVLVVFNHIDSLSRDTDILGEFELGNTQFLAKFFHTIHLVSFLIYVVYI (SEQ ID NO:125)
>orf00485
MVDRTDEVSSKHGFEVVDKEKLMWFEEVFEECKKILVS (SEQ ID NO:126)
>orf00492
MEGVNHVDIIKVSCGSFISQVNWMMKGKIPNREGFKFSVARLDAIDLVVVHIGHTRCQFSRTGSRSGYDNQVATDFDVVVFAHAFWGNDVIHIRRISFDWIMKIRINSVFLKLVAEGICSGLTSVLCNDNGTNKNP (SEQ ID NO:127)
>orf00493
MFNVASINGNHNLNLLFQFLQELDFVVRFITRKDTSSVEIF (SEQ ID NO:128)
>orf00495
LFFHFLPLDSIIIKNWKLGNYGAKKEIKKIKQTLAQNFKKCYHIL (SEQ ID NO:129)
>orf00498
MQLTSVTAPTGTDNENIQKLLADIKSEYRFDGRPEFVLLGCLQESDCR (SEQ ID NO:130)
>orf00501
MIDIHSHIVFDVDDGPKSIEDSKALLREAYNQGVRMIVSTSHRRKGMFETPEEKIVTNFIKVREIAKEVADDLVIAYGAEIYYTLDALEKLEKKKFLPLMIVVML (SEQ ID NO:131)
>orf00502
MHTSYREIHTRLSNILMLGITPVIAHIERYDALENNEKRVRELIDMGCYTQIDSYHVSKPKFFGEKYKFMKKRARYFLERDLVHVVASDMHNLDSRPPYMQQAYDIIAKKYRAKKAKELFVDNPRKIIMDQLI (SEQ ID NO:132)
>orf00503
MKEQNTLEIDVLQLFRALWKRKLVILLVAIITSSVAFAYSTFVIKPEFTSTTRIYVVNRNQGEKSGLTNQDLQAGSYLVKDYREIILSQDVLEEVISDLKLDLTPKGLANKIKVTVPVDTRIVSVSVNDRVPEEASRIANSLREVAAQKIISITRVSDVATLEEARPAISPSSPNIKRNTLIGFLAGGIGTSVIVLLLELLDTHVKRPEDIEDTLQMTLLGVVPNLGKLK (SEQ ID NO:133)
>orf00505
MAKGHHIKLKVKDKIANVLTCNTISINSITSHSNSTQFMPFGMVLTQPLNIRRHPVFSNFNLSSFYILTERTIQ (SEQ ID NO:134)
>orf00506
MKIAIAGSGYVGLSLAVLLAQHHEVKVIDVIKDKVESINNRKSPIKDEAIEKYLVEKELNLEASLDPAHVYKDVEYAIIATPTNYDVDLNQFDTSSVEAAIKTCMEYNDTCTIVIKSTIPEGYTKEVREKFNTDRIIFSPEFLRESKALYDNLYPSRIVVGTDLDDSELTKRAWQFADLLKGGAIKEEVPILVVAFNEAEVAKLFSNTYLATRVAYFNEIDTYSEVKGLNPKTIIDIVCYDPRIGSYYNNPSFGYGGYCLPKDTKQLKASFRDVPENLITAVVQSNKTRKDYIAGAILAKQPSVVGIYRLIMKSDSDNFRSSAVKGVMERLDNYGKEIVIYEPTIECDTFMGYRVIKSLDEFKNISDIVVANRMHDDLRDIQEKLYTRDLFGRE (SEQ ID NO:135)
>orf00507
MYTFILMLLDFFQNHDFHFFMLFFVFILIRWAVIYFHAVRYKSYSCSVSDEKLFSSVIIPVVDEPLNLFESVLNRISRHKPSEIIVVINGPKNERLVKLCHDFNEKLENNMTPIQCYYTPVPGKRNAIRVGLEHVDSQSDITVLVDSDTVWTPRTLSELLKPFVCDKKIGGVTTRQKILDPERNLVTMFANLLEEIRAEGTMKAMSVTGKVGCLPGRTIAFRTEILRECIHEFMNETFMGFHKEVSDDRSLTNLTLKKGYKTVMQDTSVVYTDAPTSWKKFIRQQLRWAEGSQYNNLKMTPWMIRNAPLMFFIYFTDMILPMLLISFGVNIFLLKILNITTIVYTASWWEIILYVLLGMIFSFGGRNFKAMSRMKWYYVFLIPVFIIVLSIIMCPIRLLGLMRCSDDLGWGTRNLTE (SEQ ID NO:136)
>orf00508
LIFEKEKCFLMLRKNLKYQIMTRAGTILAILFFIILGIIVEVLF (SEQ ID NO:137)
>orf00509
MKKVKKAVIPAAGLGTRFLPATKALAKEMLPIVDRPTIHFVIEEALRSGIEDILVVTGKSKRSIEDYFDSTFELEYSLRKQGKMELLKSVNESTDIKVHFVRQSSPRGLGDAVLQAKSFVGDDPFVVMLGDDLMDITDSTAVPLTRQLMDDYNATQASTIAVMPVRYEDVSSYGVISPRLESSNGLYSVDAFVEKPKPEEAPSNLAIIGRYLLTPEIFSILETQKPGAGNEIQLTDAIDTLNKTQSVFAREFVGKRYDVGDKFNFMKTSIDYALQHPQIKESLKNYVIALGKQLEKLDDCSSSGHL (SEQ ID NO:138)
>orf00510
MNCIESYQKWLNVPDLPAYLKDELLSMDDKTKEDAFYTNLEFGTAGMRGYIGAGTNRINIYVVRQATEGLAKLVESKGETAKKAGVAIAYDSRHFSPEFAFESAQVLAAHGIKSYVFESLRPTPELSFAVRHLGAFAGIMVTASHNPAPFNGYKVYGSDGGQMLPADADALTDYIRAIDNPFAVALADLEEAKSTGLIEVIGETLDSAYLEEVKSVNINQDLIDQYGRDMKIVYTPLHGTGEMLARRALAQAGFESVQVVEAQAKPAPDFSTVASPNPESQAAFALAEELGRQVDADVLVATDPDADRLGVEIRQADGSYWNLSGNQIGALIAKYILEAHKQAGTLPKNAALAKSIVSTELVTKIAESYGATMFNVLTGFKFIAEKIQEFEEKHNHTYMFGFEEG (SEQ ID NO:139)
>orf00520
MNKGLFEKRCKYSIRKFSLGVASVMIGAAFFGTSPVLADSVQSGSTANLPADLATALATAKENDGRDFEAPKVGEDQGSPEVTDGPKTEEELLALEKEKPAEEKPKEDKPAAAKPETPKTVTPEWQTVEKKEQQGTVTIREEKGVRYNQLSSTAQNDNAGKPALFEKKGLTVDANGNATVDLTFKEDSEKGKSRFGVFLKFKDTNNNVFVGYDKDGWFWEYKSPTTSTWYRGSRVAAPETGSTNRLSITLKSDGQLNASNNDVNLFDTVTLPAAVNDHLKNEKKILLKAGSYDDERTVVSVKTDNQERVKTEDTPAQKETGPVVDDSKVTYDTIQSKVLKAVIDQAFPRVKEYTLNGHTLPGQVQQFNQVFINNHRITPEVTYKKINETTAEYLMKLRDDAHLINAEMTVRLQVVDNQLHFDVTKIVNHNQVTPGQKIDDERKLLSSISFLGNALVSVSSDQTGAKFDGATMSNNTHVSGDDHIDVTNPMKDLAKGYMYGFVSTDKLAAGVWSNSQNSYGGGSNDWTRLTAYKETVGNANYVGIHSSEWQWEKAYKGIVFPEYTKELPSAKVVITEDANADKKVDWQDGAIAYRSIMNNPQGWEKVKDITAYRIAMNFGSQAQNPFLMTLDGIKKINLHTDGLGQGVLLKGYGSEGHDSGHLNYADIGKRIGGVEDFKTLIEKAKKYGAHLGIHVNASETYPESKYFNEKILRKNPDGSYSYGWNWLDQGINIDAAYDLAHGRLARWEDLKKKLGDGLDFIYVDVWGNGQSGDNGAWATHVLAKEINKQGWRFAIEWGHGGEYDSTFHHWAADLTYGGYTNKGINSAITRFIRNHQKDAWVGDYRSYGGAANYPLLGGYSMKDFEGWQGRSDYNGYVTNLFAHDVMTKYFQHFTVSKWENGTPVTMTDNGSTYKWTPEMRVELVDADNNKVVVTRKSNDVNSPQYRERTVTLNGRVIQDGSAYLTPWNWDANGKKLSTEKEKMYYFNTQAGATTWTLPSDWAKSKVYLYKLTDQGKTEEQELTVKDGKITLDLLANQPYVLYRSKQTNPEMSWSEGMHIYDQGFNSGTLKHWTISGDASKAEIVKSQGANDMLRIQGNKEKVSLTQKLTGLKPNTKYAVYVGVDNRSNAKASITVNTGEKEVTTYTNKSLALNYVKAYAHNTRRDNATVNDTSYFQNMYAFFTTGSDVSNVTLTLSREAGDQATYFDEIRTFENNSSMYGDKHDTGKGTFKQDFENVAQGIFPFVVGGVEGVEDNRTHLSEKHYPYTQRGWNGKKVDDVIEGNWSLKTNGLVSRRNLVYQTIPQNFRFEAGKTYRVTFEYEVGSDNTYAFVVGKGEFQSGRRGTQASNLEMHELPNTWTDSKKAKKATFLVTGAETGDTWVGIYSTGNASNTRGDSGGNANFRGYNDFMMDNLQIEEITLTGKILTENALKNYLPTVAMTNYTKESMDALKEAVFNLSQADDDISVEEARAEIAKIEALKNALVQKKTSLVADDFASLTAPAQAQEGLANAFDGNLSSLWHTSWGGGDVGKPATMVLKEPTEITGLRYVPRGSGSNGNLRDVKLVVTDESGKEHTFTTTDWPDNNKSKDIDFGKTIKAKKIVLTGTKTYGDGGDKYQSAAELIFTRPQVAETPLDLSGYEAALAKAQKLTDKDNQEEVASVQASMKYATDNHLLTERMVEYFADYLNQLKDSATKPDAPTVEKPEFKLSSLASDQGKTPDYKQEIDRPETPEQILPATGESQSDTALFLAGVSLALSALFVVKTKKD (SEQ ID NO:140)
>orf00523
LQIAQESSQDTDGINPPVVEEAMVFDRNDCLNQICGNIISLGIDAAFRTQVSNELIFIVVDFTRSCCN (SEQ ID NO:141)
>orf00525
MLNLMWMKIFHRNRTFLFCFLDFKVDVISIINARIVRR (SEQ ID NO:142)
>orf00526
MYNSQALRQIVVVGSIDHLFKRHSSICEIFGLRKRCLSFLW (SEQ ID NO:143)
>orf00537
MKLLKKTMQAGLTVIFFGLLATNTVFADNSEGWQFVQENGRTYYKKGDLKETYWRVIDGKYYYFDSLSGEMVVGWQYIPFPSKGSTIGPYPNGMRLEGFPNSEWYYFDKNGVLQEFVGWKTLEIKTKDSVGRKYGEKREDSEDKEEKRYYTNYYFNQNHSLETGWLYDQSNWYYLAKTEINGENYLGGERRAGWINDDLTWYYLDPTTGIMQTGWQYLGNKWYYLRSSGAMATGWYQEGTTWYYLDQPNGDMKTGWQNLGNKWYYLRSSGAMATGWYQEGTTWYYLDQPNGDMKTGWQNLGNKWYYLRSSGAMATGWYQDGSTWYYLNAGNGDMKTGWFQVNGNWYYAYSSGALAVNTTVDGYSVNYNGEWVR (SEQ ID NO:144)
>orf00541
MTTGWFQVNGRWYYAYSSGALAVNTTVDGYFVNYNGEWVQ (SEQ ID NO:145)
>orf00551
MSLADLLEELEAAKDSKKARSMEAYMRHQFSFLGIAVPERNKLYKNIFQKRKKQRLSIGILQTLAGKRILENTNMWLLTI (SEQ ID NO:146)
>orf00552
MEKILLHNLNQTEFFINKAIGWTLRDYSKTNPTWVTCFIEKNKERMAELSIKEASKYL(SEQ ID NO:147)
>orf00555
VLEILKEYLLLEEGDYIFTNNGSPLMITCFNYFLKNSFRKSEIKKDDFVLTAHVFRYSHISLLAELEVPITAIMDRVDYTNETKILSVYTHVTEKMKSNITEKLDKLVLEND(SEQ ID NO:148)
>orf00580
LVIFKNCSHCTSNCKSRTVQGMKEFHLAICFITVTDLSTTGLEIFXXXFHSLIN (SEQ ID NO:149)
>orf00581
MNXXARGFSFVCKDFEVAAHFASSDIGSNLIDMAPRNLDKLFDIDIWVKFQSLISDKEFPNFPSDREVGSACRTEEYCFVK (SEQ ID NO:150)
>orf00587
LQKPLFQILQKLVKISQAFIHDSNFFLAHAINNFXXHSFIN (SEQ ID NO:151)
>orf00590
VGFSLLLLLIFCLFCLLFNVSNQVTNCFQVIFCFDLDIKSIFDF (SEQ ID NO:152)
>orf00614
VNIDSSEFYISHITDGIFDSFLDSNRYLRNFYSVLKVEIDICCEFFVHVFKINATAE(SEQ ID NO:153)
>orf00615
VNTLYLCSSNSNDFFKYTWGDNDFAKLFFNSHRMTSF (SEQ ID NO:154)
>orf00625
LFSPRPGGDNGKGTTFLNFFLNIHKFSFFDSLFFLFQ (SEQ ID NO:155)
>orf00627
VKEEKKAIVLGADNAYMDKVETTLKSLCVHHYNLKFYVFNDDLPREWFQLMEKRLETLNSEIVNV (SEQ ID NO:156)
>orf00643
LAQISILHFDFLSIDKHSHTVFNTLRKSLQTTLALSATSKQCFEQLAASFLVCSLIFIEYKV (SEQ ID NO:157)
>orf00652
MSLITHRRFISSKVTRQKFVDNQIDLKYYIWRYSHALCGIDVAKNKHDVTALNVSGKTVLKPLTFSNNKAGFELLDLSLRQLNQDYLIALEDTGHYAFNLLNFLHEQGYKVYTYNPLLIKKFAKSLLLRKTKTDKKDAHGIALKLLSDPNREQFQHDNRQVELKILARHIHRLKKKQSDWKVQYTRCLDIIFPELDKIVGKHSEYTYQLLTCYPNPQKRLEAGFDKLIEIKRLTASKIQDILSVAPRSIGTTSPAREFEIIENIKHYKRLIDKAETCVNDLMAEFNSVITTVTGIGNRLGAVILAEIQNIHAFDNPAQLQAFAGLDSSIYQSGQIDLAGRMVKRGSPHLR (SEQ ID NO:158)
>orf00654
MDTKSSCLITTGRNDSPSTCLPRVASNNDRFSSEFRIIPDFHCSKKGIHVNMDDFS (SEQ ID NO:159)
>orf00657
MDKMKPVFQALNKELIQENLTLTIICVGGYVLEYHGLRATQDVDAFYDQNQKINEIIARVGKQFNLNTHEELWLNNHVANMNKQPPLSLCESLYSFENLTVLVVPIEYVLGMKMMSIREQDLKDIGAIIKYKNFHSPFDTFKYLKDMGFDTIDLSVLLEGFSYAYGMDWLEKFFKENQDKLREFY (SEQ ID NO:160)
>orf00663
MIPLYRTDNDITKFFTKIRNGHLAKTAGGLDDKFHEANASTSKAFDRQGVGEVNDIRDSAGSQELRINDKRKTENILFLEIRVRIFRVPHPNDSFFSSHFLG (SEQ ID NO:161)
>orf00664
VLSQGDKDITILDAGLLKNGKIGPVTKDTNDIKATDNMIENSFVLLNQQNIMLFCNQGATEGKTNFSPSDKDNFHNKTYFFMM (SEQ ID NO:162)
>orf00669
MKIKEQTRKLAAGCSKQCFEIVDRTDEVSSKHGFEVVDETDEVSSKHGFEIVDETDEVSNHTYGKAKLTWFEEIFEEYKMMGKAGQLVFFDVYRLVRQVS (SEQ ID NO:163)
>orf00698
MEKFNFKNNIGQENKLLQIEIYKFTNFCKLQNYTSVNIFSKDIFEAIVN (SEQ ID NO:164)
>orf00701
VRIKSIYWNFGQNKPEKSFRYIDTSSIDRKKNIINYKNLQYLSPEQAPSRARKLVSQNSVLFSTVRPYLKNIAVVRELKEYLIASTAFIVLDTLLNETYLKYYLLSDNFINRVNNKSTGTSYPAINDYNFNLLLIPLPPLSEQQRIVEAIESALEKVDEYAESYNRLEQLDKEFPDKLKKSILQYAMQGKLVEQDPNDESVEVLLEKIRAEKQKLFEEGKIKKKDLDISIVSQGDDNSYYGNIPMNWVVIKIKDIFSINTGLSYKKGDLSINKGVRIIRGGNIKPLEFSLLDNDYYIDTQFISSEQVYLKHNQLITPVSTSLEHIGKFARIDKDYDGVVAGGFIFQLTPFESSEIISKFLLFNLSSPLFYKQLKAITKLSGQALYNIPKTTLSELLIPLAPFEEQELITQKVEKLFEKVNQLWK(SEQ ID NO:165)
>orf00720
MIRKVNHNIFKHRSVVIFTLTNSYFCKFFINDISISFHSHQHFCWIIIQIK (SEQ ID NO:166)
>orf00724
LDNIHIVLDSLNAVSGIQDFICDGLAIFCNQITSGCSSCK (SEQ ID NO:167)
>orf00735
MNIAWILLYALVINGLEIVIFFKVDGIGLTFDRIFKAFLLKFLLGIIFTTFQFLAVSKYLSYFIEPLFGIGLSFLLLRGLPKKILIFYGLFPMILVELFYRGVSYFVLPFLGQGIVDGDGNPIFLLIMIFVCFIVLVFLKWLDYDFTRLRREFLDTGFQKSLTKINWAMGAYYLVMQSLSYLEYEQGIQSTTVRHLILVFYLLFFMGGIKKLDTYLKEKLQEELNQEQTLRYRDMERYSRHIEELYKEIRSFRHDYTNLLTSLRLGIEEEDMEQIKEIYDSVLRDSSQKLQDNKYDLGRLVNIRDRALKSLLAGKFIKAREKNIVFNVEVPEEIQVEGMSLLDFLTIVSILCDNAIEASAEASQPHVSIAFLKNGAQETFIIENSIKEEGIDISEIFSFGASSKGEERGVGLYTVMKIVESHPNTNLNTTCQNQVFRQVLTVIHAE (SEQ ID NO:168)
>orf00737
MISQEDILKACEVAEIRQDIERMPMGYQTQLSDGAGLSGGQKQRIALARALLTKSPVLILDEATSGLDVLTEKKVIDNLMSLTDKTILFVAHRLSIAERTNRVIVLDQGKIIEVGIHQELMQAQGFYHHLFNK (SEQ ID NO:169)
>orf00738
MSSKISIGQLITFNTLLSYFTTPMENIINLQTKLQSAKVANNRLNEVYLVESEFQVQENPVHSHFLMGDIEFDDLSYKYGFGRDTLTDINLTIKQGDKVSLVGVSGSGKTTLAKMIVNFFEPYKGHISINHQDIKNIDKKSLAPSY (SEQ ID NO:170)
>orf00740
MKSTLGIISVGLVITYILQQVMSFSRDYLLTVLSQRLSIDVILSYIRHIFELPMSFFATRRTGEIISRFTDANSIIDALASTILSLFLDVSILILVEGVLLAQNPNLFLLSLISIPIYMFIIFSFMKPFEKMNHDVMQSNSMVSSAIIEDINGIETIKSLTSEENRYQNIDSEFVDYLEKSFKLSKYSILQTSLKQGNKISSEYPYPMVWRSISHVE (SEQ ID NO:171)
>orf00742
MTSYKRTFVPQIDARDCGVAALASIAKFYGSDFSLAHLRELAKTNKEGTTALGIVKAADEMGFETRPVQADKTLFDMSDVPYPFIVHVNKEGKLQHYYVVYQTKKDYLIIGDPDPSVKITKMSKERFFYEWTGVAIFLATKPSYQPHKDKKNGLLSKLPSSDFQTKISHCLHCSLKLIGHYYQYRWFLLSPRNLG (SEQ ID NO:172)
>orf00767
MGLIKTLAKIYGNYFLTVQGVKVMKTIKKDDHVVVGLGKLFIADKLMDTARWLIKPEDKK(SEQ ID NO:173)
>orf00768
MKFFWGLLAIIFIKPIIGIVKFFWMIISFAVQLLFYKIVFKILDWLFKLI (SEQ ID NO:174)
>orf00776
MHSQTFQFLLMTDKTSLLHRKHRSFIRNIHSKFLILFDLLCGILSRNDSNHNPIS (SEQ ID NO:175)
>orf00779
MARTELPDKIETERLVLRVRTVADAEDIFDYASLPEIAYPAGFPPVKTLEDEIYYLEHILPERNQKENLPAGYGIVVKGTDKIVGSVDFNHRHEDDVLEIGYTLHPDYWGRGYVPEAARALIDLAFKDLGLHKIELTCFGYNLQSKRVAEKLGFTLETRIRDRKDVQGNRCDSLIYGLLKSEWEE (SEQ ID NO:176)
>orf00781
MSDVKEEVSSLSEKQLRQIDVEYAELNDSDIIERLAYLEINNNEKRIVISDIEPTKEIMSVSDQIFEIQKNFQKIKNMFELFISDVSDFLSIKNKLESKELEIEEADVNRFMIHLLSSGKLFVDFNENQIKQKYSKDSEEFDCIHGFASYQYDINFTYRFCHSLRNYSQHTDLPINEVKAVSPDDETVIIDFYIDLDYLLNSNFKWKKLKGELIKLNQETSKIDAIALVKEYFNALTELYGNYNKLFLKLNHNTLVDIKSKLESLKLKHSRYYISKISKYDLKYNPGNYTMSPLAAFAEIEEIYIELSKIGLVKIVNKSN (SEQ ID NO:177)
>orf00785
MSKHPHYELLNLIGYGLAKFDKLFIKEFQCSSKSEFYRYVVSLGIAETTGVVKNRMDLFDPYFDNNRKGWWQKAEVYRFRKDLIDMMFGNEDVHSYAEIVKMLLASEGKKTGITIVEKPIVRTKFKRLQETGMEAENYFILHFDKEEKFQGGQLTDARLYGDGYDFQVDVQEYSYLAEVKGIRKSKGRVRLTAKEFEKVKEFQSDFILSLVTNLDDIPKLVLIDNPLKHFEFKKNIIKNEIIEYRSVEDLY (SEQ ID NO:178)
>orf00787
LSTCWNGKFCHICVALFHCFRAFKLALNEILCLLTNVSFIFVSVAF (SEQ ID NO:179)
>orf00788
LLRKQEREYLRAENAILKKLRELRLKEEKEKEERQKLFKN (SEQ ID NO:180)
>orf00809
LKHLFCHFNPLWIDEIIRLAYKDQDTKDVKSKVKIGN (SEQ ID NO:181)
>orf00856
MKEIAFDAFYQLYQNDQLSLVDVREVDEFAALHLEGAHNLPLSQLADSYD (SEQ ID NO:182)
>orf00859
MVSNHKIACFQLFDKVGIFSLLDELNSLANKAHINLLNCF (SEQ ID NO:183)
>orf00871
MDFFFMNEVKEQVLFRDNHSEHIFWIEGVSDFMIKVNTALW (SEQ ID NO:184)
>orf00878
MKIKEQTRKLAAGCSKHCFEVVDRTDEVSNHTHGKATLTWFEEIF (SEQ ID NO:185)
>orf00885
MKIKEQTRKLAAGCSKHSFEVVDKTDEVSSKHGFEVVDETDEVSSKHGFEVVDETDEVSNHIYGKATLTKFELDFRRV (SEQ ID NO:186)
>orf00890
MEELVTLDCLFIDGTKIEANANKYSFVWKKTTEKFSAKLQEQIQVYFQEEITPLLIKYAMFDKKQKRGYKQSAKNLANWHYNDKEDSYIHPDGWCYRFHHIKYQKTQTDFQQEIKVYYADEPESAPQKGLYMNERYQNLKAKECQALLSPQDRQIFAQRKIDVEPVFGQIKACLGYKRCNLRGKRQVRIDMGLVLMANNLLKHSEMK (SEQ ID NO:187)
>orf00892
MHIHYNTNQTTLPLEISSFLPQDHLVFTIEKVVNTLEDCHFHAFYHAFDRPSYHLKMLVSTLLFAYSQGIFSGRKIEKWKS (SEQ ID NO:188)
>orf00894
LRLWVIFVIMKVMKSYNTLNDYYRKLFGEKTFKVPIDAGFDCPNRDGTVAHGGSTFCTVSGSGDAIVAPDAPIREQFYKEIDFMHRKWPDVQKYLVYFQNFTNTHEKVEVIRERYEQAINEPGVVGINIGTRPDCLPDETIEYLAELSECMHVTVELGLQTTYEATSDLINRVHSYEL(SEQ ID NO:767)
>orf00896
VETVKRLRKYPKIEIVSHLINGLPGETHEMMVENVRRCVTDNDIQGIKLHLLHLMTNTRMQRDYHEGRLQLMSQDEYVRVICDQLEIIPKHIVIHRITGDAPRDMLIGPMWSLKKWEVLNSIEMEMRRRGSVQGCKAVKQEFENEKTT (SEQ ID NO:189)
>orf00908
VQVCVFTNFCFFHCFSSLANCRLFNLRGICLPCISYQ (SEQ ID NO:190)
>orf00915
VFKKDRFSIRKIKGVVGSVFLGSLLMAPSVVDAATYHYVNKEIISQEAKDLIQTGKPDRNEVVYGLVYQKDQLPQTGTEASVLTAFGLLTVGSLLLIYKRKKIASVFLVGAMGLVVLPSAGAVDPVATLALASREGVVEMDGYRYVGYLSGDILKTLGLDTVLEETSAKPGEVTVVEVETPQSTTNQEQARTENQVVETEEAPKEEAPKTEESPKEEPKSEVKPTDDTLPKVEEGKEDSAEPSPVEEVGGEVESKPEEKVAVKPESQPSDKPAEESKVEPPVEQAKVPEQPVQPTQAEQPSTPKESSQQENPKEDRGAEETPKQEDEQPAEAPEIKVEEPVESKEETVNQPVEQPKVETPAVEKQTEPTEEPKVEVTSIPQTTRYEEDLTKEHGTREVVKEGKNGSRTVTTPYILNATDGTTTEGTSTTDEAEMEKEVVRVGTKPKEKLAPVLSLTSVTDNAMLRSARLTYHLENTDSVDVKKIHAEIKNGDKVVKTIDLSKERLSDAVDGLELYKDYKIVTSMTYDRGNGEETSTLEETPLRLDLKKVELKNIGSTNLVKVNEDGTEVASDFLTSKPVDVQNYYLKVTSRDNKVFRLTVEKIEEVTEEGQPLYKVTAKAPNLIQHTDATKMQDEYVYYIEKTRATDGDIYYNFNDLVNAMNKNKTGTFKLGADLNATGVPTPAKSYVTGDFRGTLTSVDGEHYTIHNTSRPLFNNIIGGTIKDINLGNVNIHMPWANNVASLANIIKGGTTIENVKVTGNVLGKDWVSGFIDKIDSGGTLRNVAFIGNVTSVGTGGSFLTGIVGENWKGLVEQAYVDANIRGKKAKAAGIAYWSQNGGDNYAVGRYGAIKKSVVKGSIDVEKPIEVGGAVGSLNYLGYIEDTVAMMKVKNGEIFYGSHDIDTDPYYTGERVNRNFIVDGVSEGKSSYKYSKQQNRIKSVSQEEADKKIKELAITADKYAITEPIVNKLNALTTRDNEYRTTQDYKADRELAYRNIEKLQPFYNKEWIVDQGNKVPSNSKLLTTEVLSVTGMKDGQFVTDLSEIDKIMIHYADGTKEEMNVTAVADSKVKQVREYDVTDLGVVYTPNMVDKNRDQLIADVKAKLSSVELISPEVRALMDKRGKAEENTEGRQNGYIRDLFLEESFAEVKAGLGKLVKALVENEDYQLNSDEAAMRALIKKVEDNKAKIMMGLAYLNQYYSFKYAELSIKDIMMFKPDFYGKNVNVLDFLIKIGSSERNVKGDRTLEAYRETIGGTIGINELNGFLHYNMKLFTNHIDINDWFKKAIEKNAYVVEQPSTNPAFANKKYRLYEGINNGQHGRMILPLLNLKNAHLFMISTYNTISFSSFEKYGKDTDEKREKFKSEINKRAKEQVNYLDFWSRLATDNVRDKLLKSQNVVPTPVWDNHNSPNGWASRHGHIDGKPDYAPIREFFGRINKYHGYKYGYGAYAYIFAAPQPMDAVYFVMTDLISDFGTSAFTHETTHINDRMAYYGGHWHREGTDLEAFAQGMLQTPSVSNPNGEYGALGLNMAYERQNDGNQWYNPNPNKLKSRAEIDHYMKNYNEALMMLDYLEAESVLPKLKGNNDRWFKKMDKQMRKDGQPHQFDKIRDLNNEEKKIQLASIEDLVDNNFMTKHGAPGNGTYNPSDFSSAYVNMNMMTGVYGGNSSDGAPGAASFKHNTFRMWGYFGYENGFIGYASNKYKAEANKAGQTLSDKYIINKVSGGTFNTLEAWKKEWFKQIKTKAQKGFTAIEIDGKTIDSYEKLKDLFDKAVEEDLKGTGTDKTVKLKEKVYKQLLKNTDGFSGDLFTAPQA (SEQ ID NO:191)
>orf00933
VGDRIFIAFLQKLGLLDNLTGIREKLHPITGQGDSLGIADKDLNAHFIFQISHCIGETWLSDKELLGCLIHGASFDDFDNIM (SEQ ID NO:192)
>orf00941
VSRWDGHSDKGEAPAGKTSYAWIWTKWGEQVAFYCDYD (SEQ ID NO:193)
>orf00955
MKLFKPLLTVLALAFALIFITACSSGGNAGPSSGKTTAKARTIDENKKSGELRIAVFGDKKPFGYVDNDGSYQGYDIELGNQLAQDLGVKVKYISVDAANRAEYLISNKVDITLANFTVTDERKKQVDFALPYMKVSLGVVSPKTGLITDVKQLEGKTLIVTKGTTAETYFEKNHPEIKLQKYDQYSDSYQALLDGRGDAFSTDNTEVLAWALENKGFEVGITSLGDPDTIAAAVQKGNQELLDFINKDIEKLGKENFFHKAYEKTLHPTYGDAAKADDLVVEGGKVD (SEQ ID NO:194)
>orf00988
MDTPDENGYVADDYRITYLEAHIKAMRDAIYQDGVDLLGYTTWGCIDPVSAGTGEMNKRYGFIYVDRDNVGNGALKRSKKKSFYWYKDVIDSNGASIG (SEQ ID NO:195)
>orf01015
MTEPDFWNDNIAAQKTSQELNELKNTYNTFHKMEELQDEVEILLDFLAEDESVHDELVAQLAELDKIMTSYEMTLLLSEPYDHNNAILEIHPGSGGTEAQDWGDMLLRMYTRYGNAKGFKVEVLDYQAGDEAGIKSVTLSFEGPNAYGLLKSEMGVHRLVRISPFDSAKRRHTSFTSVEVMPELDDTIEVEIREDDIKMDTFRSGGAGGQNVNKVSTGVRLTHIPTGIVVQSTVDRTQYGNRDRAMKMLQAKLYQMEQEKKAAEVDSLKGEKKEITWGSQIRSYVFTPYTMVKDHRTSFEVAQVDKVMDGDLDGFIDAYLKWRIS (SEQ ID NO:196)
>orf01045
MQVIKRNGEIAEFNPDKIYQAILKAAQTVYVLTDDLRQNLAQVTKKVVLDLQEAKVERATISMIQSMVEHRLLGAGYITIAEHYISYRLQRDLERSGYGDHIAVHLHFEQIR (SEQ ID NO:197)
>orf01068
MELFKTWKKNMVLYGLKSQIGTVYRNNDRTTSFYDVGNFLYLAGELDSRFWEDFVRKYGLDYKIIISENTNWQDFLHRKVGLNSFTRYSFKDKANFQVEFLNNLVTHLEEGYNIVPIDNHIYNCFSTEEWSQDLQGDFESYQDFVLKGGFGFVILKNNELIAGISSGLVYRKAVEVEVATRPNEQGNGFAKKLGAAMILESLNRDMFPLWDAHNEASKKVAEFLGYELSEPYEAFELEEILI (SEQ ID NO:198)
>orf01090
LGSDRKALHKPVYLFWCESFDILFCTWSSQFSVLKAFI (SEQ ID NO:199)
>orf01110
MKVINQTLLEKVIIERSRSSHKGDYGRLLLLGGTYPYGGAIIMAALAAVKSGAGLVTVGTDRENIPALHSHLPEPMAFSLQDQQLLKEQLEKAEVILLGPGLRDDASGENLVKQVFVNLSQNQILIVDGGALTILARTSLSFPSSQLILTPHQKEWEKLSGITIEKQKEDATASVLTSFPQGTILVEKGPATRIWEVGQSDYYQLQVGGPYQATGGMGDTLAGMIAGFVGQFRQASLYERVAVATHLHSAIAQELSQENYVVLPTEISRYLPKIMKIICQQDRVSKDKLV (SEQ ID NO:200)
>orf01114
VLASNRKFIFFFFRIGILILKNIKSFNQFLALFHKIPSHDGSRKSLSWSDGKSLKXXFIH(SEQ ID NO:201)
>orf01121
VLDSKEELKESENDAPKLETPLREEPRLAPQTLLEASEVLENKREESKVGITEPAQDSPILAPVEETKEEAVTEKPTNTRSLTAEDLVKISKGELHLENDLIDESFYGEKALDLEGDDYQDGIKNKDGKDYLGYNSHPLLADSDGDGLADGEDDNKKEWYVTDRDSLLFMELAYRDDDYIEKILDHKNLFPSLYLDRQEHKLMHNELAPFWKMKKAYYTDSGLDAFLFETKSDLPYLKDGTVHMLAIRGTRVNDAKDLSADFVLLGGNKLAQADDIRKVVGELAKDISITKLYMTGHSLGGYLAQIAAVEDYQKYPDFYNHVLRKVTTFSAPKVITSRTVWDAKNGF (SEQ ID NO:202)
>orf01148
MGRNPKTRPEERTELERLQSENEYLRAENAILKKLRELRLKEEKEKEERQKLFKN (SEQ ID NO:203)
>orf01150
LSTCWNGKFCHICVALFHCFRAFKLALNEILCLLTNVSFIFVSVAF (SEQ ID NO:204)
>orf01165
VVLSTSAILVACGKTDKEADAPTTFSYVYAVDPASLGYSIATRTSRTDVIGNVIDGLMENDKYGNVAPSQKDYDLNSTGWAPSYQDPASYLNIMDPKSGSAMKHLGITKGKDKDVVAKPGLDKYKKLLEDAVSETTDLEKRYEKYAKAQAWSTDSSLLMPTASSGGSPVVSNVVPFSKPYSQVGIKGEPYIFKGMKLQKDIVTTKEYNEVFKKWQKEKLESNSKYQKELEKYIK (SEQ ID NO:205)
>orf01169
LNFDFFIFLAHFIPLFTFSILQENPKTSKKKLYIRLL (SEQ ID NO:206)
>orf01176
MGFSMKLIHDLDTHTTHSTAKMLYNVKAIKNDFSIRE (SEQ ID NO:207)
>orf01191
LIRIIRNIYRSGEGNTSVFQSFIDQINSNQFCYGSNFDRLRCILLIENFTSICLNSNRMFSGNGKILSNSSRSTP (SEQ ID NO:208)
>orf01202
MIYFDNSATTKPYPEALETYMQVASKILGNPSSLHRLGDQATRILDASRHQIADLIGKKTDEIFLTSGGTEGDNRVLQGVAFEKAQFGKHIIVSPLPHSAVLE (SEQ ID NO:209)
>orf01223
MKIKEQTRKLAAGCSKHCFEVVDKTDEVSSKHGFEVVDETDEVSNHTYGKATLTRIEEIFEEYKSS (SEQ ID NO:210)
>orf01233
LVYAPFSFNILLDYITFDFKILLFSVFLAINRFHNDFIQFLL (SEQ ID NO:211)
>orf01236
MSGYSGLSFFEVALAEFLDIVSAVYLEDADGIIVNLWGILDK (SEQ ID NO:212)
>orf01246
MEGVAKGRIGRKKNNGIDNRCCHKKRNGRVTWNLFFQKTIDDGDDSTFTRREKHTDEGPKKDSPPTISREKMINLVRCDINFNQP (SEQ ID NO:213)
>orf01249
VDKTDEVSSKHCFEVVDKTDEVSSKHCFEVVDKTDEVSSKHCFEVVDRTDEVSSKHCFEVVDRTDEVSNRTTVRRS (SEQ ID NO:214)
>orf01251
MDYGLYHPCPIVTPSQSSSIVANPASKLISASEELIPDAIAVDFVEVNCY (SEQ ID NO:215)
>orf01254
MKKLFQEKFSKKPSHKEIERVQLGCAMMQATFHLMGY (SEQ ID NO:216)
>orf01261
VVIGVASATTNIWIIFLSGFAAILAGAFSMAGGEYVSVSTPKDTEEAAVSREKLLLDQDRGLAKKSLYAAYIQNGECKTSAQLLTNKIFLKNPLKALVEEKYGIEYEEFTNPWHAAISSFVAFFLRSLPPMLSVTIFPSEYRIPATVLIVGVALLLTGYTSARLGKDPTRTAMIRNLAIGLLTMGVTFLLEQLFSI (SEQ ID NO:217)
>orf01269
MLYVGIDVAKNKHDVTALNVPGKLFLNHSLFQIIKLVLNS (SEQ ID NO:218)
>orf01296
MDFEYFYNREAERFNFLKVPEILVDREEFRGLSAEAIILYSILLKQTGMSFKNNWIDKEGRVFIYFTVEEIMKRRNISKPTAIKTLDELDVKKGIGLIERVRLGLGKPNIIYVKDFMSIFQVKENDLQKSKNLTSEVKDFNLRSKENELQEVKNLDSNYIENNKSKYSKREYSFGENGLGTFQNVFLAAEDISDLQIIMNSQLENYIRLPAKLES (SEQ ID NO:219)
>orf01304
LLHIRVCKTFDRIPYCMLALFLSKSIGLTILLHKVKTVIFIDDQSNDKTCKICIHISFFRIKLSQQCQLSFSVYF (SEQ ID NO:220)
>orf01309
MTQEDALIVISHIKVLSIVPNRCLKPLDKTFSLYNWIFLSQKYILLQANFLKISRVQLQ(SEQ ID NO:221)
>orf01313
VTTHDEPVYEKHGVLHYAVANIPGAVARTSTIALTNVTLPYIEALAGKGFAQAISEDEGLRQGVTTYQGYLTSLPVAQGLNRDYTDINDLV (SEQ ID NO:222)
>orf01314
VFFIDGFIVRCHTVSCFDNATLVNSNVNDTEPGRICLTISSVTNSGAFAPGMRMAPITTSASLILASMLNELDIRV (SEQ ID NO:223)
>orf01316
MLIGIPKEIKNNENRVALTPAGVHSLVSRGHRVLIETNAGLGSGFTDADYQKQGAEIVATAGEAWAAELVVKVKEPLSSEYGYLRDDLLLFTYLHMAAAPELADAMLAAKTTGIAYETVRDNQGQLPLLVPMSEVAGRMAV (SEQ ID NO:224)
>orf01334
LIRILGNSFISIDKAHEQAEEYGHSFEREMGFLAVHGLFTY (SEQ ID NO:225)
>orf01383
VILFFIRFRVSWLTYFIMSLIFIRFTILVCLIFTNMVASVTKAIILMADVRLDVRHST(SEQ ID NO:226)
>orf01387
MIAMRSYITLICNLNNNLFCLNSFFLTNLVWSQIFSLLSVFITVYI (SEQ ID NO:227)
>orf01388
MFLFLAIDFIFYSYIFCMSLIFKVNIILSIYLNNISSLNLTDDILIFRLIVGGFH (SEQ ID NO:228)
>orf01390
VIPRGDHNHYIKVQTKGYEAALKNKIPSLQSNYQPGTFDEKAVLAKVDQLLADSRSIYKDKPIEQRQIELALGQFTESLKKIKVS (SEQ ID NO:229)
>orf01419
MARLEPAKIAKIVLGILLYIIDLIKSSFVLPIPKAAKKSLILISFVPSFNDKNIVIRRPRQITKIMPRFICFLFRIFACIS (SEQ ID NO:230)
>orf01436
MELSAIYHRPESEYAYLYKDKKLHIRIRTKKGDIESINLHYGDPFIFMEEFYQDTKEMVKITSGTLFDHWQVEVSVDFARIQYLFELRDTEGQNILYGDKGCVENSLENLHAIGNGFKLPYLHEIDACKVPDWVSNTVWYQIFPERFANGNALLNPEGTLDWDSSVTPKSDDFFGGDLQGIIDHMDYLQDLGITGLYLCPIFESTSNHKYNTTDYFEIDRHFGDKETFRELVDQAHHRGMKVMLDAVFNHIGSQSLQWKNVVKNGEQSAYKDWFHIQQFPVTTEKLVNKRDLPYHVFGFEDYMPKLNTANPEVKNYLLKVATYWIEEFNIDAWRLDVANEIDHQFWKDFRKAVLAKNPDLYILGEVWHTSQPWLNGDEFHAVMNYPLSDSIKDYFLRGIKKTDQFIDEINGEFMYYKQQISEVMFNLLDSHDTERILWTANEDVQLVKSALAFLFLQKGTPCIYYGTELALTGGPDPDCRRCMPWERVSSDNDMLNFMKRLIKIRKYASVIISHGKYSLQEIKSDLVALEWKYEGRILKAIFNQSTEDYLLEKEAVALASNCQELENQLVISPDGFVIF (SEQ ID NO:231)
>orf01442
MGQEIKLIRKQFRSTRQEEKQIKEMMREQKVDSFSEFLRQNLLKKNYQDRIFESWFSLWQSQKFEQISRDVYEVLVVARENHQVTQEHVSILLTCVQELIAEVNQVQPLSREFREKYMG(SEQ ID NO:232)
>orf01443
MVYRYRTNLKKVFLTDPELHQLNERIAKSNCQNFSVYARKVLLNPNMSFVTINTDTYDQLVFELRRIGNNINQIARAINQSHLISQDQLQELSKGVGELIKEVDKEFQVEVKRLKEFHGSY (SEQ ID NO:233)
>orf01444
MVVTKHFATHGKKYRRRLIKYILNLDKTDNLKLVSDFGMSNYLDFPSHAEMVEMYNVNFTNNDKLYESRNDRQEKHQQTIHAHHIIQSFSPEDNLTPEEINRIGYETMMELTGGRFKFIVATHTDKDHVHNHILINAIDRNSDKKLIWNYALERNLRMISDRISKMTGAKIIEKRYSYRDYKKYKESSHKFELKQRLYFLLQQSKSFDDFLEKAKQLHVQIDFSQKHSRFLMTDRTMIKPIRGRQLSKRDLYDEEFFRTHFAKQEIESRLEFLLNRVNSLEDLITKAKELNLTIDLKQKNVTFILEENNQKISLGHQKISDKKLYDVKFFQDYFKNKEVVASEGLENLQEQYHAFQEERD
KDKVSTEEIEEAFKTFKEK (SEQ ID NO:234)
>orf01446
VELAENQIEKLVDKGVYIKVSFGVKQSGLIFIPNYQLDIMEEENHKKYKVYIRETSSYFVYNKENMDNNCFIKGRTLIRQLSNDSQKLPYRRPTLKSLQEKISEINLMIELSNTNKQYQEIKDELVLEIAEIDMKLEETQEKIATLNKMAEVLINLKSEDHETRKLARYDFSKLNLTESTSLENVNEEIRVLQENLDYYLYEFEKRAIRLEIFVSTLNMEKDVFVIDKF (SEQ ID NO:235)
>orf01447
MAIIKKIVVLYGGLSEEREVSENSAKEISKSLLTLGYDVISIDLSQDCSYEIGEIEKSERGLNKQKVEIGSGIIDVCRKVDIVFLATHGGIGENGKLQAIFDVEKIDYTGNSFLSTAISMDKKLSKIVASSVGIKCASNLMVDRIQANDFPIVIKPIRSGSSKGIKIFQNKKQFDIYYKEHPELGSVFVEKYIKGREFSVGILGETVLPVIEIKVKNGFYDYNNKYTVGAAEEVVPAKISEKLTHTLQKSAYKIHKALGFNVYSRTDFIVDEKGDVYFIESNSLPGMTKTSLLPQEAKAAGIDFPNLCERIIELSREIRSQ (SEQ ID NO:236)
>orf01449
MKIINGICRYIFDSKGFATIEVEIFLDSGDTGIGAAPRGSTTGHYDIQYNEYYPRGNNFSPIPDGNIEFFNENILPRIINREVEDIEDITELDKHLFDIPEIENYGNVAIACSYAVWEAFSKNKKSPLWKLFFEPGSASKGKVKHLVNIIDGKPDSLLAGFEFLLVSEKEITFQSLLEISNIKNELMIKFKNQGFYTSISNQGALIINTDDFYIILDSLLETLKLYKNRYDIGLDLAMTDRYDSSLGIYKVPWCVSQQQTVTEIMDTYCDWGVKYPLVYLEDPFSDEDLDSWRKFQLIKPLKLQVFGDDFYATNLERISQFKDCADGIVIKPNQVGSVSKTLEVMEYAEKSGISMAFSQRTAETENNIISHLAMSVTSSYLKAGGLDRLDRIAKYNEVLRNG (SEQ ID NO:237)
>orf01450
MDNGKISTDGALVNNKNIEVNISSASIRYGISVFEAPKMFLLGSDIYIFRFQDYFNRLKTSCKFLNIELPLDYNKLLYDIKKFIEFVPWQESYALRINVFCPYESELLGEVECALTLSYLDIGIRSKSGVSSKVIKRSNLIRTSNNNLYMIKSPSHYINARKELYSDIEFDDILYVNEKENICELSRSNIFLIKDRTVYTPDLGSGILDGITRKLIIDLCQEHNILIEIKELNYSKLSDFDSAFCVGTTNGITPIRNIDKNIHFDTNNLLLAEIVNYYKEVFSKKGVEKYKEWFVKL (SEQ ID NO:238)
>orf01451
MIKDVNYFNERAQIIRRTINDMIFGAESGHFGASLSSVDFINVIYENYVFPENAEFILSKGHAAPALYAKLIESGVLDKDFIYGFREYRSLLTGHPNHRIPTLKFGLGSLGQGPSIGVGMAWVNKRKKSDKKIFVMLGDGELNEGQVWEAFYTCRNLNLQNLVFIIDRNFLQLDGKCEDVANFPNLAQKISSFLGTNPIEVNGNSYDEILNVLDNIDYSQTNVIISNTTKGKGIEFMEGKTEFHSYNILSSEKEVLYKRAMECLGGDKMA (SEQ ID NO:239)
>orf01452
MRKTFIDELINKNIEEQNIVVLTGDVGRSTYASKFKEIFPENYLNLGICEANIVGISAGIAGTLEFVPFVMLFSKHLILRALEQINDSILMNKKKVILVGGYSGYSASKEGETHQLLNDISILSSFPDISIYCPYDQSSIQTAINESINNDYSSYIRINKNKILNDVVSRYISQGNKSIIISMGYLGSLLSKKYMEDSDIREFADFILVSKIKPIDWEYWENFLRKYETVYIIEENTLTGGLGEQFKNYFFGLGINIISFGIKPHFGETGDYETLLTNEGLSPDKIFEKIRRINHVQSSEKSIWR (SEQ ID NO:240)
>orf01453
MYNLVKRVFGDDNFVKSDGIYLYTDDNKEIIDSCSNSMNVNLGYGVLEIEKVIHEQIRKINFMHTGKGTTYESLMLANRLHDIIKPYGNYKVYFATSGSDCIEAALRISVLYQQKNKRNQESNTRFATFEGSYHGSTLGALSVSGHRKFQKLYNGYIGNCLTIPTRWNLEWEMDFTDITAFVLDSMITNPIGSEMIDRDYLNYLVQICKESGVITIIDEIATSIGRLGCFFGFEDSGISPDIVCISKGLGVGYANIGAVIVKSNIIDSINSADILGHTYNASPIDCKIAMTVLDYIEEHGIFEHVNNLSNYIEYRLKVLKNKLPCISKVSGKGFMWSIHFRNNVNASKVFKQCYNNGLLLLYLEYEEYNHMTFCPPLITSKNQIDSMLDILEKSITEVLHGY (SEQ ID NO:241)
>orf01455
MDIKKLGFIIITNETFAFNLVEEVVEKLSILDVNRAIIKLLPKVDETTLIKHYREHLINWNGKEGKSQVPSLGWPAVRNRFSSSVVLILLEGNKQSLSDEILQIKGNTYPERCRQNQIRIKGFNPTYNLMHSSDSAEEALKEAELYFSKKELESFFCGEIGITFEEILNFILELDSIQKGGIRSNSDLENKVNKMCYWKQRIFDLFSSKTKEESRKISEELNFIKSNRSPEDVILYLENESILWSKLERDLYLSHLIIEEISGE (SEQ ID NO:242)
>orf01456
MVKIYEETLRDGIQSSELSMLNTEDKKKVIKNFDNAGIEGCCIGFINSSKKENDEINELFQFIITNKLKIVPAVLCRLLIEDFKEIKNIVQCNLFNQLKVYTYIAVSPIRMKVENWSEEIIIEKIIDFLNYCQMENAKITIAFEDASRADKSFLKKLIEIINNYPLVKSVVIADTVGTCNYNSTRDLVYFFRKNLFLSKTIEWHGHNDLGLAVSNALSAIESGADIVHTCTLGIGERCGNTSTEQLIINLIVSECGRRATYKLRYLYDVAKILENKSQFPISPKTPCFGMDSFFTCAGTHASSLYKSYLKNDNNFSIIFSPYDSNIFGREVTVGINSQSGRSSIEYISRKYGLELSQENITSILGLVKQNDLFFREEEFVDYVKQKRVERLDYK (SEQ ID NO:243)
>orf01457
MLNKKEWKDLTISKSENDSIISCLYNWGFELPKSKDYTFSLTWYGYNNVSYKYCEKECLTLGELSLLFPQLNDLFWENQKDNIEGRRNFYKNCKNLLGKFNIYSVRDILEISDRDLQFALTNRVGSKEFENSIVSLMYINKLSDDNKAENTLEKKLYFSINNEDSIIPYYENTLGMTAGNYYFTASHRTFFSSTFPDFFKIDVPYKISNSVRNLTRNELRIRASEIIKSELANSENPILKIFDDYAYVRIGKRNMVLRNKFFNGFIAPLFAFFQQRIFDNGKSFFTHFYDLRKNTSQNIPDYLLLNILFPLIDCFEFFLNVSIKYKRLIFPYDFHMQNIMISIYDNQIGFIFQDFDMSKELTVTSFYEQLDFFFEKFVVTNILESINELKLDEKKYIMREMKKIINTKIIIKAESYFGFPSPSEVIFPDFMKQYLRENQNVKERTIKYRSYLR (SEQ ID NO:244)
>orf01458
LFIDILKKKKMKIIEAHDALSAYIIDQTKYSADKGDIFEYDGIWISSLCDSIIRGKPDMEVVNITDRLQTIDDILCVTEKHIIVDIDSGGTLAQTASLIEKITLRKIAGIVIEDKIGQKYNSLFGEKNTQLQDSTESFERKIRVAIKSRKNSKTAIIARIESLVLGKSLEDLLYRGNCYIKSGAEALVIHALNSRLDDLYQALKYFKRYFPEIPLIVIPTDFPYVCAEELFESGADLIIYANHLLRSIIKPMEYIAKSILIDGYSNGVENKLLPISEILNYIPLLEEIDEK (SEQ ID NO:245)
>orf01459
MKSKDLISSLKLNGVTRFFGVPDTFLYPITSQLANDELIIVPNEGNAVSMAFGYSITTSRMPVIFLQNSGLGNILDPVQSLVGQAVYNHPMLYIIGFRGGTDDAPQHSECGKVTKKLLEISQFDIYEKDYFTNALDTDIRRIIDNIKKKNKSAAILVDKEFFSDIVNIKNYQISDYNDILNKICAVLEKEKDSIVLTSTGYITRHMENVKDTYKKNFVHIPMPGSLGQTISFGAGICMGTSINNKKKKIYIIDGDGSLFMHMGSLALFDYYNLDVTYILLNNYKHLSVGGENTLASSCNFFKLAESMNFDNIYSSDNLEVVDFEEKLNQQGKNFIEIICSNEVTYSLLERMDFNFEEIKQFNMEN (SEQ ID NO:246)
>orf01460
MPDRYVEYSQNDYDNIMKLDTLFISQLDSKYKVEYHKLPSLISGEVLKRCGYFSTMPNQLSKVDIIDVSQLESLGNNRELEEIKYSSSENYFLTPAACLHFYPMLEHKEVKECIYSSLVDVFRYENGNFTMGTRQWEFTVREFLAIGNPKFVEEFLEDLKEKFLTIALRFDSTAKIQNACDNFYPTNLNKVKQKFQKYNNLKFELIVHISEKEVSVASFNYHNNHFSKEFNFDSNDTIVTGCVGLGIDRWISLIKESEKDYKL (SEQ ID NO:247)
>orf01461
MRKQIISLFHAYLNENTQDWVEIDENFLLYQHLDMFYYYLCKKNNIEPPLMEKFEVRKKVIESRNRQYIKVARDLNAIFEKQSIQVAFLKGIQTSEKYYEEPWIRYYSDLDILVAREMIPGVEKLFYQLGYVFGHLKDNGEIHHATREEILYQKLFTHEIYNLVKKENDNVFINVDINFLFSWKGLSDSEIEFNDIKNDIIYDSKIKINTLDKVMNFIHICCHLYNEARYFALNRSFLGGDPREIQLSRVFEIALILKDLKQEEFNSVVYSSRKLNCDNKVFASIGVTKQLLELNLSILDNYTEELKVKDEFNSYIDKDQKVKYWPISIEDRVFDLKLKVKTCDKIFIN (SEQ ID NO:248)
>orf01462
LKYDSEIKNVIKDYIDFEFDINEIDNNVALTDYGVDSLKYISIILALEDYFKIEIPDNYLVFSKSNTIKKLNSIIEELL (SEQ ID NO:249)
>orf01463
MKYINSSLKPVDNYYSLEEVTCFDRLLGIFLNSINNSYCDLFYMINNFYRCYKVDDSKNEYDFEREMHILRNFFCIESEKINFSDEYVLTEFIVKNINKYGSVFVPINLKEIYYSAYYKEQDWPHLFLINGYDKEKELFYVIDATQIYSDILTEQNFCITFEILERAYKSYFHQTILNKEKEYIFVVTTVVNELTETEIFHETFKYLFNQSSISSRELEIVYKILTNRDFTLLANLKNITKKKKLFFTIFFEKLRVYELISNKELLYLSRTVETILEEWTIFINRCIKNILKNDTKLVNYDFFVLEEKKIFDFFSKQASRYKERLFEIISSNSNRYNEVFECIQNSGKIITITEDNSRKFRFSFIGDKIYNSWFNDDSPKVRINQDLNYLIVKINVIHKEKDSKFVAGLYCFIDDNLYYFGLDSNYFINLDLMGKTPEIFRKRLETSEVYLKISRQEDSCKFSYSVDGITYFYATSLDIRSCHFSCGIGCKTYSKPTPLCIDFEDLLIG (SEQ ID NO:250)
>orf01464
MAMIEVSCLRKDFVKIIKEPGLKGAIRSFIHPEKQIFEAVKDLSFEVPKGQILGFIGANGAGKSTTIKMLTGILKPTSGFCRINGKIPQENRQDYVKDIGVVFGQRTQLWWDLALQETYSVLKEIYDVPDAVFQKRMDFLNDVLDLKEFIKDPVRTLSLGQRMRADIAASLLHNPKVLFLDEPTIGLDVSVKDNIRRAITQINQEEETTILLTTHDLSDIEQLCDRIFMIDKGQEIFDGTVNQLKETFGKMKTLTFELRPGQNHVVSQFVGISDIHVTRKELLLNIQYDSSRYQTADIIQKTLSDFAIRDLKMTDVNIEDIIRRFYRKEL (SEQ ID NO:251)
>orf01466
MVKLWKRYKLFISAGMQELITYRVNFFLYRIGDVMGAFVAFYLWRAVFSSSHQSLIRGFSLSDMTFYIIMSFVTNLLTKSDSSFMIGEEVKDGSIIMRLLRPVHFAASYLFTEIGFRWLVFVSVGLPFLIIIIGLKLLSGQPILQISLMTLTYLLSLILAFLINFFFNICFGFSAFVFKNLWGSNLLKNSLVAFMSGSLIPLSFFPKIIADILYFLPFSSLIYTPVMIIVGKYNISQMIQAVLLQLFWLLVMIALSQIIWKRVQSHITIQGG (SEQ ID NO:252)
>orf01467
LLVGISLLSATVTSLTWTWTKVFIFLISIPFATLIYTSLKIATASIAFWTKQSGAVIYIFYMFNDFAKYPVVIYNSFLRWLISFIIPFAFTAYYPASYFLKDKDGLFNIGGLILISLIFFTISLKLWNKDLDAYESSGS (SEQ ID NO:253)
>orf01468
MDEVFKFYCTNIIRVIFIKWLFCLKSMNHTNLTLVTTTFKQFNSTSLGNHHAMV (SEQ ID NO:254)
>orf01469
MQAFDGSTEFIHNGKGEKDSSKYQINGAGRCEVPVYVPKTDWDWIIYPQGIYD (SEQ ID NO:255)
>orf01471
MRVKRDYPNYKIYITENGLGYKDEFVDNTVYDDGRIDYDYVKKHLEVI (SEQ ID NO:256)
>orf01472
MEWIINIIRGIRYIGREAGVTFVEPSQTSIKSFDITIL (SEQ ID NO:257)
>orf01473
VRHYKDYISQRSDWELAGIYADEGISGTQVGKRQDFQRLINDCVNGEIDYIVTKAIARFARNTLDTLKYVRMLNDMQIGVYFEEENIDILTMDGELLLTILSSVAQQEVENTSAHVKKGLKMKMERGELVGFQGCLGYDYDVETKQISINKKESKIVRYIFERYLEGIGGKVIARELDELGYKSPRGLEHWNDTTVLGIIKNEKYKGDILIGKTFTVDPISKRRLSNFGEEDKYYIKDNHEPIISK (SEQ ID NO:258)
>orf01474
MYAFSSMLECGFCGSILSRRSCHCRSDYRKVVWHCVTSIKKGKKFCKHSKGLEKLAIEGAFMEAYRQLYHSNENLMTDLLETIESELNDNSLNKELKRITNKLRTLLKKEENLVNLRLEGKISDTIYDEKYNEISSEKEFLAEEKVNIETTLKSEIDVKKRLTEFKHLLSSQKMLTEFDRAVFESIVEKIIVGGVNSNGEIDPAMLTIIFKTGETQNKDGKQFKSKRKNAKLETDKLCPQNSDEDKKLYSQGTDNTRGVCSVAGSILVSQ (SEQ ID NO:259)
>orf01475
MGGNPPIKKYSIVDKIVLSTKIKRIIIFTVFRENWEPYMKKYTEVFQSQFPNLNIDYLLLDTEQIDLDSYLDADLIIIGGGNTEKYIATYVNQEFKNYIDHMLNKGAKVIGFSAGALLLGEKVYVSPNDNSDHQIKIKDGLGLFSQFLISVYYDSWNDKANKDRAEELVNVPIIPLNDHSCLVLDKLGNIIEKID (SEQ ID NO:260)
>orf01476
MDFSSKIAINTGWSDDKKYCVTDQNQQKYFLRVSDKEKLDSKKFEFDMMGKVASLGVPMCKPISIELCDDEVHSLHEWIDGRDAIDSILTYSENQQYTYGVEAGKILRKIHTIPATEVCEDWEIFFNLKIDDKISNEMIW (SEQ ID NO:261)
>orf01514
LCSALKNSYDIELIKVLSNKAHLYLPIETVTPQTVSTS (SEQ ID NO:262)
>orf01537
LVKNPFIEIERIERTWLTAHLYRKFDKYFHKTRPPIKVFEHLIGGLFIMKTFT (SEQ ID NO:263) >orf01538
MIKIYFTKFSENHNPFCKIFEIIFTSLIFQSILNKNKKNPLHQGETNVV (SEQ ID NO:264)
>orf01545
MSQVKGLCVLDVDGTLILEEVIDLLGREAGHEAEISQITSRAMRGELVFESSLRKRVSLLEGLPILVFDNVFNSIHLSLNVPEFISILQKNGILVGLVSGGFTPIVGEISKIPWYCLFHCQPA (SEQ ID NO:265)
>orf01546
MLKSAELGIAFCSKEMLKKEIPHHVDKRDFLEVLPLIDCLE (SEQ ID NO:266)
>orf01551
MFGNWFFKAFVCSLERLAQDRTMNWFSCIGNKNTVAFVPILIGCFA (SEQ ID NO:267)
>orf01565
MEKYFGEKQERFSFRKLSVGLVSATISSLFFMSVLASSSVDAQETAGVHYKYVADSELSSEEKKQLVYDIPTYVENDDETYYLVYKLNSQNQLAELPNTGSKNERQALVAGASLAALGILIFAVSKKKVKNKTVLHLVLVAGIGNGVLVSVHALENHLLLNYNTDYELTSGEKLPLPKEISGYTYIGYIKEGKTTSDFEVSNQEKSAATPTKQQKVDYNVTPNFVDHPSTVQAIQEQTPVSSTKPTEVQVVEKPFSTKLINPRKEEKQSSDSQEQLAEHKNLETKKEEKISPKEKTGVNTLNPQDEVLSGQLNKPELLYREETIETKIDFQEEIQENPDLAEGTVRVKQEGKLGKKVEIVRIFSVNKEEVSREIVSTSTTAPSPRIVEKGTKKTQVIKEQPETGVEHKDVQSGAIVEPAIQPELPEAVVSDKGVPEVQPALSEAVVTDKGETEVQPESSDTVVSDKGEPKQVAPLPEYKGNIEQVKPETPVEKTKEQGPEKTEEVPVKPTEETPVNPNEGTTEGTSIQGAENPVQPAEESTTNSEKVSPDTSSENTGEVSNKPSDSKPPVEESNQPENSGNTTSENGQTEPEPSNGNSTENVSTKSNTSNSNGNEEIKQENELDPDKKVEDPEKTLELRNVSDLELYSLSNGTYKQHISLEQVPSNPNSYFVKVKSSSFKDVYLPVASISEERKNDKILYKITAKVEKLQQEIESRYKDNFTFYLAKKGTEETTNFTSFSNLVKAINQNLSGTYHLGASLNANEVELSTDDKSYIKGTFTGQLIGEKDGKHYAIYNLKKPLFESLRGATIEKLSLKNVSISGKDDIGSLANEAQNNTKIKQVHVDGVLAGERGIGGLLAKADQSSITESSFKGRIVNTYETTASYNIGGLVGHLTGSKASLTKSKATVVISSNTNSSDQTVGGIAGLVDKDAHIQNSYSEGDINNSQRFGKVAGIAGNLWDRESNSENHAGRLTNVLSDVNVTNGNAISGYHYNGMKITDAFSNKANKVFNVTLEKDEVVSKESFEERGTMLDASQIASKKAEINLLTPPIVKPLSTSGKKDSDFSKIAHYQANRALVYKNIEKLLPFYNKATIVKYGNLVKENSILYQKELLSAVMMKDDQVITDIISNKQTANKLLLHYKDHSSEKFDLRYQADFAKLAEYSLGDTGLLYTPNQFLYDQDSIINQVLPELQQVAYDSEAIRKTLGISPEVKQTELYMEDQFTKTKQDLANSLKKLLSADAGLAGDNPVTRGYLVDKIKNNKEALLLGLTYLERWYNFSYGQVNVKNLVMYHLDFFGKGNTSPLDTLIELGKSGFNNLLAKNNVDTYAISLASHHGTTDLFSTLENYRKVFLPDKTNNDWFKSQTKAYIVEEKSNIEEVKTKQGLVGTKYSIGVYDRITSDSWKYRNMVLPLLTLPERSVFVISTISSLGFGAYDRYRNKEHQANGDLNSFVEKSAHETAERQRDHYDYWYRILDEKGREKLYRNILLYDAYKFGTDHTEGKATEVANFDNPNPAMKHFFGPVGNKVGHNGHGAYATGDAVYYMGYRMLDKDGAITYTHEMTHDSDQDIYLGGYGRRSGLGPEFFAKGLLQAPDQPSDATITINSILKHKTSDSTEGQRLQVLDPTTRFNDAADLQNYVHNMFDVVYMLEYLEGQSIVKQLDAYQKMTALRKIENKYVKDPADGNDVYATNVVKNLTEDEAKKLTSFDSLIDNNILSAREYKAGTYERNGYFTIKLFAPIFSALSSEKGTPGDLMGRRIAYELLAAKGFKDGMVPYISNQYEEDAKQQGQTINLYGKERGLVTDELVLKKVFDGKYKTWAEFKTAMYQERVDQFGNLKQVTFKDPTKRWPSYGTKTINNVDELQKLMDEAVLQDATGTRWSNYN PEIDSAVHKLKRAIFKAYLAQTNDFRSSIFENKK(SEQ ID NO:268)
>orf01580
VLGGRANSVTSCTTNSHWNLTFTTKHVTCFSSLVDDIVHGNNREVHEGHIDDWTKSCHGCSCCSSRDGSFRNRTVTDTFWTKFFKHSNRSTEVSSEDTDVFSHQEHIFIATHFLRHSKDNGVTEGHCFCFHFISFSLVCVNIFKG (SEQ ID NO:269)
>orf01581
MDMFYIGHFLDIRRDTVTVVNAIENDWQVPDRSHVHCFVENTFIGRTISKEADNDFTGILHLLTEGCTDSDPHTTTYDTIGTKVPSIKVSDMHRSTFPFTGSSVFTKDFSHHSVEVNPFSNSLPVSTVV (SEQ ID NO:270)
>orf01602
MKINKKYLVGSAAALILSVCSYELGLYQARTVKENNRVSYIDGKQATQKTENLTPDEVSKREGINAEQIVIKITDQGYVTSHGDHYHYYNGKVPYDAIFSEELLMKDPNYKLKDEDIVNEVKGGYVIKVDGKYYVYLKDAAHADNVRTKEEINRQKQEHSQHREGGTSTNDGAVAFARSQGRYTTDDGYIFNASDIIEDTGDAYIVPHGDHYHYIPKNELSASELAAAEAFLSGRENLSNLRTYRRQNSDNTPRTNWVPSVSNPGTTNTNTSNNSNTNSQASQSNDIDSLLKQLYKLPLSQRHVESDGLIFDPAQITSRTARGVAVPHGNHYHFIPYEQMSELEKRIARIIPLRYRSNHWVPDSRPEQPSPQSTPEPSPSLQPAPNPQPAPSNPIDEKLVKEAVRKVGDGYVFEENGVSRYIPAKDLSAETAAGIDSKLAKQESLSHKLGAKKTDLPSSDREFYNKAYDLLARIHQDLLDNKGRQVDFEALDNLLERLKDVSSDKVKLVDDILAFLAPIRHPERLGKPNAQITYTDDEIQVAKLAGKYTTEDGYIFDPRDITSDEGDAYVTPHMTHSHWIKKDSLSEAERAAAQAYAKEKGLTPPSTDHQDSGNTEAKGAEAIYNRVKAAKKVPLDRMPYNLQYTVEVKNGSLIIPHYDHYHNIKFEWFDEGLYEAPKGYTLEDLLATVKYYVEHPNERPHSDNGFGNASDHVQRNKNGQADTNQTEKPQTEKPEEETPREEKPQSEKPESPKPTEEPEESPEESEEPQVETEKVEEKLREAEDLLGKIQDPIIKSNAKETLTGLKNNLLFGTQDNNTIMAEAEKLLALLKESK (SEQ ID NO:271)
>orf01625
MDESFDDIAHEQFTSNLTTKTDNVGVQLFFSIKGCCHITNQGRANTWNFIYSVVDTNTSTTDTYSKISLAASYSFPYFFTKDWVVSPCMVICTKVNDFISF (SEQ ID NO:272)
>orf01629
MLDFQDRSPWLEGQKEIDLSYDLFSTDAVTLDELQSRTIALRSLKHDKGLKVHFAEFPNLIIWSTLNKGPFITFEPWSGLSTFLEEGDHLEDKKNVCLLEANQVEELGFEIEVL (SEQ ID NO:273)
>orf01634
MKIKEQTRKLAAGCSKHSFEVVDETDEVSNHTYGKATLTWFEEIFEEYKN (SEQ ID NO:274)
>orf01643
LIESQVFSSLQVCCLNLCHLKFQHFDTCLVFLLVFLDFQNLLAHFPIGIKTRLIGFFQVPKSGITKFIQHLDMQLGTH (SEQ ID NO:275)
>orf01644
MVMLTMNIYKMLPNSSQNRQINHLTIYTADTTTILQDFPTDDNFIT (SEQ ID NO:276)
>orf01645
MTNNICRRTSSQHHIHGINDNRLPCTRFTSQDSHPLFKIEGNSLNNGKVFYRNFK (SEQ ID NO:277)
>orf01656
MPHTRDNWQTRFKNSSYHNFFVKGPEILNRTTSTTNNEQIQIVPLISTRNISSNFLRSPFTLNLGRIKKDVNTWESPADGRDNISNNGSTTAGYYTNSLRKLGQSLLEAFLKQAFFCQFFLKLFKLNRKRPNPIRLSFFNDDGVATTWFIDLYTPNHIDLHSFFQVKP (SEQ ID NO:278)
>orf01660
LAIIRNRTCSLKLINGHLTFWTLHFLTSTRILIELATINLNCRIHRGNLGNRPSQASNRFINKLFIQGRQNRGFCDHFPTSILSRRGIAQSNFPLIDLTLVLHKLDHACRLANRNRQNTHHIRIQGSTMTNFLGSQNLTQFKNRIMRGHSCFFF (SEQ ID NO:279)
>orf01667
MFALRKPGNIYTRITNPTTAALEGGVEALATASGMTAVTYTILAIAHAGDHVVAASTIYGGTFNLLKEPLPRYGITTTFVDIDNLEEVEAAIKDNTKLVLIETLGNPLINIPDLEKLAEIAHKHQIPLVSDNTFATPYLINVFSHGVDIAIHSATKFIGGHGTTIGGIIVDSGSFDWTASGKFPQFVDEGPSCHNLSYTRDVGAAAFIIAVRVQLLRDTGAALSPFNAFLLLQRLETLSLRVERHVQNAETIVDFLVNHPKVEKVNYPKLADSPYHALAEKYLPKGVGSIFTFHVKGGEAEARKVIDNLEIFSDLANVADAKSLVVHPATTTHGQLSDKDLEAAGVTPNQIRLSIGLENVEDLIEDLRLALEKI (SEQ ID NO:280)
>orf01668
MTRDFKFETLQLHAVQVVAPATKSRAVPIYQTTFFVFDDT (SEQ ID NO:281)
>orf01672
MSQKKNNKKKNKRKNLLTNILAGFLILLSLALIFNTQIRNIFIVWNTNKYQVSQVSKEKLEENQDTEGNFDFDSVKAISSEAVLTSQWDAQKLPVIGGIAIPELEMNLPIFKGLDNVNLFYGAGTMKREQVMGEGNYSLASHHIFGVDNANKMLFSPLDNAKNGMKIYLTDKNKVYTYEIREVKRVTPDRVDEVDDRDGVNEITLVTCEDLAATERIIVKGDLKETKDYSQTSNEILTAFNQPYKQFY (SEQ ID NO:282)
>orf01679
MRWNIGCHPNRDTSCSINQKVWKTRWQDQGFPFIGIIVINEINCIFVDITKHFQSNLAHTCLGITLSGSTISIHGTKIPMTIYKHVTVAPPLSHTDHGFINRGIPVWVIFTHDIPCNTSRFFMGFVWGHTQFIHSVENATVNRF (SEQ ID NO:283)
>orf01683
LKKKWFFADYYDTTIILLALISVILVLLGFAEMIDLDNPPYSIIDLVIWGVFVIDYSWRFFITKRKWRFILENIFDLLAILPLNAIFTVFRLGRIFRLVKLTKLLKLTRLLRIIGLTGKLERKISRFLRTNGLIYILYVNIFIILVGSSILSVVEEKSFSDSLWWALVTVTTVGYGDIVPVSLLGKWLAVLLMLVGIGTIGMLTSALTNFFVKDNPDEQIKLDKLKDELSS (SEQ ID NO:284)
>orf01693
VVDFKQTRQDPHDITIYSWLRQVKSNTGNGSCCVRSNPFQAGNSFIGIWKLATKVSHNLLGCSLHIANSRIITQALPSFQ (SEQ ID NO:285)
>orf01698
LINSQLIPLVQVVVNQGRKGIVGSCNSMHISSKVEVDVFHWQNLCIPTTSSTTLDPHDWTKRRFADSNHGFLANLVQGIRKTNGKRRLSFTCRCWVDGSNQDQFTDWIALNCTNFIKAEFSLVLSVQLQIVVRNTKFLYNINNWLQLNTLCDFNICFHSKFL (SEQ ID NO:286)
>orf01711
MLFIIGHLNFPTAGSFIDSTLHRLGNRVCIHDDMAFTVTSSTSNSLDESTFVAKETFLVSIENSYEAHFRNVNSFTEQVNSDQDIKDTQAQVTDNLRPFQGLDIRVHVLDLDTHFLEVVGQILCHFLGQSCDKGTLIFFNAGIDFTQEVINLSHSRTDFHLWIQESRWTNDLLNHCLGLFIFIVTRCR(SEQID NO:287)
>orf01712
MNVTLKLLPTERTIVQSRRQTETIINQHFFTRTVSIVHALDLPYGHMTLVNHNQEIIWEEVEKRIRRLSFAPSIHVARIIFNPIGIAHLTQHFDIILCPLFQTLGFKQFTFLFKDS (SEQ ID NO:288)
>orf01713
MIHFSQHLTCQSLNFTNTVNFVSKKFYSKGMFISGSWENLYHIPTNAKSSALEINIIAFKLNIDQVIQEFITRNL (SEQ ID NO:289)
>orf01714
VAKLVNLVIDRTILLNIGIARRDIGLWLVIIIVGYEILNCIFREKFLKLPIELTSQSFIVGNNQSWFIDFRNDLTHSIRLPCSSRPHQNLSFFSPLNVIHQLLDSLGLIS (SEQ ID NO:290)
>orf01734
MNITQTDFLAVNLVFAISTTIDMAFHPDFLTCILDKSVMIIQSHNYRSIIERFATFGSSKDDIRHLAPTETLDTRLPQSPTQTFCNIGLSRSIGSNNCRHTLVKNDLGLISKRLEPLNFDFL (SEQ ID NO:291)
>orf01736
MGFIVCNHLKLACFNLRNHDLIDKFLDLGHILIQKKGTKKGFKGITKNGVTIATTRFFFPFTQLDKLVKLAITRKTS (SEQ ID NO:292)
>orf01748
LFTCFSKLDNKTASTTYISHKFFTAIPVCFEFFKGFWFPRKDTTKKNIFIPMFLVECFNFWVELR(SEQ ID NO:293)
>orf01753
VVRELTGEIYFGDHILEERKARDINDYSYEEVERIIRKAFEIARNRRKIVTSIDKQNVLATSKLWRKVAEEVAQDFSDVTLEHQLVDSAAMLMITNPAKFDVIVTENLFGDILSDESSVLSGTLGVMPSASHSENGPSLYEPIHGSAPDIAGQGIANPISMILSVVMMLRDSFGRYEDTERIKRAVETSLAAGILTRDIGGQASTKEMMEAIIARL (SEQ ID NO:294)
>orf01754
MAKKIVALVGDGIGPEIMEAGLEVLEALAEKTGFDYEIDRRPFGGADIDAAGPPLPDETLKASREADAILLVAIGSPQYDGVAVRPEQGLMALRKNSIFTLIFVL (SEQ ID NO:295)
>orf01758
LANIESHCNFFQSSIFSSLPNTIDSLFNTSCTILDSSKAICHCHSEVIMTVRRIDDLTIRLDILNQVFEDGTTFLWCGKSYIF (SEQ ID NO:296)
>orf01768
MPRNRFSFTVRVTREKNFISFFSFFFQVIDKRAFSSDIDILRFIIIFNIDGHTGFLQITDMPDTG (SEQ ID NO:297)
>orf01772
MKIKAQTRKLATGCSKHCFEVVDKTDEVSSKYCFEVADGS (SEQ ID NO:298)
>orf01781
VHAHTDKLCNGCNRIFNSIISHHTIFRERNKLSHKAIKSTRQEMGPCHVVFIEFFITLHRRLIGNHDNFLTNLVGSGRVRNDGST (SEQ ID NO:299)
>orf01782
VNHCHWKLFIQNLGITFSLIVTLIRMTDSHVVGTDKDMILLVNSLFLIFDIDKLRLS(SEQ ID NO:300)
>orf01784
VGNNDILWSKRTISINGFNDFLNTCIAVSTTLCNDDTFLIKRKIFIYKIFCMRNPVSMNTNYNFFNTWLQDKFFNCMNQNRSIT (SEQ ID NO:301)
>orf01794
LVAPVASSTRFFKNNDSLTSWNNGFIIITINTIISYQRISKGQDLSIIRLVCNGFLVAGHPCIKDDFACYINICSEGLAFKNCAIF (SEQ ID NO:302)
>orf01796
VVCYFYITIDWSWVHEDCCFFQTIVTFLSQAMLGMVVFF (SEQ ID NO:303)
>orf01797
MAFVLHTEKHHDINLINDFINGYKLSIVCKLLTSPFLRSREKEFSSQAFQNLHIGFGNA(SEQ ID NO:304)
>orf01798
VVQVTCNSNFKTLKVAKFLINGHQIKQALARVLARTISTIDDGSRNRWTSNQFSIVVDLWMANHTDIHS (SEQ ID NO:305)
>orf01799
MCPCRILKEEIGNNRMVFIGNLGSIFKLNSSLDQFHYLINSEVFHGHHMVQCLLIF (SEQID NO:306)
>orf01824
VQFHLIIFQNLFCSLDIVIDSLTTNTKLLSYLSKTVIISVVKLYIIHLLICQKRRIKFKERIHTIGFFDSKLTIXXXFHSLIN (SEQ ID NO:307)
>orf01826
MHFHIIKLVNHFQLLIKLNRISHPNLHIKSSFLSLVLLFYQKEQDFAIMVI (SEQ ID NO:308)
>orf01840
MTGKKGFLFLNCHICMVTTTTCFLKERVESELLIFFYISPNRCLITV (SEQ ID NO:309)
>orf01843
LNRSILDNITLKHEVTSQKIEEVCKAVQIYDEIMAMPMKFNTIISEMGSNISGGQRQRIALARALINNPSIVILDEATSALDTINEERITKYIKSQGCTQIIVAHRLSTIKDADIIVVMKGGKIVESGNHKYLMDLGGEYYSLYTKRK (SEQ ID NO:310)
>orf01845
LFKGGVTISRTPLSSEDTVMIDATEVKINRPKKKTISE (SEQ ID NO:311)
>orf01848
MAGKKDFLFLNCHICMVTTTTCFLKERVESELLIFFYILLNRCLITV (SEQ ID NO:312)
>orf01850
MYQDEAGFGRISKLGSCWSPIGVGPHVHSHYIREFHYCYGAVDAHTGESFFLIAGGCNTEWMNSFLEELSQAYPDDYLLLVMDNAIWHKSSTLKIPTNIGFTFIPPYTPEMNPLNKCGKRFVNVDLRIRPFELWKMS (SEQ ID NO:313)
>orf01852
LLQSPYAIDTINLKKDFLEKPIDIEKFKAFLEKEEIPLAIAWQGDSLHFYTKDRSILDNHLDHLLEKMVNDPEKLSDFSMDKSLDDTIDEAKSQITFK (SEQ ID NO:314)
>orf01853
MKVVNLYDLKQMGNKGGCTIQLIHHFPFGMGLGHLKKDYIEFKRVGIFDGKAVEVTLREPYSRDILQVVKSIKQRQKLIAYRYKEGKLLFVKKEV (SEQ ID NO:315)
>orf01876
MKPSGQEFPHPIFYSLFCFYTDTLGINSQVMQFSFERLLFQFCLNCWHLVMRSQNHNCRPSTRKVGCIGPIFFGHLLNHRKFSYQVLTIALMEEISLDCLPSGHHVSCQQGSNRYIGDRTCSNSFLIRQFFRQDTTAVAST (SEQ ID NO:316)
>orf01878
LQVWYNLQSDFEQEITLIMWNPFANLVFNQPLISFFADLNLKILGYSYTDRVKTWPDIGTGCRYNNLHLILLAP (SEQ ID NO:317)
>orf01909
LRQNRCYNCFHDHSCSWKSSRITSLHGCLVRFVGFDIHTHKRFIKSRNGFHDPTNNDGLPISHTTFKTT (SEQ ID NO:318)
>orf01910
LAAFTITSLKAKTKFHPFKGIDRDNSLSQSCIQFSIPLDIGTKTNWNASDDCLHNPTDGITITFDLVNIVLDFLFSFLVDNRNFRLGSSLLNFSDCQIFRNIDFLATKDHDMVGNLHIQLSQETFGYCTNCHPHGGFTS (SEQ ID NO:319)
>orf01911
MTWARMSNFPLAFKAVFNVLRRHDVQPFLVVLIDDIHSNRRPCRLPVANARSKDNLVILNLHTTTTTVATLTASKVLIDILSCQWKSSWNSLNNSC (SEQ ID NO:320)
>orf01913
MHDLAITGSRFDGMANSVAKIEVKTNTIVQLIFNHHLALHLTRMFNQGLCMFQNTLNRTIQSRQESPQFWILNQAILDNFTHPFNQLSFSEGFKNKWINQNPIWLGKGPHHIFSKWCVNTCLSTDRRINLSCQTSRNLNKVNTPHIGRGYKSSQVPNNATTKSNDSIATSQTLLD (SEQ ID NO:321)
>orf01915
MVICHNDYLLRLPEFSQPLTSLGHTTFFNLNIIRMMRNIDSDFHRRVSLSLLVFFC (SEQ ID NO:322)
>orf01920
LIEGHLVFADKPAQALVLLRKVGSPKKVSFLTLHLYFLILKIDILKITGF (SEQ ID NO:323)
>orf01951
MAVTKSQVFSRQGFDFSILGQDLTRLQDVSNLATIGTRIHKDSTANASWNTTSKLKAS(SEQ ID NO:324)
>orf01952
MTEGNASCFNQVSPSFCFNGIAINRNVIELVTQDDKSTNPTITNDDIACIAKNHPRDIFLVGKFHNASQLKTISWKDQIISLSTYFCITIAMQGFLKTDINSF (SEQ ID NO:325)
>orf01965
MRLRDLRRVDFPDPDGPIKAVISLGWKDRETLFKAFFLL (SEQ ID NO:326)
>orf01968
LKNHSNVFTHFINVDFWTVDINSTIENLPSYFSNINGIIHAIETA (SEQ ID NO:327)
>orf01969
LHINPLNGFIFTIVNMDILSRKGYFFFRKGKDMLLIPVIC (SEQ ID NO:328)
>orf01981
MTIHIQVVKTNMVILADRLFQGFILRSTDKFFIKIRLVRSHNLRFNSMDFSTVAVHENKGRHHMDELLPRFIINSKATVAKKSIVAQGFRFDGNFFRKTRQTNHLNIIFCDNPDQIIVFQNGLITNSQFNRLHP (SEQ ID NO:329)
>orf01988
LGHSKAEEHETICSHPFDDHTTETIPNQVKGRDMTSSETLPFPSKNQNQGKAKQNP (SEQ ID NO:330)
>orf01991
LLLSCRKVIVCFIFSSTWNKNFFNLAFSWNFDNCIRGFFSINSNLFGNMTSLWINIVGPCRGYIAILSVNCNRIFTTVFCFIFFKTNSRT (SEQ ID NO:331)
>orf01992
MVKRRIRRGTREPEKVVVPEQSSIPSYPVSVTSNQGTDVAVEPAKAVAPTTGWKQENGMWYFYNTDGSMATGWVQVNGSWYYLNSNGSMKVNQWFQVGGKWYYVNTSGELAVNTSIDGYRVNDNGEWVR (SEQ ID NO:332)
>orf02006
MRFIVGRFTSFSLGIEFSPTSKLDDLLFKIAFLMILATWIKARKTKGAT (SEQ ID NO:333)
>orf02010
MANDNKSHYLIYRVLGISFEEGENIDLYQNKGRFLYKYAGSFLEEAAVLSFNEKFGTENT(SEQ ID NO:334)
>orf02019
LVNCKPLEAYRQLEEAELVGCWVHVRRKFFEATPKQADKSSLGAKGLAYCDQLFALERDWETLSADERLQKRQEELQPLMEDFFAWCRRQSVLSGSKLGRAIEYSLKYEETFKTILKDGHLVLSNNLAERAIKSLVMGRSKRVQWTLLA (SEQ ID NO:335)
>orf02022
MFPVETEEITYKRKKSKGKCQALLAQFDSEEVHHQVEESICSDCQGDLKEIGATLQRQELVFIPAQLKRIDHIQHAYKCQACSDKNPSDKIVKAPIPKAPL (SEQ ID NO:336)
>orf02023
LKIIQQQSATIDSLTNELALLREQVAYLTQKLYGKSSEKSVCPSGQLNLFEEESPSEEDGDVPS (SEQ ID NO:337)
>orf02024
LTIPVKDFKAVFTSTTKEKDLTGERIQFKVGFNQISQ (SEQ ID NO:338)
>orf02037
LVCQTIKYWHKFHLHIGRCKLLIGLIPVLNFFIRADIDCLLVLLSLIDRQNGKQFNLCQWIIASNGLNDSFEIIESLIHRNILSDIICPNQKKNFIYCSTI (SEQ ID NO:339)
>orf02042
MIFSCSDSCFSIILLDGDIHENTTFSPLSILFISHRFNSLIGNEVPH (SEQ ID NO:340)
>orf02043
MPSKDNIRSPIDHLVIKSFLFFSWFQSILNTHLRHDNGDICFLLCPFNFSLHLIFV (SEQ ID NO:341)
>orf02045
MVSSKSPIAKSACLVHLLEPRSHILKIFMKVIGTVFFFS (SEQ ID NO:342)
>orf02047
VVEQIPVGHNSGSFFLFLLLRLLLSPLLRNSISFLTSQGIPWKLSNNKTKPIDKPTASKSIATNPLLLHLR (SEQ ID NO:343)
>orf02054
MSCTKIGVPFKKLTKKACKGYKILTRLSSQWQKEAPNQSRSAAIXXHSIH (SEQ ID NO:344)
>orf02075
MNQIQIQIIQAQFFYRFFQSLTSLLIGLLTIPKLGSYEHFFTWNSAIFDCLTNTFFILINRRRINMTITSLQGF (SEQ ID NO:345)
>orf02080
MLLLISLTQLIIFLFFERFNLLLKTFLLVDLKSNKSA (SEQ ID NO:346)
>orf02095
LIFIEYKIADKTITIIGLSRVNVGRRIALLIAYFIAK (SEQ ID NO:347)
>orf02098
VIPRYVTKHQGWDHNPHTITNSDDDPATLVTFRTFKFNVGNCTIPKNDQNGSSQKFSGILQCPCEIHLLDSP (SEQ ID NO:348)
>orf02100
VVPKTATSTETKTITRIIHYVDKVTNQNVKEDVVQPVTLSRTKTENKVTGVVTYGEWTTGNWDEVISGKIDKYKDPDIPTVESQEVTSDSSDKEITVRYDRLSTPDKPTPEKPEIPSPQEPGTPGEPTPEKPIPQPNPEHPSVPTPNPELPNQETPTPDKPTPEPGTPKTETPVNPDPEVPTYETGKREELPNTGTEANATLASAGIMTLLAGLGLGFFKKKEDEK (SEQ ID NO:349)
>orf02105
MAYSTDFKQRALDSIKEGHSHVEAAKFFGVGVRTLFTWEKKDVNKDT (SEQ ID NO:350)
>orf02143
MPILQAKIDSFRPLFDGLTSRFKVGAGLFLKRFSFSRVKQILLALPQFSMMNPIVNG (SEQ ID NO:351)
>orf02158
MAVQANWSFDITHDSSFFFSNQKRGLNFSQMCFKDRRRNGFFDRKIFKFKFNNPIQIF(SEQ ID NO:352)
>orf02160
LFLQIKGIKPNHTIALASYIEQSFFFINKTVHFKIGKQLIRTLQTNPFVIQLNCHLFQGCKKKCSQALSLMVRLDD (SEQ ID NO:353)
>orf02168
MEIVLVSFSISFQHFIIAYCLDFSSAGFRNSQNFSNFC (SEQ ID NO:354)
>orf02180
LNFNHVYFGYDENRPVLKDITCSIFKGQKIAFVGPSGSGKSTIVRLLERFYKPLSGDILMEQSSIYDFNLKEWRSKIAWVSQNNAVLSGSIRDNLCLGLNRLVTDDELMKVLDLVSLGDEIRSMKEGLDTEVGERGRFLSGGQSQRLQIARTYLKDAEILIFDEATANLDADSEYAIISSLYSVLKEKTVVIIAHSLSTVKDVDCIFFLEEGKITGSGTHKELLENHERYARFVQEQMIE(SEQ ID NO:355)
>orf02181
LIYAEKSFFDKSQSGELTSAIVNDMSVIREFLITTFPNIILSLVMVLGSIVVLFSLDWNLSLLLFITLPCMMFIILPLSNISEKYSRRLQKEIGFLTGQLTEKIQEHELIKTNQAEKSVQNVLDNCIERVQNNSLKSDRVTSFETPFALLFIFATIAVMLTYGSYRVSAGYISVGTLVSFLIYLFQLLNPISNIANFVTVYSRSKGS (SEQ ID NO:356)
>orf02183
MKLKLLRVDTKVIMGSFFLVLSSLLALLLPLILKGLIDGSSIENIGSKVFQSFLIFIGQALFSSIGYYLFSQSGEKKIAKIRKKVI (SEQ ID NO:357)
>orf02196
LIRYLDQYEDVILREIKAQFPDVAVDKLMEEYIKAGLILRENKRYYLNFPTLESLDSLELDQEIFVREASPVYQALLEQSFETELRNQINAAILVEKTDFARIKMTLSNYFYKVKQQYPLTEKQQELYDILGDVNPEYALKYMTAFLLKFLKKDHLMQKCRDIFVDSLVVLGYIVQNEDRKYELAIDFDKERLTFYLA (SEQ ID NO:358)
>orf02197
MIGLKEVCRFLTDNTSLSTSMINHPIQINGNMAIVTCGSLDGLSHV (SEQ ID NO:359)
>orf02206
MLINDLTFFIFDISPIQSYKKVRLEITNLWNNTKNATSRGDSC (SEQ ID NO:360)
>orf02208
MRERVRLSGSLFTSLKTREHIKSTMELFHKYVFFLIQEIKIKMINFLKIGDLPTL (SEQ ID NO:361)
>orf02219
MIDHFEIKVKDLQISEGFYRSFLAPLDYKLTFKTSSLISFLSPNSPHPGGDFWLTQGTQDPVHFAFLAENKEEVQACYEAGLEAGGRDNGVPGYRSEHPIYYAAFMIDLDGNNIEVVCHKE (SEQ ID NO:362)
>orf02221
VIVFLSRNKDGNAFCHLDLISIANPVWGWDDDFITWIDHSHKEGIERIFGSRSDCHLI(SEQ ID NO:363)
>orf02224
LSNQFYFSLQTKPILKVKQFLLFQSQMTRVSEILQFSNKL (SEQ ID NO:364)
>orf02227
LFRLGQLISLNVAVHKPIKKFQGWIVLSSLPFQSLDILKFFRRFLSRYLVETLQLTGRIESQGIKHGLTFWF (SEQ ID NO:365)
>orf02229
MEDKEMGFYLMVASMLLGLLALKIGFSQFKEKKDKFLSILTSLAGTALVLVAVWLGWPK(SEQ ID NO:366)
>orf02250
MLDSDIGCSRKNLLGLFWIRRRRNIHIVDRAMEKGISNRAPNKISLKACFFNFF (SEQ ID NO:367)
>orf02278
LLHPFTRNITCDRHILTLLGNLVNFIHIDNATLCTFDVKVSNLQEFEEDIFHVLTHITSLRQSCRIRNSKRYIQALSQGLGKESFP (SEQ ID NO:368)
>orf02279
VEIDAFVVVINRHCQGTLGTILTNYIVVQDMEEFNWFWHLRQVCQDFLNQFFSNDFLSQLHAFITNKSIVASNYFLYFFLVFATK (SEQ ID NO:369)
>orf02285
MNXXGKGEEGEDLVVGVFKWLYSERLRFDTERVGGGGKGK (SEQ ID NO:370)
>orf02291
MHKNFVVVVTNFFTAVQFIQFNKEGTTCHNTTKFFNHLDSCLNSSTCRQKVIYNKNTLTWLNGIRVHSQGIDTVLFFIVSRNNFAWQFTWLTNRRKTNSQLKGNWTTHDKSTSFRSHDHVDFLVSSILNDFTNSVAISISISHQRTNITEGNAFLWIIFNCCNVIF (SEQ ID NO:371)
>orf02299
MLNRQVCFCFVNHISPLNVVIWENLSLEELLYAICICFITHKIAKQTSLTIDNAGIAMNNIR (SEQ ID NO:372)
>orf02304
LDSRFFCTDFFKGRQAKGCSFSCTSLSLTDNILAFKGQRNSLFLDRTSFYKTSFFNFC(SEQ ID NO:373)
>orf02315
LLRKQEREYLRAENAILKKLRELRLKEEKEKEERQKLFKN (SEQ ID NO:374)
>orf02320
MRFLADQDRIQHHRYSWALFDKVQGLLSHTDSREKTNLNSPKFHITQAI (SEQ ID NO:375)
>orf02321
MLKNGIISWKDFKSFFCQGCQTSHCYKPMQVVQGIGSQISRQSTTTKNIISRKC (SEQ ID NO:376)
>orf02324
VTAHRIFGTSSIHSKLIGLAMLGITAMKIICHKLNRNHINIFRRLGIQGKTEFLLIHLIRQVKMNDLSQGMNPTICPTSTVNSNDLPFI (SEQ ID NO:377)
>orf02328
MLARSKNCFMKSLSIFLLIFYFFDSYQISKKRRSLIGL (SEQ ID NO:378)
>orf02339
LEVCIHHHHQISCRILQACIKGCFFAKISRERNIMDCRILLPIGL (SEQ ID NO:379)
>orf02345
LTGNVICHPKLPDKISVKCLYSSKIQFKPRQRRLAMGMVTDFVSSIHNLKAAL(SEQ ID NO:380)
>orf02356
MGRKPKKRPEERTELERLQAENEYLRAENAILKKLRELRLKEEKEKEERQKLFKN (SEQ ID NO:381)
>orf02360MDEIKNFRQWGSKTPGHPEVTHTSGVDATSGPLGQGISTAVGFAQAERFLAAKYNKDGFPIFDHYTYVIAGDGDFMEGVSAEAASYAGHQALDKLIVLYDSNDICLDGETKDTFSENVRARYDAYGWHTVLVEDGTDLAAISTAIETAKFSGKPSLIEVKTVIGYGSPNKSGTNAVHGAPLGAEETGATRKFLGWDYDPFEVPEEVYSDFKTNVADRGQEAYDAWASLVSDYKVAYPEVASEIDAIVAGKSPVTITEKDFPVYENGFSQATRNSSQDAINTAAVLPTFLGGSADLAHSNMTYIKADGLQDKYNPLNRNIQFGVREFVMGTILNGMALHGGLRVYGGTFFVFSDYVKAAIRLSAIQELPVTYVFTHDSIAVGEDGPTHEPVEHLAGLRSMPNLTVIRPADARETQATWHHALTSTTTPTVIVLTRQNLVVEEGTDFGKVAKGAYVVYDTPGFDTIIIATGSEVNLAIKAAKELVLQGGKVRVVSMPSTELFDAQDATYKEDILPSKTRRRVAIEMAATQSWYKYVGLDGAVIGIDIFGASAPAQTVIDNYGFTVENIVAQVKSL (SEQ ID NO:382)
>orf02362
LNFLNEPRRQGNIGNKMAIHNIDMIRCYFIIQKSNLLFEFVQIHGHQRW (SEQ ID NO:383)
>orf02367
VIYSYDYPRLLHSRTLAMGNSNIIPNTGLSFCLSLIKTFDKLVSIRHITRLNQ (SEQ ID NO:384)
>orf02371
VSRKQEQMETLLLLLRDSKDYISAKVLGEKLNCSDKTVYRLVKGINKDCPVEAFILSEKGRGFKLNPRSSLVDVDGNFTEAFDPEVRREKLLERLLLTAPKPHSIYDLGEEFYVSESVVLKDRQILQESLAIYGLDLKMRQRKLFIDGDEAQIRSAILNLLPMFNQLDLEQITQNKVQPLDGELAHFCLGLLITLERELGVNIPYPYNINIFSHLYIFISRNRRSTSIHVVAPSKPTIVDEKIYSVCQKIIQEIEQYFRMKVDAVEIDYLYQYVVSSRLQKPFSSGKLPFSQRVLDVTHYYFSRMCMDNREIETTDPDFVDLASHISPLLRRLDNRVQIKNSLLSQILLTYPNLVKELTTISKEVSLVFGFASLSLDEIGFLVLYFARFQEKRARPLKTVVMCTSGVGTSELLRARLEKQFSELDIIDVVAYHQLDELINLYPDLDFIVTTVALQEPASVPFVLVSAFLTEGDKQRLQAKIQEINYE (SEQ ID NO:385)
>orf02390
MMSMVDPIDQTFIVNLKIRKSQVFSQLQFSCHIVVYPSEVHIYQALVIKLQNHILGPQVLP (SEQ ID NO:386)
>orf02391
LANRTRIDNQLPTSPVTKQLLVNMSINSNITGRMSHQAVKLLLFASMNQLSPPVLIRQMMANSHRQIPKLTMNLKRLIVEHFNFF (SEQ ID NO:387)
>orf02395
MDSIEFFHDKTFLFYLTSFYSKRMGVTTKMIKIAEGRFS (SEQ ID NO:388)
>orf02406
LTRFEEIFEEYKNPQDTFFYPLVYKENTYKKTAISIFALLMLGVCCLFLFSQQSYKKLVQYYANDQNLPSRITYSEYSDK (SEQ ID NO:389)
>orf02414
MIAEFIDGLQKFHFLQNALITAIVVGIVAGAVGCFIILRGMSLMGDAISHAVLPGVALSFILGLDFFIGAIVFGLLAAIIITYIKGNSIIKSDTAIGITFSSFLALGIILIGVAKSSTDLFHILFGNILAVQDTDMFITMGVGAAILLLIWIFFKQLLITSFDELLAKAMGMPVNFYHYLLMVLLTLVSVTAMQSVGTILIVAMLITPAATAYLYANSLKSMIFLSSTFGATASVLGLFIGYSFNVAAGSSIVLTAASFFLISFFIAPKQRYLKLKNKHLLK (SEQ ID NO:390)
>orf02424
LNQEIIWKTRKSFTFKSRSLTDIRSRRKCLTNIQLSLIVRHLRQSRTLLAELNVNIPKRQIGPILTRFWPN (SEQ ID NO:391)
>orf02437
MQGEMRFSLVQFLTTLIKFCIFPFLLPNWLFGRKACTHWEFSFPKIKGVFEFHGISFINNNKLKTTTSKKDENRGTTFIRKKI (SEQ ID NO:392)
>orf02438
MYEEPEVAPVHPTGPTPATETVDSIPGFEAPQESVTIL (SEQ ID NO:393)
>orf02468
MRLSWHFMRFKKLPLLINQTILDVGSTDINCNVICHKYLLLD (SEQ ID NO:394)
>orf02470
MGNNGQFTFGYRHDFFQNQLAIFNALVDTFTRRTIDIKTLNTFINEVLNQGTRTFWAYFSLIIITCVEGWNDTFVFFQI (SEQ ID NO:395)
>orf02497
LSTRNKYCKNLIIFESTFNILDIVKKDLKLNSKLEKDLKY (SEQ ID NO:396)
>orf02499
MNRSVQERKCRYSIRKLSVGAVSMIVGAVVFGTSPVLAQEGASEQPLANETQLSGESSTLTDTEKSQPSSETELSGNKQEQERKDKQEEKIPRDYYARDLENVETVIEKEDVETNASNGQRVDLSSELDKLKKLENATVHMEFKPDAKAPAFYNLFSVSSATKKDEYFTMAVYNNTATLEGRGSDGKQFYNNYNDAPLKVKPGQWNSVTFTVEKPTAELPKGRVRLYVNGVLSRTSLRSGNFIKDMPDVTHVQIGATKRANNTVWGSNLQIRNLTVYNRALTPEEVQKRSQLFKRSDLEKKLPEGAALTEKTDIFESGRNGKPNKDGIKSYRIPALLKTDKGTLIAGADERRLHSSDWGDIGMVIRRSEDNGKTWGDRVTITNLRDNPKASDPSIGSPVNIDMVLVQDPETKRIFSIYDMFPEGKGIFGMSSQKEEAYKKIDGKTYQILYREGEKGAYTIRENGTVYTPDGKATDYRVVVDPVKPAYSDKGDLYKGNQLLGNIYFTTNKTSPFRIAKDSYLWMSYSDDDGKTWSAPQDITPMVKADWMKFLGVGPGTGIVLRNGPHKGRILIPVYTTNNVSHLNGSQSSRIIYSDDHGKTWHAGEAVNDNRQVDGQKIHSSTMNNRRAQNTESTVVQLNNGDVKLFMRGLTGDLQVATSKDGGVTWEKDIKRYPQVKDVYVQMSAIHTMHEGKEYIILSNAGGPKRENGMVHLARVEENGELTWLKHNPIQKGEFAYNSLQELGNGEYGILYEHTEKGQNAYTLSFRKFNWDFLSKDLISPTEAKVKRTREMGKGVIGLEFDSEVLVNKAPTLQLANGKTARFMTQYDTKTLLFTVDSEDMGQKVTGLAEGAIESMHNLPVSVAGTKLSNGMNGSEAAVHEVPEYTGPLGTSGEEPAPTVEKPEYTGPLGTSGEEPAPTVEKPEYTGPLGTAGEEAAPTVEKPEFTGGVNGTEPAVHEIAEYKGSDSLVTLTTKEDYTYKAPLAQQALPETGNKESDLLASLGLTAFFLGLFTLGKKREQ(SEQ ID NO:397)
>orf02501
VDRTDEVSSKHCFEVVDRTDEVSSKHRFEVADRTDEVSNIYTARQS (SEQ ID NO:398)
>orf02512
MSCNCAFYRSQFFDVNSVSNYHSHQKELRFPNSILFTYFVKVT (SEQ ID NO:399)
>orf02535
VGLIKLTSYVFVCISNSFLTRHDKNDNICFFHGNFCLVLDLFHERSIDIINSSCINHAKRTIEPLTRCINTVTCHSFDIFYNGDSLTSDPIK (SEQ ID NO:400)
>orf02537
LSSKSCIDRTNQETFHTLGLEGVGMKSGSLFCSVQISDKEKENSRLANGFLRYQFIQGIFLLLTSYHNHRVGLEILPR (SEQ ID NO:401)
>orf02554
MKSKEQTRKLAVGCSKYSFEVADKTDEVSSKHCFEVVDRTDEVSNYIYGKAKLTWFEEIFEEY (SEQ ID NO:402)
>orf02556
LSNSFFLIKFSSSKISGKKRIVSDNIFIRNKFICHFKKE (SEQ ID NO:403)
>orf02564
MDYSKVAAEVIEAVGKDNLVAAAHCATRLRLVLKDEAKVNQAALDNNADVKGTFSTNGQYQIIIGPGDVNFVYAEIIKKTGLKEVSTDDLKEIANKDKKFNPLMDLIKLLSDIFVPIIPALVAGGLLMALRNFLTSPDLFGPQSIEDMYPAIKGFSAMIQLMSAAPFMFLPVLVGISAAKRFGANQFLGAAIGMIMTTPDLGGKEAFWDILGFHVTQTNYAYQVIPVLVAVWLLANLEKFFHKKLPSAVDFTFTPLLSVMITGFLTFTVIGPVMLVVSDAITNAIVWLYNTTGAFGMGLFGGTYSLIVMTGLHQSFPAIETQLLSAYNNNGTGFGDYIFVVASMANVAQGAATLAVYFLTKNAKTKGLSSSAAVSAFLGITEPALFGVNLKYKFPFFCALAGSAIGAFVAGLTHVIAVSLGAAGFIGFLSIKAGSIPMYIIAEIMSFVAAFAFTYFYGKTKAASVFADEAATATAETVTEPTVEAPVVEETDTLQNETLVTPIVGDVVALADVNDPVFSSGAMGQGIAVKPSQGVVYAPADAEVSIAFPTGHAFGLKTRNGAEVLIHVGIDTVSMNGDGFEAKVAQGNKVKAGDVLGTFDSNKIAAAGLDDTTMVIVTNTADYASVAPVATGSVAKGNAVIEVKI (SEQ ID NO:404)
>orf02570
MTESYTWVEADRATLSRYRHGQGHLTDQFFSFKVQRPAAKTLIASISTGKGMGPSFDGTPVITSGNQNRINTIKNSFIMSSSSVRISLRKLTSQRNFLRNLSSLILLAAQVAKGDATACSHQRIGRVVGQDSHETLSLTEFF (SEQ ID NO:405)
>orf02571
MNINNEKVWFAFYLLDMQITRPTSTFNDRRIGLIGKLQELRFLAGNLLLR (SEQ ID NO:406)
>orf02572
LIKGYLPNHLSLMDLCSKTTCTLDDFAGIAGRRNHRGFFCHIGNGVFLTVDKYLWNQ(SEQ ID NO:407)
>orf02578
LIKLTGRNFSDILIKCLVKCFTNLLSNQLMLLPSTLKL (SEQ ID NO:408)
>orf02582
MIQSENHCSASHSNRDYQSQHDNQGRTCQCFIIVPCHKKGSCSVGEITWNQRCQNGQDKDHSRCLIKNT (SEQ ID NO:409)
>orf02589
MIARQLMVFFSTNQADTRITNMSIDSLIINNSKDFQSSSHASVSFILTKLVNLLIFNF(SEQ ID NO:410)
>orf02590
MGEPFTHFIDCIDLGINPSYTQVCHRHFTSDIPCAMTSHPIS (SEQ ID NO:411)
>orf02597
LSSDSHFIGIKAFVILILGKSNSIVLRIVGLYQDLTCLFSTTCSTCHLSQELEGSLRRTEIRQIQGRIRI (SEQ ID NO:412)
>orf02598
MAVHSLGIHMQGQRNIAVGTSIHRPTLPTHDKARITTAIEHENHLLFFNQTVLDSL(SEQ ID NO:413)
>orf02599
MVTGIAVLLISRFMLFINNHDTQIFQRSKDSRSGTNNNLGIATLHLAPFIILFTIG(SEQ ID NO:414)
>orf02600
VKNGYLVPKTCYKTLGHLRSQGNLRYQQNSCLALIQGTLDNLQVNLGLPTSCNPLK(SEQ ID NO:415)
>orf02601
MVNLIPRLGLDLLLIDCLIFQTKQAFSSQTHHFSLLGKV (SEQ ID NO:416)
>orf02602
LGLQTKNNPLNQAITLTKRHMNPHPNFQHSLKFLRNPVTIGLVRLHQGHIYDNLS (SEQ ID NO:417)
>orf02605
LGNHFCTICSTTYQAFLQFIQIWWCQEDKDSIWNLFLDLKSTLNFNFKENIDSLVQGFIDIGQRSSIVVADIFCVFQHLSLTNQLFKFFTSTEEIVNTVHFSRTLCACRHRYRILKLVFRTLKNLSSNRSFSNP (SEQ ID NO:418)
>orf02618
VLLPNVLIKFINRFCWNVIPIKGCDSTFWNNKLIAIFKGNLDRGIHTIFCLHTT (SEQ ID NO:419)
>orf02628
VGCSYICHELVTNHDHFLFVIVEFLHSTVNTKCEGLQGPVNVINPKFLNCSLNAFFGVI(SEQ ID NO:420)
>orf02629
LLHLWRSIRVVPSNEGIIQIDQNSLDSLRLQAGDCQIIDCFHSKIWYIIFNRHSGSFS(SEQID NO:421)
>orf02631
LFGSCRQINHTSLQIRQGKEVFLGSSLAQEVIDLIPTLVHLLNDRIVGIADDFQTGKEKLINRKRVMALQITSHLFNDIGVLGITNGNQATMLDNKGHGKSLIVGXXHSIH (SEQ ID NO:422)
>orf02633
VKGLLLATKLCRTNSHTDNLTRYSNRSICQNDLISHIQLTFKEDEKAIDDIRQKALGSHTNRYPSNTSSSQQTRNWQT (SEQ ID NO:423)
>orf02635
LWGILGLTLPNLSGIGLLGDLFVGGLKAVAPILVFALVANALSQHQKGQDSNMKTVVFLYIL (SEQ ID NO:424)
>orf02637
MIGTFAAALVAVLASFIVPIEITLNSANTEIAPPDGIGQVLSNLLLKLVDNPVNALLTANYIRILSWAVIFGIAMREASKNSKELLKTIADVTSKIVEWIINLAPFGILGLVFKTISDKGVGSLANYGILLVLLVTTMLFVAPVVNPLIAFFFMRRNPYPLVWNCLRVSGVTAFFTRSSTTNIPVNMKLCHDLGLNPDTYSVSIPLGSTINMAGVAITINLLTLVTVNTLGIPVDFATAFVLSVVAAISACGASGGASGIAGGSLLLIPVAGSLFGISNDIAIQIVGVGFVIGVIQDSCETALNSSTDVLFTAVAEYAATRKKASLLMSCLLRFYSNLLGNSYVY (SEQ ID NO:425)
>orf02639
MIAHAIIILLFKNAGKLCFLVVFFFTVHTFGKNQLLLG (SEQ ID NO:426)
>orf02645
VFEVVDKTDEVSSKHCFEVADRTDEVSNHTYGKVKLTWFEEIFEEYHTKKPCSSR (SEQ ID NO:427)
>orf02651
MKNRIIDVFEVVNRILVITVENPDFEDLRVNQFVKIGDKKYRVRSGPMIHSTPPQSVLDRDTFTIDYTDDELLDKEAVFTTH (SEQ ID NO:428)
>orf02657
MAYIEYKQRGKKRLWSFSIRERSKSLLHKSGFKTKREAKIEAEKVLHKLNTGSVLSSSMTLSELYNEWLDLKILPSNRSVVTKKKYLMRKKVIERLFGNKPVSQIKPSEYQKIMNEYGETVSRNFLGRLNSSIQASIQMAIADKVIIEDFTAYVELFSSKSGQKVEEKYLHTESDYQKVLVYLKNKFDYQKSIVPYVIYFLFKTGMRFSELIALTWDEVDELNEQLKTYRRYNTAIHKFTPPKNNTSIRLVPITSDMLSLLKTLKILQLKTNKELNIDNENNLIFQHFGYVYDVPDIATVNKAIKVMLKELQIFPLITTKGARHTYGSYLWHNNIDLGVIAKILGHKDISMLIDVYGHTLEEKISEEFTAVKSLL (SEQ ID NO:429)
>orf02658
MKDTISNKDLISMGYRPSTANAIIHQVRELLVSRGYTFYNRKRLMVVPKSVVKELLGMEL(SEQ ID NO:430)
>orf02659
MNKEVLNRAPSNSPITIHQMSNKSYSKFQEEVSLKYGFIGLKLDKLSLTAEVSEEFHSEILSGNFTLYDYFGVVEPNLNRNGELASYKGQFFNDEKENWYIEYTPVASIKMNKRPLKIEFTPSKVSKKNFLIVFHRMFPYMFNIAISTFHLAYDFERDLSALRVNWPKVMYRPIYKGMKLETMDFGAPKGNYHLTAYNKLKERMDSGDMAEIEIYQQYDNLWRIEYKFYNEGNIKKELKNGLPFLAKIPVYIENFKGLEFNNLGVNEKIYLFALRNKPELFAESDKRTVAKYKKLAESISEVNLNVFFQNALDFVEIFNQEPCLINFFEFMNSMLQGDIPKLTINQE (SEQ ID NO:431)
>orf02660
MHFDKSKFGAVFSAPGLYEVEVINNASFGQNAQYEVIQSRKLGTFAELIEMAKIK (SEQ ID NO:432)
>orf02661
MFKIKQTASLSEHYFLNTSKLRSIRWFTIGFLSILSLYSCILFKGWFLQMFTLSVGLIVTLYFERKIKGCFHQIEPLLIVRENLLFMLRRNSFLFTATKDGAILRSAKFNYQLNDVSIVIQALKSGDEFTREMDDLDVLLSSVLGISLSYKEIYATHVEYVFVYRQPERLHITSLPLEEDNSLKIKIYDDFIIDLRKNFSMLISGASGAGKSFFTYYYLTRFISQTVNGRHAKIYVIDPKLSDIYKLSKFSGLPVENYGTTNEDAFRIVRHYINEMNRRMEIYNKSDLFDSIGIDLGLPPLLLVIEEYSSLVASMDSKAKKDFENMVAIVAQKARSLSMGVCIVMQQPRSDSLSTNIREQLVNAIFLGAPTRESSQMMFGTTDVPKVKKDKGVGLYSTDREPPKEFHSPMFDRDVFEVILPVWEWAAKDYMKDEDEDV (SEQ ID NO:433)
>orf02662
MKQKQPIVSRTKQHTFEELIQDQKLERLANLSPDLVGRYGFTASCASSFANLIKEAYGGKNLNVVYASRMLALWNIACSCYHKADGYSLADALFSDKKICLDYFYYHNNTSDIITLDMIEDVKKNYLQLVTTATSDNMSVIEFEMEKESDLYYFIKATLGSSFSRMHYSVLVKALAGALAKNI (SEQ ID NO:434)
>orf02664
MGWKGTPPCLHPSNQDTTILIVQQCLRRIEVLAMINFLN (SEQ ID NO:435)
>orf02665
VFGSYYGVIASIFFKEFWITEISSNQLIWQVCSSYNWILGNLFKVNPVI (SEQ ID NO:436)
>orf02666
VLPSHQVLTFSMSPVHRPPNTIIWIELIKEMVFSTKIDKSIWIIDPTNLS (SEQ ID NO:437)
>orf02689
LIVSLKTKSRKAKDMAESIQGWLAQFLVNLFKSITFDCGKEFSKWKDISNHHDSESFFANLGCPRQRCLNEHSNRLLRCHDLPKQTDFNEVSQEF (SEQ ID NO:438)
>orf02690
VVEIIYFLIIIIASGLGSISGMGGGIIIKPLMDSFGYHSVSDIAFYSSFSVFIMAIISTTKRFSQSKEIKWRLIFTVSFSSVLGGFLGHLIFQVLLSQLSVRLVSIVQMILLFVMLLVSFVLTDFKKTYQFDKIGFYMICGLLLGLISSFLGIGGGPLNVSLLMVFFSISIKEATMYSLAIIFFSQLSHLATIVVVTGLNQYHLAPVPVIFLASICGGVLGTVVSKVLPENWVRYCFKGMLFFVMGMTLYNLFHIL (SEQ ID NO:439)
>orf02691
MMGTNSEEGFLDDFEGPQVAVSVKDFSIADTPVTNQEFAQFVKETGYKTLAERQEWSFVFILFVPEAEREGYPHPAGAPWWLQVSNACWKHPYGENSNLVGLEDHPVVHVALEDALAFCNWSGMSLPTEAQWEYAARGGRQSEYPWGDTLLEGGYYHANTWQGRFPYENTALDGFIGTAPVYEFLPNDFGLYQMIGNVWEWCRNPRYTLLASFNEDDYELPKYGIQDEEYAIRGGSFLCHCSYCNRYRVAARNGCISTSTSSHLGFRCLKE (SEQ ID NO:440)
>orf02694
MVQTKQPNIILIVVDQMRADALSLNSKDKLVSTPTLDMMASVGYNFENAYSPVPSCVPARAALLTGLDQDKSGRVGYQDEVPWNFTNTLPKVFKDMGYQTECIGKMHVFPSRQRLGFDHVLLHDGYLHVDRKYDKTYGSQFDYASDYLAFLKGKVGYDVDLIDDGMDCNSWEARPWDKDEKLHPTNWVVSESISFLQRRDPTVPFFLKMSFEKPHAPLNPPKYYFDMYMERLPQFLDLHIGNWEVLEKQIPSIYALRGKLKEDDQRRMVAAYFGLITHIDHQISRFLTALKEFRHDKDTIIWFVSDHGDQLGEHYLFRKGYPYQGSIHIPSFIYDPAGLIAGNRGTIKQLVKIQDIFPSLVDLAGGTTTDELDGRSVKNLLFGQYEGWRTEFHGEHALGKDSSQYILTDQWKFIWFPVLNHYQLFDMKKDPHEMNDLYPSEKYQPIVRQMKKKLVDFLRYREEGFVVDEELVPVELSKITPTLTKTGDSQS (SEQ ID NO:441)
>orf02696
MNTMLDKMQEKLSPIAMKVGNQKFLVALRDSFVGTMPVIMTGSIALLLNAFLVDLPQQFHLESITKTFQWLVDINNLVFKGSIPIVSLLFIYCLGVNIAKIYKVDTVSAGLVSLASFVISIGSTVTKSFPLANVGDVKLDQILQGIDNLAFDGKNLMVTIGNVIPGNHINARGYFTAMMIGFLASIIFCKVMKKNWVIKLPDSVPPAIAKPFTSIIPGFMAMYIVAILTYVFHLLSNDLLIDWVYKVLQTPLLGLSQSFFAVILMIFLNKLFWFFGLHGGNVLAPIMEGLFGVAMLANLDAFQKGEPIPYIWTSGSFGAFVWFGGLGLVLAILIFSRNSHYRKVAKLGLAPVLFNIGEPVNYGLPVVLNPLLFIPFVLSPVFMATVAYWATSWGLVSPVTQNVTWVMPPILYGFFSTAFDWRAIILSVVCLIISVLTYFPFVKMADKTELS (SEQ ID NO:442)
>orf02697
MDESNLESVMGLIMYGGEAKSNAMEAIQAAKKGDFSKANRRLADANAALLQAHKAQTEMLTREAQGEETSISLLMVHAQDHLMTSLTFVDLAKEVVEVYERFEKN (SEQ ID NO:443)
>orf02698
MAKVTIMLACAAGMSTSLLVTKMQKAAEDKGLDAEIFAVPAPEAEEIVATKEVNVLLLGPQVRYLLGDFQEKLKDRQIPVAVIPMTDYGMMNGSKVLDLAESLLD (SEQ ID NO:444)
>orf02699
MKRLISANPSEILQMNAEELKQSILASEGRVVLSENVVTRETFVGDITNSEIARAFGADMILLNCVDVFEPKIYALDSSGDDVIHRLHQLVACPIGVNLEPIDPSAKMLEETQEIVAGRVASVETLKRIEELGFDFVCLTGNPGTGVSNREIIKAVQTAKENFSGLIIAGKMHGAGVNEPVAELSVAEQLLEAGADVILVPAVGTVPAFHDQELREVVDLVHSKGGLVLSAIGTSQETSDTDTIKEIALRNKICGVDIQHIGDAGYGGLATVDNIYALSKAIRGVRHTVSRLARSVNR(SEQ ID NO:445)
>orf02700
MEKLLQEKLLPVAARLGNNKALVSIRDGITLTIPLLLIGSLLMVIASFPIPGWEKYLGDIGVADYLWKGVDSSFGLLGLVASFGIAYFMARQYKVDGIPAGIVSLSSFITVTPFITGEAGAGMPTAFMASKGLFVAMILGLINGYIYQWFINHNIQIKMPDGVPPAVSKSFSAIIPGAVTIVGWLIVYATLDKLSLPNLHEIAQVALGGPLGLLGNNVIGLLILIFLNSSFWFVGLHGGNVVNAVMKPLWLANLDANKVAYQTGETLPNIFTSVFMDNFVFIGGGGATIGLVLALGYLAHKKKASKQLKTLAPITVIPGLFNINEPAMFGVPIVLNILLLVPFILAPMFNLLVAWGAMASGLVPLTYTDPGWTMPPVISGLLATGSISGSLLQIVLIVLDVLLYLPFVIAIEKRFKLLED(SEQ ID NO:446)
>orf02701
MTLSKKQLQLRAKILETVYTLGPISRIEIATKTGITPATTSSITNDLIKENILLELGEDEHDTSVGRKKILLDIQAKRFYYIGCELSEKHFTFALGDNLGNILKEEKEIVTKQLIQEKGNQLINQTLKQFLNNCSDYEIEAIGIALPGRYLDDYKITTNNPLWQHIDLEMIQSHFDKPLFFSNNVNCMAIGKRLFSRQQNDTNFAYFHFARGMHCSYIYDGNIYGKGNLMIGEIGHTVVSSEGEECSCGRKGCLQTFAGEAWLIKKSKILYHQSPYSLLPSLVKNADDIDIQVILTAYQLGDTGIITLIHQALLYLSQTILNISMMIDSQKIYLHSPLLTNQHIIQKLYSEMNYKPKLLYNRLPEVIIEPYNDFTAAHSAIALCLYHTILHS (SEQ ID NO:447)
>orf02719
MPFKENLICQHRNHHCSVFFISLGLLHNIHIEIDISQTRASFLDLSDYLQAVLMILQKFCQAIGLAQRLDLLQLHLLHLTRLLL (SEQ ID NO:448)
>orf02728
MYLLLLVVKDHIALIDKEMHVWRPNCILRDLTNFFIKRNHIVTNKTNGSPTKREV (SEQ ID NO:449)
>orf02729
MVLALMNHFIKEIQGIPINRLTILIENSIFKLNLKNWIIG (SEQ ID NO:450)
>orf02731
MKIKEQTRKLAAGCSKHCFEVVDRTDEVSSKHCFEVADRTDEVSSKHCFEVADRTDEVSNHTYGKATLTRFEEIFEEYKGVPR (SEQ ID NO:451)
>orf02743
LVEQLTFNQWVTGSSPVRVIYAGLAELADAPDLGSGA (SEQ ID NO:452)
>orf02745
VCQRMDARTCKTTIIAVHNVLTALQQTWIAVQLYQTK (SEQ ID NO:453)
>orf02746
LHLGKSILSLPVKGKDLEFLVHLFVINHWIGFPSRTSTFCRCKVLNGME (SEQ ID NO:454)
>orf02747
LEQTVIIANNPCELYWDNHLSFLSDSLLKQVIVHLKRICLDIHHDRGCSHVRNDTT (SEQ ID NO:455)
>orf02749
LTDDGVLILVVDAGWRGNSCLQEQGCHHFRAILLCITWHFRSCTDKGHLTFKDIDQLRQFVQTDTSDEISNLGNTAIVSRSHQTSFFIRIRHHGTKLPNLEPTVVLGHTLLLVNHWPLAIQLDPNAQDEKDGRGQNQ (SEQ ID NO:456)
>orf02750
MLKMRKMGEVRTSKIKAKTQSKQRLKISEPFLLETSW (SEQ ID NO:457)
>orf02765
MDYNAVIPEFLVSNIEQSRSFYCGLLGFRIEYQRPEENFLFLLKSVN (SEQ ID NO:458)
>orf02766
MLEEGTKDQLAELTYPFGRGVNLSFGIKDVSKLYQKVMEANYPIYRPLTKRKFRVSDPYIYPHKFAVLDPDGYFLRFSE (SEQ ID NO:459)
>orf02771
LACDKHGKGCFTDISALLIGDIHIHHTGCTTLMDTFSFNGQYIVEFSCLKVVDRGLECHPIKSQRDNHQTTDLVT (SEQ ID NO:460)
>orf02773
MGIAIVVERRVHYFGRHHNVTISHFFNFVIFKGRYSVKMKVFHRFLLIFQTTL (SEQ ID NO:461)
>orf02784
MIACRHDICKSQKGLEHPFCIVRRLTRDFNQRPVCIVEANIFCLKITPQIIANMIVARTVKSSKTGITLTTSMCKRDNHKITWFHRRNGFPSFFNNPNRFVSTIFVSNFRFWITVPP (SEQ ID NO:462)
>orf02799
MNMNKDQIAILNGADNLNLTLWMTLKEICKEGCKSFFPVRNTCRMLDIGIPYRLGLSLSNSSVLNGMDV (SEQ ID NO:463)
>orf02814
MDGQLHHRAIFDWVHFENFINSWLGFLTINLKTTGQGAEIVLIGHTFNTRDINLVNLITRVNHLVCKVPIIGQNQDTRCIPVQTTHRVNTFFDIGQEVDNRLATLVICYTGNDTAWFVKQIIDLFFVVDRLTFNFDLVA (SEQ ID NO:464)
>orf02820
MHKLRIFVNQLYRRFGIILGPFLVLGFQVLTQELELAIFFDLREEVLLQVIPQVCHFCYLRKEFTTLNQHELTSHDHVLTRHFQTHGLQG (SEQ ID NO:465)
>orf02831
MLHMNLFFQPFFTNLCKTLATGCCVKTVMEWSSIATTIDFKIIE (SEQ ID NO:466)
>orf02832
LDNRAKEWIMSTAQNQAIHLSNQGTQGFIDHLLGNTG (SEQ ID NO:467)
>orf02843
LSNFCFKTVTVHRYSVNTNVNQNFSTISCFQTKSVPCWKGN (SEQ ID NO:468)
>orf02853
MKIKEQTRKLAAGCSKHCFEVVDETDEVSNHTYGKATLTWFEEIFE (SEQ ID NO:469)
>orf02858
MSIVKSHSFSISLGIFNSFWNNIHTSECFYFLCKGKSNRSNSTISVNQMVFFINIQRFYCFAIEDFCLLRI (SEQ ID NO:470)
>orf02859
LNTLLPPDNLCLFTIYLTGFSCICINSYCHNFWEIFNQLFYQLS (SEQ ID NO:471)
>orf0286S
MIDKVIEQYKSHDNLDIRVELHKKYSKNKLGFNNWIFSNYQITDEVKVLELGCGTGELWKSNSDSIDKMKQLIVTDFSKDMVKSTKSVIGNRNNVNYEIMDIQKISFENETFDIVIASMLLHHVNDIPKALSEVNRVLKTGGIFYCATFGENGVVNYLASLFKDEVNQDLENRTFTLQNGKRYLSRYFNSVDTLLYDDELQVTSIDDLVKYIQSFKGISEIGSLEEEIIRKRLESEFNNGMLIIPKEYGMFIARKES (SEQ ID NO:472)
>orf02867
MDLGFDYFGSALTISPHKNSQTINSIGIDVQKIYTPHYLPNDFKKNQGYKRSVEMCEEYDIYRQCYCGCVYAAQAQNIDLV (SEQ ID NO:473)
>orf02868
LTKYADVTIYFANSNIHPKAEYHKRVYVTKKFVSDFNERTGNTVQYLEAPYEPN (SEQ ID NO:474)
>orf02876
MYNKVILIGRLTSTPELHKTNNDKSVARATIAVNRRYKDQNGEREADFVNMVLWGRLAETLASYATKGSLISVDGELRTRRFEKNGQMNYVTEVLVTGFQLLESRAQRAMRENNAGQDLADLVLEEEELPF (SEQ ID NO:475)
>orf02880
MQFTRTAHHTKTLFTTKFTWENEIPFWHHSSRKRDNGFQPHTRIGSSCNDLYSLITCDCNLADVEVVTIWMGYHLNNFTDNKLRFLIINNFFCKTFRLQLLVQTSDLLICQKDLTALCNFK (SEQ ID NO:476)
>orf02885
MSYSVDDVVSNAFKKRMILDSFFAFNCSGTMKVSTWVYDKGEWYYVSSSGAMIANDWVKDNGK (SEQ ID NO:477)
>orf02895
LTSFCFKANMFNNCLRICRIAEGHILKLDLTFEVFISQLHLNRVLDRRMQI (SEQ ID NO:478)
>orf02897
LINPLSRDHSSGKNDEECSHEKEAHDNLHSIRHENNHVTKERQTRYRSSVVNHIGPNPVNRHTQTT (SEQ ID NO:479)
>orf02900
LHLRTCFVRQTNKLSPLINRTRLQFHQTILHYTLNQITSNRLGNIEFLLDIFNQDQVLVFLAIIQKMHNLTLRPTHKFNAATFGFLLHHQVNLMTKTLKD (SEQ ID NO:480)
>orf02916
MLEIWKYRPFVSEFWNDFKNNHDKQFVDPISLYLTLKDDDDPRIEEESEALENMILQYLGEDDAS (SEQ ID NO:481)
>orf02918
LTLFFFELLICLLNSEFDLSKFIFVYFDEYFHEDSLKMNLHQFSFSF (SEQ ID NO:482)
>orf02924
MAFNQFNRCIGLSIPTAPNVPGTIINRSYLHDATVPNNVREKT (SEQ ID NO:483)
>orf02940
LRLQIELTWFEEIFEEYKFEIMKIRQTGGCFVSHLTERDGLRVT (SEQ ID NO:484)
>orf02944
MKKNRGIQKLAILVLLGVFMFSNTIPYQQFIQKNRQLEIRVQSQKKSNGLDVGKAD (SEQ ID NO:485)
>orf02946
MKKLFILISNLLASLFFVWVLTIWTDTYVSHYYPNVVVRDSSPETTFQHVATRLEKLAEETDSFIAIQHQDPNSEGTTVFSYTTFGDGKLPDGLQEKNLEDAQSSSVETNYFVFDGHLDIHLLREELSQLGLTNMHLTIPSKLSTLMAIFSNGFQLISLLIFILTFVALTLISQISQLRSSGIRLISGEKRWSIFLRPVGEDLKAIAVGFSLAGVLAILMQKILSLPTQSLMTIGEGLLSYNLILLSISLFFAQLFAVGIKKIHLMQIIKGQVPVRGIISLILIGQLLAIIIVTLGIGSSLKYSQAWQQHRIGQEIWSQERQLITLSISREGTSPGFDEQAQRKLRTWYQLMDLAVSEQKAFLSRHQLIDRTLQNGMASSKNLITSTEWHDYNPNGNVLIVTPQYLERQNIPVDTTIEQKMNHLNVGEFVLLLPEHLRSEEEHYKSVFEDDLTSRISSQDERQQMTATVGYLESGQDRFVYNTTPISYQQFLKDPIIIVITPQSTGPQSILFWIDAVQNYVLFNQLSDAQELIQRQGIENWVSEMQTGYHNYITLLDNIQRERWVMLAGAVLGIATSILLFNTMNRLYFEEFRRAIFIKRIAGLRFLEIHRTYLFAQLGVFLLGFVASVFLQVEIGVAFLVLLLFTGLSLLQLHVQMQKENKMSILVLKGG (SEQ ID NO:486)
>orf02955
VLKWCILRINHHISRKVDNFLEGTRAHIKGQAHTAWNPLEVPDVRYRSFQFDMSHTLTTNFRTRYFNPTAVTNNSSVTNTFVLTTSTFPVFCRTKDHFIKESFTFWFQGTIIDCFRFFDFSIRP (SEQ ID NO:487)
>orf02962
LVDPLVTSHDNLLSKGSIFIQTRVSLSYSIFIFFISCQPNNFRS (SEQ ID NO:488)
>orf02966
MPWKELCHKLAPKVFKVIRIYSRENKKSPSNWAFCSFET (SEQID NO:489)
>orf02970
VSVLFFCSYFSLSLEKGWFSSLISCKFMNQFLPFCWRQDSPWILTLAQDSITYH (SEQ ID NO:490)
>orf02978
VTDENTRKVRSLVAFFSIVIGYILSSFFISLYHLWQEALRGLL (SEQ ID NO:491)
>orf02979
MRLLFFFANRVIRSKENSSTCPSRSYNLLINTSNVSHITIAVNGTCTGNNTTITKIWVSYLSIDS (SEQ ID NO:492)
>orf02983
VDSLFLSLGEEGNQEINLQESFSSTDCNPTLISPETTVAQGLCQDIIYRPFT (SEQ ID NO:493)
>orf02985
VNPKSLGSFFLQDSKGFKELVLGHAKLSLPRIVHNVCPQFKNASRIITTRDDFWNACYSLQMFNIFKGIQVNGRTQFTCIGVFLVWRVVGREHNLRTQKVQFMAHQKLYITRAVHTTTFFLENFQNSWSWSSLNCKIFLKALVPRKSLVDGSCLLTNPLLIIQVKGSRELGNNRF (SEQ ID NO:494)
>orf02989
MTTIRSLLLNFISISYSIFIQKIKKQTRKLVAGGSKHCF (SEQ ID NO:495)
>orf03003
MDALVLQKNQETIQQIAVKIRFLDGHDYYSLIDIDNRRTNQTVFPFVNFEDIAF (SEQ ID NO:496)
>orf03004
MAFFTEIPTRACLINLAITLHIVETCQGFNDLSLHLRVLAL (SEQ ID NO:497)
>orf03008
MLLPLPFNTSKIKQIAMHSDLNQKEMIGHIFHDEDIF (SEQ ID NO:498)
>orf03013
MKQTVKKLALVASIAATLGGSVAVASAAVQYPEGGVWTYGSGNGGAYSNYYHPSKYHSSTVVSRKTGSSDKGYAGAGGTSRAWIRTSWGEKVAFYYNV (SEQ ID NO:499)
>orf03018
MLNRRFIKTNNIHLCHTHLSSQGNFFCLTTCKFFYIQVCMCIKNHLF (SEQ ID NO:500)
>orf03029
MNTIERTRRLVKGCATHCFEVVDRTDEVSSKHCFEVADETDEVSSKHCFEVADETDEVSSKHVFEVVDETDEVSSKHVFEVVDETDEVSSKHVFEVVDETDEVSNHTYGKATLTWFAEIFEEY (SEQ ID NO:501)
>orf03033
MLERLKSIHYMFWASLIFMLFPILPVVIGELPAWHLLVDILFVVTYLGVLITKNQRLSWLFWGLMLVYVAGNTAFVAGNYIWFFFFLSNLLIYHFGVRSLKSLHVWTFLLAQVLVVGRLLIFQRIEVEFLVYMLVILTFVDLMTLGSVRIRLVEDLKEAQVEQNTQINLLLAENERNRIGQDLHDSLGHTFAMLSVKTDLALQLFQMQAYPQVEKELREIQQISKESMCEVRTIVENLKSRTLTSELETVKKMLEIAGIEVETDNQLDTASLTQELDSMASMILLELVTNIIKHAKASKAYLKLERTEKELILTVSDDGCGFAFLKGDELHTVRDRVFPFSGEVSVISQKHPTEVQVRLPYKERN (SEQ ID NO:502)
>orf03036
MTVVKVEKLSKKIKDKEILRNISFEINDGECVALIGPNGAGKTTLLDCLLGDKLVTSGQVSIQGLPVTSSKLDYTRAYLPQENVIVQKLKVKELIAFFQRIYPNPLSNQEIDQLLQFVKQQKEQLAEKLSGGQKRLFSFVLTLIGRPKIVFLDEPTASMDTSTRQRFWEIVQELKAQGVTILYSSHYIEEVEHTADRILLLNKGELIRDTTPLAMRSEEIEKHFILPIAYKEVVEQSNLVENWTLKQDSLQVVTREADAFWELLAQAGCRMQEIEVNNRSLLNTIFEETQKGDN (SEQ ID NO:503)
>orf03039
MGEEEMRNKMIIAMSLVVTGVMTYLMFSGLDEDFCHFPWKVFAGFGIMS (SEQ ID NO:504)
>orf03040
LVEQLTFNQWVTGSSPVRVIYAGLAELADAPDLGSGA (SEQ ID NO:505)
>orf03063
MPSLRSLKKTDGSCDELHHFDLPVNFFKNTVLGKQTCSGVIREVCQDCFNMLWR (SEQ ID NO:506)
>orf03091
LHTSFRSSVGHSHTWHQDIVRPILFSRFNDSIVILWQNCPTFNQGIYCYLDCFFPIVSL(SEQ ID NO:507)
>orf03096
MWSQTLGLIHPLTSLLELPFWMACLKGFGQFCKSLSGLLSFVAECQHLLSLCSRFIRITVLQTSKV (SEQ ID NO:508)
>orf03112
MLEQARLKVEQQAIKNIQFLEQDLPKNPLEKEFDCLAVSRVLHHMPDLDAALSLFHQHLKEDGKLIIADFTKTEANHHGFDLAELENKLIEHGFSSVHSQILYSAEDLFQGNHSEFFLIVAQKSLA(SEQ ID NO:509)
>orf03113
MKHDFNHKAETFDFPKNIFLANLVCQAAEKQIDLLSDKEILDFGGGTGLLALPLTPSQAG(SEQ ID NO:510)
>orf03117
MVDLQSFFTRKYLNLNSVDAYLILPRLQGHLSYPQDFFLLQDFCFLLPIFLNLSQKEGRNAGKDS (SEQ ID NO:511)
>orf03133
MRIRNSPFDHILQTIFEFEDRTCQVTCRFEACSSICNDNWEFSQHIISVFQSPSCHTVCDKSDVFCSFLFDKNFASLWIYVVTITDQLCXXIPFIN (SEQ ID NO:512)
>orf03142
MHKTCLNIWKFLFYQIESLIHQMAADKSPCRIGNRGR (SEQ ID NO:513)
>orf03144
LNHRFNRQTTKVGRSTIWANGTVNRLIIFVIRSTCIVLINGHSFRCQTSSSTSLPNTKDKVRLITIHLFFQYLSRFVKNCRHL (SEQ ID NO:514)
>orf03147
MTLHQTFRFQNFEMPCQSSLINFQTLLNRHLVTRRMLQQKQ (SEQ ID NO:515)
>orf03151
MKKRMLLASTVALSFAPVLATQAEEVLWTARSVEQIQNDLTKTDNKTSYTVQYGDTLSTIAEALGVDVTVLANLNKITNMDLIFPETVLTTTVNEAEEVTEVEIQTPQADSSEEVTTATADLTTNQVTVDDQTVQVADLSQPIAEAPKEVASSSEVTKTVIASEEVAPSTGTSVPEEQTAETTRPVEEATPQETTPAEKQETQASPQAALAVEATTTSSEAKEVASSNGATAAVSTYQSEETKVISTTYEAPAAPDYAGLAVAKSENAGLQPQTAAFKEEIANLFGITSFSGYRPGDSGDHGKGLAIDFMVPERSELGDKIAEYAIQNMASRGISYIIWKQRFYAPFDSKYGPANTWNPMPDRGSVTENHYDHVHVSMNG (SEQ ID NO:516)
>orf03156
MSNQITVHHSHEHLQKVFTHSWQCNIKNVFIFLKQSLLLMKRNSVGFPTEFTPSILKS(SEQ ID NO:517)
>orf03171
MAELNSVITTVTGIENRLGAVILAEIRNIHAFDNPAQLQAFAGLDSSIYQSGQIDLAGRMVKRGSPHLR(SEQ ID NO:518)
>orf03178
MEXXXDIRKGRHAVVEKVMGAQTYIPNTIQMAEDTSIQLITGPNMSGKSTYMRQLAMTAVMAQLGSYVPAESAHLPIFDAIFTRIGAADDLVSGQSTFMVEMMEANNAISHATKNSLILFDELGRGTATYDGMALAQSIIEYIHEHIGAKTLFATHYHELTSLESSLQHLVNVHVATLEQDGQVTFLHKIEPGPADKSYGIHVAKIAGLPADLLARADKILTQLENQGTESPPPMRQTSAVTEQISLFDRAEEHPILAELAKLDVYNMTPMQVMNVLVELKQKL (SEQ ID NO:519)
>orf03191
LTNLSSVDSEELFQFYRERGNAENFIKERKAGFFGDKTDSSTMIKNEIRMMMGCLAYNLYLFLKQLAGDEVKALTIKRFRRLFLHIAGKYVSTARRHILKFSSLYAYSKQFQALFDTICQINLILPVPYRARGQGKTCLTE (SEQ ID NO:520)
>orf03203
LFDDRQAINICPPTNGSLRLTSLQVDQNPCPPSTNLNKILARSQFLNHIQQISLSLELLQANLWNLV (SEQ ID NO:521)
>orf03207
LSVHFCSSHRCLLVRYNDTYSTKKGLKFETFLSVFRYDFLGM (SEQ ID NO:522)
>orf03237
MDFFNYLLWMICHNHGLHTLLLSKDCVCHTARDKDGNHRIKSVFPTKGQTCYQHDSSIYQERNTTDILTRFLANSQADDIRPTTGDIVSKSKTNPQTHNNTPKKGIDNGILRQGCHRDKLDKEGTHRYRDKGKDGELMANLIPS (SEQ ID NO:523)
>orf03245
MKIKEQTRKLAAGCSKHCFEVVDETDEVSSKHGFEVVDETDEVSSKHGFEVVDETDEVSNHTYGKATLTWFEEIFEEH (SEQ ID NO:524)
>orf03253
MKIKEQTRKLAAGCSKHCFEVVDRTDEVSSKHGFEVVDETDEVSNHTYGKVKLTWFEEILEEY (SEQ ID NO:525)
>orf03254
LFFKDEKQALYTKPKTKSSSFRASKVSNQTIVATTRTDCQVIALNLCDKLENGVVVVVQTTHHIGIDDVIYSKIFQHLTHSIKMSLAFFIKKVQDRRRILYCHLVFFFLRVQDTKRIFLQATLAILRQGLLERCQIVNQGLAVGCTALRISKSVEVQFDTLNTDFLQKMGCHNDCFHIGSWIARTKTLNTNLVELAQAPCLWTLITEHRSHVVELAWLLHFWGEEFIFHIGTDNGRSSFWTEGNMTVTLVIKIVHFLGYDIRCISDRATDNLVMLKNRRAHFCIVIALENLTGKALNVLPLSRFSR (SEQ ID NO:526)
>orf03260
MEQIGKVFRQLRESRNISLRQATGGQFSPSMLSRFETGQSELSVEKFLFALENISASVEEILFLARGFQYDTDSELRKEITDVLEPKNIAPLEDLYRREYQKHAHSHNKQKHILNAIMIKSYMKSMDERVELTAEEGKVLHDYLFSTEIWGIYELNLFSVSSAFLSVSLFTRYVREMVRKSDFLMEMSGNRNFFHTILLNGFLASIECEEFTNAYYFKRVIEEHFYKENETYFRIVYLWAEGLLDSKQGRVKEGQKKMEDAVRIFEMLGCNKSAEYYRNTTEC (SEQ ID NO:527)
>orf03261
LIPYFLHFIIFFRKFIKNLPNCQNYEKIEDIYHVEGLL (SEQ ID NO:528)
>orf03263
MKIKGQTRKLAAGCSKHCFEVMDRTDEVSSKHCFEVVDRTDEVSNHTYGKATLT (SEQ ID NO:529)
>orf03266
VVPFSDTFKDRNQVDIFTIKISRCNSSTIGENSWDIHISNSNHRSRHVLVTATDSDEGIHVVTTHSRLDGVRDDVTRC (SEQ ID NO:530)
>orf03275
MKKFSYPTRQTGEGVKYQSQMVRQWFLIRIFRLFSVA (SEQ ID NO:531)
>orf03297
VVLDHQNQLFEARFLEHTNPLTRIQLTRVKALRILLSSPPFLVIKGIRTEVDKSC (SEQ ID NO:532)
>orf03305
VQKLKKAIYKAHLKDSDDFRPETSTPNLFESCLKLCPCFLSS (SEQ ID NO:533)
>orf03306
MGALGYYEGFVPYVSNQYKNQAEEEDKPLSDKYIFEKILGKTYAAFKKDQINERVEKLGKLKPITINYNGKSEVIDSKEKLQELMNKAVKDEVAQI (SEQ ID NO:534)
>orf03307
MMGDGMKEFQFERKQRFSLRKYAIGACSVLLGTSLFFAGMGAQPVQDTETSSALISSHYLDEQDLSEKLKSELQWFELENKLLNLWEH (SEQ ID NO:535)
>orf03308
MKIKEQTRKLATGCSKHRFEVVDKTDEVSSKHCFEVADRTDEVSNIYTARRR(SEQ ID NO:536)
>orf03312VNITKTSIIKAHTTKEDGIDHTFTRFNIMSIFYSTRKIFLDKLNSTNRQFLGYIISTRCYQSFNSVSQSIHTSSSSQAFRFGKHEFRVINRDKSKAILVNHYHLNLAFFISNHIVNSDFCRSSCRCIDSHNWQAFFSRLMKPFIILWFSTICSHDRNTTSCILWRTPAKTDDKVTAMFLQSSYPICDIFTSRVWLYIAKDDIFDSFCIQWF (SEQ ID NO:537)
>orf03318
MSQDEKLIREQICDVCHKMWQLGWVAANDGNVSVRLDEDTILATPTGISKSFITPEKLVKLNLKGEILEAEGDYCPSSEIKMHIRCYEEREDVRSVVHAHPPIATGFALAHIPLDTYSLIESAIVVGAIPITPFGVPSTMEVPEAITPYLPDHDVMLLENHGALTVGSDVITAYYRMETLELVAKTTFHGRMLLSTKGIEEQEIARPTLERLFSMRENYKVTGRHPGYRKYNGDGSMKETEK (SEQ ID NO:538)
>orf03320
MESKKIAKQILIATAVLTSFLGSNLVYADVVQSNSNNRASTETARVTGNNLEKLITKDKEIDKEMTYLSDMDWSSATHGDIDKTKTVQKDAPFTTGNKGEHTKISLLTSDDKVKYFDKGIGTVADSPSVISYDISGQGFEKFETYIGIDQSANSSRSDHAVVDRIEIEIDGKVVYSSSVTNPEGFRYNTQAQFISVTIPQNAKKISLKSFAGEHTWGDEVVFADAKLIKTVSTQTITPDLLNKGINGGVYLSDLEWVDATHGDDDKSKTVQKDKPFTPGNNGSNNKIKLLIDGKEVEFNKGLGTVASNPSSIKYDVSGANVTRFISYVGIDRSANHLNSDYADIQKFEVVADGKVIYSSDSKYPKGIKYDTSAFLVDVEIPKDTQTIELKSYSGKHTWADELVLGGALFMANGKFKNPNDWSEVDKRREINNEHPLLMMPLYANGEEFNQGKYTFWGGDTLTGKWENIPDDLKPYTVIQLHPDDLPKRDGAARDFYEHMLEEAAKYVNPKTGKNEPIPVILTVYTAGNMPYYTSAHWLSTSWIDKMYQKYPNLHGIFSTENYWIWANDIENKAADYLKVSAKNGGYFIWAEQNNGSAIEKAFGKNGKIAFQKSVDKYWKNLIFMFKNTPAAEGNDSTTESYMKGLWLSNHTYQWGGLMDTWKWYETGKWKLFASGNIGKSQGDRQWLTEPESMLGEEALGVYLNGGVVYNFEHPAYTYGVNNKESLLFSEVIKEFFRYVIAHPAPSKEKVLEDTKVFIHGDYSNKGNGKFFVNVNTDREQTPLYMTGRYNVIPAIPGVLKTDKLKESVSSSRIQIKEITSPEFSSTQARKEYLNKLYPMNYEGDIFAQKLDNRWFVYNYKVNENVKQTGKLKFNSLEMNVEFEPHTYGIFERISNGLKVNLNNFRTNKDSLWSNAQDANQAKKLPQLTKKGAIKWIEEHYIKDTQFGEKRVTKIVLRGIDKLPTIHSLSGTNNSYDQPSLNFDQKNHMVTITINSNGNLEFELHF (SEQ ID NO:539)
>orf03322
MTIYINKDETVFHLAMKDSSYIFRILENGELQHLHFGKRIHVKENYNQLMAYKKRGFEVSFSEEFEDIQQSMIQNEYSSYGKGDFRHPAFQVQGMNGSRITTLKYQGFELEKGKNRLNSLPSTFDDIGQCAETLTIILTDSILDLTVRLNYTIFPEYNVLVRNTEFLNNSNNKLTLLKAMSLQLDLPDSQYDFIQFSGAWLRERQLYRTSLRPGIQAIDSLRYSSSPQQNPFFMLSRRETTEHSGEVYGFNFIYSGNFQNMIEVDHFDTARVTVGINPVEFRFLLNPAESFVTPEAIVIYSDQGMNQMSQQLSDFYRHHLVNPNFSQASRPIILNSWETFYFDLGTEKILDLAKAAKDLGIELFVLDDGWFGHRKDDKSSLGDWVTDRSRLPEGIGFLADEIHKIGLQFGLWFEPEMISIDSDLYKNHADWTIHLLDREKSVGRNQYVLDLTRQEVVDYLFDSISKIIIKTNLDYIKWDMNRHITDIYSIELDSEQQMEFGHRYILGLYQLLDRLITKFPSVLFESCSSGGGRFDLGLMYYAPQAWTSDDTDPIERLKIQHGTSYGYSPSMMTAHVSISPNEQSGRQTSLDTRTNVAYFSSFGYELDVTRLSVEEKEQVREQIQFYKKYRSLFQYGDFYRINSPFSCDSASWQVVSKDKCQSILLYAQLNSKLNPGYTRVYFSGLDKDKCYSVSRFDEFFYGDELMNAGIKVSLSNLALCVPEYLTKLFVIEEVVCKY (SEQ ID NO:540)
>orf03323
MKIENKNVRRNFFWGEGRFYTTDIVNKRAGVMIKNVSKEEFTITLENGIKLSSTHFSAIVREEGDTRIQVSFVCPSIRLRLIFESRDDVLSKQLVLESSTEVIKSVEVESFEFETEDNIFYPKRQDCIKEMANFSGYYVELGQPVYANSLFLGMEFPMSENKVDGRHYVSRYYLGTVVNQEKSLWSCIIGGACSYKKEEIQEAFFEYVEGIAQPSYFRKQYNSWYDHMTDITEEGILKSFSEIRDGFENHGVHLDAYVVDDGWTNYQSVWEFNHKFPNGLRNIKYLVNGFGSSLGLWIGPRGGYNGTEIIMSDWLEAHPELNIGSKNLISNDVNVADFNYLNQMKKKMLEYQKEFDISYWKIDGWLLQPDKPDKSGPHGMYTMTAVYEFLIQLLIDLRKERGGKDCWLNLTSYVNPS PWFLQWVNSLWIQISQDVGFTENAGNDINRMITYRDSQYQEFLEKREIQLPMWSLYNHEPIYAVSANTWYMDHQMFASIPDFEAYLLFISTRGNAFWEFHYSFDMFDEERWKANARAVKWIEENYQTLKYSKKIGGSPEKFEIYGYKCHNQKTSTEILSLRNPAQIKQKIKIENLSIENFTRVIGDFTIQEDEIELAPYSIVILKK (SEQ ID NO:541)
>orf03324
MKHTLETINSRIQWFREARFGMFIHWGLYSIPGKGEWIRSHQKLSIEDYEPYFRAFDPKEYNPREWAKQAKAAGMKYMVLTAKHHDGFCLFDSKFTDYKATNTPAGRDLVKEFVDAVRAEGLKVGLYFSLIDWHHPDFPKYADLNHPMRGNEVYRDEKINFDSYLEYLHNQVKEIVTGYGQIDILWFDYSYEDMVGEKWGASKLIDMVRHYQPNVIVDNRLETSGEGFGSIVTDEITSYAGDFVSPEQIVPHEGIRNFKGEPVPWELCLTMNNNWAYNPTDYLYKSSQTLIRKLVECVSKNGNMILNVGPDALGRINDSSKKILDNFHRWMSRNGEAIYGCSGDENLPKPDWGYYTRNGNTVYAHVFEQPIGPLALLGISKENVKRMSFLHDGSEVKISESWTTNAYKGICFAQFGEVPHFTYPLPDLIDSVIKIELRE (SEQ ID NO:542)
>orf03325
MNTHINGISKKGKVLIYGYMLLTILISIFPIAWIFLSSLKADPMKNPGISLPTDFTLEGYINVFTKLHVFTYFWNSFKVVSISVIISIVMISMSSYVIARMEFRGKKLVTSMLYSTLFIPATAMTFPVYRLVNELGIYNTPVALILVYSCSGIAMSFFIIKNYFEIIPKELEEAAEIDGATYAQTFWKVMLPIARPGILTAAVLAFINNWNEYYWASMLVIDKNELTVPALLGQFTTSFNTNYNGLFSAIVVIVLPPIILFAFTSKYFIEALGGGAVKG (SEQ ID NO:543)
>orf03326
MAQKIMSLQNRKNQKRRFIFLFLLPTLICFFLFYFYSVVTIFLTSFAKWDYTNLNTPEFLGFDKLFENYRYVFKEYPFFTEALINSVRWAVIGVIIQVPLAVSVAITLSKKLKGWKISRNLYIVPSIISSAAMGLIFLQIYNPNYGVVNQIIHLFNPSFKDSVLLTPGLNIVAMTGAYIFFAGASTIMILGQIFAIPEEVQEAAILDNITGWRKEWYITIPMIKGTIKTVSIMAATSGFLLYNEVFFLTNGAAGTKSISFVIRELAVASSRTQYARANTIGVIQILGGMLIIVCINILFRERKRLKGEK (SEQID NO:544)
>orf03327
MNKKSLLKCAVIGLVATFGLAACGTSKDASGGSSSGKEVLEFYHGYHHSEDEWPVAKTMRDLYDKFAEEHKDSGVEFKPTPVNGDLKDIMNNKVASGEFPDVIDLAGNAVSLAAIEQKLVLDLKPYIDSNKLEKNVGLNYKQNQKDGKIYTVHEQLFTMGLWYNKDIFAKAGAKTPDQWNTWDDFTQAMASIRKQDGVYAFGAGEPSIRLFNTVLGTTENGRKLLDKPLTKEGIESKEFADALKMVMKEIQANGSKNAGGDANAYSKDFQEGKSAVFFNGVWASGEMSKNPSLAPGIYPAGVAISSSGGGITISSKMSEAKQKLALEFLKYMTSDDVQKVIFEKVGANPSNENVNVKELSEKSSEATTKILGQAITQVKNAKAVVPTVSDVWGGDVHTAIINALTESAAENVDVDQKVKSTQDVLKSLIG (SEQ ID NO:545)
>orf03337
MNIAIRIILNFFRVMGNHQNSLAMMMGAVVHEFVKFIFTSCIHPRCRLV (SEQ ID NO:546)
>orf03338
MLLIMSIQTTEPAFSRIATRLDKFIDRTWKTSIKTGNLLRKIGYSQFLTLRICL (SEQ ID NO:547)
>orf03339
LQNSKTGLDERRLSRSIFPSQGNKFPTINTIIDMFKNRLLIIIEGQILYRNISHYLISPTKAVKNR (SEQ ID NO:548)
>orf03344
LTSLIPRLMFQKTSQLVSIKILLEGCWIIAIFTEPLR (SEQ ID NO:549)
>orf03352
MSNSFVKLLVSQLFANLADIFFRVTIIANIYIISKSVIATSLVPILIGISSFVASLLVPLVTKRLALNRVLSLSQFGKTILLAILVGMFTVMQSVAPLVTYLFVVAISILDGFAAPVSYAIVPRYATDLGKANSALSMTGEAAQLIGWGLGGLLFATIGLLPTTFIILVLYIISSFLMLFLPNAEVEVLESETNLEILLKGWKLVARNPRLRLFVSANLLEIFSNTIWVSSIILVFVTELLNKTESYWGYSNTAYSIGIIISGLIAFRLSEKFLAAKWESILFPLVAMAIVTLTILYFPNAQMFLLFSALVGMLSQLKEVPESVFLQETVEENHLVNVYSVLEVISTLAFSVFVLLMSYITDFGYQPFV (SEQ ID NO:550)
>orf03353
MSKLLDKILSRENMLEAYNQVKSNKGSAGIDGMTIEEMDNYLRQNWRLTKELIKQRKYKPQPVLRVEIPKPDGGIRQLGIPTVMDRMIQQAIVQVISPICEPHFSDTSYGFRPNRSCEKAIMKFLEYLNDGYEWIVDIDLEKFFDTVPQDRLMSLVHNIIEDGDTESLIRKYLHSGIIINGQRHKTLVGTPQGGNLSPLLSNVMLNELDKELEKRGLRFVRYADDCVITVGSEAAAKRVMYSASRFIEKRLGLKVNMTKAKITRPGELKYLGFGFWKSSDGWKSRPHQDSVRRFKLKLKKLTQRKWSIDLTRRIEQLNLSIRGWINYFSLGNMKRIVASIDERLRTRLRVIIWKQWKKKSRRLWGLLKLGVPKWIADKVSGWGDHYQLVAQKSVLKRAISKPVLEKRGLVSCLDYYLERHALKVS (SEQ ID NO:551)
>orf03357
MFLRCATFKLADSRLNIFTCFFFGEIRFNSRNQVVKAFITDGTVISTIIVRGTVPCNQWTKTCPAAFDIINGDVGFWKAVVDNAK (SEQ ID NO:552)
>orf03358
MLQITCVVCISCTKVSLVFTWENKDHTTVTQTCVKVNWL (SEQ ID NO:553)
>orf03359LRSLIRQITYFITPRTCCINNQTGLDFKYLVCQEITSYNTCNLATFVKEEAFCLHVVGNEGTVLVGTFDVFNHETRIVVTEVKIHSTSYQAFLLQVWLAFQDLILAQNLVRSWCVAHTC(SEQ ID NO:554)
>orf03360
LHFNQTSLKTASCRLQGYTSSCDSSTDNQEVQGAFLHFFN (SEQ ID NO:555)
>orf03375
LIHSKMFVKSIRLRKKRVWDKKILILGILYYKFLKSID (SEQ ID NO:556)
>orf03379
MFASKSERKVHYSIRKFSIGVASVAVASLVMGSVVHATENEGSTQAATFSNMANKSQTEQGEINIERDKAKTAVSEYKEKKVSEIYTKLERDRHKDTVDLVNKLQEIKNEYLNKIVESTSKIEIQGLITTSRSKLDEAVSKYKKAPSSSSSSGSSTKPETPQPETSKPEVKPEPETPKPEVKPEPETPKPEVKPEPETPKPEVKPEPETPKPEVKPEPETPKPEVKPEPETPKPEVKPEPETPKPEVKPEPETPKPEVKPEPETPKPEVKPEPETPKPEVKPEPETPKPEVKPEPETPKPEVKPEPETPKPEVKPEPETPKPEVKPEPETPKPEVKPDNSKPQADDKKPSTPNNLSKDKQSSNQASTNENKKQGPATNKPKKSLPSTGSISNLALEIAGLLTLAGATILAKKRMK (SEQ ID NO:557)
>orf03384
MDREILKFFQDLLSILSHNDMITLFCQKCCNSFSNHFLVICN (SEQ ID NO:558)
>orf03386
MFITLRRICLRACVVEKEQSYLKFLFFQKRPVSFLHVKSVLAGI (SEQ ID NO:559)
>orf03387
MVKTTDRLEAIGFSFILFENLFKPCQLYLQPQNSVLSNLQLAA (SEQ ID NO:560)
>orf03393
MTRKLNPSYTNVASATTLTFNQVASTFRKACLDHVVNLTRNNLKGICQLTPLQLHDTRLI(SEQ ID NO:561)
>orf03421
VKAPIPKAPLAHSFGSASIIAHTIHQKFNLKVPNYRQEEDWTKMGLPITRKEISNWHIKTSQYYLEPLYNLLRERLLTQPLLHADETSYRVLESDSQLTYYWTFLSGKAEKQGITLYHHVLIDLFISYFNPL (SEQ ID NO:562)
>orf03430
LVSVFYSLLQVDNVDSVTFSKDVLSHLRIPATSLVTKVYTSLKKLFH (SEQ ID NO:563)
>orf03440
LIVWILKNHTDLTTYIPNIFLSQTLAINYNLSGFCFQ (SEQ ID NO:564)
>orf03441
MPYNRKPFSTFHVKRNILHIVVVLIFFIAKRKIFYINY (SEQ ID NO:565)
>orf03445
MFKKMSNSSRILFYISVNFCDKRIYRTKLYSDTPVNLFKFLFRQKSNCQSVGQTSSINHFFYSWIVFFFKNNLCHSIPSIK (SEQ ID NO:566)
>orf03458
LADGSGKLAEGGTKLTSGLEDLQTGLASLGQGLGNASDQLKSVSTESKNAEILSNPLNLSKTDNDQVPVNGIAIAPYMISVALFFAAISTNMIFAKLPSGRHPESRWAWLKS (SEQ ID NO:567)
>orf03464
MKNTVKLEQFVALKEKDLQKIKGGEMRLSKFFRDFILQRKK (SEQ ID NO:568)
>orf03483
LVEQLTFNQWVTGSSPVRVIYAGLAELADAPDLGSGA (SEQ ID NO:569)
>orf03487
LWVNIDHIRHISCENACLQYFTSIWYIDDFDLDLRVFLLKITSDFFQGFGHFFFLVEILDGHCVIRRFTIVGASAEPKQS (SEQ ID NO:570)
>orf03496
MKIKEQTRKLAAGCSKHCFEVVDETDEVSNHTYGKATLTWFEEIFE (SEQ ID NO:571)
>orf03504
LIDVLFINSFIGRICFYCYRRIHATCLFLQLFSIVILNVAHTLKHSIFIVITFISRCRNFIIVRILLENQFSRNQGIDNRVGQSRY (SEQ ID NO:572)
>orf03505
MINVNQVSIEVKNTFKNWNFTSSIELTTFSKFSQSPTMT (SEQ ID NO:573)
>orf03516
MGFSMKLIHDLDMHTTHSTAKMLYNVKAIKNDFSIRE (SEQ ID NO:574)
>orf03521
MYQDLLRKIAEEKPNYNQEEIQWLLDHLGDPSPEIRDDLVFTSFARGIQEELFTQEQFHFIAEGVSSDGGLDKEIDKIGLPTLERSFRALIYATLLSDDANQQSVFYQELNAGFRNDLSNQGLHYLSKEKDTTGFSSQYGWVHSFAHGADLLTEVVCHPEFPKNRVHEVFDILGQLFKRISIRFTDDEDWRLARVIYEPILQGKLEQEQVASWIKTVDFPIEAREDFYKFSNFRSCLGKSXXIHSLIN (SEQ ID NO:575)
>orf03524
MGFKVSHFKIPSSHLSINVLRTIENFTEIGQGLLHISP (SEQ ID NO:576)
>orf03525
VGFFDFGLTNSCRQVRQFTQTVQDFLVCYHQGIVKEGQGYAGICFKFHPSLGNIGKFVIAIVRRLRHKSIVANMAHLNVDLFQFRKGLLEILKSVKIALVITAKLVDVFTSFLDCTQEILTVLV (SEQ ID NO:577)
>orf03537
LKKAQWGFSNQGPDGLFLVRPTSNRDEIPPRDLAHPQNQLFLSFSQVEKLAWHAPSPLSEFALNISLPLSSPPYLWPFIQSPVLYCASHFSIHFNSFSSHLRLVITISPHLLSSTSLLLHTLCAQHTIHSTDLHHLRTPPPSGLFPLALYTRLAAPTLTYHTLSNIQSLKQQLVXXFIH(SEQ ID NO:578)
>orf03548
MANDNKSHYLIYRVLGISFEEGENIDLYQNKGRFLYKYAGSFLEEAAVLSFNEKFGTENT(SEQ ID NO:579)
>orf03553
MVNAMHFSFSILIEGNSRKVGICLLNRTHTRFKLSQAIYL (SEQ ID NO:580)
>orf03562
MSQQLSDFYRHHLVNPNFSQASRPIILNSWETFYFDLGTEKILDLAKAAKDLGIELFVLDDGWFGHRKDDKSSLGDWVTDRSRLPEGIGFLADEIHKIGLQFGLWFEPEMISIDSDLYKNHADWTIHLLDREKSVGRNQYVLDLTRQEVVDYLFDSISKIIIKTNLDYIKWDMNRHITDIYSIELDSEQQMEFGHRYILGLYQLLDRLITKFPSY (SEQ ID NO:581)
>orf03572
VKEEKKAIVLGADNAYMDKVETTLKSLCVHHYNLKFYVFNDDLPREWFQLMEKRLETLNSEIVNV (SEQ ID NO:582)
>orf03574
MKIKEQTRKLAAGCSKHCFEVVDRTDEVSSKHRFEVVDRTDEVSSKHRFEVADRTDEVSSKHRFEVADRTDEVSSKHRFEVADRTDEVSSKHRFEVADRTDEVSSKHRFEVADRTDEVSNIYTARRR (SEQ ID NO:583)
>orf03576
MEXXXFKAGAKFFWAKLGLESLEAKEILRDGGWDDIVKNRCIKREQPRLIEEA(SEQ ID NO:584)
>orf03588
MILLQNKSFCYSISQKETSLCHANVVTLNLHEALLNQISDNFSIKSSKRLSKFFLKSRHGNPGFLAESQEHFFFHLLLLTQVIFCNTSIAFLNRTEIMTNGTMIFIAYSIDITRCPCTNTIVFSIVPVHEIMTTFKAGFGEIRNLIMLKTRSLQLCNDVLKHLRF (SEQ ID NO:585)
>orf03612
MDREILKFFQDLLSILSHNDMITLFCQKCCNSFSNHFLVICN (SEQ ID NO:586)
>orf03614
MEDHLLINAVDEFRSISFFQFFKHAFFHIFLVKTNILSPKTNSLIITKGSRTTCFFFACQIRSSWYLKNQTNIQ (SEQ ID NO:587)
>orf03620
LFIRKKHTWINLTPKQTIFSQMNAQFFINFTRRALKRTFITFTSPTWQFPHIGPGNACLIIT (SEQ ID NO:588)
>orf03637
LNIAYTDNPAHIFGKIFHDNLLALLIIFDDVTCKACLRQDKVNSGLFLDSLFNSGCKGVGFSLVRLVISI (SEQ ID NO:589)
>orf03641
MCELDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKRIDRLLGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGRSVTLYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWVLATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVLSTVRLGMEVLRHSGYTITREDLLVAATLLAQNLFTHGYALGKL (SEQ ID NO:590)
>orf03644
MXXTKSSCLITTGRNDSPSTCLPRVASNNDRFSSEFRIIPDFHCSKKGIHVNMDDFS(SEQ ID NO:591)
>orf03650
MHFHIIKLVNHFQLLIKLNRISHPNLHIKSSFLSLVLLFYQKEQDFAIMVI (SEQ ID NO:592)
>orf03660
MDRGESLSDCVCMAGYEPANSSRLSIEGTYENKLYKLISSKYHTTGNDIMVCVPCGYTKYKETPGPHACTSCPGRTHAASTTNTNQDQCNRCPPGYYETNDPSYPCDVCSPNHICVGSDPMDPALMLYSGKRIKCDKNSVTLVPFEENVHLASCLCDKGYMARTRTGIVKCEAVPKNTYKDVVGNVGPTNCPPGSYTLKIGATDVSECVCKRGMFFDKDNKRCTVCPVGMYCLGGRLPNGEHMLPMMCTDGNAVTKDGGATSPGECLCKPGFYLRQDGPGGCVECPENTYKSFISNENCSPCPRIL (SEQ ID NO:593)
>orf03667
MXXAFCLPVFAHPETLVKVKDAEDQLGARVGYIELDLNSGKILESLRPEERFPMMSTFKVLLCGAVLSRIDAGQEQLGRRIHYSQNDLVEYSPVTEKHLTDGMTVRELCSAAITMSDNTAANLLLATIGGPKELTAFLHNMGDHVTRLDRWEPELNEAIPNDERDTTMPVAMATTLRKLLTGELLTLASRQQLIDWMEADKVAGPLLRSALPAGWFIADKSGAGERGSRGIIAALGPDGKPSRIVVIYTTGSQATMDERNRQIAEIGASLIKHW (SEQ ID NO:594)
>orf03668
MARFIRSQTLTLLEKLNELDADEQADICESLHDHADELYRSCLARFGDDGENL(SEQ ID NO:595)
>orf03669
MTVRELCSAAITMSDNTAANLLLTTIGGPKELTAFLHNMGDHVTRLDRWEPELNEAIPNDERDTTMPVAMATTLRKLLTGELLTLASRQQLIDWMEADKVAGSLLRSALPAGWFIADKSGAGERGSRGIIAALGPDGKPSRIVVIYTTGSQATMDERNRQIAEIGASLIKHW(SEQ ID NO:596)
>orf03673
MKIHKTVNPVAYENTYYLEGDKHLIVVDPGSHWEAIRQTIEKINKPICAILLTPAHYDHIMSLDLVRETFGNPPVYIAESETQLAPKPLPNNPLGPPXHSFIN (SEQ ID NO:597)
在一些实施方式中,优选的OXC141抗原选自以下多肽或其免疫原性片段:orf00045(SEQ ID NO:53)、orf00068(SEQ ID NO:65)、orf00074(SEQ ID NO:70)、orf00223(SEQID NO:99)、orf00229(SEQ ID NO:104)、orf00360(SEQ ID NO:117)、orf00506(SEQ IDNO:135)、orf00781(SEQ ID NO:177)、orf00785(SEQ ID NO:178)、orf01068(SEQ ID NO:198)、orf01446(SEQ ID NO:235)、orf01447(SEQ ID NO:236)、orf01449(SEQ ID NO:237)、orf01455(SEQ ID NO:242)、orf01460(SEQ ID NO:247)、orf01461(SEQ ID NO:248)、orf01463(SEQ ID NO:250)、orf01464(SEQ ID NO:251)、orf01466(SEQ ID NO:252)、orf01467(SEQ ID NO:253)、orf02661(SEQ ID NO:433)、orf02690(SEQ ID NO:439)、orf02698(SEQ ID NO:444)、orf03318(SEQ ID NO:538)、orf03320(SEQ ID NO:539)、orf03322(SEQ ID NO:540)、orf03323(SEQ ID NO:541)、orf03324(SEQ ID NO:542)、orf03325(SEQ ID NO:543)、orf03326(SEQ ID NO:544)、orf03327(SEQ ID NO:545)、orf03562(SEQ ID NO:581)、orf03660(SEQ ID NO:593)。
6由INV200鉴定的序列
>orf00004
LKGVDDFLFIFEEGFKQGGKARADRDYSGVSSLRNSSKVYLEFLY (SEQ ID NO:598)
>orf00005
LCSALKNSYDIELIKVLSNKAHLYLPIETVTPQTVSTS (SEQ ID NO:599)
>orf00006
MRVAETSIVKKNHQIPCIINQKIAQKLIKKTSMTDIDHQLSISTSTVIRKINDFHFEHDFSRLPEIMS (SEQ ID NO:600)
>orf00010
MFKSNLSLSQSLPHKDFFFFKRIIHLFSLFLLIDFIIIS (SEQ ID NO:601)
>orf00015
VEEVEVAEVKNARVSLTGEKTKPMKLAEVTSINVNRTKTEMEEFNRVLGGGVVPGSLVLIGGDPGIGKSTLLLQVSTQLSQVGTVLYISGEESAQQIKLRAERLGDIDSEFYLYAETNMQSVRAEVERIQPDFLIIDSIQTIMSPEISGVQGSVSQVREVTAELMQLAKTNNIAIFIVGHVTKEGTLAGPRMLEHMVDTVLYFEGERHHTFRILRAVKNRFGSTNEIGIFEMQSGGLVEVLNPSQVFLEERLDGATGSSIVVTMEGTRPILAEVQALVTPTMFGNAKRTTTGLDFNRASLIMAVLEKRAGLLLQNQDAYLKSAGGVKLDEPAIDLAVAVAIASSYKDKPTNPQECFVGELGLTGEIRRVNRIEQRINEAAKLGFTKIYVPKNSLTGITLPKEIQVIGVTTIQEVLKKVFA(SEQ ID NO:602)
>orf00018
MGVSIFLALFYMIPALYFLFRIGKKWELPKKVLILSLLGGMFLSGWLSSFANTYIHDFMNCTPKVRQKKSNFWGVLL (SEQ ID NO:603)
>orf00019
MKLSYEDKVQIYELRKQGQSFKQLSKRFGVDVSGLKYMVKLIDRYGIEIVKKGKNRHYSSKLKQEMMDKALLEGCSQRSISLDYALPNQGMLSFWPAQYKKNGYTIVEKTRGRPAKMGRKRKKTWEEMTELERLQEENERLRTEVAYLKKLKELEERDEALERERQRQLEKWFQEDFD(SEQ ID NO:604)
>orf00020
MVSGGFRLDFLLETARLARSTYYYQLKQLDGVDKDKEIKTEIQGIDNEHKGNYGYRRIHLELRNRGFVVNHKKVQRLMRILGLTARIRRKRKYSSYQGEIGKKAENLIQRQFEASRPMEKCYTDVTEFAIPNSTQKLYLSPVLDGFNSEIIAYHLSTSPNLEQVKSMLEQAFTEKYYENTILHSDQGWQYQHDSYHRFLESKGIQASMSRKGNSPDNSMMESFFGILKSEMFYGYEKNFRSLENLEQAIVDYIDYYNNKRIKVKLKGLSSVQYRTKSFG (SEQ ID NO:605)
>orf00024
VNIATLQNGHILGWQIQHIANKLTSNFWIAKDFLSYQVIGWANARMTYSHISSLFIISQF(SEQ ID NO:606)
>orf00026
VSITFSLTNFFKILINLTAQVSPQVIDEKILMMDLNLNNYLSTVIQLRQDVYTGIKILHRVRHGE (SEQ ID NO:607)
>orf00027
MSRYSYSLDSRKIVFEISCFKEKKASLTLFFHLFESSIMKLATQPSYSSFYSELK(SEQ ID NO:608)
>orf00033
MKIKEQTRKLAAGCSKHCFEVVDRTDEVSSKHRFEVADRTDEVSSKHRFEVADRTDEVSSKHRFEVADRTDEVSSKHRFEVADRTDEVSSKHRFEVADRTDEVSSKHRFEVADRTDEVSNIYLRQGDVDVVEEIFEEY (SEQ ID NO:609)
>orf00034
MKPGAEGWKDERSQDIEEKDNGDGLGYFLFLSMDNRRCRCNGRTPTDRRTYSNQGSQFCIQGKEALEEVGNNQGY (SEQ ID NO:610)
>orf00035
LPNCEDLRDIETKTKQDNGILEQFLGTKGQSNIQLFIRREKGM (SEQ ID NO:611)
>orf00044
LSLLDLRGSLCLRIYLHEPLITTVSQDFTSLSDISHF (SEQ ID NO:612)
>orf00047
MDFKSFIIGLVVGIFGPYMDDLIRKKFLKSSEKKTEKSVKK (SEQ ID NO:613)
>orf00093
MTYEYKSHIYLAETVLNVKDLASQTTFYQQVIGLEILSQTETESILGLGGKVLVQLIQAQESGEVREHXXXFHSLIN (SEQ ID NO:614)
>orf00103
MVSNLVFIGNCNFHNTVI FHLLNRLNQGPLQILSQNHDKGRRLSWIFKSRLGQLNASKNWMGRKEQAMALAIAADLQDQLLFKKAD (SEQ ID NO:615)
>orf00113
MKFNPNQRYTRWSIRRLSVGVASVVVASGFFVLVGQPSSVRADGLNPTPGQVLPEETSGTKEGDLSEKPGDTVLTQAKPEGVTGNTNSLPTPTERTEVSEETSPSSLDTLFEKDEEAQKNPELTDVLKETVDTADVDGTQASPAETTPEQVKGGVKENTKDSIDVPAAYLEKAEGKGPFTAGVNQVIPYELFAGDGMLTRLLLKASDNAPWSDNGTAKNPALPPLEGLTKGKYFYEVDLNGNTVGKQGQALIDQLRANGTQTYKATVKVYGNKDGKADLTNLVATKNVDININGLVAKETVEKAVKDNVKDSIDVPAAYLEKAKGEGPFTAGVNHVIPYELFAGDGMLTRLLLKASDKAPWSDNGEAKNPALSPLGENVKTKGQYFYQVALDGNVAGKEKQALIDQFRANGTQTYSATVNVYGNKDGKPDLDNIVATKKVTININGLISKETVQKAVADNVKDSIDVPAAYLEKAKGEGPFTAGVNHVIPYELFAGDGMLTRLLLKASDKAPWSDNGDAKNPALSPLGENVKTKGQYFYQLALDGNVAGKEKQALIDQFRANGTQTYSATVNVYGNKDGKPDLDNIVATKKVTININGLISKETVQKAVADNVKDSIDVPAAYLEKAKGEGPFTAGVNHVIPYELFAGDGMLTRLLLKASDKAPWSDNGDAKNPALSPLGENVKTKGQYFYQLALDGNVAGKEKQALIDQFRANGTQTYSATVNVYGNKDGKPDLDNIVATKKVTININGLISKETVQKAVADNVKDSIDVPAAYLEKAKGEGPFTAGVNHVIPYELFAGDGMLTRLLLKASDKAPWSDNGEAKNPALSPLGENVKTKGQYFYQVALDGNVAGKEKQALIDQFRANGTQTYSATVNVYGNKDGKPDLDNIVATKKVTIKINVKETSDTANGSLSPSNSGSGVTPMNHNHATGTTDSMPADTMTSSTNTMAGENMAASANKMSDTMMSEDKAMLPNTGETQTSMASIGFLGLALAGLLGGLGLKNKKEEN(SEQ ID NO:616)
>orf00118
MSLQIKLKKLAKELSKLLKDSNLETVDKDVLENSQKELQKAVLFLADEKGSEHTEAEVIDNLKEVIAKLKANA (SEQ ID NO:617)
>orf00129
VGRFFGSSQTSDEFFFSFDSSIVKELSEIVHGFDTVSFR (SEQ ID NO:618)
>orf00140
MKIKEQTRKLAAGCSKHCFEVADRTDEVSSKHCFKVVDGTDEVSSKHCFEVVDRTDEVSSKHCFEVVDRTDEVSNHIRQGDVDVV (SEQ ID NO:619)
>orf00146
MNDDDSRCIHIERDGKTIEFGYLNISSTDRNTSHADGLVGIFNSNFSGVRVRGIAVFLNGPDNLDTTLVGNFQTIWNFRIICIHS (SEQ ID NO:620)
>orf00147
LEFNFCRSIIKNGRDNLPNTNSTSGMATRWANHNWSDDIKDRLKTK (SEQ ID NO:621)
>orf00152
MSCNCAFYRSQFFDVNSVSNYHSHQKELRFPNSILFTYFVKVA (SEQ ID NO:622)
>orf00156
MSKEKVILAYSGGLDTSVAITWLKKDYDVVAVCMDVGEGKDLDFIHDKALKVGAVESYVIDVKDEFATDYVLVALQSHAYYEQKYPLVSALSRPLISKKLVEIAHQIGATTIAHGCTGKGNDQVEYQIAVAKKANEAKK (SEQ ID NO:623)
>orf00157
MRYDFGKVYKEIRESKGLTQEEVCGGVLSRTSLSKIESGKTTPKYENMEFLLRQINMSFEEFEYICQLYQPSQRTEIMQTYLNMRSIIGTSDLVNLFQKCQDYLKTHHDLPIEEIRDMLEVVIYIRQHGAGELSDHAEQVVKKLWRKIEKQDTWYESDLKILNTILFSFPIEYLHLITGKILQRLEVYKNYQHLYDLRIAILLNLSTLYLYNQDKNMCKQICYTLLEDAKNKKSYDRLAICYVRIGICTDDSKLIQKGFSLLELTEETSMLSHLKKEVEIYYQAKER(SEQ ID NO:624)
>orf00158
MKIREIIGTDMYGTTVSGIVSGLNKLNFTVKAVRVALEDLTPKLTFPAILQVKNDLGQNHFVVLHSIKEKINGTRITK (SEQ ID NO:625)
>orf00159
MELVLPNNYVVIDEEEMMYLDGGAIYIPRWAITGAITGAAYAALAAAGGGGLQLVLASYGLRSALVAGIVKGLGVLGIHIGNAFANTVIRSIASAGIGAGADWIFTNIIDGWDGRRDNQLRIG (SEQ ID NO:626)
>orf00161
MATITNALNIAATVAEVFSLGGAIAYGLDIVDGKFDGYLWA (SEQID NO:627)
>orf00162
MKDDQKYLLAGLYSLLVAIFYFPLIESKGIFVSILMAVLLLYLIYFIATVIHIVIIKFIRKKSFKYLVLYPFTYDGSWRFQPINLLYFPEMVRDVIPINLVQEYCQGQPYGLLKKMLKRIRLSREIALLLATIIVYFFTHRILPLSVFTFMFSYILLFVQSYLGSNTAWIGNRRLIIDDEFEKILLSKSYIKEISSARYSEYLTCEYKNPTPIILIAIFENLLDSYLLQNQSEVDLDIFYKVLPLLYKEKYTMGFNYFVSLNYLLYKVGFLGIIYDNEALRDLSKQYLNKNISELQDGSFEGGIQDAVASKQIVVINEFIACLNSKCMPSQYDRFFYKDRPYIFSRKSSIKG(SEQ ID NO:628)
>orf00163MKNKRYFFDTILIILLLISTIFCVSPVFIKLDILGTPSHAILTFVLAIPLFYILSQCLHTLLLLVSSIFCKLRPIYFYFIFVIIIGARKYYRILFHQLMGFSPGIAVFYKESQTTKNLFKFYYFLYFTTLISYYFFFTFVYDKSLLLPLIPFSIIIALVQKLYRIENQQLFLLKSKVLTILESKRDCEFNLQDYHEIWKLQSKSELPCVALSYISLIKPYLSESVREQIDLLEVKRFKKINHPISLYGMLDVIKLNLYLRHYNEKNKYESMLNKILEVRPDFVLIEQNIDDSLNSSQPLSLSLAISEIQLLLEVYIGIKHVSIRR (SEQ ID NO:629)
>orf00164
MIRKPIIFLLMLPIWGLWIELHLLVSNLQLNLEIPFDFVVSTSLTFFVLILSKIVLDILYALKDLYKKEALITIFPFIFIGRKKVNVRFSPYFSFHRKSLSPDDLRSRIIWSFILEIAIILVFILKIPFAIIMLTTIFFWTIMDINHLVFNKTEFLFNQNKWQKEDSFESDLTKTLKDKIQKSELSYSDLMSLQLYDAMNQSTFLTDSELFEDILKKIEDSHNTLLCTGLVELLLYEISISNNNNWQEKVDKIRIQLIRINQLDFFYYTSWLRQNFDFCMNREYHKMKSRKLLLSNKKIV(SEQ ID NO:630)
>orf00165
MELVLPNNYVVIDEEEMMYFDGGAYLSKRACQGICAALAMSSGTFIALAGAAVLTKKLINYIKVGGLGGWLIGAAAGKIAYYIGYGVLNRGCDINGNPYPWDGFISATVR(SEQ ID NO:631)
>orf00166
MSNVDKIRKIHIIVCWVYIFLSFRAIINDTEYFLLIFLAFIYSIVSLPLYSVKNKIVSICLAINSILLMSFPILINKFFPESFLTYTVLISVFITELIIFHLIGKDFDIKLTNEYKKISQFRSKVSQSPWIKYLEISSFILTIFPSILYGTVDNHVLTLIFLIKICVDTTIKFLFIRLFDTSTLMKRRIFFLFALDVIAYLFLGYLLVIQKAGYLFSVLLLFSNFSVPFIKEKEYELFKNSK (SEQ ID NO:632)
>orf00167
LFNEIKKTSSLIGNVFIGMKEDDAMFKKRIEKGKSSVFIFLE (SEQ ID NO:633)
>orf00168
MNKKKMILTSLASVAILGAGFVTSQPTFVRAEEAPVASQSKAEKDYDTAKRDAENAKKALEEAKRAQKKYEDDQKKTEEKAKEEKQASEAEQKANLQYQLKLREYIQKTGDRSKIQKEMEEAEKKHKNAKAEFDKVRGKVIPSAEELKETRRKAEEAKAKEAELTKKVEEAEKKVTEAKQKLDAERAKEVALQAKIAELENQVHRLETELKEIDESDSEDYVKEGLRVPLQSELDVKQAKLSKLEELSDKIDELDAEIAKLEKDVEDFKNSDGEYSALYLEAAEKDLVAKKAELEKTEADLKKAVNEPEKPAEEPENPAPAPKPAPAPQPEKPAPAPAPKPEKSADQQAEEDYARRSEEEYNRLTQQQPPKAEKPAPAPVPKPEQPAPAPKTGWKQENGMWYFYNTDGSMATGWLQNNGSWYYLNSNGAMATGWLQYNGSWYYLNANGAMATGWAKVNGSWYYLNANGAMATGWLQYNGSWYYLNASGAMATGWAKVNGSWYYLNANGSMATGWLQYNGSWYYLNANGAMATGWAKVNGSWYYLNANGSMATGWVKDGDTWYYLEASGAMKASQWFKVSDKWYYVNGLGALAVNTTVDGYEVNANGEWV (SEQ ID NO:634)
>orf00190
LKKRMNRWQFLLNQSKEMVGILLLKMKEQELIEFVVNL (SEQ ID NO:635)
>orf00191
LIKVIKRKAFGFRNFNNFKKRILMTLNIKKESTNFVLSRL (SEQ ID NO:636)
>orf00195
MTYNEKRLTNSLERVHMEQLKNTTDLLGLEDKNIKILSVLKYQTHLVVQAKLDSPAPPCPHCQGKMIKYDFQKASKIPLLDCQGLPTVLHLKKRRFQCKNCLKVVVSQTSIVKKNCQISNMVRQKIAQLLLEKQSMTEIAHRLAVSTSTVIRKLREFKFETDWTKLPKVMSWDEYSFKKSKMSFIAQDFESKSILAILDGRTHAVIRNHFQRYQREVRELVEVITMDMYSPYYRLAKQLFPKAKIVLDRFHIV (SEQ ID NO:637)
>orf00196
MGYSLKKSRTYCEQDPEKVNRFLKELNHLSYLTPIYIYETGVETYFYLEYDRALSRQLVSLEEDIII (SEQID NO:638)
>orf00201
MRFYCEAYEVITEEKKVYFNDIELEVKYSSVEELFIICEKLLDKKKVSFFYVDEKPLRYLLFDYIFLLVLAKKNIPILDGVSNKQVDPTLLHHFSLEIEKNFIDFCYKNMDLILKTQSISLCHREELIIVDVSDDSKFGVFYKRFRTLVDKNNGKYVCVYNFSRVSDILQNWKNYCNRKFSVTFTESQFELFKLLYNQKNFKTISLLFGKKIVAGGIIYYSDLTNIEYFCIFWWDSMFGKDSIGKYVYVEEISRCHFLDRNYSFCYGLQDYKSKLIKYFLE(SEQ ID NO:639)
>orf00202
METRNLISYSLTDIFETDKIRIELLGEIYYKNIKLELHEFAGLYKIYGISLIKNITGMFLIIIFDTKTKELKIFQDITTSYFNLYYTVYGGVFYYSTSLKKVMKLSHVPVTLNNKKIQEFMRNGFILDSNTLVTEINKLEYFSYISVNNTLRICGIDYNDSNNFTKEQVLKNWDSMLRESILRVYSEAGEANITLSSGFDSNYILYTLANYTNSSINAFCIGGEKGINEIPEVTKIAKFYGINLLVDTVMSQDLEYFPDIVWRLEGSLFECGVILQYYLGRLLFQKGNTSILCGESADEIMTFKYHSVNYNQFCNDKQKSVYFSYSDYPFYVTNSIVLKKNSLLLHSFGIHPRYPYKMSEIVEMSKKISDLNDKKEFHKKNCELRFQDSVLDNINSVPGTTHLFSCLNIQTLVKIILYIFRYNSCMKIFNFRDKELIFFNKIVNGLIQNIEENLEDDIERILKYLYICLFNEIFIIKNKVNFFDDVEFNQTLSEFLDKL (SEQ ID NO:640)
>orf00204
MKFFCENNLNITFFDTFSEIKNNIDYYIIALPTDYDEKIGSFNTYEIEQTVSKILRVKPNGKIILKSTVPIGFSNKLKRLFDTKNIIFVPEFLREGCSIYDNLYPSRIVVGDETVEGRKIAELFLSISTHSTANIKNVMLVSPTEAEAIKLFSNTFLALRVAFFNELDSFAERRSLNAEVVIKGVCLDPRIGNFYNNPSFGFGGYCLPKDTKQLKKEFIEINAPVIEAIDISNTNRKQFIVKQILERKPKIVGIYKLGMKYNSDNYKESAILSIINELLIVGIKILVYEPNLNVSIDNVIFEKNFELFTKQSDLIVANRWDRGLEAYKDKVYTRGIWIRD(SEQ ID NO:641)
>orf00206
LEEESFIMENTEFSLELDVTEVATEQDYVSSGVTSTGCCKN (SEQ ID NO:642)
>orf00207
MFKIKDNYIYRQCVNDSILIEKLNENNLEIFFDSKIFQEMLMVANPRFFNELTKEKIYQNSTFRNYAKRSLTRATPFGLFSSVGVGSFSKVSYPQQIRENYSKKVSVSGEWISSLCMMLENEDSVLLQLHLQWNQKVLELSDKYQLNNINYWGVSEQSRDILIKKTALLEFIKKLTYKSEVSVLDLVQEIQTKSPNLETQKIIDYLRNLIISEFLFTNLRKVVINHNCLDNLIYILSSINEQTKLTTDLLQLKSCIEKYSKSELGEGILQYAEICEKMSHIFNEEKQRYLKVDLVNSYDSLLPKDLKKTLEDFVNFISRINLGKDYRNKELISYTEKFVEKYGEYVEVPIKQLLDSKLGLGIPKQNLEPYSILSSVAEQTFLSYLSKEIFKAVKNNKKEIDISNIPPELLYPNLDRFAVNQFELYCEMKNFGEQPVISIVPNTGSDMIGKSIGRFASYFLNSNIELDSRVDNVELIEFPSDNKNLNVMSSHHGHSKKLLLSYEDDFDIDSLELDFLVVGVERVNEHYKLYFRDLRTDLIVNFVTTSMLNHKSIGVFSHLARFLLTVSLEWQDNPFSLFRVIENLDFLPYIPRIKYKNIILSEEKWILSDVDKKDMSTISQWKKFFDVPSLLYFHKDDERLLIDLKNSLDVQWILKQNVDKLHFTRFDKIDGKNCEFIFGFENPRNSVYPHSVSEKTVRRIENDFYKDYVKTFSSDWIYFKLYGINSSTMPELRENLLIFTDELLAEKLVSDFHFVNYNDGGDGSIRLRFKIMNEDDFEKLRYRIIHWIDFLLNHYFCKDVSFNLYEREVERYGGIGFLTVCERIFSIDSYLVLKLFSKKVLKVDDYLSVLHSIFIYIRLLGISPKQLLKLMKDTFTQNIYRKSFKKVFPNNAKVIKEFKQYFEDQSKFDIFNEVFKSFSPIEKHFEYKNDMIHSLLHMHMNRIGIFSLNEKEYLYFVRYILEVLNNYEKYN (SEQ ID NO:643)
>orf00208
MRNIIKKYDEIINKVDSLVLDNNIIDLLQRSCYTENRSYLSEYPSIIIYLSYRLANCDDNEHSKLLYNRVNYYLHELLKSIKLNSRNNISMCYGFSGYVYALKLLPKRSKEYSKLLETLETILVSLTRDRLSEIKKSNKVKEEYIDVIQGVSSVGKYFLSKDKLTSNQELLLKGVLNYLAGVINNKPTIYPEYMPNEKLKRKFPNGYINLGVAHGILGPLYVLALGFKKFNMPEYLISLKKGLSYYEKTFQTNKIGKIIGWNGRVSAEVESEKFEYNLSWCYGSLGMARVLYNISKIIDIPKLQELATDVFHSSIYYLNSSEILNNAICHGRSGIMLLFNLMYLDTGESQFKAISDNLFKEIVNKATDSEYIFVERDIYFRGVNYDEVIEYIDFCLLNGVSGIVLALMAQRTGNASPLAEMFFMQ (SEQ ID NO:644)
>orf00209
MKKILNNKLYMKVLVSDLISNFGDTLYFIALMTYVTEIKSSNLAISIVNISETIPILFTIFFGIIADRTLNKVGMIIKTLWIRTILYLLVAVVMNFKESILVVILASIVNLISDTLGQFENGLFYPISNRIVKKSDREETMAFRQTATSTMNIVNQSLGAFLITFLSFFHLALINSLTFAISLLITLAIKSQINNFYIDKTPSTKVSKVDFKATFSDIISNLKLSLKHLFSLTNMKTVLLVIPILNGSLAIIIPLAVVNLSKSSALTIISSATTISVLGISTVSGGILGGTLILISKKFKNLSIENLLKMNLMTILLSFIAFYYQNIYFIVLTLFLSSVFVSALNPKIGAIIFNNLDETKLATIFGGMVTYFQLGDVVSRLLFSTLVIYLSYTYIAVIYMILVLIVAIYTFRRVQTTS (SEQ ID NO:645)
>orf00213
MKIKEQTRKLAAGCSKHCFEVVDRTDEVSSKHCFEVVDRTDEVSNHTYGKVKLTWFEESFEEYK (SEQ ID NO:646)
>orf00217
LKSSILSKMGDFSVRYCNLVGTVLFGVVLIAILRLVF (SEQ ID NO:647)
>orf00246
MGLDVGSKTVGVAISDPLGFTAQGLEIIQINEEQGQFGFDRVKELVDTYKVERFVVGLPKNMNNTSGPRVEASQAYGAKLEEFFGLPVDYQDERLTTVAAERMLIEQADISRNKRKXXIPFIN (SEQ ID NO:648)
>orf00270
LNTSYSFGKKDQFALEHCFCIKLSIFARAVTLFVSCIN (SEQ ID NO:649)
>orf00291
MLIGEGYRTFPVLIYTQFISEVGGNSAFAIMAIIIALAIFLIQKHIANRYSFSMNLLHPIEPKKTTKGKMAAIYATVYGIIFISVLPQIYLIYTSFLKTSGMVFVKGYSPNSYKVAFNRMGSAIFNTIRIPLIALVLVVLFATFISYLAVRKRNLFTNLIDSLSMVPYIVPGTVLGIAFISSFNTGLFGSGFLMITGTAFILIMSLSVRRLPYTIRSSVASLQQIAPSIEEAAESLGSSRLNIFAKITTPMMLSGIISGAILSWVTMISKLSTSILLYNVKTRTMTVAIYTEILRGNYGVAAALSTILTVLTVGSLLLFMKISKSNSITL (SEQ ID NO:650)
>orf00292
LIIIASMSAPFVGAYSWVLLLGRNEVITKFLTNALYLPAIDIY(SEQ ID NO:651)
>orf00293
MERKKLNIWTVSSFFLFLTYPIFLVYPIVTVLKQALIHEGQFSLANFVTFFSKAY(SEQ ID NO:652)
>orf00295
LLSTTEFIGLSIRILSNLHEFKILVGLLNQFFFWNLLLHKTKSNVVSDSQMWENSVVLENHPDIAFAGFHIIDFCIIEVKFSTFDTVETCNHTKKGRFPTS(SEQ ID NO:653)
>orf00314
MITIKKQEIVKLEDVLHLYQAVGWTNYTHQPEMLEQALSHSLVIYLALDGDAVVGLIRLVGDGFSSVLVQDLIVLPIYQRQGIGSALMKEALEDYKDAYQVQLVTEETERTLGFYRSMGFEILSTYNCIGMTWMNRKK (SEQ ID NO:654)
>orf00325
MKIKEQTRKLAAGCSKQCFEVVDRTNEVSNHTYGKATLTWFEEIFEEYNTNLEYKQPICSQEKA(SEQ ID NO:655)
>orf00359
MVDNIPKRVNDVIRQAGNNAKTSRPHVGIGKSHISVSFLFPYHTANRIKNQEKVIF(SEQ ID NO:656)
>orf00375
LFDLLDHGLDTVLVCHVTDISMGLDANFTISFNPFIDQILIDIVKDNSSAGFSVGFGNSKSNSIRSAGDESNFSF (SEQ ID NO:657)
>orf00387
MKSLARLLIIHVFISIFLFFALTSGAISHTVLLLLLLFLPALNKGLEKIQSKRIPVLNAALFFLLISFPQLLTNPVQWKFSIFLVVTIISSLAYFYNFYQVVKEVDQKQLI (SEQ ID NO:658)
>orf00390
LEAAGEIETEFQGWIVLVVFNHIDSLSRDTDILGEFELGNTQFLAKFFHTIHLISFLIYVVYI (SEQ ID NO:659)
>orf00403
MEGVNHVDIIKVSCCSFISQVNWMMKGKIPNREGFKFSVARFDAIDLVVVHIGHTRCQFSRTGSRSGYDNQVATGFDVVVFAHAFWGNDVIHIRRISFDWIMKIRINSVFLKLVAEGICSGLASVLCNDNGTNKNP (SEQ ID NO:660)
>orf00404
MFNVASINGNHNLNLLFQFLQELDFVVRFITRKDTSSVEIF (SEQ ID NO:661)
>orf00409
MIDIHSHIVFDVDDGPKSREESKALLIESYRQGVRTIVSTSHRRKGMFETPEEKIAENFLQVREIAKEVADDLVIAYGAEIYYTLDALEKLEKKEIPTLNDSRYALIEFSMHTSYRQIHTGLSNILMLGITPVIAHIERYDALENNEKRVRELIDMGCYTQINSYHVSKPKFFGEKYKFMKKRARYFLERDLVHVVASDMHNLDSRPPYMQQAYDIIAKKYGAKKAKELFVDNPRKIIMDQLI(SEQ ID NO:662)
>orf00410
MKEQNTLEIDVLQLSRALWKRKLVILLVAIITSSVAFAYSTFVIKPEFTSTTRIYVVNRDQGEKSGLTNQDLQAGSYLVKDYREIILSQDVLEEVVSDLKLDLTPKGLANKIKVTVPVDTRIVSVSVNDRVPEEASRIANSLREVAAQKIISITRVSDVTTLEEARPAISPSSPNIKRNTLIGFLAGVSGTSVIVFLLEFLNTRVKRPEDIENTLQMTLLGVVPNLSKLK(SEQ ID NO:663)
>orf00413
MDKKGLEIFLAVLQSIIVILLVYFLSFVRETELERSSMVILYLLHFFVFYFSSYGNKFFKRGYLVEFNSTIRYIFFFAIAISVLNFFIAERFSISRRGMVYFLTLEGISLYLLNFLVKKYWKHVFFNPKNSKKILLLTVTENIEKVLDKLLESDELSWKLVAVSVLDKSDFQHDKIPVIEKEKIIEFATHEVVDEVFVDLPGESYDIGEIISKFETMGIDVTVNLNAFNKNLGRNKQIHEIVGLNVVTFSTNFYKTSHVISKRILDICGATIGLILFAIASLVLVPLIRKDGGPAI FAQTRIGKNGRHFTFYKFRSMRIDAEAIKEQLMDQNTMRGGMFKMDNDPRVTKIGRFIRKTSLDELPQFWNVFIGDMSLVGTRPPTVDEYDQYTPEQKRRLSFKPGITGLWQVSGRSKITDFDDVVKLDVAYIDNWTIWKDIEILLKTVKVVFMRNGAK (SEQ ID NO:664)
>orf00414
VTFDKEDARSILENEIFYPCYYPTNRNLKNLIKNTILAFKILRKERPDIIVSSGAAVAVPFFYLGKIFGAKTVYIEVFDRIDAPTMTGKLVYPVTDRFIVQWEEMKKVYPKAINLGGIF(SEQ ID NO:665)
>orf00415
MIFVTVGTHEQQFNRLIKEVDRLKGEGFIQDDVFIQTGYSNYVPKFCKWEKVISYEKMNQLIKESDIIITHGGPATFMAVIAKGKNPIIVPRLKKFGEHVNDHQMQFVKITKEIYNLIVIDDISDLHLILHNFKDKHFETYLNNERFNVRFNVEISNLFKGNKINEN(SEQ ID NO:666)
>orf00416
MKIRIEPQYFLYKYLWFIILLPKQFMQLILFFLIALTLLPTYIKEKQVFKIDTPSFCMVLWTIIYSISIIFNSLIDGLAVQVIFSDLSKAFNWLIAVFFYNYYLKMPINIDRIKRYMYYNFTILVVFVGLFYIQRGSNVILFGRSLLDWDGFTLATSYGVRYTGFLEYATLNGQLILFLLPLIRLFRFRFFTQTIIFAFLLEVLVLSKSRIAIVAMLIYIAFAVVNEINSNNKWLIGIFCPIIPFMLFYNFEKIKQIFFQMFSSRSGSNATRFRVYEESLKAINGMEMLLGAGVRIPSTVDILLGSHSMYISFIYRTGVLGSIIITVMFYYLFSKFLKCDSSERLRSIGYILALSVFWLFEELDPHYWCLILFFSTISIFINNRKEEIVG (SEQ ID NO:667)
>orf00417
MIEVSIIIPIYNAEKTIKNCVDSALKQNLESLEVILVNDGSNDSTSKILEQYGDNPQVMIFHQVNMGVSAARNVGLSYASGEYVFFLDSDDILDEGMLSKMYQFAKSNKIDLLSCWHKEPSTTQYGGNDNSSASFIARTKEEIGNHFVDIFPRSACAKLFLRRRIEENNIAFSTEMSLGEDMSFVCQYLMVSRSIAVIDGLYYTIQNVNPQSLSKRYVSNIENSLLMQNQLWDQLLEVYPKIEENYYKQHMDFRFYLASLYVNNLFKFDSPYSSKEKWDNIAQQLKKYRPFLDEKVSKEKKPKNMNEMVIFYLLKSKIPALIYSFYSFKEWWKKKRLKN(SEQ ID NO:668)
>orf00418
MEDLVSIVVPVYNVEKYLKKSIESILNQTYDNLEVLLVDDGSTDSSGEICDSFIKVDSRIRVFHKENGGLSDARNFGIEHMKGQYVSFIDGDDYISKDYVWKLYHSLKNNNSEVSICSFSLVDETGEKIKDELLDSGEVSLSGQQILEKALTADGYRYVVAWNKLYRSTLFEKLKFKKGMLYEDEFLNYPLFWDCKRVSIVEEPLYLYVQRKGSIIQSNMTLEKIKMKDKMHTSRIEFYAEKKNSFLHQRSCQQYCNWIVTITVSHYNVLNVAFLKYLQHQFRRIVKYTQNDDKKLIIQNILGYINIRLAAYVKSKVM (SEQ ID NO:669)
>orf00419
MFPIYIISNQNIAFQQEIDIAYRKMKRQFSHISLTESEQKNDMNISNKVWICWFQGEERPPELIRTCIQSMRTHFLGREIIVLTEENISDYIDIPDYITDKYKKGSISRAHYSDILRVELLCRYGGLWVDVTVLNTGGDFSNLELPLFVYKSLDLSRKDSQAIVASSWLISSYSNHPILLYARKLLWEYWRRKNSLCNYFLFHIFFTIATELYPIEWSAVLTFNNHSPHMFNFELNNQFSEKRWEQLKQISVFHKLNHHIDYSIGVNNFYKFIVFSKVEKNE(SEQ ID NO:670)
>orf00420
MSNKISKNLAYNIGYQLIGIAFPLITSPYLSRILGAENLGIHSFTISVALYFMMFMLLGIANYGNRTIATVKREGKEILSKTFWNIYYVQLLMSVLVTIAYLIYLYFWVSSYKFIAILQLFLLLSNAVDITWLFYGLEDFKQIVFRNTLVKLLGLFLIFSFVHESSDLWKYTLINGGVTLVGQLLLWGQLKGRLSWVKIQKKDLLSHIKPILVLFIPVLAISIFSNMDKYMLGLMVGVKQVGFYDNANRIIDIPKALIAALEAVMLPRTSYLLAEGQEEKSNYYIEVTILYAMMISSVLIFGIISVSDIFSLVFWGEEFLESGRLIAAMAPVFVFSVPGNIIRTQYLIPRAKDKDYVLSLIIGALVNILLNCFLIKPFGAMGATISTVLAEFVLYGVQFWTVRRDLDFKKYLKNGFIFYLFGMIMYLAIIAVKAHLQYNIINLVLLIVLGGIVYTGFCCFYILISRNVHFEILREKIKRKIGYENIL (SEQ ID NO:671)
>orf00422
MFVADIMISDYSSAPIDFLLLNRVVFLYLPDFKEYQSDKNPFFEVFKVSKTKGIALDPFDEIIGRFQFGVRIV (SEQ ID NO:672)
>orf00428
MGFSMKLIHDLNTHTTHSTAKMLYNVKAIKNDFSIRE (SEQ ID NO:674)
>orf00431
MEQLHFITKLLDIKDTNTQIIDVVNRDSHKEIIAKLDYDAPSCPECGSQMKKYDFQKPSKIPYLETTGMPTRILLRKRRFKCYHCSKMMVAETPLVKKNHQIPRIINQKIAQKLIEKISMTDIAHQLSISTSTVIRKLNDFHFECNFRNLPKIMSWDVETVRGVTVSIGRWR(SEQ ID NO:675)
>orf00444
LQIAQESSQDTDGINPPVVEEAMVFDRNDCLNQICGNIISLGIDAAFRTQVSNELIFIVVDFTRSCCN (SEQ ID NO:677)
>orf00446
MLNLMWMKIFHRNRTFLFCFLDFKVDVISIINARIVRR (SEQ ID NO:678)
>orf00447
MYNSQALRQIVVVGSIDHLFKRHSSICEIFGLRKRCLSFL (SEQ ID NO:679)
>orf00472
MSLADLLEELEAAKDSKKARSMEAYMRHQFSFLGIAVPERNKLYKNIFQKRKKQRLSIGILQTLAGKRSLENTNMWLLTI (SEQ ID NO:680)
>orf00473
MEKILLHNLNQTEFFINKAIGWTLRDYSKTNPTWVTCFIEKNKERMAELSIKEASKYL(SEQ ID NO:681)
>orf00477
LSTCWNGKFCHICVALFHCFRAFKLALNEILCLLTNVSFIFVSVAF(SEQ ID NO:682)
>orf00487
LLGSFFSWTTKELMGIIFFNNFPTVHKNNMMGYISSKTYLIKLIKNSI(SEQ ID NO:683)
>orf00509
LKNVFSVGCHFFQFFVRFFWFGKFDHFNLVELVQTDQATRITTGRTSLRTE(SEQ ID NO:684)
>orf00535
LIDIKHFFLCLPLSKKMIIDIIVNKNPDRFCMIEKVKKTMAENR(SEQ ID NO:685)
>orf00539
VNIDSSEFYISHITDGIFDSFLDSNRYLRNFYSVLKVEIDICCEFFVHVFKINATAE(SEQ ID NO:686)
>orf00540
VNTLYLCSSDSNDFFKYTWGDNDFAKLFFNSHRMTSF (SEQ ID NO:687)
>orf00550
VKEEKKAIVLGADNAYMDKVETTIKSLCVHHYNLKFYVFNDDLPREWFQLMEKRLETLNSEIVNV (SEQ ID NO:688)
>orf00551
VSNEIKIIALKLSIFWGHNHFRLTGNWKIFYLCLKSGLA (SEQ ID NO:689)
>orf00552
MKRIQLNMNETKKYLVIKAIAQGKKTKKRACVELNLSERQINRLLLAYQQKGKEAFRHGHGNRNRKPKHAIPDEIKERVLKKYLSYETYKPNVLHFCELLAEEEGIKLSDTTVRKILYKKNILSPKSHRKTKKRVRKQAKLNLNQPLDNPILPTAKDFLEDPKKVHPSRPRKKFAGELIQMDASPHAWFGPETTNLHLAIDDASGNILGAYFDKQETLNAYYHVLEQILANHGIPLQMKTDKRTVFTYQASNSKKMEDDTYTQFGYACHQLGILLETTSIPQAKGRVERLNQTLQSRLPIELERNNIHTLEDANTFLLSYIQTFNEQFGNKTKLSVFEEAPNPSERNLILARLAERVVDSGHHIRFQNRYYMPVEQGKEVYFIRKTKALVLKAFDGDIYLNIADKIYHTKELLDHELYSKNFEQEPEQKKKDASISLHKPIRGNSHLSNNTFIKIKRIMKSLLVRSFILLNYKYNLFFAKWEAFP (SEQ ID NO:690)
>orf00556
LVEQLTFNQWVTGSSPVRVIYAGLAELADAPDLGSGA (SEQ ID NO:691)
>orf00557
MGEEEMRNKMIIAVSLVVAGVMTYLMFSGLDEDFYHFP (SEQ ID NO:692)
>orf00567
MNTIERTRRLVKGCATHCFEVVDRTDEVSSKHVFEVVDETNEVSSKHVFEVVDETDEVSNHTYGKAT (SEQ ID NO:693)
>orf00581
MLSNDFIQLRKDDIKTTSVLYFPIRLFSLETMNMSSQYF (SEQ ID NO:694)
>orf00582
LTCYPNPQKRLEAGFDKLIEIKRLTASKIQDILSVAPRSIGTTSPAREFEIIENIKHYKRLIDKAKKCVNDLMAEFNSVITTVTGIENRLGAVILAEIRNIHAFDNPAQLQAFAGLDSSIYQSGQIDLAGRMVKRGSPHLR (SEQ ID NO:695)
>orf00595
MKRIQLNMNETKKYLVIKAIAQGKKTKKRACVELNLSERQINRLLLAYQQKGKEAFRHGNRNRKPKHAI PDEIKERILKKYLSYETYKPNVLHFCELLAEEEGIKLSDTTVRKILYKKNILSPKSHRKTKKRVRKQAKLNLNQPLDNPILPTAKDFLEDPKKVHPSRPRKKFSGELIQMDASPHAWFGPETTNLHLAIDDASGNILGAYFDKQETLNAYYHVLEQILANHGIPLQMKTDKRTVFTYQASNSKKMEDDTYTQFGYACHQLGILLETTSIPQAKGRVERLNQTLQSRLPIELERNNIHTLEDANTFLLSYIQTFNEQFGNKTKLSVFEEAPNPSERNLILARLAERVVDSGHHIRFQNRYYMPVEQGKEVYFIRKTKALVLKAFDGDIYLNIADKIYHTKELLDHELYSKNFEQEPEQKKERRKYIPPQTHPWKLTSFKQYLHKNKKDYEEFTSEEIHSPQLQV(SEQ ID NO:696)
>orf00601
MDTKSSCLITTGRNDSPSTCLPRVASNNDRFSSEFRIIPDFHCSKKGIHVNMDDFS(SEQ ID NO:697)
>orf00604
MMSIREQDLKDIGAIIKYKNFHSPFDTFKYLKDMGFDTIDLSVLLEGFSYAYGMDWLEKFFKENQDKLREFY (SEQ ID NO:698)
>orf00610
MIPLYRTDNDITKFFTKIRNGHLAKTAGGLDDKFHEANASTSKAFDRQGVGEVNDIRDSAGSQELRINDKRKTENILFLEIRVRIFRVPHPNDSFFSSHFLG(SEQ ID NO:699)
>orf00611
VLSQGDKDITILDAGLLKNGKIGPVTKDTNDIKATDNMIENSFVLLNQQNIMLFCNQGATEGKTNFSPSDKDNFHNKTYFFMM (SEQ ID NO:700)
>orf00616
MKIKEQTRKLAAGCSKQCFEIVDRTDEVSSKHGFEVVDETDEVSNHTYGKAKLTWFEEIFEEYKMMGKAGQLVFFDVYRLVRQVS (SEQ ID NO:701)
>orf00645
LVEIVRGGSPRPIKDYLTSEVDGINWIKIGDTEKGEKYINNVKEKIKKSGLNKTRFVKKGTFLLTNSMSFGRPYILNVDGAIHDGWLAISNYENSLNKDYLFYILSSNVVYSQFLSLISGAVVKNLNSDKVASILIPLPPLAEQQRIIEAIESALEKVDEYAESYNRLEQLDKEFPDKLKKSILQYAMQGKLVEQDPNDESVEVLLEKIRAEKQKLFEEGKIKKKDLDISIVSQGDDNSYYEEVPCEIPESWEWVRLNDITSYIQRGKSPKYSNIPIYPVIAQKCNQWSGFSIDLARFIDPETVHSYQKERLLRDGDLMWNSTGLGTLGRLAIYHENKNPYVWAVADSHVTVIRVLSGVINCHFIYNFLSSPIVQSVIEEKASGSTKQKELLTKTIKEYLIPLPPLPEQSRIVDKIEQFFAHIDALI (SEQ ID NO:702)
>orf00657
MTPEQLKASILQRAMEGKLVPQNPNDEPASELLKRIKAEKEKLISEGKIKRDKKETEIFRGDDGKHYGKFADGSTQEIDVPYDIPDTWEWVRFSTLVEIVRGGSPRPIKDYLTSEVDGINWIKIGDTEKGEKYINNVKEKIKKSGLNKTRFVKKGTFLLTNSMSFGRPYILNVDGAIHDGWLAISNYENSLNKDYLFYILSSNVVYSQFLSLISGAVVKNLNSDKVASILIPLPPLAEQQRIIEAIESALEKVDEYAESYNRLEQLDKEFPDKLKKSILQYAMQGKLVEQDPNDESVEVLLEKIRAEKQKLFEEGKIKKKDLDISIVSQGDDNSYYGNIPMNWVVIKIKDIFSINTGLSYKKGDLSINKGVRIIRGGNIKPLEFSLLDNDYYIDTQFISSEQVYLKHNQLITPVSTSIEHIGKFARIDKDYDGVVAGGFIFQLTPFESSEIISKFLLFNLSSPLFYKQLKAITKLSGQALYNIPKTTLSELLIPLAPFEEQELITQKVEKLFEKVNQLWK(SEQ ID NO:703)
>orf00669
MCKANSRNDIFILQDSFCFEIFSRKKFKIVKEVLPNSTCKFRVVQ(SEQ ID NO:704)
>orf00673
VDRTDEVSSKHCFEVVDRTDEVSNHTHDKPTLTWFEEIFEEYHSPFHN(SEQ ID NO:705)
>orf00674
LDNIHIVLDSLNAVSGIQDFICDGLAIFCNQITSGCSSCK(SEQ ID NO:706)
>orf00683
MGPLLMHLCQQLVWLAKYLKRAGSDMMFLQEFLNRRFNPSLLGKIIL(SEQ ID NO:707)
>orf00684
LVAKGQGHKLRVSRHKDNQGIGVLFPNLSSHFQPLHLLISNLNIQKE(SEQ ID NO:708)
>orf00692
MTSYKRTFVPQIDARDCGVAALASIAKFYGSDFSLAHLRELAKTNKEGTTALGIVKAADEMGFETRPVQADKTLFDMSDVPYPFIVHVNKEGKLQHYYVVYQTKKDYLIIGDPDPSVKITKMSKERFFSEWTGVAIFLAPKPSYQPHKDKKNGLLSFLPLIFKQKSLIAYIVLSSLLVTIINIGGSYYLQGILDEYIPNQMKSTLGIISVGLVITYILQQVMSFSRDYLLTVLSQRLSIDVILSYIRHIFELPMSFFATRRTGEIISRFTDANSIIDALASTILSLFLDVSILILVGGVLLAQNPNLFLLSLLSIPIYMFIIFSFMKPFEKMNHDVMQSNSMVSSAIIEDINGIETIKSLTSEENRYQNIDSEFVDYLEKSFKLSKYSILQTSLKQGTKLVLNILILWFGAQLVMSSKISIGQLITFNTLFSYFTTPMENIINLQTKLQSAKVANNRLNEVYLVESEFQAPENPVHSHFLMGDIEFDDLSYKYGFGRDTLTDINLTIKQGDKVSLVGVSGSGKTTLAKMIVNFFEPYKGHISINHQDIKNIDKKVLRRHINYLPQQAYIFNGSILENLTLGGNHMISQEDILKACELAEIRQDIERMPMGYQTQLSDGAGLSGGQKQRIALARALLTKAPVLILDEATSGLDVLTEKKVIDNLISLTDKTILFVAHRLSIAERTNRVIVLDQGKIIEVGSHQELMQAQGFYHHLFNK(SEQ ID NO:709)
>orf00699
LRIYLHEPLITTVSQDFSSLSDISATHFEQLHIVAIVHSDIQRNNSPLTCDNRLSLHSVKFLFTRIIG (SEQ ID NO:710)
>orf00723
MGLIKTLAKIYGNYFLTVQGVKVMKTIKKDDNAVVGLGKLFIADKLMDTARWLIKPEDKK(SEQ ID NO:711)
>orf00724
MKFFWGLLAILFIKPIIGIVKFFWMIISFAVQLLFYKILDWFFKLI(SEQ ID NO:712)
>orf00725
MKIKEQTRKLAADCSKQCFEVVDRTDEVSSKHRFEVVDRTDEVSNHTYSKVKLTWFEEIFEEYKMILLLILYHMERD (SEQ ID NO:713)
>orf00733
MHSQTFQFLLMTDKTSLLHRKHRSFIRNIHSKFLILFDLLCGILSRNDSNHNPIS(SEQ ID NO:714)
>orf00736
MARTELPDKIETERLVLRVRTVADAEDIFDYASLPEVAYPAGFPPVKTLEDEIYYLEYIFPERNQKENLPAGYGIVVKGTDKIVGSVDFNHRHEDDVLEIGYTLHPDYWGRGYVPEAARALIDLAFKDLGLHKIELTCFGYNLQSKRVAEKLDFTLEARIRDRKDAQGNCCDDLRYALLKSEWEVI (SEQ ID NO:715)
>orf00741
MGKIVAIDLFNGAGGTTSGLKKSGIDVQVAVEIDSVAVKTYKLNNPEVSVIDME(SEQ ID NO:716)
>orf00746
LLRKQEGEYLRAENAILKKLRELRLKEEKEKEERQKLFKN(SEQ ID NO:717)
>orf00768
LKHLFCHFNPLWIDEIIRLAYKDQDTKDVKSKVKIGN (SEQ ID NO:718)
>orf00792
LCCNRHIANLDLEFISYYLGQVGFDTRISTGLGIFVTKIGNVLFDTDNQFASFLNVCDTSISLDWFGSSKAEKANQ (SEQ ID NO:719)
>orf00817
MKTKEQTRKLASGCSKHCFEVVDGTDVVSSKHCFEVVDRTDEVSNHTHGKATLTWFEEIFEEY (SEQID NO:720)
>orf00819
MDFFFMNEVKEQVLFRDNHSEHIFWIEGVSDFMIKVNTALW (SEQ ID NO:721)
>orf00839
MEELVTLDCLFIDGTKIEANANKYSFVWKKTTEKFSAKLQEQIQVYFQEEITPLLIKYAMFDKKQKRGYKQSAKNLANWHYNDKEDSYIHPDGWCYRFHHIKYQKTQTDFQQEIKVYYADEPESAPQKGLYMNERYQNLKAKECQALLSPQDRQIFAQRKIDVEPVFGQIKACLGYKRCNLRGKRQVRIDMGLVLMANNLLKHSEMK (SEQ ID NO:722)
>orf00840
MHIHYNTNQTTLPLEISSFLPQDHLVFTIEKVVNTLEERHFYAFYHAFGRPSYHPKMLVSTLLFAYSQGIFSGRKIEKWKS (SEQ ID NO:723)
>orf00843
LRLWVIFVIMKVIKSYNTLNDYYRKLFGEKTFKVPIDAGFDCPNRDGTVAHGGCTFCTVSGSGDTIVAPDPPIREQFYKEIDFMHRKWPDVQKYLVYFQNFTNTHEKVEVIRERYEQAINEPGVVGINIGTRPDCLPDETIEYLAELSECMHVTFELGLQTTYEATSDLINRAHSYEL(SEQ ID NO:724)
>orf00845
VETVKRLRKYPKIEIVSHLINGLPGETHEMMVENVRRCVTDNDIQGIKLHLLHLMTNTRMQRDYHEGRLQLMSQDEYVRVICDQLEIIPKHIVIHRITGDAPRDMLLGPMWSLKKWEVLNSIEMEMRRRGSVQG (SEQ ID NO:725)
>orf00853
MKIKEQTRKLAAGCSKHCFEVVDKTDEVSHIHTVRRR (SEQ ID NO:726)
>orf00859
VQVCVFTNFCFFHCFSSLANCRLFNLRGICLPCISYQ (SEQ ID NO:727)
>orf00868
VFKKDRFSIRKIKGVVGSVFLGSLLMAPSVVDAATYHYVNKEIISQEAKDLIQTGKPDRNEVVYGLVYQKDQLPQTGTEASVLTAFGLLTVGSLLLIYKRKKIASVFLVGAMGLVVLPSAEAVDPVATLALASREGVVEMDGYRYVGYLSGDILKTLGLDTVLEETSAKPGEVTVVEVETPQSTTNQEQARTENQVVETEEAPKEEAPKTEESPKEEPKSEVKPTDDTLPKVEEGKEDSAEPAPVEEVGGEVESKSEEKVAVKPESQPSDKPAEESKVEQAGEPVAPREDEKAPVEPEKQPEAPEEEKAVEETPKQEDTQPEVVETKDEAANQPVEEPKVETPAVEKQTEPTEEPKVEQVGEPVEPREDEKAPVSPEKQPEAPEEEKTAEETPKQEDKIKGIGTKEPVDKSELNNQIDKASSVSPTDYSTASYNALGPVLETAKGVYASEPVKQPEVNSETKAEKVAANTDAKQSEVNSETASLKTAISGLNTDKVELENQLKIAQGKTETDFSMESWTVLSTAKNKAQEVKDNGTATQEQINEAEKSLKTALADLSVDKTALGSAIDTATKKNKENYTNQTWAELETVLTAAKSVNTNESKQSEVNEAVEKLTATIEKLVELSEKPRLTLSIEKRDIDRKVTVTYTLENPANTQIKSITATLKKGEEVVKDFVLTEENLKTNHLTALFEKLDYYKEYTLSTDMVYNRGNDDETESISEELIQLNLKKLELKDIQTVSLMKFENGQESQVTHLSDKPTDLSKLYLKVTSSTSKDAVLAVSSIEEEIVENKKIFKIHADTPELVVRKKDGSLSKGFDYYMERVIPHDGDIYYDFKDLISAMTSNPTGTFILGRDISSRNVKPDGNGKSYIKGEFKGKLLGTNDNVRHSIFDLEYPLFDTIKSGVVKDIDFKHVNMVFPDSNQGDNVATIARVIKDKTKIENVNVEGYLEGRDHVAGLVNNLEGNSEIENISFTGKIKSKGGNSITAGIAGRNILSRVKRAYVNANIEVLGSTNSSMLVAVNGTTLNASGGWGAWGRLTESVAKGTLEIKRSGQAGGVTATVWPYGAIDKVVSYAKVTKGKELFGSDGDLNNNWFMQKINNIFGVQGISSGDSGNDSKFKRISEEEAKQKVASYNITAPNLMSDSSLLVDRLNESWKNTDQFESIQDYQSQNQLIYQNLTKFTPYYNKEFIVHEGNALTPEQEILKTKKIKSIVGLKGTEFVVDGSDIDTIMLHFEDGSQKRYKVTSTGKFSITNLPEYQVEDLNVVYTSEHIVHPLDSSLINNLVEELKKVELYTESTYQVLGIDKDNANKLNRTKRLFLDESLDAVKTQLPTFVKTMFENEWLHINGESSGAVAALRQKIMDNKTAILLALTYINRYYDVKFSDYNIKKLMLFKPTFHGEKIDLLDRLIRLGSSGENRLKGSENAETFKQLFASETKQKDLVTYLDYNRSLLTNYQTTGEWFKETTKDYIQFEERPSLVEEIKDAKYRVYDNLTAPYYQGYILPLLTLKNTHLAILSNYSTMTFVSREKRPNWKNEDFDKWVKYVATAHRNHVDTWYKILPDNIKGKMVKENVTAVWEGLSIPGSEWVDQNAVDRKGRDYAPAREFFNLVGGPMGGWYAYHGYGAHAGGRNRVNYEVFDVLSEYGISVFTHELTHVNDTWIYLGGYGRRENMGPEAYAQGLFQSPVPGQPGWGALGLNMAFERKNDGDLIYNASPTQFENRKELDSYMKNYNDTLMMVDYLEGDAVISKGKEAITKWFKKVEPKVVSQTAQYDTVRQLTAEEKEKLSVSSVDDLVDQGLMSDRAVGNNTYNPADFETSYIAIDYMTGIYGGGKNSVGSPGALMFKHNTFRMWGYYGFEEGVLGYASNKFKQASRDEGHAGLSDNFIISKISKGEFLTMEAFKKGYFKKVVEELKTKGIRPVTINQKTYSTFEELQEGFKQAVERDLKKNQLDERETRNFKFQVFRQLLQQTDSFKTSIFR(SEQ ID NO:728)
>orf00883
VGNRIFIAFLQKLGLLDNLTGIREKLHPITGQGNTLGIADKDFNAHFIFQISHCIGETGLSDKELLGCLIHRASFDDFDNIM (SEQ ID NO:729)
>orf00892
MKTKKHRLLALALISSFTLLGAASAAVQYPDGGVWTYGEGSGGGWAFSNYYHGKKYHYSSLVSRWNSHSDKGEASAGKTSYAWIWTKWGEQVAFYCDYD (SEQ ID NO:730)
>orf00903
MSMIEVSHLSKSFGDKIALNDISFTVKEGQIFGFLGPSGSGKTTTINILTGQLLADKGQSIILGQKSQNLTSGELKRIGLVSDTSGFYEKMSLYNNLLFYSKFYNISKLRVDNLLKRVGLYDSCKMVAGKLSTGMRQRMLLARALINKPAVLFLDEPTSGLDPTTSRTIHELILELKTAGTTIFLTTHDMNEATLLCDYVALLNKGKLVEQGAPSELIQRYNKDKKIKVTDYNGNQITFDFTSLEQVSQADLENIFSIHSCEPTLEDIFITLTGGKLNA (SEQ ID NO:731)
>orf00911LISNKVDITLANFTVTDERKKQVDFALPYMKVSLGVVSPKTGLITDVKQLEGKTLIVTKGTTAETYFEKNHPEIKLQKYDQYSDSYQALLDGRGDAFSTDNTEVLAWALENKGFEVGITSLGDPDTIAAAVQKGNQELLDFINKDIEKLGKENFFHKAYEKTLHPTYGDAAKADDLVVEGGH (SEQ ID NO:732)
>orf00912
MKLFKPLLTVLALAFALIFITACSSGGNAGSSSGKTTAKARTIDEIKKSGELRIAVFGDKKPFGYVDNDGSYQGYATILN (SEQ ID NO:733)
>orf00946
MTGKKGFLFLNCHICMVTTTTCFLKERVESELLIFFTFC (SEQ ID NO:734)
>orf00948
MDTPDENGYVADDYRITYLEAHIKAMRDAIYQDGVDLLGYTTWSCIDPVSAGTGEMNKRYGFIYVDRDNVGNGTLKRSKKKSFYWYMSFIAMV (SEQ ID NO:735)
>orf00953
LSCQIAFCLIDRLDYPIMFSKVCQENHFQVFTPFSKKLKNFLKNA (SEQ ID NO:736)
>orf00966
MFLGMIGNISIILQFFGITIIVKIDNQARAIDFFKHDKSSF (SEQ ID NO:737)
>orf00968
MFSLNFFDDNVFLSIKIAHKGCFQLLDMTNPNFFNKFFLAQASDQLLHFLSWNIEL(SEQ ID NO:738)
>orf00978
MTEPDFWNDNIAAQKMSQELNELKNTYNTFHKMEELQDEVEILLDFLAEDESVHDELVAQLAELDKIMTSYEMTLLLSEPYDHNNAILEIHPGSGGTEAQDWGDMLLRMYTRYGNAKGFKVEVLDYQAGDEAGIKSVTLSFEGPNAYGLLKSEMGVHRLVRISPFDSAKRRHTSFTSVEVMPELDDTIEVEIREDDIKMDTFRSGGAGGQNVNKVSTGVRLTYIPTGIVVQSTVDRTQYGNRDRAMKMLQAKLYQMEQEKKAAEVDSLKGEKKEITWGSQIRSYVFTPYTMVKDHRTSFEVAQVDKVMDGDLDGFIDAYLKWRIS (SEQ ID NO:739)
>orf01011
MQVIKRNGEIAEFNPDKIYQAILKAAQTVYVLTDDLRQNLAQVTKKVVLDLQEAKVERATISMIQSMVEHRLLGAGYITIAEHYISYRLQRDLERSGYGDHIAVHLHFEQIR(SEQ ID NO:740)
>orf01015
MKIKEQTRKLAAGCSKHCFEVVDETDEVSSKHCFEVVDETDEVSNHTYGKAKLMRFEEIFEEY (SEQ ID NO:741)
>orf01068
MKVINQTLLEKVIIERSRSSHKGDYGRLLLLGGTYPYGGAIIMAALAAVKSGAGLVTVGTDRENIPALHSHLPEPMAFSLQDQQLLKEQLEKAEVVLLGPGLRDDASGENLVKQVFVNLSQNQILIVDGGALTILARTSLSFPSSQLILAPHQKEWEKLSGITIEKQKEDATASVLTSFPQGTILVEKGPATRIWEVGQSDYYQLQVGGPYQATGGMGDTLAGMIAGFVGQFRQASLYERVAVATHLHSAIAQELSQENYVVLPTEISRYLPKIMKIICQQERGSKDKLV(SEQ ID NO:742)
>orf01077
VLDSKEELKESENDAPKLETPLREEPRLAPQTLPEASEVLENKREESKVEI IEPAQADDIRKVVGELAKDISITKLYMTGHSLGCYLAQIAAVEAYQKYPDFYNHVLRKVTTFSAPKVITSRTVWDAKNGF (SEQ ID NO:743)
>orf01091
LSYSILICLCNSTINESLRAFYCWQKFITFNQVTGNARGKGTTCTSIGPDN(SEQ ID NO:744)
>orf01094
MGRKPRTRPEERTELERLQAENEYLRAENAILKKLRELRLKEEKEKEERQKLFKN(SEQ ID NO:745)
>orf01096
LSTCWNGKFCHICVALFHCFRAFKLALNEILCLLTNVSFIFVSVAF(SEQ ID NO:746)
>orf01109
VVLSTSAILVACGKTDKEADAPTTFSYVYAVDPASLGYSIATRTSRTDVIGNVIDGLMENDKYGNVAPSQKDYDLNSTGWAPSYQDPASYLNIMDPKSGSAMKHLGITKGKDKDVVAKPGLDKYKKLLEDAVSEITDLEKRYEKYAKAQAWSTDSSLLMPTASSGGFPVVSNVVPFSKPYSQVGIKGEPYIFKGMKLQKDIVTTKEYNEVFKKWQKEKLESNSKYQKELEKYIK(SEQ ID NO:747)
>orf01113
LNFDFFIFLAHFIPLFTFSILQENPKTSKKKLYIRLL (SEQ ID NO:748)
>orf01119
MGFSMKLIHDLNTHTTHSTAKMLYNVKAIKNDFSIRE (SEQ ID NO:749)
>orf01134
LIRIIRNIYRSGEGNTSVFQSFIDQINSNQFCYGSNFDRLRCILLIENFTSICLNSNRMFSGNGKILSNSSRSTP (SEQ ID NO:750)
>orf01137
MNATDIKNTYLKYIKENAVFNDVTDTHTEVITPFIDPLGEAIGFSIKSNGKHLTVTDDGYTIWNLSINNIDVTKKGRRQDIFNSLLHFNGFDLHDGAIERTTGKEHLGQVIHDMTQLLMNVYDFIQLTPNNIKSQFLDDVKSYFMKNEHYTVFPAFSIAGKSRLEHRFNFVFMSKGISKIARVHNNITKQQVDTILASWLDTSEYRRKEYGDTEQLYIIVSDEGYNNIKDDHQIALQEYGINILNFSDKEQLEIQLGK (SEQ ID NO:751)
>orf01138
MSKVVKVTGAEVVISHNEEYLKVNPSELNFVPKLGDEVEVHKVDGEIIVIKVKDKKDDKININIVNENNAMQNQSQVVHTQEIATGVHYVNKWVYVILALFLGGLGIHHFYAGYNGKGFLFLILSLTGIPAIIALFQGIIALFKKPDVYGRIAV (SEQ ID NO:752)
>orf01149
MKIKEQTRKLAAGCSKHCFEVVDRTDEVSSKHGFEVVDETDEVSSKHGFEVVDETDEVSNRTTVRRR (SEQ ID NO:753)
>orf01156
LQNDKNHKLFDNYTCQKEKDVLQCKQVKRKEERSYDVGTRIYTIYYFLLF(SEQ ID NO:754)
>orf01158
VDRTDEVSSKHGFEVVDETDEVSNHTYGKVKLTWFEEIFEEY (SEQ ID NO:755)
>orf01159
LFFKDEKQALYTKPKTKSSSFRASKVSNQTIVATTRTDCQVIALNLCDKLENGVVVVVQTTHHIGIDDVIYSKIFQHLTHSIKMSLAFFIKKVQDRRRILYCHLVFFFLRVQDTKRIFLQATLAILRQGLLERCQIVNQGLAVGCTALRISKSVEVQFDTLNTDFLQKMGCHSDCFHIGSWIARAKTLNTNLVELAQAPCLWTLITEHRSHIVELAWLLHFWGEEFIFHIGTDNGRSSFWTEGNMTVTLVIKIVHFLGYDIRCISDRATDNLVMLKNRRAHFCIVIALENLTGKALNVLPLSRFSR (SEQ ID NO:756)
>orf01164
MEQIGKVFRQLRESRNISLRQATGGQFSPSMLSRFETGQSELSVEKFLFALENISASVEEILFLARGFQYDTDSELRKEITDVLEPKNIAPLEDLYRREYQKHAHSHNKQKHILNAIMIKSYMKSIDERVELTAEEGKVLHDYLFSTEIWGIYELNLFSVSSPFLSVSLFTRYVREMVRKSDFLMEMSGNRNLFHTILLNGFLASIECEEFTNAYYFKRVIEEHFYKENETYFRIVYLWAEGLLDSKQGRVKEGQKKMEDAVCIFEMLGCNKSAEYYRNTTEC (SEQ ID NO:757)
>orf01165
LIPYFLHFIIFFRKFIKNLPNCQNYEKIEDIYHVEGLL (SEQ ID NO:758)
>orf01167
MKIKGQTRKLAAGCSKHCFEVMDRTDEVSSKYCFEVVDRTDEVSNHTYGKATLT(SEQ ID NO:759)
>orf01170
VVPFSDTFKDRNQVDIFTIKISRCNSSTIGENSWDIHISNSNHRSRHVLVTATDSDEGIHVVTTHSRLDGVRDDVTRC (SEQ ID NO:760)
>orf01190MKKVKLGEVLSLKKGKKATVLAEQTTLSQRYIQIDDLRNNNNLKFTESLNMTEALPDDILIAWDGANAGTVGYGLSGAVGSTITVLKKNERYKEKIISDYLGVFLESKSQYLRDHSTGATIPHLNKNILLDLQLELLGIEEQENIICILNTIKRLITKRKFQLDELNLLVKSRFNEMFEEYPDSVFLDTYIKELRAGKSLAGEENNKNKVLKTGAVSYDYFNSSEVKNLPIDYIPLDEHKVEIGDVIISRMNTSELVGAAGYVWAINSDNIYLPDRLWKVILNDRVNPVFLWKLITNEKTKLKIKRISSGTSGSMKNISKSQLLQIRVPFPPLALQNEFADFVALVDKSQLAIQKSLEELETLKKSLMQEYFG (SEQ ID NO:761)
>orf01199
MKIKEQTRKLAAGCSKHCFEVVDKTDEVSSKHGFEVVDETDEVSNHTYGKATLTRIEEIFEEYKSS (SEQ ID NO:762)
>orf01226
MRLSIQLIHDLNTHTTHSTAKMLYNVKAIKNDFSIRE (SEQ ID NO:763)
>orf01237
LVYAPFSFNILLDYITFDFKILLFSVFLAINRFHNDFIQFLL (SEQ ID NO:764)
>orf01240
LGTKIGISKNTIGNYEKRVKSTKKNTIFDLAKVFSSLIDALFPPVQKDSPSDIQSIYDQRAPPRQGKVLTYA (SEQ ID NO:765)
>orf01246
LFVFILIFLKSSIYIGIFWFIDFGKAVDFQGWKVLFEFFMVVIDQFSFGCNPGVVFILPGIALGQQSIDTRICDTVDNAESEQKLTIGMTGVVIDKACKLDCLALKFIWIVVDSLHDFHIVFISDLNTILG (SEQ ID NO:766)
>orf01247
MTVGFDLLHPDIQLNNCQDKGKHHGDIGQIGIVHVDVLV (SEQ ID NO:777)
>orf01249
MEGVAKGRIGRKKNNGIDNRCCHKKRNGRVTWNLFFQKTIDDGDDSTFTRREKYTDKGPKKDSPPTISREKMINLVRCDINFNQP (SEQ ID NO:778)
>orf01262
VVIGVASATTNIWIIFLSGFTAILAGAFSMAGGEYVSVSTPKDTEEAAVSREKLLLDQDRELAKKSLYAAYIQNGEFKTSAQLLTNKIFLKNPLKALVEEKYGIEYEEFTNPWHAAISSFVAFFLRSLPPMLSVTIFPSDYRIPATVLIVGVALLLTGYTSARLGKAPTKTAMIRNLAIGLLTMGVTFLLGQLFSI (SEQ ID NO:779)
>orf01277
MILMTKNINLTNEELELIQGGADPYGKEPNGYYPWKMEPVLTLLVHGFCPRDTDDLGYIGGGNHLCKGSAARF (SEQ ID NO:780)
>orf01282
LQVGQANEIGDPGAHFTQWNLLDDIGFDQLIQPNQKQYNDCHYCSFFHDFLF(SEQ ID NO:781)
>orf01301
LLHICIGETFDCIPYCMLTFFLSKSIGLTILLHKVKTVVFIDDQSNDKTCKICIHISFFRIKLSQQCQLSFSVYF (SEQ ID NO:782)
>orf01309
MTTIFLTRTSCSNCGKQSTFERFDRVYAAKTPEIISAILDWDFFKFTCHNCNHKVLIDYPTVVVDEEQKTIIQYCADGNVDVLSMQICSLISEGVNLSEYRIRVVSDIESFVEKVQIVSVGYDDRAIELMKYMNSPLEDGDIQFNYEHMVFTKVGHENYQFMFINNQIAVASLDFSQEQYEYYLADVEDLETNTYYIDSRWAESFFRTSLA (SEQ ID NO:783)
>orf01310
MVRRNSKITRQQKKIRDAFVSERTVEIIPAKREFTDVKTKKLRVAAYCRVSTFDESQSGSFELQKQTYTERINSNPDWIMAGIYADQGASGTSIKRREQFQQMLHDCRCGKIDLIIVKSVSRFARNQLDFISIYRELKALSPPVGIYIEDINLNTLDTNSEFILGIMAIVAQGESEQKSASITWSVIERFKRGIPMIPTHNLLGYTKDQYGRVVIDETEAKIVRLIYDSYIEGMTASEIASTLMTNHIPTVTGLERWTSLAVYNILRNEKYKGEIIMQKTYTVDCFSHKTRKNNGEKPKYRLKNGIPSIIPESRWDLVQELLKQPRRKSKSTSEIFVPKLYIKKLKSGKLRDFVVLDPSWKSEDIHEVFK (SEQ ID NO:784)
>orf01311
MTVNKNFVTFSKGIVQALEYPAHVLVAFNKDTKVMGIQVCRAKTRGAFSFSKPVGEQKGIVQVGHKTLKETLLTIMSEWKSDKRYRVEGIHIPEDKAFVFELKDFDELSDFRKNDNR(SEQ ID NO:785)
>orf01313
MEKYNNWKLKFYTIWAGQAVSLITSAILQMAIIFYLTEKTGSAMVLSMASLLGFLPYAVFGPAIGVLVDRHDRKKIMIGADLIIAAAGSVLTIVAFYMELPVWMVMIVLFIRSIGTAFHTPALNAVTPLLVPEEQLTKCAGYSQSLQSISYIVSPAVAALLYSVWELNAIIAIDVLGAVIASITVAIVRIPKLGDRVQSLDPNFIREMQEGMAVLRQNKGLFALLLVGTLYMFVYMPINALFPLISMDYFNGTPVHISITEISFASGMLIGGLLLGLFGNYQKRILLITASIFMMGISLTISGLLPQSGFFIFVVCCAIMGLSVPFYSGVQTALFQEKIKPEYLGRVFSLTGSIMSLAMPIGLILSALFADRIGVNHWFLLSGTLIICIAIVCPMINEIRKLDLK(SEQ ID NO:786)
>orf01315
MELILKAKDISVEFKGHDVLDINELEVYDYDRIGLVGANGAGKSTLFKVLLGELIPPGCKMNHLGELAYIPQLDEVTLQEEKDFALVGKLGVEQLNIQTMSGGEETRLKIAQALSAQVHGILADEPTSHLDREGIDFLIGQLKYFTGALLVISHDRYFLDEIVDKIWELKDGKITEYWGNYSDYLRQKEEERKRQAAEYEQFIAERARLERAAEEKRKQARKIEQKAKGSSKKKSTEGGGRLAHQKSIGSKEKKMHNAAKSLENRIAALGKVEAPEGIRRIRFRQSKALELHNPYPIVGAEINKVFGDKALFENASFQIPLGAKVALTGGNGTGKTTLIQMILNHEEGISISPKAKIGYFAQNGYKYNSNQNVMEFMQKDCDYNISEIRSVLASMGFKQNDIGKSLSVLSGGEIIKLLLAKMLMGRYNILIMDEPSNFLDI PSLEALEILMKEYTGTIVFITHDKRLLENVADVVYEIRDKKIKLKH (SEQ ID NO:787)
>orf01316
MNQLEFQRNHLQMDYYSESYQDFERDFYRYSNMNIPLTFLTDDILKTMATSRKNYFVLNKEKSRDNRDHFFIFEVRTLEENPLIYHYTYKKTTTYLAEK (SEQ ID NO:788)
>orf01317
MQKWMGFFLSEHTSALTDDANKVTYMSDLSLEKKLLLLSQVYAGQLNTRIHVVKKNNQVSYTGTIPSLTKDFILIKTTTGHINLKLKDIVSIELVEEVLYESA (SEQ ID NO:789)
>orf01318
MKKSINAQKKIDPANLPKTMVGHVLELFRKKYTSGAVRQIGVSYGGFVDENFTLLSLFDDVEQIEKENRLQTAIDVVREQFGFLAIQKGTVLTEGSRNIERSKLIGGHSAGGLEGLK(SEQ ID NO:790)
>orf01343
MNXXFISTKDKHTLIQVSAVRFRDGREIDAYDSYVHTSVPLKSFINEFDRGLQLRP(SEQ ID NO:791)
>orf01363
MIAMRSYITLTCNLNNNLFCLNSFFLTNLVWSQIFSLLSVFITVYI (SEQ ID NO:792)
>orf01364
MVFDANRIISEDSEGFVIPHGDHNHYIKVQTKGYEAALKNKIPSLQSNYQPGTFDEKAVLAKVDQLLADSRSIYKDRLS (SEQ ID NO:793)
>orf01390
MARLEPAKIAKIVLGILLYIIDLIKSSFVLPIPKAAKKSLILISFVPSFNDKNIVIRRPRQITKIMPRFICFLFRIFACIS (SEQ ID NO:794)
>orf01396
MASKRLSIEEQIEKKEESIKQLQNQKRQLKKKLNEQERKARNKRLIEKGAVFESIFEESIDLTKDEFYKLIKTLNDEEIRLNIMEILEERIDDNVEKSSKDEIT (SEQ ID NO:795)
>orf01397
MADSFHFSVNIISRGKGKSAVASAAYISGEKIKNEWDGVTHDYTRKEKILVKNIILPDHIPKEFNDRSTLWNKVEMAEKNSNAQLARQFIIGLPKELSLSENKNLVERYIKENLTSQGMIVDYAIHDESQDKNGNIHCHIMTIMRPINEKGEFLAKSKKEYILDEKGEKVLNKNGKPKTRKVELTTWNDTGNVEKWRENFSDLCNKYLERAGAEKRVDHRVLKDKIQIIYRQSI(SEQ ID NO:796)
>orf01398
MERKGIETDKGNYNREIRKYNQLVKTIKEEIKTLKGWIGNLLDNLSTAYEKFKDIERDKVIDNPKLFNLTNYLLTYSEIQKEKSKYLKGYAKTNKEKYDFKKLTSAYSYLRKNNIETIGQLQTKIETLKSNSYRLNKKAKTIHKEMEDVEKKILYYEIYKAKKEVYEEYQKKNIFTKEAFYNKHKKDIDQYKVVSGKLKKLLSDKEKLSPKKWNEEKILLMSNLEEINKEKDKIKDEYQEINHIKYSVDFVNKELGIDLSIEIDKLIKQGEKPSVIAQIKKFQDQVNKDNEYREMMKNKKMDQER (SEQ ID NO:797)
>orf01407
MELSAIYHRPESEYAYLYKDKKLHIRIRTKKGDIESINLHYGDPFIFMEEFYQDTKEMVKITSGTLFDHWQVEVSVDFARIQYLFELRDTEGQNILYGDKGCVENSLENLHAIGNGFKLPYLHEIDACKVPDWVSNTVWYQIFPERFANGNALLNPEGTLDWDSSVTPKSDDFFGGDLQGIIDHMDYLQDLGITGLYLCPIFESTSNHKYNTTDYFEIDRHFGDKETFRELVDQAHHRGMKVMLDAVFNHIGSQSLQWKNVVKNGEQSAYKDWFHIQQFPVTTEKLVNKRDLPYHVFGFEDYMPKLNTANPEVKNYLLKVATYWIEEFNIDAWRLDVANEIDHQFWKDFRKAVLAKNPDLYILGEVWHTSQHWLNGDEFHAVMNYPLSDSIKDYFLRGIKKTDQFIDEIN(SEQ ID NO:798)
>orf01408
MFNLLDSHDTERILWTANEDVQLVKSALAFFFLQKGTPCIYYGTELALTGGPDPDCRRCMPWERVSSDNDMLNFMKRLIKIRKYASVIISHGKYSLQEIKSDLVALEWKYEGRILKVIFNQSTEDYLLEKEAVALASNCQELENQLVISPDGFVIF (SEQ ID NO:799)
>orf01414
MSEQYRDIRKEVNLTADELKQIEKMMEVDNYRHFSPFVRDKILMTDDKQLAAKEWFSLWQSQKFEQISRDVHLVLIIARENHQVTQEHVSILLTCVQELIAEVNQVQSLSRGFREKYMR(SEQ ID NO:800)
>orf01415
MVYRYRTNLKKVFLTDSELHQLNERIAKSNCQNFSVYARKVLLNPNMSFVTINTDTYDQLVFELRRIGNNINQIARAINQSRLISQEQLQELSKGVGELIKEVDKEFQVEVKRLKEFHGSH (SEQ ID NO:801)
>orf01417
MVVTKHFATHGKKYRRRLIKYILNPDKTDNLKLVSDFGMSNYLDFPSYEEMVEMYNVNFTNNDKLYEYRNDRQEKHQQNIHAHHLIQSFSPEDNLTPEEINRIGYETIMELTGGRFRFIVATHTDKDHIHNHILINAIDCNSDKKLIWNYALERNLRMISDRISKMAGAKIIEKRFSYRDYQKYRATSHKFELKQRLYFLMQQSKSFDDFLEKAEQLHVHIDFSQKHSRFMMTDRAMTKPIRGRQLSKRDLYDEDFFRMHFTKQEIASRLEFLLNCVNSLEGLLTKSKELNLTIDLKQKNVIFILEENGKQFSLSHKKISDEKLYDVNFFQDYFKNKEVGVSEGIENLQAQYRAFQEERDKEKVSTEEIEEAFETFKEKRDAVHEFEVKLTEHQIEKLVDEGIYIKVSFGINQSGLIFIPNYQLDIMEEENQKKYKVYIRETTSYFVYNKEHSDKNQYIKGRTLIRQLTNDSRVIPYRRPTVERLQEKISEISLLIELTETDKKYQDIKDNLVSEIAELDIKLTQTNEKIATLNKMAEVLINSKSEGSGSQKLARHEFSKLNMTESTTLEQVNEELLKLQQEFGNVLDEYEKTIRKLGQLFKVFDECINKEIMNEI (SEQ ID NO:802)
>orf01419
MVCLIIDVSPYSTLCDIVVPKTHFLRQLMELCDFSFIYDELEKNYQPDFGCRSYSLLIMMFKYLLLKDIYKLSDVDVVERSFSGMTFKYFLGLAPVIEPSSLTKFRKLRNKDERLLDLLIAKSVQIAIELGLIKSNILIVDATHTKVHYNHKKPQEVLRERSKALRKTIYQYSEYIKAEFPSKPQEDTLVAELRYTQEVISVLEKHDELTGIPAISQNSITLKKL (SEQ ID NO:803)
>orf01420
LESSVKEEARIGHKSADSSFYGYKEHFAMTDERIITACVVTSGEKSDGPVLEELYHKSKDNGVTIEAIVGDRAYSGKDNMQFTKKERVH (SEQ ID NO:804)
>orf01421
MSVFKFRIFGFYLVAMFGLFFKIGRFLKPLLENMFIALKGYQISLRLSPFFITAHF(SEQ ID NO:805)
>orf01425
MQEHYTPKGKHLTIDNRRLIERWKNENKSNREIAGLLGKAPQTIHNEVKRGTTLQQVRKGLYKKVYSADYAQTVYQFNRKRSVKKLILTKEIREKILHYHKQKFSPEMMVNKKQVKVGISTIYYWFHNGHLGLTKADMLYPRKRKGVKKQASPNFKPAGKSIEERPDVINLRLENGHYEIDTVLLTKIKNYCLLVLTDRRSRHQIIRLIPNKTAESVNQALTLLLGEHRILSITADNGSEFKRLSEVFPEEHIYYAHAYSSWERGSNENHNRLIRRWLPKGTKKTTPKEVAFIENWINNYPKKCLDYKSPSEFLLGG (SEQ ID NO:806)
>orf01426
MLHPIFIIRRSWDGIFHLSEWKRNEEFDFFNKMDVPYLSMSSRIEVTQAINFHKKHSISLYAIISWCVMSAINSIPELLMDTDGKIVWQYNQRGCSFTTLTSEDKLNFSSFTMGDNLIEFVSAFNINKQKAEEGQKPNIDKNNIAYLSCVPWIDFLHVSTPMNLSKIDTVPRITWGKVIQENQRYFCTVNLQINHGMGDGLHVSNFFVLLQRFVNKINEYFQKK(SEQ ID NO:807)
>orf01428
MSIFIGGAWPYANGSLHIGHAAALLPGDILARYYRQKGEEVLYVSGSDCNGTPISIRAKKENKSVKEIADFYHKEFKETFEKLGFTYDLYSRTDSPLHHEIVQELFLQLYEKKFLYTKKIKQLYCTFDNQFLPDRFVEGKCPNCGTHSRGDQCDNCSAILDPIDLVDKRCSICSNEPEVRETEHFYYVFSEFQNLLETYLNDAEETVRWRKNAINLTKRYLREGLPDRAVTRDLPNGIPVPIDGFRDKKIYVWFEAVAGYYTASVDWAQKLQNNITDFWNNRTKSYYVHGKDNIPFHTIIWPAILSGLEIEPLPEYIISSEYLTLENKKISTSNNWAIWLNDIIKKYDADSIRYFLTINAPEMKDANFSWREFIYSHNSELLGSYGNFINRTLKFIEKYFESEIPTKYLEGEILYNLKELYTTVGNLVESGHMKQALEEIFEYIRSANKFYDDMKPWALRESDIEKCKEVLATCVIIILNLGQMLNPFIPFSGKKIEDMFKTKLNTWNYISNLPNKLSDVSMLFDRIDLKKIDEEVLELQQTSSR(SEQ ID NO:808)
>orf01429
LNNLTLLKEYNFRDLGNHLTQTGQKIKPKTLFRSSKLFGISKIDVDLLQSYGITKVIDFRSANEIKKAPDPDIKNIKNIVIPIFYNDDSELTEFPIEFFNKSDAGFQHMIKTYDQMINQKQSKLGYKKFFKLLLSHPKDESLLFHCSMGKDRTGIASLFLLYILGVDMNDIFHDYLLSNKYLINVRKENIEYVNNHSGNVILMHNLLSLSSAKEEYINRVLNVLDKEYGGILRYINTELGISSQEIEELKDRYLF (SEQ ID NO:809)
>orf01431
MDFLNEVLDLKEFIQDPVRTLSLGQRMRADIAASLLHNPKVLFLDEPTIGLDVSVKDNIRRAITQINQEEETTILLTTHDLGDIEQLCDRIFMIDKGREIFDGTVNQLKKTFGKMKTLSFELHPGQDYIVSHFEGLSDIYVTRQELSLDIQYDSSQYQTADIIQQTLSDFTIRDLKMTDANIEDIIRRFYRKEL (SEQ ID NO:810)
>orf01432
MTKLWKRYKPFVSAGIQELITYRVNFFLYRIGDVMGAFVAFYLWKAVFDSSHQSLIQGFTLSDMTLYIIMSFVTNLLTKSDSSFMIGWEVKDGSIIMRLLRPVHFAMSYLFTEIGSRWLVFVSVGLPFVILIAGLKLLSGESFLQIVLITTVYLLSLILAFLINFFSIFALVFQLLCLKTYGDQIF (SEQ ID NO:811)
>orf01433
MKKYQRMHLIFIRQYLKQIMEYKADFLVGVVGVFLTQGLNMLFLNILFQHIPLLDGWSFHQVAFIYGFSLIPKGIDHLFFDNLWALGQHLIRKGEFDKYLTRPISPLFHILVETFQIDALGELLVGVLLLLMTITSLTWTWAKVFLFLISIPFATLIYTSLKIVTASIAFWTKQSGAIIYIFYMFNDFAKYPIAIYHSFLRWLISFIIPFAFTAYYPASYFLKDKDGLFNIGGLILISLIFFTLSLKLWNKGLDAYESAGS (SEQ ID NO:812)
>orf01434
MIELAEPLPEYEILLSIPGIAETTATSIIGELETFVAFSLPTKSMPLSVLTSDTMNLAIS(SEQ ID NO:813)
>orf01435
MLGWKDGHEVPILFPCRSREKVLYFWKGNLKHLVQAILSPNDVFCQILIESTEVTQILIDFFLNISWFAVKDEL (SEQ ID NO:814)
>orf01437
MMRTVFRMDVSKASSEVAILVNGEKVHGYTMPNDAIGFSRLLEDLK(SEQ ID NO:815)
>orf01438
MKKNNVEIIKADSLVRRRGDNVERHLKRVAAYCRVSSDSEDQKNSYDSQVRHYKEYISQRSDWELADIYADEGISGTQVGKRQDFQRLINDCANGEIDYIVTKAIARFARNTLDTLKYVRMLKDMQIGVYFEEENIDTLTMDGELLLTILSSVAQQEVENTSAHVKKGLKMKMQRGELVGFQGCLGYDYDVETKQISINKKEAKIVRYIFERYLEGIGGKVIARELDELGYKSPRGLEHWNDTTVLGIIKNEKYKGDILMGKTFTVDPISKRRLSNFGEEDKYYIKDNHEPIISKEDFEKAQEIRLRRAGNKKTAANVNGKRERYSKMYAFSSMLECGFCGSILSRRSWHCRSDYRKVVWHCVTSIKKGKKFCKHSKGLEEIAIEGAFLEAYRQVYHSNENLMTDLLETIESELNDNSLNKELKRITNKLRILLKKEENLVNLRLEGKVSDSIYNEKYNEISSEKEFLAEEKVNIETTLKSEIDVKKRLTEFKHLLSSQKMLTEFDRAVFESIVEKIIVGGVNSNGEIDPAMLTIIFKTGEIQNKDGKQFKSKRKNAKLETDKLCPQNSDEDKKLYSQGTDNTRGVCSVAGSILASQ(SEQ ID NO:816)
>orf01439
MGGNPPMKKYSIVDKIVLSTKIKRIIIFTVFRENWEPYMKKYTEVFQSQFPNLNIDYLLLDTEQIDLDSYLDADIIIIGGGNTEKYIATYVNQEFKSYIDHMLNKEAKIIGFSAGALLLGEKVYVSPNDNSDHQIKIKNGLGLFSQFLISVHYDSWNDKANKDRAEELVNVPIIPLNDHSCLVLDKLGNIIEKID (SEQ ID NO:817)
>orf01440
MDDEASKQLSDSRFKILVGVQRTTFEEMLAVLKTAYQRKRAKGGRKTKLSLDDLLMVTIQYMRE (SEQ ID NO:818)
>orf01457
MGFSMKLIHDLDTHTTHSTAKMLYNVKAIKNDFSIRE (SEQ ID NO:819)
>orf01492
LILNEYEKRIFHEKTHNIECFDTCYYAFFIIFAPFLAFVIDKHCSSSLFLER(SEQ ID NO:820)
>orf01520
MSQVKGLCVLDVDGTLILEEVIDLLGREAGHEAEISQITSRAMRGELVFESSLRKRVSLLEGLPILVFDNVFNSIHLSLNVPEFISILQKNGILVGLVPGGFTPIVGEISKIPWYCLFHCQPA(SEQ ID NO:821)
>orf01521
MLKSAELGIAFCSKEMLKKEIPHHVDKRDFLEVLPLIDCLE (SEQ ID NO:822)
>orf01527
MGFSMKLIHDLNTHTTHSTAKMLYNVKAIKNDFSIRE (SEQ ID NO:823)
>orf01537
MDKLIIFIEKGKPFFEKLSRNIYLRAIKDGFISSMPAVLFSSIFILIAAVPNIFGFKWSDEQLAFILKPYNYSMGILALLVAGTTAKSLTDSVNTRSMEKTNQINYMSTFLAAVVGLLILAADPIEGGFANGLLGTRGLLTAFLAAFITVNIYKVCIKNNVTIRLPEEVPPNIAQVFKDVIPFALSVLSIYGLDLIVRNIFGTNVAESVGKILAPLFSATDGYIGLAIVFGAYAFFWFVGIHGPSVVEPLIVAISYANIEANVQLVQAGMHADKILNPVTQTFVVTMGGTGATLVVPFMFMWLCKSKRNRIVGRASVVPTFFGVNEPILFGAPIVLNPIFFIPFVTAPIINVWIMKFFVDVLQMNSFSIILPWTTPAPIGIVMGTALAPLSFVLAITLIIIDTLIYYPFVKVYDHQILEEERKGNSSSELKEKVAANFNTVKADAILEKAGVDAAQNTITEETNVLVLCAGGGTSGLLANALNKAAAEYNVPVKAAAGGYGAHREMLPEFDLVILAPQVASNFEDMKAETDKLGIKLAKTEGAQYIKLTRDGKGALAFVQEQFD (SEQ ID NO:824)
>orf01552
LFKTRSNSSALGSSYISNRNIFSYFTNQFNNTFCNVFGM (SEQ ID NO:825)
>orf01557
MALTQRQFVELFQETINVITLTCLTVSVAVVACVSICSS (SEQ ID NO:826)
>orf01558
MLTMFLFLPIDFFFCTDIIRMSCILKVNIVFSIYLNHITTLDFTDNILVL(SEQ ID NO:827)
>orf01560
LVCYFDDDLFGIDSFTLANLIRSQILRFLRRLFSIYIGNTIISLTVLA(SEQ ID NO:828)
>orf01570
MGFSMKLIHDLNTHTTHSTAKMLYNVKAIKNDFSIRE (SEQ ID NO:829)
>orf01585
MEKYFGEKQERFSFRKLSVGLVSATISSLFFMSVLASSSVDAQETAGVHYKYVADSELSSEEKKQLVYDIPTYVENDDETYYLVYKLNSQNQLAELPNTGSKNERQALVAGASLAALGILIFAVSKKKVKNKTVLHLVLVAGIGNGVLVSVHALENHLLLNYNTDYELTSGEKLPLPKEISGYTYIGYIKEGKTTSDFEVSNQEKSAATPTKQQKVDYNVTPNFVDHPSTVQAIQEQTPVSSTKPTEVQVVEKPFSTKLINPRKEEKQSSDSQEQLAEHKNLETKKEEKISPKEKTGVNTLNPQDEVLSGQLNKPELLYREETIETKIDFQEETQENPDLAEGTVRVKQEGKLGKKVEIVRIFSVNKEEVSREIVSTSTTAPSPRIVEKGTKKTQVIKEQPETGVEHKDVQSGAIVEPAIQPELPEAVVSDKGVPEVQPALSEAVVTDKGEPAVQPELSEAVVTDKGEPAVQPELSEAVVTDKGEPAVQPELPEAVVSDKGEPAVQPELPEAVVTDKGETEVQPESPDTVVSDKGEPKQVAPLPEYTGPQASAIVEPEQVAPLPEYTGVQAGSIVEPEKVEAPKEYTGKIEQPSAEDTKPENEASSTNGESERPKDKIKEEKQVDKKLELRNVSNVELYTVENNKYRHITAVDGALDSSLKYFMKVKSENFKDIMLPVTKIESTTKNNKEVYKIVAHAENLIQHENNVISNDYTYYLPKTQQSETGVYTSFKNLVDAMNSDPNGTFHLGATMDAREVELPDDQESYVKNEFYGKLIGENNGKYYAIYNLKKPLFKTLNTATIQNLSIKEANVSSKEDAATISKEAKYNTLIDNVHSDGIIAGERGIGGLVSKVDNSRISNSSFTGRITNTYDTTAGYEIGGLVGKLSGSLASIEKSIASIDIASNAKSGDQIVGGIAGVVEKSATIKYSYVEGNVNNVRHFGKVGGVAGNLWDRDSQDVSKSGKLSYVLSDVNVTNGNAIAGYNFNGIKTIETYSNKNNKVVNVVQEDDEVVTKDSDVQRGTVLDADKVKEKKVELVSKHSTKVEDFDFTSRYNTNYNEVTGYQQSREQVYKNIEKLLPFYNRETIVKYGNLVEDNSDLFTKKLLSVVPMKNNEVITDINKNKQEINKLLLHFEGNKSRVLNIAYKNDFSKVAEYDIANTKLMYTPNTMLHDYNNIVKTILNDLKSVQYSSADVRKVLDISGNIKLTELYLGEQFEKTKANIEDSLSKLLTADAAIVENNNKVIDNYVIEKIKNNKEALLLGLTYLERWYNFNYGETNAKDLIMYHLDFFGKSNSSALDNVIELGKSGFNNLLAKNNVITYNVLLAKNYGTESLFKALEGYRKVFLPTISNNEWFKKQTKAYIVEEKSTIEEGREKQGKEGTKYSIGVYDRLTNPSWKYQSMVLPLLTLPEEKTVFMIANISTIGFGAYDRYRSSEYPKGEKLNKFVEDNAKEAAKRFRDHYDYWYKILDNDNKEKLYRSILVYDAFKFGTDKDKDKVTHQATFETDHPAIKYFFGPAGNNVVHNGHGAYATGDAFYYMAYRMLDKDGAVTYTHEMTHNSDREIYLGGYGRRSGLGPEFYAKGLLQAPDHPYDPTITINSVLKYEDSENSTRLQVADPTQRFNSAEDLHNYMHNMFDVIYMLEYLEGKAVANLETNQKYELLRKIENKFDLDQDGNNVYATNVVRRLTMDEVNKLNSFDSLIENDIITSRGYKDQEYKRNGYYTIDLFSPIYSALSGEKGTPGDLMGRRIAFELLAAKGYKEGMVPYISNQYEKDAKAAGSKINSYGKEVGLVTDELVLEKVFNGQYKTWTQFKKDMYKEREKQFSKLNRVNFINPNNPLSRQRNVSVTDIGVLERMIVEAVRDDAQDDVAKFYPETNSRVLKLKKAIYKAYLDQTNDFRSSIFENKK(SEQ ID NO:830)
>orf01588
LSLLKKDKFSIRKIKGIVGSVFLGSLLFAPSVVGASTYHYLDYSSLTQTERDQLKQGRPDESKESYALDYEKDALPNTGSSQSIMTALGLLAIGSLIVIITKDNRNKKIATFLIVGATGLVTLSTASALNLNANIHESGRDGVLQISGYRYVGYLELDDKTVSSVSPASTVSPVEQPKVVTEKGEPEVQPALPEAVVTDKGEPEVQPTLPEAVVTDKGEPEVHEKPDYTQPIGANLVEPEVHEKLAYTESVGTTGMDENGNLIEPPVSDIPEYTESVGTTGVDENGNLIEPPVSDIPEYTESVGTTGVDENGNLIEPPVNDIPEYTEPISTVSEVASEREELPSLHTDIRTETIPKTTIEESDPSKFIGDDSVKEVGEDGERQIVTSYEELHGKKISEPVETVTILKEMKPKILVKGTKENPKEKTVPVLTLTKVTEDAMNRSANLNYELDNKDNAEISSIIAEIKDGDTVVKKVDLSKEKLTDAVQNLDLFKDYKIATTMIYDRGQGSETSKLDEKTLRLELKKVEIKNISSTNLVKVNDDGTEIPSDFMSEKPSDEDVKKMYLKITSRDNKVTRLAVDKIELVTEKEKELYKITASAQDLIQHVDPSKTRNEYIHYIEKPVPKVNNVYYNFNELVRDMQEHPNDEFKLGADLNATNVSAFGKSYVTKDFKGKLLSDGDNHYTIHNLSRPLFGNVIGGTIKNINLGNVDINMPWANQVAAVANIIKGGTTIENVKVKGNIVGKDWVSGFIDKIDNQGTLRNVAFIGNVTSVGDGGQFLTGIVGENWKGLVERAYVNANLIGKKAKAAGIAYWTQNEGNNNTVRQEGAIKKSIAKGTIQVTEAIESGGVVGSMKHHGSVEDSVSMMKVPNGEIFYGSSDIDYDDGYWTGDNVRRNYVVIGVSDGHSSYQRSKDKNRIRPISEEEAKSKIEATGITADKYEINEPVVNRLNRLTRREDEYKSTQDYKVDRDLAYRNIEKLQPFYNKEWIVNQGNKLAEDSNLAKKEVLSVTGMKDGQFVTDLSDIDHVMIHYADKTKEIKAVHQKESKVAQVREYSIDGLDDIVYTPNMVDKNRDQLIKDIKDRLATVELISPEVRALMDKRDTSRDPNANSDERKNGYIRDLYFEESFSETKANLDKLVKSLIENADHQLNSDEAAMKALVKKVDENKAKIVMALTYLNRYYDIKYGDMTIKNLMMFKPDFYGKSVDLLDFLIRIGSSERNIKGDRTLDAYRDMIGGTIGKSELHGFLDYNMRLFTNDTDLNDWFIHAAKNVYIVEPKTTNPDFVNKRHRAFDGLNNGVHNRMILPLLTLKNAHMFLISTYNTMAYSSFEKYGKYTEAEREAFKDKIKEVAHAQQTYLDFWSRLALPSVRDQLLKSQNRVPTPVWDNQNYHNVEGVNRMGYDKNNKPIAPIRELYGPTWRYHTTNWYMGAMASIFQDPNNNDQVYFMGTNMISPFGISAFTHETTHVNDRMLYFGGHRHRQGTDVEAYAQGMLQTPDKSGNGEYGALGLNMAYHRENDGDQWYNYDPDKLKTREDIDRYMRNYNDALMMLDHLEADAVIPKLHGNISRWFKKMDRQYRKNGELHQFDKVRELTEDEKKKIVINNIDDLVNNNLMTKHGAPSDRTYNPEDFDSAYVNINMMTGIYGGNTSQGAPGAASFKHNTFRMWGYFGYENGFISYASSKYQGEADKTNKKLLGDDFIIKKVSKDKFNNLEEWKKQYFKDVKSKAEKGFTAIEIDGRQITNYAQLKTLFAEAVQKDIDGMSDPKIKDHFKNTVDLKSKVFKALLKNTDGFFNKLFKEDI (SEQ ID NO:831)
>orf01603
VLGGRANSVTSCTTNSHWNLTFTTKHVTCFSSLVDDIVHGNNREVHEGHIDDWTKSCHGCSCCCSRDGSFRNRTVTDTFWTKFFKHSNRSTEVSSEDTDVFSHQEHIFIATHFLRHSKDNGVTEGHCFCFHFISFSLVCVNIFKG (SEQ ID NO:832)
>orf01604
MDMFYIGHFLDIRRDTVTVVNAIENDWQVPDRSHVHCFVENTFIGRTISKEADNDFTGILHLLTEGCTDSDPHTTTYDTIGTKVPSIKVSDMHRSTFPFTGSSVFTKDFSHHSVEVNPFSNSLPVSTVV (SEQ ID NO:833)
>orf01606
MNXXDFIGHCDKIKRNIFEKSHKVFSGLFGLHPKDFLNLIFSNQIPLPFSECNPLTNYNHLFSLIISDKRDIVIHWI (SEQ ID NO:834)
>orf01622
LIEIQVFSSLQVCCLNLCHLKFQHFDTCLVFLLVFLDFQNLLAHFPIGIKTRLIGFFQVPKSGITKFIQHLDMQLGTH (SEQ ID NO:835)
>orf01623
MVMLTMNIYKMLPNSSQNRQINHLTIYTADTTTILQDFPTDDNFIT(SEQ ID NO:836)
>orf01624
MTNNICRRTSSQHHIHGINDNRLPCTRFTSQDSHPLFKIEGNSLNNGKVFYRNFK(SEQ ID NO:837)
>orf01634
LFVIRNPSSQTLFQTQLQLVQALQITVIQALRLSKDNRLTAFFQSLLFLR(SEQ ID NO:838)
>orf01636
MPHTRDNWQTRFKNSSYHNFFVKGPEILNRTTSTTNNEQIQIVPLISTRNISSNFLRSPFTLNLGRIKKDVNTWESPADGRDNISNNGSTTAGYYPNSLRKLGQSLLEAFLKQAFFCQFFLKLFKLNRKRPNPIRLNFFNDDGVATTWFIDLYTPNHIDLHSFFQVKP(SEQ ID NO:839)
>orf01637
VTLADFVADRRAATPTAAAELATPVTKVGCISSFAKSGKTDGNGSPKCSI(SEQ ID NO:840)
>orf01640
LAIIRNRTCSLKLINDHLTFWTLRFLTSTRILIELATINLNCRIHRGNLSNRPSQASNRFINKLFIQGRQNRGFCDHFPTSILSRRGIAQSDFPLIDLTLVLHKLDHACRLANRNRQNTHHIRIQGSTMTNFLGSQNLTQFKNRIMRGHSCFLF (SEQ ID NO:841)
>orf01642
MEDDLNYENLMDDVTEAIKKFNLVIFIGAGVSIAQGYPNWNNYIEHLIKYWQGQVLSVSGEKRLGREHHVVFDLISKSSISNKRKVDLVNYELKKVFGEDFEKRRLDFEKGYFKNLLPYSIVNQTVESLASLNAIFITSNYDYEIENHIKRLKNAVVTINDLNEFTKNKNGKLQFGDVLHIHGTPDCDVKYFVSSSADYSKTYLKNRENFENLVTWFKETKPTVLFIGAGLEEDEILSLLCKDSKNYALMKSENTGNQRVDEHYRGVVEGFFSSENHTQIIWYGDEFEKLPLFVKKLVADINEKLGTHDFYNQWNNLLNPSINQEEYNKNLDSISNDFKYLSSVLDKVIENDNNQLDQLMLNALLRSETLTVIKKNFVLVFWKFIVKNIEKLSDNEWDVIYKIIYEGSQNYFIDDVFFVYNYAIDNKISSFTNNNKLNELREIISKDGYIVNSNFNKDKTLLGYWLVSAFEQQNRDLYIKEDSEVEVNLNYECVNKLMSILNNPEFLSYNYYSIEHQLKEYDVVKFLYELVKSKKLFIEEEKFLESDSEDLISTILIQKLLVQLDNEINLDLEFIKRLIDKIDFSNIHFGEELNTFIKEHRSIIREKNIEIPKKPYRNWISSLEGGFVSQFSYLTQENLVEYDESRVLEILVNAEKEQRGSSFLEEKTINETENFFITVLKESNEISKKVSDLLKNHIDDLYPKYKRLYVKIISFPEIEENLRKIVREKYLKRFNKESFDSNDRKFFEYHIKQQNTDIDIFEKLLSINVNELSTPKGDNKQLDILHFINSEMGSYFQCLISLFINHSSYRDVIIQIINSVTDTDYREFAQGILLNEYNPNRINVTYNTFLGFAYYHSTITIEAADVFTDVVRDILNKKIEDNQILNKVYLVALERVDPTIESFSLSKNNYSQMINIIFTGDYEFRYSKEWLGALFKFDSSANYLVTIFYLLYNENLKKNRFALFIEELSDYLTTYNQKLSLRGMNYKLNHEELNNFDLLKKMFLKLMETDKIENDIFYLDGIKSILPLLSLDDRRNVLQHIQKQNNCPPPEIEELQRIIVN(SEQ ID NO:842)
>orf01645
MFMSNLCQFFQVWNINQGVTQGFNQDKLGIVFDSCFYFLQIINIDKGCCDTITRKEFFQKIEGSTVNSRSSHYMVTSMGKRQNRISHCSHT (SEQ ID NO:843)
>orf01646
LINVFSHGVDIAIHSATKFIGGHGTTIGGIIVDSGRFDWMASGKFPQFVDEGSSCHNLSYTRDVGAVAFIIAVRVQLLRDTGAALSPFNAFLLLQRLETLSLRVERHVQNAETIVDFLVNHPKVEKVNYPKLADSPYYALAEKYLPKSVGSIFTFHVKGGEEEARKVIDNLEIFSDLANAADAKSLVVHPATTTHGQLSEKDLEAAGVTPN (SEQ ID NO:844)
>orf01647
MTCDFKFETLQLHAGQVVAPATKSRAVPIYQTTFFVFDDT (SEQ ID NO:845)
>orf01651
MAWLLVGNVGVRQVLEHLNAELKKVMQLSGTQNIENVKPFNSVTSIKPTLPNDPPDLKFIDKKNAPPKCGVFLCYGKKFVLKNKKNEIRIGRIVQDSQIDF(SEQ ID NO:846)
>orf01656
LKKKWFFADYYDTTIILLALISVILVLLGFAEMIDLDNPPYSIIDLVIWGVFVIDYSWRFFITKRKWRFILENVFDLLAILPLNAIFTVFRLGRIFRLARLTKLLKLTRLLRIIGLTGKLERKISRFLRTNGLIYILYVNIFIVLVGSSILSVVEEKSFSDSLWWALVTVTTVGYGDIVPVSLFGKTNYRAKEY (SEQ ID NO:847)
>orf01666
VVDFKQTRQDPHDITIYSWLRQVKSNTGNGSCCVRSNPFQAGNSFIGIWKLATKVSHNLLGCSLHIANSRIITQALPSFQ (SEQ ID NO:848)
>orf01671
MAERTVVQVHNAFPEDTTLINSQLIPLVQVVVNQGRKGIVGSCNSMHISSKVEVDVFHWQNLCIPTTSSTTLDPHDWTKRRFADSNHGFLANLVQGIRKTNGKRRLSFTCRCWVDGSNQDQFTDWIALNCTNFIKAEFSLVLSVQLQIVVRNTKFLYNINNWLQLNTLCDFNICFHSKFL(SEQ ID NO:849)
>orf01684
MLFIIGHLNFPTAGSFIDSTLHRLGNRVCIHDDMAFTVTSSTSNSLDESTFVAKETFLVSIENSYEAHFRNVNSFTEQVNSDQDIKDTQAQVTDNLRPFQGLDIRVHVLDLDTHFLEVVGQILCHFLGQSCDKGTLIFFNAGIDFTQEVINLSHSRTDFHLWIQESRWTNDLLNHCLGLFIFIVTRCR (SEQ ID NO:850)
>orf01685
MNVTLKLLPTERTIVQSRRQTETIINQHFFTRTVSIVHALDLPYGHMTLVNHNQEIIWEEVEKRIRRLSFAPSIHVARIIFNPIGIAHLTQHFDIILCPLFQTLGFKQFTFLFKDS(SEQ ID NO:851)
>orf01686
MIHFSQHLTCQSLNFTNTVNFVSKKFYSKGMFISGSWENLYHIPTNAKSSALEINIITFKLNIDQVIQEFITRNL (SEQ ID NO:852)
>orf01687
VAKLVNLVIDRTILLNIGIARRDIGLWLVIIIVGYEILNCIFREKFLKLPIELTSQSFIVGNNQSWFIDFRNDLTHSIGLPCSSRPHQNLSFFSPLNVIHQLLDSLGLIS(SEQ ID NO:853)
>orf01705
MNITQTDFLAVNLVFAISTTIDMAFHPDFLTCILDKSVMIIQSHNYRSIIERFATFCSSKDDIRHLAPTETLDTRLPQSPTQTFCNIGLSRSIGSNNCRHTLVKNDLGLISKRLEPLNFDFL (SEQ ID NO:854)
>orf01707
MSFIVCNHLKFACFNLRNHDLIDKFLDLGHILIQKKGTKKGFKGITKNGVTIATTRFFFPFTQLDKLVKLAITRKTS (SEQ ID NO:855)
>orf01719
LFTCFSKLDNKTASTTYI SHKFFTAIPVCFEFFKGFWFPRKDTTKKNIFIPMFLVECFNFWVELR (SEQ ID NO:856)
>orf01726
MTKKIVALAGDGIGPEIMEAGLEVLEALAKKTGFVYEIDRRPFGGAGIDAAGHPLPDETLKACREADAILLAAIGSPQYDGAVVRPEQGLLALRKELNLYANIRPVKIFESLKHLSPLKSERIAGVDFVIVRELTGGIYFGDHILEERKARDINDYSYEEVERIIRKAFEIARNRRKIVTSIDKQNVLATSKLWRKVAEEVAQDFPDVTLEHQLVDSAAMLMITNPAKFDVIVTENLFGDILSDESSVLSGTLGVMPSASHSENGPSLYEPIHGSVPDIAGQGIANPISMILSVSMMLRDSFGGYEDAERIKRAVETSLAAGILTRDIGGQASTKEMTEAIIARL(SEQ ID NO:857)
>orf01727
VVRNTASHLTCILFLNKGISVYIGNSRSLKHIKIKPCCIKDGFCISVFNSDQNPILGIDSICYRIDSVGHQTNRLVKELIDSIKDCFNGTLPCRIKFDFLTIHIG(SEQ ID NO:858)
>orf01729
VEAFWIFNHGSSYQSSNICICDFLLLIGQCLELSKEWFDILFCKI(SEQ ID NO:859)
>orf01732
LANIESHCNFFQSSISSSLPNTIDSPFNTSCTILDSSKAICHCHSEVIMTVRRIDDLTIRLDILNQVFEDGTIFL (SEQ ID NO:860)
>orf01741
MQEHYTPKGKHLTIDNRRLIERWKNENKSNREIAGLLGKAPQTIHNEVKRGTTLQQVRKGLYKKVYSADYAQTVYQFNRKRSVKKLILTKEIREKILHYHKQKFSPEMMVNKKQVKVGISTIYYWFHNGHLGLTKADMLYPRKRKGVKKQASPNFKPAGKSIEERPDVINLRLENGHYEIDTVLLTKIKNYCLLVLTDRRSRHQIIRLIPNKTAESVNQALTLLLGEHRILSITADNGSEFKRLSEVFPEEHIYYAHAYSSWERGSNENHNRLIRRWLPKGTKKTTPKEVAFIENWINNYPKKCLDYKSPSEFLLGG (SEQ ID NO:861)
>orf01752
VHAHTDKLCNGCNRIFNSIISHHTIFRERNKLSHKAIKSTRQEMGPCHVVFIEFFITLHRRLIGNHDNFLTNLVGSGRVRNDGST (SEQ ID NO:862)
>orf01753
VNHCHWKLFIQNLGITFSLIVTLIRMTDSHVVGTDKDMIFLVNSLFLIFDIDKLRLS(SEQ ID NO:863)
>orf01755
VGNNDILWSKRTISINGFNDFLNTCIAVSTTLCNDDTFLIKRKIFIYKIFCMRNPVSMNTNYNFFNTWLQDKFFNCMNQNRSIT (SEQ ID NO:864)
>orf01765
LVAPVASSTRFFKNNDSLTSWNNGFIIITINTIISYQRISKGQDLSIIRLVCNGFLVAGHPCIKDDFACYINICSEGLAFKNCAIF (SEQ ID NO:865)
>orf01767
VVCYFYITIDWSWVHEDCCFFQTIVTFLSQAMLGMVVFF (SEQ ID NO:866)
>orf01768
MAFVLHTEKHHDINLINDFINGYKLSIVCKLLTSPFLRSSEKEFRSQAFQNLHIGFGNA(SEQ ID NO:867)
>orf01769
VVQVTCNSNFKTLKVAKFLINGHQIKQALARVLARTISTIDDGSRNRWTSNQFSIVVDLWMANHTDIHS (SEQ ID NO:868)
>orf01770
MCPCRILKEEIGNNRMVFIGKLGSIFKLNSSLDQFHYLIDSEVFHGHHMVQCLLIF(SEQ ID NO:869)
>orf01776
MSINCKGWNPKSYTHDNIGCLATNTCQTLQFFTCLRDLTIKIV(SEQ ID NO:870)
>orf01790
MPDCTLTNFLDKVLYNRQGNVGLEQGQANFFGCLLDIRFRDFSFFT(SEQ ID NO:871)
>orf01793
VQFHLIIFQNLFCSLDIVIDSLTTDTELLGNFSKAVIISVVELDIIHLLICQKRRIKFKERIHTIGFFDFHNFYYTKN (SEQ ID NO:872)
>orf01796
MKFNHYFFLFLIIEKQVAIISFFMHFHIIKLVNHFQLLIKLNCISHPNLHIRPSFLSLVLLFYQKEQDFAIMVI (SEQ ID NO:873)
>orf01799
LANNRKTETLGVSYLSTFIDKHELLQSYFESNDKTPVWDGEIHVLKSPSEKKDEILGKVPVQIKTTRQKKDVLKSFSLDTRDLELYKPNGGVVLFVVWLNEDNGLRDIYYKSLPPLSIKNLLKKSKLKNKSTNRKKLSIEIFKLDEKKMYPMLVDFINNSQKQYSFINVEGISVEDIPDDKTLKFYFYGQEKEEIFNYQEEHDLFIYYLDPITGIEIPLENTIKIVETEEETDLIIKIGDYVFQDVKRHRFPDGSVQLHFGESFTMSFDIKKKQFKFNYTRPDLLSKAIKCTQVFQELGKIGYFTLNGNKIELDERSIKDISSLDLEADIKGLLKISNFMKKMGIQKDVDLSCFDKQSQRNLNILYSGLVLKKKVALNYNESKLLHLNIANIHIITLYSFLSDKNGTMIDIFTETPWCREGETEDEDYLDISIFEVFEPNDWLKIDNCKIDSVIASYQRLVDNKLKYEGADRTILKIVIAADMAEDKTKRELLLNWAQCLSDWNLKYSKNCEMAIINDLQIKSRVRKLNSKETETLTNILVNSNDNYELCFGSSVLLKSKPQADLFWNKLDNETKERYKDFPIYTLYMKLS(SEQ ID NO:874)
>orf01800
MKVSKKITLFSLSFAGFVLLTLPQAGKAFELKEDWAFKGGIRYENGKVSKINNGYEVNIKVLDLPSTSAIEWTVRLNGEKQNTNFLAEERTVSKTEDKGRFLHFYIPYGYRGDIVVEAKSGNEVKTWSTKVVDDVYSDSAKSGYFILDGEQILESSWDSVNESYIATLPTVTSGKTVVAWREKGTLNLIKPGRIARQYNSSGSYVELSPIFETASWLKSNQNWYYQKQGQLVQNSWIKDQGSWYFMDDEGVMFNQTWLHQGGSWYAFKSSGAMISADWLYDNGSWYYLKDSGSMVTGWLKNGGSWYYLNKSGSMATGWIKDSGTWYYLKNSGSMATGWVKDSGSWYYLKNSGSMATGWVKDNGKWYYLASSGNMLRNTRTPDGYYVDGSGAWK (SEQ ID NO:875)
>orf01801
MKKILLSTVALLSLVASLLANNPVSAQESSSQATYSKSSGSWIKSGNRWWYKHSDGSYTTNGWEKINGTWYYFDSEGWMKTGWIKEYGKWYYLDDSGAMKTGWCLVSGSWYYLNSSGVMQTGLQTINGKQYYLAAGGAMQTGWHNIGDDTYFFANSGENQNINRRALVLGETSTRAVPIADVNAMEKVFNNQNFSEVVRFPDRTKSEIIAKMQELFESSSEGDVNYLYFTCHGGRDGRIYIGSDGLAFSGWELASVLKQYKGKFVVMLDCCHAGTIISKDNTGEGNEGASTEYFDLDEFVSGFSNMDGNEKSGEMIDSKFLVLCSSRGAEYSSGGSLSLATKYWSLGSGWNPLQNSQAYLAADQNNNRRITLNELYTYSREQVLKQNSNQHIEVYPDNSQFVLFKK(SEQ ID NO:876)
>orf01802
MENFGAVLKDIRISKNFRLKDLSCNEISESTISRFENGITKLSINHFYILLNRLGISFSEFEELVHCYYSKKECLFEELEHAVNSSDIFLLQELVDKIELKQKQEKSLCNYHIKLIAEQQINRLANLPYNSSKCNELIKYLLSVDTWMEYELKLFYNSVFFMNTRTISLLYRIVIKKTRYFLKTNTGTHRIIPLYLFNLKLLLKNNLLGSAQFFIDDLENLLTRQGYYFEKNYLLFLKGIYLIKTNQIELGKKECFKAMRIFKEYNDSDTINELNQKFKLDLTI(SEQ ID NO:877)
>orf01803
MSSIYSSAKKDFLYWNVLIFIMELPNDVKVQFYELRKKVQSFNQLSKRFGMDVSG(SEQ ID NO:878)
>orf01810
LSFLILSPAGAQESLSFFFVKITDASKTVKNGGQTETQKLVTKMASDFERVENKDSEVGKIVKEKLALSGDITEAKLTEISSALLAFEKEQNPVDLDAEKEKLVNRLSPRFETLEQAIASKDLEKVREAFKKMNSTWTINESVVRDNSTAHYGRVETAISFLPSSMETEPTDESGT(SEQ ID NO:879)
>orf01812
MQKNIYFVVLDLHTTDRDKIIQLFKDWTDYSAKLVEGELVKKDGQNALFPPSDTGETVGLNPHRLTLTFGVSASFLKRMNLENKRPRLFRDLPLFPKEQLREKYTGGDIVIHACADDEQIAFHAIRNLIRKGRNAVPLRWSQSGFAAIGDRMETPWNLFGFKDGTANPTKEQDFDRVIWADSKDWMENGSYMAVRRIQMFLETWDRTSLEEQENTFGRYKESGAPFGKKNEFDEVDLSLLPDDSHVCLAKEVDKPLLRRSYSYSDGIDEKTGQFDTGLLFISFQKDPDNFVKVQTNLGATDKMNEYITHIGSGLFTCFGGVEKGGYIGQKLLEG (SEQ ID NO:880)
>orf01815
MTGKKGFLFLNCHICMVTTTTCFLKERVESELLIFFYISLNRCLITV(SEQ ID NO:881)
>orf01818
MSLRNKIEQHIKELEGGKFQKLGDAYLSRKYNFNIVSLGSQEGTDKTTKGIPDSYAVENGKYVYIMYGTHKSVISKLEGDIQSVKKKILEENIAEDKVGRLICCHTSSNITIKQKEDLEKMAEPYHLELIGINEIANDLTKIDFQYLAKEYLSISESTEQVWSINDFIRIHDESKTNAPISNDYIGDVSEIINTIKSSEKRIFLISAKPGTGKTRLAIEICSLLDRNKYNIICVKSNNQDIYQDVKRNLNLHKENIVFIDDVNTTQNYISTLGLLNTTSNIRFILTVRDYAKKDVINNIKVYGYNNIEPELIKDDNFKELLNQFSRNDFTNQEIEHIKTISKSNPRIAVIAAKLSSSQDLTNFNDEIDILKDYYEEILNKNNIIYAEQKTLFILSYLKKIRLESLEENQEFNKLLKITDITNTDFKSAVEKLHERELCNIYNDKIVKIADQSLDDYIVIKFLINKKISILEILHELYPVNDQRVVQILNQCSNFIRKESDLEGVSDAVKSYYYNESNFESDELKEKFLIQFGVLLPLEAISHVKNKIDNIESQVYTKTNFINQKDKKGSIEDSVLNIVFVTTRTKYCSQILQLLLKYFDKNPNKISEVYSILEANYGLVTEREYIDYTLAENTISELANLDLTKSYNQELIVTILKQFLKIEIERTEAHEEKFTFGRYKVPDSEKLKQYHRSILKLLANLYNIGSCETRFYIEKMLYDYRRKILTYSESHRNTIFGDLRNIRKLFFNDIKNLSMIGEKIVYALHKAEVKENLPIVFDDYIISDRQKIYNNLTNPNHAWFYDASEIKLQQIANSYSNVWLKIFNFANQFKHSLFMNDNNIELVLFNMFLLSKNDKKIKFLNYMFKSNYHFVNMNPISFLENIEESSMQSVIVSSPESEKYEWQLAYLTQLENVKNEDLQTLKSILEANSLPCYFTILNFERLILKDPSLKELLIQKAGNTNFVISDFIREEEVPKLINLIGVKELKFWYLINLENCQNHSYNLFQKLGEKDVDFSVEVLKKIDELRIGHSNLGYMVLHSISEFRDKKEIYKKFIRFAINRPYYYYNNMIDDIIKNDSQIILEILEETNNEQSAIRLVNLGVEFLENNNQKLILFNLLRAKGFGKKSFQEIHFTPYSHFYTGSHVPVLELEKELLERIKKIFETGIDYINLLLYLNKLIDCKRKAIERELEKEF(SEQ ID NO:882)
>orf01822
MNESLDDITHKQFTSNLTTKADNVSVQLFFSIKGCCHITNQGRTYTWNFIYSVVDTNTSSTDTYLKISLAASYSFPYFFTKDWVESPCMVICTKVNDFISF (SEQ ID NO:883)
>orf01824
MAISQMKRISLLFSKSSLDDVLKTIQELESVQFRDLKVQDNWSEALEKDEVVFPTIQISHTSNSNHGVIEGNDALIYLMNQQQYLEATVEKLQEYLPKENTFKLVRQPPITTSYKELEKLVKLMLPRVFLKK(SEQ ID NO:884)
>orf01826
MNRACIIQPSLVEIQIWLLNHVCKCSDFLSHRMRSLLDRKLDLISLLIKLFPKKNWKN(SEQ ID NO:885)
>orf01828
LNGGEFLETEFGHSVLAIQSVVWFSFFCLKSNASSLAHGI (SEQ ID NO:886)
>orf01829
LLAGILELENWGKTTELRPTLLSGPVQNKIEALKRAKI (SEQ ID NO:887)
>orf01834
MRSQNHNCRPSTRKVGCIGPIFFGHLLNHRKFSYQVLTITLMEEVSLDCLPSGHHVSCQQGSNRYIGDRTCSNSFLIRQFFRQDTTAVAST (SEQ ID NO:888)
>orf01861
LRQNRCYNCFHDHSCSWKSSRITSLHGCLVRFVGFDIHTHKRFIKSRNGFHDPTNNDGLPISHTTFKTT (SEQ ID NO:889)
>orf01862
LAAFTITSLKAKTKFHPFKGIDRDNSLSQSCIQFSIPLDIGTKTNWNASDDCLHNPTDGITTTFDLVNIVLDFLFSFLVDNRNFRLGSSLLNFSDCQIFRNIYFLTTKDHDMVGNLHIQLSQEAFGYCTNCHPHGGFTS (SEQ ID NO:890)
>orf01863
MTWARMSNFPLAFKAVFNVLRRHDVQPFLVVLIDDIHSNRRPCRLPVANARSKDNLVTLNLHTTTTTVATLTASKVLIDILSCQWKSSWNSLNNSC (SEQ ID NO:891)
>orf01865
MHDLAITGSRFDGMANSVAKIEVKTNTIVQLIFNHHLALHLTRMFNQGLCMFQNTLNRTIQSRQESPQFWILNQAILDNFTHPFNQLSFSEGFKNKWINQNPIWLGKGPHHIFSKWCVNACLSTDRRINLSCQTSRNLNKVNTPHIGRGYKASQVPNNATTKSNDSIATSQTLLD(SEQ ID NO:892)
>orf01867
MVICHNDYLLRLPEFSQPLTSLGHTTFFNLNIIRMMRNIDSDFHRRVSLSLLVFFC(SEQ ID NO:893)
>orf01872
LIEGHLVFADKPAQALVLLRKVGSPKKVSFLTLHLYFLILKIDILKITGF(SEQ ID NO:894)
>orf01882
MSTTTKFNRVVTDSHDTNFLAVFLTKEGHSSHFFSSVNIRFHCLNFKSFPDFFVDLLFNRTQFFSSYRLGSG (SEQ ID NO:895)
>orf01887
VTWIHSHFNPAVVRIFIVWIVGHVKFFSREIKPFRACQKLISPSDSFVTEVIPDREVPQHFKHGMVTRSLPYVFDVVGTDSLLGIGNTWIFRDNGPVKVFLKRLLPQS(SEQ ID NO:896)
>orf01906
MNGHFLLLFCLFNIFFHLVNIELSKQVLTVLDWETLVQXXIPFIN (SEQ ID NO:897)
>orf01911
MRLRDLRRVDFPDPDGPIKAVISLGWKDRETLFKAFFLL (SEQ ID NO:898)
>orf01914
LKNHSNVFTHFINVDFWTVDINSTIENLPSYFSNINSIIHAIETA (SEQ ID NO:899)
>orf01915
LHINPLNGFIFTIVNMDILSRKGYFFFRKGKDMLLIPVIC (SEQ ID NO:900)
>orf01920
LSPFQHCHSSGSIFFNLHFLNRNILQSFDNPFLGLIRENKIEKFCSQLIGLPQCIHMLIRPQGPIIATYIFWT (SEQ ID NO:901)
>orf01921
MNXXNSRCNHPTWSNFLDILEVDFLGNIVGQKIRSHDLKNPVQVFTVIDMTIHIQVVKTNMVILADRLFQGFILRSTDKFFIKIRLVRSHNLRFNNMDFSTVAVHENKGRHHVDELLPRFIINSKATVAKKSIVAQGFRFDGNFFRKTRQTNHLNIIFCDNPDQIIVFQNGLITNSQFNRLHP(SEQ ID NO:902)
>orf01930
MWYFYNTDGSMATGWVQVNGSWYYLNSNGSMKVNQWFQVGGKWYYVNTSGELAVNTSIDGYRVNDNGEWVR (SEQ ID NO:903)
>orf01931
LTFIKSWAIEIFCFDWNFLDKNLGLGSFFNNSCLRVFFLT (SEQ ID NO:904)
>orf01932
LLLSCRKVIVCFIFSSKWNKNFFNLAFSWNFDNCIRGFFSINSNLFGNITSLWINIVGPCRSYIAILSINCNRIFTTVFCFIFFITNSRT (SEQ ID NO:905)
>orf01949
MRFIVGRFTSFSLGIEFSPTSKLDDLLFKIAFLMILATWIKARKTKGAT(SEQ ID NO:906)
>orf01961
LVNCEPLEAYRQLEEAELVGCWAHVRRKFFEATPKQADKSSLGAKGLAYRDQLFALERDWEALPADERLQKRPAPNGRLLCLVPPSVSFSRFKTRKGN(SEQ ID NO:907)
>orf01963
LKRNKIWKKTLTYPVEREEITYKRKKAKGKRQAILAQFDSEEVHHRLENCICPDCQGELKEIGASLQRQELVFILAQLKRVNHIQHAYKCQTCSKNNPSDKIVKAPIPKAPLAHSLGSASIIAHTIHQKFILKVPNYR (SEQ ID NO:908)
>orf01964
LKIIQQQSATIDSLTNELALLREQVAYLTQKLYGKSSEKSVCPSGQLSLFEEEQNMEEDSDLPS (SEQ ID NO:909)
>orf01972
LICQTIKYWHKFHLHIGRCKLLIGLIPILNFFIRADIDCLLVLLSLIDRQNGKQFNLCQWIIASNGLNDSFEIIESLIHRNILSDIICPNQKKNFIYCSTI(SEQ ID NO:910)
>orf01978
MSFSCSDSCFSILLLDGDIHENTTFSPLSILFISHRFNSLIGNEVPHLIDNELLISIFFHRFRWFNNVRMPSKDNIRSPIDHLVIKSFLFFSWFQSILNTHLKHDNGDICFLLCPFNFSLHLIFV (SEQ ID NO:911)
>orf01981
VVEQIPVGHNSGSFFLFLLLRLLLSPLLRNSISFLTSQGIPWKLSNNKTKPIDKSTASKSIATNPLLLHLR (SEQID NO:912)
>orf01988
MLKLIIYQFQYSKRQWLGTIPLLFVSSLIVGTSLFGIASSIKTANINASQLFQMLIIFGGTTLFFLISNNIRLLIDIFKKDYQLWTILGASRTQLSLLVSGQFYLMAVIVSSIGTILSFIMADSYYKFLQNLLGRDELPDLVITANIQSILLSIFIVPTIVGIGAYFYSSRILKISSILKPKKKKRKVTVTGFVNISVRLFLWLLCIGSIVSAGFIRNKEIIEKQSSIILFLLIIHILIIQSLSPSIQMFLIKFLMRIFPTENYVINTGFWNLLSNPSYLKSIQTSMSMGVTLISGFILYTQNMYSFMNTANGVNEARASFIAYMSAPIILIITSSISLTILSSNKDIEDIKQLKTLGVSRLQLFKIRIGEAIIHSVLILLVSVIFNLIILILVSIIGQFLGRSLVDISGFWQPSLIVISLLVIFYSITKGFYLFQDR (SEQ ID NO:913)
>orf01989
MVNNVAVKVSNLSKEFLLGQDKTVSILKDISLSVNYGEFVSILGVSGSGKSTLLSCLSSLSEPTSGEVVINGVNPYTLKEGKLAKFRRQDIAIIFQNYNLVPALPVLENVTLPLRLSGKSVDSNKVKKMLDSLNFKAELSSLVATLSGGEQQKVAITRAIIADSKIIFADEPTGALDSVSRKLIFETLRNLASQGKCVLMVTHDIELASKTDRALILKDGKISRQIIKPSADELYQALESSKD (SEQ ID NO:914)
>orf01994
LALVRKFIDYFFGVLVPFPDLYVFKSCFKFAGSFTDFDTFDWWLDWCRSCSENRFFV(SEQ ID NO:915)
>orf02006
MPTILLLKKFYERLITNFFRLKFLFCKEILATNIFNHPLFEPDIRVITIKII(SEQ ID NO:916)
>orf02009
MXXGAFGQGELLLQQSRNSSITEIVSDSWAGAGRRILPLPKSVTPLVSS(SEQ ID NO:917)
>orf02013
MLLLISLTQLIIFLFFERFNLLLKTFLLVDLKSNKSA (SEQ ID NO:918)
>orf02022
MAGKKGFLFLNCHICMVTTTTCFLKERVESELLIFFYISPNRCLITVYSVLNL(SEQ ID NO:919)
>orf02029
VIPRYVTKHQGWDHNPHTITNSDDDPATLVTFRTFKFNVGNCTIPKNDQNGSSQKFSGILQCPCEIHLLDSP (SEQ ID NO:920)
>orf02034
LLVRKFNIQTFFIQVFILNDFGYTVNGLIVYRLLLTSSILSFNDYSIGSFRTVIVI(SEQ ID NO:921)
>orf02040
MRLSIQLIHDLNTHTTHSTAKMLYNVKAIKNDFSIRE (SEQ ID NO:922)
>orf02048
MTAQLPSDALQMALWRRKRPRNVIVHTDRGGQYCSADYQAQLKRHNLRGSMSAKGCCYDNACVESFFHSLKVECIHGEHFISREIMRATVFNYIECDYNRWRRHSWCGGLSPEQFENKNLA (SEQ ID NO:923)
>orf02093
MEIVLVSFSISFQHFIIAYCLDFSSAGFRNSQNFSNFC (SEQ ID NO:924)
>orf02105
LTEKIQEHELIKTNQAEKSVQDVLDNCIERVQNNSLKSDRVTSFETPFALLFIFATIAVMLTYGGYRVSAGYISVGTLVSFLIYLFQLLNPISNIANFVTVYSRSKGSSVALENLLAVPKEKFEGGKSVSGRGLNFNHVYFGYDENRPVLKDITCSIFKGQKIAFVGPSGSGKSTIVRLLEQFYKPLSGDILMEQSSIYDFNLKEWRSKIAWVSQNNAVLSGSIRDNLCLGLNRLVTDDELMKVLDLVSLGDEIRSMKEGLDTEVGERGRFLSGGQSQRLQIARTYLKDAEILIFDESTANLDADSEYAIISILYSALKEKTVVIIAHRLSTVKDVDCIFFLEERKITGSGTHKELLENHERYARFVQEQMIE (SEQ ID NO:925)
>orf02106
MKLKLLRVDTKVIMGSFLLVLSSLLALLLPLILKGLIDGSSIENIGSKVFQSFLIFIGQALFSSIGYYLFSQSGEKKIAKIRKKVIEGLIYAEKSFFDKSQSGELTSAIVNDTSVIREFLITTFPNIILSLVMVLGSIVVLFSLDWNLSLLLFITLPCMMFIILPLSNISEKYSRRLQEEI (SEQ ID NO:926)
>orf02115
MXXALIPRVSIGSVGLLLLTENYVKGDLKAASRLVQDSLTLLFMFLLPATVGVVMVGEPLYTVFYGKPDSLALGLFVFAVLQSIILGLYMVLSPMLQAMFRNRKAVLYFIYGSIAKLVLQLPTIALFHSYGPLISTTIALIIPNVLMYRDICKVTGVKRKVILKRTILISLLTLVKVSVNRNHPVAVRIFLPTKWTFVELPLCSSCRCHGGWTLYGYESAYLFIR(SEQ ID NO:927)
>orf02125
MRERVRLSGSLFTSLKTREHIKSTMELFHKYVFFLIQEIKIKMINFLKIGDLPTL(SEQ ID NO:928)
>orf02134
MANHLYIVPIQVNHKSSIVNRMLTSITRNPIVSPTCLYASLITSLNFFLIFC(SEQ ID NO:929)
>orf02137
VIVFLSRNKDGNAFCHLDLISIANPVWGWDDDFITWIDHSHKEGIERIFGSRSDCHLI(SEQ ID NO:930)
>orf02140
LSNQFYFSLQTKPILKVKQFLLFQSQMTRVSEILQFSNKL (SEQ ID NO:931)
>orf02141
MSKKVLFIVGSVRQGSFNHQMALEAEKTLAGKAEVSYLDYSDLPLFSQDLEVPTHPAVAAAREAVLVEDAIWIFSHSLQLLYPRYSEKLA (SEQ ID NO:932)
>orf02144
MEDKEMGFYLMVASMLLGLLALKIGFSQFKEKKDKFLSILTSLAGTALVLVAVWLGWPK(SEQ ID NO:933)
>orf02166
MLDSDIGCSRKNLLGLFWIRRRRNIHIVDRAMEKGISNRAPNKISLKACFFNFF(SEQ ID NO:934)
>orf02193
LLHPFTRNITCDRHILTLLGNLVNFIHIDNATLCTFDVKVSNLQEFEEYIFHILTHITSLRQSCRIRNSKRYIQALSQGLGKESFP (SEQ ID NO:935)
>orf02194
VEIDAFVVVINRHCQGTLGTILTNYIVVQDMEEFNWFWHLRQVCQDFLNQFFSNDFLS(SEQ ID NO:936)
>orf02198
MEXXXTELAGRGFLVWHPKMDEYMEALDGHLDEISERLITLGGSPFSTLTEFLQNSEIEEEAGEYRNVEESLERVLAIYRYLITLFQKALDVTDEEGDDVTNDIFVGAKAELEKTVWMLAAELGQAPGL (SEQ ID NO:937)
>orf02199
LITRLLHRVHVLVDDFNPSICITWSSLALIYIFPVLNIIIDRVRQVHIVFLYKSHGLFSVILSIGLIFSIGIEIDTIRNSQNG (SEQ ID NO:938)
>orf02200
MXXTGSLSANFAGSTTASSSSEQNQSSNKTQTSAEVQTNAAAHWDGDYYVKDDGSKAQSEWIFDNYYKAWFYINSDGRYSQNEWHGNYYLKSGGYMAQNEWIYDSNYKSWFYLKSDGAYAHQEWQLIGNKWYYFKKWGYMAKSQWQGSYFLNGQGAMMQNEWLYDPAYSAYFYLKSDGTYANQEWQKVGGKWYYLKKWGYMARNEWQGNYYLTGSGAMATDEVIMDGARYIFAASGELKEKKDLNVGWVHRDGKRYFFNNREEQVGTEHAKKIIDISEHNGRINDWKKGIDEKRVDGVICRLGYSGKEDKEWRIH (SEQ ID NO:939)
>orf02202
MHKNFVVVVTNFFTAVQFIQFNKEGTTCHNTTKFFNHLDSCLNSSTCRQKVIYNKNTLTWLNGIRVHSQGIDTVLFFIVSRNNFAWQFTWLTNRRKTNSQLKGNWTTHDKSTSFRSHDHVDFLVSSILNDFTNSVAISISISHQRTNITEGNAFLWIIFNCCNVIF(SEQ ID NO:940)
>orf02209
MNRCNSRQAIWKIISTLNRENTHIMLNRQVCFCFVNHISPLNVVIWENLSLEELLYAICICFITHKIAKQTSLTIDNAGIAMNNIR (SEQ ID NO:941)
>orf02214
LDSRFFCTDFFKGRQAKGCSFSCTSLSLTDNILAFKGQRNSLFLDRTSFYKTSFFNFC(SEQ ID NO:942)
>orf02225
MGRKPRTRPEERTELERLQAENEYLRAENAILKKLRELRLKEEKEKEERQKLFKN(SEQ ID NO:943)
>orf02246
MLSKEEYIEEIGLIEKQNYVEVELYPLVADIINPTLKNSLSKRYVFGRRKSNMGQIYYGLSNFPDIVILDKNYQNKARKSIEIEEWKKLRGCVEIKSLKHDLITEEKIKSTISNSFEHITGEMGQLIGDLLWYKKVIYTNGIEWRFLSLDDKEEIDNTIVQVVNKRIETEEAGNSFDWWKNIKDLSFNYTDIYLSKDCIQEWDEFVKKVKEIEW (SEQ ID NO:944)
>orf02248
LEVCIHHHHQISCRILQACIKGCFFAKISRERNIMDCRILLPIGL(SEQ ID NO:945)
>orf02255
VDRTDEVSSKHCFEVVDRTDEVSNHTHGKATLTWFELDFRRV (SEQ ID NO:946)
>orf02263
MTPIKDKVRRVKTPMMVNPDTDLTISSVQQDYFSLALIGFSLLTGDFLSFSKGDQKTGLSAFIKICHLIKIARLDNKITKQQEYWLYDLLMMSQGEKIQKIKQLKQVTSDILLNTPDFSSYFEKYNFKEEAENIKSYLLAKSMDKSGRLFPSNEFGEFVSPVSFQHGFGGVLFFMNKYYVEEDENTVKEWLTKLENYEAANFLHGYSLLFGKAGFLFGILDRYEKTKERYLIDISKRLVDHLMRVYDNISNLDFALGKSGILLSLMKYCTIFDDKKLANFIKNNINDAYSLLESEDNGDIYSNNFAHGRSGAAYVLKAYTDIFGDSRYQNHLQKFSDGISELLEEKLSSFSKLDNLGLSWCDGVSGLILYLCLIDKERYSEIIYKSQLEMVQQYEAMGTSFCHGLSSLLQTTIYNKNQKVEQFIKKILLTRSYRNNDRLLQFQGEDGINSYFDFGVGNLGIYWTLLGYTFPFELSKGD(SEQ ID NO:947)
>orf02264
MHIFLKNRAFRQLTVNEWISSFGDTIFYLAFINYVSSYAFAPLAIFLISLSETIPQVLQLFTGVIADFQKNRISKYISILFIKVLLYSGVTLLLTSTDFSLFSVFFICSMNLISDTIGFLAGYMLTPIYIRLINDDMTEAMGFRQSTSSIVRLIGNLSGGVFLGLFSISTLAFVNVLTFLFAFLGSLLIRNRLKKEEEKIEVPPYVGMSSFFQHLKESMKLLMTMEDVMVLLWILSISQAVLMMVEPVSAILLIHHPFMGLSTGQSLAILIMISLLHVILGGLLSGFLSKKISIRLNIYWSLLMESLIVIDFLRGSFLLILLGSAGDAFSAGVLSPRLQAMIFGIIPEELMGSVQSSINVINLLIPAVLSLALVFLATSAGLEVVAFALIILLLIAAYLVHQMKNLPNQEEV(SEQ ID NO:948)
>orf02266
MTVPDRLPARMRMDGRSIFQQIKNYFDTENKEYFKHPNTYDGISMHLEPNILTSMEHFDLTGFHCECKDFQNQGVCKHWVAMDLYFRSLPQAIQERIGKASHKPSFASQLIPTLPSEELQDELVEEKATAPSLALHGQVEIRNHSLAWTLKLQVEQAPRAYVIKDIAHFIFLIFQKEDYFVSQKIGTIRLSLNQFNQASQNLLLYIKKYFIDRNEHSYFNFSYGINPRDYGRYLETPVSYLNDLVPLFQALDVFQYVTSKAEYPLIFLDDSPFIPEEEIFKVVKSNNHYEIINTAYFGFIIQEKLWIRHNHFHIIKEEHRYFLDKLATWIYHYQENSPLIFSKENKAELMQVCNIISNYVPISIPDELQIHDFIPTFAFSKTRNEIALNMVWSFGEKQVHSKQDLLTLPYTYQASKARKIYHQLLSAGFKEEFHSLSKIKIVDFFLKELPRFRTLGQVQLDESLEKLLVEDPAVIDIFDDESFLSVQFDFSMISEDEVEKAIQALWNQESHYQTKQGKVLVFDDESLKVAQSLQDLRAKFSDGKIKMHKSRAFSLSETFKDNEHVNFSRDFKKMAYDLTHPEEFDIKPYEVKAKLRSYQKEGVKWLSMLDHYHFGGILADDMGLGKTLQTITLLEANLKPDQKALILAPASLLYNWKEEFRKFVPHKQVEVAYGSKTERIKQIEKSATITITSYPSFRSDLEHYQKQSYDYLILDEAQMIKNSQTKTAQALREFDVKTCYALSGTPIENRLEEIWSIFQIVLPGLLPSKKEFSKLSPQLVAKLIQPFVLRRKKDEVLTELPELSEHLYSNELSSSQKTLYLAQLRRMQEMVSGASAYEIKRHKIEILAGLTRLRQICNTPALFLEDYKGDSGKMDSLFELLDTIREKGSRPLIFSQFTSMLDLIEQELEKKEMSHFKITGQTPSDKRQEMVNLFNQGEKDCFLISLKAGGTGLNLTGADTVILCDLWWNPAVEMQAIGRSHRLGQTKQVDVYRLITLGTIEEKIQELQESKKELFNTVLEGQESRSNLSVDDIKEILGVE (SEQ ID NO:949)
>orf02283
MMSMVDPIDQTFIVNLKIGKSQVFSQLQFSCHIVVYPSEVHIYQAFVIKLQNHILGPQVLP (SEQ ID NO:950)
>orf02284
LPNRTRIDNQLPTSPVTKQLLVNMSINSNITGRMSHQAVKLLLFASMNQLSPPVLIRQMMANSHRQIPKLTMNLKRLIVEHFNFF (SEQ ID NO:951)
>orf02285
LIQQVQNPSTPCPWHENISQKPVFIHSYLPSICQNSLQGGGISMNI(SEQ ID NO:952)
>orf02308
MIDKVVRNLLLTFLFCKMTKIINFLTTILVKKKKMCYNVSKLREKKKGAMMWVLGFILFIIFFYSNNSKKIKKLRE (SEQ ID NO:953)
>orf02309
VDRTDEVSSKHGFEVVDETDEVSNHTYGKVKLTWFEEIFEEY (SEQ ID NO:954)
>orf02314
MIAEFIDGLQKFHFLQNALITAIVVGIVAGAVGCFIILRGMSLMGDAISHAVLPGVALSFILGLDFFIGAIVFGLLAAIIITYIKGNSIIKSDTAIGITFSSFLALGIILIGVAKSSTDLFHILFGNILAVQDTDMFITMGVGAAILLLIWIFFKQLLITSFDELLAKAMGMPVNFYHYLLMVLLTLVSVTAMQSVGTILIVAMLITPAATAYLYANSLKSMIFLSSTFGATASVLGLFIGYSFNVAAGSSIVLTAASFFLISFFIAPKQRYLKLKNKHLLK(SEQ ID NO:955)
>orf02336
MYEEPEVAPVHPTGPTPATETVDSAPGFEAPQESVTIL (SEQ ID NO:956)
>orf02363
MGNNGQFTFGYRHDFFQNQLAIFNALVDTFTRRTIDIKTLNTFINEVLNQGTRTLWTYFSLLIITCVEGWNDTFVFFQI (SEQ ID NO:957)
>orf02368
LHEVVIPSIDEGKDCKGCKPWFHNREGYTPEGTNLTTTVDFS (SEQ ID NO:958)
>orf02369
LFHEEDTEWPSNQRQDNCPESIVDSHEVDDTYQWYKDNLFWKRHSSDKDSK(SEQ ID NO:959)
>orf02393
MSYFRNRDIDIERISMNRSVQERKCRYSIRKLSVGAVSMIVGAVVFGTSPVLAQEGASEQPLANETQLSGESSTLTDTEKSQPSSETELSGNKQEQERKDKQEEKIPRDYYARDLENVETVIEKEDVETNASNGQRVDLSSELDKLKKLENATVHMEFKPDAKAPAFYNLFSVSSATKKDEYFTMAVYNNTATLEGRGSDGQQFYGNYNDAPLKVKPGQWNSVTFTVEKPTAELPKGRVRLYVNGVLSRTSLKSGNFIKDMPDVTHVQIGATKRANNTVWGSNLQIRNLTVYNRALTPEEVQKRSQLFKRSDLEKKLPEGAVLTEKTDIFESGRNGKPNKDGIKSYRIPALLKTDKGTLIAGADERRLHSSDWGDIGMVIRRSEDNGKTWGDKVVISNLRDNPEAKDPAAPSPLNIDMVLVQDPTTKRIFSIYDMFPEGRAVFGMPKTPEKAYEKIGDKTYQILYKQGESGHYTVRENGEVYNAQNQKTDYRVVVNPTEPGYRDKGNLYKGQELIGNIYFAHSTKNPFRVANTSYLWMSYSDDDGKTWSAPRDITPGLRKDWMKFLGTGPGTGIVLRNGPHKGRILIPVYTTNNVSHLNGSQSSRVIYSDDHGKTWHAGEAVNDNRQVDGQKIHSSTMNNERAQNTESTVVQLNNGDVKLFMRGLTGDLQVATSKDGGVTWEKDIKRYPQVKDVYVQMSAIHTMHEGKEYIILSNAGGPKRENGMVHLARVEENGELTWLKHNPIQKGEFAYNSLQELGNGEYGILYEHTEKGQNAYTLSFRKFNWDFLSKDLISPTEAKVKRTREMGKGEMGKGVIGLEFDSEVLVNKAPTLQLANGKTATFLTQYDSKTLLFAVDKEDIGQEIIGIAKGSIESMHNLPVNLAGARVPGGVNGSKAAVHEVPEFTGGVNGTEPAVHEIAEYKGSDSLVTLTTKEDYTYKAPLAQQALPETGNKESDLLASLGLTAFFLGLFTLGKKREQ (SEQ ID NO:960)
>orf02395
VADRTDEVSSKHRFEVADRTDEVSSKHRFEVADRTDEVSSKHRFEVADRTDEVSSKHRFEVADRTDEVSNIYTARRS (SEQ ID NO:961) >orf02399
MKIKEQTRKLTAGCSKHCFEVVDETDEVSSKHCFEVADRTDEVSSKHCFEVADRTDEVSNIYTVRRR (SEQ ID NO:962)
>orf02407
MSCNCAFYRSQFFDVNSVSNYHSHQKELRFPNSILFTYFVKVA (SEQ ID NO:963)
>orf02428
VGLIKLTSYVFVCISNSFLTRHDKNDNICFFHGNFCLVLDLFHERSIDIINSSCINHAKRTIEPLTRCINTVTCHSFDIFYNGDSLTSDPIK (SEQ ID NO:964)
>orf02430
LSSKSCIDRTNQETFHTLGLEGVGMKSGSLFCSVQISDKEKENSRLANGFLRYQFIQGIFLLLTSYHNHRVGLEILPR (SEQ ID NO:965)
>orf02448
MKSKEQTRKLAVGCSKYSFEVADKTDEVSSKHCFEVVDRTDEVSNYIYGKAKLTWFEEIFEEY (SEQ ID NO:966)
>orf02450
LSNSFFLIKFSSSKTSGKKRIVSDNIFIRNKFICHFKKE (SEQ ID NO:967)
>orf02459
MDYSKVAAEVIEAVGKDNLVAAAHCATRLRLVLKDEAKVNQAALDNNADVKGTFSTNGQYQIIIGPGDVNFVYAEIIKKTGLKEVSTDDLKEIANKDKKFNPLMDLIKLLSDIFVPIIPALVAGGLLMALRNFLTSPDLFGPQSIEDMYPAIKGFSAMIQLMSAAPFMFLPVLVGISAAKRFGANQFLGAAIGMIMTTPDLGGKEAFWDILGFHVTQTNYAYQVIPVLVAVWLLANLEKFFHKKLPSAVDFTFTPLLSVMITGFLTFTVIGPVMLVVSDAITNAIVWLYNTTGAFGMGLFGGTYSLIVMTGLHQSFPAIETQLLSAYNNNGTGFGDYIFVVASMANVAQGAATLAVYFLTKNAKTKGLSSSAAVSAFLGITEPALFGVNLKYKFPFFCALAGSAIGAFVAGLTHVIAVSLGAAGFIGFLSIKAGSIPMYIIAEIMSFVAAFAFTYFYGKTKAASVFADEAATATAETVTEPTVEAPVVEETDTLQNETLVTPIVGDVVALADVNDPVFSSGAMGQGIAVKPSQGVVYAPADAEVSIAFPTGHAFGLKTRNGAEVLIHVGIDTVSMNGEGFEAKVAQGDKVKAGDVLGTFDSNKIAAAGLDDTTMVIVTNTADYASVAPVATGSVAKGDAVIEVKI(SEQ ID NO:968)
>orf02466
MTESYTWVEADRATLSRYRHGQGHLTDQFFSFKVQRPAAKTLIASISTGKGMGPSFDGTPVITSGNQNRINTIKNSFIMSSSSVRISLRKLTSQRNFLRNLSSLILLAAQVAKGDATACSHQRISRVVGQDSHETLSLTEFF (SEQ ID NO:969)
>orf02467
MNINNEKVWFAFYLLDMQITRPTPTFNDRRIGLIGKLQELRFLAGNLLLR(SEQ ID NO:970)
>orf02468
LIKGYLPNHLALMDLCSKTTCTLDDFAGIAGRRNHRGFFCHIGNGVFLTVDKYLRNQRIRQRKSSHHILTQLVCHSHTHLFILLQTSLSLRTKERLSF (SEQ ID NO:971)
>orf02474
LIKLTGRNFSDILIKCLVKCFTNLLSNQLMLLPSTLKL (SEQ ID NO:972)
>orf02479
MIESENHCSASHSNRDYQSQHDNQGRTCQCFIIVPCHKKGSCSVGEITWNQRCQNGQDKDHSRCLIKNT (SEQ ID NO:973)
>orf02486
MIARQLMVFFSTNQADTRITNMSIDSLIINNSKDFQSSSHASVSFILTKLVNLLIFNF(SEQ ID NO:974)
>orf02487
MGEPFTHFIDCIDLGINPSYTQVCDRHFTSDIPCTMTSHPIS (SEQ ID NO:975)
>orf02494
LSSDSHFIGIKAFVILILGKSNSIVLRIVGLYQDLTCFFSPTCSTCHLSQELEGSLRRTEIRQIQGRIRI (SEQ ID NO:976)
>orf02495
MAVHSLGIHMQGQRNIAVGTSIHRPTLPTHDKARITTAIEHENHLLFFNQTVLDSL(SEQ ID NO:977)
>orf02496
MVTGIAVLLISHFMLFINNHDTQIFQRSKDSRSGTNNNLGIATLHLAPFIILFTIG(SEQ ID NO:978)
>orf02497
VKNGYLVPKTCYKTLGHLRSQGNLRYQQNSCLALIQGTLDNLQVNLGLPTSCNPLK(SEQ ID NO:979)
>orf02498
MVNLIPRLGLDLLLIDCLIFQTKQAFSSQTHHFSLLGKV (SEQ ID NO:980)
>orf02499
LGLQTKNNPLNQAIPLTKRHMNPHPNFQHSLKFLRNPVTIGLVRLHQGHIYDNLS(SEQ ID NO:981)
>orf02502
LGNHFCTICSTTYQAILQFIQIWWCQEDKDSIWNLFLDLKSTLNFNFKENIDSLVQGFIDIGQRSSIVVADIFCVFQHLSLTNQLFKFFTSTEEIVNTVHFSRTLCACRHRYRILKLVFRTLKNLSSNRSFSNP (SEQ ID NO:982)
>orf02527
VGCSYICHELVTNHDHFLFVIVEFLHSTVNTKCEGLQGPVNVINPKFLNCSLNAFFGVI(SEQ ID NO:983)
>orf02528
LLHLWRSIRVVPSNGGIIQIDQNSLDSLRLQAWDCQIIDCFHSKIWYIIFNRHSGSFC(SEQ ID NO:984)
>orf02530
MIGTFAAALVAVLASFIVPIEITLNSANTEIAPPDGIGQVLSNLLLKLVDSPVNALLTANYIGILSWAVIFGIAMREASKNSKELLKTIADVTSKIVEWIINLTPFGILGLVFKTISDKGVGSLANYGILLVLLVTTMLFVAPMVNPLIAFFFMRRNPYPLVWNCLRVSGVTAFFTRSSAANIPVNMKLCHDLGLNPDTYSVSIPLGSTINMAGVAITINLLTLVTVNTLGIPVDFATAFVLSVVAAISACGASGIAGGSLLLIPVACSLFGISNDIAIQIVGVGFVIGVIQDSCETALNSSTDVLFTAVAEYAATRKK (SEQ ID NO:985)
>orf02537
MWNKNRQLRKVKKILNQINRRKEEMALLTDEELAAKTQEFKRRLTAGETLDDILVEAFAVVREADKRILGMFPYDVQVMGGIVIHQGNVAEMNTGEGKTLTATLPIYLNALSGQGVILVTTNSYLAKRDAEEMGKVYEFLGLTIRLPFADDEEEKITPKEKKEIYSADIVYTTNSGLGFDYLIDNLASSEEQKYMPEFNFVLVDEIDSVLLDSAQTPLVISGSPRVQSNFYGIIDTLMTTLVDGEDYIFKEEKKEVWLTNKGAKIAEKFLGIDNLYAEENNVLARHLVFALRAHTLFKRDKDYIIRKGEKDQELVLLDQGTGRLMEMTKLQGGLHQAIEAKEHVKLSPETRAMASITYQSLFKMFNKISGMTGTGKVAEKEFIETYNMSVVRIPTNRPRQRIDYPDNLYITLPEKVYASLEYIKQYHAKGNPLLVFVGSVEMSQLYSSLLFREGIAHNVLNANNAAREAQIISESGQMGAVTVATSMAGRGTDIKLGKGVAELGGLIVIGTERMESQRIDLQIRGRSGRQGDPGMSKFFVSLEDDVIKKFGPSWVHKKYKDYQVQDMTQPEVLKGRKYRKLVEKAQHASDSAGRSARRQTLEYAESMNIQRDIVYKERNRLIDGSRDLEDVVVDIIERYTEEVAADHYASRELLFHFIVTNISFHVKEVPDYIDVTDKTAVRSFMKQVIDKELSEKKELLNQHDLYEQFLRLSLLKAIDDNWVEQVDYLQQLSMAIGGQSASQKNPIVEYYQEAYAGFEAMKEQIHADMVRNLLMGLVEVTPKGEIVTHFP (SEQ ID NO:986)
>orf02538
VSNKLHILQIGNRNWSHYYEIPENIEWHFFWPGSTTAIKKVMKMEGIRTFSGVVIENPDYLPDLLPLINILTPYTIFYSDICASYSPLVEEFLKKTCAQVTDFSNPRELLRILSKALFKGQYGDKLTPIDMVVNPYFAGSIRYNGYENLELVGSYGEDFRPLISWKYNIRASEWNPIELWLEYEKDLSCDIRIVVRNIQDGSTADFIKERIFTTDDMEAAILLDDDFSSFISVSLEAKGNGRLKIGALHQRLTRYQFGKFVLGGNIIRSKNREEINYFFYPGDFKPPLNVYFSGYRRAEGFEGFGMMKSLGSPFLLFQDPRIDGGAFYLGDDDFENAVRRVIQHHLDLLGFSNKELILSGISMGTYGALYYSSDFEPKAVIVSKPLTNLGLIAERGRLEAPGLFPTAFDILRHHSQGKADIDSINILNARFWERFRGADFNQTIFGLSYMKEEDYDPIAYDSLVDSLYSTGARIMVKGTSGRHNDDNDSTILWFVNFYKMVLEQEFGRKY (SEQ ID NO:987)
>orf02539
MYYFIPFLESMNQSWQVDIVPWYQTTHRLEFDDVLHQIRIFKREGIKSKIVLLPYHPHMRYLLHRQDLLEVEAFSVFDAIQDIENEEIYPLQLKDLAWDEDCDFIYTPFLIAVKQKGELHAHIEFGTEGFISYITYFKDNQVDFICYFDDRGFLSSLVKYQDNQAVSRYYYNSNAEWQIKEYLQGIHTKVEVNPRFSHRFRKSTYQSMDEVVWEFFEKFLTAEYKEGESFVLAAQTKYQNQLLKHLPEHADKILTFFIERNQEDDLNLHHQAVKQAKILISDRQDFLERLKQHYPQFTYKMHHLPSFDTRLKLGVSQRVKESKLYVQLDLNTPLNSEALYEVLNFVSQNPLTEIVFATFNAEGYQIEALQKHLFTLISERLNFRDLLKESIISGAENKLEENKEENYRFQIVNLNDEIGLIRELEYTRLIVDLNPIANIYTQIAGISAGIPQINLSESEYVTHLQNGYILSDLSEFSKAGHYFLDTLEHWNQALIHSIDKIRQNTGNQFVQKWERWLEEAKSEQ (SEQ ID NO:988)
>orf02540
LKKKLKSVVVKRVMWTICFIFVYILGSRLTLPFVNVNDTSFLGGNAAFLAFSTAMTGGNLRSLSLFSVGLSPWMSAMILWQMFSMSKKLGLGNLPLEIQERRKMILTFIISFIQTLAITLNLPIQEGVNHDLVLILNILLLISGTFFLVWLSDLNSLLGVGGSVVILMSSMIVSVPENIVRSIIDLHVNLLFIISLLLISIAFLYIAVRVQKARYRILVNKIMIHNRFKRYSYFDIMLNPAGGMPFMYAISLVSIPQYLLMLLHIFVPKTRWVDNWIAEFTIGRPVWVYTYIIVLFLLGIAFAFVNMNGEQIADKMKKSGEYIYDIYPGEDTALYINRLVLRFAVIGSIYILLMAGIPMLIILYEPRYMOLSMLPGLFLMFNGMIFNVKEEINALTLNESYRPLVERK(SEQ ID NO:989)
>orf02541
MKINITNIYGMSGQSTALIAQNETVKIAKKLDFHELSFYFYNIYSDSEGELNSRLDGVLAKLGYGDIVVYQSPTWNGREYDQAFIRKCKILNTRIITFIHDVPPLMFPSNYYLMSEYIEMYNQSDLVVVPSEKMKERLIQEGLTVQKIIIQGMWDHVHNYPLKQPSFQKKLSFAGSVERFGHLSNWSYSTPLDI FSESNYENSNPRVSFKGWKTDPELLFALSEGGFGLVWGTNENPADEEDYYRLNISHKVSTYLAAGIPVVVPSYLSNASFIKEKGLGYVVDSLEEANRLVEETTVEVYQQMVENVSKVSYALKEGYFTKKLLTDAIMQLLDI (SEQ ID NO:990)
>orf02543
MKAIVLAGDKNYLTPILTTIKSILYYNQNVKIYILHQDIPSDWLQELKIQVEKLGSVVEGIYIGDAIDSEWKTQAHISPIAYARYLISRLITEDRVVYLDSDIIVNGDLSPLFELSLGDYSLAAVRDVDGNGFNSGMLVIDCQKWREKDVTSMLFDKTVEYMSYLDHTDTDGFNGDQTIFNLVFQNHWLELDKRFNFQVGHDIIAFYSHWDSHFELDEEPLIIHYTTYRKPWTTLMGYCYRDLWWSFHDVTFDQISDHYQGRFAVKRVYDFHDINLFTFTDSQDLLYIDELAQSLLDIAFHIGAYTDMGDILLALDKYPNVYLYPSMVGAVIDEMIEKSDAYLDIHKGSSMDFIVNRYTSAGRPVLTFDVTNKNQLEEIVVPSQSPLEMIKVIKKLKSDKMETKAIVFGANYQYADKVLTTIKSICCHNRGLRFYLINSDFPTEWFYNLNRKLTKLDCEIVNARVNSSHISQYKTNIHYAVFLRYFISDFVEEAKVLYLDCDLVVTRDLSPLFDLELGDYPLAAVKDLGGQIYFGEHIFNSGVMLINNRLWKQEEVRKQLIEMTNELHDKVAQSDQSILNLLFKDRWLALDFKYNCITLHTHFSDYRPESGTYPPIIHYLTERKPWGLYECSIYRDVWWYYNAQDWSDMSQVTPSLTKDQVSQYTGVQYSALVYTFSSDLRNMGYLIENLPDVKFYVAAPVMVADSITDLLAYPNVSVLSDIAGQPPLIDSLVEGCDFLLDINADIEVDGIIERFRQAGKPVFAFESVVHGEQGQFLYDQAHPEEMVLAIEAYCQNGELPVKKFQSYPKVLDIQQSLDYILEHHTSVIRYGDGEMDIMMGHGIPYQDYDETLAEQLRSMIQLESSPELLVCLSDVFEGLERYNSESVNFWKKHLEHYKDAYQQYCTASFYGSTFISRPYMDLKDKTASVAHFEKLKQLWDERDILIVEGENSRSGVGNDLFDNAQSVERIICPSRNAYSKVQVIQEAVEKYADGKLVFLMLGPTAKVLAYHLSQKGIQAIDLGHVDSEYEWFKMGATSKVKFSHKHTAEHNFDQEIQFVEDEIYNKQVVVRI(SEQ ID NO:991)
>orf02545
MTDKASKKAIVLGADSNYMDKVETTIKSVCSHNRDIRFYIFNSDFPTEWFQLMNKRLSVLNSEIINIKITDDTISHFHLPTPHLSSATYLRYFIPNFVFEKKVLYLDSDIVVTSSLTALFDIDLDGYPLGVVPDIPTTDEEFNSGVLLIDTNRWREEDIYRQLFELTIAHHEHVYGDQGIFNILFKDRWKRLDITYNLQVGVDAHRYYMGDYDWYELFEGVPCIIHYTTENKPWKHFRFNRFRDVWWFYYGLNWNDILLRTHVLKETFLELISPISKHVSIFTNTGDIESIGYLLEKLPDVQFHIVAPTYFSPNVIELQRYSNCYIYPCADPKMKQDIIDKTDICLDINYGPAMDQMLQEMVRRGKTIYSFDCTNHFFNGESTVFTVDEIDELIRSVKELKM(SEQ ID NO:992)
>orf02546
LKDLVSVVIPVYNVENYLEECIQSVLNQTYTNLDIVLVNDGSTDASAEICARFAEIDGRVRVFHTENRGAALSKNFGVTQALGEYVLFVDSDDIAEKRMVETLYRQVEETGADIVIGNYFLYDENDGQYKLYVLERDFCIEELSAQELIDRQAGKWHLNASAFIMPVFKLFKKDLLLQVPLLMVGALMMKQRFIDCS (SEQ ID NO:993)
>orf02550
MRGGVDTTQVMTETVEDKVSHSITGLDILKGIAAVGAVISGTVATQTKVFTNESAVLEKTVEKTDALATNGTVVLGTISTSNSASSTSLSASESASTSASESASTSASTSASTSASESASTSASTSISASSTVVGSQTAAATEATAKKVEEDRKKPASDYVASVTNVNLQSYAKRRKRSVDSIEQLLASIKNAAVFSGNTIVNGAPAINASLNIAKSETKVYTGTGKDSFYNIPIYYQLTVINDGSKLTFTYTVTYVNPKTKTLGNISKKMSNGYSIYNTGTSIQTMLTLGSGLGKPSGVKNSITDKNGKQVRPYNTSTMTMWRSGYTWANGAQMNGFFAKKGYGLTSSWTVPITGTDTSFTFTPYAAKTDRIGTNYFNGTRKVVESSTTSQSLSQSKSLSVSASQSASASASTSASASASTSTSASTLASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSTSTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSTSTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASASASTSASTSASASASTSASTSASASASTSASTSASTSASTSASTSASTSASTSASASASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASTSASASTSASASASTSASASESASTSASASASTSASVSTSESTSTSGSTNSRPQETITNSRQELPNTGEKQSVKSGLLGLILGLTGLGLVAKRRKRDDED (SEQ ID NO:994)
>orf02551
LKAFVEAHPDAFLREIAARFDCALPSVWAVLKQIKVILKKTTSFKEQKPEKVSEFLDILDNLKDLPVLYIDETGINRYLYRPYAGAPRGEKVYDKISGRRFERTNEVEQKLNGSFLIRYIDSQIRE (SEQ ID NO:995)
>orf02552
MAYSTDFKQGALDSIKEGHRHVEAAKVFDVGVRTLFTWEKKDVNKGT(SEQ ID NO:996)
>orf02559
MKIKEQTRKLAAGCSKQCFEVVDETDEVSLKHCFEVADRTDEVSNHTYDKVKLTWFEEIFEEYHTKKPCSSR (SEQ ID NO:997)
>orf02561
MKKSDVLDLIKYHYEGRETEFRNQSIAIARNFNKHGDTQIAQYIMGLMSQSDRFMPQIENPSEYLTPAKLDIGPLPLPLSIMNDLKGIINAVNHHIGINKFLFVGSPGTGKTESVKQVARLIGKELLVVDFSHLVDSKLGQTVKNLATLFNEINNLPFKQNYIILFDEIDSIVLDRVNQNDLREMGRVTSAFLKELDRLSPEIVLIATTNLFENLDKAVTRRFDAIIDFDRYTDEDKVEVATIILNELLKQFKNVARDLKLFKKIINSANVIPNPGDLRNSIRTSLAFSDPSDPHDYQKRLLRSLHNGRNLSISKLSKLGFTVREIEILTGISKSSVSRELSED(SEQ ID NO:998)
>orf02597
MRFIVGIFISFSPEIEFLSSLKFLTDFVEICLLWQVMTP (SEQ ID NO:999)
>orf02598
VVEIIYFLIIIIASGLGSISGMGGGIIIKPLMDSFGYHSVSDIAFYSSFSVFIMAIISTTKRFSQSKEIKWRLIFTVSFSSVLGGFLGHLIFQVLLSQLSVRLVSIVQMILLFVMLLVSFVLTDFKKTYQFDKIGFYMICGLLLGLISSFLGIGGGPLNVSLLMVFFSISIKEATMYSLAIIFFSQLSHLATIVVVTGLNQYHLAPVPVIFLASICGGVLGTVVSKVLPENWVRYCFKGMLFFVMGMTLYNLFHIL (SEQ ID NO:1000)
>orf02599
MMGTNSEEGFLDDFEGPQVAVSVKDFSIADTPVTNQEFAQFVKETGYKTLAERQEWSFVFILFVPEAEREGYPHPAGAPWWLQVSNACWKHPYGENSNLVGLEDHPVVYVALEDALAFCNWSGMSLPTEAQWEYAARGGRQSEYPWGDTLLEGGYYHANTWQGRFPYENTALDGFIGTAPVYEFLPNDFGLYQMIGNVWEWCRNPRYTLLASFNEDDYELPKYGIQDEEYAIRGGSFLCHCSYCNRYRVAARNGCISTSTSSHLGFRCLKE (SEQ ID NO:1001)
>orf02602
MVQTKQPNIILIVVDQMRADALSLNSKDKLVSTPTLDMMASVGYNFENAYSPVPSCVPARAALLTGLDQDKSGRVGYQDEVPWNFTNTLPKVFKDMGYQTECIGKMHVFPSRQRLGFDHVLLHDGYLHVDRKYDKAYGSQFDYASDYLAFLKGKVGYDVDLIDDGMDCNSWEARPWDKDEKLHPTNWVVSESISFLQRRDPTVPFFLKMSFEKPHAPLNPPKYYFDIYMERLPQFLDLHIGNWEVLEKQIPSIYALRGKLKEDDQRRMVAAYFGLITHIDHQISRFLTALKEFRHDKDTIIWFVSDHGDQLGEHYLFRKGYPYQGSIHIPSFIYDPAGLIAGNRGTIKQLVKIQDIFPSLVDLAGGTTTDELDGRSVKNLLFGQYEGWRTEFHGEHALGKDSSQYILTDQWKFIWFPVLNHYQLFDMKKDPHEMNDLYPSEKYQPIVRQMKKKLVDFLRYREEGFVVDEELVPVELSKITPTLTKTGDSQS (SEQ ID NO:1002)
>orf02604
MNTMLDKMQEKLSPIAMKVENQKFLVALRDSFVGTMPVIMTGSIALLLNAFLVDLPQQFHLESITKTFQWLVDINNLVFKGSIPIVSLLFIYCLGVNIAKIYKVDTVSAGLVSLASFVISIGSTVTKSFPLANVGDVKLDQILQGIDNLAFDGKNLMVTIGNVIPGNHINARGYFTAMMIGFLASIIFCKVMKKNWVIKLPDSVPPAIAKPFTSIIPGFMAMYIVAILTYVFHLLSNDLLIDWVYKVLQTPLLGLSQSFFAVILMIFLNKLFWFFGLHGGNVLAPIMEGLFGVAMLANLDAFQKGEPIPYIWTSGSFGAFVWFGGLGLVLAILIFSRNSHYRKVAKLGLAPVLFNIGEPVNYGLPVVLNPLLFIPFVLSPVFMATVAYWATSWGLVSPVTQNVTWVMPPILYGFFSTAFDWRAIILSVVCLIISVLTYFPFVKMADKTELS (SEQ ID NO:1003)
>orf02605
MNESNLESVMGLIMYGGEAKSNAMEAIQAAKKGDFSKANRRLADANAALLQAHKAQTEMLTREAQGEKTSISLLMVHAQDHLMTSLTFVDLAKEVVEVYERFEKN(SEQ ID NO:1004)
>orf02606
MAKVTIMLACAAGMSTSLLVTKMQKAAEDKGLDAEIFAVPAPEAEEIVATKEVNVLLLGPQVRYLLGDFQEKLKDRQIPVAVIPMTDYGMMNGSKVLDLAESLLD(SEQ ID NO:1005)
>orf02607
MKRLISANPSEILQMNAEELKQSILASEGRVVLSENVVTRETFVGDITNSEIARAFGADMILLNCVDVFEPKIYALDSSGDDVIHRLHQLVACPIGVNLEPIDPSAKMLEETQEIVAGRVASVETLNRIEELGFDFVCLTGNPGTGVSNREIIKAVQTAKENFSGLIIAGKMHGAGVNEPVAELSVAEQLLEAGADVILVPAVGTVPAFHDQELREVVDLVHSKGGLVLSAIGTSQETSDTDTIKEIALRNKICGVDIQHIGDAGYGGLATVDNIYALSKAIRGVRHTVSRLARSVNR(SEQ ID NO:1006)
>orf02608
MEKLLQEKLLPVAARLGNNKALVSIRDGITLTIPLLLIGSLLMVIASFPIPGWEKYLGDIGVADYLWKGVDSSFGLLGLVASFGIAYFMARQYKVDGIPAGIVSLSSFITVTPFIRGEAGAGMPTAFMASKGLFVAMILGLINGYIYQWFINHNIQIKMPDGVPPAVSKSFSAIIPGAVTIVGWLIVYATLDKLSLPNLHEIAQVALGGPLGLLGNNVIGLLILIFLNSSFWFVGLHGGNVVNAVMKPLWLANLDANKVAYQTGETLPNIFTSVFMDNFVFIGGGGATIGLVLALGYLAHKKKASKQLKTLAPITVIPGLFNINEPAMFGVPIVLNILLLVPFILAPMFNLLVAWGAMASGLVPLTYTDPGWTMPPVISGLLATGSISGSLLQIVLIVLDVLLYLPFVIAIEKRFKLLED(SEQ ID NO:1007)
>orf02609
MTLSKKQLQLRAKILETVYTLGPISRIEIATKTGITPATTSSITNDLIKENILLELGEDEHDTSVGRKKILLDIQAKRFYYIGCELSEKHFTFALGDNLGNILKEEKEIVTKQLIQEKGNQLINQTLKQFLNNCSDYEIEAIGIALPGRYLDDYKITTNNPLWQHIDLEMIQSHFDKPLFFSNNVNCMAIGKRLFSRQQNDPNFAYFHFARGMHCSYIYDGNIYGKGNLMIGEIGHTVVSSEGEECSCGRKGCLQTFAGESWLIKKSKILYHQSPYSLLPSLVKNADDIDIQVILTAYQLGDTGIITLIHQALLYLSQTILNISMMIDSQKIYLHSPLLTNQHIIQKLYSEMNYKPKLLYNRLPEVIIEPYNDFTAAHSAIALCLYHTILHS (SEQ ID NO:1008)
>orf02628
MPFKENLICQHRNHHCSVFFISLGLLHNIHIEIDISQTRASFLDLSDYLQAVLMILQKFCQAIGLAQRLDLLQLHLLHLTRLLL (SEQ ID NO:1009)
>orf02636
MYLLLLVVKDHIALIDKEMHVWRPNCILRDLTNFFIKRNHIVTHKTNGSTTKR(SEQ ID NO:1010)
>orf02637
VLTLMNHFIKEIQGISINHLTILIKNSIFKLNLKNRIIG (SEQ ID NO:1011)
>orf02641
MKIKEQTRKLAAGCSKPCFEVVDRTDEVSSKYCFEVVDRTDEVSSKHCFEVADRTDEVSNHTYGKATLTRFEEIFEEYKGVPR (SEQ ID NO:1012)
>orf02655
VCQRMDARTCKTTIIAVHNVLTALQQTWIAVQLYQTK (SEQ ID NO:1013)
>orf02656
LHLGKSILSLPVKGKDLEFLVHLFVINHWIGSPSRTSTFCRCKVLNGME(SEQ ID NO:1014)
>orf02657
LEQTVIIANNPCELYWDNHLSFLSDSLLKQVIVHLKRICLDIHHDRGCSHVRNDTT(SEQ ID NO:1015)
>orf02673
MLEEGTKDQLAELTYPFGRGVNLSFGIKDVPKLYQKVMEANYPIYRLLTKRKFRVSDPYIYPHKFAVLDPDGYFLRFSE (SEQ ID NO:1016)
>orf02689
MIACRHDICKSQKGLEHPFCIIRRLTRDFNQRPVCIVEANIFCLKITPQIITNMIVARTVKSSKTGITLTTSMCKRDNHKITWFHRRNGFPSFFNNPNRFVSTIFMSSFRFWITVPP(SEQ ID NO:1017)
>orf02705
MNMNKDQIAILNGADNLNLTLWITLKEICKEGCKSFFPVRNTCRMLDIGIPYRLGLSLSNSSVLNGMDV (SEQ ID NO:1018)
>orf02725
MHKLRIFVNQLCRRFGIILGPFLVLGFQVLTQELELAIFFDLREEVLLQVIPQVCHFCYLRKEFTTLNQHELTSHDHVLTRHFQTHGLQG (SEQ ID NO:1019)
>orf02732
MQVTIEADSGLFLLSPVVQLLKVEIDVKQVTMSLDNLGRTKLSHQTFWVAGVEVHVPFNNADALPKWRIRT (SEQ ID NO:1020)
>orf02734
MKNGIDFAHIAITDFFHNQAIAMGIAHYNDGLLCHDGNTSKSFLTAKAR(SEQ ID NO:1021)
>orf02758
MSIVKSHSFSISLGIFNSFWNNIHTSECFYFLCKGKSNRSNSTISVNQMVFFINIQRFYCFAIEDFCLLRI (SEQ ID NO:1022)
>orf02759
LNTLLPPDNLCLFTIYLTGFSCICINSYCHNFWEIFNQLFYQLS(SEQ ID NO:1023)
>orf02778
MYNKVIMIGRLTSTPELHKTNNDKSVARATIAVNRRYKDQNGEREADFVNMVLWGRLAETLASYATKGSLISVDGELRTRRFEKNGQMNYVTEVLVTGFQLLESRAQRAMRENNAGLKGQIWXXIHSLIN(SEQ ID NO:1024)
>orf02782
MQFTRTTHHPKTLFTTKFAWENEIPFWHHGTRKSHNGFQPNTRIRCSCNDLHYLVTCDCNLADVEVVTVWMSYHLDNFTNNKLRFLIINNFFCKTFRL (SEQ ID NO:1025)
>orf02784
LTALCNFKQARNLKNVPSYCFPIFFEENTGYLAFAFHIQFGISAIFHDDHKDITDMFFVIRKNHCFLFLLFQSC (SEQ ID NO:1026)
>orf02788
MKIKDQTRKLAAGCSKHCFEVVDRTDEVSSKHCFEVVDEADVV (SEQ ID NO:1027)
>orf02789
MYYSVDDVVSNAFKKRMILDSFFAFNCSGTMKVSTWVYDKGEWYYVSSSGSMIANDWVKDNGK(SEQ ID NO:1028)
>orf02793
LTWILTKIARKDSLQLFELEANLISFLLVMSVDLAPFCFKEENF (SEQ ID NO:1029)
>orf02803
MEDIDEDELLIFEKVLGQLQANIKGIGGENKEISQKN (SEQ ID NO:1030)
>orf02804
LHLRTCFVRQTNKLSPLINRTRLQFHQTILHYTLNQITSNRLGNIEFLLDIFNQDQVLVFLAIIQKTHNLTLRPTHKFNAATFGFLLHHQVNLMTKTLKD (SEQ ID NO:1031)
>orf02821
MLEIWKYRPFVSEFWNDFKNNHDKQFVDPISLYLTLKDDDDPRIEEESEALENMILQYLGEDDAS (SEQ ID NO:1032)
>orf02823
MQDLLFHFYSYRLNLTFFFFELLICLLNSEFDLSKFIFVYFDEYFHEDSLKMNLHQFSFSF (SEQ ID NO:1033)
>orf02829
MAFNQFNRCIGLSIPTAPNVPGTIINRSYLHDATVPNNVREKT (SEQ ID NO:1034)
>orf02845
LTDFHDFKFIFFENLFKSRQLYLQSQNSVLSNLWLAT (SEQ ID NO:1035)
>orf02850
MKKLFILISNLLASLFFVWVFTIWTDTYVSHYYPNVVVHDSSPETTFQHVATRLEKLAEETDSFIAIQHQDPNSEGTTVFSYTTFGDGKLPDGLQEKNLEDAQSSSVETNYFVFDGHLDIHLLREELSQLGLTNMHLTIPSKLSTLMAIFSNGFQLISLLIFILTFVALTLISQISQLRSSGIRLISGEKRWSIFLRPVGEDLKAIAVGFSLAGVLAILMQKILSLPTQSLMTIGAGLLSYNLILLSISLFFAQLFAVGIKKIHLMQIIKGQVPVRGIISLILIGQLLAIIIVTLGIGSSLKYSQAWQQHRIGQEIWSQERQLITLSISREGTSPGFDEQAQRKLRTWYQLMDLAVSEQKAFLSRHQLIDRTLQNGMASSKNLITSTEWHDYNPNGNVLIVTPQYLERQNIPVDTTIEQKMNHLNVGEFVLLLPEHLRSEEEHYKSVFEDDLTSRMSSQDERQQMTATVGYLESGQDRFVYNTTPISYQQFLKDPIIIVITPQSTGPQSILFWIDAVQNYVLFNQLSDAQELIQRQGIENWVSEMQTGYHNYITLLDNIQRERWVMLAGAVLGIATSILLFNTMNRLYFEEFRRAIFIKRIAGLRFLEIHRTYLFAQLGVFLLGFVASVFLQVEIGVAFLVLLLFTGLSLLQLHVQMQKENKMSMLVLKGG (SEQ ID NO:1036)
>orf02859
VLKWCILRINHHISRKVDNFLEGTRAHIKGQAHTAWNPLEVPDVRYRSFQFDMSHTLTTNFRTRYFNPTAVTNNSSVTNAFVLTTSTFPVFCRTKDHFIKESFTFWFQGTIINCFRFFDFSIRP (SEQ ID NO:1037)
>orf02869
MPWKELCHKLAPKVFKVIRIYSRENKKSPSNWAFCSFET (SEQ ID NO:1038)
>orf02877
VDSLFLSLGEESNQEINLQESFSSTDCNPTLISPETTVAQGLCQDIIYRPFT(SEQ ID NO:1039)
>orf02880
VNPKSLGSFFLQDSKGFKELVLGHAKLSLPRIVHNVCPQFKNASRIITTRDDFWNACYSLQMFNIFKGIQVNGRTQFTCIGVFLVWRVVGREHNLRTQKVQFMAHQKLYITRAVHTTTFFLENFQNSWSWSSLNCKIFLKALVPRKSLVDGSCLLTNPLLIIQVKGSRELGNNRF(SEQ ID NO:1040)
>orf02884
MDNLCFHNAWTDWASILKQAVVTEDDMTKQNDFFLGIIDAEFHNCLGNFAINESDMSKKITSHCVLCLVWPRQLDDLS (SEQ ID NO:1041)
>orf02885
MQHNPRIEQALIELRINFANSVCQTHHGRRMIGQARFKGMVVGLGSWIGVEFLIILGVEISDNPLPDRIFNFENHLRHVVTNFLDINW (SEQ ID NO:1042)
>orf02886
LIDLRGIVINFSASFHVDNLTCGKGLNVMRLGIPELPINLATIILEGKG(SEQ ID NO:1043)
>orf02899
MDALVLQKNQETIQQIAVKIRFLDGHDYYSLIDIDNRRTNQTVFPFVNF (SEQ ID NO:1044)
>orf02900
MAFFTEIPTRACLINLAITLHIVETCQGFNDLSLHLRVLAL (SEQ ID NO:1045)
>orf02904
MLLPLPFNTSKIKQIAMHSDLNQKEMIGHIFHDEDIF (SEQ ID NO:1046)
>orf02909
MKQTVKKLALVASIAATLGGGVSVASAAVQYPEGGVWTYGSGNGGAYSNYYHPSKYHSSTVVSRKTGSSDKGYAGAGGTSRAWIRTSWGEKVAFYYNV (SEQ ID NO:1047)
>orf02919
MNQENLFLLQEIKDKLLIIIDTVHIAVDFWEDIEPRLGFNGRQTWNILNGIIDEISLLVDSSTRKKQFID (SEQ ID NO:1048)
>orf02920
LLGQNVRAKAHIGQHIEPFDIALNMSLRARQDHPTHTETCYAVGF (SEQ ID NO:1049)
>orf02921
MSVHYHAVIDFIRKDNQIVLTGNLNNLQQEFLRIKGSSWVIWIDKDDCLGIGSDF(SEQ ID NO:1050)
>orf02923
VTYNRIRQTSHPLCNQKCQQQGDAYNPDSLVNKNSFDITILITNCLHDTYFLGTLHDIDVNNDTNHNRCYHNSL (SEQ ID NO:1051)
>orf02924
LIRQHETVAVLHVIFIIDYTYNLRLKLSNLTSFGSTFYNAG (SEQ ID NO:1052)
>orf02954
LHTSFRSSVGHSHTWHQDIVRPILFSRFNDSIVILWQNCPTFN (SEQ ID NO:1053)
>orf02968
LRLAKLVPSLKIALKSFSLFTRKFFFKTQLHLLNYYYYTIFFKKANHSKVLDNFIRKRSWKNNFPNSLY (SEQ ID NO:1054)
>orf02973
LKAEQQAIKNIQFLEQDLPKNPLEKEFDCLAVSRVLHHMPDLDADLSLFHQHLKEDGKLIIADFTKTEANHHGFDLAELENKLIEHGFSSVHSQILYSAEDLFQGNHSEFFLTVSQKSLA(SEQ ID NO:1055)
>orf02974
MKHDFNHKAETFDSPKNIFLANLVCQAVEKQIDILSDKVILDFGGGTGLLALPLAKQAKSVTLVDISEKMLE (SEQ ID NO:1056)
>orf02978
MVDLQSFFTRKYLNLNSVDAYLILPRLQGHLSYPQDFFLLQDFCFLLPIFLNLSQKEGRNAGKDS (SEQ ID NO:1057)
>orf02991
VTENPAPFVFTVSINSFFTVVAFTTGTDARNQDLVTFFEVGNSFPNFFNNPNPFVAKNGTTLASRNIPFDNMEVCSTNSGFYNTHNSICWLANNWFVYINKRSKSWFNIRXXIPFIN(SEQ ID NO:1058)
>orf02993
MRIRNSPFDHILQTIFEFEDRTCQVTCRFEACSSICNDNWEFSQHIISVFQSPSCHTVCDKSDVFCSFLFDKNFASLWIYVVTITDQLCIGMWQLVHGSNHTQFTVSQPTHSIVSMHPNTRSSIDCFFGFIKSRV (SEQ ID NO:1059)
>orf02994
MSKSNRHTFARNCTNKVFHPITFWCKGNFIKQAICRFLPRMKLLNTRVSHISWILCPLKSFCEIWTFIINPTNLSTCCFFIMVSKIFSDCKQLLISGC (SEQ ID NO:1060)
>orf02996
MQCTFNVVVHHIYTCISMNMSIHKSWGNAITCIVNHLSPFRNLLYMFPKLAVHKFQVTTSTNSVWVEKLIRFNIVRHNVNLLKRLILQFIMSITL (SEQ ID NO:1061)
>orf03011
MTLHQTFRFQNLEMPCQSSLINFQTLLNRHLVTRRMLQQKQ (SEQ ID NO:1062)
>orf03023
LLSSFQDAVKFFAVVFFRKVQPSQEVAPNASSFTDQFMGG (SEQ ID NO:1063)
>orf03025
MVVTTFNDFSIFKDNNLICIENGFQAVGNDETSSTCYNHLHGMLNLAFRHRIYV(SEQ ID NO:1064)
>orf03031
MAEFNSVITTVTGIGDRLGAVILAEIRNIHAFDNPAQLQAFAGLDSSIYQSDQIDLAGRMVKRSSPHLR (SEQ ID NO:1065)
>orf03041
MQQYVDIKKQYPDAFLLFRMGDFYELFYEDAVNAAQILEISLTSRNKNADNPIPMAGVPYHSAQQYIDVLIEQGYKVAIAEQMEDPKQAVGVVKREVVQVITPGTVVDSSKPDSQNNFLVSIDREGNQFGLAYMDLVTGDFYVTGLLDFTLVCGEIRNLKAREVVLGYDLSEEEEQILSRQMNLVLSYEKESFEDLHLLDLRLATVEQTASSKLLQYVHRTQMRELNHLKPVIRYEIKDFLQMDYATKASLDLVENARSGKKQGSLFWLLDETKTAMGMRLLRSWIHRPLIDKERIVQRQEVVQVFLDHFFERSDLTDSLKGVYDIERLASRVSFGKTNPKDLLQLATTLSSVPRIRAILEGMEQPTLAYLIAQLDAIPELESLISAAIAPEAPHVITDGGIIRTGFDETLDKYRCVLREGTSWIAEIEAKERENSGISTLKIDYNKKDGYYFHVTNSQLGNVPAHFFRKATLKNSERFGTEELARIEGDMLEAREKSANLEYEIFMRIREEVGKYIQRLQALAQGIATVDVLQSLAVVAETQHLIRPEFGDDSQIDIRKGRHAVVEKVMGAQTYIPNTIQMAEDTSIQLVTGPNMSGKSTYMRQLAMTAVMAQLGSYVPAESAHLPIFDAIFTRIGAADDLVSGQSTFMVEMMEANNAISHATKNSLILFDELGRGTATYDGMALAQSIIEYIHEHIGAKTLFATHYHELTSLESSLQHLVNVHVATLEQDGQVTFLHKIEPGPADKSYGIHVAKIAGLPADLLARADKILTQLENQGTESPPPMRQTSAVTEQISLFDRAEEHPILAELAKLDVYNMTPMQVMNVLVELKQKL(SEQ ID NO:1066)
>orf03051
LTNLSSVDSEELFQFYRERGNAENFIKERKAGFFGDKTDSSTMIKNEVRMMMGCLAYNLYLFLKQLAGDEVKALTIKRFRRLFLHIAGKYVSTARRHILKFSSLYAYSKQFQALFDTICQINLILPVPYRARGQGKTCLTE (SEQ ID NO:1067)
>orf03061
LFDDRQAINICPPTNGSLRLTSLQVDQNPCPPSTNLNKILARSQFLNHIQQISLSLELLQANLWNLV (SEQ ID NO:1068)
>orf03092
MKIKVQTRKLAAGCSKHCFEVVDRTDEVSSKHGFEVADRTDEVSSKHGFEVADRTDEVSNIYTVRRR (SEQ ID NO:1069)
>orf03093
MDFFNYLLWMICHNHGLHTLLLSKDCVCHTARDKDGNHRIKSVFPTKGQTCYQHDSSIYQERNTTDILTRFLTNSQADDVRPTTGDIVSKSKTNPQTHNNTPKKGIDNGILCQGCHRDKLDKEGTHRYRDKGKDGELMANLIPS (SEQ ID NO:1070)
>orf03104
MIKQIKAHLNKSIQSIIGQKVEFVKQDEQAFTRKRRLSLETMIRTILGMGGKSLSKELLDARLTVSNSALVQRRYQIKPEAFYALFKEFTAPIPLNTDFPIFAADGSDICIPRNPMDTETSIQTQTDVKSYNLIHINALYDLTTGVYRDVSIQDKHAQHERLALIQMMEASPFRESSCYHG (SEQ ID NO:1071)
>orf03108
MAHFQEKGWLYIIRIRDGKQSMPSSFNLPNTECFDQKVSLKLSRKQTNQLKKLYRDFPNDYHFIPHNSIFDFLPETSRKQDPVTLYELPFRMVRLKVEEGKYETLVTNTDYSVQELKNLYASRWGIETSFRDLKYSIGLVNFHAKKKEGILQEIFARFTNFNFCRWVTSQVAIDSSHKKQRYKVCFSDAAYACRLFFNGSLSSHQLKNYLKKQLSIIRPNRKYSRKIKAQSVVDFICRVT(SEQ ID NO:1072)
>orf03110
MCELDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKRIDRLLGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGRSVTLYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWVSRVREKVQYAP (SEQ ID NO:1073)
>orf03139
LSIQVETLELRVIFKEIKEIVKQFHQLHTMAFKRQVPLTIPVTM(SEQ ID NO:1074)
>orf03154
MGALGYYEGFVPYVSNQYKNQAEEEGKPLSDKYIFEKILGKTYAAFKKDQINERVEKLGKLKPITINYNGKSEVIDSKEKLQELMNKAVKDEVAQIS (SEQ ID NO:1075)
>orf03155
MMGDGMKEFQFERKQRFSLRKYAIGACSVLLGTSLFFAGMGAQPVQDTETSSALISSHYLDEQDLSEKLKSELQWFELENKLLNLWEH (SEQ ID NO:1076)
>orf03159
VNITKTSIIKAHTTKEDGIDHTFTRFNIMSIFYSTRKIFLDKLNSTNRQFLGYIISTRCYQSFNSVSQSIHTSSSSQAFRFGKHEFRVINRDKSKAILVNHYHLNLAFFISNHIVNSDFCRSSCRCIDSHNWQAFFSRLMKPFIILWFSTICSHDRNTTSCILWRTPAKTDDKVTAMFLQSSYPICDIFTSRVWLYIAKDDIFDSFCIQWF (SEQ ID NO:1077)
>orf03175
MSMDNCIDIITSLILNQMHIPFARWQAFSLYNISINIHHYNIGFFDFKEINTRRGNCHQLFFTIENAEIPTCSFRQICFY (SEQ ID NO:1078)
>orf03182
MNIAIRIILNFFRVMGNHQNSLAMMMGAVVHEFVKFIFTSCIHPRCRLV(SEQ ID NO:1079)
>orf03183
MLLIMSIQTTEPAFSRIATRLDKFIDRTWKTSIKTGNLLRKIGYSQFLTLRICL(SEQ ID NO:1080)
>orf03184
LQNSKTSLDERRLSRSIFPSQGNKFPTINTIIDMFKNRLLIIIEGQILYRNISHYLISPTKAVKNR (SEQ ID NO:1081)
>orf03197
MSNSFVKLLVSQLFANLADIFFRVTIIANIYIISKSVIATSLVPILIGISSFVASLLVPLVTKRLALNRVLSLSQFGKTILLAILVGMFTVMQSVAPLVTYLFVVAISILDGFAAPVSYAIVPRYATDLGKANSALSMTGEAVQLIGWGLGGLLFATIGLLPTTFIILVLYIISSFLMLFLPNAEVEVLESETNLEILLKGWKLVARNPRLRLFVSANLLEIFSNTIWVSSIILVFVTELLNKTESYWGYSNTAYSIGIIISGLIAFRLSEKFLAAKWESILFPLVAMAIVTLTILYFPNAQMFLLFSALVGMLSQLKEVPESVFLQETVEENHLVNVYSVLEVISTLAFSVFVLLMSYITESFGISISFWLSAICLMIEAILIYIRRDYFK (SEQ ID NO:1082)
>orf03198
MSKLLDKILSRENMLEAYNQVKSNKGSAGIDGMTIEEMDNYLRQNWRLTKELIKQRKYKPQPVLKVEIPKPDGGIRQLGIPTVMDRMIQQAIVQVMSPICEPHFSDTSYDFRPNRSCEKAIMKLLEYLNDGYEWIVDIDLEKFFDTVPQDRLMSLVHNIIEDGDTESLIRKYLHSGVIINGQRYKTLVGTPQGGNLSPLLSNIMLNELDKELEKRGLRFVRYADDCVITVGSEAASKRVMYSVSRFIEKRLGLKVNMTKRVEISRFWVLEIIRWLEKPSTSR (SEQ ID NO:1083)
>orf03202
MFLRCATFKLADSRLNIFTCFFFGEIRFNSRNQVVKAFITDGTVISTIIVRGTVPCNQWTKTCPAAFDIINGDVGFWKAVVDNAK (SEQ ID NO:1084)
>orf03203
MLQITCVVCISCTKVSLVFTWENKDHTTVTQTCVKVNWL (SEQ ID NO:1085)
>orf03204
LRSLIRQITYFITPRTCCINNQTGLDFKHLVCQEITSYNTCNLATFVKEEAFCLHVVGNEGTVLVGTFDVFNHETRIVVTEVKIHSTSYQAFLLQVWLAFQDLILAQNLVRSWCVAHTC(SEQ ID NO:1086)
>orf03205
LHFNQTSLKTASCRLQGYTSSCDSSTDNQEVQGAFLHFFN (SEQ ID NO:1087)
>orf03216
LKIDHTQLSPSNLLNTFVTPFIFYLKHSINLTNAEIICFSFYFHADFLVHYPENQ(SEQ ID NO:1088)
>orf03224
LKKVQHTQNVDFNKKLSRIKTKYLYGLKEKSEAELTLKTKETKEELTAAFEQFKKDTLKSGKKVAEAEKKAKAQKEEDRRNYPTNTYKTIELEIAEAEVGVAKAELELEFAQAQVQIPQDTEKINAAKSKVEAAKSNVKKLEKIKSDIEKTYLYKLDNSTKETPKSRVRRNSPQVGDSRELKETIDKAKETLSTYMVTRLTKLDPSVFWFADLLMDAKKVVEEYKTKLEDASDKKSVEDLRKEAEGKIESLIVTHQNREKENQPAPQPGGQAGGSMVVPPVTQTPPSTSQSPGQKATEAEKKKLQDLIRQFQEALNKLDDETKTVPDGAKLTGEAGKAYNETRTYAKEVVDKSKKLLSQTAVTMDELAMQLTKLNDAMSKLKEAKAKLVPEVKPQPENPEPKPQPEGEKPSVPDINQEKEKAKLAIATYMSKILDDIKKHHLKKEKHHQIVALIKDLDKLKKQALSEIDNVNTKVEIENTVHKVFADMDTVVTKFQKGLIQNTPQVPEAPKSPEVPKVSDTPKAPDTPQVPEAPKSPEVPKVPEAPKAPDTPQVPEAPKSPEVPKVPDTPKAPDTPQVPEAPKSPEVPKVPDTPKAPDTPQVPEAPKAPDTPQIPEAPAPETPKTGWKQENGMWYFYNTDGSMATGWLEYNGSWYYLNANGAMATGWLEYNGSWYYLNTNGAMETGWLEYNGSWYYLNTNGAMETGWLEYNGSWYYLNTNGAMETGWLEYNGSWYYLNTNGAMETGWLEYNGSWYYLNTNGAMETGWLEYNGSWYYLNTNGAMETGWLEYNGSWYYLNANGSMATGWLKDGDTWYYLEASGAMKESQWFKVSDKWYYVNGSGALAVNTTVGGYRVNANGKWVN (SEQ ID NO:1089)
>orf03230
MDREILKFFQDLLSILSHNDMITLFCQKCCNSFSNHFLVICN (SEQ ID NO:1090)
>orf03232
MFITLRRICLRACVVEKEQSYLKFLFFQKRPVSFLHVKSVLAGI(SEQ ID NO:1091)
>orf03233
MVKTTNRLEAIGFSFILFENLFKPRQLYLQPQTSVLSNLRLAA(SEQ ID NO:1092)
>orf03239
MTRKLNPSYTNVASATTLTFNQVASTFRKACLDHVVNLTRNNLKGICQLTPLQLHDTRLI(SEQ ID NO:1093)
>orf03270
MRTFFLYSSAFKKHSSPSPINDGLYHLLLQSLYNILELIHDIFQSLKGFILKSTFTNLFPHLFNGVHLWCVWRNKCKANISRNL (SEQ ID NO:1094)
>orf03277
LVSVFYSLLQVDNVDSVTFSKDVLSHLRIPATSLVTKVYTSLKKLFH(SEQ ID NO:1095)
>orf03286
LIVWILKNHTDLTTYIPNIFLSQTLAINYNLSRFCFQ (SEQ ID NO:1096)
>orf03287
MPYNRKPFSTFHVKRNILHIVVVLIFFITKRKIFYINY (SEQ ID NO:1097)
>orf03291
MFKKMSNSSRILFYISVNFCDKRIYRTKLYSDTPVNLFKFLFRQKSNCQSIGQTSSINLFFYSWIVFFFKNNLCHSIPSIK (SEQ ID NO:1098)
>orf03304
LADGSGKLAEGGTKLTSGLEDLQTGLASLGQGLGNASDQLKSVSTESKNAEILSNPLNLSKTDNDQVPVNGIAIAPYMISVALFFAAISTNMIFAKLPSGRHPESRWAWLKS(SEQ ID NO:1099)
>orf03310
MKNTVKLEQFVALKEKDLQKIKGGEMRLSKFFRDFILQRKK (SEQ ID NO:1100)
>orf03330
LVEQLTFNQWVTGSSPVRVIYAGLAELADAPDLGSGA (SEQ ID NO:1101)
>orf03344
MKIKEQTRKLAAGCSKHCFEVVDETDEVSNHTYGKVKLTWFEEIFE (SEQ ID NO:1102)
>orf03352
LIDVLFINSFIGRICFYCYRRIHATCLFLQLFSIVILNVAHTLKHSIFIVITFISRCRNFIIVRILLENQFSRNQGIDNRVGQSRY (SEQ ID NO:1103)
>orf03353
MVNVNQVSIEVKNTFKNWNFTSSIELTTFSKFSQSPTMT (SEQ ID NO:1104)
>orf03364
MGFSMKLIHDLDMHTTHSTAKMLYNVKAIKNDFSIRE (SEQ ID NO:1105)
>orf03368
LKSVGSRVEDKRYQETIVATFSDILKRSREMQDQNIKXXFIH (SEQ ID NO:1106)
>orf03372
MLTIEPTKAPLVNVCDTANTARIDDEPYSPGDFXXHSIH (SEQ ID NO:1107)
>orf03373
MRWNIGCHPNRDTSCSINQKVWKTRWQDQGFPFIGIIVINEINCIFVDITKHFQSNLAHTCLGITLSGSTISIHGTKIPMTIYKHVTVAPPLSHTDHGFINRGIPVWVIFTHDIPCNTSRFFMGFVWGHTQFIHSVKNATVNRF (SEQ ID NO:1108)
>orf03380
MTDFNTFLQLSEGWSLFRSDFLLCIKHFLNSFSSSKGQLKASPTRCNLDNRLVVLL(SEQ ID NO:1109)
>orf03390
MGGNPPMKKYSIVDKIVLSTKIKRIIIFTVFRENWEPYMKKYTEVFQSQFPNLNIDYLLLDTEQIDLDSYLDADIIIIGGGNTEKYIATYVNQEFKNYIDHMLNKGAKVIGFSAGALLLGEKVYVSPNDNSDHQIKIKDGLGLFSQFLISVYYDSWNDKANKDRAEELVNVPIIPLNDHSCLVLDKLGNIIEKID (SEQ ID NO:1110)
>orf03393
MYGGEAKSNAMEAIQAAKKGDFSKANRRLADANAALLQAHKAQTEMLTREAQGEKTSISLLMVHAQDHLMTSLTFVDLAKEVVEVYERFEKN (SEQ ID NO:1111)
>orf03396
MLARSKNCFMKSLSIFLLIFYFFDSYQISKKRRSLIGL (SEQ ID NO:1112)
>orf03399
VTAHRIFGTSSIHSKLIGLAMLGITAMKIICHKLNRNHINIFRRLGIQGKTEFLLIHLIRQVKMNHLSQGMNPTICPTSTVNSNGLPFI (SEQ ID NO:1113)
>orf03402
MLKNGIISWKDFKSFFCQGCQTSHCYKPMQAVQGIGSQIS (SEQ ID NO:1114)
>orf03403
MRFLADQDRIQHHRYSWALFDKVQGLLSHADSREKTNLNSPKFHITQAI(SEQ ID NO:1115)
>orf03405
MSYGRPYILNVDGAIHDGWLAISNYENSLNKDYLFYILSSNVVYSQFLSLISGAVVKNLNSDKVASILIPLPPLAEQQRIIEAIESALEKVDEYAESYNRLEQLDKEFPDKLKKSILQYAMQGKLVEQDPNDESVEVLLEKIRAEKQKLFEEGKIKKKDLDISIVSQGDDNSYYEEVPCEIPESWEWVRLNDITSYIQRGKSPKYSNIPIYPVIAQKCNQWSGFSIDLARFIDPETVHSYQKERLLRDGDLMWNSTGLGTLGRLAIYHENKNPYVWAVADSHVTVIRVLSGVINCHFIYNFLSSPIVQSVIEEKASGSTKQKELLTKTIKEYLIPLPPLPEQSRIVDRIEQFFAHIDALI(SEQ ID NO:1116)
>orf03424
LAQISILHFDFLSIDKHSHTVFNTLRKSLQTTLALSATSKQCFEQLAASFLVCSLIFIEYKV (SEQ ID NO:1117)
>orf03430
MGFKVSHFKIPSSHLSINVLRTVENFTEIGQGLLHISP (SEQ ID NO:1118)
>orf03431
VGFFDFGLTNSCRQVRQFTQTVQDFLVCCHQGIVKEGQGYAGICFKFHPSLGNIGKFVIAIVRRLRHKSIVANMAHLNVDLFQFRKGLLEILKSVKIALVITAKLVDVFASFLDCTQEILTVLV (SEQ ID NO:1119)
>orf03439
MNITYIVGNGLDLQYGLKTRYRDFYEFQNKVYISRTENEEKYSNFIYESLFSDKVNDYENWSDFELSIGKLTKDNDLISSSIEIKEKFIDDFSEVVDDLREYLRIQQEKNLEKGNAIDFISTLDDMRTSLPVINQPAIDKKYNENPHQDDIVNIVTLNYTHVIDKLYNGSAKSFRNQLRANLYNFYIEPPIHAHGTVDVCTVLGVSDEIQISNSFDEEQKESLIKNLVLKNYRENMDVKNSDIIKNSDIIILYGVSLGETDGYIWNQIAEQSIRSSVPVIIYHYVPHFDAGNPTRVKRLYRNVEDKFIQNSGIDLELEKKLRDNLIVVIGKTIFNLMER(SEQ ID NO:1120)
>orf03440
VGAKFNDEKTKHIVTHYISRDALNKTITVLSKIIEVFEEHFDRAITCEMFSDSSTFASINFSEYGISKSKFQQYLRDSCFIENFGVEHTTVSDIQNSIVTFYDVHTDIFRLLNKLNIDISEANIMNQTTVLLDEKNIELLLSKAPYLVSMIVEDFSKLSVDDFSLDNNDLKINLPSPMNEPVVGVIDTLFDKRVYFNEWVEYHDFVSPDISKDSQDYKHGTAVTSLIVDGANLNPNLDDGCGNFRVRHFGVSLQSGFNSFTIIKQIKEIVSQNADIKVWNLSLGSNDEIRENFISAEGALLDEIQFENDVIFIIAGTNASVINGKRKRIGAPADSLNSIIVNSVDFNNQSVSYSREGIVLSFFVKPDVSYYGGGNGDFINVCEPLGLGRVAGTSFAAPFIARKMAYLIHIMGLSREEAKALLIDAAIPWNDKKTFTDLSLIGNGIVPIKMDDILSTPDDEIKFIVSDISRAYDTYNYDFPVPISSESYPYVAKATMCYFPNCSRKQGVDYTNTEMQLTFGRLKSDGIKSINKDNQHAEDTPGYVRENAARNIFRKWDNVKHIGESFTSRKRAKAILNPSNPQWGMSIKTIERLKSGDGQGVRFGVVVTLKELNGVNRIEDFIQQAELRGWLVNRLQVEAQVDLFNSLNEEIEFE(SEQ ID NO:1121)
>orf03442
MFVADIMISDYSSAPIDFLLLNRVVFLYLPDFKEYQSDKNPFFEVFKVSKTKGIALDPFDEIIGRFQFGVRIV (SEQ ID NO:1122)
>orf03450
MGFSMKLIHDLNTHTTHSTAKMLYNVKAIKNDFSIRE (SEQ ID NO:1124)
>orf03453
MEQLHFITKLLDIKDTNTQIIDVVNRDSHKEIIAKLDYDAPSCPECGSQMKKYDFQKPSKIPYLETTGMPTRILLRKRRFKCYHCSKMMVAETPLVKKNHQIPRIINQKIAQKLIEKISMTDIAHQLSISTSTVIRKLNDFHFECNFRNLPKIMSWDVETVRGVTVSIGRWR(SEQ ID NO:1125)
>orf03462
LDPWDGNSQKPRFQGLWKFIQRDSRKWRERRFYGPTFGKHLTNKKVFDKVFELFTRPGNIIIIFINFCGFTSEIRNRGKFFGLIEDNLKQVHPIFQTVFKTFLKDKEKIINALQLHYSNAKPEATNNLIKLIKRNAFGFRNFENFKKRIFIALNIKKERTKFVLSRA(SEQ ID NO:1126)
>orf03466
MVLYFWKVFQRVPNKLWKNMGENFQVRSLQDKIIQNLTNKGFSYFDAKMPIDEWDSQVDEETTQELISRDLISNILSMPESMKDTN (SEQ ID NO:1127)
>orf03469
MSKSHSFSISLGISNSFWNNIHTSECWYFLAEGKSNRSNSTISVNQMVFFINIQRFYCFAIEDFCLLRI (SEQ ID NO:1128)
>orf03470
LNTLLPPDNLCLFTIYLTGFSCICINSYCHNFWEIFNQLFYQLS(SEQ ID NO:1129)
>orf03475
MKLLSIAISSYNAAAYLHYCVESLVIGGEQVGILIINDGSQDQTQEIAECLASKYPNIVRAIYQENKCHGGAVNRGLAEASGRYFKVVDSDDWVDPRAYLKILETLQELESKGQEVDVFVTNFVYEKEGQSRKKSMSYDSVLPVRQIFGWDQVGNFSKGQYTMMHSLIYRTDLLRASQF(SEQ ID NO:1130)
>orf03476
MYYLPVDFYRYLIGREDQSVNEQVMIKCIDQQLKVNRLLIDQLDLSQVSHPKMREYLLNHIEITTVISSTLLNRSGTAEHLAKKRQLWTYIQQKNPEVFQAIRKTMLSRLTKHSVLPDRKLSNVVYQITKSVYGFN (SEQ ID NO:1131)
>orf03484
MKRIQLNMNETKKYLVIKAIAQGKKTKKRACVELNLSERQINRLLLAYQQKGKEAFRHGHGNRNRKPKHAIPDEIKERVLKKYLSYETYKPNVLHFCELLAEEEGIKLSDTTVRKILYKKNILSPKSHRKTKKRVRKQAKLNLNQPLDNPILPTAKDFLEDPKKVHPSRPRKKFAGELIQMDASPHAWFGPETTNLHLAIDDASGNILGAYFDKQETLNAYYHVLEQILANHGIPLQMKTDKRTVFTYQALQL (SEQ ID NO:1132)
在一些实施方式中,优选的INV200抗原选自以下多肽或其免疫原性片段:orf00159(SEQID NO:626)、orf00162(SEQ ID NO:628)、orf00163(SEQ ID NO:629)、orf00164(SEQ IDNO:630)、orf00165(SEQ ID NO:631)、orf00166(SEQ ID NO:632)、orf00201(SEQ ID NO:639)、orf00209(SEQ ID NO:645)、orf01109(SEQ ID NO:747)、orf01137(SEQ ID NO:751)、orf01138(SEQ ID NO:752)、orf01309(SEQ ID NO:783)、orf01313(SEQ ID NO:786)、orf01315(SEQ ID NO:787)、orf01431(SEQ ID NO:810)、orf01433(SEQ ID NO:812)、orf01434(SEQ ID NO:813)、orf01537(SEQ ID NO:824)、orf01588(SEQ ID NO:831)、orf01642(SEQ ID NO:842)、orf01656(SEQ ID NO:847)、orf01800(SEQ ID NO:875)、orf01801(SEQ ID NO:876)、orf01810(SEQ ID NO:879)、orf01812(SEQ ID NO:880)、orf01818(SEQ ID NO:882)、orf01988(SEQ ID NO:913)、orf01989(SEQ ID NO:914)、orf02105(SEQ ID NO:925)、orf02106(SEQ ID NO:926)、orf02263(SEQ ID NO:947)、orf02264(SEQ ID NO:948)、orf02459(SEQ ID NO:968)、orf02538(SEQ ID NO:987)、orf02539(SEQ ID NO:988)、orf02541(SEQ ID NO:990)、orf02545(SEQ ID NO:992)、orf02604(SEQ ID NO:1003)、orf02608(SEQ ID NO:1007)、orf02609(SEQ ID NO:1008)、orf02850(SEQ ID NO:1036)、orf03197(SEQ ID NO:1082)、orf03439(SEQ ID NO:1120)、orf03448(SEQ ID NO:1123)。
7由23F鉴定的序列
>orf00010
LPVTFDFLIEGSTKGNIDKLDTATDCHNWFILTKRFLQECQFKFVTDQIVIIAFDGLFLSIKLRMNILASCQNEFVNHFHIITYN (SEQ ID NO:1133)
>orf00017
MSLITHKRFISCNENIKHYKRLIDKAKKCVNDLMAEFNSVITTVTGIENRLGAVILAEIRNIHAFDNPAQLQAFAGLDSSIYQSGQIDLAGRMVKRGSPHLR (SEQ ID NO:1134)
>orf00027
MQQYVDIKKQYPDAFLLFRMGDFYELFYEDAVNAAQILEISLTSRNKNADNPIPMAGVPYHSAQQYIDVLIEQGYKVAIAEQMEDPKQAVGVVKREVVQVITPGTVVDSSKPDSQNNFLVSIDREGNQFGLAYMDLVTGDFYVTGLLDFTLVCGEIRNLKAREVVLGYDLSEEEEQILSRQMNLVLSYEKESFEDLHLLDLRLATVEQTASSKLLQYVHRTQMRELNHLKPVIRYEIKDFLQMDYATKASLDLVENARSGKKQGSLFWLLDETKTAMGMRLLRSWIHRPLIDKERIVQRQEVVQVFLDHFFERSDLTDSLKGVYDIERLASRVSFGKTNPKDLLQLATTLSSVPRIRAILEGMEQPTLAYLIAQLDAIPELESLISAAIAPEAPHVITDGGIIRTGFDETLDKYRCVLREGTSWIAEIEAKERENSGISTLKIDYNKKDGYYFHVTNSQLGNVPAHFFRKATLKNSERFGTEELARIEGDMLEAREKSANLEYEIFMRIREEVGKYIQRLQALAQGIATVDVLQSLAVVAETQHLIRPEFGDDSQIDIRKGRHAVVEKVMGAQTYIPNTIQMAEDTSIQLVTGPNMSGKSTYMRQLAMTAVMAQLGSYVPAESAHLPIFDAIFTRIGAADDLVSGQSTFMVEMMEANNAISHATKNSLILFDELGRGTATYDGMALAQSIIEYIHEHIGAKTLFATHYHELTSLESSLQHLVNVHVATLEQDGQVTFLHKIEPGPADKSYGIHVAKIAGLPADLLARADKILTQLENQGTESPPPMRQTSAVTEQISLFDRAEEHPILAELAKLDVYNMTPMQVMNVLVELKQKL(SEQ ID NO:1135)
>orf00033
MRRKYKSIALKKELANDSGKKKFHAMKAQAIVTSQGRIVSIAMI(SEQ ID NO:1136)
>orf00042
LTNLSSVDSEELFQFYRERGNAENFIKERKAGFFGDKTDSSTMIKNEVRMMMGCLAYNLYLFLKQLAGDEVKALTIKRFRRLFLHIAGKYVSTARRHILKFSSLYAYSKQFQALFDTICQINLILPVPYRARGQGKTA (SEQ ID NO:1137)
>orf00051
LFDDRQAINICPPTNGSLRLTSLQVDQNPCPPSTNLNKILARSQFLNHIQQISLSLELLQANLWNLV (SEQ ID NO:1138)
>orf00055
LSVHFCSSHRCLLVRYNDTYSTKKGLKFETFLSVFRYDFLGM(SEQ ID NO:1139)
>orf00086
VDRTDEVSSKHGFEVVDETDEVSSKHGFEVADRTDEVSSKHGFEVADRTDEVSSKHGFEVADRTDEVSSKHGFEVADRTDEVSSKHGFEVADRTDEVSSKHGFEVADRTDEVSSKHGFEVADRTDEVSSKHGFEVADRTDEVSSKHGFEVADRTDEVSNIYTAR(SEQ ID NO:1140)
>orf00088
MDFFNYLLWMICHNHGLHTLLLSKDCVCHTARDKDGNHRIKSVFPTKGQTCYQHDSSIYQERNTTDILTRFLANSQADDIRPTTGDIVSKSKTNPQTHNNTPKKGIDNGILRQGCHRDKLDKEGTHRYRDKGKDGELMANLIPS (SEQ ID NO:1141)
>orf00096
MKIKEQTRKLAAGCSKHCFEVVDETDKVSSKHGFEVVDETDEVSSKHGFEVVDETDEVSNHTYGKATLTWFEEIFEEY (SEQ ID NO:1142)
>orf00103
LQNDKNHKLFDNYTCQKEKDVLRCKQVKRKEERSYDVGTRIYTIYDFLLF(SEQ ID NO:1143)
>orf00105
MKIKEQTRKLAAGCSKHCFEVMDRTDEVSSKHGFEVVDETDEVSNHTYGEVKLTWFEEIFEEY (SEQ ID NO:1144)
>orf00106
LFFKDEKQALYTKPKTKSSSFRASKVSNQTIVATTRTDCQVIALNLCDKLENGVVVVVQTTHHIGIDDVIYSKIFQHLTHSIKMSLAFFIKKVQDRRRILYCHLVFFFLRVQDTKRIFLQATLAILRQGLLERCQIVNQGLAVGCTALRISKSVEVQFDTLNTDFLQKMGCHSDCFHIGSWIARAKTLNTNLVELAQAPCLWTLITEHRSHVVELAWLLHFWGEEFIFHIGTDNGRSSFWTEGNMAVTLVIEIVHFLGYDIGRISDRAADNLVMLKNGRAHFCVVVALENFTGKALNVLPFGRFSR (SEQ ID NO:1145)
>orf00114
MEQIGKVFRQLRESRNISLRQATGGQFSPSMLSRFETGQSELSVEKFLFALENISASVEEILFLARGFQYDTDSELRKEITDVLEPKNVAPLEDLYRREYQKHAHSHNKQKHILNAIMIKSYMKSMDERVELTAEEGKVLHDYLFSTEIWGIYELNLFSVSSPFLSVSLFTRYVREMVRKSDFLMEMSGNRNLFYTILLNGFLASIECEEFTNAYYFKRVIEEHFYKENETYFRIVYLWAEGLLDSKQGRVKEGQKKMEDAVRIFEMLGCNKSAEYYRNTTEC(SEQ ID NO:1146)
>orf00118
MQEHYTPKGKHLTIDNRRLIERWKNENKSNREIAGLLGKAPQTIHTEVKRGTTLQQVRKGLYKKVYSADYAQTVYQFNRKRSVKKLILTKEIREKILHYHKQKFSPEMMVNKKQVKVGISTIYYWFHNGHLGLTKADMLYPRKRKGVKKQASPNFKPAGKSIEERPDVINLRLENGHYEIDTVLLTKIKNYCLLVLTDRRSRHQIIRLIPNKTAESVNQALTLLLGEHHILSITADNGSEFKRLSEVFPEEHIYYAHAYSSWERGSNENHNRLIRRWLPKGTKKTTPKEVAFIENWINNYPKKCLDYKSPSEFLLGG (SEQ ID NO:1147)
>orf00121
MKIKGQTRKLAAGCSKHCFEVVDRTDEVSNHTYGKATLT (SEQ ID NO:1148)
>orf00124
VVPFSDTFKDRNQVDIFTIKISRCNSSTIGENSWDIHISNSNHRSRHVLVTATDSDEGIHVVTTHSRLDGVRDDVTRC (SEQ ID NO:1149)
>orf00139
MDLKFEGVDLEYKKAKNNLPESFWETYSAFANTNGGKIILGIDEKNIDTYQRVNRLPAKL(SEQ ID NO:1150)
>orf00156
LSIQVETLELRVIFKEIKEIVKQFHQLHTMAFKRQVPLTVPVTM (SEQ ID NO:1151)
>orf00171
VQKLKKAIYKAHLKDSDDFRPETSTPNLFESCLKLCPCFLSS (SEQ ID NO:1152)
>orf00172
MGALGYYEGFVPYVSNQYKNQAEEEGKPLSDKYIFEKILGKTYAAFKKDQINERVEKLGKLKPITINYNGKSEVIDSKEKLQELMNKAVKDEVAQI (SEQ ID NO:1153)
>orf00173
MMGDGMKEFQFERKQRFSLRKYAIGACSVLLGTSLFFAGMGDQPVQDTETSSALISSHYLDEQDLSEKLKSELQWFELENKLLNLWEH (SEQ ID NO:1154)
>orf00177
VNIAKTSIIKAHTTKEDGIDHTFTRFNIMSIFYSTRKIFLDKLNSTNRQFLGYIISTRCYQSFNSVSQSIHTSSSSQAFRFGKHEFRVINRDKSKAILVNHYHLNLAFFISNHIVNGNFC(SEQ ID NO:1155)
>orf00178
MKPFIILWSSTICSHDRNTTSCILWRTPAKTDDKVTAMFLQSSYPICDIFTSRVWLYIAKDDIFDSFCIQWF (SEQ ID NO:1156)
>orf00194
MHIPFARWQAFSLYNISINIHHYNIGFFDFKEINTRRGNCHQLFFTIENTEIPTCSFRQICFY (SEQ ID NO:1157)
>orf00205
MNIAIRIILNFFRVMGNHQNSLAMMMGAVVHEFVKFIFTSCIHPRCRLV(SEQ ID NO:1158)
>orf00206
MLLIMSIQTTEPAFSRIATRLDKFIDRTWKTSIKTGNLLRKIGYSQFLTLRICL(SEQ ID NO:1159)
>orf00207
LQNSKTSLDERRLSRSIFPSQGNKFPTINTIIDMFKNRLLIIIEGQILYRNISHYLISPTKAVKNR (SEQ ID NO:1160)
>orf00220
MSNSFVKLLVSQLFANLADIFFRVTIIANIYIISKSVIATSLVPILIGISSFVASLLVPLVTKRLALNRVLSLSQFGKTILLAILVGMFTVMQSVAPLVTYLFVVAISILDGFAAPVSYAIVPRYATDLGKANSALSMTGEAVQLIGWGLGGLLFATIGLLPTTFIILVLYIISSFLMLFLPNAEVEVLESETNLEILLKGWKLVARNPRLRLFVSANLLEIFSNTIWVSSIILVFVTELLNKTESYWGYSNTAYSIGIIISGLIAFRLSEKFLAAKWESILFPLVAMAIVTLTILYFPNAQMFLLFSALVGMLSQLKEVPESVFLQETVEENHLVNVYSVLEVISTLAFSVFVLLMSYITESFGISISFWLSAICLMIEAILIYIRRDYFK (SEQ ID NO:1161)
>orf00221
MSLVHNIIEDGDTESLIRKYLHSGVIINGQRYKTLVGTPQGGNLSPLLSNIMLNELDKELEKRGLRFVRYADDCVITVGSEAASKRVMYSVSRFIEKRLGLKVNMTKTKITRPRELKYLGFGFWKSSDGWKSRPHQDSVRRFKLKLKKLTHRKWSIDLTRRIEQLNLSIRGWISYFSLGNMKV (SEQ ID NO:1162)
>orf00222
MSKLLDKILSRENMLEAYNQVKSNKGSAGIDGMTIEEMDNYLRQNWRLTKELIKQRKYKPQPVLKVEIPKPDGGIRQLGIPTVMDRMIQQAIVQVMSPICEPHFSDTSYGFRPNRSCEKAIMKLLEYLNDGYEWIVD (SEQ ID NO:1163)
>orf00229
LHFNQTSLKTASCRLQGYTSSCDSSTDNQEVQGAFLHFFN (SEQ ID NO:1164)
>orf00247
MFCLTFICLIRRSGYLGSYLLLCRMNHTSHKKTGNSYTSYSNTKFTN(SEQ ID NO:1165)
>orf00248
LPSEIKAKLDAAFEQFKKDTLPTEPGKKVAEAEKKVEEAKKKAEDQKEKDLRNYPTNTYKTLELDIAESDVEVKKAELELVKEEAKESRDEKKINQAKAKVENKKAEATRLKNIKTDREKAEEAKRRADAKLQEANVATSEQDKSKRRAKREVLGELATPDKKENDAKSSDSSVGEETLTSPSLKPEKKVAEAEKKVEEAKKKAEDQKEEDRRNYPTNTYKTLELEIAESDVEVKKAELELVKEEAKESRDEKKINQAKAKVENKKAEATRLKNIKTDREKAEEAKRRADAKLQEANVATSEQDKSKRRAKREVLGELATPDKKENDAKSSDSSVGEETLTSPSLKPEKKVAEAEKKVEEAKKKAEDQKEEDRRNYPTNTYKTLELEIAESDVEVKKAELELVKEEAKESRNEEKIKQVKAKVESKKAEATRLENIKTDRKKAEEEEAKRRAAEEDKVKEKPAEQPQPAPAPQPEKPTEEPENPAPAPAPKPENPAEKPKAEKPADQQAEEDYARRSEEEYNRLTQQQPPKAEKPAQPSTPKTGWKQENGMWYFYNTDGSMATGWLQNNGSWYYLNSNGAMATGWLQNNGSWYYLNANGSMATGWLQNNGSWYYLNANGSMATGWLQYNGSWYYLNANGDMATGWLQNNGSWYYLNANGDMATGWLQNNGSWYYLNANGDMATGWLQYNGSWYYLNANGDMETGWVKDGDTWYYLEASGAMKASQWFKVSDKWYYVNGSGALAVNTTVDGYGVNANGEWVN(SEQ ID NO:1166)
>orf00254
MDREILKFFQDLLSILSHNDMITLFCQKCCNSFSNHFLVICN (SEQ ID NO:1167)
>orf00261
MTRKLNPSYTNVASATTLTFNQVASTFRKACLDHVVNLTRNNLKGICQLTPLQLHDTRLI(SEQ ID NO:1168)
>orf00300
LVSVFYSLLQVDNVDSVTFSKDVLSHLRIPATSLVTKVYTSLKKLFH(SEQ ID NO:1169)
>orf00309
LIVWILKNHTDLTTYIPNIFLSQTLAINYNLSGFCFQ (SEQ ID NO:1170)
>orf00310
MPYNRKPFSTFHVKRNILHIVVVLIFFIAKRKIFYINY (SEQ ID NO:1171)
>orf00314
MFKKMSNSSRILFYISVNFCDKRIYRTKLYSDTPVNLFKFLFRQKSNCQSVGQTSSINLFFYSWIVFFFKNNLCHSIPSIK (SEQ ID NO:1172)
>orf00327
LADGSRKLAEGGTKLTSGLEDLQTGLASLGQGLGNASDQLKSVSTESKNAEILSNPLNLSKTDNDQVPVNGIAIAPYMISVALFLQQYQQI (SEQ ID NO:1173)
>orf00356
MEMSFIAQDFDKLNIITVLESRTQAIIRNPMNTRLSSATGSSFNKIVRN(SEQ ID NO:1174)
>orf00358
MELAETSIVKKNHQIPCIINQKIAQKLIEKTSMTDIDHQLSISTSTVIRKINNFHFEHDFSRLPEIMS (SEQ ID NO:1175)
>orf00364
MNYIDTNEMLFVETPRKVITSDELRKKNTKYLDQKEFKLFIQNLKDEALCDYRITKYIRIAKVLFLTGMRYGELAALNYKEDIDFSKKTIHIKHTYDFRQKERTTPKTIKSDRVITAPQKVLDIIKEQIIENATNGFDTDFIFINTLGEPITNARVICALKRHGQKIGIEKNITTHTFRHSHISLLAELGIPLTAIMDRVGHSDSKTTLEIYSHVTQKMVSDISSKLDKIKF(SEQ ID NO:1176)
>orf00365
MWMEELPNGKYKFFERYKDPYTEKLKKVSVTMEKKTPQARNQAAILLQEKIKQKLGEKQHSVSNITFEKLYEEFEENWKHGVKNSTVYASKNVKKEILKQIEGDYLVRNLIDVYYKK(SEQ ID NO:1177)
>orf00367
MEIDKVKADLKQVGKRVADLSQSITNEEQTKNAFIMPFFQALGYDIFNPLEFVPEFTADVGIKKGEKVDYAIILDGEPQILIECKSITENLTKHDSQLFRYFVTTKSKFGILTNGREYKFFTDLDEPNKMDTTPFLTIDVTDIKENQFTEIIKFHKENFDIDNIVSSASELKYLNNLKAFLTENITTPSDSFLRYLTSEIYEGRVTQNILTTFSPIIVKGFNQFITERVNEKLSAALNTSVETKVTTDIPKVEAEAEEIVEVTDEIITTPAELEVYTVVKMLARDVVSPERVFYRDNRSYFNVLVDDNIKKWVLRYRSNSKKSTIEIRDKGIFPVSTPLEVANYANEILEVIKKFS(SEQ ID NO:1178)
>orf00368
MTLAKLCEEYQVELCLFDGSNWHNSGFYNPDTNVLAIDHNLTPEQQIQVALHELGHKDHTRSEHQNARLRCENEADRNMIHHLVKDALENLDDPTEFDYLKFMSYYNLKTMTNEIMVKEEYLALVN (SEQ ID NO:1179)
>orf00369
MYRLDIDKKALKQLKKLDTPTRKQILSWLAKNIENTTNPRQHGKALKANLAGYWRYRVENYRIICDIQDDKLVVLAVEIAHRRDVYK (SEQ ID NO:1180)
>orf00370
MTITINFTEKNSYITDYLNKHGIDTTTMDFDDFMALMEDIEDARAADQAYMEYLADPATYTMDEVLDELGLTREDIA (SEQ ID NO:1181)
>orf00371
MFETFEKIKELAKKRGKALGQVEEDLGYGRNTLYKIKNSTPNAERIAEIANYFNVSTDYLLGRTDNPAIAGSDEFAQVNGQIIDLRKAAANTMLFDGKPLNEDDIDFITSVLSAHFKSKGER (SEQ ID NO:1182)
>orf00372
MVSILKNLEQEKDHLEKVIKVVSAGGKFLRLPYQKSHARLVRI (SEQ ID NO:1183)
>orf00373
MPDIANGRERVIAFLKEKGIKKATLAVAYGFKRQEVTNILSGTTKGPRANSFILQVIEDYGIE(SEQ ID NO:1184)
>orf00374
MRPKRYPYSGQKESTFVKADPELVEKLLRNTSFLECLQKKPINFQIDSEEFKRLSYEAIHDTSQVTQ (SEQ ID NO:1185)
>orf00377
LKNREEEWQGIIAKNAILLIIAPFYFLIIVKNGVLSKIKTVTEITAYQL(SEQ ID NO:1186)
>orf00378
MREVIQELLDSSMSTSAISQGAGVPWTTVSDLRKGKTSMDKMALLTAEKLYEFATTDKQ(SEQ ID NO:1187)
>orf00382
VEEVEVAEVKNARVSLTGEKTKPMKLAEVTSINVNRTKTEMEEFNRVLGGGVVPGSLVLIGGDPGIGKSTLLLQVSTQLSQVGTVLYVSGEESAQQIKLRAERLGDIDSEFYLYAETNMQSVRAEVERIQPDFLIIDSIQTIMSPEISGVQGSVSQVREVTAELMQLAKTNNIAIFIVGHVTKEGTLAGPRMLEHMVDTVLYFEGERHHTFRILRAVKNRFGSTNEIGIFEMQSGGLVEVLNPSQVFLEERLDGATGSSIVVTMEGTRPILAEVQALVTPTMFGNAKRTTTGLDFNRASLIMAVLEKRAGLLLQNQDAYLKSAGGVKLDEPAIDLAVAVAIASSYKDKPTNPQECFVGELGLTGEIRRVNRIEQRINEAAKLGFTKIYVPQNSLTGITLPKEIQVIGVTTIQEVLKKVFA(SEQ ID NO:1188)
>orf00389
VNIATLQNGHILGWQIQHIANKLTSNFWIAKDFLSYQVIGWANARMTYSHISSLFIISQF(SEQ ID NO:1189)
>orf00391
VSITFSLTNFFKILINLTAQVSPQVIDEKILMMDLNLNNYLSTVIQLRQDVYTGIKILHRVRHGE (SEQ ID NO:1190)
>orf00392
MSRYSYSLDSRKIVFEISCFKEKKASLTLFFHLFESSIMKLATQPSFSSFYSELK(SEQ ID NO:1191)
>orf00396
MKIKEQTRKLAAGCSKHCFEVVDRTDEVSSKHRFEVADRTDEVSSKHRFEVADRTDEVSNIYLRQGDVDVV (SEQ ID NO:1192)
>orf00408
LSLLDLRGSLCLRIYLHEPLITTVSQDFTSLSDISHF (SEQ ID NO:1193)
>orf00411
MDFKSFIIGLVVGIFGPYMDDLIRKKFLKSSEKKTEKSVKK (SEQ ID NO:1194)
>orf00434
MFEKIKGINIKSGIFEDETKLELFEGNFEGTNPVQNDRASLLFGRNGSGKSTIARGINQLKNGEIGTDRVSFIDKNNNNIVLSDTERKSIFVFNEHYVDQKVKIAQEGLDTIVILGEQVDIDEELDRLRTQLSESQIESQDYYAEYEEYLDEKNEKSPDFWKKEMTDSLKGVGNWAERDREIKGNRAASPVHNNTFQNFVDLQPILDKNELEVEFNNKKARYFSIRDSAVTINNELSLPDINFDSNELSTLLSEKIEEPELNSRDKYLLTLLSDSTKGERHLREVKDFFEDEHQKKCPFCTQSVSEDVKVELTNGITKLLSRAVEEHQSALRGKKIDEINQDFSGYEQIDPILIQSYQNSINALNAKFNEINSIIDKKIDNPYNIVELPNISFSQELSQAEHDIEKINQAIIKHNSEISGIQKLKVDLLQINNELAFYEIQDAYKKFQEKTNKKAICENNYNNSSKRVKDYEKQISDLEDKKLNIDIAVDEINKSLNYIFFSKNRLAIQNQNGKYYLLSRGKSVVPSRVSVGERNALALCYFFTEIIQQRELADAYSHEYFIVIDDPISSFDMENKVGITSYLKYCLTRFFKGNSNTRVLLMTHDKQTIYDFDIFLKEIMESCKEEEGGQKSKYKKLELVSGKLQEFKTSTHDYTELLEIVFGYALGNSTPTSESFVGNAMRKILEAYGSFNYKKGIAELTTDPLIVEKIDKEYRTYFENLMYRLVLNGESHFKDPVKTLSIDFFDTISDEERKKTARDLLVLLYLLDDLHVLKHLEGVSNAENRLEQWKCEILE (SEQ ID NO:1195)
>orf00458
MIEGDRDCADIVTQLTAVKSSVERVIEMIITENLTECINQPLDDSEAQKERLEKAIRYLIKRK(SEQ ID NO:1196)
>orf00460
METSISMADFYGKYQNENLELIDVREAHEFQAGHAPGAKNLPLSTLEQGYKELKPDHEYYVICQGGVRSASTCQFLSSQGLTVTNVEGGMNAWPGQVE (SEQ ID NO:1197)
>orf00462
LGGKSCLLEDRLCDIAAQTTVAADDVGLFFVQFISFLLDTLSVFDTIVQN(SEQ ID NO:1198)
>orf00466
MKIKDQTRKLAAGCSKHCFEVVDRTDEVSSKHCFEVADRTDEVSNIYTARRR(SEQ ID NO:1199)
>orf00467
MKLLSIAISSYNAAAYLHYCVESLVIGGEQVGILIINDGSQDQTQEIAECLASKYPNIVRAIYQENKGHGGAVNRGLAEASGRYFKVVDSDDWVDPRAYLKILETLQEFESKGQEVDVFVTNFVYEKEGQSCKKSMSYDSVLPVRQIFGWDQVGNFSKGQYIMMHSLIYRTDLLRASQF(SEQ ID NO:1200)
>orf00468
MYYLPVDFYRYLIGREDQSVNEQVMIKCIDQQLKVNRLLVDQLDLSQVSHPKMREYLLNHIEITTVISSTLLNRSGTAEHLAKKRQLWTYIQQKNPEVFQAIRKTMLSRLTKHSVLPDRKLSNVVYQITKSVYGFN (SEQ ID NO:1201)
>orf00476
MSLQIKLKKLAKELSKLLKDSNLETVDKDVLENSQKELQKAVLFLADEKGSEHTEAEVIDNLKEVIAKLKANA (SEQ ID NO:1202)
>orf00483
MKIKEQTRKLAAGCSKHCFEVVDKTDEVSYIYLRQGEADAV (SEQ ID NO:1203)
>orf00503
MKIKEQTRKLAAGSSKHCFKVVDGTDEVSSKHCFKVVDGTDEVSSKHCFEVVDRTDEVSNHIRQGDVDVV (SEQ ID NO:1204)
>orf00509
MNDDDSRCIHIERDGKTIEFGYLNISSTDRNTSHADGLVGIFNSNFSGVRVRGIAVFLNGPDNLDTTLVGNFQTIWNFRIICIHS (SEQ ID NO:1205)
>orf00510
LEFNFCRSIIKNGRDNLPNTNSTSGMATRWANHNWSDDIKDRLKTK (SEQ ID NO:1206)
>orf00515
MSNVDKIRKIHIIVCWMYIFLSFRAIINDTEYFLLIFLAFIYSIVSLPLYSVKNKIVSICLVINSILLMSFPILINKFFPESFLTYIVLISVFITELIIFHLIGKDFDTKLTNEYKKISQFRSKVSQSPWIKYLEISSFILTIFPSILYGTVDNHVLTLIFLIKICVDTTIKFLFIRLFDTSTLMKRRIFFLFALDVIAYLFLGYLLVIQKAGYLFSVLLLFSNFSVPFI KEKEYELFKNSK (SEQ ID NO:1207)
>orf00516
MNKKKMILTSLASVAILGAGFVASSPTVVRAEDAPQVVEKSSLEKKYEEAKTKADTAKKDYETAKKKAEDAQKKYDEDQKKTEEKAKKEKEAAKKVDDASLAVQKAYVEYRKVQESRSNYRNRSDYNKKLAEAQVKIDEANKKLTAANNEFKTVRAVVVPEPNALAETKKKAEEAKAEK(SEQ ID NO:1208)
>orf00518
LEQEVATAQHQVDNLKKLLAGVDPDDTEAIEAKLKKGEAELNAKQAELAKKQTGLEKLLDSLDPEGKTQDELDKEAAEAELNKKVESLQNKVADLEKEISNLEILLGGADSEDDTAALQNKLAAKKAELAKKQTELEKLLDSLDPEGKTQDELDKEAAEAELDKKADELQNKVADLEKEISNLEILLGGADPEDDTAALQNKLATTKAELEKTQKELDAALNELGPDGDEEETPAPAPQPEQPAPAPAPKPEQPAPAPKPEKSADQQAEEDYARRSEEEYNRLTQQQPPKAEKPAPAPAPKPEQPAPAPKTGWKQENGMWYFYNTDGSMATGWLQNNGSWYYLNSNGAMATGWAKVNGSWYYLNANGSMATGWVKDGDTWYYLEASGAMKASQWFKVSDKWYYVNSNGAMATGWLQYNGSWYYLNANGAMATGWAKVNGSWYYLNANGSMATGWVKDGDTWYYLEASGAMKASQWFKVSDKWYYVNGLGALAVNTTVDGYEVNANGEWV (SEQ ID NO:1209)
>orf00519
LTISFKKQFLSSSLSSLTKRVIMNTAQATFNREAHTTFNRE (SEQ ID NO:1210)
>orf00525
MKIKEQTRKLAVGCSKHCFEVVDRTDEVSSKHRFEVVDRTDEVSNIYTARRS(SEQ ID NO:1211)
>orf00539
LKKRMNRWQFLLNQSKEMVGILLLKMKEQELIEFVVNL (SEQ ID NO:1212)
>orf00540
LIKVIKRKAFGFRNFNNFKKRILMTLNIKKESTNFVLSRL (SEQ ID NO:1213)
>orf00544
MTYNEKRLTNSLERVHMEQLKNTTDLLGLKDKNIKILSVLKYQTHLVVQAKLDSPAPPCPHCQGKMIKYDFQKASKIPLLDCQGLPTVLHLKKRRFQCKNCLKVVVSQTSIVKKNCQISNMVRQKIAQLLLEKQSMTEIAHRLAVSTSTVIRKLREFKFETDWTKLPKVMSWDEYSFKKSKMSFIAQDFESKSILAILDGRTHAVIRNHFQRYQREVRELVEVITMDMYSPYYRLAKQLFPKAKIVLDRFHIVQHLSRAMNRVRIQIMNQFDRKSLEYRALKRFWNPRFFVSRLGLNQSTGLIYYTRIASSSVRNDSISPRFECT (SEQ ID NO:1214)
>orf00545
MGYSLKKSCTYCEQDPEKVNRFLKELNHLSYLTPIYIYETGVETYFYLEYDRALSRQLVSLEEDIII (SEQ ID NO:1215)
>orf00552
MNIAVIGLGHVGLAYALLFASKYKVVAYDIDSVKINNLKKGILPSKNEELMKFFCENNLNITFFDTFSEIKNNIDYYIIALPTDYDEKIGSFNTYEIEQTVSKILRVKPNGKIILKSTVPFGFSNKLKRLFDTKNIIFVPEFLREGCSIYDNLYPSRIVVGDETVEGRKIAELFLSISTHSTANIKNVMLVSPTEAEAIKLFSNTFLALRVAFFNELDSFAERRSLNAEVVIKGVCLDPRIGNFYNNLSFGFGGYCLPKDTKQLKKEFIEINAPVIEAIDISNTNRKQFIVKQILERKPKIVGIYKLGMKYNSDNYKESAILSIINELLIVGIKILVYEPNLNVSIDNVIFEKNFELFTKQSDLIVANRWDRGLEAYKDKVYTRGIWIRD (SEQ ID NO:1216)
>orf00554
MLNLQFAETMELTEAELEIVYGGEFGNNAVIPAGAWGGFGTPWSITNFWKKNFNDRPDFDSDRRRY (SEQ ID NO:1217)
>orf00599
MGLDVGSKTVGVAISDPLGFTAQGLEIIQINEEQGQFGFDRVKELVDTYKVERFVVGLPKNMNNTSGPRVEASQAYGAKLEEFFGLPVDYQDERLTTVAAERMLIEQADISRNKRKKVIDKLAAQLILQNYLDRKF (SEQ ID NO:1218)
>orf00635
LNPSYSFGKKDQFALEHCFCIKLSIFARAVTLFVSCIN (SEQ ID NO:1219)
>orf00656
MITGTAFILIMSLSARKLPYTIRSSVASLQQIAPSIEEAAESLGSSRLNIFAKITTPMMLSDIISGAILSWVTLISELSTSILLYNVKTRTMTVAIYTEVLRGNYGVAAALSTILTVLTVGSLLLFMKISKSNSITL (SEQ ID NO:1220)
>orf00657
MLIGEGYRTFPVLIYTQFISEVGGNSAFAIMAIIIALAIFLIQKHIANRYSFSMNLLHPIEPKKTTKGKMAAIYATVYGIIFISVLPQIYLIYTSFLKTSGMVFVKGYSPNSYKLAFNRMGSAIFNTIRIPLIALVLVVLFTTFISYLAVRKRNLFTNLIDSLSMVPYIVPGTVLGIAFISLVYLEVDFL (SEQ ID NO:1221)
>orf00658
MECKKLNIWTASSFFLFLTYLVFLVYPIVTVLKQALIHEGQFSLANFVTFFSKAYYSETL
VNSFRVSITATVTSLVVGTLLAYLFSMYDFKGKKFLQILIIIASMSAPFVGAYSWILLLGRNEVITKFLTNALYLPAIDIYGFKGIVLVFTLQLFPLVFLYVAGTMNSIDNSLLEAAESMGSFGFKPIVTVVLPLLVPTLLAAPCLYL (SEQ ID NO:1222)
>orf00660
LLSTTEFIGLSIRILSNLHEFKILVGLLNQFFFWNLLLHKTKSNVVSDSQMWENSVVLENQPDIAFAGFHIIDFCIIEVKFSIFDTVETCNHTKKGRFPTS (SEQ ID NO:1223)
>orf00679
MITIKKQEIVKLEDVLHLYQAVGWTNYTHQPEMLEQALSHSLVIYLALDGDAVVGLIRLVGDGFSSVLVQDLIVLPIYQRQGIGSALMKEALEDYKDAYQVQLVTEETERTLGFYRSMGFEILSTYNCIGMTWMNRKK (SEQ ID NO:1224)
>orf00710
VLKIRYHKQFKKDFKLAMKRGLNAELLEEVLKIWFKKKNFLLDIVIIN(SEQ ID NO:1225)
>orf00714
MLGSMFVGLLVGFLAGTLTNRGEHMGCFGKMFLGWIGAFIGHLLFGTWGPIIAGTAIIPAVLGSMIVLAIFWRRGS (SEQ ID NO:1226)
>orf00741
MIDDIPKRVNDVIGQAGNNAKTSRPHVGIGKSHISVPFLFPYHTANRIKNQEKVIF(SEQ ID NO:1227)
>orf00755
VAIDKIAGITSEKDSRAHQIFRISPTCSRCFCNDELVKWVARTIFLQLTKRCCLRSGNITRSNSVTLDIGSTVFRRNVAGQHFQAPFSSSISANCFTSQFAHHRTNIDNLSMPFLYHRRNNCL (SEQ ID NO:1228)
>orf00756
LFDLLDHGLDTVLVCHVTDISMGFDANFTISFNPFIDQILIDIVKDNSSAGFSVGFGNSKSNSIRSAGDESNFSF (SEQ ID NO:1229)
>orf00768
MKSLARLLIIHVFISIFLFFALISGAVSHTVLLLLLLFLPALNKGLEKIQSKRIPVLNAALFFLLISFPQLLTNPVQWKFSIFLVVTIISSLAYFYNFYQVVKEVDQKQLI(SEQ ID NO:1230)
>orf00769
LEAASEIETEFQSWIVLVVFNHIDGLSRDTDILGELELGNAQFLAKFFHTIHLVSFLICVVYI (SEQ ID NO:1231)
>orf00774
MKWTKRVIRYATKNRKSPAENRRRVGKSLSLLSVFVFAIFLVNFAVIIGTGTRFGTDLAK
EAKKVHQTTRTVPAKRGTIYDRNGVPIAEDATSYNVYAVIDENYKSATGKILYVEKTQFNKVAEVFHKYLDMEESYVREQLSQPNLKQVSFGSKGNGITYANMMSIKKELETAEVKGIDFTTSPNRSYPNGQFASSFIGLAQLHENEDGSKSLLGTSGMESSLNSILAGTDGIITYEKDRVGNIVPGTELVSQQTVDGKDVYTTLSSPLQSFMETQMDAFLEKVKGKYMTATLVSAKTGEILATTQRPTFNADTKEGITEDFVWRDILYQSNYEPGSAMKVMTLASSIDNNTFPSGEYFNSSEFKIADATTRDWDVNDGLTTGGMMTFLQGFAHSSNVGMSLLEQKMGDATWLDYLKRFKFFGVPTRFGLTDEYAGQLPADNIVSIAQSSFGQGISVTQTQMLRAFTAIANDGVMLEPKFISAIYDTNNQSVRKSQKEIVGNPVSKEAASTTRNHMILVGTDPLYGTMYNHYTGKPIITVPGQNVAVKSGTAQIADEKNGGYLVGSTNYIFSVVTMNPAENPDFILYVTVQQPEHYSGIQLGEFATPILERASAMKESLNLQSPAKNLDKVTTESSYAMPSIKDISPGELAEALRRNIVQPIVVGTGTKIKETSVEEGTNLAPNQQVLLLSDKVEEIPDMYGWKKETAETFAKWLDIELEFEGSGSVVQKQDVRTNTAIKNIKKIKLTLGD (SEQ ID NO:1232)
>orf00776
MVDRTDEVSSKHGFEVVDKEKLMWFEEVFEECKKILVS (SEQ ID NO:1233)
>orf00783
MEGVNHVDIIKVSCCSFISQVNWMMKGKIPNREGFKFSVARFDAIDLVVVHIGHTRCQFSRTGSRSGYDNQVATGFDVVVFAHAFWGNDVIHIRRISFDWIMKIRINSVFLKLVAEGICSGLASVLCNDNGTNKNP (SEQ ID NO:1234)
>orf00784
MFNVASINGNHNLNLLFQFLQELDFVVRFITRKDTSSVEIF (SEQ ID NO:1235)
>orf00790
LTNQDLQAGTYLVKDYREIILSQDALEKVATNLKLDMPAKTLASKVQVAVPADTRIVSISVKDKQPEEASRIANSLREVAAEKIVAVTRVSDVTTLEEARPATTPSSPNVRRNSLFGFLGGAVVTVIAVLLIELLDTRVKRPEDVEDVLKIPLLGLVPDFDKIK(SEQ ID NO:1236)
>orf00791
MPTLEISQAKLDSVKKAEEYYNALCTNLQLSGDGLKVFSITSVKIGEGKSTTSANIAWAFARAGYKTLLIDGDIRNSVMLGVFKARNKITGLTEFLSGTTDLSQGLCDTNIENLFVIQAGSVSPNPTALLQSKNFTTMLETLRKYFDYIIVDTAPVGVVIDAAIITRNCDASILVTEAGEINRRDIQKAKEQLEHTGKPFLGIVLNKFDTSVDKYGSYGNYGNYGKNKK(SEQ ID NO:1237)
>orf00792
MNEKILRSSLAIIQSFLVILLTYLLSAVRETEIVSTTAIALYILHYFVFYISDYGQDFFKRRYLIELVQTLKYILFFALAIGISNFFLEDRFSISRRGMIYFLTLHALLVYVLNLFIKWYWKRAYPNFKGSKKILLLTATSRVEKVLDRLIESNEVVGKLVAVSVLDKPDFQHDCLKVVAEGEIVNFATHEVVDEVFINLPGEKYNIGELVSQFETMGIDVIVNLNAFDRSLARNKQIREMAGLNVVTFSTTFYKTSHVIAKRIIDIVGALVGLILCGLVSIVLVPLIRKDGGSAIFAQTRIGKNGRQFTFYKFRSMCVDAEAKKRELMEQNTMQGGMFKVDDDPRITKIGCFIRKTSLDELPQFYNVLKGDMSLVGTRPPTVDEYEHYTPEQKRRLSFKPGITGLWQVSGRSEIKNFDEVVKLDVAYIDGWTIWKDIEILLKTVKVVFMRDGAK (SEQ ID NO:1238)
>orf00793
MKKSVYIIGSKGIPAKYGGFETFVEKLTAFQQDKAIQYYVACMRENSAKSGTTEDVFEHNGAICYNVDVPNIGPARAIAYDIAAINRAIEIAKENKDEDPIFYILACRIGPFIHGIKKKIQEIGGTLLVNPDGHEWLRAKWSAPVRRYWKISEGLMVKHADLLVCDSKNIEKYIQEDYKQYQPKTTYIAYGTDTTRSVLKSSDEKVRSWFKEKNVSENEYYLVVGRFVPENNYESMIRGFLASNSKKDFVLITNVEQNKFYNQLLAKTGFDKDPRVKFVGTVYEQELLKYIRENAFAYFHGHEVGGTNPSLLEALASTKLNLLLDVGFNREVAEDGAIYWKKDNLHEIIETSEQKTQKEIDEKDILSIKQVTERFSWELIVNEYEKLFLCEK (SEQ ID NO:1239)
>orf00794
VTIKINNLFFVCLSFFGIVLSSSQVIVNLGLSSIIQYISYFMLMLCVFLTLIKNTLNVFANRIIYFLIISFLFIIGINLQNLPLSRKIYLSFSMLIISSLSTLPIKLINNLSDLRRISYYLLHSIFLSVFLGLVFKISLVTVAVEGIGFSYGFNGGLTHKNFYAITILVSYILLYVSRKYDAKHQIDSFVLWLDLFLLLISNTRTVYIILVVFWIIINRNFINNIKKEHRLVVTATTIVISLLALTFFFKHIINNSESYSHRVLGVVNFFKYYESDRFHLFFGDAELAFGNTTKGYGHNIRSVLGWDGTVEMPLLSVMIKNGYVGLVGYIIVLFKFISSIISVKNSTKKNIGLSIFIPLLLSATVENYIVNISFVFMPVCFCILCSIKNIKLVNNRK (SEQ ID NO:1240)
>orf00796
MEKLVSIILPVYNVEQYIKNCLESIQQQTYSNLEVIIVNDGSTDKSVEYCEQICKIDSRFSITHKENGGLSDARNVGIDKSKGDYLIFVDSDDFVSQDMVSYLVSCMENNEADIAICDPVHYYSDRQNNDLNIFSPASNVKVYETTEALCEMFYQKSFLVSAWAKIFKRELFDDIRFPVGKLFEDSAIMYLLFEKCETIAYSDAELYAYVHRDNSITTKKFSDRDLDILEITNTIINHYGDNLRVYTAAVSYKVSACFRILLNSPSGEKYKKVQKECLSYILQNWRNILFNNNVRLKNKLALISITIFNPFVKFIYSKVNRWE (SEQ ID NO:1241)
>orf00797
MNKYEERYQENLSKNDFYKLINKSYLSDKELQVQQVKAGIVLPPKAFETKLSNKLGLQKSLHGKGGVVDSNGNYIELSAQKAVGMRNRVYGPYKINYDNLPIRNEKVIYLNYFIKQWGHFLLDVVGRLWYPLLQDNDTKLVYTCYAGTETKIEGNYLEFLKLLGIDQSRLIMINCPTQFSEVIIPESSILPGGYYTKEYKQLFSSVVENIKLDKYDVNAKMIYCSRSKLGIAKSKEFGEDGIEGIFKQNGYTSVYMETMSLEEQIKTLLSAKTIVLTSGSLAHNLLFVNKDIDVFILNKTYRVNLHQFLINEISDATVRFVDIYRSPLPILYGYGPFLMDLTKPLANFLDDNEFVYEKGTVLSKKDYFKYYLKWLWSYRFFLFRLNGIKEGNSEFEKSFKIIRRYYKTGR(SEQ ID NO:1242)
>orf00798
MSKYKELAKNTGIFALANFSSKILIFLLVPIYTRVLTTTEYGFYDLVYTTIQLFVPILTLNISEAVMRFLMKDGVSKKSVFSIAVLDIFIGSIAFALLLLVNNLFSLSDLISQYSIYIFVIFVFYTLNNFLIQFSKGIDKIGVTAISGVISTAVMLAMNVILLVVFDWGLLGFFIANVCGYVIPCIYIVSRLRLWELFEIKIDKKLQWEMVYYALPLVLNILSWWVNNTSDRYIVTAIVGIQASAIISVAYKIPQILSTISAIFIQSWQISAIKIQEDKSDTTFVSNMLLYYNALLLIIASGIILFVKPISNILFGISFYSAWELVPFLIISSLFNAISGCIGAIMGAKMDTHNIAKSALVGMIANIILNIVLTFLMGPQGITISTLIASFLIFYMRKDSVKEINSETYRAIYLSWILLVVEACLLIYMDFIIGALIAMVINLFLLKDVIKPLYLKIFKRN(SEQ ID NO:1243)
>orf00799
MIVLQYFKILARFVFMFLISAVLLPFKIKPNKIVFINFNGKGYGDNPKSICEYLRTTYPDLDLVWLARDNEGFPDGVRVVKYGTFQAFYEQASSKVWVYNVRAFARILKKRGQIYIQTWHGASSFKLIEKQADLPINYVLEAKYDARVTDIMISDSRKQTEEFQKYFWYSGEIFEVGMPRNDALFHYKEDYDKLNNIRKELSIHSDDYVILYAPTFRDDGDASYLDINFERLLQCVEHGIKKKCKFLIRLHPNHSHLCNNISFNKNIINATFYSDMQELTLLADVLVTDYSSSIFDFMLLNKPYVRYVNDLEKYAELRGVSDTYYELPDSIIKTAEELYDLLPKKIENFDYDSIKKYRNEILCPIFNGTASENVGGRIIQEL (SEQ ID NO:1244)
>orf00800
LKNNDLKIGSGAIHQISATLSQNSISGKILYCADPVVDDLYGSIVRSQIEEIGRVKEESCNYNTIAYAMNIAERAIATDIDCIVGMGGGRVLDVCKYASFISKRPYLSIPTTAANDGIASPVAVLKRQDDRPKSLGAAIPSMTLIDIDVIASGPIQNIKAGIGDTISNYTALKDWELAVERGKDEMHGFAYLMSQNSLDALMKTKYNSITPDFIEVLVNSLVLSGIAMDFAGSSRPVSGSEHLFSHALDYYGSTRNLHGIQVALGTVAVLKLIENSVDTVVDYLQRFEVHINPKLLGIDEELFIYCMQHATKMRSNRYTYLHEVDLSTDRLKQIYKELISEL(SEQ ID NO:1245)
>orf00801
MKALILAAGLGTRLAPITNEVPKSLVPVNGKPILMKQIENLYQNNITDITIIAGYKSSVLTDAVTEKYPEINIIDNVDFKTTNNMYSAYLGKAAMGDSDFLMMNADVFYDASVIKSLLLHKAPNAIVTDLGIYIEESMKVVEKNGRLVEISKQISPEETLGASIDVYKFSYEAGARFFEKCKEFIEDKRELQMWSEVALNAILSEVEFVACPLEGRWLEIDNHEDLVAAEKLFA(SEQ ID NO:1246)
>orf00802
MKLTNRVDYFGADISELQNKKLFLFDMDGTIYEEDRLFEGTLELLDYIHNIGGEYIFITNNSSKSVVDYVEKVNRLGIKAERDNFFTSAQATIVYIKENYPKSKVYCQGTKSLIKELSDAGIDVTEQVSADIDVVLVGFDTELTSDKIRNTCEILSTKDVPFIATNPDIRCPVSFGFIPDCGSICDMISKSVDRKPVYIGKPEPTMVDIVRKKLNYSLFETVVIGDRLYTDIMTGINAGVTSVCVLTGEATVNDIQQDSIKPTYTFKNVKEMWKGIV (SEQ ID NO:1247)
>orf00804
MKGIILAGGSGTRLYPLTRAASKQLMPVYDKPMIYYPLSTLMLAGIRDILIISTPQDLPRFKELLQDGSEFGIKLSYAEQPSPDGLAQAFIIGEEFIGDDSVALILGDNIYHGPGLSTMLQKAAKKEKGATVFGYHVKDPERFGVVEFDENMNAISIEEKPEYPRSNYAVTGLYFYDNDVVEIAKSIKPSPRGELEITDVNKAYLDRGDLSVELMGRGFAWLDTGTHESLLEASQYIETVQRMQNVQVANLEEIAYRMGYISREDVLALAQSLKKNEYGQYLLRLIGEA(SEQ ID NO:1248)
>orf00806
MTDNFFGKTLAARKVEAIPCMLEFDIPVHGDNRGWFKENFQKEKMLPLGFPESFFAEGKLQNNVSFSRKNVLRGLHAEPWDKYISVADGGKVLGSWVDLREGETFGNTYQTVIDASKGIFVPRGVANGFQVLSDTVSYSYLVNDYWALELKPKYAFVNYADPSLGIEWENIAEAEVSEADKNHPLLKDVKPLKKEDL (SEQ ID NO:1249)
>orf00810
MTEYKNIIVTGGAGFIGSNFVHYVYENFPDVHVTVLDKLTYAGNRANIEEILGNRVELVVGDIADAELVDKLAAQADAIVHYAAESHNDNSLNDPSPFIHTNFIGTYTLLEAARKYDIRFHHVSTDEVYGDLPLREDLPGHGEGPGEKFTAETKYNPSSPYSSTKAASDLIVKAWVRSFGVKATISNCSNNYGPYQHIEKFIPRQITNILSGIKPKLYGEGKNVRDWIHTNDHSSGVWTILTKGQIGETYLIGADGEKNNKEVLELILKEMGQAADAYDHVTDRAGHDLRYAIDASKLRDELGWKPEFTNFEAGLKATIKWYTDNQEWWKAEKEAVEANYAKTQEIITV(SEQ ID NO:1250)
>orf00813
MILITGANGQLGTELRYLLDERNEEYVAVDVAEMDITNEEMVEKVFEEVKPTLVYHCAAYTAVDAAEDEGKELDFAINVTGTKNVARASEKHGATLVYISTDYVFDGKKPVGQEWEVDDRPDPQTEYGRTKRMGEELVEKHVSNFYIIRTAWVFGNYGKNFVFTMQNLAKTHKTLTVVNDQYGRPTWTRTLAEFMTYLAENRKEFGYYHLSNDATEDTTWYDFAVEILKDTDVEVKPVDSSQFPAKAKRPLNSTMSLAKAKATGFVIPTWQDALQEFYKQEVR(SEQ ID NO:1251)
>orf00814
LVNCEPLEAYRQLEEAELVGCWAHVRRKFFEATPKQADKSSLGAKGLAYCNQLFSLERDWEALPADERLQKRQEELQPLMEDFFAWCRRQSVLSGSKLGRAIEYSLKYKETFKTILKDGHLVLSNNLAERAIKSLVMGRSKRVQWTLLA (SEQ ID NO:1252)
>orf00823
MNKGLFEKRCKYSIRKFSLGVASVMIGAAFFGTSPVLADSVQSGSTANLPADLATALATAKENDGRDFEAPKVGEDQGSPEVTDGPKTEEELLALEKEKPAEEKPKEDKHAAAKPETLKTVTPEWQTVEKKEQQGTVTIREEKGVRYNQLSSTAQNDNAGKPALFEKKGLTVDANGNATVDLTFKEDSEKGKSRFGVFLKFKDTNNNVFVGYDKDGWFWEYKSPTTSTWYRGSRVAAPETGSTNRLSITLKSDGQLNASNNDVNLFDTVTLPAAVNDHLKNEKKILLKAGSYDDERTVVSVKTDNQEGVKTEDTPAEKETGPEVDDSKVTYDTIQSKVLKAVIDQAFPRVKEYSLNGHTLPGQVQQFNQVFINNHRITPEVTYKKINETTAEYLMKLRDDAHLINAEMTVRLQVVDNQLHFDVTKIVNHNQVTPGQKIDDERKLLSSISFLGNALVSVSSDQTGAKFDGATMSNNTHVSGDDHIDVTNPMKDLAKGYMYGFVSTDKLAAGVWSNSQNSYGGGSNDWTRLTAYKETVGNANYVGIHSSEWQWEKAYKGIVFPEYTKELPSAKVVITEDANADKNVDWQDGAIAYRSIMNNPQGWEKVKDITAYRIAMNFGSQAQNPFLMTLDGIKKINLHTDGLGQGVLLKGYGSEGHDSGHLNYADIGKRIGGVEDFKTLIEKAKKYGAHLGIHVNASETYPESKYFNEKILRKNPDGSYSYGWNWLDQGINIDAAYDLAHGRLARWEDLKKKLGDGLDFIYVDVWGNGQSGDNGAWATHVLAKEINKQGWRFAIEWGHGGEYDSTFHHWAADLTYGGYTNKGINSAITRFIRNHQKDAWVGDYRSYGGAANYPLLGGYSMKDFEGWQGRSDYNGYVTNLFAHDVMTKYFQHFTVSKWENGTPVTMTDNGSTYKWTPEMRVELVDADNNKVVVTRKSNDVNSPQYRERTVTLNGRVIQDGSAYLTPWNWDANGKKLSTDKEKMYYFNTQAGATTWTLPSDWAKSKVYLYKLTDQGKTEEQELTVKDGKITLDLLANQPYVLYRSKQTNPEMSWSEGMHIYDQGFNSGTLKHWTISGDASKAEIVKSQGANDMLRIQGNKEKVSLTQKLTGLKPNTKYAVYVGVDNRSNAKASITVNTGEKEVTTYTNKSLALNYVKAYAHNTRRNNATVDDTSYFQNMYAFFTTGSDVSNVTLTLSREAGDEATYFDEIRTFENNSSMYGDKHDTGKGTFKQDFENVAQGIFPFVVGGVEGVEDNRTHLSEKHDPYTQRGWNGKKVDDVIEGNWSLKTNGLVSRRNLVYQTIPQNFRFEAGKTYRVTFEYEAGSDNTYAFVVGKGEFQSGRRGTQASNLEMHELPNTWTDSKKAKKATFLVTGAETGDTWVGIYSTGNASNTRGDSGGNANFRGYNDFMMDNLQIEEITLTGKMLTENALKNYLPTVAMTNYTKESMDALKEAVFNLSQADDDISVEEARAEIAKIEALKNALVQKKTALVADDFESLDAPAQPDEGLENAFDGNVSSLWHTSWNGGDVGKPATMVLKEATEITGLRYIPRGSGSNGNLRDVKLVVTDESGKEHTFAATDWPDNNKPKDIDFGKTIKAKKIVLTGTKTYGDGGDKYQSAAELIFTRPQVAETPLDLSGYEAALAKAQKLTDKDNQEEVASVQASMKYATDNHLLTERMVEYFADYLNQLKDSATKSDAPTVEKPEFKLSSLASEQGKTPDYKQEIDRPETPEQILPATGESQSDTALFLAGVSLALSALFVVKTKKD (SEQ ID NO:1253)
>orf00824
LQIAQESSQDTDGINPPVVEEAMVFDRNDCLNQICGNIISLGIDAAFRTQVSNELIFIVVDFTRSCCN (SEQ ID NO:1254)
>orf00826
MLNLMWMKIFHRNRTFLFCFLGFKVDVISIINARIVRR (SEQ ID NO:1255)
>orf00827
VYNSQALRQIVVVGSIDHLFKRHSSICEIFGLRKRWLSFL (SEQ ID NO:1256)
>orf00830
MTSIIFSAKDIFEQEFGREVRGYSKVEVDEFLDDVIKDYETYATLVKSLRQEIADLKEELTRKPQVSSAPSPSHPDPIDVAASSSMTNFDILKRLNRLEKEVFGKQILDNTDL(SEQ ID NO:1257)
>orf00854
LISIKHFFWLPLSKKMIIDIIVNKNPDRFCMIEKVKKTMAENR (SEQ ID NO:1258)
>orf00858
VNIDSSEFYISHITDGIFDSFLDSNRYLRNFYSVLKVEIDICCEFFVHVFKINATAE(SEQ ID NO:1259)
>orf00859
VNPLYLCSSDSNDFFKYTWGDNDFAKLFFNSHRMTSF (SEQ ID NO:1260)
>orf00887
LAQISILHFDFLSIDKHSHTVFNTLRKSLQTTLALSATSKQCFEQLAASFLVCSLIFIEYKV(SEQ ID NO:1261)
>orf00897
MLYVGIDIAKNKHDVTALNVPGKTVLKPLTFSNNKAGFELLDLSLRQLNQDCLIALKLLSDPNREQFQHDNRQVDLKILARHIHRLKKKQSDWKVQYTRCLDIIFPELDKIVGKHSEYTYQLLTRYPNPQKRIEAGFDKLIEIKRLTASKIQDILSVAPRSIETTSPAREFEIIEIIKHYKRLIDKAETCVNDLMAEFNSVITTVTGIGGRLGAVILAEIRNIHAFDNPAQLQAFAGLDSSIYQSGQIDLAGRMIKRGSPHLRWALIQAAKACARFSPAFKAYLKTKLEQGKHYNVAIIHLAKKLIRTLFYILKKSCHLTNKK (SEQ ID NO:1262)
>orf00900
MDTKSSCLITTGRNDSPSTCLPRVASNNDRFSSEFRIIPDFHCSKKGIHVNMDDFS(SEQ ID NO:1263)
>orf00903
MMSIREQDLKDIGAIIKYKNFHSPFDTFKYLKDMGFDTIDLSVLLEGFSYAYGMDWLEKFFKENQDKLREFY (SEQ ID NO:1264)
>orf00909
MIPLYRTDNDITKFFTKIRNGHLAKTAGGLDDKFHEANASTSKAFDRQGVGEVNDIRDSAGSQELRINDKRKTENILFLEIRVRIFRVPHPNDSFFSSHFLG(SEQ ID NO:1265)
>orf00910
VLSQGDKDITILDAGLLKNGKIGPVTKDTNDIKATDNMIENSFVLLNQQNIMLFCNQGATEGKTNFSPSDKDNFHNKTYFFMM (SEQ ID NO:1266)
>orf00915
MKIKEQTRKLAAGCSKQCFEIVDRTDEVSSKHGFEVVDETDEVSNHTYGKAKLTWFEEIFEEYKMMGKAGQLVFFDVYRLVRQVS (SEQ ID NO:1267)
>orf00942
LVEIVRGGSPRPIKDYLTSEVDGINWIKIGDTEKGEKYINNVKEKIKKSGLNKTRFVKKGTFLLTNSMSFGRPYILNVDGAIHDGWLAISNYENSLNKDYLFYILSSNVVYSQFLSLISGAVVKNLNSDKVASILIPLPPLSEQQRIIEAIESALEKVDEYAESYNRLEQLDKEFPDKLKKSILQYAMQGKLVEQDPNDESVEVLLEKIRAEKQKLFEEGKIKKKDLDISIVSQGDDNSYYGNIPMNWVVIKIKDIFSMNTGLSYKKGDLSINNKGVRIIRGGNIKPLEFSLLDNDYYIDTQFISSEQVYLKHNQLITPVSTSLEHIGKFARIDKDYDGVVAGGFIFQLTPFESSEIISKFLLFNLSSPLFYKQLKAITKLSGQALYNIPKTTLSELLIPLAPFEEQELITQKVEKLFEKVNQLWK (SEQ ID NO:1268)
>orf00963
VDRTDEVSSKHCFEVVDTTDEVSSKHCFEVVDRTDEVSNHTHDKPTLTWFEEIFEEYHSPFHN (SEQ ID NO:1269)
>orf00964
LDNIHIVLDSLNAVSGIQDFICDGLAIFCDQITSGCSSCK (SEQ ID NO:1270)
>orf00979MKSTLGIISVGLVITYILQQVMSFSRDYLLTVLSQRLSIDVILSYIRHIFELPMSFFATRRTGEIISRFTDANSIIDALASTILSLFLDVSILILVGGVLLAQNPNLFLLSLISIPIYMFIIFSFMKPFEKMNHDVMQSNSMVSSAIIEDINGIETIKSLTSEENRYQNIDSEFVDYLEKSFKLSKYSILQTSLKQGTKLVLNILILWFGAQLVMSSKISIGQLITFNTLFSYFTTPMENIINLQTKLQSAKVANNRLNEVYLVESEFQVQENPVHSHFLMGDIEFDDLSYKYGFGRDTLTDINLTIKQGDKVSLVGVSGSGKTTLAKMIVNFFEPYKGHISINHQDIKNIDKKVLRRHINYLPQQAYIFNGSILENLTLGGNHMISQEDILRACELAEIRQDIERMPMGYQTQLSDGAGLSGGQKQRIALARALLTKAPVLILDEATSGLDVLTEKKVIDNLMSLTDKTILFVAHRLSIAERTNRVIVLDQGKIIEVGSHQELMQAQGFYHHLFNK(SEQ ID NO:1271)
>orf00981
MTSYKRTFVPQIDARDCGVAALASIAKFYGSDFSLAHLRELAKTNKEGTTALGIVKAADEMGFETRPVQADKTLFDMSDVPYPFIVHVNKEGKLQHYYVVYQTKKDYLIIGDPDPSVKITKMSKERFFYEWTGVAIFLATKPSYQPHKDKKNGLLSKLPSSDFQTKISHCLHCSLKLIGHYYQYRWFLLSPRNLG (SEQ ID NO:1272)
>orf00984
MDTKMMSQFSVMDTEMLACVEGGGCNWGDFAKAGVGGAAVVAALGCAAGGVKYGKILGPWGAAIGGIGGAVVCGYLAYTATS (SEQ ID NO:1273)
>orf00988
MKKKILIIFVLYLIMSIFLYPLRESIWYNLFYTIAYMIAVMIYFSLIKKKEKK(SEQ ID NO:1274)
>orf01008
LNCKGNDHPKEFHNPNNRFDKKNSKKTKKNFILSPLA (SEQ ID NO:1275)
>orf01009
MKIKEQTRKLAAGCSKQCFEVVDRTDEVSSKHRFEVVDRTDEVSSKHRFEVVDRTDEVSSKHRFEVVDRTDEVSNIYTAR (SEQ ID NO:1276)
>orf01017
MHSQTFQFLLMTDKTSLLHRKHRSFIRNIHSKFLILFDLLCGILSRNDSNHNPIS(SEQ ID NO:1277)
>orf01021
MSDVKEEVSSLSEKQLRQIDVEYAELNDSDIIERLAYLEINNNEKRIVISDIEPTKEIMSVSDQIFEIQKNFQKIKNMFELFISDVSDFLSIKNKLESKELEIEEADVNRFMIHLLSSGKLFVDFNENQIKQKYSKDSEEFDCIHGFASYQYDTNFAYRFCHSLRNYSQHTDLPINEVKAVSPDDETVIIDFYIDLDYLLNSNFKWKKLKGELIKLNQETSKIDAIALVKEYFNALTELYGNYNKLFLKLNHNTLVDIKSKLESLKLKHSRYYISKISKYDLKYNPGNYTMSPLAAFAEIEEIYIELSKIGLVKIVNKSN (SEQ ID NO:1278)
>orf01025MSKHPHYELLNLIGYGLAKFDKLFIKEFQCFSKSEFYRYVVSLGIAETTGVVKNRMDLFDPYFDNNRKGWWQKAEVYRFRKDLIDMMFGNEDVHSYAEIVKMLLASEGKKTGITIVEKPIVRTKFKRLQETGMEAENYFILHFDKEEKFQGGQLTDARLYGDGYDFQVDVQEYSYLAEVKGIRKSKGRVRLTAKEFEKVKEFQSDFILSLVTNLDDIPKLVLIDNPLKHFEFKKNIIKNEIIEYRSVEDLY (SEQ ID NO:1279)
>orf01027
MFIAEFTAILLNEFPVALDSLVFMGFSMKLIHDLDTHTTHSTAKMLYNMKAIKNDFSIRE(SEQ ID NO:1280)
>orf01049
LKHLFCHFNPLWIDEIIRLAYKDQDTKDVKSKVKIGN (SEQ ID NO:1281)
>orf01077
LCCNRHIANLDLEFISYHLGQVGFDTRISTGLGIFVTKIGNVLFDTDNQFASFLNVCDTCISLDWFGSSKAEKANQ (SEQ ID NO:1282)
>orf01095
MKEIAFDAFYQLYQNDQLSLVDVREVDEFAALHLECAHNLPLSQLADSYD(SEQ ID NO:1283)
>orf01098
MCLICQRIELIKAGQNPYFVKELETGYVVIGDYQYFKGYTLFLAKDHVTELHHMETSVKLRFLEEMSLVQEAVAKAFEAEKMNIELLGNGDAHAHWHLFPRRAGDMKSHGLNGRGPVWWVPWEEMAAEDCQVQSPELEEMIKILSHELEKYLA (SEQ ID NO:1284)
>orf01099
MKKRYVILSGLLALTLAACSQEKTKVEENTQKTEQSSQPEGTVGSKSQASSQKKAEVSNKGSYYSIQGKYDEIILANKRYPLSKDYNPGENPTAKAELLKLIAAMQAEGYPISDQYSGFRSYETQAKLYQDYVNQDGKEAADRYSARPGYSEHQTGLAFDLIGTDGDLVTEEKAAQWLLDHAADYGFVVRYLKGKEKETGYMAEEWHLRYVGKEAKEIAASGLSLEEYYGFEGGDYVD(SEQ ID NO:1285)
>orf01104
MKTKEQTRKLAAGCSKHCFEVVDRTDEVSNHTHGKATLTWFEEIFKEY(SEQ ID NO:1286)
>orf01105
MDFFFMNEVKEQVLFRDNHSEHIFWIEGVSDFMIKVNTALW (SEQ ID NO:1287)
>orf01109
VCFLGFQTILANPSKPQRQLPFLIFILDFFNYKHHKFLS (SEQ ID NO:1288)
>orf01124
MEELVTLDCLFIDGTKIEANANKYSFVWKKTTEKFSAKLQEQIQVYFQEEITPLLIKYAMFDKEQKRGYKESAKNLANWHYNDKEDSYTHPDGWYYRFHHTKHQKTQTDFQQEIKVYYADEPESAPQKGLYMNERYQNLKAKECQALLSPQGRQIFAQRKIDVEPVFGQIKASLGYKRCNLRGK(SEQ ID NO:1289)
>orf01126
MHIHYNTNQTTLPLEISSFLPQDHLVFTIEKVVNTLEDCHFHAFYHAFDRLSYHLKMLVSTLLFAYSQGIFSGRKIEKWKS (SEQ ID NO:1290)
>orf01129
LRLWVIFVIMKVIKSYDTLNDYYRKLFGEKTFKVPIDAGFDCPNRDGTVAHGGCTFCTVSGSGDAIVAPDPPIREQFYKEIDFIHRKWPDVQKYLVYFQNFTNTHEKVEVIRERYEQAINEPGVVGINIGTRPDCLPDETIEYLAELSECMHVTVELGLQTTYEATSDLINRAHSYEL(SEQ ID NO:1291)
>orf01131
VETVKRLRKYPKIEIVSHLINGLPGETHEMMVENVRRCVTDNDIQGIKLHLLHLMTNTRMQRDYHEGRLQLMSQDEYVRVICDQLEIIPKHIVIHRITGDAPRDMLIGPMWSLNKWEVLNSIEMEMRRRGSVQGCKAVKQEFENEKTT (SEQ ID NO:1292)
>orf01143
VQVCVFTNFCFFHCFSSLANCRLFNLRGICLPCISYQ (SEQ ID NO:1293)
>orf01152
VFKKDRFSIRKIKGVVGSVFLGSLLMAPSVVDAATYHYVNKEIISQEAKDLIQTGKPDRNEVVYGLVYQKDQLPQTGTEASVLTAFGYLSGDILKTLGLDTVLEETSAKPGEVTVVEVETPQSTTNQEQARTENQVVETEEAPKEEAPKTEESPKEEPKSEIKPTDDTLPKVEEGKEDSAEPAPVEEVGGEVESKPEEKVAVKPESQPSDKPTEEPKVEQVGEPVEPSEDEQAPTAPVEPEKQPEAPEEEKAVEETPKPEDKIKGIGTKEPVDKSELNNQIDKASSVSPTDYSTASYNDLGPVLETAKGVYASEPVKQPEVNSETNKLKTAIDALNVDKTELNNTIADAKTKVKEHYSDRSWQNLQTGVTEAEKVAANTDAKQSEVNSETASLKTAISRLNTDKVELENQLKIAQGKTETDFSMESWTVLSTAKNKAQEVKDNGTATQEQINEAEKSLKTALADLSVDKTALGSAIDTATKKNKENYTNQTWAELETALTAAKSVNTNESKQSDVNEAAEKLTATMEKLVELSEKPRLTLSIEKRDIDRKATVTYTLENPANTQIKSITATLKKGEEVVKDFVLTEENLKTNHLTALFEKLDYYKEYTLSTDMVYNRGNDDETESISEELIQLNLKKLELKDIQTVSLMKFENGQESQVTHLSDKPTDLSKLYLKVTSSTSKDAVLAVSSIEEEIIENKKIFKIHADTPELVVRKKDGSLSKGFDYYMERVIPHDGDIYYDFKDLISAMTSNPTGTFILGRDISSRNVKPDGNGKSYIKGEFKGKLLGTNDNVRHSIFDLEYPLFDTIKSGVVKDIDFKHVNMVFPDSNQGDNVATIARVIKDKTKIENVNVEGYLEGRDHVAGLVNNLEGNSEIENVSFTGKIKSKGGNSITAGIAGRNILSRVKRAYVNADIEVHRSSNSSMLVAVNGINADASGGWGTWGRLTESVAKGTLETKQGGQAGGASSTVWPYGAIDNVVSYAKVTKGKELFGSDGDLNYDWFMKKISNIFGVQGISSGDSGSDSKFTRISEEEANQKVASYNITAPNLMSDSSLLVDRLNESWKNTDQFESIQDYQAQNQLIYQNLTKFTPYYNKEFIVHEGNALTPEQEILKTKKIKSIVGLKGTEFVVDGSDIDTIMLHFEDGSQKRYKVTSTGKFSITNLPEYQVEDLNVVYTSEHIVHPLDSSLINNLVEELKKVELYTESTYQVLGIDKDNANKLNRTKRLFLDESLDAVKTQLPTFVKTMFENEWLHINGESSGAVAALRQKIMDNKTAILLALTYINRYYDVKFSDYNIKKLMLFKPTFHGEKIDLLDRLIRLGSSGENRLKGSENAETFKQLFASETKQKDLVTYLDYNRSLLTNYQTTGEWFKETTKDYIQFEERPSLVEEIKDAKYRVYDNLTAPYYQGYILPLLTLKNTHLAILSNYSTMTFVSREKRPNWKNEDFDKWVKYVATAHRNHVDTWYKILPDNIKGKMVKENVTAVWEGLSIPGSEWVDQNAVDRKGRDYAPAREFFNLVGGPMGGWYAYHGYGAHAGGRNRVNYEVFDVLSEYGISVFTHELTHVNDTWIYLGGYGRRENMGPEAYAQGLFQSPVPGQPGWGALGLNMAFERKNDGDLIYNASPTQFENRKELDSYMKNYNDTLMMVDYLEGDAVISKGKEAITKWFKKVEPKVVSQTAQYDTVRQLTAEEKEKLSVSSVDDLVDQGLMSDRAVGNNTYNPADFETSYIAIDYMTGIYGGGKNSVGSPGALMFKHNTFRMWGYYGFEEGVLGYASNKFKQASRDEGHAGLSDNFIISKISKGEFLTMEAFKKGYFKKVVEELKTKGIRPVTINQKTYSTFEELQEGFKQAVERDLKKNQLDERETRNFKFQVFRQLLQQTDSFKTSIFR (SEQ ID NO:1294)
>orf01156
MKSKIVLGASLAIATLSLVSLVEIEGLSPFLIENVSANTHSANKVINHKVSIYLENADEGKGLTVNFSTDSVSPNLFDEFEKKSGITITTMLVNAKTGEVVEKRLTPSVFLRSNDLTSGTISSFIFSEYPDGEYKYVVSKGDFIDPKTQFKHQYRGESPVFRIRNRKYVELGTTDKKLDERRDNSVYKDGVVEHKVNLSLTSYQGGNGVTAIFSTDSVNSNLLNSFGEKAKKVLIRSKLINVKTGEVIDETFSPKVSLTSKILKSGSTAVFYFIDLTDGEYKYVAYESQQYTDPQTTLTHQYRGESPIFSIKDGKFSGLVSASKPDENPKPTPKPDEKPKPSAPQQEKTKPTVQSGWVGSSYYQNGKKVTSKWIFDKKYNSYFYLDASGNYVQNAWVGNYYLKSGGYMAKSEWIYDKNYGSYYYLTSEGSYARNTWSGNYYLKSNGKMAKSEWVYDSNYKSYYYLTSEGSYARNTWVGNYYLKSNGKMAVNERTPDGYRVDGSGKWVK (SEQ ID NO:1295)
>orf01157
MSACTVCAEKGRTPDLSIVDNVPIVENAKAHENNFFYSSDITYYPIF(SEQ ID NO:1296)
>orf01158
MAKYYIILPKDAEIYKTWRGTVNIPIIDATKTTPELSYFKEDHRNYIANENKSGANYIEWKGTVEEFKEAIKKLTDKKSTTATPKKDEKPTPKPDEKPKPTPTVQSGWVGSSYYQDGKKVISKWIFDKKYNSYFYLDASGNYVQNAWVGNYYLKSGGYMAKGEWVYDATYQAWYYLTSDGSYAYSTWQGNYYLKSDGKMAVNEWVDGGRYYVGADGVWKEGQASTASSSNDSNSEYSAALGKAKSYNSLFHMSKKNVCIDN (SEQ ID NO:1297)
>orf01179
VSRWDGHSDKGEAPAGKTSYAWIWTKWGEQVAFYCDYD (SEQ ID NO:1298)
>orf01193
MKSKKGGELRIAVFGDKKPFGYVDNDGSYQGYDIELGNQLAKDLGVKVKYISVDAANRAEYLISNKVDITLANFTVTDERKKQVDFALPYMKVSLGVVSPKTGLITDVKQLEGKTLIVTKGTTAETYFEKNHPEIKLQKYDQYSDSYQALLDGRGDAFSTDNTEVLAWALENKGFEVGITSLGDPDTIAAAVQKGNQELLDFINKDIEKLGKENFFHKAYEKTLHPTYGDAAKADDLVVEGGH (SEQ ID NO:1299)
>orf01194
MKLFKPLLTVLALAFALIFITACSSGGNAGSSSGKTTAKARTIDEIKKRR(SEQ ID NO:1300)
>orf01231
MYQDEAGFGRISKLGSCWAPIGVGPHVHSHYIREFHYCYGAVDAHTGESFFLIAGGCNTEWMNAFLEELSQAYPDDYLLLVMDNAIWHKSSTLKIPTNIGFTFIPPYTPEMNPLNKCGKRFVNVDLRIRPFELWKMS (SEQ ID NO:1301)
>orf01233
MVTATTCFLKERVEFELLIFFYISPNRCLITVYSVLNL (SEQ ID NO:1302)
>orf01234
MDTPDENGYVADDYRITYLEAHIKAMRDAIYKDGVDLLGYTTWGCIDSVSAGTGEMNKRYGFIYVDRDNVGNGTLKCSKKKSFYWYMSFIAMV (SEQ ID NO:1303)
>orf01255
MFLGMIGNISIILQFFGITIIVKIDNQARAIDFFKHDKSSF (SEQ ID NO:1304)
>orf01257
MFSLNFFDDSVFLSIKIAHKGCFQLLDMTNPNFFNKFFLAQASDQLLHFLSWNIEL(SEQ ID NO:1305)
>orf01266
MTEPDFWNDNIAAQKTSQELNELKNTYNTFHKMEELQDEVEILLDFLAEDESVHDELVAQLAELDKIMTSYEMTLLLSEPYDHNNAILEIHPGSGGTEAQDWGDMLLRMYTRYGNAKGFKVEVLDYQAGDEAGIKSVTLSFEGPNAYGLLKSEMGVHRLVRISPFDSAKRRHTSFTSVEVMPELDDTIEVEIREDDIKMDTFRSGGAGGQNVNKVSTGVRLTHIPTGIVVQSTVDRTQYGNRDRAMKMLQAKLYQMEQDKKAAEVDSLKGEKKEITWGSQIRSYVFTPYTMVKDHRTSFEVAQVDKVMDGDLDGFIDAYLKWRIS (SEQ ID NO:1306)
>orf01267
MLQSNQVQNFHHSSFDITAIFPDYFHSVSNIFIDSFLW (SEQ ID NO:1307)
>orf01299
MQVIKRNGEIAEFNPDKIYQAILKAAQTVYVLTDDLRQNLAQVTKKVVLDLQEAKVERATISMIQSMVEHRLLGAGYITIAEHYISYRLQRDLERSGYGDHIAVHLHFEQIR(SEQ ID NO:1308)
>orf01305
MKLKLCIIGFFFCLIATIGLVTISDTEIPIPLPIDGAFSIQGKSNLSNNEIYEMVRDLSKTEKVTIYKPIVQSSGQLKYVNFDDVNNEQLKSAPIVGMYYTLGKMDVDSLKPLTMTGLQTVYMAYPWYIGGILQFTGTLRILLMGSIYLTLLVVLFVVRTRQIKEGVIRRSLGLPIYDLRREYGISLIFELIMMALLMISYSSFLGNGFFTYSSKLFFSLLLTNFILFQIIDLITFVLFWLTIQIEKPIEIIKNKAKNKLIFVVWLAIISIIILVSGVFLQETKSSQSSINIQIQNLVPWDTVKDWRRIEFLGIESNSTKNREVNDSDGQYLQIVAALKNLDFLYIERSSAYVPDFMKTSHVIENFSKQLENDGITNPEINKELIYINQTGANLQNKVNGTNYHLLDNKIATIYIPEKWKENQKSIENTVVAEQFIGTNYTREQLAVQIIPDGEKIFYFNEDADNNLKMKDILPLANVADSKDNIVVVLDTDKMMENNKFSLASNILYKSLFSPEAVKKINEMTVLLNFSMNPVDVYQIVKLKIQSLEHQILLSQILQKIIYSIVFILIYQYVQLFITLKQNEYVKKIILGLSKTYIAISSLKYFMMTITMVILFTFLMTGQIELLYIGAASLLVLMLSIIMSFRKLSESYTKILKGDES(SEQ ID NO:1309)
>orf01306
MTIDLLNVSKSFGSKKIFTDLNLIFESGKSYALIGGSGSGKSTLLNIIGRLEKIDSGNVLVDKQDIWKIKERTFFKNTVGYVFQNYSLIDNKTVYDNLSLITKDKKTITDVLEKVGLSSDYLHQKIYELSGGQAQRVAIARMLMKPRKIILADEPTGALDGEIGKEIIRLLLNETAEDKYVIIATHDPAVYNEVDVIIDMKDIGYKV (SEQ ID NO:1310)
>orf01307
MKKKIYIALIFVTGVLAIFFFGKQMITKENINKPTVELTIYTLSSSDTEKWNKVRQVETEEAIYFITVKEVSSSEEVFSNIIANGAATGFGVREEEVKKFNNGLGDTIEDSKHNKLIEIEFFTFSDDGAGFVVANFDYGKEELNSQKKDIKELYKKIYESFKEKNK(SEQ ID NO:1311)
>orf01317
MKIKEQTRKLAAGCSKHSFEVVDETDEVSSKHSFEVVDETDEVSNHT(SEQ ID NO:1312)
>orf01324
MELFKTWKKNMVLYGLKSQIGTVYRNNDRTTSFYDVGNFLYLAGELDSRFWEDFVRKYGLDYKIIISENTNWQDFLHRKVGLNSFTRYSFKDKANFQVEFLNNLVTHLEEGYNIVPIDNHIYNCFSTEEWSQDLQGDFESYQDFVLKGGFGFVILKNNELIAGISSGLVYRKAVEVEVATRPNEQGNGFAKKLGAAMILESLNRDMFPLWDAHNEASKKVAEFLGYELSEPYEAFELEEILI (SEQ ID NO:1313)
>orf01369
MKVIDQALLEKVIIERSRTSHKGDYGRLLFLGGTYPYGGAIIMAALAAVKSGAGLVTVGTDRENIPALHSHLPEAMAFSLQDQQLLKEQLEKAEVVLLGPGLRDDAFGEDLVKQVFAGLRQNQILIVDGGALTILARTSLSFSSSQLILTPHQKEWEKLSGITIEKQKEDATASALTSFPQGTILVEKGSATRIWQAGQSDYYQLQVGGPYQATGGMGDTLAGMIAGFAGQFKQASLYERVAVATHLHSAIAQELAQEQYVVLPTEISNCLPKVMKRYV (SEQ ID NO:1314)
>orf01376
VLDSKEELKESENDAPKLETPLREEPRLAPQTLPEASEVLENKREESKVEITEPAQADDIRKVVGELAKDISITKLYMTGHSLGGYLAQIAAVEDYQKYPDFYNHVLRKVTTFSAPKVITSRTVWDAKNGF(SEQ ID NO:1315)
>orf01404
MGRKPKKRPEERTELEHLQAENEYLRAENAILKKLRELRLKEEKEKEERQKLFKN(SEQ ID NO:1316)
>orf01417
VVLSTSAILVACGKTDKEADAPTTFSYVYAVDPASLGYSIATRTSRTDVIGNVIDGLMENDKYGNVAPSQKDYDLNSTGWAPSYQDPASYLNIMDPKSGSAMKHLGITKGKDKDVVAKPGLDKYKKLLEDAVSETTDLEKRYEKYAKAQAWSTDSSLLMPTASSGGSPVVSNVVPFSKPYSQVGIKGEPYIFKGMKLQKDIVTTKEYNEVFKKWQKEKLESNSKYQKELEKYIK(SEQ ID NO:1317)
>orf01421
LNFDFFIFLAHFIPLFTFSILQENPKTSKKKLYIRLL (SEQ ID NO:1318)
>orf01428
MRLSMKLIHDLDMHTTHSTAKMLYNMKAIKNDFSIRE (SEQ ID NO:1319)
>orf01442
LIRIIRNIYRSGEGNTSVFQSFIDQINSNQFCYGSNFDRLRCILLIENFTSICLNSNRMFSGNGKILSNSSRSTP (SEQ ID NO:1320)
>orf01453
MSNYRRTSKPKTEHIKKGFTVFQKTITTIGSILGLITAGITIMNALDNNNKNTKKEPTTSQTTTFVKEIQKESPQENTTPNKENNTSQEKTQQEETPKSSVKEEKKEDQKTATQDSTTPATSKPATENEKQPNTPTSENNTQ (SEQ ID NO:1321)
>orf01457
MNQSYFYLKMKEHKLKVPYTGKERRVRILLPKDYEKDTDRSYPVVYFHDGQNVFNSKESFIGHSWKIIPAIKRNPDISRMIVVAIDNDGMGRMNEYAAWKFQESPIPEQQFGGKGVEYAEFVMEVVKPFIKHKTGWFDGMMTTGCSMGAYHALNFFLQHPDVFTKVIALSGVYDARFFVGDYYNDDAIYQNSPVDYIWNQNDGWFIDRYRQAEIVLCTGLGAWEQDGLPSFYKLKEAFDQKQIPAWFAEWGHDVAHDWEWWRKQMPYFLGNLYL (SEQ ID NO:1322)
>orf01466
MSSIHTKNSSLKSKSRFNEMFGDPLNNNKKFAVKTGQQCFKFSSGKFLDKHDRVFEGYPAYGGNGIAWKSRKYLIDNPTIIIGRVGAYCGNVRTTHGKVWISDNAIYIKEFKNSDFNLVFLLELMKVIDFSKFADFSGQPKITQKPLENQKYILPPLALQNEFADFVALVDKSQFACEIAIKVWRNSLKFSII (SEQ ID NO:1323)
>orf01476
LVYAPFSFNILLDYITFDFKILLFSVFLAINRFHNDFIQFLL (SEQ ID NO:1324)
>orf01479
MSEYSGLSFFEVALAEFLDIVSAVYLEDADGIIVNLWGILDK (SEQ ID NO:1325)
>orf01490
MEGVAKGRIGRKKNNGIDNRCCHKKRNGRVTWNLFFQKTIDDGDDSTFTRREKYTDKGPKKDSPPTISREKMINLVRCDINFNQP (SEQ ID NO:1326)
>orf01493
VDKTDEVSSKHCFEVVDRTDEVSSKHCFEVVDRTDEVSSKHCFEVVDRTDEVSNRTTVRRS (SEQ ID NO:1327)
>orf01495
MTEFMSDNFPKNLHTQFLINLGIKIQMPIFGEKSPTCRT (SEQ ID NO:1328)
>orf01503
VVIGVASATTNIWIIFLSGFTAILAGAFSMAGGEYVSVSTPKDTEEAAVSREKLLLDQDRELAKKSLYAAYIQNGEFKTSAQLLTNKIFLKNPLKALVEEKYGIEYEEFTNPWHAAISSFVAFFLRSLPPMLSVTIFPSDYRIPATVLIVGVALLLTGYTSARLGKAPTKTAMIRNLAIGLLTMGVTFLLEQLFSI (SEQ ID NO:1329)
>orf01535
MSFKNNWIDKEGRVFIYFTVEEIMKRRNISKPTAIKTLDELDIKKGIGLIERVRLGLGKPNIIYVKDFMSIFQVKENDLQKSKNLTSEVKDFNLRSKENELQEVKNLDSNYIENNKSKYSKREYSFGENGLGTFQNVFLAAEDI (SEQ ID NO:1330)
>orf01543
LLHIRVCKTFDRIPYCMLALFLSKSIGLTILLHKVKTVIFIDDQSNDKTCKICIHISFFRIKLSQQCQLSFSVYF (SEQ ID NO:1331)
>orf01547
MTQEDALIVISHIKVLSIVPNRCLKPLDKTFSLYNWIFLSQKYILLQANFLKISRVQLQ(SEQ ID NO:1332)
>orf01552
VTTHDEPVYEKHGVLHYAVANIPGAVARTSTIALTNVTLPYIEALAGKGFAQAISEDEGLRQGVTTYQGYLTSLPVAQGLNRDYTDINDLV (SEQ ID NO:1333)
>orf01553
VFFIDGFIVRCHTVSCFDNATLVNSNVNDTEPGRICLTISSVTNSGAFAPGMRMAPITTSASLTLASMLNELDIRV (SEQ ID NO:1334)
>orf01555
LSTKTKGDAGSMCTDDTITDDSYFSFGTPGTPPGRTPEPPACLVRK(SEQ ID NO:1335)
>orf01556
MLIGIPKEIKNNENRVALTPAGVHSLVSRGHRVLIETNAGLGSGFTDADYQKQGAEIVATAGEAWAAELVVKVKEPLNSEYGYLRDDLLLFTYLHMAAAPELADAMLAAKTTGIAYETVRDNQGQLPLLVPMSEVAGRMAV (SEQ ID NO:1336)
>orf01576
MASRNVLSMEPKFLLAGHFKGQFLILKIVSSDIDDGFAIAC (SEQ ID NO:1337)
>orf01577
LSSADKTCLNQFFTDFSDFFQSSLVEDGFYTFQIENSGFGFFNQISQVFDSFFEFLIPFKIALGILVGSQSLIKRNHDRLVGIVVV (SEQ ID NO:1338)
>orf01578
VSVFLKFIFNTTNQFTGLLFDAVALSLILIVGVQQIRKICKRLSHLICKRNWTEGSLSQAWLGFLIEKISESGKFFTNQYPFQFICSIASQTLKEALKIFCC(SEQ ID NO:1339)
>orf01579
MVSNLVFIGNCNFHNTVIFHLLNRLNQGPLQILSQNHDKGRRLSWIFKSRLGQLNASKNWMGRKEQAMALAIAADLQDQLLFKRLIDFLDATIH (SEQ ID NO:1340)
>orf01599
LLYNPVEKTRVHIKKGIGKLQYLFTRLFYLIFVSTDYISYGSSSEG (SEQ ID NO:1341)
>orf01630
MRSYITLICNLNNNLFCLNSFFLTNLVWSQIFSLLSVFITVYI (SEQ ID NO:1342)
>orf01631
LLEITLKSPYQFAHILFQSTIVPHGGHYHFIPESDLSAGELAATYVFNPNDIVRDTGDAYIVRHGDHYHYIPKSSLNNPPSHSNTEEVGSSSSSVLSNPSLHVHHEEEDGHGFDANRIISEDSEGFVIPHGDHNHYIKVQTKGYEAALKNKIPSLQSNYQPGTFDEKAVLAKVDQLLADSRSIYKDRLS (SEQ ID NO:1343)
>orf01664
MARLEPAKIAKIVLGILLYIIDLIKSSFVLPIPKAAKKSLILISFVPSFNDKNIVIRRPRQITKIMPRFICFLFRIFACIS (SEQ ID NO:1344)
>orf01680
MELSAIYHGPESEYAYLYKDKKLHIRIRTKKGDIESINLHYGDPFIFMEEFYQDTKEMVKITSGTLFDHWQVEVSVDFARIQYLFELRDTEGQNILYGDKGCVENSLENLHAIGNGFKLPYLHEIDACKVPDWVSNTVWYQIFPERFANGNALLNPEGTLDWDSSVTPKSDDFFGGDLQGIIDHMDYLQDLGITGLYLCPIFESTSNHKYNTTDYFEIDRHFGDKETFRELVDQAHHRGMKVMLDAVFNHIASQSLQWKNVVKNGEQSAYKDWFHIQQFPVTTEKLVNKRDLPYHVFGFEDYMPKLNTANPEVKNYLLKVATYWIEEFNIDAWRLDVANEIDHQFWKDFRKAVLAKNPDLYILGEVWHTSQPWLNGDEFHAVMNYPLSDSIKDYFLRGIKKTDQFIDEINGESMYYKQQISEVMFNLLDSHDTERILWTANEDVQLVKSALAFLFLQKGTPCIYYGTELALTGGPDPDCRRCMPWERVSSDNDMLNFMKRLIKIRKYASVIISHGKYSLQEINSDLVALEWKYEGRILKAIFNQSTEDYLLEKEAVALASNCQELDNQLVISPDGFMIF (SEQ ID NO:1345)
>orf01688
MGQEIKLIRKQFRITRQEEKQIKEMMREQKVDSFSEFLRQNLLKKNYQDRIFESWFSLWQSQKFEQISRDVYEVLVVARENHQVTQEHVSILLTCVQELIAEVNQVQPLSREFREKYMG(SEQ ID NO:1346)
>orf01689
MVYRYRTNLKKVFLTDPELHQLNERIAKSNCQNFSVYARKVLLNPNMSFVTINTDTYDQLVFELRRIGNNINQIARAINQSHLISQDQLQELSKGVGELIKEVDKEFQV(SEQ ID NO:1347)
>orf01690
MVVTKHFATHGKKYRRRLIKYILNPDKTDNLKLVSDFGMSNYLDFPSHTEMVEMYNVNFTNNDKLYESRNDRQEKHQQTIHAHHLIQSFSPEDNLTPEEINRIGYETMMELTGGRFKFIVATHTDKDHVHNHILINAIDRNSDKKLIWNYALERNLRMISDRISKMAGAKIIEKRYSYRGYKKYRESSHKFELKQRLYFLMQQSK (SEQ ID NO:1348)
>orf01691
MMTDRAMTKPIRGRQLSKRDLYDEEFFRTHFAKQEIESRLEFLLNRVNSLEELITKAKELNLTIDLKQKNVTFILKENNQKISLGHQKISDKKLYDVKFFQDYFKNKEVIASEGLENLQEQYHAFQEERDKDKVSTEEIEEAFKTFKKPLRHLRKNEIPFVNLKWNLQRTK(SEQ ID NO:1359)
>orf01692
MEEENHKKYKVYIRETSSYFVYNKENMDNNCFIKGRTLIRQLSNDSQKLPYRRPTLKSLQEKISEINLMIELSNTNKQYQEIKDELVLEIAEIDMKLEETQEKIATLNKMAEVFINLKSEDEIGRKLAKYDFDQMNMTESIMLDRLNTDILKLQQELGNEINKYEEIARRLDLFVKIINTNKFTVLKFHENALLE (SEQ ID NO:1350)
>orf01693
MKSKLGITLRKVRKGKQISLCSVADEHLSKSQISRFERGESEISCIRLINILDKLHITLDEFLVLHNDDYTSSESFANLVQYIRKQYSSQSINNIACLLSDTSDYTLNSFEKTMVKSILHTMDSNIIPSDEELLHLTDYLFKIEKWGYYEIILLGNCVRTINYNSYFLLTKEMLNNYIYSSLNKTNKQIVSQLAINCFILSIDKEEFSNCSYLISKIKTLLDNELNFYEQTVFLYATGYYEFKRQLSSGIETMKQAIQVLDILGEDKLKLHYTSHFDKLVNNK(SEQ ID NO:1351)
>orf01694
MSLSYYYEINPSTDILKCIEELLYKEDKCFNNILKNWKDIRRNHNDSFPNFWCYGAPGILLARKEIFDKTNIGNNDLSIIKNVLTNVEKIRELNLCHGSVGTISCLDAILKDEENLLIKESIDFYFDNVVSQVIKPELSTDLNTMNTFSFMLGVSGVVYEISRKQDDRLLNVLLLELRGHDD (SEQ ID NO:1352)
>orf01695 MMTRKVPNIEQMSQIECGLCCCLSILHFYKSKETLLDLRRDIEKGRDGYSIGDLKQLLNKRNFDTGSYQVKDVNKISELPLPLIAFWDNQHYVVIYKVKKNKVYIMDPSKGYINYEFKEFSKHFSNIVLLSFPNENYQSLKSQFPSPWIRVFSSFSKVKGRLILTLLFSIISYLIILSVPVMTSKFINSALGNTFSFQTSFLILFSLLCLYLISILARSMGILFSNIFFSRDIESFTFKHLLKLPYSFFELRAKGDILYRISSLSGFRELFTNQVVGGVVDIGTILSVVIYMFLSSKTLSIIALILSLINFLFLFSTRKIMYDTVNRELQEQSLIYSVETEALNTISSIKISGLEDEIYENWSKYLKNVLTKYKKRSIVHILYNSATNVFQLFAPIIILIFGLDNVLNGKILLGEVVAFQTMASILFSSEISIFNAYTQYILAAGYLNRVNDIWLENEENVENGLKKCSLEGRIDIKDLSFSYSKDSAPVIENLNLTIEPGQRIALVGQSGSGKSTLSKILSGLYKIDTGKILFDGVNINQIDKKILSQNLGVVPQDSFLLNRSILDNITLKNEVTSQKIEEVCKAVQIYDEIMAMPMKFNTIISEMGSNISGGQRQRIALARALINNPSIVILDEATSALDTINEERITKYIQSQGCTQIIIAHRLSTIKDADIIFVMKGGKIVESGNHKYLMDLGGEYYSLYTKRK(SEQ ID NO:1353)
>orf01696
MAIVEIINLTKSFKDIEVIHNTSFYLNKGKVYGFVGPNGAGKTTIIKMILGILKPDSGKITIFNQTVEQNSENILSRIGLVLGPSFYGHLDAYKNLKLIANMKGLSLDTERLNEYLSMVGLKDVKKKKVKNFSMGMKQRLSIAASLLGSPEILIWDEPINGLDPQGVIEIRSLIRFLQEKKGITFLISSHILSELDKVISDIIIINYGKVEFFGSCHYLLQKYNCRNLEEAYLACLAGGEYD (SEQ ID NO:1354)
>orf01697
MIKLEFLKQKKSILWFVLIFPIILNVLLYIDLTFRYRGYLLVHQNELALSNWQLIFKEQTIFYFSELFYLVLSLIIYEVFAVEFKNDAWLTVISLPFRNKYTINSKLLITVVYTFTFWLSDYISLYVIGKAIDNSLEIGLIFFLKTFTIQLISSLMIMLLYFLTLVLIRKISGIIPIGIIMMILTISIYYNDYNFKIYLPFTYLSHAFRVTESQFYMILLSNIIIIVLFYILIRKLNERSFEMKL(SEQ ID NO:1355)
>orf01698
MKLLKNELIKSKIFLFIIVDICIQILVILAIKTYILDISALYSELDYNKYWYILHTLIYMLMIFPIQILYQNLREALIEDNNNGWNIMVINTNNLVKIIYIKVTINIVRCFICYFVYTIFSLIQLGGMGTDMLLTNIIFPNIMSFLLFLPIAIFMQICCIRFDSILAKALPNILLILIVLITFQSDWNIFIPATYYYTEIQSTTNLGIKLLVCIWIMGFEFFLLPKLIKLKEQNLV(SEQ ID NO:1356)
>orf01725
MRFSAFKIFSNSVCKRIITKGLGFRALLLYTISKVKLREDILVSQSIVPVEIPQYCRFDSKKRNGILFNVRIANLKFTFFRLDFLRNKIWYSSSMNDEASKQLTDARFKRLVGVQRTTFEEMLAVLKTAYQLKHAKGGRKPKLSLEDLLMATLQYVREYRTYEEIAAVFGIHESNLIRRS(SEQ ID NO:1357)
>orf01753
MHTKSRTIKSLITQFTAILLYELPLALDSLVFMGFSMKLIHDLNTHTTHSTAKMLHNVKAIKNDFSIRE (SEQ ID NO:1358)
>orf01776
MIKIYFTKFSENHNPFCKIFEIIFTNLIFQSILNKNKKNPLRQGEANVV(SEQ ID NO:1359)
>orf01783
MSQVKGLCVLDVDGTLILEEVIDLLGREAGHEAEISQITSRAMRGELVFESSLRKRVSLLEGLPILVFDNVFNSIHLSLNVPEFISILQKNGILVGLVSGGFTPIVGEISKIPWYCLFHCQPA(SEQ ID NO:1360)
>orf01784
MLKSAELGIAFCSKEMLKKEIPHHVDKRDFLEVLPLIDCLE (SEQ ID NO:1361)
>orf01789
MFGNWFFKAFVCSLERLAQDRTMNWFSCIGNKNTVAFVPILIGCFA(SEQ ID NO:1362)
>orf01804
MEKYFGEKQERFSFRKLSVGLVSATISSLFFMSVLASSSVDAQETAGVHYKYVADSELSSEEKKQLVYDIPTYVENDDETYYLVYKLNSQNQLAELPNTGSKNERQALVAGASLAALGILIFAVSKKKVKNKTVLHLVLVAGMGNGVLVSVHALENHLLLNYNTDYELTSGEKLPIPKEISGYTYIGYIKEGKTTSDFEVSNQEKSAATPTKQQKVDYNVTPNFVDHPSTVQAIQEQTPVSSTKPTEVQVVEKPFSTELINPRKEEKQSSDSQEQLAEHKNLETKKEEKISPKEKTGVNTLNPQDEVLSGQLNKPELLYRDETIETKIDFQEEIQENPDLAEGTVRVKQEGALGKKVEIVRIFSVNNEEVSREIISTSTTAPVSRIVEKGTKKAQVIKEQAETGAEHKEVQSGAIVEPAIQPELPAAVLTDKGESAVQPELPEAVVSDKGVPEVQPALPEAVMTDKGDPEQVEPLPEYTGVQAGAIVEPEKVEPEYAGVQAGAIVEPEQVAPLPEYTGVQAGAIVEPEKVEAPKEYTGVQAGAIVEPEKVEAPKEYTGVQAGAIVEPEKVEPSKEYTGVQAGAIVEPEQIAPLPEYTGVQAGAIVEPEKVEALKEYTGKIEQPSAEDTKPNNENTNTPEEMSIQKKSSALINMNFITDSSKVTGVGSATFIAPNVLLTVAHNFINNSTDNTTGEFRGDKSKNVYEWVTPDGQKGTFTANNIHFYNKKDYPKGFIYDLAVIKLPETTGREHVELVKNYSKVNLNDKLNVHGYPAGKYTHLKDATVEMEQEYANNTYGVQYQGGNPGMSGGGIFNANGEVIGVHQNGAQNRSGGLILSPTQLAWIKSIIAGNEIPPVYDELYRHKDEKKDDAKDEKEVIKKLELRNISSVELYSKDGNKYRHVTSLASLPSNAENYFMKVKSENFKDVMLPVTSITNDTKDNRDVYKIVASANSLIQHENNNVLENYTYYLPKTQQSETGVYTSFKNLVDAMNSNPNGTFRLGATMDAREVELPDGQESYVNNVFHGILVGTNNEKYYAIYNLKKPLFGELNGATVEKLSLKDVNISAKDDTATLAKEANNNTHIDNVHADGAIAGERSIGGLVSQVNNSTISNSSYTGRITNTYKTVASYQIGGLVGKLSGPRGLIDKSFASIDLSSNATQGDQSIGGIVGAVENSALISNSYAEGNLNNVQRFANVGGVVGNLWDPVGGLEKSGRLSNVLSDVNVTNGNAIAGYNFNGIKANGTYSNKNNKVVNVVQEDDEILTKDSTVQRGEVLEDAQIKEKKATFVSKNTIKTEDFNFSSRYVTDYKNLENADSSKEKVYKNIEKLLPFYNRETIVKYGNLVETSSNLYNKELLSVVPMKDKEVISDINKNKSSINKLLLYYADNTSETLNVNYQTDFSNVAEYRIGGTNLIYTPNTLLRNYQNILDEVLPALNSVEYKSEAIRKVLDVSKDVSLTELYLEEQFNTTKTNLKDSLTKLLTADAAIAENNNKVIDNYVIEKIKNNKEALLLGLTYLERWYDFKYRDTKAKDLVMYHLDFFGKSNSSALDNVIELGKSGYNNLLAKNNVITYNVLLAKNYKTNNLFDALEKYRKAFVPDKTNNEWFKEQTKAYIVEEKSTIKEVSDKQSIAGSPYSIGVYDRLTSPSWKYPSMVLPLLTLPEKSVFIIANISTIGFGAYDRYRSKEHPAGTDLNDYVEKKAKEAAVRFRDHYDYWYRILDDKNKEKLYRSVLVYDAFRFGTDEKEDKDTYQATFETNHPAIKHFFGPAGNNVVHNSNGAYATGDAFYYMAYRMLDKDGAVTYTHEMTHNSDREIYLGGYGRRNGLGPEFYAKGLLQAPDHPNDPTVTINSILKYDQSEESTRLQVADPTQRFGSVDDLNKYMHNMFDVIYMLEILEGKAVAKLDTNQKYDLLRKIENEYKPDPDGNSVYATNVVRRLKPEELTKLTTFNSLIEHDIITRRGYVDEATYKRNGYYTINLFSPIYSALSSKIGTPGDLMGRRIAFELLAAKGYKDGMVPYISNQYEKEAKAQGKVITSYGKQIGLVTDEIVLSKVFNNQYNSWIDFKKDMYKEREDKFGKLNKVSFIDPNGSWARQQKVTIDNINRLEKMIEDAVKFDAEDEVAKLYPETNSRVLKLKKAIFKAYLDQTGDFRSSIFENKK(SEQ ID NO:1362)
>orf01807
MTALGLLAIGSLIVIITKDNRNKKIATFLIVGATGLVTLSTASALNLNANIHESGRDGVLQISGYRYVGYLELDDKTVSSVSPASTVSPVEQPKVVTEKGEPEVHEKPDYTQPIGANLVEPEVHEKLAYTEPVGTTGVDENGNLIEPPVNDIPEYTEPVGTTGVDENGNLIEPPVSDIPEYTEPISTVSEVASEREELPSLHTDIRTETIPKTTIEESDPSKFIGDDSVRQVGEDGERQIVTSYEELHGKKISDPVETVTILKEMKPKILVKGTKEKPKEKTAPVLTLDRTNTNVLNRSATLSYHLVNTDGVTINKITATIKDGNEIVKTVDLTSEQLDKQVEDLKFYKDYKIETTMTYDRGKGEETATLEEKPLRLDLKKVEIKNIASTNLVKVNDDGTETPSDFMTEKPSDEDVKKMYLKITSRDNKVTRLAVDKIEEVTEEGKKLYKITAEAQDLIQHTDPTKVRNKYVHYIEKPVPKVDDVYYNFKELVDAMNADKNGTFKIGADLNATNVPTPNKQYVPGTFKGHLSSVDGKQYTIHNIARPLFDRVENGSVKNINLGNVDINMPWADGIAPVANMVKNATVEDVKVTGNVVANNNIAGIVNKIDSGGQLTNVAFIGNLTGVGDKGQYMAGIAGEIWRGNLAKAYVEADIVANRARIGGLVAKTDNGNDSMGIGKYGSIRKSVTKGTIKTKVLFETGGFINSNLPFGKLEDNISMMRVENGEEFFGSSDLDYDGGYFTNGWLERNFVVKGVSSGKHSYKRSRDKIKEISQDEANKRIANFGLTADKYEINEPVVNRLNRLTRREDEYKSTQDYKSERDLAYRNIEKLQPFYNKEWIVNQGNKLAEDSNLAKKEVLSVTGMKDGQFVTDLSDIDKIMVHYADGTKEEMDVTKNTDSKVQQVREYSVSGLGDVVYTPNMVVKNRDKLIADVKSQLSSVELISQEVRDLMSRRDKPAENTDERKNGYIKDLYLEESFAEVKQNLDKLVKSLVENEDHQLNGDEAAIKSLLKKVETNKAKIMMALTYLNRYYDIKYGDISIKNIMMFKPDFYGKTPSVIDRLINIGSSEKNLKGDRTQDAYREIIAGNTGKSNLRNFLEYNMRLFTEDKDINDWFIHSAKNVYVSEPKTTNTELKDKRHRVFDGLDNGVHGRMILPLLTLKDAHMFLISTYNTMAYSSFEKYGKHTEEARNEFKTKIDEVAHAQQTYLDFWSRLALPNVRDRLLKSQNMVPTPVWDNQTYNGSPVGRRGFDSKGNPIAPIRELYGPTWRHHDRDWRMGAMASIFPNPNNDDKVLFMVTDMISPFGISAFTHETTHVNDRMLYFGGHKHRQGTDVEAYAQGMLQTPDSSTTNGEYGALGINMAYHRPNDGNQWYNPDPDKLKTRDDIDRYMRNYNEAMMLLDHVEADAVLPKIKGDNSKWFKKIDKEMRSKIQYNDLLGPNQWDSIRDLKDEEKVMTLSSVNDLVDNNFMTKHGNPGNGRYRPEDFTPNSAYVNVNMMAGIYGGNTSQGAPGSLSFKHNAFRMWGYYGYENGFISYVSNKYKAEADKNNHGLLSDKLIINKVSKGNFNTLEEWKRHWYGEVLAKAKKGFEAIDIDGVHISNYDELRPLFDKAVEEDLKKPDDFSHTVALKSKVFKALLKNTDGFFNKLFKEDI (SEQ ID NO:1364)
>orf01818
VFHKSLNNCKRKKVCYSSLLPSCFHDWLKNLLTKSQHFSHINFIVEGEGWRSQVRFNHALGNNLTHWCHWNTLDFTIWCYVIRDFFHFFNLSRRFDAIVFDIFRKQGQNILLHDFTTMTGSLDFLPSNVMFEGNSFCKWRNANHVCVFISFHVFFVDTTVCT(SEQ ID NO:1365)
>orf01822
VLGGRANSVTSCTTNSHWNLTFTTKHVTCFSSLVDDIVHGNNREVHEGHIDDWTKSCHGCSCCCSRDGSFRNRTVTDTFWTKFFKHSNRSTEVSSEDTDIFSHQEHIFIATHFLRHSKDNGVTEGHCFCFHFISFSLVCVNIFKG (SEQ ID NO:1366)
>orf01823
MDMFYIGHFLDIRRDTVTVVNAIENDWQVPDRSHVHCFVENTFIGRTISKEANNDFTGILHLLTEGCTDSDPHTTTYDTIGTKVPSIKVSDMHRSTFPFTGSSVFTKDFSHHSVEVNPFSNSLPVSTVV (SEQ ID NO:1367)
>orf01841
MISVWHCNTSSCSTCDLRWVENKAIRFHMALTQRQFVELFQETINVITLTCLTVSVAVVACVSICSSWIAYRRYPVCS (SEQ ID NO:1368)
>orf01842
MISMRNDISITSILYDIRSIKDITIICSIASLRTCQGNSSIVSWSPSFTILTMFLFLSIDFLFCTDVIRVGSILKVNIVFSIYLDNISTLDLINNILIF (SEQID NO:1369)
>orf01843
LVCYLDDDLLSIDSFTLANLIRSQILRFLRRLFSIYIGNTIIFLNRSSLIQSQLVRTNT(SEQ ID NO:1370)
>orf01859
MDKLIIFIEKGKPFFEKLSRNIYLRAIKDGFISSMPAVLFSSIFILIAAVPNIFGFKWSDEQLAFILKPYNYSMGILALLVAGTTAKSLTDSVNTRSMEKTNQINYMSTFLAAVVGLLILAADPIEGGFANGLLGTRGLLTAFLAAFITVNIYKVCIKNNVTIRLPEEVPPNIAQVFKDVIPFALSVLSIYGLDLIVRNIFGTNVAESVGKILAPLFSATDGYIGLAIVFGAYAFFWFVGIHGPSVVEPLIVAISYANIEANVQLVQAGMHADKILNPVTQTFVVTMGGTGATLVVPFMFMWLCKSKRNRIVGRASVVPTFFGVNEPILFGAPIVLNPIFFIPFVTAPIINVWIMKFFVDVLQMNSFSIILPWTTPAPIGIVMGTALAPLSFVLAITLIIIDTLIYYPFVKVYDHQILEEERKGNSSSELKEKVAANFNTAKADAILEKAGVDAAQNTITEETNVLVLCAGGGTSGLLANALNKAAAEYNVPVKAAAGGYGAHREMLPEFNLVILAPQVASNFEDMKAETDKLGIKLAKTEGAQYIKLTRDGKGALAFVQEQFD (SEQ ID NO:1371)
>orf01861
MIFSNQIPLLLSECNPLTNYNHLFSLIISDKRDIVIHWI (SEQ ID NO:1372)
>orf01868
MDGFIVTVKIIGHLLVVVFSAIPFFKEFCKEVCVCLLSIVTFKVFNFRNQFLVFFRWFVFSMNESFDDITHKQFTSNLTTKADNVSVQLFFSIKGCCHITNQGRTNTWNFIYSVVDTNTSTTDTYPKISLAASYSFPYFFTKDWVVSPCMVICTKVNDFISF (SEQID NO:1373)
>orf01871
MLDFQDRSPWLEGQKEIDLSYDLFSTDAVTLDELQSRTIALRSLKHDKGLKVHFAEFPNLIIWSTLNKGPFITFEPWSGLSTFLEEGDHLEDKKNVCLLEANQVEELGFEIEVL(SEQ ID NO:1374)
>orf01872
MKLFKMSCRNIGQAGKILADSGYQGLMKIYPQAQTPRKSSKLKPLTAEDKACNHALSKGEARLRTSLPK (SEQ ID NO:1375)
>orf01874
MRRKYKSIALKKELANDSGKKKCHAMKAQAIVTSQGRIVSLDIAVNYLL(SEQ ID NO:1376)
>orf01878
MKIKEQTRKLAAGYSKHNFEVVDETDEVSNHTYSKATLTWFEEI FEEYKN(SEQ ID NO:1377)
>orf01886
LIESQVFSSLQVCCLNLCHLKFQHFDTCLVFLLVFLDFQNLLAHFPIGIKTRLIGFFQVPKSGITKFIQHLDMQLGTH (SEQ ID NO:1378)
>orf01887
MVMLTMNIYKMLPNSSQNRQINHLTIYTADTTTILQDFPTDDNFIT(SEQ ID NO:1379)
>orf01888
MTNNICRRTSSQHHIHGINDNRLPCTRFTSQDSHPLFKIEGNSLNNGKVFYRNFK(SEQ ID NO:1380)
>orf01899
MPHTRDNWQTRFKNSSYHNFFVKGPEILNRTTSTTNNEQIQIVPLISTRNISSNFLRSPFTLNLGRIKKDVNTWESPADGRDNISNNGSTTAGYYPNSLRKLG(SEQ ID NO:1381)
>orf01900
LLEAFLKQAFFCQFFLKLFKLNRKRPNPIRLSFFNDDGVATTWFIDLYTPNHIDLHSFFQVKP (SEQ ID NO:1382)
>orf01911
MFMSNLCQFFQVWNINQGVTQGFNQDKLGIVFDSCFYFLQIINIDKGCCDTITRKGFFQKIEGSTVNSRSSHYMVTSMGKRQNRISHCSHT (SEQ ID NO:1383)
>orf01912
LINVFSHGVDIAIHSATKFIGGHGTTIGGIIVDSGRFDWMASGKFPQFVDEGSSCHNLSYTRDVGAVAFIIAVRVQLLRDTGAALSPFNAFLLLQRLETLSLRVERHVQNAETIVDFLVNHPKVEKVNYPKLADSPYYALAEKYLPKGVGSIFTFHVKGDEEEARKVIDNLEIFSDLANAADAKSLVVHPATITHGQLSEKDLEAAGVTPNQIHLSIGLENVEDLIEDLRLALEKI(SEQ ID NO:1384)
>orf01913
MTRDFKFETLQLHAGQVVTPATKSRAVPIYQTTSFVFDDT (SEQ ID NO:1385)
>orf01917
MSQKNNKKKNKRKNLLTNILAGFLILLSLALIFNTQIRNIFIVWNTNKYQVSQVSKEKLEENQDTEGNFDFDSVKAISSEAVLTSQWDAQKLPVIGGIAIPELEMNLPIFKGLDNVNLFYGAGTMKREQVMGEGNYSLASHHIFGVDNANKMLFSPLDNAKNGMKIYLTDKNKVYTYEIREVKRVTPDRVDEVDDRDGVNEITLVTCEDLAATERIIVKGDLKETKDYSQTSDEILTAFNQPYKQFY (SEQ ID NO:1386)
>orf01924
MRWNIGCHPNRDTSCSINQKVWKTRWQDQGFPFIGIIVINEINCIFVDITKHFQSNLAHTCLGITLSGSTISIHGTKIPMTIYKHVTVAPPLSHTDHGFINRGIPVWVIFTHDIPCNTSRFFMGFVWGHTQFIHSVENATVNRF (SEQ ID NO:1387)
>orf01928
LKKKWFFVDYYDTTIILLALISVILVLLGFAEMIDLDNPPYSIIDLVIWGVFVIDYSWRFFITKRKWRFILENIFDLLAILPLNAIFTVFRLGRIFRLAKLTKLLKLTRLLRIIGLTGKLERKISRFLRTNGLIYILYVNIFIVLVGSSILSVVEEKSFSDSLWWALVTVTTVGYGDIVPASIFGKWLAVLLMLVGIGTIGMLTSALTNFFVKDNPDEQIKLDKLQDELSSQRILLEKQSKKIEELHKMIQDLIEKT (SEQ ID NO:1388)
>orf01938
VVDFKQTRQDPHDITIYSWLRQVKSNTGNGSCCVRSNPFQAGNSFIGIWKLATKVSHNLLGCSLHIANSRIIAQALPSFQ (SEQ ID NO:1389)
>orf01943
MAERTVVQVHNAFPEDTTLINSQLIPLVQVVVNQGRKGIVGSCNSMHISSKVEVDVFHWQNLCIPTTSSTTLDPHDWTKRRFADSNHGFLANLVQGIRKTNGKRRLSFTCRCWVDGSNQDQFTDWIALNCTNFIKAEFSLVLSVQLQIVVRNTKFLYNINNWLQLNTLCDFNICFHSKFL(SEQ ID NO:1390)
>orf01950
LRILDSQPCFFVDFTNDRLRKSLIIFYMTSRKGITRPAIVFRGAILHHHALSFEVFNQTNIG (SEQ ID NO:1391)
>orf01957
MLFIIGHLNFPTAGSFIDSTLHRLGNRVCIHDDMAFTVTSSTSNSLDESTFVAKETFLVSIENSYEAHFRNVNSFTEQVNSDQDIKDTQAQVTDNLRPFQGLDIRVHVLDLDTHFLEVVGQILCHFLGQSCDKGTLIFFNAGIDFTQEVINLSHSRTDFHLWIQESRWTNDLLNHCLGLFIFIVTRCR (SEQ ID NO:1392)
>orf01958
MNVTLKLLPTERTIVQSRRQTETIINQHFFTRTVSIVHALDLPYGHMTLVNHNQEIIWEEVEKRIRRLSFAPSIHVARIIFNPIGIAHLTQHFDIILCPLFQTLGFKQFTFLFKDS(SEQ ID NO:1393)
>orf01959
MIHFSQHLTCQSLNFTNTVNFVSKKFYSKGMFISGSWENLYHIPTNAKSSALEINIITFKLNIDQVIQEFITRNL (SEQ ID NO:1394)
>orf01960
VAKLVNLVIDRTILLNIGIARRDIGLWLVIIIVGYEILNCIFREKFLKLPIELTSQSFIVGNNQSWFIDFRNDLTHSIGLPCSSRPHQNLSFFSPLNVIHQLLDSLGLIS(SEQ ID NO:1395)
>orf01979
MNITKTNFLAVNFVFTIPTTIDMAFYSDFLTCILDKSIMIIQSHNYRSIIKRFTTFCSSKDDIRHLAPTETLDTRLSQGPSQTFCNIRLSRSIGSNDCRHTLVKDDLGLISKRLESLNFDFL (SEQ ID NO:1396)
>orf01981
MGFIVCNHLKFACFNLRNHDLIDKFLDLGHILVQKKGTKKGFKGITKNGITIAPTRFFFPLTQLDKLVKLAITRKASQTLLTDNHSTEF (SEQ ID NO:1397)
>orf01989
MRITDNQHKIAKEDFVAEYPKLSQALLDRTLDNLSREDNIFIFPNDLTHTPDLDKDQKIFETVNQKIKTGNVIGFLGYGQERLTISSRFSDESNDHFLHYLLNKVLHINLTSLDVALSREERLYQLLMYLFPKYLQAAIRKGLYKEYHRFSHNDSHVKGVI DVRNHLKKNLPFTGNIAYTTREFTYDNPLMQLVRHTIECIKNQKSIGQGVLDNLSTSRENVSEIVRVTPSYKLADRAKIIRMNKIKLIRHAYFREYRKLQELCLVILSREKHGLGPQAQRVHGILFDVAWLWEEYVYTLLPKGFVHPRNKDKTDGISVFSVGKRKVYPDFYDRERKIVLDAKYKKLELTEKGINREDLFQLISYSYILKAEKAGLVFPSKDKVIDNEIGNLAGYGLFESLRMPHSIVHFVK(SEQ ID NO:1398)
>orf01995
LDEDILLGCILPWKPEAFEKLKAYGNGREELMTDVRGTSCFVIKFGKAGEQLAAKLWEEGKMVYASSASMTKRLKLAMSKV (SEQ ID NO:1399)
>orf02000
MAKKIVALVGDGIGPEIMEAGLEVLEALAEKTGFDYEIDRRPFGGADIDAAGPPLPDETLKASREADAILLAAIGSPQYDGAAVRPEQGLMALRKELNLYANIRPVKIFDSLKYLSPLKPERISGVDFVVVRELTGEIYFGDHILEERKARDINDYSYEEVERIIRKAFEIARNRRKIVTSIDKQNVLATSKLWRKVAEEVAQDFPDVTLEHQLVDSAAMLMITNPAKFDVIVTENLFGDILSDESSVLSGTLEVMPSASHSENGPSLYEPIHGSAPDIAGQGIANPTSMILSVAMMLRDSFGRYEDAERIKHAVETSLAAGILTRDIGGQASTKEMTEAIIARL(SEQ ID NO:1400)
>orf02004
LANIESHCNFFQSSIFSSLPNTIDSPFNTSCTILDSSKAICHCHSEVIMTVRRTDDLTIRLDILNQVFEDGTIFL (SEQ ID NO:1401)
>orf02011MTAIWEIATSVEFTKTTKFNDHWTATHFTVKSSWFILNLDFFHFFFSLGNFF(SEQ ID NO:1402)
>orf02016
MPRNRFSFTVRVTREKNFISFFSFFFQVIDKRAFSSDINILRFIIIFNIDGHTGFLQITDMPDTG (SEQ ID NO:1403)
>orf02020
MKIKAQTRKLATGCSKHCFEVVDKTDEVSSKYCFEVADGS (SEQ ID NO:1404)
>orf02029
VHAHTDKLCNGCNRIFNSIISHHTIFRERNKLSHKAIKSTRQEMGPCHVVFIEFFITLHRRLIGNHDNFLTNLVGSGRVRNDGST (SEQ ID NO:1405)
>orf02030
VNHCHWKLFIQNLGITFSLIVTLIRMTDSHVVGTDKDMILLVNSLFLIFDIDKLRLS(SEQ ID NO:1406)
>orf02032
VGNNDILWSKRTISINGFNDFLNTCIAVSTTLCNDDTFLIKRKIFIYKIFCMRNPVSMNTNYNFFNTWLQDKFFNCMNQNRSIT (SEQ ID N0:1407)
>orf02042
LVAPVASSTRFFKNNDSLTSWNNGFIIITINTIISYQRISKGQDLSIIRLVCNGFLVAGHPCIKDDFACYINICSEGLAFKNCAIF (SEQ ID NO:1408)
>orf02044
VVCYFYITIDWSWVHEDCCFFQTIVTFLSQAMLGMVVFF (SEQ ID NO:1409)
>orf02045
MAFVLHTEKHHDINLINDFINGYKLSIVCKLLTSPFLRSSEKEFSSQAFQNLHIGFGNA(SEQ ID NO:1410)
>orf02046
VIQVTCNSNFKTLKVAKFLINGHQIKQALARVLARTISTIDDGSRNRWTSNQFSIVVDLWMANHTDIHS (SEQ ID NO:1411)
>orf02047
MCPCRILKEEIGNNRMVFIGKLGSIFKLNSSLDQFHYLIDSEVFHGHHMVQCLLIF(SEQ ID NO:1412)
>orf02059
MQEHYTPKGKHLTIDNRRLIERWKNENKSNREIAGLLGKAPQTIHTEVKRGTTLQQVRKGLYKKVYSADYAQTVYQFNRKRSVKKLILTKEIREKILHYHKQKFSPEMMVNKKQVKVGISTIYYWFHNGHLGLTKADMLYPRKRKGVKKQASPNFKPAGKSIEERPDVINLRLENGHYEIDTVLLTKIKNYCLLVLTDRRSRHQIIRLIPNKTAESVNQALTLLLGEHHILSITADNGSEFKRLSEVFPEEHIYYAHAYSSWERGSNENHNRLIRRWLPKGTKKTTPKEVAFIENWINNYPKKCLDYKSPSEFLLGG (SEQ ID NO:1413)
>orf02076
VQFHLIIFQNLFCSLDIVIDSLTTDTELLGNFSKAVIISVVELDIIHLLICQKRRIKFKERIHTIGFFDFHNFYYTKN (SEQ ID NO:1414)
>orf02079
MKFNHYFFLFLIIEKQVAIISFFMHFHIIKLVNHFQLLIKLNCISHPNLHIRPSFLSLVLLFYQKEQDFAIMVI (SEQ ID NO:1415)
>orf02085
MKKEQFYPLGIFLAAMLGGLVRYLVSTWLPASPDFPWGTLFVNYLGIFCLIYLVKGYLVYKGTSKGLILALGTGFCGGLTTFSSLMLDTVKLLDTGRYPSLKPELAFEYRWRPAFSLLFGEEEMVIVYLAIACGLGALVRYFFSRYNQASKLPLGTLIANLLGCFLIGVFYNHVESKEVYAILATGFCGGLTTFSTLNDELQRLLSDKKVFYSYLTLTYIGGLVAIFLGILL(SEQ ID NO:1416)
>orf02097
LNRSILDNITLKHEVTSQKIEEVCKAVQIYDEIMAMPMKFNTIISEMGSNISGGQRQRIALARALINNPSIVILDEATSALDTINEERITKYIQSQGCTQIIVAHRLSTIKDADVIFVMKGGKIVESGNHKYLITLGGEYYSLYTKRK (SEQ ID NO:1417)
>orf02100
MSLLETAKRHQLNSEKYLSYLLECLPNEETLVNKEVLEAYLPWTKVVQEKCK(SEQ ID NO:1418)
>orf02101
LKRPPKQADKSSLGAKGLAYCDQLFSLERDWEALPADERLQKRQEHLQPLMEDFFA(SEQ ID NO:1419)
>orf02102
VISIEMRTFFLYSSAFKKHSSPSPINDGLYHLLLQSLYNILELIHDIFQSLKGFILKSTFTNLFPHLFNGVHLWCVWRNKCKANISRNL (SEQ ID NO:1420)
>orf02129
METKKIKNLKGQIIVSCQALEGEPLYTPNGGVMPLLAKAAFQAGAKGIRANSVRDISEIKEEVDLPIIGIIKRDYDGFEPFISATMKEIDELVSEGVDILALDCTNRSRPGYDNITDFIHDIKVKYPNQLLMADISTFEEGKVAAESGVDFVGTTLSGYTPYSPKKDNPDFELVERLVKELDVPVIAEGRISTPEQARKMLDLGAYAVVVGGAITRPKEIAQRFINVIK(SEQ ID NO:1421)
>orf02134
MTKDILELESQKMSSDTFIDEIKNNYLSIVESTRKLIDGRQIELAIKLIREANQILMIGVGSSGNAAREFESSLLRIGIISKTVIDTHFQLMHTALLKDNDLIIAFSLSGSTKEVEETLLNAKRKNVKIISITNYSSRNIAKLSDCVLLTSKKESYLEGGSLMAKASQLFIIDVICTRLSLINYEDTICKKEEIASLLSNKVE (SEQ ID NO:1422)
>orf02135
MQIKFIDKVSNLIMLNLLYVASVVTVIAIGSGESALIATLIKIVRHEESYPYRDFANSFFKDYWKNLGAALISNLPILILLFSLFFLPYIPLPIYIISILRHIGVIYIILHLIATTFLIPLIGRYNNTLKNSLHNSIMLAYKHFFIAVLIRIIEIIPVLLFFILQNQLLVWITLMIFILPSITKYANAFLYNFIFSKYEKLN (SEQ ID NO:1423)
>orf02136
MVSGGFRLDFLLETARLARSTYYYQLKQLDGVDKDKEIKTEIQGIDNEHKGNYGYRRIHLELRNRGFVVNHKKVQRLMRILGLTARIRRKRKYSSYQGEIGKKAENLIQRQFEASRPMEKCYTDVTEFAIPNSTQKLYLSPVLDGFNSEIIAYHLSTSPNLEQVKSMLEQAFTEKYYENTILHSDQGWQYQHDSYHRFLESKEIQASMSRKGNSPDNGMMESFFGILKSEMFYGYEKNFRSLENLEQAIVDYIDYYNNKRIKVKLKGLSSVQYRTKSFG (SEQ ID NO:1424)
>orf02137
MKLSYEDKVQIYELRKQGQSFKQLSKRFGVDVSGLKYMVKLIDRYGIEIVKKGKNRHYSSKLKQEMMDKALLEGCSQRSISLDYALPNQGMLS FWPAQYKKNGYTIVEKTRGRPAKMGRKRKKTWEEMTELERLQEENERLRTEVAYLKKLKELEERDEALERERQRQLEKWFQEDFD(SEQ ID NO:1425)
>orf02152
MVISKTKKYKGVYKDSKGKIYFQIELGVDPITGKRIQKKGRKNQQGLPFNSFKEAYEEILRLKHEFVNSTINNSFLTFREFMEEIYLKYYQQKVQFVTYQTALPHHQLFIKQFGSKKLSDISTIDCERFRLAIIDKYSSNYAKNMWSRFKACLGYAERLGYIDRVPFKGLDNPRGKHPDTKFWTFDEFKKIINSFDISEYEGLHNYMTIWLYFMTGLRVSEGIALKWEDIDFERKWIHVHSTIEKDKNGVWYAKQQTKTVAGNRKIDLDDFTITILKKWREVQIKNDDKDYVISRFGAPLCKSTISRIIKRHAKITGVPEITGKGLRHSHASYLINVLHKDTLYVSYRLGHADKSTTLNTYSHWYYSGDSTISEEITNSLDNLGLSIYLPNSCQS (SEQ ID NO:1426)
>orf02153
METVNYKDLVAIGFPEHTSRNIIRQAKKIAVKKFEEARKNDKNAVQLGCSPFDNKRLGIAPKNIVENLIGISFSDIEGEKNGYIKDKEI (SEQ ID NO:1427)
>orf02154
MLKRIRDLREDDDLTQEYIAKIVLNCTRSSYSKMEAGSRLISINDLIKLADFYKVSLDYLVGRVDNKEDHYSKK (SEQ ID NO:1428)
>orf02155
MVITKHFAIHGKNYRSKLVKYILNPSKTKNLALVSDFGMRNYLDFPSYKELVKMYNDNFLSNDGLYEFRHDRQEVNQRRIYSHHIIQSFSPDDHLTPEQINRIGYETVKELTGGRFRFIVATHVDKGHIHNHIILNSIDQNSDKKFLWNYKSERNLRMVSDRLSKIAGAKIIENRYSHRQYEVYRKTNYKYEIKQRVYFLIENSKNFEDFRKKAKALHLIIDFRHKHVTFFMTDSNMKQVVRDDKLNRKQPYNETYFKQKFVQREIINILEFLLPKMKNMNELIQQAEFFDLKIIPKEKHVLFEFNGIKLSEQELGKMNQYSVSYFQDYFNNKNETFVLDNNNLIELYNKEKLIKEKELPTEEVVWKSYQDFKRNRDAVHELEVELNLNQIEAVVDDGIYIKVQFGIRQEGLIFVPNIQINMEEEKVKVFLRETSSYYVYHKDSADKNRFMKGKTLIRQFTLQHEPQHMYRRIPLSKIKEKIEQLDFLISAENSPNDFEDITNDFIAQISYLENMIEQVQNKIDDLTNLEEVLLNNTTNSSSNLENSIQGKSSVDTIEKDLYIYKGKIETLKEQHGEAINLFEMFNKTIKKYKKKQNMKSIEENEIHLE (SEQ ID NO:1429)
>orf02156
MKRDIRSIRKQFRLTETEEKQILDLMREKGEDNFSDFLRKSLLLSDGQKQMEKWFNLWKKQKLEQISRDVHEILIIAKINHQVTQEHVSILLTCIQELIKEVEKTSPLSENFRNKYMR(SEQ ID NO:1430)
>orf02157
MEYVEAVNQFIERHYKEKDIGHIEIDFWGNKNHPHSLYIYKRSKKIEYDYFFFDSIDYYEEPDFLEFKYIVHLENITYIFWQED (SEQ ID NO:1431)
>orf02162
MTTLDFKTLFKEEYDKLNKQQKKAVDTVEGPVMVIAGPGTGKTQILSRRVANILTNYHTSSEEIVCLTYTEAGASEMLDRLEKLIGEEGRKVRVSTIHAFCSELILRNSEIFGGQPKIISTAAKYEILKEIMDEYVIEGNPLYKNSGKRYSAKDQLLELFYKMKRENLNKEDFEKEIDEYFKMIDLSIPGDDLYSKFKYARNSKSKDKKVGDYKDKAINELKENTQKLLAGVEIIEKYSSDISNHNYFDFDDMILWTIEKLEENEGFQRSVSDTIRYLFVDEFQDTSVVQNKLVDLLVKGKDNPNIFVVGDDDQSIYRFQGVSANNIRDFDKKYKPTKIVLDENYRSSQAIIDASRQLISHNPREEKLLIAAGANKDYDYQLPILKSYENAKAEMFGVLTEIKELIDSGVSPNEIGVIYGRNSYGEEFAKILRDKGIFVQMKENKDLFSEPFFKKIVAILKYLCKPSRDVRELRKIVYFDFFEVVLSQIVMIRNLKKDEKISIPTIAEIDQKLEIIRKKVNQSSKYLSPMYVLSDVLKSLSIDEYIMKSKEKYHLVSVLNELYKLMLMECHIHPKLTVKGFLNQLSALEEMGISLPIEDISGSPSNCVQLMTAHGSKGLEFDHVFIMKCNDGKKKSEAWPGGENNSGRFSYPPSLNGKDENESQLKEEENRRLFYVAMTRAKKVLHLSYANDSTKTHLINEFEEFIDEVDVTESFEDCQSVDKVVMPKFSNNVINEIFDELSLSVSTLNSFLKCPLSFYFNKGLKLPSETNEAMVFGSIIHEVLEKIYISVDGSQSSELTAKTVLSLEEALKLFETVFEEKSYQLTSNKIKKDDYARGKKIIENLYKKSGYLKDGVVAVEVPIQGIRLGDILNTTVDLSEVSNIEINGKIDKIECDGNIVCLVDYKTGNFENAKKKLVAPSEKEPLGGDYWRQAVFYYILFKNAGIDISDKEILVKYVLVENSTNEDGFSETEDIRITQKEVDIVLNQIKESIMKIKQGDFNCGCGVLKKDRDNYPCDYCLQVSANTTPKFDNTEALEVATYQQTRGNYKSLSVSKLNRYLRCPKSIYFEDVLQLSQAAGLSAGAKEKSTKITINHAPTGPVFGTAIHETMEKIYKEDLQLEDAIEFYDSSLYSHQEEIIDTMSVEELKEYGHNLLTNLFEHFIPNSLKGEHVSLEKELRVKLGDNYSINGIIDKLEFDNDLIRVVDYKTGSAQRGVEELEVGHDYWRQAVFYNLLLENSSEIDTTDKRIETQYIFLDDNSTESGYSIHTIQVTKEDLDLVTSQIQNFWSHMNTADFTGSCGKNDCDYCRLAEFVDFELLKETIESGKESNLVN (SEQ ID NO:1432)
>orf02163
MSGRLTRQNYYLLGKLIDEFHAVKAAMRVIETKRNDFNI (SEQ ID NO:1433)
>orf02164
MKPQERLLTIFFRLQAGERLSKAQLSDEYEIDYRTVQRYMSTLKNFLQEQRISNTEIKFDTSDNTYRLIAKTTFNKKDILVISKILLENRALNKSELYSLLEDLLSLLSSEEQKEIDAIIGSERFNYKSLTNDKDRIDTIWILSEAIRREQMLEIEYKAPLKDIKSHIIFPVSLYYDAHYFYLVAYHLKHENYTTYRVDRMESLSESHVKKPEISYGRKYRDGEVRNQKVDAFEGRKIDVTLIYKGNTEIVLDQFPEREILSENHDEIKVKIKTQDTPGLKRWILGQGDAVTLLSPSKLIEEIQESLENTLRNYKK (SEQ ID NO:1434)
>orf02165
MSVQKTKNTLNEPLKTLLDEYHDKVGKINNSSELFDIYSPWNNFNIEKMIESFNKALQSNSNNFSWLDIEEDLPKSTDVDIKYGLPNHIKGNIDEATLFLCLVNPNIDEVKTEKKDVGIHTYYKKAREMESGDDSLNILNDKGKLRIDPKVYIKEHILDVRETSSILYNELQIVKQTRSYKDTYYLGHYLPHFIKEFLNKKGSFKNVIHNLTDEWDELEKMSKKIANLEAFPFRSQNPNYTYKSNKRATNFTNLLIESDSKVNLLSARVIIWRIVKHLESSQHKPAFILRRFNTFWLPTISKVLEQDLNFTKEEINQIINALDEEYFFTVRKKDYNGQSGYFGRNFCKNNERISNSSFKHLVQETLGEYVKK (SEQ ID NO:1435)
>orf02166
MSGSFSDSPTHDDKFSIENYINGLSNFIIECETPLTVAIQGDWGTGKTSIMYQVEKRLNPEKQDKKIQTIFFNTWQYSQFDMGNNLAVALITDLISELNVEDSKKKQFFKKAKGALSKGLEYVNLDFGILNGEKLTEKFQDLIIGFGERTDDIKHLKENLQDIINDAIKENKSDRIVIFIDDLDRLVPEKAIELLEVLKLFLDCEHCVFVLAIDYNVVVRGAKSKYGKDLDDEKGKAFFEKIIQVPFTVPVANYDLQNFIESSLKKLDFCFDKNNKERNQLETITQLIRYSIGNNPRSINRLFNSVSLLMYINNGDKVDHDEKLMILAMVCFQLRFEEAYNYLLTAYNNSPEDSDDIESYLIDLLENSFELLDDEVYYNSLVSLLGKFTFKDKKDRDDFTNFYRTLKELLGYNEQGLTMEQFNKLIEKMTFSNAVSIGNTDTITADKKKQNHAPNEDVQFVIRKLFNTLVGDENYFDLKKPELFGKETREKREAPLSEEFISIPNEFDRIRLTRGKGQGLNIYSSHNKSNFIYISGDTHGRMLNDGMAIVVNNNLVEKIKDNILASDLRSEELYHEFEMNFRDNLNKLLSKASKILNN(SEQ ID NO:1436)
>orf02167
MEFIRAANQFIENYYPREDLDRIESIEIGIRDSENYSRYFLEIQKQSEEFECDFFNFDNIDYYVVDDSVHFKQIINLENSSYVFWKDY (SEQ ID NO:1437)
>orf02168
MNKPIAAIFDIDGTIFRDSLLLKHMEKCVSYDVFPNSVNSEIKFHKNAWENRELDYDDYLYIAATLYTKYIADKDILDIDFVAKKVIEKESKKLYRYTRDRIKWHKEQGHQIIFISGSPDFLVSKMAEKLGADIWYASNYLQLDSKYTGEVIPMWDSTSKLQVLKKLFIDFEKSYAYGDTTGDFTMLQSVGFPTAINPNKKLLDKITMEKLDCKIIIERKDVIYKLDEVTHGIY(SEQ ID NO:1438)
>orf02169
MTNAKEFALTAHKNQTRKGKITPYSFHLFLVNNILETLTEDPHIIATGWLHDTVEDTDVSLEDIKQEFNDEIYSYMSLESEDKSIKDWQTRKELQLAKFREAAEDESLRKVLLVTFSDKLANLMELYQDYLIIGGLLWDRFNSKDPKKQRWYFNEFYKIFKDNQDLFSKNKDILNNYKEILKLLFYNN (SEQ ID NO:1439)
>orf02170
LKAIVKNPKRLFELLRLYFVPVKGRKVVHVPAYAYKEDENEKIYLHNNELHLSKKMFEFLVNQGLDLVECLPEE (SEQ ID NO:1440)
>orf02172
MANSSEAHGRVYIKASNLKTIEDFLLIQEERNKYVYYPTDIIDSQSNISDIVSSRTTQENGYYICNMWFTAEGRWCFENNIDDFFDCTLFQDTDDVLIRQMKEYVCSQDIQIKFEYVDAEASQNFVKEQEATITYNSKTKDISIDVKTIKDLPYTAENLIVYGYYECDEIVSVQFLLDYYDDYLRGNEFYLKHKDGIVPILERQQ (SEQ ID NO:1441)
>orf02173
MAESALINLINFSKENEELTNLVSGHASKREKATISKDGLIQARSIENFIDNYALSDFDFSTIKEKCVFIKINNSFQADDTTEDIYHNVRGVWNISESRKKDLEYALALYRGVCVGVYKIQGWKKAYEHSSEYPFPTRKEGGKIETSEETIVKYSNIEDLKKDYPELYKRSFSNSEFPQKSLDKWRNRSFFYGNWDGSDVPQHLAQCLNKRIINIPKFTKSVKEFKSIDNQASVIYNDLK(SEQ ID NO:1442)
>orf02174
MDLVEDCNTFLSFVADKTLEKQKLYKANSCKNRFCPVCAWRKARKDALGLSLMMQYIKQQEKKEFIFLTLTTPNVMSDELENEIKRYNNSFRKLIKRKKVGSVIKGYVRKLEITYNKKRDDYNPHFHVLIAVNKSYFTDKRYYISQQEWLDLWRDVTGISEITQVQVQKIRQNNNKELYEMAKYSGKDSDYLINQKVFDAFYKSLKGKQVLVYSGLFKEAKKKLKNGDLDYLKEIDPTEYIYQIFYIWKQKEYLASELYDLTEQEKREINHKMIDEIEEEQ (SEQ ID NO:1443)
>orf02175
MNFNKIDLDNWKRKEIFNHYLNQQTTFSITTEIDISVLYRNIKQEGYKFYPAFIFLVTRVINSNTAFRTGYNSDGELGYWDKLEPLYTIFDGVSKTFSGIWTPVKNDFKEFYDLYLSDVEKYNGSGKLFPKTPIPENAFSLSIIPWTSFTGFNLNINNNSNYLLPIITAGKFINKGNSIYLPLSLQVHHSVCDGYHAGLFMNSIQELSDRPNDWLL (SEQ ID NO:1444)
>orf02176
MNYQKLNDITGATKNEKDKYYVYGLYEEGKQLPFYIGKGEGTRLISHIDEALTEINQEENIQISKKIQIIRKHKGKIIPVII (SEQ ID NO:1445)
>orf02177
MTTTKNPWNQLSNVDINGEQAILATEDVELIKKYTSTKHYKNLKDDIYRLQLGFCPQQFVGDIQNADIIVLSKNPGYTPEFKTLYDHDKNYQKTLLNNLQLKGNLYFHAFDLDTNEFGYWAKKFKVWFDDVDNLQDLKEKLPWFSKHVALAEYFPYYSTKYDDKLNDFISKEGYFPTQKFLFNLIRERVLDDNDPVIIIITRSYNKWYDAIPQLKEYKNCYETSNPSNPSLKPENLLKVKRYSAKKEVEKVLEDSLKKLEHK (SEQ ID NO:1446)
>orf02181
MFTKLFKKNQDNSDVFKKLIHRLSDMSIQDLKKIDRLLDIIFTPDQGSEQLKTEATYREETLDDTLKEAKNQLHMEQLEKNLERFRKNSQ (SEQ ID NO:1447)
>orf02183
MTKDWNFNQPLESKSENQEDPDKIAALFGNHQGGNEVNYEAAFQKRKQAPVTESNSSSKPKVTEVRTGKETDITTSYQQHLKRLIADNNSDIQSSQKKIEELHTLIDTKNKDNKKLQSIYDAISELH (SEQ ID NO:1448)
>orf02185
MTIIERLEEKVTRQESKVARETEKLAAYKEQLETAMFATFKRRQSISHMSFEEALDHAFGKERQFDDSEFRKDEMSE (SEQ ID NO:1449)
>orf02187
MEIEECKKISILDVANRLGISFKQVSSSVYEHPEHDSFRIFSTTNTFKWFSRDIQGDVIDFVRLVKGISFKEALAFLSEEPFQKEAVQEKRERPFYYPLKRIEDSNCSLARYYLTECRGISEEIIQKMIQQGLMSQASWKTNETVEPVIVFKSFDHRHKLQAASLQGIYKNHSLPRERLKTILKGSHGHVGISFDIGKPNRLVFCESFIDLMSYYELHQQNLFDVRLVSMEGLKRSVVAYQTLRLIAEENQKLEFLDTVIPSKLLPLINTIRDTTSYFDNHPDLLTLAVDCDDAGKDFSDKLSRSGFPVFLDLPDNESGKEKRDWNDILREKKSDLQLMIENAKETLRNQPVRQTSQCLEL (SEQ ID NO:1450)
>orf02189
MLNKVKTKALISVGAVAATSFILMVGYTIGQHSTAKQSRKEIELAATKLVEDKQAEDKASILSSDTVKEFLTQYYTKEKLGENNTRIQPYMTESAYSQELSSQNDAMNQVYKDYILDYHFEKADIFVNQTTNQAIAMVSYNVTYVSDLKNANQSKTNQTETRTVKLSYSKLPGKLLVNQVQVWKSGLDDLDKVTPKTLEESSSIPSLPNTTTK (SEQ ID NO:1451)
>orf02190
LKKVKFIQQLSETSCGLACMAMILDYYGHEANLYELCCDFENSRDGLSIREIKDIASYFGLDSKATEILNIKKFLGNKFVEPYIALTQNAHYIVVEKHNEHSVVFIDPERGRITEDISNFANNISGIVIFFSPNRKFTKKKKHSNFFKILKIGKIDIKRLCYISLISILVQSLTLLLPLLTRFIIDNVISKGEMYRLGMLFSIFSLITFYALFSFIRTKLIITVEKRYIFTLKDKIVGKIFTLPMKFFDSRSSGEIVTRINNLDSLEKIISSGISSLLIDLSTIIIAFIAMAMISLYFTLIITCFAIFLFIVLYFLLKKLEEKNSNFISSKELTQGYLMEIFSNLLFLKVSAKGDVSYTKWKEMFSNELKFDVERENYLNIFQTFISIYRLVPNLLILILGGLEVQQHMMSLGSLMSFLSLVSLLLSPITLIVQNCFQFQFCFTILDRVFDIIYTPEEKNQFSISKLPPFEMMTFRNLSFAYSSACKVIYDISLTIKKGERIAIVGRTGCGKTTLIKLLLRLYDVEKNTILYNGNDINLFDLNSYRKSFGVVLQNDVMFNDTVISNIDLTHSHSMEDVISAASLAELDIEINNMPMGYYTSIGDNGNNLSGGQRQRLAIARAILQNPEIIIFDEGTGQLDTITEKKIMDNLKHKGITQIFITHRLSNAQEYDNIIVMDNGKIIDSGKHDELCGKEGMYKRLFETSYN(SEQ ID NO:1452)
>orf02191
MNRKLNITDFKDASYFSERKISSHDLGDGLKSRKRKEYWDNFIGEGSNEFWKYLQEKNDFSKHDLDIFLEEHTYDVKDIDSFSHMDEFIEFFFKRNDYTLPEFYIVDSEGKRKPIFSNFVIPFLKFAAYKLEDNLSSQNLKVTRDVLNSLLIALFQQILNISYRTLILELQVLKEQNMLKGETGEKRFKYFSEIYLSDHFWDILKEYPVMFRLIIENIQNWVTNNVEFLTNLKEDKALLQEHFSINGELTKIESGVSDFHNHGKSVYLLWFGTNKLVYKPRDLILDVKFQNLLSWYNLKFNKNLYVTNILNRGNYGWVEYIEHLPCTYESDFIQFYTHLGYLLFLLFAMRGNDIHFENIIAKGNRPVLVDIETLFHNTTEYRKEYETADKLIFSLLEKSVKRVGILPNIVWGKDGNSGVDISGLSSSAGEMIPIERASIMHSMTDEMKIGYEQSALQSKDNQPFIQSGKDVDLNSYKNYVSAGFKEAYEIISKDPSSIEEFLVEIEKFNNAYSRQIMRPTQFYSNLIQTSYHPSFLRSGLDREMLFSKVWKIVFEDKKVQRIASSEFESLLLGDIPLFQTKISDRFLCSELTKYQNFFNISGMELAIQQIKDFCQKDMEFQLNLIETTLNYDPSYNLVQDSFVPLKPSVKVIRNLNQKEIQETKSRIVPLTEKIADYLSKISYSGTSGDICWVDMNILGEKTNDWNMVPIGCDLYNGISGIMLFYLFLYLETKKKDYLIYLKKCYRSLKYYLDSRKQFATHSQVLFGGFSGETPIIYVLLLLKTRMPEHFNSDELDVYIYDIIDDLKKGYRYDENFDVLVGSAGAIIHLLNVFEVTRDEELLILAHDLFSHLEKNSTKITVEGQDGRAWKGTMASNPLAGFAHGVSGIVWALSKLSRYFPEDKKLKTIIKQGIIFENSMFDTEKSNWSDYRETESGIKYKDIVENIPVSWCHGAPGILISRLELYKNNTLNVEFRRTMKSDMDVAIDTTIKYGFGKSHCLCHGDLGNLNILFYVAKKMSSEHLYNVVYSYLNTILDDLESENWKCGLPYKNSPSLMSGIAGIGLGLLTLNNLSIPSVINLEIW (SEQ ID NO:1453)
>orf02192
MLMLEFTKLRRRKILYMIPFVAILLFLLEFMIGHQIYQGHSYGSVNGWYVENGFFFLNYYFLLPFASMILVDLIRIEQVSKTISNLRLIPVDLKQLLQAKFMLALLINLLVSEFTFLAMLVLELMDGDFAFSSLAMLSWGVVYGAIAFAYTLSAGIIVLFLGKYRKELILALPLAFLLSFAGLFALTTVVGRYYLANLLLIIMEQFTALTVSVYAIWLVTLVACLLYLLADKRMMNIIFAYK (SEQ ID NO:1454)
>orf02194
LLDLVKIEFLKQRHQKLNFVVYGVVSLYLALICYYVNDARGLFDSFPFVYKFSLSYLNFLILPLYCVSYTIQAFGLEYRYRIMNNLKLASANLMKTFWAKILYIEINALCIMLFTYISVSLFALLSRFSSTVSLSLLLRFFYLCMSSGILIPMGVFPLVALVMIKVRGKEIVGNLVGVMYVLVSFFLARTSPNISPVTSANSLIWEGNREGVVLQQPAILSIVVLGLSLLVLSFLSIKSWLRKVDE (SEQ ID NO:1455)
>orf02195
MRNPVIQTFNLSKSYDGKIVLDRIDFTLRQGEIYGLLGRNGTGKTTFIKAILGLTAMDSGEVNILSEKLLGEFSKDLLSQIGVVLDSASFYPNLTGPENLSIFARLRGISLKQVEQALQVVGLDGENKKLFKQYSLGMKQRLAIANAIMHQPKILILDEPTNGLDPIGILEMRRYLKELSTNHGISILISSHIISELEKLVDRVGILHDAHLMAEKTMKELIDGADKRKIHLIVSDAPQAKEVLCRINLQEQISILSDIELELQGESPTFDIAVVSNSLKDNGIVLKEYSYKNNESLEDYFKRITGGEGIA (SEQ ID NO:1456)
>orf02196
MTILSDKLKAKRKEKGFSQKTLSEGICEQSQISKIERGNYMPAADLLYKLANRLQVPLDYFFDEQIEMTSNITPFKKLAEKLLEDRNYEDLEYLLNLEKEKSQYLSTEDEFYLLWIQSIILFYLHSSKDEAIASLENALPKLSVSSSVYLKLLNTLSNFYFSVGRDAEYEENNSLLISLYQEKDLNHQEYLFGYIRVKHNFAYYLHSKGKELEAVQEALETIDFCKQKETSYQLAPLLTIVANAGKDFLKHDEILDYYLQARDICKIYEHKLMMAKIDHFLKDKDR(SEQ ID NO:1457)
>orf02197
MNDLLLIPVIFLAVGGILILLWRLFLIASGLFLIGFVSFLIFVEVYGIYLLFTETELYTADLAQNGLFGFTTFFIIFNLVLLALACWAGYKWKRGY (SEQ ID NO:1458)
>orf02199
MEDTYYQLEEALVQGFQTPEEYQAYKELKEHYEEVTGDYSFSKRELTSQLEIALQNHRGVDFEEYEKKDYLELVQKLEEFDSSLATHYRQLID (SEQ ID NO:1459)
>orf02200
MVRRWVLSLQRNGRIIRQGNENKEVDIYHYITKGSFDNYLWATQENKLRYIKQIMTSKEPIRAAEDIDEQTMTASDFKALATGNPYLKYKMELENDLTLLENQRRAFQRSKDHYRHTISYCEENMPILEKRLSKYEGDIQQSEISKDQAFSMRVGKQSFDQRAEAGESLHRLIRHNQADSKEFRTLASYRGFDIKMLSLPTNQPLPETFSVKIVGENQYSVSLDLYSPLGTIQRLQHTIDHIKEDQVKTQNLLDELKDKWNTAKVEIEKNFPKEEDYQTKKAEYDVLAPLIETETDLDTIDQALRQFHEKGKEKQEQLSFELD (SEQ ID NO:1460)
>orf02202
MRNPQNVLNNLTKHSKDKNYQFERLYRLLYNKEMYLVAYQTIYANPGHLTPGVDELTIDGMSIARIDQLIDSLKDESYQPHPSRRTYIPKKNGKLRPLGIPSFDDKLLQQVIKMILEAIYEGQFEPSSHGFRPNKSCHTALTQIQKTYTGTKWFIEGDIKSFFDNINHDVMIHILRERITDERFLRLIRKFLNAGYVEDWKFYKTYSGTPQGGIISPILANIYLDKFDKYMTDYVKNFCQGKYRKRTPEYRQNEIALGKARRALECVSTENQRQEVIQRIRQLEKERVLIPHSDPMDSSFKRLTYTRYADDFICGVIGSKEDAHRIKADIKDYLEAVLKLELSVEKTLITNARDKAKFLGYHLYIRQSNLAKRDSAGRLVRNYTGRLVLEVSIETIRDRLLSYGAMKMTYHRGYEVWKPTARYFMKDCDDLEILERYNAEIRGFYNYYCIANNSSILHRFKYIMEYSMYKTYATKYRTTKSHIIRKYKKDGQFSVQYIGRKGDTMTRYLYNGGFKRQKKSFLENDNLPNTAKYFSRTNLIDRLKASRCEYCQATDSSLEIHHVRKLKDLKGKTFWERLMISRQRKTIALCKDCHKKLHHGKLD(SEQ ID NO:1461)
>orf02208
MEVMKLLAMFRGTIPKDREKMDLFLRYQAQHFDEKWQDLVESFLAEEGKIEEIPHVYSFHQDIISFLEASSENNDQDLESYTRNFGQAGLSKLSQLSNFEKNLVLEVATYNLSTRFYIQSEKEKLEPLSELVCLQNQDVNLVNVYRVANNLSDRISRDIEEFLLMVDSKELTKEVLEIHFEEKEGDVLAYLGSELMATLDTVTDLVHHEENYTQLPLTQKLKIITHFDDVKARSEKSNQVEEVLSPSSDIEQETEETNSFSNVDKIVEEALREYPIGSQVSYKGQVFQLVSIENAQLNDLIRLELFNDSNQLFEENPILYLNSLEEIEQVLSLVELEKEDSEIEIDSSSESQEIDLFSYLEEEKENEKDKETETLIVGIEETDVPVQDFVFPDDLEDFYPKTNREKIETNIAAIELVKRLEKEGRQANPEEQELLAKYVGWGGLANEFFDELNPKYETERLTLKSLVSKSEYSTMKQSSLTAYYTDPMIIRQIWQKLLDDGFEGGRILDPSMGTGNFFAAMPRSIRDKSELYGVELDSVTGAIAKKLHPNTHIEVRGFEEVPYQNNSFDLVLTNVPFGNFRIADKNYDKPYMIHDYFVKHSLDLVRDGGQVSIISSIGTMDKRTDNVLQEIKTNTHFLGGVRLPDTAFKSIAGTRVTTDILFFQKDQAKNLNEEELVFSGSIPFEEDKRVWINPYFDGKYNTQVLGEYEVRNFNGGTLNVKGVSETLSTDIMKALENVEAPKQIDNFLKAPVFIQEEVDNSLPSRIREDLALYSFGYERNQIYYRDTHGIRKSSKVDEISYYVDEKGDFKAWDSSLSEHKIDRFVQLHLTDEEALDVYKSEEASKRGKYKGLFKKTVFYESPLSDKDISRIKGMVDLRETYQSLIEIQRHQDYSRTDFQVLLSKLNHNYDRFVSQFGYLNASVNRNLFDSDDKYSLLASLEDEYIDSKDQKVKYKKSLAFEKALVRPERVIARVSTALDALNSSLSDGRGVDLDYMVSIYPEHSQAAILDELGDQILIDPESYLRGERKYLSKNQFLSGDILNKIEVVQLLVEENNQEYDWNHALDLLESVRPPRIHLADIEFKIGSRWIPQSVYGKFAFECFTNREFELSSPDVEQVIEANPVDGQVHLRTSFAYRYPSAKDSSLGVSGSRYDTGRKIFENLLNSNQPTITMTVTEGEKKKTITDLEKTSVLRAKEQHLQELFQDFVSRYPEVQQVIEESYNRLYNRTVSREYDGSHLVIDGLAQNISLRPHQENAIQRIVEEKRALLAHEVGSGKTLTMLGAGFKLKELGMVHKPLYVVPSSLSAQFGQEIMKFFPTKKVFVTTKKDFVKARRKQFVSRIITGDYDAIVIGDSQFEKIPVSKERQMNYIEDKLNELREIKTHSENKYTVKEAEQSISGLEKQLEELQRFNRDSFIDFENLGIDFLFVDEAHHFKNIRPITGLGNVAGITNTTSKKNVDMEMKVRQIQEEHDFKNIVFATGTPVSNSISELYTMMNYIQPDILKRYQVDYFDSWVGAFGEIQNSMELAPTGDKYQPKKRFKKFVNLPELMKIYKETADIQTQDMLDLPVPEAHIIPIESELTENQKLYLEELVMRSDMVKCGTVDPSQDNMLKITGEARKLAIDMRLLDSSYSLADNHKLLQVVDNVERIYREGMENKATQMIFSDIGTPKKKDNGFDVYSEIKALLVDRGIPSKEIAFVHDANSDEKKNSLSRKVNAGEVRILLASTEKGGTGLNVQSKMKAVHHLDVPWRPSDIQRASVKAV (SEQ ID NO:1462)
>orf02209
MEGIYQRDSDQDGLTDAQELALGTNPLSADSDGDGRSDLVEVEEGTNPLEKDLQDIDQTSITEPSSVFMEMKQKISDMMESHYKEFIQALISIETGIENEQDLEDLYTYYMRTDSISLLSSDLETSPQKVEMEIEL (SEQ ID NO:1463)
>orf02210
VADTRTKSDSNLGRKGQAVADTRINRLTRWLTDSVNHLFCEENMANSRDYRNPNYTEKIKLQRFFTQLQIAASFFKEHFVGKIMYYETEIESVELHFSPTNFMHLCGVDYRKGAGSFFDDCLNRHVIIDELKIKKDGTTMQKLQVLGSIEELLGKHVHLTGSGRYLYLEFDYALRTRKQILALTLKETSRKIVPQSLLDLKRKTVFPKGQKVISIYSKHLQTSELFYYLKD(SEQ ID NO:1464)
>orf02217
MKDKREIIRARKAFRRSLKDEKKFLKKGKKEVKKQKKDSAVLDEKAWKKEIKKKLEEMREASKARVKQANEDYNHILQNSPPSLLNRKELRDRRLPHARKRLKIAKKQYREAKVEAKEERKESRKERKTNQKFLYGQESKAKSNFFFQGKSLEELKVKKEVKTAKENLKSTKQAYKSKKVSRKAKTFLYVLGREGGELASENEDLDGYRTLQETIRKGKRYSRLSYNLGKASVKTGQATCRFTKKRLTNTKERYHHFKDGKGWKLAKDKPSSFKNRFRKLKKQGLTSVRNIYQKLKAAFSFFTFAAGNPVTWIVGGIVFLLLLIMSFFLGFSSASLIQQDEFELTKAYTHLTWEDAEHTRTNDKGITYYTKVDDVMGYMNFKFHDYELHKPVHLFSSETYKDYLSTLWHDLNDGEDLKSMQDLYETPKYKLSKDDQEEMKELKEEGIYASMQELDNPFEGKSNEDSLTMTYRYGYYDLDGKPTLQEYILLEAKAHQTIVAPMDGVVSLDGDNVILTNGKGENESRLTLYSIHNGRAIEGTRVLTGDIIGETPDDTGLKVSYQKYKNKKEKLVYVNPQFYFPKVIQLQTTILPAIGQFGGDEFERAKHIYEFLKSQGASPQAIAAILGNWSVESSINPKRAEGDYLSPPVGATDSSWDNETWLAIGGPAIYSGAYPNILHRGLGLGQWTDTADGSTRHTALLNYAHSKNKKWYDLDLQLDFMLHGDSPYYQSWLKDFFGNTGSAANLAQLFLTYWEGNSGDKLLERQTRATEWYYQIEKGFSQTNGGQAKSDPQSLEGVRGDLYDHSVPGGGDGMAYAYGQCTWGVAARMNQLGLKLKGRNGEKISIINTMGNGQDWVATASSLGGETGSTPKAGAIVSFVGGTHGTPAIYGHVAFVEKVYDDGSFLVSETNYGGNPNYTFRKISQADSAISFAYTTK(SEQ ID NO:1465)
>orf02219
MTYKKEEVKGKKEEVLPSTANTISYQALYQNGLMQVKEDYFSQSYLLGDVNYQTVGLEDKGAIIEKYSDLINSLDDQTNFQLTIFNKRLNLEKFRQSVLYEEKEDGYDTYRKELNRMMNQNLDSGENNFSAVKLISFGRKDSNPKQAYRSLSQIGEYFKSGFSEIDARFESLAGEERVNLLADMLRGEHHLPFSYCDLTRSGQTTRHFIAPNLLDFKNKNYLQINDRLLQIVYVRDYGMELGDQFIRDLMQGDLELIVSLHAQSSTKADAMKKLRTKKTLMESQKIGEQQKLARTGIYLEKVGHVLESNIDEAEELLKTMTETGDKLFQTVFLIGVFGQDEEELKQALDTIQQVAGSNDLMIDKLPYMQEAAFNCLLPFGCDFLEGVSRSLLTSNIAVNSPWTSVDLQDRSGKYYGINQISSNIITIDRSLLNTPSGLILGTSGAGKGMATKHEIITTKIKESGENTEIIIVDPEAEYSVIGRAFGGEMIDIAPDSQTYLNVLDLSEENMDEDPVKVKSEFLLSFIGKLLDRKMDGREKSIIDRVTRLTYQSFKEPSLEEWVFVLSQQPEEEAQNLALDMELYVEGSLDIFSHKTNIQTGSNFLIYNVKKLGDELKQIALMVVFDQIWNRVVRNQKLGKKTWIYFDEMQLLLLDKYASDFFFKLWSRVRKYGASPTGITQNVETLLLDPNGRRIIANSEFMILLKQAKNDREELVQLLGLSKELEKYLVNPEKGAGLIKAGSVVVPFKNKIPKGTQLFDIMSTDPDKMASN(SEQ ID NO:1466)
>orf02220
MNTRVFKDISKYQHRAWLGFTTRQIIFVLPAFIVTIIVLGLNLFFWQFGDWFVYGFVFAFTIPLMLFGVYKPNDLYFEHYLKYRLHFELTVPLRTITGKKGHEHEKKIKYIKETKSFNDL(SEQ ID NO:1467)
>orf02221
MNLSLVSPFVYLASEKISAENLFEGFSVDLQSTVDLIKSLSSYNPTVWTYMSSITKSVMQPLGVAILSVVLILEFSKMAKKIANSGGAMTFEALAPMLISYIMVAVVITNTTVIVEAIIGIASHAIEQVASIVAHGGAKYDTISGLKGSGFIGRMIVGFFALLIWLVRIVSAAMVNLLVSIRFIQLYLMIPFAPLTIPTFLSDEWKSIGIGYLKNIMVYAVQGVLIFLIVSLVPLFESAGKIAVSNGAGVLQSLAIMFGSLVQAILLIIALVGSQRTARSILGM(SEQ ID NO:1468)
>orf02223
MITHFKGFVYGVDASAMFAQAMSLLQKGLIAVGAFLVVVGIVNLATNIKDGGPGVRNAILEIVGGVMVGAAGAFVTQISI (SEQ ID NO:1469)
>orf02224
MMYSGKKFLLFSLLGILLGYLFHRLTLLYDSYTGNSLDKWTHLLMEGQDEVLQSPWNVSFTGKSSAFFLLGFVMMLLVYLYLETGKKQYREGIEYGSAHFGTLKEKKLFYGKEFSHDTILAQDVRLTLLDKKPPQYDRNKNIAVIGGSGSGKTFRFVKPNLIQMNSSNIVVDPKDHLAEKTGKLFLEHGYQVKVLDLVNMKNSDGFNPFRYIETENDLNRMLTVYFNNTKGSGSRSDPFWDEASMTLVRALASYLVDFYNPPKTREQLIEESRLSQKEHQNLLKRQKKEVEERKKRGRYPSFAEISKLIKHLSKGENQEKSVLEILFENYAKKYGTENFTMRNWADFQNYKDKTLDSVIAVTTAKFALFNIQSVMDLTKRDTLDMKTWGKEKSMVYLVIPDNDSTFRFLSALFFSTVFQTLTRQADIDFKGQLPLHVRVYLDEFANIGEIPDFAEQTSTVRSRNMSLVPILQNIAQLQGLYKEKEAWKTILGNCDSLVYLGGNDEDTFKFMSGLLGKQTIDVRNTSRSFGQTGSGSLSHQKIARDLMTPDEVGNMKRHECLVRIANMPVFKSKKYNSTKHPNWKYLANQETDERWWDYQINPLNQSQENHLEGLRIRDLTFESSLK (SEQ ID NO:1470)
>orf02225
MSSEQQERMAVQYAERSLLFTVKSLLKILEWSRRQALAQDSAYKIGVQKLEELLQSPYSIDTINLKKDFLDKPIDIEKFKAFLEKEEIPLAIAWQGDSLHFYTKDRSILDNHLDQLLEKMVNDPEKLADFTMDKSLDDAIDEAKSQITFRQEGAVKQKEMVR(SEQ ID NO:1471)
>orf02226
MKVVNLYDLKQMGNKGGCTIQLIHHFPFGMGLGHLKKDYIEFKRVGIVDGKAVEVTLREPYSRDLLQVVKSIKQRQKLIAYRYKEGKLLFVKEEASDVL (SEQ ID NO:1472)
>orf02227
MFSNANSFKAKIKNISKDKGIPAQQVQQHYLIEQVLKLISTSSYRDSFIVKGGYLIGQMIGLDKRTTMDLDVTLKGTEMSRENLIHIFEEILCSKTDGFSFSVDKLEPIRQDDEYGGFSLKLNATFDTLKEVVFIDITTGDKITPREITYSMTSIFTNESIKIWTYNLETVLAEKLETIISRGLASTRPRDRYDLFTLYKLRKEEINLEVLKNALENTAEKRKSKDTIYNWEEQVRGIEISDYQKELWIRYQRQFKYAKDISFDNSVQVIREIMQQIF (SEQ ID NO:1473)
>orf02228
MVDKREKLMNSFNQYGFLTFKQVIDENLHYKTLLKMVAEGKIDAEEKGLYRLPDIYLDEWFVLQYRFPKGIFSLETALWLHGLSLTIPFNMTMSFPYGTNTKNIKEADICPIILRSHYSEGIIEIERLPGQFIKVYEVERVLVECLRPVHQVDLQIIAPAFKKYFQQNKIHLHKLFYYAQLFKVTDKLQSYTEVLS (SEQ ID NO:1474)
>orf02229
MRCLFFYPILKGSELMKTKNQESKGRSPLFKTIKHSFSQ (SEQ ID NO:1475)
>orf02230
MELKFVIPNMEKTFGNLEFAGEDKVVQRRINGRLTVLSRSYNLYSDVQRADDIVVVLPAEAGEKHFGFEERVKLVNPRITAEGYKIGTRGFTNYLLHADDMIKE (SEQ ID NO:1476)
>orf02231
MRLANGIVLDKDTTFGELKFSALRREVRIQNEDGSVSDEIKERTYDLKSKGQGRMIQVSIPASVPLKEFDYNARVELINPIADTVATATYQGADVDWYIKADDIVLTKDSSSFKAQPQAKKEPTQDK (SEQ ID NO:1477)
>orf02233
MKQRGKRIRPSGKDLVFHFTIASLLPVFLLVVGLFHVKTIQQINWQDFNLSQADKIDIPYLIISFSVAILICLLVAFVFKRVRYDTVKQLYHRQKLAKMILENKWYESEQVKTEGFFKDSAGRTKEKITYFPKMYYRLKNGLIQIRVEITLGKYQDQLLHLEKKLESGLYCELTDKELKDSYVEYTLLYDTIASRISIDEVEAKDGKLRLMKNVWWEYDKLPHMLIAGGTGGGKTYFILTLIEALLHTDSKLYILDPKNADLADLGSVMANVYYRKEDLLSCIETFYEEMMKRSEEMKQMKNYKTGKNYAYLGLPAHFLIFDEYVAFMEMLGTKENTAVMNKLKQIVMLGRQAGFFLILACQRPDAKYLGDGIRDQFNFRVALGRMSEMGYGMMFGSDVQKDFFLKRIKGRGYVDVGTSVISEFYTPLVPKGYDFLEEIKKLSNSRQSTQATCEAEVAGVD (SEQ ID NO:1478)
>orf02234
LAYGLSQNRLAVATGITRQYLSDIETGKVKPSEDLQQSLWEALERFNPDAPLEMLFDYVRIRFPTTDVQQVVENILQLKLSYFLHEDYGFYSYSEHYALGDIFVLCSHELDKGVLVELKGRGCRQFESYLLAQQRSWYEFFMDVLVAGGVMKRLDLAINDKTGILNIPVLTEKCQQEECISVFRSFKSYRSGELVRKEEKECMGNTLYIGSLQSEVYFCIYEKDYEQYKKNDIPIEDAEVKNRFEIRLKNERAYYAVRDLLVYDNPEHTAFKIINRYIRFVDKDDSKPRSDWKLNEEWAWFIGNNRERLKLTTKPEPYSFQRTLNWLSHQVAPTLKVAIKLDEINQTQVVKDILDHAKLTDRHKQILKQQSVKEQDVITTKK (SEQ ID NO:1479)
>orf02235
MNFGQNLYNWFLSNAQSLVLLAIVVIGLYLGFKREFSKLIGFLIIAIIAVGLVFNAAGVKDILLELFNRIIGA (SEQ ID NO:1480)
>orf02236
MNGVFLIFIIQADFLFDFLKVNGKGKPRTDQFALIVFA (SEQ ID NO:1481)
>orf02237
MYDVARYYIEETGALGEVPASLQNYIDYQAYGRDLDLSGTFISTNHGIFEIVY(SEQ ID NO:1482)
>orf02239
MYLIGYAIKFTPNCCNGFLWLVSAKRYFFSCIGQLWSQCISNDRLQKSIRLFTIKSFCRHKPCESHRNPKVFEKCSLYHGKRGQVAKYHHCKE (SEQ ID NO:1483)
>orf02242
MAYPIKYIENNLVWNKDGECYAYYELVPYNYSFLSPEQKIQVHDSFRQLIAQNRDGKIHALQISTESSIRSAQERSKNEVTGKLKAVAYDKIDQQTDALISMIGENQVNYRFFIGFKLLLNDQEFSMKSLTVEAKNALSDFVYDVNHKLMGDFVSMSNDEILRFQKMEKLLENKISRRFKIRRLDKDDFGYLIEHLYGQTGTAYEEYEYHLSKKKLDNETLIKYYDLIKPTRCLVEEKQRYLKIQQEDETVYVAYFTINSIVGELDFPSSEIFYYQQQQFTFPIDTSMNVEIVANRKALSTVRNKKKELKDLDNHAWQSDNETSSNVAEALESVNELETNLDQSKESMYKLSYVVRVSANDLDELKRRCNEVKDFYDDLSVKLVRPFGDMLGLHEEFLPASKRYMNDYIQYVTSDFLAGLGFGATQMLGENEGIYVGYSLDTGRNVYLKPALASQGVKGSVTNALASAFVGSLGGGKSFANNLIVYYAVLYGAQAVIVDPKAERGRWKETLPEISHEINIVTLTSDEKNKGLLDPYVIMKNPKDSESLAIDILTFLTGISSRDGERFPILRKAIRAVTNSEVRGLMKVIEELRVENTPLSTSIADHIESFTDYDFAHLLFSNGYVEQSISLEKQLNIIQVADLVLPDKETSFEEYTTMELLSVAMLIVISTFALDFIHTDRSIFKIVDLDEAWSFLQVAQGKTLSMKLVRAGRAMNAGVYFVTQNTDDLLDEKLKNNLGLKFAFRSTDLNEIKKTLAFFGVDPEDENNQKRLRDLENGQCLISDLYGRVGVIQFHPVFEELLHAFDTRPPVRKEV (SEQ ID NO:1484)
>orf02244
VKPSIVNRIKSNWTLKRLGKVAMTVAFTLVIAIFLLAMLGTVVQAAGLVDDTVNVANEYSRYPLENYQLDFYVDNSWGWLPWNWSDGIGKQVMYGLYAITNFIWTISLYVSNATGYLVQEAYSLDFISATADSIGKNMQTLAGVSANGFSTEGFYVGFLLLLILVLGVYVAYTGLIKRETTKAIHAIMNFVLVFILSASFIAYAPDYIKKINDFSSDISNASLSLGTKIVMPHSDSQGKDSVDLIRDSLFSIQVQQPWLLLQYNSSDIESIGIDRVESLLSTSPDSNNGEDREKIVAEEIEDRSNTNLTITKTINRLGTVFFLFVFNIGISIFVFLLTGIMIFSQVLFIIYAMFLPVSFILSMIPSFDGMSKRAITKLFNTILTRAGITLIITTAFSISTMLYTLSAGYPFFLIAFLQIVTFAGIYFKLGDLMSMFSLQSNDSQSVGSRVMRKPRMLMHAHMHRLQRKLGRSMTTLGAGSAIVTGKKGQSGSGSSARTQADHSRPDGKEKSTLGKRIGQTIGTVADTKDRMVDTASGLKEQVKDLPTNARYAVYQGKSKVKENVRDLTSSISQTKADRASGRKEQQEQRRKTIAKRRSEMKQVKQKKQPASSVHERPTTRQEQYHDEQTSKQSNIQTSYKESQQAKQERPAVKSDFSSPKVERQGNTVQEKTVQKPATSTTTADRTSQRPI TKERPSTVQRVPLQNTRTTNQNRHH(SEQ ID NO:1485)
>orf02246
MKLKTLVIGGSGLFLMVFSLLLFVAILFSDEQDSGISNIHYGGVNVSAEVLAHKPMVEKYAKEYGVEEYVNILLAIIQVESGGTAEDVMQSSESLGLPPNSLSTEESIKQGVKYFSELLASSERLSVDLESVIQSYNYGGGFLGYVANRGNKYTFELAQSFSKEYSGGEKVSYPNPIAIPINGGWRYNYGNMFYVQLVTQYLVTTEFDDDTVQAIMDEALKYEGWRYVYGGASPTTSFDCSGLTQWTYGKAGINLPRTAQQQYDVTQHIPLSEAQAGDLVFFHSTYNAGSYITHVGIYLGNNRMFHAGDPIGYADLTSPYWQQHLVGAGRIKQ (SEQ ID NO:1486)
>orf02247
MMKFRKNQNKEKQIPKEKKPRVYKVNPHKKVVIALWVLLGLSFSFAIFKHFTAIDTHTIHETTIIEKEYVDTHHVENFVENFAKVYYSWEQSDKSIDNRMESLKGYLTDELQALNVDTVRKDIPVSSSVRGFQIWTVEPTGDNEFNVTYSVDQLITEGENTKTVHSAYIVSVYVDGSGNMVLVKNPTITNIPKKSSYKPKAIESEGTVDSITTNEINEFLTTFFKLYPTATASELSYYVNDGILKPIGKEYIFQELVNPIHNRKDNQVTVSLTVEYIDQQTKATQVSQFDLVLEKNGSNWKIIE (SEQ ID NO:1487)
>orf02249
MKIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGITSFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGIPTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWDTVIEGNDDLLEKYMSGKSLEALELEQEESIRFQNCSLFPLYHGSAKSNIGIDNLIEVITNKFYSSTHRGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRISEKEKIKITEMYTSINGELCKIDKAYSGEIVILQNEFLKLNSVLGDTKLLPQRERIENPLPLLQTTVEPSKPQQREMLLDALLEISDSDPLLRYYVDSATHEIILSFLGKVQMEVTCALLQEKYHVEIEIKEPTVIYMERPLKKAEYTIHIEVPPNPFWASIGLSVAQLPLGSGVQYESSVSLGYLNQSFQNAVMEGIRYGCEQGLYGWNVTDCKICFKYGLYYSPVSTPADFRMLAPIVLEQVLKKAGTELLEPYLSFKIYAPQEYLSRAYNDAPKYCANIVDTQLKNNEVILSGEIPARCIQEYRSDLTFFTNGRSVCLTELKGYHVTTGEPVCQPRRPNSRIDKVRYMFNKIT (SEQ ID NO:1488)
>orf02250
MKPSSFQTTIENQFDYICKRAMEDERKNYMLYLSRIAKREVSFSDVGDYLVSQFATTDNYSTDFQIFTLNGLSVGVENDLLSEALRELPDKKREILLLFYFMDMSDSEIADLLKLNRSTVYRHRTSGLALIKKFMEEFEE (SEQ ID NO:1489)
>orf02251
LATLDCVQCIYNFFKLFSFNLNTIAIHNQPICIFIFLCQAS (SEQ ID NO:1490)
>orf02252
MSEKRRDNKGRILKTGESQRKDGRYLYKYIDS FGEPQFVYSWKLVATDRVPAGKRDCISLREKIAELQKDIHDGIDVVGKKMTLCQLYAKQNAQRPKVRKNTETGRKYLMDILKKDKLGVRSIDSIKPSDAKEWAIRMSENGYAYQTINNYKRSLKASFYIAIQDDCVRKNPFDFQLKAVLDDDTVPKTVLTEEQEEKLLAFAKADKTYSKNYDEILILLKTGLRISEFGGLTLPDLDFENRLVNIDHQLLRDTEIGYYIETPKTKSGERQVPMVEEAYQAFKRVLANRKNDKRVEIDGYSDFLFLNRKNYPKVQVITTA (SEQ ID NO:1491)
>orf02253
MGHANIAMTLNYYAHATFDSAMAEMKRLNKEKQQERLVA (SEQ ID NO:1492)
>orf02254
MKRIIPVYIFQQVNVLLVSLYLLKLLCISELTILQILYCASLISFLWMYGQRKQVVKVNMKTRMKWLGIGFVSLLIINLCFSLIHAQGTTNQANLIGLQHQVPWFSFLLLLINASMVEEFLYREILWNLVRKLDIRVALTSILFVLAHHPGTILAWCLYVSLGMFLGLVRYKSDLWGSMGLHLVWNLSVYVLFFL (SEQ ID NO:1493)
>orf02259
MKRITANQYQTSERYYKLPKILFEDEKYMDMKLEVKVAYSILKDRLELSLSRGWIDEEGAVYLVFSNSKLMKLLGCSKSKLLSIKKILKEYDLIDEVQQSSSEKGRLANKIYLGELSSTPVASSNRPSVKKKIGQVENETAPVSHSAPSETEVSETKYSETDSLFSEDEEERYTQPILKRKVEKVTKYDQDYIWGLVQDQFRREGFSETASEIAMTDFERIYQYALDNVRFVRRAEVLAEFVFNGLYSVWNNRVRKGGG (SEQ ID NO:1494)
>orf02260
MTKELQSSRYIVISFLVREMGIDIVEAISLMAELEKSGLVRLESSGDLILKELGGAL(SEQ ID NO:1495)
>orf02261
MIVILLSFFLQKIKKGEQYSTVLQNIFIKKKNPAKLIFGRVFGRKLN(SEQ ID NO:1496)
>orf02276
LQVWYNLQSDFEQEITLIMWNPFANLVFNQPLISFFADLNLKILGYSYTDRVKTWPDIGTGCRYNNLHLILLAP (SEQ ID NO:1497)
>orf02314
MVICHNDYLLWLPEFSQPLTSLSQTTFFNLNIIRMMRNIDSDFHRRISFSLLVFFC(SEQ ID NO:1498)
>orf02318
LIEGHLVFADKPAQALVLLRKVGSPKKVSFLTLHLYFLILKIDILKITGF(SEQ ID NO:1499)
>orf02324
MFLHHLLQIKGGLGIQTSQGFVQNPNIRSRQECPNDKDFLTHSVRKSFNNLIAVFSKIKNVQ (SEQ ID NO:1500)
>orf02326
MTNQAHNLSIFNLQIEITQGLFITIQLTNILKFNHGTTPIFICIIHRYTSTLIQKNQYFILSS (SEQ ID NO:1501)
>orf02348
MAVTKSQVFSRQGFDFSILGQDLTRLQDVSNLATIGTRIHKDSTANASWNTTSKLKAS(SEQ ID NO:1502)
>orf02349
MTEGNASCFNQVSPSFCFNGIAINRNVIELVTQDDKSTNPTITNDDIACIAKNHPRDIFLVGKFHNASQLKTISWKDQIISLSTYFCITIAMQGFLKTDINSF (SEQ ID NO:1503)
>orf02361
MRLRDLRRVDFPDPDGPIKAVISLGWKDRETLFKAFFLL (SEQ ID NO:1504)
>orf02364
LKNHSNVFTHFINVDFWTVDINSTIENLPSYFSNINGIIHAIETA (SEQ ID NO:1505)
>orf02365
LHINPLNGFIFTIVNMDILSRKGYFFFRKGKDMLLIPVIC (SEQ ID NO:1506)
>orf02387
MTIHIQVVKTNMVILADRFFQGLILRSTDKFFIKIRLVRSHNLRFNNMDFSTVAVHENKGRHHVDELLLRFIINSKATVAKKSIVAQGFRFDGNFFRKTRQTNHLNIIFYDNPDQIIFFQNGLITNSQFNRLHP (SEQ ID NO:1507)
>orf02404
MVKRRIRRGTREPEKVVVPEQSSIPSYPVSVTSNQGTDVAVEPAKAVAPTTGWKQENGMWYFYNTDGSMATGWVQVNGSWYYLNSNGSMKVNQWFQVGGKWYYVNTSGELAVNTSIDGYRVNDNGEWVR (SEQ ID NO:1508)
>orf02420
MRFIVGRFTSFSLGIEFSPTSKLDDLLFKIAFLMILATWIKARKTKEAT(SEQ ID NO:1509)
>orf02424
MANDNKSHYLIYRVLGISFEEGENIDLYQNKGRFLYKYAGSFLEEAAVLSFNEKFGTENT(SEQ ID NO:1510)
>orf02433
LHYRTTPTLIMVVQRDCLILSFPRQKGPIVGQMAIIL (SEQ ID NO:1511)
>orf02435
VFALLDNSTFLRKSLHLRKMEMFPVETEEITYKRKKSKGKRQAILAQFDSEEVHHQVEESICPDCQDDLKEIGASLQRQELVFIPAQLKRVDHIQHAYKCQTCSKNNPSDKIVKAPIPKAPL (SEQ ID NO:1512)
>orf02451
LVCQTIKYWHKFHLHIGRCKLLIGLIPVLNFFIRADIDCLLVLLSLIDRQNGKQFNLCQWIIASNGLNDSFEIIESLIHRNILSDIICPNQKKNFIYCSTI (SEQ ID NO:1513)
>orf02459
VGHNSGSFFLFLLLRLLLSPLLRNSISFLTSQGIPWKLSNNKTKPIDKPTASKSIATNPLLLHLR (SEQ ID NO:1514)
>orf02480
LDFLNHLWVAHAGNSSSCTNISRNSFQGHHSCRTSSFCDTSLFRILHIHNNTTLEHLCQVFI (SEQ ID NO:1515)
>orf02505
MLLLISLTQLIIFLFFERFNLLLKTFLLVDLKSNKSA (SEQ ID NO:1516)
>orf02517
MGFSMKLIHDLNTHTTHSTAKMLHNVKAIKNDFSIRE (SEQ ID NO:1517)
>orf02524
MQEHYTPKGKHLTIDNRRLIERWKNENKSNREIAGLLGKAPQTIHTEVKRGTTLQQVRKGLYKKVYSADYAQTVYQFNRKRSVKKLILTKEIREKILHYHKQKFSPEMMVNKKQVKVGISTIYYWFHNGHLGLTKADMLYPRKRKGVKKQASPNFKPAGKSIEERPDVINLRLENGHYEIDTVLLTKIKNYCLLVLTDRRSRHQIIRLIPNKTAESVNQALTLLLGEHRILSITADNGSEFKRLSEVFPEEHIYYAHAYSSWERGSNENHNRLIRRWLPKGTKKTTPKEVAFIENWINNYPKKCLDYKSPSEFLLGG (SEQ ID NO:1518)
>orf02527
VVPRYVTKHQGWDHNSHTITNSDDDPATLVTFRTFKFNVGNCTIPKNDQNGSSQKFSGIL(SEQ ID NO:1519)
>orf02534
MAYSTDFKQRALDYIKEGHSHVEAAKFFGVGVRTLFTWEKKDVNKDT(SEQ ID NO:1520)
>orf02585
MAVQANWSFDITHDSSFFFSNQKRGLNFSQMCFKDRRRNGFFDRKIFKFKFNNPIQIF(SEQ ID NO:1521)
>orf02595
MEIVLVSFSISFQHFIIAYCLDFSSAGFRNSQNFSNFC (SEQ ID NO:1522)
>orf02608
MKLKLLRVDTKVIMGSFLLVLSSLLALLLPLILKGLIDGSSIENIGSKVFQSFLIFIGQALFSSIGYYLFSQSGEKKIAKIRKKVIEGLIYAEKSFFDKSQSGELTSAIVNDMSVIREFLITTFPNIILSLVMVLGSIVVLFSLDWNLSLLLFITLPCMMFIILPLSNISEKYSRRLQEEIGFLTGQLTEKIQEHELIKTNQAEKSVQDVLDNCIERVQNNSLKSDRVTSFETPFALLFIFATIVVMLTYGGYRVSAGYISVGTLVSFLIYLFQLLNPISNIANFVTIYSSSKGSSVALENLLAVPKEKFEGGKSVSGQGLNFNHVYFGYDENRPVLKDITCSIFKGQKIAFVGPSGSGKSTIVRLLERFYKPLSGDILMEQSSIYDFNLKEWRSKIAWVSQNNAVLSGSIRDNLCLGLNRLVTDDELMKVLDLVSLGDEIRSMKEGLDTEVGERGRLLSGGRSQRLQIARAYLKDAEILIFDEATANLDADSEYAIISSLYSVLKEKTVVIIAHRLSTVKDVDCIFFLEEGKITGSGTHKELLENHERYARFVQEQMIE (SEQ ID NO:1523)
>orf02621
LIRYLDQYEDVILREIKAQFPDVAVDKLMEEYIKAGLILRENKRYYLNFPTLESLDSLELDQEIFVREASPVYQALLEQSFETELRNQINAAILVEKTDFARIKMTLSNYFYKVKQQYPLTEKQQELYDILGDVNPEYALKYMTAFLLKFLKKDQLMQKCRDIFVDSLVVLGYIVQNEDGKYELAIDFDKERLTFYLA (SEQ ID NO:1524)
>orf02622
MIGLKEVCRFLTDNTSLSTSMINHPIQINGNMAIVTCGSLDGLSHV(SEQ ID NO:1525)
>orf02633
MRERVRLSGSLFTSLKTREHIKSTMELFHKYVFFLIQEIKIKMINFLKIGDLPTL(SEQ ID NO:1526)
>orf02643
MIDHFEIKVKDLQISEGFYRSFLAPLDYKLAFKTSSLISFLSPNSPHPGGDFWLTQGTQDPVHFAFLAENKEEVQACYEAGLEAGGRDNGAPGYRSEHPIYYAAFMIDLDGNNIEVVCHKE (SEQ ID NO:1527)
>orf02645
VIVFLSRNKDGNAFCHLDLISIANPVWGWDDDFITWIDHSHKEGIERIFGSRSDCHLI(SEQ ID NO:1528)
>orf02648
LSNQFYFSLQTKPILKVKQFLLFQSQMIRVSEILQFSNKL (SEQ ID NO:1529)
>orf02652
MKHSHKKSFDWYSMQQRYSIRKYYFGAVSVLLGTALVLGAAASVQTVQAEENKQETTNSISVGRGEAATKPAEVSASNKEKTYAAPTVANPVETTPVKTGEVTKPAEKVEEAKDKKEEVTHQDAIDKSKLLTALSRAKKLESKLYTEASAANLQTSIQAGQSLLGKADASEAELSAAESSIQSSIIGLELRSNSNKGTVSETPVAKKANIVEAKEETKPAVTTTERSAVDSAILPISTAAKVETTSAPASTNEILKPSLSLSDARQNPAIRKEDVDRGYSGFRTAGSGFRAAGSGFRAAGPENKPILNPNNTIAFSDISQGLHSFRGIGHSRGGREIHYDVTTVRRGNRLNFTIKYSGPGEFVNNNFILDKGDGFGNPSNATITSSNPRVREQSKSISQGANYVSHSGYSMTSATSTNTEQTIRFSLPIINPNGDLSVRLKPVTFNVDQGGGGAATSNDPYSNSNYYHRANPLLLDANPYGGTNNKTVSEDIDFQTVYLPTSKLPEGQTRLVREGEKGQRQITYKVHRFGNETLLGLPISNRVTKEAKPRIMQIGVAKELIDTVKPRVDQNKVGDTNNLTFYLDNDGNGVYTEGVDELVQKIAIKDGAKGEKGDQGERGLTGAQGTKGEKGDQGERGLTGAQGAKGEKGEQGFQGRDGEQGPKGEDGKTPTVKVTDGQDGTHTITINDGKGGITTTVVRDGFDGASPLVSTHRNEADKTTTVIFYYDLNDNNQFDEGDTKLKEVVIADGKQGPKGDKGDNGKDGFTPEVTVTDNNNGTHTITITQPDNRPSLTTIVKNGEDGKTPKVKAERDDAKKQTTLTFYIDKDGDGSYTAGKDELVQTTVVKDGQDGAAGASGRDGKEVLNGKADPTTEGKDGDTFVNTQTGDVFVKKGNTWEPAGNIKGPKGDKGADGAKGEKGDRGERGLTGAQGTKGEKGDQGERGLTGAQGTKGEKGDRGERGLTGAQGQAGRDAVTPTVTVKDNKNDGTHTITINDGRGNVASTVVRDGFDGASPLVATQRNEADKTTTVIFYYDQNGNNELDASDKKLKEVIIADGAKGEQGLQGRDGQDGAQGQAGRDGKDVLNGKANPEVNQGKDGDKYVNTETGDVFVKNNGNWDKEGNIKGPKGDKGERGEDGKTPEVTVTPGKDGHSTDITFTVPGKDPVTFTVKDGKNGKDGRAPKIKVEDITSPSRIRRDTDAAATPTRNGIRVTVYDDVNDNGVYDEGVDKVLNSKDIYNGIDGRDGSAPTITTKDNGDGTHTITVQNPDSSESTTVVKDGKDGKTANITTTENPDGSHTITVTNPDGSTKETVVKNGKDGKTPKVEVTDNNDGTHTVKVTDGDGNVTNAIIKDGKDGKAATATTTENPDGSHTVTITNPDGTKNEFVVKNGRDGVDGRTPTASVRDNGDGSHTIVITNPEGVTTETTVRDGKSPKVTITDEQNGTHKISVLNGDGTTTETIIKDGKSPVATVRDNQDGTYTIRVENGNGTVSETTVRDGKSPTAKVVDNGDGTHTITVVNSDGTTTTTTVRDGREPKLEVIDNNDGSHTIKVTGADGKGTTTTIFDGKSPKANIVDNGDGTHTLTIVDSDGREYKSIIKDGKDGKDSVSPTVTVKNNNDGTHVVTITNPDGSKTEMVIKDGKDGKCGCQDKPVTPSNDKPVPPTPNVPTPEVPVKPVPAQPTPNVPTPEVPVQPTPAVPTPEVPVKPVPAVPEQPVVPTPAQPATPVNANPVAPTTGKENRGDKLPETGSQSDYISVLLGSGILLSLYVGRRKED (SEQ ID NO:1530)
>orf02654
MRNLLSTKVQRQLRLMETLIQNRNWMKLHELAEKLGCTERILKSDLNELRIAFPSINIQSSVNGIMIDLEVNTSVEDIYQYFLANSQSFQLLEYMFFNEGLPIYRTIENLYFSSANLYRLGRNITKVLSSQFQIELSFTPSEIRGNEIDIRYFFAQYFSERYYFLDWPFPDLPEEDLTEFADFFYKITNYPMRFSIYRMYKLMIAISIHRVKNGHFIDLPNHFYKEYYPLLKSIPNFQETLAYFSKHFGLEMTPDTIAQIFISFLQNDIFLDPQEFFNSLEDNSQARYSYQLLSQILERLSKQYKITFTNHDELIWHLHNTAFFERQEIFSTPILFEQKALTIKKFEVYFPDFMGSARQELAQYRQAIGQHDHPEQLEHLMYTILTHAENLSTQLLENRPPIKVLIISNFDHAISLTFVDMLSYYCNNRFTFDIWDELKTSPEILNQTDYDIIVSNFYISGITKKFICRNHLSIMNLVNHLNTLSNEIHLSNTL (SEQ ID NO:1531)
>orf02655
MIFKIGLFYLGQFVSLDMTVHKPIKKLQGWVVLSSLPFQSLDILTFFRSLLS(SEQ ID NO:1532)
>orf02657
MGFYLMVASMLLGLLALKIGFSQFKENKDKFLSILTSLAGLALVLVAVWLGWPK(SEQ ID NO:1533)
>orf02677
MLDSDIGCSRKNLLGLFWIRRRRNIHIVDRAMEKGISNRAPNKISLKACFFNFF(SEQ ID NO:1534)
>orf02696
MAFNQFNRCITLSIPTAPNIPTSVVHRTYLHDATVPNNVREKT (SEQ ID NO:1535)
>orf02698
MQQITEIIIAFATSFLTVAVGGIVKAVKDYLLRKGGEKAVIIAEILAKNAVHAVEQVASETGYKGEEKLEQARAKVRAELTKYNISMTDKDLDTFVESAVKQMNDAWKGR(SEQ ID NO:1536)
>orf02699
MKIEFFNFLRSVIQTEDGLVLYALALIVSMEIIDFVTGTIAAIINPDIEYKSKIGINGLLRKISGVLLLMILIPASVLLPEKTGFAFLYSICLGYIAFTFQSLIENYRKLKGNVTLFQPIVKVFQRLLEKDDDTKKGE (SEQ ID NO:1537)
>orf02700
MLKVTKTRQLVTEFFAQDGDQQKLVKTTVINTDNKAVSTISETLHDPDLYANNRISMRKHEQELREMRYKIEDAILAELEADSEHKE (SEQ ID NO:1538)
>orf02702
MTKFINSSGSLHLNIYIEQVSQDIANNSSRVSWKATVDRDGAYRTYTYGNISNLSVWLNGSSVHSSHPNFDTSGQEFTLASGEVTIPHSGDGTKTFAVWASFDPNNGVHGNITVSANYTLSSIPRSSSVSDNALSGNRRLGSPHTLTIDRKSSSFTHQVWYRVFGSNWIDLGKNHATGVSFVPNIDLARYNTKAKSGTMDICVRTYNGTTQVGNDVYSNGWYFEIPESVKPTFSGITLTDMNTVARQLLSGNNFLQIISDIQVNFNNPSGAYGSTITGYRAEIVNKNQVTTVNGGRLGMMNFNGSATIRASVVDSRGRQSDTRDITINVIEYFAPAFSFTAFRTRETPNIIQVVRNAKIAPITLSGSQKNVMTLSFKVARLGSTTFTADHGRASGIWTTQHTLNNSAANMAGNYVATKSFVVIGTLSDKFTSTEFTATVATESVVMSYDKDGRVGIGKVAEQGGAGSLDVLGDIYARNKPIQQYQLTDNNGCGKLIKQDFNTMKETGTWWINGSSQNNPFSGTWGMLEVFRPNPGSHERIQRFTTSTGYMAVRENGYDNNWRPWRYLVQQSKSTNNSDYVALLKSESTPTPWQNAILQNGWNHHRDYGGVQFSKTFDGVVCFKGTCKGGKIARESIILTLPEHFRPSTTLFKTALNNDYGSAVIGIYPNGNVVVKSNVDATWLNFDNVFFKI (SEQ ID NO:1539)
>orf02709
MLLTIHDANLQKVAFIDNEKQGTLNYYDDTWTRSLATGSSTFEFTVFKKAVKSDLPLAKAYHHLNEHAFVSFKYKGKSFVFNIIIVEENEQTIKCYCENLNLELINELANPYKSNKAMTFKEYCEAMDLLNYTHLSIGINEISDYKRTLEWEGQETKLARLLSLAKRFDAEIEFDTQLNADSTIKKFSVNVYHENDDNHQGVGRVRNDVIVKYGKNIHSITRKVDKTGIFNTIRPTGKMPTVEEELSGDKGSKSETVKNADGSTTKTTISTASDGTKSKTIVHTKVTKLADKTRITTTTTTRSDGSIEQTVTTSKKGGASTSETKVLKKPNPKEKTNTTEDVLTIEGLDEWEVKNEKGIVEFYQRGQALYAPISMQLYPSTFTHSTGELDQWTRKDFHFETDEPNELRRLGYLKLKKYCYPAITYEVDGFVDADIGDTVKVHDDGFAPLLMIQARVTDQKISFTNPVRNKTIFDNFKALENKLSADIQSAFERLFEAAKPYTIKLSTDNGVIFKNQIGQSLVTPTLYKGGKPVVVGVTWRWALDGEVTTGMTYLVRGSNVTDTVTLTVAAYIGNKEVAVDEISLVNVADGKLGTPGTPGRDGRTPYVHTAWANNATGTDGFSLDSSINKLYIGIYTDFEPNDSTDPKKYKWAKVKGEKGEKGDKGEPGQRGLDGLQGARGEQGLPGRNGADGRTQYTHIAYSNSADGTKDFSVSASDRAYIGMYVDFNRADSNTPSDYNWTLVKGSDGANGVAGKAGTDGRTPYLHIAYATSNNGSQGFSTTDSTNKTYIGTYTDYTQADSTDYRVYKWTLIKGADGTGISNVTNYYLATTVSTGITRTSAGWTTTPQPITSDKRYLWNYRVELYTNGTSKTTEPTVIGVHGEKGERGLQGLQGLQGARGEQGIPGPRGADGRTQYTHMAYADNATGGGFSQTNTDKAFVGVYIDFNPTDSRNPADYRWTRWKGRDGANGVAGRAGADGRTPYLHIAYATSNNGSQGFSTTDSTNKTYIGTYTDYTQADSTDPKKYKWAKVKGDKGEKGDKGERGLQGLQGLQGARGEQGIPGPRGADGRTQYTHMAYADNATGGGFSQTNTDKAFVGVYIDFNPTDSRNPADYRWTRWKGRDGANGVAGRAGADGRTPYLHIAYATSNNGSQGFSTTDSTNKTYIGTYTDYTQADSTDPKKYKWAKVKGDKGEKGDKGERGLQGLQGLQGARGEQGIPGPRGADGRTQYTHMAYADNATGGGFSQTNTDKAFVGVYIDFNPTDSRNPADYRWTRWKGRDGANGVAGRAGADGRTPYVHFAYSENADGSGLTMTDNGQRYFGHYSDYEKPDSSDKTKYKWADRWAKVDGGYVNIYALSKNRSIGKSYHVSEFNMDVLSGNITLKAIGSDPYIGAVSSHPGIFIKQQGMKIPVIQGRSICITITNPLFRKNYISFFNSLGKTVKTYKHYNTNKFLISSADLVGVEFIALRYGAGSSNIQIGTVLETKVKVEYGTVHSDWSPAPEDIESNINSKADQGLTQEQLNALNEKSQILEAEMKAKASMEAFSELEKAYNAFVKSNADSRKKSESDLVEAGRRIDLLTTQFGGLAELKTFIDTYMKSTNEGLIIGKNDASSTIKVSSDRISMFSAGKEVMYISQGVINIDNGIFTASIQIGRFRTEQYHLNKDVNVIRYIGG(SEQ ID NO:1540)
>orf02711
MTKIMTFNGVDMSKFFRITDIIRPIGNKRSVSTDNAPLLGVNIQQVKIGEKEHIIKFDIKTTNAIEMEQLKHDLAGILNVLEPVKITYGDEPDKYYMGLPVDEITPENLTRWFQRSELKIIIPDGVAHSTTLKNFDIDTNETSAPDRIVFNLTNTGTEPAYPIIRIKHNSENGYIGVVNNRAAFELGNREEADTEKYRDSETLIDYRGTNILKGFQNGTKGVAVTNDNKERLVGTLSTTSMWGRNHIELSNRGTVEKNRNNAQSLTWAIPVDSSGEVGSLNDYLLWRQVFMAAVANQYGFIKVTVSDTDGNFLYGVETYKRYQTLDCEYSFFTTDGKGGYKFIKWWYFTGTGAQVGKLDPFSAEKGWSELKRNDDRVQVFFDGSHYDFIIPEIKDKKSAKIHITLGALRDWPLVSHMYVDEFMYRKDFVTKSRDIPNRYPIGSNVVINSEDDSVYIDGISKVSEVVDGSHWPAIPPGKSQLELYFSRFVKKKPTVTIEFEERWI (SEQ ID NO:1541)
>orf02714
MADGKVTIVVDVDGNKVKVLNDELDKAAQKGDRGSDSLKKFAIGGAAFKLASKAVDLLTDSLGGAIQRFDTLESYPRVMQAMGHSTEDVTRSTKKLAAGIEGLPTTLNEVVGTAQRLTSITGDINKSTDLTLALNNAFLASGSSSADASRGLQQFSQMLSAGKVDMQSWKTLQETMPYALQKTADSFGFAGQSAQNDFYSALKEGRITFNQFSKKLVELNGGVGGFAELAKSNSKGIQTSFGNLKNAVVKGVANTIKALDDLTKAATGKTIAENFDALKVIINAAFGVIVNVIKASTPVFQTLFSILGTGASVISSLTPVIISLVSALVAMRAANEAITATKNLINSWQTFKTTATGAIQIINLMTAAQATCGSVTKAQLVANLANNGALTASNLLYGVLTGSISLQTAATIAATAATTAFKAALTALTGPIGLVVAGVGLAVGALVGLWQWLTAESEETKRLKSEQEELVKSTDQLTDSVKQSAKERQKNLESVKGNTESYQKLADEIVQLSQKTNKTAADKKNLKKKIDALNASVSGLNLVYDKNTDSLSHNNDQIKARISAMEAESTWETSQKNLLDIEQKRAEIGEQLKQIAEQRKKWNEESNVSDSVRKERLQELNDKETELKNTQTELQTEYEKTSQVQQAASEAMAAAAENGSNRQVISYEGMSKAQQKAVDDMRSKYNELLETTTNMFDQIQMKSAISVDEMIANLQKNQEAVNNWATNLNTLAERGVNEGILAKLQAMGPQGGLYVQELVNASDEKLATLNEVFTQGGESAMNGLTAGMDTGALGITDKIKGIVQSQVSSLQEEIAAADFPEKGKNIPEGVGDGIKAGAEIASEASKNMANDIKESFTSEMDINSPSRVFNEYGGFITTGLAEGVDKGTNQPVSSVTNLANQIKKPFDSLQSDFTYIGEMAMSGLNAGLWSGSGSVMATANSIAERVKATIKSALDIHSPSRAMRDEVGRFIPQGIAVGIEADAGVVEKSMLRLKESMMIDTRPEIALGLNKKLGAQVTVKQSSKQTIAEKIKVTMDKSSELLEKALDVAETAVRRPNEMYLNDGTLVARTGDKFAKYQSEQLRRDNRMKGVLS (SEQ ID NO:1542)
>orf02715
MSMKLNDALITNFSIADKEYDIDLSFNKVLDVFEILKEDEMTRLEQAQLIVHLLTGQELYDIKEVVDCWIYIKEHFLGIEKETVQYDLLGNPMPKAKGEEEQEKLIDFEQDAEYIYASFLQAYGINLLKVQNELTWTEFKALLNALPDNTIMQQIIEIRAWKPEYGGDKNKMRKLQAKYSLGKEGEDNG (SEQ ID NO:1543)
>orf02717
MTDIQIELKRTGFPVKIGEVELWFDTSQESLMCFYDMEEELKRRLVQYELDVVSANINNKIERDGVTKEVVAGAIELEKKQLEIQYDLIFGDGTFDKLYSIYPDYNALNNALEQTAIMLHDKLEEVAEQHKTVVKERASHYLNKGKVTPIKNNKKQKKNKKK (SEQ ID NO:1544)
>orf02718
MTRQKNALRGHFVAPYNGGTEPSTEDTWLELAKWISDVSDDTDEKTDDQAYYDGDGVEETTVVSVKGAYTFEGTYDPDDKAQALIAGMKYKTGDDRKLWHKVVSSDRKKQWVGAATATEIKAGSGAASDYEAFGCKLSYNSTPKETGIG (SEQ ID NO:1545)
>orf02719
MRENDFQNVLLKHIKTLNLPVEPRFDYFEDDKDDLVINQIPGGKVDREYMDGTQEVSLPFEIAVKAKKNSVANDTIWLVTSELAKIDLVLPSDNNSYEYMGMEVSRPAMKGKDEQGYYYYTIEIVAKIVIERNKQ (SEQ ID NO:1546)
>orf02720
MNIAIKVDLQKAKQKLSNESMTRGKIAVASKILLDNEQYIPLRGGELRASGRIVGQGDAVVYGTVYARAQFYGSNGIVTFRRYTTPGTGKRWDQVATSKHAEEWARAFVKGMGL(SEQ ID NO:1547)
>orf02721
MTYLTQEEFDELDFDEVTDFEKLAKRAKIAIDLYTNGIYQKDIDFEKEIAYRKSAVKLAMAFQIAYLDASGIMSADDKQLANSVSIGRTSISYSTSQSTLAGQRFNLSMDAENALRQAGFSLVVGVAYDR (SEQ ID NO:1548)
>orf02722
MALYKATKNLFFEQLNMDVIVDDIIELDEDYAKEVNKKLKNAFPDVKNVLELVDKNGTLEPEDAPSVDDASQATVED (SEQ ID NO:1549)
>orf02723
MPSNQNNAVRRYEKQYAGILETVFGVRAAFSNALAPIQILDGVQENSKAFSVKTNNTPVVIGEYKTGENDGGFGDNSGAQSRFGGVTEVKYENTDVNYDYTLTIHEGLDRYTVNNDLNAAVADRLKLQSEAQTRTVNKRIGKYLSDNATKTEALADFTDDKVKALFNKLSAFYTNNEVTAPITVYLRSEFYNAIVDMASVTSAKGATISLDENGLPKYKGFTLEETPAQYFETGVIAIFSPNGIIIPFVGISTARVIEAENFDGVKLQAAAKGGTYTLDDNKKAIYKVTGTIV(SEQ ID NO:1550)
>orf02724
MAFTTEELLNLGLTEEQAKSVFALRGKELNEDKSALETIKQERDSLKSQLQKAEEQVEHLKSLENISAEQKDAIDKLQAEYDKYKNEAAAELAQTKKVSAISLALKDTNAFNPDKLMKFIDVDAIQLDDNGKPQIDEVINGLKESDPYLFKAEESKPSPNILPQGNPAGEGTSDVDPFQAIIDGYGK (SEQ ID NO:1551)
>orf02727
MKKKRKQITFNDQQFPLQMQGVGDIYEKLQIDIFDRMIKRLKERGSIDLMRNPYIWQLEKLNDMHMLNEQNLKLISERTGIAERLLRDVIENEGLKVYKDTKQQLEEDLNKIPEGEISNGVTDSLEAYSRQAVSDLNLINTTLPKSLQVAYKSIVEETVAQVVAGTKTSDVALHDTIMKWQKNAFTGFVDKGGRHWKADSYARAIIKSTTYKVYNEMRTRPAEELGVDTFYYSMKAMARPACSPLQGQIVTKGTGREIDGITIYSLLDYGYGTAAGCLGIHCGHYLTPFIVGVHELPNLPDYLKNLTPEQAEENARIEAGQRGLERLIKTHKERLHYAHTLQDDKMIQAERLKVRGYQTKIRNLINQHDFLTRDYRREKLYIS (SEQ ID NO:1552)
>orf02728
MSLFQKVKDFFSRGRYYMQTSNLNSILEHPKIAVTQEEYDRIKRNLVYYQSKWDDVQYKNTDGDIKSRPMNHLPIARTASKKIASLVYNEQATITTKNEILQKFLDDMLTNDRFNKNFERYLESCLALGGLAMRPYIDGDKVRVAFIQAPVFFPLESNTQDVSSAAILTKTIKSEGRKNVYYTLVEFHEWVTADGQETGSTNDKKYYRITNELYRSDVNDVLGQRVNLSELDKYKNLEPVTVFENLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRSYDEFMWEVRMGQRRVIVPEHLTQRQYQRPDGTIDFRPRFDVEQNVYMQIGGSSMDAGGITDLTSPIRANDYILAISEGLKLFEMQIGVSSGMFTFDGQGMKTATEIVSENSDTYQMRSSIVALVEQSIKELCVSMCELGKAVGVYSGEIPELDDISVNLDDGVFTDRHAELDYWAKMVAAGFSTKKRAIGKTLNISGVEAEKELNAINSELLPMNDAELAIYGMHDQNEEKADDKG(SEQ ID NO:1553)
>orf02730
MTFNVQKNINPHFKSVWISSLPYNVLKGGRNSFKSSVIVLKLAYMMIRYIIAGEAANIVVIRKVANTIRDSVFNKVWWALNLFGIAEQFTKTVSPFKIVHKTTGSTFYFYGQDDFQKLKSNDIGNIIAVWYEEAAEFNDQEDFDQSNVTFMRQKHPRAKFVQFFWSYNPPRNPYSWINEWFESIKTNKNYLAHSSTYLDDELGFVTEQMLEDIERIKENDYDYYRYLYLGEAVGLGNNVYNMSMFHAIDALPSDDKLIGISFALDGGHQQSATACCAFGITAKGKVILLDTWYYSPAGQVVKKAPSQLSKEIYAYMRSVIEKYRVQALQYTIDSAEGALRNQMFLDFGLKWHPVAKLRKVTMIDSFQSLLAQGRFYYLNTENNKIFIEEHKMYRWDEKTIKSDNPSVIKEDDHTCDTTQYFVLDNAKLLGLRVGNV (SEQ ID NO:1554)
>orf02731
LPRDGTKNLKPVTERTKDEARAISSKGGKASGIARRKKADLKKAFETLLSLDVTDSKIKKQLEEMGMAGNNEALLAFATFQQAVKGNQKATENIIKLTNTKDKYDIQEQKERIKALKYENRERAEAEKGSSETIEIVDAWAEDVRGATDDL (SEQ ID NO:1555)
>orf02732
VAQKLTKLKDIFKHVSSIDLGKEILFEDLELYNKETETSKQYQSIEEAENDLYLMEKVNKINFTLGGGRGANFEKGKDGKYPGFRGAGGARDSGSSKALHPASLNNQGRFSSVEGAIQGFIKKHGGSRTEYSTAVDSQGFAHNYVHGGKNSVQILPISGGFTAIHNHPNGSNFSSTDLHSFAALKGMNTLVATNSSKAYRITKGANFDAKGFDKAVSKSRFTTKDYNKGADLWLKKNAKKYGYTYSYE (SEQ ID NO:1556)
>orf02735
MLQIEYVDIKSIKPYHKNARHNDGEATEKVAASIKAFGFQQPILVDDNNIIITGHTRLKAALSLGIDTIPIAHAVNLTDEQIKAYRLADNRVAEYSTWDSELLNIELSQFETIDMAQFGFELSVTGFNFGNEEEQQEETENEEEDAEDFHRDTTINQYNLFNYDDTRVEGFYNMPKIEGVDHIPKDFQGFNYVLNKPDYSSCVHFFLDDYQFERIWQRPDFYIEKLLEFDSALAPDFSLYLDMPIAMQVWNIYRSRLIGQIMQDYGLTVIPTVSWASEESFDFCFDGLPKNSTLAISTIGVKQNKEQFEVWKNGVTEMIKRLTPKRIVVYGGKVEYDYKDIEVVYFENATTERMKESGTKTN (SEQ ID NO:1557)
>orf02736
MNVIGACQKILFYSPTQAYVLLNAWFNDYFRATYTELLENAILDK(SEQ ID NO:1558)
>orf02738
MNPEIIDNINKPSHYQGANGLEAIDVVHNFVGSLSGASAFFWGNAIKYMLRFQKKNGLEDLKKARKNLDWLIEEMDKEGR (SEQ ID NO:1559)
>orf02739
MFFQDSEIEEFELNDTLRNDYITAYPDEIELMQSTGLKDKNGKEIFEGDIVRTTRFLGRADEIGGFYEYEKDYVGVVKVLEGSWVIDTGSVAVRLWSEIDESEVLGNIYENLEFLEVNE(SEQ ID NO:1560)
>orf02740
MEDEQNILETQLILGKQVLEIVLDLLKDDSKIGVVLPLNINDREFTITVEKEVTDRD(SEQ ID NO:1561)
>orf02741
MTQTLEEGMKNQSKCIKIPMEIRPFDVGYRIVNKHGQALALKNGASIFALPSLAEKAIKKEFGKNDPDFDIEKHFVEEVAIVNLSKFHSYFEEVE (SEQ ID NO:1562)
>orf02742
MNIKALIKKYEELWNEHSPFYEPVPYTSMVELFLKELKQLDEPEKVKVPRFVAEWIEEARKACKDVVELFEFDFTNDEVRKWFMQERPFDLVARAWLDGYEVEEEKRYLVTLKNRQPLVKSQSGSTLYFSQDITARNYKGTQKELEDANFGWVFDCEGIEIEEVE(SEQ ID NO:1563)
>orf02743
MMEELKQKVNEVYNWTVEDGKPQPPKQDLPQAVKERVDYFWEMAEDGMTFMGAMECIFADEKPTDYDLGATKDWLPKSKEFDDWIGYAPSMAQVVIAVYLIYRGN(SEQ ID NO:1564)
>orf02744
VIEVNIKFDNFEAHGFYQDDTKLGKIRDAIISQMNNGHVVVLGEDRGILLNPKVIKSVQFKVVEDNQI (SEQ ID NO:1565)
>orf02745
LYPTVKAIIDGMTDAGIWTDDNHKVIKKLSFVYGGLSEEKGHYRLEFDIEEV(SEQ ID NO:1566)
>orf02746
MTTENLKSALEYAVELNEHGLEILTAADGTEYYDANKFNLKELDPKRYPKTLELSTLTSLVDYLKTDLNNLKNQRLIVAVEKNDEVCVWSENDEIEHRTLLVDVKARIPELSFGRFLSLEQFNIMLQSNFIDDNDRGTLLEFASALKIENGAEIEDNGVSQVATVKTGVASLAKGKAPNPVTLRPYRTFSEVEQPASLFVFRIDKQANMALFEADGKRWVADAVGNIASYLKEQLADQKHITVLA(SEQ ID NO:1567)
>orf02747
MDKKLIGLDLTHIADGGLQEKLDKELEKVFDNILDLNTDAKAKRKVTITLTMSANEERTVVDTTMEVKSKFAPQNGVATTILIGRDFDTGQVHANELKSTVPGQMYFDENGEILTDIGQPVAEIEQQAETKSDIIDFNKKKVGN (SEQ ID NO:1568)
>orf02749
MESAGHECIGFCEIDKFARASYKAIHDTKGEIELHDITTVSDDTIRGIGHVDVICGGFPCQAFSIAGNRRGFEDTRGTLFFEIARFASILKPRYLFLENVKGLLNHDRGNTFEVILSALDELGYDVEWQVLNSKNFGVPQNRERVFIIGHLRGGSGRKVFPLSGDGAAITCEQPKINKVGNTRKKGKSQSGDVVSIDSLAPTLCSTTTQKDPLKVLIENEIKQFGVLQPNYNQSGVVYEIDGISPTIRAYQGGNLEPKIRVKEATKQGYQEAEIGDSVNLSHPNSKTRRGRVGKQIANTLLTGESQGVVEPDFRIRKLTPRECWRLQGFPDWAFDKAQEVNSNSQLYKQAGNSVTVNVISAIAQGLGGN (SEQ ID NO:1569)
>orf02751
MINNVVLVGRLTRDAELRYTQSNIAVATFTLAVNRPFKNEAGEREADFINCVIWRQLAENLANWAKKGSLIGVTGVIQTRSYDNQQGQRVYVTEVVASNFQLLESRNSQQNNQGHQDHHGGYQQQGYSNQGSSFQNGNSYGQQGSFVEGNTTNLVPDFTRDNNPFGRPTNPLDISDDDLPF (SEQ ID NO:1570)
>orf02752
MRCFYVSGKIADLDLGSEINAENSFMAAIEFVKRYTDLLKFGSNEIKVSEVEEVQNDK(SEQ ID NO:1571)
>orf02754
MVKDVTNSLTEIKVDFQPAVINVDREAIEAQVAAAIAQYSGREVTVDNYKEVYEERTRFNKLIGGLDTQRKDFNRQINEPAKDFDKWVKEKVIKPIEAVTDAMSAGLNAIDEHERLMRVDVVRATFEDKCMVAGIEKSTFADKYDEYSLKKYFKTGKYELKKTTLDEMDGLVLSEFDALEEYKANKQAIQEQAQEYDLPADSYIRHLEDGKSLVDILKMMKTDRDAEIARKEQKEIQEKAKAERLEEIAQSAKKNANANIKAYDAETGEILEQGTITPEPQNNAREVAKFEPSEPLVKLVRLELHGGLEQWENTQEYFEDNFIGFETLED (SEQ ID NO:1572)
>orf02756
MADLTFAELQRKMQIEKQTKQGVKYPFRTAEDINNKFKSLDSGWSVSFPEDDIIQKGDKLYYKAVAVVKRESDGTIEKAIGWAREEDVPIFHTQKGDVKQMQDPQWTGAVGSYARKYALQGLFAIGGEDVDEYPVEESQEQGQNNQQQKPNNQQAQGQNQVRYIDNTQYQEINDLINDIAKIKGMPFDTLANYVLSEKLKGLQDFHRVQVGDYEVLKNYLTEQLAKAKAKAKRGN(SEQ ID NO:1573)
>orf02757
MPNWAEGTLKLRGRRENVASALKEMLLGNKGATLEEEYDGTLLI FKNEYDYFYINGTRRAFISSKDIEIWLDDDFVIIELEDFEQAWAALADNYTEISSKFDVDIKIFTFEMGMEFTQEIEISKGEIIKNIVNENFTNYSWDVPFSRLGG (SEQ ID NO:1574)
>orf02758
MKKIATAMNVSVSDLFTQDTPIKKNRHSTPVNPKIYKEFIDNVNQYQRLTGATYEKISNIIGKSNSYIYDVIDKQRKSTLSIKSNASLTKGSMILRQEIEKIESGKNRIPSKLTYQTIDRDSEVVFRFNGLIKSVNELSDKELQIIVSIFDALKIPAKISKIEIRETNVFGGGK(SEQ ID NO:1575)
>orf02760
MKKLPSQQKYLRNDGQLVTIKGFDAYLQYRGSQSWKKEMAKTVKMTR(SEQ ID NO:1576)
>orf02761
MPDITNGREKVNDFLKDKGIKKTSLAIAYGFKRQEVTNILSGTTKGPRANSFILQVIEDYGIE (SEQ ID NO:1577)
>orf02762
MFETFEKIKSLAKKQGISLNTLEDRVGLGKNYIYSLKNKKTPSAEHISKIADYFNVSTDYLLGRTDNPTIANKKEQFFFEGKEVDVEELASTAMRFNGKPLTEEDKKAIQNIIEIYLRKQ(SEQ ID NO:1578)
>orf02763
MTEKEFSQNLGIDIEIFEDGLFPDEAFYIPALKTMFLSDAISDEKRVQVALHEIGHRNHAPDTYQLFREKCELEANRNMIHHLMKAELDIAEDATTFNYLVFMEKYNLKTIADEIMVKEEYLALLN (SEQ ID NO:1579)
>orf02764
MNIIAIIIIVIFVGGVIGAVIDNQKKSPEQRERELETFRANQEKKKQEKKQNIITCPNCKSKDVTFLQQDKKAFSVGKAVGGAVLTGGVGALAGFAGKKGNKQWHCQNCGNFFETK(SEQ ID NO:1580)
>orf02765
MWMEELSNGKYKFFERYKDPYTEKLKKVSVTMEKKTPQARNQAAILLQEKINKKLSTKQVESITFEEIYNLFYKSWAQTVKESTKHNCKSVDKKMKEVIPSDTILANLDRRFLQEAIEKIIESNGYITAKKVRHRLRGIFNYAVQYSYIENNEVDYTTIPQKPKTLEELEKKRNNFLTMQEIKALVDVLNRREYHQKYADMVLVLTLTGMRYGELTALQLKNIDFENNKIEITGNFDSVNKIKTLPKTTNSIRTIKVSESVIEAIQRQIVRLSERFQPLSSDDYIFCFEKWNQPTTIACFIQILKKYGKQAKIEKNLSSHIFRHSHISFLAESGLPIKSIMDRVGHSNAKMTLEIYSHTTEDMEDKLVNKLDTIF (SEQ ID NO:1581)
>orf02777
LHPFTRNITCDRYILALFSNLVNFIHVDNPTFCTLNVKVSSLQEFEEDIFHILTYITSLRQSCRIRNRKRYIQALSQGLSKESFP (SEQ ID NO:1582)
>orf02778
MIINRHCQGTLGTILTDYIVVQDMEEFDWFWYLRQVCQDFLNQFFSNNFLS(SEQ ID NO:1583)
>orf02786
LKTKIGLASICLLGLATSHVAANETEVAKTSQDTTTASSSSEQNQSSNKTQTSAEVQTNAAAHWDGDYYVKDDGSKAQSEWIFDNYYKAWFYINSDGRYSQNEWHGNYYLKSGGYMAQNEWIYDSNYKSWFYLKSDGAYAHQEWQLIGNKWYYFKKWGYMAKSQWQGSYFLNGQGAMMQNEWLYDPAYSAYFYLKSDGTYANQEWQKVGGKWYYFKKWGYMARNEWQGNYYLTGSGAMATDEVIMDGARYIFAASGELKEKKDLNVGWVHRDGKRYFFNNREEQVGTEHAKKIIDISEHNGRINDWKKVIDENEVDGVIVRLGYSGKEDKELAHNIKELNRLGIPYGVYLYTYAENETDAENDAKQTIELIKKYNMNLSYPIYYDVENWEYVNKSKRAPSDTDTWVKIINKYMDTMKQAGYQNVYVYSYRSLLQTRLKHPDILKHVNWVAAYTNALEWENPYYSGEKGWQYTSSEYMKGIQGRVDVSVWY (SEQ ID NO:1584)
>orf02791
MHKNFVVVVTDFFTAVQFIQFNKEGTTCHNTTKFFNHLDSCLNSSTCRQKVIYNKNTLTWLNGIRVHSQGIDTVLFFIVSRNNFAWQFTWLTNRRKTNSQLKGNWTTHDKSTSFRSHDHVDFLVSSILNDFTNSVAISISISHQRTNITEGNAFLWIIFNCCNVIF(SEQ ID NO:1585)
>orf02795
MEIKEQTRKLAVSYSKYSFEVADKTDEVSNHTYGKATLTWFEEIFEEYKEHHNIDV(SEQ ID NO:1586)
>orf02801
LHKTLENIGEFEEDNLYYSSMTKAETRISFPIFSLILHYI (SEQ ID NO:1587)
>orf02803
MLNRQVCFCFVNHISPLNVVIWENLSLEELLYAICICFITHKIAKQTSLTIDNAGIAMNNIR (SEQ ID NO:1588)
>orf02808
LNSRFFYTDFFKGRQAKGCSFSCTSLSLTDNILAFKGQRNSLFLDRTSFYKTSFFNFC(SEQ ID NO:1589)
>orf02821
MRFLADQDRIQHHRYSWALFDKVQGLLSHTDSREKTNLNSPKFHIT (SEQ ID NO:1590)
>orf02822
MLKNGIISWKDFKSFFCQGCQTSHCYKPMQAVQGIGSQIS (SEQ ID NO:1591)
>orf02825
VTAHRIFGTSSIHNKLIGLAMFGITAMKIICHKLNRNHINIFRRLGIQGKTEFLLIHLIRQVKMNDLSQGMNPTICPTSTVNSNGLPFI (SEQ ID NO:1592)
>orf02829
MLARSKNCFMKSLSIFLLIFYFFDSYQISKKRRSLIGL (SEQ ID NO:1593)
>orf02840
LEVCIHHHHQISCRILQACIKGCFFAKISRERNIMDCRILLPIGL(SEQ ID NO:1594)
>orf02847
VDRTDEVSSKHCFEVVDRTDEVSNHTHGKATLTWFELDFRRV (SEQ ID NO:1595)
>orf02893
MIAEFIDGLQKFHFLQNALITAIVVGIVAGAVGCFIILRGMSLMGDAISHAVLPGVALSFILGLDFFIGAIVFGLLAAIIITYIKGNSIIKSDTAIGITFSSFLALGIILIGVAKSSTDLFHILFGNILAVQDTDMFITMGVGAAILLLIWI FFKQLLITSFDELLAKAMGMPVNFYHYLLMVLLTLVSVTAMQSVGTILIVAMLITPAATAYLYANSLKSMIFLSSTFGATASVLGLFIGYSFNVAAGSSIVLTAASFFLISFFIAPKQRYLKLKNKHLLK (SEQ ID NO:1596)
>orf02913
MYEEPEVAPVHPTGPTPATETVDSAPGFEAPQESVTIL (SEQ ID NO:1597)
>orf02945
MGNNGQFTFGYRHDFFQNQLAIFNALVDTFTRRTIDIKTLNTFINEVLNQGTRTLWTYFSLLIITCVEGWNDTFVFFQI (SEQ ID NO:1598)
>orf02948
LTKIFGWILRIAVLAADVYGNFANNIAVAWDAHDKIPNNGRINF(SEQ ID NO:1599)
>orf02974
LSTRNKYCKNLIIFESTFNILDIVKKDLKLNSKLEKDLKY (SEQ ID NO:1600)
>orf02976
MSYFRNRDIDIERISMNRSVQERKCRYSIRKLSVGAVSMIVGAVVFGTSPVLAQEGASEQPLANETQLSGESSTLTDTEKSQPSSETELSGNKQEQERKDKQEEKIPRDYYARDLENVETVIEKEDVETNASNGQRVDLSSELDKLKKLENATVHMEFKPDAKAPAFYNLFSVSSATKKDEYFTMAVYNNTATLEGRGSDGKQFYGNYNDAPLKVKPGQWNSVTFTVEKPTPELPKGRVRLYVNGVLSRTSLKSGNFIKDMPDVTHVQIGATKRANNTVWGSNLQIRNLTVYNRALTPEEVQKRSQLFKRSDLEKKLPEGAVLTEKTDIFESGRNGKPNKDGIKSYRIPALLKTDKGTLIAGADERRLHSSDWGDIGMVIRRSEDNGKTWGDRVTITNLRDNPKAFDPSIGSPVNIDMVLVQDPETKRIFSIYDMFPEGKGIFGMSSQKEEAYKKIDGKTYQILYREGEKGAYTIRENGTVYTPDGKATDYRVVVDPVKPAYSDKGDLYKGNQLLGNIYFTTNKTSPFRIAKDSYLWMSYSDDDGKTWSAPQDITPMVKADWMKFLGVGPGTGIVLRNGPHKGRILIPVYTTNNVSHLNGSQSSRVIYSDDHGKTWHAGEAVNDNRQVDGQKIHSSTMNNKRAQNTESTVVQLNNGDVKLFMRGLTGDLQVATSKDGGVTWEKDIKRYPQVKDVYVQMSAIHTMHNGKEYIILSNAGGPNRENGMVHLARVEENGELTWLKHNPIQKGEFAYNSLQELGNGEYGILYEHTEKGQNAYTLSFRKFNWEFLSKDLISPTEAKVKRTREMGKGEMGKGVIGLEFDSEVLVNKAPTLQLANGKTATFLTQYDSKTLLFAVDKEDIGQEIIGIAKGSIESMHNLPVNLAGARVPGGVNGSKAAVHEVPEFTGGVNGTEPAVHEIAEYKGSDSLVTLTTKEDYTYKAPLAQQALPETGNKESDLLASLGLTAFFLGLFTLGKKREQ (SEQ ID NO:1601)
>orf02978
VDKTDKVSSKHRFEVADRTDEVSSKHRFEVADRTDEVSSKHRFEVADRTDEVSSKHRFEVADRTDEVSSKHRFEVADRTDEVSSKHRFEVADRTDEVSNIYTARRS(SEQ ID NO:1602)
>orf02989
MSCNCAFYRSQFFDVNSVSNYHSHQKELRFPNSILFTYFVKVT (SEQ ID NO:1603)
>orf03007
MTVKHRIVNSNGEARSGESLTTCGTRSTHGNWATVGYVYAADMARQNWWDLSAAISGSWSPN (SEQ ID NO:1604)
>orf03009
MRWDYGQIFKEIRKSKGLTQQDVCGQVIHRTTLTNIEHGKVIPSFENMVFLLEQIDMSLAEFKYICNEYHPSKRRDIIVESQNPSTFQDTRKMVELTEKCQKYLKTHHDVPIQNIYRHTKIVTELRTKGFKNNHVLKDLYEEIWDYLEPMDTWYISDLKLLGTILFFFPSENLPLLIDRIMKTIEKYKYFRETKAFLSSFLANLSTVYFQHHLFKECETITLQLLVLAEELKIYDILGFSQVRLGILQHNSDLIDKGITLLRLTKEEALVKILEKEINDFSNL(SEQ ID NO:1605)
>orf03014
VGLIKLTSYVFVCISNSFLTRHDKNDNICFFHGNFCLVLDLFHERSIDIINSSCINHAKRTIEPLTRCINTVTCHSFDIFYNGDSLTSDPIK (SEQ ID NO:1606)
>orf03016
LSSKSCIDRTNQETFHTLGLEGVGMKSGSLFCSVQISDKEKENSRLANGFLRYQFIQGIFLLLTSYHNHRVGLEILPR (SEQ ID NO:1607)
>orf03049
MTESYTWVEADRATLSRYRHGQGHLTDQFFSFKVQRPAAKTLIASISTGKGMGPSFDGTPVITSGNQNRINTIKNSFIMSSSSVRISLRKLTSQRNFLRNLSSLILLAAQVAKGDATACSHQRISRVVGQDSHETLSLTEFF (SEQ ID NO:1608)
>orf03050
MNINNEKVWFAFYLLDMQITRPTPTFNDRRIGLIGKLQELRFLAGNLLLR(SEQ ID NO:1609)
>orf03051
LIKGYLPNHLALMDLCSKTTCTLDDFAGIAGRRNHRGFFCHIGNGVFLTVDKYLRNQRIRQRKSSHHILTQLVCHSHTHLFILLQTSLSLRTKERLSF (SEQ ID NO:1610)
>orf03057
LIKLTDRNFSDILIKCLIKCFTNLLSNQLMLLPSTLKL (SEQ ID NO:1611)
>orf03060
LFKGGVTISRTPLSSEDTVMIDATEVQINCPKKTISE (SEQ ID NO:1612)
>orf03062
MIQSENHCSASHSNRDYQSQHDNQGRTCQCFIIVPCHKKGSCSVGEITWNQRCQNGQDKDHSRCLIKNT (SEQ ID NO:1613)
>orf03069
MIARQLMVFFSTNQADTRITNMSIDSLIINNSKDFQSSSHASVSFILTKLVNLLIFNF(SEQ ID NO:1614)
>orf03070
MGEPFTHFIDCIDLGINPSYTQVCDRHFTSDIPCTMTSHPIS (SEQ ID NO:1615)
>orf03077
LSSDSHFIGIKAFVILILGKSNSIVLRIVGLYQDLTCFFSPTCSTCHLSQELEGSLRRTEIRQIQGRIRI (SEQ ID NO:1616)
>orf03078
MAVHSLGIHMQGQRNIAVGTSIHRPTLPTHDKARITTAIEHENHLLFFNQTVLDSL(SEQ ID NO:1617)
>orf03079
MVTGIAVLLISHFMLFINNHDTQIFQRSKDSRSGTNNNLGIATLHLAPFIILFTIG(SEQ ID NO:1618)
>orf03080
VKNGYLVPKTCYKTLGHLRSQGNLRYQQNSCLALIQGTLDNLQVNLGLPTSCNPLK(SEQ ID NO:1619)
>orf03081
MVNLIPRLGLDLLLIDCLIFQTKQAFSSQTHHFSLLGKV (SEQ ID NO:1620)
>orf03082
LGLQTKNNPLNQAIPLTKRHMNPHPNFQHSLKFLRNPVTIGLVRLHQGHIYDNLS(SEQ ID NO:1621)
>orf03085
LGNHFCTICSTTYQAILQFIQIWWCQEDKDSIWNLFLDLKSTLNFNFKENIDSLVQGFIDIGQRSSIVVADIFCVFQHLSLTNQLFKFFTSTEEIVNTVHFSRTLCACRHRYRILKLVFRTLKNLSSNRSFSNP (SEQ ID NO:1622)
>orf03092
MKFKNYLFDLYPYFPKGFSLDNREDPDVCSKALYDDLCKMFFDDDSKEKLKITSVCNKCQNYGNRDYYTLFIDKDKYLLSSDYIGASIYWAQEAGLNDRIILDHLSISRTIGGHILFPRGGKLETVNQARGGEKGYYDRFDLTLYAIKEWFVENKNTKIGYAIENYHEWFELFSGDDNCKNGFENFVEFFKLEGFIYEQNKIIDLIKSDLENNQVVFLDKEDILIASTEEEYIRYMKNLNIIILERTKKILL (SEQ ID NO:1623)
>orf03093
MELSIQLIHDLNTHTTHSTAKMLHNVKAIKNDFSIRE (SEQ ID NO:1624)
>orf03096
LKAKLFSQVIVRAKHIHSNNFDATSDSSIVAKKVISNDSLFSSLKNSDDIEIVKILRNEAHLSLCKSILIPRHNLRKTRKIMFKVKIKEQTRKLAAGCT (SEQ ID NO:1625)
>orf03113
VGCSYICHELVANHDHFLFVIVEFLHGTVNTKCEGLQGPVNVINPKFLNCSLNAFFGVI(SEQ ID NO:1626)
>orf03114
LLHLWRSIRVVPSNEGIIQIDQNSLDSLRLQAWDCQIIDCFHSKIWYIIFNRHSGSFS(SEQ ID NO:1627)
>orf03118
VKGLLLATKLCRTNSHTDNLTRYSNRSICQNDLISHIQLTFKEDEKAIDDIRQKALGSHTNRYPSNTSSSQQTRNWQT (SEQ ID NO:1628)
>orf03120
LWGILGLTLPNLSGIGLLGDLFVGGLKAVAPILVFALVANALSQHQKGQDSNMKTVVFLYIL (SEQ ID NO:1629)
>orf03121
MIGTFAAALVAVLASFIVPIEITLNSANTEIAPPDGIGQVLSNLLLKLVDSPVNALLTANYIGILSWAVIFGIAMREASKNSKELLKTIADVTSKIVEWIINLTPFGILGLVFKTISDKGVGSLANYGILLVLLVTTMLFVAPVVNPLIAFFFMRRNPYPLVWNCLRVSGVTAFFTRSSAANIPVNMKLCHDLGLNPDTYSVSIPLGSTINMAGVAITINLLTLVTVNTLGIPVDFATAFVLSVVAAISACGASGIAGGSLLLIPVACSLFGISNDIAIQIVGVGFVIGVIQDSCETALNSSTDVLFTAVAEYAATRKK (SEQ ID NO:1630)
>orf03124
MKIKEQTRKLAAGCSKHGFEVVDRTDEVSSKHRFEVVDRTDEVSSKHRFEVVDRTDEVSSKHCFEVVDRTDEVSSKHCFEVVDRTDEVSSKHGFEVVDRTDEVSSKHGFEVVDRTDEVSSKHSFEVVDRTDEVSSKHGFEVVDRTDEVSSKHGFEVVDRTDEVSSKHGFEVVDRTDEVSSKHSFEVVDRTDEVSNIYTAR (SEQ ID NO:1631)
>orf03145
MKDLISVIVPVYNVEPFISSCLDSLSKQIYQNFEVLLVNDGSTDNSGAICREYADRDSRFHYFEKENAGVADARNFGIERSKGDYITFVDSDDWVTEEYLSILIETLKEQHSEIVVSTYSTYNESDGLFYIHVFDSDYYVKNYNSKLLMEELPLLERYDMSFLTSWGILFKRELFQEVQFPFGRVCEYIGTNYKLFMQVEKVTYINKVLYWYRVGKEGLSNSYSPKMMRDDCDFRLERIAVLALKGYDVSKYLDQMKFYLKYRHDIAIQRELKENVETRHLEMLDYLLNGNKYN(SEQ ID NO:1632)
>orf03148
LNVRGGAYITFVDSDDWLEHDALDRLYGALKKENADISIGRYNSYDETRYVYMTYVTDPDDSLEVIEGKAIMDREGVEEVRNGNWTVAVLKLFKRELLQDLPFPIGKIAEDTYWTWKVLLRASRIVYLNRCVYWYRVGLSDTLSNTWSEKRMYDEIGAREEKIAILASSDYDLTNHILIYKNRLQRVIAKLEEQNMQFTEIYRRMMEKLSLLP (SEQ ID NO:1633)
>orf03167
VLAKAEAEALVDADSDADVLADTEAEALVDAEAEALVEADAEALVLAEAEALVDAEADALVEAEAEALVDADADALVDADSEALVDADSDALVLAEAEALVDADSEALVLADSDALVLAEAEALVDAEADALVDAEADALVLAEAEALVDADSEALVDAETEALVDAEAEALVDAEADALVDADSDALVDADSDAEVLAEADALVDAETEALVEADSDAEVLAEADALVLAEAEALVDAEADALVDAETEALVEADSDAEVLAEADALVEADSEAEVLAEAEALVDADSDAEVLAEADALVDADSEALVDAEAEALVDAEADALVLAEAEALVDAEAEALVDAEAEALVLAEADALVDAEADALVDAEAEALVDAEAEALVDADSDALVDADSDAEVLAEADALVDADSEALVDAEAEALVDAEADALVLAEADVLALVDADSEADVLAEADALVDAEADALVDAEAEADVDADSDAEVLAEADALVEAEALVLAEAEALVDAETDALVDAEAEALVDAEADALVLAEADVLALVDADSEADVLAEADALVDAEADALVDAEADALVDAEADALVLAEADALVLAEADALVLAEADALVLAEADALVDADSEADVLAEADALVDAEADALVDAEAEALVLAEAEALVLAEADALVDAEADALVLAEADALVDAEAEALVLAEAEALVLAEADALVDAEADALVLADSDALVDAEADALVDAEADALVLAEADALVDADSEALVDAEADALVDAEAEALVLAEAEALVLAEAEALVDAEADALVDADSDALVDAEAEALVLAEALVLAEADALVDAEADALVLAEADALVDAEADALVLAEAEADVDADSEADVLAEAEALVDAEAEALVLAEAEALVNAEADVLAEADALVDADSEALVLAEADALVLAEAEALVDAEAEALVDAEADALVLAEADALVLAEAEADVDADSDALVLAEAEALVDAETEALVDAEAEALVLAEAEALVLAEAEALVDAEADALVLAEADALVDADSEALVDAETEALVDAEAEALVLAEAEALVLAEADALVDAEADALVDAEAEALVDAEADALVLAEADALVDAEAEALVDAEADALVLAEAEALVDADSDALVDAEADALVDAEAEALVDADSDADVLADTEAEALVDAEADALVLVDADVLALVDADVLADVLALVDADVLAEAEALVLAEAEALVDAEAEALVDADSDAEVLAEADALVLAEADALVDAEAEALVDAEAEALVDAEAEALVDADSDADVLAEADALVDAEADALVLAEADALVLAEADALVDADSEALVDAEAEALVDAEAEALVDAEAEALVDAEAEALVDAEAEADVDAEAEALVDADAEALVLAEADALVDADSDADVLAEAEALVDAEADALIDADSEADVLAEAEALVDAEADALVDAEAEALVLADAEALVDAEADALVDAEADALVDAEADALVDAEAEALVDAEADALVLAEAEALVDAEADALVDADSDAEVLVLAEAEALVDAEADALVDADSDAEILAEADALVDAEAEALVLADSDALVNAEADVLAEADALVDADSEALVLAEADALVLAEAEALVDAEAEALVDAEADALVLAEADALVLAEAEADVDADSDALVLAEAEALVDAETEALVDAEAEALVLAEAEALVLAEAEALVDAEADALVLAEADALVDADSEALVDAETEALVDAEAEALVLAEADALVDAEAEALVDADSDADVLADTEAEALVDAEADALVLVDADVLALVDADVLADVLALVDADVLAEAEALVLAEAEALVDAEAEALVDADSDAEVLAEADALVLAEADALVDAEAEALVDAEAEALVDAEAEALVDADSDADVLAEADALVDAEADALVLAEADALVLAEADALVDADSEALVDAEAEALVDAEAEALVDAEAEALVDAEAEALVDAEAEADVDAEAEALVDADAEALVLAEADALVDADSDADVLAEAEALVDAEADALIDADSEADVLAEAEALVDAEADALVDAEAEALVLADAEALVDAEADALVDAEADALVDAEADALVDAEAEALVDAEADALVLAEAEALVDAEADALVDADSDAEVLVLAEAEALVDAEADALVDADSDAEILAEADALVDAEAEALVLADSDALVNAEADVLAEADALVDADSEALVLAEADALVLAEAEALVDAEAEALVDAEADALVLAEADALVLAEAEADVDADSDALVLAEAEALVDAEADALVLAEADALVDADSEALVDAETEALVDAEAEALVLAEAEALVLAEADALVDAEADALVDAEAEALVDAEADALVLAEADALVDAEAEALVDAEADALVLAEAEALVDADSDALVDAEADALVDAEAEALVDADSDADVLADTEAEALVDAEADALVLVDADVLALVDADVLADVLALVDADVLAEAEALVLAEAEALVDAEAEALVDADSDAEVLAEADALVLAEADALVDAEAEALVDAEAEALVDAEAEALVDADSDADVLAEADALVDAEADALVLAEADALVLAEADALVDADSEALVDAEAEALVDAEAEALVDAEAEALVDAEAEALVDAEAEADVDAEAEALVDADAEALVLAEADALVDADSDADVLAEAEALVDAEADALIDADSEADVLAEAEALVDAEADALVDAEAEALVLADAEALVDAEADALVDAEADALVDAEADALVDAEAEALVDAEADALVLAEAEALVDAEADALVDADSDAEVLVLAEAEALVDAEADALVDADSDAEILAEADALVDAEAEALVLADSDALVDADSEALVDAETEALVDAEADALVLAEAEALVDAEAEALVLAEAEALVLAEADALVDAEAEALVDAEADALVLAEAEALVDADSDALVDAEADALVDAEAEALVDADSDADVLADTEAEALVDAEADALVLVDADVLALVDADVLADVLALVDADVLAEAEALVLAEAEALVDAEAEALVDADSDAEVLAEADALVLAEADALVDAEAEALVDAEAEALVDAEAEALVDADSDADVLAEADALVDAEADALVLAEADALVLAEADALVDADSEALVDAEAEALVDAEAEALVDAEAEALVDAEAEALVDAEAEADVDAEAEALVDADAEALVLAEADALVDADSDADVLAEAEALVDAEADALIDADSEADVLAEAEALVDAEADALVDAEAEALVLADAEALVDAEADALVDAEADALVDAEADALVDAEAEALVDAEADALVLAEAEALVDAEADALVDADSDAEVLVLAEAEALVDAEADALVDADSDAEILAEADALVDAEAEALVLADSDALVDADSEALVDAETEALVDAEADALVLAEAEALVDAEADALVDAEAEALVDADSDAEILAEADALVDAEAEALVDADSDAEILAEAEALVLAEVDALVEADSDADVLALVDADVLALVDADVLADVLVLVDADVLAEAEALVLAEADALVDAEAEALVDADSDAEVLAEADADALVDAEAEALVLADAEALVDAEAEALVDAEADALVDAEADALVDAEAEALVLADAEALVDAEAEALVLAEADALVDADSDALVEADSDAEVLAEAEALVDAEADALVDAEAEALVLAEAEADVDADSEADVLAEADALVDAEAEALVLAEADALVDAEADALVDAEADALVDAEAEALVLAEAEALVLAEADALVDAEADALVLADSDALVDAEADALVDAEADALVLAEADALVDADSEALVDAEADVLVDAEAEALVLAEAEALVLAEAEALVDAEADALVDADSDALVDAEAEALVLAEALVLAEADALVDAEADALVLAEAEALVDADSDAEVLAEAEALVDAEADALVDAEADALVLAEAEALVDAEADALVLADSDALVDAEAEALVDAEADALVLAETDALVDADSDAEVLAEAEALVDADVLADVLALVDADVLADVLALVDADVLADVLALVDADVLADVLALVDADVLAEVDALVDADVLADVLAEADALVDADSDAEAL (SEQ ID NO:1634)
>orf03175
VFEVVDKTDEVSSKHCFEVADRTDEVSLKHCFEVADRTDEVSLKHCFEVADRTDEVSNHTYDKVKLTWFEEIFEEYHTKKPCSSR (SEQ ID NO:1635)
>orf03178
MYQDLLRKIAEEKPNYNQEEIQWLLDHLGDPSPEIRDDLVFTSFARGIQEELFTQEQFHFIAEGVSSDGGLDKEIDKIGLPTLERSFRALIYATLLSDDANQQSIFYQRLKAEIRNVLLNQGLHYLSKEKDTTGFSSQYGWVHSFAHGADLLKEVVCHPDFPKNRVHEVFDILGQLFKRMSIRFTDDEDWRLARVIYEPILQGKLEQEQVASWIKTVDFPIEEREDFSKFSNFRSCLVEVYVQLDQRNSLQDELKEAIQSFQY (SEQ ID NO:1636)
>orf03181
MGFKVSHFKIPSSHLSINVLRTIENFTEIGQGLLHISP (SEQ ID NO:1637)
>orf03182
VGFFDFGLTNSCRQVRQFTQTVQDFLVCYHQGIVKEGQGYAGICFKFHPSLGNIGKFVIAIVRRLRHKSIVANMAHLNVDLFQFRKGLLEILKSVKIALVITAKLVDVFTSFLDCTQEILTVLV(SEQ ID NO:1638)
>orf03190
MNITYIVGNGLDLQYGLKTRYRDFYEFQNKVYISRTENEEKYSNFIYESLFSDKVNDYENWSDFELSIGKLTKDNDLISSSIEIKEKFIDDFSEVVDDLREYLRIQQEKNLEKGNAIDFISTLDDMRTSLPVINQPAIDKKYNENPHQDDIVNIVTLNYTHVIDKLYNGSAKSFRNQLRANLYNFYIEPPIHAHGTVDVCTVLGVSDEIQISNSLEEQKESLIKSLVLKNYRENMDVKNSDIIKNSDIIILYGVSLGETDRYIWSQIAERSISGSVPVIIYHYVPHFDPGNPIRAKRLYRNVEDKFIQNSGIDLELEKKLRDNLIVVIGKTIFDLIER (SEQ ID NO:1639) >orf03191
MNTLLTLRGKSFTQKSRNNGMGPITIPKKTIITLEHLKYLHFSLEETKTYWEKNNIIDGILISIYYNRIVAKSNRINGYFNVGGGNPFPNDTIVGAKFNDEKTKHIVTHYISRDALNKTITVLSKIIEVFEEHFDRAITCEMFSDSSTFASINFSEYGISKSKFQQYLRDSCFIENFGVEHTTVSDIQNSIVTFYDVHTDIFRLLNKLNIDISEANIMNQTTVLLDEKNIELLLSKAPYLVSMIVEDFSKLSVDDFSLDNNDLKINLPSPMNEPVVGVIDTLFDKRVYFNEWVEYHDFVSPDISKDSQDYKHGTAVTSLIVDGANLNPNLDDGCGNFRVRHFGVSLQSGFNSFTIIKQIKEIVSQNADIKVWNLSLGSNDEIRENFISAEGALLDEIQFENDVIFIIAGTNASVINGKRKRIGAPADSLNSIIVNSVDFNNQSVSYSREGIVLSFFVKPDVSYYGGGNGDFINVCEPLGLGRVAGTSFAAPFIARKMAYLIHIMGLSREEAKALLIDAAIPWNDKKTFTDLSLIGNGIVPIKMDDILSTPDDEIKFIVSDISRAYDTYNYDFPVPISSESYPYVAKATMCYFPNCSRKQGVDYTNTEMQLTFGRLKSDGIKSINKDNQHAEDTPGYVRENAARNIFRKWDNVKHIGESFTSRKRAKAILNPSNPQWGMSIKTIERLKSGDGQGVRFGVVVTLKELNGVNRIEDFIQQAELRGWLVNRLQVEAQVDLFNSLNEEIEFE (SEQ ID NO:1640)
>orf03192
MKKSDVLDLIKYHYEGRETEFRNQSIAIARNFNKHGDTQIAQYIMGLMSQSDRFMPQIENPSEYLTPAKLDIGPLPLPLSIMNDLKGIINAVNHHIGINKFLFVGSPGTGKTESVKQVARLIGKELLVVDFSHLVDSKLGQTVKNLATLFNEINNLPFKQNYIILFDEIDSIVLDRVNQNDLREMGRVTSAFLKELDRLSPEIVLIATTNLFENLDKAVTRRFDAIIDFDRYTDEDKVEVATIILNELLKQFKNVARDLKLFKKIINSANVIPNPGDLRNSIRTSLAFSDPSDPHDYQKRLLRSLHNGRNLSISKLSKLGFTVREIEILTGISKSSVSRELSED(SEQ ID NO:1641)
>orf03200
MINSQVFEIRIFNSYYKDAIYYFKNINYSIFHIFSTHY (SEQ ID NO:1642)
>orf03205
VGHRFDPCRGHLNTTGKALEPRFFCLNKIFFKFFRKLA (SEQ ID NO:1643)
>orf03206
MGWKGTPPCLHPSNQDTTILIVQQCLRRIEVLAMINFLN (SEQ ID NO:1644)
>orf03207
VFGSYYRVIASIFFKEFWITEISSNQLIWQVCSSYNWILGNLFKVNPVI(SEQ ID NO:1645)
>orf03208
VLPSHQVLTFSMSPVHRSPNTIIWIELIKEMVFSTKINKSIWIIDPTNLS(SEQ ID NO:1646)
>orf03219
LLRFAIHSNLLFLKRFIFSIKDSAWRCLFISFFETFHFCFKNKSFYLNSFYH(SEQ ID NO:1647)
>orf03230
LIVSLKTKSRKAKDMAESIQGWLAQFLVNLFKSITFDCGKEFSKWKDISNHHDSESFFANLECPRQRCLNEHSNRLLRCHDLPKQTDFNEVSQEF (SEQ ID NO:1648)
>orf03231
VVEIIYFLIIIIASGLGSISGMGGGIIIKPLMDSFGYHSVSDIAFYSSFSVFIMAIISTTKRFSQSKEIKWRLIFTVSFSSVLGGFLGHLIFQVLLSQLSVRLVSIVQMILLFVMLLVSFVLTDFKKTYQFDKIGFYMICGLLLGLISSFLGIGGGPLNVSLLMVFFSISIKEATMYSLAIIFFSQLSHLATIVVVTGLNQYHLAPVPVIFLASICGGVLGTVVSKVLPENWVRYCFKGMLFFVMGMTLYNLFHIL (SEQ ID NO:1649)
>orf03232
MMGTNSEEGFLDDFEGPQVAVSVKDFSIADTPVTNQEFAQFVKETGYKTLAERQEWSFVFILFVPEAEREGYPHPAGAPWWLQVSNACWKHPYGENSNLVGLEDHPVVHVALEDALAFCNWSGMSLPTEAQWEYAARGGRQSEYPWGDTLLEGGYYHANTWQGRFPYENTALDGFIGTAPVYEFLPNDFGLYQMIGNVWEWCRNPRYTLLASFNEDDYELPKYGIQDEEYAIRGGSFLCHCSYCNRYRVAARNGCISTSTSSHLGFRCLKE (SEQ ID NO:165)
>orf03235
MVQTKQPNIILIVVDQMRADALSLNSKDKLVSTPTLDMMASVGYNFENAYSPVPSCVPARAALLTGLDQDKSGRVGYQDEVPWNFTNTLPKVFKDMGYQTECIGKMHVFPSRQRLGFDHVLLHDGYLHVDRKYDKAYGSQFDYASDYLAFLKGKVGYDVDLI DDGMDCNSWEARPWDKDEKLHPTNWVVSESISFLQRRDPTVPFFLKMSFEKPHAPLNPPKYYFDIYMERLPQFLDLHIGNWEVLEKQIPSIYALRGKLKEDDQRRMVAAYFGLITHIDHQISRFLTALKEFRHDKDTIIWFVSDHGDQLGEHYLFRKGYPYQGSIHIPSFIYDPAGLIAGNRGTIKQLVKIQDIFPSLVDLAGGTTTDELDGRSVKNLLFGQYEGWRTEFHGEHALGKDSSQYILTDQWKFIWFPVLNHYQLFDMKKDPHEMNDLYPSEKYQPIVRQMKKKLVDFLRYREEGFVVDEELVPVELSKITPTLTKTGDSQS (SEQ ID NO:1651)
>orf03237
MNTMLDKMQEKLSPIAMKVGNQKFLVALRDSFVGTMPVIMTGSIALLLNAFLVDLPQQFHLESITKTFQWLVDINNLVFKGSIPIVSLLFIYCLGVNIAKIYKVDTVSAGLVSLASFVISIGSTVTKSFPLANVGDVKLDQILQGIDNLAFDGKNLMVTIGNVIPGNHINARGYFTAMMIGFLASIIFCKVMKKNWVIKLPDSVPPAIAKPFTSIIPGFMAMYIVAILTYVFHLLSNDLLIDWVYKVLQTPLLGLSQSFFAVILMIFLNKLFWFFGLHGGNVLAPIMEGLFGVAMLANLDAFQKGEPIPYIWTSGSFGAFVWFGGLGLVLAILIFSRNSHYRKVAKLGLAPVLFNIGEPVNYGLPVVLNPLLFIPFVLSPVFMATVAYWATSWGLVSPVTQNVTWVMPPILYGFFSTAFDWRAIILSVVCLIISVLTYFPFVKMADKTELS (SEQ ID NO:1652)
>orf03238
MNESNLESAMGLIMYGGEAKSNAMEAIQAAKKGDFSKANRRLADANAALLQAHKAQTEMLTREAQGEETSISLLMVHAQDHLMTSLTFVDLAKEVVEVYERFEKN(SEQ ID NO:1653)
>orf03239
MAKVTIMLACAAGMSTSLLVTKMQKAAEDKGLDAEIFAVPAPEAEEIVATKEVNVLLLGPQVRYLLGDFQEKLKDRQIPVAVIPMTDYGMMNGSKVLDLAESLLD(SEQ ID NO:1654)
>orf03240
MKRLISANPSEILQMNAEELKQSILASEGRVVLSENVVTRETFVGDITNSEIARAFGADMILLNCVDVFEPKIYALDSSGDDVIHRLHQLVACPIGVNLEPIDPSAKMLEETQEIVAGRVASVETLKRIEELGFDFVCLTGNPGTGVSNREIIKAVQTAKENFSGLIIAGKMHGAGVNEPVAELSVAEQLLEAGADVILVPAVGTVPAFHDQELREVVDLVHSKGGLVLSAIGTSQETSDTDTIKEIALRNKICGVDIQHIGDAGYGGLATVDNIYALSKAIRGVRHTVSRLARSVNR(SEQ ID NO:1655)
>orf03241
MEKLLQEKLLPVAARLGNNKALVSIRDGITLTIPLLLIGSLLMVIASFPIPGWEQYLGDIGVADYLWKGVDSSFGLLGLVASFGIAYFMARQYKVDGIPAGIVSLSSFITVTPFIRGEAGAGMPTAFMASKGLFVAMILGLINGYIYQWFINHNIQIKMPDGVPPAVSKSFSAIIPGAVTIVGWLIVYATLDKLSLPNLHEIAQVALGGPLGLLGNNVIGLLILIFLNSSFWFVGLHGGNVVNAVMKPLWLANLDANKVAYQTGETLPNIFTSVFMDNFVFIGGGGATIGLVLALGYLAHKKKASKQLKTLAPITVIPGLFNINEPAMFGVPIVLNILLLVPFILAPMFNLLVAWGAMASGLVPLTYTDPGWTMPPVISGLLATGSISGSLLQIVLIVLDVLLYLPFVIAIEKRFKLLED(SEQ ID NO:1656)
>orf03242
MTLSKKQLQLRAKILETVYTLGPISRIEIATKTGITPATTSSITNDLIKENILLELGEDEHDTSVGRKKILLDIQAKRFYYIGCELSEKHFTFALGDNLGNILKEEKEIVTKQLIQEKGNQLINQTLKQFLNNCSDYEIEAIGIALPGRYLDDYKITTNNPLWQHIDLEMIQSHFDKPLFFSNNVNCMAIGKRLFSRQQNDPNFAYFHFARGMHCSYIYDGNIYGKGNLMIGEIGHTVVSSEGEECSCGRKGCLQTFAGESWLIKKSKILYHQSPYSLLPSLVKNADDIDIQVILTAYQLGDTGIITLIHQALLYLSQTILNISMMIDSQKIYLHSPLLTNQHIIQKLYSEMNYKPKLLYNRLPEVIIEPYNDFTAAHSAIALCLYHTILHS (SEQ ID NO:1657)
>orf03243
MTIRFEEKVSTENAQFVCQWSNSLGKVFQEQWIGPRIPFPLTIQVFQDLEGILSIFEGQEFVGLIQKIRLEDSNLHIGRFFINPQKQGQGLGSQALRKFVSLAFENRDIDSISLNVFEANQRAQNLYQKEGFEIV (SEQ ID NO:1658)
>orf03261
MPFKENLICQHRNHHCSVFFISLGLLHNIHIEIDISQTRASFLDLSDYLQAVLMILQKFCQAIGLAQRLDLLQLHLLHLTRLLL (SEQ ID NO:1659)
>orf03271
MYLLLLVVKDHIALIDKEMHVWRPNCILRDLTNFFIKRNHIVTHKTNGSTTKR(SEQ ID NO:1660)
>orf03272
VLTLMNHFIKEIQGIPINHLTILIENSIFKLNLKNWIIG (SEQ ID NO:1661)
>orf03274
VDRTDEVSSKHCFEVADRTDEVSNHTYDKATLTRFEEFFEEYKGVPR(SEQ ID NO:1662)
>orf03293
VCQRMDARTCKTTIIAVHNVLTALQQTWIAVQLYQTK (SEQ ID NO:1663)
>orf03294
LHLGKSILSLPVKGKDLEFLVHLFVINHWIGFPSRTSTFCRCKVLNSME(SEQ ID NO:1664)
>orf03295
LEQTVIIANNPCELYWDNHLSFLSDSLLKQVIVHLKRICLDIHHDRGCSHVRNDTT(SEQ ID NO:1665)
>orf03297
LTDDGVLILVVDAGWRGNSCLQEQGCHHFRAILLCITWHFRSCTDKGHLTFKDIDQLRQFVQTDTSDEISNLGNTAIVSRSHQTSFFIRIRHHGTELPNLEPTVVLGHTLLLVNHWPLAIQLDPNAQDEKDGRS (SEQ ID NO:1666)
>orf03298
MLKMRKMGEVRTSKTKAKTQSKQRLKISEPFLLETSW (SEQ ID NO:1667)
>orf03313
MLEEGTKDQLAELTYPFGRGVNLSFGIKDVPKLYQKVMEANYPIYRLLTKRKFRVSDPYIYPHKFAVLDPDGYFLRFSE (SEQ ID NO:1668)
>orf03316
MDQNLFNYNDEDIDSVIEYSHKLLNRKFSDVMEEYNRSLYKSYDDYNDRVVSEVQDKAISMKSKGQYGNYIEKYFYGYQPNSDSEADFEKIGVELKVTPFKINKNGTLSAKERLVLTILNYMEENLEDFYSTHLWKKCAKILLLFYNGLIPNQTMKDYVIEKIFLYEWFEEDMAVILEDYQKITDKIKNGKAHELSESDGNYLSTCTKGAGKGKDLRQQPFSHELAKQRAWELKSSYMTYLINHKIFNQSDQESVLANFRGEKKSFTEVIAEKILSYKGFSEQELYDRFEVNSKAKGKNSTLIRKILGLTGDLDKTKEFQKANMNLRVIRVDKNNLPKEDSPFKTYCFKELAATDSWESSHVYNEIYNKRFLFVIFKEIEPKLFVLDSIKFWGFQDRQLEEIQRVWQETRQIISDGVKLTQNGNKVSTNFPQSKINKILFTKLHATNTYYEIDKGKFVGKGSLSDTDELPDGRRITKHSFWMPKKFIKEILDGNWD(SEQ ID NO:1669)
>orf03317
MKVLELFAGVGGFRIGLENADKQLFKTKWANQWEPSRKSQDAFEVYDYHFPNSKNINISISDITDEQFSKMDADMIVGGFPCQDYSVARSKKNEKGIEGKKGVLFWEIIRATEIIKPKYLILENVDRLLKAPSKQRGRDFAIMLTAFNNLGYSVEWRVINAAEYGRSQRRRRVFFFVYRNDTVFAQKIDNLYEKNEEIFEDNRYDDYIFNQGLFAKQFPIKPIAVKNRHVFYELPNDIVEVSDTFTGTVWNTGIMRRGKYYSIDTEPNYNGNPITLGEILQDESEVPEKYFLTDQSKLEKFQYLRGPKKIERTSSDGHQYIYSEGGMSPYDDLNLPGRTMLTSEGTVNRSTHLLFVNNKYRLITPIEAERLQDFPDDWTAKKKLSDDSIVEVSDKMRMFFMGNALVTEIVKEIAKFIKEID (SEQ ID NO:1670)
>orf03318
MDTFSFNGQYIVEFSCLKVVDRGLECHPIKSQRDNHQTTDLVT (SEQ ID NO:1671)
>orf03320
MGIAIVVERRVHYFGRHHNVTISHFFNFVIFKGRYSVKMKVFHRFLLIFQTTL(SEQ ID NO:1672)
>orf03333
MIACRHDICKSQKGLEHPFCIIRRLTRDFNQRPVCIVEANIFCLKITPQIITNMIVARTVKSSKTGITLTTSMCKRDNHKITWFHRRNGFPSFFNNPNRFVSTIFMSSFRFWITVPP(SEQ ID NO:1673)
>orf03341
MLRQFRLGFFDVRMTECHLKWKERENFHDFLKFYCKDS (SEQ ID NO:1674)
>orf03350
MNMNKDQIAILNGADNLNLTLWMTLKEICKEGCKSFFPVRNTCRMLDIGIPYRLGLSLSNSSVLNGMDV (SEQ ID NO:1675)
>orf03370
MHKLRIFVNQLCRRFGIILGPFLVLGFQVLTQELELAIFFDLREEVLLQVIPQVCHFCYLRKEFTTLNQHELTSHDHVLTRHFQTHGLQG (SEQ ID NO:1676)
>orf03382
MLHMNLFFQPFFTNLCKTLATGCCVKTVMEWSSIATTIDFKIIE(SEQ ID NO:1677)
>orf03383
LDNRAKEWIMSTAQNQAI HLSNQGTQGFIDHLLGNTG (SEQ ID NO:1678)
>orf03385
LSLDFFPDDRSRSVTSNDNHFDILGQEKVDQLPSIFTNLLSRTGAIGRPRRISNIDDFFMGKLAHELAHNGQAPDTRIQKTNWSIIHTVFFLVFFLIDRSL (SEQ ID NO:1679)
>orf03394
VSYGSHIFFASNCLKQIFGFLFKFSHLILLILVKASLI (SEQ ID NO:1680)
>orf03397
MTSLLTLENIHKTFEAGTVNENHVLKGLDLEVEEGDFISVIGGNGAGKSTLMNILAGNLSVDEGDLLLAGKSIKNLSVRKRAKDIARVFQDPKMGTASRLTIEENMAIALRRGQKRGLGWGVKEKDRIQFQEALKELNIGLENRLKVDTQYLSGGQRQALTLVMTDLMKPKLLLLDEHTAALDPKTSQMVMDLTQKIVEHHQWTTLMITHDMNHAIEYGNRLIMLYQGKIVVDVKGEEKKHLTVEDLMHLFQKNSGQSLVSDELVLG (SEQ ID NO:1681)
>orf03398
MNFVLSSLSEGLLWSIVAIGDYLTFRILDIADMTAEGAFPLGAAVVVSQIQAGANPWLATLLALLAGMVAGLVSGMLHTKMKIPALLTGIVTLTGLYSINIKIMGSVPNLSLGDSATVFKQLASLGLTNEGAVFSLSLVCFLLVCLVLTLLMKTEIGLVLRSTGDNIPMSEANGVNVDTMKIVGYMISNGLIALCGSLFAQNDGFSDVTSGTGTIVVGLSSVIIAEVLIHDLTIGGRLLSIGIGAIVYRLIILNIYEIPNLDQNLVRLFNAILLALVLFAPELQKRLKIRGLKLRNE(SEQ ID NO:1682)
>orf03399
MAEVDMVFVPTDNIILSTMETVKQVSIKHKVPVFGGSTEMIAVGGLYNYGTNYEELGRQTARMLIRVLKGEEPENIAVELPEKLELHTNQEMADALGIDISKLEGKE(SEQ ID NO:1683)
>orf03400
VDELAKQGYVEGENIEIDLQNAQGEQRNLKTISQQLAESSDVVLAIARPSAQSLANTTQTTPVIFSAVTDPVSAKLVESREHPGGNVTGTSDQSSDAISTQINLIKKVLLKAKTIGILYTQSEPNSVV (SEQ ID NO:1684)
>orf03401
MQTDQRSQEEPHYQEGASDFRTTFIMKLLLRKDKTKNRLDTI (SEQ ID NO:1685)
>orf03402
MLPILSPFSSPVNNISEFFKIFRKFFQEAFKVFQISPTKKVL (SEQ ID NO:1686)
>orf03412
MINVNQVSIEVKNTFKNWNFTSSIELTTFSKFSQSPTMT (SEQ ID NO:1687)
>orf03414
LIDVLFINSFIGRICFYCYRRIHATCLFLQLFSIVILNVAHTLKHSIFIVITFISRCRNFIIVRILLENQFSRNQGIDNRVGQSRY (SEQ ID NO:1688)
>orf03423
MKIKEQTRKLAAGCSKHCFEVVDETDEVSNHTYGKVKLTWFEEIFEEYKKSSWNL(SEQ ID NO:1689)
>orf03442
LVEQLTFNQWVTGSSPVRVIYAGLAELADAPDLGSGA (SEQ ID NO:1690)
>orf03443
MSIVKSHSFSISLGIFNSFWNNIHTSECFNFLCKGKSNRSNSTISVNQMVFFINIQRFYCFAIEDFCLLRI (SEQ ID NO:1691)
>orf03444
LNTLLPPDNLCLFTIYLTGFSCICINSYCHNFWEIFNQLFYQLS(SEQ ID NO:1692)
>orf03451
MGFSMKLIHDLNTHTTHSTAKMLHNVKAIKNDFSIRE (SEQ ID NO:1693)
>orf03460
MYNKVILIGRLTSTPELHKTNNDKSVARATIAVNRRYKDQNGEREVDFVNMVLWGRLAETLASYATKGSLISVDGELRTRRFEKNGQMNYVTEVLVTGFQLLESRAQRAMRENNAGQDLADLVLEEEELPF (SEQ ID NO:1694)
>orf03464
MQFTRTAHHTKTLFTTKFTWENEIPFWHHSSRKRDNGFQPHTRIGSSCNDLYSLITCDCNLADVEVVTIWMGYHLNNFTDNKLRFLIINNFFCKTFRLQLLVQTSDLLICQKDLTALCSFK (SEQ ID NO:1695)
>orf03469
MILDSFFAFNCSGTMKVSTWVYDKGEWYYVSSSGSMIANDWVKDNGK(SEQ ID NO:1696)
>orf03476
MCTSINKKKKAVKPSFDLFFCFIFCKLTIVQVSVEATLRHQFLIVALLDDISIFHDQDQVCISDG (SEQ ID NO:1697)
>orf03484
LHLRTCFVRQTNKLSPLINRTRLQFHQTILHYTLNQITSNRLGNIEFLIDIFNQDQVLVFLAIIQKMHNLTLRPTHKFNAATFGFLLHHQVNLMTKTLKD (SEQ ID NO:1698)
>orf03499
MLEIWKYRPFVSEFWNDFKNNHDKQFVDSISLYLTLKDDDDPRIEEESEALENMILQYLGEDDAS (SEQ ID NO:1699)
>orf03507
MAFNQFNRCIGLSIPTAPNVPGTIINRSYLHDATVPNNVREKT (SEQ ID NO:1700)
>orf03523
LTDFHDFKFI FFENLFKSRQLYLQSQNTVLSNLWLAT (SEQ ID NO:1701)
>orf03533
MKIMKKKYWTLAILFFCLFNNSVTAQEIPKNLDGNITHTQTSESFSESDEKQVDYSNKNQEEVDQNKFRIQIDKTELFVTTDKHLEKNCCKLELEPQINNDIVNSESNNLLGEDNLDNKIKENVSHLDNRGGNIEHDKDNLESSIVRKYEWDIDKVTGGGESYKLYSKSNSKVSIAILDSGVDLQNTGLLKNLSNHSKNYVPNKGYLGKEEGEEGIISDIQDRLGHGTAVVAQIVGDDNINGVNPHVNINVYRIFGKSSASPDWIVKAIFDAVDDGNDIINLSTGQYLMIDGEYEDGTNDFETFLKYKKAIDYANQKGVIIVAALGNDSLNVSNQSDLLKLISSRKKVRKPGLVVDVPSYFSSTISVGGIDRLGNLSDFSNKGDSDAIYAPAGSTLSLSELGLNNFINAEKYKEDWIFSATLGGYTYLYGNSFAAPKVSGAIAMIIDKYKLKDQPYNYMFVKKILEETLPVKNGIKVLNIPNVLRYDLNMLQLEYKNEQSWDSFIDNVNLIELEERIQTTIGIKQINTHNIITIAREGYSQNYLPNTSENTYNSLQVSLVGVLLLFISMVNILWAKKSK (SEQ ID NO:1702)
>orf03543
VLKWCILRINHHISRKVDNFLEGTRAHIKGQAHTAWNPLEVPDVRYRSFQFDMSHTLTTNFRTRYFNPTAVTNNSSVTNAFVLTTSTFPVFCRTKDHFIKESFTFWFQGTIIDCFRFFDFSIRP (SEQ ID NO:1703)
>orf03553
MPWKELCHKLAPKVFKVIRIYSRENKKSPSNWAFCSFET (SEQ ID NO:1704)
>orf03559
VSVLFFCSYFSLSLEKGWFSSLISCKFMNQFLPFCWRQDSPWILTLAQDSITYH(SEQ ID NO:1705)
>orf03564
VTDENTRKVRLLVAFFSIVIGYILSSFFISLYHLWQEALRGLL(SEQ ID NO:1706)
>orf03566
LHVELIDSHKFNIGRTTCSLLSTTNICKRCQPSINHMS (SEQ ID NO:1707)
>orf03567
MLNTNRNELIGTSFLIFCVIYFKDLANIFRTTWNLYIIRQGHYKCQESHNQGRNDV(SEQ ID NO:1708)
>orf03570
VNKPILSDIDCHLTNSINLFLPDTQTGNLFWKFNGLIRLAHNHNIFRKKLSLSHFFNICYDLFLGIGRV (SEQ ID NO:1709)
>orf03571
LKFSNFLGHLDIFSHDGLSLTVSLHQNSTGHATRYCFDR (SEQ ID NO:1710)
>orf03584
VDSLFLSLGEESNQEINLQESFSSTDCNPTLISPETTVAQGLCQDIIYRPFT(SEQ ID NO:1711)
>orf03586
VNPKSLGSFFLQDSKGFKELVLGHAKLSLPRIVHNVCPQFKNASRIITTRDDFWNACYSLQMFNIFKGIQVNGRTQFTCIGVFLVWRVVGREHNLRTQKVQFMAHQKLYITRAVHTTTFFLENFQNSWSWSSLNCKIFLKALVPRKSLVDGSCLLTNPLLIIQVKGSRELGNNRF(SEQ ID NO:1712)
>orf03590
MDNLCLHNTWTDWTSIFKQAVVTEDDMTKQNDFFLGIIDAEFHNCLGNFAINESDMSKKITSHCVLCLVWPRQLDDLSQVMQHNPRIEQALIELRINFANSVCQTHHGRRMIGQARFKGMVVGLGSWIGVEFLIILGVEISDNPLPDRIFNFENHLRHVVTNFLDINW(SEQ ID NO:768)
>orf03591
LIDLRGIVINFSASFHVDNLTCGKGLNVMRLGIPELPINLATIILEGKG(SEQID NO:1713)
>orf03604
MDALVLQKNQETIQQIAVKIRFLDGHDYYSLIDIDNRRTNQTVFPFVNFEDIAF(SEQ ID NO:1714)
>orf03605
MAFFTEIPTRACLINLAITLHIVETCQGFNDLSLHLRVLAL (SEQ ID NO:1715)
>orf03609
MLLPLPFNTSKIKQIAMHSDLNQKEMIGHIFHDEDIF (SEQ ID NO:1716)
>orf03614
MKQTVKKLALVASIAATLGGGVSVASAAVQYPEGGVWTYGSGNGGAYSNYYHPSKYHSSTVVSRKTGSSDKGYAGAGGTSRAWIRTSWGEKVAFYYNV (SEQ ID NO:1717)
>orf03643
LVEQLTFNQWVTGSSPVRVIYAGLAELADAPDLGSGA (SEQ ID NO:1718)
>orf03675
MKELLNKAFFNKNKASLSKEVLLELQGKRLPVNLFLSKSLFQASL(SEQ ID NO:1719)
>orf03690
MMTKIKLTIDIIMPCRHITNIWIYKEEGRVNLFFYSQIFFDAIDERIIHNFNSKYHLSFFSPVTGFSQIFDKTLACLG (SEQ ID NO:1720)
>orf03695
LHTSFRSSVGHSHTWHQDIVRPILFSRFNDSIVILWQNCPTFN (SEQ ID NO:1721)
>orf03713
MLEQARLKVEQQAIKNIQFLEQDLPKNPLEKEFDCLAVSRVLHHMPDLDAALSLFHQHLKEDGKLIIADFTRTEANHHGFDLAELENKLIEHGFSSVHSQILYSAEDLFQGNHSEFFLIVAQKSLA(SEQ ID NO:1722)
>orf03714
MKHDFNHKAETFDSPKNIFLANLVCQAAEKQIDLLSDKEILDFGGGTGLLALPLTPSQAG(SEQ ID NO:1723)
>orf03716
VWKKKKVKAGVLLYAVTIAAIFSLLLQFYLNRQIAHYQDYALNKEKLVAFAMAKRTKDKAEQESGEQVFNLGQVSYQNKKTSLVTTVRTSKSQYEFLFPSVKIKEEKRDKKEEVATDSSEKAEKKNQKRSLKRKRIPSQFNYNALNPE (SEQ ID NO:1724)
>orf03718
MVDLQSFFTRKYLNLNSVDAYLILPRLQGHLSYPQDFFLLQDFCFLLPIFLNLSQKEGRNAGKDS (SEQ ID NO:1725)
>orf03733
MRIRNSPFDHILQTIFEFEDRTCQVTCRFEACSSICNDNWEFSQHIISVFQSPSCHTVCDKSDVFCSFLFDKNFASLWIYVVTITDQLCIGMWQLVHGSNHTQFTVSQPTHSIVGMHPNTRSSIDCFFGFIKSRV (SEQ ID NO:1726)
>orf03734
MSKSNRHTFARNCTNKVFHPITFWCKGNFIKQAICRFLPRMKLLNTRVSHISWILCPLKSFCEIWTFIINPTNLSTCCFFIMVSKIFSDCKQLLISGC (SEQ ID NO:1727)
>orf03736
MQCTFNVVVHHIYTCISMNMSIHKTWGNAITCIVNHLSPFRNLLYMFPKLAVHKFQVTTSTNSVWVEKLIRFNIVRHNVNLLKRLILQFIMSITL (SEQ ID NO:1728)
>orf03750
MKKRMLLASTVALSFAPVLATQAEEVLWTARSVEQIQNDLTKTDNKTSYTVQYGDTLSTIAEALGVDVTVLANLNKITNMDLIFPETVLTTTVNEAEEVTEVEIQTPQADSSEEVTTATADLTTNQVTVDDQTVQVADLSQPIAEVTKTVIASEEVAPSTGTSVPEEQTTETTRPVEEATPQETTPAEKQETQASPQAASAVEVTTTSSEAKEVASSNGATAAVSTYQPEETKIISTTYEAPAAPDYAGLAVAKSENAGLQPQTAAFKEEIANLFGITSFSGYRPGDSGDHGKGLAIDFMVPERSELGDKIAEYAIQNMASRGISYIIWKQRFYAPFDSKYGPANTWNPMPDRGSVTENHYDHVHVSMNG (SEQ ID NO:1729)
>orf03763
VLAEADALVDAEAEALVDAEADALVLAEAEALVDADS DALVDAEADALVDAEAEALVDADSDADVLADTEAEALVDAEADALVLVDADVLALVDADVLADVLALVDADVLAEAEALVLAEAEALVDAEAEALVDADS DAEVLAEADALVLAEADALVDAEAEALVDAEAEALVDAXXHS FIN (SEQ ID NO:1730)
>orf03764
VRRLQHRQVLQLQRQPVRRLQRQPVRQPQQAPVLRLQQVLAPQPQHQQVLRSQRQPVPLNPHQPVHRLQQVLAPQLQHQRVLQLSMNQCVGIRINQCIGFSKY (SEQ ID NO:1731)
>orf03766
VRRNPHQPVHRLQQVLVHQLQHQRVLRLQQAPVRLNPHQRLPQPQQVPVRQLQQVLVHQLPHQQVLQLQRQPAPQPQQVPVRQLQQAQAPLSQRQPVRQLQXXXFHSLIN(SEQ ID NO:1732)
>orf03772
VTDENTRKVRLLVAFFSIVIGYILSSFFISLYHLWQEALRGLL (SEQ ID NO:1733)
>orf03774
MNXXALVDAEAEADVDAEAEALVDADAEALVLAEADALVDADSDADVLAEAEALVDAEADALIDADSEADVLAEAEALVDAEADALVDAEAEALVLADAEALVDAEADALVDAEADALVDAEADALVDAEAEALVDAEADALVLAEAEALVDAEADALVDADS DAEVLVLAEAEALVDAEADALVDADSDAEILAEADALVDAEAEALVLADSDALVNAEADVLAEADALVDADSEALVLAEADALVLAEAEALVDAEAEGTGMRKLIH (SEQ ID NO:1734)
>orf03777
VLALVDADVLADVLALVDADVLAEAEALVLAEAEALVDAEAEALVDADS DAEVLAEADALVLAEADALVDAEAEALVDAEAEALVDAEAEALVDADS DADVLAEADALVDAEADALVLAEADALVLAEADALVDADSEALVDAEAEALVDAEAEALVDAEAEALVDAEAEALVDAEAEADVDAEAEALVDADAEALVLAEADALVDADSNADVLAEAEALVDAEXXIPFIN(SEQ ID NO:1735)
>orf03794
MKIKEQTRKLAVGCLKQCFEVVDRTDEVSSKYCFEVADGS (SEQ ID NO:1736)
>orf03804
LLGSFFSWTTKELMGIIFFNNFPTVHKNNMIGYISSKTYLIKLIKNSI(SEQ ID NO:1737)
>orf03818
MEKILLHNLNQTEFFINKAIGWTLRDYSKTNPTWVTCFIEKNKERMAELSIKEASKYL(SEQ ID NO:1738)
>orf03819
MSLADLLEELEAAKDSKKARSMEAYMRHQFSFLGIAVPERNKLYKNIFQKRKKQRLSIGILQTLAGKRILENTNMWLLTI (SEQ ID NO:1739)
>orf03829
MTTGWFQVNGRWYYAYSSGALAVNTTVDGYSVNYNGEWAQ (SEQ ID NO:1740)
>orf03851
MAFTTEELLNLGLTEEQAKSVFALRGKELNEDKSALETIKQERDSLKSQLQKAEEQVEHLKSLENISAEQKDAIDKLQAEYDKYKNEAAAELAQTKKVSAISLALKDTNAFNPDKLMKFIDVDAIQIDDNGKPQIDEVINGIKESDPHLFQAEESKPSPNIFPLR(SEQ ID NO:1741)
>orf03853
MGSSGEMRTRPAEELGVDTFYYSMKAMARPACSPLQGQIVTKGTGREIDGITIYSLLDYGYGTAAGCLGIHCGHYLTPFIVGVHELPNLPDYLKNLTPEQAEENARIEAGQRGLERLIKTHKERLHYAHTLQDDKMIQAERLKVRGYQTKIRNLINQHDFLTRDYRREKLYIS(SEQ ID NO:1742)
在一些实施方式中,优选的23F抗原选自以下多肽或其免疫原性片段:orf01158(SEQ ID NO:1297)、orf01305(SEQ ID NO:1309)、orf01307(SEQ ID NO:1311)、orf01631(SEQ ID NO:1343)、orf01804(SEQ ID NO:1362)、orf01807(SEQID NO:1364)、orf02164(SEQ ID NO:1434)、orf02189(SEQ ID NO:1451)、orf02194(SEQ ID NO:1455)、orf02219(SEQ ID NO:1466)、orf02221(SEQ ID NO:1467)、orf02224(SEQ ID NO:1470)、orf02228(SEQ ID NO:1474)、orf02242(SEQ ID NO:1484)、orf02244(SEQ ID NO:1485)、orf02246(SEQ ID NO:1486)、orf02247(SEQID NO:1487)、orf02652(SEQ ID NO:1491)。
实施例3:菌毛粘附
菌毛2介导对肺泡上皮细胞A459的粘附。双标记染色显示,菌毛2-阳性菌株粘附于A549细胞表面,菌毛(用标记的抗-01287抗体观察)接触细胞。
并且,菌毛的等基因敲除突变体在宿主细胞相互作用方面显著受损。图2显示,与菌毛阳性菌株(PN110)相反,菌毛阴性菌株(D39)没能结合A549细胞。PN110中敲除菌毛消除了其结合能力。
在盖玻片上用01287纯化蛋白培养A459细胞,通过共聚焦显微镜观察显示结合水平低。该观察结果通过以下试验得到证实:将蛋白质与悬浮培养细胞一起培育,通过FACS分析对粘附水平进行定量(图4)。使用肺炎球菌菌毛1RrgA亚单位作为阳性对照,绿色荧光蛋白(GFP)用作阴性对照。
通过共聚焦显微镜对纯化的菌毛进行成像。显示粘附于在盖玻片上生长的A549细胞。并且,当加入表达菌毛II的菌株时,纯化菌毛对呼吸道细胞的粘附似乎有所增加。这种作用可能是因为纯化菌毛与细菌和A549细胞间的相互作用。纯化菌毛不能增加菌毛2的等基因敲除突变体对呼吸道细胞的粘附(图3)。
实施例4:INV104B的其他序列
下面提供了LepA肽酶(orf01289)的示例性核酸序列:
ATGCTGCTTAAAAAGAAACATAAGAAACCAGTAACACAAGTCAATCGGGATAAGTCTCCGCCGAGTGTCTGGGGAGATATCCTTTACTTAGTCAGTAAACTTCTGATGGTTGGATTTGTACTAGCCATCCTTTACTTTTTCGTCTTTGGATTATTAAGATACAATGACGATGGCATGAAGCCCGCCTTAAAAGATGGCGACTTGGTCGTCTATTATAGGTTGGATAAACGCTATTCGATTGGTGATTTGCTAGTCTATAGTTATAAAGGTAAGGAAAGAGTGGCGCGTGTCATAGCAACCGAAGGAAGTACAATCGATATAAACGAAAATGGTCTCATCATCAACGGTTCTCCTCAACAAGAGCAAGATATCTACAAAGAAACGCTGCTCTATAAGGAAGGGGCAACCTTCCCGATGAAAGTCCCAGCAGGACAACTTTTTGTCCTCGGGGACAATCGAACAACGGCTGTAGACAGTCGTGCTTTTGGAACCATCCCTATACAGGATACTCAAGGCAAAGTTGTAACAGTCATTAGAAGACGAGGCTTT (SEQ ID NO:1743)
下面提供了LepA肽酶的示例性氨基酸序列:
MLLKKKHKKPVTQVNRDKSPPSVWGDILYLVSKLLMVGFVLAILYFFVFGLLRYNDDGMKPALKDGDLVVYYRLDKRYSIGDLLVYSYKGKERVARVIATEGSTIDINENGLIINGSPQQEQDIYKETLLYKEGATFPMKVPAGQLFVLGDNRTTAVDSRAFGTIPIQDTQGKVVTVIRRRGF (SEQ ID NO:673)
下面提供了分选-1(orf01285)的示例性核酸序列:
ATGATGAAAACCAAGCGTGAGAAACCAAAAAAGAGTCTGTCTAGGCGTCTCGTTCTTGCTGTGGATGGGGTGATCAATCACTTGCTGCTCATTTTTGCAGCTTTGATCTTTCTCTTTGGTTTCTACGCCCTTTGGGATTCCAACCAAGTCTACTCCTTAGCTTCGTCAAGTGAGTACGAAGCTTATCGACCTGTCACGACGCAACAGGATGAGCTGGCCAGTTTTTCAGGCTTCAGCAAACTCCAAGAACTCAATCCCGAAGTCCTCGGTTGGATCAATGTCTATGGCACCAATATCGACTATCCCTTAGTCCAAGCCAAGGACAATGAAAAGTATCTCAACAAGGACTCCAAAGGTGAGTTTGCAGCGACAGGCGCTATCTTTCTCGATGCACGAAATAATCCTAAGTTCGAAGACTTTAATACCATTATCTACGGGCACCACGTAGAAAATGGGGTCATGTTTGGTGATGTGGCTAAGTTTGCTGATCAGGAATTTTTTGACCAGCATCGTTACGGTAGTATATACTACAATGGTGTGGAAAAAGGGCTCGAGATCTTTGAGATGTTGGAGGTTGATGCCTATGACTTTAACATCTATGATCCAGGAATACAGGGTGAGGACCGCCAGCAGGCCTATCTAGACCACCTGCTCTCAGTCGCCATGCACAAGCGGGATATCTCACTCTCACCGAGTGATCGTATCATCCTACTCAGTACCTGTTTTCTCGATGTGACCAATGGTCGTCATATCGTAGTCGCAAAGATTACAGACACCGTCCCTAAAAATACTTTCCATACAAAAAAATCAAAACCATTTCCATACAGTGTCTTTGATGACTCGTCTCTTGGACGTTTCCTCTCATCAATCCCACTATGGATTTGGTACCTTATCTTGTTTGTATTGTTCTTGCTCTTGATTTTCTTACTCCTTGTCCTCTACTTGATCCTACGTCGTAGAAGAGAGAGTAAAAAAAAATGCAAGAAGCAGACCCTTTTACTGACTAAGGGTGAATAGAAA (SEQ IDNO:1744)
下面提供了分选-1的示例性氨基酸序列:
MMKTKREKPKKSLSRRLVLAVDGVINHLLLIFAALIFLFGFYALWDSNQVYSLASSSEYEAYRPVTTQQDELASFSGFSKLQELNPEVLGWINVYGTNIDYPLVQAKDNEKYLNKDSKGEFAATGAIFLDARNNPKFEDFNTIIYGHHVENGVMFGDVAKFADQEFFDQHRYGSIYYNGVEKGLEIFEMLEVDAYDFNIYDPGIQGEDRQQAYLDHLLSVAMHKRDISLSPSDRIILLSTCFLDVTNGRHIVVAKITDTVPKNTFHTKKSKPFPYSVFDDSSLGRFLSSIPLWIWYLILFVLFLLLIFLLLVLYLILRRRRESKKNARSRPFY (SEQ IDNO:676)
下面提供了分选-2(orf01282)的示例性核酸序列:
ATGACGGTTCAAAAAAGAGCGCGATTTAAAAACGTATTTCTGGTATTCTTCTGTGTTTTTGTAGCTCTTTTTAGTTGGCAGAGAGTAGTAGAAGCAAGTGACTATGATCACTATAATCCTATTGAAAAGGATGCTTCGAGCACAGGTTTTGAAACCCTACAGCACTTGAACAAAGATGTTTGCGGTTGGATTAGCCTTGATGGGACCAAGGTAGACTATCCGCTTCTACAAAGTCAGGATAATGTCAAATACCTTGACCGCAATGCCTTTGGCGATTATACGATAATGGGATCAATTTTTCTCGACTATCGCTTTAATCCCAACTTTACTGATTTTAATACGATCATCTACGGACACTCTATGGCTTCAGGGGCTATGTTCGGTGAGATTAAGAAATTTGCTGATAAGGAATTCTTCGACCAGCATCGCTACGGTTCTATCTACTACAATGGTCGAGAACGTGGTCTTGAAATTTTTGGGATTTTAGAAGTGGATGCCTATGACACGGAGATTTATCGAACCTTGAGTTCCAAGGATGAGGAACACCAGGCTTACTATCAATATCTGCTAAGTAAAGCCAAGTACAAGCGAGATGTTTCCTTAACA (SEQ ID NO:1745)
下面提供了分选-2的示例性氨基酸序列:
MTVQKRARFKNVFLVFFCVFVALFSWQRVVEASDYDHYNPIEKDASSTGFETLQHLNKDVCGWISLDGTKVDYPLLQSQDNVKYLDRNAFGDYTIMGSIFLDYRFNPNFTDFNTIIYGHSMASGAMFGEIKKFADKEFFDQHRYGSIYYNGRERGLEIFGILEVDAYDTEIYRTLSSKDEEHQAYYQYLLSKAKYKRDVSLT (SEQ ID NO:1123)
描述了许多本发明方法和组合物的实施方式。然而,应理解,可进行各种改进而不背离本发明的精神和范围。
Claims (158)
1.一种由肺炎链球菌菌毛II岛(INV104B)编码的分离菌毛。
2.如权利要求1所述的菌毛,其特征在于,所述菌毛包含分选酶。
3.如权利要求1所述的菌毛,其特征在于,所述菌毛包含LPXTG细胞壁锚定蛋白。
4.如权利要求1所述的菌毛,其特征在于,所述菌毛通过酶消化或机械剪切从细胞分离。
5.如权利要求1所述的菌毛,其特征在于,所述机械剪切包括超声处理。
6.如权利要求1所述的菌毛,其特征在于,所述菌毛基本上不含细菌细胞。
7.一种免疫原性组合物,其包含一种或多种如权利要求1所述的菌毛。
8.一种制备如权利要求1所述菌毛的方法,所述方法包括使产生菌毛的细菌细胞经受酶消化或机械剪切和从所述细胞分离所述菌毛。
9.一种分离的肺炎链球菌分选酶,其特征在于,所述分选酶是SEQ IDNO:282、SEQ ID NO:1386、SEQ ID NO:676或SEQ ID NO:1123之一。
10.一种分离的肺炎链球菌LPXTG细胞壁锚定蛋白,其特征在于,所述LPXTG细胞壁锚定蛋白是SEQ ID NO:2、SEQ ID NO:4、SEQ ID NO:6、SEQ IDNO:7、SEQ ID NO:8或SEQ ID NO:9之一。
11.一种分离肺炎链球菌菌毛II岛(INV104B)编码菌毛的方法,所述方法包括:
使产生肺炎链球菌菌毛II岛(INV104B)编码菌毛的细菌细胞经受酶消化或机械剪切;和
从所述细胞分离所述菌毛。
12.如权利要求11所述的方法,其特征在于,所述机械剪切包括超声处理。
13.如权利要求11所述的方法,其特征在于,所述酶消化采用变溶菌素进行。
14.如权利要求11所述的方法,其特征在于,所述分离包括一次或多次密度梯度离心。
15.如权利要求11所述的方法,其特征在于,所述分离包括降低多分散性。
16.如权利要求15所述的方法,其特征在于,根据大小分离组分来降低多分散性。
17.一种抗体,所述抗体特异性结合肺炎链球菌菌毛II岛(INV104B)编码的菌毛。
18.如权利要求17所述的抗体,其特征在于,所述抗体是单克隆抗体、多克隆抗体、嵌合抗体、人抗体、人源化抗体、单链抗体或Fab片段。
19.如权利要求17所述的抗体,其特征在于,所述抗体是标记的。
20.如权利要求19所述的抗体,其特征在于,所述标记是酶、放射性同位素、造影剂、毒素或荧光团。
21.如权利要求17所述的抗体,其特征在于,所述抗体特异性结合一种或多种由肺炎链球菌菌毛II岛(INV104B)编码的LPXTG细胞壁锚定蛋白。
22.如权利要求17所述的抗体,其特征在于,所述抗体特异性结合一种或多种选自下组的LPXTG细胞壁锚定蛋白:SEQ ID NO:2、SEQ ID NO:4、SEQID NO:6、SEQ ID NO:7、SEQ ID NO:8或SEQ ID NO:9。
23.如权利要求17所述的抗体,其特征在于,所述抗体特异性结合一种或多种选自下组的分选酶:SEQ ID NO:282、SEQ ID NO:1386、SEQ ID NO:676或SEQ ID NO:1123。
24.一种免疫原性组合物,其包含寡聚形式的纯化的肺炎链球菌菌毛II岛(INV104B)多肽。
25.如权利要求24所述的免疫原性组合物,其特征在于,所述寡聚形式是高寡聚体。
26.如权利要求24所述的免疫原性组合物,其特征在于,所述多肽是由肺炎链球菌菌毛II岛(INV104B)编码的LPXTG细胞壁锚定蛋白的片段。
27.如权利要求26所述的免疫原性组合物,其特征在于,所述片段的长度是至少100个毗连氨基酸残基。
28.如权利要求26所述的免疫原性组合物,其特征在于,所述片段的长度是至少50个毗连氨基酸残基。
29.如权利要求26所述的免疫原性组合物,其特征在于,所述片段的长度是至少20个毗连氨基酸残基。
30.如权利要求26所述的免疫原性组合物,其特征在于,所述片段保留共价结合肽聚糖细胞壁的能力。
31.如权利要求26所述的免疫原性组合物,其特征在于,所述片段保留通过LPXTG基序与另一片段或蛋白质交联的能力。
32.如权利要求24所述的免疫原性组合物,其特征在于,所述多肽包含两个或多个肺炎链球菌菌毛II岛(INV104B)编码的LPXTG细胞壁锚定蛋白的片段。
33.一种诱导针对肺炎链球菌的免疫应答的方法,所述方法包括给予对象有效量的肺炎链球菌菌毛II岛(INV104B)编码菌毛。
34.如权利要求33所述的方法,其特征在于,所述菌毛是分离的。
35.如权利要求33所述的方法,其特征在于,所述对象是人。
36.一种检测对象肺炎链球菌感染的方法,所述方法包括分析来自所述对象的样品中是否存在针对肺炎链球菌菌毛II岛(INV104B)编码菌毛的抗体。
37.如权利要求36所述的方法,其特征在于,与菌毛组分相比,所述抗体优先结合菌毛复合物。
38.如权利要求36所述的方法,其特征在于,所述样品是血清。
39.如权利要求36所述的方法,其特征在于,所述对象是人。
40.一种检测对象肺炎链球菌感染的方法,所述方法包括使样品与权利要求17所述的抗体相接触并检测抗体与样品组分的结合。
41.如权利要求40所述的方法,其特征在于,与菌毛组分相比,所述抗体优先结合菌毛复合物。
42.如权利要求40所述的方法,其特征在于,所述样品是血清。
43.如权利要求40所述的方法,其特征在于,所述对象是人。
44.一种治疗患有肺炎链球菌感染的对象的方法,所述方法包括给予所述对象有效量的能够特异性结合肺炎链球菌菌毛II岛(INV104B)编码菌毛的试剂。
45.如权利要求44所述的方法,其特征在于,所述试剂是抗体。
46.如权利要求45所述的方法,其特征在于,所述抗体是单克隆抗体、多克隆抗体、嵌合抗体、人抗体、人源化抗体、单链抗体或Fab片段。
47.如权利要求45所述的方法,其特征在于,所述抗体阻断肺炎链球菌与细胞的结合。
48.如权利要求47所述的方法,其特征在于,所述细胞是上皮细胞。
49.如权利要求48所述的方法,其特征在于,所述上皮细胞是肺或鼻咽上皮细胞。
50.如权利要求45所述的方法,其特征在于,所述抗体特异性结合一种或多种由肺炎链球菌菌毛II岛(INV104B)编码的LPXTG细胞壁锚定蛋白。
51.如权利要求47所述的方法,其特征在于,在测定肺炎链球菌与A549肺上皮细胞结合的试验中测得,与对照相比,所述抗体阻断至少50%肺炎链球菌与所述细胞的结合。
52.如权利要求44所述的方法,其特征在于,所述对象是人。
53.一种确定肺炎链球菌感染患者的疗程的方法,所述方法包括:
分析来自所述对象的样品中是否存在针对肺炎链球菌菌毛II岛(INV104B)编码菌毛的抗体;和
如果检测到存在所述抗体则给予所述对象抗炎药。
54.如权利要求53所述的方法,其特征在于,所述对象是人。
55.一种检测肺炎链球菌感染患者的疗程的方法,所述方法包括:
分析来自所述对象的样品中是否存在针对肺炎链球菌菌毛II岛(INV104B)编码菌毛的抗体;和
如果没有检测到所述抗体的存在则给予所述对象抗生素。
56.如权利要求55所述的方法,其特征在于,所述对象是人。
57.一种分离的菌毛或菌毛样多聚体,其包含多肽,所述多肽包含具有最多30个氨基酸取代、插入或缺失的肺炎链球菌菌毛II岛(INV104B)编码菌毛蛋白的氨基酸序列。
58.如权利要求57所述的菌毛或菌毛样多聚体,其特征在于,最多具有20个氨基酸取代、插入或缺失。
59.如权利要求57所述的菌毛或菌毛样多聚体,其特征在于,最多具有10个氨基酸取代、插入或缺失。
60.如权利要求57所述的菌毛或菌毛样多聚体,其特征在于,最多具有5个氨基酸取代、插入或缺失。
61.如权利要求57-60中任一项所述的多肽,其特征在于,所述氨基酸取代、插入或缺失是氨基酸取代。
62.如权利要求61所述的多肽,其特征在于,所述氨基酸取代是保守性氨基酸取代。
63.如权利要求57所述的多肽,其特征在于,所述蛋白质是LPXTG细胞壁锚定蛋白。
64.一种多肽,其包含肺炎链球菌菌毛II岛(INV104B)编码的一种或多种LPXTG细胞壁锚定蛋白的氨基酸序列或其免疫原性片段。
65.如权利要求64所述的多肽,其特征在于,所述多肽包含两种或多种LPXTG细胞壁锚定蛋白的氨基酸序列或其免疫原性片段。
66.一种纯化多肽,其氨基酸序列由SEQ ID NO:2构成。
67.一种纯化多肽,其包含SEQ ID NO:2的至少10个连续残基。
68.一种纯化多肽,其氨基酸序列包含与SEQ ID NO:2至少85%相同的序列。
69.如权利要求68所述的纯化多肽,其特征在于,所述序列与SEQ IDNO:2至少90%相同。
70.如权利要求68所述的纯化多肽,其特征在于,所述序列与SEQ IDNO:2至少95%相同。
71.一种纯化多肽,其氨基酸序列由SEQ ID NO:4构成。
72.一种纯化多肽,其包含SEQ ID NO:4的至少10个连续残基。
73.一种纯化多肽,其氨基酸序列包含与SEQ ID NO:4至少85%相同的序列。
74.如权利要求73所述的纯化多肽,其特征在于,所述序列与SEQ IDNO:4至少90%相同。
75.如权利要求73所述的纯化多肽,其特征在于,所述序列与SEQ IDNO:4至少95%相同。
76.一种纯化多肽,其氨基酸序列由SEQ ID NO:6构成。
77.一种纯化多肽,其包含SEQ ID NO:6的至少10个连续残基。
78.一种纯化多肽,其氨基酸序列包含与SEQ ID NO:6至少85%相同的序列。
79.如权利要求78所述的纯化多肽,其特征在于,所述序列与SEQ IDNO:6至少90%相同。
80.如权利要求78所述的纯化多肽,其特征在于,所述序列与SEQ IDNO:6至少95%相同。
81.一种纯化多肽,其氨基酸序列由SEQ ID NO:7构成。
82.一种纯化多肽,其包含SEQ ID NO:7的至少10个连续残基。
83.一种纯化多肽,其氨基酸序列包含与SEQ ID NO:7至少85%相同的序列。
84.如权利要求83所述的纯化多肽,其特征在于,所述序列与SEQ IDNO:7至少90%相同。
85.如权利要求83所述的纯化多肽,其特征在于,所述序列与SEQ IDNO:7至少95%相同。
86.一种纯化多肽,其氨基酸序列由SEQ ID NO:8构成。
87.一种纯化多肽,其包含SEQ ID NO:8的至少10个连续残基。
88.一种纯化多肽,其氨基酸序列包含与SEQ ID NO:8至少85%相同的序列。
89.如权利要求88所述的纯化多肽,其特征在于,所述序列与SEQ IDNO:8至少90%相同。
90.如权利要求88所述的纯化多肽,其特征在于,所述序列与SEQ IDNO:8至少95%相同。
91.一种纯化多肽,其氨基酸序列由SEQ ID NO:9构成。
92.一种纯化多肽,其包含SEQ ID NO:9的至少10个连续残基。
93.一种纯化多肽,其氨基酸序列包含与SEQ ID NO:9至少85%相同的序列。
94.如权利要求93所述的纯化多肽,其特征在于,所述序列与SEQ IDNO:9至少90%相同。
95.如权利要求93所述的纯化多肽,其特征在于,所述序列与SEQ IDNO:9至少95%相同。
96.一种纯化多肽,其氨基酸序列包含与选自SEQ ID NO:29到SEQ IDNO:1742或其免疫原性片段的序列至少85%相同。
97.如权利要求96所述的纯化多肽,其特征在于,所述序列至少90%相同。
98.如权利要求96所述的纯化多肽,其特征在于,所述序列至少95%相同。
99.如权利要求96所述的纯化多肽,其特征在于,所述序列选自:SEQ IDNO:29到SEQ ID NO:1742。
100.一种纯化多肽,其氨基酸序列与选自下组的序列至少85%相同:SEQID NO:53、SEQ ID NO:65、SEQ ID NO:70、SEQ ID NO:99、SEQ ID NO:104、SEQ ID NO:117、SEQ ID NO:135、SEQ ID NO:177、SEQ ID NO:178、SEQ IDNO:198、SEQ ID NO:235、SEQ ID NO:236、SEQ ID NO:237、SEQ ID NO:242、SEQ ID NO:247、SEQ ID NO:248、SEQ ID NO:250、SEQ ID NO:251、SEQ IDNO:252、SEQ ID NO:253、SEQ ID NO:433、SEQ ID NO:439、SEQ ID NO:444、SEQ ID NO:538、SEQ ID NO:539、SEQ ID NO:540、SEQ ID NO:541、SEQ IDNO:542、SEQ ID NO:543、SEQ ID NO:544、SEQ ID NO:545、SEQ ID NO:581和SEQ ID NO:593,或它们的免疫原性片段。
101.如权利要求100所述的纯化多肽,其特征在于,所述序列至少90%相同。
102.如权利要求100所述的纯化多肽,其特征在于,所述序列至少95%相同。
103.一种纯化多肽,其氨基酸序列与选自下组的序列至少85%相同:SEQID NO:626、SEQ ID NO:628、SEQ ID NO:629、SEQ ID NO:630、SEQ ID NO:631、SEQ ID NO:632、SEQ ID NO:639、SEQ ID NO:645、SEQ ID NO:747、SEQ ID NO:751、SEQ ID NO:752、SEQ ID NO:783、SEQ ID NO:786、SEQID NO:787、SEQ ID NO:810、SEQ ID NO:812、SEQ ID NO:813、SEQ ID NO:824、SEQ ID NO:831、SEQ ID NO:842、SEQ ID NO:847、SEQ ID NO:875、SEQ ID NO:876、SEQ ID NO:879、SEQ ID NO:880、SEQ ID NO:882、SEQ IDNO:913、SEQ ID NO:914、SEQ ID NO:925、SEQ ID NO:926、SEQ ID NO:947、SEQ ID NO:948、SEQ ID NO:968、SEQ ID NO:987、SEQ ID NO:988、SEQ IDNO:990、SEQ ID NO:992、SEQ ID NO:1003、SEQ ID NO:1007、SEQ ID NO:1008、SEQ ID NO:1036、SEQ ID NO:1082、SEQ ID NO:1120和SEQ ID NO:1123,或它们的免疫原性片段。
104.如权利要求103所述的纯化多肽,其特征在于,所述序列至少90%相同。
105.如权利要求103所述的纯化多肽,其特征在于,所述序列至少95%相同。
106.一种纯化多肽,其氨基酸序列与选自下组的序列至少85%相同:SEQID NO:1297、SEQ ID NO:1309、SEQ ID NO:1311、SEQ ID NO:1343、SEQ IDNO:1362、SEQ ID NO:1364、SEQ ID NO:1434、SEQ ID NO:1451、SEQ ID NO:1455、SEQ ID NO:1466、SEQ ID NO:14678、SEQ ID NO:1470、SEQ ID NO:1474、SEQ ID NO:1484、SEQ ID NO:1485、SEQ ID NO:1486、SEQ ID NO:1487和SEQ ID NO:1491,或它们的免疫原性片段。
107.如权利要求106所述的纯化多肽,其特征在于,所述序列至少90%相同。
108.如权利要求106所述的纯化多肽,其特征在于,所述序列至少95%相同。
109.一种编码权利要求64所述多肽或其免疫原性片段的多核苷酸。
110.一种纯化抗体,所述抗体通过用如权利要求64所述的多肽免疫对象而获得。
111.一种诱导针对肺炎链球菌的免疫应答的方法,所述方法包括给予对象有效量的如权利要求64所述的多肽。
112.一种肺炎链球菌菌毛II岛(INV104B)编码的LPXTG细胞壁锚定蛋白的免疫原性片段。
113.一种分离多核苷酸,所述多核苷酸编码如权利要求57-65或112中任一项所述的多肽或片段。
114.一种分离核酸,其多核苷酸序列由SEQ ID NO:1或SEQ ID NO:1的简并变体构成。
115.一种分离核酸,其包含在严谨条件下与杂交探针杂交的序列,所述探针的核苷酸序列由SEQ ID NO:1或SEQ ID NO:1的互补序列构成。
116.一种分离核酸,其包含编码氨基酸序列与SEQ ID NO:2至少85%相同的多肽的序列。
117.如权利要求116所述的分离核酸,其特征在于,所述氨基酸序列与SEQ ID NO:2至少90%相同。
118.如权利要求116所述的分离核酸,其特征在于,所述氨基酸序列与SEQ ID NO:2至少95%相同。
119.一种分离核酸,其多核苷酸序列由SEQ ID NO:3或SEQ ID NO:3的简并变体构成。
120.一种分离核酸,其包含在严谨条件下与杂交探针杂交的序列,所述探针的核苷酸序列由SEQ ID NO:3或SEQ ID NO:3的互补序列构成。
121.一种分离核酸,其包含编码氨基酸序列与SEQ ID NO:4至少85%相同的多肽的序列。
122.如权利要求121所述的分离核酸,其特征在于,所述氨基酸序列与SEQ ID NO:4至少90%相同。
123.如权利要求121所述的分离核酸,其特征在于,所述氨基酸序列与SEQ ID NO:4至少95%相同。
124.一种分离核酸,其多核苷酸序列由SEQ ID NO:5或SEQ ID NO:5的简并变体构成。
125.一种分离核酸,其包含在严谨条件下与杂交探针杂交的序列,所述探针的核苷酸序列由SEQ ID NO:5或SEQ ID NO:5的互补序列构成。
126.一种分离核酸,其包含编码氨基酸序列与SEQ ID NO:6至少85%相同的多肽的序列。
127.如权利要求121所述的分离核酸,其特征在于,所述氨基酸序列与SEQ ID NO:6至少90%相同。
128.如权利要求121所述的分离核酸,其特征在于,所述氨基酸序列与SEQ ID NO:6至少95%相同。
129.一种分离核酸,其包含编码氨基酸序列与SEQ ID NO:7至少85%相同的多肽的序列。
130.如权利要求129所述的分离核酸,其特征在于,所述氨基酸序列与SEQ ID NO:7至少90%相同。
131.如权利要求129所述的分离核酸,其特征在于,所述氨基酸序列与SEQ ID NO:7至少95%相同。
132.一种分离核酸,其包含编码氨基酸序列与SEQ ID NO:8至少85%相同的多肽的序列。
133.如权利要求132所述的分离核酸,其特征在于,所述氨基酸序列与SEQ ID NO:8至少90%相同。
134.如权利要求132所述的分离核酸,其特征在于,所述氨基酸序列与SEQ ID NO:8至少95%相同。
135.一种分离核酸,其包含编码氨基酸序列与SEQ ID NO:9至少85%相同的多肽的序列。
136.如权利要求135所述的分离核酸,其特征在于,所述氨基酸序列与SEQ ID NO:9至少90%相同。
137.如权利要求135所述的分离核酸,其特征在于,所述氨基酸序列与SEQ ID NO:9至少95%相同。
138.一种分离核酸,其包含编码氨基酸序列与选自下组的序列至少85%相同的多肽的序列:SEQ ID NO:29到SEQ ID NO:1742。
139.如权利要求138所述的分离核酸,其特征在于,所述序列至少90%相同。
140.如权利要求138所述的分离核酸,其特征在于,所述序列至少95%相同。
141.如权利要求138所述的分离核酸,其特征在于,所述序列选自:SEQID NO:29到SEQ ID NO:1742。
142.一种诱导针对肺炎链球菌的免疫应答的方法,所述方法包括给予对象有效量的肺炎链球菌菌毛II岛(INV104B)编码的LPXTG细胞壁锚定蛋白的免疫原性片段。
143.如权利要求142所述的方法,其特征在于,所述对象是人。
144.一种在细胞中表达针对肺炎链球菌菌毛II岛(INV104B)编码菌毛蛋白的抗体的方法,所述方法包括在所述细胞中表达编码针对肺炎链球菌菌毛II岛(INV104B)编码菌毛蛋白的抗体的核酸。
145.如权利要求144所述的方法,其特征在于,所述菌毛蛋白是LPXTG细胞壁锚定蛋白。
146.一种诱导针对肺炎链球菌的免疫应答的方法,所述方法包括给予对象有效量的编码如权利要求64所述多肽的核酸。
147.一种从包含肺炎链球菌的样品纯化肺炎链球菌的方法,所述方法包括:
a)提供包含结合有权利要求17所述抗体的固相载体的亲和基质;
b)使所述样品与所述亲和基质相接触,以形成亲和基质-肺炎链球菌复合物;
c)将所述亲和基质-肺炎链球菌复合物与其余样品分离;和
d)从所述亲和基质释放肺炎链球菌。
148.一种将细胞毒试剂或诊断试剂递送至肺炎链球菌的方法,所述方法包括:
a)提供与权利要求17所述抗体或其片段偶联的细胞毒试剂或诊断试剂;和
b)使所述肺炎链球菌接触所述抗体-试剂或片段-试剂偶联物。
149.一种鉴定肺炎链球菌菌毛II岛(INV104B)编码的菌毛的结合调节剂的方法,所述方法包括使肺炎链球菌菌毛易于结合的动物细胞与候选化合物和具有肺炎链球菌菌毛II岛(INV104B)编码菌毛的细菌细胞相接触,和测定所述细菌细胞与所述动物细胞的结合是否受到抑制,其中,所述结合活性的抑制表明是肺炎链球菌菌毛II岛(INV104B)编码菌毛的结合抑制剂。
150.一种鉴定肺炎链球菌菌毛II岛(INV104B)编码的菌毛活性的结合调节剂的方法,所述方法包括:使肺炎链球菌菌毛易于结合的细胞与候选化合物和肺炎链球菌菌毛II岛(INV104B)编码的菌毛相接触;和测定所述菌毛与所述细胞的结合是否受到抑制,其中,所述结合活性的抑制表明是肺炎链球菌菌毛II岛(INV104B)编码菌毛的结合抑制剂。
151.一种鉴定肺炎链球菌菌毛II岛(INV104B)编码菌毛的结合调节剂的方法,所述方法包括:使肺炎链球菌菌毛易于结合的细胞与候选化合物和肺炎链球菌菌毛II岛(INV104B)编码的菌毛蛋白或其细胞结合片段相接触;和测定所述菌毛蛋白或其细胞结合片段与所述细胞的结合是否受到抑制,其中,所述结合活性的抑制表明是肺炎链球菌菌毛II岛(INV104B)编码菌毛的结合抑制剂。
152.如权利要求151所述的方法,其特征在于,所述动物细胞是分离或培养的。
153.一种分离肺炎链球菌菌毛II岛(INV104B)编码菌毛的方法,所述方法包括:
使产生肺炎链球菌菌毛II岛(INV104B)编码菌毛的肺炎链球菌细胞接受超声处理或分解酶消化;
通过密度梯度离心分离非细胞组分;和
分离肺炎链球菌菌毛II岛(INV104B)编码菌毛。
154.如权利要求153所述的方法,其特征在于,所述分解酶是变溶菌素。
155.如权利要求153所述的方法,其特征在于,采用密度梯度离心分离所述非细胞组分。
156.如权利要求153所述的方法,其特征在于,产生肺炎链球菌菌毛II岛(INV104B)编码菌毛的肺炎链球菌细胞是肺炎链球菌TIGR4细胞。
157.如权利要求153所述的方法,其特征在于,所述方法还包括用核酸酶降解核酸。
158.如权利要求153所述的方法,其特征在于,所述方法还包括通过以下方式降低多分散性:采用凝胶过滤色谱,根据大小分离肺炎链球菌菌毛II岛(INV104B)编码菌毛。
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US94016707P | 2007-05-25 | 2007-05-25 | |
US60/940,167 | 2007-05-25 | ||
PCT/IB2008/002108 WO2008146164A2 (en) | 2007-05-25 | 2008-05-23 | Streptococcus pneumoniae pilus antigens |
Publications (1)
Publication Number | Publication Date |
---|---|
CN101848929A true CN101848929A (zh) | 2010-09-29 |
Family
ID=40075602
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN200880100325A Pending CN101848929A (zh) | 2007-05-25 | 2008-05-23 | 肺炎链球菌菌毛抗原 |
Country Status (10)
Country | Link |
---|---|
US (1) | US20110097360A1 (zh) |
EP (1) | EP2167531B1 (zh) |
CN (1) | CN101848929A (zh) |
AT (1) | ATE550347T1 (zh) |
AU (1) | AU2008256340A1 (zh) |
BR (1) | BRPI0811908A2 (zh) |
CA (1) | CA2688284A1 (zh) |
ES (1) | ES2381613T3 (zh) |
MX (1) | MX2009012777A (zh) |
WO (1) | WO2008146164A2 (zh) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102690319A (zh) * | 2011-12-08 | 2012-09-26 | 河南科技大学 | 一种生物被膜状态下猪链球菌总蛋白的提取方法 |
CN107164493A (zh) * | 2017-06-08 | 2017-09-15 | 杭州遂真生物技术有限公司 | 一种gbs核酸检测试剂盒 |
CN116199751A (zh) * | 2023-03-13 | 2023-06-02 | 华中农业大学 | 一种细菌全基因组水平高通量筛选免疫原性抗原蛋白的方法 |
Families Citing this family (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2011008548A1 (en) | 2009-06-29 | 2011-01-20 | Genocea Biosciences, Inc. | Vaccines and compositions against streptococcus pneumoniae |
US9290550B2 (en) | 2010-02-23 | 2016-03-22 | Stitchting Katholieke Universiteit | Combination vaccine for Streptococcus |
MY163871A (en) | 2011-01-20 | 2017-10-31 | Genocea Biosciences Inc | Vaccines and compositions against streptococcus pneumoniae |
US9107906B1 (en) | 2014-10-28 | 2015-08-18 | Adma Biologics, Inc. | Compositions and methods for the treatment of immunodeficiency |
GB201509782D0 (en) * | 2015-06-05 | 2015-07-22 | Isis Innovation | Methods and products for fusion protein synthesis |
US10259865B2 (en) | 2017-03-15 | 2019-04-16 | Adma Biologics, Inc. | Anti-pneumococcal hyperimmune globulin for the treatment and prevention of pneumococcal infection |
JP2020527051A (ja) * | 2017-05-31 | 2020-09-03 | ハー マジェスティー ザ クイーン イン ライト オブ カナダ アズ リプレゼンティッド バイ ザ ミニスター オブ アグリカルチャー アンド アグリ−フードHer Majesty The Queen In Right Of Canada As Represented By The Minister Of Agriculture And Agri−Food | 家禽における壊疽性腸炎に対するワクチン |
CN110407909B (zh) * | 2018-11-23 | 2023-06-02 | 西南医科大学 | 一种多肽序列及其自组装材料的制备方法及应用 |
CN111763261B (zh) * | 2019-04-02 | 2022-08-09 | 杭州尚健生物技术有限公司 | 一种融合蛋白及其用途 |
CN111166749A (zh) * | 2020-03-03 | 2020-05-19 | 陕西科技大学 | 血根碱在抑制肺炎链球菌生长中的应用 |
Family Cites Families (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
SE8205892D0 (sv) | 1982-10-18 | 1982-10-18 | Bror Morein | Immunogent membranproteinkomplex, sett for framstellning och anvendning derav som immunstimulerande medel och sasom vaccin |
JPS59128338A (ja) * | 1982-12-08 | 1984-07-24 | Kitasato Inst:The | 歯周炎予防用ワクチン及びその製法 |
US5173418A (en) * | 1985-05-10 | 1992-12-22 | Benzon Pharma, A/S | Production in Escherichia coli of extracellular Serratia spp. hydrolases |
US5057540A (en) | 1987-05-29 | 1991-10-15 | Cambridge Biotech Corporation | Saponin adjuvant |
JPH0832638B2 (ja) | 1989-05-25 | 1996-03-29 | カイロン コーポレイション | サブミクロン油滴乳剤を含んで成るアジュバント製剤 |
AUPM873294A0 (en) | 1994-10-12 | 1994-11-03 | Csl Limited | Saponin preparations and use thereof in iscoms |
UA56132C2 (uk) | 1995-04-25 | 2003-05-15 | Смітклайн Бічем Байолоджікалс С.А. | Композиція вакцини (варіанти), спосіб стабілізації qs21 відносно гідролізу (варіанти), спосіб приготування композиції вакцини |
GB9817052D0 (en) | 1998-08-05 | 1998-09-30 | Smithkline Beecham Biolog | Vaccine |
AU2004230244B2 (en) * | 2003-04-15 | 2011-09-22 | Intercell Ag | S. pneumoniae antigens |
-
2008
- 2008-05-23 EP EP08789058A patent/EP2167531B1/en not_active Not-in-force
- 2008-05-23 MX MX2009012777A patent/MX2009012777A/es not_active Application Discontinuation
- 2008-05-23 CA CA002688284A patent/CA2688284A1/en not_active Abandoned
- 2008-05-23 CN CN200880100325A patent/CN101848929A/zh active Pending
- 2008-05-23 AT AT08789058T patent/ATE550347T1/de active
- 2008-05-23 ES ES08789058T patent/ES2381613T3/es active Active
- 2008-05-23 AU AU2008256340A patent/AU2008256340A1/en not_active Abandoned
- 2008-05-23 WO PCT/IB2008/002108 patent/WO2008146164A2/en active Application Filing
- 2008-05-23 US US12/451,736 patent/US20110097360A1/en not_active Abandoned
- 2008-05-23 BR BRPI0811908-2A2A patent/BRPI0811908A2/pt not_active Application Discontinuation
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102690319A (zh) * | 2011-12-08 | 2012-09-26 | 河南科技大学 | 一种生物被膜状态下猪链球菌总蛋白的提取方法 |
CN102690319B (zh) * | 2011-12-08 | 2015-05-13 | 河南科技大学 | 一种生物被膜状态下猪链球菌总蛋白的提取方法 |
CN107164493A (zh) * | 2017-06-08 | 2017-09-15 | 杭州遂真生物技术有限公司 | 一种gbs核酸检测试剂盒 |
CN116199751A (zh) * | 2023-03-13 | 2023-06-02 | 华中农业大学 | 一种细菌全基因组水平高通量筛选免疫原性抗原蛋白的方法 |
CN116199751B (zh) * | 2023-03-13 | 2024-03-19 | 华中农业大学 | 一种细菌全基因组水平高通量筛选免疫原性抗原蛋白的方法 |
Also Published As
Publication number | Publication date |
---|---|
US20110097360A1 (en) | 2011-04-28 |
BRPI0811908A2 (pt) | 2014-11-18 |
EP2167531B1 (en) | 2012-03-21 |
ES2381613T3 (es) | 2012-05-29 |
WO2008146164A3 (en) | 2009-05-28 |
ATE550347T1 (de) | 2012-04-15 |
MX2009012777A (es) | 2009-12-16 |
EP2167531A2 (en) | 2010-03-31 |
WO2008146164A2 (en) | 2008-12-04 |
CA2688284A1 (en) | 2008-12-04 |
AU2008256340A1 (en) | 2008-12-04 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN101848929A (zh) | 肺炎链球菌菌毛抗原 | |
CN101484464A (zh) | 纯化细菌抗原 | |
CN101977926B (zh) | 链球菌溶血素o的突变形式 | |
CA2583803C (en) | Immunogenic and therapeutic compositions for streptococcus pyogenes | |
CN102481349B (zh) | 抗革兰氏阳性细菌疫苗中的Cna_B结构域 | |
JP2008508320A5 (zh) | ||
CN101969992B (zh) | Gas57突变型抗原和gas57抗体 | |
US20150273042A1 (en) | Pilus proteins and compositions | |
JP2010538634A5 (zh) | ||
CN106177934B (zh) | 一种副猪嗜血杆菌亚单位疫苗及其制备方法 | |
CN107737334A (zh) | 一种预防a型链球菌感染的广谱多联亚单位疫苗 | |
US20100068214A1 (en) | Identification of Candidate Vaccine Antigens from Dichelobacter Nodosus | |
AU2013203022A1 (en) | GAS57 mutant antigens and GAS57 antibodies |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C02 | Deemed withdrawal of patent application after publication (patent law 2001) | ||
WD01 | Invention patent application deemed withdrawn after publication |
Application publication date: 20100929 |