CA3231249A1 - Coronavirus rapid diagnostics - Google Patents
Coronavirus rapid diagnostics Download PDFInfo
- Publication number
- CA3231249A1 CA3231249A1 CA3231249A CA3231249A CA3231249A1 CA 3231249 A1 CA3231249 A1 CA 3231249A1 CA 3231249 A CA3231249 A CA 3231249A CA 3231249 A CA3231249 A CA 3231249A CA 3231249 A1 CA3231249 A1 CA 3231249A1
- Authority
- CA
- Canada
- Prior art keywords
- strand
- probe
- quencher
- fluorophore
- rna
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 241000004176 Alphacoronavirus Species 0.000 title 1
- 239000000523 sample Substances 0.000 claims abstract description 111
- 238000000034 method Methods 0.000 claims abstract description 91
- 150000007523 nucleic acids Chemical group 0.000 claims abstract description 44
- 239000003391 RNA probe Substances 0.000 claims abstract description 35
- 108020004518 RNA Probes Proteins 0.000 claims abstract description 23
- 241000711573 Coronaviridae Species 0.000 claims abstract description 22
- 239000000203 mixture Substances 0.000 claims description 57
- 230000003321 amplification Effects 0.000 claims description 54
- 238000003199 nucleic acid amplification method Methods 0.000 claims description 54
- 102000040430 polynucleotide Human genes 0.000 claims description 47
- 108091033319 polynucleotide Proteins 0.000 claims description 47
- 239000002157 polynucleotide Substances 0.000 claims description 47
- 238000007397 LAMP assay Methods 0.000 claims description 46
- 108091032973 (ribonucleotides)n+m Proteins 0.000 claims description 43
- 108090000623 proteins and genes Proteins 0.000 claims description 42
- 239000003153 chemical reaction reagent Substances 0.000 claims description 39
- 102000004169 proteins and genes Human genes 0.000 claims description 39
- 102000039446 nucleic acids Human genes 0.000 claims description 35
- 108020004707 nucleic acids Proteins 0.000 claims description 35
- 238000006243 chemical reaction Methods 0.000 claims description 30
- 239000000243 solution Substances 0.000 claims description 26
- 108020004414 DNA Proteins 0.000 claims description 25
- 108091093088 Amplicon Proteins 0.000 claims description 23
- 230000000694 effects Effects 0.000 claims description 22
- 102000053602 DNA Human genes 0.000 claims description 19
- 238000000605 extraction Methods 0.000 claims description 17
- 102000040650 (ribonucleotides)n+m Human genes 0.000 claims description 16
- 230000027455 binding Effects 0.000 claims description 15
- 238000011901 isothermal amplification Methods 0.000 claims description 15
- 239000003298 DNA probe Substances 0.000 claims description 12
- 239000000654 additive Substances 0.000 claims description 12
- 241000700605 Viruses Species 0.000 claims description 11
- 239000011324 bead Substances 0.000 claims description 11
- 210000004027 cell Anatomy 0.000 claims description 10
- 238000006073 displacement reaction Methods 0.000 claims description 10
- 241001678559 COVID-19 virus Species 0.000 claims description 8
- 102000004190 Enzymes Human genes 0.000 claims description 8
- 108090000790 Enzymes Proteins 0.000 claims description 8
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 claims description 8
- 239000002245 particle Substances 0.000 claims description 7
- 238000003752 polymerase chain reaction Methods 0.000 claims description 7
- 229920001184 polypeptide Polymers 0.000 claims description 7
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 7
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 7
- 238000007400 DNA extraction Methods 0.000 claims description 6
- HNDVDQJCIGZPNO-YFKPBYRVSA-N L-histidine Chemical compound OC(=O)[C@@H](N)CC1=CN=CN1 HNDVDQJCIGZPNO-YFKPBYRVSA-N 0.000 claims description 6
- 238000013518 transcription Methods 0.000 claims description 6
- 230000035897 transcription Effects 0.000 claims description 6
- 101710163270 Nuclease Proteins 0.000 claims description 5
- 239000002773 nucleotide Substances 0.000 claims description 5
- 125000003729 nucleotide group Chemical group 0.000 claims description 5
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 claims description 4
- DLFVBJFMPXGRIB-UHFFFAOYSA-N Acetamide Chemical compound CC(N)=O DLFVBJFMPXGRIB-UHFFFAOYSA-N 0.000 claims description 4
- 108010067770 Endopeptidase K Proteins 0.000 claims description 4
- 239000004471 Glycine Substances 0.000 claims description 4
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 4
- XSQUKJJJFZCRTK-UHFFFAOYSA-N Urea Chemical compound NC(N)=O XSQUKJJJFZCRTK-UHFFFAOYSA-N 0.000 claims description 4
- -1 b-alanine Chemical compound 0.000 claims description 4
- 238000009472 formulation Methods 0.000 claims description 4
- 229960002885 histidine Drugs 0.000 claims description 4
- XOAAWQZATWQOTB-UHFFFAOYSA-N taurine Chemical compound NCCS(O)(=O)=O XOAAWQZATWQOTB-UHFFFAOYSA-N 0.000 claims description 4
- 241000894006 Bacteria Species 0.000 claims description 3
- 241000233866 Fungi Species 0.000 claims description 3
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 claims description 3
- 108060004795 Methyltransferase Proteins 0.000 claims description 3
- 239000002202 Polyethylene glycol Substances 0.000 claims description 3
- 102000018120 Recombinases Human genes 0.000 claims description 3
- 108010091086 Recombinases Proteins 0.000 claims description 3
- 239000000872 buffer Substances 0.000 claims description 3
- 230000001419 dependent effect Effects 0.000 claims description 3
- 238000002955 isolation Methods 0.000 claims description 3
- 230000001404 mediated effect Effects 0.000 claims description 3
- 229920001223 polyethylene glycol Polymers 0.000 claims description 3
- 229960002429 proline Drugs 0.000 claims description 3
- 239000011535 reaction buffer Substances 0.000 claims description 3
- 238000005096 rolling process Methods 0.000 claims description 3
- 210000003296 saliva Anatomy 0.000 claims description 3
- 125000006850 spacer group Chemical group 0.000 claims description 3
- HDTRYLNUVZCQOY-UHFFFAOYSA-N α-D-glucopyranosyl-α-D-glucopyranoside Natural products OC1C(O)C(O)C(CO)OC1OC1C(O)C(O)C(O)C(CO)O1 HDTRYLNUVZCQOY-UHFFFAOYSA-N 0.000 claims description 2
- GUBGYTABKSRVRQ-XLOQQCSPSA-N Alpha-Lactose Chemical compound O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@H]1O[C@@H]1[C@@H](CO)O[C@H](O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-XLOQQCSPSA-N 0.000 claims description 2
- FBPFZTCFMRRESA-FSIIMWSLSA-N D-Glucitol Natural products OC[C@H](O)[C@H](O)[C@@H](O)[C@H](O)CO FBPFZTCFMRRESA-FSIIMWSLSA-N 0.000 claims description 2
- FBPFZTCFMRRESA-JGWLITMVSA-N D-glucitol Chemical compound OC[C@H](O)[C@@H](O)[C@H](O)[C@H](O)CO FBPFZTCFMRRESA-JGWLITMVSA-N 0.000 claims description 2
- 241000709661 Enterovirus Species 0.000 claims description 2
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 claims description 2
- 241000725303 Human immunodeficiency virus Species 0.000 claims description 2
- 241000701806 Human papillomavirus Species 0.000 claims description 2
- 229930182821 L-proline Natural products 0.000 claims description 2
- GUBGYTABKSRVRQ-QKKXKWKRSA-N Lactose Natural products OC[C@H]1O[C@@H](O[C@H]2[C@H](O)[C@@H](O)C(O)O[C@@H]2CO)[C@H](O)[C@@H](O)[C@H]1O GUBGYTABKSRVRQ-QKKXKWKRSA-N 0.000 claims description 2
- 241000588652 Neisseria gonorrhoeae Species 0.000 claims description 2
- 108091005804 Peptidases Proteins 0.000 claims description 2
- 241000224016 Plasmodium Species 0.000 claims description 2
- 239000004365 Protease Substances 0.000 claims description 2
- 241000725643 Respiratory syncytial virus Species 0.000 claims description 2
- 102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 claims description 2
- 101150010882 S gene Proteins 0.000 claims description 2
- 241000193996 Streptococcus pyogenes Species 0.000 claims description 2
- HDTRYLNUVZCQOY-WSWWMNSNSA-N Trehalose Natural products O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@H](O)[C@@H](O)[C@@H](O)[C@@H](CO)O1 HDTRYLNUVZCQOY-WSWWMNSNSA-N 0.000 claims description 2
- 230000000996 additive effect Effects 0.000 claims description 2
- HDTRYLNUVZCQOY-LIZSDCNHSA-N alpha,alpha-trehalose Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 HDTRYLNUVZCQOY-LIZSDCNHSA-N 0.000 claims description 2
- HFHDHCJBZVLPGP-RWMJIURBSA-N alpha-cyclodextrin Chemical compound OC[C@H]([C@H]([C@@H]([C@H]1O)O)O[C@H]2O[C@@H]([C@@H](O[C@H]3O[C@H](CO)[C@H]([C@@H]([C@H]3O)O)O[C@H]3O[C@H](CO)[C@H]([C@@H]([C@H]3O)O)O[C@H]3O[C@H](CO)[C@H]([C@@H]([C@H]3O)O)O3)[C@H](O)[C@H]2O)CO)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O)[C@@H]3O[C@@H]1CO HFHDHCJBZVLPGP-RWMJIURBSA-N 0.000 claims description 2
- WQZGKKKJIJFFOK-VFUOTHLCSA-N beta-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 claims description 2
- WHGYBXFWUBPSRW-FOUAGVGXSA-N beta-cyclodextrin Chemical compound OC[C@H]([C@H]([C@@H]([C@H]1O)O)O[C@H]2O[C@@H]([C@@H](O[C@H]3O[C@H](CO)[C@H]([C@@H]([C@H]3O)O)O[C@H]3O[C@H](CO)[C@H]([C@@H]([C@H]3O)O)O[C@H]3O[C@H](CO)[C@H]([C@@H]([C@H]3O)O)O[C@H]3O[C@H](CO)[C@H]([C@@H]([C@H]3O)O)O3)[C@H](O)[C@H]2O)CO)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O)[C@@H]3O[C@@H]1CO WHGYBXFWUBPSRW-FOUAGVGXSA-N 0.000 claims description 2
- 239000004202 carbamide Substances 0.000 claims description 2
- 239000003599 detergent Substances 0.000 claims description 2
- 239000008103 glucose Substances 0.000 claims description 2
- PJJJBBJSCAKJQF-UHFFFAOYSA-N guanidinium chloride Chemical group [Cl-].NC(N)=[NH2+] PJJJBBJSCAKJQF-UHFFFAOYSA-N 0.000 claims description 2
- 208000006454 hepatitis Diseases 0.000 claims description 2
- 231100000283 hepatitis Toxicity 0.000 claims description 2
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 claims description 2
- 238000011534 incubation Methods 0.000 claims description 2
- 239000003112 inhibitor Substances 0.000 claims description 2
- 239000008101 lactose Substances 0.000 claims description 2
- 244000052769 pathogen Species 0.000 claims description 2
- 229920000036 polyvinylpyrrolidone Polymers 0.000 claims description 2
- 239000001267 polyvinylpyrrolidone Substances 0.000 claims description 2
- 235000013855 polyvinylpyrrolidone Nutrition 0.000 claims description 2
- 235000019419 proteases Nutrition 0.000 claims description 2
- 229960001153 serine Drugs 0.000 claims description 2
- 239000000600 sorbitol Substances 0.000 claims description 2
- 229960003080 taurine Drugs 0.000 claims description 2
- 241000712461 unidentified influenza virus Species 0.000 claims description 2
- 238000005406 washing Methods 0.000 claims description 2
- 230000007018 DNA scission Effects 0.000 claims 2
- LYCAIKOWRPUZTN-UHFFFAOYSA-N Ethylene glycol Chemical compound OCCO LYCAIKOWRPUZTN-UHFFFAOYSA-N 0.000 claims 2
- 230000007022 RNA scission Effects 0.000 claims 2
- 230000004913 activation Effects 0.000 claims 2
- 238000013519 translation Methods 0.000 claims 2
- 108010077544 Chromatin Proteins 0.000 claims 1
- 102100035102 E3 ubiquitin-protein ligase MYCBP2 Human genes 0.000 claims 1
- 108010033040 Histones Proteins 0.000 claims 1
- 101000615488 Homo sapiens Methyl-CpG-binding domain protein 2 Proteins 0.000 claims 1
- 102100021299 Methyl-CpG-binding domain protein 2 Human genes 0.000 claims 1
- MUPFEKGTMRGPLJ-RMMQSMQOSA-N Raffinose Natural products O(C[C@H]1[C@@H](O)[C@H](O)[C@@H](O)[C@@H](O[C@@]2(CO)[C@H](O)[C@@H](O)[C@@H](CO)O2)O1)[C@@H]1[C@H](O)[C@@H](O)[C@@H](O)[C@@H](CO)O1 MUPFEKGTMRGPLJ-RMMQSMQOSA-N 0.000 claims 1
- 102000008579 Transposases Human genes 0.000 claims 1
- 108010020764 Transposases Proteins 0.000 claims 1
- MUPFEKGTMRGPLJ-UHFFFAOYSA-N UNPD196149 Natural products OC1C(O)C(CO)OC1(CO)OC1C(O)C(O)C(O)C(COC2C(C(O)C(O)C(CO)O2)O)O1 MUPFEKGTMRGPLJ-UHFFFAOYSA-N 0.000 claims 1
- 239000002253 acid Substances 0.000 claims 1
- 229940024606 amino acid Drugs 0.000 claims 1
- 150000001413 amino acids Chemical class 0.000 claims 1
- 150000001768 cations Chemical class 0.000 claims 1
- 210000003483 chromatin Anatomy 0.000 claims 1
- WGCNASOHLSPBMP-UHFFFAOYSA-N hydroxyacetaldehyde Natural products OCC=O WGCNASOHLSPBMP-UHFFFAOYSA-N 0.000 claims 1
- 238000012986 modification Methods 0.000 claims 1
- 230000004048 modification Effects 0.000 claims 1
- MUPFEKGTMRGPLJ-ZQSKZDJDSA-N raffinose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO[C@@H]2[C@@H]([C@@H](O)[C@@H](O)[C@@H](CO)O2)O)O1 MUPFEKGTMRGPLJ-ZQSKZDJDSA-N 0.000 claims 1
- 238000007634 remodeling Methods 0.000 claims 1
- 108091069025 single-strand RNA Proteins 0.000 claims 1
- 241001529453 unidentified herpesvirus Species 0.000 claims 1
- 238000001514 detection method Methods 0.000 abstract description 46
- 102100034343 Integrase Human genes 0.000 abstract description 25
- 101710203526 Integrase Proteins 0.000 abstract description 24
- 108091033409 CRISPR Proteins 0.000 abstract description 14
- 238000010354 CRISPR gene editing Methods 0.000 abstract description 7
- 239000012636 effector Substances 0.000 abstract description 5
- 239000003795 chemical substances by application Substances 0.000 description 47
- 238000013459 approach Methods 0.000 description 13
- 239000003708 ampul Substances 0.000 description 10
- 230000009089 cytolysis Effects 0.000 description 10
- 102100039819 Actin, alpha cardiac muscle 1 Human genes 0.000 description 9
- 101000959247 Homo sapiens Actin, alpha cardiac muscle 1 Proteins 0.000 description 9
- 108091034117 Oligonucleotide Proteins 0.000 description 9
- VUFNLQXQSDUXKB-DOFZRALJSA-N 2-[4-[4-[bis(2-chloroethyl)amino]phenyl]butanoyloxy]ethyl (5z,8z,11z,14z)-icosa-5,8,11,14-tetraenoate Chemical compound CCCCC\C=C/C\C=C/C\C=C/C\C=C/CCCC(=O)OCCOC(=O)CCCC1=CC=C(N(CCCl)CCCl)C=C1 VUFNLQXQSDUXKB-DOFZRALJSA-N 0.000 description 8
- 238000003556 assay Methods 0.000 description 8
- 102100029812 Protein S100-A12 Human genes 0.000 description 7
- 101710110949 Protein S100-A12 Proteins 0.000 description 7
- 102000006382 Ribonucleases Human genes 0.000 description 7
- 108010083644 Ribonucleases Proteins 0.000 description 7
- 101150075675 tatC gene Proteins 0.000 description 7
- ZDSRFXVZVHSYMA-CMOCDZPBSA-N (2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-amino-3-(4-hydroxyphenyl)propanoyl]amino]-3-(4-hydroxyphenyl)propanoyl]amino]-4-carboxybutanoyl]amino]pentanedioic acid Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O)C1=CC=C(O)C=C1 ZDSRFXVZVHSYMA-CMOCDZPBSA-N 0.000 description 6
- 108010052418 (N-(2-((4-((2-((4-(9-acridinylamino)phenyl)amino)-2-oxoethyl)amino)-4-oxobutyl)amino)-1-(1H-imidazol-4-ylmethyl)-1-oxoethyl)-6-(((-2-aminoethyl)amino)methyl)-2-pyridinecarboxamidato) iron(1+) Proteins 0.000 description 6
- YRIZYWQGELRKNT-UHFFFAOYSA-N 1,3,5-trichloro-1,3,5-triazinane-2,4,6-trione Chemical compound ClN1C(=O)N(Cl)C(=O)N(Cl)C1=O YRIZYWQGELRKNT-UHFFFAOYSA-N 0.000 description 6
- 241000023308 Acca Species 0.000 description 6
- 241001123946 Gaga Species 0.000 description 6
- FSNCEEGOMTYXKY-JTQLQIEISA-N Lycoperodine 1 Natural products N1C2=CC=CC=C2C2=C1CN[C@H](C(=O)O)C2 FSNCEEGOMTYXKY-JTQLQIEISA-N 0.000 description 6
- 201000008754 Tenosynovial giant cell tumor Diseases 0.000 description 6
- YRKCREAYFQTBPV-UHFFFAOYSA-N acetylacetone Chemical compound CC(=O)CC(C)=O YRKCREAYFQTBPV-UHFFFAOYSA-N 0.000 description 6
- 238000013461 design Methods 0.000 description 6
- 208000035647 diffuse type tenosynovial giant cell tumor Diseases 0.000 description 6
- 230000035945 sensitivity Effects 0.000 description 6
- 208000002918 testicular germ cell tumor Diseases 0.000 description 6
- 108010068794 tyrosyl-tyrosyl-glutamyl-glutamic acid Proteins 0.000 description 6
- 108020005004 Guide RNA Proteins 0.000 description 5
- PKFBJSDMCRJYDC-GEZSXCAASA-N N-acetyl-s-geranylgeranyl-l-cysteine Chemical compound CC(C)=CCC\C(C)=C\CC\C(C)=C\CC\C(C)=C\CSC[C@@H](C(O)=O)NC(C)=O PKFBJSDMCRJYDC-GEZSXCAASA-N 0.000 description 5
- 210000001124 body fluid Anatomy 0.000 description 5
- 201000010099 disease Diseases 0.000 description 5
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 5
- CJWXCNXHAIFFMH-AVZHFPDBSA-N n-[(2s,3r,4s,5s,6r)-2-[(2r,3r,4s,5r)-2-acetamido-4,5,6-trihydroxy-1-oxohexan-3-yl]oxy-3,5-dihydroxy-6-methyloxan-4-yl]acetamide Chemical compound C[C@H]1O[C@@H](O[C@@H]([C@@H](O)[C@H](O)CO)[C@@H](NC(C)=O)C=O)[C@H](O)[C@@H](NC(C)=O)[C@@H]1O CJWXCNXHAIFFMH-AVZHFPDBSA-N 0.000 description 5
- 238000012360 testing method Methods 0.000 description 5
- 230000007306 turnover Effects 0.000 description 5
- 108010032276 tyrosyl-glutamyl-tyrosyl-glutamic acid Proteins 0.000 description 5
- BAAVRTJSLCSMNM-CMOCDZPBSA-N (2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-amino-3-(4-hydroxyphenyl)propanoyl]amino]-4-carboxybutanoyl]amino]-3-(4-hydroxyphenyl)propanoyl]amino]pentanedioic acid Chemical compound C([C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCC(O)=O)C(O)=O)C1=CC=C(O)C=C1 BAAVRTJSLCSMNM-CMOCDZPBSA-N 0.000 description 4
- JEOQACOXAOEPLX-WCCKRBBISA-N (2s)-2-amino-5-(diaminomethylideneamino)pentanoic acid;1,3-thiazolidine-4-carboxylic acid Chemical compound OC(=O)C1CSCN1.OC(=O)[C@@H](N)CCCN=C(N)N JEOQACOXAOEPLX-WCCKRBBISA-N 0.000 description 4
- SVDVJBWDBYSQLO-UHFFFAOYSA-N 5-(4-hydroxy-3-methoxyphenyl)-5-phenylimidazolidine-2,4-dione Chemical compound C1=C(O)C(OC)=CC(C2(C(NC(=O)N2)=O)C=2C=CC=CC=2)=C1 SVDVJBWDBYSQLO-UHFFFAOYSA-N 0.000 description 4
- 102100040004 Gamma-glutamylcyclotransferase Human genes 0.000 description 4
- 102100022662 Guanylyl cyclase C Human genes 0.000 description 4
- 101710198293 Guanylyl cyclase C Proteins 0.000 description 4
- 101000886680 Homo sapiens Gamma-glutamylcyclotransferase Proteins 0.000 description 4
- 101000937642 Homo sapiens Malonyl-CoA-acyl carrier protein transacylase, mitochondrial Proteins 0.000 description 4
- 102100027329 Malonyl-CoA-acyl carrier protein transacylase, mitochondrial Human genes 0.000 description 4
- 102100039217 3-ketoacyl-CoA thiolase, peroxisomal Human genes 0.000 description 3
- 241001063273 Alicyclobacillus acidiphilus Species 0.000 description 3
- 102100022524 Alpha-1-antichymotrypsin Human genes 0.000 description 3
- 102100039339 Atrial natriuretic peptide receptor 1 Human genes 0.000 description 3
- 101710102163 Atrial natriuretic peptide receptor 1 Proteins 0.000 description 3
- 241000219357 Cactaceae Species 0.000 description 3
- 108091006146 Channels Proteins 0.000 description 3
- 101100153048 Homo sapiens ACAA1 gene Proteins 0.000 description 3
- 101000678026 Homo sapiens Alpha-1-antichymotrypsin Proteins 0.000 description 3
- 101000829958 Homo sapiens N-acetyllactosaminide beta-1,6-N-acetylglucosaminyl-transferase Proteins 0.000 description 3
- 101001128634 Homo sapiens NADH dehydrogenase [ubiquinone] 1 beta subcomplex subunit 2, mitochondrial Proteins 0.000 description 3
- 241000124008 Mammalia Species 0.000 description 3
- 102100023315 N-acetyllactosaminide beta-1,6-N-acetylglucosaminyl-transferase Human genes 0.000 description 3
- 102100032194 NADH dehydrogenase [ubiquinone] 1 beta subcomplex subunit 2, mitochondrial Human genes 0.000 description 3
- 239000011230 binding agent Substances 0.000 description 3
- 239000012472 biological sample Substances 0.000 description 3
- 238000004113 cell culture Methods 0.000 description 3
- 238000011068 loading method Methods 0.000 description 3
- 239000000463 material Substances 0.000 description 3
- 238000012545 processing Methods 0.000 description 3
- 238000011160 research Methods 0.000 description 3
- 239000000758 substrate Substances 0.000 description 3
- 230000003612 virological effect Effects 0.000 description 3
- BCOSEZGCLGPUSL-UHFFFAOYSA-N 2,3,3-trichloroprop-2-enoyl chloride Chemical compound ClC(Cl)=C(Cl)C(Cl)=O BCOSEZGCLGPUSL-UHFFFAOYSA-N 0.000 description 2
- JEPVUMTVFPQKQE-AAKCMJRZSA-N 2-[(1s,2s,3r,4s)-1,2,3,4,5-pentahydroxypentyl]-1,3-thiazolidine-4-carboxylic acid Chemical compound OC[C@H](O)[C@@H](O)[C@H](O)[C@H](O)C1NC(C(O)=O)CS1 JEPVUMTVFPQKQE-AAKCMJRZSA-N 0.000 description 2
- VWEWCZSUWOEEFM-WDSKDSINSA-N Ala-Gly-Ala-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(=O)NCC(O)=O VWEWCZSUWOEEFM-WDSKDSINSA-N 0.000 description 2
- 241000458359 Brevibacillus sp. Species 0.000 description 2
- 102100025279 C-X-C motif chemokine 11 Human genes 0.000 description 2
- 102100036263 Glutamyl-tRNA(Gln) amidotransferase subunit C, mitochondrial Human genes 0.000 description 2
- 102100035081 Homeobox protein TGIF1 Human genes 0.000 description 2
- 101100222381 Homo sapiens CXCL11 gene Proteins 0.000 description 2
- 101001001786 Homo sapiens Glutamyl-tRNA(Gln) amidotransferase subunit C, mitochondrial Proteins 0.000 description 2
- 101000596925 Homo sapiens Homeobox protein TGIF1 Proteins 0.000 description 2
- 101000856513 Homo sapiens Inactive N-acetyllactosaminide alpha-1,3-galactosyltransferase Proteins 0.000 description 2
- 101001033233 Homo sapiens Interleukin-10 Proteins 0.000 description 2
- 101000804764 Homo sapiens Lymphotactin Proteins 0.000 description 2
- 101000869690 Homo sapiens Protein S100-A8 Proteins 0.000 description 2
- 102100025509 Inactive N-acetyllactosaminide alpha-1,3-galactosyltransferase Human genes 0.000 description 2
- 102100035304 Lymphotactin Human genes 0.000 description 2
- CSNNHWWHGAXBCP-UHFFFAOYSA-L Magnesium sulfate Chemical compound [Mg+2].[O-][S+2]([O-])([O-])[O-] CSNNHWWHGAXBCP-UHFFFAOYSA-L 0.000 description 2
- 241001465754 Metazoa Species 0.000 description 2
- 102100032442 Protein S100-A8 Human genes 0.000 description 2
- 101001009851 Rattus norvegicus Guanylate cyclase 2G Proteins 0.000 description 2
- VYPSYNLAJGMNEJ-UHFFFAOYSA-N Silicium dioxide Chemical group O=[Si]=O VYPSYNLAJGMNEJ-UHFFFAOYSA-N 0.000 description 2
- 102100036049 T-complex protein 1 subunit gamma Human genes 0.000 description 2
- WCDYMMVGBZNUGB-ORPFKJIMSA-N [(2r,3r,4s,5r,6r)-6-[[(1r,3r,4r,5r,6r)-4,5-dihydroxy-2,7-dioxabicyclo[4.2.0]octan-3-yl]oxy]-3,4,5-trihydroxyoxan-2-yl]methyl 3-hydroxy-2-tetradecyloctadecanoate Chemical compound O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](COC(=O)C(CCCCCCCCCCCCCC)C(O)CCCCCCCCCCCCCCC)O[C@@H]1O[C@@H]1[C@H](O)[C@@H](O)[C@H]2OC[C@H]2O1 WCDYMMVGBZNUGB-ORPFKJIMSA-N 0.000 description 2
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 2
- 238000003149 assay kit Methods 0.000 description 2
- 101150062912 cct3 gene Proteins 0.000 description 2
- 230000001413 cellular effect Effects 0.000 description 2
- 210000002939 cerumen Anatomy 0.000 description 2
- 238000003776 cleavage reaction Methods 0.000 description 2
- 230000000295 complement effect Effects 0.000 description 2
- 238000003745 diagnosis Methods 0.000 description 2
- BTCSSZJGUNDROE-UHFFFAOYSA-N gamma-aminobutyric acid Chemical compound NCCCC(O)=O BTCSSZJGUNDROE-UHFFFAOYSA-N 0.000 description 2
- 238000010438 heat treatment Methods 0.000 description 2
- 208000015181 infectious disease Diseases 0.000 description 2
- 239000012139 lysis buffer Substances 0.000 description 2
- 239000010445 mica Substances 0.000 description 2
- 229910052618 mica group Inorganic materials 0.000 description 2
- 238000010369 molecular cloning Methods 0.000 description 2
- 238000005580 one pot reaction Methods 0.000 description 2
- 230000007017 scission Effects 0.000 description 2
- YNJBWRMUSHSURL-UHFFFAOYSA-N trichloroacetic acid Chemical compound OC(=O)C(Cl)(Cl)Cl YNJBWRMUSHSURL-UHFFFAOYSA-N 0.000 description 2
- 230000001960 triggered effect Effects 0.000 description 2
- YZOUYRAONFXZSI-SBHWVFSVSA-N (1S,3R,5R,6R,8R,10R,11R,13R,15R,16R,18R,20R,21R,23R,25R,26R,28R,30R,31S,33R,35R,36R,37S,38R,39S,40R,41S,42R,43S,44R,45S,46R,47S,48R,49S)-5,10,15,20,25,30,35-heptakis(hydroxymethyl)-37,39,40,41,42,43,44,45,46,47,48,49-dodecamethoxy-2,4,7,9,12,14,17,19,22,24,27,29,32,34-tetradecaoxaoctacyclo[31.2.2.23,6.28,11.213,16.218,21.223,26.228,31]nonatetracontane-36,38-diol Chemical compound O([C@@H]([C@H]([C@@H]1OC)OC)O[C@H]2[C@@H](O)[C@@H]([C@@H](O[C@@H]3[C@@H](CO)O[C@@H]([C@H]([C@@H]3O)OC)O[C@@H]3[C@@H](CO)O[C@@H]([C@H]([C@@H]3OC)OC)O[C@@H]3[C@@H](CO)O[C@@H]([C@H]([C@@H]3OC)OC)O[C@@H]3[C@@H](CO)O[C@@H]([C@H]([C@@H]3OC)OC)O3)O[C@@H]2CO)OC)[C@H](CO)[C@H]1O[C@@H]1[C@@H](OC)[C@H](OC)[C@H]3[C@@H](CO)O1 YZOUYRAONFXZSI-SBHWVFSVSA-N 0.000 description 1
- KZFMOINJHMONLW-FOCLMDBBSA-N (2e)-4,7-dichloro-2-(4,7-dichloro-3-oxo-1-benzothiophen-2-ylidene)-1-benzothiophen-3-one Chemical compound S\1C(C(=CC=C2Cl)Cl)=C2C(=O)C/1=C1/C(=O)C(C(Cl)=CC=C2Cl)=C2S1 KZFMOINJHMONLW-FOCLMDBBSA-N 0.000 description 1
- BZSALXKCVOJCJJ-IPEMHBBOSA-N (4s)-4-[[(2s)-2-acetamido-3-methylbutanoyl]amino]-5-[[(2s)-1-[[(2s)-1-[[(2s,3r)-1-[[(2s)-1-[[(2s)-1-[[2-[[(2s)-1-amino-1-oxo-3-phenylpropan-2-yl]amino]-2-oxoethyl]amino]-5-(diaminomethylideneamino)-1-oxopentan-2-yl]amino]-1-oxopropan-2-yl]amino]-3-hydroxy Chemical compound CC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCC)C(=O)N[C@@H](CCCC)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(=O)N[C@H](C(N)=O)CC1=CC=CC=C1 BZSALXKCVOJCJJ-IPEMHBBOSA-N 0.000 description 1
- XIYOPDCBBDCGOE-IWVLMIASSA-N (4s,4ar,5s,5ar,12ar)-4-(dimethylamino)-1,5,10,11,12a-pentahydroxy-6-methylidene-3,12-dioxo-4,4a,5,5a-tetrahydrotetracene-2-carboxamide Chemical compound C=C1C2=CC=CC(O)=C2C(O)=C2[C@@H]1[C@H](O)[C@H]1[C@H](N(C)C)C(=O)C(C(N)=O)=C(O)[C@@]1(O)C2=O XIYOPDCBBDCGOE-IWVLMIASSA-N 0.000 description 1
- FMKJUUQOYOHLTF-OWOJBTEDSA-N (e)-4-azaniumylbut-2-enoate Chemical compound NC\C=C\C(O)=O FMKJUUQOYOHLTF-OWOJBTEDSA-N 0.000 description 1
- HTFVKMHFUBCIMH-UHFFFAOYSA-N 1,3,5-triiodo-1,3,5-triazinane-2,4,6-trione Chemical compound IN1C(=O)N(I)C(=O)N(I)C1=O HTFVKMHFUBCIMH-UHFFFAOYSA-N 0.000 description 1
- JTTIOYHBNXDJOD-UHFFFAOYSA-N 2,4,6-triaminopyrimidine Chemical compound NC1=CC(N)=NC(N)=N1 JTTIOYHBNXDJOD-UHFFFAOYSA-N 0.000 description 1
- IPYJWACEFWLBMV-UHFFFAOYSA-N 2-(1h-indol-3-ylmethyl)-1,3-thiazolidine-4-carboxylic acid Chemical compound N1C(C(=O)O)CSC1CC1=CNC2=CC=CC=C12 IPYJWACEFWLBMV-UHFFFAOYSA-N 0.000 description 1
- PIGCSKVALLVWKU-UHFFFAOYSA-N 2-Aminoacridone Chemical compound C1=CC=C2C(=O)C3=CC(N)=CC=C3NC2=C1 PIGCSKVALLVWKU-UHFFFAOYSA-N 0.000 description 1
- LVXLCZPTUBQNHH-UHFFFAOYSA-N 2-amino-5-[[1-(carboxymethylamino)-3-(2-chloro-1,1,2-trifluoroethyl)sulfanyl-1-oxopropan-2-yl]amino]-5-oxopentanoic acid Chemical compound OC(=O)C(N)CCC(=O)NC(CSC(F)(F)C(F)Cl)C(=O)NCC(O)=O LVXLCZPTUBQNHH-UHFFFAOYSA-N 0.000 description 1
- GHCVXTFBVDVFGE-UHFFFAOYSA-N 4-amino-6-chloro-1,3,5-triazin-2-ol Chemical compound NC1=NC(O)=NC(Cl)=N1 GHCVXTFBVDVFGE-UHFFFAOYSA-N 0.000 description 1
- FVFVNNKYKYZTJU-UHFFFAOYSA-N 6-chloro-1,3,5-triazine-2,4-diamine Chemical compound NC1=NC(N)=NC(Cl)=N1 FVFVNNKYKYZTJU-UHFFFAOYSA-N 0.000 description 1
- 101150026261 ACT7 gene Proteins 0.000 description 1
- OPVPGKGADVGKTG-BQBZGAKWSA-N Ac-Asp-Glu Chemical compound CC(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O OPVPGKGADVGKTG-BQBZGAKWSA-N 0.000 description 1
- 241000193412 Alicyclobacillus acidoterrestris Species 0.000 description 1
- 101100393868 Arabidopsis thaliana GT11 gene Proteins 0.000 description 1
- 101000651036 Arabidopsis thaliana Galactolipid galactosyltransferase SFR2, chloroplastic Proteins 0.000 description 1
- 102000008682 Argonaute Proteins Human genes 0.000 description 1
- 108010088141 Argonaute Proteins Proteins 0.000 description 1
- 241000316816 Attagis Species 0.000 description 1
- 101100164168 Bombyx mori ATFC gene Proteins 0.000 description 1
- 108010014064 CCCTC-Binding Factor Proteins 0.000 description 1
- 238000010356 CRISPR-Cas9 genome editing Methods 0.000 description 1
- 238000010453 CRISPR/Cas method Methods 0.000 description 1
- 101100536577 Caenorhabditis elegans cct-4 gene Proteins 0.000 description 1
- 101100439297 Caenorhabditis elegans cgt-1 gene Proteins 0.000 description 1
- 101100069049 Caenorhabditis elegans goa-1 gene Proteins 0.000 description 1
- 101100480622 Caenorhabditis elegans tat-5 gene Proteins 0.000 description 1
- 101100371648 Caenorhabditis elegans usp-14 gene Proteins 0.000 description 1
- 102100025570 Cancer/testis antigen 1 Human genes 0.000 description 1
- 241000321538 Candidia Species 0.000 description 1
- 206010050337 Cerumen impaction Diseases 0.000 description 1
- 101150011616 Ctcf gene Proteins 0.000 description 1
- FCKYPQBAHLOOJQ-UHFFFAOYSA-N Cyclohexane-1,2-diaminetetraacetic acid Chemical compound OC(=O)CN(CC(O)=O)C1CCCCC1N(CC(O)=O)CC(O)=O FCKYPQBAHLOOJQ-UHFFFAOYSA-N 0.000 description 1
- 108010008532 Deoxyribonuclease I Proteins 0.000 description 1
- 102000007260 Deoxyribonuclease I Human genes 0.000 description 1
- 108010053770 Deoxyribonucleases Proteins 0.000 description 1
- 102000016911 Deoxyribonucleases Human genes 0.000 description 1
- 108010042407 Endonucleases Proteins 0.000 description 1
- 102000004533 Endonucleases Human genes 0.000 description 1
- 101000760663 Hololena curta Mu-agatoxin-Hc1a Proteins 0.000 description 1
- 241000282412 Homo Species 0.000 description 1
- 101000856237 Homo sapiens Cancer/testis antigen 1 Proteins 0.000 description 1
- 101001017512 Homo sapiens Minor histocompatibility protein HMSD variant form Proteins 0.000 description 1
- 101000957437 Homo sapiens Mitochondrial carnitine/acylcarnitine carrier protein Proteins 0.000 description 1
- 101000724418 Homo sapiens Neutral amino acid transporter B(0) Proteins 0.000 description 1
- 101000799554 Homo sapiens Protein AATF Proteins 0.000 description 1
- 101001017510 Homo sapiens Serpin-like protein HMSD Proteins 0.000 description 1
- 101000654381 Homo sapiens Sodium channel protein type 8 subunit alpha Proteins 0.000 description 1
- 101000637792 Homo sapiens Solute carrier family 35 member G5 Proteins 0.000 description 1
- 101000847024 Homo sapiens Tetratricopeptide repeat protein 1 Proteins 0.000 description 1
- 108010007666 IMP cyclohydrolase Proteins 0.000 description 1
- 206010062717 Increased upper airway secretion Diseases 0.000 description 1
- 102100020796 Inosine 5'-monophosphate cyclohydrolase Human genes 0.000 description 1
- 235000000421 Lepidium meyenii Nutrition 0.000 description 1
- 241000029590 Leptotrichia wadei Species 0.000 description 1
- 102100038738 Mitochondrial carnitine/acylcarnitine carrier protein Human genes 0.000 description 1
- 241001529936 Murinae Species 0.000 description 1
- 101100494762 Mus musculus Nedd9 gene Proteins 0.000 description 1
- 108700010674 N-acetylVal-Nle(7,8)- allatotropin (5-13) Proteins 0.000 description 1
- 101100449516 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) grg-1 gene Proteins 0.000 description 1
- 102100028267 Neutral amino acid transporter B(0) Human genes 0.000 description 1
- 101100481019 Nicotiana tabacum TGA1A gene Proteins 0.000 description 1
- 238000012408 PCR amplification Methods 0.000 description 1
- 208000005228 Pericardial Effusion Diseases 0.000 description 1
- 206010035664 Pneumonia Diseases 0.000 description 1
- 206010036790 Productive cough Diseases 0.000 description 1
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 1
- 102100034180 Protein AATF Human genes 0.000 description 1
- 241000219061 Rheum Species 0.000 description 1
- 108700008242 S-(2-chloro-1,1,2-trifluoroethyl)glutathione Proteins 0.000 description 1
- 101150050559 SOAT1 gene Proteins 0.000 description 1
- 102100033983 Serpin-like protein HMSD Human genes 0.000 description 1
- 102100031371 Sodium channel protein type 8 subunit alpha Human genes 0.000 description 1
- 102100032019 Solute carrier family 35 member G5 Human genes 0.000 description 1
- 102100021993 Sterol O-acyltransferase 1 Human genes 0.000 description 1
- 101150092207 TGA1 gene Proteins 0.000 description 1
- 108700023707 TUG1 Proteins 0.000 description 1
- 102100032841 Tetratricopeptide repeat protein 1 Human genes 0.000 description 1
- 102100027671 Transcriptional repressor CTCF Human genes 0.000 description 1
- 101000609457 Trichosanthes kirilowii Trypsin inhibitor 1 Proteins 0.000 description 1
- 101000870345 Vasconcellea cundinamarcensis Cysteine proteinase 1 Proteins 0.000 description 1
- 241000251539 Vertebrata <Metazoa> Species 0.000 description 1
- 230000003213 activating effect Effects 0.000 description 1
- 210000004381 amniotic fluid Anatomy 0.000 description 1
- 210000004102 animal cell Anatomy 0.000 description 1
- 210000003567 ascitic fluid Anatomy 0.000 description 1
- 230000001580 bacterial effect Effects 0.000 description 1
- 210000000941 bile Anatomy 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 210000001175 cerebrospinal fluid Anatomy 0.000 description 1
- 210000003756 cervix mucus Anatomy 0.000 description 1
- WOWHHFRSBJGXCM-UHFFFAOYSA-M cetyltrimethylammonium chloride Chemical compound [Cl-].CCCCCCCCCCCCCCCC[N+](C)(C)C WOWHHFRSBJGXCM-UHFFFAOYSA-M 0.000 description 1
- 210000001268 chyle Anatomy 0.000 description 1
- 210000004913 chyme Anatomy 0.000 description 1
- ZPUCINDJVBIVPJ-LJISPDSOSA-N cocaine Chemical compound O([C@H]1C[C@@H]2CC[C@@H](N2C)[C@H]1C(=O)OC)C(=O)C1=CC=CC=C1 ZPUCINDJVBIVPJ-LJISPDSOSA-N 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 238000005520 cutting process Methods 0.000 description 1
- DFWFIQKMSFGDCQ-UHFFFAOYSA-N deethylatrazine Chemical compound CC(C)NC1=NC(N)=NC(Cl)=N1 DFWFIQKMSFGDCQ-UHFFFAOYSA-N 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 239000003814 drug Substances 0.000 description 1
- 239000000975 dye Substances 0.000 description 1
- 210000003060 endolymph Anatomy 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 230000002255 enzymatic effect Effects 0.000 description 1
- 230000001747 exhibiting effect Effects 0.000 description 1
- 239000011536 extraction buffer Substances 0.000 description 1
- 210000000416 exudates and transudate Anatomy 0.000 description 1
- 210000003608 fece Anatomy 0.000 description 1
- 238000002073 fluorescence micrograph Methods 0.000 description 1
- 238000004108 freeze drying Methods 0.000 description 1
- 230000002538 fungal effect Effects 0.000 description 1
- ZZUFCTLCJUWOSV-UHFFFAOYSA-N furosemide Chemical compound C1=C(Cl)C(S(=O)(=O)N)=CC(C(O)=O)=C1NCC1=CC=CO1 ZZUFCTLCJUWOSV-UHFFFAOYSA-N 0.000 description 1
- 229960003692 gamma aminobutyric acid Drugs 0.000 description 1
- 210000004211 gastric acid Anatomy 0.000 description 1
- 210000004051 gastric juice Anatomy 0.000 description 1
- 229960002449 glycine Drugs 0.000 description 1
- 210000004251 human milk Anatomy 0.000 description 1
- 235000020256 human milk Nutrition 0.000 description 1
- 210000000987 immune system Anatomy 0.000 description 1
- 230000003308 immunostimulating effect Effects 0.000 description 1
- 238000000338 in vitro Methods 0.000 description 1
- 238000001727 in vivo Methods 0.000 description 1
- 238000009830 intercalation Methods 0.000 description 1
- 238000011835 investigation Methods 0.000 description 1
- 108010076560 isospaglumic acid Proteins 0.000 description 1
- 235000012902 lepidium meyenii Nutrition 0.000 description 1
- 239000003446 ligand Substances 0.000 description 1
- 210000002751 lymph Anatomy 0.000 description 1
- 229910052943 magnesium sulfate Inorganic materials 0.000 description 1
- 235000019341 magnesium sulphate Nutrition 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 238000002156 mixing Methods 0.000 description 1
- 239000003607 modifier Substances 0.000 description 1
- 238000012544 monitoring process Methods 0.000 description 1
- 210000003097 mucus Anatomy 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 244000045947 parasite Species 0.000 description 1
- 210000004912 pericardial fluid Anatomy 0.000 description 1
- 210000004049 perilymph Anatomy 0.000 description 1
- 208000026435 phlegm Diseases 0.000 description 1
- 210000004910 pleural fluid Anatomy 0.000 description 1
- 229920001451 polypropylene glycol Polymers 0.000 description 1
- 230000002265 prevention Effects 0.000 description 1
- 239000013615 primer Substances 0.000 description 1
- 239000002987 primer (paints) Substances 0.000 description 1
- 238000011092 protein amplification Methods 0.000 description 1
- 230000005180 public health Effects 0.000 description 1
- 210000004915 pus Anatomy 0.000 description 1
- 238000010791 quenching Methods 0.000 description 1
- 230000000171 quenching effect Effects 0.000 description 1
- 230000001850 reproductive effect Effects 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 238000012552 review Methods 0.000 description 1
- 150000003839 salts Chemical class 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
- 210000002374 sebum Anatomy 0.000 description 1
- 210000000582 semen Anatomy 0.000 description 1
- 238000011896 sensitive detection Methods 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 210000002966 serum Anatomy 0.000 description 1
- 239000000377 silicon dioxide Substances 0.000 description 1
- 230000007480 spreading Effects 0.000 description 1
- 238000003892 spreading Methods 0.000 description 1
- 210000003802 sputum Anatomy 0.000 description 1
- 208000024794 sputum Diseases 0.000 description 1
- 125000001424 substituent group Chemical group 0.000 description 1
- 210000004243 sweat Anatomy 0.000 description 1
- 210000001179 synovial fluid Anatomy 0.000 description 1
- 230000008685 targeting Effects 0.000 description 1
- NBFQPSCRGQGZEP-YBKRDZSWSA-N tat 14 Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)[C@H](CCC(O)=O)NC(=O)CNC(=O)[C@@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCNC(N)=N)NC(=O)CNC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)[C@@H](C)O)CC1=CC=CC=C1 NBFQPSCRGQGZEP-YBKRDZSWSA-N 0.000 description 1
- 210000001138 tear Anatomy 0.000 description 1
- 230000002123 temporal effect Effects 0.000 description 1
- 101150090882 tgt-1 gene Proteins 0.000 description 1
- 210000001519 tissue Anatomy 0.000 description 1
- 238000011830 transgenic mouse model Methods 0.000 description 1
- 241000701161 unidentified adenovirus Species 0.000 description 1
- 210000002700 urine Anatomy 0.000 description 1
- 238000012800 visualization Methods 0.000 description 1
- 210000004916 vomit Anatomy 0.000 description 1
- 230000008673 vomiting Effects 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6844—Nucleic acid amplification reactions
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/70—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving virus or bacteriophage
Landscapes
- Chemical & Material Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Organic Chemistry (AREA)
- Health & Medical Sciences (AREA)
- Engineering & Computer Science (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Immunology (AREA)
- General Engineering & Computer Science (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Microbiology (AREA)
- Molecular Biology (AREA)
- Analytical Chemistry (AREA)
- Physics & Mathematics (AREA)
- Biophysics (AREA)
- Genetics & Genomics (AREA)
- General Health & Medical Sciences (AREA)
- Biotechnology (AREA)
- Biochemistry (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Virology (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Abstract
Systems and methods for rapid diagnostics related to the use of combinations of CRISPR effector systems with optimized guide sequences, OSD probes, RNA probes and/or RNase H for detection of nucleic acid sequences, such as sequences from coronavirus, as well as multiplex lateral flow diagnostic devices and methods of use, are provided.
Description
DEMANDE OU BREVET VOLUMINEUX
LA PRESENTE PARTIE DE CETTE DEMANDE OU CE BREVET COMPREND
PLUS D'UN TOME.
NOTE : Pour les tomes additionels, veuillez contacter le Bureau canadien des brevets JUMBO APPLICATIONS/PATENTS
THIS SECTION OF THE APPLICATION/PATENT CONTAINS MORE THAN ONE
VOLUME
NOTE: For additional volumes, please contact the Canadian Patent Office NOM DU FICHIER / FILE NAME:
NOTE POUR LE TOME / VOLUME NOTE:
CORONA VIRUS RAPID DIAGNOSTICS
TECHNICAL FIELD
[0001] The subject matter disclosed herein is generally directed to rapid single-reaction coronavirus diagnostics including the use of combinations of (1) nucleic acid amplification; (2) TnpB effector systems;
(3) oligonucleotide strand displacement (OSD); and/or fluorescently labeled RNA probes with RNase to release fluorescence. TnpB effector systems are described in U.S. Application Serial No. 16/326,132, filed August 16,2017, published as US 2021/0166783 on June 3,2021, which application is a U.S. national stage application under 35 U.S.C. 371 of PCT/US2017/047193, filed August 16,2017 and published as WO
2018/035250 on February 22, 2018, in Malcarova etal. (2020) Nature Reviews 18:
67-83 (and supplementary materials at https://doi.org/10.1038/s41579-019-0299-x), and H. Altae-Tran etal., Science 10.1126/
science.abj6856 (2021), all of which are incorporated herein by reference in their entireties for all purposes.
BACKGROUND
LA PRESENTE PARTIE DE CETTE DEMANDE OU CE BREVET COMPREND
PLUS D'UN TOME.
NOTE : Pour les tomes additionels, veuillez contacter le Bureau canadien des brevets JUMBO APPLICATIONS/PATENTS
THIS SECTION OF THE APPLICATION/PATENT CONTAINS MORE THAN ONE
VOLUME
NOTE: For additional volumes, please contact the Canadian Patent Office NOM DU FICHIER / FILE NAME:
NOTE POUR LE TOME / VOLUME NOTE:
CORONA VIRUS RAPID DIAGNOSTICS
TECHNICAL FIELD
[0001] The subject matter disclosed herein is generally directed to rapid single-reaction coronavirus diagnostics including the use of combinations of (1) nucleic acid amplification; (2) TnpB effector systems;
(3) oligonucleotide strand displacement (OSD); and/or fluorescently labeled RNA probes with RNase to release fluorescence. TnpB effector systems are described in U.S. Application Serial No. 16/326,132, filed August 16,2017, published as US 2021/0166783 on June 3,2021, which application is a U.S. national stage application under 35 U.S.C. 371 of PCT/US2017/047193, filed August 16,2017 and published as WO
2018/035250 on February 22, 2018, in Malcarova etal. (2020) Nature Reviews 18:
67-83 (and supplementary materials at https://doi.org/10.1038/s41579-019-0299-x), and H. Altae-Tran etal., Science 10.1126/
science.abj6856 (2021), all of which are incorporated herein by reference in their entireties for all purposes.
BACKGROUND
[0002] Nucleic acids are a universal signature of biological information. The ability torapidly detect nucleic acids with high sensitivity and single-base specificity on a portable platform has the potential to revolutionize diagnosis and monitoring for many diseases, provide valuable epidemiological information, and serve as a generalizable scientific tool. Although many methods have been developed for detecting nucleic acids (Du et al., 2017; Green et al., 2014; Kumar etal., 2014; Pardee et al., 2014; Pardee et al., 2016; Urdea et al., 2006), they inevitably suffer from trade- offs among sensitivity, specificity, simplicity, and speed..
[0003] Sensitive and rapid detection of nucleic acids is important for clinical diagnostics and biotechnological applications. Particularly when responding to outbreaks, such as the novel coronavirus, which has been referred to as 2019-nCoV and SARS-CoV-2, which causes COVID 2019, time is of the essence. Sabeti, Early Detection Is Key to Combating the Spread of Coronavirus, Time (February 6, 2020). The 2019-nCoV has killed over a million people, including well over 200,000 people in the U.S. alone, and response to the escalating outbreak, particularly where there are indications that both symptomatic and asymptomatic patients with 2019-nCov may transmit the disease. Wang, et al., A precision medicine approach to managing Wuhan Coronavirus pneumonia, Prec. Clin. Med. doi:10.1093/pcmedi/pbaa002. Early coronavirus testing kits sent to states and other countries did not work properly, according to the U.S. Centers for Disease Control and Prevention. Grady, "Coronavirus Test Kits Sent to States, 30 Countries Are Flawed, C.D.C. Says,"
New York Times, February 12, 2020. Moreover, early tests provide results in four hours from initial sample processing to results.
cdc.govhnedia/releases/2020/p0206-coronavirus-diagnostic-test-kits.
Highly accurate test results at better processing speds, particularly that are field-depoloyable would aid in addressing the outbreak. Currently, the novel coronavirus SARS-CoV-2 has resulted in an international public health emergency, spreading to over 180 countries and infecting millions of individuals. Testing for the presence of the virus is of utmost importance to both reduce the basic reproductive rate of the virus (RO) and inform best clinical practices for affected patients. However, understanding the full extent of the virus outbreak has remained challenging due to bottlenecks in the diagnosis of infection, spurred by requirements for low-supply reagents and kits, complex instrumentation, and difficult logistics with central lab facilities. A
platform that is simpler with common reagents and can be done in any setting would solve many of the world's testing issues.
New York Times, February 12, 2020. Moreover, early tests provide results in four hours from initial sample processing to results.
cdc.govhnedia/releases/2020/p0206-coronavirus-diagnostic-test-kits.
Highly accurate test results at better processing speds, particularly that are field-depoloyable would aid in addressing the outbreak. Currently, the novel coronavirus SARS-CoV-2 has resulted in an international public health emergency, spreading to over 180 countries and infecting millions of individuals. Testing for the presence of the virus is of utmost importance to both reduce the basic reproductive rate of the virus (RO) and inform best clinical practices for affected patients. However, understanding the full extent of the virus outbreak has remained challenging due to bottlenecks in the diagnosis of infection, spurred by requirements for low-supply reagents and kits, complex instrumentation, and difficult logistics with central lab facilities. A
platform that is simpler with common reagents and can be done in any setting would solve many of the world's testing issues.
[0004] Previously, Applicants developed a platform for nucleic acid detection using CRISPR
enzymes called SHERLOCK (specific High Sensitivity Enzymatic Reporter unLOCKing)(Gootenberg, 2018;Gootenberg, 2017 ), which combines pre-amplification with the RNA-guided RNase CRISPR-Cas13(Abudayyeh, 2016; East-Seletsky, 2016; Shmakov, 2015;Smargon, 201; Shmakov, 2017) and DNase CRISPR-Cas12(Zetsche, 2015 599;
Chen, 2018) for sensing of nucleic acids via fluorescence or portable lateral flow. In addition, Applicants have also developed a platform termed "STOPCovid". (N Engi J Med 2020; 383:1492-1494).
enzymes called SHERLOCK (specific High Sensitivity Enzymatic Reporter unLOCKing)(Gootenberg, 2018;Gootenberg, 2017 ), which combines pre-amplification with the RNA-guided RNase CRISPR-Cas13(Abudayyeh, 2016; East-Seletsky, 2016; Shmakov, 2015;Smargon, 201; Shmakov, 2017) and DNase CRISPR-Cas12(Zetsche, 2015 599;
Chen, 2018) for sensing of nucleic acids via fluorescence or portable lateral flow. In addition, Applicants have also developed a platform termed "STOPCovid". (N Engi J Med 2020; 383:1492-1494).
[0005] ). These and related technologies have been described in the following applications, which are hereby incorporated by reference in their entireties: U.S. Provisional Application Serial Nos.
62/818,702, filed March 14, 2019, 62/890,555, filed August 22, 2019, 62/970,125, filed February 12, 2020,62/993,494, filed March 23, 2020, 63/018,487, filed April 30, 2020, 63/019,406, filed May 3, 2020,63/032,470, filed May 29,2020, PCT/US2020/022795, filed March 13,2020, U.S. Application Serial Nos. 16/894,664, 16/894,678, and 16/894,670, all filed filed June 5, 2020.
SUMMARY
62/818,702, filed March 14, 2019, 62/890,555, filed August 22, 2019, 62/970,125, filed February 12, 2020,62/993,494, filed March 23, 2020, 63/018,487, filed April 30, 2020, 63/019,406, filed May 3, 2020,63/032,470, filed May 29,2020, PCT/US2020/022795, filed March 13,2020, U.S. Application Serial Nos. 16/894,664, 16/894,678, and 16/894,670, all filed filed June 5, 2020.
SUMMARY
[0006] In certain example embodiments, compositions for detecting the presence of a target polynucleotide in a sample, comprising isothermal amplification reagents for amplifying the target polynucleotide, and an extraction-free solution for isolating polynuckotides from a cell or virus particle. In one embodiment, the amplification is isothermal. The isothermal amplification reagents may comprise loop-mediated isothermal amplification (LAMP) reagents comprising F3, B3, FIP, BIP, Loop Forward and Loop Reverse primers. In an aspect, the LAMP reagents may further comprise "oligonucleotide strand displacement" or "one step toehold displacement" (OSD) probes. LAMP
amplification is described in U.S. Patent No. 7,175,985, which is incorporated herein by reference in its entirety, as well as in numerous publications (Notomi et al., Nucleic Acids Research 2000, 28;
Nagamine et al., Biochemical and Biophysical Research Communications 2002, 290, (4), 1195-1198;
Tomita et al., Nature Protocols 2008,3, (5), 877-882; Nagamine et al., Molecular and Cellular Probes 2002, 16, (3), 223-229; Mair et al., Bmc Veterinary Research 2013, 9; Tanner et al., Biotechniques 2012, 53, (2), 81-Suwancharoen et al., Journal of Veterinary Medical Science 2016, 78, (8), 1299-1302; Wang et al., Molecules 2016, 21. (1); Abdulmawjood et al.. Molecular and Cellular Probes 2016,30, (4), 205-210; Song 2016 Analytical Chemistry 2016, 88, (14), 7289-7294; Kong Scientific Reports 2016,6 and Mang et al., Scientific Reports 2016,6) all of which are incorporated herein by reference in their entireties.
100071 In one embodiment, the system includes a CR1SPR-Cas system that generates R-loops on a target nucleic acid. Such a system for the detection of coronavirus is provided. A system for detecting the presence of a coronavirus in a sample, comprising: a Cas protein; at least one guide polynucleotides comprising a guide sequence capable of binding a coronoavirus target sequence and designed to form a complex with the Cas protein; and a detection construct comprising a polynucleotide component. The detection agent may be an OSD probe, which upon binding a single stranded region in a target nucleic acid (generated, e.g., using the LAMP
and/or the aforementioned CRISPR-Cas system). In some embodiments, more than one guide polynucleotide may be used to increase signal and sensitivity. In another embodiment, the detection agent may be a labeled RNA
probe, which, upon binding the single stranded region, can be digested with RNase to release a detectable label. In embodiments, this label is a fluorophore. In embodiments, the coronavirus is 2019-nCov. In an aspect, the at least one guide polynucleotide is a highly active guide polynucleotide.
The guide polynucleotide of the system can, an an embodiment, bind to a coronavirus sequence encoding a polypeptide that is immunostimulatory to a host immune system, and/or binds to at least one target sequence that is a unique coronavirus genomic sequence.
[0008] The systems and methods may utilize one or more Cas proteins. In embodiments, the Cas proteins are a Type V or Type VI Cas protein, and may be Cas 12 proteins, Cas13 proteins, or a combination thereof. In an aspect, the one or more Cas proteins comprise a Cas13, which may be a thermostable Cas13 or Leptotrichiawadei Cas13. In an aspect, the one or more Cas proteins comprise a Cas12, which may be a thermostable Cas12b or Alicyclobacilluys acidiphilus Cas12b. The systems may further comprise amplification reagents for amplification of the coronavirus target sequence. In an aspect the amplification reagents are reagents for LAMP, polymerase chain reaction (PCR), nucleic acid sequence-based amplification (NASBA), strand displacement amplification (SDA), helicase-dependent amplification (HAD), nicking enzyme amplification reaction (NEAR), transcription mediated amplification (TMA), recombinase polyrnerase amplification (RPA) or rolling circle amplification (RCA).
[0009] A lateral flow device comprising a substrate comprising a first end and a second end, are also provided, the first end comprising a sample loading portion, a first region comprising a detectable ligand, two or more systems of the claims provided herein, and one or more first capture regions, each comprising a first binding agent; the substrate comprising two or more second capture regions between the first region of the first end and the second end, each second capture region comprising a different binding agent. In an aspect, the first end comprises two detection constructs, wherein each of the two detection constructs comprises an RNA or DNA
oligonucleotide, comprising a first molecule on a first end and a second molecule on a second end. In an aspect, the first end comprises three detection constructs, wherein each of the three detection constructs comprises an RNA or DNA oligonucleotide, comprising a first molecule on a first end and a second molecule on a second end. The lateral flow device may comprise a polynucleotide encoding a Cas13 and/or Cas12 and the one or more guide RNAs are provided as a multiplexing polynucleotide, the multiplexing polynucleotide configured to comprise two or more guide sequences. The lateral flow device may also comprise of OSD probes designed to separate binding agents upon detection of sequences.
[00010] Methods for detecting a target nucleic acid in a sample are also provided, comprising contacting a sample with the first end of the lateral flow device comprising the sample loading portion, wherein the sample flows from the sample loading portion of the substrate towards the first and second capture regions and generates a detectable signal. Methods may utilize a lateral flow device capable of detecting two different target nucleic acid sequences. In an aspect, the target nucleic acid sequences are absent from the sample, a detectable signal is generated at each capture region, the detectable signal appears at the first and second capture regions.
The lateral flow device can be designed such that when the target nucleic acid sequences are present in the sample, a detectable signal is generated at each capture region, and wherein when the sample contains one or more target nucleic acid sequences, a detectable signal is present at the capture region for the corresponding target nucleic acid sequence.
[00011] Methods for detection may also comprise detecting coronavirus in a sample by contacting the sample with the systems disclosed herein. The step of contacting the sample with the system can comprise amplifying the one or more target sequences in the sample and incubating the sample under conditions sufficient to allow binding of a labeled, but quenched probe to one or more target molecules. In the case of a double-stranded OSD probe, a labeled strand will bind to the target sequences and the complementary strand containing the quencher will diffuse away, allowing for a fluorescent signal to be detected (Fig. 2). The single stranded region of the amplified target sequence exists either as a portion of the amplified target sequence (e.g., as in the case of a LAMP-amplified target sequence), or can be generated by incubating with a CRISPR-Cas system, which can generate a single-stranded R-loop in the target sequence. Alternatively, in the case of a single stranded RNA
probe with both a fluorophore and quencher attached, the RNA probe will bind the single-stranded target sequences (LAMP loops or Cas-generated), and in the presence of RNaseH, the RNA strand of the resulting RNA-DNA duplex is digested, thus releasing detectable fluorescence (Fig. 3). In another embodiment, one strand of an unlabeled OSD probe binds to the target sequence, and a short RNA probe containing both a fluorophore and a quencher binds to the liberated strand of the OSD
probe, and RNase acts to release the quencher from the RNA strand, resulting in detectable fluorescence (Fig. 4). In another embodiment, one strand of a double-stranded RNA probe containing a fluorophore and a quencher on the same strand binds to the target sequences, and RNase degrades the RNA strand of the RNA-DNA hybrid, thus releasing the quencher and resulting in detectable fluorescence (Fig. 5). In yet another embodiment, an OSD probe containing a fluorophore on one strand binds to a target sequence, thus releasing the complementary strand containing the quencher, thus resulting in detectable fluorescence (Fig. 6). The addition of RNase H to OSD is advantageous because the OSD probes can re-bind when separated thus limiting the overall signal generated. RNase H, however, will cleave the fluorophore from quencher driving the reaction to complete liberation of fluorophore, generating more signal amplification in the reaction.
[00012] The step of contacting the sample with the system can further comprise incubating the sample at about 55 C to about 65 C, about 59 C to 61 C or about 60 C for 50 to 70 minutes and detecting the presence of a positive signal. The steps of extracting, amplifying incubating, activating and detecting are all performed in the same individual discrete volume.
[00013] Methods of detection can further comprise the step of treating the sample with a DNA
extraction solution prior to contacting the sample with the systems disclosed herein.
[00014] In an aspect, the DNA extraction solution is mixed with a sample at a concentration of about 1:2 to 2:1 sample:extraction solution. In an aspect, the method may further comprise incubating the sample and the DNA extraction solution, which may be performed at a temperature of about 20 C to 60 C for about 60 minutes, or 95 C for about 5 to 10 minutes.
Extraction may also comprise the addition of beads capable of concentrating targets of interest of the sample. In an aspect, the beads are magnetic.
[00015] A cartridge for detection assays in accordance with methods disclosed herein is provided comprising a sample receiver, at least a first, second, and third ampoule, and at least a first and second chamber, and a lateral flow strip, wherein the first ampoule is communicatively coupled to the first chamber comprising a heat source, the first chamber is communicatively coupled to the second ampoule, the second ampoule communicatively coupled to the second chamber, the third ampoule communicatively coupled to the lateral flow strip.
[00016] A cartridge can be provided comprising at least a first and second ampoule, a lysis chamber, an amplification chamber and a sample receiving chamber, the first ampoule fluidically connected to the sample receiving chamber, the sample receiving chamber further connected to the lysis chamber, the lysis chamber connected via a metering channel to the second ampoule and the amplification chamber.
[00017] The cartridge may be configured to fit in a system comprising a heating means, an optic means, a means for releasing reagents on the cartridge, and a means for readout of assay result.
The cartridge can comprise a first ampoule that comprises lysis buffer, and/or the second ampoule that comprises a CRISPR system, the CRISPR-Cas system comprising one or more Cas proteins and at least one guide polynucleotide. Alternatively, the cartridge may be as described in U.S.
Application Serial No. 16/894,670, which is incorporated herein by reference in its entirety.
[00018] The cartridge may further comprise amplification reagents. The amplification reagents comprise LAMP F3, B3, FIP, BIP, Loop Forward and Loop Reverse primers.
[00019] The cartridge can comprise a CRISPR system that includes a guide RNA designed to bind to a target nucleic acid that are diagnostic for a disease state.
Alternatively, the cartridge can comprise OSD and/or RNA probes, one of which is detectably labeled as discussed above. The disease state can be an infection, which may be caused by a microbe, the microbe selected from a virus, bacterium, a fungus, a protozoa, or a parasite. The guide RNA may be specific for a microbe that is viral, bacterial, or fungal.
[00020] The cartridge can further comprise a detection construct comprising a polynucleotide component, which may be fluorescent. In an aspect, the cartridge comprises reagents that are lyophilized. In an aspect, the Cas protein of the CRISPR system is a Type V or Type VI Cas protein.
In an aspect, the Cas protein is a Cas12 or Cas13 protein. The cartridge cancomprise a thermostable protein, for example, the thermostable Cas protein is Alicyclobacillus acidiphilus Cas12b (Aap). In an aspect, the guide comprises a sequence derived from Alicyclobacillus acidoterrestris (Aac). The cartridge can comprise a lysis buffer that comprises a DNA extraction buffer.
[00021]
A system designed to receive the detection cartridge as disclosed herein is provided, the system designed to receive the cartridge and conduct an assay comprising isothermal amplification of nucleic acids and detection of target nucleic acids on the cartridge. In one embodiment, the amplification may be isothermal. In embodiments, the system can comprise one or more heating means for extraction, amplification and/or detection, a means for releasing reagents for extraction, amplification, and/or detection, a means for mixing reagents for extraction, amplification, and/or detection, and/or a means for reading the results of the assay. In an aspect, the means of reading the results of the assay is an optic means. The system can further comprise a user interface for programming the device and/or readout of the results of the assay.
[00022]
These and other aspects, objects, features, and advantages of the example embodiments will become apparent to those having ordinary skill in the art upon consideration of the following detailed description of illustrated example embodiments.
BRIEF DESCRIPTION OF THE DRAWINGS
1000231 An understanding of the features and advantages of the present invention will be obtained by reference to the following detailed description that sets forth illustrative embodiments, in which the principles of the invention may be utilized, and the accompanying drawings. Note that according to the present invention, TnpB effector systems are substituted for CRISPR/Cas effector systems, and should be considered as alternatives in the specification and figures herein.
[000241 Figure I. Cas9 or Cas12 generates exposed ssDNA region via an R-loop. Cas9 and Cas12 can generate ssDNA via R-loop formation. Cas9 blocks about 9 nucleotides of the ssDNA region nearest to the PAM, leaving about 11 nucleotides exposed. This exposed region can be used for generating ssDNA that can trigger one-step strand displacement (OSD) probes and release fluorescence and bind to ssRNA probes with a fluorophore and quencher that can be consequently cleaved by .. RN ase H, release fluorescence Image from: httnts://www.frontiersio.orei articles/10.3389/fmich..2018.00257/full [00025]
Figure 2. OSD probes triggered by R-loops. With normal probes, the OSD probe binds the loop-mediated isothermal amplification (LAMP) amplicon loop, causing stand displacement and unquenching a fluorescence. With CRISPR OSD, the OSD probe binds the R-
amplification is described in U.S. Patent No. 7,175,985, which is incorporated herein by reference in its entirety, as well as in numerous publications (Notomi et al., Nucleic Acids Research 2000, 28;
Nagamine et al., Biochemical and Biophysical Research Communications 2002, 290, (4), 1195-1198;
Tomita et al., Nature Protocols 2008,3, (5), 877-882; Nagamine et al., Molecular and Cellular Probes 2002, 16, (3), 223-229; Mair et al., Bmc Veterinary Research 2013, 9; Tanner et al., Biotechniques 2012, 53, (2), 81-Suwancharoen et al., Journal of Veterinary Medical Science 2016, 78, (8), 1299-1302; Wang et al., Molecules 2016, 21. (1); Abdulmawjood et al.. Molecular and Cellular Probes 2016,30, (4), 205-210; Song 2016 Analytical Chemistry 2016, 88, (14), 7289-7294; Kong Scientific Reports 2016,6 and Mang et al., Scientific Reports 2016,6) all of which are incorporated herein by reference in their entireties.
100071 In one embodiment, the system includes a CR1SPR-Cas system that generates R-loops on a target nucleic acid. Such a system for the detection of coronavirus is provided. A system for detecting the presence of a coronavirus in a sample, comprising: a Cas protein; at least one guide polynucleotides comprising a guide sequence capable of binding a coronoavirus target sequence and designed to form a complex with the Cas protein; and a detection construct comprising a polynucleotide component. The detection agent may be an OSD probe, which upon binding a single stranded region in a target nucleic acid (generated, e.g., using the LAMP
and/or the aforementioned CRISPR-Cas system). In some embodiments, more than one guide polynucleotide may be used to increase signal and sensitivity. In another embodiment, the detection agent may be a labeled RNA
probe, which, upon binding the single stranded region, can be digested with RNase to release a detectable label. In embodiments, this label is a fluorophore. In embodiments, the coronavirus is 2019-nCov. In an aspect, the at least one guide polynucleotide is a highly active guide polynucleotide.
The guide polynucleotide of the system can, an an embodiment, bind to a coronavirus sequence encoding a polypeptide that is immunostimulatory to a host immune system, and/or binds to at least one target sequence that is a unique coronavirus genomic sequence.
[0008] The systems and methods may utilize one or more Cas proteins. In embodiments, the Cas proteins are a Type V or Type VI Cas protein, and may be Cas 12 proteins, Cas13 proteins, or a combination thereof. In an aspect, the one or more Cas proteins comprise a Cas13, which may be a thermostable Cas13 or Leptotrichiawadei Cas13. In an aspect, the one or more Cas proteins comprise a Cas12, which may be a thermostable Cas12b or Alicyclobacilluys acidiphilus Cas12b. The systems may further comprise amplification reagents for amplification of the coronavirus target sequence. In an aspect the amplification reagents are reagents for LAMP, polymerase chain reaction (PCR), nucleic acid sequence-based amplification (NASBA), strand displacement amplification (SDA), helicase-dependent amplification (HAD), nicking enzyme amplification reaction (NEAR), transcription mediated amplification (TMA), recombinase polyrnerase amplification (RPA) or rolling circle amplification (RCA).
[0009] A lateral flow device comprising a substrate comprising a first end and a second end, are also provided, the first end comprising a sample loading portion, a first region comprising a detectable ligand, two or more systems of the claims provided herein, and one or more first capture regions, each comprising a first binding agent; the substrate comprising two or more second capture regions between the first region of the first end and the second end, each second capture region comprising a different binding agent. In an aspect, the first end comprises two detection constructs, wherein each of the two detection constructs comprises an RNA or DNA
oligonucleotide, comprising a first molecule on a first end and a second molecule on a second end. In an aspect, the first end comprises three detection constructs, wherein each of the three detection constructs comprises an RNA or DNA oligonucleotide, comprising a first molecule on a first end and a second molecule on a second end. The lateral flow device may comprise a polynucleotide encoding a Cas13 and/or Cas12 and the one or more guide RNAs are provided as a multiplexing polynucleotide, the multiplexing polynucleotide configured to comprise two or more guide sequences. The lateral flow device may also comprise of OSD probes designed to separate binding agents upon detection of sequences.
[00010] Methods for detecting a target nucleic acid in a sample are also provided, comprising contacting a sample with the first end of the lateral flow device comprising the sample loading portion, wherein the sample flows from the sample loading portion of the substrate towards the first and second capture regions and generates a detectable signal. Methods may utilize a lateral flow device capable of detecting two different target nucleic acid sequences. In an aspect, the target nucleic acid sequences are absent from the sample, a detectable signal is generated at each capture region, the detectable signal appears at the first and second capture regions.
The lateral flow device can be designed such that when the target nucleic acid sequences are present in the sample, a detectable signal is generated at each capture region, and wherein when the sample contains one or more target nucleic acid sequences, a detectable signal is present at the capture region for the corresponding target nucleic acid sequence.
[00011] Methods for detection may also comprise detecting coronavirus in a sample by contacting the sample with the systems disclosed herein. The step of contacting the sample with the system can comprise amplifying the one or more target sequences in the sample and incubating the sample under conditions sufficient to allow binding of a labeled, but quenched probe to one or more target molecules. In the case of a double-stranded OSD probe, a labeled strand will bind to the target sequences and the complementary strand containing the quencher will diffuse away, allowing for a fluorescent signal to be detected (Fig. 2). The single stranded region of the amplified target sequence exists either as a portion of the amplified target sequence (e.g., as in the case of a LAMP-amplified target sequence), or can be generated by incubating with a CRISPR-Cas system, which can generate a single-stranded R-loop in the target sequence. Alternatively, in the case of a single stranded RNA
probe with both a fluorophore and quencher attached, the RNA probe will bind the single-stranded target sequences (LAMP loops or Cas-generated), and in the presence of RNaseH, the RNA strand of the resulting RNA-DNA duplex is digested, thus releasing detectable fluorescence (Fig. 3). In another embodiment, one strand of an unlabeled OSD probe binds to the target sequence, and a short RNA probe containing both a fluorophore and a quencher binds to the liberated strand of the OSD
probe, and RNase acts to release the quencher from the RNA strand, resulting in detectable fluorescence (Fig. 4). In another embodiment, one strand of a double-stranded RNA probe containing a fluorophore and a quencher on the same strand binds to the target sequences, and RNase degrades the RNA strand of the RNA-DNA hybrid, thus releasing the quencher and resulting in detectable fluorescence (Fig. 5). In yet another embodiment, an OSD probe containing a fluorophore on one strand binds to a target sequence, thus releasing the complementary strand containing the quencher, thus resulting in detectable fluorescence (Fig. 6). The addition of RNase H to OSD is advantageous because the OSD probes can re-bind when separated thus limiting the overall signal generated. RNase H, however, will cleave the fluorophore from quencher driving the reaction to complete liberation of fluorophore, generating more signal amplification in the reaction.
[00012] The step of contacting the sample with the system can further comprise incubating the sample at about 55 C to about 65 C, about 59 C to 61 C or about 60 C for 50 to 70 minutes and detecting the presence of a positive signal. The steps of extracting, amplifying incubating, activating and detecting are all performed in the same individual discrete volume.
[00013] Methods of detection can further comprise the step of treating the sample with a DNA
extraction solution prior to contacting the sample with the systems disclosed herein.
[00014] In an aspect, the DNA extraction solution is mixed with a sample at a concentration of about 1:2 to 2:1 sample:extraction solution. In an aspect, the method may further comprise incubating the sample and the DNA extraction solution, which may be performed at a temperature of about 20 C to 60 C for about 60 minutes, or 95 C for about 5 to 10 minutes.
Extraction may also comprise the addition of beads capable of concentrating targets of interest of the sample. In an aspect, the beads are magnetic.
[00015] A cartridge for detection assays in accordance with methods disclosed herein is provided comprising a sample receiver, at least a first, second, and third ampoule, and at least a first and second chamber, and a lateral flow strip, wherein the first ampoule is communicatively coupled to the first chamber comprising a heat source, the first chamber is communicatively coupled to the second ampoule, the second ampoule communicatively coupled to the second chamber, the third ampoule communicatively coupled to the lateral flow strip.
[00016] A cartridge can be provided comprising at least a first and second ampoule, a lysis chamber, an amplification chamber and a sample receiving chamber, the first ampoule fluidically connected to the sample receiving chamber, the sample receiving chamber further connected to the lysis chamber, the lysis chamber connected via a metering channel to the second ampoule and the amplification chamber.
[00017] The cartridge may be configured to fit in a system comprising a heating means, an optic means, a means for releasing reagents on the cartridge, and a means for readout of assay result.
The cartridge can comprise a first ampoule that comprises lysis buffer, and/or the second ampoule that comprises a CRISPR system, the CRISPR-Cas system comprising one or more Cas proteins and at least one guide polynucleotide. Alternatively, the cartridge may be as described in U.S.
Application Serial No. 16/894,670, which is incorporated herein by reference in its entirety.
[00018] The cartridge may further comprise amplification reagents. The amplification reagents comprise LAMP F3, B3, FIP, BIP, Loop Forward and Loop Reverse primers.
[00019] The cartridge can comprise a CRISPR system that includes a guide RNA designed to bind to a target nucleic acid that are diagnostic for a disease state.
Alternatively, the cartridge can comprise OSD and/or RNA probes, one of which is detectably labeled as discussed above. The disease state can be an infection, which may be caused by a microbe, the microbe selected from a virus, bacterium, a fungus, a protozoa, or a parasite. The guide RNA may be specific for a microbe that is viral, bacterial, or fungal.
[00020] The cartridge can further comprise a detection construct comprising a polynucleotide component, which may be fluorescent. In an aspect, the cartridge comprises reagents that are lyophilized. In an aspect, the Cas protein of the CRISPR system is a Type V or Type VI Cas protein.
In an aspect, the Cas protein is a Cas12 or Cas13 protein. The cartridge cancomprise a thermostable protein, for example, the thermostable Cas protein is Alicyclobacillus acidiphilus Cas12b (Aap). In an aspect, the guide comprises a sequence derived from Alicyclobacillus acidoterrestris (Aac). The cartridge can comprise a lysis buffer that comprises a DNA extraction buffer.
[00021]
A system designed to receive the detection cartridge as disclosed herein is provided, the system designed to receive the cartridge and conduct an assay comprising isothermal amplification of nucleic acids and detection of target nucleic acids on the cartridge. In one embodiment, the amplification may be isothermal. In embodiments, the system can comprise one or more heating means for extraction, amplification and/or detection, a means for releasing reagents for extraction, amplification, and/or detection, a means for mixing reagents for extraction, amplification, and/or detection, and/or a means for reading the results of the assay. In an aspect, the means of reading the results of the assay is an optic means. The system can further comprise a user interface for programming the device and/or readout of the results of the assay.
[00022]
These and other aspects, objects, features, and advantages of the example embodiments will become apparent to those having ordinary skill in the art upon consideration of the following detailed description of illustrated example embodiments.
BRIEF DESCRIPTION OF THE DRAWINGS
1000231 An understanding of the features and advantages of the present invention will be obtained by reference to the following detailed description that sets forth illustrative embodiments, in which the principles of the invention may be utilized, and the accompanying drawings. Note that according to the present invention, TnpB effector systems are substituted for CRISPR/Cas effector systems, and should be considered as alternatives in the specification and figures herein.
[000241 Figure I. Cas9 or Cas12 generates exposed ssDNA region via an R-loop. Cas9 and Cas12 can generate ssDNA via R-loop formation. Cas9 blocks about 9 nucleotides of the ssDNA region nearest to the PAM, leaving about 11 nucleotides exposed. This exposed region can be used for generating ssDNA that can trigger one-step strand displacement (OSD) probes and release fluorescence and bind to ssRNA probes with a fluorophore and quencher that can be consequently cleaved by .. RN ase H, release fluorescence Image from: httnts://www.frontiersio.orei articles/10.3389/fmich..2018.00257/full [00025]
Figure 2. OSD probes triggered by R-loops. With normal probes, the OSD probe binds the loop-mediated isothermal amplification (LAMP) amplicon loop, causing stand displacement and unquenching a fluorescence. With CRISPR OSD, the OSD probe binds the R-
7 loop formed in the LAMP amplicon, causing strand displacement and unquenching fluorescence.
CRISPR OSD therefore gives more flexibility in targeting and more signal than the normal OSD
approach because the CRISPR-Cas9 complex can generate many ssDNA regions in the LAMP
amplicon using multiple guide RNAs.
1000261 Figure 3. R-loop triggering RNase H-based release of fluorescence.
An RNA
probe of 5-10 nucleotides with a fluorophore at one end and a quencher at the other end binds to the ssDNA region in the R-loop, and RNase H cleaves the RNA strand of the RNA-DNA
duplex, which releases the fluorescence. RNase Fill will be active in the hot temperature range of LAMP (55 C-65 C). CRISPR RNase H probes will have multiple turnover for a given R-loop, yielding multiple fluorescent events, which yields better signal to noise compared to the ()SD
approach.
[00027] Figure 4. Combination of OSD and RNase H: Approach 1. A double-stranded OSD probe without a fluorophore or quencher is added to the LAMP amplicon, and one strand of the OSD probe binds to the LAMP amplicon loop, thus liberating the other strand of the ()SD probe.
That liberated OSD ss probe can now bind to an RNA probe of 5-10 nucleotides that has a fluorophore on one end and a quencher on the other end. RNase H can then cleave the RNA strand of the RNA-DNA duplex, thus releasing fluorescence. The combination of OSD and RNase H boosts the overall signal due to multiple turnover of RNase H, and also has increased specificity.
[00028] Figure 5. Combination of OSD and RNase H: Approach 2. A double stranded RNA OSD probe with a fluorophore at one end of one strand and a quencher at the other end of the same strand is incubated with the LAMP amplicon. The labeled strand binds to the LAMP amplicon loop, and the RNA strand of the RNA-DNA duplex is cleaved with RNase H, releasing fluorescence.
The combination of OSD and RNase H boosts the overall signal due to multiple turnover of RNase H and also has increased specificity.
[000291 Figure 6. Combination of OSD and RNase H: Approach 3. A double stranded RNA OSD probe with a fluorophore at one end of one strand and a quencher at the end of the other strand is incubated with the LAMP amplicon. One strand binds to the LAMP
amplicon, releasing the quenching of the fluorescence. The RNA strand of the RNA-DNA duplex in the LAMP amplicon loop is cleaved with RNase H, allowing for multiple turnover of the OSD probe and increased fluorescence. The combination of OSD and RNase H boosts the overall signal due to multiple turnover of RNase H and also has increased specificity.
CRISPR OSD therefore gives more flexibility in targeting and more signal than the normal OSD
approach because the CRISPR-Cas9 complex can generate many ssDNA regions in the LAMP
amplicon using multiple guide RNAs.
1000261 Figure 3. R-loop triggering RNase H-based release of fluorescence.
An RNA
probe of 5-10 nucleotides with a fluorophore at one end and a quencher at the other end binds to the ssDNA region in the R-loop, and RNase H cleaves the RNA strand of the RNA-DNA
duplex, which releases the fluorescence. RNase Fill will be active in the hot temperature range of LAMP (55 C-65 C). CRISPR RNase H probes will have multiple turnover for a given R-loop, yielding multiple fluorescent events, which yields better signal to noise compared to the ()SD
approach.
[00027] Figure 4. Combination of OSD and RNase H: Approach 1. A double-stranded OSD probe without a fluorophore or quencher is added to the LAMP amplicon, and one strand of the OSD probe binds to the LAMP amplicon loop, thus liberating the other strand of the ()SD probe.
That liberated OSD ss probe can now bind to an RNA probe of 5-10 nucleotides that has a fluorophore on one end and a quencher on the other end. RNase H can then cleave the RNA strand of the RNA-DNA duplex, thus releasing fluorescence. The combination of OSD and RNase H boosts the overall signal due to multiple turnover of RNase H, and also has increased specificity.
[00028] Figure 5. Combination of OSD and RNase H: Approach 2. A double stranded RNA OSD probe with a fluorophore at one end of one strand and a quencher at the other end of the same strand is incubated with the LAMP amplicon. The labeled strand binds to the LAMP amplicon loop, and the RNA strand of the RNA-DNA duplex is cleaved with RNase H, releasing fluorescence.
The combination of OSD and RNase H boosts the overall signal due to multiple turnover of RNase H and also has increased specificity.
[000291 Figure 6. Combination of OSD and RNase H: Approach 3. A double stranded RNA OSD probe with a fluorophore at one end of one strand and a quencher at the end of the other strand is incubated with the LAMP amplicon. One strand binds to the LAMP
amplicon, releasing the quenching of the fluorescence. The RNA strand of the RNA-DNA duplex in the LAMP amplicon loop is cleaved with RNase H, allowing for multiple turnover of the OSD probe and increased fluorescence. The combination of OSD and RNase H boosts the overall signal due to multiple turnover of RNase H and also has increased specificity.
8 000301 Figure 7. LAMP OSD preliminary limit of detection (LOD) with extraction beads. Detection of either pure RNA or mock viral particles ("SeraCare") using a combination of LAMP and OSD at various input concentrations. All samples were pre-concentrated using a bead-based extraction method.
[00031] Figure 8. LAMP OSD reaction and readout on alpha device without extraction bead workflow. Amplification and detection of RNA on device, with fluorescence labeled as arbitrary units (AU). FAM fluorescent channel refers to OSD detection; Syto64 refers to inline control detection of LAMP amplification.
[00032] Figure 9. LAMP OSD reaction and readout on alpha device with extraction bead workflow Amplification and detection of RNA on device, with fluorescence labeled as arbitrary units (AU). FAM fluorescent channel refers to OSD detection; Syto64 refers to inline control detection of LAMP amplification.
[00033] Figure 10. LAMP OSD reaction optimization for different buffer combinations.
Relative fluorescence in LAMP OSD for reactions containing 6 mM additional MgSO4 and either KCI or GuIIC1 additives. GuHC1 additives increase the speed of the LAMP OSD
reaction.
[000341 Figure Ii. RNase H-based detection of oligonucleotides in LAMP
reaction conditions shows that amplification can be achieved with small amounts of RNase H in the LAMP reaction.
[000351 Figure 12. RNase H-based detection of oligonucleotides in LAMP
reactions.
Shows that RNase H cleavage can also be achieved in the same range of RNase H
amounts within the LAMP reaction. Certain RNase H enzymes work better than others.
SUMMARY OF THE INVENTION
[00036] The following brief summary is not intended to include all features and aspects of the present invention, nor does it imply that the invention must include all features and aspects discussed in this summary.
The present invention provides the following:
1. A method of detecting a target nucleic acid in a sample comprising:
(a) distributing a sample or set of samples into one or more individual discrete volumes each individual discrete volume comprising isothermal amplification reagents for
[00031] Figure 8. LAMP OSD reaction and readout on alpha device without extraction bead workflow. Amplification and detection of RNA on device, with fluorescence labeled as arbitrary units (AU). FAM fluorescent channel refers to OSD detection; Syto64 refers to inline control detection of LAMP amplification.
[00032] Figure 9. LAMP OSD reaction and readout on alpha device with extraction bead workflow Amplification and detection of RNA on device, with fluorescence labeled as arbitrary units (AU). FAM fluorescent channel refers to OSD detection; Syto64 refers to inline control detection of LAMP amplification.
[00033] Figure 10. LAMP OSD reaction optimization for different buffer combinations.
Relative fluorescence in LAMP OSD for reactions containing 6 mM additional MgSO4 and either KCI or GuIIC1 additives. GuHC1 additives increase the speed of the LAMP OSD
reaction.
[000341 Figure Ii. RNase H-based detection of oligonucleotides in LAMP
reaction conditions shows that amplification can be achieved with small amounts of RNase H in the LAMP reaction.
[000351 Figure 12. RNase H-based detection of oligonucleotides in LAMP
reactions.
Shows that RNase H cleavage can also be achieved in the same range of RNase H
amounts within the LAMP reaction. Certain RNase H enzymes work better than others.
SUMMARY OF THE INVENTION
[00036] The following brief summary is not intended to include all features and aspects of the present invention, nor does it imply that the invention must include all features and aspects discussed in this summary.
The present invention provides the following:
1. A method of detecting a target nucleic acid in a sample comprising:
(a) distributing a sample or set of samples into one or more individual discrete volumes each individual discrete volume comprising isothermal amplification reagents for
9 amplifying the target polynucleotide, and a solution for rapidly isolating polynucleotides from a cell or virus particle.;
(b) incubating the sample or set of samples at conditions sufficient to allow extraction of target polynucleotides from the sample, (c) generating amplicons of target polynucleotides, wherein isolation of polynucleotides is not required between the extraction or amplification step;
(d) further incubating the sample with a probe that binds one or more single stranded regions of said amplicons, wherein said probe is:
(1) a double stranded DNA probe with a fluorophore on one strand and a quencher on the other strand;
(2) a single stranded RNA probe with a fluorophore on one end of the strand and a quencher on the other end of the strand;
(3) a combination of an unlabeled double stranded DNA probe and a single stranded RNA probe with a fluorophore on one end of the strand and a quencher on the other end of the strand;
(4) a double stranded RNA probe with a fluorophore on one end of one strand and a quencher on the other end of the same strand; or (5) a double stranded RNA probe with a fluorophore on one end of one strand and a quencher on the other end of the other strand;
(e) adding an enzyme capable of cleaving RNA in an RNA:DNA duplex to increase fluorescence to be detected; and (0 detecting the one or more amplicons, thereby indicating the presence of one or more target polynucleotides in the sample.
2. The method of item 1, wherein the enzyme capable of cleaving RNA is RNaseH.
3. The method of item 1, wherein the fluorescence detected is greater than fluorescence detected by unwinding of the RNA:DNA duplex alone.
4. The method of item 1, which does not include a washing step.
5. The method of item 1, wherein the solution for isolating polynucleotides is protease-based, detergent-based, or chaotrope-based.
6. The method of item 1, wherein the solution contains proteinase K.
7. The method of item 6, wherein reaction buffer contains a proteinase K
inhibitor.
8. The method of item 1, wherein the solution for isolating polynucleotides is Lucigen Quick Extract Plant DNA Extraction Solution.
9. The method of item 1, wherein the amplicons are generated using loop-mediated isothermal amplification (LAMP), polymerase chain reaction (PCR), nucleic acid sequence-based amplification (NASBA), strand displacement amplification (SDA), helicase-dependent amplification (HAD), nicking enzyme amplification reaction (NEAR), transcription mediated amplification (TMA), recombinase polymerase amplification (RPA) or rolling circle amplification (RCA).
(b) incubating the sample or set of samples at conditions sufficient to allow extraction of target polynucleotides from the sample, (c) generating amplicons of target polynucleotides, wherein isolation of polynucleotides is not required between the extraction or amplification step;
(d) further incubating the sample with a probe that binds one or more single stranded regions of said amplicons, wherein said probe is:
(1) a double stranded DNA probe with a fluorophore on one strand and a quencher on the other strand;
(2) a single stranded RNA probe with a fluorophore on one end of the strand and a quencher on the other end of the strand;
(3) a combination of an unlabeled double stranded DNA probe and a single stranded RNA probe with a fluorophore on one end of the strand and a quencher on the other end of the strand;
(4) a double stranded RNA probe with a fluorophore on one end of one strand and a quencher on the other end of the same strand; or (5) a double stranded RNA probe with a fluorophore on one end of one strand and a quencher on the other end of the other strand;
(e) adding an enzyme capable of cleaving RNA in an RNA:DNA duplex to increase fluorescence to be detected; and (0 detecting the one or more amplicons, thereby indicating the presence of one or more target polynucleotides in the sample.
2. The method of item 1, wherein the enzyme capable of cleaving RNA is RNaseH.
3. The method of item 1, wherein the fluorescence detected is greater than fluorescence detected by unwinding of the RNA:DNA duplex alone.
4. The method of item 1, which does not include a washing step.
5. The method of item 1, wherein the solution for isolating polynucleotides is protease-based, detergent-based, or chaotrope-based.
6. The method of item 1, wherein the solution contains proteinase K.
7. The method of item 6, wherein reaction buffer contains a proteinase K
inhibitor.
8. The method of item 1, wherein the solution for isolating polynucleotides is Lucigen Quick Extract Plant DNA Extraction Solution.
9. The method of item 1, wherein the amplicons are generated using loop-mediated isothermal amplification (LAMP), polymerase chain reaction (PCR), nucleic acid sequence-based amplification (NASBA), strand displacement amplification (SDA), helicase-dependent amplification (HAD), nicking enzyme amplification reaction (NEAR), transcription mediated amplification (TMA), recombinase polymerase amplification (RPA) or rolling circle amplification (RCA).
10. The method of item 1, wherein the isothermal incubation temperature is between 55 C and 75 C.
11. The method of item 1, wherein the single stranded region is a LAMP
amplicon loop.
amplicon loop.
12. The method of item 1, wherein the single stranded region is an R-loop generated when a guide polynucleotide binds to one strand of the amplicon.
13. The method of item 12, wherein a Cas molecule or an argonaute protein enables the guide polynucleotide to bind to strand of the amplicon.
14. The method of item 13, wherein the Cas molecule is Cas9, Cas12 or Cas14.
15. The method of item 14, wherein the Cas molecule is Cas12(b).
16. The method of item 1, wherein the enzyme is a nuclease or nickase.
17. The method of item 1, wherein the probe is a double stranded DNA probe with a fluorophore on one strand and a quencher on the other strand.
18. The method of item 1, wherein the probe is a single stranded RNA probe with a fluorophore on one end of the strand and a quencher on the other end of the strand.
19. The method of item 1, wherein the probe is a single stranded RNA probe with a fluorophore on one end of the strand and a quencher on the other end of the strand.
20. The method of item 1, wherein the probe is a combination of an unlabeled double stranded DNA probe and a single stranded RNA probe with a fluorophore on one end of the strand and a quencher on the other end of the strand.
21. The method of item 1, wherein the probe is a double stranded RNA probe with a fluorophore on one end of one strand and a quencher on the other end of the same strand;
or
or
22. The method of item 1, where the probe is a double stranded RNA probe with a fluorophore on one end of one strand and a quencher on the other end of the other strand.
23. The method of item 1, wherein the target nucleic acid is from a virus, bacterium, protozoa, fungus, or other pathogenic organism.
24. The method of item 23, wherein the target nucleic acid is from human papillomavirus, hepatitis, adenovirus, Candidia, coronavirus, hemesvirus, human immunodeficiency virus, influenza virus. Plasmodium, rhinovirus, Neisseria gonorrhoeae, Respiratory syncytial virus, coronavirus, or Streptococcus pyogenes.
25. The method of item 25, wherein the coronavirus SARS-CoV2.
26. The method of item 1, wherein an extraction-free solution is mixed with a sample at a concentration of about 1:2 to 2:1 sample:extraction solution.
27. The method of item 26, wherein the sample is from a nasal swab or saliva.
28. The method of item 1, wherein the incubating step is performed at a temperature of about 20 C to 60 C for about 30 minutes,.
29. The method of item 1, wherein the amplifying and detecting steps are performed at about 55 C to about 65 C, about 59 C to 61 C or about 60 C for 50 to 70 minutes.
30. The method of item 1, wherein the target polynucleotide is detected in one hour or less.
31. The method of item 1, wherein the steps of incubating and detecting are all performed in the same individual discrete volume.
32. A composition for detecting the presence of a target polynucleotide in a sample, comprising reagents for amplifying the target polynucleotkle, an extraction-free solution for isolating polynucleotides from a cell or virus particle, and one or more of the following probes:
(a) a double stranded DNA probe with a fluorophore on one strand and a quencher on the other strand;
(b) a single stranded RNA probe with a fiuorophore on one end of the strand and a quencher on the other end of the strand;
(c) a combination of an unlabeled double stranded DNA probe and a single stranded RNA probe with a fluorophore on one end of the strand and a quencher on the other end of the strand;
(d) a double stranded RNA probe with a fluorophore on one end of one strand and a quencher on the other end of the same strand; or (e) a double stranded RNA probe with a fluorophore on one end of one strand and a quencher on the other end of the other strand.
(a) a double stranded DNA probe with a fluorophore on one strand and a quencher on the other strand;
(b) a single stranded RNA probe with a fiuorophore on one end of the strand and a quencher on the other end of the strand;
(c) a combination of an unlabeled double stranded DNA probe and a single stranded RNA probe with a fluorophore on one end of the strand and a quencher on the other end of the strand;
(d) a double stranded RNA probe with a fluorophore on one end of one strand and a quencher on the other end of the same strand; or (e) a double stranded RNA probe with a fluorophore on one end of one strand and a quencher on the other end of the other strand.
33. The composition of item 32, wherein the amplification reagents are LAMP
reagents comprising F3, B3, F1P, :BIP, Loop Forward and Loop Reverse primers.
reagents comprising F3, B3, F1P, :BIP, Loop Forward and Loop Reverse primers.
34. The compositions of item 32, wherein the probes are selected from Table 9.
35. The composition of item 32, wherein the probes are provided at a concentration of 50 nM to 175 n.M, preferably 75 nM to 150r1M.
36. The composition of item 33, wherein LAMP primers are selected from Table 1A, Table 1B, Table 5 or Table 8.
37. The composition of item 33, wherein the F3 primer is selected from Table 5 or Table 8.
38. The composition of item 32, wherein the composition is lyophilized.
39. The composition of item 38, wherein the composition is lyophilized as a complete formulation.
40. The composition of item 39, wherein the composition is lyophilized as an incomplete formulation and additional components are added later in resuspension buffer.
41. The composition of item 40, comprising one of more of lactose, trehalose, sorbitol, glucose, raffmose, glycine or histidine.
42. The composition of item 32, further comprising one or more additives, wherein the additive is guanidinium chloride (GuHC1), L-proline, L-histidine, b-alanine, L-serine, urea, acetamide, 4-aminobutyric acid, polyethylene glycol, polypropylene glycol, polyvinylpyrrolidone K, 6-0-a-D-maltosyl-b- cyclodextrin, (2-hydroxypropy1)-b-cyclodextrin, a- cyclodextrin, b-cyclodextrin, methyl-b- cyclodextrin, glycine, proline, taurine, or a combination thereof.
43. The composition of item 32, further comprising polynucleotide binding beads for the capture of nucleic acids in a sample.
44. The composition of item 43, wherein the beads are carboxylated.
45. The composition of item 43, wherein there is polyethylene glycol in binding solution.
46. The composition of item 44, wherein there is silica in binding solution.
47. The composition a item 32, further comprising one or more Cas proteins and at least one guide polynucleotide designed to form a complex with the one or more Cas proteins.
48. The composition of item 47, wherein the one or more Cas proteins is a Type II Cas, Type V
Cas, Type VI Cas, or a combination thereof.
Cas, Type VI Cas, or a combination thereof.
49. The composition of item 47, wherein the one or more Cas proteins is thermostable exhibiting nuclease activity at temperature of at least 50 C.
50. The composition of item 47, wherein the Cas is a Cas12b.
51. The composition of item 50, wherein the Cas12b is selected from Table 2A or Table 2B.
52. The composition of item 49, wherein the thermostable Cas protein is Brevibacillus sp. SYSU
G02855 (Br) Cas12b or Alicyclobacillus acidiphilus (Aap) Cas 12b.
G02855 (Br) Cas12b or Alicyclobacillus acidiphilus (Aap) Cas 12b.
53. The composition of item 52, wherein the Cas protein is Aap Cas12b and the guide is derived from Alicyclobacilus acidoterrestris.
54. The composition of item 47, wherein the guide polynucleotide comprises a sequence selected from Aac guide types 1 to 5 (SEQ ID NOs: XX-XX).
55. The composition of item 47, wherein the Cas12b is BrCas12b and the guide sequence comprises a crRNA design 1 to 3 (SEQ ID NO:XX-XX).
56. The composition of item 47, wherein the guide polynucleotide comprises a spacer specific for the genome of SARS-CoV-2 .
57. The composition of item 56, wherein the guide polynucleotide comprises a spacer specific for the N gene or S gene of SARS-CoV-2.
58. The composition of item 32, further comprising one or more additives to increase reaction specificity or kinetics.
59. The composition of item 32, further comprising a polynucleotide binding beads.
60. The method of item 1, wherein the sample is subjected to in-sample multiplexing using intercalating dyes.
[00037] The foregoing and other objects, features and advantages of the invention will be apparent from the following more particular description of preferred embodiments of the invention, as illustrated in the accompanying drawings in which like reference characters refer to the same parts throughout the different views. The drawings are not necessarily to scale, emphasis instead being placed upon illustrating the principles of the invention.
[00038]
DETAILED DESCRIPTION OF THE EXAMPLE EMBODIMENTS
General Definitions [00039] Unless defined otherwise, technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this disclosure pertains.
Definitions of common terms and techniques in molecular biology may be found in Molecular Cloning: A Laboratory Manual, 2nd edition (1989) (Sambrook, Fritsch, and Maniatis); Molecular Cloning: A Laboratory Manual, 4th edition (2012) (Green and Sambrook); Current Protocols in Molecular Biology (1987) (F.M. Ausubel et al. eds.); the series Methods in Enzymology (Academic Press, Inc.): PCR 2: A Practical Approach (1995) (M.J. MacPherson, B.D.
Ham.es, and G.R. Taylor eds.): Antibodies, A Laboratory Manual (1988) (Harlow and Lane, eds.):
Antibodies A Laboraotry Manual, 2nd edition 2013 (E.A. Greenfield ed.); Animal Cell Culture (1987) (RI. Freshney, ed.);
Benjamin Lewin, Genes IX, published by Jones and Bartlet, 2008 (ISBN
0763752223); Kendrew et al. (eds.), The Encyclopedia of Molecular Biology, published by Blackwell Science Ltd., 1994 (ISBN
0632021829); Robert A. Meyers (ed.), Molecular Biology and Biotechnology: a Comprehensive Desk Reference, published by VCH Publishers, Inc., 1995 (ISBN 9780471185710);
Singleton etal., Dictionary of Microbiology and Molecular Biology 2nd ed., J. Wiley & Sons (New York, N.Y. 1994), March, Advanced Organic Chemistry Reactions, Mechanisms and Structure 4th ed., John Wiley &
Sons (New York, N.Y. 1992); and Marten H. Hofker and Jan van Deursen, Transgenic Mouse Methods and Protocols, 2nd edition (2011).
[00040] As used herein, the singular forms "a", "an", and "the" include both singular and plural referents unless the context clearly dictates otherwise.
[00041] The term "optional" or "optionally" means that the subsequent described event, circumstance or substituent may or may not occur, and that the description includes instances where the event or circumstance occurs and instances where it does not.
[00042] The recitation of numerical ranges by endpoints includes all numbers and fractions subsumed within the respective ranges, as well as the recited endpoints.
[00043] The terms "about" or "approximately" as used herein when referring to a measurable value such as a parameter, an amount, a temporal duration, and the like, are meant to encompass variations of and from the specified value, such as variations of +/-10% or less, +/-5% or less, +/-1% or less, and +/-0.1% or less of and from the specified value, insofar such variations are appropriate to perform in the disclosed invention. It is to be understood that the value to which the modifier "about" or "approximately" refers is itself also specifically, and preferably, disclosed.
[00044] As used herein, a "biological sample" may contain whole cells and/or live cells and/or cell debris. The biological sample may contain (or be derived from) a "bodily fluid". The present invention encompasses embodiments wherein the bodily fluid is selected from amniotic fluid, aqueous humour, vitreous humour, bile, blood serum, breast milk, cerebrospinal fluid, cerumen (earwax), chyle, chyme, endolymph, perilymph, exudates, feces, female ejaculate, gastric acid, gastric juice, lymph, mucus (including nasal drainage and phlegm), pericardial fluid, peritoneal fluid, pleural fluid, pus, rheum, saliva, sebum (skin oil), semen, sputum, synovial fluid, sweat, tears, urine, vaginal secretion, vomit and mixtures of one or more thereof. Biological samples include cell cultures, bodily fluids, cell cultures from bodily fluids. Bodily fluids may be obtained from a mammal organism, for example by puncture, or other collecting or sampling procedures.
[00045] The terms "subject," "individual," and "patient" are used interchangeably herein to refer to a vertebrate, preferably a mammal, more preferably a human.
Mammals include, but are not limited to, murines, simians, humans, farm animals, sport animals, and pets. Tissues, cells and their progeny of a biological entity obtained in vivo or cultured in vitro are also encompassed.
[00046] Various embodiments are described hereinafter. It should be noted that the specific embodiments are not intended as an exhaustive description or as a limitation to the broader aspects discussed herein. One aspect described in conjunction with a particular embodiment is notnecessarily limited to that embodiment and can be practiced with any other embodiment(s).
Reference throughout this specification to "one embodiment", "an embodiment,"
"an example embodiment," means that a particular feature, structure or characteristic described in connection with the embodiment is included in at least one embodiment of the present invention. Thus, appearances of the phrases "in one embodiment," "in an embodiment," or "an example embodiment" in various places throughout this specification are not necessarily all referring to the same embodiment, but may.
Furthermore, the particular features, structures or characteristics may be combined in any suitable manner, as would be apparent to a person skilled in the art from this disclosure, in one or more embodiments. Furthermore, while some embodiments described herein include some but not other features included in other embodiments, combinations of features of different embodiments are meant to be within the scope of the invention. For example, in the appended claims, any of the claimed embodiments can be used in any combination.
[00047] All publications, published patent documents, and patent applications cited herein are hereby incorporated by reference to the same extent as though each individual publication, published patent document, or patent application was specifically and individually indicated as being incorporated by reference.
OVERVIEW
[00048] Embodiments herein are directed to systems and methods of detecting the presence of a target nucleic acid in a sample. In certain example embodiments, the systems and methods provide for single reaction (one-pot) detection of target nucleic acids. In certain example embodiments, extraction, amplification, and detection may take place under a single set of reaction buffer and reagent conditions. In certain example embodiments, detection is achieved using isothermal amplification (e.g. LAMP) only. In other example embodiments, detection of nucleic acids can utilize Cas proteins to provide improved reaction sensitivity and/or specificity. In certain other example embodiments, isotheiinal amplification may be utilized with a thermostable CRISPR-Cas protein, with the combination of thermostable protein and isothermal amplification utilized to further improve reaction conditions and times for detection and diagnostics.
In certain other example embodiments, detection of nucleic acids produced by isothermal amplifcation may be accomplished using OSD probes. These probes may bind accessible loops, or bind open areas of ssDNA generated by binding of CRISPR proteins such as Cas9 and Cas12. Binding of OSD probes may release fluorescence by separation of a quencher and fluorophore, either through displacement of a oligonucleotide strand, or through triggered degradation of an oligonucleotide via endonucleases such as RNase H. Advantageous quick extraction approaches for the extraction of nucleic acids from a sample are also provided. Design of reaction conditions and reagents are provided for the identification of primers and reaction conditions, including concentration and content of reagents and additives, that enhance the detection systems and methods disclosed herein. Advantageously, the systems and methods can be provided in lateral flow or self-contained cartridge devices for rapid, point- of-care diagnostics. In certain embodiments, the detection assay can be provided on a cartridge or chip. A device system can be configured to receive the cartridge and conduct an assay.
[00049] In certain example embodiments, the Cas protein may be a Type V
CRISPR-Cas, a Type VI CRISPR-Cas, or combination thereof. In certain example embodiments, the Type V or Type VI Cas protein is a thermostable case protein with a nuclease activity above at least 50 C. In certain example embodiments, the Cas protein is a Cas12b protein. In certain other example embodiments, the Cas12b is Alicyclobacillus acidiphilus (AapCas12b). In certain other example embodiments, the Casl 2b protein is Brevibacillus sp. SYSU G02855 (BrCas12b). In certain example embodiments, the Cas protein, may be paired with the novel guide designs disclosed herein.
[00050] Systems and method disclosed herein include approaches to isothermal amplification for detection of target nucleic acids. In certain example embodiments, isothermal amplification approach is loop-mediated isothermal amplification (LAMP). Design of optimal systems, including primers, reagents and additives to be used with isothermal amplification approaches are also provided. Optionally, CRISPR-Cas systems as disclosed herein can be used with isothermal amplification approaches, including LAMP, that can enhance sensitivity and/or specificity.
[00051] Methods of designing optimal reaction conditions are also provided. In an aspect, methods can comprise identifying the type of amplification reaction and designing optimal primers in accordance with the methods disclosed herein. Methods may also comprise identifying optimum CRISPR-Cas systems, including identification of the Cas protein for the reaction conditions. For example, the Cas protein may be identified based on its thermostability, cutting preferences, or other desired characteristics. Preferred guide molecules may similarly be identified. Once one or more primers and/or guides are identified, salt concentrations and other additives can be titrated and selected for further investigation. Additional reaction conditions, additives and reagents can be identified to optimize the use of one-pot methodology, lyophilization of reagents, and use in the devices disclosed herein.
[00052] In certain example embodiments, the system comprises a Type VI
CRISPR-Cas system, one or more guide polynucleotides comprising a guide sequence capable of binding a target sequence and designed to form a complex with the Type VI Cas protein, and a detection construct comprising a polynucleotide component. The Type VI Cas proteins of the present systems and methods exhibit collateral RNase activity, cleaving the polynucleotide component of the detection construct once activated by the target sequence, which can generate a detectable signal.
[00053] Embodiments disclosed herein provide systems utilized in multiplex lateral flow devices and methods of use. In certain preferred embodiments, the guides utilized are designed to be highly active guide molecules, allowing for rapid and highly sensitive detection of coronavirus. In certain example embodiments, the systems can utilize general capture of antibody that was not bound by intact reporter RNAs as described in Gootenberg, et cd., Science 360, 439-444 (2018). In other embodiments, the presently disclosed system can be designed for detecting two or more targets.
When utilized with a lateral flow approach, two or more separate detection lines consisting of deposited materials that capture detection construct and a molecule specific to the deposited material, allows visualization of detectable signal (e.g. gain or loss) at detection lines due to collateral activity and cleavage of corresponding reporter oligonucleotide. Utilizing guide design that allows for design of highly active guide RNAs for use with the specific Cas protein of the systems for target sequences, for example, coronavirus is also provided. In certain embodiments, the time from processing of a sample in the current methods and using the presently claimed systems, from receipt of sample to detectable signal is less than 120 minutes, 110 minutes, 100 minutes, 90 minutes, 75 minutes, 60 minutes, 45 minutes, or 30 minutes.
OS1) PROBES
1000541 The use of OSD probes is described in U.S. Patent Publication 2016/0076083, published on March 17, 2016 and in U.S. Patent Publication 2020/0255891, published on Agust 13, 2020, both of which are incorporated herein in their entireties.
SINGLE LYSIS REACTION COMPOSITIONS
[00055] In certain aspects, embodiments disclosed herein are directed to compositions and kits that consolidate lysis and amplification of target nucleic acids into a single reaction volume. In certain example embodiments, extraction-free lysis reagents can be used to extract nucleic acids from cells and/or viral particles. In contrast to existing protocols, extraction-free lysis solution does not require isolation of the nucleic acid prior to further amplification. The extraction-free lysis reagents may be mixed with amplification reagents such as standard RT- PCR amplification reactions. An example of an extraction-free lysis solution is described in U.S. Provisional Application Serial Nos. 62/970,125, 62/993,494, 63/018,487, 63/019,406 and 63/032,470, noted above.
[00056] In certain example embodiments, the extraction-free lysis solution is combined with amplification reagents into a single volume. In certain example embodiments, the amplification reagents are isothermal amplification reagents. In certain other example embodiments, the isothermal amplification reagents are LAMP isothermal amplification reagents. In certain example embodiments, the LAMP isothermal amplification reagents may include primers for the target nucleic acids discussed in further detail below. In certain example embodiments, the LAMP
amplification reagents include primer sets selected from those listed in Table 1 A and Table 1B.
Table 1A. LA NIP Primers i O t ACCIGITCC.ACAACTAC:TACAACA1ACCTGAAGAG6A -- GiCthraCTCTGT1TIGGZu..1 sG-_ CiGAGC TTC1 G
_ 'FAGG! GAAGAGGAGGAGC _ ¨ 1 - TGCUSCAUTGIITUaCIATTTACTAACTOMATITA 1TAGCACTiMiGGITCAA' ToGiGICATAitTATAAG TGCAGTAGCTITATACTIT ACT-A-ACTEATCATiTAA' GT7 AGTTFGGAG CAAFAATTGGCFC rrrc:Aci TGGAG
¨.._...
2 GCAGAGIAGCATCATCTAGAAAACCIGTATETTleAT-- CTIGT7 GGCTGITC:AfciaAfACT-TCG-AAii.AliCAC TAATi AGATTITTGAAGG IGTATCTTICATGAATAGG
GAATAGGCAAA GGGTG GCAGT CAM
3 TGGC:GAGCATAATAATATAGCACAFTGCTAAAACAGT GGTTIGCAACCCCTICCTAGAAATGIATAGCTICITT
GACTATTGAATCTITATGA TGCFAAAACAGTGGAAGC
4 CiGIGGGGAAACCiGACiTCGCGCACTCCACIATITCCT
CTCCCACCCAGC;CGGCATACCGGIGGFGCTACTAGC ATAMIAGCTC'AC_ACAGG
. CCC G CC
GC,ACTCCACTA7TTC.C1TX:C .
CIGCTGGAGAAGTGGGAAGCGGCTICACICTICAAG AGGAGAACCAACGFCCAC ACCAGAGAACICICOCAG
A
(IAA TCTTGC C A
S '1GAGCMAITTIGAAA1OCTACAATGCiGGAAGAGG
AAAGCAGMCIAAGTAGTGTGTCTITA I AATGAGT CAACTGIGTTIAGATGGO
AAGATGT CCAAATGTCCAT C
TGGGGAAGAGGAAGATGT
ATACACTITATACAACCCG CCTAATGTAAATGTGTTIG
CGGACATTAAGTAGACCAAC.ATTCATCKTAGCCTAC CGCGCAATTTTATTTGGCT
T TTAATCGAATT AT
CGCCTTTACGGATGACAT
AGAIGCAlTAGCTGAGTCfGCTGTC.AAAAGACICAT C1'GCACAMT1 AT TAIGA
AGGCTGGAGAAATAGAAA
AGAAATGC TATATGGATCT TGTGTC TGC
GGAITTCCIGTIAATCCAATGCCIATGTGGAGGTIGG TTGITCAATAAATAIGCTGACGTCiG 7 GTAATCCFCA
CCTGAAAAGGA.ACG1TTG
AAGGTA TCC.IGTTGAG G
TGTGGAGGTTGGAAGGTA
TGGTGA CTATTAAAT C.CAG
AGCGGGTATTGATCiGTGA
CAGATACCTTTTATCTAAA
AAATGTCCTCCJ'AGATGCTG
GCTGTGGACAGTGTGGTCAGTTTGGCCCCTTTTTAG CTATGTGAGCAATAATC. T
CATGGGGICAATATITTIG
TATTITIGCC CA GACT CC
14 AATGCCTGC.AATAAGCACGCTGTATTGITCAACAACA ACTGAMAATGAGAGGTMTGCTCCAGGCAGTACT
GGATTGTGCAGAAAMAT TGTATTGTTCAACAACAGC
GCTG AATTCTTCAAGA TGAC TG
GGACAACAAAGAGAAAGTCCITCGMGACATTTGCT ATCCTGTGCAAGAAACCGIGGCTACTIGTACCITCT
C.AGAGCAAGIGTTAGATT AAGACATTTGCTATTAGAC
ATTAGAC. TTTTGG AAACTATCA CAC TTTTGG
AGTCCAGAAGITGCAGTTGAGAGCTT ATTTT CATATCTAACCTAATAGAT
CATFG CACAGCCT GATGGA
GGAAATICCCTGGCATTG
GCATAGTAGATTACAGTG ATGCATTACCATITGATAA
13 GCGTGTCTTGGAGGTCGAGGAACCCCAGAAGACTCC AGGAGAGTCCTGCTAGGAGACGTGTCTC.AGGTGAT
ACTACAAGGGACTCCTCC
AACCCCAGAAGACTCCGTG
AGAGGAGGAAGGGAATT
CTCCAG TITAAGC GG
GArfIGTCCCGICTCCAG
CTCCAGGATTTACTCTACTAGGAGTGGTCTCGGGATT GACCC.ACAATTCCTGTTGACACTGAATGACTGCAGA
GTTAAGTAGTGTGATATA
GGTACT GCCAG TCTTGGT
GGICTCGGGATTGGTACT
21 ACCIGTMGATGTCCACTAGCAGGTTGTAACAAC.ACCT TTAGCAATGCAGAAGAAGGIGTAGTGCTAAACGTT
GGTCCTATAGMACTCCTT AGGTTGTAACAACAC.CTGA
GAF C TCGGITIG CIA FAG IC
CTGAATTTGAAATTGAAGCCCCFCCATCTCTGAAGC ClICAGATATCAA %TM
GGIGAC GGGTGAC TGTGGA
TGCACATTFTACAGGTGAC
GGACGTCCTAGATACAGTGAAACCGAGTTTGCAT TGACCCTGATGTAACAATG
ATGA TGTTCCAC , GATTTGAAAATCCCGCCTT A
GTATOTACCCACAACAAA GTACTCCTCTAGAAC.CTTA
ACCTTATGC CCAGTIGG ATGT TGC
CCGAATAGGMAGTATGGGTGACCTTCCATATACTIC AGITCCTC.ATGICTCAGGAAGTGCGATTCGGATCTG
ACGGATGAATATGTTAAA
CACGCTG GTAG GGAAC
ITCCATATACTTCC.ACGCTG
CTGGACATCCTTTGTATAA TAGGAGACACAGAAAATC
AATC:CC CI ATA MCC CAAAC CC:
27 GGIGTICCAAAGTACACATE.TGATAATAGGAGCTCA TAG
TE.GTTCTCTAGTTTCCAGTGATTGITAGTCCCCT CCATTIGAACCAACATCTG TAATAGGAGCTCAAGACA
AGAC:AATCA GAGCA A ATCA
ACCA MCAT CCCTTACGGGGCGACACr CTOMCTGCGAGGAACCA
29 GACTCCGCCAACCATCTGCCACCA.AAAGAAACACCG GGCCCIAGATTGGGTGTGCGTTTGGGGATACGTTG
TGCACCATGAGCACGAAT ACCAAAAGAAACACCGTCC
TCCG GCZ C G
TCGACA ACC GCACCGCMGCGAAA A
i 31 CACTGG1TCC.CAACACCTeCTAAACAGACMCGCCAT CCAMTGGCGGTCCGCTATAGGTCAATATGCGTCC
AAC.AGACAACGCCATCCTA
MAC GC , TCACCAATGACTGOCCCA C
CCATTGAC.AAGGCCGGCGCACGTCACCGGCCACA C TICGATCTACCCGGGCC
ACGTCACCGGCCACA
33 CCTCTCGGGACAGCCACTAGAAATGAL 0. I i I GCAGA
C.AGTGACTTEGACC.AGGGGTGGAATAGGGICGCTG ATGACTCMGCAGACAGG
CAGGG GTCCG ACATCP.ACCGGACCGCTT G
CCC.AGGGIGCCCAGAGTAACACGCATGITTCCTGA GCTCATGG ATGAACACAAC
ACAACG AGCAGT CGCAAGGCAGTIGGTTTG G
35 GGG6ACAT5TCC7C.CCTCiTCT1GCAATTGGACGCGG TGCTCCATTCCACTACGGAGCTGTAGATAAGGCCGG
ACATAGGCTCGATGCAGC
G AAGCGG G
TGCAATTGGACGCGGG
TGATGATGCTTATGGCCCA
CCAG GCCAAG GATCTGCGCCTGCCMG G
TTAGTTACCTIGGGACCGC AGCCCITATTCTCTGGIGG
AMTC.CCGTAGGCCGCCGGC6C1GGACCGGCAC1T AGTG GCCGCGCMCTCACTAT
CGCTGGACCGGCACTT
CTACCCTTG TCGTGAGCCT
40 ACAAGAC.AGGCCCTCCCGAGTACAGCGTCCCTAC.TGT 1 GCCCCCAMGTCATTGCGTTGCTGCTITGGCTACAC TCTIAGGAGGAAGGGTOA
CC 1 (XiC CCi TACAGCSTCCCTACTGTCC
TAACACCTGAACAACCCCCG GCTTAATCC.ATCAGTCGai CGCGG 1 IC , GCCAGGGCTACAAAGTGC 6 AAATGTGATGAGCTIGCCGGGACCCCTATAGTAAG
GAA ' CAACGGC CCACCCCAACATCGAGGA
CTTGGCAACGATGGGGAA
43 GCGAGACAGAATMGCGGGACGACTTCTCCTTGGA GCTCTC.AGAGACGTGGGAGGACGCCACTGGACACG
CCCCA TATC GTGGCGGTTACCCAGACT
GAMCICCTIGGACCCCA
TGGAAGAGTGCTCCAAAGCTCCTGAACATTCCGCCG TACTGGCAGTGTTGTTGTG
TGG ATGG CGTGGCCGCATA MCAT G
AGTCAACATOCTGGGTGGGIGGCCGAGACCACGA A TAC.CTAGCTGGGCTATCCA
ACTGCCAGGTAATCCAACT
. AACTG AGTCG C 6 TGAGGATITCCACGGGAGG GGTCCCTAGCACAGAGGA CIGGCAAACCTCTTACCTG
CCIGC (X; AA T C
47 ICCCTGGAAGGCrGGGAAGTACTGCATGG I CC TATC
GGCATCATGAACACCAAGTGCCCGCATCGATCCGTI
COACT CTTGAC GAGGIGIGGGACIGGGIT TGCATG
GTCCTATCCGACT
45 .ACACGGGCACTTGAGGTCCTTCGTCACAATGACCAGC TCCTICCCCCGAATTCTTCAGCTGGGTTCAGGGGCG
CGGAGTACGTCGAGATCC TCGTCACAATGACCAGCAT
All MSC r T
A ICCACCACCCAGGCOCiPAGGTGCCGTGGAAACCGC CCC
CGAGIACGCACCICCCCA
SO TGGGCGCAAGACiAT CTGACT I CC I GCCCACTCAGCCA CT
AC.AGGACATGACCACOCCGAGGCAGGATITACG MICA GTAGAGGAAGC 16 GAT CAGAAGA CT
CTGCCCACTCAGCCAGAT
51 CCICTCAGI GCGGAT6TCITGTGGC 11CICAT A rCiAC
TA1CAGIGCT6C:AACCICGACKCCACGTAC:AAC.CIC GICCAAACGCACRX.CAT GGCFTCICATA
IGACACCC
. ACCCG TCGGT A 6 6 .
52 CGAGAGCAAGGGAG ItGAACGGGAGTACCIGGTC
TGACGIGCTACAICAAAG
GICATCGCCGCAGACCAGGAMGCGGCTGCAGCAG ATTGCCG CC
AAAGCGGCTGC.AGCAG
CTGGCCGGCTCGATCflICG IGATCCACG TCCGCCT 1GCGCTUCTALTIC TICA
TTCAA C GGGCAGGGCTCGCATA A
54 GC1GA17TICCTCC-T6TI ATICiCT GAT TECTAAAAAAT
AAGAATGAACATCTTCCCCTAA1GTTAAGCACCTCA TCiCiCAC.ATGAAATACTGA AT CM
TAAAAAATG TEGCT
GTIOCTCCTG TAACGATGT TTG MG
55 AGTC.TCCAGCTGMTCATTTGCTICTAGMGAAAAA TGITGriCiAGTGGAGITAATGAACTTCTICAACCATT
GCGACATGCTGAATAAAG CTTCTAGTITGAAAAAATC.
ATCAGAAGG TCCATT C AGAAGG
56 GTGICIATTGIGTAGCC TGITOCCAACA 11 CCCAT ACA.
CGIGATIAGPACACACGAGIACICCACATCCTGIAA A TACAGGCAGCAATTICA.
CCGG CATCAGAAA AC
AACATTCCCATACACCGG
57 CATCiGCA11CTG VGACCCr GC Cr TGGATACi AATGGA I GACAAAT
IGACICAG GGGAGACATATIGTIGIGTTC AATTGGA TIGTGT ICTCiG CTT MCAT AGAATGGA TGA
GAAGAAC AGTGCC AG AGAAC
55 CG1TTCCACCTACGGGTAAAC.AAACTTGGCIAAAAAT AGAAGGCCAAACTA 1CPAAIGCA
TACTGATCCCI-CC TCAGAGGATTIGI ATTAGI AAACTIGGCTAAAAATATC
59 CCCATCCCATTGTIGATCATATRITCiAGICGCMGAG GCACAAAC AGCCA7 ACAA
ITATICAATCfCCCCIGT GGATT TCri A-Cc:wan-re;
TGAATG GGCATTT CAATGG
60 TTCTGGAATATGCAGGTITCTCAAAAGAGATGGICrA
ACMCATAATGGACTZTGAGTACACAMTGTCCIAC GCTATGGGAAAA CACTAA AGAGATGGTCTATTAGTAG
TTAGTAGCAG . APAGGGATTT AGGA CAG
[00037] The foregoing and other objects, features and advantages of the invention will be apparent from the following more particular description of preferred embodiments of the invention, as illustrated in the accompanying drawings in which like reference characters refer to the same parts throughout the different views. The drawings are not necessarily to scale, emphasis instead being placed upon illustrating the principles of the invention.
[00038]
DETAILED DESCRIPTION OF THE EXAMPLE EMBODIMENTS
General Definitions [00039] Unless defined otherwise, technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this disclosure pertains.
Definitions of common terms and techniques in molecular biology may be found in Molecular Cloning: A Laboratory Manual, 2nd edition (1989) (Sambrook, Fritsch, and Maniatis); Molecular Cloning: A Laboratory Manual, 4th edition (2012) (Green and Sambrook); Current Protocols in Molecular Biology (1987) (F.M. Ausubel et al. eds.); the series Methods in Enzymology (Academic Press, Inc.): PCR 2: A Practical Approach (1995) (M.J. MacPherson, B.D.
Ham.es, and G.R. Taylor eds.): Antibodies, A Laboratory Manual (1988) (Harlow and Lane, eds.):
Antibodies A Laboraotry Manual, 2nd edition 2013 (E.A. Greenfield ed.); Animal Cell Culture (1987) (RI. Freshney, ed.);
Benjamin Lewin, Genes IX, published by Jones and Bartlet, 2008 (ISBN
0763752223); Kendrew et al. (eds.), The Encyclopedia of Molecular Biology, published by Blackwell Science Ltd., 1994 (ISBN
0632021829); Robert A. Meyers (ed.), Molecular Biology and Biotechnology: a Comprehensive Desk Reference, published by VCH Publishers, Inc., 1995 (ISBN 9780471185710);
Singleton etal., Dictionary of Microbiology and Molecular Biology 2nd ed., J. Wiley & Sons (New York, N.Y. 1994), March, Advanced Organic Chemistry Reactions, Mechanisms and Structure 4th ed., John Wiley &
Sons (New York, N.Y. 1992); and Marten H. Hofker and Jan van Deursen, Transgenic Mouse Methods and Protocols, 2nd edition (2011).
[00040] As used herein, the singular forms "a", "an", and "the" include both singular and plural referents unless the context clearly dictates otherwise.
[00041] The term "optional" or "optionally" means that the subsequent described event, circumstance or substituent may or may not occur, and that the description includes instances where the event or circumstance occurs and instances where it does not.
[00042] The recitation of numerical ranges by endpoints includes all numbers and fractions subsumed within the respective ranges, as well as the recited endpoints.
[00043] The terms "about" or "approximately" as used herein when referring to a measurable value such as a parameter, an amount, a temporal duration, and the like, are meant to encompass variations of and from the specified value, such as variations of +/-10% or less, +/-5% or less, +/-1% or less, and +/-0.1% or less of and from the specified value, insofar such variations are appropriate to perform in the disclosed invention. It is to be understood that the value to which the modifier "about" or "approximately" refers is itself also specifically, and preferably, disclosed.
[00044] As used herein, a "biological sample" may contain whole cells and/or live cells and/or cell debris. The biological sample may contain (or be derived from) a "bodily fluid". The present invention encompasses embodiments wherein the bodily fluid is selected from amniotic fluid, aqueous humour, vitreous humour, bile, blood serum, breast milk, cerebrospinal fluid, cerumen (earwax), chyle, chyme, endolymph, perilymph, exudates, feces, female ejaculate, gastric acid, gastric juice, lymph, mucus (including nasal drainage and phlegm), pericardial fluid, peritoneal fluid, pleural fluid, pus, rheum, saliva, sebum (skin oil), semen, sputum, synovial fluid, sweat, tears, urine, vaginal secretion, vomit and mixtures of one or more thereof. Biological samples include cell cultures, bodily fluids, cell cultures from bodily fluids. Bodily fluids may be obtained from a mammal organism, for example by puncture, or other collecting or sampling procedures.
[00045] The terms "subject," "individual," and "patient" are used interchangeably herein to refer to a vertebrate, preferably a mammal, more preferably a human.
Mammals include, but are not limited to, murines, simians, humans, farm animals, sport animals, and pets. Tissues, cells and their progeny of a biological entity obtained in vivo or cultured in vitro are also encompassed.
[00046] Various embodiments are described hereinafter. It should be noted that the specific embodiments are not intended as an exhaustive description or as a limitation to the broader aspects discussed herein. One aspect described in conjunction with a particular embodiment is notnecessarily limited to that embodiment and can be practiced with any other embodiment(s).
Reference throughout this specification to "one embodiment", "an embodiment,"
"an example embodiment," means that a particular feature, structure or characteristic described in connection with the embodiment is included in at least one embodiment of the present invention. Thus, appearances of the phrases "in one embodiment," "in an embodiment," or "an example embodiment" in various places throughout this specification are not necessarily all referring to the same embodiment, but may.
Furthermore, the particular features, structures or characteristics may be combined in any suitable manner, as would be apparent to a person skilled in the art from this disclosure, in one or more embodiments. Furthermore, while some embodiments described herein include some but not other features included in other embodiments, combinations of features of different embodiments are meant to be within the scope of the invention. For example, in the appended claims, any of the claimed embodiments can be used in any combination.
[00047] All publications, published patent documents, and patent applications cited herein are hereby incorporated by reference to the same extent as though each individual publication, published patent document, or patent application was specifically and individually indicated as being incorporated by reference.
OVERVIEW
[00048] Embodiments herein are directed to systems and methods of detecting the presence of a target nucleic acid in a sample. In certain example embodiments, the systems and methods provide for single reaction (one-pot) detection of target nucleic acids. In certain example embodiments, extraction, amplification, and detection may take place under a single set of reaction buffer and reagent conditions. In certain example embodiments, detection is achieved using isothermal amplification (e.g. LAMP) only. In other example embodiments, detection of nucleic acids can utilize Cas proteins to provide improved reaction sensitivity and/or specificity. In certain other example embodiments, isotheiinal amplification may be utilized with a thermostable CRISPR-Cas protein, with the combination of thermostable protein and isothermal amplification utilized to further improve reaction conditions and times for detection and diagnostics.
In certain other example embodiments, detection of nucleic acids produced by isothermal amplifcation may be accomplished using OSD probes. These probes may bind accessible loops, or bind open areas of ssDNA generated by binding of CRISPR proteins such as Cas9 and Cas12. Binding of OSD probes may release fluorescence by separation of a quencher and fluorophore, either through displacement of a oligonucleotide strand, or through triggered degradation of an oligonucleotide via endonucleases such as RNase H. Advantageous quick extraction approaches for the extraction of nucleic acids from a sample are also provided. Design of reaction conditions and reagents are provided for the identification of primers and reaction conditions, including concentration and content of reagents and additives, that enhance the detection systems and methods disclosed herein. Advantageously, the systems and methods can be provided in lateral flow or self-contained cartridge devices for rapid, point- of-care diagnostics. In certain embodiments, the detection assay can be provided on a cartridge or chip. A device system can be configured to receive the cartridge and conduct an assay.
[00049] In certain example embodiments, the Cas protein may be a Type V
CRISPR-Cas, a Type VI CRISPR-Cas, or combination thereof. In certain example embodiments, the Type V or Type VI Cas protein is a thermostable case protein with a nuclease activity above at least 50 C. In certain example embodiments, the Cas protein is a Cas12b protein. In certain other example embodiments, the Cas12b is Alicyclobacillus acidiphilus (AapCas12b). In certain other example embodiments, the Casl 2b protein is Brevibacillus sp. SYSU G02855 (BrCas12b). In certain example embodiments, the Cas protein, may be paired with the novel guide designs disclosed herein.
[00050] Systems and method disclosed herein include approaches to isothermal amplification for detection of target nucleic acids. In certain example embodiments, isothermal amplification approach is loop-mediated isothermal amplification (LAMP). Design of optimal systems, including primers, reagents and additives to be used with isothermal amplification approaches are also provided. Optionally, CRISPR-Cas systems as disclosed herein can be used with isothermal amplification approaches, including LAMP, that can enhance sensitivity and/or specificity.
[00051] Methods of designing optimal reaction conditions are also provided. In an aspect, methods can comprise identifying the type of amplification reaction and designing optimal primers in accordance with the methods disclosed herein. Methods may also comprise identifying optimum CRISPR-Cas systems, including identification of the Cas protein for the reaction conditions. For example, the Cas protein may be identified based on its thermostability, cutting preferences, or other desired characteristics. Preferred guide molecules may similarly be identified. Once one or more primers and/or guides are identified, salt concentrations and other additives can be titrated and selected for further investigation. Additional reaction conditions, additives and reagents can be identified to optimize the use of one-pot methodology, lyophilization of reagents, and use in the devices disclosed herein.
[00052] In certain example embodiments, the system comprises a Type VI
CRISPR-Cas system, one or more guide polynucleotides comprising a guide sequence capable of binding a target sequence and designed to form a complex with the Type VI Cas protein, and a detection construct comprising a polynucleotide component. The Type VI Cas proteins of the present systems and methods exhibit collateral RNase activity, cleaving the polynucleotide component of the detection construct once activated by the target sequence, which can generate a detectable signal.
[00053] Embodiments disclosed herein provide systems utilized in multiplex lateral flow devices and methods of use. In certain preferred embodiments, the guides utilized are designed to be highly active guide molecules, allowing for rapid and highly sensitive detection of coronavirus. In certain example embodiments, the systems can utilize general capture of antibody that was not bound by intact reporter RNAs as described in Gootenberg, et cd., Science 360, 439-444 (2018). In other embodiments, the presently disclosed system can be designed for detecting two or more targets.
When utilized with a lateral flow approach, two or more separate detection lines consisting of deposited materials that capture detection construct and a molecule specific to the deposited material, allows visualization of detectable signal (e.g. gain or loss) at detection lines due to collateral activity and cleavage of corresponding reporter oligonucleotide. Utilizing guide design that allows for design of highly active guide RNAs for use with the specific Cas protein of the systems for target sequences, for example, coronavirus is also provided. In certain embodiments, the time from processing of a sample in the current methods and using the presently claimed systems, from receipt of sample to detectable signal is less than 120 minutes, 110 minutes, 100 minutes, 90 minutes, 75 minutes, 60 minutes, 45 minutes, or 30 minutes.
OS1) PROBES
1000541 The use of OSD probes is described in U.S. Patent Publication 2016/0076083, published on March 17, 2016 and in U.S. Patent Publication 2020/0255891, published on Agust 13, 2020, both of which are incorporated herein in their entireties.
SINGLE LYSIS REACTION COMPOSITIONS
[00055] In certain aspects, embodiments disclosed herein are directed to compositions and kits that consolidate lysis and amplification of target nucleic acids into a single reaction volume. In certain example embodiments, extraction-free lysis reagents can be used to extract nucleic acids from cells and/or viral particles. In contrast to existing protocols, extraction-free lysis solution does not require isolation of the nucleic acid prior to further amplification. The extraction-free lysis reagents may be mixed with amplification reagents such as standard RT- PCR amplification reactions. An example of an extraction-free lysis solution is described in U.S. Provisional Application Serial Nos. 62/970,125, 62/993,494, 63/018,487, 63/019,406 and 63/032,470, noted above.
[00056] In certain example embodiments, the extraction-free lysis solution is combined with amplification reagents into a single volume. In certain example embodiments, the amplification reagents are isothermal amplification reagents. In certain other example embodiments, the isothermal amplification reagents are LAMP isothermal amplification reagents. In certain example embodiments, the LAMP isothermal amplification reagents may include primers for the target nucleic acids discussed in further detail below. In certain example embodiments, the LAMP
amplification reagents include primer sets selected from those listed in Table 1 A and Table 1B.
Table 1A. LA NIP Primers i O t ACCIGITCC.ACAACTAC:TACAACA1ACCTGAAGAG6A -- GiCthraCTCTGT1TIGGZu..1 sG-_ CiGAGC TTC1 G
_ 'FAGG! GAAGAGGAGGAGC _ ¨ 1 - TGCUSCAUTGIITUaCIATTTACTAACTOMATITA 1TAGCACTiMiGGITCAA' ToGiGICATAitTATAAG TGCAGTAGCTITATACTIT ACT-A-ACTEATCATiTAA' GT7 AGTTFGGAG CAAFAATTGGCFC rrrc:Aci TGGAG
¨.._...
2 GCAGAGIAGCATCATCTAGAAAACCIGTATETTleAT-- CTIGT7 GGCTGITC:AfciaAfACT-TCG-AAii.AliCAC TAATi AGATTITTGAAGG IGTATCTTICATGAATAGG
GAATAGGCAAA GGGTG GCAGT CAM
3 TGGC:GAGCATAATAATATAGCACAFTGCTAAAACAGT GGTTIGCAACCCCTICCTAGAAATGIATAGCTICITT
GACTATTGAATCTITATGA TGCFAAAACAGTGGAAGC
4 CiGIGGGGAAACCiGACiTCGCGCACTCCACIATITCCT
CTCCCACCCAGC;CGGCATACCGGIGGFGCTACTAGC ATAMIAGCTC'AC_ACAGG
. CCC G CC
GC,ACTCCACTA7TTC.C1TX:C .
CIGCTGGAGAAGTGGGAAGCGGCTICACICTICAAG AGGAGAACCAACGFCCAC ACCAGAGAACICICOCAG
A
(IAA TCTTGC C A
S '1GAGCMAITTIGAAA1OCTACAATGCiGGAAGAGG
AAAGCAGMCIAAGTAGTGTGTCTITA I AATGAGT CAACTGIGTTIAGATGGO
AAGATGT CCAAATGTCCAT C
TGGGGAAGAGGAAGATGT
ATACACTITATACAACCCG CCTAATGTAAATGTGTTIG
CGGACATTAAGTAGACCAAC.ATTCATCKTAGCCTAC CGCGCAATTTTATTTGGCT
T TTAATCGAATT AT
CGCCTTTACGGATGACAT
AGAIGCAlTAGCTGAGTCfGCTGTC.AAAAGACICAT C1'GCACAMT1 AT TAIGA
AGGCTGGAGAAATAGAAA
AGAAATGC TATATGGATCT TGTGTC TGC
GGAITTCCIGTIAATCCAATGCCIATGTGGAGGTIGG TTGITCAATAAATAIGCTGACGTCiG 7 GTAATCCFCA
CCTGAAAAGGA.ACG1TTG
AAGGTA TCC.IGTTGAG G
TGTGGAGGTTGGAAGGTA
TGGTGA CTATTAAAT C.CAG
AGCGGGTATTGATCiGTGA
CAGATACCTTTTATCTAAA
AAATGTCCTCCJ'AGATGCTG
GCTGTGGACAGTGTGGTCAGTTTGGCCCCTTTTTAG CTATGTGAGCAATAATC. T
CATGGGGICAATATITTIG
TATTITIGCC CA GACT CC
14 AATGCCTGC.AATAAGCACGCTGTATTGITCAACAACA ACTGAMAATGAGAGGTMTGCTCCAGGCAGTACT
GGATTGTGCAGAAAMAT TGTATTGTTCAACAACAGC
GCTG AATTCTTCAAGA TGAC TG
GGACAACAAAGAGAAAGTCCITCGMGACATTTGCT ATCCTGTGCAAGAAACCGIGGCTACTIGTACCITCT
C.AGAGCAAGIGTTAGATT AAGACATTTGCTATTAGAC
ATTAGAC. TTTTGG AAACTATCA CAC TTTTGG
AGTCCAGAAGITGCAGTTGAGAGCTT ATTTT CATATCTAACCTAATAGAT
CATFG CACAGCCT GATGGA
GGAAATICCCTGGCATTG
GCATAGTAGATTACAGTG ATGCATTACCATITGATAA
13 GCGTGTCTTGGAGGTCGAGGAACCCCAGAAGACTCC AGGAGAGTCCTGCTAGGAGACGTGTCTC.AGGTGAT
ACTACAAGGGACTCCTCC
AACCCCAGAAGACTCCGTG
AGAGGAGGAAGGGAATT
CTCCAG TITAAGC GG
GArfIGTCCCGICTCCAG
CTCCAGGATTTACTCTACTAGGAGTGGTCTCGGGATT GACCC.ACAATTCCTGTTGACACTGAATGACTGCAGA
GTTAAGTAGTGTGATATA
GGTACT GCCAG TCTTGGT
GGICTCGGGATTGGTACT
21 ACCIGTMGATGTCCACTAGCAGGTTGTAACAAC.ACCT TTAGCAATGCAGAAGAAGGIGTAGTGCTAAACGTT
GGTCCTATAGMACTCCTT AGGTTGTAACAACAC.CTGA
GAF C TCGGITIG CIA FAG IC
CTGAATTTGAAATTGAAGCCCCFCCATCTCTGAAGC ClICAGATATCAA %TM
GGIGAC GGGTGAC TGTGGA
TGCACATTFTACAGGTGAC
GGACGTCCTAGATACAGTGAAACCGAGTTTGCAT TGACCCTGATGTAACAATG
ATGA TGTTCCAC , GATTTGAAAATCCCGCCTT A
GTATOTACCCACAACAAA GTACTCCTCTAGAAC.CTTA
ACCTTATGC CCAGTIGG ATGT TGC
CCGAATAGGMAGTATGGGTGACCTTCCATATACTIC AGITCCTC.ATGICTCAGGAAGTGCGATTCGGATCTG
ACGGATGAATATGTTAAA
CACGCTG GTAG GGAAC
ITCCATATACTTCC.ACGCTG
CTGGACATCCTTTGTATAA TAGGAGACACAGAAAATC
AATC:CC CI ATA MCC CAAAC CC:
27 GGIGTICCAAAGTACACATE.TGATAATAGGAGCTCA TAG
TE.GTTCTCTAGTTTCCAGTGATTGITAGTCCCCT CCATTIGAACCAACATCTG TAATAGGAGCTCAAGACA
AGAC:AATCA GAGCA A ATCA
ACCA MCAT CCCTTACGGGGCGACACr CTOMCTGCGAGGAACCA
29 GACTCCGCCAACCATCTGCCACCA.AAAGAAACACCG GGCCCIAGATTGGGTGTGCGTTTGGGGATACGTTG
TGCACCATGAGCACGAAT ACCAAAAGAAACACCGTCC
TCCG GCZ C G
TCGACA ACC GCACCGCMGCGAAA A
i 31 CACTGG1TCC.CAACACCTeCTAAACAGACMCGCCAT CCAMTGGCGGTCCGCTATAGGTCAATATGCGTCC
AAC.AGACAACGCCATCCTA
MAC GC , TCACCAATGACTGOCCCA C
CCATTGAC.AAGGCCGGCGCACGTCACCGGCCACA C TICGATCTACCCGGGCC
ACGTCACCGGCCACA
33 CCTCTCGGGACAGCCACTAGAAATGAL 0. I i I GCAGA
C.AGTGACTTEGACC.AGGGGTGGAATAGGGICGCTG ATGACTCMGCAGACAGG
CAGGG GTCCG ACATCP.ACCGGACCGCTT G
CCC.AGGGIGCCCAGAGTAACACGCATGITTCCTGA GCTCATGG ATGAACACAAC
ACAACG AGCAGT CGCAAGGCAGTIGGTTTG G
35 GGG6ACAT5TCC7C.CCTCiTCT1GCAATTGGACGCGG TGCTCCATTCCACTACGGAGCTGTAGATAAGGCCGG
ACATAGGCTCGATGCAGC
G AAGCGG G
TGCAATTGGACGCGGG
TGATGATGCTTATGGCCCA
CCAG GCCAAG GATCTGCGCCTGCCMG G
TTAGTTACCTIGGGACCGC AGCCCITATTCTCTGGIGG
AMTC.CCGTAGGCCGCCGGC6C1GGACCGGCAC1T AGTG GCCGCGCMCTCACTAT
CGCTGGACCGGCACTT
CTACCCTTG TCGTGAGCCT
40 ACAAGAC.AGGCCCTCCCGAGTACAGCGTCCCTAC.TGT 1 GCCCCCAMGTCATTGCGTTGCTGCTITGGCTACAC TCTIAGGAGGAAGGGTOA
CC 1 (XiC CCi TACAGCSTCCCTACTGTCC
TAACACCTGAACAACCCCCG GCTTAATCC.ATCAGTCGai CGCGG 1 IC , GCCAGGGCTACAAAGTGC 6 AAATGTGATGAGCTIGCCGGGACCCCTATAGTAAG
GAA ' CAACGGC CCACCCCAACATCGAGGA
CTTGGCAACGATGGGGAA
43 GCGAGACAGAATMGCGGGACGACTTCTCCTTGGA GCTCTC.AGAGACGTGGGAGGACGCCACTGGACACG
CCCCA TATC GTGGCGGTTACCCAGACT
GAMCICCTIGGACCCCA
TGGAAGAGTGCTCCAAAGCTCCTGAACATTCCGCCG TACTGGCAGTGTTGTTGTG
TGG ATGG CGTGGCCGCATA MCAT G
AGTCAACATOCTGGGTGGGIGGCCGAGACCACGA A TAC.CTAGCTGGGCTATCCA
ACTGCCAGGTAATCCAACT
. AACTG AGTCG C 6 TGAGGATITCCACGGGAGG GGTCCCTAGCACAGAGGA CIGGCAAACCTCTTACCTG
CCIGC (X; AA T C
47 ICCCTGGAAGGCrGGGAAGTACTGCATGG I CC TATC
GGCATCATGAACACCAAGTGCCCGCATCGATCCGTI
COACT CTTGAC GAGGIGIGGGACIGGGIT TGCATG
GTCCTATCCGACT
45 .ACACGGGCACTTGAGGTCCTTCGTCACAATGACCAGC TCCTICCCCCGAATTCTTCAGCTGGGTTCAGGGGCG
CGGAGTACGTCGAGATCC TCGTCACAATGACCAGCAT
All MSC r T
A ICCACCACCCAGGCOCiPAGGTGCCGTGGAAACCGC CCC
CGAGIACGCACCICCCCA
SO TGGGCGCAAGACiAT CTGACT I CC I GCCCACTCAGCCA CT
AC.AGGACATGACCACOCCGAGGCAGGATITACG MICA GTAGAGGAAGC 16 GAT CAGAAGA CT
CTGCCCACTCAGCCAGAT
51 CCICTCAGI GCGGAT6TCITGTGGC 11CICAT A rCiAC
TA1CAGIGCT6C:AACCICGACKCCACGTAC:AAC.CIC GICCAAACGCACRX.CAT GGCFTCICATA
IGACACCC
. ACCCG TCGGT A 6 6 .
52 CGAGAGCAAGGGAG ItGAACGGGAGTACCIGGTC
TGACGIGCTACAICAAAG
GICATCGCCGCAGACCAGGAMGCGGCTGCAGCAG ATTGCCG CC
AAAGCGGCTGC.AGCAG
CTGGCCGGCTCGATCflICG IGATCCACG TCCGCCT 1GCGCTUCTALTIC TICA
TTCAA C GGGCAGGGCTCGCATA A
54 GC1GA17TICCTCC-T6TI ATICiCT GAT TECTAAAAAAT
AAGAATGAACATCTTCCCCTAA1GTTAAGCACCTCA TCiCiCAC.ATGAAATACTGA AT CM
TAAAAAATG TEGCT
GTIOCTCCTG TAACGATGT TTG MG
55 AGTC.TCCAGCTGMTCATTTGCTICTAGMGAAAAA TGITGriCiAGTGGAGITAATGAACTTCTICAACCATT
GCGACATGCTGAATAAAG CTTCTAGTITGAAAAAATC.
ATCAGAAGG TCCATT C AGAAGG
56 GTGICIATTGIGTAGCC TGITOCCAACA 11 CCCAT ACA.
CGIGATIAGPACACACGAGIACICCACATCCTGIAA A TACAGGCAGCAATTICA.
CCGG CATCAGAAA AC
AACATTCCCATACACCGG
57 CATCiGCA11CTG VGACCCr GC Cr TGGATACi AATGGA I GACAAAT
IGACICAG GGGAGACATATIGTIGIGTTC AATTGGA TIGTGT ICTCiG CTT MCAT AGAATGGA TGA
GAAGAAC AGTGCC AG AGAAC
55 CG1TTCCACCTACGGGTAAAC.AAACTTGGCIAAAAAT AGAAGGCCAAACTA 1CPAAIGCA
TACTGATCCCI-CC TCAGAGGATTIGI ATTAGI AAACTIGGCTAAAAATATC
59 CCCATCCCATTGTIGATCATATRITCiAGICGCMGAG GCACAAAC AGCCA7 ACAA
ITATICAATCfCCCCIGT GGATT TCri A-Cc:wan-re;
TGAATG GGCATTT CAATGG
60 TTCTGGAATATGCAGGTITCTCAAAAGAGATGGICrA
ACMCATAATGGACTZTGAGTACACAMTGTCCIAC GCTATGGGAAAA CACTAA AGAGATGGTCTATTAGTAG
TTAGTAGCAG . APAGGGATTT AGGA CAG
61 AAGGTTGCAAC.ACTTAGCGTAGAAAGGAA CA GATCT
TTTGAGGCTTCICTTTAACAGTGCGTGGGCCATAGCT GGAACTCATAGTTGGAGA
APAGGAACAGATCTATACT
ATACTAAACAC TCAAG ACC AAACAC
TTTGAGGCTTCICTTTAACAGTGCGTGGGCCATAGCT GGAACTCATAGTTGGAGA
APAGGAACAGATCTATACT
ATACTAAACAC TCAAG ACC AAACAC
62 GCATATCTGGTCCECCCICACCJAAAGCATAACAATG ATTGGATACC.ATGCCAATAATTCCACACAGTGACAT
CAGGGGTTATACCATAGA CC.A.AAAGC.ATAACAATGGC
GCC TCCGCTC CM C
CAGGGGTTATACCATAGA CC.A.AAAGC.ATAACAATGGC
GCC TCCGCTC CM C
63 ACTGTATCTCGGG TMCiTTCTCCCCAGAA TGTGAT A
TTGTGTTACCCAGGCAGCTTATGMCACGCTGCTG CCAGAATGTGATAGGCTTC
GGCTTCT AG GGATGGCTCCTTGGAAAT T
TTGTGTTACCCAGGCAGCTTATGMCACGCTGCTG CCAGAATGTGATAGGCTTC
GGCTTCT AG GGATGGCTCCTTGGAAAT T
64 CGCITGTATTGTTGTACGATCCTGTCTGGC:TGACACG AAATGCSAATMTTIGGGGAGTGCATCTGGTACAAT
GTAAACCATCATTCTITAG
TAM GCTC.TTTGT GM
GTCTGGCTGACACGTAAA
GTAAACCATCATTCTITAG
TAM GCTC.TTTGT GM
GTCTGGCTGACACGTAAA
65 ATCCCTGAACTACCTCTTTTCGATGGGACACCATAAA CATGAAGACAGAAGGAACACTTGAGTGTAMATTG
GAATGGAATTCTCTTGGA GGGACACCATAAATTTTGA
MTGA GA CTCrrA AAGGA CCC GA
GAATGGAATTCTCTTGGA GGGACACCATAAATTTTGA
MTGA GA CTCrrA AAGGA CCC GA
66 IGCCATCCTCCTICTA TAAAACCAGTTCCCCAGATTGA
AGGAATGGFTGATGGTTGGTATGCTGCTGCATACCC ATCGGAGAAATPGGTCTT GTICCCCAGATTGAATCAA
ATCAAGA TGATC AGC GA
AGGAATGGFTGATGGTTGGTATGCTGCTGCATACCC ATCGGAGAAATPGGTCTT GTICCCCAGATTGAATCAA
ATCAAGA TGATC AGC GA
67 TITGT6GTGIGGIGITATTGTGGCAAATAAATCAATT GACTGATGATCACAGACATGAGACCTGIGTGTTATG
MACTTGACAGAAGATAA GCAAATAAATCAATTCAGC
C.AGCC.G ATGICTaGG AAATGGG OG
ACACICAACAAAGATCAACITC7GTACTATC1GC1GT CATAGTICAAATGGAGa: TTAAGGAGAGACATAAGA
AAGATGAAAG GCTCCG TG TGAAAG
69 TGGGTAGTMGCCTITATAACGTITGGATAGATCIGG GC.CAACAGCTICTATGAAG lb I I
GTGCTATACC.AAA TATAGCGGCATTAGTAAT GGATAGATUGGTOTACA
TCTTACAGC ATGAACAA , AACCA GC
70 CCCAITATGCCTAGGCCAGCAGCATCATTATTG us. I II
ACAGAGGTACACCAAGGAATCAATTGAGTTGTTC.A TTCFACCATATATTGAACA
AGCATCATTATTGTCTITG
GACT ., GCATATGC ACCCAA ACT
ACACATTAGTAGTAGCGAG
AGCGAGTG ' ATTCCTGA ATGCTTC TG
72 GITTGITTGTTGGTC1TCTGTTGGTTGA1ITC.IGATCA
CAACTCACCCATCC.AACCAMCTITTTCCGGGTGGC TGATAGTGACMTGATCF TGATTICTGATCAGTTACC
GTTACCAATC TAGT ATCACTT AATC
73 TaiGTGTGGATATTTGTTTCACTAGCCAGC.AGATTIA AAACTCAAGAAGTGCAGTGC:TAGAAGGACACAITG
CCAGCAGATTTACTTATAA
CTTATAAAAGAAC GCACAT GGTGCCCATGTTCC.AATC AAGAAC
CATCAGTGTCAGAAATAA ACACACTTGAAAATATAAC
ATATAACAACC GAO I I ca I GGCFT AGATC7G AACC
CCATCATAATCACCAACCC TGTGGITC.AACTAATCAM
AATCAAACA CAGATATTGTGA IC CA
TTAAGACTAACAATAACG
TCCAAAAACAAGGACCAAC
CAACAACAACCAAAATAC
AAACAAC TTGGTAT AACC
AGCCCACGACAAAACAAC
78 TTCCCITTTGACITGIGTGITCTATCAGA/sCTACACTG 1 CCCTCCACTCAACCTCCTCCTGTGATAGGTACTCGG TCAAC.ACCACCAMAC.AA ATCAGAACT.
ACACTGCTCA
i TGITGITF CGTATTATTGCAAAAAGC CAAATCAACCAGAATCAAA
A ICAAAATCAAC 1 GA , CATG ATCAAC
AGTGTCTIAACCAGCAAAGTGTITCACAATAGGTAA CTGTATCTAAGGTC.CTGCA
CTAGAAGGGGAAGTGAAC
AGTGAACAA ' CAACTGTTT C AA
CTATGTACAACCAACACA
TGTTT TCACAAA AAGG
AAGGGICCAACATCTGM
TGITATCATTAATTGCCGT
CRAG TAGATGGGITG TGG
ACTGCTCCIATACIGCAAG
83 CACTCCAACTACACCGAGGGAAATAAGTGGAGCTGC GCCATGAGCAAACTCCTCACTCTTCATTGTCCCTC.AG
TAAGTCTATGGATAAAAG AAATAAGTGGAGCTGCAG
. AGA TIT CATCGA A
GAATGTGATTTATACAACT ATCAAAGTTATCTTAACAA
ACAACCCT TaCIGCTAA GTCT TAG GCCI
GAACCTACATATCCTCACGGGCITCTCTGCITTA TAA TCTGAGAGACAACiCIAAA CAATAGACTGGCAGTTACT
G rr ACTGAG AAGGG TM AT TAC GAG
GGTGTATGCATCTATAGA CTTAGCATAGGAATICTFG
AATICTIGGG AACITGAI AATTFG TAACAAG (31.3 ATCACTCCCCTGTGAGGAA
CiGAAC CG GGCGACACTC:CACCATGA C
88 ACCiATC IGACCCiCCACCCGACC:AAACCi TAACACCAAC CAGGGGCCCTAGATIGGGTCi TGTGGGGA TAGGCCG TGCACCATGACiCACGAAT ACCAAACGTAACAOCAACC
CG ACGTCTA C G
89 16GCACIC:GCCTCCAACACiCiCATCGATACCC71ACG 1 CGCAIGGCGTCCGGGTFCIAGAAAGAGCAACCACiCi CGTAGGTMCGC:AATTIG CATai ATACCCITACGTC/C
, CiC.C. AAGG 6 6 .
CCIAGTAITGTGIACGAGG
TC.GAGGClITTACCCTCGCGGCCGATGC.CATCCFGC C3 CC.
GCCGATGCCATC-CTGC
91 'FIG 1 GGGATC:CGGAGCAGCTTGAIGATGAACTGGT C TCAIGGACATGATCGCIGG
TGCCC.ACOVTGGAGAA ATAACGGGTCATCGCATG TGATGATGAACIGGICCW
92 TGGCiCAAAATCGGTAACCiCGTTICAACTCTTCAGGCT
TCAGTTATGCCAACGGAAGCGGGICTTGGAGGGTA 'I-KAM:ICI TCAGGCTGTC
GTCC GITiCCA GCTGGTTAGCAGGGCTCT C
TCCTTAACAACACCAGGCC
GGCCA GCAG TACCTACAGC1. tGGGTGC A
GTACGIGGGAGGGGTOCi CG ACAC A
GC.ACAGGCTGGAAGC6 95 ACAAGP.CCGTGCGTCCCGGCGGC1116GAGAACCFC CC
ICGTGITCTICRICITTGCGGAGGCCACATCCCCiF ACTCATATMCAAGCGGA
OGGCTTIGGAGAACCTCG
ACCC CIA C.GCGATGCCGTCATCTTAC
TCITGTAGTACACCCGACCC
97 CGGGAGAAGACGACTGGlICCAAGACTGGGCGCAC GGAGACCAAGCTCATCACGTGOAAGCCGITGATGA
AACCATCTCACCCCTCITC
AACG TGTCACC G
AGACTGGGCGCACAACG
98 CCCGTGGTAGACAGTCC.AGCATAACTGC:TACCCAAAC
GGAACGAGGAC:CATCGC.ATCACAC.AAGGTCTTGGT GGTGAGGTCCAGATCGTCi AACTGCTACCCAAACCTTC
MCC . CCACAT T C
99 N \ AGTCCACCGCTITAGCCACTCACGCCGITIGGCCTA
TCCCTGTGGAGAACCTAGGCIACGGTGGAGAGGACI
T TTGTCCG GGGGGTCCGCTGTTGT
CACGCCGTGGGCCTAT
AGGAACTTGCCGTAGGIGGAGTCCTMTATCAGGAC
CGGGTGCTCAGGAGGTGCTTAATG TGGCATCCGTG C-CTAATATCAGGACCGGG
0 CGGGGT GAGT AAGGCCOµTGGGGTTGAT GT
10 CGAGCTCGTCGCACTTCTTCTGGTGATCAAGGGGGG GAAGCTGGICCIC.ATTGGGCAGACAGACACGICAAG
GGTGATCAAGGGGGGAAG
CAGGACIGGCAGGGGGAA
CCATCGTGGGACCAGATGT
TGCTTGAACTGCTCAGCGAGCGATGGAAGAGTGCTC GTCCCGCCATGCAGAGGTTATAGACCTCGAGTTTCT
TGACAGGGAGGTTCTCTA GA IGGAAGAG FGCTCTCA
4 'TCAGC GCCAG CC GC
TCAACATATTGGGGGGGT
5 (31(16 G ACTGGCCAAACCCITXT. CT 66 6 TGCATCGCTCTCCGGCACGCTAATAGCCTTCGCCTCC CGCCA:GCCITCACTGCCATATGCAGTCGCCTCAGGAG
CGAGGGGGCAGTGCAA GCTAATAGCCTTCGOCTfr.
10 GTCCIAGGACCGACCATCXTCATTATGCACACTCCiCT CTGCAGGAACATGTGGAGTGGGCAGGAAGGGGAG
7 GCC TACAGGG , TATAGGGGGGTCTGGCGA
TTATGCACACTCGCTGCC
TTCACAGAATTGGACGGG
GCGCCTACACAGGITTGC
10 AGCGCATCGAAGGAGTCC.AGAACAGGAGATGGGCG
TGTGGCAGAGGAGGATGAGC.GGCGAATCTCCGAG AGGC1. AACCTC.CTGTGGA
CAGGAGATGGGCGGCA
11 GCCGGAAGTTGAGGAGCTGCCCTATCIACTGCCTTG ACGACAACATCCFCTGAGCCCGGGGGC.ATGGAAGA
GTGGTCCTCACCGAATCA CCTATCTACTGCCTIGGCC
O GCCG ATAGGAC AC G
11 CTITGGCAAGCACTGCGTGAACTC.AACTCGTTGCTACG
AGACTGC.AAGTTCTGGACAGCCITTICAC.GCCGCTG AAAAACTGCCCATCAACG
CAACTCGTTGCTACGCCA
GACATCCGTACGGAGGAG ACCAATGTTGTGACCTGGA
ACATCATGCTCCTCCAACG
11 TGCGGCCACCCTATTGATTTC.ATTCAAAGACTCCATG TGCCTCAGA/VACTTGGGGTCCTGGACAGAAGCCT
GCCTGC.TACTCCATAGAAC TTCAAAGACTCCATGGCCT
S GCCfC AGCGC C C
11 CCCATATTCTITICATTGCCTrTTCrACTTTGTGCGAC.A 1 AGATCCGAAAATCGAAACGAACACATGAAACAGAC AGGTACTGATCCAAAATG
6 ATGC1 1 rt CTAAGI GC GAAG
GIATGACTACAAGGAGAACCGATITTAGTATGTGTG TTGAAATAATCGAAGGGA
7 CIGACACT 1 AACTI Ca: LC , GM
GAACAATGGCCIGGACAG
AAGAACATGAGGAAAACAAGCCACTACTITCTCTGG ATGAAAAAGGCATAAACC
TT TGGA ' TGCCATA CCA
AlTACCTCCTGGCTTGGA
GAATTCAACAAGGCATGC TTGACAGATTCAAGTTGGA
9 TGGATTG TATTCAGTAGC G n-G
AAGGAAGACGGAMACT TTATAAAAGGAAGATCCCA
O AGATCCCATTTGAG TCTC AACC TTTGAG
1 . CCT TCA ACCTTC
GCGATACCTAACTGACAT TATTGAGGAATTGTCAAGG
2 GGGA CGCITIC (SAC GA
TTGAGGATGICAAAAATG CTCATCGGAGGACTTGAAT
CATGATTTATAAAATAAG CGAAACCGGTACATATAAA
TACATTGCTGGAAGCCAT
S MCI 1 11616(1 A
AIGCATGGAAATG1 COM' 12 GCTICCACAAAAAACCATCCATTACCATC/GaiGATAA I AA 1 TCii AGAACGIAAAACAGGGGACACTGTCCTCC
CATGGCGGATAACC:AAGG
GGGGCTATATCCCTAGGACCitiCITGGGCCTFATCAA AGATGTAGAGGC.AGTGCA GC1CAC7CCGTA
TOTAAGTC
7 . AGTCC 1, ACAGCC AC C
=
12 AGGATTGCCATCTAATGCATIGCTOCTACAACACAAT TGTGC1T1'GATAGAAAACA 1 TGCTACAACACAATGCTGG
12 Ci MT TGCTAGGG ICTCTATTCFGTCAMITTAGCGCT CCAGGAAAGGCTICTAGACC
TCCAATGTICT AITTG GGAAATCATICTITAAAA
IGTITGICCA
CACITGCTGTGTCJVAGC GCACAAGGCCATTGAACT
13 CCAC.ATAATACACTTTCiTC1TTCTGCACICiTGTGCAGTA
AAAAGGAAGCATCTAACiTATC1CIGGAGAAAAAATG ATGGGGAGGGATGGACT AGGTGTGCAGTAATATAG
13 GGGIT ACCCTCATATACAGGA 1 TGICAGGACCMCC AGAIGT i 2 TTfTTAC aiGGAT CGTTACAGCAAGTGCCTG
CAGGAOCCTGCC.TITITAC
ATITCCACAAACACCCATIGCTATAAGIATAACTAG ATACACCTATACCCACA It 3 ATCC.1 GATGCAGGTAA TAACTGCCTCATCGCCAT CT
13 GAGGTAGCTACATCAAGGGGTAGGCTATGGATTITA ATCCTGA 17 ACCTTAA.A.ATGGCT
4 AAACCCT GCCTTA GATFTGG Cr 13 CCCG Tr GTAACCAGT ATGGCTTATAACTCCTAG iGGC
CCCAGGGTCAAAACAAIGGTATTGGGIGGTATCTA CITGAC.AGITCIAT T1 ATG
AACTCCIAGTGGCTCTATG
6 CTATATTCATG . CCTG CAGC CATG
13 ACTAC.CiTCCGCCGTCTAGGTGACICGCCGGACAAAG
7 TCAGGAGGCACCACAGGCGCCGC.CGO.CTTACCTGA G GGCTCGCATCTCTCCTTCA
CCGCCGCCTTACCTGA
13 ACGAGCCCCCAACTGTGTACTACAACTC.TGCTGGCTC
CGACTGACTGCCGGCTT CAACTCTGCTGGCTCGG
13 CrGGGCTTGGGTTTGGATGAGTCCCTCC. TAGCCAGCC
CGCCGTCATCCTCCACCCAAACATAGGGAGGGGGG AGAAACACCGGTCTGGAT
CCCTCCTAGCCAGCCTAC
O TGC GGGAAG GC
GGAAGCCCOCAGTTTATGC
14 GTC.TfTGGCACTCCCTGGCAAAGGCGAGAATACCAG CCTTCCTGGGCCTCTATCCTCCGAGGCGTTCTACGA
TGICCAAGCCAAC.AATCC AAGGCGAGAATACCAGCA
CTAGGAGATAMTTGCGG
OCTGGACMCCAAAGAC.A
TACCAAAAACAACTGGGC GGTCGTGATGCCTIACAAC
GGAGGCAGMATATCGA
CCCACAGCAGTGTAACTA
GCCT TCCA , CG , CCGGCACTAGATACGCCT , AGGCATACTGATOCCCCIAGACCCGCAGCTGCACTAA TATGATGCAGTCCCCAAG
CTGGGCGCTACCTGAACT
14 IGGGGTAGGGGGGCATGTAGCC.ACTOGGTGTICCA GGGGCAGCTACITGCCTCAGTGGMGGCAGAGTA
CCACTOGGIGTTCC.AGTC
GT CATTACCTTCGGACCCI. TG
AAGGCAGGTCCTCTCAGG
14 AC.ACATAITGAGGAAGTCFTGGGAATAGGCAAGCCT GCTATTC.GCC.ATACTACCCATOCATAGGCTITTAA
GCTATTICCTCMGCTCC TAGGCAAGCCTAGCTACAT
9 AGC.TACATAA GAATGCCATT A AA
GCGCTTACCGGGATGAGAGCATGGAAAGGACCACA AGTTCTGCCOMTGGATCCCGTCCGACGGOCITGG
TCCTGGTCTTAATAGCCGC TGGAAAGGACCACAGGAG
O GGAGG G C G
15 GGGCAGGATTGCAGGGTfTAGACATCCTCGGTGATT GTTC.ATGGACCCICGACCTGCATTAGGGCAGGGIG
CATCCTCGGTGATTACAGC
GCGGAGGCTATTATTCAG CTCTTATTCAGACCCITGTT
15 GCGGGATCCTAGCGTGGGAC.ATAACTCCCTCATCCTG AGCTATAGTCTCCMCCCC
CATAACTCCCTCATCCTGCC
15 GGCMGAGGCCATGIGGITGAAACCCTGAGTC,ATCC 1 ATCCTCCTTCTCCTGC.AGCAACTTGGAAGGCGAGGT AAACCCIGAGICATCCCTG
t 1.5 AGAGCGGGAGAAAGAGGAGGAATGCGCCGTGCCT1 1 5 CF 1 CG , GCGAAGICAGCGGCC T
*FGCGCCGTGCCI TO' TATCAGCCCACTTCCC.AGGGTTCCAAACACGTAGAC AGCACAGTTTCCTCCTCCT
CCAAGGATAATAGCCCGTC
IS GGAGGGGAGTCGAGGGATAAGGGAGCATCAGATCA ACCCTTAAGGTCCTTACCCCGCGAGGAAGGAGGGT
GAGCATCAGATCACCTGG
TTTGGACATTTACCGATGG
8 GATGGC AAGGAAGA crfCCG 17CCACTCAACC C
GAAGGTCAGGGCCCAGA
9 . CtC CATCMG A 'FAAGGCTCT
GACGTCFCCC
ATAAAAGCGTGGGGACAG
O ACAGII GC CAUGGCACGCATACGGCT
TF
TATGGAAAGAATAAAAGA
1 C:TC Alf:CACTI ACTAC:GG
AAGFCGAAAGGTTAAAACAIGGAACGGCGTATM GAGTAAAATGAGTGATGC GOCAGATCGAGTAATGG
2 AATGGT GACITGOTI CC: T
TGGTMCATACATGTTAGAGAGAGATTGTFCCACCA GTAATCATOSAAGTTG ITT
GGATACTAACGICGCAATC
16 CTAGTGGA I CTGCTGATACTGCTC:AAAGTCTAAT TAT A I
TGGAGAIGTGCCACACiCACTC1CCIAAGAATOCC AAGTC1AGGAA TGATGATG CAAAG it-f AATIATTGCAG
4 TGCAGCC.A ACC TTGAT CCA
16 GCACCCITATITTCAATGTTIGGAGACitaiGICATCA GGAAAAGGGCAACAGC:TATACTCAT TGACTG
5 . ATCAAGA CTCITCC GA A
=
AATTGGGGAATTGAACATATCGACACTIGGAGTCAT GCA MAI AAAAGCAGTIA GACCF
GAATTFCGTTAAIA
ACAGGGGACAGAGAAACFGACICA.GGGCCATTAAT GAGAGGGIAGIGGIAAG
ITGACCGOTITTGAGAGT
16 CCM:ATI-Mr GGAATAGAGICCf TCTITACi TTCCTA TGIACF
niCiCiACATITGATACC.ACTCF AC !TICCIT TC1CiAAITICiAGCCATT MA
TCTTIAGITCCIAAGGCCA
16 TCCGGATGTGCCTICATCTGAACAATTC.TCGGAAAGG GTGGAGTCCGCTGITCTGAGTGATGCTTAATGCTGG
CAACAAGACC.ACTAAGAG AACAATMTC:GGAAAGGAT
TATCAGAATGGACCAGGCGATTCACA.C.-MAAGTICG C.T.11-FACCi AACC.XIACATG
crrGAGGAAATGICAAGG
O GGGA CITY A GA
17 ICCATTCAAGICCMCGATGAGC11CCAGGACATACr AACACAGT
MGAGICTCMAAACTCCCATMICATIA GAAATTICACCAF iGarr CTTCCAGGACATACTGCTG
17 TCCATOGCTGIATGGAGGAICICCAGCGCAAAATGC AC:AGGAACAGGAI
ACACCATGGCCACITMCCf TIT CCGACICTACTIITCCIAA
AGC.GC.AAAATGCCATA
17 ACf MCAITCHTTCAAGGC:AIGAATTAIGGCf ITCCTI
CTAACTCAAGGTCGCCAGACTIATGGICfTGGCCAA CAGACTGTGITCF AGAGG
"FATGGCTITCCITGAAGAA
GATAACAACCCACTTIKAA AAGGAGAGTAAGAGMAA
4 AGACAACAT . CTTTTTAACT AGA CAT
17 TCTCACAACATTTGCCAGITTCIGGCGAAAAGC.TTGAA
CTITCACAATCACTGGGGACAACCATCGCCAGGAAC TCGTATACTITGITGAAAC
GCGAAAAGCTfGAACAGT
TGITCTCAAACAAAATGGC
17 AGCAAAATCGTCGGATGATTGGC.AMAGA AATACAC
CTCATAGTGAATGCACCAAACCAAGGTCCTGTAGAA AGICTCGATACTGAATCTT CAAAAGAAATACACCAAG
7 CAA.GACAAT TCTGTCC GGA ACAAT
TCTGGAGTAAATGAATCAG
8 GAATCAGCT TATAT CTACCCAGCTTIGGAGTG a GCTGGTGCATGTGGGGA
18 GCC.ICGAGCTTGGGCTTCTCCTrTrACTCGGAGGCCA TATTGCCACGCATCCGGATCACTGGGCT
GGGGTGG
O CGG TT r 18 AGTrTCGGGGATTGTGAACCGTCAGATCACACGCTCr CAGATCACACGCTCTTTCC
CATGCTACGGCTCGGC
18 AGCAGCGTGGCGGCGAAAGIAGAC.GGGGIAGGGAG
TCCAATAAACCCATCOCCICGCCACGGATTCCTGAG CCCAAGGATAAACGTCa: GIAGACGGGGIAGGG AGA
18 GCAGTACCACTACCGCCAOCC.AACGGATAACGCGGT GGGACGGAGGAGAGACGATGCAGGTCGGACCATC
CCCAGGACCAGCCTACGA
AACGGATAACGCGGTGGC
ATACACCACATCG17GGATAATACGGACGCGTTA AT ACACTACGTC1TATAC.AGC
4 CTCTCC CGAACTTTC Tr C.AACTCTTACTGCTCTCTCC
18 CAAGMGCGACCGCAGTGGGICATAAAGCT. GTTGAC
C.GTTATCATCTGCTC.CCGCTGCCGGCCGCTGAAA AA TCATAAAGCTGITGACCCIG
CGGC AGAGT CGACGTCCCTGATTCCCT C
18 GCGTGCTGCATAGCTTGGCGTACGGGAGATACAAGG CGGATTCC.AAGTCAGAGCGGCGGGTCGGTCCAGGT
TCGAGGAGGACGAGTGG TACGGGAGATACAAGGAG
GCAGCCGGC.AGAGGAT
8 GG TCGT TTACACAACGACtCCAGC TG1TACA
TCGCCGGCGG
CCGACGTGCACGAOCT CACTCGGACCGCTCCT
O
CCGGCGGCTGCAGITGTTGCGAACTGTGCGTGCC GTA TGCGAGGAGGACGAGC GCGAACTGTGCGTGCC
19 CGCCTCGiiiii..CrC11CGGIGCCTGCTGIGCTACTGG
GGAGGAAACTGCCGCGGGATCTTCCTCAGAGACGG
CCTGCTGTGCTACTGGTG
19 GCGCCICTCTCATGGATCCACTGGTGAGCTGGGCAA GC.ACGCTGC,ATGGC:TAAATACCC-ATGACGCGTTCG TGAGTCCTTGAGTGACCsG
GGTGAGCTGGGGAACAGG
'FICCACGCTG1TTIC6AAA
3 AC GAIN: CGCAGCAAGTGACCGC
GCTTAACCAACGACGAGG
CAGGTAGCGCATGCAGTG
TGCGCTICAGATCGTGGA
6 GCA flG GATGAGTrCGCGCTCCTC
GGGCTCCAGAAACAGGCA
GOCGICAACAGCGT GCCT ITTGGATGGGC1CCGGGI Cr ATICTa:CCCTCCCICTCG
TGGATGGGCTCC:GG GT
8 MT ACATGCGCCCiGGCACTGATCGAIGGGICCGCTICA AGIT
CGGCTICTTGCTCG AACtCTGcfGCrCAcr6T
CAGGACGIGGIGACCICG
O CGACTGOCGCATGGT
ITCCICTCACCAAGGGCGCCICT GTG GIGO GCAACCGCICA TC:ACCAAGGGCGGGCT
1 CiGC CiccI TCCITCIGICACX:GMI
CTCGGACAAGA TCACGGC
20 CCCTTGAACATCC:CGCAGCACCICiCeGGGAAACGAG CSGCTGGICGGTTICIGACTGGCGGIGT1ICCAGCA
CCTGCGGGGAAACGAGA
20 GCGCTCCAC:GATCICGGCAIGTGCTAGACCGTACCG
3 CiCGCGCITCATTCTGGCCAATCTGGCCGATGGC7GTG AAT CAAAGGCGATGGGTTCCG
TaGGC.CGATGGCTGTG
4 GCTCGCAGTAGTGGCGCACGTGTGCGCGCTIIT (CA CSGCAGACGGCTGIGTC
CGICTGCGCGCTCTr 20 AGIGai ATACGGGGCTGGGAGGGCIGUCCAACI7 GGCCTIGGIGGACTICCAGGCGGATAATCTCCTGG CfGCLCGAGATGTIGACC
TG GCCA C
AGGGCTGCTCCAACTTTG
20 AGAGGGCTCATCCGACTC1GC.CCACCICTGATGTCC7 6 AAGCAGCCGCGTCTTAAGCGATTGCCIGGACC.TGCGC celoG GCAACAGCCGCGACGT
ATTGCGGGACCRIC.00 GGTGC.ACAGCGACAGTCi 20 TCGCGCATAGIGIGGAAGTGGICGCCCACC.ATGAA
8 TACCAGCGGCCCATGGTGCCGCCGCGGTAGAGCAT GGATTT C.CGACAGGLICTCCACCiT
OGCCGCGGTAGAGCAT
CCCCMCGTAITCTIGICGCCICCACCAGaGGAC TA GGAATTIICCCMGCGC A
GCCGCTCAGATTCCAGGA
21 TGCTOGGCGCCAGGAAAAACITCATGACOCCCAACi TGGCGCAGTTCTGGCACGGC.CACCGGGAACAAAGC.G TGCi ACCiTGGGAGCGC-ACXT
21 AGACGCGC.01 ACTCGGAAACACTACA TCAAAGGG C
ACCTGACGGIGAACCACCCAAATCGCCCCICACGTC: ACTACAT CAAAGGGICCIG
ACGCGIGTTGTTACCCACT ATCAGCAACGTGTTATCGG
21 CGTCGGGCACAGACCC.ATGGCCIACCGGGATCACGA
GCGACGGCGAACTAACCA
GCAGTACGGGGGTCGA
GAACTC.GCGACCCACG
GGTGCCGCCTGGAACA
CGAAGCCCACACTGACG
GGCGTCCGAATGGTGGGTAGCACGCCGATGTGCAGC GC CCGTGTGCAAGCCAGC CACGCC-CAIGTGCAGC
CGCAAAAACACGC.MC CGGGCFCTCGFCCIGT
AAGCTACAGCGCTCGCGGCACTGGCAGACrTTGGT OCAGMCAGCTITTGIATG
O CAG CG TGCACGAACAGCTGGACC
GCTTCCTGCGAC-ACCAG
22 CGCAGATTCTCGGCCAAC.AGCTACCCACCACGCGTTT
C.GGCTTGAACGAGC.GCCTIGTCGCTGCTGATCTGAC AGGTCAATACGGACTACG
CCGACACCGATCACGAAG
TCTCGCGGCCCTICAA GGCACCACCGAGACGT
TGATCCGCACAGCTCCG CGGAATGTCACAACGCCG
22 GGCTGCAGCGCTrECAGAATTGACATTCACGACCTAG GTIGGACGACAACGCCAACCGCAACTCTAGCATITT
GAC.ATTCACGACCTAGACG
ACGC GGCG GGCCTCATCACCAGCATC C
n ITCGGGTAGCTGCAGGCCGGGCCTTGGTACAGGAGC GTGCAGCAGCGCCTACAGAGCGTTTCCTCTTCCTCT
GGCCTIGGTACAGGAGCA
CGAGATGCTCTCCACCATC
OGCACCAACCACTGGAGG
8 CAGTCCGCCGCTCATTTTGTGTGAGCACGGCGCAGA T CAAC.GCGACGAGCAGC
TGAGCACGGCGCAGA
GCTGGCTTITCTGGAGGAA
O
GAAAGCGAAGACGGCGCCCACGTGGCAGGCGGICT ACC CCTGCTGCACAATGTGGC ACGTGGCAGGCGGICT
23 GTCA TGGCCAATCGTCGTCfCCAGCCTGGTGGAACTC
CGCCACAGCAGAGCCAGCACTITGCGACTCIGGGC
AGC:CTGGICIGAAC VCATCA
GGCTITCTIGGCGTACAGC
23 CGTGCAGGATGACGTGGATTCTTCCGCGTCCfCGTCG TCGTITTCCTCCTCTCCGTCGTGAATGICCTGGGCTA
TCCGCGTCCTCGTCGA
GTCCACGCCGCCTCAT
'FAGTTGCACACGGCCGA
CAAAC TGGCCILT-CTGGTACCIA
7 CATCTCTACCGCCGCCGTGCAACiGGCGGCGACA FCC GACGGI
GGAC:GGCGC.AGATTGC AAGGGCMCGACATCT
8 MG 1 GCCICAC:G ITCAC:CTCCI
CGC3GTGACCill6TG641G
9 CACCGACACACiCAGC:ACCAGGTCCGICGaXICIGTV CAG
TGTTGATGATGCGACGCC CCGTCG MCC-CCITT
TCTCCTCACGCACGCGGATTGCGACCGCGA
O GACCi AGM CTCCTGCAACGCGGCA
AACITGTCCATCAGCGACG
24 CGICITITCGGCGCTC:AACAAGAGGCGCACCVCCGAA GCCACAAAG
TCGCTITTGCCGCTCATCGACCCCCITG
ACCiCGCACCTCCGAAA
24 CCTCGCACAGCCGATGCACC:ATACGCAAGGCATCGT
2 ACGCCCGCCAGAAGTACGC.TCGGTAGGCACGC.CIAT C36 GCACAACGCCGCCAG A TCGGTAGGC.ACGCGAT
24 'ICTGTIGGCGGGGCTCACGCG TCGTAACAAAGCACA
GCGATCACCACGCTCGAGGAACCiCAGGAGATTCIG CGICGTAACAAAGCACAG
24 AGAGGCTACC:CGAa:GTAGCACAACCGCAACCiATIX: CAGCCAGACCGCTAGCCGAAAGCCG
CCCGAACA I
ACAACCGCAACGATCCC
24 GACGGTC.GTGGTCTCCTCGC.C.TGGAGGGGICCGGAA ACCZGGCGTGTCGACTCGTAGGGAAAAAAAGGIGG
S CA GGCC ACGCGTITTCCACCCITT
CTGGAGGGGTCCGGAACA
GCGCATTACrACCGAGTC GTCITCITGCGGTACCGG
24 C:ATCCTCiCTGTCACACGACGAACTGTT GACGGA TGTG GCaTCCFCCAGCCAATCG
7 CAA CGACCGCCICGGCTGATGATACCITCTGGCAC.CCTACC A
ACTGTTGACGGATGTGCAA
24 AAGCCiCGTGCCCACCCAAG I CGTCGTCATCGAAAGG
TGAAGAAATAGACCGGGIGCCGICTGGATGACCGT
8 C ATGGGC GCTCGTCC.CGGTAATCG A
GTCGTCGTCATCGAAAGGC
9 GC GAGGC.CACGGCGCAGAAGAACCGCTGOCCGTCIAC
TCGGTC.AGGGCACTCGT CGAGAAACGGCTGTCGG
O
TGGCTTACGACGCTGACGCGGCGGCCCAGAAAACC C CAGCTCGGTGGAAAGCC GGCGGCCCAGAAAACC
25 GGTC.GAACCTCATAACGGTGGGC.ACGCGTCCTATGA AGCGGCTGCTGGAAGAGGGCGGGTGACGGCGACA
CACGCCITCCTATGATGAGT
25 CITGCGCTTACCGATCCGCTTAGATTCGACGGGGAAC CACCTGGACATCTACCGTCGCC.AC.AGGCGGTTGAG
GATICGACGGGGAACGCT
3 GCTCACTCAAC.ACCGCCGTGAC.ACGCGOTIGGGAG TGCCTCAGCTG1TACCGCGGCCITCCCAAGCGGCA
TGTCTGCTAGTCGCCTACG GACACGCGCTTTGGGAG
GCGACTCGCCCGATCT ACGCTACTACATGCGCCT
S
GAATTTGCCCGTGCGGA
G
ACGACCAGTCFCCGGAC
7 AC.ATGGACGCCGCCTCTGCCGGITCTTCMGCGCA G CTTOCAGCGTCGCCGT
OCGTTC1TCOC:GCGGA
25 ATGCAAAGACGGCCiCGGOACGTGGAI CCCCATTATC
ACGIGGATCCCCATTATCC
8 CC CCGTCTTATCCCGTGC.CGCATACC:GMCGGAGAGTC
GGCACGTCACTTTGC:GG
25 C.GTAAGGAAACCGOGGCGGGCCIGGGAAACTCA A
TGGTCACCGTGGTCAGT
26 ACAGTCCCICiCTTGGGTGTCC.ATGITTGCCGGC17C6 O
CCCGGGCACTGATCCTGACCAGCGTGAGAGCCCCGT AGG GTGCCACGAGGGGATGA AGCGTGAGAGCCCCGT
26 C.ACGCTGACCGGCC ATCAGGCGGAAAAAGTGGATG
CGCGGAGGGTGATTCGCACTCCCGGGCGTCTCCAT TGAGAGTCCAAAACCCAC OGGAAAAAGTGGATGCGG
26 AGTTGTTGAGGTCC.ACCAGCAGTAAACATCAAGGG
GCAGCGCAAAGGCCA
TCGTGCGGAGATTTGITCT AAAACAGCAGAGAACrGA
4 G G .C7TGCCCGACGCGTGAAGGCCGAGCGCAAACAC
AGACGTCGGGCTCMa: ACTGGITCTGC7GACGCG
GTTCAGCATCTTGAGCGG AGCGCGTACACATAGATC
CG TGCA
AAGCTTACAGTCTTGCGG
GGACGCCAGTCGCATG
AGATAC.GTAACGTGTGCG
26 CGCTGTGCTGGCATCG/sGTTACGGC.TGGTGAGTCCA 1 CGTGTCGTACGCGCACCGTTGATACCGCGGAATCA
TACGGCTGGTGAGTCCAC
9 ACC: GACCGCC CCAGC(XiCATCAGCAG 'FCC
IGGCACTIGACGAGC
AAGTAATGGTCTGCAGCTG
O TGC CCCAT AGGTGGCGTCGCCAA
2 AACGCGCTGCACGACCACTTCCATGTTACGCGGCAG AGC GGCCACTACCIGTACGCC.
TFCCATGITACGCGGCAG
4 A OC:GACCAGTTCfCCATCGCGTGCTITCCCCGAGCTCC
GGCCICGAACATC1CCItG CX:GGCCiCiGGICTIGA
(31(3 TGG TTGCCiGCECTCFACCG
GCCTGTCCGATACACGTG
CACGCAIGTGCTACGCGGATTAGAGGCTCGGCAAT
GACGTIGTTGlIGCGGATC
ACCGTCAGAATCGACGGT
27 CTGACGCCACCiCCGCCATTCTI GT TCC ICATACiGCCC
GGTAC:GGGGAGAGATGG
TCTCCCOCGATTCCAGCGTGTCAACGGCGGTCGGA GG C
GTC.AACGGCGGICGGA
CIA
9 C TGCTOCTCTGC.ACGCTCTGACACCAACCAGGCGGC
CGCGGCCCATGAGGTA
O
CACGGCGTGICCTIGCTACCCACC.TGC.GAGTTGAGGC GIG CATGTTGGC-CGTCTTOGG
CACCTGCGAGTTGAGGC
28 CGTCGTGGAGACAAGCAACACCAGITT TCiG GGGCG
1 T1CACZTCC6GGTGGCTGCCGTGTTCGAGCACCTGAC. GAGG CTIGGAACIGGICGCGTC
CGTGTTCGAGCACCTGAC
GTGCTCAGCGGCGIGTACAGCGGGATCACGTCGG CC TAGCCACCTGITGGAAC
3 AG GGTAGCGCTTGCGGCACACCAC.C1TX:16CATCTCCA
GTTGTAGTCGCGGTGGC CCGCAGAAGGCCATGAG
28 GCAGCGGCGAGAGGAGAGGAITCACKi TGGA1 CCT
4 GCCTAACCTOCGCTCGCACTGGCGGCCGAATCFCC Cr.GTC CGTAGCGCTCCTGGATCT
TGGCGGCCGAATCTCC
28 CiCGGAGGAAGA TTCCi TCTGAGCCGTCTTCGTC:GICC TCCICICACi TT
ICCC:CGTCFCCACGTCGAAACCCCGG
CTAGT C GGCGCCGTCACTCGTT
CGTC1TCGTCGTCCC:TAGT
6 66CCGAC6CTGG1TCTCCTCGTGATGCGTGACGGAG GT TTTICCGCATGC.GTTCGA
CGTGATGCGTGACGGAG
CCC GCACGCGCGTTICCOCGTAAGCGCCII AC.AACTCCG
ACTGCTCGCACXGTCT GCGCGAATGTTACCACCC
CATGGGTACGGAGGCGT
GC.ATAATC1GCGAGGAGG
9 GAGGAC C.GGAACGTGACGGTITSCCGTTTTCCCCGCCGGTGG
TTC.GGCGCACCACCA AC
GTCTCTACGCTCCSACGAC
O A AGC CTCTGGGTCGCCAGGT A
1 AAC CGC 1TTGGAGGCCGC.ACC6 GCACTGGGCGTGGAAAC
CCTACGATTTGGCCATGTC
ATTGGCGCAAACTTITTGG
3 TGGC Gil CGGACTGIGGICGACTGT C
TCCACGCATGAAATAACG
CGCGAAGTGCAGCAGG
29 GACTITACOGTGC.GCGGC.GATGGAAACCAGCAGCCC GAGCACTIGCGGOCCATGAGCTGGAGGTCCATAC
G GGATT C. TGCGGTACTCGGCCA
TGGAAACCAGCAGCCCZ
GCCCGTCGTAGCGCAGAATAAGCTGCTGTTCACY.AC
7 11TCCCTACCGCGCGCTGGCGCTCCACACCAACAG TCAC , GCTCTGGCGCAACGAG GCGCTCCACACCAACAG
TCCAGCGCCTGCAGAT
29 CC.CCCAGGGCGTTGTAAAAGTCCITATGCTGCGCTAC GCGACriTACCGCCAACGCCGGICTGCGTGGGCGA
alTATGCTGCGCTACATG
O
CCAGAGCGTGCCGGTA
TGTCAGATCAGCTCGCAG
GGCTCCTTCGTGGGCA
GCCICCGGATCACATGGT
S
CGCTGGATGACGGTGATGCCIGTACACGGCC.GGOGA COG CGGCACAGGTCG TCCA
TGTACACGGCCGGCGA
CCAGGTGAGAAAGAGAAGCCGCCTCTCATCGTGCC
GTAGGAGCGACGCTGACG
i CGGCAGCGGTGGTACTGGTATGCTTACTGTGTGAA TGACGGTGITACTCGIGG 'FGAAGTGACGTIAGGGGA
7 GOGAGG 1 GCGG , I GO
CCACATCTGGTATACACCC
8 CO ' GTCCAGAACGCGG1TGGCAACGCCACAGGCCGTAG
CTTTCTTGGTGGCGCCC G
TACTITTCACTCCCGGGIG
TGGGCGATGCACAACTCITTCTGCTACGGCTGCTG A ACGACAGGATCAGAC.AGA
GCGTACAGGAGTCCTAGG
O AGGT TACGA CO T
1 . AG TGIGCCGCACAAGGCCGIGCAAGMCATTCCCCG
GGIGGIGGIGGGCATCGTGCGC1CGGCAGCTCCTT TTCCATCIGCGTCAGCCIG '16CAAGCCCATTCCCCG
31 CCCCTGGTCTrCAAGCACACTTACGCAGCCCAGCCTA
GTCGCCFCGGTAGCTCAGTAGTCTACGGACCGICIG A TACTGTCCCAGTCGCGA
ACGCACICCCAGCCIAG
31 GAGATGCCAACAGIGTIGCTACAGGAAGCTf ACAGG
CGATAGGCGGCTATAAGATAGAGATTGICTCICITT TACCG TGATGTAAAAACA
3 (T Aicrnrr TUG ACA
ACAGGAACCITACAGCiGT
CTTTITCCTTGTITGACGG ATATGTGCGGTATGATTTT
CCGGCGTGCAGCAAGGTTCGCTTCGTAATICTGACG
S CCAGG 1 ca CGGIACATCCCGCACITCG "fGATICCGTAGACGCX:AGG
31 GAGCAAGTTACCATCOCCTGCMGCGaiAATGGAAA = TCCGICGGCGACCAACACCAAGATG i GGGCCGACiA a; IGGGAACGACAGAAAC
TGC.GC.GAATGGAAACCAT
31 AGGGITGAGGCCiCiCCGTTCGGATCrACGTCCCAGTC
GTAGCGGGAGATACGGCGT ICAAGC:CGCCGITCAA GGA ICTAC:G TCCCAGICIC
7 . TCT GC GGIGACCGTCCTCTGTC.0 I .
CGACCITCGTACCIAGTCGC
9 CO Go TT
CGAGAGATTCTCCAGCCCG
32 TGCGCGCTGCTGC; TGITITATIGCAACACiCITCGACC3 CCiACGCGA TCACCICi TTGCAACACICTTCGACCIG
32 ACGAACiGTCCA.GCGGC000CGGATGCTC1CITCGTT
GCGTTGTCTCGCCTATCAGOGTIAACXITGGGITTCA
GCGGATGCTGGTCGTTG
32 CGCGTGACTCTCTGITCGAIGGCAGGICCAAGCGCCG TACGAGACCCCCiAGGIACGCACTGCAGACCGTACCi CAGCTCCAAGCGCC.GT
32 CCC AACGGTCACACGT CCCiGAAAGCCCi TCCCGGICA
3 Tr CGGAACGGCGTTTCACTGCCTTCTGCACCGCCGACG
AGGGGAGCAACAACCOT GAAAGCCGTCCCGGTCATT
32 GCGTCICICAGGGCTGGAACIAACCCAGCACTCCATC:G
4 CAAC.GC.CitiC.ACTTCCAACAGTGTMITGCACGGCGT AC TCGCGCCGCAGAGTT
TCITGGTGCACGGCGT
S TTCG CACACCCAGCCMCGGATC.GGICGTCGCGT CO
GGCZGTTGGAGAATIGGT GGATCGGICGTCGCGT
6 CCGACGTCCCGCATCCAACTGGCGTGCGAGGAAGA . ACATGGGCCATGTGTGGTGGTGTGCACTGCCGCGA
A TGGCGTGCGAGGAAGA
32 GAACACGCGCAACAGGGCCCiATGTCGGGCGTCCACC CTCCGGGTICCTGCCITITCCAAGGCCGTGGAAAAC
GATGTCGGGCGTCCACCTA
32 TATCGTCGGGTCGGCTGTCCAAAGCCTGGTTGGATCT AACAACGCAGACTGCTGAAGGCCAGGC.ATGGGTCT
AAAGCCTGGTTGGATCTCG
AGCTTCMCCCGACTCCGCGAGACIGGC.AGGGAC A AATTTGGGTCTTTTC.ACGC
TAGCCTCCTCCACCACGG
33 GGCCC.ACITTCGGGTGGAGEICTTGAAGTAC.AGCATG
CGATACCACCACATTCACIGGGCCITIGCCGTCCTCIT CITGAAGTACAGC.ATGCCC
O CCCA CCTG
GTTGCCCGACATCC. FCCC A
33 GGTGCCGAAATCACCGTGTGAAAAACCGCGACTTTC C.ITTCCGTACCGGGATTTC
AAACCGCGACTTICCACAC
AACGTCAAGAGICACGTC
2 ATGTCGCGACAOCCGGGFCGCCCACCCCCATTACACG occre; AG
GC:CCACCCCCATTACACG
33 AGAACATATATAATCGCCGTITC.GTACGGATCAAGAG
CAACAATACCCiCCCTACGTIGGTICAGTGTTGGATC CACATCAACAGAAAACCG
ACGGATCAAGAGGICCAT
4 AGC.ACGGGGTACTCCITGGCGCTCTCGGGGCCGTAT GGA TCGTTACCCCAGC=TGCC
GCTCTCGGGGCCGTAT
i 33 GAA6GAAACCGACCCCGMACTAGC.G1TCATAACAA GGCGACGACAGITCCGIGGGAAGGCACCGCGACA
CGGAATCGTATACCGGGC AGCG11CATAAC.AATC.CCG
TCCCGG G , A G
33 CTC.AACCTGTGCGCCACAGCGTTGCAGCaCCACGAAC
GTTCCCAAGCCACGCC GTTGCAGGCC.ACGAACA
33 CC.CTC.ACTGGGCTACCAGTGCACCGGIGTGCAGCTA 1 7 AGC ' GPAACC CGCGCCCAGAACCAAC
ACCGGIGTGCAGCTAAGC
33 CCGTGCCATCGGIGGGTACCAGAGAACGC.AGCGCA
GACCCAAGCCAGACTGC
33 TCAGCGAAACCGGCACCCCIGGCCGGCGAGTAATAGC GCGiGAAZEGGICGCICATAGCTCCTCC.TGGTGCCT
GGCCGGCGAGTAATAGCA
O
GGCTGICATGGGICTGCGCAGCCGCCGAAACACGA CT TGITGCCGCTGICGAAC AGCCGCCGAAACACGA
CtGAAAGCGTCGIGAGCA
TAGGGGACTACCTCCTCGA
CCAATCGGCGTCGCTTAG
GGGACCTGITTCAGCGIG
t GCAGTGCTACCGTGGTFCCCTACGATCCICCGGCGC GCGFAAATCF GCGGGATA
5 'MCC 1 I , GGCGCiCCAGAGAACGT GC
6 TG1TCGACAGCGGCGTAGACTCGCCTCGACAGAGCC ' ACG TTTAGCATTTCGGCGCGG
TCGCCTCGACAGAGCC
ATGGCGTCCAGGTGTATG
AAGTTCACGTCGCGTCC
GICCAGCAGTIGGICGCG TCCGGCATGGTGCTGC
AAATCCCGCAACACTTGTC ATAACGCCATGGGATCCTC
9 . CA C G A
35 AACC.AGATGGCLACGGACGTCCTGAACAGACCGCT A
CCCTGCTGTIGGCTACCTCTCTITCCTGTGC.ACCGIC CGTGAAGAGCCCTCATCC
O ACC ATC A CM
GAACAGACCGC TAACC
CGCATCTGCTCGCCGTAGATTGGATGATTITGCGCG
CfGATGCIGTTGICGTGCA
2 ATCCCGGCAAGTTGC:GCCiC:CACCACCGCATTAACCG CCCi CCFTCFCGCGGGCCTA
CCACtACCGCNITAACCCi TTCFACGGGTATCTGCAGC
3 T GGC.3 7 GGACCITCIGGGACCAGT
35 GCMCCAGACIGCACGGCCA TAGGCAGCCiACGTG I
ATAGGCAGCCACCi TGIAT
4 ATGG CGTGGAAAACCACCC.GGCAGTCCTCCGGGITCCTGC
35 CGGGGGGGTGAAATTIGGAGTGCAGCAC:ACGCAA
5 . CAGAMACAC-CC.CCCCGC.CCGGACGCCCGACATCCA CTACITC
GCGCACCCTCCTAGGT GCSACGCCCGArATCCA .
GCATACCAATGAAA I GGAA AGGTACT GATC.T.AAAATG
6 TGCTT GICTGAATAC.AT GAAG
CITTGTGCCACAGTGCTT
35 'FGTAGFCGIATAGAT OCC.iCAAGAAATGGACAGTGG T AGGAAAACCGATICACTGAAATTGG
ICGGATITT AT AATTGAAGGAAGAGACXX; IGGACAGTGGTGAATAGC
7 GAATAGCA CTTGTTAGC. TIT AG A
35 TICGCCTCFCICGGACTGACTAAGACACKiAAATGGCA AAICACAGGCACCA
FGCGTAAAGGC:IGGAGAAGYF AAGAATAAAAACCAGACT TAAGACAGGAAAIGGCAA
CCCAAAAACAAAGAACAT MACAAGCCAATTAATCiTG
36 CAAIGTGLICAATIGGGGC.AGATF CGAGCFGGATAG CATTGCAGGGCCACTGAGF
ACCATCCATGGCFGCAC CAACAAGGCATGTGAATT GATTCGAGCTGGATAGAA
O AACT AAG GAC CT
TTCTCCCITACFGACCCAAGGCCfACTTCAAGAACAC AAGGAACiACAAACCTGIA GITCA F
TATAAAAGGAAGG
AAACAGATCGGAGACAIGGCCAAGGITCTGCACAC GGGGATGGAAATGAGAC
CTGCCTICTTC.AATCICTIC
AGAATCiGITAGCGGCATIGGGCAPTCAGCCFCCCT GAT CCIATGAACAGATGGA
AAAGGACCCTAAGAAAAC AGGTCCAATTTACAGGAGA
4 AGGAGAA . ATGGG TGG A
36 CCATTTTCGCCTCTCC.AGAAATTC.ATGGAACTGATTC
AGAAGGAC.AAGAATTGCATATCAGAGTGCTGCTGT GGGATAGGGACAATGGT ATGGAACTGATFCGGATG
36 TTGAAGCAATTTGAACCCC. TCTATTAGAGGGACAAGA
ATGAGAACATGGAAGCAATGGACTTCTTATAGC.CCA ACCTTAGGGTC.TCAAGM ATTAGAGGGACAAGAATG
36 GCATITCAGTMCGCGC.CTAAAGAAATGATG t I 1 i I 1 GGAAGIGMAATIGATGACGGCGAGGCTCAGAGIT TTTAGACGCCAMTACAC GAAATGATG GGGC
36 TCCAATATGTCGTCAGCTICC.TGGGCGCCATTATGAG TFTGGTGGACAACTTTITTMCGAGTCGTAMGAGA
GGCGCCATTATGAGAACT
TATTGGGGGCGAATGTAT
CACAACTGCACCCTGAAGA
O Ci AAAGICCATT
CCTCACCCCCGAAACT AG GAAGIGCGGTICCAGAAG
GGAGGITCACCCTATIT ACC GGTGITGCTITGAATAGTI TCITAGTAATGGAGITGGA
37 GCGCCAGCGAGTAAATGCAAACAC.AIGTCAATGCAG CTGCCGACGTIGCAAGAGGATITACITCTCCACCAC
GCGAGAAATar.ACCTGA ACATGTCAATGCAGCTGFG
3 TTGGCCT ATCTAAAAGCA G , GCTITTGCTAAATIGGCCT , AACICITCTGTAAAAAAGT GIGTACITGCTTTAATTGT
TGATGATACTCGACATCCITGCTAGCACAATGGGAC. GTAACITCAGCCACTCAAA
ACTC.AAAC ' GAAG GCGGGGTATG11TAGACC C
CCAGGTTACCATATTGGG AGGATCTAC.AGTGGACGG
GCATTAGGCCATCTTEGGGATCACTAACCCGGAACA TGATACT1b1 I I 1 I I CTGA
GTAAGCTCTTGACCACCG
CGAGGTGACATGATATGA TGATTGTATTTTGAGATTG
AGATGGGGGAATCATGTC ACAGTTTCTGGAGCAGGT
CCATAGGCATGTGGTCCA
O AAGC cATT G
GCTFCAGCAGTIGCAAGC
GGCAAGGTAGCATITTAA
CTCTGAC.ACAMAGCTCTAGTCCAAACGATCGGTAC ACAAAAAGACTLITCAGTG TCCGTATACAGAT1TA A
GG
2 TAAGGGGT 1 CCITCC It GGT
i ACGIGTCAGITTCTATAMCGATGAIGATCAAATGG CACAAMTCTGGTGGTG
3 IC 1 AAGAAAGCCT A , G CT Tai KT
TCATCGCIT TC
TGCGCTCGTCAGATAGCAAGCCGGTGAAGTAGTTA TTGCGGCAATATTGAGTTG
4 GTTGC ' CCAGCG TTTGTCCCTATCGGCCCG C
AGTAGGATAGCACCTGCCGCCGCGCCTTCACATGA TA ATTITCCAGCGCCACCT
GCCCGCTCATAGGGGTTA
CTGATTTACCGCCMAA GAAAGGC CTCL I ItI III tCCATAGT AGGTATFCCICCTTATCCTT
GATTGGGAAGGGCAAAA AAATAATATTTTGCCAGCT
7 . CFC TF 6 TAGGACACAAT GI CTTG
AAGATCAGTAGCTAAGGA TGTAAGOGGMCTATAGG
TACTGCITTCFGGIGCGA CACCAGTTGGAAGAAIGG AAGTAGAAATLTCGTCGGC
9 CGCiCG GC; CI G
O TIT (.3 cm TCGTCCICTTCCTMCIGC TGCATTOCCACTACCGITT
GGITCGGLifCCIGCGGGAAICCCCTGCCAGAGCG T TGAGCCCGGGGACTA ACC ACGGITCCACGGIGCA
39 CA C:TA ITGAACGTGCGGGACATG1TA ICTI GGTAGGC . GLIGGAAGGICAITAGA
TATCCGGGAACG AAATT G CCIGTCTIAAAATCTCC:FG GTIATC TT GGTAGGCAA TT
39 CGTCGCGGCGAAA TGAT I GAAAGIGGGGGITGTCfT GCTCGACC:GTITTCACG ICC;
TGCMCGCCICAT T IC TT ICT TCCTCGACCACAGG
3 , C:TTCG CCAT I
GTGGGGGTIGTCITC:TICG .
1TGAAGIAAGCTGTITTAAGCCCAGCCTGCGAATIC. CITTC:AAGMAGGCATCT
4 T AAC.AAG AGC CA
GTCACAGTCACAAGGT
AGCTCCTGCATTAATGGIGTAGACACAGAAACTG GITGTANT AA AGCTAAAT
5 GM TACAGCC. CTGCC CA
AGCACTGCCTGTTGM
39 GCTGCC:GGCATCACATGGAT CT I CCAACAIGAGGCG
TGITGCTCCiCATAAATICGCGGCGIGGCAACIATIG
CAACATGAGGCGGTG
AGCAC.AAGTGAGGCGCAAGACTCCTCTCC AAAGTTGGTAAGCGACTC CCGTGGT1TGGTGGAAAA
AGCGTCACCCA
TGIACATAATCCAGCAAGTAAACCCCATIGTC KT AC AAAAAGCT ITAATAACCAT
GCACACTAATAGCGCGAG
TGCTGACGTCACTOGCGATGACGCCGGTCGCTCiCA CAT AGCT TACAAAGGGCG
O GA TCT Cr GACGGGAGCAGGAACAGA
CATCCGCAAAGATGAGGCMCIGTIATAGCGTGCC:
CAGTAGTar.C.TCTCGTA
CAAACGTCCAGGC.AGGGITTCTTGCAMTTITTAA A GIGGICAGGAAGATTATA
2 AATGGTT . AGCCTGAG CAAAC
GCCTTGCGACTAATGGTT
CCCGCTCCTGGATTTIACACTGACTGICCGTAACATC
3 AATCGC ATC.0 GTCCTTCTACGCTTCCAA
TCGAGCTTTCATTAATCGC
40 TTTTGCGTCGCTGC.CAACGTGAATGGGAGGAAAGAC
CACATGCGGGGGATGAAGGAAGAGCAAATGGGTT A ACATATGCACAAGAGCG GAATGGGAGGAAAGACAG
GTAGTGCAAAATAATGAC TGGCACGCAAACTATTAAC
GTAGGAAGACAGCATGG TGCIGGAAAGTGACATTG
AGGATCAAAACACTGTAA CCITAAATAAGGCTGCTAG
TGTCCATITIGGTITCGCC
AACAACACGGGCTGGG
40 GCTGC.GOCCGCAACAGCTATCGCGCACT-CAGTATA
9 GCTGCCCGCATCATAGCTOGCGGCCAGAGCG1TGGT CAT TCGCCGGCGTAGACC.A
OGGCCAGAGCZTTGGT
41 AACICGACGCCGTMCIGACAAGC.GCGATATCAAG CGCCCAGACGTAGGGTAC.AGTCCGGGAGAAAAAAC
AIGGCCCNICGGATAAa: CAAGCGCGATATCAAGCAT
O CATG CACTGT TT G
CGGAAGAAAAGCGTGGA TTAAAAGGGAAGMATGG
1 TGGA GTACTTTTAC , A , A
.
42 TGATGGACCTCGTCTAC.GGCTAGGGTACCGTGGGAC CGCCGTAGACGTCGACGCATTGTACAACGCGACGT
TACATCCTTCCATTGTGCC
GGGTACCGTGGGACAACT
41 AACGGATCCGGGCACAGTITGCCCGACGACiGTAATG 1 C.CAACCAA
AGGAGAAAAGCGGCGG1TCTGTIGTGT ATTGCCAAC.CGTITAGAG
3 GT ' GCGTTAC CC
GCCCGACGAGGTAATGGT
41 GCCATC.TTGAAAGCGCTGGTGCCTATGTTAGCGCCA CACICCCTCGATGATGCCGCACTCCGAGGCATCCTG
TCCCATCTGTGTATACGCC CCTATGITAGCGCCAGAGG
ACrIACTCCTACAAGGCTC
GGG GAGCGC G
CITTACGCTGGCTGTGGG
GACAATCCAAGGATGACA
TGCAAGACAGAAATACAG CAGTTAATGCTGGATGCTT
TAATTTGCAGGCTAATTTG GAAGCTTCTTGTACTCAAA
8 .ACTC:AAATGT AGGTATCCAC TGG TGT
GCCATCAGAAATTTGTTGC
9 Cf TAC AAAGT T
CCCGGTTCCTACACTTAC
42 AGGTGCCGTCTAAATAGGGAATGCTGGCiTTCAGGGT 1 O TIGAC : AGCAAAC: AGAAAC
CIGGGTFCAGGGITTGAC
s GGATACCTGGGCCCCACIATCACAGCIGTITGGCCI CTITAGAAACTFTCAGCCC
1 ITGTGGA 1 ATA , Al GAG
TAGGCAAGTIGTai A
42 GGGTCCACAGCCTAGATCTCGACCCTC.CCAGACTGCT 1 TTTCCGGGTfTIGTGTCTCGCGGTGTACGCCCCCAG CGGGTAACGCTACCACCT
2 CIA ' rrsc A
CCCICCCAGACTGCIGTA
TACCCTAGAMIAGTCAACC
42 GCTCCCCCCCTAGGCTTGTTATGGTTGGTGGGCAGG CCACAGCTGGCGCACAA.GTIGGCCCTAATTGCGACT
GCTA1TTGAGCCTTCATTCCTICAG(3AATAA1TCTTT
5 . CG CTCCGGAAA TGTTFAAACAITCGCAGC:G
C.ACTGGCATAAGGAGACG
TCC.ACCAAATGCTCT1TGC CACCATCTCCATTCCATGC
6 ATGCA GAM; (3 A
TICCATTAGCATAG
7 G rr GT TGITCGCAAF CCGGCGCII CCATGCCTITTICC GCAGG CCAT GO
TCCATGCCITTITCC
8 CGC GATG I GCTACGGCT(5TCAI6GAG C
AAATCCFCGCCITAACTIC
9 GCC:AGA AATTITA.CIG 7 AGITTITGCAGTGCCAGA
43 GCMCCiGGICTMCAAGITTTGAGCGCAGIGATGA A I
GGAAAGCCGGTIGIAAGC:GACGCTOTTGIAGTI CCGGCAGIAAATCTICCG TGAGCGCACIF
GATGAATC
43 AAGCATIGCTGCCAAACACCCTGCATTrACCT1TICCT AACCITCGAGAACTAAACAAACTGTIGTCCGITCGT
I . CT CAAAACC CAGAGA1TACGTGC4IAGA
CTGC.ATFTACCTITTCX.TGT .
43 AITGCTGTCAGACTGCGOICACIGAACrAlTAAGE GA
43 'ITGCGGCTITCGGGCiACATGITCCATCACCITIGIGT
CGCAGCAACGCTCAAAGAAACACTGCCTGACTCT CC
ITC.CATC-ACCITTGIGTGCT
43 CA I GAGCACi TAGCCCCGACGIGGGACACCATTGAAA
TGTAGCCTGCTTICAGCACTCiCGAATCGCCACICCA AA1CTCAGAAAGCCAACC
TGGGACACCATTGAAACCA
CCGC.AGCTGTCTCTACCTCAACiTAITTATTGAACAG GAAAACGCATTFITCCCAC
AGCAGCAACAAGGAAAGT T
rrAACCCTCCCIC1IGIR:CAGCCGAGCiCAAAAGTAC ATGAAITGGTMAGCGCC
CCOCACATGATTTCCAGGG
43 CGGGAACACiGACAAATGCAATAGCTAITCiGT GGAAA
TGAACiATTMTGCTGTCATITGIGTMCITC,AGGCG GGAMTGATACACACC(C
GCTATIGGIGGAAAAAGA
43 AAA I GCTIGCGTAC.AGGGIGAT TICAACCTATTAACC ITGlIGCTCCAG
ITAITGACAAACGAAGGAGAAACT AAACIACACAAACIAAAAC ATITCAACCTATFAA(X:IG
O TGCiGA GCCTCG ATCAC Giii, 43 AGCCAGAAGGIAAAGAAAAGAGAAACtAACTTWTT Gf ATGCF
CAAACAGCCGAAAGA.AGGGTrACATIAT ICGCAGGi CTTATTITArf CIAACTIGT 17CTAGIGCA
44 CCGTAGTACCATCGAC:TGACATTATTGTCACATTACA
ATACGGTAAGGGCAGCCCAAAGCTITTAGTAATAC:C AATGCATCATUCTIGAAA TIGTCACATTACAATIAACT
O ATTAACTGCA . TGAN \ CA GTACC
GCA
1 ACCT. CTTATGC CTGC CIG TATGC
44 TTGGATATTCGGGAAGATC.ATCAGTC(CCICTGTAGT CCCCCAGGATGATTTACCCCTAACAACAGAAGGAG
CTGGGTTTATGGAATTAT CfCCTCTGTAGMT.TITAC
TCAACTGTGCTACACGCTGCAATAGGTACCGAGCCC GAM". AATTGATTTACAAA AAAAGGAC GCCTG
44 fiTTTGC.AAGTCTGTCAGGITTGACCATGGTCTCCGTG
AAATGGAAGGTCAGAT(CTTCTTCCCATACTGGGTT TTACCOGAAATCTGA AAA
CATGGTCTCCGTGTACTC
TTAC.AC.TCAAGCTAGGGG CGGAATAAAACTTAATGCC
S AATGCCCA TGTGAGGG A CA
FLAGG TCTCACCACFGAAACTCCC TGTAAATTCTAGCGGTGCC
6 ci icca: AACCGTT T C
44 GCTCGCAICAAAGGGATAKITAACAGGGTCATGAAA CIGTCCTTAAGAC.GGGGATTGGAGITGITAAGITCC
GTAACAAGAATAGGGGAT GGGTCATGAAAGTAAACG
44 GGTTGATGTGATIAATC7TaCTGTCCTAAATATCCAA CTGGITCCCCAAGCT1CG1G1AACAG1A11GGTAGA
9 CTACACC AAAGGGA , GG CC
CGCCCAAGCaATTCTCTCGTTTAATTATGCATCATCGA TGATGGAAACGAATCCC:f O GCATTC GCAATT C
GTCACTCAAGCGGCATTC
45 GATGCTITTGGGACGCTGGTACAC.CITTTGITAATTA 1 C.CAGCTGACATACACTGTAGTGATACATGGTCATGA GTTAACAACCTCCCGATAC
ACACCITITGITAATTMG
1 AGGCG ' GCGGT C GCG
45 AAAGCTACGTTGAGTACTTCGGATGTAAAGTCAGACT TTCCTCACAGGGC.GGTAGTTCAGACGGTATCGCTAC
ATGTAAAGTCAGACTATTA
OCACTTACAGGAGGAGGA CAAGGTAGAGGC.ATAAAA
AAAAGTCTGGATCTTCAA AGATITACATIAGGCTCCA
CCTCAAAATCTAAGCCATG
GG TTCG C
GITCCGTAAAGACTCCGG
AGGAATATCAACTGCTAT CCATGTGGAGGTATAACAA
6 ACA.AAATF GGC GTGC AATF
AACGGTCAAGCACTTCTG CCGGTACCCTTGTATACGC
45 AGGCAGACATCAAGGCIGGGTTMCI. CAAGACCCG 1 GTTGAGGCTTGIGGGITITCCGCACAGCGTC.TTGGG ATAAGGATGCGGCAAGCT ITTCTCAAG ACCCGTCCAA
8 TCC:AAG I mil. C 6 9 GOA 1 CITA , CC
GCCATGGGGAGAICTGCiA
TCCATTGCTCCTATCAAATC
O ATCAG ' ccrG
ACAAACCCTACCAATCACC AG
GCCTTC.AATCATCATGICA
ATAAACTTCCATCAAATCTAAC GACATTGATTGAAACGAG AATGAATGATAACACAAAT
GCTAAGCCAAAACATACAA
3 . CAACA ATGT17TGGC:IC GAISGAGCCGAGAACAGT CA
46 CCTATGGITCCATGTACAC.ACAACGGTGAAGGACCAT GATGGTGACTGCAGGTGGAGTGAGCTGTAGMCT
CCCAAAACATATTCAATAT
GGTGAAGGACCAIGFTCYC
TGACTGGCTGTAAT GTG TCCGACT TAGCAGACAAGGTGACFC TGGTTGAAAAAGIITACIG
5 TACTG AG T 'MCC A AGT
CACCTGATCCCAAATACIT
6 CA.GGA IC/ACA TGA TGGCI
ACAACC:AACAGGA
46 CC.CCGGAGITGAATCATCTCCAGTGTGTGGAGIACA
canGTAGATCITOTAGTGCCICCC.ATC.CCTCTITCI GCCAGGAAAAGAAAAACA AGTGTGICGAGTACAATCT
46 TCCAC:AICAT TITCATCFGTGAC:AAAGTAGAGGACAC A
1 AACGAT T ITGIGGCTAGATCCiAAGGIAGTCTGGT AGTACi AGGACACAATTC16 8 AATTCTGAT ATGAATCTICT AATACa:ACACATGCCC.A AT
46 cioncrrrrrCACCCIGGGAATAIAAAGCCTAICAACA
CGCIGTGlIGAMCAGAAACiCITGGACAAAACAGC CAAGGTACAAGA IGITGG sr AAAGCCTATCAACACACC
9 . CACCA TGTT IC A
.
47 AAAC:ACCCTICTGAAGITAGCTGAGGGAAAACAAGA GTAAICTCFATGCF
GGCTICCATGATCACACCCT ATA AGCTCCTGCAAAAATAAA AGGGAAAACAAGAGCTAF
O GCTATCG GCTGAAC AGC CG
ATGGAITGACAATAACACCTGCIGCATTATCCCACT CfGAAAA IIGTAGCA LAT 'IGATGIGA
I:MCI TCATAC
1 TACCC. TGACTTTC GGTGA CC
47 GTCG TCCAAGGT TGCAGG TCACAGTGCA ITICTACTA TTC: TGCAGACCATICGACAT
TIAGATCGACAG IATG CGGAAGAGATAC:ATATAA ACAGIGCAT TT CTACTAAA
2 AATCC AGAACAA AAACIGG TC.0 47 CTCCTCCTCTGTAATC.TCATACTCAAAATGACTTAGTG
TCGGTGGTTCCATrTAGAATAGACAGCAGCATACAA CCACCGTTGCAGATCTTA AAATGACTTAGIGTIACCA
ATAGATACGITGGG TTAGAAAIGT GAAAGCTGACGITITGCT AAGAAGTCITA ACAGAAAT
47 GGACiT I ATTF TCACAGCCTGCAACCGAAAGIATAACA
TGAAAGACACAGCAAAAGCiAGATTCiAATITTCAGC GAGGATTGTGACANIGCA
CCAAAGTATAACAAAAGTC
CIGMITGGAC:AAACACCGGGTTGCTGCTTGATGA AAACTGTACAAAAG 1113A
ATAFCAAATGITATCT GAT
6 TGTGATCCTCC C.ITAC TGTGT CC:TCC
I.
7 ACAATGG CTGATATTTFCCK. TAG GC
47 AC.AGTCCAGTTCTGCTAAATCTTTGCGGGAGAAAGA GGACTGTGCATITTAAAACCCAACTGICGAAGGAG
TATGATCAICTTATACTTTA CGGGAGAAAGAGCTTATT
8 GCTTATTITAC . GAGGAGTA CAGACC TTAC
GTCAGGAGAAAC.C.COCCGCATTCGGTCTCTCCTCCT GGAGGAAACCGAGGAAA
9 At. CCT TCC
CCACCACGTCCCCTGAAA
TATTGrfAAAGGOCACAGC
O GGCACAGCA TGCAATA
AGCTAGGGATCCGCCTAT A
GTTCCAAATCITTATGCAA
GCAATTGCCTAGCTGAIGT
ITCTGTIGTGCCATTAGAGGATCTC:TC1TGTATCTCC CTAGTGGTACITTAGTAA
TACAGTGCCTGTGGAAAGT
CAATCATATCTGGCGAGG
TAACGCCATTGCAGTGTT
TAGAAAGCCCTCCAAAAA CACCACGTGACGTMAAA
4 ITTFAAAT Ain- CTAG I
AATTGGGCAGGCGGTICATTGATAATICAATAGCA f ATGCTGG C.AGCAGTA CTATAGGGCGTCCAAGGT
TITCAGAGAC.AGATGCTGG
ATGATTCTGATCATGAACGCCIAGGTAGTACCTA I A CT GAAAGATGTACAGG AA
CACAAAGTATTAGTTCCTA
ATTGCCACAGTTTGTAAAT
7 TGTAAATGG TCTCCCAT , TAGTGCTCCITTGGATGIC GG
GATGCCCTACACTCAACAACCCAAA TCTAACTGCAGATGITATG TCGTAATATITTGGATAATT
48 CC.AAATGACTCAGAGAGACTGGATGTGAACTATACT GTOCGCACCCCAATAATTATITGCGGAAATTTGAAG
TGCAAACGTGTATGTATCT TGTGAACTATAGTGAAATA
49 CiTTCGC.AGCTGACGCCGGTTGICCACGACCTACTCAC
ACAGACCAGGATCCCCACACCACTGCTGCCGTGCGA
O C C ATTACCCAGGCAGCCGA
TGICCACGACCTACTCACC
1 Gilt ATCGAAAT AATCC TC
TGCAATATGCTAGAAGAC AACCTGATAAATACTCATG
49 GCACATACATTGCTCCGTCCCfCGCGCCTTCAGACCI ATCACATGAGAACGGCGCGACGTCGCGCGAATTCG
GGATACGTACGCAACAGA
OGCGCCTICAGACCTGA
CAGAAGCAGAGGATTATG TC.AACCCATATTTGATGTIT
GGTAAAACCAACAGGAAA TTGATATCAGTTCCTTGTTG
5 Cf TGlIGG AAG7CT CG G
CAGGATGAACAGTGATCAGTTTGCTGTTGGCATTCA TCAAAAATTGTTGAAACT GTGAGAAATTG AAAG
AAA
6 AAAGAAAGCG 1 rt CCA ill. GTAGC GCG
AACAGCTTATCACCTTCTACTGCTCGCTCTAAATTCT GIATGITCMGATGGAG
'TGTCCAATCTIGCTAIGGA
7 1 ATGGAA 1 TGGAGAC , AATTIG A
AAGCGTGTTTGAATCAGCGGITTCTTCTATGGCCTC TfCAGAACTAGAGC.AAAC
ACCCTATTGAATACAGATA
8 ACAGATATGAGA ' TCC ACA TGAGA
CGAAAACGGTTGTATATA GCCCTAAAATTTAGCAAAC
SO GTIGTTGCATATCCAGCATAATCAAGIGCTGCCT AGA
AACAAGACATCTTAGACGTGCTAATTTTCTACTTCAC GTGCTGCCTAGAATTTCAT
O ATTICA% ACAGCGG
GCGGCTATCCATATGCAG G
ATGGAAGACATGTTACCC AGGATATTGTATTAGACCT
1 . AGACCTGCA GAAATGTICIT TAA CiCA
50 GCACTATAGCTICTACC.ATAAACCAGGACGATTCAGG CCCAACAGGTACAGAAA
TATCAGACCATGTCATACC AAGACCTAACAACGATGG GGACGATTCAGGTACAGA
TCCACGATIGGACGCCATTATGGTTTGAAACAGCCG GAAAG MITT AGGIAGTCC
3 cuvrAIGI IC CGACTGIGCAGGACCTAA AT ATI;
I
50 TGCTCCCGTACACTGTTTGTGAAGGGACATAGAGGG CAGGCACAGCAGGAATATTGGT.ACCAAGTAATGCT
GTCAGGAAAAGGACACA
4 GGA GCTX: GC
AAGGGACATACiAGGGGGA
AACAGTTATTGAACACGG GGCAGACAGTCAGMAAA
SO CITGCTGGICCICAGAATIZIGGGAAAIGTITTITIGA A I GGAAGCAAT AGCCAAGCG
TCAGTACTG TITTC IT AGTG i ATGAACTGICAAA GGAAATG MM.' GAAAG
50 COGITCCP.TACTATACTC.ACi ICGITGTCCGAAGCAAA AACAAGTTATGAAA I
GTGOCAAACAITCEACAGIT f GIAGTGCCACCATIAAAG
7 . AGCAC.A TGCCCCG CT
GTCCGAAGCAAAAGGACA .
SO GTITCCACT GICCACGCiGTCGAGICCAAC:AGICCEST
ACCAGCACCAAAGACGGAACACTITAAACAATTGG
GAGTCCAACAGTCCCCTI
SO 'IGGTAGGGGGTATITITACAACAI CAAACATGCCATT CTGGGAIT I
ATGICACRICACCT AAIGTACAGTIGT AAACATGCC:AITGIAACTG
9 GTAACTGT ATTACACC.AAT TCCTCAAAGGCACCACAT I
51 GGTIG ITAATAGCAGCCACAATAGIATGGATAICTGA MCKIM:TM TCS GT TACT GTCCCCATCAT
TGC IGI GCATTTGIIGTArcirrrrc ATGGA IATCTGAG ITTA IT
GTOTAC
Si CTATATAGGCCCACACGAGGCTCATCTAGTGATAGCG
CCACKITGCAGGTTACAGACC.TCCCCTICATATAC.AG ATTCCTTTAGATACTTTTG
TCATCTAGTGATAGC:GGTC
Si GGAAAAGITAI ATCAGGGCCAGATIGGGGI AACACC
TACTGCACCIAIGGCiAACACC.AAACAGGOCCIGIA CACiATACATAITTAACITC
GGGGTAACACCACAGTTC
Si GCAIGATAAAA TAIGTI GG IGCGAGCCfCCTCCI AAC
CG.GCAGITCTAGACTICTTGCAGCAGITITGITAGC GCGACAGCACAG TAT ATG
OCTCCTCC.TAACCCTGTA
SI GGAACATCI GATTTA ITGG TCTGCAACAGGAT GGCG TCCAGAT TAIT I ACAAATGGC
TGCA.ACATTIGTFCCF CCGCCCTI AGAACTIAT TA
ACAGGATGGCGATATGGT
Si ACAA I GGTA Ill GTIGGCiGIAATCAATGITGGIACT
S
51 TAAGGCAATACCGCACCCTGCC.AAGTTGCTTGCC.AAG
CTTTTATGCACTGTAGCCAACTCTAACAGTAAGACC
6 T . AAAAAAATGTG CAGGTACACATTGCCCTG
CCAAGTTGCTTGCCAAGT
7 CAGG CC AAGCGTCTAGC.CATGGCG
GTGTTGTACAGCCT. CCAGG
Si CTGCGCGGCAACAACTAAACTCAATGGACG1TAACT CCCAGA1TGGGTGTGCGCGTCTTCCACGAGGTTGC
CAAACGTAACACCAACCG AATGGACGITAAGTTCCCG
51 CTGACACC.ATGTGCCAGGGCCGCCGACCTCATGGGA GGCTTTGGAGGACGGGATCAACGAAAGMGTGCCA
CATCGATACCCTAACCTGC
CGCCGACCTCATGGGATA
52 TCGCAGCGCCATACATCGGCC.CCACCATCAAATCCA
CACATCATGCACCTTCCAC
O
CGGTAGGAGTAAGGGCCACCCTTGCGTGCCCTGCGT CAT G TTGCGTGCCCTGCGT
TGAACTGGAGTCCAACAA
ACOTAGITCTCGCCCAGG
52 GccrrGGCCGTAGCIGTCAAGACAGCTCAGGGICIT CCCACTCGGGGTCGCTAACAATCCCGCAIGGCCGA
ACTGGGTTCTTGGCTAGCT
ITCCGA
TGGGGGGAGAACGAGAC TITTGAACTCGACCAGACC
52 C.AGGGGTGATAGCTCTACCCGAGGCATGCAACIGGA CCACTACAGCGTGGCAGATCCATCAAGCCGGIGGA
GGC.ATGCAACTGGACCAG
52 CAGGAGAAAACAGGGOCACAGCTCTGCATTGICTCiG lICTGATGCTGCCTGAGAGGGCGACAACCATGGCG
TCTGCATIGTCTGGC.ATGT
CATGTC CCG , GGTGCCCAAGGCTTCTGG C
52 AATCACTGAGTCGCGAGGCCTATAGCrAGGACCGAG 1 TCTTGCAGTCCTGGTCTGTCCAGCCCTAAGATGGCC GGGCATTGIGGIGGATC.C.
TATACICTAGGACCGAGGC
6 GCTG i AGAAGA A TG
Si CC.CTTATGATGICTC.CGCACGC.GTGTICACGCCCATG
C.CTGITTCGGCC.AGGTIGGGGICTCCACCCCTTTGA
7 GAG TGTT GTGGCCCIAGAGCC.AGTT
GTGTTCACGCCCATGGAG
AGAATTGTGGCGAAGTGC
52 ¨ ETCTGACAACTCAACACCCCCAGGTGCGTGTAGGTG
9 ACGAAGTCTACCGCCTTGGCGCGCCGGC.ATATTCCG CG GGAGGTCCGCTGCTGT
GCGCCGGC.ATATTCCG
ACATGTCCAAGGCATATGG
O TGGC CCG GGCCACACTGGGTTTTGG
C
TCCAACATAGAGGAGGTC
GCCAACAACGGGGGAAAT
53 CCTACCAGTGCGGCCTCTCCGATTACAACCGTTCCCC GGGAGGTTGGGCACATACCGGGCAGTGICAAAC.AT
GACITCAGCTIGGACCCC GATTACMCCGTFCCCCAG
TTCCTATCCCAGACCAAGC ATCGGGTGAGAATTTCCCG
AACACCTCCC.ACTAGTCGAGCACCTAAGGCCrTCTG CTGTCTTTCAGTGGGCAG
GTGATTGTTGGGAGAGTC
t 53 AGGGrITGTTGGGTGGICAGAGCAITGCITC.CCTCAT 1 GATTCGCGACIMGAIGCTTCTTITTGCCAAGGCCC ATC.TCGCTGGACTGICTAC
O GTCCT 1 ACAC , I CAT
TGCT-mccrcA 1 GICCI
TIGGTAAATITGCTCCCGG
6 TGGCGGCGCAAAATCGCCATICTATCGCCCGGAGC ' C C
CATTCTATCGCCCGGAGC
TGCTGAGTGACTTCAAGAC
TICCATTATGICACCGGGG ACAACATCAAGTTCCCCTG
GAGATTGGCTCGAGGGTC
9 . GT GITAG A
CGACCCICGCTAGCTAGI
CAOCTCTTACCGAGACGT AAGCAGCAGGACTACCAA
O AGC AAACAAC GG GC
54 TT GCGGGA.AIGGCTCCGAACCCCCACTCAGCCAGATC
TAACCACATCAGCTCCGIGTGGTGGCCATGAT TOT ITACCACAGAGGAAGCTT
CCCCACTCAGCCAGATC:A
54 GGAGICAAAGC.AGCGGGTATCAAGCGGGTGGAATTI I ACCGAAAAGGACATCAGGGTCGATGACTITGCGGG
ATGGCTTCCAATACTCCCC AGCGGGIGGAATTICTTCT
3 circris 1 CI CC A G
4 C.TACGCCGTCGCICTCAGCGCGGCGCTGAGAGACT AGGAGGACAACCGAGCZCTAGCGTC.ACCTGGGGGA CC
GCGGCGCTGAGAGACT
54 MCCAW"! I GGCAGC:GGCAGTACCTCITTAACIGGGC
ATCGGETTGGITTACGGTAGGCGGTCGGGCATGACi CTIATCGCCCAGGGAGGT
*FACCTCTITAACTGGGCGG
5 . CiGT ACATG A I
.
TCGTAATGAGCAATTCCGG
I AACACACAA TCAA IGGACCCATCTAAGT AGAACCAGTTAAAGAICTC
TATGGGTGIGTICGGGCAAATAC7GTGIAACC1GA T CrTCTUCTGIAGGITITA TIGGATAGTAAGACA
ICC
54 ACCAATAAACAGG1TTGTCCCAAAAGTTACCTCATAC GATGGGGTCITCAAAACTACTC.TTCGTGATCCAGAC
SS CCFGATGICCATGGTATTGICAITAGOAATGLIAGGI
ACrGATCCAGAI ACATACACCAG IGAAGTAT AAGA AGCAA ICiCTAGGTACTCAT
O ACTCATG GAAGTTTGATACCA
TCCACAGGACAGGAGAGA G
SS ACC1AAGCMCAGTGACi CGCTIAGGC TGATGAAAG
GTIGAGAAAAC:GAAACAGACGGTGGGACT MT& GIGCATG ICCAGATTITAA ITAGGCTGATGAAAGATAC
55 TGCCATGIGT AATCGCCCACAAGCCAIGTAIGTI CCA AACrr GCITCAAACCCCAGTG I
AAGCCATGTATG1TCC.ACA
SS GIAGGGTAGrGCICTGGGIGTIATCACAGGGCAAAG
ACA1CAATAGGGCGCACAAATCCITTCCTCr TC11 AA CAAAACICTIG TCAAGATC
3 Cr TTACTGG AGA
TTATCACAGGGCAAAGCT
55 IGGAATAGTAAACACCTGATGTACGTAGICTCAAC.AG
ATTACCCC.ATAAITTGCGAAAAGCACTTGGGTAATA CCIATCCAAATAGAGATIT TAGTCTCAAC.AGGAGG
AC
4 GAGGACA . AGGGCTTC AGCAA A
55 TGTCGTATGTCAGCAAAACAAAC.ATATGCATACATGG GTTGGAGTGTATCGCAGAGGAAGTGAACCCGACAC
ATGCATACATGGTCCCATT
TITCAGCTTTAGTGATCAT AGAAATCATGATGACTTGG
SS CGTCCCACAATGGATGITGTTMAGTGTGTGTATTM TGC.AATTGCAGAACACTTCAATTCCATCAAAGTGCr AMAGTAAACAAAGAACT AGTGTGTGTATTAATCCAT
55 TGIGTTAGAATCTGIGGAGGCTAGACCAATGGCTAG CTAAGCCCCCCAAC.AATCITGTGCAAACATATATCTA
SS GGGGGIGCTACTCAAATTAATAAGTACTTTACCNICA ATGATTATGCAACAAAAACTGGGC.ACCAAAGATCTT
AC1TTACC.AACACTATC1TA
AACCIGCTGTATTGAGIGACAATGCCCTTGTACTIA TAGAGAAACAAGGCCAAG GGAGTTTAACATAAATCCA
O AAAT (CAUCA GAGAATAATGA TT TAATAG
GTCA
56 GGITGCZTAGT1TC.A1ICTCATGTTAAAGICCGAT1A
ACAAAGCATTCGATCAAAMCCCGCACATCAGGATC TAAGG AI GAAT TAAGAAG
AAAGTCCGATTAGGGAAA
CTTACGGIGATGATTIGAT
2 GC TC.ATGGGC.ATAA TGT
TGGATCSACAAGIGTTGG
56 AACAGCCCATGGGTCCAGCITGCAAAGCCGCTATGTC TCAlTGGTCGAGAGAACC.ACGCTCGACCGAAATCCA
AAGTCAATGGCTGAGTCA
3 C CGAAAG , CC
TGCAAAGCCGCTATGTOC
56 ACTGAGCCC.TCCTGCGACAAACCACAAAAGATGCTC 1 TTGCAGATGCAGCAAGTGTGGAGCATAAAAGCGCT GCCACCCGAACATTCAAC
ACCACAAAAGATGCTC.GAC
4 GACT i CGTGGT C T
CGACCTCAATCGTCAAGCACTG
CG CGTAC GACGTCCTCGAAGCCTGA
ACGCGTCTCGGAAAAACG
56 CGATGAGAGCGGAATGICCCGAAACC.AAATICITCC CCGGGGTTGAGTGTGACTGTTCACTGCCAAGCTGG
AAACCAAATTCTTCCGCGC
56 ACC.AGGATACCCCGTGGATTGICACCATGTCCACCCC AAGTGAAAACAACGC.GGCGCGAACGCCTGCAGATC
TGATCCAAAGTGGGCTGT
CACCATGTCCACCCCATG
ATGGAGACTCTCTTCCCTA
TACTGCGACTACTTCAACA
TATTGTGTGAAGAATCATT CCAGAGGAGGAGTTACTG
9 "TTALTGI AGTCG GICA I
TGGAAGCAATTAGTCTTTC
O CTFTCTCC TGCGICC
TGCAGCACTTAGTCCAAG TCC
CGGCTGCGCTATATTMA
57 ACAATCTAGTAATGGCTGAATCC.AAAGAGGTAAAGT 1 AGTAGGATTFTTGGATGAIGCCACTCC.ATCTAATGC GGGAAAGTCGTAMCTG AGAGGTAAAGTAGTATCAT
2 ACTA TCATACAIGA 1 AlITCGC rrr ACATGA
i ACCFACACCAAGACCCCGCAAGTGCTTGCCTCCTCC ATCTTCCCITCTGTTGCCA
3 CG 1 1T.3 , G GI
CTICACCTGICTC:CCCG.
CCAGTAAGAGAAGAAGA AGAACGTCATATCCTCTCT
4 TCFG ' GC.AGGCCC AGAGG G
TGGGAGAGTATTGGTAGC
S CAACGC CACA TTT
TGCAAACAGTAATCAACGC
CGITGTTGAAGGIGTTTG ACTTGGGTATTGGTACGG
TTGTAAATCCAGCTTTTAA ACATTGGTGATCCAGAATA
7 . CAGAATACAT CTCGCG CC TGGT CAT
57 CTGTCTAGAGGAGTTGTCAGCTGGTGATTTGACAGA TATTGCATTTACAGAACAGCC.ACGTAAGCGGCTTAC
GGTGAMGACAGACATAT
8 CA TATTCAM ACCiA AT TTTACAAMCCCGCCITTG TCAAT
ATCAGATGITCATTGTIGGITGTGCCACAIGGTF TA AAACTGGGTGACACFGAA
9 C:ACTGAAA GCTAAATCCC CC ITGGCACIACTGGICA A
GTTTCTTTTATGGCAAGAGGGTACTCCATCTTT GGACATGTGCGATATTGG TTTAGTACTTTTTCAGCGA
O AGCGAGTC ACAAAAC.ATGTG G (SIC
TTTAGAACCTGATAACAAT CCTAAAAATGGATTAGGG
CCATATAAGGGCTIAAATI
2 TTAAATTITTGG GTATAGGTAT TACCIA.U..1C1GAAAAGC TITGG
ATAGCACAAAGTAAAGTGT CTAGGTAAG TACAGTA.AC GG ITTC:FATITGAAGTTFG
3 , AAGITTGTACAC GCCAAAT CGTTAT TACAC
=
CICTAGCAGCGGCCAACT
AAGTGCGT/TCA TGOC TIGAAA MG IGCCC AACCCIAATGTGGTC.T.ACC
S C.CACCG GC A G
58 ACACiGACCICAGGCCiCiGACiGC:GAGGCTA FGT T CCCiC A TGTCACCATCTGA MI CG
CCAGGCTATGTTCCGCCA
SS AGGGCGTACICiTGATGGCTCCGCCATTGGCFGCC.ACT
TGTTCCTTACCCCCGGTCTACCTGGAAATAGGGAGG
7 TT GGGTAC CCTCTCGTCAlTGAGCGG
GCCAITGGCTGCCACTFT
rrACCATMCCACCAGCGGTACITCTGAGC.ATGCT GCTACAAGGITACIGTG6 8 GGGC CX-.C.G TTGCTGCTCCCGCCTAAT GC
58 AAAACCAACACCCGTGGGICA TACGC1CACi itTAGG GGACTGCGATT CGTAAGGCGGGGT7 GTAGGAAACA
TACGC.TCAGTGTAGGCGC
59 CTGCCUCCAACCGCTITACCTITACTGTGCiCGOTCA
GAGGCCIATGAGGGGICT
O
TGCCAGGGACCAC.GTAGGAATGTCGACCCCGCTGAA GC G ATCiTCGACCCCGCTGAA
CTGGGCTGGAGGTGCGCTACGGGGGCAAAAACCG AGACTAA TGGCrCAGAGC "TITTGATGCCAGICAGAGC
2 CA . 671 TGGTGAGCCGGCTCCT
CCGGCCATAACCCACCA
GACTTGCTCCCGCCTCGGTAGC.ACCTCGTATATGC.0 ATACATTAACCCCCCGGCC
59 GCC.AGATTTGCTGTCCGTGCAACCAATAGGCCGACC. F
CTTGACTCAGCCACAGACGTCGACTGCACAACGCCG
CCAATAGGCCGACCTGCC
S
AATGACAACCCGGCGCCCCGGCTTCGCTGCTTTCA G TTGCGTAATGCCTGGCG CGGCTTCGCTGCTTICA
59 GGAGGACCCGACTAGTGGTCTGCTGCGGATGTATGT GGTC.GTTGTICTGGGGTGAGCGTTCiCCGCCCITACI
CTGCGGATGTATGTGAGCT
7 TG rrGcc CCGAGCTCATGCCATTGT
GCGCCACACTGAGAAGTG
TTGTACGAGCTAGTGGAG
8 AGGC AAT GCCATUGGCCACGTACAG GC:
TIGTAIGCAGGTGITGTG
OCTTGGCGCGCTICCT
O TGGGCCTGGTCACGCCAAGGGGCTGGACCTCGTGT GCCGCTGCCICACGICGTAAGCGACC.GOGGTTAGC
CCITCGCCCCCGATGT GGGCTGGACCTCGTGT
60 GGGAGACCCGCGTETTGGTA i LI i IGCCCTCGAACIT
1TACTCCAGCACTGCCCGTCATGGTGAGCTCGGCAG CTITGl.u_ ILE, AACITGAG
1 GAGT TC , CCCTCGGGCTGTTGGA T
TGTGCTITGGC.ITTCTCTCA
AGGTACTACTAAAGCCGG ACTGCTAGTGAC.CAACTGC
60 ATTGGCTGCATATCGACGACAACAATATACCAAAACG ACTTGACGTTCGTAATGGGC1TAAAAAATC:CGTTGG
AACAATATACCAAAACGGC
CATCGCCGGMTACAAA
TACCGAAAGACACAGTGTT
CCTTAGCGTGTGTAAATAT GGGTTGITTATTAAGTTTA
60 AAACTCTTTCATATGCCGTTCCTATGATTCGGGCTATA ACAAACAATGGTAGACCGCGC.ATACGAAACTGTGC
TACATTACCAAGCGAAGG GATTCGGGCTATAGAGTAT
60 ACAAC.ATCCAACGCCGTTTTAGTGGAAACGAAAAAC
GTTGCAAAAACAAAGACAG7TTGAAGGAATACGTT CCCGAA 1 II ili : I ATTGC
GTGGAAACGAAAAACCGT
8 CGTC CAATTAA.AGCTC CT C
TOTTACTGCTATTGGACG
CGATGAACACCGTITTGCACT/sGGCC.AACGMTCC AGTTCGACATACAGTAGA
GGCTATGTTACCTGCACT
s 1 I AT TACiCGC 1 CC , ACT GC
CTAGCTCCTGCAAACTTTCTGGAAGCCAGTTTTTCAT GCCGACTAACAATTATACT
TTTTTCATCGTCGTTACTTC
61 CCTCTAIGGTTGAC.AGCACATITCTTGCTGGGTTATTT
3 AAC_ATCC GATCG A CC
TGTGTAAAGGTAGACAAC CAGAAACTATCCGAC.AGA
TCACCTAAAAACCTGACTC
5 . C GCACGC A
CGITICCTCCTICiCAGAC
ACGAACKGIGT TCCGICG
CACGGGGCGACAACACCGTGIGTACCC
7 CT GC CfCG ICCACAAACi ICGGC
8 I CC; GCCGCICTCACAGCCI CA
TCCTC:CCAGTGACCCT
TGGGCGGGTATCATCGGICTICGGACAGCGTTCCC
"FCTAGGTITGCCKCCCG
AGCTAGAACGATICGCAGITANTCCAGTAITIGTCT CGAGAGCG TCAGT ATI AA
ATIAGATCGATGGGAAAA
CI GAAAAAATTCG ACAGCCTTCT GC AATTCG
TCAGGICAGCCAAAATGCCIGATGTACCATT ATAAAAXT ACAC:CAAGGAA *FTTAGACAAGATAGAGGA
1 . GAAGAGC TGC GC AGAGC
.
62 ATCTGGCCTGC3TOCAATAGGCCAT CAA] GAGGAAGC
CCAICAATGAGGAAGCTG
GGACCAGCGGCTACACIAGAAGCCAAAACfcrr GC TAGACCGG TICTA IAAAA
GAGCAAGCTTCACAGGAG
62 TGCiAATATCGCT GGTGAICC1TTCCCACCAGGGATTA
GIAGCATGACAAAAATCITAGACiCCIGTATTGATAG ACCATACCIAGTAIAAACA
CAC:CAGGC1AITAGATA ICA
TTCCTTTGGATGGGTTATG CCATCCTGATAAATGGACA
S TGGACAGT TAATTGCCTTAC AAC CT
ACAAAAGGAAACAACTCAGGAATC GAGCCATTTAAAAATCTG ATM GCAAGAATGAGGGG
62 CCA lIGCTCTCCAKITACTGICiATAICAGGAPACi ;AC
CTAGIGATITIAACCTGCCACCICTGACAMATCAC GAACAAGTAGATAAAT TA 'FCAGGAAAGTACT
AlliTT
ACTGACP.A1GGCAGCAATTATTCCTGCTT G TATOCIGG TAGCAGTTCAT TACIAAGCAGAAG1TATI CC
9 ATTCCACiC ATTC:CCG CT AGC
TICGTAATAACAAAA TGCCAGI GIAAAACACCATATGI AT
9 GGATGGT CTCITTCTCC G1n:C/4 GGAAAGCTAGGGGAIGGT
63 GCCAAGTATTGTAGAGATCCTACCTATTAGGACACAT AACACCAAAAAAGATAAAGCCAC:CTTICTGGGGCTT
ATTACTITGACTGMTTC ATTAGGACACATAGITAGC
O AGTTAGCCC . GTTCCAT AGACTC CC
63 GCAGAATTC:TTATTATGGCTTCCAC.CTIAGGGCAACA
ACAACTGCTGT1TATCCATTITCAGTCT6TCGAGTAA TMCC:TAGGAITTGGCTC CTTAGGGCAACATATCTAT
TGTAGTGCTAC.AGAAAAAT
CTGGTMGCGATTCTAAA TGGAACAGGACCATGTAC
TGGIGGTAATAGC.AACAATGAGTCCCACTICTCCAA TAPACATEITGGCAGAAAG
CAATGTATGCCCCTCCCA
CAATGACGMACGGTAC
S GTACA GCCAGG
GCAGGAAGCACTATGGGC A
AAGAIGGGIGGCAAGTG TGGATGGCCTACTGTAAG
6 AGGG CTCa GF GG
63 AGCAAGCTCGATGTCAGCAGMAGCATTTCATCACG AGGGACT11CCGCTGCiGGAC1GCAGGAIC1GAGGG
GAGTGGAGGTITEGACAGC TACiCATITCATCACGTGGC
63 CGTCAGAGATTCCCAC.CICGGTCAACGACATGCAGIG ACGACCITCACCGCGACTCCGCCTCGAACATCTCCG
8 Ga TGTC ACGCCTGGGGGATGCT
CAACGACATGCAGTGGCT
GTTTAGGCGCAGGMCCG
O CCCCCTTCAGCACCTACGTGGAGCCGGTCAGCAGCT CGCTCTCAAACAaTCCOCGAGGGCCITTCTGGGGC
GCAGGGCCACGGACAT GAGCCGGTCAGCAGCT
CTTCGAGGGCGTAGTGGC
CGCGACAGGAACCGGTA .. GCGAACTCCACCGAGGT
64 CGCGTACATGGCCAAGCTCCAACACAACAAACTCCC.0 TCCITGTGGGACGAGAACAGCCAAGTTAC.ATC.ACCA GTGAGCACGGGGAGGGT ACACAACAAACTCCCCCTC
TAACGTACATGTICTGCGC
G GATCGA C
AGTACATCCCCGCGTACG
64 GCTCCGEGGACAGTTCTCC.AGGACCTCCGGGACTTC
GGCTOCCITGTTTCGCC GGACCTCCGGGACTTCGA
TCCTGAACACGCTAATGTG
64 CGATACACGGCAACCCCCGGC.TCTCCTCCAC.CCAAAC 1 CTCTCCICCACCCAAACGA
9 CIGGIGTTCGAACTGGGCCACACCGCGTCCICCCiACA CGAGG GICGGCCACGCGOTA
ACCGCGTCCTCCGACA
O CCA TAC .. CGGCCATGGTGCTGCA
CAGACGGAACAGCTCCCA
65 GCAAGGGGGAGCTGCTGATCTAATGGGC.GTAAAGA GAGAGTCCGTACCCGCCCCACATGAACTACGGGCC
ATGGGOGTAAAGACGGGC
CTAAGGCCCACCGICACG
3 GaGCTOCCGACACAACGTaGGGCGGICAGCGAGA CCGAGAACGCGAGGCCCATCGAAACAGCCGCCIGG C
CGCTAACCAGCAGC TCCA
CCACGACATCCGCGGCTITACACGA laGGCGCTGC G1GC TGTCCi TACGCGGGGAA CACGATaGGCGCTGC
6 CG CI ACA AGGTCGCTC.GCTG ICC --CATCATCICTAAGCGCGC:G
65 ACGAAGTGAACCAACTGCCGAATACAGCGTC.LIGAA
7 CACGT GCGCCiACITCGCCCAGTTCCGTAGTaCCGMAGA
CCCAACGCAACGCCTAC "f ACAGCGTCCIGAACACGI
ACGATCGTGCTGCZCGAGCGGGTGGACCACGICA CAC GCTGGITTGGCTCGTCC
CGGGTGGACCACGTCA
65 Cia GACGCGTGC ITTIGITCCGCCAGCGCGTG TAIGA
GGACAAGGCCGGGTCCCGTATCAGCCTCGCAGACC CGCACCAAAACiC:AGACiG
GCCAGCGCGTGTATGAGT
66 AGAGOACGCGTACCICIGCCCGGAAAGACCACTAG aCCCGGAAGGTATTGCTCC.;CAGCTICAIGGAGGG
ATGGTAAACGCAAAGCCI CGGAAAGACC:ACIAGCCC
O CCCG C.AGCC .. CC
1 C.CACCAACGTGCCCTACCCCCAGGGGCAGGACGAA AGG GCCGGGITGAACAGCC
CCAGGGGCAGGACGAA
66 TIGTCiCGCCiAGGGGCTTGGCTIGGGGGATTCGCAG
GAGCGGCCATTGGGTTC CITGGGGGATTCGCAGGC.
66 CCCCGCGGGATCGGATACAAGTAGGTGATAAACAGC CAAGGGGGIGTGGGTCiACGAAGCCCCGGATCCTCG
TAGGTGATAAACAGCGGG
CAGGGCGTTGTAGTGCG
TCAAAGAACCi ICC TGTTG
GGICCATGITCGAGGGCGGCGCCGGCCAATICTICC C GG
56 TcGAAGGCGGAMAGTCGCGCGTCCTG TCrCTGCAG GACCITCA1 GACCGCGCT
6 T C.G T I GTGCGCCGTGCTGTGICTG ATCGCGATCGGGGGAG C
GTCCTGTCTCTGCAGTCGT
AACACCATTACGGCCCIG
TGCAGGCGGTTGTCGAGGCTGCTTCGGCGGCTCCT C C
TGCTTCGGCGGCTCCT
66 CGTCAGCACCTICATCGAC.CTGGTGTAGACCTCCAGG GTGACGTCGGCGCGACTC.AGGGCTACGTGTACTTC
8 GGC ! GAGG CCTTGATCTCGTGGCGC
GTGTAGACCTCCAGGGGC
66 GCCGCACCTCAAGGAGG AAC:TCCAATCAGCGCCCGA
9 AC AGGAGCCCCACGTTCTCGAACAGOTCGCCC.AGCC
TTGAAACTCGCTGACGGC CCAATCAGCGCCCGAAC
ACGCGATGGTCTCTATGTC
O CC CC .. TCGCGCCGTAGTCCAG
TTTGCCCCCCACAACCC
TGGTAC.CGCCTCAAGCC
AAGTGGIGGICCATGACG
AGACGAACTCGAAGGCGG
ACGTCCIGCGGCT CATGAACGACIGGGCCG
G GCTTISTGCGCATGCCCCGTGCGAACGGCGTCCTC
GGACTGTCCCTCGGAGAC ACGAMTCAAGGGGCG
67 CTIC.GCCCACCGATICCTGICCAACAATCTGTCGACA
AGGTGGGTGCTT
GATGCGACGCCTGTGC
GGCGTTCAACGACGCC
67 CLIGCATGCCAATMOCATGGAAACCCACGMCCAGT CACACGGCGTGCCTGAAGAG1TC.AGGTCCCGGAAC
ATGATCGACAGCACGC.TA
ACCCACGCCCCAGTG
MACTTGACAGAAGATAA GCAAATAAATCAATTCAGC
C.AGCC.G ATGICTaGG AAATGGG OG
ACACICAACAAAGATCAACITC7GTACTATC1GC1GT CATAGTICAAATGGAGa: TTAAGGAGAGACATAAGA
AAGATGAAAG GCTCCG TG TGAAAG
69 TGGGTAGTMGCCTITATAACGTITGGATAGATCIGG GC.CAACAGCTICTATGAAG lb I I
GTGCTATACC.AAA TATAGCGGCATTAGTAAT GGATAGATUGGTOTACA
TCTTACAGC ATGAACAA , AACCA GC
70 CCCAITATGCCTAGGCCAGCAGCATCATTATTG us. I II
ACAGAGGTACACCAAGGAATCAATTGAGTTGTTC.A TTCFACCATATATTGAACA
AGCATCATTATTGTCTITG
GACT ., GCATATGC ACCCAA ACT
ACACATTAGTAGTAGCGAG
AGCGAGTG ' ATTCCTGA ATGCTTC TG
72 GITTGITTGTTGGTC1TCTGTTGGTTGA1ITC.IGATCA
CAACTCACCCATCC.AACCAMCTITTTCCGGGTGGC TGATAGTGACMTGATCF TGATTICTGATCAGTTACC
GTTACCAATC TAGT ATCACTT AATC
73 TaiGTGTGGATATTTGTTTCACTAGCCAGC.AGATTIA AAACTCAAGAAGTGCAGTGC:TAGAAGGACACAITG
CCAGCAGATTTACTTATAA
CTTATAAAAGAAC GCACAT GGTGCCCATGTTCC.AATC AAGAAC
CATCAGTGTCAGAAATAA ACACACTTGAAAATATAAC
ATATAACAACC GAO I I ca I GGCFT AGATC7G AACC
CCATCATAATCACCAACCC TGTGGITC.AACTAATCAM
AATCAAACA CAGATATTGTGA IC CA
TTAAGACTAACAATAACG
TCCAAAAACAAGGACCAAC
CAACAACAACCAAAATAC
AAACAAC TTGGTAT AACC
AGCCCACGACAAAACAAC
78 TTCCCITTTGACITGIGTGITCTATCAGA/sCTACACTG 1 CCCTCCACTCAACCTCCTCCTGTGATAGGTACTCGG TCAAC.ACCACCAMAC.AA ATCAGAACT.
ACACTGCTCA
i TGITGITF CGTATTATTGCAAAAAGC CAAATCAACCAGAATCAAA
A ICAAAATCAAC 1 GA , CATG ATCAAC
AGTGTCTIAACCAGCAAAGTGTITCACAATAGGTAA CTGTATCTAAGGTC.CTGCA
CTAGAAGGGGAAGTGAAC
AGTGAACAA ' CAACTGTTT C AA
CTATGTACAACCAACACA
TGTTT TCACAAA AAGG
AAGGGICCAACATCTGM
TGITATCATTAATTGCCGT
CRAG TAGATGGGITG TGG
ACTGCTCCIATACIGCAAG
83 CACTCCAACTACACCGAGGGAAATAAGTGGAGCTGC GCCATGAGCAAACTCCTCACTCTTCATTGTCCCTC.AG
TAAGTCTATGGATAAAAG AAATAAGTGGAGCTGCAG
. AGA TIT CATCGA A
GAATGTGATTTATACAACT ATCAAAGTTATCTTAACAA
ACAACCCT TaCIGCTAA GTCT TAG GCCI
GAACCTACATATCCTCACGGGCITCTCTGCITTA TAA TCTGAGAGACAACiCIAAA CAATAGACTGGCAGTTACT
G rr ACTGAG AAGGG TM AT TAC GAG
GGTGTATGCATCTATAGA CTTAGCATAGGAATICTFG
AATICTIGGG AACITGAI AATTFG TAACAAG (31.3 ATCACTCCCCTGTGAGGAA
CiGAAC CG GGCGACACTC:CACCATGA C
88 ACCiATC IGACCCiCCACCCGACC:AAACCi TAACACCAAC CAGGGGCCCTAGATIGGGTCi TGTGGGGA TAGGCCG TGCACCATGACiCACGAAT ACCAAACGTAACAOCAACC
CG ACGTCTA C G
89 16GCACIC:GCCTCCAACACiCiCATCGATACCC71ACG 1 CGCAIGGCGTCCGGGTFCIAGAAAGAGCAACCACiCi CGTAGGTMCGC:AATTIG CATai ATACCCITACGTC/C
, CiC.C. AAGG 6 6 .
CCIAGTAITGTGIACGAGG
TC.GAGGClITTACCCTCGCGGCCGATGC.CATCCFGC C3 CC.
GCCGATGCCATC-CTGC
91 'FIG 1 GGGATC:CGGAGCAGCTTGAIGATGAACTGGT C TCAIGGACATGATCGCIGG
TGCCC.ACOVTGGAGAA ATAACGGGTCATCGCATG TGATGATGAACIGGICCW
92 TGGCiCAAAATCGGTAACCiCGTTICAACTCTTCAGGCT
TCAGTTATGCCAACGGAAGCGGGICTTGGAGGGTA 'I-KAM:ICI TCAGGCTGTC
GTCC GITiCCA GCTGGTTAGCAGGGCTCT C
TCCTTAACAACACCAGGCC
GGCCA GCAG TACCTACAGC1. tGGGTGC A
GTACGIGGGAGGGGTOCi CG ACAC A
GC.ACAGGCTGGAAGC6 95 ACAAGP.CCGTGCGTCCCGGCGGC1116GAGAACCFC CC
ICGTGITCTICRICITTGCGGAGGCCACATCCCCiF ACTCATATMCAAGCGGA
OGGCTTIGGAGAACCTCG
ACCC CIA C.GCGATGCCGTCATCTTAC
TCITGTAGTACACCCGACCC
97 CGGGAGAAGACGACTGGlICCAAGACTGGGCGCAC GGAGACCAAGCTCATCACGTGOAAGCCGITGATGA
AACCATCTCACCCCTCITC
AACG TGTCACC G
AGACTGGGCGCACAACG
98 CCCGTGGTAGACAGTCC.AGCATAACTGC:TACCCAAAC
GGAACGAGGAC:CATCGC.ATCACAC.AAGGTCTTGGT GGTGAGGTCCAGATCGTCi AACTGCTACCCAAACCTTC
MCC . CCACAT T C
99 N \ AGTCCACCGCTITAGCCACTCACGCCGITIGGCCTA
TCCCTGTGGAGAACCTAGGCIACGGTGGAGAGGACI
T TTGTCCG GGGGGTCCGCTGTTGT
CACGCCGTGGGCCTAT
AGGAACTTGCCGTAGGIGGAGTCCTMTATCAGGAC
CGGGTGCTCAGGAGGTGCTTAATG TGGCATCCGTG C-CTAATATCAGGACCGGG
0 CGGGGT GAGT AAGGCCOµTGGGGTTGAT GT
10 CGAGCTCGTCGCACTTCTTCTGGTGATCAAGGGGGG GAAGCTGGICCIC.ATTGGGCAGACAGACACGICAAG
GGTGATCAAGGGGGGAAG
CAGGACIGGCAGGGGGAA
CCATCGTGGGACCAGATGT
TGCTTGAACTGCTCAGCGAGCGATGGAAGAGTGCTC GTCCCGCCATGCAGAGGTTATAGACCTCGAGTTTCT
TGACAGGGAGGTTCTCTA GA IGGAAGAG FGCTCTCA
4 'TCAGC GCCAG CC GC
TCAACATATTGGGGGGGT
5 (31(16 G ACTGGCCAAACCCITXT. CT 66 6 TGCATCGCTCTCCGGCACGCTAATAGCCTTCGCCTCC CGCCA:GCCITCACTGCCATATGCAGTCGCCTCAGGAG
CGAGGGGGCAGTGCAA GCTAATAGCCTTCGOCTfr.
10 GTCCIAGGACCGACCATCXTCATTATGCACACTCCiCT CTGCAGGAACATGTGGAGTGGGCAGGAAGGGGAG
7 GCC TACAGGG , TATAGGGGGGTCTGGCGA
TTATGCACACTCGCTGCC
TTCACAGAATTGGACGGG
GCGCCTACACAGGITTGC
10 AGCGCATCGAAGGAGTCC.AGAACAGGAGATGGGCG
TGTGGCAGAGGAGGATGAGC.GGCGAATCTCCGAG AGGC1. AACCTC.CTGTGGA
CAGGAGATGGGCGGCA
11 GCCGGAAGTTGAGGAGCTGCCCTATCIACTGCCTTG ACGACAACATCCFCTGAGCCCGGGGGC.ATGGAAGA
GTGGTCCTCACCGAATCA CCTATCTACTGCCTIGGCC
O GCCG ATAGGAC AC G
11 CTITGGCAAGCACTGCGTGAACTC.AACTCGTTGCTACG
AGACTGC.AAGTTCTGGACAGCCITTICAC.GCCGCTG AAAAACTGCCCATCAACG
CAACTCGTTGCTACGCCA
GACATCCGTACGGAGGAG ACCAATGTTGTGACCTGGA
ACATCATGCTCCTCCAACG
11 TGCGGCCACCCTATTGATTTC.ATTCAAAGACTCCATG TGCCTCAGA/VACTTGGGGTCCTGGACAGAAGCCT
GCCTGC.TACTCCATAGAAC TTCAAAGACTCCATGGCCT
S GCCfC AGCGC C C
11 CCCATATTCTITICATTGCCTrTTCrACTTTGTGCGAC.A 1 AGATCCGAAAATCGAAACGAACACATGAAACAGAC AGGTACTGATCCAAAATG
6 ATGC1 1 rt CTAAGI GC GAAG
GIATGACTACAAGGAGAACCGATITTAGTATGTGTG TTGAAATAATCGAAGGGA
7 CIGACACT 1 AACTI Ca: LC , GM
GAACAATGGCCIGGACAG
AAGAACATGAGGAAAACAAGCCACTACTITCTCTGG ATGAAAAAGGCATAAACC
TT TGGA ' TGCCATA CCA
AlTACCTCCTGGCTTGGA
GAATTCAACAAGGCATGC TTGACAGATTCAAGTTGGA
9 TGGATTG TATTCAGTAGC G n-G
AAGGAAGACGGAMACT TTATAAAAGGAAGATCCCA
O AGATCCCATTTGAG TCTC AACC TTTGAG
1 . CCT TCA ACCTTC
GCGATACCTAACTGACAT TATTGAGGAATTGTCAAGG
2 GGGA CGCITIC (SAC GA
TTGAGGATGICAAAAATG CTCATCGGAGGACTTGAAT
CATGATTTATAAAATAAG CGAAACCGGTACATATAAA
TACATTGCTGGAAGCCAT
S MCI 1 11616(1 A
AIGCATGGAAATG1 COM' 12 GCTICCACAAAAAACCATCCATTACCATC/GaiGATAA I AA 1 TCii AGAACGIAAAACAGGGGACACTGTCCTCC
CATGGCGGATAACC:AAGG
GGGGCTATATCCCTAGGACCitiCITGGGCCTFATCAA AGATGTAGAGGC.AGTGCA GC1CAC7CCGTA
TOTAAGTC
7 . AGTCC 1, ACAGCC AC C
=
12 AGGATTGCCATCTAATGCATIGCTOCTACAACACAAT TGTGC1T1'GATAGAAAACA 1 TGCTACAACACAATGCTGG
12 Ci MT TGCTAGGG ICTCTATTCFGTCAMITTAGCGCT CCAGGAAAGGCTICTAGACC
TCCAATGTICT AITTG GGAAATCATICTITAAAA
IGTITGICCA
CACITGCTGTGTCJVAGC GCACAAGGCCATTGAACT
13 CCAC.ATAATACACTTTCiTC1TTCTGCACICiTGTGCAGTA
AAAAGGAAGCATCTAACiTATC1CIGGAGAAAAAATG ATGGGGAGGGATGGACT AGGTGTGCAGTAATATAG
13 GGGIT ACCCTCATATACAGGA 1 TGICAGGACCMCC AGAIGT i 2 TTfTTAC aiGGAT CGTTACAGCAAGTGCCTG
CAGGAOCCTGCC.TITITAC
ATITCCACAAACACCCATIGCTATAAGIATAACTAG ATACACCTATACCCACA It 3 ATCC.1 GATGCAGGTAA TAACTGCCTCATCGCCAT CT
13 GAGGTAGCTACATCAAGGGGTAGGCTATGGATTITA ATCCTGA 17 ACCTTAA.A.ATGGCT
4 AAACCCT GCCTTA GATFTGG Cr 13 CCCG Tr GTAACCAGT ATGGCTTATAACTCCTAG iGGC
CCCAGGGTCAAAACAAIGGTATTGGGIGGTATCTA CITGAC.AGITCIAT T1 ATG
AACTCCIAGTGGCTCTATG
6 CTATATTCATG . CCTG CAGC CATG
13 ACTAC.CiTCCGCCGTCTAGGTGACICGCCGGACAAAG
7 TCAGGAGGCACCACAGGCGCCGC.CGO.CTTACCTGA G GGCTCGCATCTCTCCTTCA
CCGCCGCCTTACCTGA
13 ACGAGCCCCCAACTGTGTACTACAACTC.TGCTGGCTC
CGACTGACTGCCGGCTT CAACTCTGCTGGCTCGG
13 CrGGGCTTGGGTTTGGATGAGTCCCTCC. TAGCCAGCC
CGCCGTCATCCTCCACCCAAACATAGGGAGGGGGG AGAAACACCGGTCTGGAT
CCCTCCTAGCCAGCCTAC
O TGC GGGAAG GC
GGAAGCCCOCAGTTTATGC
14 GTC.TfTGGCACTCCCTGGCAAAGGCGAGAATACCAG CCTTCCTGGGCCTCTATCCTCCGAGGCGTTCTACGA
TGICCAAGCCAAC.AATCC AAGGCGAGAATACCAGCA
CTAGGAGATAMTTGCGG
OCTGGACMCCAAAGAC.A
TACCAAAAACAACTGGGC GGTCGTGATGCCTIACAAC
GGAGGCAGMATATCGA
CCCACAGCAGTGTAACTA
GCCT TCCA , CG , CCGGCACTAGATACGCCT , AGGCATACTGATOCCCCIAGACCCGCAGCTGCACTAA TATGATGCAGTCCCCAAG
CTGGGCGCTACCTGAACT
14 IGGGGTAGGGGGGCATGTAGCC.ACTOGGTGTICCA GGGGCAGCTACITGCCTCAGTGGMGGCAGAGTA
CCACTOGGIGTTCC.AGTC
GT CATTACCTTCGGACCCI. TG
AAGGCAGGTCCTCTCAGG
14 AC.ACATAITGAGGAAGTCFTGGGAATAGGCAAGCCT GCTATTC.GCC.ATACTACCCATOCATAGGCTITTAA
GCTATTICCTCMGCTCC TAGGCAAGCCTAGCTACAT
9 AGC.TACATAA GAATGCCATT A AA
GCGCTTACCGGGATGAGAGCATGGAAAGGACCACA AGTTCTGCCOMTGGATCCCGTCCGACGGOCITGG
TCCTGGTCTTAATAGCCGC TGGAAAGGACCACAGGAG
O GGAGG G C G
15 GGGCAGGATTGCAGGGTfTAGACATCCTCGGTGATT GTTC.ATGGACCCICGACCTGCATTAGGGCAGGGIG
CATCCTCGGTGATTACAGC
GCGGAGGCTATTATTCAG CTCTTATTCAGACCCITGTT
15 GCGGGATCCTAGCGTGGGAC.ATAACTCCCTCATCCTG AGCTATAGTCTCCMCCCC
CATAACTCCCTCATCCTGCC
15 GGCMGAGGCCATGIGGITGAAACCCTGAGTC,ATCC 1 ATCCTCCTTCTCCTGC.AGCAACTTGGAAGGCGAGGT AAACCCIGAGICATCCCTG
t 1.5 AGAGCGGGAGAAAGAGGAGGAATGCGCCGTGCCT1 1 5 CF 1 CG , GCGAAGICAGCGGCC T
*FGCGCCGTGCCI TO' TATCAGCCCACTTCCC.AGGGTTCCAAACACGTAGAC AGCACAGTTTCCTCCTCCT
CCAAGGATAATAGCCCGTC
IS GGAGGGGAGTCGAGGGATAAGGGAGCATCAGATCA ACCCTTAAGGTCCTTACCCCGCGAGGAAGGAGGGT
GAGCATCAGATCACCTGG
TTTGGACATTTACCGATGG
8 GATGGC AAGGAAGA crfCCG 17CCACTCAACC C
GAAGGTCAGGGCCCAGA
9 . CtC CATCMG A 'FAAGGCTCT
GACGTCFCCC
ATAAAAGCGTGGGGACAG
O ACAGII GC CAUGGCACGCATACGGCT
TF
TATGGAAAGAATAAAAGA
1 C:TC Alf:CACTI ACTAC:GG
AAGFCGAAAGGTTAAAACAIGGAACGGCGTATM GAGTAAAATGAGTGATGC GOCAGATCGAGTAATGG
2 AATGGT GACITGOTI CC: T
TGGTMCATACATGTTAGAGAGAGATTGTFCCACCA GTAATCATOSAAGTTG ITT
GGATACTAACGICGCAATC
16 CTAGTGGA I CTGCTGATACTGCTC:AAAGTCTAAT TAT A I
TGGAGAIGTGCCACACiCACTC1CCIAAGAATOCC AAGTC1AGGAA TGATGATG CAAAG it-f AATIATTGCAG
4 TGCAGCC.A ACC TTGAT CCA
16 GCACCCITATITTCAATGTTIGGAGACitaiGICATCA GGAAAAGGGCAACAGC:TATACTCAT TGACTG
5 . ATCAAGA CTCITCC GA A
=
AATTGGGGAATTGAACATATCGACACTIGGAGTCAT GCA MAI AAAAGCAGTIA GACCF
GAATTFCGTTAAIA
ACAGGGGACAGAGAAACFGACICA.GGGCCATTAAT GAGAGGGIAGIGGIAAG
ITGACCGOTITTGAGAGT
16 CCM:ATI-Mr GGAATAGAGICCf TCTITACi TTCCTA TGIACF
niCiCiACATITGATACC.ACTCF AC !TICCIT TC1CiAAITICiAGCCATT MA
TCTTIAGITCCIAAGGCCA
16 TCCGGATGTGCCTICATCTGAACAATTC.TCGGAAAGG GTGGAGTCCGCTGITCTGAGTGATGCTTAATGCTGG
CAACAAGACC.ACTAAGAG AACAATMTC:GGAAAGGAT
TATCAGAATGGACCAGGCGATTCACA.C.-MAAGTICG C.T.11-FACCi AACC.XIACATG
crrGAGGAAATGICAAGG
O GGGA CITY A GA
17 ICCATTCAAGICCMCGATGAGC11CCAGGACATACr AACACAGT
MGAGICTCMAAACTCCCATMICATIA GAAATTICACCAF iGarr CTTCCAGGACATACTGCTG
17 TCCATOGCTGIATGGAGGAICICCAGCGCAAAATGC AC:AGGAACAGGAI
ACACCATGGCCACITMCCf TIT CCGACICTACTIITCCIAA
AGC.GC.AAAATGCCATA
17 ACf MCAITCHTTCAAGGC:AIGAATTAIGGCf ITCCTI
CTAACTCAAGGTCGCCAGACTIATGGICfTGGCCAA CAGACTGTGITCF AGAGG
"FATGGCTITCCITGAAGAA
GATAACAACCCACTTIKAA AAGGAGAGTAAGAGMAA
4 AGACAACAT . CTTTTTAACT AGA CAT
17 TCTCACAACATTTGCCAGITTCIGGCGAAAAGC.TTGAA
CTITCACAATCACTGGGGACAACCATCGCCAGGAAC TCGTATACTITGITGAAAC
GCGAAAAGCTfGAACAGT
TGITCTCAAACAAAATGGC
17 AGCAAAATCGTCGGATGATTGGC.AMAGA AATACAC
CTCATAGTGAATGCACCAAACCAAGGTCCTGTAGAA AGICTCGATACTGAATCTT CAAAAGAAATACACCAAG
7 CAA.GACAAT TCTGTCC GGA ACAAT
TCTGGAGTAAATGAATCAG
8 GAATCAGCT TATAT CTACCCAGCTTIGGAGTG a GCTGGTGCATGTGGGGA
18 GCC.ICGAGCTTGGGCTTCTCCTrTrACTCGGAGGCCA TATTGCCACGCATCCGGATCACTGGGCT
GGGGTGG
O CGG TT r 18 AGTrTCGGGGATTGTGAACCGTCAGATCACACGCTCr CAGATCACACGCTCTTTCC
CATGCTACGGCTCGGC
18 AGCAGCGTGGCGGCGAAAGIAGAC.GGGGIAGGGAG
TCCAATAAACCCATCOCCICGCCACGGATTCCTGAG CCCAAGGATAAACGTCa: GIAGACGGGGIAGGG AGA
18 GCAGTACCACTACCGCCAOCC.AACGGATAACGCGGT GGGACGGAGGAGAGACGATGCAGGTCGGACCATC
CCCAGGACCAGCCTACGA
AACGGATAACGCGGTGGC
ATACACCACATCG17GGATAATACGGACGCGTTA AT ACACTACGTC1TATAC.AGC
4 CTCTCC CGAACTTTC Tr C.AACTCTTACTGCTCTCTCC
18 CAAGMGCGACCGCAGTGGGICATAAAGCT. GTTGAC
C.GTTATCATCTGCTC.CCGCTGCCGGCCGCTGAAA AA TCATAAAGCTGITGACCCIG
CGGC AGAGT CGACGTCCCTGATTCCCT C
18 GCGTGCTGCATAGCTTGGCGTACGGGAGATACAAGG CGGATTCC.AAGTCAGAGCGGCGGGTCGGTCCAGGT
TCGAGGAGGACGAGTGG TACGGGAGATACAAGGAG
GCAGCCGGC.AGAGGAT
8 GG TCGT TTACACAACGACtCCAGC TG1TACA
TCGCCGGCGG
CCGACGTGCACGAOCT CACTCGGACCGCTCCT
O
CCGGCGGCTGCAGITGTTGCGAACTGTGCGTGCC GTA TGCGAGGAGGACGAGC GCGAACTGTGCGTGCC
19 CGCCTCGiiiii..CrC11CGGIGCCTGCTGIGCTACTGG
GGAGGAAACTGCCGCGGGATCTTCCTCAGAGACGG
CCTGCTGTGCTACTGGTG
19 GCGCCICTCTCATGGATCCACTGGTGAGCTGGGCAA GC.ACGCTGC,ATGGC:TAAATACCC-ATGACGCGTTCG TGAGTCCTTGAGTGACCsG
GGTGAGCTGGGGAACAGG
'FICCACGCTG1TTIC6AAA
3 AC GAIN: CGCAGCAAGTGACCGC
GCTTAACCAACGACGAGG
CAGGTAGCGCATGCAGTG
TGCGCTICAGATCGTGGA
6 GCA flG GATGAGTrCGCGCTCCTC
GGGCTCCAGAAACAGGCA
GOCGICAACAGCGT GCCT ITTGGATGGGC1CCGGGI Cr ATICTa:CCCTCCCICTCG
TGGATGGGCTCC:GG GT
8 MT ACATGCGCCCiGGCACTGATCGAIGGGICCGCTICA AGIT
CGGCTICTTGCTCG AACtCTGcfGCrCAcr6T
CAGGACGIGGIGACCICG
O CGACTGOCGCATGGT
ITCCICTCACCAAGGGCGCCICT GTG GIGO GCAACCGCICA TC:ACCAAGGGCGGGCT
1 CiGC CiccI TCCITCIGICACX:GMI
CTCGGACAAGA TCACGGC
20 CCCTTGAACATCC:CGCAGCACCICiCeGGGAAACGAG CSGCTGGICGGTTICIGACTGGCGGIGT1ICCAGCA
CCTGCGGGGAAACGAGA
20 GCGCTCCAC:GATCICGGCAIGTGCTAGACCGTACCG
3 CiCGCGCITCATTCTGGCCAATCTGGCCGATGGC7GTG AAT CAAAGGCGATGGGTTCCG
TaGGC.CGATGGCTGTG
4 GCTCGCAGTAGTGGCGCACGTGTGCGCGCTIIT (CA CSGCAGACGGCTGIGTC
CGICTGCGCGCTCTr 20 AGIGai ATACGGGGCTGGGAGGGCIGUCCAACI7 GGCCTIGGIGGACTICCAGGCGGATAATCTCCTGG CfGCLCGAGATGTIGACC
TG GCCA C
AGGGCTGCTCCAACTTTG
20 AGAGGGCTCATCCGACTC1GC.CCACCICTGATGTCC7 6 AAGCAGCCGCGTCTTAAGCGATTGCCIGGACC.TGCGC celoG GCAACAGCCGCGACGT
ATTGCGGGACCRIC.00 GGTGC.ACAGCGACAGTCi 20 TCGCGCATAGIGIGGAAGTGGICGCCCACC.ATGAA
8 TACCAGCGGCCCATGGTGCCGCCGCGGTAGAGCAT GGATTT C.CGACAGGLICTCCACCiT
OGCCGCGGTAGAGCAT
CCCCMCGTAITCTIGICGCCICCACCAGaGGAC TA GGAATTIICCCMGCGC A
GCCGCTCAGATTCCAGGA
21 TGCTOGGCGCCAGGAAAAACITCATGACOCCCAACi TGGCGCAGTTCTGGCACGGC.CACCGGGAACAAAGC.G TGCi ACCiTGGGAGCGC-ACXT
21 AGACGCGC.01 ACTCGGAAACACTACA TCAAAGGG C
ACCTGACGGIGAACCACCCAAATCGCCCCICACGTC: ACTACAT CAAAGGGICCIG
ACGCGIGTTGTTACCCACT ATCAGCAACGTGTTATCGG
21 CGTCGGGCACAGACCC.ATGGCCIACCGGGATCACGA
GCGACGGCGAACTAACCA
GCAGTACGGGGGTCGA
GAACTC.GCGACCCACG
GGTGCCGCCTGGAACA
CGAAGCCCACACTGACG
GGCGTCCGAATGGTGGGTAGCACGCCGATGTGCAGC GC CCGTGTGCAAGCCAGC CACGCC-CAIGTGCAGC
CGCAAAAACACGC.MC CGGGCFCTCGFCCIGT
AAGCTACAGCGCTCGCGGCACTGGCAGACrTTGGT OCAGMCAGCTITTGIATG
O CAG CG TGCACGAACAGCTGGACC
GCTTCCTGCGAC-ACCAG
22 CGCAGATTCTCGGCCAAC.AGCTACCCACCACGCGTTT
C.GGCTTGAACGAGC.GCCTIGTCGCTGCTGATCTGAC AGGTCAATACGGACTACG
CCGACACCGATCACGAAG
TCTCGCGGCCCTICAA GGCACCACCGAGACGT
TGATCCGCACAGCTCCG CGGAATGTCACAACGCCG
22 GGCTGCAGCGCTrECAGAATTGACATTCACGACCTAG GTIGGACGACAACGCCAACCGCAACTCTAGCATITT
GAC.ATTCACGACCTAGACG
ACGC GGCG GGCCTCATCACCAGCATC C
n ITCGGGTAGCTGCAGGCCGGGCCTTGGTACAGGAGC GTGCAGCAGCGCCTACAGAGCGTTTCCTCTTCCTCT
GGCCTIGGTACAGGAGCA
CGAGATGCTCTCCACCATC
OGCACCAACCACTGGAGG
8 CAGTCCGCCGCTCATTTTGTGTGAGCACGGCGCAGA T CAAC.GCGACGAGCAGC
TGAGCACGGCGCAGA
GCTGGCTTITCTGGAGGAA
O
GAAAGCGAAGACGGCGCCCACGTGGCAGGCGGICT ACC CCTGCTGCACAATGTGGC ACGTGGCAGGCGGICT
23 GTCA TGGCCAATCGTCGTCfCCAGCCTGGTGGAACTC
CGCCACAGCAGAGCCAGCACTITGCGACTCIGGGC
AGC:CTGGICIGAAC VCATCA
GGCTITCTIGGCGTACAGC
23 CGTGCAGGATGACGTGGATTCTTCCGCGTCCfCGTCG TCGTITTCCTCCTCTCCGTCGTGAATGICCTGGGCTA
TCCGCGTCCTCGTCGA
GTCCACGCCGCCTCAT
'FAGTTGCACACGGCCGA
CAAAC TGGCCILT-CTGGTACCIA
7 CATCTCTACCGCCGCCGTGCAACiGGCGGCGACA FCC GACGGI
GGAC:GGCGC.AGATTGC AAGGGCMCGACATCT
8 MG 1 GCCICAC:G ITCAC:CTCCI
CGC3GTGACCill6TG641G
9 CACCGACACACiCAGC:ACCAGGTCCGICGaXICIGTV CAG
TGTTGATGATGCGACGCC CCGTCG MCC-CCITT
TCTCCTCACGCACGCGGATTGCGACCGCGA
O GACCi AGM CTCCTGCAACGCGGCA
AACITGTCCATCAGCGACG
24 CGICITITCGGCGCTC:AACAAGAGGCGCACCVCCGAA GCCACAAAG
TCGCTITTGCCGCTCATCGACCCCCITG
ACCiCGCACCTCCGAAA
24 CCTCGCACAGCCGATGCACC:ATACGCAAGGCATCGT
2 ACGCCCGCCAGAAGTACGC.TCGGTAGGCACGC.CIAT C36 GCACAACGCCGCCAG A TCGGTAGGC.ACGCGAT
24 'ICTGTIGGCGGGGCTCACGCG TCGTAACAAAGCACA
GCGATCACCACGCTCGAGGAACCiCAGGAGATTCIG CGICGTAACAAAGCACAG
24 AGAGGCTACC:CGAa:GTAGCACAACCGCAACCiATIX: CAGCCAGACCGCTAGCCGAAAGCCG
CCCGAACA I
ACAACCGCAACGATCCC
24 GACGGTC.GTGGTCTCCTCGC.C.TGGAGGGGICCGGAA ACCZGGCGTGTCGACTCGTAGGGAAAAAAAGGIGG
S CA GGCC ACGCGTITTCCACCCITT
CTGGAGGGGTCCGGAACA
GCGCATTACrACCGAGTC GTCITCITGCGGTACCGG
24 C:ATCCTCiCTGTCACACGACGAACTGTT GACGGA TGTG GCaTCCFCCAGCCAATCG
7 CAA CGACCGCCICGGCTGATGATACCITCTGGCAC.CCTACC A
ACTGTTGACGGATGTGCAA
24 AAGCCiCGTGCCCACCCAAG I CGTCGTCATCGAAAGG
TGAAGAAATAGACCGGGIGCCGICTGGATGACCGT
8 C ATGGGC GCTCGTCC.CGGTAATCG A
GTCGTCGTCATCGAAAGGC
9 GC GAGGC.CACGGCGCAGAAGAACCGCTGOCCGTCIAC
TCGGTC.AGGGCACTCGT CGAGAAACGGCTGTCGG
O
TGGCTTACGACGCTGACGCGGCGGCCCAGAAAACC C CAGCTCGGTGGAAAGCC GGCGGCCCAGAAAACC
25 GGTC.GAACCTCATAACGGTGGGC.ACGCGTCCTATGA AGCGGCTGCTGGAAGAGGGCGGGTGACGGCGACA
CACGCCITCCTATGATGAGT
25 CITGCGCTTACCGATCCGCTTAGATTCGACGGGGAAC CACCTGGACATCTACCGTCGCC.AC.AGGCGGTTGAG
GATICGACGGGGAACGCT
3 GCTCACTCAAC.ACCGCCGTGAC.ACGCGOTIGGGAG TGCCTCAGCTG1TACCGCGGCCITCCCAAGCGGCA
TGTCTGCTAGTCGCCTACG GACACGCGCTTTGGGAG
GCGACTCGCCCGATCT ACGCTACTACATGCGCCT
S
GAATTTGCCCGTGCGGA
G
ACGACCAGTCFCCGGAC
7 AC.ATGGACGCCGCCTCTGCCGGITCTTCMGCGCA G CTTOCAGCGTCGCCGT
OCGTTC1TCOC:GCGGA
25 ATGCAAAGACGGCCiCGGOACGTGGAI CCCCATTATC
ACGIGGATCCCCATTATCC
8 CC CCGTCTTATCCCGTGC.CGCATACC:GMCGGAGAGTC
GGCACGTCACTTTGC:GG
25 C.GTAAGGAAACCGOGGCGGGCCIGGGAAACTCA A
TGGTCACCGTGGTCAGT
26 ACAGTCCCICiCTTGGGTGTCC.ATGITTGCCGGC17C6 O
CCCGGGCACTGATCCTGACCAGCGTGAGAGCCCCGT AGG GTGCCACGAGGGGATGA AGCGTGAGAGCCCCGT
26 C.ACGCTGACCGGCC ATCAGGCGGAAAAAGTGGATG
CGCGGAGGGTGATTCGCACTCCCGGGCGTCTCCAT TGAGAGTCCAAAACCCAC OGGAAAAAGTGGATGCGG
26 AGTTGTTGAGGTCC.ACCAGCAGTAAACATCAAGGG
GCAGCGCAAAGGCCA
TCGTGCGGAGATTTGITCT AAAACAGCAGAGAACrGA
4 G G .C7TGCCCGACGCGTGAAGGCCGAGCGCAAACAC
AGACGTCGGGCTCMa: ACTGGITCTGC7GACGCG
GTTCAGCATCTTGAGCGG AGCGCGTACACATAGATC
CG TGCA
AAGCTTACAGTCTTGCGG
GGACGCCAGTCGCATG
AGATAC.GTAACGTGTGCG
26 CGCTGTGCTGGCATCG/sGTTACGGC.TGGTGAGTCCA 1 CGTGTCGTACGCGCACCGTTGATACCGCGGAATCA
TACGGCTGGTGAGTCCAC
9 ACC: GACCGCC CCAGC(XiCATCAGCAG 'FCC
IGGCACTIGACGAGC
AAGTAATGGTCTGCAGCTG
O TGC CCCAT AGGTGGCGTCGCCAA
2 AACGCGCTGCACGACCACTTCCATGTTACGCGGCAG AGC GGCCACTACCIGTACGCC.
TFCCATGITACGCGGCAG
4 A OC:GACCAGTTCfCCATCGCGTGCTITCCCCGAGCTCC
GGCCICGAACATC1CCItG CX:GGCCiCiGGICTIGA
(31(3 TGG TTGCCiGCECTCFACCG
GCCTGTCCGATACACGTG
CACGCAIGTGCTACGCGGATTAGAGGCTCGGCAAT
GACGTIGTTGlIGCGGATC
ACCGTCAGAATCGACGGT
27 CTGACGCCACCiCCGCCATTCTI GT TCC ICATACiGCCC
GGTAC:GGGGAGAGATGG
TCTCCCOCGATTCCAGCGTGTCAACGGCGGTCGGA GG C
GTC.AACGGCGGICGGA
CIA
9 C TGCTOCTCTGC.ACGCTCTGACACCAACCAGGCGGC
CGCGGCCCATGAGGTA
O
CACGGCGTGICCTIGCTACCCACC.TGC.GAGTTGAGGC GIG CATGTTGGC-CGTCTTOGG
CACCTGCGAGTTGAGGC
28 CGTCGTGGAGACAAGCAACACCAGITT TCiG GGGCG
1 T1CACZTCC6GGTGGCTGCCGTGTTCGAGCACCTGAC. GAGG CTIGGAACIGGICGCGTC
CGTGTTCGAGCACCTGAC
GTGCTCAGCGGCGIGTACAGCGGGATCACGTCGG CC TAGCCACCTGITGGAAC
3 AG GGTAGCGCTTGCGGCACACCAC.C1TX:16CATCTCCA
GTTGTAGTCGCGGTGGC CCGCAGAAGGCCATGAG
28 GCAGCGGCGAGAGGAGAGGAITCACKi TGGA1 CCT
4 GCCTAACCTOCGCTCGCACTGGCGGCCGAATCFCC Cr.GTC CGTAGCGCTCCTGGATCT
TGGCGGCCGAATCTCC
28 CiCGGAGGAAGA TTCCi TCTGAGCCGTCTTCGTC:GICC TCCICICACi TT
ICCC:CGTCFCCACGTCGAAACCCCGG
CTAGT C GGCGCCGTCACTCGTT
CGTC1TCGTCGTCCC:TAGT
6 66CCGAC6CTGG1TCTCCTCGTGATGCGTGACGGAG GT TTTICCGCATGC.GTTCGA
CGTGATGCGTGACGGAG
CCC GCACGCGCGTTICCOCGTAAGCGCCII AC.AACTCCG
ACTGCTCGCACXGTCT GCGCGAATGTTACCACCC
CATGGGTACGGAGGCGT
GC.ATAATC1GCGAGGAGG
9 GAGGAC C.GGAACGTGACGGTITSCCGTTTTCCCCGCCGGTGG
TTC.GGCGCACCACCA AC
GTCTCTACGCTCCSACGAC
O A AGC CTCTGGGTCGCCAGGT A
1 AAC CGC 1TTGGAGGCCGC.ACC6 GCACTGGGCGTGGAAAC
CCTACGATTTGGCCATGTC
ATTGGCGCAAACTTITTGG
3 TGGC Gil CGGACTGIGGICGACTGT C
TCCACGCATGAAATAACG
CGCGAAGTGCAGCAGG
29 GACTITACOGTGC.GCGGC.GATGGAAACCAGCAGCCC GAGCACTIGCGGOCCATGAGCTGGAGGTCCATAC
G GGATT C. TGCGGTACTCGGCCA
TGGAAACCAGCAGCCCZ
GCCCGTCGTAGCGCAGAATAAGCTGCTGTTCACY.AC
7 11TCCCTACCGCGCGCTGGCGCTCCACACCAACAG TCAC , GCTCTGGCGCAACGAG GCGCTCCACACCAACAG
TCCAGCGCCTGCAGAT
29 CC.CCCAGGGCGTTGTAAAAGTCCITATGCTGCGCTAC GCGACriTACCGCCAACGCCGGICTGCGTGGGCGA
alTATGCTGCGCTACATG
O
CCAGAGCGTGCCGGTA
TGTCAGATCAGCTCGCAG
GGCTCCTTCGTGGGCA
GCCICCGGATCACATGGT
S
CGCTGGATGACGGTGATGCCIGTACACGGCC.GGOGA COG CGGCACAGGTCG TCCA
TGTACACGGCCGGCGA
CCAGGTGAGAAAGAGAAGCCGCCTCTCATCGTGCC
GTAGGAGCGACGCTGACG
i CGGCAGCGGTGGTACTGGTATGCTTACTGTGTGAA TGACGGTGITACTCGIGG 'FGAAGTGACGTIAGGGGA
7 GOGAGG 1 GCGG , I GO
CCACATCTGGTATACACCC
8 CO ' GTCCAGAACGCGG1TGGCAACGCCACAGGCCGTAG
CTTTCTTGGTGGCGCCC G
TACTITTCACTCCCGGGIG
TGGGCGATGCACAACTCITTCTGCTACGGCTGCTG A ACGACAGGATCAGAC.AGA
GCGTACAGGAGTCCTAGG
O AGGT TACGA CO T
1 . AG TGIGCCGCACAAGGCCGIGCAAGMCATTCCCCG
GGIGGIGGIGGGCATCGTGCGC1CGGCAGCTCCTT TTCCATCIGCGTCAGCCIG '16CAAGCCCATTCCCCG
31 CCCCTGGTCTrCAAGCACACTTACGCAGCCCAGCCTA
GTCGCCFCGGTAGCTCAGTAGTCTACGGACCGICIG A TACTGTCCCAGTCGCGA
ACGCACICCCAGCCIAG
31 GAGATGCCAACAGIGTIGCTACAGGAAGCTf ACAGG
CGATAGGCGGCTATAAGATAGAGATTGICTCICITT TACCG TGATGTAAAAACA
3 (T Aicrnrr TUG ACA
ACAGGAACCITACAGCiGT
CTTTITCCTTGTITGACGG ATATGTGCGGTATGATTTT
CCGGCGTGCAGCAAGGTTCGCTTCGTAATICTGACG
S CCAGG 1 ca CGGIACATCCCGCACITCG "fGATICCGTAGACGCX:AGG
31 GAGCAAGTTACCATCOCCTGCMGCGaiAATGGAAA = TCCGICGGCGACCAACACCAAGATG i GGGCCGACiA a; IGGGAACGACAGAAAC
TGC.GC.GAATGGAAACCAT
31 AGGGITGAGGCCiCiCCGTTCGGATCrACGTCCCAGTC
GTAGCGGGAGATACGGCGT ICAAGC:CGCCGITCAA GGA ICTAC:G TCCCAGICIC
7 . TCT GC GGIGACCGTCCTCTGTC.0 I .
CGACCITCGTACCIAGTCGC
9 CO Go TT
CGAGAGATTCTCCAGCCCG
32 TGCGCGCTGCTGC; TGITITATIGCAACACiCITCGACC3 CCiACGCGA TCACCICi TTGCAACACICTTCGACCIG
32 ACGAACiGTCCA.GCGGC000CGGATGCTC1CITCGTT
GCGTTGTCTCGCCTATCAGOGTIAACXITGGGITTCA
GCGGATGCTGGTCGTTG
32 CGCGTGACTCTCTGITCGAIGGCAGGICCAAGCGCCG TACGAGACCCCCiAGGIACGCACTGCAGACCGTACCi CAGCTCCAAGCGCC.GT
32 CCC AACGGTCACACGT CCCiGAAAGCCCi TCCCGGICA
3 Tr CGGAACGGCGTTTCACTGCCTTCTGCACCGCCGACG
AGGGGAGCAACAACCOT GAAAGCCGTCCCGGTCATT
32 GCGTCICICAGGGCTGGAACIAACCCAGCACTCCATC:G
4 CAAC.GC.CitiC.ACTTCCAACAGTGTMITGCACGGCGT AC TCGCGCCGCAGAGTT
TCITGGTGCACGGCGT
S TTCG CACACCCAGCCMCGGATC.GGICGTCGCGT CO
GGCZGTTGGAGAATIGGT GGATCGGICGTCGCGT
6 CCGACGTCCCGCATCCAACTGGCGTGCGAGGAAGA . ACATGGGCCATGTGTGGTGGTGTGCACTGCCGCGA
A TGGCGTGCGAGGAAGA
32 GAACACGCGCAACAGGGCCCiATGTCGGGCGTCCACC CTCCGGGTICCTGCCITITCCAAGGCCGTGGAAAAC
GATGTCGGGCGTCCACCTA
32 TATCGTCGGGTCGGCTGTCCAAAGCCTGGTTGGATCT AACAACGCAGACTGCTGAAGGCCAGGC.ATGGGTCT
AAAGCCTGGTTGGATCTCG
AGCTTCMCCCGACTCCGCGAGACIGGC.AGGGAC A AATTTGGGTCTTTTC.ACGC
TAGCCTCCTCCACCACGG
33 GGCCC.ACITTCGGGTGGAGEICTTGAAGTAC.AGCATG
CGATACCACCACATTCACIGGGCCITIGCCGTCCTCIT CITGAAGTACAGC.ATGCCC
O CCCA CCTG
GTTGCCCGACATCC. FCCC A
33 GGTGCCGAAATCACCGTGTGAAAAACCGCGACTTTC C.ITTCCGTACCGGGATTTC
AAACCGCGACTTICCACAC
AACGTCAAGAGICACGTC
2 ATGTCGCGACAOCCGGGFCGCCCACCCCCATTACACG occre; AG
GC:CCACCCCCATTACACG
33 AGAACATATATAATCGCCGTITC.GTACGGATCAAGAG
CAACAATACCCiCCCTACGTIGGTICAGTGTTGGATC CACATCAACAGAAAACCG
ACGGATCAAGAGGICCAT
4 AGC.ACGGGGTACTCCITGGCGCTCTCGGGGCCGTAT GGA TCGTTACCCCAGC=TGCC
GCTCTCGGGGCCGTAT
i 33 GAA6GAAACCGACCCCGMACTAGC.G1TCATAACAA GGCGACGACAGITCCGIGGGAAGGCACCGCGACA
CGGAATCGTATACCGGGC AGCG11CATAAC.AATC.CCG
TCCCGG G , A G
33 CTC.AACCTGTGCGCCACAGCGTTGCAGCaCCACGAAC
GTTCCCAAGCCACGCC GTTGCAGGCC.ACGAACA
33 CC.CTC.ACTGGGCTACCAGTGCACCGGIGTGCAGCTA 1 7 AGC ' GPAACC CGCGCCCAGAACCAAC
ACCGGIGTGCAGCTAAGC
33 CCGTGCCATCGGIGGGTACCAGAGAACGC.AGCGCA
GACCCAAGCCAGACTGC
33 TCAGCGAAACCGGCACCCCIGGCCGGCGAGTAATAGC GCGiGAAZEGGICGCICATAGCTCCTCC.TGGTGCCT
GGCCGGCGAGTAATAGCA
O
GGCTGICATGGGICTGCGCAGCCGCCGAAACACGA CT TGITGCCGCTGICGAAC AGCCGCCGAAACACGA
CtGAAAGCGTCGIGAGCA
TAGGGGACTACCTCCTCGA
CCAATCGGCGTCGCTTAG
GGGACCTGITTCAGCGIG
t GCAGTGCTACCGTGGTFCCCTACGATCCICCGGCGC GCGFAAATCF GCGGGATA
5 'MCC 1 I , GGCGCiCCAGAGAACGT GC
6 TG1TCGACAGCGGCGTAGACTCGCCTCGACAGAGCC ' ACG TTTAGCATTTCGGCGCGG
TCGCCTCGACAGAGCC
ATGGCGTCCAGGTGTATG
AAGTTCACGTCGCGTCC
GICCAGCAGTIGGICGCG TCCGGCATGGTGCTGC
AAATCCCGCAACACTTGTC ATAACGCCATGGGATCCTC
9 . CA C G A
35 AACC.AGATGGCLACGGACGTCCTGAACAGACCGCT A
CCCTGCTGTIGGCTACCTCTCTITCCTGTGC.ACCGIC CGTGAAGAGCCCTCATCC
O ACC ATC A CM
GAACAGACCGC TAACC
CGCATCTGCTCGCCGTAGATTGGATGATTITGCGCG
CfGATGCIGTTGICGTGCA
2 ATCCCGGCAAGTTGC:GCCiC:CACCACCGCATTAACCG CCCi CCFTCFCGCGGGCCTA
CCACtACCGCNITAACCCi TTCFACGGGTATCTGCAGC
3 T GGC.3 7 GGACCITCIGGGACCAGT
35 GCMCCAGACIGCACGGCCA TAGGCAGCCiACGTG I
ATAGGCAGCCACCi TGIAT
4 ATGG CGTGGAAAACCACCC.GGCAGTCCTCCGGGITCCTGC
35 CGGGGGGGTGAAATTIGGAGTGCAGCAC:ACGCAA
5 . CAGAMACAC-CC.CCCCGC.CCGGACGCCCGACATCCA CTACITC
GCGCACCCTCCTAGGT GCSACGCCCGArATCCA .
GCATACCAATGAAA I GGAA AGGTACT GATC.T.AAAATG
6 TGCTT GICTGAATAC.AT GAAG
CITTGTGCCACAGTGCTT
35 'FGTAGFCGIATAGAT OCC.iCAAGAAATGGACAGTGG T AGGAAAACCGATICACTGAAATTGG
ICGGATITT AT AATTGAAGGAAGAGACXX; IGGACAGTGGTGAATAGC
7 GAATAGCA CTTGTTAGC. TIT AG A
35 TICGCCTCFCICGGACTGACTAAGACACKiAAATGGCA AAICACAGGCACCA
FGCGTAAAGGC:IGGAGAAGYF AAGAATAAAAACCAGACT TAAGACAGGAAAIGGCAA
CCCAAAAACAAAGAACAT MACAAGCCAATTAATCiTG
36 CAAIGTGLICAATIGGGGC.AGATF CGAGCFGGATAG CATTGCAGGGCCACTGAGF
ACCATCCATGGCFGCAC CAACAAGGCATGTGAATT GATTCGAGCTGGATAGAA
O AACT AAG GAC CT
TTCTCCCITACFGACCCAAGGCCfACTTCAAGAACAC AAGGAACiACAAACCTGIA GITCA F
TATAAAAGGAAGG
AAACAGATCGGAGACAIGGCCAAGGITCTGCACAC GGGGATGGAAATGAGAC
CTGCCTICTTC.AATCICTIC
AGAATCiGITAGCGGCATIGGGCAPTCAGCCFCCCT GAT CCIATGAACAGATGGA
AAAGGACCCTAAGAAAAC AGGTCCAATTTACAGGAGA
4 AGGAGAA . ATGGG TGG A
36 CCATTTTCGCCTCTCC.AGAAATTC.ATGGAACTGATTC
AGAAGGAC.AAGAATTGCATATCAGAGTGCTGCTGT GGGATAGGGACAATGGT ATGGAACTGATFCGGATG
36 TTGAAGCAATTTGAACCCC. TCTATTAGAGGGACAAGA
ATGAGAACATGGAAGCAATGGACTTCTTATAGC.CCA ACCTTAGGGTC.TCAAGM ATTAGAGGGACAAGAATG
36 GCATITCAGTMCGCGC.CTAAAGAAATGATG t I 1 i I 1 GGAAGIGMAATIGATGACGGCGAGGCTCAGAGIT TTTAGACGCCAMTACAC GAAATGATG GGGC
36 TCCAATATGTCGTCAGCTICC.TGGGCGCCATTATGAG TFTGGTGGACAACTTTITTMCGAGTCGTAMGAGA
GGCGCCATTATGAGAACT
TATTGGGGGCGAATGTAT
CACAACTGCACCCTGAAGA
O Ci AAAGICCATT
CCTCACCCCCGAAACT AG GAAGIGCGGTICCAGAAG
GGAGGITCACCCTATIT ACC GGTGITGCTITGAATAGTI TCITAGTAATGGAGITGGA
37 GCGCCAGCGAGTAAATGCAAACAC.AIGTCAATGCAG CTGCCGACGTIGCAAGAGGATITACITCTCCACCAC
GCGAGAAATar.ACCTGA ACATGTCAATGCAGCTGFG
3 TTGGCCT ATCTAAAAGCA G , GCTITTGCTAAATIGGCCT , AACICITCTGTAAAAAAGT GIGTACITGCTTTAATTGT
TGATGATACTCGACATCCITGCTAGCACAATGGGAC. GTAACITCAGCCACTCAAA
ACTC.AAAC ' GAAG GCGGGGTATG11TAGACC C
CCAGGTTACCATATTGGG AGGATCTAC.AGTGGACGG
GCATTAGGCCATCTTEGGGATCACTAACCCGGAACA TGATACT1b1 I I 1 I I CTGA
GTAAGCTCTTGACCACCG
CGAGGTGACATGATATGA TGATTGTATTTTGAGATTG
AGATGGGGGAATCATGTC ACAGTTTCTGGAGCAGGT
CCATAGGCATGTGGTCCA
O AAGC cATT G
GCTFCAGCAGTIGCAAGC
GGCAAGGTAGCATITTAA
CTCTGAC.ACAMAGCTCTAGTCCAAACGATCGGTAC ACAAAAAGACTLITCAGTG TCCGTATACAGAT1TA A
GG
2 TAAGGGGT 1 CCITCC It GGT
i ACGIGTCAGITTCTATAMCGATGAIGATCAAATGG CACAAMTCTGGTGGTG
3 IC 1 AAGAAAGCCT A , G CT Tai KT
TCATCGCIT TC
TGCGCTCGTCAGATAGCAAGCCGGTGAAGTAGTTA TTGCGGCAATATTGAGTTG
4 GTTGC ' CCAGCG TTTGTCCCTATCGGCCCG C
AGTAGGATAGCACCTGCCGCCGCGCCTTCACATGA TA ATTITCCAGCGCCACCT
GCCCGCTCATAGGGGTTA
CTGATTTACCGCCMAA GAAAGGC CTCL I ItI III tCCATAGT AGGTATFCCICCTTATCCTT
GATTGGGAAGGGCAAAA AAATAATATTTTGCCAGCT
7 . CFC TF 6 TAGGACACAAT GI CTTG
AAGATCAGTAGCTAAGGA TGTAAGOGGMCTATAGG
TACTGCITTCFGGIGCGA CACCAGTTGGAAGAAIGG AAGTAGAAATLTCGTCGGC
9 CGCiCG GC; CI G
O TIT (.3 cm TCGTCCICTTCCTMCIGC TGCATTOCCACTACCGITT
GGITCGGLifCCIGCGGGAAICCCCTGCCAGAGCG T TGAGCCCGGGGACTA ACC ACGGITCCACGGIGCA
39 CA C:TA ITGAACGTGCGGGACATG1TA ICTI GGTAGGC . GLIGGAAGGICAITAGA
TATCCGGGAACG AAATT G CCIGTCTIAAAATCTCC:FG GTIATC TT GGTAGGCAA TT
39 CGTCGCGGCGAAA TGAT I GAAAGIGGGGGITGTCfT GCTCGACC:GTITTCACG ICC;
TGCMCGCCICAT T IC TT ICT TCCTCGACCACAGG
3 , C:TTCG CCAT I
GTGGGGGTIGTCITC:TICG .
1TGAAGIAAGCTGTITTAAGCCCAGCCTGCGAATIC. CITTC:AAGMAGGCATCT
4 T AAC.AAG AGC CA
GTCACAGTCACAAGGT
AGCTCCTGCATTAATGGIGTAGACACAGAAACTG GITGTANT AA AGCTAAAT
5 GM TACAGCC. CTGCC CA
AGCACTGCCTGTTGM
39 GCTGCC:GGCATCACATGGAT CT I CCAACAIGAGGCG
TGITGCTCCiCATAAATICGCGGCGIGGCAACIATIG
CAACATGAGGCGGTG
AGCAC.AAGTGAGGCGCAAGACTCCTCTCC AAAGTTGGTAAGCGACTC CCGTGGT1TGGTGGAAAA
AGCGTCACCCA
TGIACATAATCCAGCAAGTAAACCCCATIGTC KT AC AAAAAGCT ITAATAACCAT
GCACACTAATAGCGCGAG
TGCTGACGTCACTOGCGATGACGCCGGTCGCTCiCA CAT AGCT TACAAAGGGCG
O GA TCT Cr GACGGGAGCAGGAACAGA
CATCCGCAAAGATGAGGCMCIGTIATAGCGTGCC:
CAGTAGTar.C.TCTCGTA
CAAACGTCCAGGC.AGGGITTCTTGCAMTTITTAA A GIGGICAGGAAGATTATA
2 AATGGTT . AGCCTGAG CAAAC
GCCTTGCGACTAATGGTT
CCCGCTCCTGGATTTIACACTGACTGICCGTAACATC
3 AATCGC ATC.0 GTCCTTCTACGCTTCCAA
TCGAGCTTTCATTAATCGC
40 TTTTGCGTCGCTGC.CAACGTGAATGGGAGGAAAGAC
CACATGCGGGGGATGAAGGAAGAGCAAATGGGTT A ACATATGCACAAGAGCG GAATGGGAGGAAAGACAG
GTAGTGCAAAATAATGAC TGGCACGCAAACTATTAAC
GTAGGAAGACAGCATGG TGCIGGAAAGTGACATTG
AGGATCAAAACACTGTAA CCITAAATAAGGCTGCTAG
TGTCCATITIGGTITCGCC
AACAACACGGGCTGGG
40 GCTGC.GOCCGCAACAGCTATCGCGCACT-CAGTATA
9 GCTGCCCGCATCATAGCTOGCGGCCAGAGCG1TGGT CAT TCGCCGGCGTAGACC.A
OGGCCAGAGCZTTGGT
41 AACICGACGCCGTMCIGACAAGC.GCGATATCAAG CGCCCAGACGTAGGGTAC.AGTCCGGGAGAAAAAAC
AIGGCCCNICGGATAAa: CAAGCGCGATATCAAGCAT
O CATG CACTGT TT G
CGGAAGAAAAGCGTGGA TTAAAAGGGAAGMATGG
1 TGGA GTACTTTTAC , A , A
.
42 TGATGGACCTCGTCTAC.GGCTAGGGTACCGTGGGAC CGCCGTAGACGTCGACGCATTGTACAACGCGACGT
TACATCCTTCCATTGTGCC
GGGTACCGTGGGACAACT
41 AACGGATCCGGGCACAGTITGCCCGACGACiGTAATG 1 C.CAACCAA
AGGAGAAAAGCGGCGG1TCTGTIGTGT ATTGCCAAC.CGTITAGAG
3 GT ' GCGTTAC CC
GCCCGACGAGGTAATGGT
41 GCCATC.TTGAAAGCGCTGGTGCCTATGTTAGCGCCA CACICCCTCGATGATGCCGCACTCCGAGGCATCCTG
TCCCATCTGTGTATACGCC CCTATGITAGCGCCAGAGG
ACrIACTCCTACAAGGCTC
GGG GAGCGC G
CITTACGCTGGCTGTGGG
GACAATCCAAGGATGACA
TGCAAGACAGAAATACAG CAGTTAATGCTGGATGCTT
TAATTTGCAGGCTAATTTG GAAGCTTCTTGTACTCAAA
8 .ACTC:AAATGT AGGTATCCAC TGG TGT
GCCATCAGAAATTTGTTGC
9 Cf TAC AAAGT T
CCCGGTTCCTACACTTAC
42 AGGTGCCGTCTAAATAGGGAATGCTGGCiTTCAGGGT 1 O TIGAC : AGCAAAC: AGAAAC
CIGGGTFCAGGGITTGAC
s GGATACCTGGGCCCCACIATCACAGCIGTITGGCCI CTITAGAAACTFTCAGCCC
1 ITGTGGA 1 ATA , Al GAG
TAGGCAAGTIGTai A
42 GGGTCCACAGCCTAGATCTCGACCCTC.CCAGACTGCT 1 TTTCCGGGTfTIGTGTCTCGCGGTGTACGCCCCCAG CGGGTAACGCTACCACCT
2 CIA ' rrsc A
CCCICCCAGACTGCIGTA
TACCCTAGAMIAGTCAACC
42 GCTCCCCCCCTAGGCTTGTTATGGTTGGTGGGCAGG CCACAGCTGGCGCACAA.GTIGGCCCTAATTGCGACT
GCTA1TTGAGCCTTCATTCCTICAG(3AATAA1TCTTT
5 . CG CTCCGGAAA TGTTFAAACAITCGCAGC:G
C.ACTGGCATAAGGAGACG
TCC.ACCAAATGCTCT1TGC CACCATCTCCATTCCATGC
6 ATGCA GAM; (3 A
TICCATTAGCATAG
7 G rr GT TGITCGCAAF CCGGCGCII CCATGCCTITTICC GCAGG CCAT GO
TCCATGCCITTITCC
8 CGC GATG I GCTACGGCT(5TCAI6GAG C
AAATCCFCGCCITAACTIC
9 GCC:AGA AATTITA.CIG 7 AGITTITGCAGTGCCAGA
43 GCMCCiGGICTMCAAGITTTGAGCGCAGIGATGA A I
GGAAAGCCGGTIGIAAGC:GACGCTOTTGIAGTI CCGGCAGIAAATCTICCG TGAGCGCACIF
GATGAATC
43 AAGCATIGCTGCCAAACACCCTGCATTrACCT1TICCT AACCITCGAGAACTAAACAAACTGTIGTCCGITCGT
I . CT CAAAACC CAGAGA1TACGTGC4IAGA
CTGC.ATFTACCTITTCX.TGT .
43 AITGCTGTCAGACTGCGOICACIGAACrAlTAAGE GA
43 'ITGCGGCTITCGGGCiACATGITCCATCACCITIGIGT
CGCAGCAACGCTCAAAGAAACACTGCCTGACTCT CC
ITC.CATC-ACCITTGIGTGCT
43 CA I GAGCACi TAGCCCCGACGIGGGACACCATTGAAA
TGTAGCCTGCTTICAGCACTCiCGAATCGCCACICCA AA1CTCAGAAAGCCAACC
TGGGACACCATTGAAACCA
CCGC.AGCTGTCTCTACCTCAACiTAITTATTGAACAG GAAAACGCATTFITCCCAC
AGCAGCAACAAGGAAAGT T
rrAACCCTCCCIC1IGIR:CAGCCGAGCiCAAAAGTAC ATGAAITGGTMAGCGCC
CCOCACATGATTTCCAGGG
43 CGGGAACACiGACAAATGCAATAGCTAITCiGT GGAAA
TGAACiATTMTGCTGTCATITGIGTMCITC,AGGCG GGAMTGATACACACC(C
GCTATIGGIGGAAAAAGA
43 AAA I GCTIGCGTAC.AGGGIGAT TICAACCTATTAACC ITGlIGCTCCAG
ITAITGACAAACGAAGGAGAAACT AAACIACACAAACIAAAAC ATITCAACCTATFAA(X:IG
O TGCiGA GCCTCG ATCAC Giii, 43 AGCCAGAAGGIAAAGAAAAGAGAAACtAACTTWTT Gf ATGCF
CAAACAGCCGAAAGA.AGGGTrACATIAT ICGCAGGi CTTATTITArf CIAACTIGT 17CTAGIGCA
44 CCGTAGTACCATCGAC:TGACATTATTGTCACATTACA
ATACGGTAAGGGCAGCCCAAAGCTITTAGTAATAC:C AATGCATCATUCTIGAAA TIGTCACATTACAATIAACT
O ATTAACTGCA . TGAN \ CA GTACC
GCA
1 ACCT. CTTATGC CTGC CIG TATGC
44 TTGGATATTCGGGAAGATC.ATCAGTC(CCICTGTAGT CCCCCAGGATGATTTACCCCTAACAACAGAAGGAG
CTGGGTTTATGGAATTAT CfCCTCTGTAGMT.TITAC
TCAACTGTGCTACACGCTGCAATAGGTACCGAGCCC GAM". AATTGATTTACAAA AAAAGGAC GCCTG
44 fiTTTGC.AAGTCTGTCAGGITTGACCATGGTCTCCGTG
AAATGGAAGGTCAGAT(CTTCTTCCCATACTGGGTT TTACCOGAAATCTGA AAA
CATGGTCTCCGTGTACTC
TTAC.AC.TCAAGCTAGGGG CGGAATAAAACTTAATGCC
S AATGCCCA TGTGAGGG A CA
FLAGG TCTCACCACFGAAACTCCC TGTAAATTCTAGCGGTGCC
6 ci icca: AACCGTT T C
44 GCTCGCAICAAAGGGATAKITAACAGGGTCATGAAA CIGTCCTTAAGAC.GGGGATTGGAGITGITAAGITCC
GTAACAAGAATAGGGGAT GGGTCATGAAAGTAAACG
44 GGTTGATGTGATIAATC7TaCTGTCCTAAATATCCAA CTGGITCCCCAAGCT1CG1G1AACAG1A11GGTAGA
9 CTACACC AAAGGGA , GG CC
CGCCCAAGCaATTCTCTCGTTTAATTATGCATCATCGA TGATGGAAACGAATCCC:f O GCATTC GCAATT C
GTCACTCAAGCGGCATTC
45 GATGCTITTGGGACGCTGGTACAC.CITTTGITAATTA 1 C.CAGCTGACATACACTGTAGTGATACATGGTCATGA GTTAACAACCTCCCGATAC
ACACCITITGITAATTMG
1 AGGCG ' GCGGT C GCG
45 AAAGCTACGTTGAGTACTTCGGATGTAAAGTCAGACT TTCCTCACAGGGC.GGTAGTTCAGACGGTATCGCTAC
ATGTAAAGTCAGACTATTA
OCACTTACAGGAGGAGGA CAAGGTAGAGGC.ATAAAA
AAAAGTCTGGATCTTCAA AGATITACATIAGGCTCCA
CCTCAAAATCTAAGCCATG
GG TTCG C
GITCCGTAAAGACTCCGG
AGGAATATCAACTGCTAT CCATGTGGAGGTATAACAA
6 ACA.AAATF GGC GTGC AATF
AACGGTCAAGCACTTCTG CCGGTACCCTTGTATACGC
45 AGGCAGACATCAAGGCIGGGTTMCI. CAAGACCCG 1 GTTGAGGCTTGIGGGITITCCGCACAGCGTC.TTGGG ATAAGGATGCGGCAAGCT ITTCTCAAG ACCCGTCCAA
8 TCC:AAG I mil. C 6 9 GOA 1 CITA , CC
GCCATGGGGAGAICTGCiA
TCCATTGCTCCTATCAAATC
O ATCAG ' ccrG
ACAAACCCTACCAATCACC AG
GCCTTC.AATCATCATGICA
ATAAACTTCCATCAAATCTAAC GACATTGATTGAAACGAG AATGAATGATAACACAAAT
GCTAAGCCAAAACATACAA
3 . CAACA ATGT17TGGC:IC GAISGAGCCGAGAACAGT CA
46 CCTATGGITCCATGTACAC.ACAACGGTGAAGGACCAT GATGGTGACTGCAGGTGGAGTGAGCTGTAGMCT
CCCAAAACATATTCAATAT
GGTGAAGGACCAIGFTCYC
TGACTGGCTGTAAT GTG TCCGACT TAGCAGACAAGGTGACFC TGGTTGAAAAAGIITACIG
5 TACTG AG T 'MCC A AGT
CACCTGATCCCAAATACIT
6 CA.GGA IC/ACA TGA TGGCI
ACAACC:AACAGGA
46 CC.CCGGAGITGAATCATCTCCAGTGTGTGGAGIACA
canGTAGATCITOTAGTGCCICCC.ATC.CCTCTITCI GCCAGGAAAAGAAAAACA AGTGTGICGAGTACAATCT
46 TCCAC:AICAT TITCATCFGTGAC:AAAGTAGAGGACAC A
1 AACGAT T ITGIGGCTAGATCCiAAGGIAGTCTGGT AGTACi AGGACACAATTC16 8 AATTCTGAT ATGAATCTICT AATACa:ACACATGCCC.A AT
46 cioncrrrrrCACCCIGGGAATAIAAAGCCTAICAACA
CGCIGTGlIGAMCAGAAACiCITGGACAAAACAGC CAAGGTACAAGA IGITGG sr AAAGCCTATCAACACACC
9 . CACCA TGTT IC A
.
47 AAAC:ACCCTICTGAAGITAGCTGAGGGAAAACAAGA GTAAICTCFATGCF
GGCTICCATGATCACACCCT ATA AGCTCCTGCAAAAATAAA AGGGAAAACAAGAGCTAF
O GCTATCG GCTGAAC AGC CG
ATGGAITGACAATAACACCTGCIGCATTATCCCACT CfGAAAA IIGTAGCA LAT 'IGATGIGA
I:MCI TCATAC
1 TACCC. TGACTTTC GGTGA CC
47 GTCG TCCAAGGT TGCAGG TCACAGTGCA ITICTACTA TTC: TGCAGACCATICGACAT
TIAGATCGACAG IATG CGGAAGAGATAC:ATATAA ACAGIGCAT TT CTACTAAA
2 AATCC AGAACAA AAACIGG TC.0 47 CTCCTCCTCTGTAATC.TCATACTCAAAATGACTTAGTG
TCGGTGGTTCCATrTAGAATAGACAGCAGCATACAA CCACCGTTGCAGATCTTA AAATGACTTAGIGTIACCA
ATAGATACGITGGG TTAGAAAIGT GAAAGCTGACGITITGCT AAGAAGTCITA ACAGAAAT
47 GGACiT I ATTF TCACAGCCTGCAACCGAAAGIATAACA
TGAAAGACACAGCAAAAGCiAGATTCiAATITTCAGC GAGGATTGTGACANIGCA
CCAAAGTATAACAAAAGTC
CIGMITGGAC:AAACACCGGGTTGCTGCTTGATGA AAACTGTACAAAAG 1113A
ATAFCAAATGITATCT GAT
6 TGTGATCCTCC C.ITAC TGTGT CC:TCC
I.
7 ACAATGG CTGATATTTFCCK. TAG GC
47 AC.AGTCCAGTTCTGCTAAATCTTTGCGGGAGAAAGA GGACTGTGCATITTAAAACCCAACTGICGAAGGAG
TATGATCAICTTATACTTTA CGGGAGAAAGAGCTTATT
8 GCTTATTITAC . GAGGAGTA CAGACC TTAC
GTCAGGAGAAAC.C.COCCGCATTCGGTCTCTCCTCCT GGAGGAAACCGAGGAAA
9 At. CCT TCC
CCACCACGTCCCCTGAAA
TATTGrfAAAGGOCACAGC
O GGCACAGCA TGCAATA
AGCTAGGGATCCGCCTAT A
GTTCCAAATCITTATGCAA
GCAATTGCCTAGCTGAIGT
ITCTGTIGTGCCATTAGAGGATCTC:TC1TGTATCTCC CTAGTGGTACITTAGTAA
TACAGTGCCTGTGGAAAGT
CAATCATATCTGGCGAGG
TAACGCCATTGCAGTGTT
TAGAAAGCCCTCCAAAAA CACCACGTGACGTMAAA
4 ITTFAAAT Ain- CTAG I
AATTGGGCAGGCGGTICATTGATAATICAATAGCA f ATGCTGG C.AGCAGTA CTATAGGGCGTCCAAGGT
TITCAGAGAC.AGATGCTGG
ATGATTCTGATCATGAACGCCIAGGTAGTACCTA I A CT GAAAGATGTACAGG AA
CACAAAGTATTAGTTCCTA
ATTGCCACAGTTTGTAAAT
7 TGTAAATGG TCTCCCAT , TAGTGCTCCITTGGATGIC GG
GATGCCCTACACTCAACAACCCAAA TCTAACTGCAGATGITATG TCGTAATATITTGGATAATT
48 CC.AAATGACTCAGAGAGACTGGATGTGAACTATACT GTOCGCACCCCAATAATTATITGCGGAAATTTGAAG
TGCAAACGTGTATGTATCT TGTGAACTATAGTGAAATA
49 CiTTCGC.AGCTGACGCCGGTTGICCACGACCTACTCAC
ACAGACCAGGATCCCCACACCACTGCTGCCGTGCGA
O C C ATTACCCAGGCAGCCGA
TGICCACGACCTACTCACC
1 Gilt ATCGAAAT AATCC TC
TGCAATATGCTAGAAGAC AACCTGATAAATACTCATG
49 GCACATACATTGCTCCGTCCCfCGCGCCTTCAGACCI ATCACATGAGAACGGCGCGACGTCGCGCGAATTCG
GGATACGTACGCAACAGA
OGCGCCTICAGACCTGA
CAGAAGCAGAGGATTATG TC.AACCCATATTTGATGTIT
GGTAAAACCAACAGGAAA TTGATATCAGTTCCTTGTTG
5 Cf TGlIGG AAG7CT CG G
CAGGATGAACAGTGATCAGTTTGCTGTTGGCATTCA TCAAAAATTGTTGAAACT GTGAGAAATTG AAAG
AAA
6 AAAGAAAGCG 1 rt CCA ill. GTAGC GCG
AACAGCTTATCACCTTCTACTGCTCGCTCTAAATTCT GIATGITCMGATGGAG
'TGTCCAATCTIGCTAIGGA
7 1 ATGGAA 1 TGGAGAC , AATTIG A
AAGCGTGTTTGAATCAGCGGITTCTTCTATGGCCTC TfCAGAACTAGAGC.AAAC
ACCCTATTGAATACAGATA
8 ACAGATATGAGA ' TCC ACA TGAGA
CGAAAACGGTTGTATATA GCCCTAAAATTTAGCAAAC
SO GTIGTTGCATATCCAGCATAATCAAGIGCTGCCT AGA
AACAAGACATCTTAGACGTGCTAATTTTCTACTTCAC GTGCTGCCTAGAATTTCAT
O ATTICA% ACAGCGG
GCGGCTATCCATATGCAG G
ATGGAAGACATGTTACCC AGGATATTGTATTAGACCT
1 . AGACCTGCA GAAATGTICIT TAA CiCA
50 GCACTATAGCTICTACC.ATAAACCAGGACGATTCAGG CCCAACAGGTACAGAAA
TATCAGACCATGTCATACC AAGACCTAACAACGATGG GGACGATTCAGGTACAGA
TCCACGATIGGACGCCATTATGGTTTGAAACAGCCG GAAAG MITT AGGIAGTCC
3 cuvrAIGI IC CGACTGIGCAGGACCTAA AT ATI;
I
50 TGCTCCCGTACACTGTTTGTGAAGGGACATAGAGGG CAGGCACAGCAGGAATATTGGT.ACCAAGTAATGCT
GTCAGGAAAAGGACACA
4 GGA GCTX: GC
AAGGGACATACiAGGGGGA
AACAGTTATTGAACACGG GGCAGACAGTCAGMAAA
SO CITGCTGGICCICAGAATIZIGGGAAAIGTITTITIGA A I GGAAGCAAT AGCCAAGCG
TCAGTACTG TITTC IT AGTG i ATGAACTGICAAA GGAAATG MM.' GAAAG
50 COGITCCP.TACTATACTC.ACi ICGITGTCCGAAGCAAA AACAAGTTATGAAA I
GTGOCAAACAITCEACAGIT f GIAGTGCCACCATIAAAG
7 . AGCAC.A TGCCCCG CT
GTCCGAAGCAAAAGGACA .
SO GTITCCACT GICCACGCiGTCGAGICCAAC:AGICCEST
ACCAGCACCAAAGACGGAACACTITAAACAATTGG
GAGTCCAACAGTCCCCTI
SO 'IGGTAGGGGGTATITITACAACAI CAAACATGCCATT CTGGGAIT I
ATGICACRICACCT AAIGTACAGTIGT AAACATGCC:AITGIAACTG
9 GTAACTGT ATTACACC.AAT TCCTCAAAGGCACCACAT I
51 GGTIG ITAATAGCAGCCACAATAGIATGGATAICTGA MCKIM:TM TCS GT TACT GTCCCCATCAT
TGC IGI GCATTTGIIGTArcirrrrc ATGGA IATCTGAG ITTA IT
GTOTAC
Si CTATATAGGCCCACACGAGGCTCATCTAGTGATAGCG
CCACKITGCAGGTTACAGACC.TCCCCTICATATAC.AG ATTCCTTTAGATACTTTTG
TCATCTAGTGATAGC:GGTC
Si GGAAAAGITAI ATCAGGGCCAGATIGGGGI AACACC
TACTGCACCIAIGGCiAACACC.AAACAGGOCCIGIA CACiATACATAITTAACITC
GGGGTAACACCACAGTTC
Si GCAIGATAAAA TAIGTI GG IGCGAGCCfCCTCCI AAC
CG.GCAGITCTAGACTICTTGCAGCAGITITGITAGC GCGACAGCACAG TAT ATG
OCTCCTCC.TAACCCTGTA
SI GGAACATCI GATTTA ITGG TCTGCAACAGGAT GGCG TCCAGAT TAIT I ACAAATGGC
TGCA.ACATTIGTFCCF CCGCCCTI AGAACTIAT TA
ACAGGATGGCGATATGGT
Si ACAA I GGTA Ill GTIGGCiGIAATCAATGITGGIACT
S
51 TAAGGCAATACCGCACCCTGCC.AAGTTGCTTGCC.AAG
CTTTTATGCACTGTAGCCAACTCTAACAGTAAGACC
6 T . AAAAAAATGTG CAGGTACACATTGCCCTG
CCAAGTTGCTTGCCAAGT
7 CAGG CC AAGCGTCTAGC.CATGGCG
GTGTTGTACAGCCT. CCAGG
Si CTGCGCGGCAACAACTAAACTCAATGGACG1TAACT CCCAGA1TGGGTGTGCGCGTCTTCCACGAGGTTGC
CAAACGTAACACCAACCG AATGGACGITAAGTTCCCG
51 CTGACACC.ATGTGCCAGGGCCGCCGACCTCATGGGA GGCTTTGGAGGACGGGATCAACGAAAGMGTGCCA
CATCGATACCCTAACCTGC
CGCCGACCTCATGGGATA
52 TCGCAGCGCCATACATCGGCC.CCACCATCAAATCCA
CACATCATGCACCTTCCAC
O
CGGTAGGAGTAAGGGCCACCCTTGCGTGCCCTGCGT CAT G TTGCGTGCCCTGCGT
TGAACTGGAGTCCAACAA
ACOTAGITCTCGCCCAGG
52 GccrrGGCCGTAGCIGTCAAGACAGCTCAGGGICIT CCCACTCGGGGTCGCTAACAATCCCGCAIGGCCGA
ACTGGGTTCTTGGCTAGCT
ITCCGA
TGGGGGGAGAACGAGAC TITTGAACTCGACCAGACC
52 C.AGGGGTGATAGCTCTACCCGAGGCATGCAACIGGA CCACTACAGCGTGGCAGATCCATCAAGCCGGIGGA
GGC.ATGCAACTGGACCAG
52 CAGGAGAAAACAGGGOCACAGCTCTGCATTGICTCiG lICTGATGCTGCCTGAGAGGGCGACAACCATGGCG
TCTGCATIGTCTGGC.ATGT
CATGTC CCG , GGTGCCCAAGGCTTCTGG C
52 AATCACTGAGTCGCGAGGCCTATAGCrAGGACCGAG 1 TCTTGCAGTCCTGGTCTGTCCAGCCCTAAGATGGCC GGGCATTGIGGIGGATC.C.
TATACICTAGGACCGAGGC
6 GCTG i AGAAGA A TG
Si CC.CTTATGATGICTC.CGCACGC.GTGTICACGCCCATG
C.CTGITTCGGCC.AGGTIGGGGICTCCACCCCTTTGA
7 GAG TGTT GTGGCCCIAGAGCC.AGTT
GTGTTCACGCCCATGGAG
AGAATTGTGGCGAAGTGC
52 ¨ ETCTGACAACTCAACACCCCCAGGTGCGTGTAGGTG
9 ACGAAGTCTACCGCCTTGGCGCGCCGGC.ATATTCCG CG GGAGGTCCGCTGCTGT
GCGCCGGC.ATATTCCG
ACATGTCCAAGGCATATGG
O TGGC CCG GGCCACACTGGGTTTTGG
C
TCCAACATAGAGGAGGTC
GCCAACAACGGGGGAAAT
53 CCTACCAGTGCGGCCTCTCCGATTACAACCGTTCCCC GGGAGGTTGGGCACATACCGGGCAGTGICAAAC.AT
GACITCAGCTIGGACCCC GATTACMCCGTFCCCCAG
TTCCTATCCCAGACCAAGC ATCGGGTGAGAATTTCCCG
AACACCTCCC.ACTAGTCGAGCACCTAAGGCCrTCTG CTGTCTTTCAGTGGGCAG
GTGATTGTTGGGAGAGTC
t 53 AGGGrITGTTGGGTGGICAGAGCAITGCITC.CCTCAT 1 GATTCGCGACIMGAIGCTTCTTITTGCCAAGGCCC ATC.TCGCTGGACTGICTAC
O GTCCT 1 ACAC , I CAT
TGCT-mccrcA 1 GICCI
TIGGTAAATITGCTCCCGG
6 TGGCGGCGCAAAATCGCCATICTATCGCCCGGAGC ' C C
CATTCTATCGCCCGGAGC
TGCTGAGTGACTTCAAGAC
TICCATTATGICACCGGGG ACAACATCAAGTTCCCCTG
GAGATTGGCTCGAGGGTC
9 . GT GITAG A
CGACCCICGCTAGCTAGI
CAOCTCTTACCGAGACGT AAGCAGCAGGACTACCAA
O AGC AAACAAC GG GC
54 TT GCGGGA.AIGGCTCCGAACCCCCACTCAGCCAGATC
TAACCACATCAGCTCCGIGTGGTGGCCATGAT TOT ITACCACAGAGGAAGCTT
CCCCACTCAGCCAGATC:A
54 GGAGICAAAGC.AGCGGGTATCAAGCGGGTGGAATTI I ACCGAAAAGGACATCAGGGTCGATGACTITGCGGG
ATGGCTTCCAATACTCCCC AGCGGGIGGAATTICTTCT
3 circris 1 CI CC A G
4 C.TACGCCGTCGCICTCAGCGCGGCGCTGAGAGACT AGGAGGACAACCGAGCZCTAGCGTC.ACCTGGGGGA CC
GCGGCGCTGAGAGACT
54 MCCAW"! I GGCAGC:GGCAGTACCTCITTAACIGGGC
ATCGGETTGGITTACGGTAGGCGGTCGGGCATGACi CTIATCGCCCAGGGAGGT
*FACCTCTITAACTGGGCGG
5 . CiGT ACATG A I
.
TCGTAATGAGCAATTCCGG
I AACACACAA TCAA IGGACCCATCTAAGT AGAACCAGTTAAAGAICTC
TATGGGTGIGTICGGGCAAATAC7GTGIAACC1GA T CrTCTUCTGIAGGITITA TIGGATAGTAAGACA
ICC
54 ACCAATAAACAGG1TTGTCCCAAAAGTTACCTCATAC GATGGGGTCITCAAAACTACTC.TTCGTGATCCAGAC
SS CCFGATGICCATGGTATTGICAITAGOAATGLIAGGI
ACrGATCCAGAI ACATACACCAG IGAAGTAT AAGA AGCAA ICiCTAGGTACTCAT
O ACTCATG GAAGTTTGATACCA
TCCACAGGACAGGAGAGA G
SS ACC1AAGCMCAGTGACi CGCTIAGGC TGATGAAAG
GTIGAGAAAAC:GAAACAGACGGTGGGACT MT& GIGCATG ICCAGATTITAA ITAGGCTGATGAAAGATAC
55 TGCCATGIGT AATCGCCCACAAGCCAIGTAIGTI CCA AACrr GCITCAAACCCCAGTG I
AAGCCATGTATG1TCC.ACA
SS GIAGGGTAGrGCICTGGGIGTIATCACAGGGCAAAG
ACA1CAATAGGGCGCACAAATCCITTCCTCr TC11 AA CAAAACICTIG TCAAGATC
3 Cr TTACTGG AGA
TTATCACAGGGCAAAGCT
55 IGGAATAGTAAACACCTGATGTACGTAGICTCAAC.AG
ATTACCCC.ATAAITTGCGAAAAGCACTTGGGTAATA CCIATCCAAATAGAGATIT TAGTCTCAAC.AGGAGG
AC
4 GAGGACA . AGGGCTTC AGCAA A
55 TGTCGTATGTCAGCAAAACAAAC.ATATGCATACATGG GTTGGAGTGTATCGCAGAGGAAGTGAACCCGACAC
ATGCATACATGGTCCCATT
TITCAGCTTTAGTGATCAT AGAAATCATGATGACTTGG
SS CGTCCCACAATGGATGITGTTMAGTGTGTGTATTM TGC.AATTGCAGAACACTTCAATTCCATCAAAGTGCr AMAGTAAACAAAGAACT AGTGTGTGTATTAATCCAT
55 TGIGTTAGAATCTGIGGAGGCTAGACCAATGGCTAG CTAAGCCCCCCAAC.AATCITGTGCAAACATATATCTA
SS GGGGGIGCTACTCAAATTAATAAGTACTTTACCNICA ATGATTATGCAACAAAAACTGGGC.ACCAAAGATCTT
AC1TTACC.AACACTATC1TA
AACCIGCTGTATTGAGIGACAATGCCCTTGTACTIA TAGAGAAACAAGGCCAAG GGAGTTTAACATAAATCCA
O AAAT (CAUCA GAGAATAATGA TT TAATAG
GTCA
56 GGITGCZTAGT1TC.A1ICTCATGTTAAAGICCGAT1A
ACAAAGCATTCGATCAAAMCCCGCACATCAGGATC TAAGG AI GAAT TAAGAAG
AAAGTCCGATTAGGGAAA
CTTACGGIGATGATTIGAT
2 GC TC.ATGGGC.ATAA TGT
TGGATCSACAAGIGTTGG
56 AACAGCCCATGGGTCCAGCITGCAAAGCCGCTATGTC TCAlTGGTCGAGAGAACC.ACGCTCGACCGAAATCCA
AAGTCAATGGCTGAGTCA
3 C CGAAAG , CC
TGCAAAGCCGCTATGTOC
56 ACTGAGCCC.TCCTGCGACAAACCACAAAAGATGCTC 1 TTGCAGATGCAGCAAGTGTGGAGCATAAAAGCGCT GCCACCCGAACATTCAAC
ACCACAAAAGATGCTC.GAC
4 GACT i CGTGGT C T
CGACCTCAATCGTCAAGCACTG
CG CGTAC GACGTCCTCGAAGCCTGA
ACGCGTCTCGGAAAAACG
56 CGATGAGAGCGGAATGICCCGAAACC.AAATICITCC CCGGGGTTGAGTGTGACTGTTCACTGCCAAGCTGG
AAACCAAATTCTTCCGCGC
56 ACC.AGGATACCCCGTGGATTGICACCATGTCCACCCC AAGTGAAAACAACGC.GGCGCGAACGCCTGCAGATC
TGATCCAAAGTGGGCTGT
CACCATGTCCACCCCATG
ATGGAGACTCTCTTCCCTA
TACTGCGACTACTTCAACA
TATTGTGTGAAGAATCATT CCAGAGGAGGAGTTACTG
9 "TTALTGI AGTCG GICA I
TGGAAGCAATTAGTCTTTC
O CTFTCTCC TGCGICC
TGCAGCACTTAGTCCAAG TCC
CGGCTGCGCTATATTMA
57 ACAATCTAGTAATGGCTGAATCC.AAAGAGGTAAAGT 1 AGTAGGATTFTTGGATGAIGCCACTCC.ATCTAATGC GGGAAAGTCGTAMCTG AGAGGTAAAGTAGTATCAT
2 ACTA TCATACAIGA 1 AlITCGC rrr ACATGA
i ACCFACACCAAGACCCCGCAAGTGCTTGCCTCCTCC ATCTTCCCITCTGTTGCCA
3 CG 1 1T.3 , G GI
CTICACCTGICTC:CCCG.
CCAGTAAGAGAAGAAGA AGAACGTCATATCCTCTCT
4 TCFG ' GC.AGGCCC AGAGG G
TGGGAGAGTATTGGTAGC
S CAACGC CACA TTT
TGCAAACAGTAATCAACGC
CGITGTTGAAGGIGTTTG ACTTGGGTATTGGTACGG
TTGTAAATCCAGCTTTTAA ACATTGGTGATCCAGAATA
7 . CAGAATACAT CTCGCG CC TGGT CAT
57 CTGTCTAGAGGAGTTGTCAGCTGGTGATTTGACAGA TATTGCATTTACAGAACAGCC.ACGTAAGCGGCTTAC
GGTGAMGACAGACATAT
8 CA TATTCAM ACCiA AT TTTACAAMCCCGCCITTG TCAAT
ATCAGATGITCATTGTIGGITGTGCCACAIGGTF TA AAACTGGGTGACACFGAA
9 C:ACTGAAA GCTAAATCCC CC ITGGCACIACTGGICA A
GTTTCTTTTATGGCAAGAGGGTACTCCATCTTT GGACATGTGCGATATTGG TTTAGTACTTTTTCAGCGA
O AGCGAGTC ACAAAAC.ATGTG G (SIC
TTTAGAACCTGATAACAAT CCTAAAAATGGATTAGGG
CCATATAAGGGCTIAAATI
2 TTAAATTITTGG GTATAGGTAT TACCIA.U..1C1GAAAAGC TITGG
ATAGCACAAAGTAAAGTGT CTAGGTAAG TACAGTA.AC GG ITTC:FATITGAAGTTFG
3 , AAGITTGTACAC GCCAAAT CGTTAT TACAC
=
CICTAGCAGCGGCCAACT
AAGTGCGT/TCA TGOC TIGAAA MG IGCCC AACCCIAATGTGGTC.T.ACC
S C.CACCG GC A G
58 ACACiGACCICAGGCCiCiGACiGC:GAGGCTA FGT T CCCiC A TGTCACCATCTGA MI CG
CCAGGCTATGTTCCGCCA
SS AGGGCGTACICiTGATGGCTCCGCCATTGGCFGCC.ACT
TGTTCCTTACCCCCGGTCTACCTGGAAATAGGGAGG
7 TT GGGTAC CCTCTCGTCAlTGAGCGG
GCCAITGGCTGCCACTFT
rrACCATMCCACCAGCGGTACITCTGAGC.ATGCT GCTACAAGGITACIGTG6 8 GGGC CX-.C.G TTGCTGCTCCCGCCTAAT GC
58 AAAACCAACACCCGTGGGICA TACGC1CACi itTAGG GGACTGCGATT CGTAAGGCGGGGT7 GTAGGAAACA
TACGC.TCAGTGTAGGCGC
59 CTGCCUCCAACCGCTITACCTITACTGTGCiCGOTCA
GAGGCCIATGAGGGGICT
O
TGCCAGGGACCAC.GTAGGAATGTCGACCCCGCTGAA GC G ATCiTCGACCCCGCTGAA
CTGGGCTGGAGGTGCGCTACGGGGGCAAAAACCG AGACTAA TGGCrCAGAGC "TITTGATGCCAGICAGAGC
2 CA . 671 TGGTGAGCCGGCTCCT
CCGGCCATAACCCACCA
GACTTGCTCCCGCCTCGGTAGC.ACCTCGTATATGC.0 ATACATTAACCCCCCGGCC
59 GCC.AGATTTGCTGTCCGTGCAACCAATAGGCCGACC. F
CTTGACTCAGCCACAGACGTCGACTGCACAACGCCG
CCAATAGGCCGACCTGCC
S
AATGACAACCCGGCGCCCCGGCTTCGCTGCTTTCA G TTGCGTAATGCCTGGCG CGGCTTCGCTGCTTICA
59 GGAGGACCCGACTAGTGGTCTGCTGCGGATGTATGT GGTC.GTTGTICTGGGGTGAGCGTTCiCCGCCCITACI
CTGCGGATGTATGTGAGCT
7 TG rrGcc CCGAGCTCATGCCATTGT
GCGCCACACTGAGAAGTG
TTGTACGAGCTAGTGGAG
8 AGGC AAT GCCATUGGCCACGTACAG GC:
TIGTAIGCAGGTGITGTG
OCTTGGCGCGCTICCT
O TGGGCCTGGTCACGCCAAGGGGCTGGACCTCGTGT GCCGCTGCCICACGICGTAAGCGACC.GOGGTTAGC
CCITCGCCCCCGATGT GGGCTGGACCTCGTGT
60 GGGAGACCCGCGTETTGGTA i LI i IGCCCTCGAACIT
1TACTCCAGCACTGCCCGTCATGGTGAGCTCGGCAG CTITGl.u_ ILE, AACITGAG
1 GAGT TC , CCCTCGGGCTGTTGGA T
TGTGCTITGGC.ITTCTCTCA
AGGTACTACTAAAGCCGG ACTGCTAGTGAC.CAACTGC
60 ATTGGCTGCATATCGACGACAACAATATACCAAAACG ACTTGACGTTCGTAATGGGC1TAAAAAATC:CGTTGG
AACAATATACCAAAACGGC
CATCGCCGGMTACAAA
TACCGAAAGACACAGTGTT
CCTTAGCGTGTGTAAATAT GGGTTGITTATTAAGTTTA
60 AAACTCTTTCATATGCCGTTCCTATGATTCGGGCTATA ACAAACAATGGTAGACCGCGC.ATACGAAACTGTGC
TACATTACCAAGCGAAGG GATTCGGGCTATAGAGTAT
60 ACAAC.ATCCAACGCCGTTTTAGTGGAAACGAAAAAC
GTTGCAAAAACAAAGACAG7TTGAAGGAATACGTT CCCGAA 1 II ili : I ATTGC
GTGGAAACGAAAAACCGT
8 CGTC CAATTAA.AGCTC CT C
TOTTACTGCTATTGGACG
CGATGAACACCGTITTGCACT/sGGCC.AACGMTCC AGTTCGACATACAGTAGA
GGCTATGTTACCTGCACT
s 1 I AT TACiCGC 1 CC , ACT GC
CTAGCTCCTGCAAACTTTCTGGAAGCCAGTTTTTCAT GCCGACTAACAATTATACT
TTTTTCATCGTCGTTACTTC
61 CCTCTAIGGTTGAC.AGCACATITCTTGCTGGGTTATTT
3 AAC_ATCC GATCG A CC
TGTGTAAAGGTAGACAAC CAGAAACTATCCGAC.AGA
TCACCTAAAAACCTGACTC
5 . C GCACGC A
CGITICCTCCTICiCAGAC
ACGAACKGIGT TCCGICG
CACGGGGCGACAACACCGTGIGTACCC
7 CT GC CfCG ICCACAAACi ICGGC
8 I CC; GCCGCICTCACAGCCI CA
TCCTC:CCAGTGACCCT
TGGGCGGGTATCATCGGICTICGGACAGCGTTCCC
"FCTAGGTITGCCKCCCG
AGCTAGAACGATICGCAGITANTCCAGTAITIGTCT CGAGAGCG TCAGT ATI AA
ATIAGATCGATGGGAAAA
CI GAAAAAATTCG ACAGCCTTCT GC AATTCG
TCAGGICAGCCAAAATGCCIGATGTACCATT ATAAAAXT ACAC:CAAGGAA *FTTAGACAAGATAGAGGA
1 . GAAGAGC TGC GC AGAGC
.
62 ATCTGGCCTGC3TOCAATAGGCCAT CAA] GAGGAAGC
CCAICAATGAGGAAGCTG
GGACCAGCGGCTACACIAGAAGCCAAAACfcrr GC TAGACCGG TICTA IAAAA
GAGCAAGCTTCACAGGAG
62 TGCiAATATCGCT GGTGAICC1TTCCCACCAGGGATTA
GIAGCATGACAAAAATCITAGACiCCIGTATTGATAG ACCATACCIAGTAIAAACA
CAC:CAGGC1AITAGATA ICA
TTCCTTTGGATGGGTTATG CCATCCTGATAAATGGACA
S TGGACAGT TAATTGCCTTAC AAC CT
ACAAAAGGAAACAACTCAGGAATC GAGCCATTTAAAAATCTG ATM GCAAGAATGAGGGG
62 CCA lIGCTCTCCAKITACTGICiATAICAGGAPACi ;AC
CTAGIGATITIAACCTGCCACCICTGACAMATCAC GAACAAGTAGATAAAT TA 'FCAGGAAAGTACT
AlliTT
ACTGACP.A1GGCAGCAATTATTCCTGCTT G TATOCIGG TAGCAGTTCAT TACIAAGCAGAAG1TATI CC
9 ATTCCACiC ATTC:CCG CT AGC
TICGTAATAACAAAA TGCCAGI GIAAAACACCATATGI AT
9 GGATGGT CTCITTCTCC G1n:C/4 GGAAAGCTAGGGGAIGGT
63 GCCAAGTATTGTAGAGATCCTACCTATTAGGACACAT AACACCAAAAAAGATAAAGCCAC:CTTICTGGGGCTT
ATTACTITGACTGMTTC ATTAGGACACATAGITAGC
O AGTTAGCCC . GTTCCAT AGACTC CC
63 GCAGAATTC:TTATTATGGCTTCCAC.CTIAGGGCAACA
ACAACTGCTGT1TATCCATTITCAGTCT6TCGAGTAA TMCC:TAGGAITTGGCTC CTTAGGGCAACATATCTAT
TGTAGTGCTAC.AGAAAAAT
CTGGTMGCGATTCTAAA TGGAACAGGACCATGTAC
TGGIGGTAATAGC.AACAATGAGTCCCACTICTCCAA TAPACATEITGGCAGAAAG
CAATGTATGCCCCTCCCA
CAATGACGMACGGTAC
S GTACA GCCAGG
GCAGGAAGCACTATGGGC A
AAGAIGGGIGGCAAGTG TGGATGGCCTACTGTAAG
6 AGGG CTCa GF GG
63 AGCAAGCTCGATGTCAGCAGMAGCATTTCATCACG AGGGACT11CCGCTGCiGGAC1GCAGGAIC1GAGGG
GAGTGGAGGTITEGACAGC TACiCATITCATCACGTGGC
63 CGTCAGAGATTCCCAC.CICGGTCAACGACATGCAGIG ACGACCITCACCGCGACTCCGCCTCGAACATCTCCG
8 Ga TGTC ACGCCTGGGGGATGCT
CAACGACATGCAGTGGCT
GTTTAGGCGCAGGMCCG
O CCCCCTTCAGCACCTACGTGGAGCCGGTCAGCAGCT CGCTCTCAAACAaTCCOCGAGGGCCITTCTGGGGC
GCAGGGCCACGGACAT GAGCCGGTCAGCAGCT
CTTCGAGGGCGTAGTGGC
CGCGACAGGAACCGGTA .. GCGAACTCCACCGAGGT
64 CGCGTACATGGCCAAGCTCCAACACAACAAACTCCC.0 TCCITGTGGGACGAGAACAGCCAAGTTAC.ATC.ACCA GTGAGCACGGGGAGGGT ACACAACAAACTCCCCCTC
TAACGTACATGTICTGCGC
G GATCGA C
AGTACATCCCCGCGTACG
64 GCTCCGEGGACAGTTCTCC.AGGACCTCCGGGACTTC
GGCTOCCITGTTTCGCC GGACCTCCGGGACTTCGA
TCCTGAACACGCTAATGTG
64 CGATACACGGCAACCCCCGGC.TCTCCTCCAC.CCAAAC 1 CTCTCCICCACCCAAACGA
9 CIGGIGTTCGAACTGGGCCACACCGCGTCCICCCiACA CGAGG GICGGCCACGCGOTA
ACCGCGTCCTCCGACA
O CCA TAC .. CGGCCATGGTGCTGCA
CAGACGGAACAGCTCCCA
65 GCAAGGGGGAGCTGCTGATCTAATGGGC.GTAAAGA GAGAGTCCGTACCCGCCCCACATGAACTACGGGCC
ATGGGOGTAAAGACGGGC
CTAAGGCCCACCGICACG
3 GaGCTOCCGACACAACGTaGGGCGGICAGCGAGA CCGAGAACGCGAGGCCCATCGAAACAGCCGCCIGG C
CGCTAACCAGCAGC TCCA
CCACGACATCCGCGGCTITACACGA laGGCGCTGC G1GC TGTCCi TACGCGGGGAA CACGATaGGCGCTGC
6 CG CI ACA AGGTCGCTC.GCTG ICC --CATCATCICTAAGCGCGC:G
65 ACGAAGTGAACCAACTGCCGAATACAGCGTC.LIGAA
7 CACGT GCGCCiACITCGCCCAGTTCCGTAGTaCCGMAGA
CCCAACGCAACGCCTAC "f ACAGCGTCCIGAACACGI
ACGATCGTGCTGCZCGAGCGGGTGGACCACGICA CAC GCTGGITTGGCTCGTCC
CGGGTGGACCACGTCA
65 Cia GACGCGTGC ITTIGITCCGCCAGCGCGTG TAIGA
GGACAAGGCCGGGTCCCGTATCAGCCTCGCAGACC CGCACCAAAACiC:AGACiG
GCCAGCGCGTGTATGAGT
66 AGAGOACGCGTACCICIGCCCGGAAAGACCACTAG aCCCGGAAGGTATTGCTCC.;CAGCTICAIGGAGGG
ATGGTAAACGCAAAGCCI CGGAAAGACC:ACIAGCCC
O CCCG C.AGCC .. CC
1 C.CACCAACGTGCCCTACCCCCAGGGGCAGGACGAA AGG GCCGGGITGAACAGCC
CCAGGGGCAGGACGAA
66 TIGTCiCGCCiAGGGGCTTGGCTIGGGGGATTCGCAG
GAGCGGCCATTGGGTTC CITGGGGGATTCGCAGGC.
66 CCCCGCGGGATCGGATACAAGTAGGTGATAAACAGC CAAGGGGGIGTGGGTCiACGAAGCCCCGGATCCTCG
TAGGTGATAAACAGCGGG
CAGGGCGTTGTAGTGCG
TCAAAGAACCi ICC TGTTG
GGICCATGITCGAGGGCGGCGCCGGCCAATICTICC C GG
56 TcGAAGGCGGAMAGTCGCGCGTCCTG TCrCTGCAG GACCITCA1 GACCGCGCT
6 T C.G T I GTGCGCCGTGCTGTGICTG ATCGCGATCGGGGGAG C
GTCCTGTCTCTGCAGTCGT
AACACCATTACGGCCCIG
TGCAGGCGGTTGTCGAGGCTGCTTCGGCGGCTCCT C C
TGCTTCGGCGGCTCCT
66 CGTCAGCACCTICATCGAC.CTGGTGTAGACCTCCAGG GTGACGTCGGCGCGACTC.AGGGCTACGTGTACTTC
8 GGC ! GAGG CCTTGATCTCGTGGCGC
GTGTAGACCTCCAGGGGC
66 GCCGCACCTCAAGGAGG AAC:TCCAATCAGCGCCCGA
9 AC AGGAGCCCCACGTTCTCGAACAGOTCGCCC.AGCC
TTGAAACTCGCTGACGGC CCAATCAGCGCCCGAAC
ACGCGATGGTCTCTATGTC
O CC CC .. TCGCGCCGTAGTCCAG
TTTGCCCCCCACAACCC
TGGTAC.CGCCTCAAGCC
AAGTGGIGGICCATGACG
AGACGAACTCGAAGGCGG
ACGTCCIGCGGCT CATGAACGACIGGGCCG
G GCTTISTGCGCATGCCCCGTGCGAACGGCGTCCTC
GGACTGTCCCTCGGAGAC ACGAMTCAAGGGGCG
67 CTIC.GCCCACCGATICCTGICCAACAATCTGTCGACA
AGGTGGGTGCTT
GATGCGACGCCTGTGC
GGCGTTCAACGACGCC
67 CLIGCATGCCAATMOCATGGAAACCCACGMCCAGT CACACGGCGTGCCTGAAGAG1TC.AGGTCCCGGAAC
ATGATCGACAGCACGC.TA
ACCCACGCCCCAGTG
68 CGGCGGGATCCATGGCGATATACAGCGCTCCGATCG GCGAGCACCGACMCCIAGATGGGGCACACCGGAA
O G TC CTGCGCGCTGTAAGCA
ACAGCGCTCCGATCGG
CAACCACCTTCGCTCCC
TTGACGTATCTGTGCTCCA
CAGCTCGGCGATGGTCA
OCTIGICGCGGTGGTTC
CGATGGCGTCCACCAGA
S CC CAC GCCCAGGGCGATCCIT
TCCAGACGGACAGCACC
TGCGTTATCACCTCCTCGCTCGATACAACGGLIMG
6 C :CXC GGIGGATGGGGACGGAA
ACTGCGATCACGAAGGGC
7 IGCACAAMI TACGGGGCCICi TCCMCGCCGACAAGA CGCCII:C.CTGACCATGCGT
CGGTCCAAC:AGCACCG C(3cAcrc:cGrcGTc3Tcir FCC ISCGCCGAC.AAGA
C MC TTCCALITTCACGCCCCG
OCCATGTTTTCCGCGGC
GAACCAAAGCCTCTCGTG
CTGCGCTTCAAGCACGG
O G TC CTGCGCGCTGTAAGCA
ACAGCGCTCCGATCGG
CAACCACCTTCGCTCCC
TTGACGTATCTGTGCTCCA
CAGCTCGGCGATGGTCA
OCTIGICGCGGTGGTTC
CGATGGCGTCCACCAGA
S CC CAC GCCCAGGGCGATCCIT
TCCAGACGGACAGCACC
TGCGTTATCACCTCCTCGCTCGATACAACGGLIMG
6 C :CXC GGIGGATGGGGACGGAA
ACTGCGATCACGAAGGGC
7 IGCACAAMI TACGGGGCCICi TCCMCGCCGACAAGA CGCCII:C.CTGACCATGCGT
CGGTCCAAC:AGCACCG C(3cAcrc:cGrcGTc3Tcir FCC ISCGCCGAC.AAGA
C MC TTCCALITTCACGCCCCG
OCCATGTTTTCCGCGGC
GAACCAAAGCCTCTCGTG
CTGCGCTTCAAGCACGG
69 ACGACGATGCATGTTCGGCGCAAAAGATAGGTCGC
o TGGTGAAGGGCGACGAACGGACACCCAGATGCGCCF CGGG TGCFTGGAGGCCCTICTG
GACACCCAGATGCGCCT
CGTGTTCTTGCAATACCCC
ACAMAAAATCACCOMC
69 CGTGGC.ATTCAGAC:AGTACGGGTCGAACCTGT1TTIA
TCGAAMTGTITTTACGGG
2 MiGGC TGCTCAGGCOGAAAACGCCATAALICCGCGCGGCC
CATGACAAOGACGMCCT C
CATMCCAGCCCCCAA
4 AAGTCGACACACCGCMG GCOGAAGGAAACCiGGC
GCATGCMCGCAAAACGCCCCAC:CACAACAMGC GGGGIGGIGGTAGTGGT 434:GGAAGGAAACGGGC
69 CC.CGTGCATGAAGACCT GGATCCACCGGGGTGTTGC
ATGCAGAAGGGGTGCAGGCAACGGACGAGATCGC
CGGICACCiCCCACTAI CA CCACCGGGGTGIT GCA
69 CiGGGCGGGG ITTGITGTGAGAAMTCAGC1 GTCTIC
GGGCTCMCGSGAATC TC
6 GCATCATCCCGMCGCATCCGTCGCC.CCATGCACCIT GC C
GTCGCCCCATGCACGT
ACGCGTGGTAGGICGCTGGGMCCGGGACICGCTG CTCCCCAGAGCC.TGCTGGTTGGOTCGTGCCATCCGA
CCGGGTTTCGTGGCCT GMCCGGGAGCGCTG
69 TGGGATCMCCATMAAGCGATGGGACTCCGCGTC 1TTCGCGGGAAGAACCTIMGOAACACGLI1'CACCC
Il GT CCCS CCTGCTAGTISTCGOGG
TGGGACTCCGCGTCGT
CCATCGGAGGCCCCCCAGA IIGTACGTCT GCTITCG CCCGATCACTG TG TACCAC
9 TACG GGC TTCCAGCCCCC.CAGCA
o TGGTGAAGGGCGACGAACGGACACCCAGATGCGCCF CGGG TGCFTGGAGGCCCTICTG
GACACCCAGATGCGCCT
CGTGTTCTTGCAATACCCC
ACAMAAAATCACCOMC
69 CGTGGC.ATTCAGAC:AGTACGGGTCGAACCTGT1TTIA
TCGAAMTGTITTTACGGG
2 MiGGC TGCTCAGGCOGAAAACGCCATAALICCGCGCGGCC
CATGACAAOGACGMCCT C
CATMCCAGCCCCCAA
4 AAGTCGACACACCGCMG GCOGAAGGAAACCiGGC
GCATGCMCGCAAAACGCCCCAC:CACAACAMGC GGGGIGGIGGTAGTGGT 434:GGAAGGAAACGGGC
69 CC.CGTGCATGAAGACCT GGATCCACCGGGGTGTTGC
ATGCAGAAGGGGTGCAGGCAACGGACGAGATCGC
CGGICACCiCCCACTAI CA CCACCGGGGTGIT GCA
69 CiGGGCGGGG ITTGITGTGAGAAMTCAGC1 GTCTIC
GGGCTCMCGSGAATC TC
6 GCATCATCCCGMCGCATCCGTCGCC.CCATGCACCIT GC C
GTCGCCCCATGCACGT
ACGCGTGGTAGGICGCTGGGMCCGGGACICGCTG CTCCCCAGAGCC.TGCTGGTTGGOTCGTGCCATCCGA
CCGGGTTTCGTGGCCT GMCCGGGAGCGCTG
69 TGGGATCMCCATMAAGCGATGGGACTCCGCGTC 1TTCGCGGGAAGAACCTIMGOAACACGLI1'CACCC
Il GT CCCS CCTGCTAGTISTCGOGG
TGGGACTCCGCGTCGT
CCATCGGAGGCCCCCCAGA IIGTACGTCT GCTITCG CCCGATCACTG TG TACCAC
9 TACG GGC TTCCAGCCCCC.CAGCA
70 CCAAAACAAACCMiCIGGCGICMTCf GTATGICCI CSC.
TGGCGCTCTCFGCCAATCGCAGGAGCCGT MST
GCG CC GCAAMCGCGACTATGC
1 GGTA ACAG GGICACAC.CCTAAGCGC
GCTGGTTGGGTGGGGGTA
ACGAAGTCITCTISTGCAGCCAAAGGTCGCFGAGGC TITACGCf TTCGACCAGAG
ACTIATCCIGGGGAGAAG
70 ACAGCAATACAATACAGCAAACAGTGA itIGGAGGG AATCGTGC:Ci I/TAM-FM
FATACCCCAAT TCT TGCA TC Tat GCATAGAMIGGC
CATCTGGAGGGGAACTIS
70 CCGIATCCCAGIGT1 GCTIGCTICAGIGTCCATIT !SC
CAGGCACTCCTGGTCAACAGTCGCCA TCTGCT CGA TC:AGISTCCAITTISCCAG
4 CAGES CA CA G TCC.C.CACACC.CAGGAGAA G
S TCACAGCA TCATTGT
ACCAGGCCATTCAAAAGC A
70 GCAC.IGGTTTACAATTATTTCTGGCTGCTGCTTTATTC
GIGGISTCTAATCAGCAGACAGACAGATCGITGTC:C TGLISMTATTCTTCTATC
70 GAC.ACCGCTIATTCACATACTGTAGACACTACAAGAC TGGACCATGMIGAAGCTGATTCATACCTCACIAAAA
GTGAAGAATTGTAGCATA
AGCG TGCACA ATOM' AGACACTACAAGACAGC.G
CAAATCATATMGCCAAT C.T. TAACAAATTTTTGGATG
TGAATGGTATAAAGC:TAGAGGTGGCACAGACTCGT CGTC.TCGGACATTTAAGA
GGCAGCCAGTTACCCTAA
TGGCGCTCTCFGCCAATCGCAGGAGCCGT MST
GCG CC GCAAMCGCGACTATGC
1 GGTA ACAG GGICACAC.CCTAAGCGC
GCTGGTTGGGTGGGGGTA
ACGAAGTCITCTISTGCAGCCAAAGGTCGCFGAGGC TITACGCf TTCGACCAGAG
ACTIATCCIGGGGAGAAG
70 ACAGCAATACAATACAGCAAACAGTGA itIGGAGGG AATCGTGC:Ci I/TAM-FM
FATACCCCAAT TCT TGCA TC Tat GCATAGAMIGGC
CATCTGGAGGGGAACTIS
70 CCGIATCCCAGIGT1 GCTIGCTICAGIGTCCATIT !SC
CAGGCACTCCTGGTCAACAGTCGCCA TCTGCT CGA TC:AGISTCCAITTISCCAG
4 CAGES CA CA G TCC.C.CACACC.CAGGAGAA G
S TCACAGCA TCATTGT
ACCAGGCCATTCAAAAGC A
70 GCAC.IGGTTTACAATTATTTCTGGCTGCTGCTTTATTC
GIGGISTCTAATCAGCAGACAGACAGATCGITGTC:C TGLISMTATTCTTCTATC
70 GAC.ACCGCTIATTCACATACTGTAGACACTACAAGAC TGGACCATGMIGAAGCTGATTCATACCTCACIAAAA
GTGAAGAATTGTAGCATA
AGCG TGCACA ATOM' AGACACTACAAGACAGC.G
CAAATCATATMGCCAAT C.T. TAACAAATTTTTGGATG
TGAATGGTATAAAGC:TAGAGGTGGCACAGACTCGT CGTC.TCGGACATTTAAGA
GGCAGCCAGTTACCCTAA
71 GGGGITC.TCTC.CCTCGTGGACAGCGCCCTGGTAAAT AAGGTACIACCIACCGCAMIGGTTTAGGGGCTMIG
CTGITACCTACCGOCTAG
O CT GCGA C
CAGCGCMTGGTAAATGT
CTGITACCTACCGOCTAG
O CT GCGA C
CAGCGCMTGGTAAATGT
72 TGCCTAGCMGCATTGITTAATGATCAAGATTCCTAA AAGGGTTAAACGCGCTAATCCAAC.ATCGGGIGGAC
AGAGAAAAGITCCTCAAA CAAGATTCCTAAAAACATT
2 CiAC TGG G GCiGCA
ITGGCACAGGAC
71 GTCAATTGTGTCCarOGGGCCGTAGGTGACAATGCT GGCGAGGAGGAGACTGCCITATCGCTCCIGTATAG
.. OCG TAGGIGACAATGCIG
GTTAGTCGCCTGGGGCAACGAGCCTACCTGAACAC TTGAGGATGA TCTGCAGC
CGTAGTGACACAGACCTGC
CTATTGAGCCAATTGATGC GAACTAGATGTACTGGGT
GTGA ATCTATTTCCTGCA , AATT GA
AGATTTTACTTAC.C.CCCTC ATAC.AACGOACATTGAACA
71 AGGCiATTTTCAGCATC.ATCAAACTTGATTGAGGTTTC ACTGAC.CAAGGTICAGACITAAGCAGCTGTGMGC
71 ACTCATGTCTGC.ATCCTCAATGTAAAGCAGATGACAA
CATAGGCCTGGGAAACTTGAATCAC.AATATCTAATG AAGCAGATGACAAATGTCC
71 GGTGIGCETTATiKAACTCAAKTiA-WE'ETAACTTTACCA
GAGGAGTTTCAGC:TTGC.CTITAAATTCTCAGGGGTA CTAACTITACCATCAGTCiTT
GTACATACTGTCCATIGCA AAATGTCCCTCTAAGGATG
O AGGATGC TICCTGCCAA AC C
CCAACATIGGCAAACGTCT
CCTACTTTAGGTGACATAA TATGCCAATTAATTTACTGT
AGAGGAGTGTAACAGAG CGAAAGTATGTGATAAGTC
72 AGGAGCTCTCTCATACAGATTC.AT7TGGAAACACAAC 1 TTGCTTTCTCAAATTGAACATTGGACTTTTC.TAGCAT CTGATACCAGAAGAAGAC
4 AGACAC I : AG1A1 AAAGTCAC CA
TGGAAACACAACAGACACT
t ITTGGGTATTGGTACGCAGCAGGIG ATAGAGCTAG TAGAAATA TATAGGAATIGTCAAA TAT
5 AAATATCFGGTG 1 TAATAGG ITT , AACGAG CTGG-TG
AAGAACACGTGACTTATA TATTCAGCAAGTTAAAACA
6 CAACCA ' GGTGCA CAGAC ACCA
TMGGGAACTATTCATACC
72 GGTTTCCTTC.ACATATTCATCCGTAGAATCTGGAAAA TGTIGGACATCCATACTTTGAAGTGCTAACTTTAGG
AATC.CAAATCAGTATCCTCC.AGAGGATCATGTGCTA CCAGATCCAAATAAGTTT ATCCTAATAGAGAACGTTT
9 . TITAGICTG C43ICC:4TC CCITT AGICTO
AGAGAAAAGITCCTCAAA CAAGATTCCTAAAAACATT
2 CiAC TGG G GCiGCA
ITGGCACAGGAC
71 GTCAATTGTGTCCarOGGGCCGTAGGTGACAATGCT GGCGAGGAGGAGACTGCCITATCGCTCCIGTATAG
.. OCG TAGGIGACAATGCIG
GTTAGTCGCCTGGGGCAACGAGCCTACCTGAACAC TTGAGGATGA TCTGCAGC
CGTAGTGACACAGACCTGC
CTATTGAGCCAATTGATGC GAACTAGATGTACTGGGT
GTGA ATCTATTTCCTGCA , AATT GA
AGATTTTACTTAC.C.CCCTC ATAC.AACGOACATTGAACA
71 AGGCiATTTTCAGCATC.ATCAAACTTGATTGAGGTTTC ACTGAC.CAAGGTICAGACITAAGCAGCTGTGMGC
71 ACTCATGTCTGC.ATCCTCAATGTAAAGCAGATGACAA
CATAGGCCTGGGAAACTTGAATCAC.AATATCTAATG AAGCAGATGACAAATGTCC
71 GGTGIGCETTATiKAACTCAAKTiA-WE'ETAACTTTACCA
GAGGAGTTTCAGC:TTGC.CTITAAATTCTCAGGGGTA CTAACTITACCATCAGTCiTT
GTACATACTGTCCATIGCA AAATGTCCCTCTAAGGATG
O AGGATGC TICCTGCCAA AC C
CCAACATIGGCAAACGTCT
CCTACTTTAGGTGACATAA TATGCCAATTAATTTACTGT
AGAGGAGTGTAACAGAG CGAAAGTATGTGATAAGTC
72 AGGAGCTCTCTCATACAGATTC.AT7TGGAAACACAAC 1 TTGCTTTCTCAAATTGAACATTGGACTTTTC.TAGCAT CTGATACCAGAAGAAGAC
4 AGACAC I : AG1A1 AAAGTCAC CA
TGGAAACACAACAGACACT
t ITTGGGTATTGGTACGCAGCAGGIG ATAGAGCTAG TAGAAATA TATAGGAATIGTCAAA TAT
5 AAATATCFGGTG 1 TAATAGG ITT , AACGAG CTGG-TG
AAGAACACGTGACTTATA TATTCAGCAAGTTAAAACA
6 CAACCA ' GGTGCA CAGAC ACCA
TMGGGAACTATTCATACC
72 GGTTTCCTTC.ACATATTCATCCGTAGAATCTGGAAAA TGTIGGACATCCATACTTTGAAGTGCTAACTTTAGG
AATC.CAAATCAGTATCCTCC.AGAGGATCATGTGCTA CCAGATCCAAATAAGTTT ATCCTAATAGAGAACGTTT
9 . TITAGICTG C43ICC:4TC CCITT AGICTO
73 TTTCTGTAAAAATC.AGCAAGCTGTTCTCTTTACAATGT
GGICTTAGCCTGGTTIGGAGAAGCTCAAACTC.AAAC ATIGTAAATACTITCGTGT CTUTTACAATGIATATITT
O ATATITiGIGGC CGTGC GTC1 GTGGC
73 CCGTTGCGCACTAAATAGAAGTGATGGCAATG !TAG
CIGGAAAGGITIGTGIAGACAGITGATATCACGCA TAATICGTTGCTATGGITC.
AIGGCAATGTFAGATCTIG
1 Arcrrerr AAGTAGGT C IT
73 GAAGAGTACGAACAGCAGTAGCACACCTGTTGTCAC TGCTGITTGCAGA.ACTTGGTGATCIGCCATGGCGAT
CCTGAAGAGGAGGAGCA
2 Acrr AG A
ACACCIGTIGICACACIT
TGGTCTTCTGTAACCTG TATGTIGTGACTGAAGCT ATGCGTAGATITTAATTCT
73 ATAGCGACT ACCIGAGAAGIACT1 TT AAACITICAGCi . AGAAACATCTG
VGGIGGAAAACCAC TACACCGTAA GTGAAGGAAAGAGCAAA TT AAACTITCAGGACAG IC:
73 IGCTGAAGCTGITCGATGATCTAATA I GGTGAG ?TIC ACM I
GGGCATITGACAATGACA.CATCMCAGCT AA TAAACG TICAATG ICTAAI ATATGGICi AG
TITCCATCA
5 . CATCATGG AGCATAAT GTAGC TGG
.
GGCGTCGAAACGC7C1 f3AAAC:ATTGGTGTATTTG TETCCAAATAAATT ACCTGI
6 TACCTGTC.ACTG GTCAGA CGTA CAM
CMGCTACACCAAACGGA GCiGAACCCACCAAAAGGA
CCiCGACTICAACGACITCAAGCAGMGCAGGACCE AGACGATCAGTTICCCCEG
GCCAACTGCTGTCCCAAC
73 AGCGCGOTTGATCTATACATTMAAACCICCAAAAC AAACCiGCAC.ACTGCTGAAAATATCAGGAGCICiCATG
AGTTCTGTTAATCAAAGA TTAAACCTCCAAAACACAC
GGICTTAGCCTGGTTIGGAGAAGCTCAAACTC.AAAC ATIGTAAATACTITCGTGT CTUTTACAATGIATATITT
O ATATITiGIGGC CGTGC GTC1 GTGGC
73 CCGTTGCGCACTAAATAGAAGTGATGGCAATG !TAG
CIGGAAAGGITIGTGIAGACAGITGATATCACGCA TAATICGTTGCTATGGITC.
AIGGCAATGTFAGATCTIG
1 Arcrrerr AAGTAGGT C IT
73 GAAGAGTACGAACAGCAGTAGCACACCTGTTGTCAC TGCTGITTGCAGA.ACTTGGTGATCIGCCATGGCGAT
CCTGAAGAGGAGGAGCA
2 Acrr AG A
ACACCIGTIGICACACIT
TGGTCTTCTGTAACCTG TATGTIGTGACTGAAGCT ATGCGTAGATITTAATTCT
73 ATAGCGACT ACCIGAGAAGIACT1 TT AAACITICAGCi . AGAAACATCTG
VGGIGGAAAACCAC TACACCGTAA GTGAAGGAAAGAGCAAA TT AAACTITCAGGACAG IC:
73 IGCTGAAGCTGITCGATGATCTAATA I GGTGAG ?TIC ACM I
GGGCATITGACAATGACA.CATCMCAGCT AA TAAACG TICAATG ICTAAI ATATGGICi AG
TITCCATCA
5 . CATCATGG AGCATAAT GTAGC TGG
.
GGCGTCGAAACGC7C1 f3AAAC:ATTGGTGTATTTG TETCCAAATAAATT ACCTGI
6 TACCTGTC.ACTG GTCAGA CGTA CAM
CMGCTACACCAAACGGA GCiGAACCCACCAAAAGGA
CCiCGACTICAACGACITCAAGCAGMGCAGGACCE AGACGATCAGTTICCCCEG
GCCAACTGCTGTCCCAAC
73 AGCGCGOTTGATCTATACATTMAAACCICCAAAAC AAACCiGCAC.ACTGCTGAAAATATCAGGAGCICiCATG
AGTTCTGTTAATCAAAGA TTAAACCTCCAAAACACAC
74 TGG1ACAG 7 CTCCGGTAAAGGTA1 AGGTCCGGAAGA GTGGTACAACCCACTGITAAACCI
GTCIAGAGGGAC AGTAITATATACTIAGGTG
O GGATCAG ACCAAAT GGTTGG
GGTCCGGAAGAGGATCAG
74 1CCGCTGCATCTGTA 1 ATA Cr i CATCTI CCI GATCCAG
AACTCIAOGGGCTACAGGAGGAAGGTG IAATITCT
1 CACTI AATAAGGCT GGAACCCGCTATTCiTACC
CTTCCTGATCCAGCACTT
TCTITGCAMGAACIAGCACACCCCATAAAGTIcra ACITTCATAAGGACTMTG TCCGGACTTFAATGTAITT
TGC1CCAGAICCCGCCITFACGTATAI11'CCGCIGGG "ITTGANITTGAAAAICCT-G
74 CCTCCCCC:TAATCCATCTACTATAGGACCCTGATGTOT
TTGATGTTTTTGGTGACTTC.AGTGACTTCACGAAAC TATGATTTAAGCCCCATCO
4 CTGATG . ACATCATCC C
GACCCTGATGTGTCTGATG
C:AGGGTAAGAGGTATCCA
TGATTGTCCTCCAATTCAG
74 TCACAGTTGAACATTCACAGTACATACTGTGAACTAG TGCETTAATTCTACTGACTC.ATGGGAGCTGACAACA
74 TCTGAAC:TTGCACTTACCTITACAACC:TAGGGCTACTT
ACIACTGACTCAMTC.TGGCAGCACTCCCGGTACATG
CCTAGGGCTACTTTCCGA
TGICTTTTCTATTGACAAT AAGGAATACAAAAGACTA
GTCIAGAGGGAC AGTAITATATACTIAGGTG
O GGATCAG ACCAAAT GGTTGG
GGTCCGGAAGAGGATCAG
74 1CCGCTGCATCTGTA 1 ATA Cr i CATCTI CCI GATCCAG
AACTCIAOGGGCTACAGGAGGAAGGTG IAATITCT
1 CACTI AATAAGGCT GGAACCCGCTATTCiTACC
CTTCCTGATCCAGCACTT
TCTITGCAMGAACIAGCACACCCCATAAAGTIcra ACITTCATAAGGACTMTG TCCGGACTTFAATGTAITT
TGC1CCAGAICCCGCCITFACGTATAI11'CCGCIGGG "ITTGANITTGAAAAICCT-G
74 CCTCCCCC:TAATCCATCTACTATAGGACCCTGATGTOT
TTGATGTTTTTGGTGACTTC.AGTGACTTCACGAAAC TATGATTTAAGCCCCATCO
4 CTGATG . ACATCATCC C
GACCCTGATGTGTCTGATG
C:AGGGTAAGAGGTATCCA
TGATTGTCCTCCAATTCAG
74 TCACAGTTGAACATTCACAGTACATACTGTGAACTAG TGCETTAATTCTACTGACTC.ATGGGAGCTGACAACA
74 TCTGAAC:TTGCACTTACCTITACAACC:TAGGGCTACTT
ACIACTGACTCAMTC.TGGCAGCACTCCCGGTACATG
CCTAGGGCTACTTTCCGA
TGICTTTTCTATTGACAAT AAGGAATACAAAAGACTA
75 GTACTGCTCTC ACACCTTGGATATGGCCCATGCTACT
TACIGGAGTGAGCAAGAAGAATTCTICCITTCTGCT GAGGACCATTAGGAAMA
O CAAGA GAATAAAGC CTC
GGCCCATGCTACTCAACIA
75 AC.C111113TCACTAGGAAGGTC1ATCTCTGATAATC1 lICAGGATGTAAGCMCAACATOCITGITATGGCC
CAAAAGAAAAAATTC:ACC ITICTGACIAATCTTGAAGC
GAAGGGITTAGTGTTTGTGAAAACGCCACATCAATT CAATGTATTITcrtGGAGI GI CATGGAATAGGGirf AA
CTAGGCMCTGCTAGGTA
3 CTAGGTAT TCAAGCAATGA , CGTGTTGIGTCCCTTGIT T
AATGATCTAGMATAAACIC
75 GACTCAACTCAGCAACAGTCTGTAAGCGCCC.TAMAC 1 C.AAGGCTICAAGCAGTTTCAATTCIGTCATCGAATA CAGAGGATTGTAACACAG
GAA ' AGCGTCT CIA TAAGCGCCCIAAMCGAA
75 GCAGAAATATGATITCCCAGTGTCACTSTOTACAGG AATACtailittitGGGGGAAAAGT711TAGTATCTT
ATCATTTCTCATTGCATTG CMCTTACAGGGTGTTCC
75 GGATCTTAAATCAGTITCTCC.CCTTCACAGGAAACTCT
ATTGGGAGCTGriTTAGAAAAGAGCCCAAGTCTATT CAGITAAACCTGGICTIAG CACAGGAAACTCTGACAG
ACTACAAGAGGCGGTCAG
CGGCCCACAATCTTCGATC
GAATTTGTATAGACACTGT
TACIGGAGTGAGCAAGAAGAATTCTICCITTCTGCT GAGGACCATTAGGAAMA
O CAAGA GAATAAAGC CTC
GGCCCATGCTACTCAACIA
75 AC.C111113TCACTAGGAAGGTC1ATCTCTGATAATC1 lICAGGATGTAAGCMCAACATOCITGITATGGCC
CAAAAGAAAAAATTC:ACC ITICTGACIAATCTTGAAGC
GAAGGGITTAGTGTTTGTGAAAACGCCACATCAATT CAATGTATTITcrtGGAGI GI CATGGAATAGGGirf AA
CTAGGCMCTGCTAGGTA
3 CTAGGTAT TCAAGCAATGA , CGTGTTGIGTCCCTTGIT T
AATGATCTAGMATAAACIC
75 GACTCAACTCAGCAACAGTCTGTAAGCGCCC.TAMAC 1 C.AAGGCTICAAGCAGTTTCAATTCIGTCATCGAATA CAGAGGATTGTAACACAG
GAA ' AGCGTCT CIA TAAGCGCCCIAAMCGAA
75 GCAGAAATATGATITCCCAGTGTCACTSTOTACAGG AATACtailittitGGGGGAAAAGT711TAGTATCTT
ATCATTTCTCATTGCATTG CMCTTACAGGGTGTTCC
75 GGATCTTAAATCAGTITCTCC.CCTTCACAGGAAACTCT
ATTGGGAGCTGriTTAGAAAAGAGCCCAAGTCTATT CAGITAAACCTGGICTIAG CACAGGAAACTCTGACAG
ACTACAAGAGGCGGTCAG
CGGCCCACAATCTTCGATC
GAATTTGTATAGACACTGT
76 TGCTICTGCATGCACACTTGCAGTCCCTATGAACCTCC AGAGAGCTCACTACCTACAGAATCCAATITCACCTG
AATGAAACAGCAATCTTA
O T TAATATTAGCATC GACAT
CAGTCCCTATGAACCTCCT
76 GGC.GGGATITICAAATTCAAATTGAGTTACATTCAAC.
CTGGTGATGTAAGTCTTGAATTTGCAATCTGGATCA GTTACATTC.AACAAATCAG
TGGGAACAAAAGGAGCG
TTAGTACCAGAAGCGGTAC
i 'TIGGAACTCAGTTACATTITCATGCITTCATAGTAGG 'TGTITTGATG TM GCAGA
3 GAM 1 GATGGICIAC , GCAAACGCAP.ACGCAATA IG
CCAATATCGAGTGTTTAGG
4 GTTIAGGC ' ATAAAGGATGACC TCCTAAGGTGTCTGCATG C
TATTGGGATGTTGCTAAAC
CAGACTTCCTTAAAATGG GATGITTAIGGCGATMAT
GCTCACTTAAATGTTATGA GAATGGCAATTAGCTITTG
7 . Gal-1167G ACTAACCAAA ATa: .16 76 GGCTATGTTCCAAACAACITACCITGCTGGAC.AGGTA TCTCTCGAACGGAAGCGGTACGCCAAAATACCAGA
ACTCGACCITATAAGTGTG GCTGGACAGGTAGTATTGT
GIGCAGAAATIGTAAGTCATGAAGGCAAATTAACA GITGCTGIACATG ITTIGC
9 'FGAA GGiCAATACCAAAT T
TCGTIGGAIGATGCTGAA
AATGAAACAGCAATCTTA
O T TAATATTAGCATC GACAT
CAGTCCCTATGAACCTCCT
76 GGC.GGGATITICAAATTCAAATTGAGTTACATTCAAC.
CTGGTGATGTAAGTCTTGAATTTGCAATCTGGATCA GTTACATTC.AACAAATCAG
TGGGAACAAAAGGAGCG
TTAGTACCAGAAGCGGTAC
i 'TIGGAACTCAGTTACATTITCATGCITTCATAGTAGG 'TGTITTGATG TM GCAGA
3 GAM 1 GATGGICIAC , GCAAACGCAP.ACGCAATA IG
CCAATATCGAGTGTTTAGG
4 GTTIAGGC ' ATAAAGGATGACC TCCTAAGGTGTCTGCATG C
TATTGGGATGTTGCTAAAC
CAGACTTCCTTAAAATGG GATGITTAIGGCGATMAT
GCTCACTTAAATGTTATGA GAATGGCAATTAGCTITTG
7 . Gal-1167G ACTAACCAAA ATa: .16 76 GGCTATGTTCCAAACAACITACCITGCTGGAC.AGGTA TCTCTCGAACGGAAGCGGTACGCCAAAATACCAGA
ACTCGACCITATAAGTGTG GCTGGACAGGTAGTATTGT
GIGCAGAAATIGTAAGTCATGAAGGCAAATTAACA GITGCTGIACATG ITTIGC
9 'FGAA GGiCAATACCAAAT T
TCGTIGGAIGATGCTGAA
77 GCAACAAAAAAAAGCCTTACACCTCTGGAGCCAGCA AACTTCTGCTTCAATTCGCACAAGACAGATCACAGG
O AATC:CTT CAATT TCGCCTGA
TATTGAACICA TGGAGMAGCAAATCCIT
ACAACAAGTCCCCAATA GAAGAAGCAGACTGTGTG AGITTAACAGCMAGATG
77 ATICATTAT GGCCIGTICCCATGG 1 GGAATt GTAGAAG CSCATAACACTGCG
77 GAT GG1 GTTICGGAACTGATCGGG ICCrrICCACI KT TCAAGAGGAAGCT
AGGGATCCACACATIAMICTAT
3 , CTC TCCICCAACA GAGAGACCCAGAGCCAAA
GGTCCTCTCCAGTCTCTC .
77 TT GCTOTTCCACC-AATIGGT1TA1TTGC.;CAATITAC.iGA
GCAAAGGTCCTGAAATAALIGTGAAAGGGTAIGAC ACAAATATITGGGAGI AT MGGCAAMAGGAATAG
CAATCAAGGGCAACGAG TCTGGAT TIC CAGGTGATGITGIGGGIT AGAAGAAATTGAATTAGG
TOCCCTGGA ITATAAA CAGCCAACTAA FATATATT
6 CAGTOG CITTGCiGATCT MC-CG
GATCGCCITTTAACAGTGG
ATGGAGCCIAAGCAAAATCAATTCITCCCAATCiTTCT TCC.ATTAGGAATAGGTATT
AATTAGITGATGTAGAAAA
GTIAAATGGGAGCGCAGATITITF GGAAACCCIAAATCTGAA TCCITATAAAGACCIAAAG
8 CC.TAAAGITTTGG AGAAATTGAAGT AAAGC MTGG
AT1TGGCCAGTATTAATACCCACACAGGT TGTAAAG Tr: ITGTACITGAATAI GTE.
ATTICCATCAAGCAATACT
9 ACTGAC CGTCC.AG TTAGGC GAC
O AATC:CTT CAATT TCGCCTGA
TATTGAACICA TGGAGMAGCAAATCCIT
ACAACAAGTCCCCAATA GAAGAAGCAGACTGTGTG AGITTAACAGCMAGATG
77 ATICATTAT GGCCIGTICCCATGG 1 GGAATt GTAGAAG CSCATAACACTGCG
77 GAT GG1 GTTICGGAACTGATCGGG ICCrrICCACI KT TCAAGAGGAAGCT
AGGGATCCACACATIAMICTAT
3 , CTC TCCICCAACA GAGAGACCCAGAGCCAAA
GGTCCTCTCCAGTCTCTC .
77 TT GCTOTTCCACC-AATIGGT1TA1TTGC.;CAATITAC.iGA
GCAAAGGTCCTGAAATAALIGTGAAAGGGTAIGAC ACAAATATITGGGAGI AT MGGCAAMAGGAATAG
CAATCAAGGGCAACGAG TCTGGAT TIC CAGGTGATGITGIGGGIT AGAAGAAATTGAATTAGG
TOCCCTGGA ITATAAA CAGCCAACTAA FATATATT
6 CAGTOG CITTGCiGATCT MC-CG
GATCGCCITTTAACAGTGG
ATGGAGCCIAAGCAAAATCAATTCITCCCAATCiTTCT TCC.ATTAGGAATAGGTATT
AATTAGITGATGTAGAAAA
GTIAAATGGGAGCGCAGATITITF GGAAACCCIAAATCTGAA TCCITATAAAGACCIAAAG
8 CC.TAAAGITTTGG AGAAATTGAAGT AAAGC MTGG
AT1TGGCCAGTATTAATACCCACACAGGT TGTAAAG Tr: ITGTACITGAATAI GTE.
ATTICCATCAAGCAATACT
9 ACTGAC CGTCC.AG TTAGGC GAC
78 TGCATGTGTCIAAATCTICTMCTCGACANIACAGT AAAACGTCAACAAARGCCAGGIATCTTCCCATGITC
AGGGGTACITTTCCAAAA CGACAAAACAGTCAAATCA
O CAAATCAAC TAGGTG TGT AC
All CC7 AACiTTGATGGAAACGTMGICAAAGETG CGTCCGTAGAAATGAAGG
ATTCGTTATGGCTACCGATGAAATAATAGGGATCG A GCATTTGITTTGTATTGCA
AAGAAGATCIGTGGATAC.0 2 GGATACCTC . GCTGGG GAA It 78 GCAGGCTAGGACACGGIC.AC:CCCTCCACTAGTTTGGT
GGCGGCCAACCCAGCTTATGCATGCGAGTCTTCACA GCGTTCGATCAGGTGGAT CCCTCCACTAGITTGCITCG
73 GTGAATTCAGTGCTGG1GCCTCCATCTAAGTTCACAG CGAATG1TGAAGC.ATGTGG1TACTATGGTGGAATT
CTGGACAATCTTTTTCAAT TCCATCTAAGTTCACAGAG
TTACACATTAGATAGCAA
AAAGCTACTTCCAAAGGCT
78 TGTTCiTAGATGGCGTCGCTGATCCAGGAGATCGAGG TGGACGGAACKTMTAGGCGTCCTCAAAITTATAA
GTTTCAGTTAAATACAAGT ATCCAGGAGATCGAGGTA
TCCAATAGCTCCACTTAAT
ACCCACTGGATCTTCTCC
AGCTCAGTACTACATCCIGIGTACA AACACAGGAACAAATGAT CAAATTACCTCATACCACT
8 TACCACT GC TGAGTGAAA AATGTC GC:
GTGGTCCAGAAGATAGAA
AGGGGTACITTTCCAAAA CGACAAAACAGTCAAATCA
O CAAATCAAC TAGGTG TGT AC
All CC7 AACiTTGATGGAAACGTMGICAAAGETG CGTCCGTAGAAATGAAGG
ATTCGTTATGGCTACCGATGAAATAATAGGGATCG A GCATTTGITTTGTATTGCA
AAGAAGATCIGTGGATAC.0 2 GGATACCTC . GCTGGG GAA It 78 GCAGGCTAGGACACGGIC.AC:CCCTCCACTAGTTTGGT
GGCGGCCAACCCAGCTTATGCATGCGAGTCTTCACA GCGTTCGATCAGGTGGAT CCCTCCACTAGITTGCITCG
73 GTGAATTCAGTGCTGG1GCCTCCATCTAAGTTCACAG CGAATG1TGAAGC.ATGTGG1TACTATGGTGGAATT
CTGGACAATCTTTTTCAAT TCCATCTAAGTTCACAGAG
TTACACATTAGATAGCAA
AAAGCTACTTCCAAAGGCT
78 TGTTCiTAGATGGCGTCGCTGATCCAGGAGATCGAGG TGGACGGAACKTMTAGGCGTCCTCAAAITTATAA
GTTTCAGTTAAATACAAGT ATCCAGGAGATCGAGGTA
TCCAATAGCTCCACTTAAT
ACCCACTGGATCTTCTCC
AGCTCAGTACTACATCCIGIGTACA AACACAGGAACAAATGAT CAAATTACCTCATACCACT
8 TACCACT GC TGAGTGAAA AATGTC GC:
GTGGTCCAGAAGATAGAA
79 1 GCGTICITGTITITGAITTCTGIGGAIGTAGAGAGGT GCTGGCCTOGATAA
ICAC:AGGAGACTIGAAAGGIT GCATTICAATGGTICAGA
O TOCTCG AATMCCAATC AACA
GATGTAGAGAGCTICCTCG
1 CCM CTGG , TCC
AGGCAATGTAIGTGCCTC
79 TGCTICC.ACATGUTGGCTCTGATGTTCATACCACGAT 1 TGGAITCCAAGAGCACCTAGAGTCTGGGGTAGTrA CATTCAGAGTGGTTAATG
TGATCITICATACCACGATT
2 TGT i GTGCG AACA GT
79 GGAGTAGTAAACACCGGATGTGCCTACTGGAGC.ACA ACAGGAAGTTCTATCCCATCATITGAGTACGAGCTA
TTAGTAGCC.CCTTACCCAA
3 TGGAG CCCTC.AA A
CTACTGGAGCACATGGAG
79 GCAAAAC.AGACATAACCGCCICTAGAT GTATACATG
79 CGACITCATGTATGGATGCTATITCiTAAGCCAGGGAA AATCCCACTCAAGAGAAAAGAGAGOTGTGACATTT
ACTCATAGAATGGATCAA
AAGCTTG GTTCTAACCAC AAACAAG
AAGCC.AGGGAAAAGCTTG
AATTACATGCAGTTTAAG AAAGAACTGAGCCAGTGT
TGGCTAGTTTAGACAATA GTTGMACTAGTAACTTT
CTIACTCAGGAAAAATAT
CACCACAGAGAAAGGGGA
CATATC.AGAGGATTTGGA CrACITTAGTAGTGCACTC
ICAC:AGGAGACTIGAAAGGIT GCATTICAATGGTICAGA
O TOCTCG AATMCCAATC AACA
GATGTAGAGAGCTICCTCG
1 CCM CTGG , TCC
AGGCAATGTAIGTGCCTC
79 TGCTICC.ACATGUTGGCTCTGATGTTCATACCACGAT 1 TGGAITCCAAGAGCACCTAGAGTCTGGGGTAGTrA CATTCAGAGTGGTTAATG
TGATCITICATACCACGATT
2 TGT i GTGCG AACA GT
79 GGAGTAGTAAACACCGGATGTGCCTACTGGAGC.ACA ACAGGAAGTTCTATCCCATCATITGAGTACGAGCTA
TTAGTAGCC.CCTTACCCAA
3 TGGAG CCCTC.AA A
CTACTGGAGCACATGGAG
79 GCAAAAC.AGACATAACCGCCICTAGAT GTATACATG
79 CGACITCATGTATGGATGCTATITCiTAAGCCAGGGAA AATCCCACTCAAGAGAAAAGAGAGOTGTGACATTT
ACTCATAGAATGGATCAA
AAGCTTG GTTCTAACCAC AAACAAG
AAGCC.AGGGAAAAGCTTG
AATTACATGCAGTTTAAG AAAGAACTGAGCCAGTGT
TGGCTAGTTTAGACAATA GTTGMACTAGTAACTTT
CTIACTCAGGAAAAATAT
CACCACAGAGAAAGGGGA
CATATC.AGAGGATTTGGA CrACITTAGTAGTGCACTC
80 CTIGGTTITGGTGGGIGTCiTTTAATIGTATCTAAACA 1 ATTGTATCTAAACAAAAGG
O AAAGGTGAGAG 1 riGICATTGA AACA
TGAGAG
AAATCGAAACAAACAAATTIGCAGCGAAATCTGAA T AGGTACTGATCCAAAATG
I I GCT 1 ACATGAAGCA LAC , GAAG All CCTGITCTCAGCGGTCCAAATGGGTCCTCAATGCTT AAGCTGTCTCAAATGTCCA
GAAGTAAATGCTAGAATTG
2 AATTGFACC ' AAT AA AACC
GCTGGATAGAGCTTGATG
TGTATGGTITCATC.ATMA ATCCCACTIAAGGAATGAC
AATGGGGAATGGAGATG
5 . ICA CTTIGGG AG CGITGTCfCCT
CCAG ICA
AAAGAACTAAGAGATTTG
80 CATGITrowcaTTCGACCIITTCCAAGCACAG TCCA
GGCCCCGTTCACTICAGAAACTAAGATCTGCATGGC AACAGAAATGGACCAAOG CAAGCACAGTCCAT TRIGG
7 'ETA ICCAA CC A AA
ACATG1TGGAGAGAGAACTGGTCGATATACACGCT ACATCAGAGICACAGTf AA
AGAAATGATGATGITGAT GCTGCTAGAAATATTGTIA
9 Ci ITAGAAGA ITC; TC CAGAG GAAGA
O AAAGGTGAGAG 1 riGICATTGA AACA
TGAGAG
AAATCGAAACAAACAAATTIGCAGCGAAATCTGAA T AGGTACTGATCCAAAATG
I I GCT 1 ACATGAAGCA LAC , GAAG All CCTGITCTCAGCGGTCCAAATGGGTCCTCAATGCTT AAGCTGTCTCAAATGTCCA
GAAGTAAATGCTAGAATTG
2 AATTGFACC ' AAT AA AACC
GCTGGATAGAGCTTGATG
TGTATGGTITCATC.ATMA ATCCCACTIAAGGAATGAC
AATGGGGAATGGAGATG
5 . ICA CTTIGGG AG CGITGTCfCCT
CCAG ICA
AAAGAACTAAGAGATTTG
80 CATGITrowcaTTCGACCIITTCCAAGCACAG TCCA
GGCCCCGTTCACTICAGAAACTAAGATCTGCATGGC AACAGAAATGGACCAAOG CAAGCACAGTCCAT TRIGG
7 'ETA ICCAA CC A AA
ACATG1TGGAGAGAGAACTGGTCGATATACACGCT ACATCAGAGICACAGTf AA
AGAAATGATGATGITGAT GCTGCTAGAAATATTGTIA
9 Ci ITAGAAGA ITC; TC CAGAG GAAGA
81 CCCGACC:ATIGTGAA1TCCICATAGGCAACCTCCAAA CGAAGAGCAACAGCCA I
TCIAACCCACTCACTAT CA AAAGGGAAGAAGAAGTG
O CATTG GTTGG
CTC AGGCAAC.CICCAAACATTG
GITICAAAACIGGGGAATIGAACCIGTC:AGGCAA TA TTIGAACTTGITCAACAGA
I , CC TTCCGATC GC
AAACCAGCGGCTAAATCX: .
81 CAGCTITTCTGTICCCTGIGITCAGAGGGGAAACGIA A TTCATCG 1 CCAIGATGI GGGAATr GATAIGTGITA
CAGAGGGGAAACGTACTC
81 CATCCTACTOGC1ICGGCGCGTACTGGGAAC:AT1-1G
CICTICTCIAACIGIGAATGIGAGGACAGGAGAAIT TA TICCA.ACIAG AT GCGTG
CGTACTGGGAACAITTGAC
81 GCTACCAAATCAAAITICTC ITGCAACC:AT TGCAI CAA AG IGAGAGGGTAITGCiAGAGGCAAT
rGT TACICTA CTITGCAIGATCITTIGGCi ACCATTGCATCAAATITAC
81 CGAGACTGTAGAATCACAATCACTTATTCiTATGGATG
GACGATGAGAATTTICAAGAACAGGACICCTCTGCA CATGGTITATTGTAAATG A
ATTGTATGGATGACATAGA
S ACATAGAGACT CACTGTA AGCAG GACT
TCIAACCCACTCACTAT CA AAAGGGAAGAAGAAGTG
O CATTG GTTGG
CTC AGGCAAC.CICCAAACATTG
GITICAAAACIGGGGAATIGAACCIGTC:AGGCAA TA TTIGAACTTGITCAACAGA
I , CC TTCCGATC GC
AAACCAGCGGCTAAATCX: .
81 CAGCTITTCTGTICCCTGIGITCAGAGGGGAAACGIA A TTCATCG 1 CCAIGATGI GGGAATr GATAIGTGITA
CAGAGGGGAAACGTACTC
81 CATCCTACTOGC1ICGGCGCGTACTGGGAAC:AT1-1G
CICTICTCIAACIGIGAATGIGAGGACAGGAGAAIT TA TICCA.ACIAG AT GCGTG
CGTACTGGGAACAITTGAC
81 GCTACCAAATCAAAITICTC ITGCAACC:AT TGCAI CAA AG IGAGAGGGTAITGCiAGAGGCAAT
rGT TACICTA CTITGCAIGATCITTIGGCi ACCATTGCATCAAATITAC
81 CGAGACTGTAGAATCACAATCACTTATTCiTATGGATG
GACGATGAGAATTTICAAGAACAGGACICCTCTGCA CATGGTITATTGTAAATG A
ATTGTATGGATGACATAGA
S ACATAGAGACT CACTGTA AGCAG GACT
82 GT ICAGCTTC:ATC: ITCGCCIAAAAGAAAGACAAAGTA
TrICTGAACAGGTACiAAGTGCAAAACTATIACATGC GGAAGCTGITAAA.ATTTC AAGAAAGAC.AAAG
TAAAA
81 CiCTCAGCGACACiAGITGCi TGGTAACAGCAACGGICG
GCiCiCC:CfCCAGAICCACCGACGOTTGCGICTGIGC AACAAATaCCCTICCIGC
7 GACCi TT C
AACAGCAACGGTCGGAGG
82 GAAG MG.:10,AG ICTCGATAGACCIGCTGAGGAAG GGAGGCTCGCiGA 1 CCG 1 ATACCAAC:ACIT TAATGTA
TGCTGAGGAAGITGGAAG
TIGCAGGCCTIA CTCITGCAGATAITITATE "fTGGAAGT ATICITTATITG
TGGTGAACATCCTTCCGTTACATTGTACTTGAACATC TGC.IGTA ATAGATGITCCT
O CTACAG . TATCTGAG GO
GTCCTGGTTTGCCTACA.G
CTACTATAATAGATAACG CAGCAATAITTTACCTCTTA
82 CGTCATTCAAAACGTCAAAGTAAGGICITTATTITCAT TAAAGCAATAGAGG1TCC.A
82 GGGC.ATMACATTAGGATTITCCGTCCTCTAGGCATA CTGCAGAGGTAGATGACAGTAGAATAAAAAGCTGA
TAC.AGATTGACAGAGGTG
CCTCTAGGCATAGGAACG
TOKTCCCCTGTGAGGAAC
TGCAACATGAGCACACTT AAAGAAACACCAACCGTC
S CGC G CC GC
ITCCACAIGTITGCGG
TAGTGUGGAAGC.GGAGG
C.GGTCACIGGGGTGICCTGAACCTICAACC:AATIGC ATGATGAIGAACTGGICGC
7 TCGC.0 CTGA CACAGAATGGCGTGGGAC C
82 GITCCAAGGGCTIGCATGC.GACCGCAGCAATTITAAC TGGGGCCCTATAACCCACAAGTGCATAGTGCCAGC
CAAACTGGGTITCITAGC ACCGCAGCAATITCAACTC
82 AGAG1TCATCCACGTGCACCCTCATGCTAGAGAGCCT GC1.
TCACCAAGACCTGCZGAGGGGCACAGGAGCTG ACAACTGGGGAGAAAAC
9 GCG GTTA , GAG
TCATGCTAGAGAGCCTGCG
TrICTGAACAGGTACiAAGTGCAAAACTATIACATGC GGAAGCTGITAAA.ATTTC AAGAAAGAC.AAAG
TAAAA
81 CiCTCAGCGACACiAGITGCi TGGTAACAGCAACGGICG
GCiCiCC:CfCCAGAICCACCGACGOTTGCGICTGIGC AACAAATaCCCTICCIGC
7 GACCi TT C
AACAGCAACGGTCGGAGG
82 GAAG MG.:10,AG ICTCGATAGACCIGCTGAGGAAG GGAGGCTCGCiGA 1 CCG 1 ATACCAAC:ACIT TAATGTA
TGCTGAGGAAGITGGAAG
TIGCAGGCCTIA CTCITGCAGATAITITATE "fTGGAAGT ATICITTATITG
TGGTGAACATCCTTCCGTTACATTGTACTTGAACATC TGC.IGTA ATAGATGITCCT
O CTACAG . TATCTGAG GO
GTCCTGGTTTGCCTACA.G
CTACTATAATAGATAACG CAGCAATAITTTACCTCTTA
82 CGTCATTCAAAACGTCAAAGTAAGGICITTATTITCAT TAAAGCAATAGAGG1TCC.A
82 GGGC.ATMACATTAGGATTITCCGTCCTCTAGGCATA CTGCAGAGGTAGATGACAGTAGAATAAAAAGCTGA
TAC.AGATTGACAGAGGTG
CCTCTAGGCATAGGAACG
TOKTCCCCTGTGAGGAAC
TGCAACATGAGCACACTT AAAGAAACACCAACCGTC
S CGC G CC GC
ITCCACAIGTITGCGG
TAGTGUGGAAGC.GGAGG
C.GGTCACIGGGGTGICCTGAACCTICAACC:AATIGC ATGATGAIGAACTGGICGC
7 TCGC.0 CTGA CACAGAATGGCGTGGGAC C
82 GITCCAAGGGCTIGCATGC.GACCGCAGCAATTITAAC TGGGGCCCTATAACCCACAAGTGCATAGTGCCAGC
CAAACTGGGTITCITAGC ACCGCAGCAATITCAACTC
82 AGAG1TCATCCACGTGCACCCTCATGCTAGAGAGCCT GC1.
TCACCAAGACCTGCZGAGGGGCACAGGAGCTG ACAACTGGGGAGAAAAC
9 GCG GTTA , GAG
TCATGCTAGAGAGCCTGCG
83 GGGCATGGTGGTAAAGCRICACTCCCCTCCTCTTTTC CCTCTACGGAGTCTCCTCGAGCATGAGGACGACATA
GACCGTTGCGACTTGTAT
O AACC CTCOCA GA
GTCCCCTCCTCTTTTCAACC
83 1 TGGACCCTC.ATCITCCTGTGGTAGGTGCATGCGGGC
1 ACACCAACCGGCGGTACCCCGGCAAGGTCGAAGCAG ' A TITGG1TGATGCTGCTCGT
CGGCAAGGTCGAAGCAG
83 GIGGCCACATCITGAGGGGICTGTCOCAGC.AGCAC
CCTCTCAGTGAGTGGGCG
2 ACGCAGCGGTGICTGCACCGCCGTAGCTGTTGAGC.0 CAAGTC A
GCCGTAGCTGTTGAGCC
83 AGCCAGCAC.CGTGATAAACAGTAACAGCCACTCAGA AAGGGGITaiATdikAGATGTAAGTGACCTAGCC.0 GAGGGGGAAGTGCAGGT AACAGCCACTCAGAEfiii¨.
CCATCAGGGCACGTAGTA
CAGGACCATCACCACAGG ATATGGGAAGTTTCTGGCC
.IGGCCG GTGA G G
83 CGCCTGAGGTGGGGATCAC.ATTAGGTCTCAATGCTG GACGGACGCCCITATGACCGTCTACCACCTGCGTCA
AGAAATGCGACGAACTGT TTAGGTCTCAATGCTGTCG
83 AGC1TC.ACAGAGCACGACGCAAAGCCGGGGGTG7AT CACGGGGTGCGCATGGTATGGGAGTGTTTAGGTAG
AAAGCCGGGGGTGTATAG
GGGACCTGATGTGGAAAT
'TGCAGAAGAACTGAAACCCGCTCCACATGIGTITCC GGICCIG TACCAGCAGITT
GCTCTAAGCACATCCCCTA
9 CCTAC 1 TCCAGA , G C
GACCGTTGCGACTTGTAT
O AACC CTCOCA GA
GTCCCCTCCTCTTTTCAACC
83 1 TGGACCCTC.ATCITCCTGTGGTAGGTGCATGCGGGC
1 ACACCAACCGGCGGTACCCCGGCAAGGTCGAAGCAG ' A TITGG1TGATGCTGCTCGT
CGGCAAGGTCGAAGCAG
83 GIGGCCACATCITGAGGGGICTGTCOCAGC.AGCAC
CCTCTCAGTGAGTGGGCG
2 ACGCAGCGGTGICTGCACCGCCGTAGCTGTTGAGC.0 CAAGTC A
GCCGTAGCTGTTGAGCC
83 AGCCAGCAC.CGTGATAAACAGTAACAGCCACTCAGA AAGGGGITaiATdikAGATGTAAGTGACCTAGCC.0 GAGGGGGAAGTGCAGGT AACAGCCACTCAGAEfiii¨.
CCATCAGGGCACGTAGTA
CAGGACCATCACCACAGG ATATGGGAAGTTTCTGGCC
.IGGCCG GTGA G G
83 CGCCTGAGGTGGGGATCAC.ATTAGGTCTCAATGCTG GACGGACGCCCITATGACCGTCTACCACCTGCGTCA
AGAAATGCGACGAACTGT TTAGGTCTCAATGCTGTCG
83 AGC1TC.ACAGAGCACGACGCAAAGCCGGGGGTG7AT CACGGGGTGCGCATGGTATGGGAGTGTTTAGGTAG
AAAGCCGGGGGTGTATAG
GGGACCTGATGTGGAAAT
'TGCAGAAGAACTGAAACCCGCTCCACATGIGTITCC GGICCIG TACCAGCAGITT
GCTCTAAGCACATCCCCTA
9 CCTAC 1 TCCAGA , G C
84 AAGCTGIGGACGCGGAGGGICAACATATTGGGCGG 1 TCGTCGTGAGCGGCTTAGCCCGGCAAGGATGTCAA
O CTG ' CAA
CCTCCCAGACCCTGITGC TCAACATATTGGGCGGCTG
CTCTGAAGGCGCTAACCA
ATAGCCITTGCCICCAGGG
ATAGGGGAGTTIGGGCTG
ICACACCACTTGCT
AGGATACATCGTTATGCGC
3 . GCGCC TGACT TCACIGAGGIGGATGGCC C
MG CAACATCACCCGGGITGA
84 GCCGGCTATTGACCCfCCGACAACAGCTGGCTGATAA CATCTCCTGACCCGACAACAGCGGGGCATFGAGCTA
CGAGICTGCTGTATCGCA CAACAGCTGGLICATAAG
84 .ACCCGCACGGATAAGTCTGGATAGICTICTGTGTAGA
TGCGAGAAGAIGGCACTCTACGTGCAGATCCCATCA ATGCCACCCCAATACCT AC
AGTCITCTGTGTAGACCCG
TTGCCAACTAGACCCTACC
7 T 'MAC G
CCCTCACAGAACGCCICT
84 TGG I ACAG I CACGGCT GAGGTAAACATCAIGCTCG f GCGAGCI
GCTIGGGAGACAGTGAIGA IGITGCCCA AACAICATGCICGTCCAAC
8 CC.AACG ACC.AC CCACAGCCCACATACGAC G
TAAGAGCTIACCITGGGCAATCiACT *FGGTATGGCACCITTITCC
9 . CC TTGG CACTCCACTCGATCTCCCT
C .
O CTG ' CAA
CCTCCCAGACCCTGITGC TCAACATATTGGGCGGCTG
CTCTGAAGGCGCTAACCA
ATAGCCITTGCCICCAGGG
ATAGGGGAGTTIGGGCTG
ICACACCACTTGCT
AGGATACATCGTTATGCGC
3 . GCGCC TGACT TCACIGAGGIGGATGGCC C
MG CAACATCACCCGGGITGA
84 GCCGGCTATTGACCCfCCGACAACAGCTGGCTGATAA CATCTCCTGACCCGACAACAGCGGGGCATFGAGCTA
CGAGICTGCTGTATCGCA CAACAGCTGGLICATAAG
84 .ACCCGCACGGATAAGTCTGGATAGICTICTGTGTAGA
TGCGAGAAGAIGGCACTCTACGTGCAGATCCCATCA ATGCCACCCCAATACCT AC
AGTCITCTGTGTAGACCCG
TTGCCAACTAGACCCTACC
7 T 'MAC G
CCCTCACAGAACGCCICT
84 TGG I ACAG I CACGGCT GAGGTAAACATCAIGCTCG f GCGAGCI
GCTIGGGAGACAGTGAIGA IGITGCCCA AACAICATGCICGTCCAAC
8 CC.AACG ACC.AC CCACAGCCCACATACGAC G
TAAGAGCTIACCITGGGCAATCiACT *FGGTATGGCACCITTITCC
9 . CC TTGG CACTCCACTCGATCTCCCT
C .
85 GCAATG TAGGCTTCAATGITCTCAIGICAAAACAGAA
GIAAGGAAGTOGGCCTIAATGGCTITTRAATIGCT TGAT TTFCAAAAA TM CTG
CITCAAAACAGAAAGACGC
O AGACGCC ATGCCAT1T AC.AG C
85 CCA T GTGCT TITCCAT T ATCCA ITT( CTGCTGAAT TG T RAI niCACT
TCITATAGCTGACGCCTCCRITITGCC GCAGCAAGAAGAAACGG
2 TCACTCT ITCTCTC I TC.TGCT.
GAATTCTICACTCT
AAAGCCITAGATAACCAACAAGATGAGAGTAAGTC TTAAGAGC.ACTAGCA AGA
GTAAAAGCAAACTACAATG
ACCCICITAAGTIGAGAAACACiGAGAAAGCACT CAATAGAGGACAAGGCTC GTA ITAATGA IGGCATGCT
RS IGGTTTITTCiGG I CTITT CACIC ITACACCCITIC1711 TA
ICAGCAGATIGGAAGAAGAAGG1CATCTICTIG AATA A A TT TCAGCi ICTGG
ACACCCITICITTTTCGAAT
TATGCCiCTG ITGAAA A TGGA TCCTCAGITTCCTA
AAC.GTITGGGGTATCGTT
85 ACACCTGTICCACAAGIGIGAGAGGAGGACiCAGCAA
GAGGAGGAGCAGCAAGAA
as TCTTCTAGGATTCCACTGTCATCAAACTTTATCATCTA
CCATAGCGGTTCAGAAAATGGCCAGTTGCTCGCCAA AC. ITT ATCATCTAGAAAGC
8 GAAAGCAAACTC . TTAG TGTTGCCGCTTTAAGTCC
AAACTC
AATGTATGATTATATCTGA AAANITCGTAGIGTACCTG
9 GTACCTGTTG GCAGC ACC. ICC. rre
GIAAGGAAGTOGGCCTIAATGGCTITTRAATIGCT TGAT TTFCAAAAA TM CTG
CITCAAAACAGAAAGACGC
O AGACGCC ATGCCAT1T AC.AG C
85 CCA T GTGCT TITCCAT T ATCCA ITT( CTGCTGAAT TG T RAI niCACT
TCITATAGCTGACGCCTCCRITITGCC GCAGCAAGAAGAAACGG
2 TCACTCT ITCTCTC I TC.TGCT.
GAATTCTICACTCT
AAAGCCITAGATAACCAACAAGATGAGAGTAAGTC TTAAGAGC.ACTAGCA AGA
GTAAAAGCAAACTACAATG
ACCCICITAAGTIGAGAAACACiGAGAAAGCACT CAATAGAGGACAAGGCTC GTA ITAATGA IGGCATGCT
RS IGGTTTITTCiGG I CTITT CACIC ITACACCCITIC1711 TA
ICAGCAGATIGGAAGAAGAAGG1CATCTICTIG AATA A A TT TCAGCi ICTGG
ACACCCITICITTTTCGAAT
TATGCCiCTG ITGAAA A TGGA TCCTCAGITTCCTA
AAC.GTITGGGGTATCGTT
85 ACACCTGTICCACAAGIGIGAGAGGAGGACiCAGCAA
GAGGAGGAGCAGCAAGAA
as TCTTCTAGGATTCCACTGTCATCAAACTTTATCATCTA
CCATAGCGGTTCAGAAAATGGCCAGTTGCTCGCCAA AC. ITT ATCATCTAGAAAGC
8 GAAAGCAAACTC . TTAG TGTTGCCGCTTTAAGTCC
AAACTC
AATGTATGATTATATCTGA AAANITCGTAGIGTACCTG
9 GTACCTGTTG GCAGC ACC. ICC. rre
86 GAAGGTICTGTTGGCACATCTATATAGATAGTAAACA AGGGTAGTATGCTTTAAATTCCCAAAACATATACAC
AGATAGTAAACATAGACA
O TAGACAGCCT AGGCTCCC
GAGAGCAGCTCTAGATGG GCCT
AACAAGCAAMCCATAAGTCTGACATGMCTITTC GGGAAACTATTAGAMAG
CACGTCAGCAAGGCATTA
86 GCACACITCCATICTCCAGITITACTTAC.AGAGCCATCA
AAAACCIACACTATCTCCCCTCX:CGTGCTICTAGAGT AAGTG GAC.TATAATGG GC
TACAGAGCCATCAGGGGA
CCAAAGACATAGATCAGT TCTGTCTAGACTTAGAGCG
4 ' F 'MG CCICTGG TGGGIATIGGiAACCGGAA 6 TATAGGTTCTMGAAGAG AATCAAAGGGAGGAGM
GAMMA AGCCCGA ATACC CA
ATTAATGCCACTTGGACA
G1TGATC.AGCTTGCCGAA
86 GACCAAC.ACCTTAGATTGATCTACATITATTCTGCTAC
AGITTCAGGCAATCAGITTAGAGTGATCTATTAAAG GAGTITACAATACGGATG TTCATTCTGCTACTGATAG
7 TGATAGACTC CAAACTTGTTAGG , AATATGT ACTC
GTGCACCACTGGATATCA ATGGCCTGATTTACTTAAA
86 alGTAGTGGAATITTACAAAGCTGAACATTATCTCA 1 ATCAATGCTATGAATCCAGGCATTGTCCTCAATACC ACCAGGATTATAAGTATA
AACATTATCTCAGACATGT
9 GACATGTAGAAGA ' GGATGG AAGCTAC AGAAGA
AGATAGTAAACATAGACA
O TAGACAGCCT AGGCTCCC
GAGAGCAGCTCTAGATGG GCCT
AACAAGCAAMCCATAAGTCTGACATGMCTITTC GGGAAACTATTAGAMAG
CACGTCAGCAAGGCATTA
86 GCACACITCCATICTCCAGITITACTTAC.AGAGCCATCA
AAAACCIACACTATCTCCCCTCX:CGTGCTICTAGAGT AAGTG GAC.TATAATGG GC
TACAGAGCCATCAGGGGA
CCAAAGACATAGATCAGT TCTGTCTAGACTTAGAGCG
4 ' F 'MG CCICTGG TGGGIATIGGiAACCGGAA 6 TATAGGTTCTMGAAGAG AATCAAAGGGAGGAGM
GAMMA AGCCCGA ATACC CA
ATTAATGCCACTTGGACA
G1TGATC.AGCTTGCCGAA
86 GACCAAC.ACCTTAGATTGATCTACATITATTCTGCTAC
AGITTCAGGCAATCAGITTAGAGTGATCTATTAAAG GAGTITACAATACGGATG TTCATTCTGCTACTGATAG
7 TGATAGACTC CAAACTTGTTAGG , AATATGT ACTC
GTGCACCACTGGATATCA ATGGCCTGATTTACTTAAA
86 alGTAGTGGAATITTACAAAGCTGAACATTATCTCA 1 ATCAATGCTATGAATCCAGGCATTGTCCTCAATACC ACCAGGATTATAAGTATA
AACATTATCTCAGACATGT
9 GACATGTAGAAGA ' GGATGG AAGCTAC AGAAGA
87 ACTCAAGCTGAATTGTOCAAATTGATTACTACTGTGA CACTIGTCCATITGTGTCATTCGTGTITGTTTMACA
TGGAACATAGTACTGTGA CATTACTACTGTGAAACTA
O AACTACTCTG GCCG ACA CTCTG
87 ACGTTATCTCCCTCTTAAGTTTCCTC.CTCAATGGGAAT TCCATGGGGCCAAAGAAATAGCTATGAGGCCCATG
GACGCTITGICCAAAATG CCTCAATGGGAATGGAGA
GCAGATGCAACGATTCAA
2 ATCATTG TCCTTC.0 G
TTGCCGCGAGTATCATTG
AGGTAOCAATGGGGAAG GGTACGGGATGTAATGGA
87 CTGCTGAGTTTCCACTTCAGTATAAAAAGGAGATTAT TACAGGTAGAAGGGCGCCATCCCCACTTCCAC.CACT
TAGAAAAACAMGTAGAG AAAAAGGAGATTAMGAA
S CAAGGAA GCACATTCTAGGC G
GAGGACGAGGACAAGGAA
CTC.AC.AATATAGTAATGA ACATTACAAGACGTTAGOC
6 TTAGCCT TCITCAC.AAA AAAG MC; r AATCCCACTMACTGACCCAAATGCCCTCCAGTITC GIGGAAGAAACTAGMT AACATCIGTACCTICCATTC
7 Traivircr rcs , AT TGA1 G C
TACGAAAACGACGTAAAC TTTTTCAGATGTCTCTTTGG
TACATTTACCTGACCCCAA AAGTITGGTTTTCCTGACA
TGGAACATAGTACTGTGA CATTACTACTGTGAAACTA
O AACTACTCTG GCCG ACA CTCTG
87 ACGTTATCTCCCTCTTAAGTTTCCTC.CTCAATGGGAAT TCCATGGGGCCAAAGAAATAGCTATGAGGCCCATG
GACGCTITGICCAAAATG CCTCAATGGGAATGGAGA
GCAGATGCAACGATTCAA
2 ATCATTG TCCTTC.0 G
TTGCCGCGAGTATCATTG
AGGTAOCAATGGGGAAG GGTACGGGATGTAATGGA
87 CTGCTGAGTTTCCACTTCAGTATAAAAAGGAGATTAT TACAGGTAGAAGGGCGCCATCCCCACTTCCAC.CACT
TAGAAAAACAMGTAGAG AAAAAGGAGATTAMGAA
S CAAGGAA GCACATTCTAGGC G
GAGGACGAGGACAAGGAA
CTC.AC.AATATAGTAATGA ACATTACAAGACGTTAGOC
6 TTAGCCT TCITCAC.AAA AAAG MC; r AATCCCACTMACTGACCCAAATGCCCTCCAGTITC GIGGAAGAAACTAGMT AACATCIGTACCTICCATTC
7 Traivircr rcs , AT TGA1 G C
TACGAAAACGACGTAAAC TTTTTCAGATGTCTCTTTGG
TACATTTACCTGACCCCAA AAGTITGGTTTTCCTGACA
88 TCACCATCCTGAATAACTGTGITTAAGGATCCCCATG TATGGITGATACTGGCTTTGGTGAACTTCACTTTTGT
CCACCTATAGGGGAACAC
AAGGATCCCCATGIACCA
ACGMGGGAACAAATGIT TTAGACATTTATTTAATAG
1 . ITAATAGGGCMG AACCAAT TG CiGCMG
CTCCAGCACCTAAAGAAG CCCCTTAAAAAATACACITT
MCCAAATCCCIGTTTAAAAAACAATGGAAT TITTGTAGCTICAACCGAA
3 'CITITGG 661161.3 IT CGGITC1CA
TGCTITTIGG
ATAAA=AGCAGACATITTAT CAAAAGAGAACMCAATG
TICCATAATATAAGGGGMCGTGGCGTGTICTTGAT ACCGTTGIGTGATTIGTTA GIGTATTAACTGICAAAAG
S CAAAAGCCA GM:MCA ATTAG CCA
CAACCAGAGACAACTGAI AT:MITA] GAGCAATTAAA
MCACACAMAGAAGTCTGC.AGIMAIMGCC GAAGACTIT GMCGACAG GCT TCAATCCAA TGATCGT
7 , ATCGTC TCGTIC I C
.
CATTGAAA ITGGCITICTCTAGG GGTGAATAGI ATCTGC.AAC
88 'ICITCAIGT7 ClITC I CCITGGAAGGI GCTAGCAGAA AACAAGCCAA ITAAAGIGGGCA
1CCFCA AAGTCC:AC ATCTCCTGACITGGAAGC
GGTGCTAGCAGAACITCA
CCACCTATAGGGGAACAC
AAGGATCCCCATGIACCA
ACGMGGGAACAAATGIT TTAGACATTTATTTAATAG
1 . ITAATAGGGCMG AACCAAT TG CiGCMG
CTCCAGCACCTAAAGAAG CCCCTTAAAAAATACACITT
MCCAAATCCCIGTTTAAAAAACAATGGAAT TITTGTAGCTICAACCGAA
3 'CITITGG 661161.3 IT CGGITC1CA
TGCTITTIGG
ATAAA=AGCAGACATITTAT CAAAAGAGAACMCAATG
TICCATAATATAAGGGGMCGTGGCGTGTICTTGAT ACCGTTGIGTGATTIGTTA GIGTATTAACTGICAAAAG
S CAAAAGCCA GM:MCA ATTAG CCA
CAACCAGAGACAACTGAI AT:MITA] GAGCAATTAAA
MCACACAMAGAAGTCTGC.AGIMAIMGCC GAAGACTIT GMCGACAG GCT TCAATCCAA TGATCGT
7 , ATCGTC TCGTIC I C
.
CATTGAAA ITGGCITICTCTAGG GGTGAATAGI ATCTGC.AAC
88 'ICITCAIGT7 ClITC I CCITGGAAGGI GCTAGCAGAA AACAAGCCAA ITAAAGIGGGCA
1CCFCA AAGTCC:AC ATCTCCTGACITGGAAGC
GGTGCTAGCAGAACITCA
89 CCGTCTCATGCT MCAATGTAGAACITGATGAAATAG TT
O GGGA CTCCCTTCA
GACTGACTCAAGCTGGGT GGA
GITGGAACCACATAAATGGGAAAAGCTTGGCCTAC GGATTCATTATAAAAGGA ATTIGAGGAATGATACal A
AAICAGAAACATGOCCA.A iTGGAAAGGITCTGFAC CAGAIMAGAGCATGATT
89 ITCICIGTGIGACCAITITaTriGriATTACAACACAT I
ACMAGACTGAACAAAAGGAGCTACGCA MCITTCi 1 GATTCAAIGGATAAGGAA GAT
TACAACACATITCCAG
MAGGAAGAAAITG TGAAGG TFCTIMTAITTMTAGAAG
OCAGGAGCATCTGTGAGA
89 IGTG TCCGTAACTICATACICITACIGTMTCAAAC.AA
AGTACCAGCGGAAATGCFCGCMICTIGTIGATITG GAAAMTCITAA.GCA 116 S GATGGC TTGAAGT CTC
TGTTCTCAAACAAGATGGC
89 TCTGAGGATTGGAGTCCGTCATCMGGGC.AGAAGA A TGACTTCGCCCTCAT
AGTGAAACCTATCTACTCCTGC AGTCTTAGGAGTFTCAATT
6 GT . CG CTGA
ATCTCGGGCAGAAGAAGT
TTCGTAGCTAACTTCAGTA AGTGTCX:GOGATTAATGA
89 ATCAATTC.CCATITCAAGCAGACCTAGGGCCAAACCT
GGATGAAGACTACCALAGGCAGGATTTCCITATGACT CAGGACrGTTGGTTTCACI
AGGGCCAAACCTGTACAA
AGATGGACCTGACATCTGTACATICICAATAAATGG GTTAACAACTATCAGGCG
GTTCIAACACGAACCITTA
9 CG11TACG GATACGGAGAG All CG
O GGGA CTCCCTTCA
GACTGACTCAAGCTGGGT GGA
GITGGAACCACATAAATGGGAAAAGCTTGGCCTAC GGATTCATTATAAAAGGA ATTIGAGGAATGATACal A
AAICAGAAACATGOCCA.A iTGGAAAGGITCTGFAC CAGAIMAGAGCATGATT
89 ITCICIGTGIGACCAITITaTriGriATTACAACACAT I
ACMAGACTGAACAAAAGGAGCTACGCA MCITTCi 1 GATTCAAIGGATAAGGAA GAT
TACAACACATITCCAG
MAGGAAGAAAITG TGAAGG TFCTIMTAITTMTAGAAG
OCAGGAGCATCTGTGAGA
89 IGTG TCCGTAACTICATACICITACIGTMTCAAAC.AA
AGTACCAGCGGAAATGCFCGCMICTIGTIGATITG GAAAMTCITAA.GCA 116 S GATGGC TTGAAGT CTC
TGTTCTCAAACAAGATGGC
89 TCTGAGGATTGGAGTCCGTCATCMGGGC.AGAAGA A TGACTTCGCCCTCAT
AGTGAAACCTATCTACTCCTGC AGTCTTAGGAGTFTCAATT
6 GT . CG CTGA
ATCTCGGGCAGAAGAAGT
TTCGTAGCTAACTTCAGTA AGTGTCX:GOGATTAATGA
89 ATCAATTC.CCATITCAAGCAGACCTAGGGCCAAACCT
GGATGAAGACTACCALAGGCAGGATTTCCITATGACT CAGGACrGTTGGTTTCACI
AGGGCCAAACCTGTACAA
AGATGGACCTGACATCTGTACATICICAATAAATGG GTTAACAACTATCAGGCG
GTTCIAACACGAACCITTA
9 CG11TACG GATACGGAGAG All CG
90 TCAGATACCTCGAAAGACTCCTGATITGCTIGTTGTC ATTGGITTAAGGTOTGITACATGCMGAAACAACAG
CTGGCGAGATAATTTCGT
TTGC.ITGTTGTCAGTGCT
AGGAGGAGICTGAATATCCATATGCGCCTIAAC.AGC GCAGACTCTATATAATTAA AGAGCAGCCAAATATAGG
CTGGCGAGATAATTTCGT
TTGC.ITGTTGTCAGTGCT
AGGAGGAGICTGAATATCCATATGCGCCTIAAC.AGC GCAGACTCTATATAATTAA AGAGCAGCCAAATATAGG
91 PCT/US2022/076140 90 TGCCTCATCCACAATATACC.AATC.ACTGTGTGCCATIC
CGGGATGATGATGAGAGCGAATTATCTGATATGTC
2 GACAC ACMCAGATC CfGCTGTTAGGCTCACTG
CIGTGIGCGATTCGACAC
CAGAACGAGCAGGACZTA
GTACAGTCTCTGGGAGGT
90 AAGTCTAACTGGCTCCAAAACCTTGCATTITCCTTAAC AAGTGACC.AAGAAGACGAGGAGCTTGTAGCCAGTC
ATITGATGAAAATGGTCA GCATTITCCTIAACAGACA
AGACATCA TAAGCG , AC.CG TCA
CCGAAAAAAGGGAATAAT
AGGACAATCCACCCAAACT
TTTAGATAAGGTGGTGGT ITCTGTTATTTTAGGGGC.A
90 TGGGMCGTGTGATTGTAC.GTGATCTAGTTCAGCTGT
TCCITTATTCAGGATCACAGC:TAGTAATATTATCAGA ATCTATITTCAGCTEITCTIG
CGGGATGATGATGAGAGCGAATTATCTGATATGTC
2 GACAC ACMCAGATC CfGCTGTTAGGCTCACTG
CIGTGIGCGATTCGACAC
CAGAACGAGCAGGACZTA
GTACAGTCTCTGGGAGGT
90 AAGTCTAACTGGCTCCAAAACCTTGCATTITCCTTAAC AAGTGACC.AAGAAGACGAGGAGCTTGTAGCCAGTC
ATITGATGAAAATGGTCA GCATTITCCTIAACAGACA
AGACATCA TAAGCG , AC.CG TCA
CCGAAAAAAGGGAATAAT
AGGACAATCCACCCAAACT
TTTAGATAAGGTGGTGGT ITCTGTTATTTTAGGGGC.A
90 TGGGMCGTGTGATTGTAC.GTGATCTAGTTCAGCTGT
TCCITTATTCAGGATCACAGC:TAGTAATATTATCAGA ATCTATITTCAGCTEITCTIG
92 AGAAACGTGTAGGCCTTTCCTAACT6AG1TCACAACT TAGACGCTATTATGAACAAGTGCCGTAACTAAGGA
CTACTGAAGGTGAGCTTG AACTGAGTTCACAACTAGT
O AGTACA AGCAGGCC A ACA
CAGAGCCACTATAAGTAC GAAGTGGTGTTACTATAG
ATCCTACTGATCTTGACAT TTTAGATAATTCAACAGGG
CCGTGCAAATGATAACACT
TGTGCAGGAACACAGTTTCAGCCTCAATAACTGTGT TAATAATAC-ACATGCAACT
CAMATGTTGCTITTGATG
5 AAT 1 AT TCAGGCC , CGGAGCAGCTTGA ! CAAT
*FTCCIT I AGGCCGCAAAT
GGTTACACACCTGTACAG TTGTTAATAAACGTITATG
6 GTTIATGCTGC ' GCG T CTOC
91 C.AGATGCTCTGATTICTGIGGCGATCATATGAACAAA TCGGAAGAATGATTGGIGGAATCGCATAATCACTG
GATCATATGAACAMTGGA
GAACAATAGCAATGGAGT
TGAGGGGATCAGTTGCAC
9 . CACA TTITGGA cATrrTCCIGGCACC3C3TCA A
GCTGCATTTGAAGATTTAA CAAGMCATAAGAGGAAA
O AAAGAAAGT CiTCICAG GAGT GAAAGT
TGCAGGACCITIMCGCCATTIGAATGC TGGGIGGGCTATATACAG AAGACAATAGCATAAGAA
1 ATAAGAATIGG r CTCITCA C ITTIGT
92 CGITGTATTTTAATACkGCCACTGCCATGGGCTGGCT ITGAGGACACAAGAGTCTGAATGTCCATCAGTCATT
%Via GGCTAACAATC
TCAATAGAGITGAATGCA AATTCTCACIATGAGGAAT
3 CiAAI CFTC AATCCAGG TIT CCT CFTC
92 GATCce &AATICKICTCAAACCCAITGGIGTTIGGATA AA( GGATGGACAGAGACTGATAGTCANICAG ICA V AGGGATIITCATATAGGT TGGTG ITTGGA
TACiGA AG
92 GAG ITT TCAAAGATCCrfGGGTGCA.GATIGTGTA ITG GIG TCTIGAAACGA
TGGAAGTTGGTCATAGOTTIG CAGATTG TWAT TGGA AGC:
5 . GAACiC.A GCGACC CGAGTG43GTATGCAC.A.A.A
A .
ATGICGTGAGGAAGAGTAATTGIAAA TGCAAATCAGAGGATTCG TACIT TGICGAAACACIAG
6 TAGCti AGAGAGCTCTGTA TG CC
ATGITCGAAAGTAAGAGCATGAAGCTCAAGTCAAT CAAGGAACCAACCTGAAT GITTACiAAATGTC17AAGC
92 GTCCGICCCACCAGTATG TGAGT CTIAGGACTI TTCAA TCCAATC.CTCTGATGA I
8 TCCT TCATG GTAC er GCTTCTTCTACCIICTATGG AGCCAACTTCAGTATGGAG
CTACTGAAGGTGAGCTTG AACTGAGTTCACAACTAGT
O AGTACA AGCAGGCC A ACA
CAGAGCCACTATAAGTAC GAAGTGGTGTTACTATAG
ATCCTACTGATCTTGACAT TTTAGATAATTCAACAGGG
CCGTGCAAATGATAACACT
TGTGCAGGAACACAGTTTCAGCCTCAATAACTGTGT TAATAATAC-ACATGCAACT
CAMATGTTGCTITTGATG
5 AAT 1 AT TCAGGCC , CGGAGCAGCTTGA ! CAAT
*FTCCIT I AGGCCGCAAAT
GGTTACACACCTGTACAG TTGTTAATAAACGTITATG
6 GTTIATGCTGC ' GCG T CTOC
91 C.AGATGCTCTGATTICTGIGGCGATCATATGAACAAA TCGGAAGAATGATTGGIGGAATCGCATAATCACTG
GATCATATGAACAMTGGA
GAACAATAGCAATGGAGT
TGAGGGGATCAGTTGCAC
9 . CACA TTITGGA cATrrTCCIGGCACC3C3TCA A
GCTGCATTTGAAGATTTAA CAAGMCATAAGAGGAAA
O AAAGAAAGT CiTCICAG GAGT GAAAGT
TGCAGGACCITIMCGCCATTIGAATGC TGGGIGGGCTATATACAG AAGACAATAGCATAAGAA
1 ATAAGAATIGG r CTCITCA C ITTIGT
92 CGITGTATTTTAATACkGCCACTGCCATGGGCTGGCT ITGAGGACACAAGAGTCTGAATGTCCATCAGTCATT
%Via GGCTAACAATC
TCAATAGAGITGAATGCA AATTCTCACIATGAGGAAT
3 CiAAI CFTC AATCCAGG TIT CCT CFTC
92 GATCce &AATICKICTCAAACCCAITGGIGTTIGGATA AA( GGATGGACAGAGACTGATAGTCANICAG ICA V AGGGATIITCATATAGGT TGGTG ITTGGA
TACiGA AG
92 GAG ITT TCAAAGATCCrfGGGTGCA.GATIGTGTA ITG GIG TCTIGAAACGA
TGGAAGTTGGTCATAGOTTIG CAGATTG TWAT TGGA AGC:
5 . GAACiC.A GCGACC CGAGTG43GTATGCAC.A.A.A
A .
ATGICGTGAGGAAGAGTAATTGIAAA TGCAAATCAGAGGATTCG TACIT TGICGAAACACIAG
6 TAGCti AGAGAGCTCTGTA TG CC
ATGITCGAAAGTAAGAGCATGAAGCTCAAGTCAAT CAAGGAACCAACCTGAAT GITTACiAAATGTC17AAGC
92 GTCCGICCCACCAGTATG TGAGT CTIAGGACTI TTCAA TCCAATC.CTCTGATGA I
8 TCCT TCATG GTAC er GCTTCTTCTACCIICTATGG AGCCAACTTCAGTATGGAG
93 ATCCTCAAAGCGCGCCATAGCGAAAACGGTGIATAT
CAACACGGCGACCCIAGNAGTMGCAGTGAAGIGT TAGIATATAAAMAGGGA CGAA.AACG GTGI ATATAAA
O AAAAGATGT TCAG GTAACCG
AGATGT
TTGGAAAAACTAACIAACACTGGGITICAACCiGIT T GGTGTATAGAGACAGIAT
TGCTGCATGCCATAAATGT
TGACCTTCfATGTCCATCIATTTCATCCi AATAITAAGTAIGCATGGA
93 GC:ATCATIGIGGACCTCCTGAACATTTTaf GAACAGG ACGMAGITT
AACATTITGTGAACAGGC.A
ATATCAGATAGTGGCTAT
4 A . CACITGTACC GGC
TGTTCTGAAGTGGAAGCA
93 ACATCTAAATCGTATCCACTGTGACGTGCC.ACAATGT AGATGAAGGGGGAGATTGOAGAGGCTCCTAAMA
GTGCCACAATGTGCAAAC
93 ATGTTCCCITGCTGCAAAGAATGAC.ATAGAC.AGCCAA GGCATACAGACATTAAACCAMAGTTCAATAGCTTT
ATAGACCACTATGAAAAT GACATAGAC.AGCCAAATAC
93 CCATGITCCTGC.ATC.AGICATATGGCAACAAAGACAA
GACAAAACCGCTACCTGTGTAACGTGTTGTACCCTT TTAAAMAGGIGGMAAA GGCAACAAAGACAATTGT
93 GGIGCTGCSAATACGGTGAGGGCGETTATCCGC.TACTC GGGCACCGCAMGACCTACCIGAGICCACAGTGICC
ATGIGCAGTACC.AGTGAC
CGGTATCCGCTACTCAGCT
CTACGCCTATAATACATTT ACAGTTTAAAATGTTTACG
CAACACGGCGACCCIAGNAGTMGCAGTGAAGIGT TAGIATATAAAMAGGGA CGAA.AACG GTGI ATATAAA
O AAAAGATGT TCAG GTAACCG
AGATGT
TTGGAAAAACTAACIAACACTGGGITICAACCiGIT T GGTGTATAGAGACAGIAT
TGCTGCATGCCATAAATGT
TGACCTTCfATGTCCATCIATTTCATCCi AATAITAAGTAIGCATGGA
93 GC:ATCATIGIGGACCTCCTGAACATTTTaf GAACAGG ACGMAGITT
AACATTITGTGAACAGGC.A
ATATCAGATAGTGGCTAT
4 A . CACITGTACC GGC
TGTTCTGAAGTGGAAGCA
93 ACATCTAAATCGTATCCACTGTGACGTGCC.ACAATGT AGATGAAGGGGGAGATTGOAGAGGCTCCTAAMA
GTGCCACAATGTGCAAAC
93 ATGTTCCCITGCTGCAAAGAATGAC.ATAGAC.AGCCAA GGCATACAGACATTAAACCAMAGTTCAATAGCTTT
ATAGACCACTATGAAAAT GACATAGAC.AGCCAAATAC
93 CCATGITCCTGC.ATC.AGICATATGGCAACAAAGACAA
GACAAAACCGCTACCTGTGTAACGTGTTGTACCCTT TTAAAMAGGIGGMAAA GGCAACAAAGACAATTGT
93 GGIGCTGCSAATACGGTGAGGGCGETTATCCGC.TACTC GGGCACCGCAMGACCTACCIGAGICCACAGTGICC
ATGIGCAGTACC.AGTGAC
CGGTATCCGCTACTCAGCT
CTACGCCTATAATACATTT ACAGTTTAAAATGTTTACG
94 ATACCAATACCCATGCATACGCT ATGIGIGCTGCCA 1 ATTGTGGTAATAACGTCCCCTGCGCAATAGTAACA T TAMTGCIT TIGTGTATG
O Ci TC GGGCAAT A CATG
TATGTGTGCTGCCATGIC
TGCAGCCITTAGTATCTGC
94 C.ACAGCCC.AAAATACATAACTGTGTGAGGACGTTAG ATTGGGGAAC.4CTGGGCTAATAAG1TCTAAAGGGG
3 GGACAA GGCA , CGCCACGTCTAATGITTC
TGAGGACGITAGGGACAA
ATCCTGATTATTTACAAAT GGGAITCCATGITTITITG
94 GAGACTGIGTAGAAGCACATATTGITGMGCTGGCA CTGTACCTGGGCAATATGATGCATCATATrCCTCAA
TATTCTiTTACATAAGGCA TGTITCOTGGCATAATCAA
TAATCAAT CATGICIG CAGG I
CGCCCIAGTGAGTAACAA ATTTGIGTTTGTGGTATGG
94 CGGTATAAGGGAAAGTTGTGCITATGGATCiTCAATC GAGACCC.TCCTTACAGCCATGACTGATGIGTCCIGT
GGATGTCAATCCGACCTT
AAATAAGGGGGTTTGTAT
GGCAAGGA
O Ci TC GGGCAAT A CATG
TATGTGTGCTGCCATGIC
TGCAGCCITTAGTATCTGC
94 C.ACAGCCC.AAAATACATAACTGTGTGAGGACGTTAG ATTGGGGAAC.4CTGGGCTAATAAG1TCTAAAGGGG
3 GGACAA GGCA , CGCCACGTCTAATGITTC
TGAGGACGITAGGGACAA
ATCCTGATTATTTACAAAT GGGAITCCATGITTITITG
94 GAGACTGIGTAGAAGCACATATTGITGMGCTGGCA CTGTACCTGGGCAATATGATGCATCATATrCCTCAA
TATTCTiTTACATAAGGCA TGTITCOTGGCATAATCAA
TAATCAAT CATGICIG CAGG I
CGCCCIAGTGAGTAACAA ATTTGIGTTTGTGGTATGG
94 CGGTATAAGGGAAAGTTGTGCITATGGATCiTCAATC GAGACCC.TCCTTACAGCCATGACTGATGIGTCCIGT
GGATGTCAATCCGACCTT
AAATAAGGGGGTTTGTAT
GGCAAGGA
95 CCATCCCACCAGTAAGTAGICTTGTTAGGCGTCTMA TCTTCAATCCTCTGACGATTTTGCCGGCTTGAATCCC
CATGTTCAATATGITAAGC
O TCCIG TTCATG ACIGTA
TIAGGCGICTCCATCCIG
ACIAGTTTITIOTATCGTT
TTIGTTGCCAATTTCAGCA
ACAGAAGACCAGTCGGGATATCCGCATCAATTCGG TGAATAC.AAGICAAAGAG
s 3 ACAGGA ICCACC:GCCTlICGCGCCMACCACC1CAA 1 TCAC , GG 1 GGAAGGCCCGGAA
CGCMCACCACCICAA
95 GGAGCCCACAACAATAGTGGGCTC.GGGGACAGCTAT 1 GAGTTCCTCTAGGGATGGCCGATGCCGCCTGTAGC
TCOGGGACAGCTATGCG
TGCTGUACTCTCTACAGAC
IGGAGTTGCTACGCGGC
CCGAGAGGGAACTGTGTG
7 . I GIGGA A CTGTGGCO TGACCGIG GA
GGGGIGGAAAGCAAGGT AAGGCCAAAATACCTGGG
GIGTAATIGGCCAACAGT
9 AAR; AACCA GC
CATGTTCAATATGITAAGC
O TCCIG TTCATG ACIGTA
TIAGGCGICTCCATCCIG
ACIAGTTTITIOTATCGTT
TTIGTTGCCAATTTCAGCA
ACAGAAGACCAGTCGGGATATCCGCATCAATTCGG TGAATAC.AAGICAAAGAG
s 3 ACAGGA ICCACC:GCCTlICGCGCCMACCACC1CAA 1 TCAC , GG 1 GGAAGGCCCGGAA
CGCMCACCACCICAA
95 GGAGCCCACAACAATAGTGGGCTC.GGGGACAGCTAT 1 GAGTTCCTCTAGGGATGGCCGATGCCGCCTGTAGC
TCOGGGACAGCTATGCG
TGCTGUACTCTCTACAGAC
IGGAGTTGCTACGCGGC
CCGAGAGGGAACTGTGTG
7 . I GIGGA A CTGTGGCO TGACCGIG GA
GGGGIGGAAAGCAAGGT AAGGCCAAAATACCTGGG
GIGTAATIGGCCAACAGT
9 AAR; AACCA GC
96 GGCGGCGTTACTGACIGAATCACCCCACIGTC.AAGCC TGCTTCGC.IGGTTCCGACITGTGGCGCCAGAAAGG
O MG GC A TCATGGGGITGGACCG I
CCCCACTGICAAGCCTITG
96 AGC¨ATCIGGCAGTGCCGAGAACTGGGGITCCTAGCA CGTGCTGGIOGMATGCACTGCCAACACTAGCATG
GCACAGGAGICTIGGITC
1 Ci1Cf 6 ACT G "f GGGGITCCIAGCAGTGTG
CTGGIGCGTGTTCTACCAA
TGAGCACGCTGGACAI
3 . C:FGGCTICACAGCGACCIGGCAGCGGCAGAGCGGTT CTCFGA 6 CAGCGGCAGAGCGGTT .
CCATIGCCAAIGGGGGCTACCITMCACGACCCAAA CCATGGGCACTTGIGTGA TCATGGCACTAACGGGAG
4 AGG a-A: AT G
96 GACOCGATTGCCACGAIGCAA tClIGICITGACGAGG CTCCC 1 AGGGAGGCTCAGACCAAMATAGCCGGIT
S GCC TTACCCG
GGTCGTCTGGCTCTCCGA ICITGIGTGACGAGGGCC
96 CA 1 GCCCAGAAC:ACTAACGCCGOGCCATGACGTGGT GIGCMG61 TGCTGGCCAACiGG1CGCACTIGGCA AA A TCCCAGAAAGTACCTCC GISCCA. IGACGT
GGTGATCS
CTGGC1TGTGTCGGCACITGACCACATCACCCiCiC1GG TTATGGACACAGGTTOC.0 TTGCAAGATACGCCACTGG
GGCAGGTACCGIGCGCMITACCACGTGACACCAG
TGAGGATGC.AACGCAGGG
9 Al GGT GGTGTCCAGCGGCAGA
TGGGCCCCCAGCTGAT
O MG GC A TCATGGGGITGGACCG I
CCCCACTGICAAGCCTITG
96 AGC¨ATCIGGCAGTGCCGAGAACTGGGGITCCTAGCA CGTGCTGGIOGMATGCACTGCCAACACTAGCATG
GCACAGGAGICTIGGITC
1 Ci1Cf 6 ACT G "f GGGGITCCIAGCAGTGTG
CTGGIGCGTGTTCTACCAA
TGAGCACGCTGGACAI
3 . C:FGGCTICACAGCGACCIGGCAGCGGCAGAGCGGTT CTCFGA 6 CAGCGGCAGAGCGGTT .
CCATIGCCAAIGGGGGCTACCITMCACGACCCAAA CCATGGGCACTTGIGTGA TCATGGCACTAACGGGAG
4 AGG a-A: AT G
96 GACOCGATTGCCACGAIGCAA tClIGICITGACGAGG CTCCC 1 AGGGAGGCTCAGACCAAMATAGCCGGIT
S GCC TTACCCG
GGTCGTCTGGCTCTCCGA ICITGIGTGACGAGGGCC
96 CA 1 GCCCAGAAC:ACTAACGCCGOGCCATGACGTGGT GIGCMG61 TGCTGGCCAACiGG1CGCACTIGGCA AA A TCCCAGAAAGTACCTCC GISCCA. IGACGT
GGTGATCS
CTGGC1TGTGTCGGCACITGACCACATCACCCiCiC1GG TTATGGACACAGGTTOC.0 TTGCAAGATACGCCACTGG
GGCAGGTACCGIGCGCMITACCACGTGACACCAG
TGAGGATGC.AACGCAGGG
9 Al GGT GGTGTCCAGCGGCAGA
TGGGCCCCCAGCTGAT
97 TGTGAAGCCGGTGGCGC1AAGCGTAGCTACC:AGMA
O
TCACAGGGGTAGCGCTCC.CCGGCGAGGAGTCCCAA TGGG CAACCCMT61TCITACCAG
OGGCGAGGAGTCCCAA
97 CCACCCCGAGAGGIGGATIOCITITC:ACCAAIGGTCG CGGCCIC1ITCCTGCTGMAAGGAGCAGAGCAGCC
1 CTCG G CCAGAOCiTGGGAGGCCAT
MCACCAATGGICGCTGG
ITGACCTCCTITCTGGCAA
2 CAAG . GTCG TAACGCGGCCTCTCTCAC G
97 AGAGGGATITTGAGGCGCGGGCCTTGOTCGCTAC.CT GATMIGGTGCAGCAMGGATGGATCCACAAGTACA
GTACTGTGGCACCTCCIT
GCCTTGGTC.GCTACCTGT
CCAGCTTGGTTGCGCAG
97 CATCCTCCAGAGCCTGCAAGGTTGAGGCTATAGGGT CTGCTGTGICCCTACTGCCCTCTC.GGCAAGAACAGT
ITGAGGCTATAGGGITTGC
S CTGCA CGT GCTGAGCTTCAGTGTGCG
A
ACICTGGIOTGCAGGAGAGAAGACC.CTC.CCACAACC TATGACC.C.AGAAGACCAC ATCCGTGATGAGGTGAGG
6 AGGG TAAGGG Cl G
CCAAACCAGCGTGTAAAA
97 GCC.GAGCAACACTCAGCTGTAITTAAGGCTGCTITGG CTTGGCTGAATG
TGTTGCTGGGGGCITCCGCATATC CATO ATGAGAACGACGG TTAAGGCTGCTTTGGCTGA
8 CfGAC CGTAC CA C
GAGITGGIGATGTGCCAGGFC.CGCGAGGCAAGAGC GCTCfCGGGACCATCTTG
ACCCAGTGGTGAGATACO
O
TCACAGGGGTAGCGCTCC.CCGGCGAGGAGTCCCAA TGGG CAACCCMT61TCITACCAG
OGGCGAGGAGTCCCAA
97 CCACCCCGAGAGGIGGATIOCITITC:ACCAAIGGTCG CGGCCIC1ITCCTGCTGMAAGGAGCAGAGCAGCC
1 CTCG G CCAGAOCiTGGGAGGCCAT
MCACCAATGGICGCTGG
ITGACCTCCTITCTGGCAA
2 CAAG . GTCG TAACGCGGCCTCTCTCAC G
97 AGAGGGATITTGAGGCGCGGGCCTTGOTCGCTAC.CT GATMIGGTGCAGCAMGGATGGATCCACAAGTACA
GTACTGTGGCACCTCCIT
GCCTTGGTC.GCTACCTGT
CCAGCTTGGTTGCGCAG
97 CATCCTCCAGAGCCTGCAAGGTTGAGGCTATAGGGT CTGCTGTGICCCTACTGCCCTCTC.GGCAAGAACAGT
ITGAGGCTATAGGGITTGC
S CTGCA CGT GCTGAGCTTCAGTGTGCG
A
ACICTGGIOTGCAGGAGAGAAGACC.CTC.CCACAACC TATGACC.C.AGAAGACCAC ATCCGTGATGAGGTGAGG
6 AGGG TAAGGG Cl G
CCAAACCAGCGTGTAAAA
97 GCC.GAGCAACACTCAGCTGTAITTAAGGCTGCTITGG CTTGGCTGAATG
TGTTGCTGGGGGCITCCGCATATC CATO ATGAGAACGACGG TTAAGGCTGCTTTGGCTGA
8 CfGAC CGTAC CA C
GAGITGGIGATGTGCCAGGFC.CGCGAGGCAAGAGC GCTCfCGGGACCATCTTG
ACCCAGTGGTGAGATACO
98 IGCCACAGAAATTGCAAGGTATTATACCITTAAGTAI ATTIGGAA
GCTTGTGAATTCGACTAGCACGCAAACA CACCAACAGAAACTGACA "FACCETTAAGF ATTAGAGA
98 ATTC.CMCCAGGCGTTCCTEAGAAAAGTTAGATTGC AGGCAGTGTAAGCATMTATCATGAGCTCCAGAAT
TATTGATATCAGGIGICAA TAGAAAAGTTAGATTGCTG
1 TGTGG AATATCTTGCA , AMC TGG
98 TGCTCCACITC.AGTATCATCTAACAACAATFATTTGAG
GGCAATTCCCTGCSAACTATTICAAGTITTTGCA ATM; TGTACITGATGTAGAAAAT
ACAATTATTTGAGAGAGAT
2 AGAGATACAGAC C. FCCTC GATFTGG ACAGAC
98 GCAGTGTAATGTATATCTACGTCCCGTGGAGGTACC TGCGTICTAGCAAC.AAAAAAGCAACCTACTCCAAAC
GCTGAAGATGTTACTCCT
GTGGAGGTACCGGCTATT
GIGGGATAGGTGCAATGTCATITATGAACTGICCCG GTTGTCTCTGTATATGCAG CA TGATGATCTATTTGA AA
AGGCTTGCTCCCGAAGACCAAATITGGCCTGGTT GCCCAACAAACTATATTA
GGCAAGTACITITGATTIT
GATTTTTCAG MT GGTC TCAG
6 TGGTCCTC CAATAGAGCTAT ACTCA at TTTGAATTFCCAAATCCIT TATGAAAGCAGATAATACA
7 TAATACACCTC GCTGAG TTCC CtTC
ACTGTGTITGCTCCTGTCA
8 GTCGCGGGGGTGGTGTITGCAGCTCCACGCCF CCA GM' C
CAGCTCCACGCCFCCA
98 CTCCCCTTCTGCAGGTGGTTGTCGGGCCCTTACAAGC GTGGAAGGTC.ACCCAGGCGGTGTGGTGCAGGAGG
TCGGGCCCTTAC-AAGCA
GCTTGTGAATTCGACTAGCACGCAAACA CACCAACAGAAACTGACA "FACCETTAAGF ATTAGAGA
98 ATTC.CMCCAGGCGTTCCTEAGAAAAGTTAGATTGC AGGCAGTGTAAGCATMTATCATGAGCTCCAGAAT
TATTGATATCAGGIGICAA TAGAAAAGTTAGATTGCTG
1 TGTGG AATATCTTGCA , AMC TGG
98 TGCTCCACITC.AGTATCATCTAACAACAATFATTTGAG
GGCAATTCCCTGCSAACTATTICAAGTITTTGCA ATM; TGTACITGATGTAGAAAAT
ACAATTATTTGAGAGAGAT
2 AGAGATACAGAC C. FCCTC GATFTGG ACAGAC
98 GCAGTGTAATGTATATCTACGTCCCGTGGAGGTACC TGCGTICTAGCAAC.AAAAAAGCAACCTACTCCAAAC
GCTGAAGATGTTACTCCT
GTGGAGGTACCGGCTATT
GIGGGATAGGTGCAATGTCATITATGAACTGICCCG GTTGTCTCTGTATATGCAG CA TGATGATCTATTTGA AA
AGGCTTGCTCCCGAAGACCAAATITGGCCTGGTT GCCCAACAAACTATATTA
GGCAAGTACITITGATTIT
GATTTTTCAG MT GGTC TCAG
6 TGGTCCTC CAATAGAGCTAT ACTCA at TTTGAATTFCCAAATCCIT TATGAAAGCAGATAATACA
7 TAATACACCTC GCTGAG TTCC CtTC
ACTGTGTITGCTCCTGTCA
8 GTCGCGGGGGTGGTGTITGCAGCTCCACGCCF CCA GM' C
CAGCTCCACGCCFCCA
98 CTCCCCTTCTGCAGGTGGTTGTCGGGCCCTTACAAGC GTGGAAGGTC.ACCCAGGCGGTGTGGTGCAGGAGG
TCGGGCCCTTAC-AAGCA
99 AGCTTCTTCC.AGTAATCTTCCAAGGAGGGICACTICG 1 GCCACCCC.CCAGTAATCATITTTAGCTCTGTFGCGG
GAGGGICACTICGATCAG
t ATGCTGCMIGCTACTAACATACTCGCCATAGCAA CATTAGCTTITCTUXIATA AGGAGAGATTTTGATGAA
1 CAAGCCi 1 ATACA AAA F GT , CfCAA GCG
CCACTAGCACAGGTTCCAGTGCTGACCCGTGTTGGA AGTTAACCCCGTGGAACC TCTAACTGAGTCCACAGGC
2 AGGCG ' GGAAT T G
CAATCCATTGTITCTAACT TTCGTTTTGCATTTGATAAT
GGITTGAAACTACAAGAA CATACTATAC.ACAAGACAC
99 GGATC.AAAAGATGTATCCTGTCTGTAAAGATACAGA CCAAATGMATTGTAGGATGC.ACATCATTTTCTGCA
AAAGATACAGAAAACAGT
5 . AAACAGTAATGC CATGGAAC TACGGAGTACIGGICACC AATGC
TCCAGIGATGCTCAATTGITTAACCCATFATFATGA CAAAAATCAGTMACATT
6 TITACA. TICCAGG CX:FTGGGCT GATGACAFTCCACGTGCA.
CCAGG
CAAACGTAAAAATTGAGGTCTGACCITCCTIGAATA MAGATCAATATTCCTTA
7 CAACA CfCiTGIAAAAGT GCiC.AGA
CCAAGCTGCiGTTACAACA
TCGAACAATCAAGCAGTGCCACTATCTCTGTTATGT GGCTOGTTICTATAACTA GACAATATGTTT
ACTCATG
8 ACTCATGCAGA TCCT17C rrGc CAGA
ATCGTGCGGICTIGGTAAATGTITGITGTTCAAAGT CTTICTCCAAGGACAACTC
9 AC:AACTCA 1 GGITCC GGICAAAACCGCAATGTC A
FTCTGAG
ATAGCATGGICCAGCTrA
10 ATACCGGGGGTAAC:AGGAGCCACATIGGTC:CACTGT 1 CCAGA AGITAGATG TOTE
GC.AGAACAT !TA 1ATAT AT TCAT TAGAGAAGGGAA
01 . CA 1 AGCACGGGICTA AATTGTC
CACATTGGICCACTGTCA .
10 AGCGCGAATCITICHCiAlTGTGGIGGGCCITTGACA CIGGTIATGAGACITICAGGOTCGTGACTIGCCFAT
TAATAACGAGAGAGGOG
GGTGGGCCTITGACAATG
10 GGTGITTIATCGATIGITCCGGIATCiCITCGAGCACT AAGAACTGACICCCGGAACCAGITCGT1IGTI6C11 GCITCGAGC:ACTAATAAAA
10 GICX:ACCITICACCATC:ATCTGAGAGATAAAGAAGA (IAA
ITCAAAATCiCACAAGCSIGTGGF TCAGTMCIT CCAAAAGAAACAAACCCC AGAGATAAACiAAGAGCGT
04 GCGTCTAC GICATCACITTG AAC C.TAC
10 TGGGATTGACTTTGITTTTGTCTGATGATTCAATCiCIC
TGAGATTCCAATTAAGCAGACCATCTCATAATC.CTCF GTGATGCAGATGTCAAAG TGATTCAATGGCAAAGAAA
OS AAAGAAAACC GCTGTGTC GA ACC
10 CCGAAAGTGG TGCATAGT 'ICI TAAACCTITAAGT n TCTCTCCTATCATFGTF
GGCA.ACITATAGTAACCGGI CACiGTGACi TACAAGT TC.A AAACCFT1'AAGIT11 GGCG
06 TGGCXiC. GCTGGIT AAC C
10 CTACAGCAA 1 TICTCCAA TGACAACAAT ;GATT errea ACMGTCGATITGAAAATCiAGAGGTGGIAGAAC I AAACGM GGACT1TA TAG AAT TG ATTU TOTTA
ma TCAGTACC1T I GGAGCAAGTAC:CTTAT AA
CTIGTGGAGC.AGGTGTAC
TGAAACGGAGGATCATAACTCFGCGCFGAGGAGAG 1 AGATGA ACT AT Tr GAAG ACTGATGGA
TCTGATG ref AGC.ATTGCCTACAGATATGCCAATGCFTCAGAAACG TTACCAG ATTGGA TAA AA
10 CTGCTA . CAGC AGACAGA
GTCATGAGTCAGCTGCTA
10 CACCCAGGATAGGTTACATC.ATCTAACAGATGTAGTA CGAGGCGCATTAGATGGTA
ATCTTC.AATTGA GTAG CATGGCAAACITTATITCTA ACAGATGTAGTACATTTTG
11 CATTTTGGTT GTGCTCTA TGATGA Gil TACTAATAAAACCTGGACA
AGCACCTCACAAGACACC CTCCOGATCTCTICAAGGG
TCTCCAGAAAATGGGGTTGCAATTTAATAGTGTCA T ATTGTAAC.ACTCAC:TGAAA
ACCAGATACAATAGTGACA
14 AGTGAC.AACT GTGCAAACA CAG ACT
TCCTTTTGCAACAGTGGAA TAACATTTGGTGAGGAAAT
AGGAAATACCA AAACGTCTA A ACCA
IGCGTITCAGTACACTAGGATGTATTACCTCTAATAC TTICAGATGTCfTGGACAAAGTCAGCACCITTGCAA
TCTAGTAATTCTGATATTC TACCTCTAATACCTGGAAG
16 CFGGAAGATCC CAGGCTT Cf TCAG ATCC
10 TO IT IGGATC1GGAAATTGTAGTCTTTACC.GATGAGT
TFACCGATGAGICTAAAGT
17 CTAAAGTAGC AC.FAATC.GC TGGATG AGC
"TAATAAGTAIGTGGATGCT
18 TGAGA GTGGT TAGGAGCAAC.AGGCCATC GAGA
10 AGGCC.AAATACTGATTICATTCGTATGGAAATICAAA AAATGGGCAAGGATATTTATGGTGATCTTGCATAAA
TGGTGATATGTGTGAAAT TGGAAATGCAAACTITGAG
19 CTTTGAGC GCTGTTCCTT , AGGATT C
CTCC ATTTCGCGTATT TACAC
AGTGGCTC.ACTIGTCTCC
AACATGCCCTCAAACTITGGTTITTGTGCTACGTTTG CCATACGAACAGTATACA
TAAACTTGAGTGAAAAGTT
21 AAAAGTTAACTGG ' CAGT 11TTGG AACTGG
10 GGGAATGIGGTGCTTATGGCAGATGTC.AATCCIIACT GGAGATCCTCCATACAGCCATTGIGTTCTGTTGACT
CATGTCAATCCGACTCTAC
ACAATGGAGGCCTIGGGTT TAATGAGCCAAGTGGATA
ACAAACAGACTGTGTCCTG
ATC.AGGAAGGCTAATAGA AGGATGTGATGGAATCAA
10 ATTACCCCC.AACCGGAAGTCCGTTGAAACTTTAGCTA CCAAACTGGCAAATGTTGTGAGTGAAAGAt=AGCTC
CAAATTAGAGGGTTCGTG CGTFGAAACTTTAGCTAGA
10 TCTCTTACTCTCGAACATGTATCCTCATCGCACCAATA TGAAGCTCCGAACACAAATACCCATTGAAATACTfC
TGGTTCAGAAACATCCTG CATCGCACCAATAATGTTC
ACATGCTAAGTACGGTTTT AGTCTCGGTACTGAATCTT
CCC.AGATGGCTCTCCAATTGTTCCTCTATGGCACCTA TGGAGTGICIGGAATAAA
28 AATAAACG 1 TATGT CATGGAGOTCCCAGT11 a;
i 29 C:AGGACT A 1 AAGTCTFC , AC A
CCCTAGTAGTTCATATAGGAGACCGATTCTGGCATC GAACCGCTCTATTCTAAAC
TAGCCAAAGGGGAATTCTT
CTFG ' AATCCGG AC G
AGTTGCCAAC.AATTATTAG
AGACGTGACAACCGACTA
CCTGATATTGTITTAGAAG
10 ATGCACCGCTGAAAGTCTCGAAGT1TTATGCCACCCC AACACCAGAAAGACAATCCAAAAGGT1TTC.AGCT1C
GACCTGGTTACAGTAAAA
33 , G ATCITCAATCC CGA
AAGITTFATGCCACCCCG
ACTGGTGATTCAGTATACT
34 GCCACT G AAG rr AAAC TT CAACCT
GATGCCACIG
10 GCAATTTAGCATCACCCCTG TATCGTFGTAGCTCTGC CCM TAAAAA ICC!
GAACAAAGGCAATGTCCCATA CAAAAATATE.CAGATCPCT
ATGGA GAGTACGTAG ATGAGT GITGTACCICTGCA FGGA
10 TCCAATTCCCAAATTGCCAAAGTATAATTTGAAGGCA CAGGAAGAGG.AACTGGGGGTTGAGTIGGCCTTCCA
AATFTGAAGGCAACACAAT
AAGGC
10 AAAC:OTICCAAAGGIATC1CTTCAT MT TAATG TAT IA TCAAACCT iTGACAT
11TAGAGCCAACAAACCTAI TA TITGAAACACAGAITAATC TG 1TAATCi TAT TAG 1TGAC
TACTCTTATTGTAC AGATTITTIGAGICAACCI 'WI TCAGTT !GAM ATACA
39 , AGAAATCCC CTTCTGAG TCC AATCCC
.
TG CCAG CC.ACCATCT ACC. AAGTACTCCFGTTGCC.AG
10 CATGACC:AGTAGATCCAACACCTAITGGAAACTAG I A
GITGGAGACAC:AGAAA.ATCCTAGCFGTTIGTTIAGG CTCAGATACAGAACGTIT TIGGAAAC TACT
AGGIATA
41 GGTATAGAAGT ATC.TATAGACAAGT GGT GAAGT
10 AGGAAAGITIGCATTACCAAA ICCIAAGTG TCCICCA AACTACAACAGCiACAGAGCACiG
ICAGOCCATAAGC GACAAAAACAGCTTAAAT AAGTG TCCTCCAATACAAC
10 CGTTACAAAGCAGATTAAC.AGGCAAAGGTTGATATT AGGATTICITCAC-AAGAGGCGGCAACTGAAACAATA ACTC.AGGATGCATGGTAA AGGTTGAT ATTAAGCiACAT
GGACTAGGAAATGAAGCTGAAGATMCATIGCTAT 1AT1TTAGCCCIGCGAACi AGATG TAGATTTGAG
MCC;
CGTCGACGCCTAGAAGACGACG1TGlICICK:CG11 GACGAAGACCCAGATCATC
as TCC GGGC AACCGAAATCGCACCACA C
GCAAACTAAAATACAGGCGCMGCCITC:AGCAACC TCTCTCAAGACTTGGACG
CTACAAGAAGAGGCTCGG
PGGTGCCAATAGACGCCCGCACTACIATITCAG
GGAGGTGGTGGAAGAGTC
10 TATAGGGGC.TGCTGGITGC.ACCCGCTGITTCTAC.AAC
CCTAGACGGGTAGCGACTAGCTGTAGTGGCCACA A AlTATAGATGTAACTGGTC
48 T . ATGT CTAGA
CCCGCTGTTTCTACAACT
10 GCTCTACTAAAAGCTCTIGGTGTAGGAATTTAATATT TGTACAATAGAAGAGTCAGAC.AGGTAATITA
ACTG GGAC.ACTTITAATGAGCC AGGAATTTAATN/TGAAGA
10 C.A7GTGAGTTACTGAAATCCTCTGTGAAAACAGTG AT
TCGTCCTTAATACTGGCTCfCCACCTITAATGCTGCT CAGAAAGTACATTTGTAG
SO GCACCTT CCAC ATGCA
GAAAACAGTGATGCACCTT
10 GGCAGC.CAATATGACATCTGTAAAGTATTACATCCCA TGCTTCFGGTAAGGIGTATMCCAATC.ATCTGTACT
GTAGGACCAAGTATTTTA GTATTACATCCCAGTCATA
10 ACCCTATTGCAACACGTTTACCAGACGGTITG1TTAT ACTICTAAGTCTfC.CAAACGCAAAGCAAGTAGAACA
52 CAAATTGG AAGTTCAC.AT TTAGGC TTGG
AATAAAGGICTGATGAAT TTITCCAATCTTCTCATCTG
GCGACAAAAGGTACAAGGTAGTCTACTACTCCTGTA TTCAGGATCITGCATCGITTGAACCATAACAAAATG
ATGGACTTGCAGCAATCA CTACTCCTGTAACTATTACC
54 ACTA T1ACCCiA GICCMCCA C GA
ACCiAAGICTIGTCACCAGA
10 CCACIGTTCTCAAACAATCGICTITCiCAGACATAGAll CAGTGCAGAAAAIGAAGMAAGCATTGITATTITC
GCAGACATAGAMGAGTC
10 TGTAGCATTTCCAGTTGTACACTTATATTCAAATTATG GCATAATGCC.AGAGTGGATAACAATGACTCACTAG
TGITCAATACTAACTGICA TATTCAAATTATGIOTGAC
AATACTGGAAAATC.CTACT GTATTCACTAATACGTMC
10 GCTTCTTGCACAGCATCGAACTC.ACCAGAAGAAGAG
AAATGCTGCAAAAGACATTGATACAAC.ATAATAAC.A CICACCAGAAGAAGAGCA
TGTATTATATAGATGACTA TTGATGAAAAAGCACAAAC
10 CAA TTGICITTGTGCAGTGTOTTAATGGGTAGATTA ..
GACACTGTCACACTACCAMAGGAATTAACCCMT ATTTGAACRZFEATCTAC ATGGGTAGATTATGATAGT
TGGTACGACCTACAATAC
TGGATGCMGGGACCTAC
1TAACATCTAATACACGCA ATC.ACAATGCCAACTATTC
64 ACTATTCCT A.ATATTAGAAGT CTC CT
CTGTTGGTCATCCATATTT
ACAAGGITGTGGTGCCTA
10 TCCCCATAGGATTCaTTGICATTTGIGTACCTTTGGA 1 GIGTACCITTGGACATAGT
66 CA TAGI GG reic:r CAGG OG
TGGAAGAAATAGAGMG
67 CTGATG I ACAGG ACC!
*FTCCICIGGAACCTGATCi TGITTGACTCATGGGGTCAGTCGATGGACAGTCTAA TACCTCAGTGTTACATGTT
ACATGITITGAGCCICTOC
AAGTACAGACAAACTTTG
ACAGTCITTGGCTGCAAC
CCATCTCGGGGGTCTCTAG
71 CAC:GCAATGGACCCCiAGGGATC.ICICTGGGCCACACA AGGITC CCCIGGCCIGGIGTIT ..
ATGCTCTGGGGCACACA
73 T AC4.3 GAGCi IGOIGCGGGACA
GCCACGTACGCOCIGT
GACTCGACAAACTCGCTG
TG111.3GTGTTGACGAC6G
10 GCGAGTAGTTGGTGATGCGGCGCCCGGCCTGACGA ..
GTAAGTGACGTCGTTGCG
10 CAC:OCC TACCTGAAGGTGACCCCGTAAGCACGGGGA GAGCTIGGCCATCi TACGCGCAACCUCCOCCiGCIGG AACICCTIGACCGACACG
CGTAAGCACGGGGAGGGT
77 AGACGGCGTCCCTGGAGTCTC.TGGCGGACOGAACA GTCGGCGAGCCCACACACAACCACACCC.CCCAGTAC
CGTGAGCACCTIT.TCGC TaGGCGGACCiGAACA
78 CCCA AGAACTGCCCC.COAAGOCACGAGCCACGCCAGAAC
CGITCTGCTGCACCGG ITCTCCCTG1TCAACC.CCA
10 .1 CG ICGTCGAACGGT1 TI ACG ICAGTIGCCACTGGG I ..
GTICGTCCTGCCCCACTGGACAGGGCCGAGAGAAC
79 C.CT C GITCGC.TGCCT1TTCCTCC
CAGTTGCCACTGGGTCCT
80 A GICiGTCCGTCTCGCTAAC.GGACICCGTCAAGCGCGT
CCGGGAAACGTCTOCG TGCITITGGCCATCTGCA
TACGTTACAGTGGa:CAG
TCGAACACCAGCTGCATG
GTGGICACGTC.CCCGA
ACACTCGTGTCCGCAGAG
10 GGACG AAACCAGGTGGGCCAGCAGAT caccTrccr .. TGCACIT TGCGTG I CTGGIGTGA T
ACGTCCCGCIT:A
84 6116 GGA CGTCTGCTCTTCiGTEGCT
GCAGATCACCGTCCTGTTG
as CTCGOCGAGCTATGGGCCTCGITGMCGGCACCAGC ATC CGTCGTCGGTGATGAGGA
CGTTGMCGGCAGCAGC
10 GA ATCIGGAGCTCGGGTCC.ACAACCITCCGTCCCCIG
87 GCCCGTTAACCCCCCACGTGC.CCGGGGCTITTCGT C TGAGGCGTCAGAAAGTGC
GCCCGGGGCTTTTCGT
10 ATCGCCTTGIGICITGTGC.IGGCCCAAGATTCGGCGC
ACACAGGCGGGAC.ACC
10 TCAACCCCGCCCTACACTACACC.AGACCCCCCGAAGC GGCGAACACGGGGCTGCATTCCCCCTCGCACATCCT
GTTGGACGTCACCGTATC
CAGACCCCCCGAAGCT
TCGCTGCTTCCTCGAGT
GCTGTTCGGIGGITGGG 'FCGGGGG1TTCCTCMG
10 C1TCCTAACOCAGACCC.CGGGGGGCGCGTCAGATAC GGCCACCTGACACAGAGGCGAGCGGCTCAAGATCT
GGGCGCGTCAGATACAGA
94 CCCACCCCCGAACCATGAACCCGTGGC.CGAGATCGT TGTGGGTGTGTAGGCGATGCTACGCGCGCCAAACC C
CCGTGGCCGAGATCGT
10 TC.AGAACGGGCCGGICGTCGGCCGATTCCITCATGC AAGTCTGCGGGGGAGCGGTGAGGCCGCSGTTGGT
GGCCGATTCCTTC.ATGCA
10 ATGGTCGCC.GTCATTATGGCCGCG/VaCiTGTMGG
ATAACCTCACCGAAACCG
TCCOCCAACACTGACGT
97 GC GG GAGCC.AGGGCCAAGGT
GTCTCCTAGTTGGCCCGC
CGGGGTTTCTGGGGCT
99 GCGCCTf3ATOGTGGAGAOGGCTGTACGTCGCTGGOG TTA CCGGGGGGCGCTTAAA Cf GTACGTCGCTGGCG
TGGAGCTGGCCCAGGA
GTCACGACGTACGAGACC
GlICACGAACGOCGCG
TGCATCGGCAACAACAAA
11 CTCGTGCGCTITCTGGAGCTAGCTC.CGGAAACTTGGT ACAGGGTGTTGCAATACGACCCATGCAAACAGCCT
AGCTCC.GGANICTTGGTAC
GGIGGGCGGCAGCATT
Os TAAGCfCCATCGCCIGGCGGACCCACGCCCACATCC 1 AC TGTCGGTGT7CCCCCAT
ACCCACGCCCACATCC
GGCGTTGTAGTGTGCCC
GC.AGACCGCGCCGM
GCAAGCAGCCCATAAACG
08 TCGACCGCCTGGCCAAACGCGAATCGCGGCCAGCA flG C
IXAATCGCGGCCAGCA
CGTGITGATGGCAGGGGT ACGAAGCCATACGCGC
11 GGAGGGGGAAGGAACGAAACACICTfC1GCGTGCCC CCCCGCGICAGACAAACCCTGAGTCTTCGGACCICG
TCITCTGCGTGMCGT
GACGICTGGGAACACAGG
11 ACAGGC: 6 TGCCCATTTGACGCTCIG C
ACCATCACGGACITICCCC
12 crcr TGGC CGCAC.TATC:CAGGACCGC T
CTICITGGCCTTGTGITCC
13 GAGGCCAACCIAGCMIAGGCTGMCfCCAIGGCAGA CC C
fGCGCTCCATGGCAGA
CGGCTCAGCTGGTGGGAGTCACCTTCGG teGGGGC GIGTACACCICCAGGGGG CAAACICGTGA
SCCTCCAG
IS ACCCCAACGCCATCCiCCTCACGTCGCCGAGCATCC TCAGCTTGCGGGCCTCGTTOCC.ATCGC.GTGGTGC
CACGTGGAGACGGCCATC ACGTCGCX.C.ACiCATCC
CIGAGAAGGGGCTGG TAC
16 ATI (3 CGATCAGAAAGCCM:C.ATT
'IGGATATGGCGTCGGAAG
11 GARIGCCGMTGCAACTMAAMXiCCi IGGAC:ATCC
18 TTCCC.CAACGGCAAGCMGCCGTCCAGAACCACT6 I GCTGGITGCGTTGGAGG
GCMTCCAGAACZACTG
CGACCGTCAGCGITTTG6 CCGACAGAAACC.CG1TGT
AGTACGTGGACCAGGCGGTGCCTCGCGGITGGTGA GCC.ICGCGGTTGGTG A
11 ACAAGGIAAMATAGGC(iGGGCRIAACAGCT(iCAAC CCGAGCCTCCCAG(i TGCAGAACCGAGGGCITCAAG GACACGGCTAAAATCCGG
TGAACAGCTGCAACGGG
22 CACCATCAAGGTCMCC.CCGTTCGGACGGACACOCG GCG GCCAGAAMTCGATGCC
TTCGGACGGACACMG
11 CGGTAAAACAGAGCGGGC.iCGITCGAGGCGGAGGIG CGTCIACGICCOAAGCGGGATGGCCGGGCAGAAGI
TTCGAGGCGGAGGIGG
11 CGCGAGCGGATCTGCTTTCGAGAGCCTCCTCAGCATC GATTCCCCAGAGCAGCCCCaTTGATGGCCIGCCTG
24 C Cr CAAGGCTCACGTGCGAG
AGAGCCTCCTCAGCATCC
ACTC.C.ATCTITGTGC:TGTG
GC cr TCATCTACGGGGACACGG C
11 GGTCCGGGIAAAACAACAGCCGAGLeifemCGTC
AGCATCCGGTMATGAGC
26 CCACTTACGGGGGCCAC_ATGTAGTGCAGGTGGGCGG CACACG C
TAGTGCAGGTGGGCGG
GGGCGGCACACCTATCA
CMGGCGACCTGGACA
TAATGTCGCGGATGCTGC
CCCTACIGGGGCCAATGGT
11 TC.GTGGICACC.GGIGCTCGGCATGCACGATACCGAC GATOCCCCCCGCGTTCCATGCAGGGCACATATGATC
AGAAAGGACAGCGACGA
GCATGCACGATACCGACC
11 GGTAGGC:CGCGCTACACGICCTACGITCTGGCCCIG
GGAGATAGCCCAGC.CCA
I
11 CGCTICITGGCCCTGGTGAGTICTATGCGC:IGGAGGT
GAGGCCCTCTTGC.ACGAACGGAACCTIACC.ACCCCG
33 GC GC , AGTGAGGGTCGCGTCG , TCTATGCGCTGGAGGTGC , 11 ACCCCACACICCAAACGCGGTGTGTATACGGACGCGC CCCAAATGGCCCTTTA &AC
TGIGTATACGGACGCGCX
35 T ' AGC AGGAACACACCCCCGTG
ACTCAGGACATCGGTGTGT
36 TAGCCCGATGCCCCCGTTGACAAGGCGACCCTGCG ... T CCCCAGGCCACCACAA
ACAAGGCGACCCTGCG
TGTTCAAAGACGCGGTGA
11 ATCGCCGAC.AGGTITCTGGAGTCCTIGTAGAACGCG ATCAGGGGCCGTGATATGCCGAGGACATCCGCGAC
CCAGAATTTGGCCAGGAC
TAGAACGCGGGI
AGAGGIGGGICTGGAGTC
CGCTTCTGGTTATGGGCG
40 C.CAT AGAG A
CCGGGGCATCCTTATCCAT
41 TGGGCGTGGCACTATCGGCTGACGAGGCMCAGCT GIG GAACCC.GACGTTCAGT
TGACGAGGCCGCAGCT
11 TTCCGGAA1TTATACCCGGGCCGGTCiTGTGATGATI7 1 GGGGACACGGGC:1ACCCTCATGTGCG11CGATGCG GGTGTGTGATGATITCGC.0 TCTGCGT
43 A 1 GIG , AGCCCCAC:GCGGIGAT
GCIAGGGTCAGCCGTTCA
44 TGGGCACGTACACCCCCCTCGTACAGGGGCTGGGT ' G ACTIGGCGGGGG TGGT
CGTACAGOGGCTGGGT
45 1: GA GCCATCGCCACGTCCT
CGCCGTCTAAGTGGAGCT
GAAAACCCCCAAACGCGT
GGACCGGACGGACCTT
47 . A ACTCGITGGCGCGCTGAATCACCACCATCCGCGIG
CTCGAGGICGCTCCIGT CGCAGAACGCCCICGA
TGCGCGGACAATTAGGC
49 AGAACGAC1GGCGCGCCACT a: rem GGGCCGCCA. A
CAGGCGCCGCATCTTG :FCC TGATGGGCCGCCA
ACGAIGCGGGGGGTGGCCTCCACAAAATOGGG
CGACAACIATCGGACTGCG
AGCCTGICGTGTCIGCG ITAAGCACGCICCGGGC
52 ACCACACGAGCACGAGGCCTCCCTGCAGC.ACCTCIC CCGTCTTCGGTGCCAGTCCTGTTGGTGCCGGTGGG
CATGCTCGCCS7COGT TC.CCTGCAGCACCTC.IC
11 TGIF TCIGCGTCCi IGAGTC:CCGCCTGCGTAGT TCRACi AGACCCIATGGTACACAC
53 , ATCGGCGTTGGTGGAGGGC.GTGCGTCTGGTGGTC.GT AGG GG
GTGCGTCTGGTGGTCGT .
11 GGACAGCAGCGGGGAC.TTGTICCICTCCGTGGGGG T
TAIGCTAATTGACCFCGGC
54 ATICCAGGTCGTCGCGGCGTGGACCICTCC.GAC.AGC CT C
TGGACCICICCGACAGC
SS TTTCGGCCTGCCAGG TGGCCIGGCr.CCGGACATA A
CGGCCATGCACACCAGC.AGGCGCGGAC:CAGGTAA CGACCC:CCCTCACCAA CGGGCCCCGGACATAA
GACCGTAGGACI GC
56 TM. ACCGAACAGCCOTCCGCGCGCCCGACITITTGC TG CC:
CGCMCCCGACITTITGC
11 TCTGGACACCCCC.ACGGACCA TTGGCACCCIACAACA
ATAACGACAAACGGCCCCTCGTTGCTGATCCCCCGC
ATTGGCACCiGACAACAGG
11 TGGGGAGIAGGGCCCGTOCATGGAIGCGCCCCAAA TCCAAOCCAGCC.AAGTIAACGGCAAAATCCGCCGG
GGATGOGCCCCAAAGC
ICGTICAACAAAGATTGGGGAGAAGCA
59 C GGTGTC 1TC7CCC.CCCCCCCTT
CTICACa:CCAGTACCC7C
ITCGACCGAGTCTGGGGA
60 AAACCGC.CCCCC.AAGCCTAGGATGAAGCC.CCCCG GTC: C
AGGATGAAGCCCCCCCI
MICA
61 GAGG TTCTTGCGGACCACGGCCCGCGTGTATGGGC.ATGCC
GGGGGCTAAAGGGIGGI GG
11 TCCTCCGC.AAACAGGCCCGA TCGTGCGCACTAGGTC
62 CAGCCCCTTGGAGAGCACCCGGTGCAGCAGTCGGA . C GGGGCTGGGITGGTCT
CGGTGCAGCAGTCGGA
TGGCCGGACGAACGAC
GGGGAGGGGAGGGTGAT
TACGGGGGGGTAGGTCA
11 CCCCGGAGACCCCCAAACCTTGACilCAGGCGCTCG
GCCGTCCCGGGTGITT
CCCCCCGGTATACGACGA
TCGGAGGGGTGTGICTIF
TCTGGTCCTCCCCAAGTAC
GC.T.GGTCTGGTGATCTTC
69 TCGC.ACGGGCOCCTITTGGACTGCCGTCCACAACGC AMAC.TC Ci ACTGCCGTCCACAACGC
70 C T CCAACTCCAGACC.ACCGG
TCGTCCCTC.GCATGAAGC
71 CGCGCATGCTTCATGGGTCCCGGGGCGGTCATTGGA TG , f. r i i r e GGGGTGTGGCGG
, CGGGGCGGTCATTGGA , 11 GGGTGEICGC3CAAGAAC.AGCGCGCAGTCTGEICATCT 1 GGGACCTGCGGCCAACACACTGGGGTGAGGGGAC
GCGCAGTCIGGCATCTG
11 GIGTTGTTGGGTGOCCTa:GCCCCCCAAACCATGICC C.GGGCTAACCAGGAAATCCGTGTCACACGGCCGGG
TGGTATAAATCACCGGTG
CCCCCCAAACCATGTCCG
GAGGTCCCCCACAAAGC
CCAGCCTGGTTGTCCGT
11 C.C.CCAGCCTGT1TGTCCTGGGAGCCGTTGTACGCCA
GCAACGCGGGACTATGC
GCGGCCGTGGTFAACC
TCAGCGCGATCCGACA
11 GCTCCGCTAAAAGACCGC.ATCGGTGATGGGGGGGA
GATCGCCIGTCFCCTCGT
11 TGITCCCAATTTGTAACATCAAGCTATCAGAAGATAA TTCAATTCAGAC.AGGGAATCAACACTGATITACCCA
AAAGCAGGAGATTAAAAT ATCAGAAGATAATAACCAT
CTCCCTMCCATATAACTC
80 CT 1 11611t A
AGGITIGAGICTGTTGCT
GCTGGCGAAATCACATGTGTCCAAATITTGATTGAA GCAGGCCTCATATAAGAT GAAAAAGGGAAAGTAGIT
83 1 AGTTAAATCAG 1 AGATACCX: , CT T AAATCAG
82 TTFAAATACGG ' CFGTC CFAACGGGGCATAIGGAG ACGG
CGGACCTGCATGACTACT
84 ATFIAAACCCT MC II CCMITACCGCTG1TACC a:CT
85 . CIG GICG TCTCTTFAC.GCGGACFCCC CCM
TGIGCCTICTCAT CTG
ITAGACTCTCCTGAGCATT
86 ATTG ocrGAAcr r TCAGCTCiGTATCGGGAA. G
CACACICTATGGAAGGCG GAAACAACACATAGCGCCT
87 GCCIC GCiATTG G C
GIGGACCATATGGCCATAATTAAGATCCTAAGTGAC CAGGICAATTATATICAGT AATAAAAGAACTACGGAA
88 GGAACCFCi GGGTTCTT A ICiGAA CCIG
TTACCCAAAAGATGITIT AACCT GGAGTAAAATGAGTGATG GATCAGATCGAGTGATGG
89 'MGT ITCGACITi GT CT G I
11 AGATGC TAGIGGATCTGCTGATCIAATTAFTGCGGCC A I TGGAGAIGTGCCACACiCACTGFC
TAAGAA IGICC AA TGACGATGT TCiAC:CAA
ATGACICIAGITCACAAIGGIGGGATCTICIGGITOCT TGGTGGGTTI ACA T !TAW,.
91 . CITCAA TTICTGA AGAAC
AAGCGGGTCATCACITCAA .
11 CI ITTCACTGACUCCICAGGAGi GATCGGT11 TIGAG AT ICA ICGTCGATGAIGTGGGAA
IGATCCATIGATA 'IGATCCGTITTIGAGAGIT
93 AGM: GGTATTGAC GAGGGTGGIGGITAGCAT C
11 CAACiTACG1CTCTCATITGTFGGAACC:AAGGCCATTA GGACATTIGACACCACCCAG
TGCTFTGCITEGGIGG TGAACCATITCAA TCfliA
AAGCC
11 GAAACCCTCTC-AAGACCMCGGAAAAGATGCC.GGCAC AGGTAAGGAAGACAGAAGATACGGC I II i bCAAGG CCACJAAAAGACTAACAA
GAAAAGATGCCGGCACTT
11 CACACAAAT CACOACGACAGAAATATTI TGA1GT ;AC
GAAAGCGICIGAGMAGTATGGAAGTGCAACACA GAACCICTIGCTTCCAGTT TAITTf GATGTUACTOCTGA
11 CGCGATAATATCANTCTICTCC:TCAAAGAGTTAAATG
TOGGTTCTAAATITCAACTGGIGCGCAICTTAGAA AGTAMTACTCAAACT CC AAGAGTFAAAIGTGAGGT
97 TGAGGTGTC GCATTCGC AlTGC. GTC
11 GCAACAAC:AACGACACGAATTACTAGGAGCATCCAG ACTGAAACTGaATTAGAGCATITFTGGGGCAAAG
GAGTTIGTCAC:CAGATTCA AGGAGCATC:CAGTATATAA
GTGAIGAGGTGGATCAGGGGATITGICACAATCCT ACTITGTAACIGAAGCAG GI
GGAIGATATACHATGTA
99 CAATGTA1TGATG CC(X.I AC TTGATG
00 GACAGTGG . AGTTCTTTCCA GACGC G
12 TGAGGCCITCTTITCTCCCATAATTGGACTCCCJWµTA CCCITTACCTGTAGCAGCAGTt. t I 1 t i IAATGACTGCA TCTIGAATCTAATTGAGCA TGGACTCCCAAATAAATTA
TGTAATAAACAAAGTAGA
CGACTCTTGCTGATCGATT
AGAAGCTGGAATTCCTGA CAACATTAATTGAATCAGG
CAATTTGATTTTGAAAATC
CITTAGCACTGACGTCACT
CIGACGCTGC
CAGAAIGGIAAACTGTAT
CAACIACC.AATCGCCAGAG
ATCCTCTGGCICAAACAGATGAAGCTGTGITIGTIT TCTITAACATGGATAGAG
TTAAGAGGTAITGAAATAG
12 CACCAAAGCCAATGICACACATATTCCACCTGTGCAA CTTGC.AATTTCAGAACATITCAGCAGAAAATCTGGA
TCCACCTGTGCAATTAATT
09 TTAATTAATTC TATTTACAGGTGC , ACCTCAGGCTGGAGAATG , AATTC .
ATCAATITCCACTAGGTAGACGGTGCTCTITTCTAAC TGAGGATCAATATAGATT
TGCTGC ACGTFTGT ITTGCAG
CTAAGTGTCCTACTGCTGC
12 AGTACATTC.AGAGACCATTCACAGTATTGTGATAAAA 1 TTTGTCTGAATAAATTGCTGATGCGTGMAAATATT CCTC-AAAATAAGCAATGT
ATTGTGATAAAATACTGTG
11 TACTGTGAATCCC ' TGCCAGCCC ATCCAA AATCCC
CTGACAAGCACGAATCTG
GAGGGTCGATACTGCCAA
12 GCTTAGATGCTCTCTCAGGCGAC.AGATCTCAGAGCTA TAGTGGAGGIGTTGAAGATGAAGCTGTCTTI1TGCA
AGAAAGTTAAATGTTACT ACAGATCTCAGAGCTAAGT
GAGCGACGACGACGAGA CCGAGAGAATCTGGAACC
IS AACCAC CGTGICT A AC
12 ATCCCCCAGACCc1TTTCCATTTGGAAG1TTGGTATAT TTGGATATAGACCATTAGGATCCGCGTCTAGTCACT
TGAAAATTCTACTATTGCA TTTGGAAGTTTGGTATATT
GGGCCTGATATATCTTTTA
12 TCTCTTCAAATATGITGTCCCCAATGIGATACTAATGT 1 TTAAAMC.AGTMCCGCGGGATGAGCTGGIGTTC
i ITTGAGITTGAAAATCCCG
19 AFCCC:CiC I GAGA , CCTICTCGAITGGTGCAG C
AAGTCAGMTCCTTGGGTCGAGCTCGTITACCATIT CAAATCCATGGCTACTAA
CCAGAAG ' AACAAAC ATGC
CCTACTGCTGAACCAGAAG
12 C.AGGAGTFTATAGGTCAGMCCCATGAATATTTTACC GGTCAGTCATFIGGTCAGCAGATCTCACMAIGGAC
ACAAACTGTGATCAATAA TGAATATTITACCGTGGIG
TGCAAGGGATGTTTFACA AGGAGGAGACATAGAATC
CCATACTIGGICAAGGCA
23 . T AGAACC GC;
CFGAGGCTGCAGGGACIT
CaGGICAITCGGTGTTCA TCIGTTAGGACCCTTCTCG
CCMAGACAGGTAGAAG
2$ crrci ceicrorr ATM T
ecGAGAAAcrcceacrre AGIGGCAGAAACFGGAAC ACAAGTAGACCAACAGCA
26 CA.CC: CI CCA IC CC
AGGGCITITTGGTCCCIG C:AAGAGGGTCTGACATAG
27 AC:ATAGC 1 ITCIATGICI TTACCAGCAGGACAGCTC C
12 CGAG TGTCCCTCCTFTCCACACiGGACCAAGCCC TATC: i GTGCCGAGCACCTAGAAGACAGITIGCCATGAIGT GCTGAACiCCCIAAAAGACi 28 (:.CA GTCCTGG GC
GGACCAAGCZCTATCCrA
ITACCAGTTGCCAAGATAGAGCC:GCCTIGGFCCATC AGIACTAAATAAAAGAGT ACCATAAT
GACAGGAGAT
29 , GAGATACC 1 TTTCC AAGAGCC ACC
.
AGAIGTAGGGGAIGCCIATITITCCCTGC.TAGGGTA f3AGGATGCTGATAGATTI AAATAAGG
ICACTCAAGA I.
AAGATTTCAC AATGCAGTA TAGAG AA ITCAC
12 CCC I AGGCCATTI AGAAG ITC( TIGACAGGACAGGTI TCI AC ICCAGATGAGAAGT
12 Gri Ca GGCICAAGATAAT TTIGTICAGAAGAAGF GC
32 AGTGGA CC:766767A ACTCA
C.AGAAGAAGTCiCAGTGGA
12 GCCTGGTATAGGATC.TCC.TACTAGGGAGTGGGACTT GCAGAGACCTTCTACACAGATGGCATATCCTGCTTT
AACTACTGGCAAGTGACA GAGTGGGACTTTGTATCFA
IICCIGGAAAAGATAGAGCCCGGGG1TAGlICITT TAGAAGAAAIGATAAAGA
AGIT:TATG1TGCATGGGTC
12 'MCl/ TTA TIGGCCACCTACTGGGAAGCAGAAGT CA I
GCACACAGACAATGGCC:CCAACCIACCCACCA TGCC GCAGTACA KT TC3CAAGT
GAAGCAGAAGTCATCCCAC
CCCACA AC GC. A
CCAGACTGIGCAGACATCCICITCACCTGCCGTAAA GGAAAAAGT CATCIAGAA GAACCTAACACCAGAAAA
CCATAGAAGCCTTAAACAG
38 AC.AGGG . ATACTFTGTGTAG TTCGAGTGGCTAGAGAGG GG
GGTAATACiAAACTCTGAG AGGAAGAAGCCTTAAGAC
39 TTAAGACAT 'IT GGA AT
ATATCATATGAGCGAAAG
CCA AAGC.ACTTG GGC
AGACGAAGAAGGACTCCA
12 GCTTGACACATGGITITATTGATCITTGCAATAATAC.A ACACCTCTATGIGTGGCMTGAATTGAGGITGTGGA
TGGAATAATACAGTGACA
GATAAGAMAGGAGGTAT C.ATGGTATTFAGAAGATGT
42 AGATGTGGT TAGGGCAAAGC AAGGAC.A GGT
CAAGTATTATAATCTCACA GTAAGAGACCAGGAAATA
44 Ci ACAAGAA TGTCAATG C GAA
GACTCTITATCGCCTGACTGGTGATATTTGCCIC GTOGGGGAACGAAAAAC
AAGAGTC.ACTGCTATCGAG
AGACTTAAGGCGGCCCAACTGCCTCGCGAATGCTT ACTA TCCA AGAACTCCCCG
47 CAGCCC CAATTTATAGGTC , ACAGCAGTATCAGCAGGG
AGAGTACATGAACAGCCC
GGATTGCCTG7TATTCACTA AAC.TGCITAGTACACCCA AACAAGCAGACATGATGA
48 GATGA i TATGGTATCC G TGA
AAAIGTACCCGCTTC.ITGC
GAGAGGCTGGCAGATCGA
12 TCCCAGTGC.AGATICGATCTGAAGGC.AATAATTGTAC
AACATCGTCAAACTCACCTCATGATCAC.ACCAGT CA CGTTGC.ATTITCTAATATC
CAAGGCAATAATTGIACTA
SO TACTCAT CATTGA CAC CICAT
12 GCAGGTTTGACTTCATGGAGTATTGCCTAGFCAGAC ¨ ACATCTGGATGCMCCTATAATGCCICTGAGAAGA
51 C.AAAATT3C TTAGGTAGTIGTC AC
CCTAGGCAGACCAAAATGC
ATGGGAACGGCTICITCA
CCAAATCAAACAGMGAC
MAGGCCATAGGAAATTG CCAATATGGGTGAAAACAC
AGCTCACAAATAGAGCTTGCAGCTCATCTTCACT TTAAGTGAGCTAGAAGTA
SS TCaAGCT GITTATTATCCC AAAAACC
CGGAGCAATGAATGAGCT
12 GC,ATTAAAGGTGCCAGCAGCTGCTITGAAACCAAAC 1 TGCTGCATCTITAMTGATGATGGCCTAGAAGCAGC
CTGCTGTAGAAATAGGGA
56 ACA t AGIT:CAGY A TGG TGCTT
TGAAACCAAACACA
i TATGAGACCCA CTITGTCCAGAMGCCTIA :FGGAGATCCAAACAATATG
57 C:AAT ATCTGA. 1 1 TACAACT G , AA GA U
CGGAAGCCATGGAGGTTGCTCCCAATTGTCCTCATT GACAGATGGCAACTACCA TGAGAACAGAATGGTGCT
58 CTGG ' Gccr CC GG
12 C.ACAATATCAAGTGCAGTATCCCAAGCAGCGATTCAA
TCTTGATCGTCTTTTCTTCAAATGCGAAGGCCCTCTT
GC.AGCGATTCAAGTGATC
GGCCATTITCATCATGGGAGATCTCCTCCATAACA GAGTAGACTCCCTTTGTIC
ATTTGATGAAAGTACAGA CTAACGTTTCCCAATTAAT
61 . AATTAATAGACG TCGGACTAC.A !TOM ACTACG
AAAATIACTGATGCTGAAT
GAGGAGGTTACAGGACCATATCT GGCA TAAAGCCCTTCCATATCCT TGGGATTGGATATACTATC
63 C:TATCAGG GCATCIC AAC ACiCi CTGTAAATGTTCCTAAAG TGACAGITTATAATGTATA
GCACTGGCTATAGACCTA GAGGTACAACAAGGCCTA
65 COAT I AGATGGATC:AGG TAG 1:
TCTICAACTACACATCCIGACCCITTCACCTTGATAA GGGTAATCCTACAGITAT AMA
TACTGCAGCIATAIT
12 CGTICAP.CTAMGGOTGGGAGTIGGCTGAATTIGAA Mal- T GAACACi G
TTCCTACACGAA T ATI CMACTG CCTITAACAACGITITCTCT sUGGCTGAA.TFTGAAA TAM
67 . ATAATC1 GA CArAGCG AM GGA
=
CICGTGCATCTATAACIACCAGAAGGACTCAATAGC CO GATA GTG AITTCAGA All ACATACTGCCTAC MT
12 ACATCTGAGC:GITT GCGMCATCAGATGACTA MT C
TITIGC:AGATCiACTGICTCiGATGAAG TACTCTGGC I CCTAGC:ACAGACAATACA CATCAGA
IGACTA ITT Ta GGCCAACACCTACi TAGGACATCCA rrmrcc GAACACAGICTOTAGI ACT
70 GTAGTAGTACC AGGATTTCZ Tarr ACC
12 GTGCAACC.TACTATMACACCIGATGATGAAAATCTAT
TCCTC:CATCiGGCTTGAATATTCTGCCTCiCiCTTCAAGGG TACT AAAATC.CCACTCTTTA
GATGAAAATGATCACAGAT
1l7GGGCGTAAGGAGCAGCATCCCCAACAMCCT A TAACAGAACAMMGC
GTACCCCCACCACAACi AAGACAAA I A TGAT A ITGAAATI GTAATCi ITIGCAAAGTMTCIGGA
AACiCGCAAAAGGICIAAGTAATTTCGCATAC:AGTAA rr-rac:c:r CAGAGC11TCFC
CGCCGCFTICTATAICAAA
74 ATCAAAC ACATTTC.ACAGTA A C
12 CiACAAACAGCTITCGCAACAGAAACAGCTGAGTCIG TTAACGGCC.AAATA TGAATCAGAGAAA I
AGCCCAT AAGAATTAGCATCCITTIT
AAACAGCTGAGTCTGGTT
TGGAAGATCTAGTGATGC
76 AA . CACAGTCT C
AGCCCATCTATTGTGTGAA
TTGGAAGCAATAAGAATTT
CTGGGAAATCATATATTTA
12 AGICAITAATTGCTCTTGTAGTGC.MCCCGAAGATM AATIGGGACCCACCGATTIATCCAGCATAATACTCC
CTGC.AAAAGCAITTAGAT ACCCGAAGATAATGAATCA
ACAACCGCAGACGACAAGCCCTIGTCCTCCiTCGTCG CTCCCTCTCITTACCACICIC
GTC:ICTATCCGACICCCC.T
GAAAAATTGTAATTATGT TAAACGAATAAAGCGAGC
12 GCATCTGGAGTGATTGGTAGTACTTCCCCAAGACCTr AGAAGAAGGAATTCCAGATGTIGGGIGITTC1GCTC
ACAAACACCAAATACAGG CCCCAAGACCITCTATTCC
82 CIATICCA. CTAAACCAG AAC A
TTAGAGAGGAGTITGATA
AGGGCCaTTMCGAGTA
TTITTC.TCAAACTCATITG TATAGGAGGGITCAGGGA
12 GCAAACTGGITAGGATCTGGTAATTAATAATGTCCAA TCCTGAAAGAGAACGTTTAGTTTGGTAC.CGACTCCA
12 CCCGCCGTCCATAAAAAAATAATTTATTAATTCrAcAT
ATGCCAGACATTICTITTGTAAGGAGCTC.ATCTGGA ATTAATTCTACATCTAMT
12 CCAGCTGC.CAGTTTIGTAATATTGACCMAGAGGCT TCCTCCTCAAGGATTGGAGGATACCITTAGGACACA
AGTAGAAATAACAATGC.A
CaT1AGAGGC1GATG7GT
12 GGICGGGIATCAA.ATACAGCAGMACATATCAGTC.G AACAACCGGTICIGGTCGTFAAGACACAGGCACCG
TATTGACTCATGAGTTIGT TTTACATATCAGTCGTTCA
12 CCTGTCTTCCTGATGTGTACTTCTTC.ACTCGCGAGATA
AGAAGAATC.CCGCTCTTAGAATGATTTGTCAGaGT AACiAACTAAGMATITGA
CACTCGCGAGATACTGAC
12 GGTCCATTTCTGTTCCACCACGCTGGATCAGACAGGG AGTACAGTCCATTATCCAAAGGThiliLLGTGTTITA
TaTTGGAGCAAAACAAA
GCTGGATCAGACAGGGTA
CAGATGTAGACACC.AGGA GGGAGGTAAGMATGATG
12 GTCCAAC.CATTGTGAATTCCTCATAAAGAGAGGAAG GAAGAGCAACAGCCATTCTAAGACCCTTCCGCTGAC
GAGGGTTCACTTTCAAAA AAGAGAGGAAGAAGTGCT
TGAGAC.AC1FTCAAAAGGATGCAAC.ATTGICCATG6 CCGAGGTGATTTGAATITC
i CAGGGAATGGAAAAGITGACGATTCTGGCCCGTTA
95 GTCCGA 1 ATCTCC , GIGGTCGTGAGCAFTGAC CGTTICTIAACCiGICCGA
GCCAGAAGCCAATATAGT
96 GTGG ' CGGTGG GTACC GG
CAACAAGACAACTAAGAG TTACAATACTTGGGAAGGA
AMTCCTGGTATTCGCTCT GCGATCATTCCAACAAATG
98 ATGC CCiTFCCACTG G C
12 TaTCATTCACGAATTTCCCAGGATCCGATTTAATTAT TGAGGCAAATTCTC.AGAGAATCAGGCCATTAGTTCT
GTGACCAATTCCTAGAATT CCGAMAATTATTGAGAG
99 . IGAGAGGCG TATICCACIGTA TTCAG CiCG
13 CCAGITTGITTCC.ACTCCCATATGGGATCCATCATTCC ATCAACAATC-GGGATa:AICATFCCGTAI
13 ITGTCCaCCACTATGATAGCATCCAGAGIGGAGTAC CTIGCCATTICAGAACATAGATAGOCTCCTITGCTTA
TCCACiAGTGGAGTACAGG
AGATTCCAAAAGGAAGAG
TFIGGIGCIATAGCGGGT
CAGAAGTGTGGTCATAC.A CTGAACTCTIGGIAGCAAT
03 CAME; AMGCAACCAGT ATG G
13 ACITGC:CAVCTGCCTrAACAAGGGT1TAACCGAAAA
ACTTaGCCTCATCACAGCCCiCAGATFTG TAAAT1CC GAATIATAAAAAAGTACCi GaTFAACOGAAAACGGT
13 GaTITGACATAIAGCACAGCGP.ATAITTIGGCATAC TATCGCCAAGTGAGAAAGATC:ATCGFCCACCIATIT
GACACTACGACAGATCAG ATTITGGCATACAGFAGAA
OS . AGTAGAACAA AMATGAATCTGA C CAA
.
13 TT CTG1AACGITGT ATG TCOGTITAATTTGACACat GTGIGGIIGTACACAGIGTGGATGCAGGIGTIGTAT GGTIGACCTGTAITGTF AT AATFTGACACCTCAGATGA
13 1 T FCCi GATMCCTGCACCICACGGGAGT ATGGGTAA
AGGGIGGIGGAATIGCTTAAGIGACCAAATGAGSA CAGGAACAGGTAGAAAA.T
07 CC TCCAAACAG GC.
ACGGGAGTATGGGTAACG
ACCACAGTGICIAPAGGAATGTGGCTaATIAACAG GCAGAGGCGTIATATACA CA TACAATGGTIAACCIG1 08 CC:TGTACC ITGATTAGCA CA ACC
13 CAGCACGTTGGGCATATTC.AGCATATGATAATGATTI
ACAC.AGATAGTAATGCAGCTGCTCITCTACACATAAT TTAACAGAAATGGTGCAA GC.ATATGATAATGATITAA
13 ACCiCATCFMACGITIGGCCGAAMCAAGAATCAGA AAC.ATGIGCAGCACTGGAAGFCCATTIGACGAGCCT
AAGGTIGIGGICCAGOT GAATTICAAGAATCAGAG
GGACG TAMT A GACG
13 GCCiTCCCTGIT11T1TAAACACITTCCATGGACATIGC ACIGIGGAGGITGTATTCGATGGCG
FATATAAATGT AGICCACATTIGGTACAG CCATGCiACAITGCAAGAG
11 AAGAGA CCATGaGTA AA A
13 GG11.13GGGIGGGGICTITGTAICACCTACACCX:ACA CCCCOCCGCGAAAACGATACTIGTGACGCCGTAGT
CCACCGACCTATACAAC:AC CACGTACACCCACAACCAT
TGIGITGIGTGIGTGGATfAGTTAGOAGCAAGACA AIGTACAAACATATATAG
"FTACIGIGAGTMACTAll 13 CACTATTGTG GGAGG GGGAAC.A GIG
13 aAAAAACCACAGCAGCAGCCriGTACATACGTGATA TIAAACTACTGCAGACATACGTOTGTACAAACAATA
14 ATATGGTAC . CAAAAACCCAAC TGTGTGGTGTGCATGGTA TGGTAC
13 TGCAGAGGACGTAATATTAAACCCAATTGATrCTAGC GGACATTTCTCCTCCCAC:CAATACGTAAAGTAAAAG
TATTGGAAGAGTCTGCAG ATTGATECTAGCATACC-AG
ATACCAGTGC GATCACTGTA TT TGC
13 TGATGGCCTAGTGATTGGGCTGGGCTCGTATTGGGT AGGCTATAGAATTGCAACaTTAGGTCATTAATTGC
GGGCTCGTATTGGGTCTA
13 GAGGAGGCAGATAAACCTIGITGTACCATATTMTT CTCCTCATTCC.AAGGIGGTCAACTGCTGGCATGATA
TACCATATTTTTFTGCAGAT
13 ACCATCCTGAATAACACTGITITGTCTGCTTGCTC.TGC
TATCGCAGATGTAGGGITTGGAGCACCTCAGCTITT
13 CaCTAGTATGICAGGATTCATGGTTGCAAAATTACA ATTGGAATGTIGGTGTAGCTCCATATGGCCTGAGAC
ATGCAGAGGAATATGATA TGCAAAATTACATTATCTG
20 'FAGTACATGC ATGC1IGTGC TTGT 'FGC
ATIACAGGAAAAGCATIT
riTGIGTTACCTGAATGGGT
AAATGITGIAGGGAAIGTGATGGFCCOGATACTFA CITSCGTICTIAAACAGTA A
ACAGGTCAAGTITGTAAAA
22 CiTAMAGAC ACTA AATTAGCAA TTC GAC
13 CCTCCGACCGTCTCTITAGTCTOGAATCCCAGAGCA GATTCCTCAAGAAGCGGCAGGATCGTC.GTCG1TGTC
1TCCTCCCTCTGITGOTAG CTGAATa:CAGAGCMCG
23 ACIGG CTCTT , C G
13 AGACCTGCTGCTGGTACTGACTGC.TGACATTACTGC.r CA AGAGTTGAGCGACTTAA
ACICAGCTCCAACATITC CTGAAGGACGTGGAGGA
CTGCTGACATTACTGCTGA
TGTAGGTGAGCATCCCAATGTAGCTGCACATCGATT GGTIAATTGACACTCC.TG
25 ATGTTG ' TGAG GT
GCCCAGGICTTGATUTTG
GGAGATATGTGCGATATA ACTITIC.ACICA111TCAGA
26 TTC-AGAAAGT CTCC.ATCT GGTC AAGT
GAC.AATGCTACACCTCCA
13 GGGCTAGCACATCAGC.ITCCITTGAAGTGGAGCTTGT ATTAATGCAATGGACTCTCGCACCTGAGGAGGTGA
AlTATATGAGACATACCG
MGAAGTGGAGCTIGTGA
TACAAGGATTTAACATM ATMCGAGATAAATTIAC
29 AATTTACCICC GACGCTT GGAAGG CfCC
TTACCCTCCTATTATTTGC.CCACAAACGTTTGTGTGC TGTACTGTGAAAATAAGC AACATATGGTGCTATGTAT
30 GTATCCTG GATGTT TGTG ocm AAGCTGGC.AAGAACAACACCCAGAAGATGACTCAA GCTTTCTAGTITA AACC.AC
TTGAACTTAGAC.CGAATIT
13 GC.CGTAGTGC.1. GGCTAGTACC.AGGCAGATGGCGACT I
ATGGCTGGATCAAGTGAGCAGGCCATTTGCCTAGC
GGCAGATGGCGACTACCA
i 13 CCACAATATCAAG %CAA TATCCCAGCAAATGCAGA 1 TaTCAMTGCATITATCGICGCTAGACTCAGGCACF ACCAGAAACGGATGGGA GCAAATGCAGAGATTCAA
33 GAITCAAGT I CCTFC , G SI
ACCTAACAACAAATGGAAGTAATGCGTTTTAGATCT GCTCTR. I i I III TATCTCA
TGGATAATGAGAAACAAC
34 AACATGC ' TTCTCGATCATGT TTCAC ATGC
ATCAGGAAACTATGTTGC TCAACAAMAAAACCM
ATATGTTCCAGCTAGGACAAGCACCAAGTTCATCTT CTGTATCCTCAGAGATCCC CA
ACTATCCTGCCATATGG
13 TaTCCCAAGAATCCATGATTTGATMAGAGTTTGCT ATCAACTAATATCTCCTCGGCCCTAGGTCTICTIGGG
ATCCTATTATACCAGAATA TTTAGAGTTTGCFTGCAAC
37 . 1 GCAACT CCM TAGSGT 1 38 GGACC GIG 'ICI TGAT CAG CC
13 GISITCCIGATGATICTG laGTATGAAAATCTCAAA AATCCTCATCATGGAA1 CICACCAAGTTGIGTITGIC CCACA TCAAAAGGGCAGA GAAAATCTCAAAAACAACA
TCCAATGAGAGAGTATCC GAAAAGATCAAGAAAACT
40 GAAAACTAG 0 17 TG MAT'S-TAIGA ATGA AGGTTFG
GAACAGAGAAAAGCAGTACCCCATCTAAATACCGA TCCAAAATGAGTATAACT ATACACA/TCCCAGAATCA
41 CiAAICATCA 1 GAICCAIGIT AACTCFG "FCA
ACAAAGMCICTIGaCaCCGIACAAlICACGAA AAAGAGA1GGTIGTGIA1 ATAAAACCAGAAC TAT ACC
42 CCC.ATG GATTACTC ACAGT CATG
13 CraCAACTC:CTITTGCGATAATCAAGAGIGGAIGCA 1 AATCICTACFTCAA ICTGGACCICACI
43 . ATT11CC 1 GATTATCCC.1TA CT CC
.
CAGGAMCCCCCCICCAAAG TAATT1TGAGCCTCATAFX: TACiAAGACFCTAACTCTIG
13 'FCTGAGTGlICAGC.AGCC I AGTGA ITACICAATCACC AGAAIGG
rATATTCCICTICCCAGCACATCTGCTCCA TTUACAGAATCAAI AAAG 'FGATTACTCAA TCACCOCC
13 Cta: ITC:ATTACK; IGGITCiATIGATTGC:AAACTGTA ? AG I ACANT
AGGTAICAACGGAAIGCTGIGTAGAA r TrGTCAAS GGAGCi AG MG TaAAACTGTA I AACAACC
AAGCATACCAATC.ACGGAAACICIGACICTTGTTGTTAT AAAAACTTAGGAGTAAAG
AATCCAACTCTACTCATAC
TAACACATGAIGIAGGIAT
'FATGCFCCACTAIACCCAT
13 TCiACIGGG1TCACTCTCGA1TC:ACAGGGAGCNITG 1 G
ACTTACTCAACAGC:AACTGAAAGACCAGCTGAGAG TGAIGCGTATCCACTAAM
CAC.AGGGAGCAlTGTGTC
13 GC:AGAGT;TGAACCGTAAGCATITCiCCTCCTGIGACA AAACGCF GTIGATIATF
ATTACC
GACGAAACAAGATCAGCATOTCTGAATAACCAT TGAAACGATTATAAGTAG TAGCAAAAGAATCTATTAT
52 CTATTATGATGGG . TCTCAATTGCTT CAAGA GATGGG
13 GCAGGTATTAAAGAGGCATATTGCAATCAATCCAACT CAGTGTTGGaIGATTCAATTACAATGCACICAACTG
CAACTATATATTGCCCITG ATFAATCCAACTATAACAC
13 CCTTCCTCCTGATAAATGAATCCAC.ATCAGATATGA A
AAGIGGAC1TGAAACACCTGACCITCTGATCaGTT A ACACTAAGT AGGAC.1TT
ATCAGATATGAAGATATGT
CCGCAATAAGAATAGCAA ATGGGCATTIGGTAATGAT
AITTGGGTTGCAACAGTTMGAAATCTCGTGACGCs A ATCTACAGATAGAAAAGT
GACTITGTGTCTACITITCT
CCGTAGCAAGTGATTCTG AATTTCTIGC.AAATGGCTGT
ST GTT TATGT A T
GATGGCAAACCTGTTATG TTATIGGCAGITTGTTGACC
58 Ci3TGACC.A ATICAATCTCT I* AGT A
TGCTCTIGACAAACTITAC TaCCTGTTAIGACAAATG
TGAAGGIGTCICTGITTG AGAGIGATTGCFCTICAGA
13 ACAGC.ACAGTCAAAGACAGITTGATGCCTGAGAAGT TGAAACTTGTACTATTGCCGGTAAAAGCGCATTATC
GTGGTACTATACAAATTCT TGCCTGAGAAGTTTTTGAA
61 TTTTGAATG AAGCAAA , CGC , TG
.
13 AGCAGTGCTACGTTCAACACAAAGTTAAGTATGCAAC ACTTACAATCGCCAACAArfATTCCACATCGCCAATT
MAGTTAAGTATGCAACACT
13 ICCICAAACCAAGACTCGITAC.TACTTIATGTGAGA G 1 TGATTACAGGGCTTTTATCAGTGTTTTAGACTCTGCA AAITGTCAAACCTAATATC ACM AMTGAGAGACTAT
63 ACTATGTTG ' GCTTTCAC AGCC GTTG
13 AGMTCTAAMACGCATTATAGGCCGAACATGTGCA GGITTCTAC.IGTTAAAATTGGTGGCACAATGTAAGG
GAACATGTGCAAAGCGTT
13 GTCAAAATGTGTAGGGTCTGCAATTCAGGATTAAGTC AGATGCAGAGTTTGTZTaCCAGTTC-ITATAAAAGA ATGGATTGAGT. I I I I MCA CAGGATTAAGTCTTTTAGT
CCITTTGAMTGCCATTTG TACTCAAACAATTGGATAA
ACTGGITTTATGGTTATGG AMTGTAGTTGTACCAGTG
GTATTAAGGATGCAGATT AAGTTGAAATATCTGTTAC
69 Cf GTTACACCA ACAACAGTA ATAATGC ACCA
AACC.AGTGTCCGTAATTAAAGTrGCAC.ACACAACAA ATACAGACAC.AGAGGITT AAGGT1A
AGGATITTGTGT
i ATACCAGCTGACTACATCTIGCCCCTACAMAGAAA CCTICTGGTAATCTIATTA
GITCIGITGITGICTACAT
72 ACATGTG 1 CAT TACAAGTCA , AGT TM GIG
13 CGACCCAATC.AACATCTATTGCITTTAAGGACAAAGA 1 AGATGCTGTTACTTTTGCMCAGTCCATTAACAACA MOTC.ATTTGAACAACAA TAAGGACAAAGAMITTCT
72 TCITTCTGGT ' GCACTTTCA GTTG GGT
AAAACTCTGTTGCATCTAT
IGTGCTAGTGICAAACGT
GGTTATGCTAAGTCAMC TCAAGGATGATTACTGTGA
75 . ACTGTGATGG TGCATAT 111.6 IGG
AAGGTTATTTCTTTTGTCA
76 TGTGAA ACC:ACCATF T GAGA TGIGOTiTTGG
TFGTGAA
GATGTACTACGTAATAGC
'FATG1CITTAGCCGAGTGC
CGCAAGTATATTGTAAAAA
78 AAAAACTAGC CACA CAC; MT TrCTCICT GAAGGI
CTAGC
TTTTTGAAAACITTGAGGA GGCITCACACCTITAAACA
79 ITAAACAAG ACTCC:AAA CrCIG AG
FGAATCCAATGCAGGTGTAACGT AACCAITGG TIAAGTTGCCAGAAGITAI CAAGGCTITGG TIT TAG AA
13 GCAATTAAATAMCACTCAC.ACCAAAA IT GTATF TC IT T
GCTACT A
81 , CiCTACTAGAAGC ACAAAC ATGC GAAGC
.
AGCAGACACGGAC TrAGCAAAAGCAA TWIG CTITICGCGTGATC:ATAAT
CTTAGCMTI CACACCA TTGGIGTIGTAGGTGCTA 'IGCATG GAGTAACICIT AA
83 ACTC. TTAAAATT AMACACC C MIT
13 GCAACCACATTAACTGTTAACATCFGGTTITGAAGAC TTATGCTOCIA TAT TGAATOCi TTITCMAITA 1 AATGC CITTGATGGIGTTAMIAT GGITITGAAGAFX:AACCTA
84 CAAC.CTAAT TCCACAAACA GGT AT
13 ACCC.AAATACFMGMCITATAAACAAAATCCATAACICT TAACCCTGGITTMTTACTCCCiTAMTGTAAGACAG
CCTGCAAAGTGGTAAAAC AATCCATAAGCTIATTIGC
13 AOCAAT GTGTACMCGCMIT TAAT T TGI CT11 TT GIT GCACTIATITGITCMACTC.A 1 TGCACCAGCAAACTA CI GIT AIGCAAATGGACA. ATITGTCITMG TIGCACT
13 CiGCTGGACT AACACAGAA ATCA TRAGCTOTA 1 GTACT TA
IGGTTOCTAATGGTITGAATGCAAGITTAAAACiA GI TIT ACA IGTMT TAGGC GCT GT A
TGTACTAT GGI Tr 88 TAATGGG CAACO AAGTGC. G
13 GCFTGriCACSC.CA ITCTGITAAITTITGAAGAAGGCT
TGCMCAGCTATGrACAAAGAAGTGCATGGC:ACTA ITOCTCACCACAAA TAAM n-GRAGAACX3CTAIGAAT
TACCAGATCATGATTCATT AAATGATGGTAGATGGTTT
90 TGGTITTGTG . CACAAGTC TGTG TGTG
GAAGGTAATGCMTATAC
GGTGGACGTGCATTCATG
13 AC-ATCTITACAC.AATTGCCAACAGGCAGCTOCAGCCT GAC.
TAATGGTTCTGGTAGCGGTICCTGCGTAGTGTT CATTGTICTTTTGC.TGTTG
CAGCTGCAGCCTATCTTG
TGTAAAGTITOTGGITGTT TAATCATGGCTGTACATGT
93 ACA.TGTGAC TAATC GG GAC
GGAGATATTGGTTCTTACTG671111TCTATGGGGTC GCATGATTTC71TAC.TIOG TATGGTAATG1TAGTAGAC
94 GTAGAC.AGGA MACCAATT CAT AGGA
GlITTGACCTTAGACAACC TCTTAATGGGAAITTCTAT
GCATTGITCAAATTITAAC
TCTTCGCAACCACAATTCC
97 AGGATTAACATC ALM i 1 i G CATC
TGICTGAAATCCATAGIAGCTACCAGOCATAAAACT TCATTAACAAAGCGTAAT TGACTCAGTTAAA
TCTTAA
13 CAATGrAACCAAGTCCAGCATAAGTGC.AAAAGCATTT AGCCAl. ti i o i AlTATCAGAATGGTAGATCTTCCTCA TTGATGAATCTTITGTGGA GCAAAAGCATTTTTC. TATG
99 TTCTATGATGA GTCCAACA , TGAC ATGA
CATCACTAAGACTGATGCT
TGGAGGMGAATTATTAC ACCACAAACC.ACATCT11C
14 ACCGTAATCTACiTTCTCAAAC.ACACATGCITC01AAT
ATACGGTTACTTACAAATCCACTGCAGCAACATTAT CCACCGTTAAACCGTAATT CATGC.7.
TCCAAMTACAAA
14 TCGCTCCAGGGTAATACACACTEEETGo74-GTGGA CGCTCTGTGCAAAACICTGICAACACTCAACTCTGGC
GCAAACAGCGTATAACCA
TCCTCCTGGTAGTGGAAA
14 TGTTGAGGATCACCAACATATACAATCTATGTGC.ACT
14 CCTGGCAGCTACATAATTTTGACTAAGCGATTTATAC TTTAGGCTrACAAACTCAGACAGTGGCGAATATAAC
CTCCAGTATAAATAGGCG AGCGAMATACATAAAAA
05 ATAAAAACTOCAC ATAGTCATATICAC TCAA CfCCAC
14 AGACAATGACAAATAAGTAGTGGCAG7TTAAGGATT AAGACTAGTGGTGACTTGGCTCAC.ATGTTCATAGGT
GTTIAAGGATTGTGCACGT
14 TGCCAACATTGTCACCTGTGATCATGTCAG.AGGTTGG TGTACCTCTACAAGTTGGITMCCGTTAGAACACAA
CTAGAGACTTTGCCATGC TCATGTCAGAGGTTGGTTA
14 GCTCC.AATCTTAACAAAATAACGCAGGEOTTTGTACT I
ACATTGCCAATGTGGTACICITGCAATGCATGTTTA
08 TIGGGC : AAGCAACAA CIGGCTCATCTGATGTACT
GGIGMGTACITTGGGC
s 09 GITAAGAATO 1 TACAA , TGACIAGA EGTTIGGCIGT AATG
TCAT6CATTTCAC.ACTCCTGCAACGGTGCTGGTTTCA GAATTGTAATGTGGATAT
TGAATTCTCAATTGTTTGC
TGCAGG ' AT GTACCC AGO
14 TCTGTGAATGTCTGCCACAAATTATAACA1TTITACTC AGAGAACATTGaTTTAACGTTGTICTAATGGCTAC
TCTCTATAGAGCATATGTT ATAACATTITTACTCAAGCT
ACTGTCAAATCAGAAGAT TTAACTGGTITGTTTACGT
13 . TITACGTACC5 TCA GGT ACG
14 CAAC.AGTCAAATCCAAAGACTICAATCCTAGTTCTAA CCAAGGITC.ATGAGGTCATAATTGAGCATTATCM
CTGACATAACACTCAAAT TCCIAGTTCTAAGACTGTTT
14 GCACCCAAGTG TAACACTOCATATTGTGICAATATTT TGGCTCTGATTATGGIOTTGCAAAc-'t AACAGTACCAC CGTG A 'FACCAC
ACTICTGTTAATACGTCAT GTTGTCGGAATTAACTATC
AAATGMGTMGCTIGT
"mccrrGTTGCATATTOCT
14 ACG I AGMCAT IGCCAA I AGTAG TGATOCTCACATA CRICITITGTGCiG I
GCACTAAATOICCIGTOCGTGA CAACAACACTI TAG TT TCA GATGCUCACATACCATITG
14 AOCGAI:ACAATAGACACAGG FAAGAIGGTITTFATIC
ACGMATTGIGTTGIACCITTCIAMACAGTT AAAG CAGTIG TCCIT TGATG LAC AGATGGITITTA
FTCIACA
19 . TACAAGCC CACITGCC C AGM
.
GGCACCAICIACTCTATCACICCCACiCITGCTGATAA GITCCTITAIGAAIGTTAC AATAITTATGATGTAT
CTCi GTATCTOGTGTGG ACAACAA AlTGG 61G160 GITTIAACITATTCIAGTIT
AGCGICATACCIAGCFTGCCCTICTATCICICACTGCG TIAAGAAATAGCOCCAGG C.16CAGATGT TACT
GAGAT
CGAATGGCCATGTATACA TTCTITAATTGGTGGAATT
CTGCTITGAATGTATTCGTITCTCATTGITGTGC.AAG CTATCICIAGETCCATICA. TC1 ATGACAG
ACT TGACAC
24 TTGACACTAT C.TGTCT GGC TAT
14 7 GCCITCITT GTAAAGAGCAACiATTGG ITGIOCGTI G
TCACATCTCGC.ATAATGTTICIAACCGIGACATTGCA ATAAGGATC11 CAAGCGT
GGITGTGCCITGATGGTA
14 GTGCTAATTICACTGGTCAAATTCATACAAATTGCCA AC:ACTGITC:AAAAA
ITGC:AAACTCTACiCCACTT TAA TATRA TG ITAATAAGAC TACAAATTGC:CAAATTACA
26 AATTACACTG GTC:GACTAA GCTGC CfG
GGCITCITTAGTTGTITTGC
14 AAAATTACAGCAATCAAAGTTGCGTTTCTAATCCTTri MTCCATAC.AGCTTGGCACTGGTGTGGTTACATTG
28 CGTTTATGTGC . AAGACAA GTT GTGC
14 ATCACTATAAGCACCACACACCAAGCTAGTGGATGAT AAGCTrTGTTTCACTTGCCATATGTATATGATTGGTA
AATTAGAAATCCAC:TAAG
29 CATGC AATGTG6TACA ATG1TC.0 AGCTAGTGGATGATCATGC
CTGTG TAC.GGC TOOT G
TITTCT ACCTACCTGAATACATGAC TAAMATAGTAGAGTCGG
14 TiCTGAAACGTMTGAACATICCATACCCATCPACAA GAAAGGGCAAACEIGGTGGATGGICCIGTGCCAAG
GGTCATACCTCGTAATTTG TACCCATCAACAAGAAAGA
GTGAATCCAAACCTCAATC GGAATCCTICAAGTGACAG
14 GGCCAAAACATIGIGTGACAlTAGAAAATGCAAAAG CCAGAGACCTTGACCACAACTTTTAGCTTTAACACC
GAAACAAAAGAACAAAAG
AAATGCAAAAGCCACGGT
AACTTCACCIGCAACTGCTGAGCATGMAGITTACT AAGMCITGAGGAGTIA ATICACTAGAGAAATGCAA
TGATCFATGTAFGAIGGTA "TCAGAGCTOC7MAATTA
14 C.CAGGTCiTGGCAAATGGACACTCATGTCCIGTITCTG
CTTTTAGTATTTGGCTTCCGTCCTTAAGTAGCCAAAT ACTGTITGGC.ATGMACA
37 TGT ACCTAGGAC , GT
CTCATGTCCTGTTTCTGTGT
14 AGAAACATCTGTCCCATCGCTGCGATGAGTGTCTAGA GGAAATCCATATGCACTGTTC.ACACTTAGCAGCTGA
TACAATTTATAGAGACTG
33 CTC a AAGCC
GCGATGAGTGTCTAGACTC
AAAAGACGGCTGITTGAG
39 GAGGG GGCACT ATC.G1TGIGTGTGICGCC GG
TTGCACCAT A TGTAGAAGAGTATCAAAT
CCGCGGCATAAAAGTGTT
14 GGCGCATITCCITATCTITTATACWiesCATTGCTAAAA AGGACAATGGATAAAAAAGTGIGGGATTAATTAGC
GCAGATGTAGAACCTAAT GCATGGCTAAAAAGTAAC
CATGGGGACTAACAGATA AGTCATGAACTGTATACAA
AGAAAGAAAACATAGATC
ATATATATAGACATTGTGT
44 TTGTGTGGGGT A.ACCC CGCAAAAGGGACACTGTA GGGGT
CTTAGGCCAAGTGTGGTG
CAGCCGCAGAGGTTATTATTGGTATCC.AAAGGTATA
46 CTAAACG 1 GA MC:ACC-AM CAGTGCTGTCCIAGAGGT
CACTTCCACGCCTAAACG
i 47 CAC:C 1 GIGTA , CCCCACiCiGCCITCCAAAG
*FCCAAAGGGCAAGGGAa:
ATCTCCTGCCTTGGAAACTGTGTTTACCACCAGTAG GAGCCITTACTNVµTGCTG
TTAGAAGATACTTATGCTG
48 CTGAGGA ' AGGTTA AC AGGA
CTGATACATTITATTTGCA
ClIGGTACGTAAACGCAAA
AAAACTGTTGTTATTCCTA ATCTGGGAATCAGTACAGA
SO AGAG TAACG AGGT G
ATGTCTAATGATCCATATG
51 . TTGGT GAATAATATATC5T GC.AG
'FGCCAMITTITCTITGGT
CTCTTACTGCTGATGIGTT TCICATAITAATGTAATGG
53 't GCCCA ICTCCACGCGAAGGICCGCGGA AA TCCAGC ACTGAC GGCAAGGCCT
TCCACA TG alCCGCGGAAATCCAGC
54 CGTC TCCTGTCACACCX:CACCCill GACCCIGICOGTGTGC GA
GCA.TGGAC:ATCTCTCCG It 14 GCGTTC.AGCAGCGCCTC.AGAGTGTCG1TGGATCTGA AGCATGTGCCCCAATCTTGGAGATCCAGGIGTAGAT
SS GC.IG GCAGO AC GG
14 CCACCGGCATAGICATGGICACGC:CTGGTGACCTGC TCCGACTACrTCCCGGAGA I
GCGCTCCTCCACCTCA
GCCIGGTGACCTGCCT
14 CiCCGTGCTCiCACCTCAAAC.ATTAACATGTACCCGGCA
CCAGAGACGGTGITGCCiCCGGCAAAGGATGACCAG
57 . CiTCC GM AGACCCIOCGCTCCAT6 AACATGTACCCGGCAGTC.C. .
ATGC.;CATC
14 CTACCTGAACTAAGACTC.iCiG i GTACCATCCTIT IGCC ACACCCAG GM I
GCAGIGCCATGATAGAATTCCAGG CACAAAAC:AAACTA TC1CC
ACCATCCTMGCCAATCA
GIAGGIGCLICCAGAGCC.CCIAAGT ITGGGGGCGC CCAGGGGGGACT17ATG r 61 C.GGGTTGGICCAGGGCCTIGTCGCTGACTCCGCCA TAG CGGGGCCACCTTCATC.A
GTCGCTGACTCCGCIA
14 CFCGTCTGGGTICTIGGCCCCAGCTCCIAAGAACiGC.A
TCCGTAGAAGGGTCCTCGTCCTACCCC1GAAGGTCiA
CAGC:TCCTAAGAAGGCACC
63 AAGAGGCCCCTCTCC.GCGAGACCCTAGGCGTCCCCT TGGGCCCTCAAGTCC.AGOCAGAGAAGGGCCGGTGG
CCCAGGATGTCCCCCAGA ACCCTAGGCGTCCCCT
14 CCICIGGTAGGACT6GGCGACCCACAC.ACCCACCCGT
GGGCCA.ACAACC:C.AGACGAGTACCACCTCCICTICT
64 CT TGCT AC.ACCCAGGCACACACTA
CACACACCCACCCGTCT
AAGCC:AGACAGCAGCCAAITG7CCACMATACCAG GGCTTG TITGTGAC:FTCAC
AAAGGTCAGGGCCCAAGG
14 GCCTFC7TAGGAGCTGTCC:GAGACACCCAGGCACAC ACCGGTCGCCCAGTCCTACGGACCCITCTACOGACT
66 ACT . CG CTAAGCCCAACACTCCACC
ACACCCAGGCACACACT
14 GAGTIGAGCTGCCTACCATGCTCCAAAATTGCiTGCCT CCCGTOTACTGCCCAGCCTGGAAATGCATAGGAGT
CAACACGACCCCAAGGAA CCAAAATTGGTGCCITGCT
67 TGCT. C GCGA G C
14 CCTCGICITCGGICTCAGCTTCAC.ACGTTAGGGGGCG CGAGTTACGCTCCTTGGAGGGCCTTCTTCCACGGCG
ACACGTTAGGGGGCGC
ACTCGTCCGCCTACTGG TTGTCGGAGGTGGAGAGG
CACAAAGCTGTGTGACCICi1CCC.ATCGAGCTCGCTG AAACTCTAGGCCTOTGGA
GCTCCAGGGAATCMGGG
ATCTGCTGACTGAAACCC
TCMCGCTGACCAGTC
14 TCGTGGIGGAGGCTGCKAGGCAGt. I tJ CC TTGAGCT TCTI
GAGGIGGCGCGGGTAACGCTGCAGGAICTGG
GCAGOTGICCITGAGGAG
14 GCCAGAGCCAACCITAGCTC.AAGTCGAGGGCATGG
TCGGGAGAGGCGCCTT
14 GCACCACCATCGTCCACGAGCCGCTATCiCTATTGGG
AGAGGGCCGGTCTCGA
14 GACCGGACGGCCACGTTGTGAGAGTATGGGGCC.CC
AGGAGAGATAATCGGAG
GCCGAGGCCACCTTGT
14 AAAGCTCrGAAGCGCCTCCTATCCCTGTTGICTGGCCA C.GTCGTCCAGGACCAAGGGGAAGGAGGAGAACCC
GICCTCCTCATCTACCCCA
CCTGTTGTCTGGCCACG
14 GCCCTCCTITGGGGATGATGCATCTAGGICAGACAG CCGAAGGGAGGTGATGGGGAAAAACC.ATCCCCCG
CGGACATGGATAGGAAA TCTAGGTCAGACAGGTAG
14 raCGIGGGAGAAACGCCCGAGAATGGCCGCGAGIT GACIGGGGAATCGTCGGTIAGGGTITGCTCGCACGG
GAGAATGGCCGCGAGTTT
14 AACGCTCTrTTCCTICACCGCTCAGATCCCTCTGGCG
CIGGATGCC.CTCCACGAC
AGCTGCCCCCGTCAAG
AGCTCGTTGGAGAGGACC
GAGAACCACGAGAGGTGC
as C:CCiCAGGCAGAIG ITGGCCGTCiCIGCAGCCCICGA
CGCTGCCIGGTGAATGCGCFCATCCCCCIOCGIGTC CAACCUGIGCGCCGAG GIGCTCCAGCCCTCGA
GCCITCACTGGCCCIGT
GAGGGCCAGATGCAGGAGCTTGATAGAGGGACAT ACGTGACATGGAGAAACT
89 AATCCGC:ATGGAGC:CCCMCCAGGC:CCGCATGAGT TTCAGCTGCTGGACACGCAGCGCAC.CTACTTCACCC
TCCACACTGAGOCC:GG CC.AGGCCCGCATGAGI
GGCACATGMATCCICTMCGG CAGCGGCICTG TGAGGT CAGCCGCTTCITGGGC
14 CCTICAGATATGCCACCCCCCAGGGGTACIGGGGGT GCC.AGGIGGATITTGAGCTCCGCCTCCAGAATCAGC
91 GMT GGCC AGAGACGICIAC Ga:CA
GGGGTACIGGGGCi TGGTT
TAGCCGGGATCGATGATG CAACATGAGACGTGACCG
92 C(36 TCCTGCT
CACCIGGCITCIGACCGG
14 CAGCAGCGTGTTCACAAACTTATAGCCTCAGCCAT Cr ACiAIGGGGATAITIAAAAGGGGCAGGT11AACGAG TATAGCCTCAGCCATCI AC
14 GCCIGTGCTACCGGACACGGAGCAGCTC.AGGGAAIG ACCFACGCCCTTGCC:CCCF
CCGICGCTAAMACACCE
AGCAGCTCAGOGAATGOC
GCATTAGACGCGCGCG
GCCAGGATGCCACCGAGGITGAACAGGCLACICGG 'ITIGI CAA TCCATGGCAGG
14 GGACATIGI ca:cccacAer GTA TGGG 1TACCGCAC: ACCCGGTCCTIGICiACMCITACGTT
GTIGCATCiC:C GTCATCTIGCAGATCCIGG
GTATGGGTTACCGCACGC
CAACTTAGCAGTTCGGCAC
15 CAIGGICGGGCTCGGGAGAGAGTCCGGACAGF T it 00 GIG AAGC.CCGGTGCCIAAACGAGICACTGCTGCTGG ATGC CTCGGTGTCACTGTTGOC
OGAGICACTGCTGCTGG
15 AGCCAGATGTTCAGGAAC:CAAAGCATC:GCTTAAGTA
GGCTGCATTAPCIAAGCCIATGAGCTTTCCICCAGAA AT TAGGCGACTCTGCATC GCATCCiC17AAGTA
MAGI
15 GGGAAC:AGCAGGGGAGGTC:CAGGCTAA TGAAAGGIC
CCAGTITGCCCCATCGTITCCCCiC:AAGGATCCCA IG TGCTCTCAGTTAAC:GAGC
AGGGAATGAAAGCTCACG
IS GCTCTCGCT GTAGICAGACTCGTGAGGGCAGIGATA
CGAGGAIGAAGCAACCCCCGGGTAGAIGGCGAGA CCAGTGACGAAGACCCAT
TGAGGGCAGTGATAGCGA
15 GC:TCCGCCACGTATTCCCCGATCTACGCMTAGCCA
GCAGTTCGCCCAGCTC
is TC.TCTCAGGACCTCAAAGGCGGGCGAACCAAGOCC
CCACCTGCSGTTACTAATGT
OS CATGCACCAGGGGCAGCTGCGTFCCAGCTTCGTCC AGG GC
GCGTTCCAGCTTCGTCC
CCAGTATTIGGCCAGGIC
GCCGCTGGAGTTCCT
CCCOGCCATCITTGCCGGATICTGGTAGAGGCGCIG TGACCATCGAGTACC. ITC.0 AGGAGCGACGAGTATGTG
TCTAC-ATCGAGGACCTCG
CGAAAATACTGCCCCGCG
TTCTACAATGCCAGGGGG
IS GCTCTACGCCTICCTCCGCAGGOTGGTGCCICTGTG
Ci GCCCCCGGCCATACTCCAATaTGAGCCGGCTGGG ASITCGaiGTCCAGGAGG
GCTIGGTGCCICTGIGG
GGACCAACAAMCCCACCTGTGGCATGICTGITCCCC GCCCATC.ATCCOCC.GAGCATCITCCAGCXTCTCTTTC
AGAGTGCCTCTGAGCATG
GGCATGTCTGTTCCOCCT
is GTOTAGI1GAGCATICACCITTIAACTOGALT.C1T1A CATCGCAATCATGAAGTCCTCCACAAGCGTMGAC
TUTTTGAGATCTGAGGAG ACTGGACLaTTAAAACAT
13 TITCC GCC , CCATGGCGCGGTGTCA C
AATCCCATGICAGGGITGGGGTTGGTATCATGCACC AGTTC.AGGITCCACATCTG
14 AGC i TCGCT C
GACAGCACCTCCAACAGC
AATGCCTCTATGTTGGCAC
CCTGGTGAGGTGTGCCA A
GCACCGAGCTGATGGGT
15 CACCGACMCGTICTGCGCCAATCTGGTTGTC.GGCCT AAACACCCCOCCCACTGGCCTCAAGGGGGICAGTAC
AATCTGGTTGTCGGCCTCC
TCCAGAAGCCAGGGAGG CAGACGGATGTCAGGTCG
CTCAAGTGTAAGCAGCCCG
AC AA CTGGAGATCCCCGTGACT
GTGCTCTCATCCCTGCAAC
AGGAAAAACATAACAATC AACCAGAAAGATACCCAG
15 CGGCCGITTCAAATGCTGTGGACACGCATCCCTGTCT 1 CCT.
GaCCAGCGTTGCCTCCCGATTTGACCTCACAC
22 Ta C I GAGA TGTAGGCGGGGAGATGC
CACGCATCCCTGTCTICTC
t IS 1 'ITGGTATAGGGCAAGGTIGGGCGTGGTGCTACGCG
23 1 GCCCITCCACCGCCXIGTGGCTAG CtCaXiCAAA 1 AOTTG , AGCCCCICTaCIGT CC GaTAGCCI:C.C6CAAA
15 CGGGGTTTACAGTGGCCTC.ATTGAAMGGGGGTGGC 1 CCTTGGGGTAGCCGACGGTGGGTCACCTGACTACT
24 GGT ' GAGA ACGGTATGCAGGOGCTG
GAAAAGGGGGTGOCGGT
ATGCTCTCCITCCTCTGAG
TCC TGGCC C
GGTGTCTTGGGCATCATCC
GCCCCAAGCCITCGCA
GAC.AAGGCAGTAGAGGAC
27 . GGACAC CAG TCGTCTGCACCCCAGTG AC
CAGGCACCGTCTGTATACG
26 CGT CT A1'AGCGGCICACAATGC3TG T
TAATGATGAGGAGCACCG
GC:CACCACAGCACAAGTG
GCAGCACACGCCCCCATTCCITCCCCGGGAATGTGT GGCT TGGCG ITTACCGGACACA
CCITCCCCGGGAATGTG T
is AAGGCTGGCAAAGATCCCC-AGTCTCCIAGGAATGCC CTAGATITAGCGATCCCCCGGTACGAGCGGGATAG
31 CiAGC CAGG CTCAAGGCGAGGCTCCT
"ICTCCTAGGAATGCCGAGC
GAGCTIGCGGCTGAGCTCaCCGIGAAGGCAGGG I
GTTGCGCTCGACXIAACT
IS GCCGAGAAGACCTICTCCICCVITTCAGCACCCCCAC CTCCAGGGAGA I
GGGGGCCATTGCCGATUCCIGG
33 . A C CATCAGGGTC7111M.GTC
CGTC.A GCACCCCCACA .
15 GCCATCAAGTGCACGTGCAACGGGGAAAGAGGCCG CAGCTGLIGCTIGTCGAGGATOAAGAAGaGCTGG
34 TTGGC alcc CGGCGATGGAGAGGCA
GGGAAAGAGGGCGITG GC
is GCCGGGACAAGCTC:AGCC TCCCCAGTAGGTGCCIGG ACACGGCGTACATG TCGATGGG I
CTIGCGGGCCAA
Cr CAM CACATCCTCCTTCITGGCC
CCCAGTAGGTGCCTGGCT
15 AGGAGGCCGGGCGCTATACiGACCCCCTCIATGICIT
CtCCIGTCCiACCAGAGCiACGACCIGCCACCAACGAG A TCiai AACT CGTACIACCI
ACC.CCCTCTATCiTCTTCGC
15 GGGACGTGGICTACTGGGAGCGAAGGCCCGICTGTA GCAGGC-ACAGGGTCTCCiCiTGCCAAGCTGCAGTCCT
GAAGOCCCGICTGTAGAG
37 GAGG G TCCC.AGGCIACCCGAGA G
15 CC3aGIGIGGGCGAGC7 GATCi TCCCCICIGCCOTGG
CC.AGCAGCiGCACAGAGGTGACGCTITCGGGGCGTA GCCAGGTGG ITACAGGAG
3$ AA AG A
GTCCCCTCTGCXXITGGAA
IS AGCTACiAGAAGGAGCCCCATGGGCCGCATCCACGTC TICCTICAGCGTCIATGCCCAGACTGGCAGCCGAAC
39 CT CAGAA MACTGGACC.ACTTCGGC
GCCGCATCCACGTC.Cr 15 TCIGGCTGCGTTACACCGATAIGCAAAGATCT GCGT ai TAGAGGACGGAA ITGG
GCAAAGATCTGCGTGGAC
GGACA CGGGCCTCTGGACCAGTCACACCGGCCAGCCTCAACT TG A
TAGCCCiCACGGACCCTGCACACIACTAGCOA
ACCTGTGGCCCGCGTA
15 TCAAGGCCTC:CCTGAGGAAGGACGATGTCGGCTGTC TGAGCCGGC.AGAGGACCAGCGGTGTTC.ACCCGGGA
42 CTGC . TG TACGTGCGTGTCTTTGCC
CGATGTCGGCTGTCCTGC
43 CCICCCCTACGGTTACCCC.ACCGGGAGGCCGTGC71T TGC GAGGTCGTTGGCGGCA
C.GGGAGGCCGTGCITT
CAACAATAAITTCC.TGTGC GAGCGGTCTGTATAAAAAC
CTIAGTGAAGAGTOTTGTCACCACTCTATAAGGGC TTGGAGAAACTGTTGCAG AGACAAATATGATTGGAA
ATTGGAAATAGACC TATGCTCC A ATAGACC
15 CGTITGTAC:TGGAGAACCAAC.TAACGTTACIGCAAG
CCCGCCIATTGCAAGCTGTATAAACAATCGTCTTTTG CCCTGGCATTGTTTAATGA GTTAGTGCAAGATTCCAAT
15 TGAAAGAAGAAAATGGC.AACTGGTGACTGTACTGAA 1TCCTTIMG1TCGGIGATTATCCAGCTGACTCATGA
CAAAAGCTGTTITGTAACA ACTGTACTGAATTCCAAAT
is GGTCTATITGTGTGGCGAGITTTCAGTITATGAAAT A
48 ACTAATGCC.ACC AACTG AIGAGG GCCACC
lICKACTTCICTIAGTF CTCAAAGGGGAGGMCTGA TATTITGAGGAIGCAAA00 is CCAAAACGG7CGTGTAGGICTTAACAAITGCACC.GA TGCAAGCAGGCCCATAGATGGGGGTAATTICTGAT
15 AGGTTGGGG i it 1 t CAATTTCAAATTGGAGAAGAA AT
AC.ACCTTTAGACAGAGTTC.AGAGAGCTGAATCiTGA TAGG11TTGCAGMAATA
TGGAGAAGAAATTCCITTA
51 TCC1TTAGATCC CG CCCATA , TTGTG GATCC
IS TGCCTCCATGAAAGTAATATCCAGTATCTTCCACCTCA
CAGAACGGTTGCTTATGGTTGGITTC.ATCGTCTACA GCAGACCACTGGAN \ ACT
ATCTTCCACCTCAAAAGCC
15 GCATITCCAMTCCAAATTCACACAGAGA1TGICCTC 1GAGTC.117TC.AACAAGACAGAGCTGGCCAAAGAG
TTAC-AAAAC.CATGTAAAG GAGATTGTCCTCCTATTGA
15 TGCAGGAC.ATITTGTAGCTAATGATGGGAACTTGCCT
GATGTTATTCCAGCAGAAAAGCCTGTC.ATATCTATT GCTCATATTAATGTAATGA
GGGAACTTGCCTITGTACC
15 GGTATGTIGGCMGCMATAC.TCCACGTATATTITA CTGAAGCCATTTGCACTTATC17CTAAACGGT1GGT
CTCTACAAATAAATCTGCA CACGTATATITTATTITTCA
AGCAGGGGAAAATAAAA CTACTGGTCCTGTTATGTG
IS GICGATGAAATC7CCTGGATMCATCTGOTCCAGTG GCfGAGGGAGCAATTGAGCTTTCTTTGGGAAATATT
CIGCTFCCAG TGAGA ICA
15 .ACTICTITCCUTTCTIGTTCACATATTTGCTATGGCTG
ATTIGCTATGGCTGACGG
is AGCATACCITGGTGCTATTAGATTITTGCTAAAACCC TTCGCACTGAGTAGAGGCTTTGTTACACTCATGCAT
GAGGATGAACTATTACTG
TTGCTAAAACCCGGAGAC
15 AACCGGCAATGGaCC.AAATGITAC.AGGACTAAGGA 1 TTGAAGGGGGATGGACTGGACCCTGTTCATTCTGAT GAGIGCCAAATTGAGGAT GITACAGGACTAAGGMC
60 ACA ITC 1 GA TGA G Arf C
i CTAICAGATTCTGGCGATCTACKAGMACTGATIG GCATGGAAAGTGTAAGAA 'FGATTATCCCAAATATICA
62 I ATICAGAAGAG 1 CCCCCAG , ATGG GAAGACi AGGTGGTTATTTGATCGTTGGGAGCGGTCATATGA
62 GAG ' AAATGTGTA CAACCGCCAATITCACTG
CAAGGTTCGTMGCAGAC
is ACACCAAATCCTCCTCTGICCTGAGCTACITAATTITG
GGAAATATTTGTATTAGG TAAGTTATTAGACTTAGTG
64 TTAGTGGAGAAG AGC.I. 7 TG PCGC GAGAAG
CGTGCTTGCTACAAATTTA
65 . AATTIAGG GAICIAAIT CGGTGCAAGACTICGT11 CiG
AGMAGATGGTGAITTG ATTIGACGAAGGGACAGA
66 AGAC Gar GAM. C
IS CTGITGTTGGIGGATTGGCCAAATGAAGCTGAAGAT CTCAGGGAACTAAGGGCITGGACAGCTITCACATTG
ACAGCGGACTAGAATGTT AAATGAAGCTGAAGATCTT
GGTAATTGCTGIGTATGCT TGAATGATGACTTAATTGA
IS AGICAAATTGIGTAGC-ATITGCAGTCCTGAATGGATA
TTGATGAAGCTACCATTGCATATCAGTAAAAAAGCC AGTATGA ACCCTAATGTAT
ICCIGAATGGATACITACA
69 CITAC:AC.A CTGGCA 37 A ACGC CA
GACCAAAAGIACAAA TAT CAGCAGACi rf AAATCCITI
70 ATC.CTTIGC CATCCTC TTGCA GC
IS CCCCAATC:GCATAACTCCATTIAGACI7 TMGACAAG
TACCAACCTGTACCTOCCITTGGCAATAGIAAAACCA ACiACT TT AAGACAAGAGC
73 , AGCAAAT TGCCAAT GICAGATIGACCACTGGC MAT
.
15 ACCCCGACCTGCiAAAGGGTGTCTTCGCC:ACCAACIG CGCCAACAACATCGCGACTCGGG/CfGATGICOGG
ICATTGCCCCTCIFIACIAG
TCTTCGCCACCAACTGGA
is CiCAGGGGGTAGTGGAGICTGCAAAGGGGAGGGGA CACAGGCGAAGGGGAAGGGGAGGAGTCiGGAGCG
CAAGGGTCGAACAGAAAC
AAAGGGGAGGGGAGCG
15 GCACTTAAGCA.CATTCiGCGTCCACGA I TACTGGCTGA AG I
ACIACGTCiGICAIGGGTAGAAICA I TCGCGCTC
C.ACGATTACTGGCTGAGG
CGTACTAAACGTGCCTCTGTTACCiTGGACATGTACC CCTAC.AGTAGATTGGICTT ATGATC.TGTAAGC1.
ITAC.T
15 CCIGGCCCTICCCCTAAIGGAA TAAGTACAGGCCG 1 G AG 1TCCiCCC
IGGGGTGA TACCGGGICAMTGGTCT ATAAGI ACAGGCCGTG GC
76 GCA GAC.IGT TGTTTTTCGGGGGCTTGG A
IS ACCTICTGTIACAACIGGAG TATCTAGAA ItAATF GC GGCTCGIC I
GMAT T ITAGAGGATTCiAGITCGTGCA ACiAAICAA ITGCTGAGA TA
77 TGAGATACAC ACAC GTTGACC.TTITACCTGEFTG CAC
15 GGCGTTG TACATCFAAAAATIGCCTIGAGCi TCACACA ACAGAAACACCTCAGGGATATGTTACGI
GTGCCiGA 11CAGTITGATAAT CCAGC TGAGGTCACACA.AA TAT TT
IS ACTiTCAACTCiGGCCTTGTACTGAAAIGCAAFTAITG ACCTGATGf.all GGAGGTGGGAA
FTAAAATCCACA ACCArf AACACAGAAGAA GAAAIGCAATi ATTGGGG
15 GCAAACGGTC.ACTMTTGCATGATACAGTGGCGAGA
GIGGGTC.ATCCATATTATGATGTC.CAGACACTITAG CC.AGCAAGTGGTAAGGTA CAGTGGC.G
80 GTTCAA AG . GGACTTCAAT 7 G
15 ACCTGATGTTCCAACCCCTMAGTTIGGGCC:IGTAGA GGITAGAGACACAGAMACTCTAGCGATGT51717 TTATAATCCTGATAAGGA
GTTTGGGCCTGTAGAGGT
GTGATCAACAACMCGAACAATAGMTACTAATGA AGAGAAC.AATGTTATGCC TTATGrfAGAGGAGG 17CA
82 AGGTTC.AGTT CCCACTTACTGTA AGG 617 AMATGCGGACGTAAGAG AGATCACTGAAAGCCAGA
GaTTOCAAGACAATCAA CITTCTTCTAGCATOGACC.
84 CC.A GTCTC GG A
CTCAAGGGCGTTATATTGG
is GMAATGCCACTGTATCGTITCAAGCAAGAAGGITG AGTCTGCTCCTAAGCCTGAAAAGAMCITCTIACCC
86 ISGAIG TTGGG CATCC.GTTCTTCACAGGA
GCAAGAAGCT 1G IGGA it 15 ACACGITAACAGTACTCGATICCIGGCACiGAGAAATC CITCCCTGAAGGTIGAGACAATTTGAATGCTCAAAT
87 CCATC C.AAAGGCG GCG
GGCAGGAGAAATCCCATC
15 CGTI1TGAITTAGC.C1CA1ICTTCAIGGGGGT TAAAG
AAAICCATCTACGTGCATITCCTTACTCTGAAAIGGA ATCAGAGGATAI AGO-IC
RS GGAGA TTCTTGATG CAAA
ATGGGGGITAAAGGGAGA
15 GTGAAATGGaiAACTGTTIGITCGCATGGIATCAACA AACGGTTITAATAAAAAGCTGTGCTAGACATGAAGT
GCATGGTATCAACAGTGG
89 GTGGG ATGCAGAGAC , CCCCTTGAGGCATTTGTG G
GTACAGCTCACAGTTGGCGTAAGC.GATACGAC:TATC AACATACTGCAGCTIGTG
90 CTGGTG i TAGC G
GTGTAGCTGCTTCTGGTG
15 ACTCGGAGACAAGTGATATAATGGACAGTTAGTCTG C.AAGGCAGCGTGGAATTATACAAC.CATATTCAAAGC
ATTCGTGGAACITTCAGTC CAGTTAGTCIGGAOGGAT
is CCATCTITACATGAGAAGGCATTCCCTACCTTCCITTA GATCITTATGCTCTGGGGGTTGL G
iii i CTGGATCA ATAGGGATGMTGATCG CTACCITCCITTAGTAAAG
CTGTGGCTCCNOTACAG
is CGCACCTTGACAAATCCTITAGAAAITCTCAAAAGGG GCTGTIGCCGAGGGAAGAAACGATTCTITTCTCGIT
TTCTCAAAAGGGATTGACG
CTCTGGAAGCAAAGGTTT TGAAGCTAACAGAACATTG
is AGACAAAGAGTATGCAGCGGTCATCGCAATCGTAAG TCAAACAGCATGATGICCGATAGCAGGCGTTATOG
CATCGCAATCGTAAGTATC
ATACCCTGTCAGAAATACACGAGCACAAAACAATC.A GAGCAAGAATTTGTAAGA
CCTCTITGTGAACAGTGTC
is ATCACCCTCTACAAAAAGCTCCTATACAGGGTGTIAT 1 GCAGACITTCTGGGGGAGGAAAGAGAGCTCAGATC TAGACTAGITC. TGCGCCTT
TACAGGGIGTTATGTITTG
t 99 CIGCLTIC 1 GGC:AAC , TCTCGAAGCCGTAGAA.GC IC
is TTTGCAGGCCTGTAGTCCCTGAAGAAGATAAAACTAA 1 TGCTTGCGAAAGGTGTGTACCCATTATTCCACCCCC AGTICATTCGTTTTGTATC GAAGAAGATAAAACTAAG
00 GAATCAGG ' ATC GT AATCAGG
AAAGGAGAACCTCGAGCCITGTTGGTAAAGGATGT Ca. i la r z I i CTTCTCGGAA
GGCTCTTTTGTTGAGTTIG
CTATTCAACTAAATGGAG CAATGTTCTGTTATCTCCA
TATCGATGTTTCTCGCTTAACAGAACTCTAGATCCTT
03 . GGC AAATAGAGCAM CTAGAMCJW.JATITCIG AT
GCGGITITAAAGATGGC
CCGCTTTCCTGTGTGTAATCATCTCTGCTATCGAAAA GGTATCGTAACGAATAGC TCTGATTGAGGGACTTCTA
GTITCICGGAAATCACAAA
OS 'TCACAAACT GITTCTCI CCAACACT TCCGCGAT TT CT
TGGCAGATICTITTICGTG C.IGAATAGGAATTCCTCGA
06 CTCGACC GTGGA.T AA CC
APAGCCTITTAGITATTCC
07 'MAGI GGC111 AA
CTTCGCCAAGCTITGAGT
16 CGGAG7 AGAGAAAT CAGC:AGACTGCTICGCA 1 TCCIA
GAAGCTGAMCGCCGA I C17 ACITCICC:CAGACT G cnCGCAlICTIATIACCAA
08 TTACCAATG rraT ATGACAGAGACACGACGA ICI
16 CATGCACC.AATAGTfACCiGTGT1T1ACAGTfTGF GGA
CGTAICC.TTGCCGGTAITCACTCGAT A FTACGGAGA AGCAATAGTCCTITCCA IC
09 . CitIGTT CAAAGA 6 TTACAGITTGTGGAGGGTT .
16 Gal GIGIGTCITITGICITMCATAAACAATAT GO-CA.ATAC
CAATACGGG TAGGGAT GA GGG
TAG TGCA.GAAGCTAAGIT TAT
CMCGGNsCAACAACTGC
16 CA 1 CCTGT TCAGC IT TACGAACiCiAAAGCCIGTACAGA TCTAGGGAT
16 GGAAGCAGATAATITAGAGGAAGCTCITGCCaTIAT CAAGTAGCGAAAAATCCAACAGCCCACAAACCAAA
CCAACTCCIGTAA TACATG
CTIGCCXITTAIGTGGITT
16 CCTAAGATACiCGATCAAAGGTIGATCCTACTTAACCI
CTCiCTCAOCTCITGCGGTTATAACCGCGAATCCCAT ICCTACITAACC TAT ITTAC
16 GCAGGTITAGATGTIGIGGGITTIGGCTAIGGGITTI 11TGTCACGAAT Al AGCCACTGAT
TNIGGT TCTICTG ATITCCTTCATTACCiAC:AG
TGGCTATGGCTTITATGGG
16 AATGTICTAAAGGGGAIGIGGATITAAGCACT Ca CC AGATCCTCTAAAT
TiGAACCTCCGCATACTAAGTTGI GAICTCGCATACCIGAAG
AAGCACTCCICCIATC.ACA
TCC.AGACATAGCAACAATGACAGAAACTTC.AATAAA CCCCAATTTITCATGAAGA
CCAACAACATTATACTCCC
18 CCCCT . CGAAAGCG GC CT
TGATGGTTACTICC.TCAMACAAGAACGGTAAN 'µ GA
CTGTTGAAGTACGTCCCTT
TAAGACGTTGCTCTTATCA AGGCTGATAAATGMTCG
TAATTTCGCCACAAGACCA
TTGTCGCGAATAGTCACA
TCTCCAGCAGCAAAAACTAAAATTCTGCTTCGCiTTT 11TGAMTCTCTACTAAGT
CAATTGCACITCAAACTCA
GCACGAGTTACTGGGGITGAAA AACTGTGTATGAGGTTTA
24 AMC Te3TTC ACACT
GTITGGCAAGC:AAACAAC
16 TGLTITCTIGCOCAAGATCTCACCICTIACAAACCGCA CAAGTAG1CAT1GCTATGaCGAA1AGAG11GTACA
ATCACAAACCATAACAAT
25 TIT AC3CGATTIG At-AA
CCACTTACAAACCGCATTT
16 1TCACAGA TCCTCC.GTAAAGAAIGATTGCTICTCTCTT
ATTGCITCICTCITTICCAA
16 AGGTTATGGAATCGCGGTTATCAGGAGCCTGTGCAG Gres- i 1 u. I Go 1 I
AGGGAAAMACGTTITATTTITAT
27 AATGTG AGCGGACTTCT , TAGCTTCAACACGOCTCT
GAGCCTGTGCAGAATGTG
16 TGGAATTTCTCGAGGCGCTCTAAAGTAAC.AGGACTAT
CATTCTGTGGCTTCTGCATTAGATA/kGGATGGCCGA TAAAGTAACAGGACTATCC
23 CCCCT TTCGT CTCCTCTTCCTGCTGTTG C.CT
AGOGCTCTTAAACAATCTC
29 GCAAC.AC AAATAGTCGA A
ACCGGTACTTTGCAACAC
CAAGCATCACAAGATTTAT GMACAAGAAGTTCTTITT
16 TGGTCAAAGACGCAAGCTGTGACCTTATCTITGGGAT AATGCCGCTCGAGICAATGTAGTTCACGAGC.AGITT
GACCTTATCTTTGGGATAC
AGTTTGAAACTCCAATACT
AGAAAACAGGCAAGCCAT
16 AGCTAGCTTC.ATGGAGTATGTCTACCAACATAAAACT
TCAGGAGACGAGAAATCA ATATCATTCTCTGTAACAA
CGGTCTTAATTCCATTAAT
GTAGGTAAAACCTCGTITGGICAACATCTGTATCAA TTTCCATATTTCGTGATTG
36 CCAA : AAAAAGGCACIC CA
GCAGACACTACATGCCAA
s ICACTACTITITIATGGTMGT GIGGAAAAGCTTFGTTAG
37 1 CCGTF I CfC3TGC: , AGG 'FM
TIGTGTCCTTCCGTT
GGAACTGACCCTATAAGGAATGC.GTAACGAC.ATCA
311 CCA ' TTGCATGA GCTO1TGTG11CAAA3CT
IXTAGCAAAAATCTCCCCA
16 GCTCAGTATTTAGCAGGGCAGTCTTCCCAAAAGAC.TT GACAACCATTTTTGCTTTGCTGTGGGATTTGGCGAA
TCTICCCAAAAGACTTCAG
AACTGAC.AAAAGCTGCCG
41 . GGAIT AAGCATAC GCCCC3GAATIGGNITTGG
CCGTAGICIGTFCGGGATT
16 TCAGAAAGGGCTGAAGGCAGAAACGGTGC.AACTCG ATCTGIGCAGCTTAAAACCIAATGTCTTTAGAGAAA
CITTGGATAGAGAAGCTG
42 ATC GTC:ATAAATACGGA Cl' AAACGGTGCAACTCGA TC
GGCTICTGTCACCITGCTCAGCTITAGTTGIGGCTIC CAA.AAATAGGATT AGAAG
43 AGAAGAACITCC AAC CfCAAGCGTACGTCCITC AACTTCC
ACAAGAAGCGGTGTGAA GCTGGATTCACAATAGTAA
TTCTACATTATGTGCALCGATC if 3II i GG CAGAACACATCACTITAGA
16 TGGCTCAGICGGTICITCAGTAATCCACIACTCTICCfC
GAAGCTGGITCTACAACAGAAACIACAGITICTCCA TAATCCAGACI CTICCTC:AT
ICCAAAA.P.MCCICATCTIGGAGCATC17 TTAAATCCCATCCCATICA AAGGC'AGAAGCFAICTIGI
47 . G71 GTGGA GG T
=
16 CITGAAtTICGAAAGAGGATGTFGCCITGCITCAGIT GCCF GACCOCAGAAACCATTICTICTI
CATF AAAATC GACICTGAAGGTAA TATC
GGTCGAGGACAA I A GAATTTAATCCCICITCGA CCTITAATTACAGCTGGGA
16 ACACIGATTCTTGCTITGOCAGAATITCAACATIGCCIA TTAGGICCIGTAGC:ITI
AAICACAATGCGCTCCCAITTAGTGC:ATGAGAATCG AAAGGAGTCGGACCATGA
TGACTGCGCTATGAGGAT
16 G TAGGGGCTCAAGGACGAACTCAAATCAAI cavrnt AGAAGCAAGGATGAGGITTICT
GTGIIGGAGCACA CCAGCTTTCIGTAAAAAG TCAAA ICAATGATITCCAC
53 CACTI-CA CATC.ACG ACAA ITCA
16 GATTCHAGAGEGGGATCAGCAAGTIGGACAATAAAA GeICKTAAAACCCGACAAAGAGAACGTC:CCTANIAT
AGAAAATFGCGGAGGAAC
TTGGACMTAAAATGCGCG
16 CCTACCTGGITGAITAIGGAGAAAITACCGAA 7 CTCf AAAAGCAICIAGGAGC7CCCCAAAGGACIGGA1'AT -1ACCGAA TCICIAATAATG
SS AATAATGGATC.A GAAGTGG GCATCTTCAGGAACA:GTA GATCA
ATCTACTAAGCCCTATATT
56 AAAGCTCCTITITCGTGCCCGTGCTGCTATCCCCGATA . GGTG AAAGTC
GTGCTGCTATCCCCGATA
16 TCC.CATTCTCTICACCIAGGTTTATCCACCATCCC.TGCT
ACCACTCTCITAAGrAAATACTGACAGGTTACTTCCG CAAAACX.GATACAGAACA
ATCCACCATCCCTGCTTC
16 TGITC.AATCITTMTITGGAGCAGAAATTCAC.TGCTITA
CGAAGCGGTGAATGC.ATGAAGGATGAGATGTTGAA AAATTCAC.TGCTTTAGCCT
AAAACGACGAGGACCCTA
TCTGCCATCAAATX.ATTTCGAATGGGAATGACGAGT T/kAMAGTGAGCAACGAA GTOCTGCTTGAGANITCTT
TTCCGACCGAGCAATCTCAGTITGGGAGTAAAAGTT TAMTCCATAAGTAGTTFC GACGCTGTAGCATAATCAA
CAACACFTITTTAAGGCATCTTCCAGTAGTGTTGTTG AGAACGCAAAG Al-TUC-AC
16 AGAAGAAAACTCf:AATIAAAAC.GIGGTGTGOCCAAT GCAAAGCGTTCATATCTCGGACTAAGAAAAGGTCCr GIGTGCCCAATCATITCAG
TTCTCTL. G1TGATA ATATGCTGATCAAATITTG
65 TGGCT GCCT , CGA GCT
AAACCGGC6TA11TCTTCCTGAAAACAATTCCCCIGT GCTTACATTCAAGATGAG AGAATCC.C.CTTAG
AAGITA
16 TCCCCAGGCAATTTCTITGCAAGGATTAACCAGCAAA 1 AGACCCTC.
67 TACAC ' ATAGAGGATT CA AC
16 GGGAGGTAAATCAAGACTCCGATTTCCTGC.TrICATC TCTTTGICATCGATTTGGTCAGGGAGGGGAAAGCAT
CCTICTGACTITATGATCA
ATCATGGG
16 ATCGTAGAGTCTIGTATCACCTGAAANCITGGGATAG C.GTCCACTA&AAFATETTGAT'A-CGCCAATCAAAAT
AAAATTGGGATAGCGCTCT
16 ITACGCTAAAAACGGGCCTATMGATTAACCAAAAG TAGCTCTGCAGATTTTGTAGCACC.CTCCGGGATAGT
TAGCMCGATGCTCAACC
TAGAAGCTATGGCCAAAA ATCGCAGAAGAAAAATTCC
ACCTATGAAGAAGCTCGTC
CTGAAGGAATCGCTC.AAGGACGTGTTCCAAITTGGT AGACTITAGGAGATCGTGT
TGGAGGGGCCATTCCGTAATTGTIGITCGAGATCCT 1TA TCCITTATCTAACAG It 75 c:Acreracr 1 USG , ATCGATGGAGGIGIAGGC 'Farr AACCTGGGAGAC.AAGATGCACCATTGGAGATAAAA Gil ft I I fACTATTAGGAA
76 GGAATTAACG ' GCGC TAACGCACGGGAAAAAGT TTAACG
MCGAAACCATACCATTA TCAATGACCTTCCTATGTCT
TCTTIATCAGGGAGAGGG
78 TTCCC.TCC TGCGG T C CA
ICGACAAGATICCCTCC
GAAACGCAAACATGGGGT
79 . 1 GCC C711 A
AAAGCAGTGAAAACTIGCC
GAGCTTGTCCCAATTGCTCCGTCAAAAAGAAACGGA TTTGTAATTGCTGG 1lilt CITTCAATCTTCGGGACAA
TGAGCGATGCAGITICTACAGCAGAAGATACCCA ATGGCACGTCTTAAAAGA
'TCGTGCAAGTATGITCAAC
TGTTCCCCGTACAAGCAAG7TTCTGCCC.CATATCTGC ACGTTTAGAAATFCTAGAA
82 CTAGAACiAl GC A ACAAGNICAGIGGCAAGC GATGC
CTAAAAATTCGTCIAACTG
16 TGC:GAGTGlIATATGAGCAOGAACRCIACAATAGCG
CSCCAGGCAAAGCAAAAAGCAAGCTC:ACTGITGCT AA TACTICCACTAIATCCC AACTICTACAATAGCG
MC
16 IGTC:CATITATTISGTCTTGGAGI OCT TGAACGCAAC
GGTAACIGTTGTAITCGCACCMGC:GATATATCAATG CATA TCCTIGITGATGITC
8S , GAGAA AGGTAGGAC TTGA
TCTIGAACGCAACGAGAA .
16 'ICCAAGAATIAGACAMAGATCGCACCAMG1TCGC TlITCCIG !GT
CCATTTGTTCGCGAGTCA
EGGGACCCTCAAACTCAA
88 TTTTGC. CAAGAACT ACGCSGATCACCITITCTCA
TTGTCTCGCACGTTTTGC
16 CGATCITAGTAMTGCTCTTGCCTTACATTTCTCGC.AT
TTCGAGCCCTITTGAAGAGTGATCGC.AATCTAATGA ACATTICTC.GC.ATATGAAC
90 TACGTTT MATTI- rrc TT
16 IACC:GICG1 ATG TCGGCTGACCAT MCACCA1T TCAC
AGGGAAA1CAGAAAATCCCGCP.AAACC1TCCTCI TA GGGACAATAATGAAGGCC CCATMCACCAMCACAG
AGCGTACATCC:ATGA TTIATIGIGTIATCGCGGCTA TAT CACAGATACATCAAGA
92 AAGAGCT TCGTCTT C.GGAC.G1TATGGACTAGT GCT
16 GlIGTICTMGCSGTACAACAATCTCCAAAGCGACGA CGCGC1CCGITAAGATGA
TRAAGGCCCIATIGAC:A
CCAAAGCGACGAAGAACT
CAGGAGCMTAAAATTAC CAAGAGATGCAACCAGAA
94 AAG . CGGAAAG AGG G
TCGGACAATAAACTCGTAA
16 ACCTGCGGAAGAAGCTGTTGAGCTICATCTTGAGCTT CAGGACGC.ACCGGAGATAAAATTAAAGATGCTGTT
AAAACCCCCTCTITAGAAG
AAACTTTGGAAGATGC:TG GTCATTTTCIGTTCATTTCG
GAAAGGGATTIGTICTCiT
CGCTATGGTCAACAGCAT
16 GCCC1TTCCCAAC.TCCTTGAG11TAGTGCACCG1TACG
CTTAATCTTTCTGAATGGGGCGTTCCAGTAGGAGCA
99 A TAC.AGC CTITGCCGGAAATTGCTC
MAGTGCACCGTTACGA
TAATGGGAGTGGAAATGG TCGTCFATTAATCCITCIAA
01 AC.ATGIC ATCGATTC CTITTGCGTATGGCTACC C
17 CsIGGGAGCGTICGAGACATGATTAGTTGCTAIGTCT
CAACGAAATACGCTTGATACCCATTGGAAGTTGCAA TAGTGACAATAGTAGGAG ATTAGTIGCTA
TGICTACCT
17 GGACAATTCCGCATAATITCGGAAGCTC. rGTTTGGIT
AATGTCGAGGCTITAAACGTTTICATATCGC.AAACG CGTGATATAGCTATMCG
03 cm CCTTGT , GA
AGCTCTGTTTGGTTCCTT
17 CAATGCAa:CCTTC1TTCTGATCCAAGAGGTTCC1TCT 1 CGTCTGC.GTAAGATGCATCTTAACGTCTITC.CTAATC GTGGAGAAGTAGACTTTG
04 CAA i CCA AGTT
CCAAGAGGITCCITCTCAA
17 CCACCITC.AATC.ACAGCACTTAAGGAATTCGATACTT
GCCITAATGGTGTGATGC.CTGCTGCTTGCGITTC.TAAA CTTTTACTGCTAAGATTGG
AAGGAAITCGATACITGGA
OS GGATTC CTCT ATC.G TIC
17 CGGGATGATTGAAGTACAGTCCATGAL t 1 i ii i CAAG
TCCGTAACTITTGATTC.AGAATTGACCaCCCATGAA ATGAL CAAGCTAA
i is- I t CGCGA TGTTGTTTCGGAGAGAAG TCTTAGATAGTC.GCTTATT
AACTTGCTICTCAGTGAG
GTTGACAATGGCTTGAGC
17 CGGCTTrGCGCATAAAAAACATTTTCAGGTTGAGTAT TTGCCTTAGGAGAAAAGGGAAGGACAATCACACAT
CATCTCTCTCGCTATCTGA TTTTCAGGTTGAGTATATG
TAGAAACTATTGATCAAAG
TCAAAGTATCCG TTTA CGAATMCCCTIATTGGT TATCCG
AGAGATACAAAGTGTCTT AGAGTTAGTGGTCATAC.AA
GAGATC.C.ITMGCAAGA CAAAAAAATTATCAGGACG
s CIMACAAATCAGCCGATCCCIATCGATCAGATITAT ATAACGATTACTf GCTTGI
CCATCACACACAGCTITTA
13 'STAG 1 GCAGAT , GC C
TGCTCGCTGCAAATTCCGAAAGTITTGCGTTAGTAG GACTAC.AGTCTCTCCTGG
ACTGGAATATACACTTCTT
14 TTCTTCCG ' AAGCT 6 MG
ITCGAAGA TGCTTAGT GGGT AGA
AGAAATCCCMCCAGAAA
17 CTGTTCGAAACOCATGGCAGCMATGGATCGMAG CAAGAGCACAATGGTATCCGAGCTAGCATTGC.ATCA
TTGGCTTGGATTACGAAT
17 . ACC GGACTA GT
CCCATGGATCGTTIAGAGC
17 TCCCAACCAACTGC1TAAACATCTAGCT1TGATC.AATT
TGGAAGAGGTTCTITT ATGAATCGTTAGCACTGCAT CIAGCTTTGATCAATICCG
GTTGCTGTTTGCTAAGAAACTA
19 AGGA TC TM MG CAACIaTTICCGCAGCTI
'ITIGGCAGICATT GAGGA
TCET APS AGTFCGTTGCTATCCACAT TT
17 ACTC I i I i tAGACTCTGTGCGTGICAGCAGAGCAATA I
21 CafTGG 1 CGAA C
AGC:AGAGCAATACC1 MG
17 GCCGCAATCGAAAA1CTC1CCTAITAI1CCCCCTTAGG I CrAAGGGAACAGAAGGCITGA I
AATCITC:AAGCAA
TATTCCOCCTTAGGITTGG
TGCFAAAACiG1CGC6CTA6AACICATGA 1 Gra:617G A1GC11TTITGG-TTATGCA.
ACCITSTCTICTITCACi ITC
7.3 . AGTTCTG 1 GAG 16 113 =
GTOGAAATICGGGGGAGTCTMCTAAGGACAAAA AAAATITCCAGGGGATAG
TATTCATGCTTCGATCCCA
ICC( A
ITTCTCGCAAGGGAGTTGT
17 TCCTGGITITCCTCTCITAATAGCICP.MACA TTCIG
GMGTITTIGGCCTTATC
26 GCTCGCAATAGCAGTITTCCTTETTGGGCTICTCTGCG AAATCCAAAT rrre TrITTGGGCTTCTCTOCG
GTGAAAGCACAGGTTCAT
17 GTCACGAA I ATITTCCTGAGAAGGAA i GACAA I GCTC ACIAGCTAAGC:AGITAGGCCTIT
I GACTAAACATCC AGATGCCAAAATGACGAT ATGACAATGCTCTTIATCG
MTh TCGCCCACCCTTra:TGGA AGAATTA 17AAGCAAGGAT
TGACGGATGTTGITAAAGGITCTOCACANIGACATC AGGCTITTIGGGAGITAC
TIGCAAC.CGATAGATTCIG
TCTGA TTCTTGGC C A
17 MCI CCCGCAGT:CATA ICACCCGGAAGIGGCTACTAC:
TTCTCAGAAGAGGATGTGCATGAGAAA I CGGATCA CITITIATTGCIATAGCIA
CCGGAAGTGGCTACTACC
17 GTGTCTCGAAAAGCTGCTTCTAAATGTITaITCC1TTC
TCGCAGAATTGATGATCATATGCC:TGCGTTATCGAG ACAATTGTTAAGCGCAAT AATGTITGITCCETTCGATC
32 GATCG . CGTATG GG G
ATATGCTGAGGATITAGAACAAGCAGTATCCCCTTC CCAATATTGAAAAGGAGC
TTATGCCTTTGTTGCTGC
CAATCTIGG1TTTGTTGAT TC.AAAGAGC.ATC-ATACGAA
TGTCGTTCGGAAAATCTCC
17 ACAAGCTTGCTGTTAAACTCCTCCTTATGGCTC.AGCTT
CTCCTTCTITGACAAAGTCAGAGATCTGICATATTTC
36 CA TGAATTGTCT AAGACTAATCTTCC.AGCGT
CCTTATGGCTCAGCTTCA
17 GCGCATTGAAACTATGC.AGATCAATATTIGTTTTCCTC GGAGCATATITAGAAACAGTTCCMCATATCCTTA
ACGCTCGATAATTTGACG
ATITGTTITCCTCCGGGA
TTAGCACGTATTCCCCAGCTCCTAAAC CAGCAGCAGAATATCTAG TTGGAGATTGAAAGACAC
AGAGATAGGGAAAAGAG GTGTCAGICTITTTTCTGG
GATCATTGACTGTGTTAGC GTGCTrTGG1TAAAATTGC
41 TrGCTAC AA11TTCC , Tr TAC
GATCCCAGCCATCAATGTGATAGCTTTACTGTATCG
42 CATA i GTGTC.AAA ATTCCCGTAGTTAGTCCGG
ACCACAGGATACGGCATA
TATMTATCGCGAATAGA AGCTTAGTCTATTTGTAAA
TATGCCGC.ATATTTTAACGCeirACAACAGAATCCCT GCTTTATTGGGACTGGGT
TAGTTTGGTTGGTGAGTGA
17 GCCCTGATAGATGGAGCTATGGTG/WFTGGCGTTAT TCCACAGrrACAGAAATCGATGACiTAGACCGATGA
GTACTACGAGAGAAGTTC
TGAAATGGCGTTATGTTCC
AACCATCAAAAGCTICTAA GATAAGACTATCATGATTA
TTCGTCTACAATTGAGAG GCCCATTAAAGATAATCAT
17 CCAGC.AGCTACATTGCTACTAMGAACTCTGGATCAG GAAAGGGGGAGCTATTTATGCCAAGAATTGTACAG
AACTCTGGATCAGTTTCCT
48 TTFCCFT GGCCAC CAAGC.AGCAGGATCCAAT T
GCGAGTAGCTTCTFIGGT
ATACACAITTGCAGAGGAGAGCCCAATCTCTCCAAC AACAAGCTITATGIGGAT
50 CGT 1 CAGAC CV( ATITGTFCGGAGATGCGT
GCATCGACACCATCACCATCFCCGGACTGATTAATC GATGGAACGACTGTFTCTT
52 AAG 1 GIGAC , TAAAT GCT
TCCGCiACTATCGAAG
52 TTAAACCC ' GTCFCT A CCC
TTGCTCTGAAACACTTCTGTCGITTC.ATTITTITCTCA AATGCAAGAAGATATAATC
CACC-AAACGGTCTTATGA AGATGGAAACTGATGGGG
GGAAGGATCCTAAGAAAA AGGACCCATATACAAGAG
55 . AACiAGAGT ATTCAAATTCiGA CTGG AGT
56 GAGTTG ATGlIGCAC G G
AAGGACCAGAAGTGGAGGAAAGTACACTGAMGA ACTAGAGGAGTACAAATT CAAATGAAAACAT GGATAC
57 'FACTATGG CCTGC GCTT 'I/FIG(3 rrTGICATAAGGGAACTFGAGTCAAGA
SS GGA AGAAGGTICTG TFAGCGGGCAATTCCTC:F
AACAGAC.ACTATCAAGAGTIGGAGAACACATGCAC TCAAGAMGAGTCAGIC
59 CiAIGG 1 ATTCAGACTC GC CAAGTGCTIGICA
TGAIGG
17 CACTAGAAICACiCiAl AACAGGAGCAGAAAAGGGAA = CACA1GIGTMGCAGGGA1 17 AAACCGIT I CT( GAAC I AATGCT1IGGAGTAAAAGGG TGAGA !GAT
TTGGGAICCOAACGCGATATCITGCrr CAGTA.TCGICIAATGGAG GGAGTAAAAGGG TITTCAT
61 . ITTICATTCA TATTGAGAAGT CA TCA
.
TCTACGGAAGGAGTACCTGAGTTATCGT CAGCATCC ATCATTGGGATCrTGCACT TAITGTGGATICTT
GATOG
62 ATCGTC ACAG Tr TC
17 'FIG I GTAGGCTGAAGCFACiGTAGAGGCTGAAAAAA TGGCITCGGAGGAAATITG I ATG
TCAAGACAGAGG GIT1TICICAll ACTCTTGA AGAGGCF GAAAAAATAAA
17 TCCACATCCITGGTAAAGGTAGTCFTCMGTCIGGT TGAAGGAGTFAAC r GGAGTCCACMAGTTIGATAT CAAAAACTGCTGAT AAGT
64 GGATCCT CACiTCCAATF 7TCGT
TCTTTGTCTGCTGGATCCT
17 AAGCTGTTGTTGCAATTTITTGCCTICGTCiAATGCCTT
AAATCCAGCATTGTACACACAAGAGCAAGATTITC.T TTCACAATC.AC.ATTGC.ATG
C1TGGTGAATGCC1. TGGA
17 CC-Mr ATCrAGTCCTCCAACAACITGTGACAACIACr A I CACCTTACACAGGGAAT 1 CTGCiGITI CAGCTGGA AAGAAGTCi TACAATAAAA TG IGACAACT ACT TICAAG
17 CCTGACAGCAACTCCC1CATCATACAT1GGAGAAGC r ACIGGACTATGAAGCTAGAIGCACCi TAAAAGGGCT CATACA TrGGAGAAGCTGA
68 AATCCCAG AlT6CAAC CCTTGGCTGCAAAGGAAG AG
17 AGCT MAGA i CI-MCA illiCAATICiAATGACAGAAT
AGGAGAATTGGGAATAATAAGAGCCGTICKCATAA CCTAAGAT CTi CIACAAAT GAM
GACAGAATTICTCAT
17 CAGATCAATTITIGIGTCAAGAGGGTA1TTTCAAGrr CAATCAGATCC11TTTAC.TGGGGAGCC.AATGAAAT A GGGTAATTGACAATAACA
TATTTTCAAGTTCAACGAA
70 CAACGAAACTG . GCAGCAG TIMM ACTG
AACATGGATAGAGCAGTT ATACAACAAGCTCAAAAG
17 TAGCAACCTCCATGGCCTCCGTAGC.ACTACGGCAAAG TCATCCTAGCTCCAGTGCTGGICTTCTGGTAGGCCT
GAAAACAGAATGGTGCTG TAGCACTACGGC.AAAGGC
GCCATAGAAAAACACGTA
17 TCAATGGGCTGGACJAGGTCGGAAACAGGC.TACTAT
CCTGC.ATTGTGACCAAGACTTGTC.TATTCTrTGAGG TAGGTTTTGTACAGAGAT GGAAAC.AGGCTACTATACC
1TTGACAAAGACACAGACTCAGATMCCCAATTCT AAGAAATAGCGTTGCT. GT GCAAAAATGGCAGATAAA
RAGTATITTAGTCCTAAAG
AAATGTTGGGGATGCACA
CACTGCAACIACAAGTTAATGACATITGTTCCARTGTT AATCTITITTI CAAAGGCT
GGCAATTAGATCTAAGTGA
78 CiTGACC CAATTTGATCTT TTGCi CC
17 GTGCZTCTTGAGCAAATCTGATAITATCATGriTGCLT AGTGAAACTGGAGAATGGGAAGTGTTGGTGGAGC
AMAGACTGAGGGACATT TATCATGCTGCCTATITTAT
79 ATTTTATGGAT TGGTAAC , TGG GGAT
17 1TTGGATC.CCCCIGGAGGCGTCTAAAGGAAGGTACG 1 GACGACCGAAAGCGTCACCGTGTGATCCCCIATCTC GTACCCACTCAACGGAAG TCTAAAGGAAGGTACGGG
80 GGCG i GATCC C CG
17 C.CTCCAGAGGCCGTAGAGGGOCC.ACTIGTGCCAAC
OGGGAGCAGGAGAAGGA
AT IC AATACTCTTAAGTGTTATC
83 GTA GTCAAT C.ACTGGATATGGGC.CATT
GGTGAAGGGACGAGTGTA
CTCCIOATAGACAATTITT ACAAAAACTTGGTAGGCCT
TCCAGAAGITATTCCTACA GATAATACCCCAACAGTAA
17 TCCCTGACACCTTAGGCACTAAGTTGGACACCCTTAT ATCAGITTAGAGCTTTTCGACTG/kACAGACATGTCA
AGGTATTGGACACTGAAA CAATACTGGAACAAAGGA
87 CAAAGGATGA CATCC ATM. TGA
ATGAAMTAAGGCCITAT ITGGATATAGTAGACTCCA
s CC.AGTGACCAGICTAGAA AAATTGCCAGITCIATCTA
89 CIATCTATITCC 1 TCA TrACICC , GI *FTICC
GTTAACAGCMCAGCCAGGGCATTGTGACCTATGG APACTITATAAAAACATCC
90 ACATCCACTCCC ' GCG CCGCTTCACTGATCTCTF
ACTCCC
AACAGACGGAATTTGTCC AAAATCATGTGATGGTGG
ATCAGTTATATATGATTGA
17 TGCAATCTCATAACATAACGTGGATTCCTACTTGGAG GAGTGCTGCTACTATACGTAGTGTAACCATCAC.ACA
TAAGACATTGCAAGAAAT
93 . I CAGGAA AGIT:TWA GAAGG 'FCCIACTTGGACi TCAGGAA
17 AATCCAACCTITAAARTCRCACGAGGGITTTGTTAIG TGGIAACATGATAGATGGIMGCITGCC.ATCAAAT
ATATGACAGITTGOTTTAT OTITTAGGTTGCAAGGCTA
95 AACiGT3 AT CACCICTA ACTGGA T
ACGTAGAAGAATTIGTCITICAGGCGATGAATGTAA GTTAAAAACATFCCICGIT
GCACAACKATTICAGAGTG
TGACAAATATTACCCIGTIGTGGIGGAACICTCCAT TTGTTACAACTTCALITAC
ATTCTGAACCACCTAAAGT
97 I AAAGTTG GCITCIA I ACC 1.6 TTAACTGTCACAAGIGCIGGACAAGACTICAGAAGA AAA TIGTATIGTGCCT TTA TCCTGAAGATCIA
TGACTTI
TCi I ACTTCCACAAACTGCCAATCAC AT AAAAA If TGCAGGA ICI
99 . AGGATCTTIGG AT.AAGGTTA TTIGGCAGTGATGCAGAA TTGG
.
AAACTGIGCTTAAIGITGITGGACCGCTCTAACAAT f3ACAGCAG ITTGITAAAG CCGA
TATGGTTAAGICTAA
18 'faGTTICGT AAACAATGGAACCICGGGCACTAAGA ATC:AATG ACGTIGCATT TG I
GGGC.ACTAAGAAATTGGC
18 AACAGGCACAAACC I ATT AGTGAAIVAAGCCITGIT IT AGGAAATGIGITITG IGATTIGACi ATTGCTTGCCTATTACAAC
GCAGGITGTIOTTAATGGT
18 Tar ACCAAAA TGCATAACAGCG TATG I AAATGIGGT
rrGAGICGTGAAGATUTGAGATTGATGAATIAM GCAATATGTGA TIT TGAA ATGIAAATGTCiGTG
TAAA
TAGGACACACCGTCTGCIT I AGAGCIA GAAAATTGAG TAT AAACC AATATIAT ICI
IGACGGAGG
OS CGGAGGTAAG TCAAAACCCAAC TGAC.TTG TAAG
GGTGACAGMAGAGAGTGGIGGTITTITAACACC TIAAGCCATGAGAAAGCT TAAFAGACCITCATIGGIT
06 TTGGITGATG ACTTAAC.1 TCC GATG
"fGCCGiTATCACIATTCAG
18 GCGGCCTC.A.ACAGTAATiNAAAGTATACTGGCTITTGT
CTAAGGAATTGAAACGGCCCATTCITAACATCAGTA AlTACGATGITATGGCTAA ACTGGCTITTGTTCAAAAC
08 TC.AAAACATC . ACCGTATGA 7667 ATC
18 ACTGATT AAAAGCATCCACAGAC.CAGCATGTGCAGG
AACTGGTTTGAAACTGAACCTTACTGGIGTACITTA A GIGTTCTGATTCAAAATIC
GCATGTGCAGGGTAATGTT
GCTCTAATAGATC.AGGATT
ACTCGTICTATGTCGTATTGCAGTAAAGCATATACC CGTGCGGTATAATC.I. 'MCI
ATGCTAAAGGIITTATCOG
AAGCTTAAACCITGCTITTGGTGACACC.ACACAATCA TATCAGCTATTTAAAGGIT
GCCTGTGGATITTTTGGC
TAGGTATTACAGCGGTAA
TGCATATAGGGAGGCTGC
GIGGITAAACCTGGTGAG ITTAGCTGCTIATAACGGC
14 'F AACGGCA TTGATGCATAT A A
AGATGCTTTAGCTFCTATGACIGGTACCATTOTAAG TIGTAAITGla i F i G i ACM GI-cirri:1-c TGTGIACAAGCATACAMAGAGGCGTGTACTCAAC AIGTITAGTAITACGMT "FTGTG TIATAAGITTGGCC
18 CCATGTGTAAGTACCAAAAAGGGAAAAGGGTTCTAA GCAAAGGTTATTGCTAAGTGGGTTAGCAC.AAAAGC
ATTTC:FGTMCKTTFGTG AAGGGTTCTAACTTAGAG
17 CTTAGAGGAAG ACTATCTTAAT , GTAC GAAG
TGACTGATGTCAAATGTGCTAATGTACTTAGAATTA ATATATGAATGCTAATGG
CCCTCCTAAGAATAGTTIT
18 TGAAG i GAAGCAACATGC ATTGCG GAAG
AAGTGCCTGACTAGTATT CGATGATTACGCAAACCiA
19 AAAGGAC I. I G I I I IAACT GAAG C
GTTGTGFACCATTGAATGC
TGGCAGCAAATACTCTGAA
18 CAGC:TTGCAATCTAACTGTAGAAGATITTGTAMAGG GAACTGCTACTGAATAT6C1TCCACTTAGGATCTAC
71ACTGI1CAAGATGC1AA TITTGTAAAAGGITCiTAAC
GGATICATATGGIGGTGC TCTGTTTGTATATATTGCCG
AAGGGCATTTGATATTTAC
TITTGGTGACTATGTTATTGC.AGC.CCAGCATAGGCA TAATGTGTATAAAMGCT
CCCTAGTTAGCGCTACIG
ATCCAGCTTTGC.ATGTAGCTICACACTAAAACAGCA GCTACCATTATAAAGAACT
ATTGTGATGAATATGGATG
18 AATGGATAGCCAGCACICTTATC.ATAGTATTTTGAGA 1 TTGGAAAAGCCAGGCTCTATTATGATAGTATAAGCG GGAC.ATTAAGCAGITGTE
ACGCCGCCCAACCCATAAGTA ACTA TGACTGGCAGAATG AAATGTITGAAAAGTA TAG
27 AGI ATAGCA.GC 1 CAG6 , TT CAGC
GTTGATTCAACCTTTGTCACAGAATCGATCATCACTC TICAGMTTAACATATGT GCCAATGTATGTGCCTTAA
28 GCCTFAAT ' AAAA TCATCA CAAGC T
AGATGGATGGTGACGATGICTACAAAACATCCAGC CGTTTTTATGTCAGAATCC
GITGGGTTGAACATGACAT
TACAATGACCTGGGTAAT ATCTTGGATAGCTACAGTG
31 . AGTIGTAMIGG TCTC;CITAAA TCTIGTACAGGAICTCCGI AAATG6 18 ACGTCTC.AAGCACACTATAAACACTAGCTAATTTAAG
ACACATIGGTATGAAACGTTACTGCGCAAGATGTGA TGTAGGAGATGTMTG TT AGCTAATTTAAGTGCTCCT
32 MCI CCTACG CiTCCCIG TFAACC ACG
18 ACACAT Arrr ACGAGIGGIGICKITTATTGTTCCGGC ACCATAA A
TGCATTACCTGAGAIGGTGG TAAGCA TA ITGAAFATAAATGATIGCA
33 CAAGGF CTAAC:FTCATCT CTCG 'FAT MT
TCC:GCi CCAAGGT
AAAGTFCTAGTGCTGTAA TAATAAGTTITTGAAGGCT
ATCACCCAGCTCATGCTCCTAAATCGCCAGTTGCCTT AGCATTACAGTTTACTACA CCTTAGATAAAGTGCCACA
35 CiCC:ACAGG 1 A TTGA GG
18 CACAACAAAATIAATICCIGTGGAAGCGIGATAGCAT .
GAAGCCACIGGTTISTITGCTTC:ACCAGGAGGAGC:f 36 TGGGAC nc CCIGGGTTGGCITTGATG
GCGTGATAGCATTGGGAC
18 ICAC:GCCCTACITITGCAAAGTAGITACATGGGCAGC TA AACGTGCCACAG I TTACAA
37 . CM ACATGTAACACT C
AGTTACATGGGCAGCCAA .
TAAGGACICIFTI ACI TAT CCACATI ACITTGIGITATGAT ATTG
38 ATATTGGCAA ACAGTTCCAAA CITAAAGC.16CCATGC.TCT GC-AA
18 ATCCATA FACACACAAGGCOTATCTGCiCAGCCITTGA GCATOGAIGC I AAGCAGGI
TGACCTAAAT TGCATCT AATAAACAFGCATICCACA
GGCAGCCTITGAGCATTF
IGTGGCCITAAGCTCTC.iGG TACAAAGCTIGGAGAAFG TC:AAGACTGGTCAT TA I AC
40 TTATACAGG TGCiTGT 7I6 AGG
GGTGICTGC.ATGTATACA AATTGAATGFCCTTMGAT
18 CCACCAATAATCTICIGGITGAAGTTATICAGAAGTA TGCATITG ?TAM AGGCI
TGIACCGMACAAACICC A TACCITi AGATCAAG AT TAITCAGAAGTATGGITIG
TICTCCAAIGGAAAGAGITAGTCTCITCATCATACA AGATTITCAGYFTA FGCIT GI
AACGATGAGAP.AGTIAT
18 TCTGCATTC:CAAGAAAACTCTGITAACATAC:ATTTGTC RAT GGGGTAT
TITGCNCTITOGACGCCANI TAAAAA GAGTACAATGTGAGTA AA ACA TACATTIGTC:ATATGA
44 ATATGATTCGAG TCCTTCACTAC; GATGG TTCCiAG
CTGGACATACAGCCIGAAGACICCATCACCCCAATG ATMCCACITACCCA TIT T
AGGITTACAATITCAAATE
45 nrscis CATAT GAG CTCG
18 TGATGAGGGCiTTGATGGTGATITGATGCAGCACTGT CCGGCTAGACTTGAAATAGTTAAGCCCAGTGACTAA
46 CC.ATT . CTATGGTTTC.A GATTGGCCATTGCACCAT
GATGCAGCACTGTCCATT
18 MGGCGTAMATTAACACCC.ICATAGGAGTITTCACT
TC.ATGCCTTTAAATGCAAC.CGTAACAAGCCTTTATIC TATCATCTAAAGCTGGCA
AGGAGITTICAC.TTTACCG
18 ACAGATTGCMCGTAGGAACAGTTCCC.TCTGGTAAT GTAAGGAMTACGCCTGTACAGGGITATCAGACTG
C.ATTACCACTGGITTTGAT C.CCTCTGGTAATTAITTAG
TACCAAGGTTATTAAAGA GTGAGTTCCCTGCTATAAC
18 GCTTACTATTACAACITC.AGAGGCATGTITAATGTFTA
ATTGGGITAC.ACCTCTCACTTCTAGACTCATACAATC 7TATTICATATITTACAGAC
GTITAATCITTAITTAGGC.
CGTAAACCTAATCTTCCCA
SI TCCCAATT GCAGCATCAAT CTISTICAGCCAATCGCAG Alt 18 TCGAGGCTTAAAAACAGAATCTTCf CTGCTAATGITT
AG
52 CTGTTAGCA GCiACCITTGA CA CA
18 CAAGACCOGAACAGTGCfC:AACCIATAAGTGCCCCCA AAGIGATIATTGTOGAGGCAATTCTICTGCAGACCA
TCCTGATCCTAITACATIT
ACTTATAAGTGCCCOC.AAA
18 AGAGITAGCGTGAAAGGCCGC:AAACAGAACTITTAT OCGAACCAGCATTGCTATTTCGC.AGCIGTCGTGTAA
CTITACGGTTCTAGAGACT CAAACAGAACTITTAIGAT
54 CAI TUG 1 GACf ACATAA TCGT
18 GACAAATGCAGC.ACAATCAATAGTACZTTCAGAGTTT TGTGGTGATTATGCAGCATGTAAGCATTAATGTTAT
55 ACTATAGGTAA CACAGAAACTAC , GA GGTAA
TAAAGTAAAGTTATCTGAT GTTGAGGCTTATAATAATT
18 GCGGTAAGACCACCATTAATAAGTCC.AAGAAAITCT TGCITATGITTCTCAACAGCTTAGTCCTICTCCATAG
CAAC.AACTCTCTAATAGAT CAAGAAATTCTATCTAGAC
CACTTTAGTTATGTOCCTA
58 AGGGTT ACTCATA CTA.AG
TATGTCACAGCGAGGGTT
59 GACCIACA AAGCATAGCTAC.AC GCACCAGATTTGTCACTTG CA
GTTAAAGTTCTTAAGGCC AGACCTGAGAAGAAATAT
60 AAATATCTCC GAGTGAATGT ACG CfCC
61 AGTITTf GGA ATACCfTGAATGT TTGT GGA
CCAAAAGGGTICTGGCAT
GAACAGGACCGCATGCTA
18 CAGCCATCiTCAGGTGTTAC.ACCGCGTAGTAGAGCCA
64 An TCiCIGAGG GAGCCICTAGTGCAGGAT
CGCGTAGTAGAGCCAATT
18 AACCIAATIGCGCCGTTATAGCrATCTGGGAATCCTG 'IGACAGTACACMCAGGTTTTGAGIGATAGGCATT
65 AC:GA CAAAT ICITAI , GI TGGCCAAAG 1GCAGAA sf Ara GGGAMCCIGACGA
18 GGTTACCACCAGATGCCGAC.ATACTGAAGAC.ACCTCA
GCAGAAAAGTCGAGATAAGGC.ACCTATAGTCTGCT TICTTAAGAAGATGGATG ATACTGAAGACACCTCAGA
18 ACT7CTGCTTGGGCATTAGCAAACTGACATGACTAT GGAAGGACCTCTTTGCATCt=GATTCGCTTTCAACAT
AACTGACATGACTATTGAG
CTACAGAGATTCGCTTGGAGAAGCTICTGTITTGG A GAAATCICACC.ATTGCOT
TTTTCCAGGACATACTATT
TTCCAACACTGTGTCAAGC
69 . 1 GMT GICCAG I
CTGCTITOTTGGGAT Gra:
GAAAGTGACAGGGCCCCTTITTCAATA TGATGGITT GCTACCTAACTGAAATGAC
18 ICCATTCAAGTCCTCCGATGAGCTICCAGGACATACT AACACAGTTCGAGICTCTGAAACfCCCATICTCATCA
GAAAICICACCATIACCIT
71 GAGE crc3a. CC
CTICCAGGACATACTGACG
GAGGCGACACTCCACCAT GATCACTCCCCTGTGAGGA
72 AGGAA rraxi G A
TGCAACATGAGCACACTT AAAGAAAGACCATCCGTCG
GTCCATCCTGGGGCCCAA GGAGGICC.CGCAATTIGG
18 IGGGGTC:CAGCACGIAGATGTAC:ATTCTGCACACACC
ACACCTACAGTGGCAGTCACiGICGCCiCCTACTAAT A GIGTA.TGAGGCCXAT GAT
75 . CGG GGIVA CT
C.ATTCTGCACACACCCGG .
18 CCACCACCATACCCACAGaiGGACATCGAATGGCTT GCAIGICCTCiC:GITTACCCCAGCCAAGAIGCCCCAA
TGIACMAGGCTATCTITC GGACATCGAAIGGCTTGG
18 GC:TS:YWCA f CAGAAGGACCGGAGCTGCAAGCCCA1 TACMC TGGCACT
ACGCACCGGCCGCAGACACTT GA ACTGGATGTCCTCAAAGG
AGCTGCAAGCCCATCACT
18 GCCCiCTGGGACITCAGCAGGGCGIGCCAACCIACAC CFCCCAGTCiCiTCGGTOCi ITT
GCGCAC:GTC 1TGAGA A GGCACTACTGATCiCCAGG
GCGTGCCAACCTACACC
18 CGATCCTC.GATATCGCAGCGCTGAACATCGATITACC TGACCGCACTGAGCAACATCC.AGCAAGGCAGTATA
AAGGTGAGGATGITTGTG TGAACATCGATTTACCGCC
18 CAGCLIGACC.AGGTi CTCCAACCCMCiGCTGA TGCT CGCTGC1GCTGGGACACAIGT
TACCCCGCACGTACC
BO GAT AC GMTOCTCACTTCTGGCGG
CCCITTGGCTGATGC.TGAT
18 G TGGCCACTCiCTAGGICTITGAAOCIATACGACCACC AGGICATCACCIGGGG
fGCGCIAATCGTGCAGAGA TACTGAGCATTGGCAGAT ACCTA f ACGACCACCTAGC
CCCAACGAGGICC IIGTGACTACCITGACT G GAGACAAGAACGTCiGTGA
18 GAAGGGCACAT AACGGCACCTCGCTCAGTCCTAGGC TGlIGCGGGGA
TCTITACiGGCTTGCTAAGGGIT 7 CC
AGTCCTAGGCT.T. CTC
TACTAAATCCATCGGIGG
84 CTC . CCGTAG CG
CCACATTAGGCTTCGGCTC
18 TCTAC.CfCGACCCGTTCGOCACCTITICCATTGAGACC ACGTACCGATATGTCGCCTCCGACAGAGGACGACC
AGATGCCCACTITCTGTCA AGACTAAGC.AGCAGGGAC
86 ACT GCfT C T
18 GTCTGGAACGAGTGCTGGCTTTCGGCTGCGTTGTGA ATGGAGGAGTGCTCACAAGCTGC.CTTGAACTGGTG
TCGGCTGCGTTGTGATTG
18 TTTGGTTGGTCGTCAGGGGACTITGCCTGOCAACCCT TACTCGGGGGGTGGGTTGCCAAGCCGC.TTACCACG
TTGCCTGGC.AACCCTGC
GAGGACATGGTCAACCTG
GCCCGCCATACTATCTCCG
90 CiGC GCMG GACTGGGTTIGCTCGGTG
GMAACAGCTACCITTGAG
91 AU GG Ci GGGTGGGGGACITCCATT
18 GATGAGGCCtATGLGettuGCATCCGCCAGCCAACT CAGACGCTGAGCTAGIGGACGTGFAATGITGCTGC
CrCCATCAGAGGCAAGCT
ATCCOCCAGCCAACTATCA
18 GITC:TTTTMCCGAGGGGGATGTACCAOCAACTGTC TCAGCTGGACGGCTCTAATOGTCCTGTGGTTTCGA
TGTACC.ACCAACTGTCCAT
93 CATG GGAGG , CTGTTGGACCGCTGGAAA G
ACCGTTAGTGACAGCGAG
18 CC.CTGCTGGACAAGGAGCGAATCACTCTGCCCGATC 1 1TC.CAACC.ACCATCATGGCGAATGCGGCCCCOTTA CTCACCATCGAGGAAGCG
95 GAA ' GC T
TCACTCTGCCCGATCGM
18 GCCrCCGGTTCAAGGTTACAGCCACTGAACAGGACA CTCCC.TCACGGAGCGGCTITACGATAACCAC.ACTGG
GGGITCTCATATGAC.ACCC CACTGAAC.AGGAC.ATCAG
18 TCAGCCACCACGACCAAATCATCAAAGGCCGCAAAC TAGAGCAGC.CCTGAGAGCCITTGGCCTGTGGAGCA
TGTTACATCAAGGCCACA
CAAAGGCCGCAAACCTCC
CGAAAGC.AGGTCAATTAT AAAGAACTAAGAAATCTAA
18 GTC.ATTGGTCCATrCCTATTCCACGGATCAGACCGAG AGTCGAAAGGCTAAAGCATGGAMCGTATrTTGAC
TTATGGAGTAAAATGAAT
99 TGATG TTGGTrTCT GATGCC
GGATCAGACCGAGTGATG
19 TrGGTTATCGTTAGTTGCGATTCCAAGTTGTTTTCCCT AGAGAAGAAAGAAGAACTCCAGGATCTCTCCAACA
00 AACGA.AG TGTATGCAAC GG AG
CCAGGAGGGGAAGTGAA AATGATGATGTTGATCAAA
19 CCTCATGC.ACTCTTATCTICAATGIACAAGCGGATCAT 1 TGAAGAGTTCACAATGGTTGGGATC.AATC.FCCTGGT TTGGIGGATTC.ACAITTAA
02 CAGTC 1 rCiCIT I GAGA
ACAAGCGGATCATCAGTC
i AAAGTGOTTITCAAAATTGGGGAGCAA TATCCCAA TIGTATGATAAAAGCAGI
GATCTGAATTTCGTCAATA
03 Ci TCAATAGGG 1 TCATTCCC , TAGAGG GGG
19 TTCACTGACCTCCTCGGGAGATTGACCGGTrCTTGAG 1 ACACAGGGAACAGAGAAACTGATIGACCAACACTG
GAGAGGGTAGTGGTGAG
04 AG ' ATTCAGG C
19 CCTCATTTGTTGGAACAGAGTTCTrAGTACCTAAGGC GATGTGCTTGGGACATTTGATACCACTTTGCTTTGG
TGGAATTTGAACCATTTCA TAGTACCTAAGGCCATTAG
OS CATTAGAGG TGGAGC CT AGG
06 ACCC CGCAAG Cr C
19 GCCAACCTCTCAGGACAGCCTATGAAAGCCITAACAC CTGCCGACGCCTTAC.CGATTGCTTCCGTTGGCATAA
TCAATAGCACGGCCTTGA ATGAAAGCCTTAACACCGG
07 . CGGC CTGA AC C
09 'FGAGCTCGGACCIGTCCCTGGCCIGCAACTGGACGC GAAC GCACAGGCTGGAAGCG
GCCTGCAACTGGACGC
TACTCAATGCAGCATCCCT
TCCCTG ACATCC AAC1CGGAG6C6GC11' G
1GTGT3GTACACCC6.ACTC
11 ACTCT CIA CGCGATGCCG ICATMAC: :1 IATCTCCTGGCCCCTAC
12 GT G GAGACTGGGC.GCACAAC
CTGC:GAGATCTGGCCGT
AAAGGTCCGAGGAGC CTGCTACCCAAACCTICCr CAACGTGCA ICAATC1C1GG
13 , GOT C13 G T
.
19 ACACCGGGGATCTCATC3GlIGTGGIGTGCACCCGIG TCACCiCIACAACICCICTCCA6CGAGCA1 GCAGGTGG
GGTGTGC.AOCCGTGGA
19 'IAAGCACCTCCTGAGCACCCGGCAGCCCCATCACGTA TlIGTC1ACGAGTGCCAC
TCCACCGTCAAGGACAG I GGACCGGGGi IGAGAACA
1S Cr GCCGA A
GCAGCCCCATCACGTACT
19 AGTAGGCCACGGCAT(GATGCTCAAAGAAGAAG ICC ACCGCGGICTIGACGTGICTGAGCA
TCGGTCGACAC TCAAGGGGGGAAGACATC TC:AAAGAAGAAGMCGAC
19 ACAGAGGACGGACGAGTCGAATCTAC.AGATTTGTGG GAGTGCTATGACGCGGGCTGAGCCTAACTGTAGTC
TCTACAGATTTGTGGCACC
ACCIGGAGGICGTCACCAGMCiACAGGCAMACG TGTGGAAGT GT11 GATCC
18 AT aici GC
GGCCAACACCCC.TGCTAT
19 GMAT AACCTCTGCTIGGCGGAAGGG A TGATGCTC:
19 arTGCAAGAATGFCCACGAGGTGGGCGCTGOCTTA AGATCA
TGAGCGGTGAGGICCCAGAGGP.IGGCGG
TGC1GCGCTGGCTTAGC.
19 TATGGCAGTGACGCGGGCGGGAACCATGOICCCCC CTCAGCAGCCrCACTGIAACCCACACrCCGAGCTIA
19 CrCCAC.ATGTTCCTGCAGGTCCTGGACATGTCAAAAA GGGACGTTCCCCATTAACGCCTAGTTCGGCGCAGG
TGGACATGTCAAAAACGG
22 CGGGA . AAGG CTGCCACTGTGGAGCTGA GA
TGCTGCGGGAGGAGGT
19 CGAATCTCCGAGACTTCCGCAGTCGATCCGCTTGIGG CCGGACTACAACCCCCCGCTCCAC.AGGTGGTTCGTA
AGTGGTGATTC.IGGACTC
24 CA GTC Cr TCGATCCGCTTGIGGC.A
A TCTTCCATGCCCCCCCTGGCCATGACCCGTCGCTGA A
GGCATTACGGGCGACAA
19 GCTGCAAGCTTCCTCTACGGATATrACCAGGACGTGC CTGACGCCCCCACATTCAGCCATGGCAACGGACGTC.
TGAC.AGACTGCAAGTTCT AlTACCAGGACGTCiCITAA
CCTGAGAAGGGGGGTCGT
TTATIGITGGGGGCCCTCTT
AACMCATGCTOCTCCAAC
GCGCTGGAAAGAGGGTCT
19 1677.7CCAAGC1CGCAAGGGIGAAA1CAATAGGGTG GICCGCSICIAGGCTICIGTCMCCCAGTICAAGAGG
TGAAATCAMAGGGIGGC
30 G C.CG 1 TACT GCCTCAGOGC.ATITTCAC CG
19 TGCACCCATGGAC.ATAGCTTCGAATGGGGTACATATA
ACCCACIAAACATATTGCTITTGICITTGTFTTGTGCAA ATCTAGACAATAATAAAT CGAATGGGGTACATATAA
31 AAACAAG AATATACAGCA , AGGGAGG AACAAG
TACCIACGCCTAAAATATT TATTCATATTATGTAGAAG
19 TGCMCCACCGCATCTGTATGGATATCCCTC=TATGCA GCACAACAAGCGTATAGGGIGTGCACTCTACC.ACCA
GTAAAAGATATCGAATTG GGATATCCCTGTATGCAAT
GAAAGITTGCACCCAGTC
AlTTCAGCCCCGTGTGTG
¨....
19 CATTGGCATGGTAGCACTTGTATGCATAAAFACCCTA ¨
GGGAATGACAGTGCTTATGCTGTTTAGCCACTGTTT GCATAAAGACCCTAATACA
GTGCCTTTAAAACATTCCT AGGAGTACCAAAAAAAAG
C.AGCTCTTCACATTFAAAA TGACAACACAAGGAGAAC
19 .ACCCTCGTATGAGACTTITCCTGTTATGTGCAGAACT ATGTGAATTTCAAGGATGACGCTCCCACATGTACCT
TGTGCTATGTACAATGGA TTATGTGCAGAACTATAGT
AAAGCGAC.AGCGACTCGA
19 CCAAAACAAAC.ACAGCAGCAGTGGATTMACTTGTG 1 GCTGAOGTGCTATCFGGCATTGTAAATATAATGCTA
AGATCTATAGGGTGATTG TGGATTITTACTTGTGTCG
40 TCCiACT 1 TAAGGCCC TACAT TC ACV
TGGCACIAICGCATITTGCAGIGGITCCAATGCCTAG
41 CICCIC 1 GCCC , CATMGCCACACACiCITT
ATAGGACCTGCAAGGCCTC
CCCCAGGIGTGTTGGACATCGCAGGA1TTGTAAAGT AGGACTCC.AGCATFATTA
42 CTAC ' TTGTACT ATFC
AGGGTOCACCATACCTAC
TACTTGCTCCAGCTGCCFCTTCACCATCATCAACATC CCCGGGTACATIACTATCA
GCGCTCTAGTGACAACIT
19 AGGC.ACGTCAGACF7ATTTAATFOTCTA17CAGGATG
ATCCAGATTATTIGGGAATGGCCCTGCGC.AAGTAAA CCCCTGGAGCTTATAACIT CTATTCAGGATGGTGATAT
45 . C;TGATATCIGT AGAACAT C CiGT
AAGAATATTTGAGGCACG AGGAATATGAITTGCAGIT
19 CAGI.GGAACAGCAAFTACACAAATITGTIATGAGCAT
GIAAGGITGlICTAGGTCAGGAGAAATTGGTGGTG GITIGIAATGICIATG TAT TGITAIGAGCATGTATGTG
47 G TATG7CiTG TGCACAC G161731 TG
48 AT TCT TG 176T GITIGTCGC:CTGATICTGA
CTGAGGAGGAGGAGCAAT
CACTCTGGAGGATTTGTFT
49 TTICIFTTG AC.AGTCX1CTG ATGTGIGGOTAGGITGGA G
19 GTCCCAGTTICIATAAGGAAALACAITTCAAAAAACA TATCAGGATTCACAFGAACAGTGGCAAAGTAC:ATIC
CACACACAGCCAAAAAAC TFTCAAAAAACATGGCTAC
50 TGGC.TACAC CATCATAATCCAC TG AC
19 CiGGGIAGAGGGGTA fGCGGITCT AGTTCTACCAGGC:
ACCGCAAATCGC.ACICIGGAGTCiAGACAGTGCCTCT *FCTACi TICIACCAGGCAGC
51 . AGCC CTCC T1TTAC.CCMCGCC7CTGT
C .
19 GTFCCCACF GCTICAGCiAGC7GACGAACCACCGAGA
AGACTICAAGAGGAGGC7CGGGTAAGCFGFTfCCC GAATCGCCAACCAAAACIA
52 CAGA GGACC. T CC.
ACGAACCACCGAGACAGA
ICTAITACACGCCCTITGAGIGGGATA FTAA AT TACAAATII TTGG MGT
ATTTGGGCAACTTGGGTA
19 TCCTGGA FCACiCAGTGA I HAM-RAU-Ft ITCAAT I ACATCATTAGGIGGIGCiGCiAGICA
I ATCiCAGGATT ccr AC TGATAT TAT TGACX:
GAGCTCTTCAATTGTFCCIT
19 TCATCATCAAACICiCGGGATTITCATITTGTAAGGCAA
CATCACITTTGONGTTTCACiCAGTGAAAGTCf ATATC AlTACAAACGAGAAACCC
SS CCGTC TGGTGCAG AG
ATTITGTAAGGCAAC.CGTC
19 CAI CAGG FACTATAGG ItTG 7 CAGIACATIGGCCFCI A i CACCTACITACCCITACACACCGGGTGCAAGAAA GAAAATGAGCAACTTf TT ACA
TIGGCCTCTAA.TAITC
56 AATA1TCCT AAATCATAAT AlTCCC CT
19 CCC ITO ACA (ATKA ICCCi IACTCAATCGTCiGAAAGT
ITATGCFGGATCTGAAAGGCTICTITCCAGGATCHA AATCGICGAAAGFIATATC
19 GGGATGACCTGATGTACTAATACC:FGTTGGTFTGGAA
ACAGAAAATCCTAACACGTAICCTGAGACACGTCCA AAA ItTCI GT ATAAFCCAG GTIGG
ITTGGAAAITGATA
SS ATTGATAGGATF TTCTATTGTC ACAGG GGATT
ACTCCr GAT CACCAG VG GGAGAACA11 AAAGGITTGCGAAAA1GTC
19 CCTGCTGAGCACCATAAATAAAtailt.!ACGCGAGGC
ACAAACCTCAMACAATTIAGGACC.TTGAGTATAAA TGT7t. FFFFFF GGAAGAAA
60 ACTTITA . GAGCCACTAGG GGAA
ITTACGCGAGGCACITTTA
ACTTGC ATACTCGGAACATACTATT
ATGCTGCCTTTTAAGCCA
19 GGTCTACCGTIGICTGTC7C.AACTAAAGCAGCTGITA
AAGAATGAAGTGGGCCATGTGTTGCCATATCAC.CCT GACGTEGGCTAAAATTGA ACTAAAGCAGCTGTFAAGG
TCAGAAAGATGAGACTTGACAATGCTTACTC. ICTCA TAGGAACTAAAGGACAAA
TTCAATAGCAGCAGTIACC
AACTCCCATTGTACTGGCATACATACTGGCAGGAAC AATA1TATTC.CCTAAAGAA
AATACCAAGAGAATCTACT
ACTAACTGAATCTAGATCT
ATGATTGTAGCTTGCAGGA
19 Gt. i i i i i i TCAATATCCCCCTGCATATGGGACGGAGA TTCAGCCAAAAAGGAGGACATGGIGTCTTGAGAAA
CGGAACAATACAAAAGAT
TATGGGACGGAGAAGAGG
TGAGGAATCACCTAAAGCAAGTGAGAATGTATAGT GGAGAGATAAATITTCTT
GCCAACTITTATMCCCA
CCAAGATUGGCAAATGGGAAC.ACTAAATITITAA TCTTAGTTIAATAAAAAGG GAAGITATAAIGGGAGCC
68 AGC.CAATG 1 GC.ACCCATTG ACTCiGG AATG
19 GGTCAGTITTCATAAC:CICTIGGICGGTCAGTACAGT AAITUTTGCCTITt R, CGCCAAAAATTGATAA TAATCCCTCAGAAGATGG GGTCAGTACAGTGGAMG
69 GGATTTGC GGCTCTCCATT , C C
19 GOTITTGAAATCTC.CAAGATCAGGGATERIGGAATGC TGAGGAACTTGAAAGACMAAACCGACGGGCMC
TGAGGATGAAGAAAGAA
ATGGGGAATGCAGTACTG
GAGTTTGAGCGATTGACG GTC.TATCAGCTTACGATCT
71 ACGATCTCTT ' GGGAG T CTT
19 AGCGACAATCTATCTTCACATGATTGTTATGCC.CCTCT TCTTGGACAAGAATGGCATGTGAACATGTAGATaT
GAAGTCAGATGAGGGTG
GTTATGCCCCTCTACTGG
19 CACAATGGACAAGAAAAACCATCCGATAATTGTGATT CTAGCGAAGITAAAGCC.CAATCATCATCATGGTTAA
GTGGTITTGITATGCAGTA GATAATTGTGATTTITATG
73 TTTATGGTTGGG C.AGTATCAGTAC TGG GTTGGG
GTTAAAGAAACTGCTOTA ITATGTTCAATGIAAGTGT
TTTGICTITTAGATCAGGCTTGGCCAACAGGITTCTC AGAAGATGATGTTATTGA TGAGTATTGTCAACCACCT
19 GCAACATAAGACAAGCTACATTCACCGGTCCATTM CAGIGGC.AGGAAGCATGGTAGCTAAAACAAGAGCA
TTAACAGTATGTAAATGG CGGTCCATTITTFTCTTTEG
CTCCTCTGAGTAAGGATTTACCTGAGCCTAC-ACCTA TTAAACAAGAAAGTCGTG
ITGGTACATTAGC.AAAGAC
78 GACTG 1 CAOCCATA rrGe TG
i 19 TGAGCAC:AAATATCATGTCCAACT All AAACCAMG 1 TGITGAGIACAAAGIAACAGTCTGGACATATAAATC TACAAAACCTATTATAAAG
ITAAACCATITGCTAAAGT
79 CFAAAGTIOAC 1 ATCACi ATGC:GAA , GCTCACi *FGAC
ACGAAATTGTAGTGTTCGTGTTAAGCCACCATTAGC AlTTGTAGCTAATATGTTA
CGGITTTATATAGTTGTTA
80 GTTG1TACTGC ' AGTAATATCATAAT CCTGC CTOC
19 CAACATCTGAATCAATGGAACAACAGGGIOTGCAAT CAGTAGCTGCTGGTTTGGAATC.ACTCTITAAATATG
TITTGTTAACATTGCTCAT GGGTGTGCAATTAGAARA
AATGATTITTATGCTAGTG CTITGIGTACTATGITTAAA
19 AC.AAGTIOACAGCCCTGCAITAATGTCTGGTCGGATG TGGTAATGTTAAACCTGGTGAAACTGAAATGCCCCT
ATTCTGCCTTATTGTGTAG
83 . AG TGTGGTC AGIT
'FAATUCTGGTCGGATGAG
19 GCAATAAC.ATTARCAGTCTGGACGTTTATGGICCATA
ATTGTGCTIOGITIGTAC.AAAATGATITGCCATAGC CACACTGGCACTGATMA 1TATGGTCCATATAGAGAT
84 TAGAGATGO CCAAACA C (SCI
AAAGCATGTAATATAGCT TGTATGAACGIGATAAAGC
8$ 't AAAGCTG CIAAAAACiC AAGTC TG 'FG
AATTGTGTTAAAATGCTCT ATCATGCTGGTACTGGTAT
AATTAGGCCCTATITTTAA TACTGTCATTITTGCAGAC
19 GGIGTG TG ICAACA TaAAGTICATCCGITIOTTGM
CAGATCCTGCIATGCACGTTGCACTAAAACAACAAG TIGGICCCa MT TAGACA CCG IT TO n GT
ricrArr GG
SS C.TATTGGT TTCGTAAA A T
19 GIOCiGCAGAAC.ATTACGITTAGTATARGCC:AGACTr! TGOATCAGTGCTAAGAATAGAGCI GGCC I
GT(ATA TGATAAAAG TOCTGGT TA AAGCCAGACTITATT ATGA
89 . AITATGAGGC GTACTAAGAAT TCCAT GGC
.
19 CaGGCCAAAACIAAACTACIAACAACCIGT TGIF ATG
ACATGAATTTTGTTGTTCACATGGTRAAAACTTGA CCITCATOTATAAAGGAI
CaGTTCTTATGGGITGGG
19 GaCTAT CAAAAGAACACIGTCAGTIOCAG A ICCTIC
AAGICTAGCTATAGATGCTIACCOACACGAAAGAC ACTA IGTTAGITAAGATA
TTGaGCA.T GGTAGCAGAA TrATGAA TOG TATG611.31 AATCTTGC.ACGGGTTCAC
19 ACACC.AAIGGAACACTATAAACACTGITiC.ACCTACAC.
ATGITOCTAATTATC.AGC.ACATTGOAGACITTCCCG GTTITAACATCACATTCTG
93 TTGIC.C. TACCAGG TAGC
GTGCACCTACACTIGTC.C.
19 TGITCCAATOTMCGCGAGTAGIGTI AGAGGTIGGO (TT
TCCACTGCAAATAGGITITICAAAGTATAACAAT TACTAAACiATGAAGCCAT
94 rrGe. C.TCTCTCAGCAA TAAACG
TGITAGAGGTTGGGTIOG
19 AAACCIT TAGGATIOCCIATG ITAIGGITATTGC.AGC TGATGCTITTCCIOTAGCCAAGA KT
;TA IGCACATC GI ATCCAATAATITO AAT
9S GIOTCA ATAGAC.AT GAGGTC
19 GCGCTICTIAAAGGAACGTAATCAACCITTMCIATT A 1 C.ACACGGTG
TAATCTAGGIOGGGIAGTIACAAT GAACIGITITTGAAAATCT CCITTITICTATTATTC.AGA
96 ATICAGATAChIC. ATTCITCAGC.ATG TAAGCC TACGC
TCGCCIOTCrGTICGGC1 1 AGAAAAGCAAGAAATGG ACT AGGTT AI CAATGATTA
19 CGC.ATATTAACAGGAAC.AGCTAATGACAGGCTGTAT MTACATTTAGGTOC.AGGGICTGCCAGATGGTAAC
ATGGTAAACCTATTAATTT ACAGGCTGTATGATGAAT
98 GATGAATGT . CACTGTC GCCT GT
19 ACCAATAGGACAGGAAGCAGAAACCTICMTGCTTC ATCGTTCTTGTGAGAGTACTACTUCAGGTAAAC.AA
AATACTTITTGTCCTTGTG
AACCTTCTTTTGCTTCAAGT
20 CGAAGAGGAAGAAGAAGGTGAGTTACTGATTATTCT TCGTAAACGTAGAAGTAMCTGCTAMCTGAC.ATT
TGATAGTTATCTIOGITGC CTCATTATTCTGITTCTTCT
TCTCTCAACAGCTCCTTCTCC.ATAGC AATTIGGTGCAATTAGTTC ITACAAGAAATTITATCTC
TACAGGGTGTCAAACTTGGCATACAGTAACATAT AC CCAGAGACCAATAATCTT
ATTGATATCiAAAGGCAAG
AAACTATTAGGATGICTTA
03 ACT TrAGGTTG TACTCC
CGGTCATTATGCTGGAAGT
20 GCMTGGCGAGG I I I Ra I I MAGTAAAGATTCTAAA
TCAACAGTGTMGGTAAAAGAGGACCAAGCTTTAA CGCTAATCTTGITITAGCC GTAAAGATTCTAAACCTCA
04 OCTCAGCA CATTIC:AGCM AAG GCA
20 AGAGTCAAACTGTICIGGTAATTGICAGAACACTGAI a GGTACTCAGCACAITTCAAAT
GGGATCATCAAGA CITGAAGAGAATITAAAT CAGAACACTGATTCTGATI
TGCAGCAATATGCAOCCACTTOGCCITG1TC.ATTGAT AGGTACTGA ITCGAAATG
AGATTTIGTGCGACAATGC
20 OCTCTGTTGGCCATTICTIGTGAAAGCAGGGCTAGGA C.GMAGAGGCGAAGAAACAATTGATTIOGGTGGGA
, AGACTACACTCTCGACGA , GAAAGCAGGGCTAGGATT , CC.TAATOCACCICCTIGTTATCAGTTCAATGCTCA AT ATGTCCAAAGAAGTGAAT
08 GAATGC i TTCAAAGC GAGGGCAAGCMCTCAA GC
AAATICAAATTACCIGCTG TGGAAGCAAGTATTGTCA
20 aCCITATGCTTGC.AATGTACTGATAGAGCTCGATGA GAATTATTTCACAGCAGAGGTGTCOCAGTATTAATG
TGCGAGCT/ACTGATTCA GATAGAGCTCGATGAAATT
AATTGG TATACCCCCTTCA AT GO
ATGIGGCTCAAGTCTCOGGICAGNGATOTGGTAAAC AMTGGGAGAMTACTGTOTCOTGCCITGAAATIT
ATATGGATTCATCATAAA CACAGATGTGGTAAACTTT
TCGAGAGCATGATTGAAG
AAACGATCCTATGAACAG
13 ACGC TR.TTGGIC MG
GAAACTGGTGGAGAACGC
20 TCTCCAGAAG1TCCGGTCATTA660ACAATGGTAAT GGCGATAATGGACGAAGAAC.ATTCCaTTGAGGAT
TGCAGCAATAAAGGGAGT GGGACAATGGTAATGGAA
ATGATGGATCAGGTGCGA AGCAGAAATCCTGGGAAT
20 GGGATC.ACTC.FIGITCCTCTAATGACATGCCATTCTOC. 1 AAGAGGACAACTATCCACTAGAGGAGAATCCMTG ATCXAGC.ACATAAAAGTC
TGCCATTCTGCAGCAT
i AAGCITTCAGCATCTATTACTGACACTGTATATTICT GAGGAGGAGCGCTATAA
GATACTTATTGCFTTGATT
17 1TGCTC 1 CCIACiCACAC , MIT GC( C
AAGAGACGGAGAGAGACCGAGGITC.AATGTTGGA TTCTGCCTCTGTGTCTAGC
18 AGA ' CGAGGAGG T
TCTCAAGGTCCGTTCCAGA
CCAACTCCTGAAGAAGTG
GGGCAGGAACTAGATCAG
GCGACTGTATGTGAAAGA CITTACCAAAAGGCACATC
20 TTACCAMAGGACGCCCAGICTGGTGGTICATTAGG ACACCTGTfAAACCAAATACAGCTGITGCAGATGAC
GTATTGGTACAGGMGGG CTGGIGGITCATTAGGATA
21 . ATATAGG TCAGGAG GI 'FAGG
TGCAGCCIATACCIOAAAGGCCIGGGCIOTTGCATC CACAATAACAGACACCAC GGTGATTCAATAGGGGGG
GIGTATGAAGATATTCCTT GGACICAGGATITGGICA
23 'MGT CAG GGTACCTGT TGGA G
CCAATGAAGATMGCTGA
24 CTGACA I/WAG CNICAGTA.GAGGCTGC:AC CA
20 GaGGTGCAC-ATCCTACAMAAACAGAACAGCGGAT I TACAGGTGAATATTGGGATGTAGCAGMACTAATT
ATTAGAGATACAGAAAAT
IS CiGA TT 1 GI ATTGCAGGACAA CCCA A It CAGAACAGCGGATMATT
TGGAAGACGGCiAGC:AATCACCCATAGICCC TIT RiCACia GCT AA ITTC GCAGGAAGACAGATC1 AG
20 AAAAGCTAATIGCCATTGATCAAGTAIGTAAGGTAAA 1 GIACCICCCWICAGACFCCGGIAAATCi AC:AGC:GG ACTAGACATACTGAAGM AMU. AGGTAAACICT TGAA.
27 . CiaTGAACC 1 GTT TATGAAC CC
=
GCAGAGCATGCAGIA1TGTAAAC AACGGTFTTCf MACAU'.
28 AGATITAAGC AAC.ATTCS TTGGACTGTTGATCTOCA AAGC
AAGICiACAAA TGCCCGGCT ITAAAGTGCA 'IGTCATAATACACIAT AAA
20 GAAACAAACITCCAAAIGTGIGCATAAAGGCAATGA A ITCGCiATTRXATT(CA
ItGACCiATAGIGCATICG CGACAMGCT ICAATCCAA AAAGGCAATGAAAGAATA
20 ACCGGTTCTCMOTAATCATAC.A ATGOTGAACAGTA
TC.ATTGAAATIGGAGTAACACGGATATITTGTTGOC TGGTGAACAGTATATGTAA
20 TCGGACTGACGAAAGGAMCCalITCACIATAAGAC AACiAGGCGAAGAGACAAT I
GAAGGT(CGGTGGGA CAGGGCAAGAMCAAAAC CTITTCACTAI AAGACAAG
AGACITTOATGACTGCAAAGATGTITAGAGATaGG TACAGGACMIGAAAATG ACiATCCCAAGGACAAAGA
GAATACATAATGAAGGGAAAGTCATCC: CI GATTC:AAGCTGGATAG TONfGAAAT AGGAGAAGA
TGCATAGGAACA I ATITGAGAAATGATACIG "FGAACTITGTA AG TATGGA
3S GAGITC GGOCCTCG ATGTGG Gilt TICTIKAGTCTCTICAGCA TTGAGAGCATGATTGAGG
36 TTGAGGC . GTTGAA G C
TGACAAAGACATAATGGA
CGATAACCTCGTTTCAGGT
20 CCTAGGCATCAGC.ATGTACCAGGTATATCACCGATAT CAGAAAATAACTGGAGGCCTTATGGACTGAGMAT
GGTATATCAC.CCATATGAG
ACAGAGATTCGCTTGGGGAACTGTITTOGAGGGAG TTACAGAGG A TGTCAAAAA
20 CACTCTItTaTTCTCTGGAAATGTOTAATOCAATC.AA
ACICiCACAACATGACC.AAGAAAATAGCTCTTMTGT GAAGGTTAATAGATITCCT GTAATGGAATCAATGC
ATA
TGGATAAGGAA CAGCTTT C.AAGG AGGAA
41 MGT TfaCACAAC ACCCGGAATGCAAATCAG
AGGATTCGIGCACTTTGT
20 GCCCATCCCACCAG1ATGTGT1AGGAGTCTCAATC11 TCCAATCCICTGATGAMCGCTCCTGCTfGTATTCC
GCATGTTTAATATGCTAAG TTAGGAGTCFCAATCTFAA
42 AAATC116 Cf CMG TACGG ATCT1 G
20 TCATGTCAGCC.GATICATIAATTCCTAGGCAACITCAG
AACAACGACCTIGGACCAGCTGIAGTCTTIAATGAA TTTTCTACCGCTATGGGIT
43 CATGG C.AGCTGA TG
TAGCCAACTTCAGCATGG
20 IGGTACAT1TGTFCATCCICAAGAAATT(EC:AAGAGA TMCCCTAGCAGTFCMATCGGATCAATTCGGGCC
TTGCAACTACACATICATG
ATTCCCAAGAGAAATCGCT
20 ATTGCC.AAATTCACsC.CATCCTATGAAATCGGTGCATA
CCAGCTACCOGACCCTACAACTTGCAGTGATGIGTC ATFACTGAAAAGGGTGTA GAAATCGGTGCATATATAT
45 TATATAACCC CA , ACC AACCC
20 ACACAMGTC.ATCTAGAGTAGGTTFCCGCAGAGAAA TGGAAOCATATAATGAAGTGCAAGACCTCFGAGTFG
CCGCAGAGAAACACAAGT
20 alGTGCATGTAATAGCGCTTGCTAGTAATACAAATT ACATGCMAAAACGAAAGTATGGACTIGCTGTATC
CAGATACAGGTTCAGACT CTAGTAATACAAATTATAT
20 CTC.AGGTGTGTCTCCTGTTACTGCACTATATTGGTATA
TGACAGTGTATTTGACCTGTCAGATCATATGCTATG GCACTATATTGGTATAGGA
48 GGACAGG TCACTTICATCT 1TGAGCC.ACCAAAACTGC CAGG
20 ACACTGGATTGCC.ATTGCTATCGTAGGGTAACAGTAT ¨
CACGTGGTCCAGATTAGATTTGCAACGTEGGATA ATACAAATCCTGCAACAG GTAGGGTAACAGTATTTAC
GATCAAATAGACTATTGG GTCTGGAAAATGCAATATA
SO GCAATATATTATGC AAGACTCTC AAGTGTG TTATGC
ATTGAATGTGTAAATACCT TACAATGTTCTGAATCTAT
Si TATGTACAG TTGG GIGA GT ACAG
GTGGCAGACCTAATACTGT
AAGTAGCACTAGTTTTACC
TGCATTTACCGACCCTTC
TGCCTTAACATCCAGACGTGGGACATAGTTAGMT TFACCAATCCTGCATATGA ATAGACACT.
ACATTAACCT
54 TFAACC TITTCAC 1 rtACCC:AATC GC TFTCAC
i TCCIGAAACACAACGMAGTATGGIAACACCTAAA TGGACATCCATACTATAAA
55 A FTCCT 1 GGCTGCC , GI TAC:C CGCAAGCAAGACA
TFC.Cr 56 CTFCTGT ' CACA AAATT
CCCATGTTGCTAC.TTCTGT
TTTACGACGCGAACAGTTATTTGCCTGCATCTCCTAT GTACTTTGCAGGATAATA AGGTTCCCTTAGACATTTG
AAAGGGCTCTGTACTATA GGCACATATATGGGAAAG
GTGGTCAAGTACTAGTTGC
59 . ACITIGCC GCCCAT CAGGGIGGAATGCATFGG C
GGAAAATATTGAAATGGG ACTGGAACTGCTAAGCATA
60 TAI CTAGIGTGTG It AGTCf T
CTCGAGGTCTGGATACGAGATGGGCMGATCTA IC CMATCCAGGTAATAATA GAGICAAGGEATTCTCATA
20 GCCATATGGPXACAGTGGTCAGATCTAATGTCGCA AAAAGTACACATCAGG.AAGGCAAATTGCCATCATCC
ATGGAGAGAATAAAAGA
AGATCFAATGICGCAGICC
AAAAGGICGAAAGGTTGAAAC.ATGCCTCCTTATTTT
63 CiCCGIA 1 AACTTGAITTCTG CTGGATCAGACCGAGTGA GI
ATCACCICF GGCCGTA
20 GTACGCCACCATCAAGGGAGM AACAAAAGAGAAG . AI
GCTAGAAAGAGAMTGGICCGTIAAACACTGOCT ATAACAAAAGAGAAGAAA
20 TGCTAATCiGGTCTGCTGACACAGTITGA FTATCGCT G
TCFCICTIGGAAATGIGC(ACAGICTAAGGAIGTCC AGAAATGATGATGFTGAC AGITTGAITATCGCTGCTA
65 , CTAGA ACGAT CAA GA
.
GAAGAGCAACAGCTATTCTCACiAAACFCTCCCGCTF AAGAAGAAGAAGTGCTAA
66 GCTAACGG ACTATCAAC GCGGATCATCAGTC.AAGA C.GG
20 CCCACATCAT TGATGACGAATAAGIGAACG (AC TAT T
AGATCAATGGCCCIGAGICACAATTTCCCAG ITCCT TTAAGGGTFAGAGATCAA GAACGT ACTA.' TG
TCICIT:
OGGTAC.AGTGGATTC:GIA
68 CAAA CAC A GGACAC1.
GTTCCAGCAAA
ATTAAGCAC.ATTAGCCTTC TGCATTGACTGAAGATCC
69 TCTGGG TataTT A
GATGAAGGCACATCTGGG
1TAIGGATGGC.AAAAC CAACAATCA TCTGACTAGC
AMGTGACC:GGGAGAGA
20 AGCGICi 1CAACRTAAATAATTGCTFTAAGCAATGAA GGITAGATFAAGGG1T1G 101 ACGTAAGAGAACiCA AGAATCTI TGGTCi 1TACCF ITAAGCAA TGAATC FCTGT
GIGTA CGAAAGTFTATAGCTAGC
GGATC.GCTCATTTGCATiA
20 ACT7 ClICAGGAGAIGGCACACACAACAAGGAGAAC
GGGAAGTGGAICTAGAACGMCATTAAIGGCGGA ACAACAAGGAGAACGTIC
20 CAGMCACCTAATGGTC.TGTAACTITGGGAATTGGA GTGGATCACGTTCTGGTAGACCCATCTACTGTTACA
GCAGMAGMAMTGO
74 TCTGGG . GATGGTCTT TGG
ITTGGGAATTGGATCTGGG
20 GGTAGCAGTAGAGICAGTGATITiTATTTAGATTTIGT ACTGGCGGTGTCACTCAATCCTAGAACTGCAAAACT
TTTAGAITTTGTAGCFCCTG
20 GCTAMCGSCCAAGTCCCGAATTIGAAAATGCCGCC AGCAGCACC.AGACAATGCATTTTICTGTTAGTATAG
AATTTGAAAATGCCGCCTA
AGACCCTATAAATGCAACAGATGTTGCMCTGC.AT AGTGAAATACCACCAA TA AGAATTGCAGACTATAGCT
CACAACGTITAGTTIGGA TATCAGGTGTAGAAATTAA
TAATAGCCCTATFCAGGAT
CATTICAATGICTTTATCT TTCATTAACCTGCTATGTAT
20 GTTFCTANISCCCTCCTC.OGTGGTGGTGACAGATF GC CIGTCi TGGAAACCIGGATATIGTGTAAGATCITACC1 GAGCAGGAAGAGCAGGA GTGGTGACACiATIGCPAG
81 AAGT TTAGTCiTCTTTGT T T
GIGCCItiGATATTTFACACACCAGITCCTFATATCCC ACAGTGGCTIAGGACATG
82 TAGCGT ACTCC.AAA A
AGGIGGAMAGGTAGCGT
TTATCCITGTITGAGITTT ATGTAGAGATACTGTACAA
83 TGTACAAAAACTG ACAAAACTTG , CAAG , AAACTG
.
20 MAGCACTTCTCATATACACATC.CACCACTAACAGAT AGATGGAAACGAGGTGICTGTGAGGTAATTTAATT
GCTAATAGTAAAAGTCAG CCACTAACAGATGCTAAAT
20 GTTIGTATGCCCTGCCTTCFGCCAAATACAACACTGG C.ATTGGCAGCATCTGAMGCACCATATTGGGAMC
C.CAAATACAAC.ACTGGGAT
20 ATGTCTATAAAGTAC.AGGCCATCCGATATATTATCAA ATG
TGCATITTACCMGGAAGCATTCCAGTCC.TCAC CGAGGAAGAAAAAGCTAT ATATATTATCAAGATGCAG
171TACATGGGTGOCGGATGCCITrGTGATGTATTA CC TCAATATTATTAGTTF
CATTGAAATGTFGGAGGTA
20 TGGGAAC.ATAGCCAGAGGCCCACTTTGGGGGTCTGG CAGGTGGGGGICGGGTAACACAGGAGGCCCAATA
TGGATTGCCAGTGIGGIT
ACTITGGGGGICIGGG TAT
ACAGACCCIGTTACACAATCTGCCAGATCCTICTGTT
AAAGGGICTCIGTAGGCA
GTAGGCCTACTFTATCTGA CTGAAGGCCACATTAGACT
CCTTCTGTGGTAGTAGATT ACTACATTTTATTTACATCC
TTGGGATGCTGCAAAGCCTAAAGGAGGACATGC AGIT ATGTTAAAACAGGA GACAAAGTGTC.FCTAIGGA
92 CTAIGGACC 1 ACC AAGGA (X
t TGATAACA
93 AACACACA 1 TTGCAT AAT , TGT C.ACA
TCTACCTCCTCTAGAAAGCGGTCTTAAACCCCCCGTC ATCTTFTTGGAAGGTGGA
TACAGAACGTTTGTCCTCT
94 GTCCFCTG ' TAM TCT G
CCAACTTTCAGGTATGTAC TCAACAACAAGATTACTCA
TGTAAAMGGAGGAGTT MACCTGTCAGTCTTTTAG
20 TGCCCCTCATTTAATTCAAGAGAATCTGTTTGAGGTG TAGCGAATGGTATGTTGTGAC.AGTAGCTCTTCCATT
GCTTCAAGAAGTGTTCCT
97 . CICCAG GIATCCAAA C
CTGTITGAGGTGCTCCAG
GCTATMAGCGCTAAAAC GTTAACTAAAACACCMG
TGGACFATAATGGACTTFA ATGGAAATAGAGCATATIT
CTCCAGAGAGTCGCCCCGTAACTCTGTCGGTCGACT CGACTACTACCGACGGAG ICTCTAGCTCTACCACGIC
AGACCCTATAAOTICAGGriCTGAATCFGCAGC:ACT GCAAG TGAAAITGATATA
CAGATGTGGTCGGTGTTG
21 AGGIGTACTGGTACGIGGAGAGGCCCIATI AA TCAG GGGGAGGGATITCPACCi ICITATCAATArruca GT AGITITGGAGAAGAAATA
03 . CCT TGGCTGTI CACTI
ACCiCCCTATTAATCAGCCT .
21 CTGCAGATGITTCTGAAAACCOAGGCCAGACCAGCiA GAACAATACGAGTCAGCAGAITAGGC1TICTGACCA
CAACiATTTGF:AAGAGGIG
04 TITTGC. ATAGTTAGTCCA G
GCCAGACCAGGAMTGC
21 'FGT. I GCAAAT ATC:AGCTGIGCATTAAATACCTGACAA TGAAGGAGAATCF ATGATCAT
GCCCCTTGCAGCF A T TTAACIGAAACTACAAAT AATACCTGACAATA TACTA
OS TATACTAGAGGA TTCFGGAAT GAAGGC GAGGA
21 Ca IT TGCGTCGCCTAAGAAGA TATGGCAGF AGA TGT RAI GMTACAGA I
GGCFCTITGGItiAGIAC.ACGAG A TIMM TCCT AA TG ITC TAT GGCAGTAGATG TF TA I
OS TTATGATGA CAACAGG CAGATG GATGA
21 GCTCTGICTATCATTGCAAATTTGTGTAGTGGAAATC TGICTTTAACCCAGAAAGAGAGAGACCACCTC.TACC
GTGGITCAAATAATATTAC GTAGTGGAAATCAGTTTAG
CC.AGCCACTGGIGAACACTGGGATCCTITIGCTGGA A TATGGAGATACAGAAAA CCGCAFATOCITI
AAAGCA
OS AAAGCAG GC TCCTG G
21 AGIGTCiCCAAGIGATICMCITAGGGGTCTACATIT
CTCGGATITAACAAGCA ITGCFGT TGCCI GTITCCCC GCiGGICTACA ITTITGICI
21 CTGCCCATCTCCACGCGAAGCCGCGGAAATC(AGCA GCAGITIGGCITCGAGGIGC.AITGATGGCCTGGAA
AT CFGAC GGCAAGGCCI. TCCACATG
CCCiC.C.GAAATCCAGCAAT
21 GCCGIGCTGCACCICAAACATTAACAIGTACCCGGCA CCAGACiACGGTGITGCCiCCGGCAAAGGAGGACCA
AACATGTACCCGGCAGTC.C.
21 CTGAACTAAGAC.TGGGTGTCCATCCACTAACCATCCT
GTAGACACCCAGTTATGCAGTGTGATAGAATTCCAG CCCFACAACACAAAAC AA
12 TTTGC . GAGGTTT ACT
TCCACT.AACCATCCTITTGC
GOGGGGACITTATGTGAC
13 TGACCC C ACCCC.AGTCCCGTCCA CC
21 CCGGTATCGGGCC.AGAGGTGGIGGAGTGITGGGCT
GTCGCTGACTCCGCCA
CTTCTAC.CICTCCCTAGCC
CGCAGGGCAAGACGAGGGATCCAAGGACTCGGGCC GG C TCCAAGGACTCGGGCC
21 CACGCTGGAGGCACATTGTC.CGCCACTICCIGGGIC GTGGCTGCCMGOGATGCAGCAGGCAAGGCGAGC
GCCACTTCCTGGGTCATG
CAATCCAACACGAGGCAA
18 TGGGCCGCCIAATGCATCCTATGCGCCGAGGCCTU ATGCGGGATCUGCCGGICGAAACCCGATGGCCCCCi GCiACACCGACTICACGC TATGIXTCCOAGGCCIT
21 TCCT1C17C.CACGGCGCCGCGAG1TACGCTCCTIGGA TCAGAAGCTGAGACCGAA
19 G AC. TCGTGAGGCAMOCATC.CTGGCTOCCGCGCATAC GA
C.GAGTTACGCTOCTTGGAG
20 C ACGA A CGGGCCTGGAC. MAGA AAGCT
iAGTcG
21 ACTTGTAGGCCX:GGGGACCCCGATCGTAAGOCGCAA
GTTTCAGATCCCACZC.CTOC.AAAAACACGGCCAAGG
CGATCGTAAGCCGCAATGT
AGAATGGCCATAGGCITGG CCTGAGAGGAGGCTTGTG
CfCGTTTCGGCCCCGA
GAGICCGACAGGAGGICT
AAAGTIGTGGTAGACGGG
TGTAGTTGAGCGGCTCCT
TTCGTCGCGTGACCTGG
GGGTGTTCCAGTCATCGG
AGOKTTGACCAGGTAGA
ACGCAGTTGCTGACGC
30 AGACCCACCACCTCGTGCCCiGGCGTTIGTCCCACG TCA GGAGGATGCGGAATGGC
GGGCGTTIOTCCCACG
GCGGGGGGAGGATGAAA
21 GGGGTGGTGAGGATGC.AGCTCCTTTATGCGCTTTGC CGACCTCAGCTCCGAGTGGGAAGAGGTAGCGCCCC
OCITTATGCGCTTIGCCG
TTCTCTATCGTCAACTGCG
CGGGTCTCCATCAAGICCC C
GGGAGGAACAGAATGAG
TCCCGCFGAGCAGA
36 TGCGGOGGAGGCTAAGC:GCCCGTTGAGGCCCACGT AG AAGGCICAGGACGIGGG
CCGTIGAGGCCCACXTT
CCTACTTGGGAGAGTCCG
37 Cl C C
CAAGOCCAGAGACACCCT
38 TCFCGTCCCACCTGGC:GGAAC:CTIGTCTGGGAGGCG AGGGGICAGMGCTOTCAMX:GGICTTCrTCIGCCA
AGAGGGCCCIGOTGAGA ACGTOICTGGGAGGCG
AAAAGCCTCOGICIGGICC
21 ACC:GCGTT AACA TaCGGCCAGAACTGGGCCTCGTG
ACiATGOGGCGCAGACAGCCACAGGGGCiATGCCAG TAAGCAGCCGTGACTAGC
GAACTGGGCCTCGTGGGA
21 MC ITCIGCCTGTITTG11T0TACGAGAGAAC:CCGCC
CTCGTCAGGCCGCGAGAGGATGGCCTCAAACACT G CAGGTGAAGCTGCAG ITC
ACGAGAGAACCC.GCCAC
21 CaGAGITGATGAAAGGGGCCGGATCTCGGCCGITGT
GTGCCTGGGCGCAAGA GATCTCGGCCGTTGICTC
ItIGGACATC.ATCGATCCCGGC
43 GA TACA GACCCTIGTCCTCC. TCCA
GGGGGGIGGCATATCTGA
21 TCTG ITCCTTCTGCTLYAGTGCFCATTACCTCICCiCT
44 AGACGAGGCACAGGC.ACCCCTGGCTCGCCTGGAGT GTCGA
TGGAAGACATGGCCGCC C. TGOCTCGCCTGGAGT
21 ACCGGTGACACCAAGTCCATCTGICATGICGGGGGC GATGCGCCTGACGTIGTGCGGCCCCACTCiTATCCAC
45 CT C TGCTGAGGTGGGGCTCi GTCATCiTCGGGGGCCT.
IGGATGIGCATATGGTIT ITG I TGCiCCAGICTI AATTIA TT
46 CGGA C.TGATGGC GTC
ATTGGTTTCAGCAATCGG A
21 CGCCAACCAACACCICCCItIACACiGCAGCGTCTTCA
lIGGCTTCTGACATCTCCCAGCTGTGCGTAGTGITCi TIC; TGCTTCOF GATGGCA
CAGGCAGCGTC1TrAGGA
21 GGAGTCCCiGACAGTITCATGCTCOGGCTKAGGGGC AG TAGCGGGCCTCGG TGlOCA
TGGACGAGGAGCT
CGGGCTTCAGGGGCAT
21 GGCAGAC:AICCGCCA ITACCiTTG MCC' GCACITACT GCAGIGGCTA
IGCCTCCATGCCAAGAGCTGGIGGG
TGICGCTGCAGTACTACGA
GAGATGGICiTCCGGAGACCCCACACCGTGGGCCCC ATAACCATGGACGAGGAC GGAAGAGGACGAGGACG
GGGGGGGAAGTCGICTITTCACCACGT GGTCGAGGAGGCAGTGG
C.GGGGTCGAGGAGGTAGT
21 CGTGTGC. TGCCTGGAACAC.AATGAGGTTCAGGG/sCT
AAGCCTGCCTCATCCTTGACCAAAGCAGCAGCCTCA CATAGTC.AC.GGATGCTGC
GAGGTTCAGGGACTTGTCC
CCACGATCACATTGGIGG
CCATAGCTGCCCCTGGT
21 ACCGTCTATAGCGCC.TTIGGGTCCATGACTGCTGACC TGGGCAGTAGGCTCGGAGTGICACGGGAGCCACAA
CCATGACTGCTGACCCGC
GAC.CGGGGGAGATCATG
CATCCGGCAAGCACCAT
ITCGIGCCAAACCAAAAAG
TCCTCTGAAGGATGGGCG
ATCAACAGGTGGGCHITTG
58 TGT CAG CATCTCGGACCai AAGCC T
TGGIGTATAAACCGCAGT GTAGCAGAATAGGGCCCC
21 TGCTGTATGCC.TCGCAGCGC.TATCCTGC1GCAGGGC
CAGTGCCAGITCC-AGACA
TGCCCATACCTGOSGG
TGGCCAGGTGGACGCA
GCAGCAAACACGCGGC AGACCCTCGTGAGACCCG
21 ACAGGCAACTACATGGGGCCGTGTCGGATCTCGGTC AAGGGCGCACACCACTCCTGTGATGGATCCAGC.GA
63 C.AG CT CGCTGGTCCTGTGTGTCT
TGTCGGATCTCGGTCCAG
64 CAACCCGGCAGAGCTCCAGGTAGCCTGAGCCGCACT G TCCTCGCT(SAGCCAGG
GTAGCCIGAGCCGCACT
CCTAACAGGGICATCGTCC
TCGCCGCCGTGACTCA
21 GGTCCTAACTGGTC.AGGGGCAAGCCGGTCMCACGA TGACGGATGTCTTTAACGGCGCAGGATGTGTTGGC
AGCCGGTCAACACGACAT
CTCGGAGAAGGCAAAGG
C.ACCAGCCCGITCACCA
21 CCTACACGACCGCC.AAGGGGCAGAACGTTGICGCTG 1 ACTGACCCCCTTGAGCACGCGAAGCCGTCTCTCTG A GGAAGG TCAAGAGCTGG
TIGTCGGTGA
GTGICCTGGIITAAGGCCGCAGTGCTGC. TCCAATGT AGCCTAG TCAGAGAGAAC
TCTTITGGAC
CGGGACAGATIGTCTTCCA
GCCGTGCACCTGCCAT
21 CCCGGACTCGTITTACGGACTCCCCCUTTTCCGCATC TCGATGAGGGAGCAAACACCCGCAC.AGICACGGGG
AGACTCCTGCCTGAATCA
CCCCCTTITOCGCATCAG
21 CAACGGGACTGTCATGGAAATTATTGCCAAAACAAC GTGGTC.AGCAGCAGATAGTGAATC1TGAGCTCCAA
CAAAAGCAATAAAGTACA
ATIGCCAAAACAACGT GT
73 GOCKi CT GAACGTGACTACCCGGCGGIGGACAGCAGT CGTAGCACCACGTGGCTGCTCCAGCCCCTGCA
TACC CTCFAGGGCCGACCACA GGCGGTGOACAGC:AGI
74 CT MCI CTCCGGGCACGIC Tr AGT TCGGGGCCIGTGTC GAGTi TCCAGCCCTGGGC CA AAA Ci A CCCTCCCCACCT
75 'FGGAGGGGGGCCAAAGAGGTACCACCCCCACCCACA GC CCATGGACGCICACACC
'FACCACCCCCACCCACA
TCCGGAGTCACAGACTTGGCCTTG
76 crcr GG GGAGCAGCCGGGAACT
CGGGAGGACACCAACCCT
CTGGATGAGGGAGCGCCAAITACAAGGGIGGGCTA GTGGCATCCTAAGGAGGG
"FTCIGGAGIGCCTiTCGC
21 TGGCCAACATGACITC:CGTCAAAGGAAGGCGCTGIA
CCAGGGICCCCACAGTGGAIGGCTCTGACCAGCAA AGGAAGGCGCTGTAGCi A
21 CAGATCCACCACCGC:ATCCAG T TITCTCGCCCCCT ICA
GGGCAGITCCSCGTTGC:GCACCCACi A IGCCTCAC:Cf TTTCTCGCC:CCCITCACT
21 CAGACGCCCCC:AOCCCIAAC:AGGCACA CrACAGCA A TM CTAI
GGGGATGAGG
SO AGGCGACCCCATGATGCGGGGTGATGCGGACX:TTGG TCA CC
GGTGATGCGGAC.CTTGG
AGGGGTGGIGGATGT
CCTTGCCGTGCCCTCT
21 TGGCTCCiCrGC7 GTGACCCCIGGCCA TATCAGGCAA CA CUT TGOCCiG
IGGGGCAACCITCATCACGGGGC CICCiCC.AT A TCAGGCi AAG
21 CGA GGGTGCCGCCITTGAGA CAGGGICTC1CiCiTAA AC
CAGGGTCTGGGTAAACAG
GCAGGTGCTCCGAGCT CT
84 TGGCGGTGCCGGIGAATC-CCCAGCCT. GCOGGACTT CXACG
TCAGCCTCCAACAGGTGC OCAGCCTGCCGG ACTT
21 CGTACCTCACCGCCAGC ICCGGACTCCiGGAGCCITA
GTCGGCAAACAGGGGC
AACGATGGAAIGGCCA CCA OG GGCAA TAACCGAG CA CXTCCITGTC:GT AGAGC
86 AGCG GCC1TC.A GCACGC.CCTCTGGGAA
21 CGCTCCT ITC; TGGCATCACCGCCTOIGCAGAGCCT t G
GCACCGTC:AGOCACCIGTGOCITAGGGAGGIGGC
37 AC CAC AGTTGGGGTCGGGCCT CC.
TGTGCAGAGCXTTGAC
CACGTCACGGGGAACTG
21 CGTACGTGCGTGTCTTTGCCCA GGAAGC.TGTACGCG
TGTGTAGCACAGCAa:AC
CTGAGCCGCGACCAGT
21 GCTACCTCATGTTCAGGGCCATGCAAAGGCAGGTOT TGG/s CTCAGCCACCTIGTCGTTGAGGACCMGGTCG GC-AA AGGC AGGTCTTTCTC
GTGACGAAGGGCCCCA GGCTGACCCCGGCAAA
21 TGCCAATATCTAAGTIGCC:TTAGTTGTTITCCGTTTGC
ITTCAGGCTITGCAAACACAGTGTATTAGTIGCTCT TGAAAGTTGTCITGGAGAA
GITTTCCGTTTGCTGCAT
A AAAGACATIGTCCTGCAGC
TGITAAACTCCAGTAATCG
TGGGGAGIGGITATATTA GTGTGCTAAAAACAGAGA
95 C.AGAGATACC TCCC ATGC TACC
GIAGACATTATAAACGAGCAGAGCACACTTCTAAAT ACAGATGAAAGTATATTG 'TGGCTGATGTAGATAGTAA
96 ATAGTAATGC 1 GATATC.C.ATTGTGA GCATA TGC
21 AACCZTAATAGGCTCATGCAAATGITTAGTATTTTAT MGGGIGGGGTIGTC.ATTICATTTTGCATCAGCTA
TAAAGGGTATACCAAAGA TGTITAGTATTITATGGGC
97 GGGCCTG AC.GG , AAAGC CTG
AGAACTGTATGA/NAAGA
GCTGGAAAAAAAAAGGCTACACTGACACATTGTCTT TAAAGACATTACTTAACA
TATAGCAACGAACGATGG
99 GA ' TTCCICTTTG GTCCC A
22 GGTGTCCTGGGTGCTAGACACAGTACATGTGGGTTC AAACX:TACTGACC.ACGGACCCGGGTTTCGTTGGTGC
ATATGGGGAC.ACAGGCAT GTACATGTGGGITCAAGG
CAACATTTGTAACATTGTG ATGATAGTGIGGAACAGC
01 C.AGC.G GTTCCGC GC G
TTITTGTTACTGTGITTCTT GTTTCTAGCACTATATATA
GCACACAAAATGTTACAG
GGAGCAGATGITCCAGIG
ATGCTGGTAGTICTAGATT
22 GCATTTTCAGTGTCATTAAGCTTGTCGGACAGCCTTf GCTGTTTGATGTTAATCCTGGTGAGCAACCAATAAT
OS AGGAAT ACATAGTEGTGT AAGGGTCTTGAGGTAGGC
CGGACAGCCTTTAGGAAT
22 CCAAGGGAACGTCAGACCTATC.ACAGG1TTTGGGGC 1 TATCAGAITATTTCGAATGGCTGCATTAAAACATTG ATACAGGATGGTGATATG
06 AAT 1 cicraia: GTAG
CACAGGITTIGGGGCAAT
t ACTGGCTAC:AACGIGCACAGGITACAAAAAGCAGA
07 ACACCT I TTGCCC , TC54.11CCCCTACA.AGC It *TCTITA.TGCATCCACACCT
GGTGATACCTACCGCTTITTAC.ATITTCTGC.AGGAG MAACAGCTGAAATTATG
C.ATGGATGCTACATTACTG
08 TTACTGG ' GAGGA GCC G
TGCGTGTCATGTATGTGTG
CATATCCMCGRACACAG
ATTIGG GCCAAA GT
TGCCTGGCAGTTATTIGG
GCCACAAGCTACATTAGA GTTTCTGGTATTGTTAGAA
11 . TIAGAAAGATGG AAGCAC CC AGATCG
GAAAGCTIATTGTAGAAA GCCTACTGCTGAAAATTTG
AAAACGAAAGTATCTGACAAGCCCTACAGCTICCAG GIATCAGAITTAATTGATG
ACGATGCTICACAGGGAA
CAAAACATTACTACAAAC ACTGTAGGATTATATGCTT
ACTGAAAATGCAAACGCA C:AAAGCAATAATCAAGCTA
is AAGCIAAACA TO GITTCCAATC G AACATCI
22 TGAAACAGTITTIGGCi ITGGG TAT ICGTAIGCAGCAG AAAAGCTGGCTA
16 AAA= GGCTTTAGC TIGGIC
CGTATGCAGCAGAAACCT
AGAAGAJTC:CAGGTCCCCICAGCATCi T CCIGTICCCA GCCGAAGAATCAC:ACAGG
17 . ATC CCTCC A
CGCCACACCTTCCACATC .
TTCCITIGGAAACITTACKTOCAAGGCCIAACGACT ACICITGCAGATATITTAC
22 CiCTATGTCTTCTTGGCCATGIAGAAATGTA ITIGAGG CIGACCCGAITCCTCCCAGATTI
CAA i AACTACAGCA ACTA IGAGATATTGACAA AIGTA 11TGAGGACTIGTC
22 TGCH AAIGCACCF TITGTGICAAAGTITGGTACGTTIT ACiCCCAATTICGACIATFGTT
GAAAGCATATCTGAA CIAGATCGCCCAATATATA GTTIGGTACGTITTAGCAG
AGCAGA GICiTCATGT GTGAAA A
AATTGCCITTCAGATGACACTCiTCTACTGGCTTTGCA AACTACTACATATCCAGAT CTAACTCTATACCGTACAC
TAACCICATCACTG CCATCCGTIGTTF HATAAA
GCCZATACAGAAAATCOCA
22 CGCTCTGTTAAATCTACGYTCCAAAACCiACiATGTCCT ITTCACiCTGAA T TCi ICCCAGTITTTAGCAGCCCTGTI CTCAGAG TATTGAGGATA
ACGAGATGTCCTCCAGAA
22 AAC:ACGCAGG !TA TOTTCAGGATCCTCCTATIGTGCA 11GCGGAACAGCA
ITAACAGAAC:AACTTGTAGCTCC ATCITCCIATIGTGCAAAG
24 AAGTC TTGT GGAGACTACTGC.CCAAGA IC
GCITGC:CTGCAGTGITGGAAACCGCATCAGGCTGAT GC7ACCT67GCCATAAACC CTGTGCAG IGAGGA.AAAG
22 TCTCCACCACCAGCCITACTGTMACAGAACCAGCA CTC:CAGGAGCTTCTGTTGGGCCTTCACCGTCGGCCA
26 CAGC . TAG GAGGACGGTGAGCAACCT
CCTACAGAACCAGCACAGC
22 TCC.ATCC:CCGGGATGCCTACTATAGTGATGAGGACG ACAGACAGCAGGAGGC.ACAAGGGCCAACGOACTA
TAGTGATGAGGACGAGGA
22 GICGCGTONTATCCACT. TGCGACGTAGACTGTTTGA
GC.ACAGGGGGGGTACAGGAAGTGCIAGTAACTGC.A GGCAGAAGCTCACAAAAA
ACGTAGACTGTITGAGCTG
28 GCTGC CCC.G GC C
22 TGACACCCTCTGCAACTGTGAAAAGTAATAGGACC.AC CAGCAACACIGICAATATGCACAATCTCACTAGCAT
TGGAGCTAGCAAGACAGT AAAAGTAATAGGACCACG
CAAC.ATATGCTCATAGAA CCJNAATAAGACATCCAGC
ATCCAGCG GCATTCTTCT CCTC G
22 TGCTCTGCCCGITTGTAATGGCCAAGCAAAATATGTA GGC.ACAAATGACAATGTC.AGAATGACCGTACTATG
GCCAAGCAAAATATGTAAA
GCCGCAGCCATTATGTGATGCGGTATCTATATAGCT ATATATGGACCTGCAGAC AT ACTTTTGTATGAGCCTG
32 GCCIGTIG CCAGCAC AC 'FIG
ATGACAACAAATACCAAT TACiACTGAAGACAGGTGGA.
33 C.AGGTGGAA TCGTC a:CT A
ACGTTAAGTTTGATGGCGA
TACTGACCCCCCATACACC
35 CCGAGTCGCTGACIGCTTATGCAACCCGACGCCACCA C.ACTGGT , G
, AACCCGACGCCACCA .
22 GCCAGGTGGAGGITACITTGTFAAGGTATAGAITGC GATaITCAACGTGCAGTAACAATCATGCCACAGGG
GGTGATCCAAACAAATTA ITAACIGTATAGATTGCACA
TGGCG1TACTGOIIILT:ATTGIGTATGTATATGAT GGAACOTTAAGGGACATC AGTGIGGAGACCITTATTT
37 CaTTATTTGC CAGGAGGCC TG GC
TGCTGC.TCAATIGTATATA
CITGTGTTGTAGTGTGCAA
22 GGGATGTATCCAGTGCGCCC.ATTCTTTGGGGGTTTG
TGGGGTAGCTTGGGTGTC TTCTITGGGGGITTGGGC
CCAACAAGTACATGTACAG
22 CACTATAGGbililtiCCTGAITGCTTCCCGACTGITA GTTCCCAAGGIGTCTGGATATCAACITATTGGGATC
CAAACATATATTATTATGC
TTAACTGIT
TGACACTGAAAACACCAAT
CGTTGCAAACCAATAAGT GATGTGCCTTTAGATATFG
22 GC.AGATACC.ATTGTTAGGGCCATCCCAGITGMAAC 1 ACTACACGTAGCACCAATATGACTGCATGTACTCTT ATCCZAGITGITTAAC.AAG
44 AAGC 1 TAAAATTCiGAAG CCTAGTGGGTCCATGGIT C
t GIACTGFIGTGTATGIATGGGTGGCACACAAACATG CCTCTACAAAACGTAAAC
45 C:ATG 1 CATAAGGAAA , GGT
CCAAACCITICCIAAGC-ATG
CCGGTTTCGGTCGTGCACAAGCGCGAAACAGCTAA GTGGTGTCCTGTATGTGAC
46 GACC ' CAGT GGTTOCGCCTTGTGAGTC C
CCCTAATAGGGGCGACAC
GCCATGAATCACTCCCCTG
GCACCATGAGCACAAATC
48 GTCGC GCG Cl AAGAAACACCAACCGTCGC
22 GTACCCCATGAGGTCGGC.AAAGTCGCGCAACGTGGG COCCAGAGCTCTCGCGCATCGGGTAAGTTCCCTGTT
CAATGACCCCCGGCATAG
49 . T GCA G 'FCGCGCAAail GGGI
GCATTACCTGGCAGCTCC
50 GT C A GCTGa GTCCi CCACCT
CGTCATGITCGGCTIGGCCTGGCCAACAGAAGGAT ATGATGATGAACTGGTCG CACGGCTACCATGATMG
22 TCGGGACATCCTG1'CGAGTTGACC1TGCACACCGGCT GGATGGGGCGCCTFGCAATATGCCAGCAATAGGGT
GCCCTGAACTGCAATGAC
52 TIA MCAT r CCITGCACACCGGCTFT A
AGCACCGCCGCATTTGAGGTAA ATTGAACAGCACTCGACC
GITCGGCTGCAC
22 CGIGGIGGAGTGCAACAAAGGAT ICACTCGTGGGCiA CST TGCTCTIACTCGGACCI
GCACGATGTTTI GGIG
54 TCGT GAGGTG CTCACGGCTGC.ATGCAAT
ITCACTOGIGGGGATCGT
22 CCCITGA IGTACCAAGCAGCCACAGCTAGCTGC:AATG ATICaTCACMGCCI
GIGGTGCACAGATGCGTCA t CAGCACIAGACiAAGGT GC
55 . CiCT AAGCA IC
CAGCTAGCTGCAATGGCT .
22 GTAAGCAGGCCCAAGCACCGCCGICGCCATAITCIAC a AAAAGGT GCTITGACGCGCGGCACA
TCCICAGTA CCM GATGGCATC:ATATG
CCGTCGCCATATTCTACCC
FOCI GCTGGGC GICATIGICIGGGGAGai AGACAGCTGCTIGTGGGG
22 CGTGAGCCGGCCAGAGTC111C ICGGGGGTE TT GTG TCACGCAGATG TACTCCACi I
GCCGTGCACGCiaCCA
TCTC.CIGGGGTITTGTGGA
TTICATCCCCGTTGAGACA
ACCACCAICMCiGCATCGGGC.ACAG1TAG ICTG ATM GGCAAAT TCCTCGCC
60 CGCCG ACOCCA CGCCCATC.ACErFACTCCA G
22 CGACCACTACGICTCC:CTGAG I TCCiGGG TATGCiCiCIT
CAOCGACGCCCTCATGACAGTAC:GTIGCAGTC:GATC AAAAAGTGTGACGAGCIC
TCGGGGTATGGGCTTGAA
22 GGTCTCC:GATGGTGIGAGCT CAGTGTAGTGCTCTGT
CCGTCAGGCTC:AGGGCGTAICTGCCTCCC:AAAACI C TAIGTTICCACIGGIGAGC GTG TAG"!
GCTCTGIGAGTG
62 GAGTGC AAGA Ci C
22 CACGGGAIGTGTCAGGGIGACCACACICGIGGGCCC ATACATCGCCACCTGCATGCAACCTGCCAAGACCC:A
IGTITGACTCCiACICAAGC
ACACTCGTGGGCCCCA
22 GCCTGCACAGTGGGTTGTATGTGCCGAGATGCTGAA TTC:TGGGCCAAACACATGTGGATGTTGATAGTCCTG
64 GTCC . CGAGGT GGGCGGCTCTCATTGAAG
GCCGAGATGCTGAAGTCC
22 CGACGAGAGaCCCGAAATGCAGGTAAGGTGCTAGT TGTCGTCAACTIGCTGCCTGGACAGAATGGCCGCGC
CAGIGGC.CIAGTGGGAGC AGGTAAGGTGCTAGTGGA
22 C.ACTGGCCCTCCGTGTAACAATCATGAGAATCACGG CGMACC.CGCGCCAMCITTACC.
TCCGCGTAC.TCTG
CATGAGAATCACGGGGCC
TCGTTCTGCGTTGGGCTTA CAGCTTCCTTGCGACCC
22 CCGGTAAACICTGGIGGGAACCCGACCTTGAGCCTTC CCTGGGCACGGCC:TG11TACCGG1TaITAATCTGGC
ACCCAATEIGTCGAAGAAA CGACCTTGAGCCITCGATA
CCTGATGAGTTGGCCCITT
22 CC:ATCACCGMTGAGGAAGITITITACCCTGACCTCG
TGGATTCCAGIATICC.COCGCTGGTCCTIGTITTC.CG CAAGOGGGGCAAGAAAG
71 (ICG OC:C C
MACCCTGACCTCGGCG
GGTGCTCACCACTAGCATG
rccc CCCCAGACCGGAGTATGA TCCTCAAATGTGTCTGTGG
73 GTGGC AGCCA , TC , C
' AAGGTTACATGGGCTTGA CTCTGCACACATACACTCC
22 CAGGCTGCATAGGCAAGGCGGTTGAGGGCAGCC.AG 1 CGCGATTGATGCGGAMGCCGTTCCAGCACCGTTT CTCAAOGGTCTGCTGATG
75 TM ' TGC C
22 GCACGGGC.AACTACCACCAAGCGGTAATGAGGCCGA CGAGGTGGGCGTAACGCTTGCAAAGTCCACGCCAA
GGOGGIGATTTGIMITC GCGGTAATGAGGCCGAAG
22 ACGTCAAAGGCTGCCACCAGTC:TTCGTCAACGGICAA TCATGCCCGGGAAGAGTTTGCCGGCTTCGTCTTCCT
TCTTCGICAAtik-TCAAGT
AGAGGATGCGGTTTTGTTC
78 GTTCG AC CC.AGCAGGCTC.AGTFCG G
CGAAAAGAAAATCGCCCTC
GATTCCAAACGCGGTGCA C
ATGTCGCCCAAGITTKAG
22 CGCGC-AGTTTGCCGTTGAC1TGACGAAGCCGCCC.TG 1 GCAAAATCACCGTCGCCGCCTCCITTGGCGAGTGCG
ACGAAGCCGCCCTGGT
i GCTCCCAAGCCGAAACGTGATATCGCAAGCCGCITT
83 .ST 1 CG , CCAAGTCGGCIACGCAA I:
GCICITCGA IGCCCII GT T
84 T ' GTGATGCOGCCGCCGACTTCTATCGCGCCGCCAAC
GTTCGGGTACGCTGTCG ,XGCTGCCGTCTIGGT
CGATTAAATCGTGCTCGC
CCGAACAGGCGGTAITTGC
TTGTTGCGATGTGATGACG
22 CGCGCGGCAGTTGATACAGGCGGGTTCTGCCTTTTC.A CCGCCCGCGACAATGTAGTTGGCACGAACGCTTGG
AATAAAACTACTGCGCCG CGGGTTCTGCCTTTTCAAA
87 . AAC AAG CA C
88 GC GG GCGATT GCGTGCCCiA
AGGCGGACACCITGri GC
GGCAGGGAATACCGCCCCiTTACG TOCGCCAGTF CG ACAATTICAGCCACAACTG
TCATATCGGFCAGCGGTA
CACACCAGCGCACCGA
GCAAGATAAACCACGTCGC
TCACCGCCACCAAC:GCGCAAGGACIGGCACC
92 CA G CTTGGTCG1TTCC.ACGCT
TTCCAATGCCTGCGTTTCA
22 ITCGTMCAAGCCGGAC:ACCAAAAACCATGTOTCGT
93 , CGCCAGTTGACCGAATCGGGAACTCTTGCCGCGTTCC GGC CGGCAGGCTTTCCTCGA
AACTC:TTGCCGC.GTTCC .
22 CCICAAAGACTACCGCAGCAACGTCAGC6GTGC:AGGC CCOCGMCGGOCAACT CGGAAT
GGITTGCCOAAC
TC.AGCGGTGCAGGCGA
GTAACGCGCAAACCOCG
22 GCATCCIGCTCGACAACATCCTAGGITTIG IGGAC:Al (..tGCTITCIC:CGTC/GAACAGTICGTCAGCCIGTCi ICC AGGTITTGIGGACATAGGC
22 CGAGGCTTTGGGCITGGGTCCTTITGGGCCICICAGGA TGC.AGGTGGAGGTAGCGCGCAAACCGAAGATGCC
TTITGGGCGGCAGGACG
GTICATGTICGTGCCITCC
98 TCCG TAGAGC.AGTTCGGCZTGIGCMCTCGCGCTCGCCC
GTGAGGCTGGGCGGAT G
22 AGACATTGCC:CTC:CCC:GAGGGTTTTCCAGTCiCGAACG GCCGCCCE ACCGAC TTGATAAF
GCGAAGIGTMA 1' GITTTCCACITGCGAACGC
23 CGCGTG TCGCCCAAAA.TGACTIGCCICCGCCGAACT
00 ACGGC.GGCGAAGAGAAAGOTITCTTGGGCGCMGC C GTTGCCGAGCCAGOGA
TITCTIGGGCGCGGC
cma:Gcr GGC.ATCGGACGGCAAAG
23 GGCAATTTGTTCGGCAATGGCGCGAAGCTGC.G1TGG TTATGCCGICTGTCGAACGGCGGCAGTCATCGTCGC
02 GC . TIC AGTCGTCGAAGCCGTGC
CGAAGCTGOGTIGGGC
GCCAGATTTGTTCGGTGG
CATACCGGGTTCGCCGA
GCACGCCTGCCGCTAT
OS CGTGTGCATCGGTCCTGCCTTCGGCGGCGGCAAT ARC CGTCCGCGCA.GGTTAC
TTCGGCGGCGGCAAT
23 TGCCGCTCGCCAAAGTCGGCGCC:TTGTTOTTCAGGG CGCCGGTGGCT1TGGAGACGTG1IGGAAGTCIAACC
GGGGACGACTICTTITTCC
CGC.CTTGTTCTTTCAGGGA
GGGTTTGACCGCCGTAAT
CGCTTMCAAGGGIGGCA
CCGCACCIGTCAGAATCG
ATCCALAGAIGGAIGTGC
ACiAAAATCACCGCCATCAG
GC GGCGITGarAGITGCAGGT6TCCGGACTCGCCAC TGT1T-GGGCGTTCGICT
C
23 GCGCAGGTGGGGCAGGTAATCGCCTTAAGCCTICiCC CCTGCCJUVGIGCTGGTCGGATTCCGCTGTTGCCGGT
CGCCTTAAGCCTTGCCG
GAACACCTGCCCCGGTAT
13 CCGATCATACCGGTCGCGCCATTGCGCCGCTTCC.AAC GACCCCAGCTTCAAAGCCGCCCTACCCAGCCGGCAA
CCTGCTGCCCGTATTGGC ATTGCGCCGCTTCCAAC
GITCCGACCAGTGGGGCAACITTTIGGTTCAGGCG TACCG AG TICGCCTATTCC
TGCTGCAAGGCTACGACT
23 ACGGCAGGGCGGGGMTGTCGTITTCGAACCC.CAA ATCAGCCCCCTGCOTACCAMACC.CAGGCGGCATCG
TCCATTICGAACCCC.AACC
CCC A GATTGCCCGTCGTGGC
AATCGAACACAGCTACGCC
AAAAAGCCGAAGCCGAAC
TGCCTGCCTCAACATCGG
19 Cl GTGC AGCACTTGGTCGGCTTTG
AGGCAGTCGGCGCAGT
ACCGCGCCATGCICITGGCGCGITGGCGGCTITTIGG GCA AAACAGCCCGCCCTGA GCGTIGCCGGLI
t it, GG
CGAACTICACCAAACCCAC
23 C:ACG 1 UGC GCGCAGGCCCAA AAAC T
CCITTIGGGCOAGCCG
TMIGTTGGGGAACGGCA
23 CATT C GCGTTACGCCGCTGTCA n-GGCGTTGACGAGCAGG TCGCGGATIGCCIGCT
23 GGCTGAAAAACGCCGTCGAACTTC.AGCGTCGGGTGC COCCACGGCGTGTTGTCGACTATCAAGGCGGCTGC
AACGCGATITCGTCGTOG
GTTGGACAGCAGGACTIT AAACGGCATCGGCTTCTTC
TGAGGCGGCATTGCCGTACCGTCC IGTGCATAIMAGCGGCA
27 CGCAG G GG AGGC:CAGGCAGAACAATCi GC
CGATCGGACGCGTT ITGC
AAACAGCAAAAAGGCGG
CAAGGITTGCCAGCGCG
23 GGGACTGACGGIGTCGGIATTGCGCCATAGTATTCG GCCGCCGICTCCATCAGCA TGaGTai IGCi TAT TT C
CGGAC CC AAGTGICTTIGTCGGCGG
CGGCATAGTATTCGCGGAC
23 CGGCAC:GCAAAACCTGIACGC:GGCAGCACCITITC
GCiCGCAAAGT MGM AA
GACGTTGAGCCGCIGGA
GCATTGAAGGGCGAACaiAACTC:AAAATO TCAATCTGITCGGGTGTCA
23 CGGCGACGGGGCAATTTGCAAGGCAAACiGCGIGC
33 C.GTGCGCGCCTACCAGATTCCTTGCGAGGCTTTGCG G TCGCCAGCGAGGITTGA
CCTTGCGAGGCITTGCG
23 TAIGGCTGCACCiGCCAGGCCCAGITG ITCGGACAAA
CCiCCGTAACCCAMCCGCTT CGCCiCATGGC1 CAAC CCAGITGTICGGACAAACG
34 CGC CTC TGCGGCGCiAGGTTITGC C
23 GGCACATTTTTC.ACGCTTGCCATGOTTTGICTECTTC CAAC.ACGCTGTTGGGTATGGGACCGGTAAAGCCTG
GCCC CTACG GGCGATCATGTGGCAGTG
23 CCCAGCAG TT TCGCC:GIT ITGTCTACGAGCTITT GGG GGTCAGGGGCATAGGOCi 36 CGAG ITTGCC.GTATCCGGGCGGTMCGCTTCGGGCAGIGC A
C.TACGAGCTTTICICiGCGAG
23 GGIGGAACGGAAGCTCiCTGGA CAC:GCCGCACATC
37 TGACCGCCATGTACAAGCCGCTGCCC.GCCTGC.ATGA C GGCGCGGITTTGCCT
CTGCCCGCCTGCATGA
23 CGC:CGT1TATGACGCCOCC:AAGGCGCCICACCA AACC CCGCTCTFCGGCT TCAMATG6f GATGCTCCACCAC:A
38 A GCCTG C.GGCGGCATTGCGTT
GGCGCGCACCATAACCA
23 GCTCGCGCGAMACTIAAACT-ACGMICAGGCCiGCC CGTAGCA TACGGITTGT-G
CGTMCAGGCGGCCr.
GTC.AGGTAGTCGATGICG
T ACG GA
CTGGAGCTGITCGTCGGT
23 GTCCCGAAAGCCGCTGCACCIAA ATCCGTCiTCGATGC
GCGTGCCGC.AGATTITCATCGMGCiCCIGTAGAG GT
C.GMATCCGTGTCGATGCA
CGCMCACGACACGCT
23 CGATGACGGIGGTGGCAACC.AAAAGGCCGAAGTCAT AAGTCGGCGTAGATGTGCCCAAAGCCCATGCGCTC
AAAGGCCGAAGTC.ATGGC
GCCAAGCTCGAAGAAGAC
ACACCCGAAACCCGCAGCGCGCCTGACGGATGCTG C ACCC-ACACATCGGATTGC
CGCCTGACGGATGCTG
OGGTGGCGTGC.ATACC GCGTATCATCGGCGCG
23 AAAGGICGATGACGC.GCACGGGCATCGCCATGAAA
47 CC.GCCGACTTCCGCATACGGCGTGGGICATGCCGT GAM C.GTACACGGCTTGGCAAA
GCGTGGGTCATGCCGT
23 AATCGGCAACIGCGG1TCCC.CCAGATG1CGAAGTC
AGACGATGTCGCATTG Tf AAACCTGCCTCGTCGGG
AAGOCTITC.AGTGAAGACT
TCTCGGCAAAGAACGTAC
ATGAACGAAATGACCGCT
MCGGCGGCACAGAAG
23 GGTAGGATGCGGAMCC.1GCTITTGGAAC1GIGGCC C.ATGTTGCGCGGGATTICTGGICCATTGCTGCTGCC
CGCAAGTITACGGGNOT MGGAACTGTGGCCAAAC
23 GCCCACGTCGITTATCalCAAGTGITTGAGCCAGATT
TGTTIGAGCCAGATITC.GG
Si TCGGA GGGCATGACCGCCGCCATATCGAAGCCGCCGACCT
CGATATCGCGGCGCAA A
23 GCAAEITCCTTATGCC.CTGACGAGACCCAATGCGAG AACGTGGICGATGTGGrrGCAGCCCATCTCCGACAC
CCAACAGCGCGGAMTG AGACCCAATGCGAGGTAG
ATACACGGGATAATCAGC
CCCXGCCAT/ACIGCG GC
GTAGGTAACGAGCAATCC CAAGCGTGGCGATAATCG
AATCGCCGTCTGCCCA
23 GCAAATATACAGCCGCAGCGGTTCATGGCAACGTCA TTGATGCCGACGACCTGTCCATC.AATCCGGGCAATT
TTCATGGCAACGTCAATCG
57 ATCGG CCG G1TTFTC.AGCTGTICGGCG G
23 MCCGGTTGCCGATACGCCTGTATTCCATC.ACGCCG 1 GCTTCGCCGTCGCMGMAAATCGCCGITCTGOT
TGTATTOCATCACGCCGTC
'ITCGCGCCC.AACCGTTATTIGGTCGCGACGGCAAGC ACCTG TCGTT/TATIGGGC1 59 GM 1 AG TGCTUCTGCC:GCGCA
23 CGGTATTCCTGAGCGTCGGATCCGTTMGCGCAGGTT AAAACCCTGCACCGCAGTGATCCGGC.AATCTGCACG
CGTTIGGCGCAGGTIGG
TCATCGA
a..GCAAAAAGCCGCCG
23 AGCCATAAGCTGC:CCGMCGGCMGCCGACCACC ACAGGTG1TCG
TCAACGT
TCCGCCATCGAC:GC1 GCCGAGGCiCCiAAIGTG
66 GCAGGICGTCGICGCTGIACCTGCTGCGCGGGTTCA TGC:FCCFCCTCGCCCACGATAGAGCOTITGGGCGCC
CGAAGCCGTCGTTCCCT CICCTGC.GCGGGTICA
CCTATCAGGCTCATACGGC
67 CG AGACGGAI GCGAGGGCG1 ACGACCIGCTCCT MCC:
CGGCGIGGCTCAMGC
GGGTAOC1TCCACGACGCGTAAAICCCCGAATTCGC AGT TGCIGT ItAAACCACG
69 CCACGTIGGCACGCCTGTGGCATCCCITGCX.GTCCA CTGC.GGGCAGTTCGTTGGTITCGTCCGCAGCCTCG
GGCGACGATGTTGCTGT GCATCC.CTTGCOGTCCA
23 CCICAAGAACACAGGCAAAGCCCGGCACTCTGC:ATTI 1TTGGGCGICITCGTG
ItTTCCACCIGCGGACAATACA TITGTGITCCAGCi TAT GCG
C.GGCACTCTGCATTMGC
GCAAGGTGGCGGCIGICGACAACGGACAAACCAC1 GCiA TGACGGICAGTGTG1 72 AAATCCCGCCCTGCTCGCCITGGACGGCGTTATGGGT GCCGCIGCMGGGC1, ii, 73 CTGACCGGC.ACACCCACACGCGGGC.AAGCTGACGT CCTCA AGGATGGTC-AGGAOGGC
GCGGGCAAGCTGACGT
23 A TGAGGAAGTIGCAGI GICGGGCCTC.ACGACAGA A
TGTCGGACAGGATGICGA
74 ACAICZGACGACC1CGAAGC1C6GGACIsGGGG TGGA ACOGAC
TCGGGACAGGGGTGGA
23 CCAAAACCGCCGTCCTACACCAGCiATTGCGCTCiAACA
TGICCAGGCTGTGICCGAACiCGCATCCCGGCATCA CGTATGGCGGGTAAATTG
TS GCCT GC GC
GGATTGCGCTGAACAGCCT
23 CAACAGCATCTCCGTCAGCGACITTGACGGAIGCCTG GGTGC0116TTCIGTTCGCCACICTIGGCAC:GCATC
76 GC AATG GTCAGTTCGAGC.GGCATG
23 GCCATATICGCAGGATTGCMCCGGGGGAAAAGGAG GGAGGCGGTCAGTA'FGCCGAAGAGGCTICAGACG
CGGGGGAAAAGGAGTIAA
TATAGCTGACTTCGACGG
TGTACGACAGGTGCGGC
CTGCAACCGGGGATGC
GGGAAGCGGCCGATGTG1TTEGAACACCT. TCTTCCG C.GGG1ITT11GACGGTTCA
AATGCGITTCAGGCAIGTA
TCGGCAATGGCATGCATG
23 GGTCCAGOCCIGGGTGGAAATTGCGO.iteiti ICC
82 ACCCCGGGGCAAACCATCCGCGCAATGCCTGCTCGA GCA GCC.AAATACCGGCGCG
GCGCAATGCCTGCTCGA
TTGCTGAAAAAAGACAGT
GACGTTACMGCGCCAA
ITGACC:GAIGCCACGACC
OGGGTCGGOETGCsATT
86 ra: CG GCGCCGCGTITTTGC
ATTICGACCGCCZATTCC
23 CCGACGGICAGCGGGATGACACCATCCTGC.ACATC
87 ACGCACCGTTGCCGACATTGATGCTGAtitiGiCCGC ATCG GTAACGGCGGCGGTGT
TGATGCTGACTTTGTCCGC
88 AGC.AGGCGCGATTACCGGCTGCTGCGCTTGGGCA ACGCCATGTTITTCGGCGGCTCCCGTCATCGCCTCC T
TGCTGCGCTTGGGCA
TGGIGAACACGATGICTT
CCGCGAGACAGGTCGT
CGCACCGTCCACGAC.A AAACCTTCCAATACGCCCG
23 CGACGCACTGGTCGGCAAAGGATGCCTFCCATGACG AACCACFFITCGC:RTTGCCITTCCAAGCCGTTGAC
AGATMGTCGGCATC.GG GATGOCITCGATGACGAG
23 CGAC.AGCGCATCGAAGCCATGGGATAGCGTCCGGCA CTCTTGCGCCGTCCTGCCTACCGITTACCGCAATATC
GTTGTCCGCCCGAAAGTT
GGATAGCGTCCGGCAAGT
CGTCCGATGTGTATTCCCA
GTAAACGACGACGCGCG
C.AATTCGGC.GGTGAITTC CGCTGAATCGGATAGGTC
23 AAACCCAAGCC.CGAAGACGCGGGGTGAGGATGTAG 1 TTCGACAATGCCGGTAATGCGTTCATACGGCATCGT CAGGCCGGTTAAAAGATC
GGGGTGAGGATGTAGCGT
CTGCCCAGITCCAAAATCG
97 I CCIACCGCTGGTICGGCIT GCCFGT TGCiACCIGGGA 1 A C
GCCTGTFGGACGGCi GA.
GACGCAAGGAGTAGGCG
CGGATTCAAACGCGCCA
TGGAAAACCCGCCCAGC
TCGGCTTTGAATAACTGCG
CGGAAAGGGIAACGGICG
GCCGCGG1TGACGATG T CCG TCAGCCA.GAGGGT
24 ACCGAC1ACG1CGTCGCCGCAACCCArTGGCTTrT1C TTITGGACACGCTGCCGGAACCAAGCCCAAGCCCIA
CTICGCTGAGGACGGAAA CAAGGCATTGGCTITTICC
24 GCTCCATA.AGCCCTACCGCCTGTCGCGTTCGATFTCG GAACGGATTCGGITTGOTGGCTCGGGCATCAGCA
04 CTG GGAC CGACGCCITCCiCCITCF
TCGCGITCCi ATITCGCf G
TGCTGGGCGCGTCGATCACCGACCTCCICAAC
OS CC C CTCITCGCCiCAGCITGAG
CTCGGGATGGMCGTi CC
24 CCGCAC 1 OCACGCAT TCFGCGG TCGGCGAAACCAAA =
AAGCCACITC.GATAAAGGCGGCCATTCGCACAAAG GGICCiGCGAAACCAAATG
24 TCGGCAATCTGGA.A A IGGTCGTIT TCCAACCGCTGC:
CGTTGC.GGC.TGACTGC
08 TICG C.ACTTCTCCC.GCCCCCGA.AATCATGCGGC.GACATGG
TGOCAACAGCGMCGAG CGAACGGTATTGCGCTTCG
CCCTGA ICGTGCM CGTGCAA ICAGGAANIGG 'ICCAAAGTGICTATCAG GC
24 CCGATAC:GCGCCA TACTIGGTITGCGCC4:CCAT
T ACAGGCGCOCGAGAAAGACAAACGCCGCGCACAC AACATTGGGTGCAGGGC
GGTITGCGCCCCGATT
ATGTATFGAATGCCGCGAG
11 CGAGC AAACAGGTGCGCCGAC.GGCAGCAACGCCCTGCCT AGGCTACGGGCGCAA
24 CGCGCAGTI GT TCCAGCGATCCiCIA TGCCGAAATGA
CCCTAIGCCCAAAICAGC
CGCCGGCTTCCIGC.AA
24 C:ACCCTG TACGAAACCMGCTGAATTCCGC IT TATC
TGAATICCGCTITATCCGC
13 GGCG CGCCGTGC.GGITGCCGATACTCCTGCA TACGCGCG
GTCGTCGTTGCCGCGT
24 GC4:A.TAATGCGCGCMG TACiGCC:AACCTCAAACCGC
CGC.GCCGTC.ATCATGC CCAACCTCAAACCGCC AG
IS A AGCCGCCCGCCICGMTGGIGITGaIGGGTC.ACG
TTATCGGGCGCGTGATGA AACCCGMTGGCGCAA
ITTCMCCTGCCCGTCC
17 TCCATCGC.GTCCGCCAGTATGGGCGGGCGAGTTGAA GCGATGTTTGCCGCCGTCGC.GCACCACCTTTICCG
TTCAAACCGACCACCGC
CGATAAAGGCGGCGACTT TGGCGGGGRI:miGC
24 CGCACCAAATC.CTATGCCCITCGCAAATCCIAGGCAA
TCGCTGTCCICTTICCGAACCTIGCCITCCGTGATCCT CAAATCGTAGGCAAGCGC
GCGCA TAC GGCCGGATATCGCGTTC A
CGGCGGCAGAAGGCIT
GTCTGTCGTCGAGGCATTCCACGGCGCATACCAGCT
AGCGICCGGGATITTCAAC
OCGCGCC.GA1TACTFCG
24 CGGCGCGARITTAAACGGAAATIGGACGACCTGATA C(.1 24 ATCCCC.AATATGCCGTCAGC.ATTGGCCGTAATAAAAA
C.GACTACCC3TTGCTA1C1CGTTOCAACGTACCGATGA CTACATCAITATGACCGTT
ATTGGCCGTAATAAAAAGT
25 GTATGG 1TG , GT ATGG
26 TGGGACAGCTCGGGGCAAGGCAACTGGACGGGGA AATFCCGCGCGCGGGCATGCAACCACC.ATCAGCCC
GGACTGGTGGTGC.ATCCG GGCAACTGGACGGGGA
24 CTCGCGC.ATATGGACGCGGTACATCGAATGCTCGCTC.
27 G GCCAACCATCCGCTTGCCGGTGTCGCCGCAC.GGAT
ACGCTATCTTGCCCACAGC TAC.ATCGAATGCTCGCTCG
24 GCGTCTGCCTCGATACCCAAACGGCGGCTFIGCCC.CA GCACTGTITGCCGAGCGCGGCTTGTCGAMCCTGA
28 A CGT TGTGGGAAGCC.AAACCGG
GGCGGCTTTGCCCCAA
GGGCATGCCGTCAATATC
CGCAAAO:GCCGTTGC
ATTCGGCTTGCCGCTG
ICGGCAGATACGCCGT
OCTTCCTGAATCAGGTCGG
24 GCCTGCTICCGGCCATICTGATCGGITTGTTGGGC.GG GTTTGCCCGAACGTGCGGTCCGCCTITCTCGGTGTC
CGGTITGITGGGCGGT
24 1 GCAGGGC.AGTGTGTATCCAGTC.GCCTACCTGITCGG
CACAGTGC.CGGATAAAGG
AAGGTAGGCGGGCAGC
35 CIGTGIACGGAATCGICGCCGGGCGCATCACiCAGCAT 1 GACGC , AGGCCGAGG
ACGCATGATAITTGGCTGGCGGGGCAACCACTITAA CTGTCGCTGCTGATTCAG
AACGACGATICCTGTGATG
36 TGG ' ACGCC G G
TCGCCGAAACGGGTAAAC
CCAGCCGTTGCGTGCA
AAGCAACTCTATCCT
ITATCGGCC
24 TTGACGGCCTTCCATTTTGGAAAAGAAGATTTGGCTG ATGGTGATGATGATTGCGCCC.AATCGGAGTAAGCG
AAAAGAAGATTTGGCTGA
39 . AAGT GAAA GCAACTFITGGAACG MT AGT
CGGTAAGGATTGGGAACA AAAAAGCGGTAATAAAGG
40 AAAGGGAA AAAACCif A (3 (AA
TCATCGICGAAGRAAOCGGCTCAATGCGTTGCGTCC
CCGAACAACTi COT TTCCI
TAGCTTTCGCCTGAAACG ACGCAGTTGITCGGAAAAC
GGGCAGGATGIGTTCGAT AGGITGATGCTGTCGAACT
43 ACTG AIN'S G G
TCGC1GCTCATC:AGGCUAGGICGICTI6CCA FAGG
CATGGCGGCGACAAATGC
24 GlIGATITGAAAAAAATGCCGTCTGAAACAGITICTC GFTCITTACGITITTGTGGGGCTFACCAGGCAFCCA
AAACAGTTICTC:GAACGGI
45 . GAACGGTA AACG TTCiCATGAGGACAGGTIG A
.
24 ACACCCGACCACCAAAATIOGGCGATTGCOGCATCiCi AGGCGIGGATITCCGTIACCOTTIGCCGCATTGCCC
46 GC (AC 1TACCCO:GAGGACGTGC
CGATTGCGGCATGGGC
TCGGIATCGGTGIAACGGCGTMCGGIA
47 GCCAC TATATGCAAA CTGC.CTICTTATGGOGAAC
GOTATCICTCTACGCC.AC
24 CACAACCGCCGA.CATCAAGCTITCFMTCGGCGACie GGCGRiGCGA TGAT
TGCCGCGAATCCCCAAACCGC
TITCMITCCIGCGAGCGT
TTCAAACCCTTGCCCAAAC
24 ACOICTGGCTITCCATICGGGTCAATfCCCGAAGCT
TGC.CGGAAGAAGG TITIG
50 TCCTCAGGCTCCGCCAC.ACCCGCTCCTTCGCATGG aiC C
OCGCTCCITCGCATGO
24 CATI GGCAAAAATAGCAGCACAMAITAAA FCC:CAAA
ccrAcc:arrrrlaCiaTGCMGCAC:AAACAGTICif ITAAA FCCCAAAACAGAAA
51 ACAGAAATGAC CG CAGCTCAAAATGTfGCTGT TGAC
24 GCCiTATIGTCIGTGCGCGGATTCICCIGCAAGGCAT
52 CGCACCTTCGCIGCACiACTGOGCAACGACAGCGAGG TCAT GCGGGC.GTGTTTGCA
GGCAACCACAGCGAGG
53 ITGCCGGITCGGAAGCCGCGGTACGC.GCATGACGG CTATGG TTAAATGCCGTCCGCGC
GGTACGCGCATGACGG
24 CGC.ATGAAGATTGGCTICCGCMCTGCTGGCGGAAT GCGGCGAAGATGAATGCCAACTCGAAGGCGTAACG
54 CG . AAGTTG ACAGCAGCITTCCCGCTA
CTTCTGCTGGF:GGAATCG
24 CGC.TTCAAGGTAGCCTTTTGCCGATGTATCTCCGCCG TGCACTGAAGCCGAGTA11CCGAC13TGCGGT1TCG
GATGTATCTCGGCCGGCT
GGCTACGGACACGGCA
24 GCGAAAGITTCCGTNVµAATATCGGTCGGATTAAITT AAGCCCAATGGGAGAAATCGTGGAITTGGGCTTTC
AATCGGGAATAGTTGGAT TCGGATTAATTTGTTCAAT
24 CCGCCGCCCAAGAITGGAATCITCAGGAAAGCMGC. ACGCTTCGACCGTCCAATCCCACCGGTCGGATTGCG
C11CAGGAAAGC1TrGCCG
CCAATCCGTGCAAAAACAG
60 C GACCGTITCGCGGCTGTCF CGCCGACAACCOCCT!
GAAATCCGTGCOGGCC GCATTGTCCGCCACGC
24 AC.GGITTCGTCGACGGCACGTCGGGGAIGATGGCGA CCAGC(CGCGATCCTGATACACGACAGCATATGCGT
TCGGGGATGATGGCGAC
AGGAGTGGTAACCATGCCGACAATCCAACGCGGCC
62 AATC TGAC ACCGITGTAATC.GGGCGG
ATTGIGGCTGCCGGTAATC
GTTAGGCGGCGTGGC.A
24 GCAMCCAATC.CCGACACC.GTGCCACGCCGAAGACG GGCGGITTTGATTTTGGAGCCGGCGCTGITGATTTT
GCCACGCCGAAGACGA
GGTCGGCATTAAAGACCT GCTICMGATTACCCTGAA
TTTGITTGATGCCGTGICC
ATGTCGGTGGCACGGC
24 TGCTCGACAAC.GGCGITCCCGTTCGGCGCAATAAAG
67 C GGTGTTGITCGGCGGC.AAGGACGAAACCGCCGC.AC
AGGCGGCTGACGGGAT CGTTCGGCGCAATAAAGC
24 CGTCGCCGCCATAAAACGCTCGAGGTGTATGGGCAG CGGCTGCCGTGTCGGTAAATCACAGGCCTAGGTrA
68 G ca CGGTTGAGGICGAAGGTG
TCGAGGIGTATGGGCAGG
CTITC.AGACGGCAITGGIC
CAAAAGGATGTTGCTGCC
70 (.3 TGC C
CCGCCCAGCCATAAAACAG
GCGGGCGTIATGACGGA TTGGGCGGATATITCGGC
24 GCCCGAACCGATGAAGCAATOC.AITTIGCGCCTGATT 1 GITCITCGTTICCCCCGAAGCCGGCCITCGACGTTFIT
ATITTGCGCCTGAITCCGA
73 GCCGAITITGCCITTGCCGCGCCITACCACA ICGCGC GCATCGGCC.Cf GCCTACGAAGCAAATCGG(XiGCG CCTGCCCGCTGATCCT GCCTIACCACA ICGCGC
ITGACCACGCCTTGAATCA
CGTCAGCGTAAOCATGPM
ITGCCGGCAACAACGAC
,XCATCCCCITCCITTATGC
24 AGTC.CGCTACACGCAAGGCCGCTCGTTGATGTAGCG TGGTGGAAGGCTTCGAACATCGCACCGTTGCCGAA
CGC7C0 ITGATGTAGC:GG
24 GGCTTCCAACGCGTC.ATCCAGGCCAAGCCTTTGCCTG GGCCGGGTTTCGACCGTGTGCGTTGCCGAAAACAC
CAAAACGTCCAAAGGCTC
78 C CC GCCAACiCCM
GCCTGC
GGAAAAGGGCAGCGGATTGACGGCGA ACAATTACCCGCAAACACG
CCACAACAACCGCATCC
TCAGACGACTITGGIGGC TATCGCGACGACTTCCATC
24 GACGAGTICCACCi ICGTTCGCCACGCTI T11 TG IG 17 CGTTGGAACAGGCGCA
24 CGAAGCOCCGGAATT IGCCIGGCAAAT CMGS:GAGA GGATG
ITGGGCCiCGGGCAT TACCGACGCGGCGAA GGCAAATGTGGCGACA IA
=
AACTCAAACAGGTTGCGG
24 CGGAAT ICCGCGCATITC3C:AGAGGICMAC ItGAAG GICTG IGCCCACA
ICGGICIGACCGCGCCCCIGAAC
ITS GCGG TT CTGC:CGGC.GCGCATAT
GGTCAAACTCGAAGGCGG
24 CIGCCCITICIGCAAAGCCGTICAGTGCiCiC:GAACOG
CCiCiGAACGGCICTCiCiC:AACCGCiCGACTICGGGOG
86 GT AlIT CCIAACGCGCACGGCTT
CAGTGGGCGAACGOGT
24 GAACAGGIATTC.CGGCGCGGACCGCAGGGATTACGC
ACGCTrAGGGIGTCTGATCGAAACCATACGCCCA ATGAIGGGAAGAAGGAC
SS CTG AGAC
GGCAGGATTITATGIGCTG
ICGTCCAAAAGCAGGAAACGCCAGCGCAT
TGTGATOAATGICGCGCA
24 AACGTIGTGAAAC:GGT1IGCGCGAAGGAATIGTCG
90 GCGTGCTGCTGCCGGAAACGCZAAC.ATCATGCCCG GGCGAAT TATTC.CCCCGTTGCCCG
CDOCGAAC.ATCATGCCCG
GGAGITGCCGCGTIGGAAGITGGCIGGTGAAACCC CGAIGTAGTAGGCGI GITC
24 ACCGTCATATGCCITATTGTCGTGGGTAAGAAGGAA TTC:TGCGGATGITTTICTCCGTAATTAATCCGCITAT
GGGTAAGAAGGAACAGGG
24 GCCCGGTCAMAGCGCAAAAAGTTCCGATACC.TTTGC GGACAGCCGGITCCGGTCACAGGAACGCGCCGTC.A
TTCCGATACCTTTGCGCC
CGTIGAGGAGGCCGTAAT C.AATCGGGGICTGACAGG
ATAAGCGGTTTCGGGATG
GATCCITAGAGACCGTGCG
CGGTTGCCAAGCGTCC
GGACGGTCGCAGAAAAGC
24 CC.GATACGGGGIAAGGCAGGATGCCTGIGGACGCA ACZGITATTGGCGGATITGGCGCCTTATACGCCGCC
99 AOC GC CCGATATTGGGC.GCOG
GCCTGTGGACGCAACC
00 GTTTGCGCCGCCG1C:17TGGGCGOGGTTGCCGTAT AGGCAA CGCGATIGCSATOMAA
GGCGCGGITGCCGTAT
CATGGGCGAAGGTGGAT
CCGCAACAAGAGGCCG
GTCCTTGACCGCGAACC
ITTTGOGTAGGGITTAGCA AAAACGCTAATC.ATAAGAG
25 GCTTCATCTGC:TTTGTGCGGGICAGGCATCGGGAGG GAAGC.GAGGICAGGCGGC.ATCCGGGATTCGAGCGT
CAGGCATCGGGAGGGAT
25 CGACCACATCGGACAATOCCCiC.GCGGCAGGGGITGT AGCCGATAATCAGGCGTGTTGCCAGGCATTACGCG
CGC.GGCAGGGGTTGTT
TGTGCAGGAATTGGCAGG CCGCCTGAAAGAATCCCAT
CGAGATGCACGGCTCAAA
25 GGCGGITAll. I I I tiGCCTGCCGTICCGCCCGAAAAT
GTGATTCCGGCTGTTCGCGGTCAGAACCGGCCGCA
GTTCCGCCCGAAAATCTGC
GCAGGAGAATCCGAACCT
GCCTCCTGGCCGCCGATTAACACGATGCGGICTTTG GACGATTICITCCTC.AGCC
TAAGGCTTCGCTGCGCCAGATATCGCGGCGG TT TCA ICGATACGAACGICCGTTA
12 AC G TGTCCi ITTCGCiOCGAAG C
TCGGCAAC.ATGAACGAAG
CCCTGCAGATTICAGCCG CGCCCGATTACAGGCTTG
GCTGCATGAGGACGCTCTATGACGCCCGTTTACCGT TGACGTGAAATCCGACATC
TGCTGAAGCACAATCTGA
TCAAACCCCCCTTGC6CCCCTGAcCCCJCG1TrFCG AA CG CCTGACGGCF
GGITTFCG
16 ATCGOGGCIGTCCIGCTCUGGGCGI GGAAATT ItGG GC (ST
TGGGCGTGGAAATITCGG
2$ AAGCA TCAACGGCTIGGAGGCCCGCCACITTGCCGT
CGGGCAGCGTCAACGA CGTCGAGGCAGTCCGA
18 ACCG (SAC GGCACOCCCGAAAGCA TI ATM
TGAAIGCCGACCG
ZS, CGGCAAGAAGCGGCGGTATGTTGCCTTGTGTTGCCG CGCCGCCCTGCC1TGTGAAGCTTACAACCGCGC-AAC
"fTGCCITGTGlIGCCGT
25 CGGACAGGAGGGOCATTCiAIGICiGGGTCGAAGGCA
ACATCGGGCATITCF TC:CCAGCTG7CCGACCTMCiG GGGGICGAAGGCAAGATG
20 AGATGG CGG AlTTCCGTACC.GGC:GGC
25 ATACGGCTCiCATCGCGGTCTGCC:GACAACCiAAGAA
GGACTGGTATCGCGCCA
22 ACAC.CGCCGCACCGTATCCCCGGCGCGTGGCAATA CZGAACGCCTCAAACCOGCCGGCGGACCATTTGCG
TGTCGGTITGCGCCGT CCGGCGCGTGGCAATA
as GTCAAATCGTGGTT TGACGGGCACAACGACGGCCA
23 17CCCACCGCITC:CCCGACGATGTGATIGGGGCGG GCC GATGTITTCGGCGGCGG
CGATGTGATTCGGGCGG
25 AAAAGCCGCiGCGCAAA ICAGTCGGGTCITIGGCCG
24 TCCGGGCGGTTTIGGCGTGTTCACGAAGCCGATGCC C GAACCIGGCX.CCCGATA
TTCACGAAGC.CGATGCC
25 GCCTTGAGCTTGTCGATTGCCEAC.GCATCGCACTTGC CCAAATGGGCCAATTGGGGCGCGCGITTGCCC.TCG
GCCTATGACGCATTGCAG
ACGCATCGCACITGC.0 25 CGCCCATCAACCCCATCACTGCCCGAT iCTIGCCGIT GCGCATATCCCATATG1CG
CX.OGATTCTTGCX:GITGC
27 GACCi GCGCGGCACTICGACCAATOCCCC.TGCCCGCiTTT C
GCGGCAGGCATATTGAGG
25 GAACAC:CAACGCGTA AA!
CX1GC.GGGAAAATGCGC
25 Gc.ciGGCAcGCGAcirrrIcTcMGcAcrGccATcAcc ATIGTCCGACCAT
ACGCCCGTGCTCAGCACGGGGA
CAAGCACTGCCATCACCG
GAGITCCTGITGTC.GC.TATTIGCATTITCAAATGGTA TTATCGTIGGTGAAGGAA CTAGATCGTCAAAGTTTAG
AGCA GGGAAGG GA CA
25 T17TCGCAAATC1TCCGCCCCGGCGAGATTCTGAACT GCGAAGAGCGGAAAATCAAACACC.TTGATTTCTTTG
CTTIGCCTATATCGTGAAA
C.GGCGAGATTCTGAACTT
ACAATACCGCCAAGCCG
GATGACCGCATCCAAAAC
CGCCGACGCOCGATTICACGCAGCATCGCGGTGGA C TCGACCCCGCCCTGTT
GCAGCATCGCGGTGGA
ZS CGGAAGACAAATCCATGCCGCCATCTGTGGCGCATC
ATCTGTGGCGC.ATCAAAAC
36 AAAACC CCLIGCCCCGATCCCITCIATGCFCGGGOGGTATC:C
GGICGATCFGCAGGCCA C
37 ATCAGCGC.GGCA. s IT.CCZCACAAGGIGCGGCAGT
ATTTGGCCGATGCGCTGCAAACTGTTCCCCGCC.GC CCCTC.AATGCGCGGCA GCACAAGGTGC6GCAGT
ITGGIGTCCGACGTGTATCA
38 ATCG GGGC TCZTTGGIGGATTGGC. TG
GCGTTGACGTGATCCATCG
25 GCTGGCCGGAAACCATATCTCGAAAACGCTGCAAAA C.C.CATTAATGCCGMATGCCAGAGACCGACAAAAA
AACGC.00TGAAGTTAAAC GAAAACGCTGCAAAATGA
25 TCC.ACCCGCACCCGACCTTATACACCCAAAGCCACTT TGGTTTTGCCGATGTCTGCCGGATCGGTGAAGTCTG
ATACACCCAAAGCCACTTC
41 CCGTACGACGCTGAACGGCTCGGCCIGGATGIGGC GC.AGTACCGCCCAGCAGGACGAC1TGGCTTGGGCG GG
TCGGCCTGGATGTGGC
25 GGCATCAGCAGCTCCACGCTAAAGICGAAAAaiTCG GGIGGIGCAGCCTGCCGAACTSCAGCAGTTOTTACC
25 CTCCGATCGGCAACTGGCMGAAGAGGICAGCGCGA TGIGGACGAZGTANCCGCGCATGC.AGCACGACAGC
TGTTTTCAAGCACATCGCG
GAAGAGGTCAGCGCGATG
GCCGCGATTTGTTCATCG
ZS CGCCGCGTTCGTACTGCATGGACACCATCGAGCAGC ATTTGGACTTCAAACGCGGCAGTTCGGCGGGGTAC
TGCTGTCCGTAAAAGAAG
GACACCATCGAGCAGCG
AGTGTCCCICGICGCC
TTGACCGAATGCTGCATCA
25 CGGTGAAACACGGCATTTGGTTGACGGC.ATCGAC.GT 1 TTGCCCATGCCGACCATAATCGGATTGCC.TGCCCGT
GACGGCATCGACGTGG
ZS 1 TCITCGGGATIGGATTCGCCGG TATCCiAAAGCGGC
49 GOCGOGCGGC.A.AAGTCi 1TCGCGGTAC:ACCACGCT 1 AOCGG , TTIGGCTGCCGAGCTGC CGCGGIACACCACGCC
25 GCAGOµGCAATGTCGGCATTACAGAAAGCCGGTGTG 1 AATCGTTGCTGCCGGCATCGTTCAGAGGACCGTTCG
50 GA ' GC GTCAAACCGTTACCGTTGC
ACAGAAAGCCGGTGTGGA
2$ CCCGCCCATAAAGCCATCGGCGCCGGAGAGTGCAAT AGCCGCAGTGGATTAAGAAGGGTGTCGAGCTGGGT
CAGACCGTTGACCAGATA
CGCCGGAGAGTGCAATC
TTTGTAGGCAATMGCGCG
25 ACATAAAGCCGGCGGCAAAC.AACTCCGTCGGCGAA
53 . 1 GITIGCCGCGCTG ATCCGCLTATCGGCAOCAG TIT TGG
GAISACAGGGCCGGTGA CCGGCGGCAGCAGTTI
25 CGAC.AGGCAGGGAGTCGGCATACCTGACGGICTT CA
GAGCAGGCGGTATGGTCGGCGATTTCCCCAACAGC
CCTGACGGICTICAGCCC
ZS CGG'16TCAACGATAAGGCACGGGAACATC1TTGACC TCCAGCATCACATACTCGAAAGICAIGCCAALGCTA
ACGGGATTAAATTGAATT
5$ AG I TCTGG TGCA
GOGAACATCT11GACCAG1 .
CCGACITTAACTTCAATTT TTGAGCGTTITTGAATICA
56 TICAGC GC.37 CC GC
25 GCAGCGCAACCTC:AAACCGAAACCTGCGCGAGTG1T I GCAAGGGCGGGITCGGTAAGCAAGAAACCGTCCGT
CCITICGCGAGIGITGC
25 CGAC:ATCTCGACGGCAACCGGITCAGGTIGTIGAGG 1 G6TIITGGCGGCAGGCGTATAACTACCTCAAC:GCC
Sil CGT G CA TGC:GTTGACAAGGTTGCC
59 . CGCCCTTGCCCAACGCTATAGCGOTATITTACGC1COG 1, TTCG C
GCGGTATTGAC:GGCGC1 .
CGACGCCAT TCGGIGTAGAGCGTGAACT
C.GACCTGCATC.ATTTCGCC
25 CGACCACGCTTC i GGTCGAGGGGAAATCGGCGAAAC AAGGCTGGCGACTIT CAC
GGGAAATCGGCGAAACGG
25 CCiCTITCAF6CGCCTGACCGGCTITAI6TGCCTC1TTT
62 TCGCCCTGCACAAGCTCCITTCGGCGGCTITGCG am CGOTATCGCGCGCITT
TTCGGCGGCMGCG
25 TCCGCCCiCGCCCICITTTAC.AGGATTTGGTGTCGGAG
GTTITCAGCCAMACGTG
CCCCGCCAAAAACGGC
CCGAAGATGAAG ?MCC
64 CG GGCCCGGGGCGCATGAMTCAACC.TGCGCCAACG GA G
25 IACAGCACCCGCCACCTCiCTTCTTT TCGGT AATGCTG
ITGTETTCGGIAATGCTGC
65 CG TTGTTCGACC.GCCGCGTCAGCAACCGCCTGAC.0 GTGTTCGCCGTGGTGG G
25 CGA 1 CTCATCCGTATCGCCCCCK:GITGGAGGC1CGAC
GGITCGAIGGCTTCGGAIGCCGMGATGCGCGTIT
66 GA GC C.GACCAGITTGGCGGIT
COTTGGAGGGCGACGA
25 CiGTAICGCCMCGGIATGGCT.T.111GATGGTGTCGGI GGACGAGCSGITGACCiAGCCGAACACGAGCMCIT
G1CCAA.AAGAC:GCAGACA CGITGATGGIGICGGTGA
25 GCAACGACGAACTGGCGCACTCTICGGTGGTCATGC CAAGCGCGCGCAGGAC.GTACCGTATCCAGCGGATC
68 GG . CG GCCGCGAATTGCACTICT
TCTTCGGTGGTCATGCGG
C.GCCGAGTTTCAGCGTG
25 CGGGC.ATCGTTGC.TTTIC.ATTCATAGTAACGGATGTG
AATTTGCAGGTTATCACTATGTGCAGGCAGAATGAA CAATCCATCAAAAGGCTCT CATAGTAACGGATGTGGA
GGCGAAAAAGGCGCTGGCGTTTCGGGTGGCGCAG AGGGGAGC1. :GGAAGAAT
GCAAACTGCCCACGCC
GAACAACICCTGTMCC:TG
ACAAATICACAOGGGCGG
2$ CGGCGCATTGGCAAAAGGTCGCTTTATCAGGCGATG ACITTGGIGGCGACGACTTCGTCCCGMCGGCGTG
GOTTATCAGGOGATGGA
74 CiAGG C CCGGCGGCTIITCGATT GC;
GTATCGGCATC.GITACCAG
75 ACCAGT GCGCCCGAAATCC:CCGTCAGAAACTCGCTGCOCGC
CCCGTCCGTCCGCAAT
2.5 CGCACCACCGGITCITCGITAACATTGICIGICGCGG
CGCCATCGAAGCCTGCACCiCGTGAAAI CGGMCGT
GATTGTCTGTCGCGGCCI
25 GGCGGGAATGGCGGCAAATATCATCGGC.ITC. TGCST
ACCTGAAAGACCTGTICGGCAC.ACCAGGCCAGCAG
ATCATCGGCTTCTGCGTG
25 CCAGACCGATATGCCGTCGCCGCCATATCCGACGCTT GCATTGGCC.GTCATACCGCTAGOTCGAGTGCTITGA
CGCCATATCCGACGCTTAC
GICGMGATACCGATGC.G
79 GG1TTGCGCCCGCATGIGGGOG1TfACCGGITCGAC GGG AT
GGCGTTTACCGGTTCGAC
CGITICAAACGGCGCGG
25 GCAGCGAP.CACGATACCCGTATGAGCAGGGCGCATC CGATACGITCGGGGIGGATGITITGCCAACCTCAM
GAGCAGGGCGCATCGA
GGAAAAAATCCGATGCCG
GCACGATGGAAGCAGGAA
ZS GCGC.AGGCGATGGTGGTGTCAACGCCGAAATCGTC
ITGGCGGCCIGCAGT
84 TAAACGG TATTCCC P.ATTGCCGGGAATGGAAA
TCCGTATCIGGITAAACGG
TAATTTIGTCGGCGTGTTG
AGGGTCAGICCGATITCGT
25 CCAGCITCGCCGAC.ATCITCTTICTICGGTCAACACGA 1 GGGCGATAATGGCGCGGAAGTCITC-ACGCGAAC.AT GMAAGCCGGTCGACTTG
TCITCGGICAACACGATGG
ZS AATCCCGCCGAAGAAACCGATATATAITCGAGTGCG
GCACTITGCCGCCGATGITTTCGAACOGCAAGTTAC AT ATATICGAGTGCGCGCT
BB TCCACCCACGGCGCATTCGTCGCCATCGTGTGTFCG G TCGCGGTAATGCACTGGG
TCGCCATCGTGTGTICO
ZS AGCGCGCCAAATCGACCGITTGGCAATCCTITTACGC TATTGCAGGAGTTCGGCACCITATGCCGAC.AGTTIG
TTIGGCAATCCTMACGC
TTTGGCTGCTTCTATTGGG
CGGCAGGCGTITTCFACC
CACCAAATCCGCCTOTACG
ACAGCCTCGCCACCAACGCACAGCAGCATCGCGAT
CX:GGGTCGGI ATGGACA
2$ CGTIGGCGCGGATTITGGCGAACACGGCGGAAACA
93 'FTGICITGCCCACGCCCGGCGGIGGAAGGCGGGAA OCiC
GCGAACCiCAAGCCGCT CGGIGGAAGGCGCiG AA
25 GGATTTGCGCGCCCAAGCFCGC.GCCGTCTGAAAATC GCCGTTTCGACAAAGGCGGCCACGGATTIGTGCAG
94 C GGC rrACCT3CCCCGACA
CGCGCC:GI CTGAA AA ICC
AAACGCCAGCGACGAAGTGCITCCTFCATCAGCTTG
9$ 1 CIITGACGAAGCGGCGCA TIGCCGCCGCCAAATCCA CMG GCAAGCCGTGCF GT ACCF
GCCGCCGCCAAAICCA
25 CCAAAGCCCITGATCGCGICCGGTGTGGIGCACiGAG GCAGCGACCAGGAATGGCGACGAAC:CCITCAGCGT
96 GA GATG GGGATGCGCC.GTCTTC
GGTGTG GTGCAGGAGGA
97 GATITX.CGCCGCCGTCATGCATACGCGCCATCGT AGCCCGTTCATTTGCGCCACSGCGTACTCGTCCGT C
GC,ATACMCCICCATCGT
f3GCACICAAATICACAGT
ATC:GATTCAAAACACTGGC
TGCiCAGCTG ITCGITGCITIGG CAATCATGTTCGCATICiG
AACCCAAAC.TCAAAGCCA
26 GCCATTIOCCGAIGATTGOGGCCC.CCGGCAGGACiC:A TGTACGGCAAAGGAGACGGCTG
TAATCGCCGAAAC
00 TT GGTCT CCGCAGCATGAGGCP.AC
C.CCCGGCAGGAGCATT
26 TGTTGTCGTTCiTGGGCGGGCGGCTTTGTGCAGATTG CGGCGGCAAGCGTCTGAATGTTGGACTACTATCCG
CGGCTTTGTGCAGATTGC
26 CCGCiGIT TCACATCGCCGTAIGAGCGACGCTITACCG
ACCGTATTCATIGCAGGIGCCGATACAGTIGGGCG GAAGAAATCGCCCTGCFG
TGAGCGACGCTTTACCGA
AAAGCCAGTTGGACTICGCCCCCiCAACACCOGCAT T ACiAGCiGCITTGAGCA
26 ACITCTCGGACGTI CCM TCGGGCTI CGACICCG 'TA
GCACIGCTGGTC:AIACTG
04 GCGCATCGATITGGGCGCGGAAGCC.GGCGMGAGT CCG C
GGAAGCC.GGCGAAGAGT
OS CGTATGTCCGGCGCGTCAGGTCAGGGC. TTC-CiGGCGT
GTCGGTCACCAGCGCCACTACTGACCICGCCTITGG GCGATGTTGCCGCCCA TCAGGGCTTCGGGCGT
26 CAGCGCGAAGTGTTGGCGGCGAC.GCAGATAMATCG ACACGATGGCGGGTGCTTCGGIGGTTAAAGIGICA
CGACGCAGATAMATCGC
CACTCTTGGGGTCC.TGAAG
eg GT GAAC-AG CCCGCTGACGATGTGTAC
GCGCTCAAACCIITCGGT
CACCCCCATCGACCTGAT
GCACIGATTGGCGGGCCATGC.A ATCCGACCGAA ACC TITC.C.GATAGATTCCTGC.C.
CGGTCGGLITTITCCIGC
TGAACAGGATGTTGCGGA
CGACACGCTCGTCCGT
CAAGGATGCGGGCGATG
TTGCTGACGCMCGGIC
26 AAGTCATGCTOCATGCCGTCAAGGTGICAGTGCGTTG TGICGCGGACAAAACGATGOTtCGACITTCGCCAAA
TCGAGGAAAAGGACGAA
AGGIGTCAGTGCGITGTG
GCAGCCCGCATGTTITTCC
26 GCATCGCCAACCGCCTGTTGACGCCGTTliffiiiCAe-CGGCTGCGTTIGGCGACTACAACTOITGCAGCTCGA
ACGCCGTIbilitiCACGT
26 GCGCACCTTATCGCCGAACAAACGAAACGfelfeiGC ITTICCCCGAGCGCGGCATTCGAAGCCGCCAATCAG
CGAAACGTTMTGCAGCG
20 AGCGG TTII.1 CCTACGCGCCC G
CAGACGCGTTCGGCAAC
GCTFCGGCGACAACAGC
AGCCGTATTGTTTCATCAT
CCTACATGACTACTTGGGC
CGCTICGATTACGGCGIT
ATTFICTOMIGCCTGCG
TTGGACMCGCGCMIGG
CTGTCTCAGTTTGATTTTT
GTAGGGGAGGITOTTCTG
ATGCTGAAATGGACTCCT TATCGGATAACATCTCCTT
GTAAAGTTACCCGAGTTA CAAATGTGGGTGACAATC
GAAGATAAGGCTITITTFA
AriGGGIATCTAGGCCACT
TTGAAACCACTTCTGAM
ATATCAACCAACTTATTGA TCCTGATTCATTTTTCTCFG
32 Torrai GCAGTIG GATCC
AAACAGTTITTTGCTGGIT
CTCCAAAGTAATCGGrfCG
CACGCCAAMCGCAAGGT
GCCATGAGCTGCCCTACG CCTCCGCGCTGATGCA
26 CITTGGCAGTGGCGATGlIGCCGTGCCITGGGICAA CCGCTITGGATTIGOTTOCCGTCGAACCCGGACCGA
ATAGCCAAGGCGTGATCG
CGTGCCTIGGGTCAAAGA
CGTGGGIGTGCCGCAA CTGACCGCCATTCACAACG
TACCAAGGCCGCTCGTAC
26 GTTCGGTCACCGTCGGGCAACACTTCCiGCGGTCAG
CGTCCGCCAAGATGCC
CC.TGATCGGCAAATGGGC GGACGCGATACCGACGA
26 CCGATGMaITGCCCGC1TCGCCTGAACATCCCCACC COGGCAAACATCCAACOCGACCGCCGATFCGATCA
GCCTGAACATCCCCACCA
26 GrCIGTCX:CTGCCCGGCAAACAATICTTIGCGGCCT
GACCTGCGGCTCGTW
GICITATTCGGCAACATCG
44 ATCCTCGCCGTCCTCGAGCCGGGCGGACTTCATCGTG . GCCGACACGTTGGCGCAATTCGCCCACCACCCGA
TTGAGGCTTGCGGGCA GGGCGGACTTCATCGTG
GGAAACGCCGAAATTTTCC
GCGTTTGCGGTAGGCG
TCCGAACCGTCITTCTGTG
26 TTGIGCGGACGCGTGGICGATGICCAACAAGAAATli ATGICCAACAAGAAATGCG
TCCAAATCCGCCTGCGA
ATCGTCAATACAGTTCCG ACATCGGATGCTCTTCCAG
TGTCCAACTCGAGGGCG
CCTTGCACGACGTGIATC GGTCTGACCGTCTITGAAG
CGACAAACGTGCCTTCCG
TITTGCCGAATTGATGGC
TCCAAATCAAACZCGCCC
26 GGCGGCATCZAAGGCITCGCGTCTGAACACCCITS.0 CTCGAAACGCGCGTATGC CGTCTGAACACCCTCCCG
CATCGCTTTTGCCGCCG
SS T TGCC TTGTATTTCCTGCCGCTGC
GCACGGACAGGGATCGT
CGACCAACCGACCGATIC
26 ITITCGCAATCGCATTGACCGCTCG/V\GCGGGTrTT
GGCGCAACGGOCTTAC
GCTTIGGCACTGAAAAGC
GTTACTTCCGATGCGGCG G
CAGITCTICTICGGTTTGC
ICGCCTOGGGAITGAACA
ACACTTTCGACGACAACC
CTATCTGAAGTCCGAGCGC
TTCATTGCAAAAACCGACC AAAACTGTACAGCAGGCG
TGCCCITCATTGCCGTTGGGGAGGGCGGTITCGTCA
TTITCGICATCCGTTACCGC
26 GCCIaiCTGCCITCGAGAITCCIACIGGIACATCGAC GCAGAAAGICGGCGAATACCGGGITCGGCAGGCA
TITCAACCGCAAAGACAA CCTACTGGTACATCGACAC
63 ACG I CCITT , GC
OCCGAACACCTGCAACAA
AAAAACGCCATCCCGAAC AAAACCGTTCGTTACCTCC
GCGACAAATACCCCGITT
TTGAGCGGCAAAATCCAC TCGTTTATGCTGATCCCTC
26 CGGGTAATCTITCCGAAACCGTTTCAATATAGCCAAG GTCAGACATCGGGAAATGCTITITICAAAAGCJkGAT
TTCAATATAGCCAAGGGGA
CATACCAACGTTICCCCCA
GCCACTGCGTGTCCATPC
TCGICAACAGCGICGICG
CACGCGCTGAAGGAGGT
ATTGCCGGGAAACACGC
ACTGCCCAAAAGCGTACC
GCCCICGATAAAACCTIG
GITCITTGGCGGCTTCGT
GCGGCGATTATCATCTGCA
AAAATCGGTGCGGACGTG
ATGAGTCCGAGGCTGTGG
TCCGAGTGCTTGAAGACC
TTGCCGATGITGCCGC
OGAGTATGGTGGCGCG
IGGIGTTCCAAAGCAGTAC
ACTGGIGGAAAACGGIAT TGTATTCCTGATITCCGGC
82 GGCC . TGTG GC
26 GGTACTGCTTCCTTTGCCGGCTTTGTTATCGACTTCAA ACAGGCTGGTTACITTGutttlIGCCATCGTGACCA
CITTGTTATCGACTTCAATT
GCGCCGCATAACCTCTT
AGCAGTAGGGICATCAGG
GACAGGLAGCGCATCG
TCGCCCTGATAGCCGTC
SS C ATG CC GAF
ATGGGCGGCGAGICC
GGGGCGAACAACAOCG
90 GCMAGAGITTCGC=GGCZGCTGCTCTGCGGCGGAC.A ATATGTGCCGTGCCGCGCTGCCGGTCGTTGAGTTCG
ACCGCCCTGCCGTTCA TGCTCTGCGGCGGACA
26 CGGATATTTCTGCCTGCCCGTTGAACCCGTATACCAA MAGGGGCGAAGMAC.A3GCITCGCGTTG1TGTA
91 CC GGTTT , TCT
TTGAACCCGTATACCAACC
GCCGATCGTTACTTGTTCG CGGTAGCTAAGACGACGT
AATCGACCAAATCCTGCG
CATTTCGCGCGCACCC
CATGTCCGACAACGCTTTG ACCTCTTCGCGACGCT
26 AAATCZEGT-Cr..CGCACGCGACGAGCAAATGAAACTG
C.GTGGGCTTACCTCiCCGCTCGATGTAAACCTCGGGC ACCAGCAAATGAAACTCiG
GCTTGCCAAAGAAGGATT CGGAATCGTACAGGAAGA
96 GACG ClIG GG CC
CGCGTGGCAGGAACAC
AAAAGGCCGCGCTCG
26 ATGGITTTGCCCGGCGAGTCGAAACTGGTCGAATGC ACCAACAACAGC.ACCTTCCTCGCGTTCTAGGITCGG
ACGCCCTTGAATATCGTGT GAAACTGGTCGAATGCAA
27 GGGCAIAGGAATAGTCGATGACGGATGCGCGMai 1 ATCTGGCGAAGGAMTGC.ACCiTGCGAAATACTGTC
ACATTCATGCGCCITTGAA
00 GCA TA 1 GcaiA I C
GATGCGCGTTTCGGCATA
t GAGMTGCCA
02 CfCGC 1 GITCC , TGCTGCAPAGCGIGGC
CGIATC:CGCACTACCTCCiC
27 GGTCAGCAGTGCCAMACCCAAGGCTGCTGCTGTCC 1 CCGCCCGCCCGATTGTCAAAAGGC.AGGGCGATCAG
02 CT ' C CCGCCATTCCTGCCGA
GGCTGCTGCTGTCCCT
CCTTGGCAGGGATAGATG
MCGCACCCTCTTCCGT
MAAGGCFTCGATGAAGA
OS . GGTVIGGCGGCGGAAACai TCAGACCGCAGGCACC GACAGA CAACCOC:GOCAGAGC1 'FCAGAMGCAGGCAGC
27 GCATCGTATTGACTGCCITTCAACCGATTICCACAAA AGCCCAGTATGAC:AATTTGCTGTGCTGATTATCTGC
TTAAACATACCGAGTGAA CCGATTTGCACAAATTAAC
06 TTAACC CGTAAA TGCi C
TGCCCAAGCACCCAAATCATATTICTITTGCCGTGAT AAAACGACTTITACCAAA
ATTICGGATITAATTGACC
27 .AACCGAAGTCAAACCGCCGGGCGATGAAGCGTGGG CACGCCAATGACGCGCAATCMCGGATTCMCGC
ACCGCAITTITITGACGGA
OS ATT CAC T G G4:GATGAAGCCi TGGGA II
AAGCATGATTTICAGGCTG CTACCATTGATATTGTGIC
09 1GTCTTGCT 1 CCT ATTCTGAATACCGICCG Tf CI
IIC I GTGC CC:GCATTCAAATCATGGG AA ITTGGGCAAAGACCCC
CCCCG GCTCG C G
IGCGTGATTATTIACGCGGC sUCTITGATIGATAATCGTG
11 . TGAMCG 1 GC. CTGACCGAAAAGCACCET AAACG
=
27 TCGGGCTITTGGICATTITCAGAAACAATTACTGGCO GCAAAC:AACAACACTCGGOCCTGC=TTG
AC.AATTACTGGCGTGTC
27 CCGTCT IGGITTTCCAGTGC.AAAAGGGA 7 ACGCAACC CGAA A ITT TICATGGI
AAACGAGCGGATACCGCAAT GCAAAAATTCCAAAGGCA
AAAGGGATACGCAACCAT
27 TCGAATF GCACCGIGAAIGCGCCCiCCGTACTOCIGG GTCCGAAGACGGCGGC
ITCACAACAGCCGAAGCCG
CCICCGTACTGCTGGCT
27 GGTACGCAACZGAACACiCTCTGTTGGCTATGACTGC
AAGCGAAATTCACGGTCG
TCGGIGGTCC.GCTCGGAAGACACC.GACGCTGACCT CCG A ACACCGACGCTGACCT
ICGCICTACATGATTAGACCC:A TAAAACGTGGCAGATACA
16 C ACATGAGiTTG (SAC
AATATTTCCGTTGCCGTa:
27 CTCAAAAAA TACCAOCCCCiAGAGTA TCTGCCAGTFT T CTCiGCiCAAACIGGAAAAACGGATA
7 ATI ATCiCTGA
TATCTGCCAGTITTCCTIGT
27 GCGCGITGAGATAGACOGCGAGCCGAAATGCG (ACC CI GCACGACGGCAAAATC
18 GGC CGOCAAACiGCGTGCACGTAATGCCTGCGTCGCG Ci CCCiAMTGCGTACCGGC
CACGCMITGTGCGTT GGCTCGCCC.GAATCCA
27 ACGCCGTCTGAAAAAACCTTITTGITTTGCAGTAAAT AMCGCiTITTICITTGGCTICGGGGATAMACCGCC
ITGITTTGCAGTAAATCGA
CGAGA . TACGC TTGCAAGCCTTCATCTICG GA
27 GCC,ATGCGCCCAAACAGATCTTGICCITTGTAAGCGG GCGGCTGCCGTACAATCAAAATGATATGTCGGGCG
TTGTCCTTTGTAAGCGGCC
AAGCAGTCGMAATCAGG
CCGGCAATGATTGAGCGT CG
CGCCGAGCAGGTATIGAG AAGGTCGAAATCGCCAAA
TCCAGGCGTCCCCACiTTGTCATCACGCCCAACCTGA
TCGGGGAATCAGAAGCGG
TGACGAAAGAGGCGGAAC
A GG GACTGGCC-CCACGACA A
DEMANDE OU BREVET VOLUMINEUX
LA PRESENTE PARTIE DE CETTE DEMANDE OU CE BREVET COMPREND
PLUS D'UN TOME.
NOTE : Pour les tomes additionels, veuillez contacter le Bureau canadien des brevets JUMBO APPLICATIONS/PATENTS
THIS SECTION OF THE APPLICATION/PATENT CONTAINS MORE THAN ONE
VOLUME
NOTE: For additional volumes, please contact the Canadian Patent Office NOM DU FICHIER / FILE NAME:
NOTE POUR LE TOME / VOLUME NOTE:
GAGGGICACTICGATCAG
t ATGCTGCMIGCTACTAACATACTCGCCATAGCAA CATTAGCTTITCTUXIATA AGGAGAGATTTTGATGAA
1 CAAGCCi 1 ATACA AAA F GT , CfCAA GCG
CCACTAGCACAGGTTCCAGTGCTGACCCGTGTTGGA AGTTAACCCCGTGGAACC TCTAACTGAGTCCACAGGC
2 AGGCG ' GGAAT T G
CAATCCATTGTITCTAACT TTCGTTTTGCATTTGATAAT
GGITTGAAACTACAAGAA CATACTATAC.ACAAGACAC
99 GGATC.AAAAGATGTATCCTGTCTGTAAAGATACAGA CCAAATGMATTGTAGGATGC.ACATCATTTTCTGCA
AAAGATACAGAAAACAGT
5 . AAACAGTAATGC CATGGAAC TACGGAGTACIGGICACC AATGC
TCCAGIGATGCTCAATTGITTAACCCATFATFATGA CAAAAATCAGTMACATT
6 TITACA. TICCAGG CX:FTGGGCT GATGACAFTCCACGTGCA.
CCAGG
CAAACGTAAAAATTGAGGTCTGACCITCCTIGAATA MAGATCAATATTCCTTA
7 CAACA CfCiTGIAAAAGT GCiC.AGA
CCAAGCTGCiGTTACAACA
TCGAACAATCAAGCAGTGCCACTATCTCTGTTATGT GGCTOGTTICTATAACTA GACAATATGTTT
ACTCATG
8 ACTCATGCAGA TCCT17C rrGc CAGA
ATCGTGCGGICTIGGTAAATGTITGITGTTCAAAGT CTTICTCCAAGGACAACTC
9 AC:AACTCA 1 GGITCC GGICAAAACCGCAATGTC A
FTCTGAG
ATAGCATGGICCAGCTrA
10 ATACCGGGGGTAAC:AGGAGCCACATIGGTC:CACTGT 1 CCAGA AGITAGATG TOTE
GC.AGAACAT !TA 1ATAT AT TCAT TAGAGAAGGGAA
01 . CA 1 AGCACGGGICTA AATTGTC
CACATTGGICCACTGTCA .
10 AGCGCGAATCITICHCiAlTGTGGIGGGCCITTGACA CIGGTIATGAGACITICAGGOTCGTGACTIGCCFAT
TAATAACGAGAGAGGOG
GGTGGGCCTITGACAATG
10 GGTGITTIATCGATIGITCCGGIATCiCITCGAGCACT AAGAACTGACICCCGGAACCAGITCGT1IGTI6C11 GCITCGAGC:ACTAATAAAA
10 GICX:ACCITICACCATC:ATCTGAGAGATAAAGAAGA (IAA
ITCAAAATCiCACAAGCSIGTGGF TCAGTMCIT CCAAAAGAAACAAACCCC AGAGATAAACiAAGAGCGT
04 GCGTCTAC GICATCACITTG AAC C.TAC
10 TGGGATTGACTTTGITTTTGTCTGATGATTCAATCiCIC
TGAGATTCCAATTAAGCAGACCATCTCATAATC.CTCF GTGATGCAGATGTCAAAG TGATTCAATGGCAAAGAAA
OS AAAGAAAACC GCTGTGTC GA ACC
10 CCGAAAGTGG TGCATAGT 'ICI TAAACCTITAAGT n TCTCTCCTATCATFGTF
GGCA.ACITATAGTAACCGGI CACiGTGACi TACAAGT TC.A AAACCFT1'AAGIT11 GGCG
06 TGGCXiC. GCTGGIT AAC C
10 CTACAGCAA 1 TICTCCAA TGACAACAAT ;GATT errea ACMGTCGATITGAAAATCiAGAGGTGGIAGAAC I AAACGM GGACT1TA TAG AAT TG ATTU TOTTA
ma TCAGTACC1T I GGAGCAAGTAC:CTTAT AA
CTIGTGGAGC.AGGTGTAC
TGAAACGGAGGATCATAACTCFGCGCFGAGGAGAG 1 AGATGA ACT AT Tr GAAG ACTGATGGA
TCTGATG ref AGC.ATTGCCTACAGATATGCCAATGCFTCAGAAACG TTACCAG ATTGGA TAA AA
10 CTGCTA . CAGC AGACAGA
GTCATGAGTCAGCTGCTA
10 CACCCAGGATAGGTTACATC.ATCTAACAGATGTAGTA CGAGGCGCATTAGATGGTA
ATCTTC.AATTGA GTAG CATGGCAAACITTATITCTA ACAGATGTAGTACATTTTG
11 CATTTTGGTT GTGCTCTA TGATGA Gil TACTAATAAAACCTGGACA
AGCACCTCACAAGACACC CTCCOGATCTCTICAAGGG
TCTCCAGAAAATGGGGTTGCAATTTAATAGTGTCA T ATTGTAAC.ACTCAC:TGAAA
ACCAGATACAATAGTGACA
14 AGTGAC.AACT GTGCAAACA CAG ACT
TCCTTTTGCAACAGTGGAA TAACATTTGGTGAGGAAAT
AGGAAATACCA AAACGTCTA A ACCA
IGCGTITCAGTACACTAGGATGTATTACCTCTAATAC TTICAGATGTCfTGGACAAAGTCAGCACCITTGCAA
TCTAGTAATTCTGATATTC TACCTCTAATACCTGGAAG
16 CFGGAAGATCC CAGGCTT Cf TCAG ATCC
10 TO IT IGGATC1GGAAATTGTAGTCTTTACC.GATGAGT
TFACCGATGAGICTAAAGT
17 CTAAAGTAGC AC.FAATC.GC TGGATG AGC
"TAATAAGTAIGTGGATGCT
18 TGAGA GTGGT TAGGAGCAAC.AGGCCATC GAGA
10 AGGCC.AAATACTGATTICATTCGTATGGAAATICAAA AAATGGGCAAGGATATTTATGGTGATCTTGCATAAA
TGGTGATATGTGTGAAAT TGGAAATGCAAACTITGAG
19 CTTTGAGC GCTGTTCCTT , AGGATT C
CTCC ATTTCGCGTATT TACAC
AGTGGCTC.ACTIGTCTCC
AACATGCCCTCAAACTITGGTTITTGTGCTACGTTTG CCATACGAACAGTATACA
TAAACTTGAGTGAAAAGTT
21 AAAAGTTAACTGG ' CAGT 11TTGG AACTGG
10 GGGAATGIGGTGCTTATGGCAGATGTC.AATCCIIACT GGAGATCCTCCATACAGCCATTGIGTTCTGTTGACT
CATGTCAATCCGACTCTAC
ACAATGGAGGCCTIGGGTT TAATGAGCCAAGTGGATA
ACAAACAGACTGTGTCCTG
ATC.AGGAAGGCTAATAGA AGGATGTGATGGAATCAA
10 ATTACCCCC.AACCGGAAGTCCGTTGAAACTTTAGCTA CCAAACTGGCAAATGTTGTGAGTGAAAGAt=AGCTC
CAAATTAGAGGGTTCGTG CGTFGAAACTTTAGCTAGA
10 TCTCTTACTCTCGAACATGTATCCTCATCGCACCAATA TGAAGCTCCGAACACAAATACCCATTGAAATACTfC
TGGTTCAGAAACATCCTG CATCGCACCAATAATGTTC
ACATGCTAAGTACGGTTTT AGTCTCGGTACTGAATCTT
CCC.AGATGGCTCTCCAATTGTTCCTCTATGGCACCTA TGGAGTGICIGGAATAAA
28 AATAAACG 1 TATGT CATGGAGOTCCCAGT11 a;
i 29 C:AGGACT A 1 AAGTCTFC , AC A
CCCTAGTAGTTCATATAGGAGACCGATTCTGGCATC GAACCGCTCTATTCTAAAC
TAGCCAAAGGGGAATTCTT
CTFG ' AATCCGG AC G
AGTTGCCAAC.AATTATTAG
AGACGTGACAACCGACTA
CCTGATATTGTITTAGAAG
10 ATGCACCGCTGAAAGTCTCGAAGT1TTATGCCACCCC AACACCAGAAAGACAATCCAAAAGGT1TTC.AGCT1C
GACCTGGTTACAGTAAAA
33 , G ATCITCAATCC CGA
AAGITTFATGCCACCCCG
ACTGGTGATTCAGTATACT
34 GCCACT G AAG rr AAAC TT CAACCT
GATGCCACIG
10 GCAATTTAGCATCACCCCTG TATCGTFGTAGCTCTGC CCM TAAAAA ICC!
GAACAAAGGCAATGTCCCATA CAAAAATATE.CAGATCPCT
ATGGA GAGTACGTAG ATGAGT GITGTACCICTGCA FGGA
10 TCCAATTCCCAAATTGCCAAAGTATAATTTGAAGGCA CAGGAAGAGG.AACTGGGGGTTGAGTIGGCCTTCCA
AATFTGAAGGCAACACAAT
AAGGC
10 AAAC:OTICCAAAGGIATC1CTTCAT MT TAATG TAT IA TCAAACCT iTGACAT
11TAGAGCCAACAAACCTAI TA TITGAAACACAGAITAATC TG 1TAATCi TAT TAG 1TGAC
TACTCTTATTGTAC AGATTITTIGAGICAACCI 'WI TCAGTT !GAM ATACA
39 , AGAAATCCC CTTCTGAG TCC AATCCC
.
TG CCAG CC.ACCATCT ACC. AAGTACTCCFGTTGCC.AG
10 CATGACC:AGTAGATCCAACACCTAITGGAAACTAG I A
GITGGAGACAC:AGAAA.ATCCTAGCFGTTIGTTIAGG CTCAGATACAGAACGTIT TIGGAAAC TACT
AGGIATA
41 GGTATAGAAGT ATC.TATAGACAAGT GGT GAAGT
10 AGGAAAGITIGCATTACCAAA ICCIAAGTG TCCICCA AACTACAACAGCiACAGAGCACiG
ICAGOCCATAAGC GACAAAAACAGCTTAAAT AAGTG TCCTCCAATACAAC
10 CGTTACAAAGCAGATTAAC.AGGCAAAGGTTGATATT AGGATTICITCAC-AAGAGGCGGCAACTGAAACAATA ACTC.AGGATGCATGGTAA AGGTTGAT ATTAAGCiACAT
GGACTAGGAAATGAAGCTGAAGATMCATIGCTAT 1AT1TTAGCCCIGCGAACi AGATG TAGATTTGAG
MCC;
CGTCGACGCCTAGAAGACGACG1TGlICICK:CG11 GACGAAGACCCAGATCATC
as TCC GGGC AACCGAAATCGCACCACA C
GCAAACTAAAATACAGGCGCMGCCITC:AGCAACC TCTCTCAAGACTTGGACG
CTACAAGAAGAGGCTCGG
PGGTGCCAATAGACGCCCGCACTACIATITCAG
GGAGGTGGTGGAAGAGTC
10 TATAGGGGC.TGCTGGITGC.ACCCGCTGITTCTAC.AAC
CCTAGACGGGTAGCGACTAGCTGTAGTGGCCACA A AlTATAGATGTAACTGGTC
48 T . ATGT CTAGA
CCCGCTGTTTCTACAACT
10 GCTCTACTAAAAGCTCTIGGTGTAGGAATTTAATATT TGTACAATAGAAGAGTCAGAC.AGGTAATITA
ACTG GGAC.ACTTITAATGAGCC AGGAATTTAATN/TGAAGA
10 C.A7GTGAGTTACTGAAATCCTCTGTGAAAACAGTG AT
TCGTCCTTAATACTGGCTCfCCACCTITAATGCTGCT CAGAAAGTACATTTGTAG
SO GCACCTT CCAC ATGCA
GAAAACAGTGATGCACCTT
10 GGCAGC.CAATATGACATCTGTAAAGTATTACATCCCA TGCTTCFGGTAAGGIGTATMCCAATC.ATCTGTACT
GTAGGACCAAGTATTTTA GTATTACATCCCAGTCATA
10 ACCCTATTGCAACACGTTTACCAGACGGTITG1TTAT ACTICTAAGTCTfC.CAAACGCAAAGCAAGTAGAACA
52 CAAATTGG AAGTTCAC.AT TTAGGC TTGG
AATAAAGGICTGATGAAT TTITCCAATCTTCTCATCTG
GCGACAAAAGGTACAAGGTAGTCTACTACTCCTGTA TTCAGGATCITGCATCGITTGAACCATAACAAAATG
ATGGACTTGCAGCAATCA CTACTCCTGTAACTATTACC
54 ACTA T1ACCCiA GICCMCCA C GA
ACCiAAGICTIGTCACCAGA
10 CCACIGTTCTCAAACAATCGICTITCiCAGACATAGAll CAGTGCAGAAAAIGAAGMAAGCATTGITATTITC
GCAGACATAGAMGAGTC
10 TGTAGCATTTCCAGTTGTACACTTATATTCAAATTATG GCATAATGCC.AGAGTGGATAACAATGACTCACTAG
TGITCAATACTAACTGICA TATTCAAATTATGIOTGAC
AATACTGGAAAATC.CTACT GTATTCACTAATACGTMC
10 GCTTCTTGCACAGCATCGAACTC.ACCAGAAGAAGAG
AAATGCTGCAAAAGACATTGATACAAC.ATAATAAC.A CICACCAGAAGAAGAGCA
TGTATTATATAGATGACTA TTGATGAAAAAGCACAAAC
10 CAA TTGICITTGTGCAGTGTOTTAATGGGTAGATTA ..
GACACTGTCACACTACCAMAGGAATTAACCCMT ATTTGAACRZFEATCTAC ATGGGTAGATTATGATAGT
TGGTACGACCTACAATAC
TGGATGCMGGGACCTAC
1TAACATCTAATACACGCA ATC.ACAATGCCAACTATTC
64 ACTATTCCT A.ATATTAGAAGT CTC CT
CTGTTGGTCATCCATATTT
ACAAGGITGTGGTGCCTA
10 TCCCCATAGGATTCaTTGICATTTGIGTACCTTTGGA 1 GIGTACCITTGGACATAGT
66 CA TAGI GG reic:r CAGG OG
TGGAAGAAATAGAGMG
67 CTGATG I ACAGG ACC!
*FTCCICIGGAACCTGATCi TGITTGACTCATGGGGTCAGTCGATGGACAGTCTAA TACCTCAGTGTTACATGTT
ACATGITITGAGCCICTOC
AAGTACAGACAAACTTTG
ACAGTCITTGGCTGCAAC
CCATCTCGGGGGTCTCTAG
71 CAC:GCAATGGACCCCiAGGGATC.ICICTGGGCCACACA AGGITC CCCIGGCCIGGIGTIT ..
ATGCTCTGGGGCACACA
73 T AC4.3 GAGCi IGOIGCGGGACA
GCCACGTACGCOCIGT
GACTCGACAAACTCGCTG
TG111.3GTGTTGACGAC6G
10 GCGAGTAGTTGGTGATGCGGCGCCCGGCCTGACGA ..
GTAAGTGACGTCGTTGCG
10 CAC:OCC TACCTGAAGGTGACCCCGTAAGCACGGGGA GAGCTIGGCCATCi TACGCGCAACCUCCOCCiGCIGG AACICCTIGACCGACACG
CGTAAGCACGGGGAGGGT
77 AGACGGCGTCCCTGGAGTCTC.TGGCGGACOGAACA GTCGGCGAGCCCACACACAACCACACCC.CCCAGTAC
CGTGAGCACCTIT.TCGC TaGGCGGACCiGAACA
78 CCCA AGAACTGCCCC.COAAGOCACGAGCCACGCCAGAAC
CGITCTGCTGCACCGG ITCTCCCTG1TCAACC.CCA
10 .1 CG ICGTCGAACGGT1 TI ACG ICAGTIGCCACTGGG I ..
GTICGTCCTGCCCCACTGGACAGGGCCGAGAGAAC
79 C.CT C GITCGC.TGCCT1TTCCTCC
CAGTTGCCACTGGGTCCT
80 A GICiGTCCGTCTCGCTAAC.GGACICCGTCAAGCGCGT
CCGGGAAACGTCTOCG TGCITITGGCCATCTGCA
TACGTTACAGTGGa:CAG
TCGAACACCAGCTGCATG
GTGGICACGTC.CCCGA
ACACTCGTGTCCGCAGAG
10 GGACG AAACCAGGTGGGCCAGCAGAT caccTrccr .. TGCACIT TGCGTG I CTGGIGTGA T
ACGTCCCGCIT:A
84 6116 GGA CGTCTGCTCTTCiGTEGCT
GCAGATCACCGTCCTGTTG
as CTCGOCGAGCTATGGGCCTCGITGMCGGCACCAGC ATC CGTCGTCGGTGATGAGGA
CGTTGMCGGCAGCAGC
10 GA ATCIGGAGCTCGGGTCC.ACAACCITCCGTCCCCIG
87 GCCCGTTAACCCCCCACGTGC.CCGGGGCTITTCGT C TGAGGCGTCAGAAAGTGC
GCCCGGGGCTTTTCGT
10 ATCGCCTTGIGICITGTGC.IGGCCCAAGATTCGGCGC
ACACAGGCGGGAC.ACC
10 TCAACCCCGCCCTACACTACACC.AGACCCCCCGAAGC GGCGAACACGGGGCTGCATTCCCCCTCGCACATCCT
GTTGGACGTCACCGTATC
CAGACCCCCCGAAGCT
TCGCTGCTTCCTCGAGT
GCTGTTCGGIGGITGGG 'FCGGGGG1TTCCTCMG
10 C1TCCTAACOCAGACCC.CGGGGGGCGCGTCAGATAC GGCCACCTGACACAGAGGCGAGCGGCTCAAGATCT
GGGCGCGTCAGATACAGA
94 CCCACCCCCGAACCATGAACCCGTGGC.CGAGATCGT TGTGGGTGTGTAGGCGATGCTACGCGCGCCAAACC C
CCGTGGCCGAGATCGT
10 TC.AGAACGGGCCGGICGTCGGCCGATTCCITCATGC AAGTCTGCGGGGGAGCGGTGAGGCCGCSGTTGGT
GGCCGATTCCTTC.ATGCA
10 ATGGTCGCC.GTCATTATGGCCGCG/VaCiTGTMGG
ATAACCTCACCGAAACCG
TCCOCCAACACTGACGT
97 GC GG GAGCC.AGGGCCAAGGT
GTCTCCTAGTTGGCCCGC
CGGGGTTTCTGGGGCT
99 GCGCCTf3ATOGTGGAGAOGGCTGTACGTCGCTGGOG TTA CCGGGGGGCGCTTAAA Cf GTACGTCGCTGGCG
TGGAGCTGGCCCAGGA
GTCACGACGTACGAGACC
GlICACGAACGOCGCG
TGCATCGGCAACAACAAA
11 CTCGTGCGCTITCTGGAGCTAGCTC.CGGAAACTTGGT ACAGGGTGTTGCAATACGACCCATGCAAACAGCCT
AGCTCC.GGANICTTGGTAC
GGIGGGCGGCAGCATT
Os TAAGCfCCATCGCCIGGCGGACCCACGCCCACATCC 1 AC TGTCGGTGT7CCCCCAT
ACCCACGCCCACATCC
GGCGTTGTAGTGTGCCC
GC.AGACCGCGCCGM
GCAAGCAGCCCATAAACG
08 TCGACCGCCTGGCCAAACGCGAATCGCGGCCAGCA flG C
IXAATCGCGGCCAGCA
CGTGITGATGGCAGGGGT ACGAAGCCATACGCGC
11 GGAGGGGGAAGGAACGAAACACICTfC1GCGTGCCC CCCCGCGICAGACAAACCCTGAGTCTTCGGACCICG
TCITCTGCGTGMCGT
GACGICTGGGAACACAGG
11 ACAGGC: 6 TGCCCATTTGACGCTCIG C
ACCATCACGGACITICCCC
12 crcr TGGC CGCAC.TATC:CAGGACCGC T
CTICITGGCCTTGTGITCC
13 GAGGCCAACCIAGCMIAGGCTGMCfCCAIGGCAGA CC C
fGCGCTCCATGGCAGA
CGGCTCAGCTGGTGGGAGTCACCTTCGG teGGGGC GIGTACACCICCAGGGGG CAAACICGTGA
SCCTCCAG
IS ACCCCAACGCCATCCiCCTCACGTCGCCGAGCATCC TCAGCTTGCGGGCCTCGTTOCC.ATCGC.GTGGTGC
CACGTGGAGACGGCCATC ACGTCGCX.C.ACiCATCC
CIGAGAAGGGGCTGG TAC
16 ATI (3 CGATCAGAAAGCCM:C.ATT
'IGGATATGGCGTCGGAAG
11 GARIGCCGMTGCAACTMAAMXiCCi IGGAC:ATCC
18 TTCCC.CAACGGCAAGCMGCCGTCCAGAACCACT6 I GCTGGITGCGTTGGAGG
GCMTCCAGAACZACTG
CGACCGTCAGCGITTTG6 CCGACAGAAACC.CG1TGT
AGTACGTGGACCAGGCGGTGCCTCGCGGITGGTGA GCC.ICGCGGTTGGTG A
11 ACAAGGIAAMATAGGC(iGGGCRIAACAGCT(iCAAC CCGAGCCTCCCAG(i TGCAGAACCGAGGGCITCAAG GACACGGCTAAAATCCGG
TGAACAGCTGCAACGGG
22 CACCATCAAGGTCMCC.CCGTTCGGACGGACACOCG GCG GCCAGAAMTCGATGCC
TTCGGACGGACACMG
11 CGGTAAAACAGAGCGGGC.iCGITCGAGGCGGAGGIG CGTCIACGICCOAAGCGGGATGGCCGGGCAGAAGI
TTCGAGGCGGAGGIGG
11 CGCGAGCGGATCTGCTTTCGAGAGCCTCCTCAGCATC GATTCCCCAGAGCAGCCCCaTTGATGGCCIGCCTG
24 C Cr CAAGGCTCACGTGCGAG
AGAGCCTCCTCAGCATCC
ACTC.C.ATCTITGTGC:TGTG
GC cr TCATCTACGGGGACACGG C
11 GGTCCGGGIAAAACAACAGCCGAGLeifemCGTC
AGCATCCGGTMATGAGC
26 CCACTTACGGGGGCCAC_ATGTAGTGCAGGTGGGCGG CACACG C
TAGTGCAGGTGGGCGG
GGGCGGCACACCTATCA
CMGGCGACCTGGACA
TAATGTCGCGGATGCTGC
CCCTACIGGGGCCAATGGT
11 TC.GTGGICACC.GGIGCTCGGCATGCACGATACCGAC GATOCCCCCCGCGTTCCATGCAGGGCACATATGATC
AGAAAGGACAGCGACGA
GCATGCACGATACCGACC
11 GGTAGGC:CGCGCTACACGICCTACGITCTGGCCCIG
GGAGATAGCCCAGC.CCA
I
11 CGCTICITGGCCCTGGTGAGTICTATGCGC:IGGAGGT
GAGGCCCTCTTGC.ACGAACGGAACCTIACC.ACCCCG
33 GC GC , AGTGAGGGTCGCGTCG , TCTATGCGCTGGAGGTGC , 11 ACCCCACACICCAAACGCGGTGTGTATACGGACGCGC CCCAAATGGCCCTTTA &AC
TGIGTATACGGACGCGCX
35 T ' AGC AGGAACACACCCCCGTG
ACTCAGGACATCGGTGTGT
36 TAGCCCGATGCCCCCGTTGACAAGGCGACCCTGCG ... T CCCCAGGCCACCACAA
ACAAGGCGACCCTGCG
TGTTCAAAGACGCGGTGA
11 ATCGCCGAC.AGGTITCTGGAGTCCTIGTAGAACGCG ATCAGGGGCCGTGATATGCCGAGGACATCCGCGAC
CCAGAATTTGGCCAGGAC
TAGAACGCGGGI
AGAGGIGGGICTGGAGTC
CGCTTCTGGTTATGGGCG
40 C.CAT AGAG A
CCGGGGCATCCTTATCCAT
41 TGGGCGTGGCACTATCGGCTGACGAGGCMCAGCT GIG GAACCC.GACGTTCAGT
TGACGAGGCCGCAGCT
11 TTCCGGAA1TTATACCCGGGCCGGTCiTGTGATGATI7 1 GGGGACACGGGC:1ACCCTCATGTGCG11CGATGCG GGTGTGTGATGATITCGC.0 TCTGCGT
43 A 1 GIG , AGCCCCAC:GCGGIGAT
GCIAGGGTCAGCCGTTCA
44 TGGGCACGTACACCCCCCTCGTACAGGGGCTGGGT ' G ACTIGGCGGGGG TGGT
CGTACAGOGGCTGGGT
45 1: GA GCCATCGCCACGTCCT
CGCCGTCTAAGTGGAGCT
GAAAACCCCCAAACGCGT
GGACCGGACGGACCTT
47 . A ACTCGITGGCGCGCTGAATCACCACCATCCGCGIG
CTCGAGGICGCTCCIGT CGCAGAACGCCCICGA
TGCGCGGACAATTAGGC
49 AGAACGAC1GGCGCGCCACT a: rem GGGCCGCCA. A
CAGGCGCCGCATCTTG :FCC TGATGGGCCGCCA
ACGAIGCGGGGGGTGGCCTCCACAAAATOGGG
CGACAACIATCGGACTGCG
AGCCTGICGTGTCIGCG ITAAGCACGCICCGGGC
52 ACCACACGAGCACGAGGCCTCCCTGCAGC.ACCTCIC CCGTCTTCGGTGCCAGTCCTGTTGGTGCCGGTGGG
CATGCTCGCCS7COGT TC.CCTGCAGCACCTC.IC
11 TGIF TCIGCGTCCi IGAGTC:CCGCCTGCGTAGT TCRACi AGACCCIATGGTACACAC
53 , ATCGGCGTTGGTGGAGGGC.GTGCGTCTGGTGGTC.GT AGG GG
GTGCGTCTGGTGGTCGT .
11 GGACAGCAGCGGGGAC.TTGTICCICTCCGTGGGGG T
TAIGCTAATTGACCFCGGC
54 ATICCAGGTCGTCGCGGCGTGGACCICTCC.GAC.AGC CT C
TGGACCICICCGACAGC
SS TTTCGGCCTGCCAGG TGGCCIGGCr.CCGGACATA A
CGGCCATGCACACCAGC.AGGCGCGGAC:CAGGTAA CGACCC:CCCTCACCAA CGGGCCCCGGACATAA
GACCGTAGGACI GC
56 TM. ACCGAACAGCCOTCCGCGCGCCCGACITITTGC TG CC:
CGCMCCCGACITTITGC
11 TCTGGACACCCCC.ACGGACCA TTGGCACCCIACAACA
ATAACGACAAACGGCCCCTCGTTGCTGATCCCCCGC
ATTGGCACCiGACAACAGG
11 TGGGGAGIAGGGCCCGTOCATGGAIGCGCCCCAAA TCCAAOCCAGCC.AAGTIAACGGCAAAATCCGCCGG
GGATGOGCCCCAAAGC
ICGTICAACAAAGATTGGGGAGAAGCA
59 C GGTGTC 1TC7CCC.CCCCCCCTT
CTICACa:CCAGTACCC7C
ITCGACCGAGTCTGGGGA
60 AAACCGC.CCCCC.AAGCCTAGGATGAAGCC.CCCCG GTC: C
AGGATGAAGCCCCCCCI
MICA
61 GAGG TTCTTGCGGACCACGGCCCGCGTGTATGGGC.ATGCC
GGGGGCTAAAGGGIGGI GG
11 TCCTCCGC.AAACAGGCCCGA TCGTGCGCACTAGGTC
62 CAGCCCCTTGGAGAGCACCCGGTGCAGCAGTCGGA . C GGGGCTGGGITGGTCT
CGGTGCAGCAGTCGGA
TGGCCGGACGAACGAC
GGGGAGGGGAGGGTGAT
TACGGGGGGGTAGGTCA
11 CCCCGGAGACCCCCAAACCTTGACilCAGGCGCTCG
GCCGTCCCGGGTGITT
CCCCCCGGTATACGACGA
TCGGAGGGGTGTGICTIF
TCTGGTCCTCCCCAAGTAC
GC.T.GGTCTGGTGATCTTC
69 TCGC.ACGGGCOCCTITTGGACTGCCGTCCACAACGC AMAC.TC Ci ACTGCCGTCCACAACGC
70 C T CCAACTCCAGACC.ACCGG
TCGTCCCTC.GCATGAAGC
71 CGCGCATGCTTCATGGGTCCCGGGGCGGTCATTGGA TG , f. r i i r e GGGGTGTGGCGG
, CGGGGCGGTCATTGGA , 11 GGGTGEICGC3CAAGAAC.AGCGCGCAGTCTGEICATCT 1 GGGACCTGCGGCCAACACACTGGGGTGAGGGGAC
GCGCAGTCIGGCATCTG
11 GIGTTGTTGGGTGOCCTa:GCCCCCCAAACCATGICC C.GGGCTAACCAGGAAATCCGTGTCACACGGCCGGG
TGGTATAAATCACCGGTG
CCCCCCAAACCATGTCCG
GAGGTCCCCCACAAAGC
CCAGCCTGGTTGTCCGT
11 C.C.CCAGCCTGT1TGTCCTGGGAGCCGTTGTACGCCA
GCAACGCGGGACTATGC
GCGGCCGTGGTFAACC
TCAGCGCGATCCGACA
11 GCTCCGCTAAAAGACCGC.ATCGGTGATGGGGGGGA
GATCGCCIGTCFCCTCGT
11 TGITCCCAATTTGTAACATCAAGCTATCAGAAGATAA TTCAATTCAGAC.AGGGAATCAACACTGATITACCCA
AAAGCAGGAGATTAAAAT ATCAGAAGATAATAACCAT
CTCCCTMCCATATAACTC
80 CT 1 11611t A
AGGITIGAGICTGTTGCT
GCTGGCGAAATCACATGTGTCCAAATITTGATTGAA GCAGGCCTCATATAAGAT GAAAAAGGGAAAGTAGIT
83 1 AGTTAAATCAG 1 AGATACCX: , CT T AAATCAG
82 TTFAAATACGG ' CFGTC CFAACGGGGCATAIGGAG ACGG
CGGACCTGCATGACTACT
84 ATFIAAACCCT MC II CCMITACCGCTG1TACC a:CT
85 . CIG GICG TCTCTTFAC.GCGGACFCCC CCM
TGIGCCTICTCAT CTG
ITAGACTCTCCTGAGCATT
86 ATTG ocrGAAcr r TCAGCTCiGTATCGGGAA. G
CACACICTATGGAAGGCG GAAACAACACATAGCGCCT
87 GCCIC GCiATTG G C
GIGGACCATATGGCCATAATTAAGATCCTAAGTGAC CAGGICAATTATATICAGT AATAAAAGAACTACGGAA
88 GGAACCFCi GGGTTCTT A ICiGAA CCIG
TTACCCAAAAGATGITIT AACCT GGAGTAAAATGAGTGATG GATCAGATCGAGTGATGG
89 'MGT ITCGACITi GT CT G I
11 AGATGC TAGIGGATCTGCTGATCIAATTAFTGCGGCC A I TGGAGAIGTGCCACACiCACTGFC
TAAGAA IGICC AA TGACGATGT TCiAC:CAA
ATGACICIAGITCACAAIGGIGGGATCTICIGGITOCT TGGTGGGTTI ACA T !TAW,.
91 . CITCAA TTICTGA AGAAC
AAGCGGGTCATCACITCAA .
11 CI ITTCACTGACUCCICAGGAGi GATCGGT11 TIGAG AT ICA ICGTCGATGAIGTGGGAA
IGATCCATIGATA 'IGATCCGTITTIGAGAGIT
93 AGM: GGTATTGAC GAGGGTGGIGGITAGCAT C
11 CAACiTACG1CTCTCATITGTFGGAACC:AAGGCCATTA GGACATTIGACACCACCCAG
TGCTFTGCITEGGIGG TGAACCATITCAA TCfliA
AAGCC
11 GAAACCCTCTC-AAGACCMCGGAAAAGATGCC.GGCAC AGGTAAGGAAGACAGAAGATACGGC I II i bCAAGG CCACJAAAAGACTAACAA
GAAAAGATGCCGGCACTT
11 CACACAAAT CACOACGACAGAAATATTI TGA1GT ;AC
GAAAGCGICIGAGMAGTATGGAAGTGCAACACA GAACCICTIGCTTCCAGTT TAITTf GATGTUACTOCTGA
11 CGCGATAATATCANTCTICTCC:TCAAAGAGTTAAATG
TOGGTTCTAAATITCAACTGGIGCGCAICTTAGAA AGTAMTACTCAAACT CC AAGAGTFAAAIGTGAGGT
97 TGAGGTGTC GCATTCGC AlTGC. GTC
11 GCAACAAC:AACGACACGAATTACTAGGAGCATCCAG ACTGAAACTGaATTAGAGCATITFTGGGGCAAAG
GAGTTIGTCAC:CAGATTCA AGGAGCATC:CAGTATATAA
GTGAIGAGGTGGATCAGGGGATITGICACAATCCT ACTITGTAACIGAAGCAG GI
GGAIGATATACHATGTA
99 CAATGTA1TGATG CC(X.I AC TTGATG
00 GACAGTGG . AGTTCTTTCCA GACGC G
12 TGAGGCCITCTTITCTCCCATAATTGGACTCCCJWµTA CCCITTACCTGTAGCAGCAGTt. t I 1 t i IAATGACTGCA TCTIGAATCTAATTGAGCA TGGACTCCCAAATAAATTA
TGTAATAAACAAAGTAGA
CGACTCTTGCTGATCGATT
AGAAGCTGGAATTCCTGA CAACATTAATTGAATCAGG
CAATTTGATTTTGAAAATC
CITTAGCACTGACGTCACT
CIGACGCTGC
CAGAAIGGIAAACTGTAT
CAACIACC.AATCGCCAGAG
ATCCTCTGGCICAAACAGATGAAGCTGTGITIGTIT TCTITAACATGGATAGAG
TTAAGAGGTAITGAAATAG
12 CACCAAAGCCAATGICACACATATTCCACCTGTGCAA CTTGC.AATTTCAGAACATITCAGCAGAAAATCTGGA
TCCACCTGTGCAATTAATT
09 TTAATTAATTC TATTTACAGGTGC , ACCTCAGGCTGGAGAATG , AATTC .
ATCAATITCCACTAGGTAGACGGTGCTCTITTCTAAC TGAGGATCAATATAGATT
TGCTGC ACGTFTGT ITTGCAG
CTAAGTGTCCTACTGCTGC
12 AGTACATTC.AGAGACCATTCACAGTATTGTGATAAAA 1 TTTGTCTGAATAAATTGCTGATGCGTGMAAATATT CCTC-AAAATAAGCAATGT
ATTGTGATAAAATACTGTG
11 TACTGTGAATCCC ' TGCCAGCCC ATCCAA AATCCC
CTGACAAGCACGAATCTG
GAGGGTCGATACTGCCAA
12 GCTTAGATGCTCTCTCAGGCGAC.AGATCTCAGAGCTA TAGTGGAGGIGTTGAAGATGAAGCTGTCTTI1TGCA
AGAAAGTTAAATGTTACT ACAGATCTCAGAGCTAAGT
GAGCGACGACGACGAGA CCGAGAGAATCTGGAACC
IS AACCAC CGTGICT A AC
12 ATCCCCCAGACCc1TTTCCATTTGGAAG1TTGGTATAT TTGGATATAGACCATTAGGATCCGCGTCTAGTCACT
TGAAAATTCTACTATTGCA TTTGGAAGTTTGGTATATT
GGGCCTGATATATCTTTTA
12 TCTCTTCAAATATGITGTCCCCAATGIGATACTAATGT 1 TTAAAMC.AGTMCCGCGGGATGAGCTGGIGTTC
i ITTGAGITTGAAAATCCCG
19 AFCCC:CiC I GAGA , CCTICTCGAITGGTGCAG C
AAGTCAGMTCCTTGGGTCGAGCTCGTITACCATIT CAAATCCATGGCTACTAA
CCAGAAG ' AACAAAC ATGC
CCTACTGCTGAACCAGAAG
12 C.AGGAGTFTATAGGTCAGMCCCATGAATATTTTACC GGTCAGTCATFIGGTCAGCAGATCTCACMAIGGAC
ACAAACTGTGATCAATAA TGAATATTITACCGTGGIG
TGCAAGGGATGTTTFACA AGGAGGAGACATAGAATC
CCATACTIGGICAAGGCA
23 . T AGAACC GC;
CFGAGGCTGCAGGGACIT
CaGGICAITCGGTGTTCA TCIGTTAGGACCCTTCTCG
CCMAGACAGGTAGAAG
2$ crrci ceicrorr ATM T
ecGAGAAAcrcceacrre AGIGGCAGAAACFGGAAC ACAAGTAGACCAACAGCA
26 CA.CC: CI CCA IC CC
AGGGCITITTGGTCCCIG C:AAGAGGGTCTGACATAG
27 AC:ATAGC 1 ITCIATGICI TTACCAGCAGGACAGCTC C
12 CGAG TGTCCCTCCTFTCCACACiGGACCAAGCCC TATC: i GTGCCGAGCACCTAGAAGACAGITIGCCATGAIGT GCTGAACiCCCIAAAAGACi 28 (:.CA GTCCTGG GC
GGACCAAGCZCTATCCrA
ITACCAGTTGCCAAGATAGAGCC:GCCTIGGFCCATC AGIACTAAATAAAAGAGT ACCATAAT
GACAGGAGAT
29 , GAGATACC 1 TTTCC AAGAGCC ACC
.
AGAIGTAGGGGAIGCCIATITITCCCTGC.TAGGGTA f3AGGATGCTGATAGATTI AAATAAGG
ICACTCAAGA I.
AAGATTTCAC AATGCAGTA TAGAG AA ITCAC
12 CCC I AGGCCATTI AGAAG ITC( TIGACAGGACAGGTI TCI AC ICCAGATGAGAAGT
12 Gri Ca GGCICAAGATAAT TTIGTICAGAAGAAGF GC
32 AGTGGA CC:766767A ACTCA
C.AGAAGAAGTCiCAGTGGA
12 GCCTGGTATAGGATC.TCC.TACTAGGGAGTGGGACTT GCAGAGACCTTCTACACAGATGGCATATCCTGCTTT
AACTACTGGCAAGTGACA GAGTGGGACTTTGTATCFA
IICCIGGAAAAGATAGAGCCCGGGG1TAGlICITT TAGAAGAAAIGATAAAGA
AGIT:TATG1TGCATGGGTC
12 'MCl/ TTA TIGGCCACCTACTGGGAAGCAGAAGT CA I
GCACACAGACAATGGCC:CCAACCIACCCACCA TGCC GCAGTACA KT TC3CAAGT
GAAGCAGAAGTCATCCCAC
CCCACA AC GC. A
CCAGACTGIGCAGACATCCICITCACCTGCCGTAAA GGAAAAAGT CATCIAGAA GAACCTAACACCAGAAAA
CCATAGAAGCCTTAAACAG
38 AC.AGGG . ATACTFTGTGTAG TTCGAGTGGCTAGAGAGG GG
GGTAATACiAAACTCTGAG AGGAAGAAGCCTTAAGAC
39 TTAAGACAT 'IT GGA AT
ATATCATATGAGCGAAAG
CCA AAGC.ACTTG GGC
AGACGAAGAAGGACTCCA
12 GCTTGACACATGGITITATTGATCITTGCAATAATAC.A ACACCTCTATGIGTGGCMTGAATTGAGGITGTGGA
TGGAATAATACAGTGACA
GATAAGAMAGGAGGTAT C.ATGGTATTFAGAAGATGT
42 AGATGTGGT TAGGGCAAAGC AAGGAC.A GGT
CAAGTATTATAATCTCACA GTAAGAGACCAGGAAATA
44 Ci ACAAGAA TGTCAATG C GAA
GACTCTITATCGCCTGACTGGTGATATTTGCCIC GTOGGGGAACGAAAAAC
AAGAGTC.ACTGCTATCGAG
AGACTTAAGGCGGCCCAACTGCCTCGCGAATGCTT ACTA TCCA AGAACTCCCCG
47 CAGCCC CAATTTATAGGTC , ACAGCAGTATCAGCAGGG
AGAGTACATGAACAGCCC
GGATTGCCTG7TATTCACTA AAC.TGCITAGTACACCCA AACAAGCAGACATGATGA
48 GATGA i TATGGTATCC G TGA
AAAIGTACCCGCTTC.ITGC
GAGAGGCTGGCAGATCGA
12 TCCCAGTGC.AGATICGATCTGAAGGC.AATAATTGTAC
AACATCGTCAAACTCACCTCATGATCAC.ACCAGT CA CGTTGC.ATTITCTAATATC
CAAGGCAATAATTGIACTA
SO TACTCAT CATTGA CAC CICAT
12 GCAGGTTTGACTTCATGGAGTATTGCCTAGFCAGAC ¨ ACATCTGGATGCMCCTATAATGCCICTGAGAAGA
51 C.AAAATT3C TTAGGTAGTIGTC AC
CCTAGGCAGACCAAAATGC
ATGGGAACGGCTICITCA
CCAAATCAAACAGMGAC
MAGGCCATAGGAAATTG CCAATATGGGTGAAAACAC
AGCTCACAAATAGAGCTTGCAGCTCATCTTCACT TTAAGTGAGCTAGAAGTA
SS TCaAGCT GITTATTATCCC AAAAACC
CGGAGCAATGAATGAGCT
12 GC,ATTAAAGGTGCCAGCAGCTGCTITGAAACCAAAC 1 TGCTGCATCTITAMTGATGATGGCCTAGAAGCAGC
CTGCTGTAGAAATAGGGA
56 ACA t AGIT:CAGY A TGG TGCTT
TGAAACCAAACACA
i TATGAGACCCA CTITGTCCAGAMGCCTIA :FGGAGATCCAAACAATATG
57 C:AAT ATCTGA. 1 1 TACAACT G , AA GA U
CGGAAGCCATGGAGGTTGCTCCCAATTGTCCTCATT GACAGATGGCAACTACCA TGAGAACAGAATGGTGCT
58 CTGG ' Gccr CC GG
12 C.ACAATATCAAGTGCAGTATCCCAAGCAGCGATTCAA
TCTTGATCGTCTTTTCTTCAAATGCGAAGGCCCTCTT
GC.AGCGATTCAAGTGATC
GGCCATTITCATCATGGGAGATCTCCTCCATAACA GAGTAGACTCCCTTTGTIC
ATTTGATGAAAGTACAGA CTAACGTTTCCCAATTAAT
61 . AATTAATAGACG TCGGACTAC.A !TOM ACTACG
AAAATIACTGATGCTGAAT
GAGGAGGTTACAGGACCATATCT GGCA TAAAGCCCTTCCATATCCT TGGGATTGGATATACTATC
63 C:TATCAGG GCATCIC AAC ACiCi CTGTAAATGTTCCTAAAG TGACAGITTATAATGTATA
GCACTGGCTATAGACCTA GAGGTACAACAAGGCCTA
65 COAT I AGATGGATC:AGG TAG 1:
TCTICAACTACACATCCIGACCCITTCACCTTGATAA GGGTAATCCTACAGITAT AMA
TACTGCAGCIATAIT
12 CGTICAP.CTAMGGOTGGGAGTIGGCTGAATTIGAA Mal- T GAACACi G
TTCCTACACGAA T ATI CMACTG CCTITAACAACGITITCTCT sUGGCTGAA.TFTGAAA TAM
67 . ATAATC1 GA CArAGCG AM GGA
=
CICGTGCATCTATAACIACCAGAAGGACTCAATAGC CO GATA GTG AITTCAGA All ACATACTGCCTAC MT
12 ACATCTGAGC:GITT GCGMCATCAGATGACTA MT C
TITIGC:AGATCiACTGICTCiGATGAAG TACTCTGGC I CCTAGC:ACAGACAATACA CATCAGA
IGACTA ITT Ta GGCCAACACCTACi TAGGACATCCA rrmrcc GAACACAGICTOTAGI ACT
70 GTAGTAGTACC AGGATTTCZ Tarr ACC
12 GTGCAACC.TACTATMACACCIGATGATGAAAATCTAT
TCCTC:CATCiGGCTTGAATATTCTGCCTCiCiCTTCAAGGG TACT AAAATC.CCACTCTTTA
GATGAAAATGATCACAGAT
1l7GGGCGTAAGGAGCAGCATCCCCAACAMCCT A TAACAGAACAMMGC
GTACCCCCACCACAACi AAGACAAA I A TGAT A ITGAAATI GTAATCi ITIGCAAAGTMTCIGGA
AACiCGCAAAAGGICIAAGTAATTTCGCATAC:AGTAA rr-rac:c:r CAGAGC11TCFC
CGCCGCFTICTATAICAAA
74 ATCAAAC ACATTTC.ACAGTA A C
12 CiACAAACAGCTITCGCAACAGAAACAGCTGAGTCIG TTAACGGCC.AAATA TGAATCAGAGAAA I
AGCCCAT AAGAATTAGCATCCITTIT
AAACAGCTGAGTCTGGTT
TGGAAGATCTAGTGATGC
76 AA . CACAGTCT C
AGCCCATCTATTGTGTGAA
TTGGAAGCAATAAGAATTT
CTGGGAAATCATATATTTA
12 AGICAITAATTGCTCTTGTAGTGC.MCCCGAAGATM AATIGGGACCCACCGATTIATCCAGCATAATACTCC
CTGC.AAAAGCAITTAGAT ACCCGAAGATAATGAATCA
ACAACCGCAGACGACAAGCCCTIGTCCTCCiTCGTCG CTCCCTCTCITTACCACICIC
GTC:ICTATCCGACICCCC.T
GAAAAATTGTAATTATGT TAAACGAATAAAGCGAGC
12 GCATCTGGAGTGATTGGTAGTACTTCCCCAAGACCTr AGAAGAAGGAATTCCAGATGTIGGGIGITTC1GCTC
ACAAACACCAAATACAGG CCCCAAGACCITCTATTCC
82 CIATICCA. CTAAACCAG AAC A
TTAGAGAGGAGTITGATA
AGGGCCaTTMCGAGTA
TTITTC.TCAAACTCATITG TATAGGAGGGITCAGGGA
12 GCAAACTGGITAGGATCTGGTAATTAATAATGTCCAA TCCTGAAAGAGAACGTTTAGTTTGGTAC.CGACTCCA
12 CCCGCCGTCCATAAAAAAATAATTTATTAATTCrAcAT
ATGCCAGACATTICTITTGTAAGGAGCTC.ATCTGGA ATTAATTCTACATCTAMT
12 CCAGCTGC.CAGTTTIGTAATATTGACCMAGAGGCT TCCTCCTCAAGGATTGGAGGATACCITTAGGACACA
AGTAGAAATAACAATGC.A
CaT1AGAGGC1GATG7GT
12 GGICGGGIATCAA.ATACAGCAGMACATATCAGTC.G AACAACCGGTICIGGTCGTFAAGACACAGGCACCG
TATTGACTCATGAGTTIGT TTTACATATCAGTCGTTCA
12 CCTGTCTTCCTGATGTGTACTTCTTC.ACTCGCGAGATA
AGAAGAATC.CCGCTCTTAGAATGATTTGTCAGaGT AACiAACTAAGMATITGA
CACTCGCGAGATACTGAC
12 GGTCCATTTCTGTTCCACCACGCTGGATCAGACAGGG AGTACAGTCCATTATCCAAAGGThiliLLGTGTTITA
TaTTGGAGCAAAACAAA
GCTGGATCAGACAGGGTA
CAGATGTAGACACC.AGGA GGGAGGTAAGMATGATG
12 GTCCAAC.CATTGTGAATTCCTCATAAAGAGAGGAAG GAAGAGCAACAGCCATTCTAAGACCCTTCCGCTGAC
GAGGGTTCACTTTCAAAA AAGAGAGGAAGAAGTGCT
TGAGAC.AC1FTCAAAAGGATGCAAC.ATTGICCATG6 CCGAGGTGATTTGAATITC
i CAGGGAATGGAAAAGITGACGATTCTGGCCCGTTA
95 GTCCGA 1 ATCTCC , GIGGTCGTGAGCAFTGAC CGTTICTIAACCiGICCGA
GCCAGAAGCCAATATAGT
96 GTGG ' CGGTGG GTACC GG
CAACAAGACAACTAAGAG TTACAATACTTGGGAAGGA
AMTCCTGGTATTCGCTCT GCGATCATTCCAACAAATG
98 ATGC CCiTFCCACTG G C
12 TaTCATTCACGAATTTCCCAGGATCCGATTTAATTAT TGAGGCAAATTCTC.AGAGAATCAGGCCATTAGTTCT
GTGACCAATTCCTAGAATT CCGAMAATTATTGAGAG
99 . IGAGAGGCG TATICCACIGTA TTCAG CiCG
13 CCAGITTGITTCC.ACTCCCATATGGGATCCATCATTCC ATCAACAATC-GGGATa:AICATFCCGTAI
13 ITGTCCaCCACTATGATAGCATCCAGAGIGGAGTAC CTIGCCATTICAGAACATAGATAGOCTCCTITGCTTA
TCCACiAGTGGAGTACAGG
AGATTCCAAAAGGAAGAG
TFIGGIGCIATAGCGGGT
CAGAAGTGTGGTCATAC.A CTGAACTCTIGGIAGCAAT
03 CAME; AMGCAACCAGT ATG G
13 ACITGC:CAVCTGCCTrAACAAGGGT1TAACCGAAAA
ACTTaGCCTCATCACAGCCCiCAGATFTG TAAAT1CC GAATIATAAAAAAGTACCi GaTFAACOGAAAACGGT
13 GaTITGACATAIAGCACAGCGP.ATAITTIGGCATAC TATCGCCAAGTGAGAAAGATC:ATCGFCCACCIATIT
GACACTACGACAGATCAG ATTITGGCATACAGFAGAA
OS . AGTAGAACAA AMATGAATCTGA C CAA
.
13 TT CTG1AACGITGT ATG TCOGTITAATTTGACACat GTGIGGIIGTACACAGIGTGGATGCAGGIGTIGTAT GGTIGACCTGTAITGTF AT AATFTGACACCTCAGATGA
13 1 T FCCi GATMCCTGCACCICACGGGAGT ATGGGTAA
AGGGIGGIGGAATIGCTTAAGIGACCAAATGAGSA CAGGAACAGGTAGAAAA.T
07 CC TCCAAACAG GC.
ACGGGAGTATGGGTAACG
ACCACAGTGICIAPAGGAATGTGGCTaATIAACAG GCAGAGGCGTIATATACA CA TACAATGGTIAACCIG1 08 CC:TGTACC ITGATTAGCA CA ACC
13 CAGCACGTTGGGCATATTC.AGCATATGATAATGATTI
ACAC.AGATAGTAATGCAGCTGCTCITCTACACATAAT TTAACAGAAATGGTGCAA GC.ATATGATAATGATITAA
13 ACCiCATCFMACGITIGGCCGAAMCAAGAATCAGA AAC.ATGIGCAGCACTGGAAGFCCATTIGACGAGCCT
AAGGTIGIGGICCAGOT GAATTICAAGAATCAGAG
GGACG TAMT A GACG
13 GCCiTCCCTGIT11T1TAAACACITTCCATGGACATIGC ACIGIGGAGGITGTATTCGATGGCG
FATATAAATGT AGICCACATTIGGTACAG CCATGCiACAITGCAAGAG
11 AAGAGA CCATGaGTA AA A
13 GG11.13GGGIGGGGICTITGTAICACCTACACCX:ACA CCCCOCCGCGAAAACGATACTIGTGACGCCGTAGT
CCACCGACCTATACAAC:AC CACGTACACCCACAACCAT
TGIGITGIGTGIGTGGATfAGTTAGOAGCAAGACA AIGTACAAACATATATAG
"FTACIGIGAGTMACTAll 13 CACTATTGTG GGAGG GGGAAC.A GIG
13 aAAAAACCACAGCAGCAGCCriGTACATACGTGATA TIAAACTACTGCAGACATACGTOTGTACAAACAATA
14 ATATGGTAC . CAAAAACCCAAC TGTGTGGTGTGCATGGTA TGGTAC
13 TGCAGAGGACGTAATATTAAACCCAATTGATrCTAGC GGACATTTCTCCTCCCAC:CAATACGTAAAGTAAAAG
TATTGGAAGAGTCTGCAG ATTGATECTAGCATACC-AG
ATACCAGTGC GATCACTGTA TT TGC
13 TGATGGCCTAGTGATTGGGCTGGGCTCGTATTGGGT AGGCTATAGAATTGCAACaTTAGGTCATTAATTGC
GGGCTCGTATTGGGTCTA
13 GAGGAGGCAGATAAACCTIGITGTACCATATTMTT CTCCTCATTCC.AAGGIGGTCAACTGCTGGCATGATA
TACCATATTTTTFTGCAGAT
13 ACCATCCTGAATAACACTGITITGTCTGCTTGCTC.TGC
TATCGCAGATGTAGGGITTGGAGCACCTCAGCTITT
13 CaCTAGTATGICAGGATTCATGGTTGCAAAATTACA ATTGGAATGTIGGTGTAGCTCCATATGGCCTGAGAC
ATGCAGAGGAATATGATA TGCAAAATTACATTATCTG
20 'FAGTACATGC ATGC1IGTGC TTGT 'FGC
ATIACAGGAAAAGCATIT
riTGIGTTACCTGAATGGGT
AAATGITGIAGGGAAIGTGATGGFCCOGATACTFA CITSCGTICTIAAACAGTA A
ACAGGTCAAGTITGTAAAA
22 CiTAMAGAC ACTA AATTAGCAA TTC GAC
13 CCTCCGACCGTCTCTITAGTCTOGAATCCCAGAGCA GATTCCTCAAGAAGCGGCAGGATCGTC.GTCG1TGTC
1TCCTCCCTCTGITGOTAG CTGAATa:CAGAGCMCG
23 ACIGG CTCTT , C G
13 AGACCTGCTGCTGGTACTGACTGC.TGACATTACTGC.r CA AGAGTTGAGCGACTTAA
ACICAGCTCCAACATITC CTGAAGGACGTGGAGGA
CTGCTGACATTACTGCTGA
TGTAGGTGAGCATCCCAATGTAGCTGCACATCGATT GGTIAATTGACACTCC.TG
25 ATGTTG ' TGAG GT
GCCCAGGICTTGATUTTG
GGAGATATGTGCGATATA ACTITIC.ACICA111TCAGA
26 TTC-AGAAAGT CTCC.ATCT GGTC AAGT
GAC.AATGCTACACCTCCA
13 GGGCTAGCACATCAGC.ITCCITTGAAGTGGAGCTTGT ATTAATGCAATGGACTCTCGCACCTGAGGAGGTGA
AlTATATGAGACATACCG
MGAAGTGGAGCTIGTGA
TACAAGGATTTAACATM ATMCGAGATAAATTIAC
29 AATTTACCICC GACGCTT GGAAGG CfCC
TTACCCTCCTATTATTTGC.CCACAAACGTTTGTGTGC TGTACTGTGAAAATAAGC AACATATGGTGCTATGTAT
30 GTATCCTG GATGTT TGTG ocm AAGCTGGC.AAGAACAACACCCAGAAGATGACTCAA GCTTTCTAGTITA AACC.AC
TTGAACTTAGAC.CGAATIT
13 GC.CGTAGTGC.1. GGCTAGTACC.AGGCAGATGGCGACT I
ATGGCTGGATCAAGTGAGCAGGCCATTTGCCTAGC
GGCAGATGGCGACTACCA
i 13 CCACAATATCAAG %CAA TATCCCAGCAAATGCAGA 1 TaTCAMTGCATITATCGICGCTAGACTCAGGCACF ACCAGAAACGGATGGGA GCAAATGCAGAGATTCAA
33 GAITCAAGT I CCTFC , G SI
ACCTAACAACAAATGGAAGTAATGCGTTTTAGATCT GCTCTR. I i I III TATCTCA
TGGATAATGAGAAACAAC
34 AACATGC ' TTCTCGATCATGT TTCAC ATGC
ATCAGGAAACTATGTTGC TCAACAAMAAAACCM
ATATGTTCCAGCTAGGACAAGCACCAAGTTCATCTT CTGTATCCTCAGAGATCCC CA
ACTATCCTGCCATATGG
13 TaTCCCAAGAATCCATGATTTGATMAGAGTTTGCT ATCAACTAATATCTCCTCGGCCCTAGGTCTICTIGGG
ATCCTATTATACCAGAATA TTTAGAGTTTGCFTGCAAC
37 . 1 GCAACT CCM TAGSGT 1 38 GGACC GIG 'ICI TGAT CAG CC
13 GISITCCIGATGATICTG laGTATGAAAATCTCAAA AATCCTCATCATGGAA1 CICACCAAGTTGIGTITGIC CCACA TCAAAAGGGCAGA GAAAATCTCAAAAACAACA
TCCAATGAGAGAGTATCC GAAAAGATCAAGAAAACT
40 GAAAACTAG 0 17 TG MAT'S-TAIGA ATGA AGGTTFG
GAACAGAGAAAAGCAGTACCCCATCTAAATACCGA TCCAAAATGAGTATAACT ATACACA/TCCCAGAATCA
41 CiAAICATCA 1 GAICCAIGIT AACTCFG "FCA
ACAAAGMCICTIGaCaCCGIACAAlICACGAA AAAGAGA1GGTIGTGIA1 ATAAAACCAGAAC TAT ACC
42 CCC.ATG GATTACTC ACAGT CATG
13 CraCAACTC:CTITTGCGATAATCAAGAGIGGAIGCA 1 AATCICTACFTCAA ICTGGACCICACI
43 . ATT11CC 1 GATTATCCC.1TA CT CC
.
CAGGAMCCCCCCICCAAAG TAATT1TGAGCCTCATAFX: TACiAAGACFCTAACTCTIG
13 'FCTGAGTGlICAGC.AGCC I AGTGA ITACICAATCACC AGAAIGG
rATATTCCICTICCCAGCACATCTGCTCCA TTUACAGAATCAAI AAAG 'FGATTACTCAA TCACCOCC
13 Cta: ITC:ATTACK; IGGITCiATIGATTGC:AAACTGTA ? AG I ACANT
AGGTAICAACGGAAIGCTGIGTAGAA r TrGTCAAS GGAGCi AG MG TaAAACTGTA I AACAACC
AAGCATACCAATC.ACGGAAACICIGACICTTGTTGTTAT AAAAACTTAGGAGTAAAG
AATCCAACTCTACTCATAC
TAACACATGAIGIAGGIAT
'FATGCFCCACTAIACCCAT
13 TCiACIGGG1TCACTCTCGA1TC:ACAGGGAGCNITG 1 G
ACTTACTCAACAGC:AACTGAAAGACCAGCTGAGAG TGAIGCGTATCCACTAAM
CAC.AGGGAGCAlTGTGTC
13 GC:AGAGT;TGAACCGTAAGCATITCiCCTCCTGIGACA AAACGCF GTIGATIATF
ATTACC
GACGAAACAAGATCAGCATOTCTGAATAACCAT TGAAACGATTATAAGTAG TAGCAAAAGAATCTATTAT
52 CTATTATGATGGG . TCTCAATTGCTT CAAGA GATGGG
13 GCAGGTATTAAAGAGGCATATTGCAATCAATCCAACT CAGTGTTGGaIGATTCAATTACAATGCACICAACTG
CAACTATATATTGCCCITG ATFAATCCAACTATAACAC
13 CCTTCCTCCTGATAAATGAATCCAC.ATCAGATATGA A
AAGIGGAC1TGAAACACCTGACCITCTGATCaGTT A ACACTAAGT AGGAC.1TT
ATCAGATATGAAGATATGT
CCGCAATAAGAATAGCAA ATGGGCATTIGGTAATGAT
AITTGGGTTGCAACAGTTMGAAATCTCGTGACGCs A ATCTACAGATAGAAAAGT
GACTITGTGTCTACITITCT
CCGTAGCAAGTGATTCTG AATTTCTIGC.AAATGGCTGT
ST GTT TATGT A T
GATGGCAAACCTGTTATG TTATIGGCAGITTGTTGACC
58 Ci3TGACC.A ATICAATCTCT I* AGT A
TGCTCTIGACAAACTITAC TaCCTGTTAIGACAAATG
TGAAGGIGTCICTGITTG AGAGIGATTGCFCTICAGA
13 ACAGC.ACAGTCAAAGACAGITTGATGCCTGAGAAGT TGAAACTTGTACTATTGCCGGTAAAAGCGCATTATC
GTGGTACTATACAAATTCT TGCCTGAGAAGTTTTTGAA
61 TTTTGAATG AAGCAAA , CGC , TG
.
13 AGCAGTGCTACGTTCAACACAAAGTTAAGTATGCAAC ACTTACAATCGCCAACAArfATTCCACATCGCCAATT
MAGTTAAGTATGCAACACT
13 ICCICAAACCAAGACTCGITAC.TACTTIATGTGAGA G 1 TGATTACAGGGCTTTTATCAGTGTTTTAGACTCTGCA AAITGTCAAACCTAATATC ACM AMTGAGAGACTAT
63 ACTATGTTG ' GCTTTCAC AGCC GTTG
13 AGMTCTAAMACGCATTATAGGCCGAACATGTGCA GGITTCTAC.IGTTAAAATTGGTGGCACAATGTAAGG
GAACATGTGCAAAGCGTT
13 GTCAAAATGTGTAGGGTCTGCAATTCAGGATTAAGTC AGATGCAGAGTTTGTZTaCCAGTTC-ITATAAAAGA ATGGATTGAGT. I I I I MCA CAGGATTAAGTCTTTTAGT
CCITTTGAMTGCCATTTG TACTCAAACAATTGGATAA
ACTGGITTTATGGTTATGG AMTGTAGTTGTACCAGTG
GTATTAAGGATGCAGATT AAGTTGAAATATCTGTTAC
69 Cf GTTACACCA ACAACAGTA ATAATGC ACCA
AACC.AGTGTCCGTAATTAAAGTrGCAC.ACACAACAA ATACAGACAC.AGAGGITT AAGGT1A
AGGATITTGTGT
i ATACCAGCTGACTACATCTIGCCCCTACAMAGAAA CCTICTGGTAATCTIATTA
GITCIGITGITGICTACAT
72 ACATGTG 1 CAT TACAAGTCA , AGT TM GIG
13 CGACCCAATC.AACATCTATTGCITTTAAGGACAAAGA 1 AGATGCTGTTACTTTTGCMCAGTCCATTAACAACA MOTC.ATTTGAACAACAA TAAGGACAAAGAMITTCT
72 TCITTCTGGT ' GCACTTTCA GTTG GGT
AAAACTCTGTTGCATCTAT
IGTGCTAGTGICAAACGT
GGTTATGCTAAGTCAMC TCAAGGATGATTACTGTGA
75 . ACTGTGATGG TGCATAT 111.6 IGG
AAGGTTATTTCTTTTGTCA
76 TGTGAA ACC:ACCATF T GAGA TGIGOTiTTGG
TFGTGAA
GATGTACTACGTAATAGC
'FATG1CITTAGCCGAGTGC
CGCAAGTATATTGTAAAAA
78 AAAAACTAGC CACA CAC; MT TrCTCICT GAAGGI
CTAGC
TTTTTGAAAACITTGAGGA GGCITCACACCTITAAACA
79 ITAAACAAG ACTCC:AAA CrCIG AG
FGAATCCAATGCAGGTGTAACGT AACCAITGG TIAAGTTGCCAGAAGITAI CAAGGCTITGG TIT TAG AA
13 GCAATTAAATAMCACTCAC.ACCAAAA IT GTATF TC IT T
GCTACT A
81 , CiCTACTAGAAGC ACAAAC ATGC GAAGC
.
AGCAGACACGGAC TrAGCAAAAGCAA TWIG CTITICGCGTGATC:ATAAT
CTTAGCMTI CACACCA TTGGIGTIGTAGGTGCTA 'IGCATG GAGTAACICIT AA
83 ACTC. TTAAAATT AMACACC C MIT
13 GCAACCACATTAACTGTTAACATCFGGTTITGAAGAC TTATGCTOCIA TAT TGAATOCi TTITCMAITA 1 AATGC CITTGATGGIGTTAMIAT GGITITGAAGAFX:AACCTA
84 CAAC.CTAAT TCCACAAACA GGT AT
13 ACCC.AAATACFMGMCITATAAACAAAATCCATAACICT TAACCCTGGITTMTTACTCCCiTAMTGTAAGACAG
CCTGCAAAGTGGTAAAAC AATCCATAAGCTIATTIGC
13 AOCAAT GTGTACMCGCMIT TAAT T TGI CT11 TT GIT GCACTIATITGITCMACTC.A 1 TGCACCAGCAAACTA CI GIT AIGCAAATGGACA. ATITGTCITMG TIGCACT
13 CiGCTGGACT AACACAGAA ATCA TRAGCTOTA 1 GTACT TA
IGGTTOCTAATGGTITGAATGCAAGITTAAAACiA GI TIT ACA IGTMT TAGGC GCT GT A
TGTACTAT GGI Tr 88 TAATGGG CAACO AAGTGC. G
13 GCFTGriCACSC.CA ITCTGITAAITTITGAAGAAGGCT
TGCMCAGCTATGrACAAAGAAGTGCATGGC:ACTA ITOCTCACCACAAA TAAM n-GRAGAACX3CTAIGAAT
TACCAGATCATGATTCATT AAATGATGGTAGATGGTTT
90 TGGTITTGTG . CACAAGTC TGTG TGTG
GAAGGTAATGCMTATAC
GGTGGACGTGCATTCATG
13 AC-ATCTITACAC.AATTGCCAACAGGCAGCTOCAGCCT GAC.
TAATGGTTCTGGTAGCGGTICCTGCGTAGTGTT CATTGTICTTTTGC.TGTTG
CAGCTGCAGCCTATCTTG
TGTAAAGTITOTGGITGTT TAATCATGGCTGTACATGT
93 ACA.TGTGAC TAATC GG GAC
GGAGATATTGGTTCTTACTG671111TCTATGGGGTC GCATGATTTC71TAC.TIOG TATGGTAATG1TAGTAGAC
94 GTAGAC.AGGA MACCAATT CAT AGGA
GlITTGACCTTAGACAACC TCTTAATGGGAAITTCTAT
GCATTGITCAAATTITAAC
TCTTCGCAACCACAATTCC
97 AGGATTAACATC ALM i 1 i G CATC
TGICTGAAATCCATAGIAGCTACCAGOCATAAAACT TCATTAACAAAGCGTAAT TGACTCAGTTAAA
TCTTAA
13 CAATGrAACCAAGTCCAGCATAAGTGC.AAAAGCATTT AGCCAl. ti i o i AlTATCAGAATGGTAGATCTTCCTCA TTGATGAATCTTITGTGGA GCAAAAGCATTTTTC. TATG
99 TTCTATGATGA GTCCAACA , TGAC ATGA
CATCACTAAGACTGATGCT
TGGAGGMGAATTATTAC ACCACAAACC.ACATCT11C
14 ACCGTAATCTACiTTCTCAAAC.ACACATGCITC01AAT
ATACGGTTACTTACAAATCCACTGCAGCAACATTAT CCACCGTTAAACCGTAATT CATGC.7.
TCCAAMTACAAA
14 TCGCTCCAGGGTAATACACACTEEETGo74-GTGGA CGCTCTGTGCAAAACICTGICAACACTCAACTCTGGC
GCAAACAGCGTATAACCA
TCCTCCTGGTAGTGGAAA
14 TGTTGAGGATCACCAACATATACAATCTATGTGC.ACT
14 CCTGGCAGCTACATAATTTTGACTAAGCGATTTATAC TTTAGGCTrACAAACTCAGACAGTGGCGAATATAAC
CTCCAGTATAAATAGGCG AGCGAMATACATAAAAA
05 ATAAAAACTOCAC ATAGTCATATICAC TCAA CfCCAC
14 AGACAATGACAAATAAGTAGTGGCAG7TTAAGGATT AAGACTAGTGGTGACTTGGCTCAC.ATGTTCATAGGT
GTTIAAGGATTGTGCACGT
14 TGCCAACATTGTCACCTGTGATCATGTCAG.AGGTTGG TGTACCTCTACAAGTTGGITMCCGTTAGAACACAA
CTAGAGACTTTGCCATGC TCATGTCAGAGGTTGGTTA
14 GCTCC.AATCTTAACAAAATAACGCAGGEOTTTGTACT I
ACATTGCCAATGTGGTACICITGCAATGCATGTTTA
08 TIGGGC : AAGCAACAA CIGGCTCATCTGATGTACT
GGIGMGTACITTGGGC
s 09 GITAAGAATO 1 TACAA , TGACIAGA EGTTIGGCIGT AATG
TCAT6CATTTCAC.ACTCCTGCAACGGTGCTGGTTTCA GAATTGTAATGTGGATAT
TGAATTCTCAATTGTTTGC
TGCAGG ' AT GTACCC AGO
14 TCTGTGAATGTCTGCCACAAATTATAACA1TTITACTC AGAGAACATTGaTTTAACGTTGTICTAATGGCTAC
TCTCTATAGAGCATATGTT ATAACATTITTACTCAAGCT
ACTGTCAAATCAGAAGAT TTAACTGGTITGTTTACGT
13 . TITACGTACC5 TCA GGT ACG
14 CAAC.AGTCAAATCCAAAGACTICAATCCTAGTTCTAA CCAAGGITC.ATGAGGTCATAATTGAGCATTATCM
CTGACATAACACTCAAAT TCCIAGTTCTAAGACTGTTT
14 GCACCCAAGTG TAACACTOCATATTGTGICAATATTT TGGCTCTGATTATGGIOTTGCAAAc-'t AACAGTACCAC CGTG A 'FACCAC
ACTICTGTTAATACGTCAT GTTGTCGGAATTAACTATC
AAATGMGTMGCTIGT
"mccrrGTTGCATATTOCT
14 ACG I AGMCAT IGCCAA I AGTAG TGATOCTCACATA CRICITITGTGCiG I
GCACTAAATOICCIGTOCGTGA CAACAACACTI TAG TT TCA GATGCUCACATACCATITG
14 AOCGAI:ACAATAGACACAGG FAAGAIGGTITTFATIC
ACGMATTGIGTTGIACCITTCIAMACAGTT AAAG CAGTIG TCCIT TGATG LAC AGATGGITITTA
FTCIACA
19 . TACAAGCC CACITGCC C AGM
.
GGCACCAICIACTCTATCACICCCACiCITGCTGATAA GITCCTITAIGAAIGTTAC AATAITTATGATGTAT
CTCi GTATCTOGTGTGG ACAACAA AlTGG 61G160 GITTIAACITATTCIAGTIT
AGCGICATACCIAGCFTGCCCTICTATCICICACTGCG TIAAGAAATAGCOCCAGG C.16CAGATGT TACT
GAGAT
CGAATGGCCATGTATACA TTCTITAATTGGTGGAATT
CTGCTITGAATGTATTCGTITCTCATTGITGTGC.AAG CTATCICIAGETCCATICA. TC1 ATGACAG
ACT TGACAC
24 TTGACACTAT C.TGTCT GGC TAT
14 7 GCCITCITT GTAAAGAGCAACiATTGG ITGIOCGTI G
TCACATCTCGC.ATAATGTTICIAACCGIGACATTGCA ATAAGGATC11 CAAGCGT
GGITGTGCCITGATGGTA
14 GTGCTAATTICACTGGTCAAATTCATACAAATTGCCA AC:ACTGITC:AAAAA
ITGC:AAACTCTACiCCACTT TAA TATRA TG ITAATAAGAC TACAAATTGC:CAAATTACA
26 AATTACACTG GTC:GACTAA GCTGC CfG
GGCITCITTAGTTGTITTGC
14 AAAATTACAGCAATCAAAGTTGCGTTTCTAATCCTTri MTCCATAC.AGCTTGGCACTGGTGTGGTTACATTG
28 CGTTTATGTGC . AAGACAA GTT GTGC
14 ATCACTATAAGCACCACACACCAAGCTAGTGGATGAT AAGCTrTGTTTCACTTGCCATATGTATATGATTGGTA
AATTAGAAATCCAC:TAAG
29 CATGC AATGTG6TACA ATG1TC.0 AGCTAGTGGATGATCATGC
CTGTG TAC.GGC TOOT G
TITTCT ACCTACCTGAATACATGAC TAAMATAGTAGAGTCGG
14 TiCTGAAACGTMTGAACATICCATACCCATCPACAA GAAAGGGCAAACEIGGTGGATGGICCIGTGCCAAG
GGTCATACCTCGTAATTTG TACCCATCAACAAGAAAGA
GTGAATCCAAACCTCAATC GGAATCCTICAAGTGACAG
14 GGCCAAAACATIGIGTGACAlTAGAAAATGCAAAAG CCAGAGACCTTGACCACAACTTTTAGCTTTAACACC
GAAACAAAAGAACAAAAG
AAATGCAAAAGCCACGGT
AACTTCACCIGCAACTGCTGAGCATGMAGITTACT AAGMCITGAGGAGTIA ATICACTAGAGAAATGCAA
TGATCFATGTAFGAIGGTA "TCAGAGCTOC7MAATTA
14 C.CAGGTCiTGGCAAATGGACACTCATGTCCIGTITCTG
CTTTTAGTATTTGGCTTCCGTCCTTAAGTAGCCAAAT ACTGTITGGC.ATGMACA
37 TGT ACCTAGGAC , GT
CTCATGTCCTGTTTCTGTGT
14 AGAAACATCTGTCCCATCGCTGCGATGAGTGTCTAGA GGAAATCCATATGCACTGTTC.ACACTTAGCAGCTGA
TACAATTTATAGAGACTG
33 CTC a AAGCC
GCGATGAGTGTCTAGACTC
AAAAGACGGCTGITTGAG
39 GAGGG GGCACT ATC.G1TGIGTGTGICGCC GG
TTGCACCAT A TGTAGAAGAGTATCAAAT
CCGCGGCATAAAAGTGTT
14 GGCGCATITCCITATCTITTATACWiesCATTGCTAAAA AGGACAATGGATAAAAAAGTGIGGGATTAATTAGC
GCAGATGTAGAACCTAAT GCATGGCTAAAAAGTAAC
CATGGGGACTAACAGATA AGTCATGAACTGTATACAA
AGAAAGAAAACATAGATC
ATATATATAGACATTGTGT
44 TTGTGTGGGGT A.ACCC CGCAAAAGGGACACTGTA GGGGT
CTTAGGCCAAGTGTGGTG
CAGCCGCAGAGGTTATTATTGGTATCC.AAAGGTATA
46 CTAAACG 1 GA MC:ACC-AM CAGTGCTGTCCIAGAGGT
CACTTCCACGCCTAAACG
i 47 CAC:C 1 GIGTA , CCCCACiCiGCCITCCAAAG
*FCCAAAGGGCAAGGGAa:
ATCTCCTGCCTTGGAAACTGTGTTTACCACCAGTAG GAGCCITTACTNVµTGCTG
TTAGAAGATACTTATGCTG
48 CTGAGGA ' AGGTTA AC AGGA
CTGATACATTITATTTGCA
ClIGGTACGTAAACGCAAA
AAAACTGTTGTTATTCCTA ATCTGGGAATCAGTACAGA
SO AGAG TAACG AGGT G
ATGTCTAATGATCCATATG
51 . TTGGT GAATAATATATC5T GC.AG
'FGCCAMITTITCTITGGT
CTCTTACTGCTGATGIGTT TCICATAITAATGTAATGG
53 't GCCCA ICTCCACGCGAAGGICCGCGGA AA TCCAGC ACTGAC GGCAAGGCCT
TCCACA TG alCCGCGGAAATCCAGC
54 CGTC TCCTGTCACACCX:CACCCill GACCCIGICOGTGTGC GA
GCA.TGGAC:ATCTCTCCG It 14 GCGTTC.AGCAGCGCCTC.AGAGTGTCG1TGGATCTGA AGCATGTGCCCCAATCTTGGAGATCCAGGIGTAGAT
SS GC.IG GCAGO AC GG
14 CCACCGGCATAGICATGGICACGC:CTGGTGACCTGC TCCGACTACrTCCCGGAGA I
GCGCTCCTCCACCTCA
GCCIGGTGACCTGCCT
14 CiCCGTGCTCiCACCTCAAAC.ATTAACATGTACCCGGCA
CCAGAGACGGTGITGCCiCCGGCAAAGGATGACCAG
57 . CiTCC GM AGACCCIOCGCTCCAT6 AACATGTACCCGGCAGTC.C. .
ATGC.;CATC
14 CTACCTGAACTAAGACTC.iCiG i GTACCATCCTIT IGCC ACACCCAG GM I
GCAGIGCCATGATAGAATTCCAGG CACAAAAC:AAACTA TC1CC
ACCATCCTMGCCAATCA
GIAGGIGCLICCAGAGCC.CCIAAGT ITGGGGGCGC CCAGGGGGGACT17ATG r 61 C.GGGTTGGICCAGGGCCTIGTCGCTGACTCCGCCA TAG CGGGGCCACCTTCATC.A
GTCGCTGACTCCGCIA
14 CFCGTCTGGGTICTIGGCCCCAGCTCCIAAGAACiGC.A
TCCGTAGAAGGGTCCTCGTCCTACCCC1GAAGGTCiA
CAGC:TCCTAAGAAGGCACC
63 AAGAGGCCCCTCTCC.GCGAGACCCTAGGCGTCCCCT TGGGCCCTCAAGTCC.AGOCAGAGAAGGGCCGGTGG
CCCAGGATGTCCCCCAGA ACCCTAGGCGTCCCCT
14 CCICIGGTAGGACT6GGCGACCCACAC.ACCCACCCGT
GGGCCA.ACAACC:C.AGACGAGTACCACCTCCICTICT
64 CT TGCT AC.ACCCAGGCACACACTA
CACACACCCACCCGTCT
AAGCC:AGACAGCAGCCAAITG7CCACMATACCAG GGCTTG TITGTGAC:FTCAC
AAAGGTCAGGGCCCAAGG
14 GCCTFC7TAGGAGCTGTCC:GAGACACCCAGGCACAC ACCGGTCGCCCAGTCCTACGGACCCITCTACOGACT
66 ACT . CG CTAAGCCCAACACTCCACC
ACACCCAGGCACACACT
14 GAGTIGAGCTGCCTACCATGCTCCAAAATTGCiTGCCT CCCGTOTACTGCCCAGCCTGGAAATGCATAGGAGT
CAACACGACCCCAAGGAA CCAAAATTGGTGCCITGCT
67 TGCT. C GCGA G C
14 CCTCGICITCGGICTCAGCTTCAC.ACGTTAGGGGGCG CGAGTTACGCTCCTTGGAGGGCCTTCTTCCACGGCG
ACACGTTAGGGGGCGC
ACTCGTCCGCCTACTGG TTGTCGGAGGTGGAGAGG
CACAAAGCTGTGTGACCICi1CCC.ATCGAGCTCGCTG AAACTCTAGGCCTOTGGA
GCTCCAGGGAATCMGGG
ATCTGCTGACTGAAACCC
TCMCGCTGACCAGTC
14 TCGTGGIGGAGGCTGCKAGGCAGt. I tJ CC TTGAGCT TCTI
GAGGIGGCGCGGGTAACGCTGCAGGAICTGG
GCAGOTGICCITGAGGAG
14 GCCAGAGCCAACCITAGCTC.AAGTCGAGGGCATGG
TCGGGAGAGGCGCCTT
14 GCACCACCATCGTCCACGAGCCGCTATCiCTATTGGG
AGAGGGCCGGTCTCGA
14 GACCGGACGGCCACGTTGTGAGAGTATGGGGCC.CC
AGGAGAGATAATCGGAG
GCCGAGGCCACCTTGT
14 AAAGCTCrGAAGCGCCTCCTATCCCTGTTGICTGGCCA C.GTCGTCCAGGACCAAGGGGAAGGAGGAGAACCC
GICCTCCTCATCTACCCCA
CCTGTTGTCTGGCCACG
14 GCCCTCCTITGGGGATGATGCATCTAGGICAGACAG CCGAAGGGAGGTGATGGGGAAAAACC.ATCCCCCG
CGGACATGGATAGGAAA TCTAGGTCAGACAGGTAG
14 raCGIGGGAGAAACGCCCGAGAATGGCCGCGAGIT GACIGGGGAATCGTCGGTIAGGGTITGCTCGCACGG
GAGAATGGCCGCGAGTTT
14 AACGCTCTrTTCCTICACCGCTCAGATCCCTCTGGCG
CIGGATGCC.CTCCACGAC
AGCTGCCCCCGTCAAG
AGCTCGTTGGAGAGGACC
GAGAACCACGAGAGGTGC
as C:CCiCAGGCAGAIG ITGGCCGTCiCIGCAGCCCICGA
CGCTGCCIGGTGAATGCGCFCATCCCCCIOCGIGTC CAACCUGIGCGCCGAG GIGCTCCAGCCCTCGA
GCCITCACTGGCCCIGT
GAGGGCCAGATGCAGGAGCTTGATAGAGGGACAT ACGTGACATGGAGAAACT
89 AATCCGC:ATGGAGC:CCCMCCAGGC:CCGCATGAGT TTCAGCTGCTGGACACGCAGCGCAC.CTACTTCACCC
TCCACACTGAGOCC:GG CC.AGGCCCGCATGAGI
GGCACATGMATCCICTMCGG CAGCGGCICTG TGAGGT CAGCCGCTTCITGGGC
14 CCTICAGATATGCCACCCCCCAGGGGTACIGGGGGT GCC.AGGIGGATITTGAGCTCCGCCTCCAGAATCAGC
91 GMT GGCC AGAGACGICIAC Ga:CA
GGGGTACIGGGGCi TGGTT
TAGCCGGGATCGATGATG CAACATGAGACGTGACCG
92 C(36 TCCTGCT
CACCIGGCITCIGACCGG
14 CAGCAGCGTGTTCACAAACTTATAGCCTCAGCCAT Cr ACiAIGGGGATAITIAAAAGGGGCAGGT11AACGAG TATAGCCTCAGCCATCI AC
14 GCCIGTGCTACCGGACACGGAGCAGCTC.AGGGAAIG ACCFACGCCCTTGCC:CCCF
CCGICGCTAAMACACCE
AGCAGCTCAGOGAATGOC
GCATTAGACGCGCGCG
GCCAGGATGCCACCGAGGITGAACAGGCLACICGG 'ITIGI CAA TCCATGGCAGG
14 GGACATIGI ca:cccacAer GTA TGGG 1TACCGCAC: ACCCGGTCCTIGICiACMCITACGTT
GTIGCATCiC:C GTCATCTIGCAGATCCIGG
GTATGGGTTACCGCACGC
CAACTTAGCAGTTCGGCAC
15 CAIGGICGGGCTCGGGAGAGAGTCCGGACAGF T it 00 GIG AAGC.CCGGTGCCIAAACGAGICACTGCTGCTGG ATGC CTCGGTGTCACTGTTGOC
OGAGICACTGCTGCTGG
15 AGCCAGATGTTCAGGAAC:CAAAGCATC:GCTTAAGTA
GGCTGCATTAPCIAAGCCIATGAGCTTTCCICCAGAA AT TAGGCGACTCTGCATC GCATCCiC17AAGTA
MAGI
15 GGGAAC:AGCAGGGGAGGTC:CAGGCTAA TGAAAGGIC
CCAGTITGCCCCATCGTITCCCCiC:AAGGATCCCA IG TGCTCTCAGTTAAC:GAGC
AGGGAATGAAAGCTCACG
IS GCTCTCGCT GTAGICAGACTCGTGAGGGCAGIGATA
CGAGGAIGAAGCAACCCCCGGGTAGAIGGCGAGA CCAGTGACGAAGACCCAT
TGAGGGCAGTGATAGCGA
15 GC:TCCGCCACGTATTCCCCGATCTACGCMTAGCCA
GCAGTTCGCCCAGCTC
is TC.TCTCAGGACCTCAAAGGCGGGCGAACCAAGOCC
CCACCTGCSGTTACTAATGT
OS CATGCACCAGGGGCAGCTGCGTFCCAGCTTCGTCC AGG GC
GCGTTCCAGCTTCGTCC
CCAGTATTIGGCCAGGIC
GCCGCTGGAGTTCCT
CCCOGCCATCITTGCCGGATICTGGTAGAGGCGCIG TGACCATCGAGTACC. ITC.0 AGGAGCGACGAGTATGTG
TCTAC-ATCGAGGACCTCG
CGAAAATACTGCCCCGCG
TTCTACAATGCCAGGGGG
IS GCTCTACGCCTICCTCCGCAGGOTGGTGCCICTGTG
Ci GCCCCCGGCCATACTCCAATaTGAGCCGGCTGGG ASITCGaiGTCCAGGAGG
GCTIGGTGCCICTGIGG
GGACCAACAAMCCCACCTGTGGCATGICTGITCCCC GCCCATC.ATCCOCC.GAGCATCITCCAGCXTCTCTTTC
AGAGTGCCTCTGAGCATG
GGCATGTCTGTTCCOCCT
is GTOTAGI1GAGCATICACCITTIAACTOGALT.C1T1A CATCGCAATCATGAAGTCCTCCACAAGCGTMGAC
TUTTTGAGATCTGAGGAG ACTGGACLaTTAAAACAT
13 TITCC GCC , CCATGGCGCGGTGTCA C
AATCCCATGICAGGGITGGGGTTGGTATCATGCACC AGTTC.AGGITCCACATCTG
14 AGC i TCGCT C
GACAGCACCTCCAACAGC
AATGCCTCTATGTTGGCAC
CCTGGTGAGGTGTGCCA A
GCACCGAGCTGATGGGT
15 CACCGACMCGTICTGCGCCAATCTGGTTGTC.GGCCT AAACACCCCOCCCACTGGCCTCAAGGGGGICAGTAC
AATCTGGTTGTCGGCCTCC
TCCAGAAGCCAGGGAGG CAGACGGATGTCAGGTCG
CTCAAGTGTAAGCAGCCCG
AC AA CTGGAGATCCCCGTGACT
GTGCTCTCATCCCTGCAAC
AGGAAAAACATAACAATC AACCAGAAAGATACCCAG
15 CGGCCGITTCAAATGCTGTGGACACGCATCCCTGTCT 1 CCT.
GaCCAGCGTTGCCTCCCGATTTGACCTCACAC
22 Ta C I GAGA TGTAGGCGGGGAGATGC
CACGCATCCCTGTCTICTC
t IS 1 'ITGGTATAGGGCAAGGTIGGGCGTGGTGCTACGCG
23 1 GCCCITCCACCGCCXIGTGGCTAG CtCaXiCAAA 1 AOTTG , AGCCCCICTaCIGT CC GaTAGCCI:C.C6CAAA
15 CGGGGTTTACAGTGGCCTC.ATTGAAMGGGGGTGGC 1 CCTTGGGGTAGCCGACGGTGGGTCACCTGACTACT
24 GGT ' GAGA ACGGTATGCAGGOGCTG
GAAAAGGGGGTGOCGGT
ATGCTCTCCITCCTCTGAG
TCC TGGCC C
GGTGTCTTGGGCATCATCC
GCCCCAAGCCITCGCA
GAC.AAGGCAGTAGAGGAC
27 . GGACAC CAG TCGTCTGCACCCCAGTG AC
CAGGCACCGTCTGTATACG
26 CGT CT A1'AGCGGCICACAATGC3TG T
TAATGATGAGGAGCACCG
GC:CACCACAGCACAAGTG
GCAGCACACGCCCCCATTCCITCCCCGGGAATGTGT GGCT TGGCG ITTACCGGACACA
CCITCCCCGGGAATGTG T
is AAGGCTGGCAAAGATCCCC-AGTCTCCIAGGAATGCC CTAGATITAGCGATCCCCCGGTACGAGCGGGATAG
31 CiAGC CAGG CTCAAGGCGAGGCTCCT
"ICTCCTAGGAATGCCGAGC
GAGCTIGCGGCTGAGCTCaCCGIGAAGGCAGGG I
GTTGCGCTCGACXIAACT
IS GCCGAGAAGACCTICTCCICCVITTCAGCACCCCCAC CTCCAGGGAGA I
GGGGGCCATTGCCGATUCCIGG
33 . A C CATCAGGGTC7111M.GTC
CGTC.A GCACCCCCACA .
15 GCCATCAAGTGCACGTGCAACGGGGAAAGAGGCCG CAGCTGLIGCTIGTCGAGGATOAAGAAGaGCTGG
34 TTGGC alcc CGGCGATGGAGAGGCA
GGGAAAGAGGGCGITG GC
is GCCGGGACAAGCTC:AGCC TCCCCAGTAGGTGCCIGG ACACGGCGTACATG TCGATGGG I
CTIGCGGGCCAA
Cr CAM CACATCCTCCTTCITGGCC
CCCAGTAGGTGCCTGGCT
15 AGGAGGCCGGGCGCTATACiGACCCCCTCIATGICIT
CtCCIGTCCiACCAGAGCiACGACCIGCCACCAACGAG A TCiai AACT CGTACIACCI
ACC.CCCTCTATCiTCTTCGC
15 GGGACGTGGICTACTGGGAGCGAAGGCCCGICTGTA GCAGGC-ACAGGGTCTCCiCiTGCCAAGCTGCAGTCCT
GAAGOCCCGICTGTAGAG
37 GAGG G TCCC.AGGCIACCCGAGA G
15 CC3aGIGIGGGCGAGC7 GATCi TCCCCICIGCCOTGG
CC.AGCAGCiGCACAGAGGTGACGCTITCGGGGCGTA GCCAGGTGG ITACAGGAG
3$ AA AG A
GTCCCCTCTGCXXITGGAA
IS AGCTACiAGAAGGAGCCCCATGGGCCGCATCCACGTC TICCTICAGCGTCIATGCCCAGACTGGCAGCCGAAC
39 CT CAGAA MACTGGACC.ACTTCGGC
GCCGCATCCACGTC.Cr 15 TCIGGCTGCGTTACACCGATAIGCAAAGATCT GCGT ai TAGAGGACGGAA ITGG
GCAAAGATCTGCGTGGAC
GGACA CGGGCCTCTGGACCAGTCACACCGGCCAGCCTCAACT TG A
TAGCCCiCACGGACCCTGCACACIACTAGCOA
ACCTGTGGCCCGCGTA
15 TCAAGGCCTC:CCTGAGGAAGGACGATGTCGGCTGTC TGAGCCGGC.AGAGGACCAGCGGTGTTC.ACCCGGGA
42 CTGC . TG TACGTGCGTGTCTTTGCC
CGATGTCGGCTGTCCTGC
43 CCICCCCTACGGTTACCCC.ACCGGGAGGCCGTGC71T TGC GAGGTCGTTGGCGGCA
C.GGGAGGCCGTGCITT
CAACAATAAITTCC.TGTGC GAGCGGTCTGTATAAAAAC
CTIAGTGAAGAGTOTTGTCACCACTCTATAAGGGC TTGGAGAAACTGTTGCAG AGACAAATATGATTGGAA
ATTGGAAATAGACC TATGCTCC A ATAGACC
15 CGTITGTAC:TGGAGAACCAAC.TAACGTTACIGCAAG
CCCGCCIATTGCAAGCTGTATAAACAATCGTCTTTTG CCCTGGCATTGTTTAATGA GTTAGTGCAAGATTCCAAT
15 TGAAAGAAGAAAATGGC.AACTGGTGACTGTACTGAA 1TCCTTIMG1TCGGIGATTATCCAGCTGACTCATGA
CAAAAGCTGTTITGTAACA ACTGTACTGAATTCCAAAT
is GGTCTATITGTGTGGCGAGITTTCAGTITATGAAAT A
48 ACTAATGCC.ACC AACTG AIGAGG GCCACC
lICKACTTCICTIAGTF CTCAAAGGGGAGGMCTGA TATTITGAGGAIGCAAA00 is CCAAAACGG7CGTGTAGGICTTAACAAITGCACC.GA TGCAAGCAGGCCCATAGATGGGGGTAATTICTGAT
15 AGGTTGGGG i it 1 t CAATTTCAAATTGGAGAAGAA AT
AC.ACCTTTAGACAGAGTTC.AGAGAGCTGAATCiTGA TAGG11TTGCAGMAATA
TGGAGAAGAAATTCCITTA
51 TCC1TTAGATCC CG CCCATA , TTGTG GATCC
IS TGCCTCCATGAAAGTAATATCCAGTATCTTCCACCTCA
CAGAACGGTTGCTTATGGTTGGITTC.ATCGTCTACA GCAGACCACTGGAN \ ACT
ATCTTCCACCTCAAAAGCC
15 GCATITCCAMTCCAAATTCACACAGAGA1TGICCTC 1GAGTC.117TC.AACAAGACAGAGCTGGCCAAAGAG
TTAC-AAAAC.CATGTAAAG GAGATTGTCCTCCTATTGA
15 TGCAGGAC.ATITTGTAGCTAATGATGGGAACTTGCCT
GATGTTATTCCAGCAGAAAAGCCTGTC.ATATCTATT GCTCATATTAATGTAATGA
GGGAACTTGCCTITGTACC
15 GGTATGTIGGCMGCMATAC.TCCACGTATATTITA CTGAAGCCATTTGCACTTATC17CTAAACGGT1GGT
CTCTACAAATAAATCTGCA CACGTATATITTATTITTCA
AGCAGGGGAAAATAAAA CTACTGGTCCTGTTATGTG
IS GICGATGAAATC7CCTGGATMCATCTGOTCCAGTG GCfGAGGGAGCAATTGAGCTTTCTTTGGGAAATATT
CIGCTFCCAG TGAGA ICA
15 .ACTICTITCCUTTCTIGTTCACATATTTGCTATGGCTG
ATTIGCTATGGCTGACGG
is AGCATACCITGGTGCTATTAGATTITTGCTAAAACCC TTCGCACTGAGTAGAGGCTTTGTTACACTCATGCAT
GAGGATGAACTATTACTG
TTGCTAAAACCCGGAGAC
15 AACCGGCAATGGaCC.AAATGITAC.AGGACTAAGGA 1 TTGAAGGGGGATGGACTGGACCCTGTTCATTCTGAT GAGIGCCAAATTGAGGAT GITACAGGACTAAGGMC
60 ACA ITC 1 GA TGA G Arf C
i CTAICAGATTCTGGCGATCTACKAGMACTGATIG GCATGGAAAGTGTAAGAA 'FGATTATCCCAAATATICA
62 I ATICAGAAGAG 1 CCCCCAG , ATGG GAAGACi AGGTGGTTATTTGATCGTTGGGAGCGGTCATATGA
62 GAG ' AAATGTGTA CAACCGCCAATITCACTG
CAAGGTTCGTMGCAGAC
is ACACCAAATCCTCCTCTGICCTGAGCTACITAATTITG
GGAAATATTTGTATTAGG TAAGTTATTAGACTTAGTG
64 TTAGTGGAGAAG AGC.I. 7 TG PCGC GAGAAG
CGTGCTTGCTACAAATTTA
65 . AATTIAGG GAICIAAIT CGGTGCAAGACTICGT11 CiG
AGMAGATGGTGAITTG ATTIGACGAAGGGACAGA
66 AGAC Gar GAM. C
IS CTGITGTTGGIGGATTGGCCAAATGAAGCTGAAGAT CTCAGGGAACTAAGGGCITGGACAGCTITCACATTG
ACAGCGGACTAGAATGTT AAATGAAGCTGAAGATCTT
GGTAATTGCTGIGTATGCT TGAATGATGACTTAATTGA
IS AGICAAATTGIGTAGC-ATITGCAGTCCTGAATGGATA
TTGATGAAGCTACCATTGCATATCAGTAAAAAAGCC AGTATGA ACCCTAATGTAT
ICCIGAATGGATACITACA
69 CITAC:AC.A CTGGCA 37 A ACGC CA
GACCAAAAGIACAAA TAT CAGCAGACi rf AAATCCITI
70 ATC.CTTIGC CATCCTC TTGCA GC
IS CCCCAATC:GCATAACTCCATTIAGACI7 TMGACAAG
TACCAACCTGTACCTOCCITTGGCAATAGIAAAACCA ACiACT TT AAGACAAGAGC
73 , AGCAAAT TGCCAAT GICAGATIGACCACTGGC MAT
.
15 ACCCCGACCTGCiAAAGGGTGTCTTCGCC:ACCAACIG CGCCAACAACATCGCGACTCGGG/CfGATGICOGG
ICATTGCCCCTCIFIACIAG
TCTTCGCCACCAACTGGA
is CiCAGGGGGTAGTGGAGICTGCAAAGGGGAGGGGA CACAGGCGAAGGGGAAGGGGAGGAGTCiGGAGCG
CAAGGGTCGAACAGAAAC
AAAGGGGAGGGGAGCG
15 GCACTTAAGCA.CATTCiGCGTCCACGA I TACTGGCTGA AG I
ACIACGTCiGICAIGGGTAGAAICA I TCGCGCTC
C.ACGATTACTGGCTGAGG
CGTACTAAACGTGCCTCTGTTACCiTGGACATGTACC CCTAC.AGTAGATTGGICTT ATGATC.TGTAAGC1.
ITAC.T
15 CCIGGCCCTICCCCTAAIGGAA TAAGTACAGGCCG 1 G AG 1TCCiCCC
IGGGGTGA TACCGGGICAMTGGTCT ATAAGI ACAGGCCGTG GC
76 GCA GAC.IGT TGTTTTTCGGGGGCTTGG A
IS ACCTICTGTIACAACIGGAG TATCTAGAA ItAATF GC GGCTCGIC I
GMAT T ITAGAGGATTCiAGITCGTGCA ACiAAICAA ITGCTGAGA TA
77 TGAGATACAC ACAC GTTGACC.TTITACCTGEFTG CAC
15 GGCGTTG TACATCFAAAAATIGCCTIGAGCi TCACACA ACAGAAACACCTCAGGGATATGTTACGI
GTGCCiGA 11CAGTITGATAAT CCAGC TGAGGTCACACA.AA TAT TT
IS ACTiTCAACTCiGGCCTTGTACTGAAAIGCAAFTAITG ACCTGATGf.all GGAGGTGGGAA
FTAAAATCCACA ACCArf AACACAGAAGAA GAAAIGCAATi ATTGGGG
15 GCAAACGGTC.ACTMTTGCATGATACAGTGGCGAGA
GIGGGTC.ATCCATATTATGATGTC.CAGACACTITAG CC.AGCAAGTGGTAAGGTA CAGTGGC.G
80 GTTCAA AG . GGACTTCAAT 7 G
15 ACCTGATGTTCCAACCCCTMAGTTIGGGCC:IGTAGA GGITAGAGACACAGAMACTCTAGCGATGT51717 TTATAATCCTGATAAGGA
GTTTGGGCCTGTAGAGGT
GTGATCAACAACMCGAACAATAGMTACTAATGA AGAGAAC.AATGTTATGCC TTATGrfAGAGGAGG 17CA
82 AGGTTC.AGTT CCCACTTACTGTA AGG 617 AMATGCGGACGTAAGAG AGATCACTGAAAGCCAGA
GaTTOCAAGACAATCAA CITTCTTCTAGCATOGACC.
84 CC.A GTCTC GG A
CTCAAGGGCGTTATATTGG
is GMAATGCCACTGTATCGTITCAAGCAAGAAGGITG AGTCTGCTCCTAAGCCTGAAAAGAMCITCTIACCC
86 ISGAIG TTGGG CATCC.GTTCTTCACAGGA
GCAAGAAGCT 1G IGGA it 15 ACACGITAACAGTACTCGATICCIGGCACiGAGAAATC CITCCCTGAAGGTIGAGACAATTTGAATGCTCAAAT
87 CCATC C.AAAGGCG GCG
GGCAGGAGAAATCCCATC
15 CGTI1TGAITTAGC.C1CA1ICTTCAIGGGGGT TAAAG
AAAICCATCTACGTGCATITCCTTACTCTGAAAIGGA ATCAGAGGATAI AGO-IC
RS GGAGA TTCTTGATG CAAA
ATGGGGGITAAAGGGAGA
15 GTGAAATGGaiAACTGTTIGITCGCATGGIATCAACA AACGGTTITAATAAAAAGCTGTGCTAGACATGAAGT
GCATGGTATCAACAGTGG
89 GTGGG ATGCAGAGAC , CCCCTTGAGGCATTTGTG G
GTACAGCTCACAGTTGGCGTAAGC.GATACGAC:TATC AACATACTGCAGCTIGTG
90 CTGGTG i TAGC G
GTGTAGCTGCTTCTGGTG
15 ACTCGGAGACAAGTGATATAATGGACAGTTAGTCTG C.AAGGCAGCGTGGAATTATACAAC.CATATTCAAAGC
ATTCGTGGAACITTCAGTC CAGTTAGTCIGGAOGGAT
is CCATCTITACATGAGAAGGCATTCCCTACCTTCCITTA GATCITTATGCTCTGGGGGTTGL G
iii i CTGGATCA ATAGGGATGMTGATCG CTACCITCCITTAGTAAAG
CTGTGGCTCCNOTACAG
is CGCACCTTGACAAATCCTITAGAAAITCTCAAAAGGG GCTGTIGCCGAGGGAAGAAACGATTCTITTCTCGIT
TTCTCAAAAGGGATTGACG
CTCTGGAAGCAAAGGTTT TGAAGCTAACAGAACATTG
is AGACAAAGAGTATGCAGCGGTCATCGCAATCGTAAG TCAAACAGCATGATGICCGATAGCAGGCGTTATOG
CATCGCAATCGTAAGTATC
ATACCCTGTCAGAAATACACGAGCACAAAACAATC.A GAGCAAGAATTTGTAAGA
CCTCTITGTGAACAGTGTC
is ATCACCCTCTACAAAAAGCTCCTATACAGGGTGTIAT 1 GCAGACITTCTGGGGGAGGAAAGAGAGCTCAGATC TAGACTAGITC. TGCGCCTT
TACAGGGIGTTATGTITTG
t 99 CIGCLTIC 1 GGC:AAC , TCTCGAAGCCGTAGAA.GC IC
is TTTGCAGGCCTGTAGTCCCTGAAGAAGATAAAACTAA 1 TGCTTGCGAAAGGTGTGTACCCATTATTCCACCCCC AGTICATTCGTTTTGTATC GAAGAAGATAAAACTAAG
00 GAATCAGG ' ATC GT AATCAGG
AAAGGAGAACCTCGAGCCITGTTGGTAAAGGATGT Ca. i la r z I i CTTCTCGGAA
GGCTCTTTTGTTGAGTTIG
CTATTCAACTAAATGGAG CAATGTTCTGTTATCTCCA
TATCGATGTTTCTCGCTTAACAGAACTCTAGATCCTT
03 . GGC AAATAGAGCAM CTAGAMCJW.JATITCIG AT
GCGGITITAAAGATGGC
CCGCTTTCCTGTGTGTAATCATCTCTGCTATCGAAAA GGTATCGTAACGAATAGC TCTGATTGAGGGACTTCTA
GTITCICGGAAATCACAAA
OS 'TCACAAACT GITTCTCI CCAACACT TCCGCGAT TT CT
TGGCAGATICTITTICGTG C.IGAATAGGAATTCCTCGA
06 CTCGACC GTGGA.T AA CC
APAGCCTITTAGITATTCC
07 'MAGI GGC111 AA
CTTCGCCAAGCTITGAGT
16 CGGAG7 AGAGAAAT CAGC:AGACTGCTICGCA 1 TCCIA
GAAGCTGAMCGCCGA I C17 ACITCICC:CAGACT G cnCGCAlICTIATIACCAA
08 TTACCAATG rraT ATGACAGAGACACGACGA ICI
16 CATGCACC.AATAGTfACCiGTGT1T1ACAGTfTGF GGA
CGTAICC.TTGCCGGTAITCACTCGAT A FTACGGAGA AGCAATAGTCCTITCCA IC
09 . CitIGTT CAAAGA 6 TTACAGITTGTGGAGGGTT .
16 Gal GIGIGTCITITGICITMCATAAACAATAT GO-CA.ATAC
CAATACGGG TAGGGAT GA GGG
TAG TGCA.GAAGCTAAGIT TAT
CMCGGNsCAACAACTGC
16 CA 1 CCTGT TCAGC IT TACGAACiCiAAAGCCIGTACAGA TCTAGGGAT
16 GGAAGCAGATAATITAGAGGAAGCTCITGCCaTIAT CAAGTAGCGAAAAATCCAACAGCCCACAAACCAAA
CCAACTCCIGTAA TACATG
CTIGCCXITTAIGTGGITT
16 CCTAAGATACiCGATCAAAGGTIGATCCTACTTAACCI
CTCiCTCAOCTCITGCGGTTATAACCGCGAATCCCAT ICCTACITAACC TAT ITTAC
16 GCAGGTITAGATGTIGIGGGITTIGGCTAIGGGITTI 11TGTCACGAAT Al AGCCACTGAT
TNIGGT TCTICTG ATITCCTTCATTACCiAC:AG
TGGCTATGGCTTITATGGG
16 AATGTICTAAAGGGGAIGIGGATITAAGCACT Ca CC AGATCCTCTAAAT
TiGAACCTCCGCATACTAAGTTGI GAICTCGCATACCIGAAG
AAGCACTCCICCIATC.ACA
TCC.AGACATAGCAACAATGACAGAAACTTC.AATAAA CCCCAATTTITCATGAAGA
CCAACAACATTATACTCCC
18 CCCCT . CGAAAGCG GC CT
TGATGGTTACTICC.TCAMACAAGAACGGTAAN 'µ GA
CTGTTGAAGTACGTCCCTT
TAAGACGTTGCTCTTATCA AGGCTGATAAATGMTCG
TAATTTCGCCACAAGACCA
TTGTCGCGAATAGTCACA
TCTCCAGCAGCAAAAACTAAAATTCTGCTTCGCiTTT 11TGAMTCTCTACTAAGT
CAATTGCACITCAAACTCA
GCACGAGTTACTGGGGITGAAA AACTGTGTATGAGGTTTA
24 AMC Te3TTC ACACT
GTITGGCAAGC:AAACAAC
16 TGLTITCTIGCOCAAGATCTCACCICTIACAAACCGCA CAAGTAG1CAT1GCTATGaCGAA1AGAG11GTACA
ATCACAAACCATAACAAT
25 TIT AC3CGATTIG At-AA
CCACTTACAAACCGCATTT
16 1TCACAGA TCCTCC.GTAAAGAAIGATTGCTICTCTCTT
ATTGCITCICTCITTICCAA
16 AGGTTATGGAATCGCGGTTATCAGGAGCCTGTGCAG Gres- i 1 u. I Go 1 I
AGGGAAAMACGTTITATTTITAT
27 AATGTG AGCGGACTTCT , TAGCTTCAACACGOCTCT
GAGCCTGTGCAGAATGTG
16 TGGAATTTCTCGAGGCGCTCTAAAGTAAC.AGGACTAT
CATTCTGTGGCTTCTGCATTAGATA/kGGATGGCCGA TAAAGTAACAGGACTATCC
23 CCCCT TTCGT CTCCTCTTCCTGCTGTTG C.CT
AGOGCTCTTAAACAATCTC
29 GCAAC.AC AAATAGTCGA A
ACCGGTACTTTGCAACAC
CAAGCATCACAAGATTTAT GMACAAGAAGTTCTTITT
16 TGGTCAAAGACGCAAGCTGTGACCTTATCTITGGGAT AATGCCGCTCGAGICAATGTAGTTCACGAGC.AGITT
GACCTTATCTTTGGGATAC
AGTTTGAAACTCCAATACT
AGAAAACAGGCAAGCCAT
16 AGCTAGCTTC.ATGGAGTATGTCTACCAACATAAAACT
TCAGGAGACGAGAAATCA ATATCATTCTCTGTAACAA
CGGTCTTAATTCCATTAAT
GTAGGTAAAACCTCGTITGGICAACATCTGTATCAA TTTCCATATTTCGTGATTG
36 CCAA : AAAAAGGCACIC CA
GCAGACACTACATGCCAA
s ICACTACTITITIATGGTMGT GIGGAAAAGCTTFGTTAG
37 1 CCGTF I CfC3TGC: , AGG 'FM
TIGTGTCCTTCCGTT
GGAACTGACCCTATAAGGAATGC.GTAACGAC.ATCA
311 CCA ' TTGCATGA GCTO1TGTG11CAAA3CT
IXTAGCAAAAATCTCCCCA
16 GCTCAGTATTTAGCAGGGCAGTCTTCCCAAAAGAC.TT GACAACCATTTTTGCTTTGCTGTGGGATTTGGCGAA
TCTICCCAAAAGACTTCAG
AACTGAC.AAAAGCTGCCG
41 . GGAIT AAGCATAC GCCCC3GAATIGGNITTGG
CCGTAGICIGTFCGGGATT
16 TCAGAAAGGGCTGAAGGCAGAAACGGTGC.AACTCG ATCTGIGCAGCTTAAAACCIAATGTCTTTAGAGAAA
CITTGGATAGAGAAGCTG
42 ATC GTC:ATAAATACGGA Cl' AAACGGTGCAACTCGA TC
GGCTICTGTCACCITGCTCAGCTITAGTTGIGGCTIC CAA.AAATAGGATT AGAAG
43 AGAAGAACITCC AAC CfCAAGCGTACGTCCITC AACTTCC
ACAAGAAGCGGTGTGAA GCTGGATTCACAATAGTAA
TTCTACATTATGTGCALCGATC if 3II i GG CAGAACACATCACTITAGA
16 TGGCTCAGICGGTICITCAGTAATCCACIACTCTICCfC
GAAGCTGGITCTACAACAGAAACIACAGITICTCCA TAATCCAGACI CTICCTC:AT
ICCAAAA.P.MCCICATCTIGGAGCATC17 TTAAATCCCATCCCATICA AAGGC'AGAAGCFAICTIGI
47 . G71 GTGGA GG T
=
16 CITGAAtTICGAAAGAGGATGTFGCCITGCITCAGIT GCCF GACCOCAGAAACCATTICTICTI
CATF AAAATC GACICTGAAGGTAA TATC
GGTCGAGGACAA I A GAATTTAATCCCICITCGA CCTITAATTACAGCTGGGA
16 ACACIGATTCTTGCTITGOCAGAATITCAACATIGCCIA TTAGGICCIGTAGC:ITI
AAICACAATGCGCTCCCAITTAGTGC:ATGAGAATCG AAAGGAGTCGGACCATGA
TGACTGCGCTATGAGGAT
16 G TAGGGGCTCAAGGACGAACTCAAATCAAI cavrnt AGAAGCAAGGATGAGGITTICT
GTGIIGGAGCACA CCAGCTTTCIGTAAAAAG TCAAA ICAATGATITCCAC
53 CACTI-CA CATC.ACG ACAA ITCA
16 GATTCHAGAGEGGGATCAGCAAGTIGGACAATAAAA GeICKTAAAACCCGACAAAGAGAACGTC:CCTANIAT
AGAAAATFGCGGAGGAAC
TTGGACMTAAAATGCGCG
16 CCTACCTGGITGAITAIGGAGAAAITACCGAA 7 CTCf AAAAGCAICIAGGAGC7CCCCAAAGGACIGGA1'AT -1ACCGAA TCICIAATAATG
SS AATAATGGATC.A GAAGTGG GCATCTTCAGGAACA:GTA GATCA
ATCTACTAAGCCCTATATT
56 AAAGCTCCTITITCGTGCCCGTGCTGCTATCCCCGATA . GGTG AAAGTC
GTGCTGCTATCCCCGATA
16 TCC.CATTCTCTICACCIAGGTTTATCCACCATCCC.TGCT
ACCACTCTCITAAGrAAATACTGACAGGTTACTTCCG CAAAACX.GATACAGAACA
ATCCACCATCCCTGCTTC
16 TGITC.AATCITTMTITGGAGCAGAAATTCAC.TGCTITA
CGAAGCGGTGAATGC.ATGAAGGATGAGATGTTGAA AAATTCAC.TGCTTTAGCCT
AAAACGACGAGGACCCTA
TCTGCCATCAAATX.ATTTCGAATGGGAATGACGAGT T/kAMAGTGAGCAACGAA GTOCTGCTTGAGANITCTT
TTCCGACCGAGCAATCTCAGTITGGGAGTAAAAGTT TAMTCCATAAGTAGTTFC GACGCTGTAGCATAATCAA
CAACACFTITTTAAGGCATCTTCCAGTAGTGTTGTTG AGAACGCAAAG Al-TUC-AC
16 AGAAGAAAACTCf:AATIAAAAC.GIGGTGTGOCCAAT GCAAAGCGTTCATATCTCGGACTAAGAAAAGGTCCr GIGTGCCCAATCATITCAG
TTCTCTL. G1TGATA ATATGCTGATCAAATITTG
65 TGGCT GCCT , CGA GCT
AAACCGGC6TA11TCTTCCTGAAAACAATTCCCCIGT GCTTACATTCAAGATGAG AGAATCC.C.CTTAG
AAGITA
16 TCCCCAGGCAATTTCTITGCAAGGATTAACCAGCAAA 1 AGACCCTC.
67 TACAC ' ATAGAGGATT CA AC
16 GGGAGGTAAATCAAGACTCCGATTTCCTGC.TrICATC TCTTTGICATCGATTTGGTCAGGGAGGGGAAAGCAT
CCTICTGACTITATGATCA
ATCATGGG
16 ATCGTAGAGTCTIGTATCACCTGAAANCITGGGATAG C.GTCCACTA&AAFATETTGAT'A-CGCCAATCAAAAT
AAAATTGGGATAGCGCTCT
16 ITACGCTAAAAACGGGCCTATMGATTAACCAAAAG TAGCTCTGCAGATTTTGTAGCACC.CTCCGGGATAGT
TAGCMCGATGCTCAACC
TAGAAGCTATGGCCAAAA ATCGCAGAAGAAAAATTCC
ACCTATGAAGAAGCTCGTC
CTGAAGGAATCGCTC.AAGGACGTGTTCCAAITTGGT AGACTITAGGAGATCGTGT
TGGAGGGGCCATTCCGTAATTGTIGITCGAGATCCT 1TA TCCITTATCTAACAG It 75 c:Acreracr 1 USG , ATCGATGGAGGIGIAGGC 'Farr AACCTGGGAGAC.AAGATGCACCATTGGAGATAAAA Gil ft I I fACTATTAGGAA
76 GGAATTAACG ' GCGC TAACGCACGGGAAAAAGT TTAACG
MCGAAACCATACCATTA TCAATGACCTTCCTATGTCT
TCTTIATCAGGGAGAGGG
78 TTCCC.TCC TGCGG T C CA
ICGACAAGATICCCTCC
GAAACGCAAACATGGGGT
79 . 1 GCC C711 A
AAAGCAGTGAAAACTIGCC
GAGCTTGTCCCAATTGCTCCGTCAAAAAGAAACGGA TTTGTAATTGCTGG 1lilt CITTCAATCTTCGGGACAA
TGAGCGATGCAGITICTACAGCAGAAGATACCCA ATGGCACGTCTTAAAAGA
'TCGTGCAAGTATGITCAAC
TGTTCCCCGTACAAGCAAG7TTCTGCCC.CATATCTGC ACGTTTAGAAATFCTAGAA
82 CTAGAACiAl GC A ACAAGNICAGIGGCAAGC GATGC
CTAAAAATTCGTCIAACTG
16 TGC:GAGTGlIATATGAGCAOGAACRCIACAATAGCG
CSCCAGGCAAAGCAAAAAGCAAGCTC:ACTGITGCT AA TACTICCACTAIATCCC AACTICTACAATAGCG
MC
16 IGTC:CATITATTISGTCTTGGAGI OCT TGAACGCAAC
GGTAACIGTTGTAITCGCACCMGC:GATATATCAATG CATA TCCTIGITGATGITC
8S , GAGAA AGGTAGGAC TTGA
TCTIGAACGCAACGAGAA .
16 'ICCAAGAATIAGACAMAGATCGCACCAMG1TCGC TlITCCIG !GT
CCATTTGTTCGCGAGTCA
EGGGACCCTCAAACTCAA
88 TTTTGC. CAAGAACT ACGCSGATCACCITITCTCA
TTGTCTCGCACGTTTTGC
16 CGATCITAGTAMTGCTCTTGCCTTACATTTCTCGC.AT
TTCGAGCCCTITTGAAGAGTGATCGC.AATCTAATGA ACATTICTC.GC.ATATGAAC
90 TACGTTT MATTI- rrc TT
16 IACC:GICG1 ATG TCGGCTGACCAT MCACCA1T TCAC
AGGGAAA1CAGAAAATCCCGCP.AAACC1TCCTCI TA GGGACAATAATGAAGGCC CCATMCACCAMCACAG
AGCGTACATCC:ATGA TTIATIGIGTIATCGCGGCTA TAT CACAGATACATCAAGA
92 AAGAGCT TCGTCTT C.GGAC.G1TATGGACTAGT GCT
16 GlIGTICTMGCSGTACAACAATCTCCAAAGCGACGA CGCGC1CCGITAAGATGA
TRAAGGCCCIATIGAC:A
CCAAAGCGACGAAGAACT
CAGGAGCMTAAAATTAC CAAGAGATGCAACCAGAA
94 AAG . CGGAAAG AGG G
TCGGACAATAAACTCGTAA
16 ACCTGCGGAAGAAGCTGTTGAGCTICATCTTGAGCTT CAGGACGC.ACCGGAGATAAAATTAAAGATGCTGTT
AAAACCCCCTCTITAGAAG
AAACTTTGGAAGATGC:TG GTCATTTTCIGTTCATTTCG
GAAAGGGATTIGTICTCiT
CGCTATGGTCAACAGCAT
16 GCCC1TTCCCAAC.TCCTTGAG11TAGTGCACCG1TACG
CTTAATCTTTCTGAATGGGGCGTTCCAGTAGGAGCA
99 A TAC.AGC CTITGCCGGAAATTGCTC
MAGTGCACCGTTACGA
TAATGGGAGTGGAAATGG TCGTCFATTAATCCITCIAA
01 AC.ATGIC ATCGATTC CTITTGCGTATGGCTACC C
17 CsIGGGAGCGTICGAGACATGATTAGTTGCTAIGTCT
CAACGAAATACGCTTGATACCCATTGGAAGTTGCAA TAGTGACAATAGTAGGAG ATTAGTIGCTA
TGICTACCT
17 GGACAATTCCGCATAATITCGGAAGCTC. rGTTTGGIT
AATGTCGAGGCTITAAACGTTTICATATCGC.AAACG CGTGATATAGCTATMCG
03 cm CCTTGT , GA
AGCTCTGTTTGGTTCCTT
17 CAATGCAa:CCTTC1TTCTGATCCAAGAGGTTCC1TCT 1 CGTCTGC.GTAAGATGCATCTTAACGTCTITC.CTAATC GTGGAGAAGTAGACTTTG
04 CAA i CCA AGTT
CCAAGAGGITCCITCTCAA
17 CCACCITC.AATC.ACAGCACTTAAGGAATTCGATACTT
GCCITAATGGTGTGATGC.CTGCTGCTTGCGITTC.TAAA CTTTTACTGCTAAGATTGG
AAGGAAITCGATACITGGA
OS GGATTC CTCT ATC.G TIC
17 CGGGATGATTGAAGTACAGTCCATGAL t 1 i ii i CAAG
TCCGTAACTITTGATTC.AGAATTGACCaCCCATGAA ATGAL CAAGCTAA
i is- I t CGCGA TGTTGTTTCGGAGAGAAG TCTTAGATAGTC.GCTTATT
AACTTGCTICTCAGTGAG
GTTGACAATGGCTTGAGC
17 CGGCTTrGCGCATAAAAAACATTTTCAGGTTGAGTAT TTGCCTTAGGAGAAAAGGGAAGGACAATCACACAT
CATCTCTCTCGCTATCTGA TTTTCAGGTTGAGTATATG
TAGAAACTATTGATCAAAG
TCAAAGTATCCG TTTA CGAATMCCCTIATTGGT TATCCG
AGAGATACAAAGTGTCTT AGAGTTAGTGGTCATAC.AA
GAGATC.C.ITMGCAAGA CAAAAAAATTATCAGGACG
s CIMACAAATCAGCCGATCCCIATCGATCAGATITAT ATAACGATTACTf GCTTGI
CCATCACACACAGCTITTA
13 'STAG 1 GCAGAT , GC C
TGCTCGCTGCAAATTCCGAAAGTITTGCGTTAGTAG GACTAC.AGTCTCTCCTGG
ACTGGAATATACACTTCTT
14 TTCTTCCG ' AAGCT 6 MG
ITCGAAGA TGCTTAGT GGGT AGA
AGAAATCCCMCCAGAAA
17 CTGTTCGAAACOCATGGCAGCMATGGATCGMAG CAAGAGCACAATGGTATCCGAGCTAGCATTGC.ATCA
TTGGCTTGGATTACGAAT
17 . ACC GGACTA GT
CCCATGGATCGTTIAGAGC
17 TCCCAACCAACTGC1TAAACATCTAGCT1TGATC.AATT
TGGAAGAGGTTCTITT ATGAATCGTTAGCACTGCAT CIAGCTTTGATCAATICCG
GTTGCTGTTTGCTAAGAAACTA
19 AGGA TC TM MG CAACIaTTICCGCAGCTI
'ITIGGCAGICATT GAGGA
TCET APS AGTFCGTTGCTATCCACAT TT
17 ACTC I i I i tAGACTCTGTGCGTGICAGCAGAGCAATA I
21 CafTGG 1 CGAA C
AGC:AGAGCAATACC1 MG
17 GCCGCAATCGAAAA1CTC1CCTAITAI1CCCCCTTAGG I CrAAGGGAACAGAAGGCITGA I
AATCITC:AAGCAA
TATTCCOCCTTAGGITTGG
TGCFAAAACiG1CGC6CTA6AACICATGA 1 Gra:617G A1GC11TTITGG-TTATGCA.
ACCITSTCTICTITCACi ITC
7.3 . AGTTCTG 1 GAG 16 113 =
GTOGAAATICGGGGGAGTCTMCTAAGGACAAAA AAAATITCCAGGGGATAG
TATTCATGCTTCGATCCCA
ICC( A
ITTCTCGCAAGGGAGTTGT
17 TCCTGGITITCCTCTCITAATAGCICP.MACA TTCIG
GMGTITTIGGCCTTATC
26 GCTCGCAATAGCAGTITTCCTTETTGGGCTICTCTGCG AAATCCAAAT rrre TrITTGGGCTTCTCTOCG
GTGAAAGCACAGGTTCAT
17 GTCACGAA I ATITTCCTGAGAAGGAA i GACAA I GCTC ACIAGCTAAGC:AGITAGGCCTIT
I GACTAAACATCC AGATGCCAAAATGACGAT ATGACAATGCTCTTIATCG
MTh TCGCCCACCCTTra:TGGA AGAATTA 17AAGCAAGGAT
TGACGGATGTTGITAAAGGITCTOCACANIGACATC AGGCTITTIGGGAGITAC
TIGCAAC.CGATAGATTCIG
TCTGA TTCTTGGC C A
17 MCI CCCGCAGT:CATA ICACCCGGAAGIGGCTACTAC:
TTCTCAGAAGAGGATGTGCATGAGAAA I CGGATCA CITITIATTGCIATAGCIA
CCGGAAGTGGCTACTACC
17 GTGTCTCGAAAAGCTGCTTCTAAATGTITaITCC1TTC
TCGCAGAATTGATGATCATATGCC:TGCGTTATCGAG ACAATTGTTAAGCGCAAT AATGTITGITCCETTCGATC
32 GATCG . CGTATG GG G
ATATGCTGAGGATITAGAACAAGCAGTATCCCCTTC CCAATATTGAAAAGGAGC
TTATGCCTTTGTTGCTGC
CAATCTIGG1TTTGTTGAT TC.AAAGAGC.ATC-ATACGAA
TGTCGTTCGGAAAATCTCC
17 ACAAGCTTGCTGTTAAACTCCTCCTTATGGCTC.AGCTT
CTCCTTCTITGACAAAGTCAGAGATCTGICATATTTC
36 CA TGAATTGTCT AAGACTAATCTTCC.AGCGT
CCTTATGGCTCAGCTTCA
17 GCGCATTGAAACTATGC.AGATCAATATTIGTTTTCCTC GGAGCATATITAGAAACAGTTCCMCATATCCTTA
ACGCTCGATAATTTGACG
ATITGTTITCCTCCGGGA
TTAGCACGTATTCCCCAGCTCCTAAAC CAGCAGCAGAATATCTAG TTGGAGATTGAAAGACAC
AGAGATAGGGAAAAGAG GTGTCAGICTITTTTCTGG
GATCATTGACTGTGTTAGC GTGCTrTGG1TAAAATTGC
41 TrGCTAC AA11TTCC , Tr TAC
GATCCCAGCCATCAATGTGATAGCTTTACTGTATCG
42 CATA i GTGTC.AAA ATTCCCGTAGTTAGTCCGG
ACCACAGGATACGGCATA
TATMTATCGCGAATAGA AGCTTAGTCTATTTGTAAA
TATGCCGC.ATATTTTAACGCeirACAACAGAATCCCT GCTTTATTGGGACTGGGT
TAGTTTGGTTGGTGAGTGA
17 GCCCTGATAGATGGAGCTATGGTG/WFTGGCGTTAT TCCACAGrrACAGAAATCGATGACiTAGACCGATGA
GTACTACGAGAGAAGTTC
TGAAATGGCGTTATGTTCC
AACCATCAAAAGCTICTAA GATAAGACTATCATGATTA
TTCGTCTACAATTGAGAG GCCCATTAAAGATAATCAT
17 CCAGC.AGCTACATTGCTACTAMGAACTCTGGATCAG GAAAGGGGGAGCTATTTATGCCAAGAATTGTACAG
AACTCTGGATCAGTTTCCT
48 TTFCCFT GGCCAC CAAGC.AGCAGGATCCAAT T
GCGAGTAGCTTCTFIGGT
ATACACAITTGCAGAGGAGAGCCCAATCTCTCCAAC AACAAGCTITATGIGGAT
50 CGT 1 CAGAC CV( ATITGTFCGGAGATGCGT
GCATCGACACCATCACCATCFCCGGACTGATTAATC GATGGAACGACTGTFTCTT
52 AAG 1 GIGAC , TAAAT GCT
TCCGCiACTATCGAAG
52 TTAAACCC ' GTCFCT A CCC
TTGCTCTGAAACACTTCTGTCGITTC.ATTITTITCTCA AATGCAAGAAGATATAATC
CACC-AAACGGTCTTATGA AGATGGAAACTGATGGGG
GGAAGGATCCTAAGAAAA AGGACCCATATACAAGAG
55 . AACiAGAGT ATTCAAATTCiGA CTGG AGT
56 GAGTTG ATGlIGCAC G G
AAGGACCAGAAGTGGAGGAAAGTACACTGAMGA ACTAGAGGAGTACAAATT CAAATGAAAACAT GGATAC
57 'FACTATGG CCTGC GCTT 'I/FIG(3 rrTGICATAAGGGAACTFGAGTCAAGA
SS GGA AGAAGGTICTG TFAGCGGGCAATTCCTC:F
AACAGAC.ACTATCAAGAGTIGGAGAACACATGCAC TCAAGAMGAGTCAGIC
59 CiAIGG 1 ATTCAGACTC GC CAAGTGCTIGICA
TGAIGG
17 CACTAGAAICACiCiAl AACAGGAGCAGAAAAGGGAA = CACA1GIGTMGCAGGGA1 17 AAACCGIT I CT( GAAC I AATGCT1IGGAGTAAAAGGG TGAGA !GAT
TTGGGAICCOAACGCGATATCITGCrr CAGTA.TCGICIAATGGAG GGAGTAAAAGGG TITTCAT
61 . ITTICATTCA TATTGAGAAGT CA TCA
.
TCTACGGAAGGAGTACCTGAGTTATCGT CAGCATCC ATCATTGGGATCrTGCACT TAITGTGGATICTT
GATOG
62 ATCGTC ACAG Tr TC
17 'FIG I GTAGGCTGAAGCFACiGTAGAGGCTGAAAAAA TGGCITCGGAGGAAATITG I ATG
TCAAGACAGAGG GIT1TICICAll ACTCTTGA AGAGGCF GAAAAAATAAA
17 TCCACATCCITGGTAAAGGTAGTCFTCMGTCIGGT TGAAGGAGTFAAC r GGAGTCCACMAGTTIGATAT CAAAAACTGCTGAT AAGT
64 GGATCCT CACiTCCAATF 7TCGT
TCTTTGTCTGCTGGATCCT
17 AAGCTGTTGTTGCAATTTITTGCCTICGTCiAATGCCTT
AAATCCAGCATTGTACACACAAGAGCAAGATTITC.T TTCACAATC.AC.ATTGC.ATG
C1TGGTGAATGCC1. TGGA
17 CC-Mr ATCrAGTCCTCCAACAACITGTGACAACIACr A I CACCTTACACAGGGAAT 1 CTGCiGITI CAGCTGGA AAGAAGTCi TACAATAAAA TG IGACAACT ACT TICAAG
17 CCTGACAGCAACTCCC1CATCATACAT1GGAGAAGC r ACIGGACTATGAAGCTAGAIGCACCi TAAAAGGGCT CATACA TrGGAGAAGCTGA
68 AATCCCAG AlT6CAAC CCTTGGCTGCAAAGGAAG AG
17 AGCT MAGA i CI-MCA illiCAATICiAATGACAGAAT
AGGAGAATTGGGAATAATAAGAGCCGTICKCATAA CCTAAGAT CTi CIACAAAT GAM
GACAGAATTICTCAT
17 CAGATCAATTITIGIGTCAAGAGGGTA1TTTCAAGrr CAATCAGATCC11TTTAC.TGGGGAGCC.AATGAAAT A GGGTAATTGACAATAACA
TATTTTCAAGTTCAACGAA
70 CAACGAAACTG . GCAGCAG TIMM ACTG
AACATGGATAGAGCAGTT ATACAACAAGCTCAAAAG
17 TAGCAACCTCCATGGCCTCCGTAGC.ACTACGGCAAAG TCATCCTAGCTCCAGTGCTGGICTTCTGGTAGGCCT
GAAAACAGAATGGTGCTG TAGCACTACGGC.AAAGGC
GCCATAGAAAAACACGTA
17 TCAATGGGCTGGACJAGGTCGGAAACAGGC.TACTAT
CCTGC.ATTGTGACCAAGACTTGTC.TATTCTrTGAGG TAGGTTTTGTACAGAGAT GGAAAC.AGGCTACTATACC
1TTGACAAAGACACAGACTCAGATMCCCAATTCT AAGAAATAGCGTTGCT. GT GCAAAAATGGCAGATAAA
RAGTATITTAGTCCTAAAG
AAATGTTGGGGATGCACA
CACTGCAACIACAAGTTAATGACATITGTTCCARTGTT AATCTITITTI CAAAGGCT
GGCAATTAGATCTAAGTGA
78 CiTGACC CAATTTGATCTT TTGCi CC
17 GTGCZTCTTGAGCAAATCTGATAITATCATGriTGCLT AGTGAAACTGGAGAATGGGAAGTGTTGGTGGAGC
AMAGACTGAGGGACATT TATCATGCTGCCTATITTAT
79 ATTTTATGGAT TGGTAAC , TGG GGAT
17 1TTGGATC.CCCCIGGAGGCGTCTAAAGGAAGGTACG 1 GACGACCGAAAGCGTCACCGTGTGATCCCCIATCTC GTACCCACTCAACGGAAG TCTAAAGGAAGGTACGGG
80 GGCG i GATCC C CG
17 C.CTCCAGAGGCCGTAGAGGGOCC.ACTIGTGCCAAC
OGGGAGCAGGAGAAGGA
AT IC AATACTCTTAAGTGTTATC
83 GTA GTCAAT C.ACTGGATATGGGC.CATT
GGTGAAGGGACGAGTGTA
CTCCIOATAGACAATTITT ACAAAAACTTGGTAGGCCT
TCCAGAAGITATTCCTACA GATAATACCCCAACAGTAA
17 TCCCTGACACCTTAGGCACTAAGTTGGACACCCTTAT ATCAGITTAGAGCTTTTCGACTG/kACAGACATGTCA
AGGTATTGGACACTGAAA CAATACTGGAACAAAGGA
87 CAAAGGATGA CATCC ATM. TGA
ATGAAMTAAGGCCITAT ITGGATATAGTAGACTCCA
s CC.AGTGACCAGICTAGAA AAATTGCCAGITCIATCTA
89 CIATCTATITCC 1 TCA TrACICC , GI *FTICC
GTTAACAGCMCAGCCAGGGCATTGTGACCTATGG APACTITATAAAAACATCC
90 ACATCCACTCCC ' GCG CCGCTTCACTGATCTCTF
ACTCCC
AACAGACGGAATTTGTCC AAAATCATGTGATGGTGG
ATCAGTTATATATGATTGA
17 TGCAATCTCATAACATAACGTGGATTCCTACTTGGAG GAGTGCTGCTACTATACGTAGTGTAACCATCAC.ACA
TAAGACATTGCAAGAAAT
93 . I CAGGAA AGIT:TWA GAAGG 'FCCIACTTGGACi TCAGGAA
17 AATCCAACCTITAAARTCRCACGAGGGITTTGTTAIG TGGIAACATGATAGATGGIMGCITGCC.ATCAAAT
ATATGACAGITTGOTTTAT OTITTAGGTTGCAAGGCTA
95 AACiGT3 AT CACCICTA ACTGGA T
ACGTAGAAGAATTIGTCITICAGGCGATGAATGTAA GTTAAAAACATFCCICGIT
GCACAACKATTICAGAGTG
TGACAAATATTACCCIGTIGTGGIGGAACICTCCAT TTGTTACAACTTCALITAC
ATTCTGAACCACCTAAAGT
97 I AAAGTTG GCITCIA I ACC 1.6 TTAACTGTCACAAGIGCIGGACAAGACTICAGAAGA AAA TIGTATIGTGCCT TTA TCCTGAAGATCIA
TGACTTI
TCi I ACTTCCACAAACTGCCAATCAC AT AAAAA If TGCAGGA ICI
99 . AGGATCTTIGG AT.AAGGTTA TTIGGCAGTGATGCAGAA TTGG
.
AAACTGIGCTTAAIGITGITGGACCGCTCTAACAAT f3ACAGCAG ITTGITAAAG CCGA
TATGGTTAAGICTAA
18 'faGTTICGT AAACAATGGAACCICGGGCACTAAGA ATC:AATG ACGTIGCATT TG I
GGGC.ACTAAGAAATTGGC
18 AACAGGCACAAACC I ATT AGTGAAIVAAGCCITGIT IT AGGAAATGIGITITG IGATTIGACi ATTGCTTGCCTATTACAAC
GCAGGITGTIOTTAATGGT
18 Tar ACCAAAA TGCATAACAGCG TATG I AAATGIGGT
rrGAGICGTGAAGATUTGAGATTGATGAATIAM GCAATATGTGA TIT TGAA ATGIAAATGTCiGTG
TAAA
TAGGACACACCGTCTGCIT I AGAGCIA GAAAATTGAG TAT AAACC AATATIAT ICI
IGACGGAGG
OS CGGAGGTAAG TCAAAACCCAAC TGAC.TTG TAAG
GGTGACAGMAGAGAGTGGIGGTITTITAACACC TIAAGCCATGAGAAAGCT TAAFAGACCITCATIGGIT
06 TTGGITGATG ACTTAAC.1 TCC GATG
"fGCCGiTATCACIATTCAG
18 GCGGCCTC.A.ACAGTAATiNAAAGTATACTGGCTITTGT
CTAAGGAATTGAAACGGCCCATTCITAACATCAGTA AlTACGATGITATGGCTAA ACTGGCTITTGTTCAAAAC
08 TC.AAAACATC . ACCGTATGA 7667 ATC
18 ACTGATT AAAAGCATCCACAGAC.CAGCATGTGCAGG
AACTGGTTTGAAACTGAACCTTACTGGIGTACITTA A GIGTTCTGATTCAAAATIC
GCATGTGCAGGGTAATGTT
GCTCTAATAGATC.AGGATT
ACTCGTICTATGTCGTATTGCAGTAAAGCATATACC CGTGCGGTATAATC.I. 'MCI
ATGCTAAAGGIITTATCOG
AAGCTTAAACCITGCTITTGGTGACACC.ACACAATCA TATCAGCTATTTAAAGGIT
GCCTGTGGATITTTTGGC
TAGGTATTACAGCGGTAA
TGCATATAGGGAGGCTGC
GIGGITAAACCTGGTGAG ITTAGCTGCTIATAACGGC
14 'F AACGGCA TTGATGCATAT A A
AGATGCTTTAGCTFCTATGACIGGTACCATTOTAAG TIGTAAITGla i F i G i ACM GI-cirri:1-c TGTGIACAAGCATACAMAGAGGCGTGTACTCAAC AIGTITAGTAITACGMT "FTGTG TIATAAGITTGGCC
18 CCATGTGTAAGTACCAAAAAGGGAAAAGGGTTCTAA GCAAAGGTTATTGCTAAGTGGGTTAGCAC.AAAAGC
ATTTC:FGTMCKTTFGTG AAGGGTTCTAACTTAGAG
17 CTTAGAGGAAG ACTATCTTAAT , GTAC GAAG
TGACTGATGTCAAATGTGCTAATGTACTTAGAATTA ATATATGAATGCTAATGG
CCCTCCTAAGAATAGTTIT
18 TGAAG i GAAGCAACATGC ATTGCG GAAG
AAGTGCCTGACTAGTATT CGATGATTACGCAAACCiA
19 AAAGGAC I. I G I I I IAACT GAAG C
GTTGTGFACCATTGAATGC
TGGCAGCAAATACTCTGAA
18 CAGC:TTGCAATCTAACTGTAGAAGATITTGTAMAGG GAACTGCTACTGAATAT6C1TCCACTTAGGATCTAC
71ACTGI1CAAGATGC1AA TITTGTAAAAGGITCiTAAC
GGATICATATGGIGGTGC TCTGTTTGTATATATTGCCG
AAGGGCATTTGATATTTAC
TITTGGTGACTATGTTATTGC.AGC.CCAGCATAGGCA TAATGTGTATAAAMGCT
CCCTAGTTAGCGCTACIG
ATCCAGCTTTGC.ATGTAGCTICACACTAAAACAGCA GCTACCATTATAAAGAACT
ATTGTGATGAATATGGATG
18 AATGGATAGCCAGCACICTTATC.ATAGTATTTTGAGA 1 TTGGAAAAGCCAGGCTCTATTATGATAGTATAAGCG GGAC.ATTAAGCAGITGTE
ACGCCGCCCAACCCATAAGTA ACTA TGACTGGCAGAATG AAATGTITGAAAAGTA TAG
27 AGI ATAGCA.GC 1 CAG6 , TT CAGC
GTTGATTCAACCTTTGTCACAGAATCGATCATCACTC TICAGMTTAACATATGT GCCAATGTATGTGCCTTAA
28 GCCTFAAT ' AAAA TCATCA CAAGC T
AGATGGATGGTGACGATGICTACAAAACATCCAGC CGTTTTTATGTCAGAATCC
GITGGGTTGAACATGACAT
TACAATGACCTGGGTAAT ATCTTGGATAGCTACAGTG
31 . AGTIGTAMIGG TCTC;CITAAA TCTIGTACAGGAICTCCGI AAATG6 18 ACGTCTC.AAGCACACTATAAACACTAGCTAATTTAAG
ACACATIGGTATGAAACGTTACTGCGCAAGATGTGA TGTAGGAGATGTMTG TT AGCTAATTTAAGTGCTCCT
32 MCI CCTACG CiTCCCIG TFAACC ACG
18 ACACAT Arrr ACGAGIGGIGICKITTATTGTTCCGGC ACCATAA A
TGCATTACCTGAGAIGGTGG TAAGCA TA ITGAAFATAAATGATIGCA
33 CAAGGF CTAAC:FTCATCT CTCG 'FAT MT
TCC:GCi CCAAGGT
AAAGTFCTAGTGCTGTAA TAATAAGTTITTGAAGGCT
ATCACCCAGCTCATGCTCCTAAATCGCCAGTTGCCTT AGCATTACAGTTTACTACA CCTTAGATAAAGTGCCACA
35 CiCC:ACAGG 1 A TTGA GG
18 CACAACAAAATIAATICCIGTGGAAGCGIGATAGCAT .
GAAGCCACIGGTTISTITGCTTC:ACCAGGAGGAGC:f 36 TGGGAC nc CCIGGGTTGGCITTGATG
GCGTGATAGCATTGGGAC
18 ICAC:GCCCTACITITGCAAAGTAGITACATGGGCAGC TA AACGTGCCACAG I TTACAA
37 . CM ACATGTAACACT C
AGTTACATGGGCAGCCAA .
TAAGGACICIFTI ACI TAT CCACATI ACITTGIGITATGAT ATTG
38 ATATTGGCAA ACAGTTCCAAA CITAAAGC.16CCATGC.TCT GC-AA
18 ATCCATA FACACACAAGGCOTATCTGCiCAGCCITTGA GCATOGAIGC I AAGCAGGI
TGACCTAAAT TGCATCT AATAAACAFGCATICCACA
GGCAGCCTITGAGCATTF
IGTGGCCITAAGCTCTC.iGG TACAAAGCTIGGAGAAFG TC:AAGACTGGTCAT TA I AC
40 TTATACAGG TGCiTGT 7I6 AGG
GGTGICTGC.ATGTATACA AATTGAATGFCCTTMGAT
18 CCACCAATAATCTICIGGITGAAGTTATICAGAAGTA TGCATITG ?TAM AGGCI
TGIACCGMACAAACICC A TACCITi AGATCAAG AT TAITCAGAAGTATGGITIG
TICTCCAAIGGAAAGAGITAGTCTCITCATCATACA AGATTITCAGYFTA FGCIT GI
AACGATGAGAP.AGTIAT
18 TCTGCATTC:CAAGAAAACTCTGITAACATAC:ATTTGTC RAT GGGGTAT
TITGCNCTITOGACGCCANI TAAAAA GAGTACAATGTGAGTA AA ACA TACATTIGTC:ATATGA
44 ATATGATTCGAG TCCTTCACTAC; GATGG TTCCiAG
CTGGACATACAGCCIGAAGACICCATCACCCCAATG ATMCCACITACCCA TIT T
AGGITTACAATITCAAATE
45 nrscis CATAT GAG CTCG
18 TGATGAGGGCiTTGATGGTGATITGATGCAGCACTGT CCGGCTAGACTTGAAATAGTTAAGCCCAGTGACTAA
46 CC.ATT . CTATGGTTTC.A GATTGGCCATTGCACCAT
GATGCAGCACTGTCCATT
18 MGGCGTAMATTAACACCC.ICATAGGAGTITTCACT
TC.ATGCCTTTAAATGCAAC.CGTAACAAGCCTTTATIC TATCATCTAAAGCTGGCA
AGGAGITTICAC.TTTACCG
18 ACAGATTGCMCGTAGGAACAGTTCCC.TCTGGTAAT GTAAGGAMTACGCCTGTACAGGGITATCAGACTG
C.ATTACCACTGGITTTGAT C.CCTCTGGTAATTAITTAG
TACCAAGGTTATTAAAGA GTGAGTTCCCTGCTATAAC
18 GCTTACTATTACAACITC.AGAGGCATGTITAATGTFTA
ATTGGGITAC.ACCTCTCACTTCTAGACTCATACAATC 7TATTICATATITTACAGAC
GTITAATCITTAITTAGGC.
CGTAAACCTAATCTTCCCA
SI TCCCAATT GCAGCATCAAT CTISTICAGCCAATCGCAG Alt 18 TCGAGGCTTAAAAACAGAATCTTCf CTGCTAATGITT
AG
52 CTGTTAGCA GCiACCITTGA CA CA
18 CAAGACCOGAACAGTGCfC:AACCIATAAGTGCCCCCA AAGIGATIATTGTOGAGGCAATTCTICTGCAGACCA
TCCTGATCCTAITACATIT
ACTTATAAGTGCCCOC.AAA
18 AGAGITAGCGTGAAAGGCCGC:AAACAGAACTITTAT OCGAACCAGCATTGCTATTTCGC.AGCIGTCGTGTAA
CTITACGGTTCTAGAGACT CAAACAGAACTITTAIGAT
54 CAI TUG 1 GACf ACATAA TCGT
18 GACAAATGCAGC.ACAATCAATAGTACZTTCAGAGTTT TGTGGTGATTATGCAGCATGTAAGCATTAATGTTAT
55 ACTATAGGTAA CACAGAAACTAC , GA GGTAA
TAAAGTAAAGTTATCTGAT GTTGAGGCTTATAATAATT
18 GCGGTAAGACCACCATTAATAAGTCC.AAGAAAITCT TGCITATGITTCTCAACAGCTTAGTCCTICTCCATAG
CAAC.AACTCTCTAATAGAT CAAGAAATTCTATCTAGAC
CACTTTAGTTATGTOCCTA
58 AGGGTT ACTCATA CTA.AG
TATGTCACAGCGAGGGTT
59 GACCIACA AAGCATAGCTAC.AC GCACCAGATTTGTCACTTG CA
GTTAAAGTTCTTAAGGCC AGACCTGAGAAGAAATAT
60 AAATATCTCC GAGTGAATGT ACG CfCC
61 AGTITTf GGA ATACCfTGAATGT TTGT GGA
CCAAAAGGGTICTGGCAT
GAACAGGACCGCATGCTA
18 CAGCCATCiTCAGGTGTTAC.ACCGCGTAGTAGAGCCA
64 An TCiCIGAGG GAGCCICTAGTGCAGGAT
CGCGTAGTAGAGCCAATT
18 AACCIAATIGCGCCGTTATAGCrATCTGGGAATCCTG 'IGACAGTACACMCAGGTTTTGAGIGATAGGCATT
65 AC:GA CAAAT ICITAI , GI TGGCCAAAG 1GCAGAA sf Ara GGGAMCCIGACGA
18 GGTTACCACCAGATGCCGAC.ATACTGAAGAC.ACCTCA
GCAGAAAAGTCGAGATAAGGC.ACCTATAGTCTGCT TICTTAAGAAGATGGATG ATACTGAAGACACCTCAGA
18 ACT7CTGCTTGGGCATTAGCAAACTGACATGACTAT GGAAGGACCTCTTTGCATCt=GATTCGCTTTCAACAT
AACTGACATGACTATTGAG
CTACAGAGATTCGCTTGGAGAAGCTICTGTITTGG A GAAATCICACC.ATTGCOT
TTTTCCAGGACATACTATT
TTCCAACACTGTGTCAAGC
69 . 1 GMT GICCAG I
CTGCTITOTTGGGAT Gra:
GAAAGTGACAGGGCCCCTTITTCAATA TGATGGITT GCTACCTAACTGAAATGAC
18 ICCATTCAAGTCCTCCGATGAGCTICCAGGACATACT AACACAGTTCGAGICTCTGAAACfCCCATICTCATCA
GAAAICICACCATIACCIT
71 GAGE crc3a. CC
CTICCAGGACATACTGACG
GAGGCGACACTCCACCAT GATCACTCCCCTGTGAGGA
72 AGGAA rraxi G A
TGCAACATGAGCACACTT AAAGAAAGACCATCCGTCG
GTCCATCCTGGGGCCCAA GGAGGICC.CGCAATTIGG
18 IGGGGTC:CAGCACGIAGATGTAC:ATTCTGCACACACC
ACACCTACAGTGGCAGTCACiGICGCCiCCTACTAAT A GIGTA.TGAGGCCXAT GAT
75 . CGG GGIVA CT
C.ATTCTGCACACACCCGG .
18 CCACCACCATACCCACAGaiGGACATCGAATGGCTT GCAIGICCTCiC:GITTACCCCAGCCAAGAIGCCCCAA
TGIACMAGGCTATCTITC GGACATCGAAIGGCTTGG
18 GC:TS:YWCA f CAGAAGGACCGGAGCTGCAAGCCCA1 TACMC TGGCACT
ACGCACCGGCCGCAGACACTT GA ACTGGATGTCCTCAAAGG
AGCTGCAAGCCCATCACT
18 GCCCiCTGGGACITCAGCAGGGCGIGCCAACCIACAC CFCCCAGTCiCiTCGGTOCi ITT
GCGCAC:GTC 1TGAGA A GGCACTACTGATCiCCAGG
GCGTGCCAACCTACACC
18 CGATCCTC.GATATCGCAGCGCTGAACATCGATITACC TGACCGCACTGAGCAACATCC.AGCAAGGCAGTATA
AAGGTGAGGATGITTGTG TGAACATCGATTTACCGCC
18 CAGCLIGACC.AGGTi CTCCAACCCMCiGCTGA TGCT CGCTGC1GCTGGGACACAIGT
TACCCCGCACGTACC
BO GAT AC GMTOCTCACTTCTGGCGG
CCCITTGGCTGATGC.TGAT
18 G TGGCCACTCiCTAGGICTITGAAOCIATACGACCACC AGGICATCACCIGGGG
fGCGCIAATCGTGCAGAGA TACTGAGCATTGGCAGAT ACCTA f ACGACCACCTAGC
CCCAACGAGGICC IIGTGACTACCITGACT G GAGACAAGAACGTCiGTGA
18 GAAGGGCACAT AACGGCACCTCGCTCAGTCCTAGGC TGlIGCGGGGA
TCTITACiGGCTTGCTAAGGGIT 7 CC
AGTCCTAGGCT.T. CTC
TACTAAATCCATCGGIGG
84 CTC . CCGTAG CG
CCACATTAGGCTTCGGCTC
18 TCTAC.CfCGACCCGTTCGOCACCTITICCATTGAGACC ACGTACCGATATGTCGCCTCCGACAGAGGACGACC
AGATGCCCACTITCTGTCA AGACTAAGC.AGCAGGGAC
86 ACT GCfT C T
18 GTCTGGAACGAGTGCTGGCTTTCGGCTGCGTTGTGA ATGGAGGAGTGCTCACAAGCTGC.CTTGAACTGGTG
TCGGCTGCGTTGTGATTG
18 TTTGGTTGGTCGTCAGGGGACTITGCCTGOCAACCCT TACTCGGGGGGTGGGTTGCCAAGCCGC.TTACCACG
TTGCCTGGC.AACCCTGC
GAGGACATGGTCAACCTG
GCCCGCCATACTATCTCCG
90 CiGC GCMG GACTGGGTTIGCTCGGTG
GMAACAGCTACCITTGAG
91 AU GG Ci GGGTGGGGGACITCCATT
18 GATGAGGCCtATGLGettuGCATCCGCCAGCCAACT CAGACGCTGAGCTAGIGGACGTGFAATGITGCTGC
CrCCATCAGAGGCAAGCT
ATCCOCCAGCCAACTATCA
18 GITC:TTTTMCCGAGGGGGATGTACCAOCAACTGTC TCAGCTGGACGGCTCTAATOGTCCTGTGGTTTCGA
TGTACC.ACCAACTGTCCAT
93 CATG GGAGG , CTGTTGGACCGCTGGAAA G
ACCGTTAGTGACAGCGAG
18 CC.CTGCTGGACAAGGAGCGAATCACTCTGCCCGATC 1 1TC.CAACC.ACCATCATGGCGAATGCGGCCCCOTTA CTCACCATCGAGGAAGCG
95 GAA ' GC T
TCACTCTGCCCGATCGM
18 GCCrCCGGTTCAAGGTTACAGCCACTGAACAGGACA CTCCC.TCACGGAGCGGCTITACGATAACCAC.ACTGG
GGGITCTCATATGAC.ACCC CACTGAAC.AGGAC.ATCAG
18 TCAGCCACCACGACCAAATCATCAAAGGCCGCAAAC TAGAGCAGC.CCTGAGAGCCITTGGCCTGTGGAGCA
TGTTACATCAAGGCCACA
CAAAGGCCGCAAACCTCC
CGAAAGC.AGGTCAATTAT AAAGAACTAAGAAATCTAA
18 GTC.ATTGGTCCATrCCTATTCCACGGATCAGACCGAG AGTCGAAAGGCTAAAGCATGGAMCGTATrTTGAC
TTATGGAGTAAAATGAAT
99 TGATG TTGGTrTCT GATGCC
GGATCAGACCGAGTGATG
19 TrGGTTATCGTTAGTTGCGATTCCAAGTTGTTTTCCCT AGAGAAGAAAGAAGAACTCCAGGATCTCTCCAACA
00 AACGA.AG TGTATGCAAC GG AG
CCAGGAGGGGAAGTGAA AATGATGATGTTGATCAAA
19 CCTCATGC.ACTCTTATCTICAATGIACAAGCGGATCAT 1 TGAAGAGTTCACAATGGTTGGGATC.AATC.FCCTGGT TTGGIGGATTC.ACAITTAA
02 CAGTC 1 rCiCIT I GAGA
ACAAGCGGATCATCAGTC
i AAAGTGOTTITCAAAATTGGGGAGCAA TATCCCAA TIGTATGATAAAAGCAGI
GATCTGAATTTCGTCAATA
03 Ci TCAATAGGG 1 TCATTCCC , TAGAGG GGG
19 TTCACTGACCTCCTCGGGAGATTGACCGGTrCTTGAG 1 ACACAGGGAACAGAGAAACTGATIGACCAACACTG
GAGAGGGTAGTGGTGAG
04 AG ' ATTCAGG C
19 CCTCATTTGTTGGAACAGAGTTCTrAGTACCTAAGGC GATGTGCTTGGGACATTTGATACCACTTTGCTTTGG
TGGAATTTGAACCATTTCA TAGTACCTAAGGCCATTAG
OS CATTAGAGG TGGAGC CT AGG
06 ACCC CGCAAG Cr C
19 GCCAACCTCTCAGGACAGCCTATGAAAGCCITAACAC CTGCCGACGCCTTAC.CGATTGCTTCCGTTGGCATAA
TCAATAGCACGGCCTTGA ATGAAAGCCTTAACACCGG
07 . CGGC CTGA AC C
09 'FGAGCTCGGACCIGTCCCTGGCCIGCAACTGGACGC GAAC GCACAGGCTGGAAGCG
GCCTGCAACTGGACGC
TACTCAATGCAGCATCCCT
TCCCTG ACATCC AAC1CGGAG6C6GC11' G
1GTGT3GTACACCC6.ACTC
11 ACTCT CIA CGCGATGCCG ICATMAC: :1 IATCTCCTGGCCCCTAC
12 GT G GAGACTGGGC.GCACAAC
CTGC:GAGATCTGGCCGT
AAAGGTCCGAGGAGC CTGCTACCCAAACCTICCr CAACGTGCA ICAATC1C1GG
13 , GOT C13 G T
.
19 ACACCGGGGATCTCATC3GlIGTGGIGTGCACCCGIG TCACCiCIACAACICCICTCCA6CGAGCA1 GCAGGTGG
GGTGTGC.AOCCGTGGA
19 'IAAGCACCTCCTGAGCACCCGGCAGCCCCATCACGTA TlIGTC1ACGAGTGCCAC
TCCACCGTCAAGGACAG I GGACCGGGGi IGAGAACA
1S Cr GCCGA A
GCAGCCCCATCACGTACT
19 AGTAGGCCACGGCAT(GATGCTCAAAGAAGAAG ICC ACCGCGGICTIGACGTGICTGAGCA
TCGGTCGACAC TCAAGGGGGGAAGACATC TC:AAAGAAGAAGMCGAC
19 ACAGAGGACGGACGAGTCGAATCTAC.AGATTTGTGG GAGTGCTATGACGCGGGCTGAGCCTAACTGTAGTC
TCTACAGATTTGTGGCACC
ACCIGGAGGICGTCACCAGMCiACAGGCAMACG TGTGGAAGT GT11 GATCC
18 AT aici GC
GGCCAACACCCC.TGCTAT
19 GMAT AACCTCTGCTIGGCGGAAGGG A TGATGCTC:
19 arTGCAAGAATGFCCACGAGGTGGGCGCTGOCTTA AGATCA
TGAGCGGTGAGGICCCAGAGGP.IGGCGG
TGC1GCGCTGGCTTAGC.
19 TATGGCAGTGACGCGGGCGGGAACCATGOICCCCC CTCAGCAGCCrCACTGIAACCCACACrCCGAGCTIA
19 CrCCAC.ATGTTCCTGCAGGTCCTGGACATGTCAAAAA GGGACGTTCCCCATTAACGCCTAGTTCGGCGCAGG
TGGACATGTCAAAAACGG
22 CGGGA . AAGG CTGCCACTGTGGAGCTGA GA
TGCTGCGGGAGGAGGT
19 CGAATCTCCGAGACTTCCGCAGTCGATCCGCTTGIGG CCGGACTACAACCCCCCGCTCCAC.AGGTGGTTCGTA
AGTGGTGATTC.IGGACTC
24 CA GTC Cr TCGATCCGCTTGIGGC.A
A TCTTCCATGCCCCCCCTGGCCATGACCCGTCGCTGA A
GGCATTACGGGCGACAA
19 GCTGCAAGCTTCCTCTACGGATATrACCAGGACGTGC CTGACGCCCCCACATTCAGCCATGGCAACGGACGTC.
TGAC.AGACTGCAAGTTCT AlTACCAGGACGTCiCITAA
CCTGAGAAGGGGGGTCGT
TTATIGITGGGGGCCCTCTT
AACMCATGCTOCTCCAAC
GCGCTGGAAAGAGGGTCT
19 1677.7CCAAGC1CGCAAGGGIGAAA1CAATAGGGTG GICCGCSICIAGGCTICIGTCMCCCAGTICAAGAGG
TGAAATCAMAGGGIGGC
30 G C.CG 1 TACT GCCTCAGOGC.ATITTCAC CG
19 TGCACCCATGGAC.ATAGCTTCGAATGGGGTACATATA
ACCCACIAAACATATTGCTITTGICITTGTFTTGTGCAA ATCTAGACAATAATAAAT CGAATGGGGTACATATAA
31 AAACAAG AATATACAGCA , AGGGAGG AACAAG
TACCIACGCCTAAAATATT TATTCATATTATGTAGAAG
19 TGCMCCACCGCATCTGTATGGATATCCCTC=TATGCA GCACAACAAGCGTATAGGGIGTGCACTCTACC.ACCA
GTAAAAGATATCGAATTG GGATATCCCTGTATGCAAT
GAAAGITTGCACCCAGTC
AlTTCAGCCCCGTGTGTG
¨....
19 CATTGGCATGGTAGCACTTGTATGCATAAAFACCCTA ¨
GGGAATGACAGTGCTTATGCTGTTTAGCCACTGTTT GCATAAAGACCCTAATACA
GTGCCTTTAAAACATTCCT AGGAGTACCAAAAAAAAG
C.AGCTCTTCACATTFAAAA TGACAACACAAGGAGAAC
19 .ACCCTCGTATGAGACTTITCCTGTTATGTGCAGAACT ATGTGAATTTCAAGGATGACGCTCCCACATGTACCT
TGTGCTATGTACAATGGA TTATGTGCAGAACTATAGT
AAAGCGAC.AGCGACTCGA
19 CCAAAACAAAC.ACAGCAGCAGTGGATTMACTTGTG 1 GCTGAOGTGCTATCFGGCATTGTAAATATAATGCTA
AGATCTATAGGGTGATTG TGGATTITTACTTGTGTCG
40 TCCiACT 1 TAAGGCCC TACAT TC ACV
TGGCACIAICGCATITTGCAGIGGITCCAATGCCTAG
41 CICCIC 1 GCCC , CATMGCCACACACiCITT
ATAGGACCTGCAAGGCCTC
CCCCAGGIGTGTTGGACATCGCAGGA1TTGTAAAGT AGGACTCC.AGCATFATTA
42 CTAC ' TTGTACT ATFC
AGGGTOCACCATACCTAC
TACTTGCTCCAGCTGCCFCTTCACCATCATCAACATC CCCGGGTACATIACTATCA
GCGCTCTAGTGACAACIT
19 AGGC.ACGTCAGACF7ATTTAATFOTCTA17CAGGATG
ATCCAGATTATTIGGGAATGGCCCTGCGC.AAGTAAA CCCCTGGAGCTTATAACIT CTATTCAGGATGGTGATAT
45 . C;TGATATCIGT AGAACAT C CiGT
AAGAATATTTGAGGCACG AGGAATATGAITTGCAGIT
19 CAGI.GGAACAGCAAFTACACAAATITGTIATGAGCAT
GIAAGGITGlICTAGGTCAGGAGAAATTGGTGGTG GITIGIAATGICIATG TAT TGITAIGAGCATGTATGTG
47 G TATG7CiTG TGCACAC G161731 TG
48 AT TCT TG 176T GITIGTCGC:CTGATICTGA
CTGAGGAGGAGGAGCAAT
CACTCTGGAGGATTTGTFT
49 TTICIFTTG AC.AGTCX1CTG ATGTGIGGOTAGGITGGA G
19 GTCCCAGTTICIATAAGGAAALACAITTCAAAAAACA TATCAGGATTCACAFGAACAGTGGCAAAGTAC:ATIC
CACACACAGCCAAAAAAC TFTCAAAAAACATGGCTAC
50 TGGC.TACAC CATCATAATCCAC TG AC
19 CiGGGIAGAGGGGTA fGCGGITCT AGTTCTACCAGGC:
ACCGCAAATCGC.ACICIGGAGTCiAGACAGTGCCTCT *FCTACi TICIACCAGGCAGC
51 . AGCC CTCC T1TTAC.CCMCGCC7CTGT
C .
19 GTFCCCACF GCTICAGCiAGC7GACGAACCACCGAGA
AGACTICAAGAGGAGGC7CGGGTAAGCFGFTfCCC GAATCGCCAACCAAAACIA
52 CAGA GGACC. T CC.
ACGAACCACCGAGACAGA
ICTAITACACGCCCTITGAGIGGGATA FTAA AT TACAAATII TTGG MGT
ATTTGGGCAACTTGGGTA
19 TCCTGGA FCACiCAGTGA I HAM-RAU-Ft ITCAAT I ACATCATTAGGIGGIGCiGCiAGICA
I ATCiCAGGATT ccr AC TGATAT TAT TGACX:
GAGCTCTTCAATTGTFCCIT
19 TCATCATCAAACICiCGGGATTITCATITTGTAAGGCAA
CATCACITTTGONGTTTCACiCAGTGAAAGTCf ATATC AlTACAAACGAGAAACCC
SS CCGTC TGGTGCAG AG
ATTITGTAAGGCAAC.CGTC
19 CAI CAGG FACTATAGG ItTG 7 CAGIACATIGGCCFCI A i CACCTACITACCCITACACACCGGGTGCAAGAAA GAAAATGAGCAACTTf TT ACA
TIGGCCTCTAA.TAITC
56 AATA1TCCT AAATCATAAT AlTCCC CT
19 CCC ITO ACA (ATKA ICCCi IACTCAATCGTCiGAAAGT
ITATGCFGGATCTGAAAGGCTICTITCCAGGATCHA AATCGICGAAAGFIATATC
19 GGGATGACCTGATGTACTAATACC:FGTTGGTFTGGAA
ACAGAAAATCCTAACACGTAICCTGAGACACGTCCA AAA ItTCI GT ATAAFCCAG GTIGG
ITTGGAAAITGATA
SS ATTGATAGGATF TTCTATTGTC ACAGG GGATT
ACTCCr GAT CACCAG VG GGAGAACA11 AAAGGITTGCGAAAA1GTC
19 CCTGCTGAGCACCATAAATAAAtailt.!ACGCGAGGC
ACAAACCTCAMACAATTIAGGACC.TTGAGTATAAA TGT7t. FFFFFF GGAAGAAA
60 ACTTITA . GAGCCACTAGG GGAA
ITTACGCGAGGCACITTTA
ACTTGC ATACTCGGAACATACTATT
ATGCTGCCTTTTAAGCCA
19 GGTCTACCGTIGICTGTC7C.AACTAAAGCAGCTGITA
AAGAATGAAGTGGGCCATGTGTTGCCATATCAC.CCT GACGTEGGCTAAAATTGA ACTAAAGCAGCTGTFAAGG
TCAGAAAGATGAGACTTGACAATGCTTACTC. ICTCA TAGGAACTAAAGGACAAA
TTCAATAGCAGCAGTIACC
AACTCCCATTGTACTGGCATACATACTGGCAGGAAC AATA1TATTC.CCTAAAGAA
AATACCAAGAGAATCTACT
ACTAACTGAATCTAGATCT
ATGATTGTAGCTTGCAGGA
19 Gt. i i i i i i TCAATATCCCCCTGCATATGGGACGGAGA TTCAGCCAAAAAGGAGGACATGGIGTCTTGAGAAA
CGGAACAATACAAAAGAT
TATGGGACGGAGAAGAGG
TGAGGAATCACCTAAAGCAAGTGAGAATGTATAGT GGAGAGATAAATITTCTT
GCCAACTITTATMCCCA
CCAAGATUGGCAAATGGGAAC.ACTAAATITITAA TCTTAGTTIAATAAAAAGG GAAGITATAAIGGGAGCC
68 AGC.CAATG 1 GC.ACCCATTG ACTCiGG AATG
19 GGTCAGTITTCATAAC:CICTIGGICGGTCAGTACAGT AAITUTTGCCTITt R, CGCCAAAAATTGATAA TAATCCCTCAGAAGATGG GGTCAGTACAGTGGAMG
69 GGATTTGC GGCTCTCCATT , C C
19 GOTITTGAAATCTC.CAAGATCAGGGATERIGGAATGC TGAGGAACTTGAAAGACMAAACCGACGGGCMC
TGAGGATGAAGAAAGAA
ATGGGGAATGCAGTACTG
GAGTTTGAGCGATTGACG GTC.TATCAGCTTACGATCT
71 ACGATCTCTT ' GGGAG T CTT
19 AGCGACAATCTATCTTCACATGATTGTTATGCC.CCTCT TCTTGGACAAGAATGGCATGTGAACATGTAGATaT
GAAGTCAGATGAGGGTG
GTTATGCCCCTCTACTGG
19 CACAATGGACAAGAAAAACCATCCGATAATTGTGATT CTAGCGAAGITAAAGCC.CAATCATCATCATGGTTAA
GTGGTITTGITATGCAGTA GATAATTGTGATTTITATG
73 TTTATGGTTGGG C.AGTATCAGTAC TGG GTTGGG
GTTAAAGAAACTGCTOTA ITATGTTCAATGIAAGTGT
TTTGICTITTAGATCAGGCTTGGCCAACAGGITTCTC AGAAGATGATGTTATTGA TGAGTATTGTCAACCACCT
19 GCAACATAAGACAAGCTACATTCACCGGTCCATTM CAGIGGC.AGGAAGCATGGTAGCTAAAACAAGAGCA
TTAACAGTATGTAAATGG CGGTCCATTITTFTCTTTEG
CTCCTCTGAGTAAGGATTTACCTGAGCCTAC-ACCTA TTAAACAAGAAAGTCGTG
ITGGTACATTAGC.AAAGAC
78 GACTG 1 CAOCCATA rrGe TG
i 19 TGAGCAC:AAATATCATGTCCAACT All AAACCAMG 1 TGITGAGIACAAAGIAACAGTCTGGACATATAAATC TACAAAACCTATTATAAAG
ITAAACCATITGCTAAAGT
79 CFAAAGTIOAC 1 ATCACi ATGC:GAA , GCTCACi *FGAC
ACGAAATTGTAGTGTTCGTGTTAAGCCACCATTAGC AlTTGTAGCTAATATGTTA
CGGITTTATATAGTTGTTA
80 GTTG1TACTGC ' AGTAATATCATAAT CCTGC CTOC
19 CAACATCTGAATCAATGGAACAACAGGGIOTGCAAT CAGTAGCTGCTGGTTTGGAATC.ACTCTITAAATATG
TITTGTTAACATTGCTCAT GGGTGTGCAATTAGAARA
AATGATTITTATGCTAGTG CTITGIGTACTATGITTAAA
19 AC.AAGTIOACAGCCCTGCAITAATGTCTGGTCGGATG TGGTAATGTTAAACCTGGTGAAACTGAAATGCCCCT
ATTCTGCCTTATTGTGTAG
83 . AG TGTGGTC AGIT
'FAATUCTGGTCGGATGAG
19 GCAATAAC.ATTARCAGTCTGGACGTTTATGGICCATA
ATTGTGCTIOGITIGTAC.AAAATGATITGCCATAGC CACACTGGCACTGATMA 1TATGGTCCATATAGAGAT
84 TAGAGATGO CCAAACA C (SCI
AAAGCATGTAATATAGCT TGTATGAACGIGATAAAGC
8$ 't AAAGCTG CIAAAAACiC AAGTC TG 'FG
AATTGTGTTAAAATGCTCT ATCATGCTGGTACTGGTAT
AATTAGGCCCTATITTTAA TACTGTCATTITTGCAGAC
19 GGIGTG TG ICAACA TaAAGTICATCCGITIOTTGM
CAGATCCTGCIATGCACGTTGCACTAAAACAACAAG TIGGICCCa MT TAGACA CCG IT TO n GT
ricrArr GG
SS C.TATTGGT TTCGTAAA A T
19 GIOCiGCAGAAC.ATTACGITTAGTATARGCC:AGACTr! TGOATCAGTGCTAAGAATAGAGCI GGCC I
GT(ATA TGATAAAAG TOCTGGT TA AAGCCAGACTITATT ATGA
89 . AITATGAGGC GTACTAAGAAT TCCAT GGC
.
19 CaGGCCAAAACIAAACTACIAACAACCIGT TGIF ATG
ACATGAATTTTGTTGTTCACATGGTRAAAACTTGA CCITCATOTATAAAGGAI
CaGTTCTTATGGGITGGG
19 GaCTAT CAAAAGAACACIGTCAGTIOCAG A ICCTIC
AAGICTAGCTATAGATGCTIACCOACACGAAAGAC ACTA IGTTAGITAAGATA
TTGaGCA.T GGTAGCAGAA TrATGAA TOG TATG611.31 AATCTTGC.ACGGGTTCAC
19 ACACC.AAIGGAACACTATAAACACTGITiC.ACCTACAC.
ATGITOCTAATTATC.AGC.ACATTGOAGACITTCCCG GTTITAACATCACATTCTG
93 TTGIC.C. TACCAGG TAGC
GTGCACCTACACTIGTC.C.
19 TGITCCAATOTMCGCGAGTAGIGTI AGAGGTIGGO (TT
TCCACTGCAAATAGGITITICAAAGTATAACAAT TACTAAACiATGAAGCCAT
94 rrGe. C.TCTCTCAGCAA TAAACG
TGITAGAGGTTGGGTIOG
19 AAACCIT TAGGATIOCCIATG ITAIGGITATTGC.AGC TGATGCTITTCCIOTAGCCAAGA KT
;TA IGCACATC GI ATCCAATAATITO AAT
9S GIOTCA ATAGAC.AT GAGGTC
19 GCGCTICTIAAAGGAACGTAATCAACCITTMCIATT A 1 C.ACACGGTG
TAATCTAGGIOGGGIAGTIACAAT GAACIGITITTGAAAATCT CCITTITICTATTATTC.AGA
96 ATICAGATAChIC. ATTCITCAGC.ATG TAAGCC TACGC
TCGCCIOTCrGTICGGC1 1 AGAAAAGCAAGAAATGG ACT AGGTT AI CAATGATTA
19 CGC.ATATTAACAGGAAC.AGCTAATGACAGGCTGTAT MTACATTTAGGTOC.AGGGICTGCCAGATGGTAAC
ATGGTAAACCTATTAATTT ACAGGCTGTATGATGAAT
98 GATGAATGT . CACTGTC GCCT GT
19 ACCAATAGGACAGGAAGCAGAAACCTICMTGCTTC ATCGTTCTTGTGAGAGTACTACTUCAGGTAAAC.AA
AATACTTITTGTCCTTGTG
AACCTTCTTTTGCTTCAAGT
20 CGAAGAGGAAGAAGAAGGTGAGTTACTGATTATTCT TCGTAAACGTAGAAGTAMCTGCTAMCTGAC.ATT
TGATAGTTATCTIOGITGC CTCATTATTCTGITTCTTCT
TCTCTCAACAGCTCCTTCTCC.ATAGC AATTIGGTGCAATTAGTTC ITACAAGAAATTITATCTC
TACAGGGTGTCAAACTTGGCATACAGTAACATAT AC CCAGAGACCAATAATCTT
ATTGATATCiAAAGGCAAG
AAACTATTAGGATGICTTA
03 ACT TrAGGTTG TACTCC
CGGTCATTATGCTGGAAGT
20 GCMTGGCGAGG I I I Ra I I MAGTAAAGATTCTAAA
TCAACAGTGTMGGTAAAAGAGGACCAAGCTTTAA CGCTAATCTTGITITAGCC GTAAAGATTCTAAACCTCA
04 OCTCAGCA CATTIC:AGCM AAG GCA
20 AGAGTCAAACTGTICIGGTAATTGICAGAACACTGAI a GGTACTCAGCACAITTCAAAT
GGGATCATCAAGA CITGAAGAGAATITAAAT CAGAACACTGATTCTGATI
TGCAGCAATATGCAOCCACTTOGCCITG1TC.ATTGAT AGGTACTGA ITCGAAATG
AGATTTIGTGCGACAATGC
20 OCTCTGTTGGCCATTICTIGTGAAAGCAGGGCTAGGA C.GMAGAGGCGAAGAAACAATTGATTIOGGTGGGA
, AGACTACACTCTCGACGA , GAAAGCAGGGCTAGGATT , CC.TAATOCACCICCTIGTTATCAGTTCAATGCTCA AT ATGTCCAAAGAAGTGAAT
08 GAATGC i TTCAAAGC GAGGGCAAGCMCTCAA GC
AAATICAAATTACCIGCTG TGGAAGCAAGTATTGTCA
20 aCCITATGCTTGC.AATGTACTGATAGAGCTCGATGA GAATTATTTCACAGCAGAGGTGTCOCAGTATTAATG
TGCGAGCT/ACTGATTCA GATAGAGCTCGATGAAATT
AATTGG TATACCCCCTTCA AT GO
ATGIGGCTCAAGTCTCOGGICAGNGATOTGGTAAAC AMTGGGAGAMTACTGTOTCOTGCCITGAAATIT
ATATGGATTCATCATAAA CACAGATGTGGTAAACTTT
TCGAGAGCATGATTGAAG
AAACGATCCTATGAACAG
13 ACGC TR.TTGGIC MG
GAAACTGGTGGAGAACGC
20 TCTCCAGAAG1TCCGGTCATTA660ACAATGGTAAT GGCGATAATGGACGAAGAAC.ATTCCaTTGAGGAT
TGCAGCAATAAAGGGAGT GGGACAATGGTAATGGAA
ATGATGGATCAGGTGCGA AGCAGAAATCCTGGGAAT
20 GGGATC.ACTC.FIGITCCTCTAATGACATGCCATTCTOC. 1 AAGAGGACAACTATCCACTAGAGGAGAATCCMTG ATCXAGC.ACATAAAAGTC
TGCCATTCTGCAGCAT
i AAGCITTCAGCATCTATTACTGACACTGTATATTICT GAGGAGGAGCGCTATAA
GATACTTATTGCFTTGATT
17 1TGCTC 1 CCIACiCACAC , MIT GC( C
AAGAGACGGAGAGAGACCGAGGITC.AATGTTGGA TTCTGCCTCTGTGTCTAGC
18 AGA ' CGAGGAGG T
TCTCAAGGTCCGTTCCAGA
CCAACTCCTGAAGAAGTG
GGGCAGGAACTAGATCAG
GCGACTGTATGTGAAAGA CITTACCAAAAGGCACATC
20 TTACCAMAGGACGCCCAGICTGGTGGTICATTAGG ACACCTGTfAAACCAAATACAGCTGITGCAGATGAC
GTATTGGTACAGGMGGG CTGGIGGITCATTAGGATA
21 . ATATAGG TCAGGAG GI 'FAGG
TGCAGCCIATACCIOAAAGGCCIGGGCIOTTGCATC CACAATAACAGACACCAC GGTGATTCAATAGGGGGG
GIGTATGAAGATATTCCTT GGACICAGGATITGGICA
23 'MGT CAG GGTACCTGT TGGA G
CCAATGAAGATMGCTGA
24 CTGACA I/WAG CNICAGTA.GAGGCTGC:AC CA
20 GaGGTGCAC-ATCCTACAMAAACAGAACAGCGGAT I TACAGGTGAATATTGGGATGTAGCAGMACTAATT
ATTAGAGATACAGAAAAT
IS CiGA TT 1 GI ATTGCAGGACAA CCCA A It CAGAACAGCGGATMATT
TGGAAGACGGCiAGC:AATCACCCATAGICCC TIT RiCACia GCT AA ITTC GCAGGAAGACAGATC1 AG
20 AAAAGCTAATIGCCATTGATCAAGTAIGTAAGGTAAA 1 GIACCICCCWICAGACFCCGGIAAATCi AC:AGC:GG ACTAGACATACTGAAGM AMU. AGGTAAACICT TGAA.
27 . CiaTGAACC 1 GTT TATGAAC CC
=
GCAGAGCATGCAGIA1TGTAAAC AACGGTFTTCf MACAU'.
28 AGATITAAGC AAC.ATTCS TTGGACTGTTGATCTOCA AAGC
AAGICiACAAA TGCCCGGCT ITAAAGTGCA 'IGTCATAATACACIAT AAA
20 GAAACAAACITCCAAAIGTGIGCATAAAGGCAATGA A ITCGCiATTRXATT(CA
ItGACCiATAGIGCATICG CGACAMGCT ICAATCCAA AAAGGCAATGAAAGAATA
20 ACCGGTTCTCMOTAATCATAC.A ATGOTGAACAGTA
TC.ATTGAAATIGGAGTAACACGGATATITTGTTGOC TGGTGAACAGTATATGTAA
20 TCGGACTGACGAAAGGAMCCalITCACIATAAGAC AACiAGGCGAAGAGACAAT I
GAAGGT(CGGTGGGA CAGGGCAAGAMCAAAAC CTITTCACTAI AAGACAAG
AGACITTOATGACTGCAAAGATGTITAGAGATaGG TACAGGACMIGAAAATG ACiATCCCAAGGACAAAGA
GAATACATAATGAAGGGAAAGTCATCC: CI GATTC:AAGCTGGATAG TONfGAAAT AGGAGAAGA
TGCATAGGAACA I ATITGAGAAATGATACIG "FGAACTITGTA AG TATGGA
3S GAGITC GGOCCTCG ATGTGG Gilt TICTIKAGTCTCTICAGCA TTGAGAGCATGATTGAGG
36 TTGAGGC . GTTGAA G C
TGACAAAGACATAATGGA
CGATAACCTCGTTTCAGGT
20 CCTAGGCATCAGC.ATGTACCAGGTATATCACCGATAT CAGAAAATAACTGGAGGCCTTATGGACTGAGMAT
GGTATATCAC.CCATATGAG
ACAGAGATTCGCTTGGGGAACTGTITTOGAGGGAG TTACAGAGG A TGTCAAAAA
20 CACTCTItTaTTCTCTGGAAATGTOTAATOCAATC.AA
ACICiCACAACATGACC.AAGAAAATAGCTCTTMTGT GAAGGTTAATAGATITCCT GTAATGGAATCAATGC
ATA
TGGATAAGGAA CAGCTTT C.AAGG AGGAA
41 MGT TfaCACAAC ACCCGGAATGCAAATCAG
AGGATTCGIGCACTTTGT
20 GCCCATCCCACCAG1ATGTGT1AGGAGTCTCAATC11 TCCAATCCICTGATGAMCGCTCCTGCTfGTATTCC
GCATGTTTAATATGCTAAG TTAGGAGTCFCAATCTFAA
42 AAATC116 Cf CMG TACGG ATCT1 G
20 TCATGTCAGCC.GATICATIAATTCCTAGGCAACITCAG
AACAACGACCTIGGACCAGCTGIAGTCTTIAATGAA TTTTCTACCGCTATGGGIT
43 CATGG C.AGCTGA TG
TAGCCAACTTCAGCATGG
20 IGGTACAT1TGTFCATCCICAAGAAATT(EC:AAGAGA TMCCCTAGCAGTFCMATCGGATCAATTCGGGCC
TTGCAACTACACATICATG
ATTCCCAAGAGAAATCGCT
20 ATTGCC.AAATTCACsC.CATCCTATGAAATCGGTGCATA
CCAGCTACCOGACCCTACAACTTGCAGTGATGIGTC ATFACTGAAAAGGGTGTA GAAATCGGTGCATATATAT
45 TATATAACCC CA , ACC AACCC
20 ACACAMGTC.ATCTAGAGTAGGTTFCCGCAGAGAAA TGGAAOCATATAATGAAGTGCAAGACCTCFGAGTFG
CCGCAGAGAAACACAAGT
20 alGTGCATGTAATAGCGCTTGCTAGTAATACAAATT ACATGCMAAAACGAAAGTATGGACTIGCTGTATC
CAGATACAGGTTCAGACT CTAGTAATACAAATTATAT
20 CTC.AGGTGTGTCTCCTGTTACTGCACTATATTGGTATA
TGACAGTGTATTTGACCTGTCAGATCATATGCTATG GCACTATATTGGTATAGGA
48 GGACAGG TCACTTICATCT 1TGAGCC.ACCAAAACTGC CAGG
20 ACACTGGATTGCC.ATTGCTATCGTAGGGTAACAGTAT ¨
CACGTGGTCCAGATTAGATTTGCAACGTEGGATA ATACAAATCCTGCAACAG GTAGGGTAACAGTATTTAC
GATCAAATAGACTATTGG GTCTGGAAAATGCAATATA
SO GCAATATATTATGC AAGACTCTC AAGTGTG TTATGC
ATTGAATGTGTAAATACCT TACAATGTTCTGAATCTAT
Si TATGTACAG TTGG GIGA GT ACAG
GTGGCAGACCTAATACTGT
AAGTAGCACTAGTTTTACC
TGCATTTACCGACCCTTC
TGCCTTAACATCCAGACGTGGGACATAGTTAGMT TFACCAATCCTGCATATGA ATAGACACT.
ACATTAACCT
54 TFAACC TITTCAC 1 rtACCC:AATC GC TFTCAC
i TCCIGAAACACAACGMAGTATGGIAACACCTAAA TGGACATCCATACTATAAA
55 A FTCCT 1 GGCTGCC , GI TAC:C CGCAAGCAAGACA
TFC.Cr 56 CTFCTGT ' CACA AAATT
CCCATGTTGCTAC.TTCTGT
TTTACGACGCGAACAGTTATTTGCCTGCATCTCCTAT GTACTTTGCAGGATAATA AGGTTCCCTTAGACATTTG
AAAGGGCTCTGTACTATA GGCACATATATGGGAAAG
GTGGTCAAGTACTAGTTGC
59 . ACITIGCC GCCCAT CAGGGIGGAATGCATFGG C
GGAAAATATTGAAATGGG ACTGGAACTGCTAAGCATA
60 TAI CTAGIGTGTG It AGTCf T
CTCGAGGTCTGGATACGAGATGGGCMGATCTA IC CMATCCAGGTAATAATA GAGICAAGGEATTCTCATA
20 GCCATATGGPXACAGTGGTCAGATCTAATGTCGCA AAAAGTACACATCAGG.AAGGCAAATTGCCATCATCC
ATGGAGAGAATAAAAGA
AGATCFAATGICGCAGICC
AAAAGGICGAAAGGTTGAAAC.ATGCCTCCTTATTTT
63 CiCCGIA 1 AACTTGAITTCTG CTGGATCAGACCGAGTGA GI
ATCACCICF GGCCGTA
20 GTACGCCACCATCAAGGGAGM AACAAAAGAGAAG . AI
GCTAGAAAGAGAMTGGICCGTIAAACACTGOCT ATAACAAAAGAGAAGAAA
20 TGCTAATCiGGTCTGCTGACACAGTITGA FTATCGCT G
TCFCICTIGGAAATGIGC(ACAGICTAAGGAIGTCC AGAAATGATGATGFTGAC AGITTGAITATCGCTGCTA
65 , CTAGA ACGAT CAA GA
.
GAAGAGCAACAGCTATTCTCACiAAACFCTCCCGCTF AAGAAGAAGAAGTGCTAA
66 GCTAACGG ACTATCAAC GCGGATCATCAGTC.AAGA C.GG
20 CCCACATCAT TGATGACGAATAAGIGAACG (AC TAT T
AGATCAATGGCCCIGAGICACAATTTCCCAG ITCCT TTAAGGGTFAGAGATCAA GAACGT ACTA.' TG
TCICIT:
OGGTAC.AGTGGATTC:GIA
68 CAAA CAC A GGACAC1.
GTTCCAGCAAA
ATTAAGCAC.ATTAGCCTTC TGCATTGACTGAAGATCC
69 TCTGGG TataTT A
GATGAAGGCACATCTGGG
1TAIGGATGGC.AAAAC CAACAATCA TCTGACTAGC
AMGTGACC:GGGAGAGA
20 AGCGICi 1CAACRTAAATAATTGCTFTAAGCAATGAA GGITAGATFAAGGG1T1G 101 ACGTAAGAGAACiCA AGAATCTI TGGTCi 1TACCF ITAAGCAA TGAATC FCTGT
GIGTA CGAAAGTFTATAGCTAGC
GGATC.GCTCATTTGCATiA
20 ACT7 ClICAGGAGAIGGCACACACAACAAGGAGAAC
GGGAAGTGGAICTAGAACGMCATTAAIGGCGGA ACAACAAGGAGAACGTIC
20 CAGMCACCTAATGGTC.TGTAACTITGGGAATTGGA GTGGATCACGTTCTGGTAGACCCATCTACTGTTACA
GCAGMAGMAMTGO
74 TCTGGG . GATGGTCTT TGG
ITTGGGAATTGGATCTGGG
20 GGTAGCAGTAGAGICAGTGATITiTATTTAGATTTIGT ACTGGCGGTGTCACTCAATCCTAGAACTGCAAAACT
TTTAGAITTTGTAGCFCCTG
20 GCTAMCGSCCAAGTCCCGAATTIGAAAATGCCGCC AGCAGCACC.AGACAATGCATTTTICTGTTAGTATAG
AATTTGAAAATGCCGCCTA
AGACCCTATAAATGCAACAGATGTTGCMCTGC.AT AGTGAAATACCACCAA TA AGAATTGCAGACTATAGCT
CACAACGTITAGTTIGGA TATCAGGTGTAGAAATTAA
TAATAGCCCTATFCAGGAT
CATTICAATGICTTTATCT TTCATTAACCTGCTATGTAT
20 GTTFCTANISCCCTCCTC.OGTGGTGGTGACAGATF GC CIGTCi TGGAAACCIGGATATIGTGTAAGATCITACC1 GAGCAGGAAGAGCAGGA GTGGTGACACiATIGCPAG
81 AAGT TTAGTCiTCTTTGT T T
GIGCCItiGATATTTFACACACCAGITCCTFATATCCC ACAGTGGCTIAGGACATG
82 TAGCGT ACTCC.AAA A
AGGIGGAMAGGTAGCGT
TTATCCITGTITGAGITTT ATGTAGAGATACTGTACAA
83 TGTACAAAAACTG ACAAAACTTG , CAAG , AAACTG
.
20 MAGCACTTCTCATATACACATC.CACCACTAACAGAT AGATGGAAACGAGGTGICTGTGAGGTAATTTAATT
GCTAATAGTAAAAGTCAG CCACTAACAGATGCTAAAT
20 GTTIGTATGCCCTGCCTTCFGCCAAATACAACACTGG C.ATTGGCAGCATCTGAMGCACCATATTGGGAMC
C.CAAATACAAC.ACTGGGAT
20 ATGTCTATAAAGTAC.AGGCCATCCGATATATTATCAA ATG
TGCATITTACCMGGAAGCATTCCAGTCC.TCAC CGAGGAAGAAAAAGCTAT ATATATTATCAAGATGCAG
171TACATGGGTGOCGGATGCCITrGTGATGTATTA CC TCAATATTATTAGTTF
CATTGAAATGTFGGAGGTA
20 TGGGAAC.ATAGCCAGAGGCCCACTTTGGGGGTCTGG CAGGTGGGGGICGGGTAACACAGGAGGCCCAATA
TGGATTGCCAGTGIGGIT
ACTITGGGGGICIGGG TAT
ACAGACCCIGTTACACAATCTGCCAGATCCTICTGTT
AAAGGGICTCIGTAGGCA
GTAGGCCTACTFTATCTGA CTGAAGGCCACATTAGACT
CCTTCTGTGGTAGTAGATT ACTACATTTTATTTACATCC
TTGGGATGCTGCAAAGCCTAAAGGAGGACATGC AGIT ATGTTAAAACAGGA GACAAAGTGTC.FCTAIGGA
92 CTAIGGACC 1 ACC AAGGA (X
t TGATAACA
93 AACACACA 1 TTGCAT AAT , TGT C.ACA
TCTACCTCCTCTAGAAAGCGGTCTTAAACCCCCCGTC ATCTTFTTGGAAGGTGGA
TACAGAACGTTTGTCCTCT
94 GTCCFCTG ' TAM TCT G
CCAACTTTCAGGTATGTAC TCAACAACAAGATTACTCA
TGTAAAMGGAGGAGTT MACCTGTCAGTCTTTTAG
20 TGCCCCTCATTTAATTCAAGAGAATCTGTTTGAGGTG TAGCGAATGGTATGTTGTGAC.AGTAGCTCTTCCATT
GCTTCAAGAAGTGTTCCT
97 . CICCAG GIATCCAAA C
CTGTITGAGGTGCTCCAG
GCTATMAGCGCTAAAAC GTTAACTAAAACACCMG
TGGACFATAATGGACTTFA ATGGAAATAGAGCATATIT
CTCCAGAGAGTCGCCCCGTAACTCTGTCGGTCGACT CGACTACTACCGACGGAG ICTCTAGCTCTACCACGIC
AGACCCTATAAOTICAGGriCTGAATCFGCAGC:ACT GCAAG TGAAAITGATATA
CAGATGTGGTCGGTGTTG
21 AGGIGTACTGGTACGIGGAGAGGCCCIATI AA TCAG GGGGAGGGATITCPACCi ICITATCAATArruca GT AGITITGGAGAAGAAATA
03 . CCT TGGCTGTI CACTI
ACCiCCCTATTAATCAGCCT .
21 CTGCAGATGITTCTGAAAACCOAGGCCAGACCAGCiA GAACAATACGAGTCAGCAGAITAGGC1TICTGACCA
CAACiATTTGF:AAGAGGIG
04 TITTGC. ATAGTTAGTCCA G
GCCAGACCAGGAMTGC
21 'FGT. I GCAAAT ATC:AGCTGIGCATTAAATACCTGACAA TGAAGGAGAATCF ATGATCAT
GCCCCTTGCAGCF A T TTAACIGAAACTACAAAT AATACCTGACAATA TACTA
OS TATACTAGAGGA TTCFGGAAT GAAGGC GAGGA
21 Ca IT TGCGTCGCCTAAGAAGA TATGGCAGF AGA TGT RAI GMTACAGA I
GGCFCTITGGItiAGIAC.ACGAG A TIMM TCCT AA TG ITC TAT GGCAGTAGATG TF TA I
OS TTATGATGA CAACAGG CAGATG GATGA
21 GCTCTGICTATCATTGCAAATTTGTGTAGTGGAAATC TGICTTTAACCCAGAAAGAGAGAGACCACCTC.TACC
GTGGITCAAATAATATTAC GTAGTGGAAATCAGTTTAG
CC.AGCCACTGGIGAACACTGGGATCCTITIGCTGGA A TATGGAGATACAGAAAA CCGCAFATOCITI
AAAGCA
OS AAAGCAG GC TCCTG G
21 AGIGTCiCCAAGIGATICMCITAGGGGTCTACATIT
CTCGGATITAACAAGCA ITGCFGT TGCCI GTITCCCC GCiGGICTACA ITTITGICI
21 CTGCCCATCTCCACGCGAAGCCGCGGAAATC(AGCA GCAGITIGGCITCGAGGIGC.AITGATGGCCTGGAA
AT CFGAC GGCAAGGCCI. TCCACATG
CCCiC.C.GAAATCCAGCAAT
21 GCCGIGCTGCACCICAAACATTAACAIGTACCCGGCA CCAGACiACGGTGITGCCiCCGGCAAAGGAGGACCA
AACATGTACCCGGCAGTC.C.
21 CTGAACTAAGAC.TGGGTGTCCATCCACTAACCATCCT
GTAGACACCCAGTTATGCAGTGTGATAGAATTCCAG CCCFACAACACAAAAC AA
12 TTTGC . GAGGTTT ACT
TCCACT.AACCATCCTITTGC
GOGGGGACITTATGTGAC
13 TGACCC C ACCCC.AGTCCCGTCCA CC
21 CCGGTATCGGGCC.AGAGGTGGIGGAGTGITGGGCT
GTCGCTGACTCCGCCA
CTTCTAC.CICTCCCTAGCC
CGCAGGGCAAGACGAGGGATCCAAGGACTCGGGCC GG C TCCAAGGACTCGGGCC
21 CACGCTGGAGGCACATTGTC.CGCCACTICCIGGGIC GTGGCTGCCMGOGATGCAGCAGGCAAGGCGAGC
GCCACTTCCTGGGTCATG
CAATCCAACACGAGGCAA
18 TGGGCCGCCIAATGCATCCTATGCGCCGAGGCCTU ATGCGGGATCUGCCGGICGAAACCCGATGGCCCCCi GCiACACCGACTICACGC TATGIXTCCOAGGCCIT
21 TCCT1C17C.CACGGCGCCGCGAG1TACGCTCCTIGGA TCAGAAGCTGAGACCGAA
19 G AC. TCGTGAGGCAMOCATC.CTGGCTOCCGCGCATAC GA
C.GAGTTACGCTOCTTGGAG
20 C ACGA A CGGGCCTGGAC. MAGA AAGCT
iAGTcG
21 ACTTGTAGGCCX:GGGGACCCCGATCGTAAGOCGCAA
GTTTCAGATCCCACZC.CTOC.AAAAACACGGCCAAGG
CGATCGTAAGCCGCAATGT
AGAATGGCCATAGGCITGG CCTGAGAGGAGGCTTGTG
CfCGTTTCGGCCCCGA
GAGICCGACAGGAGGICT
AAAGTIGTGGTAGACGGG
TGTAGTTGAGCGGCTCCT
TTCGTCGCGTGACCTGG
GGGTGTTCCAGTCATCGG
AGOKTTGACCAGGTAGA
ACGCAGTTGCTGACGC
30 AGACCCACCACCTCGTGCCCiGGCGTTIGTCCCACG TCA GGAGGATGCGGAATGGC
GGGCGTTIOTCCCACG
GCGGGGGGAGGATGAAA
21 GGGGTGGTGAGGATGC.AGCTCCTTTATGCGCTTTGC CGACCTCAGCTCCGAGTGGGAAGAGGTAGCGCCCC
OCITTATGCGCTTIGCCG
TTCTCTATCGTCAACTGCG
CGGGTCTCCATCAAGICCC C
GGGAGGAACAGAATGAG
TCCCGCFGAGCAGA
36 TGCGGOGGAGGCTAAGC:GCCCGTTGAGGCCCACGT AG AAGGCICAGGACGIGGG
CCGTIGAGGCCCACXTT
CCTACTTGGGAGAGTCCG
37 Cl C C
CAAGOCCAGAGACACCCT
38 TCFCGTCCCACCTGGC:GGAAC:CTIGTCTGGGAGGCG AGGGGICAGMGCTOTCAMX:GGICTTCrTCIGCCA
AGAGGGCCCIGOTGAGA ACGTOICTGGGAGGCG
AAAAGCCTCOGICIGGICC
21 ACC:GCGTT AACA TaCGGCCAGAACTGGGCCTCGTG
ACiATGOGGCGCAGACAGCCACAGGGGCiATGCCAG TAAGCAGCCGTGACTAGC
GAACTGGGCCTCGTGGGA
21 MC ITCIGCCTGTITTG11T0TACGAGAGAAC:CCGCC
CTCGTCAGGCCGCGAGAGGATGGCCTCAAACACT G CAGGTGAAGCTGCAG ITC
ACGAGAGAACCC.GCCAC
21 CaGAGITGATGAAAGGGGCCGGATCTCGGCCGITGT
GTGCCTGGGCGCAAGA GATCTCGGCCGTTGICTC
ItIGGACATC.ATCGATCCCGGC
43 GA TACA GACCCTIGTCCTCC. TCCA
GGGGGGIGGCATATCTGA
21 TCTG ITCCTTCTGCTLYAGTGCFCATTACCTCICCiCT
44 AGACGAGGCACAGGC.ACCCCTGGCTCGCCTGGAGT GTCGA
TGGAAGACATGGCCGCC C. TGOCTCGCCTGGAGT
21 ACCGGTGACACCAAGTCCATCTGICATGICGGGGGC GATGCGCCTGACGTIGTGCGGCCCCACTCiTATCCAC
45 CT C TGCTGAGGTGGGGCTCi GTCATCiTCGGGGGCCT.
IGGATGIGCATATGGTIT ITG I TGCiCCAGICTI AATTIA TT
46 CGGA C.TGATGGC GTC
ATTGGTTTCAGCAATCGG A
21 CGCCAACCAACACCICCCItIACACiGCAGCGTCTTCA
lIGGCTTCTGACATCTCCCAGCTGTGCGTAGTGITCi TIC; TGCTTCOF GATGGCA
CAGGCAGCGTC1TrAGGA
21 GGAGTCCCiGACAGTITCATGCTCOGGCTKAGGGGC AG TAGCGGGCCTCGG TGlOCA
TGGACGAGGAGCT
CGGGCTTCAGGGGCAT
21 GGCAGAC:AICCGCCA ITACCiTTG MCC' GCACITACT GCAGIGGCTA
IGCCTCCATGCCAAGAGCTGGIGGG
TGICGCTGCAGTACTACGA
GAGATGGICiTCCGGAGACCCCACACCGTGGGCCCC ATAACCATGGACGAGGAC GGAAGAGGACGAGGACG
GGGGGGGAAGTCGICTITTCACCACGT GGTCGAGGAGGCAGTGG
C.GGGGTCGAGGAGGTAGT
21 CGTGTGC. TGCCTGGAACAC.AATGAGGTTCAGGG/sCT
AAGCCTGCCTCATCCTTGACCAAAGCAGCAGCCTCA CATAGTC.AC.GGATGCTGC
GAGGTTCAGGGACTTGTCC
CCACGATCACATTGGIGG
CCATAGCTGCCCCTGGT
21 ACCGTCTATAGCGCC.TTIGGGTCCATGACTGCTGACC TGGGCAGTAGGCTCGGAGTGICACGGGAGCCACAA
CCATGACTGCTGACCCGC
GAC.CGGGGGAGATCATG
CATCCGGCAAGCACCAT
ITCGIGCCAAACCAAAAAG
TCCTCTGAAGGATGGGCG
ATCAACAGGTGGGCHITTG
58 TGT CAG CATCTCGGACCai AAGCC T
TGGIGTATAAACCGCAGT GTAGCAGAATAGGGCCCC
21 TGCTGTATGCC.TCGCAGCGC.TATCCTGC1GCAGGGC
CAGTGCCAGITCC-AGACA
TGCCCATACCTGOSGG
TGGCCAGGTGGACGCA
GCAGCAAACACGCGGC AGACCCTCGTGAGACCCG
21 ACAGGCAACTACATGGGGCCGTGTCGGATCTCGGTC AAGGGCGCACACCACTCCTGTGATGGATCCAGC.GA
63 C.AG CT CGCTGGTCCTGTGTGTCT
TGTCGGATCTCGGTCCAG
64 CAACCCGGCAGAGCTCCAGGTAGCCTGAGCCGCACT G TCCTCGCT(SAGCCAGG
GTAGCCIGAGCCGCACT
CCTAACAGGGICATCGTCC
TCGCCGCCGTGACTCA
21 GGTCCTAACTGGTC.AGGGGCAAGCCGGTCMCACGA TGACGGATGTCTTTAACGGCGCAGGATGTGTTGGC
AGCCGGTCAACACGACAT
CTCGGAGAAGGCAAAGG
C.ACCAGCCCGITCACCA
21 CCTACACGACCGCC.AAGGGGCAGAACGTTGICGCTG 1 ACTGACCCCCTTGAGCACGCGAAGCCGTCTCTCTG A GGAAGG TCAAGAGCTGG
TIGTCGGTGA
GTGICCTGGIITAAGGCCGCAGTGCTGC. TCCAATGT AGCCTAG TCAGAGAGAAC
TCTTITGGAC
CGGGACAGATIGTCTTCCA
GCCGTGCACCTGCCAT
21 CCCGGACTCGTITTACGGACTCCCCCUTTTCCGCATC TCGATGAGGGAGCAAACACCCGCAC.AGICACGGGG
AGACTCCTGCCTGAATCA
CCCCCTTITOCGCATCAG
21 CAACGGGACTGTCATGGAAATTATTGCCAAAACAAC GTGGTC.AGCAGCAGATAGTGAATC1TGAGCTCCAA
CAAAAGCAATAAAGTACA
ATIGCCAAAACAACGT GT
73 GOCKi CT GAACGTGACTACCCGGCGGIGGACAGCAGT CGTAGCACCACGTGGCTGCTCCAGCCCCTGCA
TACC CTCFAGGGCCGACCACA GGCGGTGOACAGC:AGI
74 CT MCI CTCCGGGCACGIC Tr AGT TCGGGGCCIGTGTC GAGTi TCCAGCCCTGGGC CA AAA Ci A CCCTCCCCACCT
75 'FGGAGGGGGGCCAAAGAGGTACCACCCCCACCCACA GC CCATGGACGCICACACC
'FACCACCCCCACCCACA
TCCGGAGTCACAGACTTGGCCTTG
76 crcr GG GGAGCAGCCGGGAACT
CGGGAGGACACCAACCCT
CTGGATGAGGGAGCGCCAAITACAAGGGIGGGCTA GTGGCATCCTAAGGAGGG
"FTCIGGAGIGCCTiTCGC
21 TGGCCAACATGACITC:CGTCAAAGGAAGGCGCTGIA
CCAGGGICCCCACAGTGGAIGGCTCTGACCAGCAA AGGAAGGCGCTGTAGCi A
21 CAGATCCACCACCGC:ATCCAG T TITCTCGCCCCCT ICA
GGGCAGITCCSCGTTGC:GCACCCACi A IGCCTCAC:Cf TTTCTCGCC:CCCITCACT
21 CAGACGCCCCC:AOCCCIAAC:AGGCACA CrACAGCA A TM CTAI
GGGGATGAGG
SO AGGCGACCCCATGATGCGGGGTGATGCGGACX:TTGG TCA CC
GGTGATGCGGAC.CTTGG
AGGGGTGGIGGATGT
CCTTGCCGTGCCCTCT
21 TGGCTCCiCrGC7 GTGACCCCIGGCCA TATCAGGCAA CA CUT TGOCCiG
IGGGGCAACCITCATCACGGGGC CICCiCC.AT A TCAGGCi AAG
21 CGA GGGTGCCGCCITTGAGA CAGGGICTC1CiCiTAA AC
CAGGGTCTGGGTAAACAG
GCAGGTGCTCCGAGCT CT
84 TGGCGGTGCCGGIGAATC-CCCAGCCT. GCOGGACTT CXACG
TCAGCCTCCAACAGGTGC OCAGCCTGCCGG ACTT
21 CGTACCTCACCGCCAGC ICCGGACTCCiGGAGCCITA
GTCGGCAAACAGGGGC
AACGATGGAAIGGCCA CCA OG GGCAA TAACCGAG CA CXTCCITGTC:GT AGAGC
86 AGCG GCC1TC.A GCACGC.CCTCTGGGAA
21 CGCTCCT ITC; TGGCATCACCGCCTOIGCAGAGCCT t G
GCACCGTC:AGOCACCIGTGOCITAGGGAGGIGGC
37 AC CAC AGTTGGGGTCGGGCCT CC.
TGTGCAGAGCXTTGAC
CACGTCACGGGGAACTG
21 CGTACGTGCGTGTCTTTGCCCA GGAAGC.TGTACGCG
TGTGTAGCACAGCAa:AC
CTGAGCCGCGACCAGT
21 GCTACCTCATGTTCAGGGCCATGCAAAGGCAGGTOT TGG/s CTCAGCCACCTIGTCGTTGAGGACCMGGTCG GC-AA AGGC AGGTCTTTCTC
GTGACGAAGGGCCCCA GGCTGACCCCGGCAAA
21 TGCCAATATCTAAGTIGCC:TTAGTTGTTITCCGTTTGC
ITTCAGGCTITGCAAACACAGTGTATTAGTIGCTCT TGAAAGTTGTCITGGAGAA
GITTTCCGTTTGCTGCAT
A AAAGACATIGTCCTGCAGC
TGITAAACTCCAGTAATCG
TGGGGAGIGGITATATTA GTGTGCTAAAAACAGAGA
95 C.AGAGATACC TCCC ATGC TACC
GIAGACATTATAAACGAGCAGAGCACACTTCTAAAT ACAGATGAAAGTATATTG 'TGGCTGATGTAGATAGTAA
96 ATAGTAATGC 1 GATATC.C.ATTGTGA GCATA TGC
21 AACCZTAATAGGCTCATGCAAATGITTAGTATTTTAT MGGGIGGGGTIGTC.ATTICATTTTGCATCAGCTA
TAAAGGGTATACCAAAGA TGTITAGTATTITATGGGC
97 GGGCCTG AC.GG , AAAGC CTG
AGAACTGTATGA/NAAGA
GCTGGAAAAAAAAAGGCTACACTGACACATTGTCTT TAAAGACATTACTTAACA
TATAGCAACGAACGATGG
99 GA ' TTCCICTTTG GTCCC A
22 GGTGTCCTGGGTGCTAGACACAGTACATGTGGGTTC AAACX:TACTGACC.ACGGACCCGGGTTTCGTTGGTGC
ATATGGGGAC.ACAGGCAT GTACATGTGGGITCAAGG
CAACATTTGTAACATTGTG ATGATAGTGIGGAACAGC
01 C.AGC.G GTTCCGC GC G
TTITTGTTACTGTGITTCTT GTTTCTAGCACTATATATA
GCACACAAAATGTTACAG
GGAGCAGATGITCCAGIG
ATGCTGGTAGTICTAGATT
22 GCATTTTCAGTGTCATTAAGCTTGTCGGACAGCCTTf GCTGTTTGATGTTAATCCTGGTGAGCAACCAATAAT
OS AGGAAT ACATAGTEGTGT AAGGGTCTTGAGGTAGGC
CGGACAGCCTTTAGGAAT
22 CCAAGGGAACGTCAGACCTATC.ACAGG1TTTGGGGC 1 TATCAGAITATTTCGAATGGCTGCATTAAAACATTG ATACAGGATGGTGATATG
06 AAT 1 cicraia: GTAG
CACAGGITTIGGGGCAAT
t ACTGGCTAC:AACGIGCACAGGITACAAAAAGCAGA
07 ACACCT I TTGCCC , TC54.11CCCCTACA.AGC It *TCTITA.TGCATCCACACCT
GGTGATACCTACCGCTTITTAC.ATITTCTGC.AGGAG MAACAGCTGAAATTATG
C.ATGGATGCTACATTACTG
08 TTACTGG ' GAGGA GCC G
TGCGTGTCATGTATGTGTG
CATATCCMCGRACACAG
ATTIGG GCCAAA GT
TGCCTGGCAGTTATTIGG
GCCACAAGCTACATTAGA GTTTCTGGTATTGTTAGAA
11 . TIAGAAAGATGG AAGCAC CC AGATCG
GAAAGCTIATTGTAGAAA GCCTACTGCTGAAAATTTG
AAAACGAAAGTATCTGACAAGCCCTACAGCTICCAG GIATCAGAITTAATTGATG
ACGATGCTICACAGGGAA
CAAAACATTACTACAAAC ACTGTAGGATTATATGCTT
ACTGAAAATGCAAACGCA C:AAAGCAATAATCAAGCTA
is AAGCIAAACA TO GITTCCAATC G AACATCI
22 TGAAACAGTITTIGGCi ITGGG TAT ICGTAIGCAGCAG AAAAGCTGGCTA
16 AAA= GGCTTTAGC TIGGIC
CGTATGCAGCAGAAACCT
AGAAGAJTC:CAGGTCCCCICAGCATCi T CCIGTICCCA GCCGAAGAATCAC:ACAGG
17 . ATC CCTCC A
CGCCACACCTTCCACATC .
TTCCITIGGAAACITTACKTOCAAGGCCIAACGACT ACICITGCAGATATITTAC
22 CiCTATGTCTTCTTGGCCATGIAGAAATGTA ITIGAGG CIGACCCGAITCCTCCCAGATTI
CAA i AACTACAGCA ACTA IGAGATATTGACAA AIGTA 11TGAGGACTIGTC
22 TGCH AAIGCACCF TITGTGICAAAGTITGGTACGTTIT ACiCCCAATTICGACIATFGTT
GAAAGCATATCTGAA CIAGATCGCCCAATATATA GTTIGGTACGTITTAGCAG
AGCAGA GICiTCATGT GTGAAA A
AATTGCCITTCAGATGACACTCiTCTACTGGCTTTGCA AACTACTACATATCCAGAT CTAACTCTATACCGTACAC
TAACCICATCACTG CCATCCGTIGTTF HATAAA
GCCZATACAGAAAATCOCA
22 CGCTCTGTTAAATCTACGYTCCAAAACCiACiATGTCCT ITTCACiCTGAA T TCi ICCCAGTITTTAGCAGCCCTGTI CTCAGAG TATTGAGGATA
ACGAGATGTCCTCCAGAA
22 AAC:ACGCAGG !TA TOTTCAGGATCCTCCTATIGTGCA 11GCGGAACAGCA
ITAACAGAAC:AACTTGTAGCTCC ATCITCCIATIGTGCAAAG
24 AAGTC TTGT GGAGACTACTGC.CCAAGA IC
GCITGC:CTGCAGTGITGGAAACCGCATCAGGCTGAT GC7ACCT67GCCATAAACC CTGTGCAG IGAGGA.AAAG
22 TCTCCACCACCAGCCITACTGTMACAGAACCAGCA CTC:CAGGAGCTTCTGTTGGGCCTTCACCGTCGGCCA
26 CAGC . TAG GAGGACGGTGAGCAACCT
CCTACAGAACCAGCACAGC
22 TCC.ATCC:CCGGGATGCCTACTATAGTGATGAGGACG ACAGACAGCAGGAGGC.ACAAGGGCCAACGOACTA
TAGTGATGAGGACGAGGA
22 GICGCGTONTATCCACT. TGCGACGTAGACTGTTTGA
GC.ACAGGGGGGGTACAGGAAGTGCIAGTAACTGC.A GGCAGAAGCTCACAAAAA
ACGTAGACTGTITGAGCTG
28 GCTGC CCC.G GC C
22 TGACACCCTCTGCAACTGTGAAAAGTAATAGGACC.AC CAGCAACACIGICAATATGCACAATCTCACTAGCAT
TGGAGCTAGCAAGACAGT AAAAGTAATAGGACCACG
CAAC.ATATGCTCATAGAA CCJNAATAAGACATCCAGC
ATCCAGCG GCATTCTTCT CCTC G
22 TGCTCTGCCCGITTGTAATGGCCAAGCAAAATATGTA GGC.ACAAATGACAATGTC.AGAATGACCGTACTATG
GCCAAGCAAAATATGTAAA
GCCGCAGCCATTATGTGATGCGGTATCTATATAGCT ATATATGGACCTGCAGAC AT ACTTTTGTATGAGCCTG
32 GCCIGTIG CCAGCAC AC 'FIG
ATGACAACAAATACCAAT TACiACTGAAGACAGGTGGA.
33 C.AGGTGGAA TCGTC a:CT A
ACGTTAAGTTTGATGGCGA
TACTGACCCCCCATACACC
35 CCGAGTCGCTGACIGCTTATGCAACCCGACGCCACCA C.ACTGGT , G
, AACCCGACGCCACCA .
22 GCCAGGTGGAGGITACITTGTFAAGGTATAGAITGC GATaITCAACGTGCAGTAACAATCATGCCACAGGG
GGTGATCCAAACAAATTA ITAACIGTATAGATTGCACA
TGGCG1TACTGOIIILT:ATTGIGTATGTATATGAT GGAACOTTAAGGGACATC AGTGIGGAGACCITTATTT
37 CaTTATTTGC CAGGAGGCC TG GC
TGCTGC.TCAATIGTATATA
CITGTGTTGTAGTGTGCAA
22 GGGATGTATCCAGTGCGCCC.ATTCTTTGGGGGTTTG
TGGGGTAGCTTGGGTGTC TTCTITGGGGGITTGGGC
CCAACAAGTACATGTACAG
22 CACTATAGGbililtiCCTGAITGCTTCCCGACTGITA GTTCCCAAGGIGTCTGGATATCAACITATTGGGATC
CAAACATATATTATTATGC
TTAACTGIT
TGACACTGAAAACACCAAT
CGTTGCAAACCAATAAGT GATGTGCCTTTAGATATFG
22 GC.AGATACC.ATTGTTAGGGCCATCCCAGITGMAAC 1 ACTACACGTAGCACCAATATGACTGCATGTACTCTT ATCCZAGITGITTAAC.AAG
44 AAGC 1 TAAAATTCiGAAG CCTAGTGGGTCCATGGIT C
t GIACTGFIGTGTATGIATGGGTGGCACACAAACATG CCTCTACAAAACGTAAAC
45 C:ATG 1 CATAAGGAAA , GGT
CCAAACCITICCIAAGC-ATG
CCGGTTTCGGTCGTGCACAAGCGCGAAACAGCTAA GTGGTGTCCTGTATGTGAC
46 GACC ' CAGT GGTTOCGCCTTGTGAGTC C
CCCTAATAGGGGCGACAC
GCCATGAATCACTCCCCTG
GCACCATGAGCACAAATC
48 GTCGC GCG Cl AAGAAACACCAACCGTCGC
22 GTACCCCATGAGGTCGGC.AAAGTCGCGCAACGTGGG COCCAGAGCTCTCGCGCATCGGGTAAGTTCCCTGTT
CAATGACCCCCGGCATAG
49 . T GCA G 'FCGCGCAAail GGGI
GCATTACCTGGCAGCTCC
50 GT C A GCTGa GTCCi CCACCT
CGTCATGITCGGCTIGGCCTGGCCAACAGAAGGAT ATGATGATGAACTGGTCG CACGGCTACCATGATMG
22 TCGGGACATCCTG1'CGAGTTGACC1TGCACACCGGCT GGATGGGGCGCCTFGCAATATGCCAGCAATAGGGT
GCCCTGAACTGCAATGAC
52 TIA MCAT r CCITGCACACCGGCTFT A
AGCACCGCCGCATTTGAGGTAA ATTGAACAGCACTCGACC
GITCGGCTGCAC
22 CGIGGIGGAGTGCAACAAAGGAT ICACTCGTGGGCiA CST TGCTCTIACTCGGACCI
GCACGATGTTTI GGIG
54 TCGT GAGGTG CTCACGGCTGC.ATGCAAT
ITCACTOGIGGGGATCGT
22 CCCITGA IGTACCAAGCAGCCACAGCTAGCTGC:AATG ATICaTCACMGCCI
GIGGTGCACAGATGCGTCA t CAGCACIAGACiAAGGT GC
55 . CiCT AAGCA IC
CAGCTAGCTGCAATGGCT .
22 GTAAGCAGGCCCAAGCACCGCCGICGCCATAITCIAC a AAAAGGT GCTITGACGCGCGGCACA
TCCICAGTA CCM GATGGCATC:ATATG
CCGTCGCCATATTCTACCC
FOCI GCTGGGC GICATIGICIGGGGAGai AGACAGCTGCTIGTGGGG
22 CGTGAGCCGGCCAGAGTC111C ICGGGGGTE TT GTG TCACGCAGATG TACTCCACi I
GCCGTGCACGCiaCCA
TCTC.CIGGGGTITTGTGGA
TTICATCCCCGTTGAGACA
ACCACCAICMCiGCATCGGGC.ACAG1TAG ICTG ATM GGCAAAT TCCTCGCC
60 CGCCG ACOCCA CGCCCATC.ACErFACTCCA G
22 CGACCACTACGICTCC:CTGAG I TCCiGGG TATGCiCiCIT
CAOCGACGCCCTCATGACAGTAC:GTIGCAGTC:GATC AAAAAGTGTGACGAGCIC
TCGGGGTATGGGCTTGAA
22 GGTCTCC:GATGGTGIGAGCT CAGTGTAGTGCTCTGT
CCGTCAGGCTC:AGGGCGTAICTGCCTCCC:AAAACI C TAIGTTICCACIGGIGAGC GTG TAG"!
GCTCTGIGAGTG
62 GAGTGC AAGA Ci C
22 CACGGGAIGTGTCAGGGIGACCACACICGIGGGCCC ATACATCGCCACCTGCATGCAACCTGCCAAGACCC:A
IGTITGACTCCiACICAAGC
ACACTCGTGGGCCCCA
22 GCCTGCACAGTGGGTTGTATGTGCCGAGATGCTGAA TTC:TGGGCCAAACACATGTGGATGTTGATAGTCCTG
64 GTCC . CGAGGT GGGCGGCTCTCATTGAAG
GCCGAGATGCTGAAGTCC
22 CGACGAGAGaCCCGAAATGCAGGTAAGGTGCTAGT TGTCGTCAACTIGCTGCCTGGACAGAATGGCCGCGC
CAGIGGC.CIAGTGGGAGC AGGTAAGGTGCTAGTGGA
22 C.ACTGGCCCTCCGTGTAACAATCATGAGAATCACGG CGMACC.CGCGCCAMCITTACC.
TCCGCGTAC.TCTG
CATGAGAATCACGGGGCC
TCGTTCTGCGTTGGGCTTA CAGCTTCCTTGCGACCC
22 CCGGTAAACICTGGIGGGAACCCGACCTTGAGCCTTC CCTGGGCACGGCC:TG11TACCGG1TaITAATCTGGC
ACCCAATEIGTCGAAGAAA CGACCTTGAGCCITCGATA
CCTGATGAGTTGGCCCITT
22 CC:ATCACCGMTGAGGAAGITITITACCCTGACCTCG
TGGATTCCAGIATICC.COCGCTGGTCCTIGTITTC.CG CAAGOGGGGCAAGAAAG
71 (ICG OC:C C
MACCCTGACCTCGGCG
GGTGCTCACCACTAGCATG
rccc CCCCAGACCGGAGTATGA TCCTCAAATGTGTCTGTGG
73 GTGGC AGCCA , TC , C
' AAGGTTACATGGGCTTGA CTCTGCACACATACACTCC
22 CAGGCTGCATAGGCAAGGCGGTTGAGGGCAGCC.AG 1 CGCGATTGATGCGGAMGCCGTTCCAGCACCGTTT CTCAAOGGTCTGCTGATG
75 TM ' TGC C
22 GCACGGGC.AACTACCACCAAGCGGTAATGAGGCCGA CGAGGTGGGCGTAACGCTTGCAAAGTCCACGCCAA
GGOGGIGATTTGIMITC GCGGTAATGAGGCCGAAG
22 ACGTCAAAGGCTGCCACCAGTC:TTCGTCAACGGICAA TCATGCCCGGGAAGAGTTTGCCGGCTTCGTCTTCCT
TCTTCGICAAtik-TCAAGT
AGAGGATGCGGTTTTGTTC
78 GTTCG AC CC.AGCAGGCTC.AGTFCG G
CGAAAAGAAAATCGCCCTC
GATTCCAAACGCGGTGCA C
ATGTCGCCCAAGITTKAG
22 CGCGC-AGTTTGCCGTTGAC1TGACGAAGCCGCCC.TG 1 GCAAAATCACCGTCGCCGCCTCCITTGGCGAGTGCG
ACGAAGCCGCCCTGGT
i GCTCCCAAGCCGAAACGTGATATCGCAAGCCGCITT
83 .ST 1 CG , CCAAGTCGGCIACGCAA I:
GCICITCGA IGCCCII GT T
84 T ' GTGATGCOGCCGCCGACTTCTATCGCGCCGCCAAC
GTTCGGGTACGCTGTCG ,XGCTGCCGTCTIGGT
CGATTAAATCGTGCTCGC
CCGAACAGGCGGTAITTGC
TTGTTGCGATGTGATGACG
22 CGCGCGGCAGTTGATACAGGCGGGTTCTGCCTTTTC.A CCGCCCGCGACAATGTAGTTGGCACGAACGCTTGG
AATAAAACTACTGCGCCG CGGGTTCTGCCTTTTCAAA
87 . AAC AAG CA C
88 GC GG GCGATT GCGTGCCCiA
AGGCGGACACCITGri GC
GGCAGGGAATACCGCCCCiTTACG TOCGCCAGTF CG ACAATTICAGCCACAACTG
TCATATCGGFCAGCGGTA
CACACCAGCGCACCGA
GCAAGATAAACCACGTCGC
TCACCGCCACCAAC:GCGCAAGGACIGGCACC
92 CA G CTTGGTCG1TTCC.ACGCT
TTCCAATGCCTGCGTTTCA
22 ITCGTMCAAGCCGGAC:ACCAAAAACCATGTOTCGT
93 , CGCCAGTTGACCGAATCGGGAACTCTTGCCGCGTTCC GGC CGGCAGGCTTTCCTCGA
AACTC:TTGCCGC.GTTCC .
22 CCICAAAGACTACCGCAGCAACGTCAGC6GTGC:AGGC CCOCGMCGGOCAACT CGGAAT
GGITTGCCOAAC
TC.AGCGGTGCAGGCGA
GTAACGCGCAAACCOCG
22 GCATCCIGCTCGACAACATCCTAGGITTIG IGGAC:Al (..tGCTITCIC:CGTC/GAACAGTICGTCAGCCIGTCi ICC AGGTITTGIGGACATAGGC
22 CGAGGCTTTGGGCITGGGTCCTTITGGGCCICICAGGA TGC.AGGTGGAGGTAGCGCGCAAACCGAAGATGCC
TTITGGGCGGCAGGACG
GTICATGTICGTGCCITCC
98 TCCG TAGAGC.AGTTCGGCZTGIGCMCTCGCGCTCGCCC
GTGAGGCTGGGCGGAT G
22 AGACATTGCC:CTC:CCC:GAGGGTTTTCCAGTCiCGAACG GCCGCCCE ACCGAC TTGATAAF
GCGAAGIGTMA 1' GITTTCCACITGCGAACGC
23 CGCGTG TCGCCCAAAA.TGACTIGCCICCGCCGAACT
00 ACGGC.GGCGAAGAGAAAGOTITCTTGGGCGCMGC C GTTGCCGAGCCAGOGA
TITCTIGGGCGCGGC
cma:Gcr GGC.ATCGGACGGCAAAG
23 GGCAATTTGTTCGGCAATGGCGCGAAGCTGC.G1TGG TTATGCCGICTGTCGAACGGCGGCAGTCATCGTCGC
02 GC . TIC AGTCGTCGAAGCCGTGC
CGAAGCTGOGTIGGGC
GCCAGATTTGTTCGGTGG
CATACCGGGTTCGCCGA
GCACGCCTGCCGCTAT
OS CGTGTGCATCGGTCCTGCCTTCGGCGGCGGCAAT ARC CGTCCGCGCA.GGTTAC
TTCGGCGGCGGCAAT
23 TGCCGCTCGCCAAAGTCGGCGCC:TTGTTOTTCAGGG CGCCGGTGGCT1TGGAGACGTG1IGGAAGTCIAACC
GGGGACGACTICTTITTCC
CGC.CTTGTTCTTTCAGGGA
GGGTTTGACCGCCGTAAT
CGCTTMCAAGGGIGGCA
CCGCACCIGTCAGAATCG
ATCCALAGAIGGAIGTGC
ACiAAAATCACCGCCATCAG
GC GGCGITGarAGITGCAGGT6TCCGGACTCGCCAC TGT1T-GGGCGTTCGICT
C
23 GCGCAGGTGGGGCAGGTAATCGCCTTAAGCCTICiCC CCTGCCJUVGIGCTGGTCGGATTCCGCTGTTGCCGGT
CGCCTTAAGCCTTGCCG
GAACACCTGCCCCGGTAT
13 CCGATCATACCGGTCGCGCCATTGCGCCGCTTCC.AAC GACCCCAGCTTCAAAGCCGCCCTACCCAGCCGGCAA
CCTGCTGCCCGTATTGGC ATTGCGCCGCTTCCAAC
GITCCGACCAGTGGGGCAACITTTIGGTTCAGGCG TACCG AG TICGCCTATTCC
TGCTGCAAGGCTACGACT
23 ACGGCAGGGCGGGGMTGTCGTITTCGAACCC.CAA ATCAGCCCCCTGCOTACCAMACC.CAGGCGGCATCG
TCCATTICGAACCCC.AACC
CCC A GATTGCCCGTCGTGGC
AATCGAACACAGCTACGCC
AAAAAGCCGAAGCCGAAC
TGCCTGCCTCAACATCGG
19 Cl GTGC AGCACTTGGTCGGCTTTG
AGGCAGTCGGCGCAGT
ACCGCGCCATGCICITGGCGCGITGGCGGCTITTIGG GCA AAACAGCCCGCCCTGA GCGTIGCCGGLI
t it, GG
CGAACTICACCAAACCCAC
23 C:ACG 1 UGC GCGCAGGCCCAA AAAC T
CCITTIGGGCOAGCCG
TMIGTTGGGGAACGGCA
23 CATT C GCGTTACGCCGCTGTCA n-GGCGTTGACGAGCAGG TCGCGGATIGCCIGCT
23 GGCTGAAAAACGCCGTCGAACTTC.AGCGTCGGGTGC COCCACGGCGTGTTGTCGACTATCAAGGCGGCTGC
AACGCGATITCGTCGTOG
GTTGGACAGCAGGACTIT AAACGGCATCGGCTTCTTC
TGAGGCGGCATTGCCGTACCGTCC IGTGCATAIMAGCGGCA
27 CGCAG G GG AGGC:CAGGCAGAACAATCi GC
CGATCGGACGCGTT ITGC
AAACAGCAAAAAGGCGG
CAAGGITTGCCAGCGCG
23 GGGACTGACGGIGTCGGIATTGCGCCATAGTATTCG GCCGCCGICTCCATCAGCA TGaGTai IGCi TAT TT C
CGGAC CC AAGTGICTTIGTCGGCGG
CGGCATAGTATTCGCGGAC
23 CGGCAC:GCAAAACCTGIACGC:GGCAGCACCITITC
GCiCGCAAAGT MGM AA
GACGTTGAGCCGCIGGA
GCATTGAAGGGCGAACaiAACTC:AAAATO TCAATCTGITCGGGTGTCA
23 CGGCGACGGGGCAATTTGCAAGGCAAACiGCGIGC
33 C.GTGCGCGCCTACCAGATTCCTTGCGAGGCTTTGCG G TCGCCAGCGAGGITTGA
CCTTGCGAGGCITTGCG
23 TAIGGCTGCACCiGCCAGGCCCAGITG ITCGGACAAA
CCiCCGTAACCCAMCCGCTT CGCCiCATGGC1 CAAC CCAGITGTICGGACAAACG
34 CGC CTC TGCGGCGCiAGGTTITGC C
23 GGCACATTTTTC.ACGCTTGCCATGOTTTGICTECTTC CAAC.ACGCTGTTGGGTATGGGACCGGTAAAGCCTG
GCCC CTACG GGCGATCATGTGGCAGTG
23 CCCAGCAG TT TCGCC:GIT ITGTCTACGAGCTITT GGG GGTCAGGGGCATAGGOCi 36 CGAG ITTGCC.GTATCCGGGCGGTMCGCTTCGGGCAGIGC A
C.TACGAGCTTTICICiGCGAG
23 GGIGGAACGGAAGCTCiCTGGA CAC:GCCGCACATC
37 TGACCGCCATGTACAAGCCGCTGCCC.GCCTGC.ATGA C GGCGCGGITTTGCCT
CTGCCCGCCTGCATGA
23 CGC:CGT1TATGACGCCOCC:AAGGCGCCICACCA AACC CCGCTCTFCGGCT TCAMATG6f GATGCTCCACCAC:A
38 A GCCTG C.GGCGGCATTGCGTT
GGCGCGCACCATAACCA
23 GCTCGCGCGAMACTIAAACT-ACGMICAGGCCiGCC CGTAGCA TACGGITTGT-G
CGTMCAGGCGGCCr.
GTC.AGGTAGTCGATGICG
T ACG GA
CTGGAGCTGITCGTCGGT
23 GTCCCGAAAGCCGCTGCACCIAA ATCCGTCiTCGATGC
GCGTGCCGC.AGATTITCATCGMGCiCCIGTAGAG GT
C.GMATCCGTGTCGATGCA
CGCMCACGACACGCT
23 CGATGACGGIGGTGGCAACC.AAAAGGCCGAAGTCAT AAGTCGGCGTAGATGTGCCCAAAGCCCATGCGCTC
AAAGGCCGAAGTC.ATGGC
GCCAAGCTCGAAGAAGAC
ACACCCGAAACCCGCAGCGCGCCTGACGGATGCTG C ACCC-ACACATCGGATTGC
CGCCTGACGGATGCTG
OGGTGGCGTGC.ATACC GCGTATCATCGGCGCG
23 AAAGGICGATGACGC.GCACGGGCATCGCCATGAAA
47 CC.GCCGACTTCCGCATACGGCGTGGGICATGCCGT GAM C.GTACACGGCTTGGCAAA
GCGTGGGTCATGCCGT
23 AATCGGCAACIGCGG1TCCC.CCAGATG1CGAAGTC
AGACGATGTCGCATTG Tf AAACCTGCCTCGTCGGG
AAGOCTITC.AGTGAAGACT
TCTCGGCAAAGAACGTAC
ATGAACGAAATGACCGCT
MCGGCGGCACAGAAG
23 GGTAGGATGCGGAMCC.1GCTITTGGAAC1GIGGCC C.ATGTTGCGCGGGATTICTGGICCATTGCTGCTGCC
CGCAAGTITACGGGNOT MGGAACTGTGGCCAAAC
23 GCCCACGTCGITTATCalCAAGTGITTGAGCCAGATT
TGTTIGAGCCAGATITC.GG
Si TCGGA GGGCATGACCGCCGCCATATCGAAGCCGCCGACCT
CGATATCGCGGCGCAA A
23 GCAAEITCCTTATGCC.CTGACGAGACCCAATGCGAG AACGTGGICGATGTGGrrGCAGCCCATCTCCGACAC
CCAACAGCGCGGAMTG AGACCCAATGCGAGGTAG
ATACACGGGATAATCAGC
CCCXGCCAT/ACIGCG GC
GTAGGTAACGAGCAATCC CAAGCGTGGCGATAATCG
AATCGCCGTCTGCCCA
23 GCAAATATACAGCCGCAGCGGTTCATGGCAACGTCA TTGATGCCGACGACCTGTCCATC.AATCCGGGCAATT
TTCATGGCAACGTCAATCG
57 ATCGG CCG G1TTFTC.AGCTGTICGGCG G
23 MCCGGTTGCCGATACGCCTGTATTCCATC.ACGCCG 1 GCTTCGCCGTCGCMGMAAATCGCCGITCTGOT
TGTATTOCATCACGCCGTC
'ITCGCGCCC.AACCGTTATTIGGTCGCGACGGCAAGC ACCTG TCGTT/TATIGGGC1 59 GM 1 AG TGCTUCTGCC:GCGCA
23 CGGTATTCCTGAGCGTCGGATCCGTTMGCGCAGGTT AAAACCCTGCACCGCAGTGATCCGGC.AATCTGCACG
CGTTIGGCGCAGGTIGG
TCATCGA
a..GCAAAAAGCCGCCG
23 AGCCATAAGCTGC:CCGMCGGCMGCCGACCACC ACAGGTG1TCG
TCAACGT
TCCGCCATCGAC:GC1 GCCGAGGCiCCiAAIGTG
66 GCAGGICGTCGICGCTGIACCTGCTGCGCGGGTTCA TGC:FCCFCCTCGCCCACGATAGAGCOTITGGGCGCC
CGAAGCCGTCGTTCCCT CICCTGC.GCGGGTICA
CCTATCAGGCTCATACGGC
67 CG AGACGGAI GCGAGGGCG1 ACGACCIGCTCCT MCC:
CGGCGIGGCTCAMGC
GGGTAOC1TCCACGACGCGTAAAICCCCGAATTCGC AGT TGCIGT ItAAACCACG
69 CCACGTIGGCACGCCTGTGGCATCCCITGCX.GTCCA CTGC.GGGCAGTTCGTTGGTITCGTCCGCAGCCTCG
GGCGACGATGTTGCTGT GCATCC.CTTGCOGTCCA
23 CCICAAGAACACAGGCAAAGCCCGGCACTCTGC:ATTI 1TTGGGCGICITCGTG
ItTTCCACCIGCGGACAATACA TITGTGITCCAGCi TAT GCG
C.GGCACTCTGCATTMGC
GCAAGGTGGCGGCIGICGACAACGGACAAACCAC1 GCiA TGACGGICAGTGTG1 72 AAATCCCGCCCTGCTCGCCITGGACGGCGTTATGGGT GCCGCIGCMGGGC1, ii, 73 CTGACCGGC.ACACCCACACGCGGGC.AAGCTGACGT CCTCA AGGATGGTC-AGGAOGGC
GCGGGCAAGCTGACGT
23 A TGAGGAAGTIGCAGI GICGGGCCTC.ACGACAGA A
TGTCGGACAGGATGICGA
74 ACAICZGACGACC1CGAAGC1C6GGACIsGGGG TGGA ACOGAC
TCGGGACAGGGGTGGA
23 CCAAAACCGCCGTCCTACACCAGCiATTGCGCTCiAACA
TGICCAGGCTGTGICCGAACiCGCATCCCGGCATCA CGTATGGCGGGTAAATTG
TS GCCT GC GC
GGATTGCGCTGAACAGCCT
23 CAACAGCATCTCCGTCAGCGACITTGACGGAIGCCTG GGTGC0116TTCIGTTCGCCACICTIGGCAC:GCATC
76 GC AATG GTCAGTTCGAGC.GGCATG
23 GCCATATICGCAGGATTGCMCCGGGGGAAAAGGAG GGAGGCGGTCAGTA'FGCCGAAGAGGCTICAGACG
CGGGGGAAAAGGAGTIAA
TATAGCTGACTTCGACGG
TGTACGACAGGTGCGGC
CTGCAACCGGGGATGC
GGGAAGCGGCCGATGTG1TTEGAACACCT. TCTTCCG C.GGG1ITT11GACGGTTCA
AATGCGITTCAGGCAIGTA
TCGGCAATGGCATGCATG
23 GGTCCAGOCCIGGGTGGAAATTGCGO.iteiti ICC
82 ACCCCGGGGCAAACCATCCGCGCAATGCCTGCTCGA GCA GCC.AAATACCGGCGCG
GCGCAATGCCTGCTCGA
TTGCTGAAAAAAGACAGT
GACGTTACMGCGCCAA
ITGACC:GAIGCCACGACC
OGGGTCGGOETGCsATT
86 ra: CG GCGCCGCGTITTTGC
ATTICGACCGCCZATTCC
23 CCGACGGICAGCGGGATGACACCATCCTGC.ACATC
87 ACGCACCGTTGCCGACATTGATGCTGAtitiGiCCGC ATCG GTAACGGCGGCGGTGT
TGATGCTGACTTTGTCCGC
88 AGC.AGGCGCGATTACCGGCTGCTGCGCTTGGGCA ACGCCATGTTITTCGGCGGCTCCCGTCATCGCCTCC T
TGCTGCGCTTGGGCA
TGGIGAACACGATGICTT
CCGCGAGACAGGTCGT
CGCACCGTCCACGAC.A AAACCTTCCAATACGCCCG
23 CGACGCACTGGTCGGCAAAGGATGCCTFCCATGACG AACCACFFITCGC:RTTGCCITTCCAAGCCGTTGAC
AGATMGTCGGCATC.GG GATGOCITCGATGACGAG
23 CGAC.AGCGCATCGAAGCCATGGGATAGCGTCCGGCA CTCTTGCGCCGTCCTGCCTACCGITTACCGCAATATC
GTTGTCCGCCCGAAAGTT
GGATAGCGTCCGGCAAGT
CGTCCGATGTGTATTCCCA
GTAAACGACGACGCGCG
C.AATTCGGC.GGTGAITTC CGCTGAATCGGATAGGTC
23 AAACCCAAGCC.CGAAGACGCGGGGTGAGGATGTAG 1 TTCGACAATGCCGGTAATGCGTTCATACGGCATCGT CAGGCCGGTTAAAAGATC
GGGGTGAGGATGTAGCGT
CTGCCCAGITCCAAAATCG
97 I CCIACCGCTGGTICGGCIT GCCFGT TGCiACCIGGGA 1 A C
GCCTGTFGGACGGCi GA.
GACGCAAGGAGTAGGCG
CGGATTCAAACGCGCCA
TGGAAAACCCGCCCAGC
TCGGCTTTGAATAACTGCG
CGGAAAGGGIAACGGICG
GCCGCGG1TGACGATG T CCG TCAGCCA.GAGGGT
24 ACCGAC1ACG1CGTCGCCGCAACCCArTGGCTTrT1C TTITGGACACGCTGCCGGAACCAAGCCCAAGCCCIA
CTICGCTGAGGACGGAAA CAAGGCATTGGCTITTICC
24 GCTCCATA.AGCCCTACCGCCTGTCGCGTTCGATFTCG GAACGGATTCGGITTGOTGGCTCGGGCATCAGCA
04 CTG GGAC CGACGCCITCCiCCITCF
TCGCGITCCi ATITCGCf G
TGCTGGGCGCGTCGATCACCGACCTCCICAAC
OS CC C CTCITCGCCiCAGCITGAG
CTCGGGATGGMCGTi CC
24 CCGCAC 1 OCACGCAT TCFGCGG TCGGCGAAACCAAA =
AAGCCACITC.GATAAAGGCGGCCATTCGCACAAAG GGICCiGCGAAACCAAATG
24 TCGGCAATCTGGA.A A IGGTCGTIT TCCAACCGCTGC:
CGTTGC.GGC.TGACTGC
08 TICG C.ACTTCTCCC.GCCCCCGA.AATCATGCGGC.GACATGG
TGOCAACAGCGMCGAG CGAACGGTATTGCGCTTCG
CCCTGA ICGTGCM CGTGCAA ICAGGAANIGG 'ICCAAAGTGICTATCAG GC
24 CCGATAC:GCGCCA TACTIGGTITGCGCC4:CCAT
T ACAGGCGCOCGAGAAAGACAAACGCCGCGCACAC AACATTGGGTGCAGGGC
GGTITGCGCCCCGATT
ATGTATFGAATGCCGCGAG
11 CGAGC AAACAGGTGCGCCGAC.GGCAGCAACGCCCTGCCT AGGCTACGGGCGCAA
24 CGCGCAGTI GT TCCAGCGATCCiCIA TGCCGAAATGA
CCCTAIGCCCAAAICAGC
CGCCGGCTTCCIGC.AA
24 C:ACCCTG TACGAAACCMGCTGAATTCCGC IT TATC
TGAATICCGCTITATCCGC
13 GGCG CGCCGTGC.GGITGCCGATACTCCTGCA TACGCGCG
GTCGTCGTTGCCGCGT
24 GC4:A.TAATGCGCGCMG TACiGCC:AACCTCAAACCGC
CGC.GCCGTC.ATCATGC CCAACCTCAAACCGCC AG
IS A AGCCGCCCGCCICGMTGGIGITGaIGGGTC.ACG
TTATCGGGCGCGTGATGA AACCCGMTGGCGCAA
ITTCMCCTGCCCGTCC
17 TCCATCGC.GTCCGCCAGTATGGGCGGGCGAGTTGAA GCGATGTTTGCCGCCGTCGC.GCACCACCTTTICCG
TTCAAACCGACCACCGC
CGATAAAGGCGGCGACTT TGGCGGGGRI:miGC
24 CGCACCAAATC.CTATGCCCITCGCAAATCCIAGGCAA
TCGCTGTCCICTTICCGAACCTIGCCITCCGTGATCCT CAAATCGTAGGCAAGCGC
GCGCA TAC GGCCGGATATCGCGTTC A
CGGCGGCAGAAGGCIT
GTCTGTCGTCGAGGCATTCCACGGCGCATACCAGCT
AGCGICCGGGATITTCAAC
OCGCGCC.GA1TACTFCG
24 CGGCGCGARITTAAACGGAAATIGGACGACCTGATA C(.1 24 ATCCCC.AATATGCCGTCAGC.ATTGGCCGTAATAAAAA
C.GACTACCC3TTGCTA1C1CGTTOCAACGTACCGATGA CTACATCAITATGACCGTT
ATTGGCCGTAATAAAAAGT
25 GTATGG 1TG , GT ATGG
26 TGGGACAGCTCGGGGCAAGGCAACTGGACGGGGA AATFCCGCGCGCGGGCATGCAACCACC.ATCAGCCC
GGACTGGTGGTGC.ATCCG GGCAACTGGACGGGGA
24 CTCGCGC.ATATGGACGCGGTACATCGAATGCTCGCTC.
27 G GCCAACCATCCGCTTGCCGGTGTCGCCGCAC.GGAT
ACGCTATCTTGCCCACAGC TAC.ATCGAATGCTCGCTCG
24 GCGTCTGCCTCGATACCCAAACGGCGGCTFIGCCC.CA GCACTGTITGCCGAGCGCGGCTTGTCGAMCCTGA
28 A CGT TGTGGGAAGCC.AAACCGG
GGCGGCTTTGCCCCAA
GGGCATGCCGTCAATATC
CGCAAAO:GCCGTTGC
ATTCGGCTTGCCGCTG
ICGGCAGATACGCCGT
OCTTCCTGAATCAGGTCGG
24 GCCTGCTICCGGCCATICTGATCGGITTGTTGGGC.GG GTTTGCCCGAACGTGCGGTCCGCCTITCTCGGTGTC
CGGTITGITGGGCGGT
24 1 GCAGGGC.AGTGTGTATCCAGTC.GCCTACCTGITCGG
CACAGTGC.CGGATAAAGG
AAGGTAGGCGGGCAGC
35 CIGTGIACGGAATCGICGCCGGGCGCATCACiCAGCAT 1 GACGC , AGGCCGAGG
ACGCATGATAITTGGCTGGCGGGGCAACCACTITAA CTGTCGCTGCTGATTCAG
AACGACGATICCTGTGATG
36 TGG ' ACGCC G G
TCGCCGAAACGGGTAAAC
CCAGCCGTTGCGTGCA
AAGCAACTCTATCCT
ITATCGGCC
24 TTGACGGCCTTCCATTTTGGAAAAGAAGATTTGGCTG ATGGTGATGATGATTGCGCCC.AATCGGAGTAAGCG
AAAAGAAGATTTGGCTGA
39 . AAGT GAAA GCAACTFITGGAACG MT AGT
CGGTAAGGATTGGGAACA AAAAAGCGGTAATAAAGG
40 AAAGGGAA AAAACCif A (3 (AA
TCATCGICGAAGRAAOCGGCTCAATGCGTTGCGTCC
CCGAACAACTi COT TTCCI
TAGCTTTCGCCTGAAACG ACGCAGTTGITCGGAAAAC
GGGCAGGATGIGTTCGAT AGGITGATGCTGTCGAACT
43 ACTG AIN'S G G
TCGC1GCTCATC:AGGCUAGGICGICTI6CCA FAGG
CATGGCGGCGACAAATGC
24 GlIGATITGAAAAAAATGCCGTCTGAAACAGITICTC GFTCITTACGITITTGTGGGGCTFACCAGGCAFCCA
AAACAGTTICTC:GAACGGI
45 . GAACGGTA AACG TTCiCATGAGGACAGGTIG A
.
24 ACACCCGACCACCAAAATIOGGCGATTGCOGCATCiCi AGGCGIGGATITCCGTIACCOTTIGCCGCATTGCCC
46 GC (AC 1TACCCO:GAGGACGTGC
CGATTGCGGCATGGGC
TCGGIATCGGTGIAACGGCGTMCGGIA
47 GCCAC TATATGCAAA CTGC.CTICTTATGGOGAAC
GOTATCICTCTACGCC.AC
24 CACAACCGCCGA.CATCAAGCTITCFMTCGGCGACie GGCGRiGCGA TGAT
TGCCGCGAATCCCCAAACCGC
TITCMITCCIGCGAGCGT
TTCAAACCCTTGCCCAAAC
24 ACOICTGGCTITCCATICGGGTCAATfCCCGAAGCT
TGC.CGGAAGAAGG TITIG
50 TCCTCAGGCTCCGCCAC.ACCCGCTCCTTCGCATGG aiC C
OCGCTCCITCGCATGO
24 CATI GGCAAAAATAGCAGCACAMAITAAA FCC:CAAA
ccrAcc:arrrrlaCiaTGCMGCAC:AAACAGTICif ITAAA FCCCAAAACAGAAA
51 ACAGAAATGAC CG CAGCTCAAAATGTfGCTGT TGAC
24 GCCiTATIGTCIGTGCGCGGATTCICCIGCAAGGCAT
52 CGCACCTTCGCIGCACiACTGOGCAACGACAGCGAGG TCAT GCGGGC.GTGTTTGCA
GGCAACCACAGCGAGG
53 ITGCCGGITCGGAAGCCGCGGTACGC.GCATGACGG CTATGG TTAAATGCCGTCCGCGC
GGTACGCGCATGACGG
24 CGC.ATGAAGATTGGCTICCGCMCTGCTGGCGGAAT GCGGCGAAGATGAATGCCAACTCGAAGGCGTAACG
54 CG . AAGTTG ACAGCAGCITTCCCGCTA
CTTCTGCTGGF:GGAATCG
24 CGC.TTCAAGGTAGCCTTTTGCCGATGTATCTCCGCCG TGCACTGAAGCCGAGTA11CCGAC13TGCGGT1TCG
GATGTATCTCGGCCGGCT
GGCTACGGACACGGCA
24 GCGAAAGITTCCGTNVµAATATCGGTCGGATTAAITT AAGCCCAATGGGAGAAATCGTGGAITTGGGCTTTC
AATCGGGAATAGTTGGAT TCGGATTAATTTGTTCAAT
24 CCGCCGCCCAAGAITGGAATCITCAGGAAAGCMGC. ACGCTTCGACCGTCCAATCCCACCGGTCGGATTGCG
C11CAGGAAAGC1TrGCCG
CCAATCCGTGCAAAAACAG
60 C GACCGTITCGCGGCTGTCF CGCCGACAACCOCCT!
GAAATCCGTGCOGGCC GCATTGTCCGCCACGC
24 AC.GGITTCGTCGACGGCACGTCGGGGAIGATGGCGA CCAGC(CGCGATCCTGATACACGACAGCATATGCGT
TCGGGGATGATGGCGAC
AGGAGTGGTAACCATGCCGACAATCCAACGCGGCC
62 AATC TGAC ACCGITGTAATC.GGGCGG
ATTGIGGCTGCCGGTAATC
GTTAGGCGGCGTGGC.A
24 GCAMCCAATC.CCGACACC.GTGCCACGCCGAAGACG GGCGGITTTGATTTTGGAGCCGGCGCTGITGATTTT
GCCACGCCGAAGACGA
GGTCGGCATTAAAGACCT GCTICMGATTACCCTGAA
TTTGITTGATGCCGTGICC
ATGTCGGTGGCACGGC
24 TGCTCGACAAC.GGCGITCCCGTTCGGCGCAATAAAG
67 C GGTGTTGITCGGCGGC.AAGGACGAAACCGCCGC.AC
AGGCGGCTGACGGGAT CGTTCGGCGCAATAAAGC
24 CGTCGCCGCCATAAAACGCTCGAGGTGTATGGGCAG CGGCTGCCGTGTCGGTAAATCACAGGCCTAGGTrA
68 G ca CGGTTGAGGICGAAGGTG
TCGAGGIGTATGGGCAGG
CTITC.AGACGGCAITGGIC
CAAAAGGATGTTGCTGCC
70 (.3 TGC C
CCGCCCAGCCATAAAACAG
GCGGGCGTIATGACGGA TTGGGCGGATATITCGGC
24 GCCCGAACCGATGAAGCAATOC.AITTIGCGCCTGATT 1 GITCITCGTTICCCCCGAAGCCGGCCITCGACGTTFIT
ATITTGCGCCTGAITCCGA
73 GCCGAITITGCCITTGCCGCGCCITACCACA ICGCGC GCATCGGCC.Cf GCCTACGAAGCAAATCGG(XiGCG CCTGCCCGCTGATCCT GCCTIACCACA ICGCGC
ITGACCACGCCTTGAATCA
CGTCAGCGTAAOCATGPM
ITGCCGGCAACAACGAC
,XCATCCCCITCCITTATGC
24 AGTC.CGCTACACGCAAGGCCGCTCGTTGATGTAGCG TGGTGGAAGGCTTCGAACATCGCACCGTTGCCGAA
CGC7C0 ITGATGTAGC:GG
24 GGCTTCCAACGCGTC.ATCCAGGCCAAGCCTTTGCCTG GGCCGGGTTTCGACCGTGTGCGTTGCCGAAAACAC
CAAAACGTCCAAAGGCTC
78 C CC GCCAACiCCM
GCCTGC
GGAAAAGGGCAGCGGATTGACGGCGA ACAATTACCCGCAAACACG
CCACAACAACCGCATCC
TCAGACGACTITGGIGGC TATCGCGACGACTTCCATC
24 GACGAGTICCACCi ICGTTCGCCACGCTI T11 TG IG 17 CGTTGGAACAGGCGCA
24 CGAAGCOCCGGAATT IGCCIGGCAAAT CMGS:GAGA GGATG
ITGGGCCiCGGGCAT TACCGACGCGGCGAA GGCAAATGTGGCGACA IA
=
AACTCAAACAGGTTGCGG
24 CGGAAT ICCGCGCATITC3C:AGAGGICMAC ItGAAG GICTG IGCCCACA
ICGGICIGACCGCGCCCCIGAAC
ITS GCGG TT CTGC:CGGC.GCGCATAT
GGTCAAACTCGAAGGCGG
24 CIGCCCITICIGCAAAGCCGTICAGTGCiCiC:GAACOG
CCiCiGAACGGCICTCiCiC:AACCGCiCGACTICGGGOG
86 GT AlIT CCIAACGCGCACGGCTT
CAGTGGGCGAACGOGT
24 GAACAGGIATTC.CGGCGCGGACCGCAGGGATTACGC
ACGCTrAGGGIGTCTGATCGAAACCATACGCCCA ATGAIGGGAAGAAGGAC
SS CTG AGAC
GGCAGGATTITATGIGCTG
ICGTCCAAAAGCAGGAAACGCCAGCGCAT
TGTGATOAATGICGCGCA
24 AACGTIGTGAAAC:GGT1IGCGCGAAGGAATIGTCG
90 GCGTGCTGCTGCCGGAAACGCZAAC.ATCATGCCCG GGCGAAT TATTC.CCCCGTTGCCCG
CDOCGAAC.ATCATGCCCG
GGAGITGCCGCGTIGGAAGITGGCIGGTGAAACCC CGAIGTAGTAGGCGI GITC
24 ACCGTCATATGCCITATTGTCGTGGGTAAGAAGGAA TTC:TGCGGATGITTTICTCCGTAATTAATCCGCITAT
GGGTAAGAAGGAACAGGG
24 GCCCGGTCAMAGCGCAAAAAGTTCCGATACC.TTTGC GGACAGCCGGITCCGGTCACAGGAACGCGCCGTC.A
TTCCGATACCTTTGCGCC
CGTIGAGGAGGCCGTAAT C.AATCGGGGICTGACAGG
ATAAGCGGTTTCGGGATG
GATCCITAGAGACCGTGCG
CGGTTGCCAAGCGTCC
GGACGGTCGCAGAAAAGC
24 CC.GATACGGGGIAAGGCAGGATGCCTGIGGACGCA ACZGITATTGGCGGATITGGCGCCTTATACGCCGCC
99 AOC GC CCGATATTGGGC.GCOG
GCCTGTGGACGCAACC
00 GTTTGCGCCGCCG1C:17TGGGCGOGGTTGCCGTAT AGGCAA CGCGATIGCSATOMAA
GGCGCGGITGCCGTAT
CATGGGCGAAGGTGGAT
CCGCAACAAGAGGCCG
GTCCTTGACCGCGAACC
ITTTGOGTAGGGITTAGCA AAAACGCTAATC.ATAAGAG
25 GCTTCATCTGC:TTTGTGCGGGICAGGCATCGGGAGG GAAGC.GAGGICAGGCGGC.ATCCGGGATTCGAGCGT
CAGGCATCGGGAGGGAT
25 CGACCACATCGGACAATOCCCiC.GCGGCAGGGGITGT AGCCGATAATCAGGCGTGTTGCCAGGCATTACGCG
CGC.GGCAGGGGTTGTT
TGTGCAGGAATTGGCAGG CCGCCTGAAAGAATCCCAT
CGAGATGCACGGCTCAAA
25 GGCGGITAll. I I I tiGCCTGCCGTICCGCCCGAAAAT
GTGATTCCGGCTGTTCGCGGTCAGAACCGGCCGCA
GTTCCGCCCGAAAATCTGC
GCAGGAGAATCCGAACCT
GCCTCCTGGCCGCCGATTAACACGATGCGGICTTTG GACGATTICITCCTC.AGCC
TAAGGCTTCGCTGCGCCAGATATCGCGGCGG TT TCA ICGATACGAACGICCGTTA
12 AC G TGTCCi ITTCGCiOCGAAG C
TCGGCAAC.ATGAACGAAG
CCCTGCAGATTICAGCCG CGCCCGATTACAGGCTTG
GCTGCATGAGGACGCTCTATGACGCCCGTTTACCGT TGACGTGAAATCCGACATC
TGCTGAAGCACAATCTGA
TCAAACCCCCCTTGC6CCCCTGAcCCCJCG1TrFCG AA CG CCTGACGGCF
GGITTFCG
16 ATCGOGGCIGTCCIGCTCUGGGCGI GGAAATT ItGG GC (ST
TGGGCGTGGAAATITCGG
2$ AAGCA TCAACGGCTIGGAGGCCCGCCACITTGCCGT
CGGGCAGCGTCAACGA CGTCGAGGCAGTCCGA
18 ACCG (SAC GGCACOCCCGAAAGCA TI ATM
TGAAIGCCGACCG
ZS, CGGCAAGAAGCGGCGGTATGTTGCCTTGTGTTGCCG CGCCGCCCTGCC1TGTGAAGCTTACAACCGCGC-AAC
"fTGCCITGTGlIGCCGT
25 CGGACAGGAGGGOCATTCiAIGICiGGGTCGAAGGCA
ACATCGGGCATITCF TC:CCAGCTG7CCGACCTMCiG GGGGICGAAGGCAAGATG
20 AGATGG CGG AlTTCCGTACC.GGC:GGC
25 ATACGGCTCiCATCGCGGTCTGCC:GACAACCiAAGAA
GGACTGGTATCGCGCCA
22 ACAC.CGCCGCACCGTATCCCCGGCGCGTGGCAATA CZGAACGCCTCAAACCOGCCGGCGGACCATTTGCG
TGTCGGTITGCGCCGT CCGGCGCGTGGCAATA
as GTCAAATCGTGGTT TGACGGGCACAACGACGGCCA
23 17CCCACCGCITC:CCCGACGATGTGATIGGGGCGG GCC GATGTITTCGGCGGCGG
CGATGTGATTCGGGCGG
25 AAAAGCCGCiGCGCAAA ICAGTCGGGTCITIGGCCG
24 TCCGGGCGGTTTIGGCGTGTTCACGAAGCCGATGCC C GAACCIGGCX.CCCGATA
TTCACGAAGC.CGATGCC
25 GCCTTGAGCTTGTCGATTGCCEAC.GCATCGCACTTGC CCAAATGGGCCAATTGGGGCGCGCGITTGCCC.TCG
GCCTATGACGCATTGCAG
ACGCATCGCACITGC.0 25 CGCCCATCAACCCCATCACTGCCCGAT iCTIGCCGIT GCGCATATCCCATATG1CG
CX.OGATTCTTGCX:GITGC
27 GACCi GCGCGGCACTICGACCAATOCCCC.TGCCCGCiTTT C
GCGGCAGGCATATTGAGG
25 GAACAC:CAACGCGTA AA!
CX1GC.GGGAAAATGCGC
25 Gc.ciGGCAcGCGAcirrrIcTcMGcAcrGccATcAcc ATIGTCCGACCAT
ACGCCCGTGCTCAGCACGGGGA
CAAGCACTGCCATCACCG
GAGITCCTGITGTC.GC.TATTIGCATTITCAAATGGTA TTATCGTIGGTGAAGGAA CTAGATCGTCAAAGTTTAG
AGCA GGGAAGG GA CA
25 T17TCGCAAATC1TCCGCCCCGGCGAGATTCTGAACT GCGAAGAGCGGAAAATCAAACACC.TTGATTTCTTTG
CTTIGCCTATATCGTGAAA
C.GGCGAGATTCTGAACTT
ACAATACCGCCAAGCCG
GATGACCGCATCCAAAAC
CGCCGACGCOCGATTICACGCAGCATCGCGGTGGA C TCGACCCCGCCCTGTT
GCAGCATCGCGGTGGA
ZS CGGAAGACAAATCCATGCCGCCATCTGTGGCGCATC
ATCTGTGGCGC.ATCAAAAC
36 AAAACC CCLIGCCCCGATCCCITCIATGCFCGGGOGGTATC:C
GGICGATCFGCAGGCCA C
37 ATCAGCGC.GGCA. s IT.CCZCACAAGGIGCGGCAGT
ATTTGGCCGATGCGCTGCAAACTGTTCCCCGCC.GC CCCTC.AATGCGCGGCA GCACAAGGTGC6GCAGT
ITGGIGTCCGACGTGTATCA
38 ATCG GGGC TCZTTGGIGGATTGGC. TG
GCGTTGACGTGATCCATCG
25 GCTGGCCGGAAACCATATCTCGAAAACGCTGCAAAA C.C.CATTAATGCCGMATGCCAGAGACCGACAAAAA
AACGC.00TGAAGTTAAAC GAAAACGCTGCAAAATGA
25 TCC.ACCCGCACCCGACCTTATACACCCAAAGCCACTT TGGTTTTGCCGATGTCTGCCGGATCGGTGAAGTCTG
ATACACCCAAAGCCACTTC
41 CCGTACGACGCTGAACGGCTCGGCCIGGATGIGGC GC.AGTACCGCCCAGCAGGACGAC1TGGCTTGGGCG GG
TCGGCCTGGATGTGGC
25 GGCATCAGCAGCTCCACGCTAAAGICGAAAAaiTCG GGIGGIGCAGCCTGCCGAACTSCAGCAGTTOTTACC
25 CTCCGATCGGCAACTGGCMGAAGAGGICAGCGCGA TGIGGACGAZGTANCCGCGCATGC.AGCACGACAGC
TGTTTTCAAGCACATCGCG
GAAGAGGTCAGCGCGATG
GCCGCGATTTGTTCATCG
ZS CGCCGCGTTCGTACTGCATGGACACCATCGAGCAGC ATTTGGACTTCAAACGCGGCAGTTCGGCGGGGTAC
TGCTGTCCGTAAAAGAAG
GACACCATCGAGCAGCG
AGTGTCCCICGICGCC
TTGACCGAATGCTGCATCA
25 CGGTGAAACACGGCATTTGGTTGACGGC.ATCGAC.GT 1 TTGCCCATGCCGACCATAATCGGATTGCC.TGCCCGT
GACGGCATCGACGTGG
ZS 1 TCITCGGGATIGGATTCGCCGG TATCCiAAAGCGGC
49 GOCGOGCGGC.A.AAGTCi 1TCGCGGTAC:ACCACGCT 1 AOCGG , TTIGGCTGCCGAGCTGC CGCGGIACACCACGCC
25 GCAGOµGCAATGTCGGCATTACAGAAAGCCGGTGTG 1 AATCGTTGCTGCCGGCATCGTTCAGAGGACCGTTCG
50 GA ' GC GTCAAACCGTTACCGTTGC
ACAGAAAGCCGGTGTGGA
2$ CCCGCCCATAAAGCCATCGGCGCCGGAGAGTGCAAT AGCCGCAGTGGATTAAGAAGGGTGTCGAGCTGGGT
CAGACCGTTGACCAGATA
CGCCGGAGAGTGCAATC
TTTGTAGGCAATMGCGCG
25 ACATAAAGCCGGCGGCAAAC.AACTCCGTCGGCGAA
53 . 1 GITIGCCGCGCTG ATCCGCLTATCGGCAOCAG TIT TGG
GAISACAGGGCCGGTGA CCGGCGGCAGCAGTTI
25 CGAC.AGGCAGGGAGTCGGCATACCTGACGGICTT CA
GAGCAGGCGGTATGGTCGGCGATTTCCCCAACAGC
CCTGACGGICTICAGCCC
ZS CGG'16TCAACGATAAGGCACGGGAACATC1TTGACC TCCAGCATCACATACTCGAAAGICAIGCCAALGCTA
ACGGGATTAAATTGAATT
5$ AG I TCTGG TGCA
GOGAACATCT11GACCAG1 .
CCGACITTAACTTCAATTT TTGAGCGTTITTGAATICA
56 TICAGC GC.37 CC GC
25 GCAGCGCAACCTC:AAACCGAAACCTGCGCGAGTG1T I GCAAGGGCGGGITCGGTAAGCAAGAAACCGTCCGT
CCITICGCGAGIGITGC
25 CGAC:ATCTCGACGGCAACCGGITCAGGTIGTIGAGG 1 G6TIITGGCGGCAGGCGTATAACTACCTCAAC:GCC
Sil CGT G CA TGC:GTTGACAAGGTTGCC
59 . CGCCCTTGCCCAACGCTATAGCGOTATITTACGC1COG 1, TTCG C
GCGGTATTGAC:GGCGC1 .
CGACGCCAT TCGGIGTAGAGCGTGAACT
C.GACCTGCATC.ATTTCGCC
25 CGACCACGCTTC i GGTCGAGGGGAAATCGGCGAAAC AAGGCTGGCGACTIT CAC
GGGAAATCGGCGAAACGG
25 CCiCTITCAF6CGCCTGACCGGCTITAI6TGCCTC1TTT
62 TCGCCCTGCACAAGCTCCITTCGGCGGCTITGCG am CGOTATCGCGCGCITT
TTCGGCGGCMGCG
25 TCCGCCCiCGCCCICITTTAC.AGGATTTGGTGTCGGAG
GTTITCAGCCAMACGTG
CCCCGCCAAAAACGGC
CCGAAGATGAAG ?MCC
64 CG GGCCCGGGGCGCATGAMTCAACC.TGCGCCAACG GA G
25 IACAGCACCCGCCACCTCiCTTCTTT TCGGT AATGCTG
ITGTETTCGGIAATGCTGC
65 CG TTGTTCGACC.GCCGCGTCAGCAACCGCCTGAC.0 GTGTTCGCCGTGGTGG G
25 CGA 1 CTCATCCGTATCGCCCCCK:GITGGAGGC1CGAC
GGITCGAIGGCTTCGGAIGCCGMGATGCGCGTIT
66 GA GC C.GACCAGITTGGCGGIT
COTTGGAGGGCGACGA
25 CiGTAICGCCMCGGIATGGCT.T.111GATGGTGTCGGI GGACGAGCSGITGACCiAGCCGAACACGAGCMCIT
G1CCAA.AAGAC:GCAGACA CGITGATGGIGICGGTGA
25 GCAACGACGAACTGGCGCACTCTICGGTGGTCATGC CAAGCGCGCGCAGGAC.GTACCGTATCCAGCGGATC
68 GG . CG GCCGCGAATTGCACTICT
TCTTCGGTGGTCATGCGG
C.GCCGAGTTTCAGCGTG
25 CGGGC.ATCGTTGC.TTTIC.ATTCATAGTAACGGATGTG
AATTTGCAGGTTATCACTATGTGCAGGCAGAATGAA CAATCCATCAAAAGGCTCT CATAGTAACGGATGTGGA
GGCGAAAAAGGCGCTGGCGTTTCGGGTGGCGCAG AGGGGAGC1. :GGAAGAAT
GCAAACTGCCCACGCC
GAACAACICCTGTMCC:TG
ACAAATICACAOGGGCGG
2$ CGGCGCATTGGCAAAAGGTCGCTTTATCAGGCGATG ACITTGGIGGCGACGACTTCGTCCCGMCGGCGTG
GOTTATCAGGOGATGGA
74 CiAGG C CCGGCGGCTIITCGATT GC;
GTATCGGCATC.GITACCAG
75 ACCAGT GCGCCCGAAATCC:CCGTCAGAAACTCGCTGCOCGC
CCCGTCCGTCCGCAAT
2.5 CGCACCACCGGITCITCGITAACATTGICIGICGCGG
CGCCATCGAAGCCTGCACCiCGTGAAAI CGGMCGT
GATTGTCTGTCGCGGCCI
25 GGCGGGAATGGCGGCAAATATCATCGGC.ITC. TGCST
ACCTGAAAGACCTGTICGGCAC.ACCAGGCCAGCAG
ATCATCGGCTTCTGCGTG
25 CCAGACCGATATGCCGTCGCCGCCATATCCGACGCTT GCATTGGCC.GTCATACCGCTAGOTCGAGTGCTITGA
CGCCATATCCGACGCTTAC
GICGMGATACCGATGC.G
79 GG1TTGCGCCCGCATGIGGGOG1TfACCGGITCGAC GGG AT
GGCGTTTACCGGTTCGAC
CGITICAAACGGCGCGG
25 GCAGCGAP.CACGATACCCGTATGAGCAGGGCGCATC CGATACGITCGGGGIGGATGITITGCCAACCTCAM
GAGCAGGGCGCATCGA
GGAAAAAATCCGATGCCG
GCACGATGGAAGCAGGAA
ZS GCGC.AGGCGATGGTGGTGTCAACGCCGAAATCGTC
ITGGCGGCCIGCAGT
84 TAAACGG TATTCCC P.ATTGCCGGGAATGGAAA
TCCGTATCIGGITAAACGG
TAATTTIGTCGGCGTGTTG
AGGGTCAGICCGATITCGT
25 CCAGCITCGCCGAC.ATCITCTTICTICGGTCAACACGA 1 GGGCGATAATGGCGCGGAAGTCITC-ACGCGAAC.AT GMAAGCCGGTCGACTTG
TCITCGGICAACACGATGG
ZS AATCCCGCCGAAGAAACCGATATATAITCGAGTGCG
GCACTITGCCGCCGATGITTTCGAACOGCAAGTTAC AT ATATICGAGTGCGCGCT
BB TCCACCCACGGCGCATTCGTCGCCATCGTGTGTFCG G TCGCGGTAATGCACTGGG
TCGCCATCGTGTGTICO
ZS AGCGCGCCAAATCGACCGITTGGCAATCCTITTACGC TATTGCAGGAGTTCGGCACCITATGCCGAC.AGTTIG
TTIGGCAATCCTMACGC
TTTGGCTGCTTCTATTGGG
CGGCAGGCGTITTCFACC
CACCAAATCCGCCTOTACG
ACAGCCTCGCCACCAACGCACAGCAGCATCGCGAT
CX:GGGTCGGI ATGGACA
2$ CGTIGGCGCGGATTITGGCGAACACGGCGGAAACA
93 'FTGICITGCCCACGCCCGGCGGIGGAAGGCGGGAA OCiC
GCGAACCiCAAGCCGCT CGGIGGAAGGCGCiG AA
25 GGATTTGCGCGCCCAAGCFCGC.GCCGTCTGAAAATC GCCGTTTCGACAAAGGCGGCCACGGATTIGTGCAG
94 C GGC rrACCT3CCCCGACA
CGCGCC:GI CTGAA AA ICC
AAACGCCAGCGACGAAGTGCITCCTFCATCAGCTTG
9$ 1 CIITGACGAAGCGGCGCA TIGCCGCCGCCAAATCCA CMG GCAAGCCGTGCF GT ACCF
GCCGCCGCCAAAICCA
25 CCAAAGCCCITGATCGCGICCGGTGTGGIGCACiGAG GCAGCGACCAGGAATGGCGACGAAC:CCITCAGCGT
96 GA GATG GGGATGCGCC.GTCTTC
GGTGTG GTGCAGGAGGA
97 GATITX.CGCCGCCGTCATGCATACGCGCCATCGT AGCCCGTTCATTTGCGCCACSGCGTACTCGTCCGT C
GC,ATACMCCICCATCGT
f3GCACICAAATICACAGT
ATC:GATTCAAAACACTGGC
TGCiCAGCTG ITCGITGCITIGG CAATCATGTTCGCATICiG
AACCCAAAC.TCAAAGCCA
26 GCCATTIOCCGAIGATTGOGGCCC.CCGGCAGGACiC:A TGTACGGCAAAGGAGACGGCTG
TAATCGCCGAAAC
00 TT GGTCT CCGCAGCATGAGGCP.AC
C.CCCGGCAGGAGCATT
26 TGTTGTCGTTCiTGGGCGGGCGGCTTTGTGCAGATTG CGGCGGCAAGCGTCTGAATGTTGGACTACTATCCG
CGGCTTTGTGCAGATTGC
26 CCGCiGIT TCACATCGCCGTAIGAGCGACGCTITACCG
ACCGTATTCATIGCAGGIGCCGATACAGTIGGGCG GAAGAAATCGCCCTGCFG
TGAGCGACGCTTTACCGA
AAAGCCAGTTGGACTICGCCCCCiCAACACCOGCAT T ACiAGCiGCITTGAGCA
26 ACITCTCGGACGTI CCM TCGGGCTI CGACICCG 'TA
GCACIGCTGGTC:AIACTG
04 GCGCATCGATITGGGCGCGGAAGCC.GGCGMGAGT CCG C
GGAAGCC.GGCGAAGAGT
OS CGTATGTCCGGCGCGTCAGGTCAGGGC. TTC-CiGGCGT
GTCGGTCACCAGCGCCACTACTGACCICGCCTITGG GCGATGTTGCCGCCCA TCAGGGCTTCGGGCGT
26 CAGCGCGAAGTGTTGGCGGCGAC.GCAGATAMATCG ACACGATGGCGGGTGCTTCGGIGGTTAAAGIGICA
CGACGCAGATAMATCGC
CACTCTTGGGGTCC.TGAAG
eg GT GAAC-AG CCCGCTGACGATGTGTAC
GCGCTCAAACCIITCGGT
CACCCCCATCGACCTGAT
GCACIGATTGGCGGGCCATGC.A ATCCGACCGAA ACC TITC.C.GATAGATTCCTGC.C.
CGGTCGGLITTITCCIGC
TGAACAGGATGTTGCGGA
CGACACGCTCGTCCGT
CAAGGATGCGGGCGATG
TTGCTGACGCMCGGIC
26 AAGTCATGCTOCATGCCGTCAAGGTGICAGTGCGTTG TGICGCGGACAAAACGATGOTtCGACITTCGCCAAA
TCGAGGAAAAGGACGAA
AGGIGTCAGTGCGITGTG
GCAGCCCGCATGTTITTCC
26 GCATCGCCAACCGCCTGTTGACGCCGTTliffiiiCAe-CGGCTGCGTTIGGCGACTACAACTOITGCAGCTCGA
ACGCCGTIbilitiCACGT
26 GCGCACCTTATCGCCGAACAAACGAAACGfelfeiGC ITTICCCCGAGCGCGGCATTCGAAGCCGCCAATCAG
CGAAACGTTMTGCAGCG
20 AGCGG TTII.1 CCTACGCGCCC G
CAGACGCGTTCGGCAAC
GCTFCGGCGACAACAGC
AGCCGTATTGTTTCATCAT
CCTACATGACTACTTGGGC
CGCTICGATTACGGCGIT
ATTFICTOMIGCCTGCG
TTGGACMCGCGCMIGG
CTGTCTCAGTTTGATTTTT
GTAGGGGAGGITOTTCTG
ATGCTGAAATGGACTCCT TATCGGATAACATCTCCTT
GTAAAGTTACCCGAGTTA CAAATGTGGGTGACAATC
GAAGATAAGGCTITITTFA
AriGGGIATCTAGGCCACT
TTGAAACCACTTCTGAM
ATATCAACCAACTTATTGA TCCTGATTCATTTTTCTCFG
32 Torrai GCAGTIG GATCC
AAACAGTTITTTGCTGGIT
CTCCAAAGTAATCGGrfCG
CACGCCAAMCGCAAGGT
GCCATGAGCTGCCCTACG CCTCCGCGCTGATGCA
26 CITTGGCAGTGGCGATGlIGCCGTGCCITGGGICAA CCGCTITGGATTIGOTTOCCGTCGAACCCGGACCGA
ATAGCCAAGGCGTGATCG
CGTGCCTIGGGTCAAAGA
CGTGGGIGTGCCGCAA CTGACCGCCATTCACAACG
TACCAAGGCCGCTCGTAC
26 GTTCGGTCACCGTCGGGCAACACTTCCiGCGGTCAG
CGTCCGCCAAGATGCC
CC.TGATCGGCAAATGGGC GGACGCGATACCGACGA
26 CCGATGMaITGCCCGC1TCGCCTGAACATCCCCACC COGGCAAACATCCAACOCGACCGCCGATFCGATCA
GCCTGAACATCCCCACCA
26 GrCIGTCX:CTGCCCGGCAAACAATICTTIGCGGCCT
GACCTGCGGCTCGTW
GICITATTCGGCAACATCG
44 ATCCTCGCCGTCCTCGAGCCGGGCGGACTTCATCGTG . GCCGACACGTTGGCGCAATTCGCCCACCACCCGA
TTGAGGCTTGCGGGCA GGGCGGACTTCATCGTG
GGAAACGCCGAAATTTTCC
GCGTTTGCGGTAGGCG
TCCGAACCGTCITTCTGTG
26 TTGIGCGGACGCGTGGICGATGICCAACAAGAAATli ATGICCAACAAGAAATGCG
TCCAAATCCGCCTGCGA
ATCGTCAATACAGTTCCG ACATCGGATGCTCTTCCAG
TGTCCAACTCGAGGGCG
CCTTGCACGACGTGIATC GGTCTGACCGTCTITGAAG
CGACAAACGTGCCTTCCG
TITTGCCGAATTGATGGC
TCCAAATCAAACZCGCCC
26 GGCGGCATCZAAGGCITCGCGTCTGAACACCCITS.0 CTCGAAACGCGCGTATGC CGTCTGAACACCCTCCCG
CATCGCTTTTGCCGCCG
SS T TGCC TTGTATTTCCTGCCGCTGC
GCACGGACAGGGATCGT
CGACCAACCGACCGATIC
26 ITITCGCAATCGCATTGACCGCTCG/V\GCGGGTrTT
GGCGCAACGGOCTTAC
GCTTIGGCACTGAAAAGC
GTTACTTCCGATGCGGCG G
CAGITCTICTICGGTTTGC
ICGCCTOGGGAITGAACA
ACACTTTCGACGACAACC
CTATCTGAAGTCCGAGCGC
TTCATTGCAAAAACCGACC AAAACTGTACAGCAGGCG
TGCCCITCATTGCCGTTGGGGAGGGCGGTITCGTCA
TTITCGICATCCGTTACCGC
26 GCCIaiCTGCCITCGAGAITCCIACIGGIACATCGAC GCAGAAAGICGGCGAATACCGGGITCGGCAGGCA
TITCAACCGCAAAGACAA CCTACTGGTACATCGACAC
63 ACG I CCITT , GC
OCCGAACACCTGCAACAA
AAAAACGCCATCCCGAAC AAAACCGTTCGTTACCTCC
GCGACAAATACCCCGITT
TTGAGCGGCAAAATCCAC TCGTTTATGCTGATCCCTC
26 CGGGTAATCTITCCGAAACCGTTTCAATATAGCCAAG GTCAGACATCGGGAAATGCTITITICAAAAGCJkGAT
TTCAATATAGCCAAGGGGA
CATACCAACGTTICCCCCA
GCCACTGCGTGTCCATPC
TCGICAACAGCGICGICG
CACGCGCTGAAGGAGGT
ATTGCCGGGAAACACGC
ACTGCCCAAAAGCGTACC
GCCCICGATAAAACCTIG
GITCITTGGCGGCTTCGT
GCGGCGATTATCATCTGCA
AAAATCGGTGCGGACGTG
ATGAGTCCGAGGCTGTGG
TCCGAGTGCTTGAAGACC
TTGCCGATGITGCCGC
OGAGTATGGTGGCGCG
IGGIGTTCCAAAGCAGTAC
ACTGGIGGAAAACGGIAT TGTATTCCTGATITCCGGC
82 GGCC . TGTG GC
26 GGTACTGCTTCCTTTGCCGGCTTTGTTATCGACTTCAA ACAGGCTGGTTACITTGutttlIGCCATCGTGACCA
CITTGTTATCGACTTCAATT
GCGCCGCATAACCTCTT
AGCAGTAGGGICATCAGG
GACAGGLAGCGCATCG
TCGCCCTGATAGCCGTC
SS C ATG CC GAF
ATGGGCGGCGAGICC
GGGGCGAACAACAOCG
90 GCMAGAGITTCGC=GGCZGCTGCTCTGCGGCGGAC.A ATATGTGCCGTGCCGCGCTGCCGGTCGTTGAGTTCG
ACCGCCCTGCCGTTCA TGCTCTGCGGCGGACA
26 CGGATATTTCTGCCTGCCCGTTGAACCCGTATACCAA MAGGGGCGAAGMAC.A3GCITCGCGTTG1TGTA
91 CC GGTTT , TCT
TTGAACCCGTATACCAACC
GCCGATCGTTACTTGTTCG CGGTAGCTAAGACGACGT
AATCGACCAAATCCTGCG
CATTTCGCGCGCACCC
CATGTCCGACAACGCTTTG ACCTCTTCGCGACGCT
26 AAATCZEGT-Cr..CGCACGCGACGAGCAAATGAAACTG
C.GTGGGCTTACCTCiCCGCTCGATGTAAACCTCGGGC ACCAGCAAATGAAACTCiG
GCTTGCCAAAGAAGGATT CGGAATCGTACAGGAAGA
96 GACG ClIG GG CC
CGCGTGGCAGGAACAC
AAAAGGCCGCGCTCG
26 ATGGITTTGCCCGGCGAGTCGAAACTGGTCGAATGC ACCAACAACAGC.ACCTTCCTCGCGTTCTAGGITCGG
ACGCCCTTGAATATCGTGT GAAACTGGTCGAATGCAA
27 GGGCAIAGGAATAGTCGATGACGGATGCGCGMai 1 ATCTGGCGAAGGAMTGC.ACCiTGCGAAATACTGTC
ACATTCATGCGCCITTGAA
00 GCA TA 1 GcaiA I C
GATGCGCGTTTCGGCATA
t GAGMTGCCA
02 CfCGC 1 GITCC , TGCTGCAPAGCGIGGC
CGIATC:CGCACTACCTCCiC
27 GGTCAGCAGTGCCAMACCCAAGGCTGCTGCTGTCC 1 CCGCCCGCCCGATTGTCAAAAGGC.AGGGCGATCAG
02 CT ' C CCGCCATTCCTGCCGA
GGCTGCTGCTGTCCCT
CCTTGGCAGGGATAGATG
MCGCACCCTCTTCCGT
MAAGGCFTCGATGAAGA
OS . GGTVIGGCGGCGGAAACai TCAGACCGCAGGCACC GACAGA CAACCOC:GOCAGAGC1 'FCAGAMGCAGGCAGC
27 GCATCGTATTGACTGCCITTCAACCGATTICCACAAA AGCCCAGTATGAC:AATTTGCTGTGCTGATTATCTGC
TTAAACATACCGAGTGAA CCGATTTGCACAAATTAAC
06 TTAACC CGTAAA TGCi C
TGCCCAAGCACCCAAATCATATTICTITTGCCGTGAT AAAACGACTTITACCAAA
ATTICGGATITAATTGACC
27 .AACCGAAGTCAAACCGCCGGGCGATGAAGCGTGGG CACGCCAATGACGCGCAATCMCGGATTCMCGC
ACCGCAITTITITGACGGA
OS ATT CAC T G G4:GATGAAGCCi TGGGA II
AAGCATGATTTICAGGCTG CTACCATTGATATTGTGIC
09 1GTCTTGCT 1 CCT ATTCTGAATACCGICCG Tf CI
IIC I GTGC CC:GCATTCAAATCATGGG AA ITTGGGCAAAGACCCC
CCCCG GCTCG C G
IGCGTGATTATTIACGCGGC sUCTITGATIGATAATCGTG
11 . TGAMCG 1 GC. CTGACCGAAAAGCACCET AAACG
=
27 TCGGGCTITTGGICATTITCAGAAACAATTACTGGCO GCAAAC:AACAACACTCGGOCCTGC=TTG
AC.AATTACTGGCGTGTC
27 CCGTCT IGGITTTCCAGTGC.AAAAGGGA 7 ACGCAACC CGAA A ITT TICATGGI
AAACGAGCGGATACCGCAAT GCAAAAATTCCAAAGGCA
AAAGGGATACGCAACCAT
27 TCGAATF GCACCGIGAAIGCGCCCiCCGTACTOCIGG GTCCGAAGACGGCGGC
ITCACAACAGCCGAAGCCG
CCICCGTACTGCTGGCT
27 GGTACGCAACZGAACACiCTCTGTTGGCTATGACTGC
AAGCGAAATTCACGGTCG
TCGGIGGTCC.GCTCGGAAGACACC.GACGCTGACCT CCG A ACACCGACGCTGACCT
ICGCICTACATGATTAGACCC:A TAAAACGTGGCAGATACA
16 C ACATGAGiTTG (SAC
AATATTTCCGTTGCCGTa:
27 CTCAAAAAA TACCAOCCCCiAGAGTA TCTGCCAGTFT T CTCiGCiCAAACIGGAAAAACGGATA
7 ATI ATCiCTGA
TATCTGCCAGTITTCCTIGT
27 GCGCGITGAGATAGACOGCGAGCCGAAATGCG (ACC CI GCACGACGGCAAAATC
18 GGC CGOCAAACiGCGTGCACGTAATGCCTGCGTCGCG Ci CCCiAMTGCGTACCGGC
CACGCMITGTGCGTT GGCTCGCCC.GAATCCA
27 ACGCCGTCTGAAAAAACCTTITTGITTTGCAGTAAAT AMCGCiTITTICITTGGCTICGGGGATAMACCGCC
ITGITTTGCAGTAAATCGA
CGAGA . TACGC TTGCAAGCCTTCATCTICG GA
27 GCC,ATGCGCCCAAACAGATCTTGICCITTGTAAGCGG GCGGCTGCCGTACAATCAAAATGATATGTCGGGCG
TTGTCCTTTGTAAGCGGCC
AAGCAGTCGMAATCAGG
CCGGCAATGATTGAGCGT CG
CGCCGAGCAGGTATIGAG AAGGTCGAAATCGCCAAA
TCCAGGCGTCCCCACiTTGTCATCACGCCCAACCTGA
TCGGGGAATCAGAAGCGG
TGACGAAAGAGGCGGAAC
A GG GACTGGCC-CCACGACA A
DEMANDE OU BREVET VOLUMINEUX
LA PRESENTE PARTIE DE CETTE DEMANDE OU CE BREVET COMPREND
PLUS D'UN TOME.
NOTE : Pour les tomes additionels, veuillez contacter le Bureau canadien des brevets JUMBO APPLICATIONS/PATENTS
THIS SECTION OF THE APPLICATION/PATENT CONTAINS MORE THAN ONE
VOLUME
NOTE: For additional volumes, please contact the Canadian Patent Office NOM DU FICHIER / FILE NAME:
NOTE POUR LE TOME / VOLUME NOTE:
Claims (49)
1) A method of detecting a target nucleic acid in a sample comprising:
a) distributing a sample or set of samples into one or rnore individual discrete volumes each individual discrete volume comprising isothermal amplification reagents for amplifying the target polynucleotide, and a solution for rapidly isolating polynucleotides from a cell or virus particle;
b) incubating the sample or set of sarnples at conditions sufficient to allow extraction of target polynucleotides from the sample;
c) generating amplicons of target polynucieotides, wherein isolation of polynucleotides is not required between the extra.ction or amplification step;
d) introducing single stranded regions into said amplicons by incubating the arnplicons with an RNA component molecule and one or more TnpB polypeptides possessing collateral activity, wherein the RNA component molecule comprises a sequence capable of binding the target nucleic acid and designed to form a complex with the one or more TnpB
polypeptides; and e) further incubating the sample with a probe that binds one or more single stranded regions of said arnplicons, wherein said probe is:
i) a double stranded DNA probe with a fluorophore on one strand and a quencher on the other strand;
ii) a single stranded RNA probe with a fluorophore on one end of the strand and a quencher on the other end of the strand;
iii) a cotnbination of an unlabeled double stranded DNA probe and a single stranded RNA
probe with a tluorophore on one end of the strand and a quencher on the other end of the strand;
iv) a double stranded RNA probe with a fluorophore on one end of one strand and a quencher on the other en.d of the same strand; or v) a double stranded RNA probe with a fluorophore on one end of one strand and a quencher on the other end of the other strand;
f) adding an enzyme capable of cleaving RNA in an RNA:DNA duplex to increase fluorescence to be detected; and g) detecting the one or more amplicons, thereby indicating the presence of one or more target polynucleotides in the sample.
a) distributing a sample or set of samples into one or rnore individual discrete volumes each individual discrete volume comprising isothermal amplification reagents for amplifying the target polynucleotide, and a solution for rapidly isolating polynucleotides from a cell or virus particle;
b) incubating the sample or set of sarnples at conditions sufficient to allow extraction of target polynucleotides from the sample;
c) generating amplicons of target polynucieotides, wherein isolation of polynucleotides is not required between the extra.ction or amplification step;
d) introducing single stranded regions into said amplicons by incubating the arnplicons with an RNA component molecule and one or more TnpB polypeptides possessing collateral activity, wherein the RNA component molecule comprises a sequence capable of binding the target nucleic acid and designed to form a complex with the one or more TnpB
polypeptides; and e) further incubating the sample with a probe that binds one or more single stranded regions of said arnplicons, wherein said probe is:
i) a double stranded DNA probe with a fluorophore on one strand and a quencher on the other strand;
ii) a single stranded RNA probe with a fluorophore on one end of the strand and a quencher on the other end of the strand;
iii) a cotnbination of an unlabeled double stranded DNA probe and a single stranded RNA
probe with a tluorophore on one end of the strand and a quencher on the other end of the strand;
iv) a double stranded RNA probe with a fluorophore on one end of one strand and a quencher on the other en.d of the same strand; or v) a double stranded RNA probe with a fluorophore on one end of one strand and a quencher on the other end of the other strand;
f) adding an enzyme capable of cleaving RNA in an RNA:DNA duplex to increase fluorescence to be detected; and g) detecting the one or more amplicons, thereby indicating the presence of one or more target polynucleotides in the sample.
2) The method of item 1, wherein the enzyme capable of cleaving RNA is RNaseH.
3) The method of claim 1, wherein the fluorescence detected is greater than fluorescence detected by unwinding of the RNA:DNA duplex alone.
4) The method of claim 1, which does not include a washing step.
5) The method of claim 1, wherein the solution for isolating polynucleotides is protease-based, detergent-based, or chaotrope-based.
6) The method of claim 1, wherein the solution contains proteinase K.
7) The method of claim 6, wherein reaction buffer contains a proteinase K
inhibitor.
inhibitor.
8) The method of claim 1, wherein the solution for isolating polynucleotides is Lucigen Quick. Extract Plant DNA Extraction Solution.
9) The method of claim 1, wherein the amplicons are generated using loop-mediated isothermal amplification (LAMP), polymerase chain reaction (PCR), nucleic acid sequence-based amplification (NASBA), strand displacement amplification (SDA), helicase-dependent amplification (HAD), nicking enzyme amplifi.cation reaction (NEAR), transcription mediated amplification (TMA), recombinase polyrnerase amplification (RPA) or rolling circle amplification (RCA).
10) The method of claim 1, wherein the isothermal incubation temperature is betwee,n 55 C
and 75 C.
and 75 C.
11) The method of claim 1, wherein the single stranded region is a LAMP
arnplicon loop.
arnplicon loop.
12) The method of claim 1, wherein the single stranded region is an R-loop generated when the RNA component molecule binds to one strand of the amplicon.
13) The method of claim 12, wherein the TnpB polypeptide enables the RNA
component molecule to bind to a strand of the amplicon.
component molecule to bind to a strand of the amplicon.
14) The method of claim 13, wherein the TnpB polypeptide comprises a Ruv-C
nuclease domai n.
nuclease domai n.
15) The method of claim 14, wherein the TnpB polypeptide further comprises Ruv-CI. Ruv-CH and Ruv-CHI subdornains.
16) The method of claim 1, wherein the TnpB polypeptide comprises about 200 to about 500 amino acids.
17) The method of claim 1, wherein the RNA component molecule comprises a scaffold of about 40 to 80 nucleotides in length.
18) The method of claim 1, wherein a PAM sequence is 3' of ihe target nucleic acid.
19) The rnethod of claim 1, wherein a functional domain associated with the Tnp13 is selected from the following: transposase activity, methylase activity, demethylase activity, translation activation activity, translation repression activity, transcription activation activity, transcription repression activity, transcription rel.ease factor activity, chromatin modifying or remodeling activity, histone modification activity, nuclease activity, single-strand RNA
cleavage activity, double-strand RNA cleavage activity, single-strand DNA cleavage activity, double-strand DNA
cleavage activity, nucleic acid binding activity, detectable activity, or any combination thereof.
cleavage activity, double-strand RNA cleavage activity, single-strand DNA cleavage activity, double-strand DNA
cleavage activity, nucleic acid binding activity, detectable activity, or any combination thereof.
20) The method of claim 1, wherein the probe is a double stranded DNA probe with a fluorophore on one strand and a quencher on the other strand.
21) The rnethod of claim 1, wherein the probe is a single stranded RNA
probe with a fluorophore on one end of the strand and a quencher on the other end of the strand.
probe with a fluorophore on one end of the strand and a quencher on the other end of the strand.
22) The method of claim 1, wherein the probe is a single stranded RNA probe with a fluorophore on one end of the strand and a quencher on the other end of the strand.
23) The method of claim 1, wherein the probe is a combination of an unlabeled double stranded DNA probe and a single stranded RNA probe with a fluorophore on one end of the strand and a quencher on the other end of the strand.
24) The method of claim 1, wherein the probe is a double stranded RNA probe with a fluorophore on one end of one strand and a quencher on the other end of the same strand; or
25) The method of claim 1, where the probe is a double stranded RNA probe with a fluorophore on one end of one strand and a quencher on the other end of the other strand.
26) The method of c.laim 1, wherein the target nucleic acid is from a virus, bacterium, protozoa, fungus, or other pathogenic organism.
27) The method of claim 26, wherein the target nucleic acid is from human papillomavirus, hepatitis, adenovinis, C'andidia, coronavirus, herpesvirus, human immunodeficiency virus, influenza virus, Plasmodium, rhinovirus, Neisseria gonorrhoeae, Respiratory syncytial virus, coronavirus, or Streptococcus pyogenes.
28) The method of claim 27, wherein the coronavirus SARS-CoV2.
29) The method of claim 1, wherein an extraction-free solution is mixed with a sample at a concentration of about 1:2 to 2:1 sample:extraction solution.
30) The method of claim 29, wherein the sample is from a nasal swab or saliva.
31) The method of claim 1, wherein the incubating step is performed at a temperature of about 20 C to 60 C for about 30 minutes,.
32) The method of claim 1, wherein the amplifying and detecting steps are performed at about 55 C to about 65 C, about 59 C to 61 C or about 60 C for 50 to 70 minutes.
33) The method of claim 1, wherein the target polynucleotide is detected in one hour or less.
34) The method of claim 1, wherein the steps of incubating and detecting are all performed in the same individual discrete volume.
35) A composition for detecting the presence of a target polynucleotide in a sample, comprising:
a) reagents for amplifying the target polynucleotide;
b) an extraction-free solution for isolating polynueleotides from a cell or virus particle;
c) one or more TnpB proteins possessing collateral activity;
d) at least one RNA polynucleotide component comprising a sequence capable of binding the target polynucleotide and designed to form a compl.ex with the one or more TnpB proteins;
and e) one or more of the following probes:
i) a double stranded DNA probe with a fluorophore on one strand and a quencher on the other strand;
ii) a single stranded RNA probe with a fluorophore on one end of the strand and a quencher on the other end of the strand;
iii) a combination of an unlabeled double stranded DNA probe and a single stranded RNA
probe with a fluorophore on one end of the strand and a quencher on the other end of the strand;
iv) a double stranded RNA probe with a fluorophore on one end of one strand and a quencher on the other end of the sa.me strand; or v) a double stranded R.NA probe with a fluorophore on one end of one strand and a quencher on the other end of the other strand.
a) reagents for amplifying the target polynucleotide;
b) an extraction-free solution for isolating polynueleotides from a cell or virus particle;
c) one or more TnpB proteins possessing collateral activity;
d) at least one RNA polynucleotide component comprising a sequence capable of binding the target polynucleotide and designed to form a compl.ex with the one or more TnpB proteins;
and e) one or more of the following probes:
i) a double stranded DNA probe with a fluorophore on one strand and a quencher on the other strand;
ii) a single stranded RNA probe with a fluorophore on one end of the strand and a quencher on the other end of the strand;
iii) a combination of an unlabeled double stranded DNA probe and a single stranded RNA
probe with a fluorophore on one end of the strand and a quencher on the other end of the strand;
iv) a double stranded RNA probe with a fluorophore on one end of one strand and a quencher on the other end of the sa.me strand; or v) a double stranded R.NA probe with a fluorophore on one end of one strand and a quencher on the other end of the other strand.
36) The composition of claim 35, wherein the amplification reagents are LAMP reagents comprising F3, B3, FIP, BIP, Loop Forward and Loop Reverse primers.
37) The compositions of claim 36, wherein the probes are selected from Table .
38) The composition of claim 32, wherein the probes are provided at a concentration of 50 nM
to 175 nM, preferably 75 nM to 150nM.
to 175 nM, preferably 75 nM to 150nM.
39) The composition of claim 36, wherein LAMP primers are selected from Table .
40) .......................................................................
The composition of claim 36, wherein the F3 primer is selected from Table .
The composition of claim 36, wherein the F3 primer is selected from Table .
41) 'f he composition of claim 36, wherein the composition is lyophilized.
42) The composition of claim 41, wherein the composition is lyophilized as a complete formulation.
43) The composition of claim 41, wherein the composition is lyophilized as an incomplete formulation and additional components are added later in resuspension buffer.
44) The composition of claim 35, comprising one of m.ore of lactose, trehalose, sorbitol, glucose, raffinose, glycine or histidine.
45) The composition of c.laim 35, further comprising one or rnore additives, wherein the additive is guanidinium chloride (GuHC1), L-proline, L-histidine, b-alanine, L-serine, urea, acetamide, 4-arninobutyric acid, polyethylene glycol, pol.ypropylene glycol, polyvinylpyrrolidone K, 6-0-a-D-maltosyl-b- cyclodextrin, (2-hydroxypropy1)-b-cyclodextrin, a-cyclodextrin, b-cyclodextrin, rnethyl-b- cyclodextrin, glycine, proliiìe, taurine, or a cornbination thereof.
46) The composition of clai.m 35, further comprising polynucleotide binding beads for the capture of nucleic acids in a sample.
47) The composition of claim 46, wherein the beads are carboxylated.
48) The composition of claim 47, wherein the RNA polynucleotide cornponent comprises a spacer specific for the N gene or S gene of SARS-CoV-2.
49) The composition of claim 35, further comprising one or more additives to increase reaction specificity or kinetics.
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US202163242377P | 2021-09-09 | 2021-09-09 | |
US63/242,377 | 2021-09-09 | ||
PCT/US2022/076140 WO2023039491A2 (en) | 2021-09-09 | 2022-09-08 | Coronavirus rapid diagnostics |
Publications (1)
Publication Number | Publication Date |
---|---|
CA3231249A1 true CA3231249A1 (en) | 2023-03-16 |
Family
ID=85506942
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CA3231249A Pending CA3231249A1 (en) | 2021-09-09 | 2022-09-08 | Coronavirus rapid diagnostics |
Country Status (2)
Country | Link |
---|---|
CA (1) | CA3231249A1 (en) |
WO (1) | WO2023039491A2 (en) |
Families Citing this family (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US11453907B2 (en) * | 2020-03-23 | 2022-09-27 | The Broad Institute, Inc. | Crispr effector system based coronavirus diagnostics |
EP4265741A1 (en) * | 2022-04-21 | 2023-10-25 | Consejo Superior de Investigaciones Científicas (CSIC) | Multiplexable crispr-cas9-based virus detection method |
CN117660702B (en) * | 2024-02-01 | 2024-04-30 | 广东省林业科学研究院 | Fluorescent quantitative PCR primer group and method for detecting Liquorice pangolin virus |
CN118006733B (en) * | 2024-04-09 | 2024-07-09 | 艾特生物科技(深圳)有限公司 | Nucleic acid chemiluminescence detection method based on Cas12a and streptavidin aptamer cascade |
Family Cites Families (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP4274856B2 (en) * | 2003-06-19 | 2009-06-10 | オリンパス株式会社 | Method for detecting reaction between DNA and DNA-binding protein |
DE602006008150D1 (en) * | 2006-10-12 | 2009-09-10 | Bio Rad Pasteur | Double-stranded probes for fluorescence detection of nucleic acids |
WO2012064978A2 (en) * | 2010-11-10 | 2012-05-18 | Brandeis University | Compositions, methods, and kits for detecting and identifying mycobacteria |
GB201122458D0 (en) * | 2011-12-30 | 2012-02-08 | Univ Wageningen | Modified cascade ribonucleoproteins and uses thereof |
CN106715706B (en) * | 2014-09-30 | 2022-08-09 | 环球生命科技咨询美国有限责任公司 | Method for analyzing nucleic acids directly from unpurified biological samples |
BR112021025669A2 (en) * | 2019-06-18 | 2022-02-22 | Mammoth Biosciences Inc | Microfluidic cartridge for detecting a target nucleic acid, collector, method for detecting a target nucleic acid, and, uses of a microfluidic cartridge, a system, a programmable nuclease, a composition and a dna-activated programmable RNA nuclease |
EP4061941A4 (en) * | 2019-11-19 | 2023-12-06 | The Broad Institute, Inc. | Retrotransposons and use thereof |
-
2022
- 2022-09-08 WO PCT/US2022/076140 patent/WO2023039491A2/en active Application Filing
- 2022-09-08 CA CA3231249A patent/CA3231249A1/en active Pending
Also Published As
Publication number | Publication date |
---|---|
WO2023039491A2 (en) | 2023-03-16 |
WO2023039491A3 (en) | 2023-06-29 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CA3231249A1 (en) | Coronavirus rapid diagnostics | |
US20210207130A1 (en) | Methods and compositions for the making and using of guide nucleic acids | |
CA2796578C (en) | Breast cancer associated circulating nucleic acid biomarkers | |
CA3060721C (en) | Method of diagnosing bladder cancer. | |
Kubota et al. | FRET-based assimilating probe for sequence-specific real-time monitoring of loop-mediated isothermal amplification (LAMP) | |
US20220093208A1 (en) | Compositions, methods, and systems to detect hematopoietic stem cell transplantation status | |
US20080194416A1 (en) | Detection of mature small rna molecules | |
CA2100919A1 (en) | Species-specific oligonucleotides for bifidobacteria and a method of detection using the same | |
CA3183566A1 (en) | Methods of detecting sars-cov-2, influenza, and rsv | |
US20210383891A1 (en) | Improved Ordered Assembly of Multiple DNA Fragments | |
WO2003045230A2 (en) | Novel compositions and methods for cancer | |
JP2023547536A (en) | Multiplex detection of bacterial respiratory pathogens | |
CN101849022A (en) | A method of DNA amplification | |
US20220098577A1 (en) | Ordered Assembly of Multiple DNA Fragments | |
WO2020068559A1 (en) | Depleting unwanted rna species | |
US20130231261A1 (en) | Rnase h-based rna profiling | |
US11345971B2 (en) | Primer set for detecting SARS-CoV-2, method for testing SARS-CoV-2, and reagent and kit of testing SARS-CoV-2 | |
US11584960B2 (en) | Multiplex detection of short nucleic acids | |
WO2020146603A1 (en) | Methods of detecting analytes and compositions thereof | |
CA3186629A1 (en) | Compositions and methods for treating disorders associated with loss-of-function mutations in scn2a | |
WO2018199136A1 (en) | Method for measuring expression level of abl1 t315i mutation | |
KR20050114099A (en) | Dna chip for diagnosis of colon cancer | |
CA3218053A1 (en) | Modified nucleases | |
AU2006216122B2 (en) | Detection of DNA sequence motifs in ruminants | |
US20080193935A1 (en) | Detection of Dna Sequence Motifs in Ruminants |