CA3231249A1 - Coronavirus rapid diagnostics - Google Patents

Coronavirus rapid diagnostics Download PDF

Info

Publication number
CA3231249A1
CA3231249A1 CA3231249A CA3231249A CA3231249A1 CA 3231249 A1 CA3231249 A1 CA 3231249A1 CA 3231249 A CA3231249 A CA 3231249A CA 3231249 A CA3231249 A CA 3231249A CA 3231249 A1 CA3231249 A1 CA 3231249A1
Authority
CA
Canada
Prior art keywords
strand
probe
quencher
fluorophore
rna
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CA3231249A
Other languages
French (fr)
Inventor
Hedia MAAMAR
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Proof Diagnostics Inc
Original Assignee
Proof Diagnostics Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Proof Diagnostics Inc filed Critical Proof Diagnostics Inc
Publication of CA3231249A1 publication Critical patent/CA3231249A1/en
Pending legal-status Critical Current

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6844Nucleic acid amplification reactions
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/70Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving virus or bacteriophage

Landscapes

  • Chemical & Material Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Organic Chemistry (AREA)
  • Health & Medical Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Zoology (AREA)
  • Wood Science & Technology (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Immunology (AREA)
  • General Engineering & Computer Science (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Microbiology (AREA)
  • Molecular Biology (AREA)
  • Analytical Chemistry (AREA)
  • Physics & Mathematics (AREA)
  • Biophysics (AREA)
  • Genetics & Genomics (AREA)
  • General Health & Medical Sciences (AREA)
  • Biotechnology (AREA)
  • Biochemistry (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • Virology (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Abstract

Systems and methods for rapid diagnostics related to the use of combinations of CRISPR effector systems with optimized guide sequences, OSD probes, RNA probes and/or RNase H for detection of nucleic acid sequences, such as sequences from coronavirus, as well as multiplex lateral flow diagnostic devices and methods of use, are provided.

Description

DEMANDE OU BREVET VOLUMINEUX
LA PRESENTE PARTIE DE CETTE DEMANDE OU CE BREVET COMPREND
PLUS D'UN TOME.

NOTE : Pour les tomes additionels, veuillez contacter le Bureau canadien des brevets JUMBO APPLICATIONS/PATENTS
THIS SECTION OF THE APPLICATION/PATENT CONTAINS MORE THAN ONE
VOLUME

NOTE: For additional volumes, please contact the Canadian Patent Office NOM DU FICHIER / FILE NAME:
NOTE POUR LE TOME / VOLUME NOTE:

CORONA VIRUS RAPID DIAGNOSTICS
TECHNICAL FIELD
[0001] The subject matter disclosed herein is generally directed to rapid single-reaction coronavirus diagnostics including the use of combinations of (1) nucleic acid amplification; (2) TnpB effector systems;
(3) oligonucleotide strand displacement (OSD); and/or fluorescently labeled RNA probes with RNase to release fluorescence. TnpB effector systems are described in U.S. Application Serial No. 16/326,132, filed August 16,2017, published as US 2021/0166783 on June 3,2021, which application is a U.S. national stage application under 35 U.S.C. 371 of PCT/US2017/047193, filed August 16,2017 and published as WO
2018/035250 on February 22, 2018, in Malcarova etal. (2020) Nature Reviews 18:
67-83 (and supplementary materials at https://doi.org/10.1038/s41579-019-0299-x), and H. Altae-Tran etal., Science 10.1126/
science.abj6856 (2021), all of which are incorporated herein by reference in their entireties for all purposes.
BACKGROUND
[0002] Nucleic acids are a universal signature of biological information. The ability torapidly detect nucleic acids with high sensitivity and single-base specificity on a portable platform has the potential to revolutionize diagnosis and monitoring for many diseases, provide valuable epidemiological information, and serve as a generalizable scientific tool. Although many methods have been developed for detecting nucleic acids (Du et al., 2017; Green et al., 2014; Kumar etal., 2014; Pardee et al., 2014; Pardee et al., 2016; Urdea et al., 2006), they inevitably suffer from trade- offs among sensitivity, specificity, simplicity, and speed..
[0003] Sensitive and rapid detection of nucleic acids is important for clinical diagnostics and biotechnological applications. Particularly when responding to outbreaks, such as the novel coronavirus, which has been referred to as 2019-nCoV and SARS-CoV-2, which causes COVID 2019, time is of the essence. Sabeti, Early Detection Is Key to Combating the Spread of Coronavirus, Time (February 6, 2020). The 2019-nCoV has killed over a million people, including well over 200,000 people in the U.S. alone, and response to the escalating outbreak, particularly where there are indications that both symptomatic and asymptomatic patients with 2019-nCov may transmit the disease. Wang, et al., A precision medicine approach to managing Wuhan Coronavirus pneumonia, Prec. Clin. Med. doi:10.1093/pcmedi/pbaa002. Early coronavirus testing kits sent to states and other countries did not work properly, according to the U.S. Centers for Disease Control and Prevention. Grady, "Coronavirus Test Kits Sent to States, 30 Countries Are Flawed, C.D.C. Says,"
New York Times, February 12, 2020. Moreover, early tests provide results in four hours from initial sample processing to results.
cdc.govhnedia/releases/2020/p0206-coronavirus-diagnostic-test-kits.
Highly accurate test results at better processing speds, particularly that are field-depoloyable would aid in addressing the outbreak. Currently, the novel coronavirus SARS-CoV-2 has resulted in an international public health emergency, spreading to over 180 countries and infecting millions of individuals. Testing for the presence of the virus is of utmost importance to both reduce the basic reproductive rate of the virus (RO) and inform best clinical practices for affected patients. However, understanding the full extent of the virus outbreak has remained challenging due to bottlenecks in the diagnosis of infection, spurred by requirements for low-supply reagents and kits, complex instrumentation, and difficult logistics with central lab facilities. A
platform that is simpler with common reagents and can be done in any setting would solve many of the world's testing issues.
[0004] Previously, Applicants developed a platform for nucleic acid detection using CRISPR
enzymes called SHERLOCK (specific High Sensitivity Enzymatic Reporter unLOCKing)(Gootenberg, 2018;Gootenberg, 2017 ), which combines pre-amplification with the RNA-guided RNase CRISPR-Cas13(Abudayyeh, 2016; East-Seletsky, 2016; Shmakov, 2015;Smargon, 201; Shmakov, 2017) and DNase CRISPR-Cas12(Zetsche, 2015 599;
Chen, 2018) for sensing of nucleic acids via fluorescence or portable lateral flow. In addition, Applicants have also developed a platform termed "STOPCovid". (N Engi J Med 2020; 383:1492-1494).
[0005] ). These and related technologies have been described in the following applications, which are hereby incorporated by reference in their entireties: U.S. Provisional Application Serial Nos.
62/818,702, filed March 14, 2019, 62/890,555, filed August 22, 2019, 62/970,125, filed February 12, 2020,62/993,494, filed March 23, 2020, 63/018,487, filed April 30, 2020, 63/019,406, filed May 3, 2020,63/032,470, filed May 29,2020, PCT/US2020/022795, filed March 13,2020, U.S. Application Serial Nos. 16/894,664, 16/894,678, and 16/894,670, all filed filed June 5, 2020.
SUMMARY
[0006] In certain example embodiments, compositions for detecting the presence of a target polynucleotide in a sample, comprising isothermal amplification reagents for amplifying the target polynucleotide, and an extraction-free solution for isolating polynuckotides from a cell or virus particle. In one embodiment, the amplification is isothermal. The isothermal amplification reagents may comprise loop-mediated isothermal amplification (LAMP) reagents comprising F3, B3, FIP, BIP, Loop Forward and Loop Reverse primers. In an aspect, the LAMP reagents may further comprise "oligonucleotide strand displacement" or "one step toehold displacement" (OSD) probes. LAMP
amplification is described in U.S. Patent No. 7,175,985, which is incorporated herein by reference in its entirety, as well as in numerous publications (Notomi et al., Nucleic Acids Research 2000, 28;
Nagamine et al., Biochemical and Biophysical Research Communications 2002, 290, (4), 1195-1198;

Tomita et al., Nature Protocols 2008,3, (5), 877-882; Nagamine et al., Molecular and Cellular Probes 2002, 16, (3), 223-229; Mair et al., Bmc Veterinary Research 2013, 9; Tanner et al., Biotechniques 2012, 53, (2), 81-Suwancharoen et al., Journal of Veterinary Medical Science 2016, 78, (8), 1299-1302; Wang et al., Molecules 2016, 21. (1); Abdulmawjood et al.. Molecular and Cellular Probes 2016,30, (4), 205-210; Song 2016 Analytical Chemistry 2016, 88, (14), 7289-7294; Kong Scientific Reports 2016,6 and Mang et al., Scientific Reports 2016,6) all of which are incorporated herein by reference in their entireties.
100071 In one embodiment, the system includes a CR1SPR-Cas system that generates R-loops on a target nucleic acid. Such a system for the detection of coronavirus is provided. A system for detecting the presence of a coronavirus in a sample, comprising: a Cas protein; at least one guide polynucleotides comprising a guide sequence capable of binding a coronoavirus target sequence and designed to form a complex with the Cas protein; and a detection construct comprising a polynucleotide component. The detection agent may be an OSD probe, which upon binding a single stranded region in a target nucleic acid (generated, e.g., using the LAMP
and/or the aforementioned CRISPR-Cas system). In some embodiments, more than one guide polynucleotide may be used to increase signal and sensitivity. In another embodiment, the detection agent may be a labeled RNA
probe, which, upon binding the single stranded region, can be digested with RNase to release a detectable label. In embodiments, this label is a fluorophore. In embodiments, the coronavirus is 2019-nCov. In an aspect, the at least one guide polynucleotide is a highly active guide polynucleotide.
The guide polynucleotide of the system can, an an embodiment, bind to a coronavirus sequence encoding a polypeptide that is immunostimulatory to a host immune system, and/or binds to at least one target sequence that is a unique coronavirus genomic sequence.
[0008] The systems and methods may utilize one or more Cas proteins. In embodiments, the Cas proteins are a Type V or Type VI Cas protein, and may be Cas 12 proteins, Cas13 proteins, or a combination thereof. In an aspect, the one or more Cas proteins comprise a Cas13, which may be a thermostable Cas13 or Leptotrichiawadei Cas13. In an aspect, the one or more Cas proteins comprise a Cas12, which may be a thermostable Cas12b or Alicyclobacilluys acidiphilus Cas12b. The systems may further comprise amplification reagents for amplification of the coronavirus target sequence. In an aspect the amplification reagents are reagents for LAMP, polymerase chain reaction (PCR), nucleic acid sequence-based amplification (NASBA), strand displacement amplification (SDA), helicase-dependent amplification (HAD), nicking enzyme amplification reaction (NEAR), transcription mediated amplification (TMA), recombinase polyrnerase amplification (RPA) or rolling circle amplification (RCA).
[0009] A lateral flow device comprising a substrate comprising a first end and a second end, are also provided, the first end comprising a sample loading portion, a first region comprising a detectable ligand, two or more systems of the claims provided herein, and one or more first capture regions, each comprising a first binding agent; the substrate comprising two or more second capture regions between the first region of the first end and the second end, each second capture region comprising a different binding agent. In an aspect, the first end comprises two detection constructs, wherein each of the two detection constructs comprises an RNA or DNA
oligonucleotide, comprising a first molecule on a first end and a second molecule on a second end. In an aspect, the first end comprises three detection constructs, wherein each of the three detection constructs comprises an RNA or DNA oligonucleotide, comprising a first molecule on a first end and a second molecule on a second end. The lateral flow device may comprise a polynucleotide encoding a Cas13 and/or Cas12 and the one or more guide RNAs are provided as a multiplexing polynucleotide, the multiplexing polynucleotide configured to comprise two or more guide sequences. The lateral flow device may also comprise of OSD probes designed to separate binding agents upon detection of sequences.
[00010] Methods for detecting a target nucleic acid in a sample are also provided, comprising contacting a sample with the first end of the lateral flow device comprising the sample loading portion, wherein the sample flows from the sample loading portion of the substrate towards the first and second capture regions and generates a detectable signal. Methods may utilize a lateral flow device capable of detecting two different target nucleic acid sequences. In an aspect, the target nucleic acid sequences are absent from the sample, a detectable signal is generated at each capture region, the detectable signal appears at the first and second capture regions.
The lateral flow device can be designed such that when the target nucleic acid sequences are present in the sample, a detectable signal is generated at each capture region, and wherein when the sample contains one or more target nucleic acid sequences, a detectable signal is present at the capture region for the corresponding target nucleic acid sequence.
[00011] Methods for detection may also comprise detecting coronavirus in a sample by contacting the sample with the systems disclosed herein. The step of contacting the sample with the system can comprise amplifying the one or more target sequences in the sample and incubating the sample under conditions sufficient to allow binding of a labeled, but quenched probe to one or more target molecules. In the case of a double-stranded OSD probe, a labeled strand will bind to the target sequences and the complementary strand containing the quencher will diffuse away, allowing for a fluorescent signal to be detected (Fig. 2). The single stranded region of the amplified target sequence exists either as a portion of the amplified target sequence (e.g., as in the case of a LAMP-amplified target sequence), or can be generated by incubating with a CRISPR-Cas system, which can generate a single-stranded R-loop in the target sequence. Alternatively, in the case of a single stranded RNA
probe with both a fluorophore and quencher attached, the RNA probe will bind the single-stranded target sequences (LAMP loops or Cas-generated), and in the presence of RNaseH, the RNA strand of the resulting RNA-DNA duplex is digested, thus releasing detectable fluorescence (Fig. 3). In another embodiment, one strand of an unlabeled OSD probe binds to the target sequence, and a short RNA probe containing both a fluorophore and a quencher binds to the liberated strand of the OSD
probe, and RNase acts to release the quencher from the RNA strand, resulting in detectable fluorescence (Fig. 4). In another embodiment, one strand of a double-stranded RNA probe containing a fluorophore and a quencher on the same strand binds to the target sequences, and RNase degrades the RNA strand of the RNA-DNA hybrid, thus releasing the quencher and resulting in detectable fluorescence (Fig. 5). In yet another embodiment, an OSD probe containing a fluorophore on one strand binds to a target sequence, thus releasing the complementary strand containing the quencher, thus resulting in detectable fluorescence (Fig. 6). The addition of RNase H to OSD is advantageous because the OSD probes can re-bind when separated thus limiting the overall signal generated. RNase H, however, will cleave the fluorophore from quencher driving the reaction to complete liberation of fluorophore, generating more signal amplification in the reaction.
[00012] The step of contacting the sample with the system can further comprise incubating the sample at about 55 C to about 65 C, about 59 C to 61 C or about 60 C for 50 to 70 minutes and detecting the presence of a positive signal. The steps of extracting, amplifying incubating, activating and detecting are all performed in the same individual discrete volume.
[00013] Methods of detection can further comprise the step of treating the sample with a DNA
extraction solution prior to contacting the sample with the systems disclosed herein.
[00014] In an aspect, the DNA extraction solution is mixed with a sample at a concentration of about 1:2 to 2:1 sample:extraction solution. In an aspect, the method may further comprise incubating the sample and the DNA extraction solution, which may be performed at a temperature of about 20 C to 60 C for about 60 minutes, or 95 C for about 5 to 10 minutes.
Extraction may also comprise the addition of beads capable of concentrating targets of interest of the sample. In an aspect, the beads are magnetic.
[00015] A cartridge for detection assays in accordance with methods disclosed herein is provided comprising a sample receiver, at least a first, second, and third ampoule, and at least a first and second chamber, and a lateral flow strip, wherein the first ampoule is communicatively coupled to the first chamber comprising a heat source, the first chamber is communicatively coupled to the second ampoule, the second ampoule communicatively coupled to the second chamber, the third ampoule communicatively coupled to the lateral flow strip.
[00016] A cartridge can be provided comprising at least a first and second ampoule, a lysis chamber, an amplification chamber and a sample receiving chamber, the first ampoule fluidically connected to the sample receiving chamber, the sample receiving chamber further connected to the lysis chamber, the lysis chamber connected via a metering channel to the second ampoule and the amplification chamber.
[00017] The cartridge may be configured to fit in a system comprising a heating means, an optic means, a means for releasing reagents on the cartridge, and a means for readout of assay result.
The cartridge can comprise a first ampoule that comprises lysis buffer, and/or the second ampoule that comprises a CRISPR system, the CRISPR-Cas system comprising one or more Cas proteins and at least one guide polynucleotide. Alternatively, the cartridge may be as described in U.S.
Application Serial No. 16/894,670, which is incorporated herein by reference in its entirety.
[00018] The cartridge may further comprise amplification reagents. The amplification reagents comprise LAMP F3, B3, FIP, BIP, Loop Forward and Loop Reverse primers.
[00019] The cartridge can comprise a CRISPR system that includes a guide RNA designed to bind to a target nucleic acid that are diagnostic for a disease state.
Alternatively, the cartridge can comprise OSD and/or RNA probes, one of which is detectably labeled as discussed above. The disease state can be an infection, which may be caused by a microbe, the microbe selected from a virus, bacterium, a fungus, a protozoa, or a parasite. The guide RNA may be specific for a microbe that is viral, bacterial, or fungal.
[00020] The cartridge can further comprise a detection construct comprising a polynucleotide component, which may be fluorescent. In an aspect, the cartridge comprises reagents that are lyophilized. In an aspect, the Cas protein of the CRISPR system is a Type V or Type VI Cas protein.

In an aspect, the Cas protein is a Cas12 or Cas13 protein. The cartridge cancomprise a thermostable protein, for example, the thermostable Cas protein is Alicyclobacillus acidiphilus Cas12b (Aap). In an aspect, the guide comprises a sequence derived from Alicyclobacillus acidoterrestris (Aac). The cartridge can comprise a lysis buffer that comprises a DNA extraction buffer.
[00021]
A system designed to receive the detection cartridge as disclosed herein is provided, the system designed to receive the cartridge and conduct an assay comprising isothermal amplification of nucleic acids and detection of target nucleic acids on the cartridge. In one embodiment, the amplification may be isothermal. In embodiments, the system can comprise one or more heating means for extraction, amplification and/or detection, a means for releasing reagents for extraction, amplification, and/or detection, a means for mixing reagents for extraction, amplification, and/or detection, and/or a means for reading the results of the assay. In an aspect, the means of reading the results of the assay is an optic means. The system can further comprise a user interface for programming the device and/or readout of the results of the assay.
[00022]
These and other aspects, objects, features, and advantages of the example embodiments will become apparent to those having ordinary skill in the art upon consideration of the following detailed description of illustrated example embodiments.
BRIEF DESCRIPTION OF THE DRAWINGS
1000231 An understanding of the features and advantages of the present invention will be obtained by reference to the following detailed description that sets forth illustrative embodiments, in which the principles of the invention may be utilized, and the accompanying drawings. Note that according to the present invention, TnpB effector systems are substituted for CRISPR/Cas effector systems, and should be considered as alternatives in the specification and figures herein.
[000241 Figure I. Cas9 or Cas12 generates exposed ssDNA region via an R-loop. Cas9 and Cas12 can generate ssDNA via R-loop formation. Cas9 blocks about 9 nucleotides of the ssDNA region nearest to the PAM, leaving about 11 nucleotides exposed. This exposed region can be used for generating ssDNA that can trigger one-step strand displacement (OSD) probes and release fluorescence and bind to ssRNA probes with a fluorophore and quencher that can be consequently cleaved by .. RN ase H, release fluorescence Image from: httnts://www.frontiersio.orei articles/10.3389/fmich..2018.00257/full [00025]
Figure 2. OSD probes triggered by R-loops. With normal probes, the OSD probe binds the loop-mediated isothermal amplification (LAMP) amplicon loop, causing stand displacement and unquenching a fluorescence. With CRISPR OSD, the OSD probe binds the R-
7 loop formed in the LAMP amplicon, causing strand displacement and unquenching fluorescence.
CRISPR OSD therefore gives more flexibility in targeting and more signal than the normal OSD
approach because the CRISPR-Cas9 complex can generate many ssDNA regions in the LAMP
amplicon using multiple guide RNAs.
1000261 Figure 3. R-loop triggering RNase H-based release of fluorescence.
An RNA
probe of 5-10 nucleotides with a fluorophore at one end and a quencher at the other end binds to the ssDNA region in the R-loop, and RNase H cleaves the RNA strand of the RNA-DNA
duplex, which releases the fluorescence. RNase Fill will be active in the hot temperature range of LAMP (55 C-65 C). CRISPR RNase H probes will have multiple turnover for a given R-loop, yielding multiple fluorescent events, which yields better signal to noise compared to the ()SD
approach.
[00027] Figure 4. Combination of OSD and RNase H: Approach 1. A double-stranded OSD probe without a fluorophore or quencher is added to the LAMP amplicon, and one strand of the OSD probe binds to the LAMP amplicon loop, thus liberating the other strand of the ()SD probe.
That liberated OSD ss probe can now bind to an RNA probe of 5-10 nucleotides that has a fluorophore on one end and a quencher on the other end. RNase H can then cleave the RNA strand of the RNA-DNA duplex, thus releasing fluorescence. The combination of OSD and RNase H boosts the overall signal due to multiple turnover of RNase H, and also has increased specificity.
[00028] Figure 5. Combination of OSD and RNase H: Approach 2. A double stranded RNA OSD probe with a fluorophore at one end of one strand and a quencher at the other end of the same strand is incubated with the LAMP amplicon. The labeled strand binds to the LAMP amplicon loop, and the RNA strand of the RNA-DNA duplex is cleaved with RNase H, releasing fluorescence.
The combination of OSD and RNase H boosts the overall signal due to multiple turnover of RNase H and also has increased specificity.
[000291 Figure 6. Combination of OSD and RNase H: Approach 3. A double stranded RNA OSD probe with a fluorophore at one end of one strand and a quencher at the end of the other strand is incubated with the LAMP amplicon. One strand binds to the LAMP
amplicon, releasing the quenching of the fluorescence. The RNA strand of the RNA-DNA duplex in the LAMP amplicon loop is cleaved with RNase H, allowing for multiple turnover of the OSD probe and increased fluorescence. The combination of OSD and RNase H boosts the overall signal due to multiple turnover of RNase H and also has increased specificity.
8 000301 Figure 7. LAMP OSD preliminary limit of detection (LOD) with extraction beads. Detection of either pure RNA or mock viral particles ("SeraCare") using a combination of LAMP and OSD at various input concentrations. All samples were pre-concentrated using a bead-based extraction method.
[00031] Figure 8. LAMP OSD reaction and readout on alpha device without extraction bead workflow. Amplification and detection of RNA on device, with fluorescence labeled as arbitrary units (AU). FAM fluorescent channel refers to OSD detection; Syto64 refers to inline control detection of LAMP amplification.
[00032] Figure 9. LAMP OSD reaction and readout on alpha device with extraction bead workflow Amplification and detection of RNA on device, with fluorescence labeled as arbitrary units (AU). FAM fluorescent channel refers to OSD detection; Syto64 refers to inline control detection of LAMP amplification.
[00033] Figure 10. LAMP OSD reaction optimization for different buffer combinations.
Relative fluorescence in LAMP OSD for reactions containing 6 mM additional MgSO4 and either KCI or GuIIC1 additives. GuHC1 additives increase the speed of the LAMP OSD
reaction.
[000341 Figure Ii. RNase H-based detection of oligonucleotides in LAMP
reaction conditions shows that amplification can be achieved with small amounts of RNase H in the LAMP reaction.
[000351 Figure 12. RNase H-based detection of oligonucleotides in LAMP
reactions.
Shows that RNase H cleavage can also be achieved in the same range of RNase H
amounts within the LAMP reaction. Certain RNase H enzymes work better than others.
SUMMARY OF THE INVENTION
[00036] The following brief summary is not intended to include all features and aspects of the present invention, nor does it imply that the invention must include all features and aspects discussed in this summary.
The present invention provides the following:
1. A method of detecting a target nucleic acid in a sample comprising:
(a) distributing a sample or set of samples into one or more individual discrete volumes each individual discrete volume comprising isothermal amplification reagents for
9 amplifying the target polynucleotide, and a solution for rapidly isolating polynucleotides from a cell or virus particle.;
(b) incubating the sample or set of samples at conditions sufficient to allow extraction of target polynucleotides from the sample, (c) generating amplicons of target polynucleotides, wherein isolation of polynucleotides is not required between the extraction or amplification step;
(d) further incubating the sample with a probe that binds one or more single stranded regions of said amplicons, wherein said probe is:
(1) a double stranded DNA probe with a fluorophore on one strand and a quencher on the other strand;
(2) a single stranded RNA probe with a fluorophore on one end of the strand and a quencher on the other end of the strand;
(3) a combination of an unlabeled double stranded DNA probe and a single stranded RNA probe with a fluorophore on one end of the strand and a quencher on the other end of the strand;
(4) a double stranded RNA probe with a fluorophore on one end of one strand and a quencher on the other end of the same strand; or (5) a double stranded RNA probe with a fluorophore on one end of one strand and a quencher on the other end of the other strand;
(e) adding an enzyme capable of cleaving RNA in an RNA:DNA duplex to increase fluorescence to be detected; and (0 detecting the one or more amplicons, thereby indicating the presence of one or more target polynucleotides in the sample.
2. The method of item 1, wherein the enzyme capable of cleaving RNA is RNaseH.
3. The method of item 1, wherein the fluorescence detected is greater than fluorescence detected by unwinding of the RNA:DNA duplex alone.
4. The method of item 1, which does not include a washing step.
5. The method of item 1, wherein the solution for isolating polynucleotides is protease-based, detergent-based, or chaotrope-based.

6. The method of item 1, wherein the solution contains proteinase K.
7. The method of item 6, wherein reaction buffer contains a proteinase K
inhibitor.
8. The method of item 1, wherein the solution for isolating polynucleotides is Lucigen Quick Extract Plant DNA Extraction Solution.
9. The method of item 1, wherein the amplicons are generated using loop-mediated isothermal amplification (LAMP), polymerase chain reaction (PCR), nucleic acid sequence-based amplification (NASBA), strand displacement amplification (SDA), helicase-dependent amplification (HAD), nicking enzyme amplification reaction (NEAR), transcription mediated amplification (TMA), recombinase polymerase amplification (RPA) or rolling circle amplification (RCA).
10. The method of item 1, wherein the isothermal incubation temperature is between 55 C and 75 C.
11. The method of item 1, wherein the single stranded region is a LAMP
amplicon loop.
12. The method of item 1, wherein the single stranded region is an R-loop generated when a guide polynucleotide binds to one strand of the amplicon.
13. The method of item 12, wherein a Cas molecule or an argonaute protein enables the guide polynucleotide to bind to strand of the amplicon.
14. The method of item 13, wherein the Cas molecule is Cas9, Cas12 or Cas14.
15. The method of item 14, wherein the Cas molecule is Cas12(b).
16. The method of item 1, wherein the enzyme is a nuclease or nickase.
17. The method of item 1, wherein the probe is a double stranded DNA probe with a fluorophore on one strand and a quencher on the other strand.
18. The method of item 1, wherein the probe is a single stranded RNA probe with a fluorophore on one end of the strand and a quencher on the other end of the strand.
19. The method of item 1, wherein the probe is a single stranded RNA probe with a fluorophore on one end of the strand and a quencher on the other end of the strand.
20. The method of item 1, wherein the probe is a combination of an unlabeled double stranded DNA probe and a single stranded RNA probe with a fluorophore on one end of the strand and a quencher on the other end of the strand.
21. The method of item 1, wherein the probe is a double stranded RNA probe with a fluorophore on one end of one strand and a quencher on the other end of the same strand;
or
22. The method of item 1, where the probe is a double stranded RNA probe with a fluorophore on one end of one strand and a quencher on the other end of the other strand.
23. The method of item 1, wherein the target nucleic acid is from a virus, bacterium, protozoa, fungus, or other pathogenic organism.
24. The method of item 23, wherein the target nucleic acid is from human papillomavirus, hepatitis, adenovirus, Candidia, coronavirus, hemesvirus, human immunodeficiency virus, influenza virus. Plasmodium, rhinovirus, Neisseria gonorrhoeae, Respiratory syncytial virus, coronavirus, or Streptococcus pyogenes.
25. The method of item 25, wherein the coronavirus SARS-CoV2.
26. The method of item 1, wherein an extraction-free solution is mixed with a sample at a concentration of about 1:2 to 2:1 sample:extraction solution.
27. The method of item 26, wherein the sample is from a nasal swab or saliva.
28. The method of item 1, wherein the incubating step is performed at a temperature of about 20 C to 60 C for about 30 minutes,.
29. The method of item 1, wherein the amplifying and detecting steps are performed at about 55 C to about 65 C, about 59 C to 61 C or about 60 C for 50 to 70 minutes.
30. The method of item 1, wherein the target polynucleotide is detected in one hour or less.
31. The method of item 1, wherein the steps of incubating and detecting are all performed in the same individual discrete volume.
32. A composition for detecting the presence of a target polynucleotide in a sample, comprising reagents for amplifying the target polynucleotkle, an extraction-free solution for isolating polynucleotides from a cell or virus particle, and one or more of the following probes:
(a) a double stranded DNA probe with a fluorophore on one strand and a quencher on the other strand;
(b) a single stranded RNA probe with a fiuorophore on one end of the strand and a quencher on the other end of the strand;
(c) a combination of an unlabeled double stranded DNA probe and a single stranded RNA probe with a fluorophore on one end of the strand and a quencher on the other end of the strand;
(d) a double stranded RNA probe with a fluorophore on one end of one strand and a quencher on the other end of the same strand; or (e) a double stranded RNA probe with a fluorophore on one end of one strand and a quencher on the other end of the other strand.
33. The composition of item 32, wherein the amplification reagents are LAMP
reagents comprising F3, B3, F1P, :BIP, Loop Forward and Loop Reverse primers.
34. The compositions of item 32, wherein the probes are selected from Table 9.
35. The composition of item 32, wherein the probes are provided at a concentration of 50 nM to 175 n.M, preferably 75 nM to 150r1M.
36. The composition of item 33, wherein LAMP primers are selected from Table 1A, Table 1B, Table 5 or Table 8.
37. The composition of item 33, wherein the F3 primer is selected from Table 5 or Table 8.
38. The composition of item 32, wherein the composition is lyophilized.
39. The composition of item 38, wherein the composition is lyophilized as a complete formulation.
40. The composition of item 39, wherein the composition is lyophilized as an incomplete formulation and additional components are added later in resuspension buffer.
41. The composition of item 40, comprising one of more of lactose, trehalose, sorbitol, glucose, raffmose, glycine or histidine.
42. The composition of item 32, further comprising one or more additives, wherein the additive is guanidinium chloride (GuHC1), L-proline, L-histidine, b-alanine, L-serine, urea, acetamide, 4-aminobutyric acid, polyethylene glycol, polypropylene glycol, polyvinylpyrrolidone K, 6-0-a-D-maltosyl-b- cyclodextrin, (2-hydroxypropy1)-b-cyclodextrin, a- cyclodextrin, b-cyclodextrin, methyl-b- cyclodextrin, glycine, proline, taurine, or a combination thereof.
43. The composition of item 32, further comprising polynucleotide binding beads for the capture of nucleic acids in a sample.
44. The composition of item 43, wherein the beads are carboxylated.
45. The composition of item 43, wherein there is polyethylene glycol in binding solution.
46. The composition of item 44, wherein there is silica in binding solution.
47. The composition a item 32, further comprising one or more Cas proteins and at least one guide polynucleotide designed to form a complex with the one or more Cas proteins.
48. The composition of item 47, wherein the one or more Cas proteins is a Type II Cas, Type V
Cas, Type VI Cas, or a combination thereof.
49. The composition of item 47, wherein the one or more Cas proteins is thermostable exhibiting nuclease activity at temperature of at least 50 C.
50. The composition of item 47, wherein the Cas is a Cas12b.
51. The composition of item 50, wherein the Cas12b is selected from Table 2A or Table 2B.
52. The composition of item 49, wherein the thermostable Cas protein is Brevibacillus sp. SYSU
G02855 (Br) Cas12b or Alicyclobacillus acidiphilus (Aap) Cas 12b.
53. The composition of item 52, wherein the Cas protein is Aap Cas12b and the guide is derived from Alicyclobacilus acidoterrestris.
54. The composition of item 47, wherein the guide polynucleotide comprises a sequence selected from Aac guide types 1 to 5 (SEQ ID NOs: XX-XX).
55. The composition of item 47, wherein the Cas12b is BrCas12b and the guide sequence comprises a crRNA design 1 to 3 (SEQ ID NO:XX-XX).
56. The composition of item 47, wherein the guide polynucleotide comprises a spacer specific for the genome of SARS-CoV-2 .
57. The composition of item 56, wherein the guide polynucleotide comprises a spacer specific for the N gene or S gene of SARS-CoV-2.
58. The composition of item 32, further comprising one or more additives to increase reaction specificity or kinetics.
59. The composition of item 32, further comprising a polynucleotide binding beads.
60. The method of item 1, wherein the sample is subjected to in-sample multiplexing using intercalating dyes.
[00037] The foregoing and other objects, features and advantages of the invention will be apparent from the following more particular description of preferred embodiments of the invention, as illustrated in the accompanying drawings in which like reference characters refer to the same parts throughout the different views. The drawings are not necessarily to scale, emphasis instead being placed upon illustrating the principles of the invention.
[00038]
DETAILED DESCRIPTION OF THE EXAMPLE EMBODIMENTS
General Definitions [00039] Unless defined otherwise, technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this disclosure pertains.
Definitions of common terms and techniques in molecular biology may be found in Molecular Cloning: A Laboratory Manual, 2nd edition (1989) (Sambrook, Fritsch, and Maniatis); Molecular Cloning: A Laboratory Manual, 4th edition (2012) (Green and Sambrook); Current Protocols in Molecular Biology (1987) (F.M. Ausubel et al. eds.); the series Methods in Enzymology (Academic Press, Inc.): PCR 2: A Practical Approach (1995) (M.J. MacPherson, B.D.
Ham.es, and G.R. Taylor eds.): Antibodies, A Laboratory Manual (1988) (Harlow and Lane, eds.):
Antibodies A Laboraotry Manual, 2nd edition 2013 (E.A. Greenfield ed.); Animal Cell Culture (1987) (RI. Freshney, ed.);
Benjamin Lewin, Genes IX, published by Jones and Bartlet, 2008 (ISBN
0763752223); Kendrew et al. (eds.), The Encyclopedia of Molecular Biology, published by Blackwell Science Ltd., 1994 (ISBN
0632021829); Robert A. Meyers (ed.), Molecular Biology and Biotechnology: a Comprehensive Desk Reference, published by VCH Publishers, Inc., 1995 (ISBN 9780471185710);
Singleton etal., Dictionary of Microbiology and Molecular Biology 2nd ed., J. Wiley & Sons (New York, N.Y. 1994), March, Advanced Organic Chemistry Reactions, Mechanisms and Structure 4th ed., John Wiley &
Sons (New York, N.Y. 1992); and Marten H. Hofker and Jan van Deursen, Transgenic Mouse Methods and Protocols, 2nd edition (2011).
[00040] As used herein, the singular forms "a", "an", and "the" include both singular and plural referents unless the context clearly dictates otherwise.
[00041] The term "optional" or "optionally" means that the subsequent described event, circumstance or substituent may or may not occur, and that the description includes instances where the event or circumstance occurs and instances where it does not.
[00042] The recitation of numerical ranges by endpoints includes all numbers and fractions subsumed within the respective ranges, as well as the recited endpoints.
[00043] The terms "about" or "approximately" as used herein when referring to a measurable value such as a parameter, an amount, a temporal duration, and the like, are meant to encompass variations of and from the specified value, such as variations of +/-10% or less, +/-5% or less, +/-1% or less, and +/-0.1% or less of and from the specified value, insofar such variations are appropriate to perform in the disclosed invention. It is to be understood that the value to which the modifier "about" or "approximately" refers is itself also specifically, and preferably, disclosed.
[00044] As used herein, a "biological sample" may contain whole cells and/or live cells and/or cell debris. The biological sample may contain (or be derived from) a "bodily fluid". The present invention encompasses embodiments wherein the bodily fluid is selected from amniotic fluid, aqueous humour, vitreous humour, bile, blood serum, breast milk, cerebrospinal fluid, cerumen (earwax), chyle, chyme, endolymph, perilymph, exudates, feces, female ejaculate, gastric acid, gastric juice, lymph, mucus (including nasal drainage and phlegm), pericardial fluid, peritoneal fluid, pleural fluid, pus, rheum, saliva, sebum (skin oil), semen, sputum, synovial fluid, sweat, tears, urine, vaginal secretion, vomit and mixtures of one or more thereof. Biological samples include cell cultures, bodily fluids, cell cultures from bodily fluids. Bodily fluids may be obtained from a mammal organism, for example by puncture, or other collecting or sampling procedures.
[00045] The terms "subject," "individual," and "patient" are used interchangeably herein to refer to a vertebrate, preferably a mammal, more preferably a human.
Mammals include, but are not limited to, murines, simians, humans, farm animals, sport animals, and pets. Tissues, cells and their progeny of a biological entity obtained in vivo or cultured in vitro are also encompassed.
[00046] Various embodiments are described hereinafter. It should be noted that the specific embodiments are not intended as an exhaustive description or as a limitation to the broader aspects discussed herein. One aspect described in conjunction with a particular embodiment is notnecessarily limited to that embodiment and can be practiced with any other embodiment(s).
Reference throughout this specification to "one embodiment", "an embodiment,"
"an example embodiment," means that a particular feature, structure or characteristic described in connection with the embodiment is included in at least one embodiment of the present invention. Thus, appearances of the phrases "in one embodiment," "in an embodiment," or "an example embodiment" in various places throughout this specification are not necessarily all referring to the same embodiment, but may.
Furthermore, the particular features, structures or characteristics may be combined in any suitable manner, as would be apparent to a person skilled in the art from this disclosure, in one or more embodiments. Furthermore, while some embodiments described herein include some but not other features included in other embodiments, combinations of features of different embodiments are meant to be within the scope of the invention. For example, in the appended claims, any of the claimed embodiments can be used in any combination.
[00047] All publications, published patent documents, and patent applications cited herein are hereby incorporated by reference to the same extent as though each individual publication, published patent document, or patent application was specifically and individually indicated as being incorporated by reference.
OVERVIEW
[00048] Embodiments herein are directed to systems and methods of detecting the presence of a target nucleic acid in a sample. In certain example embodiments, the systems and methods provide for single reaction (one-pot) detection of target nucleic acids. In certain example embodiments, extraction, amplification, and detection may take place under a single set of reaction buffer and reagent conditions. In certain example embodiments, detection is achieved using isothermal amplification (e.g. LAMP) only. In other example embodiments, detection of nucleic acids can utilize Cas proteins to provide improved reaction sensitivity and/or specificity. In certain other example embodiments, isotheiinal amplification may be utilized with a thermostable CRISPR-Cas protein, with the combination of thermostable protein and isothermal amplification utilized to further improve reaction conditions and times for detection and diagnostics.
In certain other example embodiments, detection of nucleic acids produced by isothermal amplifcation may be accomplished using OSD probes. These probes may bind accessible loops, or bind open areas of ssDNA generated by binding of CRISPR proteins such as Cas9 and Cas12. Binding of OSD probes may release fluorescence by separation of a quencher and fluorophore, either through displacement of a oligonucleotide strand, or through triggered degradation of an oligonucleotide via endonucleases such as RNase H. Advantageous quick extraction approaches for the extraction of nucleic acids from a sample are also provided. Design of reaction conditions and reagents are provided for the identification of primers and reaction conditions, including concentration and content of reagents and additives, that enhance the detection systems and methods disclosed herein. Advantageously, the systems and methods can be provided in lateral flow or self-contained cartridge devices for rapid, point- of-care diagnostics. In certain embodiments, the detection assay can be provided on a cartridge or chip. A device system can be configured to receive the cartridge and conduct an assay.
[00049] In certain example embodiments, the Cas protein may be a Type V
CRISPR-Cas, a Type VI CRISPR-Cas, or combination thereof. In certain example embodiments, the Type V or Type VI Cas protein is a thermostable case protein with a nuclease activity above at least 50 C. In certain example embodiments, the Cas protein is a Cas12b protein. In certain other example embodiments, the Cas12b is Alicyclobacillus acidiphilus (AapCas12b). In certain other example embodiments, the Casl 2b protein is Brevibacillus sp. SYSU G02855 (BrCas12b). In certain example embodiments, the Cas protein, may be paired with the novel guide designs disclosed herein.
[00050] Systems and method disclosed herein include approaches to isothermal amplification for detection of target nucleic acids. In certain example embodiments, isothermal amplification approach is loop-mediated isothermal amplification (LAMP). Design of optimal systems, including primers, reagents and additives to be used with isothermal amplification approaches are also provided. Optionally, CRISPR-Cas systems as disclosed herein can be used with isothermal amplification approaches, including LAMP, that can enhance sensitivity and/or specificity.

[00051] Methods of designing optimal reaction conditions are also provided. In an aspect, methods can comprise identifying the type of amplification reaction and designing optimal primers in accordance with the methods disclosed herein. Methods may also comprise identifying optimum CRISPR-Cas systems, including identification of the Cas protein for the reaction conditions. For example, the Cas protein may be identified based on its thermostability, cutting preferences, or other desired characteristics. Preferred guide molecules may similarly be identified. Once one or more primers and/or guides are identified, salt concentrations and other additives can be titrated and selected for further investigation. Additional reaction conditions, additives and reagents can be identified to optimize the use of one-pot methodology, lyophilization of reagents, and use in the devices disclosed herein.
[00052] In certain example embodiments, the system comprises a Type VI
CRISPR-Cas system, one or more guide polynucleotides comprising a guide sequence capable of binding a target sequence and designed to form a complex with the Type VI Cas protein, and a detection construct comprising a polynucleotide component. The Type VI Cas proteins of the present systems and methods exhibit collateral RNase activity, cleaving the polynucleotide component of the detection construct once activated by the target sequence, which can generate a detectable signal.
[00053] Embodiments disclosed herein provide systems utilized in multiplex lateral flow devices and methods of use. In certain preferred embodiments, the guides utilized are designed to be highly active guide molecules, allowing for rapid and highly sensitive detection of coronavirus. In certain example embodiments, the systems can utilize general capture of antibody that was not bound by intact reporter RNAs as described in Gootenberg, et cd., Science 360, 439-444 (2018). In other embodiments, the presently disclosed system can be designed for detecting two or more targets.
When utilized with a lateral flow approach, two or more separate detection lines consisting of deposited materials that capture detection construct and a molecule specific to the deposited material, allows visualization of detectable signal (e.g. gain or loss) at detection lines due to collateral activity and cleavage of corresponding reporter oligonucleotide. Utilizing guide design that allows for design of highly active guide RNAs for use with the specific Cas protein of the systems for target sequences, for example, coronavirus is also provided. In certain embodiments, the time from processing of a sample in the current methods and using the presently claimed systems, from receipt of sample to detectable signal is less than 120 minutes, 110 minutes, 100 minutes, 90 minutes, 75 minutes, 60 minutes, 45 minutes, or 30 minutes.

OS1) PROBES
1000541 The use of OSD probes is described in U.S. Patent Publication 2016/0076083, published on March 17, 2016 and in U.S. Patent Publication 2020/0255891, published on Agust 13, 2020, both of which are incorporated herein in their entireties.
SINGLE LYSIS REACTION COMPOSITIONS
[00055] In certain aspects, embodiments disclosed herein are directed to compositions and kits that consolidate lysis and amplification of target nucleic acids into a single reaction volume. In certain example embodiments, extraction-free lysis reagents can be used to extract nucleic acids from cells and/or viral particles. In contrast to existing protocols, extraction-free lysis solution does not require isolation of the nucleic acid prior to further amplification. The extraction-free lysis reagents may be mixed with amplification reagents such as standard RT- PCR amplification reactions. An example of an extraction-free lysis solution is described in U.S. Provisional Application Serial Nos. 62/970,125, 62/993,494, 63/018,487, 63/019,406 and 63/032,470, noted above.
[00056] In certain example embodiments, the extraction-free lysis solution is combined with amplification reagents into a single volume. In certain example embodiments, the amplification reagents are isothermal amplification reagents. In certain other example embodiments, the isothermal amplification reagents are LAMP isothermal amplification reagents. In certain example embodiments, the LAMP isothermal amplification reagents may include primers for the target nucleic acids discussed in further detail below. In certain example embodiments, the LAMP
amplification reagents include primer sets selected from those listed in Table 1 A and Table 1B.

Table 1A. LA NIP Primers i O t ACCIGITCC.ACAACTAC:TACAACA1ACCTGAAGAG6A -- GiCthraCTCTGT1TIGGZu..1 sG-_ CiGAGC TTC1 G
_ 'FAGG! GAAGAGGAGGAGC _ ¨ 1 - TGCUSCAUTGIITUaCIATTTACTAACTOMATITA 1TAGCACTiMiGGITCAA' ToGiGICATAitTATAAG TGCAGTAGCTITATACTIT ACT-A-ACTEATCATiTAA' GT7 AGTTFGGAG CAAFAATTGGCFC rrrc:Aci TGGAG
¨.._...
2 GCAGAGIAGCATCATCTAGAAAACCIGTATETTleAT-- CTIGT7 GGCTGITC:AfciaAfACT-TCG-AAii.AliCAC TAATi AGATTITTGAAGG IGTATCTTICATGAATAGG
GAATAGGCAAA GGGTG GCAGT CAM
3 TGGC:GAGCATAATAATATAGCACAFTGCTAAAACAGT GGTTIGCAACCCCTICCTAGAAATGIATAGCTICITT
GACTATTGAATCTITATGA TGCFAAAACAGTGGAAGC

4 CiGIGGGGAAACCiGACiTCGCGCACTCCACIATITCCT
CTCCCACCCAGC;CGGCATACCGGIGGFGCTACTAGC ATAMIAGCTC'AC_ACAGG
. CCC G CC
GC,ACTCCACTA7TTC.C1TX:C .

CIGCTGGAGAAGTGGGAAGCGGCTICACICTICAAG AGGAGAACCAACGFCCAC ACCAGAGAACICICOCAG
A
(IAA TCTTGC C A
S '1GAGCMAITTIGAAA1OCTACAATGCiGGAAGAGG
AAAGCAGMCIAAGTAGTGTGTCTITA I AATGAGT CAACTGIGTTIAGATGGO
AAGATGT CCAAATGTCCAT C
TGGGGAAGAGGAAGATGT

ATACACTITATACAACCCG CCTAATGTAAATGTGTTIG

CGGACATTAAGTAGACCAAC.ATTCATCKTAGCCTAC CGCGCAATTTTATTTGGCT
T TTAATCGAATT AT
CGCCTTTACGGATGACAT

AGAIGCAlTAGCTGAGTCfGCTGTC.AAAAGACICAT C1'GCACAMT1 AT TAIGA
AGGCTGGAGAAATAGAAA
AGAAATGC TATATGGATCT TGTGTC TGC
GGAITTCCIGTIAATCCAATGCCIATGTGGAGGTIGG TTGITCAATAAATAIGCTGACGTCiG 7 GTAATCCFCA
CCTGAAAAGGA.ACG1TTG
AAGGTA TCC.IGTTGAG G
TGTGGAGGTTGGAAGGTA

TGGTGA CTATTAAAT C.CAG
AGCGGGTATTGATCiGTGA

CAGATACCTTTTATCTAAA

AAATGTCCTCCJ'AGATGCTG

GCTGTGGACAGTGTGGTCAGTTTGGCCCCTTTTTAG CTATGTGAGCAATAATC. T
CATGGGGICAATATITTIG
TATTITIGCC CA GACT CC
14 AATGCCTGC.AATAAGCACGCTGTATTGITCAACAACA ACTGAMAATGAGAGGTMTGCTCCAGGCAGTACT
GGATTGTGCAGAAAMAT TGTATTGTTCAACAACAGC
GCTG AATTCTTCAAGA TGAC TG
GGACAACAAAGAGAAAGTCCITCGMGACATTTGCT ATCCTGTGCAAGAAACCGIGGCTACTIGTACCITCT
C.AGAGCAAGIGTTAGATT AAGACATTTGCTATTAGAC
ATTAGAC. TTTTGG AAACTATCA CAC TTTTGG

AGTCCAGAAGITGCAGTTGAGAGCTT ATTTT CATATCTAACCTAATAGAT
CATFG CACAGCCT GATGGA
GGAAATICCCTGGCATTG

GCATAGTAGATTACAGTG ATGCATTACCATITGATAA

13 GCGTGTCTTGGAGGTCGAGGAACCCCAGAAGACTCC AGGAGAGTCCTGCTAGGAGACGTGTCTC.AGGTGAT
ACTACAAGGGACTCCTCC

AACCCCAGAAGACTCCGTG

AGAGGAGGAAGGGAATT
CTCCAG TITAAGC GG
GArfIGTCCCGICTCCAG
CTCCAGGATTTACTCTACTAGGAGTGGTCTCGGGATT GACCC.ACAATTCCTGTTGACACTGAATGACTGCAGA
GTTAAGTAGTGTGATATA
GGTACT GCCAG TCTTGGT
GGICTCGGGATTGGTACT
21 ACCIGTMGATGTCCACTAGCAGGTTGTAACAAC.ACCT TTAGCAATGCAGAAGAAGGIGTAGTGCTAAACGTT
GGTCCTATAGMACTCCTT AGGTTGTAACAACAC.CTGA
GAF C TCGGITIG CIA FAG IC

CTGAATTTGAAATTGAAGCCCCFCCATCTCTGAAGC ClICAGATATCAA %TM
GGIGAC GGGTGAC TGTGGA
TGCACATTFTACAGGTGAC

GGACGTCCTAGATACAGTGAAACCGAGTTTGCAT TGACCCTGATGTAACAATG
ATGA TGTTCCAC , GATTTGAAAATCCCGCCTT A

GTATOTACCCACAACAAA GTACTCCTCTAGAAC.CTTA
ACCTTATGC CCAGTIGG ATGT TGC
CCGAATAGGMAGTATGGGTGACCTTCCATATACTIC AGITCCTC.ATGICTCAGGAAGTGCGATTCGGATCTG
ACGGATGAATATGTTAAA
CACGCTG GTAG GGAAC
ITCCATATACTTCC.ACGCTG

CTGGACATCCTTTGTATAA TAGGAGACACAGAAAATC
AATC:CC CI ATA MCC CAAAC CC:
27 GGIGTICCAAAGTACACATE.TGATAATAGGAGCTCA TAG
TE.GTTCTCTAGTTTCCAGTGATTGITAGTCCCCT CCATTIGAACCAACATCTG TAATAGGAGCTCAAGACA
AGAC:AATCA GAGCA A ATCA

ACCA MCAT CCCTTACGGGGCGACACr CTOMCTGCGAGGAACCA
29 GACTCCGCCAACCATCTGCCACCA.AAAGAAACACCG GGCCCIAGATTGGGTGTGCGTTTGGGGATACGTTG
TGCACCATGAGCACGAAT ACCAAAAGAAACACCGTCC
TCCG GCZ C G

TCGACA ACC GCACCGCMGCGAAA A
i 31 CACTGG1TCC.CAACACCTeCTAAACAGACMCGCCAT CCAMTGGCGGTCCGCTATAGGTCAATATGCGTCC
AAC.AGACAACGCCATCCTA
MAC GC , TCACCAATGACTGOCCCA C

CCATTGAC.AAGGCCGGCGCACGTCACCGGCCACA C TICGATCTACCCGGGCC
ACGTCACCGGCCACA
33 CCTCTCGGGACAGCCACTAGAAATGAL 0. I i I GCAGA
C.AGTGACTTEGACC.AGGGGTGGAATAGGGICGCTG ATGACTCMGCAGACAGG
CAGGG GTCCG ACATCP.ACCGGACCGCTT G

CCC.AGGGIGCCCAGAGTAACACGCATGITTCCTGA GCTCATGG ATGAACACAAC
ACAACG AGCAGT CGCAAGGCAGTIGGTTTG G
35 GGG6ACAT5TCC7C.CCTCiTCT1GCAATTGGACGCGG TGCTCCATTCCACTACGGAGCTGTAGATAAGGCCGG
ACATAGGCTCGATGCAGC
G AAGCGG G
TGCAATTGGACGCGGG

TGATGATGCTTATGGCCCA
CCAG GCCAAG GATCTGCGCCTGCCMG G

TTAGTTACCTIGGGACCGC AGCCCITATTCTCTGGIGG

AMTC.CCGTAGGCCGCCGGC6C1GGACCGGCAC1T AGTG GCCGCGCMCTCACTAT
CGCTGGACCGGCACTT

CTACCCTTG TCGTGAGCCT
40 ACAAGAC.AGGCCCTCCCGAGTACAGCGTCCCTAC.TGT 1 GCCCCCAMGTCATTGCGTTGCTGCTITGGCTACAC TCTIAGGAGGAAGGGTOA
CC 1 (XiC CCi TACAGCSTCCCTACTGTCC

TAACACCTGAACAACCCCCG GCTTAATCC.ATCAGTCGai CGCGG 1 IC , GCCAGGGCTACAAAGTGC 6 AAATGTGATGAGCTIGCCGGGACCCCTATAGTAAG
GAA ' CAACGGC CCACCCCAACATCGAGGA
CTTGGCAACGATGGGGAA
43 GCGAGACAGAATMGCGGGACGACTTCTCCTTGGA GCTCTC.AGAGACGTGGGAGGACGCCACTGGACACG
CCCCA TATC GTGGCGGTTACCCAGACT
GAMCICCTIGGACCCCA

TGGAAGAGTGCTCCAAAGCTCCTGAACATTCCGCCG TACTGGCAGTGTTGTTGTG
TGG ATGG CGTGGCCGCATA MCAT G

AGTCAACATOCTGGGTGGGIGGCCGAGACCACGA A TAC.CTAGCTGGGCTATCCA
ACTGCCAGGTAATCCAACT
. AACTG AGTCG C 6 TGAGGATITCCACGGGAGG GGTCCCTAGCACAGAGGA CIGGCAAACCTCTTACCTG
CCIGC (X; AA T C
47 ICCCTGGAAGGCrGGGAAGTACTGCATGG I CC TATC
GGCATCATGAACACCAAGTGCCCGCATCGATCCGTI
COACT CTTGAC GAGGIGIGGGACIGGGIT TGCATG
GTCCTATCCGACT
45 .ACACGGGCACTTGAGGTCCTTCGTCACAATGACCAGC TCCTICCCCCGAATTCTTCAGCTGGGTTCAGGGGCG
CGGAGTACGTCGAGATCC TCGTCACAATGACCAGCAT
All MSC r T

A ICCACCACCCAGGCOCiPAGGTGCCGTGGAAACCGC CCC
CGAGIACGCACCICCCCA
SO TGGGCGCAAGACiAT CTGACT I CC I GCCCACTCAGCCA CT
AC.AGGACATGACCACOCCGAGGCAGGATITACG MICA GTAGAGGAAGC 16 GAT CAGAAGA CT
CTGCCCACTCAGCCAGAT
51 CCICTCAGI GCGGAT6TCITGTGGC 11CICAT A rCiAC
TA1CAGIGCT6C:AACCICGACKCCACGTAC:AAC.CIC GICCAAACGCACRX.CAT GGCFTCICATA
IGACACCC
. ACCCG TCGGT A 6 6 .
52 CGAGAGCAAGGGAG ItGAACGGGAGTACCIGGTC
TGACGIGCTACAICAAAG
GICATCGCCGCAGACCAGGAMGCGGCTGCAGCAG ATTGCCG CC
AAAGCGGCTGC.AGCAG

CTGGCCGGCTCGATCflICG IGATCCACG TCCGCCT 1GCGCTUCTALTIC TICA
TTCAA C GGGCAGGGCTCGCATA A
54 GC1GA17TICCTCC-T6TI ATICiCT GAT TECTAAAAAAT
AAGAATGAACATCTTCCCCTAA1GTTAAGCACCTCA TCiCiCAC.ATGAAATACTGA AT CM
TAAAAAATG TEGCT
GTIOCTCCTG TAACGATGT TTG MG
55 AGTC.TCCAGCTGMTCATTTGCTICTAGMGAAAAA TGITGriCiAGTGGAGITAATGAACTTCTICAACCATT
GCGACATGCTGAATAAAG CTTCTAGTITGAAAAAATC.
ATCAGAAGG TCCATT C AGAAGG
56 GTGICIATTGIGTAGCC TGITOCCAACA 11 CCCAT ACA.
CGIGATIAGPACACACGAGIACICCACATCCTGIAA A TACAGGCAGCAATTICA.
CCGG CATCAGAAA AC
AACATTCCCATACACCGG
57 CATCiGCA11CTG VGACCCr GC Cr TGGATACi AATGGA I GACAAAT
IGACICAG GGGAGACATATIGTIGIGTTC AATTGGA TIGTGT ICTCiG CTT MCAT AGAATGGA TGA
GAAGAAC AGTGCC AG AGAAC
55 CG1TTCCACCTACGGGTAAAC.AAACTTGGCIAAAAAT AGAAGGCCAAACTA 1CPAAIGCA
TACTGATCCCI-CC TCAGAGGATTIGI ATTAGI AAACTIGGCTAAAAATATC

59 CCCATCCCATTGTIGATCATATRITCiAGICGCMGAG GCACAAAC AGCCA7 ACAA
ITATICAATCfCCCCIGT GGATT TCri A-Cc:wan-re;
TGAATG GGCATTT CAATGG

60 TTCTGGAATATGCAGGTITCTCAAAAGAGATGGICrA
ACMCATAATGGACTZTGAGTACACAMTGTCCIAC GCTATGGGAAAA CACTAA AGAGATGGTCTATTAGTAG
TTAGTAGCAG . APAGGGATTT AGGA CAG
61 AAGGTTGCAAC.ACTTAGCGTAGAAAGGAA CA GATCT
TTTGAGGCTTCICTTTAACAGTGCGTGGGCCATAGCT GGAACTCATAGTTGGAGA
APAGGAACAGATCTATACT
ATACTAAACAC TCAAG ACC AAACAC
62 GCATATCTGGTCCECCCICACCJAAAGCATAACAATG ATTGGATACC.ATGCCAATAATTCCACACAGTGACAT
CAGGGGTTATACCATAGA CC.A.AAAGC.ATAACAATGGC
GCC TCCGCTC CM C
63 ACTGTATCTCGGG TMCiTTCTCCCCAGAA TGTGAT A
TTGTGTTACCCAGGCAGCTTATGMCACGCTGCTG CCAGAATGTGATAGGCTTC
GGCTTCT AG GGATGGCTCCTTGGAAAT T
64 CGCITGTATTGTTGTACGATCCTGTCTGGC:TGACACG AAATGCSAATMTTIGGGGAGTGCATCTGGTACAAT
GTAAACCATCATTCTITAG
TAM GCTC.TTTGT GM
GTCTGGCTGACACGTAAA
65 ATCCCTGAACTACCTCTTTTCGATGGGACACCATAAA CATGAAGACAGAAGGAACACTTGAGTGTAMATTG
GAATGGAATTCTCTTGGA GGGACACCATAAATTTTGA
MTGA GA CTCrrA AAGGA CCC GA
66 IGCCATCCTCCTICTA TAAAACCAGTTCCCCAGATTGA
AGGAATGGFTGATGGTTGGTATGCTGCTGCATACCC ATCGGAGAAATPGGTCTT GTICCCCAGATTGAATCAA
ATCAAGA TGATC AGC GA
67 TITGT6GTGIGGIGITATTGTGGCAAATAAATCAATT GACTGATGATCACAGACATGAGACCTGIGTGTTATG
MACTTGACAGAAGATAA GCAAATAAATCAATTCAGC
C.AGCC.G ATGICTaGG AAATGGG OG

ACACICAACAAAGATCAACITC7GTACTATC1GC1GT CATAGTICAAATGGAGa: TTAAGGAGAGACATAAGA
AAGATGAAAG GCTCCG TG TGAAAG
69 TGGGTAGTMGCCTITATAACGTITGGATAGATCIGG GC.CAACAGCTICTATGAAG lb I I
GTGCTATACC.AAA TATAGCGGCATTAGTAAT GGATAGATUGGTOTACA
TCTTACAGC ATGAACAA , AACCA GC
70 CCCAITATGCCTAGGCCAGCAGCATCATTATTG us. I II
ACAGAGGTACACCAAGGAATCAATTGAGTTGTTC.A TTCFACCATATATTGAACA
AGCATCATTATTGTCTITG
GACT ., GCATATGC ACCCAA ACT

ACACATTAGTAGTAGCGAG
AGCGAGTG ' ATTCCTGA ATGCTTC TG
72 GITTGITTGTTGGTC1TCTGTTGGTTGA1ITC.IGATCA
CAACTCACCCATCC.AACCAMCTITTTCCGGGTGGC TGATAGTGACMTGATCF TGATTICTGATCAGTTACC
GTTACCAATC TAGT ATCACTT AATC
73 TaiGTGTGGATATTTGTTTCACTAGCCAGC.AGATTIA AAACTCAAGAAGTGCAGTGC:TAGAAGGACACAITG
CCAGCAGATTTACTTATAA
CTTATAAAAGAAC GCACAT GGTGCCCATGTTCC.AATC AAGAAC

CATCAGTGTCAGAAATAA ACACACTTGAAAATATAAC
ATATAACAACC GAO I I ca I GGCFT AGATC7G AACC

CCATCATAATCACCAACCC TGTGGITC.AACTAATCAM
AATCAAACA CAGATATTGTGA IC CA

TTAAGACTAACAATAACG

TCCAAAAACAAGGACCAAC

CAACAACAACCAAAATAC
AAACAAC TTGGTAT AACC
AGCCCACGACAAAACAAC
78 TTCCCITTTGACITGIGTGITCTATCAGA/sCTACACTG 1 CCCTCCACTCAACCTCCTCCTGTGATAGGTACTCGG TCAAC.ACCACCAMAC.AA ATCAGAACT.
ACACTGCTCA

i TGITGITF CGTATTATTGCAAAAAGC CAAATCAACCAGAATCAAA
A ICAAAATCAAC 1 GA , CATG ATCAAC

AGTGTCTIAACCAGCAAAGTGTITCACAATAGGTAA CTGTATCTAAGGTC.CTGCA
CTAGAAGGGGAAGTGAAC
AGTGAACAA ' CAACTGTTT C AA

CTATGTACAACCAACACA
TGTTT TCACAAA AAGG
AAGGGICCAACATCTGM

TGITATCATTAATTGCCGT
CRAG TAGATGGGITG TGG
ACTGCTCCIATACIGCAAG
83 CACTCCAACTACACCGAGGGAAATAAGTGGAGCTGC GCCATGAGCAAACTCCTCACTCTTCATTGTCCCTC.AG
TAAGTCTATGGATAAAAG AAATAAGTGGAGCTGCAG
. AGA TIT CATCGA A

GAATGTGATTTATACAACT ATCAAAGTTATCTTAACAA
ACAACCCT TaCIGCTAA GTCT TAG GCCI

GAACCTACATATCCTCACGGGCITCTCTGCITTA TAA TCTGAGAGACAACiCIAAA CAATAGACTGGCAGTTACT
G rr ACTGAG AAGGG TM AT TAC GAG

GGTGTATGCATCTATAGA CTTAGCATAGGAATICTFG
AATICTIGGG AACITGAI AATTFG TAACAAG (31.3 ATCACTCCCCTGTGAGGAA
CiGAAC CG GGCGACACTC:CACCATGA C
88 ACCiATC IGACCCiCCACCCGACC:AAACCi TAACACCAAC CAGGGGCCCTAGATIGGGTCi TGTGGGGA TAGGCCG TGCACCATGACiCACGAAT ACCAAACGTAACAOCAACC
CG ACGTCTA C G
89 16GCACIC:GCCTCCAACACiCiCATCGATACCC71ACG 1 CGCAIGGCGTCCGGGTFCIAGAAAGAGCAACCACiCi CGTAGGTMCGC:AATTIG CATai ATACCCITACGTC/C
, CiC.C. AAGG 6 6 .

CCIAGTAITGTGIACGAGG
TC.GAGGClITTACCCTCGCGGCCGATGC.CATCCFGC C3 CC.
GCCGATGCCATC-CTGC
91 'FIG 1 GGGATC:CGGAGCAGCTTGAIGATGAACTGGT C TCAIGGACATGATCGCIGG
TGCCC.ACOVTGGAGAA ATAACGGGTCATCGCATG TGATGATGAACIGGICCW

92 TGGCiCAAAATCGGTAACCiCGTTICAACTCTTCAGGCT
TCAGTTATGCCAACGGAAGCGGGICTTGGAGGGTA 'I-KAM:ICI TCAGGCTGTC
GTCC GITiCCA GCTGGTTAGCAGGGCTCT C

TCCTTAACAACACCAGGCC
GGCCA GCAG TACCTACAGC1. tGGGTGC A

GTACGIGGGAGGGGTOCi CG ACAC A
GC.ACAGGCTGGAAGC6 95 ACAAGP.CCGTGCGTCCCGGCGGC1116GAGAACCFC CC
ICGTGITCTICRICITTGCGGAGGCCACATCCCCiF ACTCATATMCAAGCGGA

OGGCTTIGGAGAACCTCG

ACCC CIA C.GCGATGCCGTCATCTTAC
TCITGTAGTACACCCGACCC
97 CGGGAGAAGACGACTGGlICCAAGACTGGGCGCAC GGAGACCAAGCTCATCACGTGOAAGCCGITGATGA
AACCATCTCACCCCTCITC
AACG TGTCACC G
AGACTGGGCGCACAACG
98 CCCGTGGTAGACAGTCC.AGCATAACTGC:TACCCAAAC
GGAACGAGGAC:CATCGC.ATCACAC.AAGGTCTTGGT GGTGAGGTCCAGATCGTCi AACTGCTACCCAAACCTTC
MCC . CCACAT T C
99 N \ AGTCCACCGCTITAGCCACTCACGCCGITIGGCCTA
TCCCTGTGGAGAACCTAGGCIACGGTGGAGAGGACI
T TTGTCCG GGGGGTCCGCTGTTGT
CACGCCGTGGGCCTAT
AGGAACTTGCCGTAGGIGGAGTCCTMTATCAGGAC
CGGGTGCTCAGGAGGTGCTTAATG TGGCATCCGTG C-CTAATATCAGGACCGGG
0 CGGGGT GAGT AAGGCCOµTGGGGTTGAT GT
10 CGAGCTCGTCGCACTTCTTCTGGTGATCAAGGGGGG GAAGCTGGICCIC.ATTGGGCAGACAGACACGICAAG
GGTGATCAAGGGGGGAAG

CAGGACIGGCAGGGGGAA

CCATCGTGGGACCAGATGT

TGCTTGAACTGCTCAGCGAGCGATGGAAGAGTGCTC GTCCCGCCATGCAGAGGTTATAGACCTCGAGTTTCT
TGACAGGGAGGTTCTCTA GA IGGAAGAG FGCTCTCA
4 'TCAGC GCCAG CC GC

TCAACATATTGGGGGGGT
5 (31(16 G ACTGGCCAAACCCITXT. CT 66 6 TGCATCGCTCTCCGGCACGCTAATAGCCTTCGCCTCC CGCCA:GCCITCACTGCCATATGCAGTCGCCTCAGGAG
CGAGGGGGCAGTGCAA GCTAATAGCCTTCGOCTfr.
10 GTCCIAGGACCGACCATCXTCATTATGCACACTCCiCT CTGCAGGAACATGTGGAGTGGGCAGGAAGGGGAG
7 GCC TACAGGG , TATAGGGGGGTCTGGCGA
TTATGCACACTCGCTGCC

TTCACAGAATTGGACGGG

GCGCCTACACAGGITTGC
10 AGCGCATCGAAGGAGTCC.AGAACAGGAGATGGGCG
TGTGGCAGAGGAGGATGAGC.GGCGAATCTCCGAG AGGC1. AACCTC.CTGTGGA

CAGGAGATGGGCGGCA
11 GCCGGAAGTTGAGGAGCTGCCCTATCIACTGCCTTG ACGACAACATCCFCTGAGCCCGGGGGC.ATGGAAGA
GTGGTCCTCACCGAATCA CCTATCTACTGCCTIGGCC
O GCCG ATAGGAC AC G
11 CTITGGCAAGCACTGCGTGAACTC.AACTCGTTGCTACG
AGACTGC.AAGTTCTGGACAGCCITTICAC.GCCGCTG AAAAACTGCCCATCAACG

CAACTCGTTGCTACGCCA

GACATCCGTACGGAGGAG ACCAATGTTGTGACCTGGA

ACATCATGCTCCTCCAACG
11 TGCGGCCACCCTATTGATTTC.ATTCAAAGACTCCATG TGCCTCAGA/VACTTGGGGTCCTGGACAGAAGCCT
GCCTGC.TACTCCATAGAAC TTCAAAGACTCCATGGCCT
S GCCfC AGCGC C C
11 CCCATATTCTITICATTGCCTrTTCrACTTTGTGCGAC.A 1 AGATCCGAAAATCGAAACGAACACATGAAACAGAC AGGTACTGATCCAAAATG
6 ATGC1 1 rt CTAAGI GC GAAG

GIATGACTACAAGGAGAACCGATITTAGTATGTGTG TTGAAATAATCGAAGGGA
7 CIGACACT 1 AACTI Ca: LC , GM
GAACAATGGCCIGGACAG

AAGAACATGAGGAAAACAAGCCACTACTITCTCTGG ATGAAAAAGGCATAAACC
TT TGGA ' TGCCATA CCA
AlTACCTCCTGGCTTGGA

GAATTCAACAAGGCATGC TTGACAGATTCAAGTTGGA
9 TGGATTG TATTCAGTAGC G n-G

AAGGAAGACGGAMACT TTATAAAAGGAAGATCCCA
O AGATCCCATTTGAG TCTC AACC TTTGAG

1 . CCT TCA ACCTTC

GCGATACCTAACTGACAT TATTGAGGAATTGTCAAGG
2 GGGA CGCITIC (SAC GA

TTGAGGATGICAAAAATG CTCATCGGAGGACTTGAAT

CATGATTTATAAAATAAG CGAAACCGGTACATATAAA

TACATTGCTGGAAGCCAT
S MCI 1 11616(1 A
AIGCATGGAAATG1 COM' 12 GCTICCACAAAAAACCATCCATTACCATC/GaiGATAA I AA 1 TCii AGAACGIAAAACAGGGGACACTGTCCTCC

CATGGCGGATAACC:AAGG

GGGGCTATATCCCTAGGACCitiCITGGGCCTFATCAA AGATGTAGAGGC.AGTGCA GC1CAC7CCGTA
TOTAAGTC
7 . AGTCC 1, ACAGCC AC C
=
12 AGGATTGCCATCTAATGCATIGCTOCTACAACACAAT TGTGC1T1'GATAGAAAACA 1 TGCTACAACACAATGCTGG
12 Ci MT TGCTAGGG ICTCTATTCFGTCAMITTAGCGCT CCAGGAAAGGCTICTAGACC
TCCAATGTICT AITTG GGAAATCATICTITAAAA

IGTITGICCA

CACITGCTGTGTCJVAGC GCACAAGGCCATTGAACT
13 CCAC.ATAATACACTTTCiTC1TTCTGCACICiTGTGCAGTA
AAAAGGAAGCATCTAACiTATC1CIGGAGAAAAAATG ATGGGGAGGGATGGACT AGGTGTGCAGTAATATAG

13 GGGIT ACCCTCATATACAGGA 1 TGICAGGACCMCC AGAIGT i 2 TTfTTAC aiGGAT CGTTACAGCAAGTGCCTG
CAGGAOCCTGCC.TITITAC

ATITCCACAAACACCCATIGCTATAAGIATAACTAG ATACACCTATACCCACA It 3 ATCC.1 GATGCAGGTAA TAACTGCCTCATCGCCAT CT
13 GAGGTAGCTACATCAAGGGGTAGGCTATGGATTITA ATCCTGA 17 ACCTTAA.A.ATGGCT

4 AAACCCT GCCTTA GATFTGG Cr 13 CCCG Tr GTAACCAGT ATGGCTTATAACTCCTAG iGGC
CCCAGGGTCAAAACAAIGGTATTGGGIGGTATCTA CITGAC.AGITCIAT T1 ATG
AACTCCIAGTGGCTCTATG

6 CTATATTCATG . CCTG CAGC CATG
13 ACTAC.CiTCCGCCGTCTAGGTGACICGCCGGACAAAG
7 TCAGGAGGCACCACAGGCGCCGC.CGO.CTTACCTGA G GGCTCGCATCTCTCCTTCA
CCGCCGCCTTACCTGA
13 ACGAGCCCCCAACTGTGTACTACAACTC.TGCTGGCTC

CGACTGACTGCCGGCTT CAACTCTGCTGGCTCGG
13 CrGGGCTTGGGTTTGGATGAGTCCCTCC. TAGCCAGCC
CGCCGTCATCCTCCACCCAAACATAGGGAGGGGGG AGAAACACCGGTCTGGAT

CCCTCCTAGCCAGCCTAC

O TGC GGGAAG GC
GGAAGCCCOCAGTTTATGC
14 GTC.TfTGGCACTCCCTGGCAAAGGCGAGAATACCAG CCTTCCTGGGCCTCTATCCTCCGAGGCGTTCTACGA
TGICCAAGCCAAC.AATCC AAGGCGAGAATACCAGCA

CTAGGAGATAMTTGCGG

OCTGGACMCCAAAGAC.A

TACCAAAAACAACTGGGC GGTCGTGATGCCTIACAAC

GGAGGCAGMATATCGA

CCCACAGCAGTGTAACTA
GCCT TCCA , CG , CCGGCACTAGATACGCCT , AGGCATACTGATOCCCCIAGACCCGCAGCTGCACTAA TATGATGCAGTCCCCAAG

CTGGGCGCTACCTGAACT
14 IGGGGTAGGGGGGCATGTAGCC.ACTOGGTGTICCA GGGGCAGCTACITGCCTCAGTGGMGGCAGAGTA

CCACTOGGIGTTCC.AGTC

GT CATTACCTTCGGACCCI. TG

AAGGCAGGTCCTCTCAGG
14 AC.ACATAITGAGGAAGTCFTGGGAATAGGCAAGCCT GCTATTC.GCC.ATACTACCCATOCATAGGCTITTAA
GCTATTICCTCMGCTCC TAGGCAAGCCTAGCTACAT
9 AGC.TACATAA GAATGCCATT A AA
GCGCTTACCGGGATGAGAGCATGGAAAGGACCACA AGTTCTGCCOMTGGATCCCGTCCGACGGOCITGG
TCCTGGTCTTAATAGCCGC TGGAAAGGACCACAGGAG
O GGAGG G C G
15 GGGCAGGATTGCAGGGTfTAGACATCCTCGGTGATT GTTC.ATGGACCCICGACCTGCATTAGGGCAGGGIG
CATCCTCGGTGATTACAGC

GCGGAGGCTATTATTCAG CTCTTATTCAGACCCITGTT

15 GCGGGATCCTAGCGTGGGAC.ATAACTCCCTCATCCTG AGCTATAGTCTCCMCCCC

CATAACTCCCTCATCCTGCC
15 GGCMGAGGCCATGIGGITGAAACCCTGAGTC,ATCC 1 ATCCTCCTTCTCCTGC.AGCAACTTGGAAGGCGAGGT AAACCCIGAGICATCCCTG

t 1.5 AGAGCGGGAGAAAGAGGAGGAATGCGCCGTGCCT1 1 5 CF 1 CG , GCGAAGICAGCGGCC T
*FGCGCCGTGCCI TO' TATCAGCCCACTTCCC.AGGGTTCCAAACACGTAGAC AGCACAGTTTCCTCCTCCT
CCAAGGATAATAGCCCGTC

IS GGAGGGGAGTCGAGGGATAAGGGAGCATCAGATCA ACCCTTAAGGTCCTTACCCCGCGAGGAAGGAGGGT
GAGCATCAGATCACCTGG

TTTGGACATTTACCGATGG
8 GATGGC AAGGAAGA crfCCG 17CCACTCAACC C

GAAGGTCAGGGCCCAGA
9 . CtC CATCMG A 'FAAGGCTCT
GACGTCFCCC

ATAAAAGCGTGGGGACAG
O ACAGII GC CAUGGCACGCATACGGCT
TF

TATGGAAAGAATAAAAGA
1 C:TC Alf:CACTI ACTAC:GG

AAGFCGAAAGGTTAAAACAIGGAACGGCGTATM GAGTAAAATGAGTGATGC GOCAGATCGAGTAATGG
2 AATGGT GACITGOTI CC: T

TGGTMCATACATGTTAGAGAGAGATTGTFCCACCA GTAATCATOSAAGTTG ITT
GGATACTAACGICGCAATC

16 CTAGTGGA I CTGCTGATACTGCTC:AAAGTCTAAT TAT A I
TGGAGAIGTGCCACACiCACTC1CCIAAGAATOCC AAGTC1AGGAA TGATGATG CAAAG it-f AATIATTGCAG
4 TGCAGCC.A ACC TTGAT CCA
16 GCACCCITATITTCAATGTTIGGAGACitaiGICATCA GGAAAAGGGCAACAGC:TATACTCAT TGACTG

5 . ATCAAGA CTCITCC GA A
=

AATTGGGGAATTGAACATATCGACACTIGGAGTCAT GCA MAI AAAAGCAGTIA GACCF
GAATTFCGTTAAIA

ACAGGGGACAGAGAAACFGACICA.GGGCCATTAAT GAGAGGGIAGIGGIAAG

ITGACCGOTITTGAGAGT
16 CCM:ATI-Mr GGAATAGAGICCf TCTITACi TTCCTA TGIACF
niCiCiACATITGATACC.ACTCF AC !TICCIT TC1CiAAITICiAGCCATT MA
TCTTIAGITCCIAAGGCCA

16 TCCGGATGTGCCTICATCTGAACAATTC.TCGGAAAGG GTGGAGTCCGCTGITCTGAGTGATGCTTAATGCTGG
CAACAAGACC.ACTAAGAG AACAATMTC:GGAAAGGAT

TATCAGAATGGACCAGGCGATTCACA.C.-MAAGTICG C.T.11-FACCi AACC.XIACATG
crrGAGGAAATGICAAGG
O GGGA CITY A GA
17 ICCATTCAAGICCMCGATGAGC11CCAGGACATACr AACACAGT
MGAGICTCMAAACTCCCATMICATIA GAAATTICACCAF iGarr CTTCCAGGACATACTGCTG
17 TCCATOGCTGIATGGAGGAICICCAGCGCAAAATGC AC:AGGAACAGGAI
ACACCATGGCCACITMCCf TIT CCGACICTACTIITCCIAA

AGC.GC.AAAATGCCATA
17 ACf MCAITCHTTCAAGGC:AIGAATTAIGGCf ITCCTI
CTAACTCAAGGTCGCCAGACTIATGGICfTGGCCAA CAGACTGTGITCF AGAGG
"FATGGCTITCCITGAAGAA

GATAACAACCCACTTIKAA AAGGAGAGTAAGAGMAA
4 AGACAACAT . CTTTTTAACT AGA CAT
17 TCTCACAACATTTGCCAGITTCIGGCGAAAAGC.TTGAA
CTITCACAATCACTGGGGACAACCATCGCCAGGAAC TCGTATACTITGITGAAAC

GCGAAAAGCTfGAACAGT

TGITCTCAAACAAAATGGC

17 AGCAAAATCGTCGGATGATTGGC.AMAGA AATACAC
CTCATAGTGAATGCACCAAACCAAGGTCCTGTAGAA AGICTCGATACTGAATCTT CAAAAGAAATACACCAAG
7 CAA.GACAAT TCTGTCC GGA ACAAT

TCTGGAGTAAATGAATCAG
8 GAATCAGCT TATAT CTACCCAGCTTIGGAGTG a GCTGGTGCATGTGGGGA

18 GCC.ICGAGCTTGGGCTTCTCCTrTrACTCGGAGGCCA TATTGCCACGCATCCGGATCACTGGGCT
GGGGTGG
O CGG TT r 18 AGTrTCGGGGATTGTGAACCGTCAGATCACACGCTCr CAGATCACACGCTCTTTCC

CATGCTACGGCTCGGC
18 AGCAGCGTGGCGGCGAAAGIAGAC.GGGGIAGGGAG
TCCAATAAACCCATCOCCICGCCACGGATTCCTGAG CCCAAGGATAAACGTCa: GIAGACGGGGIAGGG AGA

18 GCAGTACCACTACCGCCAOCC.AACGGATAACGCGGT GGGACGGAGGAGAGACGATGCAGGTCGGACCATC
CCCAGGACCAGCCTACGA

AACGGATAACGCGGTGGC

ATACACCACATCG17GGATAATACGGACGCGTTA AT ACACTACGTC1TATAC.AGC
4 CTCTCC CGAACTTTC Tr C.AACTCTTACTGCTCTCTCC
18 CAAGMGCGACCGCAGTGGGICATAAAGCT. GTTGAC
C.GTTATCATCTGCTC.CCGCTGCCGGCCGCTGAAA AA TCATAAAGCTGITGACCCIG
CGGC AGAGT CGACGTCCCTGATTCCCT C
18 GCGTGCTGCATAGCTTGGCGTACGGGAGATACAAGG CGGATTCC.AAGTCAGAGCGGCGGGTCGGTCCAGGT
TCGAGGAGGACGAGTGG TACGGGAGATACAAGGAG

GCAGCCGGC.AGAGGAT

8 GG TCGT TTACACAACGACtCCAGC TG1TACA
TCGCCGGCGG

CCGACGTGCACGAOCT CACTCGGACCGCTCCT

O
CCGGCGGCTGCAGITGTTGCGAACTGTGCGTGCC GTA TGCGAGGAGGACGAGC GCGAACTGTGCGTGCC
19 CGCCTCGiiiii..CrC11CGGIGCCTGCTGIGCTACTGG
GGAGGAAACTGCCGCGGGATCTTCCTCAGAGACGG

CCTGCTGTGCTACTGGTG
19 GCGCCICTCTCATGGATCCACTGGTGAGCTGGGCAA GC.ACGCTGC,ATGGC:TAAATACCC-ATGACGCGTTCG TGAGTCCTTGAGTGACCsG

GGTGAGCTGGGGAACAGG

'FICCACGCTG1TTIC6AAA
3 AC GAIN: CGCAGCAAGTGACCGC

GCTTAACCAACGACGAGG

CAGGTAGCGCATGCAGTG

TGCGCTICAGATCGTGGA

6 GCA flG GATGAGTrCGCGCTCCTC
GGGCTCCAGAAACAGGCA

GOCGICAACAGCGT GCCT ITTGGATGGGC1CCGGGI Cr ATICTa:CCCTCCCICTCG
TGGATGGGCTCC:GG GT

8 MT ACATGCGCCCiGGCACTGATCGAIGGGICCGCTICA AGIT
CGGCTICTTGCTCG AACtCTGcfGCrCAcr6T

CAGGACGIGGIGACCICG

O CGACTGOCGCATGGT
ITCCICTCACCAAGGGCGCCICT GTG GIGO GCAACCGCICA TC:ACCAAGGGCGGGCT

1 CiGC CiccI TCCITCIGICACX:GMI
CTCGGACAAGA TCACGGC
20 CCCTTGAACATCC:CGCAGCACCICiCeGGGAAACGAG CSGCTGGICGGTTICIGACTGGCGGIGT1ICCAGCA

CCTGCGGGGAAACGAGA
20 GCGCTCCAC:GATCICGGCAIGTGCTAGACCGTACCG
3 CiCGCGCITCATTCTGGCCAATCTGGCCGATGGC7GTG AAT CAAAGGCGATGGGTTCCG
TaGGC.CGATGGCTGTG

4 GCTCGCAGTAGTGGCGCACGTGTGCGCGCTIIT (CA CSGCAGACGGCTGIGTC
CGICTGCGCGCTCTr 20 AGIGai ATACGGGGCTGGGAGGGCIGUCCAACI7 GGCCTIGGIGGACTICCAGGCGGATAATCTCCTGG CfGCLCGAGATGTIGACC
TG GCCA C
AGGGCTGCTCCAACTTTG
20 AGAGGGCTCATCCGACTC1GC.CCACCICTGATGTCC7 6 AAGCAGCCGCGTCTTAAGCGATTGCCIGGACC.TGCGC celoG GCAACAGCCGCGACGT
ATTGCGGGACCRIC.00 GGTGC.ACAGCGACAGTCi 20 TCGCGCATAGIGIGGAAGTGGICGCCCACC.ATGAA
8 TACCAGCGGCCCATGGTGCCGCCGCGGTAGAGCAT GGATTT C.CGACAGGLICTCCACCiT
OGCCGCGGTAGAGCAT

CCCCMCGTAITCTIGICGCCICCACCAGaGGAC TA GGAATTIICCCMGCGC A

GCCGCTCAGATTCCAGGA
21 TGCTOGGCGCCAGGAAAAACITCATGACOCCCAACi TGGCGCAGTTCTGGCACGGC.CACCGGGAACAAAGC.G TGCi ACCiTGGGAGCGC-ACXT

21 AGACGCGC.01 ACTCGGAAACACTACA TCAAAGGG C
ACCTGACGGIGAACCACCCAAATCGCCCCICACGTC: ACTACAT CAAAGGGICCIG

ACGCGIGTTGTTACCCACT ATCAGCAACGTGTTATCGG

21 CGTCGGGCACAGACCC.ATGGCCIACCGGGATCACGA
GCGACGGCGAACTAACCA

GCAGTACGGGGGTCGA

GAACTC.GCGACCCACG

GGTGCCGCCTGGAACA

CGAAGCCCACACTGACG

GGCGTCCGAATGGTGGGTAGCACGCCGATGTGCAGC GC CCGTGTGCAAGCCAGC CACGCC-CAIGTGCAGC

CGCAAAAACACGC.MC CGGGCFCTCGFCCIGT

AAGCTACAGCGCTCGCGGCACTGGCAGACrTTGGT OCAGMCAGCTITTGIATG

O CAG CG TGCACGAACAGCTGGACC
GCTTCCTGCGAC-ACCAG
22 CGCAGATTCTCGGCCAAC.AGCTACCCACCACGCGTTT
C.GGCTTGAACGAGC.GCCTIGTCGCTGCTGATCTGAC AGGTCAATACGGACTACG

CCGACACCGATCACGAAG

TCTCGCGGCCCTICAA GGCACCACCGAGACGT

TGATCCGCACAGCTCCG CGGAATGTCACAACGCCG
22 GGCTGCAGCGCTrECAGAATTGACATTCACGACCTAG GTIGGACGACAACGCCAACCGCAACTCTAGCATITT
GAC.ATTCACGACCTAGACG
ACGC GGCG GGCCTCATCACCAGCATC C
n ITCGGGTAGCTGCAGGCCGGGCCTTGGTACAGGAGC GTGCAGCAGCGCCTACAGAGCGTTTCCTCTTCCTCT

GGCCTIGGTACAGGAGCA

CGAGATGCTCTCCACCATC

OGCACCAACCACTGGAGG

8 CAGTCCGCCGCTCATTTTGTGTGAGCACGGCGCAGA T CAAC.GCGACGAGCAGC
TGAGCACGGCGCAGA

GCTGGCTTITCTGGAGGAA

O
GAAAGCGAAGACGGCGCCCACGTGGCAGGCGGICT ACC CCTGCTGCACAATGTGGC ACGTGGCAGGCGGICT
23 GTCA TGGCCAATCGTCGTCfCCAGCCTGGTGGAACTC
CGCCACAGCAGAGCCAGCACTITGCGACTCIGGGC

AGC:CTGGICIGAAC VCATCA

GGCTITCTIGGCGTACAGC
23 CGTGCAGGATGACGTGGATTCTTCCGCGTCCfCGTCG TCGTITTCCTCCTCTCCGTCGTGAATGICCTGGGCTA

TCCGCGTCCTCGTCGA

GTCCACGCCGCCTCAT

'FAGTTGCACACGGCCGA

CAAAC TGGCCILT-CTGGTACCIA

7 CATCTCTACCGCCGCCGTGCAACiGGCGGCGACA FCC GACGGI
GGAC:GGCGC.AGATTGC AAGGGCMCGACATCT

8 MG 1 GCCICAC:G ITCAC:CTCCI
CGC3GTGACCill6TG641G

9 CACCGACACACiCAGC:ACCAGGTCCGICGaXICIGTV CAG
TGTTGATGATGCGACGCC CCGTCG MCC-CCITT

TCTCCTCACGCACGCGGATTGCGACCGCGA
O GACCi AGM CTCCTGCAACGCGGCA
AACITGTCCATCAGCGACG
24 CGICITITCGGCGCTC:AACAAGAGGCGCACCVCCGAA GCCACAAAG
TCGCTITTGCCGCTCATCGACCCCCITG

ACCiCGCACCTCCGAAA
24 CCTCGCACAGCCGATGCACC:ATACGCAAGGCATCGT
2 ACGCCCGCCAGAAGTACGC.TCGGTAGGCACGC.CIAT C36 GCACAACGCCGCCAG A TCGGTAGGC.ACGCGAT
24 'ICTGTIGGCGGGGCTCACGCG TCGTAACAAAGCACA
GCGATCACCACGCTCGAGGAACCiCAGGAGATTCIG CGICGTAACAAAGCACAG

24 AGAGGCTACC:CGAa:GTAGCACAACCGCAACCiATIX: CAGCCAGACCGCTAGCCGAAAGCCG
CCCGAACA I

ACAACCGCAACGATCCC
24 GACGGTC.GTGGTCTCCTCGC.C.TGGAGGGGICCGGAA ACCZGGCGTGTCGACTCGTAGGGAAAAAAAGGIGG
S CA GGCC ACGCGTITTCCACCCITT
CTGGAGGGGTCCGGAACA

GCGCATTACrACCGAGTC GTCITCITGCGGTACCGG
24 C:ATCCTCiCTGTCACACGACGAACTGTT GACGGA TGTG GCaTCCFCCAGCCAATCG
7 CAA CGACCGCCICGGCTGATGATACCITCTGGCAC.CCTACC A
ACTGTTGACGGATGTGCAA
24 AAGCCiCGTGCCCACCCAAG I CGTCGTCATCGAAAGG
TGAAGAAATAGACCGGGIGCCGICTGGATGACCGT
8 C ATGGGC GCTCGTCC.CGGTAATCG A
GTCGTCGTCATCGAAAGGC

9 GC GAGGC.CACGGCGCAGAAGAACCGCTGOCCGTCIAC
TCGGTC.AGGGCACTCGT CGAGAAACGGCTGTCGG

O
TGGCTTACGACGCTGACGCGGCGGCCCAGAAAACC C CAGCTCGGTGGAAAGCC GGCGGCCCAGAAAACC
25 GGTC.GAACCTCATAACGGTGGGC.ACGCGTCCTATGA AGCGGCTGCTGGAAGAGGGCGGGTGACGGCGACA
CACGCCITCCTATGATGAGT

25 CITGCGCTTACCGATCCGCTTAGATTCGACGGGGAAC CACCTGGACATCTACCGTCGCC.AC.AGGCGGTTGAG

GATICGACGGGGAACGCT
3 GCTCACTCAAC.ACCGCCGTGAC.ACGCGOTIGGGAG TGCCTCAGCTG1TACCGCGGCCITCCCAAGCGGCA
TGTCTGCTAGTCGCCTACG GACACGCGCTTTGGGAG

GCGACTCGCCCGATCT ACGCTACTACATGCGCCT

S

GAATTTGCCCGTGCGGA

G

ACGACCAGTCFCCGGAC

7 AC.ATGGACGCCGCCTCTGCCGGITCTTCMGCGCA G CTTOCAGCGTCGCCGT
OCGTTC1TCOC:GCGGA
25 ATGCAAAGACGGCCiCGGOACGTGGAI CCCCATTATC
ACGIGGATCCCCATTATCC
8 CC CCGTCTTATCCCGTGC.CGCATACC:GMCGGAGAGTC
GGCACGTCACTTTGC:GG
25 C.GTAAGGAAACCGOGGCGGGCCIGGGAAACTCA A

TGGTCACCGTGGTCAGT
26 ACAGTCCCICiCTTGGGTGTCC.ATGITTGCCGGC17C6 O
CCCGGGCACTGATCCTGACCAGCGTGAGAGCCCCGT AGG GTGCCACGAGGGGATGA AGCGTGAGAGCCCCGT
26 C.ACGCTGACCGGCC ATCAGGCGGAAAAAGTGGATG
CGCGGAGGGTGATTCGCACTCCCGGGCGTCTCCAT TGAGAGTCCAAAACCCAC OGGAAAAAGTGGATGCGG

26 AGTTGTTGAGGTCC.ACCAGCAGTAAACATCAAGGG

GCAGCGCAAAGGCCA

TCGTGCGGAGATTTGITCT AAAACAGCAGAGAACrGA

4 G G .C7TGCCCGACGCGTGAAGGCCGAGCGCAAACAC
AGACGTCGGGCTCMa: ACTGGITCTGC7GACGCG

GTTCAGCATCTTGAGCGG AGCGCGTACACATAGATC
CG TGCA

AAGCTTACAGTCTTGCGG

GGACGCCAGTCGCATG

AGATAC.GTAACGTGTGCG

26 CGCTGTGCTGGCATCG/sGTTACGGC.TGGTGAGTCCA 1 CGTGTCGTACGCGCACCGTTGATACCGCGGAATCA

TACGGCTGGTGAGTCCAC

9 ACC: GACCGCC CCAGC(XiCATCAGCAG 'FCC
IGGCACTIGACGAGC

AAGTAATGGTCTGCAGCTG
O TGC CCCAT AGGTGGCGTCGCCAA

2 AACGCGCTGCACGACCACTTCCATGTTACGCGGCAG AGC GGCCACTACCIGTACGCC.
TFCCATGITACGCGGCAG

4 A OC:GACCAGTTCfCCATCGCGTGCTITCCCCGAGCTCC
GGCCICGAACATC1CCItG CX:GGCCiCiGGICTIGA

(31(3 TGG TTGCCiGCECTCFACCG
GCCTGTCCGATACACGTG

CACGCAIGTGCTACGCGGATTAGAGGCTCGGCAAT

GACGTIGTTGlIGCGGATC

ACCGTCAGAATCGACGGT
27 CTGACGCCACCiCCGCCATTCTI GT TCC ICATACiGCCC
GGTAC:GGGGAGAGATGG
TCTCCCOCGATTCCAGCGTGTCAACGGCGGTCGGA GG C
GTC.AACGGCGGICGGA

CIA
9 C TGCTOCTCTGC.ACGCTCTGACACCAACCAGGCGGC
CGCGGCCCATGAGGTA

O
CACGGCGTGICCTIGCTACCCACC.TGC.GAGTTGAGGC GIG CATGTTGGC-CGTCTTOGG
CACCTGCGAGTTGAGGC
28 CGTCGTGGAGACAAGCAACACCAGITT TCiG GGGCG
1 T1CACZTCC6GGTGGCTGCCGTGTTCGAGCACCTGAC. GAGG CTIGGAACIGGICGCGTC
CGTGTTCGAGCACCTGAC

GTGCTCAGCGGCGIGTACAGCGGGATCACGTCGG CC TAGCCACCTGITGGAAC

3 AG GGTAGCGCTTGCGGCACACCAC.C1TX:16CATCTCCA
GTTGTAGTCGCGGTGGC CCGCAGAAGGCCATGAG
28 GCAGCGGCGAGAGGAGAGGAITCACKi TGGA1 CCT
4 GCCTAACCTOCGCTCGCACTGGCGGCCGAATCFCC Cr.GTC CGTAGCGCTCCTGGATCT
TGGCGGCCGAATCTCC
28 CiCGGAGGAAGA TTCCi TCTGAGCCGTCTTCGTC:GICC TCCICICACi TT
ICCC:CGTCFCCACGTCGAAACCCCGG
CTAGT C GGCGCCGTCACTCGTT
CGTC1TCGTCGTCCC:TAGT

6 66CCGAC6CTGG1TCTCCTCGTGATGCGTGACGGAG GT TTTICCGCATGC.GTTCGA
CGTGATGCGTGACGGAG

CCC GCACGCGCGTTICCOCGTAAGCGCCII AC.AACTCCG
ACTGCTCGCACXGTCT GCGCGAATGTTACCACCC

CATGGGTACGGAGGCGT

GC.ATAATC1GCGAGGAGG
9 GAGGAC C.GGAACGTGACGGTITSCCGTTTTCCCCGCCGGTGG
TTC.GGCGCACCACCA AC

GTCTCTACGCTCCSACGAC
O A AGC CTCTGGGTCGCCAGGT A

1 AAC CGC 1TTGGAGGCCGC.ACC6 GCACTGGGCGTGGAAAC

CCTACGATTTGGCCATGTC

ATTGGCGCAAACTTITTGG
3 TGGC Gil CGGACTGIGGICGACTGT C

TCCACGCATGAAATAACG

CGCGAAGTGCAGCAGG
29 GACTITACOGTGC.GCGGC.GATGGAAACCAGCAGCCC GAGCACTIGCGGOCCATGAGCTGGAGGTCCATAC
G GGATT C. TGCGGTACTCGGCCA
TGGAAACCAGCAGCCCZ

GCCCGTCGTAGCGCAGAATAAGCTGCTGTTCACY.AC

7 11TCCCTACCGCGCGCTGGCGCTCCACACCAACAG TCAC , GCTCTGGCGCAACGAG GCGCTCCACACCAACAG

TCCAGCGCCTGCAGAT
29 CC.CCCAGGGCGTTGTAAAAGTCCITATGCTGCGCTAC GCGACriTACCGCCAACGCCGGICTGCGTGGGCGA
alTATGCTGCGCTACATG

O

CCAGAGCGTGCCGGTA

TGTCAGATCAGCTCGCAG

GGCTCCTTCGTGGGCA

GCCICCGGATCACATGGT

S
CGCTGGATGACGGTGATGCCIGTACACGGCC.GGOGA COG CGGCACAGGTCG TCCA
TGTACACGGCCGGCGA

CCAGGTGAGAAAGAGAAGCCGCCTCTCATCGTGCC

GTAGGAGCGACGCTGACG
i CGGCAGCGGTGGTACTGGTATGCTTACTGTGTGAA TGACGGTGITACTCGIGG 'FGAAGTGACGTIAGGGGA
7 GOGAGG 1 GCGG , I GO

CCACATCTGGTATACACCC
8 CO ' GTCCAGAACGCGG1TGGCAACGCCACAGGCCGTAG
CTTTCTTGGTGGCGCCC G

TACTITTCACTCCCGGGIG

TGGGCGATGCACAACTCITTCTGCTACGGCTGCTG A ACGACAGGATCAGAC.AGA
GCGTACAGGAGTCCTAGG
O AGGT TACGA CO T

1 . AG TGIGCCGCACAAGGCCGIGCAAGMCATTCCCCG
GGIGGIGGIGGGCATCGTGCGC1CGGCAGCTCCTT TTCCATCIGCGTCAGCCIG '16CAAGCCCATTCCCCG
31 CCCCTGGTCTrCAAGCACACTTACGCAGCCCAGCCTA
GTCGCCFCGGTAGCTCAGTAGTCTACGGACCGICIG A TACTGTCCCAGTCGCGA

ACGCACICCCAGCCIAG
31 GAGATGCCAACAGIGTIGCTACAGGAAGCTf ACAGG
CGATAGGCGGCTATAAGATAGAGATTGICTCICITT TACCG TGATGTAAAAACA
3 (T Aicrnrr TUG ACA
ACAGGAACCITACAGCiGT

CTTTITCCTTGTITGACGG ATATGTGCGGTATGATTTT

CCGGCGTGCAGCAAGGTTCGCTTCGTAATICTGACG
S CCAGG 1 ca CGGIACATCCCGCACITCG "fGATICCGTAGACGCX:AGG
31 GAGCAAGTTACCATCOCCTGCMGCGaiAATGGAAA = TCCGICGGCGACCAACACCAAGATG i GGGCCGACiA a; IGGGAACGACAGAAAC

TGC.GC.GAATGGAAACCAT
31 AGGGITGAGGCCiCiCCGTTCGGATCrACGTCCCAGTC
GTAGCGGGAGATACGGCGT ICAAGC:CGCCGITCAA GGA ICTAC:G TCCCAGICIC
7 . TCT GC GGIGACCGTCCTCTGTC.0 I .

CGACCITCGTACCIAGTCGC

9 CO Go TT
CGAGAGATTCTCCAGCCCG
32 TGCGCGCTGCTGC; TGITITATIGCAACACiCITCGACC3 CCiACGCGA TCACCICi TTGCAACACICTTCGACCIG
32 ACGAACiGTCCA.GCGGC000CGGATGCTC1CITCGTT
GCGTTGTCTCGCCTATCAGOGTIAACXITGGGITTCA

GCGGATGCTGGTCGTTG
32 CGCGTGACTCTCTGITCGAIGGCAGGICCAAGCGCCG TACGAGACCCCCiAGGIACGCACTGCAGACCGTACCi CAGCTCCAAGCGCC.GT
32 CCC AACGGTCACACGT CCCiGAAAGCCCi TCCCGGICA
3 Tr CGGAACGGCGTTTCACTGCCTTCTGCACCGCCGACG
AGGGGAGCAACAACCOT GAAAGCCGTCCCGGTCATT
32 GCGTCICICAGGGCTGGAACIAACCCAGCACTCCATC:G
4 CAAC.GC.CitiC.ACTTCCAACAGTGTMITGCACGGCGT AC TCGCGCCGCAGAGTT
TCITGGTGCACGGCGT

S TTCG CACACCCAGCCMCGGATC.GGICGTCGCGT CO
GGCZGTTGGAGAATIGGT GGATCGGICGTCGCGT

6 CCGACGTCCCGCATCCAACTGGCGTGCGAGGAAGA . ACATGGGCCATGTGTGGTGGTGTGCACTGCCGCGA
A TGGCGTGCGAGGAAGA
32 GAACACGCGCAACAGGGCCCiATGTCGGGCGTCCACC CTCCGGGTICCTGCCITITCCAAGGCCGTGGAAAAC

GATGTCGGGCGTCCACCTA
32 TATCGTCGGGTCGGCTGTCCAAAGCCTGGTTGGATCT AACAACGCAGACTGCTGAAGGCCAGGC.ATGGGTCT

AAAGCCTGGTTGGATCTCG

AGCTTCMCCCGACTCCGCGAGACIGGC.AGGGAC A AATTTGGGTCTTTTC.ACGC

TAGCCTCCTCCACCACGG
33 GGCCC.ACITTCGGGTGGAGEICTTGAAGTAC.AGCATG
CGATACCACCACATTCACIGGGCCITIGCCGTCCTCIT CITGAAGTACAGC.ATGCCC
O CCCA CCTG
GTTGCCCGACATCC. FCCC A
33 GGTGCCGAAATCACCGTGTGAAAAACCGCGACTTTC C.ITTCCGTACCGGGATTTC

AAACCGCGACTTICCACAC

AACGTCAAGAGICACGTC
2 ATGTCGCGACAOCCGGGFCGCCCACCCCCATTACACG occre; AG
GC:CCACCCCCATTACACG
33 AGAACATATATAATCGCCGTITC.GTACGGATCAAGAG
CAACAATACCCiCCCTACGTIGGTICAGTGTTGGATC CACATCAACAGAAAACCG

ACGGATCAAGAGGICCAT

4 AGC.ACGGGGTACTCCITGGCGCTCTCGGGGCCGTAT GGA TCGTTACCCCAGC=TGCC
GCTCTCGGGGCCGTAT
i 33 GAA6GAAACCGACCCCGMACTAGC.G1TCATAACAA GGCGACGACAGITCCGIGGGAAGGCACCGCGACA
CGGAATCGTATACCGGGC AGCG11CATAAC.AATC.CCG
TCCCGG G , A G
33 CTC.AACCTGTGCGCCACAGCGTTGCAGCaCCACGAAC

GTTCCCAAGCCACGCC GTTGCAGGCC.ACGAACA
33 CC.CTC.ACTGGGCTACCAGTGCACCGGIGTGCAGCTA 1 7 AGC ' GPAACC CGCGCCCAGAACCAAC
ACCGGIGTGCAGCTAAGC
33 CCGTGCCATCGGIGGGTACCAGAGAACGC.AGCGCA

GACCCAAGCCAGACTGC
33 TCAGCGAAACCGGCACCCCIGGCCGGCGAGTAATAGC GCGiGAAZEGGICGCICATAGCTCCTCC.TGGTGCCT

GGCCGGCGAGTAATAGCA

O
GGCTGICATGGGICTGCGCAGCCGCCGAAACACGA CT TGITGCCGCTGICGAAC AGCCGCCGAAACACGA

CtGAAAGCGTCGIGAGCA

TAGGGGACTACCTCCTCGA

CCAATCGGCGTCGCTTAG

GGGACCTGITTCAGCGIG
t GCAGTGCTACCGTGGTFCCCTACGATCCICCGGCGC GCGFAAATCF GCGGGATA
5 'MCC 1 I , GGCGCiCCAGAGAACGT GC

6 TG1TCGACAGCGGCGTAGACTCGCCTCGACAGAGCC ' ACG TTTAGCATTTCGGCGCGG
TCGCCTCGACAGAGCC

ATGGCGTCCAGGTGTATG

AAGTTCACGTCGCGTCC

GICCAGCAGTIGGICGCG TCCGGCATGGTGCTGC

AAATCCCGCAACACTTGTC ATAACGCCATGGGATCCTC
9 . CA C G A
35 AACC.AGATGGCLACGGACGTCCTGAACAGACCGCT A
CCCTGCTGTIGGCTACCTCTCTITCCTGTGC.ACCGIC CGTGAAGAGCCCTCATCC
O ACC ATC A CM
GAACAGACCGC TAACC

CGCATCTGCTCGCCGTAGATTGGATGATTITGCGCG

CfGATGCIGTTGICGTGCA

2 ATCCCGGCAAGTTGC:GCCiC:CACCACCGCATTAACCG CCCi CCFTCFCGCGGGCCTA
CCACtACCGCNITAACCCi TTCFACGGGTATCTGCAGC
3 T GGC.3 7 GGACCITCIGGGACCAGT
35 GCMCCAGACIGCACGGCCA TAGGCAGCCiACGTG I
ATAGGCAGCCACCi TGIAT
4 ATGG CGTGGAAAACCACCC.GGCAGTCCTCCGGGITCCTGC

35 CGGGGGGGTGAAATTIGGAGTGCAGCAC:ACGCAA
5 . CAGAMACAC-CC.CCCCGC.CCGGACGCCCGACATCCA CTACITC
GCGCACCCTCCTAGGT GCSACGCCCGArATCCA .

GCATACCAATGAAA I GGAA AGGTACT GATC.T.AAAATG
6 TGCTT GICTGAATAC.AT GAAG
CITTGTGCCACAGTGCTT
35 'FGTAGFCGIATAGAT OCC.iCAAGAAATGGACAGTGG T AGGAAAACCGATICACTGAAATTGG
ICGGATITT AT AATTGAAGGAAGAGACXX; IGGACAGTGGTGAATAGC
7 GAATAGCA CTTGTTAGC. TIT AG A
35 TICGCCTCFCICGGACTGACTAAGACACKiAAATGGCA AAICACAGGCACCA
FGCGTAAAGGC:IGGAGAAGYF AAGAATAAAAACCAGACT TAAGACAGGAAAIGGCAA

CCCAAAAACAAAGAACAT MACAAGCCAATTAATCiTG

36 CAAIGTGLICAATIGGGGC.AGATF CGAGCFGGATAG CATTGCAGGGCCACTGAGF
ACCATCCATGGCFGCAC CAACAAGGCATGTGAATT GATTCGAGCTGGATAGAA
O AACT AAG GAC CT

TTCTCCCITACFGACCCAAGGCCfACTTCAAGAACAC AAGGAACiACAAACCTGIA GITCA F
TATAAAAGGAAGG

AAACAGATCGGAGACAIGGCCAAGGITCTGCACAC GGGGATGGAAATGAGAC

CTGCCTICTTC.AATCICTIC

AGAATCiGITAGCGGCATIGGGCAPTCAGCCFCCCT GAT CCIATGAACAGATGGA

AAAGGACCCTAAGAAAAC AGGTCCAATTTACAGGAGA
4 AGGAGAA . ATGGG TGG A
36 CCATTTTCGCCTCTCC.AGAAATTC.ATGGAACTGATTC
AGAAGGAC.AAGAATTGCATATCAGAGTGCTGCTGT GGGATAGGGACAATGGT ATGGAACTGATFCGGATG

36 TTGAAGCAATTTGAACCCC. TCTATTAGAGGGACAAGA
ATGAGAACATGGAAGCAATGGACTTCTTATAGC.CCA ACCTTAGGGTC.TCAAGM ATTAGAGGGACAAGAATG

36 GCATITCAGTMCGCGC.CTAAAGAAATGATG t I 1 i I 1 GGAAGIGMAATIGATGACGGCGAGGCTCAGAGIT TTTAGACGCCAMTACAC GAAATGATG GGGC

36 TCCAATATGTCGTCAGCTICC.TGGGCGCCATTATGAG TFTGGTGGACAACTTTITTMCGAGTCGTAMGAGA

GGCGCCATTATGAGAACT

TATTGGGGGCGAATGTAT

CACAACTGCACCCTGAAGA

O Ci AAAGICCATT
CCTCACCCCCGAAACT AG GAAGIGCGGTICCAGAAG

GGAGGITCACCCTATIT ACC GGTGITGCTITGAATAGTI TCITAGTAATGGAGITGGA

37 GCGCCAGCGAGTAAATGCAAACAC.AIGTCAATGCAG CTGCCGACGTIGCAAGAGGATITACITCTCCACCAC
GCGAGAAATar.ACCTGA ACATGTCAATGCAGCTGFG

3 TTGGCCT ATCTAAAAGCA G , GCTITTGCTAAATIGGCCT , AACICITCTGTAAAAAAGT GIGTACITGCTTTAATTGT

TGATGATACTCGACATCCITGCTAGCACAATGGGAC. GTAACITCAGCCACTCAAA
ACTC.AAAC ' GAAG GCGGGGTATG11TAGACC C

CCAGGTTACCATATTGGG AGGATCTAC.AGTGGACGG

GCATTAGGCCATCTTEGGGATCACTAACCCGGAACA TGATACT1b1 I I 1 I I CTGA

GTAAGCTCTTGACCACCG

CGAGGTGACATGATATGA TGATTGTATTTTGAGATTG

AGATGGGGGAATCATGTC ACAGTTTCTGGAGCAGGT

CCATAGGCATGTGGTCCA
O AAGC cATT G
GCTFCAGCAGTIGCAAGC

GGCAAGGTAGCATITTAA

CTCTGAC.ACAMAGCTCTAGTCCAAACGATCGGTAC ACAAAAAGACTLITCAGTG TCCGTATACAGAT1TA A
GG
2 TAAGGGGT 1 CCITCC It GGT
i ACGIGTCAGITTCTATAMCGATGAIGATCAAATGG CACAAMTCTGGTGGTG
3 IC 1 AAGAAAGCCT A , G CT Tai KT
TCATCGCIT TC

TGCGCTCGTCAGATAGCAAGCCGGTGAAGTAGTTA TTGCGGCAATATTGAGTTG
4 GTTGC ' CCAGCG TTTGTCCCTATCGGCCCG C

AGTAGGATAGCACCTGCCGCCGCGCCTTCACATGA TA ATTITCCAGCGCCACCT

GCCCGCTCATAGGGGTTA

CTGATTTACCGCCMAA GAAAGGC CTCL I ItI III tCCATAGT AGGTATFCCICCTTATCCTT

GATTGGGAAGGGCAAAA AAATAATATTTTGCCAGCT
7 . CFC TF 6 TAGGACACAAT GI CTTG

AAGATCAGTAGCTAAGGA TGTAAGOGGMCTATAGG

TACTGCITTCFGGIGCGA CACCAGTTGGAAGAAIGG AAGTAGAAATLTCGTCGGC
9 CGCiCG GC; CI G

O TIT (.3 cm TCGTCCICTTCCTMCIGC TGCATTOCCACTACCGITT

GGITCGGLifCCIGCGGGAAICCCCTGCCAGAGCG T TGAGCCCGGGGACTA ACC ACGGITCCACGGIGCA
39 CA C:TA ITGAACGTGCGGGACATG1TA ICTI GGTAGGC . GLIGGAAGGICAITAGA
TATCCGGGAACG AAATT G CCIGTCTIAAAATCTCC:FG GTIATC TT GGTAGGCAA TT

39 CGTCGCGGCGAAA TGAT I GAAAGIGGGGGITGTCfT GCTCGACC:GTITTCACG ICC;
TGCMCGCCICAT T IC TT ICT TCCTCGACCACAGG
3 , C:TTCG CCAT I
GTGGGGGTIGTCITC:TICG .

1TGAAGIAAGCTGTITTAAGCCCAGCCTGCGAATIC. CITTC:AAGMAGGCATCT
4 T AAC.AAG AGC CA
GTCACAGTCACAAGGT

AGCTCCTGCATTAATGGIGTAGACACAGAAACTG GITGTANT AA AGCTAAAT
5 GM TACAGCC. CTGCC CA
AGCACTGCCTGTTGM
39 GCTGCC:GGCATCACATGGAT CT I CCAACAIGAGGCG
TGITGCTCCiCATAAATICGCGGCGIGGCAACIATIG

CAACATGAGGCGGTG

AGCAC.AAGTGAGGCGCAAGACTCCTCTCC AAAGTTGGTAAGCGACTC CCGTGGT1TGGTGGAAAA

AGCGTCACCCA

TGIACATAATCCAGCAAGTAAACCCCATIGTC KT AC AAAAAGCT ITAATAACCAT

GCACACTAATAGCGCGAG

TGCTGACGTCACTOGCGATGACGCCGGTCGCTCiCA CAT AGCT TACAAAGGGCG
O GA TCT Cr GACGGGAGCAGGAACAGA

CATCCGCAAAGATGAGGCMCIGTIATAGCGTGCC:

CAGTAGTar.C.TCTCGTA

CAAACGTCCAGGC.AGGGITTCTTGCAMTTITTAA A GIGGICAGGAAGATTATA
2 AATGGTT . AGCCTGAG CAAAC
GCCTTGCGACTAATGGTT

CCCGCTCCTGGATTTIACACTGACTGICCGTAACATC
3 AATCGC ATC.0 GTCCTTCTACGCTTCCAA
TCGAGCTTTCATTAATCGC
40 TTTTGCGTCGCTGC.CAACGTGAATGGGAGGAAAGAC
CACATGCGGGGGATGAAGGAAGAGCAAATGGGTT A ACATATGCACAAGAGCG GAATGGGAGGAAAGACAG

GTAGTGCAAAATAATGAC TGGCACGCAAACTATTAAC

GTAGGAAGACAGCATGG TGCIGGAAAGTGACATTG

AGGATCAAAACACTGTAA CCITAAATAAGGCTGCTAG

TGTCCATITIGGTITCGCC

AACAACACGGGCTGGG
40 GCTGC.GOCCGCAACAGCTATCGCGCACT-CAGTATA
9 GCTGCCCGCATCATAGCTOGCGGCCAGAGCG1TGGT CAT TCGCCGGCGTAGACC.A
OGGCCAGAGCZTTGGT
41 AACICGACGCCGTMCIGACAAGC.GCGATATCAAG CGCCCAGACGTAGGGTAC.AGTCCGGGAGAAAAAAC
AIGGCCCNICGGATAAa: CAAGCGCGATATCAAGCAT
O CATG CACTGT TT G

CGGAAGAAAAGCGTGGA TTAAAAGGGAAGMATGG
1 TGGA GTACTTTTAC , A , A
.
42 TGATGGACCTCGTCTAC.GGCTAGGGTACCGTGGGAC CGCCGTAGACGTCGACGCATTGTACAACGCGACGT
TACATCCTTCCATTGTGCC

GGGTACCGTGGGACAACT
41 AACGGATCCGGGCACAGTITGCCCGACGACiGTAATG 1 C.CAACCAA
AGGAGAAAAGCGGCGG1TCTGTIGTGT ATTGCCAAC.CGTITAGAG
3 GT ' GCGTTAC CC
GCCCGACGAGGTAATGGT
41 GCCATC.TTGAAAGCGCTGGTGCCTATGTTAGCGCCA CACICCCTCGATGATGCCGCACTCCGAGGCATCCTG
TCCCATCTGTGTATACGCC CCTATGITAGCGCCAGAGG

ACrIACTCCTACAAGGCTC
GGG GAGCGC G
CITTACGCTGGCTGTGGG

GACAATCCAAGGATGACA

TGCAAGACAGAAATACAG CAGTTAATGCTGGATGCTT

TAATTTGCAGGCTAATTTG GAAGCTTCTTGTACTCAAA
8 .ACTC:AAATGT AGGTATCCAC TGG TGT

GCCATCAGAAATTTGTTGC
9 Cf TAC AAAGT T
CCCGGTTCCTACACTTAC
42 AGGTGCCGTCTAAATAGGGAATGCTGGCiTTCAGGGT 1 O TIGAC : AGCAAAC: AGAAAC
CIGGGTFCAGGGITTGAC
s GGATACCTGGGCCCCACIATCACAGCIGTITGGCCI CTITAGAAACTFTCAGCCC
1 ITGTGGA 1 ATA , Al GAG
TAGGCAAGTIGTai A
42 GGGTCCACAGCCTAGATCTCGACCCTC.CCAGACTGCT 1 TTTCCGGGTfTIGTGTCTCGCGGTGTACGCCCCCAG CGGGTAACGCTACCACCT
2 CIA ' rrsc A
CCCICCCAGACTGCIGTA

TACCCTAGAMIAGTCAACC

42 GCTCCCCCCCTAGGCTTGTTATGGTTGGTGGGCAGG CCACAGCTGGCGCACAA.GTIGGCCCTAATTGCGACT

GCTA1TTGAGCCTTCATTCCTICAG(3AATAA1TCTTT
5 . CG CTCCGGAAA TGTTFAAACAITCGCAGC:G
C.ACTGGCATAAGGAGACG

TCC.ACCAAATGCTCT1TGC CACCATCTCCATTCCATGC
6 ATGCA GAM; (3 A

TICCATTAGCATAG
7 G rr GT TGITCGCAAF CCGGCGCII CCATGCCTITTICC GCAGG CCAT GO
TCCATGCCITTITCC

8 CGC GATG I GCTACGGCT(5TCAI6GAG C

AAATCCFCGCCITAACTIC
9 GCC:AGA AATTITA.CIG 7 AGITTITGCAGTGCCAGA
43 GCMCCiGGICTMCAAGITTTGAGCGCAGIGATGA A I
GGAAAGCCGGTIGIAAGC:GACGCTOTTGIAGTI CCGGCAGIAAATCTICCG TGAGCGCACIF
GATGAATC

43 AAGCATIGCTGCCAAACACCCTGCATTrACCT1TICCT AACCITCGAGAACTAAACAAACTGTIGTCCGITCGT
I . CT CAAAACC CAGAGA1TACGTGC4IAGA
CTGC.ATFTACCTITTCX.TGT .
43 AITGCTGTCAGACTGCGOICACIGAACrAlTAAGE GA

43 'ITGCGGCTITCGGGCiACATGITCCATCACCITIGIGT
CGCAGCAACGCTCAAAGAAACACTGCCTGACTCT CC

ITC.CATC-ACCITTGIGTGCT
43 CA I GAGCACi TAGCCCCGACGIGGGACACCATTGAAA
TGTAGCCTGCTTICAGCACTCiCGAATCGCCACICCA AA1CTCAGAAAGCCAACC
TGGGACACCATTGAAACCA

CCGC.AGCTGTCTCTACCTCAACiTAITTATTGAACAG GAAAACGCATTFITCCCAC

AGCAGCAACAAGGAAAGT T

rrAACCCTCCCIC1IGIR:CAGCCGAGCiCAAAAGTAC ATGAAITGGTMAGCGCC

CCOCACATGATTTCCAGGG
43 CGGGAACACiGACAAATGCAATAGCTAITCiGT GGAAA
TGAACiATTMTGCTGTCATITGIGTMCITC,AGGCG GGAMTGATACACACC(C
GCTATIGGIGGAAAAAGA

43 AAA I GCTIGCGTAC.AGGGIGAT TICAACCTATTAACC ITGlIGCTCCAG
ITAITGACAAACGAAGGAGAAACT AAACIACACAAACIAAAAC ATITCAACCTATFAA(X:IG
O TGCiGA GCCTCG ATCAC Giii, 43 AGCCAGAAGGIAAAGAAAAGAGAAACtAACTTWTT Gf ATGCF
CAAACAGCCGAAAGA.AGGGTrACATIAT ICGCAGGi CTTATTITArf CIAACTIGT 17CTAGIGCA

44 CCGTAGTACCATCGAC:TGACATTATTGTCACATTACA
ATACGGTAAGGGCAGCCCAAAGCTITTAGTAATAC:C AATGCATCATUCTIGAAA TIGTCACATTACAATIAACT
O ATTAACTGCA . TGAN \ CA GTACC
GCA

1 ACCT. CTTATGC CTGC CIG TATGC
44 TTGGATATTCGGGAAGATC.ATCAGTC(CCICTGTAGT CCCCCAGGATGATTTACCCCTAACAACAGAAGGAG
CTGGGTTTATGGAATTAT CfCCTCTGTAGMT.TITAC

TCAACTGTGCTACACGCTGCAATAGGTACCGAGCCC GAM". AATTGATTTACAAA AAAAGGAC GCCTG

44 fiTTTGC.AAGTCTGTCAGGITTGACCATGGTCTCCGTG
AAATGGAAGGTCAGAT(CTTCTTCCCATACTGGGTT TTACCOGAAATCTGA AAA

CATGGTCTCCGTGTACTC

TTAC.AC.TCAAGCTAGGGG CGGAATAAAACTTAATGCC
S AATGCCCA TGTGAGGG A CA

FLAGG TCTCACCACFGAAACTCCC TGTAAATTCTAGCGGTGCC
6 ci icca: AACCGTT T C
44 GCTCGCAICAAAGGGATAKITAACAGGGTCATGAAA CIGTCCTTAAGAC.GGGGATTGGAGITGITAAGITCC
GTAACAAGAATAGGGGAT GGGTCATGAAAGTAAACG

44 GGTTGATGTGATIAATC7TaCTGTCCTAAATATCCAA CTGGITCCCCAAGCT1CG1G1AACAG1A11GGTAGA

9 CTACACC AAAGGGA , GG CC

CGCCCAAGCaATTCTCTCGTTTAATTATGCATCATCGA TGATGGAAACGAATCCC:f O GCATTC GCAATT C
GTCACTCAAGCGGCATTC
45 GATGCTITTGGGACGCTGGTACAC.CITTTGITAATTA 1 C.CAGCTGACATACACTGTAGTGATACATGGTCATGA GTTAACAACCTCCCGATAC
ACACCITITGITAATTMG
1 AGGCG ' GCGGT C GCG
45 AAAGCTACGTTGAGTACTTCGGATGTAAAGTCAGACT TTCCTCACAGGGC.GGTAGTTCAGACGGTATCGCTAC
ATGTAAAGTCAGACTATTA

OCACTTACAGGAGGAGGA CAAGGTAGAGGC.ATAAAA

AAAAGTCTGGATCTTCAA AGATITACATIAGGCTCCA

CCTCAAAATCTAAGCCATG
GG TTCG C
GITCCGTAAAGACTCCGG

AGGAATATCAACTGCTAT CCATGTGGAGGTATAACAA
6 ACA.AAATF GGC GTGC AATF

AACGGTCAAGCACTTCTG CCGGTACCCTTGTATACGC

45 AGGCAGACATCAAGGCIGGGTTMCI. CAAGACCCG 1 GTTGAGGCTTGIGGGITITCCGCACAGCGTC.TTGGG ATAAGGATGCGGCAAGCT ITTCTCAAG ACCCGTCCAA
8 TCC:AAG I mil. C 6 9 GOA 1 CITA , CC
GCCATGGGGAGAICTGCiA

TCCATTGCTCCTATCAAATC
O ATCAG ' ccrG
ACAAACCCTACCAATCACC AG

GCCTTC.AATCATCATGICA

ATAAACTTCCATCAAATCTAAC GACATTGATTGAAACGAG AATGAATGATAACACAAAT

GCTAAGCCAAAACATACAA
3 . CAACA ATGT17TGGC:IC GAISGAGCCGAGAACAGT CA
46 CCTATGGITCCATGTACAC.ACAACGGTGAAGGACCAT GATGGTGACTGCAGGTGGAGTGAGCTGTAGMCT
CCCAAAACATATTCAATAT

GGTGAAGGACCAIGFTCYC

TGACTGGCTGTAAT GTG TCCGACT TAGCAGACAAGGTGACFC TGGTTGAAAAAGIITACIG
5 TACTG AG T 'MCC A AGT

CACCTGATCCCAAATACIT
6 CA.GGA IC/ACA TGA TGGCI
ACAACC:AACAGGA
46 CC.CCGGAGITGAATCATCTCCAGTGTGTGGAGIACA
canGTAGATCITOTAGTGCCICCC.ATC.CCTCTITCI GCCAGGAAAAGAAAAACA AGTGTGICGAGTACAATCT

46 TCCAC:AICAT TITCATCFGTGAC:AAAGTAGAGGACAC A
1 AACGAT T ITGIGGCTAGATCCiAAGGIAGTCTGGT AGTACi AGGACACAATTC16 8 AATTCTGAT ATGAATCTICT AATACa:ACACATGCCC.A AT
46 cioncrrrrrCACCCIGGGAATAIAAAGCCTAICAACA
CGCIGTGlIGAMCAGAAACiCITGGACAAAACAGC CAAGGTACAAGA IGITGG sr AAAGCCTATCAACACACC
9 . CACCA TGTT IC A
.
47 AAAC:ACCCTICTGAAGITAGCTGAGGGAAAACAAGA GTAAICTCFATGCF
GGCTICCATGATCACACCCT ATA AGCTCCTGCAAAAATAAA AGGGAAAACAAGAGCTAF
O GCTATCG GCTGAAC AGC CG

ATGGAITGACAATAACACCTGCIGCATTATCCCACT CfGAAAA IIGTAGCA LAT 'IGATGIGA
I:MCI TCATAC
1 TACCC. TGACTTTC GGTGA CC
47 GTCG TCCAAGGT TGCAGG TCACAGTGCA ITICTACTA TTC: TGCAGACCATICGACAT
TIAGATCGACAG IATG CGGAAGAGATAC:ATATAA ACAGIGCAT TT CTACTAAA
2 AATCC AGAACAA AAACIGG TC.0 47 CTCCTCCTCTGTAATC.TCATACTCAAAATGACTTAGTG
TCGGTGGTTCCATrTAGAATAGACAGCAGCATACAA CCACCGTTGCAGATCTTA AAATGACTTAGIGTIACCA

ATAGATACGITGGG TTAGAAAIGT GAAAGCTGACGITITGCT AAGAAGTCITA ACAGAAAT

47 GGACiT I ATTF TCACAGCCTGCAACCGAAAGIATAACA
TGAAAGACACAGCAAAAGCiAGATTCiAATITTCAGC GAGGATTGTGACANIGCA
CCAAAGTATAACAAAAGTC

CIGMITGGAC:AAACACCGGGTTGCTGCTTGATGA AAACTGTACAAAAG 1113A
ATAFCAAATGITATCT GAT
6 TGTGATCCTCC C.ITAC TGTGT CC:TCC

I.
7 ACAATGG CTGATATTTFCCK. TAG GC
47 AC.AGTCCAGTTCTGCTAAATCTTTGCGGGAGAAAGA GGACTGTGCATITTAAAACCCAACTGICGAAGGAG
TATGATCAICTTATACTTTA CGGGAGAAAGAGCTTATT
8 GCTTATTITAC . GAGGAGTA CAGACC TTAC

GTCAGGAGAAAC.C.COCCGCATTCGGTCTCTCCTCCT GGAGGAAACCGAGGAAA
9 At. CCT TCC
CCACCACGTCCCCTGAAA

TATTGrfAAAGGOCACAGC
O GGCACAGCA TGCAATA
AGCTAGGGATCCGCCTAT A

GTTCCAAATCITTATGCAA

GCAATTGCCTAGCTGAIGT

ITCTGTIGTGCCATTAGAGGATCTC:TC1TGTATCTCC CTAGTGGTACITTAGTAA

TACAGTGCCTGTGGAAAGT

CAATCATATCTGGCGAGG

TAACGCCATTGCAGTGTT

TAGAAAGCCCTCCAAAAA CACCACGTGACGTMAAA
4 ITTFAAAT Ain- CTAG I

AATTGGGCAGGCGGTICATTGATAATICAATAGCA f ATGCTGG C.AGCAGTA CTATAGGGCGTCCAAGGT
TITCAGAGAC.AGATGCTGG

ATGATTCTGATCATGAACGCCIAGGTAGTACCTA I A CT GAAAGATGTACAGG AA
CACAAAGTATTAGTTCCTA

ATTGCCACAGTTTGTAAAT
7 TGTAAATGG TCTCCCAT , TAGTGCTCCITTGGATGIC GG

GATGCCCTACACTCAACAACCCAAA TCTAACTGCAGATGITATG TCGTAATATITTGGATAATT

48 CC.AAATGACTCAGAGAGACTGGATGTGAACTATACT GTOCGCACCCCAATAATTATITGCGGAAATTTGAAG
TGCAAACGTGTATGTATCT TGTGAACTATAGTGAAATA

49 CiTTCGC.AGCTGACGCCGGTTGICCACGACCTACTCAC
ACAGACCAGGATCCCCACACCACTGCTGCCGTGCGA
O C C ATTACCCAGGCAGCCGA
TGICCACGACCTACTCACC

1 Gilt ATCGAAAT AATCC TC

TGCAATATGCTAGAAGAC AACCTGATAAATACTCATG

49 GCACATACATTGCTCCGTCCCfCGCGCCTTCAGACCI ATCACATGAGAACGGCGCGACGTCGCGCGAATTCG
GGATACGTACGCAACAGA

OGCGCCTICAGACCTGA

CAGAAGCAGAGGATTATG TC.AACCCATATTTGATGTIT

GGTAAAACCAACAGGAAA TTGATATCAGTTCCTTGTTG
5 Cf TGlIGG AAG7CT CG G

CAGGATGAACAGTGATCAGTTTGCTGTTGGCATTCA TCAAAAATTGTTGAAACT GTGAGAAATTG AAAG
AAA
6 AAAGAAAGCG 1 rt CCA ill. GTAGC GCG

AACAGCTTATCACCTTCTACTGCTCGCTCTAAATTCT GIATGITCMGATGGAG
'TGTCCAATCTIGCTAIGGA
7 1 ATGGAA 1 TGGAGAC , AATTIG A

AAGCGTGTTTGAATCAGCGGITTCTTCTATGGCCTC TfCAGAACTAGAGC.AAAC
ACCCTATTGAATACAGATA
8 ACAGATATGAGA ' TCC ACA TGAGA

CGAAAACGGTTGTATATA GCCCTAAAATTTAGCAAAC

SO GTIGTTGCATATCCAGCATAATCAAGIGCTGCCT AGA
AACAAGACATCTTAGACGTGCTAATTTTCTACTTCAC GTGCTGCCTAGAATTTCAT
O ATTICA% ACAGCGG
GCGGCTATCCATATGCAG G

ATGGAAGACATGTTACCC AGGATATTGTATTAGACCT
1 . AGACCTGCA GAAATGTICIT TAA CiCA
50 GCACTATAGCTICTACC.ATAAACCAGGACGATTCAGG CCCAACAGGTACAGAAA
TATCAGACCATGTCATACC AAGACCTAACAACGATGG GGACGATTCAGGTACAGA

TCCACGATIGGACGCCATTATGGTTTGAAACAGCCG GAAAG MITT AGGIAGTCC
3 cuvrAIGI IC CGACTGIGCAGGACCTAA AT ATI;
I
50 TGCTCCCGTACACTGTTTGTGAAGGGACATAGAGGG CAGGCACAGCAGGAATATTGGT.ACCAAGTAATGCT
GTCAGGAAAAGGACACA
4 GGA GCTX: GC
AAGGGACATACiAGGGGGA

AACAGTTATTGAACACGG GGCAGACAGTCAGMAAA

SO CITGCTGGICCICAGAATIZIGGGAAAIGTITTITIGA A I GGAAGCAAT AGCCAAGCG
TCAGTACTG TITTC IT AGTG i ATGAACTGICAAA GGAAATG MM.' GAAAG

50 COGITCCP.TACTATACTC.ACi ICGITGTCCGAAGCAAA AACAAGTTATGAAA I
GTGOCAAACAITCEACAGIT f GIAGTGCCACCATIAAAG
7 . AGCAC.A TGCCCCG CT
GTCCGAAGCAAAAGGACA .
SO GTITCCACT GICCACGCiGTCGAGICCAAC:AGICCEST
ACCAGCACCAAAGACGGAACACTITAAACAATTGG

GAGTCCAACAGTCCCCTI
SO 'IGGTAGGGGGTATITITACAACAI CAAACATGCCATT CTGGGAIT I
ATGICACRICACCT AAIGTACAGTIGT AAACATGCC:AITGIAACTG
9 GTAACTGT ATTACACC.AAT TCCTCAAAGGCACCACAT I
51 GGTIG ITAATAGCAGCCACAATAGIATGGATAICTGA MCKIM:TM TCS GT TACT GTCCCCATCAT
TGC IGI GCATTTGIIGTArcirrrrc ATGGA IATCTGAG ITTA IT

GTOTAC
Si CTATATAGGCCCACACGAGGCTCATCTAGTGATAGCG
CCACKITGCAGGTTACAGACC.TCCCCTICATATAC.AG ATTCCTTTAGATACTTTTG
TCATCTAGTGATAGC:GGTC

Si GGAAAAGITAI ATCAGGGCCAGATIGGGGI AACACC
TACTGCACCIAIGGCiAACACC.AAACAGGOCCIGIA CACiATACATAITTAACITC

GGGGTAACACCACAGTTC
Si GCAIGATAAAA TAIGTI GG IGCGAGCCfCCTCCI AAC
CG.GCAGITCTAGACTICTTGCAGCAGITITGITAGC GCGACAGCACAG TAT ATG

OCTCCTCC.TAACCCTGTA
SI GGAACATCI GATTTA ITGG TCTGCAACAGGAT GGCG TCCAGAT TAIT I ACAAATGGC
TGCA.ACATTIGTFCCF CCGCCCTI AGAACTIAT TA

ACAGGATGGCGATATGGT
Si ACAA I GGTA Ill GTIGGCiGIAATCAATGITGGIACT

S

51 TAAGGCAATACCGCACCCTGCC.AAGTTGCTTGCC.AAG
CTTTTATGCACTGTAGCCAACTCTAACAGTAAGACC
6 T . AAAAAAATGTG CAGGTACACATTGCCCTG
CCAAGTTGCTTGCCAAGT

7 CAGG CC AAGCGTCTAGC.CATGGCG
GTGTTGTACAGCCT. CCAGG
Si CTGCGCGGCAACAACTAAACTCAATGGACG1TAACT CCCAGA1TGGGTGTGCGCGTCTTCCACGAGGTTGC
CAAACGTAACACCAACCG AATGGACGITAAGTTCCCG

51 CTGACACC.ATGTGCCAGGGCCGCCGACCTCATGGGA GGCTTTGGAGGACGGGATCAACGAAAGMGTGCCA
CATCGATACCCTAACCTGC

CGCCGACCTCATGGGATA
52 TCGCAGCGCCATACATCGGCC.CCACCATCAAATCCA
CACATCATGCACCTTCCAC
O
CGGTAGGAGTAAGGGCCACCCTTGCGTGCCCTGCGT CAT G TTGCGTGCCCTGCGT

TGAACTGGAGTCCAACAA

ACOTAGITCTCGCCCAGG

52 GccrrGGCCGTAGCIGTCAAGACAGCTCAGGGICIT CCCACTCGGGGTCGCTAACAATCCCGCAIGGCCGA
ACTGGGTTCTTGGCTAGCT

ITCCGA

TGGGGGGAGAACGAGAC TITTGAACTCGACCAGACC

52 C.AGGGGTGATAGCTCTACCCGAGGCATGCAACIGGA CCACTACAGCGTGGCAGATCCATCAAGCCGGIGGA

GGC.ATGCAACTGGACCAG
52 CAGGAGAAAACAGGGOCACAGCTCTGCATTGICTCiG lICTGATGCTGCCTGAGAGGGCGACAACCATGGCG
TCTGCATIGTCTGGC.ATGT
CATGTC CCG , GGTGCCCAAGGCTTCTGG C
52 AATCACTGAGTCGCGAGGCCTATAGCrAGGACCGAG 1 TCTTGCAGTCCTGGTCTGTCCAGCCCTAAGATGGCC GGGCATTGIGGIGGATC.C.
TATACICTAGGACCGAGGC
6 GCTG i AGAAGA A TG
Si CC.CTTATGATGICTC.CGCACGC.GTGTICACGCCCATG
C.CTGITTCGGCC.AGGTIGGGGICTCCACCCCTTTGA
7 GAG TGTT GTGGCCCIAGAGCC.AGTT
GTGTTCACGCCCATGGAG

AGAATTGTGGCGAAGTGC

52 ¨ ETCTGACAACTCAACACCCCCAGGTGCGTGTAGGTG
9 ACGAAGTCTACCGCCTTGGCGCGCCGGC.ATATTCCG CG GGAGGTCCGCTGCTGT
GCGCCGGC.ATATTCCG

ACATGTCCAAGGCATATGG
O TGGC CCG GGCCACACTGGGTTTTGG
C

TCCAACATAGAGGAGGTC

GCCAACAACGGGGGAAAT
53 CCTACCAGTGCGGCCTCTCCGATTACAACCGTTCCCC GGGAGGTTGGGCACATACCGGGCAGTGICAAAC.AT
GACITCAGCTIGGACCCC GATTACMCCGTFCCCCAG

TTCCTATCCCAGACCAAGC ATCGGGTGAGAATTTCCCG

AACACCTCCC.ACTAGTCGAGCACCTAAGGCCrTCTG CTGTCTTTCAGTGGGCAG
GTGATTGTTGGGAGAGTC

t 53 AGGGrITGTTGGGTGGICAGAGCAITGCITC.CCTCAT 1 GATTCGCGACIMGAIGCTTCTTITTGCCAAGGCCC ATC.TCGCTGGACTGICTAC
O GTCCT 1 ACAC , I CAT
TGCT-mccrcA 1 GICCI

TIGGTAAATITGCTCCCGG
6 TGGCGGCGCAAAATCGCCATICTATCGCCCGGAGC ' C C
CATTCTATCGCCCGGAGC

TGCTGAGTGACTTCAAGAC

TICCATTATGICACCGGGG ACAACATCAAGTTCCCCTG

GAGATTGGCTCGAGGGTC
9 . GT GITAG A
CGACCCICGCTAGCTAGI

CAOCTCTTACCGAGACGT AAGCAGCAGGACTACCAA
O AGC AAACAAC GG GC

54 TT GCGGGA.AIGGCTCCGAACCCCCACTCAGCCAGATC
TAACCACATCAGCTCCGIGTGGTGGCCATGAT TOT ITACCACAGAGGAAGCTT

CCCCACTCAGCCAGATC:A
54 GGAGICAAAGC.AGCGGGTATCAAGCGGGTGGAATTI I ACCGAAAAGGACATCAGGGTCGATGACTITGCGGG
ATGGCTTCCAATACTCCCC AGCGGGIGGAATTICTTCT
3 circris 1 CI CC A G

4 C.TACGCCGTCGCICTCAGCGCGGCGCTGAGAGACT AGGAGGACAACCGAGCZCTAGCGTC.ACCTGGGGGA CC
GCGGCGCTGAGAGACT
54 MCCAW"! I GGCAGC:GGCAGTACCTCITTAACIGGGC
ATCGGETTGGITTACGGTAGGCGGTCGGGCATGACi CTIATCGCCCAGGGAGGT
*FACCTCTITAACTGGGCGG
5 . CiGT ACATG A I
.

TCGTAATGAGCAATTCCGG

I AACACACAA TCAA IGGACCCATCTAAGT AGAACCAGTTAAAGAICTC

TATGGGTGIGTICGGGCAAATAC7GTGIAACC1GA T CrTCTUCTGIAGGITITA TIGGATAGTAAGACA
ICC

54 ACCAATAAACAGG1TTGTCCCAAAAGTTACCTCATAC GATGGGGTCITCAAAACTACTC.TTCGTGATCCAGAC

SS CCFGATGICCATGGTATTGICAITAGOAATGLIAGGI
ACrGATCCAGAI ACATACACCAG IGAAGTAT AAGA AGCAA ICiCTAGGTACTCAT
O ACTCATG GAAGTTTGATACCA
TCCACAGGACAGGAGAGA G
SS ACC1AAGCMCAGTGACi CGCTIAGGC TGATGAAAG
GTIGAGAAAAC:GAAACAGACGGTGGGACT MT& GIGCATG ICCAGATTITAA ITAGGCTGATGAAAGATAC

55 TGCCATGIGT AATCGCCCACAAGCCAIGTAIGTI CCA AACrr GCITCAAACCCCAGTG I

AAGCCATGTATG1TCC.ACA
SS GIAGGGTAGrGCICTGGGIGTIATCACAGGGCAAAG
ACA1CAATAGGGCGCACAAATCCITTCCTCr TC11 AA CAAAACICTIG TCAAGATC
3 Cr TTACTGG AGA
TTATCACAGGGCAAAGCT
55 IGGAATAGTAAACACCTGATGTACGTAGICTCAAC.AG
ATTACCCC.ATAAITTGCGAAAAGCACTTGGGTAATA CCIATCCAAATAGAGATIT TAGTCTCAAC.AGGAGG
AC
4 GAGGACA . AGGGCTTC AGCAA A
55 TGTCGTATGTCAGCAAAACAAAC.ATATGCATACATGG GTTGGAGTGTATCGCAGAGGAAGTGAACCCGACAC

ATGCATACATGGTCCCATT

TITCAGCTTTAGTGATCAT AGAAATCATGATGACTTGG

SS CGTCCCACAATGGATGITGTTMAGTGTGTGTATTM TGC.AATTGCAGAACACTTCAATTCCATCAAAGTGCr AMAGTAAACAAAGAACT AGTGTGTGTATTAATCCAT

55 TGIGTTAGAATCTGIGGAGGCTAGACCAATGGCTAG CTAAGCCCCCCAAC.AATCITGTGCAAACATATATCTA

SS GGGGGIGCTACTCAAATTAATAAGTACTTTACCNICA ATGATTATGCAACAAAAACTGGGC.ACCAAAGATCTT
AC1TTACC.AACACTATC1TA

AACCIGCTGTATTGAGIGACAATGCCCTTGTACTIA TAGAGAAACAAGGCCAAG GGAGTTTAACATAAATCCA
O AAAT (CAUCA GAGAATAATGA TT TAATAG
GTCA
56 GGITGCZTAGT1TC.A1ICTCATGTTAAAGICCGAT1A
ACAAAGCATTCGATCAAAMCCCGCACATCAGGATC TAAGG AI GAAT TAAGAAG
AAAGTCCGATTAGGGAAA

CTTACGGIGATGATTIGAT
2 GC TC.ATGGGC.ATAA TGT
TGGATCSACAAGIGTTGG
56 AACAGCCCATGGGTCCAGCITGCAAAGCCGCTATGTC TCAlTGGTCGAGAGAACC.ACGCTCGACCGAAATCCA
AAGTCAATGGCTGAGTCA
3 C CGAAAG , CC
TGCAAAGCCGCTATGTOC
56 ACTGAGCCC.TCCTGCGACAAACCACAAAAGATGCTC 1 TTGCAGATGCAGCAAGTGTGGAGCATAAAAGCGCT GCCACCCGAACATTCAAC
ACCACAAAAGATGCTC.GAC
4 GACT i CGTGGT C T

CGACCTCAATCGTCAAGCACTG
CG CGTAC GACGTCCTCGAAGCCTGA
ACGCGTCTCGGAAAAACG
56 CGATGAGAGCGGAATGICCCGAAACC.AAATICITCC CCGGGGTTGAGTGTGACTGTTCACTGCCAAGCTGG

AAACCAAATTCTTCCGCGC
56 ACC.AGGATACCCCGTGGATTGICACCATGTCCACCCC AAGTGAAAACAACGC.GGCGCGAACGCCTGCAGATC
TGATCCAAAGTGGGCTGT

CACCATGTCCACCCCATG

ATGGAGACTCTCTTCCCTA

TACTGCGACTACTTCAACA

TATTGTGTGAAGAATCATT CCAGAGGAGGAGTTACTG
9 "TTALTGI AGTCG GICA I

TGGAAGCAATTAGTCTTTC
O CTFTCTCC TGCGICC
TGCAGCACTTAGTCCAAG TCC

CGGCTGCGCTATATTMA

57 ACAATCTAGTAATGGCTGAATCC.AAAGAGGTAAAGT 1 AGTAGGATTFTTGGATGAIGCCACTCC.ATCTAATGC GGGAAAGTCGTAMCTG AGAGGTAAAGTAGTATCAT
2 ACTA TCATACAIGA 1 AlITCGC rrr ACATGA
i ACCFACACCAAGACCCCGCAAGTGCTTGCCTCCTCC ATCTTCCCITCTGTTGCCA
3 CG 1 1T.3 , G GI
CTICACCTGICTC:CCCG.

CCAGTAAGAGAAGAAGA AGAACGTCATATCCTCTCT
4 TCFG ' GC.AGGCCC AGAGG G

TGGGAGAGTATTGGTAGC
S CAACGC CACA TTT
TGCAAACAGTAATCAACGC

CGITGTTGAAGGIGTTTG ACTTGGGTATTGGTACGG

TTGTAAATCCAGCTTTTAA ACATTGGTGATCCAGAATA
7 . CAGAATACAT CTCGCG CC TGGT CAT
57 CTGTCTAGAGGAGTTGTCAGCTGGTGATTTGACAGA TATTGCATTTACAGAACAGCC.ACGTAAGCGGCTTAC
GGTGAMGACAGACATAT
8 CA TATTCAM ACCiA AT TTTACAAMCCCGCCITTG TCAAT

ATCAGATGITCATTGTIGGITGTGCCACAIGGTF TA AAACTGGGTGACACFGAA
9 C:ACTGAAA GCTAAATCCC CC ITGGCACIACTGGICA A

GTTTCTTTTATGGCAAGAGGGTACTCCATCTTT GGACATGTGCGATATTGG TTTAGTACTTTTTCAGCGA
O AGCGAGTC ACAAAAC.ATGTG G (SIC

TTTAGAACCTGATAACAAT CCTAAAAATGGATTAGGG

CCATATAAGGGCTIAAATI
2 TTAAATTITTGG GTATAGGTAT TACCIA.U..1C1GAAAAGC TITGG

ATAGCACAAAGTAAAGTGT CTAGGTAAG TACAGTA.AC GG ITTC:FATITGAAGTTFG
3 , AAGITTGTACAC GCCAAAT CGTTAT TACAC
=

CICTAGCAGCGGCCAACT

AAGTGCGT/TCA TGOC TIGAAA MG IGCCC AACCCIAATGTGGTC.T.ACC
S C.CACCG GC A G
58 ACACiGACCICAGGCCiCiGACiGC:GAGGCTA FGT T CCCiC A TGTCACCATCTGA MI CG

CCAGGCTATGTTCCGCCA
SS AGGGCGTACICiTGATGGCTCCGCCATTGGCFGCC.ACT
TGTTCCTTACCCCCGGTCTACCTGGAAATAGGGAGG
7 TT GGGTAC CCTCTCGTCAlTGAGCGG
GCCAITGGCTGCCACTFT

rrACCATMCCACCAGCGGTACITCTGAGC.ATGCT GCTACAAGGITACIGTG6 8 GGGC CX-.C.G TTGCTGCTCCCGCCTAAT GC
58 AAAACCAACACCCGTGGGICA TACGC1CACi itTAGG GGACTGCGATT CGTAAGGCGGGGT7 GTAGGAAACA

TACGC.TCAGTGTAGGCGC
59 CTGCCUCCAACCGCTITACCTITACTGTGCiCGOTCA
GAGGCCIATGAGGGGICT
O
TGCCAGGGACCAC.GTAGGAATGTCGACCCCGCTGAA GC G ATCiTCGACCCCGCTGAA

CTGGGCTGGAGGTGCGCTACGGGGGCAAAAACCG AGACTAA TGGCrCAGAGC "TITTGATGCCAGICAGAGC

2 CA . 671 TGGTGAGCCGGCTCCT
CCGGCCATAACCCACCA

GACTTGCTCCCGCCTCGGTAGC.ACCTCGTATATGC.0 ATACATTAACCCCCCGGCC
59 GCC.AGATTTGCTGTCCGTGCAACCAATAGGCCGACC. F
CTTGACTCAGCCACAGACGTCGACTGCACAACGCCG

CCAATAGGCCGACCTGCC

S
AATGACAACCCGGCGCCCCGGCTTCGCTGCTTTCA G TTGCGTAATGCCTGGCG CGGCTTCGCTGCTTICA
59 GGAGGACCCGACTAGTGGTCTGCTGCGGATGTATGT GGTC.GTTGTICTGGGGTGAGCGTTCiCCGCCCITACI
CTGCGGATGTATGTGAGCT

7 TG rrGcc CCGAGCTCATGCCATTGT
GCGCCACACTGAGAAGTG

TTGTACGAGCTAGTGGAG
8 AGGC AAT GCCATUGGCCACGTACAG GC:

TIGTAIGCAGGTGITGTG

OCTTGGCGCGCTICCT
O TGGGCCTGGTCACGCCAAGGGGCTGGACCTCGTGT GCCGCTGCCICACGICGTAAGCGACC.GOGGTTAGC
CCITCGCCCCCGATGT GGGCTGGACCTCGTGT
60 GGGAGACCCGCGTETTGGTA i LI i IGCCCTCGAACIT
1TACTCCAGCACTGCCCGTCATGGTGAGCTCGGCAG CTITGl.u_ ILE, AACITGAG
1 GAGT TC , CCCTCGGGCTGTTGGA T

TGTGCTITGGC.ITTCTCTCA

AGGTACTACTAAAGCCGG ACTGCTAGTGAC.CAACTGC

60 ATTGGCTGCATATCGACGACAACAATATACCAAAACG ACTTGACGTTCGTAATGGGC1TAAAAAATC:CGTTGG
AACAATATACCAAAACGGC

CATCGCCGGMTACAAA

TACCGAAAGACACAGTGTT

CCTTAGCGTGTGTAAATAT GGGTTGITTATTAAGTTTA

60 AAACTCTTTCATATGCCGTTCCTATGATTCGGGCTATA ACAAACAATGGTAGACCGCGC.ATACGAAACTGTGC
TACATTACCAAGCGAAGG GATTCGGGCTATAGAGTAT

60 ACAAC.ATCCAACGCCGTTTTAGTGGAAACGAAAAAC
GTTGCAAAAACAAAGACAG7TTGAAGGAATACGTT CCCGAA 1 II ili : I ATTGC
GTGGAAACGAAAAACCGT
8 CGTC CAATTAA.AGCTC CT C

TOTTACTGCTATTGGACG

CGATGAACACCGTITTGCACT/sGGCC.AACGMTCC AGTTCGACATACAGTAGA

GGCTATGTTACCTGCACT
s 1 I AT TACiCGC 1 CC , ACT GC

CTAGCTCCTGCAAACTTTCTGGAAGCCAGTTTTTCAT GCCGACTAACAATTATACT
TTTTTCATCGTCGTTACTTC

61 CCTCTAIGGTTGAC.AGCACATITCTTGCTGGGTTATTT

3 AAC_ATCC GATCG A CC

TGTGTAAAGGTAGACAAC CAGAAACTATCCGAC.AGA

TCACCTAAAAACCTGACTC
5 . C GCACGC A
CGITICCTCCTICiCAGAC

ACGAACKGIGT TCCGICG

CACGGGGCGACAACACCGTGIGTACCC
7 CT GC CfCG ICCACAAACi ICGGC

8 I CC; GCCGCICTCACAGCCI CA
TCCTC:CCAGTGACCCT

TGGGCGGGTATCATCGGICTICGGACAGCGTTCCC

"FCTAGGTITGCCKCCCG

AGCTAGAACGATICGCAGITANTCCAGTAITIGTCT CGAGAGCG TCAGT ATI AA
ATIAGATCGATGGGAAAA
CI GAAAAAATTCG ACAGCCTTCT GC AATTCG

TCAGGICAGCCAAAATGCCIGATGTACCATT ATAAAAXT ACAC:CAAGGAA *FTTAGACAAGATAGAGGA
1 . GAAGAGC TGC GC AGAGC
.
62 ATCTGGCCTGC3TOCAATAGGCCAT CAA] GAGGAAGC

CCAICAATGAGGAAGCTG

GGACCAGCGGCTACACIAGAAGCCAAAACfcrr GC TAGACCGG TICTA IAAAA

GAGCAAGCTTCACAGGAG
62 TGCiAATATCGCT GGTGAICC1TTCCCACCAGGGATTA
GIAGCATGACAAAAATCITAGACiCCIGTATTGATAG ACCATACCIAGTAIAAACA
CAC:CAGGC1AITAGATA ICA

TTCCTTTGGATGGGTTATG CCATCCTGATAAATGGACA
S TGGACAGT TAATTGCCTTAC AAC CT

ACAAAAGGAAACAACTCAGGAATC GAGCCATTTAAAAATCTG ATM GCAAGAATGAGGGG

62 CCA lIGCTCTCCAKITACTGICiATAICAGGAPACi ;AC
CTAGIGATITIAACCTGCCACCICTGACAMATCAC GAACAAGTAGATAAAT TA 'FCAGGAAAGTACT
AlliTT

ACTGACP.A1GGCAGCAATTATTCCTGCTT G TATOCIGG TAGCAGTTCAT TACIAAGCAGAAG1TATI CC
9 ATTCCACiC ATTC:CCG CT AGC

TICGTAATAACAAAA TGCCAGI GIAAAACACCATATGI AT
9 GGATGGT CTCITTCTCC G1n:C/4 GGAAAGCTAGGGGAIGGT
63 GCCAAGTATTGTAGAGATCCTACCTATTAGGACACAT AACACCAAAAAAGATAAAGCCAC:CTTICTGGGGCTT
ATTACTITGACTGMTTC ATTAGGACACATAGITAGC
O AGTTAGCCC . GTTCCAT AGACTC CC
63 GCAGAATTC:TTATTATGGCTTCCAC.CTIAGGGCAACA
ACAACTGCTGT1TATCCATTITCAGTCT6TCGAGTAA TMCC:TAGGAITTGGCTC CTTAGGGCAACATATCTAT

TGTAGTGCTAC.AGAAAAAT

CTGGTMGCGATTCTAAA TGGAACAGGACCATGTAC

TGGIGGTAATAGC.AACAATGAGTCCCACTICTCCAA TAPACATEITGGCAGAAAG

CAATGTATGCCCCTCCCA

CAATGACGMACGGTAC
S GTACA GCCAGG
GCAGGAAGCACTATGGGC A

AAGAIGGGIGGCAAGTG TGGATGGCCTACTGTAAG
6 AGGG CTCa GF GG
63 AGCAAGCTCGATGTCAGCAGMAGCATTTCATCACG AGGGACT11CCGCTGCiGGAC1GCAGGAIC1GAGGG
GAGTGGAGGTITEGACAGC TACiCATITCATCACGTGGC

63 CGTCAGAGATTCCCAC.CICGGTCAACGACATGCAGIG ACGACCITCACCGCGACTCCGCCTCGAACATCTCCG
8 Ga TGTC ACGCCTGGGGGATGCT
CAACGACATGCAGTGGCT

GTTTAGGCGCAGGMCCG

O CCCCCTTCAGCACCTACGTGGAGCCGGTCAGCAGCT CGCTCTCAAACAaTCCOCGAGGGCCITTCTGGGGC
GCAGGGCCACGGACAT GAGCCGGTCAGCAGCT

CTTCGAGGGCGTAGTGGC

CGCGACAGGAACCGGTA .. GCGAACTCCACCGAGGT
64 CGCGTACATGGCCAAGCTCCAACACAACAAACTCCC.0 TCCITGTGGGACGAGAACAGCCAAGTTAC.ATC.ACCA GTGAGCACGGGGAGGGT ACACAACAAACTCCCCCTC

TAACGTACATGTICTGCGC

G GATCGA C
AGTACATCCCCGCGTACG
64 GCTCCGEGGACAGTTCTCC.AGGACCTCCGGGACTTC

GGCTOCCITGTTTCGCC GGACCTCCGGGACTTCGA

TCCTGAACACGCTAATGTG

64 CGATACACGGCAACCCCCGGC.TCTCCTCCAC.CCAAAC 1 CTCTCCICCACCCAAACGA

9 CIGGIGTTCGAACTGGGCCACACCGCGTCCICCCiACA CGAGG GICGGCCACGCGOTA
ACCGCGTCCTCCGACA

O CCA TAC .. CGGCCATGGTGCTGCA
CAGACGGAACAGCTCCCA
65 GCAAGGGGGAGCTGCTGATCTAATGGGC.GTAAAGA GAGAGTCCGTACCCGCCCCACATGAACTACGGGCC

ATGGGOGTAAAGACGGGC

CTAAGGCCCACCGICACG

3 GaGCTOCCGACACAACGTaGGGCGGICAGCGAGA CCGAGAACGCGAGGCCCATCGAAACAGCCGCCIGG C

CGCTAACCAGCAGC TCCA

CCACGACATCCGCGGCTITACACGA laGGCGCTGC G1GC TGTCCi TACGCGGGGAA CACGATaGGCGCTGC

6 CG CI ACA AGGTCGCTC.GCTG ICC --CATCATCICTAAGCGCGC:G
65 ACGAAGTGAACCAACTGCCGAATACAGCGTC.LIGAA
7 CACGT GCGCCiACITCGCCCAGTTCCGTAGTaCCGMAGA
CCCAACGCAACGCCTAC "f ACAGCGTCCIGAACACGI

ACGATCGTGCTGCZCGAGCGGGTGGACCACGICA CAC GCTGGITTGGCTCGTCC
CGGGTGGACCACGTCA
65 Cia GACGCGTGC ITTIGITCCGCCAGCGCGTG TAIGA
GGACAAGGCCGGGTCCCGTATCAGCCTCGCAGACC CGCACCAAAACiC:AGACiG

GCCAGCGCGTGTATGAGT
66 AGAGOACGCGTACCICIGCCCGGAAAGACCACTAG aCCCGGAAGGTATTGCTCC.;CAGCTICAIGGAGGG
ATGGTAAACGCAAAGCCI CGGAAAGACC:ACIAGCCC
O CCCG C.AGCC .. CC

1 C.CACCAACGTGCCCTACCCCCAGGGGCAGGACGAA AGG GCCGGGITGAACAGCC
CCAGGGGCAGGACGAA
66 TIGTCiCGCCiAGGGGCTTGGCTIGGGGGATTCGCAG

GAGCGGCCATTGGGTTC CITGGGGGATTCGCAGGC.
66 CCCCGCGGGATCGGATACAAGTAGGTGATAAACAGC CAAGGGGGIGTGGGTCiACGAAGCCCCGGATCCTCG
TAGGTGATAAACAGCGGG

CAGGGCGTTGTAGTGCG

TCAAAGAACCi ICC TGTTG
GGICCATGITCGAGGGCGGCGCCGGCCAATICTICC C GG

56 TcGAAGGCGGAMAGTCGCGCGTCCTG TCrCTGCAG GACCITCA1 GACCGCGCT
6 T C.G T I GTGCGCCGTGCTGTGICTG ATCGCGATCGGGGGAG C
GTCCTGTCTCTGCAGTCGT

AACACCATTACGGCCCIG
TGCAGGCGGTTGTCGAGGCTGCTTCGGCGGCTCCT C C
TGCTTCGGCGGCTCCT
66 CGTCAGCACCTICATCGAC.CTGGTGTAGACCTCCAGG GTGACGTCGGCGCGACTC.AGGGCTACGTGTACTTC
8 GGC ! GAGG CCTTGATCTCGTGGCGC
GTGTAGACCTCCAGGGGC
66 GCCGCACCTCAAGGAGG AAC:TCCAATCAGCGCCCGA
9 AC AGGAGCCCCACGTTCTCGAACAGOTCGCCC.AGCC
TTGAAACTCGCTGACGGC CCAATCAGCGCCCGAAC

ACGCGATGGTCTCTATGTC
O CC CC .. TCGCGCCGTAGTCCAG

TTTGCCCCCCACAACCC

TGGTAC.CGCCTCAAGCC

AAGTGGIGGICCATGACG

AGACGAACTCGAAGGCGG

ACGTCCIGCGGCT CATGAACGACIGGGCCG

G GCTTISTGCGCATGCCCCGTGCGAACGGCGTCCTC
GGACTGTCCCTCGGAGAC ACGAMTCAAGGGGCG
67 CTIC.GCCCACCGATICCTGICCAACAATCTGTCGACA

AGGTGGGTGCTT

GATGCGACGCCTGTGC

GGCGTTCAACGACGCC
67 CLIGCATGCCAATMOCATGGAAACCCACGMCCAGT CACACGGCGTGCCTGAAGAG1TC.AGGTCCCGGAAC
ATGATCGACAGCACGC.TA

ACCCACGCCCCAGTG
68 CGGCGGGATCCATGGCGATATACAGCGCTCCGATCG GCGAGCACCGACMCCIAGATGGGGCACACCGGAA
O G TC CTGCGCGCTGTAAGCA
ACAGCGCTCCGATCGG

CAACCACCTTCGCTCCC

TTGACGTATCTGTGCTCCA

CAGCTCGGCGATGGTCA

OCTIGICGCGGTGGTTC

CGATGGCGTCCACCAGA

S CC CAC GCCCAGGGCGATCCIT
TCCAGACGGACAGCACC

TGCGTTATCACCTCCTCGCTCGATACAACGGLIMG
6 C :CXC GGIGGATGGGGACGGAA
ACTGCGATCACGAAGGGC

7 IGCACAAMI TACGGGGCCICi TCCMCGCCGACAAGA CGCCII:C.CTGACCATGCGT
CGGTCCAAC:AGCACCG C(3cAcrc:cGrcGTc3Tcir FCC ISCGCCGAC.AAGA

C MC TTCCALITTCACGCCCCG
OCCATGTTTTCCGCGGC

GAACCAAAGCCTCTCGTG

CTGCGCTTCAAGCACGG
69 ACGACGATGCATGTTCGGCGCAAAAGATAGGTCGC
o TGGTGAAGGGCGACGAACGGACACCCAGATGCGCCF CGGG TGCFTGGAGGCCCTICTG
GACACCCAGATGCGCCT

CGTGTTCTTGCAATACCCC

ACAMAAAATCACCOMC
69 CGTGGC.ATTCAGAC:AGTACGGGTCGAACCTGT1TTIA
TCGAAMTGTITTTACGGG
2 MiGGC TGCTCAGGCOGAAAACGCCATAALICCGCGCGGCC
CATGACAAOGACGMCCT C

CATMCCAGCCCCCAA

4 AAGTCGACACACCGCMG GCOGAAGGAAACCiGGC
GCATGCMCGCAAAACGCCCCAC:CACAACAMGC GGGGIGGIGGTAGTGGT 434:GGAAGGAAACGGGC
69 CC.CGTGCATGAAGACCT GGATCCACCGGGGTGTTGC
ATGCAGAAGGGGTGCAGGCAACGGACGAGATCGC

CGGICACCiCCCACTAI CA CCACCGGGGTGIT GCA
69 CiGGGCGGGG ITTGITGTGAGAAMTCAGC1 GTCTIC
GGGCTCMCGSGAATC TC
6 GCATCATCCCGMCGCATCCGTCGCC.CCATGCACCIT GC C
GTCGCCCCATGCACGT

ACGCGTGGTAGGICGCTGGGMCCGGGACICGCTG CTCCCCAGAGCC.TGCTGGTTGGOTCGTGCCATCCGA
CCGGGTTTCGTGGCCT GMCCGGGAGCGCTG
69 TGGGATCMCCATMAAGCGATGGGACTCCGCGTC 1TTCGCGGGAAGAACCTIMGOAACACGLI1'CACCC
Il GT CCCS CCTGCTAGTISTCGOGG
TGGGACTCCGCGTCGT

CCATCGGAGGCCCCCCAGA IIGTACGTCT GCTITCG CCCGATCACTG TG TACCAC
9 TACG GGC TTCCAGCCCCC.CAGCA
70 CCAAAACAAACCMiCIGGCGICMTCf GTATGICCI CSC.
TGGCGCTCTCFGCCAATCGCAGGAGCCGT MST
GCG CC GCAAMCGCGACTATGC

1 GGTA ACAG GGICACAC.CCTAAGCGC
GCTGGTTGGGTGGGGGTA

ACGAAGTCITCTISTGCAGCCAAAGGTCGCFGAGGC TITACGCf TTCGACCAGAG
ACTIATCCIGGGGAGAAG

70 ACAGCAATACAATACAGCAAACAGTGA itIGGAGGG AATCGTGC:Ci I/TAM-FM
FATACCCCAAT TCT TGCA TC Tat GCATAGAMIGGC

CATCTGGAGGGGAACTIS
70 CCGIATCCCAGIGT1 GCTIGCTICAGIGTCCATIT !SC
CAGGCACTCCTGGTCAACAGTCGCCA TCTGCT CGA TC:AGISTCCAITTISCCAG
4 CAGES CA CA G TCC.C.CACACC.CAGGAGAA G

S TCACAGCA TCATTGT
ACCAGGCCATTCAAAAGC A
70 GCAC.IGGTTTACAATTATTTCTGGCTGCTGCTTTATTC
GIGGISTCTAATCAGCAGACAGACAGATCGITGTC:C TGLISMTATTCTTCTATC

70 GAC.ACCGCTIATTCACATACTGTAGACACTACAAGAC TGGACCATGMIGAAGCTGATTCATACCTCACIAAAA
GTGAAGAATTGTAGCATA
AGCG TGCACA ATOM' AGACACTACAAGACAGC.G

CAAATCATATMGCCAAT C.T. TAACAAATTTTTGGATG

TGAATGGTATAAAGC:TAGAGGTGGCACAGACTCGT CGTC.TCGGACATTTAAGA

GGCAGCCAGTTACCCTAA
71 GGGGITC.TCTC.CCTCGTGGACAGCGCCCTGGTAAAT AAGGTACIACCIACCGCAMIGGTTTAGGGGCTMIG
CTGITACCTACCGOCTAG
O CT GCGA C
CAGCGCMTGGTAAATGT
72 TGCCTAGCMGCATTGITTAATGATCAAGATTCCTAA AAGGGTTAAACGCGCTAATCCAAC.ATCGGGIGGAC
AGAGAAAAGITCCTCAAA CAAGATTCCTAAAAACATT

2 CiAC TGG G GCiGCA
ITGGCACAGGAC
71 GTCAATTGTGTCCarOGGGCCGTAGGTGACAATGCT GGCGAGGAGGAGACTGCCITATCGCTCCIGTATAG
.. OCG TAGGIGACAATGCIG

GTTAGTCGCCTGGGGCAACGAGCCTACCTGAACAC TTGAGGATGA TCTGCAGC

CGTAGTGACACAGACCTGC

CTATTGAGCCAATTGATGC GAACTAGATGTACTGGGT
GTGA ATCTATTTCCTGCA , AATT GA

AGATTTTACTTAC.C.CCCTC ATAC.AACGOACATTGAACA

71 AGGCiATTTTCAGCATC.ATCAAACTTGATTGAGGTTTC ACTGAC.CAAGGTICAGACITAAGCAGCTGTGMGC

71 ACTCATGTCTGC.ATCCTCAATGTAAAGCAGATGACAA
CATAGGCCTGGGAAACTTGAATCAC.AATATCTAATG AAGCAGATGACAAATGTCC

71 GGTGIGCETTATiKAACTCAAKTiA-WE'ETAACTTTACCA
GAGGAGTTTCAGC:TTGC.CTITAAATTCTCAGGGGTA CTAACTITACCATCAGTCiTT

GTACATACTGTCCATIGCA AAATGTCCCTCTAAGGATG
O AGGATGC TICCTGCCAA AC C

CCAACATIGGCAAACGTCT

CCTACTTTAGGTGACATAA TATGCCAATTAATTTACTGT

AGAGGAGTGTAACAGAG CGAAAGTATGTGATAAGTC

72 AGGAGCTCTCTCATACAGATTC.AT7TGGAAACACAAC 1 TTGCTTTCTCAAATTGAACATTGGACTTTTC.TAGCAT CTGATACCAGAAGAAGAC
4 AGACAC I : AG1A1 AAAGTCAC CA
TGGAAACACAACAGACACT
t ITTGGGTATTGGTACGCAGCAGGIG ATAGAGCTAG TAGAAATA TATAGGAATIGTCAAA TAT
5 AAATATCFGGTG 1 TAATAGG ITT , AACGAG CTGG-TG

AAGAACACGTGACTTATA TATTCAGCAAGTTAAAACA
6 CAACCA ' GGTGCA CAGAC ACCA

TMGGGAACTATTCATACC

72 GGTTTCCTTC.ACATATTCATCCGTAGAATCTGGAAAA TGTIGGACATCCATACTTTGAAGTGCTAACTTTAGG

AATC.CAAATCAGTATCCTCC.AGAGGATCATGTGCTA CCAGATCCAAATAAGTTT ATCCTAATAGAGAACGTTT
9 . TITAGICTG C43ICC:4TC CCITT AGICTO
73 TTTCTGTAAAAATC.AGCAAGCTGTTCTCTTTACAATGT
GGICTTAGCCTGGTTIGGAGAAGCTCAAACTC.AAAC ATIGTAAATACTITCGTGT CTUTTACAATGIATATITT
O ATATITiGIGGC CGTGC GTC1 GTGGC
73 CCGTTGCGCACTAAATAGAAGTGATGGCAATG !TAG
CIGGAAAGGITIGTGIAGACAGITGATATCACGCA TAATICGTTGCTATGGITC.
AIGGCAATGTFAGATCTIG
1 Arcrrerr AAGTAGGT C IT
73 GAAGAGTACGAACAGCAGTAGCACACCTGTTGTCAC TGCTGITTGCAGA.ACTTGGTGATCIGCCATGGCGAT
CCTGAAGAGGAGGAGCA
2 Acrr AG A
ACACCIGTIGICACACIT

TGGTCTTCTGTAACCTG TATGTIGTGACTGAAGCT ATGCGTAGATITTAATTCT

73 ATAGCGACT ACCIGAGAAGIACT1 TT AAACITICAGCi . AGAAACATCTG
VGGIGGAAAACCAC TACACCGTAA GTGAAGGAAAGAGCAAA TT AAACTITCAGGACAG IC:

73 IGCTGAAGCTGITCGATGATCTAATA I GGTGAG ?TIC ACM I
GGGCATITGACAATGACA.CATCMCAGCT AA TAAACG TICAATG ICTAAI ATATGGICi AG
TITCCATCA
5 . CATCATGG AGCATAAT GTAGC TGG
.

GGCGTCGAAACGC7C1 f3AAAC:ATTGGTGTATTTG TETCCAAATAAATT ACCTGI
6 TACCTGTC.ACTG GTCAGA CGTA CAM

CMGCTACACCAAACGGA GCiGAACCCACCAAAAGGA

CCiCGACTICAACGACITCAAGCAGMGCAGGACCE AGACGATCAGTTICCCCEG

GCCAACTGCTGTCCCAAC
73 AGCGCGOTTGATCTATACATTMAAACCICCAAAAC AAACCiGCAC.ACTGCTGAAAATATCAGGAGCICiCATG
AGTTCTGTTAATCAAAGA TTAAACCTCCAAAACACAC
74 TGG1ACAG 7 CTCCGGTAAAGGTA1 AGGTCCGGAAGA GTGGTACAACCCACTGITAAACCI
GTCIAGAGGGAC AGTAITATATACTIAGGTG
O GGATCAG ACCAAAT GGTTGG
GGTCCGGAAGAGGATCAG
74 1CCGCTGCATCTGTA 1 ATA Cr i CATCTI CCI GATCCAG
AACTCIAOGGGCTACAGGAGGAAGGTG IAATITCT
1 CACTI AATAAGGCT GGAACCCGCTATTCiTACC
CTTCCTGATCCAGCACTT

TCTITGCAMGAACIAGCACACCCCATAAAGTIcra ACITTCATAAGGACTMTG TCCGGACTTFAATGTAITT

TGC1CCAGAICCCGCCITFACGTATAI11'CCGCIGGG "ITTGANITTGAAAAICCT-G

74 CCTCCCCC:TAATCCATCTACTATAGGACCCTGATGTOT
TTGATGTTTTTGGTGACTTC.AGTGACTTCACGAAAC TATGATTTAAGCCCCATCO
4 CTGATG . ACATCATCC C
GACCCTGATGTGTCTGATG

C:AGGGTAAGAGGTATCCA

TGATTGTCCTCCAATTCAG

74 TCACAGTTGAACATTCACAGTACATACTGTGAACTAG TGCETTAATTCTACTGACTC.ATGGGAGCTGACAACA

74 TCTGAAC:TTGCACTTACCTITACAACC:TAGGGCTACTT
ACIACTGACTCAMTC.TGGCAGCACTCCCGGTACATG

CCTAGGGCTACTTTCCGA

TGICTTTTCTATTGACAAT AAGGAATACAAAAGACTA
75 GTACTGCTCTC ACACCTTGGATATGGCCCATGCTACT
TACIGGAGTGAGCAAGAAGAATTCTICCITTCTGCT GAGGACCATTAGGAAMA
O CAAGA GAATAAAGC CTC
GGCCCATGCTACTCAACIA
75 AC.C111113TCACTAGGAAGGTC1ATCTCTGATAATC1 lICAGGATGTAAGCMCAACATOCITGITATGGCC
CAAAAGAAAAAATTC:ACC ITICTGACIAATCTTGAAGC

GAAGGGITTAGTGTTTGTGAAAACGCCACATCAATT CAATGTATTITcrtGGAGI GI CATGGAATAGGGirf AA

CTAGGCMCTGCTAGGTA
3 CTAGGTAT TCAAGCAATGA , CGTGTTGIGTCCCTTGIT T

AATGATCTAGMATAAACIC

75 GACTCAACTCAGCAACAGTCTGTAAGCGCCC.TAMAC 1 C.AAGGCTICAAGCAGTTTCAATTCIGTCATCGAATA CAGAGGATTGTAACACAG
GAA ' AGCGTCT CIA TAAGCGCCCIAAMCGAA
75 GCAGAAATATGATITCCCAGTGTCACTSTOTACAGG AATACtailittitGGGGGAAAAGT711TAGTATCTT
ATCATTTCTCATTGCATTG CMCTTACAGGGTGTTCC

75 GGATCTTAAATCAGTITCTCC.CCTTCACAGGAAACTCT
ATTGGGAGCTGriTTAGAAAAGAGCCCAAGTCTATT CAGITAAACCTGGICTIAG CACAGGAAACTCTGACAG

ACTACAAGAGGCGGTCAG

CGGCCCACAATCTTCGATC

GAATTTGTATAGACACTGT
76 TGCTICTGCATGCACACTTGCAGTCCCTATGAACCTCC AGAGAGCTCACTACCTACAGAATCCAATITCACCTG
AATGAAACAGCAATCTTA
O T TAATATTAGCATC GACAT
CAGTCCCTATGAACCTCCT
76 GGC.GGGATITICAAATTCAAATTGAGTTACATTCAAC.
CTGGTGATGTAAGTCTTGAATTTGCAATCTGGATCA GTTACATTC.AACAAATCAG

TGGGAACAAAAGGAGCG

TTAGTACCAGAAGCGGTAC
i 'TIGGAACTCAGTTACATTITCATGCITTCATAGTAGG 'TGTITTGATG TM GCAGA
3 GAM 1 GATGGICIAC , GCAAACGCAP.ACGCAATA IG

CCAATATCGAGTGTTTAGG
4 GTTIAGGC ' ATAAAGGATGACC TCCTAAGGTGTCTGCATG C

TATTGGGATGTTGCTAAAC

CAGACTTCCTTAAAATGG GATGITTAIGGCGATMAT

GCTCACTTAAATGTTATGA GAATGGCAATTAGCTITTG
7 . Gal-1167G ACTAACCAAA ATa: .16 76 GGCTATGTTCCAAACAACITACCITGCTGGAC.AGGTA TCTCTCGAACGGAAGCGGTACGCCAAAATACCAGA
ACTCGACCITATAAGTGTG GCTGGACAGGTAGTATTGT

GIGCAGAAATIGTAAGTCATGAAGGCAAATTAACA GITGCTGIACATG ITTIGC
9 'FGAA GGiCAATACCAAAT T
TCGTIGGAIGATGCTGAA
77 GCAACAAAAAAAAGCCTTACACCTCTGGAGCCAGCA AACTTCTGCTTCAATTCGCACAAGACAGATCACAGG
O AATC:CTT CAATT TCGCCTGA
TATTGAACICA TGGAGMAGCAAATCCIT

ACAACAAGTCCCCAATA GAAGAAGCAGACTGTGTG AGITTAACAGCMAGATG

77 ATICATTAT GGCCIGTICCCATGG 1 GGAATt GTAGAAG CSCATAACACTGCG

77 GAT GG1 GTTICGGAACTGATCGGG ICCrrICCACI KT TCAAGAGGAAGCT
AGGGATCCACACATIAMICTAT
3 , CTC TCCICCAACA GAGAGACCCAGAGCCAAA
GGTCCTCTCCAGTCTCTC .
77 TT GCTOTTCCACC-AATIGGT1TA1TTGC.;CAATITAC.iGA
GCAAAGGTCCTGAAATAALIGTGAAAGGGTAIGAC ACAAATATITGGGAGI AT MGGCAAMAGGAATAG

CAATCAAGGGCAACGAG TCTGGAT TIC CAGGTGATGITGIGGGIT AGAAGAAATTGAATTAGG

TOCCCTGGA ITATAAA CAGCCAACTAA FATATATT
6 CAGTOG CITTGCiGATCT MC-CG
GATCGCCITTTAACAGTGG

ATGGAGCCIAAGCAAAATCAATTCITCCCAATCiTTCT TCC.ATTAGGAATAGGTATT
AATTAGITGATGTAGAAAA

GTIAAATGGGAGCGCAGATITITF GGAAACCCIAAATCTGAA TCCITATAAAGACCIAAAG
8 CC.TAAAGITTTGG AGAAATTGAAGT AAAGC MTGG

AT1TGGCCAGTATTAATACCCACACAGGT TGTAAAG Tr: ITGTACITGAATAI GTE.
ATTICCATCAAGCAATACT
9 ACTGAC CGTCC.AG TTAGGC GAC
78 TGCATGTGTCIAAATCTICTMCTCGACANIACAGT AAAACGTCAACAAARGCCAGGIATCTTCCCATGITC
AGGGGTACITTTCCAAAA CGACAAAACAGTCAAATCA
O CAAATCAAC TAGGTG TGT AC

All CC7 AACiTTGATGGAAACGTMGICAAAGETG CGTCCGTAGAAATGAAGG

ATTCGTTATGGCTACCGATGAAATAATAGGGATCG A GCATTTGITTTGTATTGCA
AAGAAGATCIGTGGATAC.0 2 GGATACCTC . GCTGGG GAA It 78 GCAGGCTAGGACACGGIC.AC:CCCTCCACTAGTTTGGT
GGCGGCCAACCCAGCTTATGCATGCGAGTCTTCACA GCGTTCGATCAGGTGGAT CCCTCCACTAGITTGCITCG

73 GTGAATTCAGTGCTGG1GCCTCCATCTAAGTTCACAG CGAATG1TGAAGC.ATGTGG1TACTATGGTGGAATT
CTGGACAATCTTTTTCAAT TCCATCTAAGTTCACAGAG

TTACACATTAGATAGCAA

AAAGCTACTTCCAAAGGCT
78 TGTTCiTAGATGGCGTCGCTGATCCAGGAGATCGAGG TGGACGGAACKTMTAGGCGTCCTCAAAITTATAA
GTTTCAGTTAAATACAAGT ATCCAGGAGATCGAGGTA

TCCAATAGCTCCACTTAAT

ACCCACTGGATCTTCTCC

AGCTCAGTACTACATCCIGIGTACA AACACAGGAACAAATGAT CAAATTACCTCATACCACT
8 TACCACT GC TGAGTGAAA AATGTC GC:

GTGGTCCAGAAGATAGAA
79 1 GCGTICITGTITITGAITTCTGIGGAIGTAGAGAGGT GCTGGCCTOGATAA
ICAC:AGGAGACTIGAAAGGIT GCATTICAATGGTICAGA
O TOCTCG AATMCCAATC AACA
GATGTAGAGAGCTICCTCG

1 CCM CTGG , TCC
AGGCAATGTAIGTGCCTC
79 TGCTICC.ACATGUTGGCTCTGATGTTCATACCACGAT 1 TGGAITCCAAGAGCACCTAGAGTCTGGGGTAGTrA CATTCAGAGTGGTTAATG
TGATCITICATACCACGATT
2 TGT i GTGCG AACA GT
79 GGAGTAGTAAACACCGGATGTGCCTACTGGAGC.ACA ACAGGAAGTTCTATCCCATCATITGAGTACGAGCTA
TTAGTAGCC.CCTTACCCAA
3 TGGAG CCCTC.AA A
CTACTGGAGCACATGGAG
79 GCAAAAC.AGACATAACCGCCICTAGAT GTATACATG

79 CGACITCATGTATGGATGCTATITCiTAAGCCAGGGAA AATCCCACTCAAGAGAAAAGAGAGOTGTGACATTT
ACTCATAGAATGGATCAA
AAGCTTG GTTCTAACCAC AAACAAG
AAGCC.AGGGAAAAGCTTG

AATTACATGCAGTTTAAG AAAGAACTGAGCCAGTGT

TGGCTAGTTTAGACAATA GTTGMACTAGTAACTTT

CTIACTCAGGAAAAATAT

CACCACAGAGAAAGGGGA

CATATC.AGAGGATTTGGA CrACITTAGTAGTGCACTC
80 CTIGGTTITGGTGGGIGTCiTTTAATIGTATCTAAACA 1 ATTGTATCTAAACAAAAGG
O AAAGGTGAGAG 1 riGICATTGA AACA
TGAGAG

AAATCGAAACAAACAAATTIGCAGCGAAATCTGAA T AGGTACTGATCCAAAATG
I I GCT 1 ACATGAAGCA LAC , GAAG All CCTGITCTCAGCGGTCCAAATGGGTCCTCAATGCTT AAGCTGTCTCAAATGTCCA
GAAGTAAATGCTAGAATTG
2 AATTGFACC ' AAT AA AACC

GCTGGATAGAGCTTGATG

TGTATGGTITCATC.ATMA ATCCCACTIAAGGAATGAC

AATGGGGAATGGAGATG
5 . ICA CTTIGGG AG CGITGTCfCCT
CCAG ICA

AAAGAACTAAGAGATTTG

80 CATGITrowcaTTCGACCIITTCCAAGCACAG TCCA
GGCCCCGTTCACTICAGAAACTAAGATCTGCATGGC AACAGAAATGGACCAAOG CAAGCACAGTCCAT TRIGG
7 'ETA ICCAA CC A AA

ACATG1TGGAGAGAGAACTGGTCGATATACACGCT ACATCAGAGICACAGTf AA

AGAAATGATGATGITGAT GCTGCTAGAAATATTGTIA
9 Ci ITAGAAGA ITC; TC CAGAG GAAGA
81 CCCGACC:ATIGTGAA1TCCICATAGGCAACCTCCAAA CGAAGAGCAACAGCCA I
TCIAACCCACTCACTAT CA AAAGGGAAGAAGAAGTG
O CATTG GTTGG
CTC AGGCAAC.CICCAAACATTG

GITICAAAACIGGGGAATIGAACCIGTC:AGGCAA TA TTIGAACTTGITCAACAGA
I , CC TTCCGATC GC
AAACCAGCGGCTAAATCX: .
81 CAGCTITTCTGTICCCTGIGITCAGAGGGGAAACGIA A TTCATCG 1 CCAIGATGI GGGAATr GATAIGTGITA

CAGAGGGGAAACGTACTC
81 CATCCTACTOGC1ICGGCGCGTACTGGGAAC:AT1-1G
CICTICTCIAACIGIGAATGIGAGGACAGGAGAAIT TA TICCA.ACIAG AT GCGTG

CGTACTGGGAACAITTGAC
81 GCTACCAAATCAAAITICTC ITGCAACC:AT TGCAI CAA AG IGAGAGGGTAITGCiAGAGGCAAT
rGT TACICTA CTITGCAIGATCITTIGGCi ACCATTGCATCAAATITAC

81 CGAGACTGTAGAATCACAATCACTTATTCiTATGGATG
GACGATGAGAATTTICAAGAACAGGACICCTCTGCA CATGGTITATTGTAAATG A
ATTGTATGGATGACATAGA
S ACATAGAGACT CACTGTA AGCAG GACT
82 GT ICAGCTTC:ATC: ITCGCCIAAAAGAAAGACAAAGTA
TrICTGAACAGGTACiAAGTGCAAAACTATIACATGC GGAAGCTGITAAA.ATTTC AAGAAAGAC.AAAG
TAAAA

81 CiCTCAGCGACACiAGITGCi TGGTAACAGCAACGGICG
GCiCiCC:CfCCAGAICCACCGACGOTTGCGICTGIGC AACAAATaCCCTICCIGC
7 GACCi TT C
AACAGCAACGGTCGGAGG
82 GAAG MG.:10,AG ICTCGATAGACCIGCTGAGGAAG GGAGGCTCGCiGA 1 CCG 1 ATACCAAC:ACIT TAATGTA

TGCTGAGGAAGITGGAAG

TIGCAGGCCTIA CTCITGCAGATAITITATE "fTGGAAGT ATICITTATITG

TGGTGAACATCCTTCCGTTACATTGTACTTGAACATC TGC.IGTA ATAGATGITCCT
O CTACAG . TATCTGAG GO
GTCCTGGTTTGCCTACA.G

CTACTATAATAGATAACG CAGCAATAITTTACCTCTTA

82 CGTCATTCAAAACGTCAAAGTAAGGICITTATTITCAT TAAAGCAATAGAGG1TCC.A

82 GGGC.ATMACATTAGGATTITCCGTCCTCTAGGCATA CTGCAGAGGTAGATGACAGTAGAATAAAAAGCTGA
TAC.AGATTGACAGAGGTG

CCTCTAGGCATAGGAACG

TOKTCCCCTGTGAGGAAC

TGCAACATGAGCACACTT AAAGAAACACCAACCGTC
S CGC G CC GC

ITCCACAIGTITGCGG

TAGTGUGGAAGC.GGAGG

C.GGTCACIGGGGTGICCTGAACCTICAACC:AATIGC ATGATGAIGAACTGGICGC
7 TCGC.0 CTGA CACAGAATGGCGTGGGAC C
82 GITCCAAGGGCTIGCATGC.GACCGCAGCAATTITAAC TGGGGCCCTATAACCCACAAGTGCATAGTGCCAGC
CAAACTGGGTITCITAGC ACCGCAGCAATITCAACTC

82 AGAG1TCATCCACGTGCACCCTCATGCTAGAGAGCCT GC1.
TCACCAAGACCTGCZGAGGGGCACAGGAGCTG ACAACTGGGGAGAAAAC
9 GCG GTTA , GAG
TCATGCTAGAGAGCCTGCG
83 GGGCATGGTGGTAAAGCRICACTCCCCTCCTCTTTTC CCTCTACGGAGTCTCCTCGAGCATGAGGACGACATA
GACCGTTGCGACTTGTAT
O AACC CTCOCA GA
GTCCCCTCCTCTTTTCAACC
83 1 TGGACCCTC.ATCITCCTGTGGTAGGTGCATGCGGGC
1 ACACCAACCGGCGGTACCCCGGCAAGGTCGAAGCAG ' A TITGG1TGATGCTGCTCGT
CGGCAAGGTCGAAGCAG
83 GIGGCCACATCITGAGGGGICTGTCOCAGC.AGCAC
CCTCTCAGTGAGTGGGCG
2 ACGCAGCGGTGICTGCACCGCCGTAGCTGTTGAGC.0 CAAGTC A
GCCGTAGCTGTTGAGCC
83 AGCCAGCAC.CGTGATAAACAGTAACAGCCACTCAGA AAGGGGITaiATdikAGATGTAAGTGACCTAGCC.0 GAGGGGGAAGTGCAGGT AACAGCCACTCAGAEfiii¨.

CCATCAGGGCACGTAGTA

CAGGACCATCACCACAGG ATATGGGAAGTTTCTGGCC
.IGGCCG GTGA G G
83 CGCCTGAGGTGGGGATCAC.ATTAGGTCTCAATGCTG GACGGACGCCCITATGACCGTCTACCACCTGCGTCA
AGAAATGCGACGAACTGT TTAGGTCTCAATGCTGTCG

83 AGC1TC.ACAGAGCACGACGCAAAGCCGGGGGTG7AT CACGGGGTGCGCATGGTATGGGAGTGTTTAGGTAG
AAAGCCGGGGGTGTATAG

GGGACCTGATGTGGAAAT

'TGCAGAAGAACTGAAACCCGCTCCACATGIGTITCC GGICCIG TACCAGCAGITT
GCTCTAAGCACATCCCCTA
9 CCTAC 1 TCCAGA , G C
84 AAGCTGIGGACGCGGAGGGICAACATATTGGGCGG 1 TCGTCGTGAGCGGCTTAGCCCGGCAAGGATGTCAA
O CTG ' CAA
CCTCCCAGACCCTGITGC TCAACATATTGGGCGGCTG

CTCTGAAGGCGCTAACCA

ATAGCCITTGCCICCAGGG

ATAGGGGAGTTIGGGCTG

ICACACCACTTGCT

AGGATACATCGTTATGCGC
3 . GCGCC TGACT TCACIGAGGIGGATGGCC C

MG CAACATCACCCGGGITGA

84 GCCGGCTATTGACCCfCCGACAACAGCTGGCTGATAA CATCTCCTGACCCGACAACAGCGGGGCATFGAGCTA
CGAGICTGCTGTATCGCA CAACAGCTGGLICATAAG

84 .ACCCGCACGGATAAGTCTGGATAGICTICTGTGTAGA
TGCGAGAAGAIGGCACTCTACGTGCAGATCCCATCA ATGCCACCCCAATACCT AC
AGTCITCTGTGTAGACCCG

TTGCCAACTAGACCCTACC
7 T 'MAC G
CCCTCACAGAACGCCICT
84 TGG I ACAG I CACGGCT GAGGTAAACATCAIGCTCG f GCGAGCI
GCTIGGGAGACAGTGAIGA IGITGCCCA AACAICATGCICGTCCAAC
8 CC.AACG ACC.AC CCACAGCCCACATACGAC G

TAAGAGCTIACCITGGGCAATCiACT *FGGTATGGCACCITTITCC
9 . CC TTGG CACTCCACTCGATCTCCCT
C .
85 GCAATG TAGGCTTCAATGITCTCAIGICAAAACAGAA
GIAAGGAAGTOGGCCTIAATGGCTITTRAATIGCT TGAT TTFCAAAAA TM CTG
CITCAAAACAGAAAGACGC
O AGACGCC ATGCCAT1T AC.AG C

85 CCA T GTGCT TITCCAT T ATCCA ITT( CTGCTGAAT TG T RAI niCACT
TCITATAGCTGACGCCTCCRITITGCC GCAGCAAGAAGAAACGG
2 TCACTCT ITCTCTC I TC.TGCT.
GAATTCTICACTCT

AAAGCCITAGATAACCAACAAGATGAGAGTAAGTC TTAAGAGC.ACTAGCA AGA
GTAAAAGCAAACTACAATG

ACCCICITAAGTIGAGAAACACiGAGAAAGCACT CAATAGAGGACAAGGCTC GTA ITAATGA IGGCATGCT

RS IGGTTTITTCiGG I CTITT CACIC ITACACCCITIC1711 TA
ICAGCAGATIGGAAGAAGAAGG1CATCTICTIG AATA A A TT TCAGCi ICTGG
ACACCCITICITTTTCGAAT

TATGCCiCTG ITGAAA A TGGA TCCTCAGITTCCTA

AAC.GTITGGGGTATCGTT
85 ACACCTGTICCACAAGIGIGAGAGGAGGACiCAGCAA

GAGGAGGAGCAGCAAGAA
as TCTTCTAGGATTCCACTGTCATCAAACTTTATCATCTA
CCATAGCGGTTCAGAAAATGGCCAGTTGCTCGCCAA AC. ITT ATCATCTAGAAAGC
8 GAAAGCAAACTC . TTAG TGTTGCCGCTTTAAGTCC
AAACTC

AATGTATGATTATATCTGA AAANITCGTAGIGTACCTG
9 GTACCTGTTG GCAGC ACC. ICC. rre
86 GAAGGTICTGTTGGCACATCTATATAGATAGTAAACA AGGGTAGTATGCTTTAAATTCCCAAAACATATACAC
AGATAGTAAACATAGACA
O TAGACAGCCT AGGCTCCC
GAGAGCAGCTCTAGATGG GCCT

AACAAGCAAMCCATAAGTCTGACATGMCTITTC GGGAAACTATTAGAMAG

CACGTCAGCAAGGCATTA
86 GCACACITCCATICTCCAGITITACTTAC.AGAGCCATCA
AAAACCIACACTATCTCCCCTCX:CGTGCTICTAGAGT AAGTG GAC.TATAATGG GC

TACAGAGCCATCAGGGGA

CCAAAGACATAGATCAGT TCTGTCTAGACTTAGAGCG

4 ' F 'MG CCICTGG TGGGIATIGGiAACCGGAA 6 TATAGGTTCTMGAAGAG AATCAAAGGGAGGAGM
GAMMA AGCCCGA ATACC CA

ATTAATGCCACTTGGACA

G1TGATC.AGCTTGCCGAA
86 GACCAAC.ACCTTAGATTGATCTACATITATTCTGCTAC
AGITTCAGGCAATCAGITTAGAGTGATCTATTAAAG GAGTITACAATACGGATG TTCATTCTGCTACTGATAG
7 TGATAGACTC CAAACTTGTTAGG , AATATGT ACTC

GTGCACCACTGGATATCA ATGGCCTGATTTACTTAAA

86 alGTAGTGGAATITTACAAAGCTGAACATTATCTCA 1 ATCAATGCTATGAATCCAGGCATTGTCCTCAATACC ACCAGGATTATAAGTATA
AACATTATCTCAGACATGT
9 GACATGTAGAAGA ' GGATGG AAGCTAC AGAAGA
87 ACTCAAGCTGAATTGTOCAAATTGATTACTACTGTGA CACTIGTCCATITGTGTCATTCGTGTITGTTTMACA
TGGAACATAGTACTGTGA CATTACTACTGTGAAACTA
O AACTACTCTG GCCG ACA CTCTG
87 ACGTTATCTCCCTCTTAAGTTTCCTC.CTCAATGGGAAT TCCATGGGGCCAAAGAAATAGCTATGAGGCCCATG
GACGCTITGICCAAAATG CCTCAATGGGAATGGAGA

GCAGATGCAACGATTCAA
2 ATCATTG TCCTTC.0 G
TTGCCGCGAGTATCATTG

AGGTAOCAATGGGGAAG GGTACGGGATGTAATGGA

87 CTGCTGAGTTTCCACTTCAGTATAAAAAGGAGATTAT TACAGGTAGAAGGGCGCCATCCCCACTTCCAC.CACT
TAGAAAAACAMGTAGAG AAAAAGGAGATTAMGAA

S CAAGGAA GCACATTCTAGGC G
GAGGACGAGGACAAGGAA

CTC.AC.AATATAGTAATGA ACATTACAAGACGTTAGOC
6 TTAGCCT TCITCAC.AAA AAAG MC; r AATCCCACTMACTGACCCAAATGCCCTCCAGTITC GIGGAAGAAACTAGMT AACATCIGTACCTICCATTC
7 Traivircr rcs , AT TGA1 G C

TACGAAAACGACGTAAAC TTTTTCAGATGTCTCTTTGG

TACATTTACCTGACCCCAA AAGTITGGTTTTCCTGACA
88 TCACCATCCTGAATAACTGTGITTAAGGATCCCCATG TATGGITGATACTGGCTTTGGTGAACTTCACTTTTGT
CCACCTATAGGGGAACAC

AAGGATCCCCATGIACCA

ACGMGGGAACAAATGIT TTAGACATTTATTTAATAG
1 . ITAATAGGGCMG AACCAAT TG CiGCMG

CTCCAGCACCTAAAGAAG CCCCTTAAAAAATACACITT

MCCAAATCCCIGTTTAAAAAACAATGGAAT TITTGTAGCTICAACCGAA
3 'CITITGG 661161.3 IT CGGITC1CA
TGCTITTIGG

ATAAA=AGCAGACATITTAT CAAAAGAGAACMCAATG

TICCATAATATAAGGGGMCGTGGCGTGTICTTGAT ACCGTTGIGTGATTIGTTA GIGTATTAACTGICAAAAG
S CAAAAGCCA GM:MCA ATTAG CCA

CAACCAGAGACAACTGAI AT:MITA] GAGCAATTAAA

MCACACAMAGAAGTCTGC.AGIMAIMGCC GAAGACTIT GMCGACAG GCT TCAATCCAA TGATCGT
7 , ATCGTC TCGTIC I C
.

CATTGAAA ITGGCITICTCTAGG GGTGAATAGI ATCTGC.AAC

88 'ICITCAIGT7 ClITC I CCITGGAAGGI GCTAGCAGAA AACAAGCCAA ITAAAGIGGGCA
1CCFCA AAGTCC:AC ATCTCCTGACITGGAAGC

GGTGCTAGCAGAACITCA
89 CCGTCTCATGCT MCAATGTAGAACITGATGAAATAG TT

O GGGA CTCCCTTCA
GACTGACTCAAGCTGGGT GGA

GITGGAACCACATAAATGGGAAAAGCTTGGCCTAC GGATTCATTATAAAAGGA ATTIGAGGAATGATACal A

AAICAGAAACATGOCCA.A iTGGAAAGGITCTGFAC CAGAIMAGAGCATGATT

89 ITCICIGTGIGACCAITITaTriGriATTACAACACAT I
ACMAGACTGAACAAAAGGAGCTACGCA MCITTCi 1 GATTCAAIGGATAAGGAA GAT
TACAACACATITCCAG

MAGGAAGAAAITG TGAAGG TFCTIMTAITTMTAGAAG

OCAGGAGCATCTGTGAGA
89 IGTG TCCGTAACTICATACICITACIGTMTCAAAC.AA
AGTACCAGCGGAAATGCFCGCMICTIGTIGATITG GAAAMTCITAA.GCA 116 S GATGGC TTGAAGT CTC
TGTTCTCAAACAAGATGGC
89 TCTGAGGATTGGAGTCCGTCATCMGGGC.AGAAGA A TGACTTCGCCCTCAT
AGTGAAACCTATCTACTCCTGC AGTCTTAGGAGTFTCAATT
6 GT . CG CTGA
ATCTCGGGCAGAAGAAGT

TTCGTAGCTAACTTCAGTA AGTGTCX:GOGATTAATGA

89 ATCAATTC.CCATITCAAGCAGACCTAGGGCCAAACCT
GGATGAAGACTACCALAGGCAGGATTTCCITATGACT CAGGACrGTTGGTTTCACI

AGGGCCAAACCTGTACAA

AGATGGACCTGACATCTGTACATICICAATAAATGG GTTAACAACTATCAGGCG
GTTCIAACACGAACCITTA
9 CG11TACG GATACGGAGAG All CG
90 TCAGATACCTCGAAAGACTCCTGATITGCTIGTTGTC ATTGGITTAAGGTOTGITACATGCMGAAACAACAG
CTGGCGAGATAATTTCGT

TTGC.ITGTTGTCAGTGCT

AGGAGGAGICTGAATATCCATATGCGCCTIAAC.AGC GCAGACTCTATATAATTAA AGAGCAGCCAAATATAGG
91 PCT/US2022/076140 90 TGCCTCATCCACAATATACC.AATC.ACTGTGTGCCATIC
CGGGATGATGATGAGAGCGAATTATCTGATATGTC
2 GACAC ACMCAGATC CfGCTGTTAGGCTCACTG
CIGTGIGCGATTCGACAC

CAGAACGAGCAGGACZTA

GTACAGTCTCTGGGAGGT
90 AAGTCTAACTGGCTCCAAAACCTTGCATTITCCTTAAC AAGTGACC.AAGAAGACGAGGAGCTTGTAGCCAGTC
ATITGATGAAAATGGTCA GCATTITCCTIAACAGACA
AGACATCA TAAGCG , AC.CG TCA

CCGAAAAAAGGGAATAAT

AGGACAATCCACCCAAACT

TTTAGATAAGGTGGTGGT ITCTGTTATTTTAGGGGC.A

90 TGGGMCGTGTGATTGTAC.GTGATCTAGTTCAGCTGT
TCCITTATTCAGGATCACAGC:TAGTAATATTATCAGA ATCTATITTCAGCTEITCTIG
92 AGAAACGTGTAGGCCTTTCCTAACT6AG1TCACAACT TAGACGCTATTATGAACAAGTGCCGTAACTAAGGA
CTACTGAAGGTGAGCTTG AACTGAGTTCACAACTAGT
O AGTACA AGCAGGCC A ACA

CAGAGCCACTATAAGTAC GAAGTGGTGTTACTATAG

ATCCTACTGATCTTGACAT TTTAGATAATTCAACAGGG

CCGTGCAAATGATAACACT

TGTGCAGGAACACAGTTTCAGCCTCAATAACTGTGT TAATAATAC-ACATGCAACT
CAMATGTTGCTITTGATG

5 AAT 1 AT TCAGGCC , CGGAGCAGCTTGA ! CAAT
*FTCCIT I AGGCCGCAAAT

GGTTACACACCTGTACAG TTGTTAATAAACGTITATG
6 GTTIATGCTGC ' GCG T CTOC
91 C.AGATGCTCTGATTICTGIGGCGATCATATGAACAAA TCGGAAGAATGATTGGIGGAATCGCATAATCACTG
GATCATATGAACAMTGGA

GAACAATAGCAATGGAGT

TGAGGGGATCAGTTGCAC
9 . CACA TTITGGA cATrrTCCIGGCACC3C3TCA A

GCTGCATTTGAAGATTTAA CAAGMCATAAGAGGAAA
O AAAGAAAGT CiTCICAG GAGT GAAAGT

TGCAGGACCITIMCGCCATTIGAATGC TGGGIGGGCTATATACAG AAGACAATAGCATAAGAA
1 ATAAGAATIGG r CTCITCA C ITTIGT
92 CGITGTATTTTAATACkGCCACTGCCATGGGCTGGCT ITGAGGACACAAGAGTCTGAATGTCCATCAGTCATT

%Via GGCTAACAATC

TCAATAGAGITGAATGCA AATTCTCACIATGAGGAAT
3 CiAAI CFTC AATCCAGG TIT CCT CFTC
92 GATCce &AATICKICTCAAACCCAITGGIGTTIGGATA AA( GGATGGACAGAGACTGATAGTCANICAG ICA V AGGGATIITCATATAGGT TGGTG ITTGGA
TACiGA AG

92 GAG ITT TCAAAGATCCrfGGGTGCA.GATIGTGTA ITG GIG TCTIGAAACGA
TGGAAGTTGGTCATAGOTTIG CAGATTG TWAT TGGA AGC:
5 . GAACiC.A GCGACC CGAGTG43GTATGCAC.A.A.A
A .

ATGICGTGAGGAAGAGTAATTGIAAA TGCAAATCAGAGGATTCG TACIT TGICGAAACACIAG
6 TAGCti AGAGAGCTCTGTA TG CC

ATGITCGAAAGTAAGAGCATGAAGCTCAAGTCAAT CAAGGAACCAACCTGAAT GITTACiAAATGTC17AAGC

92 GTCCGICCCACCAGTATG TGAGT CTIAGGACTI TTCAA TCCAATC.CTCTGATGA I

8 TCCT TCATG GTAC er GCTTCTTCTACCIICTATGG AGCCAACTTCAGTATGGAG
93 ATCCTCAAAGCGCGCCATAGCGAAAACGGTGIATAT
CAACACGGCGACCCIAGNAGTMGCAGTGAAGIGT TAGIATATAAAMAGGGA CGAA.AACG GTGI ATATAAA
O AAAAGATGT TCAG GTAACCG
AGATGT

TTGGAAAAACTAACIAACACTGGGITICAACCiGIT T GGTGTATAGAGACAGIAT

TGCTGCATGCCATAAATGT

TGACCTTCfATGTCCATCIATTTCATCCi AATAITAAGTAIGCATGGA

93 GC:ATCATIGIGGACCTCCTGAACATTTTaf GAACAGG ACGMAGITT

AACATTITGTGAACAGGC.A

ATATCAGATAGTGGCTAT
4 A . CACITGTACC GGC
TGTTCTGAAGTGGAAGCA
93 ACATCTAAATCGTATCCACTGTGACGTGCC.ACAATGT AGATGAAGGGGGAGATTGOAGAGGCTCCTAAMA

GTGCCACAATGTGCAAAC
93 ATGTTCCCITGCTGCAAAGAATGAC.ATAGAC.AGCCAA GGCATACAGACATTAAACCAMAGTTCAATAGCTTT
ATAGACCACTATGAAAAT GACATAGAC.AGCCAAATAC

93 CCATGITCCTGC.ATC.AGICATATGGCAACAAAGACAA
GACAAAACCGCTACCTGTGTAACGTGTTGTACCCTT TTAAAMAGGIGGMAAA GGCAACAAAGACAATTGT

93 GGIGCTGCSAATACGGTGAGGGCGETTATCCGC.TACTC GGGCACCGCAMGACCTACCIGAGICCACAGTGICC
ATGIGCAGTACC.AGTGAC

CGGTATCCGCTACTCAGCT

CTACGCCTATAATACATTT ACAGTTTAAAATGTTTACG
94 ATACCAATACCCATGCATACGCT ATGIGIGCTGCCA 1 ATTGTGGTAATAACGTCCCCTGCGCAATAGTAACA T TAMTGCIT TIGTGTATG
O Ci TC GGGCAAT A CATG
TATGTGTGCTGCCATGIC

TGCAGCCITTAGTATCTGC
94 C.ACAGCCC.AAAATACATAACTGTGTGAGGACGTTAG ATTGGGGAAC.4CTGGGCTAATAAG1TCTAAAGGGG
3 GGACAA GGCA , CGCCACGTCTAATGITTC
TGAGGACGITAGGGACAA

ATCCTGATTATTTACAAAT GGGAITCCATGITTITITG

94 GAGACTGIGTAGAAGCACATATTGITGMGCTGGCA CTGTACCTGGGCAATATGATGCATCATATrCCTCAA
TATTCTiTTACATAAGGCA TGTITCOTGGCATAATCAA
TAATCAAT CATGICIG CAGG I

CGCCCIAGTGAGTAACAA ATTTGIGTTTGTGGTATGG

94 CGGTATAAGGGAAAGTTGTGCITATGGATCiTCAATC GAGACCC.TCCTTACAGCCATGACTGATGIGTCCIGT

GGATGTCAATCCGACCTT

AAATAAGGGGGTTTGTAT

GGCAAGGA
95 CCATCCCACCAGTAAGTAGICTTGTTAGGCGTCTMA TCTTCAATCCTCTGACGATTTTGCCGGCTTGAATCCC
CATGTTCAATATGITAAGC
O TCCIG TTCATG ACIGTA
TIAGGCGICTCCATCCIG

ACIAGTTTITIOTATCGTT

TTIGTTGCCAATTTCAGCA

ACAGAAGACCAGTCGGGATATCCGCATCAATTCGG TGAATAC.AAGICAAAGAG

s 3 ACAGGA ICCACC:GCCTlICGCGCCMACCACC1CAA 1 TCAC , GG 1 GGAAGGCCCGGAA
CGCMCACCACCICAA
95 GGAGCCCACAACAATAGTGGGCTC.GGGGACAGCTAT 1 GAGTTCCTCTAGGGATGGCCGATGCCGCCTGTAGC

TCOGGGACAGCTATGCG

TGCTGUACTCTCTACAGAC

IGGAGTTGCTACGCGGC

CCGAGAGGGAACTGTGTG
7 . I GIGGA A CTGTGGCO TGACCGIG GA

GGGGIGGAAAGCAAGGT AAGGCCAAAATACCTGGG

GIGTAATIGGCCAACAGT
9 AAR; AACCA GC
96 GGCGGCGTTACTGACIGAATCACCCCACIGTC.AAGCC TGCTTCGC.IGGTTCCGACITGTGGCGCCAGAAAGG
O MG GC A TCATGGGGITGGACCG I
CCCCACTGICAAGCCTITG
96 AGC¨ATCIGGCAGTGCCGAGAACTGGGGITCCTAGCA CGTGCTGGIOGMATGCACTGCCAACACTAGCATG
GCACAGGAGICTIGGITC
1 Ci1Cf 6 ACT G "f GGGGITCCIAGCAGTGTG

CTGGIGCGTGTTCTACCAA

TGAGCACGCTGGACAI
3 . C:FGGCTICACAGCGACCIGGCAGCGGCAGAGCGGTT CTCFGA 6 CAGCGGCAGAGCGGTT .

CCATIGCCAAIGGGGGCTACCITMCACGACCCAAA CCATGGGCACTTGIGTGA TCATGGCACTAACGGGAG
4 AGG a-A: AT G
96 GACOCGATTGCCACGAIGCAA tClIGICITGACGAGG CTCCC 1 AGGGAGGCTCAGACCAAMATAGCCGGIT
S GCC TTACCCG
GGTCGTCTGGCTCTCCGA ICITGIGTGACGAGGGCC
96 CA 1 GCCCAGAAC:ACTAACGCCGOGCCATGACGTGGT GIGCMG61 TGCTGGCCAACiGG1CGCACTIGGCA AA A TCCCAGAAAGTACCTCC GISCCA. IGACGT
GGTGATCS

CTGGC1TGTGTCGGCACITGACCACATCACCCiCiC1GG TTATGGACACAGGTTOC.0 TTGCAAGATACGCCACTGG

GGCAGGTACCGIGCGCMITACCACGTGACACCAG

TGAGGATGC.AACGCAGGG

9 Al GGT GGTGTCCAGCGGCAGA
TGGGCCCCCAGCTGAT
97 TGTGAAGCCGGTGGCGC1AAGCGTAGCTACC:AGMA
O
TCACAGGGGTAGCGCTCC.CCGGCGAGGAGTCCCAA TGGG CAACCCMT61TCITACCAG
OGGCGAGGAGTCCCAA
97 CCACCCCGAGAGGIGGATIOCITITC:ACCAAIGGTCG CGGCCIC1ITCCTGCTGMAAGGAGCAGAGCAGCC
1 CTCG G CCAGAOCiTGGGAGGCCAT
MCACCAATGGICGCTGG

ITGACCTCCTITCTGGCAA
2 CAAG . GTCG TAACGCGGCCTCTCTCAC G
97 AGAGGGATITTGAGGCGCGGGCCTTGOTCGCTAC.CT GATMIGGTGCAGCAMGGATGGATCCACAAGTACA
GTACTGTGGCACCTCCIT

GCCTTGGTC.GCTACCTGT

CCAGCTTGGTTGCGCAG
97 CATCCTCCAGAGCCTGCAAGGTTGAGGCTATAGGGT CTGCTGTGICCCTACTGCCCTCTC.GGCAAGAACAGT
ITGAGGCTATAGGGITTGC
S CTGCA CGT GCTGAGCTTCAGTGTGCG
A

ACICTGGIOTGCAGGAGAGAAGACC.CTC.CCACAACC TATGACC.C.AGAAGACCAC ATCCGTGATGAGGTGAGG
6 AGGG TAAGGG Cl G

CCAAACCAGCGTGTAAAA

97 GCC.GAGCAACACTCAGCTGTAITTAAGGCTGCTITGG CTTGGCTGAATG
TGTTGCTGGGGGCITCCGCATATC CATO ATGAGAACGACGG TTAAGGCTGCTTTGGCTGA
8 CfGAC CGTAC CA C

GAGITGGIGATGTGCCAGGFC.CGCGAGGCAAGAGC GCTCfCGGGACCATCTTG
ACCCAGTGGTGAGATACO
98 IGCCACAGAAATTGCAAGGTATTATACCITTAAGTAI ATTIGGAA
GCTTGTGAATTCGACTAGCACGCAAACA CACCAACAGAAACTGACA "FACCETTAAGF ATTAGAGA

98 ATTC.CMCCAGGCGTTCCTEAGAAAAGTTAGATTGC AGGCAGTGTAAGCATMTATCATGAGCTCCAGAAT
TATTGATATCAGGIGICAA TAGAAAAGTTAGATTGCTG
1 TGTGG AATATCTTGCA , AMC TGG
98 TGCTCCACITC.AGTATCATCTAACAACAATFATTTGAG
GGCAATTCCCTGCSAACTATTICAAGTITTTGCA ATM; TGTACITGATGTAGAAAAT
ACAATTATTTGAGAGAGAT
2 AGAGATACAGAC C. FCCTC GATFTGG ACAGAC
98 GCAGTGTAATGTATATCTACGTCCCGTGGAGGTACC TGCGTICTAGCAAC.AAAAAAGCAACCTACTCCAAAC
GCTGAAGATGTTACTCCT

GTGGAGGTACCGGCTATT

GIGGGATAGGTGCAATGTCATITATGAACTGICCCG GTTGTCTCTGTATATGCAG CA TGATGATCTATTTGA AA

AGGCTTGCTCCCGAAGACCAAATITGGCCTGGTT GCCCAACAAACTATATTA
GGCAAGTACITITGATTIT
GATTTTTCAG MT GGTC TCAG

6 TGGTCCTC CAATAGAGCTAT ACTCA at TTTGAATTFCCAAATCCIT TATGAAAGCAGATAATACA
7 TAATACACCTC GCTGAG TTCC CtTC

ACTGTGTITGCTCCTGTCA
8 GTCGCGGGGGTGGTGTITGCAGCTCCACGCCF CCA GM' C
CAGCTCCACGCCFCCA
98 CTCCCCTTCTGCAGGTGGTTGTCGGGCCCTTACAAGC GTGGAAGGTC.ACCCAGGCGGTGTGGTGCAGGAGG

TCGGGCCCTTAC-AAGCA
99 AGCTTCTTCC.AGTAATCTTCCAAGGAGGGICACTICG 1 GCCACCCC.CCAGTAATCATITTTAGCTCTGTFGCGG

GAGGGICACTICGATCAG
t ATGCTGCMIGCTACTAACATACTCGCCATAGCAA CATTAGCTTITCTUXIATA AGGAGAGATTTTGATGAA
1 CAAGCCi 1 ATACA AAA F GT , CfCAA GCG

CCACTAGCACAGGTTCCAGTGCTGACCCGTGTTGGA AGTTAACCCCGTGGAACC TCTAACTGAGTCCACAGGC
2 AGGCG ' GGAAT T G

CAATCCATTGTITCTAACT TTCGTTTTGCATTTGATAAT

GGITTGAAACTACAAGAA CATACTATAC.ACAAGACAC

99 GGATC.AAAAGATGTATCCTGTCTGTAAAGATACAGA CCAAATGMATTGTAGGATGC.ACATCATTTTCTGCA
AAAGATACAGAAAACAGT
5 . AAACAGTAATGC CATGGAAC TACGGAGTACIGGICACC AATGC

TCCAGIGATGCTCAATTGITTAACCCATFATFATGA CAAAAATCAGTMACATT
6 TITACA. TICCAGG CX:FTGGGCT GATGACAFTCCACGTGCA.
CCAGG

CAAACGTAAAAATTGAGGTCTGACCITCCTIGAATA MAGATCAATATTCCTTA
7 CAACA CfCiTGIAAAAGT GCiC.AGA
CCAAGCTGCiGTTACAACA

TCGAACAATCAAGCAGTGCCACTATCTCTGTTATGT GGCTOGTTICTATAACTA GACAATATGTTT
ACTCATG
8 ACTCATGCAGA TCCT17C rrGc CAGA

ATCGTGCGGICTIGGTAAATGTITGITGTTCAAAGT CTTICTCCAAGGACAACTC
9 AC:AACTCA 1 GGITCC GGICAAAACCGCAATGTC A

FTCTGAG

ATAGCATGGICCAGCTrA
10 ATACCGGGGGTAAC:AGGAGCCACATIGGTC:CACTGT 1 CCAGA AGITAGATG TOTE
GC.AGAACAT !TA 1ATAT AT TCAT TAGAGAAGGGAA
01 . CA 1 AGCACGGGICTA AATTGTC
CACATTGGICCACTGTCA .
10 AGCGCGAATCITICHCiAlTGTGGIGGGCCITTGACA CIGGTIATGAGACITICAGGOTCGTGACTIGCCFAT
TAATAACGAGAGAGGOG

GGTGGGCCTITGACAATG
10 GGTGITTIATCGATIGITCCGGIATCiCITCGAGCACT AAGAACTGACICCCGGAACCAGITCGT1IGTI6C11 GCITCGAGC:ACTAATAAAA

10 GICX:ACCITICACCATC:ATCTGAGAGATAAAGAAGA (IAA
ITCAAAATCiCACAAGCSIGTGGF TCAGTMCIT CCAAAAGAAACAAACCCC AGAGATAAACiAAGAGCGT
04 GCGTCTAC GICATCACITTG AAC C.TAC
10 TGGGATTGACTTTGITTTTGTCTGATGATTCAATCiCIC
TGAGATTCCAATTAAGCAGACCATCTCATAATC.CTCF GTGATGCAGATGTCAAAG TGATTCAATGGCAAAGAAA
OS AAAGAAAACC GCTGTGTC GA ACC
10 CCGAAAGTGG TGCATAGT 'ICI TAAACCTITAAGT n TCTCTCCTATCATFGTF
GGCA.ACITATAGTAACCGGI CACiGTGACi TACAAGT TC.A AAACCFT1'AAGIT11 GGCG
06 TGGCXiC. GCTGGIT AAC C
10 CTACAGCAA 1 TICTCCAA TGACAACAAT ;GATT errea ACMGTCGATITGAAAATCiAGAGGTGGIAGAAC I AAACGM GGACT1TA TAG AAT TG ATTU TOTTA
ma TCAGTACC1T I GGAGCAAGTAC:CTTAT AA

CTIGTGGAGC.AGGTGTAC

TGAAACGGAGGATCATAACTCFGCGCFGAGGAGAG 1 AGATGA ACT AT Tr GAAG ACTGATGGA
TCTGATG ref AGC.ATTGCCTACAGATATGCCAATGCFTCAGAAACG TTACCAG ATTGGA TAA AA
10 CTGCTA . CAGC AGACAGA
GTCATGAGTCAGCTGCTA
10 CACCCAGGATAGGTTACATC.ATCTAACAGATGTAGTA CGAGGCGCATTAGATGGTA
ATCTTC.AATTGA GTAG CATGGCAAACITTATITCTA ACAGATGTAGTACATTTTG
11 CATTTTGGTT GTGCTCTA TGATGA Gil TACTAATAAAACCTGGACA

AGCACCTCACAAGACACC CTCCOGATCTCTICAAGGG

TCTCCAGAAAATGGGGTTGCAATTTAATAGTGTCA T ATTGTAAC.ACTCAC:TGAAA
ACCAGATACAATAGTGACA
14 AGTGAC.AACT GTGCAAACA CAG ACT

TCCTTTTGCAACAGTGGAA TAACATTTGGTGAGGAAAT
AGGAAATACCA AAACGTCTA A ACCA

IGCGTITCAGTACACTAGGATGTATTACCTCTAATAC TTICAGATGTCfTGGACAAAGTCAGCACCITTGCAA
TCTAGTAATTCTGATATTC TACCTCTAATACCTGGAAG
16 CFGGAAGATCC CAGGCTT Cf TCAG ATCC
10 TO IT IGGATC1GGAAATTGTAGTCTTTACC.GATGAGT

TFACCGATGAGICTAAAGT
17 CTAAAGTAGC AC.FAATC.GC TGGATG AGC

"TAATAAGTAIGTGGATGCT
18 TGAGA GTGGT TAGGAGCAAC.AGGCCATC GAGA
10 AGGCC.AAATACTGATTICATTCGTATGGAAATICAAA AAATGGGCAAGGATATTTATGGTGATCTTGCATAAA
TGGTGATATGTGTGAAAT TGGAAATGCAAACTITGAG
19 CTTTGAGC GCTGTTCCTT , AGGATT C

CTCC ATTTCGCGTATT TACAC
AGTGGCTC.ACTIGTCTCC

AACATGCCCTCAAACTITGGTTITTGTGCTACGTTTG CCATACGAACAGTATACA
TAAACTTGAGTGAAAAGTT
21 AAAAGTTAACTGG ' CAGT 11TTGG AACTGG
10 GGGAATGIGGTGCTTATGGCAGATGTC.AATCCIIACT GGAGATCCTCCATACAGCCATTGIGTTCTGTTGACT
CATGTCAATCCGACTCTAC

ACAATGGAGGCCTIGGGTT TAATGAGCCAAGTGGATA

ACAAACAGACTGTGTCCTG

ATC.AGGAAGGCTAATAGA AGGATGTGATGGAATCAA

10 ATTACCCCC.AACCGGAAGTCCGTTGAAACTTTAGCTA CCAAACTGGCAAATGTTGTGAGTGAAAGAt=AGCTC
CAAATTAGAGGGTTCGTG CGTFGAAACTTTAGCTAGA

10 TCTCTTACTCTCGAACATGTATCCTCATCGCACCAATA TGAAGCTCCGAACACAAATACCCATTGAAATACTfC
TGGTTCAGAAACATCCTG CATCGCACCAATAATGTTC

ACATGCTAAGTACGGTTTT AGTCTCGGTACTGAATCTT

CCC.AGATGGCTCTCCAATTGTTCCTCTATGGCACCTA TGGAGTGICIGGAATAAA
28 AATAAACG 1 TATGT CATGGAGOTCCCAGT11 a;
i 29 C:AGGACT A 1 AAGTCTFC , AC A

CCCTAGTAGTTCATATAGGAGACCGATTCTGGCATC GAACCGCTCTATTCTAAAC
TAGCCAAAGGGGAATTCTT
CTFG ' AATCCGG AC G

AGTTGCCAAC.AATTATTAG

AGACGTGACAACCGACTA

CCTGATATTGTITTAGAAG

10 ATGCACCGCTGAAAGTCTCGAAGT1TTATGCCACCCC AACACCAGAAAGACAATCCAAAAGGT1TTC.AGCT1C
GACCTGGTTACAGTAAAA
33 , G ATCITCAATCC CGA
AAGITTFATGCCACCCCG

ACTGGTGATTCAGTATACT
34 GCCACT G AAG rr AAAC TT CAACCT
GATGCCACIG
10 GCAATTTAGCATCACCCCTG TATCGTFGTAGCTCTGC CCM TAAAAA ICC!
GAACAAAGGCAATGTCCCATA CAAAAATATE.CAGATCPCT
ATGGA GAGTACGTAG ATGAGT GITGTACCICTGCA FGGA
10 TCCAATTCCCAAATTGCCAAAGTATAATTTGAAGGCA CAGGAAGAGG.AACTGGGGGTTGAGTIGGCCTTCCA
AATFTGAAGGCAACACAAT

AAGGC
10 AAAC:OTICCAAAGGIATC1CTTCAT MT TAATG TAT IA TCAAACCT iTGACAT
11TAGAGCCAACAAACCTAI TA TITGAAACACAGAITAATC TG 1TAATCi TAT TAG 1TGAC

TACTCTTATTGTAC AGATTITTIGAGICAACCI 'WI TCAGTT !GAM ATACA
39 , AGAAATCCC CTTCTGAG TCC AATCCC
.

TG CCAG CC.ACCATCT ACC. AAGTACTCCFGTTGCC.AG
10 CATGACC:AGTAGATCCAACACCTAITGGAAACTAG I A
GITGGAGACAC:AGAAA.ATCCTAGCFGTTIGTTIAGG CTCAGATACAGAACGTIT TIGGAAAC TACT
AGGIATA
41 GGTATAGAAGT ATC.TATAGACAAGT GGT GAAGT
10 AGGAAAGITIGCATTACCAAA ICCIAAGTG TCCICCA AACTACAACAGCiACAGAGCACiG
ICAGOCCATAAGC GACAAAAACAGCTTAAAT AAGTG TCCTCCAATACAAC

10 CGTTACAAAGCAGATTAAC.AGGCAAAGGTTGATATT AGGATTICITCAC-AAGAGGCGGCAACTGAAACAATA ACTC.AGGATGCATGGTAA AGGTTGAT ATTAAGCiACAT

GGACTAGGAAATGAAGCTGAAGATMCATIGCTAT 1AT1TTAGCCCIGCGAACi AGATG TAGATTTGAG
MCC;

CGTCGACGCCTAGAAGACGACG1TGlICICK:CG11 GACGAAGACCCAGATCATC
as TCC GGGC AACCGAAATCGCACCACA C

GCAAACTAAAATACAGGCGCMGCCITC:AGCAACC TCTCTCAAGACTTGGACG

CTACAAGAAGAGGCTCGG

PGGTGCCAATAGACGCCCGCACTACIATITCAG

GGAGGTGGTGGAAGAGTC
10 TATAGGGGC.TGCTGGITGC.ACCCGCTGITTCTAC.AAC
CCTAGACGGGTAGCGACTAGCTGTAGTGGCCACA A AlTATAGATGTAACTGGTC
48 T . ATGT CTAGA
CCCGCTGTTTCTACAACT
10 GCTCTACTAAAAGCTCTIGGTGTAGGAATTTAATATT TGTACAATAGAAGAGTCAGAC.AGGTAATITA
ACTG GGAC.ACTTITAATGAGCC AGGAATTTAATN/TGAAGA

10 C.A7GTGAGTTACTGAAATCCTCTGTGAAAACAGTG AT
TCGTCCTTAATACTGGCTCfCCACCTITAATGCTGCT CAGAAAGTACATTTGTAG
SO GCACCTT CCAC ATGCA
GAAAACAGTGATGCACCTT
10 GGCAGC.CAATATGACATCTGTAAAGTATTACATCCCA TGCTTCFGGTAAGGIGTATMCCAATC.ATCTGTACT
GTAGGACCAAGTATTTTA GTATTACATCCCAGTCATA

10 ACCCTATTGCAACACGTTTACCAGACGGTITG1TTAT ACTICTAAGTCTfC.CAAACGCAAAGCAAGTAGAACA

52 CAAATTGG AAGTTCAC.AT TTAGGC TTGG

AATAAAGGICTGATGAAT TTITCCAATCTTCTCATCTG

GCGACAAAAGGTACAAGGTAGTCTACTACTCCTGTA TTCAGGATCITGCATCGITTGAACCATAACAAAATG
ATGGACTTGCAGCAATCA CTACTCCTGTAACTATTACC
54 ACTA T1ACCCiA GICCMCCA C GA

ACCiAAGICTIGTCACCAGA
10 CCACIGTTCTCAAACAATCGICTITCiCAGACATAGAll CAGTGCAGAAAAIGAAGMAAGCATTGITATTITC
GCAGACATAGAMGAGTC

10 TGTAGCATTTCCAGTTGTACACTTATATTCAAATTATG GCATAATGCC.AGAGTGGATAACAATGACTCACTAG
TGITCAATACTAACTGICA TATTCAAATTATGIOTGAC

AATACTGGAAAATC.CTACT GTATTCACTAATACGTMC

10 GCTTCTTGCACAGCATCGAACTC.ACCAGAAGAAGAG
AAATGCTGCAAAAGACATTGATACAAC.ATAATAAC.A CICACCAGAAGAAGAGCA

TGTATTATATAGATGACTA TTGATGAAAAAGCACAAAC

10 CAA TTGICITTGTGCAGTGTOTTAATGGGTAGATTA ..
GACACTGTCACACTACCAMAGGAATTAACCCMT ATTTGAACRZFEATCTAC ATGGGTAGATTATGATAGT

TGGTACGACCTACAATAC

TGGATGCMGGGACCTAC

1TAACATCTAATACACGCA ATC.ACAATGCCAACTATTC
64 ACTATTCCT A.ATATTAGAAGT CTC CT

CTGTTGGTCATCCATATTT

ACAAGGITGTGGTGCCTA
10 TCCCCATAGGATTCaTTGICATTTGIGTACCTTTGGA 1 GIGTACCITTGGACATAGT
66 CA TAGI GG reic:r CAGG OG

TGGAAGAAATAGAGMG
67 CTGATG I ACAGG ACC!
*FTCCICIGGAACCTGATCi TGITTGACTCATGGGGTCAGTCGATGGACAGTCTAA TACCTCAGTGTTACATGTT

ACATGITITGAGCCICTOC

AAGTACAGACAAACTTTG

ACAGTCITTGGCTGCAAC

CCATCTCGGGGGTCTCTAG

71 CAC:GCAATGGACCCCiAGGGATC.ICICTGGGCCACACA AGGITC CCCIGGCCIGGIGTIT ..
ATGCTCTGGGGCACACA

73 T AC4.3 GAGCi IGOIGCGGGACA
GCCACGTACGCOCIGT

GACTCGACAAACTCGCTG

TG111.3GTGTTGACGAC6G
10 GCGAGTAGTTGGTGATGCGGCGCCCGGCCTGACGA ..
GTAAGTGACGTCGTTGCG

10 CAC:OCC TACCTGAAGGTGACCCCGTAAGCACGGGGA GAGCTIGGCCATCi TACGCGCAACCUCCOCCiGCIGG AACICCTIGACCGACACG

CGTAAGCACGGGGAGGGT
77 AGACGGCGTCCCTGGAGTCTC.TGGCGGACOGAACA GTCGGCGAGCCCACACACAACCACACCC.CCCAGTAC
CGTGAGCACCTIT.TCGC TaGGCGGACCiGAACA

78 CCCA AGAACTGCCCC.COAAGOCACGAGCCACGCCAGAAC
CGITCTGCTGCACCGG ITCTCCCTG1TCAACC.CCA
10 .1 CG ICGTCGAACGGT1 TI ACG ICAGTIGCCACTGGG I ..
GTICGTCCTGCCCCACTGGACAGGGCCGAGAGAAC
79 C.CT C GITCGC.TGCCT1TTCCTCC
CAGTTGCCACTGGGTCCT

80 A GICiGTCCGTCTCGCTAAC.GGACICCGTCAAGCGCGT
CCGGGAAACGTCTOCG TGCITITGGCCATCTGCA

TACGTTACAGTGGa:CAG

TCGAACACCAGCTGCATG

GTGGICACGTC.CCCGA

ACACTCGTGTCCGCAGAG
10 GGACG AAACCAGGTGGGCCAGCAGAT caccTrccr .. TGCACIT TGCGTG I CTGGIGTGA T
ACGTCCCGCIT:A
84 6116 GGA CGTCTGCTCTTCiGTEGCT
GCAGATCACCGTCCTGTTG

as CTCGOCGAGCTATGGGCCTCGITGMCGGCACCAGC ATC CGTCGTCGGTGATGAGGA
CGTTGMCGGCAGCAGC

10 GA ATCIGGAGCTCGGGTCC.ACAACCITCCGTCCCCIG
87 GCCCGTTAACCCCCCACGTGC.CCGGGGCTITTCGT C TGAGGCGTCAGAAAGTGC
GCCCGGGGCTTTTCGT
10 ATCGCCTTGIGICITGTGC.IGGCCCAAGATTCGGCGC

ACACAGGCGGGAC.ACC
10 TCAACCCCGCCCTACACTACACC.AGACCCCCCGAAGC GGCGAACACGGGGCTGCATTCCCCCTCGCACATCCT
GTTGGACGTCACCGTATC

CAGACCCCCCGAAGCT

TCGCTGCTTCCTCGAGT

GCTGTTCGGIGGITGGG 'FCGGGGG1TTCCTCMG
10 C1TCCTAACOCAGACCC.CGGGGGGCGCGTCAGATAC GGCCACCTGACACAGAGGCGAGCGGCTCAAGATCT
GGGCGCGTCAGATACAGA

94 CCCACCCCCGAACCATGAACCCGTGGC.CGAGATCGT TGTGGGTGTGTAGGCGATGCTACGCGCGCCAAACC C
CCGTGGCCGAGATCGT
10 TC.AGAACGGGCCGGICGTCGGCCGATTCCITCATGC AAGTCTGCGGGGGAGCGGTGAGGCCGCSGTTGGT

GGCCGATTCCTTC.ATGCA
10 ATGGTCGCC.GTCATTATGGCCGCG/VaCiTGTMGG
ATAACCTCACCGAAACCG

TCCOCCAACACTGACGT

97 GC GG GAGCC.AGGGCCAAGGT
GTCTCCTAGTTGGCCCGC

CGGGGTTTCTGGGGCT

99 GCGCCTf3ATOGTGGAGAOGGCTGTACGTCGCTGGOG TTA CCGGGGGGCGCTTAAA Cf GTACGTCGCTGGCG

TGGAGCTGGCCCAGGA

GTCACGACGTACGAGACC

GlICACGAACGOCGCG

TGCATCGGCAACAACAAA

11 CTCGTGCGCTITCTGGAGCTAGCTC.CGGAAACTTGGT ACAGGGTGTTGCAATACGACCCATGCAAACAGCCT
AGCTCC.GGANICTTGGTAC

GGIGGGCGGCAGCATT

Os TAAGCfCCATCGCCIGGCGGACCCACGCCCACATCC 1 AC TGTCGGTGT7CCCCCAT
ACCCACGCCCACATCC

GGCGTTGTAGTGTGCCC

GC.AGACCGCGCCGM

GCAAGCAGCCCATAAACG
08 TCGACCGCCTGGCCAAACGCGAATCGCGGCCAGCA flG C
IXAATCGCGGCCAGCA

CGTGITGATGGCAGGGGT ACGAAGCCATACGCGC
11 GGAGGGGGAAGGAACGAAACACICTfC1GCGTGCCC CCCCGCGICAGACAAACCCTGAGTCTTCGGACCICG

TCITCTGCGTGMCGT

GACGICTGGGAACACAGG
11 ACAGGC: 6 TGCCCATTTGACGCTCIG C

ACCATCACGGACITICCCC
12 crcr TGGC CGCAC.TATC:CAGGACCGC T

CTICITGGCCTTGTGITCC
13 GAGGCCAACCIAGCMIAGGCTGMCfCCAIGGCAGA CC C
fGCGCTCCATGGCAGA

CGGCTCAGCTGGTGGGAGTCACCTTCGG teGGGGC GIGTACACCICCAGGGGG CAAACICGTGA
SCCTCCAG

IS ACCCCAACGCCATCCiCCTCACGTCGCCGAGCATCC TCAGCTTGCGGGCCTCGTTOCC.ATCGC.GTGGTGC
CACGTGGAGACGGCCATC ACGTCGCX.C.ACiCATCC

CIGAGAAGGGGCTGG TAC
16 ATI (3 CGATCAGAAAGCCM:C.ATT

'IGGATATGGCGTCGGAAG

11 GARIGCCGMTGCAACTMAAMXiCCi IGGAC:ATCC
18 TTCCC.CAACGGCAAGCMGCCGTCCAGAACCACT6 I GCTGGITGCGTTGGAGG
GCMTCCAGAACZACTG

CGACCGTCAGCGITTTG6 CCGACAGAAACC.CG1TGT

AGTACGTGGACCAGGCGGTGCCTCGCGGITGGTGA GCC.ICGCGGTTGGTG A
11 ACAAGGIAAMATAGGC(iGGGCRIAACAGCT(iCAAC CCGAGCCTCCCAG(i TGCAGAACCGAGGGCITCAAG GACACGGCTAAAATCCGG

TGAACAGCTGCAACGGG

22 CACCATCAAGGTCMCC.CCGTTCGGACGGACACOCG GCG GCCAGAAMTCGATGCC
TTCGGACGGACACMG
11 CGGTAAAACAGAGCGGGC.iCGITCGAGGCGGAGGIG CGTCIACGICCOAAGCGGGATGGCCGGGCAGAAGI

TTCGAGGCGGAGGIGG
11 CGCGAGCGGATCTGCTTTCGAGAGCCTCCTCAGCATC GATTCCCCAGAGCAGCCCCaTTGATGGCCIGCCTG
24 C Cr CAAGGCTCACGTGCGAG
AGAGCCTCCTCAGCATCC

ACTC.C.ATCTITGTGC:TGTG
GC cr TCATCTACGGGGACACGG C
11 GGTCCGGGIAAAACAACAGCCGAGLeifemCGTC
AGCATCCGGTMATGAGC
26 CCACTTACGGGGGCCAC_ATGTAGTGCAGGTGGGCGG CACACG C
TAGTGCAGGTGGGCGG

GGGCGGCACACCTATCA

CMGGCGACCTGGACA

TAATGTCGCGGATGCTGC

CCCTACIGGGGCCAATGGT
11 TC.GTGGICACC.GGIGCTCGGCATGCACGATACCGAC GATOCCCCCCGCGTTCCATGCAGGGCACATATGATC
AGAAAGGACAGCGACGA

GCATGCACGATACCGACC
11 GGTAGGC:CGCGCTACACGICCTACGITCTGGCCCIG

GGAGATAGCCCAGC.CCA
I
11 CGCTICITGGCCCTGGTGAGTICTATGCGC:IGGAGGT
GAGGCCCTCTTGC.ACGAACGGAACCTIACC.ACCCCG
33 GC GC , AGTGAGGGTCGCGTCG , TCTATGCGCTGGAGGTGC , 11 ACCCCACACICCAAACGCGGTGTGTATACGGACGCGC CCCAAATGGCCCTTTA &AC

TGIGTATACGGACGCGCX

35 T ' AGC AGGAACACACCCCCGTG
ACTCAGGACATCGGTGTGT

36 TAGCCCGATGCCCCCGTTGACAAGGCGACCCTGCG ... T CCCCAGGCCACCACAA
ACAAGGCGACCCTGCG

TGTTCAAAGACGCGGTGA

11 ATCGCCGAC.AGGTITCTGGAGTCCTIGTAGAACGCG ATCAGGGGCCGTGATATGCCGAGGACATCCGCGAC
CCAGAATTTGGCCAGGAC

TAGAACGCGGGI

AGAGGIGGGICTGGAGTC

CGCTTCTGGTTATGGGCG
40 C.CAT AGAG A
CCGGGGCATCCTTATCCAT

41 TGGGCGTGGCACTATCGGCTGACGAGGCMCAGCT GIG GAACCC.GACGTTCAGT
TGACGAGGCCGCAGCT
11 TTCCGGAA1TTATACCCGGGCCGGTCiTGTGATGATI7 1 GGGGACACGGGC:1ACCCTCATGTGCG11CGATGCG GGTGTGTGATGATITCGC.0 TCTGCGT
43 A 1 GIG , AGCCCCAC:GCGGIGAT
GCIAGGGTCAGCCGTTCA

44 TGGGCACGTACACCCCCCTCGTACAGGGGCTGGGT ' G ACTIGGCGGGGG TGGT
CGTACAGOGGCTGGGT

45 1: GA GCCATCGCCACGTCCT
CGCCGTCTAAGTGGAGCT

GAAAACCCCCAAACGCGT

GGACCGGACGGACCTT

47 . A ACTCGITGGCGCGCTGAATCACCACCATCCGCGIG
CTCGAGGICGCTCCIGT CGCAGAACGCCCICGA

TGCGCGGACAATTAGGC

49 AGAACGAC1GGCGCGCCACT a: rem GGGCCGCCA. A
CAGGCGCCGCATCTTG :FCC TGATGGGCCGCCA

ACGAIGCGGGGGGTGGCCTCCACAAAATOGGG

CGACAACIATCGGACTGCG

AGCCTGICGTGTCIGCG ITAAGCACGCICCGGGC

52 ACCACACGAGCACGAGGCCTCCCTGCAGC.ACCTCIC CCGTCTTCGGTGCCAGTCCTGTTGGTGCCGGTGGG
CATGCTCGCCS7COGT TC.CCTGCAGCACCTC.IC
11 TGIF TCIGCGTCCi IGAGTC:CCGCCTGCGTAGT TCRACi AGACCCIATGGTACACAC
53 , ATCGGCGTTGGTGGAGGGC.GTGCGTCTGGTGGTC.GT AGG GG
GTGCGTCTGGTGGTCGT .
11 GGACAGCAGCGGGGAC.TTGTICCICTCCGTGGGGG T
TAIGCTAATTGACCFCGGC
54 ATICCAGGTCGTCGCGGCGTGGACCICTCC.GAC.AGC CT C
TGGACCICICCGACAGC

SS TTTCGGCCTGCCAGG TGGCCIGGCr.CCGGACATA A
CGGCCATGCACACCAGC.AGGCGCGGAC:CAGGTAA CGACCC:CCCTCACCAA CGGGCCCCGGACATAA

GACCGTAGGACI GC
56 TM. ACCGAACAGCCOTCCGCGCGCCCGACITITTGC TG CC:
CGCMCCCGACITTITGC
11 TCTGGACACCCCC.ACGGACCA TTGGCACCCIACAACA
ATAACGACAAACGGCCCCTCGTTGCTGATCCCCCGC

ATTGGCACCiGACAACAGG
11 TGGGGAGIAGGGCCCGTOCATGGAIGCGCCCCAAA TCCAAOCCAGCC.AAGTIAACGGCAAAATCCGCCGG

GGATGOGCCCCAAAGC

ICGTICAACAAAGATTGGGGAGAAGCA
59 C GGTGTC 1TC7CCC.CCCCCCCTT
CTICACa:CCAGTACCC7C

ITCGACCGAGTCTGGGGA
60 AAACCGC.CCCCC.AAGCCTAGGATGAAGCC.CCCCG GTC: C
AGGATGAAGCCCCCCCI

MICA
61 GAGG TTCTTGCGGACCACGGCCCGCGTGTATGGGC.ATGCC
GGGGGCTAAAGGGIGGI GG
11 TCCTCCGC.AAACAGGCCCGA TCGTGCGCACTAGGTC
62 CAGCCCCTTGGAGAGCACCCGGTGCAGCAGTCGGA . C GGGGCTGGGITGGTCT
CGGTGCAGCAGTCGGA

TGGCCGGACGAACGAC

GGGGAGGGGAGGGTGAT

TACGGGGGGGTAGGTCA
11 CCCCGGAGACCCCCAAACCTTGACilCAGGCGCTCG

GCCGTCCCGGGTGITT

CCCCCCGGTATACGACGA

TCGGAGGGGTGTGICTIF

TCTGGTCCTCCCCAAGTAC

GC.T.GGTCTGGTGATCTTC
69 TCGC.ACGGGCOCCTITTGGACTGCCGTCCACAACGC AMAC.TC Ci ACTGCCGTCCACAACGC

70 C T CCAACTCCAGACC.ACCGG
TCGTCCCTC.GCATGAAGC

71 CGCGCATGCTTCATGGGTCCCGGGGCGGTCATTGGA TG , f. r i i r e GGGGTGTGGCGG
, CGGGGCGGTCATTGGA , 11 GGGTGEICGC3CAAGAAC.AGCGCGCAGTCTGEICATCT 1 GGGACCTGCGGCCAACACACTGGGGTGAGGGGAC

GCGCAGTCIGGCATCTG
11 GIGTTGTTGGGTGOCCTa:GCCCCCCAAACCATGICC C.GGGCTAACCAGGAAATCCGTGTCACACGGCCGGG
TGGTATAAATCACCGGTG

CCCCCCAAACCATGTCCG

GAGGTCCCCCACAAAGC

CCAGCCTGGTTGTCCGT
11 C.C.CCAGCCTGT1TGTCCTGGGAGCCGTTGTACGCCA

GCAACGCGGGACTATGC

GCGGCCGTGGTFAACC

TCAGCGCGATCCGACA
11 GCTCCGCTAAAAGACCGC.ATCGGTGATGGGGGGGA

GATCGCCIGTCFCCTCGT
11 TGITCCCAATTTGTAACATCAAGCTATCAGAAGATAA TTCAATTCAGAC.AGGGAATCAACACTGATITACCCA
AAAGCAGGAGATTAAAAT ATCAGAAGATAATAACCAT

CTCCCTMCCATATAACTC
80 CT 1 11611t A
AGGITIGAGICTGTTGCT

GCTGGCGAAATCACATGTGTCCAAATITTGATTGAA GCAGGCCTCATATAAGAT GAAAAAGGGAAAGTAGIT
83 1 AGTTAAATCAG 1 AGATACCX: , CT T AAATCAG

82 TTFAAATACGG ' CFGTC CFAACGGGGCATAIGGAG ACGG

CGGACCTGCATGACTACT

84 ATFIAAACCCT MC II CCMITACCGCTG1TACC a:CT

85 . CIG GICG TCTCTTFAC.GCGGACFCCC CCM
TGIGCCTICTCAT CTG

ITAGACTCTCCTGAGCATT
86 ATTG ocrGAAcr r TCAGCTCiGTATCGGGAA. G

CACACICTATGGAAGGCG GAAACAACACATAGCGCCT
87 GCCIC GCiATTG G C

GIGGACCATATGGCCATAATTAAGATCCTAAGTGAC CAGGICAATTATATICAGT AATAAAAGAACTACGGAA
88 GGAACCFCi GGGTTCTT A ICiGAA CCIG

TTACCCAAAAGATGITIT AACCT GGAGTAAAATGAGTGATG GATCAGATCGAGTGATGG
89 'MGT ITCGACITi GT CT G I
11 AGATGC TAGIGGATCTGCTGATCIAATTAFTGCGGCC A I TGGAGAIGTGCCACACiCACTGFC
TAAGAA IGICC AA TGACGATGT TCiAC:CAA

ATGACICIAGITCACAAIGGIGGGATCTICIGGITOCT TGGTGGGTTI ACA T !TAW,.
91 . CITCAA TTICTGA AGAAC
AAGCGGGTCATCACITCAA .

11 CI ITTCACTGACUCCICAGGAGi GATCGGT11 TIGAG AT ICA ICGTCGATGAIGTGGGAA
IGATCCATIGATA 'IGATCCGTITTIGAGAGIT
93 AGM: GGTATTGAC GAGGGTGGIGGITAGCAT C
11 CAACiTACG1CTCTCATITGTFGGAACC:AAGGCCATTA GGACATTIGACACCACCCAG
TGCTFTGCITEGGIGG TGAACCATITCAA TCfliA

AAGCC
11 GAAACCCTCTC-AAGACCMCGGAAAAGATGCC.GGCAC AGGTAAGGAAGACAGAAGATACGGC I II i bCAAGG CCACJAAAAGACTAACAA

GAAAAGATGCCGGCACTT
11 CACACAAAT CACOACGACAGAAATATTI TGA1GT ;AC
GAAAGCGICIGAGMAGTATGGAAGTGCAACACA GAACCICTIGCTTCCAGTT TAITTf GATGTUACTOCTGA

11 CGCGATAATATCANTCTICTCC:TCAAAGAGTTAAATG
TOGGTTCTAAATITCAACTGGIGCGCAICTTAGAA AGTAMTACTCAAACT CC AAGAGTFAAAIGTGAGGT
97 TGAGGTGTC GCATTCGC AlTGC. GTC
11 GCAACAAC:AACGACACGAATTACTAGGAGCATCCAG ACTGAAACTGaATTAGAGCATITFTGGGGCAAAG
GAGTTIGTCAC:CAGATTCA AGGAGCATC:CAGTATATAA

GTGAIGAGGTGGATCAGGGGATITGICACAATCCT ACTITGTAACIGAAGCAG GI
GGAIGATATACHATGTA
99 CAATGTA1TGATG CC(X.I AC TTGATG

00 GACAGTGG . AGTTCTTTCCA GACGC G
12 TGAGGCCITCTTITCTCCCATAATTGGACTCCCJWµTA CCCITTACCTGTAGCAGCAGTt. t I 1 t i IAATGACTGCA TCTIGAATCTAATTGAGCA TGGACTCCCAAATAAATTA

TGTAATAAACAAAGTAGA

CGACTCTTGCTGATCGATT

AGAAGCTGGAATTCCTGA CAACATTAATTGAATCAGG

CAATTTGATTTTGAAAATC

CITTAGCACTGACGTCACT

CIGACGCTGC

CAGAAIGGIAAACTGTAT

CAACIACC.AATCGCCAGAG

ATCCTCTGGCICAAACAGATGAAGCTGTGITIGTIT TCTITAACATGGATAGAG
TTAAGAGGTAITGAAATAG

12 CACCAAAGCCAATGICACACATATTCCACCTGTGCAA CTTGC.AATTTCAGAACATITCAGCAGAAAATCTGGA
TCCACCTGTGCAATTAATT
09 TTAATTAATTC TATTTACAGGTGC , ACCTCAGGCTGGAGAATG , AATTC .

ATCAATITCCACTAGGTAGACGGTGCTCTITTCTAAC TGAGGATCAATATAGATT
TGCTGC ACGTFTGT ITTGCAG
CTAAGTGTCCTACTGCTGC
12 AGTACATTC.AGAGACCATTCACAGTATTGTGATAAAA 1 TTTGTCTGAATAAATTGCTGATGCGTGMAAATATT CCTC-AAAATAAGCAATGT
ATTGTGATAAAATACTGTG
11 TACTGTGAATCCC ' TGCCAGCCC ATCCAA AATCCC

CTGACAAGCACGAATCTG

GAGGGTCGATACTGCCAA

12 GCTTAGATGCTCTCTCAGGCGAC.AGATCTCAGAGCTA TAGTGGAGGIGTTGAAGATGAAGCTGTCTTI1TGCA
AGAAAGTTAAATGTTACT ACAGATCTCAGAGCTAAGT

GAGCGACGACGACGAGA CCGAGAGAATCTGGAACC
IS AACCAC CGTGICT A AC
12 ATCCCCCAGACCc1TTTCCATTTGGAAG1TTGGTATAT TTGGATATAGACCATTAGGATCCGCGTCTAGTCACT
TGAAAATTCTACTATTGCA TTTGGAAGTTTGGTATATT

GGGCCTGATATATCTTTTA

12 TCTCTTCAAATATGITGTCCCCAATGIGATACTAATGT 1 TTAAAMC.AGTMCCGCGGGATGAGCTGGIGTTC

i ITTGAGITTGAAAATCCCG
19 AFCCC:CiC I GAGA , CCTICTCGAITGGTGCAG C

AAGTCAGMTCCTTGGGTCGAGCTCGTITACCATIT CAAATCCATGGCTACTAA
CCAGAAG ' AACAAAC ATGC
CCTACTGCTGAACCAGAAG
12 C.AGGAGTFTATAGGTCAGMCCCATGAATATTTTACC GGTCAGTCATFIGGTCAGCAGATCTCACMAIGGAC
ACAAACTGTGATCAATAA TGAATATTITACCGTGGIG

TGCAAGGGATGTTTFACA AGGAGGAGACATAGAATC

CCATACTIGGICAAGGCA
23 . T AGAACC GC;
CFGAGGCTGCAGGGACIT

CaGGICAITCGGTGTTCA TCIGTTAGGACCCTTCTCG

CCMAGACAGGTAGAAG
2$ crrci ceicrorr ATM T
ecGAGAAAcrcceacrre AGIGGCAGAAACFGGAAC ACAAGTAGACCAACAGCA
26 CA.CC: CI CCA IC CC

AGGGCITITTGGTCCCIG C:AAGAGGGTCTGACATAG
27 AC:ATAGC 1 ITCIATGICI TTACCAGCAGGACAGCTC C
12 CGAG TGTCCCTCCTFTCCACACiGGACCAAGCCC TATC: i GTGCCGAGCACCTAGAAGACAGITIGCCATGAIGT GCTGAACiCCCIAAAAGACi 28 (:.CA GTCCTGG GC
GGACCAAGCZCTATCCrA

ITACCAGTTGCCAAGATAGAGCC:GCCTIGGFCCATC AGIACTAAATAAAAGAGT ACCATAAT
GACAGGAGAT
29 , GAGATACC 1 TTTCC AAGAGCC ACC
.

AGAIGTAGGGGAIGCCIATITITCCCTGC.TAGGGTA f3AGGATGCTGATAGATTI AAATAAGG
ICACTCAAGA I.
AAGATTTCAC AATGCAGTA TAGAG AA ITCAC
12 CCC I AGGCCATTI AGAAG ITC( TIGACAGGACAGGTI TCI AC ICCAGATGAGAAGT

12 Gri Ca GGCICAAGATAAT TTIGTICAGAAGAAGF GC

32 AGTGGA CC:766767A ACTCA
C.AGAAGAAGTCiCAGTGGA
12 GCCTGGTATAGGATC.TCC.TACTAGGGAGTGGGACTT GCAGAGACCTTCTACACAGATGGCATATCCTGCTTT
AACTACTGGCAAGTGACA GAGTGGGACTTTGTATCFA

IICCIGGAAAAGATAGAGCCCGGGG1TAGlICITT TAGAAGAAAIGATAAAGA

AGIT:TATG1TGCATGGGTC
12 'MCl/ TTA TIGGCCACCTACTGGGAAGCAGAAGT CA I
GCACACAGACAATGGCC:CCAACCIACCCACCA TGCC GCAGTACA KT TC3CAAGT
GAAGCAGAAGTCATCCCAC
CCCACA AC GC. A

CCAGACTGIGCAGACATCCICITCACCTGCCGTAAA GGAAAAAGT CATCIAGAA GAACCTAACACCAGAAAA

CCATAGAAGCCTTAAACAG
38 AC.AGGG . ATACTFTGTGTAG TTCGAGTGGCTAGAGAGG GG

GGTAATACiAAACTCTGAG AGGAAGAAGCCTTAAGAC
39 TTAAGACAT 'IT GGA AT

ATATCATATGAGCGAAAG
CCA AAGC.ACTTG GGC
AGACGAAGAAGGACTCCA
12 GCTTGACACATGGITITATTGATCITTGCAATAATAC.A ACACCTCTATGIGTGGCMTGAATTGAGGITGTGGA
TGGAATAATACAGTGACA

GATAAGAMAGGAGGTAT C.ATGGTATTFAGAAGATGT
42 AGATGTGGT TAGGGCAAAGC AAGGAC.A GGT

CAAGTATTATAATCTCACA GTAAGAGACCAGGAAATA

44 Ci ACAAGAA TGTCAATG C GAA

GACTCTITATCGCCTGACTGGTGATATTTGCCIC GTOGGGGAACGAAAAAC

AAGAGTC.ACTGCTATCGAG

AGACTTAAGGCGGCCCAACTGCCTCGCGAATGCTT ACTA TCCA AGAACTCCCCG

47 CAGCCC CAATTTATAGGTC , ACAGCAGTATCAGCAGGG
AGAGTACATGAACAGCCC

GGATTGCCTG7TATTCACTA AAC.TGCITAGTACACCCA AACAAGCAGACATGATGA
48 GATGA i TATGGTATCC G TGA

AAAIGTACCCGCTTC.ITGC

GAGAGGCTGGCAGATCGA
12 TCCCAGTGC.AGATICGATCTGAAGGC.AATAATTGTAC
AACATCGTCAAACTCACCTCATGATCAC.ACCAGT CA CGTTGC.ATTITCTAATATC
CAAGGCAATAATTGIACTA
SO TACTCAT CATTGA CAC CICAT
12 GCAGGTTTGACTTCATGGAGTATTGCCTAGFCAGAC ¨ ACATCTGGATGCMCCTATAATGCCICTGAGAAGA

51 C.AAAATT3C TTAGGTAGTIGTC AC
CCTAGGCAGACCAAAATGC

ATGGGAACGGCTICITCA

CCAAATCAAACAGMGAC

MAGGCCATAGGAAATTG CCAATATGGGTGAAAACAC

AGCTCACAAATAGAGCTTGCAGCTCATCTTCACT TTAAGTGAGCTAGAAGTA
SS TCaAGCT GITTATTATCCC AAAAACC
CGGAGCAATGAATGAGCT
12 GC,ATTAAAGGTGCCAGCAGCTGCTITGAAACCAAAC 1 TGCTGCATCTITAMTGATGATGGCCTAGAAGCAGC
CTGCTGTAGAAATAGGGA
56 ACA t AGIT:CAGY A TGG TGCTT
TGAAACCAAACACA
i TATGAGACCCA CTITGTCCAGAMGCCTIA :FGGAGATCCAAACAATATG
57 C:AAT ATCTGA. 1 1 TACAACT G , AA GA U

CGGAAGCCATGGAGGTTGCTCCCAATTGTCCTCATT GACAGATGGCAACTACCA TGAGAACAGAATGGTGCT
58 CTGG ' Gccr CC GG
12 C.ACAATATCAAGTGCAGTATCCCAAGCAGCGATTCAA
TCTTGATCGTCTTTTCTTCAAATGCGAAGGCCCTCTT

GC.AGCGATTCAAGTGATC

GGCCATTITCATCATGGGAGATCTCCTCCATAACA GAGTAGACTCCCTTTGTIC

ATTTGATGAAAGTACAGA CTAACGTTTCCCAATTAAT
61 . AATTAATAGACG TCGGACTAC.A !TOM ACTACG

AAAATIACTGATGCTGAAT

GAGGAGGTTACAGGACCATATCT GGCA TAAAGCCCTTCCATATCCT TGGGATTGGATATACTATC
63 C:TATCAGG GCATCIC AAC ACiCi CTGTAAATGTTCCTAAAG TGACAGITTATAATGTATA

GCACTGGCTATAGACCTA GAGGTACAACAAGGCCTA
65 COAT I AGATGGATC:AGG TAG 1:

TCTICAACTACACATCCIGACCCITTCACCTTGATAA GGGTAATCCTACAGITAT AMA
TACTGCAGCIATAIT

12 CGTICAP.CTAMGGOTGGGAGTIGGCTGAATTIGAA Mal- T GAACACi G
TTCCTACACGAA T ATI CMACTG CCTITAACAACGITITCTCT sUGGCTGAA.TFTGAAA TAM
67 . ATAATC1 GA CArAGCG AM GGA
=

CICGTGCATCTATAACIACCAGAAGGACTCAATAGC CO GATA GTG AITTCAGA All ACATACTGCCTAC MT

12 ACATCTGAGC:GITT GCGMCATCAGATGACTA MT C
TITIGC:AGATCiACTGICTCiGATGAAG TACTCTGGC I CCTAGC:ACAGACAATACA CATCAGA
IGACTA ITT Ta GGCCAACACCTACi TAGGACATCCA rrmrcc GAACACAGICTOTAGI ACT
70 GTAGTAGTACC AGGATTTCZ Tarr ACC
12 GTGCAACC.TACTATMACACCIGATGATGAAAATCTAT
TCCTC:CATCiGGCTTGAATATTCTGCCTCiCiCTTCAAGGG TACT AAAATC.CCACTCTTTA
GATGAAAATGATCACAGAT

1l7GGGCGTAAGGAGCAGCATCCCCAACAMCCT A TAACAGAACAMMGC

GTACCCCCACCACAACi AAGACAAA I A TGAT A ITGAAATI GTAATCi ITIGCAAAGTMTCIGGA

AACiCGCAAAAGGICIAAGTAATTTCGCATAC:AGTAA rr-rac:c:r CAGAGC11TCFC
CGCCGCFTICTATAICAAA
74 ATCAAAC ACATTTC.ACAGTA A C
12 CiACAAACAGCTITCGCAACAGAAACAGCTGAGTCIG TTAACGGCC.AAATA TGAATCAGAGAAA I
AGCCCAT AAGAATTAGCATCCITTIT

AAACAGCTGAGTCTGGTT

TGGAAGATCTAGTGATGC
76 AA . CACAGTCT C
AGCCCATCTATTGTGTGAA

TTGGAAGCAATAAGAATTT

CTGGGAAATCATATATTTA

12 AGICAITAATTGCTCTTGTAGTGC.MCCCGAAGATM AATIGGGACCCACCGATTIATCCAGCATAATACTCC
CTGC.AAAAGCAITTAGAT ACCCGAAGATAATGAATCA

ACAACCGCAGACGACAAGCCCTIGTCCTCCiTCGTCG CTCCCTCTCITTACCACICIC
GTC:ICTATCCGACICCCC.T

GAAAAATTGTAATTATGT TAAACGAATAAAGCGAGC

12 GCATCTGGAGTGATTGGTAGTACTTCCCCAAGACCTr AGAAGAAGGAATTCCAGATGTIGGGIGITTC1GCTC
ACAAACACCAAATACAGG CCCCAAGACCITCTATTCC
82 CIATICCA. CTAAACCAG AAC A

TTAGAGAGGAGTITGATA

AGGGCCaTTMCGAGTA

TTITTC.TCAAACTCATITG TATAGGAGGGITCAGGGA

12 GCAAACTGGITAGGATCTGGTAATTAATAATGTCCAA TCCTGAAAGAGAACGTTTAGTTTGGTAC.CGACTCCA

12 CCCGCCGTCCATAAAAAAATAATTTATTAATTCrAcAT
ATGCCAGACATTICTITTGTAAGGAGCTC.ATCTGGA ATTAATTCTACATCTAMT

12 CCAGCTGC.CAGTTTIGTAATATTGACCMAGAGGCT TCCTCCTCAAGGATTGGAGGATACCITTAGGACACA
AGTAGAAATAACAATGC.A

CaT1AGAGGC1GATG7GT
12 GGICGGGIATCAA.ATACAGCAGMACATATCAGTC.G AACAACCGGTICIGGTCGTFAAGACACAGGCACCG
TATTGACTCATGAGTTIGT TTTACATATCAGTCGTTCA

12 CCTGTCTTCCTGATGTGTACTTCTTC.ACTCGCGAGATA
AGAAGAATC.CCGCTCTTAGAATGATTTGTCAGaGT AACiAACTAAGMATITGA

CACTCGCGAGATACTGAC
12 GGTCCATTTCTGTTCCACCACGCTGGATCAGACAGGG AGTACAGTCCATTATCCAAAGGThiliLLGTGTTITA
TaTTGGAGCAAAACAAA

GCTGGATCAGACAGGGTA

CAGATGTAGACACC.AGGA GGGAGGTAAGMATGATG

12 GTCCAAC.CATTGTGAATTCCTCATAAAGAGAGGAAG GAAGAGCAACAGCCATTCTAAGACCCTTCCGCTGAC
GAGGGTTCACTTTCAAAA AAGAGAGGAAGAAGTGCT

TGAGAC.AC1FTCAAAAGGATGCAAC.ATTGICCATG6 CCGAGGTGATTTGAATITC

i CAGGGAATGGAAAAGITGACGATTCTGGCCCGTTA
95 GTCCGA 1 ATCTCC , GIGGTCGTGAGCAFTGAC CGTTICTIAACCiGICCGA

GCCAGAAGCCAATATAGT
96 GTGG ' CGGTGG GTACC GG

CAACAAGACAACTAAGAG TTACAATACTTGGGAAGGA

AMTCCTGGTATTCGCTCT GCGATCATTCCAACAAATG
98 ATGC CCiTFCCACTG G C
12 TaTCATTCACGAATTTCCCAGGATCCGATTTAATTAT TGAGGCAAATTCTC.AGAGAATCAGGCCATTAGTTCT
GTGACCAATTCCTAGAATT CCGAMAATTATTGAGAG
99 . IGAGAGGCG TATICCACIGTA TTCAG CiCG
13 CCAGITTGITTCC.ACTCCCATATGGGATCCATCATTCC ATCAACAATC-GGGATa:AICATFCCGTAI
13 ITGTCCaCCACTATGATAGCATCCAGAGIGGAGTAC CTIGCCATTICAGAACATAGATAGOCTCCTITGCTTA

TCCACiAGTGGAGTACAGG

AGATTCCAAAAGGAAGAG

TFIGGIGCIATAGCGGGT

CAGAAGTGTGGTCATAC.A CTGAACTCTIGGIAGCAAT
03 CAME; AMGCAACCAGT ATG G
13 ACITGC:CAVCTGCCTrAACAAGGGT1TAACCGAAAA
ACTTaGCCTCATCACAGCCCiCAGATFTG TAAAT1CC GAATIATAAAAAAGTACCi GaTFAACOGAAAACGGT

13 GaTITGACATAIAGCACAGCGP.ATAITTIGGCATAC TATCGCCAAGTGAGAAAGATC:ATCGFCCACCIATIT
GACACTACGACAGATCAG ATTITGGCATACAGFAGAA
OS . AGTAGAACAA AMATGAATCTGA C CAA
.
13 TT CTG1AACGITGT ATG TCOGTITAATTTGACACat GTGIGGIIGTACACAGIGTGGATGCAGGIGTIGTAT GGTIGACCTGTAITGTF AT AATFTGACACCTCAGATGA

13 1 T FCCi GATMCCTGCACCICACGGGAGT ATGGGTAA
AGGGIGGIGGAATIGCTTAAGIGACCAAATGAGSA CAGGAACAGGTAGAAAA.T
07 CC TCCAAACAG GC.
ACGGGAGTATGGGTAACG

ACCACAGTGICIAPAGGAATGTGGCTaATIAACAG GCAGAGGCGTIATATACA CA TACAATGGTIAACCIG1 08 CC:TGTACC ITGATTAGCA CA ACC
13 CAGCACGTTGGGCATATTC.AGCATATGATAATGATTI
ACAC.AGATAGTAATGCAGCTGCTCITCTACACATAAT TTAACAGAAATGGTGCAA GC.ATATGATAATGATITAA

13 ACCiCATCFMACGITIGGCCGAAMCAAGAATCAGA AAC.ATGIGCAGCACTGGAAGFCCATTIGACGAGCCT
AAGGTIGIGGICCAGOT GAATTICAAGAATCAGAG
GGACG TAMT A GACG
13 GCCiTCCCTGIT11T1TAAACACITTCCATGGACATIGC ACIGIGGAGGITGTATTCGATGGCG
FATATAAATGT AGICCACATTIGGTACAG CCATGCiACAITGCAAGAG
11 AAGAGA CCATGaGTA AA A
13 GG11.13GGGIGGGGICTITGTAICACCTACACCX:ACA CCCCOCCGCGAAAACGATACTIGTGACGCCGTAGT
CCACCGACCTATACAAC:AC CACGTACACCCACAACCAT

TGIGITGIGTGIGTGGATfAGTTAGOAGCAAGACA AIGTACAAACATATATAG
"FTACIGIGAGTMACTAll 13 CACTATTGTG GGAGG GGGAAC.A GIG
13 aAAAAACCACAGCAGCAGCCriGTACATACGTGATA TIAAACTACTGCAGACATACGTOTGTACAAACAATA

14 ATATGGTAC . CAAAAACCCAAC TGTGTGGTGTGCATGGTA TGGTAC
13 TGCAGAGGACGTAATATTAAACCCAATTGATrCTAGC GGACATTTCTCCTCCCAC:CAATACGTAAAGTAAAAG
TATTGGAAGAGTCTGCAG ATTGATECTAGCATACC-AG
ATACCAGTGC GATCACTGTA TT TGC
13 TGATGGCCTAGTGATTGGGCTGGGCTCGTATTGGGT AGGCTATAGAATTGCAACaTTAGGTCATTAATTGC

GGGCTCGTATTGGGTCTA
13 GAGGAGGCAGATAAACCTIGITGTACCATATTMTT CTCCTCATTCC.AAGGIGGTCAACTGCTGGCATGATA
TACCATATTTTTFTGCAGAT

13 ACCATCCTGAATAACACTGITITGTCTGCTTGCTC.TGC
TATCGCAGATGTAGGGITTGGAGCACCTCAGCTITT

13 CaCTAGTATGICAGGATTCATGGTTGCAAAATTACA ATTGGAATGTIGGTGTAGCTCCATATGGCCTGAGAC
ATGCAGAGGAATATGATA TGCAAAATTACATTATCTG

20 'FAGTACATGC ATGC1IGTGC TTGT 'FGC

ATIACAGGAAAAGCATIT

riTGIGTTACCTGAATGGGT

AAATGITGIAGGGAAIGTGATGGFCCOGATACTFA CITSCGTICTIAAACAGTA A
ACAGGTCAAGTITGTAAAA
22 CiTAMAGAC ACTA AATTAGCAA TTC GAC
13 CCTCCGACCGTCTCTITAGTCTOGAATCCCAGAGCA GATTCCTCAAGAAGCGGCAGGATCGTC.GTCG1TGTC
1TCCTCCCTCTGITGOTAG CTGAATa:CAGAGCMCG
23 ACIGG CTCTT , C G
13 AGACCTGCTGCTGGTACTGACTGC.TGACATTACTGC.r CA AGAGTTGAGCGACTTAA
ACICAGCTCCAACATITC CTGAAGGACGTGGAGGA

CTGCTGACATTACTGCTGA

TGTAGGTGAGCATCCCAATGTAGCTGCACATCGATT GGTIAATTGACACTCC.TG
25 ATGTTG ' TGAG GT
GCCCAGGICTTGATUTTG

GGAGATATGTGCGATATA ACTITIC.ACICA111TCAGA
26 TTC-AGAAAGT CTCC.ATCT GGTC AAGT

GAC.AATGCTACACCTCCA
13 GGGCTAGCACATCAGC.ITCCITTGAAGTGGAGCTTGT ATTAATGCAATGGACTCTCGCACCTGAGGAGGTGA
AlTATATGAGACATACCG

MGAAGTGGAGCTIGTGA

TACAAGGATTTAACATM ATMCGAGATAAATTIAC
29 AATTTACCICC GACGCTT GGAAGG CfCC

TTACCCTCCTATTATTTGC.CCACAAACGTTTGTGTGC TGTACTGTGAAAATAAGC AACATATGGTGCTATGTAT
30 GTATCCTG GATGTT TGTG ocm AAGCTGGC.AAGAACAACACCCAGAAGATGACTCAA GCTTTCTAGTITA AACC.AC
TTGAACTTAGAC.CGAATIT

13 GC.CGTAGTGC.1. GGCTAGTACC.AGGCAGATGGCGACT I
ATGGCTGGATCAAGTGAGCAGGCCATTTGCCTAGC

GGCAGATGGCGACTACCA
i 13 CCACAATATCAAG %CAA TATCCCAGCAAATGCAGA 1 TaTCAMTGCATITATCGICGCTAGACTCAGGCACF ACCAGAAACGGATGGGA GCAAATGCAGAGATTCAA
33 GAITCAAGT I CCTFC , G SI

ACCTAACAACAAATGGAAGTAATGCGTTTTAGATCT GCTCTR. I i I III TATCTCA
TGGATAATGAGAAACAAC
34 AACATGC ' TTCTCGATCATGT TTCAC ATGC

ATCAGGAAACTATGTTGC TCAACAAMAAAACCM

ATATGTTCCAGCTAGGACAAGCACCAAGTTCATCTT CTGTATCCTCAGAGATCCC CA
ACTATCCTGCCATATGG

13 TaTCCCAAGAATCCATGATTTGATMAGAGTTTGCT ATCAACTAATATCTCCTCGGCCCTAGGTCTICTIGGG
ATCCTATTATACCAGAATA TTTAGAGTTTGCFTGCAAC
37 . 1 GCAACT CCM TAGSGT 1 38 GGACC GIG 'ICI TGAT CAG CC
13 GISITCCIGATGATICTG laGTATGAAAATCTCAAA AATCCTCATCATGGAA1 CICACCAAGTTGIGTITGIC CCACA TCAAAAGGGCAGA GAAAATCTCAAAAACAACA

TCCAATGAGAGAGTATCC GAAAAGATCAAGAAAACT
40 GAAAACTAG 0 17 TG MAT'S-TAIGA ATGA AGGTTFG

GAACAGAGAAAAGCAGTACCCCATCTAAATACCGA TCCAAAATGAGTATAACT ATACACA/TCCCAGAATCA
41 CiAAICATCA 1 GAICCAIGIT AACTCFG "FCA

ACAAAGMCICTIGaCaCCGIACAAlICACGAA AAAGAGA1GGTIGTGIA1 ATAAAACCAGAAC TAT ACC
42 CCC.ATG GATTACTC ACAGT CATG
13 CraCAACTC:CTITTGCGATAATCAAGAGIGGAIGCA 1 AATCICTACFTCAA ICTGGACCICACI

43 . ATT11CC 1 GATTATCCC.1TA CT CC
.

CAGGAMCCCCCCICCAAAG TAATT1TGAGCCTCATAFX: TACiAAGACFCTAACTCTIG

13 'FCTGAGTGlICAGC.AGCC I AGTGA ITACICAATCACC AGAAIGG
rATATTCCICTICCCAGCACATCTGCTCCA TTUACAGAATCAAI AAAG 'FGATTACTCAA TCACCOCC

13 Cta: ITC:ATTACK; IGGITCiATIGATTGC:AAACTGTA ? AG I ACANT
AGGTAICAACGGAAIGCTGIGTAGAA r TrGTCAAS GGAGCi AG MG TaAAACTGTA I AACAACC

AAGCATACCAATC.ACGGAAACICIGACICTTGTTGTTAT AAAAACTTAGGAGTAAAG
AATCCAACTCTACTCATAC

TAACACATGAIGIAGGIAT

'FATGCFCCACTAIACCCAT

13 TCiACIGGG1TCACTCTCGA1TC:ACAGGGAGCNITG 1 G
ACTTACTCAACAGC:AACTGAAAGACCAGCTGAGAG TGAIGCGTATCCACTAAM

CAC.AGGGAGCAlTGTGTC
13 GC:AGAGT;TGAACCGTAAGCATITCiCCTCCTGIGACA AAACGCF GTIGATIATF

ATTACC

GACGAAACAAGATCAGCATOTCTGAATAACCAT TGAAACGATTATAAGTAG TAGCAAAAGAATCTATTAT
52 CTATTATGATGGG . TCTCAATTGCTT CAAGA GATGGG
13 GCAGGTATTAAAGAGGCATATTGCAATCAATCCAACT CAGTGTTGGaIGATTCAATTACAATGCACICAACTG
CAACTATATATTGCCCITG ATFAATCCAACTATAACAC

13 CCTTCCTCCTGATAAATGAATCCAC.ATCAGATATGA A
AAGIGGAC1TGAAACACCTGACCITCTGATCaGTT A ACACTAAGT AGGAC.1TT
ATCAGATATGAAGATATGT

CCGCAATAAGAATAGCAA ATGGGCATTIGGTAATGAT

AITTGGGTTGCAACAGTTMGAAATCTCGTGACGCs A ATCTACAGATAGAAAAGT
GACTITGTGTCTACITITCT

CCGTAGCAAGTGATTCTG AATTTCTIGC.AAATGGCTGT
ST GTT TATGT A T

GATGGCAAACCTGTTATG TTATIGGCAGITTGTTGACC
58 Ci3TGACC.A ATICAATCTCT I* AGT A

TGCTCTIGACAAACTITAC TaCCTGTTAIGACAAATG

TGAAGGIGTCICTGITTG AGAGIGATTGCFCTICAGA

13 ACAGC.ACAGTCAAAGACAGITTGATGCCTGAGAAGT TGAAACTTGTACTATTGCCGGTAAAAGCGCATTATC
GTGGTACTATACAAATTCT TGCCTGAGAAGTTTTTGAA
61 TTTTGAATG AAGCAAA , CGC , TG
.
13 AGCAGTGCTACGTTCAACACAAAGTTAAGTATGCAAC ACTTACAATCGCCAACAArfATTCCACATCGCCAATT
MAGTTAAGTATGCAACACT

13 ICCICAAACCAAGACTCGITAC.TACTTIATGTGAGA G 1 TGATTACAGGGCTTTTATCAGTGTTTTAGACTCTGCA AAITGTCAAACCTAATATC ACM AMTGAGAGACTAT
63 ACTATGTTG ' GCTTTCAC AGCC GTTG
13 AGMTCTAAMACGCATTATAGGCCGAACATGTGCA GGITTCTAC.IGTTAAAATTGGTGGCACAATGTAAGG

GAACATGTGCAAAGCGTT
13 GTCAAAATGTGTAGGGTCTGCAATTCAGGATTAAGTC AGATGCAGAGTTTGTZTaCCAGTTC-ITATAAAAGA ATGGATTGAGT. I I I I MCA CAGGATTAAGTCTTTTAGT

CCITTTGAMTGCCATTTG TACTCAAACAATTGGATAA

ACTGGITTTATGGTTATGG AMTGTAGTTGTACCAGTG

GTATTAAGGATGCAGATT AAGTTGAAATATCTGTTAC
69 Cf GTTACACCA ACAACAGTA ATAATGC ACCA

AACC.AGTGTCCGTAATTAAAGTrGCAC.ACACAACAA ATACAGACAC.AGAGGITT AAGGT1A
AGGATITTGTGT

i ATACCAGCTGACTACATCTIGCCCCTACAMAGAAA CCTICTGGTAATCTIATTA
GITCIGITGITGICTACAT
72 ACATGTG 1 CAT TACAAGTCA , AGT TM GIG
13 CGACCCAATC.AACATCTATTGCITTTAAGGACAAAGA 1 AGATGCTGTTACTTTTGCMCAGTCCATTAACAACA MOTC.ATTTGAACAACAA TAAGGACAAAGAMITTCT
72 TCITTCTGGT ' GCACTTTCA GTTG GGT

AAAACTCTGTTGCATCTAT

IGTGCTAGTGICAAACGT

GGTTATGCTAAGTCAMC TCAAGGATGATTACTGTGA
75 . ACTGTGATGG TGCATAT 111.6 IGG

AAGGTTATTTCTTTTGTCA
76 TGTGAA ACC:ACCATF T GAGA TGIGOTiTTGG
TFGTGAA

GATGTACTACGTAATAGC

'FATG1CITTAGCCGAGTGC

CGCAAGTATATTGTAAAAA
78 AAAAACTAGC CACA CAC; MT TrCTCICT GAAGGI
CTAGC

TTTTTGAAAACITTGAGGA GGCITCACACCTITAAACA
79 ITAAACAAG ACTCC:AAA CrCIG AG

FGAATCCAATGCAGGTGTAACGT AACCAITGG TIAAGTTGCCAGAAGITAI CAAGGCTITGG TIT TAG AA

13 GCAATTAAATAMCACTCAC.ACCAAAA IT GTATF TC IT T

GCTACT A
81 , CiCTACTAGAAGC ACAAAC ATGC GAAGC
.

AGCAGACACGGAC TrAGCAAAAGCAA TWIG CTITICGCGTGATC:ATAAT

CTTAGCMTI CACACCA TTGGIGTIGTAGGTGCTA 'IGCATG GAGTAACICIT AA
83 ACTC. TTAAAATT AMACACC C MIT
13 GCAACCACATTAACTGTTAACATCFGGTTITGAAGAC TTATGCTOCIA TAT TGAATOCi TTITCMAITA 1 AATGC CITTGATGGIGTTAMIAT GGITITGAAGAFX:AACCTA
84 CAAC.CTAAT TCCACAAACA GGT AT
13 ACCC.AAATACFMGMCITATAAACAAAATCCATAACICT TAACCCTGGITTMTTACTCCCiTAMTGTAAGACAG
CCTGCAAAGTGGTAAAAC AATCCATAAGCTIATTIGC

13 AOCAAT GTGTACMCGCMIT TAAT T TGI CT11 TT GIT GCACTIATITGITCMACTC.A 1 TGCACCAGCAAACTA CI GIT AIGCAAATGGACA. ATITGTCITMG TIGCACT

13 CiGCTGGACT AACACAGAA ATCA TRAGCTOTA 1 GTACT TA
IGGTTOCTAATGGTITGAATGCAAGITTAAAACiA GI TIT ACA IGTMT TAGGC GCT GT A
TGTACTAT GGI Tr 88 TAATGGG CAACO AAGTGC. G
13 GCFTGriCACSC.CA ITCTGITAAITTITGAAGAAGGCT
TGCMCAGCTATGrACAAAGAAGTGCATGGC:ACTA ITOCTCACCACAAA TAAM n-GRAGAACX3CTAIGAAT

TACCAGATCATGATTCATT AAATGATGGTAGATGGTTT
90 TGGTITTGTG . CACAAGTC TGTG TGTG

GAAGGTAATGCMTATAC

GGTGGACGTGCATTCATG
13 AC-ATCTITACAC.AATTGCCAACAGGCAGCTOCAGCCT GAC.
TAATGGTTCTGGTAGCGGTICCTGCGTAGTGTT CATTGTICTTTTGC.TGTTG

CAGCTGCAGCCTATCTTG

TGTAAAGTITOTGGITGTT TAATCATGGCTGTACATGT
93 ACA.TGTGAC TAATC GG GAC

GGAGATATTGGTTCTTACTG671111TCTATGGGGTC GCATGATTTC71TAC.TIOG TATGGTAATG1TAGTAGAC
94 GTAGAC.AGGA MACCAATT CAT AGGA

GlITTGACCTTAGACAACC TCTTAATGGGAAITTCTAT

GCATTGITCAAATTITAAC

TCTTCGCAACCACAATTCC

97 AGGATTAACATC ALM i 1 i G CATC

TGICTGAAATCCATAGIAGCTACCAGOCATAAAACT TCATTAACAAAGCGTAAT TGACTCAGTTAAA
TCTTAA

13 CAATGrAACCAAGTCCAGCATAAGTGC.AAAAGCATTT AGCCAl. ti i o i AlTATCAGAATGGTAGATCTTCCTCA TTGATGAATCTTITGTGGA GCAAAAGCATTTTTC. TATG
99 TTCTATGATGA GTCCAACA , TGAC ATGA

CATCACTAAGACTGATGCT

TGGAGGMGAATTATTAC ACCACAAACC.ACATCT11C

14 ACCGTAATCTACiTTCTCAAAC.ACACATGCITC01AAT
ATACGGTTACTTACAAATCCACTGCAGCAACATTAT CCACCGTTAAACCGTAATT CATGC.7.
TCCAAMTACAAA

14 TCGCTCCAGGGTAATACACACTEEETGo74-GTGGA CGCTCTGTGCAAAACICTGICAACACTCAACTCTGGC
GCAAACAGCGTATAACCA

TCCTCCTGGTAGTGGAAA
14 TGTTGAGGATCACCAACATATACAATCTATGTGC.ACT

14 CCTGGCAGCTACATAATTTTGACTAAGCGATTTATAC TTTAGGCTrACAAACTCAGACAGTGGCGAATATAAC
CTCCAGTATAAATAGGCG AGCGAMATACATAAAAA
05 ATAAAAACTOCAC ATAGTCATATICAC TCAA CfCCAC
14 AGACAATGACAAATAAGTAGTGGCAG7TTAAGGATT AAGACTAGTGGTGACTTGGCTCAC.ATGTTCATAGGT

GTTIAAGGATTGTGCACGT
14 TGCCAACATTGTCACCTGTGATCATGTCAG.AGGTTGG TGTACCTCTACAAGTTGGITMCCGTTAGAACACAA
CTAGAGACTTTGCCATGC TCATGTCAGAGGTTGGTTA

14 GCTCC.AATCTTAACAAAATAACGCAGGEOTTTGTACT I
ACATTGCCAATGTGGTACICITGCAATGCATGTTTA
08 TIGGGC : AAGCAACAA CIGGCTCATCTGATGTACT
GGIGMGTACITTGGGC
s 09 GITAAGAATO 1 TACAA , TGACIAGA EGTTIGGCIGT AATG

TCAT6CATTTCAC.ACTCCTGCAACGGTGCTGGTTTCA GAATTGTAATGTGGATAT
TGAATTCTCAATTGTTTGC
TGCAGG ' AT GTACCC AGO
14 TCTGTGAATGTCTGCCACAAATTATAACA1TTITACTC AGAGAACATTGaTTTAACGTTGTICTAATGGCTAC
TCTCTATAGAGCATATGTT ATAACATTITTACTCAAGCT

ACTGTCAAATCAGAAGAT TTAACTGGTITGTTTACGT
13 . TITACGTACC5 TCA GGT ACG
14 CAAC.AGTCAAATCCAAAGACTICAATCCTAGTTCTAA CCAAGGITC.ATGAGGTCATAATTGAGCATTATCM
CTGACATAACACTCAAAT TCCIAGTTCTAAGACTGTTT

14 GCACCCAAGTG TAACACTOCATATTGTGICAATATTT TGGCTCTGATTATGGIOTTGCAAAc-'t AACAGTACCAC CGTG A 'FACCAC

ACTICTGTTAATACGTCAT GTTGTCGGAATTAACTATC

AAATGMGTMGCTIGT

"mccrrGTTGCATATTOCT
14 ACG I AGMCAT IGCCAA I AGTAG TGATOCTCACATA CRICITITGTGCiG I
GCACTAAATOICCIGTOCGTGA CAACAACACTI TAG TT TCA GATGCUCACATACCATITG

14 AOCGAI:ACAATAGACACAGG FAAGAIGGTITTFATIC
ACGMATTGIGTTGIACCITTCIAMACAGTT AAAG CAGTIG TCCIT TGATG LAC AGATGGITITTA
FTCIACA
19 . TACAAGCC CACITGCC C AGM
.

GGCACCAICIACTCTATCACICCCACiCITGCTGATAA GITCCTITAIGAAIGTTAC AATAITTATGATGTAT
CTCi GTATCTOGTGTGG ACAACAA AlTGG 61G160 GITTIAACITATTCIAGTIT

AGCGICATACCIAGCFTGCCCTICTATCICICACTGCG TIAAGAAATAGCOCCAGG C.16CAGATGT TACT
GAGAT

CGAATGGCCATGTATACA TTCTITAATTGGTGGAATT

CTGCTITGAATGTATTCGTITCTCATTGITGTGC.AAG CTATCICIAGETCCATICA. TC1 ATGACAG
ACT TGACAC
24 TTGACACTAT C.TGTCT GGC TAT
14 7 GCCITCITT GTAAAGAGCAACiATTGG ITGIOCGTI G
TCACATCTCGC.ATAATGTTICIAACCGIGACATTGCA ATAAGGATC11 CAAGCGT

GGITGTGCCITGATGGTA
14 GTGCTAATTICACTGGTCAAATTCATACAAATTGCCA AC:ACTGITC:AAAAA
ITGC:AAACTCTACiCCACTT TAA TATRA TG ITAATAAGAC TACAAATTGC:CAAATTACA
26 AATTACACTG GTC:GACTAA GCTGC CfG

GGCITCITTAGTTGTITTGC
14 AAAATTACAGCAATCAAAGTTGCGTTTCTAATCCTTri MTCCATAC.AGCTTGGCACTGGTGTGGTTACATTG

28 CGTTTATGTGC . AAGACAA GTT GTGC
14 ATCACTATAAGCACCACACACCAAGCTAGTGGATGAT AAGCTrTGTTTCACTTGCCATATGTATATGATTGGTA
AATTAGAAATCCAC:TAAG
29 CATGC AATGTG6TACA ATG1TC.0 AGCTAGTGGATGATCATGC

CTGTG TAC.GGC TOOT G

TITTCT ACCTACCTGAATACATGAC TAAMATAGTAGAGTCGG

14 TiCTGAAACGTMTGAACATICCATACCCATCPACAA GAAAGGGCAAACEIGGTGGATGGICCIGTGCCAAG
GGTCATACCTCGTAATTTG TACCCATCAACAAGAAAGA

GTGAATCCAAACCTCAATC GGAATCCTICAAGTGACAG

14 GGCCAAAACATIGIGTGACAlTAGAAAATGCAAAAG CCAGAGACCTTGACCACAACTTTTAGCTTTAACACC
GAAACAAAAGAACAAAAG

AAATGCAAAAGCCACGGT

AACTTCACCIGCAACTGCTGAGCATGMAGITTACT AAGMCITGAGGAGTIA ATICACTAGAGAAATGCAA

TGATCFATGTAFGAIGGTA "TCAGAGCTOC7MAATTA

14 C.CAGGTCiTGGCAAATGGACACTCATGTCCIGTITCTG
CTTTTAGTATTTGGCTTCCGTCCTTAAGTAGCCAAAT ACTGTITGGC.ATGMACA
37 TGT ACCTAGGAC , GT
CTCATGTCCTGTTTCTGTGT
14 AGAAACATCTGTCCCATCGCTGCGATGAGTGTCTAGA GGAAATCCATATGCACTGTTC.ACACTTAGCAGCTGA
TACAATTTATAGAGACTG
33 CTC a AAGCC
GCGATGAGTGTCTAGACTC

AAAAGACGGCTGITTGAG
39 GAGGG GGCACT ATC.G1TGIGTGTGICGCC GG

TTGCACCAT A TGTAGAAGAGTATCAAAT

CCGCGGCATAAAAGTGTT
14 GGCGCATITCCITATCTITTATACWiesCATTGCTAAAA AGGACAATGGATAAAAAAGTGIGGGATTAATTAGC
GCAGATGTAGAACCTAAT GCATGGCTAAAAAGTAAC

CATGGGGACTAACAGATA AGTCATGAACTGTATACAA

AGAAAGAAAACATAGATC

ATATATATAGACATTGTGT
44 TTGTGTGGGGT A.ACCC CGCAAAAGGGACACTGTA GGGGT

CTTAGGCCAAGTGTGGTG

CAGCCGCAGAGGTTATTATTGGTATCC.AAAGGTATA
46 CTAAACG 1 GA MC:ACC-AM CAGTGCTGTCCIAGAGGT
CACTTCCACGCCTAAACG
i 47 CAC:C 1 GIGTA , CCCCACiCiGCCITCCAAAG
*FCCAAAGGGCAAGGGAa:

ATCTCCTGCCTTGGAAACTGTGTTTACCACCAGTAG GAGCCITTACTNVµTGCTG
TTAGAAGATACTTATGCTG
48 CTGAGGA ' AGGTTA AC AGGA

CTGATACATTITATTTGCA

ClIGGTACGTAAACGCAAA

AAAACTGTTGTTATTCCTA ATCTGGGAATCAGTACAGA
SO AGAG TAACG AGGT G

ATGTCTAATGATCCATATG
51 . TTGGT GAATAATATATC5T GC.AG
'FGCCAMITTITCTITGGT

CTCTTACTGCTGATGIGTT TCICATAITAATGTAATGG

53 't GCCCA ICTCCACGCGAAGGICCGCGGA AA TCCAGC ACTGAC GGCAAGGCCT
TCCACA TG alCCGCGGAAATCCAGC

54 CGTC TCCTGTCACACCX:CACCCill GACCCIGICOGTGTGC GA
GCA.TGGAC:ATCTCTCCG It 14 GCGTTC.AGCAGCGCCTC.AGAGTGTCG1TGGATCTGA AGCATGTGCCCCAATCTTGGAGATCCAGGIGTAGAT

SS GC.IG GCAGO AC GG
14 CCACCGGCATAGICATGGICACGC:CTGGTGACCTGC TCCGACTACrTCCCGGAGA I
GCGCTCCTCCACCTCA

GCCIGGTGACCTGCCT
14 CiCCGTGCTCiCACCTCAAAC.ATTAACATGTACCCGGCA
CCAGAGACGGTGITGCCiCCGGCAAAGGATGACCAG
57 . CiTCC GM AGACCCIOCGCTCCAT6 AACATGTACCCGGCAGTC.C. .

ATGC.;CATC

14 CTACCTGAACTAAGACTC.iCiG i GTACCATCCTIT IGCC ACACCCAG GM I
GCAGIGCCATGATAGAATTCCAGG CACAAAAC:AAACTA TC1CC

ACCATCCTMGCCAATCA

GIAGGIGCLICCAGAGCC.CCIAAGT ITGGGGGCGC CCAGGGGGGACT17ATG r 61 C.GGGTTGGICCAGGGCCTIGTCGCTGACTCCGCCA TAG CGGGGCCACCTTCATC.A
GTCGCTGACTCCGCIA
14 CFCGTCTGGGTICTIGGCCCCAGCTCCIAAGAACiGC.A
TCCGTAGAAGGGTCCTCGTCCTACCCC1GAAGGTCiA

CAGC:TCCTAAGAAGGCACC

63 AAGAGGCCCCTCTCC.GCGAGACCCTAGGCGTCCCCT TGGGCCCTCAAGTCC.AGOCAGAGAAGGGCCGGTGG
CCCAGGATGTCCCCCAGA ACCCTAGGCGTCCCCT
14 CCICIGGTAGGACT6GGCGACCCACAC.ACCCACCCGT
GGGCCA.ACAACC:C.AGACGAGTACCACCTCCICTICT
64 CT TGCT AC.ACCCAGGCACACACTA
CACACACCCACCCGTCT

AAGCC:AGACAGCAGCCAAITG7CCACMATACCAG GGCTTG TITGTGAC:FTCAC

AAAGGTCAGGGCCCAAGG
14 GCCTFC7TAGGAGCTGTCC:GAGACACCCAGGCACAC ACCGGTCGCCCAGTCCTACGGACCCITCTACOGACT
66 ACT . CG CTAAGCCCAACACTCCACC
ACACCCAGGCACACACT
14 GAGTIGAGCTGCCTACCATGCTCCAAAATTGCiTGCCT CCCGTOTACTGCCCAGCCTGGAAATGCATAGGAGT
CAACACGACCCCAAGGAA CCAAAATTGGTGCCITGCT
67 TGCT. C GCGA G C
14 CCTCGICITCGGICTCAGCTTCAC.ACGTTAGGGGGCG CGAGTTACGCTCCTTGGAGGGCCTTCTTCCACGGCG

ACACGTTAGGGGGCGC

ACTCGTCCGCCTACTGG TTGTCGGAGGTGGAGAGG

CACAAAGCTGTGTGACCICi1CCC.ATCGAGCTCGCTG AAACTCTAGGCCTOTGGA

GCTCCAGGGAATCMGGG

ATCTGCTGACTGAAACCC

TCMCGCTGACCAGTC
14 TCGTGGIGGAGGCTGCKAGGCAGt. I tJ CC TTGAGCT TCTI
GAGGIGGCGCGGGTAACGCTGCAGGAICTGG

GCAGOTGICCITGAGGAG
14 GCCAGAGCCAACCITAGCTC.AAGTCGAGGGCATGG

TCGGGAGAGGCGCCTT
14 GCACCACCATCGTCCACGAGCCGCTATCiCTATTGGG

AGAGGGCCGGTCTCGA
14 GACCGGACGGCCACGTTGTGAGAGTATGGGGCC.CC
AGGAGAGATAATCGGAG

GCCGAGGCCACCTTGT
14 AAAGCTCrGAAGCGCCTCCTATCCCTGTTGICTGGCCA C.GTCGTCCAGGACCAAGGGGAAGGAGGAGAACCC
GICCTCCTCATCTACCCCA

CCTGTTGTCTGGCCACG
14 GCCCTCCTITGGGGATGATGCATCTAGGICAGACAG CCGAAGGGAGGTGATGGGGAAAAACC.ATCCCCCG
CGGACATGGATAGGAAA TCTAGGTCAGACAGGTAG

14 raCGIGGGAGAAACGCCCGAGAATGGCCGCGAGIT GACIGGGGAATCGTCGGTIAGGGTITGCTCGCACGG

GAGAATGGCCGCGAGTTT
14 AACGCTCTrTTCCTICACCGCTCAGATCCCTCTGGCG

CIGGATGCC.CTCCACGAC

AGCTGCCCCCGTCAAG

AGCTCGTTGGAGAGGACC

GAGAACCACGAGAGGTGC

as C:CCiCAGGCAGAIG ITGGCCGTCiCIGCAGCCCICGA
CGCTGCCIGGTGAATGCGCFCATCCCCCIOCGIGTC CAACCUGIGCGCCGAG GIGCTCCAGCCCTCGA

GCCITCACTGGCCCIGT

GAGGGCCAGATGCAGGAGCTTGATAGAGGGACAT ACGTGACATGGAGAAACT

89 AATCCGC:ATGGAGC:CCCMCCAGGC:CCGCATGAGT TTCAGCTGCTGGACACGCAGCGCAC.CTACTTCACCC
TCCACACTGAGOCC:GG CC.AGGCCCGCATGAGI

GGCACATGMATCCICTMCGG CAGCGGCICTG TGAGGT CAGCCGCTTCITGGGC
14 CCTICAGATATGCCACCCCCCAGGGGTACIGGGGGT GCC.AGGIGGATITTGAGCTCCGCCTCCAGAATCAGC
91 GMT GGCC AGAGACGICIAC Ga:CA
GGGGTACIGGGGCi TGGTT

TAGCCGGGATCGATGATG CAACATGAGACGTGACCG
92 C(36 TCCTGCT

CACCIGGCITCIGACCGG
14 CAGCAGCGTGTTCACAAACTTATAGCCTCAGCCAT Cr ACiAIGGGGATAITIAAAAGGGGCAGGT11AACGAG TATAGCCTCAGCCATCI AC

14 GCCIGTGCTACCGGACACGGAGCAGCTC.AGGGAAIG ACCFACGCCCTTGCC:CCCF
CCGICGCTAAMACACCE

AGCAGCTCAGOGAATGOC

GCATTAGACGCGCGCG

GCCAGGATGCCACCGAGGITGAACAGGCLACICGG 'ITIGI CAA TCCATGGCAGG

14 GGACATIGI ca:cccacAer GTA TGGG 1TACCGCAC: ACCCGGTCCTIGICiACMCITACGTT
GTIGCATCiC:C GTCATCTIGCAGATCCIGG

GTATGGGTTACCGCACGC

CAACTTAGCAGTTCGGCAC

15 CAIGGICGGGCTCGGGAGAGAGTCCGGACAGF T it 00 GIG AAGC.CCGGTGCCIAAACGAGICACTGCTGCTGG ATGC CTCGGTGTCACTGTTGOC
OGAGICACTGCTGCTGG
15 AGCCAGATGTTCAGGAAC:CAAAGCATC:GCTTAAGTA
GGCTGCATTAPCIAAGCCIATGAGCTTTCCICCAGAA AT TAGGCGACTCTGCATC GCATCCiC17AAGTA
MAGI

15 GGGAAC:AGCAGGGGAGGTC:CAGGCTAA TGAAAGGIC
CCAGTITGCCCCATCGTITCCCCiC:AAGGATCCCA IG TGCTCTCAGTTAAC:GAGC
AGGGAATGAAAGCTCACG

IS GCTCTCGCT GTAGICAGACTCGTGAGGGCAGIGATA
CGAGGAIGAAGCAACCCCCGGGTAGAIGGCGAGA CCAGTGACGAAGACCCAT

TGAGGGCAGTGATAGCGA
15 GC:TCCGCCACGTATTCCCCGATCTACGCMTAGCCA

GCAGTTCGCCCAGCTC
is TC.TCTCAGGACCTCAAAGGCGGGCGAACCAAGOCC
CCACCTGCSGTTACTAATGT
OS CATGCACCAGGGGCAGCTGCGTFCCAGCTTCGTCC AGG GC
GCGTTCCAGCTTCGTCC

CCAGTATTIGGCCAGGIC

GCCGCTGGAGTTCCT

CCCOGCCATCITTGCCGGATICTGGTAGAGGCGCIG TGACCATCGAGTACC. ITC.0 AGGAGCGACGAGTATGTG

TCTAC-ATCGAGGACCTCG

CGAAAATACTGCCCCGCG

TTCTACAATGCCAGGGGG

IS GCTCTACGCCTICCTCCGCAGGOTGGTGCCICTGTG
Ci GCCCCCGGCCATACTCCAATaTGAGCCGGCTGGG ASITCGaiGTCCAGGAGG
GCTIGGTGCCICTGIGG
GGACCAACAAMCCCACCTGTGGCATGICTGITCCCC GCCCATC.ATCCOCC.GAGCATCITCCAGCXTCTCTTTC
AGAGTGCCTCTGAGCATG

GGCATGTCTGTTCCOCCT
is GTOTAGI1GAGCATICACCITTIAACTOGALT.C1T1A CATCGCAATCATGAAGTCCTCCACAAGCGTMGAC
TUTTTGAGATCTGAGGAG ACTGGACLaTTAAAACAT

13 TITCC GCC , CCATGGCGCGGTGTCA C

AATCCCATGICAGGGITGGGGTTGGTATCATGCACC AGTTC.AGGITCCACATCTG
14 AGC i TCGCT C
GACAGCACCTCCAACAGC

AATGCCTCTATGTTGGCAC

CCTGGTGAGGTGTGCCA A

GCACCGAGCTGATGGGT
15 CACCGACMCGTICTGCGCCAATCTGGTTGTC.GGCCT AAACACCCCOCCCACTGGCCTCAAGGGGGICAGTAC

AATCTGGTTGTCGGCCTCC

TCCAGAAGCCAGGGAGG CAGACGGATGTCAGGTCG

CTCAAGTGTAAGCAGCCCG

AC AA CTGGAGATCCCCGTGACT
GTGCTCTCATCCCTGCAAC

AGGAAAAACATAACAATC AACCAGAAAGATACCCAG

15 CGGCCGITTCAAATGCTGTGGACACGCATCCCTGTCT 1 CCT.
GaCCAGCGTTGCCTCCCGATTTGACCTCACAC
22 Ta C I GAGA TGTAGGCGGGGAGATGC
CACGCATCCCTGTCTICTC
t IS 1 'ITGGTATAGGGCAAGGTIGGGCGTGGTGCTACGCG
23 1 GCCCITCCACCGCCXIGTGGCTAG CtCaXiCAAA 1 AOTTG , AGCCCCICTaCIGT CC GaTAGCCI:C.C6CAAA
15 CGGGGTTTACAGTGGCCTC.ATTGAAMGGGGGTGGC 1 CCTTGGGGTAGCCGACGGTGGGTCACCTGACTACT
24 GGT ' GAGA ACGGTATGCAGGOGCTG
GAAAAGGGGGTGOCGGT

ATGCTCTCCITCCTCTGAG
TCC TGGCC C
GGTGTCTTGGGCATCATCC

GCCCCAAGCCITCGCA

GAC.AAGGCAGTAGAGGAC
27 . GGACAC CAG TCGTCTGCACCCCAGTG AC

CAGGCACCGTCTGTATACG
26 CGT CT A1'AGCGGCICACAATGC3TG T

TAATGATGAGGAGCACCG

GC:CACCACAGCACAAGTG

GCAGCACACGCCCCCATTCCITCCCCGGGAATGTGT GGCT TGGCG ITTACCGGACACA
CCITCCCCGGGAATGTG T
is AAGGCTGGCAAAGATCCCC-AGTCTCCIAGGAATGCC CTAGATITAGCGATCCCCCGGTACGAGCGGGATAG
31 CiAGC CAGG CTCAAGGCGAGGCTCCT
"ICTCCTAGGAATGCCGAGC

GAGCTIGCGGCTGAGCTCaCCGIGAAGGCAGGG I

GTTGCGCTCGACXIAACT
IS GCCGAGAAGACCTICTCCICCVITTCAGCACCCCCAC CTCCAGGGAGA I
GGGGGCCATTGCCGATUCCIGG
33 . A C CATCAGGGTC7111M.GTC
CGTC.A GCACCCCCACA .
15 GCCATCAAGTGCACGTGCAACGGGGAAAGAGGCCG CAGCTGLIGCTIGTCGAGGATOAAGAAGaGCTGG
34 TTGGC alcc CGGCGATGGAGAGGCA
GGGAAAGAGGGCGITG GC
is GCCGGGACAAGCTC:AGCC TCCCCAGTAGGTGCCIGG ACACGGCGTACATG TCGATGGG I
CTIGCGGGCCAA
Cr CAM CACATCCTCCTTCITGGCC
CCCAGTAGGTGCCTGGCT
15 AGGAGGCCGGGCGCTATACiGACCCCCTCIATGICIT
CtCCIGTCCiACCAGAGCiACGACCIGCCACCAACGAG A TCiai AACT CGTACIACCI

ACC.CCCTCTATCiTCTTCGC
15 GGGACGTGGICTACTGGGAGCGAAGGCCCGICTGTA GCAGGC-ACAGGGTCTCCiCiTGCCAAGCTGCAGTCCT
GAAGOCCCGICTGTAGAG
37 GAGG G TCCC.AGGCIACCCGAGA G
15 CC3aGIGIGGGCGAGC7 GATCi TCCCCICIGCCOTGG
CC.AGCAGCiGCACAGAGGTGACGCTITCGGGGCGTA GCCAGGTGG ITACAGGAG
3$ AA AG A
GTCCCCTCTGCXXITGGAA
IS AGCTACiAGAAGGAGCCCCATGGGCCGCATCCACGTC TICCTICAGCGTCIATGCCCAGACTGGCAGCCGAAC
39 CT CAGAA MACTGGACC.ACTTCGGC
GCCGCATCCACGTC.Cr 15 TCIGGCTGCGTTACACCGATAIGCAAAGATCT GCGT ai TAGAGGACGGAA ITGG
GCAAAGATCTGCGTGGAC
GGACA CGGGCCTCTGGACCAGTCACACCGGCCAGCCTCAACT TG A

TAGCCCiCACGGACCCTGCACACIACTAGCOA

ACCTGTGGCCCGCGTA
15 TCAAGGCCTC:CCTGAGGAAGGACGATGTCGGCTGTC TGAGCCGGC.AGAGGACCAGCGGTGTTC.ACCCGGGA
42 CTGC . TG TACGTGCGTGTCTTTGCC
CGATGTCGGCTGTCCTGC

43 CCICCCCTACGGTTACCCC.ACCGGGAGGCCGTGC71T TGC GAGGTCGTTGGCGGCA
C.GGGAGGCCGTGCITT

CAACAATAAITTCC.TGTGC GAGCGGTCTGTATAAAAAC

CTIAGTGAAGAGTOTTGTCACCACTCTATAAGGGC TTGGAGAAACTGTTGCAG AGACAAATATGATTGGAA
ATTGGAAATAGACC TATGCTCC A ATAGACC
15 CGTITGTAC:TGGAGAACCAAC.TAACGTTACIGCAAG
CCCGCCIATTGCAAGCTGTATAAACAATCGTCTTTTG CCCTGGCATTGTTTAATGA GTTAGTGCAAGATTCCAAT

15 TGAAAGAAGAAAATGGC.AACTGGTGACTGTACTGAA 1TCCTTIMG1TCGGIGATTATCCAGCTGACTCATGA
CAAAAGCTGTTITGTAACA ACTGTACTGAATTCCAAAT

is GGTCTATITGTGTGGCGAGITTTCAGTITATGAAAT A

48 ACTAATGCC.ACC AACTG AIGAGG GCCACC

lICKACTTCICTIAGTF CTCAAAGGGGAGGMCTGA TATTITGAGGAIGCAAA00 is CCAAAACGG7CGTGTAGGICTTAACAAITGCACC.GA TGCAAGCAGGCCCATAGATGGGGGTAATTICTGAT

15 AGGTTGGGG i it 1 t CAATTTCAAATTGGAGAAGAA AT
AC.ACCTTTAGACAGAGTTC.AGAGAGCTGAATCiTGA TAGG11TTGCAGMAATA
TGGAGAAGAAATTCCITTA
51 TCC1TTAGATCC CG CCCATA , TTGTG GATCC
IS TGCCTCCATGAAAGTAATATCCAGTATCTTCCACCTCA
CAGAACGGTTGCTTATGGTTGGITTC.ATCGTCTACA GCAGACCACTGGAN \ ACT

ATCTTCCACCTCAAAAGCC
15 GCATITCCAMTCCAAATTCACACAGAGA1TGICCTC 1GAGTC.117TC.AACAAGACAGAGCTGGCCAAAGAG
TTAC-AAAAC.CATGTAAAG GAGATTGTCCTCCTATTGA

15 TGCAGGAC.ATITTGTAGCTAATGATGGGAACTTGCCT
GATGTTATTCCAGCAGAAAAGCCTGTC.ATATCTATT GCTCATATTAATGTAATGA

GGGAACTTGCCTITGTACC
15 GGTATGTIGGCMGCMATAC.TCCACGTATATTITA CTGAAGCCATTTGCACTTATC17CTAAACGGT1GGT
CTCTACAAATAAATCTGCA CACGTATATITTATTITTCA

AGCAGGGGAAAATAAAA CTACTGGTCCTGTTATGTG

IS GICGATGAAATC7CCTGGATMCATCTGOTCCAGTG GCfGAGGGAGCAATTGAGCTTTCTTTGGGAAATATT

CIGCTFCCAG TGAGA ICA
15 .ACTICTITCCUTTCTIGTTCACATATTTGCTATGGCTG

ATTIGCTATGGCTGACGG
is AGCATACCITGGTGCTATTAGATTITTGCTAAAACCC TTCGCACTGAGTAGAGGCTTTGTTACACTCATGCAT
GAGGATGAACTATTACTG

TTGCTAAAACCCGGAGAC
15 AACCGGCAATGGaCC.AAATGITAC.AGGACTAAGGA 1 TTGAAGGGGGATGGACTGGACCCTGTTCATTCTGAT GAGIGCCAAATTGAGGAT GITACAGGACTAAGGMC
60 ACA ITC 1 GA TGA G Arf C
i CTAICAGATTCTGGCGATCTACKAGMACTGATIG GCATGGAAAGTGTAAGAA 'FGATTATCCCAAATATICA
62 I ATICAGAAGAG 1 CCCCCAG , ATGG GAAGACi AGGTGGTTATTTGATCGTTGGGAGCGGTCATATGA
62 GAG ' AAATGTGTA CAACCGCCAATITCACTG
CAAGGTTCGTMGCAGAC
is ACACCAAATCCTCCTCTGICCTGAGCTACITAATTITG

GGAAATATTTGTATTAGG TAAGTTATTAGACTTAGTG
64 TTAGTGGAGAAG AGC.I. 7 TG PCGC GAGAAG

CGTGCTTGCTACAAATTTA
65 . AATTIAGG GAICIAAIT CGGTGCAAGACTICGT11 CiG

AGMAGATGGTGAITTG ATTIGACGAAGGGACAGA
66 AGAC Gar GAM. C
IS CTGITGTTGGIGGATTGGCCAAATGAAGCTGAAGAT CTCAGGGAACTAAGGGCITGGACAGCTITCACATTG
ACAGCGGACTAGAATGTT AAATGAAGCTGAAGATCTT

GGTAATTGCTGIGTATGCT TGAATGATGACTTAATTGA

IS AGICAAATTGIGTAGC-ATITGCAGTCCTGAATGGATA
TTGATGAAGCTACCATTGCATATCAGTAAAAAAGCC AGTATGA ACCCTAATGTAT
ICCIGAATGGATACITACA
69 CITAC:AC.A CTGGCA 37 A ACGC CA

GACCAAAAGIACAAA TAT CAGCAGACi rf AAATCCITI
70 ATC.CTTIGC CATCCTC TTGCA GC
IS CCCCAATC:GCATAACTCCATTIAGACI7 TMGACAAG
TACCAACCTGTACCTOCCITTGGCAATAGIAAAACCA ACiACT TT AAGACAAGAGC
73 , AGCAAAT TGCCAAT GICAGATIGACCACTGGC MAT
.
15 ACCCCGACCTGCiAAAGGGTGTCTTCGCC:ACCAACIG CGCCAACAACATCGCGACTCGGG/CfGATGICOGG
ICATTGCCCCTCIFIACIAG

TCTTCGCCACCAACTGGA
is CiCAGGGGGTAGTGGAGICTGCAAAGGGGAGGGGA CACAGGCGAAGGGGAAGGGGAGGAGTCiGGAGCG
CAAGGGTCGAACAGAAAC

AAAGGGGAGGGGAGCG
15 GCACTTAAGCA.CATTCiGCGTCCACGA I TACTGGCTGA AG I
ACIACGTCiGICAIGGGTAGAAICA I TCGCGCTC

C.ACGATTACTGGCTGAGG

CGTACTAAACGTGCCTCTGTTACCiTGGACATGTACC CCTAC.AGTAGATTGGICTT ATGATC.TGTAAGC1.
ITAC.T

15 CCIGGCCCTICCCCTAAIGGAA TAAGTACAGGCCG 1 G AG 1TCCiCCC
IGGGGTGA TACCGGGICAMTGGTCT ATAAGI ACAGGCCGTG GC
76 GCA GAC.IGT TGTTTTTCGGGGGCTTGG A
IS ACCTICTGTIACAACIGGAG TATCTAGAA ItAATF GC GGCTCGIC I
GMAT T ITAGAGGATTCiAGITCGTGCA ACiAAICAA ITGCTGAGA TA
77 TGAGATACAC ACAC GTTGACC.TTITACCTGEFTG CAC
15 GGCGTTG TACATCFAAAAATIGCCTIGAGCi TCACACA ACAGAAACACCTCAGGGATATGTTACGI
GTGCCiGA 11CAGTITGATAAT CCAGC TGAGGTCACACA.AA TAT TT

IS ACTiTCAACTCiGGCCTTGTACTGAAAIGCAAFTAITG ACCTGATGf.all GGAGGTGGGAA
FTAAAATCCACA ACCArf AACACAGAAGAA GAAAIGCAATi ATTGGGG

15 GCAAACGGTC.ACTMTTGCATGATACAGTGGCGAGA
GIGGGTC.ATCCATATTATGATGTC.CAGACACTITAG CC.AGCAAGTGGTAAGGTA CAGTGGC.G

80 GTTCAA AG . GGACTTCAAT 7 G
15 ACCTGATGTTCCAACCCCTMAGTTIGGGCC:IGTAGA GGITAGAGACACAGAMACTCTAGCGATGT51717 TTATAATCCTGATAAGGA

GTTTGGGCCTGTAGAGGT

GTGATCAACAACMCGAACAATAGMTACTAATGA AGAGAAC.AATGTTATGCC TTATGrfAGAGGAGG 17CA
82 AGGTTC.AGTT CCCACTTACTGTA AGG 617 AMATGCGGACGTAAGAG AGATCACTGAAAGCCAGA

GaTTOCAAGACAATCAA CITTCTTCTAGCATOGACC.
84 CC.A GTCTC GG A

CTCAAGGGCGTTATATTGG

is GMAATGCCACTGTATCGTITCAAGCAAGAAGGITG AGTCTGCTCCTAAGCCTGAAAAGAMCITCTIACCC
86 ISGAIG TTGGG CATCC.GTTCTTCACAGGA
GCAAGAAGCT 1G IGGA it 15 ACACGITAACAGTACTCGATICCIGGCACiGAGAAATC CITCCCTGAAGGTIGAGACAATTTGAATGCTCAAAT

87 CCATC C.AAAGGCG GCG
GGCAGGAGAAATCCCATC
15 CGTI1TGAITTAGC.C1CA1ICTTCAIGGGGGT TAAAG
AAAICCATCTACGTGCATITCCTTACTCTGAAAIGGA ATCAGAGGATAI AGO-IC
RS GGAGA TTCTTGATG CAAA
ATGGGGGITAAAGGGAGA
15 GTGAAATGGaiAACTGTTIGITCGCATGGIATCAACA AACGGTTITAATAAAAAGCTGTGCTAGACATGAAGT
GCATGGTATCAACAGTGG
89 GTGGG ATGCAGAGAC , CCCCTTGAGGCATTTGTG G

GTACAGCTCACAGTTGGCGTAAGC.GATACGAC:TATC AACATACTGCAGCTIGTG
90 CTGGTG i TAGC G
GTGTAGCTGCTTCTGGTG
15 ACTCGGAGACAAGTGATATAATGGACAGTTAGTCTG C.AAGGCAGCGTGGAATTATACAAC.CATATTCAAAGC
ATTCGTGGAACITTCAGTC CAGTTAGTCIGGAOGGAT

is CCATCTITACATGAGAAGGCATTCCCTACCTTCCITTA GATCITTATGCTCTGGGGGTTGL G
iii i CTGGATCA ATAGGGATGMTGATCG CTACCITCCITTAGTAAAG

CTGTGGCTCCNOTACAG

is CGCACCTTGACAAATCCTITAGAAAITCTCAAAAGGG GCTGTIGCCGAGGGAAGAAACGATTCTITTCTCGIT
TTCTCAAAAGGGATTGACG

CTCTGGAAGCAAAGGTTT TGAAGCTAACAGAACATTG

is AGACAAAGAGTATGCAGCGGTCATCGCAATCGTAAG TCAAACAGCATGATGICCGATAGCAGGCGTTATOG
CATCGCAATCGTAAGTATC

ATACCCTGTCAGAAATACACGAGCACAAAACAATC.A GAGCAAGAATTTGTAAGA

CCTCTITGTGAACAGTGTC
is ATCACCCTCTACAAAAAGCTCCTATACAGGGTGTIAT 1 GCAGACITTCTGGGGGAGGAAAGAGAGCTCAGATC TAGACTAGITC. TGCGCCTT
TACAGGGIGTTATGTITTG

t 99 CIGCLTIC 1 GGC:AAC , TCTCGAAGCCGTAGAA.GC IC
is TTTGCAGGCCTGTAGTCCCTGAAGAAGATAAAACTAA 1 TGCTTGCGAAAGGTGTGTACCCATTATTCCACCCCC AGTICATTCGTTTTGTATC GAAGAAGATAAAACTAAG
00 GAATCAGG ' ATC GT AATCAGG

AAAGGAGAACCTCGAGCCITGTTGGTAAAGGATGT Ca. i la r z I i CTTCTCGGAA
GGCTCTTTTGTTGAGTTIG

CTATTCAACTAAATGGAG CAATGTTCTGTTATCTCCA

TATCGATGTTTCTCGCTTAACAGAACTCTAGATCCTT
03 . GGC AAATAGAGCAM CTAGAMCJW.JATITCIG AT
GCGGITITAAAGATGGC

CCGCTTTCCTGTGTGTAATCATCTCTGCTATCGAAAA GGTATCGTAACGAATAGC TCTGATTGAGGGACTTCTA

GTITCICGGAAATCACAAA
OS 'TCACAAACT GITTCTCI CCAACACT TCCGCGAT TT CT

TGGCAGATICTITTICGTG C.IGAATAGGAATTCCTCGA
06 CTCGACC GTGGA.T AA CC

APAGCCTITTAGITATTCC
07 'MAGI GGC111 AA
CTTCGCCAAGCTITGAGT
16 CGGAG7 AGAGAAAT CAGC:AGACTGCTICGCA 1 TCCIA
GAAGCTGAMCGCCGA I C17 ACITCICC:CAGACT G cnCGCAlICTIATIACCAA
08 TTACCAATG rraT ATGACAGAGACACGACGA ICI
16 CATGCACC.AATAGTfACCiGTGT1T1ACAGTfTGF GGA
CGTAICC.TTGCCGGTAITCACTCGAT A FTACGGAGA AGCAATAGTCCTITCCA IC
09 . CitIGTT CAAAGA 6 TTACAGITTGTGGAGGGTT .
16 Gal GIGIGTCITITGICITMCATAAACAATAT GO-CA.ATAC
CAATACGGG TAGGGAT GA GGG

TAG TGCA.GAAGCTAAGIT TAT

CMCGGNsCAACAACTGC
16 CA 1 CCTGT TCAGC IT TACGAACiCiAAAGCCIGTACAGA TCTAGGGAT

16 GGAAGCAGATAATITAGAGGAAGCTCITGCCaTIAT CAAGTAGCGAAAAATCCAACAGCCCACAAACCAAA
CCAACTCCIGTAA TACATG

CTIGCCXITTAIGTGGITT
16 CCTAAGATACiCGATCAAAGGTIGATCCTACTTAACCI
CTCiCTCAOCTCITGCGGTTATAACCGCGAATCCCAT ICCTACITAACC TAT ITTAC

16 GCAGGTITAGATGTIGIGGGITTIGGCTAIGGGITTI 11TGTCACGAAT Al AGCCACTGAT
TNIGGT TCTICTG ATITCCTTCATTACCiAC:AG

TGGCTATGGCTTITATGGG
16 AATGTICTAAAGGGGAIGIGGATITAAGCACT Ca CC AGATCCTCTAAAT
TiGAACCTCCGCATACTAAGTTGI GAICTCGCATACCIGAAG

AAGCACTCCICCIATC.ACA

TCC.AGACATAGCAACAATGACAGAAACTTC.AATAAA CCCCAATTTITCATGAAGA
CCAACAACATTATACTCCC
18 CCCCT . CGAAAGCG GC CT

TGATGGTTACTICC.TCAMACAAGAACGGTAAN 'µ GA

CTGTTGAAGTACGTCCCTT

TAAGACGTTGCTCTTATCA AGGCTGATAAATGMTCG

TAATTTCGCCACAAGACCA

TTGTCGCGAATAGTCACA

TCTCCAGCAGCAAAAACTAAAATTCTGCTTCGCiTTT 11TGAMTCTCTACTAAGT

CAATTGCACITCAAACTCA

GCACGAGTTACTGGGGITGAAA AACTGTGTATGAGGTTTA
24 AMC Te3TTC ACACT
GTITGGCAAGC:AAACAAC
16 TGLTITCTIGCOCAAGATCTCACCICTIACAAACCGCA CAAGTAG1CAT1GCTATGaCGAA1AGAG11GTACA
ATCACAAACCATAACAAT
25 TIT AC3CGATTIG At-AA
CCACTTACAAACCGCATTT
16 1TCACAGA TCCTCC.GTAAAGAAIGATTGCTICTCTCTT

ATTGCITCICTCITTICCAA

16 AGGTTATGGAATCGCGGTTATCAGGAGCCTGTGCAG Gres- i 1 u. I Go 1 I
AGGGAAAMACGTTITATTTITAT
27 AATGTG AGCGGACTTCT , TAGCTTCAACACGOCTCT
GAGCCTGTGCAGAATGTG
16 TGGAATTTCTCGAGGCGCTCTAAAGTAAC.AGGACTAT
CATTCTGTGGCTTCTGCATTAGATA/kGGATGGCCGA TAAAGTAACAGGACTATCC
23 CCCCT TTCGT CTCCTCTTCCTGCTGTTG C.CT

AGOGCTCTTAAACAATCTC
29 GCAAC.AC AAATAGTCGA A
ACCGGTACTTTGCAACAC

CAAGCATCACAAGATTTAT GMACAAGAAGTTCTTITT

16 TGGTCAAAGACGCAAGCTGTGACCTTATCTITGGGAT AATGCCGCTCGAGICAATGTAGTTCACGAGC.AGITT
GACCTTATCTTTGGGATAC

AGTTTGAAACTCCAATACT

AGAAAACAGGCAAGCCAT
16 AGCTAGCTTC.ATGGAGTATGTCTACCAACATAAAACT

TCAGGAGACGAGAAATCA ATATCATTCTCTGTAACAA

CGGTCTTAATTCCATTAAT

GTAGGTAAAACCTCGTITGGICAACATCTGTATCAA TTTCCATATTTCGTGATTG
36 CCAA : AAAAAGGCACIC CA
GCAGACACTACATGCCAA
s ICACTACTITITIATGGTMGT GIGGAAAAGCTTFGTTAG
37 1 CCGTF I CfC3TGC: , AGG 'FM
TIGTGTCCTTCCGTT

GGAACTGACCCTATAAGGAATGC.GTAACGAC.ATCA
311 CCA ' TTGCATGA GCTO1TGTG11CAAA3CT
IXTAGCAAAAATCTCCCCA
16 GCTCAGTATTTAGCAGGGCAGTCTTCCCAAAAGAC.TT GACAACCATTTTTGCTTTGCTGTGGGATTTGGCGAA
TCTICCCAAAAGACTTCAG

AACTGAC.AAAAGCTGCCG

41 . GGAIT AAGCATAC GCCCC3GAATIGGNITTGG
CCGTAGICIGTFCGGGATT
16 TCAGAAAGGGCTGAAGGCAGAAACGGTGC.AACTCG ATCTGIGCAGCTTAAAACCIAATGTCTTTAGAGAAA
CITTGGATAGAGAAGCTG
42 ATC GTC:ATAAATACGGA Cl' AAACGGTGCAACTCGA TC

GGCTICTGTCACCITGCTCAGCTITAGTTGIGGCTIC CAA.AAATAGGATT AGAAG
43 AGAAGAACITCC AAC CfCAAGCGTACGTCCITC AACTTCC

ACAAGAAGCGGTGTGAA GCTGGATTCACAATAGTAA

TTCTACATTATGTGCALCGATC if 3II i GG CAGAACACATCACTITAGA

16 TGGCTCAGICGGTICITCAGTAATCCACIACTCTICCfC
GAAGCTGGITCTACAACAGAAACIACAGITICTCCA TAATCCAGACI CTICCTC:AT

ICCAAAA.P.MCCICATCTIGGAGCATC17 TTAAATCCCATCCCATICA AAGGC'AGAAGCFAICTIGI
47 . G71 GTGGA GG T
=
16 CITGAAtTICGAAAGAGGATGTFGCCITGCITCAGIT GCCF GACCOCAGAAACCATTICTICTI
CATF AAAATC GACICTGAAGGTAA TATC

GGTCGAGGACAA I A GAATTTAATCCCICITCGA CCTITAATTACAGCTGGGA

16 ACACIGATTCTTGCTITGOCAGAATITCAACATIGCCIA TTAGGICCIGTAGC:ITI

AAICACAATGCGCTCCCAITTAGTGC:ATGAGAATCG AAAGGAGTCGGACCATGA

TGACTGCGCTATGAGGAT
16 G TAGGGGCTCAAGGACGAACTCAAATCAAI cavrnt AGAAGCAAGGATGAGGITTICT
GTGIIGGAGCACA CCAGCTTTCIGTAAAAAG TCAAA ICAATGATITCCAC
53 CACTI-CA CATC.ACG ACAA ITCA
16 GATTCHAGAGEGGGATCAGCAAGTIGGACAATAAAA GeICKTAAAACCCGACAAAGAGAACGTC:CCTANIAT
AGAAAATFGCGGAGGAAC

TTGGACMTAAAATGCGCG
16 CCTACCTGGITGAITAIGGAGAAAITACCGAA 7 CTCf AAAAGCAICIAGGAGC7CCCCAAAGGACIGGA1'AT -1ACCGAA TCICIAATAATG
SS AATAATGGATC.A GAAGTGG GCATCTTCAGGAACA:GTA GATCA

ATCTACTAAGCCCTATATT
56 AAAGCTCCTITITCGTGCCCGTGCTGCTATCCCCGATA . GGTG AAAGTC
GTGCTGCTATCCCCGATA
16 TCC.CATTCTCTICACCIAGGTTTATCCACCATCCC.TGCT
ACCACTCTCITAAGrAAATACTGACAGGTTACTTCCG CAAAACX.GATACAGAACA

ATCCACCATCCCTGCTTC
16 TGITC.AATCITTMTITGGAGCAGAAATTCAC.TGCTITA
CGAAGCGGTGAATGC.ATGAAGGATGAGATGTTGAA AAATTCAC.TGCTTTAGCCT

AAAACGACGAGGACCCTA

TCTGCCATCAAATX.ATTTCGAATGGGAATGACGAGT T/kAMAGTGAGCAACGAA GTOCTGCTTGAGANITCTT

TTCCGACCGAGCAATCTCAGTITGGGAGTAAAAGTT TAMTCCATAAGTAGTTFC GACGCTGTAGCATAATCAA

CAACACFTITTTAAGGCATCTTCCAGTAGTGTTGTTG AGAACGCAAAG Al-TUC-AC

16 AGAAGAAAACTCf:AATIAAAAC.GIGGTGTGOCCAAT GCAAAGCGTTCATATCTCGGACTAAGAAAAGGTCCr GIGTGCCCAATCATITCAG

TTCTCTL. G1TGATA ATATGCTGATCAAATITTG
65 TGGCT GCCT , CGA GCT

AAACCGGC6TA11TCTTCCTGAAAACAATTCCCCIGT GCTTACATTCAAGATGAG AGAATCC.C.CTTAG
AAGITA

16 TCCCCAGGCAATTTCTITGCAAGGATTAACCAGCAAA 1 AGACCCTC.

67 TACAC ' ATAGAGGATT CA AC
16 GGGAGGTAAATCAAGACTCCGATTTCCTGC.TrICATC TCTTTGICATCGATTTGGTCAGGGAGGGGAAAGCAT
CCTICTGACTITATGATCA

ATCATGGG
16 ATCGTAGAGTCTIGTATCACCTGAAANCITGGGATAG C.GTCCACTA&AAFATETTGAT'A-CGCCAATCAAAAT

AAAATTGGGATAGCGCTCT
16 ITACGCTAAAAACGGGCCTATMGATTAACCAAAAG TAGCTCTGCAGATTTTGTAGCACC.CTCCGGGATAGT

TAGCMCGATGCTCAACC

TAGAAGCTATGGCCAAAA ATCGCAGAAGAAAAATTCC

ACCTATGAAGAAGCTCGTC

CTGAAGGAATCGCTC.AAGGACGTGTTCCAAITTGGT AGACTITAGGAGATCGTGT

TGGAGGGGCCATTCCGTAATTGTIGITCGAGATCCT 1TA TCCITTATCTAACAG It 75 c:Acreracr 1 USG , ATCGATGGAGGIGIAGGC 'Farr AACCTGGGAGAC.AAGATGCACCATTGGAGATAAAA Gil ft I I fACTATTAGGAA
76 GGAATTAACG ' GCGC TAACGCACGGGAAAAAGT TTAACG

MCGAAACCATACCATTA TCAATGACCTTCCTATGTCT

TCTTIATCAGGGAGAGGG
78 TTCCC.TCC TGCGG T C CA
ICGACAAGATICCCTCC

GAAACGCAAACATGGGGT
79 . 1 GCC C711 A
AAAGCAGTGAAAACTIGCC

GAGCTTGTCCCAATTGCTCCGTCAAAAAGAAACGGA TTTGTAATTGCTGG 1lilt CITTCAATCTTCGGGACAA

TGAGCGATGCAGITICTACAGCAGAAGATACCCA ATGGCACGTCTTAAAAGA

'TCGTGCAAGTATGITCAAC

TGTTCCCCGTACAAGCAAG7TTCTGCCC.CATATCTGC ACGTTTAGAAATFCTAGAA
82 CTAGAACiAl GC A ACAAGNICAGIGGCAAGC GATGC

CTAAAAATTCGTCIAACTG

16 TGC:GAGTGlIATATGAGCAOGAACRCIACAATAGCG
CSCCAGGCAAAGCAAAAAGCAAGCTC:ACTGITGCT AA TACTICCACTAIATCCC AACTICTACAATAGCG
MC

16 IGTC:CATITATTISGTCTTGGAGI OCT TGAACGCAAC
GGTAACIGTTGTAITCGCACCMGC:GATATATCAATG CATA TCCTIGITGATGITC
8S , GAGAA AGGTAGGAC TTGA
TCTIGAACGCAACGAGAA .

16 'ICCAAGAATIAGACAMAGATCGCACCAMG1TCGC TlITCCIG !GT

CCATTTGTTCGCGAGTCA

EGGGACCCTCAAACTCAA
88 TTTTGC. CAAGAACT ACGCSGATCACCITITCTCA
TTGTCTCGCACGTTTTGC
16 CGATCITAGTAMTGCTCTTGCCTTACATTTCTCGC.AT
TTCGAGCCCTITTGAAGAGTGATCGC.AATCTAATGA ACATTICTC.GC.ATATGAAC

90 TACGTTT MATTI- rrc TT
16 IACC:GICG1 ATG TCGGCTGACCAT MCACCA1T TCAC
AGGGAAA1CAGAAAATCCCGCP.AAACC1TCCTCI TA GGGACAATAATGAAGGCC CCATMCACCAMCACAG

AGCGTACATCC:ATGA TTIATIGIGTIATCGCGGCTA TAT CACAGATACATCAAGA
92 AAGAGCT TCGTCTT C.GGAC.G1TATGGACTAGT GCT
16 GlIGTICTMGCSGTACAACAATCTCCAAAGCGACGA CGCGC1CCGITAAGATGA
TRAAGGCCCIATIGAC:A

CCAAAGCGACGAAGAACT

CAGGAGCMTAAAATTAC CAAGAGATGCAACCAGAA
94 AAG . CGGAAAG AGG G

TCGGACAATAAACTCGTAA

16 ACCTGCGGAAGAAGCTGTTGAGCTICATCTTGAGCTT CAGGACGC.ACCGGAGATAAAATTAAAGATGCTGTT
AAAACCCCCTCTITAGAAG

AAACTTTGGAAGATGC:TG GTCATTTTCIGTTCATTTCG

GAAAGGGATTIGTICTCiT

CGCTATGGTCAACAGCAT
16 GCCC1TTCCCAAC.TCCTTGAG11TAGTGCACCG1TACG
CTTAATCTTTCTGAATGGGGCGTTCCAGTAGGAGCA
99 A TAC.AGC CTITGCCGGAAATTGCTC
MAGTGCACCGTTACGA

TAATGGGAGTGGAAATGG TCGTCFATTAATCCITCIAA

01 AC.ATGIC ATCGATTC CTITTGCGTATGGCTACC C
17 CsIGGGAGCGTICGAGACATGATTAGTTGCTAIGTCT
CAACGAAATACGCTTGATACCCATTGGAAGTTGCAA TAGTGACAATAGTAGGAG ATTAGTIGCTA
TGICTACCT

17 GGACAATTCCGCATAATITCGGAAGCTC. rGTTTGGIT
AATGTCGAGGCTITAAACGTTTICATATCGC.AAACG CGTGATATAGCTATMCG
03 cm CCTTGT , GA
AGCTCTGTTTGGTTCCTT
17 CAATGCAa:CCTTC1TTCTGATCCAAGAGGTTCC1TCT 1 CGTCTGC.GTAAGATGCATCTTAACGTCTITC.CTAATC GTGGAGAAGTAGACTTTG
04 CAA i CCA AGTT
CCAAGAGGITCCITCTCAA
17 CCACCITC.AATC.ACAGCACTTAAGGAATTCGATACTT
GCCITAATGGTGTGATGC.CTGCTGCTTGCGITTC.TAAA CTTTTACTGCTAAGATTGG
AAGGAAITCGATACITGGA
OS GGATTC CTCT ATC.G TIC
17 CGGGATGATTGAAGTACAGTCCATGAL t 1 i ii i CAAG
TCCGTAACTITTGATTC.AGAATTGACCaCCCATGAA ATGAL CAAGCTAA

i is- I t CGCGA TGTTGTTTCGGAGAGAAG TCTTAGATAGTC.GCTTATT

AACTTGCTICTCAGTGAG

GTTGACAATGGCTTGAGC
17 CGGCTTrGCGCATAAAAAACATTTTCAGGTTGAGTAT TTGCCTTAGGAGAAAAGGGAAGGACAATCACACAT
CATCTCTCTCGCTATCTGA TTTTCAGGTTGAGTATATG

TAGAAACTATTGATCAAAG
TCAAAGTATCCG TTTA CGAATMCCCTIATTGGT TATCCG

AGAGATACAAAGTGTCTT AGAGTTAGTGGTCATAC.AA

GAGATC.C.ITMGCAAGA CAAAAAAATTATCAGGACG

s CIMACAAATCAGCCGATCCCIATCGATCAGATITAT ATAACGATTACTf GCTTGI
CCATCACACACAGCTITTA
13 'STAG 1 GCAGAT , GC C

TGCTCGCTGCAAATTCCGAAAGTITTGCGTTAGTAG GACTAC.AGTCTCTCCTGG
ACTGGAATATACACTTCTT
14 TTCTTCCG ' AAGCT 6 MG

ITCGAAGA TGCTTAGT GGGT AGA

AGAAATCCCMCCAGAAA
17 CTGTTCGAAACOCATGGCAGCMATGGATCGMAG CAAGAGCACAATGGTATCCGAGCTAGCATTGC.ATCA
TTGGCTTGGATTACGAAT
17 . ACC GGACTA GT
CCCATGGATCGTTIAGAGC
17 TCCCAACCAACTGC1TAAACATCTAGCT1TGATC.AATT
TGGAAGAGGTTCTITT ATGAATCGTTAGCACTGCAT CIAGCTTTGATCAATICCG

GTTGCTGTTTGCTAAGAAACTA
19 AGGA TC TM MG CAACIaTTICCGCAGCTI
'ITIGGCAGICATT GAGGA

TCET APS AGTFCGTTGCTATCCACAT TT

17 ACTC I i I i tAGACTCTGTGCGTGICAGCAGAGCAATA I

21 CafTGG 1 CGAA C
AGC:AGAGCAATACC1 MG
17 GCCGCAATCGAAAA1CTC1CCTAITAI1CCCCCTTAGG I CrAAGGGAACAGAAGGCITGA I
AATCITC:AAGCAA

TATTCCOCCTTAGGITTGG

TGCFAAAACiG1CGC6CTA6AACICATGA 1 Gra:617G A1GC11TTITGG-TTATGCA.
ACCITSTCTICTITCACi ITC
7.3 . AGTTCTG 1 GAG 16 113 =

GTOGAAATICGGGGGAGTCTMCTAAGGACAAAA AAAATITCCAGGGGATAG

TATTCATGCTTCGATCCCA

ICC( A

ITTCTCGCAAGGGAGTTGT
17 TCCTGGITITCCTCTCITAATAGCICP.MACA TTCIG
GMGTITTIGGCCTTATC
26 GCTCGCAATAGCAGTITTCCTTETTGGGCTICTCTGCG AAATCCAAAT rrre TrITTGGGCTTCTCTOCG

GTGAAAGCACAGGTTCAT

17 GTCACGAA I ATITTCCTGAGAAGGAA i GACAA I GCTC ACIAGCTAAGC:AGITAGGCCTIT
I GACTAAACATCC AGATGCCAAAATGACGAT ATGACAATGCTCTTIATCG

MTh TCGCCCACCCTTra:TGGA AGAATTA 17AAGCAAGGAT

TGACGGATGTTGITAAAGGITCTOCACANIGACATC AGGCTITTIGGGAGITAC
TIGCAAC.CGATAGATTCIG
TCTGA TTCTTGGC C A
17 MCI CCCGCAGT:CATA ICACCCGGAAGIGGCTACTAC:
TTCTCAGAAGAGGATGTGCATGAGAAA I CGGATCA CITITIATTGCIATAGCIA

CCGGAAGTGGCTACTACC
17 GTGTCTCGAAAAGCTGCTTCTAAATGTITaITCC1TTC
TCGCAGAATTGATGATCATATGCC:TGCGTTATCGAG ACAATTGTTAAGCGCAAT AATGTITGITCCETTCGATC
32 GATCG . CGTATG GG G

ATATGCTGAGGATITAGAACAAGCAGTATCCCCTTC CCAATATTGAAAAGGAGC

TTATGCCTTTGTTGCTGC

CAATCTIGG1TTTGTTGAT TC.AAAGAGC.ATC-ATACGAA

TGTCGTTCGGAAAATCTCC
17 ACAAGCTTGCTGTTAAACTCCTCCTTATGGCTC.AGCTT
CTCCTTCTITGACAAAGTCAGAGATCTGICATATTTC
36 CA TGAATTGTCT AAGACTAATCTTCC.AGCGT
CCTTATGGCTCAGCTTCA
17 GCGCATTGAAACTATGC.AGATCAATATTIGTTTTCCTC GGAGCATATITAGAAACAGTTCCMCATATCCTTA
ACGCTCGATAATTTGACG

ATITGTTITCCTCCGGGA

TTAGCACGTATTCCCCAGCTCCTAAAC CAGCAGCAGAATATCTAG TTGGAGATTGAAAGACAC

AGAGATAGGGAAAAGAG GTGTCAGICTITTTTCTGG

GATCATTGACTGTGTTAGC GTGCTrTGG1TAAAATTGC
41 TrGCTAC AA11TTCC , Tr TAC

GATCCCAGCCATCAATGTGATAGCTTTACTGTATCG
42 CATA i GTGTC.AAA ATTCCCGTAGTTAGTCCGG
ACCACAGGATACGGCATA

TATMTATCGCGAATAGA AGCTTAGTCTATTTGTAAA

TATGCCGC.ATATTTTAACGCeirACAACAGAATCCCT GCTTTATTGGGACTGGGT

TAGTTTGGTTGGTGAGTGA
17 GCCCTGATAGATGGAGCTATGGTG/WFTGGCGTTAT TCCACAGrrACAGAAATCGATGACiTAGACCGATGA
GTACTACGAGAGAAGTTC

TGAAATGGCGTTATGTTCC

AACCATCAAAAGCTICTAA GATAAGACTATCATGATTA

TTCGTCTACAATTGAGAG GCCCATTAAAGATAATCAT

17 CCAGC.AGCTACATTGCTACTAMGAACTCTGGATCAG GAAAGGGGGAGCTATTTATGCCAAGAATTGTACAG
AACTCTGGATCAGTTTCCT
48 TTFCCFT GGCCAC CAAGC.AGCAGGATCCAAT T

GCGAGTAGCTTCTFIGGT

ATACACAITTGCAGAGGAGAGCCCAATCTCTCCAAC AACAAGCTITATGIGGAT
50 CGT 1 CAGAC CV( ATITGTFCGGAGATGCGT

GCATCGACACCATCACCATCFCCGGACTGATTAATC GATGGAACGACTGTFTCTT
52 AAG 1 GIGAC , TAAAT GCT
TCCGCiACTATCGAAG

52 TTAAACCC ' GTCFCT A CCC

TTGCTCTGAAACACTTCTGTCGITTC.ATTITTITCTCA AATGCAAGAAGATATAATC

CACC-AAACGGTCTTATGA AGATGGAAACTGATGGGG

GGAAGGATCCTAAGAAAA AGGACCCATATACAAGAG
55 . AACiAGAGT ATTCAAATTCiGA CTGG AGT

56 GAGTTG ATGlIGCAC G G

AAGGACCAGAAGTGGAGGAAAGTACACTGAMGA ACTAGAGGAGTACAAATT CAAATGAAAACAT GGATAC
57 'FACTATGG CCTGC GCTT 'I/FIG(3 rrTGICATAAGGGAACTFGAGTCAAGA
SS GGA AGAAGGTICTG TFAGCGGGCAATTCCTC:F

AACAGAC.ACTATCAAGAGTIGGAGAACACATGCAC TCAAGAMGAGTCAGIC
59 CiAIGG 1 ATTCAGACTC GC CAAGTGCTIGICA
TGAIGG
17 CACTAGAAICACiCiAl AACAGGAGCAGAAAAGGGAA = CACA1GIGTMGCAGGGA1 17 AAACCGIT I CT( GAAC I AATGCT1IGGAGTAAAAGGG TGAGA !GAT
TTGGGAICCOAACGCGATATCITGCrr CAGTA.TCGICIAATGGAG GGAGTAAAAGGG TITTCAT
61 . ITTICATTCA TATTGAGAAGT CA TCA
.

TCTACGGAAGGAGTACCTGAGTTATCGT CAGCATCC ATCATTGGGATCrTGCACT TAITGTGGATICTT
GATOG
62 ATCGTC ACAG Tr TC
17 'FIG I GTAGGCTGAAGCFACiGTAGAGGCTGAAAAAA TGGCITCGGAGGAAATITG I ATG
TCAAGACAGAGG GIT1TICICAll ACTCTTGA AGAGGCF GAAAAAATAAA

17 TCCACATCCITGGTAAAGGTAGTCFTCMGTCIGGT TGAAGGAGTFAAC r GGAGTCCACMAGTTIGATAT CAAAAACTGCTGAT AAGT
64 GGATCCT CACiTCCAATF 7TCGT
TCTTTGTCTGCTGGATCCT
17 AAGCTGTTGTTGCAATTTITTGCCTICGTCiAATGCCTT
AAATCCAGCATTGTACACACAAGAGCAAGATTITC.T TTCACAATC.AC.ATTGC.ATG

C1TGGTGAATGCC1. TGGA
17 CC-Mr ATCrAGTCCTCCAACAACITGTGACAACIACr A I CACCTTACACAGGGAAT 1 CTGCiGITI CAGCTGGA AAGAAGTCi TACAATAAAA TG IGACAACT ACT TICAAG

17 CCTGACAGCAACTCCC1CATCATACAT1GGAGAAGC r ACIGGACTATGAAGCTAGAIGCACCi TAAAAGGGCT CATACA TrGGAGAAGCTGA

68 AATCCCAG AlT6CAAC CCTTGGCTGCAAAGGAAG AG
17 AGCT MAGA i CI-MCA illiCAATICiAATGACAGAAT
AGGAGAATTGGGAATAATAAGAGCCGTICKCATAA CCTAAGAT CTi CIACAAAT GAM
GACAGAATTICTCAT

17 CAGATCAATTITIGIGTCAAGAGGGTA1TTTCAAGrr CAATCAGATCC11TTTAC.TGGGGAGCC.AATGAAAT A GGGTAATTGACAATAACA
TATTTTCAAGTTCAACGAA
70 CAACGAAACTG . GCAGCAG TIMM ACTG

AACATGGATAGAGCAGTT ATACAACAAGCTCAAAAG

17 TAGCAACCTCCATGGCCTCCGTAGC.ACTACGGCAAAG TCATCCTAGCTCCAGTGCTGGICTTCTGGTAGGCCT
GAAAACAGAATGGTGCTG TAGCACTACGGC.AAAGGC

GCCATAGAAAAACACGTA

17 TCAATGGGCTGGACJAGGTCGGAAACAGGC.TACTAT
CCTGC.ATTGTGACCAAGACTTGTC.TATTCTrTGAGG TAGGTTTTGTACAGAGAT GGAAAC.AGGCTACTATACC

1TTGACAAAGACACAGACTCAGATMCCCAATTCT AAGAAATAGCGTTGCT. GT GCAAAAATGGCAGATAAA

RAGTATITTAGTCCTAAAG

AAATGTTGGGGATGCACA

CACTGCAACIACAAGTTAATGACATITGTTCCARTGTT AATCTITITTI CAAAGGCT
GGCAATTAGATCTAAGTGA
78 CiTGACC CAATTTGATCTT TTGCi CC
17 GTGCZTCTTGAGCAAATCTGATAITATCATGriTGCLT AGTGAAACTGGAGAATGGGAAGTGTTGGTGGAGC
AMAGACTGAGGGACATT TATCATGCTGCCTATITTAT
79 ATTTTATGGAT TGGTAAC , TGG GGAT
17 1TTGGATC.CCCCIGGAGGCGTCTAAAGGAAGGTACG 1 GACGACCGAAAGCGTCACCGTGTGATCCCCIATCTC GTACCCACTCAACGGAAG TCTAAAGGAAGGTACGGG
80 GGCG i GATCC C CG
17 C.CTCCAGAGGCCGTAGAGGGOCC.ACTIGTGCCAAC
OGGGAGCAGGAGAAGGA

AT IC AATACTCTTAAGTGTTATC

83 GTA GTCAAT C.ACTGGATATGGGC.CATT
GGTGAAGGGACGAGTGTA

CTCCIOATAGACAATTITT ACAAAAACTTGGTAGGCCT

TCCAGAAGITATTCCTACA GATAATACCCCAACAGTAA

17 TCCCTGACACCTTAGGCACTAAGTTGGACACCCTTAT ATCAGITTAGAGCTTTTCGACTG/kACAGACATGTCA

AGGTATTGGACACTGAAA CAATACTGGAACAAAGGA
87 CAAAGGATGA CATCC ATM. TGA

ATGAAMTAAGGCCITAT ITGGATATAGTAGACTCCA

s CC.AGTGACCAGICTAGAA AAATTGCCAGITCIATCTA
89 CIATCTATITCC 1 TCA TrACICC , GI *FTICC

GTTAACAGCMCAGCCAGGGCATTGTGACCTATGG APACTITATAAAAACATCC
90 ACATCCACTCCC ' GCG CCGCTTCACTGATCTCTF
ACTCCC

AACAGACGGAATTTGTCC AAAATCATGTGATGGTGG

ATCAGTTATATATGATTGA

17 TGCAATCTCATAACATAACGTGGATTCCTACTTGGAG GAGTGCTGCTACTATACGTAGTGTAACCATCAC.ACA
TAAGACATTGCAAGAAAT
93 . I CAGGAA AGIT:TWA GAAGG 'FCCIACTTGGACi TCAGGAA
17 AATCCAACCTITAAARTCRCACGAGGGITTTGTTAIG TGGIAACATGATAGATGGIMGCITGCC.ATCAAAT

ATATGACAGITTGOTTTAT OTITTAGGTTGCAAGGCTA
95 AACiGT3 AT CACCICTA ACTGGA T

ACGTAGAAGAATTIGTCITICAGGCGATGAATGTAA GTTAAAAACATFCCICGIT

GCACAACKATTICAGAGTG

TGACAAATATTACCCIGTIGTGGIGGAACICTCCAT TTGTTACAACTTCALITAC
ATTCTGAACCACCTAAAGT
97 I AAAGTTG GCITCIA I ACC 1.6 TTAACTGTCACAAGIGCIGGACAAGACTICAGAAGA AAA TIGTATIGTGCCT TTA TCCTGAAGATCIA
TGACTTI

TCi I ACTTCCACAAACTGCCAATCAC AT AAAAA If TGCAGGA ICI
99 . AGGATCTTIGG AT.AAGGTTA TTIGGCAGTGATGCAGAA TTGG
.

AAACTGIGCTTAAIGITGITGGACCGCTCTAACAAT f3ACAGCAG ITTGITAAAG CCGA
TATGGTTAAGICTAA

18 'faGTTICGT AAACAATGGAACCICGGGCACTAAGA ATC:AATG ACGTIGCATT TG I

GGGC.ACTAAGAAATTGGC
18 AACAGGCACAAACC I ATT AGTGAAIVAAGCCITGIT IT AGGAAATGIGITITG IGATTIGACi ATTGCTTGCCTATTACAAC

GCAGGITGTIOTTAATGGT
18 Tar ACCAAAA TGCATAACAGCG TATG I AAATGIGGT
rrGAGICGTGAAGATUTGAGATTGATGAATIAM GCAATATGTGA TIT TGAA ATGIAAATGTCiGTG
TAAA

TAGGACACACCGTCTGCIT I AGAGCIA GAAAATTGAG TAT AAACC AATATIAT ICI
IGACGGAGG
OS CGGAGGTAAG TCAAAACCCAAC TGAC.TTG TAAG

GGTGACAGMAGAGAGTGGIGGTITTITAACACC TIAAGCCATGAGAAAGCT TAAFAGACCITCATIGGIT
06 TTGGITGATG ACTTAAC.1 TCC GATG

"fGCCGiTATCACIATTCAG

18 GCGGCCTC.A.ACAGTAATiNAAAGTATACTGGCTITTGT
CTAAGGAATTGAAACGGCCCATTCITAACATCAGTA AlTACGATGITATGGCTAA ACTGGCTITTGTTCAAAAC
08 TC.AAAACATC . ACCGTATGA 7667 ATC
18 ACTGATT AAAAGCATCCACAGAC.CAGCATGTGCAGG
AACTGGTTTGAAACTGAACCTTACTGGIGTACITTA A GIGTTCTGATTCAAAATIC

GCATGTGCAGGGTAATGTT

GCTCTAATAGATC.AGGATT

ACTCGTICTATGTCGTATTGCAGTAAAGCATATACC CGTGCGGTATAATC.I. 'MCI
ATGCTAAAGGIITTATCOG

AAGCTTAAACCITGCTITTGGTGACACC.ACACAATCA TATCAGCTATTTAAAGGIT

GCCTGTGGATITTTTGGC

TAGGTATTACAGCGGTAA

TGCATATAGGGAGGCTGC

GIGGITAAACCTGGTGAG ITTAGCTGCTIATAACGGC
14 'F AACGGCA TTGATGCATAT A A

AGATGCTTTAGCTFCTATGACIGGTACCATTOTAAG TIGTAAITGla i F i G i ACM GI-cirri:1-c TGTGIACAAGCATACAMAGAGGCGTGTACTCAAC AIGTITAGTAITACGMT "FTGTG TIATAAGITTGGCC

18 CCATGTGTAAGTACCAAAAAGGGAAAAGGGTTCTAA GCAAAGGTTATTGCTAAGTGGGTTAGCAC.AAAAGC
ATTTC:FGTMCKTTFGTG AAGGGTTCTAACTTAGAG
17 CTTAGAGGAAG ACTATCTTAAT , GTAC GAAG

TGACTGATGTCAAATGTGCTAATGTACTTAGAATTA ATATATGAATGCTAATGG
CCCTCCTAAGAATAGTTIT
18 TGAAG i GAAGCAACATGC ATTGCG GAAG

AAGTGCCTGACTAGTATT CGATGATTACGCAAACCiA
19 AAAGGAC I. I G I I I IAACT GAAG C

GTTGTGFACCATTGAATGC

TGGCAGCAAATACTCTGAA
18 CAGC:TTGCAATCTAACTGTAGAAGATITTGTAMAGG GAACTGCTACTGAATAT6C1TCCACTTAGGATCTAC
71ACTGI1CAAGATGC1AA TITTGTAAAAGGITCiTAAC

GGATICATATGGIGGTGC TCTGTTTGTATATATTGCCG

AAGGGCATTTGATATTTAC

TITTGGTGACTATGTTATTGC.AGC.CCAGCATAGGCA TAATGTGTATAAAMGCT

CCCTAGTTAGCGCTACIG

ATCCAGCTTTGC.ATGTAGCTICACACTAAAACAGCA GCTACCATTATAAAGAACT
ATTGTGATGAATATGGATG

18 AATGGATAGCCAGCACICTTATC.ATAGTATTTTGAGA 1 TTGGAAAAGCCAGGCTCTATTATGATAGTATAAGCG GGAC.ATTAAGCAGITGTE

ACGCCGCCCAACCCATAAGTA ACTA TGACTGGCAGAATG AAATGTITGAAAAGTA TAG
27 AGI ATAGCA.GC 1 CAG6 , TT CAGC

GTTGATTCAACCTTTGTCACAGAATCGATCATCACTC TICAGMTTAACATATGT GCCAATGTATGTGCCTTAA
28 GCCTFAAT ' AAAA TCATCA CAAGC T

AGATGGATGGTGACGATGICTACAAAACATCCAGC CGTTTTTATGTCAGAATCC
GITGGGTTGAACATGACAT

TACAATGACCTGGGTAAT ATCTTGGATAGCTACAGTG

31 . AGTIGTAMIGG TCTC;CITAAA TCTIGTACAGGAICTCCGI AAATG6 18 ACGTCTC.AAGCACACTATAAACACTAGCTAATTTAAG
ACACATIGGTATGAAACGTTACTGCGCAAGATGTGA TGTAGGAGATGTMTG TT AGCTAATTTAAGTGCTCCT
32 MCI CCTACG CiTCCCIG TFAACC ACG
18 ACACAT Arrr ACGAGIGGIGICKITTATTGTTCCGGC ACCATAA A
TGCATTACCTGAGAIGGTGG TAAGCA TA ITGAAFATAAATGATIGCA
33 CAAGGF CTAAC:FTCATCT CTCG 'FAT MT
TCC:GCi CCAAGGT

AAAGTFCTAGTGCTGTAA TAATAAGTTITTGAAGGCT

ATCACCCAGCTCATGCTCCTAAATCGCCAGTTGCCTT AGCATTACAGTTTACTACA CCTTAGATAAAGTGCCACA
35 CiCC:ACAGG 1 A TTGA GG
18 CACAACAAAATIAATICCIGTGGAAGCGIGATAGCAT .
GAAGCCACIGGTTISTITGCTTC:ACCAGGAGGAGC:f 36 TGGGAC nc CCIGGGTTGGCITTGATG
GCGTGATAGCATTGGGAC
18 ICAC:GCCCTACITITGCAAAGTAGITACATGGGCAGC TA AACGTGCCACAG I TTACAA

37 . CM ACATGTAACACT C
AGTTACATGGGCAGCCAA .

TAAGGACICIFTI ACI TAT CCACATI ACITTGIGITATGAT ATTG
38 ATATTGGCAA ACAGTTCCAAA CITAAAGC.16CCATGC.TCT GC-AA
18 ATCCATA FACACACAAGGCOTATCTGCiCAGCCITTGA GCATOGAIGC I AAGCAGGI
TGACCTAAAT TGCATCT AATAAACAFGCATICCACA

GGCAGCCTITGAGCATTF

IGTGGCCITAAGCTCTC.iGG TACAAAGCTIGGAGAAFG TC:AAGACTGGTCAT TA I AC
40 TTATACAGG TGCiTGT 7I6 AGG

GGTGICTGC.ATGTATACA AATTGAATGFCCTTMGAT

18 CCACCAATAATCTICIGGITGAAGTTATICAGAAGTA TGCATITG ?TAM AGGCI
TGIACCGMACAAACICC A TACCITi AGATCAAG AT TAITCAGAAGTATGGITIG

TICTCCAAIGGAAAGAGITAGTCTCITCATCATACA AGATTITCAGYFTA FGCIT GI
AACGATGAGAP.AGTIAT

18 TCTGCATTC:CAAGAAAACTCTGITAACATAC:ATTTGTC RAT GGGGTAT
TITGCNCTITOGACGCCANI TAAAAA GAGTACAATGTGAGTA AA ACA TACATTIGTC:ATATGA
44 ATATGATTCGAG TCCTTCACTAC; GATGG TTCCiAG

CTGGACATACAGCCIGAAGACICCATCACCCCAATG ATMCCACITACCCA TIT T
AGGITTACAATITCAAATE
45 nrscis CATAT GAG CTCG
18 TGATGAGGGCiTTGATGGTGATITGATGCAGCACTGT CCGGCTAGACTTGAAATAGTTAAGCCCAGTGACTAA
46 CC.ATT . CTATGGTTTC.A GATTGGCCATTGCACCAT
GATGCAGCACTGTCCATT
18 MGGCGTAMATTAACACCC.ICATAGGAGTITTCACT
TC.ATGCCTTTAAATGCAAC.CGTAACAAGCCTTTATIC TATCATCTAAAGCTGGCA
AGGAGITTICAC.TTTACCG

18 ACAGATTGCMCGTAGGAACAGTTCCC.TCTGGTAAT GTAAGGAMTACGCCTGTACAGGGITATCAGACTG
C.ATTACCACTGGITTTGAT C.CCTCTGGTAATTAITTAG

TACCAAGGTTATTAAAGA GTGAGTTCCCTGCTATAAC

18 GCTTACTATTACAACITC.AGAGGCATGTITAATGTFTA
ATTGGGITAC.ACCTCTCACTTCTAGACTCATACAATC 7TATTICATATITTACAGAC
GTITAATCITTAITTAGGC.

CGTAAACCTAATCTTCCCA
SI TCCCAATT GCAGCATCAAT CTISTICAGCCAATCGCAG Alt 18 TCGAGGCTTAAAAACAGAATCTTCf CTGCTAATGITT

AG
52 CTGTTAGCA GCiACCITTGA CA CA
18 CAAGACCOGAACAGTGCfC:AACCIATAAGTGCCCCCA AAGIGATIATTGTOGAGGCAATTCTICTGCAGACCA
TCCTGATCCTAITACATIT

ACTTATAAGTGCCCOC.AAA
18 AGAGITAGCGTGAAAGGCCGC:AAACAGAACTITTAT OCGAACCAGCATTGCTATTTCGC.AGCIGTCGTGTAA
CTITACGGTTCTAGAGACT CAAACAGAACTITTAIGAT
54 CAI TUG 1 GACf ACATAA TCGT
18 GACAAATGCAGC.ACAATCAATAGTACZTTCAGAGTTT TGTGGTGATTATGCAGCATGTAAGCATTAATGTTAT

55 ACTATAGGTAA CACAGAAACTAC , GA GGTAA

TAAAGTAAAGTTATCTGAT GTTGAGGCTTATAATAATT

18 GCGGTAAGACCACCATTAATAAGTCC.AAGAAAITCT TGCITATGITTCTCAACAGCTTAGTCCTICTCCATAG
CAAC.AACTCTCTAATAGAT CAAGAAATTCTATCTAGAC

CACTTTAGTTATGTOCCTA
58 AGGGTT ACTCATA CTA.AG
TATGTCACAGCGAGGGTT

59 GACCIACA AAGCATAGCTAC.AC GCACCAGATTTGTCACTTG CA

GTTAAAGTTCTTAAGGCC AGACCTGAGAAGAAATAT
60 AAATATCTCC GAGTGAATGT ACG CfCC

61 AGTITTf GGA ATACCfTGAATGT TTGT GGA

CCAAAAGGGTICTGGCAT

GAACAGGACCGCATGCTA

18 CAGCCATCiTCAGGTGTTAC.ACCGCGTAGTAGAGCCA

64 An TCiCIGAGG GAGCCICTAGTGCAGGAT
CGCGTAGTAGAGCCAATT
18 AACCIAATIGCGCCGTTATAGCrATCTGGGAATCCTG 'IGACAGTACACMCAGGTTTTGAGIGATAGGCATT
65 AC:GA CAAAT ICITAI , GI TGGCCAAAG 1GCAGAA sf Ara GGGAMCCIGACGA
18 GGTTACCACCAGATGCCGAC.ATACTGAAGAC.ACCTCA
GCAGAAAAGTCGAGATAAGGC.ACCTATAGTCTGCT TICTTAAGAAGATGGATG ATACTGAAGACACCTCAGA

18 ACT7CTGCTTGGGCATTAGCAAACTGACATGACTAT GGAAGGACCTCTTTGCATCt=GATTCGCTTTCAACAT
AACTGACATGACTATTGAG

CTACAGAGATTCGCTTGGAGAAGCTICTGTITTGG A GAAATCICACC.ATTGCOT
TTTTCCAGGACATACTATT

TTCCAACACTGTGTCAAGC
69 . 1 GMT GICCAG I
CTGCTITOTTGGGAT Gra:

GAAAGTGACAGGGCCCCTTITTCAATA TGATGGITT GCTACCTAACTGAAATGAC

18 ICCATTCAAGTCCTCCGATGAGCTICCAGGACATACT AACACAGTTCGAGICTCTGAAACfCCCATICTCATCA
GAAAICICACCATIACCIT
71 GAGE crc3a. CC
CTICCAGGACATACTGACG

GAGGCGACACTCCACCAT GATCACTCCCCTGTGAGGA
72 AGGAA rraxi G A

TGCAACATGAGCACACTT AAAGAAAGACCATCCGTCG

GTCCATCCTGGGGCCCAA GGAGGICC.CGCAATTIGG
18 IGGGGTC:CAGCACGIAGATGTAC:ATTCTGCACACACC
ACACCTACAGTGGCAGTCACiGICGCCiCCTACTAAT A GIGTA.TGAGGCCXAT GAT
75 . CGG GGIVA CT
C.ATTCTGCACACACCCGG .
18 CCACCACCATACCCACAGaiGGACATCGAATGGCTT GCAIGICCTCiC:GITTACCCCAGCCAAGAIGCCCCAA
TGIACMAGGCTATCTITC GGACATCGAAIGGCTTGG

18 GC:TS:YWCA f CAGAAGGACCGGAGCTGCAAGCCCA1 TACMC TGGCACT
ACGCACCGGCCGCAGACACTT GA ACTGGATGTCCTCAAAGG

AGCTGCAAGCCCATCACT
18 GCCCiCTGGGACITCAGCAGGGCGIGCCAACCIACAC CFCCCAGTCiCiTCGGTOCi ITT
GCGCAC:GTC 1TGAGA A GGCACTACTGATCiCCAGG

GCGTGCCAACCTACACC
18 CGATCCTC.GATATCGCAGCGCTGAACATCGATITACC TGACCGCACTGAGCAACATCC.AGCAAGGCAGTATA
AAGGTGAGGATGITTGTG TGAACATCGATTTACCGCC

18 CAGCLIGACC.AGGTi CTCCAACCCMCiGCTGA TGCT CGCTGC1GCTGGGACACAIGT
TACCCCGCACGTACC
BO GAT AC GMTOCTCACTTCTGGCGG
CCCITTGGCTGATGC.TGAT
18 G TGGCCACTCiCTAGGICTITGAAOCIATACGACCACC AGGICATCACCIGGGG
fGCGCIAATCGTGCAGAGA TACTGAGCATTGGCAGAT ACCTA f ACGACCACCTAGC

CCCAACGAGGICC IIGTGACTACCITGACT G GAGACAAGAACGTCiGTGA

18 GAAGGGCACAT AACGGCACCTCGCTCAGTCCTAGGC TGlIGCGGGGA
TCTITACiGGCTTGCTAAGGGIT 7 CC

AGTCCTAGGCT.T. CTC

TACTAAATCCATCGGIGG
84 CTC . CCGTAG CG
CCACATTAGGCTTCGGCTC
18 TCTAC.CfCGACCCGTTCGOCACCTITICCATTGAGACC ACGTACCGATATGTCGCCTCCGACAGAGGACGACC

AGATGCCCACTITCTGTCA AGACTAAGC.AGCAGGGAC
86 ACT GCfT C T
18 GTCTGGAACGAGTGCTGGCTTTCGGCTGCGTTGTGA ATGGAGGAGTGCTCACAAGCTGC.CTTGAACTGGTG

TCGGCTGCGTTGTGATTG
18 TTTGGTTGGTCGTCAGGGGACTITGCCTGOCAACCCT TACTCGGGGGGTGGGTTGCCAAGCCGC.TTACCACG

TTGCCTGGC.AACCCTGC

GAGGACATGGTCAACCTG

GCCCGCCATACTATCTCCG

90 CiGC GCMG GACTGGGTTIGCTCGGTG

GMAACAGCTACCITTGAG
91 AU GG Ci GGGTGGGGGACITCCATT
18 GATGAGGCCtATGLGettuGCATCCGCCAGCCAACT CAGACGCTGAGCTAGIGGACGTGFAATGITGCTGC
CrCCATCAGAGGCAAGCT

ATCCOCCAGCCAACTATCA
18 GITC:TTTTMCCGAGGGGGATGTACCAOCAACTGTC TCAGCTGGACGGCTCTAATOGTCCTGTGGTTTCGA
TGTACC.ACCAACTGTCCAT
93 CATG GGAGG , CTGTTGGACCGCTGGAAA G

ACCGTTAGTGACAGCGAG

18 CC.CTGCTGGACAAGGAGCGAATCACTCTGCCCGATC 1 1TC.CAACC.ACCATCATGGCGAATGCGGCCCCOTTA CTCACCATCGAGGAAGCG
95 GAA ' GC T
TCACTCTGCCCGATCGM
18 GCCrCCGGTTCAAGGTTACAGCCACTGAACAGGACA CTCCC.TCACGGAGCGGCTITACGATAACCAC.ACTGG
GGGITCTCATATGAC.ACCC CACTGAAC.AGGAC.ATCAG

18 TCAGCCACCACGACCAAATCATCAAAGGCCGCAAAC TAGAGCAGC.CCTGAGAGCCITTGGCCTGTGGAGCA
TGTTACATCAAGGCCACA

CAAAGGCCGCAAACCTCC

CGAAAGC.AGGTCAATTAT AAAGAACTAAGAAATCTAA

18 GTC.ATTGGTCCATrCCTATTCCACGGATCAGACCGAG AGTCGAAAGGCTAAAGCATGGAMCGTATrTTGAC
TTATGGAGTAAAATGAAT
99 TGATG TTGGTrTCT GATGCC
GGATCAGACCGAGTGATG
19 TrGGTTATCGTTAGTTGCGATTCCAAGTTGTTTTCCCT AGAGAAGAAAGAAGAACTCCAGGATCTCTCCAACA

00 AACGA.AG TGTATGCAAC GG AG

CCAGGAGGGGAAGTGAA AATGATGATGTTGATCAAA

19 CCTCATGC.ACTCTTATCTICAATGIACAAGCGGATCAT 1 TGAAGAGTTCACAATGGTTGGGATC.AATC.FCCTGGT TTGGIGGATTC.ACAITTAA
02 CAGTC 1 rCiCIT I GAGA
ACAAGCGGATCATCAGTC
i AAAGTGOTTITCAAAATTGGGGAGCAA TATCCCAA TIGTATGATAAAAGCAGI
GATCTGAATTTCGTCAATA
03 Ci TCAATAGGG 1 TCATTCCC , TAGAGG GGG
19 TTCACTGACCTCCTCGGGAGATTGACCGGTrCTTGAG 1 ACACAGGGAACAGAGAAACTGATIGACCAACACTG
GAGAGGGTAGTGGTGAG
04 AG ' ATTCAGG C

19 CCTCATTTGTTGGAACAGAGTTCTrAGTACCTAAGGC GATGTGCTTGGGACATTTGATACCACTTTGCTTTGG
TGGAATTTGAACCATTTCA TAGTACCTAAGGCCATTAG
OS CATTAGAGG TGGAGC CT AGG

06 ACCC CGCAAG Cr C
19 GCCAACCTCTCAGGACAGCCTATGAAAGCCITAACAC CTGCCGACGCCTTAC.CGATTGCTTCCGTTGGCATAA
TCAATAGCACGGCCTTGA ATGAAAGCCTTAACACCGG
07 . CGGC CTGA AC C

09 'FGAGCTCGGACCIGTCCCTGGCCIGCAACTGGACGC GAAC GCACAGGCTGGAAGCG
GCCTGCAACTGGACGC

TACTCAATGCAGCATCCCT
TCCCTG ACATCC AAC1CGGAG6C6GC11' G

1GTGT3GTACACCC6.ACTC
11 ACTCT CIA CGCGATGCCG ICATMAC: :1 IATCTCCTGGCCCCTAC
12 GT G GAGACTGGGC.GCACAAC
CTGC:GAGATCTGGCCGT

AAAGGTCCGAGGAGC CTGCTACCCAAACCTICCr CAACGTGCA ICAATC1C1GG
13 , GOT C13 G T
.
19 ACACCGGGGATCTCATC3GlIGTGGIGTGCACCCGIG TCACCiCIACAACICCICTCCA6CGAGCA1 GCAGGTGG

GGTGTGC.AOCCGTGGA
19 'IAAGCACCTCCTGAGCACCCGGCAGCCCCATCACGTA TlIGTC1ACGAGTGCCAC
TCCACCGTCAAGGACAG I GGACCGGGGi IGAGAACA
1S Cr GCCGA A
GCAGCCCCATCACGTACT
19 AGTAGGCCACGGCAT(GATGCTCAAAGAAGAAG ICC ACCGCGGICTIGACGTGICTGAGCA
TCGGTCGACAC TCAAGGGGGGAAGACATC TC:AAAGAAGAAGMCGAC

19 ACAGAGGACGGACGAGTCGAATCTAC.AGATTTGTGG GAGTGCTATGACGCGGGCTGAGCCTAACTGTAGTC
TCTACAGATTTGTGGCACC

ACCIGGAGGICGTCACCAGMCiACAGGCAMACG TGTGGAAGT GT11 GATCC
18 AT aici GC
GGCCAACACCCC.TGCTAT
19 GMAT AACCTCTGCTIGGCGGAAGGG A TGATGCTC:

19 arTGCAAGAATGFCCACGAGGTGGGCGCTGOCTTA AGATCA
TGAGCGGTGAGGICCCAGAGGP.IGGCGG

TGC1GCGCTGGCTTAGC.
19 TATGGCAGTGACGCGGGCGGGAACCATGOICCCCC CTCAGCAGCCrCACTGIAACCCACACrCCGAGCTIA

19 CrCCAC.ATGTTCCTGCAGGTCCTGGACATGTCAAAAA GGGACGTTCCCCATTAACGCCTAGTTCGGCGCAGG
TGGACATGTCAAAAACGG
22 CGGGA . AAGG CTGCCACTGTGGAGCTGA GA

TGCTGCGGGAGGAGGT
19 CGAATCTCCGAGACTTCCGCAGTCGATCCGCTTGIGG CCGGACTACAACCCCCCGCTCCAC.AGGTGGTTCGTA
AGTGGTGATTC.IGGACTC
24 CA GTC Cr TCGATCCGCTTGIGGC.A

A TCTTCCATGCCCCCCCTGGCCATGACCCGTCGCTGA A
GGCATTACGGGCGACAA
19 GCTGCAAGCTTCCTCTACGGATATrACCAGGACGTGC CTGACGCCCCCACATTCAGCCATGGCAACGGACGTC.
TGAC.AGACTGCAAGTTCT AlTACCAGGACGTCiCITAA

CCTGAGAAGGGGGGTCGT

TTATIGITGGGGGCCCTCTT

AACMCATGCTOCTCCAAC

GCGCTGGAAAGAGGGTCT
19 1677.7CCAAGC1CGCAAGGGIGAAA1CAATAGGGTG GICCGCSICIAGGCTICIGTCMCCCAGTICAAGAGG
TGAAATCAMAGGGIGGC
30 G C.CG 1 TACT GCCTCAGOGC.ATITTCAC CG
19 TGCACCCATGGAC.ATAGCTTCGAATGGGGTACATATA
ACCCACIAAACATATTGCTITTGICITTGTFTTGTGCAA ATCTAGACAATAATAAAT CGAATGGGGTACATATAA
31 AAACAAG AATATACAGCA , AGGGAGG AACAAG

TACCIACGCCTAAAATATT TATTCATATTATGTAGAAG

19 TGCMCCACCGCATCTGTATGGATATCCCTC=TATGCA GCACAACAAGCGTATAGGGIGTGCACTCTACC.ACCA
GTAAAAGATATCGAATTG GGATATCCCTGTATGCAAT

GAAAGITTGCACCCAGTC

AlTTCAGCCCCGTGTGTG
¨....
19 CATTGGCATGGTAGCACTTGTATGCATAAAFACCCTA ¨
GGGAATGACAGTGCTTATGCTGTTTAGCCACTGTTT GCATAAAGACCCTAATACA

GTGCCTTTAAAACATTCCT AGGAGTACCAAAAAAAAG

C.AGCTCTTCACATTFAAAA TGACAACACAAGGAGAAC

19 .ACCCTCGTATGAGACTTITCCTGTTATGTGCAGAACT ATGTGAATTTCAAGGATGACGCTCCCACATGTACCT
TGTGCTATGTACAATGGA TTATGTGCAGAACTATAGT

AAAGCGAC.AGCGACTCGA

19 CCAAAACAAAC.ACAGCAGCAGTGGATTMACTTGTG 1 GCTGAOGTGCTATCFGGCATTGTAAATATAATGCTA
AGATCTATAGGGTGATTG TGGATTITTACTTGTGTCG
40 TCCiACT 1 TAAGGCCC TACAT TC ACV

TGGCACIAICGCATITTGCAGIGGITCCAATGCCTAG
41 CICCIC 1 GCCC , CATMGCCACACACiCITT
ATAGGACCTGCAAGGCCTC

CCCCAGGIGTGTTGGACATCGCAGGA1TTGTAAAGT AGGACTCC.AGCATFATTA
42 CTAC ' TTGTACT ATFC
AGGGTOCACCATACCTAC

TACTTGCTCCAGCTGCCFCTTCACCATCATCAACATC CCCGGGTACATIACTATCA

GCGCTCTAGTGACAACIT
19 AGGC.ACGTCAGACF7ATTTAATFOTCTA17CAGGATG
ATCCAGATTATTIGGGAATGGCCCTGCGC.AAGTAAA CCCCTGGAGCTTATAACIT CTATTCAGGATGGTGATAT
45 . C;TGATATCIGT AGAACAT C CiGT

AAGAATATTTGAGGCACG AGGAATATGAITTGCAGIT

19 CAGI.GGAACAGCAAFTACACAAATITGTIATGAGCAT
GIAAGGITGlICTAGGTCAGGAGAAATTGGTGGTG GITIGIAATGICIATG TAT TGITAIGAGCATGTATGTG
47 G TATG7CiTG TGCACAC G161731 TG

48 AT TCT TG 176T GITIGTCGC:CTGATICTGA
CTGAGGAGGAGGAGCAAT

CACTCTGGAGGATTTGTFT
49 TTICIFTTG AC.AGTCX1CTG ATGTGIGGOTAGGITGGA G
19 GTCCCAGTTICIATAAGGAAALACAITTCAAAAAACA TATCAGGATTCACAFGAACAGTGGCAAAGTAC:ATIC
CACACACAGCCAAAAAAC TFTCAAAAAACATGGCTAC
50 TGGC.TACAC CATCATAATCCAC TG AC
19 CiGGGIAGAGGGGTA fGCGGITCT AGTTCTACCAGGC:
ACCGCAAATCGC.ACICIGGAGTCiAGACAGTGCCTCT *FCTACi TICIACCAGGCAGC
51 . AGCC CTCC T1TTAC.CCMCGCC7CTGT
C .
19 GTFCCCACF GCTICAGCiAGC7GACGAACCACCGAGA
AGACTICAAGAGGAGGC7CGGGTAAGCFGFTfCCC GAATCGCCAACCAAAACIA
52 CAGA GGACC. T CC.
ACGAACCACCGAGACAGA

ICTAITACACGCCCTITGAGIGGGATA FTAA AT TACAAATII TTGG MGT

ATTTGGGCAACTTGGGTA
19 TCCTGGA FCACiCAGTGA I HAM-RAU-Ft ITCAAT I ACATCATTAGGIGGIGCiGCiAGICA
I ATCiCAGGATT ccr AC TGATAT TAT TGACX:

GAGCTCTTCAATTGTFCCIT
19 TCATCATCAAACICiCGGGATTITCATITTGTAAGGCAA
CATCACITTTGONGTTTCACiCAGTGAAAGTCf ATATC AlTACAAACGAGAAACCC
SS CCGTC TGGTGCAG AG
ATTITGTAAGGCAAC.CGTC
19 CAI CAGG FACTATAGG ItTG 7 CAGIACATIGGCCFCI A i CACCTACITACCCITACACACCGGGTGCAAGAAA GAAAATGAGCAACTTf TT ACA
TIGGCCTCTAA.TAITC
56 AATA1TCCT AAATCATAAT AlTCCC CT
19 CCC ITO ACA (ATKA ICCCi IACTCAATCGTCiGAAAGT
ITATGCFGGATCTGAAAGGCTICTITCCAGGATCHA AATCGICGAAAGFIATATC

19 GGGATGACCTGATGTACTAATACC:FGTTGGTFTGGAA
ACAGAAAATCCTAACACGTAICCTGAGACACGTCCA AAA ItTCI GT ATAAFCCAG GTIGG
ITTGGAAAITGATA
SS ATTGATAGGATF TTCTATTGTC ACAGG GGATT

ACTCCr GAT CACCAG VG GGAGAACA11 AAAGGITTGCGAAAA1GTC

19 CCTGCTGAGCACCATAAATAAAtailt.!ACGCGAGGC
ACAAACCTCAMACAATTIAGGACC.TTGAGTATAAA TGT7t. FFFFFF GGAAGAAA
60 ACTTITA . GAGCCACTAGG GGAA
ITTACGCGAGGCACITTTA

ACTTGC ATACTCGGAACATACTATT

ATGCTGCCTTTTAAGCCA
19 GGTCTACCGTIGICTGTC7C.AACTAAAGCAGCTGITA
AAGAATGAAGTGGGCCATGTGTTGCCATATCAC.CCT GACGTEGGCTAAAATTGA ACTAAAGCAGCTGTFAAGG

TCAGAAAGATGAGACTTGACAATGCTTACTC. ICTCA TAGGAACTAAAGGACAAA
TTCAATAGCAGCAGTIACC

AACTCCCATTGTACTGGCATACATACTGGCAGGAAC AATA1TATTC.CCTAAAGAA
AATACCAAGAGAATCTACT

ACTAACTGAATCTAGATCT

ATGATTGTAGCTTGCAGGA

19 Gt. i i i i i i TCAATATCCCCCTGCATATGGGACGGAGA TTCAGCCAAAAAGGAGGACATGGIGTCTTGAGAAA
CGGAACAATACAAAAGAT

TATGGGACGGAGAAGAGG

TGAGGAATCACCTAAAGCAAGTGAGAATGTATAGT GGAGAGATAAATITTCTT

GCCAACTITTATMCCCA

CCAAGATUGGCAAATGGGAAC.ACTAAATITITAA TCTTAGTTIAATAAAAAGG GAAGITATAAIGGGAGCC
68 AGC.CAATG 1 GC.ACCCATTG ACTCiGG AATG
19 GGTCAGTITTCATAAC:CICTIGGICGGTCAGTACAGT AAITUTTGCCTITt R, CGCCAAAAATTGATAA TAATCCCTCAGAAGATGG GGTCAGTACAGTGGAMG
69 GGATTTGC GGCTCTCCATT , C C
19 GOTITTGAAATCTC.CAAGATCAGGGATERIGGAATGC TGAGGAACTTGAAAGACMAAACCGACGGGCMC
TGAGGATGAAGAAAGAA

ATGGGGAATGCAGTACTG

GAGTTTGAGCGATTGACG GTC.TATCAGCTTACGATCT
71 ACGATCTCTT ' GGGAG T CTT
19 AGCGACAATCTATCTTCACATGATTGTTATGCC.CCTCT TCTTGGACAAGAATGGCATGTGAACATGTAGATaT
GAAGTCAGATGAGGGTG

GTTATGCCCCTCTACTGG
19 CACAATGGACAAGAAAAACCATCCGATAATTGTGATT CTAGCGAAGITAAAGCC.CAATCATCATCATGGTTAA
GTGGTITTGITATGCAGTA GATAATTGTGATTTITATG
73 TTTATGGTTGGG C.AGTATCAGTAC TGG GTTGGG

GTTAAAGAAACTGCTOTA ITATGTTCAATGIAAGTGT

TTTGICTITTAGATCAGGCTTGGCCAACAGGITTCTC AGAAGATGATGTTATTGA TGAGTATTGTCAACCACCT

19 GCAACATAAGACAAGCTACATTCACCGGTCCATTM CAGIGGC.AGGAAGCATGGTAGCTAAAACAAGAGCA
TTAACAGTATGTAAATGG CGGTCCATTITTFTCTTTEG

CTCCTCTGAGTAAGGATTTACCTGAGCCTAC-ACCTA TTAAACAAGAAAGTCGTG
ITGGTACATTAGC.AAAGAC
78 GACTG 1 CAOCCATA rrGe TG
i 19 TGAGCAC:AAATATCATGTCCAACT All AAACCAMG 1 TGITGAGIACAAAGIAACAGTCTGGACATATAAATC TACAAAACCTATTATAAAG
ITAAACCATITGCTAAAGT
79 CFAAAGTIOAC 1 ATCACi ATGC:GAA , GCTCACi *FGAC

ACGAAATTGTAGTGTTCGTGTTAAGCCACCATTAGC AlTTGTAGCTAATATGTTA
CGGITTTATATAGTTGTTA
80 GTTG1TACTGC ' AGTAATATCATAAT CCTGC CTOC
19 CAACATCTGAATCAATGGAACAACAGGGIOTGCAAT CAGTAGCTGCTGGTTTGGAATC.ACTCTITAAATATG
TITTGTTAACATTGCTCAT GGGTGTGCAATTAGAARA

AATGATTITTATGCTAGTG CTITGIGTACTATGITTAAA

19 AC.AAGTIOACAGCCCTGCAITAATGTCTGGTCGGATG TGGTAATGTTAAACCTGGTGAAACTGAAATGCCCCT
ATTCTGCCTTATTGTGTAG
83 . AG TGTGGTC AGIT
'FAATUCTGGTCGGATGAG
19 GCAATAAC.ATTARCAGTCTGGACGTTTATGGICCATA
ATTGTGCTIOGITIGTAC.AAAATGATITGCCATAGC CACACTGGCACTGATMA 1TATGGTCCATATAGAGAT
84 TAGAGATGO CCAAACA C (SCI

AAAGCATGTAATATAGCT TGTATGAACGIGATAAAGC
8$ 't AAAGCTG CIAAAAACiC AAGTC TG 'FG

AATTGTGTTAAAATGCTCT ATCATGCTGGTACTGGTAT

AATTAGGCCCTATITTTAA TACTGTCATTITTGCAGAC

19 GGIGTG TG ICAACA TaAAGTICATCCGITIOTTGM
CAGATCCTGCIATGCACGTTGCACTAAAACAACAAG TIGGICCCa MT TAGACA CCG IT TO n GT
ricrArr GG
SS C.TATTGGT TTCGTAAA A T
19 GIOCiGCAGAAC.ATTACGITTAGTATARGCC:AGACTr! TGOATCAGTGCTAAGAATAGAGCI GGCC I
GT(ATA TGATAAAAG TOCTGGT TA AAGCCAGACTITATT ATGA
89 . AITATGAGGC GTACTAAGAAT TCCAT GGC
.
19 CaGGCCAAAACIAAACTACIAACAACCIGT TGIF ATG
ACATGAATTTTGTTGTTCACATGGTRAAAACTTGA CCITCATOTATAAAGGAI

CaGTTCTTATGGGITGGG
19 GaCTAT CAAAAGAACACIGTCAGTIOCAG A ICCTIC
AAGICTAGCTATAGATGCTIACCOACACGAAAGAC ACTA IGTTAGITAAGATA

TTGaGCA.T GGTAGCAGAA TrATGAA TOG TATG611.31 AATCTTGC.ACGGGTTCAC
19 ACACC.AAIGGAACACTATAAACACTGITiC.ACCTACAC.
ATGITOCTAATTATC.AGC.ACATTGOAGACITTCCCG GTTITAACATCACATTCTG
93 TTGIC.C. TACCAGG TAGC
GTGCACCTACACTIGTC.C.
19 TGITCCAATOTMCGCGAGTAGIGTI AGAGGTIGGO (TT
TCCACTGCAAATAGGITITICAAAGTATAACAAT TACTAAACiATGAAGCCAT
94 rrGe. C.TCTCTCAGCAA TAAACG
TGITAGAGGTTGGGTIOG
19 AAACCIT TAGGATIOCCIATG ITAIGGITATTGC.AGC TGATGCTITTCCIOTAGCCAAGA KT
;TA IGCACATC GI ATCCAATAATITO AAT
9S GIOTCA ATAGAC.AT GAGGTC

19 GCGCTICTIAAAGGAACGTAATCAACCITTMCIATT A 1 C.ACACGGTG
TAATCTAGGIOGGGIAGTIACAAT GAACIGITITTGAAAATCT CCITTITICTATTATTC.AGA
96 ATICAGATAChIC. ATTCITCAGC.ATG TAAGCC TACGC

TCGCCIOTCrGTICGGC1 1 AGAAAAGCAAGAAATGG ACT AGGTT AI CAATGATTA

19 CGC.ATATTAACAGGAAC.AGCTAATGACAGGCTGTAT MTACATTTAGGTOC.AGGGICTGCCAGATGGTAAC
ATGGTAAACCTATTAATTT ACAGGCTGTATGATGAAT
98 GATGAATGT . CACTGTC GCCT GT
19 ACCAATAGGACAGGAAGCAGAAACCTICMTGCTTC ATCGTTCTTGTGAGAGTACTACTUCAGGTAAAC.AA
AATACTTITTGTCCTTGTG

AACCTTCTTTTGCTTCAAGT
20 CGAAGAGGAAGAAGAAGGTGAGTTACTGATTATTCT TCGTAAACGTAGAAGTAMCTGCTAMCTGAC.ATT
TGATAGTTATCTIOGITGC CTCATTATTCTGITTCTTCT

TCTCTCAACAGCTCCTTCTCC.ATAGC AATTIGGTGCAATTAGTTC ITACAAGAAATTITATCTC

TACAGGGTGTCAAACTTGGCATACAGTAACATAT AC CCAGAGACCAATAATCTT
ATTGATATCiAAAGGCAAG

AAACTATTAGGATGICTTA
03 ACT TrAGGTTG TACTCC
CGGTCATTATGCTGGAAGT

20 GCMTGGCGAGG I I I Ra I I MAGTAAAGATTCTAAA
TCAACAGTGTMGGTAAAAGAGGACCAAGCTTTAA CGCTAATCTTGITITAGCC GTAAAGATTCTAAACCTCA
04 OCTCAGCA CATTIC:AGCM AAG GCA
20 AGAGTCAAACTGTICIGGTAATTGICAGAACACTGAI a GGTACTCAGCACAITTCAAAT
GGGATCATCAAGA CITGAAGAGAATITAAAT CAGAACACTGATTCTGATI

TGCAGCAATATGCAOCCACTTOGCCITG1TC.ATTGAT AGGTACTGA ITCGAAATG

AGATTTIGTGCGACAATGC
20 OCTCTGTTGGCCATTICTIGTGAAAGCAGGGCTAGGA C.GMAGAGGCGAAGAAACAATTGATTIOGGTGGGA

, AGACTACACTCTCGACGA , GAAAGCAGGGCTAGGATT , CC.TAATOCACCICCTIGTTATCAGTTCAATGCTCA AT ATGTCCAAAGAAGTGAAT
08 GAATGC i TTCAAAGC GAGGGCAAGCMCTCAA GC

AAATICAAATTACCIGCTG TGGAAGCAAGTATTGTCA

20 aCCITATGCTTGC.AATGTACTGATAGAGCTCGATGA GAATTATTTCACAGCAGAGGTGTCOCAGTATTAATG
TGCGAGCT/ACTGATTCA GATAGAGCTCGATGAAATT
AATTGG TATACCCCCTTCA AT GO
ATGIGGCTCAAGTCTCOGGICAGNGATOTGGTAAAC AMTGGGAGAMTACTGTOTCOTGCCITGAAATIT
ATATGGATTCATCATAAA CACAGATGTGGTAAACTTT

TCGAGAGCATGATTGAAG

AAACGATCCTATGAACAG
13 ACGC TR.TTGGIC MG
GAAACTGGTGGAGAACGC
20 TCTCCAGAAG1TCCGGTCATTA660ACAATGGTAAT GGCGATAATGGACGAAGAAC.ATTCCaTTGAGGAT
TGCAGCAATAAAGGGAGT GGGACAATGGTAATGGAA

ATGATGGATCAGGTGCGA AGCAGAAATCCTGGGAAT

20 GGGATC.ACTC.FIGITCCTCTAATGACATGCCATTCTOC. 1 AAGAGGACAACTATCCACTAGAGGAGAATCCMTG ATCXAGC.ACATAAAAGTC

TGCCATTCTGCAGCAT
i AAGCITTCAGCATCTATTACTGACACTGTATATTICT GAGGAGGAGCGCTATAA
GATACTTATTGCFTTGATT
17 1TGCTC 1 CCIACiCACAC , MIT GC( C

AAGAGACGGAGAGAGACCGAGGITC.AATGTTGGA TTCTGCCTCTGTGTCTAGC
18 AGA ' CGAGGAGG T
TCTCAAGGTCCGTTCCAGA

CCAACTCCTGAAGAAGTG

GGGCAGGAACTAGATCAG

GCGACTGTATGTGAAAGA CITTACCAAAAGGCACATC

20 TTACCAMAGGACGCCCAGICTGGTGGTICATTAGG ACACCTGTfAAACCAAATACAGCTGITGCAGATGAC
GTATTGGTACAGGMGGG CTGGIGGITCATTAGGATA
21 . ATATAGG TCAGGAG GI 'FAGG

TGCAGCCIATACCIOAAAGGCCIGGGCIOTTGCATC CACAATAACAGACACCAC GGTGATTCAATAGGGGGG

GIGTATGAAGATATTCCTT GGACICAGGATITGGICA
23 'MGT CAG GGTACCTGT TGGA G

CCAATGAAGATMGCTGA
24 CTGACA I/WAG CNICAGTA.GAGGCTGC:AC CA
20 GaGGTGCAC-ATCCTACAMAAACAGAACAGCGGAT I TACAGGTGAATATTGGGATGTAGCAGMACTAATT
ATTAGAGATACAGAAAAT
IS CiGA TT 1 GI ATTGCAGGACAA CCCA A It CAGAACAGCGGATMATT

TGGAAGACGGCiAGC:AATCACCCATAGICCC TIT RiCACia GCT AA ITTC GCAGGAAGACAGATC1 AG

20 AAAAGCTAATIGCCATTGATCAAGTAIGTAAGGTAAA 1 GIACCICCCWICAGACFCCGGIAAATCi AC:AGC:GG ACTAGACATACTGAAGM AMU. AGGTAAACICT TGAA.
27 . CiaTGAACC 1 GTT TATGAAC CC
=

GCAGAGCATGCAGIA1TGTAAAC AACGGTFTTCf MACAU'.
28 AGATITAAGC AAC.ATTCS TTGGACTGTTGATCTOCA AAGC

AAGICiACAAA TGCCCGGCT ITAAAGTGCA 'IGTCATAATACACIAT AAA

20 GAAACAAACITCCAAAIGTGIGCATAAAGGCAATGA A ITCGCiATTRXATT(CA
ItGACCiATAGIGCATICG CGACAMGCT ICAATCCAA AAAGGCAATGAAAGAATA

20 ACCGGTTCTCMOTAATCATAC.A ATGOTGAACAGTA
TC.ATTGAAATIGGAGTAACACGGATATITTGTTGOC TGGTGAACAGTATATGTAA

20 TCGGACTGACGAAAGGAMCCalITCACIATAAGAC AACiAGGCGAAGAGACAAT I
GAAGGT(CGGTGGGA CAGGGCAAGAMCAAAAC CTITTCACTAI AAGACAAG

AGACITTOATGACTGCAAAGATGTITAGAGATaGG TACAGGACMIGAAAATG ACiATCCCAAGGACAAAGA

GAATACATAATGAAGGGAAAGTCATCC: CI GATTC:AAGCTGGATAG TONfGAAAT AGGAGAAGA

TGCATAGGAACA I ATITGAGAAATGATACIG "FGAACTITGTA AG TATGGA
3S GAGITC GGOCCTCG ATGTGG Gilt TICTIKAGTCTCTICAGCA TTGAGAGCATGATTGAGG
36 TTGAGGC . GTTGAA G C

TGACAAAGACATAATGGA

CGATAACCTCGTTTCAGGT
20 CCTAGGCATCAGC.ATGTACCAGGTATATCACCGATAT CAGAAAATAACTGGAGGCCTTATGGACTGAGMAT
GGTATATCAC.CCATATGAG

ACAGAGATTCGCTTGGGGAACTGTITTOGAGGGAG TTACAGAGG A TGTCAAAAA

20 CACTCTItTaTTCTCTGGAAATGTOTAATOCAATC.AA
ACICiCACAACATGACC.AAGAAAATAGCTCTTMTGT GAAGGTTAATAGATITCCT GTAATGGAATCAATGC
ATA
TGGATAAGGAA CAGCTTT C.AAGG AGGAA

41 MGT TfaCACAAC ACCCGGAATGCAAATCAG
AGGATTCGIGCACTTTGT

20 GCCCATCCCACCAG1ATGTGT1AGGAGTCTCAATC11 TCCAATCCICTGATGAMCGCTCCTGCTfGTATTCC
GCATGTTTAATATGCTAAG TTAGGAGTCFCAATCTFAA
42 AAATC116 Cf CMG TACGG ATCT1 G
20 TCATGTCAGCC.GATICATIAATTCCTAGGCAACITCAG
AACAACGACCTIGGACCAGCTGIAGTCTTIAATGAA TTTTCTACCGCTATGGGIT
43 CATGG C.AGCTGA TG
TAGCCAACTTCAGCATGG
20 IGGTACAT1TGTFCATCCICAAGAAATT(EC:AAGAGA TMCCCTAGCAGTFCMATCGGATCAATTCGGGCC
TTGCAACTACACATICATG

ATTCCCAAGAGAAATCGCT
20 ATTGCC.AAATTCACsC.CATCCTATGAAATCGGTGCATA
CCAGCTACCOGACCCTACAACTTGCAGTGATGIGTC ATFACTGAAAAGGGTGTA GAAATCGGTGCATATATAT
45 TATATAACCC CA , ACC AACCC
20 ACACAMGTC.ATCTAGAGTAGGTTFCCGCAGAGAAA TGGAAOCATATAATGAAGTGCAAGACCTCFGAGTFG

CCGCAGAGAAACACAAGT
20 alGTGCATGTAATAGCGCTTGCTAGTAATACAAATT ACATGCMAAAACGAAAGTATGGACTIGCTGTATC
CAGATACAGGTTCAGACT CTAGTAATACAAATTATAT

20 CTC.AGGTGTGTCTCCTGTTACTGCACTATATTGGTATA
TGACAGTGTATTTGACCTGTCAGATCATATGCTATG GCACTATATTGGTATAGGA
48 GGACAGG TCACTTICATCT 1TGAGCC.ACCAAAACTGC CAGG
20 ACACTGGATTGCC.ATTGCTATCGTAGGGTAACAGTAT ¨
CACGTGGTCCAGATTAGATTTGCAACGTEGGATA ATACAAATCCTGCAACAG GTAGGGTAACAGTATTTAC

GATCAAATAGACTATTGG GTCTGGAAAATGCAATATA
SO GCAATATATTATGC AAGACTCTC AAGTGTG TTATGC

ATTGAATGTGTAAATACCT TACAATGTTCTGAATCTAT
Si TATGTACAG TTGG GIGA GT ACAG

GTGGCAGACCTAATACTGT

AAGTAGCACTAGTTTTACC

TGCATTTACCGACCCTTC

TGCCTTAACATCCAGACGTGGGACATAGTTAGMT TFACCAATCCTGCATATGA ATAGACACT.
ACATTAACCT
54 TFAACC TITTCAC 1 rtACCC:AATC GC TFTCAC
i TCCIGAAACACAACGMAGTATGGIAACACCTAAA TGGACATCCATACTATAAA
55 A FTCCT 1 GGCTGCC , GI TAC:C CGCAAGCAAGACA
TFC.Cr 56 CTFCTGT ' CACA AAATT
CCCATGTTGCTAC.TTCTGT

TTTACGACGCGAACAGTTATTTGCCTGCATCTCCTAT GTACTTTGCAGGATAATA AGGTTCCCTTAGACATTTG

AAAGGGCTCTGTACTATA GGCACATATATGGGAAAG

GTGGTCAAGTACTAGTTGC
59 . ACITIGCC GCCCAT CAGGGIGGAATGCATFGG C

GGAAAATATTGAAATGGG ACTGGAACTGCTAAGCATA
60 TAI CTAGIGTGTG It AGTCf T

CTCGAGGTCTGGATACGAGATGGGCMGATCTA IC CMATCCAGGTAATAATA GAGICAAGGEATTCTCATA

20 GCCATATGGPXACAGTGGTCAGATCTAATGTCGCA AAAAGTACACATCAGG.AAGGCAAATTGCCATCATCC
ATGGAGAGAATAAAAGA

AGATCFAATGICGCAGICC

AAAAGGICGAAAGGTTGAAAC.ATGCCTCCTTATTTT
63 CiCCGIA 1 AACTTGAITTCTG CTGGATCAGACCGAGTGA GI
ATCACCICF GGCCGTA
20 GTACGCCACCATCAAGGGAGM AACAAAAGAGAAG . AI
GCTAGAAAGAGAMTGGICCGTIAAACACTGOCT ATAACAAAAGAGAAGAAA

20 TGCTAATCiGGTCTGCTGACACAGTITGA FTATCGCT G
TCFCICTIGGAAATGIGC(ACAGICTAAGGAIGTCC AGAAATGATGATGFTGAC AGITTGAITATCGCTGCTA
65 , CTAGA ACGAT CAA GA
.

GAAGAGCAACAGCTATTCTCACiAAACFCTCCCGCTF AAGAAGAAGAAGTGCTAA
66 GCTAACGG ACTATCAAC GCGGATCATCAGTC.AAGA C.GG
20 CCCACATCAT TGATGACGAATAAGIGAACG (AC TAT T
AGATCAATGGCCCIGAGICACAATTTCCCAG ITCCT TTAAGGGTFAGAGATCAA GAACGT ACTA.' TG
TCICIT:

OGGTAC.AGTGGATTC:GIA
68 CAAA CAC A GGACAC1.
GTTCCAGCAAA

ATTAAGCAC.ATTAGCCTTC TGCATTGACTGAAGATCC
69 TCTGGG TataTT A
GATGAAGGCACATCTGGG

1TAIGGATGGC.AAAAC CAACAATCA TCTGACTAGC

AMGTGACC:GGGAGAGA
20 AGCGICi 1CAACRTAAATAATTGCTFTAAGCAATGAA GGITAGATFAAGGG1T1G 101 ACGTAAGAGAACiCA AGAATCTI TGGTCi 1TACCF ITAAGCAA TGAATC FCTGT

GIGTA CGAAAGTFTATAGCTAGC

GGATC.GCTCATTTGCATiA
20 ACT7 ClICAGGAGAIGGCACACACAACAAGGAGAAC
GGGAAGTGGAICTAGAACGMCATTAAIGGCGGA ACAACAAGGAGAACGTIC

20 CAGMCACCTAATGGTC.TGTAACTITGGGAATTGGA GTGGATCACGTTCTGGTAGACCCATCTACTGTTACA
GCAGMAGMAMTGO
74 TCTGGG . GATGGTCTT TGG
ITTGGGAATTGGATCTGGG
20 GGTAGCAGTAGAGICAGTGATITiTATTTAGATTTIGT ACTGGCGGTGTCACTCAATCCTAGAACTGCAAAACT
TTTAGAITTTGTAGCFCCTG

20 GCTAMCGSCCAAGTCCCGAATTIGAAAATGCCGCC AGCAGCACC.AGACAATGCATTTTICTGTTAGTATAG

AATTTGAAAATGCCGCCTA

AGACCCTATAAATGCAACAGATGTTGCMCTGC.AT AGTGAAATACCACCAA TA AGAATTGCAGACTATAGCT

CACAACGTITAGTTIGGA TATCAGGTGTAGAAATTAA

TAATAGCCCTATFCAGGAT

CATTICAATGICTTTATCT TTCATTAACCTGCTATGTAT

20 GTTFCTANISCCCTCCTC.OGTGGTGGTGACAGATF GC CIGTCi TGGAAACCIGGATATIGTGTAAGATCITACC1 GAGCAGGAAGAGCAGGA GTGGTGACACiATIGCPAG
81 AAGT TTAGTCiTCTTTGT T T

GIGCCItiGATATTTFACACACCAGITCCTFATATCCC ACAGTGGCTIAGGACATG
82 TAGCGT ACTCC.AAA A
AGGIGGAMAGGTAGCGT

TTATCCITGTITGAGITTT ATGTAGAGATACTGTACAA
83 TGTACAAAAACTG ACAAAACTTG , CAAG , AAACTG
.
20 MAGCACTTCTCATATACACATC.CACCACTAACAGAT AGATGGAAACGAGGTGICTGTGAGGTAATTTAATT
GCTAATAGTAAAAGTCAG CCACTAACAGATGCTAAAT

20 GTTIGTATGCCCTGCCTTCFGCCAAATACAACACTGG C.ATTGGCAGCATCTGAMGCACCATATTGGGAMC
C.CAAATACAAC.ACTGGGAT

20 ATGTCTATAAAGTAC.AGGCCATCCGATATATTATCAA ATG
TGCATITTACCMGGAAGCATTCCAGTCC.TCAC CGAGGAAGAAAAAGCTAT ATATATTATCAAGATGCAG

171TACATGGGTGOCGGATGCCITrGTGATGTATTA CC TCAATATTATTAGTTF
CATTGAAATGTFGGAGGTA

20 TGGGAAC.ATAGCCAGAGGCCCACTTTGGGGGTCTGG CAGGTGGGGGICGGGTAACACAGGAGGCCCAATA
TGGATTGCCAGTGIGGIT

ACTITGGGGGICIGGG TAT

ACAGACCCIGTTACACAATCTGCCAGATCCTICTGTT

AAAGGGICTCIGTAGGCA

GTAGGCCTACTFTATCTGA CTGAAGGCCACATTAGACT

CCTTCTGTGGTAGTAGATT ACTACATTTTATTTACATCC

TTGGGATGCTGCAAAGCCTAAAGGAGGACATGC AGIT ATGTTAAAACAGGA GACAAAGTGTC.FCTAIGGA
92 CTAIGGACC 1 ACC AAGGA (X
t TGATAACA
93 AACACACA 1 TTGCAT AAT , TGT C.ACA

TCTACCTCCTCTAGAAAGCGGTCTTAAACCCCCCGTC ATCTTFTTGGAAGGTGGA
TACAGAACGTTTGTCCTCT
94 GTCCFCTG ' TAM TCT G

CCAACTTTCAGGTATGTAC TCAACAACAAGATTACTCA

TGTAAAMGGAGGAGTT MACCTGTCAGTCTTTTAG

20 TGCCCCTCATTTAATTCAAGAGAATCTGTTTGAGGTG TAGCGAATGGTATGTTGTGAC.AGTAGCTCTTCCATT
GCTTCAAGAAGTGTTCCT
97 . CICCAG GIATCCAAA C
CTGTITGAGGTGCTCCAG

GCTATMAGCGCTAAAAC GTTAACTAAAACACCMG

TGGACFATAATGGACTTFA ATGGAAATAGAGCATATIT

CTCCAGAGAGTCGCCCCGTAACTCTGTCGGTCGACT CGACTACTACCGACGGAG ICTCTAGCTCTACCACGIC

AGACCCTATAAOTICAGGriCTGAATCFGCAGC:ACT GCAAG TGAAAITGATATA

CAGATGTGGTCGGTGTTG
21 AGGIGTACTGGTACGIGGAGAGGCCCIATI AA TCAG GGGGAGGGATITCPACCi ICITATCAATArruca GT AGITITGGAGAAGAAATA
03 . CCT TGGCTGTI CACTI
ACCiCCCTATTAATCAGCCT .
21 CTGCAGATGITTCTGAAAACCOAGGCCAGACCAGCiA GAACAATACGAGTCAGCAGAITAGGC1TICTGACCA
CAACiATTTGF:AAGAGGIG
04 TITTGC. ATAGTTAGTCCA G
GCCAGACCAGGAMTGC
21 'FGT. I GCAAAT ATC:AGCTGIGCATTAAATACCTGACAA TGAAGGAGAATCF ATGATCAT
GCCCCTTGCAGCF A T TTAACIGAAACTACAAAT AATACCTGACAATA TACTA
OS TATACTAGAGGA TTCFGGAAT GAAGGC GAGGA
21 Ca IT TGCGTCGCCTAAGAAGA TATGGCAGF AGA TGT RAI GMTACAGA I
GGCFCTITGGItiAGIAC.ACGAG A TIMM TCCT AA TG ITC TAT GGCAGTAGATG TF TA I
OS TTATGATGA CAACAGG CAGATG GATGA
21 GCTCTGICTATCATTGCAAATTTGTGTAGTGGAAATC TGICTTTAACCCAGAAAGAGAGAGACCACCTC.TACC
GTGGITCAAATAATATTAC GTAGTGGAAATCAGTTTAG

CC.AGCCACTGGIGAACACTGGGATCCTITIGCTGGA A TATGGAGATACAGAAAA CCGCAFATOCITI
AAAGCA
OS AAAGCAG GC TCCTG G
21 AGIGTCiCCAAGIGATICMCITAGGGGTCTACATIT
CTCGGATITAACAAGCA ITGCFGT TGCCI GTITCCCC GCiGGICTACA ITTITGICI

21 CTGCCCATCTCCACGCGAAGCCGCGGAAATC(AGCA GCAGITIGGCITCGAGGIGC.AITGATGGCCTGGAA
AT CFGAC GGCAAGGCCI. TCCACATG
CCCiC.C.GAAATCCAGCAAT
21 GCCGIGCTGCACCICAAACATTAACAIGTACCCGGCA CCAGACiACGGTGITGCCiCCGGCAAAGGAGGACCA

AACATGTACCCGGCAGTC.C.
21 CTGAACTAAGAC.TGGGTGTCCATCCACTAACCATCCT
GTAGACACCCAGTTATGCAGTGTGATAGAATTCCAG CCCFACAACACAAAAC AA
12 TTTGC . GAGGTTT ACT
TCCACT.AACCATCCTITTGC

GOGGGGACITTATGTGAC
13 TGACCC C ACCCC.AGTCCCGTCCA CC
21 CCGGTATCGGGCC.AGAGGTGGIGGAGTGITGGGCT

GTCGCTGACTCCGCCA

CTTCTAC.CICTCCCTAGCC
CGCAGGGCAAGACGAGGGATCCAAGGACTCGGGCC GG C TCCAAGGACTCGGGCC
21 CACGCTGGAGGCACATTGTC.CGCCACTICCIGGGIC GTGGCTGCCMGOGATGCAGCAGGCAAGGCGAGC

GCCACTTCCTGGGTCATG

CAATCCAACACGAGGCAA

18 TGGGCCGCCIAATGCATCCTATGCGCCGAGGCCTU ATGCGGGATCUGCCGGICGAAACCCGATGGCCCCCi GCiACACCGACTICACGC TATGIXTCCOAGGCCIT
21 TCCT1C17C.CACGGCGCCGCGAG1TACGCTCCTIGGA TCAGAAGCTGAGACCGAA
19 G AC. TCGTGAGGCAMOCATC.CTGGCTOCCGCGCATAC GA
C.GAGTTACGCTOCTTGGAG

20 C ACGA A CGGGCCTGGAC. MAGA AAGCT
iAGTcG
21 ACTTGTAGGCCX:GGGGACCCCGATCGTAAGOCGCAA
GTTTCAGATCCCACZC.CTOC.AAAAACACGGCCAAGG

CGATCGTAAGCCGCAATGT

AGAATGGCCATAGGCITGG CCTGAGAGGAGGCTTGTG

CfCGTTTCGGCCCCGA

GAGICCGACAGGAGGICT

AAAGTIGTGGTAGACGGG

TGTAGTTGAGCGGCTCCT

TTCGTCGCGTGACCTGG

GGGTGTTCCAGTCATCGG

AGOKTTGACCAGGTAGA

ACGCAGTTGCTGACGC

30 AGACCCACCACCTCGTGCCCiGGCGTTIGTCCCACG TCA GGAGGATGCGGAATGGC
GGGCGTTIOTCCCACG

GCGGGGGGAGGATGAAA

21 GGGGTGGTGAGGATGC.AGCTCCTTTATGCGCTTTGC CGACCTCAGCTCCGAGTGGGAAGAGGTAGCGCCCC

OCITTATGCGCTTIGCCG

TTCTCTATCGTCAACTGCG

CGGGTCTCCATCAAGICCC C

GGGAGGAACAGAATGAG

TCCCGCFGAGCAGA

36 TGCGGOGGAGGCTAAGC:GCCCGTTGAGGCCCACGT AG AAGGCICAGGACGIGGG
CCGTIGAGGCCCACXTT

CCTACTTGGGAGAGTCCG
37 Cl C C
CAAGOCCAGAGACACCCT

38 TCFCGTCCCACCTGGC:GGAAC:CTIGTCTGGGAGGCG AGGGGICAGMGCTOTCAMX:GGICTTCrTCIGCCA
AGAGGGCCCIGOTGAGA ACGTOICTGGGAGGCG

AAAAGCCTCOGICIGGICC
21 ACC:GCGTT AACA TaCGGCCAGAACTGGGCCTCGTG
ACiATGOGGCGCAGACAGCCACAGGGGCiATGCCAG TAAGCAGCCGTGACTAGC

GAACTGGGCCTCGTGGGA
21 MC ITCIGCCTGTITTG11T0TACGAGAGAAC:CCGCC
CTCGTCAGGCCGCGAGAGGATGGCCTCAAACACT G CAGGTGAAGCTGCAG ITC

ACGAGAGAACCC.GCCAC
21 CaGAGITGATGAAAGGGGCCGGATCTCGGCCGITGT

GTGCCTGGGCGCAAGA GATCTCGGCCGTTGICTC

ItIGGACATC.ATCGATCCCGGC
43 GA TACA GACCCTIGTCCTCC. TCCA
GGGGGGIGGCATATCTGA
21 TCTG ITCCTTCTGCTLYAGTGCFCATTACCTCICCiCT
44 AGACGAGGCACAGGC.ACCCCTGGCTCGCCTGGAGT GTCGA
TGGAAGACATGGCCGCC C. TGOCTCGCCTGGAGT
21 ACCGGTGACACCAAGTCCATCTGICATGICGGGGGC GATGCGCCTGACGTIGTGCGGCCCCACTCiTATCCAC
45 CT C TGCTGAGGTGGGGCTCi GTCATCiTCGGGGGCCT.

IGGATGIGCATATGGTIT ITG I TGCiCCAGICTI AATTIA TT
46 CGGA C.TGATGGC GTC
ATTGGTTTCAGCAATCGG A
21 CGCCAACCAACACCICCCItIACACiGCAGCGTCTTCA
lIGGCTTCTGACATCTCCCAGCTGTGCGTAGTGITCi TIC; TGCTTCOF GATGGCA

CAGGCAGCGTC1TrAGGA
21 GGAGTCCCiGACAGTITCATGCTCOGGCTKAGGGGC AG TAGCGGGCCTCGG TGlOCA
TGGACGAGGAGCT

CGGGCTTCAGGGGCAT
21 GGCAGAC:AICCGCCA ITACCiTTG MCC' GCACITACT GCAGIGGCTA
IGCCTCCATGCCAAGAGCTGGIGGG

TGICGCTGCAGTACTACGA

GAGATGGICiTCCGGAGACCCCACACCGTGGGCCCC ATAACCATGGACGAGGAC GGAAGAGGACGAGGACG

GGGGGGGAAGTCGICTITTCACCACGT GGTCGAGGAGGCAGTGG

C.GGGGTCGAGGAGGTAGT
21 CGTGTGC. TGCCTGGAACAC.AATGAGGTTCAGGG/sCT
AAGCCTGCCTCATCCTTGACCAAAGCAGCAGCCTCA CATAGTC.AC.GGATGCTGC
GAGGTTCAGGGACTTGTCC

CCACGATCACATTGGIGG

CCATAGCTGCCCCTGGT
21 ACCGTCTATAGCGCC.TTIGGGTCCATGACTGCTGACC TGGGCAGTAGGCTCGGAGTGICACGGGAGCCACAA

CCATGACTGCTGACCCGC

GAC.CGGGGGAGATCATG

CATCCGGCAAGCACCAT

ITCGIGCCAAACCAAAAAG

TCCTCTGAAGGATGGGCG

ATCAACAGGTGGGCHITTG
58 TGT CAG CATCTCGGACCai AAGCC T

TGGIGTATAAACCGCAGT GTAGCAGAATAGGGCCCC

21 TGCTGTATGCC.TCGCAGCGC.TATCCTGC1GCAGGGC
CAGTGCCAGITCC-AGACA

TGCCCATACCTGOSGG

TGGCCAGGTGGACGCA

GCAGCAAACACGCGGC AGACCCTCGTGAGACCCG
21 ACAGGCAACTACATGGGGCCGTGTCGGATCTCGGTC AAGGGCGCACACCACTCCTGTGATGGATCCAGC.GA
63 C.AG CT CGCTGGTCCTGTGTGTCT
TGTCGGATCTCGGTCCAG

64 CAACCCGGCAGAGCTCCAGGTAGCCTGAGCCGCACT G TCCTCGCT(SAGCCAGG
GTAGCCIGAGCCGCACT

CCTAACAGGGICATCGTCC

TCGCCGCCGTGACTCA
21 GGTCCTAACTGGTC.AGGGGCAAGCCGGTCMCACGA TGACGGATGTCTTTAACGGCGCAGGATGTGTTGGC

AGCCGGTCAACACGACAT

CTCGGAGAAGGCAAAGG

C.ACCAGCCCGITCACCA
21 CCTACACGACCGCC.AAGGGGCAGAACGTTGICGCTG 1 ACTGACCCCCTTGAGCACGCGAAGCCGTCTCTCTG A GGAAGG TCAAGAGCTGG

TIGTCGGTGA

GTGICCTGGIITAAGGCCGCAGTGCTGC. TCCAATGT AGCCTAG TCAGAGAGAAC

TCTTITGGAC

CGGGACAGATIGTCTTCCA

GCCGTGCACCTGCCAT
21 CCCGGACTCGTITTACGGACTCCCCCUTTTCCGCATC TCGATGAGGGAGCAAACACCCGCAC.AGICACGGGG
AGACTCCTGCCTGAATCA

CCCCCTTITOCGCATCAG
21 CAACGGGACTGTCATGGAAATTATTGCCAAAACAAC GTGGTC.AGCAGCAGATAGTGAATC1TGAGCTCCAA
CAAAAGCAATAAAGTACA

ATIGCCAAAACAACGT GT

73 GOCKi CT GAACGTGACTACCCGGCGGIGGACAGCAGT CGTAGCACCACGTGGCTGCTCCAGCCCCTGCA
TACC CTCFAGGGCCGACCACA GGCGGTGOACAGC:AGI

74 CT MCI CTCCGGGCACGIC Tr AGT TCGGGGCCIGTGTC GAGTi TCCAGCCCTGGGC CA AAA Ci A CCCTCCCCACCT

75 'FGGAGGGGGGCCAAAGAGGTACCACCCCCACCCACA GC CCATGGACGCICACACC
'FACCACCCCCACCCACA

TCCGGAGTCACAGACTTGGCCTTG
76 crcr GG GGAGCAGCCGGGAACT
CGGGAGGACACCAACCCT

CTGGATGAGGGAGCGCCAAITACAAGGGIGGGCTA GTGGCATCCTAAGGAGGG

"FTCIGGAGIGCCTiTCGC
21 TGGCCAACATGACITC:CGTCAAAGGAAGGCGCTGIA
CCAGGGICCCCACAGTGGAIGGCTCTGACCAGCAA AGGAAGGCGCTGTAGCi A

21 CAGATCCACCACCGC:ATCCAG T TITCTCGCCCCCT ICA
GGGCAGITCCSCGTTGC:GCACCCACi A IGCCTCAC:Cf TTTCTCGCC:CCCITCACT
21 CAGACGCCCCC:AOCCCIAAC:AGGCACA CrACAGCA A TM CTAI
GGGGATGAGG
SO AGGCGACCCCATGATGCGGGGTGATGCGGACX:TTGG TCA CC
GGTGATGCGGAC.CTTGG

AGGGGTGGIGGATGT

CCTTGCCGTGCCCTCT
21 TGGCTCCiCrGC7 GTGACCCCIGGCCA TATCAGGCAA CA CUT TGOCCiG
IGGGGCAACCITCATCACGGGGC CICCiCC.AT A TCAGGCi AAG

21 CGA GGGTGCCGCCITTGAGA CAGGGICTC1CiCiTAA AC
CAGGGTCTGGGTAAACAG

GCAGGTGCTCCGAGCT CT

84 TGGCGGTGCCGGIGAATC-CCCAGCCT. GCOGGACTT CXACG
TCAGCCTCCAACAGGTGC OCAGCCTGCCGG ACTT
21 CGTACCTCACCGCCAGC ICCGGACTCCiGGAGCCITA

GTCGGCAAACAGGGGC

AACGATGGAAIGGCCA CCA OG GGCAA TAACCGAG CA CXTCCITGTC:GT AGAGC
86 AGCG GCC1TC.A GCACGC.CCTCTGGGAA
21 CGCTCCT ITC; TGGCATCACCGCCTOIGCAGAGCCT t G
GCACCGTC:AGOCACCIGTGOCITAGGGAGGIGGC
37 AC CAC AGTTGGGGTCGGGCCT CC.
TGTGCAGAGCXTTGAC

CACGTCACGGGGAACTG
21 CGTACGTGCGTGTCTTTGCCCA GGAAGC.TGTACGCG
TGTGTAGCACAGCAa:AC

CTGAGCCGCGACCAGT
21 GCTACCTCATGTTCAGGGCCATGCAAAGGCAGGTOT TGG/s CTCAGCCACCTIGTCGTTGAGGACCMGGTCG GC-AA AGGC AGGTCTTTCTC

GTGACGAAGGGCCCCA GGCTGACCCCGGCAAA
21 TGCCAATATCTAAGTIGCC:TTAGTTGTTITCCGTTTGC
ITTCAGGCTITGCAAACACAGTGTATTAGTIGCTCT TGAAAGTTGTCITGGAGAA

GITTTCCGTTTGCTGCAT

A AAAGACATIGTCCTGCAGC

TGITAAACTCCAGTAATCG

TGGGGAGIGGITATATTA GTGTGCTAAAAACAGAGA
95 C.AGAGATACC TCCC ATGC TACC

GIAGACATTATAAACGAGCAGAGCACACTTCTAAAT ACAGATGAAAGTATATTG 'TGGCTGATGTAGATAGTAA
96 ATAGTAATGC 1 GATATC.C.ATTGTGA GCATA TGC
21 AACCZTAATAGGCTCATGCAAATGITTAGTATTTTAT MGGGIGGGGTIGTC.ATTICATTTTGCATCAGCTA
TAAAGGGTATACCAAAGA TGTITAGTATTITATGGGC
97 GGGCCTG AC.GG , AAAGC CTG

AGAACTGTATGA/NAAGA

GCTGGAAAAAAAAAGGCTACACTGACACATTGTCTT TAAAGACATTACTTAACA
TATAGCAACGAACGATGG
99 GA ' TTCCICTTTG GTCCC A
22 GGTGTCCTGGGTGCTAGACACAGTACATGTGGGTTC AAACX:TACTGACC.ACGGACCCGGGTTTCGTTGGTGC
ATATGGGGAC.ACAGGCAT GTACATGTGGGITCAAGG

CAACATTTGTAACATTGTG ATGATAGTGIGGAACAGC
01 C.AGC.G GTTCCGC GC G

TTITTGTTACTGTGITTCTT GTTTCTAGCACTATATATA

GCACACAAAATGTTACAG

GGAGCAGATGITCCAGIG

ATGCTGGTAGTICTAGATT

22 GCATTTTCAGTGTCATTAAGCTTGTCGGACAGCCTTf GCTGTTTGATGTTAATCCTGGTGAGCAACCAATAAT
OS AGGAAT ACATAGTEGTGT AAGGGTCTTGAGGTAGGC
CGGACAGCCTTTAGGAAT
22 CCAAGGGAACGTCAGACCTATC.ACAGG1TTTGGGGC 1 TATCAGAITATTTCGAATGGCTGCATTAAAACATTG ATACAGGATGGTGATATG
06 AAT 1 cicraia: GTAG
CACAGGITTIGGGGCAAT
t ACTGGCTAC:AACGIGCACAGGITACAAAAAGCAGA
07 ACACCT I TTGCCC , TC54.11CCCCTACA.AGC It *TCTITA.TGCATCCACACCT

GGTGATACCTACCGCTTITTAC.ATITTCTGC.AGGAG MAACAGCTGAAATTATG
C.ATGGATGCTACATTACTG
08 TTACTGG ' GAGGA GCC G

TGCGTGTCATGTATGTGTG

CATATCCMCGRACACAG
ATTIGG GCCAAA GT
TGCCTGGCAGTTATTIGG

GCCACAAGCTACATTAGA GTTTCTGGTATTGTTAGAA
11 . TIAGAAAGATGG AAGCAC CC AGATCG

GAAAGCTIATTGTAGAAA GCCTACTGCTGAAAATTTG

AAAACGAAAGTATCTGACAAGCCCTACAGCTICCAG GIATCAGAITTAATTGATG

ACGATGCTICACAGGGAA

CAAAACATTACTACAAAC ACTGTAGGATTATATGCTT

ACTGAAAATGCAAACGCA C:AAAGCAATAATCAAGCTA
is AAGCIAAACA TO GITTCCAATC G AACATCI
22 TGAAACAGTITTIGGCi ITGGG TAT ICGTAIGCAGCAG AAAAGCTGGCTA

16 AAA= GGCTTTAGC TIGGIC
CGTATGCAGCAGAAACCT

AGAAGAJTC:CAGGTCCCCICAGCATCi T CCIGTICCCA GCCGAAGAATCAC:ACAGG
17 . ATC CCTCC A
CGCCACACCTTCCACATC .

TTCCITIGGAAACITTACKTOCAAGGCCIAACGACT ACICITGCAGATATITTAC

22 CiCTATGTCTTCTTGGCCATGIAGAAATGTA ITIGAGG CIGACCCGAITCCTCCCAGATTI
CAA i AACTACAGCA ACTA IGAGATATTGACAA AIGTA 11TGAGGACTIGTC

22 TGCH AAIGCACCF TITGTGICAAAGTITGGTACGTTIT ACiCCCAATTICGACIATFGTT
GAAAGCATATCTGAA CIAGATCGCCCAATATATA GTTIGGTACGTITTAGCAG
AGCAGA GICiTCATGT GTGAAA A

AATTGCCITTCAGATGACACTCiTCTACTGGCTTTGCA AACTACTACATATCCAGAT CTAACTCTATACCGTACAC

TAACCICATCACTG CCATCCGTIGTTF HATAAA

GCCZATACAGAAAATCOCA
22 CGCTCTGTTAAATCTACGYTCCAAAACCiACiATGTCCT ITTCACiCTGAA T TCi ICCCAGTITTTAGCAGCCCTGTI CTCAGAG TATTGAGGATA

ACGAGATGTCCTCCAGAA
22 AAC:ACGCAGG !TA TOTTCAGGATCCTCCTATIGTGCA 11GCGGAACAGCA
ITAACAGAAC:AACTTGTAGCTCC ATCITCCIATIGTGCAAAG
24 AAGTC TTGT GGAGACTACTGC.CCAAGA IC

GCITGC:CTGCAGTGITGGAAACCGCATCAGGCTGAT GC7ACCT67GCCATAAACC CTGTGCAG IGAGGA.AAAG

22 TCTCCACCACCAGCCITACTGTMACAGAACCAGCA CTC:CAGGAGCTTCTGTTGGGCCTTCACCGTCGGCCA
26 CAGC . TAG GAGGACGGTGAGCAACCT
CCTACAGAACCAGCACAGC
22 TCC.ATCC:CCGGGATGCCTACTATAGTGATGAGGACG ACAGACAGCAGGAGGC.ACAAGGGCCAACGOACTA
TAGTGATGAGGACGAGGA

22 GICGCGTONTATCCACT. TGCGACGTAGACTGTTTGA
GC.ACAGGGGGGGTACAGGAAGTGCIAGTAACTGC.A GGCAGAAGCTCACAAAAA
ACGTAGACTGTITGAGCTG
28 GCTGC CCC.G GC C
22 TGACACCCTCTGCAACTGTGAAAAGTAATAGGACC.AC CAGCAACACIGICAATATGCACAATCTCACTAGCAT
TGGAGCTAGCAAGACAGT AAAAGTAATAGGACCACG

CAAC.ATATGCTCATAGAA CCJNAATAAGACATCCAGC
ATCCAGCG GCATTCTTCT CCTC G
22 TGCTCTGCCCGITTGTAATGGCCAAGCAAAATATGTA GGC.ACAAATGACAATGTC.AGAATGACCGTACTATG
GCCAAGCAAAATATGTAAA

GCCGCAGCCATTATGTGATGCGGTATCTATATAGCT ATATATGGACCTGCAGAC AT ACTTTTGTATGAGCCTG
32 GCCIGTIG CCAGCAC AC 'FIG

ATGACAACAAATACCAAT TACiACTGAAGACAGGTGGA.
33 C.AGGTGGAA TCGTC a:CT A

ACGTTAAGTTTGATGGCGA

TACTGACCCCCCATACACC
35 CCGAGTCGCTGACIGCTTATGCAACCCGACGCCACCA C.ACTGGT , G
, AACCCGACGCCACCA .
22 GCCAGGTGGAGGITACITTGTFAAGGTATAGAITGC GATaITCAACGTGCAGTAACAATCATGCCACAGGG
GGTGATCCAAACAAATTA ITAACIGTATAGATTGCACA

TGGCG1TACTGOIIILT:ATTGIGTATGTATATGAT GGAACOTTAAGGGACATC AGTGIGGAGACCITTATTT
37 CaTTATTTGC CAGGAGGCC TG GC

TGCTGC.TCAATIGTATATA

CITGTGTTGTAGTGTGCAA
22 GGGATGTATCCAGTGCGCCC.ATTCTTTGGGGGTTTG

TGGGGTAGCTTGGGTGTC TTCTITGGGGGITTGGGC

CCAACAAGTACATGTACAG

22 CACTATAGGbililtiCCTGAITGCTTCCCGACTGITA GTTCCCAAGGIGTCTGGATATCAACITATTGGGATC
CAAACATATATTATTATGC

TTAACTGIT

TGACACTGAAAACACCAAT

CGTTGCAAACCAATAAGT GATGTGCCTTTAGATATFG

22 GC.AGATACC.ATTGTTAGGGCCATCCCAGITGMAAC 1 ACTACACGTAGCACCAATATGACTGCATGTACTCTT ATCCZAGITGITTAAC.AAG
44 AAGC 1 TAAAATTCiGAAG CCTAGTGGGTCCATGGIT C
t GIACTGFIGTGTATGIATGGGTGGCACACAAACATG CCTCTACAAAACGTAAAC
45 C:ATG 1 CATAAGGAAA , GGT
CCAAACCITICCIAAGC-ATG

CCGGTTTCGGTCGTGCACAAGCGCGAAACAGCTAA GTGGTGTCCTGTATGTGAC
46 GACC ' CAGT GGTTOCGCCTTGTGAGTC C

CCCTAATAGGGGCGACAC

GCCATGAATCACTCCCCTG

GCACCATGAGCACAAATC
48 GTCGC GCG Cl AAGAAACACCAACCGTCGC
22 GTACCCCATGAGGTCGGC.AAAGTCGCGCAACGTGGG COCCAGAGCTCTCGCGCATCGGGTAAGTTCCCTGTT
CAATGACCCCCGGCATAG
49 . T GCA G 'FCGCGCAAail GGGI

GCATTACCTGGCAGCTCC
50 GT C A GCTGa GTCCi CCACCT

CGTCATGITCGGCTIGGCCTGGCCAACAGAAGGAT ATGATGATGAACTGGTCG CACGGCTACCATGATMG

22 TCGGGACATCCTG1'CGAGTTGACC1TGCACACCGGCT GGATGGGGCGCCTFGCAATATGCCAGCAATAGGGT
GCCCTGAACTGCAATGAC
52 TIA MCAT r CCITGCACACCGGCTFT A

AGCACCGCCGCATTTGAGGTAA ATTGAACAGCACTCGACC

GITCGGCTGCAC
22 CGIGGIGGAGTGCAACAAAGGAT ICACTCGTGGGCiA CST TGCTCTIACTCGGACCI
GCACGATGTTTI GGIG
54 TCGT GAGGTG CTCACGGCTGC.ATGCAAT
ITCACTOGIGGGGATCGT
22 CCCITGA IGTACCAAGCAGCCACAGCTAGCTGC:AATG ATICaTCACMGCCI
GIGGTGCACAGATGCGTCA t CAGCACIAGACiAAGGT GC
55 . CiCT AAGCA IC
CAGCTAGCTGCAATGGCT .
22 GTAAGCAGGCCCAAGCACCGCCGICGCCATAITCIAC a AAAAGGT GCTITGACGCGCGGCACA
TCCICAGTA CCM GATGGCATC:ATATG

CCGTCGCCATATTCTACCC

FOCI GCTGGGC GICATIGICIGGGGAGai AGACAGCTGCTIGTGGGG

22 CGTGAGCCGGCCAGAGTC111C ICGGGGGTE TT GTG TCACGCAGATG TACTCCACi I
GCCGTGCACGCiaCCA

TCTC.CIGGGGTITTGTGGA

TTICATCCCCGTTGAGACA

ACCACCAICMCiGCATCGGGC.ACAG1TAG ICTG ATM GGCAAAT TCCTCGCC
60 CGCCG ACOCCA CGCCCATC.ACErFACTCCA G
22 CGACCACTACGICTCC:CTGAG I TCCiGGG TATGCiCiCIT
CAOCGACGCCCTCATGACAGTAC:GTIGCAGTC:GATC AAAAAGTGTGACGAGCIC

TCGGGGTATGGGCTTGAA
22 GGTCTCC:GATGGTGIGAGCT CAGTGTAGTGCTCTGT
CCGTCAGGCTC:AGGGCGTAICTGCCTCCC:AAAACI C TAIGTTICCACIGGIGAGC GTG TAG"!
GCTCTGIGAGTG
62 GAGTGC AAGA Ci C
22 CACGGGAIGTGTCAGGGIGACCACACICGIGGGCCC ATACATCGCCACCTGCATGCAACCTGCCAAGACCC:A
IGTITGACTCCiACICAAGC

ACACTCGTGGGCCCCA
22 GCCTGCACAGTGGGTTGTATGTGCCGAGATGCTGAA TTC:TGGGCCAAACACATGTGGATGTTGATAGTCCTG
64 GTCC . CGAGGT GGGCGGCTCTCATTGAAG
GCCGAGATGCTGAAGTCC
22 CGACGAGAGaCCCGAAATGCAGGTAAGGTGCTAGT TGTCGTCAACTIGCTGCCTGGACAGAATGGCCGCGC
CAGIGGC.CIAGTGGGAGC AGGTAAGGTGCTAGTGGA

22 C.ACTGGCCCTCCGTGTAACAATCATGAGAATCACGG CGMACC.CGCGCCAMCITTACC.
TCCGCGTAC.TCTG

CATGAGAATCACGGGGCC

TCGTTCTGCGTTGGGCTTA CAGCTTCCTTGCGACCC
22 CCGGTAAACICTGGIGGGAACCCGACCTTGAGCCTTC CCTGGGCACGGCC:TG11TACCGG1TaITAATCTGGC
ACCCAATEIGTCGAAGAAA CGACCTTGAGCCITCGATA

CCTGATGAGTTGGCCCITT

22 CC:ATCACCGMTGAGGAAGITITITACCCTGACCTCG
TGGATTCCAGIATICC.COCGCTGGTCCTIGTITTC.CG CAAGOGGGGCAAGAAAG
71 (ICG OC:C C
MACCCTGACCTCGGCG

GGTGCTCACCACTAGCATG

rccc CCCCAGACCGGAGTATGA TCCTCAAATGTGTCTGTGG
73 GTGGC AGCCA , TC , C
' AAGGTTACATGGGCTTGA CTCTGCACACATACACTCC

22 CAGGCTGCATAGGCAAGGCGGTTGAGGGCAGCC.AG 1 CGCGATTGATGCGGAMGCCGTTCCAGCACCGTTT CTCAAOGGTCTGCTGATG
75 TM ' TGC C

22 GCACGGGC.AACTACCACCAAGCGGTAATGAGGCCGA CGAGGTGGGCGTAACGCTTGCAAAGTCCACGCCAA
GGOGGIGATTTGIMITC GCGGTAATGAGGCCGAAG

22 ACGTCAAAGGCTGCCACCAGTC:TTCGTCAACGGICAA TCATGCCCGGGAAGAGTTTGCCGGCTTCGTCTTCCT
TCTTCGICAAtik-TCAAGT

AGAGGATGCGGTTTTGTTC
78 GTTCG AC CC.AGCAGGCTC.AGTFCG G

CGAAAAGAAAATCGCCCTC

GATTCCAAACGCGGTGCA C

ATGTCGCCCAAGITTKAG

22 CGCGC-AGTTTGCCGTTGAC1TGACGAAGCCGCCC.TG 1 GCAAAATCACCGTCGCCGCCTCCITTGGCGAGTGCG

ACGAAGCCGCCCTGGT
i GCTCCCAAGCCGAAACGTGATATCGCAAGCCGCITT
83 .ST 1 CG , CCAAGTCGGCIACGCAA I:
GCICITCGA IGCCCII GT T

84 T ' GTGATGCOGCCGCCGACTTCTATCGCGCCGCCAAC
GTTCGGGTACGCTGTCG ,XGCTGCCGTCTIGGT

CGATTAAATCGTGCTCGC

CCGAACAGGCGGTAITTGC

TTGTTGCGATGTGATGACG

22 CGCGCGGCAGTTGATACAGGCGGGTTCTGCCTTTTC.A CCGCCCGCGACAATGTAGTTGGCACGAACGCTTGG
AATAAAACTACTGCGCCG CGGGTTCTGCCTTTTCAAA
87 . AAC AAG CA C

88 GC GG GCGATT GCGTGCCCiA
AGGCGGACACCITGri GC

GGCAGGGAATACCGCCCCiTTACG TOCGCCAGTF CG ACAATTICAGCCACAACTG

TCATATCGGFCAGCGGTA

CACACCAGCGCACCGA

GCAAGATAAACCACGTCGC

TCACCGCCACCAAC:GCGCAAGGACIGGCACC
92 CA G CTTGGTCG1TTCC.ACGCT
TTCCAATGCCTGCGTTTCA
22 ITCGTMCAAGCCGGAC:ACCAAAAACCATGTOTCGT
93 , CGCCAGTTGACCGAATCGGGAACTCTTGCCGCGTTCC GGC CGGCAGGCTTTCCTCGA
AACTC:TTGCCGC.GTTCC .
22 CCICAAAGACTACCGCAGCAACGTCAGC6GTGC:AGGC CCOCGMCGGOCAACT CGGAAT
GGITTGCCOAAC

TC.AGCGGTGCAGGCGA

GTAACGCGCAAACCOCG
22 GCATCCIGCTCGACAACATCCTAGGITTIG IGGAC:Al (..tGCTITCIC:CGTC/GAACAGTICGTCAGCCIGTCi ICC AGGTITTGIGGACATAGGC

22 CGAGGCTTTGGGCITGGGTCCTTITGGGCCICICAGGA TGC.AGGTGGAGGTAGCGCGCAAACCGAAGATGCC

TTITGGGCGGCAGGACG

GTICATGTICGTGCCITCC
98 TCCG TAGAGC.AGTTCGGCZTGIGCMCTCGCGCTCGCCC
GTGAGGCTGGGCGGAT G
22 AGACATTGCC:CTC:CCC:GAGGGTTTTCCAGTCiCGAACG GCCGCCCE ACCGAC TTGATAAF
GCGAAGIGTMA 1' GITTTCCACITGCGAACGC
23 CGCGTG TCGCCCAAAA.TGACTIGCCICCGCCGAACT
00 ACGGC.GGCGAAGAGAAAGOTITCTTGGGCGCMGC C GTTGCCGAGCCAGOGA
TITCTIGGGCGCGGC

cma:Gcr GGC.ATCGGACGGCAAAG
23 GGCAATTTGTTCGGCAATGGCGCGAAGCTGC.G1TGG TTATGCCGICTGTCGAACGGCGGCAGTCATCGTCGC
02 GC . TIC AGTCGTCGAAGCCGTGC
CGAAGCTGOGTIGGGC

GCCAGATTTGTTCGGTGG

CATACCGGGTTCGCCGA

GCACGCCTGCCGCTAT

OS CGTGTGCATCGGTCCTGCCTTCGGCGGCGGCAAT ARC CGTCCGCGCA.GGTTAC
TTCGGCGGCGGCAAT
23 TGCCGCTCGCCAAAGTCGGCGCC:TTGTTOTTCAGGG CGCCGGTGGCT1TGGAGACGTG1IGGAAGTCIAACC
GGGGACGACTICTTITTCC

CGC.CTTGTTCTTTCAGGGA

GGGTTTGACCGCCGTAAT

CGCTTMCAAGGGIGGCA

CCGCACCIGTCAGAATCG

ATCCALAGAIGGAIGTGC

ACiAAAATCACCGCCATCAG
GC GGCGITGarAGITGCAGGT6TCCGGACTCGCCAC TGT1T-GGGCGTTCGICT
C
23 GCGCAGGTGGGGCAGGTAATCGCCTTAAGCCTICiCC CCTGCCJUVGIGCTGGTCGGATTCCGCTGTTGCCGGT

CGCCTTAAGCCTTGCCG

GAACACCTGCCCCGGTAT

13 CCGATCATACCGGTCGCGCCATTGCGCCGCTTCC.AAC GACCCCAGCTTCAAAGCCGCCCTACCCAGCCGGCAA
CCTGCTGCCCGTATTGGC ATTGCGCCGCTTCCAAC

GITCCGACCAGTGGGGCAACITTTIGGTTCAGGCG TACCG AG TICGCCTATTCC

TGCTGCAAGGCTACGACT
23 ACGGCAGGGCGGGGMTGTCGTITTCGAACCC.CAA ATCAGCCCCCTGCOTACCAMACC.CAGGCGGCATCG
TCCATTICGAACCCC.AACC
CCC A GATTGCCCGTCGTGGC

AATCGAACACAGCTACGCC

AAAAAGCCGAAGCCGAAC

TGCCTGCCTCAACATCGG

19 Cl GTGC AGCACTTGGTCGGCTTTG
AGGCAGTCGGCGCAGT

ACCGCGCCATGCICITGGCGCGITGGCGGCTITTIGG GCA AAACAGCCCGCCCTGA GCGTIGCCGGLI
t it, GG

CGAACTICACCAAACCCAC
23 C:ACG 1 UGC GCGCAGGCCCAA AAAC T

CCITTIGGGCOAGCCG

TMIGTTGGGGAACGGCA
23 CATT C GCGTTACGCCGCTGTCA n-GGCGTTGACGAGCAGG TCGCGGATIGCCIGCT
23 GGCTGAAAAACGCCGTCGAACTTC.AGCGTCGGGTGC COCCACGGCGTGTTGTCGACTATCAAGGCGGCTGC
AACGCGATITCGTCGTOG

GTTGGACAGCAGGACTIT AAACGGCATCGGCTTCTTC

TGAGGCGGCATTGCCGTACCGTCC IGTGCATAIMAGCGGCA
27 CGCAG G GG AGGC:CAGGCAGAACAATCi GC

CGATCGGACGCGTT ITGC

AAACAGCAAAAAGGCGG

CAAGGITTGCCAGCGCG
23 GGGACTGACGGIGTCGGIATTGCGCCATAGTATTCG GCCGCCGICTCCATCAGCA TGaGTai IGCi TAT TT C
CGGAC CC AAGTGICTTIGTCGGCGG
CGGCATAGTATTCGCGGAC
23 CGGCAC:GCAAAACCTGIACGC:GGCAGCACCITITC
GCiCGCAAAGT MGM AA

GACGTTGAGCCGCIGGA

GCATTGAAGGGCGAACaiAACTC:AAAATO TCAATCTGITCGGGTGTCA

23 CGGCGACGGGGCAATTTGCAAGGCAAACiGCGIGC
33 C.GTGCGCGCCTACCAGATTCCTTGCGAGGCTTTGCG G TCGCCAGCGAGGITTGA
CCTTGCGAGGCITTGCG
23 TAIGGCTGCACCiGCCAGGCCCAGITG ITCGGACAAA
CCiCCGTAACCCAMCCGCTT CGCCiCATGGC1 CAAC CCAGITGTICGGACAAACG
34 CGC CTC TGCGGCGCiAGGTTITGC C
23 GGCACATTTTTC.ACGCTTGCCATGOTTTGICTECTTC CAAC.ACGCTGTTGGGTATGGGACCGGTAAAGCCTG
GCCC CTACG GGCGATCATGTGGCAGTG

23 CCCAGCAG TT TCGCC:GIT ITGTCTACGAGCTITT GGG GGTCAGGGGCATAGGOCi 36 CGAG ITTGCC.GTATCCGGGCGGTMCGCTTCGGGCAGIGC A
C.TACGAGCTTTICICiGCGAG
23 GGIGGAACGGAAGCTCiCTGGA CAC:GCCGCACATC
37 TGACCGCCATGTACAAGCCGCTGCCC.GCCTGC.ATGA C GGCGCGGITTTGCCT
CTGCCCGCCTGCATGA
23 CGC:CGT1TATGACGCCOCC:AAGGCGCCICACCA AACC CCGCTCTFCGGCT TCAMATG6f GATGCTCCACCAC:A
38 A GCCTG C.GGCGGCATTGCGTT
GGCGCGCACCATAACCA
23 GCTCGCGCGAMACTIAAACT-ACGMICAGGCCiGCC CGTAGCA TACGGITTGT-G

CGTMCAGGCGGCCr.

GTC.AGGTAGTCGATGICG
T ACG GA
CTGGAGCTGITCGTCGGT
23 GTCCCGAAAGCCGCTGCACCIAA ATCCGTCiTCGATGC
GCGTGCCGC.AGATTITCATCGMGCiCCIGTAGAG GT

C.GMATCCGTGTCGATGCA

CGCMCACGACACGCT
23 CGATGACGGIGGTGGCAACC.AAAAGGCCGAAGTCAT AAGTCGGCGTAGATGTGCCCAAAGCCCATGCGCTC

AAAGGCCGAAGTC.ATGGC

GCCAAGCTCGAAGAAGAC

ACACCCGAAACCCGCAGCGCGCCTGACGGATGCTG C ACCC-ACACATCGGATTGC
CGCCTGACGGATGCTG

OGGTGGCGTGC.ATACC GCGTATCATCGGCGCG
23 AAAGGICGATGACGC.GCACGGGCATCGCCATGAAA
47 CC.GCCGACTTCCGCATACGGCGTGGGICATGCCGT GAM C.GTACACGGCTTGGCAAA
GCGTGGGTCATGCCGT
23 AATCGGCAACIGCGG1TCCC.CCAGATG1CGAAGTC
AGACGATGTCGCATTG Tf AAACCTGCCTCGTCGGG

AAGOCTITC.AGTGAAGACT

TCTCGGCAAAGAACGTAC

ATGAACGAAATGACCGCT

MCGGCGGCACAGAAG
23 GGTAGGATGCGGAMCC.1GCTITTGGAAC1GIGGCC C.ATGTTGCGCGGGATTICTGGICCATTGCTGCTGCC
CGCAAGTITACGGGNOT MGGAACTGTGGCCAAAC

23 GCCCACGTCGITTATCalCAAGTGITTGAGCCAGATT
TGTTIGAGCCAGATITC.GG
Si TCGGA GGGCATGACCGCCGCCATATCGAAGCCGCCGACCT
CGATATCGCGGCGCAA A
23 GCAAEITCCTTATGCC.CTGACGAGACCCAATGCGAG AACGTGGICGATGTGGrrGCAGCCCATCTCCGACAC
CCAACAGCGCGGAMTG AGACCCAATGCGAGGTAG

ATACACGGGATAATCAGC

CCCXGCCAT/ACIGCG GC

GTAGGTAACGAGCAATCC CAAGCGTGGCGATAATCG

AATCGCCGTCTGCCCA
23 GCAAATATACAGCCGCAGCGGTTCATGGCAACGTCA TTGATGCCGACGACCTGTCCATC.AATCCGGGCAATT
TTCATGGCAACGTCAATCG
57 ATCGG CCG G1TTFTC.AGCTGTICGGCG G
23 MCCGGTTGCCGATACGCCTGTATTCCATC.ACGCCG 1 GCTTCGCCGTCGCMGMAAATCGCCGITCTGOT

TGTATTOCATCACGCCGTC

'ITCGCGCCC.AACCGTTATTIGGTCGCGACGGCAAGC ACCTG TCGTT/TATIGGGC1 59 GM 1 AG TGCTUCTGCC:GCGCA
23 CGGTATTCCTGAGCGTCGGATCCGTTMGCGCAGGTT AAAACCCTGCACCGCAGTGATCCGGC.AATCTGCACG

CGTTIGGCGCAGGTIGG

TCATCGA

a..GCAAAAAGCCGCCG

23 AGCCATAAGCTGC:CCGMCGGCMGCCGACCACC ACAGGTG1TCG
TCAACGT

TCCGCCATCGAC:GC1 GCCGAGGCiCCiAAIGTG

66 GCAGGICGTCGICGCTGIACCTGCTGCGCGGGTTCA TGC:FCCFCCTCGCCCACGATAGAGCOTITGGGCGCC
CGAAGCCGTCGTTCCCT CICCTGC.GCGGGTICA

CCTATCAGGCTCATACGGC
67 CG AGACGGAI GCGAGGGCG1 ACGACCIGCTCCT MCC:
CGGCGIGGCTCAMGC

GGGTAOC1TCCACGACGCGTAAAICCCCGAATTCGC AGT TGCIGT ItAAACCACG

69 CCACGTIGGCACGCCTGTGGCATCCCITGCX.GTCCA CTGC.GGGCAGTTCGTTGGTITCGTCCGCAGCCTCG
GGCGACGATGTTGCTGT GCATCC.CTTGCOGTCCA
23 CCICAAGAACACAGGCAAAGCCCGGCACTCTGC:ATTI 1TTGGGCGICITCGTG
ItTTCCACCIGCGGACAATACA TITGTGITCCAGCi TAT GCG

C.GGCACTCTGCATTMGC

GCAAGGTGGCGGCIGICGACAACGGACAAACCAC1 GCiA TGACGGICAGTGTG1 72 AAATCCCGCCCTGCTCGCCITGGACGGCGTTATGGGT GCCGCIGCMGGGC1, ii, 73 CTGACCGGC.ACACCCACACGCGGGC.AAGCTGACGT CCTCA AGGATGGTC-AGGAOGGC
GCGGGCAAGCTGACGT
23 A TGAGGAAGTIGCAGI GICGGGCCTC.ACGACAGA A
TGTCGGACAGGATGICGA
74 ACAICZGACGACC1CGAAGC1C6GGACIsGGGG TGGA ACOGAC
TCGGGACAGGGGTGGA
23 CCAAAACCGCCGTCCTACACCAGCiATTGCGCTCiAACA
TGICCAGGCTGTGICCGAACiCGCATCCCGGCATCA CGTATGGCGGGTAAATTG
TS GCCT GC GC
GGATTGCGCTGAACAGCCT
23 CAACAGCATCTCCGTCAGCGACITTGACGGAIGCCTG GGTGC0116TTCIGTTCGCCACICTIGGCAC:GCATC
76 GC AATG GTCAGTTCGAGC.GGCATG

23 GCCATATICGCAGGATTGCMCCGGGGGAAAAGGAG GGAGGCGGTCAGTA'FGCCGAAGAGGCTICAGACG
CGGGGGAAAAGGAGTIAA

TATAGCTGACTTCGACGG

TGTACGACAGGTGCGGC

CTGCAACCGGGGATGC

GGGAAGCGGCCGATGTG1TTEGAACACCT. TCTTCCG C.GGG1ITT11GACGGTTCA

AATGCGITTCAGGCAIGTA

TCGGCAATGGCATGCATG
23 GGTCCAGOCCIGGGTGGAAATTGCGO.iteiti ICC
82 ACCCCGGGGCAAACCATCCGCGCAATGCCTGCTCGA GCA GCC.AAATACCGGCGCG
GCGCAATGCCTGCTCGA

TTGCTGAAAAAAGACAGT

GACGTTACMGCGCCAA

ITGACC:GAIGCCACGACC

OGGGTCGGOETGCsATT

86 ra: CG GCGCCGCGTITTTGC
ATTICGACCGCCZATTCC
23 CCGACGGICAGCGGGATGACACCATCCTGC.ACATC
87 ACGCACCGTTGCCGACATTGATGCTGAtitiGiCCGC ATCG GTAACGGCGGCGGTGT
TGATGCTGACTTTGTCCGC

88 AGC.AGGCGCGATTACCGGCTGCTGCGCTTGGGCA ACGCCATGTTITTCGGCGGCTCCCGTCATCGCCTCC T
TGCTGCGCTTGGGCA

TGGIGAACACGATGICTT

CCGCGAGACAGGTCGT

CGCACCGTCCACGAC.A AAACCTTCCAATACGCCCG
23 CGACGCACTGGTCGGCAAAGGATGCCTFCCATGACG AACCACFFITCGC:RTTGCCITTCCAAGCCGTTGAC
AGATMGTCGGCATC.GG GATGOCITCGATGACGAG

23 CGAC.AGCGCATCGAAGCCATGGGATAGCGTCCGGCA CTCTTGCGCCGTCCTGCCTACCGITTACCGCAATATC
GTTGTCCGCCCGAAAGTT

GGATAGCGTCCGGCAAGT

CGTCCGATGTGTATTCCCA

GTAAACGACGACGCGCG

C.AATTCGGC.GGTGAITTC CGCTGAATCGGATAGGTC

23 AAACCCAAGCC.CGAAGACGCGGGGTGAGGATGTAG 1 TTCGACAATGCCGGTAATGCGTTCATACGGCATCGT CAGGCCGGTTAAAAGATC

GGGGTGAGGATGTAGCGT

CTGCCCAGITCCAAAATCG
97 I CCIACCGCTGGTICGGCIT GCCFGT TGCiACCIGGGA 1 A C
GCCTGTFGGACGGCi GA.

GACGCAAGGAGTAGGCG

CGGATTCAAACGCGCCA

TGGAAAACCCGCCCAGC

TCGGCTTTGAATAACTGCG

CGGAAAGGGIAACGGICG

GCCGCGG1TGACGATG T CCG TCAGCCA.GAGGGT
24 ACCGAC1ACG1CGTCGCCGCAACCCArTGGCTTrT1C TTITGGACACGCTGCCGGAACCAAGCCCAAGCCCIA
CTICGCTGAGGACGGAAA CAAGGCATTGGCTITTICC

24 GCTCCATA.AGCCCTACCGCCTGTCGCGTTCGATFTCG GAACGGATTCGGITTGOTGGCTCGGGCATCAGCA
04 CTG GGAC CGACGCCITCCiCCITCF
TCGCGITCCi ATITCGCf G

TGCTGGGCGCGTCGATCACCGACCTCCICAAC
OS CC C CTCITCGCCiCAGCITGAG
CTCGGGATGGMCGTi CC
24 CCGCAC 1 OCACGCAT TCFGCGG TCGGCGAAACCAAA =
AAGCCACITC.GATAAAGGCGGCCATTCGCACAAAG GGICCiGCGAAACCAAATG

24 TCGGCAATCTGGA.A A IGGTCGTIT TCCAACCGCTGC:

CGTTGC.GGC.TGACTGC

08 TICG C.ACTTCTCCC.GCCCCCGA.AATCATGCGGC.GACATGG
TGOCAACAGCGMCGAG CGAACGGTATTGCGCTTCG

CCCTGA ICGTGCM CGTGCAA ICAGGAANIGG 'ICCAAAGTGICTATCAG GC

24 CCGATAC:GCGCCA TACTIGGTITGCGCC4:CCAT
T ACAGGCGCOCGAGAAAGACAAACGCCGCGCACAC AACATTGGGTGCAGGGC
GGTITGCGCCCCGATT

ATGTATFGAATGCCGCGAG
11 CGAGC AAACAGGTGCGCCGAC.GGCAGCAACGCCCTGCCT AGGCTACGGGCGCAA
24 CGCGCAGTI GT TCCAGCGATCCiCIA TGCCGAAATGA
CCCTAIGCCCAAAICAGC

CGCCGGCTTCCIGC.AA
24 C:ACCCTG TACGAAACCMGCTGAATTCCGC IT TATC
TGAATICCGCTITATCCGC
13 GGCG CGCCGTGC.GGITGCCGATACTCCTGCA TACGCGCG
GTCGTCGTTGCCGCGT
24 GC4:A.TAATGCGCGCMG TACiGCC:AACCTCAAACCGC

CGC.GCCGTC.ATCATGC CCAACCTCAAACCGCC AG

IS A AGCCGCCCGCCICGMTGGIGITGaIGGGTC.ACG
TTATCGGGCGCGTGATGA AACCCGMTGGCGCAA

ITTCMCCTGCCCGTCC

17 TCCATCGC.GTCCGCCAGTATGGGCGGGCGAGTTGAA GCGATGTTTGCCGCCGTCGC.GCACCACCTTTICCG

TTCAAACCGACCACCGC

CGATAAAGGCGGCGACTT TGGCGGGGRI:miGC
24 CGCACCAAATC.CTATGCCCITCGCAAATCCIAGGCAA
TCGCTGTCCICTTICCGAACCTIGCCITCCGTGATCCT CAAATCGTAGGCAAGCGC
GCGCA TAC GGCCGGATATCGCGTTC A

CGGCGGCAGAAGGCIT

GTCTGTCGTCGAGGCATTCCACGGCGCATACCAGCT

AGCGICCGGGATITTCAAC

OCGCGCC.GA1TACTFCG
24 CGGCGCGARITTAAACGGAAATIGGACGACCTGATA C(.1 24 ATCCCC.AATATGCCGTCAGC.ATTGGCCGTAATAAAAA
C.GACTACCC3TTGCTA1C1CGTTOCAACGTACCGATGA CTACATCAITATGACCGTT
ATTGGCCGTAATAAAAAGT
25 GTATGG 1TG , GT ATGG

26 TGGGACAGCTCGGGGCAAGGCAACTGGACGGGGA AATFCCGCGCGCGGGCATGCAACCACC.ATCAGCCC
GGACTGGTGGTGC.ATCCG GGCAACTGGACGGGGA
24 CTCGCGC.ATATGGACGCGGTACATCGAATGCTCGCTC.
27 G GCCAACCATCCGCTTGCCGGTGTCGCCGCAC.GGAT
ACGCTATCTTGCCCACAGC TAC.ATCGAATGCTCGCTCG
24 GCGTCTGCCTCGATACCCAAACGGCGGCTFIGCCC.CA GCACTGTITGCCGAGCGCGGCTTGTCGAMCCTGA
28 A CGT TGTGGGAAGCC.AAACCGG
GGCGGCTTTGCCCCAA

GGGCATGCCGTCAATATC

CGCAAAO:GCCGTTGC

ATTCGGCTTGCCGCTG

ICGGCAGATACGCCGT

OCTTCCTGAATCAGGTCGG

24 GCCTGCTICCGGCCATICTGATCGGITTGTTGGGC.GG GTTTGCCCGAACGTGCGGTCCGCCTITCTCGGTGTC

CGGTITGITGGGCGGT
24 1 GCAGGGC.AGTGTGTATCCAGTC.GCCTACCTGITCGG
CACAGTGC.CGGATAAAGG

AAGGTAGGCGGGCAGC

35 CIGTGIACGGAATCGICGCCGGGCGCATCACiCAGCAT 1 GACGC , AGGCCGAGG

ACGCATGATAITTGGCTGGCGGGGCAACCACTITAA CTGTCGCTGCTGATTCAG
AACGACGATICCTGTGATG
36 TGG ' ACGCC G G

TCGCCGAAACGGGTAAAC

CCAGCCGTTGCGTGCA

AAGCAACTCTATCCT

ITATCGGCC
24 TTGACGGCCTTCCATTTTGGAAAAGAAGATTTGGCTG ATGGTGATGATGATTGCGCCC.AATCGGAGTAAGCG
AAAAGAAGATTTGGCTGA
39 . AAGT GAAA GCAACTFITGGAACG MT AGT

CGGTAAGGATTGGGAACA AAAAAGCGGTAATAAAGG
40 AAAGGGAA AAAACCif A (3 (AA

TCATCGICGAAGRAAOCGGCTCAATGCGTTGCGTCC

CCGAACAACTi COT TTCCI

TAGCTTTCGCCTGAAACG ACGCAGTTGITCGGAAAAC

GGGCAGGATGIGTTCGAT AGGITGATGCTGTCGAACT
43 ACTG AIN'S G G

TCGC1GCTCATC:AGGCUAGGICGICTI6CCA FAGG

CATGGCGGCGACAAATGC
24 GlIGATITGAAAAAAATGCCGTCTGAAACAGITICTC GFTCITTACGITITTGTGGGGCTFACCAGGCAFCCA
AAACAGTTICTC:GAACGGI
45 . GAACGGTA AACG TTCiCATGAGGACAGGTIG A
.
24 ACACCCGACCACCAAAATIOGGCGATTGCOGCATCiCi AGGCGIGGATITCCGTIACCOTTIGCCGCATTGCCC
46 GC (AC 1TACCCO:GAGGACGTGC
CGATTGCGGCATGGGC

TCGGIATCGGTGIAACGGCGTMCGGIA
47 GCCAC TATATGCAAA CTGC.CTICTTATGGOGAAC
GOTATCICTCTACGCC.AC
24 CACAACCGCCGA.CATCAAGCTITCFMTCGGCGACie GGCGRiGCGA TGAT
TGCCGCGAATCCCCAAACCGC

TITCMITCCIGCGAGCGT

TTCAAACCCTTGCCCAAAC

24 ACOICTGGCTITCCATICGGGTCAATfCCCGAAGCT
TGC.CGGAAGAAGG TITIG
50 TCCTCAGGCTCCGCCAC.ACCCGCTCCTTCGCATGG aiC C
OCGCTCCITCGCATGO
24 CATI GGCAAAAATAGCAGCACAMAITAAA FCC:CAAA
ccrAcc:arrrrlaCiaTGCMGCAC:AAACAGTICif ITAAA FCCCAAAACAGAAA
51 ACAGAAATGAC CG CAGCTCAAAATGTfGCTGT TGAC
24 GCCiTATIGTCIGTGCGCGGATTCICCIGCAAGGCAT
52 CGCACCTTCGCIGCACiACTGOGCAACGACAGCGAGG TCAT GCGGGC.GTGTTTGCA
GGCAACCACAGCGAGG

53 ITGCCGGITCGGAAGCCGCGGTACGC.GCATGACGG CTATGG TTAAATGCCGTCCGCGC
GGTACGCGCATGACGG
24 CGC.ATGAAGATTGGCTICCGCMCTGCTGGCGGAAT GCGGCGAAGATGAATGCCAACTCGAAGGCGTAACG
54 CG . AAGTTG ACAGCAGCITTCCCGCTA
CTTCTGCTGGF:GGAATCG
24 CGC.TTCAAGGTAGCCTTTTGCCGATGTATCTCCGCCG TGCACTGAAGCCGAGTA11CCGAC13TGCGGT1TCG

GATGTATCTCGGCCGGCT

GGCTACGGACACGGCA
24 GCGAAAGITTCCGTNVµAATATCGGTCGGATTAAITT AAGCCCAATGGGAGAAATCGTGGAITTGGGCTTTC
AATCGGGAATAGTTGGAT TCGGATTAATTTGTTCAAT

24 CCGCCGCCCAAGAITGGAATCITCAGGAAAGCMGC. ACGCTTCGACCGTCCAATCCCACCGGTCGGATTGCG
C11CAGGAAAGC1TrGCCG

CCAATCCGTGCAAAAACAG

60 C GACCGTITCGCGGCTGTCF CGCCGACAACCOCCT!
GAAATCCGTGCOGGCC GCATTGTCCGCCACGC
24 AC.GGITTCGTCGACGGCACGTCGGGGAIGATGGCGA CCAGC(CGCGATCCTGATACACGACAGCATATGCGT

TCGGGGATGATGGCGAC

AGGAGTGGTAACCATGCCGACAATCCAACGCGGCC
62 AATC TGAC ACCGITGTAATC.GGGCGG
ATTGIGGCTGCCGGTAATC

GTTAGGCGGCGTGGC.A
24 GCAMCCAATC.CCGACACC.GTGCCACGCCGAAGACG GGCGGITTTGATTTTGGAGCCGGCGCTGITGATTTT

GCCACGCCGAAGACGA

GGTCGGCATTAAAGACCT GCTICMGATTACCCTGAA

TTTGITTGATGCCGTGICC

ATGTCGGTGGCACGGC
24 TGCTCGACAAC.GGCGITCCCGTTCGGCGCAATAAAG
67 C GGTGTTGITCGGCGGC.AAGGACGAAACCGCCGC.AC
AGGCGGCTGACGGGAT CGTTCGGCGCAATAAAGC
24 CGTCGCCGCCATAAAACGCTCGAGGTGTATGGGCAG CGGCTGCCGTGTCGGTAAATCACAGGCCTAGGTrA
68 G ca CGGTTGAGGICGAAGGTG
TCGAGGIGTATGGGCAGG

CTITC.AGACGGCAITGGIC

CAAAAGGATGTTGCTGCC
70 (.3 TGC C
CCGCCCAGCCATAAAACAG

GCGGGCGTIATGACGGA TTGGGCGGATATITCGGC
24 GCCCGAACCGATGAAGCAATOC.AITTIGCGCCTGATT 1 GITCITCGTTICCCCCGAAGCCGGCCITCGACGTTFIT

ATITTGCGCCTGAITCCGA

73 GCCGAITITGCCITTGCCGCGCCITACCACA ICGCGC GCATCGGCC.Cf GCCTACGAAGCAAATCGG(XiGCG CCTGCCCGCTGATCCT GCCTIACCACA ICGCGC

ITGACCACGCCTTGAATCA

CGTCAGCGTAAOCATGPM

ITGCCGGCAACAACGAC

,XCATCCCCITCCITTATGC
24 AGTC.CGCTACACGCAAGGCCGCTCGTTGATGTAGCG TGGTGGAAGGCTTCGAACATCGCACCGTTGCCGAA

CGC7C0 ITGATGTAGC:GG
24 GGCTTCCAACGCGTC.ATCCAGGCCAAGCCTTTGCCTG GGCCGGGTTTCGACCGTGTGCGTTGCCGAAAACAC
CAAAACGTCCAAAGGCTC
78 C CC GCCAACiCCM
GCCTGC

GGAAAAGGGCAGCGGATTGACGGCGA ACAATTACCCGCAAACACG

CCACAACAACCGCATCC

TCAGACGACTITGGIGGC TATCGCGACGACTTCCATC

24 GACGAGTICCACCi ICGTTCGCCACGCTI T11 TG IG 17 CGTTGGAACAGGCGCA
24 CGAAGCOCCGGAATT IGCCIGGCAAAT CMGS:GAGA GGATG
ITGGGCCiCGGGCAT TACCGACGCGGCGAA GGCAAATGTGGCGACA IA

=

AACTCAAACAGGTTGCGG

24 CGGAAT ICCGCGCATITC3C:AGAGGICMAC ItGAAG GICTG IGCCCACA
ICGGICIGACCGCGCCCCIGAAC
ITS GCGG TT CTGC:CGGC.GCGCATAT
GGTCAAACTCGAAGGCGG
24 CIGCCCITICIGCAAAGCCGTICAGTGCiCiC:GAACOG
CCiCiGAACGGCICTCiCiC:AACCGCiCGACTICGGGOG
86 GT AlIT CCIAACGCGCACGGCTT
CAGTGGGCGAACGOGT
24 GAACAGGIATTC.CGGCGCGGACCGCAGGGATTACGC

ACGCTrAGGGIGTCTGATCGAAACCATACGCCCA ATGAIGGGAAGAAGGAC
SS CTG AGAC
GGCAGGATTITATGIGCTG

ICGTCCAAAAGCAGGAAACGCCAGCGCAT

TGTGATOAATGICGCGCA
24 AACGTIGTGAAAC:GGT1IGCGCGAAGGAATIGTCG
90 GCGTGCTGCTGCCGGAAACGCZAAC.ATCATGCCCG GGCGAAT TATTC.CCCCGTTGCCCG
CDOCGAAC.ATCATGCCCG

GGAGITGCCGCGTIGGAAGITGGCIGGTGAAACCC CGAIGTAGTAGGCGI GITC

24 ACCGTCATATGCCITATTGTCGTGGGTAAGAAGGAA TTC:TGCGGATGITTTICTCCGTAATTAATCCGCITAT

GGGTAAGAAGGAACAGGG
24 GCCCGGTCAMAGCGCAAAAAGTTCCGATACC.TTTGC GGACAGCCGGITCCGGTCACAGGAACGCGCCGTC.A

TTCCGATACCTTTGCGCC

CGTIGAGGAGGCCGTAAT C.AATCGGGGICTGACAGG

ATAAGCGGTTTCGGGATG

GATCCITAGAGACCGTGCG

CGGTTGCCAAGCGTCC

GGACGGTCGCAGAAAAGC
24 CC.GATACGGGGIAAGGCAGGATGCCTGIGGACGCA ACZGITATTGGCGGATITGGCGCCTTATACGCCGCC
99 AOC GC CCGATATTGGGC.GCOG
GCCTGTGGACGCAACC

00 GTTTGCGCCGCCG1C:17TGGGCGOGGTTGCCGTAT AGGCAA CGCGATIGCSATOMAA
GGCGCGGITGCCGTAT

CATGGGCGAAGGTGGAT

CCGCAACAAGAGGCCG

GTCCTTGACCGCGAACC

ITTTGOGTAGGGITTAGCA AAAACGCTAATC.ATAAGAG

25 GCTTCATCTGC:TTTGTGCGGGICAGGCATCGGGAGG GAAGC.GAGGICAGGCGGC.ATCCGGGATTCGAGCGT

CAGGCATCGGGAGGGAT
25 CGACCACATCGGACAATOCCCiC.GCGGCAGGGGITGT AGCCGATAATCAGGCGTGTTGCCAGGCATTACGCG

CGC.GGCAGGGGTTGTT

TGTGCAGGAATTGGCAGG CCGCCTGAAAGAATCCCAT

CGAGATGCACGGCTCAAA
25 GGCGGITAll. I I I tiGCCTGCCGTICCGCCCGAAAAT
GTGATTCCGGCTGTTCGCGGTCAGAACCGGCCGCA

GTTCCGCCCGAAAATCTGC

GCAGGAGAATCCGAACCT

GCCTCCTGGCCGCCGATTAACACGATGCGGICTTTG GACGATTICITCCTC.AGCC

TAAGGCTTCGCTGCGCCAGATATCGCGGCGG TT TCA ICGATACGAACGICCGTTA
12 AC G TGTCCi ITTCGCiOCGAAG C

TCGGCAAC.ATGAACGAAG

CCCTGCAGATTICAGCCG CGCCCGATTACAGGCTTG

GCTGCATGAGGACGCTCTATGACGCCCGTTTACCGT TGACGTGAAATCCGACATC

TGCTGAAGCACAATCTGA
TCAAACCCCCCTTGC6CCCCTGAcCCCJCG1TrFCG AA CG CCTGACGGCF
GGITTFCG

16 ATCGOGGCIGTCCIGCTCUGGGCGI GGAAATT ItGG GC (ST
TGGGCGTGGAAATITCGG
2$ AAGCA TCAACGGCTIGGAGGCCCGCCACITTGCCGT

CGGGCAGCGTCAACGA CGTCGAGGCAGTCCGA

18 ACCG (SAC GGCACOCCCGAAAGCA TI ATM
TGAAIGCCGACCG
ZS, CGGCAAGAAGCGGCGGTATGTTGCCTTGTGTTGCCG CGCCGCCCTGCC1TGTGAAGCTTACAACCGCGC-AAC

"fTGCCITGTGlIGCCGT
25 CGGACAGGAGGGOCATTCiAIGICiGGGTCGAAGGCA
ACATCGGGCATITCF TC:CCAGCTG7CCGACCTMCiG GGGGICGAAGGCAAGATG
20 AGATGG CGG AlTTCCGTACC.GGC:GGC
25 ATACGGCTCiCATCGCGGTCTGCC:GACAACCiAAGAA

GGACTGGTATCGCGCCA
22 ACAC.CGCCGCACCGTATCCCCGGCGCGTGGCAATA CZGAACGCCTCAAACCOGCCGGCGGACCATTTGCG
TGTCGGTITGCGCCGT CCGGCGCGTGGCAATA
as GTCAAATCGTGGTT TGACGGGCACAACGACGGCCA
23 17CCCACCGCITC:CCCGACGATGTGATIGGGGCGG GCC GATGTITTCGGCGGCGG
CGATGTGATTCGGGCGG
25 AAAAGCCGCiGCGCAAA ICAGTCGGGTCITIGGCCG
24 TCCGGGCGGTTTIGGCGTGTTCACGAAGCCGATGCC C GAACCIGGCX.CCCGATA
TTCACGAAGC.CGATGCC
25 GCCTTGAGCTTGTCGATTGCCEAC.GCATCGCACTTGC CCAAATGGGCCAATTGGGGCGCGCGITTGCCC.TCG
GCCTATGACGCATTGCAG

ACGCATCGCACITGC.0 25 CGCCCATCAACCCCATCACTGCCCGAT iCTIGCCGIT GCGCATATCCCATATG1CG

CX.OGATTCTTGCX:GITGC

27 GACCi GCGCGGCACTICGACCAATOCCCC.TGCCCGCiTTT C
GCGGCAGGCATATTGAGG
25 GAACAC:CAACGCGTA AA!

CX1GC.GGGAAAATGCGC
25 Gc.ciGGCAcGCGAcirrrIcTcMGcAcrGccATcAcc ATIGTCCGACCAT
ACGCCCGTGCTCAGCACGGGGA

CAAGCACTGCCATCACCG

GAGITCCTGITGTC.GC.TATTIGCATTITCAAATGGTA TTATCGTIGGTGAAGGAA CTAGATCGTCAAAGTTTAG
AGCA GGGAAGG GA CA
25 T17TCGCAAATC1TCCGCCCCGGCGAGATTCTGAACT GCGAAGAGCGGAAAATCAAACACC.TTGATTTCTTTG
CTTIGCCTATATCGTGAAA

C.GGCGAGATTCTGAACTT

ACAATACCGCCAAGCCG

GATGACCGCATCCAAAAC

CGCCGACGCOCGATTICACGCAGCATCGCGGTGGA C TCGACCCCGCCCTGTT
GCAGCATCGCGGTGGA

ZS CGGAAGACAAATCCATGCCGCCATCTGTGGCGCATC
ATCTGTGGCGC.ATCAAAAC
36 AAAACC CCLIGCCCCGATCCCITCIATGCFCGGGOGGTATC:C
GGICGATCFGCAGGCCA C
37 ATCAGCGC.GGCA. s IT.CCZCACAAGGIGCGGCAGT
ATTTGGCCGATGCGCTGCAAACTGTTCCCCGCC.GC CCCTC.AATGCGCGGCA GCACAAGGTGC6GCAGT

ITGGIGTCCGACGTGTATCA
38 ATCG GGGC TCZTTGGIGGATTGGC. TG
GCGTTGACGTGATCCATCG
25 GCTGGCCGGAAACCATATCTCGAAAACGCTGCAAAA C.C.CATTAATGCCGMATGCCAGAGACCGACAAAAA
AACGC.00TGAAGTTAAAC GAAAACGCTGCAAAATGA

25 TCC.ACCCGCACCCGACCTTATACACCCAAAGCCACTT TGGTTTTGCCGATGTCTGCCGGATCGGTGAAGTCTG
ATACACCCAAAGCCACTTC

41 CCGTACGACGCTGAACGGCTCGGCCIGGATGIGGC GC.AGTACCGCCCAGCAGGACGAC1TGGCTTGGGCG GG
TCGGCCTGGATGTGGC
25 GGCATCAGCAGCTCCACGCTAAAGICGAAAAaiTCG GGIGGIGCAGCCTGCCGAACTSCAGCAGTTOTTACC

25 CTCCGATCGGCAACTGGCMGAAGAGGICAGCGCGA TGIGGACGAZGTANCCGCGCATGC.AGCACGACAGC
TGTTTTCAAGCACATCGCG

GAAGAGGTCAGCGCGATG

GCCGCGATTTGTTCATCG
ZS CGCCGCGTTCGTACTGCATGGACACCATCGAGCAGC ATTTGGACTTCAAACGCGGCAGTTCGGCGGGGTAC
TGCTGTCCGTAAAAGAAG

GACACCATCGAGCAGCG

AGTGTCCCICGICGCC

TTGACCGAATGCTGCATCA
25 CGGTGAAACACGGCATTTGGTTGACGGC.ATCGAC.GT 1 TTGCCCATGCCGACCATAATCGGATTGCC.TGCCCGT

GACGGCATCGACGTGG
ZS 1 TCITCGGGATIGGATTCGCCGG TATCCiAAAGCGGC
49 GOCGOGCGGC.A.AAGTCi 1TCGCGGTAC:ACCACGCT 1 AOCGG , TTIGGCTGCCGAGCTGC CGCGGIACACCACGCC
25 GCAGOµGCAATGTCGGCATTACAGAAAGCCGGTGTG 1 AATCGTTGCTGCCGGCATCGTTCAGAGGACCGTTCG
50 GA ' GC GTCAAACCGTTACCGTTGC
ACAGAAAGCCGGTGTGGA
2$ CCCGCCCATAAAGCCATCGGCGCCGGAGAGTGCAAT AGCCGCAGTGGATTAAGAAGGGTGTCGAGCTGGGT
CAGACCGTTGACCAGATA

CGCCGGAGAGTGCAATC

TTTGTAGGCAATMGCGCG

25 ACATAAAGCCGGCGGCAAAC.AACTCCGTCGGCGAA
53 . 1 GITIGCCGCGCTG ATCCGCLTATCGGCAOCAG TIT TGG
GAISACAGGGCCGGTGA CCGGCGGCAGCAGTTI
25 CGAC.AGGCAGGGAGTCGGCATACCTGACGGICTT CA
GAGCAGGCGGTATGGTCGGCGATTTCCCCAACAGC

CCTGACGGICTICAGCCC
ZS CGG'16TCAACGATAAGGCACGGGAACATC1TTGACC TCCAGCATCACATACTCGAAAGICAIGCCAALGCTA
ACGGGATTAAATTGAATT
5$ AG I TCTGG TGCA
GOGAACATCT11GACCAG1 .

CCGACITTAACTTCAATTT TTGAGCGTTITTGAATICA
56 TICAGC GC.37 CC GC
25 GCAGCGCAACCTC:AAACCGAAACCTGCGCGAGTG1T I GCAAGGGCGGGITCGGTAAGCAAGAAACCGTCCGT

CCITICGCGAGIGITGC
25 CGAC:ATCTCGACGGCAACCGGITCAGGTIGTIGAGG 1 G6TIITGGCGGCAGGCGTATAACTACCTCAAC:GCC
Sil CGT G CA TGC:GTTGACAAGGTTGCC

59 . CGCCCTTGCCCAACGCTATAGCGOTATITTACGC1COG 1, TTCG C
GCGGTATTGAC:GGCGC1 .

CGACGCCAT TCGGIGTAGAGCGTGAACT

C.GACCTGCATC.ATTTCGCC
25 CGACCACGCTTC i GGTCGAGGGGAAATCGGCGAAAC AAGGCTGGCGACTIT CAC

GGGAAATCGGCGAAACGG
25 CCiCTITCAF6CGCCTGACCGGCTITAI6TGCCTC1TTT
62 TCGCCCTGCACAAGCTCCITTCGGCGGCTITGCG am CGOTATCGCGCGCITT
TTCGGCGGCMGCG
25 TCCGCCCiCGCCCICITTTAC.AGGATTTGGTGTCGGAG
GTTITCAGCCAMACGTG

CCCCGCCAAAAACGGC

CCGAAGATGAAG ?MCC
64 CG GGCCCGGGGCGCATGAMTCAACC.TGCGCCAACG GA G
25 IACAGCACCCGCCACCTCiCTTCTTT TCGGT AATGCTG
ITGTETTCGGIAATGCTGC
65 CG TTGTTCGACC.GCCGCGTCAGCAACCGCCTGAC.0 GTGTTCGCCGTGGTGG G
25 CGA 1 CTCATCCGTATCGCCCCCK:GITGGAGGC1CGAC
GGITCGAIGGCTTCGGAIGCCGMGATGCGCGTIT
66 GA GC C.GACCAGITTGGCGGIT
COTTGGAGGGCGACGA
25 CiGTAICGCCMCGGIATGGCT.T.111GATGGTGTCGGI GGACGAGCSGITGACCiAGCCGAACACGAGCMCIT
G1CCAA.AAGAC:GCAGACA CGITGATGGIGICGGTGA

25 GCAACGACGAACTGGCGCACTCTICGGTGGTCATGC CAAGCGCGCGCAGGAC.GTACCGTATCCAGCGGATC
68 GG . CG GCCGCGAATTGCACTICT
TCTTCGGTGGTCATGCGG

C.GCCGAGTTTCAGCGTG
25 CGGGC.ATCGTTGC.TTTIC.ATTCATAGTAACGGATGTG
AATTTGCAGGTTATCACTATGTGCAGGCAGAATGAA CAATCCATCAAAAGGCTCT CATAGTAACGGATGTGGA

GGCGAAAAAGGCGCTGGCGTTTCGGGTGGCGCAG AGGGGAGC1. :GGAAGAAT

GCAAACTGCCCACGCC

GAACAACICCTGTMCC:TG

ACAAATICACAOGGGCGG

2$ CGGCGCATTGGCAAAAGGTCGCTTTATCAGGCGATG ACITTGGIGGCGACGACTTCGTCCCGMCGGCGTG
GOTTATCAGGOGATGGA
74 CiAGG C CCGGCGGCTIITCGATT GC;

GTATCGGCATC.GITACCAG
75 ACCAGT GCGCCCGAAATCC:CCGTCAGAAACTCGCTGCOCGC
CCCGTCCGTCCGCAAT
2.5 CGCACCACCGGITCITCGITAACATTGICIGICGCGG
CGCCATCGAAGCCTGCACCiCGTGAAAI CGGMCGT

GATTGTCTGTCGCGGCCI
25 GGCGGGAATGGCGGCAAATATCATCGGC.ITC. TGCST
ACCTGAAAGACCTGTICGGCAC.ACCAGGCCAGCAG

ATCATCGGCTTCTGCGTG
25 CCAGACCGATATGCCGTCGCCGCCATATCCGACGCTT GCATTGGCC.GTCATACCGCTAGOTCGAGTGCTITGA

CGCCATATCCGACGCTTAC

GICGMGATACCGATGC.G
79 GG1TTGCGCCCGCATGIGGGOG1TfACCGGITCGAC GGG AT
GGCGTTTACCGGTTCGAC

CGITICAAACGGCGCGG
25 GCAGCGAP.CACGATACCCGTATGAGCAGGGCGCATC CGATACGITCGGGGIGGATGITITGCCAACCTCAM

GAGCAGGGCGCATCGA

GGAAAAAATCCGATGCCG

GCACGATGGAAGCAGGAA
ZS GCGC.AGGCGATGGTGGTGTCAACGCCGAAATCGTC

ITGGCGGCCIGCAGT

84 TAAACGG TATTCCC P.ATTGCCGGGAATGGAAA
TCCGTATCIGGITAAACGG

TAATTTIGTCGGCGTGTTG

AGGGTCAGICCGATITCGT
25 CCAGCITCGCCGAC.ATCITCTTICTICGGTCAACACGA 1 GGGCGATAATGGCGCGGAAGTCITC-ACGCGAAC.AT GMAAGCCGGTCGACTTG

TCITCGGICAACACGATGG
ZS AATCCCGCCGAAGAAACCGATATATAITCGAGTGCG
GCACTITGCCGCCGATGITTTCGAACOGCAAGTTAC AT ATATICGAGTGCGCGCT

BB TCCACCCACGGCGCATTCGTCGCCATCGTGTGTFCG G TCGCGGTAATGCACTGGG
TCGCCATCGTGTGTICO
ZS AGCGCGCCAAATCGACCGITTGGCAATCCTITTACGC TATTGCAGGAGTTCGGCACCITATGCCGAC.AGTTIG
TTIGGCAATCCTMACGC

TTTGGCTGCTTCTATTGGG

CGGCAGGCGTITTCFACC

CACCAAATCCGCCTOTACG

ACAGCCTCGCCACCAACGCACAGCAGCATCGCGAT

CX:GGGTCGGI ATGGACA
2$ CGTIGGCGCGGATTITGGCGAACACGGCGGAAACA
93 'FTGICITGCCCACGCCCGGCGGIGGAAGGCGGGAA OCiC
GCGAACCiCAAGCCGCT CGGIGGAAGGCGCiG AA
25 GGATTTGCGCGCCCAAGCFCGC.GCCGTCTGAAAATC GCCGTTTCGACAAAGGCGGCCACGGATTIGTGCAG
94 C GGC rrACCT3CCCCGACA
CGCGCC:GI CTGAA AA ICC
AAACGCCAGCGACGAAGTGCITCCTFCATCAGCTTG
9$ 1 CIITGACGAAGCGGCGCA TIGCCGCCGCCAAATCCA CMG GCAAGCCGTGCF GT ACCF
GCCGCCGCCAAAICCA
25 CCAAAGCCCITGATCGCGICCGGTGTGGIGCACiGAG GCAGCGACCAGGAATGGCGACGAAC:CCITCAGCGT
96 GA GATG GGGATGCGCC.GTCTTC
GGTGTG GTGCAGGAGGA

97 GATITX.CGCCGCCGTCATGCATACGCGCCATCGT AGCCCGTTCATTTGCGCCACSGCGTACTCGTCCGT C
GC,ATACMCCICCATCGT

f3GCACICAAATICACAGT

ATC:GATTCAAAACACTGGC

TGCiCAGCTG ITCGITGCITIGG CAATCATGTTCGCATICiG

AACCCAAAC.TCAAAGCCA
26 GCCATTIOCCGAIGATTGOGGCCC.CCGGCAGGACiC:A TGTACGGCAAAGGAGACGGCTG
TAATCGCCGAAAC
00 TT GGTCT CCGCAGCATGAGGCP.AC
C.CCCGGCAGGAGCATT
26 TGTTGTCGTTCiTGGGCGGGCGGCTTTGTGCAGATTG CGGCGGCAAGCGTCTGAATGTTGGACTACTATCCG

CGGCTTTGTGCAGATTGC
26 CCGCiGIT TCACATCGCCGTAIGAGCGACGCTITACCG
ACCGTATTCATIGCAGGIGCCGATACAGTIGGGCG GAAGAAATCGCCCTGCFG

TGAGCGACGCTTTACCGA

AAAGCCAGTTGGACTICGCCCCCiCAACACCOGCAT T ACiAGCiGCITTGAGCA

26 ACITCTCGGACGTI CCM TCGGGCTI CGACICCG 'TA
GCACIGCTGGTC:AIACTG
04 GCGCATCGATITGGGCGCGGAAGCC.GGCGMGAGT CCG C
GGAAGCC.GGCGAAGAGT

OS CGTATGTCCGGCGCGTCAGGTCAGGGC. TTC-CiGGCGT
GTCGGTCACCAGCGCCACTACTGACCICGCCTITGG GCGATGTTGCCGCCCA TCAGGGCTTCGGGCGT
26 CAGCGCGAAGTGTTGGCGGCGAC.GCAGATAMATCG ACACGATGGCGGGTGCTTCGGIGGTTAAAGIGICA
CGACGCAGATAMATCGC

CACTCTTGGGGTCC.TGAAG

eg GT GAAC-AG CCCGCTGACGATGTGTAC
GCGCTCAAACCIITCGGT

CACCCCCATCGACCTGAT

GCACIGATTGGCGGGCCATGC.A ATCCGACCGAA ACC TITC.C.GATAGATTCCTGC.C.

CGGTCGGLITTITCCIGC

TGAACAGGATGTTGCGGA

CGACACGCTCGTCCGT

CAAGGATGCGGGCGATG

TTGCTGACGCMCGGIC
26 AAGTCATGCTOCATGCCGTCAAGGTGICAGTGCGTTG TGICGCGGACAAAACGATGOTtCGACITTCGCCAAA
TCGAGGAAAAGGACGAA

AGGIGTCAGTGCGITGTG

GCAGCCCGCATGTTITTCC
26 GCATCGCCAACCGCCTGTTGACGCCGTTliffiiiCAe-CGGCTGCGTTIGGCGACTACAACTOITGCAGCTCGA

ACGCCGTIbilitiCACGT
26 GCGCACCTTATCGCCGAACAAACGAAACGfelfeiGC ITTICCCCGAGCGCGGCATTCGAAGCCGCCAATCAG
CGAAACGTTMTGCAGCG
20 AGCGG TTII.1 CCTACGCGCCC G

CAGACGCGTTCGGCAAC

GCTFCGGCGACAACAGC

AGCCGTATTGTTTCATCAT

CCTACATGACTACTTGGGC

CGCTICGATTACGGCGIT

ATTFICTOMIGCCTGCG

TTGGACMCGCGCMIGG

CTGTCTCAGTTTGATTTTT

GTAGGGGAGGITOTTCTG

ATGCTGAAATGGACTCCT TATCGGATAACATCTCCTT

GTAAAGTTACCCGAGTTA CAAATGTGGGTGACAATC

GAAGATAAGGCTITITTFA

AriGGGIATCTAGGCCACT

TTGAAACCACTTCTGAM

ATATCAACCAACTTATTGA TCCTGATTCATTTTTCTCFG
32 Torrai GCAGTIG GATCC

AAACAGTTITTTGCTGGIT

CTCCAAAGTAATCGGrfCG

CACGCCAAMCGCAAGGT

GCCATGAGCTGCCCTACG CCTCCGCGCTGATGCA
26 CITTGGCAGTGGCGATGlIGCCGTGCCITGGGICAA CCGCTITGGATTIGOTTOCCGTCGAACCCGGACCGA
ATAGCCAAGGCGTGATCG

CGTGCCTIGGGTCAAAGA

CGTGGGIGTGCCGCAA CTGACCGCCATTCACAACG

TACCAAGGCCGCTCGTAC
26 GTTCGGTCACCGTCGGGCAACACTTCCiGCGGTCAG

CGTCCGCCAAGATGCC

CC.TGATCGGCAAATGGGC GGACGCGATACCGACGA
26 CCGATGMaITGCCCGC1TCGCCTGAACATCCCCACC COGGCAAACATCCAACOCGACCGCCGATFCGATCA

GCCTGAACATCCCCACCA
26 GrCIGTCX:CTGCCCGGCAAACAATICTTIGCGGCCT

GACCTGCGGCTCGTW

GICITATTCGGCAACATCG

44 ATCCTCGCCGTCCTCGAGCCGGGCGGACTTCATCGTG . GCCGACACGTTGGCGCAATTCGCCCACCACCCGA
TTGAGGCTTGCGGGCA GGGCGGACTTCATCGTG

GGAAACGCCGAAATTTTCC

GCGTTTGCGGTAGGCG

TCCGAACCGTCITTCTGTG

26 TTGIGCGGACGCGTGGICGATGICCAACAAGAAATli ATGICCAACAAGAAATGCG

TCCAAATCCGCCTGCGA

ATCGTCAATACAGTTCCG ACATCGGATGCTCTTCCAG

TGTCCAACTCGAGGGCG

CCTTGCACGACGTGIATC GGTCTGACCGTCTITGAAG

CGACAAACGTGCCTTCCG

TITTGCCGAATTGATGGC

TCCAAATCAAACZCGCCC
26 GGCGGCATCZAAGGCITCGCGTCTGAACACCCITS.0 CTCGAAACGCGCGTATGC CGTCTGAACACCCTCCCG

CATCGCTTTTGCCGCCG

SS T TGCC TTGTATTTCCTGCCGCTGC
GCACGGACAGGGATCGT

CGACCAACCGACCGATIC
26 ITITCGCAATCGCATTGACCGCTCG/V\GCGGGTrTT

GGCGCAACGGOCTTAC

GCTTIGGCACTGAAAAGC

GTTACTTCCGATGCGGCG G

CAGITCTICTICGGTTTGC

ICGCCTOGGGAITGAACA

ACACTTTCGACGACAACC

CTATCTGAAGTCCGAGCGC

TTCATTGCAAAAACCGACC AAAACTGTACAGCAGGCG

TGCCCITCATTGCCGTTGGGGAGGGCGGTITCGTCA

TTITCGICATCCGTTACCGC
26 GCCIaiCTGCCITCGAGAITCCIACIGGIACATCGAC GCAGAAAGICGGCGAATACCGGGITCGGCAGGCA
TITCAACCGCAAAGACAA CCTACTGGTACATCGACAC
63 ACG I CCITT , GC

OCCGAACACCTGCAACAA

AAAAACGCCATCCCGAAC AAAACCGTTCGTTACCTCC

GCGACAAATACCCCGITT

TTGAGCGGCAAAATCCAC TCGTTTATGCTGATCCCTC

26 CGGGTAATCTITCCGAAACCGTTTCAATATAGCCAAG GTCAGACATCGGGAAATGCTITITICAAAAGCJkGAT
TTCAATATAGCCAAGGGGA

CATACCAACGTTICCCCCA

GCCACTGCGTGTCCATPC

TCGICAACAGCGICGICG

CACGCGCTGAAGGAGGT

ATTGCCGGGAAACACGC

ACTGCCCAAAAGCGTACC

GCCCICGATAAAACCTIG

GITCITTGGCGGCTTCGT

GCGGCGATTATCATCTGCA

AAAATCGGTGCGGACGTG

ATGAGTCCGAGGCTGTGG

TCCGAGTGCTTGAAGACC

TTGCCGATGITGCCGC

OGAGTATGGTGGCGCG

IGGIGTTCCAAAGCAGTAC

ACTGGIGGAAAACGGIAT TGTATTCCTGATITCCGGC
82 GGCC . TGTG GC
26 GGTACTGCTTCCTTTGCCGGCTTTGTTATCGACTTCAA ACAGGCTGGTTACITTGutttlIGCCATCGTGACCA
CITTGTTATCGACTTCAATT

GCGCCGCATAACCTCTT

AGCAGTAGGGICATCAGG

GACAGGLAGCGCATCG

TCGCCCTGATAGCCGTC

SS C ATG CC GAF
ATGGGCGGCGAGICC

GGGGCGAACAACAOCG

90 GCMAGAGITTCGC=GGCZGCTGCTCTGCGGCGGAC.A ATATGTGCCGTGCCGCGCTGCCGGTCGTTGAGTTCG
ACCGCCCTGCCGTTCA TGCTCTGCGGCGGACA
26 CGGATATTTCTGCCTGCCCGTTGAACCCGTATACCAA MAGGGGCGAAGMAC.A3GCITCGCGTTG1TGTA

91 CC GGTTT , TCT
TTGAACCCGTATACCAACC

GCCGATCGTTACTTGTTCG CGGTAGCTAAGACGACGT

AATCGACCAAATCCTGCG

CATTTCGCGCGCACCC

CATGTCCGACAACGCTTTG ACCTCTTCGCGACGCT
26 AAATCZEGT-Cr..CGCACGCGACGAGCAAATGAAACTG
C.GTGGGCTTACCTCiCCGCTCGATGTAAACCTCGGGC ACCAGCAAATGAAACTCiG

GCTTGCCAAAGAAGGATT CGGAATCGTACAGGAAGA
96 GACG ClIG GG CC

CGCGTGGCAGGAACAC

AAAAGGCCGCGCTCG
26 ATGGITTTGCCCGGCGAGTCGAAACTGGTCGAATGC ACCAACAACAGC.ACCTTCCTCGCGTTCTAGGITCGG
ACGCCCTTGAATATCGTGT GAAACTGGTCGAATGCAA

27 GGGCAIAGGAATAGTCGATGACGGATGCGCGMai 1 ATCTGGCGAAGGAMTGC.ACCiTGCGAAATACTGTC
ACATTCATGCGCCITTGAA
00 GCA TA 1 GcaiA I C
GATGCGCGTTTCGGCATA
t GAGMTGCCA
02 CfCGC 1 GITCC , TGCTGCAPAGCGIGGC
CGIATC:CGCACTACCTCCiC
27 GGTCAGCAGTGCCAMACCCAAGGCTGCTGCTGTCC 1 CCGCCCGCCCGATTGTCAAAAGGC.AGGGCGATCAG
02 CT ' C CCGCCATTCCTGCCGA
GGCTGCTGCTGTCCCT

CCTTGGCAGGGATAGATG

MCGCACCCTCTTCCGT

MAAGGCFTCGATGAAGA

OS . GGTVIGGCGGCGGAAACai TCAGACCGCAGGCACC GACAGA CAACCOC:GOCAGAGC1 'FCAGAMGCAGGCAGC
27 GCATCGTATTGACTGCCITTCAACCGATTICCACAAA AGCCCAGTATGAC:AATTTGCTGTGCTGATTATCTGC
TTAAACATACCGAGTGAA CCGATTTGCACAAATTAAC
06 TTAACC CGTAAA TGCi C

TGCCCAAGCACCCAAATCATATTICTITTGCCGTGAT AAAACGACTTITACCAAA
ATTICGGATITAATTGACC

27 .AACCGAAGTCAAACCGCCGGGCGATGAAGCGTGGG CACGCCAATGACGCGCAATCMCGGATTCMCGC
ACCGCAITTITITGACGGA
OS ATT CAC T G G4:GATGAAGCCi TGGGA II

AAGCATGATTTICAGGCTG CTACCATTGATATTGTGIC
09 1GTCTTGCT 1 CCT ATTCTGAATACCGICCG Tf CI

IIC I GTGC CC:GCATTCAAATCATGGG AA ITTGGGCAAAGACCCC
CCCCG GCTCG C G

IGCGTGATTATTIACGCGGC sUCTITGATIGATAATCGTG
11 . TGAMCG 1 GC. CTGACCGAAAAGCACCET AAACG
=
27 TCGGGCTITTGGICATTITCAGAAACAATTACTGGCO GCAAAC:AACAACACTCGGOCCTGC=TTG

AC.AATTACTGGCGTGTC
27 CCGTCT IGGITTTCCAGTGC.AAAAGGGA 7 ACGCAACC CGAA A ITT TICATGGI
AAACGAGCGGATACCGCAAT GCAAAAATTCCAAAGGCA

AAAGGGATACGCAACCAT
27 TCGAATF GCACCGIGAAIGCGCCCiCCGTACTOCIGG GTCCGAAGACGGCGGC
ITCACAACAGCCGAAGCCG

CCICCGTACTGCTGGCT
27 GGTACGCAACZGAACACiCTCTGTTGGCTATGACTGC
AAGCGAAATTCACGGTCG
TCGGIGGTCC.GCTCGGAAGACACC.GACGCTGACCT CCG A ACACCGACGCTGACCT

ICGCICTACATGATTAGACCC:A TAAAACGTGGCAGATACA
16 C ACATGAGiTTG (SAC
AATATTTCCGTTGCCGTa:
27 CTCAAAAAA TACCAOCCCCiAGAGTA TCTGCCAGTFT T CTCiGCiCAAACIGGAAAAACGGATA
7 ATI ATCiCTGA

TATCTGCCAGTITTCCTIGT
27 GCGCGITGAGATAGACOGCGAGCCGAAATGCG (ACC CI GCACGACGGCAAAATC
18 GGC CGOCAAACiGCGTGCACGTAATGCCTGCGTCGCG Ci CCCiAMTGCGTACCGGC

CACGCMITGTGCGTT GGCTCGCCC.GAATCCA
27 ACGCCGTCTGAAAAAACCTTITTGITTTGCAGTAAAT AMCGCiTITTICITTGGCTICGGGGATAMACCGCC
ITGITTTGCAGTAAATCGA
CGAGA . TACGC TTGCAAGCCTTCATCTICG GA
27 GCC,ATGCGCCCAAACAGATCTTGICCITTGTAAGCGG GCGGCTGCCGTACAATCAAAATGATATGTCGGGCG

TTGTCCTTTGTAAGCGGCC

AAGCAGTCGMAATCAGG

CCGGCAATGATTGAGCGT CG

CGCCGAGCAGGTATIGAG AAGGTCGAAATCGCCAAA

TCCAGGCGTCCCCACiTTGTCATCACGCCCAACCTGA

TCGGGGAATCAGAAGCGG

TGACGAAAGAGGCGGAAC
A GG GACTGGCC-CCACGACA A

DEMANDE OU BREVET VOLUMINEUX
LA PRESENTE PARTIE DE CETTE DEMANDE OU CE BREVET COMPREND
PLUS D'UN TOME.

NOTE : Pour les tomes additionels, veuillez contacter le Bureau canadien des brevets JUMBO APPLICATIONS/PATENTS
THIS SECTION OF THE APPLICATION/PATENT CONTAINS MORE THAN ONE
VOLUME

NOTE: For additional volumes, please contact the Canadian Patent Office NOM DU FICHIER / FILE NAME:
NOTE POUR LE TOME / VOLUME NOTE:

Claims (49)

What is claimed is:
1) A method of detecting a target nucleic acid in a sample comprising:
a) distributing a sample or set of samples into one or rnore individual discrete volumes each individual discrete volume comprising isothermal amplification reagents for amplifying the target polynucleotide, and a solution for rapidly isolating polynucleotides from a cell or virus particle;
b) incubating the sample or set of sarnples at conditions sufficient to allow extraction of target polynucleotides from the sample;
c) generating amplicons of target polynucieotides, wherein isolation of polynucleotides is not required between the extra.ction or amplification step;
d) introducing single stranded regions into said amplicons by incubating the arnplicons with an RNA component molecule and one or more TnpB polypeptides possessing collateral activity, wherein the RNA component molecule comprises a sequence capable of binding the target nucleic acid and designed to form a complex with the one or more TnpB
polypeptides; and e) further incubating the sample with a probe that binds one or more single stranded regions of said arnplicons, wherein said probe is:
i) a double stranded DNA probe with a fluorophore on one strand and a quencher on the other strand;
ii) a single stranded RNA probe with a fluorophore on one end of the strand and a quencher on the other end of the strand;
iii) a cotnbination of an unlabeled double stranded DNA probe and a single stranded RNA
probe with a tluorophore on one end of the strand and a quencher on the other end of the strand;
iv) a double stranded RNA probe with a fluorophore on one end of one strand and a quencher on the other en.d of the same strand; or v) a double stranded RNA probe with a fluorophore on one end of one strand and a quencher on the other end of the other strand;
f) adding an enzyme capable of cleaving RNA in an RNA:DNA duplex to increase fluorescence to be detected; and g) detecting the one or more amplicons, thereby indicating the presence of one or more target polynucleotides in the sample.
2) The method of item 1, wherein the enzyme capable of cleaving RNA is RNaseH.
3) The method of claim 1, wherein the fluorescence detected is greater than fluorescence detected by unwinding of the RNA:DNA duplex alone.
4) The method of claim 1, which does not include a washing step.
5) The method of claim 1, wherein the solution for isolating polynucleotides is protease-based, detergent-based, or chaotrope-based.
6) The method of claim 1, wherein the solution contains proteinase K.
7) The method of claim 6, wherein reaction buffer contains a proteinase K
inhibitor.
8) The method of claim 1, wherein the solution for isolating polynucleotides is Lucigen Quick. Extract Plant DNA Extraction Solution.
9) The method of claim 1, wherein the amplicons are generated using loop-mediated isothermal amplification (LAMP), polymerase chain reaction (PCR), nucleic acid sequence-based amplification (NASBA), strand displacement amplification (SDA), helicase-dependent amplification (HAD), nicking enzyme amplifi.cation reaction (NEAR), transcription mediated amplification (TMA), recombinase polyrnerase amplification (RPA) or rolling circle amplification (RCA).
10) The method of claim 1, wherein the isothermal incubation temperature is betwee,n 55 C
and 75 C.
11) The method of claim 1, wherein the single stranded region is a LAMP
arnplicon loop.
12) The method of claim 1, wherein the single stranded region is an R-loop generated when the RNA component molecule binds to one strand of the amplicon.
13) The method of claim 12, wherein the TnpB polypeptide enables the RNA
component molecule to bind to a strand of the amplicon.
14) The method of claim 13, wherein the TnpB polypeptide comprises a Ruv-C
nuclease domai n.
15) The method of claim 14, wherein the TnpB polypeptide further comprises Ruv-CI. Ruv-CH and Ruv-CHI subdornains.
16) The method of claim 1, wherein the TnpB polypeptide comprises about 200 to about 500 amino acids.
17) The method of claim 1, wherein the RNA component molecule comprises a scaffold of about 40 to 80 nucleotides in length.
18) The method of claim 1, wherein a PAM sequence is 3' of ihe target nucleic acid.
19) The rnethod of claim 1, wherein a functional domain associated with the Tnp13 is selected from the following: transposase activity, methylase activity, demethylase activity, translation activation activity, translation repression activity, transcription activation activity, transcription repression activity, transcription rel.ease factor activity, chromatin modifying or remodeling activity, histone modification activity, nuclease activity, single-strand RNA
cleavage activity, double-strand RNA cleavage activity, single-strand DNA cleavage activity, double-strand DNA
cleavage activity, nucleic acid binding activity, detectable activity, or any combination thereof.
20) The method of claim 1, wherein the probe is a double stranded DNA probe with a fluorophore on one strand and a quencher on the other strand.
21) The rnethod of claim 1, wherein the probe is a single stranded RNA
probe with a fluorophore on one end of the strand and a quencher on the other end of the strand.
22) The method of claim 1, wherein the probe is a single stranded RNA probe with a fluorophore on one end of the strand and a quencher on the other end of the strand.
23) The method of claim 1, wherein the probe is a combination of an unlabeled double stranded DNA probe and a single stranded RNA probe with a fluorophore on one end of the strand and a quencher on the other end of the strand.
24) The method of claim 1, wherein the probe is a double stranded RNA probe with a fluorophore on one end of one strand and a quencher on the other end of the same strand; or
25) The method of claim 1, where the probe is a double stranded RNA probe with a fluorophore on one end of one strand and a quencher on the other end of the other strand.
26) The method of c.laim 1, wherein the target nucleic acid is from a virus, bacterium, protozoa, fungus, or other pathogenic organism.
27) The method of claim 26, wherein the target nucleic acid is from human papillomavirus, hepatitis, adenovinis, C'andidia, coronavirus, herpesvirus, human immunodeficiency virus, influenza virus, Plasmodium, rhinovirus, Neisseria gonorrhoeae, Respiratory syncytial virus, coronavirus, or Streptococcus pyogenes.
28) The method of claim 27, wherein the coronavirus SARS-CoV2.
29) The method of claim 1, wherein an extraction-free solution is mixed with a sample at a concentration of about 1:2 to 2:1 sample:extraction solution.
30) The method of claim 29, wherein the sample is from a nasal swab or saliva.
31) The method of claim 1, wherein the incubating step is performed at a temperature of about 20 C to 60 C for about 30 minutes,.
32) The method of claim 1, wherein the amplifying and detecting steps are performed at about 55 C to about 65 C, about 59 C to 61 C or about 60 C for 50 to 70 minutes.
33) The method of claim 1, wherein the target polynucleotide is detected in one hour or less.
34) The method of claim 1, wherein the steps of incubating and detecting are all performed in the same individual discrete volume.
35) A composition for detecting the presence of a target polynucleotide in a sample, comprising:

a) reagents for amplifying the target polynucleotide;
b) an extraction-free solution for isolating polynueleotides from a cell or virus particle;
c) one or more TnpB proteins possessing collateral activity;
d) at least one RNA polynucleotide component comprising a sequence capable of binding the target polynucleotide and designed to form a compl.ex with the one or more TnpB proteins;
and e) one or more of the following probes:
i) a double stranded DNA probe with a fluorophore on one strand and a quencher on the other strand;
ii) a single stranded RNA probe with a fluorophore on one end of the strand and a quencher on the other end of the strand;
iii) a combination of an unlabeled double stranded DNA probe and a single stranded RNA
probe with a fluorophore on one end of the strand and a quencher on the other end of the strand;
iv) a double stranded RNA probe with a fluorophore on one end of one strand and a quencher on the other end of the sa.me strand; or v) a double stranded R.NA probe with a fluorophore on one end of one strand and a quencher on the other end of the other strand.
36) The composition of claim 35, wherein the amplification reagents are LAMP reagents comprising F3, B3, FIP, BIP, Loop Forward and Loop Reverse primers.
37) The compositions of claim 36, wherein the probes are selected from Table .
38) The composition of claim 32, wherein the probes are provided at a concentration of 50 nM
to 175 nM, preferably 75 nM to 150nM.
39) The composition of claim 36, wherein LAMP primers are selected from Table .
40) .......................................................................
The composition of claim 36, wherein the F3 primer is selected from Table .
41) 'f he composition of claim 36, wherein the composition is lyophilized.
42) The composition of claim 41, wherein the composition is lyophilized as a complete formulation.
43) The composition of claim 41, wherein the composition is lyophilized as an incomplete formulation and additional components are added later in resuspension buffer.
44) The composition of claim 35, comprising one of m.ore of lactose, trehalose, sorbitol, glucose, raffinose, glycine or histidine.
45) The composition of c.laim 35, further comprising one or rnore additives, wherein the additive is guanidinium chloride (GuHC1), L-proline, L-histidine, b-alanine, L-serine, urea, acetamide, 4-arninobutyric acid, polyethylene glycol, pol.ypropylene glycol, polyvinylpyrrolidone K, 6-0-a-D-maltosyl-b- cyclodextrin, (2-hydroxypropy1)-b-cyclodextrin, a-cyclodextrin, b-cyclodextrin, rnethyl-b- cyclodextrin, glycine, proliiìe, taurine, or a cornbination thereof.
46) The composition of clai.m 35, further comprising polynucleotide binding beads for the capture of nucleic acids in a sample.
47) The composition of claim 46, wherein the beads are carboxylated.
48) The composition of claim 47, wherein the RNA polynucleotide cornponent comprises a spacer specific for the N gene or S gene of SARS-CoV-2.
49) The composition of claim 35, further comprising one or more additives to increase reaction specificity or kinetics.
CA3231249A 2021-09-09 2022-09-08 Coronavirus rapid diagnostics Pending CA3231249A1 (en)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US202163242377P 2021-09-09 2021-09-09
US63/242,377 2021-09-09
PCT/US2022/076140 WO2023039491A2 (en) 2021-09-09 2022-09-08 Coronavirus rapid diagnostics

Publications (1)

Publication Number Publication Date
CA3231249A1 true CA3231249A1 (en) 2023-03-16

Family

ID=85506942

Family Applications (1)

Application Number Title Priority Date Filing Date
CA3231249A Pending CA3231249A1 (en) 2021-09-09 2022-09-08 Coronavirus rapid diagnostics

Country Status (2)

Country Link
CA (1) CA3231249A1 (en)
WO (1) WO2023039491A2 (en)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11453907B2 (en) * 2020-03-23 2022-09-27 The Broad Institute, Inc. Crispr effector system based coronavirus diagnostics
EP4265741A1 (en) * 2022-04-21 2023-10-25 Consejo Superior de Investigaciones Científicas (CSIC) Multiplexable crispr-cas9-based virus detection method
CN117660702B (en) * 2024-02-01 2024-04-30 广东省林业科学研究院 Fluorescent quantitative PCR primer group and method for detecting Liquorice pangolin virus
CN118006733B (en) * 2024-04-09 2024-07-09 艾特生物科技(深圳)有限公司 Nucleic acid chemiluminescence detection method based on Cas12a and streptavidin aptamer cascade

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP4274856B2 (en) * 2003-06-19 2009-06-10 オリンパス株式会社 Method for detecting reaction between DNA and DNA-binding protein
DE602006008150D1 (en) * 2006-10-12 2009-09-10 Bio Rad Pasteur Double-stranded probes for fluorescence detection of nucleic acids
WO2012064978A2 (en) * 2010-11-10 2012-05-18 Brandeis University Compositions, methods, and kits for detecting and identifying mycobacteria
GB201122458D0 (en) * 2011-12-30 2012-02-08 Univ Wageningen Modified cascade ribonucleoproteins and uses thereof
CN106715706B (en) * 2014-09-30 2022-08-09 环球生命科技咨询美国有限责任公司 Method for analyzing nucleic acids directly from unpurified biological samples
BR112021025669A2 (en) * 2019-06-18 2022-02-22 Mammoth Biosciences Inc Microfluidic cartridge for detecting a target nucleic acid, collector, method for detecting a target nucleic acid, and, uses of a microfluidic cartridge, a system, a programmable nuclease, a composition and a dna-activated programmable RNA nuclease
EP4061941A4 (en) * 2019-11-19 2023-12-06 The Broad Institute, Inc. Retrotransposons and use thereof

Also Published As

Publication number Publication date
WO2023039491A2 (en) 2023-03-16
WO2023039491A3 (en) 2023-06-29

Similar Documents

Publication Publication Date Title
CA3231249A1 (en) Coronavirus rapid diagnostics
US20210207130A1 (en) Methods and compositions for the making and using of guide nucleic acids
CA2796578C (en) Breast cancer associated circulating nucleic acid biomarkers
CA3060721C (en) Method of diagnosing bladder cancer.
Kubota et al. FRET-based assimilating probe for sequence-specific real-time monitoring of loop-mediated isothermal amplification (LAMP)
US20220093208A1 (en) Compositions, methods, and systems to detect hematopoietic stem cell transplantation status
US20080194416A1 (en) Detection of mature small rna molecules
CA2100919A1 (en) Species-specific oligonucleotides for bifidobacteria and a method of detection using the same
CA3183566A1 (en) Methods of detecting sars-cov-2, influenza, and rsv
US20210383891A1 (en) Improved Ordered Assembly of Multiple DNA Fragments
WO2003045230A2 (en) Novel compositions and methods for cancer
JP2023547536A (en) Multiplex detection of bacterial respiratory pathogens
CN101849022A (en) A method of DNA amplification
US20220098577A1 (en) Ordered Assembly of Multiple DNA Fragments
WO2020068559A1 (en) Depleting unwanted rna species
US20130231261A1 (en) Rnase h-based rna profiling
US11345971B2 (en) Primer set for detecting SARS-CoV-2, method for testing SARS-CoV-2, and reagent and kit of testing SARS-CoV-2
US11584960B2 (en) Multiplex detection of short nucleic acids
WO2020146603A1 (en) Methods of detecting analytes and compositions thereof
CA3186629A1 (en) Compositions and methods for treating disorders associated with loss-of-function mutations in scn2a
WO2018199136A1 (en) Method for measuring expression level of abl1 t315i mutation
KR20050114099A (en) Dna chip for diagnosis of colon cancer
CA3218053A1 (en) Modified nucleases
AU2006216122B2 (en) Detection of DNA sequence motifs in ruminants
US20080193935A1 (en) Detection of Dna Sequence Motifs in Ruminants