ES2703769T3 - Biomarcadores de ácidos nucleicos en circulación asociados al cáncer de mama - Google Patents

Biomarcadores de ácidos nucleicos en circulación asociados al cáncer de mama Download PDF

Info

Publication number
ES2703769T3
ES2703769T3 ES11769758T ES11769758T ES2703769T3 ES 2703769 T3 ES2703769 T3 ES 2703769T3 ES 11769758 T ES11769758 T ES 11769758T ES 11769758 T ES11769758 T ES 11769758T ES 2703769 T3 ES2703769 T3 ES 2703769T3
Authority
ES
Spain
Prior art keywords
breast cancer
sequence
set forth
extracellular dna
circulation
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
ES11769758T
Other languages
English (en)
Inventor
Ekkehard Schütz
Julia Beck
Howard Urnovitz
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Chronix Biomedical Inc
Original Assignee
Chronix Biomedical Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Chronix Biomedical Inc filed Critical Chronix Biomedical Inc
Application granted granted Critical
Publication of ES2703769T3 publication Critical patent/ES2703769T3/es
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6876Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
    • C12Q1/6883Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material
    • C12Q1/6886Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material for cancer
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6844Nucleic acid amplification reactions
    • C12Q1/6851Quantitative amplification
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6876Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
    • C12Q1/6883Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material

Landscapes

  • Chemical & Material Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Organic Chemistry (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Zoology (AREA)
  • Wood Science & Technology (AREA)
  • Engineering & Computer Science (AREA)
  • Genetics & Genomics (AREA)
  • Analytical Chemistry (AREA)
  • Immunology (AREA)
  • Pathology (AREA)
  • Microbiology (AREA)
  • Molecular Biology (AREA)
  • Biotechnology (AREA)
  • Biophysics (AREA)
  • Physics & Mathematics (AREA)
  • Biochemistry (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Hospice & Palliative Care (AREA)
  • Oncology (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Abstract

Un método de detección del nivel de una molécula de ADN extracelular en circulación asociada con cáncer de mama en el ADN extracelular en circulación en una muestra de sangre, suero o plasma de un paciente que tiene cáncer de mama o se sospecha que tiene cáncer de mama, que comprende: determinar el nivel de cada una de las regiones cromosómicas expuestas en la Tabla 7 en el ADN extracelular de la muestra, en donde la presencia de un nivel más alto que los niveles normales de un ácido nucleico de al menos 25 nucleótidos de longitud que se asigna de forma inequívoca a una región cromosómica expuesta en la Tabla 7 es indicativa de un riesgo aumentado de cáncer de mama o de recaída del cáncer de mama.

Description

DESCRIPCIÓN
Biomarcadores de ácidos nucleicos en circulación asociados al cáncer de mama
Referencia cruzada a solicitud relacionada
La presente solicitud reivindica el beneficio de la solicitud provisional de Estados Unidos N.° 61/324.927, presentada el 16 de abril de 2010, cuya aplicación se incorpora en el presente documento por referencia.
Antecedentes de la invención
Los métodos para detectar el cáncer de mama, incluyendo la mamografía, se conocen bien en la técnica (véanse, por ejemplo, Elmore, et al., JAMA 293:1245-56, 2005). Sin embargo, existe una necesidad de métodos de detección adicionales que puedan usarse de manera eficaz y económica. La presente invención aborda esta necesidad.
Breve sumario de la invención
La invención se basa, en parte, en el descubrimiento de biomarcadores de ácidos nucleicos en circulación (CNA, forma siglada de circulating nucleic acids) asociados al cáncer de mama. Los biomarcadores CNA son secuencias de ácido nucleico, en la presente invención secuencias de ADN, es decir, fragmentos de ADN, que están presentes en la sangre, por ejemplo, en una muestra de suero o plasma, de un paciente con cáncer de mama, pero que rara vez están presentes, si acaso, en la sangre, por ejemplo, una muestra de suero o plasma, obtenida de un individuo normal, es decir, en el contexto de la presente invención, un individuo que no tiene cáncer de mama.
Por consiguiente, en un aspecto, la invención proporciona un método para analizar CNA en una muestra (sangre, suero o plasma) de un paciente, que comprende detectar en dicha muestra la presencia de al menos un ADN extracelular que tenga una secuencia de nucleótidos que se encuentre dentro de una región cromosómica expuesta en las Tablas 2-8 (o que tiene una secuencia de nucleótidos que es parte de una de las secuencias expuestas en las Tablas de Secuencias AG). En algunas realizaciones, detectar la presencia de, o la cantidad de, el al menos un biomarcador comprende detectar una molécula de ADN extracelular que tiene entre 50 y 400 nucleótidos consecutivos de una secuencia singular dentro de una región cromosómica como se expone en las Tablas 2-8 (o de una secuencia singular expuesta en las Tablas de Secuencias A-G).
En una realización, se proporciona un método para analizar ADN libre en circulación en una muestra de un paciente, que comprende determinar, en una muestra que es sangre, suero o plasma, la presencia o ausencia, o la cantidad de, al menos 2, 3, 4, 5, 7, 8, 9, 10, 15, 20, 30 o al menos 40 moléculas de ADN extracelular, teniendo cada una una secuencia que se encuentra dentro de una región cromosómica distinta, como se expone en la Tabla 2, la Tabla 3, la Tabla 4, la Tabla 5, la Tabla 6, la Tabla 7 o la Tabla 8, y preferentemente las secuencias de las moléculas de ADN extracelular carecen de un elemento repetitivo. En las realizaciones preferentes, las moléculas de ADN extracelular tienen secuencias que se encuentran dentro de distintas regiones cromosómicas en la misma tabla, que se escoge de las Tablas 2-8.
En otro aspecto, la presente invención proporciona un kit que incluye dos o más (por ejemplo, al menos 2, 3, 4, 5, 6, 7, 8, 9, 10, 12, 15, 20, 25, 30 o al menos 40, pero menos de 55 o 60) conjuntos de oligonucleótidos. Cada conjunto comprende uno o más oligonucleótidos con una secuencia de nucleótidos que se encuentra dentro de una única región cromosómica que se expone en una tabla escogida entre las Tablas 2-8. Preferentemente, los distintos conjuntos de oligonucleótidos corresponden a distintas regiones cromosómicas dentro de la misma tabla. Además, preferentemente, los oligonucleótidos carecen de un elemento repetitivo. Opcionalmente, los oligonucleótidos se unen a uno o más sustratos sólidos, tales como microchips y perlas.
En otro aspecto, la presente invención proporciona un método para el diagnóstico o el cribado del cáncer de mama en un paciente. El método incluye las etapas de: (a) determinar, en una muestra que es sangre, suero o plasma de un paciente, la presencia o ausencia, o la cantidad de, al menos 2, 3, 4, 5, 7, 8, 9, 10, 15, 20, 30 o al menos 40 moléculas de ADN extracelular, teniendo cada una una secuencia que se encuentra dentro de una región cromosómica distinta, como se expone en la Tabla 2, la Tabla 3, la Tabla 4, la Tabla 5, la Tabla 6, la Tabla 7; y (b) correlacionar la presencia de, o una presencia aumentada de, dichos primero y segundo ADN extracelulares con una probabilidad aumentada de que dicho paciente tenga cáncer de mama. Preferentemente, las secuencias de las moléculas de ADN extracelular carecen de un elemento repetitivo. En las realizaciones preferentes, las moléculas de ADN extracelular tienen secuencias que se encuentran dentro de distintas regiones cromosómicas en la misma tabla, que se escoge de las Tablas 2-8.
En un aspecto, la invención proporciona un método para identificar a un paciente que tiene un biomarcador CNA asociado con cáncer de mama, comprendiendo el método detectar la presencia de al menos un biomarcador expuesto en la Tabla 2, 3, 4, 5, 6 o 7 en una muestra de CNA obtenida de suero o plasma del paciente. En algunas realizaciones, la invención proporciona un método para identificar a un paciente que tiene un biomarcador CNA que está asociado con la ausencia de cáncer de mama, comprendiendo el método detectar la presencia de al menos un biomarcador expuesto en la Tabla 8 en una muestra de CNA procedente de suero o plasma del paciente. Un biomarcador puede identificarse utilizando varios de métodos, incluyendo la secuenciación del CNA, así como el uso de una sonda o un conjunto de sondas para detectar la presencia del biomarcador.
En un aspecto adicional, la invención proporciona un kit para identificar a un paciente que tiene un biomarcador para el cáncer de mama y/o que tiene un biomarcador asociado con un individuo normal que no tiene cáncer de mama, en donde el kit comprende al menos una sonda polinucleotídica para un biomarcador expuesto en Tabla 2, 3, 4, 5, 6, 7 u 8. Preferentemente, tal kit comprende sondas para múltiple biomarcadores, por ejemplo, al menos 2, 3, 4, 5, 10, 20 o 30, o más, de los biomarcadores expuestos en la Tabla 2, la Tabla 3, la Tabla 4, la Tabla 5, la Tabla 6, la Tabla 7 o la Tabla 8. En algunas realizaciones, el kit también incluye un dispositivo electrónico o un programa informático para comparar los patrones de hibridación del CNA en la muestra del paciente con un conjunto de datos de cáncer de mama que comprende un listado de biomarcadores que están presentes en el CNA del paciente con cáncer de mama, pero no en muestras de CNA de individuos normales.
En algunas realizaciones, la presencia de al menos un biomarcador en el CNA se determina mediante secuenciación. En algunas realizaciones, la presencia de al menos un biomarcador en el CNA se determina mediante una matriz. En algunas realizaciones, la presencia de al menos un biomarcador en el CNA se determina utilizando un ensayo que comprende una reacción de amplificación, tal como la reacción en cadena de la polimerasa (PCR). En algunas realizaciones, se puede emplear una matriz de ácido nucleico que forma un conjunto de sondas que comprende sondas para dos o más marcadores expuestos en la Tabla 2, 3, 4, 5, 6, 7 u 8.
En un aspecto adicional, la invención proporciona un método de detección de cáncer de mama en un paciente que tiene, o se sospecha que tiene, cáncer de mama, comprendiendo el método poner en contacto ADN de la muestra de suero o plasma con una sonda que hibrida de forma selectiva con una secuencia presente en una región cromosómica descrita en el presente documento, por ejemplo, una secuencia expuesta en las Tablas A-G, en condiciones en las que la sonda hibrida de forma selectiva con la secuencia; y detectar la presencia o ausencia de hibridación de la sonda, en donde la presencia de hibridación es indicativa de cáncer de mama.
Las Tablas de Secuencias A-G proporcionan ejemplos de secuencias que corresponden a las regiones cromosómicas expuestas en las Tablas 2-8, respectivamente. La designación (N)x en las tablas se refiere a secuencias de elementos repetitivos.
Breve descripción de los dibujos
La Figura 1 proporciona un ejemplo de una curva ROC utilizando las 40 regiones de clasificación más alta de las regiones cromosómicas identificadas en las T ablas 7 y 8, en conjunto. La curva ROC real, creada a partir de las sumas de puntuaciones, se proporciona junto con los límites de confianza del 95 %.
Descripción detallada de la invención
Como se usa en el presente documento, un "biomarcador" se refiere a una secuencia de ácido nucleico que corresponde a una región cromosómica, donde la presencia del ácido nucleico en el CNA se asocia con cáncer de mama; o en las realizaciones donde el biomarcador se expone en la Tabla 8, la ausencia de cáncer de mama.
En la presente invención, una "región cromosómica" enumerada en una cualquiera de las Tablas 1 a 8 se refiere a la región del cromosoma que corresponde a las posiciones de nucleótido indicadas en las tablas. Las posiciones de nucleótido en los cromosomas están numeradas de acuerdo con el genoma de Homo sapiens (humano), versión 37.2 de noviembre de 2010 (Tabla 1-6) y genoma de Homo sapiens (humano), versión 36 de marzo de 2006 (Tabla 7, 8). Como se entiende en la técnica, en el genoma de los individuos existen polimorfismos de origen natural. Por lo tanto, cada una de las regiones cromosómicas enumeradas en la tabla abarca variantes alélicas, así como la secuencia particular en la base de datos, por ejemplo, las Tablas de secuencias A-G corresponden a las regiones cromosómicas indicadas. Una variante alélica normalmente tiene al menos el 95 % de identidad, a menudo al menos el 96 %, al menos el 97 %, al menos el 98 % o al menos el 99 % de identidad de secuencia con una región cromosómica indicada en las Tablas que está presente en una base de datos particular, por ejemplo, la National Center for Biotechnology Information (Centro Nacional de Información Biotecnológica) (Homo sapiens versión 37.1, en el sitio de internet http:// seguido de www.ncbi.nlm.nih.gov/mapview/.) El porcentaje de identidad se puede determinar utilizando algoritmos bien conocidos, incluyendo el algoritmo BLAST, por ejemplo, configurado a los parámetros por defecto. Además, se entiende que las secuencias de nucleótidos de los cromosomas pueden mejorarse cuando que se descubren y corrigen errores en la base de datos actual. La expresión "región cromosómica" abarca cualquier variante o versión corregida de la misma región, como se define en las Tablas 1-8. Dada la información proporcionada en las tablas en la presente divulgación, especialmente en vista de las secuencias enumeradas en las Tablas AG, un experto en la materia podrá entender las regiones cromosómicas usadas para la presente invención, incluso después de descubrir nuevas variantes o de corregir errores.
"Detectar la presencia de una región cromosómica" en CNA en el contexto de la presente invención se refiere a detectar cualquier secuencia de una región cromosómica mostrada en la Tabla 2, la Tabla 3, la Tabla 4, la Tabla 5, la Tabla 6, la Tabla 7 o la Tabla 8, donde la secuencia detectada puede asignarse de forma inequívoca a esa región cromosómica. Por lo tanto, esta expresión se refiere a la detección de secuencias singulares de las regiones cromosómicas. Los métodos para eliminar del análisis las secuencias repetitivas son conocidos en la técnica e incluyen el uso de ADN bloqueante, por ejemplo, cuando los ácidos nucleicos diana se identifican por hibridación. En algunas realizaciones, normalmente, cuando la presencia de un biomarcador se determina mediante secuenciación del CNA de un paciente, se pueden usar manipulaciones y programas informáticos bien conocidos para eliminar del análisis las secuencias repetitivas (véase, por ejemplo, la sección de EJEMPLOS). Además, las secuencias que tienen una alineamiento múltiple igual de adecuado con la base de datos de referencia normalmente se omiten en análisis posteriores.
La expresión "detectar un biomarcador" como se usa en el presente documento se refiere a la detección de una secuencia de una región cromosómica enumerada en cualquiera de las Tablas 2-8. Se considera que un biomarcador está presente si alguna secuencia de ácido nucleico presente en el CNA se asigna de forma inequívoca a la región cromosómica.
La expresión "asignado de forma inequívoca" en el contexto de la presente invención se refiere a determinar que un ADN detectado en el CNA de un paciente es de una región cromosómica particular. Por lo tanto, en los métodos de detección que emplean la hibridación, la sonda hibrida específicamente con esa región. En los métodos de detección que emplean amplificación, el cebador (o cebadores) hibrida específicamente con esa región. En los métodos de detección que emplean secuenciación, la secuencia se asigna a esa región basándose en algoritmos para la identidad muy conocidos, tal como el algoritmo BLAST que utiliza parámetros altamente rigurosos, tales como e <0,0001. Además, tal secuencia no tiene otro resultado positivo igual de adecuado en la base de datos utilizada.
La expresión "ácidos nucleicos en circulación" se refiere a los ácidos nucleicos acelulares que están presentes en la sangre.
La expresión "ADN extracelular en circulación", como se usa en el presente documento, significa moléculas de ADN libres de 25 nucleótidos o más que no están contenidas dentro de ninguna célula intacta en la sangre humana, y que se pueden obtener a partir de suero o plasma humano.
El término "hibridación" se refiere a la formación de una estructura bicatenaria por parte de dos ácidos nucleicos monocatenarios debido al emparejamiento de bases complementarias. La hibridación puede producirse entre cadenas de ácido nucleico exactamente complementarias o entre cadenas de ácido nucleico que contienen regiones menores de desapareamiento. Como se usa en el presente documento, la expresión "sustancialmente complementarias" se refiere a secuencias que son complementarias, excepto por regiones menores de desapareamiento. Normalmente, el número total de nucleótidos desapareados a lo largo de una región de hibridación no es de más de 3 nucleótidos para secuencias de aproximadamente 15 nucleótidos de longitud. Las condiciones en las que solo se hibridarán cadenas de ácido nucleico exactamente complementarias se denominan condiciones de hibridación "rigurosas" o "específicas de secuencia". Se pueden lograr estructuras bicatenarias estables de ácidos nucleicos sustancialmente complementarios en condiciones de hibridación menos rigurosas. Los expertos en la materia de la tecnología de ácidos nucleicos pueden determinar la estabilidad de la estructura bicatenaria de forma empírica, considerando una serie de variables que incluyen, por ejemplo, la longitud y la concentración de pare de bases de los oligonucleótidos, la fuerza iónica y la incidencia de pares de bases desapareadas. Por ejemplo, está disponible en el mercado un programa informático de National Biosciences para calcular la estabilidad de la estructura secundaria, Inc. (Plymouth, Minn.); por ejemplo, OLIGO versión 5, de DNA Software (Ann Arbor, Michigan), por ejemplo, Visual OMP 6.
Las condiciones de hibridación rigurosas, específicas de secuencia, en las que un oligonucleótido hibridará solo con la secuencia diana, se conocen bien en la técnica (véanse, por ejemplo, las referencias generales proporcionadas en la sección sobre detección de polimorfismos en secuencias de ácido nucleico). Las condiciones rigurosas dependen de la secuencia y serán distintas en circunstancias distintas. En general, las condiciones rigurosas se seleccionan para que sean aproximadamente de 5 °C más bajas a 5 °C más altas que el punto de fusión térmica (Tm) para la secuencia específica a una fuerza iónica y pH definidos. La Tm es la temperatura (a una fuerza iónica y pH definidos) a la que se ha disociado el 50 % de las cadenas de la estructura bicatenaria. Suavizar la rigurosidad de las condiciones de hibridación permitirá tolerar desapareamientos de secuencia; el grado de desapareamiento tolerado puede controlarse mediante el ajuste adecuado de las condiciones de hibridación.
El término "cebador" se refiere a un oligonucleótido que actúa como un punto de inicio de la síntesis de ADN en condiciones en las que se induce la síntesis de un producto de extensión de cebador complementario a una cadena de ácido nucleico, es decir, en presencia de cuatro nucleósido trifosfatos distintos y un agente para la polimerización (es decir, ADN polimerasa o transcriptasa inversa) en un tampón apropiado y a una temperatura adecuada. Un cebador es preferiblemente un oligodesoxirribonucleótido monocatenario. El cebador incluye una "región de hibridación" exacta o sustancialmente complementaria a la secuencia diana, preferentemente, de aproximadamente 15 o aproximadamente 35 nucleótidos de longitud. Un oligonucleótido cebador puede consistir completamente en la región de hibridación o puede contener características adicionales que permitan la detección, inmovilización o manipulación del producto amplificado, pero que no alteran la capacidad del cebador para servir como reactivo de partida para la síntesis de ADN. Por ejemplo, puede incluirse una cola de una secuencia de ácido nucleico en el extremo 5' del cebador que hibride con un oligonucleótido de captura.
El término "sonda" se refiere a un oligonucleótido que híbrida de forma selectiva con un ácido nucleico diana en condiciones adecuadas. Una sonda para la detección de las secuencias de biomarcador descritas en el presente documento puede ser de cualquier longitud, por ejemplo, de 15-500 pb de longitud. Normalmente, en ensayos basados en sondas, son preferentes las sondas de hibridación que son de menos de 50 pb.
La expresión "secuencia diana" o "región diana" se refiere a una región de un ácido nucleico que se debe analizar y comprende el sitio polimórfico de interés.
Como se usa en el presente documento, los términos "ácido nucleico", "polinucleótido" y "oligonucleótido" se refieren a cebadores, sondas y fragmentos oligoméricos. Los términos no están limitados por la longitud y son genéricos para los polímeros lineales de polidesoxirribonucleótidos (que contienen 2-desoxi-D-ribosa), polirribonucleótidos (que contienen D-ribosa) y cualquier otro N-glucósido de una base purínica o pirimidínica, o bases purínicas o pirimidínicas modificadas. Estos términos incluyen ADN bicatenario y monocatenario, así como ARN bicatenario y monocatenario. Los oligonucleótidos para su uso en la invención se pueden usar como cebadores y/o sondas.
Un ácido nucleico, polinucleótido u oligonucleótido puede comprender enlaces fosfodiéster o enlaces modificados que incluyen, pero sin limitación, un fosfotriéster, fosforamidato, siloxano, carbonato, carboximetiléster, acetamidato, carbamato, tioéter, fosforamidato puenteado, metilenfosfonato puenteado, fosforotioato, metilfosfonato, fosforoditioato, fosforotioato puenteado o enlaces sulfona, y combinaciones de tales enlaces.
Un ácido nucleico, polinucleótido u oligonucleótido puede comprender las cinco bases de origen biológico (adenina, guanina, timina, citosina y uracilo) y/o bases distintas de las cinco bases de origen biológico. Estas bases pueden servir para varios propósitos, por ejemplo, para estabilizar o desestabilizar la hibridación; para facilitar o inhibir la degradación de la sonda; o como puntos de unión para fracciones detectables o fracciones de desactivación. Por ejemplo, un polinucleótido de la invención puede contener una o más fracciones de bases modificadas, no convencionales o derivatizadas, incluyendo, pero sin limitación, N6-metiladenina, N6-terc-butilbencil-adenina, imidazol, imidazoles sustituidos, 5-fluorouracilo, 5 bromouracilo, 5-clorouracilo, 5-yodouracilo, hipoxantina, xantina, 4-acetilcitosina, 5 (carboxihidroximetil)uracilo, 5 carboximetilaminometil-2-tiouridina, 5 carboximetilaminometiluracilo, dihidrouracilo, beta-D-galactosilqueosina, inosina, N6 isopentenilade-nina, 1-metilguanina, 1-metilinosina, 2,2-dimetilguanina, 2-metiladenina, 2-metilguanina, 3-metilcitosina, 5-metilcitosina, N6-metiladenina, 7-metilguanina, 5-metilaminometiluracilo, 5-metoxiaminometil-2-tiouracilo, beta-D mannosilqueosina, 5'-metoxicarboximetiluracilo, 5-metoxiuracilo, 2-metiltio-N6-isopenteniladenina, ácido uracil-5-oxiacético (v), wybutoxosina, pseudouracilo, queosina, 2 tiocitosina, 5-metil-2-tiouracilo, 2-tiouracilo, 4-tiouracilo, 5-metiluracilo, éster metílico del ácido uracil-5-oxiacético, 3-(3-amino-3-N-2-carboxipropil) uracilo, (acp3)w, 2,6-diaminopurina y 5-propinil pirimidina. Otros ejemplos de fracciones de bases modificadas, no convencionales o derivatizadas, se pueden encontrar en las patentes de Estados Unidos N.° 6.001.611; 5.955.589; 5.844.106; 5.789.562; 5.750.343; 5.728.525; y 5.679.785, Adicionalmente, un ácido nucleico, polinucleótido u oligonucleótido puede comprender una o más fracciones de azúcar modificadas que incluyen, pero sin limitación, arabinosa, 2-fluoroarabinosa, xilulosa y hexosa.
La expresión "elemento repetitivo", como se usa en el presente documento, se refiere a un tramo de secuencia de ADN de al menos 25 nucleótidos de longitud que está presente en el genoma humano en al menos 50 copias.
Los términos "matrices", "micromatrices" y los "chips de ADN" se usan en el presente documento de manera indistinta para referirse a una matriz de polinucleótidos distintos fijados a un sustrato, tales como vidrio, plástico, papel, nailon, filtro, chip, perla o cualquier otro soporte sólido adecuado. Los polinucleótidos se pueden sintetizar directamente sobre el sustrato o se pueden sintetizar por separado del sustrato y después fijarse al sustrato. Las matrices se preparan utilizando métodos conocidos.
Introducción
La invención se basa, al menos en parte, en la identificación de secuencias de CNA de regiones cromosómicas particulares que están presentes, o lo están en una cantidad aumentada, en la sangre de pacientes que tienen cáncer de mama, pero rara vez, o nunca, están presentes, o lo están en una cantidad más baja, en la sangre de pacientes normales que no tienen cáncer de mama. La invención también se basa, en parte, en la identificación de biomarcadores en el CNA de individuos normales, es decir, en el contexto de la presente invención, individuos no diagnosticados de cáncer de mama, que rara vez, o nunca, están presentes en pacientes con cáncer de mama. Por lo tanto, la invención proporciona métodos y dispositivos para analizar la presencia de secuencias de una región cromosómica que corresponde a al menos una de las regiones cromosómicas expuestas en la Tabla 2, la Tabla 3, la Tabla 4, la Tabla 5, la Tabla 6, la Tabla 7 o la Tabla 8.
Por consiguiente, en un aspecto, la invención proporciona un método para analizar CNA en una muestra (sangre, suero o plasma) de un paciente, que comprende detectar la presencia de, o una cantidad de, al menos un ADN extracelular en circulación que tiene una secuencia de nucleótidos de al menos 25 nucleótidos que se encuentra dentro de una región cromosómica expuesta en las Tablas 2, 3, 4, 5, 6, 7 y 8 (o que tiene una secuencia de nucleótidos que es parte de una de las secuencias expuestas en las Tablas de Secuencias A, B, C, D, E, F y G, en la muestra. Preferentemente, el ADN extracelular en circulación carece de un elemento repetitivo. En una realización, el paciente es un individuo que se sospecha que tiene o diagnosticado de cáncer, por ejemplo, cáncer de mama.
Por "encontrarse dentro" se entiende en el presente documento que la secuencia de nucleótidos de un ADN extracelular en circulación es sustancialmente idéntica (por ejemplo, mayor que el 95 % idéntica) a una parte de la secuencia de nucleótidos de una región cromosómica. En otras palabras, el ADN extracelular en circulación puede hibridar en condiciones rigurosas con, u obtenerse de, la región cromosómica.
En una realización, se proporciona un método para analizar ADN extracelular en circulación en una muestra de un paciente, que comprende determinar, en una muestra que es sangre, suero o plasma, la presencia o ausencia, o la cantidad de, una pluralidad de moléculas de ADN extracelular en circulación, teniendo cada una una secuencia de al menos 25 nucleótidos de longitud que se encuentra dentro de la misma una región cromosómica única expuesta en la Tabla 2, la Tabla 3, la Tabla 4, la Tabla 5, la Tabla 6, la Tabla 7 o la Tabla 8 (o teniendo cada una una secuencia de nucleótidos de al menos 25 nucleótidos consecutivos de longitud que es parte de la misma secuencia expuesta en las Tablas de Secuencias A, B, C, D, E, F y G). Puede haber dos o más, o cualquier número de distintas moléculas de ADN extracelular en circulación que proceden todas de la misma una región cromosómica expuesta en las Tablas 2­ 8, y, en algunas realizaciones, se detectan todas tales moléculas de ADN extracelular en circulación y/o se determinan las cantidades de las mismas.
Preferentemente, las secuencias de las moléculas de ADN extracelular en circulación carecen de un elemento repetitivo.
En una realización, se proporciona un método para analizar ADN extracelular en circulación en una muestra de un paciente, que comprende determinar, en una muestra que es sangre, suero o plasma, la presencia o ausencia, o la cantidad de, al menos 2, 3, 4, 5, 7, 8, 9, 10, 15, 20, 30 o al menos 40 moléculas de ADN extracelular en circulación, teniendo cada una una secuencia de al menos 25 pares de bases que se encuentra dentro de una región cromosómica distinta, como se expone en la Tabla 2, la Tabla 3, la Tabla 4, la Tabla 5, la Tabla 6, la Tabla 7 o la Tabla 8 (o teniendo una secuencia de nucleótidos de al menos 25, 40, 50, 60, 75 o 100 nucleótidos consecutivos de longitud que es parte de una de las secuencias expuestas en las Tablas de Secuencias A, B, C, D, E, F y G). Preferentemente, las secuencias de las moléculas de ADN extracelular en circulación carecen de un elemento repetitivo. En las realizaciones preferentes, las moléculas de ADN extracelular tienen secuencias que se encuentran dentro de distintas regiones cromosómicas en la misma tabla, que se escoge de las Tablas 2-8. En una realización específica, se determina la presencia o ausencia, o las cantidades de, al menos 2, 3, 4, 5, 7, 8, 9, 10, 15, 20, 30 o al menos 32 moléculas de ADN extracelular en circulación, encontrándose la secuencia de cada una dentro de una región cromosómica expuesta en la Tabla 2 distinta (o teniendo una secuencia de nucleótidos de al menos 25, 40, 50, 60, 75 o 100 nucleótidos consecutivos de longitud que es parte de una de las secuencias expuestas en la Tabla de Secuencias F). En otra realización específica, se determina la presencia o ausencia, o las cantidades de, al menos 2, 3, 4, 5, 7, 8, 9, 10, 15, 20, 30, 35 o al menos 37 moléculas de ADN extracelular en circulación, encontrándose la secuencia de cada una dentro de una región cromosómica expuesta en la Tabla 3 distinta (o teniendo una secuencia de nucleótidos de al menos 25, 40, 50, 60, 75 o 100 nucleótidos consecutivos de longitud que es parte de una de las secuencias expuestas en la Tabla de Secuencias B). En otra realización específica, se determina la presencia o ausencia, o las cantidades de, al menos 2, 3, 4, 5, 7, 8, 9, 10, 15, 20, 25 o al menos 30 moléculas de ADN extracelular en circulación, teniendo cada una una secuencia que se encuentra dentro de una región cromosómica expuesta en la Tabla 4 distinta (o teniendo una secuencia de nucleótidos de al menos 25, 40, 50, 60, 75 o 100 nucleótidos consecutivos de longitud que es parte de una de las secuencias expuestas en la Tabla de Secuencias C). En otra realización específica, se determina la presencia o ausencia, o las cantidades de, al menos 2, 3, 4, 5, 7, 8, 9, 10, 15, 20, 25 o al menos 27 moléculas de ADN extracelular en circulación, teniendo cada una una secuencia que se encuentra dentro de una región cromosómica expuesta en la Tabla 5 distinta (o teniendo una secuencia de nucleótidos de al menos 25, 40, 50, 60, 75 o 100 nucleótidos consecutivos de longitud que es parte de una de las secuencias expuestas en la Tabla de Secuencias D). En otra realización específica, se determina la presencia o ausencia, o las cantidades de, al menos 2, 3, 4, 5, 7, 8, 9, 10, 15, 20 o al menos 25 moléculas de ADN extracelular en circulación, teniendo cada una una secuencia que se encuentra dentro de una región cromosómica expuesta en la Tabla 6 distinta (o teniendo una secuencia de nucleótidos de al menos 25, 40, 50, 60, 75 o 100 nucleótidos consecutivos de longitud que es parte de una de las secuencias expuestas en la Tabla de Secuencias E). En otra realización específica, se determina la presencia o ausencia, o las cantidades de, al menos 2, 3, 4, 5, 7, 8, 9, 10, 15, 20, 30, 40 o al menos 45 moléculas de ADN extracelular en circulación, encontrándose la secuencia de cada una dentro de una región cromosómica expuesta en la Tabla 7 distinta (o teniendo una secuencia de nucleótidos de al menos 25, 40, 50, 60, 75 o 100 nucleótidos consecutivos de longitud que es parte de una de las secuencias expuestas en la Tabla de Secuencias F). En otra realización específica más, se determina la presencia o ausencia, o las cantidades de, al menos 2, 3, 4, 5, 6, 7 o al menos 8 moléculas de ADN extracelular en circulación, teniendo cada una una secuencia que se encuentra dentro de una región cromosómica expuesta en la Tabla 8 distinta (o teniendo una secuencia de nucleótidos de al menos 25, 40, 50, 60, 75 o 100 nucleótidos consecutivos de longitud que es parte de una de las secuencias expuestas en la Tabla de Secuencias G).
En una realización específica, el método para analizar ADN extracelular en circulación incluye las etapas de: aislar, de una muestra de sangre, suero o plasma de un paciente, sustancialmente todas las moléculas de ADN extracelular en circulación que tengan una longitud de al menos 20, 25, 30, 40, 50, 75 o 100 nucleótidos consecutivos de longitud, o entre 50 y 400 nucleótidos de longitud, obtener la secuencia de cada una de las moléculas de ADN extracelular en circulación y comparar la secuencia con una o más de las secuencias expuestas en las Tablas de Secuencias A-G) para determinar si la secuencia se encuentra dentro de una región cromosómica expuesta en las Tablas 2-8.
En otra realización específica, el método para analizar ADN extracelular en circulación incluye las etapas de: aislar, de una muestra de sangre, suero o plasma de un paciente, sustancialmente todas las moléculas de ADN extracelular en circulación que tengan una longitud de al menos 20, 25, 30, 40, 50, 75 o 100 nucleótidos consecutivos de longitud, o entre 50 y 400 nucleótidos de longitud, y poner en contacto las moléculas de ADN extracelular en circulación con una pluralidad de oligonucleótidos (por ejemplo, en un chip de ADN o micromatriz) para determinar si una de las moléculas de ADN extracelular en circulación hibrida con una cualquiera de la pluralidad de sondas de oligonucleótido en condiciones rigurosas. Cada una de las sondas de oligonucleótido tiene una secuencia de nucleótidos idéntica a una parte de la secuencia de una región cromosómica escogida de las Tablas 2-8 (o una secuencia expuesta en las Tablas de Secuencias A-G). Por lo tanto, si una molécula de ADN en circulación hibrida en condiciones rigurosas con una de las sondas de oligonucleótido, indica que la molécula de ADN en circulación tiene una secuencia de nucleótidos que se encuentra dentro de una región cromosómica expuesta en las Tablas 2-8.
En las diversas realizaciones anteriores, preferentemente, las moléculas de ADN extracelular en circulación tienen al menos 25 nucleótidos consecutivos de longitud (preferentemente al menos 50, 70, 80, 100, 120 o 200 nucleótidos consecutivos de longitud). Más preferentemente, las moléculas de ADN extracelular en circulación tienen entre aproximadamente 50 y aproximadamente 300 o 400, preferentemente entre aproximadamente 75 y aproximadamente 300 o 400, más preferentemente de aproximadamente 100 a aproximadamente 200 nucleótidos consecutivos de una secuencia singular dentro de una región cromosómica como se expone en las Tablas 2-8 (o de una secuencia singular expuesta en las Tablas de Secuencias A-G).
En otro aspecto, la presente invención proporciona un método para el diagnóstico o el cribado del cáncer de mama en un paciente. El método incluye las etapas de: (a) determinar, en una muestra que es sangre, suero o plasma de un paciente, la presencia o ausencia, o la cantidad de, al menos 1, 2, 3, 4, 5, 7, 8, 9, 10, 15, 20, 30 o al menos 40 moléculas de ADN extracelular en circulación, teniendo cada una una secuencia de al menos 25 nucleótidos de longitud que se encuentra dentro de una región cromosómica distinta, como se expone en la Tabla 2, la Tabla 3, la Tabla 4, la Tabla 5, la Tabla 6, la Tabla 7 (o teniendo una secuencia de nucleótidos de al menos 25 nucleótidos consecutivos de longitud que es parte de una de las secuencias expuestas en las Tablas de Secuencias A, B, C, D, E, F y G); y (b) correlacionar la presencia o una cantidad aumentada de los ADN extracelulares en circulación con una probabilidad aumentada de que el paciente tenga cáncer de mama.
Como alternativa, el método de la invención incluye las etapas de: (a) determinar, en una muestra que es sangre, suero o plasma de un paciente, la presencia o ausencia, o la cantidad de, al menos 1, 2, 3, 4, 5, 7 u 8 moléculas de ADN extracelular en circulación, teniendo cada una una secuencia de al menos 25 nucleótidos de longitud que se encuentra dentro de una región cromosómica expuesta en la Tabla 8 distinta (o teniendo una secuencia de nucleótidos de al menos 25 nucleótidos consecutivos de longitud que es parte de una de las secuencias expuestas en la Tabla de Secuencias G); y (b) correlacionar la presencia o una cantidad aumentada de los ADN extracelulares en circulación con una probabilidad disminuida de que el paciente tenga cáncer de mama.
Cuando las etapas de los métodos anteriores se aplican a un paciente diagnosticado de cáncer, puede seguirse al paciente en cuanto al estado del cáncer de mama, o para determinar el efecto del tratamiento de un régimen de tratamiento particular, o para detectar la recaída o recidiva del cáncer.
En el método de diagnóstico/seguimiento de la presente invención, preferentemente, las secuencias de las moléculas de ADN extracelular en circulación carecen de un elemento repetitivo. En las realizaciones preferentes, las moléculas de ADN extracelular tienen secuencias que se encuentran dentro de distintas regiones cromosómicas en la misma tabla, que se escoge de las Tablas 2-8.
En una realización, se proporciona un método de diagnóstico del cáncer de mama en un individuo, que comprende (a) determinar la presencia o ausencia, o las cantidades de, al menos 2, 3, 4, 5, 7, 8, 9, 10, 15, 20, 30 o al menos 32 moléculas de ADN extracelular en circulación, encontrándose la secuencia de cada una dentro de una región cromosómica expuesta en la Tabla 2 distinta (o teniendo una secuencia de nucleótidos de al menos 25 nucleótidos consecutivos de longitud que es parte de una de las secuencias expuestas en la Tabla de Secuencias A); y (b) correlacionar la presencia de, o una presencia aumentada de, una o más de las moléculas de ADN extracelular en circulación con una probabilidad aumentada de que el individuo tenga cáncer de mama o una recaída del cáncer de mama, o una ineficacia del tratamiento para el cáncer de mama.
En otra realización, se proporciona un método de diagnóstico del cáncer de mama en un individuo, que comprende (a) determinar la presencia o ausencia, o las cantidades de, al menos 2, 3, 4, 5, 7, 8, 9, 10, 15, 20, 30 o al menos 37 moléculas de a Dn extracelular en circulación, encontrándose la secuencia de cada una dentro de una región cromosómica expuesta en la Tabla 3 distinta (o teniendo una secuencia de nucleótidos de al menos 25 nucleótidos consecutivos de longitud que es parte de una de las secuencias expuestas en la Tabla de Secuencias B); y (b) correlacionar la presencia de, o una presencia aumentada de, una o más de las moléculas de ADN extracelular en circulación con una probabilidad aumentada de que el individuo tenga cáncer de mama o una recaída del cáncer de mama, o una ineficacia del tratamiento para el cáncer de mama.
En una realización, se proporciona un método de diagnóstico/seguimiento del cáncer de mama en un individuo, que comprende (a) determinar la presencia o ausencia, o las cantidades de, al menos 2, 3, 4, 5, 7, 8, 9, 10, 15, 20, 25 o al menos 30 moléculas de ADN extracelular en circulación, encontrándose la secuencia de cada una dentro de una región cromosómica expuesta en la Tabla 4 distinta (o teniendo una secuencia de nucleótidos de al menos 25 nucleótidos consecutivos de longitud que es parte de una de las secuencias expuestas en la Tabla de Secuencias C); y (b) correlacionar la presencia de, o una presencia aumentada de, una o más de las moléculas de ADN extracelular en circulación con una probabilidad aumentada de que el individuo tenga cáncer de mama o una recaída del cáncer de mama, o una ineficacia del tratamiento para el cáncer de mama.
En una realización, se proporciona un método de diagnóstico/seguimiento del cáncer de mama en un individuo, que comprende (a) determinar la presencia o ausencia, o las cantidades de, al menos 2, 3, 4, 5, 7, 8, 9, 10, 15, 20, 25 o al menos 27 moléculas de ADN extracelular en circulación, encontrándose la secuencia de cada una dentro de una región cromosómica expuesta en la Tabla 5 distinta (o teniendo una secuencia de nucleótidos de al menos 25 nucleótidos consecutivos de longitud que es parte de una de las secuencias expuestas en la Tabla de Secuencias D); y (b) correlacionar la presencia de, o una presencia aumentada de, una o más de las moléculas de ADN extracelular en circulación con una probabilidad aumentada de que el individuo tenga cáncer de mama o una recaída del cáncer de mama, o una ineficacia del tratamiento para el cáncer de mama.
En una realización, se proporciona un método de diagnóstico/seguimiento del cáncer de mama en un individuo, que comprende (a) determinar la presencia o ausencia, o las cantidades de, al menos 2, 3, 4, 5, 7, 8, 9, 10, 15, 20 o al menos 25 moléculas de ADN extracelular en circulación, encontrándose la secuencia de cada una dentro de una región cromosómica expuesta en la Tabla 6 distinta (o teniendo una secuencia de nucleótidos de al menos 25 nucleótidos consecutivos de longitud que es parte de una de las secuencias expuestas en la Tabla de Secuencias E); y (b) correlacionar la presencia de, o una presencia aumentada de, una o más de las moléculas de ADN extracelular en circulación con una probabilidad aumentada de que el individuo tenga cáncer de mama o una recaída del cáncer de mama, o una ineficacia del tratamiento para el cáncer de mama.
En una realización, se proporciona un método de diagnóstico/seguimiento del cáncer de mama en un individuo, que comprende (a) determinar la presencia o ausencia, o las cantidades de, al menos 2, 3, 4, 5, 7, 8, 9, 10, 15, 20, 30, 40 o al menos 45 moléculas de ADN extracelular en circulación, encontrándose la secuencia de cada una dentro de una región cromosómica expuesta en la Tabla 7 distinta (o teniendo una secuencia de nucleótidos de al menos 25 nucleótidos consecutivos de longitud que es parte de una de las secuencias expuestas en la Tabla de Secuencias F); y (b) correlacionar la presencia de, o una presencia aumentada de, una o más de las moléculas de ADN extracelular en circulación con una probabilidad aumentada de que el individuo tenga cáncer de mama o una recaída del cáncer de mama, o una ineficacia del tratamiento para el cáncer de mama.
En una realización, se proporciona un método de diagnóstico/seguimiento del cáncer de mama en un individuo, que comprende (a) determinar la presencia o ausencia, o las cantidades de, al menos 2, 3, 4, 5, 7 o al menos 8 moléculas de ADN extracelular en circulación, encontrándose la secuencia de cada una dentro de una región cromosómica expuesta en la Tabla 8 distinta (o teniendo una secuencia de nucleótidos de al menos 25 nucleótidos consecutivos de longitud que es parte de una de las secuencias expuestas en la Tabla de Secuencias G); y (b) correlacionar la presencia de, o una presencia aumentada de, una o más de las moléculas de ADN extracelular en circulación con una probabilidad aumentada de que el individuo no tenga cáncer de mama o una recaída del cáncer de mama, o una ineficacia del tratamiento para el cáncer de mama.
En otra realización más, el método de diagnóstico, seguimiento o cribado del cáncer de mama en un paciente, incluye determinar, en una muestra que es sangre, suero o plasma del paciente, la presencia o ausencia, o la cantidad total de, todos y cada uno de los ADN extracelulares en circulación, teniendo cada uno una secuencia que se encuentra dentro de la misma una única región cromosómica expuesta en una tabla escogida de las Tablas 2-7; y correlacionar la presencia de uno de dichos ADN extracelulares en circulación o de una cantidad total aumentada de dichos ADN extracelulares en circulación, con una probabilidad aumentada de que dicho paciente tenga cáncer de mama o una recaída del cáncer de mama. En otras palabras, puede haber cualquier cantidad de, y normalmente muchas, distintas moléculas de ADN extracelular en circulación procedentes de una misma única región cromosómica expuesta en las Tablas 2-7, y todas tales distintas moléculas de ADN extracelular en circulación se detectan y/o se determina la cantidad de cada una o todas, y se realiza la correlación con el estado del cáncer de mama.
En una realización específica, sustancialmente todas las moléculas de ADN extracelular en circulación que tengan una longitud de al menos 20, 25, 30, 40, 50, 75 o 100 nucleótidos consecutivos de longitud, o entre 50 y 400 nucleótidos de longitud, se aíslan de una muestra de sangre, suero o plasma de un paciente. Se determina la secuencia de cada una de las moléculas de ADN extracelular en circulación y se compara con una o más de las secuencias expuestas en las Tablas de Secuencias A-G) para determinar si la secuencia de un ADN extracelular en circulación se encuentra dentro de una región cromosómica expuesta en las Tablas 2-7. Si es así, se realiza un diagnóstico de cáncer de mama. En el caso de un paciente tratado con una terapia para el cáncer de mama, la recaída está indicada si se detecta un ADN extracelular en circulación dentro de una región cromosómica expuesta en las Tablas 2-8. En las realizaciones preferentes, se indica un diagnóstico de cáncer de mama o ineficacia del tratamiento o recaída del cáncer de mama, si dos o más moléculas de ADN extracelular en circulación se encuentran dentro de 2, 3, 4 o más regiones cromosómicas expuestas en las Tablas 2-8, estando preferentemente todas tales regiones cromosómicas en la misma tabla.
En otra realización específica, sustancialmente todas las moléculas de ADN extracelular en circulación que tengan una longitud de al menos 20, 25, 30, 40, 50, 75 o 100 nucleótidos consecutivos de longitud, o entre 50 y 400 nucleótidos de longitud, se aíslan de una muestra de sangre, suero o plasma de un paciente. Estas moléculas de ADN extracelular en circulación se hibridan con una micromatriz que se describe anteriormente en el contexto de la invención del kit, para determinar si una de las moléculas de ADN extracelular en circulación hibrida con una cualquiera de una pluralidad de sondas de oligonucleótido en condiciones rigurosas. Cada una de las sondas de oligonucleótido tiene una secuencia de nucleótidos idéntica a una parte de la secuencia de una región cromosómica escogida de las Tablas 2-8 (o una secuencia expuesta en las T ablas de Secuencias A-G). Por lo tanto, si una molécula de ADN en circulación hibrida en condiciones rigurosas con una de las sondas de oligonucleótido, indica que la molécula de ADN en circulación tiene una secuencia de nucleótidos que se encuentra dentro de una región cromosómica expuesta en las Tablas 2-7. Si es así, se realiza un diagnóstico de cáncer de mama. En el caso de un paciente tratado con una terapia para el cáncer de mama, la recaída está indicada si se detecta un ADN extracelular en circulación dentro de una región cromosómica expuesta en las Tablas 2-7. En las realizaciones preferentes, se indica un diagnóstico de cáncer de mama o ineficacia del tratamiento o recaída del cáncer de mama, si dos o más moléculas de ADN extracelular en circulación se encuentran dentro de 2, 3, 4 o más regiones cromosómicas expuestas en las Tablas 2-7, estando preferentemente todas tales regiones cromosómicas en la misma tabla, por ejemplo, la T abla 2, 3, 4, 5, 6, 7.
En las diversas realizaciones anteriores, preferentemente, las moléculas de ADN extracelular en circulación tienen al menos 25 nucleótidos consecutivos de longitud (preferentemente al menos 50, 70, 80, 100, 120 o 200 nucleótidos consecutivos de longitud). Más preferentemente, las moléculas de ADN extracelular en circulación tienen entre aproximadamente 50 y aproximadamente 300 o 400, preferentemente entre aproximadamente 75 y aproximadamente 300 o 400, más preferentemente de aproximadamente 100 a aproximadamente 200 nucleótidos consecutivos de una secuencia singular dentro de una región cromosómica como se expone en las Tablas 2-8 (o de una secuencia singular expuesta en las Tablas de Secuencias A-G).
Detección de ácidos nucleicos en circulación en la sangre
Para detectar la presencia de ácidos nucleicos en circulación en la sangre de pacientes que pueden tener, o se sospecha que tienen, cáncer de mama, se obtiene una muestra de sangre del paciente. Después, se analiza el suero o plasma de la muestra de sangre en cuanto a la presencia de un ADN extracelular en circulación o biomarcador, como se describe en el presente documento. Los ácidos nucleicos pueden aislarse del suero o plasma usando técnicas muy conocidas, véase, por ejemplo, las secciones de los ejemplos. En el contexto de la actual invención, las secuencias de ácido nucleico que se analizan son secuencias de ADN. Por lo tanto, en esta sección, los métodos descritos para la evaluación de "ácidos nucleicos" se refieren a la evaluación de ADN.
Las técnicas de detección para evaluar ácidos nucleicos en cuanto a la presencia de un biomarcador implican procedimientos muy conocidos en el campo de la genética molecular. Además, muchos de los métodos implican la amplificación de ácidos nucleicos. Se proporciona en la técnica una amplia guía para llevarlos a cabo. Las referencias ejemplares incluyen manuales tales como PCR Technology: Principles and Applications for DNA Amplification (ed. H. A. Erlich, Freeman Press, NY, N.Y., 1992); PCR Protocols: A Guide to Methods and Applications (eds. Innis, et al., Academic Press, San Diego, Calif., 1990); Current Protocols in Molecular Biology, Ausubel, 1994-1999, incluyendo actualizaciones suplementarias hasta abril de 2004; Sambrook & Russell, Molecular Cloning, A Laboratory Manual (3a Ed, 2001).
Aunque los métodos pueden emplear etapas de PCR, también se pueden usar otros protocolos de amplificación. Los métodos de amplificación adecuados incluyen la reacción en cadena de la ligasa (véase, por ejemplo, Wu y Wallace, Genomics 4:560-569, 1988); ensayo de desplazamiento de cadena (véase, por ejemplo, Walker et al., Proc. Natl. Acad. Sci. USA 89:392-396, 1992; patente de Estados Unidos N.° 5.455.166; y varios sistemas de amplificación basados en transcripción, incluyendo los métodos descritos en las patentes de Estados Unidos n.° 5.437.990; 5.409.818; y 5.399.491; el sistema de amplificación basado en transcripción (TAS) (Kwoh et al., Proc. Natl. Acad. Sci. USA 86:1173-1177, 1989); y replicación de secuencia autosostenida (3s R) (Guatelli et al., Proc. Natl. Acad. Sci. USA 87:1874-1878, 1990; documento WO 92/08800). Como alternativa, se pueden usar métodos que amplifican la sonda hasta niveles detectables, tal como la amplificación por Qp-replicasa (Kramer y Lizardi, Nature 339:401-402, 1989; Lomeli et al., Clin. Chem. 35:1826-1831,1989). Se proporciona una revisión de los métodos de amplificación conocidos, por ejemplo, por Abramson y Myers en Current Opinion in Biotechnology 4:41-47, 1993.
En algunas realizaciones, la detección de un biomarcador en el CNA de un paciente se realiza utilizando cebadores oligonucleotídicos y/o sondas para detectar una secuencia diana, en donde la secuencia diana está presente en (por ejemplo, comprende alguna porción asignada de forma inequívoca de) cualquiera de las regiones cromosómicas enumeradas en la Tabla 2, la Tabla 3, la Tabla 4, la Tabla 5, la Tabla 6, la Tabla 7 o la Tabla 8. Los oligonucleótidos pueden prepararse mediante cualquier método adecuado, habitualmente síntesis química, y también se pueden adquirir a través de fuentes comerciales. Los oligonucleótidos pueden incluir enlaces fosfodiéster modificados (por ejemplo, fosforotioato, metilfosfonatos, fosfoamidato o boranofosfato) o pueden usarse en un oligonucleótido enlaces distintos de un derivado de ácido fosforoso, para impedir la escisión en un sitio seleccionado. Además, el uso de azúcares modificados en 2'-amino tiende a favorecer el desplazamiento sobre la digestión del oligonucleótido cuando se hibrida con un ácido nucleico que también es el molde para la síntesis de una nueva cadena de ácido nucleico.
En una realización, el biomarcador se identifica por hibridación con una sonda que se dirige a una región cromosómica descrita en el presente documento, en condiciones de hibridación específicas de secuencia. La sonda utilizada para este análisis puede ser una sonda larga o conjuntos para sondas de oligonucleótido cortas, por ejemplo, pueden emplearse de aproximadamente 20 o aproximadamente 150 nucleótidos de longitud.
Los formatos de hibridación adecuados son muy conocidos en la técnica, incluyendo, pero sin limitación, de fase en solución, fase sólida, formatos de matrices de oligonucleótidos, de fase mixta o ensayos de hibridación in situ. En las hibridaciones en fase de solución (o líquida), tanto el ácido nucleico diana como la sonda o los cebadores están libres para interactuar en la mezcla de reacción. Además se han desarrollado técnicas, tales como los sistemas de PCR en tiempo real, que permiten el análisis, por ejemplo, la cuantificación de productos amplificados durante una reacción de PCR. En este tipo de reacción, la hibridación con una sonda de oligonucleótido específica se produce durante el programa de amplificación, para identificar la presencia de un ácido nucleico diana. La hibridación de sondas oligonucleotídica asegura la mayor especificidad debido a la transición de dos estados controlada termodinámicamente. Los ejemplos de este formato de ensayo son las sondas de hibridación de transferencia de energía de resonancia de fluorescencia, balizas moleculares, escorpiones moleculares y sondas de hibridación con exonucleasas (por ejemplo, revisado en Bustin, J. Mol. Endocrin. 25:169-93, 2000).
Los formatos de ensayo adecuados incluyen formatos basados en matrices, descritos con mayor detalle a continuación en la sección "Dispositivo", donde la sonda normalmente está inmovilizada. Como alternativa, puede inmovilizarse la diana.
En un formato donde la diana está inmovilizada, el ADN diana amplificado se inmoviliza en un soporte sólido y el complejo diana se incuba con la sonda en condiciones de hibridación adecuadas, la sonda no hibridada se elimina lavando en condiciones rigurosas de forma adecuada y el soporte sólido se controla en cuanto a la presencia de sonda unida. En formatos en donde las sondas se inmovilizan en un soporte sólido, el ADN diana normalmente su marca, habitualmente durante la amplificación. La sonda inmovilizada se incuba con el ADN diana amplificado en condiciones de hibridación adecuadas, el ADN diana no hibridado se elimina lavando en condiciones rigurosas de forma adecuada y el soporte sólido/sonda se controla en cuanto a la presencia de ADN diana unido.
En realizaciones típicas, se inmovilizan múltiples sondas en un soporte sólido y se analizan las regiones cromosómicas diana en el CNA de un paciente utilizando las múltiples sondas de forma simultánea. El documento WO 95/11995 describe ejemplos de matrices de ácidos nucleicos.
En un método alternativo sin sondas, el ácido nucleico amplificado correspondiente a un ácido nucleico diana presente en una región cromosómica se realiza utilizando cebadores de ácido nucleico para la región cromosómica y se detecta controlando el aumento de la cantidad total de ADN de bicatenario en el mezcla de reacción; se describe, por ejemplo, en la patente de Estados Unidos N.° 5.994.056; y en las publicaciones de patente europea n.° 487.218 y 512.334. La detección de ADN diana bicatenario está basada en la fluorescencia aumentada de diversos colorantes de unión a ADN, por ejemplo, SYBR Green, que se presenta cuando se unen a ADN bicatenario.
Como aprecia un experto en la materia, se pueden realizar en la reacción métodos específicos de amplificación que emplean múltiples cebadores para dirigirse a las regiones cromosómicas, de forma que el biomarcador pueda abarcarse adecuadamente.
Secuenciación de ADN
En las realizaciones preferentes, la presencia de una secuencia de una región cromosómica expuesta en la Tabla 2, la Tabla 3, la Tabla 4, la Tabla 5, la Tabla 6, la tabla 7 o la tabla 8 en el CNA de un sometido a evaluación se detecta mediante secuenciación directa. Dicha secuenciación, especialmente utilizando los sistemas de secuenciación de Roche 454, Illumina y de Applied Biosystems mencionados a continuación o sistemas de secuenciación avanzada similares, puede incluir la cuantificación (es decir, la determinación del nivel) de ácidos nucleicos que tienen una secuencia particular. Dicha cuantificación puede usarse en las realizaciones de la invención que implican determinar el nivel de un biomarcador (algunas realizaciones de las cuales implican la correlación de un nivel particular con la presencia o ausencia de cáncer). Los métodos incluyen, por ejemplo, métodos basados en la secuenciación por dideoxi, aunque también se conocen otros métodos tales como la secuenciación de Maxam y Gilbert (véase, por ejemplo, Sambrook y Russell, anteriormente citado). En realizaciones típicas, se secuencia el CNA de un paciente utilizando un método de secuenciación a gran escala que proporciona la capacidad de obtener información de las secuencias a partir de muchas lecturas. Dichas plataformas de secuenciación incluyen las comercializadas por Roche 454 Life Sciences (sistemas GS), Illumina (por ejemplo, HiSeq, MiSeq) y Applied Biosystems (por ejemplo, SOLiD systems).
La plataforma de secuenciación de Roche 454 Life Sciences implica el uso de PCR en emulsión y la inmovilización de fragmentos de ADN en perlas. La incorporación de nucleótidos durante la síntesis se detecta midiendo la luz que se genera cuando se incorpora un nucleótido.
La tecnología Illumina implica la unión de ADN genómico fragmentado aleatoriamente a una superficie plana, ópticamente transparente. Los fragmentos de ADN unidos se extienden y se amplifican en puente para crear una celda de flujo de secuenciación de densidad ultra alta con conglomerados que contienen copias del mismo molde. Estos moldes se secuencian utilizando una tecnología de secuenciación por síntesis que emplea terminadores reversibles con colorantes fluorescentes que se pueden eliminar.
Además, se pueden usar métodos que emplean secuenciación por hibridación. Dichos métodos, utilizados, por ejemplo, en la tecnología ABI SOLiD4+, utilizan un conjunto de todos los posibles oligonucleótidos de una longitud fija, marcados de acuerdo con la posición secuenciada. Los oligonucleótidos se hibridan y ligan; el ligamiento preferencial mediante la ADN ligasa para aparear secuencias da como resultado una señal informativa del nucleótido en esa posición.
La secuencia se puede determinar utilizando cualquier otro método de secuenciación de ADN, incluyendo, por ejemplo, métodos que utilizan tecnología de semiconductores para detectar nucleótidos que se incorporan en un cebador extendido, midiendo los cambios en la corriente que se producen cuando se incorpora un nucleótido (véase, por ejemplo, en las publicaciones de solicitud de patente de Estados Unidos n.° 20090127589 y 20100035252). Otras técnicas incluyen la secuenciación directa por exonucleasa sin marcador en la que los nucleótidos escindidos del ácido nucleico se detectan al pasar a través de un nanoporo (Oxford Nanopore) (Clark et al., Nature Nanotechnology 4: 265 - 270, 2009); y la tecnología de secuenciación de ADN Single Molecule Real Time (SMRT™) (Pacific Biosciences), que es una técnica de secuenciación por síntesis.
Dispositivos y kits
En un aspecto adicional, la invención proporciona dispositivos de diagnóstico y kits útiles para identificar uno o más biomarcadores asociados con cáncer de mama en el CNA de un paciente, donde el uno o más biomarcadores es una secuencia que corresponde a cualquiera de las regiones cromosómicas expuestas en la Tabla 2, la Tabla 3, la Tabla 4, la Tabla 5, la Tabla 6, la Tabla 7 o la Tabla 8. Como será evidente para los expertos en la materia, el kit de la presente invención es útil en el método discutido anteriormente para analizar el ADN extracelular en circulación en una muestra del paciente y en el diagnóstico, seguimiento o cribado del cáncer de mama, como se describe anteriormente.
Por lo tanto, en un aspecto, la presente invención proporciona el uso de al menos un oligonucleótido para la fabricación de un kit de diagnóstico útil en el diagnóstico, seguimiento o cribado del cáncer de mama. La secuencia de nucleótidos del oligonucleótido se encuentra dentro de una región cromosómica expuesta en las Tablas 2, 3, 4, 5, 6, 7 y 8 (o coincide con una parte de una secuencia expuesta en las Tablas de Secuencias A, B, C, D, E, F y G).
Preferentemente, el kit de la presente invención incluye uno, dos o más (por ejemplo, al menos 1,2, 3, 4, 5, 6, 7, 8, 9, 10, 12, 15, 20, 25, 30 o al menos 40, pero preferentemente menos de 60, preferentemente de uno a aproximadamente 50, más preferentemente de 2 a aproximadamente o de 3 a aproximadamente 50) conjuntos de oligonucleótidos. Cada conjunto comprende uno o más oligonucleótidos (por ejemplo, de aproximadamente uno a aproximadamente 10.000, preferentemente de 50, 100, 200 o 300 a aproximadamente 10.000). Todas las secuencias de nucleótidos de tales uno o más oligonucleótidos en cada conjunto se encuentran dentro de la misma una única región cromosómica que se expone en una tabla escogida de las Tablas 2, 3, 4, 5, 6, 7 u 8 (o que coinciden con una parte de la misma secuencia una única expuesta en las Tablas de Secuencias A, B, C, D, E, F o G). Cada oligonucleótido debe tener de aproximadamente 18 a 100 nucleótidos, o de 20 a aproximadamente 50 nucleótidos, y tiene la capacidad de hibridar, en condiciones de hibridación rigurosas, con la región cromosómica en la que se encuentra su secuencia. Los oligonucleótidos son útiles como sondas para detectar moléculas de ADN extracelulares en circulación procedentes de las regiones cromosómicas. Preferentemente, cada conjunto incluye un número suficiente de oligonucleótidos con secuencias que mapean en una región cromosómica, de forma que cualquier molécula de ADN extracelular en circulación procedente de la región cromosómica puede detectarse con el conjunto de oligonucleótidos. Por lo tanto, el número de oligonucleótidos necesarios en cada conjunto está determinado por la longitud total de la secuencia de nucleótidos singular de una región cromosómica particular, como será evidente para los expertos en la materia. Dichas longitudes totales se indican en las Tablas 2-8 y también deben ser evidentes a partir de las Tablas de Secuencias A, B, C, D, E, F y G.
Preferentemente, en el kit de la presente invención, los distintos conjuntos de oligonucleótidos corresponden a distintas regiones cromosómicas dentro de la misma tabla. Preferentemente, los oligonucleótidos carecen de un elemento repetitivo. Opcionalmente, los oligonucleótidos se unen a uno o más sustratos sólidos, tales como microchips y perlas. En las realizaciones preferentes, el kit es una micromatriz con los oligonucleótidos anteriores.
En una realización, el kit de la presente invención incluye una pluralidad de conjuntos de oligonucleótidos que tiene la capacidad de hibridar con las regiones cromosómicas expuestas en las Tablas 2, 3, 4, 5, 6, 7 y 8, respectivamente. Es decir, el kit incluye sondas de oligonucleótido que corresponden a todas y cada una de las regiones cromosómicas expuestas en las Tablas 2, 3, 4, 5, 6, 7 y 8 (o que coinciden con todas y cada una de las secuencias expuestas en las Tablas de Secuencias A, B, C, D, E, F o G) de forma que todo el ADN extracelular en circulación procedente de cualquier región cromosómica expuesta en las Tablas 2-8 se puede detectar usando el kit.
También se contempla el uso de los oligonucleótidos incluidos en el kit descrito para la fabricación del kit útil para el diagnóstico, el cribado o el seguimiento del cáncer de mama. La fabricación de tal kit debe ser evidente para un experto en la materia.
En algunas realizaciones, un dispositivo de diagnóstico comprende sondas para detectar al menos 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20 o 50, o al menos 100 de las regiones cromosómicas expuestas en las Tablas 2-8. En algunas realizaciones, la presente invención proporciona sondas unidas a un soporte sólido, tal como un portaobjetos de matriz o chip, por ejemplo, como se describe en DNA Microarrays: A Molecular Cloning Manual, 2003, Eds. Bowtell y Sambrook, Cold Spring Harbor Laboratory Press. La construcción de tales dispositivos es muy conocida en la técnica, como se describe, por ejemplo, en las patentes de Estados Unidos y las publicaciones de patentes de Estados Unidos n.° 5.837.832; solicitud PCT WO95/11995; patente de Estados Unidos n.° 5.807.522; patentes de Estados Unidos n.° 7.157.229, 7.083.975, 6.444.175, 6.375.903, 6.315.958, 6.295.153 y 5.143.854, 2007/0037274, 2007/0140906, 2004/0126757, 2004/0110212, 2004/0110211,2003/0143550, 2003/0003032 y 2002/0041420. Las matrices de ácidos nucleicos también se revisan en las siguientes referencias: Biotechnol Annu Rev 8:85-101 (2002); Sosnowski et al., Psychiatr Genet 12(4): 181-92 (dic. de 2002); Heller, Annu Rev Biomed Eng 4: 129-53 (2002); Kolchinsky et al., Hum. Mutat 19(4):343-60 (abril de 2002); y McGail et al, Adv Biochem Eng Biotechnol 77:21-42 (2002).
Puede implementarse en una matriz cualquier número de sondas. Se puede usar un conjunto de sondas que hibrida con distintos segmentos, preferentemente singulares, de una región cromosómica, donde el conjunto de sondas detecta cualquier parte de la región cromosómica. Como alternativa, se puede inmovilizar sobre una superficie sólida una única sonda para una región cromosómica. La sonda polinucleotídica puede sintetizarse en áreas designadas (o sintetizarse por separado y después fijarse en áreas designadas) en un sustrato, por ejemplo, utilizando un procedimiento químico dirigido por luz. Los polinucleótidos sintéticos típicos pueden ser de aproximadamente 15-200 nucleótidos de longitud.
El kit puede incluir múltiples reactivos de detección de biomarcadores, o uno o más reactivos de detección de biomarcadores en combinación con uno o más de otros tipos de elementos o componentes (por ejemplo, otros tipos de reactivos bioquímicos, recipientes, embalajes tales como el embalaje para la venta comercial, sustratos a los que se están unidos los reactivos de detección de biomarcadores, componentes electrónicos de hardware, etc.). Por consiguiente, la presente invención proporciona adicionalmente kits y sistemas de detección de biomarcadores, que incluyen, pero sin limitación, matrices/micromatrices de moléculas de ácido nucleico y perlas que contienen una o más sondas u otros reactivos de detección para detectar uno o más biomarcadores de la presente invención. Los kits pueden incluir opcionalmente diversos componentes electrónicos de hardware; por ejemplo, las matrices ("chips de ADN") y los sistemas microfluidos (sistemas de "laboratorio en un chip") proporcionados por diversos fabricantes normalmente comprenden componentes de hardware. Otros kits pueden no incluir componentes electrónicos de hardware, pero pueden estar compuesto de, por ejemplo, uno o más reactivos de detección de biomarcadores (junto con, opcionalmente, otros reactivos bioquímicos) envasados en uno o más recipientes.
Los kits/sistemas de detección de biomarcadores pueden contener, por ejemplo, una o más sondas, o conjuntos de sondas, que hibridan con una molécula de ácido nucleico presente en una región cromosómica expuesta en las Tablas 2-8.
Un kit de detección de biomarcadores de la presente invención puede incluir componentes que se usan para preparar CNA a partir de una muestra de sangre de un paciente, para la posterior amplificación y/o detección de un biomarcador.
Correlación de la presencia de biomarcadores con cáncer de mama
La presente invención proporciona métodos y reactivos para detectar la presencia de un biomarcador en CNA de un paciente que tiene cáncer de mama o que se está evaluando para determinar si el paciente puede tener cáncer de mama. En el contexto de la invención, "detección" o "identificación" o "identificación de la presencia" o "detección de la presencia" de un biomarcador asociado con la presencia o ausencia de cáncer de mama en una muestra de CNA de un paciente, se refiere a la determinación de cualquier nivel del biomarcador en el CNA del paciente, donde el nivel es mayor que un valor umbral que distingue entre muestras de CNA de cáncer de mama y sin cáncer de mama para un ensayo dado.
En la presente invención, por ejemplo, la presencia de uno cualquiera de los biomarcadores enumerados en las Tablas 2-7 es indicativa de cáncer de mama. Como aprecia un experto en la materia, los biomarcadores pueden emplearse en el análisis de una muestra de un paciente, cuando el biomarcador también se ha observado de forma infrecuente en un paciente normal, para aumentar la sensibilidad de la detección. Por ejemplo, se ha observado que los biomarcadores indicados en negrita en las Tablas 3 y 4 están presentes de forma infrecuente en el CNA obtenido de individuos normales; sin embargo, dada la baja frecuencia de aparición en muestras normales con respecto a la frecuencia de aparición más alta en cáncer de mama, la presencia del biomarcador en un paciente indica que el paciente tiene una probabilidad del 90 % o más de tener cáncer de mama. Por lo tanto, por ejemplo, las matrices utilizadas para detectar las regiones cromosómicas pueden incluir las que identifican las regiones cromosómicas de las Tablas 3 y 4 que están indicadas en negrita.
Los biomarcadores expuestos en las Tablas 1-7 están asociados con el cáncer de mama, es decir, están sobrerrepresentados en pacientes con cáncer de mama, en comparación con individuos no diagnosticados de cáncer de mama. Por lo tanto, la detección de uno o más de los biomarcadores expuestos en las Tablas 1-7 es indicativa de cáncer de mama, es decir, el paciente tiene una probabilidad aumentada de tener cáncer de mama en comparación con un paciente que no tiene el biomarcador. En algunas realizaciones, la detección de dos o más biomarcadores expuestos en las Tablas 1-7 en el CNA de un paciente es indicativa de una mayor probabilidad de cáncer de mama. Como se entiende en la técnica, también se emplean otros criterios, por ejemplo, criterios clínicos, mamografía, etc., para diagnosticar el cáncer de mama en el paciente. Por consiguiente, los pacientes que tienen un biomarcador asociado con el cáncer de mama también se someten a otros procedimientos de diagnóstico.
En algunas realizaciones, pueden detectarse en el CNA de un paciente uno o más biomarcadores que están subrepresentados en el cáncer de mama. Por lo tanto, por ejemplo, puede detectarse en una muestra de CNA de un paciente un biomarcador enumerado en la Tabla 8, donde la detección del biomarcador es indicativa de un diagnóstico normal, es decir, que el paciente que no tiene cáncer de mama.
"Sobrerrepresentado" o "cantidad aumentada" significa que el nivel de uno o más los ADN extracelulares en circulación es más alto que los niveles normales. En general, esto significa un aumento del nivel en comparación con un valor de índice. A la inversa, "subrepresentado" o "cantidad disminuida" significa que el nivel de una o más moléculas de ADN extracelular en circulación es más bajo que los niveles normales. En general, esto significa una disminución del nivel en comparación con un valor de índice.
En las realizaciones preferentes, el valor de prueba que representa la cantidad de un ADN extracelular en circulación particular se compara con uno o más valores de referencia (o valores de índice), y se correlaciona opcionalmente con el cáncer de mama o la recaída del cáncer. Opcionalmente, se indica una mayor probabilidad de cáncer de mama si el valor de prueba es mayor que el valor de referencia.
Los expertos en la materia están familiarizados con diversas formas de obtener y usar valores de índice. Por ejemplo, el valor de índice puede representar el número de copias o la concentración de un ADN extracelular particular de acuerdo con la presente invención, en una muestra de sangre de un paciente de interés en estado saludable, en cuyo caso un número de copias o la concentración en una muestra del paciente, en un momento o estado distinto, significativamente mayor (por ejemplo, 1,5 veces, 2 veces, 3 veces, 4 veces, 5 veces, 10 veces, 20 veces, 30 veces, 40 veces, 50 veces, 100 veces o más alto) que este valor de índice indicaría, por ejemplo, cáncer de mama o un probabilidad aumentada de recaída del cáncer.
Como alternativa, el valor de índice puede representar la concentración promedio o el número de copias de un ADN extracelular en circulación particular para un conjunto de individuos de una población de cáncer diversa o un subconjunto de la población. Por ejemplo, se puede determinar el número promedio de copias o la concentración de un ADN extracelular en circulación en un muestreo aleatorio de pacientes con cáncer de mama. Por lo tanto, los pacientes que tienen un número de copias o concentración (valor de prueba) comparable o superior que este valor, se identifican como que tienen una mayor probabilidad de tener cáncer de mama o recaída del cáncer de mama que los que tienen un valor de prueba más bajo que este valor.
A menudo, el número de copias o la cantidad (por ejemplo, la concentración) se considerará "aumentada" o "disminuida" solo si difiere significativamente del valor de índice, por ejemplo, al menos una diferencia de 1,5 veces en el número de copias o la concentración absoluta o relativa (por ejemplo, como refleja la señal de hibridación).
Un valor de índice útil puede representar el número de copias o la concentración de un ADN extracelular en circulación particular o de una combinación (adición ponderada o directa) de dos o más de los ADN extracelulares en circulación correspondientes a la misma región cromosómica o a regiones cromosómicas distintas. Cuando se usan en el método de diagnóstico/seguimiento dos o más biomarcadores o moléculas de ADN extracelular en circulación, puede ponderarse y combinarse la presencia o ausencia de, o la cantidad de, cada biomarcador o ADN extracelular en circulación. Por lo tanto, se puede proporcionar un valor de prueba (a) ponderando el estado determinado o la cantidad de cada molécula de ADN extracelular en circulación con un coeficiente predefinido, y (b) combinando el estado o la cantidad ponderado para proporcionar un valor de prueba. La etapa de combinación puede ser por adición directa o promediando (es decir, ponderado por igual) o por un coeficiente predefinido distinto.
La información obtenida del análisis de biomarcadores se puede almacenar en una forma legible por ordenador. Dicho sistema informático normalmente comprende los subsistemas principales, tales como un procesador central, una memoria de sistema (generalmente RAM), un controlador de entrada/salida (E/S), un dispositivo externo tal como una pantalla de visualización a través de un adaptador de pantalla, puertos serie, un teclado, una unidad de disco fija a través de una interfaz de almacenamiento y una unidad de disquete operativa para recibir un disquete, y un dispositivo de CD-ROM (o DVD-ROM) operativo para recibir un CD-ROM. Se pueden conectar muchos otros dispositivos, tal como una interfaz de red conectada a través de un puerto serie.
El sistema informático también puede estar vinculado a una red, que comprenda una pluralidad de dispositivos informáticos vinculados a través de un enlace de datos, tal como un cable Ethernet (coaxial o 10BaseT), línea telefónica, línea ISDN, red inalámbrica, fibra óptica u otro medio de transmisión de señales adecuado, mediante lo cual al menos un dispositivo de red (por ejemplo, ordenador, conjunto de discos, etc.) comprende un patrón de dominios magnéticos (por ejemplo, disco magnético) y/o dominios de carga (por ejemplo, una matriz de celdas DRAM) que componen un patrón de bits que codifican datos adquiridos a partir de un ensayo de la invención.
El sistema informático puede comprender un código para interpretar los resultados de un estudio que evalúa la presencia de uno o más de los biomarcadores. Por lo tanto, en una realización ejemplar, los resultados del análisis de biomarcadores se proporcionan a un ordenador donde un procesador central ejecuta un programa informático para determinar la probabilidad de que un paciente tenga cáncer de mama.
La invención también proporciona el uso de sistema informático, tal como el que se describe anteriormente, que comprende: (1) un ordenador; (2) un patrón de bits almacenado que codifica los resultados de las pruebas de biomarcadores obtenidos por los métodos de la invención, que puede almacenarse en el ordenador; (3) y, opcionalmente, (4) un programa para determinar la probabilidad de que un paciente tenga cáncer de mama.
La invención proporciona adicionalmente métodos para generar un informe basado en la detección de uno o más biomarcadores expuestos en las Tablas 2-8.
Por lo tanto, la presente invención proporciona sistemas relacionados con los métodos de la invención anteriores. En una realización, la invención proporciona un sistema para analizar ADN extracelular en circulación, que comprende: (1) un analizador de muestras para ejecutar el método de análisis de ADN extracelular en circulación en la sangre, suero o plasma de un paciente, como se describe en las diversas realizaciones anteriores; (2) un sistema informático para recibir y analizar automáticamente los datos obtenidos en la etapa (1), para proporcionar un valor de prueba que represente el estado (presencia o ausencia, o cantidad, es decir, concentración o número de copias) de una o más moléculas de ADN extracelular en circulación que tengan una secuencia de nucleótidos de al menos 25 nucleótidos que se encuentra dentro de una región cromosómica expuesta en las Tablas 2, 3, 4, 5, 6, 7 y 8 (o que tienen una secuencia de nucleótidos que es parte de una de las secuencias expuestas en las Tablas de Secuencias A, B, C, D, E, F y G) y, opcionalmente, para comparar el valor de prueba con uno o más valores de referencia, cada uno asociado con un estado predeterminado de cáncer de mama. En algunas realizaciones, el sistema comprende adicionalmente un módulo de visualización que presenta la comparación entre el valor de prueba y el uno o más valores de referencia, o que presenta un resultado de la etapa de comparación.
Por lo tanto, como será evidente para los expertos en la materia, el analizador de muestras puede ser, por ejemplo, una máquina de secuenciación (por ejemplo, Illumina HiSeq™, Ion Torrent PGM, el secuenciador ABI sOlíD™, PacBio Rs , Helicos Heliscope™, etc.), una máquina de PCR (por ejemplo, ABI 7900, Fluidigm BioMark™, etc.), un instrumento de micromatrices, etc.
En una realización, el analizador de muestras es un instrumento de secuenciación, por ejemplo, un instrumento de secuenciación de última generación, tal como los sistemas GS de Roche, los sistemas HiSeq y MiSeq de Illumina, y SOLiD de Life Technologies. Las moléculas de ADN extracelular en circulación se aíslan de la sangre o suero o plasma de un paciente, y las secuencias de todas las moléculas de ADN extracelular en circulación se obtienen mediante el analizador de muestras. El instrumento de secuenciación se usa para secuenciar las moléculas de ADN extracelular en circulación y obtener las secuencias de estas moléculas. Después, se emplea un sistema informático para analizar automáticamente las secuencias para determinar la presencia o ausencia, o la cantidad de una molécula de ADN extracelular en circulación que tiene una secuencia de nucleótidos de al menos 25 nucleótidos que se encuentra dentro de una región cromosómica expuesta en las Tablas 2, 3, 4, 5, 6, 7 y 8 (o que tiene una secuencia de nucleótidos que es parte de una de las secuencias expuestas en las Tablas de Secuencias A, B, C, D, E, F y G), en la muestra. Por ejemplo, el sistema informático puede comparar la secuencia de cada molécula de ADN extracelular en circulación en la muestra con cada secuencia de las Tablas de Secuencias A, B, C, D, E, F y G para determinar si hay una coincidencia, es decir, si la secuencia de una molécula de ADN extracelular en circulación se encuentra dentro de una secuencia de las Tablas de Secuencias A, B, C, D, E, F y G, o dentro de una región cromosómica expuesta en las Tablas 2, 3, 4, 5, 6, 7 y 8. El sistema informático también determina automáticamente el número de copias de una molécula de ADN extracelular en circulación particular. Opcionalmente, el sistema informático correlaciona automáticamente el resultado del análisis de secuencia con un diagnóstico con respecto al cáncer de mama. Por ejemplo, si se identifica que una, dos o más moléculas de ADN extracelular en circulación proceden de una región cromosómica de las Tablas 2-7, y preferentemente dos o más moléculas de ADN extracelular en circulación con secuencias que se encuentran dentro de regiones cromosómicas distintas dentro de una única tabla escogida de las tablas 2-7, entonces el sistema informático correlaciona automáticamente este resultado del análisis con un diagnóstico de cáncer de mama. Opcionalmente, el sistema informático comprende adicionalmente un módulo de visualización que presenta los resultados del análisis de secuencia y/o el resultado de la etapa de correlación. El módulo de visualización puede ser, por ejemplo, una pantalla de visualización, tal como un monitor de ordenador, monitor de TV o pantalla táctil, una impresora y altavoces de audio.
La función de análisis basada en ordenador se puede implementar en cualquier lenguaje y/o exploradores adecuados. Por ejemplo, puede implementarse con lenguaje C y, preferentemente, utilizando lenguajes de programación de alto nivel orientados a objetos, tal como Visual Basic, SmallTalk, C++ y similares. La aplicación se puede escribir para adaptarse a entornos tales como el entorno de Microsoft Windows™, incluyendo Windows™ 98, Windows™ 2000, Windows™ NT y similares. Además, la aplicación también puede escribirse para entorno MacIntosh™, SUN™, UNIX o LINUX. Además, las etapas funcionales también se pueden implementar utilizando un lenguaje de programación universal o independiente de una plataforma. Los ejemplos de tales lenguajes de programación multiplataforma incluyen, pero sin limitación, lenguaje de marcado de hipertexto (HTLM), JAVA™, JavaScript™, lenguaje de programación Flash, interfaz común de pasarela/lenguaje de consulta estructurada (CGI/SQL), lenguaje práctico de extracción e informes (PERL), AppleScript™ y otros lenguajes de script del sistema, lenguaje de programación/lenguaje de consulta estructurado (PL/SQL) y similares. se pueden utilizar navegadores con habilitados con Java™- o JavaScript™- tales como HotJava™, Microsoft™ Explorer™ o Netscape™. Cuando se usan páginas de internet de contenido activo, pueden incluir applets de Java™ o controles ActiveX™, u otras tecnologías de contenido activo.
La función de análisis también puede incorporarse en productos de programas informáticos y utilizarse en los sistemas descritos anteriormente, u otros sistemas informáticos o basados en internet. Por consiguiente, otro aspecto de la presente invención se refiere a un producto de programa informático que comprende un medio utilizable por ordenador que tiene códigos de programa legibles por ordenador o instrucciones incorporadas en el, para permitir que un procesador lleve a cabo las funciones de análisis y correlación como se describe anteriormente. Estas instrucciones de programa informático se pueden cargar en un ordenador u otro aparato programable para producir una máquina, de forma que las instrucciones que se ejecutan en el ordenador u otro aparato programable creen medios para implementar las funciones o etapas descritos anteriormente. Estas instrucciones de programa informático también pueden almacenarse en una memoria o medio legible por ordenador que puede dirigir un ordenador u otro aparato programable para que funcione de una manera en particular, de forma que las instrucciones almacenadas en la memoria o el medio legible por ordenador produzcan un artículo de fabricación que incluye medios de instrucción que implementen el análisis. Las instrucciones del programa informático también se pueden cargar en un ordenador u otro aparato programable para provocar que se realice una serie de etapas operativas en el ordenador u otro aparato programable, para producir un proceso implementado en el ordenador de forma que las instrucciones que se ejecutan en el ordenador u otro aparato programable proporcionen etapas para implementar las funciones o etapas descritas anteriormente.
Los siguientes ejemplos se proporcionan solo como ilustración y no como limitación. Los expertos en la materia reconocerán fácilmente una diversidad de parámetros no críticos que podrían cambiarse o modificarse para producir resultados esencialmente similares.
Ejemplos
Ejemplo 1. Identificación de biomarcadores asociados al cáncer de mama en CNA
Secuenciación de CNA:
Después de la extracción de ADN a partir de suero o plasma, utilizando métodos convencionales basados en sílice, se realizó una amplificación del genoma completo por duplicado. Los productos se agruparon y se utilizaron para un análisis adicional.
Experimentos de secuenciación larga:
Se añadieron al producto las secuencias de cebador para secuenciación por 454, utilizando cebadores de fusión en no más de 20 ciclos de PCR. El producto resultante se trató de acuerdo con el manual del secuenciador 454 y se usó para la detección de secuencias directa.
Experimentos de secuenciación de etiquetas cortas de alta densidad:
El producto de amplificación del genoma completo se digirió con endonucleasa NlaIII y se ligó a enlazadores artificiales que albergaban un sitio de reconocimiento de restricción de EcoP15I. Después de la digestión y el religamiento, se reamplificaron las dietiquetas (di-tags) enlazadas resultantes, seguido de la digestión con NlaIII y la concatemerización de las dietiquetas sin enlazadores. Los cebadores de secuenciación con identificadores se ligan en la misma etapa y el producto resultante, que consiste en hasta 20 etiquetas de secuencia de aproximadamente 26 pb de longitud, se sometió a una secuenciación de 454 de acuerdo con el manual del fabricante.
Análisis informático:
Secuencias con lecturas de más de 40 pb
Las lecturas de secuencia se asignaron a la fuente de la muestra leyendo la cadena de la secuencia del identificador y todas las partes que no eran de la fuente se cortaron (por ejemplo, los cebadores).
El origen del ADN en circulación se investigó mediante análisis de alineamiento local utilizando el programa BLAST, utilizando parámetros altamente rigurosos (30). Los elementos repetitivos se detectaron y enmascararon utilizando una instalación local del paquete de programas informáticos Repeat-masker (31), utilizando la repbase (versión 12.09), que se obtuvo del Instituto de Investigación de Información Genética (32). Después de enmascarar los elementos repetitivos y la región de baja complejidad de secuencia cada secuencia se sometió a análisis de BLAST secuenciales, consultando bases de datos de genomas bacterianos, víricos y fúngicos y el genoma humano (genoma de referencia versión 37.1). Los genomas bacterianos, víricos, fúngicos y humanos se obtuvieron del National Center for Biotechnology Information (NCBI, (ftp.ncbi.nih.gov)). Después de cada una de las búsquedas de secuencias en la base de datos, se enmascararon todas las partes de una secuencia consultada que produjo resultados positivos significativos (e <0,0001) y posteriormente se utilizaron las secuencias enmascaradas para consultar la siguiente base de datos. Los nucleótidos enmascarados se contaron y se restaron de los recuentos de nucleótidos totales, dando como resultado las cantidades de nucleótidos no identificados.
Para cada fragmento de consulta y cada búsqueda de base en base de datos, el resultado positivo de BLAST con la puntuación más alta con una longitud de más del 50 % de la secuencia de consulta se registró en una base de datos SQL. El resultado positivo de BLAST con la puntuación más alta se definió como el resultado positivo más largo con el porcentaje de identidad más alto (máximo de la longitud del resultado positivo x identidad). Para cada una de las secuencias, se registraron las posiciones del inicio y del final para la consulta y la base de datos.
- Etiquetas de secuencia cortas:
Después de identificar la fuente de la muestra como se indicó anteriormente, en primer lugar se diseccionaron las secuencias a cada sitio de reconocimiento de restricción de NlaIII para dar dietiquetas. Estas se cortan en etiquetas (tags) utilizando el siguiente algoritmo: Si la longitud de la dietiqueta es par, usar longitud/2 bases desde el lado izquierdo y el derecho de la dietiqueta para generar etiquetas. Si la longitud de la dietiqueta es impar, usar un número entero (longitud/2) de bases desde el lado derecho y el izquierdo de la dietiqueta para generar etiquetas.
El origen genómico de las etiquetas se investigó mediante análisis de alineamiento local utilizando el programa BLAST, utilizando parámetros altamente rigurosos. Se pueden lograr resultados muy comparables utilizando otros programas de alineamiento, tal como BowTie. Para cada fragmento de consulta y cada búsqueda de base en base de datos, el resultado positivo de alineamiento con la puntuación más alta a) sin desapareamiento si la longitud de la etiqueta es <21 o con b) no más de un desapareamiento si la longitud de la etiqueta es de 21 o mayor, y solo existe uno de tales resultados positivos, la posición de la etiqueta en la respectiva base de datos genómica se registra en una base de datos SQL. Esto también sirve como ejemplo para la secuenciación que generó etiquetas de secuencia cortas de por si, tal como SOLiD™ (Applied Biosystems/life Tech.) o Solexa (Illumina Inc.).
- Análisis de asociación con la enfermedad:
Para investigar qué secuencias/etiquetas de secuencia están asociadas a la enfermedad, se utilizaron todos los alineamientos que podrían colocarse de forma inequívoca en la base de datos genómica humana (Homo sapiens versión 37.1 http://www.ncbi.nlm.nih.gov/mapview/). Los mismos se categorizaron en 4060 regiones de intervalos de 750.000 pb y se seleccionaron según las diferencias entre los controles normales y los pacientes con cáncer, basándose en una comparación de grupos utilizando la prueba de la mediana no paramétrica. Para análisis adicionales se usaron quinientas regiones cromosómicas que tenían los valores p más bajos.
En 335 ciclos de muestras seleccionadas al azar por impulso (bootstrap) multivariadas, se realizó el modelado de regresiones multivariable lineal utilizando las regiones de 750 k seleccionadas anteriores como parámetros independientes. Las MVR se calcularon basándose en el criterio de información de Akaike (AIC, forma siglada de Akaike’s Information criterion), utilizando unos 90.000.000 de modelos (2 a 5 parámetros). Los parámetros independientes se clasificaron de acuerdo con las sumas ponderadas de Akaike de la estrategia de impulso (bootstrap) (utilizando un subconjunto de muestras) y se proporcionan los 42 mejores (Tabla 1).
Las treinta y dos regiones de 750 k de clasificación más alta se aplicaron después al grupo total de cáncer (n=178) y controles normales (n=108), y se reajustaron las estimaciones de la pendiente.
Los cálculos del rendimiento mostraron una sensibilidad del 79 % y una especificidad del 97 %.
Selección de conglomerados en las regiones preseleccionadas
Como segunda etapa, se realizó una búsqueda de conglomerados dentro de las 42 regiones (31,5 millones de pb) y se encontraron 58 conglomerados de muestras de cáncer sin que ninguna muestra normal redujera el tamaño a 1,6 millones de pares de bases. Las regiones de conglomerados se restringieron a las 32 que tenían una distancia al siguiente resultado positivo normal de al menos 200 pb. Estas regiones proporcionan una sensibilidad del 89 % con una clasificación negativa verdadera del 100 %, donde una muestra se denomina positiva si se encuentra al menos un resultado positivo en cualquiera de las regiones de conglomerados (véase la Tabla 2).
En un análisis adicional, se buscaron conglomerados adicionales donde solo se encontraron una o menos muestras normales en un conglomerado de al menos 10 muestras de cáncer. Se encontraron un total de 81 conglomerados (24 con una normal, 57 sin normales). Aplicando las mismas reglas que antes (al menos 200 pb del siguiente resultado positivo), el uso de 37 de ellas produjo el 94 % de positivos verdaderos y el 95 % de negativos verdaderos. Se calculó que la longitud total era de 672 kbp (véase la Tabla 3). El uso de las 32 regiones sin falsos positivos, las cuales se indican en negrita en la Tabla 3, proporciona una sensibilidad del 87 % y una especificidad del 100 %.
Selección de conglomerados genómicos
Como tercera etapa, se realizó una búsqueda de conglomerados genómicos en todo el genoma no repetitivo. Una selección de 30 conglomerados abarcando 301 kbp fue suficiente para proporcionar una tasa del 93 % de positivos verdaderos a un nivel de negativos verdaderos del 97 % (véase la Tabla 4). Excluyendo las tres regiones (en negrita en la Tabla 4) que tuvieron un resultado positivo en pacientes normales sin cáncer de mama, la tasa positivos verdaderos fue del 88 % y la especificidad del 100 %.
Una búsqueda adicional fue la mejor combinación de grupos, donde no se toleraron resultados falsos positivos de por si. La T abla 5 muestra los biomarcadores de 27 conglomerados con una longitud total de 260 kbp, donde la sensibilidad fue del 90 % y no se detectaron falsos positivos.
Utilizando una selección de 56 sueros de CaMa y 35 normales, se realizó el estudio de etiquetas de secuencia cortas descrito anteriormente. Haciendo esto, el recuento de resultados positivos por muestra se aumentó en aproximadamente 11 veces y se utilizó la misma estrategia que se muestra anteriormente para las secuencias largas, para definir los conglomerados de puntos calientes. Utilizando 17 regiones, que abarcan aproximadamente 122 kbp, se clasifican correctamente 55|56 pacientes con CaMa, lo que se calcula al 98 %, cuando todos los controles se clasificaron correctamente (véase la Tabla 6). Si una muestra solo se designa como positiva cuando se encuentran al menos dos resultados positivos distintos por muestra y no se detectan resultados positivos en una muestra normal, la sensibilidad es del 94,6 % y el 96,4 % si se consideran 8 regiones cromosómicas adicionales (total de 25). Lo último todavía produce una sensibilidad del 91 % si el delimitador se establece en tres resultados positivos, pero todavía no hay un único resultado positivo de cualquier muestra normal.
Ejemplo 2. Identificación de biomarcadores adicionales que están sobre o subrepresentados en el cáncer de mama
En un análisis adicional para identificar biomarcadores, se secuenció un subconjunto de 143 muestras de CaMa y 96 controles en un sistema SOLiD4+ de Life Technologies, que es un secuenciador de alto rendimiento que normalmente puede ejecutar cientos de millones de secuencias de longitudes de aproximadamente 30 a aproximadamente 75 bases en paralelo. (Otro ejemplo de tal tecnología de secuenciación es el Genome Analyzer (Illumina)). En resumen, las muestras se prepararon como se describe para la secuenciación por 454 (0051 y 0052), donde se utilizan adaptadores específicos para la secuenciación por SOLiD4+. Para cada muestra, se obtuvieron alrededor de 5 millones de lecturas de 40 pb y se alinearon con el genoma humano (versión Hg18), utilizando el paquete de programas informáticos "Bioscope" de Life Technologies con valores de rigurosidad predeterminados. Todas las lecturas alineadas de forma única se utilizaron para un análisis adicional.
Para definir las regiones cromosómicas, donde las muestras de uno u otro grupo se agrupan en comparación con el segundo grupo, todos los resultados positivos se clasificaron en orden de aparición en los cromosomas. Las regiones de biomarcadores se definieron inicialmente como las regiones donde se contaron al menos 15 resultados positivos que abarcaban al menos 10 muestras de un grupo, pero no más de una muestra del otro grupo mostraba un resultado positivo.
En 1500 ciclos de remuestreo aleatorio, las muestras se dividieron en un conjunto de entrenamiento y validación del 50 % de cada grupo. Para cada región, a una muestra de la región se le asignó una puntuación de 1 si el conglomerado era un grupo de cáncer de mama (CaMa) y una puntuación de -1 para una región donde se agrupaban las muestras normales. Se encontró que el ABC promedio para el conjunto de validación era de 0,88 si se utilizaban las sumas de las puntuaciones de las 42 regiones agrupadas. Las regiones utilizadas en los 1500 ciclos de remuestreo se clasificaron de acuerdo con el número de ciclos en los que participó cada región. La Tabla 7 y la Tabla 8 muestran las 56 regiones de clasificación más alta. La Tabla 7 resume los grupos sobrerrepresentados en muestras de suero de cáncer de mama. La Tabla 8 resume los que están subrepresentados en muestras de suero de cáncer de mama. Estas regiones se utilizaron en un cálculo final en todas las muestras; la curva ROC usando las 40 regiones de clasificación más alta se muestra en la Figura 1 (límite de confianza del 95 %). Se calculó que el ABC era 0,94 (intervalo de confianza del 95 %: 0,87 - 0,98). La máxima precisión fue del 87 % (81 % - 93 %) en el índice de Youden máximo, representando una suma de puntuación de 1 como delimitador. La sensibilidad respectiva fue del 87 % (82 % - 92 %), la especificidad fue del 88 % (80 % - 93 %).
Tabla 1
Cromosoma Posición del inicio - posición del final Franja cito. Orden (AIC) Longitud del cromosoma Hs1 181500001-182250000 1q25.3 18 226,9 Mb 23250001-24000000 1p36.11/12 24
145500001-146250000 1q21.1 39
Hs2 194250001-195000000 2q32.3 6 238,4 Mb 219000001-219750000 2q35 10
75000001-75750000 2p12/13.1 11
207750001-208500000 2q33.3 22
Hs4 173250001-174000000 4q34.1 31 188,1 Mb 38250001-39000000 4p14 42
Hs5 9000001-9750000 5p15.2 2 177,7 Mb 169500001-170250000 5q35.1 20
60000001-60750000 5q12.1 30
Hs7 105750001-106500000 7q22.2/3 14 156 Mb 129750001-130500000 7q32.2/3 15
120000001-120750000 7q31 28
75000001-75750000 7q11.23 41
Hs8 8250001-9000000 8q23.1 12 143 Mb 120000001-120750000 8q24.12 17
123000001-123750000 8q24.13 36
Hs9 93750001-94500000 9q22.31 8 121,5 Mb 123000001-123750000 9q33.2 9
65250001-66000000 9q12 29
Hs10 118500001-119250000 10q25.3- 26,11 4 131,7 Mb 15000001-15750000 10p13 21
78750001-79500000 10q22.3 32
Hs12 11250001-12000000 12p13.2 33 130,5 Mb 51750001-52500000 12q 13.13 35
14250001-15000000 12p13.1-3 38
Hs13 48750001-49500000 13q14.2/3 13 95,6 Mb 42000001-42750000 13q 14.11 19
60750001-61500000 13q21.31 25
33000001-33750000 13q 13.2 26
89250001-90000000 13q31.3 34
Hs14 92250001-93000000 14q32.12/13 16 88,3 Mb 45000001-45750000 14q21.3 27
Hs15 30750001-31500000 15q13.3-q14 23 82,2 Mb 58500001-59250000 15q22.2 40
Hs16 15000001-15750000 16p13.11 7 78,9 Mb Cromosoma Posición del inicio - posición del final Franja cito. Orden (AIC) Longitud del cromosoma Hs17 750001-1500000 16p13.3 1 78,2 Mb 53250001-54000000 16q12.2 5
Hs19 3750001-4500000 16p13.3 3 56,1 Mb 1500001-2250000 16p13.3 37
Tabla 2
Cromosoma Posición del Posición del Longitud Longitud singular n.° ind. En n.° de tabla | n.° inicio final de línea Hs15 58794736 58803474 8739 6434 10 3|2 Hs1 78991300 79003202 11903 9921 9
Hs5 9729109 9741877 12769 9030 10 3|9 Hs15 58818514 58831712 13199 8655 11 3|10 Hs1 182086265 182099422 13158 8420 12 3|5 Hs16 15314189 15328734 14546 7256 10 3|12 Hs1 23712399 23728155 15757 7362 10 3|14 Hs13 42263335 42280086 16752 12651 9
Hs2 75150255 75167108 16854 6889 11 3|15 Hs16 15267421 15284486 17066 8241 15 3|17 Hs8 8539816 8557163 17348 10469 9
Hs8 120597158 120615223 18066 15444 12 3|18 Hs1 146117570 146135831 18262 8667 11 3|7 Hs9 123059233 123077867 18635 8192 10 3|8 Hs16 15656188 15675324 19137 5138 9
Hs10 15267696 15287050 19355 10718 9 3|19 Hs1 84837904 84859107 21204 15386 10
Hs13 42174182 42195708 21527 12804 12 3|23 Hs17 53361889 53384434 22546 14868 10 3|25 Hs9 93851754 93876478 24725 12638 13
Hs4 38893729 38919556 25828 14993 11 3|28 Hs16 15476486 15505558 29073 11073 11
Hs4 38284097 38313299 29203 10008 10 3|29 Hs18 69612722 69643198 30477 15782 10
Hs5 60570005 60603234 33230 22961 16 3|31 Hs19 4176228 4211314 35087 12332 10 3|32 Hs15 58890793 58926229 35437 17261 18
Hs13 89372537 89409391 36855 17599 9
Hs2 75480588 75519417 38830 10976 11
Hs7 75298908 75341904 42997 9795 15
Hs8 8304215 8351172 46958 20562 15
Hs1 23301973 23355587 53615 12393 12
n.° de regiones 32
n.° de PT 159
Sens. 89,3 %
0
Tabla 3
Posición del Posición del . .. . .. . En n.° de tabla Cromosoma Longitud singular n. inicio final Longitud ° ind. |n.° de línea Hs5 9333503 9342335 8833 7802 12
Hs15 58794736 58803474 8739 6434
Hs7 130047281 130058242 10962 8485 11
Posición del Posición del En n.° de tabla Cromosoma Longitud Longitud singular n.° ind. inicio final |n.° de línea Hs9 65803788 65815473 11686 8868 16
Hs1 182086265 182099422 13158 8420 12 2|5
Hs7 120732445 120748893 16449 13715 10
Hs1 146117570 146135831 18262 8667 11 2|13
Hs9 123059233 123077867 18635 8192 10 2|14
Hs5 9729109 9741877 12769 9030 10 2|3
Hs15 58818514 58831712 13199 8655 11 2|4
Hs13 61452626 61467317 14692 9991 9
Hs16 15314189 15328734 14546 7256 10 2|6
Hs16 77305582 77320831 15250 10016 9
Hs1 23712399 23728155 15757 7362 10 2|7
Hs2 75150255 75167108 16854 6889 11 2|9
Hs7 130398172 130415016 16845 7155 9
Hs16 15267421 15284486 17066 8241 15 2|10
Hs8 120597158 120615223 18066 15444 12 2|12
Hs10 15267696 15287050 19355 10718 9 2|16
Hs7 57676796 57679502 2707 1881 9
Hs1 181517895 181536854 18960 11299 10
Hs2 75699375 75718717 19343 11404 12
Hs13 42174182 42195708 21527 12804 12 2|18
Hs4 38634946 38656586 21641 9559 8
Hs17 53361889 53384434 22546 14868 10 2|19
Hs1 181589981 181612578 22598 15064 14
Hs7 130423517 130447831 24315 9298 9
Hs4 38893729 38919556 25828 14993 11 2|21
Hs4 38284097 38313299 29203 10008 10 2|23
Hs12 11306311 11335781 29471 19673 16
Hs5 60570005 60603234 33230 22961 16 2|25
Hs19 4176228 4211314 35087 12732 10
Hs1 146145061 146161296 16236 10912 13
Hs2 208420571 208427991 7421 4684 11
Hs5 9139816 9160426 20611 15318 14
Hs5 9416135 9437285 21151 14292 8
Hs8 120185767 120205636 19870 9008 11
n.° de regiones 37 32
n.° de PT 168 155
Sens. 94,4 % 87,1 %
5
Espec. 95,4 % 100 %
Tabla 4
En n.° de tabla Cromosoma Posición del Posición del
Longitud Longitud singular n.° ind. inicio final |n.° de línea Hs1 16927708 16936899 9192 5680 18 5|11
Hs22 23048139 23058946 10808 7394 13
Hs8 134828201 134841476 13276 8948 14
Hs1 16892504 16901698 9195 5770 13 5|10
Hs5 37802351 37812027 9677 5506 11 5|14 Posición del Posición del
Cromosoma Longitud , Longi ..tud . si .ngul .ar n.° ind .. En |n n °d dee ^ tnaebala inicio final
Hs10 106523167 106534967 11801 7935 12
Hs5 38337804 38351003 13200 8379 11
Hs2 121682895 121691799 8905 5988 12 5|8 Hs8 1001452 1011206 9755 8569 11
Hs11 18383367 18393825 10459 5369 13
Hs4 82734215 82744864 10650 6248 12
Hs9 123940664 123951579 10916 8398 11 5|20 Hs4 187267286 187278661 11376 7591 11
Hs8 61370490 61382562 12073 6913 11
Hs2 15861648 15873720 12073 7806 14 5|23 Hs1 156983200 156995819 12620 5427 14
Hs9 66744276 66749359 5084 3539 11 5|1 Hs14 106780499 106786136 5638 3900 12 5|2 Hs15 92982464 92989482 7019 5084 11
Hs15 71935511 71943475 7965 5400 11
Hs2 107547731 107556647 8917 6714 11
Hs2 1944023 1953149 9127 7379 11
Hs15 62540507 62549957 9451 7634 11 5|13 Hs1 98947389 98957038 9650 7345 11 5|15 Hs4 157820520 157831401 10882 7020 11
Hs2 7908924 7919887 10964 7733 11
Hs5 154171630 154183562 11933 7656 11
Hs12 104286879 104296769 9891 5255 11
Hs1 245718410 245727864 9455 6710 12
Hs11 120178388 120187909 9522 4629 13
n.° de regiones 30 27
n.° de PT 166 157
Sens. 93,3 % 88,2 %
n.° PF 3 0
Espec. 97,2 % 100 %
Tabla 5
romosoma Posición del Posición del . .. . , .. . . . . . . En n.° de tabla | n inicio final Longitud Longitud singular n.° ind. . 1 de linea Hs9 66744276 66749359 5084 3539 11 4|17 Hs14 106780499 106786136 5638 3900 12 4|18 Hs2 168582125 168589072 6948 4841 11
Hs1 152507011 152514345 7335 4952 11
Hs2 103409691 103416973 7283 3441 11
Hs4 187581788 187590239 8452 5505 11
Hs5 2303672 2312431 8760 6870 12
Hs2 121682895 121691799 8905 5988 12 4|8 Hs3 8393895 8402964 9070 4971 11
Hs1 16892504 16901698 9195 5770 13 4|4 Hs1 16927708 16936899 9192 5680 18 4|1 Hs4 31023419 31032806 9388 7983 11
Hs15 62540507 62549957 9451 7634 11 4|23 Hs5 37802351 37812027 9677 5506 11 4|5 Hs1 98947389 98957038 9650 7345 11 4|24 Hs18 41809704 41819480 9777 7064 11
Posición del Posición del En n.° de tabla | n.° Cromosoma Longitud Longitud singular n.° ind. inicio final de línea Hs5 6521932 6532055 10124 7081 12
Hs18 6856187 6866416 10230 7131 11
Hs11 35954290 35964889 10600 6168 12
Hs9 123940664 123951579 10916 8398 11 4|12 Hs19 34840006 34851079 11074 7186 12
Hs20 31428179 31440225 12047 7311 11
Hs2 15861648 15873720 12073 7806 14 4|15 Hs3 45816265 45828522 12258 6771 12
Hs14 62543243 62555478 12236 7688 11
Hs10 15859116 15871760 12645 6231 11
Hs1 227055177 227067787 12611 8536 12
n.° de regiones 27
n.° de PT 161
Sens. 90,4 %
n.° PF 0
Tabla 6
Cromosoma Posición Posición del final Longitud Longitud singular n.° de ind.
Hs14 49987035 50000858 13824 4915 20 Hs1 120440890 120451513 10624 3153 19 Hs17 26267494 26272569 5076 2751 19 Hs15 29402690 29413501 10812 9234 18 Hs5 3306716 3307102 387 387 17 Hs8 142337087 142347990 10904 8145 17 Hs11 120997333 121001541 4209 3857 17 Hs18 77058035 77064649 6615 4816 17 Hs1 173508677 173513715 5039 440 16 Hs2 209406837 209420259 13423 5537 16 Hs6 25097428 25103462 6035 2605 16 Hs6 170477988 170480867 2880 2168 16 Hs10 113831560 113840757 9198 5521 16 Hs15 98411055 98420737 9683 5178 16 Hs17 70072445 70073023 579 579 16 Hs20 12067861 12074879 7019 4288 16 Hs22 49837992 49843427 5436 3372 16 Hs7 24079846 24084079 4234 2954 15 Hs20 36255517 36260170 4654 3126 15 Hs5 167308741 167313570 4830 1745 15 Hs1 10675607 10681070 5464 3480 15 Hs22 35869939 35878394 8456 4977 15 Hs21 46129441 46139409 9969 6709 15 Hs19 13633867 13645688 11822 3924 15
Hs13 50589706 50603142 13437 10021 15
17 mejores regiones 121743 66946
Las 25 regiones 184609 103882 n.° mínimo de resultados positivos para contar como Sens. (todas) Sens. (17)
positivo
1 98,2 % 98,2 %
2 96,4 % 94,6 %
Cromosoma Posición Posición del final Longitud Longitud singu|ar
3 91,1 % 85,7 %
4 87,5 % 69,6 %
5 78,6 % 60,7 %
Tabla 7
Orden Cromosoma Posición Región Secuencias de la Tabla F (SEQ ID
NO) Agrupación 1 Hs3 26,3 Mb 26316904 - 26317098 3023 CaMa 2 Hs5 138,5 Mb 138546637 - 138546876 3031 CaMa 3 Hs5 1,9 Mb 1895609 - 1895744 3032 CaMa 4 Hs10 116,9 Mb 116886669 - 116886802 3051 CaMa 5 Hs13 109 Mb 108983160 - 108983305 3059 CaMa 6 Hs4 15,9 Mb 15922534 - 15922765 3027 CaMa 8 Hs1 21,5 Mb 21461150 - 21461398 3019 CaMa 10 Hs3 109 Mb 109045139 - 109045270 3024 CaMa 11 Hs5 131,6 Mb 131556651 - 131556869 3033 CaMa 13 Hs12 123,6 Mb 123553236 - 123553545 3056 CaMa 14 Hs14 71,2 Mb 71183079 - 71183311 3062 CaMa 15 Hs3 54,2 Mb 54201491 - 54201591 3025 CaMa 16 Hs6 70,3 Mb 70348510 - 70348627 3039 CaMa 18 Hs19 49,7 Mb 49663834 - 49663995 3068 CaMa 19 Hs13 55,5 Mb 55470461 - 55470661 3060 CaMa 20 Hs13 40,5 Mb 40513335 - 40513451 3061 CaMa 21 Hs15 95,9 Mb 95868804 - 95869053 3063 CaMa 22 Hs11 24,7 Mb 24655162 - 24655310 3055 CaMa 23 Hs9 113,6 Mb 113608047 - 113608176 3047 CaMa 24 Hs10 120,7 Mb 120651143 - 120651387 3052 CaMa 25 Hs12 7,2 Mb 7205280 - 7205480 3057 CaMa 26 Hs2 37,5 Mb 37500136 - 37500260 3021 CaMa 27 Hs20 39,5 Mb 39516205 - 39516408 3069 CaMa 28 Hs5 108,7 Mb 108742923 - 108743062 3034 CaMa 29 Hs6 54,7 Mb 54736509 - 54736604 3040 CaMa 30 Hs15 81,9 Mb 81851632 - 81851811 3064 CaMa 31 Hs2 13,1 Mb 13103843 - 13104004 3022 CaMa 32 Hs7 75,4 Mb 75395901 - 75396027 3041 CaMa 33 Hs9 77,8 Mb 77788560 - 77788689 3048 CaMa 35 Hs4 186 Mb 186017668 - 186017837 3028 CaMa 36 Hs12 127,5 Mb 127539423 - 127539595 3058 CaMa 37 Hs5 106,9 Mb 106949296 - 106949480 3035 CaMa 38 Hs4 44 Mb 44016974 - 44017119 3029 CaMa 39 Hs16 26,5 Mb 26462674 - 26462938 3067 CaMa 41 Hs3 4,6 Mb 4567129 - 4567322 3026 CaMa 42 Hs15 32,1 Mb 32072109 - 32072293 3065 CaMa 43 Hs9 4,8 Mb 4816508 - 4816623 3049 CaMa 44 Hs7 3,2 Mb 3239026 - 3239202 3042 CaMa 45 Hs5 149,7 Mb 149653105 - 149653309 3036,3037 CaMa 46 Hs10 131,7 Mb 131665070 - 131665215 3053 CaMa 47 Hs7 12,9 Mb 12932213 - 12932446 3043,3044 CaMa 48 Hs15 66,8 Mb 66763691 - 66763868 3066 CaMa 49 Hs8 120,7 Mb 120684337 - 120684476 3045 CaMa 50 Hs5 121,5 Mb 121461146 - 121461216 3038 CaMa F Q ID Orden Cromosoma Posición Región Secuencias de la Tabla (SE Agrupación NO)
52 Hs4 14,2 Mb 14175097 - 14175323 3030 CaMa 53 Hs8 96,9 Mb 96945650 - 96945720 3046 CaMa 54 Hs9 125,2 Mb 125204413 - 125204594 3050 CaMa 55 Hs1 89,2 Mb 89223352 - 89223503 3020 CaMa
Tabla 8
7 Hs10 75,9 Mb 75892398 - 75892532 normal 9 Hs3 10,2 Mb 10167419 - 10167564 normal 12 Hs18 26 Mb 26012771 - 26012898 normal 17 Hs17 60,4 Mb 60439625 - 60439754 normal 34 Hs13 110,4 Mb 110359320 - 110359567 normal 40 Hs16 13,9 Mb 13939660 - 13939769 normal 51 Hs18 71,5 Mb 71535848 - 71535989 normal 56 Hs10 11,3 Mb 11335204 - 11335346 normal
TABLA A
>Hs1 23712399-23728155
TTGGGCAAGGCTAAAAAGCCCATCTGTGGCCTGTGTGGGCCTCAGTTTCCCTGGGTGTGTAATGAGGGAGCTGGGTTT CACATTG C C AAGTTC CCTT C CAACTGT CTATGACT C TTCAG C CGC CAT GAGTCTGATTCTT CTTATTGT CACGTCAGG G CAGACCAGAACACACTGACCAAGC CT CGTAGACACATAGCAGGGATGATT C TAAACTCAGAGGAACATT GT CTAGAA GCCACAGGTGCACCACATTAGTGTACCAAGGCTTGCCCTTCTCTCTGCAGCCTGGAGAGTTTTTAAGG(N) xAAAGCC TAGAGAGTTTTAACGCTTTCTCCCCTCTGGCTCTTCCCTAAGCTTCTCTCTGATTTTCCACCTCTATGCCCCTGCGGC CTCCTGCACCTCCGGCCCGTCCCCT CACTGT CCTTACCTGGTTATAGGTGCAGTTCTTCTTCTTG CATTTG CTGCACT GGAAGAGGT CAGTGGTGGTGC CGCCAGTCTTGG CCAT CTGGTGCT CACGGATGGC CTCCTGGGTCATGG CAT T CCTCA ACT CC CT CAGTTCAT CACTGG CCATTTCCT GGAGAAAAAAGAGTCTACC CTTCAGGGC TGAGGAGTTTCAGGGGC CCT GCCCTACAC CCGGTTTCTAGAAAGC CTGAACAGAAAAGGAGGTAA CATG CTTTATTGACTG CAAGGAGCTGGAAGGAG G CCTGGATT CCGGGTCCTGCCCCAG CTGGGAGAAAGCTGTC CC CG CAGAGT CCTCCTGCCGCC CACGGCTGGAAGGT C ATAATGCACATTCAGGTGAGCAGCAGG CAAACAGC CT C TGC CCGAGGACGAGCACTC CTGTGTACATTTCTTCAGATC TGGGG C T CATC CTGCCTGCCAGGACCTTGGCCTTGGCCTGAACCTTCCTGCTTGAGCTGACTTGATCACTGCCACCTG GTCTATGATGTTC CC CTTAGC CTTC CT CTC TAGACTTGACC CAGTGCTCTTTTACGCTGTGAT CACTGC CCTGCTGTT CTG CTTAGTGGGAAC CTTGGCGCAACTACCCATGACTTG CTGCTGCTACCCCCATC CTGAAGCTC CTGGGCTCACACA ACTAAC TGTTCAT CTGATGAG CATG CAT CCAAGAT CATCTTT TT CTGAGAGTTTCTACAGAAGAGTCTC CAGT CT CC C ACCAGCTCACAGTAGACACTCCGGGTCTGCATCACCCACACCTTTGCTTTCTCTGTCCATCTCCCTGGCCACTTGTCT C CC C TTAAG CCTCT G CCAGAGGACAGCAGAGAGGGAG C T CCAGT TAAGGGTTGGTGGAAGGGGAAGGAGGGT GGCTGA CGCGAAT CT CAAAATGAGGATTTTAAAC(N) xTAAACATTTTAATTGTTGCTGTCTGTCCTGAGTCCTTAAACCATGG AGAAGAATG CTGG CATAAAGGTCAAAG CAT TTTAGTGTGCTCACCTTT CATTATAAC CACTTAGG CTTCTTGATAAAA TGCAGATTCCTA(N)xAGAATATGCATTTTCACTAGTGTCCTCAAGTTATTCTGGTACGTATTAATGTTTAAGTCCTA GTGTGTTAAATTCAAAGGCTAGGCCCTAACCAATGGCAGGAGAATATAGAAAAACAAGTCAATGAAAGCCATTTACCC ATATGAGTAAAAAGTAG C TGACTTTAAT TTGAGGGGATTTT CCAGAG CATAAAGCATT TGC CCATTCAAAATTTAT TG GATGT(N) xTCTATTGGATGCCAATTTACTGAATGCCTTTAACTGTATCAGTTTGCAGGTAAACCAGCTTTGGTTGTG GGGAT(N} xCAGAGGACAAGAATAGCATGTTTGGGGAAAAGTATTTGGGGTAGGTTCAAGTCAAGCATGGGAGATGTG GCTGGAGGT CAGATTG TGGAAGGAAGGT CTTGTGTGC CAGCTAAGAAATAGAGAACAGAAAAATCTATC CAGC CACAC GCCAATTCATTGACTTCTCTGCCCTGCCTCATTAGAATGAGGCAGCATCAGAGACACGGAGAATAAATACAACACTGT AGAAAATAAGGTGGAAAATGAAGGAAATGAGAGTCAGAGAC C AC AATTCTATC CC{N) xTGGACTGTGAATGCCGGTT CTCAGGGGAC TGACAGGATGCTATGTC CACT CAGCAGCGCT CATGGGTATCTCAGAG CCTACT CCATTCT TTGCTATG TCTGAGTAGGGAAGAAAAGAGAAGAAGAAAGAATGAGGGAGTCAGGTAAACAAGGAAAGAGGCATAAAAAGGGAGTCA GGGAAGGGAGTTCTGAGAGAGACAC CGTCCACATGGGT TCCTGATGGGCGCTTCCCCTCACAGCAGG CAGC CCAAGC C CAAGCTAAGATTTTT CTGTTTTTTTTTCTTTTTTTC(N) xCACTGTAAAGCCAAGTTGCAATTTATAACTCCAGGCAT ATGTTGCTAT TTACC C AAAAAGATCAGATATTTTAAAT TAGGCAGAAAGATAA CC CTGGTCTACCTGTG CCAAGTTC C C CATTTCAT CCTAAAGC CAAGGCT CTTTGC TGTTTGT{N} xACACAGACACACACTTCCACATGCCACACATACACAA ACCACACATGC CACAC ATATG CTAATCACACAC CACACACATACTTCACATGACAT(N!xTCCATACATCATACACAC ATATT CCACACAT CATACACG CCACATACACTC(N) xTCCACATATCATACATGCCACACACATATACTATACACCAA ACACACCTCACACAT(N) xTGCCCACTGCTCACCTCCTGTTGTGTGGCCTGGTTCCCAACAGGTACCAGCTGGTGGCC TGAGGGTTAGGGATCCCTAATATATACCATACGCACATGCTCCATACATACACACACATACA(N) xGCCCCCACACAT G CCACACAAC CTAGCG(N) xTCCTGCTAGTCCAGAGCCCAGGGGGGAGGCTGAGATGAGGCTGGTTTGAAAAGCTCCC TGG C T CACTGCTGATGAGT CACGAG CCACTGAC CCTCTG CACTAGGCAGGG CTAGGGGGACCTGGCTTCCTGCCTCAC CATGCAGATGC CCTGCTCT CACCT CTG CCGT CATCT TGG CTATAAGC CCTG C GGAGATGGC CC CACTGAGCACGTTC C GCCGCAGGCCGGGGTTCCTGGGGTCCTTGAGGTTGCTTATGCGGCTGCGCACGCGGTTCCGGTACTTCATGTCCGTGC TCTTGAGCTC TTGGTAGATATGTGACACAGT CAAGGG CCGGCCAG CCATTCATGGAGGGGCACAGAAGGGTGCAGGC C TGG C C CTGACG CCTGAGAG CT CCAGAACCC TTT CTTCCTCTCCTCGGACTT CTACAGGAGTTGGG CCCCTCTTTTGTG ACCTCTGACTCTC CAAGTGGTGATGTCTCTTGGGCATT TGTGGTCAG CCAAGTAG CAGAGAAAGAAAATAG CTT CTGG GGTAAGATCAGATCAT(N)xCTGTGAGCTAAGCCACGTGATGCACTAGTGTGGTGTCTGGGATATATCAAATGCTCAA CATGTATGGGCTGC TAAGATGATGACAAGGATG CGGATAACTT CTAGTTTGGAAGGG CC CCTGGAGGTTGTGAGT CCA GCTGTCAGAGGAAAAGATTAAGAGCCACATGTTTAAGGCTCTACGAA(N} xCCACGCTGGGAATTGCTGGTCAGAG(N ) xCGGCAGGAAGTCATCCCTGGCCACCAACCACAGAGTTTCTGTCCAAACACTGCATCCTAGGCAGGCTTCCGAGCCT CCCACTCCCTCCACCCT CTGGAGGAGCTCGGGGAACACAGC CT CATTTT CTTGGTCTCCT GAG CCAAAGAAGATCAGG
a a g a c a a a g c t g t t g c t g g g a a g a g c c a t a t a c a a t a a a g a g a c t g c g g a g a a t g t g g a a a a a t g a a a a c a g a t g a c t TGCTAGAAGTGGTGGGAGGGGAGGTGGTATTCTTTCTTTTTTCTGTTATTGTTTTAATGTTGTTTGTACAATAAAAGT GTGAAAAGCAAAGAATATG CCACAATG CATGTGAAAG CCAAAAAATGTGTTTTAAAATGTACCACTTGGGAT T CCTGG
g g t g t a t t g a t a t g c g g a g c c a g c a g g g c t g g c g a t a c t t t c a c c a g c c t c t t t c t g a c t c t g t c t c c a t g t c c c a a g G CCACAGAGGGGACGAGGCAGAGAG CAGACCGGGGGCAC CGGCCCCT CAGAGGAACTT TTGGCTG CATTGTT AGCACA
g g g c c a c t c c a a a g a g g a c a g a t c t a t t t t a g g a c a t a g a g c a g t c a a g c c c c c a a a t g t a c c c t c c c c c t g a c c t c c AGCTGGCTAAAAACAAAGAGCTAAAGTGTT TGATG CAGCTTTGATT TCCTCTCC TTTAGAAGAGGGACAT T CA(N) xG GAACTTTCAGTGGATGGAGGAAAGGCTGGGAGTGGAACTTTGTACTGGACCATCTGCTGTTGTTTCAAACTGGGCTTA
a a g g t g g a a a g g a c c t g c t t g g a g g g a t g a c t g t t c a g g a g a a a a t g a a g a t g a a a c t c g t g a t a g t t c t c a t a c c t t GTGTTCTGCCTTCT TGAATTAACAGGAAATGATGAAAATGTACTTT TT GAGGTGCTGAATGGCTC CAGACTTGT TGAG
g t a t t g c t g a t t a a t c a t t a a a t g a g a g t a g a g a g a a a c c a c c a g a a a a t a a g t t g g a g g g a a t c a g g g t a g g c c a c a GGAAGAAGATGGAAACGCAAATTTGTCGT CTGCAGATTGGC CCCTGGCTCT CCTCAACAGACC CTGCTGAACTCCGCA GGTAGAACCCAGGAGCCCTCAAAAGGGTCGCTCACACCTCGGTGTGACACTTAGACTTCCTGTCCCCTGACTGTCTCC C CG CT CCAATACCATGCAAAC CTTTGTCT CTAGTAAAAGAAGCTC CATGTTTTCATTCAGTGGGGGC CAAGTGGGGTG GGATTTTTTTTCTCCCAAGCCAAACACCTTTTGGTGCAACAGATCTATGAATACATCCCAGTGGTGGCCACCTCTCCT GTCTACACAAGGCCCCTTTCAGGTGAGAAGTTGGGGAGCAGGGGGCACATGGAAAGGATTCAATACATGGAATCCAAA GACAAAGTCTCCAGTCCCGCT( N) xGGGCGCTGTTAACATCAGAGGATGGGAGAAGGGAAAAGGCTGAGCATGGGAAA TGCTGACCT CTCAATCTAGTCTGTT CCTGGAAGTAACGACTTGGC CAACTGAGGCACGCTATTACACAG CCTAGC( N) xATAGAATTAAGAACTCTGGTTCAGGGGCAGAGCAGGGGCTGTTCCCGGAGCCCTTCCTCTCCTTCCTCACTCTCCCA GCTCGCTTGCACCTTGGATGGGTGCTGCTAGACGGTGCAGACACCCACAGCCCCGGCACAGTTCAAGGATATGATCTT CGATTTCTGATGCCATCTTGTCACAGTTGACTCCATAGTCCTTGTAATCATCTAAAAGAGATTCGAGAAATACAGGTA T CAGC CAGACACAACGTAGGCAGTGAAGC CTGCTTGTACCTGGGAC TAGAGTGTTCCTGTACCAT CC CTTGGGGTGCT GAGAAAGACTGAATTTCATCTGCCCTCAGTGAGATTGGGAAAAGGGACTCCTTCCCTCCCTCATOCCTAGCCCAGGCT CTCTCACCGTCCGCCTTCAGGGCTGCTGACAGCATCTCCACACACTTGTCCCGGACAGAGTCCCCTGTGAGATAGCAG GGGGC CAGGAGACACATGGAAGAGG CAAACGTGGGGGTGAAGGGG CTGCTAGGTGTTTTGGGG C T CT CCGCTTTTGAT TTGCTGCTGTTTGATCTGAGGATGAC(N)xTTAACCAGCCACTGTGCGTTACTCATTATAGAACAGTGTAACACCACC AATAATTATATGCAATTGATTAGGCACTGTGCTAGCTGCTCTCTATCTATGCTCTCCTAAATGTATGAACTGTTT( N) xAACACCTATGCTATAATCTATGATGTACTATCTAAGAATTATCTCATTCAGGTTTTGTT(N)xCCATATGTGCACAG ACATTTAGAAGGCTAACTATGTGGCTCAAT(N)xAAAAAAATGCATTCTTGGCAGAAATGATGGAAGTGTAGAAGGCC AAGGTTTGCATGGGCAATGTCAAAGGGCTCAATGCAGCTGGAGCGTGAGGGTGAGGGAGCTTGTAAGAAGGAGAGGGA GTTGGAGAGAAAGGATCTAACCAGACTCTGGGGTTTGTTAAGGATGGTAAAACAAAAGCCACCATTTACCACTGCTCT
TTTTTTGTAGA( N > xGCAAGACTTTGTGATAGTTGCACAACTCCCTCCCTCCCAAGATGTTTTTGCACTATTCTTTAG GGTAGGAGC CAGGAAGTCAGGAAGT CT CTGTCTTCATTCCT CTGCTCTTCT CACCTCCTTTTT CAATAACC CATCAGA ATCTGCCAGCCAGCCCCTATATCTCTGTTGAGAAAAAAGTGGACTCATCAGAGTTGAAAAAATTAAGGTCCCCTCCTG GAGTGAAATTAAGACCCACTCGAGGGTCTTACCAAACTGAGCCCTTGTGGAAGCGCCCAGAGAGCTGC(N)xGGGCCT GGATCCACAGCTGCTTCTCATGTTCACTTTCAGGCTGTGGACATGCTACTTCCTTTCAGGAAGAAGAGGATTCTGGAC TTCTGACAAAGTCTCTCAGGCCCAGGGAGTTGTGTCTGAGAGTGAGGCTAACTGAGTAGCCCAGGCTTCTCAGGATCA ACAACCTGGAGTTACCAGAGAATCCCATCCAACATCCTGCTGGGCACAGCTCAGTGTGGACTCTGCTGGTGACCTCCC TGTCCCCCACCTCTTGCTGTTTGGCCTGACCTTGGATCTCCTCTTTCACCCCAGACCCCTGGACCTTTGGGATTTTTG CATTCTCTAGAGACTTGAGGTTCCTCAAAAGTGCCTCCATGCATACCGGTGAGGTGGGCTGGGGATGGTAAACTCTGT TCTTGTAAGGTTTACATTTTGAATTTCTAAAAATTTTGTCCAAAGGGATTCATTTCCTAGCTTTTATTTATTTTTAAT GCAGTTTGAAACTTACTGTCCTAGATTAATCTAGCCTTAATTCTACCCGACCCTGCCTGACAACTGGGTTGCCTGGCA
g c a t g g t g g a g g g a c a g g g a a g c t g a g a t c c a a g c a g a t t t c t c c t g g a g t c a g a t t g c a g c a g g a g g t t t g g g t g a g GAAAGGCTGGACAGGGCGTGAACTCGCGGGGCTGTTGGCATCCAAAACGGATGTTCCTGCCCCCAAGCCTGATCCTGG
t c a c a g g g t t c c c a c c t g a a c a g t c a c a a t a g c c c c a a a a c t g g t t t c c c t g t g t c c t c t t c c
> H s l 233019 73 -233 555 87
CC CCAGATCAGTGTGTGTC C CACTTACTTG TGGGTAGGAGCAT CCTAGCGATTGTGTTACATGGTAGGTTCTGAGCCA GTGAGTCGGGGTGGTATCTGAG(N ) xCTCCTCTGCAACTCACGGATCACACCAGCCTCCACCTCATTATTTCAACGGA g c c t t g a a t t g c t a t c t c c t t c t a c t a a t g c t c a g g a t t g g g g a t g g a t g g a g g c a a g c c t t c t a g a t c t t c t t c t g t AGCCCCCATAGCCAAAAATCTGAGTG( N ) xATTGATCTGGGACCCAGGTCTTTTTCATTGCACTTGGCTGTTTGGTTA ATCTTCTCTGTGGCTCAGAACTGGCAGCCCGCTTCTTGGACACAGACTGCCTGTTGACAATGTACCACATACGGTCTT TAGAGCCC CCTGTG CTATGT C C TC TG GTCT C CACAAT C CCAGGAGG CAG(N)xGTTCGGCTTCCTACAATGACTCGG( N)xTTTTTTGTTTGTTTGTTTGTTTGTTTGTTTTTGAGACAAGGTCTCACTAACACTACCTAACACTAATCAGTCTAT GGGCAAAGACTGTGTCTTATT C A G G T (N ) xTCCTGGGTTCTGCAGTCCTTGCCCCCAGGACACCACCCCCTCCCTGAG AACCACTCCCTTGCTCCTAGCCCAGTTCTGAAGGCAGCCATTTTTCACTGCAGAGACCAGCCCTGCCCCTCCCGAATT TAACTAGACGGGAACCCACAGCGTGTCATACAGCACAGTGGCCCCAGGATGGCAGTAGATGCCCATGTGTAAGATCTC
t g g c c c a g c c t g a g c c t c c t t t a a a g g g t a a t t t a c t t g c a t g t g t c t c t t c c t (k ) x a t t a c c a t g a g t c t c a t t g g CCTCTCCTCCTGGGAGAAACTAATTGTCCTCACCCTCTGCATCAGGACCTTTAAGAAAGAAGAATGTGíN) xGAGGTT GAAAAGATATCAAATCAAAGGCAAATTAAACTTTAAAAAG(N)x CTTTACAAAGGAAAATGTTCCTCCTAAAGGATAT ATGAAGATACTTAGCATTTTCAGCAGTCAGGCTTTGATTGATTCATTGATTCATCCATTCACTCACTTACCCACTGAG
( N ) xTGGTGGTGGCTTGGACCTCAGCAGGGACAGTGGGAAAGTGTCAGATTCTGGATATACAGTTGTCCCTCTGTCTC C A ( N ) xGAGAGATGTGGGCATTAAGTGAAGGCAAATACCACAAAGTCATTAAGTGTGGGCTCAGAAAAGCCTTTACTG GAGAAACGCACAAACAGAGGGT GGAAG CCGAATGAGG CAT TGAGTTGT T CAAC C G CT CAT CATT GTTTGGAGAGAGGT AGAGGGAAC CAGGG CACAGCAG GAGACACC GTGGAG TGTG CT C CAGAAGGGGAAC CC CACCT TGCTAGCTACTCTACT TTTTGTATAG GCAATTGTGAGG TTAGTCTT GTAAGTAAAGAAAAACAG GTCCAAG CTTAGACT CTAGGGGAATTG CTT ACAGTGCTGAGGACACCTGGAGCCCCTGTGGGATGTAGAGGCCAGGTAGAGTCATTTCTGCTTTTGGGAAAGGCGCCC TC TC TGGAAT C CAT CTGCTAT CTGTT TGCTAGGAG C CT GGAGTGAGC CC CCAGGGGCTT C CTATACAGTGCAACTTGG TTTTCCAAGGTTCATCCCCTTATCTACGGGCTCATCAAACATGGGACTCACCAAATATAAGCATGCCCAACAGGAGTA GAAATCAATTGGGCTTATAATGGGATTTCCAGTGATTGTTGGCAATGCTGGAGGAGACACTGCAGAAGGAGGGAGGCC TGGAGTCATAGTACAAACCCACCATGGAGGCCTGAGGCACCTGGACATGGAGACCATGTGAAGTCTGAGGAAAGAGCC CAAGGCTCAGACAGAGGAGTAGCCTGGAGCCATGAGGCATGTTCCCGTGGTAGCTTTGGGCTTCTGGACTTAGCCAAA GGTATT CATC CAGAAACAGCAGAGGATATTGTCCAGTGAAGCTAGC TACT CT CCTGGCTGGC CAAAAAGAGTGCAAAG CT AT GT GG TT AT CC CG CATGAC CTGAGTTCATGATAA CCT CG C CAGTGAGAACCTT GGCTGTCCCTGTGGCAATTCCC TTCAGGATACCTGAGCCCCTGT GGATTGTAGATTCTA CCTATAATAG GAGGAAAAT CTG CATAAAATG GT GAAATCTA TGTACTGAATACCCAGGAATGTCTTGTCAAAAGCACTAGAGGCTGGGACTGGGAACACTGCTGTGCCCACAGGTCTTG GGTTAT CATGAG GG CT CATATG CAGGTGGTATT CTTTCCAAACAGAAAGCAAGGAAGGA C CTATAT CCAGAG TACAAA TCAAGT CCTGGGGGATA CACAG GACTGCT CAATATTCTGC CAGTT CACCTCCACACCTTC CATG CAGGG GATGGGAAC ATGGTTGTGCCATT TGTAGC TGTGGCCTGATTTCTCC CAACTAAAT TC CC C C ( N ) x CCCTAAAGCACAGCAGTTTTTG GCAATGGCCAGCTAGGAGCAATTCCCTTGCCATGGTCAGAGAGGGAGACACCAACAAGGTCTCCCATAGCTGGGTATG GAGGACAG GGGTAGTG GTGGTGATACGTT CAATACTGTAG GTTTGTGGAATGGAGC CT GG CCTAGG CAGCTG CTAGAG GCTGTTGAAG CTTTTTCTGT CTGAACCTGGAAGAGCAGGTGAGAGGTTATTTAC TCTTCGACCCTGATCTGTTTCCTG ATAT CG CTTG CGTATG TT CTTG CTACATCAGCTGAGGCTTGT CATGTC CC CAGAAT CA G C CAAACGAT CTGTACTTTT TCTTAGAGCATTGACCTCACAG GAACCCAGCAGTCACTATTCTG CC CT CAAAGG CAAT CC CTTGAGACTGAGGATGGT GGTTGCAGAG CAGAAGGAAC CTGTTGCACAGGCATAGTGA CCA CAG CAATGTT CTCCATTTG CAGAGGAGATGGATGG TAGGAACACTGCCTTCCTCTAGGTAAAGGCAGTGTCAGAGTTGCCTGCCCAGGGTGTGTGTCCATCAAGTGCCCTTCA AAAAAGGAAAAAAA( N ) xAAAGACCCAAAGTCATGTCCACCCAGTTAGCCTCAACTAGAGCAATGTGAACTTACCAGC CAATG GTCAAAGTT CTTG CT CAAAAAAGAAG GGGCAGTTAGGGACAGAGATT CTGGGGTC CTGG CACATTGAGTAGAC TCTTCCCCATCC CATGTG C CTG CAAAGAATG ATGAAGTAG TGGCTGATATAGCTGCTC CCGTAACCAG CCATAC TAAA GC TT GATT CTACCCGCCACCTG CAAGTGGAAAGGAGAATT C CATGT TGAAAT GGAATT CAGGGC CCAAAG CTGTAAAC AGCATTCTATTGTCAGCTTGTGTCATACCAGGAAACTCTAACACATGACAAAGATAATCATCACAGTCTTAAGAGTTT GGGTGCTATC( N ) xGAGTTTGGATGCTATCTGCTTGTTGGAAAAGAACAACAGCCACTTATATATCTATAGCATGCCC ATGGTAATCAGAATACATTGTAACCAACTATCAATTGTGAATTGGTCAGACAACAAGTTATTATGCCTAGTTCACTTT GAGC CATAGG C CAATG TTGCTG GCAAGTAATACACTCTC CATGAGCA C CAGCAT CTTCTT CATAGGTCAAAT GAAT CC TTATTGGGTCACCTCAGACCTCCATGGAGATTTGGCTATTACAGTAGATCCAAGCACGGATCTAGTGCACCAAAGGGG ATGTGACGCCCCATCATCTTGATCAGCTTTTTCATTCCAGATGCTCTCTTCGAGATGCAGTGCCTTATTGTGGCTTGT GTATATGTCACATGCAGATTGAGTTTGTGGTTGAATTGCTGCATTTGTTTCCGTGGCAGGGATTACCATATAAATAAC TG GGGTCTGTCATGTCCCCGGGGATCAGTCAATGATCTGTAAAGGAAACCTGAAAGAGGAGAATCATGTAGAGGGAGG CTCCCTGGGACC CAAC CAGC CCAAAACTTGC CTAAGGTGAG C CCA CAG GAGGAAGC TG GG GGAGTAA(N ) xGGACCAG AGCTACCAAGTTAGGG GCGACTGAGGACAGTTTACAGGAAGT GGTCTTAGCTGGTTTTTCTTTTTCTTTTTTTTTTTC CTGATAGGA(N)xAACAAAGTAGGATATAGGAGAATCATCATAAAATGAAAGGGGTGAGAAAATGTCTGGACACGTAA GA CACTGATAAG CAAT C A T A ( N ) xCCCAGCAGACTGTGAAACATTCTATTGTAGAACTACACATAAAATGTGGAAACA ATGCTAAG GAAAAA GATT CATTATAGCTGGCTTGAGG GGAGCTT CCTAGAGGTGATGG CTTT GGAAGAAG TGAA GAAT TGTAACAG CA GAAA GGAGAAAATTAGGGATCACGTGAAGAAACAATGTAAG CTTATGTGGATGCATTTGGGG GATGGC AAGTTCTCTGGTTTGGTTAGAATGCATGAGACTTGCATGAGA ( N ) xTG AG CAATGG AC AT GGTC AG AG CC AGGCTTGG CAGGGCTGGCTT CGGCCTCTGG GTACAGGATGAGAGGGGAAA CTAGGGGCTAGG GCAGGAGT GGGGAGACTCAGAGTG AACAGCACAGGG C CAGGGAGGAACCAGGGTAGAAC CAACTGAAATAGAAGAC TCTAGAAAGGGATGGTTCTAGAGC CC CATCAGGCCGTGTCACTACGTGCTACCTGGTAGACCTGGTGTGTACAGATTTCTTTTCCAGAGGCAATCTTTCAGCCT CTGATC CTCTTCCTGGCTCC TAGGCACCTTTCTGTTATGACTTACTATGAACATTCAATAGG CTGTGACTGG GTTTTA CTGAGGTAAAAG GTAACATCGG CAGAG GCCAGCCCGCCTCGG CCTCATTAGCTGGG GGAACT CGAAGTACAGTCAGAC AGGGTG GGGCAAATTGA CTCATGCCTCCTCATTCTGTGTTTTTT CATGG GAAAATT C CGATGTCTG CTTGACCGCCCG TGGT GATT CATTTTATTC CATATGATGAACATCTGATTTT CTAGGACCTT CATCTT CATT TCACTGAACAACAG CCGT TGGAGGAT CTCGTCCTGGAGGG CAAAGCAGGCATGCGTT C TATTGACTT CTTTTGTGTTCACGG CCTTGCTGTT CTTG GAGTCGTTTTCAGAAGTCTTTGGTTTCTCTTCCTCCTTTTAAGCTCCCCAGCACCAGGAAAAGGGGAAAGTTGGTT(N ) xCTCAAATTCCCGAGTCCTCCACTTGCCTTTGGTTCATAGGAATGCATGAGATGGCCCGCTAATCTGCCAGTCACAG TGGACATAAAGC C CTTAG CCTAAATATAT CTATCTCCAGAGC CCGTTTTTC CAACATG CCTCTCTGAG CATTTTGT CA GG CT CAGTACGATCTCAAAC TG CGAAGTCTCAATGGTCTCAT GACATG CTTTTATCTCTATGATCCTCATCTCTCCCT CC CCTGGTTAATAACACT TATTTTTTTTTTCTTTTTGGATTCAG CTA CAAA CTAATGG TC TTTAAAAATGTTTAAAAA T T A (N ) xTTTTTTTATTTTTTTCACATTAAAGTAGTTGTTACTCTTGCCAAAGCATGTTGACATCAAACATGAAATAG GAGAGT CTGGGGTGGT{ N)xTCTTGCCCCATTATTTCGACACCTACCCTGTTCCCTACCCACCTAATCTAGCTGCAAA TATCCCCTATCCCTGAGCTGAAGCCAAGCCCCATCTTGTGCTCACCTGCTGGGGCTCTGGGAGGGTGAGCTTGTGTTC ATACTTGACCAACCAGTTGCTAAGCAACAGCAGAAGGAATAGGAAC CCAAACAAGG CTGG CTTATAGCTCAT CAGC CA AG CTGC CACTGCTC CTATGGTG CCTGCGGAGGCAAGTATC CCATGGAGACTC CTAAAGAGAAATAT CAGTTCTG CCAG CATTATGGCGTGGAGGCACACAGGCCCCAGGACCCTGCCTCAGGCCCCTCCATGACAAGAGTCACGGCCACCCACTCC TCTGAGTCATAGCCAGCTCCCTCCCCCCACCCAGCTGCCGCATAAC( N ) xATTCACCAGTACTTCTGGGCCTCGCTAC TAGCAAAAAAAATGGCCAATACAGAGCCCCTACCATCATAGGGGCAAAGAAAAGTTATCCAAACTTAGACATAACTGC TTTCACAGTTATA CTAG C CCAT CTATTGGTCACCTGATAC TTTAGGGGAAGGTAAC TTTGTTTAATATAATATGTAAT AT TGATTAGGTTTTG GACT C TGTATAGTTAGACAT T A A T T ( N ) xAGGAATTCTTTATGTAAAAATATCAGTTTTAGTG GCTGTGATATTCCAATATAGAATCATGCCTTGCTAGGAC( N ) xATGAGCCACCGCGCCTGGCCAACATTATGTATTAT ATTTTTATGGTGAAAAATAATTTAGCTCCCAAGCAGCAC C TC CC CCAACAAC CTCCCTACTTCC CTAAAT CC TTAAAA
( N ) xATACATTTTCTTTCTTGCATAGCTTTTACTTTTTTTCCTTTCTACATTTGATTGCCTTTTTGTCTGTTTGCTTA G (N ) xTTTTTGACAATTTCACTTCGAAGAGACAGACTATTAAAAAGñCAATTGTTTGTGTAATAAGTACACGCATGCT AAAAGTACACGACCTGTT C CAGTTTTGGGTTAGCAAGGGATT CC TCGAAAAATTAAAAGAATAAAC CG CTAC TATCAT TATTAG AAAG AATT AACCTGGAGTAGAAAGAGGTGTTATGTG GATG G GGATGGGGG A C ATGAAGGATGGGG G AC ATGG GGGATG GGGGA CATGGGGGATG GGAGAAGCG CAAGACATGGAGT G C GAATGAGG TTGGGG { N ) xAATTCCGTCTGGAC TCAAACCTGGATTTCGTAACTTGTCTCCATTTTACTACCATCGAAACTGCTGTGCTCTCCAAGTTTTTTGCCCAGTCC AAGCAGGGCAATTATGAATAAGTAAGACGCTCTACCAACATATTTCAGTGT(N)xAATTGGGTGAGTGAATGAACGAA CGCACGCATGCAAACCCGAAAGTCCCTGGAGGAAATGGTCACCTTCGGAGGTTTAGTCTGGCCCAGAAGCCCTAAGAC CACGGACTGTGCCAGGTCCCACTCCAAACGCCGGGGAGACGCTCTAGGCAAGCTACACGTTCTTTGCTGCGGTGCCAC TCTAGC CG CGAGAACG CCGCTCTATGGCTG CGGGGGAGGGGCGG GG CT CGTGGGTGTCTC CGACCCTTTTTGTC CCGG CGCGGCGGGAGCGCGCTTGGCGCGTGCGTACGCGACGGCGGTTGGCGGCGCGCGGGCAGCGTGAAGCGAGGCGAGGCA AGGCTTTTCGGACCCACGGAGCGACAGAGCGAGCGGCCCCTACGGCCGTCGGCGGCCCGGCGGCCCGAGATGTTATCT GGGAAGAA( N) xAACCGGGACGGAGGCTGGCCCTGGGACAGCAGGCGGCTCCGAGAACGGGTCTGAGGTGGCCGCGCA GCCCGCGGGCCTGTCGGGCCCAGCCGAGGTCGGGCCGGGGGCGGTGGGGGAGCGCACACCCCGCAAGAAAGAGCCTCC GCGGGCCTCGCCCCCCGGGGGCCTGGCGGAACCGCCGGGGTCCGCAGGGCCTCAGGCCGGCCCTACTGTCGTGCCTGG GTCTGCGACCCCCATGGAAACTGGAATAGCAGAGACTCCGGAGGGGCGTCGGACCAGCCGGCGCAAGCGGGCGAAGGT AAGGCTCGACCCTTCCCTCAAACGACACCGCCTGGTGCCGAGCTTCCCCGAGGCTTCTCCGCCCTCCCCCGCCGCCGC CGGGTCTTGGGCCCTGCGACCACTCAGACCTCCCCTTGTATCGCTGCGCACGCCACGTCCCCTCTAGCTGGACGGGTG GGGTGAGAAAGGAAAAGTCTTTGCCGGTGTTGACTGCGGTCTTTGTGCTGGGTCCCGAGCGACGTGGGAAGGGGCATC GATGATTGCTGGGAGGGGTATTTTATTCCTTAGGTCTTTCTTGCACACCTTTTCATTTTCTTGACTGGAAAATGGCTC CCTGACGTGGGGTTACGGTTGGCGTTCTTGGAGACCTCACTTGTTTGTTCACAGATAGGTCTGGAAGCTGTTGGGCGA TATTGATGGCGTTTCAGCTGGGTGCACGAGGGGGCTGTAGGTTGATGCTGAAGAAATCGCCTTAGCCGGGGAGATGAA AAGAAAGGGTCGTTTTTG CTTTTGTTTTTGTTTGAGCAAAACCC TG CT GTGTGGTAGGAGTT CTAGTCTTTT TTTTTT TCTTTTTTAAAATCCTCTCAGCGATCTTTTGGCGCAGAATTTTGCAGATGAGGAAGCTGCTGATTGAGAGAGCAAATA ATAGTCAAAATGCAAATATATGGGAAGAAAAGATTTGACTCCAAACAGTATATACAGACAGTGACTGCCTCAAATGTf N )x GTAATTCCCACATTCATGTTG'TTACAGGGTATCATCAGCCCTTGAAACAAAGGACCACATGGGACACTGTGTGCA GGGCACTGGGGCAGTATTAGGGCGCAGACTTTAGACTTGGGCCTTTTCTGGAGATTTAATGGTCGGTCTAGTTGGAAA AAACAACCTGGATATATATGTGCGTGTAGTCAACATTTAAATAAAAATTTGCAGATTATATAATTTATGCCTGAAAAA ATTGTCTACATAATTTTTAAGT G GGAATA C CAAGTCAGTGTGTATG GGGAGCTTCTTGAGATGAGTATTACC CAGT CT GAGAGCTCTCATATTTATCTGCCTACTTGAGCCAACTCTGTAAAGTATAACCAGTGCTGGATGAAAGACCGTTTTGGA GGCAAGACTTTCTGGAGCTTTTACCAAAGCATGGGTTAGAAAAGCACAGAGGTTGGTGACACCAGCCCTCAGAAATGA AATAAAGGGCAGAAGTCAAGAGTGTTATTTTACGATTTCAATAGACAGGTGGCCAGAAGCCACGTATAAGATGTTT'FT AAAAACAGATTTATT(N) xCACACTATTAGATTTTACTGTTATTTTTTAAGCTATCTCCTACAAGAAAGATTTCTTTT TTTAGCTAACTGATTAA (N) xAAGATTTTTTT CTTTGTAATTCT CCTGTGTTTATTTTAACT CATCTTACTTAC CCTG TTTTTTMTCCGCCñTTGTAATTAGCTCTTAAACTTTCAGGAAAAAAATTAGTTATGTGTTTGTTTGGGGCAGCAñcT ACTAGGAAGCTAGCAACTATCCAGTCTTGGGATCCAGAGGAAACAGACTTTAGAGGATAGAGTATCCATTTAAGGAAT AAAAAG CAAGCTGAGG CC AGTATAGATAGG CC ATGCTTCAGT AGTT CTTCAGTGTATG CAAGGAAAAAAGTG CATGTA TTTTATGACTTG( N) xGGCATGTGTATGCTAAAATTATGGCAAAATATTTAAGGTTTTAAATTGGTAAATTGGGTTGG AACTTCACGAACACCATACTCCTTTAGGAATGGTGGCCTTGAGCTAGGCCATGAAATGGCAGAATTTAGAAAAGTGTT CTGGAAAAGGGTGGACTTTTTGGAGTTTAAGTATTTCATCTTCCTCTGATTCATTTTCCTCAGGTAAATAAGCCCTAC CTTGTAAAGTTTAGATATAATATAAACAAAATGCTTATTGCAGTGTTTGGCGTATAATTAGTAGGTGACCAGTAATTA AGATACTACCAGTGGAGAAAAGGAAGTGGTTTGAGAAGCATCAGGGAATCCAGAGTGGACAACTGAATGTTGAGAACA ATGGACAAATGAATTTAATTAGAAACAGGAATCATCTAGGAACAAGATAGATTGAAGGGACCAGTAGCGTTGGGGTTG TAGAGTTGTGTCCAGGTTGTAGTGAGCATTAGATTCTT ( N) XCTTGAACTAGATTATGAAGTGCAGACCATAAGTGCT GTGAGACTTCAGAGAAGATGAAAGA(N)xAGGTGTATTTTTTAAATTTAGAATAATTAAGAAATAATCTTCAGAGGAG GTGGACCTTGAATATATGGCATTGGATTAACAGAAAAAGATGGAAGTAGCTTTAGGGAACTTAGAGGTTGGTAGAGCA GCATTGAGGACTTTTTTACTCGTGTGTTAGTGATTAGAATTAAGTTTGAATTCA{ N) XGGCAGTTTGAATAGTCAAGG C CAGAGAG TGTATATG CTGGTTTGAAAAATTTGACGTTCATAT CTGTTAAGCTATTAGAAAATATCACATAATG GGAA AATTAATCTGGCAGCTTTGCCGTAGATAAAAGGGAGAAAGTTTTTTTTTAAT { N) xCAAGAAATATTTCTTTATAAAG ACTGATGCTTTAAGGACATTGAGGAGTGAATGAATCATTGTAATAAGGGTGGGGAGGAGAAGGAAGAGACTAAAGATT TAAATTTCAGAGTCATCAATGTGAAAGTAAGAATAATTACCCTACTCACTCATAGTCTGCCTGTTGGGACATCTTTAT GTCCTTGGTTGTAAGTAGTTGTAGAATCTCTGAAGTCTTTGGGGTTTTTACTTAGGGATGAGAGAGTCCCAATCCTTT AGGTTCAAAGTACACATTCCTTTTCTCAGGGGAATAGTTTTTATTGAAGTTTGTATGATTCTTTCAGATGGTTTTAAA ACTTTATTAAATGGTCACTTCTTGGCTCTGGGC(M)xGCAAAAATCCCAAAATAAAACACACTACTATTAAGTAGTAT TAAGAAGTTAAAATTTATAACATTTATTTTAGTTGTCAGTCTATGAGAAAATGAGGTGAAGCCAAATTTGAGATTGGA AATTAAGCATGATAGTCTTCTTTTAGCCTTAGAATCCATTTTTTTCCCCCAGCGACTAGACCACCAGGGAAGAATCCA ATTTTGTTTCTGTATTTTGCTTTGTTTATGñAGTATTGTGGCAGTCTGGTACTGATTTGACAAATTGTGCAAAAGGTA GAAGGTTGGTTTTGTTTGC(N)xTTTTTTTTTTTTTTTTAAGCTTGCTTTGTCATCTCAGGATGCTCTCAGTTGAACA GACATGGCAGAAGAAACTTCATTCTGAAGTCTACCCCAAATTGAGTTACCCAAGCAGCACAAGTTAATTGTGTTGGTT GTCTCTAAAGGATTAAGATTTGTTTTAACACATGAATGTGTAATACATTTAAAAAATTAAAATTCTAAATCTTGAAGA ATCCTAGGGTTCTGTGAAACAGTTTTGTTTTTAATAATACCATCAGATATTATTGCTTCTACCTGTAGGTATTTTTCT CATCTCTTTTGTTCCTAAAGACTAAGAACCCTTTCTAGACCACCCTTTTCCAAGTGTATTCCTTGTTAATTATTTCTT TGTCAAGGCATTGGATTAAACAATGTTGAAAAGATTGTTTTTGATGCAAAACTTAGCAGTACC(N)xAACTACAATAG AAAAGATTTTGTATTAGTCTTTGTTTTTGTCTCAAAATTTGCACATTGTCCAGTTCCATATTAGGTACTTAAGTCATT ATTGAATGAATAGGATTGATGGGGCAGAGGATCAAAGGACTGAATCTTTGGGGAAGAGTGAAGCACTATTGAGGGGTA
a a a g g a g a a a g a a a a c t g c a t a c t a t a a a a g t t t a t g g t c t a t t c t g a c t t c t c t c c c c c t t t g a a a c t t t t t t t t t t c t c c t t g c t g c a t t t t c a t c c a t t a t c c t g a g a c c t a t t t c c a c a g t t t a t c t c a g t c a g t t a g g a t a a a g t t t a g t a t a a t a t c a t c t a c a t t t c t g g a g t c t t a t t g c a a a g a c c c t g t a t g a g a t g g t c a c c t g a t c a g g a t c a t t t a a t t g c a a c t a g a t t t t c a g c t t t t t a g g g g c a g g g a c a c a g t c a t a a t t g t t t t c t c c a t t g t a c c a c g t a c t c a g t g t t c a g t a t t t g a g t a g g t g a a t a c a a t t a t t c a t g t a t t a a a t a a g a t t c t t a g a g c a t g g g t c a g a a c t a g t c a g t t c a t t c TTCTTAATTTGTGTTTCCATTTTCTAACCCAATCCCTGTTTTCAGTAACACTCTTGTACTTCCAGCAACCAG
> H sl 789 9130 0 -79003 202
CñTGTG ATTGAACTC AGTCTCTAGAAGGT CAGG CTG ATATC ATGTGGT T CAAAGC CCTGñC CCTCTAATCACA (N) xG T CAAACT C TTTATTACACAGTAAGACTTAAAAGTAGTAG CAGAATATT TTGTTTGTTTTAAAATCTTAGAAACTGT T C CAATTGG AAGT AGT CTT CTGG AATT AG CAT ATT CTTATATT ACTAATTCTT C A CATAATGATCGT AC C AAGGAAACAC AGTTGATTTTGGTTGATTGCTGAGACTGGTTTGTACTAGACTTACAGTATGCCTTTAAAAACAAAAACAAACAAGTGA AAAAAAGGT CCTGCC CAACATTG CAAGAAT TATACACAATTTATAT TATTCACT GGGACTAAAAAATTAT TGCTAGAT CTTTATTTTAAAGTGGTCTCCTATTCAGCAATAGAAAATTGAAGATAATTAATGCTTTCTTTATGAGAGGCTGTGGAG TATGGGATTGGTGTGGCAT C AAATT CAACAC ACTG AG AG CTGAGATTTG ATGC ATTT CTAGAGATAGTGT T AAATGAG ATCACTGGATO CTGCTAAACATGTT C TAGTGCT CTTG CCTTT TGCTGTATGTTTGGTGT CATT CATCAGCAGGGAATG G CTGAAATGTG AGGTGT AAATTG AC AG AGTAGAGAGAAC CTATACAGTTT T AACAC CATAGCCACAGACATCCAGGCA T CCTAAC ATTTTGTC CTAAAT CTTC CTG AACCTGG ACGG CAGG ATGAG AAC ACTAGTGATTTTATTGG AATGAAG CC A AGACAAATAAC CTT TTTTTGTGCTCTTTATAATATAG C T CT CAAGAATGAACAAATAAAAAGAATGGGGTCAC CATAT ATATCATTGTTTAGCAAGTGACAAGTGACACAGTGAG CAATCCTTAGTCAGT GGCATGACAAGGAAATAG T CAAGACA G ACACTG AC CTGTATTGT G AATG AGGTGC C TTGGTGAAG CAAAGGTTTGGG CC C AAT AGTG AGTCAAGG AG CC ATTT C AAAGATGGG CTACAGT CACTCATTAAAAGGGATAGAACACAACTGTG C TGATCGTAGAATTAAC TGTGACTTGTC CAT AAAAT AG AGGG AGGATG C AAT CTGTTTTAT AGAAGGC AT AT CTT G CAGGGC CTTGTAGCTTTGT T AAG GAT AG ATGAT AGT CAGCAGGCTGAAATTTTCTCTCTAGTGGAC CA CTTTGACACATT CT C AAGGTTTTATTTGTATA CATTAC CACAA GACATGGACTT C AGTTAC AATGG CAGC AAAT AT AATAC ATG CT AGT AACTATG AATTTTTAGGTAAG ATGAAATG CTT CT T AAAAGACTGTTGGT CAGAAATCTGGAGAT TGGTACTT TTT CT CAGAAAGAAAAT CAGCTC AACCTAATGG CTGT A TACTG CATTGTAACAT TTATTTTTATTTAAAAGTTGGATTTTTATGAT TTAAAAAAGTAATGTGAGT CTTCATTTAAA AGTTT AAATGG CAT AAAGAC AGC AAGTTTT C AT AT AGTAGT AAAATTGT ATGAATTTTAGAC AATGAT TGATGGC CT C TCTTGACTGTATCTTAAACAAAAGGACTTACTGCAATTTTTTCATTATCAGTGTTATGCAAATATACCAGTAAACGAC TGCTTTCTGAGATCTTAGGGAAGTAATGATTAATGAAGTATAAAAATGTAATTGTAATTATATCACCTAAATTATGCC AAAGAGATACATTGGTGACATACTTATGCTAAAAAGAGAGAGAGAAAAAAAGACT CCCTCTCTCCC CAAAT CTTA CAA CACATT AAATCTGTCT T CTGTTG CATGT TGAATATAA CAGTAT CAGGAACTAGATGAAAGTT GGTTAT TCATTTATGA TGATG ATGAGG CTGCT G AAGC AGTTGGT ATAGTT TT GT G CATT AC AG AG AATGT TGT ATGC CAGTTAAC AAAT AG ACA G CCACTATGTG CCCAG CACAGAGAGAAATGAGT CAT GGTTTGTGACCTC CAT GGATC CTAACAT TTCTTCTC(N)xTC ATGGCATAAGCTTCCGAAACAGTTGAAGGATGAGTAATACACAGAAAATTGTCAGGAGAAAATTATGTGGGGTATAAC CATTG C ACT AG ATTGT TTTGT AG AGGAGACAGATTGG C ACTGAACTTTGTGGCTAAT ATTTTAGACC ATGAGATAGG A AATGT ATTCAAAAGG C TGAAAATTACATGGGAGAG AGT AAAT ATG AT CAGAGATTTG AC CCAT AGGCTGTT AC CTGG A G AATG CTTGGGTATACATTGACTTATAACTAAG AAAG ATGAATTT CAATTTTCTG CTTAGGTATG CTGAAG AAAAATT AAAAGTGTAACTCCAGAGAAGTAAGTTCAGAAAGTTAGC (N) xGC AC AACACT C AAATTGT CTTATC CTTTTG AC CAA AC AAC AAGATTGACT C AAGT G CC AC CAC T AAACTG AAC T AATTT T AGTG AAATTG ACTGGATCAT C AGG AAGGGCTG A TTTTTTGTC CCGACCAGAAGTATATATGGATTTTATCTTTAACTAGAC T CATATTATGCTTTAG GGTTAAATAGACAT TT ATT ATT AATTCTGGCTC AACT CTTTTCTGTGTGTTCCACGTGCCTT AATTTGTTGGT CTTCT AGT ATCTTTGC ATT TGGTCTGTAGGT TTAGATTGG CTTTTT C AAAGACAAATTTT CT CTAAAATGTATGGATATCTTT TATT TTTGAAAGGA CACTCTTCT CAT TTAAAACTC CCAT CAAATAAAATAGATAGATGC CGAGGCACAGGGAACCAACACATACATT CCAGT AAAGAGATAGGTCC CAAAA CAGTGAAC CAACAAATGCATTTTAATACATAGTTAGATGT CCAGGCACAGTAAG CAACA CATACAAATTT CAATAAATAAGTAGATGC CGAGGCACAGAAGACCAACACATACATTTCAACAAGTAGATGGATG CTG AGACCGTGAACTAAT AC AC AC AGTTTC AAT AAG CAG ATAGAGG C TGAGG CAC AGCGAAC CAATGC AC ACATTT CAGT A ATGGATAGATG CCCAGG CAC AAAAGAC CTACACACATTT CAATAAGTAGATAGATGAGAAAAAAATT C TCAT TAAATA ATTAATTATTGTACCACAAC TTTGAGAATGCTTTGAAAAAT CTTT CCAAACTA CTGTAG CAATAAACTAAATGGG CCA GT TGGAAATTT CTTG C TGGAAAAAACATAATGTGTTATAGGAGAGATGGTTTAC TGAAACG C CTAACTGGTATGGTCT GTTTTTG CTATTAAAACGACGTTGC CAGTGGTCAAATAAAGTAGC CTT TTATT CCTGAATCATTGTACCTC CAATATT ACAACTAATTAC TGGACTAGTTTTTTCT TTAATTCATAGATGGGAT G CTAAT AATTAC TTCAGTCATGATATTT TAGA T AAAC ATTTTTT AT T AGGAT G AAAATGTCTT ATGC AT CT ATTATT AC CAAT ATT TGG AG CATAACTAATATTTTCTT A GTGTAACTATGTGTTT G ATGT AGGT AATC AT AT CTGG AATG AG CCAG AGGTT TGAAGTAAAATG C CT CAAATT CCTTT TGTGC CAAAGTAAGG CATATATACAAATAT AAGATT TTATAGATGTGT TTTCTTCTTC TAAGACTTTTAATTTGC CTA CT GATTTTTTAATTT CCAC CACTGC CAAGGAATAGAAATAATAAAATAAGGATAACT C TTGGGTGAATACAAAAGTTT GGATTAAAAAAGTAAAAACAAAGTCCAAAAGGACATCTGAATATTAGTCTAAAG(N)xGGTCTCTGGTGCCAAAAAGG TTGGGGACTGCTGGT CTAAAGAATAAGAGAAAAAGAAC TACT GAAATGAAAAT CATG CCATTCAT CATGGGACAGATA AGAAGAAGGAAGATAAAAGGTACTGAG CATTGTTTTATT CCATCTAT CACTATTAGTGAAT CTATTAGTGATACTTTT TTACAATTACCT TT T CATT CTGGGACAGATAGCTGTC C TCTGCACATTCATT GTTTTGACAGC CATC CTTCTGAAGGA CAC CTTGGTTATGTGGAGTTT AACT CATTTG ATTTTC C T ATTGGTTATATCTGTTGTTATGG G CACAGAAG ATTTGG A G C AAAACATTTATT AGTAC CAGTTC CTACT TTTTTAAAAAGATGCT AGTAAAAATGTAATTT TAGAAATATTGGATGA G AGTGTG CT AG AAT CAG CTT G AAAG ACAACT AGACAT AGGC AG ATGG AACT AG AATC ATAGAG AATT AAAAATGAGCT CTCCTGGTG CCATGGAGGCTAACACACAAATCCTTAGTCAT CATCT C CAACTA(N) xATAACATGACTTTTTCCTTCC CTTTC CT CCTT CCTTGAAGfiCACATTGGGGTAAAGAAATTG CATGAT CCCTCCTGTG CCTAAG CCACCCCAGTGGACC TGGTCTTCTTGCACCATCCCTGTGGCTGGAGGTTTGAGATACTGACAGGTATATTTGATGGCCAGAAGTAGGTGGGTC AAAGCTTCTTTAAAGTTGAATCGAACTAATTGATTAGTCAGTGAGTTTGATTTAACAAGAGTGAACTAACAACCTATT GTG CAGAGGATGCAGGGATGG CAGAGCTT CTTTAAACAAGAGCAAGT CCTCTAAGACAGAGACGT CTCTGTTGTAGCA CCTTGAATCCCACCACCTCCATCCCAGTTCCCAAACCCTTCTGGGATCTGGCCCAGGTAGGAGTCTCTACATCCCTAC TTCCTTCTCTTCACCTGCCCCAGTGATCTTCAATTGTGGCCCTTCCTTAAGAACAGCGAAGGCTTATGGGGAGATGTA AGGTCTATCACCATAACAATTAAAAAAGTTAAAACTCACTTTAACTTTACTAGGATTTCTTATCTTGAGTAAATAGAA ACTTTTGGGCAGACCAGAAGTTAAAGGTCATAGAGAGTCTGAGGAATTCCTTGAAACAAAAGTTCATATAAGTTTTCT TTCC(N)xTTAAATCCAACTTGACTCTTTGGACCTGTGGTTGCCTGGCCTGCTTTCTGTTAACCCTCCTAACTCCTGT GCTTGTAATTG CCTGTTTCTT CTTGTCTT CC CCAACT CAAGTGTAGGAGTCTCTT CCTAATGATGTT C TACAAAGTAG AGGAGTAGGATTTGAGAAAGTTAAAGACATAGTTCCTACCCTCAAGACACAAGGGACTATTATACATATACTGACATT ATCTCATTTCAATATGTTCTACATATTATACTGATATTATGTTTATCAATATTAAAGGAGTGGTGTCACACATAATGG CTGTATAGAATGAGACATTACTCAGCTACAGAGAGATCAAGAAAGGTTTTCTGCAGAAGATGGGACAAGGGTAGGAGG ATTTAGATATGTAATTTTTAGTGAGGGAAGGGCAGGGAACATGGCATGTTTTTGGCATTTGTGAAflTATTTTGTAACT AGAGAAAACAAAAT G CAATAAGAAAAACAAATAACTAGAGAC TGGAAAACCACTTAGGAGTCTATAATAT GATGTTTG CTTTTAAAGCGATAAGACACTCAACGAGAAATGACAGAAAAACAAGGTGTGGATGGAGAGGCAACATGAAAGTGGATC AAACAACTTATACATGGGTGCTGGCTCAGACGTGACACCTGAGGCTCCAGAACTGGAAGTTTATGCCGTCAAGTAAAT TCATT(N}xGTGTGTATATATACATACACACATATACC(N)xGTAAATTCATTAATATCGAGTCACAGATATGTCACT GTAACTCCGTGTATATATGTATATTTATATATCCATATTTTCTAAGTATTTGAATAATTTGAAAAATAAGCAGGGAAA AGTGAAGGCTGTCTTACTCTACATATTAACTGAGTAAAAATATTAAGTAAGAGGTTATTTCTTGTAATGTACTATGAA CTCGGATCAATCCACAAGGAAATATAT{ N} xTTTTCTTAAAGAAAGATTGAGAAAACTAGAGTATAAGAGGTAGTGGT TTGCAAACCATTTTGTATGAAAAGCAGTTGAAGAACTATAAGACTAGTATTATTTTAAGATATGTGAAAGGCTGACAT TTAGTAAATGGATTGATTATC < N) xTTCTCATGAGAGGTGATGTAGACAGTCCTCTGTGTGCATAATAAATGGAGCCC ATTGTTTCCTTCTGTTCTCAGAATCCCTTCAGTTACATTTCTGTTTTACTCACATTAACTTGGGTCTTGGCGGTAGTG CCATCTAGCACGATGCCTACATACAGCAGACACTCAAAACTACTTGCTGCTGGATTGATTTATGTTTATGAATATGCA GAGGAATGGAAGCATAATTATCAAATTTACCATTGCTTTGAATAGTAAACAGCCACAGAGCCCTGACTCTGGCCAACA GATGTACAG ACATGGGGGñGAGATGATGCGATAAC AT CC AG AATAAAAC CTTAGC AAACATGGGCTCTATT AT CTATG AGTAAAGTAGC CTGG CAGATT CATTAACTTTGATGATGTTAACAGTTGATAGAATTGT TTTACGTGG CCCAAATGAAT GGAGATTCATTGTATTACCAGCCTATTTTATTCTTTGCAAGCTGGCAATGAACATATACTATCCTTCCTCCATGCTTC TTTGACCTCATTCTTTCCCTTAT CTTAGTGT C CAGTGTTTTACAGTTAT CAGGTGAAT TAATTTTTT CATAGAATATT CAAATACT TTGTC CTGTCTACTT CTCAGAATTCTAAT CATCAACAAGGATATACAACTGTAATGTGC CAATGTTCACA GGAAGAGATTTTAGAAAATAACCCATATTCCTTTAGCTCCATGTACTGGGTAGCTCTTGGAAACTGTATGAAAGACTG TGTGTATGATGAGACTCTACCTAAGTGTCTGGAATACAAGTGCGGGTGGGCTAGACAGAGAATAATGTTCTCAACGCT TGTCAGGAGACTGTTGCCAGAGACCAGAGAAAATGGCTTCCGACTGCTTTTGTCGAACCAGCTTGCAGACATCCTTGT TCTGACCCCAGCTCCCACATGTCTCTTGCCAGCCTTATCAGGGCTTCAGTTTTCCAGGAGATTCTGTGGTAACTTCAG TGCTGCCCTACTCACATCCATAGAGATGTACACAAATAATCCACATTAGTTTTGATTCTTTAGTAAAATAGTATATCT CCACAAATTTCTTAGGGGAAAAAACATAAT(N)xAAAGTCCGCATATATCTCAATTATGTGAAAGTGTTATAATTTAT CTTCTGTTGGATACTTAGGTGGTCCTGAATTATCAATATTATGAATGTATTTGTTATTACTAATATACTAATGATAAA GGATATTTCTTTTACAGAAATACTCTCCCCACATTTATTGGTGGCTGAATGATTAGTGCTTCTCCCAAGTCAGGCTGT GGAATAACCTCTTTCCAAGGCTGGGGTTAGGGGTTAAAGGTGGAGAGAGAGTAATGATGAGTGCTTGGGCAGTGCCCT AGTGTTCTGGGGAGAAGCCTGGGCACTGGGCTCCCTCAGAAGAGGACTCTATGCCTTCCCACATCTGGTGACCACACT GCTGTTTGTAGAGGAGAGAGGAGAATCAGAGGAGTGTGCACCATGTAACTCAAAACAAAAGGAGTTACACAAACAAAA GCTCAGGAGGGCATGGTGCATCTGGGAAAGATGAGGTCTTCACAGTGGTTGTATCATACGGAATTTGCAGGGGAGATG AGAGAGAT GAGATT ATAGAAATAGACTTCAGATGCAT CATCAAGGAG CTAGG TGAGGAGTTT GTACTTTATTGTGTAG ACAATGAAGGTTTTGATAAAGAACAT TTATT CTTCAACCATT GGTAT CATCAGGT CAAAT TAAGATATACATTTCT TT CTAGTAATGGGTCT TGATTTT CTTTGTATTTGTGTTTTGGT CTTTGTTACGTCAT CTT TTAATGGTGGCACAATGGTC CAAGCTTCACTTTTCAGTTTTAC CTT TTGTTTACCTATTT CT TATTTGTTTGTATAT CCAAGATGTTTTCAAAGC CAC CTTATTTGTTCCACAACTGTTTGTTTGGAAGTCAAAGCTGGGATCAGTGATTTGCTTAGTTCTAACATGTAGCTGGGA CAATTTATAGAACTGATTCCATGAAATGATCTCTTCAAAATCATGGGTGGTCAGGAAATGGACACATGAAATCACAAA ATACATTGTTACAGTTAAGTCACAACTGTGGTGTAGAGAGAAACTCCTTTCCTCAAAGTCCTTAGAAAATTCCATTGC ATACTT TT TGGAATGTTTTAG CCAAGTTAAGGCTAAGGCTAAGAGATGTAGAAAGGGAAATAGAAAATTGAAGACAGT GGTATCCTTGTAGTTAAGAAGCAGAAAATCTCAGAGTAAAATTTGTCATATACCATTTGCAAATTAATATCAGATGCT CTAAATTCATTTTGTAAAACTATTTGTTAGCTTATC (N) xCTTCATTGAAATAAAACAAATGTGTGTTGGCTTTGCAT TTACTATTCTCAGAAGTATGAGAAGGTCAGTTACTGGTATCAGTGTTTGTTCTCCCTTAGGTGATTCTGTTTATGCAT ATTTGACTTACTATAGAAGTTAT CAT TTTTGTAGGTCTAATGTCAATTTGAATTAAAATGAT T CATTTGTCTC CATCA GTGTTTCTGCTATCATTGATTTCTTTCTGTCAGTATTTTAAACAATAGCATCACTCTGTGGAAAGTGAAATGAATGTC ATCTAGCATAG CTAATTTTTTAAAAATTATAGGATTGAAAAG GCTGCAT CAACTAATT TCCGTGTTTTCTT CGTTTCT AGGTTACAATGGCCAACATTGGAATAAATGGAAATCATTCTCTGGAAACCTGTGAAACAACACTTTTTGCTCTCCGAA TGGCAACATGGAATCAAATCTTAGATCCTTGGGTATATATTCTTCTACGAAAGGCTGTCCTTAAGAATCTCTATAAGC TTG CCAGTCAATG CTGTGGAGTG CATGTCAT CAGCTTACATATTTGGGAGCTTAGTT CCATTAAAAATTCCTTAAAGG TTGCTGCTATTTCTGAGTCACCAGTTGCAGAGAAATCAGCAAGCACCTAGCTTAATAGGACAGTAAATCTGTGTGGGG CTAGAACAAAATTAAGACATGTTTGGCAATATTTCAGTTAGTTAAATACCTGTAGCCTAACTGGAAAATTCAGGCTTC ATCATGTAGTTTGAAGATACTATTGTCAGATTCAGGTTTTGAAATTTGTCAAATAAACAGGATAACTGTACATTTTTC ACTTGT TTTTG CCAATGGGAGGTAGACACAATAAAATAAT G CCATGGGAGTCACA CTGAAAGCAAT TTTGAGCTTATC TGTCTTATTTATGCTTTGAGTGAATCATCTGTTGAGGTCTAATGCCTTTACTTGGCCTATTTGCCAG AG AA C ATCTT A ATGCAGCCTGCATAGTGAAATGGTTATTTTGAGATCACCGCTCTGTAGCTAACCCTTATAAACTAGGCTCAGTAAAAT AAAGCACTCTTATTTTTTGATCTGGCCTATTTTGCCCCTCATTGTGTAGCCTCAATTAACACATGCATGGTCATGACA CCCAGAATTCATGATGGTTTGTTATAACAACCTCTGCATATTCCAGGTCTGGCAGACAGGTTGCCTGACCCTGCAATC CTATCTAGAATGGGCCCATTCTTGTCATATTTGACAAATAGGACTGCCTACATTTATTATTATGAAGGTCGATTGTTG TTGGAAGTGTTTTTTCATGTCATAGATTAGCAATTTTCAAATAATTATTTTTTCTCTGAAAATTTTGTGTGTGATTGC ACAATAAATAATTTTTAGAGAAACAAAGGCTCTTTCTCAGCACATTGATGGGCAACTAGAATTACAGCAGTTTCAAAA CTCTACCATGGATAATGCAAAC
>Hsl_848379G4-84659107
TTGGGGT CCTGTTTCAAGTAGAAAGTCAAGAGTGG CTGAGGTCATAAACAGTGTCT TGTTG CTGATACCTAGCACAGA GGATC CTGGTCATTAAAACATGACACATTCTGTACAAAAACTACATCACCG CC CGTGTTTT CTGT CATO CTATTTCCT CTGAAGTCCAAGTGAAGAGGATGGTGTGCCTTTGCTTCCATCTGGATTCCAAGACTCCCTGCCCTGTCTTGCCCTCGA AAATCTGAAGGTCT AAGAATAAC CTCCATTC CGTATTGAAGGC CACAGAAATGAAT ACATT CCTCTCAACTACAAGCG GGAGTGT CTT CTTGG CCAACTCAC CT TATGTATTTTGAG CTCCATCT CCTC CAACAGGTGATCATGCTGAAATGTGTT AATACGGCCCTTAAGAGCCATTCCTCTACCTGCTACTGTGACCCCTTTACTGGGGGTAGTCCTCATGCTGATCATCTA CAGTAGCTGATGATC CCAATCTC CAGTAAGC CCTGATGG CATC CCAG CATC CCAGTT CC CTTACGGTTATCAG CTCAG CAAAACCAGGCAACTTCATAAATGGGTAGCAGCGTGGGCCTGGCTGAGCCTAAATAAGACAGGGAAGATGGAGTTGGT GACTCACAAGGGATAGATCTGCCTAAGTTGATATCTATAAAGGCTTTTGTTTTTTCTGCTGAAGGAGAAGGGACGAAC TTGGTTCATGGGTACCTGGGGGGAGAGAAGGGAGAGTTGAAAACAATGGAGGGAATGATATGGCTCTCCTTCTGTCTC TGCTGTGCTTT CAT AAGGTAATGGGGCAGGTGGAAGGGTGACAGGCATCAT CTAAAAAAGAACAA(N) xGAGCACTTG G CAAACC CTAATAG CAAAGTGCCATGG CACGATTATTAAGATCACCT CTTTTCAGTCTCTTTGTTAGTT CCAACCAAT ATCTATGCTTCTTGGACTGTTCAGTCTAAAAATCAAGGGCCAGAGATGTCAGGTATCACCACACACATGGTCTTAGTT GAAGAACAAGAATATAACT CATGATCCTCAAT TAAAATC CAAGGAAG CTCTTAAAAATC CAATGGTTTA CATCACCTA TGTGGTTTCTATTGCTACAGTTAACTCAGGAGCTGTGGAGTTTCAAAAGTAGGTAAATAAAGTCTGTATTTCTGTTTT T CCATGCTT CAGTATACTC CTAT CAGAGAGAACAATAGC CAGTGGTTAGTGGTAGAT CTGCTGATTAAATATGT CCTG ATTAAAGAAGATATTCTGTGCAGAGTGTCCCAGGATAATGCGGTGAAACTGGAAATGATCCTTGTTTAACTTTCCGAA GTACTTACTGGAAGCTACATCTGAAGAAGGAGCAAGAGAAAGATACCAACAATGCCTTAAAGCCTTCAGCTGGCTGGG TTTAT CAGAT T CTAAATGTATGAAGTCATGATCTCACAAAATCAGCATTGCTACATG CAGTGAGTGGAAGCCT CAAAC C CAGCAGAACTACAGAAACAA{N) xGAATGAGTTTAGCAAGATTGATTCATGTCAAGGAATAAGCATTGCACAATTTA GTATCTTTCTTTTCTAGACAGTTCAGCCCCATCAGTCTTTACCATTTCCCTTTCTGGTCCCCTCCAACTCACACATAG TGTGG CAGGGGGTGAAAGGAAGGGTGTTAAC CTTGGCTAAGAACCTCCTCTCTGTAGACATGACTTTGGTTGCTGCAG TTTTCAGTCTTTATCTTACTTAGTTCTCAACACTCTATAAAGCAGTTATTTTAATGT(N > xTTTTTGGATCACTTCCC ATCTATGCAATGGTTCCTCCTTCCTTTGACCCTTTCCCCATTTGTGTTCTATACGTGGCCTTTCCAGGAGACAAGGTT GCCACCTTCTATTCACAGCGTGCTTTAAGGACCTCAGATAGAGCCCCCTGCCTGAAAAATACCATCTCCATTGGTCAG AAACAGAGATTGAATAATCATTTTTAGAGTTGGCTCCTATGTATCTAAAAACCTTTGTATTCCTTAAACATTTCCTAT TGAAATG CAAATAAATATC CCTTAAAGAAGTGACT CCTTAAGC CTTTGTAAGGTA CTATGATAAT CTACTTCC CATGG TAAGCCTTTCTAGAAAACAAAATCCTATAGAAATGAAACATGGGTTTGCAATCAGGAGTGGGGTTAAAATGAGTAGCA GAAACCCTTGCAGGAGAATGAAAAGAAGGTGAAAAAGAACCAAAGACAAACTTTATCTACATTGTTTTCTTCCTATTC ATCTGGC CTTGACTG CTGG CATC CATGTAAG CCAGTCAAACTTTGGCTCCT CCAAAGGTGTTAAC CATAAAAGGATT C CTGGATGí N) xTTGCCATAATTGACTTTTTCCCAATGACTTCAACAACTTATAAATATTTCCCATGTTTCACAAAGTC TTACTGGGTTCTG AAAACTACTT CTAATGTG C CGT CCTATCCATCTC CAGC CCTGGGATAAGATTTATTGACCT TAAG CGAATGT CAGT CCTACTAGGCCACTT TGAAATGAT CTGCTTTCTCACTGGAAT CTAAAAACGGAGACAG CAGTTGTAT TTTTAAAATGTGT CTTGGCATCAGTTT CACACAGAATAAAATGTCTCATGTTC CC CAAGACAATTAACTATTG CCACA GTTTTGG CAGAACACACAC CATCTACT CTGGTGCT CTCATCATTCTATTAAA(N) xTATTCTCTGAATGTTAAAAGAA TTTTTCCCTGG CACACATTAGACATGCTGGATGAAATCAGAGAGCTTATTTTATCTT CT CTGTTGGCGGTGCC CCAG C TTT CTGAACATTGGTAAGAACCTGAGG CTTT CAAAATCATGGG CCCAT CAACTAAAAATATGATG CCTAACACACTCT AAAAAAGATTCCCATTTAGACCAACAAGAAGTAATTCAAAAACTGTCCTTTTCTAGGTTACACCCCATGAAATCACTT AATCAGCTTTGTTTTGTTTTGTTTTTCTGCCTTGGTTATGGAGTAATGCAAAAGAAAGATTTTTCACATTTCACTTTG CTAGAACATAATCTTTTCCAGCTTTTAGTGTCCTCCTACACACTTAGCAGATTGTGGCTTATCTCTCAGAGAATCATA CAT CCTACC C TTTAG CTAT CTAAGGGAAAGAAACT CACTGCGAATTAATAGAAAATAAG CC CAATTCCTTACCTACG C ATCAGGTGTCACAAAGCACAAAGCTTTATTGAGCCAAGAAAAAGATCTTGTCATAAGAGGATAATTGCTTCCACATTT AACACCTCCAAAAAGGACACAAATAGCTGAAAGGCAGATATTTTAAAACCATATTAAATCATTCTAGTAGAAAATCTC AAACTTGTAAAGTGC CAGAAAAAGTAAAGAT CAATTAAATATTTGCCTTGGAGAGTC CATTTAGCTCATGCAGGAAGA G CGAAAGCT CC CT CTGCGCAACTGTGGATGGAAAGAGATGACATCAT CCAGATGGT AGGAAATTAACTGGGCTTTTGT ACAACCAGGTGATGACCAAGAACATTTTAAAAGACCACGAATCTGCCAAGAACTTTTCTTACAGAGGAACCGAAGAGC ACTTTAGTCTTGATATCAAAACGACATAAACAGACTTCTTGGCAGTAGCCATCTCTGTCCTGGATCTCCCACTTCTCC CATTTCACTTCTTCCATCTGCCCATCTTGTCACCTCTTAAATCTGTTCTCCTCCAGGTCGGATTTCATCAGTTGCCTT CCTTACAACCATCCCCCTACCCTCACTACCCTTCATGTCAATAGAAATATAGGAAGTATTTTTCACTATCAGAAGACA CAATCTAGTATTTGAGGCCTCTTCTCCATCTTCCATTCCTCTATTTAGCTATGCTTTAGCTGAATTTCATAATTAGTG TCAAATGGGAGAAATTTGATGCTGAAGAAATTGTGAGTTATAATTCAAATAGTGCAGAGGTTTGGGGGTCAGTTTAGT TCAAATTAGTGACTCAATCATACCAATGACTTACAGCTCCGTTAAGCTTGTAGTTTCTCACTGATACTTGATAGAAAG AGTTCTGAAGATTCAATGCCATATGATACCTAAACCAACAGCCTGCTTCATACACATAGGGCCTAGTTCTCCTCTCAG GGCTTGAGAAGTCACAAACATAAAACCAGCCCTCATTCCTTTGGGCAAACAAATTTCTTTCTCTCTTCCAGATTCTCC TGTGGAATCAGTAGGCTTTGAAACAACTAAAATACAGCCTAGTGATGTAGCTGTAAAAGATCCACTAGGAAGAAACAG TGTAATGGGAGTTTTAGAGCAATTCCCAGGGGTTAAAGTGGTGCTTGAGGTTCAGCAGCATTTCTAGAAATGTTTGTT CATTTGCTAAGTAAGAATGAATATAGTCCATCGGGAGAAACTCAGTCACACCTCCTCCTGGGAAGAGTCCTTTTGGGA GTGGCGTGACTGAACGTGTGAATAAGCACATGTGTTTGAGGGGGCAAAAGTCTTACCATTCTAGCCGGTCCCCCACCA AAGGCACTACTCAGGTGTTTGTAACAATAGCCTTCCCCATGTGAATTTGTGAGTGGTGTGTGTATAATATAATTACTA TTTTCTTTAGTTAAGCCTTTACAAGGGCA'TTTCTTATGACAATGATTGTAACATTCTCTCACATTACGATTCCTACAG TTTCAGCTCTTGGATAATAAGCAAATAAAGAATTAAACTTTGCTACCAAAAGAGCATACATCCCTACTCTCCAAAGAG GGACAGAGTAG AAATGT CATO TT CATATACAC AA CACTGAAAGATACACACACC CATAAACATGTGCACACACATACC TGTATACACTTAAGGCAACAGATATCTCTTCTGCTACCCCAATCCCTGCCAGCCCTGCATGAAATCAGAGGGGGAGTT GGGGGCTGGACTCTAGATGAGTCAAGCTGGAGCTTCTGGGAACACTGCCACAATGATCTCAATACCCTGAAGCTTTGC AGCTTTCTGCAGAAAGAAAAAGTTTTAACCTGATTATGGTGGCTAGTTTTTCCACTCAGCTGCAGCCAGGTGATTAGA ATTCAAGCTGATAGCAACTCTTGGTCATTCCTGGAAAGAACAGGATATAAAGAGAAGTCCGTTTTCTTTTCATGATCA GCACAGCCCTATACTTGGCCACTT (N) XTCGG AACCCCAACTTCTTAACTCTCCTATGGCAGAGG (N) xCACTGCACA TGGGCCTATTTTTTATG CATCTTTGTT C CCCAGTAAT TCAGAAATCCCTAATTT CTTGAAATGT CTTTTT CTTCTG CT T CCATGACACTAT CATC CCTCGT CTTCCTTCTATCTCTC TAAACATTCT T CTGTCTCAGTTTTTAAACTG CCCAAC CC TTACAGTGGGGAATTTTTCCCAGGCCATCTCCTCTTTCAAACTCTTCTCACTTTGTATGTTTTTCTGGAATGGTCCTT CTTCCCTGTGGCTTTACCTAACACCAATCTGCTGTAGCTTCACCTACCACCAATCTGCTGATGGGCCCCAAATCTATC TCCATCACATAGTCCCACATTCACCAGCTTCCCACCTGAAAACTTCCCCTTGTGTGTAATC(N)xTGTTTCTTTAATC TGTCACCATACCTTGTTCAGAGTTTCTGAACAAAAACATGCACATAGTAATCTGACCATTAGAGGCTTTCTCTGGTTC TCTGTTGTCATTCTAAAC (N) xACCACTGTGCAAAACTGCCTTCATCTTATACTAAACTTGTGTATAACACATGAGGC C CTTCAGATTC CAGCCCAAAC CACCAACTTATT TATCTTCATCT CCCTTCTTACATTCTAGAAATCAGAC CCACTGAA TTTGTTACTGT CC CCCAAATG CTTTGC CCTTTC CACAGTTAAATAAATTACTTCCCCGGCATAGTCTTCAGCATGAAG CAAGTGCTCAAAATATTTTACTTCCCC{ N) xAGTATATGAATGAATGACATAAGCACTTTTCATAAATGCTCTCATTG TTAATTAGCTCTCTTGAGGCCTAATCCTAGCTGCTGTTGTTATTTTACATTATTGAAAAATGTAAAAAATGTAAAAAA G CATCCCAAACGT CACTGAAAAG CAGTTTTAAGTTAT(N) xATTTCGAGTTTCTAGTTTTATGAGGGTAGTTAATGAG G CTTGGGTAGG AT AACATTGAGT CAGGAAACTT TTCTTTCTCC T CCAAAG CCACATAT CTAAGACAA(N) xTTAGTTC CAATTTAACTTAACAAATAGCTCAGAAAGCTAATATAGCATATGACTACACATCTGGGTGATAAAGATCATCTTTTTA ACCAAATACAACAGCTATTTAGT CTCTTTTATATAACATTAAC TTTATAACTGTTTAT CAAACGTTGAGAAGGC COTA GTTATGTATAGACTTTAATTCACATAGTGGAAAATATTTAAAAATAGCTAATACACTAGAACTTTCTAAGGAGGGCAT TATCCAAAAGAACTCAGAATGAGAAGGGATCTTATATTTTATTAACCTTATAATAAATCTGATATTCATTAGATGCCA CTTCAAATTCCTACTAAGGA(N)XATAGAAAAATTTTAGAGTCCACCCTTTCATTTTATAACAATCAGTAAAGTTTAT TTTTTTACTGGGATAAAGACAI"T ATTAC CAGCT TTGTCTCCATGGTTCT TAGAGGTCCTCTAGC CTAACT CTTAGC CA ATTATTGAACT CTGCTCTGAC AAGCTC AACAAATCTGTGGAAT ACAAG C TGTGG AAATTTTAAT ATTTGCTAAGTT CA GTTATCAGTACTAATAACACCTATGATATAATTATGTTTCATAATTAATTTGGAAATTTCTTAGGAAATAGAATTATA TAATTGC CATTGCTATCTTAAATGAGATGCATT TTATAG GAATTACAATATTAGTGGTTTTATTACACAAAATAAAAT TTCAGGAAAACTTGACTATTATTTAGTAAATAT TTCCTATGAAGGTTAC TATAATTGTAA CATGATGTGG CACAAAAA ATATTGGTCAGAAAATC CAGAGATTTG CGTTCTTTTCAGGCTG GGCC TTTTTTTATTTTG CCTTACCTCAAAAT CTGG TAATAAAAATGATTTGCCTGACTCAATCCCCAGGGTTGTTGCAAAATGAATTATTTCATTTTTATATTTTTAAGACAA AAAGTAGATAATTCTTTAATATAATTCAAAGTCACAATATTCATTTCTATCTCAGATTTAAAAGCCTTAAAAAGTCTC TACAGTTTTCCTCCATGTTAGTAAACAGGATTGTTTAGAAGAAATGAAGAAAAAAGAAAACAAAGCCACCCTGAAGTT GCTACTATATAATCAAAGCATGAAAAATAGTTTCTTACCACTTCTCAGCTGTTCAACTTCACAGAAGTGTGTTCCAGT GGGAGTGTGAATAAATGCATGGACATGCTTAACTCCATTCTGAAAATTAGGGAAACAAATTATTTCCAGGAAAATTTC AGTTCATGCCATAAAGCCAATTTCTTTAATTAAACTGAGAGCACAAATAAGTGAGTTAACCTTTCCAAGATGCTTCCA AGGGATTTCTTCCATGTAGACTTGAGCTCGGATTACATGGTTAAAAGAAGAAAGAAAATGCTCACAAATATTCACACC AAAGGCTTCTATG CTTTTGATTTGTAAAAAGAAA CAAAATGCTT CAAAGCAGAAATAATTGTCC C(N)xTTACTCATG ATGTTCT CTTT CC TGATGAAT TG CTTT TTAGAACTAGACAAGCAGAGAATAGAGTCTACCTCAAGAGAAAAAGGTGTT TATCTCACTTTTTTATTAGAGCTCCCAAATGAAGTTAATAATCACCATCACCAGGTGAGACTCTCGTTTTTAAAAGAA ACCCATTTTGCAGGAGAGGATGTGTCAGTGGATGAGCAAAAAATAACTTAACATTCAGATTCTTCTCACAGTGCGTTC AGTCATCACTTGTATCAACCACAGCACACCACAACACTCTAAACTCTATGAGGCTTCCCATTTTCCCAAGGAAAAATG TTAATGAAGGGCCACCTCACTCCTTACCTTGCATACAGTCCTTCTATTGATCCCAGCTCTCCACCCCCATAATGCAGC TTCCATATTAATCCCTAGGTAATATTTTATGGTCTTAGCACAGTCTGGAAATTATTAATGGCTTTGCCTATAATTAAA GTGAAGTATT(N)x TCCATAATGAATGACTTTAAGTAGTTTTTAACATTCAGGTCCATTCTTCACTTTTCTGCAAACT CCTACACAGTACAGGAAATACAAGAATCTGTTGTACCTGGCCCTTGTCCATATCACAGCACAGGTCTACTTTCAAACG ATTTTTTTAAAGAATGCAGTACCTTTTTCCATAGCCCCATCTATGTTTGCTGGGTCATTCTGATTAGCAAGAGAAAGG TTCAGGGTAAAAATAGGAGTTGCAGAAATTATATTGCAATGTTTGTTCATTTAATGAATTATGGGTATGGATTTCATG AGAAAACATGAGAAAAGAAGCTTTCCAGTCTACATTTCTTGTCTTTTTAGACTTCATGGCAAGTAAAAAAAAAAAAAA ATCAGCTTGAATTGCCCAGAAATCTGGGTGACCCAGCATAGCTTCACTCACACTTAAAGAAACCATAAGTGG(N)xCG AGAGCTGGAAATGCTAAAATATCAGTTCTATCAGATCTAAGTCTCTCTTGAAAAGTCCTAATAACCACTGTATATGAA CAGTAAGTGGTAATAGAGTTGGGAAAGTTAGAATGGCATTAAAATTTGGAAAATAGATTTCAACAATAAAAATAAATG TATCTCATACTCAAGATATTCTTCTGTATTAATAACAAGAAGTGAAATGATATTTTCTTCTATAAATGCACACCATTC TAAAAAAATAAAAACTAAATGGTAAGACTCAAAGAAATGCATGACAAAAATGTGTAAAATTCTATTCTGAAATTCTTT CAGGAC CTTG ATTTTC CTTTGATT CTAATTAGAGATAACATGTATTTTGT CCAATAAGATTCTCTGCCTAAAAGATTG CAGTGAGATAAG AG CC AAGG AAAC AAAG ATGTG AGGTTTGTG TGTT TGTTTGCTTTT C TG CAAAGTTT TCTT CAAT CC TG AGAC CTTTAGGC TAQ AAAAGAACATTTTTTTAAAAAATGG CTTAAATGTAAAAATAGATTAAAATG CAAAGGGAAC TGGAAATAAG C AAGTAAG AGTTC ATGCTGTTTTATG GC AAATTCTG C ATC AAG ATAGG CGG CAT CTAAG AAG AATG AC ACATACTTCTTTAAACTTTGCCAAGACATGAACTGTGTTCTTGATGGTGTCTGTAGGGATGATGTCTGAATTATCTCC ATGCAGGTAATCTTTTTTGGAACTTAGAGTAAGTTGCACTGAAGTTGCCACCTCTTTAATGCTGTGATATTTTCCATC TCACTGAATATGGAGAACTTTTACCATTTCCTTCCCATAGCCAGTTCGGACAAACTCCACCTCATCATTCTGCAATGA AAACAC AGGC ATCT TATTGAAGAAGGCT CAATGAAAGC CAAAAGTT AGGTGAAATTTACTTAAACCTT C ATT AT AT CA TACAATTGTAAAAGATGAGATTCAGAGTATATATGGAAGCAATATCACATTCTACTTTTGAAGAAGAATGAGAACTTG TATAGG AAAAAG AG AATC CC ATGC AGCAAATG AAAATATATC AGATTGTT CTGAAG ACTG ACAG GTAC AGG G CTTTTT TGATAATAATAATAAATACATTAATAGGACCTCCTGGGTCAAAACTGTACTGTTTTATTTAATCCCCACTAACCTAAT CCTTGTAATAGCACTGTAA (N) XTTGTAGAATGGCCAGTAGGTGAACTGCTTTTCTTCTATAAAATATCTTCACTATG TTACCTGGAAATAATTGCATGCCCTTTTTATAGAGAGTGACATTCTGGAGAAAATGCCCTGTGATCAATAGAAACAAT TTGAAAATTGGCTTTTCTACTTGCTGCCGGGAAGAACAGTAATTAGTGACAGGGTCCAGATTACCTTCAGGAGTGCAG AGCCAAGGCTGTCTGGCTGCAGGAAGAGCTATGCTTTTAGTGAAGAGACACTTAATAAAAGCCTAGTGATTGATTCAT TACTACCCTTGTCTGAAGCTGGAGCCTTGGCTAGTGAACACATTCATCTCCACTGGTAGAATCCATAAGATTTGTCCC TGGGCTTTACTGGTGTTTGAGGAAAGAGGCAGGAGGGAGCAGAAGAACAGAAAAACCTTATTTTATTTAGCACAGAAA TTGAAC ATTAAAATGATAAT GAAAGAAG GTGA C AAAAATAAAAGTAAATG AAAAAC AG CATGCAGTGAGAGT CTGG GT AG GAGACAAAGAAGGAAC CT CATCTGCAAATCACACTTAGGATGTT CAGGTGTACTAAGGGCTG CCTAAATATCTATT TGAATAAAAAATCACCAGACCCTCTGAGGGGAATTTAAGAATACTAGCCAAAGTCAGAAAGTACTGCTGTTCTTAAAG TAGGGGGTGGGAGG CTGC CT CCAATCTTGG AG AT AG AG CTGGGCAGGGCAGGCT ATTGTTGGTC ATTG AAC ATGCC CT CGGGG GTTGAGATATCTG AATTCAGTTCTTTCTTGTGAGAGGTTAAGGAAAACT CTTGGAGAGG CCTC TGAAAAGC CT CTGGCCCAACATTTGGGGACTCATGCTGAAGAAAAGGGTCAATAAGAAAACCAAAGGAGCAGACTCAAGGGATCTCAG TAAGAACTACAAGACTGTCGGGGAAAGAGAAACTGTTGTGTGGTAGT(N)xGGCCTGGCTTAGTTCCCAGCAGCTAGT GGTACACTGAGGTTAATC CC AGCAATAG AGGC CAGG CCTGGCTGTGGAAG CAGC AGCAGC AGAGGTAG C AGAGGGG AC TGTGATTAAGGGCGCCAGTTTTCAGAGTGAACAGCACTGGGAGCAGTAACAGGTGGAAGACTCTGGTGTATATTGAGC CACAAGGGACTCTACCTGAAGGAGGAAGGCATTTGGCTGGGTTGGGGCCCTTGAGTGGTATGGGCTGGAGTCTGGAAT GAAAGG T CTCTG AG AT CATC CATC ATCC C CAACATGTACCAGGACT CGAC AAATGTTACT CCTT AGCTTCAT CCCTGC CCCTATATATTCAAGAAACAGTAGTTTAGCCTATGAAGACAAGTGGTACCTACATGGTACATAGAAAGATTAAATGAT AG CCAC CCTT CTTATC CC C A T T(N) xGGTGCCACCACCACCTCCACTGCTTGTGTGATGACACAGCATAGTTAATGAA AACACACAAAGGATTTCAATACAGGAAGAGCCACTTCAGCTCAGACCCCAATTAGGAAGTCCTTTTCCGGGGAAAGTG G GGCAAGGAATTTATG CC CAATGAT CAC CACAGCCTGGAGAT CACAGGCACAGC TGAC CAAAGAACAAAAAG CCGAGG AAGGGCAGAATTGAGTATAGAGTCGGTAAGGCAGCACAAAGTGTTCATATGGCTAATAAGACAGCCTTCCAGTGCAGC ACAGCCATCCCAGACTGGAGATCATGCAAGACATGTGCGGGACAGGTCGGCGCTCCTTTGCAGGACCCTGACTGGACT AAGCAGAGTTGGCCCTTGAGCAAAGTTTGAGTAATCAAGTTTACTGGCAATGAGTCTGTATGAAGGGAACCAATCATC TGATCC TGGAGCAGGTGGAACAAG CCTGGCATGGGACA CCCAAAAG CATC CCTG CACC CAAAGAGCAGATAATGGC CT GGACCATAACCGAGGCTGTTCTCCCCCAGTTCTCTAATCAGATAAATCAGAAGTAGGTGTCAGAGCCCCATTGTGTAG AATGTAGGGAATGGTGCAGAGATCTCATACGGTAAAATACAAAGGGTGCTCTCTTCTGGAGAGTGGGCAG(N) xCAGC ACTTATTAGCCATAATTATCATTGATAAATAATAATCACTCGTCAAATTTTGCAATAAAGGATAAAGTTACTGACAGA GATAAATACC TAGAATGCACACCAATCGAGAG CTGGAGTGGT CAAATCAAAGGG CTGC TG CTATGATAAGGC CTACTT CTGCCTGCCTACCCTCAGGTACATTGGCCTTCTTTAAATGTTCATGATTACTGACACTCACCCCTTTGGTTGTCCTTC CT CAGG CTTGTATGGTTT GAG AAGTTAT AAAAGTGAAT CGATTTCTTAGAAT AATACT CTGGAAAAAATATT AGC AGG AATAA (N) XAGGAATAAATCTTTACCTGGCTCTACATAAGAAAAATCCCCTTTCTACACATGTCTTAAAATGTAAAGG GATTACTTGT CTGCAAGTAATGTTGGGGAGAG CTCT CAGGGATTTTAAATGACT CATGTATTTT CACTTCTG CAATAA GATAGTAAGTGTTCTCTTATTTTCAGGCCATAATGAAAGTGATATAGATGTCACTACAGGTCTAGGGGTTCTTACCTT AGAGTCTTTTAGCCTCCAGAAGTGTCAG{ N) xCTCTGTGTAATTGGTGTCCAAATTGTTAGGATCACTTTGATCAAAC TGACTTTGCCACGGTCTGACAACTAGCCACGAGGCGGCAGGAGGGAGAAGACTTGGTCATTTCAGTGTACAACTGCA( N)xGTGTCAGTAAATGTTAGCTCTTATTATTATCTGACCTGCCAATTTGACCAAAAAAAAAAAAAAAAAAGGATACTT CAAATCAGTT C CAT CAGTAAGTGGTAGGGAGACATC CTTGTTTATCTC( N) xCATCTCTTTTTGATAAAGAAGACTGA GG ATCC CACACAGGTG CACACTGT TTTGTGCT AT AG ATGTAC AACT AATAAAñ C AAAT CCTTGT CAAT A CTAATTGTA TTTTACATGAATCCTAGC TG CTTATACACATTGAGGGCAAAT CGTTATATAAAG CTAATATTCTTCTC CACC CATGGG CCATACTTTTTAGCTCTGTGAGTATTTATGCATGCTTCCTTTTAATAATAGCACTCACATATTTGTACTTACATGCAC CAGGTACC AAATGAATTT CT CCATTTTCTAGTTAAACACTGAGGCAGAAAGAAG TACATTT CTCTGGC C AAACT AC CA AGATTTGACTTCAAGTCTCTGTGATTTTAACCACCAATCAACACTCTTACCCTCAGCCTAGAATGCCCTCTCTCCACA GAGCATGGTT CATT CCACATTGGACCTT CTCCATAAAACTTT CCCAATGC CCCCAAGAGGAAATGACACCTCTTGC CT TTGAACTTCCAGTGCACTTTAGGTGAAGAGTTACTAGCTGCTGAGGTCACTCTTTATTATTCTCATGTGTGTACATGT CTTAGTATTCCCAATATCAATGTCTTGAAGGCCAGATTGAGTCATCTTTATATTTGCCCATAGCACTTGACAGTGCTT TG AAAATTGT CATAAATTATTTGT CCAAGAAA C AAG CT TCCT CATTTTGCAG AT TAGTTC AGAAAATTTAGGTC ATTT TTATTTTGGCAGGAAAAACATAGTTTTCAGCTCTAAGTCACTCATCAATGGAGCCTCAGGTTCATATTTTCAAGTTAC AGGATG AGGTGG AGGGGTAT AAGG AGATTCAG CCTATAT CAG TACATATT TTCCTACAAG AACT CTTATAGGGAAAAG AAATAAGGTTGATTTTTGTG GCTTGTGAG CTC AAAATC ATTCT CAG AAAATAAAAATC CTTGTG AAACTACC CATGGG CCTCTATATTATTAAATATTAGCAGAGGTAGTAATATTATAACACAGTTTCCAACTACCTAAACCTTTGGCACATATT CATTTATTTTTTACACAGAACTGCCTCATTTTAGAGGCTTCTTTTAGCTATAGTCTTATGTTTTTGATTCTAGCATGT CATTCTT CAACAT CAAATTTCTCTACT CTGCTGGCAAAGGCAGGTATTAAAGGTTATTGTTGTGTTTAATTTTAG CTT TTCCCAATACTTACATATGAATTTTCATTTGGTGTATGTGTATGTTCACCAAATGTGTGTTTATGTATGTAGTTTTAA TTAGAGAAAAATGACTGGAAAATATTAGGAAAGATAAGGGGAAACTTATAAATATAAATTTTAAAATGACTAGAATTT T CT CTCAAGTCTTAGAAGAGCTT C TAAAG CATCATCTCTC TTCTATATAATAGAGAGAAATACAAAAATAGTAGCTAT GTAGGAC CCAT CACATATAGATAAATATCAATT CAGTTG CAGTGTGCCATGATAGAAGGAAAAATTATT CATGGTGAC AGCCTCTTTTGTCTTTGGAATGTGGTAGAGAAGTTCTTCCAATTATTTTCCCAGGTTTCTCCTTTGCCTGCTATTTTT TGGGCATTCACGAACAAAAAGACTTCAGAGAAAAATGCTGAGCTAACATGAGAAAAATGAGCACTCATGTGGACAATC TTAATAT CACTTTGGTAGAT TTTAACAATACACAATAATGATAAGAACAATAATGGTGG CTAATCTTAGTGACAG CTG GTATCTTTTTGATTGCCTAGGATCTACTAAATGTTGATGGGAAAATACTGATATCCAGTAGGGAAAGGCATGTTGCTT TAACCTGCAGAGTGGCCTATTATTGCTATCATGGGGGAAGATATATGTTATGTTCAAAGAGTGACTCTAAAGGGATTT GTCAAGCAGATGTTTCAAAAACAGTATAACAACTCATCTGCATAACTATGGCAGAAACGGTAACCTGTTACCTAACAA AGACTCCATTTACAGGAGTCTTCTTTTCATTCTTTCACCTACTACCAAGAAAGAGAAACTCATCGCTAATTGCCAGGA GAAATACTAATTTTCTTGCCAGCCTTTCCACAGCATCAACAGGCTTAGGGCACTGAACCTGGAACAAGGCCCACAGGA G CTTCTAGGATTAGGGG CAGAAGATCC CACAATAAAGAATTGATGGAGACCAC CAAAAACG CAGCTGAAAGAACATTT TGCTGAAATTCATACATTGAAAGATGT CTGTTT CCTAATGTATGT C TTGAAAATAA(N ) xTAGGCCCTACACCCACTT TGATGGCTGGCTCCTGCGTTCATTAGGGAAAGAAAGGAAGCATTTATTCTGATTCCAGCAAAGCAGGGAATTGGCCAT GCTGCTGTAGAAAG(N) xCTGATACAGTGGAAAAAGTCCTGCTGAGATGGAAAAGACACTGTGCTAGGCAGAACTTAC C CCACAC CAAGAG CTGC CC CTGCACAAAC CATC CCTTTTT CTAGCACTT CAGT CATTTTTTTTTTTTTAGTTACAGAG GATCCTGAACTCAGATGAATTCTGTGTTTATGGTGGGAAGGTTTAAGCTGTGAAACAGGAGGAAAATCTTAGGGTGGT TGCCCCTTTGGCTTCATGCCTTCAACAACATGGCACTCATTTGAGTGGAACACAGACTTTTATCTGATAAAGAGCATT GAAAACACAAAT
> H s l_ 14611 757 0 -14 613 58 31
GTTGGTGGAGATTC CTT CAAG GACAAAT CATGACACAAGGATTGAGAACACG CTTGGGCAGAAATC CCAGAGAG CAAT GAGAGGGAATGGTAAGTGAGACAGGAGGGACAGGAGACAAACCAATAAAGTGGCT(N)xACTGAGCACCTCTAGCTTA GC CCT CAATTTACCAT CTTCT CAGGAGG CGAGGG CACC CT CTCAGG CAAAGGGACACAGCTGTCATGC CATCTGGCCT GCATGAGACCTCCTG GAGTAGG CAAG GAGACAGAGAGG GGACCAAAGG TATT CACT GCAT GTACAGCCACTT CTTATA GCTGTTGAGCTCACACAGTGAT GGAGAAAAAT( N ) xGTGGTGGTGAATGCTGCTATATTTTTTAAAATCAAGGCAGGC CTAGGTTTAGTGTCATAGCGAAGGAGCCAGGCACCATAGCCAGAAGCCCCTGGGTAGGTCTGGGAAGTCCTAGGAATT GGGAAAGG TGAAAATAGTACAG GTATGG GGCTGTAAAAGG CAAACTTCATGT CCTG CAGTGTTCTG CCTGTGGCTGGT CCTGAGGAGCACTGGTGGGTCAGTTTGGCCAGAC CTC CAAAGAGAG CCACAGGAAAAAAAGAAGGCTAGAGGTC CTCA TGAC CCAC CT CCCAAGATACAAGCAGACACTTGAAGTGGAAAAT CC CTGC CACT CTGTAGGACAAGAACTACGTGGCC AAAAAGGGAAGCTG CAATAAATAGTT CTGGTTTAACCTGGGATG CATCACAGCTTCTCCAAAGTTTTG CTTTGACTTT AGGC C CAATT CAAC CAA CATGAAACACC CACAAATTAACT CCCAGGAAAAACATGG CTCTTTGAGAAAAGAACTGACA CTTAGTAAGC AGCTGAGCCTGT CAGT CCAGGTTG CAGG ATGG AG CGAGTG AG CT CT CGGCGGGTAC AGTGGCTTGCAC TAGCTGAATT CATTAC TATTCCCTTTGTGCAGCCTTGTGCTTTGACCACTCGCATATATGGAATAGGACTTACAAAAG AAAACTGTTTTGTTTG GCAGGTGGGAAATC AT TT CAAT TGAG AT ATTTTT TCTGTTT ACT CAAGTAAATATAAAGATG CTTTGATTTT CCACAAAGAAAGAGTT GTTATC CCTATTTGAATTTATG TTGCTT CAAAAACAAACATCTTACTTTTAG TATCAGAAGCAGACCATTCTGTTCCTTAGGGAAGTGATATTCTTTAGTAGACTGGTGGGAGACTTGTAGAATATTAAA AGGATAAGACAAATTC CAGAATAAATAACAGGGAAAATTTATTTTCTGAACT CTAATAAC TATGAT CTGTTCCTACAG TAGAATGGGCATTCAATAACTG GAACATGAAGGATTGG CCTACAGG GAAAGC CCAGGGTGGT CTGGGACCTC CCTGAA CCTCCAGCAGATCT CATGAGACTGTCTT TAATTATCCTTTTCTC TCTCTCTCTCCC TCTC T CAGGC CTACAATGAACA C T G (N ) xTTTGAGTTATGTGATTCCTTGTGTAAGAAATCAGATATAAGAAAACACACAGGTATCTGATCCTTTATGGA AAAAAAAAAT CAAGAACAATAAACAG GATTTGAATAAACT C(N)xTAATGCTTTAAAACCAAAAATAAATAAATGAAA TGGGTCTC TT CTAT TAAGGGTTTTCTAGAAAC CTC CAT CT TTCCAC CACT CT CCTATAAAAGTTTTAñAACATTTTCA AAAG CCAC TTATCACTATTCTTAGACTG CCAG TGATCAAAAGGG CAGC CT CG CGG GTTGCAAATTAGT GAAGAAGAGA AAGCTAGAAGGAAGAGGAAGTGGAGCATCTGAGGGAAAGATTTTTAAATGAACTTCTTTGTGTTGCTAAATTTATTCG TATCCTGGAATGGGCTACAACCATCACAAGGACATAAAAATCATCAGGTAAATTCAAGGACTTTTTAAAAATGCATCT AATCACCTCCTGTCCAAAACATTCAAGCTTAACCTTTCTCTTGAAAATGCAATCTGCTCTTTACACCCTGCCTGCATA CCAGATTCTGAGACCTAGATTGCCACAG GAGCAGAAC CAT CACCTTGT C CTGAT T CAATAGGTTCTTGTTGTTACATT AT CAG CTTAAGATC CTGGGGTATGTG GAGATGTG C CTATT CATTAC TGAAAGAGTT CATG CA CAATTTATGT CCTTCA CATAACTTTGT CCTTAAAGGCT ATTTTCTG AATC CATT TCAGTT ATGTTTTC CC AACAGC CATATT AG GAGGTGACTG TTGTTTTT CCATATTCAAAGTAAAAGAAATGAGACAAATATCT CAAAATGATTGAG GTGC TCACAATAAAAATGTAGT GGTAAAGATAGTAC CAGATCTCGTGT CTGTAGCTTCTGTGCCCTCTTTCATAGT CCAGAG GACTGAGCTACT CTAGCA AATGGGTATATG AG AñCTGGTG ACC AC AGTTAAT CC AG CT GAC ATG AC AC CAGC CATTAC ATTCAAGTGCAGTG ATGT ATTTTGCATTGCTC( N)xATAGCAAAATGAAGAAAAATTACAAGTTTCTGACTATGAGTCCAAAAATGAGTTAACATG TGATTTTATGCTAGAAGATAATCTAATGAGGTCACTCTGCTTAGCTGCTTAGCTACCGCTGTTATCCCAATGGGACTG GGCAAGAATAGAACCAAGTCATGCAAGTAAGAGTñGCACCTGATTGGGCACTGGGCTTGGATTCAGGTTGTTATCTCT ATGGATAGCTGTGGTT CACTGAGGAAC CATTC CAAAGATG TTAGAAGAG C CAGACAGAAC CTACATTAATTCATTGGT TAAAGCTCAGAAATAATCATGAGGTCTCTGAGACAGGAGACAATTAGACATAGGATGGCTCCTATTATACACAAAATT TCATTGAGTCAGCT CT CTCCGCTTTACTTCTTGC CCTAGTATAT CTTAAATATGTGGTTTTC CCCTTG CCTAAT CCAG GAGAGCTGGATTTCATAACTAACAAGACTTTAGACTTGTG CTTCAC CT TT CAGCATGCTT CTGGGTTCCATGATGTCC CAGAGCCTATGTGCTTATGATTAAAAAAAAAAAAAGAAACTGAAGAGGGACCCCAGTCTGCAGGAGTGAGATGGGCAT GGGAAGAATTACATTTTCTCTGGTTCCTGGGTATGCTGAAGATGACATCCAGGAGAAAACTTCTGGCCCCTCAAACTG CCCCATAGCTCAGCCTGCGGGCTGCGACTTACAGTGACATCAGCAACATGATGGATTCCATCGCCAGCAGCCTGGACT CCAGGAGGCACCCCAGGCGTGGCCGATGTGCACAGGTAGGAGCATGTCCTCTGCGTGGCCTCTGCACTGGAAGAGCTT CAAGGCCTTTTCCTCCAGGGCTGAGGGGATTTGTGTTTCGCCTCTGTTTGTTTTTGTGTACTTCTGTTTGGCGAAACT TTCAAGCACGCAGAG CAACGTTCTGATAGATAGTG CAGAGAGGAGCTGCGCAG CATO CTGGAG CCGAAAGTTACAAGG AAG AG AC CAAAAC CGGT AAAGTC ACGG ATGTTGTGGC C CG AACCTGTAAATC C AAAC AATCTC AC AGTTATG AAATGG AAAATTTGTGGGGAAAAACAAATGCTTTGGTTATTACAGTGTTACTACATTCATAAAATGTATTTCATTTGGGGGAGT TTACACTGGGGGTAAT GAGATT TTTGCTTTATG CGTTATTTATAACCGCTTAATC CACGTACTGTGGGTT TTTTTAAT TCCTTCATTTTCACAGCTACAAAACACTTGAGCATTTCTAATTTATTTCTGCTTACAATACAGATGTTCTTAGTTGTG TTTCTAACTATATTTCAACCGTTACGATCACGTTTGTTTGTGTCAGCGCGCAAGCCGGGGTTGTATGTGCAATACACA ATAGTAAGGCTTCAAGAAGCCTGTAAAAAGGCCTCTTGTGCTTTCATTATCAACAAATTAAAATATGCTGCATGGCAC TATCTTTATCATCAGTGGTTTTAATCTTTGGATAATTCCCCTGCAGTTGGGTAGGCAGAGTGACAGCCCTCTCTCAGT GTAGAAAGACTGCATGCCTTAAAATGGCACTTTTCAAGATGACCATTTGGAAGCAACATTTGGAGAACATTTGGGTTA TGTCCTCTAACTGAATTTCCAAGCTGTTGGTAACGGACTAAAGAAAATGTAGTCTGTACTGAATATTAAGGAAGCAAG AGTGTAAGAGCCTTGAAGGTCATGTGTAAATATTGACATTTTATTTTATTTAGTAACCAAATTGGGGCCCTTGCATTG CTTATCTTGCCATAAAAACTCCTCACTCACTTTCACTGCCCTGCAGATTGTCAGGGAAAGATCTGATCTGAGCAGCTC TGGGTGCTCTTCCTGCCATTGCTAACAAGACTGAATTTTGTTTGTGCAAAATCTGAGTCAGAGGGGTGTTCCTCCCCT CTCATCTCCACCACGGACTGATGCACCTGATTAGATAACAGAAATTTGTGGGCTGAATGTTCATGTAGTGACATGAAA GCCAAGATGCAAAAACAGTCAGAGAAAACAGCCTCCCACTCACTCAGAGTCAAGTCAGGTGTCTAAAACCAAAGTTAT CTGGGGAACCAATAACTGGCTAATTTGGCCAGAATACCCTTCTCTGTGTCCTCAGCAGCAGTGTGAGGAAGAAATACC AGCATGTCCTCAGGCCCTGTACCCTTCTCAGAAACCAGGACTCTACCTGCAGAGCTTAATCAGGGCCATATTCTGAAG GTGAGATGCAGGCAATGGCCAGGTCTCTCCACCAGCAGCAGTGGCTAGAATGCCCCACACACTTAACAACTTCCGGGA CACATTTGACTGTTGAC CTACAACAGCATTG CTTT CCTCTG CCAAAGCAAAGAGCTCTG CT CATGTAGAATAGGCT GG CAGTTTGGTAACTTGCCAGAGGCTCCTGTCTCC CTACAGTGAGGTCACCAGAATCGCTT CC CT CCTGAAACACAGTCG ATTGAAATGAAGGTCCAGCAGAAATTAGGGGCCAGTGAGTTAAACATGGAGGCCACTTCCAATGGTCTTTAGCCTAAG AGGTTTTCTCCAAGIGAAAAGAAGAAAATTGAATATGGGAATAAAATGATCTTTTAATGCTTCATAACTCAACTCTCT GACTTACCT CAAAAAGAGAGAAATAGTGTAAAAAACCAACAC TGTTTTCTGCC CT CTAAATTTGAGTTTTGAAAGGTG CTCCTCAAATATTAATGATCTGGTTTCTGTGCCCAAATAAATGTCATGATTTACAACTCATGCTTATGGTTGCAATCC TAATCACCCTTCCCTACGGCAGCTTTATAAGTCTTGCCCAGTTCCACTACTGAAATAGTGCACTTGGATTCAAACAAC ACT CAAAAACCTTGTTTTTTGTTTTGTTT CATTTTTAACTATTACTGAGTGGACAGGGT CAAT CC CATTCAAAATTCC CAAGTAGACAAAATCTGTCTGTGGCCCCTCCCCTTCCCCTCACCTTCTTGGCCGCCTCCCTCCATTCCCACTGATGAA CTT CCAAGCTT CCTACAGCTCCACTAGTC CTTTTCAACCAATGGAAAACAAAG CAGTTT CCTACATTAGT GTATCTGC CTT CATAAAGTAGTTTAGTGTT TTGCACAAACCAAGAGAATGTTCATTTTAAAAG CAGGAAAGAAAGATTACAGATGG AATGT CAGA CACCAGGGCACCT GGATAAAATATAATTT GGGGCAGGGGAGTATATTCTCAT CTTACCTAAAGTAGACG AAC CAGT CAAGTCAGTACCCATAAGGTAGGCTTGATAAATCTCAGGACCAGGCAC CT CCAC CC CCAACAAACACACAC ACACCTTTCCACTGTGATGTTTTAGGCACAGTGTATTGTAGCTCTGCCTATCTACTCAGAACCTAATGTATTCACATG AAAGTAAACTAACAGAGTCT GGCTGTCCTGGTGATTCTCAAGACCC TATTGCCTC CT CACTTACC CAAAAGGAAAGGA GCAGTAGTTTTGTTTTTTTTTTTTTTAATTCAAGAAATCAGCAAAAAAAAAAAAAATGTGTTAGTGTTTCTCCAAAAC TTCTAAGTTAGAGTGGCACAATTCATTTTTTTTAATAAAGAATTTGATTAACTCTGGTTGGCCCAGATTTGCTTCAGC CTAAAGCCCAACATCTCTTTTGCAGCGTGGTATCCTTCAGTTGCTCATACTTGAGCCAATAGAGGTGGGAGGGGAGAT GGAGAAGACTGGTTTTATTTGAGCTTGTGAAAATGGCTTTTTATCAAGTTCAATGTCATAAGTTTTCTTCCACAGGAA AGTAGAATATATTTT CTGGT GAGCAATAGTTTTTTATTATATAGAAATGGTTC CTAATT CTTAGCAGACAGGGATT TG AGAAAACAGGGGAGTTCTCCGAAGTCACAACAAATAATGTGATGGTCAACGTGATAGACTATTGTACGCTTTGATAGC TAGTTTCAGACTGATTTTCTGACACATGAGAGAGGAGCAGTGCCTGTTGCCCATGGAAGGCGATTTGCTGGGAGGAGG AACACAAACATCCCCAAAGGCTACCAGGTCAACTGTCATTTTCAAAATTACCATTAAAAGTAGGCTGGTG(K)xATTA TTATTAAAAGTAATAAGAATAGAAAACATAGGCTGTGTATG CTAGATTAACAAAACATT CC CC CTACAAAAGGGGGAA ATTTCAATTTTCTATGAAGTTTGAAATCTATAATCATCTGGCATTATTTCCAGGCATGTAGAAAAAAATTCATTTGTC TTTTCATCTGGATTAGTTTGTCTCCATCAGTGTGTACATCAAAAGTACCTGGGGATTCTGTGGGTTTGGGGAACATTT TTTTGTTTGCTTGTTTGTTTGTTTTCAAAATATATGGTAGTATCCCACCACAAATCTATACTAAAATAAAATTTCGCA GTGTATGTCAAATAAATGTGCACTTTGAAAAATTC CC CAAQTAAATTTTGAGATTGG CT CTTGATTCAGTGACACTAA TATAAACACTTTCACTTACTTTTGAATTTTGTGGTGACCGATCTTAAGATTTGAAGCCTACACCTTCTTAACTTGGTC CATGGGGGCACAAAAGGAGAGAGAGTATAAATGCTGAGGATGACCTTAACAAAAGTGAACACGAAATTAAGAGATGTG AACCCAAAATAAAAGAAAGCAAAGAACTCTGTCTTCCCTTGTCATCTTTAACTCTGCTCTATCTTGATTCTTCTCTGT GTTTTTCACAGGCTCTTCTTCCTATACCCACTCTGTAAAGTGATTTACCCCATAGCTCTCCACCCCTTTCACCTTGGA TCTCTGGTCAGATAATCTCACCTTCCTCTGTGGCTTCAAAGCCATATACATTTGACTTC(N) xGTGATAATGAGGGGG GTCTAGTTATTACAGGCTGCATTCCCTGATAGTGTGTCCCTGCCACAGGAAGGGCACAGCAGAGGCTGAGTGGACCAC TAT CGTG CCTGTCTTTGGGGAAATCCAGCATAAGC CT CCAAAATCCCTTT TCCTCTTTTAATTTGATGACATTAATGA AGGAATATGAG CT CACAGGGATTAAATCAAGATGATTAAAAC AAAGAAGTAGAAAATATGATCACACGTC TACAGTGT GTCTACAAACACGGCATAATCAGATATGAGAAAAAGGAGTTTTGTGAAATGTTAGAGTCACAC CCTC C CTGACGTGAG CAATACAGTGG CCAG CAAGGCAGGAGAGGATGTGCTCTGTATCAAGAATGTGAGG CCTACGGTAAAG CTGACCGGTAC AGGTGGAGCAAGACCAAC TAAGAAGCATCTTTC CCTGGAGAAAACTGTCACTCAT CAAGAGGATGTC CATGGCAATCT AAGTGGAGATTTTCCCAAAGGGACTACAAATGTCAGAACCCAGGTGATCTGACATCAATGAATCTTCCTGAAAGCCCC TTGGGATGTCATCTGATGACATGAGTTAGCATTGG( N) xAATTTAACCATGCAGTTTAATTTATGTAACTGTGAATTG TAGACTCTTATTTAAACATGTGGTATTAATATTAT (N ) xGTGGAGGTATTCATATTCCAAAATACCACTGGGAGATAC AGAGGAATAAAAAATTAAGAAATTCCATACATCAGAAGGACAAGAGCTCTTGAGGTTCTTTCAGTGGACAGTGGTTTT AGACTTCAGTGTG CAT TAGTT CTGTC CAAGATGCTTATTACAGATGAAGGGTGTG CTTCACTAGAAG CCCTCGCGATC ACTACTCTTGATTGCATGGTTCAGTAGGACAATACTTTGAAAAACATTTCCCTGGTGAGTGAATTCAAACCCTCAGAA GAGGGATTCCATGCTTGATACAACACTATGGAGGCTTTGAAGTGTTTACCATACTTGGGTTCTAT( N) xATTTGCCAA TTG CC CAGAAGGATGAAAG CCTCACAGTG CAATCATTGT CAAGTC CTGCTGAG CTGTGTGGGAGCTGGCTTATGTGGA ACTGCCTGAGCTTCTCAGGGGCCTCCTCCTGGCACATCGAGCTCAGATGAAATGGAAGGCCTCAAAGCCCCACTC
> H sl_1 82 08 62 65 -182099422
TTGTTGACCATAGTTGCTT CT CAAAAAAAGCTCAC CTGTTAAG CTTAACAATAGGTACCAGTATTGA{ N) xCTATTAT AAATTGTTTT TGAGAACAACTTAAAAGAAAGACATGTGG CTGTTG CTGCATGATGGACAAC CAAGTTTTT TGACCAGA AAACATTTTGC CC CTTGCAGAAAGAACTCGGGAAAGCAG CAGG CAGT CTGGAATGGCAATGGGAAGATC CAAAGG CAG CAAAGGCTTCT CT CCGCCATGTG AAAGGC CTGAGTTTAG ATACTT CAGGGAAGTTTTñATG CCAACCTT CTGTñAAAT CCCCATTCTC(N) xCAGGGGAGGAGCAAAAAGAAAAAACGTTTATTTTTTATTGAAAAGAAAAACCGTCATTCTCCAC AGCCACATCCATATTTCAGAATTGTTCTTAGAAACTTCCAGAAACAATTCATTTACTGTATATAACATTATGGCCTAG ATTAAACTGAT CAGTAATTATTAATGCTTAACGTTAGTG CGGAAG CT CAAAAATTATATACTAGAGTGTTTGTGTAAC TCTAAAT CTTAAGATACATGT CAGTGATACC CGAAGCTT CTTTTTTTTTTTATTTTATAGAGAAAACATAATTGACAA AATGGTCACATAC TTT TTAAACTT TGTTT TCGACATCTTTGGGAAT TGTAACCATTGAACC CCATGTTC CTGAT T CAA GCT CCTCTGTG CñTCTCTTAAGAGTTGTTGAATTGAAGT CATAATGC CTGGGATGGAAGAAAAGGCTTC CAAGGGGCT GATCTCACAAATCAGCGGATAGGCTGACCAGTTACTCTGTTTGTTCTCAGGCATGTGAAAGCGTTCTCTGGTCCTAAA CATCCTGTTCTAGTTGTGCCCCATTTGTTGAGGGGTTGAGGGTGGCATTGATGCAGTCTCCATGCTTTAACTAGGCTG CTATTACCTATTACCTGCCTGCCTCAGCTTCCAACTACCACCCAGGTTAAGAGGGCATGTCTGATGAAGAAACCACCT CCCTTAGTCAAGATGATTCAACTT CATTCTAC TCT CATC CCCTTTCTC CAGGTGGAAAT CTGAGGGT CTGAGGAGGAG ACAGAGAGAGAAAGGGGACGCAAAAAGGCTCAATTCTCAGGCTTCTGTCCCAACAACATCAGTATTATCTTC(N) xAA ACAATGTAGAAAGACTGGTCAACGTGAGTTTCATTATTATCCAGAAAGCCAGCAGGAGTTGGATGAGCTCATTCACAG AGACC ACTGGGTTTT AGGAGC AGGGATTTTT ATCC CT CTGGACTGGG AGAGTTGGGAAAAAT G AAAGGATC CAAG CTT TACCAAATTCAGACTCACATTTTAGTTCCCAGCCAAAGGTCATGTTGAAGCACAGTGAGAAGCAGTTTCCCTGAGGAA GGGGGTTGGACTTGGAGGTTGGTTCCCCTATTTAGATTCAGACAATGGGAGGGAAGTTTATTACTTTTACAGCATGGT GACAATG CAATGCTTGGTAAAAAACAGTT CT CAGGACATTTAGAATTTGTT CTCTTTCCCTGC CATGTG CTTATGGGA TGCTC CCATAAAT CC CTTCTCTTATC TGGATACAGGGAC CTATTGACTAGT CCTG CTAGGGATTGGGAT CTGGAAAAA GCAATTTTTT T CTTTGGTTTGTG CAT TTTTAAAATGC CTGTGCAAGGACTAAGAACCTG CTATTCATATTC CCAGGAA AAGATAA CTGTAT CAATTACTAGATACGG CACCTTGGTCATC AAGAGTGAGTC CAACGACTAGCAGGTGTCTTGGTTT GTGAGGCGCCC CCTC CAGCATTAATGGGAAAAGAGATGAGGGC CAG(N) xGATGAGGGTCCAGAGATTCCAGTCTAAT GCCTCACTCCTGGGT CGAT CT CAC CAAATAG CAAAGGGTTTCCTGTT CACCCTTTC CATACTGGAACAAGTGAGAGGT GGAGGATGGTGAATCATTTTCCAAAGCCAGAATTGGCCCTCTTGGAATCTTTGTGCATTTCTGATGGAATGTAGGAAA ATTAAGGGAAATCGGGAGG CT CCAGGGATTTAGCATGGTTCGGGC CACAGCACAAGGTC CCATAGGCTCTGAAAGACT AAGCTATAGCTCCAATACAGGCGTTATCTGACTCCAAAGTTCAGGCCTCTTCTACTTGAGATCTGAGTCCCTCACACT GGAGCATGGTCAGTTAAGGCTGCAAAAGGGAGAGAGTAATGGAGGTCAGCGTTGAATAGATACGAATATAAACTCTTG GGCACTGTTAACG CCAAACAG CCTGACAT CTTACTTCAGATATAAC TGTAA CAGTGTC CAAAATCCAATATTAGCATT TAAAACTTGTAGCACAGTTATTGTGGAGTCATTGAGTGATGCAGAAAGAAAAAAAAATAAAGCCGTGAGAATTCAGCT CATTTAC CCAAGT CACTGACG CATTTAAGGAACAG CT CTTATAAGTC CATTTAGGTGGTTTAGGGCTGCGGAAGATG C CCAGAAT CCC TAAATAGAAATAAGTACTGGT CATCAGAG CAGTGCGGTTAGGC CCTTTCTGGC CAGGAGGT CATCTTG GGCTC CAGAGC CACTTGTT CAGCT CTGATAT C TAAACAG CCAAAGTGAATTATTCACT TAGGAGGGTGAAAAACACT C AACTCCCTGGGATTGTCAAATGGAGTTTTCCATCATAAAATATCTTAGTCATTTGGAATCTAACCAAAGTAAACGCCT GGTCCTGACCCATCCATACCTCTGTGCTGGATGGGAAAGGAGAGAGGGAAATCCCAAAACAAATGGAAAAGCCACTTT GCCAGCCACAGAGCATGCATGGCCCTCCCTGGGCAGGTCTACTCCAGGTGAGACACGGTTTGAGCCAGGCTACAGGTG AGGAG CGGGTCACGGGAGGGCTCTAAATGGCATCT CTGTTTGATTCCAGGCCTGAGTGGGGCTGCGGCTTA CATAACA GTTTGGCAGGGAGATGCAG CCAGGCGGAAGCACGCGGGTTTCCAG CAGCGTATAAT CAT CTT TGATTTCTCTGT TTG C TCAGCAG CTTT CATGTGGTGGGGGAGG CAGTAGTG CAGGAGGAGGGG CAGAATTC CGATACACTGCGGC CTTCTGTTT TCTTCTGCAAAACAAACAGCCCCAAACAGATCCCCATGGCCAGGGAATTAGCCACTGCCACAGAAGTCCCGTGGCAGT CTAGGGGAGGAGCTG CCCTGGAG CCTGGGTCAGG C CTAAGAAGGT CAGAGATTGACTTAAAGTTACGGT CT CGC TAGG TGCACAGGAGC CCAAAGGG CTAC CGAGGAGCAGGGGT CTT GGATG C TGGCAGCAC C TAACATGGTTTAC CCTCTTGG C ACTGAGGGCTGTGGTGTCC CCTT CTGATTACAGAGGTTACTGAAT CC CATACTACGTCCAT CCAGAAAGGCAGC TTGT CAGGT CT CTTT CCTACTAATCACTGCCTC CATACCAGGCTAATTCTG CTGGAATC CTGAGCAT CTGGGAGCAACCAAT TTAGAAAATAACATTTCCTTTGACTAAAATACAAATTTCTAGCTCCTTTTGATCCT(N)xTAAGATGAATTCCCTGCC CTATCCTAAGTATAATGATTTGATAGAA(N)xTGACAGAAGCACAAATTGGTCATTC(N)xAATTTATCATGGGGGCC TTTAGTG CCTGGCACTTAG CACAT GCCGTATACATGTGATTAGATTAAATGAATA CATAAATAATTCTT CCAGCATG C AAACTGTTGATCTTGGCAGTGAGGTTCTGTTACCAATGTGTCCACCCTCTCAACCTTATGAATAGGTTTATTGACAGG ATACATCTTGTTTTGCAAGTGATTTGCATGAACTTGTATAATGTTGTGCTCAGATATTTCCTTACATTCAGATATTTT CTAAC CC CCAG CTTG CATCTCTGGCCT CATGTTTTGCTACTC C CTGC CCTATCAG CCATGAGGAACTGCTTGTATGAC CTC CCTGAACT CACT CTGCTC CTGGGG CCTC CACG CC CTTGCT CACC CTCTCCCCTT(N) xACCCTAGCCAGGCCTGC ACCTGTTTTAGCACATCAGTGCACACACACCCTCAATTTTGACACTCCCCTGCAGAGTCCAGATAACCAACCAAGGAT CCAAAT GGATTCTTGTTT TG GACAGAGTCTAG CTCTCTGTTGACATAT CATGTT CTTT CCTCTATTATAGTAA CTACC CATGCTCTGTGTGTTTTCTCAAGTTTTCTGTGTTTGAATACATGAAATTCTTTTGCTCTTTGTATGAAAACAATGCCA TCTGTT CACT AG AT AGAAATT CTGGAAAGCTT CAGAAAAGCAACCAG GTTTTACTGCAGAAC AGTCTAAG AT AT AC CT CAGTGGGATTCTGGGTGGGAGGCAGATAAGATTAGGACAAGAACAACTTGGAGTTCTTAGTTTTGTGTGGTTCCCTGC CCCCCTTTTTTTGATTTTAGATTTTAGAACTTTTCTCTCAAATTCATTTCAATGAGATTCATGCTCCAGTGAAAACTA ACATTCTTCTTACATTCCAGCCTAGCAGCCTCAGGGGTGATGATAGTCTTCA( N) xAAAATTTAAATAAATTGCAAAA AAAATAGTAAGAATC(N)xCTTTGAATGTCAATTTCACACATTCAGTACAAAAGGGCATGCTGATATTTATCCTCATT GAAGCTGTTG AC CT CCTT CAAGG G CTTGTC AATACC AGGACTGCTCAAAATGAT TAAC C AGATG AT TTGG ACTGCACT GGAATTTGATTAGTGGTAATTTGAAACATC CAATAATATTTG CCTTGC CTAGTGATAAGCAGATACTTGG CAAAGAGG ATGGAGTACAAGCCTTGCTTTATCTCAGAACACAGACATCAGGGACTGTAGGAAAGAATGCTCTAGAAGGAGGTCCCC TCCCTGGGGCAGGCCCTGTGTTGGAATGGAATGCAGATTTCAGGTCGGAGAAGGCAAGTGGAGAAGGCCACTGTCCAG GTCTGTAGGTGGGACAGACACAGAGGGCTGGCTGTTTCCCTTGATGGTTCTCACCAGTAGCCAAAATCAGGGATCTAA AACAGAGT CTTTTAACAG GG CTGC TTTATGGAAGAGAGGGGCTAGAGT CGAAAACCAAAACTGCTC CAAAGTAGTC CA AAGAAGCTAATATGGAAACAGCCATTTTTCATAGACTTATTCATTTAGATTAGGATGATATGAATTGAAGAATCTTTT TTTTTG AT ATTGGT CAGAGACTTAGTGTAAGAAAAT TGGGAATATTAAAAAGAAAC AACATTGTGACTG GTCTATG AT TTATATTACTAATAGTAAACTGCCATTGCTGAACATGCTTAGAAAACATTTTAAAGAAGCTCTGTAGAACTTAGGCTA CAAGACTACAAC CTGTAT CACTTAAGGAAGTTATTATGCTCATAACAT TGC CAACTGATG GAATGTGCCT CTGCTGAC TTGTACCCAGGCCAGCCCACACCACAGACTGCAGTCAAGTCTTCTCTGCATGTCTTCATCAGCCCCCCACCCTCTGTT TATTAAGGCCACAGTGGAGCCAGCCTGGAGGAACAGGGCTAGGAAGGAGGGCGTGGATCCAGGAACCCAGTACAAATC ACATGCAATTCCTCACTGTCTCTAATTTTTATTCTCCAGTGAAGATCAATGTAGGAGAGAAAGATAAATGCAAGGAGG GCACCTTGGGGAGTTAAGGGAAAGTAATGGTGCTGACTGCATGCTCTGGGAGTTGTTGTCTTCTCATATCTGTGCTAG CAGGGCAATTTTCTTCTTGCAACACCACATTCCCCTCTACCTTAACAGTGGATACCTGAACAGTTTATTTTGCCATTT AAAAAGATGCAGAATACTCCTCTCAATCCTTCAATCATACAGATTACAGGAAAAAAAAAAAGAGATTAATACAAGTAG AAATGAGCCTCGGTTTTGCCTCTGGTGTGTCACTGTAGAGGAATCACCTCTTTTTCTAAGCCCCTCTATTCCCTAGGC AG CACC CTAGGC TGTTGAGG CTACACGCCAGCTGGTGCAGCT CCACAGTACCATT CCTAT TTGACAGATCTAAGAG GA TC CCAGGTATGTAATAAATGGTCATGAGAGGTTCTG CTGACTGTGGACACCAACATACTTGCAG CT TCAT TGAATG CT CTAGAAGGAGGT CC CCTC CCTGGGGCAAGC CCTGTC CTGGTAATGGAATTCAGACTGCAG GTCG GAGGAGGCAAGTGG AAAAGGCTGCTGTCCAGGCCTGCGGGTAAAACAGAGAAATGGCTGGCTGCATCTTCCCTTTCTCTATTTTCTTTCCTC CAGTGTATACAAAGTGCTGATGTAAAATTTTCTGATCTCCATCAGCAATCATCTTCAAATAGCAAAGTGGAGAAGGCA GCAAGAAGGGCAATAAACCCAGGGAAAGAAAAGAAAGAAGTCAGCAGCCATGGAGGGAGGCTCAAGTGAGTGGCAGGG GTGATG AG AAAC CTGGCCTC CAG GTA GAGA GC A CAATGTGAAACTGTG CAGGGG AAGT CAGTTTTCTAAACTGAAGGA CACAGCATATCTGTTTTGATGATGTTAGTATTAGCCTATCCCTGGAGATGAGAGTCTCAGCTTAGGCCTCTTACAGAA AGCCTGGTAACTGGTGGTTCTTTGGGAATCCCAGATGTCATGAAAGCTGTCTTCAAACTGAGAGTTAATCGTAGTCTC AAATTTGATGTCCCTTCCCAATCCTCACCATGTGGGTTAGCTTGAGATCTTTTCAGAACATAAATTCTTGCAAA(N) x CTGAG G CT CC AGTG AATG AAG GTG AAGCTTGTGCCAGTGGAACC AGG GTTGGCATTTTTCTGTG CGT C AT CTGT AATG CTCTGCTCAGGGCAGGTGTCCTCACAGGTTAAGTAACAAGCGGAATGATGCAACAAAGGCAGTACTACTCTCTCTACG TCTGTCACTAAGAGGCTAGGTTGCAAAACATGGATCTCGTTTCAATATCTGTTTCTTTAGGCTTCAAGAAACAAGATC TAAAGCTTCCTTTCCTAGTATTATCCCAGGCACCTCCCGGGCCTGAGCTCACCCTACCTGCACGCACAGCCCTTCGGA AGAATCTTGGAGAGCCACTGCCAAAGTCATTACTGCTATCCTCATTCTGCCCCATATCGTGAATATAGCAATCAAAGG GGCACCTCATGCATGATCGGGAATTTTCCCTGCATCCAACTCTTAGGAGTACGTGCACCAGAAAGTGGGCTGACCAAG GTCGAGTGCTTATTAGTTTAATAATTGGAACACCCGTGGTACTTCCTGCCAAGTGTTCAAAGACATGACAATAAAGTT TTTCCATTTGGCACTGGTGGCTTGGGGGTGAACCTTGCTGGTGCACACAGGGCTGGAGTTGAGGAGCTGGAGGTCACC CTGACCCCTGCTCCTGGAGCCATCACCTCCTTGTCCATGATCATCAGACTTGCATTTCCATGTGAATGAGCCAAGGGA AGAAGTGTTTCTGAGTTTTATGGG CCTTTTATTCTCTTGTTTTCTTTAAAAATTGGAC CAGGAACT CCAGCCCTTT CA GCTTTCTTCCTCTCCCTGGAGTCCCACTGGTGCACATTACTGCTAATGGGGCTTTGCGCCTGCCAACCCAAAAGGAAG AACAGGTC CGCG GTGCAT CTGTG G CCTAATGAGGAG CCTTGAATGGAAATGAGAGTCAAACATG CATTGAAAAT ATTC AGTTCTCAGTTGGTCATCCCTCCCCCGCCCCTCGCCATCTCAAGGTCCCCGAGCCCTGTGCCAGCAGAAGGTGACCTG CCAAACTGATAT CC CGTGAAGCTTGGG GAAGG G CACTCAAACGGGACC CAGAAG CACA CC CTGAACTTGTTTTTTCTG TG CTTAGCACTG CCAAGG CAGGGG CTGAAG CAAAGAGGCAAG CGGGAACAGGAG CAGGGTTTGC CTACTGAG CAGAAA GGAAAATCAGGGGAATAGGGAGCTGGGAGTCAGATGCCAGGTGGCAGCCTGAAAACCCATTGTTAGGAAGTTCACTCT GAGCGGCGAGTAGGAACTTTTATCTTGGCAACTTTAATGGAGAAAGAACAGGGAGGCAGGTAGCAGGAAAAACATGGG CCCCTAAGTCAGGAGACACAGTCCCAGGCCTGGCCCTGGGACACATAAGCTGCAGCAAAACTATGACAATTTGGATAA CTTGTTTTTTTTGTTGTTGTTGTTTTTTTTTTGAGCCTTGGCTCCCTACTGTGTAAAATAGGAGTGGTAAGCCATGCT TTACCTGCTCCCCAAGCACATTGTGAGGAC( N )x TCTGGAGGTGGTGAGGCTGGCTAAGCAGTCAGCAAACGGGCAAG GTGGGCATGAGAGTGGGGGACCCCCAAACCTGCAAGTGTTCAGCTAATAAAACTCAGCACTCCATATTACAAGGCAGT TGTTCAAATCAAAGTGAATG GAGTAG CT
=>H s2_75150255 -7516 710 8
ACAGTCCCTCCTCATCTGAACCTCACCTTTCCAGAGTGGGAGAGAATTTTGTCCCTCTCTGCTCCCCACTTGCCCCTG GCCCCGGTTATCTCTGGGCATCTCTCTTTTTCTTCTCTTTTCCTTTTCTTTCC(N)xATCTCTCACCCTTTTGTCCCC CATTTGAAACTTTGCCACCAGCCTGGTCCTGGCCACAAGCAGGGTCACATGTTTTCCTGTCTTACTTCATGCCCC(N) xAT CACCCCTTG CTTTTTGATTGTGCTT CAGCCAGCGC CC CT CCCGAGGCAGGTAAAAGTGG( N) xCTTTAAGTTTTA TGGTAAAAGTGGTATTTAGGAGACTC( N) xGTTTTTTTTAAATGTTAGGCATTAAAATGAAAGACAGTAAACTCAAGA GCTCTGTGTATGTAAGAGTGAGAATGGGGTGGCAGAACTGGGAAGGGGCCTAAAGTAGGTGAGTGCAACGAGGCAAAG TACAGCACATTTCTGCTTTCCTGGAATAAGTCTGTAAGCAGCCACAGCTCTCCCAACCCTGCCCCAGGCCCGCTGTCC TCAGCTATGACTAGCAGGAGTCCAGGGTCCACTGCTGGGAGCCCACCTCCTCCATCATGTCTTGCTGGAATCCCGGGG GCTTATGCAGGC CTGAATGGG G CT CC CTGAGACTTCTGTCTAAGGAAGCCATGCACTGGCCAGCTT CATGGCTGTGG C TGCTCTTTCCCTCTCTGACACGGGATGCTTTTGCTGGGTGATGGCCCCTTACCTTGGCTTCTGCTGTTTTCTTTTCCA GCAAAGGGTGCTTCTCTGGGTGGGGCTTCAGAAAGAGGCTGGCCCCATAGTGCCTCTCCCTTCTAGGATCCGCTCTGC CC CGGTAGTCTC TTTTATTCTCTCAG G GTGCAA(N) xTTGTTCAGTGCCTGCTGGTGGACCTGAGAGGGGAAGAGAAG GTGGGGTGCTGACCAACTGCCTTCATTTATACTTTTGCCAATATTAGAGGCAGATGTGGGTCTGGGGTCTATTTAGGA AAGGGAAAAATGTGATTCCTTTTTCCAACTGTGTCCCCCCTTTCCCTAACCTTCCTTTCTTGTAGGGTGGATTAACAG TTTTTTTTTGGCAGAAAAACACACCAACCAACAAATGAACAAAAATCAACACCCCAAGCTATGCATACAAAACAACAG GCCCAACCCACACACTGAGAAATCGGACACTCAGCAGGCCACTCTGCCAACATGCTTTGAAAGGATACTCTTAAGAGG TTTTGGAGGTAGGGTTAACCTCCGAAGGGGACAATAGGGACTGATGTTTGCCTCAGGCTGGTAGGGACAAAGGGCATT G CAGAGGAGAAGA CAATGGAAGTGAG CC TTGTGGGC TTATTTGGTGGCTGGCAG CTTGAAGGTT TCTC CCTCTAGG G C AAA(N)xCAGGGTTACTGTCCTGCTTGGTGACCTGCCTACTTTGCCCAAGAAACAAGTTGAAAACTTCCACATCTCGG GAAATGCCCAGGTCAGGCTGCATCAGGGCTGTTCCTCAGGAGCTCAGGGACAAAGTGAAAGCAAAGGACACCAATGCT GGGTGCTTGGGACAGTGTCCTGTTCTCCTCATGTGGTGGTCAGTGCCAGTACACATAGGCAGTTTTCTGCCTTTAAAT AAGATAACATAAGAAGCTGTTTGTAAAGTGTAAAGAACTGAGCACGCATTAGGTGATTTCTTTTCTTTTTTTC(N) XG CATGAGGTGATTTCTACCAGTGAAGGGTAGGTGTGGGTGTTTTGGGCCTGGCTGTTCTATCTCAGGGCCTACCATTGT CGTCTTGTTCACACTCCCTGTGCTGATTATGAAGCTGA{ N) xAATTGGCAGAGCAGGGACTTGCCTCTTCCTGGGACT GCTTCTTGTC CCTGTGAGCATCAC CC CAA CAATGACTTTT CACCTT( N) xACATGCCCCTTGGAGGCATTGCCTGAGA AGCACATCAGGGTGTAGCCTTACCTCTCTGTGCTGCTCCGAGGACCCTGATCTCAGAAACCCTGCCAGTGGCCCCAAC ATGTGAACCATCGTGGTAAAAGAACTCTCTGATCACAAGAAAGTCCCCTAATCTAAGATAATGTTGATAAACAAGCTA ACTCCAGGAGGGAGGGTTTAATTTAAGAAAGCCAGTAGGGACCCTAAATCTTTTCATTCCAATCAAAGCTGTCAAAGC CAGTTATCGTCAGAGCCAGTCAGGTGGCCCCAAAGCCGCCATCTACCTGGCAGCTGTCCTGTCACCACGGAAACCACA AGTTGGTGAACAAGGCCCGAAGGAACCCTGCTGAAACTCAGTTGCTTGACATATTTGTCATTATCGAGTCTGCCAGAG TG CAGCATCT CTTG CAAGAGTGATTCTC TGAAACTT CT CCTG CTTCAATC CCTCTACCCATTGTTT CC CAAATAGAGA ATATGCCCAC CT CTGAGAAAGACATGTG CTGGCAG G CAT CACAGGCCCAAGAG GAGGCACTTGTTTACTCGACAACAG ACAAGGTGACTGTCTGCTGCATTGGCACCAGGACTAGCATTCTTTGGGCATCCCCAACTACCTTTTCAGGCCCCCGTG TCTTCAGGCTGTCTAGGTCCTGGACTGGTGGTCGTCAAGTCTGCCCACCCTCACTATCCAGCAGCACCAGTGGAGCAT GCTTCATAACTACCCTTTCCTCTCTCCGAATTGAGTTTGATGTCATCACCCTGCAAACGATGAAACAAATCCCCATTT TATTGATGACTGATAACTTCAAATGTGACAAGAGGATTCATTTCCTTATTCAATCATTCCACAAATATTCACTGAGCA
T(N)xATCATAATAATTTTCAGGTTATGGATAATGCCAATGGGCACTTAGGATGATCAGGGAGGGCAGTTCTGAGAAG GGGCATATGAGCTG( N) xAACTATGACCTAAAGAAATTAGT(N)xGATGTTTCAAATCCCTTCAAGCTGGGAAATTCT GTCTCTGTTGCATCTTTGGGATAACAAACTTCCTGAAGAATCACTGTGTCTTTTCAAAGACTGAAAATGGATTCCCAG CAATGTGAAATCTTGTGCCTGAGTCCTAAAGTGATCAAAGGTAAGGTTGAGCCTAGGACAAAATGAGGATCCCCTGGA GG GAAGCTAGG G CAACACAGTAAG TGGCAAAAGAATACGAGTGAGGTTATTTGACTATCCTCTT CCAGTAGGGCTGGC TCAGCTATGGCAGACCCCAGGGTGGTTAGGACAAATGGCAGC( N) xGATGAACAGTATTTCCTAGTTTAATGTTTTTT TTTTTTTTTCTAGGCAGACTTTTGTAGTGCACAGTGACCgGTgACGñGCAGCATGTGTTTCCñTñGGCATCTTCTCTC AGTACC <N)xGTTTGCGTTTTGAAAGTGAAGAATTGTCGAAGTGATG(N)xACACTCTAAATCTGTGGTAGAAAGAAC TGTTGTTCCCAACTCCTAGTTTCCCCCTGTAATAGGGTGACACAGCCCTGCCCATGGTGTGACTTGCCCATTCTCCCT GAAGGGACACG(N)xGGGTCTGCTCTGTGGCTAGGGCTGTGGCCACTCTCCCATTCTTTTCCCTGGCACCATCTCAGG TCCAGGAATAAAGAAGAGCAACAAAAGTTTTCATTAGACTTTAAAAACTTTGAGTCTTACTCATATTTAATCCTTAGT TGCTGAAGAAGTGACTTTCATATATAGGAGACCCTAAGGGGGTACATTCTCAGTGAAGTGACAGCCATGGGGGAGGGA AAGATTGATCCATATGAGGACTTCAAGGTCCTGGCATATCAGGACAGGTCTTTAGGAGTCCTGTTGACATAAATAAAA TAGGTAGATT AT TG AGACTG AGGC CAGG CCCTAAAC ACTAGT GTA (N) X AAACT AGAATT AATATT CCTGAGAAC AAG GGGTGGCAAAGC CTCCTCCTTCATCTGG GGTGAGGGGAGT CTTCTGAGTAGATAAAGAACTC CTTTTAGGGTAGAAAG GAGCTGAAGTTCATCCAGTGCAAGTACTGAGTGAATTTATTCTAAGGTCCCAAGACAAGAGAAAGATAAAAACAAAAA CGGATGTATCACTGAAGGAGATGTGCTCGGCCAGCTTCCGGCAGCAGCAAGAACAGTGGGTATATAAGCTCCAGTAGC CAAGGTTGCCCAGGTTACCCTTTTCAGGGGCAGCAGTAGAGGCATAGTTGGTTCCGGAGTCAGAAAGACCCAGCATCT TCTGCTGATGCTCAAGCTGCATGTTTTTTTTCAATCTTCGCAGCTACAAATTTTTATCCCAATGGGGAGCACTGTTTA TAGGTTGGAGGATTAGGTTCCATAGGAAAGAAAGAAAATGAGGAGAGACACATTGGACAAGTTGGTGGGAGACTGTGC CACTCGCCCTCTAGCCTTTGACCATAGACAAAGATTAAATAAGGTTATATAGATAGGAAGAAAGAAGTAGAGGGGGCA AAGAACCAGGTAGCTGACTCATTTAATCCTCCCAAAAGGGTTTTTCTTATTGCAGTTTCTTTGCAACTCTGACCATAT TGTAATATTAAT CAATCAATCTAT CACTTGCCTTTGTTGATATTTCTGTAC C CT C(N)xCTCTGTCTCTCCTCTCTTC CTTTTTGCACATGCACAAAGAAGAGGTCATGTGAGCATGGAGCCCGAAGGTGGCCATCTACAAGCCAGGAAAACAACC CTCAT(N)xAAGCCACCCAGTCTCTGGTATTTTGTTATAGCAGCCCAACTGGACTAATGAAATTTATTTAAATTATAT TTCCCCCATTAGAATGCAGAAACCAAGCATCTCCATTTCCCAAATACCACCAGTACCTGGCATG( N) xTTAGTAAATC AATAACATAA( N) xCAGATATGAAAGGATGCTTAATACTCACAACCTAGGGACACAAATGGCATTTCCTTAACCAAAG TTAGTTGAAAGTGGGCCAGGATAAAATAACATCTTCAGACACAAACTACATTAAAGATTGCTTTAGCTGATCCAAGAG AAGGGTCATATTTGTTCCAATTTCAGTCTAGTGTCCTCTCTGTAAAAGGGATTAGAGATTTGCCAATCCACATCTGGT TCACTCCCTGTAAATCAATGCCCTCAGAGTCAGTAGACAGACACAAAGCTTAGAGGAATTTAATTGCCAAAGGCAATG TGGGGATñTGAGAGGATGAG C A T T T (N ) xACTCATATTTGTGAAAGTTCTCACGAAACTCTTCAATAGCCAAATAGAT AGATGGCTTCTTGTAGTTCTATCTTACTTTATTTGAAAGATATATTTGAGAACACTTATCAGAGACTCTATGTCTCAT CTTTGTACTGTGCCCAAATCACTTACTTTCTCCATTTGTAGATAATCTCCAACTTAATTACTACTTGTTGATTTTTGA AATTTGCTCATTGAGTACCACA{ N ) jcTGGCAATGTGATCCATAACCT(N)xGGTGTAATAATGGTAATGTAGGGGACT GTCTATGCTC CAGAGATGTAGG CAGGGTGTTTCGGGTTGAAGAGTTA CAATGTCTGAAACTCGG TGTTAGATGATT CA GCCAAAAACAAACAAACGATGTCT( N ) xATTATGTATTATAATAAGTGGTTTGTAAATGTAGTGGTAAGAATAAAAAT TGAGCAATATTTACTAAAAAGTAAAATT TGAGAACCTAGGAGATAATTGT CT CT TGGGAAA(N ) xTATCCTAGAGTTT GATGTAAAGTATAAATGTAC TCAATTTTGT TTGCTCTC TCTTCTTAGGT AAAAAGACATC CCTTAACTATGG CAAAAC AACAACAACAACGACGACAAAAAACAAAA CAAAACAAAA(N)xCCTGCATCAATTTGATAAATAATCCTCCCCGTTGA TGTCACTGGTTTTTTTCTGCACATCTTCTCACTGATAT TTAAATGCTGAG CTTACACTTT CCAT GGACATGCTTAATG A CTAAACAATAGTTGAAT TCTAAATTAATCAACTGACCAAC( N ) xCTGAGTGGTATACACAAAGCAATTGAGGAACAA CCTAGGAC CTTGTAGT CCATTAGGATTT CCAAAAATAGAAAGAGAATGGAAAGT CA CCTGGCTAGGAGAAGC CAGTGG AAACTTGA C ACGAAGAAAAAAG AG CAGCTAAT TT CATT CCTGTCCACCAGTTATTTATGTGTTTAT CTTT AATT AC AT TTGTTTGATTTC CCTTATTAAAGT CTGATGTCTTAAAAAAG CAGAAAAGTGAGG CAGGTCAG CAGG GGAT GTAAGTTG GGAAG AAAG ACAGGTG AGGG CAAG AATT TAGG CAGG AG CC AC AGTGTTGGTTGTGC AGGTGAAGGT CAGGTG ACGG AG GGTAACCAGT CATGGATGAC CCAG GCAG GAGO CATAAC CAAAATGTTAGAAAAAGT TGGTAAGAAATT GTTC C A { K ) X GTTTTCCAGGAACAAGGG CAGAAC TTAGCTACTGAGTT CTACCGAAGGCCAGGATC TGAGTCAAGCAC CAAG GTTAAC TCC CAGGGAAGC CAAATGTT CATAAAAG GAAATCTTTC CAAATCACAAATAT CAGGGCCTGC TAGGTAAATT CTAG CT T CTCTGTG C CAAG GGCTGTGTT CCTACT CT CAAG CG CCACTCACTGACTGTCTG GGTGGTGCTGTGGG CTTCTGTACT TTTAGCTTCATCAGCTGCACCTGTCCTCTTTTCC CATAAGA CGCCAGGA C CCTCAGAATCTT CC CT CTTAC CAGAAAT TTGGGTTTTGGGCTGGGTCCCTGGACCGAATTCTGACACTCTAAGTGTTTGCAGACTATTTTGATAGAGGTAGTGATT ATGATGCCAGAAAAGTGAAGCCAGTTTTTATTAGAATGTTGGCCATCTTGAATAATCCATTCCCCCACTTCCATGGGA CATCTTCAAAGAGATTTTTAAAAGACAAATATGATCTCTACTAATGAGATATCTTCAGAAGGTAGAATTAAAGCAAAA TTAATGTACACAGAATTC CTATTT CAATATATTTATTATTCACAA CATTT CTACACACACA
> H s 2 _ 754 80588 - 755194 17
TT T CTGTTTACT CAGATACCTT CTTG GCACATAAAT CT T CCTAGCTGACT T CTGAAAGAAATTATG CAAATGTTAG CA GTAACAG TTCTT TAAAAT TTGCTG GTAT CTGAGGAGAAT CATTGT CCAGAGACACT GTGT TATTTGTGATACTTGAAG TAAGGTGAAAACTGAAGATGAAATGGGCCAAGTCCCCTTTCTTTTCCATAGTGGTATTTTAGTATTATGGAATTGTGT GGTTTTATGG GATC CCTAGTAG CCAA GGAATTTG CT GGAATTTGATGGGTAG CTGAAT CACACAGGAAAT CAGTAAGG CTGAAAAAAAAC CAAT CATATATAAAAAATTATATG CTTATATGCTAGGAAGAG CAAATATATG TATATACACAACAA G ATATTCTAG{ N ) xGGGGGCCTTCTTCTGTGCGTTTCCAAAACAAAAGGACAAGTGCAGAGATTGCTGCATTTGCTTG TGTCAGCTCAATTTTCTTTAAATAAATAAAAAGTTGAGAAATCAGGAAAATTTAGCTGCAACTATGAGTGTGTTTGTG CTTACAGCAGAAGATTAAAAAC CAATTT CACACCTGATAAATAGGTACAACTAATC TGTT TT CACAACAAAGATTT CA AAAAG GTCTTCCTTTT CGAAGGTCAGGTTTATTT TG CAAAGGTAGAGAT CT CTACATACATT CTTG CT TT CT TAGCAG TGGTTTATAT CGAAGAAACATACAGAGG CTAAAAA(N)xAATGGTGGTAATAACAATGAAATACCTGCTACCTAAACA T C { N ) xTTTTCTGTTCGGACTCTGTATGACATAGGAATAGTATAACTCAGAAAAAATGTAGTTTCAAAACTTCCAATA ATTCTCTGATTTACACTGTTATTAACTGAAACTGACCTTTGGGCAAATCATCAGTTCTGGGAAATCCACCTGAAAGGT C CTATCCT CGATAT GCAACG GTAAATGAAGfiTATTACT CTAAGAAGTCTC CT C ñ (N ) xAGGCTTCAGATTAAATTATG ACAATCCAGAGAGAAAGGACTGGCATACTTTCAATTTCATCTAGAGAGATGGATGTGTAGCAAATTTCCAGGAAGTCC AAG GTACTGAGTGACT CTGAGTTC TGAGAT CC CAGAATTGTGAGAGATATAGAAAAGC CTACTACTAGATGAGAAAAT CTC TCTGTTTCTTTATTCAC( N ) xGTCACTATATATGTGAGAGGAGTACTTTGAATAAGTAGTCCCAAATTTGGATGA TTTGG GTAGC CTGG GAATTTTC AGTCTGTGTAAG TT AT ATAT AT ATTTTC ATTC AG CC CATTTACATAAC AACGTCTA GGGATCAGAGATTTAAAAAT GAAAACA C A T G (N )xA A A T A TT TTT A A A A TG T A TC C T C ( N ) xCACTTTGTTATACATT CTTATTTG GAATAATT TT TCAATAAC{ N ) xGCTCTTGTGGATTTGCATTATCATCTGGTTTAATTGATT(N )xTGGTT TTAATAAC TT GT TGTTAAAAAATC TTGAAATT GT TTTATATTTCAGATAT TCACAAAAGT TATAGAAGAGGG GAAAAT GCCAACAAAGGCAATCATAGAAGTAG CTAATAATAAGAAGAACTTGTGGAGCAT TG CAGT CA CAGAGTTTATATTGTA TTTCTGAAAATATACAG C CTGATTTCAT TAGCAT TT CT CTGTTATTCAAGAAGC CT CAG GGTAT CATAAGAC CAGATT GTTATAATAACATCAACTAACATTTGTGTGGCACTTAACAATGTTAGTCTTTTCATTTACATAATCTTGTGGTTTGGT ATfiATACTAAACTTACTTAAGATG GA GTGCTAAAGG CTTTGTCATCTCTG CCTCAACTTTTT CTAC CTTTGC TTTG CA TACAGTATTGAAAAAG TGAATCTAAACT CTGTTAGAGAAGAAATGACTATG C CTTCATTCAATCAT TT GTTTTC CCTT TATAAAGCATATGG CT TATTAATATTTATTAAGGTGAATAATGTGCACATTATATG TGAGTACA TGTATGATTTAT TC ATTCTGTACTTTGTTT CAAAAAATATAAAG CACCTTTCAAAAATTTGTAGAG CAAGATAAAAAAATAAGAAATAAAAA GAGGCAAAGGGAAATGTCAGTGTAGAAAAATAAAATGACG{ N } xCCTAAAATATTTACTTAGGAGCATGATTATTCCT AAT TCTATGAGC C T T (N ) xGAGAGGTCAACAATTTTAAGAGAGAATTCAGGATGAACTGGAGATAATGAAGTTGACTA CAGGCT ( N ) xAAAAATACTTTTAACTTTTTTTGGGGGGG(N)xACTCATGATACAGACAACAATCTTGAGTTAC(N)X GAGCTGTATTAG GGAGGGAAAATAGAGT TAG ACACATTTTCCTCCTAACTAAAGAAAGAC TACCATGTACTGAAATAG AC ( N)xCATCAGAATTTAAATGTTAGTTTGTTGTTGCTTCTTATTCTGGGGATAATGTATAGTGAGCACOAGAACTAA AATGCTGCATTGAAACTAACTCATTGCCCTGCTGTTGTTTCCCAATCTGGGGAAAACTTACAGTGGGCACTGAAACTT AAATG CTG CC CTATAACTAAAT CACTGC CCTG CCATTGTTTCCCACTCTGG GGAAGGCTTAAGCACATACTGGAAC TT
a a t c c c g a t c t t t t a g t t t t t t c a g g t c a a a g g a a g g c t c c t t g c a g a c a c c t g a g c t t t g t a g a a a a t c t g g c c a g g CAGCTGGG CCTT CC CATGGATT GTGT CT CCTG CAGTACTATGGTGATGGC CAGCCTCTTCAGTGTGGTGTGT{ N ) xGC TCAGTTTTTCCTTCCATTGATTTTCCCATTTCTCCTTTTTCCTTTAGATAAGATTTTGAAGGATGAACAATAGTTGTT TTGTTTAC TTGGCAAGTAT CAGGAGTGTAG CTAGTATGTGTGGCAAATAT GGAGTACAGTCATAT CAACTAAA CTGTG ATAGAACTACGGTCCATTTAAAA(N ) xGTATGAGGCAGAATGTGATAATAACAAAAGTAGCTAAGGTTTCTTAAGTAA ( N ) xTAAAATAGGATGAGAGACTTACTGATGAGATGTTACTCATTACACTTGGACAAGTACAAATGATTTGCTTGGCG CCAAAG C CAAA CAAGAAGTGGG TTAATTGTAAAAGTAT C CAGGACTCT TATT TAAC CTAAGGAATACTAAAC CTC CAT TG CA3TAAA CTGAAAAAATATACATTGATTTTGCAGAG CCAGACCTTGATTT CATGC(N ) xAAACCTCCTGGTGAGTG CATTCAATTTCTGT CAAGTATT TTTGTGACTCAGCATG CAGT CAATAGTCAG CTAAGATTCAGAAGTCTC CT TGCCAA TAGGGCTGTCGGCATTGTGAGGACAGTTGAAAGGGAGGGAAGCCTGTCGGATAATCAGATGAACCTGTTTGGGGCATA AACT CCTAAACTGTGGTGTG CTGC TTGTGGTTAACATT TATTTC TCAG GGAATAGTTAATAG CC CGTTGTGC TTTG CT TCAATAAATAATGGAAAGAAAGAAAGGCAGGGACAAAAACCCGGGGCTGGATTTAGAGAGTTCTAAACCTTTGGTTTG AC AG AG CATATTTTTT A C ACTTTTTATT CT AG AC CAGGTAAG CTTTTACAACTATAAA G CAA GATG CTTACATC AG AT GC CAAATTGT CCCACCTGATTCTTTATG CCAGCATCTG CAGATG CATT TAAAAAAG CTTCATTT GATTATGCAGCT TC TT CACAGC CCAAAAGC CAAG CAAG CTGTGGGCTAGTTG TAAAGGTGTC CATGAGCTTTGCTTTA TCTAAGGAATACTA AACCTC CAGTGCAATAAACT GAAAAAAAAAAAAAGTTG GTTCTACAGAGC CAGACC TTGATTTCATGGGG GAATAATA ATAAGCAGAAGAACAACGAAAACTTCCTGGTGAGTGCATCCTACTTCTGTCAACTATTTTTATCATTCAACATGTAGT CAGT CAG C TAAGAGTCAGAAG T CT CATT CAG CTGAAATT CCAAAACTGAATAATG CATAGTATAGTGTCATT TACTGG AAAAGAGAGG CTTGTT TATTTC CTGAGT TT TACACT CTTTAATGTACT GC C C CAGAAGTTTT CT TT TTTGGAAACCAC AATAGAACAC TAAAGAAGTGTAAT T CAGAAATGTGG CAGATAATTTGC CATT CTAT TTTCTAG AA CATCAGC TCTTTT TCAGCTAGTAAAAATGGCTTACTAAGTAAATCAGTTTTGGCTCCAAAAATTATTCAGAGGAATCCAGTTTTCTTCCTC CCT CACTAAATAAACAATGAGAAAGATAGATAAAATAT CAGT TT GGAGAAA CATCAAATGATTAGC TTACATAGGCTñ CCTTTG CATTTCAG CTGTC C CTGGGTGG CC CCAGGAAAA CAATCATAT CT G AAAATGAT(N )xTATTTTGGTTATCTG TTTTCACGGTATTGCTGCCTGACAGGCAACAACAAAGTCTCAGTGCATTCAGCAATAGACATTTATTG( N ) xCCAACT ACATAATGCATTCACTTCTTTTTCTTGGCATCTCCCTTTCCAAGCACACCCACTCTCCTGGTTTCCATTATCTCCCTG CAGACAACTCTCAACTGGCTCTAAGAAAGCTCACTATAATTTTCAGCAACCTAATTCAGTCGTTTGTCCGTCACAAAT AACAGCTCAGGAAAATCAAAGGATTTTCTCACTAGCATCTTATCATTCCATGTCACCTAGCAACTATCCCAGTGCTTG ATATTCTG CTACTAGAAACT CC CTACTACACAGATAATACCACTGGAAGGTATAAAG C CACTTGACTGAC CT CTCT CC ATTCACAGCATTTAT(N ) xTTAAAAAATTAAAGAAACATTTGAAATCAATGCACCAGAGGCTAGAGACAGAGTGCTAG GAAGGGAACAG GATGG GGACAGGGACAC CAGAG CAAAGATAGAGTCAAAGACACATTT CCAAAATGGCAATT CATG CC TTG TGT CAGTTGTACC CACCTCACATTCTTTAA CACTTAAG G (N ) xATAAAG G TTTTTTTTTTTTTTTTTACA A(N ) x GTTTTCTAAT CAGCAAGAATATTAGTTT TATTTTGCTTATTATATATT TG AATTTAC(N ¡xTGAAACTATGTTCCAAA TC CT CAAGTAAATCTG TTTAACAGATATGACAAATATCTGTCAATATTAT TATCCAAGTGCTTAAATACATC TGATAA TCTACAGTTAAATATGTTTTAATCTACAGTTAAATATGTTTT CC CCTGGTAC CTCCTT CTTTGAG C CCTTGATTCTAC TG CT CTAATTTGGTTGATTGAT T(N)xTTCTGGGCATTTTGACAAGATCTTTCAACCTAGAAATACGGTT(N )xCTTT TG AAAAAT ATGGCT AT CCTTTTTGATGT CAGTCTTTTT GGTAGATTTTTT ATTTTATTTTGAAGG CTATAGAG C AATA
A (N ) xA T T C C AAGT AATTTT AAAATGC AAT ACTATG ACTAT AAAA CTGTAGTGGGT TAT ACT CC AAAGAC AAAAAC TA TT C C AAAATATATT TAAATAG TTT TCAGTAATCATATTGTTTGT AC AAAT TG ATAC TGTT ATTCTAAGTTGT AG AGTA TGTATTCCTACATAAAACAAATGAGTAATTATATGAAATTCAAATTCTATCATCCTCAGTGTTGTTGAAATCAGAATT TTCATCATGGGAGAAAGGGCACGATAGAAGAGATTAAATAAAAGCTCTAGTCTGAATTTAAATTGAAGCATCATTTTA AATTTGTCATGTATTTTATCTTTTTT (N ) X CTTAGC AT GAGC AATTGCTT C C AG {N ) xTCTGGAAAAGTC AT CAG G AA AACC CAGAATATGG CAAAG CCTGTCAGTGGGGTGGG CACAAAGATCTTGACTTTAG TTGATTTCTT CTGCTCTCAGTC CTAAGCTTCACCCTAATAATTGGGCCTGAGTCTTTCTAGAGTTTGGTGAGACAAGCAGCCTGCTGCTTTAGGCTATCT TTGTCCTCAG GCACTCACA CAAAGTTGT CCATTTTCTGG CCATTTCTC CAT CTACT TTTTTTTTTT G GTGTGTAA(N ) xTATACATATATAATG TATATTTTTGTACAGTGC AT GT GTAACATTG C AT AT ATATGC AT AT AT AT ATAT GCTGTGTT GCTCTTTGTACAGTGACTAACAACAGATAATCTATCAATATGTTACTACATCATATTATTTGTGCCTAAACATTCATT ACTAAGTACACACGTTAAAG GATGTAAAAAACA CAT GTACTAAGTACA CATGTTAAAGGATGTAAAAAGGATGTTTTC TTTTTTGGAC TTTTAAT(N)xCCAAGAGGTCCAGAGGAAGTCAGAGTGTTGGGACATCCCCCACCCACAGGGGAGATG AAAG CT TTT CAAAGAT TACAAAGAG GTC TGTTCTGACAGCATAG GGAGTGAG CCTGAC TTCCAGTG GATAAGAGTGTG TGTTTCCCATGGGACAGACAGGTGAGGACTCTGGGTTATCCTGGAGCAAGAGCCGCTGGAAGCCAGGGCTTGACAGGA GT CATTG GTGGCCACT GAG GAGATATTGTC CATAACAGG GAG CTGCAGTTGGAGG T CTGCAGTT GGGAATAT G CAACC CTGGAGAAGC CAAAGG CAGAGT CAGTGGGGAAGA CGGGTAGAAGATCAAGGTñGGGAGAAATAATGAAGGAAGTAC CA CAGTCCCTTTCCCTATTGCTGATCTCAAAACATGATTCACAGCAGAGCTTAGAAAGGGGGGAAGTTTTAGTTTGTGTA AAAG ACTG AAGTTT GGGT AT ATGGTATTGAAGTT TT AG AATTTTGAACTAAT G GAAAAGTTCTGGAATTTGC CTGAGG TC CATC CAGTGCCATAGAAG GAAGATTAAAGAGAGC CATTTTAC CAGAGGATGGTGAAAAATACAGTGGCATTCTGTT TACACACTACTTAGTT CAGGTCAGTTTCATATATTAGTACTATTT CTTAT TCATGGGTTTT CAT CTATGT CC TATTAT CTGTCTCTTCATCTATATGCCAGAAGTATACTGTGATGTCCATTGTAACTTTCAATGATGCTCTAATATTTGGCAGAT TAAAAAATAGTCTC CATTTGATAATGTATGATCTTGTATACAGTACAC CCTTATCT CATACAAC CCT CAC CC CATACA TGCAAG( N ) XCCCCTTATTAATCTTTTGTCAAGAACTTCACAGATACTCTTGTCTGTTTATTTCTCTGTATTACCTTA GTATTAGT TTGCCAAT GTATTTGG CTGAGCG AGTAATTfiCCTGATCAAGTATfiTTC TCTTTC CAGGTTCT CT CTGC CA GAGGAGACTCTTCTTCTAAG GGACATTTATTTTT TTTACATG GTAGTCAC TC CAGATAGGCAGATTGTCACACTAGTA GGGAAAAATG CCCCATGTATTTTCATAC CTTAGT CT CTT CAT TAGTTC CTTCCTCCCTGGGTTGTT TCCTTATAA CAT TTGCCGTC TAAAGTGTATCATGAT TTTT CCTCAACT CTG CAGTG CTTATATTATTT TC CCCACATTAAA CAT TTGGAA
t t a a c c a t t c c c t t a c c t t c c c a c t t c c a c t t t t t c a a a t t a c c c c t t t c t t t a a a g c c t a a t t c a t g t t t c a a a t g c TCCACCATTATCCCAGTCAACAGCAGGCATCTTTTCAGTCTGAATTTATTGTGTGCCGCTTGATAATGTTAGGAGCCT T C CTGT GATTTTATGTTTGCTATGAAAAGCAT GGCTCTAT CTCAAGG G CTCTTAQTATG AACTGATGAGGGTG G G CAA GC CT GTTATATGGG GTGATGAT GGAATGGG GTGGGGTGGACAGATCAACTTTTGTGATAGGTGTGGCCTGACAG CTAA CT CT CCAAAT TGTCAACAGT GAAG TGTGCTGC CAA C CTAAGCCT GATT CTAG CCAG CATCTAT C C CCAGGGC TGAATG GT CATG C CGTTCACATTCAT CTATGCTCAATG CAG C CAG GGCAACTTGGACATGTT TCAATGTACTGAAGAATTAAAT CTCCCCATCTCTCCCTTGATTCCAGGATGGAGGCACTCGGGCTGTCAGG(N)xGTACAAAGTCAGAAGTTCATGACTG CAGT CATG GACT CACAGC C CAAG GAAGT CTTT CTGAGT CAAAGGAGACATGTAGAAGAGCATTTAGAG C A A G { N ) x T T TGAGAAAC CCTT CT TTAGTG GGAGGG CAAAG AAG CTGAGGGAAT CAGGACTT TCACTGACTGGGCTGGGGTTCAGGGC TGAGAGCCAGTAATAATAGGAATAAGGACAAACAAAACAAAACAAACAAAACAAA(N)xGCCTGAAAACTACGGCAGA GGGTGAGTTGGATGTATAGATTAGCAGCAGCAGTAGTGTCATGCAGCTGTCATAAGTGACAGCTTGGGGAGCTTGGCT ACAGAAGATGATTC CCAGAG CCATTTAAGG CAAAGGAACG C CTACATG CCTGTGATGGAC TCAGAGTGAACCTG CAGG TG CTGG CTACTACTGG CTGAGTGAAG GAAGTCAATAG GGTTGAAAGTGG CCTTTAA G T( N ) xGAAAGACAGAAAGTGG ACTTTCAGTATTTC CTAAATAATTTCAGGGTAGGGACTAATTCT TAGCATGTTTATGTTCTCCA TAATGCTTAAC CAA GTGTGCTATATTTGTTCAATTACTGCTTATTAATTGACTGATTGATGGTGTCCTTAATTGGTCATGCCATTTCCATCC ACCTATGCTCAG CACAGC CAGGACAATTTGGACATCTT CCAATGTATT GAAGAATTAAAT CTCC CAGTGAGGAT CTTG G CAATCATAT CACCAATAGG CT TGGATT CAAATCTG CATT TAAATAG CAATGAAACAAACA GAC TCCATCCC TGTG GA TAGCAAG CAG CATAAT CTGCTTACTTATGGCTGGGCAC CTGTGAGAAATGAAAGTGATAG GCAGTAAC TTCAGTGATG AATTACAGAACAGT TT CATTTAGC TCATTG CACCAAGACAAAGCATCC CAGCAGCCTCTG GAG AAATTAATG TACC TT AACTTGAATTAGTGG ATAATTT CAGATCAG CTATTTTA CATATC TAGC TGCTTCTCCTTCCCTCCATAGGCCACCTCT TGCCCTTT CT CT TC CTAAAT CT GG CT CT CAAAGC TAAG TATTCTAGAAGTTAAACATTGAAGTAGAAATCAGTG GACA TGATTCATGTCCTTACCCTCTCAGCAGATG CCTAG CCTTGTTCCTTTTTTACAGAGCCTTGTTTTCA CAATT GT CATG
t t a g c t c a t g g a g g a a g t t g t t a c t t c c t c c c a t g a t a g t t a c a t g a c c a t g g a g a a c c c t a g a a g t c a g a c a g a c t t TACAAT C CAAAGAAAATTAC TC CT CGGTGGTAGGATTTTAGGTACCA CAAAG GCAACTGTGGACAGAGAAAG CCTCTG ACTGCCCCTAAAAATCACATCATATCTACAAATAAATAATGATGCCATTGCTTGATCATTAAATTCCTGTGGGATGTT
g g c c g t g a a a c t t c a t c t t t t t g t a t c t c t a t t t c t c c c t t c c t g a t g a t a c t t c c t t t c c a t t t g c c a a t c c a c a a c TGGCAAGGAGAT CATCAAAC GCAGTGGG CAAA GG CAGAATTTCT CTCTTTCT CATAGTATTTGCTGTAGTTC CACACA
t g g t g a c t g a a c a c c t g g a g c a g a g g c c c c g t t g c a c a c t g a a t t g g c c t c a t t a t a t c a t g a g c a a g a t c t t c t a t c a c t g t t a c t c c t t a t t g t t g t g a a g a c t t c t t t t a t g t c t g g a a a g a c a t t t a g g c a a c t t a g c a t t g t g a t a g a a a a c a g c t a a a c t a c t a c a g t t t t c t a a g t t c a a a t c a a g a t t t c a c t c a c c c c c t a g c a t c t t a c t a a a a c c t g g g t t t c c t a t a g g g g a g g a t a a a t t g c c c a c a a a g t t a t g a c t a c c a g a a t t g g g a c t c t g t t c t g t g a a g t a g a a a t a a a t g c a g a g a c c a t t t t t t c a t c c t a g a g t a a g t a g a a a a a a a g c a a c a g t t a t t t t c a c t t t c t t t t g g c a c a c t t g g t t t g g c c t a a a a c c a t c a a c t g t a g a g g t a t a t t g g t c t t g a a a a g a a a a a a c a a c c a a (N ) xTATTTCTCTGTCCTGTAAC a c t a a g a a t a a c a a g a c a t t a c a g c (n ) x Ct g a g g c a t g t g g c t a a g a a g a a ( n ) x t g a a a t c t t a g c t t g g a a a c c a c c a a t g a a a a a c g t t a t a t g c c a t a c a a t t t g g a a t t t g t t c t g a a g g t g t g a a a a g t t t t t t t a a a a a g g a t g t t a a g c c a a g g a a t g a t g t a t t a c a t t t g t t t g a g g t a c c t g a t a a t c a g a t t a a a t c a g a t t c c a g c g c a a t g a t c c c c c t t g c t g c c a c a t g t g g a a a a a t t c a c a g a t a a c c t t g a t g g g c t c c t c c a t a t g g g c t c c t a g g a a g a a a g a g t g a g c c AGGGTTCATGATGGGGCAGCACCAATTTGCTTTCCTGATTTTGTAACAGGGAAAGCAAAGTATTAAGCAGTATGGAGT
g a t c a t t t t g c a c t g t a g c a t a c t t c t t a a a g t t c a t t c a g g t a c c a g a g g c a t t a c c a t g c a t t a t g c t a g g a g c a c t g c c c c a t g g c c a g t a a a c t g a a a t g c t a t t t c c a g a c t g g t a a a a a g t g a g a t t a g c a a a t g t g t t t a c a a c t t g c a ATCCTAGCCCTTCACCCAGGAGCTGTGTTGAAACAACCTGAATCTAAGGGCATAATTAAGAGGACCCAGGGATTTATA
g t t t g a g g c t t t t t g c c t g a g t a c t t t c c c c a a t t c a c c c a c t a c t c t g a g c a c a g a a a t c c a c a a a t g t g c t g t c c t g a g g t g t g a g a g g g c a a a g g t c a c a g c a t t a g a g c a a c c a g a a t g t t c a a a a g (n ) x t g g a g g a g t t g g c t t c a g g t c A A A A T T G T A (N )x g t t g c c t g a a t t g a t c a c c t t g a c a c c t c a t a g c t c t c a a c a t c t a c t c t a t a t a t t c a t a a t g t a c a g t t t t c a g t t a a a a a a t a a g a c a t g t a g a a a c a c a a a a
> H s4 _ 3 8893729 - 38919 55 6
TT TAAAAG CCAAGTAGTATGGCTTTTAG CCACTATATTATATGGTCTTTTAAAGTATAATATATT CTTAAAG CTTCGT
g t t t a t c g t c c c a c t t g a a t t t t t t t a g t g c a c a a c g c a t c t a c t c c c a a a a a g g a t t t g a g a t a t c t t g ATa a a a g g CATTTCATTAGCAGGATATTACAATATAAGGAGAAAACAATGTG CTGGTAG GG(N )xATCTTGAAGGTGGAAGATAAG a a g a t a g g g g a a g c c a c a g g t a a c a g a g t t t t c a c a t c a t c c a c t t c a g g t a t t c t a a g g t g c a g t c c a t g g c c a t g a g g a c c t t g c g a t a g g c c a t g a a t t g g t c c a c t t c a c t g t t a c t c c t a a a a c c a g g a a g g t c t t c a g g c c t t t t g c c c c a c c c c a g t g g a c c a t g a c c a c a g t g t t a a g g g c c a g a t a t c t a a c t t (w ) x t c c t c t g t c c t t g g a a a a a t g t t t g a a c t t a g t g g c t a t c a a a a g g a t c t t g c c a c t g g t g g c a c t t a g a a g t g a c a t g c t t t c t a a g c c a a a a a c t a a a a a g g a g g t t c t a a a a g a t t g t t t t a a t t g g a a t a a g t c a t g a a a a a g t a a a t a a t a t g g t c a c t a a a a t t a t a c a g c t c t t c t c g t g g t t t a t a t c a t a a a a c a g t c a t g t g t c c t g g a t g t t t c c a g t t t t a a c c g c a t c c a a a c c a c t c t g g t t t a t a a TGAATTGGTGGTTCGAGGGTTCTAGTATAGATGGAGGACTTACTCTTTCATTAAACTTTTAAATATAAGTTCCTGCCA
c a g t a a a c a c a g a a a t c a t c t g c t c t g t g g t a g c a g c t a t a g g t g c t c t c t g t g c a c a c t t a a a g g t g c a t c t c c c t g t a a c t g c t c t a g c c c t g g t t g g t c a g a c c a t t t g c t g t t g t t t c t c a t t c a g t g t c t g t a c c t c a t t t g a c t t t g g g c t t t g t t g g a c c c t c c c a a g g t t t t c t g t c c a t g a a c t g a c a g c t c t c c c a t t g a a t t t a c t a t c c a g c a c c t t c t t a g g t g t c t t g c t a t a a t c t g a c c t t t g c a c a c a t g c a g c t t t g g t t c c a g t g g a t g g t g c t c c a a g g c c t c c g g g t t t t g TCACCTGGTTCACATGTAACCAGTAGATGGCGCTGACTTTACTGCTTAAAAAGTGGGATGT( N ) xGCAGGGTGTGATT t t t c a c a g c a g g t c c c t t t t g a c t t g a t c c g a c g t c a t t g t c t t a a t c t g c g g c c t g t g c t t g t g t t t g g g g t t g c t c t t a a a g t t t c t c t g t g t c t g a g c c a c c a t g c a g a t a a t a a a t a c t g c a t c t t g t t a c t c t c a c a a c a a t t t a t a a a t t GTGATGGTTCGCATCTGGCTCTGAGTAATCATTGATCCAGAAGAGCCTAGAGAGGTAAATTCCTGTAGGCCATCATGA GCATTGTGCCAGTCACATGGTCACTGTG CCATGAAAAATAATAT CC CG CT TTTTAAGTAGTAATGT CAGCACCATCTñ CTGGTTATATTT GAAC CAAGTAACAAAACCTAGAGG CCTTGGAG CACAGT CCACTGGAAC CAAATCTG CACGTATG CA AAGATCAGATTACAAAAGGACACCTAAGAAGATGCTGGATAGTAAATTCAATGGGCGAGCTGTCAGTCCGTGGATGGA AAAACTTGCCACTTTTCTTCTAGGGTGAGTAAACACGTGGTTTCTCTGAGAGTGTATGACATTTGCACATTCCTCATG CAATATCAGAATGAAGTTCAGTATACCCTCT(N ) xACTTTTACCTGAGGCAGCATGGCCCTGGGGTTGACACTATGCT TGTACATAATGGAAGTTGTTCCTCGCTTGCCTCTCTCAATCCCAAGATAATG GCTCAACATAGA CCACAGATCAGGAA GGCACCCATAACTTTAAAGTCTCTGGAGGATAGGGTGGTGCAGCTAGAAGGGGGCCCAGAGTCGAAGATGAGGCCAGT GG CT CTGCCAGC TG CT CAGCTGGCAC CT CTGCTT TGAAGAC(N ) xCCCGCCTACCTTAATTCTCTTCTCGTCTCTGTT TGAGATCTTTGT CAGTTACACCACCAATTA CAGAAC CTTAGTATTTTC TATGGTT CTGGC TA CATG CTGGAATTTTGA AAAG CAAGGAAATCTTGG CATTTTTCTCTTGTTTG CAT ACAAAATGGC AAAG GGTAC ATG AGGAAAGT CATTAAAATT TT GAAATATTGATG CT CATT CTAAAAGT CC CT TTGGAGAAATTT TAGAGCTGT CTTTAGAAAAG CAGGTACTAAAATT GTTCATCACCACTGATCATCACTACCCACCTCCTCCCCCCAAAATGAATGCAATAGAAGTATAACTGAAAACCCTGCC TAAATTGCTCTTACTGCATATTTTGGAGATTTCTGATGAGGACATGCTGTGACTTTATGTAGTAATAGTAGTTAAAAC AAAAGC AATGTATATT TCAAAATTGGACAAGTGT CTTTGTAGAC CTGTACAGAACTAAAG CATGTAGC CTAACAGGAG GCTG AAGTCATG GGTT CC CAG GCTGGAAACAAGCAC TGGTGGTTGGGAGT TT GACTGCTGAAAGGTAAGAGGGATCAT AAGTTGC CCACACATGGAGC CAGGCT CACAGC CTGAAACAGGTTGGAATG GGGCT CAAAGAT CT GTGAfiCAGAGAATT TAAAGTTATGTGTG CCTT C CTGATCTGTGGTCTATGTTGAGC CACTAT CTTGGGATTGAGAGAGGCAAGCGAGG CACA AT TT CTGCTATGTACAGG CATAGTGT CAAC CC CAGGGCTTTACAGCATTAGCAGTTTATTAT CAT CAGGCAAAGAAAC TTATTGGAAAA CAGAT CATT CACAGTAGTAGACAAG CTTTGAAGATTTATAAATTGCCTGTG GAAAGTAAGGAAAGAA AATAACCACATGTTCAATTGCAATTGATGTCTACTCAGAAACCTCATGTATACCCCCAGGGTAATTCCAGCTGCAGAC AGGGGTGCAAAACATATAGACATTAACAAAACACAGAGCTGTCAATTGAGCACTAGAAATGGAAACCATTACCTCTAA CAGTTGTTGAAAAACTGCAGTTAGAATAGTTAGAACTGTTGAGAGTTGGCAGGAGAATAACAGGAGAATAGAAACAAT TGAAAAñTGGTAAT CT C T T G ( N ) xATCCTGTGTCTATGCATAGAAGGAGAAAGATATTTGTGCCCACTGTGCTAAGTG GAAAAATACAGAAAATTAATTGAAATTATGTCTACTCTATTTTTATATTTTATCTTTTGCCTCAATACTCTTTCACTC TGAGGTTAAG G GAAAATGAGAAAAAG TGAC CT T C CTAATGCTTATACAAAAT CAAACATT TTTCAG CCAACAATTATT TTGGAGAGGACT TAACT CTTGCATTAG G CT CAACTACAGCTACACTGT CATCACATT C TTAGAAGATTTGATCACC TG TCACCCATGC CTTTGG AC AAGTTTGT GAAATG AAGT AAGCTT G GTT ATTATTTCC AGG CAGA CAC AAG AATAG AAAAC ATTTTAGAAAAGAGGGTTTTTG (N) xAAGAAAGAAAAAAGAAAAGAGGGTTTTGATAGTAGTTCCAAATCCCACAGGA TCTTATAGAC CTGACT TG CTT CCAAAATGTAT TA CTAGG CTAAGAGTGATAGACCAGTAGTCAAAATGAAAGTCAACA ATG CTAAAAC ATGT AT AAGT AT ACAAAAGG AAGAG CTCTATA GT AG AATACAGT AATTTATTTCTAG ATACCAC CAAA GG CTTAAATGTTTATTTG AAATTCTA GTTTTG A C AGTTTTATGAGTTT AAAC ATCTTATG CAAG AGTACCTGAGGTTG GGGGTTGGTGTGTACGGAATGTACCTGATATGTATATTTACTTTTTCCAAAATTATCATTGTTCCCCTTACTCATCCT CTCCCAAGGTACATAGTTTTTTCCTTCTGCACATTTTAATGAAACTCTTCTAGCAGCCCGGTCAGAAATGACAGTGTT TATTACAGAGAATGGCAAACTTGTCAAAATGATACTACGATGAGAAATCATCACCGTATAAATCACTGTCGATAGCGC ATGGATT CATTTTTGAAGGT GACATTGC TGATGATATGAAATTT CT CAAGGAACATTGAGACAAAGAGAATAGGAATA ATGC TTAGAGTTTGTC CAT CGTCCTTTAGATT CTAT GAATAAAACACAGGGGTTAATGAGTTTCTGTTG GACTTCTGG GCAAATATCATCAGTT CAGTAACACTTC TTGTCTATTTTGAGCC TAAAGG GAAGTAAG CATC TGATTACAGAGTG GAC AT CAACCATCATATGGAGAGGAAAGACTATTC CAAAGGAAG GTTGATTGG CT CCCTTTTTGC TG CACTATAGTT CG CT TTGGTTGAGCATGG CAGTAT CAACTTGTGCTTTT CT GAACAATTGCTCTT CACAACTT TT TATAATAT CTTGAACT CA T CTAT AGAAG AGTCTG AAA C A C AGTC AGTACTTT AT AT ATATGAG (N ) X C A CTGTGATTC CAGG CAGC AC ATAG CACA CAGCACTATCTGTGTGTG CTGAGACTTCAACTGT GAACTGCCAAATGGGAAAAGC CAG GT CCAGAAGACTTATATTTA GGTAAATATG CTTTTATATT CTTGAAAG GAAT CACAAGAAATAATTAGAACTGTTATCTTTAGAñGG(N ) xTTG CTTA TTACGTTACTGGAATCTATCTAAGCAGAAAGAGGCTGGCCCACCTTCTGTTATCTATGTTTCTATACTCTTTGGACTT GC(N)xTTTTTTTACCACTAATATTTCAGTATA(N)xATGGAGTAAAAATTCAACCAGATATATCAGAGATATGATAT TATGGGATTTTT AAAAGT CT CTTTTT AATACATT CATTTTTT GAAAAAGAAG GAAAAAGAGAAAGT TAAAATAAGAAA AAGAAGGAAATATTGT CT TAAAAAATAAAACAAG CTAAAAGG GAGAAACT TT CCTGAT GAGTTAGCGTGGGCCCTT CT CGTGATCCTTTAGCATATGGGAC CAATAGGAAGTAG CCATTGAGCTATCCAGTCATCCAGGGTG CTGCAGAGAACTGC TGTTGCTTCCCTGAGTTACCACTTGGTCAGAGAGAACTTCCCAGCATA( N ) xCATCATATTCTTTTTGCAAATTATTT CATCTTGATTAT TT CAAATAG CCAGCAGTTAT TT CATCTTTC CAGAAAGAGGAGAGACTT CC CT T CAATGGCTTTT TG ATCCTGGTG GAATT CACAAGAAGAGA G CTT CATAAGATCACA GTGCTTTGCGG GAAGAAAAA GATCAAAGTGGATGTG TAAGATAGCC CTACACAGTGTCTTGA GAATACAG CCTGCACA G CACAGGGAT GATCCTAAAAACA CTAAGCTATG C CA TCACTAGACACAGTTGACflGGGAGAGGGTTAG TGTCAGGTTTCACC CTGGAGGTGATCTT CACCTTGTTCTTTGGTCA AG GTAAGGTAAG TCAAGT GCATTTTAGAATAAAACAATGTTTTAAAAAGTAG GAAGCT CCAAAC CAAGGATCTGATTT CCAAGGAGTTACAGAAATGGGATTAAAATGACATTTAATCAACTATAAATGACACATAAAACTTTCTTAAC(H) xTG A GTATAGATGGTAAATTAAATACCAGC CAAT CATT CAGCATTTATTT TAAAAT GATACATTAT CGAAGCATTTGTACGA CATT CTCATCTT CTTTGCAG CACTCACCTT CTAT TG CAGTGAGC CAAAGAGTAGAGGATATT CCAT CT TTAGAACCAT GGAGTTTAGAG GGGTTTCAGATTTTGGG GATACTTGGGGTTTG GGATGACA CATGAAGAAAT TAAT C CATGAAC TGTA CC CTATAGTATTAT TCATATTTGTTGTT TTTCT C TTATTAGTAGGT CC CAGT CAGTAC CTCTGGAGAT GAAATCAC TC C CAATCAAAAGG CAAGAGTGACTTCATATTAT TATTTTCTTGTT CTGGA CA CTAGATGTCACACTAAGTAAAGT TT TA GAGGTTACTCTTTAGAGTAAGGTGTGGCACAGTGTCTAAAATCTTAGCTATGAAAAATTATAGCTCATACATAAATAT AAAG CTGAA CTTTATATTTATTTTTTGTTT CTTTTTTTTTGT AAAG CTGAA CTTTATATTTCTGGTAG AATG CAGT AA AAGGGTACTTTC CCTAA CAAñAAATT CAGAAAAAGAATTGTTAATAAGAAAATGTAGTTG GAACAG CAGCCTATTTTG ATGT CTGTTAAATAGATAGATC C CATTTACAT GC AC AT AAAATCGT AT TTC CTT CTAGTAA CAAAGTGAT TTTT CTAG AGGCAGCTCGTGTATACATATCTGTGATCGTTTATGTTCTGAGATGGCGAGAAGCCATTCTTCCTGCAAAAGGAACCA GACATTAT CCTAAC CTGCAAGGGACGGAGAAGGTGAAAGAAGAGATACTGAATATGAGTT TCAT TAT CTC CTGAAACA GT CT CAG GGAAGGATGTGTGAT CATCTTTCTTGAGTGG CTTAAG CAAAGAAGACAGAGAAGATGGTGC CTCCCAGCCT CCTTGACCTCATTTTTTTCCTTGATTTTGGCT CTCAGAAATCTAGACGAGAATCATAGGT TGAGATTTGGAGGCACAT TATTTCCTTTTGGAAAATGCATATATATATATTTAATGGCAGGGAAGTTGGACTCAGTATGTAAAAGAGTAAGATATT CTAT C CTCTAA CTTAAATGT GT TATT GAAACTTCAATAAGTTTC CATTGAAATG CTTGAAATTCATTG GACGCTAGAG ATAATTAATT CAAG CCTTAACAGT CGCAATACACCT TTATTTTTTCAAGTAACT CAGTAA TTAA TT TGAACTGT GT CT AATGATCT GCACATGACTATAAACACGTATTT CTATATGTGATTA CAGTTGTAAATAATG TTTACATTTCAGGG TTTT CTAGGTAGTG TATAAG GACAGGTGATTTCCATTGGACATAATTC CTTC TACATT TGTCAAATAT CAGTGCCTTG TTAA TG CTAAGATCACTA GGATAAATAAG GTTA CAACTG CTAAAATG CTACT CTTTAGGTTTGGGAGT CATC CATGTGTT CC CCTTTCTTATTCAG GAAATGGG CAATTGGTTA TTTT AACAGTGC CTTCT AAG ACTAGGTGGTTCTAGTTAGCTCTC AA ATTCTGTTGGGACTTGGGACGTTTTTCCTCATATTACTGCCTTGTGTTTTAGCAACCTGAGGTCACCCAGGTCCCGAG AGAAATCTTT CATTTT CTCCGT GAGTGCT CGGGGGATG GGGAGTT CTGTGATGATGGCAG CCTGTGCTGTTTCTCCTT CAAGAAGT CAGACAGC CACTTTGGACCATC CGTAGG CTATTCATTCCT TCAACAGTTTGACAAC TCAGAAAAGCTATA GCTCATCGTGAAGG CCT CAAAGGGAGGAAAAT CTCT CCAGAATAAAATGTAGACAAGACTATGT CCGTAT CAGGTCAA AGGTTGGGGTTTTTCCCTGTGGGTAGGAAGGT CAGCTCTGTATT TACCAAATAGTTATTGAATAAC CGCTCTGCCTGG CT CAGGGAACAAGGGTATTATC C C TG CTTAG ATGGAGTAAAACC TGTGCTCTTCGCCTGCCTG ACTGCTGTTAAG GTT GATTTATATCATTATTAAGACAACTGTCGGG G CTGTTG GGTG GCCTTCCTGGTTTCTC CACTGACT TG CCTTTTGATT TT TG GATG TCAGCT CCAGGACCTT CACTCTCCATGC CTAAAGGAAG GC CTAATG CTTTAAAGAAA CñTTCTGAGGCTC AGGTGATTTCA(N)xAGACTGTGTTGAGCTTAGAATTCCAGGTTGATAGCTAGGTGGTGTGAGAAGCAAAAGCAAACA GTGAAGCATCCGGGAGCACTTTACCTACGTGTATCTCATTTGCACGTCACAATAACTCA(N ) xCATCTCAGGGAGCAG GGATTCCTCTTTCTTTCAATCAGGATGTACTTCTACA(N ) xGATGTACTTCTGCAACTTGAAACAGCTTTGTAAGATT TGGTGGAG GATTTG CAGACCGT G GATTTCTAATGTATC CATGAAC CATATTTTTATTT CCAACAGGTCAT GGATTGA C GG CAGTCAAGGAAAAAGCAG GAGCCACTCTACGGAT TCATGGTGTAAATTCT GGñTCTTCTGAAGGAG CC CAAC CAAA TACTGAAAACG GAGTC CCTGAAAGTGAGTGAT GTGT CT CCTCTG GGTGTTCTTGGACTTTATTACACCATGTG CATAA T CAGAGGTTTTCCAAGTTCAGAT CAGCGACGT GAA CTCTTAAAG GATT TCTTTTTTCT CT TTAGTAA CAGATGCAGCC ACAGATCAGGGCCCTGCAGAAAGCCCACCCACTTCCCCTTCATCAGCCTCTCGGGGTATGCTGTCTGCCATCACCAAT GTGGTTCAAAACACAGTGAGTCGCTGGCTGCTTCCTCTCTTTCCCCTGTATTTCCCAGGGCCTTCTTTTGATATTTCC ATTTACCAGTGACCTAGATACAT CAGAGAAAC GATGGC CTTG CT CAGAACTTTGGATTATGGTT TTTTTTTTAATGTT TT CTGCTACAAAGT AG ACGT AACCTTGG ATTC AAAATAATAA GGGCAAAATG AAAT ATATTTTATGTTTAAAAG AAGG GAGCTGTTTG GAAGGAA CATTTTTCAGTCT GATTTAACAGTGTC CTTTCCAGCTGCGG CAAGGACAG CCT CTGGTTGA GGGGAGAGAGAACTCTTTATCAGAGCTGGGACAGCTGTTTTTCCTAAATGCTCTGCTTATCTTATCCCTGAGTTTGAA AAAGCATTAAACCGTTTAAATAAAAAGTACCCGTTACAAGACAGGAAAAACAGCCCAAATATGTTTCTGAAGTGTTTC CAGAGACCAG CGTTCTTCCCCCATCCCTACCTCTTCATTCCCACCCTTCCTCCTCCTTACACCACC CTAAGTGTAT CC CCCTAGAT CC CTGC CTA CCT CTTAGAAGAAGCT CCATATTAG CAGCAAAACATTGGTGAATAATGAAAAAAGGGGAGG CACCTCAACTGAGATAAGGCAGGTAGCTGGTCATCAGGGTCAGGGAGCAAAGA(N ) xAGGTGTATTTCTTACTATGTA AACTAACTAGTTTTTCTAAAAATATGGCTCGTGTC(N)xGGGCACAATATCCCAGAGGACTGAGGCGACTGAACAAAT AAACA CAATGTAATTCG GGAAGAGGTGAGGGCTGCGGT CAAACATGAGAGGATCTCAC CCAGAC CT TGGG GTCAGGAC ATGC CAGAAATATAGAAAAT GACTGGTCTTCTCCTTGCTATTTGAGGACCT CTAGACTGGGGTGGGGGAAGTAAGGfiG GAGCAGCAAATCTGTCTTAGGAGTCAAGTAGCAATACAAGTCAACCACCCCCGTGGGGGGTTTGTAGGATGGGAGATG GATGGAGCTCTGTCAGTACCTCTGGTATAAGAAATGAATTCACGGCCTATCTGAGCTACCAAGAGGGCACTATGAGTT TGTCTTCAGCAGCCTCACATTCCTTTCTGTGTTCTCTC CAAC CTTC TTGAC CCCTTTTTCC CAAGTTC CAGACACTA C CC CT CCAAGT CTfiT CCTTAACACAAATCCCTGTGTC CTTTCTTC CACCATTTTATTCT CAATAAAATAATTTTT CATA GGATGTTAGCAGGCACTAAGTCAAATTACACA GAAGTGTGAATTGCTG CTGT CC CATCAT CCCC CCGAAGTGTG CG CA CTGAAAACTGATTTACAGCATGATTACTGGTTGGCTCCCAAATCTAAAAACAAAGTCTAAATACCCAAGATGAAAATA AAGT ATATTAATTCTATGAAATTAAATACTTCTATTTAAATGAT AATT ATG AAG ATTG GAAAAT GC AT AT CAG (N ) xG GTGGGGAAAAACACAGATGTATGCATATTATAATTGCAGTTATACAGTTGGAGTGTAAGTATGAAAGAGTTGCAAAGA CATGTATG CAGTT C TGTTCACTTGAATAGTTTATG TGTGAG TAAGT CT GGG TTATTTTAATTTT TTAAAAAAAG CAAC AATGACTGACAAAGGGATTTG CAGGAGGATAG CTATATAATACACCAGTTTCTAAAGC CAGAAGTA CAAGGAAATCAA CCTGTCTCATTTCATGGATATACTAATTATAGG CAGTTATTG CTAGTTTCTGAGAGTATG CAAGAACCAT CTGAAACA G C CTGAG G GT CTTACC CTCCAGGGCC CTAGATTAAAGAATTC CCTAAATTAG CAG GACTGTTTGA CATTTTAATAGAC TG CT CCAAGCAACT CTGTTACTTGGGTTAAAAAAATTT TTAAA CAGAAACAG CAATTTAT TTTA TGTTAATTTTATGT CTTAGCAATTTATTTTATGTTAATTTTATGTCTTAGATACACATCCATTCATACACAGAGAGCCAGTAAAATTAATCC CTTTGTGGGATTAATTTTATTTTATGTGCTTTGGTTTTTTTCTAAGCATCAGTCTTTAATAACATTGTATGTCCTACA AATTATATAACATATG CTTT TC CAAGGAAACGAGAATAGGAT TAAATGGATGATAAAT CT CCTGATGAGAAAAAATTG TGAC TTTATATTGATT TTAATATT TGAAGATAGTGC CAGTTACCTAG GAGTTGCTGACTG CTGC GAAACAG CAAAACC TAAACATC TAATCT CATGCAGTTATCTCAT CT GCAGGGTAAAAGTGTC TTAACTGGAGGC CTTGATGCGTTGGAATTC AT CG GCAAGAAAAC CATGAATGTC CTTGCAGAAAGT GAC CCG GG CTTTAAGCGGACCAAGACGCT CATGGAGAGAACT GTTTCCTTGT CTCAGGTTGGATTATATACGTT TGCAATTTTT TCTTTCAGT CAATAAA( N ) xCTTTTTGAATAGCTCA TTTAAAGCATGTGATTCAAATCAAATGACACCAAAGAGCAGAGAGAGAAAACTGAGCTTCTAATACCCCCATTTCCCC AGTCACTTAATTCCTCAGAGGAGACAACCAGTTAACAGTTCCTTAAGGATTCTATCAGAGACCGCAGAGAG(N ) xGTC TTCTCCTCTCACACTTCCAG CACTAAAGTTTT CCAG CC CTTCTTGCAAAACATAAAACAAAGAATT CAAT CTGTCTTT GC TTACTTACAATAGTTAAGAAAATTGGAAATAATATTAGTCTTGTTTT C CAAATATGAC TC TAATATTAAACTTACT TTTCCAGATGGAACATGCCTAGCTTACCCCACAATCCAGGCATGCCCCAGTCCACCTCCTCCACGAGAGACTGATACA CCAGGTCTCTGTGCTTTC CTGAAATCAGAAGCAGCT GATG CAG CAGTGGATTTT GTAAAC GT CACAGG CAGGACTTGG TACTAGGAACCCATGAAGGTTGCCTTCATGAACAGCTATCACTGAGAATATGTTCATGGCAACCCCTAGAAGCTTTTG TTTTAG GCTATTTCTT CTTGTCTCTT CTGGAATAAC C CAAGTATGAGCACTC CGATGTTATAGT CCAGGGAGGGACTT TG CAG GATCATCTAGACCATCACC CTGCTG CTCCTTGAAAAAGAAAATAGAT{ N ) XGCATGAATATTCCTGCTCACAC AGGAGCTCGTCCGAGGGGGCTCTCATGGCTAGCACACTTCTGAGCCCACATCTTCTGGCCTCATCATCTTGAAATGCC TTACTTCCAGCCTCACAGAGAGGGGATGTGGAAGGC( N) xTGAAGGCTGGATTTCGAAGTCCAG CATTTATGCTCCCA GGAGTAGGAGGGAGGT CTGCAGGTTG CACTGTCACACC CT CCACTGGAGG CAATTG CACTTT C A (N ) xAAGCGCAGGG AATACT CTTTAAATAACACGGGGACAG GGGGTCCTTGGATTC CACACATGGAAGT CTCCATAGC TT CT CTGTGT CAGG AATTCTTCCAGGCCAGGATCATTCCCTGCTATTGAAACCCACTTCCCTTTCTTCTCTAGTGGGAAAATAGAACCAGCC TCCGTTGTCAAGCTGGTTATCCCTCAGGCAGCTTTAATTTCACTCCCCAGTTGTG(N) xTACTCCCCAGTTTTGAGTG AGTC CACATCTT TC CAGAGTGACC CTTTCCCTTTCTCT TCAATTTCAT CCCTTCTTTTTTCTCTCC CGAAGCCCACAT TCGGAG GTGGAAGCTGGTGTAGAGAAAGGAAGTGATTT CAGTGTCACTTTGTTATTTTATTTTACCAC CACCAAGT CC TGAGATCAGTGCGGGTAATTAGTTTTTATTGCCAGTGTTAAAATTTTCCAGTGATGGTTTCATTGGGTTGTTTGAAAA ACTTTAAGATTCAGAGTCTGTGCTAATTTAGTCTTAAAGACAGACAGGAGAGAGAATTCTCTTTGCGGCCCTGTGGGT TCTT GG CAGAT AAC CT CG AG AACC C AG AAGG AAATAAGGC TG ATTGTT TTTCTGGC AC AC AATAAAAACCAAAGTC TA TTCTTCAGTTGTCTTCCC CTGTGACCTTGTGGTGGTGG C CAGTGTCTT TACAGTGC TTTC CACAGAAATTAAAGAACC CCATA CACGTAATG CCT CAGTAAACTTAATGAGGAATCACATAAAAGATG CTTAGAATCAAAAACG CAAGTGCAG AAA ATTGAGAGTGACTGATGGA CTTGTGT TTATTTGTTGTCTTTTTGCTTAA C CAACAAAAAATAA CTGGAGAAAAGTTAT TAATGGGGAGAGGAAACTTGGTTCACAGTCATATTTCTGGAAACCAGACTTAAGAATCTATAAGCTAATG{ N ) xCTAA GTTAATATTCTTAATTGCAGTAAG TATATGATGACAAATACATTATGAGATTATGTTTATTAAT TTTTAAATTGAG GA AATT TAATAACCACGTACAC{ N ) xTTATAACTTAGAAAAAATCAGGCCAGCTTATTTCTGCTTTGACGTCTGTGTAGG TGCAGGACTTAAAGTTAATCCTGTCATAAGAGATTCAACTTTTCAAGCATACA(N)xCCCAGGCATTGAATTTTTGCT TTGTTT TGCCTTTCACATAATGAACATGTAAAATGT GAAGTAGGATATGATGTAAAAGTTAT CAATAG GATCAATCAA AAAAAT CATTTT TTAGAAATTCTAAAGTGAC CAGAATCT C CC CCTCTC CAA CACAAAATCAGTC CTCC CAACCT CC CC CATGTCTTAC AC AGGAAACGGTGC TTTT AAATGT ACAT CC AG AGTAC AAACATTGTGT AT TT CT GC AG ATGTTAAG GG AAGCTAAGGAGAAGGAGAAGCAGAGACTGGCACAGCAGCTCACGATGGAGAGAACCGCGCACTACGGGATGCTGTTTG ATGAATATCAAGGCTT GT CACACCTGGAAG CCCTG GAAATTCTGTCCAATGAAAG CGAAAG CAAGGTACTTCTG CACT ACTCGTTTGAAATGGCATGCTTAG TCATGTGTTGCTTAAACATGAAAGAAAC TGGAAAAT CTGC CATTAAAAGATCTC AT TTTTAGAGCCTCATTATTATTGTG CTGAGGTCAGAAATACTGTACAATGG GATGACCCTCGAG CTC CATATCAT CT CTTCTCTGTCCTATGCTAAGCTATTACCCCACCCCAGCTCCAAAAAGGCCTGCACAGTAAGAGCCACCACTATCCCCC CACCTCATGGTG CAAAAGGCTGCT CAGAAATGTGGCTGTC CAGCTGCTGG CCCTCTGCCTCCCT TTGGTAACGAAGTA CCCAGGTCAGAGATGCCCTGCCAGCCCACATCTTCATTCCTACCCAATGCAAACAAAGCACTTGATGTTCCTTTTAGA CTTCAGGGTTTTAATATATTTTGT CTTATGGGTTATATAGTTATTTATTCAG CCAT CTTGTTAGGACT CCTGGG CCAC AAAG G G CAGGATTATCTTACTGAAGGTTAAGGTTCTATTAGGGAATGCT CCCGTACACACAC CTGGGAAACCCAGC CT AC C CAATGGCCCTG CACATTCATACT CCTC CACTTG GT CC CAGGAAAACAAAAGATGAATAG CAAAAG TCAGCAGG GT TTGGTGATTTTTTTCC CTTAAAGCAT CTATAGTG(N) xTTATTTAGCAAGCACTTAATGGCACTTACATAGCCACTAT T C TAAATACCATAAAACTGTTAACTTAT TTAACTGTATATATGTGGCT GT TTTGTGGATGTCAGGC TGACTGTG CTCC GGAGTGGTTTTCTAGTGGAGGCCACCAGGCAGTACCCTCTGGGCCTCCAGAGAGGGAGGGGCCGTGGTGACATTCATT GAGGAGAGGCAGGTCCCTAGACAAGAAGCTCAATTTGGCAGTGAGCCTTAGAATGGAGAAGTAGTTTTTGACTCTCCC TAGACCTTATCCTGTTATGTTTGGACTTTGAGCCATTCTTGATGCCTGTCTAATCAGAGAAAGACAACATCTGCCACA TAACAG GCAACCAAGAAC CAAATAGGATGGAGACATTG TC CC CTGCCC CAGATGATGGAG CTA CATGTTACTCTGT TT ATTTCCTTCTTTGTTATCCATGAGCCTTCAGGGAAAGTCTTGTGGTTAAAAC( N } xGGCAAGGTCTGGAGTCAGACAT TATTTAATTAAACACCATGCAGA(N) xTGTACACATGTTGATGACAGTTGATTTTTATATGCGTTCTTATGTCATAGA CATTTGTTGTTATTAAAAATATCACAGGGGAGCGTGGCCCTGCTGTAAAAAAAACATTACAAGATATCATGGTGGTGG AGAATGATTCCATTGT GGTGACTT CCAT CACAGCTCTCAG CT GTGAAACAGC CC CAGGGC CCTG GAAAAGAGCC CC CA AGGACT TAGTTT CTAG GAAGAAGGTT CG GC CTTTGGAGGGGAACAGTTTG CC CATTGCCT CCACAAGCTCTGCTTT CT TT CC CTAGCCTGGAGATTTTTGTGATAT TCATAGTAAGGAAAATAAATTTTTGT CTTGACTTAGTAATGATAGTTAAT GGGTACTATAATAAACAGTCCTACAG
> H S 4 _ 38284097 -38313 29 9
TTTTTGAGTAAG C CTGTTTCATTG CAAGT CATCATTGC CAGCTTCAGGTGTTTGAC CCAAAGAAGGATATTTGTATT C CG CTTTGAGCTACGGC TCTACTTAATTAAGTAATCTTAAGTAGATGCT CCTG CCTCAGAAGG CCTC CAAGCCTCTT CC TTGTAAAAGGAGTTGATTGGTCTGTGTGATTCCTTAG G CCTCTCCAGCT CTACAG CCCATCCCCATT CAGCCAGAGGA CATG GCG CTTCTTCGAGGT CTCTG CCAG GACATTAG TAGG CAGTGGAAAT CTTC CACCAAAATT CAAG GGGGATTTTG AATATCAGAAGATGGAAAATGAGAAT CAAT CCATCAAATT TATTATTT GTTATTAATTAAGGGT GAAAGAAAAGAGTG AAAGTACTGACCAATATG CATCTGTGAT TT CAATTGGC CATGTTGTCACAGT CTTCTGTTTC TGTGTAAAATTAAGAT TTGC CTTTATGT CAGCTGCTGGGTAT T (N ) xCAAAAAAAAGAAAAGCTAACATTTCTACTGACTCTTCCTAGCTGATT TGTGAAAACAAGGTGATGGGCCAATTTGGAGAAAAATATGGGTAGAAGTTGGTGACCTGACTTCACTTAGCCAAGTCT TTTATTCACTGACCATATTGCCATCATCTTACCTGTGTTCCAGAACTGGTTAATTGTTACAGTATAGTCCCAGGGTTT TGAAGGAAAAAAGTGGGAAACTTG CTGCAAAGGCAG CCAGAT CTGTCC CCACTCTC CAGCTC CCTG CGGTATTTTAAG T CAG C ATCTC AG AGGAG GAT GATGGT CACCAT CC T A { N) xTGTTGTCAGGCATAAAATAATGATTCTGTCTTATTTCT GAATATTTAACT TATACGTAATTT CACTCT CACGTTTTCACT TC CTTACAGAATAT CTGCTT CT TT CAAGAAGTAT TT TAAAGAT CAATG CTGTCATATT CATT TATTTTAC TATAGAT CAAATTCTATG GTAACAAATAAGATTACCAGAAAT CC CCAGG(N)xCTTTTTCTTTCAAAATGCTCTATCCAATAGCCCAGAAATACATCCAATTTAAAAGAAAAATAAATGCAT T CAATTTAAAAT CC CAAATGAACTTG CTCAAT TTTGTGGGGAGG GGGGTTGT GTTGAAATTTAATTTCTG TG CTATTT GGGATTCATG CTACTTCTCAGAAGAGACA CAGTTTT CCAGTAGAGATAAACGAC CT CTTC CAAGAACAAT G GAAATGA AAGG GGGATGGGGGTGGGGTGGGGAGTGCTGTGGGTAGAGGGGCAGGAAG CACAGATGAA TTGAAAAGAGAAGAGTTA TAAG CAGGTT GG CT TCCTTTAAAAAGATGGGATAGG GTCT C C { N) xGTTAGGCTATCTATTGGACTATCCTTCAAAGA GCGG CTGTGACT TAGTGTGAAAGCAGATTTTTAAAAAATATGAATACAGTGAGT CACAAAATTCT CTCAC CAAAAC CA AGATATGAAAAAAAAA{ N) xCCATTTAACCATTCAGTCTATTCTACTGGGTATCATCTGGTGGGGTATAAGAAGTCTC TTTAGATAAATAATGTTTTCAGAGCATGCCTTTTTCTTTCCTTTCCCAGGATGGTGCATACCATGGGGGATGAGGGGG TC TCTTCCCTC ATACT( N ) xAGGGGCATGAGGGCTCCTTCGGTTGTGAGTAACAGGAAATATGGCCTTGGACTCATCC ATTCATTGCCTTCCATCACCTCCACATCATTATTCTTCAATGGTGTTAAACATGTCCTCAAACTAAAGCCTACAACCT CCTTGCATGAGT CCTGGAGAAATT CT CAT C CACTGT CTCATC CCAGTATCTTTTAT CTCC CC CAGAGAGT GGATGTTT CAGAAGCATC CCAGCCTCGTCTTGAGTTTTCTTCCCTGAGCTGCTGTCTCTGACCTACTTT CTTTACCAC CATCTC CC CTTCTCTCCCCGTCTTGGCT CATAAGTGAATG(N)xATGGCAGAAATTTGAGGAAAATCAG(N)xGACGAATATGTTC TTATTACAAATACT CAACAACACAAACATATATAGTAAAAAGAAAAATTCTGTCTTGCCTTACACC CCATCCTACCAT CATAATACCCCATCCTACCATCATACTTTC( N ) xGTAGTTCAAGCATTGCAGATATGTTTGCTTCATAAGAACCTAAT GTG GAAAGTGATGG CATTC CATAT CTGTCTTTTGACCAC CTGTAAACATGAAAAA CAATG CTGT CAGGAG CG CCGT TG CCAGAAGCTGAGCTTCTGCTTTAATTTTGGTCATTATTCTTCATTTTTACCACTTAATCAGCTCCCCTGAAATGGCAG A G TTTTCAACTTAAATTTTTCTTCTTTAAAAA C(N )xATCñACTTATATTTTTCTCAATC AAAAAGTACCACAGTATA GATAGAATAAGG CAAATTC CAT GTGTTAAT CAGTAT GGAC CTGT TTTTAGTATCATTATT GCAGAGAATAAAGTTGAA ACTTCCAGTG CATCATC CTCAAAACTGCAGAG TGGT CTTTTAGTGGGTGG CAGAGT CATG GCAG TGGGTT CT CT CAGT GGTGGTCATGCAGGACAGCAGCATGCCCAAGTTACCCCCATTGTGATGAAGACATGAAATGCTTGCTTCCCTGCACTG TACAGGGCCTCAGTGGAGAACAAGCTTGTGGCTGCAGGCACAGCACAGCTCAGGTGCCAGTTGTCTTGACACGGGTTT GT CG CTACCCAGTTTAGTGAAT TCACTACTGG GGAGACTT CATG CC CTCCTTTCTG CTTC CATT GC CTTTAC CACCTC CGGCTAAACTATATTCTTTCTC( N ) xCAAGACCGTGAGATAAGGCTTGAAACCTAAGGAATTTCAAAGAGGGGGCCAT GATGGGCATTCAGCGGCATCTCAGGAGGGGGCAGAGTAACCAGCAGGTGCTTTGCATGCTGGTCTGGTGTGTCTGGAA AAGGCATCCCGCTAAAACGTTTGTCAGGAAACTTCTCATCAGCATTTCCAGCATGCAGGAGCAACCAAGAGAGCCACT TAGATACCATTTGATTATAATGTG TTTCTCAGAGAAATAAATAG CC CAAATGGAAG CACACTTCTGATTGT CAATT TT
t t a c a a a c a c c c t g g a a c t c a a t g a c a g t t c t c g a g g a t c g a t a t g g t c t g g t g a c a g t g a a t t g a t t g t t c c a g c a g AGAT CTTTAATGAGATTGTT CACACTATTCAGGCAC CACCAT CT CCAGCAGATTACAATGACAATGG CGGG GCCTTGT
g a a t a t a g a a a g c a a a g c a g c c t t c c t g c t c a t t t c t t t t t g a a a g c a c t a a c c c c t a g c a g c a a g g c c t t c t t g t t a a t t a g t t g t t a a t t a t t g g g a c a a c t c g t g a a a t g t t t a t t a a a g a g c t g g t c c a g t g g c a a a g a a a a g a a c c c a g g g c a a g a g a c a g a a t t g a a g g g t a a t a a a a g g a a c g a a t a c g c t t t t a c a c a g t t c a c t g g c a c c a a g g a a a t t c t a g c a a c t c a c t c t t c t g g t g c a a g a g g a g a g g t a c t g t a g a a a t g c a t a g c a a a c a a a c t a g a a t g g a t g a a a a t g t a a c a g a c a c a t ( n ) x a t g a c a t g t a t g g t c t a t g a t g t a t a t a g t c a a a g t a a c a c a c t t a c a g g c t a t g a t c a a a g c a g t a a g c c c a c a g a t t t a g a t t g t c t t t c a t g t a a c t t t t a a c c t a t a t c c c a t g a c t g t a t a a c c a g a ( N ) xAAA ATTTTTT t a t c t t c t c t t c a t t t t a a a a a a a a t a t t c t a a a a t c t a c a a a a a t a g a a g c a g t c c a g g c t t c t c g a a a c t c t c g g g GCTCTTGTTCCCTGCTCTATATGCTCT CATTGTAG CATGTCACTTT CATAAGTGATTTAAGTTGGCA CAAT CAGGT TG TAGCTTTGATGATGTTACATTAACATTATCTTACAACTTTTGGCTTTTATAAAGGATGACTATTTTTTTTAAAGTAGG
a g a t t c t g t g t t g c a t a t c t g t a a a g c t g t c a t a g g a a c c c a g a a c t t t g c t a c c c a t c t c c t g g g a g g t g t a g a c a a CCTG GACAC CAGTC CTG GTAAATT CTTAGGGAAGAG CAGTTAGTATTTT CGT TCT CTATACTAG C CACGC CGATTTAT TTGGGTGGCTCTGAAATACGCCTTCAAGTTCATTCCTATGCACAATCATCTTTTAACTTAATATAGTTACTTAATATA GT TGACTTAGTACAGTAAAC TTAA TACAGT TAAT TTAATA TAGT TTACCTGCAATA TAG CAGGTAAAAAAAT CAGAGT AG GGGTAAAG TAAGAATATTAT TAAAAGCATT TT TATTCT CC CT TGAAGAGTTG CT GTAAAATGTTATCT TTTCATTC CTGCTTT CTG CTTT CAGAAGTGATTT G GAAAGTATTGCTG GATGATGCCACTTT TATACTGAAACTATGTAG GTTC TA TGCAATCTATGATAATTGAAGAAAAAGATAGATTCTATTCCTTAGGAAACCCTTCAGTCCCTTACACTGGCCCTGTTA CGTGACG CCAGG CATGCAG T CTTACCTGATGCAG CTAGTACTTCT CTCTGTCTCATTCCTCTCC CAAGTCACACTGTG T A G T C T ( N ) xACACTGCATTTTAGTGTTAGGCATTTTAGAAGCACCCTTGAGAAACTCTGATCTGCCCGCATTTTCTT A C CAGTCCCTTGACATGGAT GCATGTTTTTGCATCCATCCAA GGGACCTGGTAAGAAGCCTCTGAATTA CAGTCTTTA TCTGTTTATTTGGGTGACTCACTC CTGGGATC CCTG GGGAGATG CC CCCT CCTCACTCAG CCTGTCTCCCAG GACCTG CCTCCACTCCCCTTGCCTCTGG CAGGGAGAAGAAAAGGAG CCTGGAGAAGGG CAGA CCCC CAGCAGGATAATGC CTAC CAGTGGGACCTTCCTTCTAGGCTCACTGCATTGCCTATGATTATTTTTTTAGAATAGATTTAGAAAAAACCCAGGCCA GTTGGGGCCAGT TCTGGTGATT CTGGTTTATGAT CAAGTGGGAGAATTACTGTT CAGGAT CAAG CTACATATAACTAT AACACACATCACAG CTGGCATT CTGG CTTGGGTCATAAATGCAT CCGTGG CATTAGATGCAG CGTT T CAAAAATACTC T CTG CAGTGATG GTAGATATCACAGC CAGG CTGC C C TGAATGAACT CACAGGTTG C TTACAACTTG CTCTTGTT CCAA CCAGTCGGGCAG CCATTTTTATTGGCTGAGTGTCTCATTGATTCATTGGTACTCTGCTATGC CAGTTGTTAAATAC TT GTACTACTG GAATTAATATATAATTTTTCT CCTGAAGGA C GAAACT CCTGTGAATATCT CAATT CAGAAT GTTT CT TA AAAATCTAAT CT TGTGAACTAATAGT CAAT CT TTG CAAAT GATGTAAGT CATAT GACTCACATT CTAAGT GAAAAT CT A CTTTAAGCCAGGTTAATATTGGT CCTACATTTATT C CAGGTATTAG CTTATATAAGCATGAAAT CT CCATG CTGTTT TTTCTACTGTTGCAAGCATTCACTTCCAAAGTATGCATAAGGTTGGAAATTTAATACTCCCAATACTCCAGCAAAAAG ATGC CTTTAACTTT CTTGGC CATATCTTTAGCTTGGTTTC CTACAGATGGTACT CTGCTGATTCATTTC CTGATGGTG CACT CAAGTTGATTGTTTTATT CTTT CGAAGAATGTTATAAATTGCAGTT TG CTTAGAGT TGAATGATGATGG G GTGG GCTG GGGTTTA CAGTTGAGAAG CACC ACTTTAGC TT AT AATAAG ACGC AT AT AATT T C AATT CACG AAATAG GCTGGG GAAGTATTTATAAATAATTATATGTCTGGATGTGTTTGTATATATAGTTACATTTATTTTTAAGCATACATAG(N) xC ACA CATATAGTAATG CAAAAAAC CACTATACCATTAAAAAATTT TTAACTTAAGTT CCAATT CTG GGGATACAAGG{ N ) xTTAAAAAGTAACAGTGTATCATGATAAAAATACATAGGCACATGAATGAAATATAAATACAA{ N) xGCCATCTTTG AAAACAAT CTACTACAAGTAGAATTTGC C CCATT CTCAC CTTTTAATTAGTAGATTATCACGTCAT CATAAT CATCAT TTTGTGCTAAGACAAATTATGAAATTAAAACCAAATCACTTTAAAGAAATACTTAGCATTTTGAGGAACACATTTATG GTGAA CATATCACTTTTTTC CGGCTTTGGCTTAAAGCAAGTTA CATAGTTGG GATT CTATTGGATT CTTTCTTGCTCC CTGGTTCCAAAAGACACCATAGTAATACTCCATACTCTTATAGCTTTAGGCTACTTGCTGTAAGAGATAAAAAACTGT TT T C CCACTTTCAAAT CAC (N ) xT T T CACATTTG CCTT TAATTAGAATTCTGTATTGCAGAAATTCACACACATACAT ATG CTCAC TATGCTGATATATGTAGA GT CCAATGAGAATAACAATAATTT CAGAAATGTCTGATTG CCAAAATCAGAT ACTATTTTTCTTAG ACTTCAAAAATAGCGTTG CC ATG ATAATCT CT CT TTTC AAAG C ATAGCTCTC ATGCATTTTGCA AAGTTATTGGTGAT CCTTAAACAATCAT TAAG TAAGTGACTGAATCTAATGCTAGG TAAATATTTGTTGTTTAG( N ) x ATTTTTCCATGGACTCTAGCAC CAGG CCTGTCTCAGTG GACTCTAATACCAGGT CAGCTC CTG CAACC CCTGGCTCTA GGCTCACCCTG(N)xATCAGCTGCTTTATTTATTTATTTAAACCTTTGTTTTACATTCAG<N ) xAACCCATCCCCCAA AATTTGTG GTTTAGAGTTAG TATGAATAATGC CAATG GGTTTTCAGTT CATAAAAGAGTGAT CTGCTGACAAAAGAAT GAGGGGAG CAAGTTTTAGGAGATGTGTTGACT GAGGAAGT TTCTTCAGAAGGAG CTGCTGTATATT C CATTATGTTTT TTATTCTCTGTTTGATCTTG CTGTCCTGTGGAACTTTGATGTAATTAGGAAACACATTCC TCAGGTTAAATG TGAATA GGATGGGAT(N ) xAATGTACTAGTATGAGGGACATCATTACATTTAAGGAGCCTGGGAACACGGTTCTCAGGCTTTCG GT TT GGTTGT T CTG CACAA(N ) xTCTTCTGCACAATTTTTTAAGTGGGGGGACTCTTTTGAAAGGACTTCAGGCTGAG ACTT CTGAAAGCATGTAGTGAAGGGCTGAGG GA CAAGAC CAGAAGGTGG GGC CTAG CCAAGG CTTTGTACTCGCATTC TTACACTCTC CTGC CATTGT CC CTGTGATGAAATGAGCTT CATACATATAGTTT GCTTGAAAATACAC CTCATGGGCC AGGTGCAGTGGCTCAT( N ) xCACCCCATGAAGTGTCCTATGATCAAATTTCTGAGACAGAGGAGAGATATGGCATGCT CCATTGTGATTAAAAAAAACGTGGCTTGTAACCTTGGCTTCTCCATCTGTCAAGTGATGACAATCAATGCAGCAATAA CTGCACACCTCTCTCACTTCCTCTCTCTTGAT CAAGGAAGACTGTGAGAATTAAGAGTTGAT CAGAAAGCAAAGTAAC AATGATATGTTACAGAGGAGTGAGCTTG CAAT GAGATCTGTGAAATTT GG CAAAGCACAG GAAGTGTC CAATT CAGCT AGATTGGATTGGTG CTAAGG CAGTTTGAGCTT GAGGTTGT CCAGAATC CTTT CTGCTAACGAATGC CAACTACAAGGG GAATCCCTGCCTCACCGTTTGCATATAGATTTAGGAGGAAATGCAGATTTGCTCATTTGGTTTCTGTGCAATTTTAGA GTCAAAATGAAACTAGAATGTGAATATTAGAGTATGTGAACCTTCTTAGTATAAAGCTGGATTCCCCTCTGACTTCTG TAGCATTTAGGAAATT GTGC CTTATAATAGCAG CAAT CAATAATACAT TGTC CTTT CATCAAGTAA CAACAATGACTG CTGATGCT C A GATC CT AGTGTT CTGT AG AATCTAG CAACC CATTGTTTGG AT AC AAGGGTGC ATGAATTAAC AATATT CACTGCTCATTTTCTTTGTGTTTGCATTCTAGTCTTCTGGAAATGGATTTACTAAAAAAATACTTTCCAAAAAATGAA GTGTTGATTTTTTTTCTTTGTCAGGTAAGAGC CATAATACATGT TCAT CATTTT TAAAAAATTGGAATAG{ N) xGATG AGAAGGAAGA CTAT G CAACC CAAACA CAATCATTGTTGAC T T G T A T A (N )x TTTATAAAGTTTGCAGTGAAAAAAATA TGTAATTAGGAATTATGTAAATGGATGAAAATATAGCAAAGAACATTCTCCCCACCATCACCAAATGC( N) xAGACTG AATTGTTAGGTATGAAATACAAAATGAG CGAAGATGCT CC CAAAAACAAAATAACGAAACAAAAATAAAAAC CCAAC( N)xTGATTCTACAGCAATCTTTTAAATTTTTGAGTAAAAACTCAATTGATATATGTTTGTATCTTTAAAAACAAATTA TTTTGGAAATCCTGAAGCC(N)xGACAAGGACTTCTTTTAAAAAGCATAGCCATAGCC(N)xTCTAAACGTGTCCGAT GAGGGTTTTTTTCCTGAGTAATTTTAAACCTGTAACTTCACAAATATTGATGTGCTTCAGTCTACTGCAGTTATTGAT GCTT GAAC TGTATTAT CTTTAC CTGGGTAAA CAGAACTT CCTCCATTTAGTTATTG GGTACTTGTAA CACTACC C CAT CATATTCTAGGCTCAACCTGTTCTATGTAATATTTTTAATGATATCTCTCCATATTATCTTTTGTTCTATTTTTCAAC TAGCTGACAAACAGCTAGGAGGCGTGGCTGTTG(N)xAAATAGTAGTTATTTTCATTAGGTTTAATAAGACTGCTTTA GTGT GTCCTCTTTCTAAATTTTTCTAAG CATT ATTG AC ACAGTTGTGATT AT AATTTACATTTATTTG CTTC CTGACA TTTTTTCTTGTAACAACATACACATTATATATGTTCCAACAAACATTTCAGAAACACAAATTCC{ N) xTTTGACCAAA TTTATTTCCC CATGGTTTGTAATCGATT CACATT TACGTT TTTAGTGTGGAC TGAAAAATGT TTATCCACTG CCAATT ATTTTTCAT GCTCTTTTATACAATCCGAT GACAAAAT CAAGCCATTGAAATTGGTCAGCGCTAT TGAAAAAATGGTGT AGTATGGT CACAGT CT CCTAGCAGTCTATGTACAGCAATGTCAG CCTAGCTT CGTTGAGT CAAACCAGGGAGAT GGAG GGAAATTTTGGCTG CCACAG CC TTGAGTGCGGTG(N ) xCCAGCACTCTGAGTAATCTAGTAATTGGGTTCTGCCCTCT CATGAGCTGA CAGGAGA CCACAATCA CGTGAAA CAACCAC CAAAGATG GTAACAG CTCAG CAAGAGTGACTTGGGCTT AACAGGTGCAGAGTCC TGGCTTGGAAAGGGAG CTTGGCTTACCAGGGCTCTG CTGCACTCTTGAA(N)xTCCTCCCAG AAAGAGCAGATTTTTGCCACT(N)xCATAGGTTCCTGACATTCCTTTGAGTGAGAAGCCTTGATCTGACAGGAAGACA GCTAGAGAGTG CTTAACTGTTG CTTCTC CAGAGT CAATTC CTTGTGGAAACTTT CTGTCC CTGCAC CTGTGAAATGTG CTTAATGGATTTTGGAGACCTACGGTAAGATGGAAGATGATAGTCCTCAAAGGCCAGGAATCCCAAGGTGTATTACTT GAGC CACT CCTAAACC CAACAAAATTTC CCTT CTAAATTG CTAT CCGCTC CCAAATAAGTAT CTTAATTTAG CTAATT
T (N ) xGTGCATTTGAGCCACCTTTGCCTCCTACCCCTTGTTTTGGGTAGGGAGTGGGTTTCTTCCTCTTGTGATGGGA ATAT TTCCATTTAC CTGTGT TATACAGT CTTGGTTCTGGG GTTTGCATGCTGAGTGTAGCAGGAAGGGAGGAGGAAGC ACTTACTTCTAAAGGGCACAGAATAGTTGCCACCTTCGTTAGCTATTCCTTTTCTATTTCCTATTCAC
> H S 5 _ 9729109 -97418 77
GAGAGAAAGAGCAGGCAGGAAGAAAAACCAGCATCCAGCTGTGCCACAGATGTGCTTCCACCTCTGTCTTGCCCTCTC AC C CT CGC CTTTAATC CCATTTTATAAAAACT CATACT CAGAGGTTAG CTGGGTTGGACC CCAGTCTGACCAA C CCAC AG C C CAGG CTGGCCACAATGTG TGTGTGTGCT TAGCAG GGGGGCAGTTTTTGGG GACTGCAAAAGGAGGGTCTATAAA GAAGGACCTCAT CT CT CACC CC CTAGGGGCAG CCTCTATGG GGGAGGACGATGGTCCCTGACAT CATGGGACAGTCAC GAATGAGGGTCTGTCCTGGGTCAGGCTGGTGGGCAAGTTGATCTGGCCTCTCCACAAGGCTGGCTCTTGCCTCCCCCA TTAGTCAG TAGCACTG CTGCTC GTGGCTCTTTTTAGTTT CTGCCTTATGACCATATTTGACACACTGCCT C CAACCTñ AGGTCCCTATTTATTCTTGGGCACACGCCTTGGTGACATGCTACCACTGCCCTCCCTGCCCAGAATTGTGTTTGGCAA GCTGTG GAGAAG CTT C CAGAAT CCAAAAGAC C C (N ) xT GAATGGAGACAATTAGTTTAAAGCAGGCACGCGTAAAACC GGAGGCAG CACTTAAATAAGGCACAAACATGC CTTTAAATGTTGTTTTAAAGT CAACCTTTACTTT CATO CT CAACGA CTTAATTGACAGAAACAACAAGTGTATCTGCCAATATGTGGGCCGGCTCAGTGACCCATCCACTGAGAAGTAGAAAGC ATATCATGTTTTGAGTTACAGTTTATTGAGCATTAAGCTTTCTTCCTGTTTTTTCTCACTTTTCAGAGTGATATCCAC AGATAT TTT CCTATAACTGTGTGTAACATGACTGTGCC CAACGT TAGACTGGTGGAAT G GTAGATAAAGGG G G CAAGA ATTCACAAAAGGGAAGTCCTGGGAAAGGAAATTGAGAGTTTATAAATATATGAATGCCAGCTCATTTTTATTTTTTAA ATACTCTGTACAACACACCAACATAAAATCATAGGATATAGTGGTGATTGCATTCTTATGCATATATCTTTGGGTATT AATTTTACGAAAGATATAAAGATGATTT TGATGCAGATGGATTC CACATGTTAC TGAGAACATGAAAAGGAGGATCAG AGGGAAGCAAGAGAAAAATTCCACCTTAGGCTCTTTCTTGAAAGGTGCAGTTTCTCCACCATTGACATCTCCTTAGAC CT CACAGTGAACTGG CAAGTAG GTCCAGGGAAGAGATTTC TATAGTATCTTGGCTTTGAGTAAAAGAAAATGGACC CA AAAGCCTCATCT T CACAAGACAACCTTT TGGAAATAGATAGAAG G CTTGATACCAAATGTGCTTGAGGTGTAGCTC CA AAGCAG CAAATAAAG G CGCTGC CACACCTGTG CAGGTG CCACAC CCGTGCAG GTACCATC CAGTGGGGGC CTGGGG CA TCGCATGTCCCAGAGCACCCAGGCACAGGGAGGCTGCTCAGCAGTAGGATGTGTGGCTTTTGAGGACTCAAAGACACT TAGGGGGT CTCTGC TT CTGCATTTC CAAGAGGACAGATAG CTGGGG CAGGTG CACCAGCCCACT GAGACAC CAGGG CA TGAAGAGTGGCA( N} xTAGCACATAGGTGGACTGAAATCCGCTGTCCCCCATGAGTGTGTACCGACAGGTATGCCCTC CACCAGCTGAGCACACCAGCACCCACCTCCCCTCCCTGACCTGCTCGCAGCCAGCCCACGCCCAAGCCCTGCTGACAT TCATCTCCTTTATCTCTTAGCCTGACAACTCCCCTCCTTCCCCACCACACTCCACTCAAAGCAACGTCATCTATCACC TGAACCTCCTCACTGACTGCTCTGCCTCCACTCCTGCCTCCCTCCCACCGGCATCTGTCCCCAAGCTGCATCCAGACT CCAGGGAACACAAACTCAGCCGCGGCACCCCCTTCGCTGCCTCCTCCTCGTGTCTTCTGTTGCCCTCGTGGTAGGGAC TGACATCCTGATATGGCCCATGAGGCCCCCTGGAGTCTGCCCCCACCCAGCGGCTTCCTCACCATCACAACGGATTCC CCTCTCTGAGCTTCTCCTAGGTGGCCATCCGCC(N) xGCTTTTACAAATCATGGTAACCATGGTACTTAGTTTAGTTG TTGACATGTAGC CAGOACTTTCTTATTCATTT GCTAGT TATGGT T A A (N) xGTGGCTTGGATAGGGCAGAGCCATGGA CAAGGGGAAGATGGAAGGAGAT CTGT CC CAGAGG CAT(N)xAATTCTGGCTTCCATAGCAGGGTGGATGGTGGCTCAT TTACTGAGGTGGAGAAGACTATTGAAGAATGCATGGGTTGATGGATGAATAAACAAACAAGCAAATATGTAGAGATAA ATAGTACAATAAAGGAGACTGGGCCTTCAGGGTGAGATATATTCATAAGGCAAAAAAGCTTCCATATCGGTCAGGGTT CTGCTGÍN)xACTCTAAATGATTGTTCCGTGGAAACCACTAGGCTTTACCAATAATATAACT( K ) xGTGTCACTTCAT TTTGTGGAATTACTCAGGCAAAGTGACTGCTGCTAGGCAATAGTCAAAGTCAAAATGGTTAATGGGTCTGAAACCAGT GGAGATCTGGGAAAATTTTAAAAGAATAGTCAAACATGACAAGAGGTATCACTGGGGAAAAGAAACCCAAAACATTAG CATTTGGCTTCCAAGCAGCTCACATTCTGGCCTTAGACAACTGTTACCTCACTGCTCAGAAATGGAAGCTTGAAGCTC GTGTGGAAAACCGACCTCCTAATAAATCTCACACAGAGGATGCAGCAATTTGTTATGCGTGAGAAAATGGCACTGGGA AAGCAAAACACAGCCAGCTTGCTTCATATCACAAGTAGCAATTGGGCTCAGGTTTTCACCTAATTTTTAAGCAGCTTA TCACCACCTCCGAAAAGAAAAAAAAAATTCTAAAAATTGCATTGCTTTTTTTTTTCTGAGAAGAGCATATTATGAAGT GTAAACATTAAGAAAGTCATTTCCCAATATAACTATTTCCAAGCTTTGTCTACCACTTG(N)xACTCACTACCTTTAA ATATTTTTGTCCAATCAATACAGCTC CC TCTC CCTTTTTTGGGGAATAAAA C CTTGCTGAAATGAACGGACT CTAATA T C TTGATTTATTAACC CACATC TGAG CCAAAATGATGTTTGTTTGAGAATCTACTTAGTACTGACTGAAAGAAACT GT A CAAGAGG TTTT TTTG CCAATGTGTGAGAAAATAGATCAGAGATAAGATTAATCTTTACAAAAGA C AAAAAG CTTACA ATGCTTTGCCAAATAAAATTAACTTTCATAATTTGCCTACTATGTATTTACTATGCTATGCTATCTGTTTAGCATACA AACCCCCTACAATCCCTACAGTAATCCTAGATGGTGCCTACGTTTTAATCTCAGGTTTATAGATGAGAAACTAAGACA ACCTGGACTTAAAGCCAGGCCACCTTACTCCACAGGCCACACCCCCCACCACGGTGAACACTACAGCTGTGATAAGAC AACAGAAAATTGCTTATCC(N)xTTGACTCACCTGAATAGTGTTCATTTTTATTTAGAAATGTAAC(N)xTTCTAACA TCTTGATTTACTAACCTACATCTGAGCCAAAATAATGCTTGTTTGAGAATCTACTTTAAGTAACAACTGAAAAAAATT TATGAGTTTTTTATTG CCAATG CGTGAGAGAATATATCACATAC TGA CAAGATAAAATAAGATTAATATT TACAAAAG GCAAAAAGCTCACGATGCTTTGTCAAATAAAATTGATAATTTCCCTACCATGTATTTACTATGCTATGCTGTGTATCA AACAAAACCCCCTACAATCCCTACAGTAATCCTAGATGGTGCCTTCATTTTAACCTCAGGTTTATAGAAGAGAAACTA AGACAACCTGGACTTAAAGCCAGGCCACCTTACTCCACAGGCCACACCCCCCACCATGGTGAAAACTACAGTTGCGAT GTTAAGATAACAGAAAACTG CTATTCATAATAGTAAATTATATACTTACATC CCTCGACT CAT C TGAATAGTGTTCAT GTTTATTTAGGAATGTAACTTTTTAAAGAATG CAAAGTTTTATTTAAGAACCTG CAAA GTTACCTCATATAACAG CAA TATTCCTGCCATTATG CTGCATATGGAGATAGAAACCTATAACAAT CATCTTTC CCTCTAACTTG CTTCCACCTTCTC CAGTGGGTGACAGAAAAGACAGATAGGACCTGTACATC CAGCTT TCTCTGCTAC CCCTTGAAAAAG CTCCATCTATTC CAATCAGTGGGCATGCAGCTTCCTGGTTTTCTTACACTTGCTGTTACTCATAGAGTATCTCGTGCTGTTGTTTGCATT TT TACTAGTTTCAG CTTATTTTAGGGTT TCAC TTAACG CACAGCTTTGTACGGAAAGT CATAGATT CCACTGCTCCTC TCAGTGTTTTACACAGGTTTTACTGAAACCCAAACCCCTGCCTAGCATCCCTCTTTTTCTTCCTCACTTCCCATGGCA AAGTCAGAATTTATTCTTGAGATTTCAATCTTCTCAGCCTTTTAGCTTTCCTGGCTGTGGGACCGCAGAAAGCTAGCA CTGCAGCTATGGTC TAGCAGGACAGCGT CCCAGC CTTATT C C { N ) xGCATGCTGTATAACATACCCCATCTCCCCTCT CTATAAGGA CAT TT CTTTTAGACTAT CT CTTATT TCAGTC CTGAGC TCCAC CTT GTCAAGTATT CTGGATACCCAT GA GGGCTTAGCAAAAGGGCCTATGTCTGTCACCCTGGATGGGAACTAGCCACTTCAGGAAAGCTCTGTGATAGCCATGTG TAAGGTCATAGTTAGAAATGCCCCCACCCTTTCTCATTAAGTTCTCCCTACCCTTATTCACATTTGGCACTTTATTTA ACTCTTCCCATTCATATATTTTGAGACTTAGGCTGAATTTAGCAACTTTACATCTTCTACCCTTTCTCATTGTGCTAA AGTCTCCATCAGGGTCATGCAAATGGGGAAATAAAGGTTGAGCAAATGGGGAAATATTATTCTCCATATGGATTGTGA AAGTGTGCAACC TATACCACTGTCAG CAGCAG CT CTGGTCTT C ATC CTGC CCCATTGTTT TC TGAAGGTGCACAAT CC AGCCTCCTACTAGAAGAACCCAGACTCTTGTTTCCCCATTCACAATGGGGGTGTTCACTTACACATTAGGTCAACATA GTTG( N) xCCACCGACTCTCTGTCTTCCCCTTCTGCAGGTCCAGCCTCCAGGGCGATGACAAGGGTGAGAGAGAAATA TGAGCCCAGCCTGCCAGGTCAGGAACAAGGAGAGCGAGGAGCTGCCTTTCCTTTTCCCTGGCTCCTTCCAAGTGGCAG TGTGGACAAAGAGAACAGGCAAGATGCTGTGGATGTGGAGCTGAATGATTCATCATGCACATCAATTTAACCAGAGTC AGTTAGCAGTATCCCAAAGCTCTGAAGGACAGGGGCTCAATTAACCTTTTAATACACAAGTCAAATTTCTTGTCTTCA TGACTTACTTCTTTACCAGCAAAAAGCCTCCAAAACCCTTCTCAGATCTTTGTCCAAGGTGAGGCTAAACTCTGTCAC TGGTGGGGCTTCGTCTAATTCCCTGTGTTGTTCTGTCTGCTGTGATGTTATCACTCTAAGAGGGAGGGTGCATTTTGA GGATGACAGAGAAGGACACGTATGATGCAGTGTTAACGCAGACCTTTCTCTCCACCTGGATGAAGAACTACGTTCTCT GGCCGCCCTCATACTTGT CGTTAAAAGAACGTAAATTATTTCACAATT CT CTCCC CTCAAACACAC CCACTATTTC TC TTTCTTCCTG CAAG AG CTGAGTTTTTTT CT CTTC AGGATAAACTAG CACATTCTTTTAATTC CC AG G AAATT AT ACTT TTCCAGGGATGATTACCACCCATTCTCAGAGTTTATAAGTCCAAGGGTAAGGCCCTGATCCAATTGGGGCTGGAGCTA CAGATGAGAATGAAAATGACAGACCATGTCTGTGTGCTCCTGGACTGTGAGAGGTAAAATTTGGAATCTGACAGCTGT CACGTTTCCTGATAAATGAACCAGAGAAGT GGAG GGAGAAGGGAGG CCTG GGAAAAAGGCGCTC CT CCTAGGCTTCAA GT CCTGAAGC CAAAGCACATTCTTTACT TTGAGT CCCAAAAGGCAACT CATTATTATT CCAATAACACCACCTTTGTT TAGTCTGATAGGGGTACGGTTTCTGTCACTAGCCAACCTAGCGATGATTATTACATTATTTAAATTAAACACCCATTT CTATTTTTGGGAGG CC CT CGTGGGTGAT CAAC CTGCTGATTTATGAAC CAATTCATGCTACGGTAGTGATGGGGAATA GCAGTTACCAGACTAT CTTTTGTTATAC CAACTTTGCCTTGAGACC CAATGTTTAATGTC TTGACCTAAAGTAATGAC TAATAAACCCCAATGCCAGCAAAGAGGAAGCCCACAGAGAACCAGCATCAGTGCAGATGAGTCTCAAGGCAACACCCA AATATCC CACTAGG CCTCTCTCTTCCTG CATACACATATGTTAAGG CT TGAATTTAATTTTTTT CT CACACTATAAAG CTAAGCTGTGTCAGTAATCCCCATGCCAACCTGCTGGAGAAAAAGTCAATCTAACATCATTACATTTTTCTGGCTATT TTAATTCAG(N) xCCACATTTTCCCTAGTACCTGCCTATTTCTAGAAAAAGCTAACTGGAAATCAGAAACAAAGAAGA CAGCCTTTAACTGCTTTATTAATGGGAGCTTGGTCATGTCATTTCCAAGTTCTTCTTTTTCCAAAAGACCCAACTGAT AGTCCCGTATTCATAAGGCACCTGCTCCCTCACCCCTACAAGTTCAAATCCAGTTTTCTTTCCCTTTTGGAGCTTTTC CTGGTGAAATTATAGCTTCTCATAGTGAAGGTGAACAGCGTGCTGGCTTATCTGGTCCCTCCGTCACCTTCCCCACCC CT CATCTGCC CT CTGCAGGTCAG GGTGT CAACAGTCTCCAGGACAGGT GT TAACTAAC CTAC CC CTGGAGGCACTATG TGTTGGGC AG TG AG AAAC CCAAACTACATT CC AG A CTCACTC AT TTGAAAATGG AATT AATAAAACTGGG AATT AATG AATAACAATATTTTAGAAGGAGTCACAATTGCAAGATGCCCTGCTAACCCAGCATCCAAAAAACTGCTGTTTACTCTG TCATCTTTGGCAATACACACCATTATTACATCTTAGTTGCTTAATCCGTAAGñGGCAGGTAAGATGCCAAGCAATTCC TTGGAGTTATGGTAAGTTATTCATGCAAACACTATATTTATTAAGTGATTCAAGCATAAAGGTCATTCAACGCTTACG CT CTCTTTCTAATG CC CTTCTGATGCTC CTTCTTTTAAAGCCTGGTGG CAAAAGCACT CC CAAC TCAGAACACA GGAG TACAACTTGAATTCAGAGAATTTAAGATG(N) x CAACTGAATTAGTCCTCCCAGAATTCAAGGTCTTTTTAAAGCAGG ATTTAATTCATATCTCAGTAGCTTTC CC CAAAATTTT CAGCAATTGCACCCTGCCCAC CTATTCATGAAGCTGAGAGG GATCACAGGAGTTGTCTAGTCCCAAGTCCCCAGAGATGTTTTCTGGTGATAGACTGTAACCAAAACGACCTCTGGTAT TTATTAAGTTTGCAAAATCTTGTGTCAAAGTT <N)xATCCTAGGACTGATTTGTAGGCGAATAAAAATGCTGA(N) xA TGGATCAGTTTT CAAT GAAATGAATAAAAGGCAC CATGGTGTAGTT CT CT CTTCACTCACGCACTT C CCTCAGAATTT CAAAGCCAACGAGTTAACAAAAAGACAAATTTACACTGGAAATCAACAACCATCTAAGCTGTGGAATTTTTTAGTGCA TTGTGATGTC CATTAAAATCTGTCAC CCAATT GC CTTCCAAAAC CC CAGAGCAACTGAAACACACAAGCGTTTTTGAG TGGGCAGGGGTTAAGGGGCAGAAGCCAAGAGAGGATCAGAGACATCTCCCAGTAACACAGGGGCCCATTTCCCTGAAC CTGTCAAAACCTTTGAGCAGGTAACACCGAGGAGTGGGTCTCCTGTGGCTGGCTTTGCAACTCCTTCTCTTCCGATTA TGAAAATATCCTTCTTACACTGCACCGTAATCATTTGAATTTTTATTGTTTTCACTCAAAAGCTCCTAAGTTATTTAG ATAAGCACCATTAAACAAAAATCTGAGTAATTAGTTCTTTTGATGTGTCCTAGTGGCAGTGGAGGAACTTGAAGACGT TTATTAACTGTTAGATCAGTCCCAGGTGGGACTTATTATAAACAGAGCTTAGCCCCAGAGCAGCCGCTCATTAAATAC TGAACAGGAATTAGGTATTTCTCTCTTCCCATGCCATTCTTCATCAACCAGCCTCTACCACTTGGGTA ( N) xGCCACT AAATACCACAAATCATTG CTTACGTGAAGT CC CGGAGGTGGGGCTAGAACACAATCACAC CCTC CTTTAGAGACTG CG AC CCAGAGAGAACAAT C A (N)xGACCCGCTTCTCCCATCTCTTGCTTTAGTAAATTTTCCATGCCAAGACATTTTCAA GTGCTTGACAGT CACC CACGGATGCTQACTGCTCTGAGTGAT CTATGG CAAACCACCCACTCTTTT CCCAGAAT GATA AATGTTT CTTATGT CTAATGTTGATATAAAAT GT TATGAAAACGAGAAAATTTGGGGGAT CTTAGAAAATAAAAGAAA GATTTACATGAAATAAAAATGAG TGGTT TTAT CT CGTGTATGTTGGAAAACAGAATAGAAGTTGATTGGCATCCATGG CAGAAGGGGATCTGGGTT CTCACTTGTT CC CACCTGTTCAAG CATC CCAGTGGAGTCCTCAGTC CT CAAGT CTGTAAA CCAGGCTGAACAGCCCCAGGTGACCGGGACACAGTGGGAAGTGAGTCAACTAAC
> H s 5 _ 60570005 -60603 234
GGAGCATAGC CCTC CTGATACACCACTCTGGCACATAGTAAATG CCAAATAATCATTGTATGAACTGATGAATG CACG ACCAACCCCCATTTCACAGTGTATTTTAACAAAGAAACATACACAGAGCCTTGCTGCCTTTGTGGAATCACCAAGGGA AAATAGCTCAGCCAAAGGTACTCTACATAGTTATAATTCTAATTCTGACTAACTTGTTAGGGTTCTCCCTCTGGTGAT ATATTGTAGAAT TTAAAACATGTTTG CAATAATG CATGC CTT CATC TT CC CTTTAAAAAC TGTGTGTGTGTGCATGTC CACGTCATTGTGTCAGAGAATGGGTTTGCCAAATAGGCCTTTGTAGTCCAGTAGTTTGAACTATTAGATTACTAATAG AAATCTATGAGAAGATTGTTCTTCAGCAAGTTTTCCAGTAAATTATCTCTAGGCAAGAATGTAGCAACTGGGTATTTT AAAATGTATTTTTGTAGCAGCTATTGACAAAGTACAGCCTGTATTGCTTAAGAAATTTTAACATGTAGACATCTAAAG TG CATTGTTGAAATAAAATGATACATACATTC CAAGGGT CAAGG CTGACCAATGT CAGATTTAAGAATGTCAGTTGGC AGAAGTGGAGGAATTTGGGAACTAGAAAACATGGCTATTTGCTCAGGTTAATTCTGAAATGGGTGGAGGGAGAATCAA TGAG CATGTGAATC CAAG CAGT CTTCAGA CAGAATTTTTAGCAGTG CACTCACAGT GAAAGTAAAAC CTTTAGAAACT AAAAAAATTAAACATCAAG CTGAACAAATAAATAAAGCAGGAACAAAACTGG CATAGGT CTG GT TT TTTAAAGAAGñG AACCAAAG CAGGCAGTTCTTAGCT GATCTCCTGGAATTCCTAGTAAAAGAAAGAAAGAAAATAAGGGTGC CCACACTG GATTATCAAATCTGTTTTCTTTTCCTTAGCTCTAGGGCTGTACAGAAGCCTCCATGCTCTTGGATATGTGTGCCACCT CAAACCCAGCCCCTCCCAGACGCTTCAGCCCCTGAGCTGCACCCCCACCTGGCTCTCCTTCCTCTCTCCGCTACCTGG CTCTCCCTCCCCTCTCTCCTACCTGGCTCTCACTCTCCCCTCCCGTACTGCTGGTCTGGGTGTTGTCAGTTCTTTCCA TTATTATGGGTCTGTCCATGCCACCACCCTGñTGCAAGCCCCTACTGTCTGCTGCCCAGCCTGTGGGCAGGAAGGAGC CCCTGGGATG( N ) xAGCATGCTCTGTACACACCAGCTACTGTTATTTCTTATTATTTCCTTGCTCCCTTTCTTGCCTC CATTGAAT CCAT TTTC CTGCAGTTAT CACTTT CAAATAAACT CTGATCATAACACCTTCCTG CT G A (N )xTT A A G A TA GAAGGGAGGGAAGAAAAGAAGGAGAGG GAAGAAAAATGCATG CTGGTACCCTTG CCTGTT CTGGAGGCCACATT GC CC TGAG GATTTCTGTCAG TCTCCAG GAACCATCT CATT CCATAA GC CTTGTGTG CTTTTCTCACTT CCTACTGC CTTTGA CTCATTAAGGTGTTGATTCTCAATTCAATTCGAATGTTTGTTCTTAGGAAAAATGGTCCTTACATTGGCATGATGGTG TGACCTCTGAATGCAGTTTATGGCTTTTTTGTGTGTTTGTTTTATGTACAGACATTTTGTGATTTCAAGAAAGGATAA AGTGATAGGTATGAAGTT GCAG GGAAAAAAAATT CCATCAAC CCAAATCCAAAAGGGTTC CTTG CC CAGCACAG CACA GCAGCTCACTCT CTGGTATTTCTG CC CCCACACTGAGGAGAGAC C CACAGGAGGAG CTCCAG CACAAGCCGATG GCAA AGGAAGGGAAACAATAGGAGGGGAGAACAGCCTCCCACATCCACCATTTCAACCTCTCAGAGAAAGTACTGAAAGTGA AAGGAG CATGAATAAAGAAGCCAT TT CTCTGAACTAAAAGCAGAGGAGAGGG CTCTTCCTCCTG CTGCACATGAGTGA ACAAAGATGGGC TG CAGAA(N ) xAGG AAGTGGGGTTTATTGAGTGACT ACTGCTTTGGGC TC CT CC CTGT AG TTT AGG TATTTT GCTCTGATTGAAT CTTACATAACTATATGAAGTAGATT TCATTAC CACATTTTACAGAAAAAAAAAAAAAAA AAAAAACCATGGATACTCCAAAAGGCCCAGGACGAAGATATAAAGCTGCCCAGTGGAAACACGCTGTGGCTGACACTG CAAAGAAT CCCGGCTCAGCCACGTCCTTGTTCTTGTG CAAGC CAGCGCCCTG CAC C TGAAGG CC CTTATGTGG(N ) x T T TT TTTAAATGTAAAAGGTGCCCTGTT TGGGATAACGAATT TATTGTGTGATCACCCTACT G ( N ) xTATTATTAGGGT GGCATCTAGAGAGGCGGCAGCAAGGAGTGAAGCCAAGCTTCCCACTTGACATTCCCACCTTGGTCTCTGTGAATGATG CT CC CT GT AGTTGTGC AGTGCC AG CCTTTGTGGCTT CAGACTGAC CCGGCATGGGCAGCACTAG CTACTCTAGATTAG AGACCATATGGGGGCACCACACTTGCCACTTCTACCTCTGCACCAATCAGATTCACCACCTTCTCCTTCTTGGGTTAC AT TGTTG G GACTGAAT TTTTTTTCTTTTA CTGAAAAGTTTAGAG CC CAGATTTCTTAAAG GGGAAGAATGT CAAGCAG CGAATC CTGCAGAG CAT C CCTACC CCTGGCCC CAGC CCATGCTAATTAAGTGAC TGACTTAAGGAAGTTG CC TC CT CT TAGAAATTAAGAGGTG GATTGAGTATTTTCCT CTGG CCT CCTTAAATGTTCTTGGTTACAGTGATAAAAATG CT GAGC TCTATTGCCTACTATTGGAGTGAAGGCTGCTGGAAACTCCTCGTTCCCTGAAGAACCTAATTGTCGATGCTAGTGCAT CT AAAC AGGTGT TCTT TAGATT AC TTTTGCTAATGC CTTTTC ACTGGAAAGATC AATTTAAATGTTGCTTTG AG CT AA TTAGTTATTGCTTCTGTGTCCATCCAAATTTATATATTTGATTCTGTGTGCTGGATGAGGCATTACCTTCAAAATCTA TAAGCTTACACAAGGTAGGTCATTTTAAATTTAGTTATCTGTCCTCTTCTATATTTTATACAAATATGCATATATTTC TTTTATTATTTAAAATGCCA( N ) xTGAAATGCCTAGTTTAAACATATATTAGAATGTCCAGCATCATCTTTGGATGTC ( N ) XTATACAGAGTATGTGTTGAATGACTATTTGTGGAACAGATAAACTAATACCCTAACTTAAATGGCAAACAGATT AGATTTGTTGTCTCTAGGTGTTTTCGTTTGGGAATCCTTTAAATGTTGCCTAAACTTTCTGTGTTTATTCTGTTCTCT GC CT AG AATTTC CT CCAT TTCATC TCTATCTCTGGAAAT (N ) xA T C ATGGAGTC ATTCTT AG AACAAAG G AT CTGG AA AGGG CC ACGTCT AGTTTT C CAG AT TC CACATT ACT C CAGGTCTC TGTTACCT TATCTGGTGGGT ATGTG G AG AATGTG AACTGCTTAAAATGAGACAGATGGTATACTCTGCTGAACCC tN ) xAGTTTGGAAGAAGGATGTTTAAAAAATAGGGTA CACTTGAACCTATTAATAAAAG TAATTAATGT G G CT CTTCAATCAAGCCCC CAAAACATAGAGCTT TAGAAC CTAGAA GT TGTCTG TTCAGACT CT CCTTTTAGAGTCTTAGGATCTTGT TTAAGGTCA C CT GG CTGAGGAGGAG CAGAG TC CAGC CAACAGGTCTCCTGAGTGCTGTTCCAGACATTTCTTCAGGCATAATTTAACATGTATCAGTTTATCCTTGAGTGTAAA GTACACTTCATGATTGAATGCAGGTAACTGGCTATTCATACTTCCCCCACCCCCTTTCTTTTAGTGCCTGTTTCTAAG CAGCAGAGTTGTGGTTCAGTGACTCTGTCTGTATAATATGAAAACAGGGCACCCAGCCAATTCTGATGAGCTAATTAT TT CC CTAAATTATCATGC CTAAGCAAAGGGCC CT CAACTCTG CTGTTCAACACTTATCATTGTATTTGG G G G GTGAGC TGTACATAAGGCATATCATCAGATAG CTAGAATTTT CTATTT CAGATTGTCAAG CAGTGAAAATTGTCGT TCTTAGAA ATGACTGACAAT CC CC CT TCCT CATTTCTGCTACATTGAGTTAG GTAAAGAAAAAAAAAAAAGACT TCAGT CTTTTTA CAGATCT C TTGCTGAAAACACGTGACAGAACTATA CATTTT CAC TAGAAATTGATG GAAAAATGAG CACTT C CAAATG CCTGAAAAACTTTGCAAAGGTCCCACTTAACAGAATGTGGCTAACACCTCAAAAAATAAAGCAAAATAAATGTTGCAG TGG G CATGAATCTTGCTT CAAGTG CTGGGTTGTTGGCTGGTTGCCTGGGGAGAGTGGGGC CAGG GCTTGC CAGCTGGC AGAGCCACATGC CT AT AAAGG ATG CAGT CAGC TT AC ACCTTGGC AAACTATG ATTTTG ATGG AAAT CTTAGAGGTC AG CAGACGTTCTCCCTAGTAGAGTTACGTTAGCTCAGGTCATTGCTTTTATTTGAGTGAAGAAAACATCTTGATGGTCAT AATT CATAG GAC CAAG TCATTG CC CTGATCTAAA CCAGAGTTTGTTACTGGATCTGAATAGATAGA CATACC CTAAAG AAGG CACAGGATGTAA TT GAAGGTGG CACTTT CAATTACTAATC CC CAAGTT CAACTAGT CATG C CACTGAAGCACA C CAACTGGCTCATGAGCAAGCACACATCTGCCCCAAAGTCCCACTCAGCAGCCATTGTTTTCAAACTCTGAGCTCCAAA G CT ACT AC CATC CTATTCTCTGATGCTCATTCATATTCTGCTTTGTGTAGTTGTTGTTTTGTTTCTCTTT CATAGAT C TGAAGGCAGTATCTCCAAAATAGCTAGCTTAGTGCTTTTTAGCAAAGGAAAGAACTGCAGAGGGAGATTCTAGATTTC AGAG TC CATGTTGAAAAT TATAGC CATTAATATT TT TAATTGACAAAATAC C CATCAGTC TC CCAATGGATT TGTTGG GGTTTTTT TGAAATGGTCTAAACT GATTCTAATATTTATTTTTC TAAATATTAGAAT CAATATT GG CATG CAAAATAG CCAGAAGAAATACTGTGAAAAAATTCAAAAGACAAATATAGATTCGGAAAAACATTTGCAATACTGTATATG(N)XTC TTTCTTTCTCTGTCTCGCTCTCTCTCTATCTATATAGATTCAGGATCAC(N )xTAAATATATATACATATAAAT(N )x GAGAGAGAG ACTGT CATATAAT ATGT AT ATATGAC CT CTATGTATG CAG GT CT ATATATATCTATAT CT ATATCTGT ( N)xTCCTGCAGAAACTCTTCCGAAGGGAACAATCAGATGAGTGTGTAATGGTTTATGT( N ) xGTATTTGTAAGAGTAT GATCATTTTTTGGAAAAATATACGTATATATACATATATGTTTACATATTTTTATACATATACACATATGCGTACACA CATATATGTTTACATCTTCATGATAGCCATGATTTATGCCCTTCCAGAGTGGCTTTCATATGAATGGATTGATAGAGC TTAGTTTTCTGATAATAAATAT GAGGGTTTTGTTCAAACG TG C TTA A (N ) xTAAACAGACGTATAAAATTTTTAACAG TAAAGACTTAAT GGAG CTGACATATT TGGATATGGG GACTAGAACAGAGGA CTT GAAAGAGAñCTCTTATTGATGCTT ATGTGAATA CTG CAATGATGAATG CATTTTTTTCTTG C CAGATATCAACCTGGATAAGGT CTTTAGATTTAAATATTA GT CAGTTGAGAAATGGATATACACATACTAATAAGGAAGTTGAATTAGATGAAAACTAACAAGT TTTC CATGCTTTAG GCTGCCTGCATTACGAGTATATACAGGTGACCCTTGACTGACTTACCCCTCAGTTGGATATTGAGGGATCATTAGAAT TCTAGAACTGATTGTATAATTT CAGATA TGTGGAGGAGGGAAAATTGATGAAGTGAAGAA TAATAAGAGAGTAGGT TA AAAAGAAACTGTAAAGAAGTAAAGAAAAAAATACTGTTTTAATGCTATAAAAATTAGTTGGTGGAGGATAGTTTAAAA TTATTAAAGTTGAATATAGTAAATCAAATTACCATTAATATTCGTGAGCACGTGGTTGTGAAAAGATGGATGAAATGA CTAGCTAACTGT CTTTTTAATAAGACTATTGGTAATAGAATCAGTTGT GG CT CATTATTAACTGTATTAACAGAAAAT TAGCCTACATGAGCACTTAAAAATATGAGGTTGTTTTAATAGACTGTGGTCCCTTATCAACATATGTAAAACTTAAAG GTTTATAAACCATTTCTGTTCCCAGAACTAAACTAGATTAGATGATCAATTAATGGCTAAAATGTTATTATATCTCTT TTTAATATTT CT TGTTAAAAACTTAAATATTCCTGATT CAACTTGTGCTT CTATTTGCCTGATA TAATTTGGTCATAA TTGGTTTCATTTTGATTGTCTTTACATTTGGGGTACTTATTATCTTTT CT CT CATCCACTGGGAATGC CAAGCA CCAA TTTTTGCTACTTAGCAAGTCAGTGAGGAGCTTCCCACTGAGTGCCTTAATTATTGTTAATTGAATATTGTAAATGATT TTT CAGGATCAC CATC CTGGCAAT CCACATAGCATT TTTC TCAGTTAG CAAAAGTTAAAG CAC CAAGTGTCCCTTCTA CC C CTTACCATACAGT CATCAAAT CCAGTACCAGCAGTAACCAGTTCCTGTATAAACGAG CC TT CT CTTTTCAAATGT CACATCATGTATATGC TTCATACGACAC TTAGAACATGGTGG CTGCTCAATGAATATTAAAC GACAGCTGTCCTTTTA TATAGAAGGAGTCCCAGTTCCAGTCTGAGAGGAAACATTCAATTTCCCTTTCCAGAGCTCTCACGGTTCATCTCATGT AG CATGGCATGC CCGGTACCCCAG GGTAGCACTCTC CCATTTACAGGCTC CAGGACAAAG CTTT CTAGAACTTTTTGC TGCTGTATCATCAAAAGGAAGCAAAAAAGTAGTAAATAAAGAGCTTCAGAAGATCCGAGAGTATTGGAAATGCTACAG GCAACAAACTGGAGTGACTGAGCCAGCCTTCTTATGTACTTCTTGCCTTACAATACACAATAAAAAAGAAATGAGAAA TGTATGACCTGCCTTTTCAGCTTTTCAACTGTGTCTTCAACTTTGGAGGGGGCTGGGGGCTAAGGCTTCAATGGAATG AAGTTGTGTTGGAG GTTGGTTTTAGCTACATAAAATTTTG CCTTCTTAATGATTTGTTTT CTGG CC CAGTCAATGAAA TT TCAACCACTGTTATGAAATCAAGACCAGAAGAAG CC TTGAGACATAATTTAGAGTCTT CT TC CCTATCAAGG CAGC CCACAGATCAACACAGAAAGAAGAGAATCTGTCTTAGAGAGACAGGCTTCCACAGGCTGTGATTGTAACCTATTCTAT AAATGACATTTT CTGC CTGGACTTT CCT CTTATATGTAGC CACTTGCTAT TAAGTTTGGCAGGT CT TAGATTTAGAAA CAAATTCAAGGCCCAC( N)xCCCACTTTTAGCAAGTTTCTAGTGTTGAGTGTTACAGACTACGTGGTACTAATTTTGT c c c t g t g a a a t a g g g a c a a a t t c t c t t t t t t t c c c c a t t t g c c t a c a t t t c t c a a c t g t t t g a t c c t t t t a t a t t g t a CATAGTTGTGTAAATCACCTCAAATCTTTTGTTATAGAGACAGGTAAGAATTAATACATGATTGAATAAATAAAACAA TAAAACATGCCCTCAGGCCGCAGTTTAAGGGCATTTTTGCTGGTTCTGAAGGAAAGCTGGTCCTCATTAGTAATCAGA GCCTGCCTGTGCCTCAGCCCTTTCTGATGAGCAATGCTCTGACCACACCCTTTTACCTCCCACCCATCATCTGCTTTG GTGA(N)xTCTTGGTATGTATCAAAATATTTTCTTTTCTTCCTGTTTTCTCTCAATTATAAAGTTTTTACCTGCTCAC TTAGGGCTTTTCTGTTTTTCCAATAAGATATAAAAATTATTGTCTCGTAGAATGATGTTTGGGAACAGAGGTTTAGTA GAAACCAAACAGAAAAGAATAATTATTC CCATCTACTTGT CT CCAAATAAATAC TGGATTAAAAATTC TAAATATT TA TGGAAGGAGATT CC CTAACTTT CT TCAATGTCTTGT TCAATCTTCTTACAAG TAGTAGAACTTAT CTG CCCTGAGG CT CCACTGTCTAATATTTGCAGATACTGAGCTTTTCCCCCTCCTTTTCTACTTGCTTTGGGTAGAATATCTGGCTTTTGA CTTCAATGCAATT(N)xGCCATAGACTGGAGGCCTATTTGTCTGGAAGGAATGGGAGGGGACCATCAGCCTGTTTTAA TATGTGAGGTAGCTGAACTTTAAACTTTGCACACGGTCAAACTGACACACAGTGCCAGAGAGATCCAGAATTTTTGGT CATT CCACCT CCTACAAGGGAAGACCGGTGACTAGT GT GACGAAAATATAGATCTCATGAGG GT TT TG CCATGATGAC ATGT CTTATGTCAC CATATTAG C CA CTGTCCCTCATTTATTG GTGTTT CCTTGAGCCTTTAGAAATATGTAAA CATAT AAA C CCCTAGAAATATGTAACACT CCTATGTTTTCATTAC TTTCACATT CTTTGG GCCTAGAATAC C CACCTGCTACA AAGAGTTCATAACTTTAAATAATT CATGAATTTGTCAG TGTCTAGTTT CTTTTG TAATCATC CTAATGACAAAATT CA CTATACAGCAAATCAAATCAGCTGTGCATCATCCAATTTGGTCACCAGCTGAAAATATCAGAACTTTTCCCTTTGAAT ATGTGTGGATGCTGATGTTTCAAGAAATATTATTCAGAATCACTGTGTGATATACTTGGACTATAGATAATGCCTATT AATGGTGGCTAGATTTGCCTGCTCATTCTTAGCTTTATGAACAGTTAAGGCCTGCTCTGCTGAGGAGTTCGTGGTGTT AACATCTCACTTACTTTCCATGGGAGAACCCTAGCTGCTTAATGGCACTCATGTATAGAAACATCCTGGATATTCATG CCTCTAACTGTCCCATGGTGCTTTCATTCATCTTAGCTGAAATATCCTCCCTTTTCTGTGCACCTTTCAAGGCCCAGC TTACGTCTCCCCTCCTTCCTGAATTCTTTTCTGACCTTTCCAGCCCTCCCTGATCGCCAATTCTTTCAAATTTCAACA GCCTGAGCCATCCATACCAGCTTACCTCTTTATGATAAGCTGCAATGGATAGCTTCCTGCTGTTTAGCAGGTCAGCCC CT CAGCCAAATCACAT CCTTTGAAATGGGCAGAGAC CACGGC CTTTACTATG GCTTTGTG CT CTA CACACCTTG CATG A C CACATTCATCTT GATTCAAAAAAATATTGTTTGGTTGATT CACTTCAATT CTATTTTCTTTG CGTGTGGAAAAG CA ATATGAGGCC CAGT CA CCATCATTTCTATAGGAAAATCTA GAGGGGCCAC GAACATTTCAATAGTT CTGTAAAAGAAC CCTTATTCAC CCAGGACTATATTG CñGTAGGGCAGC CT GGAGAGAGGG CAGG CAGTTTCC CAATTCTT CCTAAA CTTG CTTCATTAAGTCATTCTAAGGGAATGTTCAGATGCTCTGACATCTTCATATTTAAACATAAGTATAAAACCCCCTCCG AAATAAAACCAAAATGATTTTTTTCCATTATCCTTCTGAGTGTGGCCAAGTGAGCACTGACCACAGTGAAACTGGCTA ATACTAATACTGTGGTGTCATCATCGCTCCCCTCCTTCCTAACTCACAGGAGGAAATCGCAGTCACATTTAAAAAGTG CT CTATCATGATTTGAGTTGAATTATAACTAGAAGATCTT CTTATAAT TTTTACATATTC TGAATGTCAAAGATG GAA AATTACTCTTAC CGTG CCATTCAACG GC TCCATGATTTTTGACAATTAAAGTTACTGTGG CATT GC TTACAACCGGTA GGCTTGAGTTTGTAAAGCAAGCCCTGTTAAGACTGGC( N) xGGAAAAAAAGACTGGCTAATTTCCCCTAAGCCCCAGG ACTGCAAATTAGGTTGCTGCCACCAGGGTTTTCACATGACTCAACTGTTCTCCAAGCTCTCTCTGCTGAGGCAAAGCA TT TAGCACA CAATG CAGTCAATAC CTTCAAATGAGATAATGAGTGTGAATG GG CTTGGAAGTGT CC CC CTCACTGCAG GGGACCTAGCTCTGA CAGCCAT GATCAGAGAATAGCAAAGACTGTTGTAGTC CT CAGCCAGG CTGAAG GACTC CAAGG CAGAGGAAAAGAACTCTAAGACCATCCGTTTCTGTATTTCTGCCCCAATGGCTATTCCACATAGTGGTCCAGGAGGCA GGGCAGAGAACCTG GAAGTAGGGG GCACCAGGAAGTAGT CA CATCCAAGTAGGGAG CACAGCAAATGCTCAAGT CCCA GCAG CACTTCCTGC CCGGGGTGGG CATCAGTGAGGTCAGG CTGGAGAAATGCAAAC TCTGGACTGGCTTGGT CACTGC TGTTCTGTTCTGATGAACTCTCCTTTGAAGGCCTCTTGCACCACTGGCCCTGGGGCACATTTGCAATCGTTAAGACAG CACAATTCAGAACTGC CTTACAC CAAACTG CTGGAGCATAAAAATGAGAAGAACTGATGAAAGGGAATCAGAAG CCTT GGATGACCGCTC CATG CCATG G GATACCA CAG( N)xCCACTACGTATCTGCCACATCAGCCACGAGAGTCTGGGTTAA TTTTTTAACTTCCCCAAATCTTACACGATACTTATTTCACAGGGTTACTGGTGTAAGTAAGATAATTAATTTGGAATT TATCTGTAAAATGACAGTTTGCACTGGTTGAAGTAAAAGTTAAAG(N)xAAAACATACAATAATAATCTTTAATTACA AACTTCCGATATGATGTAGGACAAGATTTTAGAAAGTGCAACACTTAGGAAGCAGCATATTTGTGTAAGTTAAAGCTT TACATA CAAGTTACAATACTCT TAACTCAAACTGGTTT CAA CAATGAAGTGAGT T CATTGTTAC CGAGATTAAAAAGG CCAAAT AG AGTG CT AG CTG AAT CC AGCCAG CTGAATTG AG AGTCAAGG CTTAAT CT AG CAGCTC AG AC AGTGTC ATCG TAGAGT( N) xTTAACCCAGGTTTGCTTTCTCTGCCCAGTAGTTTTGTGTTGGCAAGCTTCACCCGCAGGCATCACATG ATACATTTTCCCCACC CACTTC CTGTAGCTC(N)xCATGTGG CCATTCCATTCAGGAAAATTGATGGACTTAATCAAT CATG CT CATGTCGAGAGTGAGTTTGAGAT CATTCTACACAAATCA CAGAG CTGAGTGTTAGGAAGG GGTGAATATCCC CAAGGAAGAGTCAGAGATCACTGGCAGTAGAAGAAGTGAATGAATCCCGGGAGGTAAGGAGAGTGCTCCAGGCCAGGG CAGAACGGTGACAGTG CTGGAGGAGGAACCGG CTGACCTG GCTTGTA(N) xATGTTTCAGTTTCATGTATGGCTATGT TCTGTCAT TAGT CTTCAGGGTTAT CAACTG CAGGTTTATT TT CTTACC TGATTTTC TCAGTAGG CCTAAATAGAACTT GCCTTGGC TTTGTTTCTAATGCTAGAAAAC TT CCAACT CTGGTGGAGATT TGAAATATGCCATCTT CTATTAAC TAAG TCATATTGTTTAACTAGCCAACTTTCCCCTACAGTTTACAACTTGTGTCAGACACACGGAAGATTTGCTTTTGGTTTT TCCCTTGTGAACAAAACAAGTCATAAAAAAAATTATATCATGGTTTCCTTACAGTTGCATTTCATCCTGCATTTCTTA AATCAAGAAATTGACTAACATAATGGCTGAATGAATAAGACACCACAGCAATGAATGACTGCCTTTAGCAAACTGAGA GGGAGT CT CATTATTTAAATGTTC CTGGTTATATTAAG TGGAGCAAAAGG CTTG GAAT CAAAAG TGAGAGAAATAGGC AGCCAAGTCAAAAATGCCACCAAGCTGTAAATAAAATATGAAACTAGCTTAGAAAGGGAGCATTACTTCGAAGTCCAA CACTGATATATGGT CCATAATT GTGGGACGTTTGTAAAAG GTGACATGA CACCTGT CAATTATC CACAGCAG CCAGGA AATGTAGATTACATGAATAATT CACCAACCCAGTGCCCTTTGTTGCTAAGCTGTCT CAGCCCAGTT CCATAAAATAAC TGCTGATATGAAAGAAGAAAAAAATACGTTTAAGTTGCAAAAGCCTTATGTTACACG CGGAAAT GT TTTCAAGG CAGO TTCATCATATGACTATGTTCCAGTTAGCACCGTTGGCTAAATAATTAACATGAGCCTTTATCAGGTGTAACAAAGGAG GGGG CTAGGACT CTGGAAACTT TATATGTT CC CTTTGTGG CAG GTGGAAG CAGGGGGT GCCGCATAAGAGAATCTTCA GTATCAGAAACACATCACTACATATGTCATAATTTTCTCATCCTCCCAAACAGCACACGATCCTTTAAAAAGTCTAAG TCTGTGTTTTTGTTTTG(N)xTGGTGAGCAGGATTATAGAGGATGGCAGGAAACTGCCTGGTCAGCAGCCCCTACTCT GCAC CC CACTCC CCAC CCTTTC CCAAGCACTTA CAAGGA(N) xTGCAACACACATACACGTGTACCACTGTAAGTACA GCTTGTCTACACAGCCATGTGAGCCACCAGGATTTGGAGAAATACCAGTTCTGCAGCTCCCGGGACGGCACAATAGAG CATGAGGCATGGAAACAACTGACTCTTTCATTTGTAGCCTGGGCCCAGGCAGCCTGATACCTATACCCACCCCGCTCT TTCCCCACACCCTTGCTGGCTACAGGGCCTGGGTCAGAGGTCATGGGCCAGGTGGAAGGTGGGGCCTTGGAATGCCCA GGGGACTGGGCAGG CAGCCATAGGTAACCAGAG C CAGC CATCTAAT CCATCTGCTTCC CTGGAC TG CTGTGAGAAAAA GCCTCATTGGAGAGACCTACCAAAGTGGCTGCTTCCAGGAGCAAATCCAGCCACACTGTGGAGAAAGATTTTCTTGCC CTTACAAG GTGGTT CATTCCTTGTAACTTC CTTC CTTCAAAATGAAGTT C CAGCAGACAAAGAGAACAAAAAAGAGGT CTGTTT CT CATGGT TTATAAGGAAAAGAACAC CTTGTAGATAAAGTACAG CCTTTC CCATTTTT CAGAAGCAAATGAG TGTTTCTTTCTCATTT GAGGCTTATTTGCAGC CAAGGTTTG G CTTTGTTT TTGAAAGTTTGGAGGAGGTTA CATTCTT GTGTATACCCACTGGGTGCAAGTGCTTTGGCACAGGTTTTCTAATATTTGCTGTTTTTCTCCAACTGTCCCAAATGGT AGTTTATAGTATTT CAGCTCTTT CAGAGGC CAAC CTACAGGGAAATG CAATGAAAAGG GAACTGAGTTTCTGATTCAG GTGAAAGGCAACTGTAATTTCAGGCTTTCTGCATGTATCTCTTTGCAATCAGTCATTACTTTTGAAATGAACATTATA TTTAGT CATATG CAGAAATAATAACAAACTGCTGGTTT CATCGTGAAGAT CTGAAGTTGAAAACTT CATGTTTTAACA TTGACCATTCCAACAAGCGTAAAACAGGAGACATCGAACATCTTGACATTACAGCTGCGCTGAGACTATTTCTCCAGT GCATATGT CACAGGAT CCCTTTGACAGCGTGG CTGAGAGGGTTTCT TT CAACAAATGGAGGTAATTGTTAAA TGAATC AATG ATTG CCTAAG C C AATT AATG CCTTTT CT AATAATGC TG AAATGTTT CTCAAG AATAGTCT GG AAAACT CTTTCA TAGTAAGAGACTGGTGGAGCTGGAGCAGCAGA(N)xCAGCTACCTTGTCTGCCTGGAAAAACCCTTGCCCAGTTGG(N ) xGGATTTTATCTGGCAACGCTGCCAGGGCAACTTGTGATGGGTTGCTGGACCTTATGGGGCTGTCTGAGATGCCAAG CGAAATCAGTCTTGGGATCCATCCATTTGTTCTAGGACATTAACTTCTCGAAGCACTTGTGGATTGCTCATGGCAGGG CGATGAGTGGGCTC CTGTTCCACT CTTCAGTG CC CAGTGAGAAGCCTG CGGTGAGG CGTTGTCAGG( N) xGCAGGCGC AAGACAGCAGAGGGTGTGTAAGAAGCATCCACAAGT( N ) xTGCAGTTATTATTCTGAGAGTAGCAGATATCAACCACT GTAG CTTG GCAGGCTGGTGTAT CTTGAACACCTGACAACTAA GTGTGTTG CCAGATTTTGATCAATGTCTAGAATATA TTTTTCCCACAAAGGAGGCTAAACATTATCATCCCCTTGTGGGTTTGGTTGTTCTTCAGCACTGAGAAGAATTATGTT GGTTCCTAGCTGCAGTCTTTCATGTAATCAAGAGGAAAAGCTTTTCATGCCCCATTGAACACGTGGTCTTCCTTTGGA ACCAGCTCGATG CC CTGAACTT CC CACATCTC CCTCTCAG CCAGTC CCACATCTCCCT CTCAGC CAGTACCCTCTGTC CTGAGCCAAAAGGTACTTAGCTTGACACTATTGGCTAACATTGAGGTGACCATATGTATCCAGTTGTAGCAAAAATAC ATCTTACATATTGATCAGAAGACAAAAACCACATATATCTCCATATACCAAAATTAACAGCAAGTTTCTTCAATTCTC TGAAACTTGCTTCTTTGGCCCAGTGTTCTTAATTTGTTAATTTATCAAGAAATTATAAAATTTGAAAGTGGGCCTCAA ACACAAAATTGCTGAAAAATGCTGGCTTAAACAGCCACTTGAGCAATGCTCCTTTCACAAAG( N ) xGTTTTCATGTGT GTGTGTGGGTGGTTTT TGTGTGTG TGTG( N ) xGTGTTGAAGGAATAAGTAAACAAACAAATGAGTATGTAATAGCTGT TTAAAATGCACTGCAAATGATAAAGTACCATGGAAACATGGCTGGTTATTAAAAGCTGACAATTTTATTGCTGTTTTA GACTT CT C CTGC CCTTTAGGACAT TATGCT CCAAGGAGTT TACATTTTTTTAATTGAAAAATTTT CTTAGT CATATCT ATATTTTCTTCCTCTGACATAAAATCTTCCAGTGACTTTTTATGAAGATATTTGGGTGTTGATAGGAAGAGGGCAGTT AATGCACATAATATGAAGTCTC CACTAGCCAGAAGACACATC CTGATAGCTCAGTATTC CAAA CTT AAAATG AACC CA CTGCAT GAAAGCTCAATT CCTACAGAAGGTAGAC CAAGGGAACTGTTTTTG(N) xGGGGACTGTTATTTTTTTACAAA AAAGGGATCTATAAGCCTTAACTGTCATGTGTGTATTATTAACACTCTGCAGCCATGGTTCTCCTCTGAAAAAAAAAA AAAAAAAAAGAACTAGAGCATTTCTGAGGGGATCTGGTGCTATAGTCTATCCTTTGAGTTGATTGATGACAAATGAAT CCTCGCACCTTCTGTTTTGGAAAATACAAATGCTGGGGATGCTCCAGCTCAGGAGATCTTCCAAGGGATTTTGAGGCT GTTCTGACCTGGACAATG CTAGAGGAGAGGATAC CCTG G G CTTC CT CACTTTAC CAAAGAGAAT CCAGGCACATCCTT TT CTGG CAGT CTTGAAGGGCCTAACTATCTAAGAAAGAAT CTTCACTC CCTGAAATGATT CTCAGTGGAATT CTTTAA AATATGAGTTATCATAGTACATTAAAACATGAGAGTACAAAATGACAACAAGTGTGGACTTAAGGAAAGGATTTATTT TT CCTTTTCTTTCT CTTCTATGTTTT CTAT ATGTGGCT AG AG TG GT CTTACATAAATAAATG AACAAG AC AATTTATG GAGCCCCAAG CCCTGCTCGAAG CATG CTATATATGTTATTTC CTTTAAGGAGAGAAATTAAATGTGATTTC CAAAAGT TGTTTTGTGGAATAATTGAGGAAGACATTATTGTTCTACCTGCAACATTAGGATGAAAGCACAGTGAGCTATAATAGA AAACCTGCCACGTTATTT CCTT CCTGTTTCAATT CTTGACTTGTATGT TC CTAGGC CAGACTTC CTGGAGGTGAGCAG CCAGAG TGGACAGCGAGAGCAG CAAAGGGTAACTACCACCTTGTGAGTGC CT CCAGGCAAGCTG CATT CTAGGCAT CT TTTATCACGAACAAATATC(N)xAACCGAAACCCCAACCCCAGGGTTATGGAGCATAATATAAGG(N)xTCAACACAC AGAGTCTCCACATTCAGAATGGCACCTGCAGCCTGTTTGCTGAAAACGGATACAGTAGTGACAGAGAGCCTGGAGAGC AGAGAAAGGAGAGGAAATGATGCATGTGGGACAGGGCCGGGAACAGAGAAGAGGCTCTGCGGAAGACAGGAGAAAGGG AAAGGGGAACGGCGCTGTGCTTTGTGGAGCAATAGTGGAGATAGTGCAGCTTTCCAGGACACAAGGAGAGAAAACAAA TCAGAAAATGGGTAAGGCAGAGAGATTTAACAGAAGCTTGGGGCTAGGGGAGAAGGCACTGGAAGGGGAAAGAAGGAG AAGGTGAACAAATGATAGAGGCTACAGTACAAATGTCTGGGTGAGTCCCATACCACCAGCTTATGGAAGGCCAGCGTT GGGTGGGTGAGGTGAGAG CTCAC CATG CAC CGAGATAT CCTG CTGTTG CGTG CAGAAGA C CACACACT GCAG CACCAG GCAGCCCTCTCTTCATGAGGGTCACACACATGTGAGGATTGTAAGACGAGGCCCTTGGGGCTCTGTTCACAACAAAAA CATTGTACGAGCCTTTTACACATGACTCAGTGTGTATTGTGTGTCTGACTGAAAAAAGATCAGCTTTACCTACAAGTG CTAACGTGTTGGCCGAGTTGAAAGGAGTGCACAATTACATAAGAAATAAAAATACCAGGGGAGCAAGCAGTTTACCAC CTTGTATGGGACACGCTGAATCTCCAGAGGAAAAGTGCCAGGTGACTCCTTTGCCAACTGTGAGAATAGCTGTGGTTG GAACTGGGTGTTTTGTTTTGTTTTGTTTTGTTTTTACCCCAGTGTTTGGCACAGAGTCCTGAACACCAAGTGAGGCCC CTGAACCCTGCTTGACACCAGATTGCATTTTCTAATGTCATTCCTGTTTCACTTCATTTGAGAAGGACTGGATAACAA CAAAAG AAAAAGCG AAAG AGTAATTAAATTTTGT TGTATG GG AAAAGT TT ATGTGCTAAG CCTG CAAAA (N) xATCTT TAAAACGGGAAAGATCCCTTCCCCCAGTTTGCTCAAGGGAGG( N) xGGAGGATAAGGCTGAAACAAAAGGTTGGCAAG GTGGAAGCTCTTTGAAAAGCACTTATTAATATCTGTTTTATACTGTATGCAGTGTGGCATAGTAAAAACTATATTAGA TGAGGAATTAGAAGATGTGGATTCTGG(N)xGTGCCCATTCATTCCCTTGCCTACCTGCAGCAGGAAGGCCGGAGAAT CTCCCATTCCCTGCTGGCCTCCCAGAGATCTAGCTGATAGACTTAAAAAAGAAAAGAAAGAAACCACTCACACATACC TGGGAATATATCCCAGGTACTTTAAGTGAAAAAAAAAGATAAGTTGAAGTACTATGTGTAATTCTAATATATAAACAT TC CATAATATTTTTGGAAATAT AGATTTGGTAGAAACT AT CT CT TAGG CAGG AT TG AAGG AAAT TTTACTTT AG AT CT TACACATTTCTATATTGTTTGAATCTCATACTGTGAGCAAGTTTCATTTGATAATCAAGATAATAATAAAGATGTAAA A A ( N) xCCCCTTTGTAGTATTGACAGGCTTTTGGAGAATAATGAAACGCTGAAATGTTTGGCTTGGTTTCCCTCTTGG GTAGAGTCTGCTGCTCAGTCTTTTCTTCTGCCTCTTTGTATCCTTTAGCTTTTGGGTGGAGGCCCAGAGAGCTCCACA AGGGGTGATGTGTGTTTAAGAAACTCAGCTAAAGGAAACAAACTGATTGCAAAGAACTCTGAACCTAGGGTGTTGGGG GAAATG CTGAGCATTGTAATTGGCCTAATG CTGCTCCTGTGAGAGATG CCAT CT CC CTAG GCCT CCAAAGTT CTGG CA GGGATTTTAACTCTTGTCATCCAGCTGATGACAGACAAATATTCATCACTTCAACAGTTAGGTGCAACCCCCAAAATG TTTCCTCAAGTCCTTCCTCACTGCAAATGCCTACCAGCTCATCTCTGAAGATGGAAAAGCAAGCGTGGTGACATCTGC AAATCTGGACAGTGGGCTTTAGGATTCATCTGAAAAGAAAGGCTCTTGGACAGTGCTGGACCTTGCTCCCTGAGCCAG GCTGTGGGTC CAGTG GTGA CTCAGGAAGGAATGC CACTGACAAG CCACAGGCATGC CAGAGAATTACT CAGCACAAGC CAA CAAGTAACCTCATACGAGAATCAAACG C CAG C CAATCTG CGAACG CTTTGC GTAACAGCATATTT GGATTTGG GG CTTAGCTTTCAGGCTTGAGTTT CACGTTTG CCTGAGCATACT CC CAGC CT CC CAG GGCCC CTTCTGTGTGCC CTGC CT CTGATTGCAAGTGCAGAGAGCAGTTCTCCCAAGAGCCAGGGCTCAGCTCACCCCTGCAGGTAACGCAGACCTCTGCTA GGGATCAGCTGTCTATCATCTGGTGGCGCCCCTCTTTCCAGCCAGCGCTACATCTTCATATGCCTTTTCTTACCAACA GGTCAGCAATGTGGGTGACAGATACTTCTTTTTGCTGTTGGAACACTTCATATTCATCTCTGGTATTGTCATGGCCTT TGACATACCTTACCAACAAGAGAGGTTTGTTGACTGATGACAGCAGGTATAACAAAATCTAACTAGTGACAGTTGTAA GAATTGGCAAAGTATTACTGCATCAACAGGTATGTGGACTGTCTAAGAGCCTCAGAAAGGACAAAGAAAGAAGCTTAC AGATTC CGATGCCG CTTA CTACTGTATCTGGCCTTTCT CCTG CCGGTATAGTGAGAGTTGACAGTTTGATGGATTTAA TTTAG GATCTATGTTAGGATGAGATG CACT CTGT TTTC CTAAACAAATTTG CTG CAGAATGGCAGCCACAGT CTAGGG CCTCCTTGAG CTTG AGGCTAAATGAC CTGATGGC CCAC CC ACTT CATCTATACTGC CTTT AATATGTC AGCATTTC CT CAAGGC CTGGGTTT CGTG CACT CAGGGTGATATT GAACATGAAGTTTT CAAACAGCAATACCTTGGGTAACAGACAAG GGCTAGTTAATGATCTCATTCATAATTCATCCCGTGAAGGAACCATTTACACTCAAACTCCAAGCAGGAAGAAAATAT TC CCATTAGGTAGGGCTTTTT AAATG CCTCTGCCTTTG CC AC AGTG CTGAGAGAG AACTGTGGT CTTC AGTAGT AAGG GCATATGGCTGCAAACAGAGTG CCCAGGTTAGAGACAC CAAGAG CTGCAG CATCAC TCAGTAATAGCTATTT CAGAGG GATGTAAAGC CTTC CCTGGGAAAAAGTTGAATAGGACT CAGTGC CTAAACACAG CACTTT CAAAAAGAAATGAATCTC TAATTAAACATCCTGAAGACAGAAGCTGACATGTCATTGCAGAGCTAATAATAAGACCTCATGAAATGTTCAAAAGTT CTC CAGAGTAAGTGTGAACAATTACATTTC CCTCTCTC CATCAGTGAAGGGTTAC CACCAGTTCTTTAAGAAAGAGAA AATGAAGGTTAGTATCTGAAAGTATTATATAATGTGCTTATTACTATGCGTGGGTTGAGATATTCTCCAAAAAAAAGT TCCTTTTCCTTCCAGAATTAAAAAGAACCCCTCTAACTTTTGTTAACTGTAGATAGAATCTTTTCTTTTTTTTTTAAG GAAGAATCCTTACTGCATGTCTAAAACTCGTGGCTGAGAAAAAGTAGAGGATGTCCAGCCTAAGGAGCTATCAGCATT TTTTTG TAGCA CTC GATATAGCTGCAAGCCAAGGTCCT CACGAAAGTGAAAGTTTT CATT CAAAGTTAAAAACATACT AC TTGCATTTTACAAG CT CAAGAGTAAAGCACAATAATTATCAGTG CTTTATTGGTAGTT CCAAGC CT CCAAAAAT GT CAGTAAGTTGAATCTACACTATCTTTGTACAAGAACATAAAACATAGCCTCGTGATAAAAATTAAAGGAAAGGATAAA TT TGGTGAATTC CCTG CATAGTT CATAATAGAGATTAGT CAACC CAAGTTA CAAAGATAA( N ) xAAGTTACAAAGACA ATTTTT AAAG GTTT AAGAGGTTT GCTTC AAAG AG CT A ( N ) xTGG CT AAAG AG CT ATTT CTTTTATATAAATTAT ATGA AGATTGCTAAAAGGTTTTAAGTCACACTACATGTAAAAACCCCTTAGGCTCAAACTCACCAGGAGGATCGTGGAGCTT GGAG CTAAATAAAG CC CAGGTGCTTTGC CCTCTG CTGCTGACTTCGGGTG CTG GCTGGGAAG GATCTATTTCCAGGAG GGACCCAGCAGAACCTCGGCGTCCCACGGCCCTAATAGGCAAATATGAGCCGGCCTCCGCCCCTTAAGGATGGAGCTG CTTAGCTTCGCCACTGCTGCCTGGAGTTCCTTGTCTGAGATAGGCAGGGCGTGGTGCCCCAGGTCATCTCCCAGGCAT GCTTGCCCCACGAGCAGCGCTGTGAGG GTGTACT CAAGGCAATC CAAGTT TCAACGT CATAC CATTGT CTTTAACAAA AC CCTGCAAACATGTAGTGACCGAGG CATGATGCATCCTGCCTCTTGCTCTGCTATTTATCTGCTTTTCTTTTTTTAA CT TTAACTTGTT CT CCTC CTATTTATTGGG CAAT CACCTAGATT CTAGGAAACAAATATATTTAAT TTTTCTAATCTA AAGGTTAATAAGGACT CTflT CTTCTCTAGT CAGAGC CACAGGATGAACTTTCT CAACC CATC CT CTTTAAGGAAAAGA AGTTGTTGGCTGAGTCACGTGAGTGTCTGTTTACACACGTCTTATAACAGCATACAGGGTACATACACAGAGCAGCAC ACCTCCCCACCAGGAAGGCTGTGCTCCATATTCATGTGTCAAGTGATGTAAAGGTTGCCATGGTTATAATCTCTGTTT AGTCAATAGTGTGTGTGGGGGGGGGGGCGGGGAATCACCCCAATGGGCGTATTGCAATTTAGAGGAATCCCTGGGTAC TT TTAAAGTATTGAAT CACAATTTACATTTTAAAAAGTGTTGTCAATGA CAG CATATTGGTTGC CATT TCTTGAAAAC TCTTCTTTGTAG GAGGAC CTTATGTATG CCAAGAA CñTCGTTTTTGCCTC CTTAGGTGAAAC TT TATTTCC CACATTT AG GATTCATTAG GATTAT CTAAGGTAAGTGGCAAGAGGGGAGAAGGAT GAAAATCATGAGGGTGAATGTGGGAAAG GG ATGAGGGAAG GT GAGCTTAATTATTT CAAAGATAATAATGAGTGAGGCT C C C í N ) xGGAATAAAAATAAAAGTGTCAG TAATTCTTACTCTT CT ( N ) x C AC CC AAT CTTATT C ATATTCT AGTC AATAC ACAAAG AAT CTGCTGTG CTTTTGGATA GAGAGAAATG CTTC AT AACAA C ACCATC AC AC ATTATTGAAñAG ATTTTGGTGTGATT AGGG CCGACTGACATC C AAA CTTCCCTCCCACCTTTCTGGTCATATCTCCTCCTTTTCCCACACTCAGTATCTTTCCTTTATCTACTCTAAGCAGTAA CATG AT AAGTTATATAAATT AT ATG AAG AT CATT AAAAGGTTTT AAAG T C AC ACTACATGTATG ACTT CGTATTTG AA AAAGGCTCAAAACAGTAAGTGAñAAAAGTGACAGTTGGTGGAATTTTAGAAGTTGAGGGTGGAAGGTGTTTCCAAAAC CC CT CAAATCACACAC CCTAGATGTGGCTGCTTC CTGGATCCACTAGTGAAG CTGACT CTGG CAGGGAGTCACTTG CA TT TTTCACTAAC TGAGGT CCTGCTCTCT CAAATT TCACAAT CTAAGGGAGAGAAT CTT GATC CAGAGTGATAA CAAAA CAAGTAATG CAAGCTCAG CGATGTGG GATAACT CCCCTGTCC CTGGTATTAG CATGGA{ N } xGAAAATGTGTATGAGT AAAAGGAAGAGATAGGTATGTTGACAGAAGACAG CCTGAGGAGTTTAGAGAATAGAAACATTTC TCAAGAACTT GAAA GCGAAAGAG T CT CTTG AAAGG GTTTGTTGCACACTG CCTGAAG G CTGAGGTATGGACTG GAAAGTG ACTT AGTC C AGC CC CTTCTAGCTGGGTG AGT AACTGAAC ACT CC AG CGTGCGGATCTT CACTGG AG AGAAATGGGAG C ATGATTC ATCTC TTTGACATCATCTTTAGATTTTTGGTTTAACTTTTCCTTTCCACATCAGACGCCCCTGTTGTTGGGTGGTCTGCCATG CCACTTCCTC CAAGTC CCAG CTACTTTACT CTAGGTAGAGGGCTGA GGGTAATGGATCTTATTT CAAAAGGAGA GGGG AAATTGAAAC CAGAACAG CTTTGTGATGAACTGTAGGGGAAT CCTTTTTTCT CTAGCAAAGGTT TC CCAAGGCTGGTG TCTC CAGCTTGGAG CTTGATCGTGTGTGGTGTGTGTGTGTGTGTGTGTGTGTGCATGTGTGTGTGTGTGTGTGTGTTT GTATTTGGTGAGAGTTAACCATTCAAAATGGAAATCCTTTCATAATTTGATCTAGTTATGCAAACACAGCCCATTGAT GTTAACA
>Hs7_75298908-75341904
ACACCATACCTGTTAGTAGTCACTCTGTATTCATCCTTCCCCCAACCCCCGGCAAGTGCTAATC( N ) xTAATAAAGTA AAATAAAATATGCAACAAATGTCCCTTGAGAGAGGCATTTAGCACTGGGAGAGTTTAAGTAAGGTCAAAAAAGACAAG CCTTTCAGCCAGG(N)xGGCCAGGCCTATCAACCTCTCTCTCTCCGTTTATCCTCACCTGCTTTTAAACTGGGGTCGC TAACTTAAAAACCAAACCAAACCATCTTCAACCTCACACTCTCCTCCAGCTATGACCCCCCTTCTTCCTTTCCCAATT AAACTTCTTGAAACACTTGTCTTCACTGACTGTCTTTGCTTCACAGATTCCACTTTTGCTAGATCAACAATACATC(N ) xAGTTCCCATCTAACTAGAGCTTGTTTTCCAGGACTTGCCACTCCGTTGTTCCTGAAACTCCCTTCCCTTGACCTCC ATGGGCCAGG CTTT CCTGGTTTCCCT CCTTTCTCTATT AATTTT CTTGTT TGGCTTCTTCTC AACCTGTATGTT AAAT GTTGGCGTCCTCAAGGTAGACCCAGCCCCACTTCGTCCTCTGCACTCTCTCCCTTGGTGGTCTCCAACAGCCCATGCC TTTAACTACCACCTATACAC CAACCTAACGTC CATCTCTCTGACCTCCAGATCTGCTTAG CAGTAT CT CACAGT GACC TCAGCATCAAGATGTCCCAAATCCACCCCTGACCACCTCTGCAGATCTGACAACAGCATCCTTCTTTCGGCTGCAGAC AAAGCCAGCATCCTAGAAATAGCCTGTGACTAGCCTCCCTCTCACATCTCCTCAATGAACAAGTCCTCAAAATGCCAG C T { N ) xGAAACTCCAGCTTTTAAATATCTCCACCCATTTCTCTCTGCCTCTGTGGTCACCATCCTAACAGATCCCTTG TCCACCTCCAATCCCTGCCAACCCATTCTTCACACTGAAGCCTCTCTTTCATCCATCCTCTCGCCTCATTCAAATCTT TCTGGGGTTGCCACTATATG CCAGAAAGATTT TTTATTAAATA CAGAT G CAAT CATAACATT CA TCTG CTTCCAAACC TTTGAAGGTTTG CTACTGTT CTTAGGATAAAATATAA C ( N } xTGATAAAATATAACTTTCTCAGCAGGGCACCAATGC CCTGTG( N ) xTTAAATGGTATGAAAGCTATTTATCTAAGGCAACAACCTCTTTTTTTT(N)x AACATCCTCATTTTAC a g a c a g (n ) x a a c a g c a g c c c a g a g g g a a t t a a t a c t c c c a t t t c c c a g g a g a c a a g a g t t t t a g t t g c g t a t g c a g g t c t t t a a c c c t g g a a t g c t t a a a a a a a g c a c t g a t a g a g c g a g a a a a t a c a t a c c t t a c t c a c c t g c t a c a a a a a c a c a c c t g c t a c a a a a c t c a g a a a c a g g a g c a a g c t a t t c t g t c c a t g t t g c c a c c t c c t c c t t g c t a a t g t g t c c c c a c a t t g c t g c g c t g g a g g g a c t g t g t g c t a a t c t g t c c c c g c t c t g c t g g t c a g c a a g g a c a t g g t g c t a t g g g a t g g g g c g t g c t a t c t a a g g a c t t t g g g a g g g a g g c c a g a g g a t t t a a a t t a c c t g c a a g a g g t a t t c g t c a t t c a c t t g a g a ( n ) xTCACTTGAGATTTATGAAGGGTCTTCGAGGTGCTGGTTGGGG( N ) xGGGGTTCCTGGCCTGAGCCAAAGAGCAAGC AAGAGAAGCAAAGGCATTTACTCGGACTCGTTGGGAGTGCACAGGAGAGCAGG( N ) xGTGTGTACCTGTGTGTGCAGA TGCACCCTGGGGAAGCTGGAGGGGCAGGAGGAAGCTGGCTCAAAACAGACCACTCAGAGCTCCCACCTTTCCAGCATG GAGGATATCTGAGGCCCGGAGCCTCTGGAGGTGCAGCATGCCCCATCCCATAGCACCACGTCCTCCACCTGATGCTGG AGGTCACT GACATCTGTGAACCTC CCATCCCTCCCCATCCTTGC CACCTGGCATTG TAGAAAAAAGGACACTTGCT CC CAATTT GTTAAAGG GTCTGTTCTGGTCT CTGAAGTT AGGAG AAAAG AG AAA CAAAT G G { N ) xGTACATAAACAGCAGT ACCATTTCAGCTATGCCCTCTCTTTTGTTTTCGTATATCCTAAGCAGAAGCTGAACTCCC( N ) xTAAAAATAAAAATA TATTTCCAGGCTAGGCAGAG(N)xTTTTCTTATCAACTGATGACCATTT(N)xTGTTCTGGTCTCTGAAGTTCATTAA GGAGAAAAGAGAAACAAATGG(N ) xATCCCTAATGAAAGGTAGTGGAACTTGGGTCGAATCCTCGACCACTCTGAGGC TGTTA C CGAAAGTCTCAGGCAG GCCCTGGCCTTCCCTC CAGGAGTCTA GAGGCACCTCCCAGAA GCAGGAGC CAAT CC CTGTGGCTGGCCCCCAGTACCTTTCTCATGTCTGAACTTTTTACAATAACTTCCCAGTTTATCTCCTTGCTGGTCTCT CTTGTTGC CACCTCTT CTTTATCC TGCTGTGAATAT CTATTTTT CTAAAACACAT C T G ( N ) xCTCATCTGGCTCTTGC CAGTCTCTC CACCTTTAAAGTC T CTTATGA C C ( N ) xCCCACACCACCAACACCCCTCCGTGGGCCACAATCCGGGAGG TGGGACCGCC CTTGTGAAGC CACCAGGGGTTCCTCCCCTGGGGCAGGG CAGGAGCTTG CAGA GCTGTTGGGTGGAGGG AAAGGGAGACTCCT CA C CACACACGCAGAAGTCC CT CAGCCCTACAAT GAGAGCCAAGGAAG{ N ) xTCCGGGCTCCCA GGGGAGTCCATCATGCTGCGTCTG CAGGGGGTGGGGGGGATGAG CTGTCGGTGGAGCCCCTCGACGGGTGGGGGACGG GCAGGCAGCCAAGCGCACñAGTGCñGCCCGGCTGACAGGAAGCCGCACTAAGGGGAGATCTGCTTGGGCATCCAGGAC ACAAAGCCCCACCTGTCAGGCCCCCCACGCACACCCTCCACCCGCCGTCCGGGTCTGGCCCCGAGCCCACCACCAGCA GCTGCCTACTCCCTTCCTCCCTCCCTCGGTGGCCTTTGGTCCTCTCCCGGGACAGCTGCACCAAACCAGCAGGTGGGA GGTTGTTTGCACGGGCCTCCCAGCTCTGCCCCCCAACAGCCAGCTTCAAATTCCAGAGCCTCCCCAGGAACCAGCAGC AGAGAGTGGCCCCGTGGCCCCGGCGTGTCC CTGACTTGGGTGACAAGCAGAG CAGCAGTGCTTCCTACTGACCAGCTG ACCGGGGCTTTCATCAAGGCCCCCTTTGTCCTCTGAGGTCCCCTGGCTCAGGTCCACCCTGCAGCCACGCGCAGGACC CCTGTCTACTGCAGAGCCTCTGGGCAGCCTGCCGGCAGCCAGTCACCATGATGTGAAAGCAAACAAACAGAGTGTACG CCCAAAGAGAGACACAGGTGGGCCTCGATTTCTGCCTTATAATAAAGGTA{ N ) xTA A A A TA TA TTTA A A A A TA (N )xG GTAATATCAACTCACTGTAGAAAAT(N)xAAAATGTTGGCTGTGCCGTGGCATTCAAGGACACGCTCCTTGGAGTACA GTTTTCGACTGCAGCTGTGCCTC CACAAAGAAAC CACAAGAACAAATC CAGGGGGAGGGCTATTGTTGAAAGAGGTAC TCCGATACTTCTCCCT TAGGTATCTCTCTGAGCAAC CT CAGC CAGGACTGGCTGTG GGACGAGGGATGT CAACTAAGT GGATTGTGAGGGGTGTGCTAAATGGCATTCAGCACAGAATTTACAAGTGGAAACCTTGTAAATAGTGTCATCGTCATT CGTCACTAGGCACACATGGGGATCTGCGAGCCCTGGCTCCCGGTCTGCAGCCCTGATGGCATGATTTGATTTTATTTT TT TT C CGAAAGAGGGT CTTT CT T (N ) xTGTGCAAGTTTTACCTCAATTACAACCAACCAACCAACTGCTAGGCACAGC TT CTAATAAG CAGC CAGCAGTT TCCAGTTTCCGTTGTCTCCCCCCGTC CCAATTC CACTCCC CAAAGGCAGT CTTG CT TTTTAACCTCTTTTCCCTTCTGGCCAGTAATGATAAACTTGATATTTTCAATTAATATATCTGCATTCC(N)xGTTTT TCTTGGAGTAACTC CTTTGTTGTTTTATTTGTACACCTTCCTTTGTGCCACAGACATTCTTTTC CCAAATAT CCCATT GT TTAAG G CATTCTGCGGGG CG CC TGGTTT CCAG GAGG CACC CC CCACGC CAGTGC CCTTGGTACCGCACTCACTT CT TCTGGCTGTCATCTTCTCTTC(N)xAAATAAATCACTOCCTGCACTGCACCAATGCTGATGACAAGCAATA(N)xTTT ATT ATG CTGGGCATGCGTGGTCGG C AGATGTGTTTCTGGTTAATGTTCTTGC ACAA CTTCATTTCCTTACCTCTATTG TCTGTTTT CT CTTTTT GCCACT CCAAATACTTTCATA CTGAA CCTTCTAGATTGATTCTTT C T T T T T T T C T T T (N ) x A TGTCAGATTGTGCAGGCTTTTTTTTCTGAGACCTTGGTGGTTTCTGCAGAGACGTATCCTCTGATCACCTGGATGGGA AGTG GATG CCAGATGG CTTG TACT TGGC TCATCTGGATACAGGAAATGTT CTAGAC TTTCAGTTAT T CTC CT TGTTTT CAGCATCCGCTATACCCAAACCTCTGCCACATGAAGTCTCTTGAATTAGGAGCTCTGGGTCAGCTTCTCCAGCCCACT CTCCACTGCT CCAACC CAATGCAGTTGTAGGATAAATACTTGCCTGCCGATATTCAGAAGGGGTTGTGGAGCGGTACC A G ( W)xGGTACTGGTTCTTAGACATCCTCTCCACCTGCCTTCAGTTCTGCCCCCGTGCCTCCAGTATGATGTCTCCCA GT T CAAGT CTCTCTTGGCATTTGCTTTCCTCCTGTCAG C A TCTC( N ) xTATCATTTCTTCTTTTCTTTCCAAAATGTG TTAAAATACC CTATTTGATAATGATCCC TCTCTCATTTTCTTTGTCCTTGTGGGTCTTTCTCATTCTTATTTCTTTAC TATTATGCATGCAGCTAATCTTGGGAAGGTGTGTAGCCATTCTGCTACTTTAGCTAAAGCCCTTTCTCTTCCTTCCTT T C C T T C C (N ) xCAGTG TTTTTCTCACTTG CATATTT CTGAGTGCTTTACAACTCACAGGTCCTCCCCCCATCCCAAAA TG CTTC CTTCTGC CTT CCCAAG GATTTTTTTTTTTT AATGGTTTGATTTTTCTCTT CTTG AAACTTAACTTTTT AAGT CGACCTGGAATTTATCCAGGCATGCAGCCTAAGGTGAGAATCTAAAACATTCTCTTCTCAAAAACAGCTAACCAAATT AGCCT(N)xAAATTAATAATTTTTTAGTAAAAAAAA(N)xCCAGTTTTATAACTATCAACAGGTCTATTCTATGTAAA GTATCCTC CCAAGT CACGCTTTAT CCTT TTTCCTATTT T CTATT CATTAC TATTAAAAGACTAAGAAAAGTAGAT(N ) x T CTAT TATATGCAAG GAGCGAGGAAGATAATAT TC CAATATATTTGTAT T CACTTGTAGGTAAAAATAAAACACTTG GATGGACACAATAAGAGATCAGT CAATTAGTGGT TCTCT CTCTCTTAG GGTTGAG T GGGAGCTAGG CAGTGG GAG GTC AGGAGTGGAAGGGACG CCTTTCACTTTT TTTTT CTGGAACCACT TGAATGTATTAC CTATTTAAAAAATGAAATACAT TAG CAGATATG CAGTAACTATGTTTTAAGT GAATGACAAAAGATTTCATAGG ( N ) xGCTTTCAAAGATTCTTACAAAG ATATTTTCTGGTCTTGCTATTTTGGTGAACAGGATGTTTTCTACTACAAACTAGTTGCTGATATACTAGAAAATATAC TCTTGATTTTTATATAG CTCACTC CCAGTACTCTTATT CTCTATGGTGAACTTGTTTATTAATT CTAAGACCTTTT TT CCATCGATCTCCTGGGTTTT CAGTGGATAGAACAAGGTTCTAGG CTGGTT CCATCAATGTGTTT CG GCAGAT CAGGAT AG AAAACGGGGATGGTGACGTCTGTGGG GAAGGAAAG AGTG AGT TCTC CAT ATAAAGAC ATT CAGTTCC AAGT (N ) xC TG CAGAGTGGTCTCTG CTTACT CACATCGT CCTCCTCT CTTC CC CTCAGC CACACC CT CCAGTCCTT CAAGC CCCTGA TCCGGGTCCAGGCCTACCAGACCCTGAAGGTGGGTTCCTCCTTCATCTGGCTCCTCTGAGCTTCTCTTCTCTTTTCCG TG GTTGTCAGGTTTGGAATGAC CAACCACAAGCTGT GCAGCCACGGGAGCAG CACTGA( N ) xGGCACCGAGTTCTTGA ATAAA CAG CTTGAGACAGTT CATG CTGG CTTCCTGGGCTCAT CC CACC CT TTTCACAT CCTC TG CTACTGTGGTTATG AAAGTGAC TG CGGTGAC CATGAGCCCGT CAGCTCTT CCTGGGGT CCAAGAG CAACT CGAACC CAGCAATCAAGCCAGG TTCAGCCCTGACTTGTGGCGCCTGGCCCTGTAGGCACTTGGGAAACCTTTTCTGTGGCATGAAGGGCACTGTTGAGGG CAGACTGACATTCCAGCTAACCTGCTCACCCAGAACCCTGATGGGCCTTTGCAGAGAAAGCAGAAGAAGATGACAGGT CTCCTGTCCT CAAGGGACCAAG CAGAGT CT C CATGAGG GTGGGATCTG TTTTTTCACACCTGGTACTTACAGTGAGTC CTGAAAAACATATTTGCTGGTTAACTGACATGGAGCTT AC CATCTACAGT CACT CAAAACACAG CC ACGG CAGAGACA AAAGACAACACCTGAGGAAAACCAACGAGAAGGGGACAGAAACGTCTCCTGGGGTTACAGTGACCAGGTGAACAC(N) xC CT GACC CGGT GAAAA C TTTTAACCCCTAAACAAGAATGACTG CTGTAG CCTCT CCGGATGGGATGAGAATAT T C AC ATTT CATT CT CATAAAAATG CTAC CAA(N)xCACTACCAAATGCCAGTCT(M)XCAACAATAACAACAGGACACAGAA AATC CTAAACTCTG CCAGGTATGG CTGT CCTCACTTTA( N) xGGGAATTAAGACTTCTGGTAGAGAAAAAAATTGAAA AATATTATTAACATAAAG( N)xTACTCAGGAGGCTGAGGCAAGAGAATTGCTTGAACCTGGGAGGCGGAGACAACAAG AGCGAAACTCCGTCTCAAAAAAAAAAACAAAACAAAACAACACACACAAAACAAACAAACAAAAAAGCCAAATATATA GATGAGCTTGGAAGTACCTC CAAGTACCAGAAGG CCATA(N)xCATGCTATGGGATAAACAAATCAGCTCTAAAGGAC CAAAGTTGAGACCACTGACTGTGTGTGCCTCAGGGGTCTGGAACTGAGCTCCTGGCCAGAGCTGCCGACCCGAAGCCT TTGGTGAAGAGTCTAGGGAAAAGCCAGGGCCCCGGGCCTGGGCTGCATAGGTGTGCCAAGTTCCCCCAAGACTCATGC AG CATGGAAG CCCTGCCCTG GGAACCAGTGAACCTGGTAAGAACTTAG CAGAGACAAC CATAAAATTGTC CCAA GAAA TACCTCCATAAGTCAAGACACATGCAGGACCCCACAGAATATAAGCTCCACCAAAGATGATCTCACAGACAACAATGA CAAAACAAATAGTCTTACAC CAGTTCTAGTGGTAGTT CTTACATAACAGC C(N)xTACATAACAGCTTAATCAACTG( N)xATGAGCCACTGCACCTGGCCAACTATGTGGATAATGCAAATTCTCCAGTTCCAGAAACAGATTTTTTAAAAGCTT GGAATTCCACCTTATCCCATGTTTTAATACATCTTTAAATAAGTACAAATGCGTTTTACTTTCTTTTCCAAG( N} xTT TT TT TTAATTAATG CCTT TT TAAGAGTTAAAAT CATGTAC TGGTTAATTT CACTAAAACCAGAAATTAGTGGAAGGCT TGATTAAGGGAGGC TTTATTTGATGTT AAC TTAC CATT C CATAGACTATAAAGAACATTATAAAAAAACC CTCTAAAG TGACACATGCCCCAAATGACCAAGACATAAGCAAACCTTTTAAATTACTCATCTTTCATATGTGTGTTTGTTCCCCTA CTATTATCACTGTGTCTT CTGTCTTTTGTC TACC TATGAGAACTGCA CAC TATCTGTG GCAATATTGTGCTCCCAAAA GT CCAGAATTTC CC CCAAAGTATTAATATGTTGTGGATATAATGTGTAATATAG CACATC CTGTGTTATCAGGATATT CC CTGTGAAAGACTTACAAATTAG CAAC CTTCTAAGATTT TGAGTTCTAATGT C CTAATTTGTGGGTAAATTAT CTCC AACT CTTAGATTGCAACATCACTTGAGTAACTCTTTAG CAAACT CTTTAG CAAAGTAT CACAAT TACACTGCTTAGGC TTTTTAATACTAG CATTAATAATT CATGATGCTTGTGTACTGAT CAC CAAGCAAGGAC CAATAAAATCTT GGCTATAA CAAAG GAG GAAAAAAAGATGGTGAAATGTAGACATTTACAGACATACAAAGTTTAAGAGAGTTCA CTGG G CAGAGAG C TTCTTACAAAAAGAGCCATGTAGTTTCTCTCCCAAAAGAGATACAAGTTCAGCCTCAAGGAAAGTTCAGAACTCCTAG GATAGTCCTATTTATAGTCTGGAAGGGACATTTGAGATCAAAAACGCCATCCTAAAGTTTTTTACTTTTTTGGACATT AT CCTGATTACC CAATGAGC CCAAGATG GAAACCAGAGGAGAGGATAGAG GGGAAGCCGT CTGG CTCCAGG GAAGGCA TGAGTGGAATTTCAGTCCAGACCTCTCTCTGAGGTATGCCCTCTTTCCTGGCCTCTGCATCTGCAGCTGACTTTTCAC CAGGAAAGATCTCAGGGCCTCATCCCTTATCACCTGCTGCTTGTCAACCA3CACTTGGGAGAGAGAGGGCTCCTCCAC CAGTCCCCTGCCAGTCTTGGGGTGAGGGGGCAGCGTAGCTCTCATAATGCCTCCCTCTGGTAGCTCTTCTGAGCTTGG TGAGAAGCGCACACTCATCTGGGGTTACTTTCCAAGGAGAAGGTATTTTTAAATAGAAACGAGAAGTAATTATAAAGA AATGACTGCCTGGCAACAGAAATCTAAATCCCAAGCCCCAAGCTGGTGACAGTTGTAGCATGAACATCTGGTGATTCT ATAATTAAAACATTGACACCTCACCGTCGCTCGGCTCCCTGCAGCCCCACAGAAAACCTGGTGTAACTTTGCTCTTCT TTACGGGGGGAAAAAAAGTTATTTTCTT CC CTTT CTTCTC TGTTCAGACATAACATCATCAACATAGCTCTTTAAATA TCTGTGCCGGGAAGCTATATTTAGCTACTATTTGGCCCTCAAAATGAGCCTGATTTCCTGTGGCAAAAAGTTCTCAGC AACAAGGAGAGAGGCCAGGAGTGTTCCAGGCACTGACAAACAGCTTTAAAACTTGAGATGCCCAAATGATGTGAAAAA GT TAACTGAGTG CTTTTT CTGGATAGGTGG CTCACACT CCAATAGTAGGGGGAAAAAT CACAAAGGGGTGTGTGTTTG GT CT GAACGTTT CCAGAGTCATCC TGGAGAACAG CTTGACGAG GGGCAACTCTTTGGG CC CATATGTTGTACACATCC CT CTTTCACAGG CAG GAGGGGAA
> H s 8 _ 8539816 -85571 63
TTTTTGCACTTTTTCTTATGGAGCCTGGAGGAGCATGATGAGTGATGGTGGCCAAACTGGTTCTGCAGAGATGGTCTG GCACCTGCTGTGGTTGAGCTTTGAAGCCTCTGCCAAACACAAGGAGGGCACAGGAAAAGTCACATTTCACACCCCGTT GCAAAAGAATAGGATGAGTTGATGTGAGCTAAGTACTATGCACAATGCCTGGCACCCAGGAAATGGTATATGGAAGGT ACAGCATCTTCATTTTACTATAGACAGAATGGATGCTAAAATTCATCCAAATCTTTAGTTGTCTCTATAATCTGAAAT TC CACTCT CTGATT AATAAAGTTGTCTTTAAGAAAGTGGTTTTT CATT ACGG AAATGAAAATTGTGACTAACAAACTC AAGT CCAG C CTCAGTCTCAGGAGGATCCGTTATG CAAAAAGGAC CCTG CCTTTTGCAACATACAACACACAAAC CCAC AGCGGGTTCCAGTCTCAATGGCTCACTTGTCTGACAGGATCTTCGCTCAGAAAGAAACAATTGTCAGGTTCATGTTGT GG CCTAGCAAGAGTTTGGAAAGTT CCGTAAAGAAAAATAATGTAGACAAATTTAGAAC CTGCCC CAGGTGTGTTGATT TGACGCCAAACCATGGATAGGTAACTTGATTCAGCTGAACCTCGTTAAAATAATGTTGGAAAACATTACATTACTGCA ACGTCCTT( N) XCATTACTGCAACACTCTAAGAAATCGGTAATAACCATGTCATTACAGAGAAGCCCAAACCTTCCTA TTCTATTTGACATGCACTAATTGAAAAAAGGCACTTTGATTGCTGTAACATAAGCCTATCATCCACTGAATTAGAGTC CATAATTATTAGAATCTATTTTGATGAG CACCTG CTTT GGGGGATGAAATACTCTATATGAGG C CTCTTT CTAT CTAA TATAGGACTCTTTATCTTTTTAAATGGTGATTTGGTGGCAGATGACAGTCAACATGTCTCTGAATCATCATGAGTTTT TAGCATTT TACG GAACTGGT TTGGAGATTAAAA CTAACAT CATTTTGTTACAACAGTGGTGCTGTGTAATAAAAAGGA GAAGTCATTCGTAGAGTT CATAAACCTTTTTTTAATAATT TAGATTTTAGGCCGGGCACAGTG G CTCACT C (N ) xCAT AGGTAAGTGTTG CAATAARAAGAAGAAATTTTTACATTTC TATT TTTTAAGGAAGGCATT TTCT GACATTTATGAGTA AGGATATT C CAAGAATATTGTCATGTAT CTTGTTGTTT CACAGC TCAAGC TATG GGAATGAAAATTTTAATCTGAGTT TCTAATTATTTC CC CG AG A C AGCTTG AG AG AGGGGG AGGT AACTTCCC C C ATCT CCTAGGTAGGTGGT AGGTAACT AA CCCTGCAGTG CAGAACCAATGCT C TGAAAATGAC CAGGAAAAATGCCC CAATAT CTA CTG CTTC CTT CTAAGTG GGAA GATGGGAAAAGAATATTGGAAGAAAAAAAT CAAAGTGATTATGAGAAAAT CATAAGGGTAAAAATAAAA CATGAGAGA GAGGAAAAGCAG CAGCT CTATATTATTTTTTAAGTCTT GTTATG GCAG TTGGTT GCTTTTATTT T CAAGT GGGTTAA C CTGACATTGTATTCACTTTGACTTTAAATGCTAGAAGAGTACATTTCAGCAAAAAGAAAAAAAAAAGGCAATGGGAAA CT TTCATTGT TATAAATGñAAAGTT(N ) xGAAAAGTGACCGTAAACATCTAGGGAGCTGCAGGTGGAAñGAGñCAACA CC CAAACAGATAAACAAGTGTCTAAAAATG CTGGCAACCGAG CAAG CAAACAAAAG GCAT CAGACT TCñCCTTGAT CC ATTATCAGATCTCAGAGTCATCTATGCAATCTTCCTCCCTGAACTTTCTAAATATCCAAAGCCAAATTCCTCCCCTCC CACGTCCCTGTCCCTGGCCTCGCCATTCACACACACCTTCTTTTCCCTGTCTTGCTGTGAAACTTGGCTTTCGCCATG CACCCAGTCC CCAGGCTAAC CAC CAC CC C CAGTAC CACATTACTTTTG CT CTTCTCTT CTGGAATACTGACTTCCAAG TCTAACATCTGTCACGTGATACACCGCAATTAAATAGCCTCCTAACCAGCCCCCCTTTCTCTCTGTTCTGCTACAGAG TGATGGCTCAAAACCAAATCAGGGCATGGCATTACCTGGCTTTAAAACCACTCATAGTTCTCTCACCTGCAGAATAAA GACCACACCCTCCAGCGTGACATTCAAGTTTTTTCTGAACCTGCCTCAGCAGCGTCACAGTGATGGCGATGGTGAGAT CACACACACACATTTGTACATT TATAGTAGGACAAACTT CAT GACC CTCAGGAGTTACT(N ) xCCCAGCTCTATTACT TC CTG GAAGA GAGCTCA CAT GGAGCTAGAGTTAAGAGTG GTT CCTG CTGACG GCCCAG CACAACAAGG CAGAATGAGT GTGTGTGTCTACAGGCAGGAGTGGCTTATGCTACTTAAGAAAATCAAAGAAACAAGAGGACATATTTTTAAAATAACA GAAATG GAAG CAAACT GAAT GAATTTTT TAAAGTTG CTGGAAAGAGAAGAAT GAAA CATGTCACAAAATAAA GAGACA AGATGATTTCTGGTCCCTTTTTATTTTCAGATGGCTTCACTTGACATTCAGTTTCACTGTAAGCACTGGAACAGCACA AG CCAG CCTTTT ATTAAAAAGAG AG AGAGAGAGAGCGC CATTT ATTTC CAT ATT AACTAG AAAC AAGCAATGAGTCAG GTGTTTAAAAAAGAACATGATTG CCTTAATAT CAT CAACAGTGCAACCT CTGAATGAGGATGTGGG CTAGTAAAAATA AAAATAATGATTGATACATTTTTTTCTATTTACTTATCTACAG GGATTAAAAGGAT TAGC TAG ATGAT GGAT TGGAAT GTTAAAACT C TCAGAAAACATATTTC CC TAGT TACAGATACAAACAAT GAAGAACACTTGTC TTTGTGTCAAATTC TA ATAAAT GATATT CAG GTATATTA CAAAGAAGT CAAAAC GCTTATTT CAATGC C AAAGG GGGAGCTGGTGAAGTAAACT GGTTGTAACCTATTAGAGTT CTCCTTTCCCATGTTCTG CAGGGAAGAAAAC AAAT CAGTT CATTTT CCAAGT GCAGAT CATCTCCAAATGTCATGTGCAAAATTATGCTTACCTGTCTATTAAGATAAAATGAATCAGCAGTGAAACTTACATTAT GTTAAAGACAGAA CAG CAAAGACGAGACAAGG TATAAATT CTGAA C CG TCTAGCTAAATATTATAGGAGGTGTCCTTT TCACTTTGTAGGTCCTAGCTGAAACCTGTTGCAAAAAGGGTGGTGAAGAGTTAGCTGAAATCAAGATTCCTTGGAAGG CAATCCACAATGGATTTGCAGAATGAGGT CTG CAAGGACAAAAGT CAAATTAAATCATGT CCTTTTAT CACCTGGCTA GAGACTGGTCCATACCCCCTACCATTTCCTTGATTTGGTTTACACTAAGACATTTCTGGGCCTTTGCTACTCCTGTCT
C (N ) xCAGCCTGTTCCATTTTTCTGAGCACCAAGCTCAGTGAGCATTCAAAGAATGCTTGATGAAAGAATTACTAAAT TAATGCATAAATGGAGATGAAATTGCTG GC CT TAAACCTG CC CATGTCAT CC CCT C TCTTAAACTGTCTACT CCCAGT TT CCAGTATC CC TTTGTTGTAAGGCAAG GC CATCAGATGATTTCCAAAATAACTACAACT CCGGGATT CTAAAGCAGT TTTCTTC (N ) xG ACTATAT ATC ATTT CT CT AT CATTTG ATGTTCTTGT CTG GGCT C ATTC ATGAGACT ACATTC AG CT GG CTGCñGTG CAGGGACCAAGATGGACT TGTCAGGAG CCTTGGATC GAG CTGGCAGCTGGGTACCTTGGTCCTCTTCC AAG CCT CCTTTCATCCTCCACG CACTAGACTAG CTTTTTTACTTGGACAAGCTCCAATGCGTGAG CAACTAT CAAG CC TCT(N )xGACTTCATTGAAATCCTATTAATTTTCTTGTCACCACCATTGTAAATGGGATCTTTTTTCTATTTTGCCTC TT CCTGTC AATATTTT TACC CTTAAATGTAGG CCCTTATCTAATTTAA TTGGTTAGCCCTTC CAGCTCAATG CTTAAA AAATAGTTCT GAA CTT G CTGGG AATG CTT CTG CTATTAAACCACTTATAAAGTTG ATC ATTG ATTT AAGTAAATAT TC TTTACCACTGAG GGAAATAT CC CTCTAT TTTATTTCAC CACGG( N) xTTAATTAGTCGTTCATGTTGAATGTTAACAA AATAGAATCTTA(N ) xTTGATTTTAAATTTTACAAAAAGAATCTGTTAGTATTTTTTAATTAATATAACGGGTTATAT GAATAGATTTCTCACTACCGAGCATTCATTCTGTTTTTTATACAGATCCTCCCTCTCCGGCCTCCCACCCTCAGCTGT CAAGAGATGACTATTCTGCATACAGACTGTCAATGGATTTCAGCCTTCCAGTTCCACTTCTTCCTCTTCCCCAGCTGT TAGCAT C TA G TT( N ) xTCTATTGGACATCACTGATCTAGAGGAATTCTACTCTTTGGTTGTACATATTCCTAAGATAT TT TGATATTC CTGTTTAAAGATAGTTGACCAGGGA C GTTG GCTAAC( N ) xTGGAGATAGTTGAACGTACTTGTGATCC ATAGTCAGAGTTGTGACCTGGCTTTGTTCT( N) xCCCTGAACCGCCAAGCGAAGCAGCCAGGCCAGCCTTTTAGAGGA TGAGAGACCATAAAGAAAGACAG GCCATGCTGT CT CACACATTTCTTC CAGC CAAG CC CAGATCCCAG CTGACATCAG CCCCATGACTAACCCCAGGCAAGACCAGC(N)xCAGGCCAACCAACTCAATGCTCTATGTCTTTAAGATGGAAATCAT GCTATGTGTATGGTGAGCACTG GCCATC CT CAATTTAATTAGTGAAACTGATTGC CAACTTCTCTAGAGTCT CTTG CA ATTTTATTTTAAACTfl GACAAACCTGTCACTGGTAT GT CATTTCTTAC CAATGTCTTCTC CCTTACTTTCTATCTTCC TTATCATGCAGT TCAAAATT CACATACT TTATGGGTTTCTCT CAAG CAACTT CCTT CTTTGC CAAC CAGTGACAGATA GG CACT CTTG CC TCCATGAG GTAACTTT TATT GCCTGCACAATAAGTAACTTTTCACAG GTAATGTGACTAACAG GTG ACACAGGTCATTACCTCTCCCACGTATCTTTCAGCTGACACATCCTCTAAATCCAGCCCACCTCCTCCCAATGGGATC AG CGG GTATT CC CC AG GC AAGAGAG GGAG C AC ATAT CTT C AAATT C ATTAAATCTGGT AT TT AG AG CAGAG TTGGC AA CCAAGCAACATCATCACTAGAATTTACTGTTCATGAATTACTCTCTAATAGAATCAATTTTCTAAGAGATTTAAAACC TATTTAAATCAAAACACAAATAACATGTGG CTGGGG GCTGATAACA GAACAG GGCGTTGACTGGCCACTGTAGGTTTC TT AAAG AAGG CTGCCAATTC CT AATGTGTTTG CTTT CTTTTG ATTTTCTATCTTCC CC ATTCTACC CACTGC AATTTG TATCCTTTTCACAGCTTCATTTTATCATTCCAGTTTGAAATTTCACATTCAAATCGTCCAGTTACATATAAATATTCT
T(N)xTTACCTAAGTGAAATTAAAATGAAAAAATTCTACTTTCAGAAATATTCAATTATAATGTCAATGAGAGAAATA TT TT T (N ) xTGTTGCAATGTATGTTGACTCTTTAGCTTATATTCCATAGGTACAGAAAGCTAGATATACCTAAATATT GC CTTCTCACAT CACTAACAATGTCATCAATACAAATGTACTGCTCTCTG CTATAC TCTTGTAGGT TC CCAT TTTTAT TATAAATTTCGGAAACACTAGATGATGAGTATAAGGAGGT TGAAAC TCAATATTAC CCAAATATAGAT TAAT CCAG CT CTGTTAGTATTAACAAAAAATA T CATTT T T ( N) xTGATTTTTATATGTTATGTATTTTATAAAGGTTTTCTCACCCAT CC TTAT GGAAAT CCTACTGAGTAGAAAC CATTT C A (N ) xTTCATCTTACTATTGAAGAAAAAAGGCTCCCAGAGTTGG TG CTAATTAC CT TCCCATCATCAGTT CGAAAGT CACTTCTGCCTGCAC CATGGTAC CTA CATTAAGAAflGTAAGAC GT t a g a g t a c a t g c t c t c t a a t g t c c t t c c a g c c t c a a c g t t g g c t t a t t t g c a c a t a c t a t a g t a g g t g t g c c c a g t t c CCTATCGCTGGGCATTTTTCCCAGGTCTCTGTGACACGTAACAGGGTCAGGAATGAAAGTATACCTGCCCAATTGTTA ACATTATATGTTGTTATTGATC CAAATGAACTGAAAAC CGATAAAATG CCTAACC CATGG CAT CAAGCTTTG CAAACA AATGACAAGTTTG GTAAGTTTAACAG GT TC CGTTTTGAAAG G C ATC TG CAAG GTAGAG GTGG CAGAGAAAGCTAAG CT GATGACAAAATGGTCTTTCTCAAAAGCATATGCTTAGAAAGTCTCTATTTTACGACATTCACTTTACTAGACTTTAAA CAATTCT CA CAT CAAC CATAAAGCAAGCAATATGCATTT C TG CATGTGTCTATATG GTGTACTTTTGAAATCAGTATG TATTAAT C AATGTTAAT AATGATTAACT CCTCTATTT CT C AT ATTGTAGCTGTTT C AGTGGAATTT AT C CCATTTC TG TCTGGGTAAAACATAACTGAGAGCAGCAGTGAGACAGGATAAAGAACTTTCCAGTGCAGGTTTACAGTGCGTTCCACT CTTAGCAATGAC TGGTGAAAAATCACATGT CCGTCCTG CTACTCAGATTATGACAATGTATTACTGTT CAG G GAAGGG GAGGCTATTATG CCCTGAAAATATGGAAGGAGACACTTGCTTTTATACTC CTTTTTCTTTCC CAAGTC CAGGGCAT TC AT CAGT CAAAATGGCTTGTTAAATTAATTAGTTTTTAC TT TCTCCAAAA CAGTAAGTCCTATGGGATG GGGCAATACT TCTCTT ACGAAC CACC AC AGTTTTATTTGACATCTGTCG G CTñAAATT TT CC CTTTTATATAATTT AAACTTT CTT TT AATTAT TTGT CATTATTACTAC TATT G TTAí N) xTAAACATTGTAACTGAAGTTTATGAACTAGAAATATACAATGTT TC CCTG CGTTACTAGG TTCAACATATTTTGGACATAAGTT CA GAATAC CAATAGCATAATATTGTA CACATAAAATTA GACCAT TCATAGAGTTGCCTTATTTG CT TAGCGACCACAGGAAAGAAAGGGTATTCAGAGTT CCAGGAAAAAAATAAG AAAATATGTTTGTTAA TACTTCTAAGGATG CTATGCTCAC CAGTATTAATATGATAATAT TTAAAATGACCTTTGGAA AGAAGAAATAATATAAAGATGAAAAATCAGGCTGTCTGGTTTTTTTCTTTACCAGTATACAGCGTATTTGCTGAACAT ATTCTTGAAGTTCAAAGTCAATGCCATATATGAATTTTGTTATGACTGATTTGTATGTTGAATTTCATCATTGCTATT TGTTTCCATGTAAAGACTGTTTCTATATCATGTTTAAACATAAAAAATACTTACAAATGTGGAAATATACCACTCCAT TTTTTCCTGGAATATGTGCACATCATTTATTCTGCATGTATTTCTGTGAATGGCCTGACTGTTATTGTATCAAAATAG AATTTAGCTCTGTTCACATAAGACTAGAGCTAGTTACGGCCATAAAAGACTAGAGCTAGTTATGGCACATTCACAATT ATGGGAGAAAAAATGT GTGT CATAG GAATGTGAGATTTTACTTCAAAATTAGAAAAACGGAGA CAC TTAACT GATCAT GATTTTTTAAAATAAT CTTTTT CATATGTTTAAAATTCTAGAGAAATACAAT GTGTAAAA TCT CAAACAAAATAAAAA TCAATTGTACATCCCATTGTTCCTTCTAGTATTTTTGTATACACAGATATTCTTTTTAAGATTTAATTGGTTTCATAC TGCCCCACTACTTTATCATCAGCTTATTATAAAGGGTACTCTA(N)xCAGATGAAACATCAACTTCTTTTCCGCCACG TAAACTAATAAAGATT TGTCTTAATTTTGTGTGAGG CATG TTGAAAT CAAAATATTTTAAACTCTTAC GTAG CTAAAT CTATCAACCTTTTTCTCCATCATTGCCATCTTGGATGTTTTGCTTAGAAAACCTTTTCACTCCTGTAGCTTTACCTCC ACCTCCACTTCCTTCTAATTCTTTTATTATTAGTT(N ) xACATTAACGTGGAAAGGGTTACCATCAGTGTAGTACTCG GT TTTTACATTTAGGACTTTGC CCAG TG GC CATGTGGT TTTC TCAAAAGATAAATTTT GC CAAAAATGAAACAAAC CC CTTGTGAATAAAAATAAATAAATAAATAACAACAGAAACTGAAAATTATCTCAAAGACCTCAATGAATTTTGTCCATG GCATGGTAAATATGTTTGCC CTGCAG TAGA CAGGAAGGGC CAG GGTTGAAGAGAAG CAAG CATGGCAAGACAGAG G CA GCTGCTGATT CCTAGAA CAG CAACTC CCAC CC CAACTC CCACCCAG CATTTC CCT CTTGT CAAGGGAGGCCACTGTGA CTGTACGCCAAGAAGGGTTTGAAGCCACTAGAAGATAAAACTTAAGCATATTTTTTGAAATAGTGGCACAGTATCACA GGATGCCTTCTGATTTGTGTGAGAGGGTGGCAAGTTGATTCCTGTTTTAGTGGCCATCTAAAGTCGCCñ(N)xTTTTA AAAAGCGATGTATGCTACATTCTTCTATTGTGTTACTATGCAGGAACTTTACAAAAATTCT (N ) xAATGCGGTGTAGG AAACAATATAGATAAGTCAGTACACACATATTAGAAAACACGTGGCAGTGATAAATTCTATAAACAGAACAAATTAGA GG GAAGTGATACTGAG CTAACAAGAGGG CTATTTCACACT CGATGT GAAAAGTTATTGATTTATTTTCTACAATATTA GG CACT CACAGT TGGTAAAACAGCTGTTGGAC TACTGG CCAAAAAGAAAAAC TAATACAATATTGAAGGGGC TTTGTT CAATTC AGGCTGTTTT AATG CT ATTT AAAAAT AAAG AT CATAGTTT AAGTTT TTTC AT AGTCTAAT TT TTATTCTATA ACTCACTGAAGTAGATTTACCATAGGATGATTATCAGTGGAGTTCATTTCTCACTCCTATTCAGAATTCTCATAGTAT AACATTGTGAAC AAAATTGT CTTATCGC TT AG CAAACACAGGAAAGGGTGTT C AGAGTTC CAGG AAGAAAAAGAG AAA ATAGCC TTTG GAGAAAGTAAAATAAAGACC TGGCAGAG GGAATCTGAGGACATCTTATTATGGCAAAGTGAT CTAGTG
( N ) xATCTAGTGGACCCTCTGTCCTAGCCAAAGTTGGTAATGTGACTTTCCAAGATGAAGCTGTTGCAGGTTGCACCT TTGTGCGAGGTGGTAGCAGAGGCCTAGAGTGCAGGCAAACAGGGGACTCAGCATCAGTCCCTTCAGAATTCACTGCTG GGGGCTCACTATTCATTTCAAAAGCTGCAGCATAGTGAACAGATTCTTGCATTACTACCGGGCCCTTCTTCAGAAAAG AAATCTGTGGTAGAACCTGGGTATCCACTTACACAGCCAGGTCCTAATGTCAGCAATCATTCATTCATTCCTACCACA GAGTGACTCATGCTTTATTTCTCCCTCAGACTTCAGATGAACTCAGAAGGCCAGGGAACAGAGGTCAGCTCTATCCCC AGATGC TATG CTTAACACAGTCACTT CTTT CT CTATTGTCGCAGAAAGTGAG CCCCACAGGCCTACGCTTATTTGTCG TAAATATCTGGAGCTATCCCATACTCGTATGGGATAGATTGTTTTCTATGCTAAGGAAAAAGAAACATTCAAACAAAT CAACCTGCTTACTGAGATGTATGTCACAATCTGTTTTGGCCAGACGGTCATTTGTTGGAAAATTCAGTAGGTACTATG TGGAAGAACT CAGGGCAGAAAGACATAATATT G (N ) xATGCAGGGCTGGGTGGGACTGGGAATGGTCTTGTAAAGGGT TG ATGC AGG C CAGCTGTGñGGTT CGTGTGG CT AATAAAAAGGGAGC CACAAAGCCCGGTGTAGG AG AC CAG TT AGC AG CTTAAGGAGAAAGATGGG AG AATGGC CTGT AC TGGG AAGAGG C AGT AGAGAAGAAAGAG ACATGGAAGGACG AAAG AC AGATTTAGGAGGTAGAACCCCCAGGATGCCCTTCCTCCCCACAGGTAAATAACACCTGGGCTTCACTCATTCCACAGA GATCTAAGCAAAAACG
> H s 8 _ 8304215 - 83511 72
TATGGGTTTGTTTGCTTGTTTTTGGTGGGTGAGGCATCTATTTAGAAATCATTTTTTGTTGGGCCCCTTAATTTTCAT AAAATAATGCAAAGACATGTGTACTTTAATTAGCAC CATTGAGCTT CTGATT TCAGTTTCTTGCCC CAGAGT CCAC CA TGCACACTTTCTCCTACTATGCCTCATGAGGGCTCTTTTACTTGAAACACCCAATCTTTAAGGTGGGAAAATGCACCA AG C CTGATAAGT GACT CTACTT C CTG GT TC CAGACC CAGTGATTGTTC CATT CGGTATGGAC CAGATTATATTTCATG CTGGTCCAGATTATATTCCATTAGTGAATTTTCTAAATTTCAGTTTTCTAAGTTCAATACCAATATTTATCCATTTGG CCTTCGTAA(N)xTTTTAAAAAATAAGAACACCACCCTGTCCTTGGGAGAAA( N } xAAATCTGAAGTAGAGAATTTTT TTTCTTCTTGCCTTATGAGCTCTGCCTAGAATTCATGCCTAGAATGGTACTTGGTATATGCTAAGTGTCTCTTCTCAT TCTTGTTATCAGGGGATGCATTCAGAGCTGCCAGTGGCCCTGGGTCCCGTCCACACGAAGAAGCTGGAAAGAATGAGG CCATGC C AAAA CAGAGGC TGAGATGAGAGGCGAAGGCAGAAATC TG CC C(N )xATAAAGTTCATGCTGTTATTTGGCA GATG CT GAGATGTGAG CAAGAGAAAAGAGTCAAGAATCAAGCAGGAATAT CGAG CAAGGT GGAGTGAGTGAC TCATAG AAAACACACGGAGACAGGTTTTGGAGATGATGATGATAATAATGATGAACTTAGTTTTGGGCAAAATGAGTTTGAGGA TC TTTGAATT TT CC CATGTGGAG CTGTTCCATTTGTTGTT TG GAAACGTGGATAGAAAAC TCAG CAGAGAGT TCAGGG ATGAAG CTGTAGAT CTGGGAGC CAACTGAACGGAGCTGCAGCTGAAGG CCATGACACTG CAAGAGGAACAGAGTGG GG CTAGAAG GAGAC CAAGGACTG G GCAAGGAACAGAAGCAG G CGACA CGG CCTTGT GG GAGOACGACAAGGAGAAT CGGC ATGT GTAAGGTGAGAGGAGT CACAAGGGGTAGTTCTAATGACA CATAGAGTGTTGTTGTGCTCTT(N ) xGTGTCGTGG CACTCTGGATGCAGACACTGGAATGGATGAGAGTTTTGAAGAGGCATTGCATTTCTCTGGATGCAGACACTGGAATGG ATGAGAGTTTTGAAGAGG CATTGCATTTCTCTGGATGCAGACAC TGGAATGGATGAGAGT TTTGAAGAGG CATT GCAT TT T ACAAGTAGTTT ACTGGC AC CCC CTGAGAG AAGATTT C AGTTTAGC AGGAGC AAAT CC CAGACTGC AC AAGG CTGA GAAG GATG GGAGAT GAGAAG G GG GAGGGGAGGCACAAAAAAGAGGGAAAGGT GTGG GAATTC CAGAGTTGTG GTGAAA TCAGATCGCCTCTTGTTTTT CTTAAGAATAATTGTGATGTAAA CA CAT TTGCATGTTGAAGAGAAGTCAAAAGACAAA AAAAATACAT CAGACGAGTAAT TATTGACAAAACAAGTTAAAAGAG GAAGTGATAAAGGAAAAAGGAAGTAG CTTCAA GGAG GG GAAAATATATTT CCTTATTTAAGACAAGAGAGGC TA CACATATCTGAGAGGTGTTAAAAGAT GAAAAAAAAG ACAAAAGGGCAAGAGAAAATGACCATGATGAACCAGAGAAAATT TAATGTAGAAGAGT CAATGT TAGC TGAAATTC CA TC CAATGATCTC CATCT C A T { N ) xTTATTTTTAAGAAAGGATTTTGATCTCAGGAAGGTAGGTGACAAAGTCATGGGC TG CCTGAGAGAGGG CTAAGGTG CTGAGTAGAAAATATTTG GAAC TGTT GTGAGAAATTAAGT TTAT TATAAC TTTG GA GAAG CC CTGAAGTTAGAAAG CATGAATTGGTAGTAGATC CAG CTGATT TTTTAA CAGC CCAAGATAAGAAATGG CAAA AAGCACAGTCGT GTTTTT C CAGTCTGATG GGAAGT CAAGGAACAAAG CAAT CTAGGGTA C TGGT GAGTGT TACTGAGT TAAG GGAGCACACTTGTCT CATTTCTCAGCCTGACATAGG TGACTTGTCTGGTGCTCTGCAGATGCTGAGTAGCACAG TATCTGG CACAGAAG GGT CAGG CAATAAAGATTTATTTCACTGGATTAAAAAAAGG CATT CTTTAATTTTTATGATGG ACTCTTTCATCCACATTCAGCT CTGACACGTAG GAGTCACTTTTCTTTCTTTC CTACATT TGTT CTGT CGAT CTGATT TAAAAAAAAACAAGATGTTT TC GGAAT CATC CCTGTGGAGAGATAAGG TGGT GATGGCAATGGGAAAATG CAAAGTAG AAATTAGAAACAAATACATGAATATATTTGCAGTAGCATTTGAGGAACTGGGAACGTTTAGTGACTGTGTGTGTGAGT GTGTGTGTGTGTGTGTGTTTTC TTCTAAGCACAAGTGCTGATGTTCTGAGAT GAGGTAGTGCAT CATGAATCATA CAC AGAAAGGATTTATGAACCAAAAAAAAAAAAGAATGAGATG GTGG GCGAGGAAAT CTGGGGAGAATA(N)xTTACGTGC AG CCAAGATT GACAAC CCTTGAGCAGAGCTCAGGAGACACAG CC TA G G T(N ) xAATGATTTTTCACCTTAACGTGTGA GGAGAAACAT G G CCAATAATAC T CAATATTCATTGCTGCATAAATAGAGT CTTT T CAGAACT CC TGGAGACTTC GTGG AAAATGG CAC CCTGGGAAACACT CTGTGAATTAACATTTGTTTTATATTGGCAGAC TGAAGATT GT TAACTC CA C AAA CTGTACATATTT CT AAATTGGG CTCTTGGAGG ACTTCT AAAC CTTC CAAGTAATTG AT CTGAAG CT AAGGGC AT AAGC TTAGCCTTTATGACAACATCTTGTACTATTCTACTGCCC( N ) xAAAAATATAGTACTTTCATTAGGACCTGAAAGAGT TGAT CCTAAGTCTACTGACCAG CTTTCTTGCAGTATAATC CACAGAAAA CTT CAAGAATAACTGAT TCTGTTTT CCTA TC CTTG CC CCTTGATC CC CCAG CTGAGGAAGCACTGGTGA GAAT CACCCCCTCTCA CTGATGTTAAGTAT CAGGTGAA ACCTTTATGCGTTT CAAAGG CT GAAGAAAACATAAAA CAAAGGCGG C C T T ( N ) xAAAAGGCGGCCTTATTAGCCTGCT TTGGTGCTGATC TGAATTAAAAGCTGAATAAATGCATAAACATAA CAAGAfiCACTAGAC AAAAT CCAACAGAAAAGAG ACATACATGT CAGAAACATAG TTATCTCTAGTGTGCAAATTTGCGTG G CAT C CTAAGGGT CTGGAATGAATG GAGG CT CCTC TT CAGT CTTC CTTAGCATGAGCCAAAGAATT CAACTTCATTGAAT CAACAGTGGTACTGGTGGGTGAATATGTA GG GGTAACAAGTGTAT TCGAAACTTCTCTAGGATA CGGAATTTG CACAGAA C CGTTGAAGAAGTAGTGGCAG GAATAA AACAGAAACACTAAA CA CAG GAAAAGAGGTAGGTC CCCAGTATT TGGAGT TGATGGCTACTGTTTTATGTTGCGGGAT TCTAACAAACATTATGAATTACTTACAGATCCATTCAAATTCTTGTTCTGAACCAGTGGGACTCAAGCTTGGC(N)xT CAG GTACAGC CAAGATTGA CAGC CATGGAACAGAGCTA( N ) xAGCCGAGGTGTACCATGTTACCATGTGCTGAGACAA GGAAACCATCCAGCATTCCTTGTGGCCCTCTCAAATCAGCGACAGAGCTCCCTGGATGCTCTTTTGCGTGTGCTTGAA GGGCAAGTTGTG CATT TTATTTGCT CAAGTCACTGAAGC CTGGTGAAGTGGGACAG C CT CTT CT TTTC CAGT CT CAAA
a t a t a g g g g c t t c a g t c t t t g c t g t t c t t g t t t ( n ) x t t c t a g a a g c c a t c t a a t t t c c c c t g t g g t g t c a g a t a a t c CCAAAGAACATGGTGT CCTGGG CATACATGAGCCAGCTGC CTGGAGTCCCTCCTACCTACTGGCACTCCATCTATGGA GGGAAGCCTTCTCTCTGTCCTCTGCCCCATTCTGATCTGCGGCCTCACTCTGCGCTCTGAATCTGGCCTAATCCAATA AGCAGGACCCACAGGGCATTGGGAATCCTGCCCCAGATAAGGACCATCATGCACAGAGAACTGTAGTGGCCAAGGTAA TATTAATAGT CATAAGTCAT< N ) xGGGCTACGGGTGGGATGGCACTGGGGGCAAGAACGCTGGACGCCGGCTGTTAAA TGGTGTGGTGTCCACT CCTAGATGCAGCTGATGGATG GCGG C CCTGGG CAG GTTGCTGGC CTGACC CCACCCTGGGTG CCTGTC CT CAGATG CT TATCTA CAGAAAGATGGGCTTTGT CTAGATCACTGCAGCCCTTTCTTGCTCTGCTTTTCTAG AAGT CT CT CTGTGACT CATG CAGCTGTGCCTCCTTTCCCAGGACTCTG CC CTGC CTGGTGAT GACACAGCACGTTCAG CACATTTGTTTCCAGGGAGAGCACTCTCCAGAAGGAACCAGGGCCAGCTCCTGTGGCGCCACTTGGTCAGGCTAGCTG GAGGTTGTGT CAACGT CT CGGAGAGCCCCTTGATTACAGTTTATAAGC CGGTTG TAAAGTGC CCATTCTCGGGC GAAA CTTGGCTTTCAGTTCT CTGAGATGG CAGAGCATTGCGAAC TGGTCACTTT GAAAGC TGAGGGAGAG GT GGACGGAAAT CAGGTC CGCTCCAGGTGG CAGC CCCGCCCTTGCTTGTCAGTCCCAGAG CAGC TGCTGTGG GGTGTG CC CT CC CAGGAG GGGATGGG GCAATG CTGCGAGCTTGCACCAAGCTGGGAG CAGTTCTCTGTAGCTCTCTCTTCTCCTAGGG CCAGTAGG AGTG GGTGTG CTGATGGGGG CT CTTCTTACAGfiTTTTTGTTATGATAG CTAC TAATAT CATGAGTTGT TGCTACTGAC AATACATAGGATGñTGTGTTACTTTCCTCTATGACGACTGCATTTGACCTTGAATTAGGAAACTTACATTGGAGCTGG CC CTTT CATT CGAATGG G C CTCATTCCTCATCTATTA CAT G G GAAGAAG GAGTGAAATAAAC CC CT TTTC TTATCCTC TG TGAAAATT CTA CAC CAAACACAAGAGCAATCACACAAATTGAGAAACATG GGGG CACAGAGAGT TACCTGAACAAT ATAAAGAT TTTAGTTG CTGCAGTGATTTATTTT CATA CAT TT A T T T T A T T T A ( N > xAATACAAATATTTTAGCAGCGT TGG CATAG CACTAAAAATAACAT CTCAAAAATCAATAGTGTTTTGAATTGATAAAC CT CC TTTGGAGGAATT TAAGGC
A(N)xAGAATAAGAGAGTGGTCAGTTGAGACAGAGAGAAAGAAGGAAAAAGAGGAGAAAATGCAATTTTGCCCAATTT GCAAATGACCCAGAGAGGCTCAGAGAGGCTTAATGAAACATCATGGGTCTCACAGGTATCGCATGGATGATGATCTTA GAACCTGGATGTTTGGTTATTAGTTCAATTTTTAGTTTTCCAGGCTACACTATGATGTCACTGGGTGATTACAGCTTC AACCAGAGAGGGGAGGATATATTTTTAATGTTA CAGGACAGC CATG CCTGTAA CATTAAAAAGGGG CTCTCCATGCCT TTCAAGAGGGTTTTTCTTCGTGGACACATTGTTGGGATGGGTGTGTCAGTCTGGGAGTGGGGACTGGGAAGAAAAGGC AAGTTCTGGCAGAGGCCAGAGTGGGGTAGATTATTCTCCCAGGTTCTGTCGGTTTCTGGGCAGCTGCTAATAGGAGGA TTTGG GTTGTAAGACCTGACTCG GTTGTAAGG CCGAGAGTAAAG CT CT CACCTGTGCAAAAG CTTT CATT GCCTGGGA ATTAAAGTTTCATGGCTCCAGCCTGTGTGCGTGTGTGTGTGTCTCTA(N)xTCAGATATAAAAGAAAAAAGTGTGATT TTTCCTGGCTGTGAATGAGGGATCAGTTGAAGAAATTAGAAACTAAAGTTG(N) xTTGGTGAGTCTTTAAAGAACTAA TTTTCTTTGGTGGAGAAGACAACAACAACAAATTGTGTTCATTTAAAACTGTGTTTCA{ N) xTGTATTTCAATTGGAC TTAGTCTTTTTGTTACTGGAAAATTGTACAATACATTTCTGGGAGCAAGAACATGTTACTGCTCCTGGTGCAAGGTTT GAATTACTAA GATC CAGGAACAAAGATGTATGTAAACTTTGATTAAAAGA CAGAAATGAACCTGAGAAACAGTCTGGC AATTAGAA CCAAGC CAGCAGGGTA CTGT TATT TAGCCTGGAACATC CTTT CCAAGT AAGTGGGGAGAAGGGCGCTGCT TCTTAGTGAAAGGAGTATATAATTAATTGGTCTGGTGGTCTCCACTCAACCAGTGATATGTGTAATTGGTTTCTCACT GAGAACAAAGGAAAGCACACAATAAATGCTAGCTTTGTTGTTGTTGTTATTAGCTCTGCAGCTGGCTAAGCTCTGGTG AACCCACAAGAAGACCAAGAAAGAGTAG CTGACTTCAAATCT CCTT C (N) xATGTTGAGCAACTGAAGTGAAGGTTCA CACTCCCCAGAGGTTCTTTATGAAATGATTGCTGAATGGAATGGAGTCACTGGCAAAATGTACTGCCTGGGGGAGAGG GGCACAGATGGTCAGGCATCCTCAGAAGAAAGGAGGAAGGTGGAATGAATAATGGCAAGAAGAAAAGTGATGAAACAA TAAATGAGAGTAGTTGGATGATAAAAGAGTACAAAAAAGTGGAAAGGT TAAGAAAAATCTTC TATTAAAC CTC CGAAG AGGAATTACAAA3CACCCAGAGTTAAGCCAAATGATCAGCAGACAGGCCTTTCCAGTTGATGTTTTCATTTTTAATTT TTTTCTGTGACATCAGAAATTCCTGCTAACCTTTCCACTTTAAAGCTGGAGATAACATGGCCTGAATCAGGCAAAAAT CCTCAACAGAACTAGGATGATGCCATCTAGGCGGCACAGTCTCTAAGACTAGAAGCCATCATTCTCTTTGTATCAGAT CCAGCCTGGTCCCACTGCCTGCTGAAGCTGCACCCCCAGAAGAGACAGAGCCCCTGGATGTCTCCAAGGGGCAACAAC CAGCCCATCCTCTGTCCTGTGCCTGTAAACTCATTCATCTCCATCTCAAGTGTTCGCCATGATCCCAGGTTCCAGGAG GCCGGGAGGATCAAGAC-TCTCAGAGAGCAGAAAGCCCACAGAATCTGGCTACCACCGGGAGATCAGTT3CCC( N) xGA GACCAGTTGC CT CAATTGTCAGTTTC CT CCTCTCTCCCCTTCTCAGGT CTAT CATGGCAGAATTTGTC G CAGAACATA AAACTACAGAACACCCCAGTTTCGGTGTGAGAAAAGAGCCTAGGCATCTCCTAATTT (N) xTACCTAATTCAGCCTGA TCTGCTAATCTCCTCTGTAACTCCCCACCAAGAGGAAATTCAACCTTTTCATGAACACTTGGAGGGACACCCAATCCA TTTTTGGAAACATCTAGGCGTTTCTTGATTACCATGCTCTGGATGGCCAAATCGGGGGGTTTTCATGCAGTATCACAC CTGGACTTCGAGCCACAAGTGAACACTTCTCTGGGCGCCTCACTTCTGGCTCTCCCTGGATCTCACTGCAACCTGAGT GCCAGAGC TGTGAAGACTGCAGCAAAGT CCTC CAACTGCCACGGTAAATATTATCATTGC CT CT GGTATG CAGAAAAG CAAGCCGAGC CG CATTTGCTGCTT CCAGGTGG CAGACACAGAAATGAGAG CAGGGAGGGAGGGG CTGG GAAGGTGTCT GGGAAGAAAGAGATCAATAAGACCTCACTAGTGTGCAGTATTTACTGTAAACAGCCCAAGGTGGTTTGGGGTTGGGGT GAACTG TCAACATAGAACAGTTCTTT CT CñGGGACTGTGGGAGCAGGTGTTCTAAAAGGAAGGA TGAAGATAGCAGTT GCAGTGGGATGGGCAATGAGACCCTGGGCTCAGGGAAGTCACTGAGGGTTAATGTGGCCGGGCAGCCAACGTCACAGC ACACGAGG CC CCTG CCACATCTTACC CACACATTTCAAGATC CTGTATCGCCACAGTGTGTCATTACTTTGAGACCAT GGACTACTTTCTCAATTCTTCAGCAGAGAGGTTTTTTCCAAAGCACACAAATGCTAGTGGTGGAGCTGGTTTAAGGCC AAGAAT CCTCAACTGACAAAGAGC TCAGTAAG CCCTG GGTAGGATAAAAAATGCAGGTGTGT CTTG CC CTAAAACATC TCAGGATGAAAGAACTCGTTGGAAGTGCCTCTCCATCCAGATTTTGTCTAAAGCGTTTAAAGGCGGCTCTCTTTCAAC AATTAfiGCTGGTGTGGGATGGAAGAGCTTTCCTGCAGTTAGGGAAAAACCTTCTCCCTCTTCCTCTTTCCCAGTTCCA GTTGCTTTTGAGTGACAGGCTAAC CTGAGC CCTGAAGCCAGATT CAGAATTACTATGGCT TTGAAT TT GATCTCTTAT TGTTCTATACTGAAAGAGCAATATTCTTTTTGAGGTGAGTGAGTTCCCCATATTGGAAGCTGCTCTGGGTTTGGGGTA GAATCTTCAGGATCCAAGGGAAGAGGACTGAACTCAAAGGAAAAACAACTTGGGTTGTAGTTGGTGGTGAGACTACGC TAACTTTGTTTTTTTTTGTTTGTTTGTTTTGCTCGCCAAGAAAGCCTCTCCTAATTCACCAGTTCCAGAAGGCTGGCT GCTTCCCTGGCCACAGGGGTAGGTATGTATTCAGGTGTGACCAATCATTCTGTCCTGACGTTAGTGATTGACCCAAGG GTCAAACTGGGTCTTGCCAAGGTCCGATAAATATAAAGTAAATATCCATGGTTGCTCCATAGGAGGCTGTGAATTTGG GGCTCTTGGGCACCGTGTTATTACCTGACATGGGGAGACTTGGAñTGAGGCCGACAGACAGGTGGCAGGGATGAAAGA TAAATG CAGG CG GG CATGAAGCTG CATGACAGGGAGAGT CCT GG GAGT G (N) xTGATGTCATAATTTTCATCACCGTG TAGCTGAAACCTGAAACGCTCAGCTGAAACACTGAATGGCCCCTGAAACCTAGTGACTACAATCAGTTGCAATCCTGG CATTCCGGGTGCTGGTCCCAGCTGTTCTTTCCTTTATTCTGCTGGTTTCAACAGCGGCCTTCCCAGCTACCAGAGCCT ATTAATTC CC ACTTTTGG ACAGGC TAACTTGACTTGTGTTGG AGTACTGC C ATTTTTAAA CCTGGAAATTTGGAGGCG GATCTGTñAAATGT CT AAAGATGñAATACATTGGTTTCCCCC CC CC CG CC CAAGCTTTTG AT ATTGGTTC AAACTATG GGCTTTCAAGATGCCAATAGCTGAACCCAACACAAATGTAGAGCTTATTCATAGTTTTCATATCATTGTCTCTTGTTA AATTGTACATTCTTAAATTTTTTTGCTACATTCTTAAGCCTGAGTTTTGGTATCTCTTAGGGTCAGCATTCTCCTGTG GGAATTTATCTCTGAAATCAAACC TAAT TAAGAGAGTACATGGGGTTCAATTTAGATAACTG CTTACAGT CAACTTTT TCAGATGGGTAACTCATAAAATGAGGAGTACGGTGGTGTTAACACTCCATTTCCTTATCATCGGGGTAGTGTAGGGCT GAAGTGGAATGACCGACAAGGAAACATCTGGAATCAGCTATTTTGTGGCAGTAATTACATATCCTCACCCACTTCTGC ATGGAAGCTTCCTCTAGGCTGCAGGC { K ) xGAGAAC AACAGC AGGT AGT CC (N) xTCACAATTTGCTTTTTATTTAAA ACATGAAATGTACATAATCAATTT TA TTA (N) xACCAAGGTGTAGAATCTGCTGTTATCTCTCTCAGAGTAATCATGG TTGTTGGTTTC(K ) xTTGTTTGTTTGTTTGCTTTCATTGGTGGGTGAGGTGTAATCTCATGTTCTTGCATAGTTTCTG TTTACTTCAAGT TGCTTTTTTGTTTGTCTCTT CTGGAAT CCG CCTCAGATGTTCTGTGATCTTTGGCTGTTTGCT(N) xAA CGTTT CCTCAG GAAACTC CAG CATCGGTAT CA CTAACATGC( N} xTGGAGGAATCAGAGCGCCCCATCACCACTT TTATTAAAATGATAGAAAATCGACTGTCTGCCTGCTAGAGAGTGTGTATAAGCACCTCCACAAGCCTGCTTCACTTGC TTCTCACAGC CGTTTTAGATAGGT CAGGAGGC CTTCATATGC CCAC GGGAATAATAGGGCGCTTAG CC GAGACCACGT CGCAGGTCTCTTGACATCTAACCCAGTGCCATTTCCACGTC-TTCAGATGCTATCCTGGGCTATTCCGAGGACAACACT TGAAACGGGAAAGAATGATGTGCCTGGGCCCACCAGAGATTATGGGAAACCTTGATGCTGTTTATCACCTTGATTCAA AT TT CATG TT CACAGC CTGAGT CTA CAAAGAAGAGAAACAGAAGAGGGAAGCCTGGTGTAAAACTCTTTCATTC CATT CTTGTCTGTCTTTATCTGCTCCAACTGTGAGGGCTGCTTACCCTAGAAGCAGATCAATGCAGAGATGGACACATTTGC AGCACCTACAAAGGGTGAGAAAAGACAGGGGATTCCCAGGCACCGGAAGAATGAGGAAAACAGTGACCTTTGTTCCCA GGCTACAGAAAAATGCTGTATCTAAACTGACATCGGCCGGCTGGATGGCATTTAGAAGCCAAATGAACTGCTGTTAAG CGGCGAGGTCAGGGGTTCAGCTAATTGTGGAACCACGGGCACTCCCCACAGACCCAGAAATCTTTGTAATCTATGGCT TAGG GACATACATTGTGAAG GTGATTTTTACAGTTCATCTTTGAAAATAGAAAGACGCGGGAAG CT CTGG CTGCTACA CATACTGATAGTAGCCTTGCCTGGTGTCTTAGAAACATCTGTGAAATATAGCTTTCTGGCTTAAATAATAAAAGGAGA TAGAAAGATAAAAAGAAATTACAACAAATACAGTGAATGAACCAGCCTTGATAAGGTACCTGCAGAAAACGACCTTAT TT TCTCTC TC CAAT GCAT TAGAAT CCATGTTGTAATTTTGTACATTCTG CAACG( N) xTGAGACATGGGGTGTAGAGA TGAC CAGTTCTACATCTCTC CCTT CC GGGACCTTTTGGATTGTGAATCTCTTGATGCAGG CAGTGC CGTGTTCTGGGC AAGCAGCTTTATGAAGATTCCTCCCTCGATCCTGGGGGTTCTGTGTGTCGGTCGCTCTGTTGGGTCATGGTGCCTTCG AG GAAG CCAT CTGTGCTG GCAGGGTC CTC CATGGCTTCTCATGTGGTTCATAAGAGACTCTT CTCTATCCATAG CAGT CC CACC CCTCTCTCACTCGGTT CTTCTAGGTCAGGCCAGTGGTCACAGTCTCTGCTGTTAGTTTTC CC CAGAAATAAG TCAGGTGCCAATGTCTGCACTCTGCACAGTTTTAGGGGATGCAAATCAAGTTAAAAGTGGCTACCCTGCTCCCCTGTG TTCCCAGCAGCACTGTACATAGCCTGATGGGAAGGGAAAAGGGGAAAAGGTCCCCTCTCCTGCTTTGTGGGGATGGGA GGAAACTCATCCTGCTGACTTTGCACAAGCTATACCCCTGTTTGGGCCTCCTCAGTGTCCCCTGCTGTACATGTAGGC AGGTGACGGCAT TTGACT CTTTTTTTTTTTTTAATTTACCC CTTTGATGCATTTTGAGTC TGGCAT TAGC CTGG CT CC TGGT GGTTTGTC CCTGGC CTTT GG GAG CCATAGTAGAATGGAAAACATT(N) xTAATACTGAGCTTTCTCAGCTATCT ACTGCTAAAGAAAATAATGAGCTCTTTCAGGTGTTTCTCCAGGGCTGCCCCA( N) xGAAGAATCTTAGCCAACAGTAG GGTG GAAAGAATTC CATTTTAAAT CCTG(N)xTTAAATCCTGTTTATAAAAGAAACAATTTTAAAAATTATTTCAATG CTTCTGAAAAGAGCTGGTTATCTGTCTTGCGTGTTATTTTTTAAAA ( N) xCTAAGTGTTGATATGCTT GAGGGG CATG ATTTGATTGGTGGTATCTCTGCATTTGTTATTTTCTCTGTCTGAGTAACAGTGGGGCCTCATTGTAGGTCCTTAGTAA CTTATTGGCCTAAGTCTTTCATTGACATTATCTCTGTAGACAGAACCTTCTTGCAAGTCCTATTCTCACGCAGCATTC TTGATCAGTTACGCTTGAAATGACACTATCGATGTGTTTGTTTCTTTCCT (N) xCCCCAGGGAAACTCCCAGAGACAG CATGGAACAACTTGCCTTGTGCACTCCTTGCAGAAAGAGGAGGATTCCCCTGTTATTGATAAGAGATAAGAGATGTGt N)xTTAAAAAAAAAAAAAAAAAGAGATGTGATTGACACCTCCGATGCTGAGCTGGCTCCCGGTTCTGGCATCCTGACT GT TG AC AC AC AG AAACTT AAAGGT GT ACTAGAA CTTGAG GTTTTGTAAT CCGAACCCTAG AGTTTG CAAGTGAGGC CC CTGAAACCTATGGAGCTCGTTGGTTGAGGAGCGCAGTGAGCGCAGGGTCACTTCCAGAATCATGCTGGCCGCCTTTGG CCACTTGGTCACACTCTTCCTCTCTCTATGCTCGTTTCTCCA( N) xTCATTTCTTCATCTCTAAAAAGGAGAAGAAGG CAGG GC CAGGAAACTT CT CTGTTTGCTCATCCAACAAACAGTTGTTAAGGAAGCGACCCCTTGC CAGACCTC TGACTA GGTG CCAGAGTTAG GAACACAAAT CTAACAGTTCCTTTCCGATGGAACCCAGTCTGATGAGC CA CATACTTC C C T (N) xATACTGGGGAAGGCAGGTAGAGGCAGGGTAACTTGGTGCCCCATAAACTTGGTACCTTCTACTTCCCTGGCTATTCC TAGGGT CT CAGGAACT CTAAGACT CAGTGGTGTCTCAGCTGCAAAAATG CAAAGTAGAGC C (N) xCAGTCCACGTGTT CATGAACTGCCCTTCAGCTCTCTCCAATCCAGCATAGCTGTCCACATTCTTCAATCAGCGTGGATGACAGCAATGGAA CAGACTTGAAAGTTTAGGAAAATTCCACTTGACTTGAAAGCACCTTGATGACAGATCATACGATGCTTGCCTTCTAAT GGTCAGGACTGAAC TT CAAGAAGC TCTGGACACTGACTGAGGCCAAGCCCAGTCAAGGG GAACAGAAGGGAAGC CATC TGGG ACTT CATC TCAGTCTG CC AGGAAACTGACATCTTTTTGC AAGTT AACAACCAG GGAAT CAGAGG GAAATT AC TT TTATCAGAGGGTTATGGCTTTGCCACTAGAAAGGGAGGCTTTCCTGTTTGCCGAGCCTTTAGCACAGGTCCAGCAATA ACATGAAGGAGAAAGGCTGGGAATAACCCAAAGCAAACTGGAGAGAAGCTTCAACTTGGAGATGGGGTGGGAGTTGGA GGAGAGACTGGAAAGGAACAGGAAGAGAGGCAGTTTTGGACCAAGTGTCAGAGATGGAAGGGGAGGAAAGGAAGAAGG CTGAACGGTCTTCTCACTAAGGACAGTTCATCATCCTGTAGACTCATTGCTCAGGGACCTGGGGCCTGCATTCCGAAC AGTGGC GAACAGATGTAG CCTG GAGGGCCTGCGGATGACCCTGGGAAAGATGCAGCTGGTGGTGGAGGAT GC CCGCGA GGCAGTGACCACCTGAGCTCTGGCACCCGCGCTCCCCACCTGGCCTGTGTGCATTTACTCAGCCCACCGATAGCATCT GCAAAAGCTCCAGGGGCTCCAGATCAAGTTCCCAAGGAAGGCTGACATCAGACCCAAGGGAGGAATGCTTCCTTCCCG TT CC CATT CGAACCAACAATGCAG CCTGTGGCAGGTGTCCCTGCGTCACAGAAACCACAAGGACAGTGAGAC CAGG CT GAGTGCTCCTGGCCCTTGCCCGGGTGCCCTTGCATTCGTATGTTCTACACCAAACCTGTGAGCCCAGCCAGTGGGAGG AGGAGGAGAAGGAAGGTGAGCGGATTGGCTCTAACTTTCCACCCTGCTGGAAAAGAATCCAATCCCTTCTGGACGAAT GCCACTTTCACAGATAAAGGAAGAATGTGGAAATAGCAGTCAGAGCTGAACCAACAAGAAGGAAGGTCGGCCCAACGT GTATTGGATGTCTACAACATACCAGATATTGGATCTGACAATTTACGTTCACTCTAGTTTATTCTCAAAGATGAGACC T G ( N)xTGATGAGAGTAGAACCCAAATACTACCTGCCGACTGGTGTTGGAAGCTGCTTAAGGATACAGATGTCTGCTT CTAGCTCTGGAAAAACTGATTTAGAAGTTCTGAGGTGAGGCTGAGAAATACGCCTCAGCAAATATTTCTAAAGCCCTC CTGGTAATTTGGGTCCTTAGCTGGATCAGGCACCAGATTGTCACACGGCACCAGCCCCTCCCACCGACACCTCTGACC CTGGTGAAAG CAG CAT CAGTGCTTGTAAGAAGG CCCGAGACCTGAGATTCCACCCACCAGAT CTGCCTTGAC CCACTG CAGT TC CC TT CATC CTGGACCCCGGT CTCCTTCAT CAGAAGGACATTGGAACTACCCAGT CTATCTGCTTTC TACTGG GTTGTATAGCTCCCTGCAGATGGGATCTGGCCTCCCTACAAACTCCATCAGAT(N)xGCACTGTGTTTTACATACA(N ) xTCAAGAACTCAGAATGAGTTCCAGAATCCTGAATAAGTTGTAAAAAAATTTATTAACAGATTCCAAAAA(N)xTGT ATGAGGCTGAAAGTTAACCCTATAATCATGTTGCCCTTGAGTGATTTCñAACCGTGGACTGCACAGCTGACCATGACC AGAG CCTACC CATCAAGGTGACAC TGATGTAATTAAAAAGGAAAGA( N) xAACACAATACTTTTGTATAGTTTTAGGC ACTTCTAGATATTCATTAGGCCTGTTGCCTATGTCCTCTTAACTTTGAGTTTCTCAAACTGTTCTAGGAGCCCTGATT TCAGAATCATCTGGTGTAATACAACTT (N) xTTTCGGATTTAGAAAGCACTGCCTAAAATTCAATGCCATATTGAATC AAAATTACACAT CAAAAATAATATTT CTTTTTGAAATTAGTAGCATTCAGAGCTCTTCAGACTT TT CTTTTTAAGGTC ACAGTTTTGTTT TC TTAATAGGTTG G GGGGAGGAAATT GGTCAAATAAAAATTATCG GAAATTCAATTAAAAAGTAGA AAGAGTATTTTT TTTTTAAAGACTTG CT TCAAGATTTCTATAATAC CAAAAACT{ N) xGATTTCTGTAATACCGAAAG TT AAAG AAACTTTG GCTT A C TCAAAG AT TTGATT TCTCT C TG AAGG GTGGGTTCTTACCTAGCT CAAACT CATAAAAC T CTTGT C CAATGAAGAAACAGGAGGTTGTTAT CC CTT CT C TGTTTATCAACTT CTC CAAGAT CTGACTTTT CAAGAAA AAAAATTACAGGTT CAGTTGAGAACAATGGAGAAACTAGG TTAG CC CATTAATGTTGCTTTTAGAAAT CTGAGCACTT AAAAAT AATG ACTAACTGGAAGATTGTT AC AG CAGTGT GGATTTGT CAAAAT CC ACTT AGCT A CTTGTG ATTTGTGTA CTGCCATGTTAA TAAATTATAGTT TGATATGAAATACACAGAAAAAGTTT TTAAAAAT GAGTCAGTGCTT TTGCGTAA TTCTACTCCTCACATTCTATTTTTTTCTGAGTTATCACAGAAGACAAGGTCTGTGTGTGCACTAACTTTTTTTCTGAT TTTGTTCTCACT TGTTTAGT CCTGTG CAGAAGT CAAAGGAGACATGGCCAGAGGGC TG CTGAAATATT CACCAG CT TA ATGTTACCTCTG GGAAGAAGGGATGGGCATGGACTTGC TATGCC TTCTGCTGTGTTTCTCTGí N ) xGGAATGCACTAT GATGGTTT CAAAGAAACAGTTGTG CACCACAT CAAAG G CC CTCACTGTGAAACCAG CTACCATC CAAG GTTCCA GT CA TCTCAG TGGTAACGGAGC CTGAGCAAAGGG CAATATGGGC CTCTAAGATTTTGTCTGGGCAG CAGTCCAC CGAT CT CA TCTTCCTCTATTATATTACCTCATCATTTGGGAGTTTGGAAGATGAATGGGTGAAGGAATGCATCTCAGTGGTAATGG AGCCTCAGCAAAGGGCAATATGGTCCTTTAAGATTTTCTGTCTGGGCAGCAGTCCACCAGTTTCATCTTCCTCTACTA TACTAG CT CATCAT CCAGGAGTTTGGAAGATGAñGGGGAGAAGGAATGCTTTGCAAT CATCTGCGATTGGAAACTGAG CC CAGACACTCCGAACTAC CGTCCTGGCTGGAGCTGTCTC TTTCACAGACATCTTT CTTTAG CAATAGTTG CTTTG GG GATTACAAAAAG GAACGG GAT(N)xTAATTTTCTGAAACTAGAAGAAAAAAAGGGACAGGTTGCTTTTCATGTACAAA AGGACCCCTGGCTAGCTTGCTCATGGGATAAGAAGTTGATGCTTATTTGTCTACCCTTCCTTAACCTGCTTCCCCTGG GAGGAAC CACACACAGTGGGAGGTAC CAAT CAG GACTCAC CGAACT CTGTGAGTTTTGTAATG GG GAAACACACAC CC CAACCAGCTGCTCCTGATCTGAATGCTTTTCTTGGGCTAAGTGATTTGTTCTGTGACGTTCTGGGCTACGTCGTGTAA TAAGTATTGCCAGATGGCAATCACAAGCATCATTTCCATTTCATATTTGTGTTGCCCCATATCTCTGTCACTTTGTGC T C C CAGAGTGGGTGAAGAGG CTGGGCCCTCTGATAGTCACAGACATAGCCAG GCTTTAAAAAT CT CACAC CAAT CACC GTAAATAATTTTAAGGCTTCTTAT C CTTTAAGAT CTGATATCTG GAAAGTGT GAGATTTTTTTT T CCACGAGGACTAT TCATAC GCTTGTTTTCTTGAG GACGGTGAC CC CCTTT CAAACTCA C CTGGATGATG CATTTCAAATTCTCAGAGAAGT TG CATGAG CCGATAGCTTTGAGGGAATTAAAAAAGACT CAAAT CAAAGTCGATAAT TAG CAGTTATGC CCGTGGATGT CAAAACACCTGAGAGGCTCTCAGTTGGCTTGGAATGTCCAAGTAGAAATCAACTTCAAACAGTGTCCTTAGCAAATTT GTAAAATGTGTATACGTGTGACTATG TG CTTT TT TTTTTTTTTT CCTGCTTGTCACTG TGCTGCAAATTGGTTG GACC CTAAGT CAATCAGTTAAAAAGTAGGCAGTTAG GAGAAAAATGCC TTTCTGCCAGCAGT GTGGAACCAC CC CTTAGC TT TAAGAACCCTGCTCGGGAGGAGATCAAACTGCCCCTGGATGAGCTGACTGTGAGAAAGCGAACCTGCCGAGTGCACAG GCCAGGGCTCATC CAAAAGTTTTG CTTGGCAGTTCTGC CCAGAACTATCT CTGTGTAGCTACTCCCTTCT GAGAAATG CAAATGTCATGCAGGCCCTCATCAAGAAAAATCACCAGCACTCGCCTTAAGAAAGAAGGGATAAATGCTCATATTATC CATTCAAATGTGAGTGTATTCAATGACCACACATGGGACTGAGCTAGATGGTATACACAATGAATACCAAAGCTGAAT TTCGAGGCAGGTGC CTATAATCCCAGTG CCTGTAATCTTTGTAATC CGG(N ) xAAAGCAGAAATTTTACCCTCCAGGG ACTTATAGTCTAATATTT CT CATAGAAG CAAATTATGAACATACAGT(N)xAAGTATTATATTCTTTATTCTTCTGAG ATTAGACATCAAATGCTGAGGGGAAATATGTAGATGAAGAAAAACAGTTGTTGAGAATGTGGAAACGTGCATCTGGCA CTGTCGTGATATTAGAGAGC TCTAGACG CTG GTT TTCTG C CTAAAAAGTGAGAAGGATAATTAGGCGG CCTTTAAGTC ACTCCTTCAGCTACGGGCACTCTGCAATTCTACTGCCTCTAATTCCCTACACTGCTAACAATTCA(N)xACAATAAAC CAGCAGTTACACACATCATCACAC TGTTGTGGTTTATAAGATAAATGTCG GCTTTCCCCAATG AATTATCACTATTGC T C TCAT GAAAAGGCATTC CTTATGTACTTCGTTG CACATAAGGC TGTAAAAGAAAACC CTAC CAAATACTTGAT CC TG TGAACTTATTTAAGAAGCAGTTGTAATCTTGGCCATGTTCAAAAAAGCTGTTGCGACGGTGACTGTTTTCTTGACACA TCTACATATCAGGCCAATGAAAGAAGCTGCTACCCAGAGTTTAAGACAACTGTGTTTAGCTACTTAGTATTAACCACT TGGCGCATGCAAATGACTCCCACCCTCACTCCCTCTACCCATGCAAATGCCAAAAAAGA(N)xAGGGGTCATGGGCTC TTTTTT CATTGG CC GTGAGAAATGAACCTT AGGTGAAGAATAGACT CATGTAAGAATAAGTAAAAGGAAACATTTAA C TAAAAG CAAGTGTT CAGAGACAATATAATACATTATCTTAGAAGA CAATGAAATCT CTTTGTTTAGCC CT CTTAACAG GGTACATG CAGGTT TT TT TTA A A T( N ) xACAGTACATTAGAAACCTCCTTTTTAGGTGTCCCCAGATTTTACCTCTCT TT CTCCTGAATGTGAAAG GG CTCACAGTATAGAAAGTGAG CATGAACAACTC CAAG GATCAGAAGAGAATGTTGAC CA GAAAAACACTTTGCAAGGA C CTGT CAGATATT CC CTCCTG CACAGAAAGG GAGCTG GCTGCTTC CTCAGAATGGTCTG TCTGCC CACATGAAATCGAGTACCAC CTAC CAAAAGTAAC CAAAGTTATG CC CCAAATTTAC CATATTGñGGAG CTTT GCTGAGTTAGATCATGGATTATATñTTTAGCAATGATATTGGCAAAATGTCTTTTTAAACAAATCAAAAATTTTCAGG GA CTTTGAGGGT TT CATAGTAATAATAACAATAC CAACATTAATAACAACAT CTAATGTTTAAT GACATTTACTTG CA CCAGGTAC TCTTTCTAAG GACTTC( N ) XCTTGTCTAATGACTAGAAGGTACAGCCTTCCAAATAGCACAAGGCCTGGG TGG CTGGAACCAGC CTCATC CACACTGATG CTGTTGTGTG CACAACñTGGTT CTATGTG CTACAAGGTGGG GCACTAA ACAGGG CGAGGGAAGGACAAGACAGAAT CC CT CCTCTGAAACATGAACAATGT CACAGGATACCTGGGAGAACATT TA ACTCACATTCCCAAATATTG CATGAGTAAATGTTAAAAAG CAAG CAAGGACCAACGTTTTCCAT CACTGG GGTC CAAG ACTCCCAAAGAACACTCAAGTTTAAT( N } xTATTCATTTGTAACGGTTACTGAATTCAGGTGATCATGATGGTTTGTT TTTTCAACTCCCAAGCGTTAAAGGTATAGCACAAAGTTATACATAAAGTTTTTGTTACTGGAACAAAAGTTGGGAACC ATGGGCTTGGTCAGTGAGTTCTTCCTGGTGTGCAGAAAAGCAAGAAGTCAGTATGGTTGAAGTGGTTAAGAATAATTT GTAGTTGG GCACAGTGGCTCACGC CT(N)xTACTTCATAGACGAGGTGCAAATTGAGCAGAGATTTGTTAT(N)xGTT ATGAAAACTGAGTAGTTCACCTGATCATTCAGGATTCACAGATACTTCTGGTCCAGCTAAAGTAGTGCAGTTGCTTGG CTGTATTT CATAAAGAGCAACATT CTGGGCTG CATTGTAT GGG(N ) xCCTTAGAACATCTCTCACTGCTTGAAATACT GTCTTCTACCCACTCGTAAGGATAAGGAAGGAGCAACTATTTCCATGGGAAGCGAAATTCCACAGTGGGAAGAGCACA AGATCTAATCTGGTTCTAAGATTAGAACAGCATTAACAGAACAAAGATTTTTGTGTGTATGTTGTGGTGATCATTTGA AACATG CACATACAGTGATCATATTGTT CAAAGCAAATTT CATTA CCCACAGTCCCAGCCACGGGTTG CTTTTTTAGA ACCTTCTTTGTTTCAGTCTTGCTGAGTTATTCAGACTGTTAAAAAGCTGAGTGCATTAAGAGCCCTAAATCAAGTCAT CTGT CTAGAATATTT C CT TTGAGGAAAGTT CCTTTTTT CATT CTTTCTCTCCACCACTCTTCCCCCCACC CTTTGTTA ATGAAAAAACTT CAAAGGGTAAATTGCTTTACGT TT CCT CTAAT GGTTATTGGAAAGAAGAG GCAG CTTTTTGAAATA CATGTCTCTCTGCGAATGCCAGAAAAAAAAATGTGCAGTTGTTATCGGCTCCTGCCATCAACATTCTTTAATTTGAAG AAGGATTCAG CTGTATGAATTAGGTTGCAGGCT CCAGGGTTGTGTTTCTCATGGGTGGTATTTCñTGGTGAAATAAAA CAGCATTTAAGAAAGGATATGA( K ) x
>Hs3_120597158-120615223
TTGGGATAATAAGA CATATGCATAC AATAC CT CAACAATG C AAGATGAGTTAAATTTAAAAGAT GCATTGTTAACCAA CATG CCTGGTTTTGGG CTACACTAATTTGC CCAACTGT CATCAT CTTTCACTTAAC TTCTACTCAGCTCC CATGTTTT TGCCATATGAACCTACTACCTGCGCCAGTGTGACCCCTTCCAAGCTGCCCCATACATAAAAGGTCCTTCAGGAGCACA TACCGGGCCAAT TAAT TT CAGTCACATC CCAA TTAATT TA CCAGGCCAGTAAATTCTC CATGAGGATTT C CCACAACT GTTTTAGAAGACT C CAATAATTCCTGTT CTTCGTACAT CTTT CTT CATAATCCCACATAT CTAACT CTAAGTACACCC TGGAAAAT TACAG CATGATATTTATGTATCTGTCAAATGTGT CT CTCTAAACAT CC CATACTGATACAAGTTTCTAGA GTTCTAAGTTCTTCTTAGAACTTGAGTTTCTAAATATTTTCTTGTTATCTTCTTTCTCTTGCAAACTAGTACCCAGCT AGAGTACATGTGGCAGTTGTTCAATTATTGAGATAGACAAATTCACAGCACTTTTCTTAAGTAGAAAGTTGGAGCAAG AGTGAGTGGGAACAGCTACTTCCAGGTCTGTGTGAAAATGGATAAATCCTTAATAATTAACTCATTGGAAGTTGTTAG AGCCTCCAGACATGTCCTAGACCACATTCAAAAGCAAGGAACATTCTCTTCAACATTGGAGAGTGTTTTGTAGTTCAA TAGAAGGCAAA CATTGGGAATATTTAAAGCAATGTAAAAAGTG C CCA CAATAACAGAGTT TTTAA CATTAACAACATT TTCTAAAGTCCTCTATAAATATTGCTACAAGCAAAGTCTCAACAACCGAGATCAAATCAGCCCAATATACTAAGGCAA GCTCCCATTTATACTT TTGACTTACTCAAGTAGTAAGAATAGAGTTTAGAATGTAAATT CAAAGGATGAAAGAG{ N ) x GATG CACTGACAGACTGCAAAGAGACAC CACAATGCTGAGAC CCAGAACATAGAGATATATTGT CT CAGCACTGATGG TTTATGTAGC CCA CAAAATCTTTGGAAACAGTGAGATTTC CAACTGATATTTCTTT CAGTGATGA CTAAG GATGATAA AATATTTCCAGTTGC CAAAGGGGTGGTTGTGGAAAT GAGAAACAGAAATAGCCCATATAC CAGTTACC TTGCAACATG CCATCTGCGTTC CAC CAATAAATGGATATC CT CAATTCTT CTGT TGTTGGCATAGT G CAAACGTTTGGGAAGGTGCTG TTTCAAGTAAG G CTTAAAGTGCTGATCTGGTTTT TTACAC TGAAATAGAAATGGAAAT CAGACT T CAGATGGAATGTC TTTTGGAAAAATTCTTACAAATTCTCTC TCTAGAAAGC TGAAG GAGATTTTAAG CCTAAC CAAAGGTTAATG( N ) xGA CCAT CG CTAAGAG CAATT CACACTCCAAGG CCTT TC CCAACT CCTTCAAGTCAAG CAGTTGGGGGTG GGGTGTCGGTG GGAGGGGAGGAG CACATGGGTTTAGGCT TATC TGATTAGATG CT CTTGTACCACTAAGGT CGCTACCCAG CATCCGGG AACAATG GAG CAGAGTGGGTGGGCAGTT CTGGTT CAG GGAAAGTAAAGATTTGTTTAACCAT GAATGAATACTTTTCA GTTCTAATGGAAGGATCAGATATTCTCCATTTGTATCATTGCCAAGACCATTAGGGAGCCTGCAGCTGACCACAGGCC AGAAGT TTTGAAAGTT CTTAAGGAAACAAATA TACAGGATGG CAGGGCAGTGAGGAAAGTGTTC CT CC CACCCCTCCC CGCCGG GGGAGG CACACACAGGAGGGAGGCTG CTCCAGGGGCAGAGGCCTGGGCAGGG CAGAGG CG GGA CAACTGGAA ACACTTACCGTGAGAT TGGCAATAATGG CTTTGGGGTCATCTGTT CAAAGAGAGGAGAAAGATT TCAAAAGAAATAAC AACCATTACCAAGAGAAATCATTTTTGCAGCAAGATACTCAATTCTTTCTCCTTCCATCCCACAGAAGACTATCACTT TAGTGTTTTGTTTGAAAGATAAATAATTGTGTAGGG GTAT TTAACATTTCAGTCAT TGACATGCAG CTAAG CAAGTGG AACAAATCCTATAGTAAACTGCCCTCCTAGGTCCAGTTCTGCTCTTCTGGAGGGAATTCCTCCTTCCATAGCACCAGG ATACTGCTAG GTAAATTG CTTCTTGTTT TAAGTT CC CC CTAATGGTTCCCCTTGAGACATAATTTATTTG CAGTTTGA CCTCAGGAACTAAACTGGGCATTTGGGGCTTTTAATGTAACTTCTTATTTATCTGCAGATTTTTGAAACAAGTCTTCA CCCACTTTAAGTTGAAACTTCTAATCCATTGAGTTATTTTTTTAAAAGGGAGTTTGTTTCCCTATTTATATCCTGGGT GTAATTACACAGGTAGCCGCACCTCTCCAGCCATAGACACACACGTGCATGCACCTCCTTTATTTTCAATAGCATCAA GTAAAAGATTTTAGTCAATACTCTGAACTAGCTAAAAGCAAAATTATGTAAATGATGGCTGTGCATAAACTGTGGACA GGATTATATTATCTGAAGCTCCCCTTTTGGGCGGAATGTGGGGAGAAAGAGGAGATTGTAGGTTGCTTGGATTTTGTT TTGTTT TCAGGAAGTTTG CTTCAGTATGTTATTACAGAAT CAGCAGAAAAGGAAAGTGGTGT TCAGAAGAGATAGCAA CTAT CATGAG GCAG( N ) xATTTGTTCATATTTGTTAAAAATGTATGAATAAACTTGCCTTGGGATATAAATATACAGA CTCCAAAGT CTATTTT CAGAATCTTAGC TCTAAAGCTTGAAATC CGAAACTAAAATGTGTATTC CATTGG CCTTAATT ACATCTCTTACTAGAAGGCAAATAAGTAATAATGTAATAATAATAGCCCGAATAGATTTCATACTATGTGCCTGTCTT TCTATAAGAG CTCTGCAGGT( N } xAAGCCAATGGGTTTTCAGAAGGAGGAGGAGTTGGCACTCTGACACTCAAGCAAC CTGATTGAGCAAGC CAGG CCTCATAAAT CATATT CCTCAAAGGGACGAAATAAAGTAAGT TCAT GGTCTTTGACAATG AAGAGAAGTAAATA GCATGTACTCAGATAA GT CC CAGAGAATTACTAGGAATAT CCAATG CCTGAGTC CATACTGGGG ACATCAATCCTTCAATTCTGAAGAACTTAGTCTTTGCAGGCTAACAGTCCTCATTTT(N) xAATAAGAGTGTATGTGT ATGTGTGTGTGTCTCTAT CTGAAATAGTGGTA TGACTAAGACAT CATCATCATATC CAACCTTTATATAG CTTTGACT ATGTGCTAAG CACT GATCTAAATTATGTAAA CACAATCTC TTGTTTAATCTTCCTAACAAT C CTTACCTT GCAGATGA GGAAGCAGAGGTA CAGGG CGTCACATGGTGAGTACCGAGCAGAG CTGAGATATGAACC CAGACATCTGTG CCTGTCAC CACC CCATGCTACTGT CT C CCACAAAATGAAACCACATTG GAAAGTGTTTTTGTAAAGTACAAAAATT TTTATCATTG ATAG TTATGAATTCATGGGCCATGGGAAGGTTTT TTTTTCTCCT CCAAACCAAC CTCC CCATTACATGTACTAATGTC AGAAATAT TGTTAAAAATAGAGAAAATC TTTTTTGCATAGCC CTGAAGTGCAAATACTAAAATCGAGGGGTGCCATAG TGACTG CATC CAGT CTATGCGAGGGCAGAAAAG GGAGGACACAGGGACAGGAAAGGTCATG GAGGAGAGAGGGCAGAA ACACCCACATTT TGTTTATTTCACAACATG GTCTGGAGCTGACC CTGGAACAGTTC CAGT TGTGTTGACT CAGGAACA TAATTAGGGAAAAACTTAATTTTCCAAAGCTGTAAGGTTGGTGTCAATGGTTTCTTATTCCCAAGGCTAAAAACTCTA AAGAATTT TTTTTT CAAAGTG CGGCATAAC CC CAATAAAACTATGGACTTCTCAAAAG CTAGGTAGACAG CACCTAAA AATCTTAG TC CACAAATTAAGTTTGTAT CAGTGGTAGAT CATAAC CTGAACATG TAAAA CATAAGTGACAAGATTATT CC TGTTTATTTT TATTT CTAAT CTTT C AG GA CAG CTGCCACTTC C CAAG GGCTTAATAT C TTTT CC TACTAGGTTC TA CCAATCACTTTTACAAAGAGAATGGACTCCATTCCACAAGGCTCCTAGGAACCCCCACAGTCCAAATGTGGTTCTGCC AACCCTTACATAGACCCTCTAGTTTCATCTATAAAAATGATCAGCCTGAACTTCTTATTGAAAAAATTCTTATCTTAT AAACTTCTGTTTATGGTCATGAATATGCTTCAAACTAAGTGCAATTTCCACATAATAAG( N ) xCCTTTGTATTACCAC T CTAACCACTTAATATTTTAAGAAT CAACTTACATT TAGCATTGTTGCTAAATTTGGAT C GAAT T CTTC CTAGAGTTC CAGG CACTAAAGTAATATCATC CACATTAGTTAGGTAATTACTOAAGAA CTCAGTT CTAT CACATGTGACAT CTTC CA TT CCTATGGGGAGGGAGAAAAAGAATATTT TCAGGCAAGACTAAAA CGAAAATC TTAACATAGA CACGAAAGATAACA TC CAGGCATCTCTT TAGTTAGTTT TTAGTAACTTAATGGGAAGAGC CCTGACTTTTGAGATTTTTGTATT TGACTTTA AATGGTTTAAGGCTTCTCTGTATAATCTTTTCTCTAGACTTTTACTTAGCCATATCATCTTGCAAGCACTAGAAGGTT GAAAGTGCATTACCTCGATTACATTTTAAGTCACT CAGC CAGAATAG CAAGACAAAGGAAGT CACCATGATTTAAACA A CAACAACAACAAACATAACTATTGACAGA GT CTTTATGAAAAAGATCTATT C T T T { N ) XCCCTAGGTTGCATCAAAC TCTTCCCAGCTGGGGGTGGTATAAATGGAATATACAGTTCTGTGTCTCTTTTACAAAATGTCATTTTCCAACTCTCTA ATATATCTCCTGTCTTGAGTACATAAT CAC CAGT CCTCACTGTCTCCACTGGTGGC CTTGGATGTTACTGACAAAACA ATACTGGCAACAAGAGATGAGAAAGAAAAATGTCAGTTCCAATGACCACCCAAACTGAAAAACAAAATAACACTGAAA A CAAGTTGTAAACAATGTAATTAGT CAGAACTGAAAGCTAG CAACTAAAG CAAGAAGGTT CTAGGTAGGGAATTTT TT TAAACAACAAATATTTCTC C TC CACC CCAAGATT TG TTGTGTGT CC C CAT CAAGGAAAATTC TTTTATTAAAATGC TT TTATTTAAAACATTGTTGTAAACT CAAATC TTAG CATCATAGAC TATGGAGG CTAC TAGAGATCTTATTGAAATGTAT GAGATGACCTGGAACGTTTTAATATATTGC TCAGTACAATT CAGTATGTACATATT CTCTAACAGG GAAAfiTAACC CT GT TG GAGAATA C TCATATATTTATAT CCTGTAAAGGATGC C CAAAGGTTT TAAAATAAG GAAATA CAACC CAGAGTAC C C CAGAT CTGGTAAAATTTAA CTAAT CACCTC CTAG GAT C CCAGAT CCAGAACCTT CCCT TGTT TAATA CAAAT TGGA ATGAAATGCAAGAAAAAACACACAGCATAAGGATTATATATCTTGCACTTTAACAGTTAAACCCTTAGGAAGAAAAAT AGAG CAATGAAGAG CTACTATG TTTTAGTACAATGT TAGCGTGG GAAGGGAAAG T C CCCAAATACCTATTAATAGAGG GC TGAAAGTCTTTC C CATTT GTTATATTTCTAGGAAGTCC CTTATG C CAC C C CAAATGTGAG CAAT CTCCTTTACTTT TG AAGTCCAT ( N ) xAAAACACTAAACCATACCATCTTTTAAGAGAGAGGTGCTGTA( N ) xGAAGAAGTGCTGTAGTTG TC CCAAACCAAA TGGATCTACT CATCTTGGAAGC CTA CACACTGACAAGC CAAGGT TTTAAC CTAT CCATGACCTTTA CAATGCCTCAGCACATATCATTGGGACCCTTCTACTTAGATGTCTGGCTCCAGCATGTGAGCAAATGCTCTTGACACC TCTGGGCAAC TTAAATTGAATCAG CT GTGACACATG CTCCAACT TCTCTGAC CTGTGTGTAG CT CT CCACTAAG CCGA ATGCATGGAGCATGACTTCG CAAAGGGTTATATCTAGCC CA C TCAAGAGGAGAAAG CTAATC CC TT TG G C{ N ) xCTAA TT TCTCATAGAT CTTCTAG GACATTCTGGAAAGACCTTCAGGAC CAGAG CTAGGGAGATT TG CCTACCATT CTTAACA GAAC TGCCAAAGAGT CAGCCAATTTT CATC CT CCGAACT C CACTAGAAGGGAGAAG CCC CAAAT TCCAATGCTACTGT TTCAATCTAATAGTTCCGGAATACAGGAGATACCCTTTAGAACTCAGTCTGCATGTCTCTGAGCCCCCACCCCCTGCA CAGCTCTACA GT CATATAATGTTTATGATG CT CT TGATCT C CTCAAAGAATT GATCATAAATTTTT CTACTT CATAAA GT GGGATTTAGAAAAACTACTT CCTATAATGCTT CTTCCTCTCT TTAACAGT CATCTGAATTACAG CAGGGC CTTC CG GTTTGGACCTTGACTTCCCTCAGTGGATTCGGGGAAATGATTTGACTTACACGCCACTTCTTTGACAAGAATCCCATA TCTCTGCTTCTT TATCATATTTAGATGTATTT TTTAACCATATAAACTCCATGGGC TCGT TTTGGGGATGTGGCTG GC AGGGATAGGAGATTATCACCTGATATGTCACATTCACAGAAGAAGCTATTAATGTCACAAGGAAACTGTCTGTGTTGC TT TAAAGAGAGACAGATTTT TGTAGAAAGGTCTTT C CTG CAAATAACAGAAATATATAAGAAGT CC CAGCTTTTGATC AAAATGAACC CAGGGATCTCTCAGGT CATC TCATACTTTT CAAT CATGGC CT GATT TCTATC CC CATCTTAAAGAAAG CGGGGATGAG GT TAAGAATG GAGT CTTGGCTT CTAAGGGCAAGGATTCTT TT CC TCAATATGAT CAGAAACATTAT CC AT CTATC CTATTAG CAAAGAAAAAA CAAAG CT TTA C CATG GT CT CCGACAAAGATGA CGTTGACACACCGATGCAGTT TTAGTTGTTT CAGT CCATC CATTñATTGCC CCACAñTTTTGT CGATTTCC CT CAGAGGATTTGT CAT CTAG GAAAAAG AAG CAAGTTAGT CC CCA CAGGGTTTTCTTT CTTT CC CTTT CAGT CT CTCATAGAT C TTCTAC CC CTAGAACATTTTGG AAAGAAAACTTC CGGCCCAGAGGCAGAGCTTTGTCTGGGGAGATTTGCCTGCCATTCTTAATGAAATCTGGTCTAAGC A CGTTATATTTGTATTCTTTTGAGAGAATTAAAACCTCTC CT CATC CAAT CC CTTTAAAGATTCTTTGATATAAAAAT GT CAAAAATATTGT TAATGAAAAT TCTTTAAGAGAGAAAATC GACT TTCTAATATGTGACAG CATC CTTTAAAT CCAA GAGTTTTGAGAGTAAACAGAAAGTCCCTAAGTGAAATCACATTAGATTTTGAACTATGGCAAGAAGATAGACGCTAAG TGAATTCAAAGC TGTTCTGAGGTTAAAATCA CAACGTTGCAC TG GATAATAAAT TAGAATAT CATTAATACAAT GAGA TCCATCTTGTTC TAAGTAATTTAACT GTTGGCAT CTGAAAGCAATAGATT CTTAACAGAAGAAC CTATTC TAAGAAAC AAG TTATTG CTT CT CACTTTAGTT CTTTTAAAGCTGATTT TGAGTGAAA CAAGAAT CTATAAGAGT CCTAACTTAAAA AGAAGTGAAC CATGAGTTTGTT CTAGAAGGTT TCTT CTTTAACAAAATATGAAAAC TCAATT CT TTTCTGAAGACCAT TT CCAAAGCCTAAAAAA CATTACC CATATCTCAT TTTCTTACTT CAAATG GAAGTTTCTAA CAG CATGAATGTC CT CA CATACTAGAGGCTCTCCATC CCCCTTTCAT CTTTAG CTGTGCAATGGAAAGATTTTTAAGAAACAT CTGATCAGACAA AGAATAGAAAGATACAGGGAGTATATTTAGAT GAATTTTATAGCTTAGCTAACAAT CACATAAA TATACC CAAGAT GA TTAAGATAAGAACAGAAAGGAT GTGAATAT CATT TTTACT CGGACACATAATATTATCTT C CACTAGCAT CTTCTCAG AT GAATCAAGAC CTTTCCTCCTGTTATTTCATTGGGAAATCAGGTGTAAACTTTTGTTTTTTTCTTCATGñCGGGAGT CAGTATTTCAAGTC CTATC C CAAG GAG GAAG C CACAG TTGAAAGTT TCAGAAAGACATGTGCAGAATAAT CG GAAG GG A CTAACGGGTGTGGAACAAAGTGGTATTTG CTTCTGAGAAAATCATGCT CTG CT CCAAGATC TCTGT CTAGGTAAAGA AATACCAAATGAACAATGTGTTTTGATCTTTGGCAATGACTTATTTTCCTTTACTCAGCAGGATGACTCAGGTAAAAT CCAGAAC CTACT TC TTAGG GAT CAGTT CAGAGGAT CA CAAATAAGATACTGC CT CAGAGGTATT CAAAGG TTTTAGGA AAAG CAACAAC C GATAGGCT CTTC TGAACATACT GG GAAGAGTAGACATTA CTT GT GGAAATTACATACAAC TCTATC CGCCCATTATTTTCTCTTTCTTTCTGTCTTAGCTATTTCCTTCTCTATAACCTACACCTTTTCTGATCTCTGAAGGTA CAGAC CC CTAAT CTAAGCCAA CTT CCTATTGAGT T CTAAT CCACATTGT CTTTGAGAATACCTC CTTTTAC C TTGCAG GGTTGTTCTCAAGAACCAGGGCCACAAACCATGTGACTGATTCTGTGAATAAGTGCACTCTTAATAGTCACTCCAGCC TAAATGTTAATACTAGTT TTAGGG CAA C TAACTC TTAAAAAAAGGAAAAGACTT CTTTTT TT CCTGTCAT GTTGAGTT CAGGCAGT CT T CACAT CAGTGTTAA C CT CTGAAGTACCACAT TCTTCTGGAC TCAAAGC CAAGGTCAATG CACCAGAG TTTTTCTAAAACTTGCTCAGCCGATTTTCCAACCAAGGCACTGAAGGCAACCTTCAACTCflGflCCTGCCCGCCACAAA GCTTTGGAACAATAGATAAACTTACTTTGTCCTGACGAGTTTCCGCAGCATAATGATCCATCCTATGTATTTTTCTTC TTCTTTTCTTTGGAGGAGCAACTGGTCTTTCCTGTCTCCTCTTAGGGGCAACTTTCCTCTTAGGTCTCTTAGCCGGAG TAAAAG GT GAGO CATAA CTACTCT CCTGTACTGG CGGATAGAGATGTCTGGACT CAGAAGT CGTCTGTCCCT CTGAGT CAGATATAAAGCTTACACTTTCC CAT CTGGGATCAG CAGAGT CT CAGAATAACTAAGGCT CCTTAAGT GAGAAAAAAT TGGAATGAGGGATGGATGATTAGAGGAACTCTATGGAGAGAAGCCTCCTAAATTGATGTTGATACTGTTGTCACAGCT TCTTCGCTGAGAACAAGAATCCAATTTAAGAAAATCTAGTCATGTGGGTTTCTTTTGGCTGATATTTATGGATGTATG GGAAGGTGGCATTAGACAAGCTCTGTATTCCATAACTCCTGCATTAGGGAGAGACTGGTCTTTGGTTATGAAAAGCTC TCCATTTATAAAGAACTAACTGAAAA GTAGGC TAAATACTAG CCACTTGGTCATTTCAAGAGGACAAT GAG GTTCATG TAAGTGATTC CAATTCAGAAT CTGACAG TTATCTTTCTGTTCACGT CAGCAACTATGTCAAAGCAACTGT CA CCTTCA TTGGCAAGGATC CAGT GAAAAATTTTAAAACCAAGGACGGAATGAAACATAAATAATAAGTAAT CCTC CTTAATCATA AACGTACT CC CACAAATCAGGACTTT CAGCAGGAGAGAGAACTCA CTGACAATTAAGGTTTT CAGTACAGAGGAACT C TAAGTTAAAGTC CTTT TAAAATTAGGTAAAAT CTAAAAAGTCAT CAAAAAGT CATCTCTTTTTTGTCTTGGTATATAC AGACCAAGAAAGAGTCT C CTTATG CTCTTGCTAT CAGAAACCTG CTTTTAAG GGGCGTTG CATG CTATGGGTTTTGTT TTGAAAAT C CAGAGAG CAGTGAAG TTGACACATAAACTGAAGATAACC TTTTGGGATTTC CAAATG CT GGAGGAAAAA CAGGCCCCTGATTTGCTGATGGTG CAAAGGTGAATTAAAGCAGCAAGC CTTGCTGTCTACCT CACTGACTTC TCTCGG AGGGTGGGTGGGGGGAATGTCTTGAGAGCCATTCAGATGCAGGAAGGGGCGGGAGGAGGAACAATGTTAGTGTATCCA AGAAGGGGAACATCTGCCATGCGATTTCACTGTTTCCAGGGCAAGCCATGGCAAAGGCACTTCTGTTTCTGTCTCATC CTCTCTCTCCAC CAAAGCAAT CAAAAGACACAGC CC CAAAGACTTTGGTTTTGTTTGTTTGTTTTTTAATG C TCCTAG TTTTAG GCAAAT CT CCATAAGAAAAG CAGCAAAAGGAAAAGAGGATTT TC CGTTTCATGGTCA CTGAATACATTCTGC TAGCGCTCAAGTCGGCCCATCAAACTCTATGCCATTTGCAAGAGACCTCAACATTCCAAAGACTCCAAAGATGAGCCC TTC CAATTG CAG CAGGTTAGAGGAG CAGAAGAGTAGG GTTTAGT CAAACTGGGTTTTCAT C CAGTGATTATATCGCAT GTATTTCCTT CTTCAAATGTTAGAGGTACATGAAAG CTAAGTTTTCACTGAC CAGCCATACT TTAG CC TGAATTTAGA AATTTTAG CCTGAGGTTATTñAGAGGATGGGAGGTAGAG GTG CAAATT TC CCTG CATTTCAGGATACTTTGC CATCTG AAGGTAGAGACTAG CCAG CCAAAG CT CTGC CT GTAGGCCCGGAAGAGCAG CCGAACAAG G GGTCACAGAAAAAGCATA GAAGGT CAAAAAGAAAAAGATTGCGC CAATAATGAAAAGCAAGCATATTT TTGAACATTCAC CATTACACT CTGAAAC CAG GAG CAATACAC CATT TCTTGAACTATTAACT CAATG TTTG GT CAT CT CATTTAATAG CTTTTGAGAAT CATTCTG AAAAAATG TGGACT CAAATATTTAGCAAATAAAG CACAAATTTCAACT GAATTCATCTTATTGTAG CATATAGCATTT TGGTAGTTAAAATCATTT CAATGAAATTTCAAATATAGGTTATTTT CTTATATGTAAAAGATACGT TACTGATTTTAA AATAAT CTTT CAACTAGTATGTTATAAT CTTTTCAT GTATAATAAGTG TAAATGATGATATATAGGTAG G CATATAGG ATACACAGGTGTATAAAC CAATTAGC CTACAA TAGAAGAGAATT TTTG CTGACT TGCTTTGGTT TCATGTTTTAAAAG CATAAACTATTTTGTTTT TCATTCAGTTAAAATAAGATTCCTAGAACCAGAT GAGAAATAAT TGAAAT CT GAAAATAT CCTTTG CC TTATGAGAAGAAAATGAC CAGGTTAT CAGATTAC CATTAC TGAAGATATACT C CAAGATACC CAATTAAG AGTTTATC CAGTAGGCATATGAAAATAC CCTAGCTG CAAGAAGTAGTAATAC TGAAAAAAACATAACATGAATACTTT CAACTCTTAATTCAAAGGTACTATAGGAAAAATCTCTGTTTATATATTTGTGAGCAACCCTGGGCATAGAAAAATTAT TATAGCAAATTAAAGCAT C CTGTATG CCGC TT CATTTATTCC CTAC TCAGTACTTTCAAAAAAACTTATTTG TTTCTT AATTCACACAAACATATACTTTCATGCATTTAATTGTAACTGAACTACACATATTAGAAACCCAAAGATGTTTATTGA TTAAGAACAT TTAT CATTAAGCCATATAACTTTTAAGGTTTGAGTTATAC CT CAT CTGTATTAT TAAAAG C CAAA CTT TTGCCATAAAGAAAATGTT CTGTATAATTTGAACATAGTTATTAACTGAC CACTTTGCTTAT CT CCTGGACTGCAGGA G GGTTAAATG GTTC CATGGAAGG GAGTGTGCTCTG GGAGTCAAAAGTT CTGTAGTGCTATCTTCAGGTTCTCCCCTTA GCAGCCAAGCGAAAGTACCGGAGTCCTACTTTTAAAAGGATGGCTATACCAATCCGTTTTGCCTACCCCACAGGTTCT TTGAG G G GATGCTATTA C C CAAGAAGTCAAACTTTTAAGTAG GGAACATGGAACTGCTTAGGTGTTTTGTTTTGTTTT GTGAA CAT TTTGATAAAATGTGGTTC CTTTTGTTCC CTTTAGGTTAAT CT CAAAGAAAGT CAAT CTAAAAATAAAACA GCCATT CTTG CT TTGGTGTTCTTTAC TG GCAT CATGTATGCAGATGTT TATCTCATTTTATCAT TTGTGT TT CACAAT CATACAAGACGGGTTAAACTTCAAAGGGACAGAAAAAGAGCATCTGACTTATATAACTTCAGTGATTTAAGACTAAGA GTGGTTTTACGTAGCTAT TAGTTACTAGTG CTTAAATAAATCAC CT CT GAATTGATGTCAAAGCAT TCAGAAACACTT ATGACT CCTTGGTCACATGCTGCAAGTATTGT CAAAATACTTAAGTAT TGTCAAAAAACT TAAG TATTGT CAAAAATA CTTAAACCAAGTATGT TTAACCAAGTATGTTTAAAC CAAGATATGTTTAC CTGTATTTCTAATT CATC TGAAAACAAT GAACTTAATTATAGTCAG CAGTGT CCTATGGG TTACTAG G C T A (N ) xAGGTTAGATTTGCAAACTCCATCATATTTTT TACTAT CTATG CTGAAAATGTGG GAACATTTC TAAC CATTCCTTTC CCAC CTTCTGAATTTGAAAT TATACTGGATAC TACCTCACAGATTGTCTT CCACAATGATGG CT GAGTGTTTGATAAAAA TATGTCACCCTC( N ) xGTCAGCCTGCTCTG CTAAGCAGATTCATTCAGTGGTATATTTTT CT CAAGAATTTATGTTACTTTGG(N ) xCAAAAGGGTCCATTACTATAA CTTATG CCAAGT TGATTGTATTATATCTGC CAAGGTTATTGGAG CTGCTTAGCTGATCTACCCTCTCTAAACTGCCTA ATCATTTCAAGG GTGCTGAAATTCATTTACAATGTT TTG CATTG CAGGTT T CAT CTCAAGATTTTT TTCCATGCCTAG CCCATTGAAGGCAAATGC CAACTC TT CCGAAT TATT CACCCT CCAACTGACG CC CACAAG GC TAGAATACAC CCCATC TCCTGTTAACATCAAGTTTA(N ) xCAAATCAATCTGAAATGAAGCTCCTGTAACAGAAACAGGTCGTTTTAAGATTTT TTAAAAAATTTT TAGT CTTGTAGTTATT TGACTTTG CATAAACC TTGTAGATA CAAA CAT CCAT CCAGACTTTATACA ATTCCTGAAATAAAGTCAGTAGATCAGGTAGCCCAGGGCCACAGATTCTTCTCAGGCAGATGCTAAACCACTAACCTC AGG GC CGAAAGGGC CATATTTGTGTC CAGAGAAAT CAGGTTGCT CAGAATAGAAG GCATAGACCGAAGGC CTATAAGA AAATTG GAAG GTTCAGAAT CT CT C CTAAAC CACATGTAACAAAAAATATCA CATATTTTTAAAAAACG CATT TAAAAA AAAATG TT CT CAAGTT TAATTTAT CCTTTAACTACTATT CATGTTAGCATGAAAAACTTTTTTT TTTTTCTGTTGCTT CTGTGAC CAC CACC CCAT CTGTAGTAACTGTCTGGAAC CACCTT CAG GAATCAAAAGT CACAAAG GAAAACTTGAACC TGTACTCG GATCCTTGA C CCTCAAGATAACAGAG CCTGGC GGTAATT CACATGGGCCGTG CAAT CTGCAGCC TGAAAC AACAACTAA CTGCATTA CAGAGAAGCGTCT CT CT TTATAC TT CT G CAATC CTTGGGATGCTGCTGGTT CATTTATCAA AG CCAGAAA CTCCCTT CGGCT CTG CTGTAGTGGACAGAAT TAATAG CTGCAGTAAT CCAAGTAATTAAATG GTC CCAA TAACCAAAGT CTATTG TATCCTTCTTAGGGAAGTAACACTTAAG GCTAAGAAC CACAAAACTTT TAATTC CAAACATT GG CTGT GCTT CTATTC CACAGT CTAGTGTTTCA C GCAATT CTAAA CTAAGATAGAAATGTA C TGAAGAAAGAAGAAAG AGAGAAATAAATAC CTCTCATGATCT GGCAGGGTGAGC CACT GCAATATGGT TAATATTCTC CG CT CGTGAGGGATGA CACTGGAGGGTAAAAGCAAGCAAAGCATGTTGTTAGGTTTCATTTCCGCAAACTCAAGCCTGAGGAGTGCTCGCTCTT GCACCCAGGC CAAATG CT CTCTATTTACCAC CAT CTGT TTT CAC TT CATT TGTAGTAAAAGG CC CT CTTT CGAATTAG CGTTTAAACTACTATTTC CATT TG CAACATATTACGTTGCTCAACCTC TT CCACAGAAGACTACACTGGAAACT CTGA TT CACAGT CA GAAT GAGGAACCTGTCTTGCTAAT CAGAAACT CCACTACAAG CAGAAA GGACCCTGGTTT CAGAAT CA TTGTCATTTATTCTGTCGAGCCTTTAGCTGTGGCCATAAAGGGATCAGTACTTAC(N) xATGATCATAATGCGTTATC AT(N)xTAAAAATCTACCAAGAATA(N)xTAGGCACATATAATTAAGAGACTGACAAAAAGGGTATAATCTTCATCCA CAATAACTTTTGTGTTTAAGTCTC TTTCCTCT GTTGAAATATAATTAC CCTAAGTC TGGCA CTAGCTTTAAGGG CTCT AAGGCCTCACAGCTAACATGACAGTATTG CAT CC CCGG CTGAGT TACATT CAGATG CT CACAAG CTGAGTGATATCTT TAGTCAGG GGAAATTCAC COACTO TTTCCCAACAGGGAG CA CAAAATG CATT TT CACT CC CCGAT C CAAGAC CAGG CA TTAGAGGCATGTTCAT GATGAATT CAAACTAAAG CACAAG TTTAAATGTG GT CAGGAAAAATTATTGT GGAACATTGG TGAGCCATTTAAACTTTTACTCTCAGTGGACATGAGCAGTGAGAGAGCTGGGAGAAGGATAAATCCTCAAGCCGGGGC T C TCAT CAAGTGGTTT CT CTTC CT CC CCCATCAC CATC CAGCTCACAGG GATTT GT CATT CC CCAG
> H s 9 _ 123059 23 3 - 123077 86 7
TTTGTTAG TGTAAAATAAGTCACT GAAATGGAGCAAAG CAAT CCTTT CTGTCATATTACCAAAT C CAAGACAGTTCTC AT CCAG GAATTTA CTGGG CCAAC CTCACTGTCGGGAAAATGC CTAAATTCTCAG GAGACGAG CAGTGTGCTATG CCA C ATAATC TCAG CCTCTAATACTTAACACTATAAGCTCTG CTAATAACTC CAGGTCACTT GT TTAGGAAACAGAGATAAA GATTTGAAAATTTTGCATATTAACACATA CAAT CTACCAC CCAATGTT GT TGACT C TGAGTC CTAGATAT TT CAAT TC ACTTCCCTCCTGCTTG CGTTCTGAGAACA CAGTTATTT CAGGACATGTAAATGGAGAG GGAT CATTGCTATATG GT TT CATTCTGGAACTTTTATGTCAT T C CACATAGAGC CAGT TTCCACACACATTG CCAAGATAAG CCCCTTCCCTGCAG GA AGGAATG TT T CTGGAG CTAAACAT GAACAATGATACCCTTAAGT TTGAGT CTGACT CT CCTAGT GATCTGTGACTT T C TG CAAATTGTATAAAT T CTC AATG { N ) xCCTGCCTTCTTCCCCTCTCTTAATCCTGCTAATTTAATACTGAAACATAG A C G T A C T C T T T (N ) xAACAGCAGTAGATGACCAAAACATCTTTTCTACAGAGAGCATCAGAGCACATTGTGATTGAAT CTAAAGGTGTGTTTGCTGTTGAGTTCTTCTTG TT TGTTGTTCATTT GGGGAATT CACGTTTGTAAA CAAAAATC CAGA GAGAAGACAG CACCTAGAAAGAAAGGGAAGAT CAAACACAAACAGG CCGTTTTAAAAAGAGGTAAT G A G T{ N ) xGTGT TTTC TTTTTC TTTTA G CAAGAAAATGGATT GT CAACAGTGGTAAAT CTTATGTC CAAGTTAGAAGGTATC GGGAGATT TTTAAGAGTCTCTGGGAT TGAAAACCTGTGGTAATACAGACCTCT CAGGGTGAAGTTTACTTTTAT CC CTGT CTATGA TAGGAACAATATTTGCTGAGTG CATGAATGGTG CAAACACAGACTCTG CCTC CACTTCAG GGTTTGAT TTGATAAGTA AACACC CC TCACTC CCAGGATGTAGTTCTGG G CT CACCATATGAGACT CGTT TGGGACTGAGAAA CTTAAAACCACTG AG GCAAAT GAAAATAGGT CAGAAT CCAGGTAAATTGTC CT TAGACT CAAAAT GACAAAGT CTTT GAATTT CAGACG CC AG GAATTG CTTCTGTGGCTCCCTGGGCACCTGTT TTTGTGGCAACTTCATACGATGTTAT TTAT GAAGAAAT GAAT GG GAAAGCAGAACTCCTCTAGTA( N)xGTTGCTGCACCACTCAGTGCTGCCTTCTTCTGGGCCTGACTGGCATCAACTCA CTCGGTGATACTCAGC(N)xGGATGGATGAATGGATAATAGATAGATGGAC(N)xTCTCTTCTCCTCCCATGAAAGGA AGTCTAAAGCATGG CTATTGTCAC CTAAGT CTTGACATGGAGGTACCAGTTT CACACTGG CTTCAGATGTTTGTGTGT
GñTGGACT CACCTG CATATTCT GGGACAATGT CTTCTCATATTGTGTC CATCAAAATAAGTGT CAGGT CAATTCAATA GCATTTATTAAGTATCTAGGTTAATG CTAAGATCTAGAGAGGTCA CAGAGAAAAGTTTTCAAGC CTTCAAGTTTTTGT AGAGAAAGAACTGATA TATATT CAGACAAT TAAGAAGT TTATGTATCCATGATTCCTCAGCTCAGGAGAGGGGTAACA ATATCC CTAGACCAGG CACAGATG GCTATAAGA CAAGAGC T C CAGT CCGG CT CAAAAT CTCCCCTTGTATAG GAAT CT CTAGGAGATATGAC C CATAAACT CAAACTATTAGATGT CCTGGCTGAGATGTAG CACCTG CCATTTCCTCCTCACTCG GAGTAT TGCTGTGTGGATTCACCTTC CTCCAAGT TCAGTGGT GATGTCAATATT TAAAAG GC CT CAAAGAAAGGAGAG CTCACTGTGTGGTGAG CAGCTAAG GATATCTC CTGGAAGAGGTGG C CCTGGG CTTAGAACAGGGA CATACATACACAC CATTTGTG CC CTTACT CC CCTACC TCATCCAAT CAATTATGG CACACC TTACACTGAACT CAGAA C C C T (N ) xAAATT TAAAGAGTTGATCCATTT CTCTAATT CAAGACAG CTTATTAA GTAAAGATGCTCATG GAATC CTAAACTTAC CCTAAA TTGGGATAAACTGAGG TCC TCTTTTT C C T T T T T T T T T T T T T G ( N ) xAAGGTCCACTTTTTCTACCAGGAAGCTAAGGA TC CTAG AGGGTCAAAG ATGCC ATC ACTAAT AT TTTTG AGT AAATGAAGAAGG AAAG AAGT AGAA TG AATG AA C AGAGA AACACACTG CAATT GTAT GGGGAAAAGAAAATAGAGGGAAGA CG TT G CAG GCAAAATCAT C CA CAATTATACAAAT T C CACCTTAGAGTAGAG CCCAGCACG CCTGGGAAGGGCAGAGAGAAG CTGAT CGTACAGG CTTT CTGGGAGC CTTAACAT CATTGTGT CTTAAAAGGGTGAGGAGCAGGGGAAAGGAGGAGGAAAGAGACTT TGGGACTCAG CATT TC CATATCAGAT AATGGACTTTAAGAGCATTTG GTTTATGAC TGAATCTTATAC CCTACACT CC CT CC CACCTCTATT CT CT CCACTGAA CAAGTAAAAGTGCCTCTTAGCACAGCAGATTCAAGAGCAGAGATTCACTTGAGATGACTGTACCACAGTGTGCTCATT CC CGAT TGGAGTGAAATGGTCT CATCGCCTTT GAGTTG GGTCAGCTGCTGAGAT GATCAGG GATCTGCTG GAAAGAAG CAGGGGGACAGAGG CTGATCCACATC CCAC CT CTGCTCTGAG CTCT CAGGGAGACTGACATGAGGT CAGT CACCAT CA CCGTGGGAGTTCACAGAAACAGT CAG CATAATG GAGAGAAAG CCAGTGGAAGGGGAGCTT CGAAGATGGAGACT CCAG
c c ttc tc tttc c c c a tg g g a g tg atg g g c c c a a g g ttac c aa g c a a c aa a a g g c c ttc tc tc c c c c tg c c ac tc tc a a CTGGGCTGTGTTGGGTGGAATATGGG CAGAGCT CAGTT CATCACTGA CTTTATTTATTGAA CAGGTTTTCTAC CTGAA CTCCTCACCTCCTTTCCAATCCTGAAATGAATTAGGTCATATGAAGTCATAAGTGCTCTTCTGAGCCTCCCCTATTCT CCAACTTT CCAGTCATT CATTCACTCGT TCAACAAACACTTACC CAGCTGGTCGAAGTGGAAAGGG CACAAGCTTTAG AATT GT GC TATT TñTGAT TAAGTCAT{ N) x CTTGAAAGACTAAGCCTGAAAGGTAGTAAGGATTAGATTCTTTTGCCA GTCATTAAAACAACCACTC(N)xAAGGTAGTAAGGATTAACTGTGAACTTACTCATAAAGCTTATAATGCAGTGACTG G CAT { N) xAGAGACAGGATGGGGGTAGCACTTGGGCAGACAGTGTGGACAATTCACTGCAGCAAAACTGTGGAAGCAG GAAGGC C CAGGG CATGAACCAGGGTGGAGCAGTGTGTG GT CTGTATGGTGGAG G GT CT CAATGAAGAAGT GCAGTGGA GATGAAGCTGGAAGTCTG CAACGACCCTGTAGGG GC CTGAATGC CAAGTTGGAAGTTTTGGC CACGTCTGGGTAAATG ATGACATTAAA CATGGACAATGAGGTCAGT TAGGAT CTGCTACTAAGGTAATGT TTGAGGCCATGCCTTC CCTTCTCC TTCT CT TTGGATAGACGCTGTAAGGACCAAAGTG CCGGGAGG TATTC CATGCGC CC CACCGT CACG GCATATCATTAA CCCACCATCATTCCCAGCCTCCACAGACTTCTGACTCCTCTGTTTCCCAGGGGGAAACTCATTCTCAACAATGTGTCA CCCACATAAGGGGACCAAGTCTGGGAGGTGGTGTGGGCCACGGGCTTGCTCACAACTCAGTGTCAGACTCTCGTGAGG CCCCAGTCACTGACGGCCACAACAGCAACCAGACAAAACAGAAACTGCCCTTCACCAGCTGAGCTTCCCTCTTAAACA TCTCTGAAACCCATCAGCTTCTGCATCACTGCCTCCCTGCTAGCTCAGGCCCCCACCTTCCCAGCCAAAGGGTTCTAT CCAAAACCTCTGGATTGTAACACACCCTGTGTGAAACCTTCACTGGCTCCCATTGTCCAAGGACAAACTGCAGACTCC TGTGCATGGACTTTAAGATGCTCCCA( N) xGAATGAGTAGAGGCTAATAACAGCCAGGGCTAAGATAATGTATTCAAA AAGAAG CAAC CAGAAACT CAAAATACTCAGTGTT C C CAAAAGTGTGGAGAGCTCAGTTTGAACCGGGG GCAGGTCTGA AAGCTCCCAGCAGGCTGAGGGGAGCCCTGTTCAATCAGGCTCACCCCAGCCTGGATCAGGGTATTCTGTGGAGAATTG TGCAAAATGGTTTGGAGTTTTTGTTGTTTGGTGGGGGTGGGCAGGAGCCAACTCATGAAGGGTGGGGACTGCTGAGAG CTGGGTATGACGGCTGGAGCAGGATCCCCCCAAGCAAGAAGGGCTCCATCAGGATTCAAATTCCAGCTTCA(N)xCTA GCTGAGGGGGAAATCCTTTCATGCCAAGCACAGAAACCAAATCCAAGCCCTACAGATTGAGCTGGGTAAGTGATGTTT AGGG CAGAGTTG CAGC CTGGTGAGCACC CAG GAAGCAGACA C CC TGAAGGGTGAG C TCAAAG CAAAGAC CAACCCAGA CTCCACCCACTG GAGCATGGATATGATG CC CACAGC CT GGACTG GCTGGGTGGG CAGGGCTGTGCCTCTT GAGGGAGG AAGGAGGAGCTCTGAGGGTGGGGAAGCTTCCTGGACCCCTGGTTCCCTCTAGGCCTCCTGATGTTGACGCAATCACGT ATGCAAGCAG CTGG GG CCACAGCGGTAGATAAGCAATT CC CAGC CAGGTAGTCACAGTGCTG CCTGTGAGTGCAGGTG GGAACTGCAG CCTGGGTGAACAGAGTAGGTGC CTGACT TC CTGGGGGTGGGGGGTGGCTC CAAGTGTG C T G (N ) xAAT GTTATGAT CCGGTTACAT TGTAAGAGAATGACTGGCTG CTGGGTAGAAAGAGfiCTG GAAAGGGT CACACTAATATTCT GAAAAG CATG TAAG CT CTGGAAACACATTTGGGT TCACATTT CTGCTTCAAATCAG CAAAAT CCAGACTG( N ) xAATT TTTTTTTTAATATATAC(N)xGCAAGGGCTATGAGACCAGTCCTTGTCTGGACGGAGTCCCTCCTGGAACCCACATCT TCTCACTGACCTGGCAGCCCCAGGGATGGCCTCTCTGGGGGAGTCTCAAAGAAGAACCTGAATTCCAAGGGGAGCTCA CTGAGAGC TCAGACTTAAAGTCTCTCCAGAC CAGGATT CCAG CC CTAGGATGAC CCTCAGGC CAAGGC CAAGTCTGCC CCTATAAACCTCTC CCAAG CTCAGCCTC CCTATCAGAAAG CCAAGGCAGGAGGGTG GCAT CCACATAT GAAATCCCCT AGATGGCAGCTCCTCCCCACTCCCTAGGACCCCATGCCAACCCACACAGTCCCCCGGCGACACAGTGCAGCCATCCAT GGGCAG CAGC CAGC CTGCTGGTGTGATTATAA TTAAGT CGGTTT CCAGCATCCT CTAGGGGCTGCTCTGC CCTTTTCT CCCTTTGGTCATTTTACCTGTCATATGGTCCCCAGGGTCACAAACAATTCTCAACACGGTTTTTGATTAAAATCCAGT GAAG CT GAAAAC CAAG CCAGATTCATAG GATAAAAAAATGTTAG C CACTACCTCAGGGAAGT CAACTTAATTCACTGT TTGCAC CAGGGAAAG G CTGGTATCAGAAGTAGGC CT TAAAGT CCAAGGACTGACAT CTTTGAGCAC CTAC CATACTCC AGATCTGTTAATTACATTCTTTCACTTGATTTACCAGGAGTAGGTGGTCTTCATGTGTGCCTCTCTCCTGGGAGCCCC TAAAGTCAGAACAGAAACTCCAGGCTTCTGAGCCCTCTGCTCCCTCTAACTGGGATGTCATTCCCCTGTATCTCTGCC CTGAA CAT CATC TTTC CATGA CAGCTAAAT GT GACAAAC C C CAAGTC CAGCAAAAATCAC CCCCTTTTCT CATCTCCC ACCACCCACATCCTTTACAGTACTTACCACACAGCATGATGATAAGCCATACACATGCTCACCTCCCCCTCAATCCCC AGACTACAGATT CCTTGAAGGTAGCAGGGAAAAAGG CATT TTTTTAAACTGGAGAATGTC CAAAG GAGAG CGGGAATA GTGG CAGC CCATAAAGAACAGTAGGAGGAACTG GGGTGGAGG CAATGAATAATAAGAAAAATAATAA CTAA CATTGTT AAATAATTCCAACATCCTGGGCACTGAGCCAAGTGCTTCACCTTTGGCAGTGCTTTCTAAAGC(N ) xCGAATGCTCCC AACATAGATGAAATTGAGAGCCAGAGTTTAGTTAAGGGACACAATTAACCCCCTGCCCCACTTTGCCAATTATATCCC CAGGA CAAATTAGC CAAGTTAGAAGGACACATATTGTCAGGAGCAGGGAGGGGT TG CT GAAATAGTTT CACCTGCTGT TCTGCTCAAGTTGGTGGCTGCCATTTATTTTGGTAGTTAACACATTAATTTTGGTAGAACACACTGTGTTGCTCGGAT TTTGTTTTAT CAGTTT TAATCAACCAAATTAT TATTTTGAAG TG CAATGACAAAATAATC TCTAGTGCTAATGTTTAA AACT CC CT CATGATTTAAACTACAGTTTGCGAGT TTAAAAGAAGTCTTCAGCCTTCTTTTTA CCCCTTTTTACCCGTT TACCCTCGTTTTAGAAATGTTTCTGAAAGACAATTTTTGCGAATGTGAGTTTCTTAGGAATGAAGTTGTCATGTGATA GCTGATCAGGCTGTAGC(N)xAGCAGTAAAAGCCATGTTCCCAGTGCCTAGCACTCTGCCTAGCCGCCGCACCCCCCT TCCTGCTGTGCT CAAT CGATTATTAGAAGAATTAAGACGTTAAATTATGAAACGATTGTCAGAATAATATAGAAATTT GGACTCCCAG GGAAGCGTTCTTTTCTTT CT CAGAAACAGC CAAAATGCCTCCAACACC CTTTCTCCAGAT CAGGTCAC ATGTTAACTCTTTTCATCAT( N ) xATCATATTAAACTCTCTACCCATTTGCTTTATGTATGGCCTCACCTGATTTGAT CAGGAAAC CAAT TTATTCATAATAATTACTTACAGAGCAGGTGGTACAGGAAAAAT CTTTCCTGTTTTTC CCAGAATT GGCAATAAG CAT CG CAACTTAGAAAAAAAGGT TG CTACGGACAAC CTTAGTTTGGAGACCAG CC CCTGTGAGCTAGTG GTGCTGGGGCCTGTTTCCC CATCCTATG GACAGAC CGCTGGTGGG(N)xATTAGGAATGGAAAGGGAGGATAACTCTA TACACTAGAAATTTAATAAAAACCTTAGATAATAGTGAACATTTTATATTAATAAATGTGAAAACAAACTGGACAGTG TTCTGGAAAAAAAAAATGAACACACTACAAAAATTCACTCCACAAAAGTGGAAAACATGACCACATCAGAAAAGAAAA CTTAAG CTAGTATCATTTACATTAATGT TAAAAG CTTG GAAAAC TAAATAACTTTAAAAGTTAACTAG CTTAACTAAA AAGC TAAATAA CTAAAAG CTGTTTTAAAAATTAAA CAACTAAAATCTTTAAAAACTAGAAGCTT TATTAACTAAAGCT AGCAATTTTTTTC(N)xAGTAGGCTGAAGAATAGCTATGTGGTGACTCCATAAATGCAGAGAAAAGTATTTAATAAGA TTAATG CACATCAAGATGAAGAAAAATT CAGCAGGCAAGACATAAAAAAACTTTG G CAAT CTGCTTAAGTGTGTGTCT CTAAAAAACCAACAGCAAGTATCTCACATA( N) xGATTTAGTTGACATTTTCAGTCAGTAAAAGGGAAATGATTATAT GTGAAAAG CAA CATTTTAAACT TTTTTTAAA CAGTCTATCTGTTAC CTA CAACT TTGTTTTTTTTT(N)xAAGTCCCT AC TGAAGATAA C GG GT TT CAAATGACTC CAAGTG CT CCACAT CTTAAAAGTAGT CCCACAAAG
>Hs9_93851754-93876478
GGGATTTGATCAATTTAG CC CCTTGACAG CA CAAGC CCACGAGCCT CACTTTGACACCTTT CCC CAGAATACTACAG C TTGGGAGCAAACGG C CTG CACAGC CCCCAGCTA CAGAAAC CATGCATGAACTAC TGCT GTAAAGTGTAAT TCTCTCTC CT TTGCAT TAAG CTCCTGCAGGGCTCTGAT TACACAGGAAAAGAAAATGTTG CTGAAGAAGATGGCTC TT CTGGGAAA CCTGTCTCCAGCACTCTG CAGGAG GGAGAGGTCCTCC CAAGC TOA CAT TTAGTAAGAAGAGAAG GGGACTCTT CAAAG AGATGAGAGTGCATTT GG CTTCAGAGTGTG CCTT CG CCAAGAGAGGGT GCACTTCAGCAGGCAGTGGCATCAAGCTCT CACTCCACAGGATGAC CAAC GATGTTTCAGTCG C CCTTTTTCTGATGCTTCCTAAACAAATAAAGTGG CAGTAATT CT AGAGCTTAACCAGAAGAAAACTGCACAT TCCCTCCTGAAGATATTGCC CTGACTATAT CCATCCGGTTGTTACT CCTA CATAAATG TTAGCCTG CTACATGAGTGT CCTGAGACTTTGATGACATT TAAATCAGCAGCTG CTCCAGCCTCCTACCT TCAAGC CAACTAGCAGTT CATTTC CCAAAAGAGG GCATGC TTTAAGGG CTAATGTCTCAG C CGACACTGG GC TG CTA C TACAAAATG CAGAG CAGGGAGG CACG CG GGGCCAGGTGTC CACAA CAAATAGAAGGTGTC CTGAG C CT CAAAAACCGG CCGTCT CCCCATGGAGGT CAGCACAGGCTTATT CAT CAACTCTGCT CTTGTAAACACAGG CATCACTG GGTATAAAAC ATGCCATCAACTGAGT GG GGAG CATTTT TT CTATAACCC C TGTTGC CAGGTCATATAACC TATT GC CAGGTTAGAT GA CCATCACAAGGATGAT CAGC CAGGACAGAGGGGTGACCTGGTGACC TGGGTCATGGTACT CTGCAGGGCTCCCTAGCT GT CTGG CCAGTCATTGGCAG CT CT GCAGAACAT CAAGCAAGTTACAGGGTGG CACTGAAAGC TGATGGGTAT CGAGAA GGTGGTGGCCA CTTAGAAG CCCCACTGGGGCCTT CAAGAAGCTGAT CACAAG GTGCATGTTCTTAGATGGACCTCTGG GCAGCTGCAGAGTGCAGGCTTACTACCCAGATCAGAAACCCCATGTAGGGTGAGGATTGTCACTGAGAGAGGAACTGG TGGAGGAAGTT CTG CTGAAAAAGT CCTG GTGACC CCATCC CCTGAG CAGAAT GC C CAGGTAAG CCACCTCCTGTCTCT TCCTCCTTCCTTCTCTTTCATG CGAACACAGAAAAG GCAG CACAGCAG TGACAC CAAC CACAC CGCAGTTCC CACCTG GAGAG GAG GTGAG GTCAAAAAACACATG C C CTGC CATGC CGG CTGG CC CTGAACACAC CTTCCTCCATGCACACAGGT AGACAGGTGAATCCTGAGGTTACTGTTC CAAAA CTC ACCATG AG G (N )xG AT TCCATCTCAAAAGAAAAAAAAAAGA( N)xCTAGCTATCTGGCCAGTCATTGGCAGCTCTGTGGAACATTAAGCAAGCTACAG(N)xGTGGTTTGAGTGCCAGAT CGGCCG CAATACAACAGAA CACTAGATAGACTT CTAAGATTTTTGACT CTACTTCCTGACTCCCAGACAG CACCTC TG G A C ( N ) xCAAAAAACATACAGTGGATATCAATCCATCAAGAGAGCTCACCTGAATATGCCAGTCCCTGGTGGATGCTG CTGTCCCCAGTGGTGG GATCAG CAAG CT TATTGGTCTGT CTATCC CAACAAGTGAAATAAG CATGC CAAGG TTTGAAG AA CTACACAGAACCAG CCAAATGGAAAAAAAAAAAAAAAAAGGAATTC TGAATAGTTGGG CC CATGTGAGGTGT CAGA GAAGCTATTATTTTTTGAGATGTAAG CCTTTCTAAC C CAAACTGCCTTAAATGCTTGCTT CGTGAG CTTCAAGA GTGT CG CACAAGTGGGGAGCAGGT GGGGATAGGT CCCAGCTCAC TGTGGA GGAAGTTCTGGGTT CCAGGCAG CAATGG CTGC CC GAAGAA CTGACATG CT CAGG TAGCTGGGAACAGC CCTGAATCCAGAGGGCAGGGAG CCCTGTCTGGAGCCCCCCAC TT CACACC TGGCTATG CCAGCTGCAGGGCT GAGACT CCTG CCAGTGAGATATGAGCCC CTGGGT CATGGCTG CAGC TT GCTGGCTGATACTCGGTGTCCCTCTCTCTGGAACTGTGGGCTCTCTGTCCCCGACTCAGAATGGCAATTCTGACCCCT CTACCACC CTCAGT CC CATGTGGAATGAGAGACAGAGAGGTGGAGT CTAGT C TCTATGAC CACGTG C C CATG CATG CC TG CAGAGC CCAGGC CC CACAGC CCTGCCCC CGAGTCACATGCATGG CC CACAG GGACAAGAT CAATGT CCAGTCGGAA CG GGGAGC TCGGTTGC TCATGCTGGACCGG CATG GC CGTT TC CTGC TC CTTTTCAGCAGTGACCACAAAC CTTCTCCT CCAACATT CAG AAACAGC CTGCAT CCGG CTGAAC CACACT CAGGCTTC CTTTACCCCTG GAAAAGC C CACTCTACC CA GATGCTAC TGATGTTTCCCTCCTGGTATCC CAAAGGATGC CACACG CCCCTCGCCTCCTGTG CCAGTACT CCAC CAGC ACACACCTTTGAGTCGTAAGAACAGGGCTGGCAAGGGGCAGAAAATTGTGTGGACTGGGGTTGCCATTAGAGGAACTT GGTGAACAAGTGAACCAATGAACAAG CACTTTGAGATTG CAGTGTTTATGCATTTGAGTGGAAATGGG C CAAATGC CC CTCTTCTTAAAAGGCGATAGCAGTTTTCTAGGAGCACAGCTTACACCTGTGAGTCCTAAAGGCAGCTCCATAAGACAC CAACTG GC CCTGGAGAGGTGAGAAGGGCGGTATATT CATCAG CTGGAGCTGCGTGGCTGTG( N ) xGGCAGTTCCCTCC CTCTGCTCTCACCCCTTGACCTCTCTTC CAATTTGCATTCAC CTGCTTTGGCTTCCTCTGAGATGTTC TCAATACT CC
t t c c t t t t a t a g g c t t c a t t c c a g g c a t t t g a t c a t t t t c c g t t c a t g a a c t c t t c g t c t c t g t t c t a t g t t t t a g g a TCTTTTAAGGGTCCTATTATCCTTTGCTAACCACCAAAAAGTGAAGTCTGCAACCTGGAGAGCCTTCCCCTGCCTTGG GCTGCTCTGTGAGCTGAGCTCTGTGGTGCTGGGGTCTGCCTATCCTGCTTTGGAGATGGATTACTCCAGTGTCTCCCA CATGATTCCAGGCTGTGGTCCCTTCAACGTAGCAGAAAACCTCTCCCCCACTGGTGCCCTTTTCCCCCACAAATCCAA GC CAAT CC TGAGGAAT GT CACTGGGATGAC CAA CTTAAG C CAGATA CAA CAGTATTGTCAATAT CAGTG CTGGCACTG CC CAGACACAGACT CT CAG CTCAAAC CCAT GGACTAAAAC TACCTT CAGACT GAAACCTCTGATTCGTAC CAGGGC CA CGATTCAG CCTGGATTTCAACACATCTGGGGCCATCAGG CAC CAGA GCA CAATG CCACCCTCATAG CCAACAGCAACT ACTGAC CAGGGCTTTAACACAGAG CCTGCCATTGGCTGCTTT CACTGAGGTGTAGCAGGT CATT TTTGGAGGAG T CAA AAAACATTTGTTGGAGATTG CTAGTCAATTAGAG(N) xATGGTGCTTGCATCATTATTTCATACTCACTGTAGGGAGT TT CTCATC TGCCTCTCAAACTGTGTT CCAC CTTATCTGTT CTGGTGGGTAGAAGGCAAAG CACC TGGCCCCGTTACAG TTATAG CTGAAC CATGAC CAATGAGT CTGAGCAT TG GATG GCATGAACATCG GAGATC CT CT C CTGTGAATC TCAAGT GACCCTCCTTGCCCCTCCCCAACTCTCCCACTGCGGGATTTTCCCAGGACAGAATCTCAGAATGAGCTGGCTCTACAG GCA CTCACAGATGCACCCTTACCTTGACCTGCAGGC CTGAAG CCGACC CAGC CCACAGGCAG CATT C C GAGC TGACAC AT CAAGAAGCT C CAGATG CCTCCCCGACCTTCA CAAATGT GAAACAAGATGAAG GTGTTGACAGGG CTGGTCTGTAGA AATAGTATTTTGTTAT TACT GAAATGTTAGAGTGTAAAATAGAATGTT CTGTTCCTCTCTAGCCA CAATGT C CCAT TG ATT(N) xCTGTGGCACTGCCATTTCTCAAGCTACCTAGAAGCTAGTTTGAACAGTGTGTGTATTTAAAGTAAATTTCA TTTGGGACATTTTATTA CAGAAGC TTAT TATCT CTTTTTAAAGGAATCTTT CTT CTCAGGAT CTTAAGG CAT CTTATA TC CATT GAAAAG GC CT CCTGTAC CATGACAGCTT CT TCAAAGGACACC CTAGATAATAGCA CAGGAAT CC CAGGGAGA TGAAATACC CTGAGAT CCACAGACTTCC CTGAGATGGG GAGT CC TGTTT C CT CCTCTAC CAAGCTG CTAAC CAGTGGC CAAGTCAGATTAAGCTGTATTTCCTCCCAACCCCCCCACCCCCAATCTGCAGTGGAGGAGGTTTTAAGCAGGCAGTGG TTAAAT C CTGGAAG GAGAAATTAGAG GAAATGAAAAGACT CT CC CAA CTTGGAT GAGAAAGTGGTGGAGTGACAGGGT GGGCAAGACCCCAGCAAGCTGTCCGGGGCCAGGCCCATCCAGGAGTCGGGTGTGTTTCAGTGTCTGCCCAGGCCTCCT GTCTAGGAAGCTTTC CTGC CAGGTTCAGAAAATGAAGAACATAAA CCACAGGAC C CTATT TTATGGGTAGAAATAATG TCAATGAGGG CAAATGGAGAAGAGTGAGG CAACAGC CACATC CGTCCAGGAAG GATGC CTGG GACACAG GTG GTTCTG CCAAAGAGC CAGAAGGATGGAAGGAAAT CT CAAAAT TCAGAT GAGAAG CCAGA CAGTA GGAAAG CCAAAAAAGCACAA TCTAATAATTTAGAAAATGAAAGAGGAGGAGGGCTGTTAAAGATATTCTCTG CT CCTT CGTAACAT CT CAGCTC CT CT CT TC CTATGTAGAAATAAA CGCT CAC TG CAAAAG GCAATGATTTGCTTAGAAAAAATT CATTTAAAATGTCTTG CAAT TATTGACACT GT CATCAGAT CTTGGCAGTAGCAG CTTATTATTACCTC CTATGTATGTTTGCCG GGAGAAGAGT CT CA TTGTTTAGGA GGGA GAAATAAAG CAAAAGT CCAAGG GTATTGTG CTTGAAATACATGC CTAGAGATATG GAGTTGATT GC CC CAGAAAAAAAAGGAAG CCTATGTGTATATTATTATT CACCTCGGTAGC CCGAAT C ( N ) xATACAGTTGAGCTCA GCTGGGCAGCCATCCAAGTGATTCATATTAAATTCTAGCTATATTCAGCACAAAAGTCATCTAGCATCAGAGGATCAA AG TTA CATCAAC TAA CAAGCACTGGGGGATTGATAAGCTG CTAAAATTAG GT CATTG CATAG CAGCAG CCTTTGATGG AT GAAGTGAAAT TTAG CCGCGGGTTTGCTATTTAAAGATC TGTTACAG CACC CTTTCCTGTAG C CACAATGAACACA C AACAAAGTAT CATGTGTCTGATTAAATGTTTAAT CATCG C TGTTGAGCTTGC CGACTGTG GCAAAGAT TCAGTTTC CT TAAGAGTGCATT CAGCTCTTGGGC CAAG CCAGTACCTG CATTTTAATT CC CACATCCAGAGC CTTACAACAGGG CC CA TTTCTCCCTCCTCAGGATGAATTCTGTAAAGTTAGTTTAATCGTTTTTAAAAGCATTTGGTCACACAGTTCCACCTCT AAGT CC CTGT CAGTTCAGTGTTTCAAAACT CTT CGT CAGCGT{ N } xGGCTCCAGACTGCCCAGTGGATGCAGCAACTG GTCTGGTGAGCGCATTTCTGTTTCTTGCTCATTCCCTCAATTTCAGAATCACACACCTGTCCAAAATGGCTCACGTCT CTCTCTTCTTTGGGACTGTTTTTGTTTC CAGATGG GTTAAGGTGGGGATT TGA CTTCATTTG GTTTAGGTTTTGGCTG TGGCTGAGGCCC CAGG CATGTGGCAGTCA C CAGTTC TGATTT CATAA CTCCCATGGCCAGGC CAG GAG C GACACAG GA AG CC GGAAGGGG CCT CGGGT CAGTGTGT CCTGTCCCAG CAGCA CGGGGTGGGGGACTGTGTGCCCC CAGGTG TC CTAT CCAC CC CTCAGCTC CAAAGGGCATGTGGA CATGCGT CAG GTC CC CTGAGG CCAT CTflCAAA CAC CAGATGGACTGTTG AG CT TC CCAGGCTCAGAACC TAAGAACC CG CAACTG GAAG GAAAAGGCAG CAGAAGAT GAGAAAAACG GTCAGGGAGG AAAGAAAACACT TTAACCCAAACAAAGT TTAAAATAAAATAAAGAGTG GTGAACACT CAG CTGG GAGAGGAGAG GAGG CCTCTGTGGG CAAAGG GAAATGCTG C TGAGAGGCAG CTAAG C CTGAGT CTGGAAGACT GGAGGGAGGCAGCTCCAGGG TGAG CAC CTGGAGTGCACAAGGCT CT GGAAGGGGAGGG CAAGG GGTATCACCCTCCCCAGAGCC CCAGGGAAGC CTGA TT CATCAGTA CACC CAGGAAG GGGCTGGCTGCAGCGATAGCCCCCTGCCCAGTTGTCCCAGATGAACCCTGGGCTTGG AAACAGT CCAGGGACACTGG CAGACT GAGAGCCA CCGGAGAGGGTCACCCCGTCCCCCTCCTGCCTGTAGGGGCACAC AGGTTTCCCACTAACCCTCAGTCCCAGTTGACCAATTACCAGATAAGAAGATTGGATCCTCCTATCCTTACAGACACA AG GACACAGGTGGGAATGG CTTTT CATATG CAAATT CCATGC CATTTG CT TC CTATTC CAGC CAAA CAGTTGTG CAGA AGAT AAAACGAATCAGGGAGTGCT CCAGGGAGTGCTCC CGGCTCTCAGGAGTAC TGTAGGGAGT GGAGACAGGAAGGA AG CCAT GACAGGGAAGGTGACAATATTGG CCCGGTGCTCTAGATGCAGTTCTGCAGGCGCT CACTGAAGAGACTGC TG GAGAAGGCAG GAGGATGTGC TCGCGGTTTCCTGT GATGTGGTGAGGG CAGGTGGAGCTTCAG CACC CAGAGG CTGGAG GGAGAT CCTTAGGTGG CCCAG GTGACGCAGGAGC CTGGGCTTTCACTCAGAT TCAGAC CAGAGAGACAAAGTAGGAAA CT TGATTTTAAACCTGGGAGTGG CTGAGTGCCTTGTTCTTGGCTATGTTTCAGTGTGATCTTACACATAGCTACCCAG CACATCTG TC{ N ) xATGCTGGAGGGTGGCAGTTTACAGGCATGCACTTTCACAGTAGTAGGATGGGGCTAGGAGCTGA AGAG CATGTACACAG CACTTGCTACAACAT CCAAGATT TTATTT T CAAGAGATG CTTT CT GAAAGC TTAGAGTTGAGC C CTATGT CATGGGAGATAGGAACAAAAAAACCT CATTAGAGT CACAAGTTGTTTT CAAGGTTAATT CAAAGG GT CACG T CTATT CATGTT CAAAATC C CAGC CATGTGTAT C TTTACACACAGTAGAT GCT CAGAT CATC CACCTAGGTGATGAGA TGTAT CCCGGGCATGTGGGCACTT CAAC CAGAGG C CTAT CCTTGAGGC CAGTAAGTTACTGCACGTA CATTACACCAG TATGATGCTGTCATTATTAATCACCTTCATCTATGTCATTTAAATTGTATCAGGATGGTGTTGTAGGTCATGT( N ) x A TG AAAG GAATTTTG CTAGAATGAATAAT TATGAAATGCAAATTACAAGGAAGGTAACACACTAAGC CACTG GGGGCA C AAAGATGGCCTTGGGGAAAG CCCCCTTGCCTATGCAACTATGAGCCCACATGCCATGAGCCTTTTCTCCTAAGAACTA TGTAAGTAGATC TGTT GGATAAGTT C CAGGAAGTGACATATTTC TTTTAAATTT TTTTTAGATAGTGT CTCAT C TT CC CC CAAAGAGATG GCAATAATTTATAC TC CACTCATAGC CT CT GTGAAT GC CTGCTTCAGTAGAGAGTCGACAATAT TA TGGGTTGTAGGTGT CTTTCCTTT CTGATAATTGGAAAAGCTTTT CATT CT TT CA C A A (N ) xTACAGATGGAAGGGGCC CTGGGATTG GAGGTGG GGAGGGAGG TTCATACT CACTGTAAA CT C CAT CCTACAG TTAGAA CTTTT CATTT C CATGTT CAAG CATTTCTT CT CACACACAGAAAAC CTACTTAATATAAAAGTGGGTT CTTATAAAAA CAAAAGTTTGAG CATC CC TGCTGTACACCCTTAGGCTTTTG CACAAAAATGGGTTCAG GCTTCTTT CATC CAGATCTTGAAC TAAG CAAC CCATAA AAACACAGAG CTGTGCTCTGTGTCAC CAGGTACATTAGAAACTT CAGT CCAACAACCAAAGATAAATG GAAG CA GTTT AGGT CTTTAATG CAGT CTAGGGATGACAGACAAT GTTCCCATTCAGTA CACCTAGAAAGAATAC CTGAAAGAAATGTG ACCTTACAGGTCCCTGACTGCTCAGCACCCCTCCCCTAAAACCTCTCTCCTCCTCCCTCCCCTCCAGCACCCTTCCCG GCCTGTCCAGGAAGAGCAACTCCCAACCCAGCTGAGCCAGCTTGTGCAGATCACATCTGCTTCTCTTGTCACCTGTCC TGGATT CTGGACAACAGAGG CGGG CCAAGAAGCAAG TGGAAATC CTGCCTTTTCTTGCAGCGG GGAT C TCAGTAGACT G CAAACACTGACAAGTTAT C TTTCACGCATGCCT CAGTTT CCTTCCAGCTTTTTGTGCTGTTCCTTCTACAGCAAGAA TTGCTTTGCTGAATGC TCTCCTGG GGAAAATAAT TTAATTGTTT CAAT TGTG C CAATTATGTAC TTAAGGGT CAATGT TTGTTTCCTATAACATAGCAGCTTAACTAATTAAAAAGATTAATCTTGGGTAGAAAAGAGCCTGTATTTACGCAAATT AGAT GAAGCAAT GACT TGCACACAGGGACAAGAGTGTC CTGC C CTGCAAAGG CTTACTGAGG CTGCGTGGAGTC CAGG GT GC C CTAA CTGTACATTA C CAC C CT CGGG G GCAT CTAGAAC CCAAGATGAC CACAGCAGAG CCTTGAGGTG CG GCAG GATG C CAGAACTTT CCACAGCTCCCCAGAGGACCCCAG CAAGTGGTGGG GGC CGAGA CTAGAATATATGGC C TTTGGT TGGTGATCAGAGCCTTTTTTCTTCC CAAAGGGAATT TTCTAGAG GGGAATTTAG GCCAAAT CAAAC CCTCACATCCCT TTCACTCACAGATG CTAACAAATAGTAACAAACCAGAG GAAAAAGGAAT C TTATTTG G GAAGAGTC CT CAAAAAGTAT GCTTTTTTTT CT CC CT CACC G A ( N ) xCTTCTTAAGTGATCTCCTCTGCAAAACCACAGCTTCCGCACCTTCACACTGA C CCAGGGATGAAGG CATG CCTGGAGG CCGAGCTGATTCTAATGGCTCCACTGCCCC TTGGACTATTAG GATG GT CAAA GTGCCTGCATGTCACC CAGTATGATGTGAC CTGATGGTAGGAACCATGCGTAG CCTGGGTTCATCT CAGG CAGG GT CA TGGACTCACT CAGCATATTACAGTTG CTGTT CT CAATC CAC CAGCCGCCCTCTCTGGTTAGC CGATTTGGGGTCTCCT GCCTTAGGTGGCTG GG CT CTGC CT CATTA CTT CC CC CCATCTCCTCCCTCATG GATGT CAATTC CAGGGACT CTGTGG TTGTTGGTGC CCTATCAC CACCGCGT GCAT TTTAGG GT CTGGGGGTAACTTGTCAC CT GATT TGGCTCTC CATTATTG GCCATGGGTTTTTGGTGTTG CT CT CTAGGTGCTCAGTT TTTCACTGTGTGTGTTGGAGGAGCAT CCTGAG CAGAGCAG ATCCCGACTTGGACAGAAACCTTCTTGGTTTCTCGGAGGAGTCTCCTCTGTGAGTCTTCCAGAGCCCGCCCTGAACCA CATGACTGGGAGCTCATCGGGCACCTCTCCTG GGAAAAAAAAATGCTAAGT CAC TAAG CAAAAG GT CAAGTTGTTTTC AGAAAATGTTTTTTAAAAAGAGAAGGAAAATTATGATATGGCTTATAAACATTTGTTTCAAAAATTCTTATTTCTAAA TGAAATTTAATGCCACTTGGAACTTTATGCAGAATAACTCGGTCTTCGATAAATCTGAATAAATGTACATATGTAAGT ATATATGTGTCTCTGGCTTT CTGTAACT CAAA TT CAGGGTTTAAGGTTTG CATTTTTTGAGAATGTTG( N ) xTGAAGG AACATACC TCAAAATAACATGG GCAAACATTATAAGAAAGAATGATGTTGTAGCAAAAGAAACTAT CAACAGAGTAAA CAGAC(N)xAGGGTATCCCTGTACTTATCGAGTATTAAATTATGTTGTGACTTCTTTCTTTCCTTGACTGTCCGTCGG GCCAGAG C CAGATTTTGG AT TC ATGCGTGTGC AGGC TG CTCAGTGTCCATGCTTGT CACGGGTGTTGG CATTGG AATC AGAGGCAGCATCTGCTCTGT CCTAGCTCACAGGCTT CCTCTGGCAGAGCACAGGGACCAGGG CT CCAGGTTAGG CTGC AGGTCAGGAACGAGTTAGTTCAGAATGAAGGCAGGTACTGTGTTTCATTTTGGTTTTATTCTGTTATGATTTCCTTGC TTCTCTTTCCTGGTGAGGCTTCATCTTACCTGTCCTTGGCCAGTCCTTCCCCAGATCTTTCTGGTAATGAGCTTTGTT AGCTCAATTAAATAATGCCACATTCCCACAAAAGCTCCGTGAGTGACAAAGGATTGGCTCCAAGCATCCAATTGCCAA GCCTCACATGGCTTCCTGAGTATCAGGTGACCTACATCCTTCCTGCATTTCTGAAAAGTCATCAGTTAAAATAACTCT GGAGCTC CAGGAAGACTGGG CAGCTGGTATAGCCCTTT GCAAAGGTGAAAGC CCTAAAAT CACACATGACAG CTGGTC AAGGTGTC CC CAAGAG CTGGTCAAGGTGTC CC CAAGAG CTGGT CAAACA CAAACTAGACCAAGGATGT CT GCACAT CA GAAAAGCT CCACTTAGAAAACAAG GAATAAAAA CAACACGGGAGAATAAGG(N)xCTCTTGCGGCAGACTTGCCAGGT GCCAAGCTCTCTCCCCTCCCCTCCCTGTGTGTTTCATGAGCTATTTCCCACTGGTACATCCTCCTCTTTACTCTCTGG CACAGCCAATCCTGCAGCTTCCCAAAAGTCCAGCCCAGTCTTCTCCATTCAAGCCAAATTGAGTCCTGCCAAGCTTTG TTCTCTGATAACTCACTCAAGTCTGGGGACCGACGTATATCTCCTAGATGTGACCCTGCCTCAGGTTTATAACATCAT TGAATCGG C CAAAATGAAGTTAAGAACC TTGGCTTC CATTGGCCTCTTCCT C CAAAGAAC CCAGATTCTGTG CAATGA TAAACCAG G CAACTGATAG GTACACAGCTTGGTGGCTT CTGCAGAGCCTC CTGCACTTGTAAACAT TCAT CATT CAAA CCCTGGGCAATGTTAAAA CAGCATCACATCACACAGTG CATTTCCCATTT CAAAG C CT CTGAAAATGACAGGAAGATG GTTCCCAGAG CACC CT CAGAGC CC CCTC CCAGAGAC CACAACAACTGGTTAC CAGAAA GAACACAG GTTATGTCTCTC GTTCCCACAACTTATGATAC CAGCGCACAGATAT CAGATGTGTAAAAGCCTGAC CCACAGCTTC CT TACAATATAAA C ACATCATTTCTACC CTATTTCTTTTCTTGGACTGGTGT CTGATGGGGAGAAC CTACACTTCCAC CACCAATAGGGAGG ATTAGTTTTCTGTTTCTTTTTTTGGT CTG CGATGTTA CTTATTTCTTAAT TAGCATTTGTGCTGACATTT CTCCTGCT ATCTTAGAGC CTAGAAAT GACTTATTTG GGGAAAGCTTAAAAAGGTATTCATTCAG CCTTTCTTTCCC CAGTGG GAAA TGCGATAGAGGAGAAAAACAATTG GATG CCCTTCAC TCACTCAAAATCTT CAAGAATTTAAACCAT CT CCAC CAGCAT GAAAAGTACTCAACAGTAAGGGGCATGGTAAGGAAGAAAACGGATCTAAATACATACGACTCCGTACAGAAGGGATGA GGGTTCTG TAG GGCAGATACACACAGATAC TCAAATATAGACTTCAGCATAAATAGATAC TT CATTTAAATACATT CG GCCAA CAGACATATGTG GAATTATGG GG CACCACTATTTGAAACTCA CAATGGCAC CAAGGC TGGC TCTGAC CACCAT CTCTACCTTTCC CATC CAAGAG CTGCTT CAGGTGGAGAGAGGTGAGGCAGAGAAAACTGATCATGATGACACAG CTGC TGCTGGT(N)xCCAGGTGGACAAATGAAGGTGGGAAGGTACAAAGGGGCCATGCAATATGCCCAAGGTCAATGATAGG GCTGAGATCTAGACCCAGCTTCATGCAGCTCTCTGCATATGTGCTAACCATGGAGTGC
> H s l O _ l 5267696 - 15287 05 0
TGTTGCAACCAG GAAG GGCGCCCTTTCATCATGCGGTGAGAGCCTCGGGCACTGCT GAGACAGAGCAAGGGAGGAACC AGGTTGTT CCTG CTGCACGAAC CAAAGAAAGCT C CACTGCAAGGCCTCTCAAGT CGGCCCACCT GC CC CTAAGCATGG AGGCCTTGGGATGCTCCTGGGTGAGGACATCTGCCCAGAGGGCTCTCCTTCTTAAGCCTGTCTCCTTGGGGTCTGACA GACAACAGTC CCTAAT GTGACAGCAG CC CAGCTGG G CTACCTGCTCAAG C CC CCTT CA CC CTGAGCTT CT CTTTCTGC CCTTCCCAGCCTCTTCCAGGGGCTTACCACTGCCTCTTAGTGTCCTCTGCACAAGCAAGCAGCTGAGGTGGCCGCTGG GACTTGCAGACTCAAGGCAAGGTCATGTGGTGAAGGCAAGCAGGACCCTACAGCCTCCTCCTGCCTCACCCCAGAAAC CCAGGCTTCTTC CTGCAGAAGC CG CACCACTG CAG CTG CGCCAACCTGGT CGTG CAGC CGGCATTT TAGGAG GTGG CT GGACAATGACATGTGC CT CT CTGTTC CAAT CAAC GC CTATGGCTGGGCCTGTTACCGGGACTGCAT GTGTGTGAAGGT TACGGGGTGATCTCTCTGGCATCATCAAGGGATGAAATTCTGCGTTACTGAAATTTCTACATCATTGATTGTTACATA
G(N)xTGAACCATCTGGGAGCTACTCTT CTACGTAACCTGACTCTTTGCCAAAACTTGAAAAGG CCAT TTATAAAAAT T CTAAAAAGCAT CATTTC CT CCTG CT CTGTGAATG(N)XCCCTACCCTCATGTTTATTTCCCTTTTCTTTCTTCCTTA TAACGTGT TATCAGTGTT CTGTTATCAACT CTGCTCCC CCACCACAACCAC CCCCCAGCTTC CCAC CTGGAG CTGTAT GCTATTGG GAAACACACAAC CTTATTACAGTT CTGT CTGAAATAAGATACA CAGGCAGTGTGAGGGACAGGAGGACAA AAT CAGATA CG GAACAGTGATACATATA( N ) xGGAACAGTGATATTTGAAATCCAAGTTCAGTATTGTTTTCTTCCAC ACCACACTCT CAGGTCAGTTTTTG TCTGAACTGT C A { H) xACCTTGAAGTATCATTCTTTAGAGTTAGACCTTACACA ATG GAAAG A CTCGG AGATATTTTC AAGT AT AAA C AATTTGTTGAACTAT CTT ACTAATTTTTTG AAAATG AGTAATGC
a t c c a g c t g g c a a g g t t c a c t g g g t g t a a a g c a t c a t g c a t t c t t a a t t t g g a a g c t a c a t t t a a c t a c a a t g a t t a t TACACATAACAACTGACAGG CAGGGAAAGATG TGTC CT GCAAGATCTGATGCTG TCTCCCAG CAAT CG CCAAGTGTGC TGCTTT C CTG CT TGGTTAC CAAC CAGTAAT GAAATCTACTTTTATTTG CAAAGTATAAAGAAAAGTGAGT TCAAAGAT TTAACAAATGG C CAT CAAAAGAGAAAAGAACTCTAAAGTATGAAAAAC GG CC C AAAA CAATGT CTAAT TT GAATATGG C CTAAGGAGAT CACAGTTA C CT GGCTTT TC CTGC CT GG CACATT TACT CTGC CTGGAAA CTCCCCCGTAGCAGC CAAT TAC CTT CTCACCATTGG GTAAGAAGCAGACAGAC CTGATATTACAAG CACTATTGACTTTGAAAAGAT CGGACCTGAA T CTCACATGTTAAGAGATTCTTAAAATCTC TTTTTGAGAAAGAGATTAGC CAAAGTACAAGAA CAAAGTG GGAGTA( N ) xAGCCCAAGTGACTGTTACTATGCTGAGATTTGTTAGCATGTCAGACCTCAGAACACTGGGCAGATAAACACCCAGC AGCATAAAGGGT CAATGGAC CATACAGA TGACTTCCCTGGATTGACATGATC CTAAAT GTGATA CC CAAGTCTG C CAC TTCCTAGTCGTGTGCGCTGAGG{ N ) xGTATTTTTCCAGAAAGAGGCTGCGGTGAATTCTACTAGGTCTATAAGCTGGG ACTCGGAAAGAAATGCCACAGC CACT CAGAAAT C CCAT TG CCAAAAGT T CTATCAATGAAGACGTGATTGT C CACACT GTGAAACATAGC CATTTCTACT CCTACAGCTACATT CACACT GTGAGAGATTTGGGAACATTTAGGGTGGGGGG C AAG GTCACTTGTGCTGGTGCAGGCATGGCCTTGGATTGGGGAACCTGCTGTTGTTTCATTTTTCAGCTGTGGGCTCTGCCT AATCACTAGCTT TCTTGGGCTGTCATGGCT CTGGATGAATAATTAGCA TGGAGTGAGCACTAGCTC GTTTGAGGAATA ATCTTCTTAAGAGGTATGCACCTCTCAGATCCATGGCAAACACTCATCATCATCATCATCGTCATCA(N) xCCATCAG GCGTTT CCAGAT CC CTGAATAT G CTAGATTTCTCTAGAAAACAGATGACTTG CTTC CTAGAGCTTT CATCTG CATTAA G CAATAATCT CCATAAGTTC CTACCAAC TGTATATATAGAATTAT CAATACTTAAGT CTATGTTATTGTATACTTTAA CAAAAT CAGCATAGAAT CACATTAAAAT CAAATAAAAAG C CATCAGCC CT TAAACCACATTCTT CAGTATGG CAGAAA CAGT GACAATTT CTACATTTATGACACTGTGGGATAGT CCTCTTTTTTTC CAAATTAAAGTCTACG GCTTTGAGTAGC TAGGTTT CCTTC TTTGGAATTGTTTTTC CATGAAATATGT CAGAAGTAAACATTC C TTGT TAAT TATATAGCAAAGTA TTTG CC CGGTGGTCAATTAT CTAATCAT TC TTGT TGAAAG CAAT CAAACTGAAGAG CAAAAGTTATGTGCAGATGGTT GGATTCTCATAGCTGTAGG CAAGTA CAT TT CAGGACTGGTTTAAA CAAGAATAAAATTAAGGGCAACT CATT GTAACC AACGAC CTTCTGTTTGAAGCAT GAAATCATGAATGTGTTGAATATTAGAAATTAA CA CGATGACTGTATT CAGCTGCA ACACATTCCCTGGGTATTTTTAGGGTAGTTTTTGAGAAAGATTCTTAAATTTAAGCAACTGATTTTTTTTTCTTTAAG CTTTAG CTGAGCTAT CGTAAGACTTACT GAAACAAGTTTTAGGAAAGAAG CAGTTATG GGAAA C CACAAAATACTGGC AGACTTATAGAAACTTTATTCCTTAAATAGATAGCTAGACGTATTAGGATACATAATAGCATTAATAAAATTTAATGA GAGACT TCTGAA TT TACAATAGATATATATACAT TT C C TGAAAGTAGGTCAGAATACAAG CTAATTAGCCCCATCTGG AGAGAAATTTTG TT CATTA CCCACTGAGGC CAATAC CCAT CTTG CAAT TACATGT CAGAAGGGACTGGGTG G GGAGCA CACAAGGATT GTGGGTGG CACAGCCATTAGTCCC CTGCATTG GAAAAATC CT CAG GAT GAAAGACAGCAT AACAGATA ATTTGCAGAGAATAACTGAG CGTTCT CATAACAGAGAAATTA CAAATC TGGAGACATG TG GTAG GATG C T T G { N ) xCG CCTCTCTAAATTATTTGCCCTTTTGTGGGTGTGGCAACTCTGTCAGGCAGAGTTGCCACACCCGCAAAATGCCCTAAT G CCATTT CCTAC CATGTGGCTGTAGCGGGTT CACTAGGTGGC CAGTGC TTGAAGCAACTT CTGAAATATTT C TGATAT ATTCAAGAAG GAAT CTGGTCAATTGT CAGACTATAGGC CAGGAGTCAG CG CACTCAAATG CCTACAGAT( N ) xT TTC A AATG CCTACA GATG CCATTGTGTCCTTG CCTAGAGG GATGTGTTGCTTAGTGACCTGG CTGGTT CCAAGCTTGA GACC TCTG CACTGACCACTGTTGCTG CCAGGGATGTCCTGTCCC CGAGATAAG CCCACGTTAGGATTCTGGGTG TTGTTAGG ATCTCAGCTCAAATGTCACTTCTCCAAAGGCTCTGACCAGCTAATGACTCTCTCCACCATCACTCTGCTTAATTGTTT CTCCGC CATCTGAAATAAAC TAGCTCATTCTCGGTCTCCT(N ) xGAAACAGGTGGTTGTTCTGCATTCCCAGATCAGC CCATCATTCTATTCAGCCACATTAAGTCTTGGCATTGAAGCGGGAATCCACAAAAATCTGATAAACTGGGAAAGGGGA TTTGGAGTCTGAAGAAATCACGTTTTGTGTCTCCCCCTTCCCTAGGATGTCCCATGAGTGTGGGAAACACTCAACTCT ACTGAG CTAAAGTT CCTTTCTGATAAAGAC GGGAGT CC C C CACC TCCAGG GTTGCTTTGG GAATAATGGTGTAAAATG TTGGTTTATAG GAATACC TGTTTTTCTAATG GATAAGCTCAG CTAAGAATGAATAGATAAAAA CAT TGTTTTTT C A ( N ) xCATTTTTTTTCCCTAAAGGATCTAAGTTGTGATCCCATTTTTCTTCCCCTTCAGTTTAGGAAAACTTCAAACTGCA TACTGGAAATGG CTG GGCTG CCAGATACAAATACGTGTACATGG CGTACTGCAGG GA CAAGCCAGACACTT C TGTGGA AGCTTCTAGTGAAG GACATT CTGCACATGCCGGGTGTATTCAGG{ N ) xAAAAATAATTTTAAAGGGGTGATTCAAAGG AAGGGGCAAAAAGAACCACGAT(N)xCCACGACATCAAAAATGTGATACATGGCTGGAG(N)xACGGGCCCTCTCCTA A CATTTACAGAACTTGAGAAGTTTCAGG CGTGATGAAG GCACAAAACTA CATT CGAAAGGAAAAAAAAAACATACAAA GTGTTGATCTTCTT TTAAAAAATCAT GTGT TAAGTGAGTG CTAACATAAAAGAGAT TATAGCTCATACAAAAGTATAA AACCAAAACAAACTTACGAAAGAAACAAGAAACAAAGCTACAGTTTCAAAATAAATTTCGAATTAAAAACATCAACTT AATGGTGTAC CTGATGACATAT{ N ) xATATTACTGAACTGAACTTGTCACTTGCAACATACATACACATGACATCCAT GGCTAGGTGGGGCTTCTTTCGAGCTTGGCATGACTATGATTCAATTCTTCCACTACCTTTCTCAAACTGAACTGTAAA ACTTCCCATCTCTCAACATACTTTAATTCCACCTGTCCTTATTATGCTGGTGTTGTATCATTTGTATATTACATTTAC CATGGTTCATTCTCATTTGT CCTATT CTAT C CAA CAGTGTGTTCATAAATGCAGAC TCAGGCAATAACATTATGTGAT CGTTTTAAATTCTGTATTGAATTCAGATGATGCAGAATATTTTCATGTTAAGTTGTAAAAGATCATGATCAAACCCCA AACT CCAAATTTGAAAAT CCTTGCAGATGATTCC CAG G CC TCAAAGAAG CGGTCTG GATCTACAGG CTTCAGTG GGAA ACGCTGAATCTCACTGGGGAAACCTTGGATACCT GGAACCTT CATCTCTCCCCAGTACCCATCCTCGGGGTCCTCACC TACAAG GTGAGGATAACACAGGTTG CTGGAAAGAGC CACGGTGTTCAGCCAG GAGATAGGAGTGAG CAGAGC CAGGGA G CACATTAGG CT CACACACTAGACGAACA CTCG CTTTGGACATCAGAGACTTACGTTC CTG CTGTACC CAAATTAAAA GGCACCACTTATTGTGAACT TTAAAGTTATTTGGAAG TT CAATC CCAAA CAATGATGT{ N ) xTCCAGCTAAATAGCAA CCTTGGGTGAGACTG GGATG TTAAATTTAAAAA CAG TAAG GCTTTCCTGTAC CTGT CCTTTTATAT CATG CTTC CAAA AAAAAAAAAAAAAT CACACACTGCATATGGGCTAAATAATAG CAGCG ATGAT GATGATAAGAGCGG CAAACACATACG
( N ) xCGGTGGCTGGGCTGCACACTGCCTCTATCCTGTTCACGAATGCTATCAGCTGTGTCTCATCCAGTGCAATATTC ACAT CTATCAATAATATT CTAC CTTCTAATTAC C CACT CAAGTGGAGG CAAC TGAAAG CAGGTGATA CTTAAAAGTAC TGAATTCTGAAATATAGTGGCTTTACTTTCCTAGTTCTTTTTCACCCAGGTTAATATTAGAATGATTGTGTTAGATGC TGTC CTTAAAAT CCAGGC C A ( N ) xCTTGAACCTGGGAGGCAGAGTGAGACTCTGACTTGAAAAAGAAAATCCAGGCCA AATAGCAGCAAACTAAGAGG GCCTTGAGGCTCCTGGGTGATGGG GAAAGC CC GCAGGAAGA CA CAAATTTGC CACAGA TAATCCACAAAGTGAATACTCCATAAGTGGTAACTAAGAATGAAGTGTAAAATAAAACCTCTACAACTCCAAGATCAC AAAAGCAGGCTATACCAAGACGGGAAATTCTAGT T CTACATAAAATGATACGTTAAGAGACCTAAG TC CCTAAGAATG GGGTTTTTAAA CAAAAGTG CCC C CTGAACC CGAT CATACTAGAACAAATGTGAT GCTATTATTT TATGAAAG TT CT CA TTTTGGG GGG AGAC AAAC A CAAAAGG CTTAATGATTCCTAGG AATG GATT GAT AAT AC AAGATTGTTTTTTG AGTTGC TAGGCAACACTGAAAAACAAATGTCGATGTTCTGAGATCACAGCTTGGTTGTTCTCCCATGGGCAGGACCAGTGCTAC CCGGCCTTCCTGCACTCCATTGGAGAACAGGAGTGGTACAGAGCCACTGCGATGCTGAGTCCACAGTGGCAACAGACA CCAACTGGCATCTCCATGAAAAGTCACAACTGACCGCCTGGCTGCCAAGATCTTTGGTGAGACAAAGTTTTCACTGAA CTGTCATCACACCCAAACAAGTCCCATGCTCTCCGGGTTCTTTGCTATCTGTGATCACTTCAATCGCTTTTTTTTTTT GCAGGTACATTACCCCCTTTTCCCCTACCGGTCTCATTAAAGCCCCATTCACCCCTAAGAGTGATTTTGACATCTGGG AAAGTñCAGACACTTTTC CTGñGAGTTATTTCTTTATT T CTAAG CCATAG CCTTGACT CTGAATAT TT CCACTG TAAA AAGGCCTATTTCCAATAACAGCTTTCTTAGGGATATGTTGATATTACCTTAGGTAAGTTGAATTCCTCCTTTGCTGAA TG CCCTGAACTACGGCAT CTATTCTTTTTCAAAAAATAAAAGATGATG CC CAAGGTTCTAAAAT CACGTCACACAT CT AC GCACAGACTTGGTAAAGAAATGAACTCC CTTCTTTCATTCTAATGT TTGGGGGCACAGTGTAGTACTG TTTATC CC TATAATCCAATCACCCACCTCAGTATTCCAATCTGGGTTACTTCACCTAGTCATGTGGCTTGTGATATATAGAGACAA AGATAAGCTAATATTGTAATAATGATTTCACTGAAGACTATTTTTTAAAGAGCA(N ) xGT TCAATAATAGTTTCATTT TCTTTACGTTGTAAACATAATTTTGTGCTTTAAGAACT CC CACT CTTTGCCGCCCTTCCCCCCCACCC CCAATTAAAA AC C CAATGAACCTGAAAAA CAATCTTAGAT CTGTGTTTTAAATGAGTCTC CACCTTCAAGGG CAAGTTGT C CGT CACT TAGGTAAAAGG ACT AATT AAATGAGCTTGT CATAGCCGTAAC ATTC AC CGTCTG AAGTGC AG ACGAAT CT AC TACAGG CCAATT GTGAGTGGGTATTCCT GAGAGCCTGACACGAATG CATCGTA CGTGGTAAGAG GC TGGTTCAGTAAC CACCAT TTCTATAGCAACCCCCACACACTCTCAGGCTGTGTTTTGAAGAGGCAGTGAAAACTGGATTCTACAATGTAGCTTCAC TCTTAGAATATGCTTTTATTTCTATCATCTCATTAGAATGAGAGTATCCAAATCATATAATCCAGGTTGCCATTAAAC AGGAGAAACTTATCAAATACAG GATC CGTAAATCTTTATTTATT CTTTTTGCCACTTGATGTCCATCG CTTTAAATTC AAA CCACTGACCTGGTTTGCCTGTGACCATT CCC CAGG GCAACAAGT CAGATGTCCGT CTGACAAC CT GTACAT CAGA CAGCCATCTGCTGCCCCAGTTACAGGATAAAGAAAGGGTAAACGTCTGTAAAATAGAGCTGACATAATCATCTCTTAC TGTCCAGAAAGGCTCAGCTAAGAGGCATTCGCTATGACTAACAAAACTTCCTTTGACAGACATGGGAAGGAAAACTGC TT CTTT CAAAG GTCAAAT TGGC CACT GCACAATAAATATT CACTTTATTTA(N )xACAAATTCATTTTAACCCATTCC CCTCACATACATCTTGACA CCCG GGACACAGAGAGGGCACAATTGGAT CATTACATATTC CG CT GATT CTTTAAG GAA AAGAGATCCACTAAATAGAGTGCTCCTTCGCTCTACGAATCACAGCATTTTAACAAAATAGCTTTAACAAAACATGTC AATGTGACAAAAGT CTCGATTC CAG TATG(N )xATATATGCAAGTCGTGCTTCTTAAAAAATTTATTTTA(N )xCACA CA CACACATAAAATTATTTTTATTGAGGGT CCATATAG CAATTAGC CAACTCATGC TGTTGTAACATG CTAAACAAAT TC AATGTTATTTGT ATTCTAAG ATGGTGTT AATCTAGTTG AT CACCTT CT AATG AATATATA GG TTGAAATATG AAGA TG CCAATTTCAAGAATGG GAAATTCTTTGCTTGT TCCAAAAGAAAAAAATTAAAAGACAACTGTGAAAAACG GCAG CA TCAAAAATTGATCC CACT CTCACGGGGTTTGAAAGAGAGCAT CTCTAGTTCTCTCT CCTAAGAGTAAACTTT CAAT TT CCTGGCAGTCAAGGATTAAAGTGTGTGAGTTAGAGTTAAAACATGGATATACTGAGTTAGTGTTAAAACGTGATTTTG AGGCAGG(N) xAACTACCATGTGATTGTATCATTTGAACAGAAGTCCCATCAAACCATAAAATAATTTCATCTTTCTT TACAGAGTTTGAGGTCTGGCCCTACAATGAAGGTGCTGTCACTCTGAGTATCAGTTGTCAACTCTGGCAAAATACAGG GGAGCTGTCAGCTAACAGGAAATATGGACATTGGGGCTGT CTGGGCACCC CAGTTTGCTC CTGTAGAGTGTG GAATAC ACTGGCACTGTAATATTAGAATGGAAAGAACTTCTGGAAGGCATTTAGACCAGAGGTCA(N)xACAGTTGGCCAAACC CAATTTAGATGAAAAATTATTAATAGATTCA CAAGTTG CTATCTGTGC CATTTCTGGCTGTTAAACAATT TGGAAT TG GC AGTT ATTT TGATTTAC CTC AAAAATAGATAAT TTAAAG TATT AAACTATTTG AG ATGATCGATT A CTGT AAT CATA AATAGCAGCTAATTATTAAATTGTTTTATGTGCTTAAGATGTGTTGGCCCATTCCCCATTTTTTTCCTGATCTAATAC CGTCTTTAG(N ) xTAACTAAAATATTTTAATTTCTACTTAGATAATATTACAATTAATGTAATAGCAACAAAAGTCTT ATGAAATATG CAGG CTTCTATT GAGTATAATAAGTGAAAAAACATATGGTTT CTTTGAAATGTC CATCTTAAAGTTTT CAGAAAACATTTTAATTCATTTGGTTATTTTTGAATGACTATCCACAAACGTTGGCTTTTTTCTGCATACATGCAATC TGTTACAACCAGGAATTTGGCAGTTACATTTTTCATAGTAAATGTGAAGTTAAAAAGACCATTTC(N ) xGGAAGCGGA AATAGGT CAACTTCA CAATGAAGATT CCCAATGTA CTGT CATGGTTATTTTTTCAGAGAG GAAACAGAGAATGG GC TA ACTTATATTT CAGAGGCT TCCTAACAATGCACACA CCT CTGTTT TCAC CAATTT C TG TA T( N) xAGTTCCATCTGCCA TGGACAT CAAAAAC TTCT TTAAGTTT GTTATTTGTGAACTGTATATAAAGT C CAC CGAG GAAAG CAAGGATG CT GC TT TCATACACCTGTGC CATATAGTTACAGTGCACACAACC CATA CAGGAATC CCTAGGGAAGGCTT CCTTATATTAG GAG CCCCAGCCTTATTCCTCCATATTCAAATATACTCATTAAAAAAGAGAAGGCAAAAATGAGAAGCAAAAAATAAACAGC AG GGATG GAGG CTC CAAGATAGAAAAAAAATCCAATAGAACTGT CCTC CCTCTAAACAAGAG CTTAAACATCAT CCTT TTATAACGTGTACATATACATATATACATAAAGGTAAATTCTATGTATACAGTCAATGTATACATTGACTGAG(N ) xA TCAGAG TATCTTAAATTG CCTTTTAAAGATATCTAAAT T(N )xCC C TA A A A TTA TTC TC A TG TG (N )xA TTG A TC A C A AG CATT CAGAATAAATAGACTT GGTGAGGG CATAGGGAGTAA CTGGTACC CACACAGA CTTGGT GAAGGT GCAGGGAG TGGCTGGTACCTGCACAGACTCGGTGAGGGTGCAGGGAGTAACAGGTCCCCGCACAGACTCAGGGAGGGCACAGGGAG TAACTGG CAC CCGCATAC CGTG CCGTTCAAGCTGATAGGACCATTG CAACGTGCACAATGTG CT CAGG ATGAGG CCGG GGAGGAGACTGGAAAGGAT CG G CTGTGCTCGTGAAAGTGACAAG CTGT C A TTTA C TTT( N } xGACTGGGGAGGTATTT TGTTTAGAAGAGATAAACTGTT CAGC CAGTTGACTTTG C
> H S 13 _ 42263335 -42280 08 6
ACATATTTTCTACTGACTGATTTGGAAGGAAGACTTGAT C CAACATGTTTA C CAG GAAATTATGTTTTTAAT TACC CC TC AAAC AGGCTTAC AGATTATAGGTC AAACTAAGGGAAAGAAA CATTTATCC G ATAATTT AAAAAATGGG TG CTTACC ATGAGATCTGGAAAACCAACAACTATTGTAGCATAATTTTTCTCATCTGAGAGAATTCTGTTGGGAGAGGCAATCTTT TGTT CCACAGCT GAA CTTAGAT GT T CAGAT GATAGTTGAT CACT GGAAAT TTTATGAGGTATGCTGAACT CTGT TTCT GAAAAGTACAAACATTAGA CAC CATATG CT CCTGAAAT GC TC CAGGAATTAATTTTTTG CAAGATTTTATTCTCAACA AGT C AAATGGAAAAGAAC AAGTTAAAAAAG AG AAAACATT TT AAATTACATT CATGGG CTTTTGTTTG AAAAAATGCT GTTT TTATAAT CAAGGCATCTTCCCAAAGTGATTTCAAGTATTTACAGAATTCAAAATTTTATTA (N ) xGCTCCCTGT AGAAAAGATGTAAAAACTGACCCAACCAAGGGTTGGCAATGGTATGTTTTAGATTTTTTTCGTAAGTACTATTCATCC CC CAAAACTGTGATAAATTAAATAGAAA TCCCAATAGT CTGCAGAAAAAAGATGTTAT CCTAAA TTAGAC TTGCAACC TAAATGTATATAAAAATACT GATCTGTCAGTTAAAAAAAT TCAAGGACAGAG CT TACAA CAGTGAACT TC CTTATGAC AATTTG AG ATACTAGATT CTTATTTACT ATTT CTGGTGTAGT GACTAAGT TTTTT CTTTTGGTGTAAACTTAAC AT AG GCTGGTGTTTTGTT CTTAAATGATTCTAGAAGACAGTATATG TACAAG CATG CACTOACACAGC CATAAACACAGATG ACATAGTATTTACCTTGTGTAACTCCAAACCCTGTGCTGGGCGGCTCCTCTTTCAGCACATACAACTGGCAAACCCCA CTAC GCTCAGAT TCGATGTGTG CAGGCT TAGT TAAAAGATATTT CCTG CAAACAAACAAAACAT CTAG CTTAAGTGAA ATTTTTCTTTTT CTAATAGTAATAAGGC CAGA CTAAGAAAAATTAGAT TCTC CT CAGT CT CTTT TTTATAGTTC CAGT ATAATTTG CAGTTTTAGAGTTT CATCATGAATTAACAAATAA CTT CTCAACACT GTGTTAAAGAG CAGTG CACAACAG CAAT AC AGT AATGAATGT GAGG C AAAAATC CAATGCTT CCTATAAGAG AAAGGGTTGTGG GG AGGGGAAG CAACTCTA ATGGGGTG CACT CTTATT AT CATGTAAC CCTCT CTTGATAAAAC TCTTTG AT CAG CCAGG AAG GGGGAGT ATGACTTC TGATTTGTGATCAG CTAG GAQATAGATG CC CT CACAACTG TAGT CTCC TTGATCTACTTATTTT TAACATTTCTAAAA TATGGATG TATATATAGG CAGAAATCCT CTGCAATGAGATACTT CTGCTTGT CAGCAGAG GGGCAGAAAGAGTC TGCA CTTAGCTAGTGGAGTTGTGCAT GAGGGAAGGT GC TCAAAGTCGACACCGC CTTT CATCAAGAACTGCACAGTC CAATG GGT CAGATAACATAAACATAAACATAA CCACATCCAAACCAGAATACACAAGATAAAAAGAGAAAGACACTTCTAAGG TGTGAACT CTGTGAATTTA CAAAGAGGCAACAGCT CTAG GATAGAGACTGTGG CT CTTTGAAG GGAAAAT CTA CTTTG AAGT GTCT C CAGTTTAGCAAGAATACAGTGAATT CTCAATGTA CAAATAGGAAAAAGC C CAAATAAATAT GGAT CATA TGAAAAAGT CATGATTTTTTAACTGTTAATTT CAATGT CATTTT CTGAGAAGTGACATTTATTT TTTCTC CCCT CCTC AATT CTTATAAGT CTCATATATAGAGATG CTG CCAAACACAGAAATTGTTAAAG GACAGACACAGGAAAGACAGAACA GTAAAGATGTTGGATAATTCATAAATAAGATTGAGCTACAACATACTCCATGATACACTATATAAAGCCACTCACTGA TT TGTTTTGCTC TC CACCAGAAGC CATT TGTC CT CTGCTACAAG GAAAACTGTCTTGAG GTTGATGGGAAGTGAGATG GTGTGAGTT CGC CCTTCT AGAACAT CCAGCAC AGTC AGGC TG TT CCTG GAAAAG AAGAAAACTGTATCTG AAAG GAAA TGGTGAGGATAATGGTTGAGTAAT CTG G CATAAT CATT T CAT GATACATTATGCAACC CATAAAAAGAATGTTTTAA( N)xTAAAAAAACAGATATGCAAAATTCTAAACTATGTCAGGGTGTGTATAAGTAAAAATAATGGAAAGTAGTAACCAT TG CC ATGTTAGT AGTGGCT ATGT C ATGG AATT GTGAGC AC TAGCTTTT CTTAT AT CTGTT TCTC AATATT CTGC AATG TGGG CAAGTTACTTTATATT AAAAC ATT CATAAT TTGGTT AC ATTCAC TT TATAGTAAAC AA GT ATAATC AAAATAGG AAATAGAATGGG CCAAATAAGAAC TCCAGTTTAAATCCTC TATAATAAAGAG CT CACC CTTTTT CTTT GTAGAACACC AG CCAGTTTTTGTGTGAAAATT CTTTACACATTTTATAAGTTTC CTAAAGACAAACAAAACAAGATATT CATATTAAA TGTGTCAGATACAGTCATGT CC CT CTTTAG CTACTTCC CC TTAAATAAAATAAAGTTCTT CAACTAGTAAAACATGTA AT TC TC AAACAAG CTATCTATTTATCTGCT CTAAGACT CTTATG GTAAAATGGATTAAAC TATATATCAAAATTTACT GTGAACTT TGTACTAAAG C C TCTT CAGAATAAAT TTTTAAAAACAACACCAAACATTGTTAAGAAATAACAGGCTATC TGAT GATTAACCATGGAAT T TACACCATAAAATAAACACATGGGAAACA CTT TGTTTCTTATAT TAGGATACATTTAT GACATTAAAAAAAATCACTCTTTT CATAGTCTTTTAAGAC CTAAGTAT GTTGACTTGCAG GG CAGAGGGTAAA CCTTA TTTTAAAT CCATTATTTTTTTCTGAATGGCAGAATAAATT CT CTTGTGTTGG CCTAAACTATTTACTTACTAGTTTAA TTTAGTTAATAAACTAAAACACTTATTTGTCTAATGGAGGAAATTAGGTATCATACTCACAGTTTCATCATATAAATC CAAAACATAACTACATAACTGGAAAACAGGTTAAATTGACTCCATGATCAGAAATTTGTTAAAAATACGATATTCAAA TAGTTTTTAATTTTCTCTTATTGTTAGTCCAGTCCTGGCAAATCTTCATGCAAGTATAATGTCATAAAAAGAATTTAT TGATATTTAATTATACACATATCTATGGTTATATATATCATGTTAAAATCTATACGTGCATAACAGGAAATTCTCTTG TGCATTACAACAAAAAAGGCACTTAAAGGGATGGGCATGCATGCATCTTTCTGGTTAGAAAGCTATAGAAATCCTTGG CAATTTGTGGGACAAATGTGGTGTTTCTTACAGCTTCTTCTTTGTTCCACCAGAAAGGTTTCTTAGATGTAAACTTCT CGGAAGGGAGGATGAGACGATGAAGGGCCCGGCCAGTAGTATCTAACAACAGGATAACATTACTCTGCAAATGATAAA AACAGTGAAAAGCTCAAGATTTGTTCTTTTTTTTTTTTTTTTTAAAGCAGCAACCTCTTGGCAACCTGACAGTGAAGG TTATAGTGATAAAT CAAAACAAAGA CAATATT GTAATACTTAAGAAATAT CACAAAGACATTAAGTGATT TTTCTTAA GAAT CAC AAAGAATTAAGTG CTT CTT AATTTAAG AATTTAAAAT TTTT GGTGGTTTT ATG AAGACGTTTCTAAATATG TGATG CAGAAGGTATTCT CTTTAAATAT TTTTATGTAAAAAC CATTCC CTTTTAAATATAAG CTAGTTGAACCATGAT TCTGTTTTATTATCTAGTTATAAGGAAATTACAAGGATTGATGATTTTAAGAAATTCATCTACCTGTAACAAAATTAT ACTTAGAGAATG CT CAGT TGAAAGCTCT CTTATATTGTGCATAATTAAAT CT CATTTATCTTTTTCACAAATCCAGGT TCAATTTA GAAAATAGGTATAAAATAGATATAAAGAGGACTAAG CAAGAAATATGAAGGACT CCAAGAAACTAG CTAG TAAT GAATGTTC CTTCAT TTAGAAATTCA C CAAG CTTGAAGAAC CTTCATAACT GGTG CTGTAATAAAGTACAAACAA CAGATTTC TGCAG C ATTATT AATC AC AG CT ATG C AATG ATGT AAT CTT TC AGTAATT AGG AT ACGTTTTTTT AAGAAA ATAAATAAACCTTATGTGTTGAAATAAATGAACTGAAATACATGAAATAAAAACTTAACATTGAGAAAGCCAG(N)xA TG CAAAGAAGTTGG CTTT TT AGTT AAAATATAAAGAATTT TG AAGATT AG AC TAACAGGTTTGC AGAAAGGT AC CTGC AAGTTATT TGTCAGACCT CATTGT TATGCGGGTGTCTTTTCTTTCTTTGTTTTATTTTATTTTAAATGTGGGGCTCCT CCAGTTAT CCTAA C TCTTGT GAA CTTTT CATT T CAAGAAGAATT TCTTTTGGAGAAAATAAAGT CACAAAGCA CATGA AGAATTAATAAAAACCTT GAAAAATATAGAAAA CAATAAAATGT GAAGTGTTTG(N)xTGAATGATTCATCTGCAAAT CTAGCATCTCAGTGTGGGTTTAATTTTATTTTACAAGCTAATTTTGTGGAAGTGTTCAAGAATTAAAATAGTATTTGC ACCATATAACTCACTTCACACCATATACTCTGAATTTTATTTGTATTAAAAAATTTTACTAAGCATGTTAACCAAGCT TAAATTT CTTTCATAAACAT CACC CTTAGAATGTATCATT CTGTATAAGATATAATTCTC CATTAAGAAT GTGTGTTA GAAATATT CTTT TAAAAATTAACATATAAGAAGTGT CATT CACCAC TACT TG A CAGTTTT GAAAATGGTAAT GATATC TTAGAGTTTT C C T TA A G (N ) xCTCTATTCTfiGATTAAATflTTTTTTGAATGAGTACTTTTTGATGGAAAAAATATTTC TTGTATAATAAATATGTACAA C GTTAAAATGAATT CTTAG CACTTAAGCAATTT GG GATAATAT TAG C CTTTGTAAG C AAAGGAAGTC CTAAAAATGTTT CATG CC CAflAAATGAGTAAG CATAAGAATAAG CAGTTGATTAAACAAATATGATCA AGCAGAAAAAGAAGTTAGTATC GT CTTC CATT TTC CTAC CTTTTTAGGGT CTAATACTTACATATT CATAACATGGCA TAAGTGAACTGAAAAACTGTAT CTTT CTAAGTGACAGAAAACATGTATTT CCAG CCACTG CTGTATTTAGAATTAGTT TATAAGCAATTGGCAACATGAAGGAGTTCTGTTGCTCCATCCTTTTAGATGTTAATGGTTTCCTTCTGTAAATGACTA ATGATAGATT CTGT CTGGGTTTAAAAATAGAATTTT GCGT CACACATGAGTACATGTAA CTGTGGCTTTGGCACTCCT TTAAGGG G GG CAGGGAGGAGACGCGAGTGAGCAATTTTATTATGTTTGTATTTAAC TG GGAGAAAAGTAACC TCTGGA TGAC CTCAAT GT CC CTAAATCTGAAATT CTAAACTTTGTACAGATGTAATAAATAATCTCATGG GGAGTGGGGGAAGG GGAGGGTCTTTATTTTTAGGCATGGCAAAGTTTCCCTTTCCTTACTTAAGATTTTATTTTAGCAAGGAATTTTTGAAC AACTT CTTAAACTAACCAGGGACATG CATCTGAGCAAAGCTTTATGTTAAAAAG GCAAGGAATAGC CGAATT CATGGA AGTACTTCTCTATAATATTTTG CTTCATTTTCAGTTAGGG TAGAATGAAGAAGAAGGAAT TGATGG CAGT CTACTTAT TTTTCATTCCACTAGAATTATTTATCAGTTGTTAAATTTCACAGCTGTGTGCTGTTCACATGGAATTTTACAGGTCTG A CAT CTTGACAGAGGTATATTGTAAAATAAGATACT CT CT CCGGGAAAAGAGAAGG CAGT TTAGAGGACAGGTAGAGA GGAAGTATACAGATAAGGTGTG CAGGTCTGCT CAGG GTTC CGAAGCAGTTTC CTGGAAAACTTGAAACTGTGGC CAAA GGGGTAAG TAAG GCAGTATATGAG GAGG CAAGATGCTGGC TATGGAAGTCAAGCACAG CTTCACAATGAAAAT CAGAT TTTCTTTCATGAAAGTAATTGAAATTAAATAAGGTATTAATCAGCAATGTTACCAAATAATTGGATCTTTTGAGGAAA TAAAACTCTTGC TGAGAAC CTACAAATC TGGCA CTTTTATTAAAGAATTTGTGTAT GTAAATTT GACAGT TAATTTGA CAGCTGAAAATGTGGCAGGACTGTTAAAATAAATATTCCAAATGTAGTCTCATCCAATTTTGACTGATTTACTAAATT A CAAACCACCAAGATAT CATGTAC CTTTTATGA CA CATTGTTTTGGTTATAGAGTTAAAT TCAATGG CAT CAATACTT TGTT GAAAAT CTTTAAAGTAAAAC CCTCATTC CAATTTT CAG GAGATAAAGT TTTATAAACAATAT CTGGATGTTTAT AAAT TCTG CT CCTAGGAGAAAG TAAAAGTAAAAATAAAAAAGTGAAAACAAAAT CAGAAAAATACTTATAACTCAGAA TTTGTGATAATTT CTGGTGGCAAG GACTAAAGTTAAAGTTGACTTATTTC CT CAAT GAACTACTGATATATTATGCAA TACTTTTCTCTTGATTAGTGAAAGTATTATTTCTTGGTAGCACTTTGCTCTTTAAATATTCATAAGTAACAACACTGC ATTTAAAAAATATTTTCCTATTAT GAGTA CTT TTTAAT TTTCAAACTAAT CTTT TT CC CCAAAT CTGTAGTGAG TGAG AGAGTAGGTGATGTTTAATTCTGGTAGGGTCAATCCAATATGCTAT( N ) xATATGCTTAGTGTAGTACTTGGTATATA CTAGATAGATAATAGGTATACAAAAACATTAG GCTCCTTTCTATTGGTAT TGTATAGATTAACTTAGACC TTTAGAAT GATOACTAATTGAAGCA CAATCTGTCTT CTGATATATGTGTTTTGTTTGGAATTATTTTTAGCCTGACTO CGGG CAT C TGTGCTTGTCACTAGTA CAATCTTAGTGTACTAATC CATAGTAAAT GACGGATAGAACTTGAGCAT CATTAGATACTG ATGGATCTCACTGTATTC CCTTTG GTGTATTTA CAT GACTGT CTCTACCAAAAAACGT CATTCGGGTCTCAGATAAAA TGTCGTCT CCTAAAAGGAAATATCTCAG CACT CTTTTTAATACCATTCAC CT CAATTC CT CCTACC CCAT CTACATCA C T C C ( N ) xACTTACCTGTTTCTAGAGGTATAGGTCATGTTGAAGTCTCTAGAGAAAAGAAGATATTAGCCTGCCTTTG TAGTACTGATAAAATGCAACTC CTGACC CATC CCTC CAGC TC CTCT CCTTGATCAT CATGAAAG CACATCTTTT CAGT TATAGGTTTCTAAT CACT CTTG CCTTTCATGTAAGT CATT CCTCA CATAATTGTAG CCTGGTGTGCATTACCTGCTGC T CATG GAGAACCAC TTGAC CTTTGAGAG GACTT CC CAG CG GTGCCACTGT CACAAAAGGGTGC CAAACGC CATTGGCT GTTCTTGGGAAGATATCAAAAAAGTCCACAAAGAAGCCACTTTTCCCAGTCATATTCATAAAGTACAGGGAAGCGGGA TTGCATGTAACTACATAGAGAGTATTTTGCTCATTTTCTGAAATGACAAAAAGGTGTTAATTTTGTGGCTGGCAACCA ATTCTCTAATTGGC CAAGAGGT CTAGTGAAAACTGAAT CATGAATACACAGCTG GAACAGAGTT G GAACT CAGCAGAT GATTTGTATCATGGCTTGATTAGGCTGAAAAGACCATGAGAAACAACTTAGTTTTGACCCTAAATGTAGATAAGGTCA CAC CTAC C TACTTGAAAAAATTG G TT A TC TA TTTG T( N ) xGTCTTTTCAATCTGGGTACCTAAGTCCATTTCCTTCCA TTTGAAGAAGT( N ) xCTCTGGACTAAGTTTCTCATAATCTCTTATGTGGGCACCGCTGAAGCAAGGGATTAAGCTACT TTTGCATAATGAATGGTATCAGCATCACCTATCACCCAGGTATCTATTTTACAACAGT( N ) xTGTGTCTGTGTGTGTA TG C A TC A TA TC ( N ) xCATATCACACATATTATATATAAAGATACAGTAAATTTAAATAAATAAGTGAACTGATAAGAA A CACAAGT CAA CAGAAGGACTGAAAAAGACCTATTTTT TTAGGAGACAATGCAAACTGTAGATGTTAGTT TCAATTAT CAGATAGAAC CACAGAAT TTATAATTTATGGATCTTTCATAATGCTTCCATTAT CAACACAAATAAGTTCTATT CAAC ATTTATG G CATAATATAG GGCACT GG GAATA CAGTAAAAGTG TTAACAGAAATGTACTTAATAAACTCTT TAAC TGTT TTCTCTTTTTGCAGGCATTAAATGCAGTTCAGTCTTTGTCTTAAACATACAGATACAAGATTTATATTTATCTTTTAC CAATTTAAGAAAAACTGTAAATTTTTAAAAGCACATATTTTTATTTAAAATAGCAGATTTGTGGAATAAACTACATCA ATAGAACTTATAGG CATAAAGACAAGATACCAATTTTTGT CT CAACAATT CT CATC TTAT CAAGTTAGAGAC TCTCTC CCTTCCCCCñGCCACCACTTCTGTTTTTATCACCTTAGTAACTGACAGCAGTTGAGGAACTTCTGCCAAAATGTACCC AGGGTCCACTG CAT GAAT CAGTAG GTAATGG CAACAGT CCAGAAGT GTGGAT GC CAAAAATTCACATT CTTTATTTGC CATCCTTTACAATTAGAAGTTTAATGACTGCAAATCACTCCAGCCCTACTTTCATGGTTCATGTAATCACCAAACTGG AATC CCAG CCAAAAAACC CAAT GTTT CT C CTTTTTATTTTTC CCTT CTAT TTTTAG CA CATGCCTG GGTGTGTTTAA C TTTGTGCTACAATTGTAATGAGAAACAGACATATGCATACAAACTCAGGAGTGCTTATGTGGATTCCACAATAGCATT T T TT TAGGTTGTTAGAGGAAATTAGTTC TAAATTACATATAATTGC TAT C CAAT CCAAGAATGACAGACACCAAGTCT ATCATAAACTTTTTGATGT CTAGAGATTATTTTAAT CCAT CTTTTAAAAAAACATGTG CAGGAAGGAAAGAT CAAAAG AATAAGCAGACATACCATGTGATG TAGCAATGTCA CAGAT TATATTAATT TCAT CCAATGGTATTCTC CATGAGGCAC ATTCTTCAGTAAAGTTTAGGGATCTTTCTTCATGTCTTTCTATTGGATACTCCTGTATATTTATAAGTGCTGGACCCT ATAATAAAGCAGTTATCT CAGTTAAGATATTTTGGTTGTCAC CAAAATGTAGAAAAGCTAAAGT TC TT CCAAAATATA
a a c t t t a a c a t t c a a g t a t t t a a t a t c c t g c t a t a c c t t t t t t c a a t t t a t t g a c a g g c t t a g t a c t c a t a a c a g t a a TAGTATCACACACTTGCACTTATAAATG CTTT CACATGTG CTATCTTATT TAATTT CT CATTTTTT CTTTTTA CTCAC
a a a a t a t a c c a g a t g t a a a a t c c t a t a g c a t t t c t c a t t t c t t c a t t c c t t c c t a a g g c g a g a g c c a g t t t t g a a c t c CAAGTATCAG GACG CAGAGCTGTCCAAACCAGGAGAATAATT CG CAAATGAATGAG CCAGAG CCTGATGCATTT CCAT TGTGAAGAAGGATTTTTATGTC ATTACACAGAAATGTAATGGTT AAAC CTGGGTATTT TAAATTAAAT GCAAAACATG TGG CATTTGAAATTTTTTAAAAAACCAAACTTAC TGATAACCA CTAGTGGTGACTTATGAAG GATAGAGTATTT CACA CATGGTGATAATTATTCAATAT AAAT ATAAG CAG AGGAGC TAATTT AATAATGTTAAG AATATT AGTT CTG GGT AC TT AGATATGACTT CTTGAAAGAATGCTATAAATGTAAGCAAAAACCAGTGATTT GCATGC CCTACAGT C CTTAATTAATC AAAAAT GTTTGGAC TAGATATT G GCATCCATGTC CCCCGC C (N ) xACAAGGAGGCTCTGACAAAAATGAACAGTTGGA TTACCACATGTCTGGGTGTGTTTCACTTTGTCCTACTGT(N)xACTGCTCTACTGTGCTGCCCTCAGCAGCAGGGATG ACAGCAGAAGCAGAAATACTGATCTTGAAATTACAAGTC(N)xTGCTTGAAATACCTAGAGGGACTTCTATTACAACT AGTAACG GACAGTT CTTTA CTT TACTTTCACATGAAGACT GAAC CTGG CAA CAATGTG TG GCTAAAAAGAG G CCAAAA AT CAAAATGCACAAAGTT TTAAT CCTATCAGCACAAGñCAAATACCACAGAG CAACTTTC CTAG GAAAATATGTAACA TACCTGACTCTCAT CATC CAATTAGCTCTTTAGAAATT CTATTTAGCCTT TAG GAATTAT CAACTGAGAGGC CA GC CT GATTTGAAAGTTAGAGCAGAAG CTAAAATAACAT CCGAACAAAT CATT TCTGAGATTGTTTGAATTATCT CCAGGGAA GATACAGCAAAATTTATAAGGT TCAATATTTTAAGAGAGAATAATGAT GATATCAA CCTT TATGTCTATATGATGAGT TT CCACTGGA CACAAGAGTTTTTGCATCCCACTT CTTGCCTGAC CAATTGTC CAGTAG CC CATGAACGTTTGTT CTGG CAGAGT CAAC CTGTTAAGGATG ATTTGCAAAGTAAATATTTCTT ATTGTACC C AAGTC AT AG AAAAAAAT AACACG GC ATCTATTATCAACCACACAAATAAATTTGAAGAGTGCTTTACTTAGAAGTGAGTTATACATCATTTTATTAAACGACA GTGTATAAAATCATGTTTAAAATATACTTTAAGACAAAAAAGAT CT CG CATTTCAGAAAC TAAGAAGGTAAG GCAAGG TC CTGTTTTTACCTTAATGACAAATACAATTGATTTACATACATAC TC CTTTGAAT TG CAGTAATTGAACAT CTAAAT TT CTGGTAGAGTATAGGAAAAC TTTT CTTTAAAATTGGAG CAAT CTAT CAATTTTTTCAGTC T A ( N) xATGAGCCACC GTGCCCAGACAATTCTTTTAAATTTGGAATCAAATAATGAAGAGTACACCATACTACTAGTCCCACATTTGTATGTTG TATAGGG CAAGTTT C ATAGACTTTATTTTG GCAT CTT AA (N ) xG ATG G AAT C ATT C TGTATTTAGATT CC CT TTTATT TAAAATGACTTTCAAATAAGGTGATTTGCCAAAACCAACATGGCAATACTAAAGTATAAAACAGCAGATAATTTTTTT TAAGTCTTCAC(N ) xAGTCTTCACTTTTTAATCATACTATCTTTTATTTGAGTCTTTAAAATAGGAACCATTGATTTG TCATTTAATGCATTAGAAAATTAGTGATTCAGAGTTAAAAGCTCTTTAAAAATAATATGCACATGGCTACAAATTGAA ATGTAT CCTT GGAATGAGTTTGATGAT CAT TAAAAGAACACG( N ) xGTAAAAAGTAGTCTGACATAATATTTCCATAT GCTTAATAAGATCATGTAAGGCTAAGATATTTTT TGCATAAAATAATG GAAG CAAACATT CTAGTAGAATAT GCTT TT CGTCCCTTTT GAATGCTAATGT TTAT C ACAAGAG G AACGAAATT AAAAAG AAAT TT AC AT TG AAAT TCTT CT AAGT CA TATGACATAAACCT CTCAAAAG CACACACACACT TTTGAATCTATCAT CTAAAATATATTTTAAGTAT CTGTACAATT ATCTCAATCGGGGACCCACTTACTTCCATCTTTTTACCAGGAAAGAATGTGTGTGTGCAGGGGGTCGGGGGAGGGTAC TG AATT AGGGG AAT AAAGGGTTTTAC AATT ACTGTGTT CT CT CT TTTG AATGT AAT AAAAAT AC ATTT CTGTGTTAAT AGGCCAGATCTTGAAAACATAGACTGGGATCTCTCTCTTGGTTTAGGGAATAAACTATATTTTACATTTGCTTTCAGC AGTTCACCAAAGGAGAATTGGGGCTCCAAATGTAGAAACAGAAAAATAACAGAAGGAGATACACTTAATATTCTGCCA GCATAGAAGACATAATGTA(N) xCCTTACAGCTTGGGGAGTCTGTGAGGATCAAGGAAGAGAGTAGAGAGAAGTGTCA CTGGTAAGTTTTATAGTCTTGCATTGTCTT GATACTCT GAAGTGAAAATAAATAAT CAGAAT GAAAATAT CTATAGTT AC CAACAGAACAGATCCC CTGATAGATACACATG CTTTATTAAAGCATATAAAAACAAGC TCTGAATACATC C
>H sl3_42174182-42195708
TGTTAGTGACATTCTGTTAGCTCTCATGCAGTTACTCATGAATAATGCTATTTCTCAGGAAGGTTTTCAAGTTGTCTT TGAGAGGCGT CCCAA CAAAGTATATT TAGCAAAAATTCTT CAAAGCTGTT TG CGTG TCACTAGC GC T CTT CATTAATT TCT CCTGACGTGGG GTTTT CAATATGTAAAG GTTTTGACTACTTAAATAATTTTAATG GT CACCTGGCAAAGACTGAG TTAGAGATGCCCATAGTCATGCAGCC CAGAAGTCAGGACTATATTT CTAGGCTG CACCTGGATGAAGTTGT C CT CT CT TGGACACTGACACCATGTCTTGAGTGATGAGCCATTCATGATGTGCTAAGAAAGAGTAACTGGCACTTTGCTGCTCTC CTGTGACATGTAGACGTT CCTTGACT CGGCTGTT CTCACC CTTCCCCTCTCTTTGCTCTG CACT CTTG CCTTAGTTGA CACATCTTCCTTTGGCTTGGGCTGTCCCTCTATTCAGATGACCACAAAAATACATATACAGATAGCCATATAAACATG AAACCCTATCTAGTCCTCACCTTGCTCCTGAACTCCAAATTCAGGTTTTGAAAACTTTAAACATCGTGCTTGATATTT TTA CATAGGT GTCC CAATAGTGAGAAAACATACTATAACATC CAAAGCAAA CATCTGCTTTCTTCACAAGTCTGTCCT CTTCCTGTGTCTCTTTCTGTCCTCCAGGGCTTGAAAACCTCCCATTTCCCTTCCCTCGCATTCAGTGTCACTTGTCTT TTTGTCCTACCTCCAAATTGCTTCTTGAATTTGTCTCATTCTTTCCATTTTCATTTCAATTACGCTGGCTCAGGTGCC TA T C A C T TC TC A C C C (N )x TGAAACTTTCTCCTCCATGGCTACAAAACTTCCTAAAGAAGATAGAACCAGCTCTCAGC TCATGCCTGAGACCACACAGTGTCAT( N ) x AAGCACTATTTATCTGAACAAAAGCTTATTTTGAAACACCACAGGGGA AAAAAGGAACATAATTGTTTGTTCATGGAGATACCATGCTAGCCAAAGCAATCAGAGAAAGGTTAATTTAGTTTAGCC TGAGAATGAAATTATGATTATAGATCAGAAAAACATTGGAGAGAAGATATGATTAAAATCAAGTGGTTTTTTGTAAGA ATGGATGTTTTCTTTTGTTAGAGCATGCATGGTGCAATAAGAGATTGAGATTCTGAACCAAGTATAATTGAAGTATTT ATGGACTGATGAGAAGATTCATGCAATCGATTCACAAAAGGGAAAAGGTTCAATAACTCAAAGAAGCCAAGGAAAGTC ACCGGTGATAGCTATCTTAATCGAGAGGAAAGACATGTACTGCAACATAGGCACATGTTAAAGCAGACATTAACATGG GTGGCAACAG CATTTCCTGCTC CTCAAATC CCAC CTG G AATC AATG AGTGGG CCTGAGGCTGTGCTTTG G AAAGTC AT CTCAAGTGATGATTTGCTAGTTCCTGGCCAAACTTTGTGCCGTTCACCAAGGAAACACCCATGGTTACTTGGATTACA TAGCAAC CTCAAAACAGC TGGAAGATATTAGTCAACTG GAGC CACACAG GACTG GCAGAAGATGGCAAAATGGGAC CA AATCAAAGGTTCCTTGCTTTGTTTTCAATCTAAAGTTCCATATTCTACACAAGGCCGTGTCCATGCTTGGTGAGTAGC TAG GTTTTAAACTAG CCCTATCTTCCTAAATGGAG GACTAGAAGGC CCAC CTGAGAATAACT CC CCTATGGATTAAAA AT AGATT CAATAAG GAAAA CACTTG AAAAATGCTGTGTTGGC ATTTG GAT AATG ATGC AAAATGTTTT ATGAGT ATAT GTAAGCATATGCTTTTTATGGAATTGTTGAGTTGGAAAAAGAGAAAAGAGACCCTAACAAACAATAAT( N) xAAGTAC AATGAAGGTAA CAATGAAGT GAAT C CTTCTATAGAGAATG CTGCATG CTAGG CATT GTTTAAGCATATCATCTGAGTT CT TCACT AA CATACTGGTAT CT GT CATTCC CATT TTA CAGAACC CAGGCTGT CT GG CTG CAGAGAATGA CAATAAT GA TGGAGGTGATCTTGGT( N ) xAGATAATAAAAACTCCCCTGAAATAATTñCTCTCATTGAGGATATTTAGTAAGTTATT GACT C CAACC CC CTTAAGGATTTT TAAAAAATGGGTG C TA C TTT G (N ) xTGCTAACTCTAATTCTAACCAAGGCTTCA TTTGTGTGTGATTGTGAGAAGT CAGAG CAGTGGAC CTGACTTTT CAT CTTTT GG CTGATTGTATAAA C ( N ) xCCAATT AAGACATCAACTAATACCAGAT GAGGAAT CTGATAC CTTCAATTAAAAATGTATTCTTATAT CC CTTTGTAG CACTGT
G (N)xCATTTCGTCTCATAGCATTGTGTGCAAAATTCTCTACAGAACTCAGGAAATTTAGTT(N)xCCATACTCATTC TCAAGTCTTA GGATAGGCACCATCTCCCATTACTCT CACCTC CT CTGAAT CATC CATTTTGG CC CTTAGC CAACTA TT GC CTTACCCT GAAG CTCAGAGC CAAC CTGG CAGTGT CCTG GT GC TCTCAATGTC CAGGGAAAGGAACACT TGTCAAAT GAGTGACTAT GTAAGAAGAT CCAATCATTCAATAT CATGTTGTTCA CATATAGT TTATATAAAATCTTATGCTG TTTT AT CCTAAATAACTG CATATAGATTTTTTTTAAAGTACATG GTTTAATTTTTACATTTCTTGGATTT TGAAATTA GACA TCTGTTTAATTATTATAAAGAT CTTTCCTCCT GGAAATGA CAGAACTTTTATTAAT CTCTGGATACGGATTAGATCTT TTGAAAAAATAAGGCTTTTCCAGAGTTTTGGGTTAAAATGGTGGATTAAAAACGGCATCACAGAAGTTCTAATCCCCA ATACTTAGCAAAAATAATAAAACTTC CCC CAGAAGGA CAATTACAAATAAGAAC CAAAAAGGGAAAGGCAAT CA CCTT GAAATACTTCATGGAAAAGAAACACAAGATTC CAGAGTAGAATG GAGAAACAGGGCACAG CTTGG CTCCCAACCTT CT T C CT TTGTCCTGAAATGAGG CTTTGCTTTC TCTT TAGACAAAAAAT CTGATT TTGT TTCTGATT TTAAG GAACACT CA GC CC CTC CTAAATTATTTGT GCAAAC TGAC CACAGAGCACAATTTC TAAT TT GAAATCTC CAAACAATAGT CATGC TT AG GAAATACTGAAGTGTCTTAATATTTTCAGGAG GAGTAG CAGTAC CAACTT GTTC CTCAGCAT TGAGAG CT TTGT GA AACTGCAAAAGGGCAGGCAATGTGAAAGCAAGGGGAGGGCAGAAAGAGGAGGAGACAGGGGACATACAGAATACATAC A C TG GATGAT CAAAAAGAAAGC CATGGGATAGTC CAGTTTTATG CAGACATTTT CACACT GAGACATTTTTT CGGACT GGCTAATTAGGGTTCAGATAATTTTCTCCAAGACTTTAGGAATGTGTCCAGGTTTGGGCCCAGGTTATTTTCTTGGTC CATAAGTAGAAGATTACCTGTAAATTATC CAGGATGATTCGGAG GGAGTGCACCTGTCGC CGAACAGCAC CT GAAAA C CTTT CATAGGTTGCAGCATC GTATTCACTCATTTGGATCTC CTTTAGCCTGAAAT CAGAAGAGTATAAAAAG TTAACA GC TTAAATCACGTATATGTGGGTTTTTACTTC CATT CGC CTC CTG CTGGAACTTTAG CAAA C TC CTTGGAAACT CAGA TT CATGTTGAGGAGAAAAGG CAAGAAGTGATGTCTC CAGCTGTG CT TCCAGATT CAAAGGAGGTAACATC{ N ) xC T A T AT CC CAAAG C CATATTGTGATGTTTG CAGO CGAT CAGTT CAT GTGG CAGATT CCTAAGAG GGTT CC CATGG GTT GC TT TT TAAGATCGAGAT CATGTATT TGGTTGAC CAAGATAACAGTTCAAGCCTAAAGAATGTAAGTTAATGTG CTGCTATG TATCCCTATGTGAGAGTTTTCTAAAATTGCTGACAGTGTAATTTAAACAGGACAATTGTATTTTAATATTATTGTGGC TT TGAAATTCTATCTTCCCAGTAT G CTGCT CCTT TTTAT CAGTT T T T A T ( N ) xCTGAAATAAAAATACAACTCAGCAT TA CTTAAAAT GCAG CAT CTTAAAC CGTCAGTAGAAAGTT CAGAGTGAACT CCCTTGGGTTTTACAGTGTTGTATCCAT AATACATTAACTTAATAATACATTTCTATAAATTACTATAAATATAAAAGGTACTGATATTA GGA CAAG GAATAATAA GTGCATATCTCAAAAGCCCAAGCATAGCATAATCTTCTAAACTAGTGTGTTATGTATATGCATGCAATGTTGGAGATA TGAATCCTAGGTGTATTGCTGAAACTCTTATCTAATGTGTATTTTTTATCACCTTGAGCTAAAACAGAAACCTCAAAT ACTGAATGAGAATAAATGTT CTGTATTCCTCCCTGT CAGGTT GAG CACTG GGAAAAATCCATTCAC CATG GGGGAGAG CG CCAGAGGCTTATTACAATTATGTAATGCTAGG CC TAGACTAC CATGGGGCAGAC TG A C A T { N ) xCAGACATCTACC ATGAAACTCG CATACTTTAT TG CT TTAATAGATAGG CCTGTTGC CC TCTG TTATG G T(N ) xGCACTTTGAAAACAGTG A CTATTAAAAGGTAAATACAGACAAGGCCACTATAATATC TGAATGGTTGTATCTTGAGGTGACATGCCAAG GTGC CT TT CTAAATCTAATTAACTCT CAGT C C TGAAGAATAAAAAGAAATATGTGTATAGAACTTGAC CT CCTGACAATACC TC TTGTATTTAGTTTC TGGCAGTC TCTGAAAT CT CT CTATGACTTT TTATTAGATTAAGTTATGTCT CAAGTAT CAAGGC TGACAGAAGCTGGG CACAAT GAACAATGCAGGACAT GCCACCTACC CCTT CATAC CTAAACCGGTGTGTGTTTATATG TCAATATGCTTCTTCTCAAATAAGTGAGATTAGCACTGATTAGAAACAAATGTAGACACAGCTGTCTGTGGGGTGGGG AGG GGATGGAGGAGAGAATATATAGAAATATGTGATTAAATTAG CGCATTATGGATA CGACT CTGTAAAAC C CAAGAG AGGT CAC CCAGCAAATGAAC TTTCTACTTACAGTTAGTTTAAGATG CTGCAT TTTAAATAAT CCTGAGTT GTATTTTT ATTT CAGCCATT CAATTCATTTTCAG CAT C CATTATATGTTAGGTAATATGC CATATACTT CTATAAATATAAAAGGT ACTGATATTAGGACAGGGAATAATGAACATACGTCTCAAAAGCCTAACCACAACACAATCTTCTAAACTACTGTGTGT GTGTGTGTGTGCACGCACACACACAT GCAATGTTGGAGGTAT GG GTTGTAGGTGT(N ) xTCAAGGAAGGAGCTAGCCA TT TGGTAAG C CACCTCGAGCAGAC TGATCTTT GCAGAGTGAT CAGAATTAGCTT CAGTTATAGG CCTACCAC TGGTGC GATCATGTTG CACC CAAAACACAC CACATG CT GT TTTAATTATGAAGAAAT CTGGAAACCATATTC CAGATTGT GT CG GT TTTCAGAAAGAT CACCTACAAG G CAAT CAAAG CTGATAAATT TTGAAAGATGAT GACTGAAG CAAGG GAT GTAGAA AGAAAAGAAATTTTAGCCTCTGGATCAAATGAGGTACCCATTGCTAATACTCTCAGGCCCACAAAACTAGCTGGAAAC ATAT CTTTCTG CTT CTTTAACC CTGAT CAAAACTG CAGG G CACAGCTGCT TCACAT CCTAGTAATCTTTGTG GACTAA CTGCATTTTGTTGGCACAGTATATTTTTAAAGGTTATATTTATTGTTACGTAAGAGTCTGGGAATTTTAGTTCTCATT TTACTTTTTCTTATAGGTCACACTTAAATACTATAATTTTGAAT CAATTAT AAG GAAAA CAAACATAAGC TTAGAT CC ACTGTGTGAGGC TGATTAGTTTTG CAAATGGTAGTG GATG CAAT CTCCACAGCT GGTGCCTACC CTAGACGCTTTTCT AAGTGTGTACATGTACCTCATTGTCTATTTCATTTGCTTTTCACTGGTATAATGTTTCTGTTCTATTTTATTCATGTT TTATGCTTGT CTCTCTCTGT TAGT CTGCCTGAGT TAATACAT TC TTGATATG TAAG GGCTGGTAT CTAGCTTAAAACC AGATATAAA CAGATGCTAT CAAAATG CTGTGTTGAT GATGTCAG GAACCCAC TTAGCAGCATTAGGCTCT CCAAGG CA CTACACAAAC TTTCTGGCTCACAG CTAGGACT GCCTAGTTTCAGCCAGTC TCTGAATTAAGTAC TAAAGGATGATT CC ATTACTTGTATGCACGTGTGTG{ N ) xGACCTCATTCTGCATATGCCTAAGTGGCAAAACCGTTTTAAAACCTAT( N ) x TTTTCCTCTTGAAGTTGGGGGGTCGGGTCCATCGCATTCATTATATGTTCCTCTCAACTTTTATGTTTGAAAAGTTCC TAGTAAAAGGTTGAAAAATTTT CAGCAGCATTAAATGAAATATT CACCAAATGGGATGAAAGTC CCATTT CTACATGA AAG TAGAAAG C AACATCTTT CT CT TG GTGTTTTG CTGTGCCC CATCTTT CT CAG GATCTC CATGTGTTT CTATTTC TT TCTCTCCTTTCTCACACGCTTGGCCAACTCCTCTGTCTTTCAACCACCTTACGGTATTCTCTTTTACTTCAACTGAGA CTCTTCTCTGCAGCATTTACTAGAGGTGTACATTTTCCTCA.TCCCTTCCCCATGTCCTCTCTTTCCTCTCTTACATCC GCTAAATT CATTTG CGTTAAGGGCACAGATGC C C CTTACTTCTTCCCCTGTAGGGCAATT GT CCTGAAGAGAAC CACA CTCCTTAC TC CTATTATGGCAACAGGAGAAGGGCGTA CATA CTTGTGGC CAGGCAC TGTTAC CT TT CT GC CT CC CTGT GAA CTATGTGTGTG CCAGAACACT CTTATCTACCTGT C CACTTGGGCAAAAAATTAGACTGAACAATTACTTGG CAGG TTTTAAGGGTGG CAACAACAGAACTCAT TAAT TG CG GAGGGGAGAATGACACATTT C CAATTGT TTAGTT CTTTTT CT AAATG GGTACTTTATAATTCATAAGAAATG CAGAGAACTTTCATT CTAA C AAGACTGTTCACTACACTATTCTCTACC C TTTTTTG AG AG G CA(w )xGAGGGTCTGAG CTGG CAGG CTTTCCTGAGGGTGTGGCGG CAGGGC CTGCAT TCTTGT CT CTATTGTCCTGTACTGTTTACTTG CAGGATGTGGAAGCACAGCATGTAATGTAATAC CAGATGTTCTACGTGGGAGGA ATTTCATAACTCATACATTTTAGCATGCTTTTTTCTAACTGCGCCCATCTCCTTTCATGCAGCACAAAAATGTAAATC TGTTTCC CAAAC TA CTAC CAGGTC CT GGAATC TAAAAGGAAATACTGGTGGTTC TTTGGTATTTTTCACT TTTAGTTG AAG CACACATGTGGATGT CTAGGGTAACAT CATGTG GATTCTTCTTATTT TTTAAATGGCTC CTAT TGTAGTGG CC CA GAACAGCTGGGGAATGTC CGAGGG CCACAGGCTCACTGGCCTAGATTAAGGC CT CTACTCACTGATGAGG CCTC CCTA TCCAAACT CACTGTCCCT CCTG CCGAGCTGGTAC CCATTATCCTGAGTGGG(33)xAGACTTACCTGAATTCCCTATGT TGCCTAGGAA CAGATAGACATG GATGGGG CGATTTGAG CTTTAGATTTTAAGGAGC CTGAGTAGAAAGAACAGGTGAT TTTGTTTATTTGGCTGTTTCCGTTAGGGACGGGGATGG CTGTGGATTTCAGT TTGAGGTCCGAGTTGCTGCATGAGGT GGAGGTGGAAACACACACAAAT CTAGGTTT CC CCACAGTCCATCATGACATT TCAC CATAGCAGTAGTGCTG GT CTAA ATGTGATGGGAGCACTGACCTCTGCTGGAATGCTCTCTGGCCCATCTCTCTAGCAGCTCTCTTGACCTCTTCGGGAAC TGCATCTTTCTCAGCCTGAGAGACCTGGTACACCGTATGGCCTGCATCCAGCCGGTAAGGGCCTCCTTTGCCACCCAG GCCTGCCGTGTCTCTTCCCCCTGGAAGGAAAACGTGGAAATTCAGATGATAAATCTGAACTTGACCACAGAACAAAAG TTGGGTCATGGT TTTAGGACAG CC TTGT GGflGGATATT AATTGT ACTTAACT AGT AAT TAAACAGT CAAATTA C CT TC t g g a a a a a g a t g t a t t c t c t t g t t t a g g a t g c t t t t c c t t t t t c c c a t c c a t t a a a a t a c t a c c t a t a t a t c a a g g c t TACTC CAG CTTCTATGACAC CTTAGCTAATGCTG CTAC CTCATAAGGATC CATACT CTTGTGTACTATTT GTTATC CT TTCATATTTATTTTTCTCACCAACTATATGATGGATGAGA( N ) xAGCCAAGATTAGAAGCCAATTATTACACCAACTT TGACTTCTGGTTTGAACAATATATAACATAAACACCATCAGCTTTAGAACAGTCTTTCAATAGAAACTCAAGTTTACA TAATACATATTACATATACATATTACGTTTTATAACATAACATTGTATGAACTGTTCATATTGTAAATGCCTGTAAGG CCAGGTAGGTCATATGATAAAAAGTGGTGGGGACATTGGCTAACTTAGGATTTACCTTGTCTAAATACATGAAAATTG
(N ) xATTTAGGGGTAGATTAGAATACTAGTAACCTGTTTCTTTTGTTCATTCACGTATAACTTGAGAACTTAGAAGGG AAT CCAAT ATTG GAGG GG C AGT AT AG T ATAAT AC ACTG CT ATTGTAGAGT AT AT TATG AT AT AGTT A (N ) xA T T AAAC AAAGCCTAG TTACACAGAATTAGG CAAT TT CTAGTTTTATAAAGTGGAAAATTTGAAGGCTTTTAGACTTTG GGTAAA TTATGTTACATT TTTCTTTCACCATCACCACCCCCACCTCCACCTTACCA CTGAGC TTTAACTGGT CTAACAAAACTA AAACCAAGAACT CAATATGAACTG CTTT CCTTATGAAAGTCCAGGTAGGT CATTTC CTTTTGGGAAATAC CATGAGAA CTG CCTATTAATTACACTACTTTAAT CTTTTGATTG GAACATATTTCACTTTTATTTA GAAATGGGTG GTGGAAAATT CCTTCTGGGATGTGTGTG ACTC CC ATTT CTTA CT CATGGCC AGCCAATAGGC AG AGGGGT AC AGTAACTGGA C AGTGA CAATGAGC TATAGGAGGAAAAG GTGTTT CT GGGATCTAACTCCTTTCCTACAGGAGGGTATC CT GAGTAT GGGTGGAT CATGAATT TGAG CTGAAAGC TT CC TAGGTG CTGAGATGTTATCATTCTGGTGAAAGGGAGAAAGTGGTGGAACCAT CA GAAGCTGGTGTGAGTGACTG GCCACCTGAGTCGCAGGG CCCCAGGCAAAGT CACACTGGGGTGGAGCAGGAGGTGCTC TTCATGATCTCCACTTCC CAGAGC CTTTAATTTGTACTATTTAAATGAAAGTGCTTAC CAGAG GTT CTAAGACTGCAG GGCATAAG C CTGGACATT CCAAAC CAGTTCAGAAGATAñTGGG GAGGGTGGGCAAGGGAG GAGCAT TGATGAAGñAAG G GGAATCAAATGAC TTGT TGGG CCAGAG CT CAGT TCAAAGTAAAGAATT C TGAG GGAAAAGAAT GG CAGAGAGAAAAA GAGAAATT GAAG CCAAGGAG CCTT GAT CAATTGAAAACTAAAAGC CAGGT GAAACAAATGTAGC CT CAAT CTGGACAG ATA CGTGAGCTAGTAAATTCAAGT TTGGTAA CAAATAGGAATTTCTCCCC CfiGAGTAGTC CACTGCTGGGAAAC CACA T CTGGCCAGCTCGT CC CACAATGCATTC CATATGGATTAAATCTTGTTG G CTTGACGCTAACTT CCAGAT CAAAGGAC ATTTCTAAAAATAG CCTTAC TTTAAAGCTGATGCTT GATGAATAAACAGAGTAG GCAAGAAGTC CACAGG C CTATT TG ATAGCACATTGTCTAAGCAGAGAAACCCATTTCCTCCCGAGGCTGTCAAGGCCAAGAGGCACAGACCATCCTAAGGAA GGTGAAGTACAGAG CTCCTGCCAGCACT CCTCATGGTGAGATAAGCTCAGACAGTG CGCACACAGTCTGGCCCTATTC CTGTGGACTCGCAGATGTGT CTGC TC CAGCTG CCGCAGAGGAAAAAGGAAGACGGCAG TC CTGG GAACAATG CC CTAG TTGGGCAGCCAAGAGGAAGGAGTCGGCCATCGGTTTAGACTGCAACCAGCCTTATCTCGAAATCTTGCATCCTCAACG TACTTCGAAGATGTGGAAGACAAGGACCATTTCTTTTCAGTCAGTTGCTGAAGAAAGAATTTACTGTGGATTTCTGGC ATGATCCCAAG GT CAT GGAATTTAAAAGAAGG GATTAACAC( N ) xCTCTTCACATGCCTTGATGTCACGCTGGGGCAG AGGGC AAGACAACAAT CCAACTGAGTAATGAGGGAGGCTGAGAAGGAAAG CCCGTCTCTGAGTCTTCT CAGGGG CGAT GTG CTGGC CT AATGGG CT TG GGAAAC CTGGGGAAGG CTGACTCAGGAGAATG CCTGTGCTAC CAAC CT GTTCCGCCAG CCCAAGTGTTGCCGCCCACGTGAGGCATGTTGTCTGGGTCCTCCTTCCCGTGTTTGGGGGAGCTTACATCTTCACCAC TGTCT CTG TTGATTGT TATCTAAAAATGAACAATTGAGACATTTTTACTATC CTGTGAC AAACA CCAGGGACTG CATG ACAGAAAAAAAAATTATT TT CT CC TAAG T C CTTAAGGTATCACATAACCT CAGG CCACTGGACTTGATAAATGCTGTG AGTATTTGG CAAAAATTC CATCAGATGGATTC CAACñT CCATCTAGTTGGGAAGAAGAAC TAAT TTTTTAAAAAAC T C TTAAAGAAAAGTAAAGGAGCTGGACTGCATTATAGCTACAGAATTTTTCTGGTGCAAACCAAGCTGTGAGCATTTACA CATGGCATGAGACTTACCAGATGCAGGGAAAATGGATCTCACTGGAGAGGGGTATTTTGCCAGTGGTTTAAAAGGATC ACCACCTAGCTGAT CCAG CTGAGT CATTATATACA CACTTñCCCATTCTGTTTT CAAAGAAATC CCATTACT CTGAGA GATGCCATTTTACATATAATATATGGAAAAGAAAAACAAAAAGTTATGTATGTATGCCTTAGTGATAGATTTATGTTA TATACTAAAAA C TG CT TATT TATTTT TTTGACCCACTT CAATTTTTATTATG GTATTT TGTAGCTT TC CTAATT CATC TTTATTGT AAAACTAGATTTATGATTCCTGCAATTC AAATAGAATAAGAAGTACTTTAGCCTTCCTCTGTGAGAGCCT
g a a g t t c a t t g c t t c c t a t t t c a t t c a g t a c a g t g t c a c c a a a a g g a g t g a c a a c c a c a c a g a a a t t t c a a t c c t a g t AGTCAGTT CAGTAGATACTACT CCACAATAAAT CAAAG CATTCACATTTTAT GCACAC CATGAG CCTG CT CT CAGATG CAATGATTTCCCAGAAGACAAGATCAAATTCTC ( N) xCTGAAGTCAGTCT'i’TAGAGTT AACC ATTCCATGTTGGTAC'A TGGGATTTTCAT TCTTGATGTTGGGATTAATG CAGTTAGG TAGGTAATTTGAGGT C TT TG CAAATAAAAT GGCCATTC AGAATAAAAT CAAGA CAACTTCTACTGTCTTACGTC CAAG TTGGTT CATTTCAAGTGTTATGACAT CGTAGGGATTTT AATTAGCTGGACTTTTCTGCAAATAGCTTCGGCAGGATCAACAAAGTTTTTGCCAAAACTTCAGCATACTCCAAGAAA ATTTAG CAGC CAGACATGACAATTGATATAAATT C CACACACATAATCATGCTCTAAAAATGGG TGAAAATAATTTTT CCTCCT GTTTTCTTAAA CAAGCAAGATAG CTAGGAACATTTAGAGAAAAAAAAAAGATTTAAAATACTTT CCCAAAAA CTTGCCAC CATT TT TCAACT CGTTTñTAAATAATGTAAGAGATAAG CAGTAAATTAAAATACATATACTGGCTACAAG AATTCCAGAAG(K)xCTCAGGCACAAGCTTGCCTCAGACCTGGAGATCCTGTTTAGAGATCTTTGCAGCACCCTCTAG ATGTACT C CATGAGAAATGAGAAGGACAAAGAACATTT CAAAACTGGTACAGTTTTATTC TGTT GCTAATTTAATAAA TCTATATGAG T(N)xGCGAAAAACAAAACAAAACAAAAAAACTATATGAAAGGTATGATTACAGTCTAAATTTTTTTA AAA CAAACAGAATGGAACTTACATATGGAGATACTA( N ) xAAGATACATTTTTTTTTAAATAATAAGAAGATACTTTA TTTTTTAATGATAAAGACACAATTTATTTTTAC'ITTTTA(N) xTGTAAGTCAAGGAGCATTTGTAGTTTGCTTATATG CGTTAAGA CT CT TCTGTAGG CCAATTCTAAAAGAGAAG TT CCAAAAATAGTCT AAA CAAT TGGAAGAACCATTCTGAA CACCATTC CT CACTTT GTGGGGCATGTGTGTGTATT CT GGTA CACTATTTAAAAATAAATTTTAAAAA
>H sl3_89372537-89409391
GTGGTTGGAAAGAAAATGATTAGTAGATAGAGAGAAAGAGGTTTAGTGTAGAGGCATAGGGTGGAAACTTTAGGAGTA TACATTAGTGAGAACAT C TTTGAATCCTGTAT GATTAC CAAACAGAAAGCAGCCAC CATTAACAA CGAAGAAGATAAA ATAACT CGAAAAGTTGATGT CAACAAACCTT CAT CAGAATTGGCACAATAAGTCCACAAGTC CAT CACTCATAT(N ) x AACACTGACC CAAG GTTATACAACAGCACATGAAGGAG CAGCACAAAATGAGGTCAGAGGAG G GAAGAAAAGATGGAT TTCAGAACAGATGATTAGCTTTAGAAATTAGGAGATATTTCCTCCATTCTGAAAGTGGGAAAAAGAAAAACGTGTGAA TGAAATTGCAGATACTATAGTTTTGTTAAATATGATTTTACTGTATGTTTGAATTTATGTAAATATTTTAATGAATGA ATATTATAAAGAATTTTAATACTTTGTAGTGTTC CATG GCATATAACAATATGAGTTTTAAAATAT CTTT GAAAAGAT TAATAAATTAGCAGAGTTTAATCCCAAATTGACAGTATTATTTTCAATTTTAGAATTGCAATAGGGTTGATTACCACT AAAAGAACTAGCAAGAAAAAAAGTCTTCTATAAGTGGAGAATAAATCAACACTATAAGAATTTGTTGA( N) xAAAAAT TTTATACTATGTATTATGAGCCTAGCACAGTGTTAGAGGTATAAAAAGAGATAAAATACAGACTCAATGATTAAGACT AAGCAAGAAGTT CTGC TCAACATAAAATATATAAAGAT TAAACAAATCTGAAATAGATTGTGGGAT CGTTAAAATATT GTATACAGAAAT T CAAAATGTCTTT AAAAATT G CTGTTATAAATAACTATAAATGAATGTGAGGAT TAAGATGTCAAA TCTAGATG CTATACAT GGTATCTTGGAATAAGAAGTTATATTTAAT GTCTGTGACTAGTT TT CATTTAG C CAGACTTC AGTAGATTCAAAATATTTATTTAACTTTCTTATTTAAATGACTTATAAGTCAATTTATTAAGTTGAAAGACTGAAAGG AAGTTTGATTTTTAAATGTTATTACTTAGAGTTCATAAAAAAATATGGAGTTAATATACATTTTTATTTAATTTGCTT AATTTGCTCATAAGAGATTTATTGCTCTATAATATTGAAAGCTTCCTGAGCAATTGATTAACATTTGCTTGTCTTTCA GTGGTTTT CTTT CAAAATAGGTGACAATGATTTTTTAAGAAT CCTCTAATTTATTC CCTAATTTATTCAT CTTTGTTT
{ N ) xTTTGCCTCCTTG CCAGAGTACAAAGTGAAAATATTT TACCAAATAACTTCACAGTGAATATT TATC CCATATTT CTCATATATTTAAAAAT CTT TTCATGAGCT CAAGAT TCTAAAATAGAGTTAACTAGATAATATT CAAAACAGTTAAAA TGAAAAGAAAATTACTTGGCAAAAATCCAG CTAT TT TATT TT CTTGGAAATATTGAGGACATTTTGGTAATCTTAGAT CTTCATTG TAGT TC TT GATTGTTCAAAAGTAT TTACATATGTCATTT(N)xGTTGGACAATTTGATATTCCCAAG(N) xACTTTCCAACTGCAATGTAGGCAGAGGTTCCCAA(N)xGTTTCTGGGTTACTCTGCCTTGAACATTGTCATCTCTTG CTGTATTTAGGATGACAGTTTTCCACTTCTACTGTTCCCGTGCATCCTTATGAGAAAACTTTCTGTTGGTGTAAGTCT GGAGT CTGAGAAAG GACT CAAGTCTTTAACATTAT CAATT TG GAATTTGAAATTT C TTTGAC CATGTAGTTTTCTTAA ATATTTAGTAGGTATATGAC CTAGTTGTTTTAGAAT CATG CATC TAGATGTAT CTGTGAAGGTTT CTTTTTTTTTTTT AATCTCTCATAGAGAT TTAATTTTT(NixGCTGAGCATGGTGGGCATACTTAGGAGGCTGAAGT(N)xGTATTAGATA GTGAGGCAATATTACAAAA(N) xTCATTCCACTCTGTATTTATTTACAAGCAAACAGATTAAAAACATTTTAGGAACC CTCACCACCCTGTGAACTGTATTTGCATGAGCCTGCTGTATAGATCCATTCTAGGAAATTAAAAACTAGAGTCTGAGA AGTGTT CTflGAAATGGGT TTTCGACTCTTGAAAT TCCTGTTTTT CCAAATTTATTAAT GT TTTT GTGCTGATTTTAAT AAAGTTCTTTTAATTCAGATATTTGAAAGTATACCCTTGACTACTGGGGAGGCCACTGCCCCAATTGACCTTCTGGGA GCATCTAAATAAGTAT GTTTGCTCTATTTTTC CTGTAAAGAGAG CAACATAAGGAGACTCAGGAGAAG CC CAGATGAT CCATGATGCTTGGTTTTTCTC(N)xCTTACTTCTCTATATATGCTGGTACATTCATTATTGGATAAGAAATTTTGTCT AGTTTTAAATCATATAGAGTGTCAGATATCATATTTATAACACTTCTTAATTGTATTCTGAAAAACAGTCTTTTTAGG AATAAC CACG CTTC CT CTATTCTCTAAGTC CCTTACATATAGTGACATACTAAATGCC CTGA GAAATATCAAATGTAA TGTTTTTCA(N)xGATTACCAGCAACAGGGATTATTGCCACTCCATCAGTAAATAAGAAACAGATACCAGGTCATACC TTACATACTACT TC CCAAACAGCACAAGTG CTGACATñTC CTAGGCTTGGAACAATGAGGTC CTCACTATCCTCCTTC TGCCCTTT CTTTTCATTCAG CCTGAG( N } xTTTTTTTTTGCTTTGACAAATCCATTCCATTGTATGTTCATGGCAATG CCATTACTTC CTGGTC TCAAAATCTTTAT CAGT(N)xAAAAAACCAAATTAGGCTAAATGCTTTATGAAAGAAAAAAG AGGGAGGAAAGAAAGAGATG G GGGAGATTGAGGT TTCCACATAT CTGACACGGAAGTTGG GAGCAATTAG CATCATGA CTT CCAG CTAATTT CTTATGATAGTTTTTG CTTATATTTG GC CATAGTTTGATCATACTTGTGAAT CAAACACTAGCA GATACATTGC CTCTATGT TGAGTATTAGGTAAAATGACTCAACATT CCACAGTAATAT CCAC TATTATATACCCTTAT AGTTCCTGAATTATAAA C TTTAATTTAGTT CAGT TTTTAAATTTAAGTAAATTGAAGTA CAAAAT CATAACTCTTTAA AAA CCAAAAGTT GTTACATG CCAATCTTTT CCCCTGATGG TCTCATATTTCATTGCAAAACAATGTATTTTAGGTAAT TACAGAAT TAGGAAACTACAAAACAGTGTTTCAAAGTGTTGGTTTC CAGCTTTGATATTTTTAC CTTTG CATATATCT ATATCTATATACACATATTTGAAAATGTCATCTTTCTACCTATAAAAATTACAATTCAGTAAAATTGTTTGCTAGTGT GATATAGAATAAA C TAATTC CTACAATACACAAATATTTACAAT TGAAATACTTCACTTGTT CATTGATAATCACTGT TCTTTCTACAAT TT AT A CATTAGTCTAT TC CT CTACAATACAATGTTTGGCAAGATTATT CATATAAGTATTAATGGA TTCACATAAT TT GTTTAAT CAATCTTTT TTATGTTGATTCTTTT TATGGAAAGAATGCTATGATAAAT CATGTTTATA TTTTAATATTGATGTG CñTCTCTGTTTT CT CCAAATA CATTT CTTAATGAAATGTTATAGGT CAGTAATTTAAACAGT TTTCAAAGATTTTCAAATTGTTCTGCAGAAGTTTTGGACTTCATTCCATTTATAAAAGAAGTGGTTGAGACAACAGTG CCTATCATTG CTAT CTAAC CCACCCTCTTG C CAACAATTTGAAAAGGAACTATT CTTTAAAAT C CTACAAAATGTTTG TTTAACTTTG CATT CATTTAGCTAAATGAGAG CCTAAAACTTTAAAAAATATTTTAATATGTATTG CTTGTGTGATTT CATAATTTTATTTTCAGCCAATTTTTCATATAGGTTATTATCTGTAATGATTTGAATTACAGTTAATCTCATAAAATG AAAATATTTT CT CT CATGTGACTTTAATAT TG A T( N ) xGGGGAATTTAAATGTCTAAATTTCTTAATTTTATCAGTGG TTTATATCATTCTCTCACATTAAACATTTATACATGGTGTACCTATTTGCATATTATATGTAATAAAATTGTCTATCA TCTTTATTAAAATAAT CTGTATATATTATCTTATAATTAAAACCATATTGATAATAAAAAAGAATT T CACACACTTTG TTCAGGATATTTTA GT CTATTTTTCTGAGC CATT TATTTTTATTTCAGAACTATAC TG CCTACAATGTATTTTTATGT TT AG AAATTC ATGCTT TT CACCAT AT AT TTTT C AAAAT TATGTTGTCTATTTTC AAAT ATGC AT T ATG C AATGGATGC TAATGTATTTTCCGTGATTCAAAGG(N)xCCTATAACAGATAAATATCTACTATGTTAACTTTTCTTTCTACAGAGCA GTAAGGTAGATCAGTACATACAAAATCAATTAAAGAGGAAAGCGCGGTACATCATTGCAAGGAAAATCATCAGTTTCT TTAG CCTACT CC CAGGTT CTGATAATTATC CTGACAAT CAAACCATGTATGATCATAAATAATC TC CATAATCGCAAA AGGGCTGTTCTCATTAAGTTTGGACTATTTATTTCTATAGATTCACCAAGTGTATTACTCTTGCTTGAGACTATTTTT AAACATACGTTATTGAGAAAACTAAGAC TAG ATAATTAAC TT CTATTTGTAAACTT TCACACAGACTCTTGGATTGAA CTTTTAAAAATT TCAACAATTATCCCTT CTAT TT TGACACAAAGAAAATAAGCT CAAGTCATTCCCT(N)xTCCCTTC CCACTAGCTACCAAGCTGGTTTCCTTGAGAGAGTTGTGTGGGCATTGATTTAACGTATCAAGTGTTTGTGTTCCATAA TAGT GTCCTTGT CAGAA CAGATTGTACAAGAAC CATA CT CAAATATGAAACCTTATATA CAAAAAGGAA CAAAGAAAA AAAGAAGCCAATTCATATTTACTCTTAGAAGCTGAGAAAAGACATGTACGATTTAAGCAGCCCATGAGGAACACTTCT CCAGTACACAAT TTAT TG CTCTATCAGGTGAG TT CT CATTTATGA CTTGGGAATAAGTTACACAAC{ N ) xATCTTTTT CTTTCCTCTTGCAATCTCTTCTTTGAACACCCTCAATCTGATTTTTCTTCTCTTTAGAGATTTCAGCTATTTCTCATA GAAAG G G GAAGAAATATATATGTTTTAGTACCTTTGTCTTG CAAAGGAAACTTT CT CAGACATTGACTAG GTGCAAAT GGAGCCTCTCTTGTATCTGTTTCTAGACTGAAAGGGAAAATGTTTGCTTTGGCTCATCTTTTATGTTGTAATGCCATT CATTACAGTCTC CAAT GAAAAAAGAACT TTATTGATTT CCATGCTAGTAATTTAAAAAGCAAATACA CAT TACAAATG TGAGGATTAACCTAATTTTTTTATTATAATTACATTTATGACTACTTTCCTGGAGATAATTCATTCTAACGTTATATA TATACACACACAAAGCATATTCTTCTCATTATTAAGAATCTTGTCACTGTTTCTAAAAACATAATAGCACTGACATTA TAATGCTAAGGCACATTTGACTCCTTTCACCTGACTTGCATTTGTCACCATGTTTACAGTGAATAATTCTGCCTTAGC CATACACTGTAAGTGATACCGGCCCTAGTTAAATGTTTTTCTGACACTTCAAACTGTTCCCTGGGCAATCACTGTTAA GCCAAGTTAGAGGAAAATAACATTGCTGCAGTAATAGGAGAGTTATAAAATGTCTCAATTCTTAAGTTTACCGTTAGT TAGTACATTGAACATT CTGTATGATAAGGA GAAAAATCTCACAGGACTTTGTGG CTAGAAATAACAAGACATGTGAAA ATTCTCTG CCAAATAGTC CATGTCCTGCAT CATTTTTCAG TGAATTATGCTTTGAGGCTT CATT TTAGGT TGCATTCT TCCCATTAGTTTCATTCTTCATCCTCAGAGGAGGAGAAAAATGGAGCAACAGAGTGTTCTCTGAAAAATTGTATACCC ATAGCATATCTCTTGAACATAGGAGCAGGAAAAAAATAACTCAAGGATTTCATTTGAATAAAAATGATTTATCTACTG AGTG CTGAAGGACATGAATATATTATGG GTGTCTTTTG CATTTGACTTTTATTT CTTT CTTAGCAAGTAGATGGCTTC ATAGAATG CTTAAAAAA CAATTCTAAAATAATATGATACATTTATTTCTTTCAATGACATAGAT TT T C TTATAAAATA TATT GAGC TATATATGAAATTTTAGACCAC TCTATATGTT CAAATTGATCCTAATATTATAATTAC CTTTGCATCTGC AAAATT CACT CCATG GTT CTATAAACTT TCATATGTTAA CATA C TTTTACATTTATAATTAATT GC TTTCAGTGTTAA ATTAATATTAA CTCTAAT CGTGAAGTAGACATGGGCAGTG GT TCAGGAATTAATTT TTGAAAATGAA CTTTCTTTCAT TAACACAACCATATATAATTATGTACAAAAATGGAAGTATTACTGTATATACTGTTTTTATGCTTGAGAAATTTTGCA ATGCTGTCCTTGTT CACATGTATAATTTTTTGAT GTATAT CGTGTATAGTTTTTTGTATTTAAAATAAC CAAATAATT TTCACATATATGTGATAACAAGATTTTGTTTTAGTTTATTTGATGGGGACTATGACACTTAACATAACACTTTTAACT GCCCTTTGTTTTTAATGCATTTAATATCTTCAGTACTAGATATTACTGAGGCCAGAGCTTGTGCAACTTAATATTTTC AGAT CAAT TCAGAGAAAGACCACTCTAAAT TATGTAGGATAAAA TAATGTGATAAT TAGAGAATAGAAAAGGTACAGG GTTCTTATTTAC TCTAAT TTGAATTCTAGGTT GAATGATTTAAGAAGAGCAGGTATTACT CAGC CATCTT TATTTTCC TAATACTTAG CACAGT CT TTGCCATACAGAAAT CAGAT TATTTATGAAGCAGTGGC TTTTGAGAAAGCAACTTACTAT TTTCTGAATT GTAAAC CTGACGGAATGTAAACTTATGTAT CGGTGATACTATTTAT CATT CAG CAAAATAATATAAGA ATGTAATTAGAGATTCTAGACAGATGAGTGTATGGGAAGTCAGTTAAGTGACACTCTGAAAGTTAAACA(N)xGTTAA ACAAAGT CCACCATGT(N ) xGGAGTTAAGCAAAAACATTGGAAGAATAAATAAAATTTGTCTCCCTTñGCATGGTCAA GAGATTCCTCTTGGATGCAGTTGTAGCTTCAGAAACTACAAATATTCCCACACATTTATCCTATATTCAGAGATATTT GCCTTGTGCTCCATGAAAGCAAAGGACATGTATTTTGCCCACCATTGTCAGATTTCCTGCACTTTGTATTTTTTAATA ATAATG TTTTTCATCA TTTTT CCTTCAACATCTCTTT C CAATTC( N ) xCTTACATTGTTTCTCACATCAAATGGATGG AATGTGATTATC TGAAG CATTAGTCATGTTTCTT CATT GTTTTTATAGTGCCTAGT CTTGT CTCACAATCATTTAGGA AATACT CACTACTTTTAT GATCAATGGATAATTGGC TG CATTTACAACTACAGTGT CCACTCTACTTGAC CAGCAGCA GTGGTCTTAGGTTTGAATTAAAAATCAATTTATGAGACAAAATAGGCTTCACCCAATACCAAGTTTATTTAGCTTGAG GAAT GAATAAACATGAGT CAAAGTCACG GAATGTATAG CAATAGTATAGTAGAGAT CACTGGACT CATAGG CACACAT TACAACTT CACAGTTT CT CTGCACTGG GATTCTGGATGTT CACAAGTGATAAGACTGTGTGGGT TT AAATGTTTCCAG TTCAGGGAAACCTTCGGAATCCCTTCTGAGTGTTCTCTTGCTAATTAAAAAGCACCCTCAAGGTTACATGGCTAATGC CCATTTTCTGGTGCAGCTGCATCTGATTATTCATCATTCAATAAATAACCTCGACAGGAAGTAGAGTTGTTTGTTTGA TACATGGGAATATGGCCAGGATGTCATTTTTCTTGGATCTAGTAGTATAAACCTATACACTTCTTGATCTAGTGTTGC AAGTAGAAGGAGGATTTTATTTAAGGTCAAAATTTCAGGCTTATATCAACCTAGAGGCACAGCAACAGAATCTAAGAA TAACCTTTTTTGTATAACTACCAAATTAATTTGAAAACTACTAACCATTATCATTTGAATGTGACTGTTTAAATCAAC ATCTTTGAGAGAGTCTGTTTTCAAATTTACTTTTTAAATAGACCCTGTTAATTCTTGCAGCCTATTTTTAAAAACCTC TGAAGTTATAATAACCT CATAATAAGTC GATATTTTAAAT GGATATTTA CAATTTTTTAT TTTG C CAAATTT CACAAT TT CATT CATAGACTGTC CAAGAAAAAATGTAT TGTTTAAGATGCTTAAGGATCTT C CT CAAGAAATCATGTGAT TT TT CT AAAAAGAT C AAC ATTT AAAAAC AGAT AAAT CTGTTGTG CT CCAATCAAATT C ATGG AGGGAAATTGTT AT AAAT AA TATATT G ACAGATAT CTT AAAT AACTAC AATT AT AGTT ATTATTAT AATGTT AG AC ATTAC CAT AGAT G AATGTTTGA ACTATAAGGAACACATGTGTCTAAGCTCTTAGTCTTATTTATTTTTAATAAGCAATAATTTAATGTCAATTTCAATAG CACAGAGAAT GTTT TTGCTTAACT TCCATATCTTTT TAACAAC CCT TTAAAAAAAGAGGTGCTA GAAAGGTT TATACC TACCTATGTTTTCATATATTTCACATACAATAATTAACTTTTTAAAGTTATATATGTCTACAATAGTTTTCATGAATC ATATAAACACATATATTT CAAAAGTTC AAACATAG ATT ATTAAAAG CCTAAGAT TAAAGAGTAT TAATTGGAAATC CA TATGTATACAGTTT GTTAAATA CACTCAAGTAAAG(N)xGCATCTGATTCAGTTAATTAACATAATCCAAGATGTAAT CAACAATTTCTTTCATTGAATGTTATTTGTGTTTGCTTTAATGTTCATAAGACAGTCTTATAGAAACACACTACAAAC AAAATT TCAATTAAACAGTTTATGACAATTTT TTAAAAAG CCA(N)xAAAAAGAATAGAGTTTCAAATGTTTTTTGCA GTGGGTTCTCATGAACCAGAAGTATAGAGCTGAAACATCACAGTCCCCTGAAGGGCTCCACCCAAGGATCGTAGGAAG GAAAAATCCTGCCAATAGGCAGAACATCG(N)xAAAAACCACACTCCTGACACAAAAGAAAACAGTCTCTATTCTCTT TATTTCAAACAGATAGACGT CTATATGG GAAATTTGTCACAATGTTGCTGGAACTG CTAG GTCAG CAAAAAAATAAGA TGGCGT CAAGATGTGCAAAAG GAT TCTGGCCCTGAT TC CAGTAGTTGTTG CTTT CATGGCAGAAGTCAGTTT TC CTTC TGGTCCTCAGTGGGTATGACAGAGTCAGCACAATTTCAGTGACTCTAGGCTGATTTGAGGTGGTGCATGTTAGAAAAA GAATTAAATG CAACATTATCTT CAGCTG CAGAGTGGGTGATGACTTACATA CTCTAGC CCACAGGTATCTCCTTTCTA TTTCTCTTGGCCCTGCCTCCTGCAAGAGTTTCTTTCTTTCCCTGCCCTATAGATTTTTTCTAATGATTTTGCAACAGA CGTGCTGTTTCCATACCCTCGATTGAAACATCTAGACTGCTTTTTGTTTTTATTTTTGCTCTAAACCTGGCAAAAACA ATAATAATTAAAACATCTAG C CAT TAATGGTATGTAACTT TAGTTAAGACTTATTAfi CTATTGACCATATAACCTTGC TGTT TTCCTGCTTTGCATGTGCAC CAGTATGACATGAGAAGCAACTTTTGAGTTAAGCTCTTCT T CAG CTGC CCTACT TGGCTTATCAGGAGATCCATGCCT CAATAATTATTCTTACAATTTTTGTTTAGTAACTATATAAAAAAC CTGGGTCAG ATTT AGTTGG AAAT AT AGAT ATTCTAAAT CTG ATG GGAAATT AATATCTC C C AAAAGAAAAGGCTATG CTTT AAAAGG TTAC TTGAATTTGCAATATGTATT CTTCAGTGTTAT TAGTTT CTTAGCTT CACAGACATTGTCTTTAACTGAGACTGG TGGTCAGATTTACAGCCTAATTTAATAGTGATTCGCAAATATTCACACTTTTCAATTTTGTCATCTTTTCTTTCAGCA TAATTTTATAATTTAAGACTAAACAGGATGAAATGAGTATTTAATCTATAATTTGAGTAGCCCAAAACTTGAATTACA ATAAGCAATAAACCAAAAATACAGGTTTCCATCTATGAAAGCAAGGGGAATACACGTGGCATATACACTCAGTAAGAA GAGACTAGAAACTGATA CTGTAAG CAOAGCTACAAC CTTTATTCATGTAAAACAGACTAAGAGGTAAAGGAT CTAT TA CCCTTCATAATATGCTCTCTCTTTCTCTCTCTATATATATTCTTCTCACTAATTTTATTACACATTCCTATCTTTGAG ATTCCAACATGTGACAAGAAAACAAGATGGAAATATATTCCAGAAGAGCAGATGTTATTGATTTAGAATCATTCATTT GAAAATGTTAAATGTCAAGTTTTT AAAGTTTTTAAT TAAATCTTATTGCCñTTTGATTTT ATAAG CATA C AAAC CAGG TTAGGT CAAAATTCATGTGT CCATGTGT{ N ) xTTTCAGTGTAATTGCTCCTCCAACTCTGCACGGGTCCAACACACAT TTTTACGGAAACAGAAAGTACTACAGTTTGGGTGTGAGAAATTTCTGGATGACAATTCAATTG (N ) xCTCAGTTTTTC CTATAG CGTT CCCTTCCACACAGC CAGGAAATAGAAGTAT TTTATATCTCTGAATCAC CGTTTT CCAAATAGGGAAAA TC AT TTTGGATT AAATTAAT GAAG AAAATTGTT CTACAATGTTAAG C CAT TG AG AC ATTT ATTTTTGT TGGT TATATA TT TT TATTGATATTTTTACTGATTATATTTATTGATATGTTTGGTAATAC CACACTAAATAAAAAGTATCAGACAACT CAAATTTTAC CCATTTTTAAAATTAATT TGAAGTGCAAAAGTATAG CAAATGCTTG GGAATTTTAAAGTTTACTAT TC
( N ) xTCTACCTTCAACTTATGACCCGGCAAAGAAAATTTCTAATTCTTTTGA{ N ) xAATAAACCTGACATTGTCAACA TAT CTCTTAT TGTTAAAACAA CTTT CAATTTAAGGAAAACAT TTTAT CATATTAATAAAATAACTCCAAAGG GACT TT ATGC AAT CATTATG AATGTAAAAAACTT ACTATATATTT C TT AAAAATTTTACTGT ATTT CCTATGTT ATGTTTTTGT TCATTTTTTAAAATGTTATGTATATAGGTGTAATGTAC CT CATAACATTGA CAAGAATTAAATGA CA CTAAAAAGT TA TGTTTTAGAGTTT C T CAAAGAGATTATG CTTTAAAATTAATGTTGTA CAAATGCTTTGATAAAC CAGAATGCACAAAA GTCCTGAAAACATTATTTATGACAGGGCTATTCACATACATTTTGATTATTCTGTGTAGAAGCTGAAAGGTACAAACA ACATATGTGATATAATCTTGTAGCTTCT CAGA GCAAAACAGCACCT TTGGCCTTAGATTT TGAAGCATAATG CTTATT TGTT ATAAGATCTT AAAG TAAACTG (N ) xG TT TGTGTT TT ACGCTAAAAT ATTT CAG C AT ATGATAA CTCTGTTGG GA AATAAAAACT GATCTGACTT CAAATAGAAGGAAGAGAC CACT G GGAA CAT TTTT TGTAGAGGTCATGCACCATTTATC TATTTG CTTTAACT CAGAAATT TCAAG GTGTATGTTAAA CAGATGC TGGAGAAC T CAATATTAAATAAAGTT CT CAGA AATC AG CTCC CTGTGAAGTGTT AAAAGAGTTTTG ATTTT C AATAATGCTT AGCATTTC AGTTTAT CTCAG AATGTT AA AAAGGCAATGTTGAATCAAACCTACAAAAGAATTTGATTAAACCTGAGATGAGATTCCCAACAGATGAAGCAATGCCC ACTAAGGAAAAAGTCATTGTAAGAAGTGCAGAACAGGTATCATGTATGGGTTACATGAGCAAGAGGCTGCCAGAGCAA TCATTTTGACTCCATGATTTGAATTAAACAAAATTATTATGTTAATATGCTGATTGTTTAATCAAATCCAATTATTTA TTGTTATGTTTTATTATCACAG TCATGT TTTCTTAGAC CTGCTCCTGAATTAT CAAAGGTA CAA CTGGTTCCAAAGGA GAATTGGTAAAGGAAATAAAAATCATATTTGAAGGCTCAAGACAAAGGTGTTATTTTAATTTACTTATTTTTATAGTT CAGATAAAATGAAATAATTTTTCTAACACAAAATTATCAGGGTCTTTTTAACAGGTGTGATCTCATGAAAAAATGAGC CAAATATTCATACC CACAAATCATGGATGGTCT CCT GAG CATGTTTAAAC CAACTAAATTTGGT CTAAGAAT CTTG CT TATTACTAGAAAAAAGTG CATAAACCAGAAAT CAATAGAAATTAAT TTAAAGAAGGAAAG CTATAGCT TATGTAAAGT TATC CTGCTGAAAGAATACTATTC TTTATTTCAAGT GC CATATGAATTTTAGAAAC TA T G ( N ) xCTATGTAAGAAGAG AAAAAATAAG CTATGCAT TT CTTTTCCAGAAT GAGAAATATGTCCAAGTAAAGAAGAAACA CAAGATGACTACT CT CC TCTTTATAAAGATATGAGAAAATTATTTTGTAAACATTTTTACAGTAAGTCAATTAAGAAAGTCATTTTTAAAATGAC AT CAGGGAAGTC ATTACAAAG C AAAATT ATGAT ATAAT GAAGTATC CCATTTCTTT TT CAGAAAATGT AAAT ATGAAA TATTTT CATTATGTTAAAAC AAAC ATTT CAAT AAC C CT AC TT AGGTTCAT AAAGTT TTTATATACAAT AATG CATT CA GAAAATTGTAAATACATG CATACATGTGTTTTfiTGT CATGAT CTAACTGTATCATTTAGTATAT T CCAAGAAGCTT TG TCAATGATAAAAATGT TTG G GAGAATG AAC CAAAGAATCAAT C ATG CT GTATTTAATGTG CCATTCTG GAAAAT CCAC TCATAACC CTTTACAGTAATTCTGATA CAATCTATAAACAATGTAG CATAATTT GTTTAAA CAATAAAGGAG TGGG CC TCTTCTATGGAATGACTATTACTTAGAGTATGTGATAGCAAAGCCCTCAAGGGTTAGCAGGAAACATTCTGGTTTTTG TT GTTACTGTCCAGA CAGAATTAT TGTTAGTACTTT CTGACACTTTTT CTGC CTTATTTTATTAAAGTTGGT TTTAAA TATGTCTGAACTTAAG TGTT GAGAC CAAATATTAAATAAACGATGT CCT CTGATAATT CTAAAACATAGATAAACTGA TAATG G GTTTTACAAGGATGT CAGAAAATTTTTTTT CCTAGGACAATCAGAAATTCAGAATGATAAATTT CTGGTGTA AGATGGTTGGGAAACAAAGAAACTTAATTTTTTGTCCTTTAAAAACTTTACATTATGAAAAATTTCAAACATATTCAA AGGTAAAAAAAATAATATGACT TTGATC CCTCTCAACTTAAG CATCAACAAAGGTCAATT TATTTTAT TTCCTTGTCC TAGTAT TATTTTGTGT CAAATACCAACT TAAAAAATTATTAC CAGT CTATTAACATAG TT CCTT TATATTATAATATA TCCAATCAGTTTTCCCATTTTT TGTTTCATAAAAAAATAAAAGTTAAGTTTATT C CAACTATGTAATAGC TGCTTTAG TCTCCAGTATGATCACTCCATTAGTATCTCTCTTTAGAAATG{ N ) xTTACATACTAACAATAATATAATGAGTGTTTT T T G C ( N ) xGTCACTTATAAACCACATTCTTAGTTTCATACAGTTATCTCCAAAAGGAGCCATGAACCTGAGTCTGATT TATACT TCAAGT CATT TTAACT CAGTTATATATACCTGCATGAATCAT CTGTAATGTTGT CTTTGTGTAGGTATATTT TCATTGTCATGAAATATGTGCTCTA(N ) xTGTGCTGTATTAATTGCTTTGGAGGTTTTTTTCATTAATATTATTTTTT AT CTAAAT CTACTTAGAT CTAAGT CATT CTCTTTATTTGG CAGACAGTATTC CAGCCATTGAGT CTAACT GAATAATT
( N ) xAATTTTAGTCTGAAAAGACATAACTTGCTATATGTAATTCACATGTCATCCTACAGTTTGTCACTTTGTTCATT TTATTTATTTATTTATTTTGATGT GAATGTGTGTGT GCCTGGGTGCAT GTC CATATGACGTG CTTGTACTTT TTTTAA TCTGATATATATGTGAGTGTTCTT TGTTTTAAGAATGTTT TT CATTTC CACATC C CAAAGTCATAAAGATAT TGTCAT ACATACATGTTTGCAAGAGCTTTGAAGGATTTGATTTTTTTCTTCTTCTTTTCTTTTC{ N ) xCTCTCTTTGGATTTAT TTTTCTGTATAG CAT CTGTTATTCATGT TATATACAATAAAAA CCTGATATTTCTAATAC CTA CTT TTAAATTATCTA GATATTTATCTTTGTTTT G GAATATCAC CTG GAACTT CTACCAATGTTT CAAGTGTA CTTATTTTTTGTTAGATTTTA ACTTTCTCTTAAATGTATACTATACTTTTCATCTTGGAGTTAGTTTTTGTTGTCATCAGTGTTATATAAAATCATAGG AAATGAAACTAAATAGACCTTACAAGTGAGTATTAGGTAAGAGGATAACACAACTATGTGAAAAAAATAAGATTAGAA CTTAAATCTTTTTTTAATAACTAGACGTATTGTACTGATTTGGTAAGTATAAATTCATCATGTGTCACAAGTGTAACA TATTGAGGAGATGAAAAATAAATGCTAATTACCATGAGAGGGCTTGTCATCTTCTGGATAAGTCAAAAGAAAAAAGCA TAGAGGAACCATATGTAATG GAAG CTTGTG CTA CAAAATAGTTACTAC CTTAGACTTT G GAAAG CACATCAGTTTGAA CC CCTGGTACTAGCAT GCTATTAACTAACATCTT TGGTAACT GAA CATATATGTAAACAT TTTT GGTAAAAAATTATT TGATGTTACATGTATAAACAAGGTTCTGGTTCACAGAAAAGACTTAAATGCTAACTATTTTATAGCATAGCATTTGTA AAATGCTACAGTAAGATT TTTTAAAAAAAACTT CTG TAT(N )xATTAGTCCAAGAACTATTGTTTTGTCATAAATATT AGAAACAT CTCT TCAACATACAG CACTT CTA CTTTATG G T{ N ) xCTGTCTGTGGCTAGCAATTAATTTCTTGATGAAC T T G A T (N ) xCACTGAATAAAACTTTGACTTACCTCGTTACAGACTGACATAAGTCCAGACACTAGATGCCAGATAGCA C(N )xAG TAATTTATAG TATTATTCATACTATAATAG C(N )xTTTG G CAAAAATG TAG AG TA AATACAA GTATTT(N ) xGCCACATGCTTTTCT GAAATT TGAGGTTGTTCCCTGTAC CACACTGGAATTGACACTAT CAAA TAGAAGATATGGAG GG GAGGTC TTAAAAAATAATTACC CCTT CTTTTTAAAAA ACCTTAACTTTAG TA CATTTTG ACATATTTTTTA(M )xA GT GATTTT TTCCCTGTTT CTTTTATTGTAATTTAAAAACCATAATTTC TATT TGATGCTT TTAAAAAT CT CTGC CAGG CCGGGCACAGTGGCTCAC( N ) xCTTAGCTTCCTAAGGGTCAATCCTGGAACAGCA(N)xTCAGAATTTTG AAGCAATA TAGATTTCTCTCTGG GAT CT TGTCAATGTCTACTGTGTCTGT CTCATTATCTAGATTT CCTTGTTCTCTTTCTGGCTT GC CTAAATGTATGC TT TAA CATTTTTAC TTAATGAGTATCAAAACTTCAGA CATCTGGGATGTTAGTCTTTCCT CATG TTTACACTAGGATCTGTTATTTTTGACAATTCCTATGGGCACGGTGATTTTCTCTTCTTCTCTTAATCAGGCCAGCCA CT T CAACAGCAGAACTGCAG GTACTCATGGCCTG CACAGC TCTGGAAT CACTACT CTG CCAT GAAACT C A G T (H )x C A CTAGTGTGTGTGTGTAGGTGTGTGTACG CAG GCATGTATG CTTAAGTTTTCATCTTGATACT CCATCTTACTCTCTTG GTCTCCCTAAACT(N)xAAGAAGTTCCTTGAATTATCTTTCAGATTCAATTAAGTTTTGGTAAGAAATCTAATGAACA TGTATCATAAGCACAAñGAATAGTAAAACTGTGATAACTAACAAAAAGACTT CTTATT CTTTTAAATTAG CAGATAAA ATAAT CAACTTCATAGATAC CTACAT C A T (W) xATTTAGAAATGCTCTACTTTGAAACAGTAGCTTCAAATTCTTCAT TCTTGCACTATAATTTCTGATTGTTACAGCAGTCTAATTTTGTTTCTTAGACTGTTCTTCAAACTCTAGCCCTCTGAT TT CAAT CT CAAT TCAAT CTCTGTAT CTATCTTCATTTCTTTC CCTAAAATATTT TTAT GATATT TGA CTACATTTAAG TCAAGATG GTCAACTT TTATAT CTATTATATATAAGTTAAGAAATACCACAT CTGACT CACTGTACTTATTT GACTAA TT TTGT TTTAAG GCAAGGT CTC TGAG TGATATTAGAGAAAATAACT CATATAGAATGTTTGCAGTAAACACAGT GT CC AATTTTAACTGACC CCACAAAG CTGT CCTAAAAT CT CGAGGTTTCACTATTATCTTTT CACATTTACT CCAT CAGGCA TTAGAAAG TTTCTC CACTAT CAAAACTACACCT C CATTATGC CTTTTTTCACATCAGCAAATGACCAAGGTGAATTAA AC CGTTAACTAGGAAACT CTTACATGTCTTGATCAT GTCCGCATTTGTAATAGCAGTCACAT CTACATTTCCTTTCCT CT TACAAAATAAAA GTATCTTTTCTC CCAAT CAAAACCAGTG CCTC CACGAGTGAACT CAAT CAGGTTTTAAAGAACA ACTGñTAATAAAC(N)xATTTTGTATTTCTAATGAGTTTTGGGAGGATGCTTAATGCTTCTGGCTCATAGGACATGCT TTATGTAAAATGAATATAACATATTTTAC CATATTAAACACC CTTTTTTCCTTATCTTTAA CTTCTTTAGTTTCTCCT TTA TTTTTA CATGTTTATTTAAAC T CAAGTTGCCAT CTTACATCTT
> H sl5_58794736 -58803474
AAACGTAT CATT TAGC CTGC CTTAACTC TTGCTT TGAGGT CACAAATGT CAAAGTGGTTAAC TC CAAGTCAC CTC CAA GAA CATTATTCCTATTGGGTTAAG GCTCCACCCTTTTGAGCT CTAATTATCT CCTTAATGTGTGAGAACT GG GGGAAT AT GTT C CCAAATTAGAAATAAGTT CT CTAAAACTAAGGCT TC CAATGCTGAAGAGAATGTACAAAATGAGAAAG GCAT ACTCAGTCCTTCACTTACACAGTCAAAGAGCAATTTTTAGGGCACTCCCATGTGCCAGACAGCCCAA(N)xCTTTCCT AATCATTCAACAGGTATTTGAG GTGCTGAAA CATTGTTTAAAAAAAAAAAAAAAAGCC CAGGTG CAAACACTTCTATC AGATGTGC CATG CATC AT TG A CAATAGTATA Cfl CATATTATTTATAATATGTATAGGTGC TTTATAAATATTAACT CA TTTTATCT CCATAATAAC TGAG CT ATGG AC A A <N ) xGTGTAAGAGAACTTGGTGGGTTCCTTAAACCCTGTTCAGGAT CTGTCACTGGATTTGCCTGTTGTGCATT CAGAGG CT GGAG AAGG AGTGGAGGGGGCAG AGTC CAAGGTAG GACACTAA GAGT CAG GGAGAGGTGATTTGTGAAAGG CTTTAGGGGCAGCGCCAGCCT CTGGATAA CAGGGACCCTGGTG GTACCAA GGAATCATTGGCAGGACT CT GG GAGATGAGGACTTGGT CTGCATGGGGCAGCACCT CGCATCTGATACTGGCGCATAT CAGCACCCTACTGGACACATTCAAACAGGCAGCACCCTGTGAGTGTCTCAGCACCAGCACATCAACCTTCCAATTCAC CAACTCACTCCAGGGAATGTGAAGGGCAGGGTTCCACTGGGCCGCACCACCCAGCCCGGGGTGGCCCCAAAGGCAGAG GGGATGTGAGTG CTGT CT CT GTGAGC CTAAGTAGGC C C TGCACCTGC CAGAGA CAGGG CACAGA GCAAGTGT CT GAGG GCAAGT CACC TG GACACAGCAGTG GCAGAAAC CAAG GAGTAGGCATGCCAGGCAG GTCTC CAGCAG CCTCTGAGTTTC CAGG CACAGGGC CCACAC CAGGGTGGAC TTTTTATATGTACATACACACCCACACACC CACACCCCCACC CAAA CACA CACACTGCAC CAGT CC CGTTTC CT CA GATGGACATTTTTCAGAAGAAAG CCAATGACACTAG CAAGTC CATGAC CCAT TT CAAGATATGACAGGACAT CATTTT CCAAGTGGAGAAAACAAAAATTCTCAGAAAGC CCTGTA TACCACAGAAAT CC ATACAC CAGTTT CAAGTCTCAT CCAACAGACCAG CCñGAGTGCAGGGCCACAGGGCAC CCTGGCTGGAGTGAACAAGC TC CCTC CAAA CC CC CT CAAGATTC TAGG C C CACCAG CATAAAAGT CCTCTTGATGGTGGGGT CATT CCAACC CCAT CT GGGGTTGTAACCTCTCAGAG CAGAGGAAAATAAACAGG CTGTTGTCAGG CAGCCCAGG GCAGAAGT GATGACAC CTGC T CAT TCTGAATTTATCTT TATTATA CATTATGGC CACCTGTGGACCATT CAGAAAAGATG CTTTTTGAGT CT CAAG CA AGTGACTCTTTAAGCCGAATCACAAAGAAATTCCCATTTTGGCCAGTTCTGTTTCTGGACGTCAACACCCCCCTCTCT GTAT GAATAAG GGGTCTTCCAGATGGCCAGGAACAG CACTCAACAGGCCAAAACAAAG CAGAGGGTGG CTTGAGAACT ACACTGCTACATTCAGCCTGAGGCCCAGCATCGACCTGGCCTTCTCCCACTTCCAACAACTCATTTTGTAGCTTTTTG GTTAAGAACAAATTTGGATTTCTT CTTTTTCTCCTC CCAGTAATCTCAAATGTAT CAGAAGAAAGGAAATTT CTAC CA TTATTGTCAGAAACAAGACAAGTAAAAGGC CAT C CT CAAATACTAGTGTTCTCTT CAC CAGACAGCAG CACACGTGGA GAGTAG CAGATCTCTAAG CACGACCCAGTGTG TAAC TC CAAATGGCCCCATTATCCTATCTC GAGGAGAG CTGCGCAT GCTGTACC CTGT TTTACATG GCCTGCCATGCT TCTGGTAGCAAAG CAGTAATCTCCTGGTTATG TAGCACTGGGGATG CCAATAACAG CCAGATAAAGAATAGAAC T CATGAGG CCAGTTATTTTCAGTCAAAC CAAG CTA CAAAAAC CACAGT CA TGCAGAAGCTGGAG GAAC CAC CAAGG CAAAGA GAAT GGAAATTCCCTTATGCTAGACAG CACAGGTCCCCAGTTTTCT GGGG CATC TGAAACTT GATGTGACAG CACTGG GAGAGG CCAACAGTC CACGAATAG GC CAGGAC CTAAGGG GAAAG GC TACCACTCTACAGCCGGCCATCAGGGTGCTGGGCTACAGCAGGGCTGGCAGGAGCAGACGGAGGCCACATGCCAACTC CAGCCTCTGCTACATATGGTCCCAGCCCCCCTACTCACCAGCCATGCGGCCTCAGGCAAGTGGGATAATCCCCGTGTC TCAGTTTCCT CATC TTTAñAAAGT GGGATAATAAAAGATTCATGTACTTTATAGG GTTGT TGTGAGGATCAAATGATT TAATATACAAAAAAAT GTTGACAACAGGGCTCAA CACAAAGTGTGCTCAACAGCTG CTGTTGGTATTGGTGGTGGTGG TGAATGGT CTGC CA GC TTGGTAGG GATGTCTATGAAGTG GGACAAACATTTCATGAGTATGAAGATAGACTGTT CAGA TGATACTCAT CTAATG CCATTGG CAG CT CCTCTCTTTATTGTACCAAGATCTGTTT CCTT CACG TTTCAC CCAACGTC AACCAG CC CTTC CCTT CTTTGGACTAAGTCTC CCTT CAACACATCCTCAAATGACA CAGTTTCCAGTCCCCTCACTCT TAGT CACC CT CCTT GGAACAAG CT CAAGTT G CTAAACAACTATCT CTAAAGCCAGTTC CCAC CCGAGC CAACATTC CA AGGC CAAAAACCAT CACCTCTATCATGAGTTGAGTATC CTGCTTT CATTAGGGGAC TG CACTGACGTGGCTG CAGGTT GCTGGTCAGCCCTAGGGAGCCAGTACCATACAGCCAGTACCCGCGAGCTCCCTCCTCACAAGAGGCGGCTGTTGCCTC CTTGTGCCAGAG CT CCTGGG CTTTGGATAGAGGAAT TC CACTGCTTG GGTCTCATC CTTACT CACCAACTTG GAGGAA CTTC CATAGAGTGGAT GG CAGC CGCCCCTCCCCACTCTGTCTCTGTGGTGTGTGCCCCACCTAGGCTGTGTTTGGGTG CC TGTAGTAC CT CTTC CTGTTCTTGT CT CAAACACACAC CTGTGGGGCTCCTTTCCCAGTACCGTG CTTCACAACGGG GCTCTC CTTAGC CCAGGAGG GGAGAGGCTGTG CAGAAAGGAGA CTTCTATGACACCCTTGGG GCAAGG GT GTTGTT TT GCT C CTTC CATAGATCTCTGAAGC CACAGCTATG CAGGAGAACAGAAATAGAAGC CAG CCACACCCAGTCTTCTTTGG CATGGG CACATTGAACGGGCT CAGTTCTCCTGCCA CGACTGAAAGGGCACC CT CC CA CAGAAG G CACATGAC CACTGT TC CAAGACAGTC CAGATCAG GAGAGAGG CCTT TGAC CAGGGCAAAGCAGGTGCTGATGAGAAGG CACAGCAAAGGTGG CC CAAACCACTCTG CCTATT CCCTATCCCTCC CCTGTTAACTCCA CTCA CAATTC CAAAC CCAGAGCCCTAG CTTCTG AGATATCCAGGTAGGC GACAAGATTTTCTT TCTCTACATGATACAGACTGCTGAGAGGCTCC CCTAACAAAG CACCAG TATT CT CC CT GAAGACACAGGCT CAAAT TATG CT CTAT T CAAAGCTGGCTGTGTT C CCAACT CCTAAG CC CTGCTG GC TCCCTCTGCCCCCCACCCTGACGT GG GG GAAGATGCTACAGGTGTGGGGG( N)xGGAGTGGGGAAACCAGGAGAACAG GGAAATAAAATTGGTTTT CACAATAG CTAAAC TT T C CC TATCCTCTTGAGAGTATTTCTAAG CAAAAGAGAAATATAC TT TT CTTTTTG CTTTCTCTAAAA(N ) xTCAAACTGACTTGATTTAGCTCATTATTAAACATGCAAATAAGAAAATTCC TGTT CTTGGAATGTGATCAGTCAT CTGCATAAGGTCTTG CGACATATCTACGATTGAATTATTT CTTTGAAT CAAGGA TTCCCCCGGC CACACACAAATCAGTACTTG CCTTACTTTGGAAAACGAAAAGAGCT CACTAAAG TGGTCCCT CTTATA ATAAG GTGAGAGTGGC CT CTAGGACAAATTTCAG CTTGTACAATTCTGGAATACAGAAAGGGTAG CTG GCTAGCGACC TTTTTCCATACTCCAC CAGG C A C TG (N ) xACTGCCACCCCATAGTCAAATTAAGCCGATCATCCCCAAAGCAATATTA GTCCAGTC CAGTTC CT CT CATGACAAATGAC CTGAACACAGGATT CAGCGAGTCTTACTCGCTGTGGT CATTTTGGAG GTAGGGAGAT CATTAACC CACATACTTTGATAAAATAC GAAACGC CTTTCCTAGCTGGTT CTTTTCCTCTCC TATAGG CCGACC CC CCTC CATC CCTT TATTAGTATTGAA CTGAACAAATTAGTTAATATGGGAACATTTGGATG TATCAACTTT GCT C TGAAACAAAAATGTAT TCAT CC CCAAACT CTATG CTGGGGGCTGAGGGAGAGTC CTTACGA CTGACATAGTC CC CAACAAGGTTCTCATTAGTCCCAGTTTTTTTTTGGACACCTTTTCCTCCGCATTCACATCCTTCTGCCCTCTCCAGCA CTGTGCAGGAAGAACCAACCAAC CTGAC CTTGGTGAAAGGGCTGTGG GTTTTTCAACC CTTCATTC CTGCTACTTT TT TTTTTTAATTTT TTAGTATT CTTCTGTG CATC CTTTCTTCTCTCCTAGGCTTCTCñCC CAGC CT CT CT GTTGAGTACT ACTTGTAGGCAGAGCTCCCTCATTCACAACATCCCAATGTATTTGTCCTGTATCAAAGCAAACCTTATACAGTAGCTC TTTTAAGAACTG CTAAAGGAAT GAAG GAACAAAACTGAGCCAAATAAAGCAAAATGAAGAGGATAT GTATGCAG GCAA TTGTGGGAACATAT CACCAGAAT CCTCTCTTGTTATTG TACCCAAAT CCACACATT CTCCTCAGTGTTGT CTGGGAAA TACATCAATCCACTGTGAATGGTTTAGATAGAATAGAATCCATCAATGTCACCAGCTTGATTCCTCCTCAGATGATGA GCCACTTTCTCCACTCCACCTCCAGGGCTGGCCCCAGTCTTCCTAAATGTTCATTAACTCCTATTAGGTATTCCGAAA CTCTCAGGTTTTCT( N ) xCCACGTGCTAACCTAATAGGTGTTCAGTAAATAAATATAATTTTCTATTTTTCCCTGCAA AATTCATT TCCCAAA CA CAAT CTT TATTCTTTTTTTTAA CATATAATAGC CATACTATGG CTTACCAACTC C CCAGTA AG C CAG GACTCT CTGTATAC GCAGTGTTTTGC CACATCATTTACATGGATGAAATTGTTT CCTATACT CCCAAATTTA CC CTGCTTGGTGAC CAAGAATCACTT CTGT CC CTATTACTñGTGATñT CATGTCTG CATTTAGAAACAGACAAGTGTT ATACAAAATTACATAATATGATAGGAATGCTGTCAATGGATATCTATTATTATATTTACCAGTCAAAATTTTAGTTGT TATAACTGGTATCAC CAATTAAAAAATTTTATAAGAAAGGGGTTTTTGTACCAT TGGATCTATTGAAT CATTAAAAAA AAAAAGAGTTTAGAAAAAGAAGAAGCTATTTATATACTATTT CATTCCATAAGACTTTAT CCAGTT CCACTGTTAATA AAATGTACATGCGCAGAATTGTATTGTTTT CT C (N ) xAAGGACACAGCTCCCTGGTATGGGAAACAGATACTTTATGT CT TC AGTTGT AAGGTCCACTGTTGTT CCTATTTAAT AT CTTTGAATGTGG AATATATCTT A C AATG CTGTG G GCTAGG GCATGGTTGTGACATTGATAAAAAGGGCTCTTA(N)xTATAATAAACAAGTAAGCAAAACATAAGATCATAACACGTC CTGAAAGGACCTCTCAGCAAGAGTGACACATTGCCAAGTAATGCACATTGTTCCACATGGAGTTGGTGAGGATAACAG CACTTCCTTCCAGA( M) xCTTCTCGGCTACATGAAATGGTACAAAATACGAATTAAATGATTATCCATGGTCTTGTGT AATTCTGGCATGTTTCCAGCTAAGACTTCTCAGCTGTTCATTCAACTCAAAGGCATCCTTCAGTCCTCTTCCCC
> H s l5 _ 588185 14 -5883 17 12
GTGTTTTGCAGTCAGACTGATTTCCTCATTAGCAGAGAGTGTTAACCGGTAGCCTAGTTTGGCTCCTCGAGGTGCCTG ATTCAGACAGAGCATCCCACAGCTGGAGCACCCTAGCTGGTGGAGGTAGGGACTCAGCCAAGTAGGCTCCAGAGCTGC CACCTAAGCCCAGTAGATTTCTCTTGCCCAGAACAACCTGGCCAAATGGAAGATGGGAGGCTGAAATCCTCACTCCAA GGAGGGGCCTCCTTTTCTTGGCACAAAGGCACCGCGCCAAGCTACACAGCTAAGGCAGTGTGCTACCTAAAGCAGGTG CACAGAATTCACAGAGAGGTACCAAATAGGCTGGCAAAGACCTTGGTGGATAAGTACCCACAAATATCTGGAAGGAAC TATGTT GG GTTGAGA CATGAGAAG GAAGAGAT CTTAGG CAAGTTTGGAAAG GGAAACAACA C ( N ) xCCATTGCTCCCT ATGGCAACCTGGAAACTGTCGGGTACCTGCCCGTGGTCTGAAGTGGGCAGGCAGCAGTCAGCTGGGCCTCCAGATATG GGTGGTGAAGGGAGGATACAGGTCCAAAAGAATATCTTAGGGGAAATAGTGCACATGCCCAGAAAACTTTTATAGTCT C A (N)xGAGTCACCAAGCGCCTCAGGAAAGCTGAAGGGCACAGGGTGCATGCTGCAGCCATGACCTCAACCTCTGTGA GTCTAGAGGTGACCATATTGGCCACACATAAGCTCACCTTCCCCCTACATTTCATAATCCTTCCCTCCCATCAGGGCC AGCCTTTGATTAAAATTCCCAAGATTATCTGAGGAGCTGGCGGTACCACCCACCAATGGAGTTCATCTGGCTGCCTTT ACACCAGCCACCTTATGGCTGGGTGG( N ) xGGGATCAGGGTGGGCCTTCAGTCTCTAAAAGCATACCCAAGAGCCAGC CCTCTCAGCCGCAGGTCCGCACATGGGGAAGCCCTAGGGAGGTTCAAGGACTAGATCAGGCCCATTTTCTCATCAGGC AG CCCC CAAACCTCTGCC CCAAAGAG CACATC CAACATGCTTTCAGCC CACACT CC CCAC CCACGGTTTACT( N ) xA T AACCTTGGGCTGTTTGGCCAAGTTAATGCTTAATGCTTTCCTCATGCTGCAAGCGTTTATCTTTGCCCAGATTCCAGC TCCAGGGAGGGGAACATTCCTGCAGAACAGCCAGGCGTTCCCAGGTCACCTCTGCTCTCACTCTCACTTTTCTTTTTG CATGATTAAATCCGGCCTAAAAACACGTCTCAGGGGCACATATCTGTCA(N ) xGAGCCCAAGCTCCTAACCGCTTCTG CTGTGCAGTAACAAGCCACGCTCAGGGCATGGGCCCAACTTGAGTTTCTTCCATCCACGGGTCCAGACTGGAAAGGAA GCAGCAATCTTCTCTTCTTTTCTCACTGAGTATAAGGCACCCAATTCATCTTCTGGCAGCTGACCTGGAGGACGGAGT GGCCCTGATTTTCCTTTTACAGGTACAATCACTGCCAATACTAAAGCAGTGCCTGTCCCTACCTGCCAGGGCACCCAG GATCCTTGTCTCCAGCAGTAACTTCATGAGGTAGGTGGCTCTGGGCTGAGACAGGACTTGCATATTCCTAGATCCCTT GACTGTGATGTCGAAGGTCCTCTGCCCCCACCCCTCCTTCTATTCCTTGTAAGCTTCACTTGGCAATTGTGTAGTTTA TGT CTGGTA CAGGAATGAAGGATAAG GTCAAATGAAGCT CAAAG CCAGGAGCTAGATAGT CATGATATATCAAATAAA GTTTTGGAACAAGAACACAATGGGATTAATTTGCCAATCAAAACAGCGCTTGGCAACTTGACTGTAAGCCAAGATCTC TCATGGCAAGAGCAGCTGAAGGGCCTGTGATTGTCCAGCTCCTGGGGAGCCTCCCTGGCCTAACCGTGGTCAGATGCC AGCATCACAGCCCCCTTAGCTGGGTAAGGATTCTCCTAGTGCTACCTCTGGGAGAGAAAGAAAAAAATCTAGTCTGAG AATGACAC CAGCAGACAG CAGCTCTG CCAGGTGTTT GAAGGGTGAAGGTCATGGGACTCTGT TTTTACTACCCTGGTC ACCCCAATTGTTTCGGGGTATTTTCCATAGACAAGAGGATGCCCCAGTCAGCTGTCTTCCGTGATTCAAGACCCAGTC TACTGT CTTCTGTGATT CAAGACCTGTTTC CT CTTGTG CCGATCAGAG GC CTTATTTGGAGG CATCGG CCTTTATGTG GCATGGAATGTAAGCCCTCTGAGCTTTAACAGAGATAGGAAGAGTCCTAGAACATGGCAGGGAAGAGGGTCGGAACCT CTGTCCTCCA( N } xGGTCCTTGGGGAGCAGTGGTTGGAGTGGGAGAAGGGCATCGCCCAGCTATTTTCCAACACATCG TAAAAGATATC(N)xACTGGAAGTGTAGATGGGTGGTGTTACCTATATCCTTGATTGATAAGAGAAGGAAAATGAATT TTTCCT CAAT AAGT ACAGGTTGTTTG AAATGC AGGCTGG GTG CAGCGACG CTGC ATGGTCTC CAGG CG CCTTGTGTTT CTAACCCTCTTGCTGGACAACTTGGCCTCCACAGGGTAGTCCCCAGGGAACCCTTGGCCATCTCCGGCCTGAAGGAAT CAGGGAAGACCTTCCGGACATGCCAAGTCTCCCTGAGACCCTGCTCCATGAGGGCTGGCTCCCTCCCCGACCTGGTCT TTTGGTTCAAAGTCAGTCCTGGAGTTATGTCTGCCAATTGACCCCAACCTCCAGAAAACTCAGAAACAGCCTGTCCTC TCñGGC CTGCAAAC CAGGAG CAGC CAGGCC CAG CCC TGACTTATG CAAGTAACTAC CACAGAAGG CCTGCTGTGCAGC CTATGC CG CTCAAC CTGGAT CAAG GG CAATGTGGAGAAACCACATTATTG CTTGAC TATAGTGCAGTATTCCAACAAT C A T T T T ( N ) xTCTGTTCAGTTCTCTCACCCTACCTTGAACCTACACAGGGTGGAATGTGCCTATGACCTCGGAAGGAC TT TAGGGAAGCAAG CCCCACTTTCCC CAGAGAGGATTC CAGTACAAACGGAAAG CT CTTC TGAATG GCATACTCTTTC ATAACCAGATTT CTTAAAAGAAAAAAATCATT CTGATT CTAG CC TGAAGTGCAGTAGAAT TCAAGAGAATTTAAGAGA AAAGTTTAGAAGGAAAAAATTTTT GG CAAGAAAAAAGACTCTACTTACTT TTTT CC CTGGGATCCACAAAAGAAGAAG TTGTGGCCTCAGGCTACAGTAGTATGGAGCCAAAATGGACTCAAAAAAAAAAAAAAAAAACCCCAGAATAAATCAGAC AT GAAAATAACTTT CCTT CCATC C CGTATCTCATTTTCTTATTTGACAAAAAAATAGTAAAGAGCAA CTACAA CAACA AAGAAGAGGTGAAACAAGAATGCAAAAAATGATGGGAAGATGGCAAC CTC CAAAATGTAATAAATTAATTTCAG CAGC TTTAAAC CTAAAGC TATTTG'l'AAAGAATAG CTAGAAAAGAAACC CACTCAAGA CAAA CTTAG GAAT TT CTTCTGAGTT C A ( N } xTAATTGAGGTTCGACATTATAGTGAAGTGATGCTAGGGTGTCACAAGAGGTGAAGGACAATACAAAGTTTAG TAA CTATTGTG CACAT T CAGAAATTGAGACA CAGAATTAC TT T CAAGAAAAAAGTAAATGGGAGGGT CAGGTGTCCAG GTCAGGGCAAAATTCAAGATGGTGGGAGTGGGAGAAACATTCCTCACCATGGCCTTGGGCAGAATCTGCTTATCTCCA GACAAAGCTCAG CAA CAG CTAGGAGCTGAAT CTGTAAG GT CCAG CAGTGACTAAATTC C CACACAAAT GTTTCCATCT TTC CAT CGTG CAAGAAATTCTGAAGCCAA CATTG CC CAGCTACAGATCATTGGCCTGACCA CAGAAGC CAG GGACAGA GCTGCGGTTACATT Tñ TTACATAGGATTGT TC CATC CTGAGCAGAGTACATGGCCTGGAGAGTTTT CAGGGCAATTAT GAAATTTCGGTCTTCCAGAGGACCCTTGCCAGGTCTAGGAATATTTTCAGCAGTGGATGTTTACAAAGCAAAAGCCCC CCTACACACACATTAG CAAATTAAATAATG CATGTTGTATGAAAAACCTTGTGAGG GG CATAGCAATACC CGAGGCTT CTTTCTTATGCACTTGTGTGCCCAGTACCAGCCAATACATGGAAAGGGCTCAACAAACACTGAACTGAGCAACAGCTT CTAGCCCAGGCATCATGGGCGGGAGGCAGATCCTGACTCCAATCGGGCCTCCTATGGTAGAATCCCTGGTTCTCTCCC ATGGAAGG TGCCCCACCT CAATTGCCTGCAGT CACC CATCTG CTGATGTCCTTACC CAAAAGAAAATGAAATAATTTC TTCTTG TATGAAATAT CAGGAACCCTCAAACCTAGGAG TTAACTAG GAATAGAAAAAT CACTTG CTGTAAGAAATAAT ATTAGAAATGTAATGGCTGGACCCAGCTTCTCCTTCTAACTAATTTTAAGGATTGTTTTGCTGCTCTAGGAACTTACT GTCATACAGAGGACAACCCCAAGCCTCCCTAGTCCATCAACCCCTTTTCCTGGATACAGGGGCTGAATCCCTTGATGG ATTTGATCACTTACCTCAATTTAATATTCCTCCTCCTAAAACCACAGTATCATCCCCAGATTCTTATCACTGGTGTGA CTA CCT CT TCTGAAGACATGAGAAAAGCTC CAGTTT CAGAAGGTATGCTTGGAGAATGTG CTGCTTCTAG CTTATCCA ATATGGAAC CAATTTC TT CACTCATAGCTAAAGCATAT CCTTTCCT CACAGCTTA(N ) xAAAATGCTGAAGATCAATT TGAATATATATGTATATATAAAGAGATATAAATAGGATCACACGTAAGGAAC( N ) xATATGAAACAGTTAAAAGTCTA TTG CAACACTATAATT TGAAGGGGAAAAAAAAGG CTAAGGA CATACAGTAGGCCACTGGG CAGACAAAGGAGATCATT CTGAG C CTGAGT CCTCAGTATTTGGTGACC C A G TA (N )xC T TC C CTTAGGATGCC CAAGTGT CCTAAGTGTTCTAGGA ATAAGC GACAGACT CTGATCTCATGGAGGCTCAGAC CCCTGGGCTT CAGGGCTCTATT CCTTCCACCC CTAAATGTGA CCTGAAGTGG CC CCTG CCTC CCTCAGTCCTAGAGGCTG CTGCTCAATTACAGTGAAGAAT CT CT TATAGACTCACGGT GACTTAAACGCTATTTGGTGCTGTGGCTGCTATTTCACTGAAGATTACAGGGCTCTCAGAGTTAACCACATAAAACTT GCTGCTATATATTTGTAACCTCCTTTTTCAATAACACATAGGGAAACCATCTTGGGTTTCAGGACCAAAAAAGTGTGA GTGCCTGGCCACCAG(N)xATAAGATCACATAGTGTACATTTTTGGGTGTTAATATTGACATTCCTATCTTTATTGTT TTA CTG CC CTTATTTT CT CTTTGTCCCAAGTT CT TCAACTAT GAATTTTTGCCAAC TACT TCAAGT TAA AA(N) xTAG ACTTGACTGATGCTTTATCTTCTCTCGACCTCTTCCCTAAAGCCCCTACGCAGGGCATTCCTCCGCTGCAACACAGCA ACCACTTTTAATAGTTGCTTTTCAA(N)xAATAAGTGTTTGTTGAGGAAGTGAATAAATGAATGAGCACATACCTGTA TACAGTATACAGCTTCCTCGCTCATACACATGTTAAAGAGAGCTAAAGCCTCCGGGAAGGGAGAAAAGAGAGGGCCCT GTTCAGTCCAGTTGTTTGTGGGGCTTTCCATCTGACAACTGTTTTCCATGGTCTCTCCTGAACAGCCTCGAAGCCTCA GCCACCTGGG CTGGAG CGTATCTCACAGCTAAAT CCAGAGTAAATATACACAAGACAGAATTGG TC CTATGAG GACAA CTGGGG TGAATT TC CA CAGAAAGAAAGGATTT CACTGATCAGAAATGTAAGATGCTAGATGCTC CAATTGGTACTTAT TCTAA CAACG CT CT CTTCTCTAGGTT( N } xTTTTCATATGACCCTCTTAACATTTGAGGCAGCTATCCTAAAATGAAG AAAGGTGTAGTGTTTCAGTC TCTGAATAAAAGAAAATGAG CTACATTAGGTCCAATGTATAAATGG CTGAAGATTGAG TGC CAG CCTTTGAT CCAAGATCTGGATCTGGCAGGACACCTCAC CG CTCACTAACATTTATTGAGCAC CCACCATTGT GCCCAGGG( N ) xACGAAAAGAGAGGAGGAGGTGGGGGATGAGGGATCATCACTCAGGTGGTGGCCTTGGCTGCAATAC CATGAAGCTGAAAGTCACTAACAGGTATAGTCATTTCTGTTTTAGACTCTTTGAGCAGCTGTG (2S!) xCACAGTGTGCA CGGGAACTAAGG CACAG GATTCTAGGGATATTTAG GAAAC CTGAAG CTTTGGACTGAT CGAGAAGC TGGCATAGCAGT GCAAGGTGACACCCCGGCTTCTGACTTAGACAGGAGGGCAGCCAGTGAAATCAGCAGGCTAGGTTGGAAGCAGGCTTC CAGGAGTGAGGGCAAGCTGAGACTCTCTATAAGCTCAGAAAAGCCAAAGAGAGCGGCAGAAATTTTCTTCATTATCTC AGAAAAGT TATT CAAATT CTTGCTCACACCTC CTTTTAATAAAATATGATGCATTT CATT CTGT CTT CATAGTGAAAA ATCATTGCCCTTTGCTGAGAATTAATAACCTTTTGAGTTAATACCCTTTGGGTAGGAAAATTGGTTTCATTAACACCT CCCCTTGCACTTTGTAGGTTCATCATCTCTTGGTCCTGCTGTCCCCTGAAACCAAAAGCCCAAGACATATTATTTGAG AAGCCTGAATGATTTTATTC TCAGAAAGAATGAT CAGGGTCCTTAT TAAATGGGCTGAATTGAGTT CACTGACCAAAA AGGTCAGAGCGGCCTTGTGTCTGCTCCTCAGCCGCAGGGGAGCCCACCTTCTACCAGAAGGAAGGGATGAGAGCTTCC TCTCCCAGCTCTTGGCCTGATCCCTAATCTTGGCAGAAAAGATGCGCCCATCTTGACATGCCACGCCATGGAAACGCG CCACCTTGCCAGGCTCTACCAGGCCTCTTGGCAGAGGTGTCCGTCTCTGGTGCAGCACCACAGACAGGACAGCTGTCG GGCTGCCTGGCAGAGTTGGATGGCCACTCACCCACCCCTCCAAAGTGGCTCATGGTCATAGTGAGGCTCCCCCAGAAG AGAGGCTGGAAGGCCCTTCGGTCAAACAACAGAAGGTGATATGTTAAAAACAATCTTCCTCTCCTTTTCCACCTCCCA CGGCCTTG GAAAGC CT CCGAATG GACATTC CT C TA C T(N ) xATGTATGCAAGGTTGCATGCCTGTGAGCCTCGACTCA TGTTGCTC CCTCAGCCACCC CTGACCCCCG CAA CAAAACACACACACACACTCAAT CATC CTGCAAGG CCAGGGTCAA CATCACATTC CCTGAAAAGC CCTCCCTGAATG CCACACCCTC CCTCAAGCGGCATC CACTACTCCCTCTGTGATTCCC CCAACC CTGC CC CACACT CACCCCTGTAC CTTGTCCCT GCAAGG CTTACCTGATATAATCTGTCTG CACGTGAGGACA AGTTGAGC CT CAGGAG CTAATACATAGTGT CACC CAGCTAAGATGCTCTATTCACTGT CT CT CTAGGTAGGGATGTAG GGA(N ) xTTCCTTATGACCTAGAGGGTGACTCTAGTGCTTGATTGGCTGGTCACTGGGATAACAGGCTAGAGGATCTG GTGATCTCAACCTC CTAGGC CACCCTCCAT CACTTG GC CATT CCACTGGAAACAGT CTTTGGGGCTCC CCACAGTTGT GGCTCATTCTGATCTCCAGCATCTCCCAGTAACCTCTTTGGCTTGTGCTTGTAGAAGCAGCCTTTGAGAAGACGGAGG GCTTCAGATGAAGCAGATGCCAGGCTAAGCACCGTCCCCAATCTTATATTGCAGAGCCATTTGGAAGAAGAGCTCAAG CTGTTGAAA CAAA CAAAACG CTGCATGAGATGAAGACCAGATTC CTGCTCTTTGGAGAAACCAAT CAG GG CTGTCAGA TTCGAATCAATCATCCGGACACGTTACAGGAGTGCGGCTTCAACTCCTCCCTGCCTCTGGTGATGATAATCCACGGGT GGT CGGTAGGAAATG CTGA CATG CCGTTTTTCT CTCCGATTT CACATTTTCTTTTT TTCTTTCTAG CGTGTTCATGTT CATAAAAAGATAGGGAGCTGG(N) xCCACACAGCCCATTGGGGCCCACAAGGGATTGCAGTCCGTGCAGGAAGAAGCC TCCTTCCT GAAT CTGCATGTTACACACAG GGGGACATAGGAAGC CAGAAGAATG GACT CC CAAT GAAATGT(N) xGGC TT TT GGTATAAAAGAGTGTTTTTTAGGT GT CC CAGAGC CT CTGT GAAATT CT CC CAGT TACATC CATT GACCATATCT AAGT GGAAAGGT GG CTGAGAAGTATAGG CTGT TAAAGAAAATGTAGCCTT TTAGTGATGCTAAAAATT CT ATTACTAT G GAAGAAGGAGAGAAAG GATATTATTGG CAATAGATGAAGTCAAACCATGTGAGAACTTGTCTC CAAGAGGAAATCCT GCCT
> H s l 5 _ 58890793 -5892 62 29
CT TATGAGAAGT CTGAGAGCGG CAGGAGAGGGAACTGCTGTCATTTAGTT CTGG CTCT CC CCTG CTGCTG CCCT GAGA CAATGTCTGTCCTCACACCTGTAGAGAATAGCTACGAATACCTTGGAGAAGGGGAGGGATAAACAGAGGGAACTAAAC AATACGAAGGGTTTATTT CCTGTC CTTTTCTC CT TCCAATGC CCACAG CT CATCTAAA GAGGGTTCTT CCTCATG CTA TGTCTCCCTGAGAGGTAAAAGGTGGAGACCAGAACTAAAGAGTTGTTTTCAGGAATGGAGAATACTTGACTGTGCTTA TTTGATGAATGAAT CCTCTACT CTAAGACTTT CC CACAGA CTGGGATCTGTATGTCCC CACAAG CATCAC CCTGGGGG AAAATAAGCTTATGTGCCAGAAAATATTTCAAGAGCACAAACACTAAAAACGATAAGAGA( N ) xAATGAAGGACAACA ATATGAGAAGCACAACTATTAGAATAGAAAGATCAACAATTTTAAAGTAGTTCACCATAATCATGGCAAGGGAATATG TTAG(N)XCAAGAAAACAAATACCAAGCAAAATTTAAAATTTCTAGTTCTAAATATTTTCAGACACTCAGATAAATTA GCATTAAC TTAG CAGTCACTGG C CAGAT TGGATTGACAAAAAGCACACAAATAT CAAC TAAATCATTCAT CC AATAAT CC CATTCT TACG GAGAAAGTACTG TAACATTAGTTACTACAGAC TCAT TTATAATTCAATTC TACCTG CTAAACAATT CTGAGTTACATAAATATGTATCTTGCTCAAAT ATTCTT AC CTGG AAGTGGTTTAGGAG GAGG CAACTTTG GATTACTA CT TG GAGTATGAACACTG CATAT CTTAATAAATC CAGC CATTAG CATGAT CAGAGCAATT CC CATAAGTAATACTGCC CACCAATGAGCCTAGAAATAAACAGATTTTTCCACTGAAAAAAAAAAAACTACAAAATATTTTAGAGTATCTTTTACA GTATATAAAACATAGAATAGAACACAGTATATTGTAATTTTTAAAGAACATTTGTTTAGGAGTTGTATACATATGTTA GATATTTGCACATATATATTATTT ( N ) xGAAACCGCTCAATCTGATTTTTTTTAGTCTCTTTACATATAACAACATTT GTCATACACATGAACACAAATACAAAACTGGGGTCATCTTGCATATAGTTATATTGCATTTTCCCATGTTATTAAATA TTTT CTGGTTTAAATATTGC CTTTGTTTTTTCCCCTTACTCTTTTTTCTC CCTCAAAT GGATAATGTG C CAATG CCCT AACAAGGTTTG(N ) xTTCCTCTTTTATTTCATTTGGAATTTGAGACTCGGACTAGTTATGAAATTACTTTTTATATAC AT TAGTTTT CAT TACATTATATATAATATCTT TATTGTATTCAT TAGT TTATAGGCTATC TTTT TAAATATGAAAAGT GGAAGAAGATACATATTT TAGCAG CAGG CAAAGT TCATCCCGTC TTAGAAGAAAC CAAAACTTC TATTTTAAC CTGAC CTATATTAAACAAATCAATTAGCCAAACAGAAAAACCTTCAAAAATTATTGATAAATTTATAGGAAAATTACTGTGTC TAGT CTTAAATTATATTTTG CAATTCCAAATATT CAAACTTTGT TTTAAGAC CACATGTACTAATAAACATCATA CAA AATATTTACAGCTTCAAAACTGAACCTTTACATTTATTTAACACAGGTTACTGTAACATATATAATTACTACTAATAG TTTCCAGCAAAACTCCCATTTAAGAAGACTGTCTATGACTTAGGGACTTACAACAAACTGGGTACTACTTCTACACTG TTTAGTTTTTCCAGCAGTGTTTATATAATTAAAGTAATACTAATGCTAATTTTAGCATTAATGAGAGTAATGCTAAAA TTAAAAGG CATGTTGTAG CAAAATATCC CAAG CCACTTTTTCTTTGTGTACACAAAGC CTTT CCTACTTT CTAAACAT TTG (N )xAAG TACCTTTTA AAAACCTGA( N ) xGAAAAACTTTTAAAAATCTG( N) xATGTATTCATTTATTTATTCCC TCATGGTTTCC(N ) xCCTACCCTTCCTATGTAGACACCTTCCTCATCCTACTTGCACTCTTGACATCCCATGCCAGTC ACTCCTCCACAGGGGTGCAATTCTCACCCTGCTTGGGCTCTGAAACCCCATACTGGTAGTGTTCCCCATCCCACACCT TCTGAACCTGGTCAGCTTCCAACACTCTGCTCTGTGCCACTATAGCTCCCCAACTTTGGCGTGGATGCCTACCTTGCT TGGG CTCACCTAATGGCT TTAG GAATTAAC TG CT TCAAAAAAGGAGAGG GAATAAAAAAGGGAAAGAG( N } xTCAATT TTAAAAT CAAACTTAGAATGATTTACA CTTGAGTGTCATTTCT CAGTCTTATTTAATT CT CAGATTAGAT CTCTATAC AG AAGAGT CCCACAGACAC CTTT C TTCT CTTT ATTTTC ATGTTTATTGTT CAAT CTGT AAAAACTGTAT C ATGGTTGT CCCTCACATTGCTTGTTATTGCAGGGACGGCAAACTCAAAAGCTTACATGGAAGGTAAGGTGGAATGCAATTCAACCT TACT GCTTAAAGA(N)xGAATTTAAAACTTTACAATCTACTTTCTAAAAGCCACTTC(N)xAGGTATTCCATAGACTC CTATTGTTTTTTGTTTTTATTCAT CCTCAAATAACAGACCTG CTA CTTAT CG CAAGACAGGGTAGAAGGC CACC CTCA GTCTGTCAAAGCAGTACACGACCATTTTGAGTAATGCAGAAAGTTTAGCATCCCTGGCCCCCAACCATCAA(N) xATC CATGAGTACAAAGG CTTAAGAGTTTTCCTACTAG CTTT CT CACGATGATG GG CAAGTT GG GTACAGGAGACACACTTC TCTT CCTAAAGT CATTAT CATCTT CAATATAT TTCCTTCC TT CATCTT TACTTT CATTTT TATGAAAATATGGC CATC AGTGTTATGGCTACCTAAAATATGGAACTGACCATGTTACTCCTTCATGTCAAACCCTTCTTCGAATGCTTCCCACCA AAGATTT C CAAGATAGA C CACTAT GTAGGAGAGAGGAAAAAAAG CAAAAG CAAAAACTTGAG CACACGATAAAGTACA GC AAAGTG TATATG ATT C C ACCTTTTTAAAAAGT AAAT AT CCTTACAGTGGTATAGACAGTTTC GAAGAATAC ACAAG AAAT CC AT CCTAGAAAGATTGGG GGTCC AGGGTT ( N ) xG G CCTACTTT AGAATGTATTACTTTG TCAACTAAAAAAAG AATAATCAAACTTTTCAAAAGAAAAAAAGTAGAAAAAAATGTTTTGCAAAATTCATACTTCTGTGACTTTTTCACATG CTCTGTCC CCATAAAATATTTCTG TACT CT CTACTTCTACTAAT TCTTAAGAACTTAG CT T (N) xAAATAAAAATAAA CAAAGAAC TTAG CTCTGGCGTTAT CTCCTTAAAG GCTTTT CCTAATAGGTTCTC CCAGTT CTGT GAAAACAGCA CACA GACTGAAGATATGTTTCTCTGGCCTGTCTTCTCTATCCCAAACTGACCTGTGAGCTCCCTGGACTTATATCCTATTAC CT CAGT AC CTAC CC AACAAACAAAAT AC CTGTTG AACT AGTT ATTACAAGTAAGTCATTATTTAAATATTGACCTTGA AACATAACTAACAG CTCATT TATTACTT CCAGAT TGCCTAACACATACAAATAATGTGACAAG G CCGT TACAACAGAG ATAGGACAGTAAAACTGGTTTGGATATATTTAATGATAGTCTTTTTAGTACCTACACACTC ÍN ) xTATAGATTTAAAT AAAGAGC CATACGAGTA CACGCTATGGCAGTAAGAGAG GCAGAT TGGAGACAGAAAGC CCAGATTTAGTAGTG GTTAA AACT GTAGAAAACAAAGGTAAAAAGAAC CTAGTCGCAGTTTAGAT CAACAGATGTATAATAAAATATG GCACTCAAGC CCTT CAGAA CTT CC AGTT C CTT CT AGCACAGTGTT CCTGG CTTT CTTCTAAATTTTTT CAGGTAACTATGTATCTC AC AAGC C C AATGCTTG AG ATGTTTTAGTGC CAATGATAGT AG CTTTTGTG AATTT AGCGTGTGG G G AAATGG AACATGGC AATAGACAAAAGA CAGTAAGAAAAA CTAAACAAAATAAATAGTG CATCTGAAAATATTAGTAAG TACGGC CACCTTTA TAGTTT TC CTTT TTAAAAAAAG AATG AGTT AAATGCTAAATTñ CATGCTATC CC CTTTAAATATAC TATAAAATCAG G GAAAGGAATACTAACTAGGGTAGTAAAT GTAAAATATGCATT T C TCTCAAAAGAAAAGAT CACAAATT GCCCACCTGT TAGTCT GAAAGAAAGGGGAGCTGT CATT CATT CAAT GAACATTTACTGAGCGTAAACTGAGTGC CAGG GT CTTATATC CCAGAGACAAGGGTGTTAAGCAAAñ CTTTAG GGACAAAGAGAfiGACAT CCAAC CATTTTT CAAAAACT CAAGAAGTTT CTCAACT CAGAAAATAATTCTACATAAGAAAAAGAGAGATGATGTT CAATACAG GCACTTAAA CATAG C CAATAATAA AGTAAAAAGT CAAAATTAAATAGT CTAATT TATTATGTATTAAT TACAATAGTATTTAAT TTGATAAT CTAAATATAA A CAAAAAGAAC CTATATGGTAACAAAATGACAAAGCAAGTAT CATCTAAAAAAAAGTGCATGAG CGAG GACAGGGAAG A CATGCACTCAAGT CT C T T (N ) xGGACTTCCCATGGGTCACAAGACCATGTGGGGCCCAGAGCTCCTTGCCATGATAC TGATTATGT CAT CC CTATAGGGGCAGAAAGTT TC CTTTCAATTT TT CAGAAGACATAATACTATAGTCTGG G CTTTGA ATATATACTCGT CCTATC CTTCAAACTC CCAC CACCACTAGCAG CAGCAG CAACAGCGATCTGC CAAG CAGAAAGCAA CTTGCTACTAAACTTTTGGTTACTTTAAGCTCAACCTGGGCAAGTAAGATAAAATACCACTAACAGGAACCTGAAAAC ATTCCAGTTT CAAAGT TAATATGCACAAGTTT TACTCTTTTC CATATAAATTAT TACATT TGAGAAAT CTGCTTCATT CATCTAGCAAATGTATACTGTGTC CTCACTATAT GCTGAAAACCTTACAATAT CATTATATCTT TGTTGTTAAAGTTT CATTAAGTGCTATG CTATGTGGTC TGTT CCACAAAG CAATGAAAC CAC CTTCTATTTTCAA CATAT CAAGATGATAAA ATGGAAAATTTCTGTTACAAGTCTTTTTAAAAAGGAGTCACATATCACAGAGACTGAAAAAAATGGCAGACAAACTAT TAGCACGCACCATCACOTA(N)xCTATTAATCAATATTTTTCCAAATGACATCTAAGGATATTTAGGACTTAGTCTTT T C A T C A T (N ) xCTGTTAAGCATTTGGGAAACAGTGCTCTAAATCTTTCATTCATAAAGAATACCAGATACTTACTACT TAGATGCATAAAATTATAGCAAAATATGACAGCAGAAACTAATTTTAAAATGAGAATGTGTTTGACTCTTGATAAATT AGCAATACAAAAATACAAACTCCTGATAAATCTGAACAAGTTAAAAAGAAAAAAGCCACAAAATCATCAGGCAATGAT TTTTAAAAGGAG GG GAAACACAACAATACATT TGAACAATAAAAGT CAAGGTAAGAGGTAGTTT TC TACAAT GTCAAA CCTAA CAT GAAAGATACAGAAAAT CAGAACACAT CAACTATAGAAGAAACTGGAAATGAATTAC CAATG C C C TCAATC CCTAATTAAAGAAAAGGAGATGCTGAAATAAGGT CT CAACAGfiTTTACTGAT GAATGCTCTTAAATGTTT CAAGAACT AGATGACT CTTTGATAAATCCAAAGGATGTTTAT CATTTTAT CCTGTC CAAACACACAGTAGG CACTCAG CAAATACA GTAAAT GTATTTGGAAAAATCAAAACATAGTTGT CAACACAGAAAAGAAAGTAT CACTTC CGTGGTAC CAGTAGCAG C AGTCAATAAC C CATGC CCTGAAGTTCTAAATTGTTAAGGTTCTT CCTACTCTACC CTGCATGAGGAGAAAAG CTGCCA TAGTGTAACACAGAAAGAAGACAAAACG CCAC CAGG CACCAC TG GAAAGGAAAGTAGTGAAACTAACC CCTCACTTCT TTCACAGC TT CT TACACAAGAGCAT C CTGATAAT CTGTATTACTAACT GGAAGGGCACTT CC CTAACATGATAAATCC CGGTATGCAAGTAAACTATGCACATAAG CAAATGAGACAAGTAACTTGTAAAGA CCAGATGTTTATTATCTTACTGCT TGATGTG GTTAATG CC CACAAATG CA CCAACAG CCTAGCGGTACTC TT CACACATTATGAG CAT GCTATT CGAAAGAC TGGCATATGCCTGT CAA CAATGATATAAAATAAAGG CATGTT CTAT CC CA CACACACTTAAAGT CAACGT CTATGAAC TTGAAAGTGT CTGATTAATTAGTT CCAGAG CACAAGAAAGGACTTCATGTTG CCAATTCATTGTGAAACATTTCATTA ATCCTCACAT CTAAAATATGG GGAGAAAAGGT CT TAAAAATACTTAAAAGAT CATGTCTATTAAATATTTTTAAAGTG AAAATAAATGGGCTTTTGAAATTCTAAATAATGTAATGAGTGAAAATTAGAGATAACAGAGAAAAGTAAAAAAATGCT ATGTTTAGATATGACTTAGTACCCCAAAAATCTAAAATATCTAGCACAATATATACTCAGTAGAATGTAATTCTGAGA TCTCCTTTCGTTTTCTGTATACACTGTCTTCTACAGCAGACAAGTATACCAAATTTCAACTTTTCTTATTAGAGACAA T CAGAAAAAAAAATAG CTG AG TTG ñ(N ) xAGAAGACAACTTCAGTTTGACAGGTATTTACTAGAACTTACCACGCAAC AAGCACAG GAATTTAAAAAAAGAGTAACATGTTT TAAGGAAACCTAAGTTAAG T CAACTG CT CAAGTATTTG GTTTGA AAC CTAGG TAGATC TC CTAAATC CAATT TCAACAAT CCTTCTACTC CATGAT CATTACATAT CTAAAAAC CACATATC CTAATGTGTC CTAG GAA CGGTAAACAT CATTTTAAATGCTG G GGTTAACTATAAAATTTT TACT CTAGAATGTTATTA GAAGAGTACAAAAAAGGC TGA CAT CC CACACAAACAA CATCTTTTGTGCT TTAAGGCATC CATAGT T C T G A (N ) xT A A TTTGAGAAAG GGGAATGTGAGTTATAAGTCAATTT C CAAAGCAAACTACATT CAGTTCAATTATAG CACATT CTCAAA
t a t a t c a t t t g c a a g c t t c c t a t a g a t a t t t t a a t g t t a t c a c t t t t a a a g c a t g a a c a c t t a a g t t g a g a t a g t c a a TCTTGACAATCT TTTAAGAAAACAT CAG GTAATT CTGTGTTTGTA CAACTTG GAGATATGTGGG CTTAAAAATAAGCG TTCAGTGGATTACTGCTGGATGAACATCCATAATCATTAGTAGATCTACAAAAGGCTGTGTATCTGATCTTATTCTGT TCAACATTTCCTATTñCAAACAAGATAAAGA(N)xGAAAGATCAGGAAAAATCACCAATTAGTCCACGATTTGGGCAA ATAAGTTGT C TTGAAAAACAACAAAAAAAAAT GT TTTTTAAG TC CCAT GAAAATGCCCTAATAATATG CACCAAAAGA AGTAAC CCAT CCAGGAGATGAACTAAGGAG CACGGTAAAACAGAATGACT CC CAT CACCTA CTAAG CAAT CTATGAAT GATACACATGATTCTGATCACTTCAGAAGGAAATAATAACCAGAAATATTAGAACTTTAAAAATCTTCATATAAAGTA ATTCTAATATAAATAGTTAAGTTGTTTT CT CGTAACTTTGATGACTTCTGGC CAAGTAAAATTAAGAT CAAACCACTT TAAGTTTTCTTTGTAAATAAAAAACATACTTACCACAATCCATTCAGCAATGTTTTCATAGAGCTCTGGACTAAAAAT TGCTTTTTTAAG CCTAG CTAGAG GAC CATCAG CATCTACTAATCTG CACCG CATGAAAACAT CACAGTAACC TCTAAA ATCGTTGCAAGGGGATCCAGGTTGCAGGGTGATGGTTCGACCACTGAAGTGCCTACTCCACTGCACAGACCCTGTACT GGCACAAG TTGATGGGTC CACTGGAAAAGAAA TG CCAAATATAAGC TGAAGGT CAGATT CAAATATAAGT TGAAGATC AGATTT CCAGTTGT GCAAATA CAAAGAG GGAAAAAAAACTGAAAT CATTACCATAACATGTTATACAGAC CTTCTCTA GTTAGGCTGGGTAACGCCATCAGTAATTCATATTAAATGTTACAATGTAAACAAGCAACCAGCAAGTCTTGACTCAGT GTCCACGATGACTTAAAGTCTAACAGTGTTTATGTAAATTACTGCTGTAAAGCTGGCAACGTTTTGTCTAGTTTATCA TTT TTAAAA CATTGTAGAGACCATAATAATAGAGACACAATG CTACGT TACATTTATAAT CT CCTCAAGAGGACAGAT TTAGCTGGAAATTATCTGGCCAAACTGTAGGTCAAAGTTTTACACTTAAAATTTAGGAGAAAATAATAGTTTGAGGCA AATTTT ATACTC TTGT GT TTTT AAAAGC CTTACT 'i’TTCTTCATACAGC AT AC AT GGCATAATTCTT TATC AT CTTTGC CAT CAGAACTGG CACACGTACACT CCTCTAAG CCATATTTCT CACAGATAGAAC CTGCACATTG CTAGAAGAAAAATA AAAGA CAAATTGTTTAAT C CCTATA CAGTCAAAC CTATTTTT TATAGTAACAGACAGGCAAGTATGAG CTTC CAATGT CAAAATATTTAAATGGAAATGAAAGTGAAAAACAAACAGAGATGAGTATCAAAACTGAATTACATAAATACATTTGAA AAAATTTGAAAAACAG TAAATCCAAGAAAGATAGTAAATCTTTTAAAG CAAT GG CAACTTAAAAAAAT GAAATAAAAG CA3AGCAA3AGATTAGATGAGGATCGATTTTATGTTGGTTATTACGGCTCTTCTATAAAGAAAGAAAAATTAAATTTT TTTCCTGTAGAGCAAAGTGATATTCAGCCACAAAAGCTTTGAAGGTTTGATATTCAGAGAAGATGACCAAAGAGTTCT TTTñTTTAGG CTACTG( N) xGACAAATCACATGCCGAGTTGTACAACTATGTTTCTCTAGAGAATATCAACCACATGT ACATGGAAAGATGAAAGACAAATTAGTAGCTAATTGAATGCCACATAATATATTCATGGTTTTGTGAGGGAATATTAC TTAAACAGAGACCAAGATTAATTACTTTAACTATGTAGCTTTAAGCATCAATAATTCTTCTGAAAAACAAGTTTTCTA AAATTAAATTTT AAAACATATAGTTAAATG CTTACC CCATTAATGCACACTTGTGTATGC CT ATTACAGT CTGTGAAG TTTGGTTTAGGGTCAGATGCTGGG CAGAGAGCTGTGAAGCCATTACATATTC CT TCCCTTGCACAGTCTGAATCAT CC CGACACTTCTCAGACTTTGACTTGAATGCACACTGTGCTGTACAACAAGGACCTTGACTTGGACTAGAGGAAACATTA AGTGATTAAACAAAGTAAATCACTAGTTATTCCCTATATACTATTAGATATTAGATTAGATCCACATTATTAGACATA GTAC CAATTTTTAAAT CCTAAGT CTCATCA CTTAAATGATAAGTATGTTACTGGTATACAACTT CAAAAAACTGAAA( N)xCCAATCAATCAATCACTTAAAAGCTACAGAAGGAGAAGAGATTTAAAAAGGCACAGAAAACCTTTGTTTTGGCTG (N)xAAAAAAGAAAACCTATTTflTTAAAATAATAGCTGAAAATGG(N)xAGACTTTATTTGTCCCTCATTT(N)xGCC TATTTTAATCTGTT CTCCCTCTGTAG CTTTTAAATTACTTATGTATTC( N) xTACTTCCTGATCACCAGGTAAAATAA GTAAGAAAAA( N} xGGAATAGGAAGCTACTAAGCCAAACAGTCAAGAAAAGTGTGCGACTTTTTTTTTTAATAGTCAA AGCATACTACATTC CACATT CTTG CC CTGTTCACCTTTAGGAAAAGGC TACAATGGAAAATGGATTTAATATTAGG CA AT TTATCAGTATTTTTT CAG TTACAT CACACATTAG TTAATTTATCGCTTT C TCTCCCATTTTCATACTAAAAT CAAA CTAGGATATACAGACCAGCCGTTTAAAACATATTGAACT(N) xGTATTGTGCAAACAATTGCTTACACTGCTACCAAA GTAAATTTGACAAGTGAAATAGTTATAACTTGCCAAGTTAGAAAAGTAAATGGATTCCACATTAACTAAATATTTAAA GTTAAGATTTTTAAAATATGGAAGATAGTTACATTTTCACAATAAGTAACTAAAGAATATGTA(N)xAAGAATATGTA TT TTTTAG GATTCTTT TTAGTATG TC CTTTGAGTAT GT CCTTTTGTTAACATTCACCACACAAATTA CTATTAAAAGT AG{N}xGTAGTTCAATCTGCCAATAACAGAGCAGTAACCAACACTACTTCAGAAAAAGAAAACAAACAACAGATAAAA GCAAAGTTTCAAAACCATAT GCAT CT CTTAAT CTCAGTTTTAAATTTG TCATAACTGCATTGAGGTAACTTTACAAGA ATGTTAACATCACTGAAATTAGCAAGGTACCTGCACTGTTTCCCAGGTTTCAGTTTGCATTTTCTTCCCTCTGGTTGA TTTG CATCGAAGCAGCATTCATCTTTACA CTGGTCACTATAG CCA CAATCACATTCTT CAC CTTGTTCTACCATTC CA TTTCCACAAATAGGTTGGCCAGATTCTGAGGAAAATAAACAAGACAAAGTAAGCATTGTGTGCACACATTAATTTTAT TTAAAATT CACAAATT CTTTAGAT CATCATAACAAAGGGATAAAGGAAAACTTTGA( N) xTCTAGAAGCTGTAACTTG CAACTTAG GC CATG CT AC ATTGAG CTTTATTG ACTGTT CTATTCAT AT CACACAAATACAAAC C AAGTTAGTGCTT CA CTAACCAGGAAAAAAAAAAATTAAGC CTTTTTTTGCAACAG(M) xCAGACCTTAATGGTATTGAGCCAGATCTTGGTT GATTAAGATTAGACAGT CTCATCTAAGCATAGTGAATAATTTTAGAAAATTT CTT CAG CAGTCTTCTAAAG TTCTT TG ATTTTTGTCAATGGAATTTAAAATTGGAATAGAAATGCTCCTCTTTGTAAG( N) xCAAATCGACATGGAAAACAGTCA TTGTACAG GCTTTAAG TTG C GAATTGAATAAG TAGAAGG GAACACATACATCTCAACACAGGAACCCATACATCTCAA CACAG GAAAC CCACATGAGG CCTATCTAAATCAAAAATGTTCTAGAAAGACTGG CTTTATATTCAGATAATTATTT CT TTATGGTAAATTATCTTCCTTTATTTATATGTGCAATATAAAACAGAAGCTTGGCACAAACATATGCACTCACATCAG CCCAGAAGAAAGGATCTGGGACAAAGCCACTTGGCAACAGGACTGGGAACCCAGACCCTCAGGGCACATAGGAGCCTA G CTGAACC CTGATTTCAGGG CCCC CAGGGATGTTGCTGTGGAGGCACC TGCT CT CCAGAG C CAGACTGGC CTAGATGA AAGCAGAAAGTTCATTCTTAACCACATCGACTATTTTTTAAAATCTTGGAGGTCATCGAAGCTTCATCCCTGTTTCAC TT TAAACCAT CTCTTTCGTT CTCAAAAACCAGAAGCAT( N) xACTAGAAACATGCATCTGCACATATATACTAGGTGA ATAATGATTTTTACAAGTCTAGTAATAATTTTATAATTCACATGACACGTAATTTATAAGAAAACAAATAACGAAGTC AGAAAATGTTACATACATTATTTACTAC(N) xGTACTTCAAGTTAGTTATTTGCTAACAGTTTAAACAACAATGAGGT ACTG(N)xTAAAAATACTGTTTAAATGCAAAAATTATAATAGGTACACATTTTGTTATAATTTAAGAAAAAATTTTCT TTCCTGTTAATACCTACAAAGCAATATGTACATTTTTAAAAGTCAACTTATTTCAGAAGGTTTACTGTGAAAACCCAG TTTACAAATGAAGAATCTTAATGTACCTGTACATTAAGAAGCATTAGAAAAACAACCACTGACAATAACAATAACAAA AATGATAACATCAGTT TTTGAAGATTGAAGATGTAATTTCAATCTG C C CTTTTAAAAGAAAAAGATAATG C (N ) xCAA AATC CCATGCTTAT CTGGAAATCTTC CTGAATTCTTTCTCCTTTG CTGTCACAT GTTTAACCCCTCTTT CAGTG CT CT TAATG(N)xATATTTAGAGTATGTTCAATGCCGCAAGAAAGA( N) xTACATACCAACAAAACAGTTGTTTCTCTTCTT CTCAAGAACTTGGCTTATATTTCTAATACTACAGAGTGAGAATTTATTGTTGTTAAGTTTGTCCCCAGATGTTGCTCT TG CATACATGATGTAATTGC CATTTT CTTTTT GACC CAAATT CTTAGATTCT CCTGGTGTGCACTCTGTT CCAGAATC ATGCTGT CAAAAAGAATATAAAGTTACATAAACAGGAAGTAAAATAAG GTTTTCAGGT CAACTGATTAAACGTAATGT T CTCATCAGGAAAACAATGGAAG CTTGTGGACT CTC CTTTAAAAAGGATCTTACAATTGT TTTACACCTTGAG TAGGA AAAGAATTTG CTCTTT TAGGGGT CACTGTCTTAGAAAAGGAGGAAAAT CAGTGGAGGT CTAGC CATAACAATTTATAG CTTTGTAAGC AGGG ACTAGAATTG AG GACAGACTGT CATAAAAGTCTATTAAAAAT AA GCTC AC ATACTTTTAAAACT AGTAACAGTAAAATAAACTACTTTAAAACTACTAAATAGTCTCTAACACATATGTTAAATTACATGTTATGAATAGTT CAGAGAAGACAGAGGTGGGGAAAGACAAAAGAATCAAGAGACTGACATGGGGCTAAATACACACAGAACAGGAATCTA AGACAGCTAACTTAATAGCCAATATATAGGAAAGGTAAGTAAAAATGCTGACTAGGAACAGAAACACAATGATATATT TTTAAATGAAAAATGGTAAAAGCCTAGAGATAAAAC CAAAATACTG CT TTGC CAAATAAACAGAGTTCTGTGTACACG TAACATATAAATGCACATAAGTGGTGTTCGGTTTTGATAAATTTTATTGAAGAGTCAGATACATACAGCATAGCTCCC TGAATTTT CT CAAAGTAATG CTTATGAGAC CAAGCAACAGAACACTACTAC CAGATCC CAGCAG CCC CTACCAT CCTG GG GATATG CACTAAAACAA C CCTCACAATG CC CCTCAAACAATTCAGAACA CAT CAGAAGGGGCAAAAAACACTGGAC AC TAGAGTAATTGTGCTATTTAATTAAAAT CTTGTTTTAAATACCCAACAAC CC CACT CT CAAGAAAGAAAAAGAC TG CCCAAGGACGTTCTGTTTTTTCCTAGAGCCCAACTAAGTTACTTTCATTTTCTATTCCCTCTTCTGAGGCAACTGTTA AATTAAAAAATGTGTCCTCTACTATCACCTTTATTCTTCCTGATACCTCTTGCCTTTCTTCTGAATCTAGATGCAGCA TTTAG CAACT CTATACTATT CAC CAATATTAT CTTG CTTTATGTTTTCAAC CTATAAAATTCAACTCAGTTTCTGAAA AAGGTGTAGCACTAGAATTATTCTCAGCATTAGAGTTTTAGAAAGGAAAGAAAAAAATCCCAATTCTGTATAACTCTT GATGTCATCACAGAATAT CTTGC CACTT TT CAAT TCAGTAAATATTAATGAATGTCCAGTGC TATAAAAGGAG G C ACT AAAATATAAAA(N)xTTTTTAAATGATAGCAAATCTTGACATAACTGCTTAGTCTGACTTGCTGCTGCATAAATCCAT TACAGATATTGCTAGT CAATTATTATTTTACT AGG C CAGGAGATATTACAATAGATCACCATTAA CAG GAATAGAAGG GGAAACAGGTTGTAAG CAAG CATGCCAG CTATTTGC CAT CCT CAGATAACGAATAAAACAAG CACATGAATAAG CAAA ACACAAAACAGAACATAAAGGCCATCTTGACTGGACTCGCACAAGTATTCAGAGTTCAAATCAAGAGAAGCGTTTCAC TAAGAAACTATAATCTTTGTGACAAGAAAACAGAAAAATTTGTCTAACTATAAGAA{ N ) xTTAAAACTTTTTGGTATT ATAAATAAAA CTATGATGGT CATATTTT CT TAATTT TTATTAAGGG CCATAT TCATTTACATCATATAA CAACACGGA TCAAGATTTCAACAGGCAGACATGGGAAGGAACTGACTTCCTAGCGGAGGAAAGTTTTAAAGGTCTGTGAATTAATTT GACAGATGAG GAGAGATAA CACTAGAAATATACT TAAACCCCAAATTAT CAAACACCC CAAACATTAAGATAAG GAAA TTGTATCAGCAATATTAGAAAGTAGTAGAAGTGAGTTAGTAAGCAGAAGTGGCATTATTAGAACTGTATTTACATGAA GGTG AATCTG GTGA GAGT AG GT ACAT ATTA GAAT TTTC A (N ) xC AG AT AG GATAAATACATTTAAG AAAT AATGTG AA CCACTCAGAAAAAGTAACATTGACATTACTACAAACGTATAAGAGAGAAAGAGATTTCAGATAAAGATGTTAAGGGAG ACAGGTTCAGGTGTTTATGATTCTGG CTGATCTTACTATTAAAT TACT CAACAAATATTTATTAAG C A T T G (N ) xACG ATCTACTACGGATACAGATGGCAGAATTAACAACATGTGCAAAAGAATGGGGTACGATTGAGCGTGGTAGGCAAAGAG TG TGAACCAATT TCTC CATTAAACAATAATATAACATTAATTGAATAAAAACATCTCACTGG CAG CTAGGTAGGAAAG AGATTACACAAGAAGTGAAAAGAGAATGGTCTAGATTCCCTACTGCAGTGTGGTTCACAGATG(N ) xGAGTGACCTGG GACTAAAGG C CAGACAAGTAGCTGGAG GAGTACTAAAATTCC CC TGGTAAGG CCTGGAAC CAAGATGACTTGGGGAGA AGAGAGGACACTTATCTG CT CTTGAAAATG CAAATCAACTATGAACAATAATTCTCTG CT CATCTCTAAAATTC CAAG TACT CTTTTAAGAGAAGAT CA CAAAAGCAGTCAAGAGTTACAGAGATTAAGCACATTTTAAATACC TTATC CAAGAAC AT TAATAATAAT CTAT CAATTTCTGATT GG GATGTAGAAGAAGAAAGACT TGCAACTTAGTGAGATGG CAC CTATT TT ACAAACATGGGAAAAATTAGAGAAGTGCATTTGGGGATTTTAAGCTACAAGTAGACAACTTATTTTTAAAATCATAAA AATTTGCACTTTTGCAAGTGATAAGGTGAAAGTAAAATGAATTTACATACAGTTTTCAAGTGTACATTTTCAAAATGT TATCTTTAATAAGCTCTACCAAAATCACAAAGACAGAGTATCATTCAGTATTTCATATTTCTGTGTTGTACTGATCTT TAAGGTCACTTAAAAGATCAATATAATG( N ) xGATAGAGTTCCTTTAAATACCTTAGCAAAGAAAAAGAAATACAGAT AGAAGATATGTCAAAATCTTCACAATTATTGGCTCTGAATTAGGGGTTGACAAAG(N)xTTCTGAATCATGGGCGTAT AACG CTTTGTTG CACTAT TTGTT CTACT TTTGTGTATGTTAGAATTTT TT CCTAATAAAAGATT TTAAAATAT CAATC T CATTCCACGGTAGAGAAAATACTTTAAATTAT CAATCT CATTC CACAGTAGAGAAAAGATTTTAAAATAT CAATC TC AT T C CCTAAAAATG CTTTAACAGAAACAAAACAAAAAAAGAATTTCTCACAATTTGCT GAACATTCAAAATGTACTTT TAAACAATGAATAATG( N } xCTAGCACTGTTATGCTTACTTTTCTTACTTTAACCACAAACATTCTATCTTCCTCCCT TT CAGTATGGTCAAATA CATTACTGTGCTCTGACATAAAAACTAAACACT CGTAAGTACATTTGTTTT CACGGTGAAT TT TAAATAAATC ACTC AACATAAAAATT AG ACTAAT AAGTATTTTTTAAG CT AATAATTAGACAAT ACTT ACTG GGGA TC CAAAGTTATGTC CAACTT CGTGAG CAAAAG TAAT GTGAGA GACTTTGGGAGGTACATGAGAC CCATAGTTCTGAAC AGTAATAATTCCAGTGTTTAAGGACTTCTTCTTACCATCTGAATAGAGTTTACTTTTTTCACATATTCCTCCAGAGCT TCCTAATCCAGAACAAAAAAATGGCTAAATTAGTATCTGTTTCCCCTTACCCCCAAAAAGGTAATACATGCTATATTA AATG AAAACATT TTAT A TTTT AATCT AAAATAGAACTGGTACTCAAACGTAT C ACAT ATG CAAAAGTTTCTTTGGT CC AAGACTAAGAAGTCACTTCATATAAAAGTTTTAATGACATTCAGAAAGAAACTCCCAGGACTGAGTACACAGCCTTTA CAAAATGTTCTT CACAAT CAAAATGT CC CC CAAATG TTATTC CCAT TAAT TTAGGAAAACAAATAT TTACATTTCCTT TCAATTTGAATATATCAGTATTTACACTAAGT CAAAGTGTACAT CC TT TCAT CTTTTAAAAAGAAATCTTATGATT TA AGTTAAATGAAAGTTGAACAAAATAT CC TCAAGAAGATTCTGAGAAAGT CACTTCTA CAAGAAG CAGTGAGGCACATG AGCACATGATTCCCTGTCCT GAGATATGACATA CAATGTATTTGTAAAAATAGAAAATAATCATGT CAAGAAGGTT( N ÍXAAAAAAAAAGAGGGGGTAGAGGAAAGAAGGAAAAAGAGAAGATGGGTGGGAAGGAAGGGAGAGATAGAAAGCAAAT GCGACAAAAG GTTAATAATTGATTAATC CATCAAGAATT CA C CACTG
>H sl6_15314189-15328734
CATGAGGACAAT CACACC CACCTTACTG GGAGACTGGAAGATTAAAAAAGAAAAATGACATAAACG(KT)xTAGCATGC AGTCAGCAAACT TATC CC CTTCCGT CACAGTG CCATAACTCAGT CTACAGA CGAGTAACT CGGGGAGGGGGTTT GT TT GG CT CAT CTGTAAAACAGGG CTGCTGGAAG CC GACATTGCATAACC CATATGTACAACAACTGG CACACAGTGAGTGC T C AG CTG TT AATAAAG GG AAG G AAAAGAAACT GTñAATCTGG CT CTGTTATAG GT CCTGAGGTT AAGC AAflTAG AG AG ACAGTTATAAAGATAAGTAAGACAAGAGAGCCAAATGTAAATAATAATATCAGGTAATTAATTAGATGTATGAACAAA GT TCAAATCA GTGTAG CTTC CAG GCTATAT CAAT TACCTCGCTTAA GC CCTCTACGCC CTTTACAACACAG GACTGTC ACCC CTGCCCAGTTTACGGG CCAGCCAAGG CGGGTT CAGAAACAGCAGAGAATGTCCTTTGGGCTTACGCTGTTTGTG ACCCTGCTTCTTCT CGGATC CTAGTCCATCTT CATCTTTTTTTT CC CAAAATGGTTTT CT CATT TCTG GCACTGAAGT TGAACAAATTCAACATCTGATTTTTCCTTTCCCTGTCTCCCCTCTGCCCACACCCTCCCCCAGTACTTCATCTTCCTG AGGTCGGCTCGATCCCAACTTCATGTCTGTCCCATCCGCCCTCTTCAATTCCACTCTCCTCTCTCCAGTTATGACCCC TGGATGCCTGAAGAAGCCTTCTAACTGTGCTCTCCACTGTTCAGCTCTGCTCCCTCCAATCTGCCCTGCATGAGCCCC TGAGGGATCTCACCCTTT CAGCAGGT CCAC CTA CAAG TATAACC CCTCTAGG TGACATACCTTCCTTCG GGAAG CTTC TC C CAGCAAC CACTGACC CACAAGAATC TCTGCTTTCCT CAAACACAAGTAGACTTCG CAAC TC TG C CTAC CTGTTAG AATAA CCTGAAGAAGCTT CATAC CA CAC TGTT C A { N)xAAATAAACCTCCCATTCCTATCACTTCCTCATTGTTCAAT ATGGCTTCATATATTTCTT CATAAATAA( N } xAGTGCCAGTGTGTGAATATATGAGTACCTGTAAGAATCAGCAAAAA T(N)xCTGATCCACAGCAGCTGTCATCTATAACCCTAACCCTGTGTCACCCCTGCAGATGAGGCTTCAGTAGCTCACC ATTACCTCTACAGACATTCTAAATTCAAGTTAAATAACGCAGATGAAGGACTCAGGTGAGTCTGGGACTTTCTGTATC AGAATCGTCACAGATGCCTCAAAAAATCAGATTCCCAGACCCCGTTGGAGACATTCAGATCCTTGCAGñCCCCTGCCA G G GAAT TAATGAGTTC AGGTTC ATTC CACATG CAGATTCTCG GGAC TCCCTC CAGGAAAT TAAATTGAGATT CC CGGA TTTAACCCCCTGGAAGGGGTCCCAAGAATTCACGTGTTGAACAAGTGTCTTTTCTCAGACACGCTGAAGGTAGGGATC TACAGAGTCACATTCATGCAAGGTCTGTGGCACAAGGCCTGGCTCTGCCTGCATCTCTGGCTTCATCTCTCAACATTC ATGCCCCCACCCTGTGCAGGTCCACCAACCCCGGCTACTCCACGTAACAGAAATGGCCTTTCCCTACTCCTTCTCTTG GCTT CT GCAACCAG CCT C C CAACCTT CACTCAGGTATCAAAGTG GCAAGCTTTT CTGGGTTCCG TGTCTATT CTGAGC TCCCAAGCCTCCTA( N)xACTGAACTTGAAAAACTCTGCTATAGTCTATTGTCATGTGAGTTCCTCGAAGGCAGGGAA TGTGTAACACTCACCTGCATAACCTCAGATACCACATACAGAGCCTGACACACCGT( N ) xTATCCGTACCTTACATCT GG CATT CAGGTGTGTT CATTTC TGCCCTATTTTATTC CAGAAAAGACGTGTAGATACTTAAAAAAACTTACT CAATGC AACAGGATAAAATCAGTGAAGGTATCGTAGAAGCAGCAGCTATGATGAAGCTATAGGTGGGTTGAGAGCAGTTATTAG GATTAT TTTTATAG CCACAAGGTAGAGATAAG CTGAAATTTGAC TCTGGGCTTC TGGTTG GCAT CCATGCAATAAAAA TGTC CCAGTTG CTCAAGG GAAG CACA G CTCTCTCTAAAGTTGAGAGGTGTGG GG CACTGGGC CATAGTGAGAGTGAAG TGGACAACCC( N ) xATCTGGACAACCCTTTCAACAGCACCAATACACCAGGAGTCTTTCACAACTAGTTGTTATAACT TC CT CTGAGTTCTTTT CCTCTT CCAGAACTCA TTCTCTTTCTCTTT CCCATGTAATTTGAT(N ) xGTTTCTTTCTATC TCTTAAGTTTGAGCTTGGCCACATGACTTGTTTTGGTGAATGAAATGATGATAGACATGATACAAGTAGGAGCTTTAG ATGTGTTTGTGCAGCTGGATGT G AATTCCTGAAT CT CTCCCATC AC CATGAG AAG AGC ATGC CC TGGCTAGC CCTCTG GTTCAAAGAAGGTAAAAAAATACCTGGAGTCAAGCTCAT(N)xCCCATTATTGACAATGGGTAACCAAAATATCTCCC AATAGCAAGTTAAAGAATTACGCCAAAGCGTGGCTAAATAAATAAATGAAAGCAATCTTTTGGGGGGCTAAGATAGAC CAAATTATTCCATGTTACTTAAGATTCCTCCCAAACTAAGTAGTAACTCCTTGGTGATAGATCATATTCTTCCACCAA GTGACCACCACTGAGCTGGGCACATAAGAATGCTTGATGCCAATGTGTCAATCAGTATTTGATTGGGAATATAGCCAA C C CATTT CAAGAGAAATG CTGT GAACTAAAGT CAAAATGTTGGCAACACTGAACAAT CAATCGACT GAATTG GACAGT CAATAGAT TCAGATGCT CTAAATG CCTTCCATTATAT CCGAT CTGC CAAAG GTC CAAAAACCAATGATG CAGTG GCAA ATTTCCTGGTCTCATCTCCCACGCAGCATCCACTCTCCCCCAGATGGTTGAGTTCTTTCTGGTCATCTTAGATCCCTC CAGAATGGATTTTTTTTACCTCTCATGCTTGCGCAAGGCTCAACTTCAATTCGAAGAAAGCCTTCCCATACTTCCTTT TATGAAATTACAAG CCTGGGAT CC CTACCTTGTCTG CTCAGAAATAAAA CAGACTCAGCC C CAT CTACACTTGAGTTA GAGT TGAGAGTTGCATA CTCTCAACT CCTCATGTGG CTGAGTTG CTAAAGGT G T { N ) xCTCTTATCACTTGATTTTGA AT TCAGAGAGTAGACC CATCACAG CAGAGCTACAGTGGG GTG GAAAGGGGAGGTAACGGAGGTAACAGGACC C A C T A ( N ) xTAGCAGCTGTGCTAAGCACTTTAAATACACTCTCTCATTTAATCCTCACCACCACCTACTTTGTCACCATTGCTT CTATAACAGGACAT CAAGTAGGAATT CTCCAGTTGTTCTGGATCAATGACTACAGCAGACAGTT GGTACCTAGC TCAG GGTGAGAGATCC CCAAGAGAAATT GCTCCAAAAGAGG GTGAT C A { N ) xTGTCATGGTCACCAGCCAATTATATGGTCA CCTGCCATGGAGCCAT GGGGGCTGATTGCAAGAAGGñGGGGCTAGCATCTCTAT CC CTGAGCTG CAGAG C CATGAG CC CCTCTAGTGCGATCAT CTGAGTAAGATGGAGTGAGGACACTCTTGGAAAAG CTGAC TCTAAG CAAAG CTTAATGTAGG AAAAGGACAGGGGCAGGCATCTCCCTCTCATTCCCTGTGCCCTAAGAGTGCGTCACAGATGCCTGTGATTTCATGGTA GAAGAGGAAGCCAGAGATACAGATGAGAGGCTATCCAAGGAGCCACTAAATGAGGGGCTTGGGGCCACAGATGATTGA AAGGGG CCAAAGGGAGGCAAGG CTGGCCTTCC CCTAAGGGTCTCAGTGACAGAT GGGGCACATATGACATTGAAAC CA AGCACACGATCTACAC CAGAAC CATCTCATTTGCAAAGGTTTTGAG CAG CAAAACCAGTGTTACACAG AACAAGAT CA GAAATGCCACCCATAACTACGACCTTGCCAAGTTAAGTAGAAATAAGCCTAAACTCCCCGACCCACTTTCCTCCCCAT CTCCACACAC( N ) xTTTAATAGCTGAAAGTGGCCAGAAAGTTAAGGGACCTAGCACCCACTGGGGTAGAAAGACTGAA CATACAACTGATAATAATATTTAAGAAAAAAAACAAAACTAGTTCATGCTTGTACCCC( N ) xTGTCGTTTAATAGACA CT TGGT GCTGAGTG CTTTACATATTTTCCTTTTAGGGAAAACTC TGAGA CACAT GGGCTCAGGGGAAATATTTGTT GA CT TG GATG GAAAAG GCCCATCCTCAGCTTTTGTTTTGCCCTGTCCTGCTGTCTGT CTGTGTTAAGT T CAATG CCTGAG GCTTTG C CAACTTATATT C CCTGTAAGGGCAGAAATT CAGCT CATGTAAAC CTCACAGCAAT CCTTATG C C C A T ( N ) X TGCCTTATTGAGCTGTCTCCTCAATTTCATCTTCACCAGCCCCAATTCTCAGATGTCCCCTGGGCCTCCTCAATACCA TGTTGCAGGCCATAGCCCCAGGCTGCTAAATGGCACGCAGCCTCTGTGGGAGAAGACTCTAATTGTGCCTGACCACAA GGTCTG CAGGGAGTTAAT AAATTATT CCAAGAGAAT ATTTGC ACTG AAGG CCTT ATTGACTG GAGT C C ATTT CCTGAG TTGG CATAATTTTT TTTCTCCAGCAATTGTTATAAAGCAAGT GCAAGTCTTGAAGC CTCAACTGTAGAAT CT CTTGAT TATCACCAAGTATATGGGCTTCCAAGCCTACCTACCTTCAAAGTCAAGGTCATCACAAAGGGATAATCCCTCAGGCAA AAAGAAAGATGAGTTT TAGGGGTC CC CTGGTT TC CATAGAAAAG GCACT C C A ( N ) xAGAAAAGGCACTCCAGTTCCAT TCTATT CTTTATGGAACTATTCATAATTTACAGATTAATGTCAC TTTTCTCCCTCCCACCCT CATCTTTC TC CCTAAG AAAACCACGACTG(N)xCCTCTCTAAGGCTGAGAAG(N)xTAAAATTATTCCAAAACAACGTGGTGATTTCTAAAAAT ATATGAATTTCATACACGAGTATTATGACATGAA CAAAAAGCAATTAAAGGT CAT C { N ) xCACTCCTCCAACCAGGGC AA CT AACC ACTCTGTACCTACT CTGA GG ATGG CT TTTTTATTTAGTTATTAGTTTG GG AC ATTG TT AATGTT CTGT AA AG AC AAAT ATAAAACAAAATTTGAAAA C AAAAAC AG AAAGAG CC AC AATATT GGGC ATGAAT AATC CAAATC TG AG CA CTGTCACCACGTGCTCCT CTCAAATT CAGGGT T T T A A ( N ) xTCAAGTTCAGGTTTGAGCAGCTCTCACCAAAATCCCA CTGCAGTAAAATTCGCAGTGGG CTTCAAATTCAG GTCCATCT GAGT C CAGAT CC CAAAAG CTTT CT GACAGATCTT CT TGAGGGTGACAAAACAGAAATAAAAGATCAGAATGGGAACTTGCAGTTCCTCTTCTGTCCTTTATAGAAGTGTCAACG GGGATG CAAAGATT TCAT TATCAACGGGACTGGCAC TGTGCC CTTCTAT CAGGCGTCTGCTCTCA CAGG GAAAT CAAC TTGGCACAGCCT CAAG CAT TGAAC CTAACACTGAGAG GCAGC CG GG CAGAGCAGAAAGAAAG CTGAAGACAG GC CC TT TC CATC CAAGTCAACAAATATTTC CTTTGAGTACGTGTACCT CAGGGTTTTT CC CAAAAAGAAAAATCCCTC CC CATA TAAACCCCCTAGAAAGAAGTCATTGCTGCCTCAAATGCATGTATTCAGCTGCAAGGCTTCTATTGCTACATGCTTTCT GATTATAGCCACCATTTCTAAGCCCTGCAGGTGTTTG( N) xCATCTCCAGAGCCTACTATTTTTTTTCGGTGTAAACA GTA(W)xTTCCAAAATCAAGTACAAAATCAAAAGGCAGATTTTCAAACAAAATCTTTGAAAAGTGGTTCT{ N ) xATAT ATAAATAAAAATAAAAGTGGTTCTGTCCTGCTTACACCCTCTTCTCACGCTTTTCAGGCACAAAGAAGGGCCAAGAGC AAGAGAGGATTTCTTTGGAGAGAGTCTGTTGAAGACACGCCTTCATGTTACAAGATCTTGAAACTCATGGGCACCATG GTCTGGGATCTGAATCCTGCAGCAAAGCTCTTCCAGGCTTCTTGCTATGCTGCCTGGGGTAACTGCACAAGGCCCCAT AATTAGACGACCCCACTGTGATGCTGGCATTTTG(N)xCCCAAGATGCTTATTTTATTTCA(N)xGATGGTTAATGGG TACAAGAAAAATAGTTACAAAGAATGAGTAACACTTACTACTTGGTCAGGTGTGGTGGCTCACACCTGTAAGCCCAGC TCTTTAGGAGGCCA { N) xCTAAGTCCACTTAGACAATCAGAGGTGCTTGTTTTAAACTCAGGTTCCTGGACCCCACTG TTTT TAACAGTT CAGCTTGATATC CAAG CCTGTAAGAGACACTTTGAT T CTCTG GATTAAAATGAGAGACAGTGAGAG GAGAG GAAGAAGAATAGAGGAG GAGAGATTTCTCTCTAAAAATGTAAGGGATTC CAATTAAAAATCAC CTCTCATTGT A C AT GCAGAAAA TGTACAGTGAGACACAAAGTTGGGTTGGAAGACAAAA CTC TGAAACTAGT CTGGGAGGGTTAAGTG A CTT CATGGAAATAAA TACTAAAA TAT CAAAATGCT CTAAACTTTCCATT TT TACTTCTAAATGTG CT GAGAAAAAAA T CTGATTATTTGA CAG GAATTT TAAAAAGAACAATTTT TATAAAACTAAAAT CACATTCC TGAACACT CAAATG CT TA GCATAGGGGATGAAATTGACCTGGAATTTTCTTTGTGAGAGGAAGGACCAAGTTCCCAATGTGGATACATTACCAGTT AGATTTTCCAAG CACAGTTCATAC TG CAGAATTCAC CTAACTTGATTTAGGTTT CTCCTTTATAAT CAGTTTCCACTT GAACTTCAACTACCACGAAAAAGTAAGTAAGTAAAG CAAA GACGTAAA TT CTTGGGAACT TTGT TATGAGAGGAACTT TCTAGACTGCACTTAGTTGTCTTATTTAATTAGCTACCAGGGAAACAAAACATATAGTATGTCCAAGGAGTTGACTCA ATGGAATGCTGT
> H s l6 _ 1526742 1 -152 84 48 6
TG CGAAAACT CAGGAATCGTTGAAAGG GACTTGTTATGAACAACAAAAGATG CT CCCCTAAACC C A TC ( N) xATACTT CT TC CTTCAT CG CAGAG G TT A (W) xAATTTATAATTCATTCAACACCGTTCTCAAGGCTAGGAATATGGCAGTAAACA ACAGATTGGCCCA(N)xGGCCAGTGGTGCCTCTGGCATCCCCAAAAGCAAGCCTCCAATACCACAAAGGTACACAGGG CCACTTTCCCTGCACCTCTCGGCTCCTGATCTACCCAACTCCAGACCCCAAGGTTCACCTCTAGCTCTGAAAAGGAGC ACAG CTTCTC CAGC CACATCTG C CTCATTTTGCCTCAACT GCAATTGGAATC GTTCTCTGATTGGCTC CCCTGGGTA C GTCCCCTAGCCTGACCTTCTCAATCCCCCACTTCTGTGCCCTACCATCCCCTTAAGTGACACTGTCAGCCAGGTTTGC CTGGCTCTCAAGCTGCAAATCTCTAAGACAGAAGTCAATGGGTAATGAGAATAAGAAAATGTGAAGGACTAGAACTCC AGATCTTGATGCTACTGGTATTAGTTAGGTCAGAGAAGATAATAATTTGACATCCTAGGAACCCAAATTTCCATCCCC GTCTAACTCCACTATCCAATGCTAAATTCAATGGCTTATACCAATGCTATTTGATTCTTAATCAGTGTCTGCTCCATC TG GT CCCTTTACAGTAC(N)xCTTTTAAGTAACACAGCTCACCCTGCATAAACATTA(N)xTTGAGCCACCGTGCCTG GCCAGAAGTAACATTTTAGATGTACAAATAGCAATAGAGATGGACCTTACAGCTTTAATGAGAGTGAGGAGGGGGAGT TAAAAAAAAAAAAACAAAAACAGCAT(N)xCATGCATCAATCTTTTTCAAAGTGTACACAT(N)xCTACAAATAAAAC TTAGGATGGAGGACTTTCAAAAGATATGGATTCA( N >xACTTAAAGTATAAAATAAATGAGATATTGATTCAGAGTCA TATT CCACAATTAATAACTATATTAG GAATAAAC(N)xTGAACCCAGAGACAAATCAGTGAGAGA(N)xCCTGGGTCC TTG CAGAGAC CCATGTGGATGTGC CACGAGCCTGTGAT C C ( N) XCTAGGATCTGCTTCTGGTTTTCAATTCTTCTATT ATTTAAGTCATGCAACAGTGTTAACCAGGCAGGCTCACTAGCAT{ N ) xTGGACACATGGATGGATAGAAGCAAAATAA AATCAATACTGTATGAGCCAAACAAAACAATGGCTTGAGCATTCTGTTTCTAATGATGCTCTCAGGAATGCAGCCACA AGGAAACTCTGCCCTCACCTCCACCCCATGAGGCAGAAACTGGCCACCAGTGACAGGGAAACAAAGTGGATTCCTCGA ATGT CATTGTTATTAAAACAAAAC TAGT CCCAGAAAACTGAC CTGAATGTAT TTTTTCTC TCACTT TTG GCCTGACTG T CTAATAACCAAAAGT CACCAGAAAAAAAAAATTTT TT CT CAGCAAATGCTTAAAAACAGAGAAGAC C CCCAAAGTAC ACAGGGGCAGAGAACAACTCACAAAT CT CAAGAATAAAATAAAGACGGAAGATT CTCACCCTACTG CAATGACT CT CC CAGC CCACTG CCAC CAAAGTTT CTGAGATTAAAAAGAGAAATTATTTAATñTAT GCAGTT TCAATT TTTAAAGTTT CA GGCATGAAAGAGCTTTTGCTCCCTAATGATGTCCCTCAGATGTAGCAAGACCTGGTCTCTCTTTCACTGAAGAATTGC CTTCATTTTCACCACAGACTTCCCTTGCCTCAAAGACAGATGCTCAGTCAGAGAAAGGAGCCTGAGGATCCATAGAAA CTCTCTGATATTAATTGATCTGCTCCTAGTTTTCAGTTATTCTATTTTTTAAAACATGCGACAGTGTTAACTAGGCAG GCTCACTAGCATGATGGATGGATGAGTGAGTGGGTGGGTAGGTT{ N } xTGGGTGGATGGATGGATGGATGGAAAAATG GATGGATGTATAGATGGAGATAGATATAGACAGAACTAGATACAGACAAATAGAGACATCGTTAAAGAGAGAGAGGTA AT TATTTATTTTTG CAACAAGTAT TTATTATCTACT TTAGGT CAGGCACTGCTC CAGCTTATAA CTGT TTGTTATAAC AGTAACCAAAACAAA CAAACAT CCAC TA C C { N ) xTGTTCAGGTTCAATATTACTTTTTAAATAGACAGACTTAACTAC CTGAGAAAAAAACAGG CATATC CCAAAAATACTAGGACAAGG GAAAAT TATAAAAGTTGATTTAACA C CAACAG GTAT GTAAAGGAGATGATGACATAATGCTTAGTTCCTGGCAAGATTTAGATTGGAGAAAAATAGCATGACATGGGCACACTT GGAAGAAACCAATGGG CTATCACCAAGAAGGACCTG CT CGAG CTCAAGATAACCAGGAAACATAA C CACCCCTTATGT TATAATAATGTGGTAACACAAAAAGGTGGGATGGCATGTCACATATTGACTCGGAAGTAGGCAAAAACGTGAGACCAA CTTG GGCTTA GATACACATACCTAG GAATATACTAG GTGAAGTCAAAGACAG CAG GAGATTGATAT TCTACATAAATA TGGAAATGCACTTTATGAGTTCTTGAGAATCTCTGGGGATTTTTCATAGGTCTCACCCATCACTGGTCATGTAAATAT GT CATTTTTAGCAAAA TCACATTCTCTT CACTTCTñTCTG TT CTGTAT TG CTGCAAGGACTG CT TTTTGAACTAGGGT ATGACAATGATCCGTCTTGATAGTGTTTGTGAGTCAATTCAGAATTACAGTGATTACTAAAGTTGGTCAC{ N ) xGCTA GACT CCTTGAGG GTGAGAAACT ATGTGAAGCAAGGT CTGG CC AATAG ACAGC CC CAAATG C C AG AATTGGGAGTGAAG TCATCATAGACAATCTGTCTCTAATGGAGTCCCCAGATGTGTGCAGAATAGGAATAGCCCCAGGAAAGACCAACAGAA
{ N) xTTTTGGATAATTTGTTATGCGGGAATAAATAACTAATACAGATGGTAGCTACAAGTCTCTTCCTATGCCTATGG TTGTCAAGTAGACAGGTGAAGCAATTAGTAGTGGAAACTATCCATTCATTAAACAAGTA(N)xACACACAGCCTAGTG GCTCACAATAAGCAACCTTTACAGCCACTGCTTGATGAAATATCTCCAGACAAAAGTATTCCTCCACCATAGCATAAG GTGATGGTCTGC CCTTGTGTAT C CAT CCTTGTTTAAGAAACAAACTAATGTC CCTTAACTAAAGAT T CTCCTTC CAGT GAGGACAAATGCAACTCCTCGTGACCACTGTTTCTCCTCTGGTGCCCTTCCAATAACTCñGAAAATCTAGATTATTGG TTGTTTATGACTTCACTGGATAGCACCTCTATTGTCATCAGGCCTTCAGGAGGTTGGGTCTTCTGAGAAATGCCTGCC CCTCACCCAGCCCTAGCAGGCCAATCCATTCCTCTTTGTGTGGTGGAGTCACGCCAGTGAAGTGGATGAGGAGTAGGA t g g g a g t c a g a a a a c t t a g g t t t c a t t t g t g c a t t c t c a g a c t t g t g g g a a g a t a c t t c a c c a g c c c c a g c c t t a g a a ATGAGAGTAACAGAGCTGTACAAT CT CT GAGGTGTTAT GTATTCTTAAGGATA(N ) xTTTAACTCAGGCCCATCCAGT CTTAAAAACATGGACTGAA C CAGTGTACATCCAACT GACAGATGAGTTTAGAAG CTAT CTAGTC CAGACAAG CT TGAA CTCAGCTAGTCCAGCCTTTCATGCACCTCACCTATTTAGTGACAGGGAAGTCATTACTTCTCCAAGCAACTTACATTA CGTTAG GCTGAAAATTGC CT GC CT GAAATTCCTG CT CACTAATCCCTCTTGG CC CT CTGTTAA CAT CAATAATT CCAC CGATAATTACAAAGCACCTACTAGGCACCAGACATTATCCTAAGTGCTTGGGATACACGTATGAAATTGCTCATTCCA ACTTGGTTTGGCCCAGTTTTCAAGGGTTT(N)xACT TC CCTAGCCTTTCCTCAGAG CCCAGCTGCTGTCATTTTTGGG GGGTCCTCTACCAAGTCTCTTGAATTCCACATTCCCTAGCCTTTCCTCAGAGTCCAACTGCTGTCATTTTGGGGGGAT CCTCTACTAAGTCTCTTGAATCCTACATTCCCCAACCTTTCCTTAAAGCCCAACTGCTGTCATTTTGGTGGGGGGTGG GGGGGTCCTCCACCAAGTCTCTTCTCTATTTCCTTCTAATTTTTCGCAATGGTCAGTACACCCAAGACCCTCTTTCCC TGAG CTG G GAAAAGACCT C (N ) xTCTATTAAATAATAGTAATGTCAACTATTAAACCTGCACCTTGGCATATGATATG CTACATTCATT(N)xAGTCCCCAAACCAACACGCCTTTC(N)xGAGCAGGGTACCTAGAATCCAAGTTCAGGCAAGTG C(N)xTACCTTGTCTTCCACAACCCTGAAATGCATGCTGGACAGTGACCATCCTGACTGGGACACTAGCGTGGCCCTC TGGGCTTGCATAGCAATTGATGCCTGGCTTTCCCTGTGCAGGTGGTGCTGATTAATGGTGCTGTTAAGAAGAGATCCA GCTG CTGC TTTCAGTTTC CT CTTC CTGCAGAGAT CCTTAATTACTGGGCTTCAC CTGTTGAGTT TC CTTAAATGACAT GAGTGTGTATTTTTAAGTGG GAAAA CAATTCATTAG GAAGTG CCAACAT CT CAGAG GAGGTAAAT C CT CCAGTT AATT TCAGAAC CACACTGCTTCAG CCACATAAGAGAGCGGTC CAAGGGGGTACTGCTTTTTT CTGAACGT CACCAAGC CAAA CAA C CAAC CTTCAGCAGATGGCAGGGAG CGGCCTTGAGAAAC CACAGGGAGGGGTCATGAGATCAAAC CAC C CTTAGC CATCAACCTTTAGCATCCTCCCTGGTTGCCCCTTTAGCTCAGCACCCTGGGAATGATGGGTCCACACAGCCATTTAGG TACT GCTT CCACACGGCTGAAATTGCAT TTGCAT TTTATTAAATAAAAGCAATCTG TG CAAAAG C C TAGC CTAAAGTT ACTAACAGATATTTGAGTCTAAGATAATACATCTTACAGTATTAGAATTCAGGTTCTAAGACAATCACCCTCAACCAT TGTCAATCTTTACATTAGGGTAGAT(N) xGCTAGTTGTTTTATACATAGAGAGAGAGATTACAAAATACTTTCTAAAG GATAGATC CTGG GTCTTATCACTGTATTGCTCATAC CAAG CAAAATGCCTGG CACAGAGTGTGC CACC GATACATGTT TTGTGCATAAAAATAAATAATGTCACTTGA{ N ) xTGTTAATATGTCCCTTGATTCTTGTTTGTACCTTTTTCCAATTA TTTAGC CT TACATATATATC CATGTATA TATAAATT TTACATAAATGCATAAAAAGTGAGCATT C C T T ( N) xCGGCAA ACTTTTTTTGCTACCAATTTTGGCTCATTTGCCCTGCCCCTCTATTCTCACCCCCATTGCCATCTCCCCACCTGCTCC AACACCTG CC CCACCAGAAAAC CTGTAC TGACAGTC CAGT CAGTGGTCTT CTATATAG TGTCCATC CT CATTTAATTC TAGACAGC CATACAGACACACGAGTGTGGGCAAT CATTGTTTTATAGAAATGGGAT CATATTCTAC CAACCGTTCCCA TGTATTCAACAATTCCTTGTTGTAAGCCCTCCAAGTT(N)xGCCCCCAATACCCCTGATATCACTCACCTTCACTGTC CTTATTGGGC ACGT ACCAGT CC AT AGGTGG AAGC CC CATTGCTT AGTCAG CCACTCCC CC ATGG ATGAGC AC TC ACTT TGCTTC CCATTT CTGCCCGT CAGAAATAACTCAG CCACAAACAGGAGGTCAT CTGATG CCTGCAAC CACC CTATACAG CACAAATATC CATGTGAATCTTTCTT CC CATCCG CAGACACATAGCCAAATG CCTAGCAGGCCTTACCTGAC CC CACT AAAACAGT CTTT CTGGCCAGGGAAAATTATAAAGAATTATATGAGATCACACATACAAAAAAAAAAAAAC CTTCACAG TGTGCAGAAGGCTTTGTGTCAAAGTCGAGCTCGCACGTGGCCGTGGCCGGGTTCTCTGTGGCCAGCCCCATTCACAAC ACAG CT CTGG CAGAGCACTCTT CAATAG CATCATTAATAG CC CATTGATC CAATTACATTTCTTTCTGTT CCACTTCA GAAT TACAAT CT CCTGTT C C CC CTGAAAGAGCCCAGATAGATTCCCAGATGAGTAT CGGAAGAGGTAG CTGC CATCAG GACT TG CT TGATGGACCCAAGCAC TCAT CTAATAGTG G CACTGGAAGAAT TAAT CTGG CTGATATAT CAGAC CT CTCT CCTC CGTG GGAACTTGCCATTTTC TC CAGCCTCTGAGTTT TT CTAGTGAGATTCAGAGATACAG CCAAATGTGAGTTA ACCCAGGGACATCCTAACACTGCCCTCATTGTGTGTTCCCAGCATGGGGTAGGCACTCCATAAATAATCGTTGAATTG AAAAAT GC CTAAGGAGGGGTATATTGGGGCTCAAAG CAGACCAGGTCGGCTGAAGAAATGGGCT TTATCCAGATTGTG ACTTTGCAGGTTGGACCAGAATGAGAGTTTGAATCTGATTCCATAGGTAATGCATAAAATGAAAATATATACTATAAA AAGGCAAAGTCAAGACCTTGAATGATAGCTGAGAAAAAAGTAGAGAGAGCTAAATACATATTCCAAGTCTAATGGGCA GGATAATGAGGGACAGAGGTAG GCAG CAATAGGAC CAAAATAGC{ N ) xAGGATGAGGAAGAATTAGTAATGTTGAATA CTGG CAG GTGAACAATTCATACTGTGAGAGGCTGTGGC CC CAGTACCCTTGCAT CTGCTCTAGACTTTGACTTCTACA GTGAGACT CCTGGCCAATGAGGAT CTATGTCCACAGTC CCTGGGGGTCAT CAGCCCTGCA(N ) xGATTATGTTTACCA GGTT CCAG GTAAGTGGAAGGAAATTAAAAGACAG CC CAGC TG GAAACAGTGGGCAG GCATTACAGG CAGG CC CAGCTT TCTAACAGCTATCATAAAACTGGCTGCTCGCTGGAGCTGTGAATTTCCCATCGCTGGTTAGAGAAATTCTTTAAAAGG GAGATT CTTTTAAAAATGGCTGT CCTAGATGATGT CTAAG CT CATCTAAT TTTCACTAG GTTAC TACAATGGAGAATG GTTGATGAG TAAGCAAAGACAG CAGAAACCCACTACACAGTGAAATAGGAATAGAT GTGTGTTATTG T CAAT GTGATT GCTCTTGGTTGAATGATGAGAT CAGGGG( N) xTTTAATCAATTAAACTAAAAGATCCATTCTCAGAGAAATAATGACA GTATGT CATGAG CAAGGATTATAACTAACCTAAT TCTAAACATCCGAGGAT(N ) xTGAGAATCATTTTTAAAGTTTGA TTTAAACAAT CGCCCATC CC CAGATCTGATTTGGGG CATCAATGGGGCCAAATGACATGGGTTT CCTAGAGAGCAAAT CTCCGGATGT CATACAGATACTTAAAAC CTTCAATGGCTC CC CATTTCCCTACAACACATTCCTAGTC CT CAGCTTGA TGTTTGAGGCTTTTAATAATATAGCTCTTGCCAGCTGTACCTTACACAACGCCAGACACACTCAGACCAGAGGTTTCA ATCT CAAGGG CATATAGGGACT CAGTGGAGTGTG CTGGGC CAGGACAAAACCACAAGGGCATGC CC CAACTCAACAAC TGAG CAAC TT CACAGGT C CAG (N ) xCTCTATGATTTACTTAGACTCTATTTTTTAATTACTTAGATTTACTAAAAGGT GCTATGCTCGCACCCCTTCAAATATCCCTCCCCGATGTTTCCTCTCTCTAGGTTACTCCTCTCTGTCTCCTTTCCTGG ATTTATTCCAGCCCAACCTTTAAGACTAGCTTCTAGTTGCAGTCATTCCTGCAGAATTACCAATTTCTTTTTACTCTT CCATATTGTCACAATGTTTTACAATGAACCTGA(N)xCCACAGTTATCTTCTGCTAGACTTTGTTTGCTATCACACAC CTGCTTGGGTTTTTCCTGGAAACACTTCCTATTAAACTGGTTTTAAATGAGCCCTCATTTCAGGATCTGCTTCTAGGA
c c t t t g a t a g a c a g c t t t t g a c a t g t t c t c c a a t g a t c t c c a g c a t c t t c t c t g t t a a t g c c c t t a t a c a a t c c c c t
> H s l6 156 56 18 8 -156 75 324
TAATTAAAAATÜAGTTAGGTGACACAGAAAACGAACTCACTÜTACCATGCCCCTTTGAGCCAGAGTTGTTTGÜTGCCA AAA CTCTGCCTCCTAATCA CTACCTGTCTCTC CTTTTGTCCTTTCTCCATCTTCTCCTCT CTTAC CTCTCAAAAAT TA TCAATAATTTGATAATTCTGTGGTTAGAATC(N)xTGATGTAGCCTGATCTCTGCAAAAGCGTCTGGAGTCTCTCCAG TTGAAC GATCGCTGATAGAAACTCACGG GAACAGTCTG TG CC CT CC CGGCAAGTCCACGCT CAT TCGAGGCAGCTGTC A C CTGCTAGG AC ATTCCAGGGC ACGACTTG AAAAATGCTTGG CGTG CCTCTG AC AACAAG AGATGC AGGCTGAAGC AA GG CATT GCTGCCTCTTGT CCTGAAACTGGGATGCTG GCAT TA GAATGGACAG CATGAGTTGTTCTTGAGTGG CAAAAG GCAAGGGGTCC(N)x GATATTATTTC CT CTTCATTAAAGAACAGCTTTTCTAAATGTTGGGGGAAATGTCCATAGTCA TT A CTC AAT C AAAACTTGTGTT CCCATAAG CCTAAGGACC ATTC TAGATTTTTT AC ATGT TTTTTTGTGTGTGTGT AT CTATAAAATG CATA CATAAATT TTTTTTTGTTTT TAAG CATT CACC CAAACAAAAAATCACAGGTAAACCCATATTTC TGAGATGCCATTATTCCGAGCTAAATAAGAGATAATCACTTCAAGGTAAATTGAAAATTTTCCTGAAGCCATACATTT CAAGTGAAATAAGTAATT CTAAATAGGACAAT TTAAAT TGGATAATTT TAAAG CGT CTATAATTGGTTTATTTG CAAA ATTCCTGAAAGGAAAAATTTTAT CACTGCCAT CACAGCAGGTTT CCACATCCAGATGAGAAAACAAGACAAATG CTAG TGTGTTTTAA CTAG CTAAACAAAACTAAGTTAAATGAATATTTAAAAATTTC CCTAGTGGG CCATTCCTTAACAAAAT GT TGAAATCC CTGTTGCTACAT TGACTAAAAGAT CATGTTGAATGGAATATGTAAGACTTGGCT CATAGAAACCTAAT CAGATGGTTAGAGATGCTGGCAGTTTAGGACCTGCTGCCATAAATGTGTGAACAACCTTTTGTAATCTAACCTACTGA CCTGCATGTTTTTTCTTTACCC CAGCTCATTC CTTACATG TAGC CT CAATCTTCAG TTTGCTTTACTGGTTCAG CAAA AG C CAGGAAGAACAACTTTGTAGTAATCAGAATGTTAT C CAACTGTATATTGTTTACTTTATCGTAAATACTGGTGAA CAGTGGTTAATAAATAGT TTTATATT CC TTTATG CAAAAAAAAAAA( N } xGGGCCCCATGCTGGTATACAGATGGCAC TGCTGAAAGGAGTCAGCAGCCCCGGCCCTTGATGGAGACGCTGAGGAACGAG{ N ) xTGTCTCAAAAAACAAAACAAGA ACAACAAAGACAAAGACACGCAGAGTGAATGTGAACCTGTGTATCACGGGGAAGTTATTTGAAGATACATCTCAACAG CTATGGAAATATCAAGCC CTTTGTACACAGTGATTC CACTTCTGAGAAT CTAAACC C CAAATAATCCAACA(N ) xACA GCAAGATGAGTTAATTTTTGTAGCACACACACAAAACCATAATGTGTAATGCAAAATTATAATGTGTAATGCAAAACT AGGTTG CAAAAGCAACCCAATTACTGAGATGACAG GAG C CAG CACCCACTGTCCCC CATGG C C (N ) xTTTTTGTGCAA TAACAATGGT GTCATATATGATATTATAAG CATACG CT TT TTTTTTAAATCT GGAG G A ( N ) xTTAAACATTTTCAAAA TAAAAATGTTTTTAAAAAATCCAGAG GAATA CATA CTAAACCGT TGA CAATGATT C TCTC CGAAGGACAGAG GGAGAC TTTCCTCTATGTTGTGAATTTCTGCAGTGTTTAAAAAACGTAAACTTTAGCTGAAAAACGAAACTGAATATGCCATAT GAACCCACTTATATAAGCAAACCTAAGTCAGGCACATGAAGACTGGAAGGAGCTCTATCGAGGTGTTATTAACAGTGA CTGAT(N)xGTGATCATTTCTAGGTGGTGGGATAATTGAGCAGTGTTTGGCTTTTTCCTTTTGAGGGAAAACCTCTAC AATGAG CATGTATGGCCT CTTCAAAAGATAAC CCAC CTTTAATAGCTT CTTTGTGTTAAC CGGGTGATGCTCAGGGCG TAATGCTCTGCTGATGTAACCTGCTGTTGATGGTGTCCTGTTTCCCCAGGATGGATGACATCCAGCTCTGCAAGGACA TCATGGACTTGAAGCAGGAGCTGCAGAACTTGGTCGCCATCCCAGGTAACCATTTGCAACTTCACCTTGTGCTAAACA GGTGCTTGGGGGCCCCCAGCCTGGCCCATCTGTACGCCTGGGTCTGGAAGACAGAGCCTCATTCTTGCAATGCAGTGC TCCCTGACTG CATGTGTCAAACACA(N)xGTAAGAGGGTCTAATTCATTGAGTATAAATATCCTAAAGA(N)xAAATT AAATAT CATAAAAATCGATGAGATGATG CAAACTAAGTGC TTAGCTTGATATTTGG CCTCAAGAGTCCAGTAAATATT AGCAATTATGACCATTATTATCCTACTTGGGAACAGATGAAGTGTGTACAAACTCTAACTTTGAAGTCACCCCAGATT TGGTAGTTAATAATGCCTGACTTATTTG GCAT CT T C TC CAA(N )xTTATCAAAATGCTAGTCCCA(N )xAATAGGCCA GATAAG CACC CACCATTAC CCTTGACTTGAAT GTTTTTGC CATG CTGT C A G (N ) xTTCTTACTGAGCTACAAGTGCAG CATGAT GAGC CTTAAAGCAGCGTTTC C CAACCTT CAACTATTGACATTTGGGGTTGA CAAATGT CAGTATCTTC C (N ) xCAAGCGAGGTCAGGGGGGTGTGCAACATTGCCC(NIxGCTCCCACATAAGAACTACTACTTTAGTAA<N)xCACAGT CTTGTTGTAAGGGTAGTGTGCGCAATGACAGTAGAACTGAATCCTCTAGCTTTCGTGAGCATTTACTAGGTGCCCAGC A CTGTGTAGG CCCTAAG GATAAA CGAGT GAATAGA CTTT C CTAGTTGAAAGAGAG CGACAGCTAGTG GGTATGGGGTT TCTTCTGGGGGTG(N)xTAAAGCTGTTACCAAAAAAAGTTAAATGTCAAACATAAAACAACGAAGGAATGAGGGAAAA GT ACCAAAGT CT ATTCTC CACC CACT AG AAAC CC CAAT GCAATTGATCTGGC ATTGTTGG CTTTGGACTCTGGC ATTG TAATAGAACTCAACTTG(N)xTTCCTAATCTTCTGGTTAATTTGTTA(N)xCCTGATCCATCTCAGCATTAGTAA(N) xCAGGCCTGTTCCCCTCTCTCATTTCACAATTCTCCAAGGATTAGTCTGGCTGAAAGGTACAGAAACCCATCCAAGCT TATGTAAGCACAAAA CAAA CAATTAC TG GAAG GACAGTG GAG TCTC CT GAAG CCCTAGG G C A TTAC{ N)xAGCAACTT ACAAAT TACTTTGGAAATAGAGTATTTC CAAATAGAAGAGATATAAAC TGGGGAAAACTATTTC CAAATAGGGCACAC ATGAACTTTGGGGGTGCCTTAGTCTCTGTGGCTTGCTC( N ) xGTGGGTGGCACCTGTAGGCAAGTGGAAGATGATGGG AAAGGTATGTGGCACACACGCACGTGAGCAGCATTGGCCAATGACTTGGAGACTTGGGCTAGTCATAGGTCTGTGAGC TGACGCAGGCCACTTGTGTGAGTAGGTGGTTTCTCCTGGGAGTTGCTGCAGCTGAGGCTGGGGAATGCCTCTATTGGT GAGTGACAGAACACATGGATGAATGG CTTGGAGACAGAG GTTTG CA GATGTT GGG G GCCCAGCCAGTT GTTG GGTGAT TCTGGCTAAGATCTAGAGAGCAGTGGAGGGCACTTGGGAATGTCAGAGTGCTGTCAATGGTTATTTGGAGAGTAAAGT CACACCACGAGCTCCTGAGTGAGCAGGGGTCCTGGTGAGTCGATGGGTGTTGGTGCGTGCTTGGAGAGGGCAGGGAGC GG CTCTGCGGTGCTGACGAGTGATTTGG GGAATCAGAC CCAACC CACAAGATTTAC CACG G CAGGAGCAGTGACAC CA T C ATCC AATG CACC AATGGCTGGGAATGGGGAGCGG CTGG CAAG CCTATTGC CCTT CTTCGTGC AGTG GTGG ACTC CT GGTTAATGAGAGAAACCAACTTTCTGTCCTATCCAAATATTATGTTCTCCTTACCCCTGGTTCTGCTTCCAGATCCAG AAAAAATGAT CAAC GTCAGTCGTGGG CAGG CAGACAAG CTGT CC CACATTTTTTAT TCTGAAGGGATTTGAG CAGAAA GAGCCCAAGTGGGTTTAAGTTTTTGC CT CTTAATTTTCTCAAAC CT CAT CTT CAGC CCTC TCTCTTTTTTTTATTT TT TCTCTTTGGCATCCTAGCACTTGGTGTTTGGGACAGGGTTAGTGTCATTGGCATGGATGATTTTCTTCTGATAAATTG GTTTACCTTCACCAAAGGGAAAAAAACAACTAGTTCCCTCCCAGCACCCCTATCTGAGCCTCTCCTCACCCCCTGCCT TTGCAGAAGACCCTGGCTTTGTCATGACCATTTACTGAAGGCTCTCAGCCAAGGCCAAGCACTCCACTAAGCCATTTA CTAACACTACTGCA{ N ) xCATTACTCCCATTTTAAGCCTCACTGTGT(N)xGTGGTCACCCTCTCATCTGTCTCTCTG TAA(N)xCCTCAGCCTCCCAAAATGCTCATCTGTCTCCTTGTTGATGTCTTCTTTCCAGAAAAAGAAAAAACCAAACT GCAGAAGCAGAGAGAGGATGAGCTAATC CAGAAGAT CCACAAA CTGGTGCAGAAGAGAGACTT C CTGGTGGACGATGC GGAG GT CGAG CGGTTAAGGT GAGTGCACTGCGGGTACCCGATCACTGGGCTGCAGGACAG CAAC CTTCTGCTTCCTTC CAGAAC C CAT CTTCTG CTCCTGGGACTCACAGCCTTGC CTAACAGGCATGGC CTTGAT CCACAGT CTCTGGACACT
>H sl6_15476486-15505558
AC AC AG CAG AGAACGAACTG ACAGG ATT CCTCTCTT ATGT AACTCAC A TT CTT ATATG AT AATG ATAAGGGT T AA CAT T A { N)xCACCACGTTATGTGTTCTCCAGCCCATGGAGAATAATTTTAACACAGTCAATGAAATTTCTACACAACAATG TTCTTGTCTCAAGTCCAAGAAAGCCATAATACTGGCTTTCTGGTGAGTAAAGATGCCATTCTCATGTGTAATCAGGTG GCAAATG GAGATATGACCAAAGTAAC CCTTTGCC{ N) XCTCAGTTATATTTATGGGTATGTTGTTGATATACATGTTC CAAAAATTACATACAT TTATACAAATTTAATA TGTTATGATTTGTAAT TTTGATAGTTATACTAAATATTTG TTAAAG TTATATTTGTATAAACATGTTATGAATGGCTGGGCACCGTCACTCAT(N ) xCAGTAAGTACATTTTTTATTATCAAAA AAGAGTAGTGTATGATTGGCTTATTCTGTGTAGAATGTATTTTATCGATGGCTTCTATTTTTATAATTTCTGAGTTAA GTAATTTT TAATGAATG CTTTTTAGTTTGGGG CAGATT CAGTTGACTAAAGCACCTCATT T C CCAGATACATGAAATA AAATATTTGG CTTCTTTT C CAATTTCACAC TGATGT TATT TTGTGAAAAT CAGTGCTTTAAGATAAAT CTTTATACTT TAAGGTAAACATGAG AAACTTGATCTAATATT TAATATTTATTCAGTT CTA CACTTTATTAACTTCTACAC CAG CAGA TTTAAAAATTATGTAACTAT CTCAAGAAGTTT C A C T T (N ) xGTTTCAGCAAATTCCACCTAAGAATTCCACCAGAGTT CTGTTGTCTCCAATGTCATGTTCCACAGATTTCAAGTTGTGAAGCCCTGAACTGTTAATTTATCTTGAGAATGTACAT TTAAGCTTAATTTAAGACTATATACCTAAAAATTGAGCATATAATTTGTATAATTTGTTTATGTAAGTTTTTGTAAGT CATAAGTATGTAGTTT CCAAGTATATAATTTAT CTGAATGTAATAGGCATTAATATATTTTACATTATTGG GAC CATA ATACAGAAATTTCTAAATGGTTTGTAAAATAACTTGTTATTTGCAATGTTGTAAAAGTAGTTAATACAATGGAAAAAC TCATAATAAGAAGATACATT TTAACAT CAAGAAGTT TACC CAAGGTAATTAT GAATACTACCTGGCAAACTTTACAGA AGCTGTGGTATCACTTTTATGATAGAAGAATAGTGTTTGCATTTTGTGTAAAAGTACTTGG(N)xGCAGAAGATGAAT AAATAAATGGATGCCA CTGAATGAGATGAGGT CTCT CT TGAAGGAGAGAGAAAAAGAGAT TTAAATAGTAACAATTAT A A T A A (N ) xTAAATATAAAAAGTATTAGAGTCCTAACAGAGGAAAGTTTCCACTGATCACCTTTTAGCTTTGAACAAT GCAGAAGCATTTGCC CAGTTTACTTG TAATTAAAAATCATGCATCATT CACAATTTAT CAGT CTTTTTTGTTTG TACA AAAACTAATATAAGTTAT TCTCTTTTGT CTGTATTGTGACTGGTTTGGTGAGAGGGAATTAGGC CACTCGAGAG TTTG TGTGTGTTTAAAATTTTCT(N)xCTATCACAGCATAAAGTAGGAGGGATATTTCATTACTGTCTAGTTAAACTGGTTA ATGCGGAAAGGAAGTCTGGAAATTCCAGTTTTAAAGTAAAATTTTGGACATTGTAGGATTGATTATTTGGCATAGCTG TGATGTTTGTTGCTGCATTATGGTTTTGTTGGCAGGGCAGCCTTTAAGGACCTGTATATTTTCTTCTAGACTCTATAT ATTCCCCGTCAGTATTAGTTGCATGGTCAAACTGGCAAATTTTACCATAGGTATAAATAATAGAGAATGTGGAAGAAT AGTGAATAGTGTCAGAGATAGTTAAAGGTCCATACAGAAGTAGAGAAGGTAATAAGTAATAGTGGCTTGGACTAAATA TTTGTT GAATAAATTTTTTAAAAAACAG C CTACCTAAAATTTGTGTTGAAGATATGAATGAATGAAGTTT CCGCACCC T T A (N) xCTACCCGAGACACCTTAGGAGATTTGTAACAGCTGTAATGCCAGGTCCACCATATTTTTAGCATAAAGCAA ATGT TTAC CCATGATATGACTGCACAGG CTTT CAGC TGGAGC CATAGCAACT CAAGTAGTAACC CTAT CTTAGT CTGA TTAAAAGTAAATATTAGT(N ) xCTAATATGACTTAATACATTCATTTTGGAGGGCAAGTCTCTCAAAATGGGCCTTTC ACTGGGGGAAAATGGTAAAAATACTC CC TGGTAATT CAAGAATTGCAGACTC CTGAGATG CTGCTCATATTAGCTGAA CACTTACCAATACTTCACTTTTTTCCATATATACTCAAGGAACAAGTGCTATTTAAAGTGTTTCACTCTGCTGTGCTA GGTGCAAGACTATAAAGAGGTGTGAGGATCAACACTTTTATGAAAACCAACGTCATTCTGAATGTAGTTTCAGATGCT AGTG CAAAGGAAGTTCTTGGTATACG GAAAAAGTATT CAACAATAAATTAGG CATGGTTG CTTC CATTTT CTGC CTCA CACACTTT TTTTTCGTG GTTAAAGTGATAAAACGTCTATGATATTTTAGATTGGCAGTTG CAAACTAGTGGT CCTCAG CGGGTTTTTTATGACACCTACAAGGTTTGAAGACTTTGATTT CATATTAAAAATCTGGGT TT CAG T{ N) xATTCTGGG TTTCTGGCATCTCAAAAAAAAAAAAAAAAAGGAAAGGTCAGGGCAGCTACAGTCCTCTATTAAGCAATGTGCCACAG t N)xTGCTTTAGAATCTGCCTGGAATCTGCAGCCTCTATTATATAGTTCCCTATAGACTTTGCTTCCTACCATCTTACA TTCTGC CTTATAGGCATT TGTGTTTG CAACTCTTGCTTTTGT CAGTAT GCTACGCTGG TGA CAT TGAC CAAATT GACC ACACATTAATTATAAGCTTAGTTGGTGATGACCTCAATGGAATAACATGACATAAGTATTGTGACACTACTTCTTGCA TGTATCTGCAGGTGGAATTGTAAACCTGGTGGTCCGAGATGGTCTAATTCGATCTTCCTATGTATCTCCTTATATTAA TAG TG GTAACATTTGT GGTGGTGATT CAGCGTTTCAATG CCT CTTCTCATG G CAACAA CAAACGTTTT CCG T CTGAAT CAACATTAAC CTAGAT GTTACTGCGGAT CAGAATTAGACT CTA CATTT T CAACCACAGAAATAT TGGG CAGTAAA CAT TTTT CTTAATATTGATTG CCTACATA GGTTGTGTAATTAGCATATGTTTATAGTTCTATGATTTGTGCATG GCTGCTG CAGAGCTG GA GGGGGTAAAG CAACAGTGTTTT CTCAGTTGTG CGAGCAG CATTACATTATAATGAATAGGTAATATTA AACTGGGCTGATAAGAGTTG CAAAAGACTACT TTAATG TT C ( N) xTAAATAGGGCTGATGAATCATATTACAGTAATC CAATATTT CC CTTGTAAAGT CCCTGAGGTTTACAGAAG GAAATCTTTT CTAATAACAAAAAGAGTGAGGCATGTGCTT TGGT CTTTATTTCCTAGT TATACGAT CCTGGGTAAGGAGC TCAACTTTTCTC TGCCAAAATTAAA(N ) xATCTCTAAA ATAGAGAT TGAGTCTGAAGATCCACAAAGTG GTAGAAC CGAT CTTAAGAT T C CCAAATAATAGAATTGTG CT TTATTT GCCTAAGTAGTATGGTAT TACTTATTGTATGC CCAG G GGATC{ N ) xTTCCCTTCTTAAATAGTTTTAAGGGGGAGAAG AAAGAAAAAGACTGATTTTCTTCCCTTCCACTACCAAAACAGGCAAAACACACCAGAACACTTTGCTTAGCA{ N ) xGT CCCATCATTTTTTAAGATAAATTCACATACCATAAAACTTACCATTTTATTTTCTCTCTCTTTTCTCTTTCTTCC(N) xCCCCATGCCTTCTGCCTCTGTGGCTTTGCCTGTTCTCGGTATTTTGTGTCAATGGACCCATGCATGGGGGTGGGCCG GGCACCTTTCAAACGTTCAGCCCCCAGGACGGCCCGTGGCCTCCGGATTGGACGGCGCGAGCTGGGTGTGTGTGGGTC GCTCAATCGCTCCGGAGCTTCTGGAGGGGGCAGATGCAGGTGCCGGCTGCTGCAGTGCAGTAGCTGCTGGAGGCTGGG GAGG CC CGGACCCGGTGCAG GAAGACGC CGAC CACG CG GG CT CCTGAT CGCGGGCGCC CACAGCGCGGACATGG CGGG CTGGTGGCCGGCGTTGTCGCGCGCGGCCCGGCGCCACCCGTGGCCCACCAACGTGCTGCTTTACGGCTCGCTCGTCTC GGCCGGGGACGCGCTGCAACAGCGGCTGCAGGGCCGCGAGGCCAACTGGCGCCAGACGCGGCGCGTGGCCACGTTGGT GGTGACCTTCCACG CCAACTTCAACTA C GTGTGG CT GCGCCTGCTG GAGCGC GCGCTCCCGGGCCGAGCGCCGCACGC CCTGCTGGCCAAGTTGCTGTGCGACCAGGTGGTCGGTGCGCCCATCGCGGTCTCGGCCTTCTATGTCGGTGAGGGGCC GGGAGG GGACCTGGGGGGTGGGACC CAGTATTGGGG GACTGGAGGCTGGGACTCGGGGAT CAAG CGGC TG GAGGGAGG GCGCTGCAGGAGCCTGGGGACCCGGGCAGGAGCCGGGGCTCTGAGACCGGAGCTGCTTCAGAGGCCTGAAGGGCCAGG GCAAAGGCTAGAGG CTGGGGTGTTTGATTG CGAAAATTTGCAGGAGGCACTG GC TGGGGAGC CGAGAAAT TGGGGAGC CAGAGGTCCACCAGGTCAGGAATTTAGAGGACTAGGGTCGGGGCTCAGACTCTGGGGTTGGGTGTAGACCAAGGCTGG AGGTAGGCAGGGCTGCGGAGGTCTTTGCCCCCTGAAAAAGGCCCACGGGAAAATAAAGGAGATCCCAGTTCAAGGCCG TAGCAG CTGCAATGAC CGTGAAC CAGTCGTGG GACAGGGAAGGAAC CGAC CTGC TGTTCAGAAAAT CCACGCCTGGGG CCCCGGGG CCAACACAGGAACAAAATG CAAGGGGAT CAGTGACC CTGAGGGG CAGAGCAGGTGGAATCTGAG GTTGGG CCTGGATCTT CAGCAAATTCATGC CCAGGGGCCTAGTCCTGGTT TGGTAACAAGATTTTC CCCTGGGGTC TCACGGAA AACAT CAAGGGACCTTGGATCCTTGC CATGGAGGACTATATACCTGTGAGGGTT CAGGGAAC CCTCTC CTTA GAAAAT CCGACC CGGGCCTGGGGTGTGGGGGA CAGGGG CTTGACTGTGGAAACCTT CT CTACTGCATACAATAAGAACAATGGA AATGGTACCTGAAAAATACCGGGGTTCAT(N)xGGAGGGGCCCTGTGGCCTCTAGCATCATTGCGATTTCTAGAGCCT TGAAG GGGATGG GGTGGGTAACTG GGGAGAAAAT CTCTACGGGTCT CAAG CCTGACCTTGGGATTGGG GATOACAGAT CTCCCCCCAAATGGGAGGGAACTTCTCCTAAGGGCTGTTGGTGCTTTCTAAACCAGGTTGAAGTCAGACAAGCCACCA TCTTGAAGAAACCTACCCTGCCCCGTGTGCCTTATTTTTAAGAGTTGGATTCACTTCCTTTCCCTACAGACCCCACCA GAAACACAAACCAC CTGGTTGCCTGTTGTCATTGCGGCCAA CTACATGGC TGACGGGGTCAGTGGGTCAG GAGAGCAG GCAGCCTGAGAGCTAGGGCCTCCTGTTTTATCAGAAGGTCTGTGCCTGTGCCCGGTGTGTGATTTGCAGACTAGATTT GCAAACAG CTTC CTAG CAC CTGTTA CAGGAAC CTG GTACACTGGTGAAG GGGTAGGGGAC CAGAAT GCTATT TAATCC T CAAACAACTAGTGAGATTGTTCT CTTTTT CT TCTTTTTTTTTTG AG AC(N ) xATCCAGGCACTTGATGCTCTCATGG TGGGAAGTTTGACAGTTTGCCTGAAACTGAAAGATGCTGAGTTACTAACAAGACTTGAACCTAAGTCCAGGCCAAGCT CTCTGATACAGATTCCTGGCCAGATTGCAGCTTTGCTTTGGTCTTCAGTTTTTC( N } xGGTTTTCAGTTTTTCTGGTT CAGTTTCCCTTACCTCACATTTCAATGAAAGGGGAAGTGGAAGGGGATAGAGTGGAGGTTTGCGTAAGGCTTTACATG GAAATTGCAG GAAT CAAT TGATCTTTGTGCTGGGGCTACCTT GG GGAACCAGAGT CGTflACAGATTAACAAC TCTGGC TTGGGAAT CAGACACCTG GGTTGAACTC TGTCCGCTA(N)xTTTAATTAATGCTGGGTAAGTGGAACACAGTATTACC AATATT TTGCTG GGTCATGTAAGAAT CGTCTGTAATTAACTAATAAAG CT CTTATATTCC CT CC CT CCTAAT CAG GGA TTG GAAAGAATT TTTAAAG TACT CTCAGTTTTACAACCTTCCTAAAATAGGAGAG CTTTG CCTT GCTACCTGAATTTA T T G T T A C (N ) xTGAATTTCTCTTCAGGGATATTGAAGATCAAAGTGTTTTTAGGAAGGGTGAATTGGTGTTCCACTTT CAAATTTTGAATAGGG TAATACTGATGGAAAATT TGTAGTCG CCAGAG TATTTGG GTCCACTGG C C ( N } xCACTAGCT rpTTTTAAAAA GACA GAAATTAAAATTTCAAAATGTT CAAATGTT CAACAT TT CACAAAATGTTTGGAAG GTAGAGAAA GTAACCAATAGACC CA CTATTGTAAC CTGATTGTTTTTGTGAAT TGTTAGTTTT TGAAAAAATAAT CGTT CTGCTATA ATTTTTATACTC TG CTAT TCTCAC TT CGTATTTATCATTCCTTTAAAATT TATCTAGTTT TAGAGT GGAGATTATG(N )xTCTATGTTCTCTTTTCACTGTTGGGT CAGAGGGT CATATTGT TGGATGATAGTGATGGTTAC TT TG CATT TTGGAA TGGGAAACTTAAACATGAACTGTTATACGGTACTCTACTGAATCTTTTAAAATTTTTTTTGGGTTTTGCAAATAGGTA TGAGCATTCTCCAAGGAAAGGATGACATATTTTTGGACCTGAAACAGAAATTCTGGAATACCTATCTGGTAAGATAGG CGTTTGAAAATGTAAT CAC(N )xGATAATTTGACACGTGTTTAATCACATCAGGATTTTTTTG (ti )xAACATTAAG TA CATTTT TATTAAAC TATT C CTAAATTAGAG CTCCAGTTTGGTTGA CAGAAGGGT C CAATATTTT GACATAGAAATCTT AAAGTTATTT TGATAGATGTATGAGGAGAG GTTT TAAGAGTATCTGTACT TCTCATGAACTG GAAAGAAATAATGTCT C T G ( N ) xGAAATAATATCTCTGAATTTTAGTCCATGTTCTCAGTGATTTCTGACTTTATTATATTAACTCTTAGTAAT GAT CA CAATT TACTTT GTAAATTT TG CAGC CAAAG GGACAA CAATTTATT CTGTAAATTTGG GATT CTATTTGCAAAA TAGAGATTATAGATTG TGAGTGGCTTAATGGAAT CATTC CCACATT CAGAGC CAATGTTTT CAGGTGAGCAT C CTGCG GAGTCCTG GTGAAAAGAACTTTAACATCACAAAGGTATTGATAGAGAAGGAGAAG GTTTC CAAGGGTTATTC CAAGGT TTGAAG CAAAT CAGCATTTTTACTGTTTGCTCGG CACAAATATA TACAGTCCAGTCCAATACATATTTATTCATTGAT T CAGTTAATATT TATT GAA CAGTGT C C(N )xATTATGTTGTAAAGTCTATGGAGATATGTACATCTAAC(N )xGAGAG CAGGAAGTAGAATTGGGAAAGAAGCTTCAGGCAGGGATTTGTAAATGAACAAAAC(N)xTCTGAAGTTATGCACATAC TGGGGGACñGGT CATT CCAGGGGCAGAAGAGC TGAGTACAGTTGACAATG CGTAAAGCAC CATG GTATGCCTGGAGGG GAGAAACCACAGGCAGTT TGGTG T TCACAGAGTATTTGTTGAGCAACT TGGAATGAG TTTGGAT CCTGGAGACTGTTG AACACCCAGAGCTGAGGATCTTGAACTTCACAGTTAGGTAATGGCAGCAAAGGAAACCAAAGTGGGTGGTTATCGTGA CCAAGTTCACATAA GAAATAAGGAAG CATAT(N)xAGCACAGAGGAGAGTTCATGGCAGGGT( N ) xATACTACTACAT GATACTACATTCTGACAAGAAAAAAAACACAG CCATAAGAAATAAA GC CAT(N)xAGAAAGAAGGAAAGCCATGTTTG GATAAG TTTAGTCTTCTGATTATTTTTG CTAAAGTTTTAA( N ) xATATTTTATATTTAAAGCATATTTAACAATTAAG { N ) xGCTTTTATTGCAATTGTTCATTTTAGAAACTTCAGAAAATACAAATTAGTAAAAAGAAGAAAATAAGTCTGTAA TACCACTTTTAAGGAC TATTTTGAAAAC CT CTAG CTGTTTTTTGAGATAAATTTACATAC CAAGACTTTTTTTTAAAA AAT CATAATGATGACTTT TTCATAGGACTGTGTAGGATAAAG CTTGTGAAATGT CTATCTAACATATT TTGTAATATA TGAAAATT TCAG CTAAA CTTGTTC TATAGCAT CACATGGTTT CAATATAT TATTAGAAAATCATTC CATAAACAATGG TTTGGTATATAGTTTCAACAATTCAA( N ) xTTAATTTAAAAAAAAAGAATTCATTGATGATTCTGTATTTACAAATAT ATTCCTGCATCATTTGTTCATGGTAACCAGGAAATTGCAATGCATATATACATGCTGTCATTTCCAAAGCCGCTGTCT TCCAGAATTGGTACTGATTCTAGTAGACTTTTCATTCTGGTGTTAAAAATATAATTAATATAGGAATGTCTTCTACCT AATAG CTT TTTT CC COA CAACTC C CCAACC CC CT CACTCTAGAGGC CTTAAGTAGTCTAAGTTAAT CAG CTTAGTCTG AAATATAATACTAC CTAGGCATCTTGTTGG CCTAGTATTTCTTACT CTTTTATGGAATGAACTGTG CAATTGTTTTGT GTGGTTAAAAGGTATTCTA(N)xAAATAAAAGGCAATCTAATAAAGTTGTTGCTTTTTTTTTTTCCCAATTCCAGAGT GGACTGATGTACTGGCCCTTTGTACAGGTAAGTTCCACCTACTCAGTAATATGAACTCAGGGCTTTGGTTTCCATGTG GCCCTAACGGACTCTCTCTCTGTTCCAGCTGACCAACTTCAGCCTTGTTCCTGTTCAATGGAGAACAGCTTACGCTGG AGTCTGTGGTTTTCTCTGGGCCACCTTCATCTGTTTTTCCCAGCAGAGTGGTGACGGCACATTCAAGTCAGCTTTCAC CATTTT AT AT AC AAAGG GGACC AGTG CCAC AG AAGGGT AC CCGAAG AAATGAGAAGTC AAGGACT CT CTT AAAG GG AC CACATTTTTTAC CTAAAATG CACAGAATTG C CTGCAGACAAAATATTTGATGTG CCAATTATGCACTTCATTTTGAGG AATTACTACTATTTATAGACCCACTTTTTAAAAAATTATCAATGATTATTTTTGAATTGTATTCAGACTTTTTTCCTG TTCTAGTCTGAAATATTACTTCTCTAATATTTTGGTTAATATGAATAATAGTGGCAAAATGGCATTTTAGAATTATTA ATAT TT CTAATATTTTAACACAAGTTTCAGGAAACTTGGTTTTGAT TGTT CACATTTCTATTCTAAAATC TCAGGTTA TCTC CTGAACAC TTTT GG GCAGAT GAAGTTTTATAC CAAAAAATAGTTCT TAGAGTGAAT TTTAAT TTACATAGAACT CAAT CGAAAT GAAGATTTAATAAC CAGATTTC TTTCTC CAAAACATACACAATT TGTTATTTTAGTAAATAC C ( N ) xC AGTAAGTAC ATTTTTT AT TATC AAAAACAG AG TAGT GT ATGATTGG CGTATT CTGTGT AG AATGTATTTT ATTG ATGT CTTCTATTTTTATAAT TTTTAATGAATGCTTT TTAGTT TTGGGCAGATTCAGTTGACTAAAGCA CCTCATTT CC CAGA TACATGAAATAAAATA CTTGGTTTCTTTTC CAATTT CACA CTGATGTTATTTTG TGAAAATCAGTG CTTTAA GATAAA TATTTATACTTTAAGGTAAACATGAGAAACTTGATCTAATATTTAATATTTATTCAGTTCTACACTTTATTAACTTCT ACACCAGCAGAT TTAAAAATTATGTAACTATC TCAAGAAGTTTCA C T T ( N ) xGTTTCAGTAAATTCCACCTAAGAATT CCAC CAGAGTTC TGTCAT CT CCAATGTCATGTT CCACAGATTTCAAGTTGTGAAGCCCTGAACTGT TAATTTAT CCTG AG AATGTATATT TAAG CT T AATTT AAG ACT AT ATAC CT AAAAAT TG AGCAT ATAATTTGT ATAATTTGTTTATGT AAG TTTCTGTAAGTCATAAATATGTAGTTTCCAAGTGGATAATTTACCTGAATGTAAAAGGCATTAATATATTTTACATTA TTGGGAC CATAGTACAGAAATTTCTAAATGGTTTGTAAAATAAC TTGTTATTTG CTTTGT TGTAAAAGTAGTTAATAC AATG GAAAAATGGTTT CGTAATAAGAAGATA CATTT TAACATCAAAA CGTA(N)xCTCAAAACAAACAAACAAACAAA AAACTGTACCCAAGTTAATTATAAGTACTACCTGGTGCAAAACTTTACAGAAGCTGTGGTATCACTTTTATGATAGAA GAATAG TGTTTG CATT TTGTATAAAAGTACTTGGGG CT GGGCATGAT CGCTTAT( N ) xGGATCTCTCGAGCTCAGGAG GCAGAAGATGAATAAATAAATGGATG CAAC TGAATGAGATGAGGT CT CTCTT GAAGGAGAGAGCAAAAGAGATTTAAA TAATAACAATTATAAGAAGG CTG GGCGCGGTGGCTCATGCTTGTAATC
>H sl7_53361889-53384434
GCTCATCAACTCAGCTGCTC CATAATTAGCTG GTGATGGCTGATTG CCTG CC CCATGGTTGGGAAG CCTGGG CACTGG GCGACTGCAGGGGAGGGGGACCAAGGGATGACTTCTTCCAGGACCTTCTACTCTCTCCAACCAGAGGTGCCCTCTAGC CCTGTAAACTTCTGCATT CTCCAGCACAGTTC CTGAGAGGTTCAGAAGTTAC CTGGGT CCTGGTGCTCTG CTCCTGCT AC CTTT CAGCTC CTGC CC CCAACACC CTGTTCAAATTGG GAACAAGTTGACCACAGGG CACAGTGAATTTGAAAGTCC GGAGATTAAGTCATATGT TGAAAGAGGGCC CAGTGAGCAGGGGC CACTAATC CC CTTT CTTCCT CAflCCTAG GAACCT CCTCCCTTCC CAAGATATAGATATATATAGAT TGAGAC CCTGAG CAATGGAT CT CTTTTGAGGC CAGTGGAGAC CAAG ACCACTGAAGCAAGGAGTCAGGGACTGCCCACAACAGCAAAGCTCCCCAGAGTCCAGGCCACCCCATGGTGATTGTTT GGGTTC CCTGTGATTG CCTCCCATGGCTGCCC TCGTTCATAGAGAGTTGACTGCATTTGC CTGGGTCCCTGTGGGGTG AGAGTATGGAAGTATGTGTATTAGTCACACCTTACACCAGACCAGAAGATATACAGTTTAACCTAGGAGTCAAGCTCC TG CT AT ATTTGG AATGGT CT CAGTTTTGAG ACGTGGTACCTTGG AT CCCAGGTATATC CAAAGT CT TGAG AGTGTCCA TTGAGGACTTTG GTCTGTAGCCAC CAAGAAAGAGTAAACATTTGTG CAGGAT CATGTTTGATGGGATTCCATGGATTG TTTTCATTTCTTAAGCAGCATTCCCCTGAGCCTTTGACATGTCTTTCCATCTCTGGCTGTGTTCTCACTTGCATTTGC CAGGCCTGCAATGCCTCTGCCTTGATACAACACAGTGTGCCAGCAGCTGCAGTCCCTTTTCAGCCAGGGCCATGGGTT TC CTTGGGTC CTTCTCTCTGGCT CA CGTTTTC CAGGGCTGTTGGTGTGTACAGT CTAGAGATCACATTTTATTAACAC AAAGAAATCTGCTTGGGCTGCCTGGCTCTTGCAGTCACGTGGAGAGAAATGTGTTCCTTTGGTCCTTCAGCCTGCAGC ACTCTG TGCAAAATACAG CAGAGAGCTGGG CACTGGGC CAAGCTTACATTTCACTTCGGAACACACTTTC GCAT CACT CCTTG G CAGAACATCCA CGGTGGTGT CTATTTGGGGAGTAAGTTAGAGAAGTGTTGTTTG C CTCTTATGAGTGT CGAG AGATAGGACCTTTAGAAAATGTTTTACTTTTGCGCCTGTCTTCATCTTCTTCGCGGTGCCCAGGTACTGAGGTGATCC AGCTAAATGTGGCAGTGATGCCTGAGAGAAGCCCCTTCCTTCCTCGGGCCACCCTTTTCTCTTGCAAAATCAGTAAAC AGAGCCAGGACTGCCTGGGACAGGGAAACTTAGGCCAAGGTAATAACATCAAAACCAAAGTCCTGGAACTCTTGAGTA AGCTAGTTACCC TCCCAGATAAAACAACTTTGTTGT CCAAATCTTCTTTC CAAAGATCTAAGAC CT TTCTAATG TTGG TCCAAGGCCAGCCATGGAAGGAAGGAACTTCAGCAAGCCGCCTGGAATGGCAAACACTGTTAAAATAAACAAATGGAC CTTTGC CATTTGGTTT TCCCTGACTTGTCTAT TTTTATTT CTTTTCTCCCCC CACCCTTGTACT TTACCTCTCGATAG TACC CC CCTTTCTCCACC CCTACT CC CATT CT CATCTCACTTTT CT CAGGGG CCAAGCTGGCTGTTTTGC CTTCAGTA AAACAAATATCTAATTTCAAAAAGAGGAATTAGGAGAAAGAGGTGGTATAATTCATTTCTTGGGGTAATCTCGCTTCC CTCAGCTCCTCTATTGTCTGTTCTTGTAGGTGGCCACCTGAAGCTTGACTTGCTTTTGAAAATAAAGTTGGTTTTGGA ATTTTAAACCTT C CTTAG GAAAGACTTGAATGGTTAGAAAACAAAGATTTATTG GAATAATAAT GACGTTTC TTTTCT CAATGAAATAGACCCC TCTCATAACAAACGGTTGAAAATAAACT CTTTTT CCCCATCACAC CAAG C CATAAATT CCAT AAGG ATTTCC CAGCGC CAAG CCTT ATTCAG AGT CTT AT AT TT ACTG CTTTGG AATGTT ACTTTTGC CATC AGTATC AT CCAG CCAAGCAACAAC CAAT CAAGACTTACAGAAAC TTTTCCAG CTAGATGCATGTGC CAGGCTGCTTCTTC CTTCCA TGATGGAAGGCCTCCCTTGTCATAGAGGATGCCCAAAATCTCAGGGACTGGGATACAGCCTCCTGCCAGTCTGTGGGT ACAGTCTTTG CT TACT CTGCAGAAG G GAG GTGA CAGAAAC C CAT TC CACC TGTCTCAGAAACCT GC TGGTAATTAACT TT AAAAAT AATG CCAAGT T CTC ACGTGGCTGGTAGATG CATT AG CT CAAT C AGGGGGG AGGCAGGGGAGAAG C ATGCT GCTT CAAAGAGTAATACTGTTGCAAT CACATTAGTCATT C CAGGGGAGGTTAGATAAG GT TATT GAAAAGGTGAAGTC ACCCTGTACATTGAAAGAGGTCTTCTGAAAGTGTTGAGAAATAGTTGAAGTTCAGAAGTTTTATTTGGAGCTTTGCTC CC C CTT CCCCTTGTAAATAAGAC CAGTATTTT TTAAAAGC CACATTTTCTTT GTTTGAT C GATGAACACT CT CATAGT TGATGATTGAGAGT CAT C TACAGC CAGTAT CAAGGGTCAGTT CTGCATTACT CATGTGTAGATTAT CT CTTTG GTATA CTAGCCATGGTCACTGTGTAAGCCCTGGCCAGTTGCCACCCAGCATAACAACACCATAAGGCATCTACCCCCTGCCTG CACTTGGGAAGTGTT CAGGGAATG TAAT TT CATTTTATAATTAAGTGAAACAAT GCACAGTG CATT TC CTTGGTGGAT T CTAACAGCAGGACAGTTATTATT CAGAAAGGACTGTGT C TG GACTACTTTGAGAATCAGTAGTTTAAGTTGTTGCAC CTGT GTTTGGCCTAT C CT CATACCAGTGTAAATACTTCAATC CTGCAAGATTTT CT CATTTTGCATAAAACAGGTGAA GGTTTGTTGAAGT C CATCTTCCAAACTC CCAAAGTACT CTGGATACCTTGGTAG GT CCTTGTGGA CTC CCTGTTGTTG ACTT GCAAGTTT GC CAGAGGTGGGGACTGG G T T T ( N} xGTAACAAAGATACTAATTTCATTAGAAATGGTTTGAAATG AGTG CT TGTGTTGG CA GAATTTTGTC CAACATGGTT GGTCTTTGCAAA GGGT TATGATAGTATGATGCACGCCTTTGG GC CAGATGTGAGTCAGGATGGTCTTGGAGGAGGCGAAGTG CAGATATACC CCTG CTGATAGG CT CCTCATAATACTTC AGTGAGCTGTTTTTCTCGTGACTGTTTCACTA(N)xAGTTTCACTGTTTTCTGATTGGTAAACATTCAAÍK)xTATTC AAGAATGAAGAGTGTTTT CAGAGAAGAGGATTTAGATG CTTACAAATAAAAGTGGATATTGACC CC CCAATTTT CCAC CCTTTAATCACAGAGACATGATTGGAAGTCAGCAGCTCCAGCCTGCATGTTCCCAGATCACCTGTAAGAGTTCTTAAA ACTCAT CCCACAGGTT CTTAATTGGGAT TTTTAACTGGGTAAGTAGAGAAGCT C CATGTTGGGTGTGGGGAAGGGAGG ATGCAGATGG( N)xCATCAACTTATGTTTCTTTGGTTAATAGATGCAGGAGAATGCCTCTTCTCTCCATGTGGCATGT CAGGGTATAGCT T CTT TGAGCACAGGAT TTGGTTTACAGAATGGCATGTCTC CC CATGCCAG CCAGTGATAAGG GG CA CTAAAACAGACTTGGC CT TGCTAAAGGC TT C CAAGCAC CAGG CCATGAGCAGTTGACCCCAC CACAGAGGTATGTACA GCTGGCAGGAAGCTGCCCTTCTGGTTCACTGACTCAAGAGTTGGAGCAGATTGATCCCTGGATTTGCAGTGCCCTCAA TATCGCTGCACAGAATTCATTCTATGTACCCCAGGCCCTATGTGCCCACAGCAGGAAATATAGTGTGAGCCTCTGGCA ATATCCAGAGTTGCTTGAGCCTCCTCACATTTGCTTCTCCTATTCAGTTTAGTGTGCCTCAGTTTACTAATATGCTGG GGAAAATAACACCCCTCTTGCCAAGGTGAGATAGCAGAGTTCTCTGCATAAAATAGTGCATAAAATTATTCCTATGAC AAG G CATTTGAAAAATTACTTTGAGAATGACTTAAAATACAGAACAATTTTTTTAAATGTGG CAAACAT CCTTATT CT TATCAC C CAGAATTAACAATTCAG TACAGTTGCCATATATTCATATATAAATTTAC CTATATATT CATATGTGAAT CT AGACATTGCAGATTCACTTGGGGGTCTCAGGGAACTTTTTATAAAAAGAAATGCATTTAGTTCGTAAGGCCTGATTGA GATAATTCCAAGTCTGTTTTCTCAGCTCTTAATCCTTCCTAGCACTTTGCATTCATTGTATTAATAAATAAGCCTGTT TT CAGC CTAGTTAGAAAAAAAAAGGC CAGCCTCATT CATTAGAATTCTGCGTTC CCTTTGGTTATT CTTTGTTGGAGA GAAGGGGGCTCACTGGTACAGAATTCCAAGATATTTCGCTTTGGAATGTGAAAGGAGTCTCTCTTGACAACCCAAGGA GAACCTGCCTCAGAATGATCTTTCTTCCTCAGTGCTGGAAAATGAAAACCAGACAAATAAGCACAACCAACATTAACA GAAGGC CTTGGTACAG CCT C A A ( N) xTTTTTTTAATTAAGATTTGAGGTGCTATATGTGCCTGAGGAGTTATAGGGGA TG CCG GTGGAACGACCAT TACATGTGGCAG CTGGACTGGGCAACGGCCGATCGCCACTGTGCTCTGAGGAGAGAAGGC AGAGCTTCCCCCTTTATCTGCTGGCTGGTGAAGTCCTGAGAACCGGGCAGTCAGCTAAGCACCAAGTTTTCTCTGACT ACAGAGCTCCTATCAAACGGATGGGTAGAAAAGGAATGTTGTC(N)xTCATGGAGCCCCAACTCAATGGGCTGCTTCT CAGTAGAAGGCTTTGC CT TGTGGAAGGTAGTAGAATGT CTAAñATCAT CAGAACAC CTGTGTGGATGGTACAACACGG TGACAGATATGTAC CAGGGACCTAGATGTGGAGGGAGTGAGGAGGAAAGGAG CAACATATGCACCT CTGGGGAGGC CC TAAG GT CTCCTCAG C CAATTCCTAGGGAATTTAATTTTAAAGGACACAGAGAGG GTAGTTTTTAACACAGGTTTTACT TT TAAGTCATGAGGGATGACTTTC TAAG C CACAGTT CCTT CTGATAACGGTC TTAAAAGAGAAG CACCATGAAGTGAT AAACACATCAGTCCAGGAAGATTTAGAGACAGCCTAGTAGAGGAGGATTGGAAACAGTGTGGCTGTGTCCCTGAGGGT GCTG GTG CTAAACC CACGGCACAGTC CATATCAGTCAAAG CTTCATTCT C CT TGTG CCCACACTATGC TGATCTGC CC CAGT GCAACAG G GAG ACG CATATG GT CAAGAGAATGTGTC TTTCACGAATAGATTT TCTC CTTCGAC CATGGTGAGAA GT C CTT GTAAAGAAGCATACATGAATATATGGATGTGTTGG G CATCAG CTGTGCGTGTGGCCCT CAGC CAGGTGAGTG GAGGAAGTTGGAGG CATT CAGTGTTCACCT CCATGT GCTTTT CAGAGACATACCTG GTGAGCTGTCTG CAGATACACA GCGTAGTGGGAGAGATACGCACGTGAAAGCAGCACAGGGCATGTAGCTAGCACTTGGTAAATATTAGTTTCCTTCCCC TT T CTGTGTGTAAG G CAAGTTGTACGAG CAAAGCAT C CGCAG CTCAGAAT GCAAAAATAGAT CT CCATGTGGGGATGT GG C CTCTTTTCC C CAAGTGTGGAC CAGATGT CAGGC CG CCTTGTGAGTACTCAAGGAATGTTACGG CTTCCTTCTTCA GGACTT CACTGCTT CT CT C CATATGGAGAAGGCTCC CTTT CCATGCACTT CCTG GATCTGAAAAA CAAGAACAAGAAA GAGGATGGCTCTAAAGTCTCAGGAAATGGACCTTTCTATATCCTCCCTTTTAAAAAGCATTTGTATGTAGGTTTTTTA GGTCTTGCTGTCAAGT CG GATACACAGC TGTCATCTGT TATATCGGTGAT TTAATAA CATAATAAAAATTTATAGAGA AC CACAAGCTCATGGATGTTTGTTTT G GAGTGCATTTATT CATCATTAGTATGGGTATTT TTAAT CATTTACTGAG GG CCTAGCAAGGTG CTGGTAATTCAGAGATAAGGAAGATGT CAT CCTTTAAACTTGAGAAGC TTAT GAATGGCGAAGAGA TAT AATTAATTGTG AATGGAGTG T GGTAGATAG CAT CAG GTTGTAT AT GAG GTAGAATC AGG AAAG CAAGAATGGC CA G CTCTGAATAGAGTGAGGAGGGAAGATGGGAGATGATC CTTGAGTTGT GT CTTGAAGAATGAGCAAAAATTATATT CA TGGAAAGAAAGGATACAATATAAGTGACTTGTTGAGTTTTTCTTGGCTGATACTAAGAATGAATTTTCATTAAATATG CACCAGTAATTCACAGGAAATGAT CA GGTCAGAATGTTATG G GACATTGGTCTGAAATATAATGAGAG GGGAAAAAAA CAGAAAATTTAAAACGTTTATTTATGATGG CTAATATT CAGAGGGGCTAACT CCCTCCTGCC CAGATAAGGAGAACTT TG CCGC CTGGCAAGGGAC CTGCGG CTCTG CACTGCCAG GG CAACAGAACCACACGT CTCAAACGTGGTGTTTAAAGAA GATGTTGGAAGGTTAGAGGCGGTGTAATGGATGGGATTTGACAGTTGC TGTGGCAACACACC CACTTC CAGAACAT TA GTGT TAGTGGGAGGGCAGGCAAAC TCGTGGTACTTG CC CTGGGCTGCC CTATGTCC CGTACCAC CTGT TCTATG CCAA AGGTATGAACAAATTCAAAAGGACAAATGCCTCACATGGCTGATGGCTGCCTGCCTTTGTAGAACTGACCCTGGAATA AGATAAGAAGATGAGTAAATCAGAGAAAAGACAACAGGGAAATAGCGTGGAGGAGGAGGGGGAGGCCAGAGCAAGCAG TGTTAACTCCTTTCAGGTCAACATTTCCCAGGGTTATATTAGAATGTATTGACTAACTGGTAGATTTTTATGGGGAAA AAATGCATATGGCATCTGTGTGTCTAAAATGCCCTTGATAGAAGCAGTAGTGTAGAATGCTTTGGGGCATAAATAGTC CTTAGAGTAGCAAAAAGACAGGTTAATG CAAAATTGGTTGGACAGGTTTTG GTTGGAGGC CTGAGT GATTGCCT CCGG AGCACCACCTTCTGGGGAGTCATGGGTATGACATGGTGTGAATATTCCTGAGGTGTGGGTGGTTGGTTTCCTTACAAG GTAGGG TTTGCAAG CT CCTTACAT C CA CAGGGGCTGTTTC CT CTGGAC { N) x GAAATTTAAACATTTTATTACAAAAT TAATATACGATTGTTGTAAAATAGGACTGTATAAAGAGTAACTT GAGTT C CCATATTC CATTTC CAGATT TAACTGTC CACAGTTTCTGCGTACTCTTCCAGACGTTCCCTT TGTGTTTATAAGCATT( N ) xTTACATATGTTCTTATCTATGGCC TTAGATAGAGGGTGTAAAATTAGGAGA CGCAGCTCCCACT GATGATGC CTGGGTCATGTCCTTT CAGATC CAAATTGC AAGATCTCCTTTAATGTGGTTAATTGGGAGAAATTGAATGACCCATTTCTTCTGAAATTTCTGTTAAACTGGAGGTCT TTATTTAGACAAATGTAAACATTAATCCATCTTTATTAAGATAATGTATGATTAAAGCAACAAGGTACCATCTAGGTT CTGTTTGATACTTTTCTTGTTATTCTGTAGTTATTTGTTCATAAACCATTTTTTCTTCTTATAGGCTTTCAAGGAAGA ATTATCTAATTC CTTTTGGTGTGGCCTCAGATTT CATTAG CAGTAGGTAGAGTT TGTAGT TATTGAAAGT CTTT CTCT TATT TCAGTTTGAAATGT TCCCAGGTTGGCACATAGCT TTGGGAACTGGAAGTGTCTCTTTT CC CTTG CT TAGGTAGC CT CAAT AG ATTG AAAG AAAATGGAC ATCTCTT CT TC AG AAAACCTGTT CT AAGAATTATC CATCTTAGGAAATT AGGG TGGTTATTGCTCTTCTTTACTGCCTGCCTGAAAAAAATTTCCCAGCCCTCATTTCTTTCATCCTTTCCTTTGAGGCCA TGACTTGT CACC CT CTG CTGTGTT CCTCTGTC CC CTGAGCAG CTGTAAGGGGTGAGAGTCAC CTA CCCTCACCC CTTA CACT CCCA CCAACCTGAGGACCACGTGT CTTCAT CTAACAGCA(N ) xAGCATTTCTTTCTTTTCCTGTTTGAAGAAAC TGAGTCCTGTCTACTCCCACCCAGCATTTGCCTGGCATATCTTCACAGTTGGTCTCCTTATCAACCAACTGACTCCCA TGATTAGGTCAGATTAGATCCCCTGTTCAGGGGATTTATGTGGGGAACTACACACTGATTGGGACAGGATTTTGTCAA TGTACTCAAATGACAAAGGACTTCAGAGATGTTGGAATATTCTTGGTCTACCCAAAGGTTGCAGACCACGGTGGCATT TC CT TCTT ATAT AGGTCCTG GTGG GC ATGGGCTTGAGAAG CTGAAAGAAG ACGC ATGG AACT CAGAAT CTGATCTTTT GATGATGACTTCATATATATGGTC TGTTTACC CACGAGGCACACAAAC CTTACCTTATGC CTGT GAAATATAAAG GCA GATTTTTGTCCCATGAAGGTTTCTCCATAGACTACATTAAGCAGGAGACAGATGGGTTTAGCTTGGCTAATCTTAGTG CAAGCTGGAAAATTTCAGTTTTTCTGAAACAACATATTTCTAAGAAATTGTCTGGTCTAGAATGTTATTCTAGTGTAT GT TAGGGATCTGTATGA CTGTAAAGCAAGG CTGAATTACTGCAGGCCTATAAATATGTGT GAAAATTTAC CTA CTTCT CC CTA CCAA CATGGAGGT G GAAAGAAGTAG GTTTAATAAACC CCATTTTTATTACGTGTGTAAAT CAGTGGCTGTTTG GC CATTTTTCTC CTATGTAG GACTTGAATTTATATGAGAAAACTTGAG TGTATG C CTTTT CC CAAGTCTT GGCTTGCA TCTGAATGTAAAAATCAGATGCATGCTCAATGAATGTTAATTCAGATTATAGATAGCTATAAACTATATAGGCTGCTC TGTGGTCTTTCCTTTCCCTCTGCCTGCATGACTGAAGACCAAACTGTCACTTCCTTATAATCTAGTTCTTCCCATGAT AC CT CATCACAAAGGTATATAT CAGTTGGT CAATAACAGGACTTGTGCAGTGAATACTTACATGAAGCAGACACTGAA GTGCCCAG CAAGAACTGCAAAGGC TGGCAGTATTAGGAGATTGC TCAC CGTACC CCAAATGAGATGCCAGAGG CATAA AGTC CATTTTGT TTAGGTAC CAGATGAC TCTGTG GAAT TAAC TTGATT TGTC CC CAGTAATC CATATT CT TTAG CATA CACTTCCAGGCAACACTTATTGCTTAGCAACACTTACTTAGTAACCTTTCAGAATGATCATTAACTAATAGGGAGACA GCCTGTGGTGCCGATCGTAGGCTGTTGAGGCTATCTCTACGGGCATTTTCACCTATCCTGCCTACTTCCGTAACCTAT GC CC CACAAAACATACACTCTC CAGACATGAAATTTAAAAATAATGAAAGATGATGCAGGAACT T CAGACACTAGATG CAATAGACAGCAAAAGGAAGCACCTATGTTAGCCTTGGTTCTCAACATACCAAGATGCTCTTATGAACACTTCTAACA AAATGTTCCCAAGGAGAACACAAAGCAGGCACCTAACATCCAAGAAGCATGTCATCACCTCAGCATGCCTGCCACACG T C ( N ) xCAGTCATATTGTTCCTCATTTGC(N)xTGCTTGG CCTT CCTCATTTGTTTAACAGATATAGTAATGCAGCCA CTTGATTAGGAGGCACAGTGACCGCTCTTGAGGGATCAACAGTGTAACCAGGGAAACATCTCTTTGGTCAGTCCCGTT TCACTAGC CTGT T CAAGAAC( N)xGTGAATCTTGCTCTCTCCTCCTCTTTCCCCATGTTCAGTTGGCAACAATTCATG TTGC CCCCATCT CATTAATAACTC TTGAAT CCAC CTAGTC CT CTGCTGTTGATGTAGGTGAG CT CTTACCATTC CTGG TT C C CCTGTAAG CCAAGCT C TG CCTGGTTG CCA CAACAAT CTTT CTGAAACGAGTCAT CTGATCAAGT GAGTCCTTTG CT CTAATTGCAT CGATAGTTCCCCGTTCTCTT GAGGATAGAACC TGAACATCTAAG( N) xGT CTTAATCAACAAAGAC ATAGTTTATCAT TTTCTGTGTAAGGCAGTGGCATGGCCTTTG CC TGGAAAGATTTGCAAT CT CGTTGATTAAAAATTA TAGAGTTACTTAAT CTGATAAAAAAATTAT CAGTATAGACAT TGTGAGAGAAGTTAGGAGGAAT TGTC GC CAA CATTG GTATAGTCAGGAGTCGGAGGGCTTGAATTAGACCTTGATATGGATGTTAGGACAAATAGAAGTGGCGAAGACGCAAAG CAAGATTGTCTAGAAACAAAGCAGA CCATTGG CTTAAG CAGAGAGCAT GAGATAG GAACAATATGCCT CC CTGTGGAG GTCTCTCCC CAGTAGACAAAGATTTCAGA C CAG GAGGCAATGGGAAGC CATT GG CTTC CTTTGAT CATGGTGGCG(N) xTTCCTCATGACTGCACCTGTCTTGTGACAGGGATCTGAAACACAAAGGAAAGCCTCCACTGCACTCCA(N)xATCTC AAGGTTCC CATAACTGATGG GGTCTTTGGATT CCAATC CTTGTCTCATTTTCC CAACCTG CCTGGGAAGTGGGTGGTT AAAGTTTAAAGC CTGGAAGAATG G GGGAAATAGT TCACAG GCATGAGGTGAC CTGCAG GAGAGG CTGATGGTCTATAG GAGATTG GAGGATGAGTT TGGCTTAAACATGT TGATTTAGAAATGACAGAGGGATTGGAG GCAATGAC CAGCAG TCCA CAGATCAGTTTAGGACTCTGTGAACGTTAGTG GAAGAGATTA CAGAGAACACTGGCTAG GAG CCAAGG CAAGAAG GCT TTG GAAAG G CATGTTTG C CTTTGAGTGCTTAACT TAGAGACT CATGCG GGAAAGAATCTGTGGAT CTCTTTGAGAAGC TG CAGAACAGCAGATACA CCTGAATCCTATTCATGTTCAT CCAGAAAC G CAGACTTTAAAACAACCTTGTTATT CTTC CTTT GTTTTGTTTCGTTTGGTTTT AGGGGGTTGG AGGGGAGG GAAGGG TGTC AC CGGGGG AG CATATG GGTC ATTTGT TTGCAGTT CACAGGGGTT CTTAGTGAACAT CAG CGAATATTGTCGCTGATTT CGTCTC CTGCTT CACT CT CTTTTCTA AAAACAAAATAT TTAATGTAGC CCAAAGGAGAATAGTCAG TTTAATAA CTAG CCATGG CAGCTCTTTGAAAACTGCAG GTTTCAAATAACTAGGCCTGCACATTTTCACCAACTCAGACAATTAAAAAGACTCCAGGGACCTAGGTGGTGCTGATT CAAGTGGCGCAT ATTGTC ATTT AAAATG GAAAGT CT ATTT CTGAAAAT AG ACTC CAGGT ATGGG AGAACTTGTTTCTC TG CTGGAGAAGGTAGCCCAT CCTAGGGG GACC GAGGTAAATTGCTTAGTGGC TCAACATC TACATAGT TAAAATGGGC CCATTTG (N ) xGCTATAGATAGGCAGCATGTCATTGGACTAGGTCCGCCAGATGCAGAATGTTGACAAAAGCAGAAGT CCCCTCCCCTCAGACTCCTCTGGGGAAAGCTGGCAGCCTTGACCACCATCACCGCTCTGCACACCATAGCAGGGATAT GAGAAGGGGCTGAGGTGAGCATTGGACTGGAGGCTCCTCCAGCTGTCTCTTGTTGCCTGTCCTCTAGTGTGTAGAGTT GCAGTGTG GCTT GAAATGGT CAT CACACCCAC CAT CAC CATGG CAGCCCTGGGAGGAGGGACTGGCTGCTACTCTCTC AAGAAGGGATAACTGTTGCACAAAACACTGCAGACTTTGAGCTGGCCAGCAGTTGACCAGCTGACCTAACCTCCCGTT TTCCAGGTCTTAACCTACCATCCATTCCCTGCAAGTACAGAGGAAACTAGAAAGCTACCCTTCTCCCACCATAGTTCT GTAACTTTGGGATGGAGTGAGAAG CATG G CAT( N ) xATATGAGTGTGGGAAAGGGGAAGCCATTGCCTCTTGTTTCCC AT TCGT GAC CATCC CTACC CACTT CAGTGGGCCCAGTGTTGC CAGACCTGAAATATAC CTCATGTGGGTGTTTGTC TC TGGTACTCTGGGTT CATTT CTGGT C ACC CT ATGATATTT CCTTG CCCT AAGGT ACTGG CACAGCTAT CGGGTAGGAGG AGG C CAAG GTAGAAGG CACAGAGC CATTCTTGCCTCACCTTCTGCCACACGGTGACCC CACATGTCTGGGGAGAGG CT CCCGGCCCAGGGAGGAGGAGGTGGAAGGTGCACTGTCATAACCATCACAGGGGGCAATGGCGAGAAGGCTCATGGGCC ACGTGCCT CGGCCATTAGTT GT CATGTCTGTTGTAC C C CAACTTAACTGACTTCTAAC CGGCTG TGGTTATGAGTT CC GCTGGTCTCACCTTTCAGTGGGTT CAAGGG GGCAGATTTTG CTGA CAC CATAG GCAAGGAAG CATGATG GTAAACACC ACTG GAATGAGAGG CCTTGT TAATCCAGCCCAAGCTCC TGCCAAGTTAGAAG CTGAGGAGTCTfi GTCTGGGCCACTTT GAGTACTGATGCATGATCCCAGGGCGATGTGCTGTTCTGGGCACAGGGTCACACTGGGTGACAT CTAACAGG CTTTGC TTGGGCTGTAGCTAGAGGGAAATGGGAGCTGGGGAGTGTGACAGGAGGGAAGTGTGGACTCTCCACCTACCAAGGGGC CT T C CCACATACTGTC CAT C CCATTCAG CG CCCTGAGGTGGG CGTCATTTCT CCCACTTGACAGATAAGGGAATCAGA CCAT GAG G CAGAGTAACTGG CC CCAGG C CACACACAACTA{ N ) xACACCTTCTGATGAACTGAGCTACCTCGTGCAGG ACAGGCCCAAGGAAAGGGGCTCAGGCCAGAGTGGGAGCATTGAAACAGGGTTCAAAATGTCTGTCTGCCCAAACCAAA CGTTAATTTG CATACTAATAGTTTAAGGGCATAGATATAATACAGACCTC CC CCAAGAAGTGAC CTTTGAAG CCAACT TCTTCTTCATGTGACGGAATACATCAGACACCCTGTCTGCCGGGAATCCTTACCCTTCTCCTAGTGAGCCGTGGTCTC TGGCCTAT CCAGAAGC TCCTTAAAAATACTAGTG CATGAGATGTAGCATT GACAGGAG CCCTGG CCTGTGGGTCAACA GTTGCTTTGT CAAATG CAAT CTTGCCCTTGGCACAGCT CGCCATTCAGGAGGATGGAGGAAGTGGACGAGAGAGAG GG AG GTAAAAGAAGGGAGGGAAAGGAAGCAGGATGC CTGGCTGCCTGGAGCTTCCTTGGGCTCTTGTC CATGAGAGGAAG CCATACCTACCTCTCTGGGGTTTGC CAACT CTTGTC CAGGAACC TATT GG GCTAGAGTG GAGGGGAACTG CAGAGT CT CCCCTTCCTTTCTC TT CTGC CC C C A T ( N) xGCTGATGGTTCTGTGGAGTGTTGTGCACAAGGCTCCAGCCCAGGTCCT GATG CAGGACTCTCATTCTGGAAG GCTGGGTGCCTGGCTCTCCTTGGGGCA CAATTAGGCTT CCTGGCTG CATTCCTG GGGCATTCGCATGAA (N) xCTCTGAAGAACCTTñCTATCAGTATTACTGATATTACCAGAGA { N ) xTTGCTGATATTG ATTATT CAGTAGGC CAG GGAAG CC CGTCTAC CAG CCTACAGGAC GTCATTGCATT CAC CTTCTCAGTAAATGGCAACA GACTCTTTTC CTAATG TTTACACGTAATGGATTGAACTGGCTTAGCAT CATG CTTTACCCATATTT CTGTTGACAG CC TGTT ACTGTGTGGAGAGAATTG CTGG AT AAGAGC CACT GAGATCAAAGGG AT CTTT AC AATG GG CCTTT C AGTATATT TGTT CTGTAAT CATGGA CTGAAAACTTACTGAGCAC TTGCCATCTGAATGAG GCAAAAAGGAA(N ) xTATGTGTATGC ATGATTTGTTTTTGCCTTAAAATTAGTACATCAACCGTGTAATGTGCAGGACATTTAAAATATGTCAGAGCTATGACA GGTTCCCAGCTACTGGGATCCAGGACACCTTCCAATTTCATCTTTGGCTAAGACTTTCTATCTCTTTTTTCTTCTTCT T T C T T T C T { N ) xGCCCAATACACTTCTGACCTCAGGGCATAGATGCATTGAAGGGCATCTGCTGCTGTGGCCTTTTGG ATG GGAAC CCTGTAGAT CTCAG CTAGGT TGGTTTAT CAG CTATAATGTAT TC CTTGATTTGGTAAAACAG CCAGCAGC AGAG CAACATGCTTTTACCT CCTCTTTTTTTTTTT
>Hsl8_69612722-69643198
GAAT CT GAAAAGAAAAATCTTTTC TGAGTAGATAGTATTTTAGG TGTGAAAAGGAATATTAAAACG CAGOAATGTGGC AATCTG G (N ) xTACTTAAGATAAGTGAAACACAGTATTTAAGAAATAACATCTCCAGAAACTTAATGGTACATATACA AT AG G GTGTCTAAACC ACGT AAAATTATGATT ATGG CC CCTC CGTCTTTCT C TCTC TC AT AT CATGGTATGT TCTG AT TAATACATAT CTCCTTTAT CACTTTTTTTTTAGCTT CAAATATT CCTC CT CTAAAAAGTTCTTG CC CTCACTAGGACC AGCTACCTCATGTC TTTGTGATTACAGATTGTTTTAAAATTTAT CTGGCCTAATAT TCGGTTAGTACATT CTAATTTA GTCTATGGGTGTTATAGTTCATCACTTGAGACAGAGCATAATTGGGTAGCAATAACTCAGACACTCACCCCAATCTAA AGGGTT CTAAAGAAGAAAAATC CC CATAGT CTAAATATTAAAAC CTAAGACACACACTATAC CAGGAGTAAT CGAATT CTT CTGTATTAGAAGTT CCAGAC C TTAG CAAATATC C CTAATGCAAATATTG CAGAGTAAAATGACTGT CAAAGTTAC TGATGAG G CATTTC CGATCATAGC TTTAGCTCTGAATATGTTAG<N)xTATAAACATGAAAGTGCCT(N)xTCAGTTT GCTATC C CAAG CA CAAAGAACAATTTAAGC TTA CAAAATTGATTAATATACACCCT GAATAAAT TTATGT CAAGTT CC TTCCTGTAATATAATCTTGTTTTTCTACAACTGAAAAATGAGTCTTGAAGGAAAGTAAACTAAAAGCTGAAAAAGCAC AATTTCTATTTTTA CTTTGAGGTATAAA TTGCCTTTGT CATTTTAAAATATG C CAAGTTAAATA TTATGATAAATG CA TAGG CTTTT CAGA CACATATTT TAAAACATATTTAAGTATGTAAGTTT TC CCAACATG TTAAGTAGAAC CAAATAG CA GCCAAATTCCAAGTACACGATTACAGCTGAATGTAGAAAGCTGTGAAGAATTTAGTAACGTATAGAAAACCCGGTCCT CAGTTTAGC CAAATAC TCTAAT TTGGAAGTGCTAAATC TGAT GTAATT CCAAACCTTT GCTAAAAGAGAC CAAGACAT GACATGAGACAAAGCTATCTTTGGTCTTCTACAGTTTATGCAAAGTCAAAAATTTTTTCTATCTGCAAATTTAATTTC CATAGCTTGATAAACTCATGTCTACCTAATACGTATTTTCATCAAGATAGTTACTTTGATTTAAGGTAATATATTTTA CC CTAGGTTT ATTT ATTATT AAG CATTT AAAATTGTGTTTTAAAAATCTGTGGCACTA GTTT CACT AGAATTTT ATTG ATTCTGTT ATTCTATT AGTC AAGATGCAGGTATT TG AGTGATTG CCTC AATCTTAT CT CT ACTATT CGCC CTTT ATTT TATCATGAAATTTTGCAGCAAAATGGAT TGTCTG CAACACCAATACTA TGGCAGAAATATAAAA TT CATAAA CCCT CT TAAATTAGGAAAAGAAAAGGAAAAGAGGAAGGAAGAGGGGAAGAAAAAAAGGAGAACAAGAAGAGGAAG(N)xTAACA CAT AGG CAT CATGG ATGGCATT CTGTGT ACGATGT AT ATGAAAG GACTTGTTTCTTTGT ATGTGTG AAAGGACTTG CT TCCTAGCT CATACT AC AATTGC AATGGTTTTTTG AG GAAAAAACTAACTAAT AGATTAAACATAGAGTAT AG ATTAAA CATAGAGTGTAAAATAGTGAGATGGAAAGATAACATTATAGACATACATATAATTATAGGGGAAAATGCCAAATAGAT AATATAGTATTTTC CACCTCTGAAATATATTTATAT CT CTGAGATGG ACACACATG TT CAGTTT TATGTAAAGGAAAC ACAGGATATTTAATAACAGTTTTG ACTT AACAAGAAAATGAG AATG AATGTCATTC CC C AGGTGT C ATTCTGGCTG CC TCAGACTGTTCTCGGC CACC CAT CAGTC CTGATT TTAGTGCAGATAATGT TGACATAGGCGTGTGGA CAAAC CATT GT AT CAC CTATG AGGG AT CT AACAGAGTGT AATCGATAAAGTGT AGTCTC AAGACAAT TATTGT AAATTTAAGC TATGTA CAG G AC AAAAAAGGTAAATACT AT ATAC AAGTGAGTG C AACTTAAAAATAAATGTGTTTC AATG AG AAAAG C A C AAAT TTAAAATTGTAGGAAGAAGAAAAAATTTTGTCAATATTAAAATTTTTTCATTAACAGGAAAAGTATCTTCACAAATAA ACATTATTAATGGATG CAATATAGAATATACTTAAATATTTATG CCCTAT TTGGAATGAAA CAAT C TTTAAAAATATT TTTAACTTGATGTCTGGTATTCTTTTACACCGATTTGACCCCACATATTTAGAAATTGCCATATTCTATAGAGAACCA AT TTTTATTAATAATT CTAT CATTAATATCTTTGCT CAAATGTT CATTTAAATCTATT ATTTTAT CA CATTATTTAAA ACATCTATTATCTGATGCTGTATTAATGGATATAATTATAATGAATAATATTAACAGATAATATATAGATGATACTAA AC C A G (N ) xAGATTAATATTATTAATCTAGATAATATTTATCTGGATCCTATTAATGAGAATATATAAGTAATATCTT TTAT CATATGTAAAATAATTT CAAAAGAGTA CAATTT CAGGGTAA (N ) x T CAACC CTCTTCT CGTTATTG GCTAAAAA TGACATTTCCCTTTTATTTGAATAATGTTCTGTATTTTTCCCATCATAACTTCTCAGAACAGCAGTGAAAATCCATGT TACTGTGTTTGTTT CTTAAAC CAT TTAAGGAATTAGTT TGTTTTTTATTTAGAAAGTCAGTT TTTACCAATT CTGATT ATAATGATGTTTTAAAATAAATGTTTTAAGAACTTAAAACATTAAACTACATTGAGCTAATATAACTTCAAAAATAAC ATGCTC TTTGT AGAAC C AGAGAC CTTC AGAGAG C AACT C ATG CTGTTT ACTTGATTTT GGTTATTGT CTATAAT AAGT GAAGTGATGC CTAATC CCTT CCTATTTCTAAAGGAG GATATAAATATAAAAGACTC TTTGATTAGGAGGAAT CAGACA AG AG ATTTTTTAGG AAAC AT AAAAATAT CAGCTG ATGT ACAATC CT AG CATTT ATTTT ACTGTT TCTTAATG TGTCTC TTTGTC CAAT CCTCCCATGGCATCAGAT CCTACAGCAGAAAGTG CTTA TCAAACTG TC CAGGGG CCACTG CATCCCTG GGCTTATTACAATGCCAGTATTGGCATCGTCTGATTTCCATCTCTATATTTGCCATATTAACAAGTTGACTGCCAGAG TAGTACTTAAATGAGT TTATTAAATGCATTTTATTTTTGAAG CAGTAT TCTATAGT TTTAGAGGTGACCCAC TTTGTG TT TAAATT TCAACACATTTT TTG CAAAGATTGGT TT TATAACTCAGAGAAGAATCACAAGCACTGTGGTAACATAAAA CAGGATGATTTTTT TT CCAAGTGT C CAACATGAGAGAGTGTTTAAATGTACTGTAAACAGATTT GCTGAATAAACG GA ATTGAAAAGCACTCCCATGGGATGACAGACAGAACAGGGTGAAGAAAATCAGCCCAATAATGGAAACTGAGAAGCAAA TGTATAAAACCAGAAAAACATTATGAGATAAAAGTCAGAGCATGTAGCTCTGCGGAGACATGGTGGAAAGGGTCACTC CTCCAGCTACAGCTTTTTTCTTAAAGCCTGTTTAGTGCACTACTGGTAGAGCCACTGTCCAGGGGCCACTGCATCCCC AGGCTTAT TACAATGC CAGTAT CAGCAT CATCTTATTT C CAT CT CTGTACTTG CCATACTTAACAAGTTGAC TGCCAC AGTAGT GCTTAAAAGAGTTTATTAACTG CATTTTAT TTTTGAAG CATTATTGTATAGTTTTAGAGGTGAC CCTCTGTT TGTTTAAATTT CAACAñTTT TTTTTTTTTTTTTG CAAAGATTGGTTTTAT CACCCAGAGAAGAAT CACAAGCACTCTG GTAACATAAAACAGGATAATTTCTGTTATCTCTCATGGCTGCTATTTACATGGCATTTCTTTTTTATTTTTATCTTTG AATTTAAAGTTGCTGCTGCTTGTTTGTTTTACGGATTTGGCTGCACGAGTCCAGTGCATCCCCAGATGCCCCTGCTCC AATTACAC CC CTGGAT T CGAAC CATTTAGTGAT CAGCCACTGATTGGGCT TAAGT C CCATGAGT CAGTAAGGTTTC CA CCTTTAACGTTGGATCTAGGCATGGCCTGGACACCACCCCACAGTTCAGGGGGTTCGTGACTTTATGCATTATTCAGG CAGGAACTGGG GAATCGCAGAT T CTGGAGATCTGACTC T G T T T T { N ) xTTGACTTTCCCTTGTTACATCTCTTGATTC ACTTTTCCTCATTTCTGGGACTGTTTTTGTTTGTTTGGTCATTTGACTGTTTAGTAACTTGGCTGTGGAATGTTTTCC TTGAAGTTTGTATT CT CTGCT ATTTATG CT CAGATGTTTGCCTT TATT TT AT CTTG CATT CTGG CTTCTTAGACATTG CTAATG AATC AACATAAGCC ATGT ATGGTT CAAGTTTT ATGATT AGTC CC CCTTAGCT AGGG AG AT TTTC AC CCTTTT AC ATTG AATGTACGTG CAGCTTAGAGACAG CTTC CACAGTTC AG AAAGGTTACGGTTTTAAC CTTC ATTG AGCCAAGA ACAGTGACTTCAGAGGTCCTTCTCTGGTCTTTTCTCCAAGGTAGTAGTTTGGAAACATACACTGTTTTCCAAAAAACC AG GAATTAGTGTGATTTT ATGTTAAATC TAG CTTTC CAG ATGGTA C AC CTGG CTCAGAGTAG CTTATGGCTC AGCTGG TGTTTTAAAAGAAGTTGTGTGTAAG CAC CT CAAT CCAGTAAG CATTCTAGATTTTT CTGATGAAG C TGTGTAAATTGC AGAATG AATT CAAGTCTGC C CTGT TTTT CATGGTGCATGGTC CCGAGAGGTATAGTACAGTG CATGTACACAGCCT CC TGGACCTC AGGAAAAAGTTG G G (N ) xGTGTTTTT TC TATTAGTGTCTGTATT AAAC TT AC AAGT CT ACCATT TC AT AT GATG CAAT T CATATGGAGAATTAC CAATTTTCTCTTTATTGC CAGCTACGAAGATC CT CCATATAAA CTCCTCCAGGC TTAGTACTACATAAAGTCAGTCCCTTAAGGCTATACTATGAAGCTCTCCATCCTTATAGCCCACATTTTTACCCTTAG GAAGAAC(N)xTATTTTGCCCAAATACATTTTCAATATGTC(N)xTTTCTATAGCTATCATATCATTTCAGTTAGAAT GATCTTGAGGTATTTGATATTATTGTAAGTGGTGTAATTTCTTTAGTAACGTTTTCTGTCTCTTGCTGATATAAAACA TGTATAAACTTTTCAATATTAAT C TTATAT CAAACAT CATCTTAAATT GTTTTATTATTTAAGATAAAAAAT TGAATT GT CTAATATAAACAAT CAGGTGAT C CAGATTTAT CTTCTTTAATTTTATTAT CTAT GAA CCATAGAAAAACC{ N ) xAG AAAAAT CTTG AGT C AAAATG AT ACTGGATATCTTTG CC ATATTTGTCC AC TT AAG G GAATTTTCTAATATTTTTCCTA TATAAGATGGTAAGTTAATATTTATGCTTAAATAACTTTATTGAGTTAGCAAGTGTTGTGTTACTAATGTATGTTGAA TT TT AT CAAG G GATTG AAAATT T C AATT CATO AT TTTAAAñATT CAAT CT TGTAAT AT G GTG AATT A CAT TT ATflAAT TT TCTT TTATTAAATT CTACTTAC CCTT TGGGGATAAACACGAAATGATATAGACTTAGACATAAAGTTTGTTAATAT TT CG CT TATAAATC TTATACCCGTñTTTATCTTTTAATAGTTTTGTTGTG CTTATG CT G GCCTCATATTAAGACTTAA ATTATATTCCTCATTTTGTATTACTTATATAATATTGGGAGGACTTTCTATTTTTAACTAAAAGAACTATTCAGTAAA ACCAACTGTG CCTAGT GTCCATTTTTAGAAAAAAAT CACCCTTGTTTTAGTTTTCAGCAGTTTCATAAATAAGCTT TT ATATTTTCAAAGTTTACAAAATTTGTGT CATATG AT(N )xTATATAC TCCCATAAG CCTA( N ) xCCCTATGTTTGTTC AACTATTTAAATAGAATATA ( N ) xTGAGGC CAAG CAAATCTGTG CTATTAGT GATC TGCTTTATTT TTGTGAATGTAA AGAGTGGTAATTGGGTATGCAGAATGTGACCTTTTATTGTCTTAGGAAAGTTCTACTTATTCACTAGGTTAAGATTAC ACAGTTTCAATCATTTTACAATAATCATTGTAAAAACAAACAAAT CTC CAAATAATTAAATTGCAATGTG GAAGACAG CCTTGAAATAATCACAAGTATTCAAATCACAGGCCTTAAAAAATGAAGGAATATGACATAAAAGACGTAAGCATAAAA TCTG CTATAAGAAAAATGGT GTCC CTAAAGAAAATACTTGAACTGAACAGGCAAGAAT CTGAG GGATGATATGAAACA AC AACAAC AAAAAC AAAAC C ATGG GCCT AG AC AATATTGAAAAAGAATGC AG TGATGT CTTTACATT CAAGATTTAAG AAAATTTC CAATAACTATTCATATAGTTATG(N) xGTGGAAGAAAATAAAAAATGGCATCAGATATTTCTATTTTGAC AATATTAAAATATATT GAGAGTATA CAGAGATTAAAAT GAAGAAGTTGG GGTA CAAAATAT CAATAGTAATTTATGTT GTAACATACATGTGATATGTA CATTGTTAGATATGT GAAACT TGGGAATGTACCC CAACTATTT C CT CAT GC TGAGGA
a a a a a t g c a c a c a a a t a g a a a a a a g a a c g c t t g a a t a t t t a c t c t t g c t c t a t a a a g g t t c (n ) x CCAGt g t a t t a t c ACTAATTT TGAGCAATG GAAACTAAATGTCTAAT GAAATTAATTTGG GTACACAAATTACAG CACAT CTG CAAGATAA CAAAGCTTGTATCAATTTAAAATGCTTATGAAATAATAATATGCACGGTATATCAGGTATAAAGTATACTTCAAAACA GTACTGTGGTTTAATTCCAATTTAAAAGAGGAAAAAAATAAATGTCTGG(N)xCACATGGTCTTTAATGGATAGATAC TATTTTTG TTAAATGTAGAAATAAATAT CTAT TTAAAAGAAAATATTCAAAATT GTAAAT CAACTATACT GTAATACA TTAGA CTGAAGAAACTTC CAGAAAAGAAGCTAAACAAAACGC CT C (N ) x CGTA CTTTGTAAGATGAAGAT GTTT T CAG ATTTA CATATTTTAATTT CCTGAGTGCTGCAATC CATCAACTTACTTG CC AAAATATACAGATGT CCTT ACATTGCAG TAGATTACTTTACATTTTATTTTCTCCAATTTGTATGGAGTCACAAATCAAAAAGTTTGCAACACAGTTTTGATTAAT TAAGACTCAATAGAAGCTGTTT C CTGTTTTAGGTTTGACAGTAATTTAGC GTAATATAAGTG GCATGG CATTAAACTA CATTGCTGAACAGAATTCACTTCCATTAACAAGCTCCATTAGAGAAACATAGCTGTCTACAAATCAACCATTTTGTGT CAGT TTCAGTAA TTACGGTT TGGGGATAAT TGGTGTAATA TCACACCTTTTCTTATAGTGCT TGAATT GT TTGG CTAT GCAAGGCAAGGCATACTGTTGAATAAAG TATAAG CTAAAT GGAATTTTAAATTATTTG TATCAGAGACTG CTACTCTG AGAAGAAAAAAG CCAATTGCTT TATGAAATTCATGACTGCTATTTTAACT CTGAATGAATAATATATTTATTAACAGA CAATTTAAGGTAAATTTAAAAAATGCATATAAATATTAACAAAAATTTACCTTTCTTAAGACATGGTGATTTGCAATG TCATACTC CATTAAGAAAAACAAATGTTGTACTTGTGT TATCTATTATAAGGGTATATTT TTACTCCACTAAATTAAA AAAC ( N ) xGCAGAAATGGACTG TG TACCTG CT TCTAAAAG TGTGTCTAAGTTTAATACTGGGAT TAACACATACTAAC ATTGTGGAAAAT CTG CAT CACAGAGAGCAGTTATATGTTGTTTG CTCTAT CATTTAATAAATAAGATT CTTCAGTGCA TATAAAGTGCTCTT CCTAAG CAATTTCCT CTCATAATT CAGCTGTGTGTTTG CCTTATTG CTAAACTAATTATACCAA ACAGAATTGTGATTACTAATAAAACTACTGGAAGCTTGTGATTTATGATTAAATGTAAAAATATTATTACCTCATTTC TAAAAAGTGTAATTTTTGACTAT CAATG CG CT GT CAAATT TCTAATTCATATTCTTTC CATTGTAAATAAAACCGGCC ATTGTATC AGAC AT ACTAGATG AC TCTATTTGAG CTTG AAATG G AGAT AT AGTATAGT AAAT AG CTTTGG CC AG ATTA TTAT CCTGAGTTAGTTGT CATTTACATATTAAAAAAAAATTATGTCACAATTTAAAATTATAATGTAT CC CTTTTTTC CT TT CATG TCTAGCTATTGGAGTAGCTATAAT CGCATTCTTGC CAAAATAATTATGAATAGG( N ) xGTCATTTAAAAA AATACTTAGTCCTTGTATCAGCAATTGACTGTGTAAAATTGAATTAATACTATTATGCTAAATGTTAAATATCATCAA AT AAAGCTGCCAT CAATAAGAGACATAATTTTGTGTGCAAAG CAAGAG CAAAATAATG CCAC CAATTAAAACATGA{ N ) xGAAGATATTTATCATTTCAAAAATGGCTGCAAAACAAT(N)xAGTATAGAGATCCAGTCATTCTTTTCCTTTTTCT TTTTTTTGATTAATGAAAT(N)xTAAGCTGAGCCACTGGTGCCTATCAATGTATATCTAAAGAAGGTATTGATCTTTG AAAGAGAGGCAATAGATTTG CT C CAAAACC CTAGTGAAAT TTAAAGGG{ N ) xTTTATCTTTTTGGGAAAATAACCAGA AATGTAATTCTTCAATGTTATAA(N)xACATTTCTTATGATAAAAAATC(N)xTTTTAAAAATTCTATTATGTCTACA TT TTTGAAT CAT TCATACTTAAGC CTTTTTACTG GTTAGC CTAG GATT CACAATATCATT TGAATTAAACTTGT GTTA TCTT CAAGTATT ATTATT T C AT TTGAC ATATAGG AAAC TTTG AAGAGTTTGTTC CTGAGATCTAAATTGG ATGGTTTA TTCTTTTGTTTTCACATA( N ) xGCTTCACGTTCCTCTGATGAGAATTTACATATAGGGCTATGTCTCCCTGACTTTAG AAAC TATTATCAGT CCACAGAGGCTTGGATATCACTTGTCATCCATCCTACATCTGACCCCAAAACCCCTCCTTAATC TG CTTGAGCCCTGG CATC CCATAATGAAGCATG CTCCTTTGACTGATC CCAGTGATCCGTTT CATGTTATGATGTCTT ATG C CACACTCTTG CATGTGTG CTCTGTAC CCTGTAG GGTTCAGGGTCTGATGCTTAACACTAGG CAG CT CACCTGTG CATATTGT CTTCTCATTCTCATTGGTTC CC CA CACCACAAGC CAAATTATGT CTTCTAAACAGA TGCCCTCCTCACCT TG CTTTAGGTGAGTAATAGCTCTC T C T C TC T T( N ) xTCAGCAAAATGGAAACCTGTTTCAACATCTGCTTTTTTAATG CACT CTCATTTT CAGGGAGTGG( N ) xCTGGTTGATTATTCAGTAGACTAGACACAGCTGATGAAAAAGTAAGCACGGG GT CACATTAACAGTAGAGGAAG CAAACAAATATGAAATTAAAGCAAACAATTGCTAAGAACTACAAAT CGTTCCTTGG TTAT CTATAATG GG(KT>xATTCCTGAAGATAGCATGGATTTCTTGAGGACCAATTTAAATCTAAGGAAACCTAGCAAA AATTAAATAGAT CAAATTAGTGTT TGTAAC CAGAAAAAGC TGTG GCTGAGAAAACTGATAAATAATTT CAGGGG CAGC AATTTTTACTCT CATGTGTAGCTTTAGGAGAT GACATAAGTAAACTCGGAAACAGGCAAC TT CCTAAG GACTG G GGAA AATG GTTAGGAT CCTG AACT GT CT AAG AAT AT AT T C AGTG ATAGGTGAGATTTAGTCAG AAAAT AGAAA C AAGGGTAG GG C CT ATAATGAGGTGG G AACT AAAT CCAT ATAT GACC CAAT AATTAAATTAG C CTTC AG AG CTTGTT AT ACGGTATT AAC C CTTGA CTC CTTTTTAACTAGAAAATAAAT C C CTCATATAATGATATTGATGTTCAGTG CT CTAATTATAGAAAT AGTGTTTGGAAGTTGGAAGCAAGATTATGTTTGGATTACAGCCTTTAGGTAAAAGGCTAATTACCTTTACTTAATTAA TTAATTC CATCAGT CATG CAAT CAATAAAT CAGTTATCTACTTACTATATATTTATTGAATCTC CCTTACTTGT CTGG TG CTGGGCTTTC CTTTTGAT TATTAGGGTAGGGAAAGATTTCTACCTCTAAT CC TAAGGGGAAATGAAGC CTATAAAC ATAAATTTT CTATTTTAAA CTT TGTAACAATACAGCCATGTACT TTGCTGAAAG CTAATT TTGCATAACAGTGACACA AAAGTGGATTTTAGTACACATG CCATTTAT CAAC CTATAAACAATGCCTT TAACATAAAACAAAAGCC CT TTATTTGG ATGC AC AC ATTG CC AAAAAT CT TATAGACTATTAT AAG ATTT ATGATAGCTTTC ACAT CATT CTTTTTTC TGGCATCC ACTTGAGTGCATTAACTTTGAAAAGAAACTATTTGGCTTGAATTGTGAATATGACTTCAAACAATAAAAGCAGAGGAA TCTAT AGAGCAG CTGCTTTCTTGTATAT CAAT A C ACAAGTGTAT ATAGTTTTAT ATAT AG AAC CATATCCCTGTGCTT TTAGGTAG CTGCAAGTTCACTTTACATT CATC TGAAACACAñ GAAGGG CTGTGTG GCAAATTTAGGTGTTAGAAAATT GTTAAGAATTTGTTGACACAGTGACAAAAAAACACAATGAACTCACTATTATGTGTTTATTACACTGATAAATGTGAA TTCTATAT CTGATAAAAT GATT TTTCAGAACT CAAATT TAATTACAAAG CAAAAGCTGGGAAGCTTGGTATATGATTT TCAAGGAGACAATACAGATATT CTAAAAATAATTAAAACATTATTGTGGT TT CATCAGAAAGAAATATTT GTGAAACA GAGCTAGACTCT CTGAAATATAGT TCAG GTTT TGTGATAACATT T CAT CAGACATGT(N ) xTTGTCATTTAAACTCTT TGGTCCAAATGTGAGTATCGAACACTGGAAAATCATCTGACTTCATTTGACTATTCCATAGTGTTTTTCTGTCATTCA TTCAATTAATATATGAGAAGGAGAAAAGAAAAATAGGCTTTTTTGTTCACTACTTGCAATAAATAGTTCTGTTAGAAC CTGGTGTTGAATTC CAAAATAATGATGAGATT TACAAAAAAATTATCT GG CCTGATTATC TTTCAATATCACAGTGAC TGATTATGA CAC TAAAATAT CAGCA(N) xATGGGGTTAAACACCATGAAATATAGTAACTCCTAAATTAGAATATGAA GAT AATGG CACT TG AGTGGATT AG AAG AAAAG CAAGTTTTTC AT CTTGTACAAAC AG AGC CTTAAATTTT ATTG A CAG TGTGTTAAATGGAGAGCATTAACTAGAAAATTTGGATGATTACTTAAAAAATTGCATAGTCCCTGTTTTGACAGTAAA AATATGTAGTTAATATTTTTGTAAAATGAAGTTG CTTAAT TAAGATTAATTTAACAAAATATGTT CCATATTAATTTT TGTTAAGAGTCACCTTTGCACCTGATATAAAACAGCATTATCTTG(N)xAAAATTAACTAAAATAAACTATGCTTAAA CAATGTTCCAACCTTTAAGATGAAGTTAATCATACTTTGTTTTACTTGTTTATAAGCTTAAGAACATATTATTTTCGA AGATATAATTCACTGAGATATAAACATG CATTATATATATTT TTTTCTAGñT GAT AA CTTA CT CAATGfiCAC TAAATT AATGTTTG CTGAGAAATT TCCATCTTATTTTGAGTTTTTC TAT CAAACATATTTTACTGTATTACAT CTAGTAACT TA GTAAATAAGCTATGATTAAAGTAGAATTTTGATAATG CTATGAAAT CATATATGAATTTTGTA CAAAAATTAAATT CT GTGATTTTAAGACTAAATTCTATGAAGAAGGGAAGT CTAGAAA CTGACTTTT TATAA C CCAAA C CTTAATTAT CTGAT TCAATTAAATCATATTGAAATT GTGTGT TTTAAATAACAATG GATTATTAGCGT GAGAAAAACTATTTTG G G GAAAAA GATGTTCTTAATTACCATTAAACAGGAATTAATCTGTACTACTATAAATGTATAAATATGGTGCTACGGCCTTCTGAA CT TGATGT TGTTATACTATAGGGAGAAAGCTATT TT TACATGTACTGGTGGAAT TACAAAAA C CTGAAGT CAATAA TA AAGTCCAT CTGTATGT CACATC CACTGCACGGCTGG CAGOAT GGTTTTCTCCATTGTG CGAATT GCAAAAAT CAAATT TATTGAGGAGATATAGAAAG CT TAA CATATTTTATGAACAATATAAAGTGTGAC T CACTC CTGTAAGT CTAAAATCAA TAG CTAAATGTCTAAGAATAAAAAAA GG CC CTTAATTTCAATAAATACAACACATTTTTCTACTACG CTGTTTCTACT AAAATATC CCTT CTTGAGGGATTAAATCTCATCC CTGCTGACTCCCAT TGAGTAATAATTAATGTGGTT CATTTTTGA ATAGAAAGATTGTGATGTTTTTAAAATTAAAAACACTGATCAA( N ) xTGCCTCACTATGCCTGTTTTCCTCTCCATAT TAATTGTG CATGAAAGAAGATACAGTTAATTAAACT GATAGAGATACATTATATATTT CCAAAATGATAATAAGTAT C AATCTCAT CAAG CTGAAATC TTTAAGAATTTTAGTAATAT TAG CAACATTTTAATTTATAAATT TACATATATG GAAA AAGATATAA CAATATTAGGTATGAATTACTGTTGTGATGTGAAAAAAACAAAAAAGATATATGTTT TTAAATTAAATA
T ( N ) x CAC TGGC CTTCCCAAGTCCAATGTCTAATGGAGAAAACTTATAAAACTATAGATGAAATTATTTTCCTGTATC AGTTTTATGTTAATACAT CTTGTATC CTCTATGCCTTCTT TCTATTAACCTCTC CTCTTGCTTTTTCTCTCTGTGATA TT TTGATTTAG GATAACAAACCTAGT CT GAA CAATTTTCTTAGCCC TAG GAAAT TGGTTC TCAAAATACAGCTTTATT AAATGGTAATTAAAGGCTTTAAAATAAAATTTCAAATACACTGGCTTGATAATAGATTGCATAATGTTCATTTGTTAT AT C CAC CAGAGACCTT CTTGAAAT TTATAATTTGTAGGT CATTTTATTATATTTTTTCTGGTGTGTAGTG C CATGATA ATTGATTATATATGT CAGTAAT C CATGACGGAAATT GCTTTCTTACCTATTT CAATAGTG GAGCTACAGTAAAGA CAA AAATAACTATCTGTTATAGAAT CAAAAT GGGATTATAAACATAATGATGAAAATATCTGT TTTAACTATTAG CAG CTT TTTC T CAGTGCATGATTG CTAAGAAATC TTATTAAATATCATGAC CACAACTGC GTTACTTG GAAAAC CAATAAATTG ACGCATGT TGGAGGGGGT CAGTGACTTTTATACAAAA CTT CAAAGG CTGCCATGTTGCTT CAGAATGAAG CACT GAGG GATGGTAGTAGT TTTATT TTAATGTTTC TTAATT TGAATAAC CATAAT CTTGAGTTATGTTTGT GCAATATAGATGAG GTGGAGTTTGCAATTATAGAAGGAAAAGAGAAAAGAGAGAACAGAGGGAAAAGGGAAAATAAATGAAATTTTAAACAT GAT CATATTTAAAA CAAACATAATTG CTATTAGAGCAATGAGAATC TT GGGAAAAAGG CT CATAAAATTACTAACTAT CTTGAGTCACTCTTGCCTG TGCTGATTGAAAGTTAGGTAGAAGAGGAACATTTTATCAG GAAGAAGACATTT G TACAC AAAAGGGC TGATGATT TAGTGTGAAAGAGG CACAGAGAAAGTAACCTATAATGGG CATTTATTT GGGTAAG CAGGTGT GAAAAATACTCTAGGGAG CT GAGTGC CAGT GTTTAGAAACAG CTCA GG CTCT GAG CACTTGTAATTAT GCAGGCTTTC CCTTTGG GCTTACTTG TTTATCTGTCTGTTGGTGGTTGG C TGAACTTTTTAAATTAAAATTTTGATGAAAAGAAAG TT CCTGCCTTAAAACCATACTTTG TAAG CCTAAAAAGGACTCTAGAGAGCAACTTTAAAGTT TTAT GAGGATATTTTTAA ATATCAGTTTTATTTTTATTGTACATTTTACTTTTCTATTTGCACTGCTCTGCTTAGGAGGAAATTAATTTAGAAAAT AAAAGAAAGTGGCCCCCAAAGTCTTCCAAATTGTTTTTACTGTACTTTGTAAAAATTAAATCTGGTTAGCTTAACTGT GG CGAAGATTTAGT TTAAGG GCTG GAAG GATGCATTATTT TTTTCT CTTTTTAAAAAC CTTAAT TATATTTATTAAGA AAAATTGTTTCAGTñTAAAG GTTTTGTGTG CTñAAGTCTGACTAAAACAAAAAAAAATATATTTTT CT TTGACTTT CT ACAATT CAAATATGTAAAAACAATAAAAATAAGTGGAAAATAAAAATATACTTA CATAAAAT CATGAAAAGATAAAAA TGGGCATATGAT TG CACAAAAG C CTTGC TAATCTGACATTTT CATT CT CAAT CTTGTCTGACAT CACG CT GG CTGCAA ATA CC CAAATAT TT CCAAATAT CTACTTA CTG TTTTTTATATC( N ) xAATATACATTTTAAAATGGATTACTGAATAG ATAGTTGTAATTTGGCCATCAGAATGACAGAGACAACTATATATTTTATTTTACCATCTAGAAGCAAATGAAAGAAGA AATATT CCATAAATA CTTTATG CT TTTGAACTA CATGGAC TCAATAAATTGACAGAAATATGTTATTAAGAT TACAGG ATAATATAAAAATGGAGGTTACTGACATAAAAAGTAAAATGAGTTCAACAAAACCATATTTCACTAATAATTAAATAA ATAGGATAATGTGAGAAGACAAGAAAGACTGC( N ) xAGTAACGGGGAACAGTGATGACCATGACAACACACGACTCAG ATTTGTTGCCACACAGCACAATTTAGAGACAGCACCAGATTCTCTGCTCATCACCATGTTAACAACAGATTCATTCGT GAGOAT CAGATT CATG C T ( N)xGTGAACACATTGCATTGGTTTAGTCAAAACCCCATTAGAACTGTGCTGCTGCAGTT CTAGAC CTT CAAAC C CAACAT
> H S l 9 ^4176228- 421 13 14
GGCGACAGGCCGAGTCTGGATTCGGGATTAGAGAAGGCGATGTCCACTTTACTTTTCCTGACTTTAATCGTTATACTG GGCCGGGCGTGGTGGCTCAC{N ) xGGATGTACCCCCCACAGGAGGGAGGCACGGCCCCCCCAAATCCCTCCAGGAGGA AGACATGCCCCC CAAA TC CCTC CAGGAGGGAGACATACTC CTTATATC CCCCCAAGAGGGAGACACGCCCCCAGATCC CCCCAGGAGGGC CACACC CCAGTCCCCC CATACC CTCTGGGTCTCT GGGACATC GGATTCGACC CC CAAC CCCCTCTG GCAGAGCCCCCACCCCTGCACCCAGGTGGCACTCACGTCTTACACTTGGCACATTCTTCCACAAACATGTTCCCGTGG AG CTCTGC CAGTTTGT CCCTGTGG GAAGAG CAGGAAGAGG CTTGATGGTGGGAAAGGATC CC TGGATG CCCAGGCTTT GGAGCTGGTGCTAAG CCCCTCTCCTC CAGGAAGG CTTCCCGGACTCCTCTCC CACAA CATTGAT CAAC TCCTTCTTTC TGGCTC(N ) xGAATCATTAACCCCCAAGGCTGAACCCACCTTAGfiTAACGGGAAGCCCTTTCCACTCACAAAATGCCC GT C CCC CT C CGAGAG CATATGG GAAC CT CTTGATGT CAC CAG GTCAGAAGCACAGGAATT CAAATT C C CAGTTTATAA TGGAAGAAACCGAGA CAT TGAATG GC CCTGAAGCAT GAGATC CAAT CC CAG C ( K ) xCTGCAACTTCTAGACCCTGACA GCTCTCCTCCCCTTAGCCCCATCTCGAGGGAAATTCTGCCTCTGGCCTCAGCCGGGACAGCTCTGAGCCATGGGACTC AGAGATCTCCACACACTGGGC CAAGAAC TCAGG G CTAGAATCTTATAGAAC CAG GAAAAGGG GACTCTCCCC CTAAAA ATGCAAACACGGTTTG TGGGGCCC CAAG CTCTTTTGTGGGGGCGGGGC CAGG GT GTTACCTGGGGAAG CCTGAGCGCA CATG GAGCCCGT CCACGTTCTGGCTGACCAG GAAGCGGAGGAGG CC CACG CGCT C CAG CT GCAC CAGCGC CATGTGGG TCTGCGTGGGCCGCGCGCTC TCAAAGGT GGTGT CGAACTTGGGGGC CAGACCTC GCTC CT CCATGGT CCAGACT CC GT GGGGACCCCTGAAG GTGGCAGGCC GGGAGAGATGGACGGG( N ) xAGAGAGACAGGGAGACAGAGAGAGGGAATCAGAA ACAGAAATAGAGGGTGAGTTG CAAGAACAG G GAGAGGGG GATAGAACAAGA CACAGA CAGAGAGATAAAGAGTTAAAG ACAAAGCAAGTCAGAGACAGAATTG(N)xTGAATTCACACGAAGGCAAAAATG(N)xAAGGAGGAGAAATTCACGCCT T G G C ( N ) xAGTGGAATGCTGGTTGTGACAAGGGACACA( N ) xATGC CCAGCCAAGGGACACATCTTAAAGGGAGTACC CAGGACACATCCAGGA GGAAAGCAGATGGATGTGTTCTCATT CC CAGñTGTGTGTGGCTCGCAGGGAAGC CT CT GG CA GCCTGGTAGGCCTTGCCTCGACTTCCCCTG CA CAATCACAGACCTGAA GT CGGGGATG CCAGAGGCAGTG CTGATG CC GGCACCCGTGTG GAACAC CACACT GGAAGACT GC CAGACCAG CC TCGC CAGTTC CCACACCTTCCGCTCCAGCTCCTC CGGGGGGTCGAAGATC TGTGGGGGGAGAGAGCAGACGGAGGGGT CAAAACAGTT CC CC CAAGTG GCAAGG C CAC GCAC GAGC CACAGACTGA GCTGTCCCTGTCTTGC CTGAGGGAG GAAAATGGATTGGAAAAGG CAGCTTGTCC( N ) xCCATGC AACC CCTAGT CAGTCTCCCTTACTA(N ) xGTGACTAACAGCAGCAGTGAAGAGTGGGGCTTCTGGACGGGCGCACACC ATC(N)xCCGATCCCTCCACTTCCCCTCTG(N)xGTTCTCCAGGTGTGTTTATTCCCACCTCCCTTTGCGCCTGTCGT CC CCTCTTCCTGGAACTACATCCC CT CT CT CC CCGATCAG CC CCGACAGG GTGACCACAGAGTTGGGCGT GTTGGCGT CCCCCGGCGCGCAGGGGCGTCACTTCCCGCCCCTTCCTCACTGGGAAGTCCCTCCCATTGTCTAGCCTCAGTGCCCCC TGATATTCCCACAATGCCCCCCTGCCATCCGGCCGCTCCCAGCCCGCGGCCCTGGGGCGCGATGCTCGGGACCCTCAG ACGCGCTCACCT CCGGGAGG CCGCACTTGCCCTTGTCCGCGTACGGCGACAGCCCCGCCG CGTAATTCAC CGACAT CC TCGACTGCCC CACGGGAACAATAAAGTT TCCCTTGTTGAGGCCGCTTCCGC CGGAAGCGGGG CGGGGCGGGGAAGAAG GAGGAACCGCGTTCCAGTCCTGCGCACCCGGCCTCCCACGGCAAGGCGCATGCGCCTCTTGCTGACGCCGCAGGCGAC ATGTTAT CTG CTGT CAGAAG GAAG CCTGCCTCTTTGCATGCAGGTGTTTGCGGGGC TTGGGAAGGGGCTC CC CGAT GA CCCGGGCGGGAAATTGGGGCGGCCGCCTACGTGAGAGTTCTCCAGTCACCTCTAAAATGCGGGACACAGGCTATCCAT GT C C C CACAACC C CTT TTAC GGATTAGTAAAT T C CGTAAGGACAGAGC CGGACAGCAGGGAC C C CAGCCTAAGG GT TT TGAATGC CACGCAGAG TTACCTGAATATCT C A T T ( N ) xAAGTTATCTGAGTATCATCCAACTGGAGGTTCTGCATTCT GCAGGACAGGTC CTGCTCAAG GTCAATGGG GCTGGTG GC CTGGAGG( N)xTCTTCCCGCCAGAGCCAGTTGGCCAACA GGCAGTCCGCTGCCAGCCTGTCTCCTCCCTCGGCTCATCCCCGGCCCCCTGGGGCGGCCGCCTGCTCAGAGCCGATCA GGTGTCAGAGTG GAGCGG CTGGAGCTGCTCGTGG CCCAGC TG GGAGTGGGTAAC CG GAGC CGGCAGTGGGCACTGTGG CCGGGAGCT CG GGGCACTGGAGCTGCAG GAGGTAGGGGAG GAAG GAGGTG GGAGAG GC CCAGGACGGAGGAT CC CC TT ATGGCCCCAGGCCCCTGAGCCCTTGTATGTGG GAGAAAGAAG CCGTGCTGAGTCCTAGCCAGGC TGCGGAGCTACC CA GGTACCCTGA CACACACATG CACT CGTT CACO CACGCACTTC CCACTTGCCGGCCTGTGCCCTG CCTAGCTT CCTC CA AGGCAGGTGCTCAACAT CCCACCCGGCCAT CAGG CCCTTCTTATTTACCATCACCTCTTC CAGCAAGCAG CTTC CTAA ATTTAGCAGGCGTTTCTCCCTCCCCCAAGCCCACGGATTGTCTTCTCCCAAGCAGCCTGTGAACTCCTCTGTTGGGTG TCTTATTACCCCCTGCTTTGGGTGTGATTGAATCCTCCAGTCAGAGGTAGAAGAAAAAA(N ) xATCTGATTAAAGGGT GATGAAGAACCATGAGAACATGAATTGGGTGGGTGGGTTTGTGGGTGGTGGCCAGTGGGTGGTTATACAGTGCTTAGG TG GGAGGATGGATGTC CAGGTGAGTG CATAGGTG GTAAATGCGTGGAAGG GTGG CTTGGAGAATTCACGAGTAGATAG GTGGACAGGTAGAGGC CATG CCAACC CCTACT CCTTGGTAAGAG GCAAATACAGATTCTC CATCACAGAAAT CC CT TT CACACATAGGGC CT CACAGT CACACC CCTCAAAAAACACACC CC CT CAAGATAGG C CTGC CTGACCT CCGTG CCTG CC CATG CGGAACGTAGAACAGG CCCAGGTG GACAGAAGTGCACGTGTG CACACGCC CCACAC TCAACGT CACACACATGC ATGCAGCCñTTGCCCCAGACCTCCñCCCAC TATG GGTGAATTGC TGTCACATGGAGGTTCCC TCTGACTATG TAGCAA ACATGCCTCTGTGCTGAGAGCCGTC(N ) xAGGTCTGTGTCTTGCTCTAGGGGCCAGGACTGGTCAGGAAGGAGGAAGC GGTCCTTTTGAGGAGGGCGAGGCAACGGACCCTGCCGCCCACCAGGACTGGTGTCCCTCACCTTACCCCCACCCTTGC CCTC CTCAGGTGGC CTGT G GAGAGGAGAAACACAG GG CAC CAAC TATGAAGACT CT CAGGGCGCGATTTAAGAAGACA GAGGTGAGTGTGAGGCCCTAGATGCCCGATACACCCCTGCAACTTCAGCCTTCTGTCTCCTGGGGCTTGGCTGAAGAT CCACCCTTTCATTCCAGTACCCACCTCTTCTCCTTTCCTCTCTGCTGCCCCCTAGCGTCATGACCTCCCCACTGCTCC CCGTACAAGCTTTCCAGCCCAGGCCTTTACCCAGGCCCTAGTCCGCTCTCCATGGAATTCCTTCTCGCCGCATTGCCA CCAATCAGAC CCTCCTAATT C CTCACAATT TGGC CCCAGTAC CACCTCCCCGTC CAAT CTCCGCTCTCCCCGGCCACA GTAACCCCTTGCTTTT CCAAGGAC CCAC TTTC CCGAGGAT CC TGAT CTTTTGCTTTGG TG T G ( N ) xATCCAGCTGTCT TCTCTTGCAT CCGTGT CTTTGGCTAT GAACTC CTAGAAAGACATGCTT CT TCCT CCATTGTC CC CATTCT T C TCAG CC CC CT CAAGTC CCTGTGGCGG CCAATT CAGGGCACAGATGGGGAG GATGAAAGGGAGGG CT CA CC TCCAGGGACAGG CA ATCAGCCTTCCTGAGCACCTCTTTGTGCCTGGCCTGGGTGGGCAAGCTTGACTTGGAGCCTTTAGTTTTCGTATCATG GGATACAACATC C C <N ) x TCTTTGAAAGCCATTTTTGAAAAGCCAAAAAGAGAGGCCTGTGGAACAAGCCTGGATTTC A T T (N )x CATTTTATTTTG TTTTTAAAC TATTTATTAATTATC ATG TAC CCATTCATTTCATT(N )xT TA TT TA T TTT CA T TTT TG TT G (N ) x ATGAACATGATTGGCCTGGAGGTGGGAGGGGAGTGCATTGATTGAAAAGAGAATGGGTGGAGA CAGACAGCGGGAGAGAAAGAGGTGAAAAACGGTCCATTAACTGCCTACAAAATGTGGCATTGGGTGAGCTTCATTCAC
t g t g g a g t g a t c c t a a a a t t t t t c t t c c c a c g t g g a c t c a a t t c c t c g a t g a t a g t t g a c c c t t g a a c a a t t a c a g c c CACCCTCCTG CACAGT CAAAAATC CATGTATA( N ) xCCTGAATGCTTTTTTTCCCCAAGCATTCCTCAGAAGTAGTAG CCACTGGTAAACCAAGCCATATGATTGCTTAATTTCCAAAGCATAGTTATATTTAATATTTAAGTAAATCTTTCAAGA AGTGTGGGTGGCTAAAAAAGAGATAC CT CATT CGGGATGTGGGTGTGTTT TGATATGTTTGCAATGGGAGGG GT CGGG ACAAAGTCAAAATTGGATGGGGACAGGATAGGTGTTAAAACCTCAGACGGGTGTGAGGGCCTCTGAGAGTGTCTGGAG CTGGAGGGGCCTGAGGTTAG CCCATAGC CTG GGC TGGAACTGTTGCAG CAGATG GGAGGCCTGGCTGTTG CTGG GAAG CTGCAGGGCCTC CATC CTCCCCTCCCAGAGCAC CTTGCCATC CATCCCTCCACCTGCCCTTCTCCAGTTTCGCAGCAG GCAG C CG GAGTCAAGTT CAGCTTTTGTTCAGCAG C CC CGAGG GAAGAAAG CAAACACC CATGACTGCAGGG G CCAGGC
c c c c c c c t c c t g t c a c t g g g a c c t t g t g t t c c g t g t c t t c t a g c c a c t g c t a c c c c t c t c c t c c t c t c c t t a g g g c a c TCTCCGGCCTTTCCTGGTCC CTAACACCGG CT TTTTCTTATTTATGTATTTACTTTTATTTTAT T T T C T T T C T T ( N ) x AACACCGTTTCTTT CTTTGGGGTCC CAACAGTTGGCATTG CTTACTCTGATC CC CGTTGGGC T G ( N) xTTACCTGGAG CTGTTCACTTGGGG CT T CA CTGACCT CCTG CCTTTACTTTAGGTTTCAGGAAAATCTCC CAAGTGGGAAGATTGGGGT TTCTAGAGAGTCACAGACTCCATACATCAATGCTTCCTGGAAAGTCCTCTTTCTCTTTCTGTCTTTTTT(N)xAAACT CCATGAG GTCAGGG GT CAGG CACTGG CAGAGTG GGGCTGC TCAGAG( N ) xGGTACACACCTCAGTGGGCTCCGAGGGG CATTTGCTCTGATTGATTGATTGA(N)xGTGAGCCACCGCGCCCAGGTT(N)xGTGAGCCACCATGCCTGGCCATTTG CT CTGATCTTGTATGG CATC CAGGAG GACAAGTGTAGGAGACAGTTGGAAACTGAAGCCCTGAG CTGGGAGATC CGTT TCGGTCTCTTCTCTGGAGAAGCTCAGAGGACCTCCCCACCCCCAAGGGTGGAAGGAGAGAAGAGTTCCAAGGACACCC TCCTTCCCAAATGCCTGTCACTTTCCATTTTCCTGCCTTGGTTCCCACTCCTCCCTCCTCTTGTCCAAACATGTCTAA GAG(N)xGTCTAAGAATAATCACGGACCACACCGACGTGTAAATGCTCAATGCTCACCCAGCCCTAAGCCAGTGGTCA ATGTGTCAATGGA(N)xATTCTGGCTCTGGAAGACCTCGGTT(N)xTCCAGGGCTCCTTCTAAAATGCCAATTACCTG CAAGG(N)xTGGCTAGAGAATGTCAGAGCTGCGCCGGGACCAGCTAGCTCTTTCTGTGAGGAACGAGTCACAGT(N)x TGTGTAGGATTGCñGCTC CC CAGTGG C (N ) xAGTGTGAGTGGCGAGAGCCCTGCCGGCCGCGTGGAGGTGCGAGGCTG CGGñCG TCGCGGGC CCGGAGGCACCTGCGCGC CCTTGG CCGACTCGGAGGAGGTGGAGATGGACGC CCGCGGGT CC CC TGGAGATGCAGCCGGCGGCCTGCGCTGGTGAGGGAGCCGGGCCCCCGGCGCCGCGTCCTCCTCATCCTCCAGGCGACA AGGTCAGGAGGGGCCGGGGCGGCGCCCCTTCCCTCAGCCCCCAGCCCCCAGCCCCCTACCTGGGTCTTCCCATTCCAT CCCGT(N)xCTCCGGCAGCGCCCAGCCCCGCCCTCCGGCCGCTCCCCGCGGTCCCTCCAGACCCTCTGGCCGCCGCCT CCTCTTGGAACCCCGTGC( N) xCGCCATGAAGCAGCTGTGTCTGTGCGCAGCCGCCTCCTTCGCGGTAGGGCCCGGGG AGGGGGCGCAGGAGCGGGCGGGGCGCGGGTACCTCCTCTCCCCTCCCTTCCCCGTCCCCGGCTGACCTGGACCACCCC CC CATT C CAGACCCG GGAAAGATGGT CGGCG G CGGGGG GTG G GGGGGAACAGAG GTTGGGGCAGCTTTTGGGG GAATG GAAGAGACATTTGG GGAGA CATGTGGA CACGT TTTGGAAGACTCTTTGAAA CAAATGGGGACAT TTAGGAACAT TTAG AGGGCATTGGGGGGTGCAGTTTTGGGAAAACATTTTGGAGACAGATGGGGTCATTGAGGGGACAACTTGGAAGCTATT TTGGGAGAGCCGATTTTGCGAAGAGGTAATGATTTGGAGAACAGTTTGGGAGAACATTTGAGGGATGTTTTTGGAGGA TATTTTTGGGACATGACGGTATCATTGGGAAGGATGGTTTTACAGAACACTTCAGGAATTTGGGGGAGACATTTCGTA ATGCATTTCAGGCTATAGTGTTGAGGGGAGAAGCCTCGGGATGTTTAGGGGACAGTGGTGGCATTTCAGGGCAGTTGT GGATAAACAGGGCAGAGGATGGTCCTGAGAGGCATTTAAAGGGGACTTTTAGGGGTTCAGATGGGACTTTGGAGGGTA GT TTGGAAAAGACT CATG GATCCATTTG CTGG GGGCGATGTT GCAGGG GTGG CTTTCACTAAGG GATGAATTGG GG GA CAAC( N) xCAGGGGACAACGTTTTGGAAATAGATGCCAGGGGGCTGCTCAGGGGATGATGCTGGTGGTGGGCTTTTGT TGGACAACTGGGGTGATGGG CCTGGGG G CAGATGCTGGGGGT CGCAGG CT TGGGGGACAC TGTCTGAGGACCCCCTCG CCCAGGACCACCTCCCCCTGCAGCTGCGGCTCAGCCCCACTGACCTTGGCTCCTGCCCGCCCTGCGGCCCCTGCCCCA TCCCGAAGCCGGCAGCCAGAGGCAGGCGCCAGGCAAGTGCCCAGGGGCAGGTGGTGAGAGCCCGGAGCCCCTGTCGGG GG CGTGG GGAGGGGACAG CAGCCAACACTG CC CCACGCAC TT CTGGGCGTGC CCTGCAGAGT CAAGACTGGGGCAAGA GTGACGAGAGGCTGCTACAAGCCGTGGAAAACAACGATGCACCTCGGGTGGCCGCCCTCATCGCCCGCAAGGGGCTGG TG CC C ACG AAGCTAGA CC CCG AGGGC AAGT CCGCGTG AGTGC CCGCGA CC CGGG AGTGAG ATGG CTGAGGGG TGGC AA CCTTGCGGCTGAACCCTTGTCTCTCACCTCCAGGTTCCACCTGGCGGCCATGCGGGGTGCGGCCAGCTGTCTGGAGGT GATGATAG CTCATGGCAG CAATGTCATGAG CG CGGACGGGGCAGGTAC TG CCAG CTGGGC CC CGGGGAGGGAGGAG GA A CTAAG CC CAGGTG CC CAGC CTGAGGGT CCAG CCAGAC CCTG CTCCCAGGT C TCAATGTT CC CCGCTGAAAAAAGG GG AGGCTC CCTCTGGTGCT C CT CCCCTG CC CC CAGGGAGACACACAGGGG CATGTTAGGATGGGGTGG CCAAAGATAGAG ACTGTGGC TGGCAC CGAG CAAAGCCAGTGGACAGCTTGTGGATGGGGAGTGACTAGGATT CAGGTATTGGTT CAGTTG GATGGG GAAGTAGC TGGCATGGGGAAGTAG CTGGGGCTTGGG GGATAG CTGCAAAGTTTC TGG GG (N) xTGGG GAAGT TTCTAGAAAGTGACTGAGGCAGAGGAAGAACTGGGGCTTGGGAGATAACTGGGACCATTGGACGAACTAAGCTTTGAG GTGCATCTGGGATATAGGAGTATCCGAGATAAAGGAAGTGACTGAAGGGAACTGCTTGGAGTTTGGAGGAGGAACTGG ATT CATG GGAGTAG CTGAGCTTGCAAAGTATCTGGGG CAAGAGAATAACAAGTAGAGTTTGG G GATATCTGAGGTATA TGGGATAGCTGGGATCTAGGAGTAGCTGGAGCTGTGGGAGTAGCTTGAGCTCTTGGAGTAGCTGGGTTACGAGGAGAA CTTGTATGTATGTATGA CTTAACTGGAGA C CTGGGGGGAATAACTGAGATAAAAATGTGTGG CCAAAAGGAGTAGTTG GGGC TCAGGAGGCAACTAGG CATGGGGACACAGTCAAATCTGAAGCTTTAGG GT CACAGG CATGGTGGGCAGGGAATA TGGAATAGCTGGGGATTCATGGAGCATCAGACAGGGACAGAGTTGGCATTTGGAAGAGGAAGGGTATATGGATTGGCA AGAACATATACAGCAACTGTGGGTTG TGAAGGGCAGAG CT GAAACTTT GGTGTAGTGGAC CCTGTAGAGTAT CT GG GG ACATGGGGAGTAGCTGAGGCAATAGGGC TTGAGGAATAGGTGG GTGTG GACAGT CGCTGAGATG GGATTGT CTGGAAT A CAGACTAS CCATGGCTTAGAGAGTCTCTGGGTACAGG GAATAGCATG GT TTTGAGAAGTA C CT G A T (N) xTACTAGG GTTTACGGACTTTG CTAGACTCAGAAACTCAT CACAAGTAGTTGACAT GGCTTGGGGAGCAG CTGGGAGCTACTTGAA TAGCTGGAGCTCCAGTAAATCTTCTTTCCTTCCCCAGGTTACAATGCCCTCCACCTGGCCGCCAAATACGGGCACCCA CAGTGCTTGAAGCAACTACTGCAGGTCATTTACTGTCTTATCTCAGCTACTCCCTTGGCCCCTACTACTCCTGAGTTC CAGCAATT CCTTGAAC CC CCAGATGCTCTATGAATCCTAGATATTCCT TGGC CC CTTTTACT CCAGAACCCTAG CTAC TC CCTG GATAACCAGGTACT CTCAGA CCAT CCACATGCTT CT CCTATTTAAGAT GTACTC C C A (Jf) xTATATATATAT ACTCCCAATGCCAGGTATTTCCTGAGCTCTGTTTCTCCCCAAGCCCTGGGTCCTTCCTTAGCCCCAGACACTCCCAGA GTTCCCAGCCACTCCTCAAATTCTCAGCTGCTTCTCAGCTCTCAGCTATTTTCTTTTCTCACCAATTTTCATGAAAAT GGAGTC CCTTGGGGGC CACG GAAGGGTGAGGCTGAAGTATAG CAGAGC CCAG G GTAGGGAAGGCAGTCCTAGAGAAAT CAGGCAGCAAGAAGGATGGCTCCGCCTCAAAGGACTCGCCCCATCCCCAGGCTTCCTGCGTGGTGGACGTCGTGGACA GCAGCGGGTGGACTGCCC TACACCATGCAGGTG GGTGCAG CC CAGCCCTG C C CT GACCCCGAAG CC CAGAGGGTGC TA GACTTGGG GGTTATTCTGCC CAGGGC CCTAGGGACCTGAC CTGCTGTGAAGCTT CCTGTCTC CT C C ( N) xAGCAGTCG AATGTACT CCTGGG CTTTGTGAGGATGAGGAGAGGTC C CTAG GGTTTAATTT C CTTTGTGGAGC CC CTAGGAGATACC AGGAAAACCATAGCTCCAGGCAGCTGGTACCCTCATAATCACCTAATCCAAACTCCTTCCCAGAGGGAGGGAAAGGGA CCCACCGACTTGAACCAGAACTCAGCCCCCGACCTGTCCGCTGTCCCCGTGAAATTCAGGCATGGCTTTCGCTGACGA CT CAAGAAA CATCTGTTCTTTC GGAAAC TCAGAATAAAATAG GG CTTT G GGGGTTTTAGT CACT G GGTGAGAATTC TC TG CAGTTG GAGTATGCTG CCAC CTAGTGGCGG CAAT GCTT CTGGTCA CAATATAGGAGT C CGTTGGTTTTTG TCA CTC AACAGTTAACTG{N) xCTCATACTGCCTCAGACATACAGCAGGTATTCAAACAGAGTTTGTTGACTGAAAAATGGATT TTTGTGCT TAAT CAGACTG GATGT CAAC CTGCTGGAATT CATACCCAGGGATTC CAGAGACCATGC CCTTGTGT CAG( N)xATTAGCCAGGCACGGTTGCACATGCCTTGTAGTGTGGGT(N) xAGGAGCAGGGTCACACCCAGGCAGAAAGTGCC CAGAAAATGT CGAAGT CT CACO CTACCCCTACTCTTCCCCCT(N) xGGACAACCTTTTCTCTTTTGCAGOGGCTGGTG GCTGTCTCTCCTGCTCAGAGGTGCTCTGCTC CTTTAAGGCACAT CTAAAC CC C CAAGATCGGGTAAGC TT CTGGGATC TCTT CAGGGAAGATGTTATG CTTGAGGTATGCTGCCAC CAAGTGGCAGTATC TCAGGCAC CCTTTGAAAGTCTGAGAA AGTTTGAGGT GAAACT CAAG GG CAGTTATATAGG CTTCTGCACCTCCCGGGGGT CAGTTT CT CATG CCAT CG CATC CC TCCTCCTCC CTACCAGTCAGGCGCAACACC CC TCATTATAGCAG CT CAGATGTGT CACACAGACCTGTGCCGTCTCCT ACTGCAGCAAGGGGCTGCCGCGAACGATCAGGACCTGCAAGGCAGGTGAGCATCTCCCC(N)xAAGTAAGACGATCTA GC CAGC CTCCTCTTCC CAGC CTGGCCTGGGTG CTGAGG TGGGCATGGGGGCTTGGGGGATGTTCTCATCT CCTCAGTA GCCCCCTCCC CTGGTAGGACGG CC CTGATG CTGGCCTGTGAGGGGGCCAGCC CCGAAACAGTGGAGGT CCTGCTGCAG GGCGGAGCCCAGCCGGGCAT CACCGATG CGCTGGGGCAGGACGCGGCTCACTATGGCGCCCTGGCG GGGGACAAACT C AT CC TG CACCTT CTG CAAGAGG CGGC CCAGCGCCCCTCCC CACC CAGCGGTATG CAAG CCCCACCTCC CCAATG CATT TGCTTCTTGG CAGC TTCTTGTCACTCCCCTTCTCTT TATCGTGAATAGTT T CAAGGTACC CCCGATTGGCTG CATT CT AG GAGGTC CTAGAGCT TAC C CAAT TC TACT CAGAA CAGTTT CAAGGAG CCCCAGGG CATT TAG AGTAGTCTGGGAG GG GGTCTGCTTCTGCTTTCCTGGGTCATCTGTACAGTAAGAACTTTGCCTGTCCTAAGAAGGGCACCCCCT(N)xTATCC T C TCTTTTTAACAGTC CCAAGT CCCTCTCT CAGCACTCTACC C
TABLA B
>Hs1 23712399-23728155
T T G GG C AAGG CTAAAAAG C C CAT C TGTGGC CTGTGTGGGC CT CAGTTTCC CTGGGTGTGTAATGAGG GAG CTGG G TTTCACATTG CCAAGT T C C CTT C CAACTGT CTATGACT CTTCAG CCGCCATGAG TCTG ATTCTT C T T ñT T GT CAC GT CAG G GCAGAC CAGAA CACACTGAC CAAG CCTCGTAGACACATAG CAGGGATGATTCTAAACT CAGAG GAACAT TGTC TAGAAG CCACAGGTGCAC CACATTAGTGTACCAAGG CTTGCCCTTCTCTCTG CAGC CTGGAGAGTT TT TAA G G { N ) xAAAGCCTAGAGAGTTTTAACGCTTTCTCCCCTCTGGCTCTTCCCTAAGCTTCTCTCTGATTTTCCACCT CTATGCCCCTGCGGCCTC CTGCAC CTCCGGCC CGTC CC CT CACTGT CCTTACCTG GTTATAGGT GCAGTTCTTCT TCTTGC A T T T GC TGC ACTGGAAGAGGTCAGTG GTGGTG CCGC C AGT CTTGGC CATCTGGTGCTC AC GGATGGCCT CCTGGGTCATGG CATT CCTCAACT CC CTCAGTTCAT CACTGG CCATTTCCTG GAGAAAAAAGAG TCTACC CTTCA GGGCTGAGGAGT TTCAGGGG GC CTGC CGTACA CCCGGTTT CTAGAAAGCCTGAACAGAAAAGGAGGTAACATGCT TTATTGACTG CAAGGAGCTGGAAGGAGGCCTGGATT CCGGGT CCTGCCCCAG CTGGGAGAAAGCTGTCCC CG CAG AGTC CT CCTG CCG CCCACGG CT GGAAGGT CATAATG CACATT CAGGTGAG CAGCAGG CAAACAG CCTCTGCCCGA GG ACG AGC ACTC CTGTGT AC A T T T CTTCAGAT CTGGGG CT CATC CTGCCTGC C AGG AC CTTGGC CTTGGC CTG AA CCTT CCTGCTTGAGCTGACT TGAT CACTGC CACCTG GT CTATGATGTTCC CCTTAGCCTTCCTCT CTAGACT TGA CC CAGTGCTCTTTTACGCTGTGAT CACTGCCCTGCTGTTCTG CTTAGTGG GAAC CT TGGCGCAACTA CCCATGAC TTGCTGCTGCTACCCCCATCCTGAAGCTCCTGGGCTCACACAACTAACTGTTCATCTGATGAGCATGCATCCAAG AT CATCTTTTTC TGAGAGTTTCTACAGAAGAGT CTC CAGT CT CC CACCAG CT CACAGTAGACACTC CGGGTCTGC AT CACC CACACCTTTG CTTTCTCTGTCCATCTCCCTGGC CACTT GT CTC C CCTTAAGC CT CTGC CAGAGGACAG C AGAGAGG GAG CT CCAGTTAAGGGT TG GTGGAAG GG GAAG GAG GGTGGCTGACG CGAAT CT CAAAATGAGGAT TT T A A A C (N ) xTAAACATTTTAATTGTTGCTGTCTGTCCTGAGTCCTTAAACCATGGAGAAGAATGCTGGCATAAAGG TCAAAG CATTTTAGTG TG CT CAC CTTTCATTATAAC CACTTAGG CTT CTTGATAAAATG CAGAT TC C T A ( N ) xA G AATATG CATTTT CACTAGTGTC CT CAAGTTATTCTGGTACGTATTAATGTTTAAGT CCTAGTGTGTTAAATT CAA AGGC TAGGCC CTAACCAATG GCAGGAGAATATAGAAAAACAAGT CAATGAAAGC CATTTACC CATATGAGTAAAA AGTAGCTGAC TT TAAT TTGAGGG GATTTTC CAGAGCATAAAG CATTTGCC CATT CAAAAT TTAT TG G A T G T (N ) x TCTATTG GAT GC CAAT TTA CTGAATG CCTTTAACTGTAT CAG TT TG CAGGTAAACCAG CT TTGGTTGTGGG GA T ( N ) xCAGAGGACAAGAATAGCATGTTTGGGGAAAAGTATTTGGGGTAGGTTCAAGTCAAGCATGGGAGATGTGGCT GGAGGT CAGATTGTGGAAG GAAG GTCTTGTGT G CCAGCTAAGAAATAGAGAACAGAAAAATCTAT C CAGC CACA C GC CAATT CATTGACTT CT CTGC CCTGCCTCAT TAGAATGAGG CAGCATCAGAGACACGGAGAATAAATACAACAC TGTAGAAAATAAGGTGGAAAATGAAG GAAATGAGAGTCAGAGAC CACAAT TCTATC CC{ N ) xTGGACTGTGAATG CCGGTTCTCA GGGGAC TGACAGGATG CTAT GT CCACTCAG CAGCGCTCATGGGTATCT CAGA GCCTACTC CATT C TTTG CT ATGT CTGAGT AGGG AAGAAAAGAGAAGAAG AAAG AATG AGGG AGTC AGGT AAA C AAGG AAAG AGGC ATA AAAAGG GAGT CAGGGAAG GGAGTT CT GAGAGAGACACC GTCCACATGGGTTCCT GATGGG CGCTTCCCCT CACAG CAGG CAGCCCAAG CCCAAGC TAAGATTTTT C TG T TT TTT TTT C T T T T T T T C (N )xCACTGTAAAGCCAAGTTGCA AT TTATAACT CCAGGCATATGTTG CTATTTAC CCAAAAAGAT CAGATATT TTAAAT TAGG CAGAAAGATAAC CCT GGT C TACCTGTG CCAAGT T C CC CATTTCAT CCTAAAGC CAAGGC TC T T T G C T G T T T G T ( K ) xA C ACAGACACACA CTTC CACATG CCACACATA CACAAAC CACACATGCCACACATATGC TAAT CACACAC CACACACATA CTT CACAT G A C A T (N )xTC C A TA C A T C A TA C A C A C A T A TTC C A C A C A T C A TA C A C G C C A C A TA C A C TC (N )xTC C A C A TA T C A TAC ATGCCACACAGATATACTATACACCAAACACACCTCACACAT(N )xTGCCCACTGCTCACCTCCTGTTGTGT GGCCTGGTTC CCAACAG GTACCAG CTGGTGGC CTGAGGGT TAGGGATCCCTAATATATAC CATACG CACATG CTC C A TACATACA CACACATACA(N )xG CC CCCACACATG CCACACAAC CTAG CG (N )xTCCTGC TAG TCC AG AGCCC AGGG GG GAGG CTGAGATGAG GCTGGTTTGAAAAGCT CC CTGG CT CACTGCTGAT GAGT CACGAG CCACTGACCCT CTGCACTAGG CAGGGCTAGGGG GACCTGG CTT CCTG CCTCAC CATG CAGATG CCCTGCTCTCACCTCTGCCGTCA TC T T GG CTATAAGCCCTG CGGAGATGGCCC CA CTGAGCACGTTC CGCCGCAGGC CGGGGTTC CTGGGGTC CT TGA GGTTGCTTATGCGGCT GCGCACGCGG TTCCGGTACTTCATGT CCGTGCT CTTGAGC TCTTGGTAGATATGTGACA CAGTCAAGGGCCGGCCAGCCATTCATGGAGGGGCACAGAAGGGTGCAGGCCTGGCCCTGACGCCTGAGAGCTCCA GAAC CCTTTCTT CCTC TC CT CGGACTTCTA CAGGAGTTGGGC CCCTCTTTTGTGACCT CTGA CT CT CCAAGTGGT
g a t g tc t c t t g g g c a t t t g t g g t c a g c c a a g t a g c a g a g a a a g a a a a t a g c t t c t g g g g t a a g a t c a g a t c a t (n ) xCTGTGAGCTAAGCCACGTGATGCACTAGTGTGGTGTCTGGGATATATCAAATGCTCAACATGTATGGGCTGCT a ag a tg atg a c a a g g a tg c g g ata a c ttc ta g tttg g aa g g g c c c c tg g a g g ttg tg a g tc c a g c tg tc a g a g g a a a a g a t t a a g a g c c a c a t g t t t a a g g c t c t a c g a a (n ) x CCa c g c tg g g a a ttg c tg g tc a g a g (n ) x CGGCagg AAGT CATCCCTGGCCAC CAACCACAGAGTTTCTGTC CAAA CACTGCATCCTAGG CAGG CTTCCGAGCCTCCCACT
c c c tc c a c c c tc tg g a g g ag c tc g g g g a ac a c a g c c tc attttc ttg g tc tc c tg a g c c a a ag a ag a tc a g g aa g ACAAAG CTGTTG CTGGGAAGAG CCATATA CAATAAAGAGACTGCGGAGAATG TGGAAAAATGAAAACAGATGACT
t g c t a g a a g t g g t g g g a g g g g a g g t g g t a t t c t t t c t t t t t t c t g t t a t t g t t t t a a t g t t g t t t g t a c a a t a a a AGTG TGAAAAGCAAAGAATATG COA CAATG CATGTGAAAG CCAAAAAATGTG TTTTAAAATGTACCACTTGG GAT TCCTGGGGTGTATTGATATGCGGAGCCAGCAGGGCTGGCGATACTTTCACCAGCCTCTTTCTGACTCTGTCTCCA TGTC CC AAG G CCACAGAGGGGACGAG GCAGAGAGCAGAC CGG GGGCACCGGC CC CT CAGAGGAACT TTTGGCTGC AT TGTTAGCACAGGGC CACT CCAAAGAGGACAGATCTATTTTAGGACATAGAGCAG TCAAGC CC C CAAAT GTACC
c tc c c c c t g a c c t c c a g c t g g c t a a a a a c a a a g a g c t a a a g t g t t t g a t g c a g c t t t g a t t t c c t c t c c t t t a g a AG AGGGACATTCA(N ) xGGAACTTTCAGTGGATGGAGGAAAGGCTGGGAGTGGAACTTTGTACTGGACCATCTGC t g ttg tttc a a a c tg g g c tta a a g g tg g a a a g g a c c tg c ttg g a g g g a tg a c tg ttc a g g a g a a a a tg a a g a tg a AACTCGTGATAGTTCTCATACCTTGTGTTCTGCCTTCTTGAATTAACAGGAAATGATGAAAATGTACTTTTTGAG GTGCTGAATGGCTC CAGACTTGTTGAGGTATTGCTGATTAAT CATTAAATGAGAGTfiGAGAGfiAA C CACCAGAAA ATAAGTTG GAGGGAATCAGG GTAG GC CACAGGAAGAAGATGGAAACGCAAATTT GT CGTCTG CAGATT GGCC CCT GGCTCTCCTCAACAGACCCTGCTGAACTCCGCAGGTAGAACCCAGGAGCCCTCAAAAGGGTCGCTCACACCTCGG TGTGACACTTAGACTTCCTGTCCCCTGACTGTCTCCCCGCTCCAATACCATGCAAACCTTTGTCTCTAGTAAAAG AAGCTC CATGTTTT CATT CAGTG GGGGCCAAGTGGGGT G GGATTTTTTTT CTCC CAAGCCAAACA C CTTTTGGTG CAACAGAT CTATGAATA CAT CCCAGTGGTGGC CACCTC TCCTGT CTACACAAGG CC CCTTTCAGGTGAGAAGTTG GGGAGCAGGGGG CACATGGAAAGGAT T CAATACATG GAATCCAAAGACAAAGTC TCCAGTCCCGCT( N ) xTTTGG GAGTAAAAAAACAGAGGG CAGGGC( N ) xGGGCGCTGTTAACATCAGAGGATGGG AG AAGGGAfiAAGGCTGAGCAI' GGGAAATGCTGACCTCTCAATCTAGTCTGTTCCTGGAAGTAACGACTTGGCCAACTGAGGCACGCTATTACACAG CCTAGC( N ) xATAGAATTAAGAACTCTGGTTCAGGGGCAGAGCAGGGGCTGTTCCCGGAGCCCTTCCTCTCCTTC CT CACT CT CCCAGCTCGCTTGCAC CTTGGATGGGTG CTGCTAGACGGTGCAGACAC CCACAGCCCCGG CACAGTT CAAGG ATATG AT CTTCGATTT CTG ATGCCATCTTGT CACAGTTGACTC CATAGT CCTTGT AATCAT CT AAAAGAG ATTCGAGAAATACAGGTATCAGCCAGACACAACGTAGGCAGTGAAGCCTGCTTGTACCTGGGACTAGAGTGTTCC TGTACCAT CCCT TGGGGTGCTGAGAAAGA CTGAATT TCATCTGCCCTCAGTGAGAT TGGGAAAAGGGACTCC TT C CCTCCCTCATCCCTAGCCCAGGCTCTCTCACCGTCCGCCTTCAGGGCTGCTGACAGCATCTCCACACACTTGTCC CG GACAGAGTCC CCTGTGAGATAG CAGGGGGC CAGGAGACACATGGAAGAGGCAAACGTGGGGGTCAAGGGG CTG CTAGGTGTTTTGGGGCTCTCCGCTTTTGATTTGCTG CTGTTTGATCTGAG GATGAC{ N ) xTTAACCAGCCACTGT GCGTTACT CATTATAGAACAGTGTAACACCAC CAATAATTATAT GCAATTGATTAGGCAC TGTGCTAGCTGCTCT CTATCTATGCTCTCCTAAATGTATGAACTGTTT(N)xAACACCTATGCTATAATCTATGATGTACTATCTAAGAA TTATCTCATTCAGGTTTTGTT(N)xCCATATGTGCACAGACATTTAGAAGGCTAACTATGTGGCTCAAT(N)xAA AAAAAT GCATT CTTGGCAGAAATGATGGAAGTGTAGAAGGC CAAGGTT TG CATG GG CAATGT CAAAGG GCTCAAT GCAGCTGGAGCGTGAGGGTGAGGGAGCTTGTAAGAAGGAGAGGGAGTTGGAGAGAAAGGATCTAACCAGACTCTG GGGTTT GTTAAG GATGGTAAAACAAAAGC CAC CATTTAC CACTGCTCTGCTG{ N ) xCTCCTGCCTCAGCCTCCCA AGTAGCTAGGACTGACTACAGGCAGATGCCGCCACACCAGGCTTTTTTTTTTTTTTTGTAGA{ N ) xGCAAGACTT TGTGATAGTTGCACAACTCCCTCCCTCCCAAGATGTTTTTGCACTATTCTTTAGGGTAGGAGCCAGGAAGTCAGG AAGTCTCTGTCTTCATTCCTCTGCTCTTCTCACCTCCTTTTTCAATAACCCATCAGAATCTGCCAGCCAGCCCCT ATATCT CTGTTGAGAAAAAAGTGGACTCAT CAGAGTTGAAAAAATTAAGGT CCC CT C CTG GAGTGAAATTAAGAC CCACTCGAGGGTCTTACCAAACTGAGCCCTTGTGGAAGCGCCCAGAGAGCTGC(N ) xGGGCCTGGATCCACAGCT GCTTCTCATGTTCACTTT CAGGCTGTGGACAT GCTACTT CCTTT CAGGAAGAAGAGGATT CTGGACTT CTGACAA AGTCTCTCAGG C CCAGGGAGTTGTGT CTGA GAGTGAGG CTAACTGAGTAG CCCAGG CTTCTCAGGATCAACAAC C TGGAGTTACCAGAGAATC CCATCCAACATC CTG CTGGG CACA GCTCAGTGTGGACT CTGCTGGTGACCTCCCTGT CCCCCACCTCTTGCTGTTTGGCCTGACCTTGGATCTCCTCTTTCACCCCAGACCCCTGGACCTTTGGGATTTTTG CATTCTCTAGAGACTTGAGGTTCCTCAAAAGTGCCTCCATGCATACCGGTGAGGTGGGCTGGGGATGGTAAACTC TGTTCTTGTAAGGTTTACATTTTGAATTTCTAAAAATTTTGTCCAAAGGGATTCATTTCCTAGCTTTTATTTATT TT TAATGCAGTTTGAAA CTTACTGTC CTAGAT TAAT CTAGCCTTAATT CTACCCGACCCTGC CTGACAACTG GGT TGCCTGGCAGCATGGTGGAGGGACAGGGAAGCTGAGATCCAAGCAGATTTCTCCTGGAGTCAGATTGCAGCAGGA GGTTTGGGTGAGGAAAGG CT GGACAGG GCGTGAACT CG CGGGGC TGTT GG CATC CAAAACGGATGTTC CTGC CC C CAAGCCTGATCCTGGTCACAGGGTTCCCACCTGAACAGTCACAATAGCCCCAAAACTGGTTTCCCTGTGTCCTCT TCC
> H s l_ 14614 50 61 -146 161 29 6
ATTTTT AGAG AAAG ATCT AGACAT CT CTG CTTTTTCTAG GAAAAATAAAAACAT AAAAAC TAAAAACC CCGCAAC TCTCAGGATAGACCGGATTATGCTGAGGTAACAAATTAACCCTAAAATCCCAGTGGATCATC{ N ) xGGTCATGTC CCTACTGTAGAAGACAAAGAGGAC CT CATAGCATTCTT CTAT GC CTAG GC CCAGAATGTACATTGT CC CACTT CA GTTTATAGCCCATTAGCTAGTGCAAGTCACAAGTCCCCTAAGTACCAGGAAATGTGGTCTTTTCTGTGTGCAGAA ACAAGAGAAGAACAAAAAAAGTATGCAAACACACTCACATCACCATTGCCTACATTAAGATTTTATGCTTCAAGA AGAGTGAAGCCAACAGAAAAGGCTGAACATAAAGGAAATCTCTAAGCCTGGACACCCGGTGGATCCAGAATAACC AAAACATAATTAAAATACTTACTGAGCACTTACTGAGTACTTTTCTCAAGATTAATGCAAATTTCTATTCTGCCT TTAATT CC TTGATTTTTAAAAATATTTTAA GATTTTTG CCAA GCTTTTA CATTTTAGGTTATAAAAGAATTT CCA
A(N)xGAAAAGAAAAAAGAAATGATCTTCCAAGAAGAAAGGTTGATACTAATTTTTAAATGTAAGGTCAATCACT ACTTGG TT CACATTAAATTTTCTA GCñAAT CT TGTGTT CCTCAT TTGCAAGATTTG CTTCTGATTTGCTAGATTT GCTTCCGGATTTCCCTTGGTTTGCTTCTCTTTGTTCAGAAGATAAATTCACACTCAAAGTAGCACAGGGCCTGAC ATAGATGT AAATGCTAAG CATATATACCAATG CTTAGGAATAACAAATAAATGAGTGTGAAATTAGAAGCAT CAC TAC C AAGAAT ATTT ATA C AAATAT CT AGAC TT ATAC TC CATG CT ACAACAATAAAATCCAAAATCAATGTGT CTT AC CTTT CG CATACGATCATTAATG CCAAAGGG CAGTACTGATTT CATTAAAAGAACAATTAGAAAAGATGAAGAG AGfiTTTTCCTGTTTTCCTCCAATTTTGAATCTGTCCTCCTTTTCCCATTTTCCCCATTTTCACAGATTCTTATGA TAAAGTTTTCAT TT CTCAAGGAGC CAACGT CACCC CAGACTCACACAAGGGAACACAACTGTGTTATCATTG CTT GT CTGAAG GAAGGGAAGT CACCAATC CTCCTT CAAACAGGAAAC CTTGTGGAAAATAATTGTGAATGTAGAG CAC AC CTGAGC AG AAAAAAAATG CTGACC AGAACTGTG C AATAAGTTTCTT CCATGTTGG AAG AGTTTG A CTTA CAAC TCAGCCATTCTCTGCTGCTTATATAAGAGCAAAGTCACGTCAAAAAAGAAGCCAAAGGAAACAGAATGTTCTCCñ TTGCAG GAATGACTGAAGTTTCGC CAATTAAACTTTTC CTTAGG CATGAACATAAG TAATAACCTAACATAT C (N ) xAGGTTATTGAAAGTGG AAGT AATGCTCCCAGTATTTGTGCAGGATAAAAAGAAAATATTAAATTATTTGTTTT TGTTTATTTTGTAACCTTATTATTTAGT TT AGTT TTTT GC AT TT TATTATTTA CAT CATACTGTATATAGTATAT TGGTAC CTATATAAAATAAATATATACT CACACACACATATGTT GAGAGCATA CGAATAG CAGTTGCAGCTCAGT TTATTTTTAATCAATGAGAAAGG(N ) xATAGTCCGGCGTGATCATTAAAGGAATTCCTTGTGTTCAAAGAGCAGT GCTTGAAAAC CAAG CTCATTTG TGCCTG CAGAAATG CCAG CCTCTGGGGTAGAAGAAATGT CATACTTTGAT CAT CTTCCACACTGAATTAGGAGGAAAAG CAGAAGAAAAGAAACTGAGG GTA CCAATGC CAGACAGT CACTAGGTAGT AAATGGAAAAGAAT TTAAAGAAAGTGAAAAT CAAAG GAAAATAT CACAACGT TTGGTTGACACC CTATTCCTCAC CCAGCC GCTTAATGTCTTTCTACAGAAA TCAGATTTCTGGTGCAGGCA GTTCATTGTTTGAACATGCAGTAA TGG ACACATCCAGATTCCTAACACTGGGGAGTTCAGAGTCAATCAACATGGATTGAGACACTTACAGCTTCATCTTTT GCTGATTGAG CACTGAGGGGTGTGTATC CCCATGCGGTCC CTAAGAGC TACACCCACGG GAGAAACATTTTATG T TGTTCAAAATACTTGGTGCTATGGTTGTTATCTTCAATTTTCTTAACTAGTAATGAGTGATCCTAGTAAGTGTTA AACC AATATG C ACT AGGCTCTCTTTAGG AT AT TC ATTACGGAAGGCTT TG AAGCTG AG AAAGAAGTTT AGAG TTT AGGGAAAATTTTTAAAACAACT CTATAT TATAGG CATT TTAAAAAATAAGTG CAATGGAATTCT TTAGAGGTAAC CAAGTTTCTCAGGATATCTTTGAGGCAGTGATAAAGATTTAAGTTTTTAAAATCACTATCAAACCCCTTTGTGCA ACATTTGCATAAATTCTCAAACAATGGGACTGTTTTCATGTAAATTTGTTTTCAGGGAGATTCCTGCTTTCTGAA GG CAGTGTAACTGTAGCTGAGTTACATAAAGTGACATTTTATTG CCTC CTGTGGAAGATG CTTTGCAC TAATGAT CTGAAATCTGATGTCTCAAGCTAGGAATTAGATGATCTGTGGCTCAAAGTGTGAAAAGAGAAAGTGGGGCCTTTA TGTTTTAGTTAATTACTT CTGG CTAAAATGTGGG GAGCGC TCCCTGGACCTT CTGGGACT CCTGGAAT CCGGAAA CC TGTCTACAAAAC CTAAAGGATTGAAG CAGTTGAC CT CATOA CAGGAGTGACCACAGG GG GAAAAAACAGT GAT TCACAAATCCAAGCCTGATTTCAGTCAGAAATTTTTCTGTGTTTCATAAATCTGCCTTTTTTCTCTAACTGCCCT TGCCACCTAG CAAG CCAGGAACAGAGAGAC CAAAAATCAT CATAAATATGGAATATGCAGTCG GAAATGTGAAAA AAGAGCTTGCTACC CAGGGAGAACAAAG CTTTCACCCCTCTGGGGCTT CACT GTGAAAAAGAAACAAC TCTG CTT GTTTTGTCTTATTTTCCCAGATACTT CTATTTGAAAAATATAGTAAAT CAATTAAT CAATTTTT CTCTCCCTACT ATCCACGCCCCCT CATTT CTCCAGTT CT CTAATT CT CTAATT CCTGAGTTGT CTAATTCCTGAGTACT CCTAGGG CAGTTCAGCTTACTACAAGAGAATTTTTTGTGGCTGTTCTCAAACTCCATATTCTGTTTGCAGAGAATGCTGTTC CT TAAT TCTG TTTGAGTT TCAAAGAGAAAAAG CT CATACCAG TT TAGATTA C CCACAATC CTCAATGCATAT CGA A CAAAATTGT CATT CCTAATGG CAAAGT TCAG GATATAAC TG TGTTAGTTAATTTAATCATTTACCATGAAATAC CAGGTGT CTTAATCTTATATATAGTT CTTAAGTTGG CTGCATATTTTT GTGCAATT GAATACTGACCCTACATGA AGAATACAAGAATTTTGTTTTTAAGTAAATGTTGTTAGAAATAAAATGTTTCTACTGGAAAAAAAAATATGTTAT TAGAAGCACTGTGAATTTTTCTCCAATGCTTGAAGAGTGACCACTCCAGCAAGCCTGGCTTATGTTGGTGGCATT CTGCCCTGCCACTACCTCCAGCACATCCACAGACCTTTATTTTTGTTATTTTATGACCACATGCAGGATTCTGGC AC CAAGGACT CCATTTGGACTTAGGAAAAGTAGAAGAG CACT CTGAAAACTG GACGTCGGGAATTGAT GAAC TGG CATCGCACCCCAACAACACACTCTGCTTGTGGGCCTTACTCTCCAGATACCTGATGTATTACTGGAGAAGCAAAT G CAACAAGG CAAAGATGT CTTCATTTTCTGTCAAAG CAAGGC CT CAACTATGGTGT CATTGAGG CCCT GAAT CT C CG CAGC CAGC CAGT CAGACCCCATCAACAG CTGCAG CAAG CAGG G CTGAGGAGGGGAATG GGAGG CGATAGGAG C CAATGAGGGCTGAGCCCGCAGCTCTGTGTCTGACTGGAGCAGAGAGACAGAGAGGCCAGGGCCAGAGGAAACCCC AG CAGCAAGGGAAAAACAGATTGACT CT CAGTGACT CATT TAGCTC CTGGGT TGGACAGATCTCT CCC CTGCTGG CAATTTGGCT CTGCATCAATAGTTCGAATACCAG GCTGAACAGC TT CAAATAATTAGGG G GAAAAATCTGCAAAA CTGCTAGCCATCCAATTCTTTACTCTGTGCTGAAAGACACAGTACAAATGACATCTAGGTTATTAAATATGAAGT CACTGGATTTGCTGCCCATTTGGTCCAAAGGCAGAAAACTGGATAAAACTCAAATTTATAATTCAGTGTAGATGG CCTGTTTGTC CTTG AGGCGATC CATG AATTTTGG CTGGGC AG GCGCTGTG AATGTGGAAGTAATGTCGTGG GGG A AGGGGGATGAACAAG GTTTGTTCCCCCCGCCCCGC C CCGACT CGGCTTACACCCCC TGGGGAAAAAAAAATGAAA ATGTCAATAAAGACTGTGAATGTGAAGAGAAATGGAATCTGAGCCCAGAGTTTAGTAGCTGGCAGTTTAAACTTT ATATACTTTAGGCCGACAGGCGTCTGGATCTC CTGTACTTTATAT C CAGCGTACAAAAACTGAAGCAACAG G CT C ATTTGGTATCAGTAAAGGTCAGTTAAAATTATAGAGACATTTTTTGGAGCAAATCTCCAGTCTTGTATTTGCATA TT TATGAACTAGAGATTAAGATAGGC CATT C CAC TTAT TAGAGCA(N)xTTGGCAGGGTGAGCCGGTGGTGGTGG TGGTGATGGGGCTTGGGAAACCAACAAAGACATTTAAATCAAGCCAACTGTTTTTACATTGGATATTAACAGTAA ATATGAAATCAGAGATTT TGAGGGGAGTTGGT GAATTC TTCTCCTCCC TCAACACC CCCC CACCAAATTATCTTT CCCCTGTCTTATCCTTGTCAGTATCAGGGCTTTCACTCTCTATTTAAAGGGAAGACTCTGGGATTTTAGGGCAAA GATCCAAAGTTTTGGCTTGGCAAAATAGTTTTCCTCACTTCACTGGAATTGTGGGGGTAGGGGAGAGCATCAGAG TAAAGCAAAGATGTTGAAGA( N} xCCATCACACTGGTACCAAGTTTTCACAATAAAATTACAGACTCAATAGCTG GTTCCATGCTTTGCTTTCTCTTTTTGGCCACTTGGGATATCCCCTGCACAGAGGCACAATTGTGAGTGTGCTTAG CTGCAT TCTCAAGAGAAGGCACTGCTAT CTAGGGACTC CTTATCA CATAGGAAGACAGCCTGCT CTTT CCAAAC C AAATTCTTGTAAATTCACTGGACACTAACCTGTCAAGTAACT CAGT GATATT CAAAGATC CTTT GGGATTTGAGG AGAAATTGGAAAAAGAGAAAAGAGGAAG CAG C TT GC TTTC TCAAGT CATTCATTCAAAAAAAAAATGAGTAT TGG CTCCTGTTATATGGCTCCT CGATGTGTTTTAGAAAC CACACAAACACAT CCT CAACTGGACAAC CATAGAGCAGG TTGAAAAATT CCTACTCTACCTGGCGGAGTTT CCAG CC CCTCTG CAGAAGTG CACACCTATGCAAAAACCTT CAT CAGAAGAGAACCTTAAGGGATAGTCTGACAGTAGGGAATCCCAATGTGTATAACCAGTGGTCCTTGCTATCTGAC TC CGAGAGOTA CAC TGAGTAATAGAAGCAACT CT CCTTGC CCTGTC TT CTAGGTTT CTAT TATACTTGGAGTAG G TAAGCTTTATTTTATTTGCTCAAGATATGTCACACCCCTGGACACACAGAATAAAAATTCAAAGTCCTCCTGGAA TT CAAGATGTGTCATACC C CTGGACACACAGAATAAAAAT TCAAAGTC CGA CTAGAGAGG CAGAGAATATCATCA CTGTATCACTCCATTTTACTTTATTGAAAACCTTAAAAGTAAATATAAAACGACAAATTACGGTGCTGCTATTGT TGCTTGCTGTCTGGGAAAAAAAGAAATGTTGGATAAGAAGGTCAAAATGTGCCAATTCAATGCCCTGAGAATGGT CACCTAAT CTTC TATCAAGAGATGTGAGATA C CATCTAAAGAACGG CTGCCCAGC CAG GCGCAGTTGGTCACAT C TGT(N)xTAAGTAGAGAAAGAATGACTGCCCTCAATGCCTAGGAGCTTGACTCTGCTTTACTTCTGCCCTTTCGA TTCTCTGT GAAATGAACAGGACTTATGTAATTGCTTTCAG CAGCAT CTTTGAATTACAATTACATAATAAGACAC ACAC CAG GATGTGTGTAATTGTGTAATAACAG CATGCCT CACTGTG CTAGGTTGGTGG CCAATAAC CACTTAGT C ATGCTATTATATTATAAGAACATATGCCCT CAAGGCATGT TGAAATTATT CTGG GGAT CTCTAAAGGAAAGCTGA CTGAT CAAAAGAGAGC CCCAACTTAAAAAG C(N)xCTGATAGTAATCCCAACAAGAAACAAAAAAATTACACCTA CCCTAAAAACAAACAAACAAAAAACCTGTAGTGTAAAATCAAAATCAAAGTGTTGTGTAATGATTCACTTACGTC ACCACAGGATTTAAGATGACAGAGTGTTAATAAGGAAAAAACAAGAAGCTGGCCAGAAGTCAGGTGTCGTGATGC CCTCTT CATAGTTGGT CAGCGTGAAG ACTACAGAGGAGTT ATGTCT CAAC CC AG AAAGTTCTAG CTTCTTTG GAG CCTACTCATTCCCAAGAAAATGACACCCCAAAGGCTCTATAAACTTTCCACACTTTGAACTTTTGTGTTGTTCTC ATAGTCAT CAGATCAAAAAATAAT CTCAGAAATGTGAGAT CA GACTTTTG CAGTTGAATCTCCC CAATGATATTT TCCTTACAGTGAAAGTATGTGCTAGGATAAAT TGCCCACACTTTATTCTTACAGCA CAAGATACATATATGTTAT ATATATAG TATGTGTA CATACG TTTCAATACCTAATTTTT CTGTATATG CAAACTA CATTTCCATCTCTTCCACT CTTCTAAC TCAT TT GG CAGGAAAAAAACCT CATG CAA CAAAAATAAGG CAGTAT GAGATGAGGGGAGAGGTGGGA GAGATACTTTTGCAATTTATAAAATATTAGTTTGTCACTTCTTAGATAAGGCTGTATCAATACAAGTTATAATAT CGAG GTAAACATAAATGTATATGTGTAATATACT TGTAAT TCACAATTGATGTTTTGTTTTTTC CT TATGCAAT T AGAACCTTAAAATATAGAAGTTTTGATCATTAC(N)xTATACATTTATTACATTATTATATCAACTGCACATTTA TTA CAT CAACAAAC TT TTTTTTAAACAATG CT GAGTATAAGTAATGAGAATATAATGGAGAGGAAAACAAGG TC C ATGTTGTTTTAGGGCTTATAGTCCACCAAAAAATAATTTTGACAGGACATGCATAGGAAAGGCAGCAAATATAGT TGTGGGTTTTAATC CATTACGTATGTGGCATTTC CAAAATAC CATATGTAAGCT GATT TTAAATAT TTTCAG TGA TCTTAAAGGACATCAACTGAGTAT TTCAGT CACCTTAACAA C CAAAAGTATTCAATGATTCAA CAAATGAC C TG C AACTAATGACTCAATCTAAACC CAATCTAAA C CT CCATTGAGTTAAAT TTGAGGAAGTACCTA CAAAATTATTGT TACCTCTCTGCAGG{ N) xACCCTTCACACTGCCCACTGAAGGCTCCACTTCAAGTCTGCCCTCCTCTGGAGCCTT TTGTTAAATCTTTG CCAGCACAC CATTCGT CAGGGTGCAC CC CCAT CAA CAAGACAGCTTAAAGA CATTTAGAAA GCAAAACCATGACAGAAAATGTCAAAGGGGAAGAGAGGAATATACAAATATGGAAATAAAATAATAAAGTGGAAG ATTCTGATGTAT TAAG CCAGAG CAGCAGTT CT CAAATGTTATGGCCTATATCCAAAT CACC(N ) xATCGGTATGG CTTTTTCT CCTAA CAC CAGTTCGT CTCCTGTG CT TACCAC CTGGTTGAGGAGGGGTGGAAGTGCATGGCAGGTG T GGCTAATGCCATTGGATTGGCAGCTCTTGGTAAAGGAAATCGATGGCCATGTATAAACAATGTTTAAGCTGAAGT CCAGTGAAATGGTCCCTGGGGACCCACAAGCTGCTCACAGCCAGGAAGGCAGCACTGCCCAATGTTGCCCCCACC ATGGAAGAAAAAGCATGCCAAACACCTTCATCTAGCCCTGAAGGTATTTTCACTAGGCCTATGTAGAAGGTAGTT TGGCTAAAACAG CTTCAGTTGG CATATGCA GAAAGACACAAT CTTA GAG C CACAGTGAATTATTTGTTTGGAAG G CAGGAC CCTATTAA GACACTGGATGTTGCCTTAT CCATAT CTGGAT CACCACAATG CAAACCATG C CCCTGGAGA CAGAAATCTACAAG CCACAAAGACAGGGAGAAAG CTAACATGACATTA CTGGGAAGGTGAGTATACACTAATTAG ATCT CACAAAATAAGGATGATAGAAGAATT CCATTGTGTT CA CATTTCTTACCCTC TGTAATGTTGAAGATC TGA AATTTGGTTGTAAATACATTGGTAACTTTTTTGT CCAAAT TACATT CC CAAGAAACAGATAAAAAC TGAACACAG GTTGAAGTTTGGCTGTTTTATAAGTATGCAGTTACTGAAAGCACTGGTGATAACACTGTCAGAAGTGAAAAATTA TAGATGATTAATAAGAAATTGTTGAAAGAAATGGTTCTTGCCAAACAAAACATCAATGAGATTGAAAACAATTTT CATTGACAAAACATGTGCCTGTTGGGCATTCCAGAGGAGTGTGATGAGAATAAATTTGACTGAACCTTTAAAAGA GTTTTTACTTGTTGGGTCAGATCTTTCTGCTAAATCCCTCAGTGTCAAAATGCTGTTTTGCCTTTCCTACCTGCT CTGGGCATACTCCGTCACAAACTACGTTGTGTGAATCTGAGGACTGCTTTAGAAATGAGTTCCCAAATTGCTCAA GAAATAGTTCAGGCTGTAATT CTGAAAAGC CTTCTGCT CACT CAGGTC CTGTGGGGGGAGTTTAGC CAATAAAG G AAAGAAAGAAAGAAAGAAAGA(N)xTTAAACTCTTTCAAAAGCAAACTTTGGAACCATGACAAACATGGAAGCCA TGTGTATACCATGCTGTCAGCATTTCCTGAAGAAGCTGTGTGTGGGCTTCAGGGAGGGCTCTGGGATGATAATCT GGAAATGATGAAGAATTCTTTTATGTATTTTATTGTAGTACAGGTCTAATAAAAGCTAAGTATTATAAATGTTAA TGGT GCATTAAT(N)xCTAGAACCCTAATTCAAGGAAAACATTCTATAAAACTTCTAGCCAAATGTCTAA(N)xG CTTTTAAATCACTAGTCTGAGCATAGTTTAGATTTAAATAATTATTCTTTCTCTCTCTTTTTTTTTTTAAAGATT CAGATT CATAGC CATG CGATGAATGCTACTGT TATCAT TGACATAGAT GACCAG CC CGGAAATAAGATTGTATGA ATAGTC CATTTAGTTAAATGTT T CGGCCACTTAGATCTATGT GAATGT GAGCATAAAATGTGGAAACACTTAGGA
a a t a a g c t t t c a t t g t a t a g t a t a a t a g g g g g t a a t g a g a t t c g g t c a a t c t g t c a g a g a t t t a a a c c t a c c c g a TCCTAAAGAAAAACAGGCCTAAAATATTTCAAATGTGAGGTTATATTAAAAGCCACAGTTTCAAACTGTGAAGCT AAAGAAATCAGTTGTTTCAGTTTATTGTCTCAGGGAAAGCACAGTGTAATACCTGAAGGACTTGAAAGAGAACCA AAAG GAG G GGTAGC ATTCCCAG CAGCTAGC ACTTTCAT CC AATGCC AGGG CTTT CATTGGCCAAAT TATTTT CAA AGATACTG CCTGAC CGAAGCACTT CTGCAATAAAAGTATCTCTGTGTTAGTCAATGGAATGGGGGAATGTTAAAA TTGAAAGAGGCCTTTGAGACCAGCTAGGTCAGTGTCC(N)xCGTAACAAAAACAAACTACAGCTCAGGTCAACTG ACCATCAGTATAGTATGCCTTCTAATACACTACTGTCCTCGCACTTTTTGTATTTTAACGTAATGGTTTTAAAAT GCACATGGGAAAAATAAATATAAACTATAATATAGAATACAAAAGCTCCCAGAGAAGAGTTTACTTCCTATTCTT GTAC TC CAAGTC CCAGTTACTCTC CTCAGATATGACCACTGTGCTC CT CAGATATGAC CACTGTGC(N) xTATAA CTACTGTCAGCAGTTC CTCTGATAGTCTTT CAGAGATG CT CTATGAAC TT CTAAGCAAGTGTATATATGTGTGCA TGCATGTGT(N)xAAAGAAAATATAGTCATTATTTAAGTTTTATTTGTGATTTTTTTCTTCACTTATTAAATGAT AAGTTCCTTGAAGAAAGAAACCATAGCTTCTACAACTTGATTCCAATGCCCATGGCCACACATATCATCATTCTT TAACAAAGATTGAATGACCAAAACTGTGCTAAAGACTGAAATACAAAAATAGATATATATTAT(N)xTGCATTTG CTTAGGGCAGGTGG G GGTCTTT CAGGGCAGAGGAAACAAGA CTTAAAG CT CAGGGT CCACCTTAGCTTAAGACT C AGATAAAC CACCTG C CACTGTAAGGTAGAAC CTAGCAAGG TTATCACTTCAGGG TAAGGGTAAACCAGAATTAAA CCAGTCCTT
> H S l_ 1461175 70 - 14613 58 31
GTTGGT GGAGATTCCTT CAAGGACAAAT CATGACACAAGGATTGAGAACACG CTTGGG CAGAAATC C CAGAGAG C AATGAGAG GGAATGGTAAGTGAGACAGGAGGGA CAGGAGACAAACCAATAAAGT G G C T { N) xACTGAGCACCTCT AGCTTAGCCCTCAATTTACCATCTTCTCAGGAGGCGAGGGCACCCTCTCAGGCAAAGGGACACAGCTGTCATGCC ATCTGGCCTG CATGAGAC CT C C TGGAGTAGGCAAGGAGACAGAGAGGGCACCAAAGGTATTCACTG CATGTACAG COA CTT CTTATAGCTGTTGAGCT CACACAGTGATGGAGAAAAAT{ N ) xGTGGTGGTGAATGCTGCTATATTTTTT AAAATCAAGG CAGGCCTAGGTT TAGT GT CATAGC GAAGGAGC GAGGCACCATAG CCAGAAGCCC CTGGGTAGGTC TGGGAAGT CCTAGGAATTGG GAAAG GTGAAAATAGTACAGGTATGGGGCTGTAAAAGG CAAACTTCATGT CCTG C AGTGTT CTGCCTGTGGCTGGTC CTGAGGAGCACTGGTGGG TCAGTTTGGC CAGACCTC CAAAGAGAGC CACAGGA AAAAAAGAAGGCTAGAGGTC CTCATGACCCACCTCC CAAGATACAAGCAGACAC TTGAAGTGGAAAAT CC CTGC C ACTCTGTAGGACAAGAACTACGTGGC CAAAAAGG GAAG CTG CAATAAATAGTTCTGGTTTAACCTG GGAT GCAT C ACAG CTTC TC GAAAGTTTTG CTTTGACT TTAGG C CCAATT CAACCAACAT GAAACACC CACAAATTAACT CC CAG GAAAAACATGGC TCTTTGAGAAAAGAAC TGACA CTTAGTAAG CAGCTGAG C C TGTCAGTCCAGGTTGCAGGATGG AGCGAGTGAGCTCTCGGCGGGTACAGTGGCTTGCACTAGCTGAATTCATTACTATTCCCTTTGTGCAGCCTTGTG CTTTGAC CACTCGCATATAT GGAATAG GACTTACAAAAGAAAACTGTTTTGTTTGG CAG GTGGGAAAT CATT TCA ATTGAGATATTT TTTCTGTTTACT CAAGTAAATATAAAGATG CTTTGATT TT CCACAAAGAAAGAGTTGTTATC C CTATTT GAAT TTATGTTG CTTCAAAAACAAACAT CTTACTTTTAGTATCAGAAG CAGACCATTCTGTT CCTTAGG GAAGTGATATTCTTTAGTAGACTGGTGGGAGACTTGTAGAATATTAAAAGGATAAGACAAATTCCAGAATAAATA ACAGGGAAAATTTATTTTCTGAACTCTAATAACTATGATCTGTTCCTACAGTAGAATGGGCATTCAATAACTGGA ACAT G AAGGATTGGCCTACAGG G AAAGC CCAGGGTGGT CTGGGACCTCCCTGAACCTC CAGCAGAT CT CATGAGA CTGTCTTTAATTATCCTTTTCTCTCTCTCTCTCCCTCTCTCAGGCCTACAATGAACACTG{ N ) xTTTGAGTTATG TGAT TC CTTGTGTAAGAAAT CAGATATAAGAAAACACACAGGTATCTGAT CC TT TA TGGAAAAAAAAAAT CAAGA ACAATAAACAGGATTTGAATAAACTC { N) xTAATGCTTTAAAACCAAAAATAAATAAATGAAATGGGTCTCTTCT ATTAAGG GTTTT CTAGAAA CCTCCATCT TTCCAC CACT CT CCTATAAAAGTTTTAAAACATTTT CAAAAG CCACT TAT CAC TATT CTTAGACTGC CAGTGATCAAAAGGGCAG CCTC GCGGGTTG CAAATTAG TGAAGAAGAGAAAG CTA GAAGGAAGAGGAAGTGGAGCAT CT GAG GGAAAGATTTTTAAATGAACTT CTTTGTGTTGCTAAATT TATT CGTAT CCTGGAATGGGCTACAAC CATCAC AAGG ACAT AAAAAT CATC AGGT AAATTC AAGG ACTTTTTAAAAATG CATCT AATCACCTCCTGTCCAAAACATTCAAGCTTAACCTTTCTCTTGAAAATGCAATCTGCTCTTTACACCCTGCCTGC ATAC CAGATT CTGAGACCTAGATTGC CACAGGAG CAGAAC CATCACCTTGTC CTGATT CAATAGGTTCTTGTTGT TACATTAT CAGCTTAAGATC CTGGGGTATGTGGAGATGTG CCTATTCATTACTGAAAGAGTTCA TGCACAATTTA TGTC CTTCACATAACTTTGT CCTTAAAGGCTATTTT CT GAAT CCATTTCAGTTATGTTTTCCCAACAG CCATATT AGGAGGTGACTGTTGTTTTTCCATATTCAAAGTAAAAGAAATGAGACAAATATCTCAAAATGATTGAGGTGCTCA CAATAAAAATGTAGTGGTAAAGATAGTACCAGAT CT CGTGTCTGTAGCTT CTGTG C CCTCTTTCATAGTC CAGAG GACTGAG C TACT CTAGCAAATGGGTATATGAGAACTG GTGAC CACAGTTAAT CCAG CTGACATGA CAC CAG C CAT TACATT CAAGTG CAGTGATGTATT TTGCATTGCTC(N)xTCCTTTTAGTAATCGTAACATCATGCCATTCAATAT A TA A ( N > xATAGCAAAATGAAGAAAAATTACAAGTTTCTGACTATGAGTCCAAAAATGAGTTAACATGTGATTTT ATGCTAGAAGATAATCTAAT GAGG TCACTCTGCTTAGCTG CTTAGCTAC CGCTGTTAT CCCAATGGGACT GG GCA AGAATAGAA C CAAGTCATGCAAGTAAGAGTAGCAC CTGAT TG GGCACTGGGCTTG GATTCAGGTTGTTAT CT CTA TGGATAGCTGTGGTTCACTGAGGAACCATTCCAAAGATGTTAGAAGAGCCAGACAGAACCTACATTAATTCATTG GTTAAAGCTCAGAAATAATCATGAGGTCTCTGAGACAGGAGACAATTAGACATAGGATGGCTCCTATTATACACA AAATTT CATTGAGTCAGCTCTCTC CG CTTTACTT CTTG CC CTAGTATATCTTAAATATGTGGTTTT CCCCTTGCC TAAT CC AGG AGAGCTGGATT TC AT AACT AACAAG AC TT T AG ACTTGTGCTTC AC CTTT C AGCATGCTT CTGGGTT CCATGATGT C CCAGAGCCTATGTG CTTATGATTAAAAAAAAAAAAAGAAACTGAAGAG G GACCC CAGT CTGCAGG AGTGAGATGGGCATGGGAAGAATTACATTTTCTCTGGTTCCTGGGTATGCTGAAGATGACATCCAGGAGAAAACT TCTGGC CC CT CAAACTGC CC CATAGCTCAGCCTG CGGGCTGC GACTTACAGTGACATCAGCAACATGATGGATT C CATCGCCAGCAGCCTGGACTCCAGGAGGCACCCCAGGCGTGGCCGATGTGCACAGGTAGGAGCATGTCCTCTGCG TGGC CT CTGCACTGGAAGAG CTT CAAGG CCTTTTCCTCCAGGGCTGAGGGGATTTGTGTTTCGCCTCTGTTTGTT TTTGTGTACTTCTGTTTGG CGAAACTTT CAAGCACG CAGAGCAACGTTCTGATAGATA GTGCAGAGAG GAGCTG C GCAG CATC CTGGAGCCGAAAGTTACAAGGAAGAGAC CAAAAC CGGTAAAGTCACG GATGTTGTGG C CCGAAC CTG TAAAT C CAAACAATCTCACAGT TATGAAATGGAAAATT TG TGGGGAAAAACAAATG CTTTGGTTATTACAGTGTT ACTACATTCATAAAATGTATTTCATTTGGGGGAGTTTACACTGGGGGTAATGAGATTTTTGCTTTATGCGTTATT TATAAC CG CTTAATCCACGTACTGTGGGTTTTTTTAATTC CTTCATTTTCACAG CTACAAAACACTTCAG CATTT CTAATTTATTTCTGCTTACAATACAGATGTTCTTAGTTGTGTTTCTAACTATATTTCAACCGTTACGATCACGTT TGTT TGTGTCAG CGCGCAAG CCGG GGTTGTATGTGCAATACACAATAGTAAG GCTT CAAGAAGC CTGTAAAAAGG CCTCTTGTGCTTTCATTATCAACAAATTAAAATATG CTGCATG GCACTAT CTTTAT CATCAGTGGTTTTAAT CTT TGGATAATTC CC CTGCAGTTGG GTAG GCAGAGTGACAG C C CT CTCTCAGTGTAGAAAGACTGCATG C CTTAAAAT GGCACTTTTCAAGATGA C CATTTG GAAG CAACAT TT GGAGAACATTTGGGTTAT GT CCTCTAA CTGAATTT C CAA GCTGTTG GTAACGGACTAAAGAAAATGTAGTCTGTACTGAATATTAAGGAAG CAAGAGTGTAAGAG C CTTGAAG 3 TCATGTGTAAATATTGACATTTTATTTTATTTAGTAACCAAATTGGGGCCCTTGCATTGCTTATCTTGCCATAAA AACT CCTCACTCACTTTCACTG CC CTGCAGATTGTCAG GGAAAGATCTGATCTGAG CAGCTCTGGGTG CT CTTC C TG CCATTG CTAACAAGA C TG A A TTTTG T TTG T GCAAAATCTGAGTCAGAGGGGT GTTC CT CC CCT C TCA T CT C CA CCACGGACTGATGCAC CTG ATTAG ATAACAG AAATTTG TG GG CT GAATGT T CAT GTAGTGACATGAAAGC CAAGA TG CAAAAACAGT CAGAGAAAACAG CCTCCCACTCACTCAG AG T CAAGT CAGGTGT C TAAAAC CAAAG TTAT C TGG G GAAC CAATAAC TG G CTAATTTG G CCAGAATACC CTTC TC T G TG TC C T CAG CAG CAGTGTGAG GAAGAAATACCA GCATGTCCTCAGGCC CTGTACC CTTCTCAGAAAC CAGGACTCTACCTG CAGAGCTTAAT CAGGGCCATATTCTGA AGGTGAGATG CAGG CAATGG CCAGGTCTCTCCAC CAGCAG CAGTGG CTAGAATG CC CCACA C ACTTAACAA CTT C CGGGACACAT TTG ACTG T TGAC CTACAACAGCATTG CTTTCCTCTG G CAAAG CAAAGAGCTCTGCT CATGTAG AA TAGG CTGG CAGTTTGGTAAC TTGC CAGAGG CTC CTG TCTC CCTACAGTGAGGTCAC CAGAAT CG CTTCCCTCCTG AAA CAOAGT CGATTGAAATGAAG GT C CAGCAGAAATTAG G GG CCAGTGAGTTAAACAT G GAGGCCACTTC CAAT G G TCTTTAG C CT AAGAG G TT TTC TC CAAG TGAAAAGAAGAAAATTGAATATGGGAATAAAATGATCT TT TA A T G C T TC A TA A C T CAACTCTC TG ACTTACC T CAAAAAGAGAGAAATA G TG TAAAAAACCAA CACTG TTTTCTG CC CT CTA AATTTG AG TTTTGAAAGGTG CT CCTCAAATAT TAATGATCTGGT TTCTG TGC C CAAA TAAATGTCATG ATTTACA ACTCATG CTTATG G TTG CAAT C CTA AT CACCCTTCCCTACGG CAGCTT TATAAG TCTTG CCCA GTTCO ACTACTG AAATAGTG CA CTTG G ATT CAAACAACAC T CAAAAAC C T T G T T T T T T G TT TTG T TTC A TTT TT A A C TA T TA C T G A G TGGACAGGGT CAAT CC CATT CAAAATTC C CAAGTAGACAAAATCTGTC TG TG GC CCCTCCC CTTC CCCTCAC CTT CTTGGCCGCCTCCCTC CATT CC CACTGATGAACT T C CAAG CTTCCTA CAG CT CCACTAGT CC T T T T CAAC CAATG GAAAACAAAG CAGT TT C C TACATTAG TG TATCTG C CTT CATAAAGTAGTT TAG TG TTTTG CACAAAC CAAGAGAA TG T T CATT TTAAAAGCAG GAAAGAAAGATTACAGAT GGAATGTCAGACAC CAGGGCAC CTG G ATAAAATATAATT TGGGGCAGGG GAGTATAT TC TC A T CTTACC TAAAGTAGA C GAAC CAGT CAAGTCAGTACC CATAAG GTAGGCTTG ATA AATCTCAGGACCAGGCACCTCCACCCCCAACAAACACACACACACCTTTCCACTGTGATGTTTTAGGCACAG
t g t a t t g t a g c t c t g c c t a t c t a c t c a g a a c c t a a t g t a t t c a c a t g a a a g t a a a c t a a c a g a g t c t g g c t g t c c TGGT GATT CT CAAGA C C C T A TT GCCTC CTCACTTACC CAAAAGGAAAGGAGCAG T A G T T T T G T T T T T T T T T T T T T TAATTCA AG AAATCAG CA AAAAAAAAAA AAATG TGTTAG TGTTTCTCCAA AACTTCTAAG TTAG AG TG G CACA AT TC A TTT TTT TT A A TA A A G A A TTT G A TT A A C TC T G G T TG G C C C A G A T TT G C TTC A G C C TA A A G C C C A A C A TC T C TT TTG CAG G G TG G TATCCTTCAG TTG CTCATAC TTG AG C CAATAGAGGTGGGAG GGGAGATGGAGAAGAC TGGT T T T A T TTGAGC TT GTGAAAATGG C T T T T T A T CAAGTT CAATGT CATAAG TT TT CTTC CACAG G AAAG TAGAATATATT T T CTGGTGAG CAATAG TT TT TT A TTA T A TA G A A A TG G TT C CTA A TT CTTAGCAGACAG G GATTTGAGAAAACAGG GGAGTTCTCC GAAGTCACAACAAATAATGTGATGGT CAACGT G ATAGACTATTGTACG C TTTG A TA G C TA G TTT C A G ACTG A TTTTCTG A CA CATGAGAGAG GAG CAGTGC CTGTTGCC CATG GAAGGCGATTTG CTGGGAG GAGGAACA C A A AC ATCCCCA AAG G CTACCAG G TCAACTG TCATTTTCA AAATTACCATTAAAAG TAG GC TG G TG ( N ) xC A A C A A C A A C A A A A A A A A A C A A A A A A A A A C C C (N )xA T TA T TA T TA A A A G T A A TA A G A A T A G A A A A C A T A G G C TG T G T A T G CTAGATTAA CAAAACATTC CC CC TACAAAAGGGGGAAATTT CA A T TTT C TA T G A A G TT TG A A A T C TA TA A T CAT CTGG C A T T A T T T CCAGGCATGTAGAAAAAAAT T C A T T T G T C T T T T C A T CTG G ATTAG TTTG T CTCCAT CAGTGTG TAC ATCAAAA GTA C CT GG GGATTC TGTGGGTTTG G G G AACATTTTTTT G T T TG C TT G TTT G TTT G TT TTC A A A A T A TA TG G TAG TAT CC CACCACAAAT C TA TA C TA AAATAAA ATTTCG CAG TG TATG TCA AATAAATG T GCACTTTGA A A A A T T CC CCAAGTAAAT TTTGAGATTGGC TC T TG A T T C A G TG AC ACTAATATAAA CACTTT CACT TA C T T T TGA A TT TT G TG GTGACCGAT CTTAAGATTTGAAGC CTACACCT TC TTA A C TTG G T CCATGGGG GCACAAAAG GAGAGA GAGTATAAATGC TGAGGATGAC CT TAACAAAAGTGAACACGAAATTAAGAGATGTGAACC CAAAATAAAAGAAAG CAAAGAACTCTGTC TT CC C TTG TCAT CT TTA A C T CTG CTCTATC TTG A TT CTTCTC T G TG TT TT TC A C A G G C T C T T C T T CCTATACC CACT CTGTAAAG TG ATTTACCC CATAGC TC T C C A C C C C TTT C A C C T TGGAT CTCTG GT CAGAT A A T CTCACCTTCCTCTG TG G CTT CAAAG C C A T A T A C A T T T G A C T T C ( N ) xA C T TC C A G T TA T A TA A C T TC A A T G C T(N )xG TG ATAATG AG G G G G G TCTAG TTATTACAG G CTGC ATTC CCTG ATAG TG TG TC CCTG CCACA GG AA GG G C ACAG CAGAGG CTGAGTGGAC CACTAT CGTGCCTGTC TTTGGGGAAAT C CAGCATAAGC CT C CAAAATC C C T T T T C CT C TT TT A A TT TG A TG A CATTAATGAAGGAATATGAGCTCACAGGGATTAAATCAAGATGATTAAAACAAAGAAG T A GAAAATAT GATCACACGT CTA CAG TGTGTCTACAAACACGGCATAA TCAGATATGAGAAAAAGGAGTT TTG T G AAATG T TAGAGT CACACC CT CC CT GACGTGAG CAATACAGTG GC CAGCAAGG CAG GAGAG GATGTG CTC TG TATC AAGAATGTGAGG CCTACGGTAAAG CT GACCGGTACAGGTG GAGCAAGACCAACTAAGAAG CA TC TT TC CCTG GAG AAAACTG T CACT CATCAAGAGGATGT CCATGG CAAT CTAAGT G GAGATTTTCC CAAAGG GACTACAAATG T CAGA ACCCAGGT G ATCTGACAT CAATGAAT CT TC CT GAAAGC CCCTTGGGATGT CATCTGATGACATGAGTTAG CATTG
G (N )xA A T T T A A C C A T G C A G T T T A A T T T A T G T A A C T G T G A A T T G T A G A C T C T T A T T T A A A C A T G T G G T A T T A A T A T T A T ( N ) xG TG G AG G TATTCATATTCCAAAATACCACTG G GAGATACAG AG G AATAAAAAATTAAG AAATTCCAT A CAT CAGAAG GACAAGAG CT CTTGAGGTTCTTTCAGTGGACA GTG G TTTTAG ACTTCACTG TGC AT TAGTTC TGT C CAAGATG CTTA TTA C A G A T GAAGGCTCTCCT TCACTAGAAG CCCTCCCCAT C A C TA C TC TTG A TT GCATGGTTC AG TAGGACAATACT TT GAAAAACATTTC C CTGGTGAGT G AATTCAAAC CCTCAGAAGAGGGATTCCATGCTT CAT A CAACACTATGGAGGC TT TG A A G TG TTTA C CATACTTGGGTT C T A T ( N ) xATTTG CC AATTGC CCAG AAG G ATG A AAGC CT CACAGTGCAAT CAT TGTCAAGT CCTG CTGAGCTGTGTG GGAGCTGG CTTATGTG GAACTG C C TGAG C TT CT CAG GGGCCTCCTCCTGGCACATCGAGCTCAGATGAAATGGAAGGCCTCAAAGCCCCACTC
> H s l 1 82 0 86 2 65 - 1820 99 4 22
TTGTTGftC CATAGTTGCTTCT CAAAAAAAGCT CACCTGTTAAGCTTAACAATAGGTACCAGTATTGA{N>xCTAT TATAAATTGTTTTTGAGAACAACTTAAAAGAAAGACATGTGGCTGTTGCTGCATGATGGACAACCAAGTTTTTTG ACCAGAAAACATTTTGCCCCTTGCAGAAAGAACTCGGGAAAGCAGCAGGCAGTGTGGAATGGCAATGGGAAGATC CAAAGGCAGCAAAGGCTTCTCTCCGCCATGTGAAAGGCCTGAGTTTAGATACTTCAGGGAAGTTTTAATGCCAAC CTTCTGTAAAATCCCCATTCTC(N)xCAGGGGAGGAGCAAAAAGAAAAAACGTTTATTTTTTATTGAAAAGAAAA ACCCTCATTCTCCACAGCCACATCCATATTTCAGAATTGTTCTTAGAAACTTCCAGAAACAATTCATTTACTGTA TATAAC ATTATGGC CT AG ATT AAACTGATC AGTAATT ATTAATG CTTAACGTTAGTGCGG AAGCT CAAAAATTAT ATACTAGAGTGTTTGTGTAACTCTAAATCTTAAGATACATGTCAGTGATACCCGAAGCTTCTTTTTTTTTTTATT TTATAGAGAAAACATAATTGACAAAATGGTCACATACTTTTTAAACTTTGTTTTCGACATCTTTGGGAATTGTAA CCATTGAACCCCATGTTCCTGATTCAAGCTCCTCTGTGCATCTCTTAAGAGTTGTTGAATTGAAGTCATAATGCC TGGGATGGAAGAAAAGGCTTCCAAGGGGCTGATCTCACAAATCAGCGGATAGGCTGACCAGTTACTCTGTTTGTT CTCAGGCATGTGAAAGCGTTCTCTGGTCCTAAACATCCTGTTCTAGTTGTGCCCCATTTGTTGAGGGGTTGAGGG TGGCATTGATGCAGTCTCCATGCTTTAACTAGGCTGCTATTACCTATTACCTGCCTGCCTCAGCTTCCAACTACC AC CC AGGTTAAG AGGG CATGTCTGATGAAG AAACCACCTCCCTTAGTC AAGATGATTC AACTTC ATTCTACT CT C AT CCCCTTTCTC CAGG TGGAAATCTGAGGGTCTGAGGAGGAGACAGAGAGAGAAAGGGGACGCAAAAAGG CT CAA TTCTCAGGCTTCTGTCCCAACAACATCAGTATTATCTTC(N)xAAACAATGTAGAAAGACTGGTCAACGTGAGTT TCATTATTATCCAGAAAGCCAGCAGGAGTTGGATGAGCTCATTCACAGAGACCACTGGGTTTTAGGAGCAGGGAT TTTTATCCCTCTGGACTGGGAGAGTTGGGAAAAATGAAAGGATCCAAGCTTTACCAAATTCAGACTCACATTTTñ GTTCCCAGCCAAAGGTCATGTTGAAGCACAGTGAGAAGCAGTTTCCCTGAGGAAGGGGGTTGGACTTGGAGGTTG GTTCCCCTATTTAGATTCAGACAATGGGAGGGAAGTTTATTACTTTTACAGCATGGTGACAATGCAATGCTTGGT AAAAAACAGTTCTCAGGACATTTAGAATTTGTTCTCTTTCCCTGCCATGTGCTTATGGGATGCTCCCATAAATCC CTTCTCTTATCTGGATACAGGGACCTATTGACTAGTCCTGCTAGGGATTGGGATCTGGAAAAAGCAATTTTTTTC TTTGGTTTGTGCATTTTTAAAATGCCTGTGCAAGGACTAAGAACCTGCTATTCATATTCCCAGGAAAAGATAACT GTATCAATTACTAGATACGGCACCTTGGTCATCAAGAGTGAGTCCAACGACTAGCAGGTGTCTTGGTTTGTGAGG CGCCCCCTCCAGCATTAATGGGAAAAGAGATGAGGGCCAG{ N} xGATGAGGGTCCAGAGATTCCAGTCTAATGCC TCACTCCTGGGTCGATCTCACCAAATAGCAAAGGGTTTCCTGTTCACCCTTTCCATACTGGAACAAGTGAGAGGT GGAGGATGGTGAATCATTTTCCAAAGCCAGAATTGGCCCTCTTGGAATCTTTGTGCATTTCTGATGGAATGTAGG AAAATTAAGGGAAATCGGGAGGCTCCAGGGATTTAGCATGGTTCGGGCCACAGCACAAGGTCCCATAGGCTCTGA AAGACTAAGCTATAGCTCCAATACAGGCGTTATCTGACTCCAAAGTTCAGGCCTCTTCTACTTGAGATCTGAGTC CCTCACACTGGAGCATGGTCAGTTAAGGCTGCAAAAGGGAGAGAGTAATGGAGGTCAGCGTTGAATAGATACGAA TATAAACTCTTGGGCACTGTTAACGCCAAACAGCCTGACATCTTACTTCAGATATAACTGTAACAGTGTCCAAAA TC CAAT ATTAGC ATTT AAAACTTGTAGC AC ACTT ATTGTGGAGT CATTGAGTGATG CAGAAAGAAAAAAAAATAA AGCCCTGAGAATTCAGCTCATTTACCCAAGTCACTGACGCATTTAAGGAACAGCTCTTATAAGTCCATTTAGGTG GTTT AGG GCTGCGG AAGATGCCCAGAAT CC CT AAAT AGAAAT AAGT ACTGGTCATC AG AG CAGTG CGGTTAG GC C CTTTCTGGCCAGGAGGTCATCTTGGGCTCCAGAGCCACTTGTTCAGCTCTGATATCTAAACAGCCAAAGTGAATT ATTCACTTAGGAGGGTGAAAAACACTCAACTC CCTGGGATTGTCAAATGGAGTTTT CCAT CñTAAAATAT CTTñG TCATTTGGAATCTAACCAAAGTAAACGCCTGGTCCTGACCCATCCATACCTCTGTGCTGGATGGGAAAGGAGAGA GGGAAATCCCAAAACAAATGGAAAAGCCACTTTGCCAGCCACAGAGCATGCATGGCCCTCCCTGGGCAGGTCTAC TCCAGGTGAGACACGGTTTGAGCCAGGCTACAGGTGAGGAGCGGGTCACGGGAGGGCTCTAAATGGCATCTCTGT TTGATTCCAGGCCTGAGTGGGGCTGCGGCTTACATAACAGTTTGGCAGGGAGATGCAGCCAGGCGGAAGCACGCG GGTTTCCAGCAGCGTATAATCATCTTTGATTTCTCTGTTTGCTCAGCAGCTTTCATGTGGTGGGGGAGGCAGTAG TGCAGGAGGAGGGGCAGAATTCCGATACACTGCGGCCTTCTGTTTTCTTCTGCAAAACAAACAGCCCCAAACAGA TCCCCATGGCCAGGGAATTAGCCACTGCCACAGAAGTCCCGTGGCAGTCTAGGGGAGGAGCTGCCCTGGAGCCTG GGT C AGGCCTAAGAAGGT CAGAGATTGACTTAAAGTT ACGGT CT CG CT AGGTGCAC AG GAGCCC AAAGGG CT AC C GAGGAGCAGGGGTCTTGGATGCTGGCAGCACCTAñCATGGTTTñCCCTCTTGGCACTGAGGGCTGTGGTGTCCCC TTCTGATTACAGAGGTTACTGAATCCCATACTACGTCCATCCAGAAAGGCAGCTTGTCAGGTCTCTTTCCTACTA ATCACTGCCTCCATACCAGGCTAATTCTGCTGGAATCCTGAGCATCTGGGAGCAACCAATTTAGAAAATAACATT TCCTTTGACTAAAATACAAATTTCTAGCTCCTTTTGATCCT(N)xTAAGATGAATTCCCTGCCCTATCCTAAGTA TAATGATTTGATAGAA[K) xTGACAGAAGCACAAATTGGTCATTC(N)XAATTTATCATGGGGGCCTTTAGTGCC TGGCACTTAGCACATGCCGTATACATGTGATTAGATTAAATGAATACATAAATAATTCTTCCAGCATGCAAACTG TTGATCTTGGCAGTGAGGTTCTGTTACCAATGTGTCCACCCTCTCAACCTTATGAATAGGTTTATTGACAGGATA CATCTTG TTTTGCAAGTGATTTGCATGAACTTGTATAATGTTGTGCTCAGATATTTCCTTACATTCAGATATTTT CTAACCCCCAGCTTGCATCTCTGGCCTCATGTTTTGCTACTCCCTGCCCTATCAGCCATGAGGAACTGCTTGTAT GACCTCCCTGAACTCACTCTGCTCCTGGGGCCTCCACGCCCTTGCTCACCCTCTCCCCTT<N) xACCCTAGCCAG GCCTGCACCTGTTTTAGCACATCAGTGCACACACACCCTCAATTTTGACACTCCCCTGCAGAGTCCAGATAACCA ACCAAGGATCCAAATGGATTCTTGTTTTGGACAGAGTCTAGCTCTCTGTTGACATATCATGTTCTTTCCTCTATT ATAGTAACTACCCATGCTCTGTGTGTTTTCTCAAGTTTTCTGTGTTTGAATACATGAñATTCTTTTGCTCTTTGT ATGAAAACAATGCCATCTGTTCACTAGATAGAAATTCTGGAAAGCTTCAGAAAAGCAACCAGGTTTTACTGCAGA ACAGTCTAAGATATACCTCAGTGGGATTCTGGGTGGGAGGCAGATAAGATTAGGACAAGAACAACTTGGAGTTCT TAGTTTTGTGTGGTTCCCTGCCCCCCTTTTTTTGATTTTAGATTTTAGAACTTTTCTCTCAAATTCATTTCAATG AGATTCATGCTC CAGTGAAAACTAACATTCTT CTTACATTCCAG CCTAGCAGCCTCAGGGGTGATGATAGTC TT C
A (N) XAAAATTTAAATAAATTGCAAAAAAAATAGTAAGAATC (M) xCTTTGAATGTCAATTTCACACATTCAGTA CAAAAGGGCATGCTGATATTTATCCTCATTGAAGCTGTTGACCTCCTTCAAGGGCTTGTCAATACCAGGACTGCT CAAAATGATTAACCAGATGATTTGGACTGCACTGGAATTTGATTAGTGGTAATTTGAAACATCCAATAATATTTG CCTTGCCTAGTGATAAGCAGATACTTGGCAAñGAGGATGGAGTñCAAGCCTTGCTTTñTCTCAGAACACAGACAT CAGGGACTGTAGGAAAGAATGCTCTAGAAGGAGGTCCCCTCCCTGGGGCAGGCCCTGTGTTGGAATGGAATGCAG AT TTCAGGTCGGAGAAG G CAAGTGGAGAAGGC CACTGT C CAG GT CTGTAGGTGG GACAGACACAGAGGGCTGGCT GTTTCCCTTGATGGTTCTCACCAG TAGCCAAAAT CAGGGATCTAAAA CAGfiGTCTTTTAACAGGGCTG CTTTATG GAAGAGAG GGGCTAGAGT CGAAAACCAAAACTGCTCCAAAGTAGTC CAAAGAAG CTAATATG GAAACAGC CATT T TT CATAGACTTATT CATTTAGATTAG GATGATAT GAATTGAAGAAT CTTTTTTTTTGATATTG GTCAGAGACTTA GTGTAAGAAAATTGGGAATATTAAAAAGAAACAACATTGTGACTGGTCTATGATTTATATTACTAATAGTAAACT GC CATT GCTGAACATG CTTAGAAAACATTTTAAAGAAG CT CTGTAGAACTTAG G CTACAAGACTA CAACCTGTAT CACTTAAGGAAGTTATTATGCTCATAACATTGCCAACTGATGGAATGTGCCTCTGCTGACTTGTACCCAGGCCAG CC CACACCACAGACTG CAGTCAAGTCTTCT CTGCATGT CT TCAT CAGC CC CC CACC CT CTGT TTATTAAGGC CAC AGTGGAGC CAGCCTGGAG GAACAG GG CTAGGAAG GAGGGCGT GGAT C CAG GAAC CCAG TACAAAT CACATGCAAT TC CTCACTGT CTCTAATT TTTATT CT CCAGTGAAGATCAATG TAGGAGAGAAAGATAAATGCAAGGAGGG CAC CT TGGGGAGTTAAGGGAAAGTAATG GTG CTGACTG CATGCTCTGGGAGTTGTTGTCTT CT CATATCTGTG CTAG CA G GG CAATTTTCTTCTTG CAACAC CA CATTCC CCTCTACCTTAACAGTGGATAC CTGAACAGTTTA TTTTGC CATTT AAAAAGATGCAGAATA CT CCTCTCAATCCTTCAATCATACAGAT TACAGGAAAAAAAAAAAGAGATTAATACAAG TAGAAATGAG CCTCGGTTTTGCCT CTGGTGTGTCACTGTAGAGGAATCAC CT CTTTTT CTAAG CCCCTCTATTCC CTAGGCAG CACCCTAGG CTGTTGAGG CTA CACG C CAGCTGGTGCAG CT CCACAGTAC CATTCCTAT TTGACAGAT CTAAGAGGAT CCCAG GTATGTAATAAATGGTCATGAGAGGTT CTGCTGACTGTGGACACCAACATACT TG CAGCT TCATTGAATG CTCTAGAAGGAGGT CCCCTCCCTG GGGCAAGC CC TGTC CTGGTAAT GGfiATT CAGACT GCAGGT C GGAGGAGG CAAGTGGAAAAGGCTG CTGTCCAGGC CTGCGGGTAAAACAGAGAAATGGCTGGC TG CATCTT C C CTT T C TCTATTTT CTTTCCTC CAGT GTATACAAAGTG CTGATGTAAAATTTT C TGAT CT C CAT CAG CAATCAT CTTCA AATAGCAAAGTGGAGAAGGCAG CAAGAAGG GCAATAAACC CAGGGAAAGAAAAGAAAGAAGT CAGCAG CCAT GGA G GGAGG CT CAAGTGAGTGGCAGG GGTGATGAGAAACCTGG CCTC CAG GTAGAGAG CA CAATGTGAAACTGTG CAG GGGAAGTCAGTTTTCTAAACTGAAGGACACAGCATATCTGTTTTGATGATGTTAGTATTAGCCTATCCCTGGAGA TGAGAG TCTCAGCTTAGG CCTCTTACAGAAAG CCTGGTAACTGGTGGT T CTTTG GGAATC C CAGATGT CATGAAA GCTGTCTT CAAACTGAGAGTTAAT CGTAGT CT CAAATTTGATGT CC CTTC CCAATC CT CAC CATGTGGGTTAGCT TGAGAT CT TTTCAGAACATAAATT CTTG CAAA{ N ) xCTGAGGCTCCAGTGAATGAAGGTGAAGCTTGTGCCAGTG GAACCAGGGTTGGCATTTTTCTGTGCGTCATCTGTAATGCTCTGCTCAGGGCAGGTGTCCTCACAGGTTAAGTAA CAAGCGGAAT GATG CAACAAAGGCAG TACTACT CTCTCTACG TC TGTCACTAAGAG GCTAGGTT GCAAAACATGG AT CTCGTTTCAATATCTGTTTC TTTAG GCTTCAAGAAACAAGAT CTAAAG CTTC CTTT CC TAGTATTATC CCAGG CACCTC CCGGGCCTGAG CTCACCCTACCTG CACG CACAG C CCTT CGGAAGñATCTT GGAGAGCCACTG CCAAAGT CATTACTGCTATCCTCATTCTGCCCCATAT CGTGAATATAGCAATCAAAGGGGCACCTCATGCATGATCGGGAAT TTTCCCTG CATCCAACTCTTAGGAGTACGTGCAC CAGAAAGTGGGCTGAC CAAGGT CGAGTG CTTATTAGTTTAA TAATTGGAACACCCGTGGTACTTCCTGCCAAGTGTTCAAAGACATGACAATAAAGTTTTTCCATTTGGCACTGGT GG CTTGGGGGTGAACCTTGCTGGTGCACACAGGG CTGGAG TTGAGGAG CTGGAGGTCACCCTGACCCCTGCTCCT GGAGCCATCACCTCCTTGTCCATGATCATCAGACTTGCATTTCCATGTGAATGAGCCAAGGGAAGAAGTGTTTCT GAGTTTTATGGGCCTTTTATTCTC TTGTTTTC TTTAAAAATTGGAC CAGGAACT CCAGCCCTTT CAGCTTTCTT C CTCTCCCTGGAGTCCCACTGGTG CACATTACTGCTAATGG GGCTTTGCGCCTGC CAAC CCAAAAGGAAGAACAGG T C CGCGGTGCATCTGTGG CCTAATGAGGAG CCTTGAATGGAAATGAGAGT CAAACATG CATTGAAAATATTCAGT TCTCAGTTGGTCATCCCTCCCCCGCCCCTCGC CATCTCAAGGTC CCCGAGCCCTGTGC CAGCAGAAGGTGAC CTG CCAAA CTGATATCC CGTGAAGCTTG G GGAAG GGCACTCAAACGGGACC CAGAAG CACACC CTGAAC TTG TTTTTT CTGTGCTTAGCACTGC CAAGGCAG GGGCTGAAGCAAAGAG GCAAGCGGGAACAGGAGCAGGGTT TG CCTACT GAG CAGAAAG GAAAATCAGG GGAATAGGGAGCTG GGAGTCAGATG CCAG GTG G CAG C CTGAAAAC CCATTGTTAGGAA GTTCACTCTGAGCG GCGAGTAG GAA CTTTTAT CTTGG CAACTTTAATGGAGAAAGAA CAGGGAGGCAGGTAG CAG GAAAAACATGGGCC CCTAAGTCAGGAGACACAGT CCCAGG CCTG GC C CTGGGACACATflAGCTG CAGCAAAACTA TGACAATTTGGATAACTTGTTTTTTTTGTTGTTGTTGTTTTTTTTTTGAG CCTT G G CT CC CTACTGTGTAAAATA GGAGTGGTAAGCCATG CTTTACCTGCTCCC CAAG CACATTGTGAGGAC{ N ) xTCTGGAGGTGGTGAGGCTGGCTA AG CAGT CAG CAAACGGGCAAGGTG GG CATGAGAGTGGGG GAC CC CCAAAC CTGCAAGTGTTCAG CTAATAAAACT CAGCACT C CATATTA CAAGGCAGTTG TTCAAAT CAAAGTGAATG GAGTAG CT
> H s l _ 18151 78 9 5 - 181536854
Gt TTTGTGTCTCAGAATGTGTCTC TAACCACATCTCAAAAC CAAATATGATAAATGACTTTCGT CT TATC CATCA TGTTTAGTTGGCTAAG CCTCTT CC TCTATCAG CC CAGATAATGAAGAGATGG GATTTTTATTGTTTTCTCTGTGG CACAGAGAATAAAG CAAGTTCC CAGGTAACAAGG CACAGCAG CATCTT TCAAGC CC CGAAACTGAGATGG TCTGA GAAGAACAGG CACGTCTAGGAGAAACAGAAAGGAAAAG CCAG CCGGAAAG GGGAAT CATT CTTAAT CGAT CC CAA G C CTCTGAGAGCTGGAAGGTCTGACTGCATTT TACAAATAAAAAAGTCAAAG CC CCATAAAGGAAAGTAG CC TAA AG CCACAG GGATGGTGAAGGAGTG GCGTCAG GATGGGCAGAAAGAG CTCTGGACTCTTGGGCTGCTGCTACAACC A C CTG GTGTGGGTTTT CTTAACAC CACAGCTC CATCAGGTTCTTAGGAAAAATGTCACACAACCAGGT TGTCAAA AGGGCTTTGAGGGCATTAAGAAAG GGACAGAAGTGGATGGATTAGAAT GGACAGGTAGGGTACCAAGGGG C CAGA GTGAGAAT GAG GAAGAAGGGCT GAAGAGAGAGG GATCAGT CGTATTTG CT TCTGTTTCTGAATAGT TTCCCTTTT TGGAAAGAGAAGTAAGAGAATAGAAGGCTAAAAT CATG CCACAGACTGGCAGGGGTGGTTGGGTGTAGA CACAAC AG CTCAC CTT CAGG CTGCTTTCACATTCATTCATCTCTTTCC CTTATAAT GATGATGCTG CCTC CAGTTTTTAGA ACTTT CTGGAAACTTCTGAGAAGTGACTGT CTAAAGGGG GATGATG G GAGAAGGG GAGTTGCCCATGGGTTCCCA T CGTGCTTGCTGGAGCAATGGG TGTTGGCAGATTTCTT CAGAAG GAGG TTGT CAGCAGTG GGGTGGGGCT GGGAA GAAAGC CACAGT GAG G CAAC AG GTGCAGGCATG GAG AC CAGAGCAAGCACAGTGTC CGTGGAGAñ CGGCATG CAT GTCACGGCAACCACTGGAGT GGGCGTGGAGTGGAGCAT GG CGGAGAGTGGGGGT GAA CAGGCAGGCAGGAGC TCT AGGTCTGTGTACTG CGGTGACACTTCTGGATT CTACTGTGAAGACAATAGAGGG C C AT TAAGAGGCTTAAAG TGA GGGGGTG GTAGAAT CCTGGCTACATAGACCTC CCAGCACT CAGAGC CTTGGAAT CT TG CCATG GTGGTACATGTG TGTGTTGGGGGTAATTAGGGAGGAAGGGGTGAGGCTGAGCTAGGATGACTAGTTAGGACACCATCTTTACTGTCT AGAATATACTGAGG GCTGTGAC CTGGGCTGTGTCAGTG G GAC CAGAGGAGGGGACAGATACAAAAAATACTTAGG GGTC TT TAAACTTGGGGGAT GAGAGGGAAGAATCCAAGATGAATCC CAAGTGTAGGGGAGGGTGGCTTTT TT CAC CCTGGATGAGGAGCAGTCTGGTATGGAGAGGGGATGAGGGTGGCTCTGGCTTTGGCGGTTGGGTGTGTCTTTTGG ACTGGGTGTGGTGC CCTATAGAAAGGCTGGAG CTTGGGACAGGTCTGT GTGGACACCT CGATTCAT CACTGCATG GTTGGTAGTTGAAA CCATAGACATGGGT CAGATTGTCC CAGT GAAATG GCAGAGAAGAGGGCAAGGACAGACTC C AAG AG G CC CTGAAT CAAATG GGTATGGAAAAT ATGG AGG GCCTGGGATGGAGGCTGAGAGGGAG CTGGCAG G AGG GGTGGGGG CAGATTGGTTGAGACCATTAGCAGGGAAATGAGGGAGAAAGAGTTT CCAGAAGGGAATGGTTAC CAG GTTC AAATGTGAGGGAGGAATT CA CTGCTGGC ATTTGT TTGAGAAG CC CTGACCTTTTTCTGTT CC CTTG CT AG A AATAAATG CTTTTTTTTTTTGGTAG CAGA CAGTAATGTAC TGGAAT GGAATCAT GATTGTTC CGTGTCTCAGAAT CCATATGTAAAGT C CC TTGT CAGATAGCAGGGAAACAGTCAT CTTTGGG CAGTTTGTACTGAGAGGGAGGGGAGG CCGAGTGTAAAG GATCGGTG CCTTAATTTTAG CAAGGCTCTGAGCC CCTTGGAAGG CAAGCTTTAT TTCTACATA GATT GAAAGTAT TATTATTATTACATCC TTACGCTGTAT CAG CGCCAAACACTGGAGT CCTCAATACTGTAT TTA TTTCTCTGGGCTTTTGGAGGGGTGGGGAAGAAACGGGACATTTAGGTT GCTAAGTGATTGTGCTTG CTTCTAAAA ATAATGAGATTC CC CAGTGG CT CGTTCT CC TGTGTTTT GCTTAGAGTTAGAGGATG C CAAGGTCAAGTTATAGAT GACAGG CTGTGTCTCCAGGTGGTG TCAGGG CATCCTGTTG GGGAAG GACAAGGTAAACTCTC CCTCTCTGGT CCT CCCAC CTG CATC CC CTTAGGG C C CA CTACTTTG GGAAATTGCAGGAAAAACTGGGC CT CCCC CTGCTTGGAT TGA AGAGCCTCGCCTAGGCAGTCAGACTCAGTGGTTGCCATGGTGATGGTGGGTGGTTGGTGGGGTGGTGATGCAGCG TTTGGCCTGTGCAG CTGAGGTC CT CAATGCAGGCTGCTTGGAGCCCAGGCTGCT GAAG CTTTTGACAGTTTGTTT AGGATT C CAGAGGGTCA CTTGCTG CGGTGT TGAAATCTTCAGACAGAC CCGCCAGCTCAGCCAG CTGAAAC CTT C ATTGTTGAATATGCAT CTCACAGG C CAAAT TTTGGCAGGCGCTCATGGGATTAGGAGCTCCACAGCAACC TTGGT GGGGGAAG GGGTGTGC GAAAAAGC T CTGTG CAGGGGAGAGTG GCTGGGATAATAGACT CTGAGGACAACAGAAG G CCAGAGGCCCTT CAGAT[N ) xTTGGCAAAATTCTGGCCCCTTCTAATTTGATTCACTGACTCAAGTTATGCCAAG AAGGAAAG CAGTGTAAGCTTTCAT CAAAGTGACAACCACTTG CCAC CCAACAAAACATTCATAC TTTTCTCAGCT ACTT CTGACATTATTT CAAGAC CC CTTGTTATTTCCCCAT CC CCTATA( N } xGACTTTAGGGTGACACCCATGTT TGTGAA í N ) xTATAATAGTATTCATAGCTTTTTTTTTCTGTTTTTATATCCAACTATCTACTTGTCTAACCAATT AATCACTGATC(N)xTTTGCAGGGACATGGATATTTT(N)xTGAATTTAGTTGTTATTCAGTAATCCATATAATT TGGGAATGATTGTTCAAATGGCACAGACTCTGACAGGGGTTCAATCTACCATTGTGTCAGTGTAGCAGAT( N) xA TAATAAGAAACAATGC CAATAG TGATACTAATGGTCCT C A ( N } xGCTTTCTCGTTGCTCACACACTGTGTACTGA CGTACGGCTGAATTCTCCTGGGATTGACAATGAGCTTACTGGTTATCACTGTAGAGTCTTCTGTTTGGAATCAAG ACATTTGCTCAGGTTTTAAGGCCTCTGTTTTTATCCCCAGTTTCTCACAAGCTGCTTGTTTGGATGTCTTATCAG TGCTGGATCTAAGT( N } xTCAGTTGCAGACACACTTGAGCCATTTTTGTTATTTAAAATGGTTAAGTGTTCCTAC TAGTATTATCTC CTAT CCAAGTTT CACTTTTAA CAGAATT TATCCT CCCC CTTTTAGATTTTTAAAAAATAT TCA TAAGACAC TCAGAGGTATGACTGAATGTTGGT CACTAGACC CAGAAAGGG GTATAC CTACCAGCTCTTGC C CACT GAGTTGTG CTTGAGGT GGAAGAAACAGAATTG CAGACT C CAG GAGGAAAAGGTATG CTTCTAGAG GTCTTTCAT C TAGAGC GAATCT TTCCCACCTTCT CCTAAT CAGCAGAAAT CATTACAGGAA CAAGACATATCGATC CTGGTCTGA GGGCAC CATATACATAGAGTT CTGTTTT CTTCTTTTT CTT TC CACC CT CC C CCAGC CT CAGAAGGCTAAT C CAGT GCCAC CTGCGATTTCTT CTGATGT TACTGTG GAAGGGAG GGTTACATAAAGAGAATAACCAG GTTTT CTC CTGTG GCACCTGCAGCCCCAT TTTCTCATTGGGAAAACCAAGG TATGAAGGAATAGGTTAGGACTTAATGGTGCATAAAC TCAGAG C CAGCAGT CATTCTTC CATCCCTG GT CTTGGCATTTGGAAGATGTGAT CCACTATGTTGTAAGGTCTAA GGGG CAGGAAATACAA GATGATGTATTTATGTAAATATAATGTACTATGTTGTAAT CATATGTG CATGCAGGAAT GAATGACT GAATAAGT GAAGGGAGAATT CTTT T CTAC CTGAACTGTGG CT CCTG CTAG TCTGAAAG GCATGCAG C CTTTCTTAAGCTTTATGAAGAGGAAGGCTAGAAAGAAGTTCAGAAAAGCTTCCCCCATCTTTATTCTTTAAACTC TATATAAAGGCATG AAATGTTC TGñGAC TGAC GAAAACATGGGCCTAGAC CAGCAAAT GATTTT GGAATAACAAA GATGAG CCTAGT TCTCTAGCTATT GTCACCTC CTTTGTTGTG CCAGTG GTGGTTAC CATGTGTC CT CCTT CACTG GAGGATACAGCCAAAGTGAAGGATGCTG CAGT GTAAGGATGCAGGATGGGTTTT CTGG CTTCAAATTCGTTATAG ACTAGAGTAGCC CAGGG CAT CTTC TTACTA GATGTCTTGCAT CATGGCAG CAACAC CC CTGGTC CCTCTGG GGTG GATG GGGTGCAG CACACCTTGGTGAGGGGTTTA CACTAAATACTGCTGAC CCCAG GACAGCAATTT CTCATACAT TTTTATTC CAAAGTAT TCCTGAAAGGTACTTT TGCTTCCTCATGTTTC CACAACAACC CCCT CTAGAGCAAGGAT ATAAATAAGAAAG CAAAATTTCAGGCTC C CTTTACAAT CACAGTAG CT GT TAACTG CTGAAAGGGACTCAGAGAG GGGCCCCTGTGG GAGGGGGAGG CAGAGG GTGGAGCAGG CCAGCCTGTCTCCCTGGGTAGCACATCC CCAAAGAAT CCTCTGAGCATTGATCTTTGATTGTGCCCCTTCCGTGCTACCCAGCAGGACCTTCTCCCCTAAAGAGGGGTCACT TTGTATTGAGGATCTCCTTCTTATATTGCATTTGCTTAGAGGGTGCATGGCCTCAGACCACATTTTGGGGCTGAT GGGAAGAAGTTGGGGCAGGAGGGCACTGTGGTCATAATGAGGTCATGGACTGCTAAAGCTTCCAGTTCTGAGGGG CTCAGGTTTCTCAACAGAGTGAAGATCAAAGTGTGTTCAGTGTGGCTGCTGCTGCTGCTACAAAGAGGTGTGAAA TGTGGGTTTGGAAAGAGAAC CGAG GATAATTTGCTAGTTTATAGCCTTTC CTAGAGTT CACACAGAATTTTTAGT ACTTTCTCTCCCTCCCTTTTTTGTAACTGTTTCAGAGAAGGTGTGAGATCAGCAGGCTGTGAACATCGTGCAGTG TGG GAAAGGGGAGCTGAGGCATG GGTCTCTGTCTCCTTGGCCCTGG GAGAAGGGATTTTGGTG GGAGCTGG GGGA GTGTTTGGGTGTCATCTTGGGGAGCATGTTGGGGAGTGTGTGGGATCAACATTCCCACCACCTTCCTCCCTGTCA CCTGGGTGCTGG CTTCTGGAAG GCTCAG CC CTGCAGGT C CATAG CAG GACTTTC CTG G CTGTGC CGTACAAAGTA C CTGAC GATGAGATTG CT TG GAAG GG CATG GGGAGGAG C CAGTGACAGATGT GGTG GGAT CC AT AG AAAGGG CTG CTGTGGAGCCTCTG CGTATTACT CAGGAGCAC CTACTGAGTTTTCTGTACAGCCTTTGTTTTGCC CTCTGGAAAA TG C CAC CAGGAG CT CTAAC CTTAC TGTGAT CACC CTTTG C CATT CAGTGC CTGCCTCCATTCCATCTGACAGCTT TAGATATTTGAACAGT CCTCTTGCTGACAGCTGTGTGGAG CAGGAGTGTC CTGTTCGCATGC CAAGTACT CAGAC TCTGG GAAGACAGAGATAATGAACTT CCTTTT CACC CCAGTGATTG CCTCCGAGTT CAGAA CTCTGTGCCCTCGC ATCAGGCTGTGG CTGAGC CT C CAACTTG CTTTGCTCCCTTGGCCCAGCATCTCTCCCCTTCCTGTCTACTTTCTC AACCAC TG CTGGGAGACCTT C C TAAAG CATAGCTTTGCTCATGTTATCCGCTGGGCAGAAACCATCCATGGGTCT CAGTGCT CAT GAGAGGAG GGGG CAGAGC TGGCTCAGCGGCACTCTGTGGG GAAAG GACTCGGGT TATGTTAAGAA AGTGAGATTGCTGC CAAAAGGGAT CCCACCCTCTGGACCTGCTGTGACTGGGATTCCAGGCCTT GAATAAGATT C AG ( N ) xAGTTCAGGGATGGGCTGTGCTGCCCTCACTGCAAGAGTTCATATAGATGCTGGGATGAGCTTCGCCCAG GAACACTG TGTT CCTGGAGTTTGGGCTTGATT TT CTAAGATCATAT CCAACTTGAAGTTTAT CAGTTCAC CCT ( N ) xGAAGACTGAGGGAAAGGGTGGTTTGGGAAACCTTCCTGAAGCAAGTGCATGCTGAACAGTGTTGAGGAGGAGC AGTTTAAACCTAGC CAGG GTGAGGTG CATG CGAGAGCC CTGAAATGTGAAAG CTGGGCACCTGCAG GAAACCAAA AT TAGTTCAGGTGACTGGAGGGTGGCAG GCGGAGGCGTGG CAAGAGGC TGGAAGACAAGGTGGAAGACAGAT CAT GACA( N ) xAGGTATTCTTGTGAAAAATTATGAGGACGTTGGCAGTAGGGATAGAGAAATGAAACTGTTAATAGAG AAAAAG CTATGGAGTTCCTACAAGACGGTCTT TTGGGAAAGACA( N ) xAGTGGAAATGCCCAGCAGAGTACCAGG TATGCA(N) xGGTCTGTCCTGGAGATAAAGACTGGGAGTCATTTGCATCCAAGTGATAGTTGGGCACTTCGTTTC AGTAAG GC CTTT TAG GGAGTGAGTGTGGATTGAAAATAAAAGTGGGGAAAGG CAAGTATGAAA CTT TAGGAAATG TGGCTTTTAAAGGGAAGTGGAGGAAGAGGAGTAG CCAAAG GAGAAA( N ) xGAGGAATTTCAAGGAGAGGGTACTA AACACTACAGAGTTATTATAAAGTGCTG CAATA CAATAC CGACAAGTACAT CTTGAATTTGGAGAC CTGCGGGCC TCTGATGC CCTGTAATGAGT CATTTT CACGGAGTGAGGAGGACAGCAG CCAGGGGCAG CAGG CTGGGTAGGTAGA GGGTACAAATGTGCAC CAGCTT CAGAAG CTGGAC CAGG GAAGTGAGGAGG CTAAGGAGGG GC CACAGAGATG CAG GGAAGAGCTGTGT CAGAAGAGAGCACACAAAAAG TTGAG GAAACAG GAGAGGG GGT CATCA CAGAG CTGCTGAAG GCGT GGGATG CAGAGCATCGACATTCTT CTGCAATGTGAGAAAAAGGGTCAC CAGCTTTTAG GAGACTTCAGTCC ACATTGGTATTAGAAAGT TTTGATTGTT TAAAATAAGGAT CT CTAATGAT TATGGAAGAG CTATTTTTTTTTTCT GTATTTGTTGACCTTTAGTCATATTT GAAGTT TAGCAAT CAGAACATGTACTACAG CT CT TAAT TTGAATGTAAA CTCACCATCCAGGCCCACCTAGACAGG GTCACAAAGAGTTGCTTTTGT GAAGAGCTCTCAATATTTTTGCCAACT TCAGAAGGTTAGGAAGATACTGCAAACAACCTTGCTTTTTTTTTTAAGT(N)x ACATAGAAATTTGCCATGGCTG TATCTCTACAGGATTAGTTCTACAGTATGTGATG CTCCATGCCAGGAT CAGCAGTTTACAGTAT TTTCATAAAAT CTTAACTTGGTCTTAGCCTCGTTTCTTTCACTTTCGGAGGTCACGAGTTAATTTGCGTAGCTGGCAAGGTAAAAA
t a a g a g t t g a a g a g a a a g a t t t t t g c c c t c a a g t t a a a c t t g g a c a c t g g g t g t c a g c a c t t c t g t g a g a g g c a g ACATCTCTCTGTTGTT CCTAATAAATAGAAAAAGTGTT CTTTTAAATT CCTCTCTGAGAAAATGGC CATG CCACT TAGTTTCCCTGTTGTGAGAGTAACCTCTTGATTCCTCTAACAACTCAGCATTTTATGTGTTCACAATGAAAGTAC AATTAAAATCACTTAC TT CATG CTTG CATAGC TCTATCAAATAGATTCAGGCAG CAAACAGTTC CTGTATATAGG CACAATTC( N ) xTTATCTTTTATTTTATTCCAAATAGATGGTATGGGATGTGATATGACTCAGAAGTACTAGAAT GAAT TTTGACTTAATAA CT CTAAAAATAAAAAGCAT TTTGTTTTCT CTGACAAATAGCTCTCGTT CATTTTGATG CTCGACAGACAG CTGATATTTTATAAAACTTTATGATTGC CTCTTCATCAAG CACTGTAATTGTAGGTTTGT CTT CT TTGCAAGTAT GGG GAACTGG GTATTTTC TTTTAAATACTTGT CATC CTAGGAGG CG GATTGC CTATGCCTGTC ATG C AAAT CAA CTTTT CAGG CAT CATAGGCAGAACTAGGAAAGTAGAACAAT GGAAAAATATATAT TTTTGTCTA AAATTTATGGAT CTGTGACTTG C C TGAAAGAGTTATTTGAGAAGGTTTTTATAAAC TCAAACAGAT C CTTTAAAC TAGATATT TCCCAGACTGCC CAAAAAAGGG CCCCTGTCCC CCAATACCACA CAACACTATGAGAAAGTAGAATTT TATT GCTTGGGTTTTTCTTTTCCT CATACTTTGAAAGATCTCTTTGGC CAGCTTTC CATT CAGTGTTAAC C CACA CTCGTCAGTGTTTGATGT CATGAGAAGAGGACTCTGGCTTG CTCAAGGAT CATGAGTGAGTAAATTTTTCAGTCT TC CAGAAGGGGAAAAGGAAG CATCATTTTGGTAT CTTAGGGATTAAGA CACTTG CTAACATGATAT CAAGAAAAA AG CTTG CACACT CTC CAATTGGTATACTGG CTCCATGC CAGTTACATAGT TTTGGGTTTTTCCTTGTTTTTCATG T A (N ) xTATAAGTCTGGCCACACCAGTTTTACCATCTTTCACTTTGGTTTCTTATCTAGATGGTATTATCTATAT GGAATAGG CAAGTGGGTTA(N)xGGAGGTACTTTTCAGGGAAATGATGTTTCACAAAACTGCCACGCTGCAGGCA GAATTAGAAAATAGGC CT TGG GAATTGCAGAAATAGG GCTCCAGGTCCAC CAGAATTATT TCCTCTTTTCATTCC CTCTTGGC CCTGTAAGATTGGCTTTATT TTTCTCTCTCAC TGTAGC CAGTTTTCTT CTGGAAGAGAAAACATGGA TAGT GACAGCTCTTGAG C CTTTTAGCTTATGG CTTCAGCTGCTTCA GAAGAGAT CAAC CAAACT CTATTGGTCCT AACT CCAAACAT CCAGGGAAGGAACTTC CTGCCCCT GATC CTGTTAACTGTG GC CAGTAGTG CCATGTAAGAGCA CT GAAATTTTTGGAAAGACCTTAGGGAT GAAG TATTGTTGTCAGAG GAAGAGGAGGGACTGGTAGTAAAAACAAT AGGTGTACATTCTTTGTGGG CAGTGCTGGATTATATAG CTAATCATGGGG CTTG GATA CTGGAGTT CATTAGTG C AT CTTGTAACAAGG CCACTTTTGGGGGTGTGGCTCC CACATGGATAATTAGCTGGATC CTGGTCTT CAGCTGTGT CCTGCTCTATCC CTGACTAG CCAT CT CACAACTG CTTGTTCCTGGTCCTGGGGGGTACCTGCG GAGAGAAAGTGT GTGG GTGGTAGATCAACAGAAATC CCTCGC CACTGC TAGAAAAACAACT C (N > xATTTATGTAGATGATTTCTTA TAATGTGG GATTTC CCTCTC CAGCAATT TTCCTTTG CCAACT CTCTGAAGCACAGT TTTGGTAACATT CATATGA TG CTGCAATCATAGGATAC CATAAAG CTCTGCAGAGAG GCAACT CTGGTTGT CCTAGAG AATGCAT TTGT CAAAT ATATGTGCCTTTGG GATATATTAC TAGCTATTAG CCATTATCTT TCGTAGCTTTTG( N ) xATAAACTGTGAAACA A CAATAAGAAATGTTT CTTTTTCT TTTTGTTAT CACCTGGGC CTGAATGACAGAGT TTTTCTCCATCTCTTGGCA TTTCTCCTAGCTGTGT CTGAGTGAATAATCAGG GA CTTT C TCAGAAAAGAGAAG GAAATG CAG GAC CATTG GTAG GAACTT GACCTTTGTTT CTATCATACTGAGGT TAAG CAAACT CTTTATTTCTGTATTCTTCGTCTGGCATCCTTT G GGGAAC CTCTT CTTGAGGTTAAGGTGAGATGAAGG CTTC CTTTGGAGACACGTTCAGGC CAAG CT TCATGCTCT GAGGAGTTGTTT CAGGAAGAAAGAGAGAGGAGCTAT GT GTATGTTCTC C CAGTGTGAATAGG GACTAATGTGAAA CATATG C CATTG CCTTTAATCAGG CTAT TAAGCATTTCT CAGTTTTGATGGAAC CT CTAT CTGAGAGGTTCACCT A TA G ( N ) xCAACATCTTCTTCTTAGTGTGGTGGTTTTGATTGTTGAGCTTTCTTTTGGTGGAGAAGGAGAAAGGA TTTTTAGGCCATGATTTTTATTCTCATCAAAAGGCATATATTGAGACCACTTCATACCCGCTAGGATTTTTCTGA AATTTAATTATGAATG CCATTTTAAATT C CAGAAGGG GTTGC CTTACTTTTGTTAAATACAGTACT CTTTATTC C CT CCTAGAAATGGCACAGATCTCT CTTGGAGAGGGCAAGGTGACTCTA CTCCCCTTCTCATCCT CTTAAGAAATA TCATGG CTTCTTAT CTGTTATATATTGT CTGTGGTCTTAGAAGAAAGT CTGTGCTCCAGTTTGG CCATAAATTAA CT CAGAGCCATATGTATTGAGCAATATACC C CACCAAG CTGTAAAACAAATGAACAAATG GAAAACAACAAAAGA AACT C CAAACCAAC CAAC CAATCAGAAACAGGTCACTG CC C CATGACAGT TTTGAAGTGTGTGT GAAAGAGGAGA AAGAGCTTTAAGACTGTCCATTCTGGAGGGCAGCAAGATGTTGTTGCTTCAGGTAGCATAAGAGGGAACAGATTC TCTT CTACAATG CCATGTTTGTGG TT CT TG GAAAAG CACAGCAAAACCTTTCAAGG CAAACTGT TT CAT CATTCT GGGATAACACCA
> H s l _ 181589 98 1 - 181612 57 8
GTGGGAAGACTTAAATAAGTAAAGAGGAAGAGGGCTTC CTAGTGTGGT TGTG CAGG TTGTACATAAA CAACTCTA AACAGCACCATAAATGTCA CAATC CATGTGTATGGC CC CAAGATGGAAAGA CACTC CAG GAGGGAAAAAAAGCAT G GAAAAAAACAAG GAGATAGAGTTA CATAATACATG GAAAGTTTATTT G G CTGATG CTAGTCAAATAT CCATCTA CT TATTTAGGAAATAT TTATCGTCTGGCAT CATTGAGAT CACA CTGAACAGAGT GGAAGATT CATT TAGGAAAG C AGTGACAAAAATGTGTGGGCTGTTGCTATT CAATGTGAG G CCTTGAATGT CAAAC CAAGGACTTAGGACTTTGC C CCATGGGCAATCAGGAGGTATAGAAGGTTTTGGATGAGAGGTTGATGTTTTAAAAATAGTGTTCTAGAAGGATAT CTTG GCAACAATGTGC CAGATGGCTTGG GTTATGGGGT GACTTGTGGTGGGGAAGT GAAG GAAAAC CATAAACCA GGAAAAGAGTGGAGAGTGTTAAAG GTAATT CAGTATG GGTTAACAGAAAGACAT G GAGTCATTTTCAAACATACA GAGT CGACTGTACATT CACCAATTAGGTGCTTGGTAAATG CTTACTATGT CCGTGGTTCT CTAATACATGTAAAA GG GAGACAGAAAAAAATT CAACAGTAATAATCCCAGGTATAT CTGTGAATATAAAACACATATC CACATTGCAAG TCAGCCTGTCAATAAGGGAGGACAGAACATAGATAAGCAGAATGAGATATAGTGAATAAATTTAATAACTGCCAA CTAGTTAGCATTTATGGAGCTCCTTAATGGCTTAAAGCATGACTGTATCACACGAGTTT(N) xATTGTGCTACCT AG CCTTGCACTGTG CCAAGCTCAGTATCTCTACAATACATAGGTAGGG GAAC CT CTAGTTA C TGAG CCAGCTTTG TCAGCTGTGGCGAGGGTCATACCTGATGCAGCCATGAGTGCATCCCTTCCTCAAGGGGCATACAAGACCCTGCCG AG CTAAGACATC CA GTATGGGGTC TTTGTCACTGCCAAGAATG CCCCTGG CCATAATAACTGTC CT TGAGTAGCA AGAATAG CAGAGTC CCTACTCCCTGCTC CTGGATGAGTTAG GTCAACAGG GACT CCACCAACTCTC CATGGCACT GGTC CATCAGAG CTGC CT CATGCCTATT TC CATGCCTAGG TTATGCTGTAGGAAGCATTGGCATAT CCAAGGCTA GATCTTTCATTT C CTT CC CCTGCGAGTCTT CTGTAATT GC TT GGAACT CTGGGAAGTCCAG CTTTTGG CTTAGTA GAGGGCTGTTCTTTCACTGAGTCAGGGCCCTTTCATGATGGAACCTCTTCTGATTATGAAGGTAATGTTAGAGTA GTTGGTGGGTATA(N)xTCCTGGGGAAGCTCTTCCCACTCCACATTTCATTGTCCTGTTGTCCTTGCTCTCCTAA TTCTACTTCCCTTCCCTCACCTTTTCTGGAAGTCTATTTGAGCCTGGATCCAGAAGATGCAAGGGAGAGATCTGA AGTGTTGCTTAAAAAAACCGCGACAGCCCACTTATGCTTAGGAGCTTTTTCTGGGCACCTTTCAGCAGCAACACA CC CT CACAGTTGGACTT C CTGTGGTGAAG GAAATTTTCTAATTTCTTTTATTAG GCTAAT TAG GAAAAGAAAAAC AAT CATTTTTTCTCAAAT CAGATACTATAACTGGGACACAATGATTATTAAT CAT CATCAGCTG CAATTGAAGT C CATTTGGTCATTCAACAAACATTCCCCAACCCCATTCCCACCTTTCCAATTATGCAACCCCTCAGTTAAAATCGC CAT CT C CAG AAGGCTCT C C CTAAC TAATTTGGCCCACAATGGTAAACC CCTTTT CAAAATTTGTGTAACATTTGA CATCTG TATTACTTAT TTGGTATGTATTATAGTGTACCTTAAAATCTT TAAT GTACACATATA CATAACAAAA(N ) xAAAAGTTCTCAAATGTTTGCTGAATGATTAAATGATTAAATAATTTGAGGATGGGTATTTTTTGGGCCCATAA ATTTAGAAATTACTGTTAAATAACATTATATAGATTTCGTAGATTACAAATTTTGAAAAAATTGAACAGCAATAA AATACATTTTAAATTCAAA CAATATATG GG CATACCTTG G AG (N )xTAG AACTAATCTCG TG TCTTAG AT(N )xC AACTAGATTTAC CTTG CATGCAGTTTTCAGTGTATGGACATG GTTTACTG GAGAAAGATT CTGAGACCATGTCCA AACC CAT CTATGAAAAGTACCTGGAATTGTTAGTCATGT CAG CAAATG CAGAGGAGGAAGTATAGGTAGTGTAAG TATAGACGTTCTCTGAGGTCCCTCCG(N) xTAGTGTAGAGTATGCTACTTGATCTAAATTCAGGAAAACATGCCC AGGG CGTAGGACTT CCATACTACACAAATTTTCTTAATTG CATTGGTGTAAAATGTGCAATTAAAAGGGTAATGA
G(N)xATCCTATAAAAAAGAATGATAGATACTCTAGCAAGATGAAAGAACAAGGAAGAATTTCAAAATTCAGTTC ATTT CAGTGAACATTTAT C C A T < N} xCAGGGGTTACCAGTAACTGATTCAAGAAGGAGAGCTGTTCACAGAGCAA GATGTT TCACAGAAGGGAAAGCTATTGAACTAGGTCTTATAGGAAGAG TTATCAATTGACAGAA TATGAAAATGG GGCATCTTGGGAAGAGGAAATGGTACCTGTATCTGGAAAATGTAATGCATTATCAAAGGACCGTGAATGTTTCTG AGAGGGATCCTC CTTAGGAGTGGGGGGAGATAAGGC TCGGTAAATGGATGAGGC CCAGACTGTTGTGT CATGGTG T CTAACTAGTGG GCAAAGTAGAGC CACTGCAGGGTT CTAAGCAGAGGAAGGC CACCATGAGAACAGTGTGTTGGG AACACTGATGTTGGCTTCAGGAGGGTCAGAGGGAATTCACTGGGCAGCACAGTGGTTCAGGGTAAGGGAATGCTG TATAAG CACCAGGCTTCTTCCCAGGTGTTC TTAATTATTACAAATTAGGAAGATTAAGGTTTAAATTGTGTCAC T AACATG CTTATT CAGT G GGTGAGG CTACAGGTTGCTAAA CTTGTTAGAAT TGTGATA CAATAAT CT CATTGTTTT AGTGTCATGTCTTTTCCACCCACCTCTGCTTTCTCTTTCTAACCGTCTCCCACCCCATGTTATTGGCATTTAGTT CATATT CATTTTAGTATTAAAGG GAAGTAATTATTT CTGTATTGTGAAATTGGCAT TTGAGATCATTGGAAATAT T CTTTAATGCTGTAAG CTAGAGTT TTAAAAAATAG GATTAGTTAAGTCATATTTTT TAAAAG CGAAAAAAATCAC C CAGAAGCTATTGAñGATGATGAGATAATGAGATGAAGAAGTTGAGATGGACTG TGTGTCTCTTGAGT CAATGT C GATAACCTTTGGAACCTGTCCTAAGTGAGAACCTCGATTGCCCTACTGCAAATGCACGTGTTCAGCTGCTGATTC AGGACTGATCACTGGGGGTGCCTCAGAACTTTCCATTTTCTTAGATTCGTCAGCAAACTCACACTCAGGGACCTT ATGAGAC CAAGT CAGAGTAAATGT CCAGGGTTTACAGAGT GGATACAGTGGCTCTG TTTCTCAAGAAG GACTCTG TATAATTAAATTATCATTCTTCTATTAAAAAATTGAAATTCTTTCTCCCAGGAGCTCATTGCCAGTGCCTTCTCT GTAGTCTTATAGT CAATT CAAAAATATATCACTTCT GGAAGATGTGAG CTTGAAGTGTGAATGCTATCACATGAA TCTTCTTTTTATCATAGAAAAGGCTTTTAGGTGGGGTGTTGATGTCAAGATTAACTCAGAGAAAATTGGCTGCAG TTAG GG CTGAATGATA GTGATCATTC CAAAACTCAT TAGGAGTCTAAAAAGT GAATATTTAGACTTTGACCAAAA CCCATTTGTCCTAGAGTGAAATATGGTAGCGAGAGAAAGACAGCCCTGCAACCCCGAAGTGTAGGCGGGCCTTTC TT TAGGAACCAG GCAT GT TCTTGAAATCTTGTGTGT CAAT CAGCTTTTTGTAAATG GAAT CCTTTTTT CTCCTT C AT TAAC TTTCCC TGTAAT TTCAGCAG TGTTTAACATTCATTACTGATT GAT CTTGAGTTG GCAGTTAACTGCTAA CATTCAGATTTGGATTTTAGTGCTTCTTTCTTGATTTTCTTGCCTTTTTTTCTTTGCTCACT(N) xGTAAACAAT TACCGTAGGAAATGGTAAATGCAAATCCTAGGAGTACAGAGAAGCTGTGGTAGCACGGAGTCCCTCTGTCTGGGG AGGGCTTCCCAGGGCAGGGGTTACTGAAGGATCTGAAGATGTGAGGAGGTGCTTGCTTGGGTGATGCCCATCCAA GGAAGACCAGGATCAGAGGCAGGTTTGGGATTAACTAACTGGAAGGACTTTGGTGTTGAGGAGTTTCAGTCTCAG GACAAG CAGTGAAG CAG C TGATGC TCTT TG CTTAGC CTTC TACATGGAGT GAGTTGTGGG GAAAAGTTGTTTGCA A CATGTATTTTT CTAATGGCCAGG GAGG GGTTATTT CACTATTATTTCT CT C CAAC TTTTCTAT CAAATACACTT GTGGAGTTTCTGTAGT GTGAATTG TCTTAC TTCAAT CTGTAAGTTCAATCTATATT CATGAAGTAG GAGGGCAC C CCTAAATAAGAT TGAGAG GGAAAAAACATGAATATTTT CAGTTTTAGT GG GAACT CTCATATTT GACTTAGTAAC
( N > xGGTAAAGGACACCATAAGTAATGTCAACAAATGATTAATACACTGAGAAAAATGACACGTGA{ N } xGAGTA GGAAGATTAGTGGAATGT TGCCTACCTGTG GTTCTGTG CG TCTGTGTGGGT CTACTTTAA TGAGTT TAGATCTAG GTGCTTGTCCTGGAGGCACATTCACAGTGTGTTTT(N)xTAGCAAAATGCTATGTGTAAGTTTGAATGTGCATGA AGATAG GTGGGAAGAAAACACATCGGGC CATAAGCACTTAC CA CAGCAGGAT GGGAATGGGGGGAGGGTGGGGGC AAGTAACTTTTT CTATATA CATCTTTGTATGCAGTT GTATACATATTG CCTTTG TAATTTAATG TACAAAGACAA AATAAGTACCTAGAAATTAAGAACAGATTAGTACACTTTTTCTGCTACTGTCTTGCCATAGTGTTGTTCAAACCA GCTAGAAAACAATT CTGTGGTGCCAGGTGAGATGAACT CTCCACCTCTTCCTGAGGGCTTTGTCTT CATGCATGG GACAAAGCAGTG TAAT TAA CATGC CC CC CCACCCCG CCTTAAAACCTAATATTCATT CAT CT CT TACTTTGGTTA TTATTTAAAAAGAAAACATTTGTCAATTA CTTCACACTGACAAGCACCAT CC CACTGAG G GGGCAG GCAGGAGG T CCATCTCACCACCCAG CTGTATTG CAACAA CGCCCTCTCCñT CTGGGAATACGG CCCAGGCACTCCT CTGGAAT C CTGAACACACTC CAGG CACCTCTCTT CT TACTGAACAC CACCTATGCTTGGAGCAC CATGTGAGAGGACACCTCT GGTATCAGGCAG CTTCAGATGTATTCAGAGAGATTGAAAACTGCAAGGATTCAAGG CATAA CAAATGATACACAG A CñAGAAAAAAAGACATT TAGAGTGGGGGATAGGGC CTGAAGG CTTGAATGGTC CTACAAGACTTT CT CAAAAGT CGTCAC CCTGGGAC CTGAAACATTAGAGCT CTTTCCAGAGTAAGTCTATT CT CAGC CAGAGGTCTC CAACAGAGG CTGT CCATGTGTACTGGTTTCAAG TT CTGCTTACCT CC CAAGTGTTGT TTTTTCTGACCCTGAC CTAATTCCTC C CAGGCCAGGCTGGCCAGCACCTTCTGAAATTGCTCTCAGGCAGCTGGAAGGCCTGGCTGGCAGGAGCCCTGGGTT TTTCACCCTCAGCC CCAGGGCTCTTTGAAGGCCTACTCTC CAAGAACAAACAAACC TGGC CTGACG CTGCATAG C TG CAGAATAGGG GAGCTTT CTCGG CCTGGTGCCCAG G GTG GC CGAGCTGC CAGC CAGGTG CATG AT C CTCAGTT A AT TAG GTAACTATAG GTAGTCCTG CAGC CCAGGCTGTGTG CTGATTAGT CTGGGCTGTTGTTGGAGAC CTCTGAG AATAGACTTACTCTGGAAAGAGCTCTGATGTTTCAGGTCCAGGGTGATGACTTTTGAGAAAGTCTTGTAGGACCA TT CAAAC CTTCAGG CT CTATCCCC CT CT CTAAATGT CTTTTTTCTTGC CTGTATAT CATC TGCTATGC CTTGAAT CCTTGCAGTTTTCAGTCTCTCTGAATATATCTTAAGTTGCCTGATACCAGAGGTGTCCTCTCACAGTGTGCTCCC CCAGCATCGTGCTCTGAGCATAGGTGGTGTTCAGCAAGAAGAGAGGTGCCTGGAGTGTGTTCAGGATTCCAGAGG AGTG C CTGGGCCGTATT C C CAGAAGGAGAG GGCGTTGTTG CAGTATG GCTGGGTGGTGAGATAGGCCTC CTGCCT GC C C C CTCAGTG GGATGGTGCTTGTCAGTGTGAAGTAAAC CCTGGAGCAC CT GAGGGAGGAAGAGGAG GAGATAG ATCTTCAGCTTCCAGGGATTAGACTGAAAGCTCAACTGGCAAAATGCCCATTTTATTCCCTCATTGCAAGCTGAA AGTAGAGAGCTGAGGAAC CCCGGGCTTGACCACTTTTTTG CATTGACA GATC CT TTGTGGGAGACACATGATCCA CATGGG CATTCCTAGATC CCTTTGAAGACT C CATAAGC CCAG GGAGAT CAGATT TACCTCAGGT TT CAAGTAGAT CTCAAGATCCCACTGGAACTCAGATTTTCAGGGGAGTTCAGAATTAGGGCACGAAGTGCAGGAGATGGTATCAAC AG CAACAAATCTGT CT GATATTCTAAAAGAGTTCAT CC CAGT GTAGCC CCAAAGACTTCAGA CTGAAC CAGATT C A C CTTGAATTGAATAAG CTGATG CAATATTGGCAGCTACACATTCTTCATATGACACCTCTTTT CT CAACTTCT C TT CACC CTGTAGTCTC CA CAGAGATATT CCTGAGGGATAG CAAGAGTGGGGAGCTAACCTGTTT CTTTGCACTGA ATGAGCAATGCTTGTG CCTTTCCT CTTTTCTTGTAT CCGTACATTTCCAGAG GAGC CAGTTTATAGAG GGTATA G AGTAAC CCA(N)xTGGGGTGGGAGCA GGTGTTGAAAG GAGCCTGCTGCTGGG GATTGGCTGG CC CC CAAGGAGCT TATG CC CCTCTTGCTAGGATGCTGGGGATCAGGTCAGTAA CACCTGTA TTTACAGATGTGGG CTTTG TTAC(N) x GACAGT CATAGC C CATATATAACC CA CTT C CTGATGTGATTTGTTTGG CC CAGTTTTACAAAGTTGTC CTGTCAG AGGAATTTCATTTATAAAGTTCCTTATTGCTGAGCCTCTGTGTGTTCCAGTTATGAGTCACATGTGTCAACACCG TGGAAATGAAATTTGTAC C CCCTG GACC CATGCAGATTGATTTGATTCTGGG CTAAATGGAAATAGAACCTGTAC AAGT < N ) xCCAACAATGTTTTAAAGCTTCCAGTAGTTTCCTAAAAAATACCATAGTCTTTTTGAGTTTGGAGAAT ATTATTACTGTTGTCAATTACTTTTTATTAGCACCCTGTATGATATAGGTAC(N ) xTGCAAAAAAGCCAAAGTTC AACAGAGGAAAGGC TGAC TCGCAC CACCAGAACTGGAT GT CC CTTCAGAGGTGT CT CTAATC GAG G CAAGGGGCA GT TT CCAAGGAAGG GTGCAGCAGC TGGGGAATAAGAGT GT CTGCCCTGAAAG CAGAACCT GAGTATAATATCATG GTATCCACTCCAGGCATTCAAATTCATTCTTATCACCATAGCTAATACTTACCGAATGCTTTCTGCTTGCCATAG CTTGAG CGAGTGTTTTAG CTGCAC TGTTTC CGATTCATG CAATGTAGAGATGAGGCAGTTAAG GG G CAGAGAAGT GAACTGCACAATGAATACCATGGTAGGCAGAGAGGTGGAGAGGGGAATTATTAATAACTGTCAGACTACAC(N )x TTCCTATTAGCCTATGATGTCTCACAGAGATAACCTGTCCAGGTAGATACCCAGTTTTGGCTGGAGAA3GCATCA CAGACCTTGGGTAATTGATGGCCCACCAGTTTGCACCCCTTTCCATTAATGTGGACCGATCATGCCCTTTGTTGT GGGTCAGT( N) xT T CTCCCCTCCCTGGCAAAAGGGTTCAAGGCTGCACAAAGATTATCTAAGGGCAGATACTCTG AACATAGC CATAGTTTATTGATTCAGCGTTTTGT CATTTG CTTCCCTCTCT CAAGAGGCAAGAGAGAAAG CGAGA CTGTCTTC CATGCAGT TTGCTCAGAATT CTAACACTAGCCTGTTAGTCTñATCATAACA CATTT CTTCTT GGAAA CAAACATAGCAACAGCAATTATATCTTACTTTTAATGTCTAATTGGCTGTAAACTTGCTTTTACTAAGTTATTTC CATCGAACATGTAGAGCATTGTGCAGTTTGCTTATCTTGATGTCCTCCTCTTGCAGTCCCCATCTTGGCTTTATT CGGGAGGCCAATTATGTGTATGCTAGCCTTGTGGGTGCCCTAGAAACTCCAGCAAAGGCCAGTGACAGTGTTGTG AGCCTTCTGCCCCTCATTATCCACAGAGGCAGGAGCCTATCAGAGCCTCCAACCTTGAATTTTTGTGTGAGTTCT GCTTCTGTTCTCCTACTCCAGGTATGTCCCCTCACCTGGCTTCTGGAGTAACCCATCCCTGCTCTCTGAACCCCC ACTTCGCTGTGCCTTCCTCTGCTTCTAACCCTGCCTCTACGATGGGCTCTCTCTCCTTTTCCTCATAAGCCGATC ATTCTTTTCCTGCAAGAAGTCCCACAGGGAAGCCATGCCATCCTAGCCAGACTTGGTTATGGTATGACCTGGCGT TCAGAGGACACACCCCAGCACCTCTACACTGGCTTTCAGCCAGATATGTCTTCACCGCTTGTTGAGACAGAACAG CTTTCTTCTACCTTTATGCTATGAAGCAGCCTTTCCAGGAGCCCTACTGGGGCCAGAGCCCAAGTGGGAAGATCC TCCCTTAAGCTTTCCCCTCTGTTTTTCCAACTCCTGAAACTTACATCCTTCTGCAGGCTTCAGGACTCAGTGTAG GTGTGTCTGATGCCCGGTCATCCTAGAGAAGGGGTTGTGGGAGGGGCTGTACCTGTTGCTATGGCAGCAACCAGT TGAGATCATGAACCCATTTGATTCAGAGGCAATTGCAAGGTGAATCAATCTCCTTGTCACCCACTCCTCTTCTCC TCCCTCTCCATCTCTCCCCACAGCCTTCTGTCATCTTTCCTCCTCCCCACTCAACCTCAAGCTTTCTCCTTCCCC AGTTCTCTCTCTCTCTCTTTTTTTACTCCTAGTTATGATACATCAATGTAGCAGCTAAAGCTACTCATGGTAGAC TT CTTATTTATATTATA CTTTAGCA(N) xCGCTAGACTATGTTTACCCAGTTTGATTTGGTTGTGGCTGGAATAT TC CTCTGCATTTTGATTCTAAAGTATTTGATTAG CT CATT C(N)xAACCTGCAAATAATAAGAGATTTAAAAGTA TACAAC CAACT CCTTG CTCATG GCAAGTAAAATATT TAGATGATAGGAATATAAATAATAATGACCTATT CATTT ATTTTTCTTTCTTTGCCAAGAATT{3CCTGAAGGGAACTGAGCTTAACCTATTTAAATAATAATñTTAACATCTAA TGTTTGTGGAGCATTTACTATGTTTCAGATACTGTACCACATACAAG(N) xACCAACATATTTTGACCTGAGGCT GAATATCACAGAATTGGCCTAGAGAAATTGATATCTTCCAGACTAAATAAATCTCCAGGATTTTGTCAAGAGAAT GGATGGGGGCTGGACCTCAATGTGGGGATTAGAGAATTCCAAAACAGGCCTGAAAACTCCTTGGAGCACCAGTCA GAGGAAGCAGCCCTACTTGGCTGTCGGAGAATCAGTGCACCCCAATGGAATATAAGAAAGGCCCTGATGGATGGC CAGGATTTGGACCATGAAACTTGTTTTGGAGGAGGGAGGGGGATTGTCAGGACTGAGGCTTTACAGTCTCTGGTT GACACAGCTGAGGAAGGCAGAACCCTCAGGCTGGGGTGTGAGAGTTTCAAGCTTAGCCTTGACCAGTCTTTCCAT CTAGAGAA CCAGGGTCACTTTCAGGCCCAAATGC CACGGGGCTGGTGC TGTCAG CATTATCAGAG CGTTT CCTAG CAATGCCCATCAGCAAAGAGGCTGTGTTTTCTGAACCTTCTCCAAAGAGGCTGCCCTCTCCCCTCTTCAATATGC ACCCCCATCCCTGAGAGAAGTGATTTCCTCTTGTTATTTGGAAATCAGGGAGAGAGTTTCCCAATCTTTGTCTGG GGCTCTACAGGGACCA CAGT CCACATTCATGTTGAGTCAGGTCTCTCTTTTCTGTCTCTTAGACCCAGCTTCACA GCTGAGAC TGTAGCTATATGAGAAAGCAGAATCTAC CAGTTTTGTTTTAATCCTTT CCTGAG CT CATOAGAGATA AGAGTGATAGAAA CTTTCTT CTTACAGAGAGGATTAGAAGTT CT TTGTGG CAAGATATACCTGGGAAGGGGTTAT TAAGTGATCATCCCTGATAAAGCCCCTCTAATCATCTACTAGTTTTGCTATTTTAGTGGTGGGAAAAAAAACACA AAACCCTAGGCACTTAA CT CACTCTTGG GGAAAT CT TTCTAA( N) xTTCTGTGTGTCTTTGTGTCTTTCTGTCTT TCATCCTTTCTGCTTGTCTCAGATTTAATCGGAGCCATACATTCCGCTTATTCACTTATTTG( N ) xTTCTTGTTC TTGAGG GT GTCAAATATTGTAGAAGTGAACAACATACCAG CC CACAAT GATTAT GTTGCAGTTAAGTGGTATG GT GGAATT CAGTGTAG GG CATAAGGGAGTGACT(N) xGTAGGGGATTGGCATGGGCAAATTCCCATTTCAAGGTATT AAGCAGaGGAGAGACAGGGCAAGATGTGTTGGTTAGAAATGACAACTCTTCCCTGCTCTGAAGAATGAAACCAGC TGTGCCTGGGAATCACTTGCCCCACTTTACAAGATCTCCCTGGCAACAGAATGGCTGTTGTACAACTAGCCAGGG AATTTG CTGTGñCñGT CTTC CAACCTTATCTGACAT CCAGAG CAGGGAAGGAATTG CCAAGTTAGGGTATAGATA TCAAAG GAATTGAAGCAACACAACATTGGAATAGTATTACAATATGAAATTAGAAACTATTTTCTCTG CT G GCAT TGTTGTAAATACCAGCATCCATCACTCCCTTCTCAAATATATTGTAAGTTAAAACATTGCTATTTTTATCTCTAA CCCCCTTGCCCTCTAACCCAGGCAATACAATATAGCTTCTTGAGGAGAGGCCCAAGCTGGGTGTGGATTCCGAAT CTTCACCCAGCTTCACATGCTTATTGTGGGAGCTTGGGCACATCCCCTGTGCCTTCTGAGCCTGGGTTGTGTTGT TATTGT TG CTTGTTGGTTTG( N ) xGTTTGTTACATGTACAATTGCAATAGCACCTCTCTTGCATGTGGTTGTGAA GATTAATGTATGCTAT G CAC CTG GTCCAATGTCCAG CTTAGAGTAGAGTAGATG C C ( N) xGAGTAGATGCTTAAC AAGTGGAAACCATTATTATTTTTCTCCCCTTCGATACATGGAGCCTTAGGGTCTAAGTGACAGGAAAATGTTCAG ATAAAGATGTTTTAATTATGTCTTCTTAGTACTTCCAGAAAGAGGTCTGCATGGGTTTAGGTCAGCTCTGCCTTC CGACAGCTTACGTGCATTCTTGTGTTGCTCTCCCATCTTCACTGCTCTTCCTACATTCCACCCTAATGAACGGCA GAGAGAGCTATGCAAAGTGGTCAAAGTAAAAAGGGATTCTTTCCGATGTTTCCCAGATTGGAGAAACAATGAACA TGATTACTTTflTTATCTCTGTTTTGTTTTAATCTTGTTGTGCCCCAAACTAATGGGGATGCTCATTTGGAATGñG CTTTGTGTATCTTTGGG CAAAGAGGAAAAC CAAGATGAATTT CTAGGAGGGCAG CAGGAGAGTGAATAAGAGAGT AC AAAT CATCAC AAAT CAAG AGATGAGT AAAG AGAAG GGG GTTG GG AATGGCAG AATTGG AACCTG AG AGTTAG A GT C CATGAGAGAGAGGAGAAGCAAAATG( N>xTGCCTGACACAGTGATTTCTATAA( N) xGTGTGCTATGAGAAA CTGACCTCATAGGG CCATCTTGGAGTGACT CACC CATGTACCTT CAGG CCTTCAGGTGATTAATAGGCATGTAGG AC TGAC GTTTCATTAT CACTGCATCAACGC TGATGTAG{ N) xCCTGTTTTTGTTAAGGAATAGGAGTGTTTCCTG TGGCCACAGAATAAACGAAACTAACTGTCCTGGGGCTTCTTCAGGAAACCAGTGGGGCTAACCAGACTTACATAG TGTTTGATACACAATCATTAATTAAAACAGAGATGTTCAATGATAAATTAATTCGCCTTCTGCTCAGAATTCATT GAGGAGGGAAGAATGGAAGCCAAGAAAAATGATTATCTGAAGCTTTTTTCTTGAGCGTCTGCCCTTGTAGGCTGT GCTGAATATTTGAGTGAGCTAAGTGTACTTGCTGTATGTCAACAGAGATGACATTTTACAGGTGATATATTTATT TCCAGAGAAGGAAACTTATATAGGCTTGTGAA3CAGAAGAATGCCAGCAGGTTTCTAATCTCCTATTAGTAAACA ACATTATCGAGAATCTCCTGTTTGTGGAGTAATATTCTAG3CATTTGTAGAGTGAACACAGAAATAAAGATGCAG TTCT GTCTTTAAGAAACATT CAGC CCTAATGGGAAT GGAG GAC CAATATGTAAAAGAGA CAGAACACATATAAAG AAGGTCCATAAATAAGTG CAAATAAATG CCTGAGTG CAGCACTTAflTGTT CTGTTTTCTATTATAGGTATTTGTA CTCACGACTTAGCCTTCTTATGGATACCAAAGCCCTTGAGAACTGTTTTCCTTTCTCATTCATCTTGTGCAGAGT ACACACTCAAAAGTGTTGAATGAATATGTATGCTACTGTGAGGTGTGCCTAATTGCTGGGGGAATTCTGGAGTGA AGTAGAATGACATGTTTAGG GGAAAC CTTATAGCGG CT TAGCAGTGTTTT CAAGTATGAGAGGACTTGTAATTGG TGGAGTAAGGGGACAGGAGGGAGAAGAAAGAAGGAAGAGTTCACGGGGACTGAGCAAATCCAAACAATTTATGCC ACCATTCAGGGT CCTC CATGAT CT GATC CTTCAG GT CT TACCTTGC CTTGCTTGCTGC CACCAAAG GCAGCCCAC TGTGGCCTTAGCATTCCCACCTCTTTCTCTCTCCTTGGAATGTCTTCCTCTGTTGATTGGCCATCTGAATCCACC TTCCACTT CATGTCTGGGTGAAAGTA GAGT CT CAGGAAGC CTTACATGACATCT CTATCCTTGOTCCTATCTTCA TTAAATGG CAGTGTTTGCAGGT CTGATT TCAGTGGT CTTAAC CTTCAGAGGAAACGAGAGA CAATCATGGTCTTC AACCTCGGGTAG CTTT CAfiATGAGAT CATGAGGCAACACTAATTTTTTG(N) xTGGGGCAACACTAATTATATAC AAAATAATAAGAGGGAGCAGGG CTGTATATAATTAGGT GC TAAACAGAGTAGA CAGAA( N}xAGAATCTGCAAGA TGATAAAGGTTAGAGTACTCATATTAGACTGAGTGGGAAATAGTGATTTGTTAGTAAGAGGGGAGAGGGGAACAT CAGGAAAAGCAATAAATGATGG CGAC CAGAAGGACT GATT CCTGACAACCATAGGACAAAGTG GAGAATCAGAGA GGGACTGGTTGAAGAG CGGTTAC C TTGGTATTAT C CAG CATGTTCTTTCTGGAATAGGATGGGCACTGCACAAAA CGTT CACAATAGAGG ATATATGAAGC CTCCTGTACTCTTTTGCGTTTCTGATACTCACCTCCCCTCTTGTTTTCA GAGCCCTTTCTGGTGCTTGCCACTCTTTAAAGAATACAAATCTCTCATCAGCAAGCCCTGTAGAAGGGGCAAAGG GGTGATGCACTTTAGCTCATCACCACTGGGCACGTTCTCTTTAATCCTCTCTTAAATTCACCTGCTTTGGCTGAT CTGCATACAGTGGGAGTGGTGT CAGATCAACACACTGG C C TTTTCC CAAAGCAAAGCG GGAT CCCAGCCTG GATG GCCCAGCACAGGGTCAGCTGCCTCCATCCTGGCCCCCCAGAGCATCTGCCTCCCCAGCACCCTGGGCTGCAGTTG CAGAGGGC CTTCTTTT TAAT GTTTTT CTAATAAT CATTTTTTTCAAACCCTAAAACAACACTTTGTTATT GCATG ATAGTTGC ( N) xGAATAACT CATTTCTACC CC CT CACT CC CC CCACAGGTTATCATCACTATTATTAATAACATT TTTCACTGTTTTGACTAAAAAAGGTTTTCTTTTAAAATTCACCCATGACCCAGAACTATTTAAATAAACTAATTT TATTTTG GAGAAAGAC CCTAAAAC TTTTATTATAGAGATTTT CAGT CATG CACAAAAG TAGAGAGAATAAAATAT TAAACCCTATGTAACCCACATCAGCCCCAGCAGTTACAAGAAGACCTTAAAGTCTTTAGAGGATCCGGCATCTCC CTGTTGGG CTTC TGTGTCAGAGTT CAGAAAGG GAGAGGGGAGG GACAGGATAAATATCACATGACAT CAATGTT G GAATTACTAGGATTTAAGCAGAGGGATTTTAAGCACAGCAGGACCCTGAGCCCTGCCCATATTGGAGGTGCCCAC TTTTGCAGACCCTGCCACGGCAGCCTTCGT
> H s 2 _ 75150255 -7516 71 08
ACAGTCCCTCCTCATCTGAACCTCACCTTTCCAGAGTGGGAGAGAATTTTGTCCCTCTCTGCTCCCCACTTGCCC CTGG CCCCGGTTATCT CTGGGCAT CT CT CTTTTT CTTCTCTTTTCCTTTT CTTT C C { N ) xATCTCTCACCCTTTT GTCCCCCATTTGAAACTTTGCCACCAGCCTGGTCCTGGCCACAAGCAGGGTCACATGTTTTCCTGTCTTACTTCA TGCCCC(N)xATCACCCCTTGCTTTTTGATTGTGCTTCAGCCAGCGCCCCTCCCGAGGCAGGTAAAAGTGG(N)X CTTTAAGT TTTATGGTAAAAGTGG TATTTAGGAGACTC(N) xGTTTTTTTTAAATGTTAGGCATTAAAATGAAAG ACAGTAAACTCAAGAGCTCTGTGTATGTAAGAGTGAGAATGGGGTGGCAGAACTGGGAAGGGGCCTAAAGTAGGT GAGT GCAACGAGGCAAAGTACAGCACAT TT CTGCTTTC CT GGAATAAGT C TGTAAGCAGCCACAGCT CTC CCAAC CCTGCCCCAGGCCCGCTGTCCTCAGCTATGACTAGCAGGAGTCCAGGGTCCACTGCTGGGAGCCCACCTCCTCCA
t c a t g t c t t g c t g g a a t c c c g g g g g c t t a t g c a g g c c t g a a t g g g g c t c c c t g a g a c t t c t g t c t a a g g a a g c c a TGCACTGG CCAG CTTCATGGCTGTGG CTGCTCTTTC CCTCTCTGACACGGGATG CTTTTGCTGGGTGATG GCCC C TTACCTTGGCTTCTGCTGTTTTCTTTTCCAGCAAAGGGTGCTTCTCTGGGTGGGGCTTCAGAAAGAGGCTGGCCC CATAGTGCCTCTCCCTTCTAGGATCCGCTCTGCCCCGGTAGTCTCTTTTATTCTCTCAGGGTGCAA( N) xTTGTT CAGTGCCTGCTG GTGGAC CTGAGAGGG GAAGAGAAGGTG G GG TGCTGACCAACTGCCTTCATTTATA CTTTTGCC AATATTAG AGGC AGAT GTGGGT CTGGGGTCTATTTAGG AAAG G G AAAAATGTGATTCCTTTTTCCAACTGTGTC C CCCCTTTCCCTAACCTTCCTTTCTTG TAGGGT GGATTAA CAGTTTTTTTT TGGCAGAAAAACACAC CAAC CAA CA AATGAACAAAAATCAACACCCCAAGCTATGCATACAAAACAACAGGCCCAACCCACACACTGAGAAATCGGACAC TCAG CAGG CCACTCTG CCAACATG CTTTGAAAGGATACTC TTAAGAGGTTTTGGAGGTAGGGTTAA CCTCCGAAG GGGACAATAGGGACTGATGTTTGC CT CAGG CTGGTAGGGACAAAGGGCATTGCAGAGGAGAAGACAATGGAAGTG AGCCTTGTGGGCTTATTTGGTGGCTGGCAGCTTGAAGGTTTCTCCCTCTAGGGCAAA(N)xCAGGGTTACTGTCC TGCTTGGTGACCTGCCTACTTTGC CCAAGAAA CAAGTTGAAAACTT CCACATCT CGGGAAATGCCCAGGT CAGG C TGCATCAGGGCTGTTC CT CAGGAG CT CAGGGACAAAGT GAAAGCAAAGGACACCAATG CTGGGTGCTTGG GACAG TGTC CTGTTCTC CTCATGTGGTGGTC AGTG CC AGT ACACATAGGCAGTTTTCTG CCTTT AAATAAG AT AACAT AA GAAG CTGT TTGTAAAGTGTAAAGAACTGAG CACG CATTAG GTGATTTCTT TT C TT TTT TT C (N )xG CATGAGGTG ATTT CTAC CAGTGAAGGGTAGGTGTGGGTGTTTTGG GC CTGG CTGTTCTAT CTCAGGG CCTACCAT TGTCGTCTT GTTCACAC TCCC TGTG CTGATTATGAAG CTGA{ N ) xAATTGGCAGAGCAGGGACTTGCCTCTTCCTGGGACTGCT TCTTGTCC CTGTGAGCAT CACC C CAACAATGACT TTTCAC C T T (N) xACATGCCCCTTGGAGGCATTGCCTGAGA AGCACATCAGGGTGTAGC CTTACCTCTCTGTG CTGCTC CGAGGACC CTGAT CTCAGAAA CCCTGC CAGTGGCCC C AACATGTGAACCATCGTGGTAAAAGAACTCTCTGATCACAAGAAAGTCCCCTAATCTAAGATAATGTTGATAAAC AAGCTAAOTCCAGGAGGGAGGGTTTAATTTAAGAAAGCCAGTAGGGACCCTAAATCTTTTCATTCCAATCAAAGC
t g t c a a a g c c a g t t a t c g t c a g a g c c a g t c a g g t g g c c c c a a a g c c g c c a t c t a c c t g g c a g c t g t c c t g t c a c c A CGGAAAC CACAAGTTGGTGAACAAG GC CCGAAGGAAC C CTG CTGAAACT CAGTTG CTTGACATAT TTGT CATTA T CGAGT CTGC CAGAGTGCAG CAT CTCTT GCAAGAGT GATT CT CT GAAACT TCTC CTGCTTCAAT CCCTCTACCCA TTGTTT CC CAAATAGAGAATAT GC CCAC CTCTGAGAAAGACATGTGCTGG C AGG CAT CA CAGGC C CAAGAGGAGG CACTTGTT TACT CGA CAACAGACAAG GTGACTGTCTGCTG CATTGGCACCAGGACTAG CATTCTTTGGGCAT CCC CAACTACCTTTTCAGGCCCCCGTGTCTTCAGGCTGTCTAGGTCCTGGACTGGTGGTCGTCAAGTCTGCCCACCCT CACTAT C CAG CAG CACCAGTG GAG CATG CTTCATAACTAC CCTTT CCTCT CT CC GAATTGAGTTTGATGT CATCA CCCTGCAAACGATGAAACAAATCCCCATTTTATTGATGACTGATAACTTCAAATGTGACAAGAGGATTCATTTCC TTATTCAATCATTCCACAAATATTCACTGAGCAT( N) xATCATAATAATTTTCAGGTTATGGATAATGCCAATGG GCACTTAG GATGAT CAGGGAGGGCAG TT CTGAGAAGGGGCATATGAGCTG{ N)xAACTATGACCTAAAGAAATTA GT ( N >xGATGTTTCAAATCCCTTCAAGCTGGGAAATTCTGTCTCTGTTGCATCTTTGGGATAACAAACTTCCTGA AGAATCAC TGTGTCTTTTCAAAGACTGAAAATGGATTC CCAG CAATGTGAAATCTTGTGCCTGAGT CCTAAAGTG ATCAAAGG TAAGGTTGAGCCTAGGACAAAATGAGGATC CC CTGGAGGGAAGCTAGGGCAACACAGTAAGTGG CAA AAGAATACGAGTGAGGTTATTTGA CTAT CCTCTTCCAG TAGGGCTGGCTCAG CTATGG CAGACC CCAGGGTGGTT AGGACAAATGGCAGC(N)xGATGAACAGTATTTCCTAGTTTAATGTTTTTTTTTTTTTTTCTAGGCAGACTTTTG TAGTGCACAGTGAC CGGTGACCAG CAGCATGTGTTT CCATAGGCATCTTCTCTCAGTACC( N ) xGTTTGCGTTTT GAAAGTGAAGAATTGTCGAAGTGATG{ N ) xACACTCTAAATCTGTGGTAGAAAGAACTGTTGTTCCCAACTCCTA GTTTCCCCCTGTAATAGGGTGACACAGCCCTGCCCATGGTGTGACTTGCCCATTCTCCCTGAAGGGACACG(N)x GGGTCTGCTCTGTGGCTAGGGCTGTGGCCACTCTCCCATTCTTTTCCCTGGCACCATCTCAGGTCCAGGAATAAA GAAGAG CAA CAAAAGTTTTCAT TAGACT TTAAAAACTTTGAGTC TTACTCATAT TTAATCCTTAGT TG CT GAAGA AGTGACTTTCATATATAGGAGACCCTAAGGGGGTACATTCTCAGTGAAGTGACAGCCATGGGGGAGGGAAAGATT GATCCATATGAGGACTTCAAGGTCCTGGCATATCAGGACAGGTCTTTAGGAGTCCTGTTGACATAAATAAAATAG GTAGATTATTGAGACTGAGG C CAGGC CCTAAACACTAG TG T A ( N ) xAAACTAGAATTAATATTCCTGAGAACAAG GGGTGG CAAAG C CT C CTCCTTCAT CTGG GGTGAGGG GAGT CTTCTGAGTAGATAAAGAACTCCT TTTAGG GTAGA AAGGAG CTGAAGTT CATCCAGT GCAAGTA CTGAGTGAATTTATT CTAAGGTC CCAAGACAAGAGAAAGATAAAAA CAAAAACGGATGTATCACTGAAGGAGATGTGCTCGG CCAGCTTC CGGCAG CAGCAAGAACAGTGGGTATATAAGC TCCAGTAGCCAAGGTTGCCCAGGTTACCCTTTTCAGGGGCAGCAGTAGAGGCATAGTTGGTTCCGGAGTCAGAAA GACCCAGCAT CT TCTGCTGATG CT CAAG CTGCATGTTT TTTTTCAATCTT CG CAGC TACAAATT TT TATC CCAAT GGGGAGCACTGTTTATAGGTTGGAGGATTAGGTTCCATAGGAAAGAAAGAAAATGAGGAGAGACACATTGGACAA GTTGGTGGGAGACTGTGCCACTCGCCCTCTAGCCTTTGACCATAGACAAAGATTAAATAAGGTTATATAGATAGG AAGAAAGAAGTAGAGGGGGCAAAGAACCAGGTAGCTGACT CATT TAATCCTC CCAAAAGGGTTT TTCTTATTGCA QT'TTCTTTGCAACT CTGACCATATTGTAATATTAAT CAAT CAAT CTATCACTTG CC TTTGTTGATATTTCTGTAC C C T C ( N} xCTCTGTCTCTCCTCTCTTCCTTTTTGCACATGCACAAAGAAGAGGTCATGTGAGCATGGAGCCCGAA GGTGGCCATCTACAAGCCAGGAAAACAACCCTCAT(N) xAAGCCACCCAGTCTCTGGTATTTTGTTATAGCAGCC CAACTG GACTAATGAAATTTATTTAAATTATATTTC CC CCATTAGAATGCAGAAAC CAAGCATCTC CATTTC CCA AATACCACCAGTACCTGGCATG{ N) xTTAGTAAATCAATAACATAA(N)xCAGATATGAAAGGATGCTTAATACT CACAAC CTAGGGACACAAAT G G CATTTC CTTAACCAAAGT TAGTTGAAAGTG G G CCAGGATAAAATAACATC TTC AGACACAAACTACATTAAAGATTGCTTTAGCTGATCCAAGAGAAGGGTCATATTTGTTCCAATTTCAGTCTAGTG TCCTCTCTGTAAAAGGGATTAGAGATTTGCCAATCCACATCTGGTTCACTCCCTGTAAATCAATGCCCTCAGAGT CAGTAGACAGACA CAAAGCTTAGAG GAATTTAATTG CCAAAG GCAATGTGGGGATATGAGAGGATGAG C A T T T (N ) xACTCATATTTGTGAAAGTTCTCACGAAACTCTTCAATAGCCAAATAGATAGATGGCTTCTTGTAGTTCTATCT TACTTTATTTGAAAGATATATTTGAGAACACTTATCAGAGACTCTATGTCTCATCTTTGTACTGTGCCCAAATCA CTTACTTT CT CCATTTGTAGATAATCTC CAACTTAATTAC TACTTGTTGATTTTTGAAATTTGCT CATTGAGTAC CACA( N) xTGGCAATGTGATCCATAACCT(JMJxGGTGTAATAATGGTAATGTAGGGGACTGTCTATGCTCCAGAG ATGTAG GCAGGGTGTTTCGGGTTGAAGAGTTACAATGT CT GAAACTCGGTGTTAGATGATTCAG CCAAAAACAAA CAAACGATGTCT(N)xATTATGTATTATAATAAGTGGTTTGTAAATGTAGTGGTAAGAATAAAAATTGAGCAATA TTTACTAAAAAG TAAAATTTGAGAAC CTAGGAGATAAT TGTCTC TTGGGAAA( N ) xTATCCTAGAGTTTGATGTA AAGTATAAAT GTA CTCAATTTTGTTT GC TCTCTCTT CTTAGG TAAAAAGACATC CC TTAACTAT GG CAAAACAAC AACAACAACGACGA CAAAAAACAAAACAAAACAAAA( N) xCCTGCATCAATTTGATAAATAATCCTCCCCGTTGA TGTCACTGGTTT TTTTCTGCACAT CTTCT CACTGATATTTAAATG CTGAG CTTACACTTTCCAT GGACATG CTTA ATGACTAAACAATAGTTGAATT CTAAAT TAATCAACTGAC CAAC( N ) xCTGAGTGGTATACACAAAGCAATTGAG GAACAACC TA GGAC CTTGTAGT CCATTAGGATTTCCAAAAATAGAAAGAGAATGGAAA GTCACCTGG CTAG GAGA AGCCAGTGGAAACTTGACACGAAGAAAAAAGAGCAGCTAATTTCATTCCTGTCCACCAGTTATTTATGTGTTTAT CTTTAATTACAT TTGTTTGATTTC CCTTATTAAAGT CTGATGTCTTAAAAAAGCAGAAAAGTGAGG CAGG TCAGC AGGGGATGTAAGTTGGGAAGAAAGACAGGTGAGGGCAAGAATTTAGGCAGGAGCCACAGTGTTGGTTGTGCAGGT GAAGGT CAGGTGA CGGAGGGTAAC CAGT CATGGATGAC CCAGGCAGGAGC CATAAC CAAAATGT TAGAAAAAGTT GGTAAGAAATTGTT C C A (N) xGTTTTCCAGGAACAAGGGCAGAACTTAGCTACTGAGTTCTACCGAAGGCCAGGA T CTGAGTCAAGCAC CAAGGT TAACTC CCAGGGAAGC CAAATGTT CATAAAAG GAAATCTTTCCAAAT CACAAATA TCAGGG C C TG CTAGGTAAATTCTAGCTT CTCTGTGCCAAGGG CTGTGTTC CTACTC TCAAGCGC CACT CACTGAC TGTCTG GGTGGTGC TGTGGG CTTCTGTACTTTTAGCTT CATCAG CTGCAC CTGT C CTCTTTTCC CATAAGACGCC AGGACC CT CAGAAT CTTCCCTCTTAC CAGAAATTTG GGTTTTGGGCTGGGTC CCTG GAC CGAATTCTGACAC TCT AAGTGTTTGCAGACTATTTTGATAGAGGTAGTGATTATGATGCCAGAAAAGTGAAGCCAGTTTTTATTAGAATGT TGGCCATCTTGAATAATCCATTCCCCCACTTCCATGGGACATCTTCAAAGAGATTTTTAAAAGACAAATATGATC TCTACTAATGAGATATCTTCAGAAGGTAGAATTAAAGCAAAATTAATGTACACAGAATTCCTATTTCAATATATT TATTATTCA CAACATTT C TACACACACA
>Hs2_75699375-75718717
TTGGTATGGAGAAGTTCGAATCCCCTGGATCCACCTGGTGGATTTCCCAGTGCTCACGAACATGTTCTTTATGCC ACAAAATGTCAAAATTAAACCCTGCAGAGCAGATAATTATGAAACATGGAAAAGGATGCATTTGTAGACTCATTT CTGTTT TC CTAAAAATAC CCCT TCCACTTA( N) xTGTTTCAAATATGGCCCATCTCTGTCCTCCACCTTGAGGGT TCTTTGTTTGGAAGGCCACACTCCTGGGAGAAAAGGAGTCTTGGAAGTACTGACAGAGATAAACAATACTTCTTG AGTTTGTATTACATGTTAGACTGTTTTTAAAA ( N ) xGCTAGACC CCAGGGCC CTGC CTTAGCAGTAGAGCATGAA TT CATC CACT CAGAGTGAAGGGAGAGAGGTTGAC CAAT CTGGTAAA CACTTGAAATAATGAGAATGAAGGGGATG AGTGAAGGGTGAGATGGGGAGGGTGAGAGACCTAGCCAGTAATGGTTAAAAATCTTGTGGGGCAGCAGCATTGGA GAGCAGGGATTTGTAGGAGCAC CAAT CTGCACATTTGGGTGATT TTTGTTTTTCAT CAGTGACCTATACATATTG AGACAAAGAAGATAAGCAACTAGGCTTCTC CAGACCAG TA CATAGC CAGATGGTGTACAT CTCAACAAAAATGGG GCTTTTATACTA( N) xTAATTGTTTCTTACCTTATAGTGCGGCGTACATAGAGCTTAACAGAAAGCCTTG( N > x C CTTGGAAGTGAGAGAGGAAAGAAAGCAAAAAGTATATGGGAGGCAAAAAGTATACGAAAAGCAAAAAGTATATGG GATAGGGTGGCCAGGAGGGGAACGGGCACCAAGAAGGCTTGCATTATCAGCGTTCGAGGCATGGAGTGTCGGGTG TTCACCCTACAGATCCCACAGGACTCGGGCACAGGCACTCCCTGGCTAGTTAGGCTGAGGTTTGGGGGTGTTGGT GTGTGATC TT CAGAGGCTGAGGTGGG CGGGGACTGACTAGGG CCAAATGAGTTTCT CTCC CCTTATTTTCAATGT GTGTTCTTTCCACAAGTTGCAAGGTGGGTGAGGAGGAAGCACAGAAGTTGTGTGGGGTAGAAAAAGCCTGGGGAA AATGGTGCCTTTAGGCCACCCATCACGTGCCTCAAGTTCCAAAGAATGTGAACAGCAGGAAAAGAGATTTGCCCT CTCCCCACTGTCATACATGTGACAGTAGAAAGCTGAAGATTTGAGCCAGAAACTGGTAAGTACACCCTGGTCTCT CAATAAAG TATGGCACAGGACTTTTT CTTGGTCCTCCC CGAG CCTGGT CTGATAC C CAGGAATT TATATCAG CAG GGTGGG TGGTGTGGTTCAAGTTTTGG CCTCAGCACAGGAGTT CTGGTGGCTGTTTAAACAGAAAGCAG CCAGGGT ATTTTTGCTTGTGATAAATGACAGCCTTACCTGCTATTCTTTTCATTTCTTTTCTTTTTTTATCTGTTCCATAGG TTGATT TT TTTTTCTGTT CCACAGGT CACCTTTTTATT TTAC CTGAAGGTAAAACCTGAC CCAATTCATCAATAT TCTCTCAAGCTTCCTTGACCAGGAGCCCAGGCAGGTTTGAGTGCAGCATTTCTCCAAGTGTGGTCTGAGGACCAC CCACAAGAGAGAGGCCCTGCCCAGGTGCTACGTGTTTATCAAGTTCCCCAGCACACTCTCAGGCACATTCTACTT TGGGAACCTCTGTTGGAATGTC CTGCCCCCTCTCGTTG CTTCTTAATTAATTTCT CTTTATTTC CTCCATGACCT TTAGTCATTTACTTACCAAAGCCCCTGAGAATGTAAGCCCAGGCTAAGTGCTGGGGATGCCACAGCAAACCACAT AGGCACAGGCCAGCCTTCTCAGTTTAGGGCTTCTGGTATTTGGGGAGGGGAAGGGGAATAGTAGAAAGGTTGGGA GACTAAACAAGCAATCATAATAAAGCATGAAGAGCATTACAATAAACTGTGGCAAGAATAATATCAGAATTTGTT AATGTGACTCAGGAGGCCACTAAAGAGAGGGAAAAGTGTGACACCCCCACCTTTATGGTTCAGCTGCAGATGGCA CC CC CTGGGT CCTAGTATATACATGTAGGAGCCCATACAG CCTT CCAGAGAAAGGTGAAGATGAGAGACAGA CTG AG CT CTAC TCATGGGATACAAG GAGGGATGAAACAGTGTATAGAGGGG CTCTATAC CATC CATGGCGTGGGGAGT TAAGGACAAAAGTCTGGAATGATTTATGTTTCAATGTCAGTGGAAGATGCCAAACTTTCCCAATATCTGTGACCT CACCCAGAGTAACAAGCAGTAAAAGCCAGAAAGTAAAAATGAGGGTGGGCTTGTATACAGCCTAGCAGAAAAGCG GGTGTAGG GAGATC TGCAAACCATGCTGCAGACAGCAG CTAG CC TGGTGGAAAGGACAGACAGTGAGT GAAAGTT CTATAGTGAGAAATAAGG GTGCATGGGGAGCCAG CTTCAGGAG GAAAAAAAGACC CAAGCAGCAGACGTGATGTG GCCCTAGGCACGTCCACACTCAATTTTCCCCACTGCCTTCCACATAGTCAGACCTTCCCTTACAACTTGGTACTC TATCTCTT CCTATAGGCAGATT CAGTTCTTTTTATTTT GAATTT CTTTTAAAGTTT CATT TCATCCGGC( M) xT T TCCTAGGCATTCAGTCAATTTGTTTCCTGTTCCTTCCTGTTTTAATTTTTTTTTCTATTTTAACAGCTGCACTAC AATTTATT TCTATT CTTGAATG CTGT CTTAACTTTTTTG GAAGTTTATGCAGATATAAATTATGATGAAATGAAA CACGTGTGTCCATTTTCTCCCACCCACCAATCCTTTACCCCTCCCTGACTGCAGTCATATTGCCATGGAGTCACC TCTTCTTATGGTCTTTCCCCAATCTCCATAATAATCTTTTGTTACTATTCACTACAGGCACAAACACACCTGCCA GACCTCCCCATTCTCTTCATTTTTCTCCTTGATTTCCACCTACTGGCCTTGCTTGTGGCTCCCACCTCTGTCCTG AC CCTAACATTTAGTGTT CATTGTTCTTCC CTAATTCTGATG CATC CC CTGT CTCC CAGATGTT GGCTAACATAA CACC CATT CC CCTAGAGAAGAATTAG TGTCAACTGTCTTT CCACAACAGATT CCCCTAGG GAAATTGAGCATT CA GTTATGAGGGACCAAAATGGGACTTAGTTCAGAGATTCTTACACCCTGCCCCCAAATCCTGAGCAATGAAATAAA CTTAACCAAGGTCTTCTTTTGTGCATTTCTGTCCTGCTGCTTGGTAAATGTGTGCTCACTCATTCATAATGAAGG CTTATCAGCCCAGCATTGCTCATCCTGCTGTTTGACAGTTGTGCCTCCTAAACCCTTTGAATCTTAAATTAGCGT CACAGCAATGTTATGATTCATTGCTCTCACTCTTTCTCGAGGTTGTGGTTGTTTCTTTTAAACAGCTCTTCCATT AAGGTGAAAGAGCCAACACAGAACACACACATATATCTTCCCCAGTCCCCAGCTCTGTGGGTGGTCCAGCAATTT AGTTCCAAATATAGCCTCTGAGAAGGGGAAAGCATGCTTGTTTGTCTGCTCTTGAAACTTAATTTTCAATGGAAG TCAGTACTGGTAAAAGATAGTTTAAAATTTTTGAGGGGGGTATAGCTGTTTGTGTTCCACACTATCTGAGTTCTA CC(N)xCACAGTCTTAAAGATGAACTTTAGCTTTGCCCAATTTGGCTAATACAGAGGATGAATATC( N } xTATTC CAAAAATG CATTTCTTTT CTAAAATAAGGAGGAAAATTTAAAATAGAGAATG GATCTAGG GACAAGAT TTTACCC ATCATTATTTAAGAGCTTAATGATGTAGGATTGTCTCC( N) xTCTCCTCTTCTCTTGTCAAAAGGAAGATGTTTT TGTC CTTT TGTGATGATTACTTTGTATGTC TGAGTATATAACTCAC CT CTTTAGTGGTTTATGG CCCT CTCACAG TCATGGGGCTGTGCAGCTACCTAGCTTTATTAGTCCTAACTACCAGCTTTAAATTGTGGATGCTGTCATGCCATA GG CATATT CTTATATGAATATATAAGAATATATACATG C CTGAAGCGTGTTGTAAG TAGCATGT TTTC CTTAA CA AAATCCATTTTCATAAATAAAAACACTCTACACATACATATCCCAATTTTCTTTTTATCCTTT(N)xTACTGAGT CTTGTATAAATGCTAATGGAG(N)x CAGGGGAAACTCATATCTTATTCTCTTATAATTATTTTTAAGATACATGA CTCTACAAATCACAACCTTAGTGATCACCAATGACCTACAGTTGCCAATCCAGTGGGAATCTTTATTCTACTTGA TTTCTACTTAGCCC TGAGGACCATTTCCTG CTTTACTC CfiGTTG CCTTTCTCAGTT CCTGTTGTTATATATT AT C TT (N)xAATTCTGGGACCTGAAATTCTATATGTTC AAGTCAAACACATGATCA CTGTAGTTACTAG CGTCí N) xA CCCACTGCCCCCAAAAAACCCTCATCAACTCCATTATTTCTATTAAACAAGATAATGCGTGTAAAACATTATGTA AATGTAACAGTACT CCT CTATGTTATTAATTCTACTGGTATAATTTTTTT( N) xTCTACTGGTATAATTTTAATA ATATTGAAAGTTTAAATAAAA(N ) xCTTTCCAATCCAGATTAACTTTTTTTGAAGCACAGTGCTGATCATATTTT GG CT CATAAACCTT CCATACTCTC CACTG C CTAAGGACAAATAG C CAATCATAT CAGCCAGTTAGTTCTCAAAGT CCAG GAACTAGATC CAACTTAGCTTGCCAGATGTGCTCTC CACCAT TC CACTTAGCATGC CCTCTG CCTGTAGC C AACAGGAACTACTCACTGTCTGTAAATTACTCCATCTC TGAAACTGTCTCTGGC TGTTTC CTAGAGATGAAATG C ATCACTCCCCACTTTTCTGTCAGCATTCTCATCACATTTTGGACCCACTTTTAGGATCCCTGTGGATATTTCAGA AAGGTATAGT GACATAGTAATTTAAGATTATTTCAAATGGACCAAAAT CCTGGAT CAAACAGACAT TTTC TGTTA ATCTAGGATGGTAGATTCTGGTCTTGGAAGATTCTTTTATATATGAGAATATGTTATTTACCCACAAAGGGCTTT GGTAAACAATAAATAATCTAGTGTGCTGGTAACTCGAATGTACTGATAAGGCCGCCTTTCTGAGAAAAGAAGGCA AATATGGTGAGTAC CTGCTGTTTGCCATCTGCTGTTTG CCAACTGACA CAGGAAGTGTAAGTGT TCAGCTTTGAG GAAAGACGGCTAACGGATACATGTTTACTTTTAATGTGTATTTATTTGGCTTTCTTTGTGTGCTGAGTCTTCATT CT CACTAAAACTGAATACTT CAGGGAAATGGAGG CAGCTC CATGGCTGTGTTCT CCAGCTGCCAGTTTTGTCTTT ATT CTCGGAAATCCTAA CACTTCATATCTTTCAGT CTCATTCGTAC CAATTTATGCTTTG CTAAATAGCCATTAA GCACTTTT TTATCTTCTCTGGCTGACTGAGGCAG GGGACATAAT CC TTTTGATGTT CTGAACATAGT AATTATT C AGTAAACATTTGTTGAAGTGCGAGTAGGAACTCCAGAATCTATGTCAAGGGATCTTGGTGGGTTAAAAAAAAAAG TGTG CTTTA CAAACATT C TACCCTTTCACAGATT TTTTTCATAT TAAATAATAGAAATTT TCAGA CATAAC CAAG AGTTAGTTGTATTTATTTGTTCATTTTTCACCAAAAGAAGGCATTTTATTCTCTTTTGGCAACTTCAATTAGCTT TG CAGAAAAGAACT CAAAATAATACAATTAGTATTACCAC CTTGTACAGCAATTG CAACAACAC CAAAATTTCCT CTGCATTT CAAAAAAAAG CT TTTGGATGTGAAAACTTAACATTC C(N)xTTACCTGACTAAATATCCTTTAACAG CATTAACAGCATTAA CAG CAGAGAAATT CAATTAATAAACTGCCTG CCTCTTTCATTTGCAGGCTATCAAGATTG ATTCAGGTTGATT C CCAGGGTTCTGTTTTATTTC CACC CAGGAAA CAC CAGCATTCTTGAATCCTTTCAG CATTA GGCAAAGGAGCAGATGTGGGCATTCTGCATCTCAAGGAGGTCTTTAGCAACACCTGCTGCCTTGCTGGTCCCTAA AGAAGTCTTG TCTGTGGTCC CCAT CACCTAAAGG GAGCTG GTGATTGT GGTTGAAGACAT CCTGGGTAAAG GAAC AT TGTGGAT CATTG CACATACATGTAAGTTGGTT CAGAAAGTTCAGAAACTACT CAGGGATTTG GAGAGAACTT C CAAAGCTAGCTTTGGATTTT CACATTTTTATGTT CATATTGTTCACAAATAATT TTTAAC TGTGAGCCCTGTACC ATTATGGACTCAAATTTCTGAGAAAGCCTCAGGAATTCTATGTACTGGTGTCTCTTTTAAGGCTAGGAACACAAG CAGAAACT CACTCCAAATTTGAATTTTAATATGTGCTCTG CACT CAAT TTAAATGC TTTAAGGAGG CACTTCCTG AAGTTTG C TCTATAGCTC CT CAGTTCCAAGGGGTGGTCAC TGGTGAATATAATTTGAGAAATGT CT TAAGACATA CTGGTTTGTTTAAT GAA CAGTTTC TGGAAAAAGGACATAATTTTGAACTCTAAATGTTGTATCTG C CACAAATAA CTGGCTGCCAAGAATGCCACTGTGTGCAAATTACAGAGATTATAATAAGCAGATGGATTCTAGGTTCCGGGGAGG AAAATTCATCTGAATTTAAGAGGTAAGTGGAGCTAAAGATTCTAATTGGACAGAAAGTCACATGGTTTATATGGC TGACATAAAAACACACCCTGATAATTTT TAGGG CTCTCTTCTTTCTGCTCTTCTGCCAATGTGGATATCATTTAG TTTCATCTAGACACCTGGGCAGAGGCCACATAGAACCTGAAGCTAGCTAACGCAACCATATCTGATCTCCCACAT CT TG CAAT CATCTTTGT C C C CTTGACTTTTAGCCATATTC TACTTTGCAGTTCTTC TCTTTAC CA CTGAC CTTAC AACTGGGCTCTTCCTTTT CAAAGTGCTC TC TAAAACTAGAATTC TATGAT TATCTG TATGTTCTACACTCTGAAC TT C CTCTTA CAACTGTAC CT CTC CTCTTAC CCAG CTGG C CTATGG GAATT CATTATACAT CATGAGACTCTGTG G GGAACCCTGTGATTCTTCTACAACCCGGGGCGTGTAGGGCTGAAAAAGT(N) xGTTGGGGAAGGGGAGATTAGGT TCTGCGTTGGTCTTTTTCCCTATTGATATCTCCAGATCATTTCTCTACAAT(N) xCTCTGCAATCTTTTGTAAAC TACTTCCTTAGAAGCTCATGCCATTCAGTTAAATCTCTTATGTTTTATCTTGCCTTTGTCACTTATCTATATCCT GGTCACCCTT CAATA CACATTGAGGATGAATTAC C CACTGGAAGACATAT CCAAGAACGTAGAAACAGTCTCTTG AGTT CCAGATAAG CAAAAGT CTAATCCTA CAGAGTAACAGTAACAT CATAAAGAGAAAATGTATTTTTGTTCATG TAGCGCCTTTCTATAAAAAGTTCCTGATTCAGAAATGCACTTTTCTCATTTGAGAAATTCACCTGGAAATACCCA GGTATTGC CC TAAATGGC CACAGTTTTT CTATT C CAAAGTATCTGC CAGATTTAGTGTTGTAACAG GGTATAAG G CATCAGAAACTTTC CATAAAAAGACTAGAGACTG GCTAAAAACAAATT TT CAGGAC TATAGCGGAAAATT TTTAG TTAATCATCTCATCTTTTTTGATGATGTGGAGCTAGTTGCATGATTCTGGTTCTGAATGGAGTCCCTAATATGGG CC CTG GCCTCAGGC CCAGATAAAC CTCT CTTCTGAGAG CCTCAGAGGTTC CTTGTGTTGT TCAAATGACAG CTGG AAGTGAAGGTTATTTTTAATTTCACTGT CACCAC CAGTAGGCTCAT CT CTACCTAC CACTGAAAGAAAGAAAATG AAAAGGTAATAAATGGTTAGGAAAAGAGACTTTCCTTTTTTTCTTCTTTTA(N) xACAGGGTCTTGCTATGTTTC CCAGGCTCATCTCCTGGCCACAAGTGAGTCTCCCTACGTGCTGAGTTACAAGCATGAGCCACCTTAGCCAAGACA TTTTGTATATTAGTTCAATATCTCAGTAACTAAATGATGG TTTT TC CTATAGACAGGAGATAAA TTGTGT CTGAG GGTCAAGTGT CATATGAGTGAGAC TGAGAT GGTCTTGAAGGCAGAAATAATGGAATATATGATGTGACTATTTTG CAGACACAAGAGG CATACTTCCAGTGTT TAAGTTTTAGAAGTTTAC CTTTATTTGTTCAACCCTTGAGGGACTGA GGACCTGTATTTCGTGAAGTTAAGAACAGCCATTGCAGTTCAAGAAGAAAAGCCCATGCCTTCAAGACCTGTAGG AATT TTGTAACATT CTGAGTGCTAAATTGAACTTGAAAAAATGAACTG CAAGCT CTATCACACAGC CACGTCTAA ATTAATGTAGGAGAG ATG GGAGAGACTC TAAAAT TGAAAGAGACAAGGAAATGATTAATG GTAGAG GAAAGAATA AATGAAATCAGATAAAAGAGACCTTCACACACTTAAGAAAATTATCTTCATCACAAACAATCAATGCCTTGGGGA TCTATGGAACAGCATGCTTCAATGGGCTGTATTTCAAATTCTCTGAGACTTTATGCATTAC(N) xATATTCTTAT TCCAGGGTGAAAAAGAAATTCATATCTTTAAAATTAACAAAAAAGCCATAAATTTTCAAATTTGCTGTTGACATG GGAAGACTATCACTTTGAAGGTTGATACTTTTGGTGC(N)xGACAAGGAATGTATCCTGATATATACTGATTTGG GCTG( N) xGGTTGGCAAACAATTAAAAGATTGACACCCCATATCTGATCACAATTTTTCAGAGGAGTTCATTTTA CAATTTGTGCAATCTCAACATTGGTCGAGAAGATAAGGAAAGACATGGGAAACAAGGCTAATTTTGCTCTGCTCA AAGCTGATTAGAATGATGAGTTTAGCTGCCTT TAT CAGAACTGT CACTCTGATAAG CATGATAT TGAAGGTTTTG TGACAGTTATGATAATAGGT GGAAGGTATTGC CTGACTCACT TTACTTCAA CAAAG TAGAATATATGTATATATG TAAATAAAATATTAGAATAAAAG GGGAGAAGTAC CATGAAAAGCATATAAATAAAT TAGAAACTATT CAG CACAA GGAACAAAAAGACAAGAAACATGTTAACTTTGAATATTCATTATTGTGAATTATTATAAGGGGAGATGAATGAAT AT CCTTC CTGAACATAAGGTAACT CCAACTGAAG CATAAGAAGTTAAAAGTAGTAAAGGAAAATGCTTAC CTAAA GAGGGTTACTCTGCTCCATAGAATTCCCTTTGAAGTTTTAGGATCAATTTTACTAATGAAGAATGAGTTAGGAAT AAGC CTGTCTGAAGACAGñGTTGGAAGCAGAGACTT CAAGAG CC CT CCCAGC CC TCATTT CAGGGTTGTATTTCA TCA G C C( N ) XGTTTAGGCAGGAGTTGATAAGGGCTTAAACTAAGCTGTAAAGGAAGAGTCAATGCAAGAAATATT AATGACTTAAAATTGGGAAGAT TGTTAATTGTTATATTGAAG CTTCAAGGAAAAGAAGGAAA CT GGAATGGT CC C AGGCTTCTGATTTGGGTAACCTTGTAACTTGCCTT(N)xGAAGTTGTCCAGCTATCTATTGTAAACTTAGGGGCA TACTGTGTGTAGAAATGTTTGCTC CTCTATCCTTTGGGAA CAGGAATACATT CATAACATGCTA CATGTAAAAAG GACTCCCCAGGGTGGCCAATCTTATGGTAGAGGTTGTATGGTAATATATTTCTCCTTCATAAAAGTGGGAAAAT( N Jx TGGGACAACTGTCTCCTGTGGTTAAGGTCAGTGGGCTGTATGTGTCTGAAACTTCGTCAGGACTGGAAAGAG ATTTTTCAAGTCATCAGTGCGTAGGTGATCTTTAGAGTCACTGCAATATATGAGATATCCCAGGATGTAGGGTGA
g a a g a t c a g c t c t t c a a g g t t a g a a g g a t g g a c c c a t a t t g a a g g g g t t g a g g a a a a g a a g a t a a a g c a t t a a g a AAACGAAAATAGAAAAACCAAG( N ) xACAGAAAAGAGTGGTGCAGTGGCTACTGCGAGTGCCTGCCTAGCATCTT ATCTCCACTTTCTTCTTAAAGAAGCTCAGTCTCTTGCCCACAATGACCGGTGATTTCATCCATGGACCTACACAA
g c c a g a c c a t a a a g t c c t t c c t g t g a t t t c a t a t t t t a a g c c a c a g a g c a a g g t g t t t c t t t t c t c t t c t g t c c t t a t t g a a a c t a t g t t a t t t c c a a a g t c c c t t t g g t g t g c t c c c t g c c a c a a g g c a a a g g t t c a g c t g a g t g c a t t t a g a g a t g c c a t g a g t t g a a a a t g a c c a c c t g t c c c t t g c a g c t c a g g a a g a g g c t c t t a a g g g a a g t t g g t g g g a g c a c t a a t g c c t g a a g a g g a t t t c t t g g a c t c c t a a g t c a t c t t t a c t t c g t a a a g t t a t c t c a g a a t t t t g t c a g c t a g t g c t t c t c t g t g a a a c t t c c a a a g a c c a g g a a a c c g g c a c c a a a a g c a t c c c t t c c a c a t t g t g t c t t g c t t c t g a c t t c t g a g c t a a a c g a g c c a t g a a c c t g t g c c a a a t a t g c a c a t a c a t a g t c c g t g a g c t g t g t g g g g g c t t g a t c a g a a g c a a g c a c c c c a a t a c a c t a t a t t g t a c a a g t g t g t t a c a g t t t c a t c t g c a g t a g a a ( N ) xA AGTGGAATATCTTTCTGTTGCAAATGCAAATGTTCTAATACAGAAACTAAAACCAGCTGTGAAGTGATAAACCTA AAATGTGGAATTGGTTGAATCAAGATAAGGAAGGTTGAAAATCTGTATTCCTTGTTATATGAAAGTGAAGCAGCT GCTACCTAGT GTATTTTATG CC TCAGGCAATGTC CC CAC CñC CATAATTACAAGAACATTATGGACTTGGT CAAA AGGATATGGGAGTGAGAACTGAGACAGGCATCTGCCCTGGAGCCCTCCCAAACACCTCTGCTACTGCAGTCCCTG CC CAATAAGG CT CATAATCC CACC CCATAAT CAG CTTTGATG CTCCCTGGTGAT GGAAAT CAGC T C CCGATTGTT CCATTTTTCTAAGA GGTGGAGACAAG GATA GATGACGTTAAT CTAAAGCC CG GGG GAATCAGATGATTCCTG CAA TTGTTGCTAA GGGACTGAAACCACAA G
>Hs2_2084205 71 -208427991
AG CAGAGATACT CTTACAATAT CAAAATTC CTAT TTACTTAGTATTAGAGTATATT TAGAAATGATAGTATTAAT T CAATAACAGTAAAATGTAT TATTAAACAATAATACAGTATTTTTAGTGAAATGAT CCAGAAATAATTACAT TTT AAACTTATGGGTAAATGGTG CT CT CAGATAAGGGAAGAAAAC CTTGGCTATGTATGACAACTGCTTTTTTTATTT TT TTAATATGACACAGATTTAT TTTTTAACAGTGGGAATG CAGATAATATGAAAGC CTAAGTTTATATAACTAAA CTAAAAATTAGGATAAGCAAATAAAT TGACATGTAAGGAAAT TT TAAGGATGATAG CAGTTTTTGTTTCCTG TAT ATTTTAAATATAATTTCTGATT CT TT CATTTCAT CTTTCT CTTTAACAT C CTAT CTTTAAGAATATTTTCTCTCA TT CTTATGTTTTTGTTTGCATG CATT CTCCTCAACTATTT TATTAACAGCTGTTTACATAGATAG GAATGAGAAA GGTGGTTAACTTAAAAAACAAATAGCAGATACAGACCTGCTAGGTAATAGTGTGATACCGTCTCTTTCAGATGAA GGTT CCCAGGAGAATATAAT CCTAG CAGAAGCTCTGATflTATGTAG CTTCAGTTTCTCCAAGAGTTTAAAAGTT C CTAGAGAAGTCACCTAGCAAGAATGAATCATGGGAAATAGAGGACTCACCACACTTGGGTGTCTGGCTGAGGTAG TGTGAGCCAGATACTGGATTTCCAGCAAGAATATAGTCAGAAGGAGTGAGCTGTTCTGTGTTTTCTCCTAAGACC AC CAGGGTTTAC CT CCAGCCTT CAGGAATAAGACAAAGCCAAATA CATCTAATGTACTCATACGAG GCAGGCTG C AGGATTGAAGGG GAGAGAGC CAGGACTTCCAT CAATATAGGG TCAA CAGATAAGAATCTG GG CT CAT CTGTATAT TT TG CCCTTTAGAAAGAGGAAGACACAATG CTAGGCCGGGCGCTGTGGCTCACG CCTATAAT CCAAACACTTTAG GAGGCTG(N)xGATGCAATGCCAGCAGGCTTCTTTATGATGATAGCCCTTAACACATCTTATCTCTGGAGTCACA CTATTTTGTC CAGT CTGGTTGTCC CCAGAGTAGG CTGTATTGTGTCATCCTG GGAATTCCTTTCTC CCTACTTC C ATTTTCTCACAGTTTGTGTTTTTGATATGGTAAAG CATGTTTTT CAGTGGTTTTATGAGAAA GAAAGGTTG GCTG AGTAAAGATAGCATGTTATAAAAATTGAATGTATGGGCCGGGTGTGATGG CT CACA CCTATAAT CCCCACACTTT G G G AG (N )xTTG AATG TACATG TG CCTTAACAT(N )x AGGAGCCATTATATAAAATTATTAGTGTAGAAAAGGGA TTATTCA(M)xAATAAGGTTATGGTTCTCAAACTTCAGTGTACGGGAAAGTATGATTGGGTTCCTGTTAGAGTTG A C T TTA C A G A G TA (N )x GGACCACATTTTGAGAATCAAATAAAACTAAGCAAGATTTATAGTGGCCAGAGAGAAA GTGATTAATAAGTAAGGACT TTATTACTAT CCAGAGATGGAAGC TTCTGGTTTTAC CCTATATG CCAAAGACACT GTCTTTATAGGGATCTTCCCCCCAGGCTTCACAGTCAC(N ) xTGTGTGCTGTTTCTCTTTGCTTTTGCCACTACT CCACTCTGTG CT TGAAATTCTGTGACTCTT CC CTGTAAGTAATAATAACAAGTCTAACTT TTTT CT TTTGTTTTA AATT CACATT CTTCCCTGATAGCCTC CCCTTCTCTTAGGTTGGGTAGAGTAATT CTCAGTGTGTCCATTTTTCTT TTAG CAAACCTCATTACTAAA CTGAATTGTATTGTGT CT C TTTT CCCCATTC CTT CGCATACATAC CCC CAATAA T T T A T T ( N ) x TTTAGCAAATAAAATTCCTATCCTTGAATGTGTTCAAGCATAGGAGCAATAAATAACATCTGATA GACATGTATGATTTAAGGAC C CTTTTAACCTGGGTATTGACT CATTAAAACT GC CTTAGTTCTAGACTTAGTAGG T CAGGTTTGCTT ATGCTTGAGAGCAGTTAG ATTT TAGAA CGTGACTTATT TAA CATGATATT CTTTAGAATTTTA TGTTTCTT CAAGGATAGTA(N ) xATCTCATGT TCAGAGAGTCTGTATTACATATTTGGTATAfiCATGGGAATAGT TTTTAATTTTTTGTCACTTGGGAAGAGCTGAAAAGGATAGGGGTATAGTTTTTGGTGTTTTTTATAGTGTTTTTT CTTAAAGAATAT TAAATTTAGTCATTAC TTTCATAT CAGTGAAATACATAT CTAG CAAAAAG GT CACGTTTT TGC TTCTAAAAAGTTATTT CATATTGAACATGAGTACTG C CAAAGAGTGTCTGGGAT GATAATTCTT TTCTGTGTTCA ACACCATAGGTATCTATGCCAGCAGCTCATGCAACATCATCTGCTCCCACCGTAACTCTAGTACAGCTGCCCAAT GGGCAGACAGTT CAAGTC CATGGAGT CATT CAGGCGGCCCAGCCAT CAGTTATT CAGTCT CCACAAGT C CAAACA GTTCAGGTATGTGTATAAAAAGTT CTGCAT CTATTTTAATAACTTT TGTTTATAGCCATATCTCTCTC CTTT CAC TAGTTATGAAAATAAG GACACAT C TAGT TCAGTT TATTTTATTCA CAGTATAGGAATATG CTGCAT CTGTATAGT CACCTTATGTATAGGAACATATATACAGAAGAACAAACATAAAAGAAAATGACTAATTTC TTAAGCTG CAGAGGA AAAAGAATTGATTGGGGAAAATAATGAAGTACATGAGATAAAGTGGAAGTGTTTGGGAGAAACTGTAGGACTGAA GTACCT CAGT CGAGGAAAATTGAGATGCAAAAATAGTAACTAAT TTTCTAAAT C TGCAAT TGGAGAAAAATATAC CCTTGATCAAAAATTAATAACCTTAC CTAAGA TATTATTTGACCTTAGAGATTACATAAAGCAT TTGTAAAATCA GCTGTG CTTAGGGTTTTCAGTTACAGTGTGTAACTTTGAAATAATATATT CAGAGGTTñCAT CTTAAGGGTCAGG AATTATAC TATG CATGTTATCTAATT CTTTGGAGAGATCTTT CCTTTGGGTTTGATCCACAGGTAATAAAGG CAC TGTAGACC TTTGAAG GTGACTACT CTGTTTAACTGTAGGGAACCGAGAGCAGTATAGATCATTG CAATTGAG CTT ATTGAAAAAAAAGACT TGAACGTT CATT CAAGTAGG CCATAACC CCAG CCTC CCATTTCATATT TTTTGCTTCAC AGTCTT C C TGTAAGG ACTTAAAAAGACTTTTC TC CGGAACACAG GTGAGT CCTAGGTCCTAGAT TTGGAGAG AAC TATTATTAGAAAGGAAG GTGAGAAATTATT CTTTATGTCTTTGTCC TTTT C C { N ) xTTGTTACCTCCCCAGCCCT TAGTCT GTTG CT TAATTCTGTTTAA CATTTCTCCCCTCTTCCT(N)xTTAATATTTGTAGAGGGGAGAAGTGCTG T CTTTGT C CTGGAAGCATT CTATTAATGTC CT CAGTAGAATTTGTCAG CCATTTTATACTATTGAGTCAT CGAAC TTCTATTAAC CACCTCTACTTCATTCTGTCTT TGTG CTTTTGGTTCACTTTGACACTTAAGTGGGAGC GT GGGGG AGGAG AAG TAATGTTT TTGTGTG AAAT CAT CAGTTG GAT CAATTTC CATAAGACACTGGGAATGGCATTT GTTAA CAAGTAAAAGATAGTGTTGTGCATGTAAAGATCTAAGAACTTGATATTTCTATGAAATCACAATGACTGAGCAAT AGTCCTTTGCCTTAGTTTTTATTCCATTGAGTGCTACCTTACCATTGTCCATACTGTGCCATTACTAATTCCCTA ACCATATG CT CCTCTTAGTAATAGTGAGAAAACT TGATTTGCTAGTGAGGAGAC CTAGATTC CAGACC CñGTTTG G A C A T( N ) xCTTTTTCACTTATAGAAGATCTTTCTATAAAGAACACAAAAATTACTTAAGTTACAGCCACAAAAT TTGCTAAGATTACTGTTT TTTATTGTAATATGTGTTATTGTT GATTGTAT T CTTGATTTTAATC TT TATT CTGGG G CAACATAATTTGGACATGAAGATTTTATGGC CATTAAG TATAGAAATAT TCTCTTTTTT TTATTTGCTTG TATT CATCTTTAAGGAACACATTTGTCCATTAAGGTCCATTAAGGTGAGCTGTACTTCTAATTTAAGAGTGTGTATTTA TAAAAGAAAA GTTGGACT TATTGAAC CCTGGATT CTA CAAATAGAGATACAAATATAATATTTTTTTCTTTAAAA GGCCTAAACTATAGAATATCTATTTTAATA GTTATCTTAGCCAGTTAATG GT GATTAGTGTTAACAACTAG GAAA TAGTTT TTAT CAAAAGAGATGAT CAAACAAGGAGAGGATCTT CTAGATGGGAGACCTCAT TAATATTC CAGAATA AAGTTT CCAAAATATACAAGGTACAGTT CT CTTGTTGTGAGAATAGATTATTA CTTTAAATTAT C (N ) xGAGGGT ATACTTGGGATATCAAGGAATAACAGAAAGGTATGGCTGGAATAGAGTAAGTGAGGAGGGAAGTGATTGGAGATG CAATTTA
>H s4 _386349 46 - 38656 58 6
GTTGTT GTTGTTGTTAATG CACTTGACCAGCTTTT CACATATTTATTGACTAAATGTATTTTTGTCTATGAATTG CCTTTTTATAAACT CTGCTGT CC CGAGATT CT CAAT CTGTGGTC CCAAGG C C C CTGA CATGTAATATAAAATGTT CTATGCTTATTGGTTGTG CATTAT TCTCAG GATAGAGGACAAGGAACAATAAGAACCACTGATTTTTGACTT CCT GCTGGCAAGGATAA( N ) xTAAAGAACCACTGATTTTAACGGGTATACGCTGAAAAGTATATGCGGAAAGAAAGAA GCAGGC CTAGAAGAAAGGGCGTAAAAGAGGTGAGGCAGGCTT CTATGATT C CAAACCAGAGTGTGCACACACACA CACCCTTTTATTATCTCTCCCTCAATTACACTAGGACTTCCCAATAGTATTTTCTCATTTTCCTTCCCAGAAGGC CCACCAGCCAGAGGCAGGCTCCTAGATCTACTGCAGATTGGGCTTATTTCTTTACCAATATCTTTGACATATCAC AGAGAATAAGGACAAACAAGTCAAGATG CTGCTT TATTTACATT TTGGTT TATAGACACATT CAAAACTT TATAC GGACAAGCTGTCAC CTA TTTTTTTTT GGAATCAGGACAGCCATTAA CTTCAATT CTAACT CCTAAATATTT CTCT AATCTG CC CATT TCTCTT CATTT C CA CT CCAACTAT CGCATCTTG CTTTGAC CATGG CAG CACT CT CCTACCTGG CTTCCC CATATC CTCT CTGCCATACT CT CCAATC CACTGTCCATGCTT TAAC CAAAACTATC CCTTCACTCACCA G CTCTCAACCTGTC CATGG CTGCTCTTTGCTCTCTG GCTAAAAGGC CAAC CT CTTGGTGCATGTGT CCAACTGTG CGGTCTGCCTGACCTACCTCCTCTATGATGCTGAGCCCCTCTTCCCTCACCCTGGCTCCCACCACACCTGGGACA CTCAGC CATTCTCTCTCC CATACCATGACCTT TGTCATTCTATT TCTT CTGC CTGAATGCTGTCTCTC CCACTCT TTTCTCTCACTGAGTT GACTTCTACT CAGCTTAAAAATAACTTC CCTGACTTCC CAGGCCAAGT CATGTG CTGCT ATTTCATGCTTTCATAACAAAAGCATAAAATACTCCACTTGTTTTCTTTAGCAGCTATCCCAAATTTTCAATAAT ATAGTT CATTAACATCTG CTTCCACT CTGG CAGGAGACGTCCATGAAAATAG GATGGGAT CTGTTTTG CT CTTCT TGAAAT CC CAGAGCAGGG CTAACC CAGAGCAGATGACCAATAAGTACTGT TGAAG CAGGAAGAGGAAAAGGATGC TTTTAGAAATGATC CTTTGTAGGCCTTTATTTTTCTTTACACTTTCCTTC CAAC CTGAGAGGATAGTAACTATGT CCTGGATATG GAAATTTG CTAACC CTTT CAAGGCATTCCCG G C (N ) xTTGGGTGAAAGAAGAAAGAAGAGGGAAC TCA CAC C CATTCGTTGTATGTTAAAAAACCAGGTTGGGGA( N ) xCCAACTAGGTTGGGGAGATAGACATAGAGGG GGTGT CATGAAGTGACATAAG CT CATGAAGTTTAAACTAAGTTTGAGAAAAACAA CAGAAATAGATACACATA CA
t a c t a a c t c t a g t a g t a a a a t a t c g t g t a c a c c g g c t t g t g a g c t g c c t t g a g c a g c g t g c t g g c c g g t t t g a g t AGCTCCTCTCACTCCCCGAGTCCATCCACATCTTGCTCAAGAACTTCAATTTCAATGTTAACTCTATTCCTCAGA ACCCATTCCTTCTCCTTGGCATGGCTGGTGGATCTTTCTTCCATGTTCCTAAGGCCTTCTGTTCACCTTTAAGGT GTCACT CATCTC CCTGTCCTA( N) xGAAAGAATGGGTAGAGAATGAATGAAGGGAGTTAGAATGAGTGTAGCTCT GGCAATTGGAAT GCACCTTGTTCTGAGCAG CTGAATTCT C CAGAAATT CTCCGTACGTGTGTGTGCTGATCTTCA GCACTTCTGCAT CC AAGAATACTGGTAGTCAGA CAAGTTATT CT CTAGAGGTAAT CAGAAATGCTTG GTTTAGAC TCATTAGTGG( N ) xAAGGTCAAGTGATTTGGGGACATGTGGCCCAATTTGCCCAGAACACCCTGCTTTATTCTTA TTGG CC CAG CATGGTTGCTACCTTCAAAATAATAACTTTC CCAGTGTGTACTGTAAATTGGGTGAT C A C G (N ) x T TATGTAAGATAAGGTGAATATAGAAAACAAAAATTAAAAAATTAAAAGTTTGATAAACACTTTTCTTGAGAGACT CAAACTTCTGAGACTCCGGGGAAGCAGCATCGAGGAGAAAAAGCCAGGGAAAAGCAGAAGAAAGAAAAGAAAATT GAAGTGTGGCTGGATAGCTT( N ) xACACACTGAAAAGTAAAAGCATCTTGTTCTACACATAGCAGGGGAAAGGAT T T TT TT C CACTT CTTGGACACCACAATATTTGTAAC CACATCAAAATT GAGGAGGGGATAATGGAACATAT CTC C CAGT GT CCTGTTAT GACTTTTGTAATTTAA GAAA CATACTTTTATTTTGGAAATATTCTG CATTTGAAAGTAGTT AGCATTTTGATTCAGAGATCAGTTCTTAATTATGGACCAAAAAAGTGGGTAAGTGTTTCAGATTGATGAAATAAG CCTTGC TAGG CAAAAGCAAAGAAAATGAGTAGTT CTGTGCAAAGTGT(N)xACCACAGTCAAAATAGAAATCAAG ACGATT GGAG CCAAAGGTGCTCTCTT CTTACTTT TATTTGTT CTT CC CAT CATCATACGC CCCCCACCTT CGAT G GTCTTC CTCTAG CTG C T C A (N ) xACTCCACTGAATGCTTCAGGGCTGAATGAGGACTCCCCTTTGGCTGGTCAGA CAAAAAACACTTTC CCAT TT CATGTGTG CT CTGGGT TTAT TCAG CTTACAAC TAAAAAGT CTTTTCTTGC CCAGA CTTGTAGAGCAT CATTCTA CATACTTGTAT CCTAACACTCAG CAAAAATT CAAGAGATTC CTAT GCATATTT CTG GAGCTCTTTTTC TG CATAGCTT CCTCCTTC TCAGAACT CTGC CC CAGTAC CTCTCACCAC CTCAGC CT CC C CAAA CTCTGATCTT CATC T C A T ( N ) xAATTACAATAATTTATAATGGGCAGTTAGGTCAGGCTATTATTATTCCATCAT GACTATAAGGTGATGTC C CCTCTACT TATCATTT GATAGG TAGAACTTA C TATGTAGTATACT CTGGGTCTT CAG A CTTTC CAAAGATTTGGTAAATTTGACTGACTTGTATGTAGCTAGAGAA CAAAGTGTAGAACTTTT CCTGCGGTT TGC CTAAATAGT TGATATGCTATATAAT CATTGACCACAATTTCT CATGTTCAGGTTACGA CAGAGAAGACAAAG AGAACTTGATAGAAGAAG CT GAAAGAAG GGTAGCTTGGAG CTATGCATTTCTGAATCTATTTGTACATTTATATC AGAGGTAGAACAAG CATATTAG CAGG CCTAGAAAAGAT G A T IN ) xATGATGTACCTCTTTTTAAAGCATGTGAAA AGTT TAGGAAG CAATTAT CATTAGCTC CAGATGCTATATTAT GTGAAA CAAATAGAGCTGTATA CCTC CAGGTAT TTATAGTGAAAAAAAAGAA CTGATCCAAAATACAGATAAATGATAGAGATAGATAGAT G GATAT CC CC CAAAAAG TTTGTT TTATAT TAGTGG CATTT CTTTATTAAGAGATAGAAACTA CAAT CAG CCGATAAAACCAAAAAGGAAATT AAAAGTAAAAACAAACAAGCAT CTATTTTCTCGCAC CTGATTAGAAAAATATTTATTT CATAAAAATGTGAAGAA CCCATTTGTTGAAAATAGAATGAGTAGT TATACT TTTAG T CAAGGTGGAGGAGATG GAAGAGAACTG GGAAGAGA ATAACAT CAAAGATGGGTGT CAAAAATAGAAGATTCAAGAAT CATATAAAGGGGTAGAT(K ) xTGGATGGATGGA TGGATGGATGGACA GACAGTAGGAACAATATAATAGAAGAGAATAAACGTGATACC TGAAAGTGAG CATTTGAT C TTGTAGAACA GAGAAGAAAT CTAATG GAGATAGA CAGAATTGTCATGAAGGGTTTTGTAGGATGGTAATTTGTAA CTAGTT TAAGTG CAAATAAGTTTTCC CC CAGTTAATT CATGATT GAAA CTTCATCT CAAAGGTCAACTTT CTGAA AGAGAC CTTGAG CATCAGTCTAGATTAATT CACCTGTTTTATTTGTCTTCTGGTAC C A T (N ) xATGTAAGTAAGT G CñTGAATTCAAGAATGAAGGAGGGTGGTACTTG CGGAAATG CAGAG CTGTGAGACCTACCCAGCCCCCATGAGG ACTGAGCCTATTCATCCAAGGAGTGGCTT(N)xGTGCTTAGAACTTCTCCAGTGTTTTCTCCCAAGAACCCTTCT GGAAAAAGCGGG CATTCACATCTGTGTTGAACT CAAG GTACACTAAAAA CAAAAG CAACTTCTTTGAAACAAATT TTAG CTAAAAAGTT CGTG CAAATGCTTCATTTTATACCATAATATATATCTACATTGT CT CAAT GT TGAAAT TCT GTTTTAAATTATTCATCAGTTAATTTAGATTTTTATTTCATAAAATAATGGTATAAAATGGATGTTTATAGAACA CGCACCATATTATC CTTTGCTTTTCCATATTTTT GAGAAACATGGATTGATGTAAAATGACCTG CTACATGGTGG GCAGGACCTCCATAAGTGTTTGCTGAATCCGAATCTGCCCACCAGAGGAGCTGCAGGGTTGAATCAAAAGCAAAG CTTTTGATTGAT GCAAAGAGGATTTCTC CCA CAGGAGGAGTG CC CAG G CAG G CTC C TAAT CTG GGGATAAAT CAA AAGGTACAGGT CAAAAATGACCTATAAACTTATTTGAAT CTAGATAAT CAAAAGGAAATGATCAGTAC TATATAA AATGTAGGCTATATGTAAAATGATTATATATTTACC( N ) xCTGGACTGTCTACAGAGAAAAAAAACATACTTAAT TGTAAAGATTACTC C T ( N ) xGCCTCCTATCTTTGGTTTTATTCTTAGAACCTAATGTAAAACAAACAAACCAACC TTCC CCACCC CCAG CTCTATTT TAACAT CCAGACTTTGGT CT GC TCTGAATCTGAATT CACCAT GAATTCAG CTA CACATT TAG C TGTAGCT CAATTT CTATT TC CAAG GCTT TGAC CCAATGAAAG CATTAAACATA CTATCACAAAAG CAAACC T CCCTATGAACAAACGTGGT C T G C (N ) xACCACCCAACTTCTAGGGTTTATTCACTGCTTTATCACCAA TT T C C A C T ( N ) xCCTATTCCATAAATGTCCAATCCTGAGCCTGGCACAGATATTTAGCAAATATTTGAAGACTGG ATAAAT GATT CTATTTCCAAATACCAAATTAGAA CACAGT CT GAAAAAAAATGTATTACCTG( N ) xGTCACACTT AAAT CACCTTAATTGGAAAGGG CAAAATGT CATGAAAT CCATGCTGGT CTTG GAAAAT CCAATTTTGGTAGAGTA CATC CTTAATTGTTTAA CAT CCAAACTC CTTGG C CTAAATTTAAG CACTG CATAG C TCTCTCTCTTGC CAGT COA ACCTCATTTCCC CAAGTC CC CCACAGGATC CATCTG CTGAAACCACACTGGT CTCTGCAT CACT TC CAGAAATAC CATC CT GCTAGG CT CAG CTCAAACCC CTCTTCTCACATGAAGCCTTCTTCATCCTCTGAAGCCAAAGTAATAAAG TAATAC CACTTTCTTTAGGGACTTTTTT TTTTTTAATGTATCACTTAC TTCCTCATACCATTAGTTATATTTGTG TGfiG TTTATCTG GTTTCTAT TCTAGGTTAAAAA CTC CTTGAGGACAG GAACT TTTAAAAGACTTAAT T CTAC CCT ATCTAGGTTGGAGT CCAG CAAATATTATAAAGAGAT TTTAAT TAAATCA CTG GAAAATACACAGTCAG CATT TAC ACACAACCCCAGCCTTGAAAGCACAGGGAGCCAGAGAAGGCACTGGCAAATCAAAGTCTGTTAACATTAACAACT TGAG CACAAGTT CTA CT CAAAGTGG GAAATAGAGAAGAACTACAATAACATAGTTCAAAGTGAT GCTGTCAGTTC CAAAAG GTATCGCTTTAC CGTTAACAC CAGTATAATAAATAT CCATCATGCC CTACTT TTGAATAATATCTATT T GATTAATGCTGGGCATTT C CTT C CCAAGGCTTCATACTAT TC CT CTTG CCACTTG (N )xC C C C TC TTG C TG TTTT TTTGATTAAGTAGG CATTTGCCTCAGATAACCAGATCAGCAGCAGGACCAAGAAATACACTTCAGCTGTTTTGTG GGCTTT GGCAAACAGTTAAT GTGTGTTTGTG CAATCATAGGTAGAAGGAATAC CATAAAGAATTAATTACTTTTT TAAAAAATAT CTGATTCT TCTGATTCAC TTGATATGAAACATGAGC CAATGT CAAACAAATTCTTTTCTCTT( N ) xTTATTTTCTCTAATTTAATGGGAAAAAGCAGTGTAATATTCCTCAAGTGGCTTTCTAGCTGCTTGATCCCATCT TCATAT GACCTTTAAAAATTTT GTTGCTATATTTCTTTT CATATAATTGCAT CCAAGCAATATGATAAATAAAGG TAGTTAGGATGAGGCAGGAGAGTGGTTCCTAACCTCACACATAAGCACGAATGTCAGTCCTTGGCTATTTCAGTA AG CATT CTTCATTTAACG CCCAGTCTAATATT TTTATG TGACATAACAA(N) xATAGGACATCCATTTTTATGTG ATATAAAAGGACGTTTTATATGAAACTC{ N) xCCTTGAGAAATGGCTTTAAATTATACTATTTTGAGTAAAGTTG ATACTT TCAAGAATATAACAGAGGTACTAAAG CAAGTGATAAGC CATAAGGTATAAGATA TAAAAATACAAATCA ATACTAGATAAATAATAGTATAAGCATCCTATTTATGGAGAAATAAATTCAAATTTTAAAAGTCATGTATATGT( N ) xG AATAAG AAAAGTTATTTTTATGGC CAAG GAAATT AGTTGT TAGAAAAATATAGAACTGTCTGTT AAGT AT A AAAATTGTAACAGAAACTATAAAAAAGACAAAGCCCTTGCTCTAAGTAGCTTATAAACAACTACAAGAGTTCAAG CAAACTAGAAATGTATTTGGGTTTGTG(N)xGCTAACTAGAAATGTAAAGTGCACAGAGTGGTAGTGCTGGTAAT AATTCTAGAGTATAAAAACAATTTAAAATTTTTTGGAGAATTTGTTTTTCAGATTTGAAAAGAAAAGGTGAATGA TACACATATCTGTTTAAAACAATGATACAGGAAAGGTT TT CTTTAAAA CAGG CTAAAAAT TTTTGCCTTCCTTTC CTAATTTCTAAAGATGATGGAATAGAAAGACCATTATCTG( N } xGAGAGAGAGAGACAGAAATTAAGACTGTTAT CTGGAATTGAAATAATAACATAATCAACAGTGTT( N } xTCAGACACAGCAATTGCTGTGTCTGAATTATACAATG CAT CTTTGTC AGTC ATCTGTAC CAAATG GT CT AG AT TAACTGGGGTTTTTTT TTTTTTTTGGAGGGGG G GGCGTT GTTGTTTGTTTGTTTGTTTGGATCTAAG GGTACTTCTG CAGGTG CAGAAGAAAGTTAAG G GAAATACATTAATAA ATGTGACTAATTTTGTCATTAT C CAAAC TACACTA CTCAT TTGATAAC TAC CACCTAATGAAAATTGAAAAG TC C TGAGGACTG GACTC CAGACAGGTACTTT CAAATG CATCATAACTGGGAA CAC CTTGAGATATGT G GTT CGAGAAA GTAAGCTGATATCCTAC CAAGAGTGT CT CT CCTGGGAGGTGGTGACAAATGCTCCT CAAGACTTCTCCTGTTTCC T C CTAAATCTTGGGAACT C CTTTAAGACGC CCTCAT CTT CATAGTACCAAATGAAAGAAAATCAATTG CATT TGA AACAATTTGCCACATAAGAAAATAAACAAACATGGAAAAGGAGACACATTCGTGTGAATCTCAGGGGAAATCTTA TTTTTCAGATATTGATGAAATAGCAAAAGAAACAAGATGAGTTTAGAATACCATCATCCATTCCTGCTTCTTTTG AATCCATTTCCCTTATCTTTAGTGCAGAAAAGGAAACTGATTAGATACATTACTGTGTAGCAAAAGTTGTACTCA AATTTTGAAAGTGTTTCCCTCCTTTTTAAAAAACTAAATGTTTCTCTTTTTTAGAAAGTGAATTTTCTCTATTGA GAAG AT ACCATG AC CACT AGGC AGAATT AT AT AT TT AGTTGTGCTAGAGC A CTG AT GACT TCTTTTATGATG ATT TT TTTAAATG TTTAAATATTCCT CATAAAC CT GCAAAACTTAAGTG CTAGA CTCTATAACGTTT CATAGTTAATA GACCCATTTG CAACAGCAATGAGAAAAATTTAAATTTAAGTC CAGA CACGT CACATAAAATTATTGACTGGCTG{ N ) xCCAACAATTATTGACGGGCTAAAATATTCTGGAAAATCTAATAAACTAAACATATTGATGTGAGCATGAACA CAATATATATTGTAGATTTCTTGTTC CTAT CACAAG CAAGTAACTTTAAGGG CAAAGCACACCAAAAAAAAAAAG CCTTTTACATAATT CAAAGCTGTCCATACTTT CT TTTATC CT CTTC CATTCTTTTTTATTATTTTTTTTTAATTT T T A T T T A (N) xAACAGTATCTTCAAGACAAGGGGAGGAAGGGAAGACACACTGCAATATACATCTGGACATATTT T AAAG ATATT C ACG ATAG AAG AAAGC A C AAAT TAAT GAAT AAA C AATTGG AT AG ATGACGTGCCTTGTTTGATGT GAGAACACTTAATAAAAG GTTAAAATTAAT GAGCTAAG GGAAGAAAACATGG CTAAAAGGG CTGGTAG CTCTGTA AGGGTGGGGACAGGGCGGGGAACAGAGTCAGGGAGACAAACAGGATGCTGGGAGGTGAGAAATAAGGGTGGGGAG TAGTG GG CAATCTCAAGAGGAACTAACAAGGC CGTTGCTTATAGTTAC CCAGAAAACCTACGAACAATAACAGAC AAGCAGATGCAGCT CAGAAAGAAACAGCGGAG CAGO TAGAAG CCCCTCCTGC TGAGACAACGCACCAGACAC GTT AATGGTGTTTGGGGGCTG CAGGTCAT CT CT CACATACT CACATC CACAACTG CTGACACC TCTGTTTCCCGTCGA ATCAACAGATCCTGGCAAAAGACCCCCAGAGAGGATGAATGGACCTTGATCCAGGAGTTTTAATATATTCCTATC TCTTCAGAAATAGGAACT TCGGCTCTACTCAT TCTAGTTTGCTGGCT(N)xTTGTGTTTGTTTTTAGCTTTTCTT CTTTGGAAAG GTTTGTCAAGTAAATACTGTTACGTGTCAT TTTGAAGAGGAAATT C CAGTTCCAAGAAC CAG CAC AGGTCCAACTATCCAAGCAGTCTTTAAAAAGCCAAAGAGTCG
> H s 4 _ 3889372 9 - 389195 56
TT TAAAAGCC AAGT AGTATGGCTTTT AG CC ACTATATT AT AT GGTCTTTT AAAGTATAAT ATAT T CTT AAAG CTT CGTGTTTATCGTCC CACT TGAATTTTTTTAGT GCA CAACG CATCTACT CCCAAAAAGGAT TTGAGATATCTTGAT AAAAGGCATTTCATTAGCAGGATATTACAATATAAGGAGAAAACAATGTGCTGGTAGGG(N ) xATCTTGAAGGTG GAAGATAAGAAGATAGGGGAAG CCAC AGGT AACAGAGTTTTC AC AT CAT CCACTT C AGGTATTCTAAGGTG C AGT CCATGGCCATGAGGACCTTGCGATAGGC CATGAATTGGT CCACTTCACTGTTACTC CTAAAACCAGGAAGGT CTT CAGGCCTTTTG CCC CACC CCAGTGGACCAT GACCACAGTGTTAAGGGC CAGATATC T A A C T T (N } xTCCTCTGTC CT TGGAAAAATGTTTGAACTTAGTGG CTAT CAAAAG GATCTTGC CACTGGTG GCACTTAGAAGTGACATGCTTT C TAAGCCAAAAACTAAAAAGGAGGTTCTAAAAGATTGTTTTAATT GGAA TAAGT CATGAAAAAGTAAATAATATGG TCACTAAAATTATACAGCTCTTCTCGTGGTTTATATCATAAAACAGTCATGTGTCCTGGATGTTTCCAGTTTTAA CCGCAT CCAAACCACTCTGGTTTATAATGAAT TGGTGGTTCGAGGGTTCTAG TATAGATG GAGGACTTACTC TTT CATTAAACTTTTAAATATAAGTTCCTGCCACAGTAAACACAGAAATCATCTGCTCTGTGGTAGCAGCTATAGGTG CT CTCTGTGCACACTTAAAGGTG CAT CT CC CTGTAACTGCTCTAGC CCTGGTTGGT CAGACCATTTGC TGTTGTT TCTCATTCAGTGTC TGTAC CTCATTTGACTTTGGGCTTTGTTGGAC CCTCC CAAGG TTTTCTGTCCAT GAAC TGA CAGCTCTCCCATTGAATTTACTATCCAG CACCTT CTTAGGTGTC TTGCTATAATCTGACC TTTG CACACATG CAG CTTTGGTTCCAGTGGATGGTGCTCCAAGGCCTCCGGGTTTTGTCACCTGGTTCACATGTAACCAGTAGATGGCGC TGA CTTTACTGCTTAAAAAGTGG G ATG T{ N) xGCAGGGTGTGATTTTTCACAGCAGGTCCCTTTTGACTTGATCC GACGTCATTGTCTTAATCTGCGGCCTGTGCTTGTGTTTGGGGTTGCTCTTAAAGTTTCTCTGTGTCTGAGCCACC ATGCAGATAATAAATACTGCATCTTGTTACTCTCACAACAATTTATAAATTGTGATGGTTCGCATCTGGCTCTGA GTAATCATTGATCCAGAAGAGC CTAGAGAGGTAAATTC CTGTAGGC CAT CATGAGCATTGTGCCAGT CACATG GT CACT GTGC CATGAAAAATAATATC CCGCTTTTTAAGTAGTAATGTCAG CACCAT CTACTGGTTATATTTGAACCA AGT AA CAAAACCTAGAGGCCTT GGAG CACAGT C CA C TGGAAC CAAATCTGCACGTATG CAAAGAT CAGATTACAA AAG GA CAC CTAAGAAGATGCTGGATAGTAAATT CAATGG GCGAG CTGT CAGT CCGTGGATGGAAAAA CTTGCCAC TTTTCTTCTAGGGTGAGTAAACACGTGGTTTCTCTGAGAGTGTATGACATTTGCACATTCCTCATGCAATATCAG AATGAAGTTCAGTATACCCTCT( N ) xACTTTT A CCTGAG GCAGC ATGGCCCTGGGGTTGACACTATGCTTGTACA TAAT GGAAGTTGTT CCTCGCTTGC CTCTCTCAATCC CAAGATAA TGGCTCAACATAGACCACAGAT CAGGAAGG C AC CCATAACTTTAAAGTCTC TGGAGGATAGGGTGGTGCAGCTAGAAGGGGGC CCAGAGTCGAAGATGAGG GCAGT GGCTCTGCCAGCTGCTCAGCTGGCACCTCTGCTTTGAAGAC(N)xCCCGCCTACCTTAATTCTCTTCTCGTCTCT GTTTGAGATCTTTGTCAGTTACACCACCAATTACAGAACCTTAGTATTTTCTATGGTTCTGGCTACATGCTGGAA TT TT GAAAAG CAAGGAAATCTTG G CATTTTTCTCTTGTTTGCATA CAAAATG GCAAAG GG TACATGAG GAAAGT C ATTAAAATTTTGAAATATTGATGCTCATTCTAAAAGTCCCTTTGGAGAAATTTTAGAGCTGTCTTTAGAAAAGCA GGTACTAAAATTGTTCATCACCACTGATCATCACTACCCACCTCCTCCCCCCAAAATGAATGCAATAGAAGTATA ACTGAAAACC CTGC CTAAATTG CT CTTACTGCATATTTTGGAGATTTCTGATGAGGACATG CTGTGACTTTATGT AGTAATAGTAGTTAAAACAAAAGCAATGTATATTTCAAAATTGGACAAGTGTCTTTGTAGACCTGTACAGAACTA AAGCATGTAG CCTAACAGGAGG CTGAAGTCATGGGT TC CCAGGC TGGAAACAAG CACTGGTGGTTGGGAGTTTGA CTGCTGAAAGGTAAGAGGGATCATAAGTTGCCCACACATGGAGCCAGGCTCACAGCCTGAAACAGGTTGGAATGG GG CT CAAAGATCTGTGAACAGAGAATTTAAAGTTAT GTGTGCCTTCCT GATCTGTGGT CTATGTTGAGCCACTAT CT TGGGATTGAGAGAGGCAAGC GAG G CACAATTTCTGCTATGTACAG G CATAGT GTCAAC CCCAGGGCTTTACAG CATTAGCAGT TTATTAT CAT CAGG CAAAGAAACTTATT GGAAAACAGAT CATTCACAGTAGTAGA CAAGCTTTGA AGATTTAT AAATTG CCTGTGGAAAGT AAGG AAAG AAAATAA C C ACATGTTCAATTGCAATTGATGT CTACTCAGA AACCTCATGTATACCCCCAGGGTAATTCCAGCTGCAGACAGGGGTGCAAAACATATAGACATTAACAAAACACAG AG CTGTCAATTGAG CACTAGAAATGGAAAC CATTAC CT CTAACAGTTG TTGAAAAACTGCAGTTAGAATAGTTAG AACTGTTGAGAGTTGG CAGGAGAATAACAG GAGAATAGAAACAATTGAAAAATG GTAATCT C T T G (N ) xATCCTG TGTCTATG CATAGAAGGAGAAAGATATTTGTG CCCACTGTGCTAAGTGGAAAAATACAGAAAATTAAT TGAAATT ATGT CTAC TC TATTTT TATATT TTATCTTTTG CCTCAATACT CT TT CACTCTGAG GTTAAGGGAAAATGAGAAAA AGTGACCTTCCTAATGCTTATACAAAATCAAACATTTTTCAGCCAACAATTATTTTGGAGAGGACTTAACTCTTG CATTAGGCTCAACTACAGCTACACTG TCATCACATTCT TAGAAGATTT GATCACCTGTCACCCATGCCTTTGGAC AAG TTTG T GAAATGAAGTAAGCTT GGTTATTATTTC CAG GCAGACACAAGAATAGAAAACATTTTAGAAAAGAGG GTTTTTG(N)xAAGAAAGAAAAAAGAAAAGAGGGTTTTGATAGTAGTTCCAAATCCCACAGGATCTTATAGACCT GACTTGCTTC CAAAATGTATTACTAGGCTAAGAGTGATAGAC CAGTAG TCAAAATGAAAGTCAACAATGCTAAAA CATGTATAAGTATACAAAAGGAAGAGCTCTATAGTAGAATACAGTAATTTATTTCTAGATACCACCAAAGGCTTA AATGTTTATTTG AAAT TCTAGT TTTGACAGTT TT ATGAGTTT AAA CAT CTTATG CAAG AGTACCTG AGGT TGGGG GTTGGTGTGTACGGAATGTACCTGATATGTATATTTACTTTTTCCAAAATTATCATTGTTCCCCTTACTCATCCT CTCCCAAGGTACATAGTTTTTTCCTTCTGCACATTT TAATGAAACT CTTCTAGCAGCCCG GTCAGAAATGACAGT GT TTATTACAGAGAATGGCAAACT TGTCAAAATGATACTACGATGAGAAAT CAT CACCGTATAAAT CACTGTCGA TAGCGCATGGATTCATTTTTGAAGGTGACATTGCTGATGATATGAAATTTCTCAAGGAACATTGAGACAAAGAGA ATAGGAATAATGCTTAGAGTTTGTCCATCGTCCTTTAGATTCTATGAATAAAACACAGGGGTTAATGAGTTTCTG TTGGACTT CTGGGCAAATAT CATCAGTTCAGTAACACTTCTTGT CTATTTTGAG CCTAAAG GGAAGTAAG CATCT GATTA CAGAGTGGACAT CAACCAT CATATG GAGAGGAAAGACTATT CCAAAGGAAGGT TGATTGGCTCCCTTTTT GCTG CACTATAGTT CG CTTTGGTTGAGCATG G CAGTAT CAA CTTGTGCTTTT CTGAA CAATTGCT CTT CACAA CT TTTT AT AATATCTTGAACT CAT CT AT AG AAGAGTCTGAAACACAGT CAGT A CTTTATATATATG AG { N ) xCACTG TGAT TCCAGG CAGCACATAG CACACAG CA CTAT CTGTG TGTG CT GAGACTTCAACTGTGAA CTG C CAAAT GGGAA AAGC CAG GTC CAGAAG A CTT AT ATTT AGGT AAAT ATGCTTTTATATTCTTGAAAG G AATC A CAAGAAATAATTAG AACTGTTATCTTTAGAAGG(N ) xTTGCTTATTACGTTACTGGAATCTATCTAAGCAGAAAGAGGCTGGCCCACCT TCTGTTAT CTATGTTT CTATACT C TTTG G ACTTG C(N )xTTTTTTTACCAC TAATATTTCAG TATA{ N ) xATGGA GTAAAAATTCAACCAGATATAT CAGAGATATGATATTATGGGAT TTTTAAAAGT CTCTTT TTAATACATT CATTT TTTGAAAAAGAAGGAAAAAGAGAAAG TTAAAATAAGAAAAAGAAGGAAATATTGT CTTAAAAAATAAAACAAGC T AAAAG GGAGAAACTTT CCTGATGAG TTAGCGT GGGCCCTTCTCGTGAT CCTTTAG CATATGGGACCAATAGGAAG TAGC CATTGAGCTATC CAGT CATCCAGGGTGC TGCAGAGAACTG CTGTTGCTTCCCTGAGTTACCACTTGGTCAG AGAGAACTTC CCAG C A T A (61}xCATCATATTCTTTTTGCAAATTATTTCATCTTGATTATTTCAAATAGCCAGCA GT TATTTC AT CTTT CC AGAAAG AG GA GAGACTTC CCTT CAATGG CTTTTTGATC CTGGTGGAATTCAC AAGAAG A GAGCTTCATAAGAT CA CAGTGCTTTG CGGGAAGAAAAAGATCAAAGTGGATGTG TAAGATAGCC CTñCACAGTGT CTTGAGAATA CAG C CT G CACAG CACAGGGATGATCCTAAAAA CACTAAG CTATG CCATCACTAGACACAGTTGAC AGGGAGAGGGTTAGTGTCAGGTTT CACCCTGGAGGTGATCTT CACCTTGTTCTTTGGT CAAGGTAAGGTAAGT CA AGTG CATT TT AG AATAñAAC AATGTTTT AAAAAGTAGG AAGCTC CAAACC AAG G ATCTGATTTC CAAG GAGTTAC AGAAATGG GATTAAAATGACATTTAATCAACTATAAATGACACATAAAA CTT TC T TA A C (N ) xTGAGTATAGATG GTAAATTAAATACCAG CCAATCAT TCAGCATT TATTTTAAAATGATACATTATCGAAG CATTTGTACGACATTCT CATCTTCTTTGCAG CACTCACCTT CTATTG CAGTGAGC CAAAGAGTAGAGGATATTCCAT CTTTAGAACCATG GA GTTTAGAGGGGTTT CAGATTTT GGGGATACTTGGGGTTTGGGAT GACACATGAAGAAATTAATC CATGAACTGTA CC CTATAGTATTATTCATATTTGT TGTTTTTCT CTTATTAGTAGGT CC CAGT CAGTA C CT CTG GAGATGAAATCA CTCCCAATCAAAAGGCAACAGTGACTTCATATTATTATTTTCTTGTTCTGGACACTAGATGTCACACTAAGTAAA GTTTTAGAGGTTACTCTTTAGAGTAAGGTGTGGCACAG TGTCTAAAAT CTTAGC TATGAAAAAT TATAGC T CATA CATAAATATAAAGCTGAACTTTATATTTATTTTTTGTTTCTTTTTTTTTGTAAAGCTGAACTTTATATTTCTGGT AGAATGCAGTAAAAGGGTACTTTCCCTAACAAAAAATTCAGAAAAAGAATTGTTAATAAGAAAATGTAGTTGGAA CAG CAG CCTATT TTGATGTCTGTTAAATAGATAGATCCCATTTACATG CACATAAAAT CGTATT TC CTTCTAGTA A CAAAGTGATTTTT CTAGAGGCAG CTCGTGTATACATAT CTGTGATCGTTTATGTT CTGAGATGGCGAGAAG CCA TT CTTC CT GCAAAAGGAACCAGACATTATCCTAAC CTGCAAGGGACGGAGAAGG TGAAAGAAGAGATACT GAATA TGAGTTTCATTATCTCCTGAAACAGTCTCAGGGAAGGATGTGTGATCATCTTTCTTGAGTGGCTTAAGCAAAGAA GAGAGAGAAGATGGTGCCTCCCAGCCTCCTTGACCTCATTTTTTTCCTTGATTTTGGCTCTCAGAAATCTAGACG AGAATCATAGGTTGAGATTTGGAGGCACATTATTTCCTTTTGGAAAATGCATATATATATATTTAATGGCAGGGA AGTTGGACTCAGTATGTAAAAGAGTAAGATATTCTATCCTCTAACTTAAATGTGTTATTGAAACTTCAATAAGTT TC CATTGAAATG CTTGAAATTCATTGGACGCTAGAGATAATTAATTCAAG CCTTAACAGT CG CAATACAC CTTTA TTTTTTGAAGTAACTCAGTAATTAATTTGAACTGTGTCTAATGATCTGCACATGACTATAAACACGTATTTCTAT ATGTGATTACAGTTGTAAATAATGTTTACATTTCAGGGTTTTCTAGGTAGTGTATAAGGACAGGTGATTTCCATT GGACATAATTCCTTCTACATTTGTCAAATATCAGTGCCTTGTTAATGCTAAGATCACTAGGATAAATAAGGTTAC AACTGCTAAAATGCTACTCTTTAGGTTTGGGAGTCATCCATGTGTTCCCCTTTCTTATTCAGGAAATGGGCAATT GGTTATTT TAACAGTG CCTT CTAAGACTAGGTGGTTCTAGTTAGCTCT CAAATT CTGTTG GGAC TTGGGACGTTT TT C C TCATATTACTGC CTTGTGTT TTAGCAACCTGAG GTCACCCAGGT CC CGAGAGAAAT CTTT CATT TT CT CCG TGAGTG CT CGGGGGAT GGGGAGTT CTGTGATGATGGCAGCCTGTGCTGTTTCTC CT TCAAGAAGTCAGACAG CCA CTTTGGAC CATC CGTAGG CTATTCATTCCTTCAACAGTTTGACAACTCAGAAAAGC TATAGC TCAT CGTGAAGG C CT CAAAGGGAGGAAAATC TCTC CAGAATAAAATGTAGACAAGACTATGT C CGTATCAG GT CAAAGGTTGG GGTTT TT CC CT GTGGGTAG GAAGGT CAGC TCTGTATTTACCAAATAGTTATTGAATAAC CG CT CTGC CTGG CT CAGG GAA CAAGGGTATTATCCCTGCTTAGATGGAGTAAAACCTGTGCTCTTCGCCTGCCTGACTGCTGTTAAGGTTGATTTA TATCATTATTAAGACAACTGTCGGGGCTGTTGGGTGGCCTTCCTGGTTTCTCCACTGACTTGCCTTTTGATTTTT
g g a t g t c a g c t c c a g g a c c t t c a c t c t c c a t g c c t a a a g g a a g g c c t a a t g c t t t a a a g a a a c a t t c t g a g g c t c AGGTGATTTCA(N) xAGACTGTGTTGAGCTTAGAATTCCAGGTTGATAGCTAGGTGGTGTGAGAAGCAAAAGCAA a c a g t g a a g c a t c c g g g a g c a c t t t a c c t a c g t g t a t c t c a t t t g c a c g t c a c a a t a a c t c a ( N) xCATCTCAGG GAGCAGGGATTC CT CT TT CTTT CAATCAGGATGTA CTTCTACA(N)xGATGTACTTCTGCAACTTGAAACAGCTT TGTAAG ATTTGGTGGAGG ATTTGC AGACCGTGGATTTCTAATGT ATCC ATGAAC CATATT TTTATT TC CAAC AGG TCATGGATTGACGG CAGT CAAGGAAAAAGCAGGAGCCACTCTACGGATTCATGGTGTAAATT CT GGAT CT TCTGA AGGAGCCCAACCAAATACTGAAAACGGAGTCCCTGAAAGTGAGTGATGTGTCTCCTCTGGGTGTTCTTGGACTTT ATTACACCATGTGCATAATCAGAGGTTTTCCAAGTTCAGATCAGCGACGTGAACTCTTAAAGGATTTCTTTTTTC TCTTTAGTAACAGATG CAGC CACAGATCAGGGCCCTGCAGAAAGCCCACC CACTTC CC CTTCAT CAGC CT CT CGG GGTATG CTGT CTGC CATCAC CAATGTGGTTCAAAACACAGTGAGTCGCTGGCTG CTTC CT CT CTTT CC CCTGTAT TT CC CAGGGC CT TCTT TT GATATTTCCATTTACCAGTGACCTAGATACAT CAGAGAAACGATGG CCTTGCTCAGA ACTTTGGATTATGGTTTTTTTTTTAATGTTTTCTGCTACAAAGTAGACGTAACCTTGGATTCAAAATAATAAGGG CAAAAT GAAATATATT TTATGTTTAAAAGAAGGGAGCTGTTTGGAAGGAACATTTTTCAGTCTGAT TTAACAGTG T C CT TT CCAG CTGCGG CAAG GACAGCCTCTG GTTGAG GGGAGAGAGAACT CT TTAT CAGAGC TGGGACAG CTGTT TTT C CTAAATGCTCTG CTTATCTTATCCCTGAGTTTGAAAAAGCATTAAAC CGTTTAAATAAAAAGTACC CGTTA CAAGACAGGAAAAACAGCCCAAATATGTTTCTGAAGTGTTTCCAGAGACCAGCGTTCTTCCCCCATCCCTACCTC TTCATTCCCACCCTTCCTCCTCCTTACACCACCCTAAGTGTATCCCCCTAGATCCCTGCCTACCTCTTAGAAGAA GCTC CATATTAG CAGCAAAACATTGGTGAATAATGAAAAAAGGGGAGG CACCTCAACTGAGATAAGGCAGGTAG C TGGT CATCAG GGTCAGG GAG CAAAGA( N) xAGGTGTATTTCTTACTATGTAAACTAACTAGTTTTTCTAAAAATA TGGCTCGTGT C (N) xGGGCACAATATCCCAGAGGACTGAGGCGACTGAACAAATAAACACAATGTAATTCGGGAA GAGGTGAGGGCTGCGGTCAAACATGAGAGGATCTCACCCAGACCTTGGGGTCAGGACATGCCAGAAATATAGAAA ATGACTGGTCTTCTCCTTGCTATTTGAGGACCTCTAGACTGGGGTGGGGGAAGTAAGGAGGAGCAGCAAATCTGT CTTAGGAGTCAAGTAGCAATACAAGTCAACCACCCCCGTGGGGGGTTTGTAGGATGGGAGATGGATGGAGCTCTG TCAGTACCTCTGGTATAAGAAATGAATTCACGGCCTATCTGAGCTACCAAGAGGGCACTATGAGTTTGTCTTCAG CAGCCTCACATTCCTTTCTGTGTTCTCTCCAACCTTCTTGACCCCTTTTTCCCAAGTTCCAGACACTACCCCTCC AAGT CTAT CCTTAA CACAAATC CCTGTGTCCTTTCTTCCACCATTTTATTCT CAATAAAATAATTT TT CATAGGA TGTTAG CAGG CACTAAG T CAAATTACACAGAAGTG TGAATTGCTGCTGT C CCAT CATC CC CC CGAAGTGTG CG CA CTGAAAAC TGAT TTACAG CATGATTACTGGTTGGCTCCCAAATCTAAAAA CAAAGT CTAAATAC CCAAGATGAAA ATAAAGTATATTAATT CTAT GAAATTAAATACTTCTATTTAAATGATAAT TATGAAGATTGGAAAATG CATATCA
G (N) xGGTGGGGAAAAACACAGATGTATGCATATTATAATTGCAGTTATACAGTTGGAGTGTAAGTATGAAAGAG TTGCAAAGACATGTATGCAGTTCTGTTCACTTGAATAGTTTATGTGTGAGTAAGTCTGGGTTATTTTAATTTTTT AAAAAAAGCAACAATGACTGACAAAGGGATTTGCAGGAGGATAGCTATATAATACACCAGTTTCTAAAGCCAGAA GTACAAGGAAAT CAAC CTGT CT CATTTCATGGATATACTAATTATAGG CAGTTATTGCTAGTTT CTGAGAGTATG CAAGAACCATCTGAAACAGCCTGAGGGTCTTACCCTCCAGGGCCCTAGATTAAAGAATTCCCTAAATTAGCAGGA CTGTTTGACATTTTAATAGACTGCTCCAAGCAACTCTGTTACTTGGGTTAAAAAAATTTTTAAACAGAAACAGCA ATTTATTT TATGTTAATTTTATGT CTTAGCAATTTATTTTATGTTAATTTTATGTC TTAGATACACAT CCAT TCA TACACAGAGAGCCAGTAAAATTAATCCCTTTGTGGGATTAATTTTATTTTATGTGCTTTGGTTTTTTTCTAAGCA TCAGTCTTTAATAACATTGTAT GT CCTACAAATTATATAACATATGCTTTTC CAAG GAAACGAGAATAGGATTAA AT GGATGATAAATCTC CT GATGAGAAAAAATTGTGACTTTATATTGATTTTAATAT TTGAAGATAGTG CCAGTTA CCTAGGAGTTGCTGACTGCTGCGAAACAGCAAAACCTAAACATCTAATCTCATGCAGTTATCTCATCTGCAGGGT AAAAGTGTCTTAACTGGAGGCCTTGATGCGTTGGAATTCATCGGCAAGAAAACCATGAATGTCCTTGCAGAAAGT GACC CGGG CTTTAAGCGGAC CAAGACGC TC AT G GAGAGAACTGTTT CCTTGTCT CAG GTTGGATTATATACGTTT GCAATTTTTTCTTTCAGTCAATAAA(N)xCTTTTTGAATAGCTCATTTAAAGCATGTGATTCAAATCAAATGACA CCAAAGAGCAGAGAGAGAAAACTGAGCTTCTAATACCCCCATTTCCCCAGTCACTTAATTCCTCAGAGGAGACAA CCAGTTAACAGTTC CTTAAGGATT CTAT CAGAGACCGCAGAGAG( N) xGTCTTCTCCTCTCACACTTCCAGCACT AAAGTTTT CCAGCCCTTCTT GCAAAACATAAAACAAAGAATT CAAT CTGT CTTTGCTTACTTA GAATAGTTAAGA AAATTG GAAATAATATTAGT CTTGTTTT C CAAATATGACT CTAATATTAAACTTACTTTTCCAGATGGAACATG C CTAGCTTACCCCACAATCCAGGCATGCCCCAGTCCACCTCCTCCACGAGAGACTGATACACCAGGTCTCTGTGCT TTCCTGAAATCAGAAGCAGCTGATGCAGCAGTGGATTTTGTAAACGTCACAGGCAGGACTTGGTACTAGGAACCC ATGAAGGTTGCCTTCATGAACAGCTATCACTGAGAATATGTTCATGGCAACCCCTAGAAGCTTTTGTTTTAGGCT ATTTCTTCTTGTCT CTTCTGGAATAACC CAAGTATGAG CACT CCGATGTTATAGT C CAGGGAGG GACTTTGCAGG ATCATCTAGACCATCACCCTGCTGCTCCTTGAAAAAGAAAATAGAT(N >xGCATGAATATTCCTGCTCACACAGG AGCTCGTCCCAGGGGGCTCT CATGGCTAGCACACTTCTGAGC CCACATCTTCTGGCCTCATCAT CTTGAAATGC C TTACTTCCAGCCTCACAGAGAGGGGATGTGGAAGGC( N) xTGAAGGCTGGATTTCGAAGTCCAGCATTTATGCTC CCAGGAGTAGGAGGGAGGTCTG CAGGTTG CACTGTCACAC CCTCCACT GGAGGCAATTGCACTTTCA(N )xAAG C GCAGGGAATACT CTTTAAATAACACGGG GACAGGGGGT CCTTGGATTC CACACATGGAAGTCTC CATAGCTT CT C TGTGTCAG GAATTC TT CCAG GC CAGGAT CATT CCCTGCTATTGAAACC CACTTCCCTTTCTTCT CTAGTGGGAAA ATAGAACCAGCCTCCGTTGTCAAGCTGGTTATCCCTCAGGCAGCTTTAATTTCACTCCCCAGTTGTG(N)xTACT CCCCAGTTTTGAGTGAGTCCACATCTTTCCAGAGTGACCCTTTCCCTTTCTCTTCAATTTCATCCCTTCTTTTTT CTCT CC CGAAGC CCA CATTCGGAGGTGGAAGCTGGTGTAGAGAAAGGAAGTGATTT CAGTGT CACT TTGTTATTT TATTTTAC CACCAC CAAGTC CT GAGATCAGTG CGGGTAATTAGTTTTTAT TGCCAG TGTTAAAATT TTCCAGTGA TGGTTT CATTGGGTTGTTTGAAAAA CTTTAAGATTCAGAGTCTGTG CTAATTTAGT CTTAAAGACAGACAG GAGA GAGAATTCTCTTTG CGGCCCTG TGGGTT CTTGGCAGATAACCTCGAGAAC C CAGAAGGAAATAAGG CTGATTGTT TTTCTG GCACACAATAAAAAC CAAAGTCTATT CTTCAG TTGTCTTCCCCTGTGACCTTGTGGTGGTGGCCAGTGT CTTTACAGTGCTTT CCA CAGAAATTAAAGAAC CCCATACACGTAATGC CT CAGTAAACTTAATGAGGAAT CACAT AAAAGATG CTTAGAAT CAAAAACG CAAGTG CAGAAAAT TGAGAGTGACTGATGGACTT GTGTTTATTTGTTGTCT TTTTGCTTAACCAACAAAAAATAACTGGAGAAAAGTTATTAATGGGGAGAGGAAACTTGGTTCACAGTCATATTT CTGGAAAC CAGACTTAAGAATCTATAAG CTAATG(N)xCTAAGTTAATATTCTTAATTGCAGTAAGTATATGATG ACAAATACATTATGAGATTATGTTTATTAATTTTTAAATTGAGGAAATTTAATAACCACGTACAC(N)xTTATAA CTTAGAAAAAAT CAGG CCAG CTTATTTCTG CTTTGACGT CTGTGTAGGTG CAG GACTTAAAGTTAATCCTG T CAT AAGAGATT CAACTTTT CAAG CATACA{ N ) xCCCAGGCATTCAATTTTTGCTTTGTTTTGCCTTTCACATAATGAA CATGTAAAATGTGAAG TAGGATATGATGTAAAAGTTAT CAATAGGATCAATCAAAAAAATCATTTT TTAGAAATT CTAAAGTGACCAGAAT CTCCCCCT CTCCAACACAAAAT CAGT CCTC CCAACCTC CC CCATGT CTTA CACAG GAAA CGGTGCTTTTAAATGTACATCCAGAGTACAAACATTGTGTATTTCTGCAGATGTTAAGGGAAGCTAAGGAGAAGG AGAAGCAGAGAC TGG CACAG CAGC TCACGATGGAGAGAAC CG CGCACTACGGGATG CTGTTT GATGAATATCAAG GCTTGT CACACCTGGAAGCC CTGGAAAT TCTGTCCAATGAAAGCGAAAG CAAGGTACTTCTG CACTACTCGTTTG AAATG G CATGCTTAGT CATGTGTTGCTTAAACATGAAAGAAACTGGAAAAT CTG CCATTAAAAGAT CTCATTTTT AGAG CCTCATTATTATTGTG CTGAGGTCAGAAATACTGTACAATGGGATGACCCTCGAGCTC CATAT CAT CT CTT CTCTGTCCTATGCTAAGCTATTACCCCACCCCAGCTCCAAAAAGGCCTGCACAGTAAGAGCCACCACTATCCCCC CACCTCATGGTGCAAAAGGCTGCTCAGAAATGTGGCTGTCCAGCTGCTGGCCCTCTGCCTCCCTTTGGTAACGAA GTACCCAGGTCAGAGATGCCCTGCCAGCCCACATCTTCATTCCTACCCAATGCAAACAAAGCACTTGATGTTCCT TTTAGACT TCAGGGTTTTAATATATTTTGT CTTATGGGTTATATAGTTATTTATT CAG CCAT CTTGTTAG GACT C CTGGGCCACAAAGGGCAGGATTATCTTACTGAAGGTTAAGGTTCTATTAGGGAATGCTCCCGTACACACACCTGG GAAACC CAGCCTACC CAATGGC CCTGCACATT CATACT CCTC CACTTGGT C CCAGGAAAACAAAAGATGAATAG C AAAAGTCAGCAGGGTTTGGTGATTTTTTTCCCTTAAAGCATCTATAGTG(N)xTTATTTAGCAAGCACTTAATGG CACT TACATAGC CACTATTCTAAATACCATAAAACTGT TAACTTAT TTAACTGTATATATGTGG CTGTTT TGTGG ATGT CAG G CTGACTGT GOTO CGGAGTGGTTTT CTAGTGGAGG CCAC CAGG CAGTAC CCTCTGGG CCTCCAGAGAG GGAGGGGC CGTG GTGACATT CATTGAGGAGAGGCAGGT CC CTAGACAAGAAGCT CAATTTGG CAGTGAGC CTTAG AATGGAGAAGTAGTTTTTGACT CTCCCTAGAC CTTATC CTGTTATGTTTGGACT TTGAGCCATT CT TGATGC CTG TCTAATCAGAGAAAGACAACATCTGCCACATAACAGGCAACCAAGAACCAAATAGGATGGAGACATTGTCCCCTG CCCCAGATGATGGAGC TACATGTTACTCTGTT TATTTC CTTCTTTGTTAT CCATGAGC CTTCAGGGAAAGT CTTG TGGTTAAAAC(N)xGGCAAGGTCTGGAGTCAGACATTATTTAATTAAACACCATGCAGA(N)xTGTACACATGTT GATGACAGTTGATTTTTATATGCGTTCTTATGTCATAGACATTTGTTGTTATTAAAAATATCACAGGGGAGCGTG GCCCTG CTGTAAAAAAAACATTACAAGATATCATGGTGGTGGAGAATGATTCCATTGTGGTGACTT CCAT CACAG CTCT CAGC TGTGAAAGAGCC CCAGGGCC CTGGAAAAGAGC CC CCAAGGACTTAGTTTCTAGGAAGAAGGTTCGG C CTTTGGAGGGGAACñGTTTG CC CATTGC CT CCACAAGCTC TGCTTTCTTTCCCTAGCCTGGAGATTTTTGTGATA TTCATAGTAAGGAAAATAAATTTTTGTCTTGACTTAGTAATGATAGTTAATGGGTACTATAATAAACAGTCCTAC AG
:> H s4 _38284 097 -383 13 29 9
TTTTTGAGTAAG CCTGTTTCATTG CAAGTCAT CATTGC CAGCTTCAG GTGTTTGAC CCAAAGAAG GATATTTGTA TTCCGCTTTGAGCTACGGCTCTACTTAATTAAGTAATCTTAAGTAGATGCTCCTGCCTCAGAAGGCCTCCAAGCC TCTT CCTTGTAAAAGGAGTT GATTGGTCTGTG TGATTC CTTAGGCCTCTC CAGCTCTACAGC CCATCCCCATTCA GC CAGAGGACATGG CGCT TCTT CGAG GT CT CTGC CAGGACATTAGTAGGCAGTGGAAATC TTCCACCAAAATTCA AGGGGGATTT TGAATAT CAGAAGATGGAAAATGAGAAT CAAT CC AT CAAATTTATTATTTGTTATTAATTAAGGG TG AAAG AAAAGAGTGAAAGTACTGAC CAAT ATGC AT CT GT GATT TC AATTGG CCATGTTGTCACAGTCTTCTGTT T CTGTGTAAAATTAAGATTTGC CTTTATGT CAGCTG CTG GGTATT(N)XCAAAAAAAAGAAAAGCTAACATTTCT ACTGACTCTTCCTAGCTGATTTGTGAAAACAAGGTGATGGGCCAATTTGGAGAAAAATATGGGTAGAAGTTGGTG A C CTGACTTCACTTñGCCAAGT CTTTTATT CACTGACCATATTG C CAT CATCTTAC CTGTGTTC CAGAflCTGGTT AATTGTTACAGTATAGTC CCAGGGTTTTGAAGGAAAAAAGTGGGAAACTTGC TGCAAAGG CAGC CAGATCTGTC C CCACTCTCCAGCTCCCTGCGGTATTTTAAGTCAGCATCTCAGAGGAGGATGATGGTCACCATCCTA{ N) xTGTTG TCAGGCATAAAATAATGATTCTGTCTTATTTCTGAATATTTAACTTATACGTAATTTCACTCTCACGTTTTCACT TCCTTACAGAATATCTGCTTCTTTCAAGAAGTATTTTAAAGATCAATGCTGTCATATTCATTTATTTTACTATAG AT CAAATTCTATGGTAA CAAATAAGATTAC CAGAAATC CC CAGG( N ) XCTTTTTCTTTCAAAATGCTCTATCCAA TAGCCCAGAAATA CATC CAATTTAAAAGAAAAATAAATGCATT CAATT TAAAATCC CAAATGAA CTTG CTCAATT TTGTGGGGAGGGGGGTTGTGTTGAAATT TAAT TT CTGTGCTATT TG GGATTCATGCTACT TCTCAGAAGAGA CAO AGTTTTCCAGTAGAGATAAACGACCTCTTCCAAGAACAATGGAAATGAAAGGGGGATGGGGGTGGGGTGGGGAGT G CTGTGGGTAGAGGGGCAGGAAGCACAGATGAAT TGAAAAGAGAAGAGTTATAAGCAGGT TGGC TTCC TTTAAAA AGATGGGATAGGGTCTCC(N ) xGTTAGGCTATCTATTGGACTATCCTTCAAAGAGCGGCTGTGACTTAGTGTGAA AG CAGATTTTT AAAAAAT ATGAATAC AGTG AGTCA C AAAATT CT CT CACCAAAAC C AAG ATATG AAAAAAAAA t N ) xCCATTTAACCATTCAGTCTATTCTACTGGGTATCATCTGGTGGGGTATAAGAAGTCTCTTTAGATAAATAATG TTTTCAGAGCATGCCTTTTTCTTTCCTTTCCCAGGATGGTGCATACCATGGGGGATGAGGGGGTCTCTTCCCTCA T A C T ( N ) xAGGGGCATGAGGGCTCCTTCGGTTGTGAGTAACAGGAAATATGGCCTTGGACTCATCCATTCATTGC CTTCCATCACCTCCACATCATTATTCTTCAATGGTGTTAAACATGTCCTCAAACTAAAGCCTACAACCTCCTTGC ATGAGTCCTGGAGAAATTCTCATCCACTGTCTCATCCCAGTATCTTTTATCTCCCCCAGAGAGTGGATGTTTCAG AAGCATCCCAGCCTCGTCTTGAGTTTTCTTCCCTGAGCTGCTGTCTCTGACCTACTTTCTTTACCACCATCTCCC CTTCTCTCCCCGT CTTGG CTCATAAGTGAATG(N)xATGGCAGAAATTTGAGGAAAATCAG(N)xGACGAATATG TT CTTATTACAAATACTCAACAACACAAACATATATAGTAAAAAGAAAAATT CTGTCTTGCCTTACAC CCCATC C TACCATCATAATACCCCATCCTACCATCATACTTTC( N ) xGTAGTTCAAGCATTGCAGATATGTTTGCTTCATAA GAA CCTAATG TGGAAAGTGATGGCATTC CATATCTGTCTTTT GACCAC CTGTAAACATGAAAAACAATGCTGT CA GGAGCGCCGTTGCCAGAAGCTGAGCTTCTGCTTTAATTTTGGTCATTATTCTTCATTTTTACCACTTAATCAGCT CC CCTGAAATG GCAGAGT TTTCAACTTAAATTTT TCTT CTTTAAAAAC(H )xATCAACTTATATTTTTCTCAATC AAAAAGTACCACAGTATAGATAGAATAAGG CAAATT CCATGTGT TAAT CAGTATGGACCTGTTT TTAGTATCATT AT TGCAGAGAATAAAGTTGAAACTTC CAGTGCAT CATO CT CAAAACTG CAGAGTG GTCTTTTAG TGGGTGGCAGA GT CATGGCAGTGGGTTCT CTCAGTGGTGGT CATG CAGGACAG CAGCATG CCCAAGTTACC CCCATTGTGATGAA G ACATGAAATGCTTGCTTCCCTGCACTGTACAGGGCCTCAGTGGAGAACAAGCTTGTGGCTGCAGGCACAGCACAG CTCAGGTGCCAGTTGTCTTGACACGGGTTTGTCGCTACCCAGTTTAGTGAATTCACTACTGGGGAGACTTCATGC CCTCCTTTCTGCTTCCATTGCCTTTACCACCTCCGG CTAAACTATATT CTTT C T C (N ) xCAAGACCGTGAGATAA GG CTTGAAAC CTAAG GAATTTCAAAGAG GGGGCCAT GATG GG CATT CAGCGG CATCTCAG GAG GGGGCAGAGTAA CCAGCAGGTGCTTTGCATGCTGGTCTGGTGTGTCTGGAAAAGGCATCCCGCTAAAACGTTTGTCAGGAAACTTCT CATCAGCATTTCCAGCATGCAGGAGCAACCAAGAGAGCCACTTAGATACCATTTGATTATAATGTGTTTCTCAGA GAAATAAATAGCC CAAATGGAAGCACAC TT CT GATTGT CAATTT TTTACAAACACC CTGGAACT CAATGACAGTT CT CGAGGAT CGATATGGT CTGGTGACAGTGAATTGATT GTTC CAG CAGAGAT CTTTAATGAGAT TGTTCACACTA TT CAGG CACCACCATCT C CAGCAGATTACAATGACAATGG CG GGGC CTTGTGAATATAGAAAG CAAAG CAGC CTT CCTGCT CATTTCTTTTTGAAAG CACTAACC CCTAG CAG CAAG GC CTTCTTGTTAAT TAGTTGTTAATTATTGGGA CAACTC GTGAAATGTTTATTAAAGAG CTGGTC CAGT GG CAAAGAAAAGAAC C CAGGGCAAGAGACAGAATTGAAG GGTAATAAAAGGAACGAATACGCTTTTACACAGTTCACTGGCACCAAGGAAATTCTAGCAACTCACTCTTCTGGT GCAAGAGGAGAGGTACTGTAGAAATG CATAGCAAACAAACTA GAATGGATGAAAATGTAACAGACACAT(N ) xA T GACATGTATGGTCTATGATGTATATAGT CAAAGTAACACACTTACAGG CTATGAT CAAAG CAGTAAGC CCACAGA TT TAGATTGT CTTT CATGTAACTTTT AACCTATATC CCATGACTGTAT AACC AGA (N ) xAAAATTTTTTTATCTT CT CTTCATTTTAAAAAAAATAT T CTAAAAT CTACAAAAATAGAAGCAGTCCAGGCTTCTCGAAACTCT CGGGGCT CT TGTT CCCTGCTCTATATGCT CTCATTGTAG CATGTCACTTTCATAAGTGATTTAAGTTGGCACAAT CAG GTTG TAGCTTTGATGATGTTACATTAACATTATCTTACAACTTTTGGCTTTTATAAAGGATGACTATTTTTTTTAAAGT AG GAGATTCTGTGTTGCATATCTGTAAAGCTG TCATAGGAAC CCAGAACTTTGCTACCCATCTC CTGG GAG GTGT AGACAACCTG GACACCAGTCCTG GTAAATT CTTAGG GAAGAG CAGTTAGTATTTTCGTTCTCTATACTAGCCACG CC GATT TATTTGGGTGGCTCTGAAATACGC CT TCAAGT TCATTCCTATG CACAATCATCT TTTAACTTAATATAG TTACTTAATATAGTTGACTTAGTACAGTAAACTTAATACAGTTAATTTAATATAGTTTACCTGCAATATAGCAGG TAAAAAAATC AG AGT AGGGGT AAAGT AAGAAT AT T ATT AAAAGC ATTT TTATTCTC CCTTGAAG AGTTGCTGT AA AATGTT ATCTTTTC ATT C CTGCTTTCTG CTTT CAGAAGTG ATTTGG AAAGTATTGCTGGATG ATGCCACTTTTAT A CTG AAACTATGTAG GTT CTATG CAATCTATG ATAATTGAAG AAAAAG AT AG ATTCTATT CCTT AGGAAACC CTT CAGTCCCTTACACTGGCC CTGTTACGTGACGC CAGG CATG CAGT CTTACCTGATGCAGCTAGTACTTCTCTC TGT CTCATTCCTCTCCCAAGTCACACTGTGTAGTCT(N ) xACACTGCATTTTAGTGTTAGGCATTTTAGAAGCACCCT TGAGAAACT CTGAT CTGCCCGCATTTTCTTAC CAGT CCCTTGACATGGATGCATGTTTTTGCATCCAT CCAAGGG AC CTGGTAAGAAGC CTCTGAATTACAGT CTTTAT CTGTTTATTTGG GTGACT CACT CCTGGGAT CCCTGGG GAGA TGCCCCCTCCTCACTCAGCCTGTCTCCCAGGACCTGCCTCCACTCCCCTTGCCTCTGGCAGGGAGAAGAAAAGGA GCCTGGAGAAGGGCAGACCCCCAGCAGGATAATGCCTACCAGTGGGACCTTCCTTCTAGGCTCACTGCATTGCCT ATGATTATTTTT TT AG AATAGATTTAGAAAAAACC C AG GC CAGTTGGGGC CAGTTCTGGTGATT CTGGTTTATGA TCAAGTGGGAGAATTACTGTTCAGGATCAAGCTACATATAACTATAACACACATCACAGCTGGCATTCTGGCTTG GGTCATAAATGCATCCGTGGCATTAGATGCAGCGTTTCAAAAATACTCTCTGCAGTGATGGTAGATATCACAGCC
a g g c t g c c c t g a a t g a a c t c a c a g g t t g c t t a c a a c t t g c t c t t g t t c c a a c c a g t c g g g c a g c c a t t t t t a t t g GCTGAGTGTCTCATTGATTCATTGGTACTCTGCTATGCCAGTTGTTAAATACTTGTACTACTGGAATTAATATAT AATTTTTCTC CTGAAGGACGAAACTC CTGTGAATAT CT CAATTCAGAATGTTTCTTAAAAATCTAATCTTGTGAA CTAATAGT CAAT CT TTGCAAATGATGTAAGTCATAT GACT CACATTCTAAGT GAAAATCTACTTTAAGCCAGGTT AATATTGGTC CTACATTTATTC CAGGTATTAGCTTATATAAG CATGAAAT CT CCATGCTGTTTTTT CTACTGTTG CAAGCATTCACTTCCAAAGTATGCATAAGGTTGGAAATTTAATACTCCCAATACTCCAGCfiAAAAGATGCCTTTA ACTTTCTTGGCCATATCTTTAGCTTGGTTTCCTACAGATGGTACTCTGCTGATTCATTTCCTGATGGTGCACTCA AGTTGATTGTTTTATTCTTTCGAAGAATGTTATAAATTGCAGTTTGCTTAGAGTTGAATGATGATGGGGTGGGCT GGGGTTTACAGTTGAGAAGCACCACTTTAGCTTATAATAAGACGCATATAATTTCAATTCACGAAATAGGCTGGG GAAGTATTTATAAATAATTATATGTCTGGATGTGTTTGTATATATAGTTACATTTATTTTTAAGCATACATAG(N ) x CACACATATAGTAATGCAAAAAACCACTATACCATTAAAAAATTTTTAACTTAAGTTCCAATTCTGGGGATAC AAGG(N)xTTAAAAAGTAACAGTGTATCATGATAAAAATACATAGGCACATGAATGAAATATAAATACAA(N) xG c c a t c t t t g a a a a c a a t c t a c t a c a a g t a g a a t t t g c c c c a t t c t c a c c t t t t a a t t a g t a g a t t a t c a c g t c a t CATAATCATCATTTTGTGCTAAGACAAATTATGAAATTAAAACCAAATCACTTTAAAGAAATACTTAGCATTTTG AGGAACACATTTATGGTGAACATATCACTTTTTTCCGGCTTTGGCTTAAAGCAAGTTACATAGTTGGGATTCTAT TGGATTCTTTCTTGCTCCCTGGTTCCAAAAGACACCATAGTAATACTCCATACTCTTATAGCTTTAGGCTACTTG CTGTAAGAGATAAAAAACTGTTTTCCCACTTTCAAATCAC( N)xTTTCACATTTGCCTTTAATTAGAATTCTGTA t t g c a g a a a t t c a c a c a c a t a c a t a t g c t c a c t a t g c t g a t a t a t g t a g a g t c c a a t g a g a a t a a c a a t a a t t t c AGAAATGTCTGATTGCCAAAATCAGATACTATTTTTCTTAGACTTCAAAAATAGCGTTGCCATGATAATCTCTCT TTTCAAAGCATAGCTCTCATGCATTTTGCAAAGTTATTGGTGATCCTTAAACAATCATTAAGTAAGTGACTGAAT CTAATGCTAGGTAAATATTTGTTGTTTAG(N)xATTTTTCCATGGACTCTAGCACCAGGCCTGTCTCAGTGGACT CTAATACCAGGT CAGCTCCTGCAACC CCTGGCTCTAGG CTCACCCTG(N)xATCAGCTGCTTTATTTATTTATTT AAACCTTTGTTTTACATTCAG (N) xAAC CC ATCCCC CAAAATTTGTGGTT TAGAGTTAGT ATGAAT AATG CC AAT GGGTTTTCAGTTCATAAAAGAGTGATCTGCTGACAAAAGAATGAGGGGAGCAAGTTTTAGGAGATGTGTTGACTG AGGAAGTTTCTT CAGAAGGAGC TG CT GTATATTCCATTATGTTTTTTATT CT CTGTTTGATCTT GCTGTCCTGTG GAACTTTGATGTAATTAGGAAACACATTCCTCAGGTTAAATGTGAATAGGATGGGAT(N) xAATGTACTAGTATG AGGGACATCATTACATTTAAGGAGCCTGGGAACACGGTTCTCAGGCTTTCGGTTTGGTTGTTCTGCACAA( N) xT CTTCTGCACAATTTTTTAAGTGGGGGGACTCTTTTGAAAGGACTTCAGGCTGAGACTTCTGAAAGCATGTAGTGA AGGGCTGAGGGACAAGACCAGAAG GTGGGGCCTAG C CAAG GCTTTGTACT CG CATT CTTACACT CT CCTG CCATT GTCCCTGTGATGAAATGAGCTTCATACATATAGTTTGCTTGAAAATACACCTCATGGGCCAGGTGCAGTGGCTCA
T(N)xCACCCCATGAAGTGTCCTATGATCAAATTTCTGAGACAGAGGAGAGATATGGCATGCTCCATTGTGATTA AAAAAAACGTGGCTTGTAACCTTGGCTTCTCCATCTGTCAAGTGATGACAATCAATGCAGCAATAACTGCACACC TCTCTCACTTCCTCTCTCTTGATCAAGGAAGACTGTGAGAATTAAGAGTTGATCAGAAAGCAAAGTAACñATGAT ATGTTACAGAGGAGTGAGCTTGCAATGAGATCTGTGAAATTTGGCAAAGCACAGGAAGTGTCCAATTCAGCTAGA TTGGATTGGTGC TAAGGCAGTTTGAG CTTGAGGTTGTC CAGAATCCTTTCTG CTAACGAATGCCAACTACAAGGG GAATCCCTGCCTCACCGTTTGCATATAGATTTAGGAGGAAATGCAGATTTGCTCATTTGGTTTCTGTGCAATTTT AGAGTCAAAATGAAACTAGAAT GTGAATATTAGAGTAT GTGAACCTTCTTAGTATAAAGCTGGATT CCCCTCTGA CTTCTGTAGCATTTAGGAAATTGTGCCTTATAATAGCAGCAATCAATAATACATTGTCCTTTCATCAAGTAACAA
c a a t g a c t g c t g a t g c t c a g a t c c t a g t g t t c t g t a g a a t c t a g c a a c c c a t t g t t t g g a t a c a a g g g t g c a t g a ATTAACAATATTCACTGCTCATTTTCTTTGTGTTTGCATTCTAGTCTTCTGGAAATGGATTTACTAAAAAAATAC TTTCCAAAAAATGAAGTGTTGATTTTTTTTCTTTGTCAGGTAAGAGCCATAATACATGTTCATCATTTTTAAAAA ATTGGAATAG(N)xGATGAGAAGGAAGACTATGCAACCCAAACACAATCATTGTTGACTTGTATA(N)xTTTATA AAGTTTGCAGTGAAAAAAATATGTAATTAGGAATTATGTAAATGGATGAAAATATAGCAAAGAACATTCTCCCCA CCATCACCAAATGC (M) xAGACTGAATTGTTAGGTATGAAATACAAAATGAGCGAAGATGCTCCCAAAAACAAAA TAACGAAACAAAAATAAAAACC CAAC(N)xTGATTCTACAGCAATCTTTTAAATTTTTGAGTAAAAACTCAATTG ATATATGTTTGTATCTTTAAAAACAAATTATTTTGGAAATCCTGAAGCC(N)xGACAAGGACTTCTTTTAAAAAG CATAGCCATAGCC(N)xTCTAAACGTGTCCGATGAGGGTTTTTTTCCTGAGTAATTTTAAACCTGTAACTTCACA a a t a t t g a t g t g c t t c a g t c t a c t g c a g t t a t t g a t g c t t g a a c t g t a t t a t c t t t a c c t g g g t a a a c a g a a c t t CCTCCATTTA GTTATTGGGTACTTGT AACACT ACCC CATC AT ATTCTAGG CT CAAC CTGTTCTATGTAATATTTT TAATGATATCTCTCCATATTATCTTTTGTTCTATTTTTCAACTAGCTGACAAACAGCTAGGAGGCGTGGCTGTTG
( N) XAAATAGTAGTTATTTTCATTAGGTTTAATAAGACTGCTTTAGTGTGTCCTCTTTCTAAATTTTTCTAAGCA TTATTGACACAGTTGTGATTATAATTTACATTTATTTGCTTCCTGACATTTTTTCTTGTAACAACATACACATTA TATATGTTCCAACAAACATTTCAGAAACACAAATTCC (N) xTTTGACCAAATTTATTTCCCCATGGTTTGTAATC GATTCACATTTACGTTTTTAGTGTGGACTGAAAAATGTTTATCCACTGCCAATTATTTTTCATGCTCTTTTATAC AATCCGATGACAAAATCAAGCCATTGAAATTGGTCAGCGCTATTGAAAAAATGGTGTAGTATGGTCACAGTCTCC TAGCAGTCTATGTACAGCAATGTCAG CC TAGCTTCGTT GAGT C AAACCAGGG AG ATG G AGGGAAATTTTGGCTGC CACAGCCTTGAGTGCGGTG(N)xCCAGCACTCTGAGTAATCTAGTAATTGGGTTCTGCCCTCTCATGAGCTGACA GGAGACCACAATCACGTGAAACAACCACCAñAGATGGTAACAGCTCAGCAAGAGTGACTTGGGCTTAACAGGTGC AGAGTCCTGG CTTG GAAAGGGAGC TTGG CTTACCAGGG CT CTGCTGCACT CTTGAA( N) xTCCTCCCAGAAAGAG CAGATTTTTGCCACT(N)xCATAGGTTCCTGACATTCCTTTGAGTGAGAAGCCTTGATCTGACAGGAAGACAGCT AGAGAGTG CTTAAC TGTTGCTT CT CCAGAGTCAATT CCTTGTGG AAACTT TCTGTC CCTGCACCTGTGAAATGTG CTTAATGGATTTTGGAGACCTACGGTAAGATGGAAGATGATAGTCCTCAAAGGCCAGGAATCCCAAGGTGTATTA CTTGAGCCACTCCTAAACCCAACAAAATTTCCCTTCTAAATTGCTATCCGCTCCCAAATAAGTATCTTAATTTAG CTAATTT(N) xGTGCATTTGAGCCACCTTTGCCTCCTACCCCTTGTTTTGGGTAGGG AGTGGGTTTCTTCCTCTT GTGATGGGAATATTTCCATTTACCTGTGTTATACAGTCTTGGTTCTGGGGTTTGCATGCTGAGTGTAGCAGGAAG GGAGGAGGAAGCACTTACTTCTAAAGGGCACAGAATAGTTGCCACCTTCGTTAGCTATTCCTTTTCTATTTCCTA TTCAC
> H s 5 _ 9333503 - 9342335
GTTGGCATTTGGAATAATACTTGGATTCAGCACAGAATACATATCAGAGTAATTCGGGATGGAATCACTTGCCAC CCTTAGAATGAACCTGAAACACTAAAGACGAACTAATT CATTTGGGAAATAAAAATGTAATCAC CCTC CCTTGTA TCTGTTGCAACAATTCAATCTGCAGTGAAACTTACACTTGTATCAAGTGTGTGGAGTGTTATCACTGCCATTCCT TTTATAAT CTTCTATACAAAAACC AGTCAT AATATATT CTTT CATTTTGAATTTTTTATACCAAGAAACT GATCT TAGTAGGATTTGTGACAAAGAAGTATGCAATCATTCAACAAATATTATCCCAAGAGTAGGAACAAATAGAAATCA AATAATG C AAGG AAAT AACTGAAATG CTTCACTGTGATTATCTCTGAAATTGTTTTTTTCTTTATG CTTT ACTGT ATATTCTACATTTTGTAAATGAAGATACCTAAGTTTCATTACTGAAAACACAAATATTATTAACTTTTAAAGAGA CTGGGGGAAAACAAGATATGACTGTTGATGCATTTAACATAATACGAAAGTTCATGTGCATGCTGGAAATGAAAA TGTATGACTTCTTCCAGAAATTCCCTGAATTCAGATCCAAGTACCTTCCGATTCTCCATAACCACAAATGGCTGA AAGAGAGGCTTTTCTTTTGCAGCCTAACGAACTTCCTTTATCAAAGAGTACAAAGAATCTTTAACTGTTTCAATT TCAGGAACATACAGTATCACTAGTATAAAGTGAAACCATCCAATAAATGGTTCCACAATGCCTAGTGCTATGAGA AAAAAAAATCGACTATTATTCTATTTTTTCACAACTCATATTTTTAAACCATTACTGATTTCTGAAATATACACC TGTTATCTAAACTTACTCTTGCTAAATCAAGCTTTGCCAAACAGGAATCTGAGGGCTGGGTTATTTTTAAAGTTT TATATCTTACTCATATGCCCTCTCAAACTTTCTGAG CCTAAT CCCATGAAATTT CC CAGGTGAAATTC CT ATTCT CTTCTTTAGGTAGCGTCTGCCTGAAAGGCATATGCACACGTATGCGAAACCCATGAAGACCCTCAGAGCTTCCCT AGGGC(N)xGATAAAATCAGGCTCTGAAACTGGCTTAGAGTCACTTAGGTAGGGGCCAGCCTCTTTTTTTTTCTG TTATTATTTATCAAGTGTCTATAATG TACCACACA CTG CT CTAGAATAAAAAACTACAGCCCAT GCTC CCG GGGT CACCAAACCTTTGGAACATACTTATGATAAACATATAAAAGGTTGTTATTTGGAATTCAAATTTAACTGGGCATC ATGGATTTTTATTTGCTAAGTCTGGCAATCCTATGTATTCATGTCCACCATGCCATACAGCATGCTAGGTTGACT CAGCGGCCCATCCGATCCTTCCCAGCAGACCATGTTGATATGCAATGGATGGAGACCTCCAGGAATGAGGTCTGG GATGAGGACT CCAGGGATATTGAATTGCAGATTCCAQGG GCC CTGGGATGGAAG CCACAAGACACATTGGGATGG AGACTCTAGGGACG CTGGGATGGACACT CTAGAGACATTG GGATGGAGACGACAG G GATGCTGGGGTG GAAGCTC CAGGGAAACTAGAAGACACAGAGATAGAGACTTCCAGGAATGCTGAGATGGAAACCATTGGAGATATTGGACAGA GACCTCTAGGGATACTGAGAGAGAGAGAGACACCCCACTGACACTGGAATAGAGAACTCCAGGGACTCTGGGATG AAGACCTC CTGGGACAGGTACC CCTTGAAAACAATT CC CAGCTGCCATCCACAG CCAGAACAGGAG CATAAGACA TCTCTTCAAAATACCCTAATTTGCTCTGTGGCTTCTCACATCCCTTAGGAAAATCTTGAGTCACTGACCAAAAGC CAACCACTTGCAGAGAATGAAGCGCTGGTCTGTTCACACAGCACCCAGGACCTCCAGAGACAACAAGAGAATCTC ATGAGTCAAAGT CATAGAAAAG CC TTTC CAAAGTTCTTTAGAGATGAATGAGGGATGCAAGACT CAAAACACTAA TTGGGGTCACATTTGTACCTGCCCCACCCCGGTAAGGGGTCTGTTCCCTTGACTCAGAGTGAAAGCTCATTTTCC CTCAGGGTGAGCTTTGGTGTATACAGCCTGCCACTGGCTCATGCCCTTGCCAGATGTAGGTGTCTGAGATAAATT GTTATAAT CC CAAC CCCCTTTT CC CTAAGTGTTCAC CTGGATGCAGAGGGGAACATAGACAAATT CAACT CTGGA AATAAATGTGTTGAATCAACAGGTATCTTTGAGCTTCTATAAGCAGGAAAAATC(N)xAATGAAAACAGGCAAGT CTGGAAGGGAAGTGGGAATGCAGGGCAGAAGGGCTGGC GTAC CACACCAGGGAGACAGCTTATATC CAAT GGGAG CT ACTGAAGGTGGG CATTTTTAAAAG ATGATGTTGG CTGTGGTATGCAGAAGGAG GAG AACATCTGGAG G CCAGG AGACTGATTTAACACCACTGCAAAACACAGGCCGAACTCCAAGTGATGAAATTTGTTGGAAGTGAGCATCAGAAA TCCCCCAGTATTGCTACTGGACTAAGGTAAGGAAGAGAACACCAAGTAGCCTTGTCCAATCCATACTCACCGCCT CCTCACAAACGT CAAAAGAGGTGAAGGT AACTTGCT AGGTGC AATAAACT ATGC CAACTTCTTC CT A CTC ATGT A AAATTAC AAC C ATT C ATCACTT C AAGTGTTGAGTGG ATGT AC AAGG ACAC CT CAAGATCC AAGAGC AAAAGC AAC GATCCTAGAAACTGTTCAATGTTTATCTAGAGTGCCCAGACTATAAGCTAGATTTAGCTGCACTCGTGTTTATAG AGAGACTG CAAAAC CTCTGTAG CTATGG CAGGCATG CTTTATACTCTGGTACATTTGGACAAAAAATATAATGAA CTCTCCACTGCCCAATGTGCCATCCTTAGATGCCAGAGGAACCCAATTCTACAAATGGACCCTTTCTTAAACTTT AATGTCAATTTTCCTAAAAGATGGATATATAATCCCAAAATAGGTAATTTAAATGCAAGGATTATGCAGGTTTTC TAAGGAAGTGATAGTGAAAACAGGGGAAAGAACTCAAATTTGTTTGGATTGTGTTCCAGTGGCCTCATTGTCAAA CTGGGATGCAGATGTTGACCTAGATACGGGAACACTCACACCCTTAAGAGATAAAGATAAAAATAACTTCGTTGA ATCTCAATTTACTGGTATATGGCCTAATCAAGGTAACAAAGAAAGGTGGTGGTGGGCCTTGGGGTACACATCAAG ATCTCTGGATCTCTAGAATCCTACTGAAG(N) XTACTCTAACGGGCAGCCATCAACATCTTATTAGCAGTAGTAT ATTTTATGACACTCATACTGTATTTCATTCTTATAAGAAATAATATTATTCAGCATTCTTCAGTTTTGTCAGTGA AGGTCACATGTAACTGCACAGATGTAAAATGAAGTCCAAAAATATCTGTGGTGGCAAAATACAATATCTCTCACC TTGCTCCTACAACAAGTTCTTTCTGTCCTGGGTCAAATGTTAACTGCGAGAAATCCACAGCATTCTTCGCTCTGA ACTCCCGTAACCAGGGGCCAATTTCTAAATAG( N) xCCTCTGGGGACCACAGATAGTGCACATCAGGTAGCAGAC GTTACATTTCAGAGAGGATTCGGAGGTC CCTCTATG CCAT CTTTCAGAGG C CACTGGAGGTTTC CCT CAGGATGC
c c c a t a a g c c a t a c t a a g a a c t t g t a a a g c a c a g a g g g c a g c c a c c c a a t a a a g a g c a a g c g a g g c a t c a a t g c t TGGTTGTGATTTACAAGCCAGATGTGCATTTTCCTTTTGGCTTTGGGCTGGGCAGAAAATAAGACAGCATTCACA TG ATTCAT CAGAGG AATACTGAATGCAAACAAGAC CATTT CCTGTG CAACTTGñCT CAAAAG AGCTGC AGATTTT TAAATATTACAAAAGCCTCCTTAGGAGCAAGGACCAAGCACACAAGCATCATTTGCCTTCTTTCAAATTAGATGT TCCATTTCCCTGTCCTTTGGGTTTCTATTTTGGCCCCATGATTAGACTTGTTAAACACATTCCGACCTTTCTAGA TG AG AAAT CCACAGTTACTT CT AGGCTT CATTTG CATTTT TTTTTCTAAGGG AAAT CAGAAAATAGCC ACTCTAT AGAGACAGGCTTTTACAAGCAC GGTTAACTCCTATTAGACAGCCTC CAGC C CAGAGTC CGATAATTATAAAATGA CAGTATTTGAGGAGAGATAGGATGCTGCTTCAATGAGCCATTTTTGCAACTTTGATTAGCTCATTTTTAACACTA AAAAATATTCAGTGAGTTGC CCACCAAACGTGGGATAC CTTATCAT TATCAT TGACAATTAAC CATAACATCTGA TTGT CGTTTTTTCT TGGATT TAGTATTTTGAATAACAC CACCCTGACTAACCAGGCACATAA CAACTGTCAAAT C AATGAAGATG TTGGTTTGAGGAAAAAGTGTGT GATACAG GAAAGAATCTC CT CCAACAAACG CTGATG CTAAACA CATCAGTGTTATGTTTTTAATGGGGGCATATGGAAATATGCCCCCATTCCGTCTCATATCCTAAAGAAATCGCTT TGTTGATGTGAATGAGAGGGACTATTTTTTCAGGAATGGTTCTGTATAATAAACATGTCTTTTCTCTACTAACAG TT ACG AGGGGGAAGTG ATAT C AAGTAAAATTT AT CTTGTG CTTAAAGG AAAC CAGGGGTGAGTCAT AT AGAG AGG GACTTGTAATGTATATGATAGCTTGATTTTTTTAAGGATTTTCATATAAATTTCTACATTTTATAGGCAAATCAA TTAAAATACGTAGAACAAGGTATGTACAAGGCAATGGAACTATGAAAAGTTATTAATAATCCTTCCAGGACTTTA TAATTCAGTTAAAGAAACAGCACTAGTATCTTTAAACAGGGTGTCTATCAATGCAAGTGTTTATAGGGGCTTGTG GGGTTACAGG AATT ATT CTT AT ATGAAAAGAAGAGATG ATTG AG AG CTGCAGTAAAAC AAGAG GAAAG ATTTGAC CAGAGAGTGACAGGAAAGATGT C CTAGAGGGG CAAAGCAAGACAGCAACATGGGAGGC CATGGTGCCC CCTGTGG CCATAACTGGTATCAC TCAT CTTCACTC CATCAATGCCTC CATT CC TGGG CAAAAGAGAACC CAATGG CTCTTG C GT CTGCATTG CTCC CTGCAGTG CATG C AGCAT AC AAAACAGC ACATTG AATG CG AT AGGAAATAAT AAATATGTG AAAT GAAG TAAGAACAAGTAAGTGAGTT CAAGAGAGAAATAATC{ N} xGTAATAGCAACTTAAGAAACTACTAGG AAAGGAAACTGAAAG GTGTATTTAAATACAAACTGTAGAAAAATTCACTTT CAGGT TAAAAG GTTTG GACTTGAT TT CATATGTGACAG CAAAGCTTTGAAAC GTTT T CAGGAAAACTCTTGACC CATTTGAG CAGATGTCTTTGGAATA TCAATGAG CC CTTGATAGAAAGAATGGT CTGGñAAGAG TAGAGAATGGAAGGAAAAGTAGTATATTTTTAAAAAC CTAAAAACCTACAATTTAGGTATTGGCCTTGAAAATGAATAAAGAAGATGTGGGATCACTGATTAGAGATGAAGT CAAGGTAAACAACTTAAAAAGAACAAAACAGTTACCAAATTAAATGAAGAAGAAAACAGTGATGACTGAGACAGG CATC AGAGTTTGAGGT GGCTTT AGAG AAAGAT AGTAAATT CAG GTG CAAG AATGG G AATGTTTCAGGTGAAAAAT TT CCAATG GTAGGTTGGCCCACAGCCAT TGTTAAA CAT CG CTAAGACCTAGAAATGTC CATAAAGATGTTCTAG T TCAGGAGGTTCATCATTCAGATGGGAAACACATGACCAGGGAGCACTGCCAGGACCAGGCTCCAGAGTGAATATC AGGGGCTCTGTTCACCACAACTCACAGCCCCCAGAGCTGGGCTCACATCCAATCTTCAAACCACATGAGGGAAGA ACAGAGGGCCAAGGGCTGACCTTCCTGCTAGGGGAGTGAAGAGATGAGCCTCTAAGAGGAAAAAAGACTACGAAA GATAGAAG AAAATC AGGTTACT CTTTTC CCATG CATTCATCTTT CTTTTT ATTT AG AA GACTTGGAAG AAAGTGG CCAACAGGAACCAGGCTAAAGAGACATACATCAGGTTTTGGTCCCTATACTGAACAACGGGAAGGCAGTTTTGCT ACTT CCTAAAATAC CCTGAAGTAATAAAAGGAGT CATGTA CAGAGGAAAAAATGAG TTAACAGCATGTAGTTCTG TC AATATTGTTCAAGTTTCCTT ATTTGAAATT CT AATG AACTTT CC CAAT AAAACTGTGTATTTTT ATGGTT AAC ATAGATCAAAGTTGAGTAATCTGCATATTCCCTTCTCTTTCCTCCAGAAGGTTGGCCATCTCAACTTTGCATATT ACATTGGCTTCTATCTCACTGATACCAAGAGAGCCTTGCAAGAAAGTCTCTGCACCAACTCACCTCTTGAACTAA AAAGAAGCCTGCACCAATACCTATCCTCCCAACTTTCCCAACAGATCAGAGAAAAGGCACTCAATCCTCTTTTCA AAGACCAGTTGCTCTCATCTCTTCTTTCCCCTTTAATCGCTTCTCAATTACCTAGGCTTTCCTCTAAAATCTCCA ATGTTACCTAGTCC CTTTCCCT CTGC CTATATGTACAT CCTCTCCTATTT CAAAAAGCATTACATATTTTTATTA CACTCCCTTGTTATTATAGCTTCAAAAACCCCAATAAAATGCATAATAAACTTCCAAGTACAATATAGCAATATA TCAAAAAAAAAATCACACACATACAAA(N)xCACAATAACAGATTTCAGTAGTAGCAGGAAAACGTCATCAATAC TGCTTTCTAAGGAATCAGAAAATGAACAGTTTTGGAAATAGAAGATCTAAAAGAGTTCAGGGCAATTGGTCACAA ACTAGATGCCAGAGGAAGTTCGGGGAGAGTGAGGGAGCAGGAGGGGGTCCACAGCAAGTGGTGAATTGAGGGGAT ACAATTTTCAATAACATTAAGATGCTCATTGCCCCCAGTGAGTTTACCTTTTTCCTCCTATCCCTATTTTCTGAT ATTGCTGAAATAACCATATGATGAAATGGAGCTTAAATATGGCAGCCAATATGAC(N)xAAATTGGAAGTTAAGT TATAACATAAAATTTGTAAATTCCAACTGAACACTTGATAGATAATTCATAATATTTGAGGTTAACTATCCTAGA TAAT C CAATTTTTACACATATATTTACGTGAT TGTTGAAATATG GTCTCCAG CAATGATTTTTTATTG CAAAATT ACATTCATTGATTCTTAGTTGC CTTCTC TTACAG CTACATAAGC TCGCATGTGATTGGTGTATAAAATATTTTAT CAAAAGTATAATTCCATTTTTTTTAATAATCCAGTCTTACAACTTCAAGAAACGATTTATTTTATGAACAAGTCA GTATTAATAATTTGGTCCAAAATACTTGACAAATATAACTAGAGATAAGCTAAATAGATTCAAGTTAATGTCCCT TCC (N) xAGAAAAAGAAAGGGAAAGAGGGGTAAAAGCTACTCTTTTAAAAAAAGCAGACACTCTAAATATAGCCA AGGTAATGATQTTTGTAAAC CAAAACAACAACAG
> H s 5 _ 9729109 -9741877
GAGAGAAAGAGCAGGC AGGAAG AAAAAC CAGC AT CCAGCTGTGC CACAGATGTGCTTC CACCTCTGTC TTGCCCT CTCACCCTCGCCTTTAATCCCATTTTATAAAAACTCATACTCAGAGGTTAGCTGGGTTGGACCCCAGTCTGACCA ACCCACAGCCCAGGCTGGCCACAATGTGTGTGTGTGCTTAGCAGGGGGGCAGTTTTTGGGGACTGCAAAAGGAGG GTCTATAAAGAAGGACCTCATCTCTCACCCCCTAGGGGCAGCCTCTATGGGGGAGGACGATGGTCCCTGACATCA TGGG ACAG TC ACGAATGAGG GT CTGT CCTGGGTCAGGCTG GTGGGC AAGT TG ATCTGG CCTCTCCAC AAGGCTGG CTCTTGCCTCCCCCATTAGTCAGTAGCACTGCTGCTCGTGGCTCTTTTTAGTTTCTGCCTTATGACCATATTTGA CACACTGCCTCCAACCTAAGGTCCCTATTTATTCTTGGGCACACGCCTTGGTGACATGCTACCACTGCCCTCCCT GC C CAGAATTGTGTTTGGCAAG CTGTGGAGAAGCTTCCAGAATC CAAAAGAC C C ( N) xTGAATGGAGACAATTAG TTTAAAGCAGGCACGCGTAAAACCGGAGGCAGCACTTAAATAAGGCACAAACATGCCTTTAAATGTTGTTTTAAA GTCAACCTTTACTTTCATCCTCAACGACTTAATTGACAGAAACAACAAGTGTATCTGCCAATATGTGGGCCGGCT CAGT GACC CATCCACTGAGAAGTAGAAAGCATAT CATGTTTTGAGTTACAGTTTATTGAG CATT AAG CTTTCTTC CTGTTTTTTCTCACTTTTCAGAGTGATATCCACAGATATTTTCCTATAACTGTGTGTAACATGACTGTGCCCAAC GTTAGACTGGTGGAATGGTAGATAAAGGGGGCAAGAATTCACAAAAGGGAAGTCCTGGGAAAGGAAATTGAGAGT TTATAAATATATGAATGCCAGCTCATTTTTATTTTTTAAATACTCTGTACAACACACCAACATAAAATCATAGGA TATAGTGGTGATTGCATTCTTATGCATATATCTTTGGGTATTAATTTTACGAAAGATATAAAGATGATTTTGATG CAGATGGATT CCACATGT TACTGAGAA CATGAAAAGGAGGAT CAGACGGAAG CAAGAGAAAAATTCCACCTTAGG CT CTTT CTTGAAAGGTGCAGTTTCTC CACCATTGA CAT CT CCTTAGAC CTCACAGTGAACTGGCAAGTAGGT CCA GGGAAGAGATTTCTATAGTATC TTGG CTTTGAGTAAAAGAAAATGGAC CCAAAAGC CTCATCTT CACAAGACAAC CTTTTGGAAATAGATAGAAGGCTTGATACCAAATGTGCTTGAGGTGTAGCTCCAAAGCAGCAAATAAAGGCGCTG CCACAC CTGTGCAGGTGC CACACCCGTGCAGGTACCAT CCAGTGGGGG CCTGGGGCATCG CATGTCCCAGAG CAC CCAGGCACAGGGAGGCTG CTCAGCAGTAGGATGTGTGG CTTTTGAGGA CTCAAAGA CACT TAGGGGGT CTCTGCT TCTGCATTTCCAAGAGGACAGATAGCTGGGGCAGGTGCACCACCCCACTGAGACACCAGGGCATGAAGAGTGGCA
( N) xTAGCACATAGGTGGACTGAAATCCGCTGTCCCCCATGAGTGTGTACCGACAGGTATGCCCTCCACCAGCTG AGCACACCAGCACCCACCTCCCCTCCCTGACCTGCTCGCAGCCAGCCCACGCCCAAGCCCTGCTGACATTCATCT CCTTTATC TCTTAG CCTGACAACTCC CCTCCTTCCCCACCACACTC CACTCAAAGCAACGTCAT CTATCACCTGA ACCTCCTCACTGACTGCTCTGCCTCCACTCCTGCCTCCCTCCCACCGGCATCTGTCCCCAAGCTGCATCCAGACT CCAGGGAACACAAACTCAGCCGCGGCACCCCCTTCGCTGCCTCCTCCTCGTGTCTTCTGTTGCCCTCGTGGTAGG GACTGACATCCTGATATGGCCCATGAGGCCCCCTGGAGTCTGCCCCCACCCAGCGGCTTCCTCACCATCACAACG GATT CC CC TCTCTGAGCTTCT C CTAG GTGG CCAT C CGC C (N) xG CTTTTACAAAT CATGGTAAC CATGGTAC TTA GTTTAGTTGTTGACATGTAGCCAGCACTTTCTTATTCATTTGCTAGTTATGGTTAA <N) xGTGGCTTGGATAGGG CAGñGCCATGGACAAGGGGAAGATGGAAGGAGATCTGTCCCAGAGGCAT(N)xAATTCTGGCTTCCATAGCAGGG TGGATG GTGG CTCATTTACTGAG GTGGAGAAGA CTATT GAAGAATG CATGGGTTGATGGATGAATAAACAAACAA GCAAATATGTAGAGATAAATAGTACAATAAAGGAGACTGGGCCTTCAGGGTGAGATATATTCATAAGGCAAAAAA GCTT CCATAT CGGT CAGG GTTCTGCT G (N) xACTCTAAATGATTGTTCCGTGGAAACCACTAGGCTTTACCAATA ATATAACT { N) xGTGTCACTTCATTTTGTGGAATTACT CAGG CAAAGT GACTGCTG CTAG GCAATAGT CAAAGTC AAAATGGTTAATGGGTCTGAAACCAGTGGAGATCTGGGAAAATTTTAAAAGAATAGTCAAACATGACAAGAGGTA T CACTGGGGAAAAGAAAC CCAAAACATTAG CATT TGGCTT CCAAGCAG CTCACATT CTGG CCTTAGACAACT GTT ACCTCACTGCTCAGAAATGGAAGCTTGAAGCTCGTGTGGAAAACCGACCTCCTAATAAATCTCACACAGAGGATG CAGCAATTTGTTATGCGTGAGAAAATGGCACTGGGAAAGCAAAACACAGCCAGCTTGCTTCATATCACAAGTAGC AATT GGGC TCAGGTTTTCACCTAATTTTTAAGCAG CTTAT CACCAC CT CCGAAAAGAAAAAAAAAATT CTAAAAA TTGCATTGCTTTTTTTTTTCTGAGAAGAGCATATTATGAAGTGTAAACATTAAGAAAGTCATTTCCCAATATAAC TATTTC CAAG CTTT GTCTACCACTTG( N}XACTCACTACCTTTAAATATTTTTGTCCAATCAATACAGCTCCCTC TC CC TTTTTTGGGGAATAAAAC CTTG CTGAAATGAACGGACT CTAATATCTTGATT TATTAACC CACATCTGAGC CAAAATGATGTTTGTTTGAGAATCTACTTAGTACTGACTGAAAGAAACTGTACAAGAGGTTTTTTTGCCAATGTG TGAGAAAATAGATCAGAGATAAGATTAATCTTTACAAAAGACAAAAAGCTTACAATGCTTTGCCAAATAAAATTA ACTTTCATAATTTG CCTACTATGTATTTAC TATG CTATGC TATC TGTT TAGCATACAAAC CCCCTACAATC C CTA CAGTAATC CTAGATG GTG CCTACGTTTTAATCT CAGGTTTATAGATGAGAAACTAAGACAACCTG GACTTAAAGC CAGG CCAC CTTACT CCA CAGGC CACACCCC CCAC CACGGT GAA CACTACAGCTGTGATAAGACAA CAGAAAATTG CTTATCC(N)xTTGACTCACCTGAATAGTGTTCATTTTTATTTAGAAATGTAAC(N)xTTCTAACATCTTGATTT ACTAACCTACATCTGAGCCAAAATAATGCTTGTTTGAGAATCTACTTTAAGTAACAACTGAAAAAAATTTATGAG TTTTTTATTGCCAATGCGTGAGAGAATATATCACATACTGACAAGATAAAATAAGATTAATATTTACAAAAGGCA AAAAGCTCACGATGCTTTGTCAAATAAAATTGATAATTTCCCTACCATGTATTTACTATGCTATGCTGTGTATCA AACAAAAC CC CCTACAAT CCCTACAGTAAT CCTAGATGGTGC CTTCATTTTAACCT CAGGTTTATAGAAGAGAAA CTAAGACAAC CTGGACTTAAAG CCAGGCCACCTTA CTC CACAG G CCACACCC CCCACCATGGTGAAAACTACAGT TGCGATGTTAAGATAACAGAAAACTGCTATTCATAATAGTAAATTATATACTTACATCCCTCGACTCATCTGAAT AGTGTTCATGTTTATTTAGGAATGTAACTTTTTAAAGAATGCAAAGTTTTATTTAAGAACCTGCAAAGTTACCTC ATATAACAGCAATATTCCTGCCATTATGCTGCATATGGAGATAGAAACCTATAACAATCATCTTTCCCTCTAACT TGCTTCCACCTTCT CCAGTGGGTGACAGAAAAGA CAGATAGGAC CTGTACATCCAGCTTTCTCTGCTACCCCTTG AAAAAGCTCCATCTATTCCAATCAGTGGGCATGCAGCTTCCTGGTTTTCTTACACTTGCTGTTACTCATAGAGTA TCTCGTGC TGTTGTTTG CATTTTTACTAGTTTCAG CTTATTTTAGGGTTTCACTTAACG CACAGCTTTGTACGGA AAGTCATAGATTCCACTGCTCCTCTCAGTGTTTTACACAGGTTTTACTGAAACCCAAACCCCTGCCTAGCATCCC TCTTTTTC TT CCT CACTT CCCATGGCAAAGTCAGAATTTATT CTTGAGATTT CAAT CTTCTCAG CCTTTTAGCTT TCCTGGCTGTGGGACCGCAGAAAGCTAGCACTGCAGCTATGGTCTAGCAGGACAGCGTCCCAGCCTTATTCC{ N) xGCATGCTGTATAACATACCCCATCTCCCCTCTCTATAAGGACATTTCTTTTAGACTATCTCTTATTTCAGTCCT GAGCTC CACCTTGT CAAGTATT CTGGATAC CCATGAGGGCTTAG CAAAAGGG CCTATGTCTGTCACC C TGGATGG GAACTAGCCACTTCAGGAAAGCTCTGTGATAGCCATGTGTAAGGTCATAGTTAGAAATGCCCCCACCCTTTCTCA TTAAGTTCTCCCTACCCTTATTCACATTTGGCACTTTATTTAACTCTTCCCATTCATATATTTTGAGACTTAGGC TGAATTTAGCAACTTTACATCTTCTACCCTTTCTCATTGTGCTAAAGTCTCCATCAGGGTCATGCAAATGGGGAA ATAAAGGTTGñGCAAATGGGGAAATATTATTCTCCATATGGATTGTGAAAGTGTGCAACCTATACCACTGTCAGC AGCAGCTCTGGTCTTCATCCTGCCCCATTGTTTTCTGAAGGTGCACAATCCAGCCTCCTACTAGAAGAACCCAGA CT CTTG TTTC CCCATTCACAAT GGGGGTGTT CACTTACACATTAGGT CAACATAGTTG( N) xCCACCGACTCTCT GTCTTCCCCTTCTGCAGGTCCAGCCTCCAGGGCGATGACAAGGGTGAGAGAGAAATATGAGCCCAGCCTGCCAGG TCAGGAACAAGGAGAG CGAGGAGCTG CCTTTCCTTTTCC CTGGCTCCT TC CAAGTG GCAGTGTGGACAAACAGAA CAGG CAAGATGCTGTGGATGTGGAGCTGAATGATTCAT CATG CACATCAATT TAAC CAGAGT CAGT TAGCAGTAT CCCAAAGCTCTGAAGGACAGGGGCTCAATTAACCTTTTAATACACAAGTCAAATTTCTTGTCTTCATGACTTACT TCTTTACCAGCAAAAAGCCTCCAAAACCCTTCTCAGATCTTTGTCCAAGGTGAGGCTAAACTCTGTCACTGGTGG GG CTTC CTCTAATT CC CTGTGTTGTT CTGT CTGCTGTGATGTTATCACTCTAAGAG GGAGGGTG CATTTTGAGGA TGACAGAGAAGGACACGTATGATGCAGTGTTAACGCAGACCTTTCTCTCCACCTGGATGAAGAACTACGTTCTCT GG CCGC CCTCATACTTGT CGTTAAAAGAACGTAAATTATTTCACAATT CT CT CC CCTCAAACACAC CCACTATTT CT CTTT CTTCCTGCAAGAGCTGAGTTTTTTTCTCTT CAG GATAAACTAGCA CATT CTTTTAATT CC CAGGAAATT ATACTTTTCCAGGGATGATTACCACCCATTCTCAGAGTTTATAAGTCCAAGGGTAAGGCCCTGATCCAATTGGGG CTGGAG CTACAGATGAGAATGAAAATGACA GACCAT CT CTGTGTGCTC CTGGACTGTGAGAGGTAAAATTTGGAA TCTGACAGCTGTCACGTTTCCTGATAAATGAACCAGAGAAGTGGAGGGAGAAGGGAGGCCTGGGAAAAAGGCGCT CCTC CTAGGCTT CAAGTC CTGAAG CCAAAG CACATT CTTTACTTTGAGTC CCAAAAGGCAACTCATTñTTATTC C AATAACACCACCTTTGTTTAGTCTGATAGGGGTACGGTTTCTGTCACTAGCCAACCTAGCGATGATTATTACATT AT TTAAATTAAACACC CATTTCTATTTT TGGGAGGC CCTCGTGGGTGATCAACCTG CTGATTTATGAACCAATT C ATGC TACGGTAGTGATGGGGAATAGCAGTTACCACACTAT CTTTTGTTATAC CAAC TTTG CCTTGACACCCAATG TTTAATGTCTTGACCTAAAGTAATGACTAATAAACCCCAATGCCAGCAAAGAGGAAGCCCACAGAGAACCAGCAT CAGT GCAGATGAGT CT CAAGGCAACACC CAAATATC CCAC TAGGCCTCTCTCTT C C TGCATACACATATGTTAAG GCTTGAATTTAATTTTTTTCTCACACTATAAAGCTAAGCTGTGTCAGTAATCCCCATGCCAACCTGCTGGAGAAA AAGT CAATCTAACAT CAT TACATTTTTCTG GCTATTTTAATT CAG(N)xCCACATTTTCCCTAGTACCTGCCTAT TTCTAGAAAAAGCTAACTGGAAATCAGAAACAAAGAAGACAGCCTTTAACTGCTTTATTAATGGGAGCTTGGTCA TGTCATTTCCAAGTTCTTCTTTTTCCAAAAGACCCAACTGATAGTCCCGTATTCATAAGGCACCTGCTCCCTCAC CC CTACAAGTTC AAAT CCAGTTTT CTTT CC CTTTTGGAGCTTTTCCTGGTGAAATTATAG CTT CTCATAGTGAAG GTGAACAGCGTGCTGGCTTATCTGGTCCCTCCGTCACCTTCCCCACCCCTCATCTGCCCTCTGCAGGTCAGGGTG TCAACAGTCTCCAGGACAGGTGTTAACTAACCTACCCCTGGAGGCACTATGTGTTGGGCAGTGAGAAACCCAAAC TACATT CCAGACTCACT CATTTGAAAATGGAATTAATAAAAC TGGGAATTAATGAATAACAATATT TTAGAAGGA GT CACAATTGCAAGAT GC CCTGCTAACC CAGCATCCAAAAAACTGCTGTT TACT CTGTCATCTTTG GCAATACAC AC CATTATTACATCTTAGTTGCTTAATC CGTAAGAGGCAGGTAAGATG CCAAGCAATTCC TTGGAGTTATGGTAA GT TATT CATGCAAACACTATATTTATTAAGTGATTCAAG CATAAAGGT CATT CAACGCTTACGCT CTCTTTCTAA TGCCCTTCTGATGCTC CTTCTTTTAAAG CCTGGTGG CAAAAG CACTCC CAACTCAGAACACAGGAGTACAACTTG AATT CAGAGAATTTAAGATG( N ) XCAACTGAATTAGTCCTCCCAGAATTCAAGGTCTTTTTAAAGCAGGATTTAA TTCATATCTCAGTAGCTTTCCCCAAAATTTTCAGCAATTGCACCCTGCCCACCTATTCATGAAGCTGAGAGGGAT CACAGGAGTTGT CTAGTC CCAAGT CC CCAGAGATGTTTTCTGGTGATAGACTGTAA CCAAAACGAC CT CTGGTAT TTATTAAGTTTG CAAAAT CTTGTGTCAAAGTT(N)xATCCTAGGACTGATTTGTAGGCGAATAAAAATGCTGA(N JxATGGATCAGTTTTCAATGAAATGAATAAAAGGCACCATGGTGTAGTTCTCTCTTCACTCACGCACTTCCCTCA GAAT TT CAAAGC CAAC GAGTTAACAAAAAGACAAAT TTACACTGGAAAT CAACAAC CATCTAAG CTGT GGAATTT TT TAGTG CATTGTGATGT C CATTAAAAT CTGTCACC CAATTG CCTTCCAAAACC CCAGAG CAACTGAAACACACA AGCGTTTTTGAGTGGG CAGGGGTTAAG G GG CAGAAG CCAAGAGAGGAT CAGAGACATCTC CCAGTAA CACAGGGG CCCATTTCCCTGAACCTGTCAAAACCTTTGAGCAGGTAACACCGAGGAGTGGGTCTCCTGTGGCTGGCTTTGCAA CTCCTTCTCTTC CGATTATGAAAATñTC CTTCTTACACTG CACCGTAATCAT TT GAATTTTTATTGTTTTCACT C AAAAGCTCCTAAGTTATTTAGATAAGCACCATTAAACAAAAATCTGAGTAATTAGTTCTTTTGATGTGTCCTAGT GGCAGTGGAGGAACTTGAAGACGTTTATTAACTGTTAGATCAGTCCCAGGTGGGACTTATTATAAACAGAGCTTA GCCCCñGAGCAGCCGCT CATTAAATACTGAACAGGAATTAGGTATTTCTCTCTT CC CATG CCATTCTT CATCAAC CAGC CT CTACCACTTGGGTA( N ) xGCCACTAAATACCACAAATCATTGCTTACGTGAAGTCCCGGAGGTGGGGCT AGAACACAATCACACCCTCCTTTAGAGACTGCGACCCAGAGAGAACAATCA(N) xGACCCGCTTCTCCCATCTCT TG CTTTAGTAAATTTT CCATGCCAAGACATTTTCAAGTGC TT GACAGT CACC CACG GATG CTGACTGCTCTGAG T GATCTATGGCAAACCACCCACTCTTTTCCCAGAATGATAAATGTTTCTTATGTCTAATGTTGATATAAAATGTTA TGAAAACGAGAAAATTTGGGGGAT CTTAGAAAATAAAAGAAAGATTTACATG AAATAAAAATGAGTGGTTTTAT C T CG TGTATGTTGGAAAA CAGAATAGAAGTTGATTG G CAT C CATGGCAGAAG GGGATCTGGGTTCT CACTTGTTC C CA CCTGTTCAAG CATC CCAGTGGAGT CCTCAGTCCT CAAGTCTGTAAACCAGGCTGAACAGC CC CAGGTGACCGG GACACAGTGGGAAGTGAGTCAACTAAC
>Hs5_9139816-9160426
TTGAGATATACATGTTATAAGATTTTATG CAGTAAAACTTATGAACAC TTTTCTTTATAG CATC TAATGTGTGTT CCATGTTAAGAT CTTT CT CACTTAAATC TT CCCAAC TTAAAGATTAAG CAT CAAACATTATAGAAAGATTAAAGA TTATAGAAACAT CACT CC C CCGATTC CTTCTGATTC TT CTAAGGTATCAT TTTTTGGCTC CCTAT C CT TTATCCA CTTGAAATGTATTTTT CT CCTCCT CATAAACAAACAAACAAATCAAAAAT CACAAATACATATTAAAAAATCTAA A CAT CT CAAACC CCA CA CAAAACAAACAAATTCAGGTGTTAGAGGAAGAGTAATAGATGT GAGTATGGTTTCTTT TC CC GTTATTCAAATGA CATCCTTTT CAATTACTACAAGTTTATAACAGAGAGT GT GAGCATAAGCATTTTAAAA TCTCTAGCTTTGGGAATATGATTCCCAATAATGTAC( N) XCCAATGATCAGTCTCATTGAATAAAACTTGAACCT GTATTTTTATTGTAGAAAACACATAAAAATTTTCTCAATTCTCAAGACACGCAATTGATTTAATAACACTTTTTT GTGCAG GGAATAGAGAAGATGAGAATAAA C TATCACAAC CAATGACTAAGAC CATGAAAAGTGTGCATTTATTAT CCAGGT GTCAGCAATTT C TTCCTGAGTAAC TTTG CATGTG CAGTGTTGTTCTGTAGAAAAATGC CCAGAGGAAGT TG CTGGAAAT CTGGAGA C CCTT CAAGTG CAA CTCAT GAGACAGACC CGGGTGGGGTGAGC GCTAAGAGTCCTTCT GCTCCCTGCTGTGñCTCCACCTGGCTGTGCCAATGAGATCGAATCTGAGAGTGTCTTCATCTTACTAAACTAGCC AAAAGTGACATATTTATATTTG CGGCTC CTATGG CAAAC C TGAGAGTAATT CTTTTATTGACAAAATAA CATAGT GTATAT CACATATGGGAATTCT CTGTGTATGATT CTTTACACAGAG CAAAAG CATAGCGAGGCTACAC TTAGAGA TCATTCAAAAGGCCTGCATTCCTTTTCGTGTTTCTTATTAAAGCAACATGACCAGTAGTTAAAATTAAACCTCCA AAAAGGGAAAAATTCCACGGGCTCCTCACGTTTACCCCAGACTTCCCATGCAAAGCTGCTTGGAGACTTAAAACA ATGCCACAGTGTTAGAGGGAAGTATTTCAAGCTGTTTTAAGTGGAAAGCTGCATGTAATAACATCTCTTTCTTCT GATGATTCCAATGGTTCTGCACGTTCTTAACATTACACTTATCCTTTAAAATGTTAAAATGTCTTTACAAACACA AATGCCCATTCAATAAAATATCATGTATATTTAAATATAATTTCCTCATGCAAAATATTTTACACCATGGGCAAA ATTCTGATGAGATCTACCTTTATTCTAAATTATGTCTTAAGAAAATCTATTACACAATTTTATTTTTGGAAGCTG GATCTCTTTAAGAAACTGTGGAAAAT CATGGGTTTC CAATTAATGCAG CCATTGTGAAAGTGGCTTTT CTGTATT CCACTTTGAAAATGTTTTTCCTTGTGTGGTAAATATTTTTAATAAAAGTT( N ) xTCTGTTCAAATATAATTATCA ACATTCflATTTTAAAATTTCCATAATGTGTTAATGACCTTACACTATTCTCATTTTACATATTACCCTGGTATTT GAAAGTATCTGAATTTAGATAAAAATGT TAGT CAGAAT CCGAAG GTTTAATTTTTT TCAAAAACACAC TTAT TGA GGGTATTTTTTAGAGCTAAACATATATTTGGAGACAGGCCTTAATTCTAAAAAGCAGGATAAATGTCTACTTTCC TTTCCACATGACAATAACATCAACATATATCTAAAGTATTTCTTTACATTTTAAAAATTTGCAATTACGTTTTTT GTTAGTTGGAGTAACAGAAAAATAAGTTTATTCATTAAAATATTTTCACTGTATTTCTGAAAAGTTGAGATTTAT TAATAAGTA CAATT CCTCGTGC CTATTAGATTAT CTAAGTAT CAGTGATGACTTTATCTGGCTTGTCACTGT CAC ACTAATTATCATTAGGTCAGTTTATTTTTGATATTAAGAATTTTTGCCATTTTGTTAATATAAAATGTATTCTTA CACATGGGATTTTAAAAGTGTTT CAATCATGTAT C A ( N ) xCCTTGGGCTCCACTAAAGATTGAGGGTTTCCTTGC AAGTGTTGCATACTTTAAAGTG CTTC C CAAATACTGTG CAAGAG CCACACTT CAAACAT CAATATTAACTTCTAT TATCATTACAATGATTGTGGTAGTGATCAAflCTG C CTATGAATTAT CC CTTC CCTAATTC CTATGAAAATCTGT C CCATTC CTAAGTA C CATGTTAC CATATACTTCATAAGCTATT CTTGTACTTTAAGAG CTG CAATTAAAAACATTA T C T (N) xTTATCTTACAAAAAATAAGACTGAAGAAAATTCCAAGTAGñC AATAGCCTTCCTGAGACATCCACAGA TCCTTCAATTGGTAAATCCAACTAGAACATACATTTTCATTGT(N)xATATAAAAAAGGTTTTTTTGAGATGGAG ATGCATACAG TTGTTATAATTAGTCC CTAAGCACAATT T CAAGT TATTATA CACTATGGATTTCTGAGGTAGTT G AAAACATTCTATATTGAAAATACTAAGT CCAG CAATAATT CACATTACATGCTATGñTTTGTAG GTGAAATTAAT GTACTTGATTTCAATTAGTGATTAACTTTTAATACTTTTCATGTATTAAAACTACTCGGCTTAACATTAAATAAC ACTGAT CAAATACAATGG CTGTTTTTTACATT TCATTTTG CACT TAATATTAAATTTAAATAA(N ) xGACTTATA CTGGCACTAATATTAAG CATACTAAG CATATG TT CTTTTAGTTAAAAC CCATAGG CAAATATAACATTTCAATTT TC CTGAAAAATCTGTTTTAAAATTGAACAA GATTGATACACATTTTACAGTAAAG CfiTTT CAGGAAAATTGATAA TTTAAAAAACAGTGATGGGATTAGATAGAG CA CATGTGAAATAAGCAAGTCAAAAG TTCTTCCTGGAT CTAGGAC TTGTGATTAA CTTGGTTTGCCAGCTTTGGTTTCT GACCTGATGCAGAGACCC CGTT CCACT CCCTCTTTAAG CAC TG AATC CTG AGCTGTTTATCTTGACCTC CTTGTGGG CTTC AATC ATTGTAAATTTT CTCT AGAACAAATATAAAG T CTTTCTAGAAAGG CCAAGACTG CCT CCAGGACAAAGC CAAGAAAGGCAGC C TCTT CCATGGCTAGGATTGTAC C AGGGGACCAAAACCAGGACAAAGGAGGCACAGGGTG TTTCAATGGC C CAACATAAACCTT CCATGGTT CCAC GAT GTGGAAGTAACTCACCAG GAATGGCAAC CAG AAACAGCTGTGTC TATGTGCT CAGGAAGAAAAGAAAATAAT TAC TGGTCACTTCATTCAGGGAGAAACCCACACCATTGGTATTCCACATTCTCTATTCCATAAGTAGGCATAATTATA GGTACCTCCAAAGTT CACCCTCCCCCAGTGTCCTCTCCTCTTGG CTAATTAAGATTTAAAATGAAGACA CAATTA CTTATTTGCTATATAACATGTTTTTCTTTTTCAAGAGAAACACTTCTGTGAAATAGGGTTTAAGGGTCAACAATT ATGCCTGAAG AATG C AGAAGGTGTCAAGTAAT AC ATTTTAATTTTT A CT C AAAAC C TT AAAAG GGCAT AAGT CC A GAGGGGAAAATATGTATTATTCTGTTTT CTTATAAT TCAC CTAACTTAACAGTTTGACTG CAAT TGAGTGGAAGA GGTTCAATGC CAAG GAA C CCTT CTGTGTAGTACAGC CCAC CC CT CAGAAAGAACATTTGT CCCT CTCAGCG GGTA GT TACAGCAAGTGTGAGGTCTG CCTT GAATGTGGAT TCTCTGGTGCCCTCATGACTGCTGGGCAGCCGGTGACAG TGGTGAAGGCAAGCACCTGGCTCTCTTCACACTTGGCCTTGTGTAGCCTTCATCAGTGCCTGAATCCCAGTTGCT CT TGTGTGGATAGAGACATGTT CACTTGAGACAG GGAATTATTG CAAATCCATGGACAGATGCT CCCG CTGCTTG GATGAGGCCATGCTGACAT CAC T CAC CCAAGCACAT CATGTGGT CATG CTAGAGT CTCATGAAC TGTTGATTTG C CC CGGC AGAGTC AGGTAACAATTGAATG AAAG GAGAAG AATC CT CCTCT C AG AC AACATG AAGTTGTTTGAAGG C AGAGCAGCAAATGATTGG CTGGT CAGTT TACC TT CAGTTTAGTC CGGAGACCATGAGAAGTATATGTGGGCAAAG AAGGCAAACCACACATAAAAGAGCAGTCTGTCA (N ) xG AC TT AC AT CT AGG ACAGAGGGAGACTTTGAATCC CAG CCTTCAACATAGTGTGCAGAAAGTGTAAAAGACCACTCACAGGCAATGCCAGCTCAGAAGGACCTGCATGAACTG AGTATAGTATTAAAACAGGAAAGGAACTAT CT TGTGTCAAATTCTTAT TAAGTACCAGCC CCCTAACATAGC CTC TCTCTCTTTATCT C CAATAACACACT CCTATGGAAACAGGGT(6T>xTAGGCAAGCTGTTTTGCAAACATATGAGA ACAGTTATCTCCTTAAGAATTCACTGGACTATGCAGCAAATAAAAATCTTAGGAGAAGTCCAGGTGCTCGTATTT AG AATAACTC AGGTGCAGGGGT CTGG AG CC AAGAAGTGGTGG CAGCTC AAGCTTTC CT ATGGAATTCT TACATAA CT TTAAATTTGAGAGCT CAAAAG CAGGAGG CCTGTG GAGTAGATATTAAAGAAGAAAAC(N) xAAGAAGGAAACT TTTTCTTTCTGATATCCTTTAGTTGTGTTT CT CAA CTGAAGTACAAACAGTT GGTAATAGATTTTCAAfiCTT TAC CCAGCT GACTGTTGAAAACACACATTGC CATTTAGAGAC CAAGGACA CATTCTCTTTGTAAAGCAGTTTCACGTG TTGATGGGTTCTTACCCACCCTGCCCAGGGTCTGGGGGTGTTACCTTACGACAACACTGGAATACACTGTGCTCC AGCCCAAGCCTACCTTGGAACCCCTCTTATCATGTCTTCCTCACTGTCCTAATTAGCACTTACCACTTTTGTTAG TG C C AG CACO AACAAAAACTCGGTGC ATTT ACTTT CTG C C ATTT CT CTATTTTT AAAAAC AGATTTAATGGTGG C AT C CATATAATGTTGCTGTGCTTTATTACATT TTTGTT GTGT CATACAGATGTCTT CATAGGTG TCGCTTAATGG CT TCAGAATAAC TCAATCAGAGAACAAT CGTT GAAAGGAT GTAT CATAAGTGGTGATTAATT CC CTTACTACTG G AAATTGAAAATTAGAACACTGGTTATGATACAGATATAGCTTTAAAAAATAATTGGAGAGGAACATTAGGTGAGG AGATTAGG TCCTAGTGTATT CCGG GATCAGACTTTAAAGATAAATCGATT TT GGAATAAATT CTAAGG CAACTC C AAGGTC CT TCGGTT CAGTT C CATG A CAG CAG GTGATTTTACTTACACCAGAA( N ) xAATATAAAGGAATTAATTA CAAAAAAT C CACATTCTCTAGAGG TTAAAC CAAAGCCAGCACAAA CAAAAAC TCAAGACTTTGAGATT TG GCAAA TATT AAATTC ATT CTGAGGGGATAAT AT TT ACTTTGTAAAATGGGT A C ATGT CAAAATGTCCTC CTGAAT GTTT A TT TGAT TAATGACCAAATGAGTAAGGTTGT CTGG CTGGGAGACCACGTGGAGACCCTG CAATTTCTCT GTACACA TTGTATGCTTTTTCCATTGG CAGATG CTGT CTTTAGGGAATATT TAATTAAGAAAGGT GTCAAC CTAT TAACTC C GG CTTT TAAAAC TTTGAC CTTTAGAG CAATAAACATCCTGTAGTAATCTAATTTTC CATACTAATTTTTTGATGA GGTGTTAATTTC TTTAACTAAATGATAC TATTAAAGGC TAACTG CATTAAGAACGC C CAAATAC CAGTTCACTCA ATAGAACCAATGTCTTTTTC C (N ) xCAATGTCTTTTTCTTAATGGTCCATCTTTCCCTCTAGTCTTATAAGGCAC AATTTAAAAGGTGTGACT CTTTTATATGTAAG CTATAATA CTTC CAGTTC CCAGATTTTCTCTCCTTTCC CCCAA TT AGTT CATTAAGC ( N ) x CC C ACCTAAGGC CC AAAGGCTTTC AATC CTTTTG CT ACTGTATG AT ATTG CC ACATT TTTCTGCAAAGGAATTTTATTTTGTCACAAGAATTTTGCTTCCTTTCAAGGTATGAAGTAAATTAGCAGATAAGG GAGAGG CAG AATGT CCATTTGCTAATTT AACAAATATT AT ATTT A CTG AC CCTTTTTTGAGCTGGAT ACC AGGCT GGGGGCTTCATTGAACTCAGAAGCACAGAGGAAGCCCAGGCTTTGGGAGCTTATGTTCCCACAGCTTGAATGAAG CCTGTG CTGGCATG CAGGAATTTACAGT CTACTTTCTGTT CTATGATAAAAGATTAAGTGCACACTCTTT CAATG AATGTT AAATAAAAAATT AAGATTGT ATGAGT AAGT AT C C AAAG GGTAGAAAATTC C ATGGTGT TCAG AAAATAT TAGGGTACAGATTAACCTATGTAATTTAATAACTCATTTGATAAATGAAATACCTAATTTTCTAAATTAGTGTTA TTAAGG AACATT CATTTATT CAAAAGGTAGATGATAAT CTAACT TAGGGGT CAGCAAACCACACAGCCTGTGGC C AAATTCATA CGG CCACCTAAACTT TT CC TGAAACACAGAACTGGGGAATGTT C C T (N ) xATGCTGATCCACTCCA CACAGAGG CCTTAAAATGACTTTAAAAGTC CACATCAG( N ) xGAGTCCAAATCAGCGGCCCTCCCATCACCACAC CTCTAAGCTGTTCTTTTCTTAGCCTTGAAGAGGTACAATCAGTCACACTACTTGGAAGGCTATAACATTTTCTTG TGGGAGAAAATAA CAACCTGATTT CTAGACAAGATTATTAGAGCA CAGGTCCTGTGGCTGCT CTTTTC CT C CTGA AT T ACAG AGGTGGG AAGC CT ATGAGTTCTTGG CC AG AG CT C ATAGGGT AG CAGGTT CT GACT TCGCATTGGTGTT TACACAAG GCAG GAAGTCAGTGAGTG GC TG CT CT TGGCTG CTCCATCCTGTGCCTCCTCATGGAGACTTTTTCCA CTAAAT T A A T T (N ) xTTTTCCCACTTTAAAGACCACATGATGCTACTCGACCATGAGCCAGGTGCATGGCCTGGC CCACAGGATGAGGTCAGCTTTCCTGTCTTCCCCTGTTACTCATGGTACTCATAGCTGGGCCTTCCCAGATCCCCA AATTCCT CAGAAATTAG TGAATGT GAACTCACTT TTGCAACTTATACACAGATGTGG T CAATAAGGCCTAAAATG AAAAAGGATATGCATGAATAGTGCAGTTTCTCTGGTACTCTACATCCCCAGAGATCACCACCAATTCCAATGAGT TAG CTTTGTCTTTT GAATGATTTT CT CACT CTAT CTATTAAATGGCTGTG CTAAAAAAAAAT GC TAACAT CAAAA CACAATTTTTTTACTCGT CTTTGAG GTCAAAC TTGACGGAAATATAATTT TGAAATATTGAT CTTTTAAAATAAC TTTCTAAGGCAACAGTGCTATGTT CTGATCAGTCACAACA CACGTGTGGT CAGTACAGGCAGGTGTTCT CTCAC C ACTCCCAACCCCCGCCGCCTGAGGTCCCAGGATCTCCTTAAGAATAACCAGGGAATGGAGACTCAGCTTAGAAAG CT TTGGAAAAACAGAGTATAAAAC CT CCTT CATGTTTG CTTGATCCAGTCTGATGG CAAGCAGAATGAGGAAGAG GCTAACGG GAAGGGG GTTGAGCCT CT CT CAGAAGACCTGG CTTCTGCCTGTC TGGGTACTGACCAGCT CTG CCT C TCAGTTTCACACATGAAGAGTGCAATCCAGCAGCTCCTTCAGCTGAAATCTGACCACTGAATGGGTGGTCCTCGT TGTACCGGGGCAGCAGAAACAACCAGTAATAC(N ) xACATAATAATCCCGCCAAATTCTCACAAAATCTAGCATA TTGTCAGATATGTGAAAGAAATTTGTGAAATTGTATCAGTTATCATTTATACACATTTTAATAAATGTCATTGTG CATTGT CTTTGGAAAAAAGACTCAAGAAAATTAAACAT GTAAA CATGATT TAGAAG GAGATG GGTAATTTAGATA AG CAGTTT CCCTTTTTAAAT ACTT CAAGATTC ATTT AT AAG ACTGGTTTT C C AATAAT AAATG G CATTACTGAG A TTCTATGATGGAAAGTATATTGGCAAAGAAATTAGTATCCAATCGTTACCCAATTGTTTTCCTATTCCTATTTAG TGTACTTCTTTAAGCATCTTTCTGAAAGATTAAAATTGCTTTTGTCAAAAAACCCCACAATTTTTCCTTATAATT TATAAAAAATGTTTGTGCAGTTATTTTTTTTTAAAGATTCTTTT CT CCATG GAATGTTAACTAGGGAGG GTTTC C TATCTT TCTACAGGAGTAGGATAATAGAGAAACACATTGGAAATAT TCAGAGAAATGCTTGGGTTGAAAT CTTTA GTTTTCTTTTCTCGAATGAAATACTAAAGGACCAGGCAAAAACTAACCCCCTCTGAGGAAGGGCACACAGAAGAC TATAGTCTATCAGTTGAAAATGAAGTTACGATTTCTTACTGCTTCTCTCAGAGTAAACAGCAGGATGATTAAGCT CCAGCTATAAATGT CAAG CT CCTT CC CT CT CAGGAGCC CAGTGAAGACACAC CAGTGT CATCATTCTCCT TGAAA GGAAAGAGGGAACAGTGAACTCCTCCCCAGGCCTCTGTGCCTGGAGCACCAGCTCCTGTGGATCCACAAGTGGGC TGGCAGTTCAGGTG CATGAGAGATGC CAGGGACA CTGTGCCTCT CATCACTGATCTGATGCAGAACTTAGTTCCT AAGTTG CGTGTACAAGAGG CTGAGCG ATTATG AC CC AG AC TC AAGG AAGG AGAGCAAC CTGT AAAAGATG AATTG CCTCATGCTCTGTAAGCCATGCACTC CCATTCTGGCAC CAGAAT CACCCTG CAGACAG CACAGG CCTG CGGCTT C CTGTGGGTTATC CCGTGTCACCCT CCAAAAAGGT CATC CTTCACTGAGCTGGAGTTGAAAGACTGGTGTC CTTCT AGATATATTCCAGCTATGACATCCTGTTGGCCAGACCCCAATCCTGTTACCCACTGAACCAACAGCAACTGGAGG AAGAAT AATT CCTT CCCT AGGCTAGTTCTG CAGGGAAC AC AAAC CAG AAC AC ATTT CAAGCAGC CAGTGGTTAAA TT TAG GAAGTAAGCATATATAAGAATGG CAGT TAATAAATA CCACTAGCTGT TATTAATGTT CACAATAT CATTA TGATTAG CTGGGAGGTTC CTG GCATTATTC CT CAACTCTC TCTCTCATTT CAAAGGAGACATTAAAGACTTCATT TT CTAGAAT CAT CTTGACAGGCTTATTG CTGT GATTACACATCAAAACCTTC CTTC CACCTAAGAATACT CACTT TG CAGAGTGCTTAAATTT TATACT CTATATATTATCCTATTTGAGT CTACAG CAATTCACATGT GTTGAGTGACA AAGTGAATTCTGAAATAC TTTTGC GACCTGGGT CAAGAAACAG CAACAACAG( N ) xGGCAACAACAGGTCTGTTC CATGAACCCTAGCTTCTGAGTTCCAAAGCCAGTGTAACTTGGCACCTTGTTTAGAGAGAGAGTGGCCTGTGTAGG GTAGATGT CTGTGAGTGATATATCAAAAAG GACTATTT CTA CTC CAAATCTG CAAT TACACC CTGCTTTAG GAGT G GAATTAAACAAGATGCCTAAGAACATTTAAGATTCTGATTTCC CCAAGGGC GATCTTTTGT CAG GGATCAGAGT GTATATACAATCTC CCG GAGTAAAGTGC TGGT CAGGTGTGTGATGTATA CAGTACTTTTTCATTTC CTAAGTAC C TGGGCCAACATGTTTCCAAATTAAACTTTAAACACAACTGTGGTTGATCGTGGATGTCCCACGGAAGATAAGACT GGCCTGAGGGTGAAGGAAGATGCAATGACTCTTGGGTTTCCCTGTGGCTGGAGACCTCCCTCCCTACCCTCTCTG CAGG CT CAGGAGAGAAGCAGCACT CCAGGGGAGCTGGTGCCAAGGACAGTGG CGGGGGAGGGGGGGGTGTCCCGG AAAGTGGGTGCCACTGCAGCAACATCAACCACTCCTTCTTCTCAAGGGGTTATCAGCATATGAGCTGTGGTCGAG AAGGACGTATACCTTATAATATTTTTTGTTATTT CTATATTCAAATAAAATACAAG CTTGAG GC CAGGTGCAGTA GCTCAA( N ) xAGCTTGTGCCAGATTAACATATTAATACAGTCATCAAATATCAGTATTACCACTGAGTGTTCTAA GGCACTGAATAATTTATTTTAACCCTGACAACTTCTTACAGGTATTATAACGTGGCTCCTCCTCTGAGGAGAACT GGTGGGTTATTCTCAT CACAGTGAAT CTATTT TAATGAAAGCTC TG CATCCTAGAG CACAGGACTCTCAGAG GCA TGAAGGGTGGCTCTGAGGCCATTCAACTTCAGGGCATCTGAAGCCTCCCTGGCTCTCTCTGGGTGGGCCCTTCAC A CACACAC CAGTGCAT CCTGACCC CG GAGATG CC CTACCTGCGTGTGCGGTAGAAC TGG CAC CT CTTCAGGG GGA TCTTGACCACGTGCTC CCGCAGGC CCACGAACAGGACACTCTGG CTGTGCAGGATCTGCAGG CT CCTGATGGGCT CCCTCCGCCTCTCAGGGAAGAGCTCAATCTCTTCCAGCAAACAGCTGCTTGAGGTCTGATTCAGGGGTACCCGCA CTTT CTTAATGGTT CCGTAATCTATGAAGGTCACAGGATGAAAAGGAAACAAGGTCAGAGGGTCTCACAAAC CAA TCCCTGGTTAAACAGAGTGTGCTGTGTATGCAGCCATGGATCTTCAGCCAGGAAAGC(N)xATCATGTCCGATTG GAC C CTTC CAGTTC CAAGT CTCGGGATGGAGAGCTC AAAGTAOC AAGCTTTGTG CC ATCCCATGGAAAAC AT CAG GAAAGAAAGACCCT CAAG GGACGATCTTGGAT TCTTAGTGACAGAGATGCTGATTTTGGTGCCCAG CAGGGCAAG GAAAATGCAATGTGAT TG GCAGCT TCAAATAT CCAGTACGACTGTGATGCCC GCTCCTCCTTCGTGC CACGAGAA GTACACCCTCCTCCATCTCCCTCTCTCCTCTGTCTACATTAGCTCCTTCTTTCCTCCTAGTGCTTGCTCTTTGGG ATGATTTTCTTGGGCTCAACTCCTCCAAACTTTCTTTCTTGTGCTTCTGAGTATTAATTTCTGGCCTGATCCAAG CCCTCCCCGCTCCAGGCTTCTTGCTCCCTGGGTGTTTCTGTGCACTTATTTATCTGCTTGACTGCTAGCACTGCT GGAGGGTGAGCGGCCCCTGAGCCATGCAGCCTTCCCAGCTTAGGGGGACACACAGTTGAAACGAACATTGCTCTC CCAGGTACCCTGTGCTGCACCCCGCAACTCCTTGCTGACACCCTGCAAATCCCAGGCATGAAATCCAGCCTTCTG GAAG CC GCTTAGAGAT CCACT CAAGTAC CTGGAATCATC CTGAAGG CCCTGACT GAATAAAACCTT C CATGATC C CGAGTT TCAAAACATCATGGTTCTTATTTTGCAAACACAGAAAA TGAGTTTAAGGAGTTATT CTAAAACCAGAAG TAGT CAAGAATATTTGACTTCAAG GC CT CCTAT CAAGCAATT CATTGCAAAAGACC TTG GATAAAATACTTTACA TTTTTCTCTCTCTATGAAAGCCATAGAGATGGAGGCTATTTGTGTTTAAAAAAATACATAAGCAGACATGTTTTA GCCTGAATTTACTTTAAAAAGCATATCTGAAACATCTCCAGTGAGCATTTTCTGATTATACACTGACTGAAAAAT CATGTAGACAGTATTGGCAAACAATTGCTTGCTTTCATTTTAAGCTAACTGGTTAAATATGCTACTTTTTAAAAA GGTCAATCATACACTGAAGTATGCTATATGCGGGGAAGATTTTAAAATGATCTTGAGGAAAGAGCATGTATTTGT GTTCATGTAAAAGGGTGAGGACAAAAGAGAGAACGAGAGAGGGAAGAGAAAGGGGCAGCTGTGGCGTAGAGAGGC AGACATTTTAGCAAAT( N) xCTTCAGGGAAGGTGACATCAGCTCACATGTGTCAGCCACAGGCCCTGCAAGGAGG TCCC CATGAGGCAC CAGCAGAGGGAGGTGTCAGC CTTCCTGTGGAG CAGTGATGGG CATAGAGC CTGTGGCTGC C TTTGCGACATCATTACACAAGAGGACATCATCTGAGACACATGTTAGGAGGGAGGCAGGGGAAGCAAGGCTCATG GGAGAACTTATGTGAGAGACCCCTGGGTGAAACAGGAGCTTTCATAGAAGTTCAGGGGTGGGTGGCCTGCCTAGA GTCAGGATCCAGCTGGAGGATCACTGTGCTGTGACCAGCGTCACACTAGTTGTTCACCAGCCTAGAGTGGCATGG GAAACAGAGTTTACAGAACTATAG CCTATCTGAGAATTTAACAAAGACT CAT TG CTGTTTAT TATT TATTTTGTT GAAGGATTATGATT CT GAGAAAC C TG CC CAAATTAACAACAGTC CAAATGTCAATATGCTCATATGTATTT CAAA TTCTTAAGTAATTGTGGAAAGAATCAAATTTTTAAAGATCTGTGATAGTTTGACTTAAAAGAGTGAGCCATCTTT GAT CATAAA CTTTTACTGCATATTTCTG CATAC CTT GTATCTGATTTTC CAACTAG TGGGAACTAACAAAAC TAC AGGG CTTAGCATTT CAGAG CAAAGTG CT CAGCGTGGGCTTGCATGCTGGTC CA CAATGGATGAC CGTGCAT C CTT CACTCCCTGGAGTGTCTAGTCATGTGACGAGTCCCCTCCACRGTTCCCCCTTCCCTCTGCATCAGGTTTAGCACA CAAAGACCCTTCCTGTCTCCTTGAAGCTCTTCCTTCCCCTGCCTGTGCCCTGCCCTTGTTCTGAACTCCTGCATG CTCACATGTTTGGTCACAGCCTTCTCATCCCCTGTCCTCATGATTACCCTAGGGTCTGTCTTCAAAGCCGGGATC TGTCTTCAAAGCTGAATGTTGGCTCTGTATCTGCAACTCAACCGTTAGGGACTGGAGACACCAGAAACTTGACAG GCCTCCCTGACTCTCCATGTGGCCTACCAGCATGTCCCACCTCCTGTGACCTTGTCTAAGGGACAGCAGCACCAT AATC CCAG GGCCGATGATGGGACC TCTC CTTC CCTGTCACCCACAG CTCACCTAGTT CTCGT CCAC CCGCCC CT G TCATCCCTGCCACTCCTGCAGCTTCCTCTACACCTCTGTCACCCACATCTCACTGCATCTAGAGGCAGGCTCCAG GCATGCACATGGTGGCCCCATGCTCCTTCATGCCTCCATCCTTGCCTGTGCCTCTCTGTGCTCTGGCAACGGCCA CCAGGAAG CACTTG CAGC CCAAGAGC CTGTGñTTGTGGCATACC CTGGACATGCAG CTCCTTTG CCTGAGGTGAC AAAT GACAATGTTATTTT CTTTCAGTGG CAGAAAGACAGAGATGGC CCTATAGAAGAGACAC CC CGAAAAAC CC C ATAGATTT CTATTT GACATAGGAG GACAATAG CAGGACCCTGGTGGATCTTG CAAAGCTATTTG TATTTGGTTG C TGTTTTTACTTTACAACATTTACCATGGTTTTAGTTACTAACTTACTTGTTTTGTGGTTTTTACACTTGATCTTA GGCTCCATAGGGAAAAGAACTATGTCAGTTGTGTCATATGATGTATGTGTAGTGGCTGCTATTAGGGCCAGTAGA GATTAGATGTTTCATGAATATATTCTGCTGAAAGACTGAACAAATGAGTGAGGGAAAGAAGTGTAGCCAGCCTGG AATGAGAATGTCATTATTACTCCCACATTTGCTAAAGAGCGCTGATTAATATTTTTCCCATTATAATCTCTAGCA GG AGTT ATGATATT AGTC AGTGTGAAGGTG AAA CAACAAACATGAATTT C AC CAAT ATGG AAAG AT TTCAAC CC A AGAT GT CATAAACTAAATATGAATGAAGAATT GGTTTCATC CAAGAAAATATTAAC T CCGAAAATTTGCCACAAG AAAT AAAG CCATA C AAAACTTCTATACT TCTG AAATTTAATACT CT AACC AAAGTT ATTTGC AT GAAAAAAAGTG CCAGAAATATGTAAAATCTTCTTAATCATTCTTTAACATTCAGAGATCAACAAGTAAAAAAAAAAAAAAAATAGG CATTTC CTGTCCATTT CAGTTCCC CTTC CCAAGGAGAGACGACAATGTGAGGTGGAGCTGCAGAAT CAGTGC TGA CCAGAGGATGCTCAGAATGGACAC TG CTTCAGTG CAAACTCT CATAGTATTTG C CATTTGTGAAAATGT CC C CTA CTCATTATGCTATGGACACAAATCTGAGTGTCATCTATTTATAGTTTCTCTCTCAAATAGAAAACTATAGATACT CATCAT GAAAGGGACAGAGCTACTTCAAGACTTTAAATAT CCAT TAGTGT TG AT ATG GTTATTATT CAAAAGACA CAAATACTTGTAATTACATGAAAACAT CATTAAATATTATA CTTCACTACATATTTGTCT CATCTTAG CAGAAGA GAAG GGATGT TATATT C (N ) xACAGAACTGATACATATTTTCTATGAACTGGTTAGACATGAAATGCCTTTTCAC TT CAATGT TGATGAAT GAAATAAAGGATTATAA CAATGGACT CCACAGTGAACATTAATTATAAG G CTATTACTG AGGATACAGACTGAAACAGGGGAAGAGGTAGT TGTG CTTTGATTGCAT TGTC CTTTTTTTCTATCCGCTCTTCTA GAAGTCATAGAACTTAGC CAACTG GAGGG CAATGAAAAGAAAGT CAGTATTCTAAT GAATG CTG(N )xAAGTCAG CATTCTCCTC CATTGGAAGACT TTGTCTGTCATCCATCTCAAGGACAAGCAGAGGCACATCTGTGTCAGAAGGCA ATATGTTCTTCTGGC CTCAATGAC CC CCT
>Hs5_9416135- 9437285
GGGTAACCACAAAAAGAGAAGACTTT CTACTTGTTGTT CCAAGTATACATTTTACA CTAAAC CCTATG CTATTTA GAGAGATATTGGTAAAAAATTATGGCAGTGGGTCAT CAGATGAGGAAT TC CCT CTCAAAACC CT TCTAGATAAAC TGACTCTCAATT CC CC CAAC CTTTACTCAAGAAATTGGATTTTATGTG CTCCATCCTTTTAC CCTAAAACTCATT AGTACATAAATGAGGAGAAAAAGTTGAGAATTGGAGAGGTGGGAGGTCAGGGAAGATAGAGGGTCACTGGGGGAG TGGACTGT CCACTCTAAC C CGTTT CTGATCACTGGTAT CCACAGAATGAAC C CCAAAG CCATCGCC CATCAAAGG GCAGGTCCCACACTTCCCAGCAAT TAGATC CAGGTGGAGGTGAG CCTCTGGTGGGGTGGTGCGGAAGGGCAGAG C CAGGTAAC CAGGGTAGTCAGGG CATG CC TACCTCGC CCCTACCC CAGAGC CT CT CACCTG CAAACTGGGTACAAG CAGTTG GTGTAAG GTCAGGAAT GTTGTCAAAGGCAACATGAAAGTTATAGTGTATCAAGT TCAAGAC CTG GACTT CACAGATTTATGGCTACATAAATACACACAATGT CTTTATGAATTTGCCCCT GCTTTTAATTTAGTAAGAGACCT TCAAAG( N ) xCTTTACACTGAAAAAAAGAGAGATGGAATTTAGGTAGCATTATGTTCTGTAAGTAGAAAAGAAAT TGGTAAGAC CAGAAT CT CTTAC CACTGAAATG TTTCTCAG CTGCTAAAATAG GGAGCCTGGTA CACTCATTGAAC AG CTTT CTAATT TT CT TTTATATC CTACACAGATTTTCTCTTTATT CTGC CATG CAGAAAAATAAGAGAG GC CAT TTATAG CAAC CATTTGAATT CT CT CCTGTATTATTTTAAGATGCTG CTGCTTTTACATTTTG CC CATAAGAAGTT TTAC TAGAATAAATAAAACCAGTTGTA CACAA TT CACAGAC AATTGTTTT CCAAAGGATTTT TTTAGTATTATAA GCTGTAAC CT CTAATTAAATATGTGATC CCACTG GATT TTATACATAGAATATAGTTACCATAT TTACGTGTGCA CATTAT CCAC TT TACTAACTTT T C TAAAATTT CT CACATAAATC TGAATT TTTAAG CC TTAAGT GG GATO CCAAT TTGTACATTTTTCTTT CAACTAT C CATATCTTTTGATAGTGAATAGAC CAGTCCTCAGTTACTCCTTAGCACCTC CTATATTTG CAATG CAGGAC CCAGTGAT TACTT CTATTACTT CTGGTAGT TCTCAAAGTTA CAAGGAACAAGAAT GATCAT GAAGGT TCAATTCACCTGATTC CATAAT CACATTCTTTGCTC TTGAGACATT CG CC CC GTTACTT C TGT G C CTTGTCTTTATCAC CACAAATC CT CCGATCAG CACATC CCTTGCCTAT CAGC CT CT CTACTTAACAACTT CAT TTTTACATTTATTACATCTTTC CTGC CAGAGT CAGTTTG GACTACTATAATAAATTAC CATACACCAAGTG GCTT GAACAAGAGAAA{ N ) xTCCATAACACTTCCTATATGCAAAGCAGAGCAAAATAAAGGCAAATACTACATTTTCAA AGTTATTTGCCT CAGTAG CATCAG CATTTGGTTC CAGCTGTAAT C CTCAG CCCCCTCC CATGAT GGTAAT CAAAG G CTTGC CATTTG CTGGAAATTC CTGCCCTG CAATAGGAGTGTTCTT CACCCTCTACAT CTGACAACTG CC CT CCA CAGT TCACAAATAC TGACGGCG(N ) xGGAGAGTAATCAAATTGTACTACATATACTCCTGGTGCTGATGAGGACC AGTAATGAGACAGATAGAATGTAT CCAGAGACATCTGTTG GTGCTAAGAAGG GCTCAGGCAGGTAATAG G CATC C TAA C CATT TAG GACAAAGTAAAGGACAAATAAAT( N ) xAGAAAATGTTTTGGCCAGAGTGTGTGCCTCACTTAGA AAGGAAGGG GAAAGATAGAG GAAGG G CAGTGCAAAGTCATGTTCAATGGT CATG GT CAGT GAG G CAAAACTG CAT CT CT GAGC TCAGGATGAAAGAGGATGAGGCAACGAT GGT CGAGGATGCTGATTACATCTC CTAAGTA CAGGTTTT AAA CATGCACACATCTGGT CAAC C TCTT CTGATTGTGTGCAAAC CTAAG GAGTCACAGGTGGGCCCTGATAAGGT CATGAGATAAAATAG GG GGCTTTATGTTTC CTAGGG CATG CATG CACACACAAGTACATGGCAGATTTAACATTT TATAAATGACAC C CAAATAACT CCTGTTTAT CATCTGGGAGCATTC CATTTTTTTCTCAAGAAT CATT GTATGAC AAGATATG G CAAAG CT CAAATTAAAT CAA CA CAACGAATAAC CAA CAAAAACAAGTGAAGTTTCAGAAG G GCAAA TTAAGAACAACTTT TGTTTTTCTGTGGTGT CACACAGAGAAAAGTTATGT CTAGAGTAGATACT CTTT GATGTAC AT CC CAAAAG CT CTAT TAGCACAT CT GCAG CCAAA C CACTGGTT TCAG GATGTGTACTGTTGGG CCCCTCGGTGG CCATATTGGTTC CAGATG GAGTT CAAATAACATAAGGG CCTTTGTTCCCGGG CTGAACTATTTCAT TTCTCCACT GGTAACTTGCACACAT GATGGAGCGCTC CATCATGAACGATGTG CAAT GCCCAGGGGTGCTT CAAAAAATGTAC C ACAGAGACTG CTAAAACT CATGGAAG CCCCTGCT TTTAAGTGATAG CAG CAG CATTTG CCAACAGAGTG GTG CTA GTGGGCTGTATTACAT TCCATCTTTGTT TATG CAACAC CC CTGGAATAAAAAGTTAGG CATTTACCATTATAATT TACCTTAATTATAGTATT TGTCAC TACCTGTAATG CACAAAA GAATTATGGTAGTTGCAC( N ) xTCCTATGCTGG AGAGAAG G GGAAATATTAAATACC TTACAT CAAATACCTAAG GG CA GACTTAAGGATTTCTAAT CCACTGCTGCT GACTGCTGGTAT TTTAAGACGGGAACTT TCAAAGAAGAATGAATGTTG TCAGAAACATGATT CCTAACATAAGGA AG CAAT GACTGGGGGCTG GGAGATGG CCTATATC CCTT GTAAGAGTATTTTGAG CC CTTTAGACAATAGGAAGG C TCTCTGACAGTCTTACTTGTCACCAGGCCTCTGCTATGTCCAGAACAACAGGGTTAGAATTCATGGCCCC< N > xT T CATG G CCACATTT C CAAAATCTT CATACTTTTG CAAG CT CACTGC CTAT TT CAGG GC CATTTCTTTGAACT TAT TTCTTCCCCTAC CCACATAATCAGTGTGAACAGT CTAGTGTACTGC CTT CTG TATC TT CCTC CATGTT TCTG CAA TAACATACAAATACAGGTATTCAGACATGGGGGATGGAGGTCATTTTACTTTATAATTTGAGATTTTCTCATAAT AT CATAAAAT CT CT CCAAGTTAAC CAGGTATGAT TAAAATTAAT CTTTAAT CA CAT CTGAGACATT CTTACAA CA TAA CTGACATATAATCTATACTTGATTGGTA CA CATTCAATATGTGAGAT CAATTTTTGGGAGAGAGGAAAG GAA GCAGTAAAAAAG CTTCTATAAACATT CACTATACATG CCCTTATATATGT( N ) xTATCTAATGTTAGACTTTTTA GATGGTTTATGT CATCTTTTAG CACAGTTTAGTTTAACGATAAC CAAATTA CAGT C TATC CCTTAAGTTG C CAAA AAGAAG G AAT CAG CAGAAAATT CTCGTTCTG CAATTACTAAGATAAAACATC CAGTGAAAGAGGAACTTG GAAAA GATC C AGT A CAAATAAAACTGCATGTTCAAGAATGCTTTCTTAACC TTTGGCACAGATGC CACCTTGC CC C CTC C TTGGCTTATAAGAAGCTGTCATACACCATCATTAACACAGTTGGTCTCACCCTTCAATGCAGATGTTATAGGGTG CTGCTCCTGAATAATTCTTCAACCTGGGCAACGCATTTAAATTGGGAATTTTCAGCTATAATTGTTACTCTCTGC TGTCCATAGGATGAAGTCTTTGACCCGAACCTAAATACAGACCAGAATTCTGAGCCCAAAAACTATCTCCAGCAT C CGGAGAAGAAG CAT CATGAAAGGGGAG GAGAGAGCTGTTTATATATAAAATAAGTTGAATTTGCAAGAAAGAAC CGACTGTCTTTTTCTTTAAAAATTGTAGGTAGTATCAAAGTTTTTCTCCTCTTTACAGATAAAAACAAAAAGGCT AACGGAATGCAGTCGACCGAGAACTAGTTTCTTACATTTCCATTGAGGGTATCTTTCTCTAGATCTGTCATTTTT CAGAATTTTCCAAGTAATAAAGTCACAGGCTTTTTTGTTAGGTGGCTGGAAAGAGAAATGTATTTGTCATTCTGT CTGATCTTCCAAAGCATTTTTCAGTGTTGTTTTTGTTGCTGTTATCTTGTTGAATTTTTTATCAGACTATTATTG GTAT CAO CAATAAAGAGGATGTTGAATTATGTTT CT CATTC CAAAAATAGTG CAAAAAAATTAATCTTAAAAAGT ATAATGGAAT CCACTTTT CTTTCACTGACTCCAT CTGCAAGC CAGGATCAACTGTAAAAT CATTTTAT GAAGAAG CAATATTGCAGGGGATTTATTCTCTGGGTCTAACAGACTGCTTGCAAGCCAGCGTTAAAGACCTTGAACAGTTGA TGCAAATTCCACCTTTGTATTTTACTGCCCTCAGTGAGCTCAACATCATTCATTAAAGGGTGTATTAGCTGATAT CTTACCACCGCCATGCAGCAGAGAAGGAGAATCACATAATTTTAAAATTCAGGCTGAGGGCCACTGTGGACGCAG AAGG CTGACTTC CGACCACCTGCTTC CC CACTAGACTTGTTC CAAATGG CTTTCTTG AGAGGTG ATTC AC AAGTG CTAAAAAATCACGAATTT TTAAAGTGAGATATTT TG GATTACTTATAAA CTACTTTCCGTAATGGCAT TTTACCA ATGAAATGAT CCACTGGATGAAAATGAC TTATCCTTAGTTAT CTGTTTTATT CCTGAG CT CATCTATTTTGACAA GTTCTTGATCATTGTGCAGTACAAACTATGCGCAGCCTTTTCACAAAGGCTTCCCTATGGATTCTAGGAGCTCAC AACC CAC CGCTT CTCAA CAGACAGTCATATATTCTACTTTA CAG CAGAATACTT CTGC CT CAG GGAAT CCTAATG GCCAGTGCCAAATCTTGTTTTCACTAACTGTGAGAAATGTGAGCCTTATGCTTTACATTAATAGGCCAAATGTAT TT CT CTTTGATAGAAAAT CAAACTTCTAAGCCACTGATAAAACT CTGGTGACAT C CTGAG CTTCACAGGTTTGAT CCTCTTACAAAAGGTGTGAACAATGCAAGTGAGGTTAACAAAAATC CTTGCAGGGAAT CACACACTGAGT GGCT C TCTGGCCTGCATCACTAACTGCTGCATTTTATTAACAGCAGGTCAAAGAGCCTCTCGCCCAACACATCACAAATG TT CACAGAGATT CAATGAGT CAACAAGTATGCCTACTGCAG CTGGAACTGACACTGAAAATGTGAACTACATTGA CAAAAATTCTCAGCAGGCCTCTCAAACACAAACTTTTCCCTAGCCCAAGGTCTCTGGGGAAAAATTATAACAATT AAAAGTGCAGTGTGGAGTGAGAACACAGTGCAGCAAATTTTTAATGTCAGAAAGTAGCAATTTGATAACTAAGAA AACTGGAAAAAT CATTATAACCTATG CCTTACTTTG CCCAGCAC CT CCCAAGACT CAACTGGAAAGAG CCTGGCT TTGAGAAGAATGAAAGCCTGAACTGCTGATTGTGAGAAAAAAAAAGG CAAAATAAGTG CTGAAAGGAACT CACAT CAAATATTTCAT TGTAAGAGTCTGAT CT TATTT C CTATAGCCAGAATGG CATTTGACGATACAACAGTG GAGAAT TTTGCTACTAAGATGTTGAATGAAATCCCTCAAGAGGAGAATTACCTTTGAGATTTTCTGTCACTTTTAAGATCA CT TCTGAAGTAT CAAAG CACTACTATAG CAAATGAAATATTATGAGGAG CTTTG CAACAGATGCCAATAC CTTC C CAAACATGTCTGAGGCAT CT CCTAGGTCAGAGG C CCTGGATTTCAAAC(N) xCACACACACACACAAATACCTGG GGAGGAGGGAGAATGCAAGTTATTTCTCATGGTCACTTTGACTGCAATTTTCAAGAAAAGCAGTCAGTGTGTGAC AGCAACAGTAGAAGCACGATTTCCAATCAGACATTAATTGAGACGACCACTAGCCCTTGGTGACCCCATGTACAG GAAGAGATGTTTCAGGGACACCATTTGTTTATACAGATGTCTGTCCTTAAGGGATTTTTGTTTGAAATCAACCAC CAAT CATGTATTTTCTG CAAG GAAATTACACAGTAAG GGTGGGTTC CAAC CC TCAAAATATTTGCTAG CT CAGTG AAATAAAACCAAGATTCCATTTGGAGAAACCTATAGCTGCTTTACTTTGCAATAAATTGTAAAATAGGTGGGTTT TATTGTTTTATAATTCTTTTCTGTAATAGTCTACCAAGAGGAATCCCTTGGGGACCTTTAGAAAATAAAAGGGGT AAAACACATG CTGCATTAAGAAGTTATATATTAT CCACTG( N) xAAGCATTATGAATTATCACCAAGTTATTAGC AGTG CTT CAAAAAATATT TCAGATTTGGAAATGT TG GAAAAT CCACATTATTAAATTTGTATTGT(N) xTGTGTA TTGTCATTGCTACATTACAAGGAACTGTTTAATGACTTGCTCTGAGCATGTTCAAGTGCTCCTTTGGCTGCAAAC ACAGATGACTACCGCAGGCCCGTGGAGGTGACCCAGGATTAGCTACACTCTATTCTTTCTACACTTGGGGAACTG GCAT GTGCTTAAGAAAAGACAAGAAAAG CTACTT CAAAT CAATAAAGTCTTCACAAAT CATG CACACGTGATTAG AAATGTTCGTTCTGGGTGTATCTGAT CATTTAGTAATTGCTCAAGAAAA CAAGAATAACTTTGTA CATTATCTG C CATAAAGACAAAAAATGAGCAAGAAAATGAGAGTCAAAGTTAGAAAACAAACAAAACACTTAATATTATAATCAG AGAACTGAGTCACACTGTCATAATTCACTTTTTTATTGTTGCTAATATTATGATGTCCCAGAAGACACTTCTAAG ACATTTACATAGACCTTAACACCTGGTTTATAAAATAATATTAAAAGAATATAGTCCAGTACATAAAAGAGAAAG TTATTTTAAAAAATGAAATT(N)xGGGGGGGAAAAAGATGAAATCAACTCCTAGAAAATACTGCAGCAGAATGAT AAAGTAAACATACAACTACATAT CAAGT TGAGT CGAGAAGAAAGGAATTT CAG GAAAATACACAAAAATGATAGT GGCTTGGGAGAAAAGAATAGACCTGCCTATCAAAGCATGTGTTGGTTCTCTGATGAGGTTTCAGCCTGAGAACTA ACATGGGAAATGTGCTGTCCTTCTCCTGCTTTACTAACCTTCTGCCTTCTTGGTTGGTGCATCCCTGTGCCGGGC TGATGAATACATCACAACTGCTGCTCACAGCTGGGAACACAACCAGTAATAAGGCACAGTTATAGCTTCAGGGAT GTAAGGAAGGTGAGTGACTGTCCATACTGTGCGTGTGTGTATCCACAGTCACACATGTGTATGTAGCACCAAGTT AGTGAAAAGGTTTTATTTAAAAGGTGCTACCTAGGTTGTGATTTTCCTCTAGTAGGTGGATCCTTGAGCAGCCTA CCCCTAATTTGCCACAAATTCATAATTAAGTTACATAGCTGGGTGTTAGTATACTGTTTTACATATAGATTTCAG AG AT TTTAAC CCTGCCTT AGTGCACCTAATGC AGTAG GGTTT CATC CAATTATAAGGCTTTCTCTAT CTGTTTCT ATAT CTATTTAT CTATCT CTATATAATTTACATACATGCATTTATATAGGTG GATATT CTATCTTTCTGTTTATT CAACTCTATCTACATATAT(N)xCAATTGACCCAGAATTGTGGTCACAGCAAGGCAGATACTCTTAAACATTAGT GCTGTCCTCTGGACATGTACATCCAGATAACTATGTTACAGGAAATACAATTTCAGTCTTCTGAGCTTGGCTCAC CCCACAACCTGACAGTCATGCTGTGCTGCGCTATTTTTATGTTTAATATGTACTCCTTGAAGATGATACTATGAA ATTC CACTGTAACACTGATAGTGTTCAAATGCT CACTTTAAGAAAG GTTAGATG C CAG GAAG GAAGTGATACCGA ATCCTACAGTGCAGGAAAGCAGCTACATTGGTTTCGGTCTTGGTGCTATTTGAATACCTCACCTTCCTTTTCTGT TCTCTTCTACATCTCATAGATAACCATTAATAAGAAACGGACTTTAGAAGGATCTCACAGTCTTTCCTGTCTTTG GTACACTGTAAGTGATGTTCAATTACACTGGAAAACAGTGAAGACTATTTGAGAAAGCAGTAAAATGAATGGCCC CAGCTGGACAATATGGGGA CAGTCTATAATGG CTA CA(N)xGCAATAAACTTCTTAAACTAAATATAGCACAATC AGAAGG CTAGAG CAGTA CATTTAATCACACTCTTAAGGAC CATTAGAGAAGTTATATT TTACAAAATTTC CATCT GAATCAC(N)xAATGCATGAATCTCTGATTGGGGTGGGGGAGGAAGAATCCCATTCCTCTTCTCTTAGCCAAGCC CATAGC CTTC CAGCT CTGTGTCCAAATTAGAAATAG GG TATTA CTCATGCGATTCAGAAAGT C C CAGAAAAATCC ATC CAT CAAC CCAT CTA CACAACAAATAATTA( N ) xGAGCAAAGGAATGGGATACAGTTTGGCCAAACTAGGGTT AAGAGTTTCCATGAGAATTAATCAGTGTCCACTAGCTGGAATATTGTGATTGGCCCACTTGGGTAGGTAGCTCAC CTTTAAAACATCACTGTAGTAAGGAGGTGG G CTG CAATGAAGAAAAGAGGAACAGT TCTCAAAAAGTATGTGGCC CCTGAGCTGTGATAAGGGTACCCCAATTTATATGTACTAAAAAAAATCTGAGCTACAGTAAACAACTTAGATCAA CCCACT TAATTC CT CAGCACTGGAAATGTAAGTG CACTATTCTTAGATACTGTTACAT GAAAAT GTGGTGATATT TACTTCCCAGAATCTCCCAACCAACTTCCCAACCTTTGCTTTTTCTGCTTTGGTTTAACCAATAGTGGTTACTGC TCAGGCTTTAAACAAAAACAAATGAAGCAAAA TGACTCTGATTT CTTTGTGGCCTCGGAGGCTCAAAAAGACACA AACACACATGAAA C CC CCACTAGCTTTCCTAGATAAAG CATGAACTGATTAGGTTTGCAG CCCTGCAGCTATAGA GGCCTCTCCAATACTCTGATCTGGAAACCACAGGGCAGGAAAATCCCCAAAGTCCCATGAAATCCAAAGATCCAA ACGTGGGGTCGT TTTTATGC CCTCAGGAAAGTTCTG CCAAAAAAAGAGATCTTACACACAGTGTAATTGAGAGGG CAGTAT CCTTACGTGCA CATGTAAATCCAAAT CCAAGCAATTAACC CCACACACAT CTGCCCCT CTAATGGGTGT CTGAGAATAAACAG CACACGTCAATCCCTC CCAGCCTCAC TCAATCACGAAACTAAGAAAAGAAAATGAATTCTG AACTTCAC CACACAGCTGTGATTCACAAATAT TTATAGTAGC CATAAAGCTGGTGAC CTT CAGGAAGATAACACA CTTGGCAGACATTCACCTAAACAAAAGAGGGGAAAAGGCCCAAATCTCAAACAGCAGAGAACAGAGAGAGAGGCA AGATTT CC CAAGTT CCATAATGGGGAATTC CAATGCATT C TGAT GGAGATAATGGAAG CTGACAAT TCTCCTGTC AGTGAGCTTTGGTAAGCGGAGTGAAATGCTTGTGACAAAACAGATCAGCTCTGCACGGGCCTTTCTGGATGGCCT CACAGAGATTTTCCAT CAGG CTGAGTACA CAATCGGAT GGGACT CAGGAGTGGTGT CTACAGCCCTGTATTTCTG ACATTATTAGGAATGACAGGCTAAACCTTTATCCCAAATTACAACCACTCAGCATGTTTAAGCAAGTGTGTACCT TGTAT CAACAATATA CATAGAGATGCCCTGAAAATATCACAAAATAAAATTCCTTT CAGGAACGTGGCAC CGAAA GAAACACCGT CAGCAACT CAGCAAGCACTT TTTTTTTCATATTTGGTCTCTTTGTCTAGT TTAAAC CC CT CATTA CCTATTAT CACACC CAAACCTATGTCTTCT CACT TTCCTTATTT TATAGCCAAGAAAT CTAACAAAAT CAAGACA AGTAAAGACT GAGñGCTGA CTCTTGGTCCGAT CTGTCCACGG TCAGGTGGTTATGAAT CT CAAAAAGCGAGATCT AGATTCAATCTTTTGGAACTCCTGCCCTTAACATTATTGTACGCACATCACAACAAATTACTATTTTACATCAGT GCTCGAGACTGCACAAA CGAG GACATCTTATTTGTGATTT GA CCAG CCTTAACTTGGT CT TTGATACC CACCATG TAACATTTAATG CAAACA CCTTAAGTTAGCTACAAGAATGAG CCAAGAAGCAAAGCACTGTGAT CAGGATTTAAT CATGGTTAATATAATGGGTTAAGTTCTATAAAATTTTTTAAAAATGAAATATAGTTGTTAAAATATAGTCATAGG TTATATAAAATCTTAAAATAGAAAGTTCCCTT CT CC CTAACGAAGAATGAATAAAAGAATGA GAAAATTTAAGTT GTCTGG TT CTAAA C TAGGTCTCTATTTGTA GAAAGGAGAA GTGTGTTTCTGGGATTTT CT CC CA CTTGTACAAAT ATTGTTATTGTC CGAACATC CACAATAAATTAAAñTTG GGGATTGATTTTTTTCAGTGAGGACAAAGAAAGGCAG AATAAAACATTGAGAAGGTTTTAAAAAGCAAAACTATATTTTAAATATCTGTTCCTGAATTAAACAGTTAAATAT TTTTAATACAATCAGAAGAAAGTAAAGAATTTTTGTTCATTTTTTTACTATTCCCATTTTATAGATGAAAATGTT TTGTTTTATT CCTAAG CACTGATTAAAATCATTAT CAAGAAAAAAT CTGACATCAC TAAAATAAA CTTATTT AAA TTTAAAAGGATAAAGCAAACAAGCAAATGAAATAGCGGAAAAAACAGTGACAATAATGAGTTAATATTCTTAATA ATTAAG C CTTAAAATCTGTAAGAAAAATGTTGAGTCTCTATAGC TAAACAGTTATAAAAT CTG GA CAAATAATTC TCACACACGCTTGTTTG(N)xACCTCTTCTCAGAGTAATGGCTAATTATTGCTTGAAAAAAAATTAGCAAAGAAA CACAATGAG(N)xCAAATTGACTAAAACAATTAATAATACTACTCAAAGTTTTTGAGTAATGATATTCAAATTTG TTTTAG CAAAATAAATT CTC CAG CATATAAATGAAGTATAAATATCTACAATA CTT TTACAAAG CCAACTG GCAG ATTTTTAAGACT CC CTGGGTTTTATGTAGTAAAT CAAC CATAGCTCTTCTGAGCTAATGAATTAAC C CTCAATTA AAAGAAAGTATTACAATAACAATTGCTATTGTTTTTGAAGCATGTACTTCATGCACCAAATACAGTGACCTGCTC AAAATGAGAAAG CAGGATTG GAACTCAGGTGTATGAAGTCTC CCAT CACCCTTTTC( N ) xCTGCTTATCAAAAAT ATTCCATAAAAGACTACATTAAATGTCGTCTGTGATTGTCTTCAATAGAGGCATTATAGTTGAATTTTCCTCTTC TTTTATATATTTTCAGTAATGAGCATTCACTACATTAATGCAC(N) xCTCCTGAAAATCTTACTGACCACATGGA TGATAAAGTG CAAA CACCTGTACTCCAGAGATAAT CTC TGTTTAGAGTATGGTATATATTAATATATATT CTAGG AGACTTTATAATACACATGAACTTGTGCATAGAGATTGTGTGTGTGAGTCTGTGTTTATTTTTAAGAGGAATGGG ATTATATTAAATATAT TATT CTGCAATTTG CT TT CT CAGTTAATAAGGAATCAAGGATAGTTTT CCATTACTTTT ATGGTCAGAATTTCATTT TTTATATCATTG CTATTAAGGTGAAG CACACTTTGCCATGG CACATGGAATAAGTAC CTAGGGGCCAGTGCTCCTTGACAAAGGCATGTCTCCAAAACTTCTGCTATATTATCAGGCTCTTGAGTTTGCTGC AATTATAGATTTCTAAGAATGAACGAATGAATGAGTACAATAGTTCCTTGCTTAAATTGGTTAAAGAAACACAAA TTAATT GGTCTTT CAATT TT CTTTGCCCAT CTTTGTAAATGATATAGTAGAGAGTCAAATAT G C CTTT AAGTATC AGAAGTAT CCGT TTGTAATAAAAAAGAAAAAT TTAAAAAATCATAC CACAATTTATTTATGTGG CAAAGGGTATT GCCATGTTATTTCATCATTTAACTATGTCAATCCTGTTCAAACTCCATCTTTGCTCATAATAAAAGAGGGATATT CGAGAC CAGACCAGAACATT CTTGTTCAACAAAATTAT CAGAAGGC CCAAGATAATTG CATGTTTAC C CTAATAT ATATAAAAGG CAGTAT GT AT AAGGGTGCAAAC TAAC AGTG TC AC ATTC AG AAACTATAT AAAAACCAAAGTAGG A AATACCAT CTAAAC TTAT TT CATAAATGAGTTATATATAC CAGC TTAACAAAATGT CAAGAAAAT AATAT TTACC AGGCCCTAATTTGAGCAAAGCTGTAATTTAGGTGGGACTTAGACTAACACATTTGGGGCATATAAACCATGTCAA GCAATAGAA CAT CAAATGT C CTTATTCTAAGGG GTAAGGT TTGCAGTTACAATAGACTTCAATAAAATTTGTTTA TAATTAAAAT CCAAG CATO CAAGATAAATTG CACTGTGGGA CTAATACCTACTGT C CC CGAAACAC( N} xAATCA TAGCTCAGAAAGCTGCTTA(N) xATTTAGATTACTGTTTGAAAAACTGATTTCTTAATATTTGTTCTGAGTTTCT CTCTACAGGTCTGAGTAACTCCAGTAAGGAATAAAAACTAAAATATGACTGTTCTTATTCTA( N ) xTCCTATCAT ATATCATGATAACCGCTGACCATCCAAAGCAAGCCAACACAGGCTCCATGTGGTCATTCTTCCAGCTCTCAGGTG CTGT GGTT CT CCATGGGGGAGGGAACGGGAGGTTCC GGTTAAGACACTATGTGAACCTGAATCTA CGT GGGCTGG AGAAAGATACAAAATGCAGACCTCCGTTTCTTCTTCTGCAAAACGTAAGGACGACACAGGAGAACCTCTCGTCTC AGCTGAGCACATCTACACATCTATACAGAGGGCCCAGCTCTAAGACCCATTTAGCCTCCTTCCTCATAGACGAAG CCAGGGACTGGTAACTCATAAT CCAAGAGCAGGAATG GTCAC CAATTATT CAGAAAAT CT CTGGATTATCTAAAA TCAG CAAAGT GTTT TGTGAATGGGGAAAAAATATAAGCCTGT CCACAATGAGACGGTAATTTAGTTAG GGAGAAA AATAAATAAGTTAATAAACAATGGTAGTGTCTTTGCAGTGACTAGTATATGGAATGAAATTGTCACTTTCTCTCA CTTTACTTGTTCCTCCCCTACACAACTAGTCCTCTATCCCCCTTCACCCCTTCCTCTCTTCTAGCCTTCCCAGAA GGTAGCAACACCTAGGTCTAAAGTGCTTCAGAAAGGCTCCTGTGAAGCCAGCATTTCTGAGCCAGCCTTTGGAAG ATCCAAACAAGCACGATTCGTGGTGGGAACAGATCATTGTGAAGTGGGGGGTCAGTCCCTACACTGTCCCAGGGC CACAGGAAGGAGGAAATTGTGTGGAGGCCTGACCAGGAGGATCCCTGAGAGTATCATGAGAGGTTTAGGGCTTAC CCAGCAGGTACAGGGGACTGCAGAAGGTCCCAGGACAGGAAGTGAGATGACAGTGGTATGCACTGCAAATAAGTG GGGG CAGCGCACAGAGGTAACT GCAGGAGACAAAACTG
> H s 5 _ 60570005 -6060 323 4
GGAGCATAGCCCTCCTGATACACCACTCTGGCACATAGTAAATGCCAAATAATCATTGTATGAACTGATGAATGC ACGACCAACCCCCATTTCACAGTGTATTTTAACAAAGAAACATACACAGAGCCTTGCTGCCTTTGTGGAATCACC AAGGGAAAATAGCTCAGCCAAAGGTACTCTACATAGTTATAATTCTAATTCTGACTAACTTGTTAGGGTTCTCCC TCTGGTGATATATTGTAGAATTTAAAACATGTTTGCAATAATGCATGCCTTCATCTTCCCTTTAAAAACTGTGTG TGTGTG CATGTCCACGT CATTG TG TCAGAGAATGGGTTTG CCAAATAGGC CTTT GTAGTC CAGTAGTT TGAACTA TTAGATTACTAATAGAAATCTATGAGAAGATTGTTCTTCAGCAAGTTTTCCAGTAAATTATCTCTAGGCAAGAAT GTAGCAACTGGGTATTTTAAAATGTATTTTTGTAGCAGCTATTGACAAAGTACAGCCTGTATTGCTTAAGAAATT TTAACATGTAGACATCTAAAGTGCATTGTTGAAATAAAATGATACATACATTCCAAGGGTCAAGGCTGACCAATG TCAGATTTAAGAATGTCAGTTGGCAGAAGTGGAGGAATTTGGGAACTAGAAAACATGGCTATTTGCTCAGGTTAA TTCTGAAATGGGTGGAGG GAGAAT CAATGAGCATGTGAAT CCAAGCAGT CTTCAGACAGAATTT TTAG CAGTGCA CTCACAGTGAAAGTAAAACCTTTAGAAACTAAAAAAATTAAACATCAAGCTGAACAAATAAATAAAGCAGGAACA AAACTGG CATAGGT CTGGTTTTTTAAAGAAGAGAAC CAAAGCAGG CAGTT CTTAG CTGAT CTCCTGGAATT C CTA GTAAAAGAAAG AAAGAAAATAAGGGTGC CC AC ACTGGATT AT C AAATCTGTTTT CTTTTC CTTAGCTCTAGGGCT GTACAGAAGCCTCCATGCTCTTGGATATGTGTGCCACCTCAAACCCAGCCCCTCCCAGACGCTTCAGCCCCTGAG CTGCACCCCCACCTGGCTCTCCTTCCTCTCTCCGCTACCTGGCTCTCCCTCCCCTCTCTCCTACCTGGCTCTCAC TCTCCCCTCCCGTACTGCTGGTCTGGGTGTTGTCAGTTCTTTCCATTATTATGGGTCTGTCCATGCCACCACCCT GATG CAAG CC CCTACTGT CTGCTG CCCAGCCTGTGGGCAGGAAGGAGCCC CTGGGATG( N ) xAGCATGCTCTGTñ CACACCAGCTACTGTTATTTCTTATTATTTCGTTGCTCCCTTTCTTGCCTCCATTGAATCCATTTTCCTGCAGTT ATCACTTT CAAATAAACT CTGATCATAACACCTTC C TGCTGA( N) xTTAAGATAGAAGGGAGGGAAGAAAAGAAG GAGAGGGAAGAAAAATGCATGCTGGTACCCTTGCCTGTTCTGGAGGCCACATTGCCCTGAGGATTTCTGTCAGTC TCCAGGAACCATCTCATTCCATAAGCCTTGTGTGCTTTTCTCACTTCCTACTGCCTTTGACTCATTAAGGTGTTG ATTCTCAATT CAATTCGAATGTTT GTTC TTAGGAAAAATGGT CCTTACAT TGGCATGATGGTGT GAC CTCTGAAT GCAGTTTATGGCTTTTTTGTGTGTTTGTTTTATGTACAGACATTTTGTGATTTCAAGAAAGGATAAAGTGATAGG TATGAAGTTGCAGGGAAAAAAAATTCCATCAACCCAAATCCAAAAGGGTTCCTTGCCCAGCACAGCACAGCAGCT CACTCTCTGGTATTTCTGCCCCCACACTGAGGAGAGACCCACAGGAGGAGCTCCAGCACAAGCCGATGGCAAAGG AAGGGAAACAATAGGAGGGGAGAACAGC CTC C CACATCCACCATTT CAAC CTCT CAGAGAAAGTACTGAAAGTGA AAGG AG CATG AATAAAGAAGCCATTTCT CTGAACTAAAAG CAGAGGAGAGGGCT CTTC CT CCTG CTGC ACATGAG TGAACAAAGATGGGCTCCAGAA( N ) xAGGAAGTGGGGTTTATTGAGTGACTACTGCTTTGGGCTCCTCCCTGTAG TTTAGGTATTTTGCTCTGATTGAAT CTTA CATAACTATATGAAGTAGATT TCAT TACCACATTT TACAGAAAAAA AAAAAAAAAAAAAAACCATGGATACTCC AAAAGGCC CAG G ACGAAG AT AT AAAG CTGC CC AGTGGAAACACG CTG TGGCTGACACTGCAAAGAATCCCGGCTCAGCCACGTCCTTGTTCTTGTGCAAGCCAGCGCCCTGCACCTGAAGGC CCTTATGTGG( N} xTTTTTTTAAATGTAAAAGGTGCCCTGTTTGGGATAACGAATTTATTGTGTGATCACCCTAC T G { N} xTATTATTAGGGTGGCATCTAGAGAGGCGGCAGCAAGGAGTGAAGCCAAGCTTCCCACTTGACATTCCCA CCTTGGTC TCTGTGAATGATGCTC CCTGTAGTTGTG CAGTGC CAGC CTTTGTGG CTTCAGACTGACCCGGCATGG GCAG CACTAG CTACTCTAGATTAGAGAC CATATGGG GGCACCACACTTG C CACTTCTACCTCTG CACOAATCAGA TTCACCACCTTCTCCTTCTTGGGTTACATTGTTGGGACTGAATTTTTTTTCTTTTACTGAAAAGTTTAGAGCCCA GATTTCTTAAAGGGGAAGAATGTCAAGCAGCGAATCCTGCAGAGCATCCCTACCCCTGGCCCCAGCCCATGCTAA TTAAGTGACTGACTTAAGGAAGTTGCCT C CTC TTAGAAAT TAAGAGGTGGATTGAGTATT TTCCTCTGGCCTCCT TAAATGTTCTTGGTTACAGTGATAAAAATGCTGAGCTCTATTGCCTACTATTGGAGTGAAGGCTGCTGGAAACTC CTCGTTCCCTGAAGAACCTAATTGTCGATGCTAGTGCATCTAAACAGGTGTTCTTTAGATTACTTTTGCTAATGC CTTTTCACTGGAAAGATCAATTTAAATGTTGCTTTGAGCTAATTAGTTATTGCTTCTGTGTCCATCCAAATTTAT ATATTTGATTCTGTGTGCTGGATGAGGCATTACCTTCAAAATCTATAAGCTTACACAAGGTAGGTCATTTTAAAT TTAGTTATCTGTCCTCTTCTATATTTTATACAAATATGCATATATTTCTTTTATTATTTAAAATGCCA( N) xTGA AATGCCTAGTTTAAACATATATTAGAATGTCCAGCATCATCTTTGGATGTC(N)xTATACAGAGTATGTGTTGAA TGA CTATT TGTGGAACAGATAAAC TAATACCCTAA CTTAAATGGCAAACAGATTAGATTTGTTGT CT CTAG GTGT TTTCGTTTGGGAAT CCTTTAAATGTTGC CTAAACTTTCTGTGTTTATTCTGTTCTCTG CCTAGAATTT CCTC CAT TT CATCTCTATC TC TGGAAAT(N ) xATCATGGAGTCATTCTTAGAACAAAGGATCTGGAAAGGG CCACGT CTAGT TT TC CAGATT CCACAT TACT CCAG GT CT CTGTTACCTTAT CTGGTG GGTATGTGGAGAATGTGAA CTG CTTAAAA TGAGACAGATGGTATACT CT GCTGAACC C (N ) xAGTTTGGAAGAAGGATGTTTAAAAAATAGGGTACACTTG AAC CTATTAATAAAAGTAATTAATGTGGCTCTTCAATCAAGCCCCCAAAACATAGAGCTTTAGAACCTAGAAGTTGTC TGTT CAGACT CTCCTTTTAGAGTCTTAG GATCTTGTTTAAGGTCAC CTGG CT GAGGAGGAGCAGAGTC CAGC CAA CAGGTCTCCTGAGTGCTGTTCCAGACATTTCTTCAGGCATAATTTAACATGTATCAGTTTATCCTTGAGTGTAAA GTACACTTCATGATTGAATG CAGGTAACTGGCTATT CATACTTCCCCCACCCCCTTTCTTTTAGTGCCTGTTTCT AAGCAG CAGAGTTGTGGT TCAGTGACTCTGTCTGTATAATAT GAAAACAGGG CACC CAGC CAATTCTGATGAGCT AATTAT TT CC CTAAAT TATOATGC CTAAGCAAAG GG CC CT CAACTCTG CTGTTCAACACT TATOAT TGTATTTGG GGGGTGAG CTGTACATAAGG CATATCAT CAGATAGCTAGAATTTTCTATT TCAGATTG TCAAGCAGTGAAAATTG TCGTTCTTAGAAAT GACTGACAAT CCCCCTTCCT CATTTC TG CTACATTGAGTTAGGTAAAGAAAAAAAAAAAAG ACTTCAGTCTTTTTACAGATCTCTTG CTGAAAACACGTGA CAGAACTA TACATTTT CACTAGAAATTGATGGAAA AATGAG CA CTTC CAAA TG CCTGAAAAACTTTG CAAAGG TCCCACTTAA CAGAATGTGG CTAACA C CTCAAAAAAT AAAG CAAAATAAATGTTG CAGT GGGCATGAAT CTTG CTTCAAGTGCTGGGTTGTTGGCTGGTTG CCTGGGGAGAG TGGGGC CAGGGCTTGC CAGCTGGCAGAG CCACATGC CTATAAAG GATG CAGT CAGCTTACAC CTTGGCAAAC TAT GATT TTGATGGAAATCTTAGAGGT CAGCAGACGT T C TC CCTAGTAGAGTTACGTTAGCTCAGGT CATTGCTTTTA TTTGAGTGAAGAAAACAT CTTGATGGTCATAATT CATAGGAC CAAGTCATTG CC CTGATC TAAACCAGAGTT TGT TACTGGAT CTGAATAGATAGACATAC CC TAAAGAAGGCACAGGATGTAATTGAAGG TGGCACTT TCAATTAC TAA T C CC CAAGTT CAAC TAGT CATG C CACTGAAGCACAC CAACTG GC TCAT GAGCAAGCACACAT CTGC CC CAAAGT C CCACTCAGCAGC CATTGT TTTCAAACTCTGAG CT CCAAAG CTA CTACCAT CCTATT CT CT GATG CT CATT CATAT TCTGCTTTGTGTAGTTGTTGTTTTGTTTCTCTTTCATAGATCTGAAGGCAGTATCTCCAAAATAC-CTAGCTTAGT GCTTTTTAGCAAAGGAAAGAACTG CAGAGGGAGATT CTAGATTT CAGAGT C CATGTTGAAAATTATAG C CATTAA TATT TTTAATTGA CAAAATACC CATCAGTCTC C CAATG GATTTGTTGGGGTTTTTTTGAAATGG TCTAAACTGAT TCTAATATTTATTTTT CTAAATATTAGAAT CAATATTG GCATGCAAAATAGC CAGAAGAAATACTGTGAAAAAAT TCAAAAGAC AAA TATA GATT CGGAAAAACATTTG CAATACTGTA TA TG { K ) xTC TTTCTTTCTCTGTCTCGCTCT CTCTCTATCTATATAGAT TCAGGAT CAC(N)xTAAATATATATACATATAAAT(N)xGAGAGAGAGACTGTCATA TAATATGTATATATGACCTCTATGTATGCAGGTCTATATATATCTATATCTATATCTGT(N)xTCCTGCAGAAAC TCTTCCGAAGGGAACAATCAGATGAGTGTGTAATGGTTTATGT(N)xGTATTTGTAAGAGTATGATCATTTTTTG GAAAAATATACGTATATATACATATATGTTTACATATTTTTATACATATACACATATGCGTACACACATATATGT TTACAT CTTCATGATAG CCATGATTTATGCCCTTCCAGAGTGGCTTTCATATGAATGGATTGATAGAG CTTAGTT TTCTGATAATAAATATGAGGGTTTTGTTCAAACGTGCTTAA(N) xTAAACAGACGTATAAAATTTTTAACAGTAA AGACTTAATGGAGCTGACATATTTGGATATGGGGACTAGAACAGAGGACTTGAAAGAGAACTCTTATTGATGCTT ATGTGAATACTG CAATGATGAATG CATTTTTTTCTTGC CAGATATCAA CCTGGATAAGGT CTTTAGATTTAAATA TTAGTCAGTTGAGAAATG GATATACA CATACTAATAAG GAAGTTGAATTAGATGAAAACTAACAAGTTTT CCATG CTTTAGGCTGCCTG CATTACGAGTATATA CAGGTGACCCTTGACTGACTTACCCCT CAGT TGGATATTGAGGGAT CAT TAGAATT CTAGAACTGATTGTATAATTTCAGATATGTGGAG GAG GGAAAATTGATGAAGTGAAGAATAATAA GAGAGTAG GTTAAAAAGAAACTGTAAAGAAGTAAAGAAAAAAATACTGTTTTAATG CTATAAAAATTAGTTGGTG GAGGATAG TTTAAAAT TATTAAAGTTGAATATAGTAAAT CAAAT TACCATTAATAT TCGTGAGCACGTGGTTGTG AAAAGATGGATGAAAT GACTAG CTAACTGT CTTT TTAATAAGAC TATT G GTAATAGAATCAGTTGTGG CT CATTA TTAACT GTATTAACAGAAAATTAG CCTACATGAG CACTTAAAAATATGAGGTTGTTTTAATAGACTGT GGT C CCT
t a t c a a c a t a t g t a a a a c t t a a a g g t t t a t a a a c c a t t t c t c t t c c c a g a a c t a a a c t a g a t t a g a t g a t c a a t t AATGGCTAAAATGTTATTATATCTCTTTTTAATATTTCTTGTTAAAAACTTAAATATTCCTGATTCAACTTGTGC TTCTATTTGC CTGATATAATTT GGTCATAATTGGTT TCATTTTGATTGT CTTTACATTTG GGGTACTTATTATCT TTTCTCTCATCCACTGGGAATGCCAAGCACCAATTTTTGCTACTTAGCAAGTCAGTGAGGAGCTTCCCACTGAGT GCCTTAATTATTGTTAATTGAATATTGTAAATGATTTTTCAGGATCACCATCCTGGCAATCCACATAGCATTTTT CT CAGTTAGCAAAAGT TAAAGCAC CAAGTGTC CCTTCTACCCCT TACCATA CAGT CAT CAAATC CAGTAC CAGCA GTAACCAGTT CCTGTATAAACGAG CCTTCTCTTT TCAAATGT CACATCATGTATATGCTT CATACGA CACTTAGA ACATGGTGGCTGCT CAATGAATATTAAACGACAG CTGT C CTTTTATATAGAAGGAGTC CCAGTTCCAGTCTGAGA GGAAACATTCAATTTC CCTTTC CAGAGCTCTCA CG GTT CATCTCATGTAG CATGG CATGC CCGGTACC CCAGGGT AGCACTCTCC CATTTACAGG CT CCAGGACAAAGCTTTCTAGAACTTTT TG CTGCTGTATCAT CAAAAGGAAG CAA AAAAGTAGTAAATAAAGAG CTT CAGAAGAT CCGAGAGTATTG GAAATG CTACAGGCAACAAACTGGAGTGAC TGA G C CAGC CTTCTTATGTACTT CTTGCCTTACAATACACAATAAAAAA GAAATGAGAAATGTAT GACCTG CCTTTT C AG CTTTTCAACTGTGT CTTCAACTTTGGAGGGGG CTGGGGGCTAAGGCTT CAATGGAATGAAGT TGTGTTGGAGG TTGGTTTTAG CTA CATAAAATTTTGC CTT CTTAATGATTTGTTTT CTGGC CCAGTCAATGAAATTT CAAC CACTG TTATGAAAT CAAGACCAGAAGAAG CCTTGAGACATAATTTAGAGT CTT CTTC CCTATCAAGG CAG CCCACAGATC AACACAGAAAGAAGAGAAT C TGTC TTAGAGAGACAG GCTT CCACAGGCTGTGATTGTAAC CTATT CTATAAATGA CAT TTTCTGCCTGGACTTTCCTCT TATATGTAGC CACTTG CTATTAAGTT TG GCAG GT CTTAGATT TAGAAACAA ATTCAAGG CCCAC(N)xCCCACTTTTAG CAAGTTTCTAGTGTTGAGTGTTA CAGACTACGTGGTACTAATTTTGT CC CTGT GAAATAG GGACAAATT CTCTTTTTTTCCCCATTTGCCTACATTTCT CAACTGTT TGATCCTTTTATATT GTACATAG TTGTGT AAAT CACCT CAAAT CTTTTGTTATAGAGACAG GTAAGAATTAATA CATGATTGAATAAATA AAA CAATAAAACATGC CCTCAGGCCG CAGTTTAAG GGCATTTTTGCTGGTT CTGAAGGAAAG CTGGTCCT CATTA GTAATCAGAGCCTGCCTGTGCCTCAGCCCTTTCTGATGAGCAATGCTCTGACCACACCCTTTTACCTCCCACCCA TCATCTGCTTTG GTGA( N ) xTCTTGGTATGTATCAAAATATTTTCTTTTCTTCCTGTTTTCTCTCAATTATAAAG TT TTTACCTG CTCACTTAGGGCTTTT CTGTTTTTC CAATAAGATATAAAAATTATTGTCT CGTAGAATGATGTTT GGGAACAGAGGTTTAGTAGAAACCAAACAGAAAAGAATAATTATTCCCATCTACTTGTCTCCAAATAAATACTGG ATTAAAAATT CTAAATAT TTAT GGAAGGAGATTC CCTAAC TTTCTT CAATGT CTTG TTCAATCT TCTTACAAGTA GTAGAACTTATCTGCCCTGAGGCTCCACTGTCTAATATTTGCAGATACTGAGCTTTTCCCCCTCCTTTTCTACTT GCTTTGGGTAGAATATCTGGCTTTTGACTTCAATGCAATT{ N ) xGCCñTAGñCTGGAGGCCTATTTGTCTGGAAG GAATG G GAGGGGA C CATOAGCCTGTTTTAATATGTGAG GTAG CTGAACTTTAAACTTTGCACACGGTCAAACTGA CACACAGTGCCAGAGAGATCCAGAATTTTTGGTCATTCCACCTCCTACAAGGGAAGACCGGTGACTAGTGTGACG AAAATATAGATCTCATGAGGGTTTTGCCATGATGACATGTCTTATGTCACCATATTAGCCACTGTCCCTCATTTA TTGGTGTTTC CTTG AGCCTTTAGAAATATGTAAACATATAAACC CCTAGAAATATGTAAC ACTC CTATGTTTTCA TTACTT TCACATTCTTTGGGCC TAGAATAC CCAC CTGC TACAAAGAGTTCATAACTTTAAATAATTCATGAATTT GTCAGTGTCTAGTTTCTTTTGTAATCATCCTAATGACAAAATTCACTATACAGCAAATCAAATCAGCTGTGCATC AT CCAATTTGGTCACCAG CTGAAAATATCAGAACTTTT CC CTTTGAATATGTGTGGATGCTGATGTTT CAAG AAA TATTAT TCAGAATCACTGTGTGATATACTTGGACTATAGATAATGC CTATTAATGG TGGCTAGA TTTGCCTGCTC ATTCTTAGCTTTATGAACAGTTAAGGCCTGCTCTGCTGAGGAGTTCGTGGTGTTAACATCTCACTTACTTTCCñT GGGAGAACCCTAGCTGCTTAATGGCACTCATGTATAGAAACATCCTGGATATTCATGCCTCTAACTGTCCCATGG TGCTTTCATTCATCTTAGCTGAAATATCCTCCCTTTTCTGTGCACCTTTCAAGGCCCAGCTTACGTCTCCCCTCC TTCCTGAATTCTTTTCTGACCTTTCCAGCCCTCCCTGATCGCCAATTCTTTCAAATTTCAACAGCCTGAGCCATC CATACCAG CTTACCTCTTTATGATAAGCTG CAATGGATAG CTTC CTGCTGTTTAGCAGGT CAGC CCCT CAGC CAA AT CACATC CTTTGAAATGGGCAGAGACCACG GCC TTTACTATGG CTTTGTGCTCTACACACCTTGCATGAC CACA TTCATCTTGATTCAAAAAAATATTGTTTGGTTGATTCACTTCAATTCTATTTTCTTTGCGTGTGGAAAAGCAATA TGAGG C CCAGTCAC CAT CATTT CTATAGGAAAAT CTAGAGGG GC CACGAACATTT CAATAGTTCTGTAAAAGAAC CCTTATTCACCCAGGACTATATTGCAGTAGGGCAGCCTGGAGAGAGGGCAGGCAGTTTCCCAATTCTTCCTAAAC TTGCTT CATTAAGT CATT CTAAGGGAATGTT CAGATGCTCTGACAT CT TCATATTTAAA CATAAGTATAAAACCC CCTC CGAAATAAAACCAAAATGATTTTTTT CCATTATC CTTCTGAGTGTGGC CAAGTGAG CACTGAC CACAGTGA AACTGG CTAATACTAATACTGTGGTGTCAT CATCGCTC CC CT CCTT CCTAftC TCACAGGAGGAAATCG CAGT CAC AT TTAAAAAG TGCT CTAT CATGATTTGAGTTGAATTATAACTAGAAGATCTT CTTATAAT TTTTACATATTCTGA ATGT CAAAGATGGñAAAT TACT CTTACCGTGCCATTCAACGG CT CC AT GATTTTTG AC AATT AAAGTTfiCTGTGG CATTGCTTACAACCGGTAGGCTTGAGTTTGTAAAGCAAGCCCTGTTAAGACTGGC(N) xGGAAAAAAAGACTGGC TAATTTCCCCTAAGCCCCAGGACTGCAAATTAGGTTGCTGCCACCAGGGTTTTCACATGACTCAACTGTTCTCCA AG CT CT CT CTGCTGAGGCAAAG CATTTAGCACACAATG CAGT CAATAC CTTCAAATGAGATAAT GAGTGTGAATG GGCTTGGAAGTGTCCCCCTCACTGCAGGGGACCTAGCTCTGACAGCCATGATCAGAGAATAGCAAAGACTGTTGT AGTC CT CAGC CAGG CTGAAGGACTCCAAGG CAGAGGAAAAGAACTCTAAGAC CATC CGTTTCTG TATTTCTGCCC CAATGGCTATTCCACATAGTGGTCCAGGAGGCAGGGCAGAGAACCTGGAAGTAGGGGGCACCAGGAAGTAGTCAC ATCCAAGTAGGGAGCACAGCAAATGCTCAAGTCCCAGCAGCACTTCCTGCCCGGGGTGGGCATCAGTGAGGTCAG GCTGGAGAAATGCAAACTCTGGACTGGCTTGGTCACTGCTGTTCTGTTCTGATGAACTCTCCTTTGAAGGCCTCT TGCACCACTGGCCCTGGGGCACATTTGCAATCGTTAAGACAGCACAATTCAGAACTGCCTTACACCAAACTGCTG GAGCATAAAAATGAGAAGAACTGATGAAAGG GAAT CAGAAGC CTTGGATGAC CGCT CCATGCCATGGGATAC CAC A G { N ) xCCACTACGTATCTGCCACATCAGCCACGAGAGTCTGGGTTAATTTTTTAACTTCCCCAAATCTTACACG ATACTTATTT CACAG GGT TACT GGTGTAAGTAAGATAATTAATT TGGAATTTATCTGTAAAATGACAGTTTG CAC TGGTTGAAGTAAAAGTTAAAG(N)xAAAACATACAATAATAATCTTTAATTACAAACTTCCGATATGATGTAGGA CAAGATTT TAGAAAGTGCAACACTTAGGAAG CAG CATATT TGTGTAAGTTAAAGCT TTACATA CAAGTTACAATA CT CTTAAC TCAAACTGGTTTCAACAATGAAGTGAGTTCAT TGTTAC CGAGATTAAAAAGG CCAAATAGAGTG CTA GCTGAATCCAGCCAGCTGAATTGAGAGTCAAGGCTTAATCTAGCAGCTCAGACAGTGTCATCGTAGAGT(N) xT T AACCCAGGTTTGCTTTCTCTGCCCAGTAGTTTTGTGTTGGCAAGCTTCACCCGCAGGCATCACATGATACATTTT CC CCAC CCACTTCCTGTñGCTC( N) xCATGTGGCCATTCCATTCAGGAAAATTGATGGACTTAATCAATCATGCT CATGTCGAGAGTGAGTTTGAGATCATTCTACACAAATCACAGAGCTGAGTGTTAGGAAGGGGTGAATATCCCCAA GGAAGAGT CAGAGATCACTGGCAGTAGAAGAAGTGAATGAAT CC CGGGAGGTAAG GAGAGTGCT CCAGGCCAGGG CAGAACGGTGACAGTGCTGGAGGAGGAACCGGCTGACCTGGCTTGTA(N)xATGTTTCAGTTTCATGTATGGCTA TGTTCTGTCATTAGTCTTCAGGGTTATCAACTGCAGGTTTATTTTCTTACCTGATTTTCTCAGTAGGCCTAAATA GAACTTGC CTTGGCTTTGTTTCTAATGCTAGAAAACTT CCAACT CTGGTGGAGATTTGAAATAT GCCATCTTCTA TTAACTAAGT CATATTCTTTAACTAG CCAACTTT CCCCTACA GTTTACAACTTGTGTCAGACACA CGGAAGATTT GCTTTTGGTTTTTCCCTTGTGAACAAAACAAGTCATAAAAAAAATTATATCATGGTTTCCTTACAGTTGCATTTC AT CCTG CATTTCTTAAAT CAAGAAATTGACTAACATAATGGCTGAATGAATAAGACACCACAGCAATGAATGACT GC CTTTAG CAAACT GAGAGGGAGTCT CATTATTTAAAT GTTC CTGGTTATATTAAGTGGAGCAAAAGG CTTGGAA TCAAAAGTGAGAGAAATAGGCAGCCAAGTCAAAAATGCCACCAAGCTGTAAATAAAATATGAAACTAGCTTAGAA AGGGAGCATTACTTCGAAGTCCAACACTGATATATGGTCCATAATTGTGGGACGTTTGTAAAAGGTGACATGACA CCTGTCAATTATCCACAG CAGC CAGGAAATGTAGATTACATGAATAATTCAC CAAC CCAGTGCC CTTTGTTGCTA AG CTGT CT CAGCCCAGTT CCATAAAATAAC TGCTGATATGAAAGAAG AAAAAAATACGTTTAAGTTGCAAAAGCC TTATGTTACACGCG GAAATGTTTTCAAGGCAGCT TCAT CATATGAC TATGTT CCAGTTAG CACCGTTGGCTAAAT AATTAACATGAGCCTTTATCAGGTGTAACAAAGGAGGGGGCTAGGACTCTGGAAACTTTATATGTTCCCTTTGTG GCAGGTGGAAGCAGGGGGTGCCGCATAAGAGAATCTTCAGTATCAGAAACACATCACTACATATGTCATAATTTT CT CAT C CT CC CAAACAGCACACGATC CTTTAAAAAGTCTAAGTC TGTGTTTTTGTT T T G (N) xTGGTGAGCAGGA TTATAGAGGATGGCAGGAAACTGCCTGGTCAGCAGCCCCTACTCTGCACCCCACTCCCCACCCTTTCCCAAGCAC TTACAAGGA(N ) xTGCAACACACATACACGTGTACCACTGTAAGTACAGCTTGTCTACACAGCCATGTGAGCCAC CAGGATTTGGAGAAATACCAGTTCTGCAGCTCCCGGGACGGCACAATAGAGCATGAGGCATGGAAACAACTGACT CTTTCATTTGTAGCCTGGGCCCAGGCAGCCTGATACCTATACCCACCCCGCTCTTTCCCCACACCCTTGCTGGCT ACAGGGCCTGGGTCAGAGGTCATGGGCCAGGTGGAAGGTGGGGCCTTGGAATGCCCAGGGGACTGGGCAGGCAGC CATAGGTAACCAGAGCCAGCCATCTAATCCATCTGCTTCCCTGGACTGCTGTGAGAAAAAGCCTCATTGGAGAGA CCTACCAAAGTGGCTG CTTCCAG GAG CAAATC CAGC CACACTGTGGAGAAAGATTTTCTTG C CCTTACAAG GTGG TT CATT CCTTGTAACTTC CTTC CT TCAAAATGAAGT TCCAGCAGACAAAGAGAACAAAAAAGAGGT CTGTTTCTC ATGGTT TATAAGGAAAAGAACACCTTGTAGATAAAGTACAGC CTTTCCCATTTT TCAGAAGCAAATGAGTGT TT C TT TCTCATTTGAGG CTTATTTG CAGC CAAGGTTTGG CTTTGTTT TTGAAAGTTTGGAGGAGGTTACATTCTTGTG TATACC CACTG G GTGCAAGTGC TTTGGCACAG GTTTTCTAATATTTGCTGTTTT TCTCCAACTGTC CCAAATGGT AGTTTATAGTATTT CAG CTCTTTCAGAGGCCAAC CTA CAGG GAAAT GCAATGAAAAG GGAACTGAGTTTCTGATT CAGGTGAAAGGCAACTGTAATT TCAGGCTTTCTG CATGTATCTCTTTGCAAT CAGT CATTACTT TTGAAATGAAC ATTATATTTAGT CATA TG CAGAAATAATAACAAACTGCTGGTTT CATCGTGAAGAT CTGAAGTTGAAAACTT CAT GTTTTAACATTGAC CATT CCAACAAG CGTAAAACAGGAGACATCGAACATCTTGACATTACAGCTG CGCT GAGA C TATTTCTCCAGTGCATATGTCACAGGATCCCTTTGACAGCGTGGCTGAGAGGGTTTCTTTCAACAAATGGAGGTA ATTGTT AAATGAAT CAATGATTGC CT AAGCCAAT TAATGCCTTTTCTAATAATG CTGAAATGTTT CT C AAGAAT A GT CT GGAAAACT CTTT CATAGTAAGAGACTGGTG GAGCTGGAGCAG C A G A (N ) xCAGCTACCTTGTCTGCCTGGA AAAACCCTTGCCCAGTTGG(N)xGGATTTTATCTGGCAACGCTGCCAGGGCAACTTGTGATGGGTTGCTGGACCT TATGGG GCTGTC TGAGATGCCAAG CGAAATCAGT CTTGGGAT CCAT CCATTTGTTCTAG GACAT TAACTT CT CGA AGCACTTGTGGATTGCTCATGGCAGGGCGATGAGTGGGCTCCTGTTCCACTCTTCAGTGCCCAGTGAGAAGCCTG CGGTGAGGCGTTGTCAGG( N ) xG CAGGCGCAAGACAGCAGAGGGTGTGTAAGAAG CATCCACAAGT(N )xTGCAG TTAT TATT CTGAGAGTAG CAGATAT CAACCA CTGTAGCTTGG CAGG CTGGTGTAT CTTGAA CAC CTGACAACTAA GTGTGTTGCCAGATTT TGATCAATGT CTAGAATATATTTTTC CCACAAAGGAGG CTAAA CATTAT CATCCCCTTG TGGGTTTGGTTGTTCTTCAGCACT GAGAAGAATTATGTTGGTTC CTAGCTGCAGTCTTTCATGTAATCAAGAGGA AAAGCTTTTCATGCCCCATTGAACACGTGGTCTTCCTTTGGAACCAGCTCGATGCCCTGAACTTCCCACATCTCC CTCTCAGCCAGTCCCACATCTCCCTCTCAGCCAGTACCCTCTGTCCTGAGCCAAAAGGTACTTAGCTTGACACTA TTGG CTAACATTGAGGTGA CCATATG TATCCAGT TGTAGCAAAAATA CATCTTACATATTGATCAGAAGACAAAA ACCACATATATCTCCATATACCAAAATTAACAGCAAGTTTCTTCAATTCTCTGAAACTTGCTTCTTTGGCCCAGT GT TCTT AATTTGTT AATT T ATC AAGAAATTAT AAAATTTGAAAGTGGGCCT C AAAC AC AAAATT GCTG AAAAATG CTGG CTTAAACAGG CACTTGAG CAATGCTCCT TT CACAAAG(N ) xGTTTTCATGTGTGTGTGTGGGTGGTTTTTG TGTGTGTGTG <N } xGTGTTGAAGGAATAAGTAAACAAACAAATGAGTATGTAATAGCTGTTTAAAATGCACTGCA AATGATAAAGTACCATGGAAACATGGCTGGTTATTAAAAGCTGACAATTTTATTGCTGTTTTAGACTTCTCCTGC CCTTTAGGACAT TATG CT CCAAGGAGTTTACA TTTTTTTAATTGAAAAATTTTCTTAGTCATAT CTATATTTTCT TC CT CT GACATAAAAT CTTCCAGTGA CTTTTTAT GAAGATATTTGGGTGTTGATAGGAAGAGGG CAGTTAATGCA CATAATATGAAGTCTC CACTAG CCAGAAGACACATC CTGATAGCT CAGTATT CCAAACTTAAAATGAACC CACTG CATGAAAG CTCAATTC CTACAGAAGGTAGAC CAAGG GAACTGTTTTTG( N ) xGGGGACTGTTATTTTTTTACAAA AAAG GGAT CTATAAGC CT TAACTGTCATGTGTGTAT TATTAACACT CTGCAG CCATGGTTCTCCT CTGAAAAAAA AAAAAAAAAAAAGAAC TAGAGCAT TT CTGAGGGGAT CTGGTGCTATAGTCTATCCTTTGAGTTGAT TGATGACAA ATGAATCCTCGCACCTTCTGTTTTGGAAAATACAAATGCTGGGGATGCTCCAGCTCAGGAGATCTTCCAAGGGAT TTTGAGGCTGTT CTGAC C TGGACAATGCTAGAGGAGAGGATACC CTGGGCTT CCT C ACTT TACCAAAGAGAATC C AGGCACAT CCTTTT CTG G CAGT CT TGAAGGGC CTAACTATCTAAGAAAGAAT CTTCACTC CCTGAAATGATT CT C AGTGGAAT TCTTTAAAATATGAGT TATCATAGTACATTAAAACATGAGAGTACAAAATGACAACAAGTGTG GACT TAAG GAAAGGATTTAT TTTTCCTTTTCTTTCTCTTCTATGTTTTCTATATGTGGCTAGAGTGGTCTTACATAAAT AAATGAACAAGACAATTTATG GAG CC CCAAGC CCTG CTCGAAGCATGCTATATATGTTATTT CCTTTAAG GAGAG AAAT TAAATGTGAT TT CCAAAAGT TGTTTTGTGGAATAATTGAGGAAGACAT TATTGTTCTACCTG CAACATTAG GATGAAAG CACAGTGAGCTATAATAGAAAACCTG CCACGTTATT TCCTTCCTGTTT CAAT TCTT GACTTGTATG T T C CTAGGC CAGACTTC CTGGAGGT GAGCAGCCAGAGTGGACAGCGAGAGCAG CAAAGGGTAACTAC CACCTTGTG AGTGCCTCCAGG CAAG CTGCAT TCTAG GCATC TT TTATCACGAACAAATATC( N ) xAACCGAAACCCCAACCCCA GGG TTATG GAGCATAATATAAGG(N ) xTCAACACACAGAGTCTCCACATTCAGAATGGCACCTGCAGCCTGTTTG CTGAAAACGGATACAGTAGTGACAGAGAGCCTGGAGAGCAGAGAAAGGAGAGGAAATGATGCATGTGGGACAGGG CCGGGAACAGAGAAGAGGCTCTGCGGAAGACAGGAGAAAGGGAAAGGGGAACGGCGCTGTGCTTTGTGGAGCAAT AGTG GAGATAGTGCAG CTTTCCAG GACACAAGGAGAGAAAACAAAT CAGAAAATGG GTAAGG CAGAGAGATTTAA C AGAAG CTTGGGGCTAGGGGAG AAGG CACTGGAAGGGG AAAG AAGG AGAAGGTGAA C AAATG AT AG AGGCTA CAG TACAAATGTCTGGGTG AGTC CCAT AC CAC CAG CTTATGG AAGGC CAG CGTTGGGTGGGTGAGGT GAGAGCTC AC C ATGCACCCAGATATCCTGCTGTTGCGTGCAGAAGAC CACACACTG CAGCACCAGGCAGCCCTCTCTTCAT GAGG G TCACACACATGTGAGGATTGTAAGACGAGGCCCTTGGGGCTCTGTTCACAACAAAAACATTGTACGAGCCTTTTA CACATGAC TCAGTGTGTATTG TGTGT CTGACTGAAAAAAGAT CAGC TTTAC C TACAAGTG CTAACGTGTTGG CCG AGTT GAAAGGAGTG CACAATTACATAAGAAATAAAAATACCAG G GGAGCAAG CAGTTTACCACCTTGTATGGGAC A CGCTGAATCTC CAGAG GAAAAGT GCCAGGTGACTCCTTTGCCAAC TGTGAGAATAGCTGTGGTTGGAACTGGGT GTTTTGTTTTGTTTTGTTTTGTTTTTACCCCAGTGTTTGGCACAGAGTCCTGAACACCAAGTGAGGCCCCTGAAC CCTG CTTGA CAC CAGATTGCATTT TCTAATGT CATT C CTGTTTCACTTCATTTGAGAAGGACTGGATAA CAACAA AAGAAAAAGCGAAAGAGTAATTAAATTTTGTTGTATGGGAAAAGTTTATGTGCTAAGCCTGCAAAA( N ) xATCTT TAAAAC GGGAAAGATC CCTTCCCC CAGTTTGCTCAAGGGAGG( N ) xGGAGGATAAGGCTGAAACAAAAGGTTGGC AAGGTG GAAGCT CTTT GAAAAG CACTTATTAATATCTGTTTTATAC TGTATGCAGTGTGGCATAGTAAAAA CTAT ATTAGATGAGGAATTAGAAGAT GT GGATTCTGG(N ) xGTGCCCATTCATTCCCTTGCCTACCTGCAGCAGGAAGG CCGGAGAATCTCCCATTCCCTGCTGGCCTCCGAGAGATCTAGCTGATAGACTTAAAAAAGAAAAGAAAGAAACCA CTCACACATACCTGGGAATATATCCCAGGTACTTTAAGTGAAAAAAAAAGATAAGTTGAAGTACTATGTGTAATT CTAATATATAAACATTCCATAATATTTTTGGAAATATAGATTTGGTAGAAACTATCTCTTAGGCAGGATTGAAGG AAATTTTACTTTAGATCTTACACATTTCTATATTGTTTGAATCTCATACTGTGAGCAAGTTTCATTTGATAATCA AGATAATAATAAAGATGTAAAAA(N) xCCCCTTTGTAGTATTGACAGGCTTTTGGAGAATAATGAAACGCTGAAA TGTTTGGCTTGGTT TC CCTCTTGGGTAGAGTCTG GTGC TCAGTCTT TTCTTCTG CCTCTTTGTATC GTTTAG CTT TTGGGTGGAGGCCCAGAGAGCTCCACAAGGGGTGATGTGTGTTTAAGAAACTCAGCTAAAGGAAACAAACTGATT GCAAAGAACTCTGAACCTAGGGTGTTGGGGGAAATGCTGAGCATTGTAATTGGCCTAATGCTGCTCCTGTGAGAG ATG C CATCTCCCTA GG CCTCCAAAGTTCTGGCAGGGATTTTAACTCTT CT CATC CAGCTGATGACAGACAAATAT TCAT CACTTCAACAGT TAGGTG CAACCCCCAAAATGTTTC CTCAAGTCCTTCCTCA CTGCAAATGC CTACCAGCT CATCTC TGAAGATGGAAAAGCAAG CGTGGTGA CATCTG CAAATCTGGA CAGTGGGCTTTAGGAT TCATCTGAAAA GAAAGG CT CTTGGACAGTGCTG GACCTTG CTC CCTGAG CCAGGCTGTGGGTCCAGTGGTGACTCAG GAAGGAATG CCAC TGACAAGC CACAGGCATG CCAGAGAATTACTCAG CACAAGC CAACAAGTAAC CT CATACGAGAATCAAACG CCAGCCAATCTGCGAACGCTTTGCGTAACAGCATATTTGGATTTGGGGCTTAGCTTTCAGGCTTGAGTTTCACGT TTGC CTGAGCATAC TC CCAGCCTC CCAGGG CCCCTTCTGTGTGCCCTGCCTCTGATTG CAAGTG CAGAGAGCAGT TCTC CCAAGAGC CAGGGCTCAG CT CACCCC TG CAGGTAACGCAGAC CT CTGCTAGGGATCAGCT GTCTATCATCT GGTGGCGCCCCTCTTTCCAGCCAGCGCTACATCTTCATATGCCTTTTCTTACCAACAGGTCAGCAATGTGGGTGA CAGATACTTCTTTTTGCTGTTGGAACACTTCATATTCATCTCTGGTATTGTCATGGCCTTTGACATACCTTACCA ACAAGAGAGGTTTGTTGACTGATGACAGCAGGTATAACAAAATCTAACTAGTGACAGTTGTAAGAATTGGCAAAG TATTACTG CATCAACAGGTATG TGGACTGT CTAAGAGC CT CAGAAAGGACAAAGAAAGAAGCTTACAGATTC CGA TGC C CCTTACTACTGTATCTGG CCTTTCTC CTGC CGGTATAGTGAGAG TTGACAGTTTGATGGATTTAATTTAGG ATCTATGTTAGGATGAGATGCACTCTGTTTTCCTAAACAAATTTGCTGCAGAATGGCAGCCACAGTCTAGGGCCT CCTTGAGCTTGAGGCTAAATGACCTGATGGCCCACCCACTTCATCTATACTGCCTTTAATATGTCAGCATTTCCT CAAGGC CTGGGT TT CGTGCACT CAGGGTGATATTGAACAT GAAGTTTT CAAACAGCAATACCTT GG GTAACAGAC AAGGGCTAGTTAATGATCTCAT T CATAATT CATC CCGTGAAG GAAC CATT TACACT CAAACTC CAAGCAGGAAGA AAATAT T C CCATTAGGTAGGGC TTTTTAAATG CCTCTG CCTTTGCCACAGTGCTGAGAGAGAAC TGTGGTCTTCA GTAGTAAG GGCATATG G CTGCAAACAGAGTGC CCAGGTTAGAGACACCAAGAGCTG CAGCATCACT CAGTAATAG CTATTT CAGAGGGATGTAAAGC CTTCCCTGGGAAAAAGTTGAATAGGACT CAGTGC CTAAACACAG CACTTT CAA AAAGAAATGAAT CT CTAATTAAACATCCTGAAGACAGAAG CTGACATG TCATTG CAGA GCTAATAATAAGAC CTC ATGAAATGTTCAAAAGTTCTCCAGAGTAAGTGTGAACAATTACATTTCCCTCTCTCCATCAGTGAAGGGTTACCA CCAGTT CTTT AAGAAAGAGAAAATGAAGGTTAGT ATCTGAAAGTATTA T ATAATGTGCTT ATTA CT ATGCGTGGG TTGAGATATTCTCCAAAAAAAAGTTCCTTTTCCTTCCAGAATTAAAAAGAACCCCTCTAACTTTTGTTAACTGTA GATAGAAT CTTT TCTTTTTTTTTTAAGGAAGAAT CCTTAC TG CATGTC TAAAACTCGTGGCTGAGAAAAAGTAGA GGATGTCCAGCCTAAGGAGCTATCAGCATTTTTTTGTAGCACTCGATATAGCTGCAAGCCAAGGTCCTCACGAAA GTGAAAGTTTTCATTCAAAGTTAAAAACATACTACTTGCATTTTACAAGCTCAAGAGTAAAGCACAATAATTATC AGTGCTTTATTGGTAGTTCCAAGCCTCCAAAAATGTCAGTAAGTTGAATCTACACTATCTTTGTACAAGAACATA AAACATAG CCTCGTGATAAAAATTAAAGGAAAG GATAAATTTGGTGAATT CCCT GCATAGTTCATAATAGAGATT AGT CAACC CAAGTTA CAAAGATAA( N) xAAGTTACAAAGACAATTTTTAAAGGTTTAAGAGGTTTGCTTCAAAGA GCTA( N ) xTGGCTAAAGAGCTATTTCTTTTATATAAATTATATGAAGATTGCTAAAAGGTTTTAAGTCACACTAC ATGTAAAAACCCCTTAGGCTCAAACTCACCAGGAGGATCGTGGAGCTTGGAGCTAAATAAAGCCCAGGTGCTTTG CCCT CTGCTGCTGACT TCGGGT GCTGGCTGGGAAGGAT CTATTTCCAGGAGGGACC CAGCAGAACCTCGGCGTC C CACGGC CCTAATAG GCAAATATGAGCCGGC CT CCGCCC CTTAAGGATGGAGCTG CTTAGCTTCG CCACTGCTGC C TGGAGT TC CTTGTCTGAGATAGGCAGGGCGTGGTGCCC CAGGTCAT CT CC CAGG CATG CTTGCC CCACGAGCAG C GCTGTGAG GGTGTACT CAAGGCAATCCAAGTTTCAACGT CATACCATT GT CTTTAACAAAACCCTG CAAACATG T AGTGACCGAGGCATGATGCATCCTGCCTCTTGCTCTGCTATTTATCTGCTTTTCTTTTTTTAACTTTAACTTGTT CTCCTC CTATTTATTG GGCAAT CACCTAGATT CTAGGAAACAAATATATT TAAT TTTT CTAATC TAAAGGTTAAT AAGGACTCTATCTTCTCTAGTCAGAGCCACAGGATGAACTTTCTCAACCCATCCTCTTTAAGGAAAAGAAGTTGT TGGCTGAGTCACGTGAGTGTCTGTTTACACACGTCTTATAACAGCATACAGGGTACATACACAGAGCAGCACACC TCCCCACCAGGAAGGCTGTGCTCCATATTCATGTGTCAAGTGATGTAAAGGTTGCCATGGTTATAATCTCTGTTT AGTCAATAGTGTGTGTGGGGGGGGGGGCGGGGAATCACCCCAATGGGCGTATTGCAATTTAGAGGAATCCCTGGG TACTTTTAAAGTATTGAATCACAATTTACATT TTAAAAAG TGTTGT CAATGACAGCATATTGGT TG CCATTT CTT GAAAACTCTTCTTTGTAGGAGGACCTTATGTATGCCAAGAACATCGTTTTTGCCTCCTTAGGTGAAACTTTATTT CCCACATTTAGGATTCATTAGGATTATCTAAGGTAAGTGGCAAGAGGGGAGAAGGATGAAAATCATGAGGGTGAA TGTGGGAAAGGGATGAGGGAAGGTGAGCTTAATTATTTCAAAGATAATAATGAGTGAGGCTCCC( N) xGGAATAA AAATAAAAGTGTCAGTAATTCTTACTCTTCT(N)xCACCCAATCTTATTCATATTCTAGTCAATACACAAAGAAT CTGCTGTG CTTTTGGATAGAGAGAAATGCTTCATAACAACAC CATCACACATTATTGAAAAGAT TTTGGTGTGAT TAGGGC CGACTGACAT CCAAACTT CCCTCC CACC TTTCTGGT CATATCT CCTCCTTTT CCCACACT CAGTAT CTT TCCTTTAT CTACTCTAAGCAGTAACATGATAAGTTATATAAATTATATGAAGAT CATTAAAAGGTT TTAAAG TCA CACTACATGTATGACTTCGTATTTGAAAAAG G CT CAAAACAGTAAGTGAAAAAAGTGACAGTTGGT G GAATT TTA GAAGTTGAGGGTGGAAG GTGTTT C CAAAAC CCCT CAAAT CA CACAO CCTAGATGTGGCTGCTTC CT G GATC CA CT AGTGAAGCTGACTCTGGCAGGGAGTCACTTGCATTTTTCACTAACTGAGGTCCTGCTCTCTCAAATTTCACAATC TAAGGGAGAGAATCTTGATCCAGAGTGATAACAAAACAAGTAATGCAAGCTCAGCGATGTGGGATAACTCCCCTG TC CCTGGTATTAG CATGGA(N ) xGAAAATGTGTATGAGTAAAAGGAAGAGATAGGTATGTTGACAGAAGACAGCC TGAGGAGTTTAGAGAATAGAAACATT T CTCAAGAACTT GAAAGCGAAAGAGT CT CTTGAAAG GGTTTGTTGCACA CTGCCTGAAGGCTGAGGTATGGACTGGAAAGTGACTTAGTCCAGCCCCTTCTAGCTGGGTGAGTAACTGAACACT CCAG CG TGCGGATCTT CACTGGAGAGAAATGGGAGCAT GATT CATCTCTTTGACAT CATCTTTAGATTTTTGGTT TAACTTTTCCTTTC CACATCAGACGC CC CTGTTGTTG GGTGGTCTGCCATGC CACTTCCT C CAAGT CC CAGCTAC TT TACT CTAGGTAGAGGG CTGAGGGTAA TGGATCTTAT TT CAAAAGGAGAGG GGAAATTGAAAC CAGAACAGCTT TGTGATGAACTGTAGGGGAATCCTTTTTTCTCTAGCAAAGGTTTCCCAAGGCTGGTGTCTGCAGCTTGGAGCTTG AT CGTGTGTGGTGTGT GTGTGTGTGTGTGTGTGTGCATGTGTGTGTGT GTGTGTGTGTTTGTATTTGGTGAGAG T TAACCATTCAAAATGGAAATCCTTTCATAATTTGATCTAGTTATGCAAACACAGCCCATTGATGTTAACA
> H s 7 _ 5767679 6 - 5767950 2
CACC CCTTGATCTGGCAATTCGACAT CTAAGAAATGTT CATTTCTTCCAG GATTTTTTTTTTTGAGTCAAGACTT CCCT(N)xCACAGACCCTCCTGATTGAGTCAAATCTCCTGTCACTTCTCTCACACACCACACACCTCCCTCCTTT TC { N ) X CTGAATCTATATTTCTTAGTATGGAAAGATGTTTGAAATATGCAATTAAGAGAAAGAGCAGATCATAAA ACAGCATGCGTGCTGTAGTACACACTGCAAATTCCCTCCTTTCTTGATCATCTTTGGGGGACCACATGCGCACTC TCAGTCAA{ N ) xAAGAATACGGAAGTGACTGCTTTTGCTGGCCTTGTAAACTCATGTAGTGGTTGGAGCTTCGGT AG C CGT CTTACAGC CATAAGCTAAATT C CAATAGACTCACAAAGATGCTGGG CCTAA CAT TGTT GAAC CAGTGAA C CATAG C CAGGAAC CATAGATACT CTAACTATGTAAGAAAAATGAACCTGTCTTTG TTTAAGTGA CTGAACTTT C AGATTAATGCACTCCCAACAGATACAGAGCATGATCCCATATATACCTGTATGTGCAAATATGAATAGAAAAATA AAGTGATTATCTCTGCGTGGTAGGATTCTCCTTCATTTTTTTTATTTTATAAAGTTTTCCCAAAGGGCATGTAAA TAATAATCAGACAAATATTTCTGTACATTCAGAAAATAAACAATGGTAC(N ) xAGAAGATAGTTCACTACCATAT GATAAGTTGTTTAGCCACGACCGAGATGAACACTTGTGGGTATCCTGGAGGACAGGTTCGGCAGCAGTGAGAGGT CC CCTC CAGAACAGTATGTGGTTC CAAACT C CTGAGGGAG CGGGGGCAGACC GCAAGGAAGTAGATGAGAACCTG AATGAGGCTCTGTGGGGAGAGAGGATGTTCCTCAGCCCTCACAAGGACATGTCAGTTTCACTTGAGATTAGGTCC TGGCCTTGGCTGGGCAGCCTGCCAGGCAGGTTCTCAGCAGCATGGGGTGATAACAGGAGTACGGGAAGCCTGGCA AGCTGGGGCAGCTCTGGGGTCTTCAGGTGGGGCCGTCTTGGGCAGCAACTGTGTTGGCTGTAAACACAGATTTCC AGGAAG CAGAGCGTTG CAATCTCT CC CACCACAAAC CTG GAC CATCTTAGTTTC CATAGCAG CAT C CATGTCAGT GAGGAATCTGGACATGATCTGTGATCCCCTCTCCCACTTTTCCTTTTCTCTTAGAATGGGACAGAGCAGCCTGGA GTAG CTAGTGTC CTGTAGTGACTC Cñ GTTTGCTGCT CT CTTC CTTATCTT CTTCCTCCTGCTCTCTGT CTCCTCT G CAT CCTTCACGTT CTACATGAG GñAGCTCAAGTCACCG GGCTGAGCCTGGG CTTCTCAT CTAAAT CACGGAGTG CCAGTCAATGCCTCCCTTCC( N ) xGGAGAGCTAGACCCTGTTCCTTATTCCCTGTGGGACCATCCGCAGCCTGTT TCCTCACCCCTGTAATGGGACTGCAGTCCTCCCTGCCCACCTGGCAGTGCTAATTCCAGGCCGGGAGCCCTCCTG GGAAAATCACCAGCGTGGGTGATACGGTGGGTGTTTGTGTGTGTGGAAGGGTGGGGCTGCCACCTGGCCACTTAC AGACCCCATCCTTCCTCTCTCCCTGGGCTTGTCGCATGTCAGGAATCCCTCTCCTTTGTGCCATGAAAC(N)xCT GTGGACTTATAGTGGAGCTGGGATTAGGATGCGTGTTCTGGATTCCTGGCGGGGTCCTGGCCCATCTGCAGCTAA GGCCTGTCTTTCTGCTCCCATGAGGTCCCTTTTCCATTCCTTTCCTCCCCATGACAGCCACTGTCACCACCACCC AGGGTCTGTGCT CACTGT CACACC
>Hs7_130047281-130058242
TTTGGAGACCTGCCTTAT CAAATAGTAACATATGTTAAAAAG GTAAATGATCAAAATAGTACAG CACTACAACCA GAACAATGGAATATAATAGAAAACATGGAAAAGAATATAAATAAAAAGTCAATACATCCATATCACAAATCAGTG
G(N)xTTTAAAAAATTAGCTCTTTAATATAATGTGAAAAGAGTAAAAAAGCAATAGGATCATGTGCACGAGGTAA AAATGCAAGTAAGATTAAATCGTTAACATGGAAAGGAAGTCTTTCCCCCACCGAAGACATGCAGGCCCTCCTGCC TC CTTC CCCAAG GGAAGTATCAGTATAT TAAACACTTT CACG TATTTATGTCTCATTCCAGAAATAAAACATTTA CATGTGCTTGCATATATTTTACTTCTTCCCCCTTTTACAAATCCAGGAGAATTCTATTCATATTATTTTATACTT TGG GTTTTTGTT TTATTGTTGTTTTATT TTTACTTAATATAG CCTGGAAATTGTT C CAACTAAT CAAGATTTTTT CT CATC CATTGT CAAGTTAATATTGTTCTC CGTTGTACAAAGATATCATAATTATT C (N ) xAATATAGATTTAAT TGGG GTGAGGGGTTAGAGAGGAG GAAAG CGGACCAAAACATA GACAGTGATTATTAGTTATTTTTCTT CTTTATA TACTTTTTACGGTTTCCTAACTTACTAGAAATACACTTTTAAGTTAGGAAAAAGCGATAAACATTATAAAAGGAG ATTTTTATTTAT CCCT TCAAAGATAG CCAATATGCATACAGATACAAT CTGTTCCCCAGGCACT CCTGAGTGAGT TATTATCTGTGTAGAGCACTAGAAAAACACTAATATTACTCACACATTATGGTGTCCTACTTTACAAGAGCCTAC TTTG CTGTGCAAATTTAT CTCAGAAGAGAAAATTGCATGTGC CCATATGTTCTGGTGCTG CAAT CT CAACAATGA ATTAAAGGACCAAATCTAGGTGCTTATATAACCCAGTTAAAATTGGAGTACTTGTATGAGGAGACTCAAATTATC TCTGCTACAAGGAGCTCTTATTTCTCTGCTTTGTGAGAGGCATCTGTCAAAGATAAATTCCTGTCGTTTTCAGAT TCTCTTTATCCTCCCTAACAAAAAACAGTCCAGGGAAATGACAGTTGAGTTGCACAAACCCAGTGTTCAGACTTC CTGAGAAACTTGTC TTGAAATCAGAAGT CAAATTCCTGAC CTA CATCAA C CTTAA CAAACACATT C CT CTTCAAT AAA CAGCCACACACTGATTGCTGGGTACTT CAGGATAAAATAAAGACCTTAACT CTCTGTATTCTTTT CATGAGA GTACAACTAGGAGGTTTTATTTTCAACTTGGTATTTAAGAAAGGAGAAA(N ) xTAACTGGAAGTGTCTTTCCTTC CTTGGAACTCTCCCACAACATCCTGCATCTTTCCTGAAGGTACTTTTCAGTCCCTTAAG(N)x TTCACCTTGCAA TATACAGCCAACAAACGATGGTTT CCTTCCATTCCT CCATTTTAAAAT TCTGCCAC CTTAGTAATTTT GTGGTAC TGTTATCATTCAGCCCACCCAACACACA{N ) xACATGCACACATGTGACTGTCAATTTCATGTAAGCAGAGCTGT GTTTTGTTCACCTCTG CAGT CT CATAGTACTTAATGTGTAGGTACAAAGAATGCTAAAAATATT TGTTGAAAAAA GTAATTAG TATATTAAA CACAT CTAAAACATT TAAAACTG TAAGTGTTAGAGTCAGAAAACCAGATGGCTTTTTA AAATAAGAATTAACAACTAAAGTATCAGGTAG CATTT C C C TGAGAGTAA CAATCTAAT TACAAATAGATACATCA AAGT CC CAGTCC CTGGGAACAGACAGCAAATG TGTACAGGGTGAGTGT TT CAGT CAAATGGT CAGC CTGTTATTT TCCC CACACCCT CAATTCTCATTACCTT C CAG CCTTTGAATCTCCT CAGCTGTCACTT CCAGTGTTTGAT CAGAG AGGGAAGCAACTTGGATGATCTGATAGAAAAAAGAATGAAATGTGTGGATGTTTGTTTGTTTTTAAACAGACGCA ATTTTGTCTTTTACAAAAGACGAGGGAAGAACAAACAT CAAAGATCTGTTGTTT TAAGAAGCTGTCAGTGTC CT C AACTGAACGCTGAATCTCAGGC CT CCTT CCTTTTGTGTTG GTAATATACTGCAGATGATCATTT CAGAAGAAAAG CAC ACT TAGAAC ACTAC AT AAACTTAGAATGCTAAG AGGAAAATTT AG GAC AAG AAAC ATGAATAAGAGT TT AAA ACCACATCTGTATATATCGTGGTC CAAG CACTA CACATTATTTCTTGAAAATGCAATACTGACCTAG CATAACAT AGAATGGTATCAGCAATTAT CCACGCAATGAAGAAGTAAATGTAATAA CAAATACA CTGATTACAGTGTTTTATA CTTCTATA CAGTATTCATTTTTAAAGCACTTT TCACAACTTAGCTCACAAGACC CAAGATTCAA CTTGAG TTATG CCAGTGGAACTCTTGGAACTGGAAAGACACCTTCTCCATAATTTTCAAATAAAAGAAGAAAGTGACTTGCCCAAA ATAACAAAACAG CT CTGGCAGGATGATT TGGAAAAGAT CAGT CCAAGAAAAGGGGAAATCACTTGGTTTC CAGAG AAAAACGTTGAAACAAT CCAG GAT CCAGAATT TTAGATTC TAAATATTAAGTTAAATGTATAAGGAAAAAAACC C CTAAC CACAAGT TTAATGAATTAAAAGTAGGG CAAACATT TACTTTATAGAATT CT TATCAAAGAAGAGAGAAAC ATTTCTCACCATCTTCATGT GAACACACACAC CACATAC CAATTAATC TT C CCATAGCTGAACGACATACAAAGA ATCAGAAGAAAC TC CT CTGCAAACTTTC CC TAGAGTTGAATTTAGAAATACATAAAGTGGCACAAATTAGACAGA CTTTACTCCAGT GGAAAAAT CAAGAAACTTAC CAGCTGGG CAAAAGTTGTAACTTTTAGTCT CT TGAAAAGC TCA TCTTTTTTGTAT CTATAATCTGAAAAATATG GTAAGACAAGTATTTAT CTATTTTG CTTTTTTAGAGCTATT CTT TCTAGTTGTCAAGTGACGTGGTCTTTCAAGTTATTTATAT( N ) xATTAGGTGTGGCAGGTAGTAGTATCATCTGT TCAGTTTGTTTCCCTTTGCTTTGT CATTAGGAAGCTTAAAGTTGAATT TATAAG CTACCTTTGG TCTAGT GACTA CCCCTTGCCTTTATTTTATTAGTCTACCATTTTCCTTTAACTAAGCTGTCTCTACTTCGACCACATTGTACTCCA CGCATT TT TGTAAG C CGCCT CAAATCCT TT TCAGAGTAAAGCAGGGAA GAAATGTC CGTGTTTTA CAAATGAGGA CAGAGACT CAGAGAGG CTGAGCTG CCCCAG CT CAAACAGC TG GTGAGACGTTCATG CC CCACTATT CTGACT CCA GCTTAAAAAGAAAG CAATATTATT CCTG CCAT TACACACTAT GCTGAAGC CACAG GACAACATT T CTAAGTGTAT GCTGGG CACACAGCTGTGATTGGTTCTGAT CATCTCTCAGCTTCTCTCCAACTTCACT CCAATAAAAATTAACCT ACTTAGTTTCGC( N ) xGTTCTTGTTTTTGTTGTTTACTA(N)xCCAATTAATAATACAACTAATAATACAAAATA CAACTAATAATA CAAAT C C C A (N ) xCTTAAAACATCAATATTTATTCAGTGCTGTCTTGACAATTCTGAGCTATA ACTCACATATAGGTACTTCTGCAC TTGTGTTCATAATT CTTC CCAT CACATTAAACAGTTATTAAAAACTAAGC C AGTT GCAC TTTATCTATTAT CTGTTCCATA GATACGTGTTAAAGGTTAG CTTCCTTAAACTGTTTTGAAGATGAC AGTCAAATTCTGAATGGTTTT CAG TGTTGAACTCTGCC CG CC CTCATG CTTTCCTCCT CACCTAGGTTTTTATAT TGGTAC CCATGACT CTTTAACT GC TGCCT CAT TATTTCAGAATTATTAAAGGTTAG CT GGTTAATT GCTATGGC C TTCACAGCATAAATGT CTCATAAAC CAAGACT CCTAGCAACAAAGGAAAGGATTAAAAGTATTTAAGATTAT TTA CCTG GTGTAACTAATCACTT TCATGAATTTTC CAAAAACCAG CACAGACCTTAAGACT CAGAGT CGGGAAATGAT ACTACTTTAAAAACTAGCCT TCTGTAAATATAT CAAG G CT TTTTAAAAATAGG GAATTAAAC TT CT CTGTTC TAA AATGTCTTAAAGAGAG CAATTTGG CTAATAACAGACCATTAC TTCTAGGAAATG CT TAGAGATG CTGCTT CTGGA TACAACAAGGTAACAAACCTATATGCAGTGGAACTTCTAAAACCTGCCGGGATGCAGACCCAAATAGGCCGAGCT GAGG CTTTA CAAAGTACAAC CAAATAAC TACAGTGTTC CTTTTGATTGC CA CCAAC TTGCCCAC CAAAAGT C GT T TATG CAAATCAAAACAA CAATAAATCAG CAAATGGAAACAGC CAACAG CTTTCAAGTAGTTTGCACTGTGACATT TATCCACTGATATTCTTTTGGTATGGTTACAAGGTTTAAAATGCTTCTAAGCATAAAATGTATACTTTAAAATCT CCTTTAACTAAACAAAGATTAATTTAAACAAACAAAAAAAAGACTGCTTTATTCTAAAGATTGGGATTTCTACTG TCTT CT GCACAAG CACTAGCAG CC CCCTA CATATTTGAAC CTGAACTC CACGTAC CTGGTTCTGGTTTAGAAGGA CAGCCGTCAGTAGGACTTTTTGACATGATTACTCAGAGATAAAAGATCAAACTAGATGTTCTACAGTAAGTTTAA ATTTTTATAAAAAGGATGACTTGGAAGGCTCTGTCAGTCTGTCCTCGCAAGAGCGAACAGAGCTTTAAGAATGGA GCTTCTGCTATGTGTCTTCTCCCAAACTCTGTGTTGAAGACAATCCTAGTTTCAGATATGTTATCAAGGTGTGCT TCTC CAGTAAGT CTAG GTTCTAAGTCTGTCTC TCTCGAGTTACTGGAAGGACAGAGAC CTGA CTAT CTAGGATGT TTCACACTGGGT C CAACTAGTGT CATCTAACCA CCAGAAACCAAGAAT GGGCTTTGTAAATTGT CAATAGACATA GAGTGT CTATTTTCCC CAGGTAGAGTTG AAAG AATATT CAGATT AC AAAGGATTT CTGCATTTT CAGGGTT C TGT TGAAATG CATTATTAG CAAAG GGAAGTTGGGTTCACAGTG GAAGACAAAT CAAAG GAA GTAACAGAACAAG GTTT GTGGTTTCACAGAAAGAAATGG CATCTAAGGTAATCTGAGTAAAGT GATGGGTCTAGGGTAT CAAGAAACAAGGG AATATT CTCCTCCCTT TCAACC CCAAATAATA CTTACAACAGAAAATGGGGAAAGAGAAGGAAA TATCTT CTAAA ATGTAGATGCCAAACCAGAATCTAAAGCAATTCTGTCAGGGAATCAGCACAAAGTGTGTGAAATGGGGTAACAGG AATCTC CTGCTCTACTAAAAGGAAGCCCTC CACTCCTACAGAGGGAAAGT GAAG CAAG CATTTT TCAAGGTT CTG TTCCAAGG GTATTGTT CTGG GAGATCAC CT CCACGTAAGGTTTAGATATTAAAGAAAC CACATC GTACTTTGAAG AAAT GTTT TAAGTGAGGTTATGG CAACCTGGG CAGAGTGG GACATT TT TATGG GACAAATTT CTAGTGAGGACAT GTGAACT CACACTCAG CTGCAAATGGATGAG AGGTGACAGAGACGACAAAC CCAAAAAGAAG CAGG C CAG CTGGA TTCTGC CT GTCT TGGATGAGTCAAGGAT G G CAA CAATGAAGTGTAGGT GAGTAAAT GCGATC TTT C TCCAGAG(N ) xGTTCAGGAAGAAAAGAGAAGGATACTGTGGCATTACATATTTTTAAGAAGTTTGTAGCATTAGCCCTTATAGC TGAACATGGCAAATGAAAACACAG CATAAAGCAGCTC CTGAGTCCACAT CTATTG GTTGGGGTTT C C CAATATTG CAG C AATT CAAATG GAACTTTG AT TTCAAATAAAAGGT CTGCTGTGTAAAAG AAGAAAATGTTCTG ATGT AT TCT GGAT GT GCAATTTTAT CTTC CAAGAAGATGTACATTAC CAGAGCCAAG T C CTGAACTATGATTAAT CAGAGTAC C TCTTTACCACCATTAACTAAATATCCCCTAGCACTGGGTTCTCTTATTCAAGATATGAGCAAATCCATCTTAGAA AATñGATG TTTCTACTTCTT CCTTAATT CTGG C CTTGAAG TñTACACTTTA CATCAGTTTTG GATG CAGACT GAA AAATTATG ATTT TGTG CAAGGG CAATGC ACT C AATG ACTG GT AACATATCTTTG CAG ATAAGTT TT TTAAAAGG A TGTAGATGAAATATGATTTTTAATTAAAAGAAAACAGACTGCAATGAATTGATAATGCATAACACTAAGTTCTCA CGTC C C T (N ) xTGAATACTCTCATTTATACñCACAAGAATGTTCTGAAGGAAGATATTATTACCCCCTATCTCAA ATAGAGACAAAATAAGA CAGAAAAGATGAAATATCTTGAAGAAGG G CATGAAGACTGCAGTACAATGACACTAGA AGCT TAAG CAGAGCTCATTC CACCAGAT CTGACTCTGCAGGC CAACACT AATGT CT CCTATGGGATGGAAATAGG GCAGAATT TAGG CCTA CAGG CCACAATCTGAGAAGCAG CC CAGTACAG CACCCAGGAT CTCCCACAGCCCTTGAG GCAC CTGCTAG G CTTAT CGGAC CATCCATAGCTCTCTGTG CCTGTCAC CTGTGGAACACAATTGTTAGTTAAGAA CCAATT TATAGAACAACTATAGACTTCCACTC TCCCAAAC CATOAAGAGTGTTACTTACTTTTCTTAATCT CTT C GAGCTT CT CAGTATATTTAGTCATACTGTTAC CTAAAAAGAAAATAAGGGGTTTTAGATATACTTTAAATG GAAG CAAATGAG TAACACTGTGGAAACAGAATGGAGGAAAAG TATC CCCTAAGCTTTC CTATCCTGTT CTTAGGGGATT CTGGACAG CAGCAAACAGGC CACA CACAGTAAAATCTTTTGGATAATA CAGCGTTATTAAAAGT TG CAGAGC CGA GGGAAATGTTTTGGTAGAATGT GTTTAAG CTGGGATGCATAC CCCCAAAACATT GACAAGTT CCAG GAGATACAG ATGATCATTTTCTATCAGTTTGTTTTTCAAGT CTAGGCTTTC CTCCTCCTTTTATCTCCCCTGCCC CCGCTGACG ATCTTTTTGTGG GGAAGATACACAGAAGAGGAGCAAAAAG GAGGCAGTGT T CTTTGAGTGTCTAAAATGTG CCTT TGTGGTTACCCAAG CAACTCTCTC CTCT CGGCTTTGTCTT CACAGTTT CCTTCTCTTT CCTGAACTTACT CT TCT ACCGTT CATGCT CACGTTGCAGTC TGAACTAGACTACACAGT CTACACACTAGTAT TCACCATAGATTTGATGTG CTGAGAAATAACAG CAGATT CTTGAAAC TG CAGAGAGG CC CCAGCC CT CATGTTTTATAGTATGTT TCTATAATA CTAGTACTGGCCAAGGTGGTTT GGAGGT TTT CAGACTTAAAAAACT CAGC CAGGAACCTTCTGGTT T CCAACTCA ATTCT AGC TGTC AAGC CAGGGG CT ATTATATTTTACTGG GTATGCTTCGATTTG AT TC AATT C AACGCAGGAAG C TCTTTCTGTCAC CTAAAAGGTATGGCAAAAGGAAAACATT CAAAATAATTTCTCATTT CTAAAACTGTGCTC CTA ACTTTTTAAGCCTGGAGAGGATCTAGTCAGATGCAGCCCCAACATCAGATAAAAGAAATGCTTTCAATTAATAGT AGCATGAACTGACATTTTCAAG GACTATT CTGAACTTTTCTT CATGTTTGGGTCAGAAGCAGAAAAAGAAAATC C TTGC TC TGAAAC CCAAAATACATTTTCAAAAAAACAA C TTATTTCATTGG CTAT TATTAAAT CTTTACCATGAAT TCATCATGGAATAAAAATCCACCATTG(N) xTTCTTTTCTTTTCTTTTCTTTTTTTTGGTGTTAAGACATACACT CTCTGTAT CCAG GAC CAGAAAGGAGGAT TCAGAGCAGG CC TGGACCTGGAAGTT CCAAGAGAAATGAAGG CAGT C AGGGCTGCCAG GAAGG CCTAAGCCTTCT CAAGAGAGACACTGAATA CCTCTTAGAAGAGGCC CAAATATCAC CAA GATTAGAACCAAATACAGTCTTTCCTAGGTTCTTCAAACTGCAATTGACCCGAGATGTGACCAGATCTTCAGAGC AGAAG CCACAATTTCT T CCTGGACACACAT CTCTCCTGCCCACT
> H s 7 _ 1207324 45 - 12074 889 3
GTGGAGTT CACTTATTAAGTATAGATTTAT CT GAGATC CTATATTAAGTG CCAG GATGTGTTTGGTGCTT GG CAC TAGT CATT CAGTGGAGAAAAAAAAA(N)xATACCCTTAGATGCATTCAAAATAACCCTTCTCTTCTAACTTCCAA TGGGTGAT CAAG CTTGTAAACTTGACTCTACCTTGGGATATGTTTT CT CAC CATTTTATACACTGCTTAAGAGGT GAAAGGGTGGTTGATGCTCCTCAAATAGTAGTTAGGGGGTAGGAGCTTGTTCAGCTAAAGCCTCTTCCCACGAGG CTT ATTGAGCCT CACT CAAATG AAATTGGAGCTGATTGAC CCTGTGG AGCACTC AT AATGAC CAGAACATGñGAA GTTAACAGTGGGAC TG CATTTCAG CTCTGGTG CTGAGACC CTGGCCACTTATCTTCTC CATTAGGC CATTTATAG CACATATTTAAATCATAACTTTGTATC(N)xAGAGCTGAAGATTTTTGTTCTCTTCTAGCAACTGGCACAACTTA GACTAGG C TTCTATTGTTTT CAAAGCACAAGACGAACACT CATTGAGTGATTTCAAGGTCCC CTTTTTGC CATTT CTAC CTAC TGACATTCTAGACACTGGAACTGTTTTTCACAGTAGTATGTTTAGT CACT CTTT CTT C C CCAC CGTA GGCATCATGGTGAAGGAAAAAAA(N)xTGCTATCCCTTGCTGCCACATCTAAATGCTCTTATTTGAAATCAAAGC TAATATTGCTCCCTTTTTCTGT( K ) xGTTGTTTGTTTGTTTCTCCAGGACTTGTGGCTACTGAGAAGACTAGGGC CAAAAC TTGTAT TTAC CAAGTACAGTGTAGAGTT( N ) xG AGTGATTTCCTGGCTGTGACTTCAGTTTGTGCATAT ATTAGT TT CTAT TTGCTATATGTGAAAGTT CT GATGTTATATTTGCAGAG CTGT TATACCTGTAGG CTAAGTT CT AAGACT TAT CAATG CC CTCAGG CAGGAG GAA CAATTAATGTACTCATAAATGGTAC TTATTTATACTTCTAAT CT CAAAAC TAATTG CGAATTATTAAGTAAG GC CTTTATCT TTflTTGAAAATT TTAAAATAAGAAAATG CTGG CTAAA TCTAATTTTCTAATATCAAAATGCAAGACAATATTCAAACATCTGTGAAATATAATTATACTTTTATTATTACTT CCCTATTAATTATTAGTTGTAAGG CAAAACAT TAAAT CAAAT GCAATATGAATTTGGCATCTAAT CTTTG CCAGT GTAAGTGTTTTTTCAG TGTTTAATATTTGACTGTGGTGTATATACT CATATGTCACATAGAA GGGAAAGAAGTAA TCGAATAAAATGTTTTTAATTCATTGGAATATGATTTTGTTCTCTAACCATAGATTGGCTTAAACCTCCAAAACA AGTT C ATTGTTAGAAC AAAT AT TT C AAATATT TCCTG AGC ACTT ATTTGAGTGCTT ATTT TAGT CAGTAT AG TGT TCAGTGATATGGGAAGTTCAGAAGATTCATGGGATTTACATTCTGGATTTTAGAAGGCTTCCTAACAATGTCACA AGCC TATTAATTGTTGGTAGACAAGAAC CAAAACGAAAGC CCAGGTCTCCTGTTTC TGGTTT AATCATCATTTT C ATAATG CTTTTTCC CCATAAGCAG GCGCTCAGTATCTAG C CAACCAGATATTAAAAGATATTGAAAAGCAGG CTT CATG CATO CCTAACAAATTGTT CACTTTTATT CTGGGTAAACTTTACTGAT CATAAAGTTACAGGCTTCAGG GCT CTCTAAAATAAG CTAAGAGTAG C CAATAAATAAAAGTACAGCACTATC CAC CATATGGTATTTCACATTTTAAT C TATTTT C CATTCAGTTTGCATATTATGCG GGCAGGAAAGC CCTAAAGAGT CATG C CTAGGTATT CCATAT TAAGT TACTTTAACATT TTTAA CAAGTATATGT TAATGAACTAGG CTATTT GGAAGACT GTGT CATACAGC CTCAT CACC CAAATCCCTTCCAAAAACAGAAGGTCGTTTGAGCATTGTTTGTAGGAGATTTCTAGGCTCTTTGAGTGTGTTGGT TGGG CTATGGAGAAG GGGGATGGATTAATAAATGGATTAGA CTGGACTGTTGAGAGAG CCCTTGAAGGCTATGC C AACñGATACAAGAAATñGTG CT CCAGACATAC CTTTAT CTGTTAAATGATAGCCTAGGACACTGAAATGTTTACA CG ATTATTTATAAAATT CCATAAAATGGATTT GC AG CTTTTAAAGATATAAATGAAGCTTATAT CTflAGT TGTG G CTATAAT CAAGTTAAATGAGAATAAT TTTAGAGGGGACAGCTAATTACAGAG GTTTTTTTTTTTCT TTTAGAATA ATAAAAAG TAA CAT CCAAG C CT GGAATAATATAATT GGAAGC CTAGAATTTATT GATATATGTATACATATAAAA ATAAAGGTAACATTATTTTG CT CAGTATTCATTTC CATTATC CGAT CACAGTTT CAGAGATTCATGTGGAAATCT GACATAGTCTTGGCAATTCCTGTATTGGACCCCTTCAGTTTCATGTATAGCTCCACTTGAGCTAAACCAGAAAAA AGTGACATAAAATTGTTTATTCTTTTGACATGTTTTTT GACAACTT CCTCAAAAC CATTGAGATA CTATCTTGG C A CAAAACTTAATATGT CTAGCTTCTTTAATGT CAAG( N ) xACATAACTCTATTTAGTAAAATTTTAATTAAGAAC ACGTATTTTATTACTGCAAATTGAAAGGCTTACATAAACATTTCAACTTTTTAAAAGTTGTGATTCCCAAGAAAA GAAAATTAAC TTAATG GCTGTT CATT CATCAC CTCCAAAACCAAAG CCATTTGAATATTT CCTT TATTATGATG T ATTAGAACTT CATTAAAAGGGAAAAC TTACAAGTAAGT TATCATTTTT GGAGGAAGAT CACACT GTGCTAAATAA TGAGATACGGGAATACATTGAACAGCTTGACTTCTGCTGACTTTTACTTAAGAGTGTGAAGTGTTTTAAATTAGA CTATGTTAGTATTGAT GAAATG TG CACAAAGGGGAAAAATAATT TAAAGGCTTTGTTCTATTTTGAGAGC TAGAT
(N)xACTGTGACCTGGGATTAAGAAAAC(N)xACATATATTATTATGACACCTCTTTGGGATAATTAAAAAAACA AACAGAAATA CCTACATATATAGGTT CATGTAGATACATTGATATTTATATT AATT AC AT CAACTTTT AT ATTG A AG AAAT AT ATGC AT CTGACC CAATGTGCTTATTCAGAT CTT AC AAGTT AAT ACGTAAG CATTGGTTTTTGGTGGG AAGAG GTT CTAGATAGAAGT GAAT TGGAGGATGCTT TTTCTG CT CATTTGAT TG CTTT CTGTAATATT TGATTTT TGTAATCATGTATATGCAATAACTTTATTTTTAAGCATGTCCATATGAGTGTTCTTAAGTAAACCATTATTATTG TTCTTTAT TTTTATATACCTTAAAAT( N ) xGACAATAAAGCCCTGTTGTTGAGATTCCCAAGGTCAGGAAGCATA ñ CAAAGTTTAGTATAT CAGCATTGTATTGT CAG GAAAT CAGCTCAAGTTTTCTATGTAATATTCT CATGACAAAC AAAAGTCTGCTCTTGCAGCCAC CACC CCCT CATTGAGAATGCAT CC CT TAGCTTTGTAGGATAGAT CAGCTTGAT GTTGGACAGTTAGAGACATTGTAT CTACCTCCCCTT CTGAGCAAGAAAATGAAAT CAATTATTACAGTTGTTTT C CTCCTCTGTAGTAAGAGAACATATGGCAAACAAAGGCCTCTCTAAATATGCATTCATACTTAGTGGATCCATTTT TGAT TC AGAATGGGGAGAAAAATGGTTGCATATTTCTC ATGT A C CCTG CAGTTT AATACAGCAT ACTCTATTTT C AAAG CAGTTC CAGAAAGAAACAATAAAATGACTGTCTT CTGCTCT CTTGAAT CTTTTCAGAACTGCAACTTCCAG TGAGTCCCTCTGTGTGTCTGGATCAGGGAATGCAATTAAAGCCGAGTACTTCGAGTCACCTTTTAAAAACAGTGA AGCCACGTGT GTGGAAACCAGGG GACTGGAGT CGTGAACAGCTGTAAG CTAATCATTTTGAATT GT TATAACCAG CATGCATTGTAGGAGACAGCTAACAAGAGAAAAGCACAGAACAGATATGGGGGGTTCACATGTGTGAATTATTAG TTTCTCTTACATAAATTTTCAGCTATCCTTTTTAAATAAAGAATGCCATCATTGACTTTCTCCTAGATTAATTCA AACCTGT C CTGCCAGAAAAT AAAC ATTTGGGT AAATTGGACT CT TATCTAAT AAG TT A CATTAAAGGAAATTG AA TTTAAAAC CTTGACAATGCAAAAC TATAAAAT TTTAACTTCCAT TTTATACT CAAATGACTTTG CCTTAGACTAT GAATATGGCTGGATGCATTTTTTTTCTGCTATTTTTTTTATGGCTCTTAGCATCTAAGAAAGCATCTGCTGTGTA TC AAGTTATC CAAGTTTTCT CC AG G G AAAGGATTTT AT AATGTAGATAAATT AT AGTT AAAAATTAAG AGGTTTT AATTATCTTACCCAAGACACAAATAAGTTTGCCTCATCTAATTTTATAATTATTAAGTAGTATTTAAACAGTTAA TAGTATTTATTGAAGAATCATATGCATCTTGTGCCAACCTAGAATTTTAAAGTTTAAAGTA(N)xAAAGCACAGT CCTTTCAT TGATCAAACTTAGATATTTAA CAT TTAGAAATATGATGTAGAAAAG CAAATAATTATACATTTTGGT AT CATTTATGATGAAAGCATATTTGTTTCATTA CAAAATATTAAACTGGTGT TACTTTTC CTTAGTAG CCTGAGG T G ( N ) xCATCCCTTAAGTTAACTCTTCATTATCTGGGAGTAGGGACATTAGGTAAGATTGAACTAGTGGGGAAAG TATTTTCGCATATTCTTGGTGACCTGTCCACTATTTAAAAACTTAACATTTTTTTCTAGAGAAGAAAAAATAGAG CAAAGGTñGG CAGAGGTAGAAG CTTTTAGAAAGTTT CAC CCAAACTGTAATTTATATCAC TTTTGT TG CAATG GT ATCATGCATCTGAGGTCACG GGTACTGTAAC(N ) xCAACTTTATTCTACATACTACCTTTCCACCTTTTTATTTA ATTAAATTAATTAA( N ) xACCTTTTTTACTTTTTAAAAATACTTTGATTAAATCTGCCAGAGTGGGGAACTATTA CCAATTTGACTCCTTCTCGCTGTACCTCTCCACCACCTGGTGGTGATGCTAATAAGCAACTGAAATTGTTCAAGG AAAC C CACACACTGAGGGCT GG CGTC CTACAAAGGATAGAAG CAGTTTA CAAAG CTTGTAGGTAATTTACAAATT ATGT ACTCAACCCT AC ATTATG GAGCGCTAACTATGTTTTTTTTTTTTTTTAG G AATG AAA CGACAGT CCTTGCT CCACATGAAACAAT CTTTCGAGCCGAAGATCTATCTGTGATT CTTAAA GCGTATGTGTTGGTGACGTCCTTAACC CCTTTGCGTGCATTCATTCATTCGACGGGCACAGTTTGGAATCCACCAAAGAAAAAACGCTTCACTGTCAAGGTA AGCTCTCG GTTGAAGCTATTATTT CACGTAAGTACACTGAAG TAAAGTAAAGTGAGTTGG GCTG CCATTTTCCCA CCTTTATGGTTATCAAATCCCATTGAACAAGGATCACATCTTTTTTATTGCCGCAAGAAAAGATTTTCACTTAAT AGG TGCAG GTTTAAAAATGAACTG GAAGCAGCTTAATT CACTAGGGAT CACAAAG GCTACATTCTT TTTT CTCAA TT CATCATGATAGTTCAATT CTTTGATCTACT TATTACAGAGCTAGTT CACAGCTATACT CTGT TTTT CTTAATT TTTCACTTATTCAAGAAAAATATAGTCAAGGCAACAAATGAAAATACAATTTTGCCTTTGTGCTTTTTGCTTCAC CCTT AGGAAG CATAATG CTAGTTC AT CACACC AGTC AATATG GAAT ATTTGCTG AGTG AAAAAT AAAG CT CTCAA GTGTGCCTAATATAGCAAGCTATGACATAAAGGATAATAGGGAAGATTTCAGGGAAGAACTCAATAATAATCCCA AAAAGTTTAATAAGTTATAAAAGGTAGCTCAGGTCAACTTGGCTGAAAATGAGATCTAGTGACAATATGTTTGAT AAATTTGAAAGAAATTTTGATTTTACAGTATTTTATTGTTCTGCATACCAAAAAGAGGAAATAGAAATGGAAAGA ATGCATAG GGTCCCTCATAG TTTGAACAGATTAGAAAATGACTGAACTTGAT TTTTTT CAGAATTTAC CTTCATA AACT GAAT TTTGAATTTTGGAAAATCTTCCAC CTCAAGAAGAAG GCATGAGCAAATTTTAGAATTTAT TT CTGC C AT CTTATT TTTTTATT TAGTATGCTTTCTTTTTCTCCT CTTAAACTTT GCTTTTTCAC TGAAAAGCTTACATATT CTTCTGTAAGTTTAAAAGTTGTACATTCTTTACCTATATAACATACAGTGGTCACTGTTAATTTAAAACATGCAC T C ATTGTTTATGTTTT CCTATG ATTT CTTAGACTTTTC AATATCTCTATCTG CTT CCC AACCTTGC CTTCT C AAG ACTCTTTTGGTCTCTGTACCATGTCCCCAAATCCCATTATCACAATGTAGATTTTTAGTTTTAGTTGAAATAATC ACTTGTTC CCTTTTTTG GTTTTTT TTTTCCTCAAAAACAAAACAAAACTTTATCAAGACACTTACCAT CCTTA CT
t c c c c a c a t t g t t t a c a a t g c c c t c c t g g a t t t a t t t c t t t t a t t a t t t g a g t g t t t t c a a t t t g a g t c t g a t g t CCTTAATCTTTAATTCTAAAAATTTGTTTT CATGATTT CTTGAAATATTTTTGCTCACTGTTTTTATT CTTCCTT CTCCTGTTGACCAGATTTGATCCTTCCATTTCTATTTATATTATTTGGCTTTTCTTATCTGCTTTCTGGAAGCAT TTCTGTTTTCTGAGACAGTTCTTCAATCTTATCTTTTGAATCGTACAATCTGT( N) xAAATCCTTCTTACGCTAT AGCTTTGT TGCTGTCT CAT C CGTTGTTTCTTGTATTT CAGTTATTTATTTAATGCTGCATGTATAACTTAAATGC TCATGC CC CTTCACTTAAAAACTATGTATATT TTTATTTACTTCTGATATTTTTTAC CATATTTCCTCTTTGTTC TTTTTAATGACTT CTTTTTCACATTTTATTTTGCTACTATCTTCCATTATTTCCTTGAGTATATTGCTGGCATGC TTATTT TAAATT CCTCAT CTATCAGTTTTGGTAG CATGATATATAT TGTTCAGTTTGTTG CCTTTCTTTTGAAGT AGCTGTATTTTTTGTGCATTCTCTATTTTGTCCTTTGAGCTTAAGTCCTCCTGCAGACATTATTTATTCCAGCTA GTATGATGTTTG GGGAAG GT CAGTCCTTTCAGGTGAAT CCAAG GAGAAGGGACAGAATGTGCTC CAAACATGAAG AAACCACTTTTG GTCACTTC CCTTTGTAGC CAGGTAACTTCC CTGGTCAGAATTT CAC CC CCTAAACT CT CTTAG GCATAAAAGCACAGGGAGGAAGCACATCTTCCTCCGTTATATGTCATCAAGGCTGAAGGTTGACAGACAGTAGGG TGGAGGAG TGAATG CCACATGTGACCAGTAACTC CATACAGTTT TT CCTCCTTTCT CC CAAGAAATTTAG CAATC TGAATCCTGGGCCTGGGCTTAGCATCCTTAACAAGGAAGGAACAACCAATAATTGCCCTAGAGGAAGACAGGGGA GAGTTAGGAGGAAAA CGAAAATATTTTCCTTCAG CCTGT'C CT CT CAAATAGCCTCCAA CAA CT'C CC CT CC CCCAA CTCCACTTATCC CTGACCGTGGCTACCACTGGTCAGAC CAGTTCACAATAGCTCTAGACT CC CT TT CAAAGAGGT GTTTGTTATTTTTCTTTGTTATCAAATTCTTGCTTCATTCTCATTTCACCTGGGCTAGTGAATCATCTTAACAGG AAA CTGAAGC TC TTAGGTATTTTAAATAATTT TTAAAGTATTTT TC CTCTTTGTATGTAAGTGC CGCTCCCCCTA CCCTGATACAAATAACAGGCTAATTTTTATGATAGTTTTTATTTATGTATTTCCTGAAAATAAATGAAAAAAGAA ATCTCCTTGTGATCTGGTTTTCTTTAGATTTTCATGCTTCATTCACTATCTCCTTTCTTATTCATATTTTTGTCT TTTCTGTGAGACAGATGCAAGCTGCGGTTGTAGCATAGGAAAGCATGCAGTTCTTGTCATTTTCATCCTCTCCTT GGGATTTATCTTTTCAAGATCTTGCCTGATTATTCTGGACAATACTGTTATCTCCTCAGAGGGAGGTAGTGCAGC TTCCTGGGTCAATGTGGAGAAAGTGATTGTTTCCTTACTAGTTCAAAGTTACGCTTGATGATACAGATAAGTTTC AGAAAACAAGAGGATATTTATTTACCCTTCAACTGATGTACTTGTTATTTCAGTTAAAACAAATAGATTATAACT CATCTGTTAAATTACTTAACCATCCTCAGAAATACTGCTTTCTACTGGTATTTTTAGATGTGAAAAAATACACAG AAACTTCTCACTGCTAATGTGGAGGACAAAACCAAACCCACTTTAAAAACTACCCACGCTATACCAGCATGTTAT TAAACACTGT CC TTAAAT TC CACAAAACATAATT T(N)xACTGAGTACTGAATTCAAAGCCCAAATGACTTAGTT AAATCAGTAG TGATTTATGTACTGTGTGATGAGG CAAG CC CACTAAACTAGTTAT CAG TGAACATT TATAGAACA TTCTGGCAGCTAAAGAAGGTATGAAGGCAATAACAGATATAACTATGAGCTCATTTATTTCCAATTAGGTCTTCT GCTGAGATACTTTATCTGTTAAAGCTTTGGAGAGAGTAAAAAAAATCAAGAAATGCAAATTTGAGGAATAATGGA AAATAGAAGCTT TGTAAA CTATTTTTAATG CC CTTCTACATAATAAAATATTTAAAAATT TTAGTGTTAGATGTT TCCCAAGGATGTTCAAACCTAAAATCAGAGGAAGTTTTGAAAGGGAAAAATTCGCAAATGTAATATTTGCTTACT ATACCATTTGAT CT GATT TTAAATCAACTCTATTTGTTGC TGTTAA GACAATATCATTTACTTGATGAG(N) xGT CTTCCCTTAAAAAG CT CTTAATTATAAAA C TT CAGTTTGTAT CTATAAATTCACGAGTAAAATT CTTT CACAAAT ATTCTAAC TATAA CTTTTGGTTAAATGCAT CTATGTCTGT CTTAAAACACAGGTCATAAAGATTAACTGCAAGAA ATGTCATAAATT TAAATT CGGTTCTTAGAG CAAAAAGT CACAAGTCATCCCTTCAT CCGT TGTAAATCACAAAAT AAGTTACCCAACAGAGAGGCAACTGTAATATCTGTAACTTTTTATGATGTCTGAAGATTCTTTGTACTTCTGACC ATATTTGACTATTTTTATGAAAATGTATCGCTGAAGAGATTTGTGACTTCTGGCAGAAAAGCTTGTCCAAAGCCT TATTTTGT C CAGTCAAGACATAAATTAATT CATTGTTTGATCAG CCATCCCATATT TGAAGATGTTAGATTTGTT GCTATT C CATAT TTGACAT CAGATATAAAGTTATAT GAAGATGATT CCTTGCATATAATCAG CT CC CTAT AAATG TCT CTT CGAT TGAATTGAAT CAAAGGTCATTAAAAAAATTTTAACTGCATGAAGAAATTG CCTGAGTCTACTTTA TCCCTAACTGCTATAAGCCAGCCACTCTACTTTGCAATGATATATACATGATATAGGTAAATATTATACAAATGA TCATTTGT TTTC CAGTAAATGCAATAGACAGATGT CATAATCTGAATTTCTGGATAAG GGCCACTTTGATAGAGA TAGGGAAATCTG GGATAC CGTAGCCTTTTTT CAAGT CAGATTTGTAAGTTCTAAAT CACCAGCTCTCTAC CACCG CTGCACTGGC CTT CAGGGGACCATTCTTC CAT CTCCAGTGCTTG CAGTTCCCTCTCACAT CAATA CAATCAGCAA ATGTGCACAGAGGG CAGAGGAGAACTGTGG CAAAGAGAGGAATGGTGTTATGTACAAGATTC CTTT TAAAAAGAT GCATATAT CC CT TATT CATTTTG GAAATATGACACAGT TGTAG G GATGAAGTTAGC CT CATTTGT CGT CAATATT TATGCACTGCCTTTGATAATGTTGGGTGAGATTGCTTATATATTCCCAAGGTAAAAATGAGTAATGTCTTTGGAA GAAAGTATAAAGTGACAGTATGGTTTTTCAACCTTAGGAAGTGTTCTTCCCCTGATCTTCATTTTATGTCCAAAA CTGCAGG CTA CGAAGCAGAC CTTGTAAAAGGCTT TTGACT GAGCAT CTGGCCCTT CG CTAGG CCCGGGAGATTTA TGTCAG CTGTAGAAAGAATGTGAATAAAAGTC TT CT CCTGAGAGTTTGTAAAAAGAGTTGAC GTATTACTTAACC TACTTTTG CTTGAATTAAATTTTAAAATTG CATCAG GGTGTT CATTTTAAAGTATG( N ) XCCAGCATGTAATTGA ATAAACTTGTGACCAAAG CT GACTTACTTTAAAAACTTTTATAATAACTGCAAAGTAATCAG CC CACATTGATGT TAGATTGCACTAATACTCTTTATAGCAACTTTGTGTCTTTTTAAAAAGCA( N ) xCCAACTTGGTTCTATTAAAAG ATTTGT CAAGTCATTTTGTACATTCGTTAATT CT GAAG GC TGAACTTGGATGATTATC CTAAAT GTAAAGAAAAC TATTCATTAATGATGATCATCTTTATCTGGCCAAAATGATAACTTAAGCAGAT(N)x CGT TGTCTATCATCCTTA TTTTAATTAAAGGTACCTCATTAATAATAACTTATTAAAAACTGCCATTTGGGGAAAATATGATTTAAGAGCAGG TTCTACAACAGAAGATAGTATTATGGATTTACATAT TC CATCATTGTTAATTCAGG CATATTTT CC CT TCTGAAC TAGGGTAATGAACGCATAGACATATGCAAATATAATTAACTAACATAATGCTTTATATAGGTAGATGCTATATAA TGCTT(N)xATGTATTATGACAGTGTAAAGGTAAAACCTAACACTAATGAATAAATTTGCTAATTATTATTGCCT TTAATACTAGAAAAT CAGTAGGCAAGTGTTAATTATATAT CCAGAAAAATGGAGGTAAG CTAATA CTCAGGGATT TCAGTGTGAAATCAGGTGAGGTAAGTTCATTCCTTCTTCCCTCCTTCTGGATGCTTGAAAGATGGAAAAGTTCCT CAATAGTACTTTGGAGGATAGAATGTATGTTT CCCTTCTCAGCT CCTCCCCTT CACCTGTCCTGTT CACAG CCTG CTTCTCCCTGGAGCCATCAAGGGAAGGTATATGCTGGGATGAGGATGGGACCCAGTTATGGACTTGAAAAGAGCA AGTGTTGTTTCTCTGAAC TGGATC CTGGAAGTGTATTG CC CGTGAACTT CACAGAAGCTC TTAAAAATTAAT CT C TGCGGGCTCTTCCTCTTTCTGCCGTTGCCCAGCACAGTAATGTCACTCTGATTTCTGTTACACCTTGCCGATAAC AAGTTTATAAAATAAACAAGTGTTTAGAGTAAAAA C CAAG CCTGGGAAAAGAGGT C TAGT GTAAGCAATCTTTCA AAGCACTTTAAGCGTCCATAGAGAGCAG CACOAT CTTAAT TTTTTAGTTT CTTATTATTT CTTTAATAGATGAG C CAGCTAAGCCGTAGAGATGTTC GAAGA CAGAACAGCTG TATTCCCTTT CTTCACTACC CT CATG CAAGAAAGGAT TGTTAAGATGGAT CTTTGGGAAATGGAC CATGAATTAAG G CAAAA CTC CCTG CAAT CTTC CTAAATGTTTTCATT TCTGTACAGACACCTAGAGAGT TATT GAAT TTAATT TTATAAAT CTTTATCC CTAAGACTTTAACAACAAGG CTT AACTTTTTCATCACTGTGGTAATGTTAAATATCTTCTGATGACAGGAAAATATATTGGAAGGAAAAAATCACATG AAGTCAGCAAACAGGAGAA CAAAAAC CAAGAG GAAAAAT C TACT TG CCTAAATCTTTTTGAAAT CCTTAAAAGAA GAGAGTTTGCTTATTCTT TATT TATG TAGGAAACAAAT CTTCAGTGCC TGGCATAGTAGC TAATAT T CAG CCGAT TCAGGGAGAGAGAAATGAATTAGAATAGAGACTC CACT CATCAAATTG CTGAAAAGAATAATTAGTAATAAAAAT AACTCCTTTACAAAGTGGTATG TGGGTG CAAAGTAATAACAGTGAAAA TAATGG CAGTAATTGT CT CCAAAAATT AATCGCTTTAATGCTTAAGTTTATTACAGAATATTTAC TCAGTGGGGAAGTTAAATGATT TTTACACTTTTACTT TGGGTATAGAAAAAAAGTAAGTTTTTACCAGTACGTTTTGAAACTTATTAAATGTATATCATTTTTGAGATTTTT ATTAACAATATCCTTTATTC CTTGAGATCTGTCT CAAAGG CTGACAAATT CTAAATATAAAATATC CT CAAATGT GCCAACTAGGTAAAACTGAT TTGT CTGCTGAAGGAAGACAGG CTTTTGCTCC CTTCAGGAGAAAAT CTGACTTG C ATTTGTCTTATGTGATTTATTCAT TGTAAGTT TAAAATGAAATACAGC CTTTTTCCTT CCATATATTAAAATGTG GTTTCAAATTTAGAAGAAA CAG CAAT CTTTTTTCACTAAGGAAG CC TACATAGCAG TGAGATGGAG CAAAACAAT TACTGTATTGCCCCCTAGTGGT CTTTTTTG CTATTGTACAAATAGT CT GAGCTGTC CAATGG CAAGAGGAAAAAA GGCAAAAATGCTTAGCAAAG CATG GAGCATTT TT CC CC CATTGGATTAAACATATTTATT TTAAAAAAGTTATTT TATAACCCATACTTACAGTAAGAAAGCTGTGCTTTTAAGTAGATTGCGAATGATGATAATTCCTCACTACAATTT AAATTACCTAGGCCA CACAATTATTTGGAATGATTT CAGCATTTGC CAGC CACTGGACATTGTTTTTT CACTTAA GAACACTTTGTAAAATTTAATATG CAGTGCTATTAAAGAATGATAT CC CT CAGATGACTATATG TTAAGAAAATT CATTCTTCCAGCT CAAATTCATAT CATAAAATATAATAAT CTTAATTT TTTT CTATGGAAAAGTTAAG CCTATTA GTCATAAACTTGTGAGTATACATT TAAGTGAA GAAATGTACATTTTGTAATGTT TT TTTCATATAAGGAAAT TCA ATGATATTTTCTAGAGCT CTTT CT CCTCAAGGAATTTACTTGAGAGTGGTATAAATAT GCAAAGTACACAAATAT AGGATATTTTTATGGAATATGGAATACTGACTTAGAAGAAATGACTTATAAACATGCTATACTGTCTTATGTGTA GATCATACAAAATATGTC AGGG ATGTTG GA CT G ATTTC C ATAAAACTC TAGT CT AAGC CAGT C ACC AC AA
> H s 7 _ 130398172 - 13041 50 16
TTTTGTGACTCCTGCCTGTCACTGTG CC CC CT CAGATT CTGAAG CTAAGTAACCTC CAAACCTT CCTGGTAAGAG CGTTCATAACTTTCACTGTCAAAAACAC CATG CCTAGT CACCTTTGGCGTGCATTGATTGGC CTAAAGAATTTTA CTTTTTCCCCTTGC( N ) xACATTCTTACAGAGAAAGTATCTTCAAGAGAAATAATGTTTCTTACAAACCTTAACG AGTCAAAGATCATCGTGTGT CAAT CAGCAATTAGAAAGTAATAAATGT CAGCTCTAAGAGT CTG AAATATGG CC C CCAAAGCAGGAGGTTCTTACTAAAGGTCAAG G CAGCACTGACTGGCTGT CAGAGGCAGGGACAG CTACTTAC AAT TTAAGATTAGCAAGCCCCTAAAAATAAG GAAAGTAT TTTCTCTTCCTTCC CATC CTCCTGTTCCTTATTTGTTTA GGAGAGTGTGGGTAGGGAAAAGATAATGTTGTCTGATATTTGTAGCAATGATTGGTATGTGCAATTTCAAAGAGA GGCCAGATGCTAATGTTTGTAATAACCAAATAGCAGGCATGTCACCTGTTTGATTTGCCAATTTTAATAAGTCCT GTGCTTTAATTTGTAAGGGTAATTTGAGTAAT TATAAATATGTAAAAATAGACT CATTAAACATTCTG GGGAAT C AAAGGGCTGAATCTCTGC C C CTAC TTATTTATGATATATT CATATTTTTT CAATG GGT CAGTAGTTTACTATGAT ATATGGGAGCCTAAAGCTTGGGGGGTAGACACTCCCGTTTTCCAACTGGTAGACACTAGTCTACCCCCCAAGCTT TAAAAGTAAACAA CTACTTCAG GATGGCACATGCCACT GAAAGGTCAG TAGG GT GCTATG GCAGATTCTCTCAAT TTTCTACGAAAAATATATAT CCTC CCTTTATT CT CTGCAAAG CAAACCTCAGTTTT CTACAG GGAGGCAT CAGGA TATGGACCACAATGAGATTAAGTT CATTACAACAATCC TTTAAG C A T (N ) xACTCATCACTTGGAATGGGTAGGA ACAATTCTGGAAAGCTTT TG CTTT CTGAGTAAAATGGAGAGATG CCAGTGGTTC CAGCTTAC( N)xTGTGGCTCA AATTATACCTTACCTTT CTT CCTG CTTTAAA CATGAGTAAGATG CCTAGAGCTGGC CCTATTGTTCTGTGGC CAC GATATGGCAGAATAAGAGATGC TGAACC TG CCAGTATAGAGT GAAC TGAACAAAT CACAG CAGCACTTT(N) xGG TGCTTTAGCAGAATTAAAGGATTTCAACATATGTATCTTTCTCACTAGGGAGACAGAAGCTTCTAACGTCTTCTC CCCCTAGGATCCAGATCCTACGTTGG CT CTGT GAGATGGC CAAGGATACT CTAAATACTAATAGGA CCTAGACAG GGGACTACATTCTCTCTTTTATGAAG GT CAAACAGTTTTAAGTATAAAATG CTAGCATACAATAGTAATCACACT CCAGTTTCCAACTGGGAGATAAAAAGACTGGC CT CTfiT CCTTAGAACC CAATTTATTT( N) xCTTAGTACCCAAT TTAAATTTGCCTGGAGTT CCAAGTTGAGGACCATGGAAAATCTGTATGGGACTAAGGATG CTGGATTCTCCCTTC CCCTACAGCAACCATTCCTAGTAAAATTTGAT CTGTGATC TGTG CTAAGAGTGC CC TGGGGACAGGTG CATTAGG GTGGTGCCTCTCAGAGAGGCAGTTAGTACAGCAG CATAGT CTGGGCACATAC TCAATAGCTTGGTGGT CCAGAAA GAAGAGAACAACTTTTGC C CTAGGTGAC CACAGTAACTG CAACCTCAT CCTG CAGTTATTTGTTTAACAAAACTA AGGAAGGTAGAGAAAGT CTT CAATAAGGTTTT CTGGGCTT CTTGGAGC CAAACTG C CCAACCAATGGTACTAAAT GGAACTAAGGCATGATTTTC CTTCACTC TCATATAGTATATTTT CCTAAGTATACT CAAG GGAT TAAAGGGACCT GCATAAAATTAAGATGTGAGATGGAAAT CTGAATGACT GAG GACTGTGAGTC CAGTATTC TGG G CAATAT CAG GA GAGTTGACGTA CTGCTCC CAAGT CAGTAAGTGTCTT GTACCTTGTGTCTCCG GCAAAGTATAAAGAGGAGATGCT GAGGCCCTTTAATCTTA CA CAT CTGCACAACC CACTGTGAGG GG CTAAG GAGGTATCT CATAAATAT C C C T T ( N) xA T A TA T A TT TTT T{ N ) xGAGATATCTGAATAGATATATATGCCTTGAGATTATCACTATGCCCTTGTGTCAATG ATCTAAAACCCATTAAGCTAACTATAAT CTGTGAAAAG TGTTTTGCTT TATCTATACAAATAC CTGGGAAGATGG GAGG 3TAGATTT TC TGG G AC'I'TTAGAG CAAAAATCATCTTTGTACCA CAGGAAC TACATCAG GATAGG CCATGCT GAGAGCACAACCACAGTGGGCCTGTGACCAGGGTTGAATACACGTCCTGGGCATCCTGAGCAACGAGTCAAGGTC ACCTGTGGCAGGACACTAATGTCACTGCAATGGAGTAAAAGAGCCAGCCTGTCTCAGTCACTCCAGTTATAAAGG AGATCAAAATGCTCCAATCACACTTGAAAATGGTGCCGGGAAAAGAGATCATGAGATATTAGCATTCTAAAGAGA ATGGTGTATATGATCTGGAAATCAGAACTTAACACCTCAGAAGTCTTGGTAGATGTTTTCTTTCTACTGTCCAAC CTG GAAATTT CT CAGGAAAGAT GAATGG CCCATCAAG G GTTTGAGACTAGAAATTGAGTGA CTTAGGGGCCACTG TAACACACAGCAGGCGTTTATGGAGAAGAAATCTGAATCTCCCTCAAGTAAACAGAGGCATAAATAAATCATTGG AGCAAATGGAATGTTTGTTGTGTTGGGGGCAGCCAATTTATTGCTCAAAGAAATAGTACAATGTTACTTTTTTGT TTTGGCACAATTCTTGTGTTCTGTATTACCTGACATGACTTTTTACCAATAAATATAATATGTGGTTGCAATTAT AGGAATAACA CATGTAATGTATGAATTAATTGTGTGTTAGTGGAAGAAAAAACAAAACTGAAAAGAAG CAGTAT C ATTAAGGTGAAGAATGTGTCTCTTTTTGACATTGCTGGCATAAAGCAGTAACCCCACTCTCAGGAGTATACCTGG TAGGAAGAAACCCTTGAGAAATGGATAGGGATCCCTAGGAAAATCCCTCTTTGTTGTGCTTCTTAGCCTTTAGGC AACTATTACCTTGTAATAATATTTTGTTTTGTTACAGTGTAATAGTTAAACAACTGAGGTAGAAATTGTATGTTT CGATTGTAAGATTCAATTAGTGATTTACTTTAAGACTATCAGTGGGTCATTCCACCATA(N)x ATCCTATACTCA AAATACATACAATATT TATTTACAAAAATTACATCAGGTAATACAGAGGG CAAATATGTG C CAAAGACAAAAAGA A CAT TTTTAGAG CCATAAAAAGAACATTTTAAGCCATTTC CTGAAGCAATAAACTGGCTT CCATAAAAAACACAC ACA(N)xCTGTATTTTTTAGCCCATAAAAGCCTGGTGCTTTTTGTTGTTTTTGTTTTGTTTCATTAAAGATAAAA GCCATGACTACACATGACCCCAGAAGGAAAGCCTCTAGCTAGCACCGCCTCTGGATGCACTTCAGTCCAGTCCAT TGCTGACCCCTGCTGGAACTCAGCGACCAAGTTGCTGTCTGTCCCTTGACTGGATGAGTGGACCTGGAACTCCTT GGGGACCGGCAGGGTGACAAGAGGCTGCTGATATCCTCTTTGCCACCTACTTGTTCCATACCCTGTTTGTAAGAG CCGTATCTGTAGTCCTACCTCTCCCTGGGGATTTTCCTAATTCTAGGGGTCTGCAGAAGACAGATTAAAGCTTTC TTTT GTTTTATT TATACTTGGTAACTGT CTTCTCTTATAAAGACCCAG TAAGAG GGGG( N ) xCCTGCTCTGGGTT CCACTTTCAGAATTAAGAACTTAATCTTTAACCAGACTAAAACTTGACATGAAGTAGAGAATCTG(N) xCTTTCA AGTTGAATCGTAAATGTTTGTATGTTTTGACCATTTTGCTTTTCTTGCCCTTTACTAAACTAGAATATTTTGACA GGTGAACTAGGATT CTGGGTCAAG CT CTGGATTATCAAGAGCTATTGCGGAAAAATAAGAGACAAAGAGTATTC C GGAGTGGAGT GTGTGAGATTTT CC TTTGAGAATTCAGATTGGGCTCAT TT CCTGTGTGGT TTGT GT TGACACTC C CCTAACTCTATCCCTCCAATGAGCTTAAGAAAATGGGAGTGAGTCTCTGAATTCCTAGTGAGGGATGAGGGGCAA AAGAAACTCC CTGT CC AGGTCC TAGAAT AGGC ATGAGAGGGC CTGAGCTG AC AC TCCTAT AC AC GGGC AC AAAGG TTCTTATCGATTTGGATTCTTGGTGTACTATGATTTTTCTCAATATAAAATTATCTACCTTTGCTGCACTCAAGT TTTGGGCTTGGAATTGTATTTTGACT CTGATATTAATTTTGT CACACC TT CTTG CTTTTATT AACA CTTAACATT TGTTTGGTATATCATTTCCAAACCTAGGA(N)xTGCTTTTGCTTTTTTAACCAACATTATTTTGTTTCTGGCCTT TAAAATACTTTAAAAGGTCAAT CTTGTTAACTTATACAAG CAATACATGAAG GTATTCTG CTTTTTAAAAAAAGT CAAAGTA(N)xGTAATAAAAGAAAGAATAAAAGCATTATATATAAGGCTAGAGAGCCCCTAACCAACACTTTCAA TCC TTG TCTTC TCTTTAA(N )xCTTTTCTTTTTTTCTTTCTTTCTCTTT(N )xAGTCCTATTTTCTTTTATGTCA TTCAGTAAAAGTTTTATAGTCTTCTTCATAGATGTTCTTTATATTATTTGTTGCTACCAAATATGGAATCACTTA GTCTATTACATTTTCTGATTGATGATTACTGGCACATAAAAAAGTAATTGA(N) xTGTGATTACCTGCCTGAAGT TTGT CTAGACTTTC TTTGTATCTTAC CATGTGGCCGATA(N) xAGACGGTTATGCCTTTTTGCTTCCTAATTTCT ATTAATTTGATCAATAACTTTGATATATTTCATGTGTCTCATCTTCCCACACTTTCTAAAGACTTACTTCTTTCT GTACGTTCTC CAAG CAACCATCTATATT CCCACTGACAATG GTAATAT CACTGAAACTAAGC TAAAAATACTTG G AAGTATTGTTTACATTTTCTGTAATCATCTACTATACCTATCTCAGTGATAGGGATGATTTCCCAGTGCTTTCTC TG CAAATTTAAACAG GTATCT CT CAATATTTTCTTGGG C C TGAGATCCTGAGTGAAGACAGACATT CATGTAAAG
c t t g c g t c c a t t c t t t t g t t a g a t c t c t t a g g g t t c t a a t c a c c t c c t c t a a c c t a c t a a t a a t t a a a a t g c c a c TCTC CTTTATAATT C CATGTTTAAGATTGTTGGCTACTGGTCTTTCTTATTTTT CTTTTCACAT CATCAATACTG TCCTCTGAAAACACAACCTTTCTTCTAAATAAATCCTTCTCTTCTTTCAGCTTAATCAAACCATCATCCTTTCTT CAGTTTTCACCCTTGTACTGCTTCTATTTTGATTTCAAGATTTAAATATATATATTTGAAACAAATATGGCTCCC ATTTTATTTCCTGAATAGACCCTGATTCCTAGGGAGAGAAGGATTCAGGCAAAAGCCTTTTTAATACTTTTCCTG CCTTTGGGATTATGGTATACAATG CTTT TCTCTTCCTCAAAC CCCTATATAACAAAAACT TC CATT CCTCTTTAT GTATTTGTGATAAAATCCACCATTTCTCTGTGGTTGAACACACAAGAAATTCAGCTGAGGCCTTGAAATTCCACT GT CAG GGCTTATATAGTAAACACATCTGGCACCAACTTG GAATATAAATAGAAGG TGATT CTGGATAT CCCATAT GAATTAAAGGATTTGGAGCCTCCCATTGGCCTCTCCTTAGCCCTATTAAAGGGTGCATTTCAAATCCAGATAATG GGGAAACTGATCTGATACCACCTGATACTTCCCAGAATAATTTAAGAGCTAGAATGTCCCTAACCTGTGCTCTTA CATATACATGGAACTTATGCCACTGACCTTAGAATCCTCAACTGCAATTCTGGCTGAGCATAAAGGCAGCACAAA TTGATCTATTGAAT AG CGCCTT CC CTTT CTGC AGTG AT CCTGTCTCAGTT AC AT CT ATGTGAAT CAAAATCAGC C ACAGTGTTCTGTA CAG CTCACTTTTG CCAAAGTCTATG CT CAGTGACT CTAATAAATTAGTGGCTTGTGTGAGGA CACTTGCGATCATAGGCTGCTAAGATTTGAAATGACTAAAGAACTGAAAGGCTGAAGATCATTAAAGCATTAAAG CC CTGTACTCTGGCATTACAGACATC CAATCTCCACACAT CTTATTTTGATT CC CTAATT CCAGGTGAAGTGCCT ATTTCTCAAGATTTAACAGGCAAGGCACATCTCAATG(N)xTTACTTTGCTGATTAAGCTGTATCATGGTCAAAA CTGAGCTTTCTAATACACTGTCTC CC CTACTCCTCCTAAT TC CTGGTCTTGGAT CTGGAC TACC CAAAGAAAAAC ATACAGTGGAAATACTTACCTGGATAGCAGCAAGGAACAGAGCTGCCCCAAAAGTAGAAAGAGAGAGAGGTGTTC TTCAAACCTGCCTCTAGAAGGCAGGTCTGGGGCTCAGCAGGTGTGTTTACCCTATGGAAGCAGGATCTGAGTGAA GTGCATGCACACTGCAGGCAGGGAGGTGCAAAAGAGAGAACTTAAGAGTGTGCGTGTACAATGACCCCCCCCCCC CCATGTCTCCATCAGAGAATACAATCATTTCTCCTTTGGGGAGAAGTGTGTAGAGGTAGAGATGGGGGTCAAAAA TGATATTCCAGAATGGTTCCCTGCTTGGAGAAGAACAGGGAAATAGGGACAAATGACATATTCTTGCCTCCATGG GTGGGGACAAAGCCTGTATTCTTCACATCCTCCCAGCAGAAGCAGGGGATTCAACT
> H s 7 _ 130423517 -1304 478 31
TTTT GT CTTAACAAAGATTT GT CTGTATATTTAGACAAACTTA CAGAGGTTT GTCTGTAGTCTT TACAAATAGG C AGCTTTTGTTGAT CATATAT CTGTTT CTTTTAAAATGT ATAT ATTTGTTTAATTTCTATATTTATTTT ( N) xAAA TATG AAATTTTTGGAAGGGTGC TGTC CCCGGATTTC CC TGGC ATTGTATCACTTGAACTATT CT ATAAC C AGTG C TCCTTGTGTAACCTCTCAGGAACTGGGTCTGGAAGTATAATTGGGGATTATTTGAAAAGGAAACTAGAATAAGGG TAATTTTTTAATGCAG CGTATACTGTGGCCTT GAAGTAAATGTT AC CAAAATGTTATCTCTC CC AAGC CC ATTC A GAAGCTTGCAGGAACAACATCAAAGAGATGTTCCTGAAGGCCTTGAACAGCTGACTGGTCCATTCTTTTGGTCAC AAGT AGTTC A QGGCATGT AC ATAGTA QGTGTT ACAT AAATATQGTA GAATAAATGAAT AGGAAT ATAC AAGC TG A CAGATTAGTATTACTA TTCATT TATGATTATTAATC CCAAACAGACAT TAAACATTCCTAGAAA TACTGGACAC C AGTTTG CATCTGTTGC CAGT CT CCTT CGTG CCTTTTGG CAAAAGCTAGTCTGTTCTAGTT< N) xAGTGGGAGGCA GGTCATGTAGGGCCTTGTAAACAATGTGGACAGGGAGATGGGAAGTCTTAAGGAGGATTGATTCACCTTGGTTGC TGTGTGGCTAGTAG( N) xCTTGGAGATAGGGAAGCTGAGTTGGCTCTTGGATCTGAGTCAGATGC(N)xTAGTCT GATGTTTTTATAGTTCAATGAAGTCCAGCTACATCCCTGACTGGGCAACAGGATAAAAGGGATTTTTTAAAGCTA TGTGTTTTTATACGTATGTTTCTGGAGAACAGGAATTCTAACTCTTAGTAGAGTTTTTCAAAATGCCATGTATAA TTATTTGTTTAATAGTTTTTTATTATAATGATCACAAGATGTTATCTAAATATACTTAAAACCGATTGTTTTGTG T C AAACTATAAAAACAAAGATTTATATTTT AGGAAATCTAAT C ATTTAATAT CT AATTTTTTTAAGTT C AATTT C CATTCTTGATAAAACCTCTATGTTCTTAGCTGTCAGGCTTCTGTTAGATCCGCTGAACTTCTTTCCTGTGACATT CACGCTCATGTTAATTTACAAGTCTAAATGAACTATATGTGACCTTATGAAAAACAGATGAGCTTTTCTTTGGTG AATTTCAAACACTTTT T CAGTG CCTGTAGTGT TGTGGG CACATAGAAGGTAAAGGCACATAG CAAATAATTGGAA AATATTTTTATATAATGAGACTGAGAGCAAAGGTCCAACTGAGATGTTTATATGAGACAGAAAAAAAAGTGGGTG GTGTTAAGATTCTGCAGTATGATAGCTCAATAGTTCAGCAGTATGTGTGGTGACTTTGGAGTCTAACAAGCCAGA TTCAAATCTCAGCATGAGACATAGGACCCTCTCCTTTCCCATGAAGGGAATAGAAATTGGCGCAACAGTTTCATG ATAACCTGTCAAGATGATGGTGTTCATA( N) xTGCTGCAAATTTTTGAAGTAACTTATTCATTAAAATGTAAGGC AATACTACACAGTAGTAAAAATAAGCAGTGA(N) xCCAAAACTAGTTAGATCATGGATAATGCTTTCTGCTTAGT AAATAGAATGAAAAAAATCTAAGATTTTAGAAGACTGTGTCAAATATGATATCTTTTACACAACGTAAATACATG ATCATAATTACTATATCAACACGTT (N) xCTGTATTCATTATTATGTATTGTTATCCACATATTATTCATATATG TCCATAATACACCCATACATTAAAAGTAGAATACATTAGCAGGTGTGATAAATACCAAATTTAGAGCAGTAAGTG TCTCTGGTACGGGAGGAAAGGGAATGGAGTTTCAACAGATTCTTTTGTTATTTGTTTATTTTTTAAAAAATCTGA AGCAAATTAGGTAAATGAGATTTGTCAATATGAGCCTAGGTACAGTGCTGTTTGCCATATTATTCTCTGTACTTT TCTGTGTGTGTGAAAT TTTTTTTTTT AAAT CG CAGAAC AGAG CC AGAATATT AG ACAC A C AAAA CATACCTG CC A AATAAGTGAGTGGACTGGGGAT GCAG CCCTTGGAATGT CTGG CTGGACTCCC CCTTTATTCTGC CTTC CAGAATA AAGGCGAGCAGGGGGCATCTTGGCCATTAGGAGGGACACTCCTCTCCAGCCATGTGGATG( N) xGAGGAAGGCAC AAGGGACAGGGAACCTAGTGACAAAACACAGCATAATTTAAAATAATCATTGATTGGGGGTAGAGGG(M) xAATA ACAACCATCTGACATTCTGGAATTAAAATCCCTAATGGTCTTTACATGGGAGGTAGCAAGAGGGCAAAGGAAAAA ACTAGAGGCGTGAA(N)xGAGAAGGGAGGGTAGAAAATTCAAAACCCTAAAAAGCTATATGCTGTTTATCTAAAA CAAATGACCCAGGAATTTAAAATTGAA CAACAAAAG CAGATGTATTAT GAAAATGTTAA CAAAG CAAACCAAATT AACATTT CAAGT AGAAAAAAAATAAGGATAAAGAAG ATT CTAATAACATTAT GAGAAACTTTATATACTCAG TTT AAATGTACGTGGAATATTTAC (N) xT AATT AAAATC AAGATAAAATGTTTGAT CCATAAATG CACAAATT AC CAA AGTTGGTTCAGTAAGGAGCAAAGCTCCATGGACCATTACATAAAAATAAATGGAAAAATAATTAAAGGTATACTT AAAACAAAACAAAACACCAGAT C CAAGTAGTTTCGC CAACTGGAGCTAC CAAAGCTAAAAA CAGGTATTTCC CAT GTTATATAGTGGTCATAGTCCATAAAACAGATTGAACAATTTTTCAATTCGTAAGAGGTTTTTTTTTAA (N )xC T ATAAAGATCTTGGTGAAGAAAAGTATACAGCTGAATTAAAGTTCTCATATA(N)xGGGAGGTAAGCATAAAAATC AAATTT AAAG C ACTGG A (N) xTCTGTATACTAACAATGAATACATAAAAATCAAAATTAAAAAACAGCATTATTT CCCACACACACCAGTTCCATTTAGCACC( N) xCTTTTAGAATTTTTTAAATGTTCTTTTTAAACCAGACAATACA GAGCAT CTGGAACTCT CAAACT CTGGAGCCCTAA(N) xTCTATTTCTAATGTGTAGGCCAG(N)xTCAAAATGTG a a t t t a a a a a t t a c t g t g a a a a a a t g t a g a c t a t c t t g g t a a t t g c a c a g t a g g a t a g g a a g g t g t g a a a a c t c a GACGCTGTGAAGGAAAAGACTTACAGATCTGATGACTACAAAATATTAAACTTTTATAGGACAAAAGTCACCCTA AACAAAAAATTAGGCCATCGGCTTGTGAATATATATATTCAGCAAATATTACAGACAAAGGAC (N) xGAAGTGAG TCCGTAGGGTGGGCTGCTGTGACAGGATGGATGACACATATGTGAGTACTGATAGAGTACTGTGTAAATTTATTT ATAATGAAATGCAT(N)x AAGGTACACATTTAACTCAACAAAATTTGGTCTTACTGGAACTTTAAAAATGCAAAT GTGTTTATGTTTTGTTTTATTTTGCTTAACAAATATCTTGGTATTTTACTAAGCTTTTAAGTTAGTGAAATAAAG AGGCATAATT CAAACATATTTATACACTTTAAATTAACTAAG CCCAAGTAGAAAACAAAAAGGTATATAATT CC C AAAT CAT (N) xATGAGAATATCAAATGGAA (K) xGAATAAGTGATATAATGTACTATGAAGGCAATACAGTCAAT ATAAATG CTT CCAAATAGGGAACTGGTGAATTATAT TATGATACCTATTTAT TTACAGTGAACCT CTACACAGC C AGGAAATTGCTAATGTAGAAAACATATAATGACCTGCAAATGTTACAAACTATAAAGCAAAAACAAGGTTGTATA CTGTAAGTATGACATGATCTCAATTTCGTTTTAACATTTTATTAAAAAGAAAAACGACTGGAAGGCTCACAGTAT CAGGGTGCCATGCGGAGGGAGGACAGACATAGCGAAGCAGTCCTGAAACAACACTCGGTCACGTGTGAGGACATA ATAATGGGTGTTCAGCCTGGTGGCTGGACAGCTCTGCCTTCAGACCAGCTTTAGCTGCTATGTCTTTTCCCCTTT GTTTTCCAGCCTTCTCACTTCCTATATCTCTTGGCTCCGTAACAGGACTGCCTGGTAATAGCTCATAATCCTTTT TCCTGAGTCTAGTGTGTACCATTGTGAGCTATCACTCTTCCCGTAAGGTTTATGTGATTTCTGTATTTGGTTCCA TTATTGATGTTGTGGGATATCCCATGGGA(W) xACTGGAATTTAAAATGTATCCTAGACTGGTACCAGTAGGGAC AGGGGAATTTAAGTTACTGACAAGAGTAGTTGAGGAATAGATTGGACACTGACAATAACATCTCAGGGGAAATAA TTGG GGGAAGACTAGTGC CAAAACTACAGTAGGAAT CTGTTACT CTGGGACACAGATG CAGG GT TAATTGGAATG AAATGTTACTACTTAGTGTGTAAAGGGAGGAAACTTTACTAAATTTCAGTAGATTTCTATGAGACAGAAAGGATA TG CAGACTGGAAAT CT T CTCTTGCC CTTTACTAT CTGTGACTTAGGGCAAAC CCC CTAGAA(N )xCTGCTTAGAA TCCCTATGGACAGGTAAAGAAAGTTAAGTTAGTGAATGAAGGATCTCAAAAGATCCTCACTGTATTGAAGAACTT CC CCATT CAAGGGT TGTTAGA CAACT GT TAACATTTAAATGGGAATGG CTGTCTGGCT GCTCAGGCAATGACCT C AGAGGTGCTGTCCCTCTTTCAATACATTGGGTCTGGTCCTTCATCTTACAGATTCAGTAAAGCCACAGATTGAGG AATGGGGTAAAG GAAATAATATAGAATC( N ) xAGGGATGTGGCAGACATCGTTTTGCCTGTCCAATAGCCCCCCA CTT C CTGGTCATAGAACC CCAGTCAGCTTTGGGAAGCTGCCCCTCC CCAGTGGAAGTGGATC CAGGGGGCTGGTA AGGTGTCCTGTCAACT CTAG G GCAAG GACCTGGGAAGTTGCACT CTGAGTCTGGAGTGGGCAGCTGCC CACTGAC CT CAAGAGCTGCAGAC CAGCTGGGTC GC CAGCAC CACATTTCTGATCCTTGCTTTTCAATAGTT CCTTTAATTCT TTGATTTGCTGCATAT TTTC CAATAAAT CCTGCCGTT CTGAAGT TAGCTC CAGTAGCCTTGCTTGTAAAGAAAAA GAAAAAGCCACAG(N)xAAGACAATTCTCTCTTGAAGAGTAACCAAGAGCAACAGCAAGCCTGCTGTTCAAAAAG TACATCTCACTCTG CAA CAAATCAGTGC CTAC TATGTACCAGTT CTTACAAAGTGACACGTCAT CTGATGGAGCA GGTCATTGTATCTTGTGAATGCTGAATAAACCAACAGAACACTGATTTATAAACCATAACAACTTATTA( N ) xC T CAA CAATTTATTAATT TTAATGTAAT CAACAAACAAATGGATTAGAAT G(N )xGAACAGAATACTCTTGTAACAA GGTCCTTGAGACTGGGATGGATACCCAGACCT TAGGAAATTGAT CAAAGTTGGGCTATTTTATT TG GGGATTGC C TATTTAGGC CAGGAAG CC CTTTCTTC CTGT CATTATAGGAG GTGGC TAAT TCTATGGTTGAAAAGGAAAGAGG GA GGGATGTTCTGTGATTGCTGGGCCCAACTTGGAGGTATGAGGAGGCCAAAGAGGTTCTGGGGTAGGAAGAGCTGC TCTTGTCCTGTGGAGGGCCTGGAAGTTGGGCCCCACTGGCCTCTGTGGTCTTTGTCAGAGGGAGTGGTAGCAGAG GAGG CTGGGGCTGTGGATTT GC AGGATTGC ATGTGAG AG AGG C AGT CCTGGC AGTGGTGG AGTAGC CCTGGATGC CC CTTAGTTCAG CAGGAGGGGAGGTATG GGGTGAACAAACAGTTGTGGGGAT TGACCTGGATTGGGGAACATTTG AAGCTGGCTCTC CCAACTTGTTATfiG CAGATGAG CCACG GfiGTTAACTGG CT CTCAAACAAATGGATG CCACCCT TATCTCCTGATATCCCTGGCTTCTCTACCTAAGATTGCATCACCCTTACCCCTTTCTTCTTATAG(N)xGAACTT TATTTCCTT CATAATATT CGTGG GAAAAGAAT CACTGGAAAG TC TTAATT CCTACTAATT TGGT CAGGAAG GGG C TT AAAGT CCCTC CAAATTG AACGTTAATGG AAATGTGAT CGT AT TTTCTATC CTTAACTAAT AAAG A CTGCCAT A TTGGTG ( N ) xTTGTTGCAGG CATAAT CATAGTAG CTGGTGTTAG GAAG CTGTGGGATAGC CTGATGAC CATGCAC GTG GGATGG GAATACAT CTTTCAAGGG GAT CAAGAAACTTAGG GAGTAT CCGGGGCCCTTCTCTCTCC C A T A T (N > xATTGGTTGAATGAGTTTATG ÍN)xACCAGGGCCCTGGTGGACAGTGCTGGAGACCTACTAACTACTGTTGCCA CTGGTCTTTTAC( N ) xTGGTCTTTTTTTTTTTTTCTAAACTATACGTGTATTCCAGCCCTTACAACACAGAGCTC TG CAAGTAAC AG ATGC CC CC AACTTAAAAAAAAAAAAAATGAAG CAAATC CC AGTTCC ACGTTCTT CTGG AGATA CCTCTTCCCCTTAATTTGACGTGTCTGGCTCCCAAGTCTCATCAGGAATAGGCTGTTATTCTCTAAATCGTAACT AT CC CTTGTAAAAGTAG CTACC ACTTTAATTCGAAAACCAGG CTAATATTTT AAGTGT AT CAAG AG AGGCTTTTT ATTGTCACTATCTGTTGAAACCTTCAATAAATCTGAGAGTATTCTTAGAATCAACACATAACTTTTGTGTTTTGT AAGT TTGGAACGTTGC CTGT TGGATTTT CTTAGTGTG GCACATGAT TGAAGG GAG CATGTTAAC CT TT TAACTAA AATAAAGTGTTTTTGC A C AAATTTAGTG AT TATAC C TG AGCTGGTT AATGTG AG AGAAATGATCTT AAAC ATGAG CTTACTCTCTTGTTGT GGAGGTAAGACAGACACAGG CTACAATT TGATAAC CAATAAAATTG CT TTTGATTAGTG CC AAAATGTATGGAGC AG AT ATTGAGTC CAGT AT TTGG ATAAGATAAAAT AT CTTGGAATATTTTGGG ACTAATT TTAG GATAGACATATCAT TTTTAGGAAT GTTTATAACAGTCC C CAT CAGGAGATAATC CCTT CTAGAAGGTTTCT TAGATTTGAACAGT CTTGTTTAAAAAGATAAAAG CT C CTAGAGGGAGCT CA CTAAAGG GAATGTGGTT C CCTCTA GTGGTGGTAGAAGAAGTTGTGAATTTTCTTATTTATTTCTTTTTTTCATATTTATTCATTGCATCCTGATACATA GTAGTTC CTGTATTAGAGATACAGAAATAAAAGACACAGTTTTATTTAT CAAACT CTTA CAAAG TTGGAATAGAG AAGAGCAAATAATTATGATACTGTATCTGAAAATAGTAAAAAGCCTCCTAGTCCAGATCTGGAGGGATGGGGTTA
g a c a a g g a a a g a c t t c c t g a a g a a t t a g a t g t t a t c t g a g t t t t t a t a a a a g g a g c a c c t a t t a t t t c t t c c t c t ATA CAATAGTAATGTATTATACG GGTATATTT GACGTTTATTTTGGAAAATTTGG GAAACAGAT GAGTTTTTAñA AAGTCCATTATTTTCTTCACCAAAAGATAACCACTTACAACATTTTGCTATATCTGCTACAAGCCACTTTTTATG GATAATTTATGTTTTAACAGAATTTATGCCCATGACATTTTGTAATGTACTTTAAACAAGTTAATATATTTTGAA AATTTTCTTGTATT ACTACATTCACT AGTATAA C ATTATTCGTT CT CAAAAT AGTTTT CC CCTTTT TC CC ACTGT TA CTTTTTCAGGTGAACTTTAGAAAAAT TTTGTT CATTTTTAAAATTC CATCGA( N ) xGGGAGGAATAGCCATAA CT AC ATTAAATAAT AT AG G AACTTTTGT CTTG CT TTTT ACTTTATT CTTGTTTT ACCTTAAA GTGT AG GATTGT C TGATTTTTTTTTTGACATAGATATTTTTCACATTATGCCTTATTTACTAAGTTCTCTTTTTAATTTGGAATAAGT ACTAAATTTGTCAGATGT CTT CTTGGTATT( N ) xTAGACATATTTTTGTGTGTGTGTGTATGGGCCflCAACACAA ACTACTACAGCATCTATTAAGAAAGGTA( N ) xGCCCAAGAAAGATATCTTTT( N ) xCCTGAAAGGTATCTTTTAA CTTGTTCTATAGATAAGTCTCTCCTCCCCCCTTCTCTCCTCTCATTTCTCTTTCTTCCAGAAAAAAAAAAATCCC TTTC TCTAAC CC CATT CT TT CTAACACTG(N ) xTATAGTATGTGCTCTTTGTCGCCCCTGAACTTCTATTATCAT AT TACAT CATAT CTTCTT TG GCCATTTTTGTGAC C C TGTAACTTGGAAT CATG GATATAT CG CT CC CTATTTTAT TCCCAGAGATTGATGCAT CAACCAACATATAAAAAGATTTATACATAT TTTGGTATAAAAAC{ N ) xTCTGTTGAA CTGTTCTGTTCTTGTAACAAATGGCAGTTAGTCATACTGTGTTATTCTTTTACTATAGTGTTTGAATTTGCATAT TAGACATTTTAT TAGATT TTTTGCATCTATATT CAAAAGCAAGTATGGTCTTTTGCTATT CATTA CTTTGGTTTA GGGAGCAGAGATAAGCTAGATAAATAAGTTGTCTGTCTTCTGGAAGTGTTAAGTAGCATGGGAGGTATGTGATAC TTGAGTGCCTAAAAGAGCTTCATCATATAGCCATCTAGGCCCACTAACCTATTTCAAGTCTGGCTAAACATCTCC CAACTTTTCATCATTATTGGTTTTTGTACTTCTGTCTTTTGAGTTTTTCTACTTCTTTCTTGAAGAAAGGTAGCT CGTT CATCATAT TT CGAT TT'I'CAAAAAT TTGATATGAGTTGTATATTGAAATCT CTTACAATTTAAAAAT CTATT TCATAGTT CCAATGATGT CTTCTTTCTCATTACTAATGGTGT CT TTGTTGTTTCAT CT TT TT CAAGTTTAATTTT CTGCAGATTT TT TT CATTAGATTTTTAAAAATTAGGTAATTAA(N ) xATCAGTTTTAGAATGTTTTCATGAGCCC ATAGAAAGAT CC CT CATGT CCATTAAAAGTTAATT C C CATTC CCGTCCCC
> H s 8 _ 120597158 -1206 15 223
TTGG GATAATAAGACATATGCATACAATAC CT CAACAATG CAAGATGAGTTAAATTTAAAAGATGCATTGTTAAC CAACATGC CTGGTTTTGG GCTACACTAATTTG CC CAACTGT CAT CATCTTTCACTTAACT TCTACT CAGCT CCCA TGTTTTTGCCATATGAACCTACTACCTGCGCCAGTGTGACCCCTTCCAAGCTGCCCCATACATAAAAGGTCCTTC AGGAGCACATAC CGGG CCAATTAATTTCAGTCACAT CC CAATTAATTTACCAGG CCAGTAAATT CT CCATGAGGA TTTC CCACAACTGTTT TAGAAGACTCCAATAA TTCCTGTT CTTCGTACATCTTT CTTCATAATC CCACATATCTA ACTCTAAGTACACC CTGGAAAATTACAG CATGATATTTATGTAT CTGTCAAATG TG TCTCTCTAAACATC CCATA CTGATACAAGTTT CTAGAGTTCTAAGTT CTTCTTAGAACTTGAGTTTCTAAATATTTT CTTGTTAT CTTCTTTCT CTTG CAAACTAGTACC CAGCTAGAGTACATGTGG CAGTTGTT CAATTATTGAGATAGACAAATT CACAGCACTTT TCTTAAGTAGAAAGTTGGAGCAAGAGTGAGTGGGAACAGCTACTTCCAGGTCTGTGTGAAAATGGATAAATCCTT AATAATTAACTCATTGGAAGTTGTTAGAGCCTCCAGACATGTCCTAGACCACATTCAAAAGCAAGGAACATTCTC TTCAACATTGGAGAGTGTTTTGTAGTTCAATAGAAGGCAAACATTGGGAATATTTAAAGCAATGTAAAAAGTGCC CACAATAACAGAGTTTTTAACATTAACAACATTT TCTAAAGT CC TCTATAAATATTGCTACAAG CAAAGT CTCAA CAACCGAGATCAAATCAGCCCAATATACTAAGGCAAGCTCCCATTTATACTTTTGACTTACTCAAGTAGTAAGAA TAGAGTTTAGAATGTAAATTCAAAGGATGAAAGAG(N ) xGATGCACTGACAGACTGCAAAGAGACACCACAATGC TGAGAC C CAGAACATAGAGATATATTGT CT CAGCACTGATGGTTTATGTAGCCCACAAAAT CTT TGGAAACAGTG AGAT TT C CAACTGATATTTCTTTCAGTGATGACTAAGGATGATAAAATATTTCCAGTTGC CAAAGGGGTG GTTGT GGAAATGAGAAACAGAAATAGCCCATATACCAGTTACCTTGCAACATGCCATCTGCGTTCCACCAATAAATGGAT ATCCTCAATTCTTCTGTTGTTGGCATAGTGCAAACGTTTGGGAAGGTGCTGTTTCAAGTAAGGCTTAAAGTGCTG ATCTGGTTTTTTACACTGAAATAGAAATGGAAAT CAGACTT CAGATGGAATGTCTTTT G GAAAAAT TCTTACAAA TTCTCTCTCTAGAAAGCTGAAGGAGATTTTAAGCCTAACCAAAGGTTAATG(N) xGACCATCGCTAAGAGCAATT CACACTCCAAGGCCTTTCCCAACTCCTTCAAGTCAAGCAGTTGGGGGTGGGGTGTCGGTGGGAGGGGAGGAGCAC ATG GGTTTAGGCTTAT CTGATTAGATGCTCTT GTAC CACTAAG GTCGCTACCCAG CAT CCGGGAA CAATGGAGCA GAGTGGGTGGGCAGTT CTGGTTCAGGGAAAGTAAAGATTTGTTTAAC CATGAAT GAATACTTTT CAGTTCTAATG GAAG GATCAGATATTCTC CATTTGTATCATTG CCAAGACCATTAGGGAGCCTGCAG CTGACCACAGG C CAGAAGT TTTGAAAGTT CTTAAGGAAACAAATATACA GGATGG CAG G GCAGTGAGGAAAGTGTTC CT CC CACC CCTC CCCGC CGGGGGAGGCACACACAGGAGGGAGGCTGCTCCAGGGGCAGAGGCCTGGGCAGGGCAGAGGCGGGACAACTGGAA ACACTTAC CGTGAGATTG GCAATAATGG CTTTGGGGTCAT CTGTT CAAAGAGAG GAGAAAGATTTCAAAAGAAAT AACAAC CATTAC CAAGAGAAATCATTTT TG CAG CAAGATACT CAATTCTTTCTCCTTCCATC CCACAGAAGACTA TCAC TTTAGTGTTTTGTT TGAAAGATAAATAATTGTGTAG GGGTATTTAACATTTCAGTCATTGA CATGCAGCTA AGCAAGTGGAACAAATCCTATAGTAAACTGCCCTCCTAGGTCCAGTTCTGCTCTTCTGGAGGGAATTCCTCCTTC CATAGCAC CAGGATACTG CTAGGTAAATTG CTTCTTGTTT TAAGTTCCCCCTAATGGTTC CC CT TGAGACATAAT TTATTT GCAGTT TGAC CT CAGGAACTAAAC TGG G CATTTGG G GC TTTTAATGTAACTT CT TATTTATCTG CAGAT TTTT GAAACAAGTCTT CACCCACTTTAAGTTGAAA CTT CTAATC CATTGAGTTATTTT TT TAAAAGGGAGTTTGT TTCCCTATTTATATCCTGGGTGTAATTACACAGGTAGCCGCACCTCTCCAGCCATAGACACACACGTGCATGCAC CTCCTTTATTTTCAATAGCATCAAGTAAAAGATTTTAGTCAATACTCTGAACTAGCTAAAAGCAAAATTATGTAA ATGATG GCTGTG CATAAACTGTGGACAG GATTATATTATCTGAAGCT CCCCTTTTGGG CG GAATGTGG GGAGAAA GAGGAGATTGTAGGTTGCTTGGATTTTGTTTTGTTTTCAGGAAGTTTGCTTCAGTATGTTATTACAGAATCAGCA GAAAAGGAAAGTGGTGTT CAGAAGAGATAG CAACTATCATGAGG C A G (N ) xATTTGTTCATATTTGTTAAAAATG TATGAATAAACT TG CCTTGGGATATAAATATACAGACT CCAAAGTCTATTTTCAGAAT CT TAGCT CTAAAGCTTG AAAT CCGAAACTAAAATGTGTATTCCATTGGC CT TAAT TACATCTCTTACTAGAAG GCAAATAAGTAATAATGTA ATAATAATAG CC CGAATAGATTTCATAC TATGTGCCTGT CTTTC TATAAGAGCT CTGCAGGT(N ) xAAGCCAATG GGTTTT CAGAAGGAGGAGGAGTTGGCACTCTGA CACTCAAG CAACCTGATTGAG CAAG CCAGGCCT CATAAATCA TATT CCT CAAAGGGAC GAAATAAAGTAAGTTCATGGTCTTTGACAATGAAGAGAAG TAAATAGCATGTACT CAGA TAAGTC CCAGAGAATTACTAGGAATATC CAATG C CTGAGT CCATACTGGGGACATCAATC CTTCAATT CTGAAGA ACTTAGTCTTTG CAGG CTAACAGTCCTCATTTT(N ) xAATAAGAGTGTATGTGTATGTGTGTGTGTCTCTATCTG AAATAG TGGTATGACTAAGACATCATCATCATAT CCAACCTTTATATAGCTTTGACTATGTG CTAAGCAC TGATC TAAATTATGTAAA CA CAATCTCTTGTTTAATCTT CCTAACAATC CTTACCTTGCAGAT GAGGAAG CAGAG GTACA GGGCGTCACATGGTGAGTACCGAGCAGAGCTGAGATATGAACCCAGACATCTGTGCCTGTCACCACCCCATGCTA CTGTCTCC CACAAAATGAAACCACATTG GAAAGTGT TTTTGTAAAGTACAAAAATT TTTATCATTGATAGTTATG AATT CATG GG CCATG GGAAGGTTTTTTTTT CT CCT C CAAACCAACCTCCCCATTACATGTACTAATGT CAGAAAT ATTGTTAAAAATAGAGAAAATCTTTTTTGCATAG CC CTGAAGTG CAAATACTAAAATCGAGGGGTG CCATAGTGA CTGCATCCAGTC TATG CGAGGGCAGAAAAGG GAGGACACAG G GACAGGAAAGGT CATG GAGGAGAGAGGG CAGAA ACACCCACATTT TGTTTATTTCACAACATG GTCTGGAGCTGACCCTGGAACAGTTCCAGTTGTGTTGACTCAGGA ACATAATTAG GGAAAAACTTAATTTTCCAAAG CTGTAAG GTTGGTGTCAATGGTTT CTTATT CC CAAGG CTAAAA ACTCTAAAGAATTTTTTTTTCAAAGTGCGG CATAA C CC CAATAAAACTATGGACTT CT CAAAAG CTAGGTAGACA GC AC CT AAAAAT CTTAGT C CACAAATTAAGTT TGTATC AGTGGT AGATCATAAC CTGAAC ATGT AAAACATAAGT GACAAGATTATTCCTGTTTATTTTTATTTCTAATCTTTCAGGACAGCTGCCACTTCCCAAGGGCTTAATATCTTT TCCTACTAGGTTCTACCAATCACTTTTACAAAGAGAATGGACTCCATTCCACAAGGCTCCTAGGAACCCCCACAG TCCAAATGTGGTTCTGCCAACCCTTACATAGACCCTCTAGTTTCATCTATAAAAATGATCAGCCTGAACTTCTTA TTGAAAAAATTC TTAT CTTATAAACTTC TGTTCATGGT CATGAA7ATG CTTCAAACTAAGTGCAATTTCCACATA ATAAG(N)xCCTTTGTATTACCACTCTAACCACTTAATATTTTAAGAATCAACTTACATTTAGCATTGTTGCTAA ATTTGGATCGAATTCTTCCTAGAGTTCCAGGCACTAAAGTAATATCATCCACATTAGTTAGGTAATTACTCAAGA ACTCAGTTCTAT CAOATGTGACAT CTTC CATTCCTATGGGGAGGGAGAAAAAGAATATTTTCAGGCAAGACTAAA
fiCGAAAATCTTAACATAGACACGAAAGATAACATCCAGGCATCTCTTTAGTTAGTTTTTAGTAACTTAATGGGAA GAGC CCTGACTT TTGAGATTTT TGTATT TGACTTTAAATG GTTTAAGG CT TCTCTGTATAAT CT TT TCTCTAGAC TT TTACTTAG CCATAT CATCTTGCAAGCACTAGAAGGT TGAAAGTGCATTAC CT CGATTACATTTTAAGTCACT C AG CCAGAATA G CAA GACAAAGGAAGT CACCATGATTTAAACAACAACAA CAACAAACATAACTATTGACAGAGT C TTTATGAAAAAGAT C TA TTC TTT(N ) xCCGTAGGTTGCATCAAACTCTTCCCAGCTGGGGGTGGTATAAATGGAA TATACAGTTCTGTGTCTCTTTTACAAAATGTCATTTTCCAACTCTCTAATATATCTCCTGTCTTGAGTACATAAT CACCAGTCCTCACTGTCTCCACTGGTGGCCTTGGATGTTACTGACAAAACAATACTGGCAACAAGAGATGAGAAA GAAAAATGTCAGTT CCAATGAC CACC CAAACTGAAAAACAAAATAACACTGAAAACAAGT TGTAAACAATGTAAT TAGT CAGAACTGAAAG CTAGCAAC TAAAGCAAGAAG GTTC TAGGTAGGGAATTTTTTTAAACAACAAATATTTCT CCTC CACCC CAAGATT TGTTGTGTGT CC CCATCAAGGAAAATTCTTTTATTAAAATGCTTTTAT TTAAAACATTG TTGTAAACTCAAATCTTAGCATCATAGACTATGGAGGCTACTAGAGATCTTATTGAAATGTATGAGATGACCTGG AACGTTTTAATATATTGCTCAGTACAATTCAGTATGTACATATTCTCTAACAGGGAAAATAACCCTGTTGGAGAA TACTCATATATTTATATCCTGTAAAGGATGCCCAAAGGTTTTAAAATAAGGAAATACAACCCAGAGTACCCCAGA TCTGGTAAAATTTAACTAATCACCTCCTAGGATCCCAGATCCAGAACCTTCCCTTGTTTAATACAAATTGGAATG AAAT GCAAGAAAAAA CACACAG CATAAG GATTATATAT CTTG CACTTTAACAGTTAAACC CTTAG GAAGAAAAAT AGAGCAATGAAGAGCTACTATGTTTTAGTACAATGTTAGCGTGGGAAGGGAAAGTCCCCAAATACCTATTAATAG AGGGCTGAAAGTCTTTCCCATTTGTTATATTTCTAGGAAGTCCCTTATGCCACCCCAAATGTGAGCAATCTCCTT TACTTTTGAAGTCCAT( N) xAAAACACTAAACCATACCATCTTTTAAGAGAGAGGTGCTGTA( N } xGAAGAAGTG CTGTAGTTGTCCCAAACCAAATGGATCTACTCATCTTGGAAGCCTACACACTGACAAGCCAAGGTTTTAACCTAT CCATGACCTTTACAATGCCTCAGCACATATCATTGGGACCCTTCTACTTAGATGTCTGGCTCCAGCATGTGAGCA AATG CTCTTGACAC CT CTGGGCAACTTAAATTGAAT CAGCTGTGACACATGCTC CAACTT CT CT GACCTGTGTGT AG CT CTCCACTAAG CC GAATGCAT GGAG CATGACTT CG CAAAG GGTTATATCTAG CCCACT CAAGAG GAGAAAG C TAATCCCTTTGGC(N)xCTAATTTCTCATAGATCTTCTAGGACATTCTGGAAAGACCTTCAGGACCAGAGCTAGG GAGATTTGCCTACCATTCTTAACAGAACTGCCAAAGAGTCñGCCAATTTTCATCCTCCGAACTCCACTAGAAGGG AGAAGCCCCAAATTCCAATGCTACTGTTTCAATCTAATAGTTCCGGAATACAGGAGATACCCTTTAGAACTCAGT CTGCATGTCT CTGAGC CCCCAC CC CCTG CACAGCTCTACAGT CATATAATGTTTATGATG CTCTTGAT CTCCTCA AAGAATTGATCATAAATTTTTCTACTTCATAAAGTGGGATTTAGAAAAACTACTTCCTATAATGCTTCTTCCTCT CTTTAACAGTCATCTGAATTACAGCAGGGCCTTCCGGTTTGGACCTTGACTTCCCTCAGTGGATTCGGGGAAATG ATTTGACTTACACGCCACTTCTTTGACAAGAATCCCATATCTCTGCTTCTTTATCATATTTAGATGTATTTTTTA ACCATATAAACTCCATGGGCTCGTTTTGGGGATGTGGCTGGCAGGGATAGGAGATTATCACCTGATATGTCACAT TCA CAGAAGAAG CTAT TAATGT CACAAG GAAACTGT CTGTGTTGCTTTAAAGAGAGACAGATTT TTGTAGAAAGG T C TTTCCTGCAAATAACAGAAATATATAAGAAGTCC CAGCTTTTGATCAAAATGAACCCAGGGAT C TCTCAGGT C ATCTCATACTTTTCAATCATGGCCTGATTTCTATCCCCATCTTAAAGAAAGCGGGGATGAGGTTAAGAATGGAGT CTTGGCTTCTAAGG G CAAGGATT C TTTT CCTCAATATGAT CAGAAACATTAT C CATCTAT CCTATTAG CAAAGAA AAAACAAAGCTTTACCATGGT CT C CGACAAAGATGACGTTGACACACCGATG CAGTTTTAGTTGTTT CAGTCCAT CCATTAATTGCCCCACAATTTTGTCGATTTCCCTCAGAGGATTTGTCATCTAGGAAAAAGAAGCAAGTTAGTCCC CACAG GGTTTTCTTTCTTTCCCTTTCAGTCTCTCATAGAT CTTCTACC CCTAGAACATTTTG GAAAGAAAACTT C CGGCCCAGAGGCAGAGCTTTGTCTGGGGAGATTTGCCTGCCATTCTTAATGAAATCTGGTCTAAGCACGTTATAT TTGTATTCTTTTGACAGAATTAAAACCTCTCCTCATCCAATCCCTTTAAAGATTCTTTGATATAAAAATGTCAAA AATATTGTTAATGAAAATTCTTTAAGAGAGAAAATCGACTTTCTAATATGTGACAGCATCCTTTAAATCCAAGAG TTTTGAGAGTAAACAGAAAGTCCCTAAGTGAAATCACATTAGATTTTGAACTATGGCAAGAAGATAGACGCTAAG TGAATTCAAAGCTGTTCTGAGGTTAAAATCACAACGTTGCACTGGATAATAAATTAGAATATCATTAATACAATG AGATCCATCTTGTTCTAAGTAATTTAACTGTTGGCATCTGAAAGCAATAGATTCTTAACAGAAGAACCTATTCTA AGAAACAAGTTATTGCTTCTCACTTTAGTTCTTTTAAAGCTGATTTTGAGTGAAACAAGAATCTATAAGAGTCCT AA CTTAAAAA GAAGTGAACCATGAGTTTGTTCTAGAAGGTTT CTTCTTTAA CAAAATATGAAAACT CAATTCTTT TCTGAAGACCATTTCCAAAGCCTAAAAAACATTACCCATATCTCATTTTCTTACTTCAAATGGAAGTTTCTAACA G CAT GAATGT CCT CACATACTAGAGG CT CTCCATCC CC CTTT CATCTTTAGCTGTGCAATGGAAAGATTTTTAAG AAAC ATCTG ATC AG AC AAAGAATAG AAAGATACAGGG AGT AT ATTTAG AT GAATTTTATAG CTTAG CT AACAAT C A CATAAATATAC CCAAGATGATTAAGATAAGAACAGAAAG GATGTGAATAT CATTTTTACTCG GACACATAATAT TATCTTCCACTAGCATCTTCTCAGATGAATCAAGACCTTTCCTCCTGTTATTTCATTGGGAAATCAGGTGTAAAC TTTTGTTTTTTTCTTCATGACGGGAGTCAGTATTTCAAGTCCTATCCCAAGGAGGAAGCCACACTTGAAAGTTTC AGAAAGACATGT GCAGAATAAT CGGAAGGGACTAACGG GTGTGGAACAAAGTGGTATTTG CTTCTGAGAAAATCA TGCTCTGCTCCAAGATCTCTGTCTAGGTAAAGAAATACCAAATGAACAATGTGTTTTGATCTTTGGCAATGACTT ATTTTCCTTTACTCAGCAGGATGACTCAGGTAAAATCCAGAACCTACTTCTTAGaGATCAGTTCAGAGGATCACA AATAAGATACTGCCTCAGAGGTATTCAAAGGTTTTAGGAAAAGCAACAACCGATAGGCTCTTCTGAACATACTGG GAAGAGTAGACATTACTTGTGGAAATTACATACAACTCTATCCGCCCATTATTTTCTCTTTCTTTCTGTCTTAGC TATTTCCTTCTCTATAACCTACACCTTTTCTGATCTCTGAAGGTACAGACCCCTAATCTAAGCCAACTTCCTATT GAGTTCTAATC CACAT TGT CTTTGAGAATACCTC CTTTTACCTT GCAGGGTTGTTC TCAAGAAC CAGG GC CACAA ACCATGTGACTGATTCTGTGAATAAGTGCACTCTTAATAGTCACTCCAGCCTAAATGTTAATACTAGTTTTAGGG CAA CTAAC TCTTAAAAAAAGGAAAAGAC TT CTTTTTTTC C TGTCATGTTGAGTT CAGGCAGT CTT C A CAT CAGTG TTAACCTCTGAAGTACCACATTCTTCTGGACTCAAAGCCAAGGTCAATGCACCAGAGTTTTTCTAAAACTTGCTC AG C CGATTTTCCAACCAAGG CACTGAAGGCAACCTT CAACTCAGA CCTGC CCGC CACAAAG CTTTGGAACAATAG ATAAAC TTACTTTGTC CTGACGAGTTTC CG CAGCATAATGAT CCATCCTATGTATTTTTCTT CTTCTTTT CTTTG GAGGAGCAACTGGTCTTTCCTGTCTCCTCTTAGGGGCAACTTTCCTCTTAGGTCTCTTAGCCGGAGTAAAAGGTG AGCCATAACTACTCTCCTGTACTGGCGGATAGAGATGTCTGGACTCAGAAGTCGTCTGTCCCTCTGAGTCAGATA TAAAGC TTACACTTTC CCAT CTGGGATCAG CAGAGT CTCAGAATAACTAAGGCT CCTTAAGTGAGAAAAAATTGG AATGAGGGATGGAT GATTAGAGGAACTCTATGGAGAGAAGCCTC CTAAAT TGATGTTGATACTG TTGT CACAGCT TCTTCG CTGAGAA CAAGAAT CCAATTTAAGAAAATCTAGT CATGTGGGTTTCTTTTGGCTGATA TTTATGGATGT AT GGGAAGGTGGCATTAGACAAGCTCTGTATTCCATAACT CCTG CATTAGGGAGAGACTGGT CT TTGGTTATGAA AAGCTCTC CATTTATAAAGAACTAACTG AAAAGT AGGCTAAATñCT AG CC ACTT GGTCATTT CAAG AGGACAATG AGGTTCATGTAAGTGATTCCAATTCAGAATCTGACAGTTATCTTTCTGTTCACGTCAGCAACTATGTCAAAGCAA CTGTCACCTTCATTGGCAAGGATCCAGTGAAAAATTTTAAAACCAAGGACGGAATGAAACATAAATAATAAGTAA TCCTCCTTAATCATAAACGTACTCCCACAAATCAGGACTTTCAGCAGGAGAGAGAACTCACTGACAATTAAGGTT TT CAGTACAGAGGAAC TCTAAGTTAAAGTC CTTTTAAAAT TAG GTAAAAT CTAAAAAGTCAT CAAAAAGT CATCT CTTTTTTGTCTTGGTATATACAGACCAAGAAAGAGTCTCCTTATGCTCTTGCTATCAGAAACCTGCTTTTAAGGG GCGTTGCATGCTATGGGTTTTGTTTTGAAAATCCAGAGAGCAGTGAAGTTGACACATAAACTGAAGATAACCTTT TGGGATTT CCAAATGCTGGAGG AAAAAC AG GCCC CTGATTTG CTGATGGTGCAAAGGTGAATTAAAGC AG CAAGC CTTGCTGTCTACCTCACTGACTTCTCTCGGAGGGTGGGTGGGGGGAATGTCTTGAGAGCCATTCAGATGCAGGAA GGGGCGGGAGGAGGAACAATGTTAGTGTATCCAAGAAGGGGAACATCTGCCATGCGATTTCACTGTTTCCAGGGC AAGCCATGGCAAAGGCACTTCTGTTTCTGTCTCATCCTCTCTCTCCACCAAAGCAATCAAAAGACACAGCCCCAA AGACTT TGGTTTTGTTTGTTTGTTTTTTAATGCT CCTAGTTTTAGGCAAATCTC CATAAGAAAAGCAG CAAAAGG AAAAGAGGATTTTCCGTTTCATGGTCACTGAATACATTCTGCTAGCGCTCAAGTCGGCCCATCAAACTCTATGCC AT TTGCAAGAGACCTCAACATT CCAAAGACTCCAAAGATGAG CC CTTC CAATTG CAGCAGGTTAGAGGAG CAGAA GAGTAGGGTTTAGTCAAACTGGGTTTTCATCCAGTGATTATATCGCATGTATTTCCTTCTTCAAATGTTAGAGGT ACATGAAAG CTAAGTTTTCACTGACCAG CCATA CTTTAG C CTGAATTTAGAAATTTTAGCCTGAGGTTATTAAGA GGATGGGAGGTAGAGGTGCAAATTTCCCTGCATTTCAGGATACTTTGCCATCTGAAGGTAGAGACTAGCCAGCCA AAG CTCTG CCTGTA GG CCCG GAAGAGCAGC CGAACAAGG G GT CACAGAAAAAGCATAGAAGGTCAAAAAGAAAAA GATTGCGCCAATAATCAAAAGCAAGCATATTTTTGAACATTCACCATTACACTCTGAAACCA3GAGCAATACACC ATTTCTTGAACTATTAACTCAATGTTTGGTCATCTCATTTAATAGCTTTTGAGAATCATTCTGAAAAAATGTGGA CTCAAATATTTAGCAAATAAAGCACAAATTTCAACTGAATTCATCTTATTGTAGCATATAGCATTTTGGTAGTTA AAATCATTTCAATGAAATTTCAAATATAGGTTATTTTCTTATATGTAAAAGATACGTTACTGATTTTAAAATAAT CTTTCAACTAGTATGTTATAATCTTTTCATGTATAATAAGTGTAAATGATGATATATAGGTAGGCATATAGGATA CACAG GTGTATAAACCAATTAG CCTACAATAGAAGAGAATTTTTGCTGACTTGCTTTGGTTT CATGTT TTAAAAG CATAAACTATTTTGTTTTTCATTCAGTTAAAATAAGATTCCTAGAACCAGATGAGAAATAATTGAAATCTGAAAA TATCCTTTGCCTTATGAGAAGAAAATGACCAGGTTATCAGATTACCATTACTGAAGATATACTCCAAGATACCCA AT TAAGAG TTTATC CAGTAG GCATATG AAAATA C CCTAGCTG CAAGAAGTAGTAATACTGAAAAAAA CATAACAT GAATACTT TCAACT CTTAATTCAAAGGTACTATAGGAAAAAT CT CTGTTTATATAT TTGTGAGCAAC C CTG GGCA TAGAAAAATTATTATAGCAAATTAAAGCAT CCTGTATGC CGCTT CATTTATTCC CTACTCAGTACTTT CAAAAAA ACTTATTTGTTTCTTAATTCACACAAACATATACTTTCATGCATTTAATTGTAACTGAACTACACATATTAGAAA CC CAAAGATGTTTATTGATTAAGAACAT TTATCATTAAGC CATATAA CTTTTAAGGTTTGAGTTATAC CT CATCT GTATTATTAAAAGC CAAACTTT TGCCATAAAGAAAATGTT CTGTATAA TT TGAACATAGTTATTAACT GACCACT TTGCTTATCTCCTGGACTGCAGGAGGGTTAAATGGTTCCATGGAAGGGAGTGTGCTCTGGGAGTCAAAAGTTCTG TAGTGCTATCTTCAGGTTCTCCCCTTAGCAGCCAAGCGAAAGTACCGGAGTCCTACTTTTAAAAGGATGGCTATA CCAATC CGTTTTGCCTACCCCACAGGTT CTTTGAGGGGATGCTATTAC CCAAGAAGTCAAAC TTTTAAGTAGGGA ACATGGAACTGCTTAGGTGTTTTGTTTTGTTTTGTGAACATTTTGATAAAATGTGGTTCCTTTTGTTCCCTTTAG GTTAATCTCAAAGAAAGTCAATCTAAAAATAAAACAGCCATTCTTGCTTTGGTGTTCTTTACTGGCATCATGTAT GCAGATGTTTATCTCATTTTATCATTTGTGTTTCACAATCATACAAGACGGGTTAAACTTCAAAGGGACAGAAAA AGAGCATC TGACTTATATAACTTCAGTGATTTAAGACTAAGA GTGGTTTTACGTAG CTATTA GTTACTAGTGCTT AAATAAAT CACCTCTGAATTGATGTCAAAG CATT CAGAAA CACT TATGACTCCTTGGTCACATG CTGCAAGTATT GTCAAAATACTTAAGTATTGTCAAAAAACTTAAGTATTGTCAAAAATACTTAAACCAAGTATGTTTAACCAAGTA TGTTTAAACCAAGATATGTTTACCTGTATTTCTAATTCATCTGAAAACAATGAACTTAATTATAGTCAGCAGTGT CCTATGGGTTACTAGGCTA(N)xAGGTTAGATTTGCAAACTCCATCATATTTTTTACTATCTATGCTGAAAATGT GGGAACATTTCTAACCATTCCTTTCCCACCTTCTGAATTTGAAATTATACTGGATACTACCTCACAGATTGTCTT CCACAATGATGGCTGAGTGTTTGATAAAAATATGTCACCCTC( N) xGTCAGCCTGCTCTGCTAAGCAGATTCATT CAGTGGTATATTTTTCT CAAGAATTTAT GTTACT TTGG{ N) xCAAAAGGGTCCATTACTATAACTTATGCCAAGT TGATTGTATTATAT CTGCCAAGGTTATTGGAGCTGCTTAG CTGATCTACCCTCT CTAAA CTG CCTAAT CATTT CA AGGGTG CTGAAATT CATTTACAATGTTTTG CATT GCAGGT TT CATCTCAAGATTTT TTTCCATG CCTAGC CCATT GAAGGCAAATGCCAACTCTTCCGAATTATTCACCCTCCAACTGACGCCCACAAGGCTAGAATACACCCCATCTCC TGTTAACATCAAGTTTA(N)xCAAATCAATCTGAAATGAAGCTCCTGTAACAGAAACAGGTCGTTTTAAGATTTT TTAAAAAATTTT TAGT CTTGTAGT'i'ATTTG ACTTTG CATAAACC TTGTAGATACAAACAT CC AT CCAGACTTTAT ACAATT C CTGAAATAAAG T CAGTAGATCAG GT AG CC CAGG GC CACAGATTCTTCTCAGGCAGATGCTAAAC CACT AACCTCAG GG CC GAAAGGGCCATATTTGTGTC CAGAGAAATCAGGTTGCTCAGAATAGAAGG CATñGACC GAAGG CCTATAAGAñAATTG GAAGGTTCAGAAT CT CT CCTAAAC CA CATGTAACAAAAAATAT CACATATTTTTAAAAAA CGCATTTAAAAAAAAATGTTCTCAAGTTTAATTTATCCTTTAACTACTATTCATGTTAGCATGAAAAACTTTTTT TTTT TTCTGTT G CTTCTGTGACCACCAC CC CATCTGTAGTAACTGTCTGGAACCAC CTTCAG GAAT CAAAAGTCA CAAAGGAAAACTTGAACCTGTACTCGGATCCTTGACCCTCAAGATAACAGAGCCTGGCGGTAATTCACATGGGCC GTGCAATCTG CAGC CTGAAACAACAACTAACTGCAT TACAGAGAAGCGTCTCTC TTTATACT TCTG CAAT CCTTG GGATGCTGCTGGTT CATT TATCAAAGCCAGAAACTC CCTTCGGC TCTGCTGTAGTG GACAGAAT TAATAG CTGCA GTAATCCAAGTAATTAAATGGTCCCAATAACCAAAGTCTATTGTATCCTTCTTAGGGAAGTAACACTTAAGGCTA AGAACCACAAAACTTTTAATTCCAAACATTGGCTGTGCTTCTATTCCACAGTCTAGTGTTTCACGCAATTCTAAA CTAAGATA GAAATGTA CT GAAGAAAGAAGAAAGAGAGAAATAAATACCTCTCATGATCTGGCAGGGTGAG CCACT GCAATATGGTTAATATTCTCCGCTCGTGAGGGATGACACTGGAGGGTAAAAGCAAGCAAAGCATGTTGTTAGGTT TCATTT CCGCAAA CTCAAGCCTGAGGAGTG CTCGCTCTTGCACC CAGGCCAAAT GC TCTCTATTTACCAC CATCT GTTTTCACTT CATTTGTAGTAAAAGGCC CT CT TT CGAATTAG CGTTTAAACTACTATTTC CATTTG CAACATATT ACGTTGCT CAAC CT CTT C CACAGAAGACTACACTGGAAACTCTGATTCACAGTCAGAATGAGGAAC CTGT CTTGC TAAT CAGAAACT CCACTACAAGCAGAAAGGAC CCTGGTTT CAGAATCATTGTCATT TATT CTGTCGAG CCTTTAG CTGTGGCCATAAAGGGATCAGTACTTAC(N)xATGATCATAATGCGTTATCAT(N)xTAAAAATCTACCAAGAAT A(M)xTAGGCACATATAATTAAGAGACTGACAAAAAGGGTATAATCTTCATCCACAATAACTTTTGTGTTTAAGT CTCTTTCC TCTGTTGAAATATAATTACC CTAAGT CTGGCACTAG CTTTAAGGGCTC TAAGGC CTCACAGCTAACA TGACAGTATTGCATCCCCGGCTGAGTTACATTCAGATGCTCACAAGCTGAGTGATATCTTTAGTCAGGGGAAATT CACCCACTCTTTCCCAACAGGGAGCACAAAATGCATTTTCACTCCCCGATCCAAGACCAGGCATTAGAGGCATGT TCAT GATGAATT CAAACTAAAGCACAAGTTTAAATGTG GT CAGGAAAAATTATT GTGGAACATTGGTGAG C CATT TAAACTTTTACT CT CAGTGGACATGAGCAGTGAGAGAG CTGGGAGAAGGATAAATC CT CAAG CCGGGGCT CTCAT CAAGTGGTTTCTCTTCCTCCCCCATCACCATCCAGCTCACAGGGATTTGTCATTCCCCAG
> H s 8 J L 20185767 -120 205 63 6
TGGCAAAGAGTT TATCTAAAAATAGGTCACTOA C CAG GATTGAAGAGGAGATAACT GA CTGAGAGATAGTAAAGA GACAAATT CAGTGATATT CCAACATGGATTAGATGGGG CTAGTAAGCACATCTC CC CAACTAGAGTTC CAGCTTC TGAGAAACGATGACTATTTCTTTATAATTCTGAATTCTACAACTGACATATTATAGAGACTTAGTGAATATATGT TGCATTAAAA GAGCAGGGTTTCTAACTTTATCTACT GAGT CA GTAATAGCGTTATC CACCAA GATATGTAATACA
{ N) xAATTGGGAAACTATCTGCATGCATGGCAGAGGCTGTCAAAATTGTGCAAACAGATAAGCTTTCTTAGGGAG CATAGACTGAAAAAAAAGTTGAGGCTCAAGAATAAA( N ) XCAAGGTTTTCTGATAAAGTTTTATGTCTGGAAAAC CGGC CCGGGGTGAAATACAGAGCTTGCATTAAATGAGGGG CCTT CAGGAAGAAAGT CT CAGG CACTGGGATGGGA GGCAG GGTGGATGCAGAAAATATATTAGAAAAAACAAGTG GGAGGGTAGTGGTTAAGGGC CTTGAGAAAAGTTGT GAAG GTTT CAAATG CAATGCACTGTACCAT CCATGTGG CAGATC TGTTCTTTCACATT CT CAAACAAT CACAGTG TCAGGAGTGCAG CTTCTT C CAGTAACAATGGAAAA CAGTT TTTCAATAGCCCAACAGAGTAT CACACT CTTAGCT TCAAACTGGATCATGAATGAGTGGAGATAAAATTGGATAGAAGAGTTCTGACATTTTCTTTGGCAATGTTGTCAC TTCATATACACAACAC CAGAATATAAGT GAAATGGATGGG GCAGTACAGGATTCAG GAAACTGAGCTTTTAACTG ACGTAAATAATAACAGTAATGAT(N) xCATTCTGAGTAATTTTAGTCATTGTTTTCATAATGCCAATAGGTTTAT ACGGAAGCATACACACATATATGTACATGCTCATAGAAGTCTGATTTTATAATGATACACTGCTACCTAATAAAG GCTTTCCTTGACATTC CATGCAACTTCAAATTATTTTTATTTGGAAGTTACTACTG CT CCAGGATATG CTTTATT GCTTTATT AG CAC C AAAATTGCTTTTGGTC ATTTTTTTTGTTTT CTGGGC ATTTTTTATC CC AT CC AAAGTCAGT CTG TTCC(N ) xACACCGAATGTGTGGGTTAGATACAGAAAGTTTTTCTGTCTTATTAAGGAGTTTGGATTTTATT CTTAGAT CACCGGTGT GACAAGGAGGG GATGT GG GATCAC CAAGACTTAAAACCACAG GAATGTATGATOATAAG TTTATTGTGTGTGTGTGCGTGTGTG(N)xCATAATTTAATTTTTGTAGAGGAAAGTT(N)xACTTGCTGTCACAC ACGTGTACACACCTCC CACCAGTGTTCCAT CTCCTTCTAG CAGAAACCCCATCAATGAAAAGAAGGT CAAGAGAT CACAAACAGAAACTGACAAGGATTTTTCAAAACAG TAAG TTATT TTATTTCAAAAT CAAAAATATGGCACAATAT TGATCCCAACAGGAATTTTTAGCTGGAAGCTGTTTCTAAAAGATGCCTACAATTTCAATAGACCACATCTCCCAG GTGATTGC CTAACC CCTGGGTTAGTGTGAG CCCTTCTCTC CTGCTTCTCACTTC CCTTTCAAAGTG CGTGAGTCA CACC CTTTTACCTGTTGT CTTCATTCCCTG CC CCAG CTGATA GTGAATAAGGAAGACñ GTTGTTGAAACGGAGAA AGGAAAG(N)xCATTTCTTTTTTCTCTGTGCCAGGCACTAGGGCATGGGCTAGAAGCTGAGCCATAGAAACCTGC TTTGGGTCACAGAGCATGATGGGGGAGGQAATTGAGGTCAGATTCTTCGGCTCATGCCTTTACCACTACATCATA AACAGGGC TGGGTCTC TGTTATCAAGGT CTTT CATTGTTGGCAC C CGTGTTCACACAG CTTG CCGAGTTT CTAGT TCGCTGGCTGGCTCTGCCTTTCCCTCCCAGTGGCATTCTCAGTTTATGGCTGCCTGCCCTCTCCACTTGGGAGTC CTCATGTTACTTTGCGCTTGGGTGTCTGTAGCCTCCTGATTCCTGCCTACCCACCTGGATACACTTCTTCTTTGG TTTGGGGCCAGC TGAGAACTGCTCACCTGG CTGTAGTGTC TAAG CTCCATTTTGTC CC CTAAAC CG CCTATTTGC CAAATATAGC CC TGGCAGGACTCAGTCT CTTCCCAGTTC CGAAG CACCTGTCACACTCTC CAAT CCAAAG C CTGC CAGT CATTTT TATCATAT CCTTTACCGAGAGAAAT CTTTAGAAG CTGCTCTGTTTT CCTTATCTCTAC CATTTAG TATTTTTTTAAATATTGACTTATTTTTCCAAATTAAGTCATAAAGAGATGCATGATGAAGTACTCATGCTCTCTG TTTTTATTAAAGACTTAGGATCAAATTAGATCTACTTTTTTTTTTTTAACAAATTGAATACAGACAGTAAAAAAT ACATTT GTGGGTAAAT TTGGTTTAAGCATATTGT CAGAAAATTACAAGACTATGAGATTC CTGAATGG GCATTTA GC CTAC CTTTTACC TC CAGG CAGGAACATGGCTAACGTTTACTAAAG GAGAGAGAGAGAAAGAGAG CGATTC CAT TA T G TT A (N ) xACTTTAGGAGATAAACTATCTAGGCTATCCTTCACCTAGGGCCATTCGTACTGGTGATCACTGT AT TGAT GAAAGAGT CTGAGATAGACAGAAAGC GGGGTTTGAATAAGTT TTATAAGGAAAGAGTT TTTCCAGTGGC AGTTAC CATATTTTTAAAATT CAGTATGTAACG G C CAAAACTAC< N ) xCAAGCAAACAAGAGAGAGGGATTAAGA AAAC CACAATGCACATATAC CATGTAA CCCATCTTCCCTCCCTG GAACTC CT CACTGATTTC C CAATC CAGC CAC TC CAATTT CC CAAT CAAGGAGCTATGACTTTGAAATAG CGATGGTGTAGCTTTAGC CC CGATATATGAAGTG CT C CCACTTTTTGCCAGTCACTT CACAGTTCTC CCTTTATTTG CCAAGATA GACCTCCTCCCTCTAA GTAAAATAAAA GTACGGCT TTTCTAAGTTAAC C TAAA GAAAATAATACAATGC CTGTGTTAGTACATGACTTT TTTTAAGC AACAG AAGACTG GTAAAGAGG CTGTTATTTAAAA CTGAT TGAAATGGAT TCAGAGTACAG GCCCTGTGC CAGGTGGCATG AACTTGTAAC CCATGCTTAACT CAGATTTATTATATTAAT CATG GTAATC CAATACATAGTAAGTTTAAT TTTGT GTGTATGTGTGTGTTATTAAAGTTGTATTCAAATGAAAATACTAAAAG CTGAAGAACAAG CACAGAAATTGT( N ) xTTATATTTTTATTACTAATTATAATAAGTTT CT CCTT CTTGAAATTTGGATTTAGAAAT TTAGAGAAATTC CTA TACCCTTTCCAG CTAAAATATATATTTT TTGT GAATAG TG TATT CAGTAATAAGAG CCTCAATG CAGAAATT CCA GAATGAGT CTGCTTAT CATTTT GCAGTTA CTTAAAAAATAAA CATT CTTGTG CCATAAGATT CCTTGGCT GATAA GT TAGATC TGTAGTT(N)xCAATAATAATTAAGACATTCAATAGCTCGACAATTAGAAAAAAATAAAGTATT(N) x T TAAAAT TTCTTTATATTT TAAAAT{ K ) xCAAGACTTACAATCAAGTTTAAAAAACCAAGACAGTAGGATGTTA TAAGAATAAACCATTGATTAATAAAG CAGAATAAA CTGATTCTATTTCTCTCTCTCTCTC(N)xTATTGTTAAAA GC CAAAAGAAC(N)xGTCTAGCTGCAAAAAAAGTCATCTGGCAAGATTTGTACTAAATGGAATGAGATAATAGGA G GTGAAAGGAAAAGAGAGAGAAGAATGC TGGT G GGAAT GAGAGT TGACG GGGAAAGGTTTGAG GG GAAAT GGAAT CTAGAGAATGAAAAGGAATTAACC CTGGATGTTGA CATTACCACTCTT CAA CTGATATGAGAAGGAAGAAACAAA GAACACATGTTCTACCCTTGTGTTGCTCACAACTCAAGGAGTAGCAGTGTGTATCTTGCTGAGGAGACATTACTC GGAGAT CT CTAAAT CT CACAGCATTTGAATA CTATC CACC CCGACAC CTGGTGTGGTTGCATGCGTGTGCATGTG TGTGTG CATTGGTGTG TTGTAG TGAGGGAAACTATC CTTCTCAATGAGATGTTACTATGAAAAAATGACATTTCT CATGAG GAAACCTGACAG CTTT GAAGGTATAG CATCTATG GGAGATTCTCAAAC CAAACTTAGAGATTTTAAAAC AGGG GC TACCAACT CAATGC CTACAAGGTT CAGATGGTTAATAA TAACAC CCACAGTGTATTGAGCA(N) xCTTT AAA CAGAACTGTAGTCTG CTAAG G GT GT CCAAGATAAACAGGAGTATACATT CAAAACAG GAAGTT TGATAACTT CAGAAG CTTGATGTAT GTGAAAACAGAATTTT TAAATGTACAATATAAGGGGTAAAAG TACACACATTAAGAAAC AAGCAACCTAACAAATATGCAT CAGGAAGAAAACTTGGTAATAG CC CAATACAAAGAGAAAAGGAAGACATTTTA AAATAACAAA CACCAC CAACAAGAATGCTGGG CTGCTG CACTGACAAG GACAGTTGAGATGAGACATCACT CTC C AAGAAAGTTAAATGTGAT CTGGAAGAATACAT CT CTTG CTGTGAAAAT CCTGTCCCACTTACAT CAGACATACTG TATT CAGTGA GACATTAACAAGTAAACAGCAAAT CCACATCT CACAGGG C CATTG GAA GAAGATGGGGAAAAACG CAGATCGATGTGGG CCTAAATACATC CCAT CTTTAATTATTTTAAAACTCTATCAGACTTTCTC CCAGGCAGATG CAAC CAGGAAATGTAAATGGTT CTAGAAAT CA TTAC CAAAATTT T T T T ( N ) xTAAAATACCATTTTGACTATTGA GT TATTATAG CAATTCTTTTAAATGG CAT CAT GTAATACAAGTTAT CATG CAGT GAAACTGG CATT CG CACC CAT G ACTG AC AC(N )x CAGTAGGTAAT CT CATT CATTTTTTAAAAAAATAG CACAGGAAATGCTTAAAATATCTCATA AAAGAGGAAAGAAAAAGGATGTTCAGAT CTAT TAC CGTGGCTGTG(N)xATCACACGCCATTATCTAACATGAAA CCTGAAATGC CATAATTAGATG CATTTATTTACAT C TTGAAAAATTTGATAATCAACT CACATATAA(N )xTTCA TGTAAAGATGGC TTGACAATGGAT GAATGGAT GC CTTAGT TATATAAATG GCAAGTGGAAAGTGGTAGGTAG CAG GTAAGTAG CAG AAATTTT CTTT GAG GATGTAT CAATTAAGGACC CCATTTGCAATGAATAG AAACACT CTATTTA CCTGCTGCATA CATAATGAAAG CTGTCACTGGAGCAGCTGCCACTG CAAGTT GGGTATTTTTTCTGCAGAGCA CA AAT(N)xGAAGATGTATCACTCAGGACC<N)XTCCAACCTAAGCCAGCTCAAACAAGAAAATTTATTGAGCTGTT GAA CTGAAAA( N ) xAGTTTGGCTTTTCTCTCCAGAAAGTCTAGCTCTACAAGATGGTCTCTGGCATATCTAGGTT CAAGAAAT CCTTAGAGCAC CAATC CAGTGGAAAGAGTTTATCTTTC C CAAACATTG CTA CAG CACAGGAAC CTCA GAATTTAATCTCT CAGGATGGATCTGGAT CATGAATAGAATATGAAAACTGGTCAG GG CT C CACTATGTT CTTAC CCTTGGAGGCAACAACAGAATT TTTCCTACCC{ N ) xCTCAAAGGATAACTTCTGAGGTTGTTACCAGCTTACATC ATGTTGAGGT GCAGTT GAAAGGGAGACTGTGT TGAACTTTACAG GAAT TAGATGTAGTAAATAATAGGTGG GGGT GGA CATGAAAGAATGAGGAGGAGAGAGAAGAAGAAGAAG G CTAATGAATAAGTAG GAAGC CAGAAAA CACTGACT TTATAAATATTC CCAATGT CTG CTGG CACTGACCTAGACTAACAGAAG GGGT CAAGGTAC CA CT CATT CCATGAT G CACATG TAAGAGC CTAT CCTAAT G CAGGTAAATAGGTAGATAT TTGAGAAGTCAGAAATAAAACATATG CACCA GG CAAACATTTG CAAGAGATGC CTTATAAT C A < K ) xAACTTAGATGTGCTATTCGTTATTCACAGCTTCACAGGT GATG CAAATCAAAT CCTGCTTTTTGTTGTTGCTGTTTTTTATCTTTCTGCTTGGGATTCATAGACCCAAGAGCAC AGATATGATTGG CCTAA CTTGGGT CATATGGT CACCTCTTGG CTAA GGAG CATTAATGAATG CATGGATAG GAGA AAGGTATTTC CC CAAAGG CAAAATTG CT CTGCTGGTACAATGAGAAGAAATAGTGG CTACTGAATGGTTCTGCAG GG GCATAGGTTGTAAGGTGCACAG CAGGAATCTGAAAACAGATCAC CAGACCAAGACT CTTC CACAGCTG CC CGA GAATAT CATACCTAGAGTGATATGATGTAT CTAAAACAGTACTG CATGAC CTATTCACAGCC CATTTT CAAACAT TT CTGT CTGCACTACTGAAG GGG CTAAAAGGCA CAAACTG CCTTTGAAGTAC CCAG CAAGAATTA CAAAC CAAT C A CGATGACAGGACCAAAGGTAC CTGCATCTACA CAAGCGTGCAC TGTTAC CACACTGGACTGG GTAACAGTT CAC TGGGTTGTTCCTGTCT GGAAGAAACTGT TCATTGCCCC CAGGAGAGTT CTATTTTGTACT CATAAG GCAAAATTG TCTTTTGCGG CTGTAAAGAAACTCTCTTAGTT GC CAGCTTAACCATAT GC TACAC CTCCTACCTCT TACAACTCT TTGTTAAATC CCATGAC CTTCTCAGG CAAAGGTT CCATCTGG CAGAATTGCTAAAACCACAAAAA CAAAAAAGC C AGCAGCAACATTAAAGAAGGACAGTTTTATGGATGTATCTTGTGTGTAGATGCTTCTCTAAGATTTTACATGTAT TGTTTCTT TTACGATG CAATA T { N ) xTGAGCTGGGTCTTCACTTCCCAGATTCCTTTACCCATATAGTTCCAGTT TAAGTT CAGCCACGGG CAAATCTGAGTGAGAT TTGGTGAGGTG(N)xAAAGAGTTTATCTTTCCCAAACATTCCA ACAGGACAAGAACC TCAGAATTTAAT CT CTCAG GATGGAT CTGGAT CATGAATAGAATATGAAA ( N) xATGTAAT AGCTGAATTAACTTCCATAGCATATTGCCATAGCAGTTTTTATATGTAGACACAAGTATTATATAACTGTCATAT CAGTATATATAAAGATATAACTAATGTATAT CATATATAT CATATAATGCAT GCATTT TGTTAGTAGTATATAAG TTTGTATATATAAACTATAGAACTGATTAGATAGATATAAACATATAAATATTTACCCTATATATAAAATTTAGC TC CAGAATTTCATAGAAACT GG TTTTTCAATG TTTTCC CTGGTG GATATCAAAAAATG C CAACTGTGTGCAATTG AAAATTCTCAATTGTATACAGTTAGTATTTTGATATCTGCTAGTGACAACAGTTGAAAAACCAGTTTCTATAAAA CACTGGAGCCAAACTGTCATATAAACAAAAAGATGAGTGGATCTGCTTCTTTGGATAATATTGGTTATTC( N) xA AAGAAAAACAGAAAGA GATT GAGCTCATTAAC GT TCAAATGAGGAT C CATTT GAACAGAGAAGGTTAATGAGTAC ACAGTGTG GGAG GAGC CCTGAGATAG TT TAAG GC CATGAAACAAATGCAATGTT TAGAAGAAAATGATAGATAAT TC CACT TAATGAGTACACAG CAGACTAATTATTG CAATGAAAAATGTTTAATTAAAAAGAAGTC TG CTTT CTAAT GATTATAC TGACTGTAAACCAGTTTAAAAAAT CC CAAAGACAGCTAAAGACAATATATATAAAGACTGCAG G GAA AGTTTACACAAAGAAGATAAGTATATTTGTTTATTATTATGGTT TAAAGTTAGCATTCTAGCAA TCCTGGCCTCT TTTAATACAATGTTTC CTGATCTCTTAAGAAAACACCCAG TTGAACATGGGAGAAGAGAT TACT TGGAAACAAAA GAAAGT CATAGCAGAATC(N)xCAGAGAAAATATATTTTAACACAAATTTGTTTTAAAAATGTGAAATTATAGCA AATTCCATTCAGGTCATTAGTGACAGAATCCATTCATGGAATCAAAATATCAAGGAAGCCTATACAGCTCCAGTT TGCGGTGGGACTTGAGAAAGACAGATAAAGAATATTTTCACATTCTGCTTCACTTCAACAACCACCAGTGTCTCG ATGGACAGGGCATGCAACTGGAAGAGAGTAGAACTAGGAGTTTCAAGACAGGGCTGCAAAATGTGACTCTTTCTG CCTCTGATTTCTCTGTGAGCATAA( N > xTATGGATAGTGTTATCAGTTATCAATTTCAGTCTGGGGGAATCCATA AT CATT CATTTT CTG GGCCTTTTCTT CC CTTGAC TTGC C CACTT CTCTTCCC CTTCC CAGACAAATAGAGAG( N) xGATAGAGGCCATCATGGGGAACTGAGCCATGGTGGGTAGATTCTATGTTTAGAGGGTGGCACTTAAGCTCTGAC TGCTGTTGGAGCAGAGGAAACTGATAAGAGATGAGTCTGCAGAAGAAAACCCAAACAGATGTGAGATCAGTGCAC TGGTATT CTGAAGAAG CCAGTGAGAGAGAAGT CAAGGCTCTGGCTAGGAGCT GTTGAGAT GAGAATTTAGAAAAC CCAACAAAAGCAGAATGT
> H s 9 _ 65803788 -6581 54 73
GTTATCTGTTTATCTTTGTG CTTGTAAT ATAAATG AAC CAGTGG CACTTTGG AATGCC AC ATGC AG AATG AATAG AAAAAGTTTAAAATATTGGTGCAGGAGGTCAATCAGAAAGATATATATCCATTAGAATATCTAGATATGAGATGA TAAAGACATGGAATAAAGTAATAGCTCTTTATTTCTTCCCAAAAATGTTCAATTGCTCTCCTTGAAGTATATCCñ TTTTCTAAATACTT CCTCTTTATTCT CCTGTAATTCAGTAGGAT TTAAGACCAACAGAAGTATT GTGTATTTCTA GTGTAGTCAGTCTGGATTCAAGTCATTTAGAATGTTATACACAGTTGGTTCAACCACCCTACTATGTCTGGAATT GT CTTCTC CCCAGTTG CAACTTTTCT CT CACTTATTCT CTGCCT CCATTGGGAACACA GCTGGGGTGAGAATGGG AATGTTTG CAGGGTGATCCAAACCAGGC CTGAGCATGG TT CTGC CATGGCTTTACCCCACGAAGGCTCCAGT CAC AC CCACGC CCAAGGAATCTGTAGGGGACATGGAAATCCAG CTTAAAGAAGTGGCATTTATGTTT CTGCATGTTTG TT TACTAGGGATGTAACACGGC CATTTAGAGGAGACTAGCACTT TG CAGCAGGCTCCTTGAAAGAAGAGATGGAC CCCTTCATCGCAGTTAACCTGCCATGCCAAACAAACATGTTCATGGGCATCACTTTCTCAGTCCAACAGGGCCGT TGGAGCACAGAAGC CAGGCAGACACAAG CGGTATGTTGAAGTCAGAAGTGGT CTGATT CAGGGC CAGGGTGG GGG TGATGTAATGAAGTGC CGAGGG CACAGACCTTAGGGAGACACTCAC TGTCACGGTTGTGCAAGT GCAGCATCTCC TTAAAT CTTGTC CC CTGTGTGT CTTGTC TTAC CCTAGTTC CAAAACA CATCTAACTTTTCAGCAAGAGAGAACAG ATGTCAGTCATATAAATAGCAGAAATGAGAGAGCCAAGAAAAAGAACAGCTTTGTGTTGAAAAGAAGTGTGGTTA AGAATACACAGCATTTGAGGTAACAAGACTTCCTCTGGATAGGGACTAGGGTTTTCTCTGCATCCAGGCTGGGAA TCCCGAGCTCAGAAGAGATGGTGGAGATGTAGGGTTCAGAGACTTTTGCTTGCTTTTCCTAACTCTTGCTAGCCA GAAACACT CTGCAGTT TCACTTTTTTAACCTT TGGATT CATCCCATTTTAGT C CTCTT CTATGC CTTAAAAACAC TGAAGATGAGAAGATCAAAC CT GATGAAAGTCATGTTCTGTTTT CTGTGCTTTCATCATC CAACAT CAATAGACT CTGACG CTGTGG CCTTGTGT GGTGCGTCTGCGTT CGTTGTGAGTTT CAAAGC TGAAGAAAGTTA GTGGGTGTTGG CATGTTC(N)xAAGTTAGTGGGTGTCTTTCCCACTCCTGTTCTTCAGTTGCGGTTTTTCAAAAGTTGGCAGACAT GTTTGATAAAATTCTATCAAAGAAGTCATCATGATGACATTAGGGTGTCTTCCTTTATGTTTTTCGTCTTTATTC AGTGGTTGTGTGTGAT GTT CCTTCTGCTCCATTT TTTTGTTTTGTTTTGAATTT CATGATGTAC CATGCGGGAGT GACCCACAGTTCCAGATGATTGATTAAAACCCCTCAGGCTCACACTGCTCACTGTTGACACATCCCTTGAGTTTT GAATGAGATCAGTTACGTTCATCTCCACTGCATTCTGCCTTCTTGTCTTGCTCCCTTCATTATTTACTTTTCTGC CCATTCTTTTATGTTGATAACTAATCTGTGTTCTTTAATCTTATTACCCACACTAACACAGAATAACATTTTAAT GTTCAATTTTTAAAAC CATT CTACTTTAAGGTGTAAACTCTTTT CACCTGCTflAACAGAG CGCAG C CGTATTTCA GCTAATTAACCCACAGCATGTATACACCTGATAATTCAATCGCACATTTGGCTTACTGCCTTTTAAAGGCTGTCT TAGAGAGGGATT C CTATTAAAATTAAGAAGAGTT TTGAATTTGAAATATGAT CAAGTC CTAAAT GTTAGAGC TGG AGGAAAGTGTGTGCATTATTTTATTTGAACTC TTGCGTGGTATATATGGAGACCAAAGTG CTGGCACGTGCT CAC T C CAGGTCTGTCAACT CAGTAGAGCCACTACCAGACTT CCAGTCTC CCAACT CCTATT CCAGGAGT TTAGCTCTG AGAAAAGAAAAACAAAACAAGACAGAAAATTGTAGTCAATGTATATAATGAACTAATTGTTCATACATTTATGCA CAACATGTAAAAAGAAAGTACAATGAGC TTGACCTTAATTATAT TAAATAAGTAGAAAAGAGTT CTTAAAATGTA CCAAATGGCAGAATGCATCATACTGGAAAATGTATTTATTTAGTACATACCTAGGAGAAGCTTTTCAGAGAATTT GCTTAAAAGGCAGACT CTTAATACAT TT TGAACATATTT CA CAAAACAAGTTTATCCGAAGAAAAT CTTATAGTA AAATACAAAACAGGACATTGTTAACAACACTGAACACACTGAGATAACATT CTATTA CTG CTGC CGTGGAGTAGA CCAGAAAAATCTGCTTATGC CATTGC CCATGATATTTATGATAATTATTTAAGAGAAG CATTAACTGGGGGAATG GACAGCAT CTTATCTACTTTGTAATAGCAGTG TTTAACGGGATGTC CCTGGGTAATACAGAGGAGACACT CTAAC AACAATTATTGAAT CGATAG GATAAAAAACATAGAGAAACAGACATCAAGAGCT C AGAA CAAAT G GAAGAGCAAG GGGCCAGGATGTAAGGAGATTAATAAGCTTTGGGGATTGGGGTGGGAAGTAAAAATAAATTTCATCTGCTTCCTC ATTAATTTTCAGTAGAACCAACCCTGCTCATGAAACTGGATTAAGATTTCACGTGGAAACTATCAAGTTAGAGGA CTTCAAGAGTGTCAGTGTGGTGATGCACTCACAGGACAGGGCAAAGCTAGCACTCATGTTCAACACTTGAGATAG TCTCTACTGGGAAAATGTGGGATAATGT CCT C AATC CAATTTGAATGGGAACCATGTTGAGAATTCCTGG TACAT TGTATATAGAAAAATAATTAATACTTCCTAGTAGGCATTTAATATCTTATAGATATTATAGTAGAGGTTTTGAAT ACTCATTTATGTCTGA C AATAT TCAAATAATATATT TAAATATAGCAG CTATAATTTATTGC CCGTGTGC CCAGA CAGTGGACAGTGTACCAAGAAGCACTGTAATCAATATTTTATATGAATAATCTGATTTATTTTTCTCACTAAACT CCATGAAATATTTTAT T CA CTT TATTTTGAAGAAGAGGGATGAAA CAGTTGCAAAG CATCTATT TGTTTATAAAT GTATCACAAAGATATT CAGC CCACACTCAG CAGAAGTTTGAACCACTG GGGGACAAGATGAGAAGAT CTT TCATT TTGAGACGTGTCCAGGAGCATTTCCCTGGACTGTGAGAAGAACCATGCTGCCTGTCTGTGCTGGGGAGGCCAGAG TTTGTCTGAACCATGAACTTGCCAGGTGCAGGCCTAGGGCTACACTCAGAGCACTCTCCAGACAGGAGAGAGAGC AGCCCTGCTCTGCCTGAACTCTGCCCAGCAGCATAGTCCAGCTGTCCCAAGAGTGCAGCTCTCAACAGTTATTCC TC CTAATG CTGTCACT GCTTTT CCAAGTGTTGGT CCTTTGGAAATTAGTT CCTG CTGTCTTATACACTTGAAGGC ACATGATTTTATTTCTATAAAGAGAGGCAAAGTATAGAGTTTATACCTAGTTAAGTTGGCTTTTTCTCCAATTAA TTTCACTTAAACTTTATTTTGTACTAGCTTCGTATTAAAATAAATAACTGTTTCTGGAATTTTTCACACTCAGAA AAAAATGGCATTCATATTAGAAATAACATTAGAGTATTTTGATTTAAAATAGTTTGAAACTACATTTTATTAAAA TATTTGAATCCTTTAAGGCACAGCCTAGCACCTTAATAAAAAAATTGAATATCTCCCAATATTGCTGGTTATGCA ACATAAATGAATAAAAGCTTACTGTAAGAAAAAAAA ( N) xACCTGCCATGTTTGTTTTATGTTTTTGGAATCAGA AACACCAATCAGTCAGGGCCCTTTCTCTGCACTGTTTCTAATCATCCATCCCCATCATGGCAATACAGCCTCATC TGGCCC CAGTAACCTC CAGTTTTCA CAGAGCTT CTTTTCC CATGTATC CC CTCT CATGAGAG GCT CAG CGTTAAA TTCAAATGCTCTGTGTTACTGCCCACCTCCTTTGAGAGGGAAGATAGCATGAGCTCATTGAATCCTCTTCTGCGG GGTGGC C CAAAACCTGAAAATATTACGGAATATC CATAATAGGTGATT TATTGATACACCATTC CAGTTCGTGAT ATTACATG GCATGAGATACGTAATGGTGTTGATCTG CAATAG CAA CAAAAAGTC CAAGATA CTGTGCAAAACTAC AGTCACCATGGTGCCTTAAGACACACCTTCATCAACCCCTCAGATGTTCCCCATTGAGACATTGCCTCAGATGAT TGGTGC CT CTGATTTT CATTGAATAAAC CCGCACTTTTTG CATATTCCAGAGAAAGTAATCACGAGGGTT CTGAG CTGAGGAGCATGCAGCCCACTCCTCAGGGCTACATTGTCCATCTCAGTTTTGGAAATTACCAACCATCTGGTTCC TCCCTCCCTTGAAGGAACGGGTGGTATATCTTCTGCGGGAAGCACTCCAGGAGGTGTCCCTTAGCTCATCAACAA TGAGTGTTATTGCAACTTAATATGTGAAATATATTAAACACTTGTTTATTTTTCCAAACTTTTGTCCACCCTGTG TTAAGTAGAATGCCCATCTGATCAATGAAGTTCAATCATGATTTTCCAAGTAAATGAAAGACTCAGTGAGTTAGT TCAGCAGTCAGCCAATTCATCAGAAGGCAAGGTCTGTGTTTCTTCCATAAAAGTTAGACCTAAGGGGTAACAGAA CTGGACCCATTCATTAGTGAGTTTTATGAAACTGTTTTCATTATGTGTGGTGATGACCACCTCACTTGCTACCTC ACAGGTTTTACATACGGCAGCAACAAATGCAGCACCACACTTTTCTCTAC( N) xTCTCACATCTCCCCTAGACAG ATATTAGTGGAGAATCTTAGTTTTACACTTGGCTGCCCTCACACGGAGCTACAGTGTGATGACTGTCATAAGCAG TGGATT CACAGGCTAC TGAC CCATTAGGAAGTG CTC GATTTAAT CTGAAAGCAAAG CTGAATACGTGAGCACACA TTACACAGGAATTTATGCCCTGCAGCATGAACAACAACAACCAAAAACAAACCTAACTTCAAATGTTTAAGTCAT TGGCAATTTAGCTGACTGTATGTTCTGAGGAATCAGGCCATGAGTGAGTAAAATATAATCATAAAATATTTGATA AAATAT C CAATCTC TATCTTAAAAGCAATGAAATTCTGATATTT TAGGAGAGTTTTGTTTGTTT TTTCAAATAAC CAACTATAGATTGTTTGGATACAGACCTTTCGTAATACAAAATAGTTCATTCATTTTTGAATATTTCTGCTTATT
T (N ) xTAAAATGGCTAAGTCTTTAGGAGAGGAAGAGTGAATAGTCATTCCTCCTTTCATCTCATATTTTAAACAA TACTTTCCTTCCTTAAGTTAAAAACAAATCTTTTCCCCCTTAAGTTAAAGATTTAGCAGGTGTCGTGGTTAGGAC ACAGGGTGATTGATGTGTGAGAGGAGATGAGTGGGAAGACXCAGGGTGGGGCCGACTCTTTTCCAGACTGTGTGC
a g g t a g a a a t g c c a t c a g t c a t c g c t t c c t c t g g a c a g a c t a a t t t t t g t c t g t c t c t t c a t a g a t g c c c a t g a g GTTACTCAAAGCCCCATATCGGATTTCCCTGCCCAGGACCTCTGTTTCCAGGACTCTTCAGCCCACATCTCTGAC
c c a c c g t t g c c c t t g c a c c g g g t t t g t a a t g c t g a t c a c t t c c a a t c a g c a a a c t g t a g t g c t t g a g g g c c t c c t GTCCTCTGCCCTTGAGTCAGTCGTTTCCATCTCAAAAATAATCACAAGGGAACTGAGACACACAGCCAGTGTCTT ATGACATCATTTTCCCCCTTCCTTCCTATTTCCCCACCAGGAACAGGCCAATATAGTTGCTGCTTATTAAAAGTT TGTGGGAAGAAAAATAAACGCAGACGGTAGTTTCCCTAAATCCTAAATGTTTCTACAGACTCTCTTTGAATTATC AATTTGGGAAGAGCGTTTAGAAGCTGGACCTGGAACTTTCCAGGAAGCTGGCTTAAACCATCCTTACCTGTCTGT CCACTACTAGAGCAGCCTCCAGAACATCTGCTTCAGAGGCCAGGAGCGCTCTGCGCAGCGGTGCAATCTTCAGGT TGTTGG CC CAGGAAAT CTTC CAG GACTCGC CATT CC CATT CTTCAAACAC CTTCAG GCAGTGGCTGCATATTGTT CC CACT CAACTTGC CCTCCT AT CTTGGAAATATATTTTCATTTT CTTATG CTTTTATTTAATTGTCTGTGTTC AC GCTGTTTTACCTCAGTACGTAGCTGCAAACCTTGTAGAACGATACCATATAAAATAGTTAACATCCTCTTGGCAG CACTCTCAACCTCTCCATCTCACTGAGAAACAAGTTGACATACCAGGTTGGAGATCAACACAAACTAACCAAGAA GCTACACTGTTC(N) xATCACACTTTAACCGAGATTGCTACCTAAGTAACTTTAATTGCTGAAGTTGCCTTCATG GAAAGC CC TCAGGTAACAGCAATTATAAAGGACACATTTATGAC CAGGTAAGAGTCTATCCACAT CAG CAGAGAA CACAATGACTGTTGTTTAAGTACTTCAGTGTAACAGAAACAGGACACACACTGATGCATGCATTAAAATGCTTAT CATCCATGACCTGTGACGGTATCTGATTTGGGCAGCTACATTTGGGGATTGGAGGGTAGGTGACTCAACCAATAT GTGGCTTAATCGGGGCACGATAGAGTAGAGAAACACCCTGAAAGGCCAGAGTGAAGTGAATCAAAAGACGTTCTG TGATTTTGGGGGCACCAGGATTCTCCCATGGTCCCTGGCTTCACTTTGACACTGAAGTCCAATGGTTCTGATATT CCCAGAACATCTGTTGCAGCAGTTAGAATCACCTATCTTAGTTCTCCCTAATCCACAGCATGTGCCTGCTGAATT AA CTG CTGTTGACATCGAAGTG CTTATTTCAAAAGACTCT TCTGTGTG CAAAGTT CAATCAGAAAAA CACAGCGT AGACTTCTGCCTTCCTTCACCAACCTCAGTCTAAGGATAGGTTATTTCTGATCGAAAAATCTCCTCAGACTTGTG ACTC CT CCAGTGTAAACAAATGTTGCTGTT CCTTTAGGAGAAAT GATGGATTGACTTTATGTTTGATG CTATCGG GCCAGAAAATACTTGCTTGTAAATGTGACATTCAAGGTGTTATCTTGGTCAGCCTATGGATAAGATGGGGACAGA AGGCGTACCAGGCTTTCCTCTTTTCTACAGATCCTTTTTGCTAGAGCTATTCCTTTTCCTTGGGATCCTGGCAGT TCAC CAT CA C CATTTAGTTTT GTAGAAAAT CCATTAATTATCACAAATATTATTTACTTT CT CAGGGTTATGCTA GAATATTGTTTAAAT CTGTAACAAAAGAAACT CAGG CT CCTTGTGACGCTTACATGTGTAACAGflGGAGTTTTGT AAAAAAAT CAAT CTGTAGGCATTGATCAGAAATG CAATATGCTACAAAGTTTAC CT CTACTGTCTGATAC C CTAT CATTAATCTACAACAGGATTGTTTAAAAACTGTTACAAACTATTAGTACCCCATTCTGTAA(K) xGCTAACCCCA TGACAT TTATTCAGTAATGACAGCCACAAT CACC CTTGGTTCTTTTGGCTGCAGTTTAAAAT CAAC CGTGACTGA AGGTAGGATTAT TTAG CAGGAGAAACGT TGGC CTAGAAAATG TGTGTATCCAGATATTTAA CTT CCTAATGTTTT CCATTCATTAACAAAACTTATATACATTATTT GC CACAAC CTATTAGTTTTTGCAGTCTGAGAAATAGAGACAAG AGTTATATGATCAAATCCCGACTGCCAGGACTTGAATAAGTTATTTAGAACCTCAAGTTAGTTTGTTTCTTTTAA ATTTTTGTAGAATTAG CTGACTTTCGGGTTGC CC GTGCACATTAGAACAAAGGAGTATAAAATTGG CAGAGGAAA GAATAGAA CATGAGGCAG CACTGTCTCATT CTTATTTACT CAGGGCACAGCCCAGTGACT TTAGGCACT CAATAA TCATTTTC TGTATAAATGAATCAATATTTGTGGTTAACAC TTTATAAATGTCAACTGTTATTAT CGTTTAATACA TCAT TGAATGTCTTAT TACTCTTAATAGAATT TCAAGAAAGATACAATTAAGTTAAAT CAGGATTTAT TTGGGAT AAGC TAAGAATTATTATTATTATTA(H) xATTATTGGCCTTAATTTACAATCATGGATATACCATAAAAAGAGTA TTAGTGTT TTAGAAGTACTGTATCCATT TGAAGAATGACGGGTGACTAAATGGT CATGGGTCTT CCTC CT CATGT CTTGCTGTGTGCTGTT CCGACTCTCCTG CAATATGGAT CAGACT CTGAGCCTAT CGTATGGTG AGACT CC CAGGT G GATGCTATGTC TAAATACGATGGGTGGTTAT GTGTGCACAACC T
>Hs9_123059233-123077867
TTTGTTAG TGTAAAATAAGTCACTGAAATGGAGCAAAG CAAT CCTTTCTGTCATATTAC CAAAT C CAAGACAGTT CTCATCCAGGAATTTACTGGGCCAACCTCACTGTCGGGAAAATGCCTAAATTCTCAGGAGACGAGCAGTGTGCTA TGCCACATAATCTCAGCCTCTAATACTTAACACTATAAGCTCTGCTAATAACTCCAGGTCACTTGTTTAGGAAAC AGAGATAAAGATTTGAAAATTTTGCATATTAACACATACAATCTACCACCCAATGTTGTTGACTCTGAGTCCTAG ATATTTCAATTCACTTCCCTCCTGCTTGCGTTCTGAGAACACAGTTATTTCAGGACATGTAAATGGAGAGGGATC ATTGCTATATGGTTTCATT CTGGAACTTTTAT GTCATTCCACATAGAGCCAGTTTCCACACACATTGC CAAGATA AGCCCCTTCCCTGCAG GAAGGAATGTTT CTGGAG CTAAACATGAA CAATGATAC CCTTAAGTTT GAG T CTGACTC TCCTAGTGATCTGTGACTTTCTGCAAATTGTATAAATTCTCAATG(N)xCCTGCCTTCTTCCCCTCTCTTAATCC TGCTAATTTAATACTGAAACATAGACGTACTCTTT(N ) xAACAGCAGTAGATGACCAAAACATCTTTTCTACAGA GAGCAT CAGA G CACATTGTGATTGAATCTAAAGGTGTGTTTG CTGTTGAGTTCTTCTTGTTTGTTGTT CATTTGG GGAATT CA CGTT TGTAAACAAAAATCCAGAGAGAAGACAG CACCTAGAAAGAAAGGGAAGAT CAAACACAAACAG GCCGTTTTAAAAAGAGGTAATGAGT(N)xTTAAAATAAAATA( N } xGTGTTTTCTTTTTCTTTTAGCAAGAAAAT GGAT TGTCAACAGTGGTAAATCTTATGT CCAAGT TAGAAGGTAT CGGGAGATTTTTAAGAGT CT CTGGGATTGAA AACCTGTGGTAATACAGACCTCTCAGGGTGAAGTTTACTTTTATCCCTGTCTATGATAGGAACAATATTTGCTGA GTGCATGAATGGTGCAAACACAGACTCTGCCTCCACTTCAGGGTTTGATTTGATAAGTAAACACCCCTCACTCCC AGGATGTAGTTC TGGG CT CACCATATGAGACT CGTTTGGGACTGAGAAACTTAAAACCAC TGAGGCAAATGAAAA TAGGTCAG AATC CAG GTAAATTGTCCTTAGA CT CAAAATGACAAAGTCTTTGAATTTCAGACGC CAGGAATTGCT TCTGTGGCTCCCTGGG CACCTGTTTTTGTG GCAACTTCATA CGATGTTATTTAT GAAGAAAT GAATGG GAAAGCA GAACTCCTCTAGTA( N ) xGTTGCTGCACCACTCAGTGCTGCCTTCTTCTGGGCCTGACTGGCATCAACTCACTCG GTGATACTCAGC { K ) xGGATGGATGAATGGATAATAGATAGATGGAC (l'J) xTCTCTTCTCCTCCCATGAAAGGAA GTCTAAAGCATGGCTATTGTCACCTAAGTCTTGACATGGAGGTACCAGTTTCACACTGGCTTCAGATGTTTGTGT GTGATG GACT CACCTG CATATTCTGGGACAATGT CTTCTCATATTGTGTCCATCAAAATAAGTGTCAGGT CAATT CAATAG CATTTATTAAGTATCTAGGTTAATGCTAAGAT CTAGAGAGGTCACAGAGAAAAGTT TT CAAG CC TTCAA GTTTTTGTAGAGAAAGAACTGATATATATT CAGACAAT TAAGAAGTTTATGTAT CCAT GATT CCTCAGCT CAGGA GAGGGGTAACAATATCCCTAGACCAGGCACAGATGGCTATAAGACAAGAGCTCCAGTCCGGCTCAAAATCTCCCC TTGTATAGGAAT CT CTAGGAGATATGAC CCATAAACTCAAACTATTAGATGTCC TGGCTGAGATGTAG CACCTGC CATTTCCTCCTCACTCGGAGTATTGCTGTGTGGATTCACCTTCCTCCAAGTTCAGTGGTGATGTCAATATTTAAA AGGCCTCAAAGAAAGGAGAGCTCACTGTGTGGTGAGCAGCTAAGGATATCTCCTGGAAGAGGTGGCCCTGGGCTT AGAACAG G GACATACATACACACCATTTGTGC CCTTACTCCC CTACCTCATCCAAT CAATTATGG CACAO CTTAC ACTGAACT CA GAAC C C T (N ) xAAATTTAAAGAGTTGATCCATTTCTCTAATTCAAGACAGCTTATTAAGTAAAGA TGCT CATGGAAT CCTAAACTTACCCTAAATTGGGATAAACTGAGGTCCTCTTTTTC C TTTTTTTTTTTTTG (N ) x AAGGTC CACTTTTT CTAC CAGGAAGCTAAGGATC CTAGAGGGTCAAAGATGCCATCACTAATATTTTTGAGTAAA TGAAGAAGGAAAGAAGTAGAATGAATGAACAGAGAAACACACTGCAATTGTATGGGGAAAAGAAAATAGAGGGAA GACGTTG CAGGCAAAAT CATCCACAATTATACAAAT TC CACCTTAGAGTAGAGC CCAG CACG CCTGGGAAGGGCA GAGAGAAGCTGATCGTACAGGCTTTCTGGGAGCCTTAACATCATTGTGTCTTAAAAGGGTGAGGAGCAGGGGAAA GGAGGAGGAAAGAGACTTTGGGACTCAGCATTTCCATATCAGATAATGGACTTTAAGAGCATTTGGTTTATGACT GAAT CTTATACC CTA CAC TCCCTCCCAC CTCTATTCTCTC CACT GAA CAAGTAAAAGTGC CT CTTAGCACAGCAG ATTCAAGAGCAGAGATTCACTTGAGATGACTGTACCACAGTGTGCTCATTCCCGATTGGAGTGAAATGGTCTCAT CGCCTTTGAGTTGGGTCAGCTGCTGAGATGATCAGGGATCTGCTGGAAAGAAGCAGGGGGACAGAGGCTGATCCA CATCCCACCTCTGCTCTGAGCTCTCAGGGAGACTGACATGAGGTCAGTCACCATCACCGTGGGAGTTCACAGAAA CAGT CAGCATAATGGAGAGAAAGCCAGT GGAAGGGGAG CTTCGAAGATGGAGACTC CAGC CTTCTCTTTC C CCAT GGGAGTGATGGGCCCAAGGTTACCAAGCAACAAAAGGCCTTCTCTCCCCCTGCCACTCTCAACTGGGCTGTGTTG GGTGGAATATGGGCAGAGCTCAGTTCATCACTGACTTTATTTATTGAACAGGTTTTCTACCTGAACTCCTCACCT CCTTTCCAATCCTGAAATGAATTAGGTCATATGAAGTCATAAGTGCTCTTCTGAGCCTCCCGTATTCTCCAACTT TCCAGTCATTCATTCACTCGTTCAACAAACACTTACCCAGCTGGTCGAAGTGGAAAGGGCACAAGCTTTAGAATT GTGCTATTTATGATTAAGTCAT{ N) xCTTGAAAGACTAAGCCTGAAAGGTAGTAAGGATTAGATTCTTTTGCCAG TCATTAAAACAACCACTC (N) xAAGGTAGTAAGGATTAACTGTGAACTTACTCATAAAGCTTATAATGCAGTGAC TGGCAT( N ) xAGAGACAGGATGGGGGTAGCACTTGGGCAGACAGTGTGGACAATTCACTGCAGCAAAACTGTGGA AGCAGGAAGGCCCAGGGCATGAACCAGGGTGGAGCAGTGTGTGGTCTGTATGGTGGAGGGTCTCAATGAAGAAGT GCAGTGGAGATGAAGCTG GAAGT C TG CAACGACC CTGTAGGG CC CTGAATGC CAAGTTG GAAGTTT TGGCCACG T CTGGGTAAATGATGACATTAAACATGGACAAT GAGGTCACTTAG GATCTGCTAC TAAGGTAATGTTTGAGGC CAT GCCTTCCCTTCTCCTTCTCTTTGGATAGACGCTGTAAGGACCAAAGTGCCGGGAGGTATTCCATGCGCCCCACCG TCACGGCATATCATTAACCCACCATCATTCCCAGCCTCCACAGACTTCTGACTCCTCTGTTTCCCAGGGGGAAAC TCATTCTCAACAATGTGTCACCCACATAAGGGGACCAAGTCTGGGAGGTGGTGTGGGCCACGGGCTTGCTCACAA CTCAGTGTCAGACTCTCGTGAGGCCCCAGTCACTGACGGCCACAACAGCAACCAGACAñAACAGAAACTGCCCTT CACCAG CTGAGCTT CC CT CTTAAACATC T CTGAAAC CCATCAGCTT CTGCATGACTGCCTCCCTGCTAGCTCAGG CCCCCACCTTCCCAGCCAAAGGGTTCTATCCAAAACCTCTGGATTGTAACACACCCTGTGTGAAACCTTCACTGG CTC C CATTGTCCAAGGACAAACTG CAGACTCCTGTG CATGGACTTTAAGATG CT CC C A ( N) xGAATGAGTAGAGG CTAATAACAGCCAGGGCTAAGATAATGTATTCAAAAAGAAGCAACCAGAAACTCAAAATACTCAGTGTTCCCAAA AGTGTG GAGAGCTCAGTT TGAAC CGG GGGCAGGT CTGAAAGCTC CCAGCAGG CT GAG GGGAG CC CTGTTCAATCA GGCT CACC CCAGCGTGGAT CAGGG TATT CTGTGGAGAATTGTG CAAAATGGTTT GGAGTTTTTGTTGTTTGG TGG G GGTGGGCAGGAGC CAACT CATGAAG GGTGG GGACTGCTGAGAG CTG GGTATGACGGCTGGAGCAGGAT CC C CC C AAGCAAGAAGGGCTCCATCAGGATTCAAATTCCAGCTTCA( N) xCTAGCTGAGGGGGAAATCCTTTCATGCCAAG CACAGAAACCAAATCCAAGCCCTACAGATTGAGCTGGGTAAGTGATGTTTAGGGCAGAGTTGCAGCCTGGTGAGC ACCCAGGAAGCAGACACCCTGAAGGGTGAGCTCAAAGCAAAGACCAACCCAGACTCCACCCACTGGAGCATGGAT ATGATG CC CACAGC CTGGACTGGCTGGGTGGG CAGGGCTGTG CCTCTTGAGG GAGGAAGGAGGAGCTCTGAGGGT GGGGAAGCTTCCTGGACCCCTGGTTCCCTCTAGGCCTCCTGATGTTGACGCAATCACGTATGCAAGCAGCTGGGG CCA CAG CG GTAGATAAGCAATTCC CAGC CAGGTAGT CACAGTGCTG CCTGTGAGTG CAGGTGGGAACTGCAG CCT GGGTGAACAGAGTAGGTG CCTGACTTCCTGGGGGTGGGGGGTGGCT CCAAGTGTGCTG( N) xAATGTTATGATCC GGTTACATTGTAAGAGAATGACTGGCTGCTGGGTAGAAAGAGACTGGAAAGGGTCACACTAATATTCTGAAAAGC ATGTAAGC TCTGGAAACA CATTTGGGTT CACATTTCTGCTTCAAAT CAGCAAAATC CAGACTG(N) xAATTTTTT TTTTAATATATAC(N)xGCAAGGGCTATGAGACCAGTCCTTGTCTGGACGGAGTCCCTCCTGGAACCCACATCTT CTCACTGACCTGGCAG CC CCAGGGATGG CCTCTCTG GGG GAGTCTCAAAGAAGAAC CTGAATTC CAAGGGGAGCT CACTGAGAGCTCAGACTTAñAGTCTCTC CAGA CCAGGATTCCAG CC CTAGGATGAC CCTCAGGC CAAGGCCAAGT CTGCCCCTATAAACCTCTCCCAAGCTCAGCCTCCCTATCAGAAAGCCAAGGCAGGAGGGTGGCATCCACATATGA AATCCCCTAGATGGCAGCTCCTCCCCACTCCCTAGGACCCCATGCCAACCCACACAGTCCCCCGGCGACACAGTG CAGC CATC CATGGG CAGCAGCCAG CCTG CTG GTGTGATTATAATTAAGT CGGTTTC CAGCAT CCTCTAG GGG CTG CTCTGC CC TTTTCT CC CT TTG GTCATTTTACCTGTCATATGGTC CC CAGGGT CACAAACAATTCTCAACACG GTT TTTGATTAAAATCCAGTGAAGCTGAAAACCAAGCCAGATTCATAGGATAAAAAAATGTTAGCCACTACCTCAGGG AAGT CAACTTAATT CACTGTTTGCAC CAGGGAAAGG CTGGTATCAGAAGTAGGC CTTAAAGT CCAAGGACTGACA TCTTTGAGCACCTACCATACTCCAGATCTGTTAATTACATTCTTTCACTTGATTTACCAGGAGTAGGTGGTCTTC ATGTGTGCCTCTCTCCTGGGAGCCCCTAAAGTCAGAACAGAAACTCCAGGCTTCTGAGCCCTCTGCTCCCTCTAA CTGGGATGTCATTCCCCTGTATCTCTGCCCTGAACATCATCTTTCCATGACAGCTAAATGTGACAAACCCCAAGT C CAG CAAAAAT CACCCCCTTTTCT CATCTCC CAC CACCCACATC CTTTACAGTACTTAC CACACAG CATGATGAT AAGC CATACACATG CT CACCTCCC CCTCAATC CC CAGACT AC AG ATTCCTTG AAG GTAGC AG GG AAAAAGGCATT TTTTTAAACTGGAGAATGTCCAAAGGAGAGCGGGAATAGTGGCAGCCCATAAAGAACAGTAGGAGGAACTGGGGT GGAGGCAATGAATAATAAGAAAAATAATAACTAACATTGTTAAATAATTCCAACATCCTGGGCACTGAGCCAAGT GCTT CACC TTTGGCAGTG CTTTCTAAAG C (N ) XCGAATGCTCCCAACATAGATGAAATTGAGAGCCAGAGTTTAG TTAAGGGACACAATTAAC CCCCTG CC CCACTTTG CCAATTATAT CC CCAGGACAAATTAGCCAAGTTAGAAGGAC ACATATTGTCAGGAGCAGGGAGGGGTTGCTGAAATAGTTTCACCTGCTGTTCTGCTCAAGTTGGTGGCTGCCATT TATTTTGGTAGTTAACACATTAATTTTGGTAGAACACACTGTGTTGCTCGGATTTTGTTTTATCAGTTTTAATCA ACCAAATTATTATTTTGAAGTGCAATGACAAAATAATCTCTAGTGCTAATGTTTAAAACTCCCTCATGATTTAAA CTACAGTTTGCGAGTTTAAAAGAAGT CTTCAG CCTT CTTTTTAC CC CTTTTTAC CCGTTTAC CCTCGTTTTAGAA ATGTTTCTGAAAGACAATTTTTGCGAATGTGAGTTTCTTAGGAATGAAGTTGTCATGTGATAGCTGATCAGGCTG TAG C( N) xAGCAGTAAAAGCCATGTTCCCAGTGCCTAGCACTCTGCCTAGCCGCCGCACCCCCCTTCCTGCTGTG CTCAATCGATTATTAGAAGAATTAAGACGTTAAATTATGAAACGATTGTCAGAATAATATAGAAATTTGGACTCC CAGGGAAGCGTTCTTTTCTTTCTCAGAAACAGCCAAAATGCCTCCAACACCCTTTCTCCAGATCAGGTCACATGT TAACTCTTTTCATCAT { N) xATCATATTAAACTCTCTACCCATTTGCTTTATGTATGGCCTCACCTGATTTGATC AGGAAACCAATTTATTCATAATAATTACTTACAGAGCAGGTGGTACAGGAAAAATCTTTCCTGTTTTTCCCAGAA TTGG CAATAAGCAT CG CAACTTAGAAAAAAAGGTTG CTACG GACAACCTTAGTTTGGAGACCAG CC CCTGTGAG C TAGT GGTGCTGGGGCCTGTTTCCC CATC CTATGGACAGACCGCTGGTGGG( N) xATTAGGAATGGAAAGGGAGGA TAACTCTATACACTAGAAATTTAATAAAAACCTTAGATAATAGTGAACATTTTATATTAATAAATGTGAAAACAA ACTGGACAGTGTTCTGGAAAAAAAAAATGAACACACTACAAAAATT CACTC CA CAAAAGTGGAAAACATGAC CAC ATCAGAAAAGAAAACTTAAGCTAGTATCATTTACATTAATGTTAAAAGCTTGGAAAACTAAATAACTTTAAAAGT TAACTAGCTTAACTAAAAAGCTAAATAACTAAAAGCTGTTTTAAAAATTAAACAACTAAAATCTTTAAAAACTAG AAGCTTTATTAACTAAAGCTAG CAATTT T T T T C (N ) xAGTAGGCTGAAGAATAGCTATGTGGTGACTCCATAAAT GCAGAGAAAAGTATTTAATAAGATTAAT GCACATCAAGATGAAGAAAAAT T CAG CAGGCAAGA CATAAAAAAACT TTGGCAATCTGCTTAAGTGTGTGTCTCTAAAAAACCAACAGCAAGTATCTCACATA{ N) xGATTTAGTTGACATT TTCAGTCAGTAAAAGGGAAATGATTATATGTGAAAAGCAACATTTTAAACTTTTTTTAAACAGTCTATCTGTTAC CTACAACTTTGTTTTTTTTT( N ) xAAGTCCCTACTGAAGATAACGGGTTTCAAATGACTCCAAGTGCTCCACATC TTAAAAGTAGTC CCACAAAG
>H slO _1526769 6 -152 87 05 0
TGTT GCAACCAGGAAGGGCGCC CTTT CATCATGCGGTGAGAG CCTCGGG CACTG CTGAGACA GAG CAAGGGAGGA ACCAGGTTGTTCCTGCTGCACGAACCAAAGAAAGCTCCACTGCAAGGCCTCTCAAGTCGGCCCACCTGCCCCTAA GCATGGAGGC CT TGGGATGCTC CTGGGTGAGGACAT CTGC CCAGAGGG CT CT CCTTCTTAAG CCTGTCTCCTTGG GGTCTGACAGACAACAGTCCCTAATGTGACAGCAGCCCAGCTGGGCTACCTGCTCAAGCCCCCTTCACCCTGAGC TTCTCTTTCTGCCCTTCCCAGCCTCTTCCAGGGGCTTACCACTGCCTCTTAGTGTCCTCTGCACAAGCAAGCAGC TGAGGTGGCCGCTGGGACTTGCAGACTCAAGGCAAGGTCATGTGGTGAAGGCAAGCAGGACCCTACAGCCTCCTC CTGCCTCACCCCAGAAACCCAGGCTTCTTCCTGCAGAAGCCGCACCACTGCAGCTGCGCCAACCTGGTCGTGCAG CCGGCATTTTAGGAGGTGGCTGGACAATGACATGTGCCTCTCTGTTCCAATCAACGCCTATGGCTGGGCCTGTTA CCGGGACTGCATGTGT GTGAAG GTTACGGGGTGATCTCTCTG GCATCAT CAAGG GATGAAATTCTG CGTTACTGA AATTTCTACATCATTGATTGTTACATAG( N) xTGAACCATCTGGGAGCTACTCTTCTACGTAACCTGACTCTTTG C CAAAACTTGAAAAG G CCATTTATAAAAATTCTAAAAAGCAT CATTTC CT C CTG CTCTGTGAATG(N)xCCCTAC CCTCATGTTTATTTCCCTTTTCTTTCTTCCTTATAACGTGTTATCAGTGTTCTGTTATCAACTCTGCTCCCCCAC CACAACCACCCCCCAGCTTCCCACCTGGAGCTGTATGCTATTGGGAAACACACAACCTTATTACAGTTCTGTCTG AAATAAGATACACAGGCAGTGTGAGGGACAGGAGGACAAAATCAGATACGGAACAGTGATACATATA(N) xGGAA CAGT GATATTTGAAAT CCAAGT TCAGTATTGTTTTCTT CCACACCACACTCT CAGGTCAGTT TTTGTCTGAACTG T C A (N) xACCTTGAAGTATCATTCTTTAGAGTTAGACCTTACACAATGGAAAGACTCGGAGATATTTTCAAGTAT AAA CAATTTG TTGAACTATCTTAC TAAT TTTTTGAAAATGAG TAATGCAT CCAG CTGGCAAGGTTCA CTGGGTG T AAAG CATCAT GCATTC TTAATT TGGAAG CTACATTTAACTACAATGATTATTACACATAACAAC TGACAGGCAGG GAAAGATGTGTC CTGCAAGATCTGATGCTGTCTCCCAG CAAT CGCCAAGTGTGCTGCTTT CCTG CTTGGTTACCA AC CAGTAATGAAAT CTACTTTTATTTGCAAAGTATAAAGAAAAGTGAGTT CAAAGATTTAACAAATGG CCATCAA AAGAGAAAAGAACT CTAAAGTATGAAAAACGGCCCAAAACAATGTCTAAT TT GAATATGG CCTAAG GAGATCACA GT TACCTGGCTTTT CCTGCCTGGCACATTTACTCTG CCTG GAAACTCC CC CGTAGCAGCCAATTAC CTTCTCAC C ATTGGGTAAGAAGCAGACAGACCTGATATTACAAGCACTATTGACTTTGAAAAGATCGGACCTGAATCTCACATG TTAAGAGATTCTTAAAATCTCTTTTTGAGAAAGAGATTAGCCAAAGTACAAGAACAAAGTGGGAGTA(N)xAGCC CAAGTGACTGTTACTATGCTGAGATTTGTTAGCATGTCAGACCTCAGAACACTGGGCAGATAAACACCCAGCAGC ATAAAGGGTCAATGGACCATACAGATGACTTCCCTGGATTGACATGATCCTAAATGTGATACCCAAGTCTGCCAC TT C CTAGTCGTGTG CG CTGAGG(N ) xGTATTTTTCCAGAAAGAGGCTGCGGTGAATTCTACTAGGTCTATAAGCT GGGACTCGGAAAGAAATGCCACAGCCACTCAGAAATCCCATTGCCAAAAGTTCTATCAATGAAGACGTGATTGTC CACACTGTGAAACATAGCCATTTCTACTCCTACAGCTACATTCACACTGTGAGAGATTTGGGAACATTTAGGGTG GGGGGCAAGGTCACTTGTGCTGGTGCAGGCATGGCCTT GGATTGGGGAA C CT GCTGTTGTTT CATT TTTCAGCTG TGGGCTCTGCCTAATCACTAGCTTTCTTGGGCTGTCATGGCTCTGGATGAATAATTAGCATGGAGTGAGCACTAG CT CGTTTGAG GAATAATCTTCTTAAGAGGTATGCA C CT CT CAGATCCATGGCAAACACTCAT CATCAT CATCAT C GTCATCA tN)xCCATCAGGCGTTTCCAGATCCCTGAATATGCTAGATTTCTCTAGAAAACAGATGACTTGCTTCC TAGAGCTTTCATCTGCATTAAGCAATAATCTCCATAAGTTCCTACCAACTGTATATATAGAATTATCAATACTTA AGTCTATGTTATTGTATACTTTAACAAAATCAGCATAGAATCACATTAAAATCAAATAAAAAGCCATCAGCCCTT AAA C CACATT CT TCAGTATGGCAGAAACAGTGACAATT T C TACATTTATGA CACTGTGGGATAGT C CT CTTTTTT TCCAAATTAAAGTCTACGGCTTTGAGTAGCTAGGTTTCCTTCTTTGGAATTGTTTTTCCATGAAATATGTCAGAA GTAAACATTC CTTGTTAATTATATAG CAAAGTATTTGC CCGGTGGTCAAT TATCTAATCATT CTTGTTGAAAGCA AT CAAACTGAAGAG CAAAAGTTATGTGCAGATGGTTG GATTCTCATAG CTG TAGG CAAGTA CATTT CAG GACTGG TTTAAACAAGAATAAAATTAAGGGCAACTCATTGTAACCAACGACCTTCTGTTTGAAGCATGAAATCATGAATGT GT TGAATATTAGAAATTAACACGATGACTGTATTCAGCTG CAACACATTC CCTGG GTATTTTTAGG GTAGTTTTT GAGAAAGATT CTTAAATTTAAG CAACTGATTTTTTTTT CTTTAAGCTTTAG CTGAGCTAT CGTAAGACTTACTGA AACAAGTTTTAGGAAAGAAGCAGTTATGGGAAACCACAAAATACTGGCAGACTTATAGAAACTTTATTCCTTAAA TAGATAGCTAGACGTATTAGGATACATAATAGCATTAATAAAATTTAATGAGAGACTTCTGAATTTACAATAGAT ATATATACATTT CCTGAAAGTAGGTCAGAATACAAG CTAATTAGCCCCAT CTGGAGAGAAATTTTGTT CATTAC C CACTGAGGCCAATACCCATCTTGCAATTACATGTCAGAAGGGACTGGGTGGGGAGCACACAAGGATTGTGGGTGG CACAGCCATTAGTCCCCTGCATTGGAAAAATCCTCAGGATGAAAGACAGCATAACAGATAATTTGCAGAGAATAA CTGAGCGTTCTCATAACAGAGAAATTACAAATCTGGAGACATGTGGTAGGATGCTTG tN) xCGCCTCTCTAAATT ATTTGCCCTTTTGTG GGTGTGG CAACTC TGTCAGGCAGAGTTGCCACACC CG CAAAATGC CC TAATGC CATTTC C TACCATGTGG CTGTAG CGGGTT CACTAGGTGGCCAGTG CTTGAAGCAACT T C TGAAATAT TT CTGATATATTCAA GAAGGAATCTGGTCAATTGTCAGACTATAGGCCAGGAGTCAGCGCACTCAAATGCCXACAGAT(N)xTTTCAAAT GC CTACAGATGC CATTGTGTC CTTGC CTAGAGGGATGTGT TG CTTAGTGAC CTGGCTGGTTC CAAG CTTGAGA C C T CTG CACTGACCACTG TTGCTG C CAGGGATGTCCTGTC CC CGAGATAAGC CCACGTTAGGATT CTGGGTGTTGTT AG GATCTCAG CT CAAATGTCACTT CT C CAAAGGC TCTGAC CAGC TAAT GACT CTCT CCAC CATCACTCTGCTTAA TTGTTT CT CCGC CAT CTGAAATAAACTAGCT CAT T CTC GG TCTC C T ( N) xGAAACAGGTGGTTGTTCTGCATTCC CAGATCAG CC CATCATTCTATT CAGC CACATTAAGT CTTG GCAT TGAAGCGG GAAT CCACAAAAATCT GATAAAC TGGGAAAGGGGATTTGGAGTCTGAAGAAATCACGTTTTGTGTCTCCCCCTTCCCTAGGATGTCCCATGAGTGTGG GAAACACT CAACTCTACTGAGCTAAAGTTC CT TT CT GATAAAGACGGGAGTC CCCCACCT CCAGGGTT GCTTTGG GAATAATGGTGTAAAATGTTGGTTTATAGGAATACCTGTTTTTCTAATGGATAAGCTCAGCTAAGAATGAATAGA TAAAAACATTGTTT T T T C A (N) xCATTTTTTTTCCCTAAAGGATCTAAGTTGTGATCCCATTTTTCTTCCCCTTC AGTT TAGGAAAACT TCAAAC TG CATACTGGAAATGGCTGGGCTG CCAGATACAAATACGTGTACATGGCGTACTG CAGGGACAAG CCAGA CACTT CTGTGGAAGCTT CTAGTGAAGGACATTCTGCACATG CCGG GTGTATTCAGG(N) x AAAAAT AATTTTAAAG GGGTGATT CAAAGGAAGGGG CAAAAAGAACCACGAT( N) xCCACGACATCAAAAATGTG ATACATGGCTGGAG(N) xACGGGCCCTCTCCTAACATTTACAGAACTTGAGAAGTTTCAGGCGTGATGAAGGCAC AAAACTACATTCGAAAGGAAAAAAAAAACATACAAAGTGTTGATCTTCTTTTAAAAAATCATGTGTTAAGTGAGT GCTAACATAAAAGAGATTATAG CT CATACAAAAGTATAAAACCAAAACAAACTTACGAAAGAAA CAAGAAACAAA GCTACAGTTT CAAAATAAATTT CGAATTAAAAACAT CAAC TTAATGGT GTAC CTGATGACATAT(N)xATATTAC TGAACTGAACTTGT CACTTG CAACATACATACACATGACATCCATGGC TAGGTGGGGCTT CTTT CGAG CTTGGCA TGAC TATGATTCAATT CT T C CACTAC CTTT CT CAAACTGAACTGTAAAACTT CCCATCTC TCAACATACTTTAAT TC CACCTGTC CTTATTATGCTGGTGTTGTATCATTTGTATATTACATT TAC CATGGTTCATTCT CATTTGTC CTA TT CTAT C CAACAGTGT GTTCATAAATGCAGAC TCAGGCAATAACATTATGTGATCGTTTTAAAT TCTGTATTGAA TT CAGATGATGCAGAATATTTT CATGTTAAGTTGTAAAAGATCATGAT CAAACCC CAAAC TCCAAATTTGAAAAT CCTT GCAGATGATT CC CAGG CCT CAAAGAAGC GGT CTGGATCTACAGG CTT CAGTG GGAAACGCTGAATCT CACT GGGGAAACCTTGGATACCTGGAACCTTCATCTCTCCCCAGTACCCATCCTCGGGGTCCTCACCTACAAGGTGAGG ATAACACAGGTTGCTGGAAAGAGC CACG GTGT TCAG CCAGGAGATAGGAGTGAGCAGAGC CAGGGAGCACATTAG GCTCACACACTAGACGAACACT CG CTTTGGA CAT CAGAGACTTACGTT CCTG CTGTACC CAAAT TAAAAGGCACC ACTTATTGTGAACTTTAAAGTTATTTGGAAGTTCAATCCCAAACAATGATGT( N) xTCCAGCTAAATAGCAACCT TGGGTGAGACTGGGATGTTAAATTTAAAAACAGTAAGGCTTTCCTGTACCTGTCCTTTTATATCATGCTTCCAAA AAAAAAAAAAAAATCACACACTGCATATGGGCTAAATAATAGCAGCGATGATGATGATAAGAGCGGCAAACACAT A C G (N) xCGGTGGCTGGGCTGCACACTGCCTCTATCCTGTTCACGAATGCTATCAGCTGTGTCTCATCCAGTGCA ATATTCACAT CTAT CAATAATATT CTAC CTTC TAATTACC CACT CAAGTGGAGGCAACTGAAAC CAGGTGATACT TAAAAGTACTGAATTCTGAAATATAGTGGCTTTACTTTCCTAGTTCTTTTTCACCCAGGTTAATATTAGAATGAT TGTGTTAGATGCTGTC CTTAAAAT CCAGGC C A (N ) xCTTGAACCTGGGAGGCAGAGTGAGACTCTGACTTGAAAA AGAAAATCCAGGCCAAATAGCAGCAAACTAAGAGGGCCTTGAGGCTCCTGGGTGATGGGGAAAGCCCGCAGGAAG ACACAAATTTGC CACAGATAAT CCACAAAGTGAATACT CCATAAGTGG TAACTAAGAATGAAGTGTAAAATAAAA CCTCTACAACTC CAAGAT CACAAAAG CAGG CTATAC CAAGACGGGAAATTCTAGTT CTACATAAAATGATACGTT AAGAGACCTAAGTCCCTAAGAATGGGGTTTTTAAACAAAAGTGCCCCCTGAACCCGATCATACTAGAACAAATGT GATG CTATTATT TTATGAAAGT TC TCAT TTTGGGGGGAGACAAACACAAAAGGCTTAATGATTC CTAGGAATGGA TTGATAATACAAGATTGT TT TTTGAGTTGC TAGG CAACACTGAAAAACAAATGTCGATGTTCTGAGAT CACAGCT TGGTTGTT CT CC CATGGG CAGGAC CAGTGCTACC CG GC CTTCCTGCACTCCATTGGAGAACAG GAGTG GTACAGA GC CACTGCGATG CTGAGT C CACAG TGGCAACAGACACCAACTGG CATCTCCATGAAAAGT CACAACTGACCG CCT GGCTGCCAAGATCTTTGGTGAGACAAAGTTTTCACTGAACTGTCATCACACCCAAACAAGTCCCATGCTCTCCGG GTTCTTTGCTATCTGTGATCACTTCAATCGCTTTTTTTTTTTGCAGGTACATTACCCCCTTTTCCCCTACCGGTC TCATTAAAGC CC CATT CAC C CCTAAGAGTGATTTTGACAT CTGGGAAAGTA CAGACACTTTTCCTGAGAGTTATT TCTTTATTTCTAAGCCATAGCCTTGACTCTGAATATTTCCACTGTAAAAAGGCCTATTTCCAATAACAGCTTTCT TAGGGATATGTTGATATTACCTTAGGTAAGTTGAATTCCTCCTTTGCTGAATGCCCTGAACTACGGCATCTATTC TTTTTCAAAAAATAAAAGATGATGCCCAAGGTTCTAAAATCACGTCACACATCTACGCACAGACTTGGTAAAGAA ATGAAC TCCCTTCTTTCATT CTAATGTTTGGGGG CACAGTGTAGTACTGTTTATCC CTATAATC CAAT CACC CAC CT CAGTATTC CAAT CT GG GTTACTTCAC CTAGTCATGTG G CTTGTGATATATAGAGACAAAGATAAGCTAATATT GTAATAATGATTTCACTGAAGACTATTTTTTAAAGAGCA(N)xGTTCAATAATAGTTTCATTTTCTTTACGTTGT AAACATAATTTTGTGCTTTAAGAACTCCCACTCTTTGCCGCCCTTCCCCCCCACCCCCAATTAAAAACCCAATGA AC CTGAAAAA CAAT CTTAGATCTGTGTT TTAAATGñGT CT CCAC CTTCAAG GGCAAGTTGTCCGTCACTTAGGTñ AAAGGACTAATTAAATGAGCTTGTCATAGCCGTAACATTCACCGTCTGAAGTGCAGACGAATCTACTACAGGCCA ATTGTGAGTGGGTATTCCTGAGAGCCTGACACGAATGCATCGTACGTGGTAAGAGGCTGGTTCAGTAACCACCAT TT CTATAG CAAC CC CCACACACTC TCAGGCTG TGTTTTGAAGAGGCAG TGAAAACTGGATTCTACAAT GTAGCTT CACT CTTAGAATATGCTTTTATTT CTAT CATCTCATTAGAATGAGAGTATCCAAAT CATATAAT CCAGGTTG CCA TTAAACAGGAGAAACTTATCAAATACAGGATCCGTAAATCTTTATTTATTCTTTTTGCCACTTGATGTCCATCGC TTTAAATT CAAACCAC TGAC CTGGTTTG CCTGTGAC CATT CCCCAGGG CAACAAGT CAGATGTC CGTC TGACAAC CTGTACAT CAGACAGC CAT C TG CTGC CC CAGTTACAGGATAAAGAAAGGGTAAACGTCTGTAAAATAGAGCTGAC ATAATCAT CT CTTACTGT C CAGAAAGGCTCAG CTAAGAGG CATT CGCTATGACTAACAAAACTT CCTTTGACAGA CATGGGAAGGAAAACT GCTT CTTT CAAAGGTCAAATTGGC CACTGCACAATAAATATTCACTTTATTTA(N ) xAC AAATTCATTTTAAC CCAT TC CC CT CACATACATC TTGACACCCGGGA CACAGAGAGG GCACAATTGGATCATTAC ATATTC CG CTGATT CT TTAAGGAAAAGAGATC CACTAAATAGAGTGCT CCTT CGCT CTA C GAAT CACAGCATTTT AACAAAATAGCTTTAACAAAACATGTCAATGTGACAAAAGTCTCGATTCCAGTATG(N)xATATATGCAAGTCGT GCTT CTTAAAAAATTTATTTTA( N) xCACACACACACATAAAATTATTTTTATTGAGGGTCCATATAGCAATTAG CCAACT CATG CTGTTG TAA CAT GCTAAACAAATT CAATGTTATTTGTATTCTAAGATGGTGTTAATCTAGTT GAT CAC CTT CTAATGAATATATAGGTT GAAATATGAAGATGC CAATTTCAAGAATGG GAAATT CTTT GCTTGTTC CAA AAGAAAAAAATTAAAAGACAACTGTGAAAAACGGCAGCATCAAAAATTGATCCCACTCTCACGGGGTTTGAAAGA GAG CATCTCTAGTTCTCT CTCCTAAGAGTAAACTTT CAATTTCCTG GCAGTCAAGGAT TAAAGTGTGT GAGTTAG AGTTAAAACATGGATATACTGAGTTAGTGTTAAAACGTGATTTTGAGGCAGG( N) xAACTACCATGTGATTGTAT CATTTGAACAGAAGTCCCATCAAACCATAAAATAATTTCATCTTTCTTTACAGAGTTTGAGGTCTGGCCCTACAA TGAAGGTG CTGTCACTCTGAGTAT CAGTTGTCAACT CTG G CAAAATACAGGGGAGCTGTCAGCTAACAG GAAATA TGGACATTGGGGCTGTCTGGGCACCCCAGTTTGCTCCTGTAGAGTGTGGAATACACTGGCACTGTAATATTAGAA TGGAAAGAACTTCTGGAAGGCATTTAGACCAGAGGTCA( N) xACAGTTGGCCAAACCCAATTTAGATGAAAAATT ATTAATAGATTCACAAGTTGCTATCTGTGCCATTTCTGGCTGTTAAACAATTTGGAATTGGCAGTTATTTTGATT TACCTCAAAAATAGATAATTTAAAGTATTAAACTATTTGAGATGAT CGAT TACTGTAATCATAAATAG CAGCTAA TTATTAAATTGTTTTATGTGCTTAAGATGTGTTGGCCCATTCCCCATTTTTTTCCTGATCTAATACCGTCTTTAG
( N) xTAACTAAAATATTTTAATTTCTAGTTAGATAATATTACAATTAATGTAATAGCAACAAAAGTCTTATGAAA TATGCAGGCTTGTATTGAGTATAATAAGTGAAAAAACATATGGTTTCTTTGAAATGTCCATCTTAAAGTTTTCAG AAAACATTTTAATTCATTTGGTTATTTTTGAATGACTATCCACAAACGTTGGCTTTTTTCTGCATACATGCAATC TGTTACAACCAGGAATTTGGCAGTTACATTTTTCATAGTAAATGTGAAGTTAAAAAGACCATTTC(N) xGGAAGC GGAAATAGGTCAACTTCACAATGAAGATTCCCAATGTACTGTCATGGTTATTTTTTCAGAGAGGAAACAGAGAAT GGGCTAAC TTATATTTCAGAGG CT TCCTAACAATGCACACAC CTCTGTTTTCACCAATTTCTGTAT( N) xAGTTC CATCTG C CATG GACATCAAAAACT TCTTTAAGTTTGTTAT TTGTGAACTGTATATAAAGT CCAC CGAGGAAAGCA AGGATGCTGCTTTCATACACCTGTGCCATATAGTTACAGTGCACACAACCCATACAGGAATCCCTAGGGAAGGCT TCCTTATATTAGGAGCC C CAGC CTTATT CCTC CATATTCAAATATACTCATTAAAAAAGAGAAGGCAAAAAT GAG AAGCAAAAAATAAACAGCAGGGATGGAGGCTCCAAGATAGAAAAAAAATCCAATAGAACTGTCCTCCCTCTAAAC AAGAGCTTAAA C AT CATC CTTTTATAA CGTGTACATATACATATATA CATAAAG GTAAATTCTATGTATACAGTC AATGTATACATTGACTGAG(N)xATCAGAGTATCTTAAATTGCCTTTTAAAGATATCTAAATT(N)xCCCTAAAA TTATTCTCATGTG (N) xATTGATCAC AAGC ATTCAG AATAAATAGACTTGGTGAG GGCATAGGGAGTAACTGGTA CCCACACAGACTTGGTGAAGGTGCAGGGAGTGGCTGGTACCTGCACAGACTCGGTGAGGGTGCAGGGAGTAACAG GTCCCCGCACAGACTCAGGGAGGGCACAGGGAGTAACTGGCACCCGCATACCGTGCCGTTCAAGCTGATAGGACC ATTG CAACGT GCA CAATGTGCT CAGGATGAGG CCGGGGAG GAGACTGGAAAGGATCGG CTGTGCTCGTGAAAGTG ACAAGC TGTCATTTACTTT(N)xGACTGGGGAGGTATTTTGTTTAGAAGAGATAAACTGTTCAGCCAGTTGACTT TGC
> H s l2 _ 113063 11 - 11335781
CCAGCAGGCCCCAAATGACAAGATACAAAATGTGATACAGGACAAAGTAAAATTCAGACCAAAAGA(N)xACCCC CAAAAAAATGGCAAACAAGAGAGCGAAACAAGTGGTTTGTGTCCAGAGCAAAATAGTGGAATGTTTTTTCTACTT TAATAT TGAAATAAAGAAG(N)xGAGTTTTGAAAAATAGTATATAGTATTCAACATACTCAATACTGGAGACAAA TTTGTTAGAATAAAGTCACTCAAATAAATCTTATGATACATCAGCCAACTTTGGCAGTTTACGGATACAATCAAA CAATGC CAAAATGTTTTTTATC AATAATTCTTTTAATTCT CC CATC AGTAGCAT ATACTC CAATT CAACTGAAAT ACATTAAAAGAGAATCAAGTATTGTA{ N ) xGTATACAGTTGTAGTTAGTTTCATGGTGCCATATTTCTTGCAGTG TGTCACAT CTTCTG CTTAATTGTAGAGT TTTAATAGAAAAAT CCTT CATACATGTGG G CACAAATTTTATATTAA CATAAAAACATTTT CATT CCTATCAAA CAGTTCTGCCAATCC CAGG CAGTAACACTCGAT CTGTAATTTTGATTG AAGGGCTGGATTCAACTTTGATTACATG CAAATAG GAGAAAAAGTAACTTGCACTGATTTGCAATTTT CATT CTG CCCTAAAG CTTGGGTCACTTGGAT CCAAATGC CAA CTTT C CTAAAT CTCTGCT CAAAAAAGAGAG GTGGAAAACT TTC CAG TATAATT CATGGGGGTGCTTAGGTGTAGCT TTTCTCACTTGAAAACCAGAGAAC CAGAGAAG GTACAGA AACCTGGAAAAATGATTATAAGTCTTTTCTAGAACATTAGAAACCAATTAGGTCAGTTGTTAAAAATCAAAACTA GAATTTTATGTGACATATAGAAAGAAAATTTCCTGACTTTAACACAGGAAGTTGTGAAGTTGTAAATACAATTTT CAAGATGATT CGTTTATTGCAGAAATTGATTT GAG(N)xAGTATATGCATTGAATTTGATACTCAGACTCT(N)X ACTCCTTTGAAAC(N)xTTTTCAGTCATTATTTTGGTGGTAGAAAGAAACTTTCACTCTATTTCTGTAAGTTATT TCTATGGATAAATAAAATACTCATAATACCAAATTCTTAAAGATTTTTTAAAAACTTTTAGTAAACTTGACAAGA AAATAC CAATAATT CTCT TCTTTTACCCACACTTTGGTCAAAGAAAACA C CAA(N)xCCACACTTTGGTCAAGAA TAGAATAAAAGTGTGAGT CTAATCATTCTTAACCAAGTTGAAAAGAACTAAAAATAAATT TGAT GCAAGTAG CAG CTTCTCTGTTGCTGCATCTTCACCACAGACTCTTATTACATGAAAATTCAAGGAGCAGAACTTGATCTCAAAATG CCATTACCATAAATAAAAGATAATACAT CAAT CACAGCAACACCAT CTTATAACATGG CTTAATATTTAAGTAGT CCTTTTACTCTCCTTACACTGT TAGACATGGGGCTTTGTG CCTTA CAAGT CCAT CATCAAGAGAGTATGATATTG AAGATGTGAGGGCTTATTTCTGTTTGGTTCATTGACTTTTAAAGCCTCTCTTTTGAAATAAGAGTTTAATTATAA AATGGTTTATTGA CTTTATCTCTC TTAT TATACAAATGCTATTTA CACTAAAAACTAAATATTT CTGGTAGT TAG CCTCTGATGAGTTTAAGATTATTTATAAATCATTTGTTCAAGCAATGCATTTTAAGTAAAAATACACTCAACTTC TAACTCTAAATGTATTAAGTTTCACATAAGCCTTGCTCTTTAATAATGCAACTTACCCACAATAATAATCTTTAG GAAATTACCTAGACATTCTCCAAATCGTGCATGTAAACATTTAAAAAACAAATAGTTAAATGAAATATATGATGA TGATAGTGGTGGTGGTG GTGAT GATGATATTAATAG CTAAATGAGAGAA C CCTACCTGATAAAT GTGATAGATGT GACCTCACTCTAATTTC CTGAATG GAAGAGTT CATTTTGT CTGGAGAAAACAA CAATT CTTCCCTATGGCCTTAT CATTCTTACACCATTCTCTGCATCAGGGCTTAACCTGGAAGAATATTTTCACCTACTACTTCATGAAGACTAATA TAAAAC CAA CATACTTCC CATGAT TTAC CAGCTATTTTTT CC CTTTATTC CATTTTTAAAATTATGGTAAAAGTT ACCTCAAACAACAGTGCTGGGTTTTTTTTCTTTCATATGGACATATCCATTTCCTCTATTAAAATAATACATGAT CTTCATTTCATATTTTAGTAACATCATTTGCAAGCAAAATTCTCTTAACTGCTTCATTACATTCCCAGCTACTGT GATTACATCTGGGTTATTTTATTACAGTAGTCAAAGTAGCACTACACATCCACAAACATATCCAAGCATTACACT GATATATATTATGCTTTAA CAAACTTACTTAATTATACTTAACATTTTATAATTTTTT CC CTA CTGTATTTATAA TTTTTTGTTCAAAAAATTACATTACACTTTTTTCTGTGATTTTATATATATTTATTCATATTGCCTTGTATATCA TATGTG CATTTATTTATATT C CATTATAAATG TTTATCTTTTTCTC CAAGTC TCACATA CTCTATTGTTATTAGC AGGCATTATTATAGTAGAGACATTTC CTTGAATCAGACATTATGT C CGGCAACTTTGAGAAAATATGGTTAGTGA ATATTTTAAGGAAAATTTCAGAATTATTTAACAACACTTAAAAGGACTCCAAAAAAGAAATTGTTAAATATAAAA TTAACT TATGAGGT CTTCTATTAAGT TATTTCTTTCATAGATGCAGAAAGTTAAATTTTGTCTATGAAGT TAAAG GT TTTG CTTTTT TCAGAGAGAATTTAAGATGC CACAGTAGT CTCAAGGAGCT CTGTCTTAGCTT GTTGTTTCCCA AAATTATAATAAATGAGTGG CC TGAGG GAAAG GCAAGTAAAATCAT CATAATAAACGT CATGAC CTGATAATTCT CTACCTCAAGGAAGATCCAATCTCCTATTAAAGTGGAAATAAAGTTAATAAAAAAAAGGAGGAGGAAGGCTATCA CCATTTTCATGGCCCT TTTATGGGCCTGTGTG CTGGAGTC CCTTGAGCCCAGAGAGTTGAGCTG CAAATT CTTGG TGTGTCTCACTAAG GATATAAATAAAAGGAGCAAAGAGGT CAGAGT CAGAAGAAAGGGAATAACGTATGT CAAGC TGAGAAGCCTAAGGCCTTGAAGACAGACTATTTTACTTGTATCTGAGTGCAAAGTCATGTTTTTTTCACAACTCA CTAATGGCATTGGACATAAAAAGGTAAACAAACGATAAGAACAAAGACCGCAGGAAAAGCACAAGAACCACTCTG TT CATT CT CCACTT CAGCCAG G CAAAAAATAAGCGGGAGAAATTGG CTATCT TAAGCAAGTAGAAAATGCTTAGG CAGGTGGCAAAC TAG ATACTTAACTGATTAAT TAATGC C CAAAGAATAGTAAACAGTTTTACTAGTTTACGGATG GTATATAGATGAGGGGACAATACCATTATAAATGAATCAAATAGTATTAAATACAGTCGAATGATTCTGACTATA GC CAAG CTG GTGAGGATGAAAT CAGCTGATGAGACCTT C CAACT CTTGGCC CACTCAATG CAGTTTACCAGT CCA ATGAGC C CGTTT CC CAGCATTC CTAAGACAAGT CCTCTTGTTGC CACCACCAGAAAGAAAATAT TAATTC CAACG AACATTT C TATGAAAATATTT C CGATATTCTACTTCACTGA CAG CTTTATAGT CAAA CAGTTGCAGATGGG CATG CATTTATGATGCTTTCTATCTATGTTTT CAT CACAATTT CAGAAGG CATAGC CAAATT CAGATATATGTT CAGAG AT CTTCATGAAAAAAATAGT TT GTTCTATTTATATTGTAACTCCGGTACTAAAC CAAATTGTTAAGAGATACAAA GCTTCATGAATAC CTTTCTTAC CACTTT CAGT CAGCTACTATATAATTTCTAGAACAAACACTAATGATGTCTAT TTATCTTCAGCATCAG CCACAATTTT CCATTCATGGTAATGACTAAAGAAGC GAAGTCGAGTATGTAAA CATGCA AAAATGAGAGAT CATT TTTCAC TAATTTGCAT TTTCATCT CCCC TGAGCCATAC CACTTGAATACAAATAT C TTA AATTTTGAATAGAAATTGTTAGAAATAACATTTATTTCAGTAATTATTTGTCTCTCATTCAGTAACATCCTTCTA CATACCAAGCATTTTTGTTGATGCAATATGATTTATAGAAATAAGTCCCTAAAATTATTCATAATTCAAGAATTT CACCCAAGTCCTAAAACTATTAATATAAATAATT TGTATG CAATGTTACAG GAG CTñAAAAGAATAAACAAAATñ TA CCAGAATTAAAGAC GAGGAAGATACAAATG CTTCAGTGAAGAATTGACTATAGTTCTCTGAGGTTAGGGACAT TCAGTTTGAACTGGAATT( K ) xCTGGAATTCTTTAGAATTTATGAAAAAGAAGAAAAGAACTAATATTCAAGACA GGTTTAAGGGCACTTTCAAACTACAAAAGTGACATAAACAAGCTATACCATGCATCTATAGGCAGGAACTTTTTT GTTTCACATTTGTATGGCTATTTTAAGCAAAA TG CACT CT CCAACTTCCATGTC CTAAAT TTCATGTTCTA(N) x GAAATTGGAACTGAAAGTGGGAGAATTTGTTTAGAGGATTT(M)xTATGGTGATTTAAAAATAGGCATGAAGTC( N)xCAGTGTCTGAGTTCTTGCCACTGGAGCTTGAGAAGTGATGTGTGAAAACTCTATGTCCCCAACAAATACAGC CT CACC CCAAACACACACATAAATACACATGCA CACTTGTAATGAAGAATAC CC CCTGGG CAAC CATGGAAAAAT GTGTTAATATTTGCGAAGACTTCTTTAGCCTGAGTCAATCAATAATTGCAGAGAGAAGGG( N) xTGGTATTGTCA AAAGCTGACTTTAAAAAATGGTTACGTATA( N ) xTCCATATCTTAAATTTTTAAAAGCAATTAAAAAAGGAAAAA AAATTGGTTACCTATAGGAAAGGAAACACAGAGCATATGACAGGAAAGAAGCTAGGCTTCTCTGAATACATTTTG TT TTGAAG TTTTAC CC CTGGTACCATGAACAT TTATTAAATAAT CACTAAAAATTAAAATAAAGGAATATTAAT( N)xTGAACCTGTCTGATGGATGATTTAACTAAAAGAGGAATTGTTTCAAATGATCCTACAACATAGTTTTTTTAT CT CCAGAAGGATATATGCTAAGATAAAT CTGAAAGTGTTT TAATAATTAAG CTGTTAGAATAAT CTAAT CTGAGT ACTGTTAGTTTGAAA CTATTATATGCGCTGATGCACACAATTAATTATGTGAAT GTTGTGTGGT TACGGAC CAAG ATTTTCAG CATGAGAAAAAAAGAAGTAAGAATATAAAATCTGAATTGGAAATAT CAACAT GAACTTATTTTTAAA AT CTAT TCTGGTAGTTTAAG CACAAAAGACATTTATTAAAGTGTAT CAGAGAAT CATTA CAAAGTCTGCAGAAGC AGATTCAAAGCTAAGCATCTAGATACAATGTGTGTATCAGATTTTTTACTCTACCATATACCATCCCCAAGTTTC TAAAAT CT CACATTTG CCA CAGG GACATAAAAAAGAAG CT GGAATAA CATTT CC CAGACC CCAT GGTCAATTAAA TT CCCC CTTAAAAAAAGACACT T T G (N ) xGTAGCCTGCATAGAGGATGCAAGTAGCTGACACAAATGTTATTGCC TTCATGTACAACCCCTACAATTCTGGATTCCTGAAAGTAACCCTCCAGAATTAGTCTCACTTCCCCCAAAGCCTT CTAATGTTTGTATAGTGTCTAACTCCCTGATTAAACTTCTTCTAACTCAGAACATCTAAAATGTCTTGTTTTCTA CCTAATGTGGTCTGAGGCTTCTCAAAAAAAAAAATGCTTC <N )xTTCCTCCATCTCTTCTCTCC(N )xTATTTTC TCTTfiACAATGC CTGGGTTTTTGTATGTGTGTATACATATTTACAGTAAATC CTGAGATAAATATTACCC CT CAA AAATGGGCTGCCTT CT TATT CTGTCAGACTACTAAGTG TG TGAAATTGTATCAATCCAAT CTGTAGTTTATCTGT ATTT(N)XACAACAAGAAAAACTGCCTAAAATCAGTGAAGGATCCTGGCTGAAAGCCGGTTTTCTCCCCCTCCTC CAGTAGCAACAGATGGCTTTCACCCAGTGTCAGCATGGGAGTCCAAGTCAGGCAATCTTCCTGCCCCTCCTTGAG TGTCAGTCAATCAATGTCTTGTATCAGTGGAGGGTCTTGATATCATCAGGTTAACTGCCCCCTTCCTGCAGCAAA TGGGTTTTTCCTGGTGTTCATGCAGAGTCCAGGATTGGTGGGTTTTCTGCCTTCCTTCACTGGCAGATCCTTTCT GT TTAG C CAGCATG CAGCC CAGAGCAAGTGAGTTTCCTGGACTT CC CCTGTGACAACAGACTTCTAATATGTATT AATACAGACCTAGGAGTGCAGGCAGGTTTCTTACCCCACTCCTTGTGCCAATCAGCTATTGCCCAATATCAGAGT AGGGTCCAGGTTACAGTGTATTCCCTGCCCCTCACTCAGCAGCAGGCAGATTTTGCATAATAAAGATCCAGAACA TGTGTAGTTTTCTTGCCTGTCCCCTAGCACTGGCCAACAACCACCTTTTACTCATGCAGGGTCCAGTGTAAATGG GCTTCTCCAGTTTCTATTACTCCACTTTAAAACTTAAGAAGATTTTGTTCCCATT(N)xCTCAAGCTGGTTTTCA a a a t g c a t g c a t a a a t g c a t g c t a t a t c a t a t t c t t c c a c a t g c a t g g t t t a a a a a a t g t c t c a a t t a a a a a a t a GATTCATGTATAGGTGATTTTTTATGCACCTATGAAATCTGGCCTT3TAATGAAGAATACAATAGATCTTAATAA TCTGACAATTATTAAATGTGTTTCATTGTCTAAAAGAGAGGTTTGAGGGGAGAAAAAGAAAAGTCTTTCTTGAAA
TñTG GAAATAAGAGTATATATTTAGCTCTGAGTATGTT CTGGTAAGAATTA CAATGATGT CAAACTATAT GTA CA TATTAAAGCCATGGC(N)xAATTAAAGAGTTTTTATGTGGGCTTCTACTTGATTTTATTTTTTCCTAATACAAAA AAGTTTTCGT CAAGGCTTTCAAGGAG CCAAGG TTTC CCATCATAT CAAGAAT CTAATCTT CCACAGCATTATCAG TGAGGTCTGACTCAGCTTGCTCTTCCCAAGAATCAGAATCAGAGTGATCAGAAAGGCAGAAGGCTCCTACTATCT GGAAAAGCATCATGCCCAGCAGGCTGTTTACCATAAAGTGAATCAAATCTGCTATAAAAGCTGAAAAACAGTACA TGATAAAAAGAAAATAAGAAAGATTACGACTC CAAAAT CATTTATT TC CCAATGGAATGTTAATAAACCAAGGAC T C CATCAGAATCATAATATT CACATAATTAAT T CTACACTTTATCCAAAGGAAAAATG GC TGAGAAAAAT TAG CT ATTGTGAACCACCCCCACCTCCAAAAATTATCAGGCAGATAGCAAACCAAAGGGTTGAGTACTGATCAGTGTCCA AAAG GAAACGTACTTTTTTT CTAGTTTGGCATTAG CACAAGTTAATGGTGTG GTGTGAACATAAGTATAAAAATA TTAAGCAG TATTAAACACAG CTGTTAAATT CTGGAG CTGGCTAACCTGATGAAAAGTCAGTTAAAGGAAG CTTC C AATGCCCAAGCCAGTCAATGCAGTTAATCTGTACAATCAAGCCATTTCCAATCATGAATTTTCTAATTTGTACAA TCAAGCCATTTCCAAT CATGAATACT CTAATCATGAATTCTC CTAATATCAC CCTCAG CAAAAGAAGTGT GGATT AGGCATTCAAATCTAAACATCAAGGGGCAGATAAAAACTACAACCACCTAAATTATTGCTCCTATCCCACTCTCA GGTG GAAAGAAGGAAAT CTTTCTT CAGCTC TAGAGCATTAAGGAGTAGGATTTGGAGCTTTGTGACTGTT CTTTG CCTATTCCCGTCCCATTGGAATAGGCCCTCAATCTGGCCATGTTATCAGTGGGTGGCTGCTATTTTCCCCACGGT CT CAGGGTGGAAGATAAGGTTTT CATG GAGTTACC CAGAAACG GAG CAAGTAGGGCGTAAAGGGTGGCACAAAAG AGGAATAGAAGACAGAATGAGTCCACATAGCTTTACATATTTGGAAATGAAAAATCCAGGACGTAATGACTCCTG TGACTCTGAATGGCTCTGTGTATTAGTCTTTCAGTTTTCAAAGGGAGTAAATTATATGTGTGAACTGAAACTATA TATCTTCTCCCAGTTCTCAGAATTGATCATTTATTTTACATCGCTCTTCAGTCTCTATACCAGTATTTTAGAAGT TGTCATAAGTAATT CCTCTAAGTG CTACCT CG CTGGAG TGGTACTGGAGCAG CAAAGTAG CAAAAGAATACTAG C TGGGAACACAAAATTAATTTTTACAAATTTAT CAGT GG CACGTTATTT CTAC CAGTCATATAAT CAGTAGACTAT ATAGAñACAGAGGTACTCTAG GGTAC CGTACTTGTG CT CTGAATAG CTGCTAGACATG CAGTGCACGGTG CGATG TGTGGTATCTCAACAATGCTTTTCAGAACAAGACTGTAACCTTAAGGTCATGATCAGAAATCACACATTTGAATA AT CCATGAAT GAAGACGGTGTGTG CGATTAAAAAATACAGTG CTTT CTAGTTAAAATG GGAGATAAGTGAAGGTA ATG AAATGTG CACC ATTT AAATAACAAAAGTATTGTGG CTATTAGAGATTCACTGTGCTGTT ACTCTAATG GTAT TAAGAGTTTGGAAGTACAGG CCGGGC GCTGTG GCT CAC( N) xGAGTTTGGAAGTACAGAGTAGTATTCCAGCTCT TGGAGCCTAGCCACAGGTTTCACAGATGCATCTCCATGCAGAAATGGCTGAAGCTTCCACAGCTAATCTGAACTT TTTAAGGGTTTACATGGCTGGGAATATAATTTTGTATGTATGTCCTTATGTATATACGCGTGTGTGTCCACTAGT TTTTTGCTTGAATGTTTATCATTCCTGTCACTTGCAATTTAAATATTCCTAATAAATAGAAAAATCCAAAGACTA AGATTGTGATCCTTTTTATGTCTATGGTATATTATGAATACAGAGACCTTGTGCTTTTATTGGCGTAGGAATCAT CTTGCCCTAATATTAATATTATGAAATCAAAAGGGTAAGATTCCTAACATCCCACATAACACAG( N) xAACTATC TGCATGGATGTTCGTACATGCTATATAACACATCCTATTCCAGAGCCACCTACCATCAGTTGAGCTCTAAATGGT AAATGCCACAGTGAGAAGCAGACAGGAAGGGGCGGAGCGCCAGCGGCGCCCGGGGCTACGCGCCGCACTGCACCG AG CG GCGG CAGCGG CAAGCT TGGGTGTGAGCCCGGGAGCCGCTTTGCTTACCGTCCTGCCGGTCCCAGCCGTCGC TAGGAGGTCCGCGGGCCCTGCGGCAACCCTCGCTACAGACGCTGGGCGGGCGGCGACACCTGGCTCATGGCCCCC GCGGCGGCTCCGTCCTCCTTGGCCGTCAGGGCCTCAAGCCCCGCCGCGACACCCACCTCGTACGGCGTCTTCTGC AAGGGGCTCTCCCGCACCCTGCTCGCCTTCTTCGAGCTGGCCTGGCAGCTGCGCATGAACTTCCCGTACTTCTAC GTCGCGGGCTCGGTGATCCTCAACATCCGATTGCAGGTACATATTTAGAGCCATGACTAAGCTAACGGCCTCCGG GGCCAGCATGATGGCCGACTCCCAGGGTCCGTTGCGGCGCGGCGGAGCAGCCAATGGCGAGCCCCACAGTCTCGC GAGAGTGCTCAGGCGCTCTTCGTGGCTGCCCTCTTAGCTGCTAGCGGAGCTCCTCAGGGGGCGGCCGGGAGCCTA CAATCCCTAGAAAGAGAATACGCTGTTCCGGAAACAGAACTGCAGTTAAGACCCTCGAAAACATCTAAGAAAGTG TG CATCCTAAAACACCTGACGAATTT CAGAATGTGACAAAG CGCAGAGGATG CATTATTT CAAAACAAAACAGAA GG CTAAAATTTGCAGGAAAAAGAAAATCAGTAAACC GGGAAT CCTCGGACTGGATTGTAAGCAA GATTTCAATGA ATAAGAAG CT GAAGGTATTAAGGC TG TGATATAGAAGGTACATATTTCATCC CACAAGAGAAAACAATAATAAT C AGAAATTTTCGGTGAAAAAAACGCAAAACTGTACAGGAAAATCATCCTCCAAGTACCAGACATAAAATGCTGCAA GCTTTTGAACTAATGG CGAGAGTGTAAGAAAATGGG CT CTACTTCAGT GATC CTGTGG CAGGACGTGGAT CAAGA CT TGGAAC CG CAGAAAA CGAAAT C CCATAGTAG CACAAAGCTTGG CTGTTCAGTGAATAACATT TAAATAATCGT AAAATACAAATGTTGT TTATGGTTTTTATTGT TTAAGGGCATACTTAATTATGGTTACAAAGTGGAGTG CAAATG TTATTTACCATGTTTTAAAAATACAGCCGGAAAATACAAGTGGGAATGTTGAAGGAGGGCGGGGGAAGTAAATGG AATG GGG GTTATGT CCTTATAAAGTG GAAACTTGAAAGGTACTGTCTGTTGTTGAGTG GGGAAAGACATTTTTAT TACTTACAGGGTAGCCATAAAGTTTCTAAAACGGTAATATATTAAAGAGGGAGAGTGGTAGGGGAGAACAGTATG AAGTCAACAGGAAATGGCTAAAGATGGAGAGCTCAGGTAGAATAGTTT( M} xATACAGCCTAGAGAGCTTTCATT TTCCAAAGAGTGTGGGGATAAATTAGTGAATGCATATTTTTCAAAATTAAGAAAACAGAACATATAATTTTGAAC GCTGAGAAAAATAAAAATTTAAAAACTACAC(N)xTTCTTCGGATTAAAAAAACTAATAATAAATCACCAAGAGT GGTAAAGT TTTAAATCAAGATCTGATATAGAAAAGT GCAGAAAAAGTAGTATACG CATTTTTTTTTT CAGGAAAA AATTAACTAATAAACCACTAAGAGGGGTAAAGTGTTAAAATAGTTG CATTGTTCTATATT CCACTCATTGTGTCC AGCAAACAGTGTTCTATGGCTTTAAGTAGTAAGCATCTTGCCCCGTCCAACACCATGTCTGGCCTAGGGTAATTG TTCTTTCTAGTCTTTATTCTTTGTCTCCAGCCAAACTCCATTTGGGCACATTCTCCTATAGCCTTTTCTACCTGA AGATACT C TGTGTTTTAAAG CTAAGCTTAG GTGGCG CTTTTT CCATTAAATTTTT CCTGGATTC CACTGACCATA TAGGAGCTCACTTCTTTTAATCCATAAGGCCATTTTCATAGGTTGCCTTATTTTTCCCTAATCGTGCATCAACTG TCTGTTTTATATACCCAAGACAGGTTTCCTAGACTGCGATAAGCCAAAACATTTTAGTCTAAAATATCAGAAGTG TAGTTTAATCAATGAAATAGTAATACCAAGGGATTTAGAATCGTGGACATCACTGTTTCCCAGAGCACTGATGTC CCAATTTGTAACACAAAAGACTGTCTAGTCTTAATCCTGAAATGGTGACAGAGTAGGATGCTCCATTTGGGTGAC TATGTGAACATATT CCTATAACTTTTTT CGTCA CAT CAGTCATT TGTTAAA C CGAAGATGAAGAAACAGACTTTG TATTT ATT A CA(N)xGGTAGTAACAGTATGTGTGTGTACTG(N)xGTTACTACTTACAATATATGCATAAAGTTA CAGCTTACATTTACCTCGTAATATATGTATATATATATATTAGGAGTTAAATGTATTAGGAGGTAAATGTAAACT CTGTG GTTTATCATTT ATTC AC AATTGCTCTT AATT CTTGGGGT AT AGTTTG CCTC CT AC ATGATG CAATTCAGT GAAATT CC CAGGAGACAAAACAAGATTTTGACCTGAATAAAC CACTTñTTGTTGTAATTCACAAGTGAGT TTATT CTTCCTGGCACACTTCCAG(N) xGAATGGCAATTTTGTTGGTATGCTCTACATAGTCAGCAGTCAATAAAGGAAA CATATATTCGTTCCTGATAATAAGATGATTCACTTTATTGCCCATAAATTGGGCTTTTTCAGGCCATCATGCACG TTGATTGATCCAGAGATCCCTCAAGCATGAAAAAGCAGAGACATAGGTGGTATAGTTATTATCAATAGAGATAAG AAATGTAGCTGGCAAAAAAAAAAATGGGAAACGATGCGTTACAGCTTATTTTAACTATAAAAGGAAAACAGTGAA CAT AGC AA C ATATGTAAGCT AAGGGATAGGGG AAAAGTTGTTTT TTTT TTTTTAATTCTGGGTAAAAAT ACATGT GATTAC TTGACAAATAGCATAT CATTAC CATCAG CAT CATTTTTAAA(N) xTGATAACCAACCAGACCTCAAATG AATCTCCAGTAATTCCTAAAGTTATATTTCATAAGGAGTTCAATTTTCTTAAATGTGTAGATAATTACAAATAAT TCTGTTTCTCAAATTTCTGGACAGAAATATTACCTGGGTAATAGTTATTTTTTTTAACATAACTATTGAGGGTCT TTCATACTAAAATATAAATATTTAACATGATTTGATATGTAAGTTCAAGTATTTTATACTCTCTGCAGCCTTAAA CCT ATTGG CTT ATTTT ATTAGAGGGTTGGTGGGATTGAGGGATATGGG AG CAAAGG GAG AGAG AGG AAAG AACGT TGTCCTCAGAATGGAAGGGAAACACTTTCCTGTGCTACCCAGAGTAGAGATCAGACAATGTTAAATTTTGAAGCC ATGTCTGTTCCTGTCCTGAGGATCCCTAACTAAATTTTGTTAGAAAATCTTGCAGTTTGCTTTTAATGGGAGATC TTTCTGGC AGGACAT C AATAGCTTTTGATAGCGT CAAG AAG AAAAT AAAGTT ACATGC AG AAGC AT AAAGGG GT A GG ATGAAAGTTGAG GAATAT AGGG A CAAAAGATG AGGG AAAG GAAT AT CCTAATAC TTGG AAGTTTGAGC ATGG A AAGTAG TCTAAAGACGACTT GG GAAGTT CCAAGAAAATGAAC CAAGAAAAGAACCT CTGCAAGAACTATGAGAAT TCC CAACAATTGGGTTGGATTGATTCATATTAGT CTATTACTTG GGTTAC CTTGAGTGTGGGGATT CTTCTTGCA GTTTTAACACTTG(N)xGACTTAATAGGATTTGCTCAGCAGGATTACCACCAGGTCAAGTCATGAGATCTTTTCA AAATATAAGTAAACCTCCCCTTACCCCTACTCTCCACAAATACAAATTTCAGCAGTGCTCTTGGGACAACTAAAA GGCCCAAGGACCTTCCCAAAAGCCTACTACAAATGAAGAAACTATGGAAGTCTTAGTAGCTTAATTGCAGGACAT CCCTGCCAGTTTCTATCCTGATTAGGGATCTGCAATGAACATTACAAAATCTTCACCATCTCCATGGCTTTCAAC ACCAAG CATATGGA CTTAGC CACTGTAAAATAGTGG CATGCATCAATGAACAATAAAGAGAAAT GGAGAT CATTC ATATTATTAGTGAGGGAAGTTGGGTACAACTTTTCATCAGCCAAAAAGAGGGAGAAAAAAATCACACCACCAAAT AATGAAAT ACTGTAAC TAAG AT CATTGG CAGAGAATTAGAGATT ATGT TC AG CAGATT AAAAAAAAAAAG AAAAC
( N) xGC CAAAAATACG AGATGCTT AAAAAAAG AACTGTTGTT CAAG AGAT CATTTT CACTTTTT AATTGGTTTTT GAAAACTGTGAAAGTATCTCAAACTGAAGTCTAAAAGGAAAATAATAGAAGCTTCAAAGTTCAGGGGCCAGGTTA T ATCT ATATAAAAGGG AAGCTAACTTAT TT CC AG AACTGATT CATTTA TT AAACATGGGGCTAAGG AATAGAGGT GGCAATATACAAGTCAGGATAATGCCTTTGTCTTCTGAAGCTGCATCTACATTGGTGCCCCTCAAAATAGCTAAA ATTGAAAATAGTTTATCTGTCACCTGGGATTTCTTCTACAAATAACCTTCCCAAAAAAATATTGCCTAACATTTT T CTGAATT TATCAAGC CAAC CATCTGAT CAGTTT TC TAAATGAAGATT TATT CTTG GCAATAAT CTAGCT CAGGG CCTAGGTCTTGTACACAGCATTAAGCAATGTAACAGATGAGTGCACTGGAAGTATGGATTTGCAGATGGCCTCCT TGTTTCTGTATTTTAATTAAGGAAGTCTCTAGCTCAAAGGACATGAGATTAAGGGCAGCAGCAAACCATGTGAAC TATCAGAGATTCAACAGAGACAATCACGCCCTCTGAAATATGTATTGATCCATGCATATTTTCACTAGGATGTTT TCTTTGAGTAAAGCTAAGGACAACATGGCTTGCGCACCCAAAATATCTTCTCATCTTAAGTTCTAGGCCTAAGGC AGGGCTCAGGCATCAGCCGTTTTTGGCATTAGAAACAGGTTATTCAGGTTATTGGCAACTCTTCTATAAATCAAC TGTGTGAC CAAACTGGGTTGGGGGGAGGGCAGGGGCGGGAAACT CAAAAACT CTCATGACATCAAGA CTATAAAC TTACTTTAGGGGAAAGCAATTGGCAAAAAAAAAAAAAAAAAAATGTCGATCATTTTCTTTTCCCAATTATTTGTT CCCAGGATAGGATACAGTTCTCTCCAGATTGCCTGAAACTAGGTTTGGATGTAAGAGTCGTGGTTTTCTTGGATC GTGGCTTGCATCTCCAACCCAAGCTTTCATTATGGAAAATGATGGTGCAGAGCTTAATCTCAAAATGCCATTGTA ATAGGGAGACTGCAATACAGTGGTAGCAACAGTCCCACACTTTCTCCCAGCCTTCATCATTTAATAGTCATTTTC TGTGCTTTTATTATCTGGAATAGGTCTCTGCTCCATATGGCTTCATCTCTGCAAGGATAGTACGTGAGATGTGAA GACTTAAT C CTAATGAGACATT TACTTC TAAGTC CCACT CACTAGAAG CAAGGGTTGT GTATGGTTGTTTAAATC TTTTGAAT CTTGAT CATTAAAGAAAAGT TTAGT(N) xTTTAACTTATTTACTGAAAGGAGTAATTCCAAACCATA TTATCCTCACTATCTGAACCTAAATATAGCCAGTGTGATTTTAATGTTGCCCTTTCTAAAGCAAGATAATGCCCA TGATAATCATTAGGAAATTTATTATACAGTATCTAACAATGAATGGAGTTTTAAAAATCTAAATGTCTGTATAGG CCTGGACTGGCCTTAAGATTATATATTATATACATTTTCAGTTACTTTATTTTTTTGTAAAAGTCAACCTAGAAC TATACACAAATACAAA CATG CCTAAAAT TTATG CAAATATGTGACATT CT CTTATAAATTTTCCTTTATG TT CTT ATTCTTAGTGAGTTAGTTGGTT GTGTTAGTTTTCTCTGGTGT GACACACTAGTATT CATGGTAT GACAAAAATAT GTCATTACACATGTGCACACACACAGAGATCCTGTGTCAAAGTGTGGTTTTGATTCAAAAAAGTAAAAATATAAG CAT CATTATGGACATAGGGATAAGTATT CAACTT CT TTTTAGAAATTCAATT GATT TTGCACTTAT C CTACAAAG GT AGGATAGCTTGAATT CCT AG CTGGAGTC CT AATG AG AGAATG CTATTTTG TTTT AAAAGTTACCT CTATTGCT ATGAGATTATATGCTTTCAATATTTTTAGTTCTTAATGTATTTGACTTAGTGTTATAAAAATATGCCACCTACTC CGGATTGGAGTAGGTATTAGGTTCTAATGGCATTTAGAACCATTTCCAAATATTACTGTTTTTTTCCTCACATGT GAATGT AATC ACAC AT TTGC CAGC ACAT ATAAAATAAATACATGTTGCAT TATATATT TT ACAT TT ATATTAAGT CTCAAATATTTCCTAGTTACTAGCAAATATTAAAATGACAATGATACTTACTTTTTTAAAAAAATCCGGTATTAC ATTGGCTG A CTTTATG AGTAAAT CGTTC ATTT AT AGG CCATT CT AGG G AAATTTTGTG ATGCTT AATTA C AC CTT AAAATAATCTGGGTGAAAAACACATGCTAAAATACATCTTCAATGTGTTCTCATTAATTCCCCTGGAATACAGAA AAAT CCAGAAAAGT CTA CAAAACT AAAGTTTC A CTCTTTTTAGGTGGC ACTT AAGATG CCACAGTGGT CC CACAG CAGT CTGT CT CAGCATGCAGTTTT CCAAAATTAGGACAAATGAGTG GCTTGAAGGAAAAATAAT TG A C GTTAA C A AGACAAAAAAATTG GC CTGCTATTTCTGCTGTATAAGGAAAATC CAAC CTGTTGGT( N ) xCACCATCATTTTCAT t t t c a t t t t c a t g g c c c t c t t a t g g g c c t t c a t t c t g a g g t c c c t c g g g c c c a t g g t g t t g a g c t g c a a a t t c c t GATATGTCTCATCAAGGACAGAATTAAAAGGAGCAGTGAGATCAGGGACAGAAGAAAGGAAATTAAGTAGATCAA A CTGACAAGAATC CAGAGGCTACAATATAGAATTTTACTTACAT CTAAAGAC CAAGTTGAGTTT CTTT CATACAT TGTGTAGATGTTAATC CACAAGTCACTAAGTC CACCTGTTAATG CAAGGTTGAAAAACAGTAAGAG CAAGGACC C CAGT GGGAGTTCAAGTAGCGTT CTGCTAAT TCTCCATC TCAG CCAGATGAAG CATGGGTGGGAAAAATAGGCTAT TT TAAAGAAG TAGAAAACA CTAAG GC AGCAAG CC AGGT AACT AAGTGATTGGTC AGTG CC CAAAAAAT ATGAAT A AATTTTATTAGTTTATTGAAGG CATATAGATGTGGC CATAAT GC TGTTACAAATGAAT CAAGTAGTATTAT CCAA AGTT GACTGATTGT GGAGATAGCCAGG CTGGTGAGGATG CAGTTGT CTAATAAGATCATTTGACTC CTCACCAGT CAATGCAG TTAACTAGTACAATGAAC CCATTC CCTAACATTCAGATTA TGAATT CTCCCATCATTACTAT CAGAA AGGTATTT CCAATT CCAGGTGG CATTTCTGGAGAAACAAATC CAGAAAGCTAATACTATGTTTCACTGATACTTT TGCTGTCAAAACGTTGCAGAATAATATCCAACTTGATGATTTTCAGATTTñTATTTAAAATGTTCAGAGTAACTC AGAATGAT CTTCTGTT TGATGCCACCTCACTTATCT TCACAAAAGATC TTTC TGTTTTTT TTATAT TCAT CTCAC TAGTAGG C CTGAAATATGTG CAAGAAGATC CAGTCT CACATATC TTAATTTAAAAAAAAC TTTAA CAT TATTTTT AATCAGCTAC TGAC TTA CAAATTAAATAGTAAATGTAGTAAATGTG CCCACTCATCATTGGCACTGAGAGCAATT TCCCACTTGTGAGGACATCTAGAAGGGCGAAGTCAGGTATGCAAATATCAAACAGATTTCTTTACACTGATTTGT AATTTTCT CCTCAACT GAAAGT CAATACCATTTGGATT TGAATATC TTAATTTTA CAAGAATTAT C TAAAAATAA CCTCTACAATATATGTATTACTCATTAGTTTCATTAAATAAACTCCATAATATTTCCAATTCTACTTTAGATGAA TTAAGTGTAATAAAAAGAAG CAT CAAAAAG CT CACAGAGTAGTGAAAAGAAG CAACATTAAATGTG CTTTTACTA CTGAATGTAAC CACAGTTTGATCAAAATGCAATTACAACACAGAGGAGGGTGTTATAAAATCTATGGCCAATAAC TAATTCTAGTTTCATGAAGATAATGACAGTTGGCTTGGGTCTTGAAGCTTTTAGGATTTCTTAAAGAAATAGACA AAAAGGGAGATGATTTTACATAATTATGTAAAAGCATATGTAAAGAAACAAAAAGTATTCCTTTATCTGAGCTAT TT T ACTG AACTTTTGAGCTTGGTTTC CAATGC CTTC CTGGCT ATTTGATTTT G AAT AAGCTCTT AT CAAC TG AAA TGAAAAAAAAATAGTAAAAAAGGATG GAAAAATAGCAAAAAAACATAAGAGGGAGAATAG GAGG GAGATT CTCC T TTTCATTAATTCTTCCTTTGGTAAAATTCTTTGTCTCATAAAAATGTATCCACAGTAAAATACATTTTATAATTT AT TTGTATTATTTTGTAACT C TA ATA<N ) xATTGCTGGCACTCACTGAGAGGCTCTTGATGTATGTCTTACAGTG ACTO CC AAAC CC AGGC AAAAAT CTTT CTAC CC AAAAGCTGCAAATCTATG ATTCTGCAAGGACAGGñC AGGATTT TGCCTGCCTTGTTCTC TAACAATGTGTTTCTCATTCAG CCAATT TTGACGCTGACTGG CAAACT GG CCAC CAACT CT CTTAACAACCACATA(N ) xTAGTGCTAACTTAGTTGGCAGTCCCACTAGCAGCACACAAAAGGAGAAAATGAA TT AG GCTTGACTGTGAAACAAGTATC AG AG CAAG AAAAGAAA GT AC CT CCTT AC AG AAAATGTT A CATTCATGTA TGAG CTTATGAAAGAT TTCTGTTCTAATCCTGAAAAAC CCAACACTGCTCTGTAATCTTT CTAGTT CAAT CAAGA AATCATACAGAATCAAAAGGAACATGGTCTGCCACCTAAGATGGGGGAGGCTTGGTACTTTTCTTCATGAAAAGG AACTACTT CTGTTGAAGATGGCTAGAAAA CTAGATT TCATGACC CAAGAAGCAAGATAGC CCTAGAGATCACTAA TGTT TTCAGAAATG CT CTGCAAGT CATGTAAAGGCTAATTGAGAAGAAGAT T CCATTGATAGGCATGGTTTAGCT GTGAAAAACACCGAAAA CAAGT CAGT CAGGAGATAG TGATTAAAGAACATCTAAGTTTGTAGTTA CGC TGAAGAA ACT AAAAATAAAAG AT GCAATAAC AAGT AT AAAAA C CACAAGTC CC AT AT A CTATG AGTTGTAT TCAAAGTCC AA GTTAATAATCACAATTTTTTAAATCTACATAATTAATCTATGTCCTACTCTAGACAAAATATCATCCAGTGTCTA GAAT GAAAGCAAAATGA CTGTG CAAATATACAAACACC CCCT(N)xAAAAACACCCCTTAAAGCATGATCTGGCT CAGAGAAGATGCTGTCTAATGAAGATTTTG CTAGAAAAAGCCAC CTACTAAGACATAG CAGATAAAC CACO CAAG ACAGTGAGGAAATTGGTGCT CCTTTTTCTTG GAAAT CAGTATTGGTATGGGTTT CTAC CTGATGTTA CATTCAGT ACAGTTTAAGAAAACATGC CTT GATATACTTGATTT GT GTCTTATTTG CCTT CAGAACTAGAG CTATG GG TTGGA GATAAGC CAATTT CAG CAGTAGAAACAGGTTATTTGTT CCTGTC CTATAAATTAC CTGACTGTATG CATT GAAAT AACCA
> H s l3 _ 614526 26 -614 67 31 7
GTG GATGAAT GAAATGATTCTGAGAGTACT CCAATTATGAAAGTGTAGAACTGACATGGT TTGGCTGTGTC(N ) x GC CC CAGGTAGCAGAC CACTGGATTCAGTTAAGACT CATAAGGACCAACAAAAGGTGGAT CCAGGAAGGAGAAGG TG CATGCTTGACCCTCTTAT CTTC CT CATCACAATACTAAAAACAC C A (N ) xTTAGTAAGCACAATCAGGTGAAG G C ATTTGAGACTTTTATGGCGTGACT CT AACAACCTTG ATAGTAAGTAATTG GG AGCCTT AAAAATTATAAACC C CC CTG AAAACTC AGGC ATTAC AGC AACCAT CT CC ATTAGTGG C AATTAATTAT ACTTTGAATTT TCTCACCTTGA AGTCAGGTTTAAAAATACCCTTTACCTGTGTGTGTGTGACAAGGCATGGTATGAAGTAGGGATGAGATCAGCAGA GGTGAAGTCAAGGACAGAGTAGGGACAAGGAAATGAACCCAGGGAGAAGGTTTCACTTGAATAGGGAATGACAGT AGTATTGT TTAGAGGGATC C CTAAATGTAT CAGTGGATGTTC CGAGAG CACTG GGTATTTTGCAGTACATGAAAC CTCTACAG CCTCATAT TTCACTACACAATT CGGGAGAAAGTAAAATATGAGAAAAAAGATTAGTGGAT TTAGGGA GATAAGGCAGTGAATTAACAGCTCACTGTTTGGAGAGTATCATATATAA CAG CT CAGGACACAAATTCTTTGATA AATGATAT CTTCAGGGAATTTT CAGACAAAATAAAAGT TTTC CAATTAGTTT CC TTGAGATAAT GT CACTTTTC C CCATAAACACACTGTAGTTGATTTCATTTTTCATATGTTGGTTTGCTGATGATTTAGACAAAGGTGAATGAACCT GGCTTTACTCGCAGGATACATATTTTAAAATGGTATGCTGGTTCTTAGACAGGTAACTTCCACCAGCCAGCCCTC AAG GTGAATAAGAAACACCTTTTTGG CTCTTTG GAACACTAACT CTTT CAG GACATTACATTTCAT CT CCAGTAC ACTAATTGGCAATTTGGGAG CCTGAAGCTGTAATCTAT CTCCTAGAAGAGTT GTTACCTCACAC CTAGGG GGTAG TTTACTTT CC CGGTGATAT CA CAGAATAGCAGAATTTGAAG GAATTTTGAGCAGTGAAATATA CAGGT CTTGGC C AGAG3AACGCAGGAAAGACAGGCATATTACACGCAGTTTCAACCCCAGTAGCAGCCCCTGCATAGTCTTGGGATC ATCTTAAACTCCCTTGCTTCCGTTTTCTAACCAGGAAACAAATTCCAGATCAATTCTAGATGGAAGCTGACAAGT CTTTATAAAACACTGTCTTATTAGATGAGTGGACTTTGTGATGTCTTCTGTTATGTCAACAGAGAGATGGTACGA TCCT GACCTTTTTTCGGGAAGCACTT CCACTT T CAGAAGGTTTTGTTCAAC C CATCTCTAAGATGCAC CAAT CTA TCTGATTT CAATATTT CTTG CCAGAATAGCTATATATTTACCTTATGTAACAGGTTCCAC CCTCAGGTATTCTT C CTAAAT TACTAATGCTATGT CTTAAATT TATTATGT CT TT TTTACTTñ GGAAGCACTGG(N ) xTCTAGTTTTTTT TACTAGATAGTTATTATCATTATTACTGTTGTACAGATATAGTCAGGGTTGAAGTTTACTTCATGATAAAACTGT CTGTAAAAAATTAGTAACATACATTAAT TCTAGGGGAAGAGTTTTAAATT TT CATAAACT TTGGAAGG CTGCATA AAATAGTCAACATTTTTATTATAAATTTGCTAATGTAACCATACAAGTATAATTTAAAAACTTGTTTTTAAACAA GTAAGTATTATACTTAAAAT GGCAGAAAGCAAACTG GATTTCTTTGAAAG CATCTGTTGTTTTTGACT CCTCACT GATTTAAA CTAACTTTAT CT CAGTAC CAATGAATGATCTTTC CCATAGTTTCTTATTTAC CCATCTTTCTTTCAT GTTAAC CACC CTAAAT TG CAGGATTTTT AAATGGCGGGGAGTGGGT AAGG AT ATTG AC CTGCTT ATTAC ACG AC A AAA(N)xCATAGTATTGAAAAGAAAATAGCTGGGTTCAATTGGCTTGACAAAAAGCTATTAAGAGCTATTTTAAT GTAAAG CAAGTTACCC TT CTGAAGTC CAAATTG CTC CTTGTGGGGAAGAT GAAGCAAG GACC CTGAAG CT CTGC C CAAAGTTGCAGGGCTGGCTTCAGCAGAGTCTGGGCATATCCTGGCTCTGCTACAGCACAGCTAATGACCATTTTC TTTATGAGATGCTCTATGGAAATTG(N)xACTGATGCCCCTTAGGAGGCATATGCTAGAGAGAGCTGATGGGCTA GCTGAGGTTGGAGATGATTATTGTGATCTCAGTTCATGCAGTTGTCTCCAGCAGAGCCCGATTGATCATATACAG ACACA CA CAATGAGTT CAGCTATGGACC TGGCTAATTGAGG GATTACTTT TCAGAAG G GC CAGT CACAAAGACAG AGGACATGAG CAGAGAAG CGTGACTGGGGAAAAGACAT GTTT TGGACT T CAT GAATTT TGACTATCTCTGAGAGA GCTGAGTGAGGGTATGTGATTGGACGTCCGATGCACAGGTATGGCTTTCAAGAGAGAGAGAAACA(N)xAATATA GATGATTTTTGTTTA CTGATACATTTAAT CAACAAATTTGGATATTGAAG CTTAGAGGGAACGATGGGAACTTTG GCAGTT GATGTAATTG TATAGGAAGAATGTTT TAGATTAAGAAGAGAATGGAACCACACTAGGC C CAC CAGACAG ACACATTG GAATCACTGT CATTTAAAGAATAAGCAAAG GACAGAA(N)xAATAATAATAATAACAACAACAACAA CCATAGGCAAAGAGAGAGAAAGAGGACAACTTTTTAATAGATGACTTTGTTCTCTTTTGCATTATTGTTATTGAT TACTACTACTTTTTCT CT TAATAAACAG CAAAAGTT CAG GAAGGTTGTAATGTTTTTGAAATAAAGCT CATG GG T GGCAGATGTCGGCTTCCAACACAGGCCTCTTGGCTCATGCTCAATTCATCCTTTTCCTAATACACACTGTTAAAT TTCTCTTCTGCACACAGCCTTCTATTAGATTCTTGTTCTCATAGAACTTACAGCCAAGATACAGAATACTGTAAA TATCAGGATTGAATACTAAAATATCAGTTATAAATCTTATACGAGTGAAGAGAAGGTTATCAACAGGTATTTTCA TCTCTG CTACAGAGC CAATAAAATTAATAAGT GTCTTAG C TGAGAGTAATGAGCCTATTA CTTT GAAC CAT CATA AGAAGTTAGAGTAAGTTAGTATTTTA GGAAATAAAG CAG GfiG CTGTTGAT TAATTTTTAAGTTTGTCATGATAG C ATAATTTC TCTAAGATAT TAAATTTTAG TATTTTATATAG GAAATTAAG CAAAACTGTTTGTGAAATACATACAT AAGTATTCACTTACATATATACTCCCATATGTTTATATTACAATAATGGAAATAAAGTTCAATAATCAAGCCAAG AACCAAAC TATTGCTGGATTTTATCTTT CT CACTTACATCAATATTGT CAGTGCTTAGATTTTGTTTTTGATTGT TTCCAAAATAGTTATT TT CCATCTACAATAAAAGAGAAATGTTCATAT TT GC TGAAGAAATGTTGAAGATGC TTA TGCTGGATGAAGTTTGTGTCTACAGTGAAAAGCATCAGATCAAAACCAGTTGTTTTGGTGTTTTGAGCTGTCTTA GGTGAGATGCATGCTGAT CTAATTAGTG GTGCAAAAAATATTTTTAAAAGGAAAAATAGCTGG G CTGATTGTTGT ACATTGTCAT CAATTAGCAATAACTAGACAAT TCATTT CAAAGGAAAGAAAAATATAT CACTAATTGACTGT TAC TAATAAA CATTGTGCCAAGCAAGCTCAACTA CTTAGAGTG CAGACTTTAATGTTGCTTTGTGTGT CTCTTACTC C TTGCATAGTTTTCCTT CAA C CACTGTGAACTTTGTCAGAACACAATAATC CC CATTTTTAAC( N ) xACAAACATT GGAATTTTA C CTATAAAT CG CTAAAAGTGAGATCCTTTT C CTTTTCCT CC C CAGGACTTC CT CCTAACTGAGAAT AGGTTGGCAGATCACCTGGCTTTTGCATTTGGCACCTCTCTCATTCCAATAAGAGACCAGAGGTTTCCTTCAGCT TTTT CATATGAGTTGT TATCTGCTTGTTGGGT TATT G (N ) x ACATTAGGAATAATACTGAATTGATTGTTTGTTG ATCCCACATTCTTTCAATACTTTTTCTCCTCTAAGATTACTATTATATTTTTTATTACATTCTAGAGCCTTTTGA AATACTTTAGCTTATGTGCTTTAGAATAATCATTAAGAAGCCTGGATGAATAACACAGGGGAGAAAAGATAGGCA TGCATGAACAGTCATGCTGGCTAGGGAAAAAATACTGTTTGAGTCCTAGTTACAAGGGGGCCAAAATGAATTGAT CATTTG CC TTTTGGCT TTAGAAGTGG GC TTAT CTGAGACAAAGAGAAG GG CAAACTCAAAAAACTGGGTC CTGCA AAGTGC TGGAGATGTCAGAGAGCAACAG GC CCAGAGATAATG CCATGAAT CTGATAACAAAAGAGGGGGAGGAAA GGAAG GTG CATGTCTTTT TTTAGTAA CATGTGAACACTAAGAATGAGAGGG TTATTGT CT CTTTTTTTAAAAG TA TGTTTGAAATTTATCATGGCTTACAAAATAAGAGCACTGAAATAATCATTTGAGAAATAATGAAGTTATTATATA TGAGGGGTGCTACTGGGGTAATACTCTGTACATTGTTTGC CATGTTTGTTTC CAACATTTATTT CCTC CTTTGAG AATGGGAA( N ) xTTCAATACCCTGAAGCTATCATTGGTTATATTCCAACACTGTCTTAACTTTCCTTCCTTTTCT ATACAATTGACAAGGAGTTAATATATTGAACCAAATGATATTATCATT TT GAAGTATTTG CC CAAGGC CATGAG C ATTATATT CT CCATTT CAACAATGACTTAGGAGAAGGCAG CT CTGAAAATGTAGGGTCAAAG CAGGACAAAACT C AACTCAGAACAATGACTGAAGACTAGTTTTATTTGATTGATGAGCTGGAGTATTGGTTAAGATTATCTAACATTT TTTACACC CCTAACCCATGG TATAAAACATGTTTAT G T CAAT CCTAAT CACAGTTCCCAACTATTAGG TTAT TTA ACAATAACACTGCAGACATTTAAATCATGATTTATTCAAAGATGCCTTTTGCACATAAAGTTTATTTTGTTAGTT TGGTTAGC TCTGCATT GTTTTAAAGATAATTCTCTT GCAAAGGATAAT TCATTGAAAAGTATTTTGCTAAGACT C CAGAAAAAAAATGGAT CACAGGCAAGTT CC CAGAAGAAGCATATTAATAAAGTGTTG GAAAACCT CTGAAGGATT ATATTCT C TAGAGATGA C CATCTATTGGACATATTAGC CTTTAAAGTCAT TT TATTTT TAAGTT TAAAGAT C TAT GTGAAATACTGTTTTGAAT C CTGGAATT CAAATACAAC CTTT CATATTGT TAATCTTT CTTGAAGTTTAAAGAGA TGG GAATAAAGAAAAAATTATTTCAT CTTAAATTTAG CTC CGTGTATACTTTTTTCTTTTTAA CT CAC CTAG CTT TTATGAACACTGCCTATGGACTAGGAAGTATAGAGAATACAAAACAACTGACATAATCCCTAAATATAGGAGTCT TTTGTATATATGAAGTGG CTGCATATGCATAATACC CT GTTTAGGAAAATGCTG CAG G GT CAGCTGTATTTTAAA TAAT CAATGG CT TATT TTAAGT CACTAATCTTAAAT C G ( N >xCAAGAGGCCAAAGAGGAAGTTTTAAGCTTTAAG GAGATT CCAGATGTT CTGAGAAGGAGGATT GAAATTTAAAATGGAACACT TG CTAAAAGCAACCAGAAAAAATT T GAAAATTAATAGTTATTTTGAAGAATGGATAGAACTTGGACAAAATAAAAGGGAGTGGGATGTGTATAGGTGTAT ATGAATGTGAGTATATGTGTGCTGACACATGTATGTAGAAACAACCTGAGAACATTAAAGTGGCAGGGTGGATGT GTGG CAAATAGAGAACATAATAAG CATG CATOA CTG CAACAC CCTTTTGGGAGTTAAAAAAGAAATAAAG CTGAG TAGGTTGAGT GAAAC CAACC TC TTGGAAAG C CTTGATC TTCTCTTT TAGAAGTT TATAGATAAAT AAATACATTT ATTTATATAT GAAC{ N ) xACAATTTCTTTTTATCTTTTTATCATCAAGTAGTTTCTAAGATAGñCTTGAGGGGAG CAATTC CTAAACAAAT TAA CGAGTTTGTGGGG CT TTTG CAGT CATGTAAGATGCAT CTAT TAAAAACTGAAC CAG GTAATG CAGAAGTAGT GGGGAGAAAAG GAGGAATGAAT C AACATAGTTTAAC CAGAAGGATATGAAGTAAAAGGG AGAGAAAGAA G CAAAATATT CCTG CAAGTTTACATGTTG G GTGATGAGGAAATGATGG CTTCATTCTTCCTTTTG TCAATATCATTT CT CAAATTGT CTTTCCCTTCTC CTTGGATGAT TTGGTCAGTGTTTCCT CCTTGACTTT CT CTT GCTTTTTCTAAAGG CT TAGTT CACAAGTCTCCTTCTGTTT CCTGTCTGGATTTAGTGT CCCTCTTACACAATTTC TTGGTTTCCTTTGCAGGC CAAT GT CATAA CACTCATCACACTACAT CACATTATTTGTTCA CACCTGTTTCTTTT ACTACATTGGATTCCT TGAGTAGAAAGCA CAGAAATGT CTAGTT CTTAG CACGACGTTTG GCATGTGGTTGATTT AAGT TGATAACCATAGAAAG TTTAATGAATGAGT TTCTTCTTTT CTTTGCAATAAT TAATAGTCTAAGAC CTGGA TATTTACTACTTGGAAAGTTAGTT CTAGAGTGCT CTACAATGGTGGAATAGCTTTC TTCTCTTT TTTATATAAC C TGTGTGACACTGTTCCAG CATTAGATAAAATATTAGATTTAAAGTGGAGTGAGAGGAAGTTTTAAG CTTATT TTA TTGTTCTCTTTGGGGTTC TATGAAAT CT CAGAAAGGAGAAATGCAGAAGATGAT GGTTGCTGCCACATTTTTCGA TT TT CAAAGAT CACTCTGGC CATTTATT TCTGGGGTATCT GGAAAATT TAGTTGTG GT CC CCAACATG CT GAAT T TTTACATT CTAGGAAACACTAGTGAGAATTAGACATGAAATGATGAACTAAGAAAGAAAC TCAAATT CT CTTAAT ATTTAAGTATTTAAAT TTAGACATGAAGATTT TTAAAAAG TCGTATCCCTTTC CTTGAAT CCTTATATAATAAG C ATTTAATTTACCTGAG CTTCAG CTAAAAATGAGGATAAGAGCTGTTGTAT CAAG GAATTTAAAG TGCTCTGTGTT AAAT CC CAGTGTGTATGACATTTACGTTG CTAAAATAATTTGAGAG CT CTGGTCTCTAAAGTAGATTG CTAAAAT AAAATGTACTATGATTAACAAAAC C CAAATATTACAGAGACAGATT TTAGGGATAGTATTGT TGGCTTTAGT AAG AAGTATG G GAGATACTAGATTTGT CATTAACGGT TTTT CTTTAACCTG GAA CAT CCTACT TTTATT CT CT CT CAG TCTGAAGTTT CTGAGT GGATATGTAC CTAACT GTGTTGCTTGTTTT CAAAGAGGGATATTAAAAAGTTTCATTAT GTAGCCCATG AAAAATGTGTATTTTCTTAGTAGTAAATATTGATCATTATCAATTTATGAAGAAG(N)xGGGTTA CT CACAAACAGAAAATA CTATTAAAGAAAGTTTGGCAAAT GT CT TTAAA CTTTT CACTGC CTTAAGTATGTAAGA TT CATAGTTTTTATTC T C CCACAATGATGTAGTTACAGTTAATATCTT GTTGTATATATCAT C C GATATGTAGGA ATGTGG CATTTT CTTTTTTCTTTC TTAAAT CTTT CTACATTTTAAGTCAATTATTAAAAGACTATTAGACTAGT C TTTTAAAATTCTTCCACAATACCGAAAAACCTGAGGATAAGCTTTTTGTTTTTGTCAAGCTGCTTTATCTCTAAT GTTAAACAACTTATTAGTTTAGGGATAGTGTTTATTTTTCTGGTAAATAATG( K > xTAATTTTAAGAAGAACTGT GGG(N)xAAGGAAATTACTGGTTAATTTCTAATTATAATACATGAATACTAAGTACAAAATGTTAATAAAAATTG TCAGTAGTATTT TAACATACTG CAATAT T CAT CTTCAAAT TCATATTATAAC CTACAAAG CAAT CATGTTGGGGA GCTT TACT CC CTTT C CAAGAGTG CTTCCATGGAC CAAAATGATTGCAAAAAAG(N ) xCCTAGAACACAAGGATTT TTGAAACAGAAAGGACTTGAGAGATCAT CT CTA CAAACTCATTCAGTTAAGTTTTT CCCTCTAGTTTCTGATTCT TATTTACATGGCAT CATAGTTATTGATG GAG AAGTCAGGAAC TCAGGTAGTTTTTCAAAATAAATATCTTAATAC ATAG CAGT GAATACAAG GGGTTTTGAGT GACTTATCTT C CGATTACATT CTCGTCTTCAC CAAT GC TCATAGAAT GTGACCATTAA CATGTTACAGTC CACACAAAATT TT GCAGATAACACACAAATGAC CAAAGTTACATTAT GAATA TCTAGCAGTGTGGTGGCTGCCATCTTCCTCAGCTCTGCTCTGTTCTTAATTTTCCAAGAACCCTCCAGGGATCAT TAAAAGAAGGAACTGTAAATAT CCACAGATAGTTTTGTAT TATATAAATGTAAACTGTGTAT CAGAAAGTG GGCA AGAAACTATTAAAAAAT CATAATTTATT CAGOATAC CTATGCATAAACAAAATTAATACTTATATGATTTTCTGG TTTCAGTATGTATAA CA CTAACAT CTTTTGTTTGC CAAACATAATGATATAT CACTTTAGATTGAGAAACATGC( N)xGCTTCCTGTTGTCTCCCTTCTCACCTTCCTCACCCATTCTCGTTCAGAAGAGAATATAATACTTATTCTTTG AT TTTATTAATACATATTGATGTT CT GAATTGTAATATGCATTT TAATAACTAAATTGGGATTT GCTAAAAATAG CATTTTAATATCTTTTGCAGTTCTTCAAACATCTG(N)xGTTTAATCACTATATTCATGCATTTTAAAAGTCTTG GATTAAATGAATAG CTTTACATCTAG TTTTAATATT CAT CACAAGTTTTT TGGGTATTAACTAAGGACTT CTGTT TG CATGTAATTTAACAAT TTGATTAATATG CTTTGCAGTTTTACTTTGTATTTTAT CTTGGATCTGAGAT CTTGG AGGCAAATTCTAGCTAATGC CT CTTTGCAGAGATTTTACAAAGC CATAAGAAGAGAAATGAAAATT CT C AAAATT AT TACAG GAA G CTTTTGAAAAGTGCAAT CTAGTTGGATCGGTAG CATTAGGATACAACTTTTTGACACTTTGTTA GGGTT(N)xTAGTTTGGGGCTG CTGAAGTGGCTAATAGTTTAAGGCAGGGTACTG GAAAGGAGGAAG CTGTACAG AGGGAAGCAT CAGAAGTCTGTG TGGGGTATACTACATG TTCTTGGC CAAAGG CGAGGGTC CATT TATACAGGGTA AGACTC CATGTGTTGTAATCAAGAGTGG CTGCTGGTTGCC TGAAAATGAAACTGAAATGC CAGGGGTTGTTGGAA TTGCAAAG CAGT CAGAG CAGAGAGACCATT CAGAGCTCTC CAGG CATT CAGCTGAGAT CC CAGAAATG CCATGAC TTTG GTGTAAGGAAAACACACTAAATATGGAT CACATCATATGACTAAGGATAAAC TCAAG
> H s l3 _ 4217418 2 -421 95 70 8
TGTTAGTGACATTCTGTTAGCTCTCATG CAGT TACT CATGAATAATGCTATTTCT CAGGAAG GT TT T CAAGTTGT CTTTGAGAGG CGTC CCAACAAAGTATATTTAG CAAAAATT CTTCAAAG CTGTTTGCGTGT CACTAG CGCTCTTCA TTAATTT C TC CTGACGTGGGGTTT TCAATATGTAAAGGTT TTGACTACTTAAATAATTTTAATGGT CAC CTGGCA AAGACT GAGTTAGAGATG C C CATAGT CATG CAGC C CAGAAGT CAGGACTATATTTCTAGG CTGCACCTGGATGAA GTTGTC CT CT CT TGGACACTGACACCATGT CTTGAGTGAT GAGC CATTCATGATGTGCTAAGAAAGAGTAACTGG CACTTT GCTG CT CT CCTGTGACATGTAGACGTTC CTTGACTCGG CTGTTCTCACCCTT CC CCTCTCTTTG CTCTG CACTCTTG CCTTAGTTGACACATCTTCCTTTGGCTT GGGCTGTC CCTCTATTCAGATGA C CACAAAAATACATAT ACAGATAG CCATATAAA CATGAAACCCTAT CTAGTCCT CACCTTGCTCCTGAACT C CAAATT CAG GTTTTGAAAA CTTTAAACAT CGTG CT TGATATTTTTACATAGGTGT CC CAATAGTGAGAAAACATACTATAACAT CCAAAGCAAA CATCTGCTTTCTTCACAAGTCTGTCCTCTTCCTGTGTCTCTTTCTGTCCTCCAGGGCTTGAAAACCTCCCATTTC CCTTCC CT CG CATT CAGTGT CACTTGTCTT TTTGTC CTAC CT C CAAATTGCTTCTTGAATTTGT CT CATT CTTTC CATTTT CATTTCAATTACGCTGGCTCAGGTGC CTAT CACTTCTCAC C C ( N ) xTGAAACTTTCTCCTGCATGGCTA CAAAACTT CCTAAAGAAGATAGAACCAGCT CT CAGCTCATGC CTGAGACCACACAGTG T C A T ( N) xAAGCACTAT TTATCTGAACAAAAGCTTATTTTGAAACACCACAGGGGAAAAAAGGAACATAATTGTTTGTTCATGGAGATACCA TGCTAG CCAAAG CAAT CAGAGAAAGGTTAATTTAGTTTAG CCTGAGAATGAAATTATGATTATAGATCAGAAAAA CATTGGAGAGAAGATA TGATTAAAATCAAGTGGTTTTTTG TAAGAATGGATGTTTT CTTTTGTTAGAGCATGCAT GGTGCAATAAGAGATTGAGATTCTGAACCAAGTATAATTGAAGTATTTATGGACTGATGAGAAGATTCATGCAAT
c g a t t c a c a a a a g g g a a a a g g t t c a a t a a c t c a a a g a a g c c a a g g a a a g t c a c c g g t g a t a g c t a t c t t a a t c g a GAGGAAAGACATGTAG TG CAACATAGGCA CATGTTAAAGCAGACATTAACATGGGTGG CAA CAG CATTTC CTGCT CCTCAAATCCCACCTGGAATCAATGAGTGGGCCTGAGGCTGTGCTTTGGAAAGTCATCTCAAGTGATGATTTGCT AGTTCCTGGCCAAACTTTGTGCCGTTCACCAAGGAAACACCCATGGTTACTTGGATTACATAGCAACCTCAAAAC
a g c t g g a a g a t a t t a g t c a a c t g g a g c c a c a c a g g a c t g g c a g a a g a t g g c a a a a t g g g a c c a a a t c a a a g g t t c CTTGCTTT GTTT TCAAT C TAAAGTTCCATATT CTACACAAG G CCGTGTCCATGCTTG GTGAGTAGC TAGGTTTTA
a a c t a g c c c t a t c t t c c t a a a t g g a g g a c t a g a a g g c c c a c c t g a g a a t a a c t c c c c t a t g g a t t a a a a a t a g a t TCAATAAGGAAAACACTTGAAAAATGCTGTGTTGGCATTTGGATAATGATGCAAAATGTTTTATGAGTATATGTA
a g c a t a t g c t t t t t a t g g a a t t g t t g a g t t g g a a a a a g a g a a a a g a g a c c c t a a c a a a c a a t a a t (N )x AAGTAC a a t g a a g g t a a c a a t g a a g t g a a t c c t t c t a t a g a g a a t g c t g c a t g c t a g g c a t t g t t t a a g c a t a t c a t c t g a GTTCTTCACTAACATACTGGTATCTGTCATTCCCñTTTTACAGAACCCAGGCTGTCTGGCTGCAGAGAAT3ACAA t a a t g a t g g a g g t g a t c t t g g t { N) xAGATAATAAAAACTCCCCTGAAATAATTACTCTCATTGAGGATATTTAG TAAGTTATTGACTCCAACCCCCTTAAGGATTTTTAAAAAATGGGTGCTACTTTG( N ) xTGCTAACTCTAATTCTA ACCAAGGCTT CATTTGTGTGTGATTGTGAGAAGT CAGAGCAG TG GACCTGACTTTT CATO TTTT GG CTGATTGTA TA A A C (N ) xCCAATTAAGACATCAACTAATACCAGATGAGGAATCTGATACCTTCAATTAAAAATGTATTCTTAT ATCCCTTTGTAGCACTGTG(N)xCATTTCGTCTCATAGCATTGTGTGCAAAATTCTCTACAGAACTCAGGAAATT T A G T T (N ) xCCATACTCATTCTCAAGTCTTAGGATAGGCACCATCTCCCATTACTCTCACCTCCTCTGAATCATC CATTTTG GCC CTTA GC CAACTATTGCCTTACC CTGAAG CT CA GAGC CAACCTGGCAGTGT CCTGGTG CTCTCAAT GTCCAGGGAAAGGAACACTTGTCAAATGAGTGACTATGTAAGAAGATCCAATCATTCAATATCATGTTGTTCACA TATAGT TTATATAAAA TCTTATGCTGTTTTAT CCTAAATAACTG CATATAGATTTTTTTTAAAG TACATGGTTTA ATTTTTACATTT CTTGGATTTTGAAATTAGACAT CTGTTTAATTATTATAAAGATC TTTC CT CCTG GAAATGACA GAACTTTT ATT AAT CT CTGG AT ACGGATTAG ATCTTTT GAAAAAAT AAGGCTTTTC CAGAGTTTTGGGTT AAAAT GGTGGATTAAAAACG G CAT CACAGAAGTTC TAAT C C CCAATACTTAGCAAAAATAATAAAAC TT CC C C CAGAAGG ACAATTACAAATAAGAACCAAAAAGGGAAAGGCAATCACCTTGAAATACTTCATGGAAAAGAAACACAAGATTCC AGAGTAGAATGGAGAAACAGGGCACAGCTTGGCTCCCAACCTTCTTCCTTTGTCCTGAAATGAGGCTTTGCTTTC TCTTTAGACAAAAAATCTGATTTTGTTTCTGATTTTAAGGAACACTCAGCCCCTCCTAAATTATTTGTGCAAACT GACCACAGAG CACAAT TT CTAATTTGAAAT CT C CAAA CAATAGT CATGCTTAGGAAATA CTGAAGTGT CTTAATA TTTTCAG GAG GAGTAG CAGTACCAACTTGTTC CT CAG CATTGAGAG CTTTGTGAAACTGCAAAAG GGCAGG CAAT GTGAAAG CAAGGGGAGG G CAGAAAGAGGAG GAGACAGGG GA CATACAGAATACATACACTGGAT GAT CAAAAAGA AAGCCATGGGATAGTCCAGTTTTATGCAGACATTTTCACACTGAGACATTTTTTCGGACTGGCTAATTAGGGTTC AGATAATTTT CT C CAAGACTTTAGGAATGTGT CCAGGTTTGGGC CCAGGTTATTTT CTTGGT CCATAAGTAGAAG ATTAC C TGTAAA TTATCCAG GATGATTCGGAGGGAGTG CACCTGTCGCCGAACAGCAC CTGAAAAC CT TT CATAG GTTGCAGCAT CGTATT CACT CATTTGGATCTC CTTTAGCC TGAAAT CAGAAGAGTATAAAAAGT TAA CAG CTTAA ATCACGTATATGTGGGTTTTTACTTCCATTCGCCTCCTGCTGGAACTTTAGCAAACTCCTTGGAAACTCAGATTC ATGTTGAGGAGAAAAGGCAAGAAGTGATGT CTCCAGCTGTGCTT CCAGATTCAAAGGAG GTAACAT C (N) xCTAT ATCCCAAAG C CATATT GTGATGTTTGCAG C CGAT CAGTT CATGTGG CAGATTCCTAAGAGGGTT CC CATGG GTTG CTTTTTAAGATCGAGATCATGTATTTGGTTGACCAAGATAACAGTTCAAGCCTAAAGAATGTAAGTTAATGTGCT GCTATGTATC CCTATGTGAGAGTTTTCTAAAATTG CTGACAGTGTAATTTAAACAG GACAATTG TATTTTAATAT TATTGTGGCTTTGAAATTCTATCTTCCCAGTATGCTGCTCCTTTTTATCAGTTTTTAT( N ) XCTGAAATAAAAAT ACAACT CAGCAT TACT TAAAATGCAGCATCTTAAAC CG TCAGTAGAAAGTTCAGAG TGAACT CCCTTGGGTTTTA CAGTGTTGTATC CATAATACATTAACTTAATAATACATTT CTATAAATTACTATAAATATAAAAGGTACT GATAT TAGGACAAGGAATAATAAGTGCATATCTCAAAAGCCCAAGCATAGCATAATCTTCTAAACTAGTGTGTTATGTAT ATGCATG CAATGTTG GAGATATGAATCCTAGGTGTATTG C TGAAAC TCTTATCTAATGTGTATTTT TTAT CACCT TGAGCTAAAACAGAAACCTCAAATACTGAATGAGAATAAATGTTCTGTATTCCTCCCTGTCAGGTTGAGCACTGG GAAAAATC CATT CACCATGGGGGAGAGCGC CAGAGG CTTATTACAATTATGTAATG CTAGGC CTAGACTACCATG GGGCAGACTGACAT( N) xCAGACATCTACCATGAAACTCGCATACTTTATTGCTTTAATAGATAGGCCTGTTGCC CTCTGTTATGGT( N) xGCACTTTGAAAACAGTGACTATTAAAAGGTAAATACAGACAAGGCCACTATAATATCTG AATGGTTGTATC TTGAG GTGACATGCCAAG GTGCCTTT CTAAAT CTAATTAACTCT CAGT CCTGAAGAATAAAAA GAAATATGTGTATAGAACTTGACCTCCTGACAATACCTCTTGTATTTAGTTTCTGGCAGTCTCTGAAATCTCTCT ATGACTTTTTATTAGATTAAGTTATGTCT CAAGTAT CAAGGCTGACAGAAGCTGGG CACAATGAACAATG CAGGA CATGCCACCTACCCCTTCATACCTAAACCGGTGTGTGTTTATATGTCAATATGCTTCTTCTCAAATAAGTGAGAT TAGCACTGATTAGAAACAAATGTAGACACAGCTGTCTGTGGGGTGGGGAGGGGATGGAGGAGAGAATATATAGAA ATATGTGATTAAATTAGCGCATTATGGATACGACTCTGTAAAACCCAAGAGAGGTCACCCAGCAAATGAACTTTC TACTTACAGTTAGTTTAAGATG CTGCATTTTAAATAAT C C TGAGTTGTATTTTTATTTCAGC CATT CAATTCATT TTCAGCATCCATTATATGTTAGGTAATATGCCATATACTTCTATAAATATAAAAGGTACTGATATTAGGACAGGG AATAATGAACATACGTCTCAAAAGCCTAACCACAACACAATCTTCTAAACTACTGTGTGTGTGTGTGTGTGCACG CACACACACATGCAATGTTGGAGGTATGGGTTGTAGGTGT( N) xTCAAGGAAGGAGCTAGCCATTTGGTAAGCCA CCTCGAGCAGACTGAT CTTTGCAGAGTGATCAGAAT TAGCTT CAGTTA TAGG C C TACCACTGGTGCGATCATGTT GCAC CCAAAACACAC CACATGCTGTTTTAATTATGAAGAAAT CTGGAAAC CATATTCCAGATTGTGTCGGTTTT C AGAAAGATCACC TA CAAGGCAAT CAAAG CTGATAAATT TTGAAAGATGATGACTGAAGCAAGGGATGTAGAAAGA AAAGAAATTTTAGC CT CTGGAT CAAATGAGGTACCCATTGCTAATACT CT CAGG CCCACAAAACTAG CTGGAAAC ATAT CTTTCTGCTT CT TTAACC CTGAT CAAAACTGCAGGG CACAGCTG CTTCACATCCTAGTAA TCTTTGTGGAC TAACTGCATTTTGTTGGCACAGTATATTTTTAAAGGTTATATTTATTGTTACGTAAGAGTCTGGGAATTTTAGTT CTCATTTTACTTTTTCTTATAGGTCACACTTAAATACTATAATTTTGAATCAATTATAAGGAAAACAAACATAAG CTTAGATCCACTGTGTGAGGCTGATTAGTTTTGCAAATGGTAGTGGATGCAATCTCCACAGCTGGTGCCTACCCT AG ACGCTTTT CT AAGTGTGTAC ATGT AC CTCATTGT CT AT TT CATTTG CT TT TC ACTGGT AT AATGTT TCTGTT C TATTTTATTCATGTTTTATGCTTGTCTCTCTCTGTTAGTCTGCCTGAGTTAATACATTCTTGATATGTAAGGGCT GGTATCTAGC TTAAAACCAGATATAAAGAGATGCTATCAAAATGCTGT GTTGATGATGTCAGGAAC C CACTTAG C AGCATTAGGCTCTCCAAGGCACTACACAAACTTTCTGGCTCACAGCTAGGACTGCCTAGTTTCAGCCAGTCTCTG AATTAAGTA CTAAAG GATGATT C CAT TACTTGTATG CACGTGTGTG( N ) xGACCTCATTCTGCATATGCCTAAGT GGCAAAACCGTTTTAAAACCTAT(N)xTTTTCCTCTTGAAGTTGGGGGGTCGGGTCCATCGCATTCATTATATGT T C CT CTCAACTT TTATGTTTGAAAAGTT CCTAGTAAAAGG TTGAAAAATTTT CAGCAGCATTAAATGAAATATT C ACCAAATGGGATGAAAGTCCCATTTCTACATGAAAGTAGAAAGCAACATCTTTCTCTTGGTGTTTTGCTGTGCCC CATCTTTCTCAGGATCTCCATGTGTTTCTATTTCTTTCTCTC CTTTCT CACACG CTTGGC CAACTC CT CTGTCTT TCAACCACCTTACGGTATTCTCTTTTACTTCAACTGAGACTCTTCTCTGCAGCATTTACTAGAGGTGTACATTTT CCTCATCCCTTCCCCATGTCCTCTCTTTCCTCTCTTACATCCGCTAAATTCATTTGCGTTAAGGGCACAGATGCC CCTTACTTCTTC CC CT GTAGGG CAATTGTCCTGAAGAGAACCACACTC CT TACT CCTATTATGG CAA CAGGAGAA GGGCGTACATAC TTGT GGCCAGGCACTGTTACCTTT CTGC CT CCCTGT GAACTATGTGTGTG CCAGAACACTCTT AT CTACCTGT CCAC TT GGGCAAAAAATTAGACTGAACAATTA CTTGGCAGGTTTTAAGGGTGGCAACAACAGAA C T CATTAATTG CGGAGGGGAGAATGACACATTTCCAATTGTTTAGTTCT TTTT CTAAATGGGTAC TTTATAATTCA TAAGAAATGCAGAGAACTTTCATTCTAACAAGACTGTTCACTACACTATTCTCTACCCTTTTTTGAGAGGCA( N ) xGAG GGTCTGAG CTGG CAGGCTTT CCTGAGGGTGTG GCGGCAGGGCCTGCATTCTTGTCTCTATTGTC CTGTACT GTTTACTTGCAGGATGTGGAAG CA CAGCATGTAATGTAATAC CAGATGTT CTACGTGGGAGGAA TTTCATAACTC ATACATTTTAGCATGCTTTTTTCTAACTGCGCCCATCTCCTTTCATGCAGCACAAAAATGTAAATCTGTTTCCCA AACTACTACCAG GT CC TGGAAT CTAAAAGGAAATACTGGTGGTTCTTTGGTATTTTTCACTTTTAG TTGAAGCAC A CATGTGGATGT CTAGGGTAACAT CATGTGGATTCT TCTTATTTTTTAAATG GCTCCTATTGTAGTGG CCCAGAA CAGCTGGGGAATGTCCGAGGGCCACAGGCTCACTGGCCTAGATTAAGGCCTCTACTCACTGATGAGGCCTCCCTA TCCAAACTCACTGTCCCTCCTCCCGAGCTGGTACCCATTATCCTGAGTGGG(N)xAGACTTACCTGAATTCCCTA TGTTGCCTAGGAACAGATAGACATGGATGGGGCGATTTGAGCTTTAGATTTTAAGGAGCCTGAGTAGAAAGAACA GGTGATTTTGTT TATTTGGCTGTTTC CGTTAGGGA C GGGGATGGCTGTGGATTT CAGTTTGAGGTC CGAGTTGCT GCATGAGGTG GAGGTGGAAACACACACAAATCTAGGTTT C CC CACAGT CCAT CATGACATTT CACCATAGCAGTA GTGCTGGTCTAAATGTGATGGGAGCACTGACCTCTGCTGGAATGCTCTCTGGCCCATCTCTCTAGCAGCTCTCTT GACCTCTTCGGGAACTGCATCTTTCTCAGCCTGAGAGACCTGGTACACCGTATGGCCTGCATCCAGCCGGTAAGG GCCTCCTTTGCCACCCAGGCCTGCCGTGTCTCTTCCCCCTGGAAGGAAAACGTGGAAATTCAGATGATAAATCTG AACTTGACCACAGAACAAAAGTTGGGTCATGGTTTTAGGACAGCCTTGTGGAGGATATTAATTGTACTTAACTAG TAATTAAACAGTCAAATTACCTTCTGGAAAAAGATGTATTCTCTTGTTTAGGATGCTTTTCCTTTTTCCCATCCA TTAAAATACTACCTATATATCAAGGCTTACTCCAGCTTCTATGACACCTTAGCTAATGCTGCTACCTCATAAGGA TC CATACTCTTGTGTACTATTTGTTATC CTTTCATATT TATTTTTCTCAC CAAC TATATGATGGATGAGA( N ) xA G C CAAGATTAGAAG CCAATTATTACA CCAACTTTGACTT CTGGTTTGAA CAATATATAACATAAA CAC CATCAG C TTTAGAACAGTCTTT CAATAGAAACT CAAGTTTACATAATACATATTACATATACATATTAC GT TTTATAACATA A CATTGTATGAACTGTTCATATTGTAAATGCCTGTAAGGC CA GGTAGG TCATATGATAAAAA GTGGTG GGGACAT TGG CTAACTTAGGATTTACCTTGT CTAAATACATGAAAATTG( N) xATTTAGGGGTAGATTAGAATACTAGTAAC CTGTTTCTTTTGTT CA TTCACG TATAACTTGAGAACTTAGAAGGGAAT CCAATATTGGAGGGGCAGTATAGTATA ATACACTGCTATTGTAGAGTATATTATGATATAGTTA(N)xATTAAACAAAGCCTAGTTACACAGAATTAGGCAA TTTCTAGTTTTATAAAGTGGAAAATTTGAAGGCTTTTAGACTTTGGGTAAATTATGTTACATTTTTCTTTCACCA T CAC CACCCC CACCTC CACCTTAC CACTGAGCTTTAACTG GT CTAACAAAACTAAAACCAAGAACT CAATATGAA CTGCTTTCCTTATGAAAGTCCAGGTAGGTCATTTCCTTTTGGGAAATACCATGAGAACTGCCTATTAATTACACT ACTTTAATCTTTTGAT TGGAACATATTT CACTTTTATTTAGAAATGGGTGGT GGAAAATT CCTT CTGGGATGTGT GTGACTCCCATTTCTTACTCATGGCCAGCCAATAGGCAGAGGGGTACAGTAACTGGACAGTGACAATGAGCTATA GGAG GAAAAGGT GTTT CTGGGAT C TAACTCCTTTCCTACAGGAGGGTATC CT GAGTATGGGTGGAT CATGAATT T GAG CTGAAAG CTTC CTAGGTG CTGAGATGTTATCATT CTGGTGAAAGGGAGAAAGTGGTGGAAC CAT CAGAAGCT GGTGTGAGTGACTGGCCACCTGAGTCGCAGGGCCCCAGGCAAAGTCACACTGGGGTGGAGCAGGAGGTGCTCTTC ATGATCTCCACTTC C CAGAGCCTTTAATTTGTACTATTTAAATGAAAG TG CT TACCAGAG GTT CTAAGACTGCAG GGCATAAGCCTGGACATTCCAAACCACTTCAGAAGATAATGGGGAGGGTGGGCAAGGGAGGAGCATTGATGAAGA AAGGGGAATCAAATGACTTGTTGGGCCAGAGCTCAGTTCAAAGTAAAGAATTCTGAGGGAAAAGAATGGCAGAGA GAAAAAGAGAAATTGAAG CCAAGGAG CCTTGATCAATTGAAAA CTAAAAG CCAGGT GAAACAAATGTAGC CT CAA T CTGGACAGATACGTGAG CTAGTAAATT CAAGTTTGGTAACAAATAG GAATTT CTC CC CCAGAGTAGTC CAC TGC TGGGAAACCACATCTGGC CAGCTCGT CC CACAATGCATTC CATATGGATTAAATCTTGTTGG CT TGACGCTAACT TCCAGATCAAAGGACATTTCTAAAAATAG CCTTACTTTAAAG CTGATG CTTGATGAATAAA CAGAGTAG G CAAGA AGTCCACAGGCCTATTTGATAGCACATTGTCTAAGCAGAGAAACCCATTTCCTCCCGAGGCTGTCAAGGCCAAGA GGCACAGACCATCCTAAGGAAGGTGAAGTACAGAGCTCCTGCCAGCACTCCTCATGGTGAGATAAGCTCAGACAG TGCGCACACAGTCTGGCCCTATTCCTGTGGACTCGCAGATGTGTCTGCTCCAGCTGCCGCAGAGGAAAAAGGAAG ACGG CAGTC CTG GG AAC AATGC CCTAGTTGGGCAGC CAAGAGGAAGGAGT CGGCC ATC GGTTTAG ACTGC AAC C A GCCTTATCTCGAAATCTTGCATCCTCAACGTACTTCGAAGATGTGGAAGACAAGGACCATTTCTTTTCAGTCAGT TGCTGAAGAAAGAATTTACTGTGGATTTCTGGCATGATCCCAAGGTCATGGAATTTAAAAGAAGGGATTAACACt N)xCTCTTCACATGCCTTGATGTCACGCTGGGGCAGAGGGCAAGACAACAATCCAACTGAGTAATGAGGGAGGCT GAGAAG GAAAGC CCGTCT CTGAGTCTTCT CAGGG GCGATGTG CTGGCCTAATGGGC TTGGGAAACCTGGG GAAGG CTGACT CAGGAGAATGCCTGTG CTAC CAA CCTGTTC CGCCAGCC CAAGTGTTGCCG CC CACGTGAGGCATGTTGT CTGG GT CCTCCTTC CCGTGT TTGGGGGAGCTTACAT CTTCAC CACTGT CT CTGTTGATTGTTAT CTAAAAATGAA CAATTGAGACATTTTTACTATC CTGTGACAAACACCAGGGACTG CfiTGACAGAAAAAAAAATTATTTTCT CC TAA GTCCTTAAGGTATCACATAACCTCAGGCCACTGGACTTGATAAATGCTGTGAGTATTTGGCAAAAATTCCATCAG ATGG ATTCCAACAT CC AT CT AGTTG G GAAG AAGAACTAATTTTT TAAAAAACTCTT AAAG AAAAGT AAAG GAGCT GGA CTG CATTAT AG CT A C AG AATTTTTCTGGTGC AAACCAAG CTGTGAGC ATTTAC AC ATGG CATG AGACTT ACC AGATGCAGGGAAAATGGATCTCACTGGAGAGGGGTATTTTGCCAGTGGTTTAAAAGGATCACCACCTAGCTGATC CAGCTGAGTCATTATATACACACTTACCCATTCTGTTTTCAAAGAAATCCCATTACTCTGAGAGATGCCATTTTA CATATAATATATGGAAAAGAAAAACAAAAAGTTATGTATGTATGCCTTAGTGATAGATTTATGTTATATACTAAA AACTGCTTATTTATTTTTTTGACCCACTTCAATTTTTATTATGGTATTTTGTAGCTTTCCTAATTCATCTTTATT GTAAAACTAGATTTATGATTCCTGCAATTCAAATAGAATAAGAAGTACTTTAGCCTTCCTCTGTGAGAGCCTGAA GTTCATTGCTTCCTATTTCATTCAGTACAGTGTCACCAAAAGGAGTGACAACCACACAGAAATTTCAATCCTAGT AGTC AGTTCAGT AG AT ACTACT CCAC AATAAATC AAAGCATT CACATTTT ATGCAC AC C ATG AG CCTGCT CT CAG ATGCAATGATTT CC CAGAAGACAAGATCAAATTCTC( N) xCTGAAGTCAGTCTTTAGAGTTAACCATTCCATGTT GGTATATGGGATTTTCATTCTTGATGTTGGGATTAATGCAGTTAGGTAGGTAATTTGAGGTCTTTGCAAATAAAA TGGCCATTCAGAATAAAATCAAGACAACTTCTACTGTCTTACGTCCAAGTTGGTTCATTTCAAGTGTTATGACAT CGTAGGGATTTT AATT AG CTGG ACTTTT CTGCAAAT AGCTTCGG CAGG AT CAACAAAGTTTTTG CC AAAACTTC A G CAT ACTCC AAG AAAATTTAGC AGCC AG AC ATGACAATTG AT AT AAATTC CACACACATAAT CATG CTCT AAAAA TGGGTGAAAATAATTTTT CCTC CTGTTT TCTTAAACAAGCAAGATAGCTAGGAACATTTAGAGAAAAAAAAAACA TTTAAAAT ACTTTC CCAAAAACTTGC CACCATTTTT CAAC TCGTTTAT AAAT AATGTAAG AG AT AAGCAGTAAAT TAAAATACATATACTGG CTACAAGAATT C CAGAAG(N) xCTCAGGCACAAGCTTGCCTCAGACCTGGAGATCCTG TTTAGAGATCTTTG CAGCAC CCTCTAGATGTACT CCATGAGAAATGAGAAGGACAAAGAACATTT CAAAACTGGT ACAGTTTTATTCTGTTGCTAATTTAATAAATCTATATGAGT (N) xGCGAAAAACAAAACAAAACAAAAAAACTAT ATGAAAGGTATGATTACAGTCTAAATTTTTTTAAAACAAACAGAATGGAACTTACATATGGAGATACTA(N)xAA GATACATTTTTT TTTAAATAATAAGAAGATA CTT TATTTTTTAATCATAAAGA CACAATTTATTTT TACTTTTTA
( N) xTGTAAGTCAAGGAGCATTTGTAGTTTGCTTATATGCGTTAAGACTCTTCTGTAGGCCAATTCTAAAAGAGA AGTT CCAAAAATAGTCTAAACAATTG GAAGAACCATT CTGAACACCATTC CTCACT TTGTGG G G CATGTGTGTGT ATTCTGGTACACTATTTAAAAATAAATTTTAAAAA
> H s l5 _ 5879473 6 -58803474
AAA CGTATCATT TAGCCTGC CT TAACTC TTGCTT TGAGGT CACAAATGTCAAAGTG GTTAACTC CAAGTCAC CTC CAAGAACATTAT TC CTAT TGGGTTAAGG CTCCAC CCTTTTGAGC TCTAAT TATCTC CTTAATGT GTGAGAAC TGG GGGAATATGTTCCCAAATTAGAAATAAGTTCTCTAAAACTAAGGCTTCCAATGCTGAAGAGAATGTACAAAATGA GAAAGGCATACTCAGTCCTTCACTTACACAGTCAAAGAGCAATTTTTAGGGCACTCCCATGTGCCAGACAGCCCA
A(N)xCTTTCCTAATCATTCAACAGGTATTTGAGGTGCTGAAACATTGTTTAAAAAAAAAAAAAAAAGCCCAGGT GCAAACACTTCTATCAGATGTGCCATGCATCATTGACAATAGTATACACATATTATTTATAATATGTATAGGTGC TTTATAAATATTAACTCATTTTATCTCCATAATAACTGAGCTATGGACAA{ N)xGTGTAAGAGAACTTGGTGGGT TCCTTAAACCCTGTTCAGGATCTGTCACTGGATTTGCCTGTTGTGCATTCAGAGGCTGGAGAAGGAGTGGAGGGG GCAGAGTCCAAGGTAGGACACTAAGAGT CAGGGAGAGGTGATTTGTGAAAGGCTTTAGGGG CAG CG CCAG CC TCT GGATAACAGGGACCCTGGTGGTACCAAGGAATCATTGGCAGGACTCTGGGAGATGAGGACTTGGTCTGCATGGGG CAGCAC CTCGCATC TGATACTGG CGCATATCAGCAC CCTACTGGACACAT T CAAACAGGCAG CACC CTGTGAGTG TCTCAGCACCAGCACATCAACCTTCCAATTCACCAACTCACTCCAGGGAATGTGAAGGGCAGGGTTCCACTGGGC CGCACCACCCAGCCCGGGGTGGCCCCAAAGGCAGAGGGGATGTGAGTGCTGTCTCTGTGAGCCTAAGTAGGCCCT GCAC CT GCCAGAGACAG GGCACAGAG CAAGTGTC TGAGGG CAAGTCAC CTGGACACAG CAGTG G CAGAAACCAAG GAGTAGGCATGCCAGGCAGGTCTCCAGCAGCCTCTGAGTTTCCAGGCACAGGGCCCACACCAGGGTGGACTTTTT ATATGTACATACACACCCACACACCCACACCCCCACCCAAACACACACACTGCACCAGTCCCGTTTCCTCAGATG GACATTTTTCAGAAGAAAGCCAATGACACTAGCAAGTCCATGACCCATTTCAAGATATGACAGGACATCATTTTC CAAGTGGAGAAAACAAAAATTCTCAGAAAGCCCTGTATACCACAGAAATCCATACACCAGTTTCAAGTCTCATCC AACAGACCAGCCAGAGTGCAGGGCCACAGGGCACCCTGGCTGGAGTGAACAAGCTCCCTCCAAACCCCCTCAAGA TTCTAGGCCCACCAGCATAAAAGTCCTCTTGATGGTGGGGTCATTCCAACCCCATCTGGGGTTGTAACCTCTCAG AG CAGAGGAAAATAAACAGG CT GTTGTCAGGCAG CCCAGG GCAGAAGTGATGACACCTGCTCAT T C TGAATTTAT CTTTATTATACATTATGGCCACCTGTGGACCATTCAGAAAAGATGCTTTTTGAGTCTCAAGCAAGTGACTCTTTA AGCCGAATCACAAAGAAATTCCCATTTTGGCCAGTTCTGTTTCTGGACGTCAACACCCCCCTCTCTGTATGAATA AGGGGTCTTCCAGATGGCCAGGAACAGCACTCAACAGGCCAAAACAAAGCAGAGGGTGGCTTGAGAACTACACTG CTACATTCAGCCTGAGGCCCAG CATCGAGCTGGC CTTCTCCCACTTC CAACAAC TCATTTTGTAGCTTTT TGGTT AAGAACAAATTTGGATTTCTTC TTTTTCTCCTCC CAGTAATCTCAAATGTAT CAGAAGAAAGGAAATTTC TACCA TTATTGTCAGAAACAAGACAAGTAAAAGGCCATCCTCAAATACTAGTGTTCTCTTCACCAGACAGCAGCACACGT GGAGAGTAGCAGAT CT CTAAGCACGACC CAGT GTGTAACTC CAAATGGCCCCAT TATC CTATCT CGAGGAGAGCT G CGCATGC TGTACC CTGTTTTACATGGC CTGC CATGCTTC TGGTAG CAAAG CAGTAAT CT CCTGGT TATGTAGCA CTGGGGATGCCAATAA CAGC CAGATAAAGAATAGAACT CATGAGGC CAGTTATT TTCAGT CAAA CCAAGCTA CAA AAACCACAGTCATGCAGAAGCTGGAGGAACCACCAAGGCAAAGAGAATGGAAATTCCCTTATGCTAGACAGCACA GGTCCC CAGTTTT CTGGGGCAT CTGAAACTTGATGTGACAG CACTGGGAGAGGC CAACAGTCCACGAATAGG OCA GGACCTAAGGGGAAAGGCTACCACTCTACAGCCGGCCATCAGGGTGCTGGGCTACAGCAGGGCTGGCAGGAGCAG ACGGAGGCCACATGCCAACTCCAGCCTCTGCTACATATGGTCCCAGCCCCCCTACTCACCAGCCATGCGGCCTCA GG CAAGTGGGATAATC CCCGTGTCTCAGTTTC CT CATCTT TAAAAAGTGGGATAATAAAAGATT CATGTACT TTA TAGGGTTG TTGTGAGGATCAAATGATTTAATATACAAAAAAATGTTG ACAA CAGGGCT CAACACAAAGTGTG CTC AACAGCTGCTGTTGGTATTGGTGGTGGTGGTGAATGGTCTGCCAGCTTGGTAGGGATGTCTATGAAGTGGGACAA A CATTT CATGAGTATGAAGATAGACTGT TCAGAT GATACT CATCTAATGCCATT GGCAG CTCCT CT CTTTATTGT ACCAAGATCTGTTTCCTTCACGTTTCACCCAACGTCAACCAGCCCTTCCCTTCTTTGGACTAAGTCTCCCTTCAA CACATC CT CAAATGA CACAGTTTCCAGT CCCCTCACTCTTAGTCACCCTCCTTG GAACAAGCT CAAGTTG CTAAA CAACTATCTCTAAAG CCAGTTCCCACCCGAGCCAACATTCCAAGGCCAAAAACCATCACCTCTATCATGAGTTGA GTATCCTG CTTT CATTAGGGGACTGCACTGACGTGGCT GCAGGTTG CTGGTCAG CCCTAGGGAG CCAGTAC CATA CAGCCAGTACCCGCGAGCTCCCTCCTCACAAGAGGCGGCTGTTGCCTCCTTGTGCCAGAGCTCCTGGGCTTTGGA TAGAGGAATTCCACTG CTTGGGTCTCAT CCTTACTCAC CAACTTGGAGGAA CTT CCATAGAGTGGATGGCAG CCG CC CCTC CC CACT CTGT CTCTGTG GTGTGTGCC CCACCTAGGCTGTGTTTGGGTG CCTGTAGTAC CTCTTCCTGTT CTTGTCT CAAACACACACCTGTGGGG CT CCTTTC CCAGTACCGTGCTTCACAACGGGG CTCTCCTTAGCC CAGGA GGG GAGAG G CTGTG CAGAAAG GAGACTT CTAT GACACC CTTGGGGCAAGGG TGTTGTTTTGCTC CT TCCATAGAT CT CTGAAG CCACAG CTATGCAGGAGAACAGAAATAGAAGC CAGC CACACCCAGT CTTCTT TGGCATGGGCACATT GAACGGGC TCAGTT CT CCTG CCACGACTGAAAGGGCAC CC TCCCACAGAAGG CACATGAC CACTGTTCCAAGACA GT CCAGAT CAGGAGAGAGG C CTTTGACCAGGG CAAAGCAG GTGCTGATGAGAAGGCACAG CAAAGGTGGC CCAAA CCACTCTGCCTATTCCCTATCCCTCCCCTGTTAACTCCACTCACAATTCCAAACCCAGAGCCCTAGCTTCTGAGA TATCCAGGTAGGCGACAAGATTTTCTTTCTCTACATGATACAGACTGCTGAGAGGCTCCCCTAACAAAGCACCAG TATTCTCCCTGAAGACACAGGCTCAAATTATGCTCTATTCAAAGCTGGCTGTGTTCCCAACTCCTAAGCCCTGCT GGCTCCCTCTGCCCCCCACCCTGACGTG GGGGAAGATG CTACAG GTGTGGGG G (N) xGGAGTGGGGAAACCAGGA GAACAGGGAAATAAAATTGGTTTTCACAATAGCTAAACTTTCCCTATCCTCTTGAGAGTATTTCTAAGCAAAAGA GAAATATACTTTTCTTTTTG CTTTCT CTAAAA(N) xTCAAACTGACTTGATTTAGCTCATTATTAAACATGCAAA TAAGAAAATTCC TGTT CTTGGAATGTGATCAGT CATCT GCATAAGGTCTTGCGACATATC TACGAT TGAATTATT TCTTTGAATCAAGGATTCCCCCGGCCACACACAAATCAGTACTTGCCTTACTTTGGAAAACGAAAAGAGCTCACT AAAGTGGT CCCT CTTATAATAAG GTGAGAGTGGC CTCTAGGACAAATTTCAG CTTGTACAATTCTGGAATACAGA AAGGGTAGCTGGCTAGCGACCTTTTTCCATACTCCACCAGGCACTG( N) xACTGCCACCCCATAGTCAAATTAAG C CGATCAT CCCCAAAG CAATATTAGT CCAGT C CAGTTC CT CTCATGA CAAAT GACCTGAACACAG GATTCAGCGA GT CTTACT CGCTGTGGTCATTTTGGAGGTAGGGAGATCATTAAC C CACATACTTTGATAAAATACGAAAC GC CTT TCCTAGCTGGTTCTTTTCCTCT CCTATAGGC CGACCCC CCTCCATC C CTTTATTAGTATTGAACTGAACAAATTA GT TAATATGGGAA CAT TTGGATGTAT CAACTT TG CTCTGAAACAAAAATGTATT CAT C CC CAAACT CTATGCTGG GGGCTGAGGGAGAGTCCTTACGACTGACATAGTCCCCAACAAGGTTCTCATTAGTCCCAGTTTTTTTTTGGACAC CTTTTC CT C CGCATTCACAT CCTTCTGCCCTCTC CAGCACTGTG CAGGAAGAAC CAAC CAACCTGACCTTGGTGA AAGGGCTGTGGGTTTTTCAACCCTTCATTCCTGCTACTTTTTTTTTTTAATTTTTTAGTATTCTTCTGTGCATCC TTTCTTCTCTCCTAGG CTTCTCACCCAG CCTC T CTGTTGAGTACTACTTGTAGG CAGAGCTCCC T CATTCACAAC AT CCCAATGTATTTGT CCTGTATCAAAG CAAACCTTATACAGTAGCTCTTTTAAGAACTG CTAAAG G AATGAAGG AACAAAACTGAGCCAAATAAAGCAAAATGAAGAGGATATGTATGCA3GCAATTGTGGGAACATATCACCAGAATC CT CTCTTG TTAT TGTA CCCAAATCCACACATT CTCCTCAG TGTTGT CTGGGAAATACATCAATC CACTGTGAATG GT TTAGATAGAATAGAATCCAT CAATGT CACCAG CTTGATTCCT C CTCAGAT GATGAG CCACTTTCTCCACT CCA CCTCCAGGGCTGGCCCCAGTCTTCCTAAATGTTCATTAACTCCTATTAGGTATTCCGAAACTCTCAGGTTTTCT{ N)xCCACGTGCTAACCTAATAGGTGTTCAGTAAATAAATATAATTTTCTATTTTTCCCTGCAAAATTCATTTCCC AAACACAATCTTTATTCTTTTTTTTAACATATAATAGCCATACTATGGCTTACCAACTCCCCAGTAAGCCAGGAC TCTCTGTATACG CAGT GTTTTG CCACAT CATT TACATGGATGAAATTGTTTC CTATAC TC CCAAAT TTACCCTGC TTGGTGACCAAGAATCACTTCTGTCCCTATTACTAGTGATATCATGTCTGCATTTAGAAACAGACAAGTGTTATA CAAAATTACATAATATGATAGGAATG CTGTCAAT GGATATCTAT TATTATATTTACCAGT CAAAAT TTTAGTTGT TATAACTGGTATCACCAATTAAAAAATTTTATAAGAAAGGGGTTTTTGTACCATTGGATCTATTGAATCATTAAA AAAAAAAAGAGTTTAGAAAAAGAAGAAGCTATTTATATACTATTTCATTCCATAAGACTTTATCCAGTTCCACTG TTAATAAAATGTA CATGCGCAGAATTGTATTG TTTTCTC(N)xAAGGACACAGCTCCCTGGTATGGGAAACAGAT AC TTTATGTCTT CAGTTGTAAGGTC CAC TGTTGT TCCTATTTAATATCTTTGAATGTG GAATATAT CTTACAATG CTGTGGGCTAGGGCATGGTTGTGACATTGATAAAAAGGGCTCTTA(N)xTATAATAAACAAGTAAGCAAAACATA AGATCATAA CAC GT CCTGAAAGGACCT CTCAG CAAGAGTGACA CATTG CCAAGTAATG CACATTGT TCCACATGG AGTTGGTGAG GATAACAG CACTT CCTTC CAGA ( N) xCTTCTCGGCTACATGAAATGGTACAAAATACGAATTAAA TGATTATCCATGGTCTTGTGTAATTCTGGCATGTTTCCAGCTAAGACTTCTCAGCTGTTCATTCAACTCAAAGGC ATCCTTCAGTCCTCTTCCCC
>Hsl5_58818514-58831712
GTGTTTTGCAGTCAGACTGATTTCCTCATTAGCAGAGAGTGTTAACCGGTAGCCTAGTTTGGCTCCTCGAGGTGC CTGATTCAGACAGA3CATCCCACAGCTGGA3CACCCTAGCTGGTGGAGGTAGGGACTCAGCCAAGTAGGCTCCAG AG CTGCCACCTAAG CC CAGTAGATTT CT CTTG CC CAGAACAACCTGGC CAAATGGAAGATGGGAGG CTGAAATC C TCACT CCAAGGAGGGG CCTCCTTTTCTTGGCA CAAAGGCACC GCGC CAAG CTACACAG CTAAGG CAGTGTGCTAC CTAAAGCAGGTG CACAGAATTCACAGAGAGGTAC CAAATAGG CTGG CAAAGACCTTGGTGGATAAGTACCCACflA ATATCTGGAAGGAACTATGTTGGGTTGAGACATGAGAAGGAAGAGATCTTAGGCAAGTTTGGAAAGGGAAACAAC A C ( N) xCCATTGCTCCCTATGGCAACCTGGAAACTGTCGGGTACCTGCCCGTGGTCTGAAGTGGGCAGGCAGCAG TCAGCTGGGCCTCCAGATATGGGTGGTGAAGGGAGGATACAGGTCCAAAAGAATATCTTAGGGGAAATAGTGCAC ATGCCCAGAAAACTTTTATAGTCTCA( N) xGAGTCACCAAGCGCCTCAGGAAAGCTGAAGGGCACAGGGTGCATG CTGCAGCCATGACCTCAACCTCTGTGAGTCTAGAGGTGACCATATTGGCCACACATAAGCTCACCTTCCCCCTAC ATTTCATAATCCTTCCCTCCCATCAGGGCCAGCCTTTGATTAAAATTCCCAAGATTATCTGAGGAGCTGGCGGTA CCACCCACCAATGGAGTTCATCTGGCTGCCTTTACACCAGCCACCTTATGGCTGGGTGG(N)xGGGATCAGGGTG GGCCTTCAGTCTCTAAAAGCATACCCAAGAGCCAGCCCTCTCAGCCGCAGGTCCGCACATGGGGAAGCCCTAGGG AGGTTCAAGGACTAGATCAGGCCCATTTTCTCATCAGGCAGCCCCCAAACCTCTGCCCCAAAGAGCACATCCAAC ATGCTTTCAG CC CACACT CCCCACCCACGGTTTACT{ N) xATAACCTTGGGCTGTTTGGCCAAGTTAATGCTTAA TGCTTTCCTCATGCTGCAAGCGTTTATCTTTGCCCAGATTCCAGCTCCAGGGAGGGGAACATTCCTGCAGAACAG CCAGGCGTTC CCAGGT CACCTCTGCT CT CACT CT CACTTTTCTTTTTG CATGATTAAATC CGGC CTAAAAACACG TCTCAGGGGCACATATCTGTCA( N) xGAGCCCAAGCTCCTflACCGCTTCTGCTGTGCAGTAACAAGCCACGCTCA GGGCATGGGCCCAACTTGAGTTTCTTCCATCCACGGGTCCAGAC TGGAAAGGAAGCAG CAAT CTTCTCTTCTTTT CT CACTGAGTATAAGG CACCCAATTCAT CTTCTGGCAGCTGACCTGGAGGACG GAGTGGC CCTGATTTTCCTTTT ACAGGTACAATCACTG CCAATACTAAAG CAGT GCCTGTCCCTACCTGC CAGGGCACCCAGGATC CTTGTCTCCAG CAGTAACTTCATGA GG TAGGTGGCTCTGGG CTGAGACAG GACTTGCATATTCCTAGAT CC CTTGACTGTGATGT C GAAGGTCCTCTGCCCCCACCCCTCCTTCTATTCCTTGTAAGCTTCACTTGGCAATTGTGTAGTTTATGTCTGGTA CAGGAATGAAGGATAAGGTCAAATGAAGCTCAAAGCCAGGAGCTAGATAGTCATGATATATCAAATAAAGTTTTG GAACAAGAACACAATGGGATTAATTTGCCAATCAAAACAGCGCTTGGCAACTTGACTGTAAGCCAAGATCTCTCA TGGCAAGAGCAG CTGAAGGGCCTGTGAT TGTC CAGCTCCTGGGGAG CCTCCCTGGCCTAACCGTGGTCAGATGCC AG CATCACAG CC CC CTTAGCTGGGTAAG GATT CT C CTAGTGCTACCTCTGGGAGAGAAAGAAAAAAATCTAGTCT GAGAATGACACCAGCAGACAGCAGCTCTGCCAGGTGTTTGAAGGGTGAAGGTCATGGGACTCTGTTTTTACTACC CTGGTCACCCCAATTGTTTCGGGGTATTTTCCATAGACAAGAGGATGCCCCAGTCAGCTGTCTTCCGTGATTCAA GACCCAGTCTACTGTCTTCTGTGATTCAAGACCTGTTTCCTCTTGTGCCGATCAGAGGCCTTATTTGGAGGCATC GG CCTTTATGTGGCATGGAATGTAAG CC CT CTGAGCTTTAA CAGAGATAGGAAGAGTC CTAGAACATGGCAGGGA AGAGGGTCGGAACCTCTGTCCTCCA(N) xGGTCCTTGGGGAGCAGTGGTTGGAGTGGGAGAAGGGCATCGCCCAG CTATTTTCCAACACATCGTAAAAGATATC(N)xACTGGAAGTGTAGATGGGTGGTGTTACCTATATCCTTGATTG ATAAGAGAAGGAAAATGAATTTTTCCTCAATAAGTACAGGTTGTTTGAAATGCAGGCTGGGTGCAGCGACGCTGC ATGGTCTCCAGGCGCCTTGTGTTTCTAACCCTCTTGCTGGACAACTTGGCCTCCACAGGGTAGTCCCCAGGGAAC CCTTGGCCAT CT CCGG CCTGAAGGAATCAGGGAAGACCTTCCGGACATGC CAAGTCTC CCTGAGAC CCTGCTCCA TGAGGGCTGG CT CC CT CC CCGACCTGGT CTTTTGGTTCAAAGTCAGTC CTGGAGTTAT GT CTGC CAATTGACCC C
AfiCCTCCAGAAAACTCAGAAACAGCCTGTCCTCTCAGGCCTGCAAACCAGGAGCAGCCAGGCCCAGCCCTGACTT ATGCAAGTAACTAC CACAGAAGGCCT GC TGTG CAGCCTATGC CG CT CAAC CTGGATCAAGGG CAATGTGGAGAAA CCACATTATTGCTTGACTATAGTGCAGTATTCCAACAATCATTTT(N)xTCTGTTCAGTTCTCTCACCCTACCTT GAACCTACACAGGGTGGAATGTGCCTATGACCTCGGAAGGACTTTAGGGAAGCAAGCCCCACTTTCCCCAGAGAG GATTCCAGTACAAACGGAAAGCTCTT CTGAATGG CATACTCTTT CATAAC CAGATTTCTTAAAAGAAAAAAATCA TT CTGATTCT AG CCTG AAGTGCAGTA GAATTC AAGAGAATTT AAGAGAAAAGTTTAGAAGGAAAAAATTTTTGG C AAGAAAAAAGACTCTA CTTACTTTTTTC CCTGGGATCCACAAAAGAAGAAGTTGTGGC CT CAGG CTACAGTAGTA TGGAGCCAAAATGGACTCAAAAAAAAAAAAAAAAAACCCCAGAATAAATCAGACATGAAAATAACTTTCCTTCCA TCCCGTATCTCATTTTCTTATTTGACAAAAAAATAGTAAAGAGCAACTACAACAACAAAGAAGAGGTGAAACAAG AATGCAAAAAATGATGGGAAGATGGCAACCTCCAAAATGTAATAAATTAATTTCAGCAGCTTTAAACCTAAAGCT ATTTGTAAAGAATAGCTAGAAAAGAAACCCACTCAAGACAAACTTAGGAATTTCTTCTGAGTTCA (N) xTAATTG AGGTTCGACATTATAGTGAAGTGATGCTAGGGTGTCACAAGAGGTGAAGGACAATACAAAGTTTAGTAACTATTG TG CACATTCAGAAATT GAGACACAGAAT TACTTT CAAGAAAAAAGTAAATGGGAGGGT CAGGTGTC CAGGTCAG G GCAAAATTCAAGATGGTGGGAGTGGGAGAAACATTCCTCACCATGGCCTTGGGCAGAATCTGCTTATCTCCAGAC AAAGCTCAGCAACAGCTAGGAGCTGAATCTGTAAGGTCCAGCAGTGACTAAATTCCCACACAAATGTTTCCATCT TTCCATCGTGCAAGAAATTCTGAAGCCAACATTGCCCAGCTACAGATCATTGGCCTGACCACAGAAGCCAGGGAC AGAGCTGCGGTTACAT TTATTACATAGGATTG TTCCATCCTGAG CAGAGTACATGGCCTG GAGAGTTTTCAGGG C AATTAT GAfiATT TCG GTCTTCCAGAGGACC CTTGCCAGGT CTAGGAATATTTTCAG CAGTGGATGT TTACAAAGC AAAAGC CCCCCTACACACA CATTAGCAAATTAAATAAT GCATGT TGTATGAAAAAC CT TGTGAGGGG CATAGCAA TACC CGAGGCTTCTTT CTTATGCACTTGTGTG CCCAGTACCAGC CAATACATGGAAAGGG CT CAA CAAACACTGA ACTGAGCAACAGCTTCTAGCCCAGGCATCATGGGCGGGAGGCAGATCCTGACTCCAATCGGGCCTCCTATGGTAG AATC CCTGGTTCTCTC CCATGGAAGGTG CCCCACCT CAATTG CCTGCAGTCACC CATCTG CTGATGTC CTTACCC AAAAGAAAATGAAATAATTTCTTCTTGTATGAAATATCAG GAAC C CTCAAACCTAG GAGT TAACTAG GAATAGAA AAAT CACTTG CTGTAA GAAATAATATTAGAAATGTAATGG CTGGACC CAGCTTC TC CTTCTAACTAAT TT TAAGG ATTGTTTTGCTGCT CTAGGAACTTACTGTCATACAGAGGACAAC CCCAAGCCTC CCTAGT CCAT CAAC CC CTTTT CCTGGATACAGGGGCTGAATCCCTTGATGGATTTGATCACTTACCTCAATTTAATATTCCTCCTCCTAAAACCAC AGTATCAT CC CCAGATTCTTATCACTG GTGTGACTACC T CTT CTGAAGACATGAGAAAAG CT CCAGTTTCAGAAG GTATGCTTGGAGAATGTG CTGCTTCTAG CTTATC CAATATG GAACCAATTTCTT CACT CATA GC TAAAGCATATC CTTTCCTCACAGCTTA( N ) xAAAATGCTGAAGATCAATTTGAATATATATGTATATATAAAGAGATATAAATAGG ATCACACGTAAGGAAC <N)xATATGAAACAGTTAAAAGTCTATTGCAACACTATAATTTGAAGGGGAAAAAAAAG GCTAAG GACATACAGTAG GCCACTGGGCAGA CAAAGGAGATCATT CTGAGCCTGAGTC CT CAGTATTTGGTGACC C A G T A (N )x CTT CC CTTAGGATGCCCAAGTGTCCTAAGTGTTCTAGGAATAAGCGACAGACTCTGATCTCATGGA GGCTCAGACCCCTGGGCTTCAGGGCTCTATTCCTTCCACCCCTAAATGTGACCTGAAGTGGCCCCTGCCTCCCTC AGTCCTAGAGGCTGCT GCTCAATTACAGTGAAGAAT CT CT TATAGACTCACGGTGACTTAAACG CTATTTGGTGC TGTGGCTGCTATTT CACTGAAGATTACAGGGC TCTCAGAG TTAACCACATAAAACT TG CTGC TATATATTTGTAA CCTCCTTTTT CAATAACACATAGGGAAACCAT CT TGGGTTTCAGGACCAAAAAAGTGTGAGTGC CTGG CCACCAG
{ N) xATAAGATCACATAGTGTACATTTTTGGGTGTTAATATTGACATTCCTATCTTTATTGTTTTACTGCCCTTA TTTTCTCTTTGTCC CAAGTTCTTCAACTATGAATTTTTG C CAAC TACTTCAAGTTAAAA(N ) xTAGACTTGACTG ATGCTTTATCTTCTCTCGACCTCTTCCCTAAAGCCCCTACGCAGGGCATTCCTCCGCTGCAACACAGCAACCACT TTTAATAGTTGCTTTTCAA(N)xAATAAGTGTTTGTTGAGGAAGTGAATAAATGAATGAGCACATACCTGTATAC AGTATACAGCTTCCTCGCTCATACACATGTTAAAGAGAGCTAAAGCCTCCGGGAAGGGAGAAAAGAGAGGGCCCT GTTCAGTCCAGTTGTTTGTGGGGCTTTCCATCTGACAACTGTTTTCCATGGTCTCTCCTGAACAGCCTCGAAGCC TCAGCCACCTGG GCTGGAGCGTATCTCACAGCTAAATC CAGAGTAAATATACACAAGACAGAATTG GT CCTATGA GGACAACTGGGGTGAATT TCCACAGAAAGAAAGGATTT CACT GATCAGAAATGTAAGATG CTAGATGCTC CAATT GGTACTTATTCTAACAACGCTCTCTTCTCTAGGTT(N ) xTTTTCATATGACCCTCTTAACATTTGAGGCAGCTAT CCTAAAAT GAAGAAAG GTGTAGTGTTTCAGTCT CTGAATAAAAGAAAATGAGCTACATTAGGTC CAATGTATAAA TGGCTG AAGATTGAGTGC C AGCCTTTG ATC CAAG AT CTGG AT CTGGCAGG AC AC CT C ACCGCTC ACT AAC ATTTA TTGAGCAC CCAC CATTGTG CCCAGGG{ N) xACGAAAAGAGAGGAGGAGGTGGGGGATGAGGGATCATCACTCAGG TGGTGGCCTTGG CTGCAATACCATGAAG CTGAAAGT CACTAACAGGTATAGTCATTTCTGTTTTAGACTCTTTGA GCAGCTGTG(N ) xCACAGTGTGCACGGGAACTAAGGCACAGGATTCTAGGGATATTTAGGAAACCTGAAGCTTTG GACTGATCGAGAAG CTGG CATAGCAGTG CAAGGTGACACC CCGG CTTCTGACTTAGACAGGAGGGCAG CCAGTGA AATCAGCAGGCTAGGTTGGAAGCAGGCTTCCAGGAGTGAGGGCAAGCTGAGACTCTCTATAAGCTCAGAAAAGCC AAAGAGAGCGGCAGAAATTTTCTTCATTATCTCAGAAAAGTTATTCAAATTCTTGCTCACACCTCCTTTTAATAA AATATGATGCATTT CATT CTGTCTTCATAGTGAAAAAT CATTGC CCTTTGCTGAGAATTAATAACCTT TTGAGTT AATACC CT TTGGGTAGGAAAATTGGTTT CATTAACACCTC CCCTTGCACTTTGTAGGTTCAT CATCTCTTGGTCC TGCTGTCCCCTGAAACCAAAAGCCCAAGACATATTATTTGAGAAGCCTGAATGATTTTATTCTCAGAAAGAATGA TCAGGGTCCT TATTAAATGGGCTGAATT GAGTTCA CTGAC CAAAAAGGTCAGAG CGGCCTTGTGTCTGCT C CTCA GCCGCAGGGGAGCCCACCTTCTACCAGAAGGAAGGGATGAGAGCTTCCTCTCCCAGCTCTTGGCCTGATCCCTAA TCTTGGCAGAAAAGATGCGCCCATCTTGACATGCCACGCCATGGAAACGCGCCACCTTGCCAGGCTCTACCAGGC CTCTTGGCAGAGGTGTCCGTCTCTGGTGCAGCACCACAGACAGGACAGCTGTCGGGCTGCCTGGCAGAGTTGGAT GGCCACTCACCCACCCCTCCAAAGTGGCTCATGGTCATAGTGAGGCTCCCCCAGAAGAGAGGCTGGAAGGCCCTT CGGT CAAACAACAGAAGGTGATATGTTAAAAACAAT CTTCCTCT CCTTTTCCAC CTCCCACGGCCTTG GAAAGCC TCCGAATG GACATT CCTCTACT(N ) xATGTATGCAAGGTTGCATGCCTGTGAGCCTCGACTCATGTTGCTCCCTC AGCCACCCCT GACC CC CG CAACAAAACACACACACACACT CAAT CATOCTGCAAG G CCAG GGTCAACATCACATT CCCTGAAAAG CC CT CC CTGAATGCCACACC CT CC CT CAAG CGGCATCCACTACT CC CT CTGTGATT CC CC CAACC CTGC CC CACA CT CACC CCTGTACCTTGT CC CT G CAAG G CTTA CCTGATATAATCTGTCTG CACGTGAGGACAAGT TGAG CCTCAGGAGCTAATACATAGTGTCAC CCAG CTAAGATG CT CTATTCACTGTC TCTCTAGGTAGGGATGTAG GGA(N)XTTCCTTATGACCTAGAGGGTGACTCTAGTGCTTGATTGGCTGGTCACTGGGATAACAGGCTAGAGGAT CTGGTGATCT CAAC CT CCTAGGCCACCCTC CATCACTTG GCCATTCCACTGGAAACAGTCTTTGGGG CTC CCCAC AGTTGTGGCTCATT CTGATCT CCAGCAT CT CC CAGTAACCTCTT TGGCTTGTGCTTGTAGAAGCAG CCTT TGAGA AGACGGAGGGCTTCAGAT GAAGCAGATG CCAGGCTAAG CACCGT CCC CAATCTTATATTG CAGAG C CATTTGGAA GAAGAG CT CAAG CTGTTGAAACAAACAAAACG CTGCATGAGATGAAGACCAGATTC CTGCT CTTTGGAGAAACCA ATCAG GG C TGTCAGATT CGAATCAATCATC CGGACACGTTACAG GAGTGCGGCTTCAACT CCTCCCTG CCTCTGG TGATGATAAT CCACGGGTGGTCGGTAGGAAATGCTGACATGC CGTTTTTCTCTC CGATTT CACATT TT CTTTTTT TCTTTCTAGCGTGTTCATGTTCATAAAAAGATAGGGAGCTGG( N ) xCCACACAGCCCATTGGGGCCCACAAGGGA TTGCAGTCCGTG CAG GAAGAAGCCTCCT TC CT GAAT CT GCATGT TACACACAGG GGGACATAG GAAGC CAGAAGA ATGGACTC CCAATGAAATGT( N ) xGGCTTTTGGTATAAAAGAGTGTTTTTTAGGTGTCCCAGAGCCTCTGTGAAA TTCT CC CAGTTACATC CATTGACCATAT CTAAGTGGAAAGGTGG CTGAGAAGTATAGG CTGTTAAAGAAAATGTA GCCTTTTAGTGATGCTAAAAATTCTATTACTATGGAAGAAGGAGAGAAAGGATATTATTGGCAATAGATGAAGTC AAACCATGTGAGAACTTGTCTC CAAGAGGAAATC CTGC CT
> H S 16 _ 15314189 -15328734
CATGAGGACAATCACACCCACCTTACTGGGAGACTGGAAGATTAAAAAAGAAAAATGACATAAACG( N) xTAGCA TGCAGTCAGCAAACTTATCCCCTTCCGTCACAGTGCCATAACTCAGTCTACAGACGAGTAACTCGGGGAGGGGGT TTGTTTGGCTCATCTGTAAAACAGGGCTGCTGGAAGCCGACATTGCATAACCCATATGTACAACAACTGGCACAC AGTGAGTGCTCAGCTGTTAATAAAGGGAAGGAAAAGAAACTGTAAATCTGGCTCTGTTATAGGTCCTGAGGTTAA GCAAATAGAGAGACAGTTATAAAGATAAGTAAGACAAGAGAGCCAAATGTAAATAATAATATCAGGTAATTAATT AGATGTATGAACAAAGTT CAAAT CAGTGTAGCTT CCAGGCTATATCAATTAC CT CG CTTAAGCC CTCTACGCCCT TTACAACACA GGACTGTCACCC CTGC CCAGTTTACGGG CCAG CCAAGG CGGGTT CAGAAACAGCAGAGAATGTCC TTTGGGCTTACGCTGTTTGTGACCCTGCTTCTTCTCGGATCCTAGTCCATCTTCATCTTTTTTTTCCCAAAATGG TTTTCT CATTTCTG CCACTGAAGTTGAACAAATT CAACAT CTGATTTT TC CTTT CC CTGT CTCC CCTCTGCC CAC ACCCTCCCCCAGTACTTCATCTTCCTGAGGTCGGCTCGATCCCAACTTCATGTCTGTCCCATCCGCCCTCTTCAA TTCCACTCTCCTCTCTCCAGTTATGACCCCTGGATGCCTGAAGAAGCGTTCTAACTGTGCTCTGCACTGTTCAGC TCTGCTCCCTCCAATCTGCCCTGCATGAGCCCCTGAGGGATCTCACCCTTTCAGCAGGTCCACCTACAAGTATAA CC CCTCTAGGTGACATAC CTTC CTTCGGGAAGCTTCTC C CAG CAAC CACTGACC CACAAGAATCTCTG CTTT CCT CAAACACAAGTAGACTTCGCAACTCTGCCTACCTGTTAGAATAACCTGAAGAAGCTTCATACCACACTGTTCA(N ) xAAATAAACCTCCCATTCCTATCACTTCCTCATTGTTCAATATGGCTTCATATATTTCTTCATAAATAA (N) xA GTGCCAGTGTGTGAATATATGAGTACCTGTAAGAATCAGCAAAAAT( N) xCTGATCCACAGCAGCTGTCATCTAT AACCCTAACCCTGTGTCACCCCTGCAGATGAGGCTTCAGTAGCTCACCATTACCTCTACAGACATTCTAAATTCA AGTTAAATAACGCAGATGAAGGACTCAGGTGAGT CTGG GACTTT CTGTAT CAGAAT CGTCACAGATGC CTCAAAA AATCAGATTC CCAGACCC CGTTGGAGACATT CAGATCCTTGCAGAC CC CTGC CAGGGAATTAATGñGTTCAGGTT CATTCCACATGCAGATTCTCGGGACTCCCTCCAGGAAATTAAATTGAGATTCCCGGATTTAACCCCCTGGAAGGG GTCCCAAGAATTCACGTGTTGAACAAGTGTCTTTTCTCAGACACGCTGAAGGTAGGGATCTACAGAGTCACATTC ATGCAAGGTCTGTGGCACAAGGCCTGGCTCTGCCTGCATCTCTGGCTTCATCTCTCAACATTCATGCCCCCACCC TGTGCAG GTC CACCAACC CCGG CTACTCCACGTAA CAGAAATGG CCTTTC CCTACT CCTT CTCTTGGCTTCTGCA ACCAGCCTCCCAACCTTCACTCAGGTATCAAAGTGGCAAGCTTTTCTGGGTTCCGTGTCTATTCTGAGCTCCCAA GCCTCCTA( N) xACTGAACTTGAAAAACTCTGCTATAGTCTATTGTCATGTGAGTTCCTCGAAGGCAGGGAATGT GTAACACTCACCTG CATAACCT CAGATACCACATACAGAG CCTGACACAC CGT(N) xTATCCGTACCTTACATCT GG CATT CAGGTGTGTTCATTTCTGCC CTATTTTATTCCAGAAAAGACGTGTAGATACTTAAAAAAACTTACT CAA TGCAACAGGATAAAATCAGTGAAGGTATCGTAGAAGCAGCAGCTATGATGAAGCTATAGGTGGGTTGAGAGCAGT TATTAGGATTATTTTTATAGCCACAAGGTAGAGATAAGCTGAAATTTGACTCTGGGCTTCTGGTTGGCATCCATG CAATAAAAATGTCC CAGTTGCT CAAGGGAAG CACAGCT CT CT CTAAAGTTGAGAGGTGTGGGGCACTGGGCCATA GTGAGAGTGAAGTGGACAACCC{ N) xATCTGGACAACCCTTTCAACAGCACCAATACACCAGGAGTCTTTCACAA CTAGTTGTTATAACTTCCTCTGAGTTCTTTTCCTCTTCCAGAACTCATTCTCTTTCTCTTTCCCATGTAATTTGA
T(N)xGTTTCTTTCTATCTCTTAAGTTTGAGCTTGGCCACATGACTTGTTTTGGTGAATGAAATGATGATAGACA TG ATAC AAGTAGGAGCTTTAGATGTGTTTGTGCAGCTGGATGTG AATT CCTG AATC TCTC CC AT CAC C ATGAGAA GAGCATGCCCTGGCTAGCCCTCTGGTTCAAAGAAGGTAAAAAAATACCTGGAGTCAAGCTCAT (N) xCCCATTAT TG A CAATGGGTAAC CAAAATAT CTCC CAAT AGCAAGTT AAAG AATT ACGC C AAAGCGTGG CT AAAT AAAT AAATG AAAGCAATCTTTTG G GGGGCTAAGATAGAC CAAATTATT C CATGTTACTTAAGATT CCTC CCAAACTAAGTAGTA ACTCCTTGGTGATAGATCATATTCTTCCACCAAGTGACCACCACTGAGCTGGGCACATAAGAATGCTTGATGCCA ATGTGT CAAT CAGTATTTGATT GGGAATATAGCCAACC CATTTCAAGAGAAATG CTGTGAACTAAAGT CAAAATG TTGGCAACACTGAACAAT CAAT CGACTGAATTGG ACAGTCAATAGATT CAGATG CT CT AAATGC CTTC CATT ATA TC CGAT CTGC CAAAGGTC CAAAAACCAATGATGCAGTGGCAAATTT CCTGGT CT CATCTC CCACGCAG GATC CAC TCTCCC CCAGATGGTTGAGTTCTTTCTGGT CATCTTAGAT CC CT CCAGAATGGATTTTTT TTAC CTCT CATG CTT GCGCAAGGCTCAACTTCAATTCGAAGAAAGCCTTCCCATACTTCCTTTTATGAAATTACAAGCCTGGGATCCCTA CCTTGT CTGCTCAGAAATAAAACAGACTCAGCCC CATC TACACTTGAGTTAGAGTTGAGAGTTG CATACTCT CAA CTCCTCATGTGGCTGAGTTGCTAAAGGTGT( N)xCTCTTATCACTTGATTTTGAATTCAGAGAGTAGACCCATCA CAGCAGAGCTACAGTGGGGTGGAAAGGGGAGGTAACGGAGGTAACAGGACCCACTA{ N) xTAGCAGCTGTGCTAA GCACTTTAAATACACTCTCTCATTTAATCCTCACCACCACCTACTTTGTCACCATTGCTTCTATAACAGGACATC AAGTAGGAATTCTCCAGTTGTTCTGGATCAATGACTACAGCAGACAGTTGGTACCTAGCTCAGGGTGAGAGATCC CCAAGAGAAATTGCTCCAAAAGAGGGTGATCA( N) xTGTCATGGTCACCAGCCAATTATATGGTCACCTGCCATG GAGCCATGGGGGCTGATTGCAAGAAGGAGGGGCTAGCATCTCTATCCCTGAGCTGCAGAGCCATGAGCCCCTCTA GTGCGATCATCTGAGTAAGATGGAGTGAGGACACTCTTGGAAAAGCTGACTCTAAGCAAAGCTTAATGTAGGAAA AGGACAGGGGCAGGCATCTCCCTCTCATTCCCTGTGCCCTAAGAGTGCGTCACAGATGCCTGTGATTTCATGGTA GAAGAGGAAG CCAG AGAT ACAG ATGAGAGG CTAT C CAAGG AG CC ACTAAATG AGGGGCTTGGGG CCAC AG ATGAT TGAAAGGGGC CAAAGGGAGGCAAGGCTGGC CTTC CCCTAAGGGT CT CAGTGACAGATGGGGCACATAT GACATTG AAACCAAGCACACGATCTACACCAGAACCATCTCATTTGCAAAGGTTTTGAGCAGCAAAACCAGTGTTACACAGA ACAAGATCAGAAATGCCACCCATAACTACGACCTTGCCAAGTTAAGTAGAAATAAGCCTAAACTCCCCGACCCAC TTTCCTCCCCATCTCCACACAC{ N) xTTTAATAGCTGAAAGTGGCCAGAAAGTTAAGGGACCTAGCACCCACTGG GGTAGAAAGACTGAACATACAACTGATAATAATATTTAAGAAAAAAAACAAAACTAGTTCATGCTTGTACCCC (N ) xTGTCGTTTAATAGACACTTGGTGCTGAGTGCTTTACATATTTTCCTTTTAGGGAAAACTCTGAGACACATGGG CTCAGGGGAAATATTTGTTGACTTGGATGGAAAAGGCCCATCCTCAGCTTTTGTTTTGCCCTGTCCTGCTGTCTG TCTGTGTTAAGT TCAATGCCTGAG GCTT TG CGAACTTATATTCC CTGTAAGG GCAGAAATTCAG CTCATGTAAAC CTCACAGCAATCCTTATGCCCAT(N)xTGCCTTATTGAGCTGTCTCCTCAATTTCATCTTCACCAGCCCCAATTC TCAGATGTCCCCTGGGCCTCCTCAATACCATGTTGCAGGCCATAGCCCCAGGCTGCTAAATGGCACGCAGCCTCT GTGGGAGAAGACTCTAATTGTGCCTGACCACAAGGTCTGCAGGGAGTTAATAAATTATTCCAAGAGAATATTTGC A C TGAAG G CCTTATTGACTGGAGT CCATTT CCTGAGTT G G CATAATTTTTTTTCTC CAGCAATTGTTATAAAGCA AGTGCAAGTCTTGAAGCCTCAACTGTAGAATCTCTTGATTATCACCAAGTATATGGGCTTCCAAGCCTACCTACC TT CAAAGT CAAGGT CATCACAAAGGGATAATC CCTCAGGCAAAAAGAAAGAT GAGT TTTAGGGGTCCC CTGGTTT CCATAGAAAAGGCACTCCA(N)xAGAAAAGGCACTCCAGTTCCATTCTATTCTTTATGGAACTATTCATAATTTA CAGATTAATGTCACTTTTCTCCCTCCCACCCTCATCTTTCTCCCTAAGAAAACCACGACTG(N ) xCCTCTCTAAG GCTGAGAAG(N)xTAAAATTATTCCAAAACAACGTGGTGATTTCTAAAAATATATGAATTTCATACACGAGTATT AT GACATGAACAAAAAGCAATTAAAGGT C A T C ( N ) xCACTCCTCCAACCAGGGCAACTAACCACTCTGTACCTAC TCTGAGGATGGCTTTTTTATTTAG TTAT TAGT TTGGGACATTGT TAATGTTCTGTAAAGACAAA TATAAAACAAA ATTTGAAAACAAAAACAGAAAGAG CCACAATATTGGGCATGAATAATCCAAATCTGAG CACTGT CACCACGTGCT CCTCTCAAATTCAGGGTTTTAA{ N ) xTCAAGTTCAGGTTTGAGCAGCTCTCACCAAAATCCCACTGCAGTAAAAT TCGCAGTGGGCTTCAAATTCAGGTCCATCTGAGTCCAGATCCCAAAAGCTTTCTGACAGATCTTCTTGAGGGTGA CAAAACAGAAATAAAAGATCAGAATGG GAACT TG CAGTTC CTCTTCTGTC CTTTATAGAAGTGT CAACGGGGATG C AAAGATT TCATTATCAACGGGAC TGGCACTGTGCCCTTC TATC AGGCGT CTGCTC TC ACAGGG AAAT C AACTTG GCACAGCCTCAAGCATTGAACCTAACACTGAGAGGCAGCCGGGCAGAGCAGAAAGAAAGCTGAAGACAGGCCCTT TCCATCCAAGTCAACAAATATTTCCTTTGAGTACGTGTACCTCAGGGTTTTTCCCAAAAAGAAAAATCCCTCCCC ATATAAACCCCCTAGAAAGAAGTCATTGCTGCCTCAAATGCATGTATTCAGCTGCAAGGCTTCTATTGCTACATG CTTTCTGATTATAGCCAC CATTTCTAAG CC CTGCAGGT GTTTG(N)xCATCTCCAGAG CCTACTATTTTTTTTCG GTGTAAACAGTA( N ) xTTCCAAAATCAAGTACAAAATCAAAAGGCAGATTTTCAAACAAAATCTTTGAAAAGTGG T T C T { N) xATATATAAATAAAAATAAAAGTGGTTCTGTCCTGCTTACACCCTCTTCTCACGCTTTTCAGGCACAA AGAAGGGC CAAGAG CAAGAGAGGATTTCTT TGGAGAGAGT CTGTTGAAGACACGCCTTCATGTTA CAAGATCTTG AAA CTCATGGGCAC CATG GT CTGG GATC TGAATC CTGCAG CAAAGCTCTT CCAGGCTTCTTG CTATGCTG CCTGG GGTAACTG CACAAGGCCC CATAATTAGACGAC CC CACTGT GATG CTGGCATTTTG(N ) xCCCAAGATGCTTATTT T A T T T C A (N ) xGATGGTTAATGGGTACAAGAAAAATAGTTACAAAGAATGAGTAACACTTACTACTTGGTCAGGT GTGGTGG CTCACAC CTGTAAGCCCAG CT CTTTAGGAG G CCA(N)xCTAAGTCCACTTAGACAATCAGAGGTGCTT GTTTTAAACTCAGGTTCCTGGACCCCACTGTTTTTAACAGTTCAGCTTGATATCCAAGCCTGTAAGAGACACTTT GATTCT CTGGATTAAAATGAGAGACA GTGA GAGGAGAGGAAGAAGAATAGAG GAG GAGAGATTTCTCT CTAAAAA TGTAAGG G ATTC CAATTAAAAATC AC CT CT CATTGT AC ATGC AG AAAATGTACAGTGA GACACAAAGTTGG GTTG GAAGACAAAACTCTGAAACTAGTCTGGGAGGGTTAAGTGACTTCATGGAAATAAATACTAAAATATCAAAATGCT CTAAACTTTCCATTTTTACTTCTAAATGTGCTGAGAAAAAAATCTGATTATTTGACAGGAATTTTAAAAAGAACA ATTTTTATAAAACTAAAATCACATTC CTGAACAC TCAAATGCTTAG CATAGGGGATGAAATTGACCTGGAflTTTT CTTTGTGAGAGGAAGGACCAAGTTCCCAATGTGGATACATTACCAGTTAGATTTTCCAAGCACAGTTCATACTGC AGAATT CACCTAAC TTGATT TAGGTTTC TC CTTTATAATCAGTTTC CACT TGAACT TCAACTAC CACGAAAAAGT AAGT AAGT AAAG CAAAGACGTAAATT CT TGGG AACTTTGT T ATG AG AGGAA CTTTC TAG ACTGC ACTT AGTTGT C TTATTTAATTAGCTACCAGGGAAACAAAACATATAGTATGTCCAAGGAGTTGACTCAATGGAATGCTGT
> H s l 6 _ 1526742 1 - 152844 86
TGCGAAAACTCAGGAATCGTTGAAAGGGACTTGTTATGAACAACAAAAGATGCTCCCCTAAACCCATC( N ) xATA CTTCTT C CTTCAT CGCAGAGGTTA( N ) xAATTTATAATTCATTCAACACCGTTCTCAAGGCTAGGAATATGGCAG TAAACAACAGATTGGCCCA(N) xGGCCAGTGGTGCCTCTGGCATCCCCAAAAGCAAGCCTCCAATACCACAAAGG TACACAG GGCCACTTTCC CTGCAC CT CT CGGCTC CTGAT C TACC CAACTC CAGACC CCAAGGTT CACCTCTAGCT CTGAAAAG GAGCACAGCT TCTCCAGC CACATCTG CCTCATTTTG CCTCAACTGCAATTGGAATCGTTCTCTGATT GGCTCCCCTGGGTACGTCCCCTAGCCTGACCTTCTCAATCCCCCACTTCTGTGCCCTACCATCCCCTTAAGTGAC ACTGTCAGCCAGGTTTGCCTGGCTCTCAAGCTGCAAATCTCTAAGACAGAAGTCAATGGGTAATGAGAATAAGAA AATGTG AAGGACTAGAACTC CAGATCTTGATG CT ACTGGT ATTAGTTAGGTC AG AG AAG ATAAT AATTTG ACAT C CTAGGAAC C CAAATTTCCAT CCCCGT CTAACT CCACTATC CAATGCTAAATT CAATGG CTTATACCAATG CTATT TGATTCTTAATCAGTGTCTG CTCCAT CTGGTC CCTTTACAGTAC( N ) xCTTTTAAGTAACACAGCTCACCCTGCA TAAACATTA(N)xTTGAGCCACCGTGCCTGGCCAGAAGTAACATTTTAGATGTACAAATAGCAATAGAGATGGAC CTTACAGC TTTAAT GAGAGTGAGGAGGGGGAGTTAAAAAAAAAAAAA CAAAAACAG CAT(N )xCATG CATCAATC TT TTTCAAAGTGTACACAT(N)xCTACAAATAAAACTTAGGATGGAGGACTTTCAAAAGATATGGATTCA(N)xA GC CGAATC C A T C ( N)xACTTAAAGTATAAAATAAATGAGATATTGATTCAGAGTCATATTCCACAATTAATAACT ATATTAGGAATAAAC(N)xTGAACCCAGAGACAAATCAGTGAGAGA(N)xCCTGGGTCCTTGCAGAGACCCATGT GGATGTGCCACGAGCCTGTGATCC( N > xCTAGGATCTGCTTCTGGTTTTCAATTCTTCTATTATTTAAGTCATGC AACAGT GTTAAC CAG GCAGG CTCACTAG CAT(N)xTGGACACATGGATGGATAGAAGCAAAATAAAATCAATACT GTATGAGC CAAACAAAACAATGGC TTGAGCATTCTGTTT CTAAT GATGCT CT CAGGAATGCAGC CACAAGGAAAC TCTGCCCTCACCTCCACCCCATGAGGCAGAAACTGGCCACCAGTGACAGGGAAACAAAGTGGATTCCTCGAATGT CATTGTTATTAAAACAAAACTAGTCCCAGAAAACTGACCTGAATGTATTTTTTCTCTCACTTTTGGCCTGACTGT CTAATAACCAAAAGTCACCAGAAAAAAAAAATTTTTTCTCAGCAAATGCTTAAAAACAGAGAAGACCCCCAAAGT A CACAG GGGCAGAGAACAACTCACAAAT CTCAAGAATAAAATAAAGA CGGAAGATTCT C ACC CT AC TG C AAT GAC TCTCCCAGCCCACTGCCACCAAAGTTTCTGAGATTAAAAAGAGAAATTATTTAATATATGCAGTTTCAATTTTTA AAGTTT CAGG CATGAAAGAG CTTTTGCT CCCTAATGATGT CC CT CAGATGTAGCAAGACCTG GT CT CT CTTT CAC TGAAGAATTGCCTTCATTTTCACCACAGACTTCCCTTGCCTCAAAGACAGATGCTCAGTCAGAGAAAGGAGCCTG AGGATC CATAGAAACT CT CT GATATTAATTGATCTG CTCCTAGTTTT CAGTTATTCTATTTTTTAAAACATG CGA CAGTGTTAACTAGG CAGG CT CACT AGCATGATGGATG GATGAGTGAGTGGGTGG GTAGGTT(N)xTGGGTGGATG GATGGATGGATGGAAAAATGGATGGATGTATAGATGGAGATAGATATAGACAGAACTAGATACAGACAAATAGAG ACATCGTTAAAGAGAGAGAGGTAATTATTTATTTTTGCAAGAAGTATTTATTATCTACTTTAGGTCAGGCACTGC TC CAGCTTATAACTGT TTGTTATAACAGTAA C CAAAACAAACAAACATCCA CTACC( N) xTGTTCAGGTTCAATA TTACTT TTTAAATAGACAGACT TAACTACCTGAGAAAAAAACAG GCATAT CCCAAAAATACTAG GACAAGGGAAA AT TATAAAAGTTGATTTAACAC CAACAGGTATGTAAAGGAGATGAT GACATAATGCTTAGTT CC TG GCAAGATTT AGATTGGAGAAAAATAG CATGACATGGG CACA CTTGGAAGAAAC CAATGGGCTATCAC CAAGAAGGAC CTGCTCG AGCTCAAGATAACCAGGAAACATAACCACCCCTTATGTTATAATAATGTGGTAACACAAAAAGGTGGGATGGCAT GT CACATATT GACT CGGAAGTAGG CAAAAACGTGAGACCAACTTGGGCTTAGATACACATA C CTAG GAATATACT AGGTGAAGTCAAAGACAGCAGGAGATTGATATTCTACATAAATATGGAAATGCACTTTATGAGTTCTTGAGAATC TCTGGGGATTTTTCATAGGTCTCACCCATCACTGGTCATGTAAATATGTCATTTTTAGCAAAATCACATTCTCTT CACTTCTATCTGTT CT GTATTG CTGCAAGGACTGCTTTTTGAAC TAGGGTATGACAATGATC CGTCTTGATAGTG TTTGTGAGTCAATT CAGAATTACAGTGATTACTAAAGTTGGT CAC(N)xGCTAGACTCCTTGAGGGTGAGAAACT ATGTGAAGCAAGGTCTGGCCAATAGACAGCCCCAAATGCCAGAATTGGGAGTGAAGTCATCATAGACAATCTGTC TCTAAT GGAGTCCC CAGATGTGTG CAGAATAGGAATAGCC CCAGGAAAGACCAACAGAA(N ) xTTTTGGATAATT TGTTATGCGG GAATAAATAACTAATACAGATGGTAG CTA CAAGT CT CTT C CTATG CCTATGGTTGT CAAGTAGAC AGGTGAAGCAATTAGTAGTGGAAACTATCCATTCATTAAACAAGTA( N) xACACACAGCCTAGTGGCTCACAATA AG CAAC CTTTACAG CCACTG CTTGATGAAATATCTC CAGACAAAAGTATT CCTCCACCATAG CATAAG GTGATGG TCTGCC CTTGTGTATC CATC CTTGTTTAAGAAACAAACTAATGT CC CTTAACTAAAGATTCT CCTT CCAGTGAGG A CAAATGCAACTCCTCGTGAC CACTGTTTCTC CTCTGGTG CC CTTC CAATAACT CAGAAAAT CTAGAT TATTGGT TGTTTATGACTTCACTGGATAGCACCTCTATTGTCATCAGGCCTTCAGGAGGTTGGGTCTTCTGAGAAATGCCTG CCCCTCACCCAGCCCTAG CAGG CCAATC CATT CCTCTTTGTGTG GTGGAGTCACGCCAGTGAAGTG GATGAG GAG TAGGATGGGAGTCAGAAAACTTAGGTTT CATTTGTG CATT CT CAGA CTTGTGGGAAGATACTTCAC CAGCCC CAG CCTTAGAAATGAGAGTAACAGAGCTGTACAATCTCTGAGGTGTTATGTATTCTTAAGGATA(N)xTTTAACTCAG GC CCAT CCAGTCTTAAAAACATGGACTGAAC CAGTGTACATC CAACTGACAGATGAGTTTAGAAGCTATCTAGTC CAGACAAGCTTGAACT CAGCTAGT CCAG CCTTTCATGCAC CT CACCTATT TAGTGACA GGGAAG TCATTACTTCT CCAAGCAACTTACATTACGTTAGGCTGAAAATTGCCTGCCTGAAATTCCTGCTCACTAATCCCTCTTGGCCCTCT GTTAACATCAATAATT CCAC CGATAATTA CAAAGCACCTA CTAGGCACCAGACATTAT CCTAAG TG CTTGGGATA CACGTATGAAATTG CT CATT CCAACTTGGTTTGGCC CAGTTTTCAAGGGTTT( K ) xACTTCCCTAGCCTTTCCTC AGAGCCCAGCTGCTGTCATTTTTGGGGGGTCCTCTACCAAGTCTCTTGAATTCCACATTCCCTAGCCTTTCCTCA GAGTCCAACTGCTGTCATTTTGGGGGGATCCTCTACTAAGTCTCTTGAATCCTACATTCCCCAACCTTTCCTTAA AGCCCAACTGCTGTCATTTTGGTGGGGGGTGGGGGGGTCCTCCACCAAGTCTCTTCTCTATTTCCTTCTAATTTT TCGCAATGGT CAGTACAC CCAAGACCCT CTTTCCCT GAGC TGG GAAAAGACCTC(N) xTCTATTAAATAATAGTA ATGTCAACTATTAAACCTGCACCTTGGCATATGATATGCTACATTCATT(N)xAGTCCCCAAACCAACACGCCTT TC(N)xGAGCAGGGTACCTAGAATCCAAGTTCAGGCAAGTGC(N)xTACCTTCTCTTCCACAACCCTGAAATGCA TGCTGGACAGTGACCATCCTGACTGGGACACTAGCCTGGCCCTCTGGGCTTGCATAGCAATTGATGCCTGGCTTT CCCTGTGCAGGTGGTGCTGATTAATGGTGCTGTTAAGAAGAGATCCAGCTGCTGCTTTCAGTTTCCTCTTCCTGC AGAGATCCTTAATTACTGGGCTTCACCTGTTGAGTTTCCTTAAATGACATGAGTGTGTATTTTTAAGTGGGAAAA CAATTCATTAGGAAGTGCCAACATCTCAGAGGAGGTAAATCCTCCAGTTAATTTCAGAACCACACTGCTTCAGCC ACATAAGAGAGCGGTCCAAGGGGGTACTGCTTTTTTCTGAACGTCACCAAGCCAAACAACCAACCTTCAGCAGAT GG CAGGGAGCGGCCTT GAGAAACCACAG GGAG GGGT CATGAGAT CAAACCA CCCTTAG CCAT CAAC CTTTAG CAT CCTCCCTGGTTGCCCCTTTAGCTCAGCACCCTGGGAATGATGGGTCCACACAGCCATTTAGGTACTGCTTCCACA CGGCTGAAATTGCATTTG CATT TTATTAAATAAAAG CAAT CT GTGCAAAAGCCTAGC CTAAAGTTACTAACAGAT AT TTGAG TCTAAGATAATACAT CTTACAGTAT TAGAATT CAGGT TCTAAGACAATCAC CCT CAACCATTGT CAAT CTTTACATTAGGGTAGAT( N ) xGCTAGTTGTTTTATACATAGAGAGAGAGATTACAAAATACTTTCTAAAGGATA GATCCTGGGTCTTATCACTGTATTGCTCATACCAAGCAAAATGCCTGGCACAGAGTGTGCCACCGATACATGTTT TGTGCATAAAAATAAATAAT GT CACTTGA(N) xTGTTAATATGTCCCTTGATTCTTGTTTGTACCTTTTTCCAAT TATTTAGCCTTACATA TATAT C CATGTATATA TAAATTTTACATAAATG CATAAAAAGTGAG CA TT CCTT(Es)xC GGCAAACTTTTTTTGCTACCAATTTTGGCTCATTTGCCCTGCCCCTCTATTCTCACCCCCATTGCCATCTCCCCA CCTGCTCCAACACCTGCCCCACCAGAAAACCTGTACTGACAGTCCAGTCAGTGGTCTTCTATATAGTGTCCATCC T CATTTAATT CTAGACAG CCATACAGACACAC GAGTGTGG GCAATCATTGTTTTATAGAAATGGGATCATAT TCT ACCAACCGTTCCCATGTATTCAACAATTCCTTGTTGTAAGCCCTCCAAGTT(N) xGCCCCCAATACCCCTGATAT CACTCACCTTCACTGTCCTTATTGGGCACGTACCAGTCCATAGGTGGAAGCCCCATTGCTTAGTCAGCCACTCCC CCATGGATGAGCAC TCAC TTTG CTTCCCATTT CTGC CCGT CAG AAATAA C TCAG CCA CAAACAGGAGGTCAT CTG ATGCCTGCAACCACCCTATACAGCACAAATATCCATGTGAATCTTTCTTCCCATCCGCAGACACATAGCCAAATG CCTAG CAGGC CTTACCTGAC CC CACTAAAACAGTCTTTCTGG CCAG G GAAAATTATAAAGAATTATATGAGATCA CACATACAAAAAAAAAAAAAC CTT CACAGTGTGCAGAAG G CTTTGTGTCAAAGT CGAG CTCG CACGTGGCCGTGG CCGGGTTCTCTGTGGCCAGCCCCATTCACAACACAGCTCTGGCAGAGCACTCTTCAATAGCATCATTAATAGCCC AT TGATC CAATTACATTTCTTT CTGTTCCACTT CAGAATTA CAATC TCCTGTTC CCCCTGAAAGAG C C CAGATAG ATTC CCAGATGAGTAT CGGAAGAG GTAGCTGC CAT CAGGACTTG CTTGATGGACC CAAGCACTCAT CTAATAGTG GCACTGGAAGAATTAATCTG GCTGATATAT CAGACCTCT CTC CT CCGTGGGAACTTGC CATTTT CT CCAG C CTCT GAGTTTTT CT AGTG AG ATT CAGAGATACAG CC AAATGTGAGTTAAC CC AGG G AC ATCCTAACACTG C C CT CATTG TGTGTTCC CAGCATGGGGTAGG CACT CCATAAATAATCGTTGAATTGAAAAATG CCTAAG GAGGGGTATATTGGG GCTCAAAGCAGACCAGGTCGGCTGAAGAAATGGGCTTTATCCAGATTGTGACTTTGCAGGTTGGACCAGAATGAG AGTT TGAATCTGAT TCCATAGGTAAT GCATAAAATGAAAATATATACTATAAAAAGGCAAAGTCAAGACC TTGAA TGATAGCTGAGAAAAAAGTAGAGAGAGCTAAATACATATTCCAAGTCTAATGGGCAGGATAATGAGGGACAGAGG TAGG CAGCAATAGGAC CAAAATAG C (N ) xAGGATGAGGAAGAAT TAGTAATGTTGAATAC TGGCAG GTGAACAAT TCATACTGTGAGAGGCTGTGGCCCCAGTACCCTTGCATCTGCTC TAGACTTTGACTTCTACAGT GAGACT CCTGG CCAATGAG GATCTATG TCCACAGT CC CTGGGGGTCATCAGCC CTG CA(N ) xGATTATGTTTACCAGGTTCCAGGT AAGTGGAAGGAAATTAAAAGACAGCCCAGCTGGAAACAGTGGGCAGGCATTACAGGCAGGCCCAGCTTTCTAACA GCTAT C AT AAAACTGG CTGCTCGCTGGAGCTGTG AATTTCCC AT CG CTGGTT AG AG AAAT TCTT TAAAAGGG AG A TTCTTTTAAAAATGGCTGTCCTAGATGATGTCTAAGCTCATCTAATTTTCACTAGGTTACTACAATGGAGAATGG TT GATGAGTAAGCAAAGACAGCAGAAACC CAC TACACAGTGAAATAGGAATAGATGTGTGTTAT TGTCAATGTGA TTGC TCTTGGTTGAATGATGAGAT CAGGGG( N}xTTTAATCAATTAAACTAAAAGATCCATTCTCAGAGAAATAA TGACAGTATGTCATGAGCAAGGATTATAACTAACCTAATTCTAAACATCCGAGGAT< N) xTGAGAATCATTTTTA AAGTTTGATTTAAACAATCGCCCATCCCCAGATCTGATTTGGGGCATCAATGGGGCCAAATGACATGGGTTTCCT AGAGAGCAAATCTCCGGATGTCATACAGATACTTAAAACCTTCAATGGCTCCCCATTTCCCTACAACACATTCCT AGTC CTCAGCTTGATGTTTGAGGCTTTTAATAATATAG CTCTTGCCAGCTGTAC CTTACACAACGC CAGACACAC T CAGACCAGAGGTTTCAAT CT CAAGG GCATATAGGGACT CAGTGGAGTGTGCTGG GCCAGGAC AAAA C CACAAG G GCATGCCC CAACTCAACAACTGAG CAA CTT CACAGGTC C A G (N ) xCTCTATGATTTACTTAGACTCTATTTTTTA ATTACTTAGATTTACTAAAAGG TGCTATGCTCGCACCC CTTCAAATAT CCCT CC CCGATGTTT CCTCTCTCTAGG TTACTCCT CT CTGT CT CCTTTC CTGG ATTT ATTC CAGC C CAACCTTTAAG ACTAGCTT CTAGTTGC AGTC ATTC C TG CAGAATTACCAATTTCTTTTTACT CTTC CATATT GT CACAATGT TTTACAATGAAC C TG A ( N ) xCCACAGTTA TCTTCTGCTAGACTTTGTTTGCTATCACACACCTGCTTGGGTTTTTCCTGGAAACACTTCCTATTAAACTGGTTT TAAATGAG CC CTCATTT CAGGAT CTG CTTCTAGGA C CT TTGATAGACAGCTTTTGACATGTTCT CCAATGATCT C CAGCATCTTCTCTGTTAATGCCCTTATACAATCCCCT
> H s l6 _ 773055 82 -7732 08 31
TGGGTTGATAAGATTATCTAGATCACTGCTAAATCTCCCTGCCACTTTATCCTGTTGATACTGAGACACCTAACT CT CAAATATAATCTTC CAAAAAGCAATCCTTTACAAGG CGACTCTGAAAGACTCTTTGTG CTAG CAAACAATAC C CAAAGTGTGC CTT C CAGAGAGACAATATAGTAAAGCAG CATATTAATCTTGC CC CAAATC CTATTT CATGTCTT C CGGACATTGCAGAGGGGAGATGGGTGGAAACAATGGATCCATTGAGAATTACAGGCTGACCTCCAACTGAGACCT CTACCTGG GTGACTACATTTGGATGGAGAACAAGTACTTTGG CC CTAG CCACATTTTGTTTCCTGT T C CCTGACT GAGGTTGG CTGCAGTT CAGT TATGAACTCTGCTAGCAAGACCGTGACACTGGAAAAGACGTTAAAAGTGC CCAC C CATT CAGGATTTAATTAATGGTTG CACAGGAACCAACTTCTT CC CACT CGGCAGAGAGGAAAGGGT TGGCAGCTG AC CACTC CTCAGGC TG CCAG CT CT CAGTCC TTTGAGGAC CTCAG CCATGCACCTGTGC CAAGAGGT TGGCAGCTG AGGTTCGATGATCCAAGGCCTCACAAAATCTCATCCACGAATCAACCTGCTTATCGTTATTTCCTGCAAAACTTC AAAACCATAAAATTCTTTC(N ) x CATAAACCCTTCTTTTACCCTCAAAATATCCTTTGTTGTTGCTAGAGCATCT GTTGCCTT CTTC AGTAACTGT CTTCCTTTCTCTTGT CATTTT AT AGTT ACCATTTGTTGAGTATTACT AGGT A C C AGCTACATCC CTTTAGAAAACAAAAC CTTATAAAAAATTCAAAACAAT T CTGGTTACTAAAGTTACTC CT GACTA GGTCAATTGC CAA CGAATGAAT GACA( N ) xATCATACATCTTTTCTGCTTAGGCTTCTCTTGGTTGTAATTTTGT AGGATTCCTTGGTTAGTGATGGTCCACATCTGTGAGTCATGAGCCCAGTGAGGGCAGAAAACTTGTGTATATCTA TTAAACTTTG TAGAAT CCAGTATAAATGGCGCTAA CTT T CATTGAATAAATGAT TAA (N ) xAGT CAA CATAATAG GCTCTTTACG AATC CCGTGC CATTGT ATGAAATGTCTGTG AG TCTTGGTTCACGTGCATAATTAGG AG AGT CAG T AATTATTTCATGTCTTAAGATGCAGATAATTACAAAAATTCAGATTAATATGTGGAAAATACAAAGTATATCCCT TG CT CATAGCAGATGAG CATTGAT G T CAGTTGTTAT CATTATAT CATCATTG CTT CTTTCTGTT CTGCCTACTTC TGTTGCTAGTCTTTGGGAACTCTAACATCTTAATACATTGTTATTCTTGGTTTAACAGACAAGTCCTAGGAAAAT TT TCTG AAAATAAG CAAAAC ATTAAAACTT CATC AG ATGATT CTTAAAG CAG ATTTTAAAAAAT ATTT CATAAAA TC CACAAGACATGAAAGAACAGTGTGTCCACAATTCTTTCCT CTAA GTATTTTTTCCTATTAAC CAGTAGGATA G AAAGAAGGGGATAATfi C CAAATTAGTTTTT CCTACCTTCTGCCCTCTC CAAC CCTAGC CT TATCATGGATATTTT TC CCTCTTGGGATAGAGATGACTG CAAAAACAGATGTT GCTGCTCCTG CCAAACTCTTAAAAGTAG GTTGAACAA GT CACGTGTAATGC CAATGG CAGAGC CCCATCACTG CGACTGTT TTATGTAACCTGAGTT CAGAAT CTTGAACT C TTGT CC AC AG AAAC AGGCTTGCTC CTTTGAG G ACC C TGTCATTG AC AT CACAAC AACAAT AGGTGTGAGAAT AG A TAGT CCTCATTGCAAG CATATT C TT TT TA TT TA ( N ) xCATTCAACGTTATTCTTAATTCATGGTTCAAAGCAGAG TAAGCAATAGTTCAGTTCCCTGAAGATATTTGACTTTTGGAAAGGGAATGGGTCAGATAGTTTTAAAAATGGATA ATTTCATAATGTCACCTTTTCCCTACACAGAGGGAGGATGATAGATTTCATAATGTCACCTTTTCCCTACACAGA GGGAGGAT GATAGATT T CATAATG TCACCTTTTCCCTACACAGAGG GAGGAT GACAATAT TCTTGTGTTTTTTAT GGTAGCAGAGCATTTTTTTGGGGGGAAAGGGAACTTCAGATAATATCACATATATGAATTAAGAAATGTTTCTTA CTAGGCTT TTATTTTAAACT CACTACTCAT CAAACTTGT CCTTAAG CT CAA CAAAAATAT CTAC CTGT CAGGCAA AAATCAATATTTTTATAATACAGCAAAAAGTATGCTTACCTAAAAATTGTCCCTTAATTGACCTAACTTT <N )xC AGTTTCTATACC TGTG AAATGGGGGAGATAAATG AGCT CTAAATTCTCTT CCAGTTCTTCATTT CTAT CñGTTTT CAAGTGGACT CCTTTCTGCTTAATGCCATAATTT TGTGATACTATTAT TGTGGAGGAGGT TAAAAACACTTTTG C ATGATCCTGCTGTGCATATATGATATTCACAATTACAGACTGAAACAAGCATTTTCAAATGCTAACCGGATCAAT GTAAAAGTGAGGTT C (N) xAATATTAGGACTATATGACCTTGTAACATTGCCTTCTGTATTTTTAAGTAAAGTCC ATTT GGGT CAGGAACAAGATATTTTTCTGTGCTTGGAG CT CACATTGCAGTTGACACT CAATA CATACTC GTCAA AAGAATAAATGCATTTTAGTTAGCAAAAATGCAAGAAAAACATTAGTTAAGGCAGACCAAACTATGAAAGGGTGA CTCTGCATGT CAGAGGGACTTATAGCACTAAAA CAAATATAACGTCT AAGAAAATGTGAAATGT CTAT CTTCTCA GCAGGCCTAG CTGTGAGTAATAAAAGCC CT CACAATATGGGCAT CATAA CAAAGAAAGAAATTTGCTGAAAACAA TAA CAGTTATTT CTTTAT TGGTTT CAGACTATGTGGATATATACATAAA CAACACGTTAATATT TAAATATCTGG AAT C AAT CTTAAAAAATACAAAGACTGTGAATTAATAAAATTAG CCATTAAACAGAAATCAGAG GCCAAGTAAAG CACAAACAATGAACCTAACTTCTTGAGCTGGGCCAAAAACATCATTCTAGTGGCTTTTCCCTATATAAC(N)xGG TTTT CTGTTGTACAAAAC CTGAAG TTAC CATTTCATTTAG TCAAGGCAAC CAATGCATTT CTCG CAATTTATAAG GATGTTGACAGTTGGCACCATCCATTAAAAAATGATTAATTCACCTCTTGGCACCTAATGACCAAGCAGGAAATA GGATAGCAGCTGTCATTCTGTCTTGACTGCACATTTCTCTGTAAACACTCACACATGAAGCCTTGGGAGTAGCTT GCGAAAGG CATATT CCAATGGGTATTTCTT CAG CTTTT CTATTATACACT CTTTTTTTAAAAAAAGTATTTTTTT AAAG CACAGTAATC TAGCT C TTG CATGTGGGTGCTTTACTGAGAAAAATGGTCAATTGTTTTCAATGT TT CTGCT CCCCACCCTCCAAACCTGAGTGAGTGCAGGTGCTCCTGGCTTGCCTGAGAATGAGAATAAACTAAGATCACCAGC ACGTGGGGTCTCTGCCTGTGAATTCTTATCTTGACCAACTAATACTTTACTGGTGAAATGTACATGCAGGTAAAA TGTACATGCAGAAGATGAGAGAGGCAAAGGGTCACTTGATTATTAGCTAAATTTGGCCCAACCACTGCACAGCTG CATG GGATGGCATG CAA CCACAGGCACT CCTTTGTCTAAGAAAACACATC CAAG CACTTC CTAT TTTATTTTTTG GCCTCAGAC(N)xTACTTAGAAACCATTGTCCTGCATCTTACTCTGCAGCCAGCTTGGCTTTCAAAAGGCATGGG GACAGGGGAGAGAAGACCTC CTTT CTAG GAAGAGGAGACT CTCAGCTG CTAGGATATGGAGAGT CTGCAG GAACT GAGGGACACACTGAAGAACAGTAGAGCTAGGCCTTCTGCCAAGTGGGCGCTGCCCGCTCCTTCCTGTAGTCCAGC CACATGAAAGAAGG GATAG GAAGAGACAAAGGGGGAACTG GGCCTGGG CCTTGT CTCCATGAGGAACAGC CCATG TCTCTGGTTCATCT TCAGAGATCATCTC CC CAAGTGCTGTGACTGTTAATGGAC CACT CCTTGTACTG CACACCA GT TTTAAAAAGT CTATTTGC CAT CAGAATGGAGG GTTGAG GAAGACCACGTGGT TTGATC TAGAATAG CATTTT C CATTGTGTGGCATGGGCATTGCAGTGCAACATGAGACAATGTCAAGCGATCTCCTAACCTGTATGGGGCAGAGAT GGCCTTAATGGGTAGGCCAAGGCTGTGTGAAGAAGAGAAGAACAGGGGTTGAATGAAACAGGGGTGGGAAGAAGA GACATGCAGGAT CTCCTTTT CCTAGATATTGGTGTCTCATTTTAATCC CAATTATAATGT TGGGAACAATTTTTA AACCTAATTTGT TCATAAAACCAACCAACAATTA CAAAAATGACTTTT TGTTATTGAATGAACT GATTACACACT TGACTGTGTCCCTCATTAAACAAAGTTTAAGAGC( W)xTAATAGTATAAGGAAACACTTAGGACACTCTGCAAGT AT CT CTTCATGACGTTCAAACAATATGGTGGTTGTAGATGAGTAAGGCACTATG CCACTGTGGAAACTAACAAAT AGCTGGGCAGATTCAGAGAAACACACTGTAGATATTGCTTTTGAAGCCTTTTCTACAAAAATTGGTTGAGCAAAT GTCACTTACATCATGATCCTCAATGTCGAGCTCTATCAGCAAAGGCACAATTAAGTGGACATAATTTGCTTTTTC TCTGTGCAATTGTGAATTGGGATTCAAGATAATCTTACATAACATCCCATCTTTATTCTAAACTTAGTGATATTA AGCTGACTATTCTTGAGAAACTATGAACAGGTCACTTAGTGTCATTGAGGAATTCTTTTATTTTCAGATGAGTTT TGTAAACCATTTGTAGTGAC TGTATTTTATTTTGTAATTAAAAT TAACAAATGTAAATATTTG GCTTTTGCGGCT ACCCTTTATGCCATGTTCTATCGATTCTATGTTTAAAGGAGTTATAAAACAGTTGATTTTAAAATAATTTCTTTT TT TAGAGAAAATAATTT C TT TT A C T(N) xGAAATATTTTTTACAGCAAATCATATTTTTACTTAAATAGATTTAA AAGTTGGTAGAGGTGATACGTGCTTATGGGGAAAACTCATGATGATAGCAATTGAGTGACAACAGTTTGAGAATC ACCTCTCAGGAGTCTGTGGTCCTCTCAGGAGGAAGAAGGTGTCAGTCACCCAGATCCAGGGGTGACTGACGAAAA CCGTCTTAGGTGTTGACCCTCAGCACCAGTGGAGACCATAGAGGTATTCAAGGAATTAAGATCTTAGGGGAAAGA TAAG GTAAAACAGACAAAGCATGGAATC CTTATG CATATTTTTACATT TT CAAAGATAAAACCATATAGCACATA ATTAGATGAC CTTTTCCACATCACACTTAG CAATAATG G GTTCTGTGTTATAAATTTT CATAAACCATGT CTTAC ATAATACATG CTGGTGTATT GTGAGGTATTGTGGTGTATT( N ) xCCAAAAGCTATAAAGAAAGACAACTTTTTTT TT TT CCTG GTGTGGTATCGT CTTT CTAC CGTTAG CATATAGAAC CCAT GC CAAGAGAATT CTCG CTCATG CTTT C CCTC CATCTTAATGGACT TGGAGTTGCAGG CTCCAGATACTCTGGTGT GATTGC CCAG GAAATACTAT CATCCCA GAGCAAAGCAATTATTACAGATAAAGACTTAGATATGCAACAAGTCTTTAAGTATACTTTGTTGAAAAGACTAGG TAAGGATTTATTATTTCTCAACCCCAGTTCCCCATATGAAATTCCCAGCTCTGCCCGTAATTTATCATTGTTAAA TCACTTATTGTCAAACT C CATTCACATG CATCT CAGGGGG CAAAAAGTAGAGTT CTAAGG CCTG CATG CC CTTCT TTTTCTTTCTCTATCCCAGGGTAACGC(N)xCCAAAAGGTATAAAGACAATTTTTTTTT(N)x TATCTTTTTTTC TGTCACGTATTTTATCTG CATCTATTCCATTTTC CATT CTTAAT CATAAT TTTTTGTTTG CTAAATAG CAATAAA TTATATAAAAATTCTCTTAACTATAAAGATACGCATATGAACTTATGTAGACACACATGTAATCACAGATGCAGG GAAAAATACAGTATTTTCAGATCAATTGGACATAACTTTGCCCAGTTCTTATTCTCCACTGGTTTAATGATTCAA AATTTCAAAAAGCCAAGAACAATTACTTTTTATTTTTTTCATGTTTTCCTGCCCATCCTTCCCTTCCCCCTCTTT AT TT TTACAAAAATATTT TACTAGAATGAC TTTTAAATGTAAAAATACAT TTATGGTAGTTTTAT CTTAGATGAT TT C CACAAATGTTCTCATGACAAT TACAGGGGAACAAAATAGTGTGTC CT CAAGAGGGTAAATGTGT CAT CTGTT TCAT CCTAGCAT TT CATACTTTGATTTC TT CAAAGGCAAACAAGAAAAAT CACTAGTTGTTGAATAGCTTATCTA TCTTTCCTCTTCTCAATGAAGTTACTCTGTAGA(N)xTGAACCGCTGCACCCAACCATTCTCTAAGAGTCCTGAG TGA CATTTATACAGTTAAATTAC CTTTGAAATTñTGGACTTTTCACTAA CTTATTCAAGAACTTAACAAGCTGTC AAAGATAAAGAAATG GAATAC(N ) xCACATATATTTTGACTTTTCTCTATAATCCATGGTTTTATTCAACATTTT AT CT CAAAATATTACTTAGAGAAG CAAGTGAGAACACATAATAGGC{ N ) xGAAATAACTTGCTGTTTTTCAGAAG GCAACGGGGTTGTAGTAGAGAAAATTTTTTTTTTTCCTGTTGATAAAATGGCCACTTCTCTAGGCCAAGCCAAGC AGATTGAGAGAAGTTCCTAAGCATTATTTCTCTGGGATTCAAGAACTCATGACTTTAATTAGAATGGGAACGAAG TATG CT GCTAAATTTAA CAAAC CACTGTCTAGTTGATTATGCTGAG CTGGCTAAT C CAG CTTGAAT TGAGACTTT TT AAAG AAGTAAAATTGGGTT ACATT CATCAAAG AAGTTGAGGAAT AAAGTAGAATTT AT GT AAAT ATGC C CTTT ATAACCACTTCTCAAAAAATTACAGGAGGACACAGTATTATAATTACTTTGGTTTTGGCAAAATAGTGTGATGTT TTTT CAATTGAGAACAATTTTAAACT CATTAATG C CTAC CAACAGCTGGCTT CT GATGA CTGAAAATACT CTTAT TCAGTGAG GGTCTTGT CATATTAT GATTTATTAATATTAACATAG CAAGAAGACAGATGGATTTTTTTATTACCA CTGTTTACTCC(N)xGCTGTTAGCTCAATCCTAGGGCAAATTTAAACGTATCAGTTGGTAGGAAAAAGCCACAAG TGGC CT TTAATTTT CCAAGCAG CT CTGGTAGTATGTA CAGAG CATTATAAAT GATGGTCCTTTCTC CCTCAAAAC ACTG CTGTGTTTTGTC AT TTAG CT CCTCCTAG AT ATTTTTTAAATAC C AGAAATGG GT ATGT AG CATTCC CAAAG AGCGGGCTGTAGCTTTTCATGGACTGATTCAGCAAG CAC CAAAAGTTACGAGGT TGATAACATGTGAAGGGCAT C CATATTTATCTTTT CT ATGAGTTGTTTCCTTG CATG AATTTC AATT AC ATGG AG AT CTTAAACAGT CC ATGTTG C CCTATTTT CATAATAT TC CTTGGAATACTTTT CAAGTTGTCAAT TTTAGTCT CAGAGACT CT CTATAAGCAGAAT ATAGTT TA CACAAAATAATAAAGA CACACGCA CACCAAATAAACATAACATGAAGCTTGC CAGT CCAGCAGCAAG CACAATTGCGAGCACCATTTGGCTTCTCATAGAACTCCTGGCTTGGGTGTTCACAGGGTTAGTCGAACTGTGAAC TC CACG CAACAGTTAGG CAAGGTATT CCATCCAGATTTGAAAGT TCTGAGTCTCAGTCTTTGAC CTGAAGTCATA AAGGACCTTTTTAAAACAGATGTCTTTGCCCTTGAACCCTTTGAAGTTCAAAAGGGGGTCTCTCCCCAAATCGAC GTAT CT CAGTGTGATT CAC CACTT CTTGCATATAACAGTGCAAATCAAAGATTTTT TTTAAAGT CAGATTTGTCA TCGCTTCCCATTTT CAAGTGCTTCAGAGTACCACGTGCTTCAG G GAATTGAG CTAGAAGC CTACTG GCAAC C TGT CTGTTC CT CAGAGCAGGC TCCTTCAT CACAGCG G CAGCTCACAGATG GTTCTCGGTGCTCAGCTCCTGGTCT CAA AGGCAG CTGGTCTCTCTAGAGGTT GAAAGGTAAG CCCCTGGC CCTAAGGTGCTGGG GAGGACAC CAAGAT CAGAT CTTCCTTGTGCATGACTTGCAGCATTGTTTTCCGTAAAACTTGTGGTTGCAGACACCATGCTGAGGAACTAGGTG ACAC CAGTTGAAGAAAT CTACG CAGGATGGAT CCTCTAAAATAAGAAAATATATTTAGCATGTTGG CTAG GGTGT CTATCC ( N ) xTTTGGTGATTTG CATATTATGATTTATTAATATTAACATAGCAC CTTTTT CTTAAGGTGAGGGC C TACAGAACAAAGGT CTTGGGCCCACCTCCCTTGTGCCTCTATGTGGTTTTTC CCATTAAACATTGGGACTTTGGT AGGG GGGTGAGGTAGGGG CAGGATTTAGATGC CAAGAAAAGACT CTGACCAT TAGCAAAATTAAATGATT TGTTT TTAAAAAATGTCACATCTAATCCAAGATCAATGTCTGATCCTCCCTATGTAGCCTGGGAGCTACCATGGAATTCA GTGTCCAGGGAAATACCCTAAAATGGTGCTTTAGGGATCCCCTAAGGATATTGTTGTACCACTAGTACCGCTGGT TCTG AT TGAGAAGGGC TGT CAG AGGGGAGCATTAAGGG AGGGTGTG CTGCTGGTC C ACGAAC TATAATTT CAGG A G C AAGAAG CCTG ATGT AT TGG GAGTC AAG CCACCTT ATTTG C CTG C AAATG GGTTGAGAT TAAC TG AAAAAG CCG GGTCTC CAACAGTCTGTGTAGGTTTAAGTCAATCTC AACCAGTT TG CAAGT AAATAAATT TAAG T C AAC ACC AGT CCTTTCTAATTTCTTTAGTTGTTCAATGAGCAGGACAGATGTGATTATTTCACTGTACATCAAGAATTCTTACTC AG ATGGGC CTCAGG AG AT AGGAAATG CAAGTG AATG GGTATGTAAACATAAGTG CATGTTGT AGGAG CAAG G GC C CAGT CATTTCACTG( N) xATTT TA CTGTATTTTTAAGGCACAGTTTTGACTTTAAAAATATTTAGAGTATACACT ACACATAATTCAGAAATGCTTAGAGTTCCTTCTTCAAGCCTTGATTTTTCTGCCTAAAATTCCTTTTTATACATC CC CTTC ATGTAAGT AAGT CAGG CAGG AG AAGTGAGATGTATT ACTATTCTGT C AGT AT AAGTGC TTTTTTTGATT ATAATTACTAACAGCTGGTTGTTTGAGCTTCTACTGTGAGTCAGGTATTTCATTTACACTA(N)XTTCCACTGTA CAATGCTG CCTG CC CC CAT CTG AAGATGTGG G AT T C CATGAGGAGC AGCTTATT AGTATTGAGTGC CTGCTC AT C CTTAAC TTAATATGAC TAGTCTGCAG CCCAGGTAGCTTC CAGACACGTAAA CACCCAGTT TCAATG CAGTGTGAC A CAAGT CT CTGGGTGGTATGACTG CTTTAGG GCCTTCTCTCTTTTC TGAAGAGAATTTTATGTATAATAT GTACA GT GCAAC CATAAGGATGG CTTCCCCAAGGTGGAGAT CCAACCAAG
> H s l 7 _ 53361889 -533 84 43 4
GCTCAT CAACTCAG CTGCTCCATAATTAG CTGGTGATGGCTGATTG CCTGCCCCATGGTTGG GAAG CCTG G G CAC TGGGCGACTGCAGGGGAGGGGGACCAAGGGATGACTTCTTCCAGGACCTTCTACTCTCTCCAACCAGAGGTGCCC TCTAGCCCTGTAAACTTCTGCATTCTCCAGCACAGTTCCTGAGAGGTTCAGAAGTTACCTGGGTCCTGGTGCTCT GCTC CTGC TACC TTTCAG CTCCTGCC CCCAA CAC CCTGTTCAAATTGGGAA CAAGTTGAC CACAGG GCACAGTGA AT TTGAAAGTCCGGAGATTAAGTCATATGTTGAAAGAGGGC C CAGTGAGCAGGGGC CACTAATC CCCTTTCTTCC T CAACCTAG GAACCTC CT CCCTTC CCAAGATATAGATATATATAGATTGAGACC CT GAGCAATG GATCTCTTTTG AGGC CAGTGGAGAC CAAGACCACTGAAGCAAGGAGT CAGGGACTGC C CA CAACAG CAAAG CTCCCCAGAGTCCAG GCCACCCCATGGTGATTGTTTGGGTTCCCTGTGATTGCCTCCCATGGCTGCCCTCGTTCATAGAGAGTTGACTGC ATTTGCCTGGGTCC CTGTGGGGTGAGAGTATGGAAGTATGTGTATTAGTCACAC CTTACACCAGAC CAGAAGATA T ACAGTTT AACCTAGG AGTCAAGCTC CTGCTA T ATTTGGAATGGTCTC AGTTTTGAGACGTGGT AC CTTGGATC C CAGGTATATCCAAAGTCTTGAGAGTGTCCATTGAGGACTTTGGTCTGTAGCCACCAAGAAAGAGTAAACATTTGT G CAGGATCATGT TTGATGGGATTC CATGGATTGT TTTCATTTCT TAAGCAGCATTC CCCTGAGC CTTTGACATGT CTTTCCATCTCTGGCTGTGTTCTCACTTGCATTTGC CAGGCCTG CAATGCCT CTGC CTTGATACAACACAGTGTG CCAGCAGCTGCAGTCCCTTTTCAGCCAGGGCCATGGGTTTCCTTGGGTCCTTCTCTCTGGCTCACGTTTTCCAGG GCTGTTGGTGTGTACAGT CTAGAGAT CACATT TTATTAACACAAAGAAATCTGCTTGGGC TGCCTGGCTCTTGCA GTCACGTGGAGAGAAATGTGTTCCTTTGGTCCTTCAGCCTGCAGCACTCTGTGCAAAATACAGCAGAGAGCTGGG CACTGGGCCAAG CTTACATTTCAC TT CGGAA CACACTTTCGCATCACTCCTTGGCAGAACATCCACGGTGGTGTC TATTTGG GG AGT AAGTTAGAGAAGTGTTGTTTGC CT CTTATG AGTGTCGAGAGATAGG AC CTTT AG AAAATGTTT TACTTTTG CGCC TGTCTT CAT CTT CTTCGCGGTG C C CAGGTACTGAG GTGAT CC AG CT AAATGTGG CAGTGATG C CTGAGAGAAGCC CCTTCCTTCCTCGGGCCACC CTTTTCTCTTGCAAAAT CAGTAAACAGAGC CAGGACTG CCTGG GACfiGG GAAACTTAG G CCAAGGTAATAACATCAAAACCAAAGTC CTG GAACT CTTGAGTAAG CTAGTTAC CCTC C CAGATAAAA CAACTTTGT TGTC CAAATCTT CT TT CCAAAGAT CTAAGAC CTTTCTAATGT TG GT C C AAGG C C AG C CATG GAAG GAAGGAACTT CAGCAAGC CG CCTGGAAT GG CAAACACTGT TAAAATAAA CAAATGGA CCTTTGCCAT TTGGTTTTCCCTGACTTGTCTATTTTTATTTCTTTTCTCCCCCCACCCTTGTACTTTACCTCTCGATAGTACCCC CCTTTCTC CACC CCTACT CC CATT CT CATCT CACTTTTCT CAGG GG C CAAGCTG GCTGTTTTGC CTTCAG TAAAA CAAATATCTAAT TT CAAAAAGAG GAATTAG GAGAAAGAG GTGGTATAATT CATTTCTTGGGGTAAT CTCGCTTCC CTCAGCTCCTCTATTGTCTGTT CTTGTAGGTGGC CACC TGAAGCTTGACTTG CT TT TGAAAATAAAGTTGGTTTT GGAATT TTAAA C CT TC CTTAGGAAAGA CTTGAATGGTTAGAAAACAAA GATT TATTGGAATAATAATGACGT TT C
AAATTC CATAAG GATTTC CCAG CG C CAAGC CTTATT CAGAGT CTTATATT TACTGCTTTG GAAT GTTACTTTTGC CATCAGTATCAT CCAG CCAAGCAACAAC CAAT CAAGACTTACAGAAACTT TTCCAGCTAGATGCATGTGC CAGG C TGCTTCTTCCTT CCATGATGGAAGGC CT CC CT TGTCATAGAGGATG CC CAAAAT CT CAGGGACTGGGATACAGC C TC CTGC CAGT CTGTGGGTACAGTCTTTGCTTACT CTGCAGAAGGGAGGTGACAGAAAC CCATTCCACCTGTCTCA GAAACCTG CTGGTAATTAACTTTAAAAATAATGC CAAGTT CT CACGTGGCTGGTAGATGCATTAGCTCAATCAGG GGGGAGGCAGGG GAGAAG CATG CTGCTT CAAAGAGTAATACTGT TG CAAT CACATTAGTCATTC CAGGGGAGGTT AGATAAGGTTATTGAAAAGGTGAAGT CACC CTGTACATTGAAAGAGGT CTTC TGAAAGTGTTGAGAAATAGT TGA AGTT CAGAAGTTTTAT TTGGAGCTTTGCTCCCCCTTCCCC TTGTAAATAAGACCAG TATT TTTTAAAAGC CACAT TTTCTTTGTTTGAT CGATGAACAC TC TCATAGTTGATGAT TGAGAGTCATCTACAG CCAGTATCAAGGGT CAGTT CTGCAT TACT CATGTGTAGATTAT CT CT TTG GTATACTAG CCATGGTCAC TGTGTAAG CCCTGGCCAGTTGC CAC C CAG CATAA CAACACCATAAGG CATCTACC CCCTGCCTGCACTTGG GAAGTGTT CAGGGAATGTAATTTCATTT T ATAATTAAGTGAAACAATGCACAGTGCATTTCCTTGGTGGATTCTAACAGCAGGACAGTTATTATTCAGAAAGGA CTGT GT CTGGACTACT TTGAGAAT CAGTAGTT TAAGTT GTTGCACCTGTGTTTGGCCTATCCTCATACCAGTGTA
a a t a c t t c a a t c c t g c a a g a t t t t c t c a t t t t g c a t a a a a c a g g t g a a g g t t t g t t g a a g t c c a t c t t c c a a a c t CCCAAAGTACTCTGGATACCTTGGTAGGTCCTTGTGGACTCCCTGTTGTTGACTTGCAAGTTTCCCAGAGGTGGG GACTGGCTTT( N ) xGTAACAAAGATACTAATTTCATTAGAAATGGTTTGAAATGAGTGCTTGTGTTGGCAGAATT TTGT CCAACATG GTTGGT CTTTGCAAAG GGTTATGATAGTAT GATG CACG CCTTTGGG CCAGAT GTGAGT CAGGA TGGTCTTG GAG GAGGCGAAGTG CAGATATACC CCTG CTGATAG G CT C CT CATAATACTTCAGTGAG CTGTTTTTC TCGTGACTGTTT CA CTA(N)xAGTTTCACTGTTTTCTGATTGGTAAACATTCAA( N ) xTATTCAAGAATGAAGAG TGTTTT CAGAGAAGAG GATTTAGATG CT TACAAATAAAAGTGGATATT GACC CC CCAATT TT CCA C CCTTTAAT C ACAGAGACAT GATTGGAAGT CAGCAGCTCCAGCCTG CATGTT CC CAGAT CAC CTGTAAGAGTTC TTAAAACT CAT CC CACAG GTT CTTAATTGGGATTTTTAACTGGGTAAGTAGAGAAGCTC CATGTTG GGTGTGGGGAAGG GAG GATG CAGATGG(N ) xCATCAACTTATGTTTCTTTGGTTAATAGATGCAGGAGAATGCCTCTTCTCTCCATGTGGCATGT CAGGGTATAG CTTCTTTGAG CACAGGAT TTGGTTTACAGAATGG CATGTCTC CC CATG CCAG CCAGTGATAAGGG G CACTAAAACAGA CTTGGCCTTGC TAAAGG CTTC CAAG CACCAGG C CATGAG CAGTTGACCCCACCACAGAGGTA TGTACAGC TGGCAGGAAG CTGCCCTTCTGGTTCACTGACT CAAGAGTTGGAG CAGATTGATC CC TGGATTTG CAG TG C C CT CAATAT CG CTGCACAGAATT CATT CTATGTACCC CAGG CCCTATGTGC CCACAG CAG GAAATATAGTGT GAGC CT CTGG CAATAT CCAGAGTTGCTTGAGC CTCCTCACATTTGCTTCTCC TATT CAGTTTAGTGTG CCTCAGT TTA CTAATATGC TGGGGAAAATAACACC CCTCTTGC CAAGGTGAGATAGCAGAGTT CT CTGCATAAAATAGTGCA TAAAAT TATT CC TATGA CAAGG CATTTGAAAAAT TACT TT GAGAATGACT TAAAATA CAGAACAAT TTTTTTAAA TGTG GCAAA CATCCTTATTCTTATCACC CAGAAT TAA CAATT CAGTACAGTT GC CATATATT CATATATAAATTT ACCTATATATTCATATGT GAAT CTAGACATTG CAGATT CACTTGGG GGTCTCAGG GAACTTTTTATAAAAAGAAA TG CATTTAGTTCGTAAGG C CTGAT TGAGATAATT C CAAGT CTGTTTTCTCAGCTCT TAAT CCTT CCTAG CAC TTT GCAT TCATTGTATTAATAAATAAG CCTGTTTT CAGC CTAG TTAGAAAAAAAAAG GC CAGC CT CATT CATTAGAAT TCTGCGTTCC CTTTGGTTATTCTTTGTTGGAGAGAAG GGGGCTCA CTGGTACAGAATT CCAAGATATTTC GCTTT GGAATGTGAAAGGAGT CT CT CT TGACAACC CAAGGAGAAC CTGCCTCA GAAT GATCTTTCTT CCTCAGTG CTGGA AAATGAAAAC CAGACAAATAAG CACAAC CAACAT TAACAGAAGG CCTT GGTACAGC CT C A A (N )xT T T T T T T A A T TAAGATTTGAG GTGCTATATGTGCCT GAGGAGTTATAGG G GATG CCGGTG GAACGACCAT TACATGTGGCAG CTG GACTGGGCAACG GCCGATCGCCACTGTGCTCT GAGGAGAGAAG G CAGAGCTT CCCCCTTTATCTGCTGGCTGGTG AAG T CCTGAGAACCGGG CAGTCAG CTAAGCAC CAAGTTTT CT CTGACTA CAGAG CT CCTATCAAA CGGATGGGTA GAAAAGGAATGT TG T C < K ) xTCATGGAGCCCCAACTCAATGGGCTGCTTCTCAGTAGAAGGCTTTGCCTTGTGGA AGGTAGTAGAATGT CTAAAATCAT CA GAACAC CTGTGTG GATGGTACAA CACGGTGACAGATATGTAC CAG GGAC CTAGATGTGGAGGGAGTGAG GAG GAAAGGAGCAACATATG CACCTCTGGGGAGG CC CTAAGGTCTC CT CAGC CAA TT CCTAGGGAAT TTAA TTTTAAAGGACACAGAGAGGGTAG TTTT TAACACAGGT TTTACT TTTAAGTCATGAGGG AT GACTTT CTAAGC CACAGTTC CTTCTGATAACGGT CTTAAAAGAGAAG CAC CATGAAGTGATAAACACATCAGT CCAGGAAGATTTAGAGACAG CCTAGTAGAGGAGGATTGGAAACAGTGTGG CTGTGTCC CTGAGGGTGCTGGTGCT AAAC CCACGG CACAGT CCATAT CAGT CAAAGCTT CATT CTCCTTGTGC CCACACTATG CTGATCTG C C CCAGTG C AACAG GGAGACG CATATGGT CAAGAGAATGTGT CTT TCAC GAATAGATTT T C TC CTTCGACCATGGTGAGAAGT C CTTGTAAAGAAG CATACATGAATATATG GATGTGTTGGGCAT CAGCTGTG CGTGTGGCCCTCAG CCAGGTGAGTG GAGGAAGTTG GAG G CATT CAGT GTTCACCTCCATGTGCTTTT CAGAGACATACCTGGTGAGC TGT C TG CAGATAC ACAG CGTAGTGGGAGAGATACG CACGTGAAAG CAGCACAG GG CATGTAGCTAGCACTTGGTAAATATTAGTTTC C TTCCCCTT TCTGTGTGTAAGG CAAGTTGTACGAG CAAAG CATCCGCAGCT CAGAATGCAAAAATAGAT CT C CATG TGGGGATGTG GCCTCTTTTCCC CAAGTGTG GACCAGATGT CAGG CCGC CTTGTGAGTACT CAAGGAATGTTACG G CTTCCTTCTT CAGGACTT CACTGCTTCTCTCCATAT GGAGAAGG CTCCCTTTCCATGCACTTC CTGGATCTGAAA AACAAGAACAAGAAAGAG GATG GCTCTAAAGT CT CAGGAAATGGACCTTT CTATATCCTC CC TTTTAAAAAGCAT TTGTATGTAGGT TTTTTAGGTCTT GCTGTCAAGT CGGATACACAGCTGT CAT CT GTTATATCGGTGATTTAATAA CATAAT AAAAAT TTATAGAGAAC CA CAAG CTCATGGATGTTTGT TTTGGAGT GCATTTATTCAT CATTAGTATGG GTATTTTTAATCATTTACTGAGGGCCTAGCAAGGTGCTGGTAATTCAGAGATAAGGAAGATGTCATCCTTTAAAC TTGAGAAG CTTATGAATGGC GAAGAGATATAATTAATT GT GAATGGAG TGTG GTAGATAG CAT CAGGTTG TATAT GAGGTAGAATCAGGAAAG CAAGAATGGC CAGCTCTGAATAGAGTGAGGAGGGAAGATGGGAGATGATC CTTGAGT TGTGTCTTGAAGAATGAG CAAAAATTATAT TCATGGAAAGAAAGGATACAATATAAGT GACT TGTTGAGT TTTT C TTGG CTGATACTAAGAATGAAT TT TCAT TAAA TATGCAC CAGTAATTCACAG GAAATGAT CAGGTCAGAATGTTA TGGGACATTGGT CTGAAATATAATGAGAGGGGAAAAAAACAGAAAATTTAAAACGTTTAT TTAT GATGGCTAATA TT CAGAGGGGCTAA CTCCCTCCTG C CCAGATAAGGAGAACTTTG CCGC CTGG CAAGGGAC CTGCGGCTCTGCACT GC CAGGG CAACAGAACCACACGTCTCAAACGTGGTGTTTAAA GAAGATGT TG GAAGGTTAGA GG CGGTGTAATGG ATGGGATTTGACAGTTGCTGTGGCAACACACCCACTTCCAGAACATTAGTGTTAGTGGGAGGGCAGGCAAACTCG TGGTACTTGCCCTGGGCTGC CCTATGTC CCGTAC CACCTG TT CTATGC CAAAGG TATGAACAAA TTCAAAAGGAC AAATGCCT CACATGGCTGATGG CTGCCTGC CTTTGTAGAACTGACCCT GGAATAAGATAAGAAGATGAGTAAAT C AGAGAAAAGACAA CAGGGAAATAG CGTG GAGGAGGAGGGGGAGG C CAGAG CAAG CAGTGT TAAC TCCTTT CAGGT CAACATTTCCCAGGGTTATATTAGAATGTATTGACTAACTGGTAGATTTTTATGGGGAAAAAATGCATATGGCAT CTGTGTGT CTAAAATGCC CTTGATAGAAGCAGTAGTGTAGAATG CTTTGGGG CATAAATAGT CCTTAGAGTAGCA AAAAGACAGGTTAATGCAAAATTGGTTG GACAGGTTTTGG TTG GAGGC CTGAGTGATTGC CTCCGGAGCACCACC TT CTGGGG AGTCATGGGT ATGACATGGTGTGAAT ATTC CTGAGGTGTGGGTGGTTGGTTT CC TT ACAAGGTAG G G TTTGCAAGCTCCTTACATCCACAGGGGCTGTTTCCTCTGGAC( N ) xGAAATTTAAACATTTTATTACAAAATTAA TATACG AT TGTTGT AAAATAGG AC TGTATAAAGAGT AACT TGAGTTCC CATATT C CATTT CC AG ATTT AACTGT C CACAGTTT CTGCGTACT CTTCCAGACGTTCCCTTTGTG TTTATAAGCATT( N ) xTTACATATGTTCTTATCTATG GC CTTAGATAGAGGGTGTAAAATTAGGAGACG CAGCTC CCACTGATGATG CCTG GGTCATGTCCTTTCAGATCCA AATTGCAAGATCTC CTTTAATGTGGTTAATTGGGAGAAATTGAATGAC CCATTT CTTCTGAAATTTCTGTTAAAC TGGAGGTCTTTA TT TAGACAAATGTAAACATTAATCCATCTTTA TTAA GATAATGTAT GATTAAAGCAA CAAGGT AC CATCTAGGTT CTGTTTGATACTTTTC TTGT TATTCTGTAG TTATTT GT TCATAAAC CATTTTTTCTTCTTATA GG CTTTCAAGGAAGAATTAT CTAATTCC TTTT GGTGTGGC CT CAGATT T CATTAGCAG TAGGTAGAGTTTGTAG T TATTGAAAGTCTTT CTCTTATT T CAGTTTGAAATGTTC CCAGGTTGGCACATAGCTTTGG GAAC TGGAAGTGTCT CTTTTCCCTTGCTTAGGTflGCCTCAATAGATTGAAAGAAAATGGACATCTCTTCTTCAGAAAACCTGTTCTAAGA ATTATCCATCTTAGGAAATTAGGGTGGTTATTGCTCTTCTTTACTGCCTGCCTGAAAAAAATTTCCCAGCCCTCA TTTCTTTCATCCTTTCCT TT GAG G CCATGACTTGTCAC CCTCTG CTGTGTTC CT CTGT CC CCTGAGCAGCTGTAA GGGGTGAGAGTCAC CTACCCTCAC CCCTTACACT CCCACCAACCTGAGGACCACGTGT CTTCAT CTAACAGCA(N ) xAGCATTTCTTTCTTTTCCTGTTTGAAGAAACTGAGTCCTGTCTACTCCCACCCAGCATTTGCCTGGCATATCT TCACAGTTGGTCT C CTTATCAACCAACTGACT CC CATGATTAGGT CAGATTAGATCCC CTGTTCAGGGGATTTAT GTGGGG AACTAC A CACTGATTGGGACAG GATT TTGTCAATGTACT CAAAT GAC AAAGGAC TT CAGAGATGTTGGA ATAT TCTTGGTCTACCCAAAGGTTGCAGAC CACGGTG G CATTTC CTTCTTATATAGGT CCTGGTGGGCATGGGCT TG AG AAG C TGAAAG AAGACG CATG GAAC TC AG AAT CTG AT CTTTTGATGATG ACTTCATATATATGGT CTGTTT A CC CACGAG GCACACAAAC CTTAC CTTATGCCT GTGAAATATAAAGGCAGATTTTTGTC CCATGAAGGTTTCTCCA TAGACT ACATTAAG CAG G AG AC AG ATGGGTTT AG CTTG GCTAAT CTTAGTGC AAGCTG GAAAATTTCAGTTTTT C TG AAA C AAC ATATTT CTAAG AAATTGTCTG GT CT AG AATG TT AT TCTAGTGT AT GTTAGGGATCTGTATG ACTGT AAAGCAAGGCTGAATTACTGCAGGCCTATAAATATGTGTGAAAATTTACCTACTTCTCCCTACCAACATGGAGGT G G AAAG AAGTAGGTTT AATAAAC C CC ATTTTT ATTACGTGTGTAAATC AGTGG CTGTTTGGC C ATTTTT CTCCT A TGTAGGACTTGAATTTATAT GAGAAAACTTGAGTGTAT GC CTTTTCCCAAGT CTTGGCTTGCAT CTGAAT GTAAA AATC AG ATGCATG CTC AATG AATGTT AATT CAGATT AT AG AT AG CTAT AAACTATATAGG CTGCTCTGTGGTCTT TCCTTTCCCTCTGCCTGCAT GACTGAAGAC CAAACTGT CACTTC CTTA TAAT CTAGTTCTTCC CATGATACCTCA TCA CAAAG GTATATATCAGTTGGT CAATAACAGGA CTTGTGCAGTGAATACTTACATGAAGCAGACACTGAAGT G CC CAGCAAGAACTG CAAAGG CTGG CAGTATTAGGAGAT TG CT CACCGTAC CC CAAATGAGATGC CAGAGG CATAA AGTC CATTTTGTTTAGGTAC CAGATGAC TC TGTGGAAT TAACTTGATTTGTC CC CAGTAATC CATATT CT TTAG C ATACACTT CCAGG C AACACTTATT GCTTAG CAA C ACTT ACTT AGTAAC CT TT CAGAATGATC ATTAACTAAT AGG GAGACAGC CTGTGGTGCCGATCGTAGGCTGTT GAGGCTAT CTCTACGGGCATTTTCACCTATCCTGCCTACTTCC GTAACCTATGCC CCACAAAACATACACT CT CCAGACATGAAATTTAAAAATAATGAAA GATGATG CAG GAACTT C AG AC ACTAGATG CAAT AG AC AG CAAAAGGAAG CACCTATGTT AG CCTTGGTT CT CAAC AT AC CAAGATGCTCTT A TGAACACTTCTAACAAAATGTT CC CAAGGAGAACACAAAG CAGG CACCTAACAT CCAAGAAG CATGTCATCACCT CAGCATGC CTGC CACACGTC(N)xCAGTCATATTGTTCCTCATTTGC(N)xTGCTTGGCCTTCCTCATTTGTTTA AC AGAT AT AGT AATGC AG CC ACTTG ATT AGGAGG CAC AGTGACCG CTC TTGAGGGATC AACAGTGTAACC AGGG A AACATCTC TTTG GT CAGT C C CGTTTCACTAGC CTGTTCAAGAAC( N ) xGTGAATCTTGCTCTCTCCTCCTCTTTC CC CATGTT CAGTTGGCfiACAATTCATGTTG CC CC CATCTCATTAATAACT CT TGAATC CACCTAGTCC TCTGCTG TTGATGTAGGTGAG CTCTTACCATTCCTGGTTCCC CTGTAAG CCAAGC TCTGCCTGGTTG CCACAACAAT CTTT C TGAAA CGAGTCATC TGAT CAAGTGAGTC CTTT GC TCTAATTG CATCGATAGTTC C CCGTT CT CTTGAG GATAGAA CCTGAACATCTAAG( N ) xGTCTTAATCAACAAAGACATAGTTTATCATTTTCTGTGTAAGGCAGTGGCATGGCCT TTGC CTG G AAAG ATTTGC AATCT CGTTG ATTAAAAATT AT AG AGTTACTT AAT CTGAT AAAAAAATTAT C AGTAT AG A C ATTGTGAG AG AAGTTAGG AGG AATTGTCGC CAAC AT TG GT ATAGT C AGGAGTCG GAGG GCTTGAATTAG AC CTTG ATATGGATGTTAGGAC AAAT AGAAGTGG CGAAG ACG CAAAGCAAGATT GT CTAG AAAC AAAGCAGACC ATT GG CTTAAGCAGAGAGCATGAGATAG GAACAATATGC CT CC CTGTGGAGGT CT CT CCCCAGTAGACAAAGATTTCA GACCAGGAGG CAATGGGAAG CCATTGGCTTCCTTTGAT CATGGTGG C G { N ) xT T CCTCAT GACTGCA CCT GT CTT GTGACAGGGATCTGAAA CA CAAAG GAAAGC CTCCACTGCACTCCA(N)xATCTCAAGGTTCCCATAACTGATGGG GTCTTTGGATTCCAATCCTTGTCTCATTTTCCCAACCTGCCTGGGAAGTGGGTGGTTAAAGTTTAAAGCCTGGAA GAAT GG GGGAAATAGT TC AC AGGCATGAGGTGACCT GC AGGAGAGG CTGATGGT CT AT AG GAGATTGGAG G ATG A GTTTG G CTTAAACATGTTGATTTAGAAATGACAGAGG GñTTGGAGG CAATGACCAGCAGT C CACAGflTCAGTTTA GGACTC TGTGAACGTTAGTG GAAGAGATTACAGAGAACACTGGCTAGGAG CCAAGGCAAGAAGG CTTTGGAAAGG CATGTTTGCCTTTGAGTGCTTAACTTAGAGACTCATGCGGGAAAGAATCTGTGGATCTCTTTGAGAAGCTGCAGA ACAG CAGATACACCTGAATC CTATT CATGTTCATCCAGAAACGCAGACTT TAfiAACAACCTTGT TATTCTT C CTT TGTTTTGTTT CGTTTG GT TTTAGGGGGTTGGAGGGGAGGGAAG G GTGTCACCGGGGGAGCATAT GGGTCATTTG T TTG CAGTTCACAGGGG TT CTTAGTGAACAT CAG CGAATATTGTCGCTGATTT CGTCTC CTGCTT CACTCT CTTTT CTAAAAACAAAATATT TAATGTAGCC CAAAGGAGAATAGT CAGT TTAATAACTAGCCATGGCAG CT CTTTGAAAA CTGCAGGTTTCAAATAACTAGGCCTGCACATTTTCACCAACTCAGACAATTAAAAAGACTCCAGGGACCTAGGTG GTGCTG ATTC AAGTGG CG CATATTGT CATTTAAAAT GG AAAGTC TATTTCTG AAAATAGACTCCAG GTAT GGGAG AACTTGTTTCTCTGCTGGAGAAGGTAGCCCATCCTAGGGGGACCGAGGTAAATTGCTTAGTGGCTCAACATCTAC ATAG TTAAAATGGGCC CATT T G ( N) xGCTATAGATAGGCAGCATGTCATTGGACTAGGTCCGCCAGATGCAGAAT GTTGACAAAAGCAGAAGTCCCCTCCCCTCAGACTCCTCTGGGGAAAGCTGGCAGCCTTGACCACCATCACCGCTC TGCACACCATAG CAGGGATATGAGAAGGGG CT GAGGTGAG CATTGGACTGGAGG CTCCTC GAGCTGTCTCTTGTT GCCTGTCCTCTAGTGTGTAGAGTTGCAGTGTGGCTTGAAATGGTCATCACACCCACCATCACCATGGCAGCCCTG GGAG GAGGGACTGGCTGCTACTCT CT CAAGAAGGGATAAC TGTT GCA CAAAACACTGCAGACTT TGAGCTGGCCA GCAGTTGACCAG CTGACCTAACCT CC CGTTTT CCAGGT CT TAAC CTACCAT C CATTCC CTGCAAGTA CAGAGGAA ACTAGAAAGCTACCCTTCTCCCACCATAGTTCTGTAACTTTGGGATGGAGTGAGAAGCATGGCAT(N)xATATGA GTGTGGGAAAGGGGAAGCCATTGCCTCTTGTTTCCCATTCGTGACCATCCCTACCCACTTCAGTGGGCCCAGTGT TG CCAGACCTGAAATATACCTCñTGTGGGTGTTTGT CT CTGGTACT CTGGGTTCATTT CTGGTCAC CCTATGATA TTTCCTTGCC CTAAGGTACT GGCACAGCTATCGGGTAGGAGGAGGC CAAGGTAGAAGG CACAGAGC CATT CTTG C CTCACCTTCTGC CACACGGTGACC CCACATGT CTGGGGAGAGGCTC CCGG CC CAGGGAGGAGGAGGTGGAAG GTG CACTGT CATAAC CATCACAGGGG G CAATGG CGAGAAGG CT CATG GG CCACGTGC CTCGGC CATTAGTTGT CATG T CTGTTGTACC CCAACTTAACTGACTT CTAA CCGGGTGTGGTTATGAGTTC CGCTGGTCTCACCTTT CAGTGGGTT CAAGG GGGCAGATTTT G CTGACAC CATAGG CAAGGAAG CATGATGGTAAACACCACTGGAATGAGAG GCCTT GTT AATC CAGCCCAAGCTC CTGC CAAGTTAGAAGC TGAGGAGT CTAGTCTGGG CCACTTTGAGTACT GATGCATGAT C CC AG GG CG ATGTGCTGTT CTGGGCACAGGGTCA CACTGG GTG AC AT CTAACAGG CTTTGCTTGGG CTGTAGCTA G AGGGAAATGGGAGCTGGGGAGTGTGACAGGAGGGAAGTGTGGACTCTCCACCTACCAAGGGGCCTTCCCACATAC TGTCCATCCCAT TCAG CGCC CTGAGGTGGG CGTCATTT CT CCCACTTGACAGATAAGGGAATCAGACCATGAGG C AGAGTAACTGGC C CCAGG CCACACACAACTA(N ) xACACCTTCTGATGAACTGAGCTACCTCGTGCAGGACAGGC CCAAGGAAAGGG G CTCAGGC CAGAGTGGGAGCATTGAAACAGGGTT CAAAATGT CTGT CTGCCCAAACCAAACGT TAAT TTGCATACTAATAGTT TAAG GG CATAGATATAATACAGAC CT CCCC CAAGAAGTGACCTTTGAAGC CAACT TCTTCTTCATGTGACGGAATACATCAGACACCCTGTCTGCCGGGAATCCTTACCCTTCTCCTAGTGAGCCGTGGT CT CTGG C CTATC CAGAAG CT CCTTAAAAATACTAGTGCATGAGATGTAGCATTGACAGGAGCCCTGGCCTGTGGG TCAACAGTTG CTTTGT CAAATGCAAT CTTG CC CTTGGCACAGCT CG CCAT TCAG GAG GATGGAGGAAGTG GACGA GAGAGAGGGAGGTAAAAGAAGGGAGGGAAAGGAAGCAGGATGCCTGGCTGCCTGGAGCTTCCTTGGGCTCTTGTC CATGAGAGGAAGCCATACCTACCTCTCTGGGGTTTGCCAACTCTTGTCCAGGAACCTATTGGGCTAGAGTGGAGG GGAACTGCAGAGTCTCCCCTTCCTTTCTCTTCTGCCCCCAT(N)xGCTGATGGTTCTGTGGAGTGTTGTGCACAA GGCTCCAGCC CAGGTC CTGATGCAGGACTCTCfiTTCTG GAAGGCTGGGTGCCTGGCTCTC CTTGGGGCACAATTA GG CTTCCTGGCTGCATTCCTGGGGCATTCGCATGAA( N ) xCTCTGAAGAACCTTACTATCAGTATTACTGATATT ACCAGAGA( N) xTTGCTGATATTGATTATTCAGTAGGCCAGGGAAGCCCGTCTACCAGCCTACAGGACGTCATTG CATTCACCTTCTCAGTAAATGGCAACAGACTCTTTTCCTAATGTTTACACGTAATGGATTGAACTGGCTTAGCAT CATG CTTTAC CCATATTT CTGTTGACAGCCTGTTACTGTGTGGAGAGAATTG CTGGATAAGAGC CACTGAGAT CA AAGGGAT CTTTACAAT GGGC CTTT CAGTATATTTGTTC TGTAAT CATGGACTGAAAACTTACTGAG CACT TG CCA TCTGAATGAGGCAAAAAGGAA(N) xTATGTGTATGCATGATTTGTTTTTGCCTTAAAATTAGTACATCAACCGTG TAATGTG CAGGACATT TAAAATATGT CAGAGC TATGACAGGTTC CCAGCTACTGGGAT CCAGGACACCTT CCAAT TTCATCTTTGGCTAAGACTTTCTATCTCTTTTTTCTTCTTCTTTCTTTCT{ N)xGCCCAATACACTTCTGACCTC AG GG CATAGATG CATTGAAG GGCATCTGCTGCTGTG GC CTTTTGGATGGGAACC CTGTAGATCT CAG CTAG GTTG GTTTAT CAGCTATAATGTATTCCTTGATTTGGTAAAACAG CCAG CAGCAGAG CAACATGCTTTTAC CTCCTCTTT T T T T T T 4 p11^
> H s l 9 _ 4176228 - 4211314
GGCGACAGGCCGAGTCTGGATTCGGGATTAGAGAAGGCGATGTCCACTTTACTTTTCCTGACTTTAATCGTTATA CTGGGCCGGGCGTGGTGGCTCAC(N)xGGATGTACCCCCCACAGGAGGGAGGCACGGCCCCCCCAAATCCCTCCA GGAGGAAGACATGCCCCCCAAATCCCTCCAGGAGGGAGACATACTCCTTATATCCCCCCAAGAGGGAGACACGCC CCCAGATCCCCCCAGGAGGGCCACACCCCAGTCCCCCCATACCCTCTGGGTCTCTGGGACATCGGATTCGACCCC CAACCCCCTCTGGCAGAGCCCCCACCCCTGCACCCAGGTGGCACTCACGTCTTACACTTGGCACATTCTTCCACA AACATG TTCC CGTGGAGCTCTGCCAGTTTGTC CCTGTG GGAAGAGCAGGAAGAG GCTTGATGGTGGGAAAGGAT C CCTGGATGCCCAGGCTTTGGAGCTGGTGCTAAGCCCCTCTCCTCCAGGAAGGCTTCCCGGACTCCTCTCCCACAA CATTGATCAACTCCTTCTTTCTGGCTC(N)xG AATCATTAACCCCCAAGGCTGAACCCACCTTAGATAACGGGAA GCCCTTTCCACTCACAAAATGCCCGTCCCCCTCCGAGAGCATATGGGAACCTCTTGATGTCACCAGGTCAGAAGC ACAGGAAT TCAAATTC CCAGTTTATAATGGAAGAAAC CGAGACATTGAATGG CC CTGAAG CATGAGATC CAATC C CAGC( N ) xCTGCAACTTCTAGACCCTGACAGCTCTCCTCCCCTTAGCCCCATCTCGAGGGAAATTCTGCCTCTGG CCTCAG C CGGGACAGCTCTGAG CCATGGGACT CAGAGATCTC CACACACTGGGC CAAGAACT CAGGGCTAGAAT C TTATAGAAC CAGGAAAñGGGGACT CT CCCCCTAAAAATGCAAACACGGTTTGTGGGGCCC CAAG CTCTTTTGTGG GGGC GGGG CCAGGGTGTTACCTGGGGAAGCCTGAGC GCACATGGAGCCCGTCCACGTTCTGG CTGACCAGGAAG C GGAGGAG GCCCACGCGCTCCAG CTGCACCAGCGC CATGTGGG TCTGCGTGGGCCGCGCGC TCTCAAAGGTG GTG T CGAACT TGGGGGCCAGACCTCGCTC CTCCATG GTCCAGACTCCGTGGGGACCCC TGAAG GTGGCAGGCCGGGAGA GATGGACGGG( N ) xAGAGAGACAGGGAGACAGAGAGAGGGAATCAGAAACAGAAATAGAGGGTGAGTTGCAAGAA CAGGGAGAGGGGGATAGAACAAGACACAGACAGAGAGATAAAGAGTTAAAGACAAAGCAAGTCAGAGACAGAATT
G(N)xTGAATTCACACGAAGGCAAAAATG(N)xAAGGAGGAGAAATTCACGCCTTGGC( N ) xAGTGGAATGCTGG TTGTGACAñGGGACACA(N ) xATGCCCAGCCAAGGGACACATCTTAAAGGGAGTACCCAGGACACATCCAGGAGG AAAG CAGATGGATGTGTT CTCATT CC CAGATGTGTGTGGCTCGCAGGGAAGC CTCTGGCAGCCTGGTAGGCCTTG CCTCGACTTCCCCTGCACAATCACAGACCTGAAGTCGGGGATGCCAGAGGCAGTGCTGATGCCGGCACCCGTGTG GAACAC CACACTGGAAGACTGC CAGACCAGCCTCGC CAGTTCCCACACCTTCCGCTCCAGCTCCTCCGGGGGGTC GAAGATCTGTGGGGGGAGAGAG CAGACGGAGGGGTCAAAACAGTTC CCCCAAGTGG CAAGGCCACG CACGAG CCA CAGACT GAGCTGTC CCTGTCTT GC CTGAG GGAG GAAAATGGATT GGAAAAGG CAGCTTGTCC( N ) xCCATGCAAC CCCTAGTCAGTCTCCCTTA C T A ( N ) xGTGACTAACAGCAGCAGTGAflGAGTGGGGCTTCTGGACGGGCGCACACC ATC (N )xCCGATCCCTCCACTTCCCCTCTG <N)xGTTCTCCAGGTGTGTTTATTCCCACCTCCCTTTGCGCCTGT CGTC CC CT CTTC CTGGAACTACAT CCCCTCTCTCCCCGATCAGCCCCGACAGGG TGACCACAGAGTTGGG CGTGT TGGCGTCCCCCGGCGCGCAGGGGCGTCACTTCCCGCCCCTTCCT CACTGGGAAG TCCCTCCCATTGTCTAGCCTC AGTGCCCCCTGATATTCCCACAATGCCCCCCTGCCATCCGGCCGCTCCCAGCCCGCGGCCCTGGGGCGCGATGCT CGGGACCCTCAGACGCGCTCACCT CCGGGAGG CCGCACTTGCCCTTGTCCGCGTACGGCGACAGCCCCGCCGCGT AATT CACCGACATC CT CGA CTG CC CCACGGGAACAATAAAGTTT CC CTTGTTGAGG CCGC TT CCGC CGGAAG CG G GG CGGGGCGGG GAAGAAG GAGGAACCGCGTTC CAGT CCTGCGCACCCGGCCTCC CACGGCAAGG CG CATG CG CCT CT TG CTGACGCCGCAG GC GACATGTTATCTGC TGT CAGAAGGAAGC CTGCCTCTTTGCATGCAGGTGTTTGCGGG GCTTGGGAAGGGGCTCCCCGATGACCCGGGCGGGAAATTGGGGCGGCCGCCTACGTGAGAGTTCTCCAGTCACCT CTAAAATG CGGGACACAGGCTATCCATGTCCCCACAACCCCTTT TACGGATTAGTAAATT CC GTAAGGACAGAG C CG GACAG CAGGGACCC CAG CCTAAGGGTTTTGAATG CCACGCAGAGTTACCT GAATATCT C A T T (N ) xAAGTTAT CTGAGT AT C ATC CAACTGGAGGTT CTG C ATTCTG CAGG ACAGGT CCTGCTCAAGGT C AATGGGG CTGGTGGC CTG GAGG( N ) xTCTTCCCGCCAGAGCCAGTTGGCCAACAGGCAGTCCGCTGCCAGCCTGTCTCCTCCCTCGGCTCATC CCCGGCCCCCTGGGGCGGCCGCCTGCTCAGAG CCGATCAGGTGT CAGAGTGGAG CGGCTGGAGCTG CTCGTGGCC CAGCTGGGAGTG GGTAAC CGGAGCCGGCAGTGGGCACTGTGG CCGGGAG CTCGGGG CACTGGAG CTGCAGGAGGT AGGG GAGGAAGGAGGTGGGAGAGG CC CAGGAC GGAG GATCCC CTTATGGCC C CAGG CCCCTGAGCC CTTGTATGT GGGAGAAAGAAG CCGTGCTGAGTC CTAGCCAGGCTG CGGAGCTACC CAGGTACC CTGACACACACATGCACT CGT TCACCCACGCACTTCCCACTTGCCGGCCTGTGCCCTGCCTAGCTTCCTCCAAGGCAGGTGCTCAACATCCCACCC GGCCATCAGGCCCTTCTTATTTACCATCACCTCTTCCAGCAAGCAGCTTCCTAAATTTAGCAGGCGTTTCTCCCT CCCCCAAGCCCACGGATTGTCTTCTCCCAAGCAGCCTGTGAACTCCTCTGTTGGGTGTCTTATTACCCCCTGCTT TGGGTGTG ATTG AATC CTC CAGTC AG AGGTAG AAGAAAAAA (N ) xATCTGATT AAAGGGTGATG AAGAAC CATG A GAAC AT GAATTGGGTGG GTGGGTTTGTGGGTGGTGGCCAGTGGGTG GTTATACAGTGCTT AG GTGGGAGG ATGG A TGTC CAGG TGAGTG CATAGGTG GTAAATGCGTGGAAGGGTGG CTTGGAGAAT TCACGAGTAGATAGGTGGACAGG TAGAGG C CATGC CAAC CC CTACTC CTTGGTAAGAGG CAAATACAGATTCTCCAT CACAGAAATC CCTTTCACACA TAGGGCCTCACAGTCACACCCCTCAAAAAACACACCCCCTCAAGATAGGCCTGCCTGACCTCCGTGCCTGCCCAT GCGGAACGTAGAACAG GC CCAGGTGGACAGAAGTGCACGTGT GCACACGCCC CACACTCAACGT CACACACATG C ATGCAG CCATTG CCCCAGACCTCCAC CCACTATG GGTGAATTGC TGTCACATGGAG GTTC CCTCTGA CTATGTAG CAAACATG C CTC TGTG CTGAGAGC C G TC ( N ) xAGGTCTGTGTCTTGCTCTAGGGGCCAGGACTGGTCAGGAAGGA GGAAGCG GTCCT TTTGAGGAG GG CGAGGCAAC GGACCCTGCCGCCCACCAGGACTGGTGTCCCTCACCTTACCCC CACCCTTGCCCTCCTCAGGTGGCC TGTGGAGAGGAGAAACACAGGG CACCAACTATGAAGACTC TCAGGGCGCGA TT TAAGAAGACAGA GG TGAGTGTGAGGCCCTAGATG CCCGATACACCCCTG CAACTTCAG CCTTCTGTCTCCTGG GG CTTGGC TGAAGATC CACCCTTTCATTCCAGTACC CACCTCTT CTCCTTTCCTCTCTGCTGCCCCCTAGCGTCA TGACCTCCCCACTGCTCCCCGTACAAGCTTTC CAGC CCAGGCCTTTACCCAGGCCCTAGT CCGCTCTCCATGGAA TTCCTTCTCGCCGCATTGCCACCAATCAGACCCTCCTAATTCCTCACAATTTGGCCCCAGTACCACCTCCCCGTC CAAT CT CCGCTCTC CC CGGCCACAGTAACCCC TTGC TTTTCCAAGGACCCAC TTTC CCGAGGAT CCTGATCTTTT GCTTTGGTGTG(N)xATCCAGCTGTCTTCTCTTGCATCCGTGTCTTTGGCTATGAACTCCTAGAAAGACATGCTT CTTCCTCCATTGTCCCCATTCTTCTCAGCCCCCTCAAGTCCCTGTGGCGGCCAATTCAGGGCACAGATGGGGAGG ATGAAAGG GAG GGCTCACCTCCAG GGACAGGCAATCAGC CTT CCTGAGCACCTCTTTGTGCCTGGCCTGGGTGGG CAAG CT TGA CTT G GAG CC TTTAGTTTTCGTAT CATG GGATA CAACATCCC{N } xTCTTTGAAAGCCATTTTTGAA AAGCCAAAAAGAGAGGCCTGTGGAACAAGCCTGGATTTCATT ( N ) xCATTTTATTTTGTTTTTAAACTATTTATT AATTAT CATGTACC CATT C A T TTCA TT(N )xTTATTTATTTTC ATTTTTGTTG (N )xATG AACATG ATTGG CC TG GAGGTGGGAGGGGAGTGCATTGATTGAAAAGAGAATGGGTGGAGACAGACAGCGGGAGAGAAAGAGGTGAAAAAC GGTC CATT AACTGC CT AC AAAATGTGGC ATTGGGTG AGCTTC ATTC ACTGTGG AGTGATC CT AAAATTTTTCTT C CCACGT GGACTCAATT CCTCGATG ATAGTTGACC CTTG AfiCAATTACAGC C C ACCCTC CTGC AC AGTC AAAAAT C CATGTATA( N) xCCTGAATGCTTTTTTTCCCCAAGCATTCCTCAGAAGTAGTAGCCACTGGTAAACCAAGCCATA TGATTG CTTAATTTCCAAAG CATAGT TATATT TAATATTTAAGTAAATCT TT CAAGAAGTGTGGGTGG CTAAAAA AGAGATACCT CATTCGG GATGTG GGTGTGTTTTGATATGT TTGCAATGGGAGGGGT CG GGACAAAGT CAAAATTG GATGGGGACAGGATAGGTGTTAAAACCTCAGACGGGTGTGAGGGCCTCTGAGAGTGTCTGGAGCTGGAGGGGCCT GAGGTTAGCCCATAGCCTGGGCTGGAACTGTTGCAGCAGATGGGAGGCCTGGCTGTTGCTGGGAAGCTGCAGGGC CTCCATCCTCCCCTCCCAGAGCACCTTGCCATCCATCCCTCCACCTGCCCTTCTCCAGTTTCGCAGCAGGCAGCC GGAGTCAAGTTCAGCTTTTGTTCAGCAGCCCCGAGGGAAGAAAGCAAACACCCATGACTGCAGGGGCCAGGCCCC CCCCTCCTGTCACTGGGACCTTGTGTTCCGTGTCTTCTAGCCACTGCTACCCCTCTCCTCCTCTCCTTAGGGCAC TCTCCGGCCTTTCCTGGTCCCTAACACCGGCTTTTTCTTATTTATGTATTTACTTTTATTTTATTTTCTTTCTTf N)xAACACCGTTTCTTTCTTTGGGGTCCCAACAGTTGGCATTGCTTACTCTGATCCCCCTTGGGCTG(N)xTTAC CTGGAG CTGTTCACTTGGGG CTTCACTGACCT CCTGCCTTTACT TTAGGTTT CAGGAAAATCTC CCAAGTGGGAA GATTGGGGTTTCTAGAGAGTCACAGACTCCATACATCAATGCTTCCTGGAAAGTCCTCTTTCTCTTTCTGTCTTT TTT(N)xAAACTCCATGAGGTCAGGGGTCAGGCACTGGCAGAGTGGGGCTGCTCAGAG( N}xGGTACACACCTCA GTGGGCTCCGAGGGGCATTTGCTCTGATTGATTGATTGA(N)xGTGAGCCACCGCGCCCAGGTT(N)xGTGAGCC ACCATG CCTGGCCATTTGCT CTGATCTTGTATGG CATO CAGGAG GACAAGTGTAGGAGACAGTTGGAAACTGAAG CCCTGAGCTGGGAGATCCGTTTCGGTCTCTTCTCTGGAGAAGCTCAGAGGACCTCCCCACCCCCAAGGGTGGAAG GAGAGAAGAGTTCCAAGGACACCCTCCTTCCCAAATGCCTGTCACTTTCCATTTTCCTGCCTTGGTTCCCACTCC TCCCTCCTCTTGTCCAAACATGTCTAAGAG( N ) xGTCTAAGAATAATCACGGACCACACCGACGTGTAAATGCTC AATG CT CACC CAGCCCTAAG CCAGTGGT CAATGTGTCAAT GGA(N)xTTCCTTGGTGCCT(N>xATTCTGGCTCT GGAAGACCTCGGTT{ N) xTCCAGGGCTCCTTCTAAAATGCCAATTACCTGCAAGG(N)xTGGCTAGAGAATGTCA GAGCTGCGCCGGGACCAGCTAGCTCTTTCTGTGAGGAACGAGTCACAGT(N) xTGTGTAGGATTGCAGCTCCCCA GTGGC(N) xAGTGTGAGTGGCGAGAGCCCTGCCGGCCGCGTGGAGGTGCGAGGCTGCGGACGTCGCGGGCCCGGA GGCACCTGCGCGCCCTTGGCCGACTCGGAGGAGGTGGAGATGGACGCCCGCGGGTCCCCTGGAGATGCAGCCOGC GGCCTG CGCTGGTGAGGGAG CCGGGCCCCCGGCGCCGCGTCCTCCTCATC CT CCAGGCGACAAGGTCAGGAGGGG CCGGGGCGGCGCCCCTTCCCTCAGCCCCCAGCCCCCAGCCCCCTACCTGGGTCTTCCCATTCCATCCCGT( N) xC TCCGGCAGCGCCCAGCCCCGCCCTCCGGCCGCTCCCCGCGGTCCCTCCAGACCCTCTGGCCGCCGCCTCCTCTTG GAACCCCGTGC(N) xCGCCATGAAGCAGCTGTGTCTGTGCGCAGCCGCCTCCTTCGCGGTAGGGCCCGGGGAGGG GGCGCAGGAGCGGGCGGGGCGCGGGTACCTCCTCTCCCCTCCCTTCCCCGTCCCCGGCTGACCTGGACCACCCCC CCATTC CAGACCCGGGAAAGATGGTCGG CGGC GGGGGGTGGGGGGGAACAGAGGTTGGGGCAGC TTTTGGGGGAA TGGAAGAGACATTTGGGGAGACATGTGGACACGTTTTGGAAGACTCTTTGAAACAAATGGGGACATTTAGGAACA TTTAGAGGGCATTGGGGGGTGCAGTTTTGGGAAAACATTTTGGAGACAGATGGGGTCATTGAGGGGACAACTTGG AAGCTATTTTGGGAGAGCCGATTTTGCGAAGAGGTAATGATTTGGAGAACAGTTTGGGAGAACATTTGAGGGATG TTTTTG3AGGATATTTTTGGGACATGACGGTATCATTGGGAAGGATGGTTTTACAGAACACTTCAGGAATTTGGG GGAGACATTT CGTAATG CATTT CAGG CTATAGTGTTGAGGGGAGAAGCCT CGGGATGTTTAGGGGACAGTGGTGG CATT TCAGGG CAGTTGTGGATAAACAGG GCAGAGGATGGT CCTGAGAGGCAT TTAAAGGGGACTTTTAG GGGTT C AGATG GGACTTTGGAGGGTAG TTTGGAAAAGACT CATGGATCCATTTGCTGG G GG CGATGTTGCAGGG GTGGCTT TCACTAAGGGATGAATTGGGGGACAAC(N)xCAGGGGACAACGTTTTGGAAATAGATGCCAGGGGGCTGCTCAGG GGATGATGCTGGTGGTGGGCTTTTGTTGGACAACTGGGGTGATGGGCCTGGGGGCAGATGCTGGGGGTCGCAGGC TTGGGGGACACTGTCTGAGGACCCCCTCGCCCAGGACCACCTCCCCCTGCAGCTGCGGCTCAGCCCCACTGACCT TGGCTCCTGCCCGCCCTGCGGCCCCTGCCCCATCCCGAAGCCGGCAGCCAGAGGCAGGCGCCAGGCAAGTGCCCA GGGGCAGGTGGTGAGAGCCCGGAGCCCCTGTCGGGGGCGTGGGGAGGGGACAGCAGCCAACACTGCCCCACGCAC TTCTGGGCGTGCCCTGCAGAGTCAAGACTGGGGCAAGAGTGACGAGAGGCTGCTACAAGCCGTGGAAAACAACGA TGCACCTCGGGTGGCCGCCCTCATCGCCCGCAAGGGGCTGGTGCCCACGAAGCTAGACCCCGAGGGCAAGTCCGC GTGAGTGCCCGCGACC CGGGAGTGAGATGGCTGAGGGGTGGCAACCTTGCGG CTGAAC CCTTGTCTCTCACCTCC AGGTTC CACCTGGCGG CCATGCGGGGTGCGGC CAGCTGT C TGGAGGTGATGATAGC TCATGG CAGCAATGTCAT G AGCG CGGACGGGGCAGGTACTG CCAGCTGGGCCC CGGGGAGGGAGGAGGAA CTAAG CC CAGGTG CCCAGC CTGAG GGTCCAGCCAGACCCTGCTCCCAGGTCTCAATGTTCCCCGCTGAAAAAAGGGGAGGCTCCCTCTGGTGCTCCTCC CCTG CC CCCAG GGAGACACACAG GGG CATGTTAGGATGG GGTGG C CAAAGATAGAGACTGTGGCTGGCAC CGAG C AAAGCCAGTGGACAGCTTGTGGATGGGGAGTGACTAGGATTCAGGTATTGGTTCAGTTGGATGGGGAAGTAGCTG GCATGGGGAAGTAGCTGGGGCTTGGGGGATAGCTGCAAAGTTTCTGGGG(N)xTGGGGAAGTTTCTAGAAAGTGA CTGAGG CAGA GGAAGAACTGGGGCTTGGGAGATAACTGG GACCATTGGACGAACTAAG CTTTGAGGTG CATCTGG GATATAGGAGTATCCGAGATAAAGGAAGTGACTGAAGGGAACTGCTTGGAGTTTGGAGGAGGAACTGGATTCATG GGAGTAGCTGAGCTTGCAAAGTATCTGGGGCAAGAGAATAACAAGTAGAGTTTGGGGATATCTGAGGTATATGGG ATAGCTGGGATCTAGGAGTAGCTGGAGCTGTGGGAGTAGCTTGAGCTCTTGGAGTAGCTGGGTTACGAGGAGAAC TTGTATGTATGTATGACTTAACTGGAGACCTGGGGGGAATAACTGAGATAAAAATGTGTGGCCAAAAGGAGTAGT TGGG GCTCAGGAGGCAACTAG G CATGGG GACACAGTCAAATCTGAAGCTT TAG GGT CACAGG CATGGTG GGCAGG GAATATGGAATAGCTGGGGATTCATGGAGCATCAGACAGGGACAGAGTTGGCATTTGGAAGAGGAAGGGTATATG GATTGG CAAGAACATATACAGCAACTGTG GGTTGTGAAGGGCAGAG CTGAAACTTTG GTGTAGT G GAC CCTGTAG AGTATCTGGGGACATGGGGAGTAGCTGAGGCAATAGGGCTTGAGGAATAGGTGGGTGTGGACAGTCGCTGAGATG GGATTGTCTGGAATACAGACTAGCCATGGCTTAGAGAGTCTCTGGGTACAGGGAATAGCATGGTTTTGAGAAGTA CCTGAT{ N ) xTACTAGGGTTTACGGACTTTGCTAGACTCAGAAACTCATCACAAGTAGTTGACATGGCTTGGGGA GCAG CTGGGAGCTACT TGAATAGCTGGAGCTC CAGTAAAT CTTCTTTCCTTC CCCAGGTTA CAATGC C CT CCAC C TGGCCG C CAAATACG GGCACCCACAGTG CTTGAAGCAACTACTG CAG GT CATTTACTGTC TTAT CTCAGCTACT C CCTTGGCCCCTACTACTCCTGAGTTCCAGCAATTCCTTGAACCCCCAGATGCTCTATGAATCCTAGATATTCCTT GGCCCCTTTTACTC CAGAA CCCTAGCTACT CC CTGGATAACCAGGTACTCT CAGACCATC CACATGCTTCTC CTA TTTAAGATGTACTCCCA(N ) xTATATATATATACTCCCAATGCCAGGTATTTCCTGAGCTCTGTTTCTCCCCAAG CCCTGGGTCCTTCCTTAGCCCCAGACACTCCCAGAGTTCCCAGCCACTCCTCAAATTCTCAGCTGCTTCTCAGCT CT CAGCTATTTTCTTTTCTCAC CAATTTTCATGAAAAT G GfiGTC C CTTGGGGGCCACGGAAGGGTGAGGCTGAAG TATAGCAGAGCCCAGGGTAGGGAAGGCAGTCCTAGAGAAATCAGGCAGCAAGAAGGATGGCTCCGCCTCAAAGGA CTCGCCCCATCCCCAGGCTTCCTGCGTGGTGGACGTCGTGGACAGCAGCGGGTGGACTGCCCTACACCATGCAGG TGGGTG CAGC CCAGCCCTGCCCTGACCC CGAAGC CCAGAGGGTGCTAGACTTGGGG GTTATTCT GCCCAGGGCCC TAGGGAC CTGACCTGCTGTGAAGCTT CCTGTCTCCTCC(N ) xAGCAGTCGAATGTACTCCTGGGCTTTGTGAGGA TGAGGAGAGGTCCCTAGGGTTTAATTTC CTTTGTGGAG CC CCTAGGAGATAC CAG GAAAACCATAGCT CCAG GCA GCTGGTACCCTCATAATCACCTAATC CAAACT CCTT CC CA GAGGGAGGGAAAGGGA CCCACCGA CTTGAACCAGA ACTCAGCCCCCGACCTGTCCGCTGTCCC CGTGAAATTCAGGCATGG CTTTCGCTGACGAC TCAAGAAACATCTGT TCTTTC GGAAACT CAGAATAAAATAGGG CTTTGG GGGTTTTAGT CACTGGGTGAGAATTCTCTG CAGTTGGAGTA TG CTGC CACCTAGTGGCGGCAATGCTTCTGGT CACAATATAG GAGT CCGTTGGTTTTTGTCACT CAACAGTTAAC TG (N ) XCTCATACTGCCTCAGACATACAGCAGGTATTCAAACAGAGTTTGTTGACTGAAAAATGGATTTTTGTGC TTAATCAGAC TGGATGTCAACCTGCTG GAATT CATACC CAGGGATT CCAGAGACCATGCC CTTGTGTCAG{ N) xA TTAGCCAGGCACGGTTGCACATGCCTTGTAGTGTGGGT(N } xAGGAGCAGGGTCACACCCAGGCAGAAAGTGCCC AGAAAATGTCGAAGTCT CACCCTACC CC TACT CT TC CC C C T (N ) xGGACAACCTTTTCTCTTTTGCAGCGGCTGG TGGCTGTCTCTCCTGCTCAGAGGTGCTCTG CT CCTTTAAG GCACAT CTAAA C CCC CAAGATCGGGTAAGCTT CTG GGATCTCTTCAGGGAAGATGTTATGCTTGAGGTATGCTGCCACCAAGTGGCAGTATCTCAGGCACCCTTTGAAAG T CTGAGAAAGTTTGAGGTGAAACTCAAG GG CAGTTATATAGG CTTCTG CAC CTCCCGGGG GTCAGTTT CTCATG C CATCGCATCCCTCCTCCTCCCTACCAGTCAGGCGCAACACCCCTCATTATAGCAGCTCAGATGTGTCACACAGAC CTGTGCCGTCTCCTACTG CAGCAAGGGG CTGC CG CGAACGAT CAGGAC CTG CAAG G CAGGTGAG CATCTCCC C (N ) xAAGTAAGACGATCTAGCCAGCCTCCTCTTCCCAGCCTGGCCTGGGTGCTGAGGTGGGCATGGGGGCTTGGGGG ATGTTCTCATCTCCTCAGTAGCCCCCTCCCCTGGTAGGACGGCCCTGATGCTGGCCTGTGAGGGGGCCAGCCCCG AAACAGTGGAGGTCCTGCTGCAGGGCGGAGCCCAGCCGGGCATCACCGATGCGCTGGGGCAGGACGCGGCTCACT ATGGCGCCCTGGCGGGGGACAAACTCATCCTGCACCTTCTGCAAGAGGCGGCCCAGCGCCCCTCCCCACCCAGCG GTATGCAAGCCCCACCTCCCCAATGCATTTGCTTCTTGGCAGCTTCTTGTCACTCCCCTTCTCTTTATCGTGAAT AGTTTCAAGGTACC CCCGATTGGCTGCATT CTAGGAGGT CCTAGAGCTTACC CAATTCTACTCAGAACAGTTTCA AG GAGC CCCAGGGCATTTAGAGTAGT CTGGGAGGGG GT CTGCTT CTGCTTTCCTGG GTCATCTG TACAGTAAGAA CTTTGCCTGT CCTAAGAAGGGCACCC C C T (N ) xTATCCTCTCTTTTTAA CAGTCCCAAGTCCCTCTCTCAGCACT CTACCCAAGCGCCAGATTTCTACCTTAAACGCCCTGATTCTTGGGGAAATCGGGAGCCCCCTACAAGTCCCTGGG GCCCACCAATGCCCTTCCTGGGCCTGGGAACAGAATGCCTGACCCTGCTTCCCTCTCTCCCCAGCCCTCACAGAG GATGATTCAGGCGAGGCGTCATCTCAGGTATGGACCCCTAAGCAGTGAAGGAGCCCCTCCTCCCTGCATCCACAC TAACTTCTCCCCTGCCCCAAGCTCTGTGAGAAGAGAAGGAAGTTAGACCTGGGAAAACCTGTTTGGAAAGAATGC TAC CTT CAGGGT ATGCTG C CAC CAAGTG GCG G CAGTGTG CTC AG CCTAA CT C CC AG AACTGGGTGAGAATAC TGG ATGGGG C C AAATTATGGGGCTGGGGTG GGAG AGTGAGT C C TTTG CT CT AATT GCATCGATAGTTCCCCGTTCTCT TGAGGATAGAACCTGAACATCTAAG(N ) xGTCTTAATCAACAAAGACATAGTTTATCATTTTCTGTGTAAGGCAG TGGCATGGCCTTTG CCTGGAAAGATT TG CAAT CT CGTT GATTAAAAAT TATAGAGTTACT TAAT CTGATAAAAAA ATTAT C AGTATAGAC ATTGTGAGAG AAGTT AGGAG G AATTGT CG CC AACATTGGTATAGT CAG G AGTCGGAGGG C TT GAATTAGACCTTGATATGGATGTTAGGAC AAATAGAAGTG GCGAAGACG CAAAG CAAGATTGT CTAGAAACAA AGCAGACCATTGGCTTAAGCAGAGAGCATGAGATAGGAACAATATGCCTCCCTGTGGAGGTCTCTCCCCAGTAGA CAAAGATTTCAGAC CAGGAGGCAATGGGAAGC CATTGG CTTC CTTTGATCATGGTGGCG(N ) xTTCCTCATGACT GCACCTGTCTTGTGACAGGGATCTGAAACACAAAGGAAAGCCTCCACTGCACTCCA( N ) xATCTCAAGGTTCCCA TAA CTGATGGG GTCTTTG GATT CCAATC CTTGTCTC AT TTTC CC AAC CTGCCTGG G AAGTGGGT GGTT AAAGTTT AAAGCCTGGAAGAATGGG GGAAATAG TT CACAGG CATGAG GT GACCTG CAGGAGAG GCTGATGGTCTATAG GAGA TTGG AGGATG AGTTTGGCTTAAA CAT GTTG ATTT AG AAATGAC AGAGG G ATTGG AGGC AATGAC CAGCAGTCCAC AGATCAGTTTAGGACTCTGTGAACGTTAGTGGAAGAGATTACAGAGAACACTGGCTAGGAGCCAAGGCAAGAAGG CTTTGGAAAGGCATGTTTGCCTTTGAGTGCTTAACTTAGAGACTCATGGGGGAAAGAATCTGTGGATCTCTTTGA GAAGCTGCAGAACAGCAGATACACCTGAATCCTATTCATGTTCATCCAGAAACGCAGACTTTAAAACAACCTTGT TATTCTTCCTTTGTTTTGTTTCGTTTGGTTTTAGGGGGTTGGAGGGGAGGGAAGGGTGTCACCGGGGGAGCATAT GGGTCATTTGTTTG CAGTTCACAGGGGT TCTTAGTGAACATCAG CGAATATTGTCG GTGATTTCGTCT CCTG CTT CACTCTCTTTTCTAAAAACAAAATATTTAATGTAGCCCAAAGGAGAATAGTCAGTTTAATAACTAGCCATGGCAG CT CTTTGAAAACTG CAGG TTTCAAATAACTAGGC CTGCACATTT T CAC CAAC TCAGACAATTAAAAAGACTC CAG GGACCTAGGTGGTGCTGATTCAAGTGGCGCATATTGTCATTTAAAATGGAAAGTCTATTTCTGAAAATAGACTCC AGGTATGGGAGAACTTGT TTCT CTGCTG GAGAAGGTAG CC CATC CTAGGGGGACCGAGGTAAATTGCT TAGTGG C TCAACATCTACATAGTTAAAATGGGCCCATTTG(N ) xGCTATAGATAGGCAGCATGTCATTGGACTAGGTCCGCC AGATGCAGAATGTTGACAAAAG CAGAAGTC CC CT CC CCTCAGAC TC CT CTGGGGAAAGCTGGCAGCCTTGAC CAC CAT CACCGCTCTGCACAC CATAGCAGGGATATGAGAAGGG GCTGAG GTGAG CATTGGACTGGAGGCTCCTCCAGC TGTCTCTTGTTGCCTGTCCTCTAGTGTGTAGAGTTGCAGTGTGGCTTGAAATGGTCATCACACCCACCATCACCA TGGCAG C CCTGGGAG GAGGGACTGGCTG CTACTCT CTCAAGAAGGGATAACTGTTG CACAAAACACTG CAGACTT TGAGCTGGCCAGCAGTTGACCAGCTGACCTAACCTCCCGTTTTCCAGGTCTTAACCTACCATCCATTCCCTGCAA GTACAGAGGAAACTAGAAAG CTAC CCTT CT CC CAC CATAGTTCTGTAACTTTGGGATGGAGTGAGAAG CATGGCA T(N)xATATGAGTGTGGGAAAGGGGAAGCCATTGCCTCTTGTTTCCCATTCGTGACCATCCCTACCCACTTCAGT GGGC CCAG TGTTGC CAGACCTGAAATATAC CT CATGTGGG TGTT TGT CTCTGGTACTCTGGGTT CATTTCTG GTC AC CCTATGATATTT CCTTGC CCTAAG GTACTGGCA CAG CTATCGGGTAG GAGGAG G CCAAGGTAGAAGGCACAGA GC CñTT CTTG CCTCAC CTTCTG CCACACGGTGAC CC CACATGTCTGGGGAGAGGCT CCCG GCC CAGGGAGGAGGA GGTGGAAG GTGCACTGTCATAAC CAT CACAGGGGGCAATG GCGAGAAGGCTCATG G GCCACGTGCCTCGGCCATT AGTT GT CATGTCTGTTGTAC CC CAACTTAACTGACTTC TAACCGGCTGTGGTTATGAGTT CCGCTGGT CTCACCT TT CAGTGGGTTCAAGGGGGCAGATTTTG CTGACACCATAGGCAAGGAAGCAT GATGGTAAACAC CACT GGAATGA GAGG CCTTGTTAAT CCAG CC CAAG CT CCTG CCAAGTTAGAAGCTGAGGAGTCTAGT CTGG GCCACTTTGAGTACT GATG CATGAT CC CAGG GC GATGTG CT GTTCTG GG CACAGGGTCACACT GGGTGACATCTAACAG GCTTTGCTTGG GCTGTAGCTAGAGGGAAATGGGAGCTGGGGAGTGTGACAGGAGGGAAGTGTGGACTCTCCACCTACCAAGGGGCC TT CC CACATACTGT CCAT CC CATT CAGCGC CCTGAGGTGGGCGT CATT TCTC CCACTTGACAGA TAAGGGAATCA GACCATGAGG CAGAGTAACTGG CC CCAGGC CA CACACAACTA( N) xACACCTTCTGATGAACTGAGCTACCTCGT GCAGGACAGGCCCAAGGAAAGGGGCTCAGGCCAGAGTGGGAGCATTGAAACAGGGTTCAAAATGTCTGTCTGCCC AAAC CAAACGTTAATT TG CATACTAATAGTTTAAGGGCATAGATATAATACAGACC TCCC CCAAGAAGTGAC CTT TGAAGCCAACTTCTTCTTCATGTGACGGAATACATCAGACACCCTGTCTGCCGGGAATCCTTACCCTTCTCCTAG TGAG CCGTGGTC TCTGGC CTAT C CAGAAGCTC CTTAAAAATACTAGTG CATGAGATGTAG CATT GACAGGAG CCC TGGC CTGTGGGT CAACAGTT GCTT TGTCAAAT GCAATCTTGCCC TTGG CACAGCTCGCCATTCAGGAGGATGGAG GAAGTGGACGAGAGAGAGGGAGGTAAAAGAAGGGAGGGAAAGGAAGCAGGATGCCTGGCTGCCTGGAGCTTCCTT GGGCTCTTGTCCATGAGAGGAAGCCATACCTACCTCTCTGGGGTTTGCCAACTCTTGTCCAGGAACCTATTGGGC TAGAGTG GAG GGGAACTG CAGAGT CTCCCCTTCCTTTCTCTTCTGCCC CCAT( N) xGCTGATGGTTCTGTGGAGT GTTGTGCACAAGGCTCCAGCCCAGGTCCTGATGCAGGACTCTCATTCTGGAAGGCTGGGTGCCTGGCTCTCCTTG GGGCACAATTAGGCTTCCTGGCTGCATTCCTGGGGCATTCGCATGAA(N)xCTCTGAAGAACCTTACTATCAGTA TTACTGATATTACCAGAGA(N) xTTGCTGATATTGATTATTCAGTAGGCGAGGGAAGCCCGTCTACCAGCCTACA GGACGT CATTGCAT TCAC CTTC TCAGTAAATGGCAACAGACTCTTTTC CTAATGTTTACACGTAATGGATTGAAC TGGCTTAG CATCATGCTT TACC CATATT TCTGTTGACAGC CTGT TACT GTGTGGAGAGAATTGCTGGATAAGAGC CACTGAGATCAAAGGGAT CTTTACAATG GG CCTT TCAGTATATT TGTT CTGTAAT CATG GACTGAAAACTTACTG AGCACTTGCCAT CTGAATGAGG CAAAAAGGAA( N ) xTATGTGTATGCATGATTTGTTTTTGCCTTAAAATTAGTA CAT CAACCGTGTAATGTG CAGGACATTTAAAATATGTCAGAGCTATGACAG GTTCC CAGCTACT GGGATCCAG GA CA CCTT CCAATT TCAT CTTTGG CTAAGACTTT CTAT CT CTTTTT TCTT CTTCTTTC T T T C T (N ) xGCCCAATACA CTTCTGAC CT CAGGGCATAGATGCATTGAA GGGCATCTGC TGCTGTGG CCTTTTG GATGGGAAC CCTGTAGATCT CAGCTAG GTTGGTTTATCAG CTATAATGTATT CCTTGATTTGGTAAAACAG C CAGCAGCAGAGCAACATGCTTTT ACCTCCTCTTTTTTTTTTT
> H s l9 _ 4176228 -421 13 14
GG CGACAG GC CGAGTCTGGATT CGGGATTAGAGAAG GCGATGTC CACTTTACTTTT CCTGACTTTAAT CGTTATA CTGGGC CGGG CGTGGTGG CT C A C (N) xGGATGTACCCCCCACAGGAGGGAGGCACGGCCCCCCCAAATCCCTCCA GGAGGAAGACATGCCCCCCAAATCCCTCCAGGAGGGAGACATACTCCTTATATCCCCCCAAGAGGGAGACACGCC CCCAGATCCCCCCAGGAGGGCCACACCCCAGTCCCCCCATACCCTCTGGGTCTCTGGGACATCGGATTCGACCCC CAA C CC C C TCTGGCAGAG C C CC CACC CC TG CACC CAGGTGGCAC TCACGTCTTACACTTGGCACATTCTTCCACA AACATGTT CC CGTG GAGCT CTG C CAGTTTGTC CCTGTGG GAAGAGCAGGAAGAGGC TTGATGGTG GGAAAG GATC CCTG GATG CC CAGG CT TTGGAG CTGGTG CTAAGC C C CT CT CCTG CAGGAAGG CTTCCCGGACTCCTCT CCCACAA CATTGATCAACTCCTTCTTTCTGGCTC(N)xGAATCATTAACCCCCAAGGCTGAACCCACCTTAGATAACGGGAA GC CCTTTC CACT CACAAAATGC CCGT CC CC CT CCGAGAGCATATGGGAACCT CTTGATGT CAC CAGGT CAGAAGC ACAGGAATTCAAATTCCCAGTTTATAATGGAAGAAACCGAGACATTGAATGGCCCTGAAGCATGAGATCCAATCC CAGC{ N ) xCTGCAACTTCTAGACCCTGACAGCTCTCCTCCCCTTAGCCCCATCTCGAGGGAAATTCTGCCTCTGG CCTCAGCCGGGACAGCTCTGAGCCATGGGACTCAGAGATCTCCACACACTGGGCCAAGAACTCAGGGCTAGAATC TTATAGAACCAGGAAAAGGGGACTCTCCCCCTAAAAATGCAAACACGGTTTGTGGGGCCCCAAGCTCTTTTGTGG GGGCGGGGCCAGGGTGTTACCTGGGGAAGCCTGAGCGCACATGGAGCCCGTCCACGTTCTGGCTGACCAGGAAGC GGAG GAG G CC CACG CG CT CCAG CTGCAC CA GCGC CATGTG GGTCTGCGTGGG CCGCGCG CTCTCAAAG GTG GTGT CGAACTTGGGGG CCAGAC CT CG CT CCTC CATGGT CCAGACTCCGTGGGGACC CCTGAAGGTGGCAGGC CGG GAGA GATGGACGGG( N) xAGAGAGACAGGGAGACAGAGAGAGGGAATCAGAAACAGAAATAGAGGGTGAGTTGCAAGAA CAGGGAGAGGGGGATAGAACAAGACACAGACAGAGAGATAAAGAGTTAAAGACAAAGCAAGTCAGAGACAGAATT
G(N)xTGAATTCACACGAAGGCAAAAATG(N)xAAGGAGGAGAAATTCACGCCTTGGC ( N ) xAGTGGAATGCTGG TTGT GACAAGGGACACA(N ) xATGCCCAGCCAAGGGACACATCTTAAAGGGAGTACCCAGGACACATCCAGGAGG AAAG CAGATGGATGTGTT CT CATT CC CAGATGTGTGTGGCTCGCAGG GAAGC CTCTGGCAGCCTGGTAG GCCTTG CCTCGACT TC CC CTGCACAATCACAGAC CTGAAGTCGGGGATGC CAG AGGCAGTGC TGATGCCGGCACCCGTGTG GAACACCACACTGGAAGACTGCCAGACCAGCCTCGCCAGTTCCCACACCTTCCGCTCCAGCTCCTCCGGGGGGTC GAAGATCTGTGGGGGGAGAGAGCAGACGGAGGGGTCAAAACAGTTCCCCCAAGTGGCAAGGCCACGCACGAGCCA CAGACTGAGCTGTC CCTGT CTTGC CTGAGG GAGGAAAATGGATTGGAAAAGG CAGCTTGT C C { N) xCCATGCAAC CCCTAGTCAGTCTCCCTTACTA( N) xGTGACTAACAGCAGCAGTGAAGAGTGGGGCTTCTGGACGGGCGCACACC ATC(N)xCCGATCCCTCCACTTCCCCTCTG( N) xGTTCTCCAGGTGTGTTTATTCCCACCTCCCTTTGCGCCTGT CGTC CC CT CTTC CTG GAACTA CATCCCC TC TCTC CC CGAT CAGC CCCGACAGGGTGAC CACAGAGTTGGG CGTGT TGGCGTCCCCCGGCGCGCAGGGGCGTCACTTCCCGCCCCTTCCTCACTGGGAAGTCCCTCCCATTGTCTAGCCTC AGTGCCCCCTGATATTCCCACAATGCCCCCCTGCCATCCGGCCGCTCCCAGCCCGCGGCCCTGGGGCGCGATGCT CGGGACCCTCAGACGCGCTCACCTCCGGGAGGCCGCACTTGCCCTTGTCCGCGTACGGCGACAGCCCCGCCGCGT AATTCACCGACATCCTCGACTGCCCCACGGGAACAATAAAGTTTCCCTTGTTGAGGCCGCTTCCGCCGGAAGCGG GGCGGGGCGGGGAAGAAGGAGGAACCGCGTTCCAGTCCTGCGCACCCGGCCTCCCACGGCAAGGCGCATGCGCCT CTTGCTGACGCCGCAGGCGACATGTTATCTGCTGTCAGAAGGAAGCCTGCCTCTTTGCATGCAGGTGTTTGCGGG GCTT GGGAAGGGGCTC CC CGATGACCCGGG CGGGAAAT TGGGGCGGC CGCCTACGTGAGAGT TCTC CAGT CACCT CTAAAATG CGGGACACAG GCTATCCATGTC CC CACAAC CC CTTT TACGGATTAGTAAATT CCGTAAGGACAGAGC CGGACAGCAGG GAC CC CAGCCTAAGGGTTTTGAATG CCACGCAGAGTTACCTGAATAT CT C A T T ( K ) xAAGTTAT CTGAGTATCATCCAACTGGAGGTTCTGCATTCTGCAGGACAGGTCCTGCTCAAGGTCAATGGGGCTGGTGGCCTG GAGG( N) xTCTTCCCGCCAGAGCCAGTTGGCCAACAGGCAGTCCGCTGCCAGCCTGTCTGCTCCCTCGGCTCATC CCCGGCCCCCTGGGGCGGCCGCCTGCTCAGAGCC GAT CAGGTGT CAGAGTGGAG CGGCTGGAGCTG CT CG TGGCC CAGCTG GGAGTGGGTAAC CGGAGCCGGCAGTGGG CACT GTGG CCGGGAGCTCGG GGCACTGGAGCTGCAG GAGGT AGGGGAGGAAGGAGGTGGGAGAGGCCCAGGACGGAGGATCCCCTTATGGCCCCAGGCCCCTGAGCCCTTGTATGT GGGAGAAAGAAG CCGTGCTGAGTCCTAG CCAGGCTG CGGAGCTACCCAGGTACC CTGACACACACATG CACTCGT TCACCCACGCACTTCCCACTTGCCGGCCTGTGCCCTGCCTAGCTTCCTCCAAGGCAGGTGCTCAACATCCCACCC GGCCATCAGGCCCTTCTTATTTACCATCACCTCTTCCAGCAAGCAGCTTCCTAAATTTAGCAGGCGTTTCTCCCT CCCCCAAGCCCACGGATTGTCTTCTCCCAAGCAGCCTGTGAACTCCTCTGTTGGGTGTCTTATTACCCCCTGCTT TGGGTGTGAT TGAATC CT CCAGTCAGAGGTAGAAGAAAAAA(N ) xATCTGATTAAAGGGTGATGAAGAACCATGA GAACATGAATTGGGTGG GTGGGTTTGTGGGTGGTGG C CAGTG GGTGGTTATACAGTGCTTAG GTG GGAGGATGGA TGTCCAGGTGAGTG CATAGGTGGTAAATGC GTGGAAGG GTGG CT TGGAGAATTCACGAGTAGATAGGT GGACAGG TAGAGG C CATGC CAA C CC CTACTCCTTG GTAAGAGG CAAATACAGATTCTCCAT CACAGAAATC CCTTTCACACA TAGGGCCTCACAGTCACACCCCTCAAAAAACACACCCCCTCAAGATAGGCCTGCCTGACCTCCGTGCCTGCCCAT GCGGAACGTAGAACAGGCCCAGGTGGACAGAAGTGCACGTGTGCACACGCCCCACACTCAACGTCACACACATGC ATGCAGCCATTGCC CCAGA CCTCCACCCACTATGGGTGAATTGC TGTCACATGGAG GTTCCCTCTGA CTATGTAG CAAACATG CCTCTGTG CTGAGAGCCGTC( N) xAGGTCTGTGTCTTGCTCTAGGGGCCAGGACTGGTCAGGAAGGA GGAAGCGGTC CTTTTGAGGAGGGCGAG G CAACGGAC CCTGCCGC CCACCAGGAC TGGTGTCCCTCACCTTACCCC CACCCTTGCCCTCCTCAGGTGGCCTGTGGAGAGGAGAAACACAGGGCACCAACTATGAAGACTCTCAGGGCGCGA TTTAAGAAGACAGAGGTGAGTGTGAGGC CCTAGATG CC CGATACACCCCTGCAACTTCAG CCTTCTGTCT CCTGG GGCTTGGCTGAAGATCCACCCTTTCATTCCAGTACCCACCTCTTCTCCTTTCCTCTCTGCTGCCCCCTAGCGTCA TGACCTCCCCACTGCTCCCCGTACAAGCTTTCCAGCCCAGGCCTTTACCCAGGCCCTAGTCCGCTCTCCATGGAA TTCCTTCTCGCCGCATTG CCACCAATCAGACC CTCCTAATTCCT CACAATTTGG CC CCAGTACCAC CT CC CCGTC CAAT CTCCGCTCTCCCCG GCCACAGTAACC CCTTGCTTTT CCAAGGACCCACTTTC CCGAGGAT CCTGAT CTTTT GCTTTG G TG TG(N )x ATCCAGCTGTCTTCTCTTGCATCCGTGTCTTTGGCTATGAACTCCTAGAAAGACATGCTT CTTCCTCCATTGTCCCCATTCTTCTCAGCCCCCTCAAGTCCCTGTGGCGGCCAATTCAGGGCACAGATGGGGAGG ATGAAAGGGAGGGCTCACCTCCAGGGACAGGCAATCAGCCTTCCTGAGCACCTCTTTGTGCCTGGCCTGGGTGGG CAAGCTTGACTTGGAGCCTTTAGTTTTCGTATCATGGGATACAACATCCC(N ) xTCTTTGAAAGCCATTTTTGAA AAGC CAAAAAGAGAG G CC TGTGGAACAAGC CTGGATTT C A T T ( N ) xCATTTTATTTTGTTTTTAAACTATTTATT AATTATCATGTACCCATTCATTTCATT(N) xTTATTTATTTTCATTTTTGTTG( N)xATGAACATGATTGGCCTG GAGGTGGGAGGGGAGTGCATTGATTGAAAAGAGAATGGGTGGAGACAGACAGCGGGAGAGAAAGAGGTGAAAAAC G GTC CATTAACTGC CTA CAAAATGTGGCATTGG GTGAG CT TCATTCACTGTGGAGTGATC CTAAAATTTTT CTTC CCAC GT GGACTCAATT CCTCGATGATAGTTGACC CT TGAACAATTACAGCCCAC CCTC CTGCACAGTCAAAAATC CATGTATA( N) xCCTGAATGCTTTTTTTCCCCAAGCATTCCTCAGAAGTAGTAGCCACTGGTAAACCAAGCCATA TGATTG CTTAAT TT C CAAAGCATAGTTATATTTAATAT TTAAGTAAATCTTTCAAGAAGTGTGGGTGG CTAAAAA AGAGATACCTCATTCG GGATGTGGGTGTGTTT TGATATGTTT GCAATGGGAGGGGT CG G GACAAAGTCAAAATTG GATGGGGACAGGATAGGTGTTAAAACCTCAGACGGGTGTGAGGGCCTCTGAGAGTGTCTGGAGCTGGAGGGGCCT GAGGTTAGCC CATAGC CTGGGCTGGAAC TGTTGCAG CAGATGG GAGGCCTGGCTGTTG CTGGGAAG CTGCAGGGC CTCCATCCTCCCCTCCCAGAGCACCTTGCCATCCATCCCTCCACCTGCCCTTCTCCAGTTTCGCAGCAGGCAGCC GGAGTCAAGTTCAGCTTTTGTTCAGCAGCCCCGAGGGAAGAAAGCAAACACCCATGACTGCAGGGGCCAGGCCCC CCCCTCCTGTCACTGGGACCTTGTGTTCCGTGTCTTCTAGCCACTGCTACCCCTCTCCTCCTCTCCTTAGGGCAC TCTCCGGCCTTTCCTGGTCCCTAACACCGGCTTTTTCTTATTTATGTATTTACTTTTATTTTATTTTCTTTCTT{ N)xAACACCGTTTCTTTCTTTGGGGTCCCAACAGTTGGCATTGCTTACTCTGATCCCCCTTGGGCTG(N)xTTAC CTGGAG CTGTTCACTTGG GGCTTCACTGAC CT CCTG CCTTTA CTTTAGGTTTCAGGAAAAT CTC CCAAGT GGGAA GATTGGGGTTTCTAGAGAGTCACAGACT CCATACAT CAATGCTTC CTGGAAAGT CCTCTTTCTC TTTCTGTCTTT TTT(N)xAAACTCCATGAGGTCAGGGGTCAGGCACTGGCAGAGTGGGGCTGCTCAGAG( N ) xGGTACACACCTCA GTGGGCTCCGAGGGGCAT TTGCTCTGATTGATTGAT TGA(N)xGTGAGCCACCGCGCCCAGGTT( N ) xGTGAGCC ACCATG C C TGGC CATT TG CTCTGATCTTGTAT GG CATC CAGGAG GACAAGTGTAGGAGACAGTT GGAAACTGAAG CCCTGAGCTGGGAGATCCGTTTCGGTCTCTTCTCTGGAGAAGCTCAGAGGACCTCCCCACCCCCAAGGGTGGAAG GAGAGAAGAGTTCCAAGGACACCCTCCTTCCCAAATGCCTGTCACTTTCCATTTTCCTGCCTTGGTTCCCACTCC TCCCTCCTCTTGTCCAAACATGTCTAAGAG < N) xGTCTAAGAATAATCACGGACCACACCGACGTGTAAATGCTC AATG CT CACC CAGC CCTAAGC CAGTGGT CñATGTGT CAATGGA(N )xTTCCTTGGTGCCT{ N ) xATTCTGGCTCT G GAAGACCTCGGTT(N)xTCCAGGGCTCCTTCTAAAATGCCAATTACCTGCAAGG(N)xTGGCTAGAGAATGTCA GAGCTGCGCCGGGACCAGCTAGCTCTTTCTGTGAGGAACGAGTCACAGT(N)xTGTGTAGGATTGCAGCTCCCCA GTGGC( N)xAGTGTGAGTGGCGAGAGCCCTGCCGGCCGCGTGGAGGTGCGAGGCTGCGGACGTCGCGGGCCCGGA GG CACCTG CGCG CC CTTGGC CGACTCGGAG GAGGTGGAGATGGACG CC CGCGGGTCCC CTGGAGATGCAG CCGGC GGCCTGCGCTGGTGAGGGAGCCGGGCCCCCGGCGCCGCGTCCTCCTCATCCTCCAGGCGACAAGGTCAGGAGGGG CCGGGGCGGCGCCCCTTCCCTCAGCCCCCAGCCCCCAGCCCCCTACCTGGGTCTTCCCATTCCATCCCGT( N) xC TCCGGCAGCGCCCAGCCCCGCCCTCCGGCCGCTCCCCGCGGTCCCTCCAGACCCTCTGGCCGCCGCCTCCTCTTG GAACCCCGTGC(N)xCGCCATGAAGCAGCTGTGTCTGTGCGCAGCCGCCTCCTTCGCGGTAGGGCCCGGGGAGGG GGCGCAGGAGCGGGCGGGGCGCGGGTACCTCCTCTCCCCTCCCTTCCCCGTCCCCGGCTGACCTGGACCñCCCCC CCATTCCAGACCCGGGAAAGATGGTCGGCGGCGGGGGGTGGGGGGGAACAGAGGTTGGGGCAGCTTTTGGGGGAA TGGAAGAGACATTTGGGGAGACATGTGGACACGTTTTGGAAGACTCTTTGAAACAAATGGGGACATTTAGGAACA TTTAGAGGGCATTGGGGGGTGCAGTTTTGGGAAAACATTTTGGAGACAGATGGGGTCATTGAGGGGACAACTTGG AAGCTATTTTGGGAGAGCCGATTTTGCGAAGAGGTAATGATTTGGAGAACAGTTTGGGAGAACATTTGAGGGATG TTTTTGGAGGATATTTTTGGGACATGACGGTATCATTGGGAAGGATGGTTTTACAGAACACTTCAGGAATTTGGG GGAGACATTTCGTAATGCATTTCAGGCTATAGTGTTGAGGGGAGAAGCCTCGGGATGTTTAGGGGACAGTGGTGG CATTTCAGGGCAGTTGTGGATAAACAGGGCAGAGGATGGTCCTGAGAGGCATTTAAAGGGGACTTTTAGGGGTTC AG ATG GGACTTTGG AGGGTAGTTTGG AAAAGACT CATGGATC CATTTG CTGG GGGCGATGTTGC AGGGGTGGCTT TCACTAAG GGATGAATTG GGGGACAAC (N) xCAG GGGACAACGTTTTGGAAATAGATG CCAGGGGGCTGCTCAGG GGATGATGCTGGTGGTGGGCTTTTGTTGGACAACTGGGGTGATGGGCCTGGGGGCAGATGCTGGGGGTCGCAGGC TTGGGGGACACTGTCTGAGGACCCCCTCGCCCAGGACCACCTCCCCCTGCAGCTGCGGCTCAGCCCCACTGACCT TGGCTCCTGCCCGCCCTGCGGCCCCTGCCCCATCCCGAAGCCGGCAGCCAGAGGCAGGCGCCAGGCAAGTGCCCA GGGGCAGGTGGTGAGAGCCCGGAGCCCCTGTCGGGGGCGTGGGGAGGGGACAGCAGCCAACACTGCCCCACGCAC TTCTGGGCGTGCCCTGCAGAGTCAAGACTGGGGCAAGAGTGACGAGAGGCTGCTACAAGCCGTGGAAAACAACGA TGCACCTCGGGTGGCCGCCCTCATCGCCCGCAAGGGGCTGGTGCCCACGAAGCTAGACCCCGAGGGCAAGTCCGC GTGAGTGCCCGCGACCCGGGAGTGAGATGGCTGAGGGGTGGCAACCTTGCGGCTGAACCCTTGTCTCTCACCTCC AGGTTCCACCTGGCGGCCATGCGGGGTGCGGCCAGCTGTCTGGAGGTGATGATAGCTCATGGCAGCAATGTCATG AGCGCGGACGGGGCAGGTACTGCCAGCTGGGCCCCGGGGAGGGAGGAGGAACTAAGCCCAGGTGCCCAGCCTGAG GGTCCAGCCAGACCCTGCTCCCAGGTCTCAATGTTCCCCGCTGAAAAAAGGGGAGGCTCCCTCTGGTGCTCCTCC CCTGCCCCCAGGGAGACACACAGGGGCATGTTAGGATGGGGTGGCCAAAGATAGAGACTGTGGCTGGCACCGAGC AAAGCCAGTGGACAGCTTGTGGATGGGGAGTGACTAGGATTCAGGTATTGGTTCAGTTGGATGGGGAAGTAGCTG GCATGGGGAAGTAGCTGGGGCTTGGGGGATAGCTGCAAAGTTTCTGGGG(N)xTGGGGAAGTTTCTAGAAAGTGA CTGAGG CAGAGGAAGAACTG GGGC TTGGGAGATAACTGG GAC CATTGGA CGAACTAAG CTTTGAGGTGCATCTGG GATATAGGAGTATCCGAGATAAAGGAAGTGACTGAAGGGAACTGCTTGGAGTTTGGAGGAGGAACTGGATTCATG GGAGTAGCTGAGCTTGCAAAGTATCTGGGGCAAGAGAATAACAAGTAGAGTTTGGGGATATCTGAGGTATATGGG ATAGCTGGGATCTAGGAGTAGCTGGAGCTGTGGGAGTAGCTTGAGCTCTTGGAGTAGCTGGGTTACGAGGAGAAC TTGTATGTATGTATGACTTAACTGGAGACCTGGGGGGAATAACTGAGATAAAAATGTGTGGCCAAAAGGAGTAGT TGGGGCTCAGGAGGCAACTAGGCATGGGGACACAGTCAAATCTGAAGCTTTAGGGTCACAGGCATGGTGGGCAGG GAATATGGAATAGCTGGGGATTCATGGAGCATCAGACAGGGACAGAGTTGGCATTTGGAAGAGGAAGGGTATATG GATTGGCAAGAACATATACAGCAACTGTGGGTTGTGAAGGGCAGAGCTGAAACTTTGGTGTAGTGGACCCTGTAG AGTATCTGG GGACATGG GGAGTAG CTGAG G CAAT AGGGCTTG AGGAAT AGGTGG GTGTGGACAGT CGCTG AG ATG GGATTGTCTGGAATA CAGACTAGCCATGGCTTAGAGAGTC TCTGGGTACAGG GAATAG CATGGT TTTGAGAAGTA CCTGAT( N) xTACTAGGGTTTACGGACTTTGCTAGACTCAGAAACTCATCACAAGTAGTTGACATGGCTTGGGGA GCAGCTGGGAGCTACTTGAATAGCTGGAGCTCCAGTAAATCTTCTTTCCTTCCCCAGGTTACAATGCCCTCCACC TGGCCGCCAAATACGGGCACCCACAGTGCTTGAAGCAACTACTGCAGGTCATTTACTGTCTTATCTCAGCTACTC CCTTGGCCCCTACTACTCCTGAGTTCCAGCAATTCCTTGAACCCCCAGATGCTCTATGAATCCTAGATATTCCTT GGCCCCTTTTACTCCAGAACCCTAGCTACTCCCTGGATAACCAGGTACTCTCAGACCATCCACATGCTTCTCCTA TT TAAGATGTACTC CCA(N)xTATATATATATACTCCCAATGCCAGGTATTTCCTGAGCTCTGTTTCTCCCCAAG CC CTGGGT CCTT CCTTAG CC CCAGACACTC CCAGAGTTCC CAGCCACT CCTCAAATTCTCAGCT GCTTCT CAGCT CTCAGCTATTTTCTTTTCTCACCAATTTTCATGAAAATGGAGTCCCTTGGGGGCCACGGAAGGGTGAGGCTGAAG TATAGCAGAGCCCAGGGTAGGGAAGGCAGTCCTAGAGAAATCAGGCAGCAAGAAGGATGGCTCCGCCTCAAAGGA CTCGCCCCATCC CCAGGCTT CCTG CGTGGTGGACGT CGTGGACAGCAG CGGGTGGACTGC CCTACACCATGCAGG TGGGTGCAGCCCAGCCCTGCCCTGACCCCGAAGCCCAGAGGGTGCTAGACTTGGGGGTTATTCTGCCCAGGGCCC TAGGGACCTGAC CTGCTGTGAAGCTTCCTGTCTC CT C C ( N} xAGCAGTCGAATGTACTCCTGGGCTTTGTGAGGA TGAGGAGAGGTCCCTAGGGTTTAATTTCCTTTGTGGAGCCCCTAGGAGATACCAGGAAAACCATAGCTCCAGGCA GCTGGTAC CCTCATAATCAC CTAATCCAAACTCCTT CCCAGAGGGAGGGAAAGG GACC CAC CGACTTGAACCAGA ACTCAGCCCCCGACCTGTCCGCTGTCCCCGTGAAATTCAGGCATGGCTTTCGCTGACGACTCAAGAAACATCTGT TCTTTCGGAAACTCAGAATAAAATAGGGCTTTGGGGGTTTTAGTCACTGGGTGAGAATTCTCTGCAGTTGGAGTA TG CTGC CACCTAGTG GCGGCAATG CTTCTGGTCACAATATAGGAGT CCGTTG GTTTTTGT CACT CAACAGTTAAC TG ( N) xCTCATACTGCCTCAGACATACAGCAGGTATTCAAACAGAGTTTGTTGACTGAAAAATGGATTTTTGTGC TTAATCAGA CTGGATGTCAACCTG CTG GAATTCATACCCAGG GATT CCAGAGAC CATG CC CTTGTGTCAG< N) xA TTAGCCAGGCACGGTTGCACATGCCTTGTAGTGTGGGT( N) xAGGAGCAGGGTCACACCCAGGCAGAAAGTGCCC AGAAAATGTCGAAGTCTCACCCTACCCCTACTCTTCCCCCT(N) xGGACAACCTTTTCTCTTTTGCAGCGGCTGG TGGCTGTCTCTCCTGCTCAGAGGTGCTCTGCTCCTTTAAGGCACATCTAAACCCCCAAGATCGGGTAAGCTTCTG GGATCTCTTCAGGGAAGATGTTATGCTTGAGGTATGCTGCCACCAAGTGGCAGTATCTCAGGCACCCTTTGAAAG
TCTGAGAAAGTTTGAGGTGAAACTCAAGGGCAGTTATATAGGCTTCTGCACCTCCCGGGGGTCAGTTTCTCATGC
CATCGCATCCCTCCTCCTCCCTACCAGTCAGGCGCAACACCCCTCATTATAGCAGCTCAGATGTGTCACACAGAC
CTGTGCCGTCTCCTACTGCAGCAAGGGGCTGCCGCGAACGATGAGGACCTGCAAGGCAGGTGAGCATCTCCCC(N
) xAAGTAAGACGATCTAG CCAGCCTCCTCTTCCCAGCCTGGCCTGGGTGCTGAGGTGGGCATGGGGGCTTGGGGG
ATGTTCTCATCTCCTCAGTAGCCCCCTCCCCTGGTAGGACGGCCCTGATGCTGGCCTGTGAGGGGGCCAGCCCCG
AAACAGTGGAGGTCCTGCTGCAGGGCGGAGCCCAGCCGGGCATCACCGATGCGCTGGGGCAGGACGCGGCTCACT
ATGGCGCCCTGGCGGGGGACAAACTCATCCTGCACCTTCTGCAAGAGGCGGCCCAGCGCCCCTCCCCACCCAGCG
GTATGCAAGCCCGACGTCCCCAATGCATTTGCTTCTTGGCAGCTTCTTGTCACTCCCCTTCTCTTTATCGTGAAT
AGTTTCAAGGTACC CCCGATTGGC TG CATT CTAGGAGGT C CTAGAG CT TACC CAATT C TACTCAGAA CAGTTT CA
AGGAGC CC CAG G GCATTTAGAGTAGT CTGGGAGGGGGTCTGCTTCTGCTTT C CTGGGT CATCTGTACAGTAAGAA
CTTTGCCTGTCCTAAGAAGGGCACCCCCT(N)xTATCCTCTCTTTTTAACAGTCCCAAGTCCCTCTCTCAGCACT
CTACCCAAGCGCCAGATTTCTACCTTAAACGCCCTGATTCTTGGGGAAATCGGGAGCCCCCTACAAGTCCCTGGG
GCCCACCAATGCCCTTCCTGGGCCTGGGAACAGAATGCCTGACCCTGCTTCCCTCTCTCCCCAGCCCTCACAGAG
GATGATTCAGGCGAGGCGTCATCTCAGGTATGGACCCCTAAGCAGTGAAGGAGCCCCTCCTCCCTGCATCCACAC
TAACTT CT CCCCTG CCCCAAGCTC TGTGAGAAGAGAAGGAAGTTAGAC CTGGGAAAAC CTGTTTGGAAAGAATG C
TACCTTCAGGGTATGCTGCCACCAAGTGGCGGCAGTGTGCTCAGCCTAACTCCCAGAACTGGGTGAGAATACTGG
ATGGGG C CAAATTATGGG GCTGGGGTG GGAG
TABLA C
>Hs1_16927708-16936899
AGGCAGGGTTAGTCTGATATGATT TATT CT AACAGACA.GAAG CAGAAAT CTGTTATACTC TTTTAATTACTGTGT CTTTATAATñTTATGGTAGACAGAA(N)xAATAAAATGGAGAAGGCTTTGGAGTSGGGACAAGAAGGAAACGGTG GGAGAGGGATGCCTGTATGCTGATATGGTTGATGCCTGTATGGTTGAATTGGGTCTACCGTTCCTCATCTAATTA GCTATGGTCTATTAAGGTGCATAG CTACACACAAATAT TGGTACTACGTT CAAT TCAGAGGAATAAGATATTGCA TT CTTGACAGTAGACAAGAACACC CT GAATTTGGGGTCACTGTATCATAAGT CATGTTAT CAGGTC CCTCTAGGA AGGCTTAGAG GAAGAT TTCCAGGATACACTTGTGA CAACATTGAAGGCTT CTTTTTTCCC CAAAGG GACCCGAT C TCCCCTCAGTCGAGAAGCTCCAAGTCTCTGAACTGGATGCCAGGTTATAAATTCCCCCTATACTGACTCCATCAG
G(N)xAGTAAAAGATAGACTCATGGGAGTCTAGGCATTTATTCTCTTATTTTATATAAATCAGTTAATGTGCAGG AACAAAACAGACTTTGAAGAAAGA CACT CACAGTTG CCACAGGAAAACAC CTTCAACATCCTCATGAGTCATCAT GGGTGTTCTGTTGGGAGGACTTGATAGGAGGCTTTCCTCCTCACGGGCTAGTGCAGATCCAGGGGAAATGTCATC AAGTCCTCC(N)xGGGCCCTCAGTTTAGCATTCT(N)xGTGTAACGTAAGTTGATTTCTTAGTAGATGTCCCATC CATTACATTCCCAGACACCTCACAATGATTCGAATGATTAGTAACCACCACATATCCCTGCCTCTCAGGGAAATC CCTCCCGCCTTGTCTCTAGATGGCCAAGTCCCACGGCCTGTCCTCTACTCTTCCAGAACCCTGTTGTTCTCACTG A CAG CAGGGAGGGCAAATCCATGCAG CAGCTCCCGC CATGAC CTCCAG CCTG CAGAGGATGGGCGC CACAGGACT TTTAAACGCATGCCG(N)xTGGGAAGAAGGAGGGGATGTTATGGGAAAACAAAAGGAGAATACTAGCTAAGAACG CTAGGTGACATTAATATTCCGAAGTCTGTGCTCATATTCAGCAAAGAAAGTTCAGCATAAAGCACTAAATAAGGA GT CAAGATATTGTACTTCCAACTG TTGTTCCAACAG CT GTATTATGAAGGGC CACTTTAT TT CATGCCTTTCTAA TTTGACCTAAAGTGCCAGGTGGCACTGGGGCTGGCACAGCCTTGCTCAATTATGTGTTGCAGAGTACACAGAGAC TG C CAGGCTGAG GGAAGATGCAAGAGAATAGAAGAGAT GCTCTCAGGGAACAAGAGACCACATGGC CC CAGAGT C AGGGGCAGCATCAGCCACTGTCAGCTGCTCATTTTCCCAGACAGAGCCCACAAGCCTCAGCCATGCTTTGCTTCT GCAAGACGCTTCTTCACCTTTTCAATAAACCTGCCTGAATTTAAGCTGACAGGGTTTATTTCTCCTTCATCATAA ATGAAATTCTTCAC CACAACAATCTC CAATGAATTTTGGG CACAGCAGGCAG GC CCATTT CTGCTTCTGTTCCAC TATCTC ( N ) xA T TCTGTT ATTCTG GTTC CTTTTTGG CT AC TTTGTTTTTG GT AG CGTGTATC CT AAGG CGTCCAG TTGAACAACTTT TGTCTACTGTGT CCAG GCATTCCTGGTG GTATTTCAGATAAGACTCTC TTGGGTTG CTGAACT CACAACCACT GAAC CAATTCTATGAC CATCTGTTTCATGGC CACATGTTTGCTCATTTTATATGTACATAAAGGG AGG GGACAGA CAG CAAACTTGCG TGTTACAAATTGTAT CATCTTAAAAAGGAAACAAGGCAACACT TTGCAATAA AACCTTAAGATGCATGAAATTTGAGCCTAATGCAATAAAGGATGCCCATAAAATTCTTATCTAAAGAATGTTTCG AAAATTGTTGTACAAGGACATCAT CATTTAAAGTGATATGAA GAAACCTT CT CAGCTAAG CATATGGG CTAGATT AGAGAGAAAAATAAAGGACCCATC TCTG CCCTGGAAAAAC TG CTGGTAG CAT CTTTCAAAAA GC TCTCTGTGTTT GAGT ACGCAC CT TG AT CC ATAGGCTC AC ATTTGATC CC AA CTGGCAGCTG CTTCTTGGCATTAA CATTGG ATTC C CAACTAGTAAAT CTTACCAAGATC TGACTTTCTGCAGATATAATATTATTTTGTTTGACCAT CCTTAT CTTCAAG GG CTACCAAGAAGGAACCAAGAAT TTAT TTACCTCC CCAAGGGAAAAGGTTTTACCAATGAGAC CCTTTCTCAC C ATGACCCCAGGACC CCATATGC CCTGTT CACTTGAGTG CC CTGTGTGG CCTGATAGAAGC TCATGCTG GTCACAG GATTCCTTATATGACTAGCCTCCTTCCTGAATCCCAATTTCATGGTGGTGGTCATGACAGGTGTCCTGTATCCCA TG CT CATGTC CC TGAAGTCACCAG CCTATCTCCAGTTAGAAAAAATTACATGTATATAGAGAGG CCTCTTTGGAA G GAG CAAAAG CT TT CT C (N) x T CGTACACTAATGGTTG GAAG GTACAACAGCATATGCAC TTTGG GAAAAAATAT CTGG CATATT CT TACAGAAACAAACAA CTACCTATT CTAT GACTCAGTAATT CCTAAGCATTTATC CAAGAGAAA CTAAAAC(N)xGTGAGAAAAAAAGATAAATAATAATGGTTCCAAGAAATGCACAGCAGACAGCCCAGAGGCAAAG ACC CACAGGACGGCG GGCCGG(N ) xTTAGGATGCAGCAGCCCCATATCAAGGTTTTGGTGGCATCCTGTAATTGT GTGGTTAGTACTTGGCATTGAAGTGCACCAACCTGGAGTCAGAGCAGTTGGAGATTTCAAGGCCTGTGCCATTTA CCTCTAACCCTGGGGTGCCCCTGGAATACAGATAGCAGATCGGTTAAGGAGAAGCAGCCTCAGCAATCTAGACAG TG CAGGTTTCTGGTGA GGACAGGTAAAAACCATCTG GGTGGG CAGAACTTGGTGAAGACCAGAAAC CACTGAGAC TCAGCAGCTGCCGCAGTGGCACCCACAAATCAAAGGAGGGGGCTGGGAAGAGCTAAGGGCTACTGGATGAGCTCT CTGC CTGCAAGACAGAAGCAGAT C CAGAGATTTTGGAAAATAATGTAG GT TT CAGTACAGTGTGAT CT CTTCAAA AAAGT(N)xGGAAGGAAGGAAGGAAGGAAATGAACAAATTTACATGAAGATGAGAACAGTGGGGAAACTTACACC A C CAATATTTTC CATTAACAGGAACA CG CTAAGTAGTTATTAGAGAAAGACACG CTACTGTAAAA CAATATACTG TTTCCATGGGGTACAACAACCCCTTCCTCCTCCTCTGAAACACATTCTATCTCTGGCTCACTGTTGCCAGAGACA CTGAGTCTTGTCTTTGGATACGTTCTGGTGCCCACAAGAATGAGATGAGACAGTGGATCCCAGAACACCAGGCCA CGAACTTCCCTGTTGCTCCTTGTC CACT CCAGAAG CTACC CAGCTGCAGTTG GGGACCTCAG CC CCTGGGTCTGA TGTCATCCATTTGCCTTTCTCAATGGACTTCTCTCCTTGCACTGGCTCCTACTCCCCCAGGACCTGTGGGTGACC ACATGAGAAGAACACAAACAGGCCATGCCCCTTTCTTTCTCCCCCTCTCAATGCCTGCAGTAGTGGGTTCCATGG GGTAGTGACCTGAGATTTACTCATTGTGGGGCCTCTAGCCCAGAGCAGGGCCTACTACCTCACAGTCACCCCATG AATG CTCAGTGAAAGAAGACGT CCAC CACAAGGTC C TGGG GAACCAAGAAT T CCACTGTGGC CCATAAATTCTAA GTCTACAGGATTCTGGAATGGGAGATGGGAAAGGCCTTCAAAAGTGGCCACTTTTAACCCATTATACTGGCAACT GAGCCATGTTTCCCCATCCTGGACACATCCAGAGGGCACTGCCTAAAACCAGACACATCTCCCCACCCAGGACAG TGTAGGAGCCTTAG CCTGGGGGATGCAGGTGGACAGGGAGG G GGTGAG CCAC CAAAGCTGAAGAGCAGAAAGCAG GTGAAAGGGGACAG CAGGGTGGAAA CAGAGAGAAAT GGGG GCAGAGAATGGG GGGTGAGAGG GGAAGAGTGAGGA GAGGGATGCAGATCTAGCTAGTAAGGAAAAGTCCTGGAGAGAACACTGTCCTCTCCTGAAGTAAAATCACTTCTA CCTGACCACGGCACTGCAGCTCATGGGCAGCACATGCTGTGGATATTTG(N)xAGGCCCTGCAATGTTTAGGGAC CTTGACATCTTC CCTT CACATCTGAGTCATAATACAAAGAGGACTCTCTGAC CC CACTGAGCTGGCAATGCCTCG GGATTTTTAC CTGTTGGATCTGGCAG CT CTTGATGT CAGCC CACAC CATGTGAG GCTGCT CTTGGTGCAC C CAAT GGGGAAGTTTCTACATCAGGGCCTCGGAGAATCCACTGGAAGCCCTGGACAGTGGGAGTCAGCGGCATCCCCAGT GTGGAGGCCAAGAG CACACAGTGCTTAAGCTC CAGG CACCCT CAGGAGGACG GCAAGGGACAñTTGGCTGGT GAG AGC CCGG GTCAC CG G GAAC CTTCG CCTGGGTCTA{ N ) xTGGTGGGAACTAGAGGTGGTTGGGTTTCTGTCATATG TAA TCAACAGTCCT( N ) xGGGAAAAAAATAAAGAGTCCTGñCTAAATACTAGAGTAGCCAGGGAAGTTTTCACAA AGTAAGTAATATTTGAGGCAGATCTTAGTGAACAAGAATTCCATTATTTCTGTTAGGGAATTAAGAGAGTGTGGG TGTCGT TAGTTAATGCTTATTAAAGTAG CT TTGGAATCTCAT CTflCTGGTCTAG CTGGTCTATCTGTACACGTAT ATTGTATATG CTGT CT CT CTGAGCTTTCGCTAGGTTATGCTACGGTAACAAAAG CCC CAAAATCTTAG C A G C T(N ) xTACAAGCTACTTTATTTGTTAGATGGTGAAAACTGTGATACTCGGAGGTTGTTGAATATGGTATTAGTATGTT CATTCATTCATTCATTTAAGAAATATTTATTCAATATCTGTTTCATGCCAGGCAAGGTCAAGTACTGAGAATACA CTGGTGAATCAAAGAGA CAAAATC TCTAATTG CCAG GAGCTTATATTGAAAATCAGATTAAACACATACAAAATC ATCATAATAACAACAA TGAATACTATAT TCATAAATAATAGCTGTAAGAGATTTTAGTñ CñT CT TTTAAATTAGA A A A A T A TA A (N ) xTTATTTGATGTAGTCCTAAAACTATTATGTAGAATACTATTGTTTATATCACAGCACGTGAG CCCCTTAAATGG CTTAACACTTATTTAGGTAT GATC CATAAAGCTTTT CTGGTAATTAAGTATACTTAAGAACAA TTAAGTATAAAAGAGTTACTGCCTTGACAGGAAGATTGTAAAAATTTTAAAAAGACAAATAAATAAAAGAGTAAA AACTGTAG CT CTGTGAGG CTCAAATAACAT CTAATT CAAGTCACAATGAACATCTAG CAATCATTCTGAACACCA TATAATTCACTTAATACGTTTTG
> H s l _ 163925 04 - 16901 69 8
G GGAGAGAGAGACAGAG GAGAAAGTGAG CT CAGCGAATTGGC CGGGTGACACACTGA CGAAG GGGT CAAAG GA CA CTCTGAGTTAGTGCCCTCGGGACACACAGAGAACAGTGATCATGAAAAGAGTGGGCTCAATAATTTTCCATAAAC TTGCTTAAGATT CCATGCAGTTGC CATACAGC CTTT GAGGTATG GT CAAC CTACAGTAAGTTAG TAAATGATAAG GGGAG GAAGAAATG GAAACCTAAACATCTACTGCAAG GAAAACCAACAGCAATGTCAGTAGGAGTAATT CAACCT TCGTTGAAAACATGAAAT TGAACATACT CT TGTTTT CCCTGGAC CTGG CATCTC CAGGTGTCAACACAGAATTAA GCATCCATAATTGCTCAAAGTTAC CTGG GG CATGATGGGTCTTG GT CTTCTT CCACTTCTTGGTACTTTT CAATT T CTGCAATAAGT TCAGACATGGACAGACATAT TAAG CTG GTT CT CCTACACACATAACAATC CACTGT CTAAT CC TCACGCAGGGACTTCAGGCTCCTCñG CATGAGAATAGGACACTGTGAGAGAT CT T CTTCAGGAG GC CTGAAGGCT GATCATGATAGAGATT CCTGG GTTTTTGTC CCAGAAACTGTGG GTAAAATTC CCTATTCTGGTAGATCGTTATCC CAAGAT CATTTGTC CCAAGTTTGTGCAAATGGTTATGCCATATT TTTC CAAT CGATTT AAAG CAAATG CC CC CAA ATGGTTG C TGGGAGAAAAACTGCAATATTCAG CC CTGTCTCATCAAATACTCAGATTCTT CATGGTAG CGAG GAT TTTAGATG CTGAAATTAGAGTGAAGGATGAAATCTACAAGAT CTA CAAAATT GAGACAAAAT CAGAGTTGTGTGA ATTTGTCACATCTGCC CAGAT CCAACAT CTTGAGAGTGGGñTTAGG GTGCCACAGGCATGGC CTGAGACTAGGAA GAGAGC CCTGCTCACT GACCCATC CCTTGCCTGG GCTTCCAAGTGGAACTAGAGTTTCATT CAACCTACATGTGC CTATAGGT CCTC CCTGTGGCAATGACAT CT CT CAGC TCAGTAAGGG CCATTTGCAGTAGGAATATGAC CCTAACC AGAAGACT CAGTGGAT CCTTATCACCTT CATAGAAAG GTACT CACCAT CCATGT CAAGAG CC CAGC CAACACGCT GTTGCTCCAATATGTAAAAGGCACTTCTGTAGGGCTGGCATGAGTCAGTCAGTTCAAGATAACCTGAAGGAGTTG AATAACATCTATCCAGTGAGTCCTGCAAGACTTCAGGCCCTTTCTCATCCAGCAGCTCCCTGCTGAGCCTGGAAC AGTGGGAAAAAGTAAAGAATAAGC CAGGGGGAAT CAGAAAC CACACAGCCCCAGCTAGATTT CATGGCTAA CATA AGGAAGAG TT TGAAAAGAAAAAGGñ CAGAT CCATTAATGAGGTAA CAAAT TATT GCCTTTATATTGGGATAGACT AGGGCCAGGTAGAAAAGGATGAAAGAGAAAG(N)xGAGTGAGCTCAGTGAATTGGCCAGGTGACACACTGATGAG G GAGT CAACG GT CATT CTCTATTTGTGC TCTCAGGACACACAGTGAACAGTGAT CATGAAAAG CATGG C CT CAAT AATTTT GCATAAAATG TG CTCAAGTTTC CCTG CAGC CAC CATGAGAATACAG CTTTTGAG GTATGGTCAAC CTTC A CTAG GTT AGTAAATG AT AAG GGT AG GAAG AAATGG AAACCTAAA C ATTT ACTCTAATGAGAAC CAAAAAGCAAT GTAGTAGG CATAAT TTAGACTTGTCT GACAAGACAAAATCATTA TTTTCAGCATGTACTGTTTTCCCTGGACTTG GCATCTCCAGGTGTCAACATCAAATTAACTGTCCACAATTTCTCAGACTCACCTGGGACCTGTTGCCTCTTGGTC CTCCTTTTTCAC TTGATC CCACCGAT GT CCTG CAAATAAATT CAGATGGG GC CT CTTACATTAAGCAGTT CTTCC TTGCACACfiGAAACATTC CTCTGT C CAATC CTAACACAGGTACATCAGTCTGGT CAGTGTGAGAACAGGAGACTT TGAGAGAAATATTCCAGCAGGCCTGAGGTCAAGTCTTGAGAAAACTGGCTTGGGTTCTTTCATGAGCCTTGGGCA AAATTACC CTGT TTTGGAATGTTATCTT CCCTATGTGCTCTGTCCTAGGTTTGTGTACA CAAAT GAGCAACTTTT TCCCCAATAAATTGTAGG CAAATAGTTCTAACAC CT CATAG GAGAGATACTT CAATATTAAG CTTT CT CT CATCA AATACC CAGAATTT GATAGTTTATGA GATTGTGGACACAGAGATTTGATGAAG GG GTGCAATGTAC CAGCTCTTG AGTCAAAATGAAAC TTGGTTCTACACAGAAGCAT CAGCTATTATGG CT TTTGTGGGTGAAAAGT CAGC CATTTAT CTAGAAAACATACCAGGAACATGACGGACAGATGAG CTAAAG CAAG CGAACTTAGAAGACA CAGAAAATG GGAAT AAATTCAGTGAAACCTGGGCCACATCTTTCACTGAGAGGTñGACAAGGGTGACACTTGCCTTGGGCAGGTAAAGA ACCACACAGACATGCTTTGGGAACAAAACTCATAAGGAATTTTGTAGCTGGCAAGAGACATTTAATTCAGATGAG CTGATCTGACAGACAACT C CTGGT CATGTG CTGCATAGTTTGGTGTGAGCTTGC CACACCTG CCTTGAGTTCAAT GTCGTGACAGTCAGTCCAGGTTGGCACGGG CATG GC CTGAGACTAGGAAGAGAG CAAAGC TCAC TCAC CCAC CCC ATGCCTGTGCTT CAGACT CGACTC CAGAGTGATTGAAATCTACATTGATATATAGGTTCAGC CCACAGTGATG GC AAATCT CAGC C CAACAAGGGGCACAAGG CC CAAAGATTATGGGGTCTACCTGGG C CATGAACTGGAGCTTTATCA
c c t t c a c a a t g g a g t a c t c a c c g c c t a t g t c a a c a g c c a t g c a g a c t t g c t g t t c c t c t a a t g a g t g a a a t g t g c CGCTGTAAGACTTGTACGAGG CCAñ CATTT CAG GAGGAATTGñGAGAGTCGAATAACCTT CATC CCAGGACT CCT GGGGGACTTCCTCCTCTTCAGACTCCTGCAGATTCCTGATGAGCCAGGCAGGACAGGGñTGATAGAAGATTTAAC CAACAGACATTAGACAA CAAAACCTCCCAGAT GATCTGAT GG GAGACAGAATGGAGTG GT CACAGAAACCAAAG G CATTTTT CCT T C AAGAGAAATAAAACTAGCCTTCTAAATACAGGGTGGAGGGTG ACTG CT CTGGGGA C AG AG C AA AAAT GGGCAG CATGTGCT CAGTACATTTGCCACAGATGAG CCAACT CAGGGCAC CCAGACTCTCC CTGTAAACTA CC AT CATGACTTGC AGCACAGAGAA CTG ACAC AGGG CTT C AACTACTTTG CAT AAATTGGGTTG AATTTT AC ATG CAGCATT CAAGTGAAGAGAGTT CTTGACA CAG TGCAGACACAGATCTTGTGTATTAAGGG CCCCfiTTT TCC CAAT ATTTTGATATAATATATTTACCTTTTCAATTT CTTTTCTT GCAAAAATA CTAGC CAACATA CTACCAACAGATAG GAAGAAAGCATATATACATCTCTCCCTGGATTTAAACACATGGGAGAGAATAGGCAACACCAAGAAATCCCTGTT TG ( N ) xAAACCCTGTTTGGCTAGTTCACCTGGCTCATCTGATGGCAAGTTCCTATCTTGAGAGGACTATGAAATT AAAACCAATACAAGTGCCACAAATAACATACAACATTGTAAATCAGCACAATTTGTAGCTGGGTGAATGGAAGAA ATAGTTCTATTCATGACTTCCT CATTTT CCCTAAAT CTACAATCTC CAGATGTCACTACTGAATTAACAGC CAAC AATTCCACAACATTACCTGGGAGACACTGGCCCTTTTTCTTCCTCTTCCTCATCATCACTTTCATTTTCTGTAAA TAAATT CAGAGAAG CAGGTCACATTAAG CAAT TCATAC TT CACATATGAC CAAATCACTGTCCAGTCATAGCACA AGGACATAACTATTCTCAGTGCAAGAATAAGGATTCTGACAGGAATATTCTAGGGTGCCCTAGATTAACTTTGGT GA3AATTAGATGAC CCTG CTTT CC AGAC C CACAGGC CAAAAT CTCC CTCTACGT GT AG AC CA'i’AATGC CATATT C CCTG CCTGAGTCAAAGTTAAACAAAATT TTTT CCCCAAAAAAATCT CCAAAAATTGGT CCATTTTCTAAGAGTGT TG CTGCAATACGGACTTATATCAC CAGATAACATGGACATTAAATGTTTAGAG G CATCTATACATGAAACACACA TGATAGATAAATTTGAACAACTCTTGCTTTAAAAAGAATCTGTGA(N)xAATCCACGATGCTACAAAGAAACATT GGAT CAGCCATTGCATTGACAGGG TGGAGAAG CAGGGT C C AG CCTTGCTTTATGGAAATATATCAGCAAAGTAAA GAAGAAAAGTTTCC GTCC TGAT TT CAGGGTGACTGTGOAG CTAAGCAAGC TGACTTAAAGGAGAT CCG GATGAAA GCTGAGAGCAGTGAAGCCTGGGGAACAATATTTCCAAATACAAAGGCAAGGCTGCCAGCTTCCTGAAACAGGCAT AGAAACT CCATGGACATTGTT CAGGGACAGATGACTTAAT CACAGATGACAAGAGATACTGAAT CGAAG CTAGGñ GGCCTGACAGATACTGCCTGTGCACCTCCTGCACTCAGGTGACTATGAGATTGTCACACTTGCCTGGGGTCGAGT AACTTGATACTGGGGACTGGCAGACAAAGGCATGACATTAGCTGAGAAGGACAAAAAAACTCCCTGATATCTGTT TAGAAACCCATCATAGTTTTTTATTCAAATGAATTTGTGTTTATAGAGCCTGTCTTCAGAGTTTATCTTCCTCAG CCTAGAGAGAGGTATGAGACA CAAGGAAAACAGAGG CTAC CT GGGATAATGTGTACAG CATCCT CCCATTCAACA TGAGAGGATGAGCCAATGAGAGTTGAGT CGACTTTGTCTT CCTCAAATGTGATT TTGG TT TTCCTATGTGGC TGG TTGGAGT CATAAGGGCCATGGCTATTTGAACAAGTGATG G CACATT CCTC CAGTGAGT CCTCAGGGA CTTCCTTT TCTTCAGCCTTCGGCATCTCCCTGATGAGCCAGGTGGGACAGAGATGACAGAAGATTAAACACAGAGGGATTGGA CCCCAGGGAGTCCTAGCTGGTTTTGACAGGCGGCATTAAGAGAGTGGTCCCAGAAAGCAAAATGGAGGTTCCCTT TAAGGGGGAACATGCAATCCTGTTCTCTCTGCAACAGAGCATGGCTGCCATGGGAACCAGAGAGGAAGAGAGCAG CTGGTGTTCATTGCAGTGGACAGATAGGAGCTGAGGAGGATGAAGACTCAGCTATCCCTGTATGGTGCAGACATG ACACTCGGCACACATAGAGAAACATGACAGCTGCCGCACCCTGTGTCTAAGCTGGGTTATATTTCACATACTGTG GC CAAG CAAATGCGGGTTTTTGGC CCAT CATAGATG CCAGAGAGGGTGTACCTC CTAGATATTCTTCATATG TTA CCATCCATTACTTGTTCCTGAGTATTCAGTGTTACCTGGGGGCAGACGATTTCTGCACTTTCTCAGCCACCTCAA CTTGAAC
> H s l_989 4 73 89 -98957038
TTGTGGTAGTGTGTGGTTGTAACTTATTGAGAGATTATAACCTGTCAGCTGCTGTGCTCGGCAGGTTCTTCATCA TCATCATCATCATCATTATCATCATCATCATCAGTCCCAGGCAATGTTTATTAACTGGTTATCTGGAAATGTACT CTATATGTGCCATCCATTTAAAATCCCACAAACAGAAGTTCAAAAAGGCTCATAACTAGTGAGAAGTGGACCCTA AATT CAAATC CAGACCTATCTATTATAATTTAGCAG CT CTGAAATAGTGCATAATAGGTTTTAAAATT CAATA CA TTTCACAGTCTTCCCAAATAGTCCGTTTTCAAAACTGAACACACAATCTAATATATTCTGTACCTATTATCTGAT GG CTGT ATCTGTTTGAA C CG ACTCTCTGGATATACAAGTT AC CTCTTAAG AGCATCTT CACATATAAC AATTTT C AAGT TT TCAAAGCT TTTC CTAATTGATTAGAGACAGAAGAAGAGTT CACATCACACTACCGCAT CAGCTCCAGC C CCATTC CCCTCCCTCTTCCTCATTCTTACCTCTTTCCTGTGACACTGAAAGTTCCCACCCCTTATCGTTGAATGA GGAATACTCTACACATCAGGGAATTTCTTTGGGGACTGAACTTTGGAGACTGTACCGAAGTCCCAAGTGGGTTTA TGAAAGAATTTTTTAGAGAAAATTTATTGTTCTCCCTGGAGGAAATTGTGGTAAATATAATTTTCTTTATCTCTC CTG TTAAATT AACTTG AAATTATT CAAAAAGT AATGGC ATTGTAGTTACTGAGAGT AGGC AG AG AACC CATGT AT TCTTTTCTCAAAACCCATGTATTTTTAAGCCATGGTGGCTAAGAAATTCTTCCCACGTAATCTGATTAAAGGCTT ACCACGTACACAGTTTTTCTGTTAAGGGTAGTTGAAGAAAGCAAGAGAATGGAAGGATTGAAGAAGCAGTGCTTG TG AG AGGTTAACTGTGCATT GAAAATGG CTAG TG ACTG AAGACTTA GGAATGCACC AC ATGGTC AGGT AG AG AAT AAGAAAGAAGTTGATCATTTGCTATTTTTACCCCTTGGTTGTTGTTTTTTGTTTTGTTTTG(N)xTACACCTTGG TTTTGATACAGAAGTGCATTGAAGGATTAGAGAAGCAATCTTCCCAAATGATTCCACAAGTGTCTTAGAATGTAA GAGATGGGATTCTAGAGGGT CTGG AGAAAGAC TG AATGGTGGGGAAGATG AAC AAAAT AT CTTG AAATTTGGTT C CAGC TT CTAGTCTGAATTGAAG CT CTTTTTCTAAGAGCGTAT CTGAAATAGAAGTGTAAAGGGAAAAAAGCT CAA TTAG TACAAGATTCATT C CAAGTTGAGC TAGAAGTGAAAGAACAGAAAGCTAAAAG CATC CCATATCT CAAATAG TC CT GAAAAGTGTACTCAATTTTGGCCATAG ACACGGAAT CGAGCACATCTGTCAT TT CC CATT GTTGTGAATGT CT TAAG GTTACCCAAACAAATG GT CAAAGAATATAGAAAAAT CATACCAGTTTAGCAGAGTCTGAAAAGTCTAC C TCAGATTTGGAAAG CAGCACAATG TTGT CAGGGGAAATT CGTTATTATTAGATTTAAACATTGCA CAT GAT CATT ATAT CAAAAGACAAGGCATT CAGTAAAACTTTATAAAT CT CATGAAATGCTTGTTG CGAG CAGATACTTTGTGGA AACAAGTGAAGGAACAAAATCACGGCAGAGTACCAGAGACAGACAGAGGTGTCTAGGTAGGTGGAGAGGAAGCGT AAATTCAGTTGCCTA CATTACT GACAAAAGGACTGCTTTGGCAAAATGTAAAAG TATATCTTAT TATT GCTTTCT TTGGTTCAATCATGAAGGAAAAGTGTACCATCTTAGCAATTAAAATAAGTATGCCAATTTTTCTCTGAATTCTTT TACAACAAGTGCCCTAGCAATTACTTCCTCTTTTCTTTATATCTCAAATTTCTATGGGTTTTACTTCTGCTAGCT ACGAACATGGTCTGGAGAATTTTTCTTCTGTCATCCTACTCTATCAAGCTACTTTCCTCCCATAGCCATCTCTCT TCCTTTCATTATCACCATACTTCCCAAATAAGGACTCATTGCTTTCTCTACTACAGTACCCATCCATCCCTTTGC CATTTTTTGACTGTTATATAC CAATGATGCTT TTCTTTTAAATGGTAATACTAC CCTGGTATTAGC CAAG TTTAA TGGCC(N)xTAAGACTACTTTGAATAACAGCAACAGTATCTC(N)xATCATTTCCTTCTTATTTTGTTAATATTT TCTT CC TAATTAGAAGTAAT TT C CTCAAATAG CAAGAATGATGCGCTTTATTTCAT TTACACCCTC CACAACATT ATCCAGGGCTGGTCACAAAATTAAAATAATCATTATAATGACAATAATAATAC(N)xGGATAATAAAATTGTACT TTGTGTAAGTTATTTCT CAATTAT CATTATAGTAAAAG CAAACATATG CATTTGTATTTATTTC CATATCGATGA GTTTCATGCTCAGAATTTTACTTTTAAAGAAAGGCTGTATTGGAGGAGTGGGAAGACAAGTCAGAGAAAGCTTTT TAGG GTTT CCAT CC CTAAAACCTCAGCT CTTTTAAAAATAGACTAT CACTTCTC CCTCTTTC CTAGGTGC CTTTT GAGAAACAATTAAATTATTGGACTACAACATC CAAAGTGT CT CAGAAAATAAAAGTGC CTTATGAGTGTGATAAT GTTAAATT CACCTAATAAGG CAAC CTAAAATAATTACTGCTAAAATATGTACTTGATAATTT CTACTTACACTAG ATTCAAAACCCTTTAAGCATTTTTACTATATAAAAAGTATATCCATTTTGTTTAAATAACTACAGTAATACCAGA TTCTTTTAAG AAAATC CTTC TC AAATTT TTTTTAATTT AACTGACTGT AG AAC AAGGCTTGAGT AAT AAT CAG AT TAAC TGTG CAAAATG G CATT TTGG CCTT CTGTATTTAAAAAAAAAAG GAAAACTTATTTCAAACT C CAATAATG C CTCTAG C C TACACATTTGAATT CTATGTTGTTTCAATGAATATATAAG CAATTAAACATAGAAAATGTATTT TTA GAGAAACAGCAACTTTATATAC CTAATGTGGG CACAGT CTGAATTGTC TTTGACACAGAGATAGAC CAAATGAC T TGATATGTTTTAGAAGAGAACAAGAGGTAGATGATCAACTGTATGTAGATAAATACTAAATATACGTTGATATAT TTCAGGTTATAATGGTACCCTCTACCCTTTCCCTCCTTCATTTGGCCACTGAGGGAAACATAATAACTGCTGATA ATGAAAGT CTTATC C ATGGTGAAAAGTC CT AT CTTTACTGTACATTG GG G ACT AGC TTTTAT ATTG CTTGGATAA CTCTAAAC CTTTAAACTGCTATTT CTAG GG CAAATTCTGC CCTCCAAAACTTAATTGGGCAGAATC CTATGTTGA CTCAGAGGGAATAAAATGCTCTCA( N ) xTTTGTAGGAGCTACTGACTGAGCAGTTTACTCCTAGGGAGAATGTAA CTCACC CTGGTTCC CTGCAGAAACAGCCT C CTGTTATAAT CAGGATTATTATCCTCTGTTTACT CC CTTT CT CTG CCTGAGTCTCTCAGTAACAGCTGGGGGCCTCATTTATAAACAACCTGTGCTATAAAATGCCCATGATATACACAG TATGAAAACATAGGAACATGGT TTATCC T T ( N ) xGTTTTATATGTATTCAGTGCATTTTCTCCAGAGAGCTGAAG GCATTT CATAAACTTTAAATAACCAGTAGGTATAAATCTATG CCACTGTAAGTTATAG TTATTTAATAGTAATAA AACAAC TTGAG G GTGAAGAATTGGTATTATTTATTTATAATTAGCA CATGAGAAGGAAAGGA CGTT T CAATAAT C ATAGAAAAATGCTAATGTAAATTTTTCCCCTGATTTTGTTTTTTACTATGGAATTGACTAAGTTTATGAGTTCAA GACTTT CATTGC CTAGAGTACGACAGTCTG CT TATAAAGTATACTCAATAAATATTATTGAATGAC CACTGAAAT TATATTTTAATTGCAGAAAATTATATAAAATG TGTTGAAATTAACCTCAATTGCTATA G CCTAG TTATTTATAGA TTTC CAAGCTTAGGAG TACAAC CTTTTCTT GAATATTTTG G C ( N ) xCTCTAATTTCTTGCTTCTGGTGAATTTAC AATTT CGGATAGGTCTTAGTTT CTGAGCTTACTGAAT CATTTTCCAGGTG CATTGA CTCTACCCCTCACTAGAGA CAACTCTTAC{ N)xCATGCCTTTATTCCCTCCCTCCAGTCCTCCAAAATTATAACCAAAGAACACTCCAGATTTC GCGAAGTTTGGG C CATG CTG CTTGTGGATAGGGACAGTAAAAAAGCAC TGTTCAGGTGAAGTTTAAAATCTTGAT CCGCTGTAACTGTACCT CCACTGGAAATTT CTAACAGACCATGTCCGT CCTCACCTGG CCCAG GTT CTAGG CATT AGAT TACTGCCATTTTT CAAG GTATCTCTTTC CAACTCTG GG CTCTTCTT TTTC CTACTTTT CAGO CAACTTACA TCCACTGAATTT CTATTTATTTCT TGAATATAACAATTTAAGTCTTGT TTGTTTAAACTCTC TGAT CTAAGC TAG GACCAAGGAAATGTATCTTTTATAAGTAAGTATATACAGTAGGTAAAAGGTTTTGCCCTGTCCTCTTCCCATCCC AATCAAAGAAACATTTTTATCTGAGAAGAATTCAACTTTTCTTCTGTAAAGTG( N)xGTGTTTTACCACCCACAT GGCAAATATTTT CTTATAAT CTTT TAAGGGAAAAGTCTTTGAAGACATATAACAATGTATATATT CATATATAT T CACT CACATGTGTGTAATG GAGTAATATGTTTAACTGATT TTACAATG CAGATGATGTGTACAC CTTAAACT TAA CCAATT CTAATAATGTAAAACACT C CTTTGAT TGCATT GGTTACTTTGATTACTTGGT CTTT CT CAC CTC TGAAA CACC CT AC CTCC AAfl C AAGAGTTCATTTTG AAGATG AT GTTTGGGTGTTTGT AAC CTC ATGATACAGGTG AAAAT GTTC TTGGATAAGATGTTTG CAAGTCTTTT TAGTTTTT CACAGGTGGC CCCTCTCCACCCATGCACTGCTTACTA GCCAGAAATTGTGT CT GTAATTGTTGGC TGGACTGATCTAAAGTAGTCAT CTCT CT CAGCTTGGTTGAAG CC CAG TGGT CCT C CATG GCTGTCTT CACC TGAT TTGG GGTTAT TT CAGGCCAT CG GTGGTTTTTTGCTGTG CACACG GTT CAGCCCCTGCCTAGATTGATTGCCCTGTAGCCTCACTGAAGAACCACTACATCCCTTTTATATGTCCCCTCTGGC AAGCACTACAGACAGATTCTCCCATACTTCTGTGTGCCTTCTATGAACACAGAAAACAAGAATGCCTCTTGAAGT TAAAAAGCATAT TTGT GTGATGTTGACT TCAAGCTGGCTG CTTCCTGGAT CCCCAACTGGGGAGT CAGGACAAGA ACTCCCCACTCTTCTCCCCACATTGAGTTACCAGCATCTTTTTGTCCCTCTTTGATCTTTTTCTCCCAACCCCTC CCCCAC CAATACATTCACTTTGAACTGAAGTC CTTGGAGAGGATTA GGAATCACACAC CTGACCACACCCTT CTC CTATTG CTACTTAAGA TTGAAAGTTCCT TGTT CTTGCTGC CTTAGGTGGTATTGGCAT CTTCTCTACACATTGGG AGAAGCTC TCCAGGTGTAAACT GTGATTG(N)xTGTGATTTTTTTTATGACAA(N)xACCTCTACCACATCATCT CCCCCACCTCCATCTCCTGTATAGACAGATGAGCCTTAAGAACAACAATTACTATATACTGAAGCTCAGATGTTA ATCAG GTAATCCTAGT CTTAAATGTCTGTG CATTTTTT CCTC CTAGATTTGAGAAAATAAAAG G CACAAGAAAT C CTGG CCTGAATGGT CAT CAGAACTTCTTGTGTGTTGCTTC CAAAATTTTAGAAG CACTTTTGAG CAGACTG C TTA TAAAACATGGCTAC TTTCTAGATT CAGC TGTCAACCACAGAATTTT CTAC CACTAAGGACAGTATCAGTT C C TAA AATCATAGAAGT GATGGTGT CTAATTGC C CAGAGAAAGAATTAAAAGTTAATGT GGGTTTGGGGAT TTTTTGGTG TGTGTGTG GCCAAAT CAGCAGTTG TTCATAñ CATAAAAGTTT CATGATGGTACATG TGGTACTACT TTTTTTTAA CGATTG( N ) x GCTCATAGAGTTTTACAGCAACTTTATAGAAATATACCTTTTAGGTATAAAACTACATTCAAAAA ACATT CTT CAGTTGATTAGAACAAATTC TCATAAATCTT C TGATTC CATTTATGAATTTCCCACAT GAAAGAGC C AAAAAAGAGAATGTCTGCTCTGCTAAAATTCTCTGGAACATCCTCTCAGTACTCAGTGGAAGATGGATGAGATCC TT CTGCAGCAGATGAT GAACAATAATTATAAGTAAATGGTAGTGTATTTTAGAG GACACACATTTAG GAAATATT TGAAATAAGAAGTAGGGAAAGGAG GGA CTTCTGAA CAAATAAAAAATGAT CTAGAAAGCAAACCTG CAATCCAG C ATGG GAATTT TT AT A CAGGCAAAC TAAAAAATAGGG CCATTGAGGTTT TT TAAAATACAAGAGGGAG CAAGGAAT AACAACAATCTTATGCCAGTGAGGTATTATCTTGAATGGAAAAAAGCAATTGTGATAAATAAATTATTAGTTAAA GATATGAAACAATAATAGGGTCAGCTGTAGAACACGTTAAAATTCAGAAATCTATAGAATGATCCTTTACATGTA TTCTGCTTCTGTTGAGTTCAGATCAAAATCTACAGCCTTCATTAAATTAATGTCTTATTTTTAAAATATTCTCCA TGGCACCATGGAGAATTAAATTAGGCATTAAATTAATGCCTTATTTTTAAAATATTCTCCATGGTGCCTGAATAA GACAGAGGAGTAAGTCTAAATTGGGC CCTTATGCCATACATATTTATTTCAT TAAAGTAGGAGAAT CT GAAATAA AATTTAAATTACAAATATGGAAGTGGCTTTAGATTTTCATTTTGGTTTTGGC( N) xTTTTTCTTTTTCTTTTTTT TTTTTGTTCTACTGATAATTCAATCAAATGCAAACTATTAATATGAATAAAATATATTGAGGGTAGTTATGTTTT TG CT CATCTACTTG CT CATTCATT CAAAACAAT
> H s l_ 156983 20 0 -156 99 58 19
TGGCAAAAACTC CTGC TTATTC CT CAAG CCCCTATGAATT CCTCAAACATTAATGCTTCT TGGTGAAG CATTTC C CTACCTCTCCCCCAGCATGGCTGCTCACACCTTCCTTTTGGCCCCTACTTTACTTGGTATTCTCCCTCCCTCTAT TACAACACCTTTCATTCTGTATAGCAAATCTCCATTTATGGGTGTCTCTCCTTCAACCCACTTTGAGTTCCTTTT CTTC CCTTAATT TT CATGACTTTCACTC CCATCCCCACCTCCACTACTCACACC CAGAAC TACATTTT GAATCTT GACATTGCTCA CAATACTTCCAC CTT CAAAAACACAGG CT CCAGTTTC CT CT CATACATT TGGT CT CACCACATT TGCTCTTCTTGGTGACATCACCAGACCCTGGTATTTTCCTCCCAGGTATCAACGTCCTCTCAGCTTCTCTTCCTT TTCGCCTGAATTCCATAGCTGATCACCTACGGCAGCAGTCCTGATTCCTTTGTATCCCCAGTCCTCCCATCAAAT TTGTCCAACAAACTAGCTCTGGAGAAAAGCTCCCATTTTCGGGCTTTACTTCTACTTTGTTAGCTGCACTCAGCT AGGGAAAAATCACACTGCTGTGTGATCTAACATGCAGATTCAAGGTCTCCAGCCTTGCTGAGGCCTTTGGTGCCT CCAATCACGGTTTTATTAGTGTTT CCATGACCTTTT CCAC CACAACGCTAAAACTTTACAG CTCTC CT CAGCCT C CATATTTAAAGC CCAC CCTCTC CCTATCATTAGATT GTATTG CCTCTTAC CACATAAAAA TAACTGGAGCCTTTA AG CACTGTTCTT CTTCAATTTC CTAC CT CTACCTA CAAGC TC CTCACACT CATCTTAGCCTTTT CC CT CTAGTT C AAGCTCAAATCTCACTCATGCCTCTCCTAAATCTTCAAGATCTAATTGTATTGGCTCTTTCTCCTCAGCCCAGAA ATTTATTCAATTC(N)xAGACTTGCTACTTTATCCTACCCTTCCTTCTTAGGAGACAGTGCGTACTCTCCTTTTA ACCAACTTTTGTAACAATGCGGACTCTCATTCTTGTCTTCAGCCTTCAGTCTCTTTTAGATTTCAGGCTCCATAT GGAACTCTCTAGGGGACAACACAGCCAACAGATGCTCAAACTCAACACACTGACAAACTCATCTCCATGCACCAC AC CT CTGTCCTC CT CT CGTATT CT CT CAAGC (N) xG ACAATTTG ATAC AAATTGTCTATG CTTCTC ATTTTCCC C TCTACTTCAGGAATTTCAGAGACTGCTTGAT(W)xCAAGCAACCCCACTCTCCTCCCCTGGCTGGGTTATCTTTA CC CTGCTCCTAACT CA CAGGGCACAT CG CCATCACTGCATTCATCTTG CACTGTGGTCAT CTAT TTAATGGT( N ) XTGAAGGTCTACTGCCATAATTACACTCTAGGTCCTCAGTAATGCGCAGTGTGTGTCTGCTAATGGTCACAATAT CCTGAAGGGCCTTACTGCATGCAGCACAGGCATGTGTGGATTAGTGCCTCTTTCCACTGGAGGAGGAAGCACTGG GATG GAAGGT CAGC CATGGGACTCTTGG CGACCTGG CC CAACGAGCATGT CT CC CAGGGG GAGO CAGATACACT C ATTCACCCAATCAAAAACATTTACTGAATTTCCACCAGGTGCCAGTCACTTTGCTATTATGGTCCCTGCCCTCTG GGAATTCTCAACTGAGGAGGGGGAGAGATACCTTTTCACAATATGGGGATTGGAAGAGTGATGTGATGACTACTG CAGTCTTAAAAATAAACCAGGAGTGTGTGGATATCTCTAAGATTTTAATCCGTAATGAAACAACAAAAGAACAGT AGGAAAAACG CT TG CCAGGAGT CAGACTACCCAGCCAAT CA CAAATAAGACCTAAACCCC CAGGTGTCTATTGTG AGGACTGAATGAGG CACCCATAG CA C CCATTACAGG CCTG( N) xTAATATTCCTGTTTTTAAAATTATTTTTCTC TGTTGTTAGGTTGTTTGGCGTC<N)xTTCCTTATCTCATCCTACTCTTGTTTGAAATTTTCCATAATAGAG(N ) x ACTAGTGAGATTTAAGGCTAGAAG CTAAGGCTGCTG C CATAC CATTAAGATTTAAAAAAGAAAATAGGAAGGAGA AGAACTATTGATAAAGATGTATTCCTTTCAGGCTCCCCATAATAATAAAGCAAATATGGACAATCAGGATGGCGA AGAGGTGGAACTTC CC CAGATAGCTCAGTTTAAATACT TT CTTATCAATGAC CT CCAAGTAT TC CT CAGGCAGT C ATAC CCTTCCAGAACAGCAGGGT C TG CAGAGATCAT CT CACTGCCCAGAT TCTGAATG( N ) xTGAATCACTAGTG TCTGATACAGTGGGTACTAAATGTTCACTGAATTAATAAAGAAATGGAGAA(N) xCGGAGAAGAGGATTTAATAA TTTCTTTTGTTTTGATTTTTAAAGAATTTCTGGTAAGAGAGAATCTAACATCTGCTCCAAAGGTCTACCAAGAAG TT CTAAAATC CTGATATTCTCCAAAA CT C(N)xAAGCATTTTACTGTATGCAAATTTAAACAAGTTTTAAAAGAC AC T G ( N) xAAAACAGCATTAAGAAAAAAAATTATTCTGGGTCCATATTAGAAACTAAACATTCCCTTACTCAATT ATAGAGATGCAGGTACTCGCTCTTTTTAGTGGTAATATAAGGCTGTCCCAAAACATTCTGTTTATTGTCCAGGTC ATAGTGCTGCTAGATCCTAAAATGAGGCTATGGTCTTTGAATTTAAAATGGAAGAAAATCACAATATGTACATAC AGTCTTGCATTT TATCTGTTTG TA CATACATATGATTATTTAATCAAT TAATGCATAGGCAAGT TC CAAACACAC ATATTAATGGAGAAGCAACACAAATAATCAGCAAAGACAGATAATTTTTACAAAGCACACTTTCATTTGCCTTTG TAATATAATACGGGGTTTTGTTTTGCAGCAGATATATTCTTAGGGATGTTGGGCTTTGACTAACATTAAAATGAA TGGTCATTTTTTTCCTATTTTATACTAGAGCAAGAGCTAAGGAGTTCACAGCAAGGATAAATTACAATGCTGATA T CGATAAAGCAACTGCATTAAATGTAGG CATTGAGAATTTAAAAGCATT CAT TTGAATTCAAAAGC CAGCAAAAT GTTATATACTTCCAGGTAAATGTCTGTCCATGTGCTCCCCACCCACTCTTCCAAGCTGCCCCAAAGAGCTTTTCT GGATATCTAAATGTGAAATGTGTAATAGCCTGTGCACACATCAGAAAAGGTGGAGAAAGCAAAAGATTTAATGTT GAGATTCTGTCAGTAAATTCTCAGCAATTACAGTCTCTGACATGCCAGTTGCTCTGTAGGAAGATGGGAAAGAGA GT GATGTGATGGGGTACACAGTACAG GC TGAAGAGAAGAACACAGAGATTT CAGGCCAAGTG GAT C CTTGGCCAA CAGAACCTGTCCTTAGATGCTTTCAGGTGGGCACTATAGACATGCATCCTTCCAATCCTAAAGCCTTTATCAAGT ATGT GTCAAGTATTTGAGGATG GCAAATGCTGAACAC CAAAAAAGAAAGCAGAATCACAG GCATGTCCTTGCCCT GTAATG CTCCTGAATCAAAAAAGAGACAATGACAAGGATATATTTAAAATCAAAATAATAACCAT CAAGGATGTA AATTTCATTATCAC CT CAAATGGATCCAATAT TAAGTC CTGATTAG GTGGTCAT GTAATGAGAG CATTAACAAGA CCA C CCT CTGGAAGTTTTCAGT C CAAGACAGTAATGAT GG TGTTTCATAAAAGGAAAT CGTTTAAAT AAATG GAA TCAACTGTGGGTTATAGATGCTAAACGA ( N ) XTCAAAAAGCTGTTATAAAAGAAGAATATACATGAAAATTCTCA AACTATAAAACTCCAAGTGCTGCACATACACAAGGGATTATTATGATACAAAATAAATGGCTATGGATTGTGAAG TCATGAAGGTTAGATGAAGTAG CAATTAAATCAG CCAACT CG CTAACCTCAAAAAACTTGTCTG CAGGTATAGGT GGTGCC CAGCCTGCTATGCTATGTAAATAT CTTTTAATGTATTTAT CT CTTGAC TTGAGGGTCCATTTATAATAA TCTC CTTTTCAT CT CTGACCAAAAGCCGTATTAAAGATAAAATGAATGGGTGAAAGGCAAAAAAAAAAAACAGAA ACCTATCTTTTTCCACAGTTAAAGAAGTTTGTATCTATGCCATAGTCAAAACAACAGATTTTTAACTCCATAATT ATG C( N >xGC ACCTGGAATGACTATTACTTTCCTATTTTTTGAACACATACCTCATGTTAAGTATATTCTTATAA ATTAAATCCCCACAACCTCTGTCTAGCACAGTGCAGGCACATCAACTTAGGATCAATGAACATCTCTTGCTGAAG TCAATT AATTGC CAAA TAAGACTTGCAAGTAAAATCAGAGAGAAGGGGAGATTTGTTATCTCTA CCAAACTGATG GTACAGACAGTAAG CA CTCCTG CTGTGGGCTGTGGTGC CTTTACTCTT CCTTTCAAGTGATGTA CT CTGCTG CTT CCAGGTTCTTAGGTCTTTTGCAGGTGCCTTTCCAACAGGCCTGAGGTTCCCTAAAGGCAAGTACTACTTGTCTCC ATTATCTTCCCCGATGCATGTCGAAGATAAATAAGCACACAATACAAAGTAGGTGTAGATGCTGACTGAGACAAT GGTAGCTGAAATGAGACAGGGCATAAATCTGGTAGAGAGGGAAAAAGGACCTTGGATTCCAGGAGGATCTGTGCT GCCAGCTAGTTACCTCATTCTTCCCTCCTCAGCACCCCTCCCCACCACATCTTCCTGAACATGCGGGTTCCTAGC CTGAGTTGGTAATCTTTCTGCC CTAGCCAATT TATTTT CTAATCTTGTAG CTTT CATATAAAGGGCTCTCCG TGA GATTGATGATGACAGTCTGGGCAGCCACACAGATGAGCGGGCTCCCCTGTTAATTACTACCAGGGAGGGAAGGCA CTCTATT CAAGGGAGGATAGAT CAAGGAGCTAAGTTATAT CAGAAG GGGGCAATTT CAAAAGCTGGAAAGAT CTG TCAAAAAAGTGTGT CCTATGGGAACCCAGG CAGAAGGAAGGAGAGGAGAAAAGGGAAACTAAGT CACTTGGAGG G AGATGGCTCCTTCTT CATGCCCAGGTTCTT C CTTACAAGT CTGTGATGAGGGTAGGTATAACAATAGATGGAAAC TGCTTTGAAGAATAAAAACACTGTATAATGCTTGGTACTGCAAGGATTATTGTATTAAGTGGATTCTTATCCTTT GCCTGGTTTATGTCCTGGTTCTTGCAACA
> H s l_ 245718 410 -2457 27 86 4
CAGTGT TT CCTAGT CTTCTTTGAGAATGCC CCTGTTAAGATAG GTGAGAAGCAC CAGGG GTAGG CGAGATTCTTG GATAATG TGTGT CCTG GAAGGTGGTGTATT CT GGAATGGGGATGGGAG GAG GGGAAAAG GATAAAGAAGGGAATT CTTTTCGTTTC AAG CAATAGAAATTGAAACTGTGATGC CT CATGGAAC CAGACCATGAGAAGGT CATGAGCG CCT GGAACTAAGATCTCTCGAGTGCTC CGGGACGCTT CTCGCCTCGTCT CTTA CGTGTGCTTCTTCT CACATTTTGG C TTAATTCTTCATTTCAGGCCAGCTTTCTCTCTTACAGCTGCCACCAGCAGACTGGACAGACTCCTGTCCTCTCTG GCCCTAAACTCAAAACTTCCAGGAGGGAGCTCAGATGAGCGGGCGGTTGGGGGTCAGGAAGCGGCTGCTCAGATG AAGGGGGTGCTGGGGTTGGGGTGCTCAGGAGCATCTGCCTGTGTGGAAGACGCTATCCTACAAGAAGGCTGGAAA GACAGACAAACGTCACCCCCTGCATTTTTTGATGGTTTGTAGGAAGTGTCAGTGAGCATTTAGGAGAAAAACCCA AGACCAGGTTACACAGAGCATGTCAGGTCTCAAATCCCGACATGACTCCTGTCAAACCCAGTGCCAGGGAGGAGG GGAGCCCTTCACCCAATCCGTATTTACCGAGCATTGACTGGGTGCCTGTGAAATGAGGACTTGGGGGGGTTCATT TTGTGGAC CACTGATT TCACAATCATTTAG CAGTTTCCAGGGGGAGAAAAGTGG CCTG CGTATT CAACAGAG CTT AGAGAAGAGACTGTGTAGGGATTTGCCCAGG GAGTGCAGTAG GGAGAGGTGCAT CACCAAGAATAT CGGAGACTT CTCCTTAG CAGG CT CATCGTTT C CAAAGAGATGGAGAT GCAGTCCCGAGATAGC GCTC CGGCTTGCTGGGGACAC TGGCT G CAGGTGGACAGTCTTACGTCTGCCTGGCAATG CTGG CTATGCTGGCTATG CTATGACCTG CTTCCT CTG TGAAACAGCTGCTGAACAAAAATGCCCAAATTTTAGCACATCAAT(N)x TGCAAGCCACCATCCCCAGCCAAGCA CATTAATCTTTATGAAAATGTTTTGTTTCTGCCTTTCCCCACTCTTGCAATGGCTCACCACAGTCATCCCATTTT CCCAGCAG CGTGGAACTCACAGTCAGAAGG GC CCATTCTC CAAAATTAA C CCCACCTGAGAGTTA CATTATGTAA CACACTGGGAGTGGGGGACAAGGAACATCTGT CT CCAT CTGTGCCAGGTG CTAACTTC CTGTGT CG CCCAGGAAC ACTCAC CTAACACCTC CGAGTG CCTCCACAAAAGTGCCAG GC CATGTATGTGGGACACTTGCTCTG CAGTGTAAG CAAAAATG GGTG CAGTTTTAAAAATATTGTAGAATTCTAGAACTTT C CAAATAAGTTG GATAATAT CAGCAGTT G GTATTCATATAGGTTGACCTTCACTTGGCAGAACAGTCTGAACAGGTGCGTAGATTTTAAGTAGACTTGAATTTG AAGG CTAAGGTCAT CCAGATCT CTTTGCGGAT GTGGAG CT GGAATGAAG CCCTCCTGG CTTTGC CT CCTCTG CCG TCGT CG CAGCCGGCAGG(N) xCTGTGCTGGAGAAAGTGGGGTATGAGGCCCGGTTTGGGGCTGTGGGAGCACCTC TTCCTTCTGTTGGTCCTAGGAGCTCAGCGCTTGGTGGGATGTTCCAGACCAGGAGAGGCTGGGACTAAAATTCCA GAGT GATCTATGGGTG CAGAAG CTTCAGACGC CTGTTCAAGA GATGGGTATCAGGTGA GGTGAGTC CATATTTA G GTCTCCAGCTAGGATTTAAACCTATAAGGGGAAGAAGCACAGGTGAGCTACAGTAATTTTGGCTAAAGGTCCAAA CTGAGGCCAATCTCAGGGCATGTCCCTCTAACTAGAGGTAAGGCATTTAGTCAACAGCCCTCCAAGCTCAñATñG GCAGAGGGTAGAACTGAGGGGGAATAGCAGCTGTTACTAGGAGGCTGGTCACTGGCAACCTTGGGTTTCAGGAGC GAGAAATC CATC CTG GATACTG CGAACGCC CTTGATTCTTAGAAAAGTAGGGTC CTGG CATCAAAG CTGGAG CAG TATGGAACGGATGACTGCTTAGACACAGGGCCAGTGGGTGATCAACTACAGCCTCGTAGATTTCTCTTTGGAGGA CTGGAGTGACTC CCAGGCTCAGTAGGGGAC TCTGTAGGGGACTGAG CT TGGAGAAG CC CACGGC CACCCTGC CT C CATTCCAAACAAGATTATTCACTCCACTCCTGTTTAGCTGGTCACATGGCTCCCCCATTCCTTGCTCTCCATTGC CCTTAGTCCAAAGTCCAAATTCCCCAGCCCAAGGCTGCAGGGGGCTGG( N) xCCAGTGAGTAGTTTTTGGTGCTG CAAAGAGGACACACTTTCTCTCACCTTCGGGCCTTTGACCTGCCCGGGCAGGGCCCTCCCCATCTTCCTCCTTTG CTCTGAATTCATCCTTCTTCTTCAAGCTTTAATTTGGATGTTATCCCTTCACCCCGCTTCCCCAAGGAA(N)xGC AGAGTTTGTCCTTCTTTGAACCAGGTCCCTCCTGGACAAGGAAGGTGCTAGATACATGTTTGTCTAGCCACAGGG AT CAGAGCTCAGAAGGTTGCGG GGACAGG G CAT C CTGC CTGCCCTC TCTGAATGTT CACT TTGAGT TACTTCAGT TACT CTTTACCT GGACAACATAAC GT CGGG GAAAA CTCATTTAT TTGCAAAAGG CAAAATACATGACACT GTTT C A C T CAGAAGCCC CT CAGACC CAGCAT CT CG CCAACCTTTG GG CTTAAGAAGTGGTGAAAAGATGGGGAGAGAGTG ACTGAATG CATTTTTCTGT CTAG G CAATTTAGTATTACAGGAGATG CT CT CAAAT C CTTGACCGGAGTGAAGGCA GAGTTCTTGTCAGTTATGTAAGAAAAACAGCTCATTGTTATATGATGCACACTGAACACCATCGTGTGTTTTCTC AGTCTTTGTGTGTGCGGACACGTGGC( N ) xG G A ATTCTG TTTTTACTACG G TG AG TA ACAAG G (N )xTAACAAG G TTTTTTG TTTTG CTCCTG TAG G TCTATTTTTTTTTTTTTTCCAA TCAG TATTTTCAATG TTTTG G CTCC AG ACAG A TT T CAGTGGTG CATC AC AT GT T A ( N ) xT TG TT G AGCT ATGCGTGGAGTG CTGC CTGAC AGCTTGCT A TT TT CTT TC T TT C CTGTGAGATAAAAATGTACCTT A TG G TATTACTTTATT TT AATT CTGC CATG AC A T T T GTGGCGGTTGT T T T AAGATTCAGAGATTACTGGGTGTTGAT CT CCTTATGATACT CT CG GT T CCTAATTATGTTAGAAAAAAAAG C AAAG CGG GAACTTGGCTTTTGTGTTG CCTT CACAGAGTGTGGAGGTTGAAAACATC TCAG CGCACTTCACG GAGG T T CT CAGAGACC CCATAAGTTT CTGGAAAAGTGGGAAACG CAAACCTGTCAACCTG TCAGTCTCTCCCCTCTCAC GG AAGC CTGAC ATGTAAATCTCAAGTTGTTAAAC CTGCTG TC AAATGAGCTGCCTC CACTGAAG ACTC CC CT CC G CTCCTCGCACTCT CAT CACTGC CC CT CACT CTCTGTCCCCACAG CACAGG GACAGC CTCCGCAGACTCCCGTGTG TGTG CT CTGG AG AG AAAAGATT CTGGATTTGGG CGGTCTC AC CT C C AAAATTGC AG CC CCTAAGGGGAGC CGTG C TAAG GTGTGATACAGG CCATGC CC CTTCTAGC CC CATGGGACAC CTGACTGATAG CAG CC TGTG CAGATGTGAGG AACAGAAG GTGGCCCCAGACCCCT TAG GAGAAAC CAAAGTGC CT CAGAGACAGGACAAGTGCTACC CCGGAGGCC AGTCAC CAGGTGAC CTGG GAGCTGGGAACCTTTT CAGC CTATGGGTGGCTGTTTAT CACGAGGGGCATTAGACAG A CAGGAGG CAGT GC TGAAATTCAACAGCACAAAG CAATTT CCAGCTTCATCTTG GCTTAT TAATAATGAAACAG C AATAGC GCAGG CA CAAAT CAACAG TAAGTAGGTGGGTGGG TGAGA CAAGT TGGACATT TGGGGAA CAGCCTGGGC
a g c t a c c g a t a c a g a g g g a a g c c a t t t g g t g t g t g t a c c t c c c c t c t t c c c t g g g a t a g g a g c a a a c c t g a a a t g AGAACG GCAGTCAT CTGCACACAAAAACTAGG TTAGGAAACTGTTTTCTAAGGG GC CAGGTTGGGT CACT CTTTG
t g a c a a g a a c c g t g t g a a c c a c t t c t t a c c t g t a a t t c t g a c t c a g c a t t g a t a t t t g a a a a t t c t t c c c t a g g g CTGGGTACTGAGTACTGTTCTGAGTG CATTACATACAACAGCTCATTTA CATACGTAATACATT CACATACATCA
a t c a t t t g c a t g t a t c a a c t c a t t t a t g t a c a t c a a c t c a t t ( N ) x t c a t g t g c a t g c t g c a t c t c t g c a t g c t t CCTCTGATGGTT TGAT GTGGGT CAAT CC TACGTCTGTG CACATC TTAACAA CTAT CGTAG GTGTAG GGTGCCTTT
g g t t t a c t a g a g g t t t c a t c a g t c c a c c t g a t t c t c a c t a c t g a c g t a a a t g t c a g a g g t g g t c c t g c c t c t t c t c a g t c t c c a t c c c a a g g c t c t t t a c a g c t t t c t g t g g c c t g a t t t t c t t g c t g t g g a t g c t g a a g g g c a c t c t t c a g t g g t t t c t c c c a g g g c c g t t g c g g a t t g c a g c t g t t t c a t a c c c a g a t g a c c t g g g g g g c a t c t t c a g t g t g t g c a g a t c a g a c t c a g c a c t g a t a c t c g t c c t c c a g c c g t a c t t t g t c t c c g c g g t g g t g c t c a g a c t t a a a g c a c t t c t c a g c t g t g c t a g t g g t c c c t t c a g a g a a c g a a t t a t g c t g a g g c a g a g a g g g a c c t g g g g a g g g t a g g g a a g g t c a c a a a t t a a a c a g c a g a t t a g a g g a c a g a a a t t c g g t t t g t g g c t t t a a t a t t t t a t a c t c a g t a t g a c a g g c a g a c t c t t c a a a a a a a c a t a a c c t a g t g g c c c a a a t c g c a a t t t t a c g t t a t a c t t t c a a t a t a g g a g g a c c c t c a c a t t g c t t a a a a g g g g t c t t c a g g g a g a a a t t t a c c c a a a a t t t c a t c -c t t g g c t c c g t g g t g a c g g c a c a t c c t c a c t c t g a g a a c c a g g a c c g g c c t c g g g a g t c c a g g t t c t g t t c c c a g t t t g t a t g c a c a c c g a t t t c c c c a c t g t c t g t c a c a a c c a g t c a a c t t c c c a t g t g t t a a c a g t t a c t g a a t c t a c c t g t c c g c a c g t g g c c a a g c c c c g c c c c c c c c t t c c c c c a t g c t c t t c t c t c t g c c t g g c c a a c t c c t c a a t c t t g g c a a a g t c c t t c a g a a c c c t t g g g t a g t g t g a g t t a c c t t c c t t t g t a t a g a a t c t g a a a a a t c a t t g c c c (N ) xTTAGTACATGTTTGTTTGGGGA ATGAAT GTGAGGTCACTAGT CAAT GATT CTGAAATGAGTCTATTGAAGGTGGAAAC CASCA CAATT CAAATG CTT TG CGTTAAATGC CAGT CAGT GTCTCTTC CCAAAGA CCC C CA CTGAACC CAT CAGAT GTAAAATC C C CCATTTTAT
c a c c g a a c a a c c t g t t t c c a c c c a t t t c c c t t a t t c a c t a a a c a a c c c g t t t c c a c c c a t t t c t c t t g t t c c t t c t t a t t g c a g t g g g g g a c g t g t c t t t g c c t t c t t t c a a g g g t t a a t g t t t c c a a g t g g g t t c t a t c t t c t c a a g g a t c a c t g g t c t t t c g c a t a t a c c c t c t t g c t c c c g a a t c a t t a a g a a c t c c t g c c c a t t g g a t g a t g t t g c t g t a t c t c c a a t t a t a t t t t g a a a c t t t c t c c c t c t c a c t t a c t c c t c c a c g t g t c c c a t t t c t c t g c c t t c t t a a t g g c c a c t t c t c a a a c a c a t g g c t g a c a g a t t c c g g a g g c c t t c a a c t c c c a c c t a t g a a g c a g c c a g t g g c c t c g g g a g t g g g g c t g c t t g a g a c t g t g g g (n ) x t t a g a a t a t g t a a c g c a a t t a c c g c t g c g t t <n ) x Gt g t t g a a c t t g a c c t t g c t c t c c t c c t c g g g t c a g g c c t g c c g a c c c t g c a c t a c a g g t t c c t t t t t c t c t c c c t g t c c g t g c a t c c a t c t g t t t c t g c a c t c t c t t c g g t g c a t c c t c t g a a c c a c c a g a a t t a g g t t c c t c c a c a c c a a a t g t g a t c a t t t c a t t t c g c a g c t a g a t t g c t c g t g a a c a g c t a a g c a g g t c a c g c t g a g c t c g a c a c a g t c c a a t c c c a a g t a c c AG TCA GG CTCCCACCG TCCCTTCCTATCCCAAAATG TCACCTCCC(N )x CCTGGGAACAGATTCGACCTGGGCAG a a t g c t c t t c a g t t t g t t c c t a t c t c t g c c g g g a t c t a t a t g t t c t g g a c t c t c t t c t a c g a c a c g g a g a c c t t t c c c g g a g c a g g a a c g g g g c c a c g g c c g t g t t t g c a t t t t c t g a a c c c a g g a t g g t g c c a c t g t g g t t t g t t c t c c c c g t t c a a g t g t t g c t g a c t g g t g g g g a g g g a c a g c a t c g c c t c a g t g a a a g t t t t g g g t t a a a c a c t g t c t g c a g t a a t t c a a a a g g a t a a g a a g g a c t c c t t t c a c t g t g c a t c
>Hs2 1944023 -1953149
TTTC CTTAG GGAGC CAGCTGCT GAACATGAATTCTGAAAC CAGGCACC CCACAATTAACTATGAATTCTTTCATG GTCCTGCTACCTCTTCACAAGTGTTTGCTTTCAGCACCTCCTGTCTTCCGTGCTCTGGACTCAGCATTTGCTGAC CACATGAGTGGTGGGCACCATGCCAGGTCTGGCCGCAGAGTTAGAAATCCTGGGTCCTGAGGCACGATCAGCCAG TCGCTC CTTTGG CG GTGCTGTCACACATCCACTGGGTG CC CATTTACTTG CATTTT TCTGGCAAAAATGTCTTTA ATTCTAGTTTGCTTTCCAGCGTTTAAAAACAATTATATCCTAAGTGTTCAAGTGAGAATAGACAAACCCTATGCA GTCACATTCTAAAGCTATTTCAATAAGGCTTCTGACTACCTTTCAAAGTACATCACTTTTTATTTTTAAAGAAAT ATTTA CAT TTGAGCATAAATAAACTAAAAT TGAATTTT TAATTCTAGAAAATGAAGAG CAAAACTTAAAACCTGG TTAAGG CC CCTGGTGTAAACGATACCTATT Tñ CAGATT TCCTATCT TAAT CCGT CG CACCAAACATGAATGC CT C CTGCTTTCCAAACGGGTGAGAACGTGAAAGAGCCACATTGTATACCATCTTTCTAGTAAGCCCTGGGCAAGTTAC AAAT CATTAATCAG CCTCCACAATTAATGAGT CAGTCAAAAT GACA CT CAGTTAATACAGTGTC CAGAAATAATT TGCCAGGTTGATCTACTTTCCAAATATTTTACTACACAAATCTATTATTTTAATTCTTAGCATCTCATGAATGAC ACTGGTGCCTTGTGTCCCTGGTAGATTTAATAGCGATGATGGTAAGTCTGTTGATAGGTGTTCTGATAGGCCTTC TCTACC CAGCAT TCTA CCATAG CGAGACAAAAAATCTTGGGAGACTAA TT TT A A TT( N) xGTGTATATATGTACA TGTAAGTAAGTGAT GGATTGTTTG CATAGGAATCTTAAG C CA CTGTAACAATCTAGGATTATTT CATAAAATTCA GGTTTCCTAGTACAAACTCCTCTGACTTCTGAAGGAGGAGTGGTAATTTAAGCCTGCACATGAGATTAGCAACAA GCAACTCACATCTCCCTGATGTATGGAGGCCCTACCCACTGGCTGAAACAACTACCCTTTGTGTCCAGAAATTAG TTCT CCTGGTAATCAGATTCTTGAAGAACTTTTGAGCC CAAGG GAGAGGG GCAAGAAAGAAAAT TCCCTTCACTT ACAACATGTTTGGG CATTTCTATGAGGAAAAC CCTAAG C CAATAAAG ATGATAG CTAGGCTTTG(N) xACCACAA AACT CC CACCTC TTTC CTCAGAAGAAATGACTTCTATGAAAG GGTTAGTT CTGAAGTG GTATTGTG CGTCGGATA GTAACAGAACACTG GACATGAAAG CCATGT CACCGTAAACGCACTG CC GGGAACATTC CCAACACACAGCATGG C GGCCTTCTCCCTGGCATGTCTCAGAGCACCGGAAACCCCAGATCAGAAACAGCTCTCACATATGGAAGCAGATGA CCGGCTGATTATCTTG TAGCAATACTACTT C CTTTCAT CC CAATTATG TGACCATCACTCAAGCAAAGCTTTGG C AATCTACAGACAAAA C CCACTTTG CTGGTTGATGTAC CAGTCATTAGG CAGGACTT C CAACTC C CCTGAGGAGGT ACCTGGGCTAAGCCATTCCAACAACACTGTTTTGCCCTTTGCCCAATCATTGTATCTCATGGATACAGCATTCAG GAAAATGTAAAA TTAAAGACCAGGTCGGCTAA CTTAA C TT CCTTGATACATTTTGTTAAGCTAA TCAAAAAAGTA CACCATAAATAT CT CAGTAAAT TACTGACAAC CAAACTGAATAAAAAT TAGACTGACATAAAAT CTGAAGTGTAA CAGAGAATCTGAAGAATTCTTTAACAGAAGTGTTCAGTTCATATAAGAGGTTAATTTTCTCAGTATTGACCAAGA AGAACAATTTCTTTATATGCCAGT CATTCGTAGTAACAGC CGGCAATT CACCTTGAACAGTGGACC CAAAT CACG ACTTGTTACTTTGCTAATATTTTAAAGATTACCGTTT(N)xCCTGGCTCATCATTGTCCTCGGAGTA(N)xAGTC CCAT CACTGTCGTCACACTCAT CCACTGAGGAGCTGTCTG CTTTCACGG CAAATGG CTTTCGTT TAGGAGCAGGT TCCTGGG GCTGTTTAT CTTGTGTTTTTCTTTTTTTCGC CAAGGGACAACCATATACACTAATTAAAAAAATAGA G AAGG CAG G GGAGAGAGAGAAAAAAAATATCTGTGTTACTGTCTTTTAAAATCAGAC CAATGGGGG C CTTGAT CAA GGTACTTAAAGG CTTA TCATGAATGTCATCAG CACAAACA CCAGAGAGACATAACATACATGTT CC CCGAAG TGT GCTGAAATCTGTTAATGAAGTGATAGTGCAAATAAATACCGCTCACATTTGTCTAGTTACCATTCACATTTAGTC CAAGTCTC CTAGTT CG TCGTTC CTATAATT CCAGAGAT CC TACGAAAATT CATGAGATTGGTTTG CATAATGTGT ATAACATGTAATTGACAATACAAATTTATGAAACATTTTACCAAATATATACTATGTATAATATACATCTGAAGC AAG(N)xGTCTATGTCGTGAAAGAGCTTTGGATATATTTGGATTTGTATATTTTATGGATTAAATTATATCTTTA GACTTGATAACC TC CT GAAACAAGATGAGC CCTAGGGCTTGT CAATGT TGTATGAAAG CAACTG CATCATT CAAA GTTCTG CAGAAC TT CCTCAGTCTTAGATGTTATTAGTC C CACGCCAGCATTCCAGAAAACAACTTCAGTGTTAG C GGTAATCACCACTAAATCACGCTGAAATTGGCCCAGGCTTGGATGAGACCCTTCCTTACAGCAGAAGCTCACAAT CTTT C CTTGAGAAACAACGATTTT C T { N ) xAGAAAGAATGATTTTCAAAGCTACATTACAAATTCACTTTTGTTG TAGAAATAATATATATACTACAAAAAAATAGGTGGGTTAGGGGAGACATAGAGGCAAACTACCGCAGGCAAATAC TCCATTTATGCAGAACGAGATGTGTGGCCTCTCTCATTGGCCCATTATTCTGCAGGGTGAGATGTGTGGCCTCTC TCACTGGC CCATTATT CTGCAGG GTGAGAG GTGTGGCCTCTCTCACTGGC CCATTATT CTGCAGGGGGGCTC CTG CACACCTTGACTAGCTTGGCTTTGGATCTGGGGTAGTGAATGATCATTCAGTCATGACCCTGACCAGTGTGTACT ATTTGGGCAACT CCTCTATTTTG G GTCATGGG CTATGCTG GAGTTCAGAGATGAGTGG CACAGATCTTCACACCA AGGCTTTATGCT CTAACAAGGGATTTAGA CAAGT CACAAATACCATTT TAGAAC CT C (N) xATCAACTTCGTGGA TGAATGATGGGACT CT GGATTT TGACCCAGGAAACAGGGAGT GATA CGAGTTCTTT CAAGCACT CAATAATG CAA TTAAAATG CATGTTGG AT AGGTG CGAAGCGGAGG ACAG AAGAGAGG AG GTGC AGG CTAGAGATG CGG GGAGC AGA AGACAG CCAGCGCCAT GTTCAGGTGGGGACTGGGATGAGAAAGTGAGC TGAGGAGGGTGCCCAGAT CATGAGACA GATGAGAGAGGCCAGGAGGACCAAGGGCCGTGAGGGAGCTCCTTGGTACAGTCAGACCCTGGGTGAGAAGGTGCT GGGAGAGAGCGTGAGAGCAGGACACAGTGTGTGTGGACCAGGAGGAGCTTGCATTGCCCATGGCAGGTCTTGTTG GTCAAC( N) xGAACTGCTGTGTAATAGTGATAATAACAAGAAACTTTTGCTATTTTTTATTTTTATTCAGGTATC CAATTATCCAACAAACATGTCGCTCCTTAAAGTTTTGGAAACTAATGATTGTCCTGTTTGTTTCTTOTATCAATA CTTCTAAAACGATTTAGACCAATTGCACTAGCGAAAGGGAAGAGAAACAATTTTATGTTTCTTAGGAAGCCAGTT TTTCTTGATAGGTTGT ( M) xCCCAGGAAGACACAAGCATGGGGTAGGGGCAAGAGCTGAGGGACTTCAT(N)xGG GTGAAACGTTATCATCCCTTAAAAACATAATAAGCAAAACAAGAATAACAATTTTAATCACCAGATAATTCAAGA AAAAAAAT TCAT CTATAGTCTACATAACATTT CAAATATGAAATAAACAGATTT CTGGGATTCC TTTTATAT TTT CATTTGTGAGGGGATAATCTGAGATAATAATCT CAGAG C CAG CGACAC CT TCAT GCTGAATCAC CACCGGACAAC CTGTCAAGTCTAAGCCCAACACCAACCAACCTCCTCGCAGGGCCACCTGTATCTGAGTAGACTTGTTCTGGTGGA TAC CTT CTGTGTTTGTTCAGTTAG CAGATAATAGATGCAG CTGTGCACG GATGG CT CCTCCAGAGC CTGCC C TTG GTGTCGCGGGTCATAATGACCGGAGCTTTTCCCAGCTTCCTTCTCCTTCATCAGTGCATCAATATGTTTGTCTGA CACAGT CTGTTCTG CACTGAGG CCACCTGCACAG CAC CATTCACTAAAACAGATAACTAGGCC CAC CGTGAC CAT CTATGAGTCCCGATTGCTGGGAAGTGATGTCACCCCACTTGGCTTGGTGCTATTCAGTGAGGTGACTGGGGGTAA GGTGGGGCTCCTGTTTCATTTTCTCCCTGCTCATTCACACGCACAGCCGCCCTCCTTTTCAAGATGGCTCACTTG TTGTTCAGACTGTGCGATGTTGGCCCCATGAGGTTTAGAG GAGGGGAGTGGAGATTTT CCTTAGAATGGATG CTG CAGTTAGAAGTG AG C ACAACTGG GCTTC ACTCATGAT C AC CTGGGAC AGCTG AGAGTTTGGAG G A C AATGTATCC GGCTAGGAGGACGTTTGAGAGGATGGGGTCTGGGCCGTAGGGCATTGCTGTGGTGGGTGTCTGAGTCCCAACCTT GGAGAACATGTCTGGGCCATCGAGGCGTGGGAGGCGGGCTGCCTCCTGCTCACATGTCCTCTGAACTCCCCTGCA CCTG CAGCTGGCTGGCTC CCTTGCAGAGGGCCCG CC CACTTT CATTGCTTTTCCAT CTCCTGGCAG CTGC CATGG CTCCTACTCCATTAGAACTTGCTGGGCATGGGAAGGTAGTGGAGGAAGAGGGACAATAAATAAGCAAGAGTAGAA AAAAAATAGTTG AT CAAG AT AGTTGG AG AAGGTG AGTGTGTG TG GAAAGT CAAGTACAC ATGGAAAAGAAAAGTG AGGAGT T C ATCTGGTCTCTG CC AGGACAAT ATTGGT AT AATC CT ATC AAG CAGGGTTT GGTTTAAT ATGTGAAAC ATTT CACAAACAGC CTGAGCGTATCCAGGGTGGTGATGTGTA GAGCACAC CACACGAC CAAGGG CGTGAG CT CCA GGCTTGGTGTTGTTAACACCTCTTTGAG CACCCGGACTCCGG CTGGGC CTTAGTTTAC CTAT CGATGTGATA GAA GGGT CGGCTGTT CCTTTACT CT CTAAAG CTTTACATGTTTTAAATTTC CTAAATGATT CCCTTCCG GTGG TGGGA ACTTTCTGATGAAGTGTACGTTTCAGTACCTTCTAGTCTTCTATTTTCACTTGAATTGGTTTTACTTTCCTATCC CTACATTTTGCAAGCTCTTGGATTCAGCGTAGCAACTTTTGGAATCATCTGGCGTCAAACTTCCAGTGCTTCCTG CCAT TTGACATG GTTC ATTG CT CACG ATTTGCTCTCTATTTGTT AATGTAAAGCGTGAAT AC AGT CGTAT CC CC A ACTACAAACAAGTACCCTCTGTCCCTTCTCTCTCTGATCCATTCTCCTACTGCAGTTCCTGGTATCTACTTCCAT GGTGGCAACACATTCAACCACCAGGCACATTCCATTACATCTGTGTTCAACTCCACGTGGCTCGAGGCCCTGCTT AAAT CAACACAG CC CTCTTGGT CTCC AAAGGTTT AG AATATTTCAAATGG AGGCCTTG GAAAGC AATTCTGGGAC ATGAGTTTGGGGAGTGTGTATC CTCCTG CCCTCGTATTCTGTGAACAG CGTGTCTG CTGATG GGGCGGTGAT CTG GAAAGAGAGGAGGATGTGGGGG CACTGGTGGTGC CG CCCAGCAACACCAGTGAAGGGCTCAG CC CAGGAC CT CAG CTCACCCTGTCAGTGCCTCAGAGATGCTGTGCACTCTGGTCACCTCCATGGCTCCGCAGCTGGCTACTGCAAACC TCTGAG GCACAAAGGCAG CC CTTTGTATTTGAACAT CATT CTGCAAGACTGñGTTCAACñTC CñCAATGCACTGA CAGAATGATGTTTACTCACATATCCTCTCCTAGATTTATATCTCTTTTTATGATATAAATCTAAAAAGATATTTT TATATCTTTAGGATATTTTTAGGATGATGATATTTTAAGATTCCAAAAATAATGCACTGTCTCTTTTTTCACACA GTTGAAACTTCAAAGAAGTCGTAAATATGATCACTAGTAAGATCACGGTAGCTCAATAGTTCTCTGTTGACAAAA TACC CCTCCCGT CATTTT TC CAGAGTGTGTCCAT AACATG ATTC TCAATG CAGAAG CAAGGC AGGG AñGCTAGC A GTGGCCACATCTCACAGAAAACCATGCTGTCTTATCCTTGGACTTCTCACAATTCAGCAGCAGCCAGCGACAGGA GTGGAAGTCCAAGACACTTTGGGTAGAGTGAAAAGGCAAACGCATGCTGGATTAGAAATGAGAAAACCTGGCCTT TCCATGTTTTCACCCTATTTAGGTTACGCAGGGTTACACATGGAGGCCTGGCATCCCGTCACCCCCACC
> H s 2 _ 15861648 -1587 3720
GAGCAAGAAGGGTGTATTAGTAATTATT CTGGGACACCAAGGGTAACCTAGGGCTT CC CTGGACAAAACCTACCC TGGAAAAATCCTGCCCAGCATCCCTTTCAGACACTCGGGAATATTAGGGAGCCTCTGACTTGGAACAGGCAGGAA TCGTGTTTCACTTTGACTCCATATGGGGGCTGCAAGGGTAGAGATCGGGGTCTGAGGATTTGGCTAACGGGCCAA ATGTTTTGGGTTTG CTAAAG CTTCAG CCACATCATG CCTCGAGCATTTTC CTATCACT CTCAAATTTGTGTACTT TAGGGCAAAGTCATTTCATATGGCCCTAGCACCTTCACACACGGCTCAGTGAAGCCATTCCTGGACAGGCTTCTT GCTGTCGAATGAGCCCTCCCTGTGGAAGGCGCCTCTCTTTGAACTCAGACGCCTGTTTATTTTTGAACTGGAAAC CAGAAACTTAGAATTCCCCAGAGCCACAGGCTCAGATCCTTTTTTTCAATGATAACCTGTGGAAC(N ) xTTGAAA GAGAAGAGAGGGATTTGCTGAAGGTCACCTCGTGTTTTCATTCCTTTGCCATTCTCAAGAACATTACCATGCCCG GCT C CTATAGGAAATGAAGCAATGCGTGAGAATAAAAGC CATTTGATCACA CACAG CCTCAAGT CACTCACAATT CCGAAAAGC CTTGAGGCTGT GTGCAGACCCGCGCGG CAACTTTTAGAGGT CTCACCTC CATTGC CAAGTG CAGAG CCGAGTTAAAGACTGTTATT AT G GAGTC CAAGTGAGGACAAGAAGAGT CACACTCACCATC CAG CAAAGC CAGAG AAGGTTCTCGTAGCACCCAAGGATGCACATTGTGGTCCCCTGGGACGCTCTCTGTGACGATGCCCCCACTGCTCT CCCTACCTTGCTCAGCTCCCTTCTCCCAGGAACATGGTTGCCTGAGTGTCTCTCCCCACCAAGGACCCCTTAAGC AAGG CCT CAGGGAGGTGG CGTGGTGTTT TGGGCT TGAAAC CT GATTCT TT C (N) xCTCTTCTAT GTGAGT CGTGG ATGCTACGGGACATGATGTATGTAAGAGTGTCTG CTCGCTGTACCCAGTTCCCTGT CAGTGC CTAGTGAGGTTAT TATTGCT CTCACTTTCAT ATGTG GTTTCTG AACGTTGGCACT AT TGGC AT TCTCTGTTGCGGGT GT TATT CC ATG CCTTTTAGGAGGTT{ N >xCTTTTAGTATTACCTGGAGTGCCTAAGATTCCC( li ) xCAGGCCTGGGGAAGCTGTTA GACCTATGTTCATTTCACTTGTTTTC CT CCTGGAGGTGCT CACCAGGCAT TAAATC TCACCAGG CATTAAAT CTC ACGTGTCCCTGTGTCATTGCAAGGCAGTGCGGATTTCAGAATCCTGAAGCTGAAGCATGTCAAATTGAAACTAGG AGCAAAATGTTTCGAATCTTAATTTTTTGTTTCAATTCTTGAAATATTGCTGTTGTTCAAAAGCATGCATTTCTA CATTGCTCATGTGT GG { N) xGC GTGAGC CACTGAGC CCGGTGCT CATGTGTTTGAATG CATTTGGTTACTTCGTT TATT CT CTTCCCTGTAG CTT CACAAGTAATATCT CT CTGT CG CAATTC TTGTGCTC TTTAATAAACTGTTGTGGT ATCTGAGTCAA (N) xTGTGC CTGGTG CCTCG GGACTTGGTGGTG 3CTGTGGTGAGG CGTGTGG GATGGCCACGGT GTGGTG( N) xTCTGGGACACAAGGGTGACCCTGGTGAGCTGGGGGCGAGTGGGGGGTGAGAGGGCCCGGCAGGGG CTGGCAGTTTCTTTCTCTGTCATCAATTTCTCTTCCCAGGAGTCTCCTTCCCAGCCGCTTGCTCCTTAGGTTTGA ATCTCTCCTCTTTCTTTGCCTTTCCCAGCTCTCTCCTTCTGCTGTCTTTGATTCGGTCTCTCTAGAACTCTCAGC CTCTCTCTCTACAAGACAGCAGAAATCTCGGGGGAGTTGGAGAGGGAGGCAGTGAGGATTTTCTGTGTGTGATTA TTTTAGTAAGCTAGCCCAGGTTTTCACCCAAACCCACTGTAAATAAATGCTGTCCTAGAAGCTGGATCCGCTGAG AGCGCCTATCTATCCCAGCTCTGCCTGATCCTCTTTCCTCGGCTTGGCTTCTCCGTCTCCAGGCCTCATATCXCC CAGTGCAGAGCCAAGCAACATCGTCCTTCCCACCTCCACCATCCTGCCCCAAGGTCAGTACGTTGTGGCTGTAAA GGAAGGAGGGTGTAAAGGAAGCAAGGCGACTCTGTCCATGTCTGTCCTCCCTGATAGCAGACGCTGTGGGCCTTC CATGCCACCAGGACA(N)xAGACACCCACACCCGTCAGCTGACCCATCCCCTCTCTCCTCTCTCTCCCTGAGACC ACCACCCTGCACGTTCCCTCCCTCTGCTCCCTTCCTCCGTCTCCTGGCTCATGCAGCAGGTCAAGCAGCCGTGGA CGGGCGGCTGTGGGCTGCTGTGTTCTGCAATGTGCACCCAGCATTCAGGGCCCAGAGCCTGCTGCCCAGACCCCA GGGCCAGACTTCCAGGAACATC CCTGGAAAGGGAGG CAGC CTTGAñAAGT C CTTGATTTñCTGAAGTTGG TGCCC AGAAGGCAGTGCTTTCTGCAGGAAGAGAGCTTGTCCAGGCTGTGAGCAGGTGAGCAGAGACGTCTGCAGGAGAAA AGGCATAGGCCAGGCCTCAAGGACTGTGTCCCACAGGCTGCTGGGAGAGGAGCAGGGAGCAGAGAGAGAGGGGGG CCTGGAGCTTGGAAACCGGCAGAAGAAGTCAGGAAGGATTGTGAAGAGGGGGTC{ N) xTTCCAAATAAGGTTGCA TT CTGCATTCCTGGGAAGAACATGAATT TGGGGGTG CTCTT CAACTCAAT CCAGAGTCTAAGGGAAGC CAG GTGG GTGCAGT CTTGCTCAG GCTT CT CTGGACGTGCAAAAATAAGGTC CCAGAAGACAAAGCTCACATTGGGGC CAGGC TGACATAGGTGTCTGGGCTTTCCCTGCAGCAGGTAGATACCTGCAGAGTCTGTGGGTCTCCACACAGGGACGCAC ACTCCAAGAAAGGCCCCAGGGCAGCCATGCCTCCTTGTGATTCTTTGCGAGGGGAGGGAGCTGGTCAGTACTCTG TCTTGTCTGTGATCTGTTGCTTGGGATAAAGGGATCAAAACACCATCTGTGCCCCGAGCTTGTGTTTGGTACCAA CCTGCAGGCCCCGGAGGATCTGTGTGGTTTGGTTGGATGTGCTCAGCGTGTGCCCACCTGGAATAAGACTTACCA CTCCTGGGGATAAGAACCCATTTCCTGAAATGAGAACAGTCTGGCCAAGGTCCATGTGCTGCCTCACTTTCCTGA GTAGCCTG GGAACT CACCAATG CCTGCAGAGCT CAG CACCTCATGAGCACAGATGAAGGGTGGC CATG GTGAGCC ACGTGACTCTGGGTGAGGCAGAGCCCCTCTCTGGGCCTCCATAGTTTGGGGGTTCTGGAGGTGGCTTCTAAGCCT CC CAC CAACTTGAGTAGTAGAACCTTGGAT CATTGCAACAGTAC CATT TTTTTTTTCTCTGCAGTGGG CTATAAG TTTGCAAAGCCTTTTTAAACATTTTGTCTTATTGAATCATCTTATTATTCTCGGGCCACAGTGGACAGGTGCAAT TT T CAT GTTTCTCAAGATAGACAGCCTGAG GCCCCGGGTCTCACTGTCAGGTAGGT CAAAGCTT TGAGATTAAGT CCTGTAATGCCTTATTTCAGTGTCCCAGGTCCCCAGATATTTCCCTATTAAG( N) xCACACTGAGAGTCTACACT TGAGAC CTGTAAAG CAGGGCTC( N) xTGCAAAAGAAAACAGGTGTATCTTTTATTACATATTCCAAACACCACAG ATATTTTGAAAATATAAGTATGTCAACCATTGCTTTAAAACTGTGCCATCTATTGCTCTCTGGGAACTTCCCCTT CTTGAGACTCAGCT CTGGTC CATCATATTG GAGGATGCCGAATC CTGATAAAG GATTTCTGGGCAGCTCT GAACA GAGGACAAAAGAGATGGAGATGAGCCTCCTGTGGTTGGAGTCACTAACACAGTCACATGGTGTCATTGAATGCCT GCTGAAAACACTTGTC CGAG CAGCGCTC TGGAGCATAGAC TGAG CCTTTGTGATGGTCATAG CT GGTGGC CTGTG CAGGAATGAGTTAGCGTCCCTCCAGGTGGGTGCGGAGGAGGAAATCCAGCCCCACCCTGGGAGACCTCCTGGGTT CATCCTGGTCTAGCATGAATGACTCTGTCTTGGATACACCAGGCTTCCTTCCCCTTTCTGCAGTAGAATTCATCG AATTCTGC CAGTG CATTAGAGATGTGGG CTTAAGAAAAT C CAAATGACAT CAAT CAACTAGTGATGAACAAATAT TTGCAGAG CCCTTC CT CAATGT CACACC CT CAGTTTGGAG CTATAGTT TAGAAGñATATGGGGATTAATGAAACC TGGCTGCTTCTGTTACATTCAAGAGGAGAAAACCAACAATAAACACACAGATGGTACGTCAGATGGGGAAAAGAA GGCAGGGGGAGAGGGGTAGGGAGGAAGACTCCCCACTCCTGGGGGATTCGCTGCGGAGACAAGATTTATGTTCAG GAAAAGTTAACATGCAGGAAAAGTAGTTACAATATAGCTCCTTAGTCTGAACCATGACTAGATAATCATATGGTG AG CCTTTAAATGTACCTGTGGGATACGGAAAGAAAGGACC TC CTTGTGTC CTAACC CATGGAGCAGGGGGGTGCA ATAAAGTACTGTGGGTGACAGAGTTATTTGAAGGGGTGATGAACTTTGCGGGTGAGATGGCCTGAATTATCAAGT TCA(N)xGGCGTCCTCGGGTTCTTCCAGGCCTGCCTCTTGCCTCCTAAACATTTTCATCTGTTTCTGGATATTGG TGGCTCCATGAACTTTGGGCGATCTCAGCGTTTGGTTTTCAGGTCTGTGACTGGAAGGCAGAAGGGATCTGCTGA CT CCAC CTGCAGATGGACTGAGTCCTTC TCGTGC CCATCACCTTACCG CTAAGGTGTAAAGGAC TCAGGAAAGGG ACACAG GAGCTGGGTCAGGGTAGCTGC AAT CGAT TG CTTGGACAAGAATGGACCAATGACAC TCA CAG CT GACAT CATGGTGGTGACTTTTTATGAAAGTCAGGCCAGAGCAGACACAATGAAAGTTAAAGCATTGAGTGTGAAGTCTTG TCTTGGAGTCAAAAAAGCTGAACCAACTAGCTCAGATTGGCAGGGGGTGATGGGGATATATGGCAAGCACAAGGG TATGAGAGAAGGAC CCTAGAAGGTTCTAACTAAAATTAAC CT CAATTTTGTTTTAT T T T T T T T (N) xGTGGGCAT AC CAGGAAGAGGAGAT CTACTG TGAATATC CGTCAGGATGAGAG GTAACAGTTG TGTTTTGTACAGGC CAAGAGA AAGACAGCTTAGAGTT( N) xGGGAGTTCCCTGCTGGGACCATTCCCCAGTTATTTACTTCCTTGTAATAGGTCAT GCTTAGCGTAAGGCCAAGGAGACAAGACACTGCTGACTCATTCTTTCCCAGCTGTGCAGGCTGGGCACAGAGCCA GGCTGCTTGTCCAGCCCTGGGGCAGATGCCGGCTGCTGCGGTTTGGCAGTGGGTCAGCAGCTGGGATGAAGCCAC AGAGTCTGACCTGTGAGGAGGGTGGTGG GT CCACAGGAAC CCAG GCTCTG CAGC CC TTCTGAGATGGG CGGATCT GGGCATAGGAAAGAAATGCAATCAGGAATGCAGCAAGGCAGAGTTGAAAAGTCCTGTGAAGACTGTGCTGAGCAT CAGGGTCGGGTAAGCCTCCTCTGGGGGTTGCATAGGAGCTGCTCACAATCATCCCAGTATTATTCACATCATCTG TATTTAGTGGCACCTCCTTGGTGCCAAGGTGTGAGCCATGGCCCCGGGTAGCACAGCCTCCTGCTTCCCAGCTGC TGGAGGGATTGCCCTGATCCCATGCTTCTCCAAGGTGGTCTGGGGGAAGCAACCCAGCAGCAACTGTAGGA(N)x CCTAGATAAGCTCTGCTGCAAAACACAAAAGTGGCCATGTTCTCTCCCAGCACTTGAGCTTGCCTTGAGACTGAA AT CCAAGTGATGCC CCTCAGGCTATCACCTCTGGTGGATGCC CCTCCAGC CTGT CTTACCCATGTCCTTCTTGGG CCATCAGCCTTCTGTTACCTGCTGGGAT( N) xTTGGATAGAAACGCAATCGTTATTGG( N}xTGTGTTCCTTTCT CTGTCGTCACTGAGTG CTCACTCTAGGCAGCCCT CTGCTAGG CG CAAGTGGCAGGATGGAGGAG CGTG CCAAGGA CTAGGAGATTACATGGAGAAGCTAACATTAGCCTTAGTGCTTCTTTGCCATTTGGGGAAGCAGATTTTGGAAGCT GT CACTTC CAAGGTGGAGTT CTGGGAAGTGTTTGGG CTTC CC CTTAGCAGGTGCTC CCTAAATT CTAG GCACTCC CCTCCAGCACTAGAGCTTTGTATTCTGACCCCTGACCCTCACAGCCCTTTGCCTTTGATCAATGTATTTTCTCAG ACTGGAACCTCCTTCCTCTGCTTTTCCCTGTTTATCAACCTAGTTTAGTTGCCCCTTGCTTCAGACAACCTTCCC TGTCTCTTTCCCTGCAGCCCCCGGAGTACTGTAAGGCCCTCCACCAACAGACACCTTTCACTTTATTGTCCTGAA TCACAGGTTTAAGCTTAAGCTGGGTTCCCTCCATTAGATCTCTGGAGATCCAGAATTAGGGTCAGATTTATTTCT GTTTCC CTGCACAATAT CTGTTACATAG CAG GTATTGTGT TG GAGAGG<N > xGAAACTGAATTGAGTCCATCTCA GCTTCT C CAACAAGGACAGAAAAGAAA CAG CTCAGGAGACGTAATCTTATAACATGGGGTTT CT CTGTAATTTGC AT AAAT AACC CACT CTACTAGGTTTCACGC CTCTTGGGAGACCTAC CAQCTGCCTG CCTC TC ACTT CCAGTTTC C AAAGAGGAAGGAAGGTTTCTACCAGCCCAGCTCAGCTGGTCCAGGGACTAAATTACCTTCCCATCACTGAGCCCC TT T CTCATTATGCCAGAGGAAGAT CACTACTGC CTGTC CATTTGGG CGGCTTATC CACGTGGCCTGGTTC CAGGA GCCATCTGTGTGATGTCATTCCCAATACGATTTGAAACTTCCAGCATCTGCAGCCCCCACTGGTGTTG( N) xCTT ATCATGGTGAGGGTTCTAGAGATGGGGTGGGGCCGGGCCCTGTGTCTCCAAACGAGCCCAGTGTATCTCTTGAGG CTGGCCTCTCCCTCTCCCTGGTACAAGCCAGGGCTTCAAAATCCAAAAGATCTTTCTTCATCTTTTCCAACCCAC TATGACC CAAATATGCAT CTTCCTTTAAGGAATGTCTTAGATTTAAAAAAATTAAAAACT CAAAAG CTTC TGAGG CATCTTAC CTGAGCTTAT CAACAT CTTT CCCCCATTCCTT CTGATATAACTACT TGGGCTGACTTGAAGC CTCAC TTAG GTGCAG CTGCAATG GTGTGAGG{ N ) xGGTGTGCGGTTTTATTTGAATGGAGCTCCCGTTTGAAATAAAAGC TCTATGATCTAACCTCTCCTGGGCCTCCTGCCTTGCATCCCTAG GGGAAGTTTTGGGGCATAAAAGTCCAATTT C TTTT CGTATTGGACGCTTGGATGAGAGAAA GAGAAACGAAATCCTGGCA CTGAAAAAAG CAGAGA CAACCAAAC C CAATTATT CCTACTAACTGGGAAGGAAT TCCATTTGAGTTAGTGGTTC CCAAAC T
> H s 2 _ 7908924 - 7919887
AAATCCTCCCCATGCAATACCCTCAAAGGGCCAAATCCAGTCCCAAGCATCCCTGTGCCCTCAGGTCCCCTGGGA GC CATACAGTGGAG CCCCTAGATATGTGGG CCAC CACC CTTTCTCCTTCTCTAG CCACGTGGAAGT CTGAGGGGA CATCGCCCCAGACACGGAATCTTCTGCCTGTGCCTCTGTGGCAGGAAGAACAAGGGCTTAGTCCCACTCTTGACA CCAC CAGG CCGGGCAGGT C CTTGC CCGG CTGCGC CTGGAGAAAAGGACAGTGCAGGTAGAAGAAAC CAAC CTTT C GGGCGCCAAGTCCATCTCCCAGGCACAGTCCACCGCGTGTGCCCAAGGCTTCGTTCATTGAAGCTTCTAAATAGG AGATGGAGGGAGATGCATTTCTGGCTCCTGGTTGGGAGGTAGCGAGGGGAAAAAGAGAGAAAAAAGAGACAGAGC AAAATGTT CATAGACACC CACAGCACAGTG CTGAGGTG TG CAGG GGGAAGTTGTTCATCTACTTTGTGGG CCCAC ACGTGGGAGCAGGCAGCTGGCAGGGGCACAAATGGGCGCATCGTTTCTATCGCTCATGCTGCATAGGCCTGGAGG AG CAG GGCTGGTGAGTATGCATAAGAAAA CTGTTGAGC CC CACAC CAGAAAACATG CCCG CATTGCATTC CAGT C AAGTGGTGTTCAAGGCAGCAAAAATGTCTGAGCTAATTCTAAGGACCAGAATTCTAATAGAGCCGGCCCCTTCTA TTGGCTGGGAATGTTTGTTTTGAAACAGTGGGCTCATTTAAAGGGATCATTGAAGAGATTCTTCAGGGCCCAGTT CAAATGTCATGACCTGTCTAAAATTATTTCTTCTGCATTTTTGTATCACCTTATATATAACCCCAGTATAATATT TACTATGAATTCTCTATG CAATACATTGTT CAGCGGTGTG CCTTGGAGACAGCAGGTGTTTGTT CAGTGT TTTTA
T (N)XCTTTGCAGGAAGGGTACCCAAGTGGGAAGATGGAGCTTTGATCAGCTTGCACATTACTGTTTTAATTAGC TTA(N)xCACAGAGCCAAACCATATOAGCTCATGAAGAGAGCTGTGTAACCCAACAAAACATACTGGGCAAGAAA CAACAAATTCTCAAATTAGTTTCC TAAAAATCAAAATATTAATGGTAG CAATGCTCAAGG CCAAAG CTAAGATG C CTGTGCAGAAACAGATAAAAAGTTAACTTGGCAATGCCTGCCGTGGTATAATTTCACATCTGTCAATTGTTTTTG TTTG CTTAAAGTTGAAGAAATAGA CCAAAGACCAATTG TAAAGT TGTGAAGAAAATTCACTAGA TGAAAAAAAGT AAAAACCTCAGATTATGTAGTGCTTTCCCCTAAATATTGGTAACTCTGCAGCTGCCAACAAACAGGACAGCATGT GAACATTTTTGCATTTAGAAAGTATTTGAATTTTAACT TG CAAAAGAAAACAAAATTATATCTGTTGCATTTTCA AGACAATTTTTAAGT(N) x CAGCCATGGAAAAGGCTGTGAAAAGCTATGATTGCTGCTCTGAGGATCTGGGCTTT TTCTTCTCAGTACTGTAGTGAAAACAGAAATGAAAACCTTGAAAAGTAGGGGAAAACCCACATTCCATTAAGATC CTGG CAGCGGGTGCAACAATTATT CTAGAACCCG CAGCAG CCTT TT CATTTGTATC TAAG GAAT GGTGGATGGAC TC C C T C T (N) xTGGGTCCCCGCTGGCCAAGGGTCAGCAGAGCAAGCCCTCTGTCAGGTTCTACCCCTCCATAGCC TC CACACT GCAATAATTC CT GTGAGATATT CCAAAGCT CACTTTGT CT TATAAT CTATAT TGA CAT TAACTAAGA AAAGTAAAAACATATAAAGAAAATAATAAAATGATCAACTCAATCGCAAAAGTCACTATACTCAAAAACCACAAG TTAGCGACCTTACTGTACAGATCACTCTGCTAGATTGCTTTTGTTTGTCCCCATCCTATGGATGATGAAGATAAG GTGCAAATAATGATGAAATAAGGCTTAAAATTTCTAACCTAACCTTTGGGTTTTCCTTCACACACCCTTTAGAAA TATATTTGTTAGATGTGATTTTGC CATAT C CTAGTTTT CCTTTGTTGT CTAAAGTATTTTGAAATT CTAAACTGT CC CT TGATGACAGCACTCAG GGAACTGG CGAAGGGACATGAGAT TTGGACAGAGGGAGGACGATGTGGC CAGGT G CTGG CCTT CCATGG CAAG GG CTGATGGACATAAATACAAAGGCTAGAAAACTGAAGAGGC TACATGATGAAGATT CATTAAGC CAGCA CATAGA CAAACAAAG CTAGGTAAGAACAATG CAGCACACCC CAGAGCAATTAAAGCT CATG C ACAGCCTTGTTGGG CAGGGCAATTGATTTTAGCAGGCT TGGAACAAGAACGGGC CT CAG G CATG C T { N) xACCTG ATTT CACAAAACAGACTGGAATG TGCTGAGAGGCTGCCAGGCAGGTGGTCAGCAT C CATG CAGTGGGTGCTCTCA CCAATGCCACACCTGGAGAGGAGGAGGG CAAAG CAGATGT CGTC CGGGTGATTTT CAGCC CTGGGC CCTT CCCCG TT CACACC TCAGGCTTCT CAGCATGTATTG CTTTG CCATG CTTT CCTCAG CTTACAGTAATAACTC CTTTTGCCT GCCACCCACACTGACATGAGCCCCCACTTAGAGATGGCTTCGTGAATATTTCAAATTTTATCAGGTGCTTTAATT AAATCATTGACATTTATGTCAGAGAGTACA( N) xTAGAGGATAACAGCCAAGACTTCTGATTTCCTACCAGTTTC AGAAAATATGGAGCTCCCACAGCATAGGAATTCTCTCCATAACAGAATGTTATGGCTAAAAGCATGACCCATCTA AAATTCAACTTCGAGTATTTATTTAACAAATAACTACAGTACTTCATTAACTACT(NixTAGTTGAGCCTTCTGC TT AT AATT CAAGGTGACC CACATG AATTTGGAACATTACATCCACT AC AG CATG CCTAAT ACTACC CTCAGACAT TTTAGATTGCAAGAGTAAATTTCCCCCAAGACTGGCCC( N) xTTCCCTCAGCGACTATGGCTATCTATATATATT AT TCATGACCTTGC TTTAAAAGGAATTATC TGGTTCTGTGTCTCTG CAAGATTATC TTGC CTTCTAACACAGCAA AACACATCAGCTTTATGACAGCAATACTTTCTCAAAATTTTTTTCTAGACCCAGTGGCATGTCCCAGCTTGATGT GG CT CCATAAGTGTGTGATATTGTTTAAGATTCTG CTCAAATGGACTC C CTGATT CTGAACCTTGGGAAGGAAGG GTATCAGCAGCATCGGAAGGGAGGCTCTCTGGAGTTTTTCTATGGAGGAAGTCGCCACACACACAAATGCCGCAT TGTCAACTCTCTAGAACACAGTCATCTCACAACTGTGGAATGTCCCTCCTTGCCAAACCCTCAGGACGACAGCAG TTGAAACTTTCATATACGGAGGAGTGGAGGCTCCTACGGAAGAAAAGAGGAAGAGCCAGTGCTTGATGCAGGCTT TTGACACTTCAAGTTGG G CAAT GTTTTTG G TTT CAGCAAAGTTG CTAAAAAC CACAATTC CCAAG GTTGG C CTGA ACTTTG ATT CAGAG CT TGAATT GC TAAGAAAGGTGGGG CTGGCCACAGGTTT CACTTGG CAG CAGCT CGCTTCTT CCCTGTGGGCGGCTACCCCTGCTGTCCTTTTCCATGCGGAGTTCCTGGCCATGGGTCCATGGTCAGTAGGTGCTT CTCTCTTACACCCTGGGTCACAGCACAG <N ) xTGACAATAGAAGAAAAGGAAATGGAGACAAATGAGAAGAAAAA AGTGAATTTAGAAATAAAGG GAAGATTCAAGCCCAGTGAGTGAC CC CTGT CATCAGAGGG GTGTGGCAGGTGGCC CTACAAGAG CTT CACAGC GCAAGT CAAC TT CCCATGAG CC CAGACAGGTCTACAGATGTGG CTCTC CAGACT CTA TG TT GAGAGGAATGTTGAGG CCATACCT CAGAAGGAAATCACACAGAGACAAATAAGCAAGG CTGTGTGGGCCAG GAGGAG CAGGGGACAA CCTG C CAATTTG CC C CTAAGGG CGAGAAAT TGAAAC TGGGGTTAGAGAGT TAGACAAGC AC CTTGGATAGAGC CCTATAG TG TTTTT TG TTTT TGTT TG TTTG TTTG TT TGTT TTTTTACTGAGCAGGT CTTAC TATGGTAACACC TGGT CATAAAATTCTAAAATCATTTGTTTAGAAAAG GATATGGCAATT CA CT GAGC CCAGTTC CC CAGC CATGGT CT CCAG CACACTTC CACTAAGAAGCC CCTA G CAATGGAAG GT CAGCTGTCTCTAAAAG CTGTG CATT CTAT CTTTGGAGAAATGTGGAC CT GAAACGTTTCTG CAGACATC CTGG CC CT CC CTGAAAGTTTTACC CTA TGAGGGACTCAACAGAGGTGATTTGACATTAGGACACT CTGATG CACA CATG CACA CAGTAGTT CT CAGAAATGT G A ( N ) xATGACAGTCCTTCCTCTAGGGTGCTGGGCATGCCTTCGTTTCTCTATTCTGTGAG( N ) xCTAG AAAAAT ACATATTAAATT TT CAGTGTGACAAGTATATAAAATAAAAAATT C CACAAGAAATC TTAT CAAGTT CCTTTC CAT GAAAAGAACGAACTTGAATT GTTATTGT CTGTCT TGCGTTGAGT CTTTGGACTCAT TGTTAATTGAT C CAATGTC ATTTTAGC CAGC TC TTAAC CTC CACACTTTTTCTTCCTTCATAG GAAAATTC CACTGC CACAAAAAGTACAAGGA AATATG CTATGCAATC CTA C CTTTATG AAATTTACAAT CTAGTATAGGAGAT GAAC TTAAAATAATTGTAAG GAA GC TGTGAACCAT TT CT GCAATACATGTT TAAGCGTGGAAGG GAGAGTTGCGAAGAAGG CATTAACAG CAA C( N ) x TGTGAAAG GTAAG CAAAACAGAGCAG CCTCTG CTG TG TG T( N ) xTTTCTATCTTAATAATAG AG ATG TTA TAG TG AAAATGG CA CACAAAATG CAC CAACAAGAT C TG T TT TTT T TTTTTAG G CC CTTCAACTATTTTTTTA GG C C C TTC TTG CTTG TATTT TA TG ATTTACAAA TTTT CATACTTTT GTAATCAGGAAAAATAAATATGATAT CAAACTTAAAG TTAAAAAATG CATC CAAAGG CACTGACGTACGCCAGGCAGGCAACTT CATTCTCC CATGTGT C CAACACAGAACT CACAAGACTCTTGG CTGGATGTCCTCTTTT CAGGAAAGAAGGTT CATTTTTAGAGATAATTACACACTGTGTAAG CATAG AAGTCTC CCTTTTATTC TGAATGGT TTCCTGAAAACG CTGTAA CTAC TG CATTTAGT TGTGGCAAACTTG GGTGTT CAAGAG GCAT GGAACAGC CAGAAAGGGT TCATAAAG CC TGGG CCAGAT T CAG CACAAAGATTTT T C TAG GCTCAAAAAAGG GT CT GTAACTG GAC CAGACTGATCTGAG CACCTCCT CATACT TTCTGGCTGTTT CTGAT C TAT CCT CGACATG TTACACCGTATT CTTG TTTAATC T TTATAG CACAGAGG GGATAAAGAACACAGAGG CTTAGGAAG ATGAG GG T GGTCGGTG GGTATCACAT CGCTACCT TGAT CCAGGCCTGTCTCCGCTCTCCGCTGAACCAAGCTTAC ACCATGGGGTGCTCATCCTGGCCTTGCTCT CTGAGGAACTTT CCAATAGTAGGTG CATTTATTT TAAGG CAAAAC AAGAAT CTGT CTGTAAGTACAT CTTGGAGTGTGAGGGT CGAGACGG CTGATG CTGTCTGTGTCTATTGGTTCTGT CAGT GC CTGTTTGAAAAAGAG G CTGGTATTTT AT CAGTTTTATCTTTG TTGATT CC AA GATGTTTGTCTCGTGAT TG CATT TT CTTC CATGGCTAAATATTTCTGAGGGGTTT TAAAGT CATAATAATTGATATC CTTT CAAACATG CTT TCTC CT G TTTTT CTGTTTTTGACTGAAACT GAAT C T C T G C C T T C T T T T T C T T ( N ) xA A T TT C TG TC T TC T TA T A A G CTCATTTTGGG GGTGAAAG C C TCTTTCATTTCT CCTCTGGCTT CTGGGCGCTCTCAT CTTCCTCCCTGACGACC AGGCATGCATGAGAAG CAGAGACTGC CCAG CGTGAACCTGAT CTGTGTCTGCTG CTGGATTCTGA C C CAACTGGG GATG CCTGTG CACTGGAGGG CAAG GATACC TGAAGGTGTT TT CACAG GGGATTTGGGGACTG CTTAGAAAGAGCT AAGCTGGAGCAG GG CCTAA CATAAAGAG CACAAACTAGAGATTC TC TCTGGTTGTT TAG GAGG AAAAGTGACGGA AGTAGATGTTGC TTAACT CGCTCCAGACCT CACCTCCACTGTGCAATGACACAGTG CT CACAAATGGCTG GTG GA GACC CGAACTGGAG CCAGAGGC CAGG CAGGATA C TGCTT CGTGGGAGGATGAAGAAGGG CTTTGTACTA C CTGCC CCAACTTCGACACCATTT GCAGATGAAG CT CCAAGAAACATGAAAACCAGA CAG CTATAG CGCTTTGGGTGGAGT GTGCTGCTGGGCACAG CAAATATAAATGGC CTTAAAGAAAAAGCAGAATGGACAATGAATGC CATC C CTTAAAGC GACGTCTTTG CTGCAGCCTCTT CATTTG CC CTCGGATTAATG CGTGATAGTT TC CTGT CC CTGG TACTTG GGGCT GTGC AGTGAGGTGAGT AGGACGTT AC CT CTGGGGGGTG GAGG AG AATC AGATAAACTCAG C C AG A C CAAAGAGTG AGTGAC TGAAGTTCTCTGGAAGGACAGG CT TCAGTCCTGGGATTGAGGGTAG CAGTTAAGGGTC CATTTGGCTTA AG GCAGAGAAAG GTAT GT TAGC C T G T G ( N ) xGAAAATGGGACACCAATGAAAAAATGCCAGGCATCCAGAATAGA ACTAGATAAC AAAAGT GC TAAC AG AACT TAGAAGTGAG AG GG AG G ( N ) x AAAGTGAGAG G GAGAGAAG AGG G GGC TCCTTGTCCC CACA CGACCACTGCACCT CAATTT CTCC T CTAGAGATGTC CCGTGTGCAG GCATGTGCACCTTGT GTAC CACT GG CCACGACAGCTATATTGCAAATA CTGTGATTTTAGT CAACTGTG CCAC CCTTTC CTAGAAATAGT TTAATCAGATAGCAAAAATATATATACAACAACGAAACATAAATATTAAAATTCAAAGTGAAGTAAAAATAAAGG AAAACAGTAAAAGGTAA CGCAGTGGA GAAAATGATCATAGTACAGAAAAATT CATACfi GAGA GCTTTCTCATAGT TACCAACTGTTT CCTC CAACGTTCTGTCTAAGCC CCTCATAACCAAAG CAATAGGACCACATAACCAGTTTTAAG CCTTGTATTGTT CATGGGAACAACTGGAAG CTCC GCTGAGGGTT CACCATGTTTGT CATCGC CTCCCCAGCACTT CT CACAGC CC CTGACATG GCACTAGATGG CGGTTAATAGATGGACGAT GAAGAACATGAATGAT G CA T A A G TA C ( N) JíACAAGTTAATT CATTTTTTA G CACT CAGATGTGAGGGACAT TTCTCTTG CACATCTAAAAGACAG CACAATG ATGAAGTGAAATGTATGTGGCTGTTATTGTTCTAATAGATTAGAAGGTATTAGTAAACAA
>Hs2 107547731 -107556647
CAAAACAAAAAACAAGAGCACCTGTTTCACACAGATTCCTGTTTCCCTAAGGTAGGTTGATTCAGGTAGGTTAAC CTCGCTGGGTCTCAAGTCCCTCATCTTTTAAATGGGGGTCATTTAATTGGTTGCCAT(N)xCACTCTCTAAGTCA TAGCTACTATTATTAGTAGTATTATTAGCAGTA(N)xGCCTCTCTTTGGTGGTGCACCCATTCTATTTCTTAAAA GCAGCACCTTGGGAGGAGGCTCTCTGCCTCCCATCTTCTCTGCTAGGGGATACATCAGCCCCAAGAACATCCTCT CCCTCACTCTCCCACTCCTCTCTAAGTGATTTCATTCTGTACATGAGACACTGTGATTTTTATGTGAGAAGCACA AAGCATCTTTCATAGAATTTTAGCTTCAAAGACAACATCAAAAGGTTGTGTTTGATGTTTCCGAAGGCAGCATTT CACTTCATGACTATTCAACGACAGGTGATTTGACTGATTTTTCTTTGTCAGTGTTCACCCAGATGTTTGGCACAG
C’rTGACAAGGAGAGTGTCTGAGCGCTGCTCTTCTGTCGTTGTCAAGAGCCATTTGGGAATATACTTGAAATCCAA CAACAAAGATACACTCCTCCTCAGCTTTTCTGTTCACTATATGGTCATATCAGTATCCTTGAATGACAAACACAC TCCAGCATTCATCCTGA’TGGTAACTTAGATCATGTGAGTCATGAGTTCAAGGAAAGTCTAATCTGCGGCATTCTA ATCTGAACTCCAAACAGCCCTGGAGATTATTATCCAGTAATGTTATTTCCTTACTTCCCTTCTCTATATTCTAGG TATTATAGGAGGCT(N)XTAAAGTTGAAACTTTAAATGTGTTCACAAAGTGGTGGTTAAAAAAAATACAAAACAC CAACCCAAATAGACAAGTGGGTGTTTTTTTGTTATTGTTGTTGCCTCTTGTGCTCAAAGTTGTCTGTCTCTATTT GGTACAATGAGGCAAAATCACAGTTAAAGAGTAAATGCAAATATCACAATTTCTGACTTCATTTAGCCTCACTGC ATTTACCAAAGCTGTGTATAACATCTTGCCTATTCAAAATATAACATATGGACCATGTGGGTGAGAGTGCTAAAA GAGCCTTCATTTGTGCAGTCTTTGACTGGGGAGGCTTCAAAACAACCTGAATGTTAAGTTGCTATTTTAAAAAAA GAGATAAAATTTTGACTTTTAAAAACCATTGAAAAGAAAGGAAAAGATACCATAGGTGAAGAGTTTCCTAATGTC TGCAGGGAAGGTTAATGGAGACCTCTGGGAGAGCAGGAGTATTCTGAGGTGCCCCTGCCACAAGAGGGCTCCCTG GGGGCGGGGGTGGGTACAGAGTCAGTGAGAAGTCTCCAGAGCAGGAGGGTGCAGGCTTGGAAGCGTTTGAATTCT AACAAAGAATGTGTTTGAACTATTGAGGTGTGTATATATGCTTTC(N)XGTGTGTGTGTATGTGTGTGTGTGTTT GCAAGAAAACCCCTCCTGATTAGCATCATTTTGTTTGCAACAAAACCCCTCTTTACTAGCATCTCAAAAAGAAAA GTTCCAGTAGTACAACTGGGAATCTAGCATTTGCTTCTGCATTGCTTCTATAAAGTTGCATTCTTCAAAAAGGAA CTCCTTTTCAGTTTAGG’rCAAAATAGATTGGAGAACAAACTAACTTTCAGAGATATTTTAGTGAATTTTGGGCCA GTTATATAACAGTTGTATTTGTGTGTCTGTGTGTGTGTGTGTTTGCAAGAAAAATCCTCATGCTTCTTTGTAGTA GAATCTGTATCAAGGAAGACCATGGATAGGAAAGGAAAAATTTTAGAAATCTACCAGAAGTGGAGTACATGGCAA GTTGTATGAATGTTATATTTTAAAAGTTCAATCTCTCACTAGTGAGAATTTGCAAAGTAGGACTTTTAAACACCT TTCAAATGTGTTTAAATAAAAATAGTCTGATTCATTATAGAAAACTGTATTTTTTATTTTAGAAATAGCTAGAAC TATTTTAATATGAAATCTTCTTTATTTAAATATGTGCAACTTGTTGCATTTTGTATAGATAATGAATAGAATTTC AATGTAGTTGTTAAATTAATTCCTGAGAGTTTCATTTCATCATATACGTTTTCCTTCCAATCTTTGGGTGACAAA TTTCAAGGTCTAAAAGAAAATTAAATATCACATTTTCAGTCGAAGCTATGTTTATTTGATCAATGTAGGTCATTG TTTTAATATACACACGTTCCACGACACCAGCTCTCCCCTCTCCTGCTGAGGGATC(N)xGTTCTTTGGGGATTGA ATAAATGATGTATATTAAAAGTTTGGTTCATAGTAAGTTGTCCTTCAATAAACAGTGTTATTATGGAGAAAAATG GTAAATTCCATTCCTCCTAGAGCAGGGGTTATTTAGCCAAGGTCCATCTGTATTTCCACGAAGCGCAAGCTGAAA TTTAACATTTCCCTCCATTTTGAATATAGGCAGCAAATTGCATCACATTTGGGAACAGTTTGGGGGCCCCAATTA TGGCTTTGTGAAGTATTGCTTATATTTGGATTTACAGAGTATATACAATTCCTATCTCTCTTTGAACTATGGATA TTCCATGTTCATGGGTAGGCATATTGTAG(N)xAAATGCAAAATGCACCAGTTGCTACCTAGGTCAGATCACTGC TGCTTGATATACAATTCTTTTTCTGGTAGGAGTGGACAGCAAACAGGACATTGAA(N)xACCGTGATGTAGGTAA TCATAGTTGTATCATAAGTACATGTGACAGTGGTTCCTTCTGAAGTCAAACTTTATGACTGACTTAATTAAACTG TGTTGTTACTAAGCAAGTAGTTCGGGAAATGTACCTTCTATGTTTTTGTCCTCAGAGAATTAACATTTGCTTGAG GGAGCTGAAATTATTAACTCATGAAACACCAGCAAGACCCATGGGCCATTATTCACTCATTCATGCATCATATGC AACATGGAGTATAGGTCATGTGAGACCATGGTCTATAAGGGCAGTATAGATGGTAGGCAGAGCACTGAAAAGATC TGGAACAACCTCATGGAAGAGGATTGGGACCAGAGTCAACCCTCATATATCTTGGTTTCTGTATGTCACTATGGA GTTACTACTTTACTGGTCTCAATCAAATTTTTTATTGATGGCCTGATTTTGGAGGAAACAGAAGTAACACTGTAT ACAGTAATATCGACTTTACAAGTAATTTCTTAAGAATAGGTTTCTTTATGGCTATATGATTTATTTACTTTGTTA TAATCATTTTCACAGGGAACATCTCCTGACCTTGACTTTAATCTTAAGAGACACTGAGTAACTTATTACATCCTC ATCAAAGAACAAAATTTATTGTTTAAATCTCTGCTCTCCTCATGCAGATCATTTATGATTTTATGGATATTTGTC ATCTTACCTTTTTTCT(N)xGACTTTCCAATCTGAGTAGTTTCAAGTGTTTAGCCCATCCCATCTCCATAACCTT TTCAGTTGCTCTTATTCTGATGATCTTCAGTTTTATTTTTTCTTGAACTGCAGGGGGAAGAGCTGGCCATCTATT ACAGAAACAGATTTTGTCTACGTTTATACATAGCGCAGATAATCAAAATTTTTGTTTTGTTTGTTTTTTCTATAT GAAGGTGATGTTTAACCTGTCCTACTGAAACACCATGTTTTGATGGATGTGTTGGCTTTTTAATTTAATTTAGTT TAAGTTTAGTTTTCCATTATAACATATCGAGTCTGTAGTGGCTCCCTCTGTTAGTCAGTGCTCTCTAG(N)xCTC CTCATTTTTATTTTGGATTGAATAGGTAGCTAAAAACCTGTAGTCTAAGTAGAGGTGATTTTTTTCTTTCTACAC TTATAACTGGCAGAATAATCCTGGTCCAGGCCTAGTGCTTCCTCAAGGGAGATGTCCCTGCTCCTGñCCCTTTTG TCTCCAGTCCATTGTGTGCCTAGGGAAGACATTATTCCTCAGGTTCTGACTGATT(N)xTGATGTAGGGTGAATG T'rTGGATAGAAATAGAAGAAGTCAGAAGGGAGGAAACAGTATAATTAAACTAAAGAAGCAGAAGCACAGAGGAAC TGAGACACATAAGGAGCAGAGGAAAACTCAGATTTTTGATGACTTACAATGTTCTAGGGTCTTACCATCTTATAT ATGTTATACTATATAAATACATTTATTATCTGACTCCTCTGTAAGAATGCAAGAAAGCTGCAGGCAATACCATGA GCCTGAAAAGAAAAAAAACAGAATTTGGGCCCACCAAGGTAGCCAAGATAGG(N)xGAATTATGTTAATTAGTAT ATATTGAATTAAGAAATAAGGTTTATAATTTTATGATTCTATAAAAGTTATTTGCATTGATTTGGAACTTTACTA TTAACAGATCTCAGTGGCTGGTACAGGCTAAAAGTTTTTATTCTGTAATTTACAGATCATTGTTAAAAATATAGA
tatgttggaatccaataatatgccttactaagcccaatatttaacctttttcattgacaaaatgtgattttaata actttcttgtctttgagtaataaggcaatatatagactacctatgggatatacatattaagtagcctttcccaaa actttccttttgatgaattagtttttttctaagaatagatttttttagattgaatattgagaatgaaaggtgtaa TTATAAGAGGACTAGACAGAAAGCACTAATGATAACCCAAAGCATGTGATTCTACATATTAACACAATAATTTCT
gaagctaaaaaattaaattctatgttgagtaaagttgagtatgaaagtacaaatcagaccatgaatattggcaaa ATATAACACAACCATGTTTTAGAAGAAACTCAAATTATGTGTGTCTTTGTGTGTTTGTCTGCACACAAAGCTGTA AAATCTGCATTATCTCACC(N)xATTGTATTTTATTACAATATGTGTAGTGTATACTAACTTAAGGGGATATATG TAAACAAAATTAAAGTCATGGGGAAAATGGCATCTTGCTTCAATCTTCAACTTAAAGTTACTCTTAACAATCAAT TTATAC CATTATGT CAAATTTTAG TCAT CACTGCAGAATTTTAGACAACTGAAAAGAGACAAAGTAA CACCAAAG AATTAAGCACATAAAGTGATATTGATTAAAAAGTTGAAAG TAAAAT CTAGCTTGAC TAGAACTG AACATTCAGAT CTAT CT CTCCAGAG GAAAAT CTAACTTGAATCATAACGGT TCATATTTTGA CTAGT TCATACCATGT CAATTAG C CACTTATAACTTGAAAATAC CTTT CCTCAGATGCAT TT GACTATCTAAAATCCTACTGAG CATG CTGTTTGG CAT GTCTTATTCCTCTGAAATGAAATGAGATGAAAGTCATCTACTTTCTAAAAACCAGAAAATCAGTTTGCTTGTGAT TTAAAT TTCAAAAAATAGTT TGAGGAAAACACAAAAAGAATCAACTGTTTAAAGTCTTAT CTTTTCTT CATTGCA CAACAAGCTACTTCTACCAAAAACAAGGAGTATGTGGATGCTTTCTAAGAACTCAAAAATGGGAAAACCAATATC GGAAGTCTGGGTGCATGAATACATGTGCCCACATATGTACAGACTTAATCTCCATATCTGCCAAAACAGATTTTA ACAGGTAATC GCAA CATT CTAAATTCAGAAAG CAGAGATAAACAGTTTTGTTTC TAAATCAGTG GTATTACTAGA TG AAATGTTT AGTA GAAT ACTG CAC ATATAGTTC AG CAGT ACTTTG ATTATATC CC ATTT AAAAAATC AAAATAA TAAG CATATT CTTCTAACAG CAATGAAT TCTC CCGCTTTTTATTTATTTT GACATA CTGATATT TCTGTAAA CTT GCAAGTGGAAGATAAGCTGTTCAATAAAAGCCTTCTTATATATAGAGTATACAGAAATTATTTTAAAAGTCTGTT TATGTAACAGATTATTTTGGTACTAACAAAAATTAAGATACAAAGAATTGGCTAGAGAGGAAACCATCACTAAAC CAAGACACACAGGG CTTT CC TG CACTTCATTT CAGGAAAAAAATTT CCAAGTAATT CTTACTGTGTTAGAAGAAT AAAGTACGTTTGTCATAGTATACATTATTGTATTC C CTTAAAGCGGG GACTATTTAAAATTTTTAAAT TAAACAA TGTC CAGGCTTACTTCTGTCTGTACATT CATGAATAAT CGTATCAC TGGTTACACACAATTCTCTCCT CATG CAA AAAAAAC CCCTCCAAAAAAACAACAACCAAAAAAAC CT CAGTTCGTTGTTTTCTTAAGTC TAAGTAAG C CAAACA AACTAATAATAGCAATTTAATTAG CAAG CTGTAAAT CAGGGAGGTATAGAAATT CAGCAGTTAAATTATTTC CTG TCTATAGTACTGCTGCTACT C AATTTATTTTCTTC ATGTATT AGAAGAAT TAAT AG GC AT TGATGGT C AAAATAA GAATTT CAAT ATCG CAGC AAATGACAGAAGAGTGAGAG AAAG AGTT CCTAATGTTGTG AC AATCTTAATG AT CCT TTAAAAGGTAAAGGACTGTGTGCGTATGTGTGGAAAGGAGTAGAAAATAAAAGAAGAAGGTTAAGACAGATATTT AAAG GGAATG CCGAGATAGCTC CATTAGAATATTTATTT CAAAAAAACTG CTCT GAAGTCTGCC CAGT GTAC CAA AAACATA
> H e 2 _ 121682895 -1216 91 79 9
AATTATGAAGATGGAGTTGAACATGTAAAAATTAAAGT TAAAATTT CAGGAGAAAAATTT CTTC CCATTAAGAAT CCATTGTGTT CAAC CCAAGCAT CCATTGA CAGATGAGTG GACAAAAAAAATGTGGT CTAT CCAATACA( N )xA A A GAATTCATAGTATTTATGGTATTGTACCAGAAT CTGTTTG CACAGAAACAGTTAGTAG CATTAT TTTTCTATATT TTTTTGTTTGTCTGTTTCTTTTTTGGTTTTTTGAGGTGAAGTTTCGCTCTTT ( N ) xTTGTCTGCATTTCTTAGCC TAG CAGTCAG CTGACCTGTACATACCTATCACAAAC CCAT CCACATTTCTTTAAAATATT CAGAAATC CAAACTT AACC CTGCTTCCCACTGGCCTGTGGCCT TCAGACTGAGGGAGAGTTGACTAAGAAATC CCCACTTCTTAGCCCTT TT CC CAGTATAACTACTT TGTTAAAAGC CTTCTTTT CCTAAAGCCTATAATTAT GTTTGC CTCCAGGGTACC CAG GAATTTTTTCTTTTAATAAACAAAGAGCATCC TACAATTATCATGATGATTTGT CGAGGG CAGG CCAAGTTCAGA CAGGAGGCTGACTCAGCTCTGGAAACTGACACGCACGTGCTGGCGTTTTCGATCCTGGGGGCACCCGACAGCGTT TAGGGGTCAAGAATTAATATTAAGAGCTGGAGAACTAACAGATCCACATAGATGTCCACTTTATACTGCCAGCAA AACGGGACAG CAGTTGGGTGGGAT TTGGCCTTCCTGAGGCTGGCGGCACCGTCTGTGGGCCGTGCCAT CACAAAT GGCTCTAGGTCAACACCTCCAGCCTGCGTGCACTGCAGCATCCGGATGCCGTAACCTGGTGCTGCTGGAGCCACA CG CAGC CAGGGCCG GCATGGGCAGAAGC CGGCGGAGTT GGAG CCTGTCGCTCTGTCAGCCCTGATTTGCGGGCTG AG C C CAGTTTTGTG GCCCTG CCTGTAAT CTCC CCGAGTT C AAAGAGTGCTTAGCTG CTTGTCTTTGT C AAGAGCG CAGTTGG GGATCTTTTATGTGAAAGATGTAGAATT CTCGGGCAGACTTTGATTATT TATT CATC CCCCTTCGTGG GTGTGTGATCGCGCGGGCACTG CGGAGC CCCTTGTCCTGG CTGCTCTTGCTATGAAATTCATTGAGCTTTAAAG C CCTTTG AAAGTAG CTTTTTG AG GG AGGGGGAAAGTTTTTGAAGTCTTGTTTCTCTCTC CC C CTCTGCAGTGC CG C AG CATC TCTTGCCAC CATTC CATG CGCC CCTACCGATTGACATGCGACAC CAGGAAGGAAGGTACCAT TACGAG C CTCATTCTGTCCACGGTGTGCACGGGTAAGTCCTGCCCTCTGCCTGCTGCTCCTGGCGTGCAGTCACCTGCCATG GG GAGG CTGGGCCGGCAG CCTCAG CCACATCT CCTG CCT C TG TCTTTCTT TTGGGG GTTC CTGAT CTACATTGT C CTGAGCGGGC GATCACCTTTGC TATCATGGCC TGGGAC CCTGTGTGAGCATGTG CGTGGG CAGT GTATAAACAC C AACCACCCCCGAGCCCACATCACCACTTATAAGGCTCTGGGCTTCCTTGGTGTATCTATACATGGTTTGGAGCTC TTTCTT CATC CATGAAGTGGGAAT CCTCTCTAGGTCTAAGAT CCCATATAAGTAAGGTGATCTTAGGTATCTGTT GTTC CAGCATAATAATTCAGAG CACCCTTTTT CACTTCTT CAAGTGTCCC CCTTTGAATAGTAAATTGTATAGT C AC CATACTTAAAAGGACAAG CT CAAAGTGATG CTTTGGG GCC CTTC CTATTCCCAG CTTTGAAG TCCC CAG GTA G AAGGTG TGGGGTCACCCTGTGGTCTCCACCTG CCTAAC CCTGTCCCTGTCATACCCACCTGGGGCCTCTAAGCCC ATTGGCTGGTGTCATTTCTTGGCCTTTAGGGCTCAAAAGTCTTGGTCTCTGGCTATCCCATCCGTCCTCCTTCCA GGAAAGAAGGCCCCTCCTTCTCCCTACCCCCAAACTCTCCACTGCTCCCCGCACCTGCCTCCAGGAGAGCTCTTA CAAAGGTCAGCCAGCCCCAAGACCCCTCTCGACCCAGCATCTTCATGCCCACACCCTCCAGCCCCCTCACCTGTT CATGCCAAGGTGCTGTCTGTGGTTCTGTCCCATGTCTCTGTGCCTGGGGCCAGCTGCCTGCCCAGTGTGCCTGGA ATCTCCCCCCAACAGGCCACCTCCAAGCCTCAACCGAGGAGTCACCTCCTCAGGGGCAGTGTGATCCTTCACACT GAGCGGGGGCTGCTGCCCTCAATCCACAGCATCATCATCTGTTTTTGCACCTGCCTCCCAGCCAGCTCTGAGCTC TC CT CT CACCTCTGTGTCTCAATG CTGACATGAATAGGTG TACAAC CAGTGCTC GCTGAGTGGAGAGC CTCT CC C CGCCTGCCGCCACCTATGAAGGCACATCCTCCTTCCTCTACCATCTCCCATCCCAGTGAAGTCAGTCAACACTTG CTGGGCCCTCACTGTGCTCACAGCATTGTGCCTCAGCAGAGTCCTGTTTCCGGACACCCATTGCCTCCCCAAGAC GGAGCAAACCTGGGACTTCTACTTTCTTTTTAAAAACTGTTTTAC (N ) xTTGGGAACTTTTTGTTTTATTTCGGT GTGCATAC CTQGTT CTCACTTGACTAGG AT CCGCTGCCGCAGCC TT ATGT CTGC CTGGTGGGTGTTGGAGTCATT TCCACCCTCTGCTCACCT CCAGGG CCCCTCCACCTGCAGG TGGCTGAGA C CACAGCTGGAGAAAC CT CTGGAAGG TG CATGTG TTTGGAACTAGTTGGG CCA C CCAAAAATTG CC CAGT CAGTGGTGTCTGAGACTCCTAGGAGCATCCA GGGAGCT CAGTC CCTATTTGGGAGGGGGTTGCTGTTG CTGATTT CTTG CT CATGAGTCATGTTTGGCTGGTTCCA TGACACGGATCCTGGGCATAGCAGGCCTGCCTCAAAGTGCTCCCTGCACGGGATGTTGGTCAGGAGACCTGGGCA ATGCTGAGAGCTTTGTGCAGAGACAGTCCATGCTGGAGTGCTTCTGCCTGCAGGTGAATGTCCTGGTTCACCTCT CCCTATACACCTGAAGTGTGTAGAGGCACCACCAGAAGTGTAGGAAGACCCCAACAGCGATGACTGCCTTCCTCA TCATAACTGACCTTGTGAACC(N)xTAGAAATCAAGTTCCAGCCCCCTGAGCTCAGGCCCTCCGCTGCATGCTGG CCTTTCTTGTGG GAGCGTAAGTAT GGAC CATGGGTCCCT C TGACGAGT CCAGCCGAGC CCTTCTTGACAG CACO T CGGCCCTTCGTTAGGGCACTGCCCCGAGTC CAGT CGCT CC CAGAAC( N ) xCTGGCTACAGGCAGAGCCCCCACTG AGAAGGGAGG CC CT CAGTG C CGTC CTTC CCTGTGTAGGTG GGAG CCCTGGGCTGGGAAGGGGCCTGGGTCTTCAC CAGG CTTTGCCATGCTCCAGCTCTGGGTAGGGCCTGCTTC CTTC CCTTGC CAGTGAGGGTGGGGTGTG CG CAGAG CACTTGGAGAAGGGGGCCGTGGATGCTGGCATGCAGCAGGGAGGAGTGGCCCAGCCAGAGGCCCACAGAAATGGC CTTGTCCCTCATGGCCTACACAGCTCCTTCACTGGTTGGATTCCTGAAGAGATTCCTAAGGCTGCTGTA(N)xTT TGGGATCACTGAGCCCCAGTGTGGAGTCTGGCATCAGCCTGTAGAATACAGTAAATCATATTTATAATTTGCATG GGTCTGCAGCAG{ N)xCTGTGTGACATCCCTGCAAGGTAGTTACTTCCGTGAGGGCAAGGAGACCAGGCTCTGGG GTGCAGATCTCCTCCCCAGCCAAGGAGGGGCTAGTCTGGTTAAGCCCCAGTGATGGGCAGCAGCATGGTGCTCCC AGAGCCCCCCGTCC CTACGCGGTC CTGC TGGGGTGGGCTT CCAG CGGCTC CACACTAT CC TCAAGACAGTTGGTT CCCGCCAGACTGAGTGGGGGACACAGCAAGAGCCTCGGGGCTGGCCCTGGGCTCTGGAAAGCCCATGCCCTCACC TCTTCCGCCCCCAGTGTCCCAGGTGAGAACCAGGGCAAAGCTTGTGAGGAAAATGGCCCCGTGGCTCTGGCTTCC ATGATGAATGTGGTTGGAGCCCGGTGGAGAATCCCGATCCATCAAGCCCATGTTGTAGCTCGCTTCTCCCTGCCA GCAC CAAATGTGTCTAATTñCACAGATGTTTGCACAGCAAATGAGGTfiCGGTGT CATCTCATTT CTGTGT CAGCT GAGCCTGAGCCTGAGCCTGCTGCCCTGAGCTGGTGACCATTTCGTGGGTGTGCCCGCCCCTCAGCCCTTGGACCT CCTGCTTTCCACTGCTCAAGCACCCAGACCCCCTCCCCGTTCCCTCTCAGGATCCTGATAGCTCCAGGGAGGGCG CACC CAGTGTGTGCAGGCTC CAGAGAGGTGTTCC CTGCACTCTCTCATTCTGCC CTGC CCTCTG CCCCTCCGGAG TACCTTGGTCATCACCCAAGAGCCTGCCATTGTCAGTTCTCAGCCTGCCAGGCTACCCGCACTGGGACCCATTGC CGAACCCTCCCTAGCTGTGATTAACATGAATTAACTCA( N) xCTTTGGAAACATTCAATGGATATGAGCAACAAT CGTGATCATT GTTGTTAT CCTCACTTGC TGATTTGAGAAGGGG G CCACGG CAGG CTGGGAGACT GGGTGG CAGAG QCCGGCCACAGCATGTCCTTQGGAGAGTTGACTGTCTGGGGACTGTGGCACAGAAGGTGGGGGATGGGGCTGAAG GAGTGCTGTGGTGC GGTGTGGTGGGGGT CTTTCTTTCCCCCAGT TGAGGAATTTGGACTT CCTC CTTTTGGCAAC CAGGAGCAGCGGGGAGTGGTCAGATTTATTTTCAGGCAGCTACAGCAGTAGTTAGGGAGTAGACAGGAGAAGGGC GGGAACTGTGGTGAGAGGAC CTGC CAGAGCTGCCACAGTGGCCCTGGGGAGAGGG GAG CAGGAC CAGG GCTGAC C ACCCGGCTGTAAACAGGCAGCTGCACGTGGGGAAGGCAGGGCATGGAGGTTGGTGTGAGGACAGCCTTGCAGCCC AGCCCAGCTCGATGGCCTCCTCATAGACACCCAAGCCCACTCCTCTGCAGAGCACCCTTCCCGGTCACTTAAGGT TCAAATGCACAG CAGGCTTG CATGGCTTA CTGATGCTCACGTGGTTCCAGGGGAGGAAAGTCAC CACC CACCCGT GGAC TCCTGCTG CAGTGT CTTTGG GCTGTG CAAACCACAAGTGATGGGGAGACAAATGGGGCAGGGGTGTGTGAT TGCTGGTCAAGCAAGGTTCTCTTGAAGCCCCCATGGGGACATCCCTAAGTCTCCAAGAGTCTGTACAAGGGCGAG AAGAGAAGGCCGGCTGTGAATAGGGCCACACACCCCCAGCAGTCTCTCCAACCTTGTGACGCTAACAGGAAAGAG AAGCTCTTTGGATGCCCTCTGGGCCCCCTGCCTCCCAGAAATAGGCTTGCCAGGGTGTCATGGGGAGCAAGGCTG AGGTGTGGCGAGGAAGGGTCCCTAAGTTGGTGCTCCCCATTTCCTCAGTGTATGTGGTCACACAAGCATGAGTCC GGCCTGCCCTGGGCCAGCGTTAACTTGTGTCCTGGAGAGGAATGTAGAAGTCCATGCTCCTGGTTCGGGGGGCTG GGGTGGAC CTTCAG CTCC CTATGT TGCTTTTGAGGAC CTT TCC A( N) xGGGCTTCAGAGAAGACCAATTTCAGCT GCTTCCTGCC CAGAAGGG CGTACAGAAACTGCCC CATG( N) xACAGGCGCCCATAGGCCAGAGTTTGGGGACCAC TGCAGGCGCCGCTGCTTGCTTGACGTCCGGGAAAGCAGCCCCAGAGCTGCTTGTTGGCTTTCTGGACTCATCACT GACAGCGC CT CG CCGCCAAAACACGGGT
> H S 4 _ 82734215 - 82744864
GGG GTTGTACTG CTAATTATTACT GTTT C CAATATATC CCTGAACCAGAGACTACAAAG GTCATATTCTTGAAAC TTGTTTTCAGGCAGATGCAGCCACAATAATTACAGCTCAAAGAGGATAAGAGCAACATATTCCATGACATTTGCC CTTAACTACCTTTCTTATTAGCAATGATACTGCTACAGACAATGACTATATTATACATCCATGTATCTAAATGTG CTGTTTATTCTGAAGTGACAGTGACAATTATAGCAGCTGCCAGGAAGGAATAATGGGTATACTCCAGCATTGGCA GTGGTGTGGTTAGCAGTAATGCCAATTCACAAAAAATCCCACACCCCAAAAGGCACCAGGTAATGCTACTGATGT GATT CTATTG CTAAA(N) xTATGATTATTATTTATTTCTTGATTCTGGCTTAGGGTAAGTGTTACACATAAAAAC GCAATGCATTTCTTCTGGCCATGAGACTATTTTCCACAATGAATTCTAATCCTATGAACTATGACTACTTGGTTT GACCTTGGGTATATGTGACTTTCCATATGAAAATTTCATTATCATACGCTTAAGAATAAAAAATTCCTAGCCTAG AGATGTAATG CTAT CACACG GAGGAACT CACCAGTATT CACTCTACTCAATTAT CTCCATGGCTT CTACACTGCA AGCAAAAAAGTGCACATTTCCATAGCACTTCATATTCATCAGCCCAAACCATCCATATGTTGAAATGCTTCCTTT CAATCCAAAGAAAAGCCCACAAATACACTCCACCCACAGTATTGACTCTACCTAGCCCATACCACATTAATAAAT GACACCTC CT CTAAAATATTATAATGCTAATGGTGAAAAGAGAT GAA CAT TTAGAAAAAG CAATGGATATTTATA TGAGGGAGAG GAATATTTTGTATT TTATTCTGTACCT CTAAGCAGGTAATAAGGTACTTTTTTC CTA C CTGTGAA TGTTCCAGGTGCTGGCATATGAAGACATACCAAGAGGGGACTCTTCCCCTTCCTTTCTTTTCTACCTCAAGCCTT ACTCTGGAGGCCTTTGGAGAAATAATATCATGCCAAAGTTAGGAA(N) xAAAAGTAGATTCATTGTACAGTACTG TTTAGAAGGTTTTATGCTTCATTTATTAAGAAATGCATTGCTTCTATAGAACTAGAACTTATAACAAAACTGAAC TTGAGT CTAT TTCAATTATTAAGAATTATCTTAAGATTTAAAGAAC CATCTGAGGG TATT TATC CAGT GCAACTA CAGTCCAAGTTTTTTCCAAATGCTAATGTTGGCCCAAGTCTACTGATGTACTTTCCTTGATATTGTTGCTCTGTT AGAG CAAAAAGATAAAGTTGGT CT CAAAGCTACCAAGG CT CAGACCATT CTCTGTGAG CG CAG GGATGAAATAAT AATGACGAACACCCACTG{ N ) xCTTAACCCCCACACCATACTGCCTCTCAGGAGAGCAGGATGGAAAAATGCAAC AATAAAAACCAAAG CCGGGCAATAAGGC CTGAAAGGAGAG GACATG GAAT C CAGAACCACAAAT CCTTTTCC CAA ACAC TG CTAGGAGCTCTTTC TT TG CAGAAACA GTGACAGGAGAAGAGGGAGGAAAGGAGGGATGTGGC GGGGGT G AAGG GAGGGGAGAAGAGTATAG CAACTC CCTT CTCT TAACAGTGATGCAACTTC CCTT CC CATAAATTAGTGGC C TGTATTG GCACATATGAAAAAAGATAG GAACT TTGCAATGTG TATCTAACATCCTGTATC CAGCAGAGAGCC CTfi CAGCAAGTCAGCAAAAACAG CACT CTGC CAAAAATGAATGTT CACTGCTT CCAAAAAT CTATTTTAGAAGTAAA C AGTGTTTAAATTTG CTGG AAATGATATT CTTTG CTGTG AC TG CTCC AGAACATGGC ATTCTGTC ATAC CAGAGG A GAACTGAGGAATTT CCTC CCGCTGACACGGCTGACTTTGT CTTCATGCCTTTTCTCAGGG CATGGGAAGAAG TC C CTGTGACCTACCAG CCTGAAAATT CTGC CAAGAGGCTGAGAGTCATGGGAATCTGC CCAACCAG CTTTGTGGAGG TCACTGAACACCATCTAGACATTAAAAGAGGAAGCCAGACACAATATAGTTTCAAAGGACTGTGACAGTAAATGT CTTAAC CACTGATATTAT TACCTTGAGT TGTGATTACCTAGAGACCTATT CATCAACATTGATGAAAAGGTTGAG TCTT CCAATAGCCCATTACATT CT CTATTCTTAGTTAC CT CTTCCTGAAGGAAAATTGTC CATGTGAATGAAGCA TTA CTGATGT CTAGTAAAAATGAAAACTTCAT CCAGTGGG CAACGGACCACATGGG CACATTCTCCGCACACTTA CACACTCACATCATTAGTAACAGGAAAGGGAACCCAAGTCAAAGCCAACATTTGAAAGGAAGAGATAACACAGCC ACCAATAATGCCATTACAGTGGCCAACAGATTTTCTTTAGGAAATCCCAGATATTCTAAAACCAACACTCTTTTA CACGATAGGAATACGTTTTCTAAATACTGTCTTTAAATTATTTTTAAAAATCGTCTTTTAAATGAAATTTTTAAG CT ATTTTTTAAAAGTTGT CATTTATATGGATT AAGATAT CAGAAGTTCACAG AGAG AGAG CAG GG GAAGACACAG AGAGAAACAGAGTC CCCTCCCCCAC CAAAAAGGTAAAATAAACCAT CAAAATGG TAGCTGAAGT TTAAAAATAAG ATTCCAACCTCAGGAACCTATTTCTATGACTACATACAGGGACTCCAGATGGAATTCTAGGTTACCATATGTAGA CAGAAATTACAGCAGGATGAAG CACAGCATTTTGATGC CAAACTG GAAA CACCACTATTTTTTT CATTT CAAGTA ATTT GTGAAACAGACATC CCTAAACTTTAATC CATC CC TGGGATTCATCCTTTC CATGAC CCTCACCC CATCTAT CACACT TACCTGTCTAT CACTC CTATCC TACCT CTAAAATAT GCCCTAGG GGTGGGAG TCA CAAGACCTCG GAGA AG GGGTGAGGATfiT CCTT TTTG CACCTT CTCT GTA CTTT CAAATAATTAAAACATTGGAG GTTCAACAGTGC TAT GT CAGAATGTTCCTGCACAAAAATTGCTGAGT CAAAATAACT G GGT CCAAAAGT CC CG CACCAAGAGATGTAAAA ATATCAAGATACAAAGATATAAACATCAGTAACATAAAACAGGAGGCATGAGACACTATGGAGCAGACTTGAGGA GCAATCTCTGGTGCTCATCAACGCAACACACATCTTCTAGATGTGAGTTTTAGTTAAACAGCATAGGATGCAGAG GT TGTACAG G GAATAAATGAGAAAATAGGAATTAGG CTTATC CTCTAAGAG(N ) xTAATGGGAGGCACTTAATAA AT AATT ATTAATGAATGAGT AAATTCATGTATGTCC CCTAGATGTT CCTAGGTTTT C C AAATTAAGG AAT AAACT GGTCTTTGACTGAAAATAACAACAAGGGTTCTTATAACTCTCTAAAGTGGTGTTTGGAGGCAGGCCTTGGGGATT TGCTACCTCAGGCCAAACAAAACACTTCAATCTCCCCAGCTATCCATTAACACCTTATTTCAGTGTGGAAATTCC CTTCACGAGTAAATGCTGAGTATAAATAGTCTTTTCTCAACTCCTAGGAAAAGGGAAAAAAAGATTTTTCTAATA AC AAAC AAAATG AT ATTAGAAT AATCTC ATACTAAT TG AT AATCAAGTAT AT AG CAAG ATTGTTT CTAAT AACAA AATGATATTAGAATAAT CTCATAC CAATTGATAAT CATGTATATAG CAGGATTGTTTCTAATAACAAACAAAATG AT ATTAGAAT AATCGCACAC C AAT TGAT AATC ATGT GT AT AG C AA (N ) xAGTGTGTGATTGTTGTTAG ATGT TAT AT TAAGTACAAAATATTACGAATC CAGTAGGTAAAACCTCTCATCTTTA C TGGAGAAATCGAGATAGT CTTT GAA AAGAAAGAGACAATATAAGG GACCTCTC CAGGAAAGTGTAAAAGTTTTC CAGA CAGAC CAGAGGGAGAGGAAAG C ATTCCAGAAAAAAGAAAAAGTTTGTACACCAGGCACAGTTATCCAAGATTTTCACGTTCAGGAGCAGTCTGCTCñ AGGC CTTTAG ACTC C AGCTT C C AGGGCAGTAC CAG G AG ATGCGGGTG GAG AG AATGGG AATGACTTCT CAAGTTG AGTT GAGACTGGAGA CAACAGGAATGGTAATGACTT CT CAAGAGGAGTTGGGATTTATTT TGAATAAAATTGAAA TCCCTCAGGTGTTTCTTAAGTACAGGTT ( N ) xGATTTGACATAAACGGATGTGTTTGTTTGTAGTTTATTTGTTT TCCAGAAAGTAACTTTCATAGTGTTTTGAAAAATGGAGTGGATATGGGAAACGTGGTATAAGAAAAGAGTGAGAA GG G AGT TGTT G G AAACTT CT AAAG GAGAGATG AGG AGG AAAG G GAAGAAATG AC AG GAAG ATCC CAGG AATGTTT CACAATGGAATTTACGGGATTTGACAGTTAGTTGGATGTGAGAAGTAAAAGAAAGGGAGAAACCAGGGATAATAT TAAAGTATGG TGAAGCTGGCTGGG CACGGTGTCTCAC(N) xGTCTGTGGACCGTGGAAAAGACCTCTAGTTAGTC AGCAGCATGGATCCTTGGCTCAGAAGATATTTCATTATTTGTGACACAGACTCTAGAATAATTTTCTTAAGATAG AT CT CATTTGAACAAACTAGATTG CCCAGAGATTGCTGATTATGTGAATGGTG CTATC CATATAAAATATTATGG GG GAGT AGCTGT AG ACTAAATAAATACTGTGAAAAT AGTGTAACTTGACATT AATATTTACCAATGTC CCTC TTG ATTTTT CTGC CTATAAAATAGAAAAAATAGTATTTGTGTAAATAACTTTAAACATTTCGTTTTTAAACGTTC TAC CT CATGAATGGATAACATAGTTGTGCAGACTCAAAAGAGG TTGAACAATAATCC CA CTTC CCTG CAAAAAAAAAA AAAAAGATAAATTGATCAGAGTCATCTGATGGTATGGCCACATTTTGGCTCCCTGCTTCCATAACTC(N ) xTTCC ATATTAACACATGAATTCTATC CC T (N ) xGAT CCTT AC CC CTTTCC CTTC CTATTGTTTT CTTT CTT ACAGC ATT TAAACACACTCTAATGCCTCATTGTAAGATACCATATCTCAACCCATAACCCTACCCAACTAC(N ) xTTG CTTTT AGAAGACTCTATTT TACCAT TTTTATTT CTTTATC C CTAC TAAGTC CAT CTTGCCCTG CAATAGGAAG GAAG GAA GAAGGGATAGAATT CATGTGAGTAAGAATTGT CAAAGCAG GGAGACTTTTTATTTC TTTATCCCAGCTAAGT CCC AGTC CCACTACAAT CTGG CTTCTGTGTC CATCA CTT CGAT CT CAATAAAAC CAAAAAG CC TATGGTATTAAGAAA GCCAACGGATACTTTTC(N)xCCTCTTTTTTTTATAGATAATGTTTTATTGACTTGATTTCTCAGACATCACATT CTTCTGGTGTCTTCCAACCTCCGTGGCCTCTCAGTTTCCCTCTTCCCTCCAAAATCCATAGTCTTCTCCCAGCTC TTACATGTTAACTTTCCTCAAGACTCTTGAAAACCCTCCTCCTTTTTGCCTTACACACCATCTC( N } xGTGGTGC TACACTTAAGAAAACCTGTAGCTTGCCTTCACATCATAGCCCACACCTTATAACATACCTGGCATACACCAGGTG TGTAAACAAAGCTTGCTGAATGTAGAGTGAGA( N ) xATAACAAAGATTGTGTGAGCCAAAAAAAAAAAAAAAAGG ATAATCTTGTCTAGAAGC( N) xGGAAATCAATCATATCAATGTCCCATTGAAATGAAAAAGGAAAATTGTTAGCC TGGAGATG GGGTAGAAAGGAGATT CAGAAAAACATATTAACTGGCTT CATA CTTAAATATCTA(N ) xGAGCCACC ACACCTAG CCAAGATCTATAATT C TAAAAATAATACATG GTCTTCATT CT CAGG CAGCATACAACT CAATGCAGA GAACAAGACAAATATAGCATAATT CAGT GAAGATGGAGATGTAATAAAATAGATTTTAAAATTAT CAAAATAATT AAATGAATGGTACTTAGAGCAGTAGGCC CT CACACCAC CTTTAAAG GTAAGTAACT CTGCTACCT
> H s 4 _ l872672 86 -18 72 78 66 1
GGGGTATACAGACCTT GGTTCTACC CTTTATGATCTAAAACAGCTGAAAGTATTGACATAAGGATACTAAAAACT AAATAGAGGCTGGGTGTGGTGGCTC(N)xCTAAATAATTTATTGACATTACAGATATAATTTAATTTTAACTACT AAGAGCACAATCAGAA C A A T T (N ) xAGCACTACACTGTAGAGAAGGCAAGCCTCGCCTCTGTTGAGGCTATTCTG AGAT CT CAGGGTGC CCAGCT CTGAAGTGGAGCAGGGATAAAT CCATGGTT CTGGACATGACATGTATTTCATGTA AGTCTGATTGACAATTAAGGATTTTGTGTT CATTTGATTTAAAAGT CACAGAAAAATC CAAC CTTCAATTGT CTT CACAOAAAGGAATGAC TCTAATTG CACAT CTGACTGGAAAACTGTAAAG CTGGGATATGGCACTCTTGCCGCTGT GTGGACGCGGGCAC CCTGGAGCTG GTTCTC CACTCTGCGATTACCTG G CT CTCTTCGGATTCTACTAAGCGTGGA AGAAGTTACTTAGAGCGAATACTTTACATTTACTAGTAAGATAGCTGCACAGGCTGGGACTTTTCTGACAACAGT AATG ATTAT C AT TAGT TATTTT AATAGT TT AAATAGTG C C TTGG AG AGTAAG AAGATCT ACTGT TAAGGG AAGAA AAAACACAGCTGAGTTGTCTTATTAATGAATT GAGCTG CT CATTAG CTAATTTTTT CCACAGTCTT TAAT GATCT CCTGGTTAATGG CTGCTAGAAATACAGAAG CACTTTAATAAGGGCT CT CAGTCTAAAACATTTG CAGCTGTAA CA TTGACTACTCTGCACAT(N)xATGTGGGAACAGAGAATACCTTTATCTGAAAAGACAAACAGCCTTAATTCATTT GGCTAC CTAACG GGTT TCTAACT CAAGTTACACATAAATTTATTAAAT CGGGTCAACTGATGTAATTAGC CAACT GTTAAAAATTGTGCAGACGGAGAGAGCCGGTGGATCCCGATGGTCATGACTTATGAGAAGGCATACATCACACCT TGCCTCTATGGGGAAGAAAATTCATTTTTCACAGAGAGTGTAAGTCTCCATCCTGAGGTAATAGAGAAGAGAGTT TGTGGGTGTCTTGT TGTTAT TGTTGTTT CTGT CCTTAATCTT CCCACCTTGCAT TTAAAAACTCAATAAAAAAAA AAACAGTGACTATAAGTCACCATGTTCTTTGTTATGGAAGAGACAATGTCTTACAAGCCCTTAAAGTTGAAGTCT CTGATG CC CTCCAG CAGTTGACTG GGGATACCTGGTTACTTG TCAG CT CCTAAC CG GGACACAGAGTAGGT CAGA GAAGTCGACCCGGG(N ) xTCCAACGCGATGATACATACGATGTACTGGTGTGCAACGGGTATCCAGTGGATGGCA GCTATGATTTTTGTGTTTATTATTAATAATGGGAGTACAACTTGGGCAGGTTCATCTCGGTGATATGAAGCTTCG GTTT CCTC TGCCTAAGATTTGTTGACCCTC CTGCCAAC CAGGACTTGAAC CTTG CC CAAATTTC TGTGTATC TTT TTTAAG CTTTGC CAAACTCC CC CCACAC CC CCTACTCC CATC CAGAGG CTGGATCTGC CTTTTACC CCTGTCAAC AATACCGTTGCCCACTCCCCTCTTCCCACCTGCCCCCCCTCCCCAGCCACTGGTTGTATTTCTGTCATGGACAAA ATCAT CAAAATGATGCTGATGC CA CTTG CTTT TTAGAATCAG CGGACA CAGTGACTTC CTTCCGTATCCCTGCCC CACTTATCAGCTGAG CACCT CG GATCTCAGAAATTGCCTCAG CGGGG CACTAGG CTTGACTGTT GTACAATGAAG CACTGAGC TGCCACAGCTGGGG CACCAT TGTACTACGACGGTGTGTTT CC CAGG CTCCCTCCCCAG CTTCTGTCT CCTT CCTGTTGG CTTCGGTC CTGG CGTG CCTCACATATGAGCACTTTGTGTTAGGCTACGCAAAGAAATAAGAAC AGCAAC CG CATCAAAGGAGG TTACAATTTACCTTTCAACACAATCATCAT CTACTC CTAATTTC CC CACTTGGG C GATTAGTATAAAAC TTATTATTTTACTACTTT TTATGTAT CAGTAGAATT CTATTAATAAATGC CAGTAAAG CAT ACAATTCAGTAATACAAACCTTGTTTTATCCTATCTTATGTTGGCTGGCTGGTGTAGATCACACCACTGAACTCC AGCCTG( N ) xAAGTCTCACAGCGATTGTTAATCATAATTTGTAATTATTAATTAGATAACTCCAGAAATATACAT GAACTTG GTGATAATTTACT CTGATAATACAT CATACAAAGATAATATAAATTCTG CCAGAATACT CTGTG G CTT CTTCTGCTTCCCAAATGTCCTACATAAAACTATTAATACTTGTAAAAGACTTCAGTGAAGAATGAATTTCCCCCA GCTGAGCAAGCGTAATTTGGCTATGGCCAGATGAAGGAGCGTTCTGAAGCAATTCTTGGCATGGAAGCAAATGGA AAGGCCTTATGCATAGGCAACCAAGAAAAAAAAATCTAAACAATTTAAGATGCAGTGACTTCCCAGGTGCAGGTG CATCAGTCACATACTCCATCTACCATCGGCATGTCTTGCTTTTGTAACAAGAGTCTCTTTCTATAATACGGGCAG TTGGTCTTGTACTCAG CTGGGCATT CAG CATGGCTCCTGT CT CAGCAAACGGAGT CGGAGGGGT TAGCTCTG GTG TAGGATGC CTGC CTGGGTGG CAAT CCCTATTG GACAGGTGAAAATAGAT CATTATTTTAAAGTCTTGATC CG GC C TTAGAGGTTCCCCTGTGTTTCCTGTTCTGCTAGAGCTGACflGCTAGGACTCCTCAAGCTGTGCACGATTGCTTTC TGCACTG TGTGTGGCT GGAG CCT CTAGGTTAG CAATGTAT CTACTCACT C CAGAACGGTCTTTT CATAAAGAAAA CCTG ( N > xCCTGATATGTTAAATCACTTTAAAT(N)xCTTTCAATGCTTCTCGGTGCTCTTAGGATAAAGACCCA GGTT CTTC CCATGGACTCTG CACC CGATACAG CCTGGC CCGCACTC CCTCTCCAGC CC CTCT CCTACCGCATGC C GCTGCCCCCGGC CTTGTGTC CT CCAGCATGGTGACCTC CT CT CAGTGT CCAGGGTTCC CTAC CC CCACAAG GCCT TTACCATCTCTT CTAGAACT CCGCTCTG CCTT CCAGTTGTGC CTCCACTGTCACTT CATCTGGAAG CTGTTT CTG CCCTTT CAGACCAACTGGGTGAAATTGCTATGTGTGCTTT CTTATG CCATGTGT CACT CCTC CATG CCACTGGC C A C A (N ) xCACTCAGCACGTTTCAATATGACACGTATCTGTGCAATGACGCCGTTCATATGTGCCTCATCTCACAA ACTCGAAGCGTCATCAAGCCAGAGACAATGTTGACTTTGTTCTTCATAGCATTTCCAGGGCAAGCATGGCTCCTA GGATAAAG CAAGTC CC CACTGACACATTTCATTAAAAG CAGAGAAGG GTTAGAAGT TCATCAGC CT TG G CT(N ) x CCAGAGGAGCCAGGTCATCCCCAGTCAGCACCTCCCAGGTCAGGTCCTATTTCTCCAATTGCTCAGCTTTCTACA GCTGCGCCCACTGTGGGCTGCTCACCCTCAGAGCCCAGGACAGGTCCTATTCCTCGAATGGCTCCCCGAATGGCT CAGCTTT C TACAGCTG CGCCTG C CGTGAGC CG CTCACC CT CAGAGC CCTCATGT CTGCAGCTGATTTTTGTTTG C AGTTGGCCCACTTCTGTTCCTTATTCTGCCTGGAACCCTGCAAGTCATTGATCCCTCATTCAACCTGCTCCTGGG TGCT CAGCA CAGGCTCAGTGGGGGGCCTGG CT CCAGT C CAGGTCAC CCTGTGTAAG CACCAACCTTGACACT TCA TAATTAATGTTGTTGCCAGGAAGCTCTGATAAAGCCATTCACAAGCCCAGGTGATGGTGGAGGAAATGTCATTCT TCCTATAAATGCTTCCCACATCTCACTTATTTTCATGACAGGGAGCTTTTTGCTGTCAGGTTTTTGCCTCAGACA GAAATTCATCGCTTTCAGCGCATTATCCCTATATCAACCCTTCTGTCACTGTTGCTTTGCATTGCTGCATTAATA ATAATAAAAAGGAACTCATGTTTATCAAGAAAAAA CTT CAGAGGGATACATT GAAAAAAACATAAAAAAT GCT CA GAGATTGAGATTGCTTCTTCAAAGCTCTGGACAGACGAAAAACACATTGGTTATTTACATATCGACAGGTTCTTC GGTAAGGT CAAGAGTACATCATTT C CAACTAAATGTTCTAAGATATTAACTGTCTTTTTTAC C CATGT GAATAT C ATAGGACCGCCAAGTGAGGCCCAAAGCTATCCGTTTTTAGAAATGTATTTGTAGTGATAGTCCTGGCACAGCTCA AATATATCCTGAACTTGACCTGAATTGTAAATTAATTCTTAACGCTTTTTGCAAAACAATCCCCTGCCTTTGGGA GATGGGAGAATGGGTGCTCCTAAATCACTCGATCATCACGAGATTTGCTGTTTGCAAACCGGGGCAGACTGCCAT CTTTACAGACTACCGAGGAAGAAACCCAGGAAAGGGCTTATTGTTTTTGAGATTTATTTCAATCAAGATCTGAAC CC CTGAGG C CTTTGTTAATTTT CCTCTGACTCTT CATGTATTTCACCC TTGCCTTCTTTTTGATTTCC CATCTGT CTTGCTCTTTTTAGTTTCCTATCGAATTTTGTTTGTTTTGTACACTAATTCTTTCTTCTATATGTGTCTTTGATT TGTC GCCT AAGT CCTGAGCTGTTTCAGG CTGT ATTCTCTATT ACTTTC AT CCTGGGAC AGTTTC CTTTG AGTTG C CCTGATGGGTGAACTCAGAC CACAGAAGGT CAAG CTCAGA CACATCTTGTGGTAGATAAT CTACTGGTTACAATT TTTTTTTTTAATGGTTCCTACTTAGAAACT CAGT CCTC CAGCAC T CATACACATAAACTC TCAGTTACTTTTTCA AC AAAAGT TTTC TTAGTGGTGAAT T CAT TTTACCAAGAAGTATT CAGTATGT CACATAAATTTG CAGATG{ N ) xC CAATAGTACAACATAAAT T CAATAGTACTATAAAT CACAT TT CAACTTAATACTATTCTGATG GGTATAGATCG C CATTATTGATGAGTAGATTATCTAACTCAACACTTCTCATAGGTGTTGTCATGAAATGCTGGTGAGTGAGTCATA GTACAAAA{ N) xCGGCATGTGTGGTCACCTTAGCGACAACCATCCTGAAGTGCGCTACAGATCACTTGAAAAATA ACCACAAAACTGACATGCAAATGAGGCGGGTTCGACGAGATTGCTCATTTACAACCCAGCATGCTTTCCTGAGTG AAGGAAAATTAT TACAAATATTTTTGGGG(N ) xAATATGTTTGGTTTTTAAGTTATAAATATTCCTCGGTAGCTT TATATGAACCTT CTTACAAAGTGC CAGCATTATGGGGG C CAATACCTTAAAGTTAAGAGCAGATGAGCTAGACAA AGTTAAC CGGAATTTAATTTTTAGGAAATT CAAAGTGAATGT CTGTGAAAACTAGGGTTTATAATTTG CACTAGT TCACAAGTTCTCTTTCCTGAATTC C CAG GGAAGC CAGACT CC CACTTACCAC CTTTTGAATCTTGAAACACCCT C TTCCTCCT CATC CAGGA C CAAG GTTTCCTAGAAAGCTCTATCATTTTT CCAGATTAACTTGAAATAGACAATGT C CATCTGAGATCATTTCAATC TTAAAACT CTGG CTTCAAACTAAGGTTCAGGTAATAGT CTTGCACCTGAGTCACT TCTAATT C ATTTGGTCCC CAGT T C TG AT AGGT AAGAACTG GATG GAAG GTGC CCT CTG TTGGGC CAGC CCTATG C GATCTTCCTCTCCCTCAAACTCCCTTCACGTGGTCTCCACCGCCCAGTTCCATGAGCAGAACCCAGGAGCAGAGC ATGGGGTT CTTTAGATATGT CACCAAG GGACGTGCCACTTGGGGTTAG GAAGTGACAAAT CTGTAAGCAG CAGGA ATTTCTTTTCCCTCCAATTTTTCACGCAGTATCTGGAGAAGCAAAGATCTAGATCTGGGCCCAGGGCAAAATAAT ATGT CTAGTTTTGGGAGTTGTGT CCTACCTTTCT CAAAAAAGAAGTGCAAGGATTTGACTGGTC CTAGGCTTCCA GAGG GAATTTACAACAAA CC CAGAGAGAAG CAGGG CCTTG GGG C CTGAAG GAGAATTATCGG CAG CACACAATCT AT AG GAGT AAAG AAAGTGTG AC AC ATG CAT CT AGGT AT AATTTT CTGG CAAATG ATTT CTTT CTGGTATG AAATT GTTGTAAATTAT GAATATTGTTAAAAGGGGGAAT CATAATGCACTTATGCTGGCTCTATT CC CAGAAATAGTATA ATATATAATATATGTTGTGGAAGGGTTCAGAAACACCAGTGTTATATTTCTAGTTATATACTTCCACGTATTATT CTAGTAGT TTCATG CATT TTGATT TTTT TAAC CTACAAGTGGAT CAGAG C CC CTGGGAGGAGGAACCACATTTCA TT CACCACTGGG CAACAAAGAAAAGCCAGACTGAGACT CTGCACATGCA CATO GTCACTG CC CTT CCATAGCTTT CAGTCCGGTCATTTCATAGAGTTGTCTACTATGCAAATAATCATCTACTATAAGACTATACATTGAGGATATGCT AAAGAATATCAG GTAACTGT CTAAAAAGATATATATGCAATTTT TTAGTG CAATGAGTATTC TAAGATTCAGCAG TGTATGCTGTTCA CAATGTT CATGGCAAAAGTTAGAGAGTGTGATAATT C CCTACAGCAC TCATACTAAAGTGC C AGGCATCAAAATAAAAAGGTAAAG{ N ) xAATTCAACCTGTGACAGATGGTGTTGCTCTTCCGTTAGTTGACATCC TGGGAACCATCGCCTTCTCCCATTTATCTTAATATTCTGGTCCAAGATCAAGAAATGGCTGAACAACAGGAAAAT ATGAAAAAAATGGTAGCTAAACTAGGAAAAGAAATTTTAGTAAATAATAATCATCCTGGAAATGAAGGTAAACTG TAAGTTGCTCAAAAGAGAATTAAATATTGCT(N)xAGGTGTGTGAACAATATCTTCA(N)xTATATATATATATA TATAATCAAAGC CAAT AAAC CAAG CAGTGG GAA C CT AAGT AG GGAAAAGGAAATTAAG AATTTT ATTC AAGGTAT TAACAGGAAGGTTAC CAT CAAAAAGTGAATACAAAACACACCTñCTATAATT TTA TTTTG A (N) xTTCTGGGATG ATGGAGCG GTGGTG CATC CCGATTGTG G CAAC CTATGCATGT GT TAGAAT TTATAGAACTGTA CACCAAACACAA AGTCAATGTTAC TGTTGTATTTGT CTAT TATAAATATTATTT GAAAACACAACAAAAG CATACT TGTCAAGTAAC TGAATAAATAAATGAATAGACAAC C CTC CT CCAACCTC TACCAGAAAGAG CAAAGGCAATAAAT GACAAGGTGAA AAACA
> H S 4 _ 157820520 - 15783 14 01
TCACAGAT CCTAAG GGATGACGATATTGCCTTACTGTCCTGCTGTTCTTGTTT CAAGCTT CTTAAACACAGAGTT TCTTCTCATTGTGAATATGTCTTCATGGACCATTCACTTACTAACTCCGATGACTTCTATTCTTAAC(N ) xTTAA (N) xCTGGTAATCTGACTTATCATAATTCATTTATAAACTAAGATCATGACCTGCATCTCTAAATAAATATTATT CAT C TATATTAAAGTAATAAACAGAAAACTTGAAT CAAAGATATGTGAAACT TAAAGT CT CATC CTCC TCATCTA CC CAGTCATCTCTACTTT CAACTAATCC CAAT CTC CAATC TTGATCAACTAAATATCTAATTTTATGAATTCTTA CTAGAATT CACCAT CTCCAGTTTTAATT CTCTCTTTTTCC CGAAATTC CTGAATGTCACACCTAAAAT CCAGGCT CAAAATC CTTAGTTGGTT CC CT GT CCTC TAAAGAGCAATG CCAT GCATTTAATGAACTTT TTAAAAGGTC CACCA CATG CCAGTTACCTTGATACAT GTTGTGTAAACAAGAAT CTC CTAACCTGGGAAT CATGATCAT TTAAA C GAAGG AATGGGCAGCAAAACTAGGACACGTAGAGTAACAGACATAATGGGTGGAGTCAGAGAAGTAATAGGTAAGCCTGT GAGATTAAATGC TG CTGTATTCAGTCTTAAATGATGAACAAGAACGAACTAGTTAAAGAAGAG GATAGGAAG{ N ) xGGAATGAAAGGAAGGAACGAAGCAAGGAT(N)xAAAAAAAGAGAAGAGGGCAGAGAAGTGAGTGTACAAAAGAG AGAATGCAAAAGATGAAACTGAAAAGTGAAGAGTAAGCTTAC CATACT CAGOAAAAGAGC TTGTATTTTCTCTG C TAGATTGATGGTTCTCAAATTGTATCAT( N) xAGTATCCATATGAGGGAATCCACATCTTTTAAATTGACCATGG GTGGCTTTGATATTGAGACACCCACGAATTCTACAAATGCTCCTAGAGATACTGAGGAGTCATGAGAAGTTTCTC AGCAAGGAGCAAGCACATGTAAACGTCAGATTGGCTTTTAACGAAGCTCCCTAGAAAGCATGTGAAGGATTAATA TGAAGAGGGTAAGTCCAAGTCAGGGATACCATCAGGGACACAGAGGGAGACCAGGAAAGAATGACTGACTCAAGT AATGCAGGCAAGGATGGTCACAGAGTACATGACATACAGCAATGGGAAATGAAATAAGAAGTTGGAGAACAACGC CTACCTATTTCTTATATATTGGTAGCGAGAGAGAGAGAGCAGGGCAATGGCCGGGTATCTATTTC(N) xAGGTAT GTAGATTAAGGTATA(N)xCCAAAAACCTAGCAGGCTACACAGAAGATACTTTGCAATCTGACTCCTGACTACTT ATTGAGCTTATTTTTAACCGCTGTCTTCTACATATCTTGTTATAAAATATATTTGTGAT(N)xCGGTGGTTAACA ACCCATTGCATTCTATACCCCAGCCAAACCAACCACATGTGGAACTTAAATATTCATGTGGCACAGATGTTTAGC TTTGGTTCATTTGATTGCTCTGCCTGAAATATTTTTCCTAGCATTAGCACATTAGCACTCAATATTTCTTTCTTT TTTACGTCCCTGTAAAATTCTTGATCTT CAACTATG CGGAGCAAAAAAACAGGGACTTGATAAT TATTTATTGGA T AAAT AAAAT AT CT AG CTTCTG AAAATCGATTGATAG AGC CG CT ACTAAGTTTC AGTTTT TAGAGT AAC ATAAAA GT CATGTCAAATAAATTCCACC CT CATGACACTAATAATGAAGAAAAAGAACAT CAAAAC CAATAAGGACAAAGA GAGAGAGAGAGAGAGA < N) xATAAACTAGATCACATATGTATTACAGTGAAAAGGGCAGTTATTTCAAAGAAACC TTAATTTTGTAGTGACATGAAATATTAAACTTACATACAAATACAGAGAATATGATTATGAAAATAATTTATAGA ACAAATTAAACAATAACATTTTCTAATAAAATGAACAAATTAATAAAATATACATGTATACATTTAAGTCCTAGT GG GAAAACATGCAC TTAATGAAAACAACAAGTAGAAGAC CAAATTAAG CAAAGGTGATGG CACAAAGAGCTGAC C ACAAGGATTTATAATGTTCCAAGACTGTAAAATTAAGTGCTTCTCCCCATCTAGACATATTACAAATATTACACA GTAACAAATAA CACAG CAGATGGTTT CC CCTATGAAAATTAGAAAGAATGATTTAAAATT TAAAA C TACAAAATA ACA CATAGCAAG CT CAGAATAGATA CTAAGAATAAAAGAGAT CCAATAATTAAAGAATGAATTCTGA CTAATGAA ACTAAAATAAGATAAACATAGCAATTCAAGACAGCATACCTGCCAGTGAAATAATCAGAAACAAAAATAAATCCA AGTGTGCAATTAGTGTGTCTTCCCAAATACCAGCCAACAATCCATTTCTGTAAAACGAGTAATTCTGTAGAATCT CTTGATTAATTT CAAG CTTAATTTTAAAATATAGCAAAGTGTAGGAAGA CAAAC CATTAC CTATAGTTTGGCAGG AAGAGGAAAGGAAG CAAGACTC TAGACC CAAAGATCATGATAGAGCTGTACTGG CCTTTAGG C CAATT TTAATCT CACTAGTGTGATTTAATGTTCATAGTGATGCCTGCCTTTGAATGCTTGTTAGTATCCAAAACCAGGTTCCCTGTA AATTTAAAAC CACCACAAAAAATACAATAATTGATTTTGTAAATGAAAGTGGGC CTAAGTGC CATATT CTTGCAA TGATATTACAGCATACAATTTACAGAAAAATTGGATTAAGTTGGGATTGAAATGGGATGAGGGAAATAAGGCAAT AT CAGTGAAT CT TT CATTGGACAGAAAACGGAGTTC CTTATCAACTGACT TGGATTTTCATC CC CTGGTGCCCTA ATTTAACGTGAAAAGT CATGGATAAAAG CTGGAAAAAGGAAAGAGAT CATAATTAAAAG C CTTT TATGGTCAGT C ATTAATAC( N ) xTTATTTCTCTTGTCCATAGAAATTTATCTTTCACATAATAAAAATAGCCTTCATTTCTCTATT GCTCGAAACACTAGTTCCCTCATTTACAAGATCCAAATAAGAATGAATAATCTCTTAAGATTTTTATGTGACATA TCTTAGTAGATACAAAGACATATAAGAGAATTTCACCATCAAGTGTATATTTGTGCTTGTGTATTAGTTGTTCCA ACTAGCTACTCTCACCTGAA( N ) xACAAGCACATGTTAAAAATAAAATTACCTG3AGATCATCTCTTAAAAAGAA AATTATCTTTTGAGCTGATGGAACACTTGCTTAAATTCCTGTATCTTAGCATTACACTGGACACTATGGGATTGG TAAAACCAGATTCTGTGCCTCTTGCACATAAGGGAAATACAAAAATGATCAAGAAGCTGGAGCTGGGATCATCTT GAGATAGGCTTAGATTGAGAAATAAATACTAATTT CAAATATAACCTT CAAG CT CTATTGAATGAAT TAATCATA ATTC CTTAGCAAATAAAAGTTTGG CAAT CTGAGAGCAC CTAC T G { N ) xCATAGTGACCATAATATCACTTTTGTA TTATATGTGCCAATTTATAATCATTTATGTTTATTTTTTTCTCAAAGATGAGATAGTTG(N) xAAATACATTTCT GATCAACATTTTCCAAACTATCAAGCAGTTCAACCTGGAGTAGAAAATATGTCTCTAGTTCTCTCTCCTGTCAAC CATT CAGAATAT CTGCATCTAAT CA CTG CTTCATT CT CAAAGAAACACTATTTG CCCAGATACCAAAGTAAGTCT AT CTAGAGTTTGAAG GGCTCAG CT TTATTTTAATTACATTAGAAAGATTATG CTGTGTTACC CAAGGG CTTACAT CTACACGCAAGACCTACATTCTAGAT CT GAAACAATAAAG CAACATAATTGC CATGTACTATAT CACATTTAATA TACTGTAGATACATCACCTCAAAAATACATCCTAGCTCTGCTTACAAGCCTGTTATGTAAAGTGTCAGTGACCAA AAGCAACAGATGAAACCCAGTATTAAAAGATGCTTCATCAGATACTTGTGTAAGTATACACAGAGATATACATTT ATAATCTATACATATG CATGTT TATG TATATATGTTATAGAT GACATATAA CACACATTT TACATACTAGCACTT CTGAAGTTAT CAGAAATTTTCAAAACAAGTAAGAATAT TTTAAAAACAAGAATATTTCAGAAGT T CGGTGTACC C AAACAATTAAATACAAATCACTTAAAAATACTGCTTGCTCAGGTTTCTTAATGTCATAAATTTTGAATACGACAA ATTTAAGCTTTCTCTGCCCTTTTTTCCATTTCAATTATATCCATTGTTAAATATAGTAAATTCAATTTCTATTCC ATTGTTTAAAAT TCTT GAATGTTTATGG CTCATGGTGAGGAGTGGTTT GG CTTATTCTTT TATGTAGTGTAAAGT AGGAATAATCACTTTCACCCTTTGATTTACTATATTTTTATAATCACTTGTAAAACGTGATACAATGGGATGTAA AACTATAATTTTAAGAGTAGACAGATGAGTAATAGGAGTTGACTTCCTGGTTTTTCACTACATTGTAATTAAGAA TT CCTGGAAGGTAG CT TAGGGGGA CAGT TTTAAAAAAGATTTATGATGTACTTTTAAAAATCTTGAGCACGGACA AAGGTTTATCTTTGAAGTTTTG GGATATAATTACAG CATAAATAGACTTCTGGTTCATATTG CT CTTACAAGAG C ATTTCCTCTTTGCAGACTTAGGTTTCTGGCTACTCTAGTCATTGTTCTGAGGGAGTCACAACAAAAATTAATAAA AGTTAATCATACGTTTCCCTTAATTTCTAGAATAATATCTGATATTCTTTAATTTTACACTCACTTCCAAAGAAA GC CT CACTGGTAATAGAAGGCTGG CTTATGCTGGGATCTCAG CTCTGC CC CT TC CATTTCAC CTTG CATGGATAC ACTGATGAAC TAACAT GGATGCAG C CAAAACATGTTTTATAACAGAAATTAAGT CTTGCAGGAAGAAAATGTAC T TT TGTCTAATAAA CAAATAACAAT CTAC TGTTTATGAAAT CTGTTTATTT TACT CAGAACATGTAAATACTTGGA GC CCAGTATCTTAGAT TAGGAGAATATACCACCACCAT GACT TAAAAATAAAAATGGAC CAAAAAGAGATTACTG ATAGACTTTTAAAGAGATGTTTAAAATGCATGCCTTGTTAAGTGACACCTGTCTTTCACATCTTTGTAATGTAAA AAATAAAACCACTGTTTTTATACATGTTGATTTTTAGCCAGATTTTCTTACATTAGCACCTAATTCTAATCATAA AACACACATTTG CTAACAAATGAG TGAGATTACATAATATAAAATGCATG C CAAAAATGCAAA CAAAAGAAAAT C CAAGGATACAAAGAGTTTACTTTAGCAATC TACACCTG GAGAAAGGGAACATAAAGGTTTAAG GGT TT CATCTCT TAAGATGAATGAGATAAAGATTGTTTCACCTGAATTTGCATCTTTCAATTCACCTCAATGTCTAGAGACATTGAG GGCC CC C CACTCACTAGT CCTCAT GGTT GAAAAGAATCTT CGAGGGAGTTGG GACCTTCAGACT TGTGTGTCAAA T CT CAGT CGGTGGGTGGAGCTCTTGC CAGG CCTCTGGGTGTG GGCATCT CAAGATGGATGTG GTGGTTACCTATñ AG CAGG CATGTG CTAGGCATTCCTGG CTGTGACTGGGG GT CCTTCACCTGGGAT GAAGTG GC CCTAAC CTCTTGG ATTGAGCCTCCATGCTGGCTAGGGCTTGTGTCCTGAAGCAGATGCCCCCACACTCCAGGTCACCAAACATGTCCA GT CCTT CCTACTGT CT CCTCCAGTGTTA TATAAAAG CAGCAG C T T (N) xGCAGCAGCTTTCCTTTATATCTCATT TAACAAAACAAAGCTTAGCTTTAAGGC(N)xGAGATTTATGATGCATTATCACATTATTCTAATTACCTAGAGTA GT TGGTGTCAATATATAATACTTTGT GAAAGGCAATTTACTAGATAATTTA C CT CCTTTATTTATG CT CATTTGT
G(NIxCAATCTAGAAATGCCAAATTAGAATTAGAATATTTATATAAATGATAATAGCTGCCAATTAGTAAAACAG AAGTATGGCATTTTTTCTTATTCTAAGTAATACAAAGAACAAACTTGGCACTAGGAAATTGTCATCACAACATTG GACAAAATAACCTATGTGGCAAAATGTTAGTAGAACTG TGATGATATCTTAC CTCTGCTT CTGCAAACACCCAAT CCTTTCAATGAC CT CT TT CCCATC CAGTAATTCTGTGT TGACTTCTGGCTAC CTAAACCTAC CA TACATTTAATT ATTC CCATCCAAGCCAGTTATCTCTTñATATGACTTGTGCTTGAAATTATAGTTCAATAGACTTATGTAACAATC AGACAAGTGCAAG(N)xCAGACAAGAAATTCCTAAAATAAAGTGAAATTTGCTTTTTATAAAAAGCCATTCCACA AGTTCATCTACCTAAACTTCTCTAAAGTTGGGACAAACAGAATTCTCTCTTAGCATTGCAAATATTATGGCCAGA AAATAAACTTCAGAGGAGAAAGTAAC CAAACAAGTGAG C CAGAAATACAAA CACATTTG GGAGAGTGTAATTTAC AT CTGTATAATTATAAGTA CATGT CACT CAGTGAACTACTAC TATATT TT C C ( N ) xAATAGGAAAGATATAAAAC AT TAGATATAATATGAT C T CATGTATA CACA CATGTAT TT C CA CAAATAT C CA CATA CAAATA CAT TC CAATGTT AAC(N)xTACCACAGCCCTAGAAAACAATTATATTTTGTGGTTTTTAACTTTTTAAAATTCTGTAATGATTGTAG ACT CA CAAAGTTGC
> H s 5 _ 37802351 -378 12 02 7
GTTT CAGACTTCAT TCATATTTTTGTGT CT TTCTGAACACAAATATTTAT CTAGTACTTATT CTTC CATAACCTA TT TCTT CCGAGATG CTGC CTTTATTT CC CCATATTT CATGTC CCACAG CCAATC CC TTGC CTTTTCTACTTCCAC AGCAAGTCTGAAATTCCAGAATTCTAGACCAGCTCAGAGCCCAGAGTCCTTCTTACTGACTGGGAAAGAAGTTTG GACAAC TTTCAGAA CATG CCATGATCAGTCATTCACAT TT CT GTATCTTTGCTCAGTTG G GGTC CC CTATGGAG T AGGGAACTAGCTGAAGTTCTAGCTGTAGCCACATCACTTGGCCTGAACTTCTTCCCAGAGACATCAAAGTTACCA AGGCCACACTACTCCCTTCTTCCTCTAAATTCCAAAAACATTTGTCTGGACCTATGATCCAGGCACTAAATCCTA GGAACCTTTGATTTGTGGTGGCAATACAATGGTTCATAACTAACTAAGGACCTACATAATGGTTTTATGTTTTAA GTTTTAATTTTGAA(N)xGAGAGAGAGAGATTTCACTCATTTTTCCACTAATATT(N)xCTGTGAGAAATACATT TTTGTTGTTTATAAGC CACCTAGTGTAT TTTGCTAGAG TAGC CCAAGGATTAATGCATGT TATT TTCCTCCTCTG GGATCCA(N)xACATATAACAGGTGTATGTTAGTAAATGTTTATTTATGTGAAAAAAGTAAACCCTCATTAAGCC ACTGCAGTAGGATTTCCTATTAATTGAGCCTTCTTTGAACTGTCTTTATAATGCAGTCCATTGTAACAGGATTTT AAT C TAAATATAATG G CAAAATTAAATAAATAGAACACTG CTT(N )xATGCTTCTTTTCTTTGTAAAATGAAACT GAAATG CCAATATT CATATTTCTC CCAATTATGCCTAACCATGCTACCTACT CC CTACTG CT CTGTTTGAACGTT ATGT GAACTTGC TGTTATGGCCAAATAAGT CAGCCAAGATGGGAACAT TAATTT CT TTGAGCAGATGGTTGTAAA G GTTGAC CTGTAAAAAGAGGTGCCAATT TGTTTTGCTGTAGTGTGCTGT CACAGAAAAAGGACT CTGTTAG(N) X CTGTAAGCCAAAGTCCTGACAAATACAGATGCTTTCTGAGCATCTGCTACCTAATTCTTCCATTAGAGATTCATT CCTTTGGCTCTAGAATAGCATTTTCCACATTTTGTCCAGCAGAACACTTCCGTTAGATGCCAACAGGCGCTCCTT GGT CAAATGAGGAG GCAGGTGCTCAG GG CT C T T C A (N) xATCATAAGATCGGTGCATAACTTATTTACATATCTñ CAAAAAAAATCTAAAAGAAAAAGTTCTCATGTCATCAGCCTGACCCACACATAAAGCACTTTGGCTATAGAATCA CAGTTAAGTG AGTGATAAATACCG CCACGG GAGTATTTG GAG CATTGAAGATGT GAGGGACAGTATTT CCAGCAG AGAAAATGGCTGGGGCAAGGGTACAACACTGAGAAATTGCCACAAGCTCAACATGTATTTTGAAGTTCTGTAGGA CATC CACAGTGACTGAAG GGTTGAGTTT TGGTGTAAAAG GAAGATTTCGG GGACAAAAAC CAAACAAGAACTAGA GAAAAGTTTTGT TTGGAACAGATTGT GGAGGTTCTGATTG GCAGTCTGAGTGAT TTGATTGG CAGCAAGATCAGG TAACCCTGTGAGGAGGCCAATGCCGTGGTCTGGTCAAACCTTACTGCAGGACTGGGAAGGACTGGGTGGAAAAAG AAG G CTGGAGGTGGTT GGAATCCCAG CCTC CGAGAACATT CTGCAGAGAAACTTTG GCCATATTTG CT CAAGAAT CTTCCT CGTGGTAT CATTTGAAAG CCATACTTTTTATTT CATGCTAAACATTGT CAGAGGGAGC GGTGGCAGATT CATCCCGGAATGTGAGCCAACGCACCGGGCATGCACTTTAAATGGCATTCCAGACCTCGGGCTGGCAGGCTCCCT CAGATCCTCGCCCTGCACTGGTTCCTGCCTTACCCTGAGTTAGGATGTCAGGAGAGAGGGTCTGTTTTCTTACCT TC CCAG CAGAGAACAT C CAGAATGAGACAC CAACCCTC CCTATTTCAC CT CGTAGAGATATCACTG CACGGGCGT CAGC CT CGGTGCTGTT CTGATGATGCATTTTCTCTT CTTC TCTCCTGT CCTC CTTATGCCTGAT CTTTGGGTA(N ) xATCATTTTGGGCCTCTCTGCTTGCTTTTTTCCCCATCTCTACGGGACTGCTCTCTTCATGGCCCATAATGCAC CT CT TCACAATAGAAG C T (N ) xACTGTGCCATGGCAGGGTCAGCCAGGTGGAGTGAGGGTAGAGGATGGTGGAGA GTGAG G CTGAGGGAG GAGGGTTGGTCAT CT CAGCTCAAG CTCAGGGG GAAATGAGAACA(N) xATTAAAAAATGT TTAAAAAAAAGAAATGTGAACATGCCTGTGTTTAATGCTCACTGGCCCAGTCTGATGTGCCATGTGTGAAAGAAA TGATGTGTGAAAGAGAACTTGGCCATTTGTCGAGTCCTTCTGGCCAAGTGTGCCTTTCCACAAATGGTGGGTTGG GAAGAT CCCCCTGG CCAGAAAGAATT CT GATGTGTGTAATTGTTCCGTTACAGATT CATT CCTTTGGCTCTAGAA
T (N ) xTGTGTGATGTCGTTGCTGCTCCGTGAGAGGGGACTGAAGTATGTGACATTTCTCCCATGTTTAAGATCAA GGAATTCCAGCCTACACAGATGCTTCCTAGGGTTCCACCGGGCAGCTTGGGAAACGCTTGCAGCTTGGGAAACAG CCTT GCAGTCCTACTGAGTCTCCTGTGG GAAGCAAATG TTCCTGGTAGGCCAGTCTGTAG GAGAATGAGATCAGT GCAGACAAG GCC TTGAAAGCATGGGCTCGAGAG GGGCTGCAGAGGCG GTGGGTCTGAAAGGGG GGGGTGGATGTG AAGATCCAAAGTGATCCTTTTCCCAAGGAGCTTGGAGCCTGGCAGAGCAGAGCTCCCTGGTCCCCCACCTCATGG CTGCCTTTCTCCATGGGAGGCCTCTCTCCCCTCTCATTCCTCCCAGTGAAGCTTAGCCGCCCTGCCATCCAGAGC TCTGTCCTCTCCCCATCTCCACCTTCCCAAGCCAAC( N ) xGCAGGATTCCTTGTCCCTCCCGGGAGCTCCATGAC ACCCTTCACTGCACGTCTTCATCGCTGCCCCTCCCCACGTTCTTATGCTCCATTTCTGTCCCTCTAATCTGGGAG CCCCTTGCAGCATTGTCTCTGTATCTTTACATTCTCCCCTGGCAGGAGATAGCTTCTCAGAAATTAATTAAATGA ATGAGACC CAGACC CGGAGCAGGTGAGCAGGGAGTGGGTTAAAATC CCCTACATGTAC CACTTTAAAGAGTTTC C CAAGTTAAAACAATCGATTCCACATGTTCTGAAACCTCATCCTTCTTCACCATTGCTCAGCTCTATGGTTCTCTG GTTT CCTG GAGG CTGAAGTCTT CT CTCCAT CC CT CTCACGGT CTGC CCACCCCGTC CACTTTCCAT CTGACGTG C TCTTCCCTCTGTTGCCATGCCCTTGGGCCACTGATTCATATTGTCGCTTCTCAGAAATCATGAATGAGCATTGGA ATAT CAATGGCATGGC CTTAAGTTTATGCTTC CCTAAGTAGAAAGTAAATGCAAATAT CTTGAGAAAACAAT CTA ACTTGAAGAATGGAAAGAAGGGTTGTTTTTCCA(N) xTTTTTTTTTTAAGGACAAAGTTGGCTTTAATAATTCAT GAAAAAGAGAAAACTAGAAGGAAA CTGGAAGCAGAGGACA CAGAAAAT TTGCTGTTAG CAGTGCAC C (N )xA G C C CACCAC CC TGCC CTTTTGACACTTTGGGT CG CACTGTT CC TC CTG CTT GGAGTGTC TGTCGTCCAT CCCAGC CTG TGTTGACCTGGC CCACGTGACAGATTCCTACT CACTTG CAAGGTCCAG CT CCAATGTAACTTTCA(N) xATCTGT AGCTGTTTCTGATTTGTTTTCCTTCCGGCTCTTTATCCCACCTGCGGCTCTCTCTGAGGCATTTTAGCGCTCTCT CCACACCTCTCCCTGCACGTCTCGCTTTCGGGGTATCACGGACTCCTCCCTGCAGCTGCAGTCCCATCTTCATCC CACTCAGCTGCCGTGG CCAGGGACGTGACAGTGTATTC CT TGTCTTAGATGAGTAAATATGAGTAAATGTGTGCT TGCC CTTCGTTT CC CCAACTGT CTGTTTTCAGGGATGACTTG GCAG CATTTGGTTTTCATTTCATT CAGGATGTT GTGAGGCTGAGACTAAAGATGAGTTAACTATTGCCTGGGATTCCAGCAGCGCTTCCGTTAACAAACGTGTGTCCC AAGAAGAC CCTTGG G G CTTTTT CATTACCTGATGGAAAAAAG CCATAGAC CCT CGG CTAAAACC CACGCCC C CT C CTTGGG TGGACTATGAGTGGAG GCAACAGGAT TCCCCCTC CCAAGGGATT CTGG TGTCAGCTAG TACATCAG CCG TGGC CAACGGAG GAT CTTGTGTTCGAAGGGAAGGTCTTTGTTTTTCAGACGTAG GTTCAGGAAATT CTGATTTT C CACCTCTATCTCGG GAATGGATTGATTCTTTTAAAAGTTCACTTTAA C CTACTT CACT CCACACTCACGGTC CC C CAGAGTGCCTCCTTAACACAGCAGAAAGTTGGCCTGAGATGATCTCATGGGCACAAGCCACATAAGGCGGTCGGG ACCAGTATCCAAGCTCCTCACCCAGTGCTCTTCCACCCTTCTCTGCAGCGCGGATTTCAGAGCTGGAGGCCAACA GCATGG CCTTCCCTCT GGGGAT CG CAGTGAGGGGTGGCGCGCACTCAGGG CTGCAGAC CAGCAAAG CACCAG CC C AAGAAGGACGGGGGAGCAAGGTGCAGGCGCCCCCAGGCTGCTGGGGACTTCCCAGCTCTCAGCACATTCTCCTTT TCTTGGTTAAGGAAGTTTCTAAAGTTGACCACTGGTTAATCGTTTTGTTATTCATGCTGAGCAAGAAACCAGGGT CCAAATACC(N) xTGTGTGTGTGTGTGTGTGTTGTGTGTTTGGGGAGGGGTGCCCTCCCAGTAGGGAGCTGCCCA GTAGTTTCTGCCTCTTATTAAT CT CTAGTACAG CTCGTGACT CACCATGAAACT CACA GGAACACG CAAGATGG C CCAAAATAGTTTCCAGGGTGGTCACCAGGTGTGATGAGAAAGAAGACAAAACAATGAACCCATGGATCCATGGTC ACGTCTACATGGGGAATTCTGCACACGACACCCTCCCCTGGACTGTGAAACAATCATTGGCCCAATAAAGGCTCT GATAAG( N} xGAGGAGCATCTCACTGGGTAAGTAATCCTGGACACACTTTGGGAAAATAGCTCTAAAAAGGGAGA GGAGGTAAGCAG GAGTTTTACCAT CTTATGTT CC CGCT CCAAGTCACTGACTTTGGACTGGAATTTTGGCTTGGA GAAAACGCAGCTGGCCTCAG
> H s 5 _ 38337804 - 383510 03
TTCTAGATATCTGTCCTTCCATATCAATAACTGCCCCATTCTCCCTCCATAAAGCTAGAAGAAATGTCCCGCTTT GGAGGGTATTTGTAAATGCCCTGAATGTCACATCACATCTGCATTACTCACTTAGGATCTATCCAGTCCTTCCTC TGTTCAGCTGTGGGAAAATATAAACTAAAAGGACATGGTTTTTGCTATTCAGTTGCCTATAGTCAAATCTGGAGA ATAAAG TAGGAACACGGGAAACAGGAAATCACTGTGGAATGTTGTCAACAAGTACAAAATAATG TAAAGTAAGAA GGTGGGGAGGCAAAATGG AGGAAG CACCAG AGAATG CATT CTTATCTATC AG AGGCTG CAAGTTGGGCTGAT AT C ATTTCACCAAAA{ N ) xGGAGGCTCATGCATTTTGGGAAAAATAACACGAGCCATGTGCAGACCTCTGCATTTCAC TTCATTGATATGTGTCTTCCTCCTGGACTCCTCAGGCTTCTGAGTTTGCAACACCCCCCACCCAACCCAAGTAGG TCTGATGTGTGTTC CCGCAGAACTAGTGCACGTGATGTTAGG CAACGCAC CTCAGAGC CTGCTG CCACTGTGAG T GCAATTACAACCTTTGATCTTACACAAGCACGGGCTCTGCTCTCACCAGGGTGTCTGCATCTTTTCAAGGTCTTT TACTCTGAGGTTGGCGCAGATAAATCCCTGCAGGAGCAGTTGCACAGCGT3CCTCTCAGCCGGGACATCCCGACC ACGGTGAGTCTTTCCATCCTGGCAGCCCACTCAAAAGCATGTGGGCACTATTCGGGCGGATTGCTGATTTGGTGT
3TGTGCTACTGTACAT3TAATAATAATAACTAATGCTTGTAATAGTCCTTTAGGATTTCCAAATGTCATTCATGT ACATTTCCTCTGAACCATGCCATCTACATCTCTGGAAAACAGGCATCTGACGAAAGAGTCCTGGCGTAAACGCTT CCATCCTGGACTCAACCCCATTTTATCATGCAAGAAAACTGCCACCTTACAGCTGGTTTTTCCCCTTCTTTTCTT CATAAAGTTCAGTT CTGTTTTTTTTAATCC CAATAAAAGC TTTATAAATAAATGTGATACATTT GTATCTTC CCA GCTGCCTGAAGACACAATGCAGAAGAAGAAGATGTAATTTGAAATGTATATTTAAAAGGTCCAAATTGTTTATTT TTTTTTCAAGCCCAGTGCCCCAT CCTTCCACTAACCGATTGTTGGGTCCT CATCTATTTTCTTTTCTGTTTCTCA TCAAGGTCTCTTGAGTGAATTTTTGGCCTTAGGCATCTTGAATTTTCTTTTCTTTCACTCCCCTCCTCCTTCCTA ATTCTGCCACTACT CCAGGCTAGAGCCTTTTGAATTGCTC TACAAGGAGTTAGCAC CAGTGCTCATAACACTATT ATAATG GTG GTGGTG GTGCTAACT CCTGTTTGAT CCGTGC CTTCTGGCAG CAGCAC CTGAGAGAAACCTGTTTGT CAT C CATGATAAGGAACTATCACTT(N)xG CTGGTTCTACCAAAGCTCTCATGACTCAGAAGGAATAGATGCACA TTTCCAGCTTGCTTTTATTTTGTATGACATGCTTTTAAGOTCAAAATAGACTTTTGCTTCGGCACCCAGCATTGG CCGCAGAAGCTGTCAACCCCAGCCTCTGTAGTTTACTCTCTCAAGAAAGGGTGATGTGTTGGGTTGGAACTTTCA ATAAGCTCATTGAACTCACCAAGGAGTCTGATTGTCCCTAGTAGTATCCCCAGCCACCCCAGCCTGGTGTCCTGC AGGGCTCACTGCTTGGCCTGGAACCCAGTCCAGGTATCAAGAGTGATACATGCCTTTGAGTCTAGACAGTGCCTC TGAAATTGATATCGTGCC CATCACAGAACCTGGG CT GAATAT TGGCATCTCT CTGAñGGT CACTGCTGTATATAC CTAAACAA( N ) xGCCATGTGGATGGTAGTTATCATTCCCAATGATTTñTTTGTTAAATATATTCTCTCCATTATC AACAT CTAGG CATTAGAGTATG CTAT CAAATT CTGCATTTGATCAT CTTGC CAATT TTGACAAACTTG CATAGG G GCTACT CTATGCC CAACACAAGACAACACACAGT CC CTGTTTGGAGGCAGCTTTCAGTCTTCTGAAAAC CACAT C CACACAGATAAAACAATTAGGGAGCAAAGTAAGATCAGATGTCGTTATTAACATGCTGCTTAGTAGACAGGGTTT GAAGAGGGATAGGTAGGTGAGTGTGAGAACAAATGGTCAAGAGGTGAAAGTT GGGACTGAGGAC CAAAGCAG GAA CTTTCTGCAAAGGCAGACTCAAGGGAGGGACATCCCAGTTGGAATGGTTTGAGCCCAGGGAATGAATGAGCATGA ATGTGTCCACACATGCCTCTCTCTCTCT CAAT CT C (N ) xTTGGGGGGGTGGGGGGTGGGG GGTGGGGAGAGTTG C AG AATAAAGAGAATGAAAAGAGGAAATC AAGACT AG AC AT AAAG GCAGTGATTCAG C AATGCAATGGG AGGAGT C TG AT ATAGC A GAATGGGT CCTGGATT CTGATG ACTTGTGTTC AG ATTT CTGCATTT C AAC AAAACAGCTGCTTAA CCTTACTAGGTCTCAGTTTCCT CAGTGACATA TATCATAG TGTGTCTTT(N)xGCAACATCTGTGTGAACTAC(N ) xACTATATCATATCATTCTATAGATAAAGATGTGAGTTATTATATCGTATCATTCTGTAAATGAAGA(N)xTAG TATGATTCGTATTG CAAAGATGTGTGTTTG CTGC CCTT GG CTGTG CAT GGCAAAAGGGCAGGTG CTGTA CAGATT TATTTT ACTC ATG C AAGTTTTTTTTTTT TTTT ACTC TT CATGTTG CCTCTGTTTTTTCCTCTCT CCAGGGAC AAC CTGATCATCAGGCAATTTTTGTAAGCATGTTTTAGAAAAGTTAAACCCCTGTCCCAATCCAGGATTCAAGGGATC TCTTTTCAT CAAAATGTTGATT CTCCTG CATCTC C CTCTTGCTGATTAAGGCTCAG TGAAGAGT GCCTAGTTCTA GTGTAAGCACAGACAGCTGAATCTACAAGGTGAATTTTTAATCGTTTAGAGGAAGCCACTTTTATCCCTGCATGT A C TCCT TCAT CTCACATGGCAGAGTAG GGTTAAGGG GAAC CT CT CTGAAGGTTGAAGTAATTTAAGCT CTGGTT C AAGAGTAGGTAAAACAGATTGAAAAGGAGAGAAAATATTTAACATGTGAGCACCTCTGTGGAGGTAGCTCCCCCT CCTTTTACCTCCTCTTTTATCAATCCCTTATTTCCCCTAACAATTACAGATTGTGTTTTCAGTAAGATAGGACCG ATTACCTTAAAAGGTAGATTTGGTTTTCTGGATTGAATGCTAAACCGGGGAGTGGGTTGACAAATGGTTAATCTG ATTTAACTGATAAAATGAC CAAAATACC CTGG CTGATAATGTGGAGTTGGGCTGCATTGTTTGATAAT CTGC CTT AT AGTAGCTTTC AATTTGGCAG CATTTC AAGC CC AC CTTATCTTTTTCTT AAAATAAT AC CCTC CAGATTTC CT C CTGGAAAACAGAATAGT AAAACAAAT CAGGGCAATT TGGTAAAT TT TTTAGAGTTAGAAA TAGAAAAGAATGTTT AGGGTG(N)xTGAGGGGATGGACTAGCAATGGTTCTCAACTGGGGGTACCTTTGTCCCCTCCTGGGCGT(N)xCC TCCAAAGCCAATGCTGCCCCTGTAACAAAACTCTGGAAAATAACTTCCTGTGACCAGGGAATAGAAATCCTTACA ACTCCTCCCTCCACCCTGCTCTGCACAGCCGTCCTCAGTG CATTGGAG GAAGGATG CTTG GAAGATAG CACTTTT CTGTTTCTCTGGTGCCTTCCACTGGACAATGCTAGGTGGTCCCAACTGCCACTAAGGCAACTGAGGCCCAGCCTT GACTTGTGCGGTCAACTGTGAGGTTGAATCCAAGTTTACTAGCTATTAATGAAGGTTGTTTTAGAAGAATCATAG A CTTTCAGTGTTGAATCT TTGGCCCA GAGAA(N ) xATTCATGCCTCCTCTCAGCTGTGCTCATCTGAACTAGTGA ATAGGACAGCCTCTAGCCTTATTTCTGAAGCCAAATGATCTGTATTTCTAGCAGTAGGAGGAGAAAGAACAGCAT CATGTAAGGAGGTAAAGG CAAG GTCTGAAATCAGAGTCAGAGACTT CAGTTCGGAA C AAT T CAA CATGATTTTGT CAATGCAGAGTGAAAAGTATGCAGTAAAGAATAGTCTT( N ) xGAGTCTCATCTTTTCTTGGGCTCTTCTTACATG GT AACC CAGGTTGCTTTTGTAT ATTGTGTC ATGT ATTTG ATAGATATAAGTCTAAC AGGC CAG (N ) xG AGTC ATC CAGGTGACAGACTGTGAATGGAGGGAATGGAAAAGGCAGTTGAACATCTGAAAAACCAGGTGCTCATTTCAGAAC TAGTTAGTGATGCTGTGT CTACGAGT CAGATAAT TAGAAC CGTATAAAGTCATGGTGTGTTTCATGAG G GAAAT C AGACCAACTGTTGC CCACAAGGGCAGTT AC AG AAACTCTC TG { N ) xACTATAATGATGATACCAATTGTAAGATG AAAGTTTATTGTTCTGGCAATCTGTGAAGAAGGCAAGAGTCCTGAAAGTCGCCTGAGCTTTCTCACCGGGCTCAG TGACCAGCCTTGATTTTCTCTTCTATAGATGTGAATTCACTGCTCGCATAAAGCCACCAGCGGTGTTGAGCAGAC AAAGGGGTTTTACATTCCAGAAAGAACCAG G CAAA CAC CAAGTAGG GGGTATTTTG CAGCAAAG CACT CAG C TT C AC C CAAAAAATGGTG GAGGAAGG GCCTAGG G GAGA CTGTTGAGGTCACT CTGGCCTCTAC TCCAAGAAAGCAATT CC CACGACTCTGTTT CT CACTAACTTGTGTAGAAAT GAAGTG CAGGAGAGATAAACATCTTGA CTTGTCTTCAGA CTTCCACAAGGCAACAAAACAAAGCAAAAGACTGTTTCTCTACTTTGAAAAGAGCATGTTCAGCATCTGCCCTAG TTTGATGCCAGGCTGAAGTGGTGATGGTAAGACCTTTAAGGACAGAAATACAAAGATTTATGCTGTGTGGTATTT TACTACAAATTGCAAGGGTTAT CACAAGTTA CAATACATT GAG GATTC CATTTGAGAGGAGAGATGTCAGG G GAG GAAGTGAAGG TGAAATCACTGAATTCAACACGTT TCTGTG GC CCTGAC CACT CAGC CCCC TTGATTTTTACCATT ATGCCACTTTCCGGAGTGTCAGTGCAGGAAAAGAGACAGACGGGAAACAGCCAGCTTCAGGGAAGGTCACCCTCT A C CCCT CAGAATG CACCCAAGCTGAAG GTAGCACTAAT CTTC CATATC CAG TAAAGAGTT CCAGAATTTTTT GAT CTATATATGGAACTACAAATACGATTTTGCTTGGTTTTCAGTGGTAACATTTAAATTTCTAATACGTTAACTTCC ATGGTCGTAAAGTAATAATAATCTAGTTTCCAGTGATTATGGTAAATATTGTTTGTTGTTGATTTAAACATTATT TAATATTTTTTGTACATGTACTGGCCAATT( N ) xAGCCTTCTTTCCCTTTGTTCATGAAACAGAGGAAACACCCA TCACTGGTAATTAATGGTAGCTGTGTTTGGTATTAAGTGATATCTACAATACTGTCTGAATCCATATCACCACCT ATACTTCATCCAGCATCATGACTTCCCAGGAGTAAAGGATCCTAAAGCAAAACTGGGAGAGTGCTGTGTTGGTTT GAAAGTGTTTCTGCTTCCTTGAGGAAAGCCTGGAGTGTTAGTCGGGAGAGGAGTGAGAACAGAGGGTTGGATCCA CCTGTTGCCT CCAGTCT CTTAACCAC CAAACTTGACGTGAAACCTGGAGACAGAAG GTGAAGCT CAGGAGCACAG ATGTGAGGAAGTAGAGCTGGAGAGGGGATCTGAAGGCAGGTCCACATGTGCAGCAAAGGCCCCTTGTGAAGCCAC ATGTAGGGTGGAGTGGAGGTGGGTGGGAGGG AGG GG CAGACTG AAGAC CCAGGCAAAGAGTCAAAGGGGCCG TAA CTTGGG CAC CAAGGTGGAAATG GGATGAGGGAACAGGT CTGAAAGG CATGGTGGGGTGGC TGGAG GGAGGTGG G C AGGAGAGTCAGTCCTGTC CCTGGAGGTG CTAATC CAAGGC CTGGGG CAGGAGTACT GATG CTGTGAGT CAAGATG CAGGGCTCAGTAGTATCGGCTGGCTTCTTGATGAGGTCATCTGTTTACATGAGACATCTTGGCTTGATTTCTTTC TTTAAT CATT CATTAGTT CATTTATT CATT CATCAAGAAG C A T T ( N ) xGACAAAAACACGCTCTTCTAAGGGGTT TTAAAG TGTGAAGTA CGTGATCAGATTG CATATTTAGAAAGATAAGTAGTATAGCACTGTGGAG TCCAGACTGGA GGTACAGGCAGTGCAGCATTGGTGGCAGGGGAATCAGAGCCCATTGTCATATTTATGAGGACTTGGTCTTCAAGG GCTGTAAGGATGAAATGACAGAAACACATT CAAGTGATAT GAAAGAGGTGGAAT CTGCTG CATAGGTAACTGATG AGATGGTGGTGAGGGGTGTAGAGTGAACTCATAGAGGAAAGCATCAAGAATTCTGGCCTTGGGTAAT(N)xAGGC AAGCAGTGAAGCTATGTTTTGGTTTGGTTTTGGCAGTAG(N) xTAGGGGTGCAGGTATGGTAGGGAATTACTGTT TTTAAGGTGTGTGAGTCTTGAGCATAGTAAATACAGAAGGATCTGGTAGATACTGGATCTAGGGGGATCAGGACA ATGAGAGGAGGGGGAACAGATGG G GCAGTT CAGGGCCT GAGAGATGAG CCAAGCTCTGGAA(N) xTATTAAGTAT TCATTAATTATTTGTTGAATGCTTTAAAAAATTGTAGGGATGCCCTTAATTCTTCTTAGAAACAATGGACGTTCA AACATCTGCAAAAGAGCTTGTGAAGAAAGT<N)xTAACAGTTTTTAATAAATACAATGTAAATGTGTTAAAGTAG TG GCACAATTTT CATTAG GGTAAGAT CACCAGACCTGCATGGTCAGGGAAGGTGTCAAAG GGAGGG CTTGGC CTT TGGAGTAGTCAG GGAG CCTAGGTAAG CAGAGAAGAGGAAGGTTC CCACATGC CCAAGACAATATCATGACCACT C AT TT CCTGAG CACATCTT CCATAC TGGCTG(N) xTCCCACCGATCAGACCCAGGCTCCCAATTACTATTTAGTGC TGTAGCTTTGGGGACA CACCACTTTTGCCAAAGCTCAAGG TGGAGAAAGCCCTCTTCC GCTGGGGCTT CCTG CT C AGGGCCCTTTGTGCAGGGGAAGT(N) xAAATTTCAGTCCCTACTTTGTAGGTACAAATCAAGACCTCAGTTTCTT CAGCTGGGGCTT CCTG CT CAGGGC CGTTTGTG CAGGGGAAGT( N ) xTTTTAAGAAAGTCTTCCTGGTAGTTCTAA TG CTGTTTAAAATTTGAGAACTAAAATTC CAGTCCCTACTTTCTAGGTACAAAT CAAGAC CT CAGTGTGCAG CTG CAGC TATTCT CTGAGCAT CTGAAATCTGCAAATGGAAAGAAGGAACTGGGGAATTACAGC CTTAGAAAGAAT CAT TAC CTGCCTT CCAGCAACGTAGTACTTTGATCAAAGAGTT CTGATGTT CACAGTAACATT TG CTAGTGTGAATGT ATCTAGTTCTAGGACCATGGATAGATTTCAATAAAGGGCCATCCTCCAGTCTTGGAAAATATGTCATGGCTATTT TT TT CTTGGCAAAGTTTAAAACTGG G CCAGTC CTAGA CAAAGCAAAACATTCTCTGTG CCAAGCAGTTTCACAG G GGCCTCGAGATATAAACCTCATGGAAATTGTACAATTTGCCCAAACAGATGTACTGATTGTTAGCATCTTTCTTG TTTGGTCATTGACAGGAGGAAGTGATTGGAGATTTGAAAC CAGG CACTGAATAT CGTGTGAG CATAGCAGCT TAC AGCCAGGCTGGCAAAGGGCGGCTGAGCTCTCCTCGGCATGTCACCACTTTGTCCCAAGGTAAAGTAGGTTCAAAT T CAT TAATAGGTGGCAGG CTGCTATC CATG CATCCTTCATTCAG CAAATATCAATAGGGACTTACTAT GTGC CAA GCACTGTGCTAGGCTCTGGGAATTCAGTAGAGAACAAAAGTAAGCACAGCTCCCGCCCTCTCAGAACTTAGTGGG GTGAGTCTAGACATTCCCTCATTAGTGTGTAGTTGCAGAGTGTGATAGATGCTCCAGCACTGTAGGAATGTACAA CACAGGGGTAAGGGAAGAATTCCCTGAAGAGATGACAACTA
> H s 5 _ 154171630 -154 18 35 62
GTGAAGTAGGAGAGTGGT CATTAGGC CAAGCACCGACCTC CAGACCACATGT CTTGGGGAAG CTTATAAAATTCT CTTGGGGCCTAGCAAAGAGAGGACTCCTAGTCAGTTCTCTTCTTCCCAGCACTTTTGTTCTGGCCAGTGGCTTAG AGTCTTTACT CACTGG CTGTGGCCGCTTTTAC TTCACAGACAGAGTTG TTGTTGATAGG C CA GGGGG CATCT CTT CCTT CCCACAGACTAC CTAGACAA CATGGC CCTGTTGCAGGGAATAAT TTTG CCCCTGTGAG CCTCAG CACCGCT GTCCTGAAGGTTGTCTCTGTCACTTAGACTTGGAAGCTGGAGAGATAAGTTGCCTTCCCACCTTCTGGCCTATTT GACCATGCTTCC CACTGAAACC( N ) xTGTGGCTCCTGCCTGCCCTTCTAGTGACAGGGAGTGCCTTCCCTCCTAT ACCATGAGGCACTCACACTCACCTCACTGTTACTTTACTTCCTGCTATAGCCACAGTCCCACAAGCCTCAGCCTA CC CGTAAACTGC CACC CAAGAAGGACATGAAGGAACAGGAGAAAGGAGAAGG GAGTGATAGTAAGGAGAGTC CAA AAA C CAAATCAGATGAAT CAGGGGAGGAAAAGAATGGAGATGAG GATT GCCAGCGAGG CGGG CAGAAGAAGAAAG GTGAGTGACTGGGAGCAGGACTTGGGAGAAGGTGGCAGGGTAGAACTGTGAAATGTCAGGACTCCAGGGGACTGC TGGAGGAGCT TTAGGTGC CACCT C TGAAAAGT CAATGGGGAAACTGATGGAT GTAGGCTGGTGAGATAGCAG TCT TCAACCATCTGTCATTAATCAGTCCCTCCTTTCAAAAAACCTGCTGAAATTTTGGCTTCTCTAATGCTG(N)xTG AATCCCTTCAGCTTGCTGTGGTACACCGGTTCTTCATATCATCACTGATGGCTTCCTATTTCCTTCTTC(N)xAC a c t g a t c t t g g g a g t t a a c g a g g a g t c t c t a a c a c a c t g a c g g c c a c t g a c c a g c t t a t t t t a g a g a t g a a a a a g a g g g c t a a t g t g t g g g a a a g g g a t c t g c c c a a g g t a a c c a g a t t g t g a t t t g g g c c c a a g t c c a g g g c a g a a g a a g a g g a g t t g g g t a g a g c t g g c t a g g a g t g g c c t c t c c c t c a a g c c t t t c t c a t g t a c c c a t g c t g t c t c c c a g g a AACAAACACAAGTGGGTT CC ATTAC AAATAGACATGAAGC CTGAAGTG CCCAGAGAGAAACTGGCT TCACGCCCC ACTCGCCCACCGGAGCCTAGACACATACCTGCCAATCGCGGAGAGATCAAAGGTATGCACTACCCACTATGGAGG GCCTGGACTTGGGAGACACCCTCAGGCCTGACCTGGTACCCCTCTCCCCATAGGGTCTGAGTCTGCCACCTACGT GCCCGTGGCCCCCCCCACCCCAGCCTGGCAACCAGAGATCAAACCGGAGCCTGCCTGGCACGACCAGGATGAGAC AT CGAGTGTGAAGAGT GATGGGGC TGGTGGGG CGCGGG CTTCCTTC CGTGGC CGTGGACGGGGGCGTGGTCGCGG CCGG GGACGCGG CCGGGGTGGCAC TCGAAGTACGTGAGGC CCCTTTGGGCTCCGGGATGTCCAAGGGTGTGCAGC AGGG GTGGTGGG CAGGAATCTCCT CT CCCT CATGGCAC CCGTTTCCCC CATAGC CCATTTTGACTA CCAGTTTGG CTAC CGAAAGTT TGATGGTGTGGAGGGGCCTCGTACGC CCAAGTA CATGAACAACATCAC CTACTACTTTGA CAA TGTCAGCAG CAC CGAG CTTTACAGTGTGGATCAGGAACTG CTCAAAGACTACAT CAAG CG C CAGATGTGAGTGTG CGGAAGCCTT CT TACC CTGGAGAAATGAGTGAGAGGTTAGGGGT TG CTAGAG TGTCATAG CTGGG CAGGCCCTTC TGGTGCTTAG CAGCAGTT T CAGCC TCAGGC TGAAAAGATAGCTTTT CT CATTGTTTATTAAAGAAGTGATAC CAC ATTC TCATTGTTTATTA CAGAAGTGGTACAACATTTAC CTTAAGAAAGTTAGAAACTATACCATTACAAAAAGGA AG GAAAATATTTAGAATCTACCC C CTTCTGTTCTTCCATT TGTAATGACAA CATTGACAGAAT(N)xCTCTCAGT CCAGTAAGGAAGTCATA CAAGAAAATAAATG GTTGCAATAGAGTGTTTATAAGT GTTTTAGGAGCAGAAGAGAAA GTGACCAACAATATTTGGA CTGGAAT TTAAAGGAGGAT GAGGAGTTTGACAGGCATGTCGAG GGGCTTAGGG CAG AGGGAGTGATTTGACC CAAGCAAGGTAGGGTGT CATGAAGGGAT CACATCATTT CTTTTCAGCCCT CAACAC TGT TGGTATATAG GATAAGGATGGGGTAG GTCAGAGGCTGTGTGAGATTGCTGGGGTGTGTAG CAGAACAAGGAAGG C ATAC TCATAAAACTGATG CACCAATC CCTCGG TGATAACTACTT CTTC CCTC CACTTAGT GAATACTACTTCAG C GTGGñCAATTTAGAGCGAGACTTCTTCC TG CGAAGGAAAATG GATGCTGATGGTTT CCTACC CAT C AC CCTTATT GCTTCCTTCCACCGAGTGCAGGCCCTTACCACTGACATTTCACTCATCTTTGCGGTATGTCTTCCTCCCTGGAGC TGGGATGCAGGGGAAAGAAAGGTCCTTAGGGAGCTGGAGTTTGGGCCAAGGCAGAAAGCCCTGGAAGAGTCATCA TCTGATGACTTCTGCTGAAGTCTCAAGGCTTGGCTAGTCTCCTCCTCTCCCTCCTTCCTTCCTTTCTCTCCCTCT CTTGCTC(N)xTTTATATATATATATAGTTTGTGAGAATCAGAATCCAAATAATACCTATAG(N)xGCCTGATCA GATTCA3GTTTTGAAGTTTTTTCCCCTCCTTGTCAAGAATACTTTATCATAATGATGTATTTTCTCTCAGGTGGC ACATAACATTTCATTTGATTGTCTCTCTTGTGCTATAAGCTGCCAG{ N ) xTTGTTTTTTGGTGAGATAAAATACA TCATGGAGTTATTCTGTTTTGTTGTTGTTG( N) xTGTTCTGGTATTTTCTATTAAAGGGTCTTTACGTAGACTTG TGGATCTTTTAT TTTACCTACTGCTTTCTT C CATGC CCATAATC TTAATTCCTAAT GATACCAACATAAT TACTO ATTTTGTTTTATTTCACAGTATGTCTACCGTC( N) xAAGCTCTACATTCTTACAATACCTCCACTACCACTGAAA ACAGTTTTTCCACTTATACAGTATCTTCAAGTCATTTGAAAGAATCTTTGTGTGGTTTCACATTTAAATTGATAT ATATATTAGTTT TATT TTGCTTCCAAGT TTTAGAGATTGCTTTCTCCCTACCCCTATAAT TTTCTAAGTTGTTTT ACATTTGTGTAAAACATTTACGTGGTTCTAGAGTCTCAGCATGGTATAGTCATAGAAGTCTGACTTAAATTCTGT CACTTACACC CTTT CTTCTCTTACAGGCAAACTTTTTAGAAC CTTTCCATTTAAGAAAAAAG CAAATG CCTCTAC ATGTATTTGTATCCTTGGTAAATGGAAATATACAACACATGGCATTTCTTCAATT(N) x TGCTTTTTTGATTAAT ATTTTGTG GGTTTATC CAC CTCAGCACATGTAAAG(N ) xCCTCCTACAAGTTTGAACATCTCACATTATCTACCA GTCGTGGAAGATGGTGTCAATTTTCTTATATTACTGGTAG CATTGGACTCATAC CAAG CT CAGGGGAACC CAGTT GTCTACCATTTCCAAGATCTTTCTGTCCAAGATTTTCTTCTTAATCCCCTCTTCCTTCTCCTCCCCTTCAGGCCC TAAAGGACAG CAAGGT G GTGGAGATCGT TGATGAGAAAGTTCGTAGGAGGGAGGAACCAGAAAAGTGG CCTCTTC CCCCAATAGTGGATTATT CACAGACTGATTTCTC CCAGCTTCT CAACTGCCCTGAATT TGTTCCCCGT CAGCACT ACCAAAAGGAGACAGGTAGGTACCTGCTGGCATGAAGATTGCCCTTGTCCTCTGGGCAACAGTCCTCCTCAGGGC TGGCATCAGGAG GAGGACTGGAGGGATGAGGACTTC CC CTTT CCACC CTTTAGAGT CGGCACCTGGCTCTC CTCG TGCAGTCACCCCAGTG CCAAC CAAAACAGAGGAGGT CAGCAACCTAAAGACACTAC C CAAGGGC CTGT CTGCCAG CCTG CCTGAC CTGGATTCTGAGAACTGGATTGAAGTGAAGAAGAGGCCTCGGCCAT CC CCAG CACGGC CCAAGGT GGGTGAGGCCTTGTCCCTTGCCTTGGTTCTAGCACT CTGAGCTAGGGTGCTTGAAGGGGATAACACAT GGGCACA GCACTCTAGCTCTGAGGGGCCAGCAGGAAATCTGGGTAGTTCAGGATCATAGTGTGAGTTGAAGTTACCAGTAGT GGCTACAGAAAGGTTGATTTGGCTTGTTTCCATTTCCTGTGGGGAACCACCCTTTAGGGCGCCCTACTGTGTGAG TTCCTCGGTGACATTTTT CTTAAGGGAT CATG CAGCATCTGG CAGCGATGTTTC TGGCTTCTTGATTCTTGGCTC TTCTTTTTGCTCCATCTAAATTTCTGGGGCTGCGTTTTTGGTGTCTTTTCCCTGAGTATGTCCCCACTATCATGT ATGTAAG C CC CT TAGGT C CTCAGAAGAAGGAAGGGAGAGGGAGCAGATGCATGG CC CAAATATGGAGT( N } xAAG GAGACATTAGAATATT CTAGAAATAGTTTATATATATAAACA GATGAAGTTCACTGTTAAAGAAGTATTG CAATC TCTAACTAAAATTACTACTTCCTTTTTAAGTG CTGAATTTTT CT CCACATACCTGC TCACTGTCAC CCACTTTGG AGACCTCTGG CCTGGGGCTTGTGAGGAGACTG CAGAAT TGAC CT TGAACTTTGGGT CTATGGAATT CTATACTCC CTTATCTATTTTCCCACAGTGATTACCTCTCATCTCCTTTCTCTTGCCTGCCTGTTCATACACAGAATTCCCCAC AGTTGTGGCTTCAGCCCT CAAAAAGTGT TGGAGATC CAGC CAGAGACATGAGGAGAAAGGTAGTAAGTAGATGAG GGTTTTACAAAGTCCAGGCCTACAAAGTTTTGTTTGAAACAGGGGTCAGACTTAGCTCCCATTCCAGAATTAGAG CCCCTACTCTGACCTGGTCTATCTTAAACCTTAAAAAAAGATAGGTTTAGAGTTTATGTGCTCTTGGCACAGCAG GGAATCTACTTTTGAATAAATACTGTG GGTTTAGA CAAATGCTT CTCTCTCAGCGT CT CATATT CAGT CTATCAA AACAAATAGTTTTTC(N)xGGAGCTTCCAGTTTATTGGGAGATGTTGAATAGATGGATAAGTGGATGAATCAACC TGCT GAACTT CTGTGTGT CCACTCCTAAGCTAGGTG CTATGTAGG(N)xAGTTGTTTGTGCATGACTGTGATATG AGCTGTGAGTGGAGGAACCTGGACTTCTATGAAACATCCTTTCTCATAGTCATAAAGCTGACTCCCAGGATGACG CTTGTGACCTGCTGGTGTTACCAGGGTCATAGTTGAACAGTCTTATGTTGAGAGGCTGGAGCTTCCCTCTCATGG TATAGCTACAGAAT CTGGGCCAAGGATCTCTGGTGATT CCTTTT CCC CTTCGTCTGTC CTATTT TAGAAG TCAGA GGAGTC CAGATTTT CC CACCTGACCTCT CTGC CT CAGCAG CTGC CTTCCCAGCAGCTGATGT CCAAGGAT CAGGA TGAGCAAGAGGAACTGGATTTTCTGTTTGACGAGGAGATGGAGCAGATGGATGGGCGGAAGAACACCTTCACTGC CTGGTCTGAT GAGGAAT CTGACTATGAGATTGATGACAG G GATGTCAACAAGAT CCTCATTGTCAC CCAGACACC ACATTACATGCGCCGGCACCCAGGGGGGGACCGCACAGGCAACCACACCTCGCGTGCCAAGATGAGCGCCGAACT GGCCAAGGTCATTAATGATGGCCTCTTCTACTATGAGCAGGACCTGTGGGCTGAAAAGTTTGAACCTGAGTATTC CCAGAT CAAGGTGAGG CT TGGACATAGCAGTGAGTGTGGAG C CTGGTGTGCCTGTATTGTA CGGAGAAGAGGAAG
C(N )xCTG ACATCTAG CTTGGGCATTAGGAGT GAGGGGTGATGTGTAAACACTG CAAACT CTTCAGGTGAGGCGG TGGTGATTATTATCCTATCTGCGGATCCATAGTTAATATTCTGACAGTTGGCTAGACACTTTTGTGGGGAACAGA AAAAGGGTTA GATTGTTATTTCTAGTCTTAAATATTTAG G CATCTAAAAATTTAGAAG CCTTTTAAGAAG CTTGT GTTGAGTATGGAAAATGGAGAAAAGATAAGTCTTTTGAAAAATTGCTGCTGCCTGGAAATAGTCAAAACATCTCT GTTGTGGGATTT TAGAGT GGGTATGTTTGGTGGTAGGTTGTC CTAGGGTGAGCCTTTGTAATAAGTAG CAGTGTG TTTC CTGGAAGTAAGGAGGAAGGAGAGT TG CC CAGC CACTGTGTGATAAGGCAGTAGCAGAAAGAGAGATGGGGA AACACTGGAGCTTTCAGGGGGGAGTGTGTAGTGAGTTCCTGAGGAGCTACTGTCAGCCTGATCTCAGTCTTACTA CAGG GAGG GCAGGG CAAG CACTGTCTTCTT CTGGGCTAG CACACTTGTGCCAAGGTAACTGGG GTGAG GAATGGT GACTGGGCAG CTGAGAGC CTGGGGACCAGAATTC CACGTATGTCGACGTGGGAG CTGCCCTCTCCAACTTCCTGC CAGTCTTGATTCCTTAAACAGGCTAGAGCAGCCTGCTTACTTACATCTTTCCCCACTCATCTCATGACCTCTGCA GCAAGAAGTCGAGAACTTCAAAAAGGTCAATATGATCAGCCGGGAGCAGTTTGACACACTGACCCCTGAGCCCCC TGTGGATCCCAACCAGGAAGTTCCTCCTGGGCCACCTCGGTTCCAGCAAGGTGAGAAGCAGACACCTGAGATCCT GACATGGGTGAGAGGATCTAGGGCCCTTGGACTGGGGGCTATCCTGGGGTGGATGCCACAGGCCTTTCCCTGCTT CCTGACTCCTCTCTCTGCCTCTGCAGTTCCTACGGATGCCCTGGCCAACAAGTTGTTTGGTGCTCCTGAGCCCTC CACCATCGCCCGCTCTCTACCAACCACTGTCCCAGAGTCACCAAACTACCGCAACACCAGGACCCCTCGCACTCC CCGGACAC CACAGCTCAAAGACT CAAGCCAGACATCAC GG TTTTAC CCAGTGGT GAAAGAAGGACGGACACTGGA TGCCAAGGTGAGGCATTCCTGTCGGGCTGCTCAGAGTCTTGGGTCTACTTCATTGCATTCCAGTGCTTTGCTTCT CTCCCTTGCCTTGTCTGAGCTAAGGAAGTGCTAAATCCTTCACCTGCTCTGTGTTCTGAGGCTGGTGGGCTTACC TAGGATTAGGTAGC CACATCTG CAAAATATACTGGGTG TGGGAGTGGTAACCCCATGTTGAAAGGC CTAAGGAGG ATTGAC CAGGTATGAGTCCTTGGG CAAAT CACTG CATT C
> H s 8 _ l 001452 - 1011206
GTGTGCTGTCAC TT CT GAAGTGAAGGCACAGACTATAT TGAT GTTAATAC CACT CAGCATAAAGATGACATACG C AGAGTATTTTGTCAAAAAGACACAACAGAGAATTTACACATGCACAAATAGAAATAGACTTGCTACTATGTAATT TTAT CTAATAGAGCAT TTTCTGGGGTCTTAAA TGGACTGATTTTGGGATC CAGAGTAAAGTCACGT CTGGTC CAC ATCACAGTTTAGACTCACGGAAGCTGCTTCTTCTCATGCAGATCCCGCCACGCTCAGCACAGTGGCTGTATGAGG CTGGGGGGCTTCACCTCTGTCTCATGCCTGCTCAGTCACCTGCTGAGTCTCCTGCCTGGGCCCCTCAGGGCTTAG CTCTACTCCTCT CTAAGTGGAGAAGTCGACTGGGAGTCTG CTACGGGTTTTTTCCTTT CAATTT CAACATGC CAT GGATTCATGAGAGCAAATGAGCTTAAATGCACCATCTATAAGCCTATTAATTTTTGAATTTGCACACCAGAGGAG TGCCACCTGCCCACATTCTACC CACAATGGGATG CCCCACAAGGATGCAGAGGGAGAAACCAGG CT CAACACGC C TGCCGCCAGCACGCGCCTGCCAACTGCACACTGACGTTGTTCTCCTGTGCCTGGTGTTGCAGCCCCAGAGTGCCT ATTGTGGG CACTGAACTCCCGTGT CATGAGGTGGGCCTGGGC CCGG CCGCCCTT CCTGGGCCTG CCTCTGAC CAC CCCTTCCCCCCATTCTCATGCATGTAGACAGTTCATACATAGCCCAGGGGGTTTTACCTCATTCAGCTGTTTCTT CCTCCTTTCTACAAACTCTGAGGACATACTTTACATGCAAAACCATGCCAAATGCCATGGGGTATTTAAAGCTCA GTAGCCTGTAATCCCTGACTCAAAGGCAGTAAAAAGGAAATAATTTTTGCAGGTAGTTTTAACACATGGCATCAC AGCATACACTAGGAATGGGGAAATTTTGCTGTAAGGCGTGGAAAAGGCAGTGGGGAGAGGAGGGAGTATCATCTT TGGAG GAGTTACTATATTCCAG GCATTGTGTAGGAGGAAT CCTTTGAC( N)xATCATTAGCTTCTGTAGGAACAA ACCC CATAGATGTTGCATTGGC CTAGTAGGGGGG CAGT TTGTTTTTTCTGGTTGGC CTGGTCAT CAGAATAACT C ATCTGGGTAATACAGGGAGGGGTTTTCTAGGGGCTCGGTTACACACACTGGACTTGTTTTTAACATATAAGAGAG GCTGGGAG CATT TCTGAAAAATG G CCTATAGAAACACACG GCAATGACAACCTGTAAG GAGGATAGAACTGG CTG CAATCCATACCTGTAATATTCAAACTCATTATTAATCTTCCCCCAAATGTCTATATAGAGAACCAGGTGATATCC AGGTTAAG CAAATATT CAAGAGAAATGTAAAT GTGACCTT CCTTAATTAACAAATTTG CTAACATTTCTCAT CA C TGCGGAGTTGCTGTAGGATGTTCAGCTGCCCCTTGAATTTACATGTGGAGTAACAAACAAGGAAGCATATGTAAT GTGCTTTGCCGTTTGCAATTTCGCAATGGAAAAGAGGTGCTGTACGTTTTGCTTTGTAATTTTGGGTTATTTAAA AGAAAT CAAAAT( N } xCCTACACAGGCAGTACAAGTGATGAGTTCATACCCTTGGAAGGGAAGGAGCTTGTGCAA GCTTG ATCTCCACAGAGAAGG AAC C AGAAGGA CC AGGTGTGT CCCAGT CAGCTC CGTC ATCCCAGT CAGTTCGC A GTCTCCGAGGCCAGGGGTTTTGAAACCTGGGAAGTGAAATGTGTGCATGCAGCTGAAGAAGAGACAGACGCGTGA GTGATGAAGACTGACACAGAAACACACCAAGCAGAGGC CCGATAACTG CGAGGAATTC CGTGGATGTGTACCGTG TTTGGAGACGGGATAATTCCTGGATTTAATTGCAAAGGACCCAGTCTTTCTGCCTGACATATTGGGTGCTAGCAG AATGTTGC CATAACAACCAAGCAAGTAACTGTGG CAGGTTAG CAAG CAGC CAAAAATCATTATAATATTTAAGAA CCAAAACT TTAATATT TTATTTTCATTACCGT CACTGGTT TT CCTT CCTAGCAGAATGGATTCCTGTGCTAG TTA TGGTAACCTTTTCTGCAGTGCTTTTAGAGCAAGTACTGTTGAGTTTATATTTTCCAAAAGATTAAAAATGATCCC TTTGATGAATTTCACTTTTATTTGGGTCTTTTTGTGTAAAGAAATCTGTGGTTAGCAGTACAACAAATGTGTTAC ACCGGCTTCCTTTTCCCGGTCAGATTCAGCCTTCCGCTGCTGGGTGCTGTTCATCATCATGAGTGCTGAAATGGG AGCCTCTGAGCCTGTGTGCGGCACCCGGCGTTTCTGGGAAGCTTCCTATAGATTTTTCCTCTCCTGGTTTGAGTA CAGGCATGAATGCACGCACACATGTACACGCACGCACACACAGATCCACACGCATGCGCACACACATGTAT(N)x CATCCATGCATAAAAGTCATTCGCAGTCATGTCTATGCTCAAACCCATTAAAACATTTTTGTGTCCCTGTTTCAG TGTC CGTATTAAATGG CTTGGACTTTACTCTTGG CTCCAAGCATCT TAGATTTGTGGT CACCAGTTACACTT CGG CTTTGTAGACGTCTATCATTGTCAACATAAGTTTTTACATAAGTGCTGTCATTAAAGTAGTCTAACTTACTGTAT ATTAGAATATATGATATTTCTG CAAATATTTATATTTT CTATGGCGTT CATTAT TAATATAGAAAT TAAATAAAA CATGCCCAGTAATTTATCATCCCTCTCTTGGGTTGTGTACCTAGGAATGATCAGATAACAATTGACTGCTTAATT TAAATAAAACAATGCACTATTCTCAGCAGGAATTAACATTATTTGGAAGGTTTGATTCATGGTTTTTGCATTGTA TTGGTGTCACGCTGTTGCACTGGACAAGAC CT TAAAAAAACT CTTGGAGG CTGAGATT CAATTACT CTCAAT CAA ATTGTAAGGAAAAACCATAACTTATCTAAAACATTTTTTATTATCAACTGTTTTCATTAATGCTTTGTGTTTGTT TTATAAAAAGCACTAGAACAAAATAATAACAGCCTTCATAGACAGTGGCACACAAAAATAGTTTTTCCTCATGAA GCAATAAT CGCATATAGCAAAT TGAATATACT TC CACAGCAAGGAATGTTGAGG CCACATGGTCAGAAAACAGTT CCATGTTACGACG CACATTTAAGATGTTTATAAG GATCAACTTCTTAAAAATTTTACCAAGAGT GAAATCTATGG AATAGACGTTTTGTAAAGATGGATGCTATCAGTGTGTGTTTTGAGACCACTTTTTCTCAATTGTCCAATTGTGTG ATTATGTGATGACATTCTCTAAACATAAAGCGCTTTTGCTGGTGGTTTCGGCCTGATGAGCTCTATTTTCTTGAA ACAGAAGATGAGTGCATTAAGAACTGTAGCTTCAACTTATCAAACTAGCAAATTGATTTTCTGTTTTCCATTTCA AATCAGAG TCACAGTGATTTGGAG TCATCCGTCC CCTGTAAGTCTC TTCTTCCTGC TCTAAATATTAATGTAATA AGATAT CTAGGT TC CTTACCTATGAATGTGTTAGGACTTTAT CTGGTTTACTGTTAG GATGAGATAAAATATTTT TCTACTTAAAGT CTGT TGTTAT CACTTATAAGAGATTTTACATGTG GACTATTTATTTAATGGC C CTACTAATGT TTT C CAT CAAATAT CTAAAGC CAAGGACGCTCGT CTT CT CA CTGTTG GACTTCCGGGTAGGACG CTGGAAACAG C CGTCGGATAGGCCGTTGGTCACCGTCTTCCAGCTCCTCCACAAAGCAGGTTCCAGAGCTGCATCCCCTGCCATAA TAGCCCGGGAATCGGCAGCCAGCACCGTTGCCATGCTGAGTTTTCACGGCTCATTCCACTGAGGAGGCCTCGCCA GC(N)xGGCCAGCTAAAGAAAAGAGTGGGTTGGATTGTGGTCACCTTGGTGTGCCCTGGGGGGTGTTGAGATGCA GGAAGCTGCTCCTTATGGAGGCAGCTCAGGGAGTTGAGAGCAGAGAGGGCTTGGTAGCTCCCAGGTAGCAATATT TCACCACTGAGTAAGCATTCAGTTGTATGACTTCCTCAGGCAATTCAGTGTAACACTGGAGGGCTTTTAGAGAAA AATGTAATTCTCGTTTTAGGAAAAAATACCCGTTTGTTCCTGGTTATAACTTTTGATCAAACGACCTCAGTTGAG TGTGTTGTGTGGCATAGTTTTACAAGGGCCTGGTCACGTGGGCCTCTCCACTCGTGGCATCTGTGGCCCCTCTAA CGCATCCCATAGTTTCACATAACTGTTCCGTGCATGTGGTGGAGGGTGGAATTCCTCCAGCTCTGTGTGTCCTCT GTCCTGGTTTACCTCAGCCTTCTCCCATTTTTCCCATCTCTTTTTCCTAGTTTGCTTATTTACATGTCTTAATTT TGGAACTTTTTTTCATGTTTTGGAACTGTGTTTCAAAGAGCCATTCTTTAGTCTCTTGTTTGCTGTAGAACTATT AAAGAAAATTTTATTTTTCCCCTAAGCCAGCAGGACTCCAACTAATGACTTCCAAAATGTGTGTAATGAAACACA GCTC CGGTTGAAAGGTGC CAGGACACCG GCACACCCGTGC CAAAATGTGGATTTATACAG GGAT GCGTTAí N) xC CCCTTAATCCTCTACCTGCATTTTCTGGTGAATACAACCCAAATAGGGAGGTGAGAATTTCCTAACGATAAATCA CCTCGAAGAGACAGAGACAGAC CGTGTTCGTTAGTTTTCG TTAACC CCAAGAACTCATGGACCCACCAGAATAAT CCACAC CCGCAGTTTGAGGAAC TA CTGTGAGA TACATATTTTTGTG CATGAAAATGAGTTGTAAAATTATTA CTG CAGAGTTTTA CCTTTGAAACCT GTGAAAC CTTTCAGTGTGAGAATACTTTTTAC CCTAGGA CACACAGTTCC CTT GAGATATGCATCGGTTACTACACGTAGAAGTTAGCTGCCTGTATTAACATAGGTTATTAAGAAACAGGAGTTTTA GCACAGGTAAATTTAATTACCTACATTGAAGTTAGCCATCTGTATTAACAGGTTATTAAGAAACAGGAGTTTTAA CACAGGAAAATTTGAATTACATACAGTTCATGTTAATCCAAACATTCAAATGCTGCACGTATTTTCCATTTTACT AAATCATTATTCACATTAGGGTTATGCACACAAGGGCTAAGTCTTTAATTGATAATTCTTTCCTTCTATGGATAT AAATGGCATTGCCTAAGAATTCAGCTCTTCGTTTGTTGTCTCTCTAACTTACTGGCTGTGCTATCTCTTGATTTC TTAT GT CTAT CTATGGAAGAAGATTAGAAACCTCTATTCTAAA CAC CGA CAAAGTCTT TCTGGAGAAT GTAAGCA GGGACCAGGCAAACAAGAAAAC CT CCGGTGTAATTTTCT CTGAATTAGATTTGTTTGCGAACAG CTTGTGGC CTT TCATCTTTGAGAATTGTGACTATGTTCATTGATACTAACACCAGAGGTTAAAATGTGTTACCAAGAGAACAAAAG GGAGATAATCTTTAGTTCAATGTAGCAAAACAAAGTACTTGTTGAAGCTTTGATGTTTAAGGCTGGAGAAATAAA AAAAAAAGAGGTTGCATTTCTGGCTCTTATTTTCCAGTTCTTAAATCCTTCATTTTACATTAGGACCCAGAGACA TCAAGCTGCATAAACAAGAAAACTTGTTTTGCTTTATTTGTGGCTCTACATGAACCTAAGATTATGGGAAAACAT TCTATCAATAGTCTGCTTCTGGTGAACTTAACAGTTTTGAGAAGAAAAGATATTACCCTGCTCTGCCAAATTCCC CGAGTGAGGATCACGTGGGGTTGCATGCGAGTGTATTCCCACAGGGGGCTTGGCTGGATTGACTAGCGGCGGGTC TGGGGG CG CGGCCGGATTGACTAG CGGCGGGT CTGGGGGTGTG GCCGGAT TGAC TAGCAGTGAGTCTTGATTTCA GCATTTTGCTAAAGACCTTCTTTTGGAGATGTTGCTTTACAAATAGATAATCACTCACTTGGCACTGAGAGAATA AGCCCGAATTTCTGTAGGAACCCAGCACCTGCCACGGCTCCGACGGGCGACCTGACTGCTGCAGGCCATGCTCTG CTGTGAGATGCGGTGTGATATGGGTTCAAAGTTTTGCATCGTCCAGTTCAATCATCAATAGAACTCTAAGGGTGG CTTCTAATACCTGCCCATGCACCCATTCTCAGGTCAGCCTCGAGGGGCCATGCAACAACGTAGCTGGCTAATGTG GTTCAGCAATGGGGAGCACTGTGTTGGCAAAGAGGAGATCAACTTGGAGGGAGCAGGGGGAAGCCTGGCTCATCA CGGCTGGTTAAAATCCAAGTTTCTAGCCATCTACCAAGTAGGTTGTTTAACCTGTTGTCATGTATGTGGTTGGTA AATTAGAGTTGTGGTTGT CACCTC TTAAGACAGATTGGCC TTATATTATATGGACAAG CTA CAAAGAT CAAGTGA TCTTAAAAGTGATATTTGAGGCCCACCCCAGATTATTGGATTAGGAATCAGGAAACACCTTGTGGTGTGACTTCA TGTGAGTT CCTGGACACCTGTG CGTGCT CGTTGCC CTTGATGGCCACACC CACCTGTG CCTACCT CAAAAGACCT CATCAAAACTGGTGTTAGAAGCTTGAGGTGGAATGCA(N ) xTCTGCTTCTGAGAAGACATAAACGGAAGTCTCCA GGGCTCCCCCCATGTGTCCATGTGTCCATGGGTAAACCTCAGGTTCGCCCATCTCTCTGTGTCCAGTGGGAGTCC TAAAGCCAGCAGTGCTGCTCTCTGCTCTGGGGCTCTGTGACAGCTCTCTTTGGGCCCTGGATCCTCTTGGCTTCA CTTATAGGGAGAGGCTCCATCACCAGCTGGGCCTCATCCCCTGCCCACTCACCCCTCGGGGTCCCAGCATACTCC ATCCCCGAGGGCCTCCTGGGACCCCTGCATCCCCTGGGTATGGACTCTCCCTGCCTTCTCCGTGGAGAAGGCATG AGGGGTGGCATATTCCTTAGGAACACAGCTCTGGAGCCGCACTCCTGAGCCCGAAGGCTTGGGCTCCTCCCTCAG
(N)xCAGAGCACGTGGTGTGTGGCGAGTGTGGGC(N)xTGAATGCTTGTGGGTAGATATGCTGTGGTCAGCCTCC TTGGACCAAGGCGCCCTCTTTGCTGTGGGAAGGCTGAGGGGCGTGAGGCAGGTGAGGGAATTCCACGTCTGTGCA GGCGCAGTGGTGACCGCAGGAACCACCAGGACAGACAGAGACACGCCTCTTGGGAAGGCAGCCAGGCAGTGGGAC CGACCATATGGGGCAGTGCAGAGAGTCGGGGGCGGTCGCGATGCAAATCGCATTCTGTGTGGCCCCGGGGGCATG GATCGGGGGTATTCACAG CTGGTGGTTTTGAAGTATGAGTGGGCTC CAGAGGGC CAGAAAAATG GTAGCTGGGAC GTCTTGACAGCAAGCAAGTGGTCGCTGGGATGGCTGGGTGAACTGGAAGTGGAATCTCAGGACCTGAGTCTGGGG CTTCAAGTTCCTCCACAGGGTCAGATGCGAACTGTTCCCAGGGTCAGATGTGAACCACGGAGTGGGTGCTTGGAA AAG C CT CAGGG CGTGCGTGGC(N) xGAAACCAAGGTGGTGAGTGCTTGTGGATAGATCCCCTCCCAAGGCGGGCT CCCTCTGGAGGCCCTGAGGGCGTTGGTCCCTCCGCTGAGTGCTTGTGGATAGATACCCTCCCAAGGCGGGCTCCC TCTGGAGG CC CTGAGGGC GTTGGT CCCT CCGCTGTGTCCTTG CTGTGTCCTTGGTGAT CCAGGC CCATGGAAGCG CGTCACCGACCCCTGCCCCTGCTT{ N) xATCCCAGTGGCCCTACATTTTGGGGGATGATGTTCATCCCGTAACTC CTGGTGATGGTGGTGTCCTCACTGAATGACTGCCCCACATTTCAGGGCATCCTGTGGGCTGCCTTTTCCTCAGAG AGGG GC CTGCGGGTTCCACATG CTGACCTCCCTGTG G GCCTGGGAGGAGG CAG GAGGC CACAGC CAGG GCAG CCT GGGATGCTGTTGAGGAAGAGGAGGTGGGCTCTGCCCCCTCACAGTCAAGGGATCTTAGGGAAGGGAGACTTACTT GCATTCTGAGTGTGTCTCCCAGCCTTCAATGTGTTGATATTTCGATGTTGTGTAATAAACACAGGCATTGGTGAA CCATGGATA CTTCT CTATGTTG GG CAAC CATGGCAGTTCC CC CACGTGGG CAGAGGAACGAATT CCCTGATG CAC CCTACACCTTTCCAGCATGAAGAACCCATCCACCAGGTCTGTTTTCCAGGAAGATTCTCATCTGATTTGGTAGAC CTGGCGCTGTCCTTTGCCAGCCAGCCTTGTTCGTGGTGAGCCATTCTGTGGAATCGTCG
> H s 8 _ 61370490 -61382562
AAGAAGTGTGTATCTTTAAATAGTCTATGGGTTGCAAGGCAGTGAGATATCATGTGTAGACAGAAACAAGACTTA
actggaagaggtatggaattcaccaagaaatcctgaactctaatcctgaataattattccataaaagctcttgín ) xTCTTTTGAATGCAGTGGCAGAACATAAATTTGGGATATCCTGTCTAAGGAAGAGGTGTATAATGAATAAATTG ATATTTATATCAATTGTATCAGTTAGATCATGTTAGTGGGGAATTTTTAAACACATAAGGAAAAAAACAAGGTGT CTGTTTGTGCAAGGCAAAGAGGGTAGTCTTGTAGTTTGCGCAATTCAAACTCATTTACCCAGCTCTGGGAATGCC CTATCTGTACCCCTAATCCCATTGCCATCCACATGGCCAGCAGGACTCCTAGCAGCCGTTTTTGTAGTGTGATCC CTCCCCCAAACTATGTTTGATTAGATCAAGAGTGCACATGTGTCCCAAGCTGAGCTAAAACACCCCTTCTCAGAA GAAACTGGAAATGCAGTTTATCTGTATCTATATCTATATCTATATCTATATCTATATCTATATCTATATCTATAT CT AAAT CTATCT CACTTC AATGTGGCTCTCTC CTGAATGC AG GAAACAAAACCC CAGGACTATGG CCATGTTTTC CCATACAGAGAATGAGAAGCAACGGAAATCTGTGTGAGAGAGAGGAAAGAGCAAGACCAGCAGGGGCCAACACAG AG CGAT CAGCCGTGGATTTC CACTCCCTGCTT CCAG CTTGAG CATGGCTTGCATTTCTGCACTTGGATTTCCAGA
gacactcccttacctttaa(N)xaagagatccatcccaggatttgcgatgttccttttcttcttatagataacct CTGTTTTGAGACTTTTCTGCCCACAGGAATGGTTTGGTGATTGTAAACACAGATGAACTGAAGAGACTGAATGAG CACATGGATCATTAATTAGATTATTTCTCATTCTGGCCATGTCTATATCAATCTATTTCTGTCAGGGAATCATAA TTTCCTTACATAGTCAGCCGCTGTAAAAGCAGATCTCACTTTTCCCCAAAACAGTTTTCCTCAGTGCTGGCATGA GAAAGGGGTGAGATTTTGGCGGTGGGGAATAGATAGGTTTGGTCCCACACAGCCCTGCGCTTAGTTCAACTCTTC AGCAATGTCCCAAGCAAGGTTGCCGGTCTTCAGGGTGGGTTTTCAGTAGCTGCACTTCTCAGGAAGTGTGGGCAC AGGAGGGAGCAGTTCAAGTTCATTCTGGGAAGGCTGAAGATGCAATCTGTGAGCAGCATGGAAGACAGAAACGCA
gagcctttgcaaggctatgcacagagcaggaaaaagcagaggtctccaaacacatgaggagcagagagatcccct TATTGCCTGCCTTGGAAGGTGTTTCGTTTTAGAAGGCTGGGAAGCAAAAGGAATCATCTGACCTGGAAAATCGCA GGGAGCCCAAGTGAGGAAGAGTCCAATGCACAGCATCTATGGAGAAGCACAAGGTAAACTGGGAAGGTGCAGATC CTTAGGGAGGACTCAGTCTCTCAGCAGTTTGTCCCAAGGAGCATTCTAGGGGAAATTTAGTCACACTTGGAGGGT GT(N)xTGGCTCAAGAACCAATTTAAACAAGAAAAATGTAAGGAATTCCCTTTCTCCTGCCCAAAAGAAGTCAGC ATCTATTGACTGCTCACCATGGCAAAAAACCTGTTTGATCAAGTAATGAGGTCAACATTTGTATAGAATCCATGT
G(N)xTCTTCTTAAATTATTTTAATGTAGCAATTATGTATTTAAGGAACTAATAAAGCTTAGTTGTTGAAAAAAA GAGGTTGTTGTGTTTTATTTTCCATAATTTTGATTAGGTGAACTTGAGGTGCTGATGAAGAGTAGCTGTTGAAGT GTTACATAACAACTTCTCCTTAGTTCAAAGTGAAAATGAACTCAATAAATAGGAGTCAACGTTAATCTAGGGAAT TAAAATTCAATTTGTTTTTAAACATTGCCTATAATTATTATAATACTGAAATTTACATAGAAATATACATGAACC
ttaaattatttccatccacttacctagtaaatactgacttaagaaattgctatgctgaattcagtcaattatctc TAGGTCATCTAGTGAGAGCTTAATTATATTTGTGGGGCTTTGACTTTTCAGCACAAACACCATACTCATTTTCCC ACATACATTATACCAAGAACATATGGTACAGTGTGTGATTCACTAGCAGAGAGCTGCTTGCAATTGAGATCTGTC
aaatgaggacttctcagacagctatgaggattctttgactcctgttgttaaaacacattttgtagatttaaaata TATTCATAGAACCAAGTCTTCTGAGTTTTCATCTTCTAGGACTAGCTTTCGCTGTGTGCCAAGTA(N)xGCTGCT CATTTCAGTAAACCTTCTTGCGTCCCCTAAGTTCTCAATGGAATAACACCCGTCCCATCTGTTCCTCTGTCTATC TCCTCTGTAAGTCCCCTCCTATGTTGCCATGGCAACAGTACCTGCTACAACATCATAACGAGTTTACTATATTAT ACTTCTGTTAGCAAATGCAATACTTGGCTTTCAAAACCATGGCTTTTTATCCTTTTTATCCTTCTTTAAGAGATA GTTGGGAAAGG(N)xGGTAGTTGCCTGGTGAGGCAGCTAGCATTCTCAAGGCTCATCTTCCCACTGTTTCCTTAG CCCTTGTTCCCACACTTAGGAAGCCAAGAGCTGTAAGCCACTACCATCCCATCTTTCACCCTAGCAGACTCCATC TCCCAGCAAAACATCCAGAAGTGCTTAGGATAGGAAACCACTGGCAGGAAGACTCACACTAGTGGTTCCCTCAAA
ctacactgtgcctccaattcctccaggctgcacatcaacaatacatccaacttcatttttctcag(n)xacattc AACAGAGTTTGAGACACAACGGTCTAGATTTTCTAATTCAGGTGCCCCAGCATCTAATAGT'TAGACAGTGTTTTT CAATGCATAGGACTTTGCTGGGGAAACTGA{K)xACACCAGAGAGACCAATCAGGGCTGGATCCATATCCCAGCT CTTCCACCTAAGTAGCCATAAGCATATATATATATTTTTTTTGACCTGGTGATAATATCTACCTTATGAAGGGAT TGTGCACAGTAAACAGAAGCCATGCATTTTTTACTGTTTCTGAATAAGGCATGATACAGCTAGAGGAGATGTTTA AGAGTAAGACAAAAAATACCACAATCTGGCCATCATTTCTGAGTCATGATGGAGTCTTTCTAAGGACAGACAACC AGGGGTAACTTTTTCCAACCCAAAAGTCATGCTATAAAACTCCAACACAGTTTCGATAGACAGAACAGAGGGCCA CAGAAATAAAGATACAATTTTGATCATTTCAAAAATCTATTGATCCATGCATATTTCTGTATTTCATCAAAATCT GGACACTATAGAATGTGTCACATACACTTAGTTCCAGAAAGGAGGAGAGCCACTGCCTTGGAGAATGGTGCTTGG ACAAGCTTTCTTCCTCTCTGGGTACATTCCACCAGCAGAATATTTCCTGGACTTAAAAGAAGAACACTTTG (N)x TAGGATGAACACTTAACACTTCTCTCCACCTGAGATTAGTCAAGTTAAAAATCCATTTTAAGAATGCTATTATA( N ¡xTGCTATTATAATGACACATTTATAATTGAGGATGAGAATTACATCCCACAATTTGCACTGGCTAATTCAGAG ACTTTGGGATGCAAAATCAAGGTTATAGTCCAGCTTATCTGTATCTGAAATTAGGGGAGAAGTAAAGCATAATAT TCTGCTTCATGTTTCCAATTTCAGGGTGACAGGGTCCTGTTCAGTTGAAGGCTGCTGTGCATTTCTAAATGGCCC TCAGAAAATAAACAACTCACAGAAAAGAACTCCCTGAAAGTGGCAGTGGTGACTTTAAATGCCCATGACCCTTCA CACTGATCAGCCTTCTCAACTTGAGGTGTGTGGAGACAGGGATCCTGGTATTG(N)xACTGGTTTTTGAGGGTCA CCAAGGTTAAGAGGAGATCGACTCTATTACTTAGGGAGGTGACAGTCACTCCTTTTCTGAGGTTTCCTGGTTTGG ATTGGTTTGTTCAAGTGCATTGAGGGGGAAAGAGCTTTATTCTGGGGTGGTGAAGGCCAGCTCAGTCCAATGGCT ATTCATTACACCTGCGTGTTTTAGCCATTTGGCTTCTCATTATTGTATTTTTGTTTTCCATAAGTCATCATGAAC ACACTGAGTGGCATTACCAAGTGCAAAATGTGTGCTGGGCATGATGAAAGAAAACTATATAAATTTCACTTTTTA GCCTTCAAGCCTTTGCCATTTGACAATT(N)xTGTTAGCAACAACTGTGGA(N )xTGTGTGTATGTGTGTGTGTG AAAGAGAGACAGAGGAGAGAGAGTTGGGGAGGTAGCAAGAAAGATAATGCCATAATCAATTGAGGAACTTAAAAA CATACACACCTAGC CCCT CC CTC CAGAGACTCTAATCTAC CTGGAGTGGTATGACATT CGGATACATT CTGCCAA AT CATAATTATCAC CTGCAAGTTGAAAACACCTñTTATTTTACC CCAAAGTATAATGAG(N) xCACTGTGGGGGA AGGTGGCAGCTTTGTTCTAGGCCTAAAGGAAAAAAACACGTATTTTCATGCCTCGGTCACAGGGCAAAAAGTAGG TGTGAGTTAAGGTTAACAAAATAGGCCAAGAGTTGTT CAT TCAT CCTAGT CCTTAAA CTAAAAGG GTGAAAAATA ATCCTGGAGGTTTATGTGTGCTCTATCCCAAGTCACCAAGGTACAAGAGACGGCAACAAAAGCCCTCATTGATGA TAAACACGTTTG CTGGGATTGTCCTCTGTG CTCCGCGAAT CCAT CAGGGTGCTT G GGGTGATGAGAGTTGACCGG TGAAGGTTACAGGGGCCCAGGGTTCTAACAATTATGCATTTCAGACACATTAGAAGTCAAGAAGACACCAAGCA( N)xATGACACAAAGCAGCAGAATCAGCTGCAAATGACTGCAGGCCTTGCCAGCCCCTGTGAGTGCCACAAATCAA AG GT CATT CT CC CT CAGOATAGAGAGAAGAGAGGACTG CAGGGC TATOAGTCAC CTCGAC CTGAAATAATGACAC TGTCGGTCAGCATTTTCT CTTGATACCTGAGGAG CCCT CT CTAAAATCATGACTGAGGGC TGAG CATAGAGGTTT TTGTAATG GA GG CAACAG CAAACGTTAGGTTGCAAGGG CTACCTGGTTAT CCAT CAATGAGTTG CAGATGAAGAA T A ( N) xTAATAACAGGAGCAGATTCTGGCTCAGGGCTGGGCCGTGAGTCACACGAGGGCCAGCGTGGGTGACGTC AAAG CCTCTGCTTTGTTT CACTACACCTGCTGAAGGACAC CTGGTGTA TCACAG TGGAGAAGAA CAGG CTGTGGA GG CTGAGCAATC CAACAG GGAGAAAAATA CACACATGCACACACACGT GTGTATTCAGATTAGATCATA CACTCA AACAAATC TTGACC CCCACT CAGGACACT GACAGGTTTAC CGAAGGAGG CAGAAGTGAAAATAAACAATCAAAAC AAAG CATC TTGCAGGCTATGACGGAAAT CAGGT CAATAAAAATGA(N) xTGGTTCTCATTTGGAGGTTGCACTGC ACCTTTCCCG CAAGGGGGTATCAATGCATTGCATGATTTTGATCGTTAGATGACAAGGAGGTGTTGCTGG CATGT CC CATGGC CACTG(N)xTCCCAGGAGAAAGAACCAGGCGCTAGTTAATAATGTCAAATGCTC(K)xTAGTGTCAA CTGCTAAGGAGAGGGGGCTGGGACCACCCACCCCCCCCAGCCCCAAGGAAGTCTGCCATAGCTCCCCTATTTCCC ATAACATAGACCTTAGCATGCTGTTTTCTGAGGGCT( K ) xTACCAAATAGATGACTGAGCTGCAAACAGGCAATT ATTCATGATCTCTGTCAGAGCATTTAGAGTGGAACAGTGAGAACAAGTCTAATCACAATTAGCTGAAGTGTAAAT GAGAAATGAGAAAAGGACGAAATCAGGCATATAACTTTCTTCAAAGAAGGGTAAGTAATTACAAGGTACTCTAGG GTTTAGTTTGAGAGAGAGACAGAGTTGGCAGGAGTTGAG(N)xGTAAAGTTAATTCTTCACTACACCCAAGGTAC TATGTTGATAAAGAAGAAGGATAGAAAGCTTGAAAGGAATACAGTTTTAAATAGTAGTATATAGGAAGCAAAAAG GTGGTCAATGACAAAGAGCACTGCATAATAATGGAAGGTGGCCAGTTGAGATGATTTGCCTTGGTCAAGAAAGAG TACTGATATGACTG CTGT TCAGAC TAGGTG CTT CTCATAACACAGCCAAAAAGCTGTG CATTGCATTTTCATAGA CCATGAAATAAATGACTT TTTCTG GAGTAGTACCACACAG CATTGCTGGTGCCTTAGG TG GCTC CCA CAACAGAA TACTACCTATTT CATGAGTATGCCTGCT CC CTAAGTACAACTAATCATAGGAAG CATGATTTACGTTTGC CTACT GTAAATGTCAAGTTTGGCCAGTTCGGATGGACTCTCCTAAATCATACCAACAGAGGCTGAGTCAAATCAATAAAC ATTT CTTAAGAACCTACCAAGAGTAATC CATTATGCTAGG CTCTGATTA CAAAGGTGAAAATTATAGAATAGCTA CC CT CAAC TA GTTCACTAGTGAG GAAGATA GCA CATGAACAACTTGGGAT TTTATAAA GAAAGCA CAACATTTAA CTTCTTTTAAAAAACTTACTATGGAATGAGGCATAGGTGATGAGATTCAGGAAGACATTTCAAAGATTTATTCAC CACCA
> H s8 _ 134 828201 -134 8 414 76
GGAGATGG GTGTATAACACGATGG CAGGACAGTAATGGAG GTGTGGTGGAAGG GATGAAG GGGTTTT C CTACCAA CAAGGCTGTCCAGTGCCTTTATTTGTCAGTCAAAGCCTTTCTCCAGCTAATGTCAATTTGCATTTCCATGTATCT TAC CTTGT C CAAAC TTTATC CTACATGACC CAGC CAGG C CTTCAGTAG CAGATGTTACACATAAAGTT CCTTTTG CC CAAAGAGT CT CCTGTGTC CTCTAAGTAG CAAATCCT CC CCACAGCCTGCACAATGG CTATCACCTATCTTGCT TC CT CTGACGTG CCTGGACAGTCAATCC TT CTGC CCTGTGGGCTTCTTTGTTACTTGGTTTACT CGT CTGTTGAA GCTACCTGTGCACAAAGGTGGAGCCTCACTCATTGCTCAATCAGTAGCTCCCAGCGTAGAGCTCAGGGTTGGCCA GT CATTC CACAGA C CACTGG G AAT GAATGATTGTTAGAAGGAAT TAGTGAGTGAGCAT CT CAT CGTGC CACTGAT CTAG CTCT CAATGGACTCAT CAAT GAAAGGACTG CACAGAGACTGGTGGATGAGATTAGTGGCT CTCACT G GGGA TGCCCACCAGTGGAACCAGAGGGAATTTGGGGAAGTCTGGGCTCACTTAGAATTAAAAGAATCAGCCTCTCTGGG TATTTTTAGGGGACAGTGGGATGC CACC CT CTGTGTTAG GAATCAACGTCTGGG CAGG TC TCTCGAAGTC CTTT C ATTTTGTACATCTñATGAñTTCTATTTTGTTCTGTGGGTTTCAG CCTGAAAAAGTGGTGCTCTGATA C CAG GATG CATTGAGCCTTCCCTGAGCGGCTGCCCCAATTAGCCTGAGCAGTGACATGCAAATCCCCGAGATGTTTTGCATTT TT TCTGGCATGGTC CTTACCACAG CCTGCTCTGTGTTGTCTTTGTTTGTGTTAAGAGCTGAATCCCAGCTGTTTA GGAGTCT CAACAGC CAGGGG CTGATGTACT CATTTCTGTTTCTC CTGCAGACCTAGTTAGTAAC TGACTCACAAT GTGT CCTCTGTT CACTTTTTGTTTGCTTTTTTTCTCATTGAAATGAAATTGAATTCAT CTGGGTAGTG CC CTCCT ACCCTTAGTTACAATTTTGACCTTTAGAAGTATAATCAGCTGTGTACAAATTTTCAAATTAACAAATTAACCGTT TATCTGGAAAGGTTTTACAG CCCAGAAG CT CCAGGTAAATGCTACACTGTTA( N) xATGCAACCTTATTTTTTAT GGAAGTGGG CACAG CACT CCCAGTAGCCTG CAGT CACC GC TGGG CAGT GG GGCACAAG CCAGGGGAAGTAATCC C ATTCTCTGAGCACCATGCTGGGTGCTTCCCCACCATCAGCCTGAAAGGAGGGTGCTAATGGCTCTCCTCTCATAG ATGAAGAATTTGAGGCTCGGGGATGAAGGTCACTCCTGCTTAACATGTGTTCTGGTGATGAGAAAGCCCCCTCTT TGTAGAGAATCCCTCCCCCTTGGCCTCATCCTCAAGGCCATTGACTTCCAAGTAGCTACCTGGTCCTATTGAGAG AGGGCTGTCTTTGGATTG CTAAGG GTCC TGGGATGGTGAGAGAAGTACACACTTGGAAAAGGAT CTACAGACACC TCCTGAAAATGAGCCAGAAAGAGCCCTCCATGCAGTCGGAGAAGGATGAATCCATCATGGAAGAAGCTAGCTCAG TTTT CCTT CAGTTT CAGCTC CTGACAGT CC CTGCATTCAG TGCTGCACTCTAGñTTTACC CAGAGGCT GT GGTT C TCCCTCACACCCAACTCTTTCTCCATCTCAGACTTTGCTCACCCTGTTTCCTCTTCCTGGAAAGGCCTCCTGATA TCCTTGGCTCTTTCTGGCTGGTACCTCTTCACAC( N) xTGGGGGGCGGGGGGCAGGGGGCGGGGGGTGCAGGGAG AGAGTACTAA{ N ) xTTAAAAGAAGATACTCATTAAGGTCCACTCACACCTTATCTGGTGACCCCATCCCAGGTGA GA3CTGGT3AGGGTCTCCTCCTCA3AGGGCCCAGGAATCTGTGGTTTCCCCT3TCCCAGCACAGAGCAGGATGGA G(N)xGTTTACTTTTCAGCTCAGTAATTCACCATGTGGOTCAGTTTCTCCATTTATGAGCTTGTCTCTAGGGCCC TGGACCTTGTGAGTCTTCATGAGGACAGGATGTGGGGTGGGCCAGTTGTCACGTGGGGCTCCTGCCACCAGGCAA GGTC CTGATATAACTACGGTTATAGTTGCG GTGTGGGG CAAG GGCT CC CCAACT C CAGATGAG CAGATTTG GGT C AATTTCTTAATATTTTTTAAGGGCTGTGATGTATCTTGGCTATAAATGGTCTTTTCTGCACCTGGGGCATTTTCT TTTAGAAATACTGCTTTGGAAGTCAGATTAAGATGCGTTTAAATGGAACATTTAATGATTGTTTTAflAACAATCC AGGT CATAGCATACAAA CACACACACACTGATTCTGTG CTCTGTATCCTACGGCCTAGAACCAAGACTTGCTGAG TGAGTCCTCACCCTCCAGTTTAGAGGAAACAGGTGAGGACGTGGCCACCACAAAGCAAGAGGACAGGGCAGGGCA GAACAGAATGGAGGGCGCCCTGTCCCACTGCACTGCACTCCGGGGCTCCTACAACCCATTCAAATATGAACAGTA AGTCGCTCACCAAGGGCTGGATCAGCCACTGTTCCCAGCTCCCACCCAAACATTCCTCTACAAGTATTTCCTCTC AGAGAAAAAATCAAAAA CAC CTGTAACTGGATGAGG GGAAATGAGTGTAGATTT CAGACTAAAC CAATTCAGAC C CTAT CATCTT CCCAGA CTCT CCAATCTGGCAAAGCC CTTTGAGAGTAT TTCTGTGAAACC CACT TTTCTTAATñT TTATTGTA CC CAGATCTCCC CTTTGGAATACCATCAGACCTG CAGTTT CTTT CCTTGGGGAAAGGCAGGACGCAC GCAC CCAG CñATCAGACACATCCT CAGGGATG CCTGGATCTC CTGAAACCTAGTGGGT CCTGAATGGAGCTACCT CCAGATGAGAAGTACAGGCCAAGG CT CTGGGAAGGAAGGGGC CCGGCCACTCACACAT CTAAGT CAGCGGGAGG C TTAGACATGAAGCCAGGAAGCCTGGTGCCTATCTCATCAGAGAGAGAAAGATCAGTTAATTCACCAACAACACAA CCATTGATATTTAATGTGGTGTCCAGGTTCAGGGCCAATTGTATATATA(N)xGTATTTTTAAAATAGTCTTTTT T(N)xGACCCAGGGCCAATTGTTTGAAGTGGTA(N)xAGAAGAGACACTAGGAGTTGAGGTCCTGTGGGGAGGAC TGGCTGGGTTTCTATATGACAACCTGTTTTAGTGTCTATCCATTTTTAATGGTGAGATGACCCATATGCAGGGGC TCCAAGTTAGCTGGAGAGGACAGGGCGAGTATCTGGTGGTTCTTTCTACTTTAGAGGCAAGGGCTGGCCAAGTCT GC CTTTTT CCTCCTTACCGAGTC C TTGGCACATTT CTT CTGAAAACAACGGGGATTTTTGGTTGTTTAATGTATT A G T (N) xTTCAAGCTGTAGGACTGTGGGATCTTCTCCCCCCACAAAGCTATACTTCTTTTGTGCTTGACTCATCT GAATTCAAAATCTTGAATATCCTTATTCTCATCCCAGACTCAAGGAAAGAGATCCTCATGATACTCTTCCCTTTG ATGTAGG GTCTGGG GTGGAGTTGATGGGGTGATTGG CTG GGCTATTGGTGGTGG GTTATTGACGGGGCAATTGG C AAGGCCAGGCATCCCATAGGCAGCCTTGAGGGGATGTATGTGTTTTATGTGTTATCCTTCCTGACCCAGGTTCAG CCTGAACC TT G GCTGAGAGCAAG G CAGAGTTG GAGCACAAGG CGGTGC CCAGGAG GACAC TGGAAGGCTGGAGC C CACCCCAAAGATAGGCTTTGGAGAGCCCAGAGAGCACCATGTTCCAGGGTCCTCCCAAACGGTCTTCAGATTACC AAGGACTGTTACGTGGGAGAAAGTATCTGCACTCCATGAGCTTCCAGTGCGCTGGACAAAAACCAGTGTTCCTGA GTCATAGCTTGTGGACAACAGGAGGGGGCTGTGGCATCATGAGCCACTGGAAAAGGGACTCGGGTCTGGCTAGTG TGCCAGTCCCGACAGGTGATTCAGATCCCAATTAGGAATTTAAGTATAATCCTGGCCATGTCAACTGGGAAGCCT TGGCTTG G GCTGGGAAGGAG GGG G CTGTATTCATGC CTTAAG CTCACTTCTG CTATGTAGACAC CAGGCTGAGGG CATG GAGACACTAGGG TTCTGCCTGT CTGTGGTAG CAACTGACTTATGAAAT GGGCATGG CTGATCCTTCTTGGG CAGGGCTC CTGGGT CTACCATTGAGTGAGCTGAAGGAAGTTAGGGTTAATTT CTTTTTTGTCACTCTAAG TATCT TCCCATTTCCCTAAGCTCTGGGCTTTGGGGTTAAACTACTGGGGTTAAGTAATAAGAAAGGAGTAAATTCATGAG TCATTCTCATGTCTTT CTGTTGTC CT CAGCTC TATTAGTCCC CAGTAAGTCCTC CTTAATATTC CCAGCATTATT GAGG CAGT TTAATGGCT CCTATAT TTAAAAGGGACAGTGAACTTGC CTTACAAT CATT CTTACAAGATCCTTTAT GAATAGCAACAATGTT CACATTAC GATTTCACATT C CG CTTTAATAAGAATG GAATTG CCATGGGCGATT CTGAA TGAG CTGGTCTTGG CTTCTAATTG CTGTAGGT CTTCTGGCAG CATC CTGCTCTATTTC CTGGCT CTTCACTAGAT TAAATTAACTTTCCTCACAGTTCAGCCCTTAACCCTTCTTTGGGCTAGCAATCTGGAGAGACAGATCAAAAGATT ACAGAATTTTTGGCACTCAAGGGTCCCCCTGACCCCAACAAAGTGCAGAAATCTTCAAAATGGCAGAACAGACCT ATGGATGGTCCAATTTCAACTTGAATCTACCAAGGATAGGTCCCCTGTTTTAGCCCCAGATGGTTCTTTTGCTGA GTAG CTC CAGAGAACT CCC CTTAACATT{ N) xAACAGGTGCTTAACAAGCTATTAATTATATTTGTGAGTAATAG TAACGCCCTGGCAGTTACTAAAGACAGACTTCTTCAAGATCTATTCCAAACCACTTCAAACTCACCTTTACAAGA AAGGCTAGAAAGCTCAAACTAAAACAAAATTAAAGAAAGCTGAGATTCCCTGTTCTCTCCTGCATCCAGAGTTCT GTGTGTGTCTCAGCTTCTGC CAGAGATTGGAG CAGCAGTGAGAAAGTGGAGGTGATGG CCTCGATGGGATAACTG GATCTTT C TG GCGGGAGTGGTGT C TG CATTTC CTGGAGTAGT CCCTGTAGAGTTT CTAGCATCT GGTTCC CAAG C AC CACAGAAG CTAAGT GGAT CCCCAG TATGGGATTCTC CACT GGAGTCTCAG CTATGG TTAAGCATCTTT CCTG C CTA(N)xCTTCTTTTAAAATCTGTAGCCTAAAACCATGATCAACGTAATTGCTTGTAACAATTCCTCCACTTCTA AGAG CTGT CTAAAATG CTTTTCTAAG( N} xTAAAACGCTATTTGGAGCAGCACAGACCACACCTTTGTGCAGAGA GCCAGCAAC(N) xGATGACATGGATTTTCATATCACCCCTTCATAATCCTAACTAACTTGGAGTGATTTTAAGGT GATTAGAAAGGGAATGTAAAATG GCATTGGTG CCTCTT CTGCACCCAGTGGTGCTCGCTGGGTG CTGAAACAAA 3 G CAATGTTGTG CAAATGTTAATTCTCTGAGACTAGATGAAGG CTTCACTGTCAAG GACAAATTAGGTTCACTATG AG CACAGAGAATGG CTACCTTCATGGAACAAA TAGTAT CTTCTATG CA TGTAGAGGAAGGACTC CTGCCATTCCT TC CCACCC TTTCAC CTTCTT CCCACCATC CAACCAACT CCCTTATATAGGAGGACAGTTG CTAATATAAT GGAT C TGGTCTTGTGACTTGGATTGGACCACCTTTTACCAGTGCCTGGACAGGGACTTTCTGTTCAGGGGCCTTGGTGAG
A(N)XGATTGCAACAGTATGGGTCATGAATTCATGAGAAACGACAGGGGTGCTGAATACCATTCATGTTTGTAGA A CTACAAGTAGATT CAGATGACTGAGAGTGTGGTGTATTATAGAAGGAGACG GC CAGAAGTTGGAGCAGATCATA TG GAGCTC TGAGTAGCTTTAAGGCAC CCAGGT GTTCAATCC CTGTTAC CCAG CCAAAC CC CATATGTAAC TAAAA CTACATGGAAGTCCTACTTCAACTTCTCCTTACATTGCCCTAGCTTCTCTCTACCTTCCTTCCTCTCCAGTCTCT GATTCCCTTTTGATGCCTTTTCCCTTCAAATACACTTAGCATGGTGATTTTAATGGGCTTTCACTCAGACCCAAC TATCAAAGGTAGATGAACCCTGTAGACCTATATCCTACTCCCATTCCAGAATGTCACACAAACGAAACACCTACT TTAAC CAGAGTCCATG CCTGTAGC TTTCCTAT CATGATAACTTTTGATG GT CTGAGCACC CAGTGAGTTAGTAGT CATCAGCCTCTTTAATAGGTTCGGGAGGCCCAGAGAGACTTGGCAAAATGGCCAGGGTTGCTCAAAGTCTTAGTA GCAGAG CC CAAAAT CTTGACAGTTGTGACC CTAATGATGGTG CT CC TACTGGGTTGG G GGTTTATTTTAT CTTC C TCATCCTTGGCCTTAGC CAAAT GCTC CTGAAATCTATGGCAATGTC CTGCTGAACTTGGC CC CACAGAATAAGT C AGGTCTGGTCCTGGGCTTGGTTAT CTTTCCTCTTGTGG CC CTGAGC CCCCTCATGTTGCTGGAGAGCT CTAAAGA ACAGCATGAAAAAC CGAGGTTG CTGTTAGTG GGTGGGAAT TTAATG GGTTTGGTTGGTTGGTAGATGATAGCAGA AG CCTTGCTGGAGGAT CTTGCTTCTG C T T T < N ) xACACAGCCTGGOTTAGCTCTCATTGCTCATGGGTCTTTAGT CCAAGAAATGGAAAGC C ( N ) xGTCCTGCAG GAAGGCCACTTC CTGCTTTCCTGGGTCCACCC CTGAGCTT CAGC C ATGCGGTTTCTGAG CTGATGAGGG CCAGAGGC CAACATGAAT C (N ) xCAGGATTTGGAGGTGGTGGCAGGCATGC CACAAG CAGC CAGCTCTAGCCAGTGGTAGAAT CAGCTCAT CAGCTGTGGAGGAAAGAATGACTTTATT CC CAATG GC CTTGAATCAGTTTTTC CAGAAGTAAGCT CTAGGTCT CCTTTCAT CCTCAAAC CATATTTCAT GGACAGATATC ATGATT C TTA TT TTA TA TA C A A ( N ) xTATGTTTCCTCTCTGGCAAGGGCTGAATTAAAGTTTTATAAGTATGCAT GTATTTTTTCAAAAGACTATCAACAAATGAAAAGTTGAAATAAT CTTTAAAAAAGATTTCTCAG CT CAAGACTAA AAATCCTGATTTGAAAAATTAAAGTCATAGAA CAGTTG CAGAGG CAGT CATTAGGT CCTT CAAAAT CT CCTAAAC TTGAGAGGGAACAG CTAGTGAGAG CTAATAGAGATGCTTG CAAAGGTGAC CT CAGG CCTTTC CAGAGATTAG CC C AAAGCAGATAAGGATAAAATACTTTCATTAAGGCAATTAGAG CTAG CATGAGAGGAATGTTC CCAGGAGACATCA CTTCTGGGACACTTTAG CTATAAG CC CCAGGTGTTTGACAGC CACAGTAC CTTAAAGCAGAGAAAGAG CTAGGT C TATTTCTAACTGCAAAAG CAG GAAAACCCT CCATTGG GTT CAGCAAAGAGTT CATC CATGGGTGGT CTATTCTGA TATCAGATGTAAATTTTTTTGTAT TC CTTCTCAATGTCTGATTTAT CCAGCGGCATAT CACATGACTAAACACAA AG CGAGTAGAGATGAGAGAGTTAC CAAGTGACGC CCAAGGGTTTATAGACTTGAGG CCATTCTCTGAAAT CATAG AACTCTTTAG GAGC TATTATACTGATATCTGTTTTTAGATACAATC TGGG CATGGTAG GTGATAGC CATGAGAGA TGGGGG CTAGATGATGGGGGCC CTTGAGTG GTGTGCTAAGATAATTGGA CTTTATCTTATGG GCACTCTCTT CAA ACAGTTTGAAGGGGTAGATTGATATCATCAGAATGGCATTGCAAATGGCCAAACCAGTGAAGTGTGGAATGTGGA CAGAGG GAAACCAGGCTGGAGGAAGGGAGATAGT CAAAAAGTAG GTGGAAAAG(N ) xGATGTGATTTCAGTAATC CTATTG GGAAAAGAGAGTAAAG TCTT CACCAAATTCTTAATAAAGTGAACTGAGAGGAGAGGATGGATTTGAGAA ATGGGATAAAGTTG GGAGACTTTGGTAATG GACAAAAGTGGGATGTGGGGGCAAAAAAGAGTGTTAGAAG CCTG C AGTTTT CTGG CTTGGGGACTTGGGGACTCT CACTGAGTGTGATGGAAACAGATTTGGGGAGATGGTGGGT CAGGG TGAGGTTGGGATGC CCTTGAAACATG CATGGGGG CATATT CT CATATCACTGAG CAGC CAACTTTCTC CT CTGCA CTGTCC CTTTAGCATG CACTGTTCTCTTCAAACCAGAATC CCTT CCTT CC CTTT CATACAGTTAACTCTC CTGCG CTTTGGACCCAGCTCTAC CTCACTTCTCCAGT GAAGACTTTC CTAGAC CCAG CTAGGC CAGT CAGATT CC CAGG C TCTAGTAGACTGCCTCCTTCTCTTCTAGCTGTAGATCTCAATTGGCAATGGGCCCCTTATGTGTGGTTCATGGAT TAATGT CTACCTCCCCTTTTAGAGAGGGTCAGCT CCATAAGTAC CAGGAG CATGTC TCTTTGGTAGTCAC CACTG ACTCCCAGTA
> H s9 66 74 42 7 6 - 6674 93 5 9
TTTGTTTTTACCAAAATTCAGGGATTCC CCC CTAATC CCAATGAGAATAATGTCTTAAATAAATGGAATTAAGAA
GCTTCAAAA(N)XCAAAACTCTTTAAAGTACTCAACCTATCATAAGACTAGTAGGTGTTTCTCAACTCTCCTTCT
CAAAACATGAGATCATGAGGTCTTAATG CTGAAGAT TATGGATTATTTTATGTAGAGAACATGGAACAAAGCTGG
ACTGTGCTCTGGCT CAATCAGCT CCACTTCCACACCT CGCAG CAATTCTCAATT AT CAC CAC CAGAGAGAAGAAG
GCAGTCCCTACCTGACCATCGCCTGGCCTGGCTACCTTATCTGTCTAAACAGTGCATTGCAGGAGGCCCTTTCCT
TTCTGTGCTGTTTTATTTTTTCTCCCCCAGCACTAGTTTTCTAATTAGAATAATTGAGGAGAAATACAAGATTTT
CACAGAATGAAAGCAAAAGT CTT CCAAGTATGAGAAAAATAGAACAAAGG CTGTTTATATCCATATTGATTATAG
TGGGAAGGTATTATTGTAATAAATGTGATGGTTATTAAAGCAAGT CTTGATTTTAAGGAAATGTTTTGTC TTGGG
ACTGTGACAGGAGATTATGTGATGTTCATGAGATGAT CATACTGT CTCTGTCCAGGTCT CTGTAACATGTAGAAC
ACAACCAACGTAGAGTTTCAGTCCCATTTGATTTACAGAAACAGCTTGTTCCTCAGTAGTATTCTATATATAAAA
AGTAAACACACACACTATAGTTCATAGAGAATACATTCACATGGATTACTAGGAAAACTAAAGTAGCCTTTCAAA
ATAATTAAAAAAG(N)XTAATAAAAATGAATTCTTCTCAAAAAACTACCTTTCTATTAGATATTTTTTATCTCCA
AACATAAAAGGTAAATTTTGAAATAACTAATAGTTAATAAGAAAAACATGTAAATAGTATGTAACTAGAATTGTT
AATTTCTTGAAACTTAAGGTTTGTTTTTATGTTATGTTAGCAGGATATGTAAATCTAATGACATTCAAAAAACAT
AAAGTGAGAAAAGAAGAGGCTGGTATTGAACATGTATATGTTTACTATATTCTAATAATGGTCGAGTTGTTTAAT
TTCTGCATGAATGTCTAGGCTTCACTATTAGTCTTCAGCTAAAATTCTGCCCATTTTGGTGAAGGAACTGTGTGT
GGCACATAAGGAGAGCTTAATGTTAGTATTTACCATAGCCTTACTAATATAATCCTATAATATTGTTCTCCATGT
TCC CAAAATTAAAAAGTAGT CTAATTCTACATACGAACTCAAAATAAATGTATACTGTTATGTGG CATAACTCAA
AATATTTAGAATTAAACACAGCTCTTATTTTCTCACACATACTGAGTATGGAATTACTATTATTTTTGTTTCCTT
GATCAGAGGTAGATGAAACTTGATAATATAGTCACTGACTGAAGGGCATTGTTTTGCCAAAGTTCGTTAAGTTAA
AAAATATTTTGGAT CAATAAATGTCTTT CATTAAAATATTCACCTACATAAAATAAGTATTCAGAATTGCATAGG
ACATATGAATGTCATCTTTCTTTTG CTGTTCATTTAC CCAACATTTTT CTT CTTTTTATATGTGC CAGGTTGAAA
ATTGTCATCCCATACTCTTTTCAGCTAAGAATATTTCTGTGTTGGGAATATCTTGAATTTCCCTCAGGGGTTGAT
GATATTGCC CACAAAGACTTTAGGCAATTTTGATGGC CAGAATGAAAACAGGCAGCTAAGATATTTATTTTTAAA
AGAAAAGAATAGAAAACTTTGTAATCTAGTCATACATTGTAGGAAG(N) xTGCCCCACTCTGGAAGTAATTATTA
ATAATGTTTCTCCTTCCAGTTCTTCAGTTTGTTAAGCGCAACTGTTGAAAACATTTATTTGTCCCAGGAGATTAT
ACGAAAAAATTAAGTGAATCTACTTAAACATAAAATTCCCAGTTTTAGAATAGA(K)xTGGCCTCCCAAAGTACT
GGGATTACAGTCGTGAG CCACCGTG CCTGGC CAGAT TCAGAT TTTTATTAGGGATGTCATTTAG CTAATTTAATG
TATTTCTATGCCATAGTTCATATTTTCTGTGCTGTAAGTATTTTCTATATGTCTGATTTATGTTGAAGAATAGTA
TCTCTTTAGACG CAGGGG CATACTGTGAGCATCTACAC CATGGCAAATGG CTTTGCATTTTGCACT CTATAC CAT TGCCAAAAAAAGGTTTGATTATGCTACTAAAAATCCGACATGAATCAGTGAACACCGAAAGCCAATAATTTTAGA ATCCCAAAATAGTGGAATGTTTAGAACTGTGATATTTTCAGTTTATTTCCTCCATCAATTTTCATTAGAAAGCGA GTAAGGAAAATAGTAGAATTTAGGAGTTAAAAAATCAAATTAGCGGTCA(N)xCAACAACAACAACAAATTAACA CCTTGCTTATTATAATAAAGAACGAAAGTAATCCAATACCTGTAACCTAGAGAATAAGGTTCACATCACTTAAAA TGGCATTATATATAAAAGAAAAGATTGGTTCTACTACATACATCAGGCTTATTAAGTCACACTAGACAACTGAGG TTGCCTAAGACGTCATTGTCCTGGACTTGCAAGGGCCTCCAACTCCAGTTAAGGTGATAAGAAAAAATGAAAATG AG CAAC CTAGCAATTTTCACAGGC CTCAGGAAACTGGGGC CACATCGAAC CCAGTCACAGGTTATACCAATTTGA TT TACAACAATGAAAGAACCAT CTGCTTATACTTGAGTTACAGCACAT CC CTTTGTAGACATCCATA CAATAGAA AGCCCCAGAAGTTGATTCACAGTAGAACAAAGTGTTTGTGTCAAGCAAAGGAAATTCTGCACTTAGAAATTTTAT CGGCAC ATTTGATTAAAAGATGGTGT AGGTAGG C ATAAGT AAGAAAAC CATAAG CAGACATAGT CAAC ATACTAG GTAAGTAAAACCAAATCCAGAAGGTTGTATTTAGTAATATTCATTTCAGATTTGCTTGGATAATTGTGGTTTGCC TTACATCTTTTGACTATGACATAGAGATTTTATTCACTTTTTAAAATATCTCCTAGAATGTAGCTTGAGTCAGTC AAACAG GAAAACATATTG CAGCATTCTCTCTTTTACCT CC CCCAAAGCACTGTAATATGG CTTTTGGAAATATTT CCTCATTGTCCTTTAGTTTTTGGAATGATGTCTGAGATACTGCTGTATGTAAATACAATGATATTCCATATTTCC TGTAAGTTTGCCTATTCAGAAAGCTGCATTTACATACATGCTGAAACACAATCGCTGGTGATGTATCAACCAGAA ATTTTATGTGTGAGCCTAAAAGGAAGTGTTGAGT CTTT CT CTAC TTAAATAATTAGAAATTAAAAGTACCTC TTC TG CATT CTACAGTTTATGTAATAT TAAAAGAAGGAATTTTGGCAAAGTAACTGAGGTTAT CAGTTCAGTT CAGTT TATAAATTTAGTGACATCAACTTGTCCCAGCAAA
>Hs9 123 940 66 4 -1239 51 579
TGGG CCTGTGTTGAATGAGGATAAATTATCTT TT CT CTAAAATG CCACATGAAC CCTCTCTATATT C C CACATGA AGAGGAATGGAAGGTAATTATTTGGT CTTTTCTT CTGTTTAGGGGAATGAACTGAACCACT CATTTTTTTAAAAT CACACT TAAAAGACACATGGGCAAAAAAGT TC CC CAAAACTACTGT CTTACC GAATTTGAGAAGGGAG GTAATGT ATGAAG CTTAACAG CT GG CTTCAAAAGACACCTT TC CAAAGAAATTGTACTACCT CTATTAACGTGTAAACCAC C AACCAAAAAAAAATAATAAGTTACTC CATOAAACA CGTTATTAT CCATAAAAAAGACTTCAACATTGTACTGGAA GAT CTATTTAAG CATAAATAGTACTAAG CA CCAATTAC TAAT CTGAAG GCCTCCTCACAGGT CCAAGGGCAATGA G CAACCTCAAGAGG CAGGTGACTG CA C AAG CAGTAAGCTATGGATTAAAAATTAAAAGGATTTCA CATTCTTTCC AAAGTGTACTGC CCGGTGTCTGGCACACGCAT GTTACAATATGACAAT CTGCTCTATTTGTGAGCA CCTGAGTGT
a t t a c a g g g g a t t a c a c a t g c a t a t g a t a g a a t c t g g c t c c c g g t a c a g t c a a a g g a g a a c a g c a t c a g c c a c c a AATGTG CTGATCTTACACTGAAAAGGGTTGACTGAAAACATTTCAAGGGTAATGTAAACACAAGAATAAAGC TGT GGGT CTATTACT TAGTGATGTGGTTTTATATG CTACATACGTAGAC CT CT CTTTATACAATGAATATGGACAGTG CTGCAATAAAAACTGATGACACGT CAAATT GTTCACCT GAAGAAAAAC CCTATTATAGTT CAGAAAATAATG CAA CCAATTTTAATT TAAT CTAGATACAGGTACTT TATTTACAAATATTTAGATTAATAGCAT TTTGTTACAT CAAAT GAAGAGTACAGCAGTCTGAATAAATC CTTCAGT CACAAAAACAATAAAA C C CACTGTAACTAACTTTG G GAGTCA GGGTATTG CACCTCAACAAG CCGCAGTCTTTAGTTTGTGATTGCTACCTTATAT CC CAATGGGTGGTTTGTTTGT TTTGTTTT TGTAAATATACACACACACCAG CAGGTCATGGTC CCTGGGTGAGTCCC TTGTGATGCAACAGTG TAA GCAAAATGGATCAC TGTAAGTCTTTAACAGAATATACCAATCTACT CCAGAAAT CTTATT TTTT AAAAAGTT AAA CAAAGACAAAAATAAAAATGAAT C CACAAATTAACCAAAG CCTACTTT CTGCACATTC CAGTTTTGGCTTTTAT T
t a a c a t t g a c t a t a c a a t a c t c t g g t a c t a c c a c a t g t t t a c a a c c c a g a a a g a t g t a c t t t t a t g t t a g t g t c t GTAAAGAGGGATTTAAAATGTGTATTTTAAACACAG CAGT TGAG CTGAGTGCATTT TCTATAGTACGCTGAGGTG TTACCTATTCTATTTCAAATAAATTCTCAATTCCCAGCCACTGAATCATAAATGCAATAAAAAAAATCAACAGAA ATGAAGAACTTAATAAAACATGTTGT CCAAAAAAATAAGATTGTTT CT CTTG CTATACAGTATTAATT CAGTGG C CAAACCAC CTGGTG CAAAGTAATAACTTACTTTGTATCAG CACAAG CG CTGAAACACCTTTAGAAACACTTT CC C TT TTACAAAACAATTTATGC CAACAT GACATAAAA CAG CC CCTCACACTGTGAACACAGG GATATCTTAAGTTAT TT CACTGTAGGGTTAAAAATGCACAATT TAAAAT CC CTTAACAG CAGACCTGTGGTTCTGACTGTC CAGTT CAGA AT CTGACCATTC CAAGAAGATAAAGG TATAAAAG CTTAAAAT GTGCAATAGTAAAC CCAGCCTTTTTTCTTTTTC TATACAG TAAGG CAAGAATG CAG C CTGTAATG CAAAAACGTTTACAAAAAGAGAAAAG CAGGTTAG CACATTGT C GATTGCACAAGAACAGTTAAGAAAAT CAGCAGGTAAG CAACAGTGCAAAGATGGAACAGAAT CTGCTAGTGTTAA TC CT CTGATG CTAGGAG CTCTTTCCAG CATAATGTC CC CAAACACTGC CAG CAC CAAGGGGTGGAG CCAGTACTA CTTGTGAACTGCAGTTGTGT CTATTT CTTTGTGTGAAATG GAAGGGAG TAACATG GTCACATATAGGT CATACTG TACAAACTGGTATTTTATACTG TT CCAATG CCAGTAAT CAATTTATTT TCTT CATTAAAATAATATACACAGAAT GTATTGTTAGTTCGATTC CTTCAAATTTTATACATATTTACTTT CT GTTAAAGAGAAAAGGATAAAATGGTATAA AAAAAGATAAAG CTAT TAATTAAG CACGAGAGAGAAGATAAATG GATATTTT CC CTGTGTGAGGCTAAGACAGAA GCAAAT CT CGTTAAGAAAAATG CCAC CCACACAACAG GAAATTTAT CCAAAACAAAACAAAAGCAGTTATAGAAC CC CT TCTCTACCAT CAGAAGTAATTT CACAGCAATAAACTTATTGGTTACAACAGACATACTTGAACAGTTAAGG ATGGGAAGAAAG GC TTAAGATATCAC CAAATTAAAC CGTACAGTGAGACAAAGC CTTG CCAAAGGGAGGGTAAAA AT CATGAAGT CCAG CATCAGTG CT CG GT TTAAAT CATATATTGGTGACATACTTAT CACGAGGACAAGGG GGAAA AAAAGT CTAGATTTAC CATG CAGGAGAGATTTAT TACT CTACTTGC CTTTGATAAC CTGATTACAT CTAGTTGTT TAGCAGTT TAGTATTGTGTTAAAC TGTTTTTACAA CAGAGTTTTTT CTTTTTTTTTAATTAAACCCAGTAAGATG TACAGAAGA CAATGAGGCAGTAAAAAGTACTG CTTC CAACAGACAGAGGTGAAAG GTCAAATGAGGGG C CACAG C AAAGAG GT CACTAG CAGC CACAGC CTTCTCTCTGGGGTTGGGGTTCACTGGTTAGC CGGCCTCCCTGCGGGGCTG AAGGTTTGTGTTGT AC AC CAGACTCAGCAG CATT CAGATCCAAG CTTC CATC CTGAATGT TC TGATAGATTTTCT TGGCAGCCTCAAGGAAGGCATCTTCTACATTCTCTCCCCTAAGAGGCAATTGATAACTTTATTGGAGAACCACAG TT TT CTACAAAAGACAAGA CACTGACCTTTTG CTAATCTTTAGT TAACTG CCATGATGTCTC CAA C TTAAC CACT GTCATCTAATAAGAGATTACCAGAACACTGAGCTAAGAGAACATGGAAAACTA (N) xATGAGCCACCACACCCGG CCGGAAAACTACTTTCTATGGAAAGCATACATACATACGTGTACTGAAAGCCCTTTCTACAAACCCTATTTGGAC AACTACATGTGTTATATGCAATCATTAAAATACTGTATGTTAATACACACCATGACAATTCAGATAGGTCATTTC TGTGATAACAGAGAGGATACTCCAAACAGAAATGAAATCTCTCTGAACATGCTGAAAAAAATTCCAAGAATAATA TGGTCTGCTCAGCCAAGCATGAACTGAG CTGTTTTTAATCGGGTAATGGTGACATAATCATG CTTAGCCACGTTA ACA C CTCCATTCAGTTTT CTTTG GACTAATAG CACAGCTGAGACTAAACGAAAACAGGTTAG CCTCTCCTCACCA CAAGAAATTTTTGG CCTAATAAATGAAGAA CAATGATGGGTGTACCACAGTT CAATAGCAGA CATGGAAGAACAA TTAGAAAATGTTCACTATGCTTCTGAGTAGACACTATTCCCTTAAATCATGCTTTTTAATGTGATCAGAAGTATT ACAGGAAGAACAGATTGACCAAGCTTGTCTGAGATGCCAAACTCAACCTCACTTGTGAAAAGTCAAACACTGTCA TTTGGGAAAAGT CAAA CACTTTTGAAATGTAAACAAAGTTTCAT TTAT TAAC CTGGGTTACCAA CAGGCATAATC AAGGTACAATCTTTTAAGTAACAAAAATTCATATTATTTTGAAATGTAAAAAAGGAAGCAAAGAGATGTTTGCTG TTCTTCCTGCAGTAAGCATTACACATTTATAGATAGTACTTTATATGTGTATATACATTATATACTTTTCATATA TACGGTTCTACTTTGAGAAACTTGAATATAATTGAAATATCTGTATTTTGGTCACACTTACGTTTTTGCACTCGC TT CGAGGAACAATAAG CCTAAAAATAAAATTCAGACTATAATTAGGACACTT CAAACTAACT CACAAGTGCACTT CTTTAGTAACATCTGAAAAAGTTTCCCATCAGTTTTTAGTACTCCTCCCATAGTATTTTATGAAAAAGTATGTAA GT TTGAGGTGGAGAGCAT CTTTATATTTGT CACTAAATAACTGTAC TG CT C CAACATAAATCACA CAGAAATAGA AACACATTGTTACTTCCTCATTCTTGAGATTTCTGAGTCTGCATTTAAAAAAAAAATGCAAAAAAAAACCAACAA AACTGTCAAGCC CACCTTT C CTTAAAAATT CT CC CAATTGTGAC CATTAAGT CTTAAAGAAACT CAC CATTTTCT TCAGCAAACTGTTTGGCTTCTTCATATGTAACATCTCTCTGTGCCTCCAAATCTGCTTTATTTCCTATGAGAATT ATTACCTAATTTGTGATCAAAAGAAAGACACCTTGTAACAAAAGCAGTAATTTAAACCAACCATGCAAATTCTGA AATC CAAAAATCTTGATAGAAAAAATGACTGGGC CTAAAAAGAAAACCAAGTACTAG CAATTAT CCACTAGACAG GGCTACAGATTCATGAAGTCATTACTACAAAGGACCTTAGCAATGATCTAGGT(N) xGTCTAGTCCAAAGATTAC GATATAGCTGAGTAAGATTACTAA ( N) xAG AAAC CC AAACTGAG AT CTTG AG AC AAATACTATC CT A CCCCACAC TATCCCTAGAAAGGATTGAGTTTATAGTTAACAGTAACTGATAAGAGATGTCTATGTGTATACACAAGGGGCAGC TGGGAGTAGGG GTTTG GGGG CACGTATG CAGTAAGTTCAAATCCAAGG CAAAATATCACC CA CCAACCTATATCC AC CTTGGAACCATT CATACAAGTGAAGT TAGG CCAAAGTTAAAGGCAG CT CTGCAAAGTTGTAAGñAAAAAATAT GTAC CCTTGGGAAACAGCAAAGGGCTTTGG CC TTTACCTGGGGTGGGGAGGGGGGTTATGGATAAA CTGTCTTTG TACATTTATGAGCTTATTTGACACCAACTGGTAAAATCCTTCCCTTTTTCCCCCTTCCAAACAGATATGGTAATC TAGATATGAACACTTTAG GAATG GATGGTTTCAACTAAAAG GCACTTCAG CCATTAACTTTTTTTCATGTAAAAT TACAGCTCCTGGCTCTTCCACTTTCAAAAATGTGTGTCCATAAACCAAATAATCATTTTTATCTGAATGTAAACC TCATGCAAGGACAGTTAAGTAGTACAACAAAAGTGAGCATTCTTTAAACAGTGTGGACAAAGTGCCCACTGTGAA GGGG AAG AAACT TT CATAT ACT ATCCAT TAGT ATTTTAAAAGAATAAAAT AATG ATACTT AAAAAGGAAATCAAT TTATAAAAAATCAAGT CTGG TAAAGCCACAATGACTAGCATAGG GC CATTACAAGATAGGTACT CAAAACAAGAA ATACTGCCCTGCTCCTGATTCCCTATGAATCTCCAAATAAGGCTTCTTATCTCCCTGAAAGGGAAGACATAAAGT GG CATGCTAATTACAGAGATACAAAACATG CC CACAACAAAATG GTAGAGAACACAACTT CTACACAGAAATCAA GGGCAATTCTATAGAAAATGGAGCACTTAAGAAAAACTCAAGAAATTTTTTTTCTCCTGATCTGGTTCTATTTCA AAG CACATAAAT GñGG CAGGAAGGACACAG GG CGGTAAGGTTAATTTTATTATACTCCTC CACGGGCC CTCTACA ATGAGTTTTGTTTTTAATA(N) xCTTGGGTTTAATGCGATAATGGAAATGAAAGTTTCAGGCACAGTCCCTAGCA TGATAGGCAAACAATGTTAACTGACTCTCAATATGATTAAACCACCATTTCCTGATAAAAGCTCATCTTACCACT GATAACACAGTTCTTGAAGGAGGCCTC(N)xCATCTCAGTCTGGTTAAGAAGCTAATGTTTTAACACATATAGAA TCCTTTTTATTTTTGACTGAAATTTTTATCCTTAATTCTCCTCCTGTAG(N ) xATGCAGATCCCTTTTAAAATCA TAATTTCAGATTTCATAGTCTTCACATGAGATCAGCACTACATTCATAATAGTAATGACAAACATGAACAAAAAA CCTAGGTATAATAAAATGCAAAACCTGGTTTAATAAGAACTGAAAAATAACTGTTAGGTTTTCTTCAACTAATTG AAGAAT AC AATAAAAATT CT CTCATTCTTAGT AC CTGAAGACAAAAAT CT CACC AGTAAATGGCTT CC CTTTTTQ GGTTAGCTTAGTTCTTAAATTTTCTGTGTAATAAGATGCCATTTAACTTACAGTATTTGGATTGGTGAGATTCCT TG CATCTGTCAACCAG CTGCTTAAG TG GTTATATGTACTTCTTCTG CAAAAATAAAAGTT TAAATT TGTGACTGC
C (N) xAATACATATTTAATCAACATATCCTAAAATAATAATTTTGCTTTGACCAGTTACTCAGAACAGTAGTTGT GAAATACAACTG CATACT TTTCAAAAAT CTAG CCAAACCTAATATTTTTAAATAAACAACTCAATATC CAACAAC TTTGATAATTACAGGTCAGGAGTAAATTGTTTACTCCTAAAAGCATACCGTAAACAGACCAATAAGCAAGCAAGG AACAACACAGTAAATTGATGGAAATGCCAATGTGGAGTCAAAACTCAAATTACTTTGAAATCATACAGACTTTCA ATCACAAAGCACAAAGTTGTATCAAAATGCCTAAACATGACACATTTAATGGAAGACACTTTTGCTTGTCTTTAA ATTTTAATTTCATTAAAAACCAAGTCACACTATTATAAGCTGGCATACAATTCACCTTTCCTTAATAATTTAAAG AGCAAGCTAACTATAACAGCATAAAATTGACTTTGAAGACATTTGACAAGAATTCAAGTCCCCAGATATCAGTGA AAAAGTTCGTCCTAACTTTGTTCTCTTATCTATCAGGCAGCTAAATTATTCTGGAATTCTAACTGCCCAAGCAAG AGTTTAAAAACT CATAGG CTCATTTGCACTAG CCAAGTAAGCTC TAAGATAAGTAAACAAAC CCAAATATTTGTA ATATCATACACAGTAGAATCTACCAAAGGGGTCAAAGATAATAAAATTTTTGAGGTTAAAAATGTTTCTGATTTA TAGACTCACAGAATGT{ N) xACTTAACTGGTGTCAAAATCAGGCCTAGAACTCTTTTACAAATñCCACATGTCCT TTAAGTGTTTCTAGTTTAGTCCTGTGAATATATTCCATGATCTCTAGCTGCCAGTTCTCATAGAAAATCTGTTAT TCAAAGTATATAGTTCTTTCCCAATGAGAACTGAAAAGAACTCCTTAGTCAGACTTTTATTGTGTTACATCATAT TATT CATAAATCAC CACTTAGATGTCAAAAAAGT CATATATATCAAAATAGC CCCTTTCC CC CCAATGTTTTAGA AT TTTCTAAGTAAACAATGATAAAATA CAATTTT TAAAAAGCA CAAGCCT TATCTGTGTTTGATATTCGTATTAT TGGCATAT CTTGACTTTATACAACAGGTATGT TT TACTAC TTCATG CATAAACCTTAAACAAGT C CGAACATTT C TACTGACTTTGATGAACT TTAAAATT T CAAAG CTGACACAGAGAGTGAAGTACACAGAAAAG CCTAGTAC CAAAA TCATATAAACCATATTACTTACATQTACTATAACTTCTGCCAATGATTCTGGTAATTG( N ) xGGTGGTACTAGTA CTAAAAAAAAAGTG CTAAAATACTAGATATGT CAAAATGTTATG GCAAAACAACTGTCACGACAAGCTAC GCTGT CAATTTAGAAAAGCACCCTGTAACAATGAGAGTTAATGACAAAAATAAAAAGGGACTTTTACGTTTGTAAAATGT CAAAGAATAACAGAAAAAAATTGCTGTCATGACTGCAACACCAAAGTCACTATAAAGTATCACTTTAGCGCCTGA CTAAAGCCTCGGGTACTC( N ) xTGTTCCCAGCTGAAGTACTTTTTTTGATGACTTGAATAGACACAGATTTTACC AAAAGTATTAAC TATGCT TAAAAGATATGCTTAAAA
>H sl0_106523167-106534967
GGTTTCTTTTTGCAGATTGAATCATGTGGCTTACAAATAGTCAAGGATCTAGGATTTACTGTTCATTCAGCTATG TG GAAACACACCTATTACATTAAACAAAGTAGAGTTTATT CATT C CACAAATATGTATTGATAG CTGGGAATAAT TTGTTATTAGTC CñTCTT CC CTTATGATGG CCACTCTC CATCAG GG CAGAACAGGCGTGGACTT CTATGCTCCAC TTGTTACTTACAAGCTAGATGATCTTGTGGCCAACGCTTCTTTAGACCATATTTTTTCAAAGTTATTAAT( N ) xA TT AATTTTACATAGTTTT CTGTTT TGGC TTGATG CAGGATATCTAAAAAACTTATAATAAAC TT CATAAATAATG ATGAAAATAGTGATACATTCTTAT TACATT CAGAGCAAAAAAAC CTGATACCTTTACTACTT CT GCTTTTTCAAC AGTAGCAGTGCTATGTTTAT CAAACTTTAATGTG C A T (N ) xTATGCTTTACACATTCTGAGGTGCCCAGCTGCCT GGTGAGGTAGTTGTAGGGT C AAGTG ATAGT AATT TGTAAATC AT AAAGTATT CT AG ATGCTTGAGCC AAG AAGGT AATGAAGTATTAGCTTCTCAAAGCATGGTTAATTGCACTACATTGCCTGGTCCATAGTACTGGGGTAATGTTTTT TTCAGAATGCTTAACTATGGTTCACCCAGCAGGCTCTAGGAATATATTTTTCACCTGTTTCCTGTGTCATATGAA TCAGTTCCCTAAGTAGATTATTCCAGTTTTCAGAGACAGGGAGTTTCTTAAACTCCCTTTGGCAACGTGGTGAAG T G A (N) xGCTGGTCCCTAGAGCTGGACTAACTTAGGTTTGAATCTCTACCAACAGCCAGAGACTCCTATTATTAA GCTCTGAGACCTCTTTTTAGGTCTGTGGGAGTGC CCTATATTTAGGTTTCTTTCCT TTGGACCCCGTGGAACCTA GTCTTTTC TCTC TACCTTAAC CATACTT CCTCTGTCCC CTGTAATTTCTCTCTCCTTGAGTGTGATGAGACAGGA AGACAGTTTCCTCAACTCTCCTTTTCCTCATAAGCCCCTTGCCCTTATGGGCCTGACTCCCAGTCTTTTGTAGAA ACAAACAAATAC CAACATGAAGAAAT GAGGAG GTTTATGGAGCC TTACAAAG CTGTATGGTCGTGGAG CTAGTCT CCAACT CCTAAG TAGTTTAGG GGAA(N ) xCTTCTTCCAGCCAGCCCCCTGTGCACTTTGTCAACAAGGCTGACAT AGAGTGGACGTTGTGACTAGATAG CTTT CTTGGCATAGAATCTAGATGAC CTTCCTACATGC CGTTGGATGGGGA GAAATAAC( N ) xGTTCCCACATCTAGGCCTGTGTGGGGAACATCTTGGTATCCACAGCTACTAACTTGCCCTTTG GGGCTAGAGAGGTTAGGGATTTGCTTTGTTTCTTGGGGGATTCCTTATCTTCTGGACTTTCTTGAAACAATACTT CTTCTTGACTCTGG CATTATTCTG CTGCATTCAG CAGGGTTCCC CAAATGAC GCAGGAAAAATGTCAG CATGGGñ GAGAGG GGAGTTGGGACTAAGTCTGGGACAGACC CAGT GAATCTGTTAAATCAGAC CAGAGT GAACTG CCAGTGT GAGGCCATGGTCTGACTG CTGTGGTATT CATGAG CAGAGC TGAGAAAGTGATTGAGGAGAGT CTTGGTGGAACAG AGACAGTCACCACATCTGTATTTGTCTCTTTGGCCAATACTGAAGCCTTCACTCTCACTCCCACGTGTGGGGTTG TGAGAGGAACCATTGCATGGAGCAAATAGTGGCATCTGTACTTACCT(N)xAATGAATCATGAATGAATGACTCA TTCAAGAAAAGGCCCTCCATACTGCTGACTGTTGGAGGATGCTGTCCTCCTTCACAGTGGCCAGTTGAGCATTCA CATTGACTTTCTGTTGCTGC CACTGTATGGGC CATGTT GGAGACAGGATT TGGAAAAGTGGTAGATGGAAGGAAA AGCAGAGAGCACACACTGGGTCCAAGAAGATGTGGGGCCACGGTACCTCTGAATGTGCTGTGTGTGGTGGAGTAT AATTCTCCATTGTAACTTGAATTGTTTTTGAGAAAATGGCAACTTCCTTTCTGTTTCCACCCAAAGGGTGCATAT TGGCCTTCTGTGACATATTGTCCCATTATCTTCTCCCCCATGñCTATGTGGTTTGATTTAACATAAATATTATCT GCATTG CAGTATTATAGACAAGGTTCATTTAT CC CTGG GTAATGAT CTTTTAGGTTTCTAATGT TTCAGG GGTTA GAGGAAG GATGGGAAAAGTGGTTGTT CT CAGATT CACATT CCTT CAATACTT CTG GTGACTAGC CTCACATCCC C AAGTTC TC CTTG CCTTCTGATTAAATGTGGTT CTGACT CTCAACATCTTGTTTTCC CAGCTTAAGTGTAAGCCCT CTG AGAGC TGAT TCTGTG A CTTGTGCTG AC AC AG CTCAG G ATTT A C AT AG AG AAAATG ATCAGT AAAT ACTTGAG TAGTGGGCAGTGTTTGTC CACTTAAT CTGTGCTG CAGCAATGGAAAAAATGACCTT CT CAAGAGGGTGGCAAGT C CTGCTTTG CAAATCACTGAGTAAAGGAAAAGC CC CGAGGAGCCTTGGGACTG CTAG C CTACCTG CTCAATAAAG T T C CACCTTAAGATACAGG CTAAGC CTTACCTC CTGGGC CTGCAGTAACAACATATTTTGTAATG CTGATT CCAGA GG ACTTGñCTGTGG CAAG CTTGGGTAAGTT AGGT ATCAGT AC ATGT CTTTGG AAATTG C AG (N ) xGGTTTTTGAT AAATTTATGAGTCTATTAGAGCTTGCTGGTAGTATAAAACATGGTCAATAGGTGCCACTCTTTGAATAGTTTTAC TGTAGGCATTGCTCATCGTACTCCAGTACAGTGCATACTAAGGGATCTAGAACCCAGATGTTTAACAGAATAAGC AGAATTGGCACTTGAGTCAAATGTTTTCTGAAAATAAACCTCTAAATATATTCTCCTTGACCTGGGTCAGAACTG TAG AG (N ) xTAGAGCAGTCACTTTTCCTTCTTTAGAGGGAGATAAGTAGGCACATTGGATTGGAATTCAGTATTA AGGAGGTGAACAA CAGATAGTTTT TGAGACAT CTGAGAGTTTGT TATACT GAAAAAACTGTGTC CTACTTTTTTT TT CTAAAT TGTATTTTATAACACAAAGTATATTTAAG GAGAAGTTGTGGT TGAGACGGGGCCAGGGAGAATGGGA GAAAG GAAACAAAAAATTAGATAAGCAAAAAGAT TTTTTT CTGTCTATTGTTTTCT TCAATGGTGTGGTATCCAT TAC CAT CAAACAGAGCAC CTGTAATAAGTC AC CAGGGCTC TTG AAG AAATTGGTGAAG CTTC AT TCTCTTGCAGT GC TTATTAGTGATT CAGTGTTTGTTCAG CTTCTG CTGATGGAGAATTTTC TCTTCAGAAGCTTCA CAAGT CAGAG GTTACCCCTTGCTCCCTCCAACCCAATTTTCTATGCAAGAAGCATCTACTTTCTCTGCCTAATTAGAGACACTCC T C CTCCTGAATGACACTGGAGTGACCTGTCATG GAGG CAAAATGAGAGCACATAGAGAGTGTG GGTAAGTATGTA AAGACCTGGGCC CAGAG GGAGGTACAGAGG GCAGGGG CGAAGTGTG GAGCTGG CTATG CAGACCGAAT C CAAATG GC CTTAGAGAGTAG CTCCACTTTC CAGCACCCTCTCTGCAGAATATTGACACAGGCTTCACTTTGTTTCTTCCTT GCTTCTGTGTCTCCCCTCTCC CAC CACTCTAT CTTC C AAAC CTTC CAC CTCTCCTGTTTT CATGAGTGGAAGAAG AGAGGGT CAAAGAACT G C CAACTTA CTTG CAT TTAAGAAAGTG GAGC CAACATGTAGTGTGAGAAATGTGTG GG C T CTTCACAATTGCACAGGGAAGAG CAGAGATT CAGGAGAAATGA CACTCCTTC CAG GGGCTCTAGACTGCTG CAT GTTAGGT C CCTAC CAG C C CACAG C CAGAGGAAAAGCCAGG GTGAGGC CACTTCACAGGGCTGG CTCTGGAGGCCA TC CCAGAT GCTGTGCC CACGGAGGTGTTGTGGG CTGTTTCTGTTT CTGGAAACAGT TT CAGAAC CTGAGTAT CTG AGGAGATATC CGGAGGAC CACAGT GAACATT CCTACTCCT TGAATGGAGG CTGGC CATGGT CCTTGCTGTGGGGA ATGTGCCTAGGTTTGG GGAGATG C CTGCTGAG GAGCTTGG GACC CAGGTGTAAGAT TCAGCATC TGATGGAAGAA GCTGG CAAGCATTAC CTG CAT CCTGTTCTGGTAC CTAAAGGGAGGGGGGAATTTT CAAGACTTGTTAGAGC CAGG GGAGATC CAC CAAGAATTAATGC C CAGCCTC CAGGAAGT CTGAGACAT CTGCTTGGAG CCAGGGA CATTTAG CTG CTGGTGTC TAGGGTGATG CTGTCAGGAGTGAG CACTGACT CTGAGGCT TAAGTGT C CC CTCTCT GTG(N) xTCTT GGCTTAGT CTTATAAATTATACAC TTTTGTCAGATGACTT TTCTTTTGGGGGC CAT TTGGGATAATGATAAT GAT GACTGTTATGGCAGTGTTGG CTG C C CGAGAG C GAAA CAGGCTCC CTAT GAGGT CAT GAGC CCTTTGTCACTAGGG GTATT CAT GTAGT CAT CAAAGAG CA CTTATGAGTGATTT CAAGA CAAT TT CAT CAGGAATATC CAACTATTAAAA AGTTAGAT TCCTGGGCTG CAACCTA CTCT CAAGGACT CCAATTCGGCAGG CTTGGGATGGGAC C (N) xGAGGAAC AAAT CTGGAT CAATGT GCTGCTGTAGGGAGGACATG(N) xAGAATTGAGTGTGGGACTTTTGTGTATAAGTGGAA AAG GGGAC T C CTATTT CGAGG CCACAGAAGCAGTTCCATG GTATCTGTGGG GATG C CAGTTTTGTGGGTTTTAC T GTATTTCATGGGAGGATCTGTGG(N) xTGTCAGATGCTCTCTTTTCCAGTTTCAAGATTTTCTGTTCCTTGAATT CTGTGTGGATGTTGTTAG CTAATAAG CAGTG C TGATGTGAGAAAGATGGG CCCCTGTGTG CAAGAGTAGAGGTGG CT CATGGAAT CTCTGCTTGGCCCCTGACCCTC TAAAAGGGC CCTGGGATC CCTGGGCAGAAGAT GTGCC CAT CAC AGTGTAAT GAATT CTAATTTGAAT CTTGGCCAGGGTT CAT CCTCGGGAC CTCAGTGTG CTGAAC TTTTGGGGTGG CAGGCTAT CACTGAGACAGCCTAGC CAGG CAGCAGGAGGAC CTTGAG CATGAGATGATGTTTG CA CAGG CAT GG C CTCTCCTTGCTCACTATG CCGTGAAGGTAAAATATCCTAC TT C T CAGATAGAGGGT G CAAAGTT TGGTACAAGTG CCAATGCACT CAAAGGATAAGGAG TAG TC CAG GACCACAACAG GGATT TTAGACAGTTTCT CAT GCATGCCACTG GTGCTGGTTGACCCCTT CAAG CAACCTGCAT CA CACTAGATACAATGAGAAGGACGATGGGGGAGGGAAAGT GAG TTGAC CAAGG CTGACT CCAGGAGGTTCTCCTCTTCTCCTC TGAGAGC CCTAGCCTTGGCTTCCT CTTGGAT CATG TGTTCCGACTGTTGTC TTTTT CCAATTGG CCT C CTTGAAAGGTTTTTT GGGGATG C TTAGATT CATTCT CTGAGT TAGGAATGTGTAT CTGGGTGCAGGGAGGGGC CAATGCTTAGG CTAAGACTTGTGTT T CGATAGGTGGGG CTT GGG TGTGGGGATGACCCCAGGATGGGCGTCTTGGGTGTTG CAACCTGGACGGGAACACT CCCTGCCTGTCATCTC CAC CTCCTTTC CAGAATC C TCTGT CAT TGTTTCTTAGG GAGCT TCTTAGAAA CTCTGCT TCTAGGGGA CTGGTGGGGT GGTGG CAG GGGATTGT GCAGCAGGGGTGGGTGGG CAAGGAT CTGCATCTG CAGGT CTCCCTGGACCTCACCTGTC CACTG CCCTGGCCCTTCT CACACT C C CAG CTGCTTGCTCTCC CACAACAGGGACC CTGTGTCTCA CAGTTTT GG C ATTGATC C TAGGGGC CAGCTTCCTTGCTTCTCTGGTGTCT CTGATTAAGTTCAGGAAAGGAGC C T CAG CAAT T C C ACCAGGTTGGCTG CAGCC CAAAT C T CTGT CAT TTTCTTGT TACTCTTAA CAATGT CTGCTGGGTAC TTTGGGCAC TGTGGGGTTTCATTCATCAGC CCAGATCAAG CCGTTTTTCCTCTTGGATTCACTTATTGT CTCACA{N)xTTGCC AATCT CTT TTTATAC CAGACTTCTGTTGCATGTATGGTCAGT CAATT CAT CAAAAAG CACATAT TGGCACAT GAG T CAAGGGAGATG C CAAGGACACAGAGATGAAC TAGAC CTGA C CTCTG C CTTATGGT TTAGGGAG GAAGACATAT T CAGTAAA C CATTACC AATAATTAG TAAATTAC TTGCAAAT GTGAATGGGCTGG(N} xTCTAAAGCAGGGGAGCAG CACACATGTGAGT CC C GAGGAGGT CAGTGAGGCTGGAGAGGTGACAGT CATGCAG CAGGGACAGGTGCACGAGTA TCTGAGAC CACAGAGG CATGG CCTTTCCTTGCAG CATGCTAG CATGTGTGTTAAT C TTTC CTAGTC CTTCTAGTT AGGCTATGAGTTC CTTGTAGG CAAGGGCTGCACCAAGAG C C CTTGGAAA CTCGGAT TCATTCCCAGCTG{N) xCC AG CTGACGCAGGG CTAAG CATACAGAGCAGAAGG CAG CAGGGTGGAGGAGGAGGAACT CAGAATAGGGAGG CAGA CACT(N)xTGAAAGGATGTACATGAGAAAGAAGTGATAAGAACTGCTCAAACTAGAAGCCCCAGCTTAGGTCAGC AGAAT CAGTC CAGGCT GGTTTCCTCCACCTCT GTTATTCAAGTGT CCT CAAGAACT GGACAGTGCAG GTCTCTGA GATAT(N)xCTATCCTCTGTTCCTTCTGGA(N)xTTAAAAAGTGTAATATTCTCTGTTGTGTTTAATTTCACAAG CCATCTCC CATC CTTT CTGGATGAAGATGGGATACATGC CTT CCTATAAAACATAAATTGTGCATATC CTGT TTA CTTGCAGTCATCACCCCTGACTTCCTACCCCAAATCTCGCAGCTGTCAGTGGGACATCAGTCTTTACTTTTGCCT GAATGTT C TATCC CAG CCCTTCCC TGGGACAGCAG GAAGAGGG CTTTCCTCCTTCTAGAGGCTCT CTGAGAT T CA TGACT CAACTTTG CAAGG CCATGT TGTTAAT CCCCATGCC TCAGAGGTATTCAGCT GCACCACAGCCCCTTCTTT CTCTG CAAC CTGTGTCGACGCTGC CATTT CC C CGGGGGAAAGGGGAAG CAGTGTTGTC CCTTG CATGGTGTCTGC CATGTGTT GAT(N) xCATGGCAGCTTCTGTCTGTGGCCATGACTCCCAATTACCTGACAAGAAAGATAGATTTGA CCAGGAGTTCCTCTTC GAG CCTTG TTTCTGAGGG CTG CAG GTAGTGAGAAATAACAC C CAGGACATTTAGAACTG GAATC(N) xAACTTCACCAACGAGCTGAGACATTCCTATTTAGGCTTGTTTTATGTTCTAATTCCCAAGGACATA TTTCT CAT GGACTGTACT CCAGAGA CTCTAAT TATTT CTT GTTAGTCAAGAATAAT TTTAC CAAG CATCTGTAAG CTATGAGGAG CCTGGGATGTGTGT GTGTT CAGTTTTGGAGTGTT CTGG GAATT CAGTTTAACT CAAGCATTTAT T GAGTGTT CACCCTTGG GATGAAATATGGTATGCAACATGGGATTTGCAGTTGGAAAC CTGAGTT T CAGT CTGAGA TTTGCTGTTAGTTAGCTTTGGCCTTGGGCACACA(N} x
> H s l l 18383367 -18393825
TTTGGCATCAGCTTCCCTGATT GT TTACAGTGGGGAAAGT CAGTGGATGAGGTATTGAGAAGTGT CACTTTATG C CCAGCATTTTTAGCAT TGTATT TAAAACAAGGTTAGAGCAAC CACATCAGAT CATTTGTATGTG CAGTTATACAA GCCCTCCCAGAGGTACCTTG GAAAGAATAG TACAG T CACGTG TTG CTTAA(N ) xCTAGTTACAAATTTAAGATTC AGTATTCAATAAGTAGGATAAATAAAAGGGGAGTTTTACAATTGAAAGCTGAAAAATCATCAGCAAGATTTTTCA TATTTATTTCCTAAATATTCTAAGTGGGTTTTCTTAAAATACAAAACAGCATTACTTATAATCAGATAGAAGCTT GGAGTGTCTTCTCACAGCAGGG <N>xATCTCAAAAAAAAAAGAAAAGGATGCTGAGGCCATGCATAGCCTTGCAG AAGGATATGGTGTTCATGTCTTCCAATTGATAATATAGATACTGCTGCCCATCAAACAGAATAAGTAATTTTTAT AGTTTCCAGACAGAGCTAAATATTACTATCCCCCATAATGATAAACAGGTATGGCAGTAAGGGTAATTTTTTTAA CTCTGTACCCCTCAGACCTAAACATTGACATGCCAAAGGAATTTAGGACTTCCTGATCTCCATCTCCATCCTCCC AGACTTGGCAAAACTTTACTTCTAAACACATAAACACTGTCCACCTCCAAGGAAACATATTCCAGAGACTTCTAC CTCAAAAGTCAGTATAA(N) xGATTCCATATGATCCTGTTGACTTCTAGCTATGTCCATGATGAGGGCAAAATTC TTAATCTACTCAAAGATCTATAGTCCTGACCCAAGACATGACATAGCACATTGTGTGAAGCATTGAAACCATTTA GGTTTTCTGTAATCTGCTGAGTTGTATTGAACTCCTGAAATGCTGGAGTTGGAAAAACCCTAAAAATAGCAAACT GTTTCACAAAATTATGC(N> xAATAAGGCATTTTTCCAGTGTTCTGCCAAGATGTATATACGG(N)xGTTTATAC TACCAT CCTGGCCTTAGTTGGCTGAC CAGTATTCAAAT CTAGAAAATCACAG(N) xGTCTTTCCTTTCCACACAA AATGTGTCTGACAGCACCTTCTCAGAGGGCTTGCACATTACCTCATTGACTGGGGAAATACAGAGGAGCTCCGAA ACTACACTCTGAAGTGTGTGCCAGATTTGGGTTGGAAGGATTAACTTCCCCTGCTTTTGCCCAGTTGTTAAGTGT CTTGCTGTGTTTTT CAGTTGGTAAGT CACATAGAAGAGATGCTC CAGACAGC CTACAA CAAGCT CCACACATGGC AGTCACGGCGTCTGATGAAGAAAACGTGAGGTGGCCATGATGCTTACAGGTTTTGTGAGATTGAGAGAACTATGA CCTGCAGCAACTCTGGAAACCTGGCCTGACAGACAAGCAGATGACCTCACAGGAGTGATAAGAAACATCTGCTCC ACGCCAACTCCCAGAGCTGATGCTATTGTACTTGCACATTGGAGACTGAAAGGAAAGAAGGGACTAAATGCTGGG GAGGTAAATTAAGACAGAACCAAATGAGCTAAGTTGCAA(N)xTTTAAAAGACTGTTTACTGGAGTTGCTCAGGA ACTGCTTTTGATTCACATTAAGCTGCTTTCAGAAATTAAAAAAACACTTTTTAAAGGGTGCATTGATAAAATCTG AGGTTTTTTGGTTGTCGTTTTTTTCTGTGTACATTTTTTTCCTAAGTTTATGGCACAGGGTAGACCTTAAGTATT CCTCCTCCATCCTTCATTCTTCACCCTCCATTGGATCCTCAAGTTTTAATGAATTCCAATTATACCTTACATCAG CAAGTTAAAAAAAGTACTTTAAAATAAAGCAAAGGGAGACTGTTGCTCAACCATCAGGAAACAGTTGTCAGAAGA CATCATTGGTTCTGTGTTTCCTACGGAAATAAGAAACGATAAATATTGCACTGAATGTTTGTGGTTTGGAGTCCC TGAATAATAAAGAGGGAATATATTTGCAGAAAGTCGCATAGGGTTTTTTAATGCAGAATTTTGTCAGAAGACAAT GGCGCTGC ATGTTT TTCTTTG AGTGC AAATGT ACATTG CTAAGATT TTTTT AAGATGG C ATGTG CTTTG AAAAG A AGATATTGCATTTTTAAGAGTTTAAAAATCTTATGAGTGAGAAATATTAAAAAAATCTTATTTTCACCTCTTTAG AAGAAATAAAAGATGTTTCTCCTATCTCCTTTTCTCTAGTATTTGACTGTTACTGTCCTTGGCGAATCGATAATC ATTGCATAGTGACTGAAAAGCCTAAGTGCAAAAAAAAAAAAAAAAGATGTTCTTGTTTCTGAACTTCGTGCCATA TTTTGTTCCTGATGGGATCAACTTAATGTTTAAGACTTTAGATGTCTTGTATTAAAAATTACACAAAAAAAGTAA AACTTTTTATACTTACCCTTTTAACTCTAACACATCTCTGGTTTCTCATTATGTGTGAATTTCCTTGGGTGGGCT GCCTGAGAACTGTTCATTTTATTTTAGGCCATGTTACAAACTGGTTATCCTTCATATAATGACT3CTAAGGTCAG AATCTAA(N)xTTTATGAAGACCTTGAACTGGTAAAGGGAAAGAAGCCACATCATTTTCTGCCTTCAGTCTTCCC ATTTCTCTTTCAGATCAGTATATGTACTGGTATAATCCCTTAAAATTTACACTTTCACAAAGTACTTTCAAATGT GTTACTGCTACAGTGGGGCAAATAGAAAAATGATTGAAATAAAAAAAAGTGGCTATTGGGCAAGGCAGACATAAG AAATAGAATGTTCTGTTGAGTGCTGTGTTGAAAATTAAGTTGTAAGCTGGGCATGGTGAGTCATA(M)xCAGGAA AGATGTGTTAGATTGGAGAGTTGGAAGTGACAAGTAAGGCTCAGAAAACCATGTTAATTCAGGTTGTGAATGGCC TTGTATGCTAAAGTAAAATACTCAGACCTTACCTACTAGGCATGGGAGACTTGAGAGAAAGTTTTAAGCAAGTTG CTTTTATGGGGGGGTTTTCTGGTGGTGGGGGGTTAGGCAGGGGTTAAATAACTTGGCAATTAGGACAGATTTAGA AGACTTTCTGAACAAACCACACAGGAAAGCGTCTACATTAAAATGAAGTCCTGGTAAATAGGAAGGGCCTGAGTA ACAGCATTTCTACAGTGTATGTGACTAACTCACAGGGAGTGAAGGAATCAAGGGTGATTCACAGAGATAGCTGTG AATTTCATGTCTAGCTTGCACTTGAAATATTTGAGCCTTGGATACCTTTAGCAATGAACTTCAGCAGCTGTATAG GCCTTTTATGGCCCTTCACCTTCTTGCGGAA(N)xTCTCTAGTCTGGAGAAGGGATGAGATGTCTGTGAAGTTGC AAGTTGTTATTCCGCAGCCCAGGCTTCCATTGCCTAGTTTTTCCTTTGACTCCTTTGTCATTCCTGCCTGACTCT ATTCTGTGCTTTACTTGTGGCAAATCGCATCATGGGATTAGTCTGCCTAGTAGGCCCTGAGGGCTATAAAATGTA ATAATGTAACAACAGCATGTCCTGCCTTAGGAGAATGAGTCTTTTCTGGAAAGAAAGTCAGTAGT(N)xGTGAGT AGTGTTAGTCTGCTTTAGACATACTACAAAGTGGGTGAGGTACCTTGTAAAATCCACTTCAGGATGGAAAACAAA AGAATGTTTTAACATCTGTAACTAGAAAGGTAACCCCACTCATCGAGTCTGAAGAGTTTAATCCAGAAAGTGATT TCCTGTTGCATATACCATCTTTCTAGAGCTGACAGTGTCTGGAATGGAAAGCTGTGTGTTTCAAACTTAGGTTTG CTGTCTCCAGTGTCAAGACTTGCATGGGATTCCTTAGGATTACCTCTGCCCTTTCCCAATTTAGCTCCCTCAAGA CTCAGCTGTTCTCCCAGTTCTTGAGGCCAGGGGAGTCTTAGTTATTTTCAGCTATTAAAATGTCCAGAACTGGAG TATTGCCTGGAACCTGGTTCAGGAGTGACCAGCCTGAGTTAGTAGTCCATCTCCTTGTTTGGGTGTATATTAAAG TCACAGAAGAGACTGGAAATCTTGTCATCTCGTGGCCACTACTGACTTTCTTTCCATCCTGGGTGGAGTTACATA
( N) xCAATACATATAGTGAAAACATAATTCGTGGGGTTTTTTTTTCCCTGAGTTTTGAATAGTGCTGAAGTATAT AAAATAGAAAGTGTCTCTGTACCCTTACTTCCAAGTCTTTCCTGAGGTGATGGTCAGTGCATCTTCTTCTGGACA TTTGTCTATGCCCTTAAATTCTTTTCTAACCTGCAACCTGAGATTTTTCTAACAAAGAGATACCTGTGCACCTGA ACATGCCCTTTAAGATAATCCTT(N)xGATAATCCTTAACAAAGGGGCTCATATCCTTAGAGAAGAATCCATTAG TGATCCTATTTCTGCCAAATTGTTGTAAATTTTGGTCCTACCCATTTGTATCTGTTTCTGACACATGGTTATTCT TTTCCtN)xTGCTGGGATTACAGGCATGAGCCATCATGCCCGGCCACCTGGTTATTCTTGAGAACAGGTTATATA CCTGTGCTGAATACACATCTTCAAGACTTCACATATATCAACCATATCCTTACTGCCCTTGAACATTTAGACCTA CAGAATAAGGAATGGAGAGTAAGTCAAGTGTTCTGACAGCCCCTTCACAGGATTTATCCACATCCCTGCAGCAGT GGACAC { N) xAGCATTTAACTGATAGGTTGTTGTGAGGATAGCTGAGATGTCTATAAGCCACTTAGAGCAGAGTC CAGCACACCACAAGTGCTATGCAAGTGTTTGCTGTAATGAATGTTGATAAGGCTTCCCCTGAGTCACGCAGTGAA AGACTGCAGCACCTTCTTGTTTTTATTGTTTAACCTTAGAAACCTAAATGGAGGAAAATTCCCTCAAAGAAAAGT AAAAGTGCTTTCTTCTGTGCCTTCATTTTCCCCACAAAACCTGTACGTATTTGATTTTCTTAGATCATCTTTACT AGATTGATTAGTAGACACTTTTTTGTGGTGACCTGCATTCTCCAAATTTTGAGCTC ( N) xTAAAATTTTTTCTTT AAATAGAAGT TGGGAT CAAACTTGAGTTAAAAGAAAAAGCCATTTAGAGTATGCTGAGTTAG CAGGAGAG CAAC C TTAAAACT TG ATTC AATAGGTG CGAAGGGTTTATCTT CAAAACAAAGTCACTAACAGGTTTCAGTTAG CATGTC C TAAATATGGTGGTTTGGCATTG TAACAGTTA CATATCACAATTATAA CTTAGATAAGAAG GAAT TT CCTTAAAAA ATACAAC
> H s l l _ 12017838 8 -120 187 90 9
GCCCAGGCCCCTTGGCACCAGCCCCTCCGCACCTCACACCTGCTCTGGGACACGATCAGAAAACACATTATCCCA CT CCAGGCGC CCAGGTATTG GAGTGCAAAGCTG CCCTCCATCTCAGGTTCCAG GAACCTC CTCCTCCTCCACTGA CCCTTCTTGCCGTGACATCTATTTAGGGCCCTTTGTTTTTCTCACTGGGAGATTTAAAGAATGGGCTATAGGGTC CCAGAACA CAGTGTAT CTCCGTAT CC AGTTGAGGTAAAGGGAGCAAAGTCAGGGGATTGT ATGT AAGTTCTCTAT TTTCTATG CAGTA CATTTTGTA [ N) xCGCCCATCGCCAAATGCCATCTGACCTCATTTCCTTCTACTCCTTCTTT TCTATG CACATT TAGATACTTGTGAT CGTGTTATACACCTACGTTTTTTGGAACTTGTAG CAAAAG CATGTTTTT GAGC CAGTATGTGGAAGCTGTGAG CC CCGTGTGGGCTGGGGAAGAGAAGAGGGAGAAACAGATGTGGC CTAG CCA GCCTCTAGCCAGGCTCCCTCAGATCCCAGCCTCCAGAGCCGCCATGGGTACCACGCGTCTCCAAAGGCCCCAGGC CTCCAGGACTGCTGCCCCTCCAGCAGATTCCTCGCACATTTAGCTCAAAGATTTGGTGCCCATATCCAGCTAGCA GAATGACAGGAACTGGCTTTTCCTTCTTTTTAGGCTTTCTAGAGTGGCAAGGTTAAGGGAGGGTGTAGGAGTAGG GGAAAGTGGCCGGGGCTAGAGACCAGGTGTGAAGGATCAGAACAAGGGAGTGGCGCCCGTGTAGCATGTATTGCT ATGCCACAGTCCTGATGGGTCCACATGTTTTAGCTTTCTCATTTTCTGCTTTGAGACAGGAGGACAGGAAGAAAT GACAAGGCA(N)xAAAAAAAAGAAATGACAAGGGAGAAGGACATCAGGTAATAAGGAAGAACACAGAAGGGAGCA GCAGCTCTGGGGACTCTACCTGCCCCACATAACTCATCCATGGACTTCTCTCCTCTGAACTCATGAATGAAGGAA AGACAGCCTA( N) xACTCTAAAGCTTTTGATCCATGTATAGTTGTATCCTGTTTCTTTGGGACCTCTGTTCCCTG CCCCTGATTCCCTTCTCTTGGCCATACTCTGTCCTTTCGGTGCCTATAGAACCCAAAACCCAGCTCGGAGGAGAT CTCCATGATTGCAGAGCAGTTGTCCATGGAGAAGGAGGTGGTGAGGGTCTGGTTCTGCAACCGACGCCAAAAGGA GAAG CGAATCAACTGC CCTGTGGC CACACCCAT CAAACCACCTGTCTACAA CTCCCGGCTGGTGAGTGGC CAGGA ACCAAGCTGTCTGCCAAGCACTGGAGGGACATGATGCTTGGAGGGGAAGGTGGTGGCTGCAGGGAAGAGGGGAGC AT CCAGTAAATAAATAAG CAAAG(N) xTTGACACAGGCTGCTATGTGGGGAATAATCTGTGTGGAAGCAGGAGAC CAGCTAGGAGAATGACAATGGTCAGGCTGTGTGATGGAGGCTTGGCCCAGGGTGGTAGCCATGGAGGTG(N) X.AA GTTCTAGGGAGACAATGAAGTTTAGGGGGCAGGAAATACCTTCTCCATGCAGGGAGAAATGGTATCTGGAGGGTG GGTGTGGGAAGGTTCTTCTCTGGGGATGGTGATGGTTGAAGGTTAAGGGAGTTGGGGAGAATGATGTTGCCCTTC AGTCTTTCAG CACTGTAAGATGGG TGAAATGCAGCCTTCTCAGGGTGGGACACAGCTGTT TC CCAAAAACTA CAC CATTGT{ N} xGAAACTGAGAAGGAGGGACCAGAAAAGACAAGAAACTACAACCACGTAGATGTGCGGAGTGTTGG GCAGGAATGAGCCCCATAGGAGTGCCATGGGAACCAAGAGGAAAGACACCCAAATCAGAATGGGTCCCAGGAGGG TGAATATCTAATTTGTTTGGAGAACAGATTAGTAATTAGCAAGAGTAGTGTACAAGGCAG( N) xTGAGTGTGGGA TCATAATGGGCCATGGATGCTGAAGTACCCTAAACTCTCTAAGGGATCATTGAACATGTTAAGTTGGAGGCAACA TGATATTCATTCA(N) xGAGGAGGCTGGAATGGTAGGCAGGGGCTGGATTATGTTGGGAATCATAGGCCTTGTTA AGAAACTTGGGTCATGTCATAGGAACTTTGGAATCTTTAAAGATTGGGCTTTGTTTAGAGCAGATGGGAAGTAGT AGATTCTCCCTTTGGTTCAGTGCCAACCCAGGACATCAACATGGTGAGAAATGGAGGGGGCCTAGAGGAGCCATG CAGGGG CC CCATTC CC CAT CAG C T ( N) xTTTTCTGATCCTATCTTTGGGTCTCATTGAGACAGTCCCTTGGGACT AGGACATAGTTTCTTCTGCTTCTGGAATAAAATGATTAGCAGGTAGAAAGGCAAAATGGTGCAGATCAGTGTGGG CAGAATGCTACAATATATCTAGAAAATAAAAAAGGGGAACCACGTATATGTCTGTTCGCATATGCCTAGCCTGTC TCCAGAAGGAAACAAGCTGGCCGCACTGACCAAATACATTATCACTTATTTAAAAACAACAACAA (N)xCATGTC TTGGGTGTGACAGCAGGGAGGGAGGAGAAGAGGTAGGGAGATTATTGATAACATCTCTTACCACTCTGCCACCCC TC CCAACCTGTG CCCGTCTC CACC CTTCACAGGCCCCCAGCCTACTGAGGAGGCT CATTGTATAAGGAAAAGTAA ATG AAAAC AG GAAAAC CATTTCTGATTAATGACG AAAT AAGT ACATGCTGCATCCTGTTAGGGT GGGATCTT CC A AACTGGGGAGAGCAGAACTACCTTTCAATAGCCTTTGCGAACACCTGGTCAGAGCTTCCCCTGTTAGCATGAGAA GAATAAGGGGCAGTGAAAATACACAGATTTTAGCTCAAAAGAAGAGTCGAATAACATCTAGAAATATACAGCAGT TGACAG CTGCTT CTTC CAGAAAGGGAG(N) xGGTGGAGCTGTCTTACTCCCAGACCTGTTGGGCTCCAAAGTCTA TT CTTT CTATTACATCTTGTTCTACATTGTTCTAGGATAGTAACAGTGAGGGGGAGGTTC CTGAATTG CATGGGA GGGTAAACAAAGTTATCCCAGGTCCCTTTCTAATCTCAGAGCTGATGATGCTATGAGGTCTCCAGGGCAAAATAA AAAG CT TCAGAGAC CAAAGTT CAGTG CCTCCAGTCACCTCTGGAGAGTCCATGCTCCATT CATC CATCACAG TGG GAAAGCCTGCGAGAGGCTGCTAGGCTGAGCAACCCTGCTGTCCTCAGCAAAAGCAGGGTTCTTTTAGGGAAAAAG CAATGGAGAC CAGAAAATGACAGG CAACCAGCAGTATCTG( N)xGCAGGGCCCAAGAGGCTTGTCTTGCTCACCT CCAGGGCTCTCAGGCAAGGCAGCCTTCGGCTTGGCAGTGTCAAAGAAACCTTATCAGGCAATACACATTGTCAGG GGCTGGAGGGCCCTGGTGTTGACACCAGGGAACTTCCGGGGCAGACTGGACCTGGTTCTGGATGCCAGTATGTGG TC CAGAGG CCTT CC CATAGAAAGGAGGATGGCATGTAGGCTAG GAAATCAC CAAGGAAGC CAAG CCCTGGTCTCT GTAG AGTCTG GGGC CTACTGTCTTTG GAGAGTGATCTG AAGTTGGAGAAGGGAGCAGAAATGGACATTTACAAG C TTCTGTTGTTTTTCAGGTATCTCCCTCAGGGTCTCTGGGCCCCCTCTCTGTCCCTCCTGTCCACAGTACCATGCC TGGAACAGGTGAGCTTTAAAATGAGATTTCTGAAGAACACCAGAATTCTTTTAAAAAATGGGATATTAGAACTGA AAGGTATCTACCTAAGTCAGGCTTCCCTAAATGCA(N) xAGCGAATTGTTTTGTTTTGTTTTGCTTTAACTGCAG CACTTCTAAG CACTTAGGTG CATAAAATTCCAAAATGAACTGTGTGGAAGGCCCAGGGGñGG CAAGGCTT CC CAA AGTAATACTCTAGT CCT CAAAAAG CACC TAATTATTTCAACG CACTAGAC( N)xCAGGCGTGAGCCACCGCACTC AGCCCCTTCCACTTTTCAAAGTGTCTTAGGGTCTTAGTTTGGATGGTAAAATATATGTCACCCTATCCATAGAAC A CAGTTTG GGAAACACAAA C A C A ( N ) xAGTTTCTAGAAAACCAAGCACCTTTTCTTTGTGGAGACTGGAGAACTA GAGCAGCTGGCAATCTTTCATCGTGCCCTCAAGTTCTCTTTCTCCCCTATGCACTTGCCTTTCTTTCCTCCTCTC TTCCTTCTTGTTCTTTTT CTC CTACCGC CTCTTGGTCTGGGGCTGCATGTGCC CAGC CTAGGGTGTGT CCTAGAG TAGGATGATTTTGGCATCTGTCCTTGGATTTGGCAAGTCTTAGACTTGCAGAACCAGCCACACCACTCTCCAGGG CCTGGG CTGCAGAGAG CATTTGTGACGACAGC CC CT CTGGGTGTGGGCAGAAGT TGATGCTAATGAGG CAGGAGG TAGTCAGGTGGGTGTGGCTAGGGAGAGGGTGAGGTCCCTTTGGGGATACACCAGGAAATTTGTGTCTGAGGGGCT ATGTCCCGATGTCCCGGGATCCAG CAGCATTATT TC CC
> H s l2 _ l04286879-104296769
TGGTTGATTCTATATTCTTTCAATTCATTGGCTAGGATTGTGAGTTTTAAAGGACTCAGGTCTGGTTTTTTCCTC AACGCCTGTGTG CT CTGTTCGAAGGGGT CCAT C'ATGGGAATGGAGAGATG CTTT CTAAAGATGTTTAATTTTGTT TCTTCATCTCCTGTGC TAATGAGTTC CACTGTCCTCACATCCGG GCGGGCACACAACGACTTGTAGGACAGG CTG GAAGAGACAGTG CT CATGATAAAT TTGCAATGAGGG GAGAGTGAGTGTGG CAGC CAGGAAAAAT CT TT CACC TGA TGGAGAGAAT CAAAG ACATTTGAAAACATTAAAGACTTTAAAAT TACT TTGTTGATGGTTTTTCAACTTAGAGTA GGTACAATATGGGGAGTAAGTAGAGAATAAGGGTTGGATTGAAGGTTATTAATAGTAACTTGAATTGTAGGCATT ATGGTGCTAAGAAAATATCAGGTCACACTCAATAGAACCGTATTTGACTCATGTGTCAGTTATGAATTTGATTGT GCAAAT GCAAACTCTAAAAAGAGGAGGT TC CTAGGAGCAAGTAATTTAATAAAAT CACGTAATT CTGAAGAATGA CCTACAG GAATAAC CAGAG GAAAC TG GAGAGTGCAATTG GAACCATTCAGTTGAAAAAGAGGATGTTATTTTATG CTGAGAAGTTGT CACTTTGTGCAG CAATACTATCAT GAGTTAAAT CAT TTTATTTTGGAACAAAATAAATACTTA ATAAGAT CAAT CTTTT CT CAATTC CTTTAATAGC CTTTCCCACAATAC CT CTTT GCCTAATAA CAC CAAC CATCA CATACTCACTCTGAGCCAGGCACTGTGTTGAATGTGTCTGTAGGTACTAACTGTATTAATTATGGTTTTTAATTT CTGCAT CTTAGGATGATTTAA CATTCCTCAGGACCT TACAGACC CAGC CCTT CC CAAGGC{ N ) xTGAACTCATTC TATCCT CATGACAACACT CTAGAAAGG(N)xCTTCAGGTAACTGTCTGGCCATCCTTCCTCCTTCTAAAGAGCTC TGCCTCAGGC CATT CTTCTAGAAAG C CATC CCTT CATGAGAG GC TGTCACAATACTACATAC CAAT CTGTAG C (N ) xTGAGAAACAGATCTTATTCACATAACTGATTGATTACATAGAGTTTTAAGTTTTAAAGAAGCTGCTTCCCAAA GTAATAAC TAGTGC CAAGTATTAGTAGAGGTT CCACTGATCCAATATTAT TT CAAACAG GAGAAAAACTGAAGAT GACAACTCAT GAATGT GATTTTGTATGTGTAAAA CCAGTGAAGATGAGGTAGG C CAAACTATAAAA CACAAAAAA CAGTAAGACTGATCAAAGTATAAAATAG <N)xATTATAGAAGTATTTGTTCAATAAAATATAGAGCAG í N ) xT T A ATTAATTAAAATATAAAG CAGTAACTAAAA GCAGTGTTG GCTAAAACCAGAGATGTGTGATACAATGGAAAG GTG TTATATATAATACATC CTAAATAATGAAGTTT T T A T T C T T T T T A { N ) xGTTGTGATAAAATAGGGGTTATTTTAA AAAGA CATTA CC TT CT GACCTGAAAT CC CATAAATG CCAATCAGTT CTTCAATT CCATCTAGTACAAGTATG CAT GGCTTTAAACTGATGGAAGCAATGAATACTTC CACAAGAAATGAGAAGAC CAAG CCATCTGAGT CTTCGTTGAGA ATATCTGTTT CAAG CTGAGTACCTACAG CATAGAAATTTGTT CAAATTAGGTAGAAATGATT CCGT CATTGCTAA AACATGTT TATG GG CTGGGGAAG CAGAAGAG GTTGG GAAAAATGAGTG CTAC TCACAGCTGTAC GT CTAACACAC ATATGTATACATGTGATTACCATCTTCAAAATATTCAGCCCAGAAGAGATAATAGCGATGTTTACCACCAGATTG TCTCCAATACATTTTCAGGATTTGATATAAAAAAATTTTTGTGTGATCCAAAACATTACTCTCATCTTCCAGAAT GCAAAAACAGGT CATGTTA CAAAGAGTTGGATACGAAGAGAAG CCCTGTGATCTGTGAAGTTTCTGACACTCCTT AACATTGGGTGCGTGCTTAGGGTCACAGCCACCACCCTGGGCTCTTCCAGTAATGCTCAAATATAGAAGCTCCCA CCCACTGT CT CGTTTC CTTGG CC C TAGC CCAT CTT CAACTTAACAGAAATAAT C CTAAAC TCTAGC CACACTATT GGGCCCTAGACTTTGGATTCTGGAAAGTATACATTCTTAGAAAGAACTAAGCTATGAACCCCCAAAAGGCAAAGG AAGTCATTAGAGATGATTGACTTTCAGACTGTGATAAAGTGACCTAGCAGCAACCCCAAGTTGTCATCAACTACC TAATAC CCTAAGAACATTTTACTG CAAATT CCGTCTTGCTTTTATT TTTCACAAGCAATA TT TT CCACAGAAATT TAAGCACATT GAGGTCTG GAAAATACAGTCACATGTAGAACCTT TAAT GGATTAG GTCTCTCAGATGCAT TAAGA TTTTGTTAGAGTCGTTGAGAGTAATTATG(N)xAGGAGATAAAATGATTCATATATGAAGTCAACCACCTAAAGG AAGTGGAAATTG GTTC TGTATACAAATAGC CAAATTTCAAGG CAGAATATGAGT CTGGAGGCTG CTGAAG CTTGA TGTAGAAC CATCACAATAAAGAAC CATCACAATAAAGAAAAG CT CTGT GTGAAAGACACT T CAAATAAGC CTTTA AGGGTACATAGG CAGAGA CGAGGAGAAGGG CATATCATCAAGACAG CACAGC CAAAGGCTGGGACATAGGAAGGT ACCATGAACA GTTGAAAAACAGGACATTGCTTAGAG CAGTAGTT CTTAACAG GAGAGGTT( N ) xAAAGAGATTTT TATAACACAAAG CGAT CATAAAAT CGATTACTTGAAACTTTTTTTATGAATGTGG GACATTAAGAAAGTGAAGGT GTCAGAAAAGATAGAGATTGAGCTTATGTCAAACAATGAAAAATGATTTTCAAGAAAGGGAAGAGAAAGAAATTC AGCTGATGATTCAT CC CTTGAGTTTTTATTTAAAAACCTTTAAAATTATG GG CATCGTTGAGATTT CAGAAC TGA A A C G ( N)xñTGTGAACACACAGATCATCAGGAGCTTTTAAAATACTATTTTTTCTAATGATTTGTTTTCAAGCCA TGTGTGGC CC CCATTT C T T T T T T T T T { K ) xCGTGAGCCACCGCGCCCGGCCCCCCATTTCTTAGTGTAGATGGCT CACAAG CAGC CC CTGT CAGAGCCCACAGAGGTGTGGATCCTCTG CCTCTTCCACACGGTTGG CATAGAG TGTAAA TGA CAAGG TGAATGTGAATGAAC C TCAG GAAGTATATACTGC CTGACAGT CACAGAGCATAGAG CACTAATATAA TCAGCAAAAATTAATTCTAAGCAAAAAAAAAAATACTCATGGCTGATGCTTATAGATATATATCTTTATATATGA AGGAAAATATATATGTTGAGTGGTTTTAAAATCACATTTTTGGTGACTATAATTATTTTTTCTGAAAATAATGTT
g a c a a t a a a g t c a g t g t t a c t a g t a t t g g a a a a t g a t t c t a a a c a t t t c t c c t t a t t t t g g c a a g a a c a a t c t g t c t c t a c t a a a t g a c a a t g g c c a a t t a c c t c a t t t a a a t g g t a a t a g c t c a c g t g t t t a a a c a g c c a c t t t t t a g c AGTGTTATCTCTCA( N ) xCAGGGTGAAAAGCTTGGCAAGCTCTGCCAAGGATTTCTGTCCñTTCCACATCACCAA CTTGAGCAGAGACTTTCCCGATGCTAGGAGAGGAGACTGATAAAAAGCATCCACAGGAGGGAAATTTTCCTTTGC TGGCTCAAG(N)xGGATGGCAGTGTAGTGATGAGGACTAGGGGCTTAGATCCCAACAGAGACCTGAGTTTGCACT CCCTGCTATGCCCTTGCTTAGCTGTGTGGCTCTGACATTCTGAGCCCCATTCTTCCTA( N) xCAAACCTGCCCCC AGATTCCATGTGGTCACATGGTAAGGAAACATGCAAATATGAAAATCAGTGACCAT( N)xGTCAGTGACTACTCT GACCTTCCGTGAAGCATGTCTTCTGTGACACCAGTTGCCTGCAAGGCAGGCAAAGGGTGTGGGAGTCGGTAGAGC TGGCTACAGGCAGGTC CCAT CTACAAGATC CCTTTATC CTGACAAG CAGGGATTGAGGACCGAAGAGTAGAAACA TGAGTGTCACTGGGTGTGATTAGGTGCCCCAGATAGAAAGGGTCCCTGGCTCACAGCCCACGCAGGATCTGCAGC TTCAGACAGCTGAGGTGGGGTGCAGAAAAATGCTCTCTGCAAAGAGGCTGAATCTGCTCTGAGCACCTGGCCTCT CAGGCTGGCTGTGTTCACAGCAGCAGCTCTTCCTCCTCTGCCCACATCCACCTCACCCAGGTGAAACCAGGTCTC AGCAAAGGCAGCCACAATAAATGACT CTGGGG CCAG CCTT CTCCGCTCAG CTGCAGGCTTTCTGGGCTGCAGAGA
t g a c a g c t g c a a a c t c t t g t t t t t g c a g t c c t t c c t c t c c c t a c t o a g c t a a t g a a c a a g a a t g t c t t g t t t c c c CTAGCAGCCCCTACACCCTGCTTCTTTCTCTAGCTCAGAAAGG (N) xCGCAGCTGCGTGCCATGTACTACTTTAA ATTCCTTACTTGTATTTTGTATTATGTTTTTTGGGTTTTTTTAACGCAGCTTTTTTAAATTT( N) xGTGGGCCAA g t c c a g g a c c a g c c c t t c c t c t t g t t a a t g a g g a g g g g t g t t c t c t c a g g g t t a g a a a g t g t c a a g t a t t c g t a g GCTGTTCCTGAACTTATTCTACTGTTGTCAGCTTAAAAATCTGGTTTTTTTTGAAAGACAGCAGCTGTTTTTATT AGGAGTTCTTTGCATCCAAACACACGTTCATTAAAATAACAATTCACTTTGGGAATAAAATCGTAATTGAGTGTG TAGCACGGAGTTTACATCATCAGCCTGGCCAGACTGAACACCTTTGGGATGTGCCAGAATAGGATTAGTTGCTGT GTAGACACAGTCACACCTGCTTCTACTGATTTTGACTTTATCCATTTTATATGCCCTGAGAAATGCAACCATAAA GACCTCATCCACACGTGTTCTTCCCTCTGTTTACTCACAGACACCCACCACATGCACAGAAAGAAGCTCAAATCC CTGTACCCTTTTATAGAGATAAGATG CACC CATC AAAAT
> H sl4 1 06 780 49 9 -1067 86 136
CC CCTCACTGTGTTTCTCGCACAGTAATA CACGGCCGTGT CCACGG CGGT CACAGAGCTCAG CTTCAGG GAGAAC TGGTTCTTGGACGTGTCTAC TGACATGGTGACTCGACT CTTGAGGGACGGGT TGTAGTAGGTGCTC CCAC T AT AA TAGATGTACC CAAT C CACTC CAGT CCCTTCCCTGGGGGCTGCCG GATC CAGC CC CACCAGTTA CTACTGCTGAT G GAGTAACCAGAGACAG CG CAGGTGAG GGACAGGGTGTC CGAAG G CTTCAC CAGT CCTGGGCCCGACTC CTGCAG C TG CACC TGGGACAGGACCACTGTGAACAGAGAGACC CACAGT GAGC CCTGGGCT CAGAG GCACCTCCCATATCTC CATGTCTG CAGC CTTGAGACACT CACAT CTGG GAGCTG CCAC CAGCAG GAG GAAGAAC CACAGGTGTTTCATGTT CTTG CACAGGAGGT C CAGGACT CT CAGAAAGTATTT CC CATGTGAG CAGGAC CCTGAATT TAAGGAAATGTGTGA TGGTTTCCCTTGGGTG CCTAAGTGAGATTTGCATGTGGGTGGTG CCTCTGTATGGAGA GG TGAAAAGG GATGAG G GAGG CCCCAGTCTTTTAGGCTCTC CCTGGAAGGAGGATGC TGGTTGTGCCCT CTGAGAATTCAGTTAT CTTC CTG GGGC CT CAACTCACTA TGTCCTGGCTCCTCTTTTCCCAGG TGAGGAAA CAGATTGCAACAGCAGCTTAATGTAAC AATCAT GTGAGTT CAGACACAC CAGGATT CACTTAACGTTATTTGTAGTT CAGAAC CT CTAT CAGGTTTAGAGGG AATCGCTC TGTG C CAGGGAGTGGGTCTTAAATAG CAAAACGG CC TCAGAAAACC CAACATAATCTACAGCGAGAC CT CAGCATGG CAAG CAAGGAAT CC CTAAAG CCAC CAG GGAGCTC CGGATG CACTGATACG GCCCAGACACATGGC GAGT CCAG GAAC TGATGGGGACTTTGGGGGAGCCTC TTTTATTATTTTTTAG GATT CTGTGGTTGAAG GT CACAA CACTGGGCCTGACTGCTTC CTGAAC CAAGC CCAG CACAATATG GTT CACC CCAGTGACATTTTCAGATGTTTCTT CC TGTAAT GAAAGC GCG GTGTGATGTGTATGCAC CTATGTGTTTAC CTAATGAATGTAAAGAGAAG CACATTTCA TGCAGCTG TATT TT CATAAATGT CAGTCATTCAT CATGTTAGTGTCTATT TTTC CATAAATCTGTACAGAACAAA TTATTCAT TCATTGATTTGTAATAGT CTTATTGAGACATTATTACTATA CATTAAATATTGCAAAAAGTGTG CG G TTTAATAAGTAC TCAA{N) xATGTTATTCCACTCGTGCTCACCTGTGCATCCCCAAAAGCCATCCAAATAATGAA TAAACAAATTA CA CTTAAAAGTTT CCTTGTGTTCCT CTACAATT CCTCCTTCC CAATTATTTT CTT C CTC CAC CA TATT CT GAGT CAGT CCTT CACATTTAATA C C T(N) xAGTATGTGTTTTATTTGTGTGGCTTCTCGCACATAACTA CTTATGATGCACACATGTTGAG CATCAACAATGTAT TGAT TCTAATGATGGATGTTATTACCATAAAT TAGCAT G CCATTACTGTTTAT CTATGTATCTT CTTGGTATT TGTACCATTT CTAGTTGCTTAGTATTA CACAGAGGTGCTTC TCAGCTTG GAGATGTAAG CACC CCATAAAAATATGTTACTATTC TTATAACAAATACTAAA CTTACTGATCTGCA AATTAG CATATATA CAT CAAAT TTTT GATGTTATAGGACAAACAATATAT CTGAAATCTGAA CAGACATAAAGAC TTGCAGG GAAATAAACAGGAGCAGATGATAAT CTTTTCTGGGACAGAGGCTG CCAAATGT CATTTAAGTTAG CA C A CGATTAAAGTAGACATATT CATTGGGTGGTTT CAATTTGAGTGTGATAGAGAAGTTATTGTTTAAATTCT CAGA GTGTATGCAGTTGA GGAATT CCTC CTG CTATTGAAG CCTTTTTCTT CAGTACTGG G GATACATCA CAAAATGCT C CAG CCTCTACCCCTTGGGATGGTGCTGTCT GGGAAAGCAAAACAGCAACTACAG CTGAAATG CATC CAGACACAC CTCCCCAT CACCACTA CATTGCAAGAGAAATTAT CTGCAGAGGTAAAG C CAT CAAAAC CACTGTTCTACAGA CAC TG GAGAAACCAATAAGAACTGG GAGGGGAGAGAG GAAT GCAC CAAGTC CCTGTC CAGG CC CACCTC CCATCTTCC CT CAGGAGTAACAG CC TTAT TCAAAAGGAAAAGG CAGAAACTGAAGAAAT CAGT GGGAAGA CATAGTGGCTGCTG AAGGAATATATGAATAAAAACGAAGA{N) xCTAAAAAAAAAAAAAAAAAGAATGTAAAGAACCGGCCAAGTGTAG ATGAAGGG CC CTGGAAA C CTAG CTACTTGC TAGT CAG GA(N)xATCTCTTGGCATTGGCATGGTGTCGGCTACAA TG GATG CTGGGAGC TTGGTGTCACGGCTCCTT CCAAAGGACATG CCTGCTCC CTGAGTTAAC TCCCAGACGCAGT TGGACAGG CCTCCTGGG GTCTGAGAAGC TC CTTT CATGTACTGAAATC CTGT CATTATGT TTTTGTATTCTAGTG TCT C CCTAAAAG TACAGTGAGACC CAGGGTCCATTCATGTGTGTATT CAGGACT CT CTGATTTTTATGTATTTTA
t t c a t c t c t c t c t a c t a c c t t t t c t a c c a a a c t a g a c a t t t a a a a a a t t g c a a t a t ( n ) x c t t a g c t t a g a g a a g TGA CAAG C TGAGAAAACATGGTT CTTATTTTTA CATAACATGAATAG CAGGCAAAG CAGAAAAGCTG CA CAC TAA T CAATTTG CTTCAATACATCACATAATTAAAG TTGGGAAGCTCTGTGTGTGTGT CAGTTCACGTGTTTTT GTGTG A CAGAAGAGAGAAGGCAGGAGGAGAGAC CATG CCAAAAGGAGAC CC TACC TGTTTT GACCATAATGTGTGAGGTA CT CAAGTAAATACAGGGACTTAGT GCTT GATGGACAAGGT CCTC CT CAGATG GAGAAGA CAACTGGAT GCAC CT C CATATGGG TACATATTAGTATTTACATAAATG CCATTTTC TAAT CATAT CAACACT CCGACACATTATAAGAGAT GTATTGGAGGGTGTCTGGTGGTGAAATATTATGGTGAGAACAACCCACATCTACAGCCCCTTTTCTGCCCTGTTG CACTTG CC CTGATG CGAACT TGATCCTGCT CATC CT GACC CCTAA CAAT CAT CCTAAG CC CC CATACT GC CC CGA
a t g c c c c c t g c t g c t c c t a t t c a c c c c t g c a g g g a g g g t t g t g t c t a g g c t c a c a a t g a a g g c c c t t c a t t g c g t CTTTTG CTTAAAAATG CGTAGTTGTGTGTTCACTGGGCACAGAGCT CAGCTGTAAGAACTGT TTCTTGGATCTGG ATAT GGACTCTTGAGCAGTG GGTTGTAATT TGTGCTCCCT TCACAACC CATG CACCTGAT CCACTCCTGTCCATC TTCTAGGGGCAAGCAGATACAATTCTAGCAGGAAACACTGGTTATGATGGGGAATCCAGAGACAGTGCAGGTGAG GAGAGGGT CT GCAAGGAGT CTCAAGC CAGAAGTGTG CTGAGAAACATAGT TGTTGATG TTAACAGGTTCTGGGCA A CACAGTGAAATTC CCAAAACCACACATTTTTATGAGAATAAAGAG CT CACTTTGT C CAATTTG TGAGTCTCCTA GAAC AATT CAGTAGAT TT CGAGGTTAGGTTAAAAAGTATTAT CACATGTT CCTTTCCT CAAACT TG CAAC CAAAT
> H s l5 71935511 -71943475
AT CATCTGGTCCGGTGGTCCTññCAC'TTTTTCATGAAAATGATCTTGTTTCCAAAGAAAAGAGATATAAGTAATC CCAACACATAAAACAAATTTTTTAAAAC CACTAATGTGTT TTGTTAGCATAAATTAATTT CTTTTTTGAGACAAT
(N) xAGTATAAATTAATTTCTAATTAAAACATTAACAT CTAACTCTTATACAGGCATTTAGTGAACATGTGGCTA TT TTGT CCATTAGAAAATTTAG CACAAGTAATATGGAAAATGAT TTGGATGC CTTAGCGT TTACTT TATTTCTGA AT TGTGTTTCAATTTACAA CCTTTGAAGAGTTCT TAATTAA CATAG GCAGGAAATTACAGACAGTCT CTTTGGTT GAATAAAAATGGGAGCAGTTTGTT CCAAAG GCTG CATAAAAATGAGTTAATG CAGAGATGTCTGGTGTGTTACAC GTAGGCGTATACTGTGGTTGAG TATATATGATGTTTAT CACAGTTCTT CACATTTTGAAAATATAT CTTAAAGAT TATGAAAATACTGAAG TAGAGC CTTGGAATAG CTTGAAAG C CACTGAC CTGT CT CATTTAGCAGACAT GATTTAT AGGTTAAATTAG CCCCAGTGACTGTC CAAG GT CT CAACAGAAAACAGTAGGACT CAGG CTAACACCAGG GTCTAA CTGATTGT{N) xTATGTTTAGTGGGAATACTTGCCAATAAGCCTGATTTTCTATGATAAGCCTCTTCTTCCTTCA AT TGGTTAGG CCATTTTGTTCTTC TCTATACTACAAAT CTGTTACGAAATGAAATTTCAGAG CTATATTTTCTGA AAAGGATAAATGTC CTTTGCTC CAAATT CATTTTATTTTTATTC CTATCCACTT CAGTTTGTATAAAATACAGAA TT CTAATCTT CACCAG CAAGAATTTGTG GTTTTT CCTAATA CAAACTG GTAACTGTAAGC TC CAGATTTACAAAA
a a a a a t t c a a a t t t g a c t t t c a c g g c c a g c t a g g t c a a t a c t c t a t g c c a c c t t t g t a t t c t g t t c c t c t g a c a t TTTAGCTTTTATTTTT TAAAAACTGGTT CAATATGTTG CTGACT TTGA{N) xATGCCTTGGTTAGTTCATGGAAT CCTTAGGCGGCAATAACGGTTCAAAATGAACATTATACTGCAGCTCCTAGTAAGTTATGGATTTATAGCCTGCCC
c t g c t c a a g a g c t c c a g a a g t t c c t t c a g c t t a a a g g t a a t a g g c t g t t t t t c t a t c a t t a t t t a a a a a c a c a a g TG GAACAG TGATTTCCTTGCAGCCCT CAAT CA TATACAGAATACAGTCTCAAAG CTGGTACCTCAGGG CT CAAAA TGGGGAGCCCCAGGGTCACTAGGGTGACTCAAACCCAGAGGAAAGATGGTCTTGGGG(N)xTGAGGAAATCCTGG CTTGTCAT CGAAAT CATACCATGGTTTTAGATTC CCTTTC TGATAATCGAAGAACATT CTGGAGGC CTGAGG CAG AGTGAG CCAC CAGAAGAAATAGGAGCTGTC CAAAATAATTTAAAGT CC CAGG CCCTTGGGTTTGCTTTG GGGAAA AAAG CTTT TATCACAG CCTT TTGGTG CAACAGAAAACAAAATGTTTG GGAGAG GTTTTAGTACCAG CACTTCAGG CAGTGCATGTAG GCAATG GATAAGAGAGAACT CAAAGCTT CTGGGAAATTGGAGGTGT CC CATGTTTATAAAGGG GAAATT CAAGATGCAATTAAGACGACTC CTTT CCAAGAAATACCAGGACAAAGCAAGTAC TC CAAAGC CAC C CAG TGCCACTTTCCCTCATCCCTCCACCTGT CCAACACGTG CACCTT TCAGTGGGTGCTCTGCACTTGCTGGCATCTG
c t a t t c a c t a c c a t a t t c c c t g t t t c a a t c a c t c c a t t a a t c t c c t t g c c t t t g t g g c t t c c a c c t t a a a a c t c t GCTTTTGAGTTTTTGAACTAGATAGATACGAG CACAGAT C CTAG CACTTGGCTGTG(N) xATACATTCATCCTAT c a g a t a a g c a a c a a a t a c c a g t t t c t t t t c t t t a t a t t g c t t t t c t t g g t g c t c a g g a g c a c c t c t g c a c a c c t c TCTGAATAGAGGGAGCA CACTTTT CTTATG GAA C CTAAAGTCAC CAGAGT CCATAAGTTGT CAAAATGGAAT CCT TGATTCTTTTTGACTTTGGCTC TTTCTCTCTTTTAAC CAGAGTTTGTT CT CTGC TGGAGAAATGCTTTTCAT TTT ACTTCCATATTTGCTGCCGTTT CACATT TTTAAAAGTATATT CAGG CCAT TCTCTAGCTAAATAACATTC CAAT C AG CTACTGTTTAAG CT GG GAAT TGATAT CATT TATTTGTGTTTC TTTGAATTTG CC CT CTATGTTATCTTTAACT TGACTT TAAATAAAACATAATATG CATTGAGTTC CTTC CATGTGTCAGATGCTT TTTACAAA CATAAACTGG CTT CATCAT CATAAAA C CAA C CCAGTAAATTAGGAGATATAATTAATAAATAGTGTCACTAAT TATTA CGTATTTTCA CTTTATGACAGCAAAGTATTTTTG CTGT CAAAGCATAAGGACACTGGAT CATTGT CAAAATC CCTACCTTCACTC AGTTGTCCACAC CTGGGTTAAGGC TTATTCTTGAG CGGCTGCAGCCCTGGGCACTCCCTT CCACCTGGAATCAGA GAAT CACATATAGG CATAATGC CTAG GAGGAAGTAATCAGCT CGTTTAAGTGATG GCTGACATCAGGAGG CC TTT TGAG CCAGAAGAATGA CATGATGATAGCTGTGTCTTTGGAAGATAAATGT CACAAC CACGTG CAGTATAGATTAC CTGAAACCACAACATTTAGC CTTTATGCTTACACAT CTGGGGGG CATGATTTAAAAG CTCATAAATACAATC TGG ATTTTAAAGAAG GGAC TGGTGAAAATTAAGAATTTCAAGGTT CTAC CAAAGAAGGTGAAG GATCGGTGTTTTTC C AACTG GTTTATGTGTC CTAAATATTTATGTATACAAGTG CATATTGTTAGTG CAGT CT CTTGGAAG C CATGTGTG GG CAGCAGAT CTGCTATTGC CCTACCTTTCAGTT TAAAGT TG CAAGAGAACG GAAATT CT CTGCCAGCATGAGAA TCATTCAT CTGG CAGC CAC CTGAATTAC TGTTAAGCAACACATTGAGT TTGACAGATAAC CCTGTT CTTAGC CAT CCTT CAAGAAAAGCAT GAAGGAGC CTTCATGTTTAAACATTT CAGC CAAATTTCTT CCTT CTGACAATGCATG GA ACA CAATG TCTT TATTTC CTAATGAAATATT CATATTGGATGTATCTATGGTTAGAAGAGAGAATAATAAGGAAT TAT CTTTATTTAATTT CTTAGT CATT CTGGTCTCAGAGAG CAAG CTCTCGGGTTG GGTAATGACGT CG CC CAAG G
a a c c t g a t g t t t t t c a t t g a g t t a t a a a t a t c t c t c a a c t t g g c a a a c c a a t a g t t c c c a t t c c c a g g a t g a c a a ATGAGAAAAATC GTTACTAT TTCAGTGGATAGTTTTGGATCTAG GAAT TTGTTGTGAGAG GAAGGGTGGT GTTAT AGATGT CC CTTCATG GG GGGTAAATCAG CACAATATGG GACTTCAGAGGACTTCAATATGG GATTGCAAG GGAAT CAAGAGAC GCTATAGGAGAT GCTGAAG CAGTGTGGCCAGG CCTCGTGGGAGG GGAGGCCTCG GATTGG GAGTAAA GC CTGT GTGTAAGC ATGT TG GGGAAAGC CCGAGAGC AG CCTGAGAGATATAAGCGTATAGGTGG C CCTTGGTCAG TCATCTACCAGGCCCTGTTCTTACAAGAAGGGTTTCTGTTGTCCCAAGAGACAATACAACTA3CAGAGGAAAAAG ATCCTAGACTTCGGGGCCAGAAGGCACTTATTTCAAACC(N)xGTAATTCTTACCTCCTGGTGTTGTTGAGAGAA TTAGGCAG GATAA CATATTAG GTAAG GTGC CAAG CACATG GTGAGTGGTC CAGCCGTAACAG CTGTCACATATAT CCATGGGGCTTCCTACTG CCTAAATC CCTAC CAGATGC TTGAATGC C CTGAT TGCATCTGTT TT CCAC CTACAAC TCATCTAAGACCATGACCCTACATGCCCAACCCATAGGCTCCATCCAGCAGTCTCCCTTGCTCTGTTTCTAAAAT GAAAATAAT CCC TTTCCAGTTTTTTAG G CTTT CCATAG CTGGTCTC CAAGA CAAAC CGGGAAAGACTACCAAAAG AAAGCATTTTCTTTAAGTTTTTTGTTGTTGTTGTTAATGAGAAAAGTTACAAAGATAAAAGAAATCTCTGGAAGC CTTGATCACCATTGTCCCAGAGCATTATTGAAATGCAAATAGATTGTGACAGTTTACCTGTTACAGCTCTGTCTT CAAACACCACTTGAGAATCATGTGAAAAGAAAGGGGGGTTTTGGTGGACAGTGCAAATACACAAGCCTCCCCTGT TGGTAGAGAAGTATATGTAG CTTG TGTGAAAAAGGGGATC CCATGG CCAGACTGCCAAGGGACG CGTCTCGAGGA GAAGGAGAGGCACAACATTTTACCAG CC CCTACTTCAT GC C A G TT(N ) xTGCGGGTGTAATGTGCATTATTAGGñ AGGCTCTTAGGTGAACATTCATGTGTCTAGAAGAAAAGACGTTTTGAACATATGGAACGTAGAAAAGAAATTTTT GT TAAATT CTTTAAATCC CAGCAAGTGCAC TT CAGATT CAAAGT CCTTAGGAGTCC CC CCAT CACTCAGAG CTT C TCTTTGGCTCTTCTCCACCATCTCCTAGTTTCTCCTCTCCCTTTCCTCTCTGTCCCTTTCCTCTCTCTCCAGCAC TG GAAG CAAAAC TAGGCTTTTCTATTTAGC CAGATCTCAGGGC(N ) xAGTTCTGTGATAGTTCATTTGCTCTTCA ACCATTTCCTGGTGAAAACAGCCAATAAGGAAGTCAGAAAAGGACTGCCAGCTGCAAGAAGTGACTTCTGACATC CAGAGGAG GGAC GG CAGAGACCTT CTGGTAGT TGGAACTACTTAATATAGAG GGTG GCAATCTCTCACAC CTCAG AATGTCTCTGAGTCATCTAGAG(N)xGGCCAGAGAAAATAAACTAAGGCTTAT(N)x CTTACTAGTCTTGATTTT TTTAAAAACTGAGAATTATAAAATATACTTTAGAAGGATTCCTATCCAAAAAGCTAAAATAAAAATGCATAGGGT TTTAGAGGAAAATT CTGATGAAAGAATG GGAGAGAGAGAATTAT TATAACTAAATT CGATGAATGAATTAATGAG TATCAAGATCTATG CTCAGACCCTTG CTGTAAGATGGCTC CATCAAAGGC CTGCCCCC CAGTGGAGTTTCTGCCT CA
> H s l5 _ 625405 07 - 6254 99 57
ATGGCAGGACCAGAACAAGGACTCAAATTTTCCAGCTCTTGGCTGGAGCCTTCCCATACCCCACCACCCCTAGGG CTGCAGCCTCTCAC CTGTTGAAGCGTGAGCTCAGTGAGTT CCAGTTTCTTTTTCAGCTCCTC CAAGTCACACTG C ATCCCTGC CTTGTCAATCAGTACAACTTGAAG CTGCTCTG CCAAAG CTGAGTTCTC CTG CTT CAG GTC CTTATTG CTGTTG CTACGG CCAGAGG CAGTAGA GAAA GGAATGAACAAAGAACAGAAAG GGCTGCTTTGGTGATCAG CCCT C TACCCTCACCCCAC( N ) xCCTGTGCACGCATTCAAACCTGTGTGACCACGTCACCATGCTCACATGCAGCCCTTC CACCTCCCAGCACATCACCCACACTAAGGGCCCCACACCTCCCATCCCACCCTCCCCCGTCCTACCTGTTATTGT ATAACTCCAGCCTGAGGGCATCGCTGTCTCTGGTTAACTGGTTGTTGTACTGAAAATACAGAAAGGTGAAGTCAG GATACAG CAGGCAGAGAAGCAGCTGG CAGACTAGGAACGACAGC TACAGTGACTATTC C (N ) xAAAAAGAGAATA TTACTATTGTTATTACCGTTATGACTGCCATTGTTGGAACC(N)xGCTGGGGGTGGGGGCACAGATGGAAAGGGG GATAAT CT TGTGTT CAGTTTTTGAAGG GTACATT CTCATAGTCCAAAACT CAGAAAATACAGAAGGGAAATATCT CC CAGC CACCCTGGTCCTTT CTCCTG AGTTTTTT AC AAAT CCTTGC AG AC ATGTTTTATGTATATTAT C ATAGTA CACACACACACGTGTTCCCTCTCTCTGCACAAATGTTAACATACTAAAGATACTCTTCTGTACCTTCACAGTGCA AGTACCATATCTCCCACCTAG(N)xCCCCAACTCACCCACAGCAGCCGACACAGCCCCAGGCTGACTCTAACAAG CACGCACAAAAG CAGCGAGAAATGGC CCATGCTG CTTT CTGGGCAG GACACT C CAT CC CGCAGAAGGGAC CTAAA GGTCCCTCACTCCTCCATCT GGAAAG CCGGGCTGCCAG GGTATG GGGCAGGCGGTTGGACTCAC CCTATCTGCCT TCCTCTGCTCCTGCTCCAACTCTCCCACATGCTGCCAGGAATACAGCAGATGGCTGGCCAGATCCTCGGCCTCTT CTAGAATGAGAGAGGTTGAGATGGGG CC CAAAGGACTC CC CCTAAAGGCCTGTCAAAG CACCAGGTTGAAGGATG ACGGGTGCCCAGATTCCCACATTCAAACTGCCTGGCAGCACGTTCATTGTGATACAGTGTTGTCTTCAATTCTGC TT TCTCAAACAT CAAGACTCTAATTATC TGAATTGAAC CTTTAG GAGAAAAG CCAAGCAAATGCTGAAAGAGGAG GAAAGCAACATT CT CCAGAGGACAGGAG GGAACTTCACAC C CTC CA CTCACCTGTAATTGCCTC TTTAGGGCTC C CCGGTTTTGCTGGCTTTCTTGCTTTTCCTATAGGAAGAGGAAGACAGAGCTCTTACTAGGGGGAGGCAGAGATGG CA CAGCAAGGGACATGCC CCTAGAATGC CAC CAATGCC CCAGGACAGGCC CACCCATGGGAC CAGGTTAT CAGGG ACCCTGTGGGGATGAGGTGGAATCTGGGGAGTGAGCCTTTTTCCCCAGGCTGGGGGTGGGCAAGACGAGACTGGG GCCTCTACATCTGAGTGCCCCCCAAACCCAGCAGTCATGCCGTGAGCAAACAAATCATTTCTTCTAGTTGCTTGA CAAGTTTTTGGT TGTGCTGTTTCTGC GGGGAGAGTCAAAGGAAGGTGACCAAGGATGG CCCC CT CCACTCTATT C CC CAGACCAGGAAG CGGTAGACAGGGGC CAGAAATGGATTTTAAAGG CAAAGTTCT CAGACC CACTAGGACCATG AACTGGTAAACT CT CCTCAAGCTC CCAAGGACAGAGGATTTGGGT CTTTGTTGGTTTTGGCCCACGGC CACAGAA CTGAAAGT CCGAAT CTGG AT TCTC CCGAAAG G AC AGTAAC AT AAAC CTTT AGAG ATGG AGTC TG AGAAAAG CTC A CC CTTCTACCAG CTTGTGAT TTAGAAAG G TG (N ) xAACCCCAGGACATGTGTGGCAAGGGCTGGAGCATGGGTAT CTGAAGAAGAGACAGTAG GCAAAGAGGG CAGCAACAGAAGAGCCATGATG CATGCT CCGTGCTCTGGGGTCCCTC TAGCTGAGGCCTCGGCCCCCCTGCTCCCCATTTGCCCTTGGCATCAGGGACCCTCAGCCCTTTCTTCAGGGCCCC AAGGGGAAACTGGAGCCCAG GACTTG CAGCGTGGAATT GG TGGACC CCAT TGAACT CTTACCAATGACTC GATGG TTTTATTC CGTCGACTGATTGTTATATAACTCGAGTC CAGGGCGACTGTTA C CTCT TG GTAT CTGCTCTGAGGCñ CGTGAAGAGAGGAG GAGTTGGAGGAG GATCGAG GGGAGAGGTAGAGAGAG CAATCATTAGGGTTG GGGGGAGGGT GTGAGAGGTCTCAGATGG CAGAGGGG CACC CAGC CCC CACTGTG GGAGGAGGTTGGAGGGCTGG CCTG CAGGGT C AC TGGGTCATGG CCCAGGQCCT CTTACT TCCAGATC CTTCAG GGTAGCG GATGATC CAGAGC CCT C CG CACAGAT ACCTGTTGCTGACTACAAGAGATGAGAGTGCGCATGGAAATCTTCTGTCCCCTCCATGTCTAAGCCCTCTGACTT CCTTTCTTCCCCAGCAACTGACAACATTTTCTTTTCTGCCTAACTTGGACCCTTCATCCCATAACCTCTTTGTGC
c a a c t t c t c t c a t g g t t t t t a t c t c c c c a c c a t c c c t t c c t c c c a a g c a g c t c t c a t c t g g t g t t t c t a c a g t a g GATAAAATGATGTAACCAGCCACGGGGTACAACTCCTTCATGTGAACAGCCTCAAGGAAGAAGCCTCAGGGAAGA GGCAACTTCCTCAACATGGCCCAGCAAATCGGCCAGTCCCAGGATCTTTCCTCTCTCTGGTGATTCCTGCCTCAG TTTCTTTCTATCATACTTTCCTTTCCCCCGCACTCCCCTCAATTTCTGAAGTTCCTCAGAAACAACGCCATTGAG CTGTTGGGAGAGTTGTTG CAGG CTCT CGGTTAATGAGAAGGT CCTAAGGGCAGAGACACAGG CGTCAG GGGTGCA GGTGGCTCAATGGGAAAGAAGGTGTTAGACAGTGGCCTCCCTGACTCCCTCAGCCTAGAGACTGCTGCCCTGATA AGAGACACTCACTTGCTTTCATCCAT GAGATT GGGG CTAGCATGATGATTCTGTGTGGGAAGAAACATATGAGAT GGTCATTCAATTTCTGTAAGAATCACCGAGGACAGGTGAGCCAAGTTCCCAAGCTCATTCCAACAATATTCTATC TCCCCAAACCTCAAGGAAAAGCCCAGTCTCAGCCTTCCCCCAGCCCAGTCCCCAGGAAGGGCACCTGGCCCTAGG GAACAAACTACT TGTGAAAAGAGAAGAGAATC CATAGTAGGTACAG GAGGGCAGTGGG CAGAGGAGGAGGAAGCA TGAACCTGAGGTGCCGCTATGCTAGCAAGACTGGCACCAGGGCAAGGGACACCACCAGGTAACACGGTGTCATTG G CAGGTGG CTAGACAGATGCAG CATTGT CTTTGGCGTCTGTCACAGAACAAAGCAGGG CATTAGGGGG CCATGCA GGAGAGAAGGTGCCTCAGTGGCATGGACACAAGAGTGTGGAACAGGTATGACAGATACTCACGCAATGACAGTGA CAGTCATG CATG CTTGGAGCCACTCGGGGGTGTGGATATGGTTTGC CCAGGCCTTGGGCTGC TAGG CAGCAT TTT GAATGGCCAAAGCTCAAAGCAGGGGCTGCACGGCCTCCTTCATGAATACATGAGGACCTCGCTCCCCCTGGAGTT CAGCAAAAGACAGCAG CAGGTT CTGACC TGTGGCAG CCACAG CAGG CTG GAGGAAGGTGTAC CAAAGT CAGGGAA GAGAAAGG CACTGTTCTTCCAGACGGCCAGTCTGTGCCACAGAGTGAC CCTAACTG GGCCCCTCCTGGGCAGAGG ACAGTGATGAGGAAGTAGTAGT CTGG GCAATGTACAATGTAG CTGC CT CTCT CTT CTCTCTT CCCTTCTCCCTTG CAGTGGCATAATCAGCAGGGGCTACCATCTCCTACAGCTCGCCCAGCTTCTCCTGCAGCTCCTCCTTGACATGCT GCTCAGACTAAAGTGCATTGTT GATCTC CATATTCT CACTGTTCTGGACAGAGAGAAG CAATCACGCCACCCACT GCAGCTGGAGACCGCAGAACTTGGTGTCTGCCTCCCATGGCGCCGGGAAGGATGGAGGCAGATTAGAAAAATGAT CCCCTCTCCCCCATAGCCATCAAACCAGGGCTCTGGCTCACAGGTCCCTTCAGAAGTGCCACTTCACATGAGGGC TAC(N)xGGGAGGAGGGCAGTCTCCCCAGGTGGGATGCACCAGTTTTACAAAGCTGCTCTCTGTAGCTGCTCCTT GAGCTCAGGGTTCTGGGAGAGTGTGGGGCTGATGGTGGTGAGGTCATTTTGCACGGTCCCCAGGGTTTGCCTGCA TACCTCCGCCTGCTCCCCTCAGAGCTCAGCCACCCGCCCCAGCTCCAGCAGCCTCTCCTGCTCCCAGTTCAGGCA TTTCAAGTCCTCATTGTCTTGCATGTGGGCCCACTGGAGCTGTCCTGCTAGACTTTCCAGCTCCTCCCACAGGTG CT CAGC CT CTGC CTGTAG CTGCTGCTGCAGCT GCAGAC CCCAGAACTTGGTGTCTG CCTCCCACGG CACTGGGAA GGATGGAG GCAGATTAGAAAAATCAT CG CCTCTCAT CCACAG CCAT CAAAGCAGG GCT CTGG CTCACAGGTC CCC TCAGAAGTGCCGCTTCACATGAGGGCTGCGCCCCCTGCTGGGGGCTCCAGGGGTGGGATTCAGCTGAGAAAGGAA GCAGACAATAACGGCCTCTGGATTCT CAAAAA CACC CT CCTCTTGGTA CACAGCTC CTCTCGGGCTCCCCAGACT TGGCCTCCCTGCTAATGATTCTCAAAAAAACCCTCTGCATTCTCAAAAAAAAAAAAACCCTCCTCTTGATCCACT GCACCCCTCAGGCTCCCCAAACTTGGCCTCCCTGCTAATGATTCCTTGCACCCTGATGGTAGCCAATCTTCCAAG CCACTTTCAGATAGAGACAACTGTGGGTGGCTGACAACACACACTT( N ) xGCCTCCTGGCACAGACCTCTTTCCC TCTGCCTCAAAGCCTTTCCATGCATCCACCTCTCTGGCATTCTAAGCCATCCCCACAGCCCTCTGATGCCAGTCC TGCTCCCAGGTCACCCCAGCCCCAGCTTACCCATCTGGTTCCTTAGTTCAGCCAAGATCGTCTCCAGCTCCTGTA CTCGACTCATGCTATGCACCGTCTCCTCCCTCAACGCGTGCACCTGCCCAAAGCACAGGGGGAAAGGGCCCTGCA GAGAGGGGCTGGCAGCTGGACAAGCTACCATCTCCCTCTCTGCCCCTACCTCCACAAAGCCCAGACCCATGACCA CCTCTGGCTGTGCTCCTCCCATTTCACAGATGCCTGGAACAATCAAGTGACCTATCTACGGTGGGGGCTGAAGGG TCAGGTCTCACCTGCTCTGACATCTCCCGCATCCTCTGCCACCACAAGGCGCTCTCTCCTTTCAGATTCTCAGCG TGTTGATCTTTCTCCATCTGTAGTTGTTTCAGTGACCCCATTACCTGCAAGAATGGGCACAGAAGTTAGGAAGGG CTGTCACTGGTCCTCACACGCTCCTGGCCACCTGGGGTCATCTTCCTTCCACATCCCTCCCTCTGCAAAGCCTCA CCTGCCCCAGGTGTACTTCCAGCTGTGCCCGCTCCTCCATGGCCTGCTGTAACTGCTGGTTAGCAT CAAGGG CTT CATAACCAGCTTCAACTCCTCCATCTGGGAAGCTGGGCTGCCAGGGGATGAGGGAGGCTGTAGGACTCACACTGT CCATGT TC TTCTGCTG CGTGGAGAGAGCAAAGAGAGTT CGTT CCAATT CTCC CGTAAACTGCTGGGAATACT GCA GG CAGC TGGCCAGATC TGTGGACTCTTC TGAAATGAGAGAGGTTGAGATGGGGCCCAAAGGACTAC CC CTAAAAT CCTGTCAAAGCAGCAGGTTGAAGGATGACGGGTGCCCAGATTCCCACCTTCAAACTGCCTGGCAGCAGCACGTTC AGTGTGATACAGTGCTGT CTTCATTT CTGCTT TCTCAAACAT CAAGAT T CCAATTGTCTGATTTGAACTTTTGGG AGAAAAGCCGAGCAGATGCTGAAAGAGAAGGAAAGCAACATTCTCCAGAGGACAGGAGGGAACTTCACACCCTCC A CTCAC CT CTAACTGC CT CTTTAGGG CT CTCC CTGGTTTTGCTGGCTTTCTTGCTTTT CCTATAGGAAGAGGAAG ACACAG CT CTTACTGGGGGAGG CAGAGATGGCACAG CAAGGGACATGC CCCCAGAATG CCAC CAATGC CCCAGGA CAGGCCCACCCATGGGACCAGGTTATCAGGGACCCTGTGGGGATGAGGTGGAACCTGGGGGGTGAGCCTTCTTCC CAGGCTGGGGGT CAGCAAGACGAGACTAG CAC CTCTACATCTGAGTGC CCCC CAAACC CAG CAGTCATGCTGTGA G CAAAGAAATTACATTACTAGTGTGATT CTAGTTGATC CACAATTT CCTGGT TGTG CTGTTT CCTTGG GAGAGTC AAAGGAAGGTGACCAAGGGTGGCCCCCTCCACTCTATTCCCCAGGCCATGAAGCAGTAGGCAGGGGCCAGGAGTG GATTTTAAAGGCAAAGTTATCAGACCCACTAGGACCATGAACTGGTAAACTCTCCTCAAGCTCCCAAGGACAGAG GATTTGGGACTTTGTTGGTTTTGGCCCACAGCCACAGAACTGAAAGTCTGAATCTGGATTCTCTCAAAAGGACAG TAACATAAAGCTCTATGAGGCAGGA(N)xGAGTGAGAAGTTCAGATCTGGGGATCCTGGGCCATTCCACACAGTG CCCTTTAAAAGGTCTAGAGCTGGGCTCAATGTACAACTTGGTCAATAAAGATCTCTACTGTGAAGTTGCTTTGCT TT A G ( N) xCTCTGCTATCTATTATCACCGTGGAATAGTTGAAGTGTTGGCTTGAACCTCAGAAGGAAATAAACAG GCTCAT GAGCTAGCCATATAAG TATAAT CTATATAATAATGGTTTT CATCCATGATGCAT{ N) xATGGCAGGACC AGAACAAGGACCCAAATTTTCCGGCTCTTGGCTGGAGCCTCCCCATAACCTGCATGATCCCTAGACCATGTCCCC AGCTGGATGGGGCTCCCACCACCCCTGGGGCTGCAGCCTCTTGCCAGAAGCAGGATCTTAGCCCTCTCCAGCTTC CTTTGCAGTTGCTT CAGATTGAGTCGAAAT CTTCGGTCACTAGGACTTGAAGCTTCTCTT CTAGTTCT GAGTTCT TCTGCTTCCGGT CCTCGTTGTTTTGGCTATG G CCAGAGGCAGTAGAGAAAAGAATGAACAAAGAACAGAAAGGAC TACTGTGGAA
> H s l 5 _ 92982464 - 92989 48 2
GGGGTGCCCTAC TAGT GACTTTATTACC CTGTT CTCAAAAT C TG GGAAGTGCGGGAGTAACAGAGAAACT TGAAT TTTGTACCTGACTATAATTTTCTCTAAAATATCAGCTCTCATTTTCCAGTTGCCTGGGACATTTGCACACAAGGT TGCCCATGGCCCTGCCACCTGACCCCCAACCCACTACAGTCATGGAAACAAGGCTTTGCTGTCTCCAGGTCAAAA TTAGGGCACG CGGAGC CT CGAGTTTCAT TC CAGC CGGCTCCC CAGCCCTGGCAGAA CC GCAAACGTTCT CATATT TCTGTGGGGACCAACT CGAGGCAAAGACAG CCTTGT CCTGGGGCTTTGGGAAGC CG CC CTATTGTAGAGTTCCCT AAACCTTGGGACATGGGAAGAAACCGTGTCTTTACTCTTGCATCCTCTGGGATTGAAACGCTACCACAGCCTGGC AGCCAGAGTTTTGTCATCCTCTGCCAGGTCTGTCTGGGCCTGGCATATCCAACAGCTCAGTCCCTGTTTTTCAAA GCCCTTTAGC CTAAAGA CTGGTGGAGGGAGG GTGAG G GGAGGGGAAAGGAAGAAGGAAGAGTGG CTGGTCTGTCT GCTTGGAAGC CAGAGAAG GAGTGACGGGTGGGTGTTAAAG GTGAGTGTGTGGGGGC CAAGGCCTCCTC CCAAGGG GTCGGATC CT CT CCAGGAGTGCCCAGCAGTAT CCGGGGGAGG CGGGGGGGTGGCTGTCTGAC CTAAAG C CACACA TGTAAG C CATGC CT CTT C CATTGTCACAAGTCATGAGT C C CCACAGTGAGCTTTACACAT CC CATGAAGT C CTGA CCCAGGAATT TT GAGAGGCAGATTTTGG CCACAGTTTCCTTTTGGGACCCTTTCACATAGTCAG CAGC GCACAGT CCAAAGCTGT(N)xATTATGAATACAGTCTGTACTCCAGCC(N)xTGTGACTGCACGTGGACAGGTGCAGCCGGG GTGTGCTCTGCCTCACTCCTGGGAGTCACGTCCACACCCATCCACACATACTTGGAAAATATGGGTCAAAAATGC CAAGGAGCAAGT TACT T CAAAAAGTGCAGTTG TTCTTTGTCC CTTAGGCCTATG TC CATG GCT CTGAAACTCCCA CGTGTCTCTGGGGTGGACCTGTGGGGTGTTTTAGGACAAGATTATGCAGCCCCACTGGGTACAGACTTACGTTTG ACAT CAGGGATACGATA C CCAGCAGTGG CACTAGCTCCTCCACC CCACTGGGAC TATTCCTCTGGGTTCT TCCCA TCTAGGAGTTTATTCCGTGCCCTGCACTTGGTAATATCCTACAGGCCAAAGCCCCACACTAACAGTAAAGAGAGC A TG A ( N) xCCCATGGACTCTGTCTCCACTOCACCCACCTCACCCCCAGGTGGTGAGCTCCTCCAGAGCAGAGTGG CTGTGTCCATCTCTGGGC CTCCCGTACAAGGCTGTG CCAGGC CAGAGGAGGGCC TC CATGTGGC TT TACTAGAAT GGCAGACGCTGTGAAGCTCCTGATGGAGGTTTCTGTCTCTGCCAATCCAAGTCTTCAATCCATTCACCCTACCTC CTCTGTGGGAGAAACACAAAGACACGCAAGAC CTCACCCCCTCT CAGAAAACTCTG TGTGGGGAACACAAGCTTG TAAGTC CTGGTGGGACACATGGCACGCAAAATG CAACTAT CTTG CAAAACCTAGTGCCTC CTAGGGAGTC CAGGG CAGG CTTC CG G CAGAAG CGGCATTGGAGTGTGG CTTGC CTTTGACCAGTAAAAGTG GTGAGGAGGAGGAAACAGC ATGAAG AGGGGTGTGG AAGCAAGAAGGC AG ATGC ATGG AT CC GG CAGGGC ACTG CT CG AG CAAG CGTC AG CTGCA CAAAGT GC TG CTGGGAATGTCTGAGGGG CTGGGCCCAG GAGAGC CCCACGATCCTG CAGTGC CAGG CTGAGGACA CTGTGTCTAT CTGGGATCGTGTGTGCAGAGTATAAG TT CC CAGATGTTTGGGGAACAGAACTTCACTGTG CATGC ACACACACACACACACACACACACACACACACACACACACACGCACTTCTTCCATAGGGAGAATCTGATGAACAA ATGCCCTT CCATGGATAAACCACATCAT CATCA CTGTTATCC CTTAC(N)xCCCAAGCTGCCTCTCTCACCCTGC AGAGTACT CTTCAATGTGTACCTGTAG GTGTGGTGGTTTC TGGAAGGGAGTGACTC CTCTCCTGCC CGGAGCCAA GCCCATTCAGAAGTCACTTTGGGAGGCCTGGCCTACTGTTGGACAAAAGAGAGCCTAGATCAGATTGAACCCCTT CCTAACGC CTCCTTCT CAA CTAAATGAAGCAATT CACAT CTGGT GGGGAAGGAAGACATCTAGACAGT CCACTTA GGGCAGAGCTCCCCCAACCCAGCATCCTCACTGAGGCACAGTCATTGGAAAAGCTTGGACTTCACAGCCCATCCC TGCCTGCCTCTTCTTGTGTCTCTGCAGGTAAGG(N)xCCTGTTCCCATTACCCACAGCATCTTCCAGGAGGTGGC ATCCCAGACTGGCCAACATTGCAGTTGGGAGTATATTTCCTGCTTCACCCCTCGTGAGCCGCCAGGGTCAGGTGC CACATC CC CTTCATTCTC CTTGCCTCTTTCACAG CCCCTCCTGC CCCATGGTTGAT CC CAAT CCAT CTGAACTGC ATGGAGTCCACTTGGCCCTGCTCTCAGGACCCAGAGCTTGGCCTAGAATTGGCCCCAGGACCCTCAGGAAGGCCT TGGAAG CC TGGC TCAG CCGGGTTTCAAG GCAATAAAAGTAACTGAGACCTGAGCAG CC CCACACTGGCTGATGTG CTGCAGTGACACCT CAGT CTCTGAGGCC TGTGGCAAGGGCTC CTGAAGGTGAAT TGAC CCAGGG CAAT CCTATCT GCTTACAATGTTTAGC CACAGTCAAAGATAGCAAGG CC TAGGG G CAGGGATGGGAAGACG CGATAT TTTAGGGGG TTCAGTCTGAAATCACAA( N ) xTCTGATTAACCCACTTTTTCTCCCCAGTTGTTGAAAAAGGAAAGTCCTTGCTT AAAGGGAGGATAAATGGTACGTTCCTGGATTT CTTCGTTTGATGATGACCTGTGAT CCCTTGTCTTCATGTCTCC AGCCTCTATTTTTACACTCTTAACTGGATTTCTCAGACTTATTTTCACTGGTCAAAGGGCTTTTCCACCATTAAA TCCTCTGTGCCAGACAGCAGCCAGACCTCCCCTGCAGTGCCCTCTGCTGCCATCCCAGTCCTAGGCACAGGCAGG AAAC CCAGGTGTGCACAC CCAGGCCGGGTCTC CCACAAGGAAGT CATCAGGTTAAATAGCAGAGGCTG CT CACCC AGGCAGCCTGATTGTCAGCATGAGTTTGCAGTGGCCTCTTAGTTTGCCCTCCAGTACCCAAATCCATCCCCTCCA ATCCATACTGCAACTT CATCTCATTGCT TT CTAATCGTGG CCAAAAATTCTAAACCGT CC C (N ) xAGCATGGTAT TCCTAACCTGCTTGGCTATAACACTACCTGAAGAGAAA( N } xTCTGACTTGGCAAACTGGAGAAGCACTCTTTTC GTGTACCTTGTC CACTflACAGACTTCTG CTTAGTGGTCAC CACCTCAGGGAAGT CCAC CTGACACT CCTC CTTCC CATAAGAGCGATGACAACAGTTAGCATTTATCAAG(N ) xGGGAGGGATGCTGTTGTGATCTCTATCAAAGCTAAA GTAATG GAAG GT CT CAG GAAGCACAGAGAGGTTGAG C (N ) xGTCACCTAATGGGCCAGATCACTCTTGGCAACAG CATCATCATCTTGTGGACTGAGGACTG(N)xGGCAGAAAAAGGAGGAGAAAGGATGATGGGAGGGGCCTGTGAGT GTGGAGAGATGCCCTGGGT CTTTGTTTGGG GATGAGAAAGAAGCAAGGAGAGGCAAAG GATG G GAT CGAGTTAAA CAGAGGAAGGGGTCTCAGCTTTCCCAAAAGGATCATGTTGCCTTTTTCTCCGGCAGGTGCAACCTGGCCCCAGTA CAGGAGTATGCCCGGGATGTGGGGCTCAAGACAGACCTGGTAACCATGAACCCCTCGGTCATCCAGCGGGCCTTT GAGGACTTGGTCAATGCCACGTGGCGGGAGAAGCTGCTGCAACGGCTGCACAGCCTCAATGGCAGCATCCTGTGG AT CC CTGC CTTCATGGCC CGGGGCGG CAAGGAGCGT GTTGAGTGGGTCAACGAG CTTATC CTGAAGCACCACGT C AACGTGCGCACTGCATACCCCTCGCTGCGCCTGCTGCACGCCGTTCGCGGGTGAGCGGCCTCCCTACAGGCCAGT AGGACCGTCACTAAGTGTGGGAGATTCTTGGTAGGCAGTATCCCTTTCCCAGGTGCAT ( N) xAGGTGTGTGATTT TCAAGAAAATCAGGATCAGATGCACT { N) xTGCGGTAGAAACATTTAATGTTATCATCAATCATCGCTATCTTTA TGTCTCAGTGCAATATTCTCTAACTCTACAGAQCACATCTTACTGATTGGGCTGAGATCTTCAAAGGAGCGTCTC TTAG GATATGTTTT TGTC CACAAGGT CTTCTCTCTAAGATGGAGGGGAACTTTACCTAGG CAATATGAAAGAGAT AGGGAGAGTTAAAGCAATGGAATCCACTCTCTTTAAGAAGAGTTTTGATGTAACTAATTAGGTAAGGATTTTCCT GGTTAGGC CTGGAT CTCTAATT CCAG GAAAGAGATT CCAAGAATTAAGGAAAACAAATAG GAAATGTTGAG GAGG GT CTGAACTATT CCTATCAAGAAGTC CATGTCAACCTGGATGGCATTCTTAAACAG C CCCAGCCAAACAAGTAGT AGTTCCCAATCTCCTCCTTGTTTCCTCCCATTCATGTGACCTGGCATAGATCTCCCTGGCACTTGGATCAGTAAA TTGG CTAGGAAT TTACTAGTGC CCATGATAGCAGCCACAAGAAAAATAGTTTAATATTAATCAT TCATTCACTCT CTTC C (N) xTGCAGTCAGACTGGCTAGAAGTAAAACAGTGGGGGGCTGAAGCAG CTGGGGTTGA CCATGCGTCT C TCTCTCTTTATGTTATATTAGG GTTT CT C CAAATGATCTC
> H s22_230 481 39 -23 058 946
GATGGCTGGCTATCACCAAAATAAAC TACCATAGAC CTGAGGTGGAAACAC CAG CGAATTTCCCAAATTT C CATT GTGAACAT C CAAATGCCCATTTATAT CTGAGT GTTC CTTGTATTCTTATAAAAT GAAAATGGAG CCTGACATAC C AGGTTTTTGGAGTCCTGAAGGAGATGTGGAGACAGATCCTTCTGCAAGGAATCCACCGAAAGGGGAGCAGCCTCA CCCCATGCAGGGGCTGGAGATGCTCCCACAGGGGCAGCTGTAGCATGGACGCTACCAACCTCCACCCTGCACATC CTGC ACAAAC AATC ACTCTTTC ACCC CT CCTT CAGCTT AT CAAAGTTGAC ACTT ACGAAG CC AG CACAGCTCAC C CTTGTCAATCAGACTCCTTCAAACCTCCATCAACCACACAATCTCAACATAAAGAATCTCCCCAACATAATAATA GGAACAGTAGCTATG (N) xGTCTTGTCTTACCCCTTGAGGTGGCAGGTGCTTTGGAGTAAC (N) xCAGTCCAGTA AACTGAAAACAAAACCCCAGAAGTAATCACACGTACTGCGGGAGAGGAGAGAGCTGTCTCAGAGGGCACAGTCTT GTTGAAATTCCTCTGCTCCTTGGGAAGAAACAGCCCAGAAAAGGCCCCAGGGGCTGCGTCTCATCTGGGTTCCTG GATGAGGCACACCCCCTACTGCACTGAGATTCCACAGAAGCTGGGGGTGAAGTGCAAATACCGTCTTTGGTTCAT CCTTGGGATTTT CTTTTTATCACACACT CGTCATTTTGT C CACAATGTCCAAAGTTTCTAAAG(N) xTATTGCAT GCTCAGGAAGCAAAGTAATTTTGCAATTATACAACTGTTAAAAATGGAAGAATTATGTACTGAGCACCATTTTCT CTGTTGTTTATGCTACAATTCAACTTTCTCTCTCTCTCACTCTTTCTCTAATTAGCAATGATCTTCTGTCTGGTC CAAGGTCACACATTTCCAGTCTTCCAGAAATGTCTGTTGAATCTGTTAATTTGTTGCCAGGAAGAATACTCAAAC TT CT CATGCTCATATTTGAGTGTTAG CTGTCT CCAG CCAGCATTTCTAAG CCAGAC CCTTTGTCAAGG CTTTTGA ATAATTTTGTGAAAAGTACCCCCAAAGATGACAGTGTTCACTGTTGGCAGCAGGACTGTTTCCATGCAGTAGGGC TATGCTGGGTGTGCAGTGGAGCTGTGCACACTGTGCTCACGGGGCAGATTCTGTGCTGCTGACTCTGATCCCTGT G CTTGGGAGGAC CGTGTC CACTGCTGAGTCTACATC TG AATTTGATTCTC AAG AAAAAGGT C ATTC AT AG CTATT GGACACTGAGTCACATAGAGGGACAGAAGGTGGCTGAGGACACCTTATCTCTGñTGAGCATAAGGGGATT ( N) xT AGAGGTGAAGGGGATTGGGAAGCTTTCATGTTCTCTCTCACCATCTCCCTTGCTTATCTAACACACTCTTACACA CCTACTTTATTGGAGCTTCTGCTCTACAAACATGGCTGAGCTCTAAATTAGAGACTTTTGAAAGTGCAGACGGAT GCAGGAACT (N) xATTTC AAAG AAñACTAAAT CTTTGG AT GAACCACGAC AG AAGAG GGAAGGTT C AC AC ATTT C CCTGCTGCTTGGGCCTCAGTGAGGACTGACTCAGCCAGTCTCAGGCTCCTGGAGACCGCCTGGGAGATCTGCACT AGGAAACCATTTTCAGCACAGCCTGGGGCTGGTAAGGGGCCCTTACTCTGTTATTCTGCTGCCTGATAATTTTGC ATCCAGAGAAGAATTAGGACCTGTGAGTCCAGGAATTTCAAAGCCCATAGACAAGGTCCCCTCATTCTGCCGTGA CAGCCTCAGCTCTGCTTTCCCAAAAAAAGAGCATCTTCCCAGCATGAAGAACCAGGAGCAAAGAGGGAGCTTCTC CCAGGGGTGAGCAGGGCAGAAGCCACCTTGGTCAGTGGGAAGGAAACATGCTCAGCTCACATCTGACCCAGAATG ACTCAGTCACACCTGTGCAGGGTCACCTGTGTGTGAGCATCATCTATTTGCTCCATGAGAAAATACCTTGTGTGG CTCTTGGGTAAAATCATACCATAGGTGACATTTCTCTGCCTATTGCTTCAGCCGATGCTTTTCGTGATTCCCTTT AGGACACTGGCATAAGGTAAGCATGTCAGGTGTGGGACGTATTGAACAGGAAAAGCAGAGGCTTCTCATATCAAT GCTCGTGG CTTGTATTTGGGCC CATACC TCAT GGAAATGGGAAAA(N) xTGTCAGCTTAAAGGATCATTAAAAGG CCTATGTTTGAT TGAAAC CTG GAGTA CTGTACTTTGAGT CTT CTAATCAG CCAATGAGTG CAGG GAGCTGG CTG C CCTC CCAGGTCCTGACTGGCTC CATGTC CAGGTAGAGCAG CG CTCC CCACAGGCAG CTGGAGGAACTGACGAGGG GAAG CTGTTGGCAT CAGGGCCT CAAAAATTTGACTCAGGñTTTTCC CACGTCCC CT CCCCAAGT CCAGGCG GGTT CCAGGACACTCAGAAGAGCTCATCATTATGGGTCTAGGACTTCCCTGGTCCTGCTCTTCTTTCTTCCTTCCCCAG TGGCCTTAAGATTGTCACTCCCTGTCACAGGCCCTTCCTTTCCTTGTCAAGTGCACTCCCTGAATTCTCTGCTCC TGGGTCAGGAAAGCTCTGGAACTTCAGGGAAAGCAGTGCTGGGTGACCCAATGGGGACCCAGAAAGCTGGCCCAG GG CC AGAT AC AC AGGCAG CAGGGTGGGT CCCT ACAGTGCTGC CTGGTGGG CATC AGGTAG AGGGGAGTGACCAGG AC CAACTC CCCTGAGTGT CTTCAGGG CCAGGTGAAC TAGG GAGGGT CTGGGACCTG GTGTGGAC CCATGCACTGA CC CAG GAATAGGACAGGAAACT CCTCAGAGCAAGGACTGTGTTTTC CTCATGCTGAGCTC CTGAAAAG CAGTCTT GCCAGGATAGAGTGGGCCAGGGTGGATATATGTTGGGTGAATGAGCTTCTGTCTCTGTTTTATTGCAGGAGTCAC TACGAAGGTACGCTGGCCCTGTATTTGTATCCAGAACCAGTCAAAGCCACCTGTGAGTCCTGCATCCCTCAGTAC CAAG CCCCAGTCCC CCTT CAGGACCACCCACCA CACAGAGAGACAGACGTGCAGGGAGCCTGAGGAAGAACCTGA AACCTCCTCTACTACATGGAGTGTTTGTATTTCCATCAGGAATCTTGAGCACCTTGTGCCGGGTCTTGAGGCAGG AAGGGGCCACACAGAGAAGAGGAGTTTCTTCCCTGAGGTCAGCAGTCCATGAACGAAGGGATCAGGGACTTGAGC AGCAGATTCTGAAGCAACACCTGGACCCAGGAGGCCCCTGAGCCTCCAGCAGCCCGCGTAGAGTGGCCACCAGGG G G CAGCAGAGAG TAA CCTGG CAGAAG CACO TGGAGATG GGGTG GGC CT G G GCAT CAG G G AG ATG CC CACATG CAG GGAG GGT C GGTGAC CATG A C CT GAGATCTG GAGGGAAAGAGGTT CTT CTT TT CTT AG AGT ATTTGT CC TT GAC AC CGATGTTTACATTCTTCTGGCTTCCTGGTTCT CAGATCTGA CTCTTGGAGAT GAT C CT TATTTT TCTGACACACA TT CAGG CACTTTGC CTCTTCCCATCGACCTGGCCCC CAT CTCACTGTCACACATGC CATC CCAT CG TGTAGT CAG A CGT GT C C CAC CTCCCTGCCTTTGCTCT GGGAGC CCCTGTCCTG CAG ACACAG CCACTTCTCCTTCCCTGGC CAC CC CT CAACGT CCACTGTCCCCGGTCTTTCCTGACTCTGTTCTGGG GAACC TG C CAATT CGTAGT TCTTGATTTCC TTAT GAGACA CACAATA CTTACTCATTAATCTTT TGTTGACTTAAGTTTT TATCCATTGTTACATTTTCC CAGCA A CAGAAGACAA CTTAGTTATAATAAA CACTTACTTCCTGGGTCTTGGAGTTTGCAGCCCCCTCTCATTTTCT CAT GAAG CAAACATT TCCTTCACCTCCTGATCTGTCCCTGGTCCT GACAGCAC CCTGGT GATACTGAGG CACAGCAC C CATGAC CTGACACTAGAACT CG CAGC CAAGGTAGAAAC TTAC CG CATC CCATGGATCTTC CAAAAT T CTATGTG C CCTT GG CAAC CAAGAATTG CATTCTCCTCTAGCA CAGAG GAGCTGTGCCCTG GAATG GGGCCTGTACCTGTC CAA GGCTTGTGCCGTCC CCTGTGGGAGATGAGAAG CGTCCCTGCATTGGGCTCTTGGGGACCCGTCT TGGACATGAG T GAGAATGAAGAGGGTCC CTG CATTGGGCTCTGGCATGTGA CTTTAAAT GGATTTAGGCCTGTAC CAGACATCTCA TG TCTGACATAAAATATT TACAAT CAGGA CAT TACTAGAGAAGCAGAAAAAAGCTAAC CAC CTCCCTCCT GAG C C AGGATGGAATGAAGGAGGGGAC TGTG GAC C CCAGATAATT CC CC TGTCAC CACTGTGACT CTAACAAC CT CT TAA AT CACGG C CAACAT CTAT C C CATAGGAAGGTC TT TATAT C CC CTAGAAAATACAGAGGAAGT CAG CTCTGAGCTT TT C CACGAC CAACC CAGC CAAGGAG CAAGG CT GG GCACAAC CTG GGTAAA GATG TGAG CC CAG ACCAT GGGAC CA GTGGGTGAAGGAAAATCG CATGGG CTGAGGGGGTGGGTAAGCAGGGGCCAGCCCTCCTCTCTCTGTTTCCTTTGG G G CT GAGT C CTT CT CTG GAAA C CACAG AT CTCCTCCAGCAGCAGCCTCTGACTCTGCTG ATT TG CAT CATGGGCC GCTCTCTCCAG CAAG GGGATAA GAGAGG CCTGGGAG GAAC CTGCTCAGTCTGGGCC TAAG GAAG CAG CACTGGTG G T G C CT CAGC CATG GCCTGGACCGTTCTCCTCCTCGGCCTCCTCTCTCACTG CACAGGTGAT CCCCCCAGGGTCT CAC CAAC CTG CC CAG CC CAAGG GTTCTGGGTC CAGC GTGTCCTT GATT CTGAGC T CAG GAGGG CCCTTCCTGTGG TGGGCAGGATGCT CATGAC CCTGCTGCAGGGTG GGAGG CTGGTGGGGC TGAACT C C C C C CAAA CTGTG CT CAAAG GC TTGT GAGAGC CTGAGGGACT GCACCTGC CAGGAGAGAGTAGT GAGT TTTCAGTT CAAAGT CT CCATACAACAG GAAAGT CATGGGCCACTGGGGCTGGGGCTGATTGCAGGGGATACCCTGAGGGTTCACAGACTCTCTGGAGCTTGT CTGGGACAGCAGGGCAAGGGATTTCATAAGAAGCATCTTTCACCTGCAAGCCAACCTCTCTC(N)xTTATCTTTG CAGG CT CTGT GACCTCCTATGTGC TGACTCAGCCACCCTCGGTGTCAGTGGCCC CAG GACAGACGG C CAGGATTA CCTGTGGG GGAAACAACATTGG AAGTAAAAGT GTGCACTGGTAC CAG CAGAAGC CAGG CCAGGCCCCTGTGCTGG T CG T CTAT GATGATAGCGAC CGGCCCTCAGGGATCC CTGAG CGATT CTCTGGCTC CAA CT CTGG GAACACG G CCA CC CTGACCAT CAGCAGGGTCGAAG CCGGGGAT GAGG CC GACTAT TACT GT CAGGTG TGGGATAGTAGTAGTGAT C AT CC CACG GTGACACAGG CAGATGAG GAAGTGAGA CAAAAACAC CCTCCCAGCCTCGGTCACCCTCTTGCTCCAG CC CC GGGAAG CC TGTTGATAAAG C CATGAGTGAATCTGG C C CAG TT CACCTG GATC TGAGCCTTTCAGGTTGCCC TTCCCTCCAGCCCCCTCCAGGAGTCTCTACAGAAGATACATCAGGCATAAATATGGCCTGGAAGGGCCAGAATCA TCTGGTGACTTGGGGCTGTTGT GTGAGT TAGAGAAT GAAGG CTTGGGT GGAAAGACAGACAGAG G CAACCTC TGT CCACTGTCCTACCC CTGGATGGTCATATGGTGGGGACAGGG CAAGT CCTTAGAC CAACTGT CTGGATCAGGC CC C AGAACTAC TG CC CAGTT CTG CTGAGGTC CTGGCCCC CAGG CTGTGTGG CAGC CTGT GATT CC CAA CAGAG CAAAC CAGAGGAATGGACACTGTGAAGTCTGCCCAGATCCCCTCCTCAATGTGACC(N)xGTTGCAGAAAGCTTCCTCAA GTTTGTGTCCCTTTTCAGAGGGGTTCGGTT TAAT CAAC CAAGAT CT CAAATC CTTG C C TCAATT TAAGATGC CAC
(N) xTGGCCAGCTGTCCCCAACATGGGTCATCAGGGACCTGAGTAGGCCACTATAAACTGAAAACTCTGGTTTCT GT CCAAAT TTG CAGAGTAAATGTT GAAA TG CC CAAT CT GATGGT TC CT TGAATTTTTATG GAATGAAAAGGGAG C CT GACATG CCAGGTGCTCTGGGTT GAGGGATT G T TGGAGT CAGATCTC CC TG CAG GAAAG CCCGGGGCAGGGGGA GCAGCCTCACCCCT CACAG GAAC CACAGATACACCCA CAAGGTGAG CT GCAGGATGGATG CTGCCCACCTCCACC CTCCACATCCT CTGTAAATGTT GCTCCTTT CTACAACT C CAAC CAGATATGTAGATGTGG CGAACTA C GTAAAAT ACG GAT CATT CAT CA CAT CAAAAC CCAC TG CAGGACAC CCTGGT CAA CAAAGAAC C CAAT CACATC C C CATCAAC T ACATAGT TT C CAAATTT TCCATCTC CAGAAAAATAA CAATAA CAATA T ACATGAAAAT CG(N) xCCTCACAACT CTTT CAG GTGACAGGTAC TGTTGAGTAACCTG CTGCAAGCAT CCCCATCTCCAC CAGACCATATAAGTGTGAAC C CAG GAAGAGG CACTGGAACAATAGAGAGAAAAAC CTGCTTGT GCAGAAGACG{N) xCCCTTGAGCCCTGCTCCTG CT CCAT C CTA CGGGTGCCACATTCATCT CATG GTGTAATATTTCGTGC CCTG CC TGAG CTTATGAC CGAGGG GAT ATG G CAG GTCTGACTGTGTGGTTACTGGTGTCT CATGAG GTTCTGGATG TAACAAAG C CCT CGAATATAGAAGAG GTTGTTTT CAAAAG GAAATAA(N) xAAACACTGGGTGACACAGTTCTCAGACCCATGATTTATAGTGTCAGTATT CAGG CCTCAGGGGT CCCTGATGG CTTCTCTGGCTC CAAGT CTGGAAA CACAGCCTC CATGAC CATCTCTGG GTT C CAGG CTGAG GATGA GGCTGATTATTA CTGCAACT CACATAG GAGAGGTGG CACT TTCCACCGTGGT CCAAGTTCA TGGGGAATTGAGA C C CAAAC CTGCCCTGGGCTCT CAGC CTCTCTCTTGTTCTGAAGATGCTTCCTCACCCTGTGC AAGG GGCTTCTTGCAGC
TABLA D
>Hs1_16892504-16901698
G GGAGAGAGAGACAGAG GAGAAAGTGAGCT CAGCGAATTGGC CGGGTGACA CACTGA CGAAGG GGT CAAAGGA C A CT CT GAGTTAGTGC CCT CGGGACACACAGAGAACAGTGATCATGAAAAGAGT GG GCTCAATAAT TTTC CATAAAC TTGCTT AAGATTC C ATGCAGTTGC CATACAGC CTTTGAG GTATGGT CAA CCT AC AGTAAGTTAGTAAATG AT AAG GGGAGGAAGAAATGGAAACCTAAACATCTACTGCAAGGAAAACCAACAGCAATGTCAGTAGGAGTAATTCAACCT TCGTTGAAAACATGAAATTGAACATACTCTTGTTTTCCCTGGACCTGGCATCTCCAGGTGTCAACACAGAATTAA GC AT CC AT AATTGCTC AAAGTT AC CTGGGG CATG ATGGGTCTTG GT CTTCTT CCACTT CTTGGT ACTTTT CAATT TCTG CAATAAGTTCAGACATGGACAGACATATTAAG CTGGTT CT CCTACACACATAA CAATCCACTGT CTAATC C TCACGCAG GGACTT CAGGCT CCTCAG CATGAGAATAGGACACTGTGAGAGAT CTTCTT CAGGAGG CCT GAAGGCT GATC ATGATAGAGATT CCTGGGTTTTTGTC CC AG AAACTGTGGGTAAAATTC CCTATT CTGGTAGATCGTTATC C CAAGATCATTTGTC CCAAGTTTGTGCAAATGGTTATGC CATATT TTTC CAAT CGATTTAAAGCAAATG CC CCCAA ATGGTTGC TGGGAGAAAAACTG CAATATT CAG CCCTGT CTCATCAAATACTOAGATTCTT CATGGTAG CGAGGAT TT T AG ATG CTGAAATT AGAGTG AAGG ATGAAATCT ACAAGAT CT A C AAAATTG AGACAAAATCAGAGTTGTGTG A ATTTGTCACATCTGCCCAGATCCAACATCTTGAGAGTGGGATTAGGGTGCCACAGGCATGGCCTGAGACTAGGAA GAGAGCC C TGCTCACTGACCCATCCCTTGCCTGGGCTTC CAAGTGGAACTAGAGTTTCAT TCAACCTACATGTG C CTATAGGTCCTCCCTGTGGCAATGACATCTCTCAGCTCAGTAAGGGCCATTTGCAGTAGGAATATGACCCTAACC AGAAGACT CAGTG GAT CCTTAT CACCTTCATAGAAAGGTACT CACCATCCATGT CAAGAG CCCAGC CAACACGCT GTTG CTC CAATATGTAAAAG GCACTTCTGTAGGGCTGG CATGAGTCAGTCAGTT CAAGATAACCTGAAGGAGTTG AATAACATCTATCCAGTGAGTCCTGCAAGACTTCAGGCCCTTTCTCATCCAGCAGCTCCCTGCTGAGCCTGGAAC AGTGGGAAAAAGTAAAGAATAAGCCAGGGGGAATCAGAAACCACACAGCCCCAGCTAGATTTCATGGCTAACATA AGGAAG AGTTTG AAAAGAAAAAGGAC AG AT CC ATTAAT G AGGTAA C AAATTATT GCCTTTATATTGGGATAGACT AGGGCCAG GTAGAAAAGGATGAAAGAGAAAG (N ) xGAGTGAG CT CAGTGAATTG GCCAGGTGACA CACTGATGAG GGAGTCAACGGTCATT CTCTAT TTGTGCTCTCAGGACACACAGTGAACAGTGAT CATGAAAAGCATGG CC TCAAT AATTTTGCATAAAATGTGCT CAAGTTTCCCTG CAGC CACCAT GAGAATACAG CTTTTGAG GTAT GGTCAACCTT C ACTAGGTTAG TAAATGATAAGGGTAG GAAGAAATGGAAACCTAAA CATTTA CTCTAAT GAGAAC CAAAAAGCAAT GTAGTAGG CATAATTTAGACTTGT CT GACAAGACAAAAT CATTATTTT CAGCATGTACTGTTTTCCCTGGACTTG GCATCTCCAGGTGTCAACATCAAATTAACTGTCCACAATTTCTCñGACTCACCTGGGACCTGTTGCCTCTTGGTC CT CCTTTTTCACTTGAT CCCAC CGATGTCCTG CAAATAAATT CAGATGGGGC CT CTTACATTAAG CAGTTCTTCC TTGCACACAGAAACATTCCTCTGTCCAATCCTAACACAGGTACATCAGTCTGGTCAGTGTGAGAACAGGAGACTT TGAGAGAAATATTC CAG CAG GC CTGAGGTCAAGTCTTGAGAAAACTGG CTTGGGTTCTTT CATGAG CCTTGGGCA AAATTACCCTGTTTTGGAATGTTATCTTCCCTATGTGCTCTGTCCTAGGTTTGTGTACACAAATGAGCAACTTTT TCCCCAATAAATTGTAGGCAAATAGTTCTAACACCTCATAGGAGAGATACTTCAATATTAAGCTTTCTCTCATCA AATACCCAGAATTTGATAGTTTATGAGATTGTGGACACAGAGATTTGATGAAGGGGTGCAATGTACCAGCTCTTG AGTCAAAATGAAACTTGGTTCTACACAGAAGCATCAGCTATTATGGCTTTTGTGGGTGAAAAGTCAGCCATTTAT CTAGAAAACATACCAGGAACATGACGGACAGATGAG CTAAAG CAAG CGAACT TAGAAGACACAGAAAATGGGAAT AAATTCAG TGAAAC CTGGGC CACATCTTTCACTGAGAGGTAGACAAGGGTGACACTTG CC TTGGGCAGGTAAAGA AC CACACAGACATG CTTTGGGAA CAAAACT CATAAGGAATTTTGTAGCTGGCAAGAGACATTTAAT TCAGATGAG CTGATCTGACAGACAACTCCTGGT CATGTGCTGCATAGTTTGGTGTGAGCTTGCCACACCTGCCTTGAGTTCAAT GTCGTGACAGTCAGTC CAG GTTGGCACGGGCATGGC CTGAGACTAG GAAGAGAG CAAAGC TCACTCACCCACCCC ATGCCTGTGCTTCAGACTCGACTCCAGAGTGATTGAAATCTACATTGATATATAGGTTCAGCCCACAGTGATGGC AAAT CTCAGC CCAACAAGGGGCACAAGGCC CAAAGATTATGGGGTCTACCTGGG CCATGAACTGGAGCTTTATCA CCTT CACAATGG AGTACTCACC GC CT ATGT CAA CAG CC ATG CAGA CTTGCTGTT CCTCTAATGAGTGAAATGTG C CG CTGTAAGACTTGTA CGAG GC CAACATTT CAGGAG GAATTGAGAGAGTCGAATAACCTT CATC CCAGGACTCCT GGGGGACTTCCTCCTCTTCAGACTCCTGCAGATTCCTGATGAGCCAGGCAGGACAGGGATGATAGAAGATTTAAC CAACAGACATTAGACAA CAAAACCTC CCAGAT GATCTGATGG GAGACAGAATGGAGTG GT CACAGAAACCAAAGG CATTTTT C CTTCAAGAGAAATAAAAC TAGC CTTCTAAATACAGG GTGGAGGGTGACTG CTCTGGGGACAGAGCAA AAATG GG CAG CATGTG CTCAGTACATTTG C CACAGATGAGCCAACT CAGGGCAC CCAGACTCTC CCTGTAAACTA CCAT CATGACTTGCAG CACAGAGAACTGACACAGGG CTT CAACTACTT TGCATAAATTGGGTTGAATTTTACATG CAGC ATTC AA GTGAAG AGAGTT CTTGACACAG TGCAGACAC A GATCTTGTGT ATTAAGGG CCCCATTTTC CC AAT ATTTTGATATAATATATTTACCTTTT CAATTT CTTTTCTTGCAAAAATA CTAGC CAACATACTACCAACAGATA G GAAGAAAG CATATATA CATCTCTC CCTGGATT TAAACACATGGGAGAGAATAGG CAACAC CAAGAAAT CC CTGTT T G { N ) xAAACCCTGTTTGGCTAGTTCACCTGGCTCATCTGATGGCAAGTTCCTATCTTGAGAGGACTATGAAATT AAAACCAATACAAGTG CCACAAATAACATACAACAT TGTAAATCAG CACAATTTGTAG CTGGGTGAATGGAAGAA ATAG TTCTATTCAT CACTT CCT CATTTTCC CTAAAT CTACAATCTC CAGATG TCACTACTGAAT TAACAG CCAAC AATT CCACAACATTAC CTGG GAGACACTGG CC CTTT TT CTTC CT CTTC CTCATCATCACTTTCATT TT CTGTAAA TAAATTCAGAGAAGCAGGTCACATTAAGCAATTCATACTTCACATATGACCAAATCACTGTCCAGTCATAGCACA AGGACATAACTATT CT CAGTGCAAGAATAAG GATTCTGACAG GAAT ATT CTAGG GTGC CC TAGATTAACTTTGGT GAGAATTAGATGAC CCTGCTTTCCAGACCCACAGGC CAAAAT CTCCCTCTACGT GTAGAC CATAATGC CATATT C CCTG C CTGAGTCAAAGTTAAA CAAAATTTTTT CCC CAAAAAAAT CT CCAAAAATTGGT CCATTTTCTAAGAGTGT
t g c t g c a a t a c g g a c t t a t a t c a c c a g a t a a c a t g g a c a t t a a a t g t t t a g a g g c a t c t a t a c a t g a a a c a c a c a TGATAGATAAATTTGAACAACT CTTG CTTTAAAAAGAATCTGTGA(N)xAATCCACGATGCTACAAAGAAACATT GGATCAGCCATTGCATTGACAGGGTGGAGAACCAGGGTCCAGCCTTGCTTTATGGAAATATATCAGCAAAGTAAA GAAGAAAAGTTTCCGTCCTGATTTCAGGGTGACTGTGCAGCTAAGCAAGCTGACTTAAAGGAGATCCGGATGAAA GCTGAGAGCAGTGAAGCCTGGGGAACAATATTTCCAAATACAAAGGCAAGGCTGCCAGCTTCCTGAAACAGGCAT AGAAACTCCATGGACATTGTTCAGGGACAGATGACTTAATCACAGATGACAAGAGATACTGAATCGAAGCTAGGA GGCCTGACAGATACTGCCTGTGCACCTCCTGCACTCAG GTGACTATGAGATT GTCACACTTGC CTGGGGTCGAGT AACTTGATACTGGGGACTGGCAGACAAAGGCATGACATTAGCTGAGAAGGACAAAAAAACTCCCTGATATCTGTT TAGAAACCCATCATAGTTTTTTATTCAAATGAATTTGTGTTTATAGAGCCTGTCTTCAGAGTTTATCTTCCTCAG CCTAGAGAGAGGTATGAGACACAAGGAAAACAGAGGCTACCTGGGATAATGTGTACAGCATCCTCCCATTCAACA TG AG AGGATG AGCC AATG AGAGTTGAGT CG ACTT TGTCTT CCTC AAAT GTG ATTTTGGTT TTCCTATGTGGCTGG TTGGAGT CATAAGGGCCATGGC TATTTGAA CAAGTGATGG CACATT CCTCCAGTGAGTCC TCAGGGACTTCCTTT TCTTCAGCCTTCGGCATCTCCCTGATGAGCCAGGTGGGACAGAGATGACAGAAGATTAAACACAGAGGGATTGGA CCCCAGGGAGTCCTAGCTGGTTTTGACAGGCGGCATTAAGAGAGTGGTCCCAGAAAGCAAAATGGAGGTTCCCTT TAAGGGGGAACATG CAAT CCTGTTCTCT CTGCAACAGAGCATGG CTGC CATGGGAA CCAGAGAGGAAGAGAG CAG CTGGTGTTCATTG CAGTG GACAGATAGGAG CT GAGGAG 3ATG AAGACT CAGCTATC CCTGTA-’GGTGC AGAC ATG ACACTCGGCACACATAGAGAAACATGACAG CTGCCGCACCCTGTGT CTAAGCTGGGTTATATTT CACATACTGTG GCCAAGCAAATGCGGGTTTTTGGCCCATCATAGATGCCAGAGAGGGTGTACCTCCTAGATATTCTTCATATGTTA CCATCCATTACTTGTTCCTGAGTATTCAGTGTTACCTGGGGGCAGACGATTTCTGCACTTTCTCAGCCACCTCAA CTTGAAC
> H e l_16927 7 Q 8 -16936 89 9
AGGCAGGGTTAGTCTGATATGATTTATTCTAACAGACAGAAGCAGAAATCTGTTATACTCTTTTAATTACTGTGT CTTTATAATATTATGGTAGACAGAA(N ) xAATAAAATGGAGAAGGCTTTGGAGTGGGGACAAGAAGGAAACGGTG GGAGAGGGATGCCTGTATGCTGATATGGTTGATGCCTGTATGGTTGAATTGGGTCTACCGTTCCTCATCTAATTA GCTATGGTCTATTAAGGTGCATAGCTACACACAAATATTGGTACTACGTTCAATTCAGAGGAATAAGATATTGCA TTCTTGACAG TAGACAAGAACACCCT GAATTT GG GGTCAC TG TATCATAAGT CATGTTAT CAGGTCCCTCTAGGA AGGCTTAGAGGAAGATTTCCAGGATACACTTGTGACAACATTGAAGGCTTCTTTTTTCCCCAAAGGGACCCGATC TCCCCTCAGT CGAGAAGCT CCAAGTCTC TGAACTGGATGC CAG GTTATAAATTCCC C CTATACT GACTCCATCAG
G(N)xAGTAAAAGATAGACTCATGGGAGTCTAGGCATTTATTCTCTTATTTTATATAAATCAGTTAATGTGCAGG AA CAAAACAGACTTTGAAGAAAGACACT CACAGTTG CCACAGGAAAACACCTTCAACATC CTCATGAGTCAT CAT GGGTGTTCTGTTGGGAGGACTTGATAGGAGGCTTTCCTCCTCACGGGCTAGTGCAGATCCAGGGGAAATGTCATC AAGTCCTCC(N)xGGGCCCTCAGTTTAGCATTCT(N)xGTGTAACGTAAGTTGATTTCTTAGTAGATGTCCCATC CATTACATTC CCAGACAC CTCACAATGATT CGAATGATTAGTAACCAC CACATATC CCTG CCTCTCAGGGAAAT C CCTCCCGCCTTGTCTCTAGATGGCCAAGTCCCACGGCCTGTCCTCTACTCTTCCAGAACCCTGTTGTTCTCACTG ACAGCAGGGAGGG CAAAT CCATGCAG CAGCTC CCGC CATGAC CTCCAGCCTGCAGAGGATGGGCGCCACAGGACT TT TAAACGCATGCCG(N ) xTGGGAAGAAGGAGGGGATGTTATGGGAAAACAAAAGGAGAATACTAGCTAAGAACG CTAGGTGACATTAATATT C CGAAGTCTGTG CT CATATT CAGCAAAGAAAGTT CAG CATAAAGCACTAAATAAGGA GT CAAGATATTGTACTTC CAACTGTTGTTC CAACAG CTGTATTATGAAGGGC CACT TTATTTCATGCCTTTC TAA TT TGAC CTAAAGTG CCAGGTGGCACTGGGG CT GG CACAGC CTTG CT CAATTATGTGTTGCAGAGTACACAGAGAC TG CCAGGCTGAGGGAAGATGCAAGAGAATAGAAGAGAT GC TCTCAG GGAACAAGAGACCACATGGCCC CAGAGT C AGGGGCAGCATCAGCCACTGTCAGCTGCTCATTTTCCCAGACñGAGCCCACAAGCCTCñGCCATGCTTTGCTTCT GCAAGACGCTTCTTCACCTTTTCAATAAACCTGCCTGAATTTAAGCTGACAGGGTTTATTTCTCCTTCATCATAA ATGAAATTCTTCAC CACAA CAATCTC CAATGAATTTTG GGCACAGCAGGCAGGCCCATTTCTGCTTCTGTTCCAC TATCTC ( K ) xATTCTGTTATTCTGGTTC CTTTTTGG CTACTTTGTTTTTGGTAGCGTGTATCCTAAGG CGTC CAG TTGAACAACTTTTGTCTACTGTGTCCAGGCATTC CTGGTGGTATTTCAGATAAGACTCTCTTGGGTTG CTGAACT CACAAC CACT GAAC CAAT TCTATGAC CATCTGTT TCATG G CCACATGTTTGCTCATTTTATATGTACATAAAGG G AGGGGACAGACAGCAAACTTGCGTGTTACAAATTGTATCATCTTAAAAAGGAAACAAGGCAACACTTTGCAATAA AACCTTAAGATGCATGAAATTTGAGC CTAATG CAATAAAG GATG CC CATAAAATT CTTAT CTAAAGAATGTTTC G AAAATTGTTGTACAAGGACATCATCATTTAAAGTGATATGAAGAAACCTTCTCAGCTAAGCATATGGGCTAGATT AGAGAGAAAAATAAAGGA CCCATCTCTG CC CTGGAAAAAC TG CTGGTAG CAT CTTT CAAAAAGC TCTCTGTGTTT GAGTACGCACCTTGATCCATAGGCTCACATTTGATCCCAACTGGCAGCTGCTTCTTGGCATTAACATTGGATTCC CAACTAGTAAATCTTACCAAGATCTGACTTTCTGCAGATATAATATTATTTTGTTTGACCATCCTTATCTTCAAG GG CTAC CAAGAAGGAACCAAGAATTTAT TTAC CT CC CCAAGGGAAAAGGTTTTACCAATGAGAC CCTTTCTCACC AT GACC CCAGGACC CCATATGCCCTGTT CACTTGAGTG CCCTGTGTGG CCTGATAGAAGC T CATGCTGGTCACAG GATTCCTTATATGACTAG CCTC CTTC CTGAAT CC CAATTT CATG GTGGTGGTCATGACAGGTGTCCTGTATCCCA TG CTCATGTC CCTGAAGT CACCAGCCTATCTC CAGT TAGAAAAAATTACATGTATATAGAGAG G CCTCTTTG GAA GGAGCAAAAGCTTTCTC(N ) xTCGTACACTAATGGTTGGAAGGTACAACAGCATATGCACTTTGGGAAAAAATAT CTGGCATATT CTTACAGAAACAAACAAC TACCTATT CTATGACT CAGTAATT CCTAAGCATTTATCCAAGAGAAA CTAAAAC(N)xGTGAGAAAAAAAGATAAATAATAATGGTTCCAAGAAATGCACAGCAGACAGCCCAGAGGCAAAG ACCCACAGGACGGCGGGCCGG(N)xTTAGGATGCAGCAGCCCCATATCAAGGTTTTGGTGGCATCCTGTAATTGT GT GGTT AGTACTTG GCATTGAAGTGC AC CAAC CTGG AGT C AG AG CAGTTGGAGATT TC AAGGCCTGTG C CAT TT A CCTCTAACCCTGGGGTGCCCCTGGAATACAGATAGCAGATCGGTTAAGGAGAAGCAGCCTCAGCAATCTAGACAG TG CAGG TTTCTGGTGAGGACAG GTAAAAAC CATCTGGG TGGG CAGAACTTGGTGAAGACCAGAAfiCCACTGAGAC TCAGCAGCTGCCGCAGTGGCACCCACAAATCAAAGGAGGGGGCTGGGAAGAGCTAAGGGCTACTGGATGAGCTCT CTGCCT GCAAGACAGAAG CAGATC CAGAGATTTTGGAAAATAATGTAGGTTT CAGTACAGTGTGATCT CTT CAAA AAAGT(N)xGGAAGGAAGGAAGGAAGGAAATGAACAAATTTACATGAAGATGAGAACAGTGGGGAAACTTACACC ACCAATATTTTCCATTAACAGGAACACGCTAAGTAGTTATTAGAGAAAGACACGCTACTGTAAAACAATATACTG TTTCCATGGGGTACAACAACCCCTTCCTCCTCCTCTGAAACACATTCTATCTGTGGCTCACTGTTGCCAGAGACA CTGAGT CTTGTCTTTGGATACGTT CTGGTGCCCACAAGAATGAGATGAGACAGT G GATCC CAGAACAC CAG G CCA CGAACT TC CCTGTTGCTC CTTGTCCACT CCAGAAGCTACCCAGCTG CAGTTGGGGACCTCAG GC CCTGGGTCTGA TGTCATCCATTTGCCTTTCTGAATGGACTTCTCTCCTTGCACTGGCTCCTACTCCCCCAGGACCTGTGGGTGACC ACATGAGAAGAACACAAACAGGCCATGC CCCTTT CTTT CTCC CC CT CTCAATGC CTGCAGTAGT GGGTTC CATGG GGTAGTGACCTGAGATTTACTCATTGTGGGGCCTCTAGCCCAGAGCAGGGCCTACTACCTCACAGTCACCCCATG AATGCT CAGTGAAAGAAGACGTCCAC CACAAGGT CCTGGGGAAC CAAGAATT CCACTGTGGC CCATAAATTC TAA GT CTACAGGATTCTGGAATGGGAGATGGGAAAGG CCTT CAAAAGTGGCCACTTTTAACCCATTA TACTGG CAACT GAGCCATGTTTCCCCATCCTGGACACATCCAGAGGGCACTGCCTAAAACCAGACACATCTCCCCACCCAGGACAG TGTAGGAGCCTTAGCCTGGGGGATGCAGGTGGACAGGGAGGGGGTGAGCCACCAAAGCTGAAGAGCAGAAAGCAG GTGAAAGGGGACAGCAGGGTGGAAACAGAGAGAAATGGGGGCAGAGAATGGGGGGTGAGAGGGGAAGAGTGAGGA GAGGGATGCAGATCTAGCTAGTAAGGAAAAGTCCTGGAGAGAACACTGTCCTCTCCTGAAGTAAAATCACTTCTA CCTGACCACGGCACTGCAGCTCATGGGCAGCACATGCTGTGGATATTTG(N)xAGGCCCTGCAATGTTTAGGGAC CTTGACATCTTCCCTT CACATCTGAGTCATAATACAAAGAGGAC TCTCTGAC CC CACTGAGC TGGCAATG C C TCG GGATTTTTACCTGTTGGATCTGGCAGCTCTTGATGTCAGCCCACACCATGTGAGGCTGCTCTTGGTGCACCCAAT GGGGAAGTTT CTACAT CAGGGCCT CGGAGAATC CA CTGGAAG CC CTGGACAGTGGGAGTCAG CGGCATCCCCAGT GTGGAGGCCAAGAGCACACAGTGCTTAAGCTCCAGGCACCCTCAGGAGGACGGCAAGGGACAATTGGCTGGTGAG AG CCCG GGTCACCGGGAAC CTTCGCCTG GGTCTA{ N) xTGGTGGGAACTAGAGGTGGTTGGGTTTCTGTCATATG TAATCAACAGTCCT<N)xGGGAAAAAAATAAAGAGTCCTGACTAAATACTAGAGTAGCCAGGGAAGTTTTCACAA AGTAAGTAATATTTGAGGCAGATCTTAGTGAACAAGAATTCCATTATTTCTGTTAGGGAATTAAGAGAGTGTGGG TGTCGT TAGTTAATGCTTATTAAAGTAG CTTTGGAATC TCAT CTACTGGTCTAG CTGGTCTATCTGTACACGTAT AT TGTATATG CTGTCT CT CTGAGCTTTCGCTAGGTTATGCTACG GTAACAAAAG CCCCAAAATCTTAG CAGC T (N ) xTACAAGCTACTTTATTTGTTAGATGGTGAAAACTGTGATACTCGGAGGTTGTTGAATATGGTATTAGTATGTT CATTCATTCATTCATTTAAGAAATATTTATTCAATATCTGTTTCATGCCAGGCAAGGTCAAGTACTGAGAATACA CTGGTGAATCAAAGAGACAAAATCTCTAATTGCCAGGAGCTTATATTGAAAATCAGATTAAACACATACAAAATC AT CATAATAACAACAATG AATACTATAT TCATAAATAATAGCTGTAAGAGATTTTAGTACAT CT TTTAAATTAGA AAAATATAA(N) xTTATTTGATGTAGTCCTAAAACTATTATGTAGAATACTATTGTTTATATCACAGCACGTGAG CC CCTTAAATGGCTTAA CACTTATTTAGGTATGATC CATAAAGCTTTTCTGGTAATTAAG TATACTTAAGAACAA TTAAGTATAAAAGAGTTACTGCCTTGACAGGAAGATTGTAAAAATTTTAAAAAGACAAATAAATAAAAGAGTAAA AACTGTAG CT CTGTGAGG CT CAAATAACATCTAATT CAAGTCACAATGAACATCTAGCAAT CATTCTGAACACCA TATAATTCACTTAATACGTTTTG
> H s l_ 989473 89 - 989570 38
TTGTGGTAGTGTGTGGTTGTAACTTATTGAGAGATTATAACCTGTCAGCTGCTGTGCTCGGCAGGTTCTTCATCA TC ATCAT C AT CATCATTAT C ATCATC AT CATCAGTC CC AGGC AATGTTTATT AACTGGTT AT CTGG AAAT GT ACT CTATATGTGCCATCCATTTAAAATCCCACAAACAGAAGTTCAAAAAGGCTCATAACTAGTGAGAAGTGGACCCTA AATTCAAATCCAGACCTATCTATTATAATTTAGCAGCTCTGAAATAGTGCATAATAGGTTTTAAAATTCAATACA TTTCACAGTCTTCCCAAATAGTCCGTTTTCAAAACTGAACACACAATCTAATATATTCTGTACCTATTATCTGAT GGCTGTATCTGTTTGAACCGACTCTCTGGATATACAAGTTACCTCTTAAGAGCATCTTCACATATAACAATTTTC AAGTTTTCAAAGCTTTTCCTAATTGATTAGAGACAGAAGAAGAGTTCACATCACACTACCGCATCAGCTCCAGCC CCATTCCCCTCCCTCTTCCTCATTCTTACCTCTTTCCTGTGACACTGAAAGTTCCCACCCCTTATCGTTGAATGA GGAATACTCTACACATCAGGGAATTTCTTTGGGGACTGAACTTTGGAGACTGTACCGAAGTCCCAAGTGGGTTTA TG AAAG AATTTTTTAG AG AAAATTTATTGTTCTC CCTGGAGG AAATTGTGGT AAAT ATAATTTT CTTT AT CT CTC CTG TTAAATTAACTTGAAATTATT CAAAAAGTAATGGCATTGTAGTTACTGAGAG TAGGCAGAGAACC CATGTAT T CTTTT CT C AAAACC CATGTATTTTTAAGCCATGGTGG CTAAGAAATTCTTC CCACGTAATCTGATTAAAGG CTT AC CACGTACACAGTTTTT CTGTTAAGGGTAGTTGAAGAAAGCAAGA GAATGGAAG GATTGAA GAAGCAGTGCTTG TGAGAGGTTAACTGTG CATTGAAAATGG CTAGTGACTGAAGACTTA GGAATG CACCACATGGTCAGGTAGAGAAT AAGAAAGAAGTTGATCATTTGCTATTTTTACCCCTTGGTTGTTGTTTTTTGTTTTGTTTTG(N)xTACACCTTGG TTTTGATACAGAAGTGCATTGAAGGATTAGAGAAGCAATCTTCCCAAATGATTCCACAAGTGTCTTAGAATGTAA GAGATGGGATTCTAGAGGGTCTGGAGAAAGACTGAATGGTGGGGAAGATGAACAAAATATCTTGAAATTTGGTTC CAGCTT CTAGTCTGAATTGAAGCT CTTTTTCTAAGAGCGTAT CTGAAATAGAAGTGTAAAGGGAAAAAAG CT CAA TTAGTACAAGATTCATT C CAAGTTGAGCTAGAAGTGAAAGAACAGAAAGCTAAAAGCATC CCATATCT CAAATAG TC CTGAAAAGTGTACT CAATTTTGGC CATAGACACGGAATCGAG CACATCTGTCATTTCC CATT GTTGTGAATGT CTTAAG GTTACCCAAACAAATGGT CAAAGAATATAGAAAAAT CATACCAGTTTAGCAGAGTC TGAAAAGT CTACC TCAGATTTGGAAAGCAGCA CAATG TTGT CAGGGGAAATTCGTTATTATTAGATTTAAACATTGCACAT GAT CATT ATATCAAAAGACAAGGCATTCAGTAAAACTTTATAAATCTCATGAAATGCTTGTTGCGAGCAGATACTTTGTGGA AACAAGTGAAGGAACAAAAT CACGGCAGAGTACCAGAGACAGACAGAGGTGT CTAGGTAGGTG GAGAGGAAG CGT AAATTCAGTTGCCTACATTACTGACAAAAGGACTGCTTTGGCAAAATGTAAAAGTATATCTTATTATTGCTTTCT TTGGTTCAAT CATGAAG GAAAAGT GTACCATCTTAG CAATTAAAATAAGTATGC CAATTT TTCT CTGAATTCTTT TACAACAAGTGCCCTAGCAATTACTT CCTCTTTTCTTTATATCT CAAATTTCTATGGGTT TTACTT CT GCTAGCT A CGAA CATGGTCTG GAGAATTTTT CT T CTGTCATCCTACTCTAT CAAG CTACTTTCCTCC CATAGC CATCTCTCT TCCTTTCATTATCACCATACTTCCCAAATAAGGACTCATTGCTTTCTCTACTACAGTACCCATCCATCCCTTTGC CATTTTTTGACTGTTATATAC CAATGATGCTT TTCTTTTAAATG GTAATACTAC CCTGGTATTAGC CAAG TTTAA TGGCC(N)xTAAGACTACTTTGAATAACAGCAA CAGTATCTC(N )xATCATTTCCTTCTTATTTTGTTAATATTT TCTTCCTAATTACAAGTAATTTCCTCAAATAGCAAGAATGATGCGCTTTATTTCATTTACACCCTCCACAACATT ATCCAGGG CTGGTCACAAAATTAAAATAAT CATTATAATGACAATAATAATAC(N)xGGATAATAAAATTGTACT TTGTGTAAGTTATTTCT CAATTAT CATTATAGTAAAAG CAAACATATG CATTTGTATTTATTTC CATATCGATGA GT TT CATG CT CAGAATTTTACT TTTAAAGAAAGGCTGTATTGGAGGAGTGGGAAGACAAGTCAGAGAAAG CTTTT TAGG GTTT CCATCC CTAAAACCTCAG CTCTTTTAAAAATAGACTAT CACTTCTC CCTCTTTCCTAGGTGC CTTTT GAGAAACAATTAAATTATTGGACTACAACATC CAAAGTGTCT CAGAAAATAAAAGTGC CT TATGAGTGTGATAAT GTTAAATT CACGTAATAAGG CAAC CTAAAATAATTACTGCTAAAATATGTACTTGATAAT TTCTACTTACACTAG AT TC AAAACC CTTT AAG CATTTTT ACTATATAAAAAGTATAT CCATTTTGTTTAAATAACTACAGTAATACCAGA TT CTTTTAAGAAAATC CTTCTCAAATTTTTTT TAAT TTAACTGACTGTAGAACAAGGCTTGAGTAATAAT CAGAT TAAC TGTG CAAAATGG CATT TTGG CCTTCTGTATTTAAAAAAAAAAG GAAAACTTATTTCAAACT C CAATAATG C CTCTAGCC TACACATT TGAATT CTATGTTGTT TCAATGAATATATAAG CAATTAAACATAGAAAATGTATTTTTA GAGAAACAGCAACTTTATATAC CTAATGTGGG CACAGT CTGAATTGTC TTTGACACAGAGATAGAC CAAATGAC T TGATATGTTTTAGAAGAGAACAAGAG GTAGAT GAT CAACTGTATGTAGATAAATA CTAAATATACGTTGATATAT TT CAGGTTATAATG GTA CCCTCTACC CTTTCCCTCCTT CATTTGGC CACTGAGGGAAACATAATAACT GCTGATA AT GAAAGT CTTATC CATGGTGAAAAGTCCTAT CTTTACTGTACATTG GGGACTAGCTTTTATATTGCTTGGATAA CT CTAAAC CTTTAAACTGCTATTT CTAGGG CAAATT CT GCCCTC CAAAACTTAATTGGGCAGAAT CCTATGTTGA CT CAGAGGGAATAAAATGCT CT C A { N ) xTTTGTAGGAGCTACTGACTGAGCAGTTTACTCCTAGGGAGAATGTAA CT CACCCTGGTTCC CTGCAGAAACAG CCT C CTGTTATAATCAGGATTATTAT CCTCTGTTTACT CCCTTTCTCTG CCTGAGTCTCTCAGTAACAG CTGGGGGCCT CATTTATAAACAAC CTGTGCTATAAAATGC CCATGATATACACAG TATGAAAACATAGGAACATGGT TT A T C C T T ( N) xGTTTTATATGTATTCAGTGCATTTTCTCCAGAGAGCTGAAG GCATTTCATAAACTTTAAATAACCAG TAGGTATAAATC TATG CCACTGTAAGTTATAG TTATTTAATAGTAATAA AACAACTTGAG GGTGAAGAATTGGTATTATTTATTTATAATTAG CA CATGAGAAGGAAAG GACGTT T CAATAAT C ATAGAAAAATGCTAATGTAAATTTTTCCCCTGATTTTGTTTTTTACTATGGAATTGACTAAGTTTATGAGTTCAA GA CTTTCATTGCCTAGAGTACGACAGTCTG CT TATAAAGTATAC TCAATAAATATTATTGAATGAC CACT GAAAT TATATTTTAATTGCAG AAAATT ATAT AAAATG TGTTGAAATT AACCTC AATTGCTATA GC CTAGTT ATTT AT AG A TT TC CAAG CTTAGGAG TACAAC CTTTTCTTGAATATTTTGGCí N ) xCTCTAATTTCTTGCTTCTGGTGAATTTAC AATTT CGGATAGGTCT TAGTTT CTGAGCTTACTGAATCATTTTC CAGGTGCATTGACT CTACCCCTCACTAGAGA CAACTCTTAC(M )xCATGCCTTTATTCCCTCCCTCCAGTCCTCCAAAATTATAACCAAAGAACACTCCAGATTTC G CGAAGTTTGGGC C ATG CTG CTTGTGGATAGGGACAGT AAAAAAG C AC TGTT C AGGTG AAGTTT AAAATCTTG AT CCGC TGTAACTGTACCT CCACTGGAAATTT CTAACAGACCATGT CCGTCCTCACCTGG CC CAG GTTCTAGGCATT AGAT TACTGC CATTTTT CAAGGTATCTCTTTC CAAC TCTGGG CTCTTCTTTTTCCTA CTTTTCAGC CAACTTA CA TC CACTGAATTTCTAT TTATTT CT TGAATATAA CAATTTAAGTC TTGT TTGTTTAAACTC TCTGAT CTAAGCTAG GACCAAGGAAATGTATCTTTTATAAGTAAGTATATACAGTAGGTAAAAGGTTTTGCCCTGTCCTCTTCCCATCCC AATCAAAGAAA CATTTTTATCT GAGAAGAATT CAA CTTTTCT TCTGTAAAGT G(N )xGTGTTTTACCACCCACAT G G CAAATATTTTCTTATAAT CTTT TAAGGGAAAAGT CTTTGAAGACATATAACAATGTATATATT CATATATATT CACTCACATGTGTGTAATGGAGTAATATGTTTAACTGATTTTACAATGCAGATGATGTGTACACCTTAAACTTAA C CAATTCTAATAATGTAAAACACT CCTTTGAT TGCATT GGTTA CTTTGATTACTTGGT CTTTCTCACCTC TGAAA CACC CTAC CT CCAAACAAGAGTTCATTTTGAAGATGATGTTTGGGTGTTTGTAAC CTCATGATACAGGTGAAAAT GTTCTTGGATAAGATGTTTG CAAGTCTTTT TAGTTTTTCACAGGTGGCCCCTCTCCACCCATGCACTGCTTACTA GC CAGAAATT GTGT CT GTAATTGTTG GCTGGACTGATCTAAAGTAGTCATCT CT CTCAGC TTGGTTGAAG CCCAG TGGTCCTC CATGGCTGTCTT CACCTGATTTGGGGTTATTTCAGG CCAT CGGTGGTTTTTTGCTGTG CACACGGTT CAGCCCCTGCCTAGATTGATTGCCCTGTAGCCTCACTGAAGAACCACTACATCCCTTTTATATGTCCCCTCTGGC AAGCACTACAGACAGATTCT CC CATA CTTCTGTGTG CCTTCTATGAACA CAGAAAACAAGAATG CCTCTTGAAGT TAAAAAGCATATTTGTGTGATGTTGACTTCAAGCTGGCTGCTTCCTGGATCCCCAACTGGGGAGTCAGGACAAGA ACTCCCCACTCTTCTCCCCACATTGAGTTACCAGCATCTTTTTGTCCCTCTTTGATCTTTTTCTCCCAACCCCTC CC CCACCAATACATTCACTTTGAACTGAAGTC CTTG GAGAGGATTA GGAATCACACAC CTGACCA CAC CCTTCT C CTATTGCTACTTAAGA TTGAAAGTTC CTTGTTCTTG CTGCCTTAGGTGGTATTGGCAT CTTCTCTACACATTGGG AGAAGCTC TC CAGGTGTAAACT G TG ATTG (N )xTG TG ATTTTTTTTATG ACAA(N )xACCTCTACCACATCATCT CC CC CACCTC CATCTC CTGTATAGACAGATGAG CCTTAAGAACAA CAATTACTATATACTGAAG CT CAGATGTTA AT CAG GTAAT CCTAGT CTTAAATGTCTGTG CATTTT TT CCTC CTAGATTTGAGAAAATAAAAG G CACAAGAAAT C CTGG CCTGAATGGT CAT CAGAACTTCTTGTGTGTTG CTTCCAAAATTTTAGAAG CACTTTTGAG CAGACTG CTTA TAAAACATGG CTAC TT TCTAGATT CAGCTGTCAAC CACAGAATT TT CTACCACTAAGGACAGTATCAGTT C CTAA AATC AT AG AAGTGATGGTGT CTAATTGCCCAGAG AAAG AATT AAAAGTTAATGT GGGTTTGGGGATTTTTTGGTG TGTGTGTGGC CAAATCAGCAGTTG TT CATAACATAAAAGTTT CATGATGGTACATGTGGTACTACT TTTTTTTAA CGATTG<N ) x GCTCATAGAGTTTTACAGCAACTTTATAGAAATATACCTTTTAGGTATAAAACTACATTCAAAAA ACATTCTT CAGTTGATTAGAA CAAATTCTCATAAAT CTT CTGATTC CATTTATGAATTTC CCACAT GAAAGAGC C
a a a a a a g a g a a t g t c t g c t c t g c t a a a a t t c t c t g g a a c a t c c t c t c a g t a c t c a g t g g a a g a t g g a t g a g a t c c TT CTGCAGCAGATGAT GAACAATAATTATAAGTAAATG GTAGTGTATTTTAGAG GACACACATTTAG GAAATATT TGAAATAAGAAGTAGGGAAAGGAGGGACTTCTGAACAAATAAAAAATGATCTAGAAAGCAAACCTGCAATCCAGC AT GGGAATTT TT AT A C AGG C AAAC TAAAAAAT AGGG CCATTGAGGTTTTT T AAAAT A C AAGAGGGAGC AAGGAAT AACAA CAATCTTATGC CAGTGAGGTATTAT CTTGAATGGAAAAAAG C AATTGTG AT AAAT AAATTA'i’T AGTT AAA GATATGAAACAATAATAGGGTCAG CTGTAGAACACGTTAAAATTCAGAAATCTATAGAAT GATC CTTTACATGTA TT CTGCTTCTGTTG AGTT CAG ATCAAAAT CTACAGC CTTCATTAAATTAATGTCTT ATTTTTAAAATATT CT CC A TGGCAC CATGGAGAATTAAATTAGGCATTAAATTAATG CCTTATTT TTAAAATATT CTCCATGGTGCC TGAATAA GACAGAGGAGTAAGTCTAAATTGGGCCCTTATGCCATACATATTTATTTCATTAAAGTAGGAGAATCTGAAATAA AATTTAAATTACAAATATGGAAGTGGCTTTAGATTTTCATTTTGGTTTTGGC( N ) xTTTTTCTTTTTCTTTTTTT TTTTTGTTCTAC TGATAATT GAAT CAAATG CAAACTAT TAATATGAATAAAATATATTGAGGGTAGTTATGTTTT TG CTCATCTACTTG CT CATT CATT CAAAA CAAT
> H s s l_ l 52507011 -152 51 43 45
TTTGTTTTTTGAGAAAGGGGCGCGAAGCAGGGTGCGAATGGATCCCTAAATTCAGGTCGTGGCTGGCTTTCCCAG CAT CCAGTTT CTGCTTAG CAAAGATCCAAGAGAG CT TC CCCGTAAC TTTTGGATTT TTGT CTGAGATAAGGGTT C AAAAGCAGAG CATTTTAATGTCATAGATAGTT CAGATGGATACATTAGAATTAAGT TATTTG AA(N ) xCAATAAC AACAGTAAAAGTGC TAAAATGCCAGTAG GAAGTG GT GAAAGCGGTG CTGCAAGTGGGTGGGGAC CAGAAACATCA A C TTTGAGATGAGGGCAGGCAGGGTAGC TTTAAT TTGTTCC CAGTGGAGCAGGC CC CTCCAGGAG GAG CT GC CAC CGGTTTGGGGAGCC CACAGGTCCT CCTCCCAGGTCTGGGGCAGGACCCAGGCAGGTTGCGCACCTGAGCGACGGC CCTTGTCCTCTTATTTTCTGGGTAAGGGGCTGCCCTTTTCCCGCACAACAGTGGTCAGGAGAGGCAGTCACTGAC TTGAGG TGAC CAGGGAAGAG CAGGGCCATC CG TTTCTT CGGAAAAAAAAAAT CAGAGTTTTAAAATCAGAGACAT GC C CAG GGCTGAAG C CTGAAGCTGTTTCTT CCTCTGTG C CAGATAAC CTCAG GATC CATAATT CTTTG GAAT CT C A C CCTG C CGAATTACTTT CTTTCAAAAATAAAATTATTTTCTGATAATTACGGTAATGTACATT CCCTAAATTTT TGAAAACCAT GGAAACATGAAAAAGGTGTGTGTGTACACACAATGGAGACACAC TT TACATTCTGTAAATGCTG C TT TTTCACAT CACATGATGACAGTTTGT CCACTT TTAAAAGTG GTTGTAATGAC TGAACAATAT TTTACC CATTT TAGGGATGCACAGTTCTTTAGTTAACTATTCCCTTCGTGTTGAACATTAAGGCAGTTCTACTTCATTGCTCCTGT AAAGAATG{ N) xCCTTCACAGAATACCCTGGGTCTCAGCTTCCACACTGGAGGAGAGGAAGTTGCTGGGAACATC AGACTGG GACAGGT CACC CATCCTTTGT TT CTT C CAGTATTG CTCCTCTG CAAACCTTAGGGCCA CCATCAG CAG CAGCCTCCAAACCA( N ) xATATCTGGCCTATGGGGAAAGTGGGGACTGTAATCCAGCTTGAATGTTGCTGGCCTA CCTGTGTGTAAG CATAAGTCTGTGTCCATT CTTCTAATTC ACTCAAAGGTG C CACATGGATTAG A CG AACGAGTG GT CCTGGTAG CAACTGGGGC CTGAGGGGGTGC AT AC AT AG AAATGG AATACT ACGATGTCGTTTTGATTTTTTA G TTTTATTTTAGGAATA TC CTGGCTTAGATTTGGCTATGATTTGTGCTTTACT CT CATTTCAACA CTTTACCATCA GTGGGAAATATC ACTGTTGCTTTATGTG CAAAGT GATG GTTTTAAT AAGG GGTAAAGAGG CATAGTTAGTTGAC C ACTGCACTTTCATCTGGGACAGCACCTTTCTTAC CCAAACAGGATCATATTGAAGC TATGACTTACATGGTTTGG GAT CTT AGTTGTAT ACAT TT CAAAAAGCGATGTT AAAC AC ATTGAG CTAGGC TG AAAAAAGTAATTTAGAAAGAA TTTTACTTCCTG GTTC GAAG TA A A T (N ) xTAAGTGCTAAGTAAATGTTAGATTTTATGATTATATCCATTTATAA AAC CAG CAGCAT TT CT CATTTCCACTTTTTTCTT CTTTCCAGTCTT CTTACATAAGATAC TTGT TCATAGAATC C CAGTCT CATTTT TC TC CCTG CATC CACAAC CT CT TCAT GTTTAACT CTCC CTAAGG CACT CCAGATATTTACTGA AAAAATTGAGTTAATAGT TATTC CTGGGTT CTTCCTTTCCCACCCTGGTCTCTCAGCAACTTTT CTTCTAGTTT T TGGATT CTTTGC CT CC CCATT CT C TGTC TTATAGGTGGGGAT CCTG GAATAT CTTT CATAACCTGATGGTAG CAG TTGCCTTCAACCAGAGCCTGGGCACTACATTACAGGGGAAAATCAGGCCTCATTGCTGTGGCTATGAACTGGTAC CCCTCCTCTTCCTGCCCCAACGTAATCGTCCTACAGGCCCCTTCAGAGATGTTAGAACTTCTACTTTTCTGGTCT GGAGTTGAGCTCTGTTTCTTGTGC CAGG CAGTGCTGGCTGGAGTAG CAGAG GGGAG GAGACACT CACAGAGCAG C CTGGGCTGTA GCAAAGGCAGAGAGTACCACGTGGTG CTGATAAAACAAAGGC TTTATTTTTTCATAGGGT CAGAC ACAGGTGATGACAACT GG CTGTGCAGATATTG CACACACTGG CAGATACCAGGGTTGATATCCT GTGGGATC CTG GAGAATGGACAGGAAGAGCCACTAGGGATACCTCAGAGCTCTCTGCGGGGGAACTTCTCAGTGCTGACCAAAGGA AAGGAGATTT GAGTAT GAGAGAGCAAAG CAGACCAGGATGCT CCTTGGACTTTCTTTCCT CTAAAGTTGC TTATT T CAGCATAAAGATC CAGGTCAGTGG CAG CCTGAG CC CC CACCTTGCTGAC CACTG C CCCTGCCACAGAAGTT GGA GCTGTGGCACTTGCAT TGGCGGGACCTGTGGCAC CTGTGGTGGCTCAGGGAG CACAGACACTTT GTGTCCCCTCA AAGTGCTCCTGTCACTCACTCTCCTCCTCCTGAGATCAGGTGTCCACAGTCCTCACACACACAGGATTTGTCTGT GATGTC CCCT CAAA GAGGGGACACAAAGTGTTGGAAAACACA GTGCTATT TTCCCCAGTTTTGCTGGT GATG GA G GCCAAGAGAATGTCAGCTGGGCAGAAAGTCAGTCTAATGAAGCTGTGTCATTGTGGGGCTCTGGGCCTTCTAAAG GGTCCACCTCAGTCCCTAGTCCATGGCTGTAGATGGAGACACCACCTTCTGTGTTTCTTTATCATTTGGCTTCAG CCTCCTCCCCTT CCTAATGG CCAGCACCAGCCCTCCTCACTC CATGAAATTC CT CC TACT T CAAGTCC TT CATT C CCATATTCCTTCATTAATAAATTTCTATTGAGACTTAG(H) XGCATCAATTATGGGCCAGGCATCATGGTCAGTA GT AGTG ATAGGGTAG GTAAG AG AAAATC ATG G CATCTGTCTT CAATGAGCTT AAAGTCTAGCAG ATCT AACAAAC ATAAAAGAGTAAATTGTTAGGGATAATGGACCCCAGAAGGAGTTCCATGTGCTATGGGCATGTAGAGAAAAGGTA TTGAAGTCTATACTTT GT TG CAAACTGAATATTTATTTTTTCATAT CTTG CAGTTATCATAAG CAAGC CTTATC C TC CAGACAGGG CAAA CATGG CAGC CATTTTCCTC CATGGCAG CCATTTTCCTCCATAGAT TGCAA CCCTGAG CCG TTGGCAGCAGGGGAGGCAAGAGGTAGGAGCCAGAATTAAAAAGGAAAATAAATCACACATTTTGGGCAAGTTTCA AAGAAT CAAAATAT CTTTT C CAATGACTTT CAGTACAT CGTGTTAC CTCTGTTTTATGAAACACAGTGGATGA CT ATTAGG CTTT CTTC CAGC C C CATT CTTGACTACCTGTCTTGAAAGC CTCC C CACAAGACCTGGAGAGCTCATTAA TCTCCTTT CTAC CTTAGTGATT GCCCTTGT CACT CACT CT CCTC CTCCTGAGAT CAG GAGTC CACAGT CCTCATA TG CACAGGATTT GT CCTGTGAT GACACATG GT AG CTATAAGAC CATGAT C TG GT CACT T C CCTTA CAG CACAAGG GCACCTCCCTGAGA( N ) xAGGAAAAATTTCATGACTTAGTCATTCCAATTGGTACTATAGGAACAGGCACTGCAG GAC CT CAGGTTTAG CTCTTTGGAGTACATTTTGGA CATT C CACATCAG CAAT CAAAGATGAGTAAACAAACAGAA T T C T T C T C ( N ) xCTCGGTAGCCAGAAAGTGCCATATACCTTCAATGTGCCTAAGTTTCTTATATGGATTCTAATT TT CC CCTCAGGTTAAGTAGTATTTTGAG CTATTTATAC CT CTTTAGTCTC CTGAGAAAGTTCTGGTTGGACCCAG TGTGTAAAAGAA TGTTGGACTCGAATTTTTTT CTTGTGTTGGGATGTTGG CT GGAGTTGAAAGATAAAAATAATA TGTGCCCCTTATCCCCTTATACTAAAGTAACCTCAGGATAAGCTGCTGAAAATCTCTGCAGGACTTGCTAAGAAA AAA C AT AACTCT ATTTCT CTGGT C CAGT TATATATCTGGAAG TT TG AGTT TTGATAAAAATTTTTTAGGT AGGAG AGTTATTGTCCTAGAAGACAGTACTATGGTTTCACCCTTATCTTGTATAGGAAGGGAACTGGTTGCTCTCTTGTA ACAGAGAT C AATGG AAAGGG GAG G ACTC AT GATT CG AAA (N ) x T AAAG AG AT AG AGC A GG ATGTTGAG AC AT AAT AGTAGACAACATCTGGGAAGGAAGACCAGCTTGCACATTCTTCCTAGTGTTGTGTTTAGAACG(N ) xTATTTTTA AGGTTCTAACAATGTCTTTATAACAAGGGATCAT CTTTGGGATT CTCCAC CTTTTCTAACAT CAATGAAATCAAT AAATGCCAAGATT C CATGTCACAAAGGACAAAAAAAAG CAGGTT CAGGGATGTT GGTTGAGGGCTTCAAAATGAC CACATGTGGGTAATGCAAAGGCAATAGGAAGTAGAAACTGGGAAGTTCTTGAGGGTGGGAGACATGTTGGGATTT TGGG GGCCAGATAGGAAGT C TT CTAGGGAGAGGAATAT CTTAGGTCAAGAGGTGGAGGAGAGGAAGTTG GGTGCT GGATGGTGAGAATACTGTGTGCTACTCACCTGTAGTCCATGGGGAGAAAGAATAAAGGAAAACTTTTGGGGAGTC AGTAGAAGACCCTGATCAATCTGACCCTTGCCAAGTTC
> H s l _ 2270551 77 - 227067 78 7
TGTGGTAACTCTGGAAAT CAGATC CTGT TAATGCTTATTAAGGG CTGCAGTCTT CTGTTTGTTTAGAT CATGCCT TAGTCTTTCTGGCCACCAGCCCCTA(N) xACTTCCTAACTCATTTATTGAGTCATTATGCTTCACCTTAGGGCAT ATGCAGCCTCTT CACATTAGGG GAAATT CAGC CTATCAAGTG CAA CACTGAAAG CTAAATAACAA CAACACAACA ACAACCAATCAGGGAT{ N ) xTATAATTCTGTTAAATGTAATTAATTTCTCAAATTTATCAAAGATCATTATTTGG AC CATTATTTTC TTAGCCAAAATG CTGG CCTG CCCCAGTT CCAT CCAG GGTTAAT CTG CATTTAAACAGCTCTTT GAAAATATTTGGTTTTC(N)xTTTCACCTTAAGTTAAAAAAAAAAAAGTAAAACTACTCAGACAACGCCAAATTA TTGACAAT CTCAAC CTAC CC CACAGACC CCAAAT CCTGGAAC CACAAC CC CCTAGGCCAATT CTCAGGTCAGGCA GCAATTTATTCCTGTTCAATTTTATGCAGAGCTCCTGGTGGCTCTGAAGCGCCCTAGGGAACAGACCAGGAACAT TCTCATGGTGTTAGCTCACATTGGGCCATTAGTGCTCCTTTAATGTGAGAACAACCGGGAGGAGGAGGGGATGTG GACC CAAAACTACAAGAAAGAGTGTCCT CGAAG C CTATGT CCTACCGC CC CACG CTGCTG CCAGG CCCGCAGGAA GATGACAGGCCCGGCCTCCACTCCTTCTAAGGTCGTCGCTTAGTTCCGACGTCGGGATGACCCTGTCATCCACGC GGCGTGAAGGCCACCCTCCCCGCGCGCCCGGGACTCCAGGTGGGGCCCCAGTGGACGAGGGAACGCGGCGTCGCC CACCGGGCGTGGCCT(N)xTCAGCCAGCTGCGTAAACTCCGCTGGAGCGCGGCGGCAGAGCAGGTGAGCGGGCGG TG CCGGGGGGTG CC CAGG CCAGGGCCCTGTCGCCTGCGGCGCTGAGGGCCCGGGGTGGGG CTGCG CCCTGAGGG C CCTGCCCTGCCCTCCGCACGCCTCTGGCCACGGTCCCTTCCCCGGCTGTGGGTCTGCGGCCCCTGCGTGCGCAGC GCTCCTGG CCTCTG CGGC CAGCG CGGGG GCGGAGAGAGGAGAGTGCCCGG CAG GCGGCGGCTGGGCCGGCCCGGA ACTGGGTCGTGGAAGGAT CG CGGGGAGCGG CC CT CAGG CCTTCGGCCTCACTGCGTCCCCACTTCCCTGCGCCCG CCTGCCGC CGAG CCCCGGCTGGGGGTGGGCGCGGCGCGAGCG GTTAAAG GGC CGGTGCATTTAAAGGAGC GGTG C ACGTGGGTCTCTGAGGCGTGTAGCAGGCGGGGGCGTTTTGTTCTTCTTCTCTCTCGCCGGAGACCTCCGTTGCGC CGAGTCCATTCGGCCTCTAGCACCGGGTCCTGGGCATGCTTTCCCCGGGAAGGAGGCGCGCGGGGGCTCTGCCCG CACGTGAGGGGCAGGGCCGCAGGCTCAAGCCTAGAGCCGGTTTCTGTTAGCAGCGGTGTTTGGCTGTTTTATCAG GCATTTCCAGCAGTGAGGAGACAGCCAGAAGCAAGCTTTTGGAGCTGAAGGAACCTGAGAC(N) xCATCCCTCCG AG CAGACCTCAC CC CTGTTT ATTG CCTT AATAAGTATT CC CTTTGAAAGGTATG AACGGTGTTG AGTG AAGT AAC TG CATCCCTATTTACAAATG GAGAACCTGAGAGCATTC CATAGAGACGATTGTAGACTAACT TAACTCAGAAGCG ACAGCCTG GGGT TG CCAAGG CTGT CTACGAAGTAA CTTGATTAG GACCGACC CCAGCTTC CAGTAAGGAAGCCT C TGATGCCTCTGTAGCCAATTCTGCAGACACCTGAGCCTCCAAGGCCTTCAGCCAAGACCTTTGGCGGTAATTGGA GT CT CGGGATAAGC TGCT T CAGGTGTGTGAGC CTCAGGTTCTTCTCTC CTGAATGTGGTTGTGG GCAGCCGGTGA CTGG CGCAGGTG CAGAAGG GGCCTGGTTCTTGGCCCCACCTCAGAGCTGCGTCCT CACGACG CC CACGTTGAG C C TTGGGTTC C AGGGC AG AG ACTGGAGTG AGGGC TTGGGGGC ATGT TGCT TTGAAGTGGG ATGG AT GTAT CAGGTTT TTGG GG AAAACT CTGTAC CCTTTGGTGTTGAAGTG CCCATGTGC CAAG TCTTGAGTCCAG CATG TTCACATGTGG GG AGTG AGTGGCTTGTTC CTGT CT ATTTGAAAGAG CAG CAAGGAGGAGGAGG AG CAAGGG CTAGGGGCTGCTGCT GGGGTGCCTGGAGCTGTGGTGCATAATGTCACAC CTGT CT CC CCTCCGTAGCTG CTCACCGTCC CCCCAAGGGGG GTTTGCCT CTTG C CTACTTTGG CC TTTCT CTGTTATCGATGTTAATAATGACATATAT CT CG CTTATGAGTTGGT CATAATAAAAAGCTATCTTGTACAGAATATTAGAATTTAAGATCTTAAGAATTT{ N ) xGTTTTTGCGCAGGAGGT TCCTGTAT TCAACT CCTAC C CGTGTCTCTC CACTACTG CTGGGAAAGTTT TGTGGAGT CC CCATGAGCAACTTC C TGACAAACAAACAAAATT TT TTTAAAGAAACCAAAGCAGTGTGTGTAGGT CACATGCAGTGTGT CTAATGAAAAC ATCTCTGGCGGGTTTTCAGCTGTTGCTTTGAC TTTCGGACACTGTTTAGTTGGGGACTGATAAGACAG CAAATAT TT CTGCAAGTAT TC CCAC CTGTTCTATT CC CAGC TGCCACAG CT GCG GAAAGGC GGG G GT GAG G CTGAGAGGC C C CGAGAGGAA CAT TTT CCACT GGGCTCCAAT CCTGGAGATG GGAT GACCAT CATGTTAATGTCTGGAGAAAAGAAT G ATTTCA(N ) x TGGTTTCACATCAGTCCTCAGGAAAGATCAGATGTCAGTGAGGGAGATCATTTCTTGAGAGCCT CTT CACTG AGTGGGAGAATGGG CTGCTTGTTC AT CTTTGTGAAAATTCTAGAACG GG AAG AAC AATTC AAAGGGT GT C CACCATTCTGCTGTAC CTTAAC CAGAAACTTACTG GACT CTTTTTAAAATAAAAGTAATT CATGTTTATTCT AGAAAATTAG GGAAAAAAAA( N ) xGAGCATTTGAATTATTTATCCTACCAGTCCCACAGCAGGACACTTTCCCAC CCTCCCTTTTGCCCCAGAGAAGCAGTGCCCTCTGTCCTCCCCATGCCCATATGTGGGCACTCCCCACCATGGAGC CAAACCTACCTGGGCAAGTAGCAGAGGGAGAGCAGAGTGAGCCCTGGGGGCAGGAGAGAGACTTGAGAGTTTTGA G GTGACAGATGAGCTGGTGAGTGAGTGATTAGG GAG CATT TCTTGACACATACGTG CC CGTG GTGAAGG CATGTG TCTTGTGAGTGTGCTCC CAGAAAGCCTGTGTAGTGT GTGGTGGG CCTGCCTGTG TGAC CAAAC C CTGG C CACTGG GTACGT GACC CT CACAAGTGCTGACTGGGCTGAGAAGAGCTC CTTGATGGGCAGTTTGGAGACTTGAGTT GTAAC TGTGGG TTTTGG CCATGGGAGATTAACTGATTA(N ) xGATATGTCCTTTCATTTGATAATATGTTTACGTGGCCA TAGT GCCTGGGGCTGGGC CGGGAATGGAAACTTGAT CTCTGGGG CCTGGCCTTTGAAG CCAGTT CATGTGTCTGG TGGTTCAGCAGATCCGTAACTTTCCAAGAGGCACATCCATAGGCTACCGTGTCCTTTCTCACTGTGTCCCTCCTC CATTTCATCTTCTT[H) x TAATCTTGCAAGGTATGC CTGTTCTGCTTTTTACAGCAGAGGAAATGAGCTGTGTCA GATTAGACTGTCTGAGGCCTCTTGGCCAGGGAGTATGTGGTTCAAATCACATAGGCAGGCGATCTGAACCCTGTC AGTCT C CAAAGC CT CTGCTTTTGACCGCTGACTTGCTG CTGCTTGTTTAAAAATAAATGTGTTT CTGGAG CCTAC TCCAGAGGGGCGTGCTAGGGGCTCCCTCTCCCACTTCCCCACAAACCACCCTTTTCCCTGGCTGCTTCAGGAAAT GAGAGAAC TCTGCCTGGG CCCCAGGCACTT CT GAGTGG GACAGGG CTGTTAGAG GTAAGT CTAGAGCCTG GCCCA AAATTCAGGAGG CC CCAT CAGAGGGCCC CTGGGGCCTGTGGT CCGGGAGGGTGGTAGGGCAGTACCTCACTTCCC TTTGAGACTCAGGCCCCAGCTCTGGCTTAGGCCAGGGAGAACCATCCCCAAGTGGTATGTGTTACTATATGAGCT GAGATGGATGGT CAGCTG GAC CAAATACATAGTCGGGTAC CCAGGGCCAGGGGGAGGAAG GTGAGCAGGGAAGCT GTGGGCAATTGT CTGGGTATCACCTGAC CTTAGCAAACTC TT CC TTGTTTTAAG CGAGGACGTGGGACTT CTCAG ACGT CAGGAGAGTGAT GTGAGGGAGCTGTGTGAC CATAGAAAGT GACGTGTTAAAAAC CAGCGCTG C C CT CTTTG AAAG CCAGGGAG CAT CATT CATTTAGCCTG CTGAGAAGAAGAAACCAAGTGTCC GG GATT CAGAC CTCTCTGCGG CCCCAAGTGTTCGTGGTAAGTGCAGTGACTCCCAACCTGCTTTTGAACCCTCTTTTTCCATTAGGATTTTCTCCG TGGAGG CAGATTT C CATGGGAGTTTGCTGTGG CATT TT GAAATCTGTTTCTTAC CTAGTTCCATTGGC CTTAAAT GTTAAG GC CAAAGC CT TTACATTTCTCTGTAATGAAAAGAAGGT CGAGGAAATT GGGTCATTGGGTTTC CATAAT GATTGCAGGAACTGCTGACACAAGCACGGCTGGGGAGATTCTCTAGGTCAGACTCCCTTGGTTTGGCTAATTCAG CAGTTTGATC CCATTCAG CTGATTAATGGGAATGTG CAGTGG CTTCTTTGGATGTT TGATTTTGCATC CTAATCC AAAG CAGC TATOAG CCTCAGCACTTCCT TGTT GGAAGG CTTT CCAGAACGTAGT CTAT GT TGGACACTTC CTTCT GCCTCTCTGCATTTTC CTGCCACTTCTCTAGAGAATGGG GTG CAGGGGGTGGGAGACGG G GAAAGCTGGT CGCTG AGTGGCTGATGGGACTTGACATCACCCAGCCCCACCCCCACCTGCCCGTGAGTCAGCCTCCGGGGAGAGTTCATC G CG T CACCGG CACT CTAATGTGGACAGACACCTAGCAGTGTTGT TTATCTGCACACGTTTGGGT GGTGATTTTTC CCTC CAAGGATT TCAGAG CACCAGCAGG CTTCAGAG CAGACTTAGGTGGCTTGCAAAG CAGG CC CT CAGGAATTC AGAG GGTAGCAGAAGT CCATCCCAGATG CTCTGTTTTC CTTCAGGAGCTAGGTAAATCAGAGGGG CTGAGGGACA AATGAAAAAA GTTACAG C CTTTGAGTCC CATCTG CTCCTCCTGGCCAATGAGAGGGGATCTGGGAGGGGCAGATG TAGAGGAAAATCTGTCTAAATGTTGATG CTCGTTATTTTC CTTTAAAGAATTAATAGC CTAAAA TAAACC CTACA GATACAGTC(N ) xTTTATAAGGACATTCAGATGATCAGGTTCGTAAAGTTTTATGTTCGGTTAAATTTAACAGCG TGTCATTGTTCAGGTTAT TAAATGTTTGAAATAAGATTTTTGGTGGTCCTGTCACAGT CT C CATGAAG TAGCATT TCAG GATCGAAAGGTATG CTGTGTTTAAAGTGTTGATT CTTACT CCTTTCAGTTAAGG CCAGTG CAGT TTGTCCA GGTAG TGACTGAGACC CAGTTTTTCCACACTCT CCTCCGCAGTG GGCATTGTTTTG GGCCTTTTTCAGCC CAAGA GCTCTCTTCTCCCCATGCCGCTCTGCTGGTCTGAGATTTTTCCACTCCTCCTCCTCCCTAGTTGCTCTCTGACCA GACT CTAGGTAT TCAGGAGAAAGTGTTCATTGT C TCACTCTCTCATGTGGCAAT CAAGTAGTGC CAAG CAGTGAG AGGGTGAAGGTGGGTGGGTGAGGGACACTCACCTTGCTGAGAAAGGGCCCCAGCCTGTTCGGGTGATTATAAAGC AGAGACAGTGCCAGGAAAAGTCTGACACTGGCTGAGAATCACCCGGGGACCAACCATCCCGAATGCGGATCCCTG ACACTGGGTGAGGATGGAGCTTGGAGATCTGCATTGTTAATAAGCAGCCTAGCAGAGTGGTGAAGAGTCCAGACA
{ N) xCAGGATGTTTAGTGCCGGCTAAGGGCTCAGCAGGTGCTGGTCCATCTCCACCAGCCCCCAGTGGCCTGGGC CACCTTTGAGAAA CAGTGATCCTAAGGGATTCAG CATTTC CTAAGTTGGTGCCT CC CACCTGTCA C CC CCACCCC ACCAGG CTAGGAGGGTTGTGATTAGAGGGTGC CCTTGC TGTGACAGCTGAGACTAG CTCTTCCCTGAT TATTCCT TAATGACAGCTC TCTC CT TCCCTGCTTT CTTGAAGT CTTGGTCC TCGTTGTTGTG G GCACAG CT TCAGGGGAGGC CTTGGAGGAATTTTTGAAAGTGGAATGAGGGAAGCAGCCTGCTCAAGGGAACACTTGTTTTCTGGTGAGGAGGCC GCATGTATGAATGA CGTTTGTGGGTTAGAAAG CATGTT TTGTAGTTTTTCCTTGTTTCTT CCTGAAGACATGTCA GGTCTTGATGAGAC CGGGCCTGGGCACAGGGCAGGCAGTCAG CGAGTGTGGATGAT GACGACAGTGGT CACCAGG TCACTGTCTAGACCAGGTCACTGTCTAGCGCAGTGTCACATGGAAAGGGTATGGTCCTTTAACCCTACCCTCCCC AGCACAACTATCACAGATGTCAGGGAACCTCTGCTCACAGAACTGCTTTCCAGGGATTGTCTTTTT{ N ) xACAGG CATGAGCACCGCACCCAGCCCCAGGGAGCGTCTTATTAGTGGTTGGCAACTGAATGGAGACGTGGGAATTGTAAG GAACTGATTCTACTTGAT CCTGGGTCCC CTGCTTCTCCAT CTTCACCCACCCAT CAGCTC CCTTTCTC CTTTAAA CAGG CACC TTTGCTCT CTGCTTATCCATTTTT GTTGTG CATTGC TATTTGGGAG CC TAAGAAA CACAACATCCTC TGAATG CT CCAG CTGTTGTGGGTCTGAAGGGTGAGC CTGC CCTCTGTCATTGGAGG CTGCAGCC TGTGGCTTTTT AGGTACAG GGACTC CCAGAACTGCTCCT CCAGTCATAG CAGAGATAAATCACAGGAGCTTAAGAGG CATGGGAAG AACAGAGG GAGGAGAT CGTAGCTTCCCTGTTCAT T CACAC CCAAAACAAAACTGTCATACTAGAAAAG GAGGTAT TAAAAG AG CC AC CTGT ACAGCCTCGT AT CT CATC CAGC AC ACTG CTGCAG ATGG AATATTATG ATTTAGCTTGAG AAAATG CAGCAACT CTTTGTTGTGGTGC CC CT CT TTGAGTAAGAGTGAATTCCC CATTGC CAGAGTGGATAGTGA GGGAAACCCTGGGTCCAGGCAGGAGTCTGTTTAGGATTTATCTAGTGAGGCTGAGCCAGAGGAGGACCTTACAGT TTTTTCTCTT CAATTT C TTTTA TTTA { N) xCTTCTCCTGCATTTCCACCTGATAATTTCTCTCATTTCCATAGAT GATGAAGGAACTAAAGCCAAGAACTTTCCñAGGTCCTGCAGCTCTTTGGGGGATGTGAAGCTGTGCTCTATTTGT ATGGATTTTGCTGGTTCC CAGAACTTCC CTGTGGCCCTGGGG CCTAGTCTGAGG GTACTCTGAG TGAAGAGGGAG GAGGGC CCACAC CT CTT CTGCAAAG G CTGC TTTTGTAAAGTT CACTTCAGTT CACATCTT CC TC CTGGTCAGAAA GCTTCGGGGGCTCTCCTCTGCTGCATTAAGCTCTTACTCCTCCATCAGGCACCAAACTCCTCCCTGGCATGGCCC ATCCTACCAGGTCCCCACACTTGAGCCACATCCAATTGCTCGATATTATCAGGATAGGTTATGTTATGTTCCCAA CTCATATGTTTACTTAAGTGGTTACCTCTTTCCAGAATGAGCCCCCTCCTCCAAACTCTGCCTGGTGAAATATTC CTAACCTTTGCAGCTTCACATCCCTCTTACTTCTTGTGACCTGAGGCATCTACTCCTGACAACTGATAGACTGTG TCCCCTCCTGTCGGGTGCATTGTCCTTGTCACTACCCTCCTGGCTTTTAGCTGGCTTTGCTTCCCGCTGTTGTTA CTCCTGTACTTGTCTCATCTATCCTAAACAGAAGGTGCTGCAGGCTGGGGAGTTTGTTCATGTTGAAATCCCTGT GATGGAGGTGAGCAGAGGCAGTCTCTGCCTGTGCCTCTTATTTGGGGATGAAGTTAAAGTCCCTGTAGGAATAAT CCAGGC CATAGC CGGGGTTGCTGT CTTCAGAAAGAAGGGCAG CCACAG GT CTTGTTAAGG GGATTGAAATTGGCT GACTTGGTGGAAGGAACCTGCCTGCTTTGTTTAAAAACCA
> H s 2 _ 15861648 -15873 720
GAGCAAGAAGGGTGTATTAGTAATTATTCTGGGACACCAAGGGTAACCTAGGGCTTCCCTGGACAAAACCTACCC TGGAAAAATC CTGC CCAGCATCCC TTTCAGACACTCGGGAATAT TAGGGAGC CTCTGACT TGGAACAGGCAGGAA TCGTGTTTCACTTTGACTCCATATGGGGGCTGCAAGGGTAGAGATCGGGGTCTGAGGATTTGGCTAACGGGCCAA ATGTTTTGGGTTTGCTAAAGCTTCAGCCACATCATGCCTCGAGCATTTTCCTATCACTCTCAAATTTGTGTACTT TAGGGC AAAGTG ATTTCAT ATGGC CCTAGC AC CTTC ACACACG G CT CAGTGAAGC C ATTC CTGG ACAGGCTTCTT GCTGTC GAATGAGC CCTCC CTGTGGAAGGCGC CTCTCTTTGAAC TCAGACGC CTGTTTAT TTTTGAACTG GAAAC CAGAAACTTñGAATTCCCCAGAGCCACAGGCTCAGATCCTTTTTTTCAATGATAACCTGTGGAAC(N)xTTGAAA GAGAAGAGAGGGATTTGCTGAAGGTCACCTCGTGTTTTCATTCCTTTGCCATTCTCAAGAACATTACCATGCCCG GCTCCTATAGGAAATGAAGCAATGCGTGAGAATAAAAGCCATTTGATCACACACAGCCTCAAGTCACTCACAATT CCGAAAAG CCTTGAGGCTGTGTGCAGAC CCGCGCGGCAACTTTTAGAGGT CT CACCTCCATTGC CAAGTG CAGAG CCGAGTTAAAGACTGTTATTATG GAGTC CAAGTG AGGACAAG AAGAGT CACACTCACCAT CC AG CAAAGC CAGAG AAGGTT CT CGTAGC ACCC AAGGATGC AC AT TGTGGTCCCCTGGG ACGCTCTCTGTG ACGñTG CC CC CACTGCTCT CCCTACCTTGCTCAGCTCCCTTCTCCCAGGAACATGGTTGCCTGAGTGTCTCTCCCCACCAAGGACCCCTTAAGC AAGGCCTCAGGGAGGTGGCGTGGTGTTTTGGGCTTGAAACCTGATTCTTTC(N)xCTCTTCTATGTGAGTCGTGG ATGCTACGGGACATGATGTATGTA&GAGTGTCTGCTCGCTGTACCCAGTTCCCTGTCAGTGCCTAGTGAGGTTAT TATTGCTCTCACTTTCATATGTGGTTTCTGAACGTTGGCACTATTGGCATTCTCTGTTGCGGGTGTTATTCCATG CCTTTTAGGAGGTT(N)xCTTTTAGTATTACCTGGAGTGCCTAAGATTCCC(N)xCAGGCCTGGGGAAGCTGTTA GACCTATGTTCATTTCACTTGTTTTCCTCCTGGAGGTGCTCACCAGGCATTAAATCTCACCAGGCATTAAATCTC ACGTGT CC CTGTGT CATTGCAAG GCAGTGCGGATTTCAGAAT CCTGAAG CTGAAGCATGT CAAATTGAAACTAGG AGCAAAATGTTTCGAATCTTAATTTTTTGTTTCAATTCTTGAAATATTGCTGTTGTTCAAAAGCATGCATTTCTA CATTGCTCATGTGT GG{ N) xGCGTGAGCCACTGAGCCCGGTGCTCATGTGTTTGAATGCATTTGGTTACTTCGTT TATTCTCTTCCCTGTAGCTTCACAAGTAATATCTCTCTGTCGCAATTCTTGTGCTCTTTAATAAACTGTTGTGGT ATCTGAGTCAA (N) xTGTGCCTGGTGCCTCGGGACTTGGTGGTGGCTGTGGTGAGGCGTGTGGGATGGCCACGGT GTGGTG( N) xTCTGGGACACAAGGGTGACCCTGGTGAGCTGGGGGCGAGTGGGGGGTGAGAGGGCCCGGCAGGGG CTGGCAGTTTCTTTCTCTGTCATCAATTTCTCTTCCCAGGAGTCTCCTTCCCAGCCGCTTGCTCCTTAGGTTTGA ATCTCTCCTCTTTCTTTGCCTTTCCCAGCTCTCTCCTTCTGCTGTCTTTGATTCGGTCTCTCTAGAACTCTCAGC CTCTCTCTCTACAAGACAGCAGAAATCTCGGGGGAGTTGGAGAGGGAGGCAGTGAGGATTTTCTGTGTGTGATTA TTTTAGTAAG CTAG CCCAGGTTTT CACC CAAACCCACTGTAAATAAATGCTGTCCTAGAAGCTGGATC CG CTGAG AGCGCCTATCTATCCCAGCTCTGCCTGATCCTCTTTCCTCGGCTTGGCTTCTCCGTCTCCAGGCCTCATATCTCC CAGTGCAGAG CCAAGCAACATCGT CCTT CC CACCTCCACCAT CCTG CC CCAAGGTCAGTACGTTGTGG CTGTAAA GGAAG GAGGGTGTAAAGGAAGCAAGG CGACTCTGTCCATGTCTGTC CT CC CTGATAGCAGACGCTGTGGG CCTTC CATGCCACCAGGACA(N) xAGACACCCACACCCGTCAGCTGACCCATCCCCTCTCTCCTCTCTCTCCCTGAGACC ACCACCCTGCACGTTCCCTCCCTCTGCTCCCTTCCTCCGTCTCCTGGCTCATGCAGCAGGTCAAGCAGCCGTGGA CGGGCGGCTGTGGGCTGCTGTGTTCTGCAATGTGCACCCAGCATTCAGGGCCCAGAGCCTGCTGCCCAGACCCCA GGGCCAGACTTCCAGGAACATCCCTGGAAAGGGAGGCAGCCTTGAAAAGTCCTTGATTTACTGAAGTTGGTGCCC AGAAG G CAGTGCTTTCTGCAG GAAGAGAGCTTGTCC AGG CTGTG AG CAGGTG AGCAGAGACGTCTG CAGG AGAAA AGGCATAGGC CAGG CCT CAAGGAC TGTGTC CCACAGGCTGCTGGGAGAGGAG CAGGGAGCAGAGAGAGAGGGGGG CCTGGAGCTTGGAAACCGGCAGAAGAAGTCAGGAAGGATTGTGAAGAGGGGGTC ( N) xTTCCAAATAAGGTTGCA TTCTGCATTCCTGGGAAGAACATGAATTTGGGGGTGCTCTTCAACTCAATCCAGAGTCTAAGGGAAGCCAGGTGG GTGCAGTCTTGCTCAGGCTTCTCTGGACGTGCAAAAATAAGGTCCCAGAAGACAAAGCTCACATTGGGGCCAGGC TGACATAGGTGTCTGGGCTTTCCCTGCAGCAGGTAGATACCTGCAGAGTCTGTGGGTCTCCACACAGGGACGCAC ACTCCAAGAAAGGC CCCñGGGCAG CCATGC CT CCTTGTGATT CT TTGCGAGGGGAGGGAG CTGGTCAGTACTCTG TCTTGTCTGTGATCTGTTGCTTGGGATAAAGGGATCAAAACACCATCTGTGCCCCGAGCTTGTGTTTGGTACCAA CCTGCAGG CC CCGGAGGAT CTGTGTGGTTTGGTTGGATGTGCTCAG CGTGTG CCCACCTG GAATAAGACTTACCA CTCCTGGGGATAAGAACCCATTTCCTGAAATGAGAACAGTCTGGCCAAGGTCCATGTGCTGCCTCACTTTCCTGA GTAGCCTGGGAACTCACCAATGCCTGCAGAGCTCAGCACCTCATGAGCACAGATGAAGGGTGGCCATGGTGAGCC ACGTGACTCTGGGTGAGGCAGAGCCCCTCTCTGGGCCTCCATAGTTTGGGGGTTCTGGAGGTGGCTTCTAAGCCT CCCACCAACTTGAGTAGTAGAACCTTGGATCñTTGCAACAGTACCATTTTTTTTTTCTCTGCAGTGGGCTATAAG TTTGCAAAGCCTTTTTAAACATTTTGTCTTATTGAATCATCTTATTATTCTCGGGCCACAGTGGACAGGTGCAAT TTTCATGTTTCTCAAGATAGACAGCCTGAGGCCCCGGGTCTCACTGTCAGGTAGGTCAAAGCTTTGAGATTAAGT CCTGTAATGCCTTATTTCAGTGTC CCAGGT CC CCAGATATTT CC CTATTAAG{ N) xCACACTGAGAGTCTACACT TGAGACCTGTAAAGCAGGGCTC( N > xTGCAAAAGAAAACAGGTGTATCTTTTATTACATATTCCAAACACCACAG ATATTTTGAAAATATAAGTATGTCAACCATTGCTTTAAAACTGTGCCATCTATTGCTCTCTGGGAACTTCCCCTT CTTGAGACTCAGCTCTGGTCCATCATATTGGAGGATGCCGAATCCTGATAAAGGATTTCTGGGCAGCTCTGAACA GAGGACAAAAGAGATGGAGATGAGCCTCCTGTGGTTGGAGTCACTAACACAGTCACATGGTGTCATTGAATGCCT GCTGAAAACACTTGTCCGAGCAGCGCTCTGGAGCATAGACTGAGCCTTTGTGATGGTCATAGCTGGTGGCCTGTG CAGGAATGAGTTAGCGTC CCTC CAGGTGGGTG CGGAGGAGGAAATC CAGC CC CACCCTGGGAGACCTCCTGGGTT CATCCTGGTCTAGCATGAAT GACTCTGTCT TGGATACACCAGGCTT CCTT CC CCTTTCTGCAGTAGAATTCATCG AATTCTGCCAGTGCATTAGAGATGTGGGCTTAAGAAAATCCAAATGACATCAATCAACTAGTGATGAACAAATAT TTGCAGAGCCCTTCCTCAATGTCACACCCTCAGTTTGGAGCTATAGTTTAGAAGAATATGGGGATTAATGAAACC TGGCTGCTTCTGTTACATTCAAGAGGAGAAAACCAACAATAAACACACAGATGGTACGTCAGATGGGGAAAAGAA GGCAGGGGGAGAGGGGTAGGGAGGAAGACTCCCCACTCCTGGGGGATTCGCTGCGGAGACAAGATTTATGTTCAG GAAAAGTTAACATGCAGGAAAAGTAGTTACAATATAGCTCCTTAGTCTGAACCATGACTAGATAATCATATGGTG AGCCTTTAAATGTACCTGTGGGATACGGAAAGAAAGGACCTCCTTGTGTCCTAACCCATGGAGCAGGGGGGTGCA ATAAAGTACTGTGGGTGACAGAGTTATTTGAAGGGGTGATGAACTTTGCGGGTGAGATGGCCTGAATTATCAAGT T C A (N) xGGCGTCCTCGGGTTCTTCCAGGCCTGCCTCTTGCCTCCTAAACATTTTCATCTGTTTCTGGATATTGG TGGCTCCATGAACTTTGGGCGATCTCAGCGTTTGGTTTTCAGGTCTGTGACTGGAAGGCAGAAGGGATCTGCTGA CTCCACCTGCAGATGGACTGAGTCCTTCTCGTGCCCATCACCTTACCG CTAAGGTGTAAAGGACTCAGGAAAGGG ACACAGGAGCTGGGTCAGGGTAGCTGCAATCGATTGCTTGGACAAGAATGGACCAATGACACTCACAGCTGACAT CATGGTGGTGACTTTTTATGAAAGTCAGGCCAGAGCAGACACAATGAAAGTTAAAGCATTGAGTGTGAAGTCTTG T CTTGGAGTCAAAAAAGCTGAACCAACTAG CT CAGATTGG CAGGGG GTGATGGGGATATATGGCAAGCACAAGGG TATGAGAGAAGGACCCTAGAAG GTTCTAAC TAAAATTAAC CT CAATTT TGTTTTATTTTTTTT(N) xGTGGGCAT ACCAGGAAGAGGAGAT CTACTG TGAATATC CG TCAGGATGAGAG GTAACAGTTGTGTTTTGTACAGGCCAAGAGA AAGACAGCTTAGAGTT( N)xGGGAGTTCCCTGCTGGGACCATTCCCCAGTTATTTACTTCCTTGTAATAGGTCAT GCTTAGCGTAAGGCCAAGGAGACAAGACACTGCTGACTCATTCTTTCCCAGCTGTGCAGGCTGGGCACAGAGCCA GGCTGCTTGTCCAGCCCTGGGGCAGATGCCGGCTGCTGCGGTTTGGCAGTGGGTCAGCAGCTGGGATGAAGCCAC AGAGTCTGACCTGTGAGGAGGGTGGTGGGTCCACAGGAACCCAGGCTCTGCAGCCCTTCTGAGATGGGCGGATCT GGGCATAGGAAAGAAATGCAATCAGGAATGCAGCAAGGCAGAGTTGAAAAGTCCTGTGAAGACTGTGCTGAGCAT CAGGGTCGGGTAAGCCTCCTCTGGGGGTTGCATAGGAGCTGCTCACAATCATCCCAGTATTATTCACATCATCTG TATTTAGTGGCACCTCCTTGGTGCCAAGGTGTGAGCCATGGCCCCGGGTAGCACAGCCTCCTGCTTCCCAGCTGC TGGAGGGATTGCCCTGAT CC CATGCTTCTC CAAGGTGGTC TGGGGGAAG CAACCCAGCAGCAACTGTAGGA(N) x CCTAGATAAGCTCTGCTGCAAAACACAAAAGTGGCCATGTTCTCTCCCAGCACTTGAGCTTGCCTTGAGACTGAA ATCCAAGTGATGCCCCTCAGGCTATCACCTCTGGTGGATGCCCCTCCAGCCTGTCTTACCCATGTCCTTCTTGGG CCATCAGCCTTCTGTTACCTGCTGGGAT( K ) xTTGGATAGAfiACGCAATCGTTATTGG(N)xTGTGTTCCTTTCT CTGTCGTCACTGAGTG CT CACT CTAGGCAG CC CT CTGCTAGG CG CAAGTGGCAGGATGGAGGAGCGTGCCAAGGA CTAGGAGATTACATG GAGAAGCTAACATTAGC CT TAGTGCTT CTTTGC CATTTGGGGAAGCAGATTTTG GAAGCT GTCACTTCCAAGGTGGAGTTCTGGGAAGTGTTTGGGCTTCCCCTTAGCAGGTGCTCCCTAAATTCTAGGCACTCC CCTCCAGCACTAGAGCTTTGTATT CTGACC CCTGAC CCTCACAG CCCTTTGCCTTTGATCAATGTATTTTCTCAG ACTGGAACCTCCTTCCTCTGCTTTTCCCTGTTTATCAACCTAGTTTAGTTGCCCCTTGCTTCAGACAACCTTCCC TGTCTCTTTCCCTGCAGCCCCCGGAGTACTGTAAGGCCCTCCACCAACAGACACCTTTCACTTTATTGTCCTGAA TCACAGGTTTAAGCTTAAGCTGGGTTCCCTCCATTAGATCrCTGGAGATCCAGAATTAGGGTCAGATTTATTTCT GTTTCCCTGCACAATATCTGTTACATAGCAGGTATTGTGTTGGAGAGG{ N) xGAAACTGAATTGAGTCCATCTCA GCTTCTCCAACAAGGACAGAAAAGAAACAGCTCAGGAGACGTAATCTTATAACATGGGGTTTCTCTGTAATTTGC ATAAATAACCCACTCTACTAGGTTTCACGCCTCTTGGGAGACCTACCAGCTGCCTGCCTCTCACTTCCAGTTTCC AAAGAGGAAGGAAGGTTTCTACCAGCCCAGCTCAGCTGGTCCAGGGACTAAATTACCTTCCCATCACTGAGCCCC TTTCTCATTATGCCAGAGGAAGATCACTACTGCCTGTCCATTTGGGCGGCTTATCCACGTGGCCTGGTTCCAGGA GCCATCTGTGTGATGTCATTCCCAATACGATTTGAAACTTCCAGCATCTGCAGCCCCCACTGGTGTTG( N) xCTT ATCATGGTGAGGGTTCTAGAGATGGGGTGGGGCCGGGCCCTGTGTCTCCAAACGAGCCCAGTGTATCTCTTGAGG CTGGCCTCTCCCTCTCCCTGGTACAAGCCAGGGCTTCAAAATCCAAAAGATCTTTCTTCATCTTTTCCAACCCAC TATGACCCAAATATGCATCTTCCTTTAAGGAATGTCTTAGATTTAAAAAAATTAAAAACTCAAAAGCTTCTGAGG CATCTTACCTGAGCTTATCAACATCTTTCCCCCATTCCTTCTGATATAACTACTTGGGCTGACTTGAAGCCTCAC TTAGGTGCAGCTGCAATGGTGTGAGG ( N) xGGTGTGCGGTTTTATTTGAATGGAGCTCCCGTTTGAAATAAAAGC TCTATGATCTAACCTCTC CTGGGCCT CCTG CCTTGCAT CC CTAGGGGAAGTTTTGGGGCATAAAAGTCCAATTTC TTTTCGTATTGGACGCTTGGATGAGAGAAAGAGAAACGAAATCCTGGCACTGAAAAAAGCAGAGACAACCAAACC CAATTATTCCTACTAACTGGGAAGGAAT TC CATTTGAGTTAGTG GTTC CCAAACT
>Hs2 10 340 96 91 -10341 69 73
TTGGCTCTGTGTCAGGTATCTGCTCCCATTTACTGATGTTTTAGGTTTGTAGGAGTAGAGTCATGTGATATGGAA CATGGCCACTACCTTTTTAGGAGGTAAATGGACAGAAAACGTTCCCATGAGAGATGGTATGAGAACTTCGTAGAA CTTTCTGTATCAGTAGTCTACTTTGCCCAAACATGACTACTGCACTCTAAGCTATCATTTAAAGAGATGTTCCAA CCTCAGTGGCACATCAGAAACAGTGTACAGCCATCCAGTTGTCCCTCAGCATCCACATATAATTGTTTCCCGGAC
(N)xAACCCAGATAGAGAGGGCTGACCATACAAGCTAAGGGAAAGAATACATGTTCCTATTGCTGAACACAAGAC CCCCTGCTGTGCAGTGATAATGCAATCTTGATAATAATAGCAATAAAAGCCACCACA(N)XATTTTTTAAATTTT AACAAATATCA(N)xAGAGAAAGGAGAAAAAACATAACCATGATTCCTCCCTCCTCTTTGCACAATGAGGGTTCC TTTTTGTGATTCCTCTATCTAGAGGGTTGGGTCATCATTTGGGATTTTAAGTGCCCACGCCTCCACTGGGGCAGC AGTGCAGACCTGTATTTAAAGCTGATCTGGAGGCAGGACTAAGGGAATAAAAAGAGGAAAAAATAAACATGGGGA TTTACTCTGGCTTTAGGGGGCTATTTTCCTAGATTTCTAGGAAATGAATTCAACTAAAGATTTTTGCTATTCGCA CCTTTTACACAGTTCCCCATTCGCTTGCCTTCAGG(N)xACAGTCGTAGAATGGATTTCTTATTCTCTGAGCTGC TTCTTGTTGTTAATGATTATACCCCTAGCACTTGGCACAGAAGCTGTCCTGTGAAAGATAACCTCAACCTATTTC TAAATGAAATAAATGAAAATGTAAAATGTGAGACTCACCAATTGCCTAATAAACTATGGTAGGA(N)xCATGTAT TCCTGTTTTCAGGACTTGGTTATTCACATAAAAATCCGAGCCTTCCTTTTCCATTTAACTGAAGTGACTCCTCTT CTCAAATTTCCATCCTCCCATCCTCTTTCCTGTAAAACTCCAGTTGATTGATAGATTTCTTCTTTGCCAGGAAGG CTTCTTGAGAAAAAAAAATATTCTTTTCCTTCTTATCAAGTTGTATATTTTCTTAAAATATAACTTGACTAATAT GTTTCACTGAGCGTTCCCCCATCTATACTATCAAGAATTTATTTTAGCTTTTTATCCTTTTGGAAATGATGAAAT TGGTTAAGATTGGTAATTATCCAACTCAAAAATGATCGTTAGAATTCCTTTTTGTTTGATTACCATATATCACCA AACCTACTATTTCTTACCAGCTGTTTCCATTAATTTAGCCTTGCTTGTCCTTGACTCCTATAGACTCCTATACTT CTCATTTTGAATCTAAGTGAAAATATTAATAGAAAGAGGTAAGCCTTAAATACATCTGATTTAAATTATCAGATT TGCTTGGAAAGCATAGCTAAACGAACTTCTTTTATGGTTTTGCTTGTCATTTTTTATTGCCTTAGACTGTTTTCT ATAGAGCCAACAATCTAAATAGAAGTGGATATTTTTGTTGTGGTTAGGGGCTTGGGGTGACTATTATCAGCTCTC AGAGATAATCATGGTACAAATATATCTTAGAAGAAGATGTGAGGTAGTGTTTGTAAATAATGTGAATGCATTTAC ATGTGTTTGGACA’rTCGGTATGAAACACTTCTAGGCTACCCAACTGGGTACTTAAATGGGAGAACTCTTTAGAAG CAAGAACCTTAGGTCTTCTTGCTTCTAAAATGGAAAATCGTGATGGAAAAAAGAGCTCCCACAGTATATTAAGCA TGTGTGTGGTACAAACATTGAGCAAGAGGAGTTGATTGAAGGTCACCAGCTTGGACATCACTGATTCCAAGGTCA TCTGATAAGATCTCCTGTTTGTTCAGACCCATCCCTGGCATTCTATCTAGCAGTGCCATAGCTGAAATTAGAAAA ATGAAGATGAAAGCAACTGACTGTGACAATCTGCTGTGTACACAATCTAAGTGTATTTTTCTTTCTTTTCTCTTT TCCTCCTGGGGTCTCCAGTTTACCGTGGTTTCTGGGCAGTCCTGATGCTCC’rGGGGGTAGTTGCTGTAGTCATCG CAAGCTTTTTGATCATCTGTGCAGCCCCCTTCGCCAGCCATTTTCTCTACAAAGCTGGGGGAGGCTCATATATTG CTGCAGGTACGTACGGTGCAATGGGTATGACTTTCAGCCACGGTTTTTCTTCATGTCTTTCTATTTTTCATTTCT ATATTCAGGCGTCAGCCTTCTAGTAACAGTAACACAATAATATTTGTATACAATACTTCTTGAACACTAACTAAT AATATTTGTATACAAT(N)xAAAACTCTGTCACTGACTCACTCCAAGGTGTGTAAGAGACTGTTAACCTCTTACC CTATTAATTTTCTGTCTGTAAATGGGGATCGTAACATAAGTCTCTGTAACATATCAGCATATGATATGGGTAGGC ATTTTCTGAGCTCTTCAAAGATAGCCCTGTGATGATfiTTTATGATAAGCATACACTCATGATTTATCATAGGTAC AAAAATATTTTTCTACATCACTCCCATTCATTTTCTTAATTGTTTGTTGAT(N)XTAATTAATTAATTGTGATCC ATATGAATTATCTGAATATATTATTTTCTGTGTGAAGCTTGCATTTATTTTAGTTAGGTTTCTAACTAATTTAAT TTCTGCTAAAATGATCACACTACAAATAGAAAAGTAATTGAAGAGGTGAGTCTTGTGAATTTTAAGCCCTTTTCA TTTAATTAGGCAGTGAAAGCCAGACTATTAACTTGGAAACATAAGCACAGGCTATTATTTGTAAAGATAATTCTA GAAAACTAGCCAAACTTCAGTATGACTGAGACCAAACTGTACATACTATGCTGTATTGTAACATCAAACCAGAGT GACAGAGTGCTCCTAGGTGACACTGTAGTGACAGTTGAAGGAAAGGAACAGAATGGAGAAGGTTACTGTCCTTCT AATCATTGGAAAGATGGCTTGTGATGCTGCCCAGTGTCATTTCATCAGTGCTCAGGTAACACAATACTTTCTGCA TAAGATTGTGCTTAATTCTGCAGAGGGAATAAAAATAATAATCAGATCTAGATCCTGTATTT(N)xGTTTAAAGfl GCTAAAAAAAAAAATCTCACTCTGAGAAGAACCCTGTTAAACCATTCTTTCTTTTGTCTTTTATTTGAACTGTTT TCTTTCACTGGATTTAGCCATTAGAGCTCTTCATTGTAAAAGCCTTT(N)xTGTGGACTTTTGTGTCTGGCTTTT TTCACTTAGATCTGATAAATTATCATCCCTCTGCTTGAATTCATAATTCCATAGGACCAAATCCCATATCCTTAT ATCCCTTAAAACTGAGCATCTTCTTGG
>Hs2_121682895-121691799
AATTATGAAGATGGAGTTGAACATGTAAAAATTAAAGTTAAAATTTCAGGAGAAAAATTTCTTCCCATTAAGAAT CCATTGTGTTCAACCCAAGCATCCATTGACAGATGAGTGGACAAAAAAAATGTGGTCTATCCAATACA(M)xAAA GAATTCATAGTATTTATGGTATTGTACCAGAATCTGTTTGCACAGAAACAGTTAGTAGCATTATTTTTCTATATT TTTTTGTTTGTCTGTTTCTTTTTTGGTTTTTTGAGGTGAAGTTTCGCTCTTT{N)xTTGTCTGCATTTCTTAGCC TAGCAGTCAGCTGACCTGTACATACCTATCACAAACCCfiTCCACATTTCTTTAAAATATTCAGAAATCCAAACTT AACCCTGCTTCCCACTGGCCTGTGGCCTTCAGACTGAGGGAGAGTTGACTAAGAAATCCCCACTTCTTAGCCCTT TTCCCAGTATAACTACTTTGTTAAAAGCCTTCTTTTCCTAAAGCCTATAATTATGTTTGCCTCCAGGGTACCCAG GAATTTTTTCTTTTAATAAACAAAGAGCATCCTACAATTATCATGATGATTTGTCGAGGGCAGGCCAAGTTCAGA CAGGAGGCTGACTCAGCTCTGGAAACTGACACGCACGTGCTGGCGTTTTCGATCCTGGGGGCACCCGACAGCGTT TAGGGGTCAAGAATTAATATTAAGAGCTGGAGAACTAACAGATCCACATAGATGTCCACTTTATACTGCCAGCAA AACGGGACAGCAGTTGGGTGGGATTTGGCCTTCCTGAGGCTGGCGGCACCGTCTGTGGGCCGTGCCATCACAAAT GGCTCTAGGTCAACACCTCCAGCCTGCGTGCACTGCAGCATCCGGATGCCGTAACCTGGTGCTGCTGGAGCCACA CGCAGCCAGGGCCGGCATGGGCAGAAGCCGGCGGAGTTGGAGCCTGTCGCTCTGTCAGCCCTGATTTGCGGGCTG AGCCCAGTTTTGTGGCCCTGCCTGTAATCTCCCCGAGTTCAAAGAGTGCTTAGCTGCTTGTCTTTGTCAAGAGCG CAGTTGGGGATCTTTTATGTGAAAGATGTAGAATTCTCGGGCAGACTTTGATTATTTATTCATCCCCCTTCGTGG GTGTGTGATCGCGCGGGCACTGCGGAGCCCCTTGTCCTGGCTGCTCTTGCTATGAAATTCATTGAGCTTTAAAGC CCTTTGAAAGTAGCTTTTTGAGGGAGGGGGAAAGTTTTTGAAGTCTTGTTTCTCTCTCCCCCTCTGCAGTGCCGC AGCATCTCTTGGCACCATTCCATGCGCCCOTACCGATTGACATGCGACACCAGGAAGGAAGGTACCATTACGAGC
CTCATTCTGTCCACGGTGTGCACGGGTAAGTCCTGCCCTCTGCCTGCTGCTCCTGGCGTGCAGTCACCTGCCATG
GGGAGG CT GGGCCGGCAGCCTCAGCCACATCTCCTGCCTCTGTCTTTCTTTTGGGGGTTCCT GAT C TACATT GT C
CTGAGCGGGCGATCACCTTTGCTATCATGGCCTGGGACCCTGTGTGAGCATGTGCGTGGGCAGTGTATAAACACC
AACCACCCCCGAGCCCACATCACCACTTATAAGGCTCTGGGCTTCCTTGGTGTATCTATACATGGTTTGGAGCTC
TTTCTTCATC CATGAAGTGG GAATCCTCTC TAG GTC TAAGAT CC CATATAAGTAAGGTGAT CTTAGGTATCTGTT
GTTCCAGCATAATAATTCAGAGCACCCTTTTTCACTTCTTCAAGTGTCCCCCTTTGAATAGTAAATTGTATAGTC
A C CATACT TAAAAG GA CAAG CT CAAA GTGATG CTTTGGGGCCCTTCCTATTCCCAG CTTTGAAGTC CCCAGGTAG
AAGGTGTGGGGTCACCCTGTGGTCTCCACCTGCCTAACCCTGTCCCTGTCATACCCACCTGGGGCCTCTAAGCCC
ATTGGCTGGTGT CATT TCTTGGCC TTTAGGGCT CAAAAGT CTTG GT CT CTGG CTAT CC CATC CGT C CT C CTT CCA
GGAAAGAAGG CCCCTCCTTCTCCCTACCCCCAAACT CT C CAC TG CT CC CC GCAC CTGC CT CCAG GAGAGC T CTTA
CAAAG GTCAGCCAGCC CCAAGACC CCTC TCGACC CAGCAT CTTCATGCCCACACCCTC CAGC CC CCTCAC CTGTT
CATGCCAAGGTGCTGTCTGTGGTTCTGTCCCATGTCTCTGTGCCTGGGGC CAGC TGCCTGCC CAGT GTGCCTGGA
ATCTCCCCCCAACAGGCCACCTCCAAGCCTCAACCGAGGAGTCACCTCCTCAGGGGCAGTGTGATCCTTCACACT
GAGO GG GGG CTG CT G C CCTCAATC CA CAG CAT CATCAT CTGTTT TT G CAC CTGC CT CC CAG C CAG CTCTGAGCTC
TCCTCTCACCTCTGTGTCTCAATGCTGACATGAATAGGTGTACAACCAGTGCTCGCTGAGTGGAGAGCCTCTCCC
CGCCTGCCGC CACC TATGAAG G CACATC CTCCTTCC TCTACCAT CT CC CAT C CCAGTGAAGT CAGT CAACAC TTG
CTGGGCCCTCACTGTGCTCACAGCATTGTGCCTCAGCAGAGTCCTGTTTCCGGACACCCATTGCCTCCCCAAGAC
GGAGCAAACCTGGGACTTCTACTTTCTTTTTAAAAACTGTTTTAC tN) xTTGGGAACTTTTTGTTTTATTTCGGT
GTGCATACCTGGTTCTCACTTGACTAGGATCCGCTGCCGCAGCCTTATGTCTGCCTGGTGGGTGTTGGAGTCATT
T C CACC CT CTG C TCAC CT CCAG GG CC C CTC CAC C TG CAGG TGG C TGAGA C CACAGC TG GAGAAAC C T C TG GAAGG
TGCATGTGTTTGGAACTAGTTGGGCCACCCAAAAATTGCCCAGTCAGTGGTGTCTGAGACTCCTAGGAGCATCCA
GGGAGCTCAGTCCCTATTTGGGAGGGGGTTGCTGTTGCTGATTTCTTGCTCATGAGTCATGTTTGGCTGGTTCCA
TGA CAC GGAT C CTGGG CATAG CAGGCCTGCCT CAAAGT GCTCCCTGCACGGGATGTTGGTCAGGAGACCTGGGCA
ATGCTGAGAGCTTTGTGCAGAGACAGTCCATGCTGGAGTGCTTCTGCCTGCAGGTGAATGTCCTGGTTCACCTCT
CCCTATACAC CTGAAGTGTGTAGAGG CACCAC CA GAAGTGTA GGAAGA CC CCAACAGC GATGACTGCCTTCCTCA
TCATAACTGACCTTGTGAACC(W) xTAGAAATCAAGTTCCAGCCCCCTGAGCTCAGGCCCTCCGCTGCATGCTGG
CCTTTCTTGTGGGAGCGTAAGTATGGACCATGGGTCCCTCTGACGAGTCCAGCCGAGCCCTTCTTGACAGCACCT
CGGCCCTTCGTTAGGGCACTGCCCCGAGTCCAGTCGCTCC CAGAAC(N) xCTGGCTACAGGCAGAGCCCCCACTG
AGAAG GGAG G CC CT CAG TGC CGT C CTTC C C TGTGTAG GTG GGAG CC CT GGGCTGG GAAG GGGCCTGGGTCTTCAC
CAGG CTTTGC CATGCTCCAGCTCTGGGTAGGGCCTGCTTCCTTCCCTTGCCAGT GAG GGTGGGGTGTGCG CAGAG
CA CTTGGAGAAGGGGG CCGTGGATGCTGGCATGCAG CAG GGAGGAGTGGCCCAGCCAGAGGCCCA CAGAAATG G C
CTTGTCCCTCATGGCC TA CACAG CTCCTTCACTGGT TGGATT CC TGAAGAGATT CC TAAGGCTG CTG TA(N )xTT
TGGGATCACTGAGCCCCAGTGTGGAGTCTGGCATCAGCCTGTAGAATACAGTAAATCATATTTATAATTTGCATG
GGTCTGCAGCAG(N) xCTGTGTGACATCCCTGCAAGGTAGTTACTTCCGTGAGGGCAAGGAGACCAGGCTCTGGG
GTGCAGATCTCCTCCC CAGC CAAGGAGGGGCTAGTCTGGTTAAGCCCCAGTGATGGGCAGCAGCATGGTGCTCCC
AGAGCCCCCCGTCCCTACGCGGTCCTGCTGGGGTGGGCTTCCAGCGGCTC CACACTAT CC TCAAGACAGTTG GT T
CCCGCCAGACTGAGTGGGGGACACAGCAAGAGCCTCGGGGCTGGCCCTGGGCTCTGGAAAGCCCATGCCCTCACC
TCTTCCGCCCCCAGTGTCCCAGGT GAGAAC CAGG GCAAAG CTTGTGAGGAAAATGG CCCCGTGGCTCTGGCTTCC
ATGATGAATGTGGT TGGAGC CC GGTG G AGAAT CCCGATCCAT CAAG CC CATGTT GTAGCTCGCTTCTCCCTGCCA
GCA C CAAATGTGTCTAAT TACACAGATGTTTG CACAGCAAATGAGGTACGGT GTCATCTCATTTCTGTGTCAGCT
GAGCCTGAGC CTGAGC CTGCTGCC CTGAGCTGGT GACCATTTCGTGGGTGTGCCCGCCCCTCAGCCCTTGCACCT
CCTGCTTTCCACTGCTCAAGCACCCAGACCCCCTCCCCGTTCCCTCTCAGGATCCTGATAGCTCCAGGGAGGGCG
CACCCAGTGTGTGCAGGCTCCAGAGAGGTGTTCCCTGCACTCTCTCATTCTGCCCTGCCCTCTGCCCCTCCGGAG
TAC CTTGGTCAT CACC CAAGAG CCTGCCATTGTCAGTTCT CAGC CTGC CAG GCTACCCGCACTGGGACCCATTGC
CGAAC CCTCCCTAG CTGTGATTAACATGAATTAACT C A {N} xCTTTGGAAACATTCAATGGATATGAGCAACAAT
CGTGAT CATT GTTGTTATCCTCACTTGCTGAT TTGAGAAG GGGGCCACGG CAGG CTGG GAGACT GG GTGG CAGAG
GCCGGCCACAGCATGTCCTTGGGAGAGTTGACTGTCTGGGGACTGTGGCACAGAAGGTGGGGGATGGGGCTGAAG
GAGTGCTGTGGTGCGGTGTGGTGGGGGTCTTTCTTTCCCCCAGTTGAGGAATTTGGACTTCCTCCTTTTGGCAAC
CAGGAGCAGCGGGGAGTGGTCAGATTTATTTTCAGGCAGCTACAGCAGTAGTTAGGGAGTAGACAGGAGAAGGGC
GGGAACTGTGGTGAGAGGACCTGCCAGAGCTGCCACAGTGGCCCTGGGGAGAGGGGAGCAGGACCAGGGCTGACC
ACCCGGCTGTAAACAGGCAGCTGCACGTGGGGAAGGCAGGGCATGGAGGTTGGTGTGAGGACAGCCTTGCAGCCC
AG CC CAG C TCGATGG CCTCCTCATAGACACC CAAG CCCACTCCTCTGCAGAG CACC CTTC CCGG TCACTTAAG GT
TCAAATGCACAGCAGGCTTGCATGGCTTACTGATGCTCACGTGGTTCCAGGGGAGGAAAGTCACCACCCACCCGT
GGACTCCTGCTGCAGTGTCTTTGGGCTGTGCAAACCACAAGTGATGGGGAGACAAATGGGGCAGGGGTGTGTGAT
TG CTGGTCAAGCAAGGTT CT CTTGAAGC CCCCATGGGGACATCC CTAAGT CT CCAAGAGT CTGTACAAGGGCGAG
AAGAGAAGGCCGGCTGTGAATAGGGCCACACACCCCCAGCAGTCTCTCCAACCTTGTGACGCTAACAGGAAAGAG
AAGC TCTT TGGATG CCCTCTGGGCCCCCTGCCTCC CAGAAATAG GC TTGC CAG GGTGT CATGG GGAGCAAGG CTG
AGGTGTGGCGAG GAAGGGTC C C TAAGTTGGTG CTCCCCATTTCCTCAGTGTATGTGGT CACACAAG CATGAG T C C
GGCCTGCCCTGGGCCAGCGTTAACTTGTGTCCTGGAGAGGAATGTAGAAGTCCATGCTCCTGGTTCGGGGGGCTG
GGGTGGACCTTCAGCTCCCTATGTTGCT TTTGAGGACCTTT C CA< N) xGGGCTTCAGAGAAGACCAATTTCAGCT GCTTCCTGCC CAGAAGG G CGTACAGAAACTG C CC CATG(N) xACAGGCGCCCATAGGCCAGAGTTTGGGGACCAC
TG CAGG CG CCGCTGCTTGCTTGACGTCCGGGAAAGCAGCCCCAGAGCTGCTTGTTGGCTTTCTGGACTCATCACT
GACAG CGCCTCGCCGC CAAAACACG GGT
>Hs2_168582125-168589072
TAAGT CCACTTCTTTT TGGACC CTTAATATAAGTGT GG TT AG A CTGT CTAAT GGACTGAAATAAATAT GG C C CTG GAAAAATATACC CGAAAACC CT CTTC CACACT CTGACCTCAT CTAGTTGAAACATT CCACAGTTGACC CCAT GAA CC CCAAACTC CAGCATCATAGTAGTGTT CAGGGAGATGAAGAC CTT C CTGACAGAACCATGTGTTAGT TCAG GAT CT CTAAGAAGGAAGGGTAAC C C TAAGT CAAAC CAAT TAAAGT GACAGTTGAC TTATTTCT CTGTATAGAAGAAC C AATT CAGAAGAC TT CT GTGCAGATATTATG C CACTG CTGACC{N) xATTAAAATTTTCTCTGGTGTTGGGGATCA AATATG CTTTAACAGATCTTGGGC CTTTAAGACAGAGATATGTTGGTTTCT C CATAAAAT CAGTTC GATAAGTCT GAGTTGTC TCCTTTACAGG CAGGAAGAAATGG CTTCCCTTGGAGCTATCCATG GAACTTATCTAAG GGACTCATG TTTTTAGC CTGCTTCTCCAG CTTAGGTCTG CCCTGCTT CCAAGTATTT CTAG CACA CCAAGC TTCTTTGTGAGT C TTGCTATGACTAGTTT TTGAAATGGAAC CTATGTTTTT CAAG CC CAGAGG CCTTACACTG CAGGTTAATATTTC C AGTCCTACCCTC TACCTATT CATCGC CTTCTTTGTGCTAGCCCCATCTTTGC CAGGAATC CG CCCACCTG CC CCT TGTCCAGG CATATTGC CCTG CC CAAGACAGAGAC TGACTTAAAGAGTAAC CAAGAC TAAC CTTAAAAG CAAC CAG TAAG CATGATTTAAG GATGATTGGAAAACATTT C CTGC CC CCAAAGTATT C C TC CATTTTAGACTTGGAATGTGA CAATCCTCCTATTTGGGAATGTAGCAACAAAGAACAATGTGACTGGATT(N)xGAATAGAATCTGAAATAACCTC AAGCAT CC TTAGAT TCATTGTTGT TTACATGTAAGCAACATTTG GCATCTGGCCTT TAATAT CTCTTTGCTT TAG GCACTT CATCAT TTTTATGT CACATG CT CATT CAC C CT TGAC CAAGGCTAG G GAAGATATATGTT C TTTG CAAAC TGA C TGATTGAGTACAGGTGTAAAGCTATGT CATGAAAAACCTT TGACCCCTTTCTTTGTCC CAGGT AA CAG GCA ATTAAATCAGAGTAGTTT G CTT CAGG CCAGAAGAT CATTGA C CAGTA CAAG G CAT C TTTACTTTTTTTTTTAGCT TAGAGATAC CAT TAATGT CACAGAGC CC CGTAGC CAAAACGAAG GACAGTTT TG CAAATGATGGATTT AAC CTTT GTCTTTTGGAAGCTAGATGTGATTATCTCCAT CAGC TTTTGTTCACACTCTTATAGCTCATTTTCTAGTGTTTTA ATCTTCAGTCGTTC TAATTT CAAAGTTC CAAGGC TCAGAGAACTAACT CACT CT GCATTAATAACTAACTTG G C C AG CCAACTAGTTAGATAATC GC TT CTTCTAAA GACC TGAATTTAAT CA CATATAAGATAA TACGCATTGGCTCCT AG GATTAGAAAGAAAATAGACCTTTG GGGTGTCGGG CACT GTTTAG CCTACCACAGT C CATCTTCTGATT CC C (N ) xGCACCTAAGAAGTCCAAGATGATTTAGAATTA(N)xAGCAAACCTTATGTATTGTAG(N)xATTGGGGCTTGC AGTATATAC CTTTGTACTGAAATGGAGTTCAGATGTTT TAA CATTTATTC CAGAAAGTAT CTTACTGTGAATTAA GAAGATAC CCTGGATTAATACACAAATGGAAATG CTTTTTTTATCT CAGATT CTGACATGTTTTCT TCTTAGCCC TTATTTTAATGTATTGGTTGAATAAGTTGACT GTCTATCCCATTAAGTCATGAT CACCTTACACATATACTTGTT TGTTATATGTGTTT CAGTACACTG GAAAGAAAGACTTCAG GAAGAAGG CT CAGC CCTGGTGAACTGTGAGG GAGG GAGT G GG GATGATGGGGAAAGT CCTG GACCATGAGT GAGTAT CAGGGAAAGG CAGT CTTG CT CACT CACAG G GAC AC CTAAAAAT CATTAAGGGGTAACTTGT TGAA TGAG CAGG TGAGGACAGGTG CTAA CAAGAATGAT GGAGGATGG TG GAAGACAAACATTGTCTG CTTC TTAATTTACC CTAG GGATTCAC CT GAATGG CACA CT CATTACTATG CTTC C CCTG CCAAAG GG CAAGTACAAATAGTTAA CAC TAGGACGATGTCAAGGAATTAGAGAAGAG CTGTGGGGAAATG G AGGAGCACATAG GT CC CTGAAACACCAG CT CAGAGATTGAGTGTTCATAAT TATGGAAAG GCGATAATAG CCTAG AAT TATGTATAATGTCA CAC CT CACCAG GG CCTG CAGTGG GAAC TT CTGCTGTGAGTAAAAC TTGT CAATTC TGT GTATTTCTTCCTCTCT CTATTATG TAAGATTT TTATTTAAAT CTTTAATAAAGT CTGGTTTCATTCTT CAACGTG TT TC CCTAC CAAATTTTAATGATACATT TC TAT CAATCAT CACATC CATC CGTCCTTT CTAGTGCATT CC CAA CA GTCTCCTGTTTCCTACCCACTGTC TGTATG CT TAGACT CAAAATAGAT T CAAGCAGAAC CAAAAGT CATGTGAAG
a a g t g a a g t g g c c a a a a t c c c t t t g a t a a a a g c a t g a a a a g a g c t a a g g g g c a t c c t g t t g a c c t g c c a a g c t g c AGGTGGTGGC CAAG CC CAC CACAG G TCTTCCTCCTC CAAT TAGACC CATTTCTCCTTCATAGGGG G AATT C CAG G GGGTGTGAGTTTGTGTGTGTAGTATGTGAGCCATCTTGGCTTATAGCTGACCTGCAGTGCCCAGAAGGGTGCACC A CAAGCATTGTT C CAG CT C CAAAAGAAAGAAAGGAGTC C ATTG T CT CTTTATGACAGC CAAAAATAGAACTG CC C T T TTAAATA CTG CACAA CAT TGTAAATGGT C CAATC TATTTTCTG G CTTACTATATCCG CCG G G G CG CTCATCCA CAATGT TG CTCTCTAG G CTTC TCT CAAT CTGT TTGAGT CC TT T C CACACATACAGCAAGC TA C A TT T T T T TT CAA GGTTGAATGG C A T T T C A T T T T T T T G T T T G T T T TAA ACTAAATTTAG CATAGTGTGT CT CAATTCTTAATGG TG T C TG CAACAACT CC CAAATTAGAAAG CT C TTC A A A C TT CACTAG CAAAGT G T T G T T T T C T T G C T T T T T T CAAAC C TT C TC A C G G TG C TA A C ( N ) xT TA C T A A TT C C TG TT TA T G TA TT TC C A C A C A A G C A G TT A C C A A T < N ) xGCAGGAAAC TG AATAA A A TTC A A TA A C TTG A A TA A A TG T CTAGAAAGATGCTCTGAC C A TA G A TTTT G G TCAG AAA TATTTTCT T A T G TC T C TG CCAACATGAAAACAAGACA CAAAATCAGTG GCAGGTACTT T T A A A G A TA TTT A TA A T T G T { N ) x A TTAAATA AG TG AACAAAA ATTG G CACCTAGCTAGTCAT CAGTAAAT GTTAG CTTGTAGTCTACTGATG CTACACA T T C T T C A T T T A T T T C T T T A T T C C T CAACTAGG TATTG AG CCCTTCCTTG CTGTG TG TG CC A G A TG A TA TTTC TC T G C A C A C TTA G TTTTA A A A G G TTTC A TA TTTTG TG TA C C CC CAGGAT CATCAG CG CC CAAAG CTAAC CAGG CACTT T T C C T T C T T T C C T G T G T T TG AACACCATATGGAATCAGGATTAGTCAC TTTGAGCC TTGTGTGAGTTG CTGT TGT T T CAAATGAGTTTG CAACTG CAG CAT C TT TT G TTT TG TT A C A G TA C T A C C A G T C TA T C TTT T CCCAATAT C T T T T T CTT CAAAATGC TGGAAT T C A T T T G A T T A T T T T CTAG GG CTATTTGAG CACACC CTCTG TG T GAATGAAGGAAAC CAGGTTTTACTG CAGG CATCTC CCATTTACATGG CATAAT TTAAAATTAAAG G CTT C CAT TG CAAG GCAGCCTCT G TT TC C T G TT CCAC CG TCTTTTCTG CTG ATAG G TTG GATCATGGAC TG TCCTCT CAAAGT CAGAGCAGTG GTGCT TG ATG G CTCTGTA CACTTAAGGAGAT TT G TTTC C A G A C C C C TC TTTC C C CAA C TA TTA A T T G T C T T T T C CAAGAT A C T G T T T T T C C C A T T C C T CAAC C C T T A G A A C TT A TT C TA C A G A C TT TC A C T TTG C T TT TG A C TT A T GACGAT TTG GT GAAACACTTTTGTTAAG C CAG G CTGTACTCACTTGGAAGG G C A G TT TT TTA T A C A CAG TATTA CAAC CTA TA T A A T A C A TT TC T A TG TG A A T A TG GC CAAGAAGAAGTACT CTAAG CTCAATGTCAAAC CTGATG C CATTGTGATAAA ACAAATACACAATGCCTTTAATTGCAACAGACGTTTTATGATCTCAATATTTCTGTAAACTCAGTCCACACAGCC AGTGTAGCCAAAATTAGAATACATTAAGAGGCATAAAATTGTAAATTGGACATGTGGATAAGCTTTATTGATATT GTAAAATATTAT GGTAAGA CTTT CTGTCGAGT CTCCTCATTTTTGG CACCA CATTAAAAAACAGAGTT GTGAGTA GTACTGACGTAATTAGAAGTTGAACCTGTTTGGATTCACAGTGGTCACTCCAGACAATTACATTTTTAAACCTTC AGCAGATGGCCAGAATTAAGTCCTGGTGAGCCTTTTCCACAGCACCGGAGCCAACATGCAGTGGATATGCCACTG GCACAGATTTAACTTCA C CTGGAAATAAG C TAGTGGGACG GAAACTGCTGCAGAACAACCG
> H s 3 _ 8393895 ~ 8402964
AGATAGGAAGG GAAAGAAACTGACAGGTAATAAGGACATAGTAAGCAC CAAATGTGTT GTGAATGT TAACTG CTT TAATCCCCAAAT(N)xATGGCTTCTGCTGGAATAAATAAATTTAATAACAGTAACAATAATATTTCTATTAAATA ATAGGAAAAGGAGTA(N)xCAGGGTTTGAACTTGGGCAGTATGACAAAAGAGCTTGTTACTTACTAACCACTATA CGATATTGCCTGTTTGTAACAACATTTGGGGAAATGCAAAGTCAGGTTCCCAGTTCACATGGAAGTGCTTCCAGC CTGAGCTTGATGGGGTGGGGGGGCTCTAAGGCTCCTCACAACTCTGAGATCTAAAGCGCCTCCCTTAGACCTTTC TCAC TCAAAC CT CAAG CT CCCACATG CTCTATG CTGTCAC TCTGTC CC TGTCGTGTGT CAGGGAGGATGCCCATT CAC CAGGCCTTCTGTG CTGTATGT CTTGTGTGACTTCC CTCTGCCT CCATGGTCATATGCATGTGCTTACAT CCA TTAGTAGTGCCACCACCTAATAGAAGCTCTTCAGGGCCCTGGGGAGTCCAACCTCCTCCATGAGG(N)xAAGCAT TTAT CTCACATCAGTGAGTCTGAGAAGCCAGAACCAAG C CAGGGTACT CCAGGACTAT CC CC TTTT C C CATAGTT TCCAGGAGTACTTCTTTGAGACCTTCCAAGGTCCAGAAGGGACTGGCAAGGCCATCTCGAAAGCCACTCTCCAGT TCCTTCTACCTC CAAATGTCCAT CATGGCGGG CTGCTC CCTGTGACTC CAGGAC CCTGGTACTCAGCC CACATG C CCAGTGCCCAGGTATGTGTTGTTAGCTGAA{ N) xGGGTATGGTCTGGCTTTTCTCCTGCTGCCCCACTTTGGGTT CCATATTGGCTC CTTGACAAAGAC CACATGGTAAAGAGAAAGAAACTTATC CTCTGGTGG GG CAGGAGACCCAGG AACTTGCCAAGTCAACTGTCTTCTCAAGAGGAAACCTGCCAAGTCACCATCTGTAAGTAGGGGCTTCGGGTTGAA TAAT ATTAAT AACGAT ATTACTAG CAGCCT CTTTTTCCTAAGTC CTTAGGCACT AT AGGGTG GATTGGGCTG CTT GGGAACACGTGATCTA GATTCAAATC CTGG CT CCACCACTTACT CATGAATGAC CTGGGCAAGTTATTTTTCTCA ATTTCTTTATCTGCAAAATGGAGATAATAATAATCCCTACCTCAAACTTTCCCTGCCAATTTAGTTAGTATGGAT AAAAGCACTTACAACT GCATTTGAATTTAGAT TTAGTTGC TGTGCTGT CTTTATAACCTT TG CCTTAATTAT CTT ATTTAATGCTCTCAACAACCCTCTAAAATAGAGGCTAT(N)xATAT CTCGGCTATACATCCTCCCACTGCAT<N) xGTTTTTCAGAAGCACTGCATGAGATGCTCCCGGGGGTCTCTTGAGCTCTAACATTCCACATTTCTACTTCTATT AGAATTTCAATCAGTTGACATCCTATAGGAGAACAGAGAGTTTGGTGATACTGTTTATACCAAGAAACAGACAAA AT CTTGTGGAAACCTT CC CTGGG GCTTACTAACCTCAC CGTCACAGGCTTG C CAACCT CATC CTCATC CTCATCT TCAT CTGCACTGGCAC CTTCACG GTTATTGAAGAAAG GTG CACACT CT CCAG CAGATGAATTACTG CATTGT( K ) xATAT CAGGATT TGGTTGGCAGCGTCTTGTGT TTCCTAGTTGTT CGTAAAATAG TGGGGCAT CCTATAATCAGCA GAAT CCTAAATG CAAT GATACACAAT CTG CAAAGCTCCACATCC CTGT CCCCACTACT GGAAAACACAAATAAGA AAAAATCACCACCTCTGACTGGTGGTGGTCCCAAAGGCCATCATTTTGTAAGTGGTCACCACATTCAGAATTTTT AAAAAATGCTTTTTCCTCCAAGACAG( N} xCTGTCTCTGACCAACTCTGCCTGGCCCCAGAGACTAAGAATTAAA ATGCAGTCAAATATCACTGGATTT CACATAGCTTCTCAGAGCCCGAAATAAT TT CAAG CTACAATT T C TTGAAAT GG GTTTTATTTT TTTTAAACCCCAACAAAG CAGATTTTTTACAGAC TAAAAATG CCAAACAGTTGC CTGCAACTC AAAGGCAAGTGATTAAACATCCATGATTTA( N} xGTTGCGGGGAGGATTACTCTCTCAACAGCAGACAAATGTAG CAAAACATCTATTAATATTGTTCCCTGAAGAAGAATGAGTTAGTATAAAACTGGGAAGGACAACAGCCATGCATC TTT CT CTGTCTT CATAAAATACAG CCTTCCTGAAAACTGATGTGATTACTTGTGAAGC CAATGTTT CT CCAT TTT TTTTTCTTCTTCAATAGCTGTTTGGGCCTTAATGTGCTTTAAATGTGTAATAAAAATATAATGAAAAACATACTT TTATTAAATGTG GCTG CTA CAACATACACTfiTAAAGC CTGAATC CTATAAAGTTGAATTGTATGGCATTTCTTAA A C CTGTTCCTTCTACTTATAGCTG CC CTA CAATATTG GTCAGAATTTAGTGCTT CTCTCTCC CACCTTTGCCTTG AATTACTATGCTTGAACTTGCAGTTCCCATTTCTGCTTTTTCCTTTTTCTTCCATCTTACCTCCCACACTATACC CTAATCCCCCTGTTCTTCACACTCTATCCCCCAATATCCCACCACAAAGAAGTACATACTACGTGTGACTTGTCA GAAAACCCACAC CAAAAC CTCACAATTGGG CT CCTGTAT C CAAAAACAACCTTTTATATAAAAATGTTAAGCAG T GGAGTTAGAG<N) xGGTCAAAGC CAGATCTCCAGAGCTGGCATGCCTGGGTTCACAGCCAGCTTTGAGATATCGT GGAC CACATT CCTCTCTGGCATG GACTTTC CT CCCTG GAACATC CATC CAGGG GTGTC CATTGACTGCAGTT G CT CCCAGTTCTCATGCTT TAGAGAACAT CAGTGCTTTTCC CATCTCAACC{ N ) xAACCCACCATAAAAGCCATTGCT GAGG CCTCAGGGGCAG CATATCCT CACGCATCTCCAGG CTACCCAC CCAACCTATGCCTGTGTGTTTC CCTGTC C TCACTGGGACACTTGC CTTCGTGATTTGAGGACATTTGATTAAC CCAG TCTAAGG GCTGAAAATAA CTTTTC TT C CTTGAAAACAACTGCTG(N) xCACACTGTGTCAAATAAAGCACCCTCAAAGGAAAGAGGGGATAGAATCAGAGTT AG GGAATATT GAGATTTTACATAGAAATGTAAATACCTTGTTAAAG CTGCCATGTTGCTGTTAGAATACAATTG C CT CAACCAAAGG CAAC TGAAAGTGTTTGAATT TGGAATACAGAGTATT TATT CTTCTGATAGAATTGC TCTGGGA TGTTTAAAGTAAGAGCAGCTCTCTACCTTCGAAGTTTATTAATGTTACTAGGCTGTCAATCAAGCTGTTTTCATT T CTT TTGTCC CAGATGAAAATGTT CTGCTTTCT CTGCACTTAAGACATATCATTTCTC CT CACTAGTACTGCAGA GAAAACTGTC CTTGTAATGCCATAT CAAGTTT TTCATACT TTTGTTTT CTTT CATGTAAACATGGTTGACAC TTT TAAGTTAGTTTGATCTATATTTTATAAGGCATCAAGTCAGAGACAAAGATATTTTTAACGTTAAGCCATTTCAAA TTTACTCATTTTGCTGCCATTGGTTCATGCACTCATTCAATCAACAACCAGTTCTGGAACATTTATTATGTCATT TGTTAAAGGTAGG(N)xAATAATTGTGTTGCAGAGACCATTGGCTATTTGGG(N)xGACACAGTGATTTGTTCAA AAA C CACACAAT CAAGTTTTTTCTAACAAAAT GAATTC CAGGGTTTATTTGGGACTTT CAGGA(N)xAGCTGAAA TAAGATATTTTTTACTGATG(N ) xTTATCTTAAGTTTGGGCCATTTTAAGGATCACAAAATCAATATTGCCTTCC TT TAAG ATGTAT TT CAAGTCAG CTCTTGGC TTTT TACTTTGG CTAAG GGACTTAGTAG CTGACAACTTGCTTTG C TGGCCTAT GGAAAGTAAG CC TAAAA CTAGAAT TTATCAAT GCTACTTGTAAGTTAATT C CATTACAGGTAATAGT GGAGTATTGGGT GT CATC CATT CATCCAGTGGTG CCTGGT CT CATTAAT CAC CAGAGGTAATAGTGCAGAAGGAG TT CT CAAGTCAG CACAGT CAGATT CCTTAG CAAACTGATACTTGAAG GAGTAGACACTGTAATGTAATACAGCTG CCTGATCAATGCAAT CACAGCTGGGCTCACAGCT CTTCAATCTTTTATAACTAAACAG CATAT CAAGCATTAGGG GATTTATTTAACAAACATTTTG GAATGT CTAATGATCTTAGGACATGG CGAAG GTAAACT CAAGAATCG G GTATA T A ( N ) xTCGGGTGTATAATTCAGCAGGATACAATTTATGCCTTTAAAGAACTTAACAATTAGTGTAAGTGACATT GCATA CATAAGAGAAGACAAAGGGAGAC CACAGAAGGCAT CTAGGCAACATT CATGTCTCAAT(N ) xAGTTGGAA TTATAGGAAGGCAAGACTGAAT CC CACTGTAGAGATTAG GAAAG TCAGTTGAACTGGCAC TTGAAGAGTAACTAG AATGTCTT CACATAAGGATTTGAGTAAC TTTC TCTGCATATG TAAGAAGTGACATGAG CCATGT CTGAGGTGTCA AAG AAAGAGTAGTGTGGT CTGG CTGAAG ACTATAGG AGTG GA GGTAAGGT TG GAGAC A GATGGC CAAAG G CTTGG ATGCTACA CTAGTATTTGGAAG TTACTTGGAGTG CAAATA GAAC CATT CACAGTATTATAATATAGAAGTGATTT TATAAATG TCAT TAGCTGGACT CT CTAGAAGGAACAGGAAGAGT TAAAAATAAATTAAAATAAGAGATAGAAGTT ATATAGTCAAAACAT
> H s 3 _ 45816265 - 4582 852 2
CATCATTAAGGCATAGAGAGAG CTATGTGGCTAC CAAAC CAAGACAAT GG GAAGTCTTGGGGTGAGGCT(N ) xTG G GTG CTCAAGCCGTGGTGGGTCTACTGT GAGAAGTGACAGTGAATGAGACAGGGTGTG CTAGGCT CCGAGACAT C TG CT GAAG CTTCAGGCAGGTGGCG GGTGATGGGAA CACTGAAAG CAAAC CTT CTACGTGGAG CAGACTAGAGATG AAG GAACATGTCAG CCCCAGTGAAAGTGTGAG CCTTGGGATG CACCAAGAAAGGATTATGGAGAT CAAG C CCGAC AG CT CTGGAGGGTGATTGGAAG CAGCTTAGAACAAA{ N) xTACCAATGTGCAGCTGAGTTGGAGAGCCAGTGGCT AGGGTGTCTCGAGGATACATACAATCTTAAAGAAAAGCAAACT CACTGTCTGGTTGTCTGGGTGGTATGGATTTG GGGCTGACTGCTA CAGAAAAACTTTTTTTTTCTAAGAAAGTC CTTGTTTTGAGATTTAGGGTTG CAACTTTAAC C ATCTCTGGCCCTCTCTATAGTCAT CTAGTAGGTGACTTGT CC CAGCAACACCAT CTGTGCCCTCTGCC CTAACAT AGCTGCTG CCTT TC CACTGCAGAG GGCACT CC CT TGGCATTG CCTAGG GT TGGAGACAGAGG CAGCTG CC CCTT G GTGCACCCTCCCAC CCAC CAGGAG CACCGGGCTT CTGTGC CT CACTGT CC CACCTTGC CAGTGGACTCGGTGCCA CG CAGGATGCACAGGTACAC CACCAGCCAGGC CAGGAGGAGG CACAGCGC CGGCTCCCACTGCACACCCCCGTTC TC CTGGAGGGACGG CGAGATATTGAGGGTTTT CCTGTAC CAGAAGTACTGTGTGGAGGACGC CTTCTCACACTCC TCATCGTAGCCCGTGTGGTTACCATTCAGTGGGCAGACAGACCACGGCAGGGGATCCTGTGGGACCAAAGCAAGT GTTATCCAGGGAGGTGAAGGCTGAGGACAAAGGCTGTGGCAGGGGTGAGGGAGGTGGCCCTGAGCGGAGACTCTT GGAAGAGAAGGGTG CTGGTCCCCGAAGGGCAGGTGGCTGGAGCC CCTGAAGG CAGACTGGGAGTTGGAGAGTTTG TCAGGGGG CACGGGTGAGGCAGGAAGGGGTTGGAGGAGGGGG CTCTACTCTCGTTGCAATTCTACGGGTGTTCCT TGAGGACTACTGAACTCTGTGGTGGGTCCC( N } xGGGCTCAGACATGCAGGACAGCTGTGGGGCCTGAACCTCAT CTCCTGCTAGCTTC CAAACC CAATGTACATTG CAGGAAATGC CATTCG CCAC CAGCACTTAGTGAGCACAGAAC C TGAGACTTACGCATCTCAGGACT(N)xTCACTTATGAACAAAGGAGCCTCAATGGTGCATGCATCATGACCATGA CCCGAGGC CAAGTGGT AAAC AGTG GTGCTG AG ATGGGAGG GAGG GGAGGAAAAGTTACTT CTGACTGTG GGGTAG GAGACCC CAACC CTGCTCTTTGAGGACCAG CCCCACATTGAT CAGTTTTG CACATATCTGGGTT CTACAAGCCCA GATTTAATTAGACAAAAACG CATCATTAATGGAAGCAAGT CTGACCAC CC CACT GGGCTCAATC CCATAT TTCAC AGGAGACAGAACCAAGGCCCGGGGTATCACCTAATGGGTCATGGCTACTGTGCCAGTAGGCAGCTGGAGCCCAGG TCTGCACACCCAGACAGTGCTCAGGACTGTCCTGGTGAGGACCTCCACGCCCACCCCTGCAATACCTCCCCAGGC CACCCTAGCCTATTGGAATGTGTGAGCTGGTTTCTTGCTTTGACCCATATCGTGTTCAAAGCTGGCCACCAACAG TCCTTCCC CTTC CCATGTGC( N ) xGCAGGTGACAGGGGTCCTGGTAGACCTGAAGCCCAGTTGAGCTGCCTAGGC TGACATCCCCTTGGGCCATTTACAGCCTGTGTGGGACTGACCTGAGTCTGTTCCCTGGACTACCTGCTGATGCCA GACCGGGCCTGCTTCCCAGGCTTGGCTACACGACTAAGGCACCCGGGGG(N)xCCCTGCACCCCGCACCCCGCAA CTTCATCTCCCCAGTGTGCAGCCAGGTTGAAGAAGGTTTAGCGTATAGGAGAACTTGTTAAATGGGCCAGTTCCC AGTCTGAC CTCC CT CACT CACCT CTGAACCAG CAGGTGAG GACAAGGATC CC CG CATTTGTGTG CATAACAGCC C AGGTGCTC CTGC TGAGAAGGGC TGGACACAAG GCTGGAAAGC CTGGCCAGGGAGTGAGTAGA CAGGAG CACAGAC TCCGGGGCCAGGTCCCTGAGTCCAACCCCACCAGCTATGAGCTACTCGGCTCCCCTCTGCTCGGGTGCTCAGTGT GCTCAG{ N ) xCCTCTCATTATTATGGGACCATGGAGGAGGGCTTCAAAGTAGAGGGACTAAGGGATAAGTCCACA GACCTATGAAGGAT GGAATGAATGAATATTTAAGTGATTTTCATGAAT CG CTGG CCCC CATACTTCTGACT(N) x TTTG CTTTTGGGGGATGAG CAAAT( N ) XCACAGAGGACAGGAAACAGTCACCAGGCTATAGAGTAAGAAGAGTTG CCCTGTTT CCACTG AGAAGG CTGGGGAGAGGCAGAGTTGTGGCCAGTGTT CATG AAACTCTAGGTTGAAC CAGGG AAAGAAGGAGGGA CTGCAACTAAG CTCAGGTTACAGCTTT CTTGCCAGAGATAC GTGCAATGGGGCAAGCTTGGT GGAG CTCATTGG GAAGCCAGAGTGGGGTATGC CAAATCAGGATTATTAAAGT CATTCACC CAAG CATT CCTCACG CCCC CACAGAGT CAAGGCTATG CC CAAG CACC CCAGAGG CAAAC CCAGTT CAGT CCATAGTC CATGCCAG CCCTA TCTTGCTTGCGGACCTTGGGGACAGGCATGTCCCATGAAGGCCTGGCTCAGGGGCAAAGGGTAAGGCATTGGCCT GGTTAGCT CCTG CCTCCTCCACTGTTCC GAAC TG GGAAGAAG CCAAGGTGAAGGTGGTTC CTGGAAGCAGGTGGG GGCAGTGGAGGATGGGGTTCCTGAGAGTGAAACGGTAGGAGCCAGGTAACAGAAATGAGCATGAGCAGTGGAAAG GAAAATTC CTCTTGTCCTGAAT GT A C AC CAGCTCT CTGTG TGGAATTGGAGTTCTTGCTT CT CT A CAATT CT AG C CCCTTCCACTTTTCTACAGAGTGTCAGCTGTAGTTATCATCAGAAATGTGTGGTTTAAAATGTTTAGGAATGGCC CTATTGTGTTACAGGGTGACAATGGTCCTTCTAGGAGAACTTAGGGGTCTGCAGGGAAAAGAAGAGGCTCAGGCT GGAGTCAGAAAC CTGGAAGCAC GAGTGGCC CTGAGGAAAG CGGGAGAAGGGGAC GGGGAG GAAGAGGACAGGCCG TTGCTGTC TTGCTGGATTGAGGGAATCC CT CC CC AC CAGTCCTCTCCACñGATTTCCTCCTAGGCACACTGTG(N JxACGTGGTTTGGGGTGCATGGCTTCCCCTCCACCTCACACCACCCACCGGCTCCCCGCCCCGAGGCTGCTCACC TTGCCCCACACCCCCTTCCCCGAGCGGGTGGCCCTGTTTTTCTCTCCCTTTCTCGCTCCTACTCCTGTTCTGGCA CGGG CC C C CCGGCTCACCTGGAAGGAGTGGAAGAG GTAC CAGAAGGCC CAGG CGTTGATGACGTTGTAGTACATG GAGAGGAAGAAAGAGACCAC CACGCTGGCGACCCCTGC GAGGAAGCAGAGGG CCGCGCTGAGGACTGAGGATGG C CCTTCCCCCT CCGCCAGG CC CAGCGTGTGCTC CCAGGACACACCTGTGGCGACCGCCC CGGG GTGGGCGAGG CCT CCCT C CAC CAGGTTTGTGTG GTGAGTAT TAAG CGAGAC CAGG CACCTGTGAACACTTGGGAAGGGTGATTTCTTT TTCTCCAACCTCCCTCCTCACCTTCTGGGACAGGAGGCTGATG(N)xTCATGCTGTGAGAAGCACTGTTGCTGTG AAGTGCTACCTTGTCCAGATGTACAG TTATGCTCCCTC CTTGGCTCAC CAGCGGTTGAGT CACCTGGTAG CCAGG AGAGGGGGTGTGAGGAGG CGGTGGGGAG GGGGCTGC CCGTAATTCGGT CAGG CCTGCACAAAAC CCAGGGTGCGG CCTGAGAGCACAGAGAGGGGTGGGAGG(N ) xCTGGTTGGCACAAATATCTGGTATCTTTCCTATGATGCAGAAGA TGCCTCTGAGAAGTCCTCTGTGATTG CT TATT CAGG CAGATTGTAAAA CTACACACTT CTGTGT TAAAAATTAAT AG CC( N) xTTAATAGCCATTCAAGACAAGGAGGCAGTCTGGGTTTGGAGATGATTTGTGAGAACACTGGACACAT TTCTTCTG CATCTGCAAC CAGGTCACATATAAG CTT CCAC CCTCTTTG CT CCTGAAACGT CACT CTGCTTTC CTG CCTG CCTT CCTTTCTGGGGTGGGAGGGGTGAG CATCACAGGCTATGGC TACTTCTGGT CAGGGTTCCTTCTG CAT GCAGAGGCAGCCCTGGGT CATAAGCAATGTGCAGGG CAGGTGGAGGTTGT C C TGGAAGTG CCTGGGGCTCTGGCT CAAGGT CACT CAGGAGAAGCATGAGATGAAACAGTC CTCTCCTGCTGT CCGTGCGG{ N) xTAGCAGGGAGAAGTT GGTGAGTTTCCAGAATGGTTGTTAGCTAGGACTTGGAGTTACAGC(N)xTGAGGTCCAGAGAAGAGAAGGAGGGT GGCATCTGTTCCCCAAGGGGCATGCATCCTCCCTGTCTTACCGCCCTGTGGGTAGGAAACCCAGCTCCTCTCAGC CCACATTCTATGTCATCCCCACCAGGCCATTTGTTCACCCCAGGCAAGAGCTCTCTCTTGGGCCTTGTGCTCGCC CCTTGATCCAGCAGCTTTGCTGATGGGATCCCACCCACACCCCCATGCTGCCCGGGTCTCTGGGTGGGAGAGGAC TGGAGGGGCCCCACGTACCGACACCACTGAGGTACGGGCTGATGGTCCTCCAGGCGCCGATGCTGCCCTGCCGCA TGCGCTGCCC CACAGC CAGTTCCAGGTACAAGAGCGGCATTC CCTCCACGATAAGCATGATGATGTAG GG GAC CA GGAAACTACCTGTGGATGTC CAGAGCTGAGAG CCAGGC CACCACCAAAAGGCTTTTCT CCA CGACCACTCAACCT CTGTGCAAA(N)xTTTCATCTTTTTATCTTCCTACTCAGCCACCTGTCTGTTCGTCCATCCATCTGTGGATGACA (N ) xTAGTTGATCAGGGCCAAGACAATTGTGTTGTTAAATATGTCAGACATCATGCCTGGATCCATGCATTCAGC TGTC CACC TCTATGCACT TC CTTTGG GC TTTGTGCACAGC CC TGATGGTG CT C CACAT GAGGTAGTGTATGA( N ) xCTCTGGGCCAGCCCCTGGCTGGTGTGGAATGGGAGAGTAGCAACTCTGCTCCCTGCCCTTGAGGTTCCCTGCTG TTG GTGAT GGTGGGGAG C CAGGGAGTGTGAATTTTGTCAGGGTGAGCCTAACAAAATGTC CAGT GGGAAAAGAGG GCAGGACTTCTGATCAAATAAAACCCAGACACTACAACTGCAAAGATTAAGGGCTGATTACAGAGCAAACTGGAT CCTCAGTTGTTAAGGTAAACAACAAAATAACTATGCTTACATGAACAT CTAATATTTACCAAAAATGT CT CCATT GTACACACGTG CTGTG CTGTAACCCACT CTTCTCAC CCTATG G CATTC CC CAGGCACT CT CC CAATCACAGACCT TTGCAT CACC C C { N ) xACCAGGCAGAGCCCCCATGGCCACACCACAGAAAGAAAGGCATGAGCCGCTTGCATGGG TGGATGAGTGCCCACACCACCAGGCTTGGAGTTCCTAGCTTCATCCTGCTCATGTGGGGGCCTAGAACAAATCCT GCCCTCTCATGTCCAGACACTGCTATAACTCCTCTAGGCCAGGCTCCCTTCCTTTCCTTCAGAACCTTGTTTCAG CACCTTGTGAGTTTTCTCTCAATATCAAAGGAGTCCAAGAGGTCATTGGTGGCTCTCAATCATGAAATATCTGAC AATAAAGATTGGGTTCA CACACTGAGACTTTGAAGC CACATT CAAACC CT CAGTGGTTATAAGATAAT CCTGTTT TTACGCGGGGTGTTAAA CAGTAACTATGTAAGGCCTGG CAAGGTCCAGGGACGACTGT CT CT CT GACAGGGT CTG ACTG CC CAGTGCCCATG GGTAGGGTG GAAATCTAAAAC CTGATGCCAAGTGGGAACATGGGGGCTGTTTG GC CCT AGGCATTAATCATTTCTGTCACTCCTGACCCAACCTATCAAATAAACACATAATCTGGGTGAATTTGCTCTGCAÍ
N)xTGGAAGAAAAGATTGAGAAATTGGATTTCGTTACTTCTTTGTGAAAGACACTGTTAAGAGAATAAAAACAAA AACTATAGA C(N)xGTAGTAATACAACTACATGCCTTAAAGGCATG(N)xTAAGTCCTGGAGTTTTCAAGTGGTT TGTTATGAATAAATAGGTAACTGAGGCACACATTGAGCCCTGTGGTAATGTCTCTGCCACTGTGAGTCCATATTA CCCCAGATGAGTGGGACG CTTACAGT CACAGAATTATT G GAAGATGTAGACT CAGCTCAG C CTGGTGAAT CCTTA CTCATCACCACCTCCCTGGCACCCACTGTCACATCTCCACTGCACCGCCCTCCAGGCAGGTCTTTGTGATTCCAC
C
> H s 4 _ 31023419 - 31032 80 6
TGTGGAAGGGGAAAAATATGGTTGGATGAG CAT CTAGTGAGTTGCTAT GA CAGAAGAAAATT CACTGAGGAAGTT ATTTTCTTTTGGAAATAAAAAGATACAAAATAAAAACAAAACAGCAACGTTCATCTTTGTAAAGTGTATCTTTAG TTTTGAAATTATAAGGTGAATGTACCTTTGGTTAGGTTAGGCAACTTAACACAGCTTATTTTGCTTAATGGATGT TCTAGGGATAATTATTGT CGTTTAAATATGTGAAATAAAGAACACTTC CTTTGAATTACCTTTTAAGGAAGT CAG TGTTTAAACTGCAAAGGCTT CTTAATAGTTAATAGTAGTGAGAGTTGATTTT CTGTAG CAGTAGACTCTGTTTG C AACAGATTGTATGAGAGTGCCCTTTGGGG(N)xCTCATGTGTTTAATGTGGAAAACACCTCCTAGGTTTACTGAT AAAG G TAT TCTTAGCACCATGGAGATTTTGTGATTT TT CACAGTTGATGATGATTATTTC TAAC GGTAGAGTAAG TAAT GACTTTAAAAATT CACACTATAAG CTTGAAAATT CCTAGTGCTTAGTT TTATTGTACATAATTT TG CTTAC TGTATCATTAAATCTATTTAGGTAATGAGGCAAAACATTTTCTGAAGCTTGGTGGGAAAGTAAAATACTTTAAGA ATAATG CATGACTGAATATTTCGTACAT TT CTTATATGTTTATTTAATTGAAACGTACTC TTG GATCAGAGATAG ACC AC AT AAATGTC AAAAAG AGATTT TAAC CCAGGTATTG C AAGTGTAAAT C ATATTT AT CATC AGGAGAGG TG A GG AG AATGTTTAAACCTT C CT CCATT CC TACTTTCAGTTT TT CCAT AG AAAAAAGT ATGAAT AC CAAATTTGTCT GTATTTTGATAACTGTTGATTGAAAACTATTATGAGTAG G CTACCTG GTTAGAAAAATñCAGAGTTGAGT CAAAC TTGG CC CTTGTCTTT CAAAAAGCTTACACATAGGGGTAGAAATTATGACTAAGTGTCATAAG CTGCTGTG CAAGG TAGAATAATACGGTGAGTGTGATAAG GAAAAT GAAAATAAAATAGACCAACAGTTAAGATGAAGGAAAAG CACGG GAAAGATAAAGGAGAAA CAG GAAATGGAAGTAA CATAAT CAGGTAAAT TATG CCAACTAT CTTTGCT CAGAACAA GGGGATTGCGTT CATAGGTAAAGGAAAT GAGAGCGAAACCGG CAAATGTT TGCAATGGA CATTGAAA CAAAACT T AAGACATGTTAGTATGGCA CTT CTGAATTTAAG GAGATAGTGGGCTAGGCAGTGAGGT CAAG GTTTAAAAGCAGA GCAT TTAATTGG CAAAGCATATGTGGAATGAAAAAGTACAGT CTTTGACT CT CC CATT CAGAC CTGATTATTGG C AACACCAGAATCG CAGTTTG CATG GAAG CTGGTTATG GTTTAAAGAGGTCTGGAG GTTGTAAAAATAGACTATGA CACT CAGATGAGGCTC CTGCGTGTATTT TATATT TCAC TATC CCTATT CTTGTC CTGACATCTAAAGCTT CC CCT TCTAAAAGAATTGC{ N ) xGTGCAGGTAAGAAACTAAAACAACGTAAGTATGATAATGATCTGACATGACTATTCA TTTCGCTGCTCT CCTACT TT CATGTAAG GGTCAATTTTT CCCTCCTTTTT TGTATT CCTCCTTCCTTCCTTCCTG TT TAAAAATTTTATTTTAGAAT TACAGAAGAATGATTGTACATAAAAG CTAGGCTACAGGATTTCTCTAC CTTAC AT TCTATTATAG CAAAGAAAGACTGAATA CAAAA CAAAG C TC CTAGAAAT CATACT CA GGTTTTAGGAGGAGTTG AATACAGAAGGAATGT TCAGAGACATGTAAAAGAGAAAGCGTTC CAAA CC CAGCAAGGACAAAGTAGTAGAA CT C AAGGT CAGGGAATGTAAT GAGACA CAGCATAAAAGAAG CA CTGT CAGA CGCTTTGGAGACCTTGATGTCCATTTG GGTGGAAT CAGGGATT GATG CTTTTCAGTAAGAAGAGCATTAAAA CTTTGAAAT GC TAGTTT CTGTGATGTGGAA GGGTAGTGATTTGTTTGAAGAGAGTT CTGTGT TTGAGGAAAGATATAC TC CAGAAAAGTG GAGTGGAATAGAGAA CTGTG GAGAATAATATTTGAAAAATATG CGCTTTTT TCAGAG CACT CAA CTAAT CATTGAATGAAACACAAT TT C CTTTAATACGTGAAAAAAC CTT CAGGGTAGATATTATT CA TA TT( N ) xAATTATTCTTGTCATTTCTATTTACGA AATACAGAACTTGAAG CT CAAGAAAATT TTAT GATT CTAGATTATGACATAAATAAATAT TAG AAT CAGTAC CAT AATTTAAACCTGTTTGTGTATAATACTGAAGATGGATATATT CTTT CCAATTTATCACAT TTATTTTTCACTTCA AAGACTTT CT CAAAGCAT TTTTTTCTTT GAAT CTACATAC TATAGAGATATTATT CAGAGAAA C AATGTGTGAT T TTA CTTATAATTACTAAAGAAATT CT CATAG GAAC CTGGGTTTT CTGTGAC(N ) xTGTGTGACTTTTTAAATGTT TAAAAG TTATT CAGTGAATAAC CTTATG GAGAAATC GTGATCCTGTTTATGGTG CTGGTAATAT TTGTTATTTTT TTAG CTGGTGTAATGAAACT CACAATTCTGTG TCATTTTCTCTCAAGAGATATC CAAGAAAAGGAAAG GTTAACA TACATTTTTT CAGTATTTTCAGAGAAATTGTCTCACGTATAGGACATGTTGAATTTAACAT CTTGCATAG CATCT GCCCTTTCATTTTTAGCTAC GAAAAG CTAAATGAAATGGCTGTTAGTATACAAACAGCATTAGGAGAGTGACTT G TTTCTTATGTAGTTCTT CTAATAATT GGAAGAAATC CCAACT CTGG CAGAGTGGAAGTAT TTGGAG GATT CT CGT TTTTTCTTTTCTTGTTGTTCAGCT GAAATGTTTTGATC CCAC TTAACAAG GATT CAGT CT TG CGTTGTTCTTTAA GTAC CAAGGAGATATATGAGAT TGTTTTGAAGTAGATGATTTATTTTC TAATTAT CATAG GTAGTTAACAGATG C ATGAAGAAAGAAATGTTT CCACTTGC CTATTATT TACTAGAAAATTAAT CTAAAAC CTATCTTTT CACACTATAA TATTCATCCACAAT CATG CATTTTAAGTGCTT CTTGTTCTTTCC CAAATATAT CAAGAAC CAGT TT CT CCATTAT TT CAGAGATT CCAAGC CTAGTGATTTTTAATTAAGATTGCATTTTG CAAGTC CAGT CTTCGTAAATGTAAAACTT A CAGAATATATTTTATGG CACC CACTGAAAGTATGAATATAACAAAAACC CTTT CT CTATTAAAAT CTTTGTGGA AAATGCTGGAAGTTTC CAGGACATACAGACTAAAAAAG TAACAATG CT TAACATATAAGACT CAAACGTTTTTAG ACAACCAC TTGTGGGTATACAT GTTTTTTGAATT TCACATAG CTATAATTACTAGTATGTGT CAAACAATTTAC C ACGCACTGTTATATGC CATGTCTTACACTG CCATTACTGAGAAATT CCAAATAACATTGAGGTCACATTGAC TTT GGTACAAC CAAAATATTTGC CAG CAT TAGACTTT TATCAATAAC TCAT TAAC CTTCTAACATTTT CT C CCTT TAT AC C CACTAAT CTATGACATT TTTCTTGAGTTC CT T C CAATTTTAGACATTTTATTTGAATTATCTTT C TAAGTAC AGTGTCATTTGCCTCTATTTTCTGTG CCACAAGTTGTAC CTTTGATGTTT TTAAAT TTAAACTTTG CATTTTGAT GTGGAAGTAG CAAGGATT CC CGAGGCTATGAT CTGATC GG CAACAG CCAGGCTAGCAG CAAGG GAAGTTGAGATA A CAACAGTGGG CATAC CGAAAT GAGCATACTGTTA CAACACGTGTT CGTAGTTTTATGAACATATTAT GGAG CTG ACAGATGC TATGATAAATTTGTATGCAGATATTGT CTG CAGCATAATTTTTATGTATT CC CTTTAAAG CTGAAAG TGTATATGTCTGTGAGCAAATATGTATTAGCATACATGCTTTCCTGCCTGCCTAGTTTTCTTTCTTTGGGACAAA ACATTCTGCCACCTTGGCTTCC CACATAATTAAAAGTTTCTACACATTATTGATATTTGTAACACCTCTTATTTT TGCCATAGAAATATGTTGCACTATTTGGCTTTCCTTTGGGGGGAAAAAACCTGCACACACACATAGACACACATA CACACCTGTGGACTCAAGTTTTAAGAGCATGGCCTAATGGTGCATGACTACCACATTAAAAGAAATGCATTCAGT GATGTCACAAAG CTGATTGC CT CAGTTGAT CTATAATTTG GTTCATTAATAAAAGT CAGC TTATTG GTAGTATT C TCGCAAAG GCTT CAAATTATTG CATTTTAAGTAAAAATG G GAGTAAGATG CTAACAT C TATTTCTAAACT TTTTA AAAAAC TAAT CTAATG CAATTATAACAC TAGAAT CCTCTGCT GAAGTC CCACAAACTTTCGTTCATGATT TG C CA TAAGGAAACC CAGC CC CAGCTTTG CTGACTAGTAAGATAAATTTAT CTTTflC CAGACC CAACTCTTTCTG CCATT CTGCTAAGTCTTGGTGGAGTAG CTAAAT TAAT CC CC CCAAGGACTCTT TAAATTGC CAGCTAAG CTTT CTTGGAA CCTCAT CACAAATA GG CATTATGTTCTTTGTGAGAAAAATTT CG CT CTG CATTTGATCAGACAC CTTAGTTGTTT GCTT GATACA GAAC CCTT CACTATTCTTTATAATATTCTACATTATTCAGTTTAAAACTC C (N ) xTTGACATTAA AATGGATTATTAAAAGTGATTATG TCTCTGACTGTTGGTTCCCT TGAAATTACTGTGTGTAACA TTTCCATGTGA ACTAAC CTGCTTTGTATCAATC CTGCTTAT CCTTTATT GGTATTTTAT CAACTT GATTTAATTTAACTGTAGAAfl TGCTTCAGTTTC CTGTGGAAATTAAATTAG CCAATTGAAT TTTAG CTG CTGCAACTTTTTGTATTTAGATGACTT AT TAATGACCATGGTCAT TT TGTACC{ N ) xCAAGAGAAATACTCTAGAAAGTATTCTTAGGTAAGCCAAATTATT TATT TTGTATGGTACTGACT TTGAGTAG CC TTAAATAATGTC CG CTGACATAGCTATAGTGATATTAG GATATAT AAAAAGTTATATGATATG CAGTGT TATACACAGGAATATT CCTAGAATAAGAGGAT TTAAGAAT GTTCTTTTATT TAAAAT CCCTGGGTTTTTTTGTTT CAAAGTTC CTGCAGTACACCACA CT CAAGCTTAATTATTTAAATGT GAAAA TCTT CCTTAAAACAGAATAGAGGAAGTTAAGTGT TT CATTT CGAAATTA C CCAAATTGT(N)xG CT TCAAAAAAA AAAAAAAATACC CAAACTTTAGGATTAATATTAAAGGTAT TGAGGTAACATTTTAAATTTAACCTTA CATGAGCT ACAGTTCT TCATGATGGATAATGCATGAGATGAAAAACAT CGAAAAACATGCTAAT CTTC CAAAAAGAAG CT CAT T CTAGAATAATTTGAGGCTTTG GGAGATAATATATT TATGAAAAAAATAGAAAAACAAATTATTGTGT CATAAAA GCGATTAT CC CAGTAATGAG GAGCAACCCCAAACTGGTTGTGTGACATTGATATATTTTGATGATTGACTATGC C CAGAACAG CTGATT CAGCAGGGCAGAAGAAATGTAGTTTTGG CAAAGGTGG CAAGAAATT TCAGAAA CAATTA CA AAATTTTATCTCTACCACCTGAAGTGTAGCAAATTTTAAATATTTCTTTGATTTTGAAAGGCCTTGTTAATCATT T CTTTGAT TTTGAAATGCCTTGTTAATTCTTTTTA CTG CTAATCATTTGTGCTCT CTGAC CAG CATAAAACAGCT ATAGAACATTAATG CAT CACAAGCAACTGTAAC CAAATTAAAGGCCTATAAGTC CATTTTTTTGGTGTTG GAGAT TCTGTGCCTATTTCACCTAATGTACTTTTAACTAACCATGAACTAAAAATAGTAGCTTGCTTCTGAAGATGGTGG GT CT CAAACAT CAAAACAGAGTCAGAAATGTC CTAGGTGCCCAGACAT CTAGAGGGGCGCTCAG CAGATGGGAGT CAATGGAGTCTAGAACAAAGTGTTGGATTAGAGTGGGTTCTGAGTGGAGCATGAAAACCAGATATTTGCAGGCAC AGCTGAAGGCTTTGAGGGCACATTACTCCAGCATTTTCCTGGGAAATAGCGGTGCTATTATAACGGTTTAGCACT GACAAGT CTT G GCT CATAGCTCTAGCAAGGAAGTTTATAGA(NixACTCCATTCATATTATACCCTAGATGGAGT GGTC CAGAATTCTAATAAGTGAGTAGAATACCAGAT GAAGAATCCGGG CAACATTTGGAAACTC CAGGCAG GCAG AT AGGCCTGATACTGT AG AT CTGGTñTGAAGC CCTT AG CTCTGGGTTGGGGTTC AGGC AT AATAAAGTTC CTAAG GATCATACACTAGT CTAGGC CTCAGT CTTAGAATGGAC CTTGGCAACT CTGTGGGTTCTGAGCA TAAAGACAGAG AAAGAAAGACTAATGCAGAAGTAGTTTAGAAAACAAAACAAACCCACTAGACTAGGCTGGGTTCTAACAAGTCTT GAAGAAAAGCTCAGTT TTCTAGTCATGAATGAATTAGCTACT CAAAAGACC CAATGCAGG G CTACAAATTGTGAA TTTG TAG CTCTGCAGT CATTAGTACTTCTAACTATGACTGACAGGACAGCCT TTGTGTGG G T(K )xTTTG TG TG T GTGCATGTGTGTGTGCATGTGTGTGTGCATGTGTGTATGGGAGTGGTGTTCAGCAGCAAGACTGCCAGTTATATT CATTTGTCCTATAAGACCTACCCTTGCTGCATCATCTCAGCAATCCTTCCTTGTATCCTCAAATTGCTTTGTCTT CT TTAGGG CCAGCTATTTAGTACTAG GATGTTATTCTT C CTCACAAATAATGGAGAAAGACATGTAGTTATTTG C TCAATTACCAATAATAACCCAAACTTTTGCATTTAGGAATGCTATCTGAAGACTAGCCAGGGTTAGCTATAGACA AAGAGGGAAAGAAAAAGAAAAAGAAGTTGAAGTTTGTT CTCCAAATTGTGT CTT CAGGATAAGAGCCAAGTTA CA CTTTTATTTACATATAATTTTAATCATGAACAGCATAATTTCATCTTATTTTAAATTATTGGTGCTAAGAAAAAA ACAAGTAGAAAAAAAGACTGAAAGTTTTAGTGAATTTCAATTACTTATGGCTTAGCAGCAGTAACTGAAGAGCTA ATTTGTTTAGTAATGAACTTTATCTGCAAAAGAAAAGACAAGACAGCTACTGAATTCAACAACAATGACTTACTC TCCAGTTTCAAGCAAAAAAAGACAATAGAACTTGTCTATACCATTCTTCAGATTGGTGAACATTAATCATGTTTT GCTTTTTTAAAAAACTACAAGACAGTTGTATAGGGG CAAAGAACATGT TGATGAT CATAATTTT GTTTATTGCAG AATGAATATTTGATGGTTTCTGAGCAAATAAGAGGTGTAGGGCATTTTCCCAAGAAAGCTCTCTTGAAGATACTG AGGCAGAAAATGCATCATCC CAGATTTCAA CCAGTTTCCTTT CTAGTGATGTAGGTAAGATCTG CACAAAAGTAC GTGTAATATAAAGACAAATGG GG TTATTTGAAGAAGAGTTACAAGTGGTGG TAATATGAGGAGGATG GAAAGAT C ATATTTGATGTAGG CC TGGAAGAGGAAAGATGGTTAAGATTTTATCAAGCACATATAAAGTGAT TAAAC
> H S 4 _ 187581788 -18759 023 9
GGCTGTGGACACCTGGAGAGGGTTTATTCCTTCTGAAGACAACAGCACAGACATCTCTGCAAACACCTTTTCTTT AGAGTTGT CAGAATAGAGAGATTTAAAAGGGAGAGG CAATGC CATATTAATCATGTTT CTA CAACAAGACATAAC ACCATAGCAGACACATCATCATTTCTGGGGTCCCAGGAAGAACCAAGGGTTTTCAAGGCTCCCCACAGCAATGCT AAGC TATATAATCT CCT CAGTTTCTTTCTC CT CCTACT CCTCAAACATATAT TATGCC CAGAC CTATGCCATGTG CT CAGATATC CCAAAGGATATTTC TTGCTTGCTCAT CTATCT CTTCACAGAGGACTTACT CTAATCTAAAAGAGA GACAGAAGACA CAAGAAAATTGAAAACACTGCTTAGTCTAATTCAT CAAACAAATTACTATAGGG GTAGTTCTG G TCTT GAATTG CCAC CTGAGATATGACATATACGTGT GTTGTGGGGAGGGG( N) xAAACATCTCTTTGAGGGTTTT TGTT CACACT CACATTGTGT CAAT CAAGGCTTTTA CTT CTAGATTT CT CTCT CTG GAG GT CTGGGAC CGC CACTA GAGAGGCATACATACATTCTGGACTCCAAATTCATCCTCACAATTCTTATATACCTCTGACTTTTCTCTTATGTT TGACAATGCCTTTAATTTTTCAGTACTTAGTTCTTAGCTCACTTTTGCTTTATTTCACAAGGAGGGTGAACGCAG TCACCACTTGTGCAGTAAAGGAACAAAGATTAGAAGTTTATAAAAATTTCGGAGGAGGATTTTGAGGTCATGGAG CTATCTTCTTGGAAGCAGGGAAGTCAGTCACAGACATCGATGGAAAAAAATATTAGAAGCAAAAGGCCTGGGGTG TCACTTCAGGAAAGTAACCACTTCCAATCTGAAAACTGTTCAATCCCAATCCCTTACCACCTGCAGATACAGCAC TG CT CTATAG GAAACACTTAAAACATTAAAAAACTAGC CAAC TGTGAAATAAACT CTTTTAATACTCTTTACAAA AAGC CGACTTAGGA CAATT CAAATTCTTAG CAGCAGTAGAATTTGCTGTAAGTTTAAGATATAT CACTAATTTAG GAGCTATTACTGAAATAACAATG TTTGCTTTTTAAC CTGAAAGTGAAACCCTACTAATGAG CATAATCGTAAAAT AG CTTTATAAAAGCAAACTCAAAGTTTAAAAAG CACATTTAAGCTAA(N)xTTATGCTAAATTTTTATCCAAAAT AAAGGCTTATTTTGAATAATTCAAAAGGAAAACAATTTATCAAGAAACAAAAGTATGTAACAGATAAATGCTAAT TCTTACTTTTCCCCCTAACAGAAAAGCATAAGCATAAACATTTCCTATAGCAGTATTAACACATACCTGTTTTAG GATGTATTGAAAAGAATCCTTGTGGATTTCCACTTGTAATTTTGTACATGAGCTTGTCATTAGAGCTCGAATCTG GATCAAATGC CTCGAT CTGGACCACAGATACATCTTTAGGAGAATTTT CCATGATTTCTGGGTAATAAA CAGGCT CTGATGTC TGTGGTGCATTGTCATTGACATCC TCAACCTCTATGTAGATCTCTATGAACGATGAAAGAGG CACGA CACCCTGATCGGTTGCAAAGACTGTTAGCCAATAATGGGAGGTCGATTCACGGTCCAGTCGATCTGACGTCTCTA TGACACCTACAGAGAAAAAAGAAAAG CGTAAAGCAG CACATCAACGAC CAAAACTGCCAG CTTCAGAAAGTGTCA A CAACTACGG CTTACT GAGAGCTT T A T A ( N) xTTAAAAATATTATCTCACTGAAGTCTCAAAATAGGTGAGCACT AAGAGATCATCTAATGGCCCAACACTTCATTTTTCCATAGGTAAAGTTTATAGTTGGTTGAGAGACGTGCTGACT TT CCTGGAGG CTACTCAGGAAGACAGGGA CAACACCAGAGCCATGG CC CAGATCTGAGAC C C C (N) xTGGCGGCC ACTGT CTAAG GAAAAAG GCAGGAG CAGCAT CCT CCAGAAAGCTCGGTCTGCACACACACC CCTT CTCACAGTGG C TAAGGAAAGAAACTGCAACCTTCAAAGTCCACAATAAGGGGCTGCTCAGATCTCCTGAGTGTTGAGGTCCTTTTA TTTCTTTCAATCGC CT CAAACAAATGGGAAGACAAC CAGCATGCTATAAAAATATAA(N)xACAATAACGTATTA AATACACAAGTGAAATATCTAAGTATCTTAATCAGGGTTGGTATAGAAAAGTCCTCAACTGGGACCCAAACTAGG ATACATTTCTTAGTAAGTCAACTAGCAAGA3ATGAAATTATATGATTACAAAGAAGAGGGTTAGAGTATCTGTAA TGTTTTGGTC CAGC CACTGAAACCAGTCAAAACTTAACTT CAAG CCAGGATTTACCTC CT CTGGGGATAAGAAGT AGAAATATCCATTTCTCTTTGGCAAGTCCCACCTTTTTATTCAATGGTGCAATCTCGCCCTTTGTCTACCAAAAA CATATCACTG CACTTT TT CTCATGGATTGAT CACTC GG GACTAAAGATACTTTCAGTGTT CTTCTACAC CATGGT ACAGGCTG GACT CTGGAAATGCTGAAGCTCTAGAAT CACTGAAGTTTCCCGTTAAACG CAAACTA CTGTGAGAAA AATAA CCAATTACCAATTTTACCTATGCTCAGCAATGAAACATTTGGTTGAGAAAAATACACTGTTGCAATTCGT AATTTTGATCACACGAAATTAAATATTAACTTATTCGAGGTTAATATCAGAAAAACTCATTTTTTGAATGAATGA TAATTATAT CTAATACT C CCAAAAGGAC TCACAACTTGAAAATAGAAAACTTAG CTAAATAAAATGTGAAATTAT CATAAAATGAATCTTTATTGCCATTCAGAAGCATTACAATGATTTGTTATTGCTTCATTATTTCATACTGAACCT TCAGAAAGGGGGTATAATACTGCTTCTAGAAACATGGAAGAAACCTCAGAAC(N)xAGTAAAAAAAAAAAAATGA AATAAAATAAAATAAACCTCAGAACCAAATAAAAATGAGCTAACTGGTTCCTCCGTACAGCAGACAACTGAAGAT GAACT CAAACATGCTT CC CTGTTCTCTAAC CCAAGCATAATG CACACACACTCAACTGGT CTGCTAAATTAGCAA AATTATAT CTGTCTGTAG GCTCAAATAATAAC CTTT CC CAGTTC CTCTGATTCATCACAT CTAT CCACAGATACA AAGACGTCAGGG GATAA CAGAATGTTCT CT CCGGGAGGAT TACACATTAACAAAATGTACA CAACT CCTC CTGTG TGCTGCTACTTCATTAAAATGCAATAAAATTCACTTGACAGGTGAAGTATTATTAGAAAATCTCAAGAAGACAGG TACGTGGTGCTTTTAT CTACCTCTTTTGAGTT CTAATC CT C CAACAATTCATGCTT CCAT CTACTGTTTTAAAAT GTGCCCCT CTAAATGT GACTAGAGGCAT TTACAAA CAGTGAT( N ) xTACCACAAATCTCATCTTCAGCGCCAGCA ATATCCTTGTCAGT CCTACGGCTGTTAT GACAAAAGAGTAAACAAAA CTTAATGTGGTAATTTGG GAAAGATGAG TCAAACTT CCTATTT CAT CCACCTGTTCTACATGT CAGTATT CAGCAGGAATGTATTAAATGTT CACT CATACAC AATAGCTTTCTCTAATCTCAAGGGAAAAAAGAATTATGAAATGGAAGTATATCACCCTCACATCTGAGGTTTGCA ATTACAGATAGAAACAGCTATGATTTTT CCGTCGTGCCTTATTTCTAGTTCGGG CCGTAAGC GTCCAGACTGCAG TGTGCAATTCTGAGATGGCATCTGCTGGTCCCCACCGACCCATCAGCTGTACAACCAGGGGGTGGGGGACAGTCA TGTC CATCAGGAGG GGATTGTCGGCCTC CC CGGTAATCTACGGATATTACTAAAGGTTAAGACTGAGG CAACAAA GCGACT CTAT T CAAATAT CGGAGATTGTGTAA TT CG CAAT TAAAATAGTAAGTATGAATGAGTGAGATGC TGGAT GTTCCTATTT CATGAATT CATGACTGTGTACTTT TATAATGC CTATCGTGATTG CTGAGATAAG CT TCTG CTTCG ATGCATTTTAATTATTAC CTCTTTGTAAGATAAAAAAT CACT GAATA CAACTTACG GAGAACTATAAAGTAAATT TCAACTGCCAAAGGGAGAGGAGGGCACACCGAAGAATCTTflTCTTCTAACCAAAATCCTCAGATCCCTCA( N ) xA GCGGTGGGCTGTTGCCA CTATCAATCAACAGCAACACTG T CGGACACCAAAGGTG CTGG GGACCTGGTGGGGCTA TAACCATGAGGAAAGGCCACAAGGAGTTTCCCAGTGTGGAAGGAAGGGTCTTCCTCATGCATGCAGGACAACAGA CGGGAGACGATCTGAGGATTCCGCTCAGAGTTTTGGGGTTCAGTACCTAGGCGCTATTATAGGCACTGCACAAAG TGGGTGGAAG GAAG CCTTAAGCGTGCCAAGTTATTTTGTTTT CCGCTTTTTAAT CTGT CAGC CCAGTTGCATGAT ACACTCCT CAAGAACC CTGGGCAGAAACTT CCAAAGTGGACCTATGGCACTAGC CCGAGATCTCTAGGTCAAAAC CCCGAAGC TT CC CATCTTGAATCAGAGG CAAGACGAAG GT CCAGGATGGCTGAGTGAGAGT CTAGATAAT GTGCA AAGAAAGACCAG GGGTAGAGCTAGCCTT CGCCATCT CAAAAT CAAGCATTAGAC TGGGAACG CCTT CAAAACCCT CCAGTG CAACTC CCTTGT T CTTAAAGAACGTTGAAATGATTTGT CCAACTCCAAGAAG CTAAACAGGTGTAGTTC AGAATC CAGAGTGAAGGACTTCCTTAAT CAG GGGTT CATCACGATCCAAACCATTC TTTAAGG GAGT C CTGGTTT CTGCAGGGAGTGACACAC CAGCTGTCTC CAAGTATGAC CGAAGC TGCCTTTACT CATTAAGT CATTAG CAGTTCT TGTT CCTACACT TCAT CATGCTTTCACCAGTC CT TC CTAAAG CCACGACTGGCATTTACAGATTA CAGATTGATT TCTTCTGTAACTTTAGCAGTTACCTCAATTCCATAGAGAGAAACAAATTTTCACATTTCCATTGGGAAGACTTCT AATTAGTG CCTTTAACACAGAGAAAAACAAAAATAAAACTAAAGACATCCAGTCAT CACTGGATGT GAAT CAAAC TGTGGT CACT CT CAAACCT CTATCCCTGTCTATTT CT CTT TATC CATGAATGTCTACCTATCAACTTT CAATAAG TACTTTTTTATATAGTGGGTTATAAACT CACC CTGGAGACAAAAGGAACCAACAAAAACTATAG TTTTTACAGGT ATCTTTCTATTTA CACATATTCACGACCTCAC CTTTTTG GGCAC CTATGTATAT C
> H s 5 _ 2303672 - 231243 1
TTTCAGTCTGGTTTCAATGCTTTTTGCAGGCACAGCCGCGGCTTCTCAAATTTCAGAGCTTTTAACCCAAAGAAA TAAGAGG C CAAG CATC CG CTGCTGAAGC CGTCCTTGAGTGCCCTGCATG CATCATC CTTAAAGC CT CAAAATAAT TTGAGAGT CTCATGTGGT CCAAAGCAAACACAGT CAGGATTTTCTCTATTTATCTTAATT TCTCCACCAGCTCTC AGATCTGATGGTCATTAACATGCACATTTAGGATAATCTGGAGGGCATTTCAGACTGCAGACAGGCTGGCCCCAG
q t t t t GCCTGGGAGGG CGGTCTGAGGCCAG CAGGAATACTGTAC CGGGGCTCAGGCCGTCACCCGCAG CCAGCAT GGAGAACG CTGT CCTGACACAAGTCACTGCAT CTTCT CATGGGCATGTTAAAAAAAGAAAAAGGAAAAGAAAGAA ATGGAATTAACATGTTTTCCCATTAAGAAGTTATGAACTGCTGTAGTTCCGACAGTGTTTGCTTTATACAGTTTT ATT ACC CATT CCATATTT CTTGAATTGT AT CT TG AAAT CT CAGC ATCAACTCTCTATTAC AATC TAGCTTTAAAA ATAAACAC CCACTTGTAGATATGTATCCATATGAAT CTGTGC CTGTGTACGTTCAT TCTTAATAAT TGATTTTTG ACATATCACATCAGTTATTTGGGGAAAACGTCATGTGGAAATCAAAGTGCATTTGGTCTCTGGCAAG(N)xCTTC TCAACTAGTTTCCATATCTTTAGCTTTTGAAAAAAGAGGTAATAACTGAAAACAAAATAATACAGAAACCTCAGA G GATTAAAATAGAGTGGAGACAGCAGACATGCA CAATGAC CAAAAATGTTTAACTCG GTAGAAGGTAGTT CACCA ACCCAAAGCAAATTCCTGGCAGCTCCTATCACTGGTTTATGTAACATGGTGAAAACTGTTAAATGTCTTTCAGAG GAAGACTCATGTAGGTGATTACAGGCAGGAACACAAAAGACATGTTGCATGTTCTTTTTCTTTTGAAGCAGGTAA TGTTGTG CAACAAG CAGCATAATATCAAAGTT CTGAG CTT CTTTTTTTGAACTACC CC CTTTGTCTTTT CT CTTA GATTATAGGC CAGAGAAAC CC CAAAACGTGAAAATGTG GCCTTCATT CGAGTTCAACTGTGG CCTCTGCT CATGC TTGGAACGTGAC CAAAGAGATG GTTTTCTATTTT TAGTTT GAGAAGTC CT TTACATTTTC TC TTTT TAGTATAGA CACAGT CAAAGATTT CATT CAT CATTTATATTCTAATGTTTGGCTTGTGT CAGC CATTCTGTAAGGTACAAGTTG AGCAACCTTCTGTTGGTGAACAGG TGAAATTTGGGT GGTCCACGGTAGGCCCAGTGGATTGGTC CATT GATAAAC TCAGTAAGTGGTCTTTGGTCTTTACGTCTCTTTGTGATGGAAACCCATCGTCAGTGATGCCAACACTACCTGCGT TAGTGTTTACTTGGCCCCCCCGCCAGCTCTTCCTAGGGAGTAACAGGCTTTGGAGCAGCTCTTTCCAGAGCTGTG GCTCTACTTGGCTATTTCCATGGTCAGAGAAGGCAGAGGGGTGGAGACAGAGGGTCGGGTAGCAAGGAGAGGACT GGAGAC CAGGAC CCAGCAAT CT CTGAAGGCTTCC CGGGGCTGCTCAC CAGGTTGTGAGAGTCAGGCGGGGTTTCA GTGGAAGC CC CATCTTGAGATC TGCTCTTCTGTC CCAGAATTTGGGGGAAAGAATTGTGTAT T T ( K ) xGAATTGT GTATTTGTGCGTTTCTTTTCCTGCATGAATTTAAAAAAGTTAGAATAAAAATTAAATCTTTAAAAAGGTCAGAGA TCCCCTCCCTGG CCTTTGAAAAGG CACCTCATCATATATGAAG GAAAGAGGC CT CACATOAGGATTGAAGAAATT CACT AT CT CT AG AGCATC AG GAATGATCTAGAAACC AACCTCAGTCTC AG AT CACTGTCTGC CAGG AG GAAAAG C CCAGCTGCTCAGCACCTCAGTGGC CAGTCCTGAACCAAAG CCTTGTTGAGGGCTGCCCTGTGCT TACAGAAAGAA GTCCGCACTGTAGACTCCTCCTGCTAGCTCCTCCCAACACAACGTCATTTAAATCCAAGGGTACAGGAGCCCCAT AGGACATCAñGATTAATTTATGACATAGCAGTCCCAGCCTCTCTTAñCTGTTCTTCCñGCTCTGGGCCTGAGTTA ATTTGTATTTTATTTTGTTTTTAAAATTTCTTCCTAAG( N) xTATATTTTACACATACATGTAAATGCATTTATA CTTAAC ATTATC ACATAAAC AT CTTCTATGTT ATTACAAATATTCTTAAATGTC CTGG AC CATG AG CTTCACTT C ACTCCTGTCTAACGCTATTTTACTTAGTGTTTTCTGGCATTAAGAATGTGAGTCCTTTGGCATCTCCCAGCCCCC CTTCTTTGCCCAGCCACCCCAGTCCAGGGGCGCACATGCAGTAGGGAGCCCCAGGACCATTCTTTCCAGAAGTTC AGGGAG GC CCAGAGCTTGGGTTTAGCTCCAG GGTAGGACAGACCTTTC CCAC CAGCCAGG CTGGATGGGGAACAA TTGCCAACTCTATAGCA CCAGC CAAC CTACTTCAT CTCTGCACGCTAGTTTTCATTTTACTTCT TAGG CCTTAAC AAGAAATAGCCACTTATTTTAAATAAAGAAAATGTCTTCCACAAAATCAAGTTGTGTGTCTCGTCCCTGACGTTG GCAAACATAACTTAGGCTTGAAGCAGGAAACAGAGTTTATGGTGTGAATAGTTCTCAAGCATGCAGCGTCTGTGC ATGCCCTGAATGTGCACCCATTTCCTCTGCCTCCTTCTCCTTCACCCATCAGGAACTGCAGCAAGCAAGCAGTCA CATCACAC CT CC CAACCACATG CAAAGAACCAAAAAAG CAG CTGC CAGAAAAGCTTCGGGAAATTTTCAAATTAG ATCTTCAACTGCAGTAAGAGTCATATTTAAGTAAAAGCACATCAGCCCGGCTGAGCTCTGTGCCACCACTGCACC CCA CAC CC CT CCTGCTTTCTCC CC CAAACAATTT CCAG CTGACGTTTñGTGAACAGñAAATGTTTTATGAGCTTA AGAGAACACAGAGA( N)xCATTGAGGTGTTTGGACCACAATATTCTGGAAAACCTTTCCTCTGGCATTATGCTCT ATACCTCTATCTTTTTAGGACCACACATGCCATTCAAAGAAAGGGCTGAAGCATGGAGTTCAGAAACAAGTGCTG GGCTGTGCAGACAGCCTGGGAATCCACAGGCCCACCCGGCTTCAGGGCTCTCAGGAGCAACTTCAGGGGCCTACA CAATTCGT CTTCTGAGGTTGTCTC CAAACTCACAGGAAGCAAGAGGGT TT CTATTCCAGCTC CC CT CAGGTAGCA A CAGGCAAGG CACGCAAAAGTGTGTGATGGGAGG CAGG CAGTTATTGCAGAAATG GGATAGGTC CC CATGACAGA AGGGAAGAAACCAACAGAGGATTCTTCCCGGGAGCCCCAGGTTCAGCAAGTCACACCTCTCCCATGGGGCAAGGC AAGGCAGATGGTCGTGGGCCTCCTGCAGCACAGACATCCTTCTGTCCACTCAGGAGTGGAGTCCAGGCCCTCAGG CAACTGAGGCTAGAG CACTGAATCAGGGGACTGAACAAGACCATG CCT GAGTTG CAGATACGGC CACCTGGCAC C TTGCAAACACACGCAAGTGACAGGTCCCCTTGCAGAGCAGAGCACTCAACTGCTCTGCCCAGGACCCCTGGGAAA CAGCTCCTCAGTCTGCCGGTGAGGTGGCCACCCACCATCTCAAACTCCACAAGGGAGCACTCTATTTGGGGGCAA ACCACACATAATTAACCCAAATGGACCTGCCATGGTGGTCGTCCCCCCACCCCCACCCCGAGCTGGCAGTCCCCG TCAGCAGATC CCAATGTTTGTTTCAATAAGGGGCAC CCAT TTGGTTGG CTGTAAAAC CCTGTGT GGGTTCTTTTG CAACAATGTTAAGATTGCACAGAAAC CGGCGTT C TC CT GT CCTCTGAGTG CTGT C CTCACGC TG CTGT CCTCAC C TGGGCCTGGGAATACCTGCACACATTGCTGAGTCATGTGTACAGGATAACGAGAGCTCCTCCCCTTTGTGCAGAC ACAAGGAAAGAGGCTGCACCCCATTCATTTGAACGCCTGGCACGTTCTTCATCAACTCCATGGAAA( N) xGTGAA CCA CCATG CCCGGCCTG GAAATTC TTAAGTGCTTAGAGTGAAATGCATTTTGTCAGAAGC CTATGAG GTTTTAAA ACTCCG TATGTCTTTTTAAT GACTAG CTGAATTCTCAC CGATTCGTGAAG CC CG CAGAGTATGATGTACATGTT C TAACAACTACTAGGGCCAAGCATTTTGGAAAAAAAAAAAAAAAACATGGACAGCTTACTTTCACCAAATGTTAAA AGAAGGCATAAAATGAAATGACATTTATTTTACAATGGCGGAAGTCCTCAGGAAATGAAATGTGCTAATACATTT TAGCTCGTATGTATCCTTCTGATCACACCCATGTCCAAACAAGTGGTTTTCAAGCTTAAATGGGAAGAAACAATC TATGAATATTAAAGAAAATATGTATT CTTTGTACA CTATTAAAGCAGG CT TAAAG GACAGATGAAC CT C C (N) xG ACTGACGAAC CT CCGCAGTGAT TCTGGTCACTCCAGGAGCATGTTGTC CATTTG CCAGTG GAGAGGGC CTTGC CA GGGTGGAGCAACCATATTGGAGTCACGCTGGAACCCCACAGCCCCTAACACGGCCCCTAACAGGAGGCTTCTGAA TGCCACAGTCCTCCCTCTCAGTGCAGTGTAGAGGCCACCAAATTCAGATCCTGGGGGTGTCGGGGAAGGCAGTCT TC (N ; xGAGAGTTACCAATGCACACCACGGGACTGGCTCATGCAAGATTTTCCATCTCTTCGGAATGCACAGAGT GTGTCTTGTTGGAGAGAGAATCGGAAGAGACTAAAAAAGGTTTCCCCAAACTGAATGTGCTCTCACATGACTGGA GCTCCGTGTCTT TGCGTGTGGGAC CGTTTCAGTTAGAGGAGGTTTATG CCTTTG CGGTGATGTT TTCCCACTGTA ATACAACGTCCCAGTGTCTACGACACATGCTGCCTTTCTGAGGTCCAC{ K ) xCACATTTGTGTCATCTTGGGACC CTCAGGGTTGCCTGTTCCTGTTCACGGTGCCTGAATTGATGGCTGATGGGAAGTCCCACTTCTCCTTCTGGACCT A CACAACAGACCTTG GCCTACGTCAGTCTTATCAAC CATTGCAAATAAAACAAG CCTCCATC CC CTAT TTACATA GTGGAC CAGGATGCTTGGT C TATGTCAATCTTAT CAAC CAATGCAAAC CAAACAAGCCTG C (K ) xTCTTGGTACT TTGGTTAGCAATGGGGATTGTAGCTATCTTGTACTTTGGTTAGCAATGGGGATTGTAGCTAAAGTGCCACTTTCA T CACTACGACGC TGATTGTCTT CAGTTGCCCAAGGATTAAAATGTAA C CATGAT GAATTT CGAGTCAGGAAACAC GTCACAGTGGGATTCTATTTCCCTGGATTCCCTTAAAGACACGGACACATACTCACTCAGGGCACTGATGAGGCT ACTAAAAGGAATAAAAGACGGTAT CACTATG GGG C C CTTGAGGAAACAGTTCTG CGAATC CT CCACGC CTCATT C TGTCACGGGAGAGAGTCTCTGCACTTCTATTTTCAGCAGGCCAACATTTCTCTGTGTTGCTTGTGATGATCTTAT GAGGGG C CTGGAGAAATATGTACACATAGATTTTAG CATGAGTCTTGCAG CACTGCGCTTAG CAGAGT GTCTTCT CTTACAGAGCAT CC CTGCGCTTGGAGTTGGG G CGTGGAGGACTGTC CACT TCAC CTGACC CCG GAATGTGGC CCA GAG CTACT GACAAG GG C CGT CCTTTGTGAAACAGAG GACTATAT TGTG CT CATCT C CAGCTAC(N ) xCCCCAACC T CA C CAAG CATAGCT CAGGGACCTCTGTGTGTTTTCTCT CTGTATGTTTT TTGTGCATAT GTGTGAGT GT GTTT T CTGTGTGTGTGTTTTC CT CT CT CT CT CT CT CT CT C (N ) xGTGTGTGTGTGTGTGTGAGTGTCTC CAA C CT CACCA AG CATAGCTCAG GAAC CT CAGTTGTC CCTTGAAAACAC CGTG CTGC CAGGTACTAGACACAAATGTGACTTTAAA CGTGAC CTTTACG CTCAAGCAT CTTGTAGT CTAGATGAG GAAGAGAGAGGAATG GAACAC CACATC CT CTTCAGA GC CAGGTG CTATGGGAAAGTTC CTGGCTGTCTTTGCAGACCACACAC CAAGACAGCTATTGTGATCAC CACTTTA TAGT GACCAGAATGTGTT CAGCAAGG CAGCCTGCCCACAG CCTGGAGAACTTA CAATTTTTTCCAGGCGTGATCA AAAG CAAC CAAACAAACAAATGTTTT CACT CCATAAACTGATATGTGC CAGGGAGACTTGTC CT GCTGTTTATGG AATGTGGGGG CACTTAGAGTGGGGGT GAAGGCTTGG CT CAGAG GAA CCAG CAGC TGTGTTAT CAGAGACAAAGGG AAGT GACAACACAGTAAT TTA CAATGAT CC CTG C CCAAATTATCTCAG CACT CCGACTAAGA GCAGTGTTCCTGG GCAAGGAGAACACACAATTT CT CA CCTG CTGAAGAATG TGGAAG CCACAGGCAGTG TGGCTGGAAGTT CC CATC G TCCCTCCATGAGTGACTT CATAAAGCTT CCAT CAGG CACCTGGGCCTGGCTGCCCTCCTGCT CGGATTATTCTAA ATATAAACAAACGA
> H s 5 _ 37802351 - 3781202 7
GTTT CAGACTTCATTCATATTTTTGTGT CTTT CTGAACACAAATATTTAT CTAGTACTTATT CTTC CATAAC CTA TT TCTT C CGAGATG CTGC CTTTATTT CCCCATATTT CATGTC CCACAG CCAATC CCTTGCCTTTTC TACTTC CAC AG CAAGT C TGAAATTC CAGAATT CTAGACCAGCTCAGAGCCCAGAGTCCTTCTTACTGACTG GGAAAGAAGTTTG GACAACTTTCAGAACATGCCATGATCAGTCATTCACATTTCTGTATCTTTGCTCAGTTGGGGTCCCCTATGGAGT AGGGAACTAG CTGAAG TTCTAGCTGTAGCCACATCACTTGGC CTGAACTT CTTCCCAGAGACAT CAAAGTTACCA AGGCCACACTACTCCCTTCTTCCTCTAAATTCCAAAAACATTTGTCTGCACCTATGATCCAGGCACTAAATCCTA GGAACCTTTGATTTGTGGTGGCAATACAATGGTTCATAACTAACTAAGGACCTACATAATGGTTTTATGTTTTAA GTTTTAATTTTGAA( N ) xGAGAGAGAGAGATTTCACTCATTTTTCCACTAATATT(N)xCTGTGAGAAATACATT TT TGTTGTTTATAAGC CACCTAGTGTAT TTTG CTAGAGTAGC CCAAGGAT TAATGCAT GT TATT TTCCTCCTCTG GGATCCA(N)xACATATAACAGGTGTATGTTAGTAAATGTTTATTTATGTGAAAAAAGTAAACCCTCATTAAGCC ACTG CAGTAGGATTTC CTATTAATTGAG CCTT CTTTGAACTGTC TTTATAATGCAGTC CATTGTAACAGGATTTT AATCTAAATATAATGG CAAAATTAAATAAATAGAACACTG CTT(N)xATGCTTCTTTTCTTTGTAAAATGAAACT GAAATG CCAATATT CATATTTCT C CCAATTAT GC CTAAGCATGCTACC TACTCCCTACTGCTCTGTTTGAACGTT ATGTGAACTTGCTGTTATGGCCAAATAAGTCAGCCAAGATGGGAACATTAATTTCTTTGAGCAGATGGTTGTAAA GGTTGACCTGTAAAAAGAGGTG CCAATTTGTTTTGCTGTAGTGTGCTGTCACAGAAAAAGGACT CTGTTAG(N ) x CTGTAAGC CAAAGT CCTGACAAATACAGATGCTTTCTGAG CATCTGCTAC CTAATT CTTC CATTAGAGAT TCATT CCTTTGGC TCTAGAATAG CATTTT CCACATTTTGTC CAGCAGAACACTTC CGTTAGATGC CAACAG GCGCTCCTT GGTCAAATGAGGAGGCAGGTGCTCAGGGCTCTTCA(N)xATCATAAGATCGGTGCATAACTTATTTACATATCTA CAAAAAAAAT CTAAAAGAAAAAGTTCTCATGT CAT CAGCCTGACCCACA CATAAAG CACTTTG G CTATAGAATCA CAGT TAAGTGAGTGATAAATAC CG CCACGGGAGTATTTGGAG CATTGAAGATGTGAGGGACAGTATTT CCAG CAG AGAAAATGGCTGGGGCAAGGGTACAACACTG AGAAATTGC CACAAG CT CAA CATGTATTTTGAAGTTCTGTAGGA CATC CACAGTGACTGAAGGGTT GAGTTT TGGT GTAAAAGGAAGATTTCG GGGACAAAAAC CAAACAAGAACTAGA GAAAAGTTTTGT TTGGAACAGATTGTGGAG GTTCTGATTGGCAGTCTGAGTGATTTGATTGG CAGCAAGAT CAGG TAACCCTGTGAGGAGGCCAATGCCGTGGTCTGGTCAAACCTTACTGCAGGACTGGGAAGGACTGGGTGGAAAAAG AAGGCTGGAGGTGGTTGGAATCCCAGCCTCCGAGAACATTCTGCAGAGAAACTTTGGCCATATTTGCTCAAGAAT CTTCCTCGTGGTAT CATTTGAAAG CCATACTT TTTATTT CATGCTAAACATT GT CAGAGG GAGCGGTGGCAGATT CATC CCG GAATGTGAG CCAACG CACCGGG CATGCACTTTAAATGG CATTC CAGAC CTCGGG CTGGCAGGCTC CCT CAGATC CTCGCCCTGCACTGGTTCCTGCCTTACC CTGAGTTAGGATGT CAGGAGAGAGGGTCTGTTTT CTTACCT TC CC AG CAGAGAAC AT CCAG AATG AGAC AC CAAC CCTC CC TATT TC AC CT CGTAGAGATATC ACTG CACGG G CGT CAGC CTCGGTGCTGTT CTGATGATGCAT TTTCTCTTCTTCTCTCCTGTCCTCCTTATGCCTGATCTTTGGGTA(N ) xATCATTTTGGGCCTCTCTGCTTGCTTTTTTCCCCATCTCTACGGGACTGCTCTCTTCATGGCCCATAATGCAC CTCTTCACAATAGAAGCT( N ) xñCTGTGCCATGGCAGGGTCAGCCAGGTGGAGTGAGGGTAGAGGATGGTGGAGA GTGAGG CTGAGGGAGGAGGGTTGGTCAT CT CAGCTCAAGC TCAGGGGGAAATGAGAACA(N ) xATTAAAAAATGT TTAAAAAAAA GAAATG TGAACATGCCTGTGTTTAATGCTCACTGGC CCAGTCTGATGTGC CATG TGTGAAAGAAA TGATGTGTGAAAGA GAACTTGG CCATTTGTCGAGTCCTTCTGGC CAAGTGTG CCTTTC CACAAATGGTGGGTTGG GAAGAT CCCCCTGG CCAGAAAGAATT CTGATGTGTGTAATTGTT CCGT TACAGATT CATTCCTTTGGCTCTAGAA
T ( N ) xTGTGTGATGTCGTTGCTGCTCCGTGAGAGGGGACTGAAGTATGTGACATTTCTCCCATGTTTAAGATCAA GGAATT CCAG CCTACACAGATG CTTCCTAGGGTTCCACCGGGCAGCTTGGGAAACGCTTGCAGCTTGGGAAACAG CCTTGCAGTC CTACTGAGTCTCCTGTGG GAAG CAAATGTT CCTGGTAGGC CAGT CTGTAG GAGAATGAGATCAGT GCAGACAAGG CCTTGAAAGCATGG GCTCGAGAGGGG CTGCAGAGGCG GTGGGTCTGAAAGGGG GGGGTGGATGTG AAGATC CAAAGTGATC CTTTTC C CAAGGAG CTTG GAGC CTGG CAGAGCAGAG CTCCCTGGTCCCCCACCT CATGG CTGCCTTTCT CCATG GGAGG CCTCTCTCCCCTCT CATT CCTC CCAGTGAAGCTTAG CCGCCCTGCCATCCAGAGC TCTGTCCTCTCCCCATCTCCACCTTCCCAAGCCAAC( N ) xGCAGGATTCCTTGTCCCTCCCGGGAGCTCCATGAC AC C CTT CACTGCACGT CTTCATCGCTGCCCCTCCCCACGTTCTTATGCTC CATTT C TGTC CCTCTAAT CTG GGAG CCCCTTGCAGCATTGTCTCTGTATCTTTACATTCTCCCCTGGCAGGAGATAGCTTCTCAGAAATTAATTAAATGA ATGAGACC CAGACC CGGAGCAG GTGAGCAGGGAGTGGG TTAAAATC CCCTACATGTAC CACTTTAAAGAG TTTC C CAAGTTAAAACAATCGATTCCACATGTTCTGAAACCTCATCCTTCTTCACCATTGCTCAGCTCTATGGTTCTCTG GTTTCCTGGAGGCTGAAGTCTTCTCTCCATCCCTCTCACGGTCTGCCCACCCCGTCCACTTTCCATCTGACGTGC TCTTCCCTCTGTTGCCATGCCCTTGGGCCACTGATTCATATTGTCGCTTCTCAGAAATCATGAATGAGCATTGGA ATAT CAATGG CATGGC CTTAAGTT TATGCTTC CCTAAGTAGAAAGTAAATGCAAATAT CTTGAGAAAACAAT CTA ACTTGAAGAATGGAAAGAAGGGTTGTTTTTCCA(N) xTTTTTTTTTTAAGGACAAAGTTGGCTTTAATAATTCAT GAAAAAGAGAAAACTAGAAGGAAACTGGAAGCAGAGGACACAGAAAATTTGCTGTTAG CAGTGCA C C (N )xA G C C CACCACCCTGCCCTTTTGACACTTTGGGTCGCACTGTTCCTCCTGCTTGGAGTGTCTGTCGTCCATCCCAGCCTG TGTT GACCTGGC CCACGTGACAGATT CCTACT CACTTG CAAGGT CCAG CTCCAATGTAACTT T C A (N ) xATCTGT AGCTGTTTCTGATTTGTTTTCCTTCCGGCTCTTTATCCCACCTGCGGCTCTCTCTGAGGCATTTTAGCGCTCTCT CCACACCTCTCCCTGCACGTCTCGCTTTCGGGGTATCACGGACTCCTCCCTGCAGCTGCAGTCCCATCTTCATCC CACT CAGCTG CCGTGG CCAGGGACGTGACA GTGTATTC CTTGTCTTAGATGAGTAAATATGA GTAAAT GTGTGCT TG CC CTTCGTTT CCCCAACTGTCTGTTTTCAGGGATGACTTGGCAG CA TTTGGT TTTCATTTCATT CAGGATGTT GTGAGGCTGAGACTAAAGATGAGTTAACTATTGCCTGGGATTCCAG CAGCGCTT CCGTTAA CAAAC GTGTGTCCC AAGAAGACCCTTGGGGCTTTTTCATTACCTGATGGAAAAAAGCCATAGACCCTCGGCTAAAACCCACGCCCCCTC CTTGGGTGGACTATGAGTGGAGGCAACAGGATTCCCCCTCCCAAGGGATTCTGGTGTCAGCTAGTACATCAGCCG TGGCCAACGGAGGATCTTGTGTTCGAAGGGAAGGTCTTTGTTTTTCAGACGTAGGTTCAGGAAATTCTGATTTTC CACCTCTATCTCGGGAATGGATTGATTCTTTTAAAAGTTCACTTTAAC CTA CTT CACT CCACACTCACGGTC CC C CAGAGTGCCT CC TTAACACAGCAGAAAGTTGG CCTGAGATGATC TCAT GGGCACAAGC CACATAAGGCGGTCGGG ACCAGTATCCAAGCTCCTCACCCAGTGCTCTTCCACCCTTCTCTGCAGCGCGGATTTCAGAGCTGGAGGCCAACA GCATGGCCTTCCCTCTGGGGATCGCAGTGAGGGGTGGCGCGCACTCAGGGCTGCAGACCAGCAAAGCACCAGCCC AAGAAGGACGGGGGAGCAAGGTGCAGGCGCCCCCAGGCTGCTGGGGACTTCCCAGCTCTCAGCACATTCTCCTTT TCTT GGTTAAGGAAGT TT CTAAAGTTGACCACTGGTTAAT CGTTTTGT TATT CATGCTGAGCAAGAAACCAGGGT C CAAATACC(N) xTGTGTGTGTGTGTGTGTGTTGTGTGTTTGGGGAGGGGTGCCCTCCCAGTAGGGAGCTGCCCA GTAGTTTCTG CCTCTT ATTAATCT CT AGTACAGCTCGTGACTCACC ATGAAACT CACAGG AñCACG CAAG ATGG C CCAAAATAGTTT CCAGGGTGGTCACCAGGTGTGATGAGAAAGAAGACAAAACAATGAACC CATGGATC CATGGT C ACGT CTACAT G G GGAATT CTGCACACGACACC CTCCCCTG GACTGTGAAACAAT CATTGG CC CAATAAAGGC TCT GATAAG( N ) xGAGGAGCATCTCACTGGGTAAGTAATCCTGGACACACTTTGGGAAAATAGCTCTAAAAAGGGAGA GGAGGTAAGCAG GAGTTT TACCAT CTTATGTT CCCGCT C CAAGT CACTGACTTTGGACTG GAATTTTGGCTTGGA GAAAACGCAGCTGGCCTCAG
>Hs5_6521932-6532055
TGGTTCCTGTAC CCTGGCGTTGTC CATCTGTAGGGATCTG TGGGGACAGGAGACAAGTTCGTGGTGAGATGC CT C CCCTACAGCCCACACACC CCAAGGACTCA C CT CCAGATACAGGGGAGT GAGGGAAACGGTT CAGAT GGATGTAGG TGTGAGACATGCTGGGAACAGCAGGGACGT CTGTAAGGAGGGAAAG CAGAGT CCGCAGGG CAGAAACAGAGAGG C TGAGCCAGGCTCAGAGGACTCCCAGCAGAGGCAGTCACGCAGGCCAGCAGGGGTGGCTCTGGAACCCCCGAGCAA GTTGTGGCAGCAAAGAATTCGGGATCTGAGGGGTTTCAGTTGTAGGTCTGAGGTCTGTACTGACAAGGGGCAGGA TGCCAGCTCAGCAGTAGTGGGGAGGTGAAGACCTCACAGGGGATAGGGGATAGGGATGGGACAAAAGAGGGAGGT CT CT CAGATACAAGAC CCATTTACAGAGTC TCG GCTCATC CAAAGAATATGAGG GGAG GGGG CTTTGTGCTCACA GATGATTGGGTTAGGGGAAAGATGATTAATCACAGGCTGTGGACGCATTTTCGAGGCAGGAGCCAAGCCTTATGG CAAGATAACGAAGGGGACGGCAAGGGCTGCTCTGTGGGGGTGCACGGGCTGGGCCCGCAGCACACCACCTGTTCA CTAGGTCTGGGCAGCAATTAGACCTGCACCCAGCAGGGCCCACAAGGTCAGAACAGGGACTGTGAAGATCAAACG GAAGCGCAGGAGGGTGTTCGAGCCTTCGGGCAGGGCTTGTGCTTCCCCCAAAATCAGTGCAGGTTTGCCGGTGGC TTTCAACCAGAAAGCACTAAGGAACGC(N) xCAGGCTGAGGCAGGGATGGGAGGTCGGGGTGAACTTGGGAGTGT TTGAAACTGAAGGAAATAAAGTATGGCCAGGGAACCGCCTTCCAGCATTCCAGCGGGTCCCTCTCAGGGCAGCAC TTCCTGGCCCTCAGGCTGTGGGTGGCCCCAAGTTAGACGTTTGGCTCAGCCTGCTCTAGACAaCGCTGGTGGCTG CACO CACAAACCTGTGGG CAGGACTCTGAC CCTCACGCTGGCCCGGGT GGGGGCGCCT CC CGTGCC CTTGGC TCA GCTGCTGCCCCATGGAAGCTGCTCCTGACACCCTTGCTACATGACCCTGTGGATTTGCCCTCGTCCCCTTCGCAG AAG GG GAACCAGAGAAAG CATTT C CA CCAGGCTTTTGAAGTGAGATTC GGGTG GG GGGGT TTGAGT TTTCTTTTG CTTTTATTAGTGTCTGGTGCTTTTTTTACC CT CTTTTCT C CATT TCAAATTGTATTTTTATT CTAATAAAATAT( N)xGTGATGGTTTTTCTCTCCTGAGCACATGAGGATGCCTGAGCCTGGCTGGTCCTCCTGCCCTTCTCAGGGAGA CGGATGGCGTGTTCCATCTTTGCT CT CTGCTACATTGTG CATTTGGGCGCAGAGACAAATTCAGAAAG GCTT CTA ATTATGCAAGGAATCGAGGTCTCATCTCCTCAGCACACTTGTTTACAGCACTGAGCCCTCCTCTCCCTCTCCCTC CG CCACACACTGGGGCTTTGTCCC CT CTGGAACAGACACTGGTCAACAGCCTGGTGTCAT CC CACAGACCTC CTG TGTGTGGACCAAAGGT CC TCAACATTTAGGGATGGTCC CC TCACAGGGGGTCGG CCCTCTGTGTCT GAACGTGG C CAAG CCACGACC CCTGATGGAAAGTTACAA(N)xCATGGCCTCGATACCCCAGACAGAGTGAAGACTATGAGGCC CCAAGTCCTCAAGAGGGAGAGGGGCTGCTGCCTGCAAGGGAGAAGGATAGAGCCAAGGAAGCAAGCCCAGGGGCC CCTGGGAGACAC TGGG CGAGATTC CAGGGGGCAGGGTCTATTCC CAG AAACGAGATGGAT TGGAACGG CCACAGG AGAAAGGCAATGGGATCGGTCACCTTTGCAGCATTTGTTGTTAATGGACAGAGGTGCTGCCCCTCCTGGCTCAGC CTTGTCTGCAAGACCAAGTGGGTGCAGGCGCTGGCCTGGTTGCTGGGAGGCACGGGGCAGGAGAAGCTGTGCCCC TCTCTCCACAGGGTTGGAGCTñGAGCTAATGAAGTCTAGCTCTGTGGCCTGCCATGCTTGGCCAGGTGTCCCTGC AGAGAAGGGAAGCTCTACTCAGTCCAGACAAGGCCACAGTGTCAACAAGGGGACGATGTGGATAAATCAGCACAA TCTTGTGACTATGGCTATTGGTGGGAATAACTAACAAACTAACAATGATGATGACTGTTAACGGTCAAGTGGAGG CAGATGTAAATTTTAGTCTAACAAACAAGAAAGTAAGACCAAGTCATTTCTTTATTTTCTT(HJ xCGATGGGAAC TCCAGAAGCCCTGCCTGGGGACTGGTCCTTCTGAGGGCTCAAAGTTCTTCCCTCTGCAGCCAGGCCATCCCCCCA GGGATC CC CAAGTCCC CT CCA CACCTGGAACCTACCTCTGAACATCCAGCAAACTTCCA C CCTTGGTTTCTAGT T TTCT CAAATACAGGGAG GGAAGTTTCTTACAAATGG CTT C CG CTTTTGAC CATGTAATATAACT CACACT CACCT CACC CGAAGAAACAAGGAACATAAAAA CCGGGTCACCC CATGAAATAG CGGGTGGGGGGAAAC CAGAGAGAAGAC ACTAAATCACGTGCCTGAATAAGCGCATCTCTCACGAGCACCCCACAGCAGCTGGCCAGGGGCAGGTACTGTCGG AGCCTCACCACGCCCAGC{ N ) xAGCAGGGAATGTGACATCAGAAACCCCATCCAAGCAGCAGAGGTCCAGGGAGA AAGG CTGC CC C C T (N) xTCC CGAAGAAGAAGGGATT CTGT TGAAGACA GCAACATGGACACC CTGACT CAGT TT C CAGATCACTGTGAGACT CAGACCCAAGACAA CAACAC CAACT CCTCCCTAATTTCCAG C C T (N ) XATTCTGTTTC TTTGAAGAACTCCAACTATAATACATCCCCATCTCTATGTGGACAGTGTGGC( N ) xTCCCCATCTCTATGTGGAC AGTGTGGCGA GATTTC CCAG GGAACT CCAGACTTCT GGTCflGAGAATT CAGGACCCCATCTGGCACCAGCTACAC CATCAG CCTCCGCCCCTTGC CACCTCTACTGC CACTTG CTTT CCCTGA TAGCATTTATTTACTTGGGGTT CA CGT GAAC CTTGAT CTCAAAACAT GCCTCTTAGCAGAGAAGTTC CAATGATG CACGAGTATGTGTATGTTGATT TGAGT CCTATTTATTTTTCTTTCAAAATATT GAAAAGGTTTTT CT CT CTAAAT GT CTGTCTACTGA CAAGTTTTGAT CTG TCTTCATTGTTCTGTAATTGCCGCTGAGATGTGGTATTCAGTGTGATGAAAATCCCCATTGCATTCCTCTAATTG AAGGTTTTAAGGAAAGAAACATGTTCAAGACAACGG GCAT CCATACTCAAGATTAGGCGT CACCACCCAGG C CAG CGAGCCAGGTAGGGACAGTGTGCGGACATGGGTATGTCGGCTACAAACAAGCGTCCCCAATCAGTTGGGTCACTC TGTGCAGAAAGGCAGCCAAGCCATTTACCTGGACCCAAATAGTCAAAGAGAGGTTTTACATCACAC(N) xGAAGA AAATAAAATAAAGTAGAGGCCATACCCTTCTGCTC(N) xCAACCCACCGAGTTAGCTGCAGGTAGTCATTACGCT AAGATAGAAAA CTAAAAAA CTAGAAAAAATTGTTTTTTAC TT TACCAACAA CAAAAGGA CATAG CTGAAAT C TTA GGGAGATGTACTTTCCCCAAAAGCTTCATTTTGACAAGAATCATGCATTCATGAGACCTCAGAACACTCTGGATC CCGñTC CACTATTAGGATTT GCATTTTAGGGCAGATTTGTTTTTATAGACTT GAACATGTTGGATCTTTC GGGC C TTTT GAATTTTATAAT CTTCTCTTATGC GGCGAGAGGAGT CAGTGTAGACAGGGGGTT AAGT AAGAAG GAAT CTG GAATAGACTTGTGGTTAATCAGGTCACCAGGGGCAGAAAGGGCCAGTCAGGGCCAGGTGGCTGAGCCAGACAAGA TCAGGT TC CTTGGACC CTTCAGGAAAGGACTCTCGG CC CAGT CACCTTGAAC TGTGCT GGGTGAGCCATGGAAGG CGGGTTTGTTTATTACAG GAGCTCTGAGAC CCTCTCAT TAGG GACTGT GTA CTAAGAATC CCCTGGCGAG CAGGA ATGCTGGAGTAAATAT TGTGGGAGG CAT TTGC CAAACT CAAG CAACAACAGTGAGCGT CAGAGAAGGAGC CAG CA TTTACCCTTGGGGAGCAGGTGCCAGCCAGCAGCTGTGGGTCTCCCTGCAGGCAGGATGGCTGAGCCAGCTGTGGT CTG G CC CT GATGCTCACTG GGTAGGGAAGGGATAGCAG CAGT CTCTGAATACGCAGAA CCTCCGGGGCCTTCATG GAAATTGT CTGGGTGC TTTGAGTGGGAACCAT GTCG CACC TTTTCTTGAAAGGTAAATTGAGTCAGGC CAT C TGA ATAAGTTTTC CCGCAAAAGT CAGCCC CTCTTTCTAGGGCT CCAGAGTC CCACAGACCT CCTTTTG CGTTCAGCAC CTTC CAAGTTGGTATCA CAAGCACATGAGC CACAAC CAAGCTAAGCACAAG C CCCAGA G CAC CGAGAAGATGCCT GCCTAGAA CACGGATG CAGTGAGAGTGC CTGCTAATGGGCTCAAGGTC CC GG CATGAACAGC CAGAAT CAGAGGG GCTATAAAGGAGTCGGCTGGGCATCCCCCACCCTGGGAAAGAGGGCCTCGGCATCCTTCCCTCCTCAGACAGCCC AGCGAGTCTCACCCTCAGCTTCCCGGGCCAGGGACCTGCTGCAGGCCATCACTGGCTCACCGAACACGCAGAGGA AGCACCGAGCACCCACATCTGTGCCCACCTCTAGGCTCTGAGCGCCCACCCCAAACACCACCCTTTATCTCCTCA CTCCCCCTTCCTTCCCATGGACCATGGTCCCCATCCTCATGGCAATCTCTGCCCTGATCTTCACCCCTTCCAGGT ACCCTCTCCC CAGGCC CCTT CAGGCGACACTCACTT CC CACC CACCTGGACT CAGGAACTGT CAGCTCTG CT CCT GGCAGCACCTCTAAGCCTTTGCCTCCTCCCTGCTGTGCCAGTGCCCATCACTTCCCTAAAACTTCCTTCCCATTT GGGATTTTTG CTAGAAAAATAAGTGTGGACAC CTGGCCAG GGGCTGCCTGAT GAACAGTAGCAG CTGGTGTTGCT GGCTGGCACCCACCCCGTCTCTCTGTCTGGGGAGGCTGCTCTGCTCTCTGTGTTCTCTCCCCTGTGCCTCCCACC ACACCACCCACTCCCCTGCTCCTCCTGGTTCCCTTCTTGCCTGCTAGTTGTAACTGATTACCATCTTACGTTGCA ACTTATACTCCTGGGTGATCTCACTCATTGTATGTCTCTACCTATTACTTCAGTGTAATCTCTACCCCAGGCCTA TTCCTTAATT CCCAGGTC CC CATTTC CAATAG CCTGAGTGTT CTCTGACC CAAACCAAGCT CAGTACATT CC CTG ATGTACTTCTCTCCCCACCTC(N)xCATAAAACTTGGGTGGCATCTGAACCTCTCTCTTTTCCTCACACCTCCTG CACCAACTACTCATCAATACTTGCATCTTGTGCCACCAGCATGGATCAACCAATAGTCTTAATCTCCCTTCCTTA AGTGTGTTAG GATTTG GT TCAGCTAC CTATAACAAAAAAC CT CCAAAC CACAGAAATTAAAACATAAAAATACAT TATGAT CT CACATAAAGT CACCTCAGATGGTTAGGCTC TGGAAGAAGT CCTGACTTAGTCTCTG GCATTAAAAG T CGG GAG CCTC CACATCAGAGGTTTATTTGGGAGACA(N)xCCACAGCATTTTCCCCAACTGAGCACCAAGATGAA G T A (N) xTGGGTATTTTTTTCTTTTTTTCCTTTTTTTCCTTACATTAAATACATCAGTAAATCAGCAGTCACATT CTGTTACCACGATTGTTATGTTAGGA GAGGAAGAAATGTAAATGCTTT CT TAGAAACAAAA CAAAAAGAAAAAAA CCAGAACACTGTGCCC CTG C CTCCACTG CCTTAAAAAAAC TGG CCAGAAGAGAAAAATGAGCTGGGAAATTTAAA AAAACT AA C ATCCCCAAG CC C AGC AGAG CT CAAAGG AAGG C AGAAG AT CTGGTAGG AAGT AG AGGGAAGG TAGC C GATTGC CCAGAAGCTAAG GCTAGATC CTGTTCCTATAG GGGTGGGATGGG GGAGAAGC CTGGATAAGTGGTTTC C GAGC CC CAAGACAATGAAACACCTTCACT C CTG CTC CATGGGGCAAAG CTTGACAGAGAAAGAACTTCATTT CTA AGAGACTT TAGAAAGC CAT C CCAGTT CTTC CACTTGTAAAAAGGATGCA CTAAGTGCT CTGGG GGGGC TAGAGG C TTCT CGGAAAGGCAG GGAGAAATGTGTGTGTTT CAG G G CCATGGGAACTAAG GCTGCCACTGGAAAAG GAAG GGA ACTGAGGC CTAAGTGGAAGAACCTAAGC TGGG CCCTTCAGAATCTCAG CTTCATAGATTT CT CAGCTGTTATGGA ATAGAAGG CAGTCTGCAAGC CAAAAG GG CAGAAAGGTT CTAATCCTAGGT CACCGTGAC(N ) xTCTCTCTCTCTC TCTCTCT CTCTCTCT CTTT CTTTACCAAAACT CTGGGCTGGAA CAAGGAACTTAAAGTTCTCTTTTGC CTTGATG GCCT C C CACAGACATCTTAAAACACGTGATAAGGGACATGAAGAAAG G CTAGAGATTTAGAAAGGACCAGTGTC C TCTGATAATGAGCCAGTTGTAAGGTACCTTCAGTGCCAGTCATTCCATGTCCACAGAAACAGGTGGTGTGATATG AAGGTG CC CCAAGGAGGC CT CTGTGGTACTTG CCCCTG GTGGTACATGGATTAAACCGTTGGTTGCAT GG TGGGG AGGGGTAGGGATTAAGGATGGAGGGACTAAATTCAAGATATTAACAAAGGAGGAAAGAAGCAGGGCCTGATAGGA GAGAGAAGACAG
>Hs9—66744276-66749359
TTTGTTTTTACCAAAATTCAG G GATT CC CC CCTAflT CCCAATGAGAATflATGTCTTAAATAAATGGAATTAAGAA GCTTCAAAA(N ) xCAAAACTCTTTAAAGTACTCAACCTATCATAAGACTAGTAGGTGTTTCTCAACTCTCCTTCT CAAAACATGAGATCATGAGGTC TTAATG CTGAAGAT TATGGATTATTTTATGTAGAGAACATGGAACAAAGCTGG ACTGTG CT CTGGCT CAATCAGCT CCACTTC CACACCTCGCAG CAATTCT CAATTAT CACCAC CAGAGAGAAGAAG GCAGTCCCTACCTGACCATCGCCTGGCCTGGCTACCTTATCTGTCTAAACAGTGCATTGCAGGAGGCCCTTTCCT TT CTGTGC TGTTTTATTTTTTCTCCC CCAG CACTAGTTTT CTAATTAGAATAATTGAGGAGAAATACAAGATTTT CACAGAATGAAAGCAAAAGT CTTCCAAGTATGAGAAAAATAGAACAAAGG CTGT TTATAT CCATATTGAT TATAG TGGGAAGG TATTATTGTAATAAATGTGATGGTTATTAAAG CAAGTCTTGATTTTAAGGAAATGT TTTGTCTTGGG ACTGTGACAGGAGATTATGTGATGTTCATGAGATGATCATACTGTCTCTGTCCAGGTCTCTGTAACATGTAGAAC ACAACCAACGTAGAGT TTCAGT CCCATT TGATTTACAGAAACAG CTTGTT CCTCAGTAGTATTC TATATATAAAA AGTAAACACACACACTATAG TT CATAGAGAATACAT TCACATGGATTACTAGGAAAACTAAAGTAGC C TTTCAAA ATAATTAAAAAAG(N)xTAATAAAAATGAATTCTTCTCAAAAAACTACCTTTCTATTAGATATTTTTTATCTCCA AACATAAAAGGTAAATTTTGAAATAACTAATAGTTAATAAGAAAAACATGTAAATAGTATGTAACTAGAATTGTT AATTTCTTGAAACTTAAGGTTT GTTTTTATGTTATGTTAG CAGGATATGTAAAT CTAATGACATTCAAAAAACAT AAAGTG AG AAAAGAAG AGGCTGGTAT TG AACATGTATATG TTTACT AT AT T CTAAT AATGGT CG AGTT GTTTAAT TTCTGCATGAATGTCTAGGCTTCACTATTAGTCTTCAGCTAAAATTCTGCCCATTTTGGTGAAGGAACTGTGTGT GG CACATAAGGAGAGCTT AATG TTAGTATTTACC AT AGCCTT A CTAAT ATAATC CT ATAATATTGTTCTC CATGT TC C CAAAATTAAAAAG TAGT CTAATT CTA CATACGAACTCAAAATAAATGTATACTGTTATGTGGCATAACTCAA AATATTTAGAATTAAACACAGCTCTTATTTTCTCACACATACTGAGTATGGAATTACTATTATTTTTGTTTCCTT GATCAGAGGTAGATGAAACTTGATAATATAGTCACTGACTGAAGGGCATTGTTTTGCCAAAGTTCGTTAAGTTAA AAAATATTTTGG AT CAAT AAATGTCTTT CATT AAAATATT CACCTACATAAAAT AAGTAT TC AG AATTGC ATAG G AC ATATGAAT GTCATCTTTCTTTTGCTGTT CATT TACCCAAC AT TTTT CT TCTTTTTATATGTG CC AGGTTGAAA ATTGTCATCC CATA CT CTTTTCAGCTAAGAATAT TTCTGTGTTG GGAATAT CTTGAATTT CC CT CAGGGGTTGAT GATATTG C CCACAAAGA CTTTAG GCAAT TTTGATGG CCAGAATGAAAACAGGCAG CTAAGATATTTATTTTTAAA AGAAAAGAATAGAAAACTTTGTAATCTAGTCATACATTGTAGGAAG < N) xTGCCCCACTCTGGAAGTAATTATTA ATAATGTTTCTCCTTC CAGTTCTTCA GTTTGTTAAG CGCAACTGTTGAAAACATTTATTTGT CC CAGGAGATTAT ACGAAAAAATTAAGTGAATCTACTTAAACATAAAATTCCCAGTTTTAGAATAGA( N ) xTGGCCTCCCAAAGTACT GGGATTACAGTCGTGAGCCACCGTGC CTGGCCAGATTCAGATTTTTAT TAGGGATG TCAT TTAG CTAATTTAATG TATTTCTATG CCAT AGTTCATATTTT CTGTGCTGTAAGTATTTT CT AT ATGTCT GATTTATGTT GAAGAATAGT A TCTCTTTAGACGCAGGGGCATACTGTGAGCATCTACACCATGGCAAATGGCTTTGCATTTTGCACTCTATACCAT TG C CAAAAAAAGGTTTGATTATGCTACTAAAAAT C CGACATGAATGAGTGAACACCGAAAGC CAATAATTTTAGA AT C CCAAAAT AGTGGAATGT TT AGAACTGTGATATT TTCAGTTT ATTT CCT CCATC AATTTT CATT AG AAAGCG A GTAAGGAAAATAGTAGAATTTAGGAGTTAAAAAATCAAATTAGCGGTCA(N)xCAACAACAACAACAAATTAACA CCTTGCTTATTATAATAAAGAACGAAAGTAATCCAATACCTGTAACCTAGAGAATAAGGTTCACATCACTTAAAA TGGCATTATATATAAAAGAAAAGATTGGTTCTACTACATACATCAGGCTTATTAAGTCACACTAGACAACTGAGG TTGCCTAAGACGTCATTGTCCTGGACTTGCAAGGGCCTCCAACTCCAGTTAAGGTGATAAGAAAAAATGAAAATG AG CAAC CTAG CAATTTT CACAGGCCT CAGGAAACTGGGGC CACATCGAAC C CAGT CA CAGGTTATAC CAATTTGA TTTACAACAATGAAAGAACCAT CTGCTTATACTTGAGTTACAGCACAT CCCTTTGTAGACATC CATACAATAGAA AGCCCCAGAAGTTGATTCACAGTAGAACAAAGTGTTTGTGTCAAGCAAAGGAAATTCTGCACTTAGAAATTTTAT CGGCACATTT GATTAAAAGATGGTGTAGGTAGGCATAAGTAAGAAAAC CATAAG CAGACATAGT CAACATACTAG GTAAGTAAAACCAAAT CCAGAAG GTTGTATTTAGTAATATT CAT TTCAGATTTC CTTGGATAATTGTGGTTTGC C TTA CAT CTTTTGACTATGACATAGAGATTTTATT CACTTTTTAAAATATCTCCTAGAATGTAGCTTGAGT CAGT C AAACAGGAAAACATATTGCAGCATTCTCTCTTTTACCTCCCCCAAAGCACTGTAATATGGCTTTTGGAAATATTT CCTCATTGTCCTTTAGTTTTTGGAATGATGTCTGAGATACTGCTGTATGTAAATACAATGATATTCCATATTTCC TGTAAGTTTG CCTATT CAGAAAG CTG CATTTACATACATG CT GAAACACAATCG CTGGTGATGTATCAAC CAGAA ATTTTATGTGTGAG CCTAAAAGGAAGTGTTGAGT CTTTCT CTACTTAAATAATTAGAAATTAAAAGTACCTCTT C TG CATT CTACAGTTTATGTAATATTAAAAGAAGGAATTTTGG CAAAGTAA CTGAG GTTAT CAGTTCAGTT CAGTT TATAAATTTAGTGACA T CAACTTGTC CCAG CAAA
>Hs9_123940664-123951579
TGGGCCTGTGTTGAATGAGGATAAATTATC TTTT CT CTAAAATG CCACATGAAC CCTCTCTATATTC C CACATGA AG AGGAATGG AAGGTAATT ATTTGGT CTTTTCTTCTGTTTAGGG GAAT GAA CTG AACCAC TCAT TTTTTT AAAAT CACACTTAAAAGACACATGGGCAAAAAAGTTCCCCAAAACTACTGTCTTACCGAATTTGAGAAGGGAGGTAATGT AT GAAG CTTAACAG CTGGCTTCAAAAGACACCTT TC CAAAGAAATTGTA C TACCT CTATTAACGTGTAAAC CACC AAC CAAAAAAAAATAATAAGTTACTC CATCAAACA CGTTATTAT CCATAAAAAAGACTTCAACATTGTACTGGAA GATCTATTTAAGCATAAATAGTACTAAGCACCAATTACTAATCTGAAGGCCTCCTCACAGGTCCAAGGGCAATGñ GGAACCT CAAGAGG CAGGTGACTGCACAAG CAGTAAGCTATGGATTAAAAATTAAAAGGATTT CACATTCTTTCC AAAGTGTACTGCCC Q GTGTCTG GCA CACGCAT GTTACAATATGACAATCTGCTCTATTTGTGAG CAC CTGAGTGT ATTACAGGGGATTACACATGCATATGATAGAATCTGGCTCCCGGTACAGTCAAAGGAGAACAGCATCAGCCACCA AATGTG CTGATCTTACAC TGAAAAGGGT TGA CTGAAAACATT TCAAGGGTAATGTAAACACAAGAATAAAGCTGT GGGT CTATTACTTAGTGATGTGGTTTTATATG OTA CATACGTAGAC CTCT CTTTATA CAATGAATATGGACAGTG CTGCAATAAARACTGATGACAC GT CAAATTGT TCAC CTGAAGAAAAA CCCTATTATAGTT CAGAAAATAATG CAA CCAATTTTAATTTAATCTAGATACAGGTACTTTATTTACAAATATTTAGATTAATAGCATTTTGTTACATCAAAT GAAGAGTACAGCAGTCTGAATAAATCCTTCAGTCACAAAAACAATAAAACCCACTGTAACTAACTTTGGGAGTCA GGGTATTGCACCTCAACAAGCCGCAGTCTTTAGTTTGTGATTGCTACCTTATATCCCAATGGGTGGTTTGTTTGT TTTGTTTTTG TAAATATACACACACACCAGCAGGTCATG GTC CCTGGGTGAGTC CCTT GTGATG CAA CAGTGTAA GCAAAATGGATCACTGTAAGTCTTTAACAGAATATACCAATCTACTCCAGAAATCTTATTTTTTAAAAAGTTAAA CAAAGACAAAAATAAAAATGAATCCACAAATTAACCAAAGCCTACTTTCTGCACATTCCAGTTTTGGCTTTTATT TAACAT TGACTATACAATACT CTGGTACTACCACATGTTTACAACC CAGAAAGATGTACT TTTATGTTAGTGTCT GTAAAGAGGGATTTAAAATGTGTATTTTAAACACAGCAGTTGAGCTGAGTGCATTTTCTATAGTACGCTGAGGTG TTAC CTATTCTATTTCAAATAAATTCTCAATT CCCAGCCACT GAAT CATAAATG CAATAAAAAAAATCAACAGAA ATGAAGAACTTAATAAAACATGTTGTCCAAAAAAATAAGATTGTTTCTCTTGCTATACAGTATTAATTCAGTGGC CAAACCAC CTGGTG CAAAGTAATAACTTACTTTGTATCAG CACAAG CGCTGAAACACCTTTAGAAACACTTT CCC TTTTACAAAACAATTTATGCCAACATGACATAAAACAGCCCCTCACACTGTGAACACAGGGATATCTTAAGTTAT TTCACTGTAGGGTTAAAAATGCACAATTTAAAATCCCTTAACAGCAGACCTGTGGTTCTGACTGTCCAGTTCAGA ATCTGACCATTCCAAGAAGATAAAGGTATAAAAGCTTAAAATGTGCAATAGTAAACCCAGCCTTTTTTCTTTTTC TATACAGTAAGGCAAGAATGCAGCCTGTAATGCAAAAACGTTTACAAAAAGAGAAAAGCAGGTTAGCACATTGTC GATTGCACAAGAACAGTTAAGAAAATCAGCAGGTAAGCAACAGTGCAAAGATGGAACAGAATCTGCTAGTGTTAA TCCTCTGATGCTAGGAGCTCTTTCCAGCATAATGTCCCCAAACACTGCCAGCACCAAGGGGTGGAGCCAGTACTA CTTGTGAACTGCAGTTGTGTCTATTTCTTTGTGTGAAATGGAAGGGAGTAACATGGTCACATATAGGTCATACTG TACAAACTGGTATTTTATACTGTTCCAATGCCAGTAATCAATTTATTTTCTTCATTAAAATAATATACACAGAAT GTATTGTTAGTTCGATTCCTTCAAATTTTATACATATTTACTTTCTGTTAAAGAGAAAAGGATAAAATGGTATAA AAAAAGATAAAGCTATTAATTAAGCACGAGAGAGAAGATAAATGGATATTTTCCCTGTGTGAGGCTAAGACAGAA GCAAAT CT CG TTAAGAAAAATG CCACCCACA CAACAGGAAATTTAT C CAAAACAAAACAAAAGCAGTTATAGAAC CCCTTCTCTACCATCAGAAGTAATTTCACAGCAATAAACTTATTGGTTACAACAGACATACTTGAACAGTTAAGG ATG GGAAGAAAGGC TTAAGATATCACCAAATTAAAC CGTACAGTGAGACAAAGC CTTG CCAAAGGGAGG GTAAAA ATCATGAAGT CCAG CAT CAGTGCTCGGT TTAAATCATATATTGGTGACATACTTATCACGAGGACAAGGGGGAAA AAAAQTCTAGATTTACCATGCflGGAGAGATTTATTACTCTACTTGCCTTTGATAACCTGATTACATCTAGTTGTT TAG CAGTTTA GTATTGTGTTAAAC TGTTTTTACAACAGAGTTTTTT CTTTTTTTTTAATTAAAC CCAGTAAGATG TACAGAAGACAATGAGG CAGTAAAAAGTACTG CTTC CAACAGACAGAGGTGAAAGGTCAAATGAGGGG CCACAGC AAAGAG GT CACTAG CAGC CACAGC CTTCTCTCTGGGGTTGGGGTTCACTGGTTAGCCGGC CTCC CTGCGGGG CTG AAGGTTTGTGTTGTACACCAGACTCAGCAGCATTCAGATCCAAGCTTCCATCCTGAATGTTCTGATAGATTTTCT TGGCAG CC TCAAGGAAGG CATCTT CTACATTCTCTC CCCTAAGAGG CAAT TGATAACTTTATTGGAGAACCACAG TTTTCTACAAAAGACAAGACACTGACCTTTTGCTAATCTTTAGTTAACTGCCATGATGTCTCCAACTTAACCACT GTCATCTAATAAGAGATTACCAGAACACTGAGCTAAGAGAACATGGAAAACTA(N)xATGAGCCACCACACCCGG CCGGAAAACTACTTTCTATGGAAAGCATACATACATACGTGTACTGAAAGCCCTTTCTACAAACCCTATTTGGAC AACTACATGTGTTATATGCAATCATTAAAATACTGTATGTTAATACACACCATGACAATTCAGATAGGTCATTTC TGTGATAACAGAGAG GATACT C CAAACAGAAATGAAATCT CT CTGAACAT GCTGAAAAAAATTC CAAGAATAATA TGGTCTGCTCAGCCAAGCATGAACTGAGCTGTTTTTAATCGGGTAATGGTGACATAATCATGCTTAGCCACGTTA ACACCTCCATTCAGTTTT CTTTGGACTAATAG CACAGCTGAGACTAAACGAAAACAGGTTAGC CTCTC CTCACCA CAAGAAATTTTTGGCCTAATAAATGAAGAACAATGATGGGTGTACCACAGTTCAATAGCAGACATGGAAGAACAA TTAGAAAATGTTCACTATGCTTCTGAGTAGACACTATTCCCTTAAATCATGCTTTTTAATGTGATCAGAAGTATT ACAGGAAGAACAGATTGACCAAGCTTGTCTGAGATGCCAAACTCAACCTCACTTGTGAAAAGTCAAACACTGTCA TTTGGGAAAAGTCAAACACTTTTGAAATGTAAACAAAGTTTCATTTATTAACCTGGGTTACCAACAGGCATAATC AAGGTA CAAT CTTTTAAG TAACAAAAATTCATATTATTTTGAAATGTAAAAAAGGAAG CAAAGAGATGTTTG CTG TTCTTCCTGCAGTAAGCATTACACATTTATAGATAGTACTTTATATGTGTATATACATTATATACTTTTCATATA TACGGTTCTACTTTGAGAAACTTGAATATAATTGAAATATCTGTATTTTGGTCACACTTACGTTTTTGCACTCGC TTCGAGGAACAATAAGCCTAAAAATAAAATTCAGACTATAATTAGGACACTTCAAACTAACTCACAAGTGCACTT CTTTAGTAACATCTGAAAAAGTTTCCCATCAGTTTTTAGTACTCCTCCCATAGTATTTTATGAAAAAGTATGTAA GTTTGAGG TGGAGAGCAT CTTTATATTTGTCA CTAAATAA CTGTACTGCT CCAACATAAATCACACAGAAATAGA AACACATTGTTACTTCCT CATT CTTG AG ATTT CTGAGTCTGC ATTT AAAAAAAAAATG CAAAAAAAAACCAACAA AACTGTCAAGCCCACCTTTCCTTAAAAATTCTCCCAATTGTGACCATTAAGTCTTAAAGAAACTCACCATTTTCT TCAGCAAACTGTTTGGCTTCTTCATATGTAACATCTCTCTGTGCCTCCAAATCTGCTTTATTTCCTATGAGAATT ATTACCTAATTTGTGATCAAAAGAAAGACACCTTGTAACAAAAGCAGTAATTTAAACCAACCATGCAAATTCTGA AATC CAAAAATCTTG ATAGAAAAAATGACTGGGCCTAAAAAGAAAACCAAGTACTAG CAATTAT CCACTAGACAG G GCTACAGATTCATGAAGTCAT TACTA CAAAGGAC CTTAG CAATGATCTAGGT(N) xGTCTAGTCCAAAGATTAC GATATAGCTGAGTAAGAT TACTAA( N) xAGAAACCCAAACTGAGATCTTGAGACAAATACTATCCTACCCCACAC TATCCCTAGAAAGGATTGAGTTTATAGTTAACAGTAACTGATAAGAGATGTCTATGTGTATACACAAGGGGCAGC TGGGAGTAGGGGTTTGGGGGCACGTATGCAGTAAGTTCAAATCCAAGGCAAAATATCACCCACCAACCTATATCC ACCTTGGAACCATTCATACAAGTGAAGTTAGGCCAAAGTTAAAGGCAGCTCTGCAAAGTTGTAAGAAAAAAATAT GTACCCTTG G GAAACAGCAAAG GG CTTTGG CCTT TACCTGG G GTGGG GAGGGGGGTTATGGATAAACTGTCTTTG TACATTTñTGAG CTTATTTGACAC CAACTG GTAAAATC CTTC CCTTTTTCCCCCTTCCAAACAGATATGGTAAT C TAGATATGAACACTTTAGGAATGGATGGTTTCAACTAAAAGGCACTTCAGCCATTAACTTTTTTTCATGTAAAAT TACAG CTCCTGGCTCTTC CACTTT CAAAAATGTGTGTC CATAAACCAAATAAT CATTTTTATCTGAATGTAAA C C TCATGCAAGGACAGTTAAGTAGTACAACAAAAGTGAGCATTCTTTAAACAGTGTGGACAAAGTGCCCACTGTGAA GGGGAAGAAACTTT CATATACTAT CCATTAGTATTTTAAAAGAATAAAATAATGATACTTAAAAAGGAAATCAAT TTATAAAAAAT CAAGT CTGGTAAAGC CACAATGACTAG CATAGGGC CATTACAAGATAGGTACTCAAAA CAAGAA ATACTGCCCTGCTCCTGATTCCCTATGAATCTCCAAATAAGGCTTCTTATCTCCCTGAAAGGGAAGACATAAAGT GGCATGCTAATTACAGAGATACAAAACATGCCCACAACAAAATGGTAGAGAACACAACTTCTACACAGAAATCAA GGGCAATTCTATAGAAAATGGAGCACTTAAGAAAAACTCAAGAAATTTTTTTTCTCCTGATCTGGTTCTATTTCA AAGCACATAAATGAGGCAGGAAGGACACAGGGCGGTAAGGTTAATTTTATTATACTCCTCCACGGGCCCTCTACA ATGAGTTTTGTTTTTAATA (N) xCTTGGGTTTAATGCGATAATGGAAATGAAAGTTTCAGGCACAGTCCCTAGCA TGATAGGCAAA CAATGTTAACTGA CT CT CAATAT GATTAAAC CACCATTTCCTGATAAAAGCTCATCTTACCACT GATAACACAGTTCTTGAAGGAGGCCTC(N)xCATCTCAGTCTGGTTAAGAAGCTAATGTTTTAACACATATAGAA TCCTTTTTATTTTTGACTGAAATTTTTATCCTTAATTCTCCTCCTGTAG(N)xATGCAGATCCCTTTTAAAATCA TAATTTCAGATTTCATAGTCTTCACATGAGATCACCACTACATTCATAATAGTAATGACAAACATGAACAAAAAA CCTAGGTATAATAAAATGCAAAACCTGGTTTAATAAGAACTGAAAAATAACTGTTAGGTTTTCTTCAACTAATTG AAGAATACAATAAAAATT CT CT CATT CTTAGTAC CT GAAGACAAAAATCTCACCAGTAAATGGCTTCCCTTT TTG G GTTAGCTTAGT TC TTAAATTTTCTGTGTAATAAGATG CCATTTAACTTACAGTATTTGGATTGGTGAGATT CCT TGCATCTGTCAACCAGCTGCTTAAGTGGTTATATGTACTTCTTCTGCAAAAATAAAAGTTTAAATTTGTGACTGC
C (N) xAATACATATTTAATCAACATATCCTAAAATAATAATTTTGCTTTGACCAGTTACTCAGAACAGTAGTTGT GAAATACAACTG CATACTTT TCAAAAAT CTAG CCAAAC CTAATATTTTTAAATAAACAACTCAATATCCAACAAC TTTGATAATTA CAGGT CAGGAG TAAATTGTTTACTC CTAAAAGCATA CCGTAAACAGAC CAATAAGCAAGCAAGG AACAAC ACAGTAAATTGATGGAAATG CCAATGTGGAGT CAAAACTC AAATTACTTTGAAATC AT A CAGACTTTCA ATCACAAAGCACAAAGTTGTATCAAAATGCCTAAACATGACACATTTAATGGAAGACACTTTTGCTTGTCTTTAA ATTTTAATTTCATTAAAAACCAAGTCACACTATTATAAGCTGGCATACAATTCACCTTTCCTTAATAATTTAAAG AGCAAGCTAACTATAACAGCATAAAATTGACTTTGAAGACATTTGACAAGAATTCAAGTCCCCAGATATCAGTGA AAAAGTTCGTCCTAACTTTGTTCTCTTATCTATCAGGCAGCTAAATTATTCTGGAATTCTAACTGCCCAAGCAAG AGTTTAAAAACTCATAGGCTCATTTGCACTAGCCAAGTAAGCTCTAAGATAAGTAAACAAACCCAAATATTTGTA ATATCATACACAGTAGAATCTACCAAAGGGGTCAAAGATAATAAAATTTTTGAGGTTAAAAATGTTTCTGATTTA TAGACTCACAGAATGT( N) xACTTAACTGGTGTCAAAATCAGGCCTAGAACTCTTTTACAAATACCACATGTCCT TTAAGTGTTTCTAGTTTAGTCCTGTGAATATATTCCATGATCTCTAGCTGCCAGTTCTCATAGAAAATCTGTTAT TCAAAGTATATAGTTCTTTCCCAATGAGAACTGAAAAGAACTCCTTAGTCAGACTTTTATTGTGTTACATCATAT TATTCATAAATCACCACTTAGATGTCAAAAAAGTCATATATATCAAAATAGCCCCTTTCCCCCCAATGTTTTAGA ATTTTCTAAGTAAACAATGATAAAATACAATTTTTAAAAAGCACAAGCCTTATCTGTGTTTGATATTCGTATTAT TGGCATATCTTGACTTTATACAACAGGTATGTTT TACTAC TT CATG CATAAACCTTAAACAAGTCCGAACAT TT C TACTGACTTTGATGAACTTTAAAATTTCAAAGCTGACACAGAGAGTGAAGTACACAGAAAAGCCTAGTACCAAAA TCATATAAAC CATATTACTTACATGTAC TATAACTT CTGC CAATGATTCTGGTAATTG( N) xGGTGGTACTAGTA CTAAAAAAAAAGTGCTAAAATACTAGATATGTCAAAATGTTATGGCAAAACAACTGTCACGACAAGCTACGCTGT CAATTTAGAAAAGCACCCTGTAACAATGAGAGTTAATGACAAAAATAAAAAGGGACTTTTACGTTTGTAAAATGT CAAAGAATAACAGAAAAAAATTGCTGTCATGACTGCAACACCAAAGTCACTATAAAGTATCACTTTAGCGCCTGA CTAAAGCCTCGGGTACTC{ N) xTGTTCCCAGCTGAAGTACTTTTTTTGATGACTTGAATAGACACAGATTTTACC AAAAGT ATTAA CTATG CTTAAAAG AT ATGCTT AAAA
>H slO _15859116-15871760
TTGTGCTTTGACATTAAAACAACAACAATATAGATTTAACACAGCTGGAGATAACTATTTAGCCCTAAGAGTCAT TCTCAGGTTTTG CTAAAGTT CCTAGATAAAAT GACCACTT CTTAGAATTTCAGCATGAAACCTCAAACCCTAAAA CTATAAACTTGACGATAATAACTTTATAAGGAAAATAATTATAGTTTATCAAATGAGTAGATTTATGAATAATCA AAGGGAAAGAACATCTTTGCTCCCAAATAATCAAGAAAAATTACAATTTAAACCATTTAAGTATGTAAGTGTCCT GTTATTTGACTCTTCATGATAGAAGGTAGGTTCAAAGTTTCACAGTATGAACAGTCTCTTTAAAATGTATAAAAA G CTGTATTTGTACT CTG CTCTC AAGGTT AC AT AATCTGTG AACATT ACTATACCAAAAACTTGAGAAC ACGATTG TGTGTGTGTGACTGAGAAAAAAAAAGTTTAATACTAAACAAACAGAATAGACTTATGAAACCCAACACGTTCACA CCACTCAATTCCACTCTCATATGAAGACAGAAACTTTAACTTAAlAAATGACGAATCTTAGCAATACAATTTGCTG CATTTTCCTTTT TCTTACAGACTG CCTATAAT TGGGGAATTC CAAGTGG CTGTTTGTGGCAGTCAGGCCAAAAAT TCTTTCCTAATCAGCCAGCAGCCCCTGCTTCAACCACCACCCTGGCCCTGTGGAATTTATTCTTATCACTAGGAG GGAAGTGGAAAATAGGTACTCTTTTGTATCCATGGACAGAGAAACTAGTATTTTTGTCCTATGATGGGAGGGATG AGTTAGATGTGC TAATGTGG CAATGTTGAG CCAGTAAAGTAGGAAGGATGGTTTGGGAACAGCAGAGAAGAATAA AAGAAAAGGAACAAAGAAAGAG GGAAAG(N)xTGTGGTGAAGAGATTAGATATTAACCTTTCTTTCAGAAAAAGT GTGTATTTAAATTAAGGTAGAAGTAAAAAAAAAAAACAAACAAACAAAGGATAAGAT CAAAGAATAAATGAATG C TAAACTCATGAAATGTTTTGTAATAGTAAAAATATATT( N) xAATCTATCTTTTTTAAAAAAGTAGAGTCAAAAG TAGCCATAGCCACCTTATTCTTTGTTGGGTTTGTACGTGTGTGTTGAGAGGGAGAGTGCAGGAGGGAGAAAGAGA TTATTTCATCACCTTAATTGTATTAATGCATGTATAATCAAAATGATATTGCAAAATGTTAATTTCATCTTTAAA G GGAAATCTG CTACACTTAT CAAAGCACGGG GAAATGTTCAGAC TT A CAAAATTAACACAAG CT CAATGACTTC C TTAATGCCATTCCTCC AAGC AT CTCATAGAAAGTAAATGTAAAATATACACAGAATTTGAGAAGTTAGAGAGAGA TGAACACTGTAG CGTAATATA( N) xCATAGTTAACTGCTAAAACACTACAGCAATATGTAAACAGAAGAATACAC AG CAG CGATTGACTTC CCA CTAAGAT CTTGAGG GGAAAAACGTT CTTCTGTTCTAC CTAAAAATTAAAG CAAAGT AAAG CCTACATCTAGAAAAC CAAACAATTCTGGTGAATACGT CCATTCATTCATT CAACAAATA( N ) xGTCAAGA AAAAAGTCTCTGAG GTA CGATCTG CAGT C CAG CC TGAAG GGGACAGTGAG CCATCCATCT CAAGAG CTG GACAAC TC CAGG CAGAGAGGACAG CAAG CACTGGAGAGGGCTCGGCGTGT CTGGGAAAGTGAGAGGAC CAGTGTGGCTGGA GCACAG CAGTAGAGAAAG CAACGCAACTGCAAAGAC CACCAGGGTCTTAAGGGATACACAGGAAATAT TT C CAAA TATAAAAT CTCCACCC CCTAGC CT CATACAAGACGCAGGCAT GCGTGCACATCCCACTTGCCCCACCCCCCGCCT CCATGC TTAACTTT CTTTAGGAT CTACGAATGTGTGTGTATATGTGTTAGGGGGAAGGG GACACACTTTGATTCT CTAGTTGCCAGGCTGTTT CAGAATAG CTTTAGTT TACACAATTAAA GT CACC GAAGATAAAA GT CACT GAAAATA CTGTTTAACCATATGA TCACTACTAAAGGAGTAAGGGCAGGACAAATC CAATTTGATTAT CATA TG CATATCAC C CAGTGT TCAAGT GAAA CATGAAAAAGGAA CGCTGTCTGACGAAAAAGAGGAT CAACCATGATTATC GTTATTACA TATAATTATCAC CATCTGAACC CC TG CTAT GGAACC CT GTTCTAAAATAGAAAAGGAA( N ) xTTTTTTTAATGAA AAAGAAAACTAGAT CTACTTCTTTCTCC CAAATCATGACATCTTA CGAAGTG CAAGTCAATAATGGTTTAAATT C CAT T TAAAACTATT TACAATAC CCATTT CAG GTTGT TT CAGATTGTGTAGGAG GAGAATGAAATAAACAG CAATA AATGAGAG TTCCATGGCAGTTC CTAACC CTAGTG CATGTG TCAAAGAGTTTAATGT TGTAAAAT GCTTGAAATTT TT TT CAGTGTGAAATCAT TTTT CAGATG GAATAGATAAAAGT CAGAAT TT CAGG CATCTGGCAATT CTTTCTCTT TG C CTAATAAAGTATTAC CAGTTGTTTTAGTGAAGCATATTCTT CGTTAGGTAATAAAGAAACAGTGCTTTCTT C ATCCTCATTTTG CCAAAAATAAATT CATATT CAATAATAGATGT CTGTGG CTTTTTTGAATTAT TT TGTACTGGG TTCTGCCCCTCCTCCGACAAAAATAACAACAGACCTAGCATAGGAAACTCAGCATAATCAGCTGGCTTAACCACA CTTAATAACC CTGCTGGTTTAGGGTGGG GGATAGGAAG GAAGAAGAGAGCGGTACTGAAC CATAGTGCTG TG CT C AAATGTGCTGTCAGAGGAACAAAACAGTATCCACAATAGTGGACATCTAAACAAAAAACATTCAAAGTACCTTAA AAATCAATTTACCTTACAGTATCTTAAAGCTTCCATTAGTGTTAAAAATCCTACTGCTGCTTGTTCATGTATACC AA GAAGTT CT GCAAAAAACAA CAACAA CAACAAAAAACGAAAACTTAG CATT GTAGTAGTGTGTATAAAT TTGAA ACGTACTTATATGTATATATAT TTATACTACACCAAAACTTTTAA CTTTTACC CAAAACTATACTT TAAAAC TTA GCATGTAAT CAT TACAGTA CTG CT CATTATAAAATACCATTAGATCAG GT TTGTG GTGGGTAAATGTGATGC TTT AGATGTATATTAAATGTATTTATAAAAC TTGATATT CAATGTTAACTTTT TAAATGAGAGAATTTACT CC CTGAA ATCCATGC CATTGTTGA CAAAAATACAAGAAGAAAAAGAGACTCAATACCTATGTAAAAGGG CTAACTTGAATTG ACTTTGATTTTGAAAAACAAATGGATATGCTAAT CACTGGTAAAATTTAT CATTAGGTAG CAGT GACTCCACTTG CCTAATACTTAAAACATGGACATTTAATGC CATC CCTCTTTC CTATAAA CACAGG(N ) xGGCAAAGTTTCCTTAA ATGTTT CTGGTCTGGGGGGACATGATAAATAG TGATTAACACAAAGAATTAAGAAGTAAGTT CTGGACTC( N } xT CAGAACAGACACAACG CAGGGAATAAAAATGTGATAATAATAAAGT CT TTTCATAAAC GATACCCTTCAG CTAAC ATACATTAATATAATAATATATTTGGGT C CATAATATATC TAAAAAG CATTT CTTT CTGGGTAAAAATTATGTT C ACTGTAAACATAATGCATATATAATTTTG CAT CCTTTTTACCAT CTGGTCATGTTTTGTC CATT TTTCCATTAGG ATATATTTTCTTAT CAATGAATGTAGTAAGAATAT TAACT CTGTTATTTTTGTGG C CAACAT TT TACTGTGTAG C CTTAACTACAACAT CTAGATG GTGTAAG GCAAGAGC TGACAGTTTAACTTT CTGGATTGGTAAAT C CAAAAT TAA TAGAAGGTGCATATACATA CAGTT TATACTTT GTCACTAGCTTCTTACAGGCACAGAT CAGGTAGAG(N ) xGAAA AACATT GATTAT TTGCTT TGGGAT CATC CGTAAGAGTGTAAAGGGCAT GAGATTAAAAGGTTTTAGAATCTT CAG GT TARAGCAAAACAAGGGAGAGGAGAAGAACAGC CCAAATAGGTATAAATGGATAC TT CATñTCTT CG CAAACTT TTCTCTCCCTTTCCTT CTTAAATGTAAG CAACACTAAATGTTTT CATGTAAAAAAGAATACATACATTGGGAAAC AGTTT CACAATTTAATTT CTAAAATCAGTTAC CAC CAC CTAGTGTTA CAATTAGATA CTACATCAT CTATTTCCC TGGACT TTTAAACAAATGTCTAATAATAATATCATGTTACTAAATACAGCAAACTAGTTGTTCATAATTCTTGGG TAATGAACTATT TAGATTTGAT CAGTAAA CACATGCTTAGACAT CTAG( N ) xGGATTTAATATTCTCACATGTTA CGAT TTATAGAATT TATTTGTAGGAGACTT TCTCAATAAG CCTTGAAAGTGT CTA TTTTCA(N ) xGTACTCTACA TT TG CAAAAAAATT CAAATTACTTTG TGATTATATT CTTGTTTTCTTTGC CAGAATTAAT TTACTATC( N ) xTAA CTGATTAT CTTATATAAAAGAGAATGAAA CAAAC CATCAAAACT CATT CAGTTT CTGACTGCAT TT TTAGGAGTT CCACATATTAAGTGGAATATAA C CAGACAAGCAC CAGAGAAACATGAC TATTTT TTGG CC TAAC TATAATAT CCG TT TC T G ( N ) xTTCTGTGTATGAGTCCATCATCTTTCAATTAGCTATGCATTTGACTGGCTGTAATACTGATTTCT TTTTATAAAAACAAAACAAAAACC CACAAAAAAC CAAGAAACTT CTGGGTTT CTGAATTA CAAGAGTTGACT CTT CTATCCCTATCC CAT(N)xTGTACAAATACTGAAGAATTCCTTTAAGATAAAAAAGAGACCATGACAAAATTAAT CTACAGAAAATCAAAATCAAAGAAAACTTTAG CT CAACATACATG(N)xAAAGAGTGGGCAAGAATGCATTTAAT TT GATTAG TTCTACTG CAGCTTATTT CTGTTCTATTGGAGAAGCAAAAGATGGT TT CAAAAACTGGGGTAGAAT C AAGTATATGGATT CAGGACCATATAATCTATATAñAACAC TGAATATTTTTCACTAAAGAATGTACñAAACAAT C ( N ) x TATC TGTG GAGAACTG ACTGG AAT TAGG AC ATTAG AT AAT T CTT TG AG AAAAAGTC AAAAGG CAG ATC CC A AACT CATACCATTCATA(N)xTCTGGGGGTGGGAGACACATAAAATTGATATAACAGGAGT(N)xGCCCAAATAG AAGC CATC TAAAAAAAAGAC CAAATGATTTAACCT CATAAATATGAAAGC C(N)xAAAAAAAATTAACAGATGCC TAAAATTT TAATACAC T(N)xCAAGGCAAAAAACAGACTAGGAAAAAAATACGTTTACTAACATGAGACATACAT TCATTCAG CATT TACT GGACTTGTTTCCTGCT CTGACAA CATAGTAG GAGGCAG GCAATATAGAAAGAAAAATTA CAATTAAATAG(N)xTTCCCTCAAAAGCCTAGGCACCTCGTTTACAGAAGCAAGAACATGTACACAAAGAACTAT GATG CGAGGTGTAATAGAATGAAG CCGACTGAG GTAAAG CAC CATAGATGAT CAGAGG GAGGAGAAAACTTT TC C TGCTGTCGGG GTTAAGTGGAGAGACATTAAATG GGAGTTAA CATTTTAATGGGG C CAGATGGAATGAG CAG GATG ACATTTTGGGGAAAGTAC CATT GTTTTTAACGAC CGGGAGGCAGAGCCGTCT GAATAGAA< N ) xGCCAAGTTTGA GAACCAGTG GATTAAAGCACTG GATTTGTGAAG GAGACTAGTAAAAGATTATAAA CATAGGGTT GAGTAC GAA CA CAGAGG CT CTTGAATG CTAAGAAGCTGAAACTTTAGTTGG C AAG CATTAG CAAGAGAGT CATTGAAGG CACTGAG CAGAGAAATAA CCTAACAGGAC GTGTAAAATAAAñT CATTAACT TGTCAGTTTG CAGAATATACTGGAGT GGAAT GACAGACAGAAAGCTATAGCATGATAGTCTAGACAGGTGAGAGTAATGAGACCACAACAG
>Hsll_35954290-35964889
TT TACAAATCAGGT TC CTCTTAATTTCTGGAAGACT CCAATTAAAAACTT CATGTAAGTTAT TTTTTGTC CGTTT AGACTGGAATTTATTAGAGACGACTTCCCCTCAGGACTGATCTTATGGTGTAATGACTTCGATAGGGTTATGTAA TGGAAAC CATATTATCTTTTTGATTGCCACAACC CTTCACAGTT CTTC CT CTTTGGGTATGTTAGAT CTT TCAAA CATATC CTAATGGTTCTCTTTTTTTTTTTTTCTT CT CCGAGAAGAAAAAT TATGGGTGTCAG CTTCCT GATTGTG ATTTACCAACTAGAGGCACAGTCTCTGCAGGAAATATGTCAGCTGTCTCATTCCTATCTTATTTAAAGTAGGACA GACACCAAGCTTGAGAAAACTTGGACAGGAGGACTCAGAGGTAGAAAATGCAAGCAAACATGTCAGTTTCAGTCA TGACTCACAAGCTCCAAATTCACTTATGGAATCATTTGGTGAAATCAAATTGATAGTGAGAACAGGGTGCAGCAA TGATCAGAAGGAG C CCAAAGAG CTCTAATGTCATTTTCGCAG CCTTTATT CTTGGTGTCTCC CTGAAGAC CATTT ACTTGGAACCTAGACAGGAGACTAAAGTCTGTGATCTCTGGGTTTGCTTAGCTCTGTAGACAGCTGGTACATCAC ATA CAAAATATTAG CC CACG CT CTCATAGTGTGATGAGTTGTTAATAAAT C CATTC CAAATT CT TGGTTCTCATT CCTCTGAATGTAGATAGACTGGCTGAATATGGATATATCAGCTCTGCTTATCACTCTAAATGTCATCTGTCTTTA AC TTTATC CAAAACAGTTTTATTTTGCT TCAGAGAAAGCAGTGT TCTCAACTTCTT GAAAAT CCAAGC CAAGAAG AATCATAGAATGAAGAATTATGCTAAATATTTGGTCCTCAGCCTTCCTGGAGCTGTGGCATGAAGCATCAGTGGC CTCAATATGTGTTGGGTTTTCCTCGGAATAATGAATAACACGTGTGCTTTAAAAAATGTATGTACATTTGAGAGT TTTGGCATAGGGAACTGTCCATAATTCTGTCTCCTCAAGTCATTACAAGACATTTTCCTGTCTTTCCATATTTCC ACAAAGAGAGAGTGAGTGTATAAAAAATTCCCTAATATTGTTTCTGCCTCGCTCTCACCCTGTCCCCAAATAGTT GCTTTTTATGGTTTGCTTTCTTTCTGAAGT CCCAGTATAG G CGATATTTTAACATG GTTTAAAG CCTAGACTAAG GATTTGGACAACACAGTTTGAATCTTCAGC CATTTC CTAACT CTGCAACTTTTAATAAGATACAGATT CTTCTTC TGAAAATGAGAAAAATAATAGGTG(N)xGTAGCATGCATGCTCCCCCCACCAAAGAATTGAGCAAATGAATAAAT GAATGAATGGTAGTTATTTG CTATTTTC CAGTAGTTGTAAT CAGAAAT GATATT CAAAAGACGAGGCATCAACTA GTTGCATGAGGCTTTAATCCTAAATGGTCACTGCAT( N) xCAGCCCCCAACTGGTCCGGAGGTGTTTTATTATTC GACAGTTGGCTGGCTGTATTGGTGTGCTCTGTCTGAATGTCAGCCCTGCTTGTCAACACTGGCAATAGGTCTTCC ACACTCTTTCCCAATCCCTTTTTGATCCCAAACACCAGCAT(Ni xGTGAGATTTGAGCCACTGGCCTATGCAGCC AG CTTCTAGTATTACCACTG CCTGATCTTTTTCTAC CTTG GAAATTTT CATTGTAACCACTC CATATTTTTGCTA TGATCAGGCTTATAATGATAAGGCAGGGACTTGAACATTGCAGGGTGGGTTGGCACACCTCTTCAGATGGACTAG CTAGGGCACATGTCCTCCTTCCAATTTTCT CTTAAGTTGAAT CACTCCTCTGGCTTGGCCAGGAGGATAC CAGCT TTGCCAAGGTGACTTTATTATGTGTATCCTGGTACACATTCCCTTTTGTGTATGTAGCTCTGTTTCCTGGCATCA TTATTTTTGGTAGGGTTGGGGTTGACTTCTGGCTATGTGCTATCTTTCCAGCTGAAGTGTTTTGTTCCTCCACTT CGACATTCCATGAAGCCTTGTGAAAACACTTGTCCTCTTGTCGGAAGGTTGTTTGTAGGGGGTTATATTGCCCTA AACTTTTCCTTCCAGGCCATGGTGCGATCCACAGGACTGA3ACGAGCTGCAGCTCTGCTGCTCATTCACTGCAGG ACTTAGGG CTTAAAAC TTGATTTCTCTGAG CCTTTATTAT CACAACTGTAAAGTGGAGATAAGGAGAGAACAAAA GAGATAAT GTGAAT CGAAGTATAAAGCAATTGAAGGTAGT CATT CATT CTACCACG C A ( N) xATAGAAAGAGTAG GAATTATTAATAGTAATGATAGTTAAAATTAGTGGGGTGCTT(N)xTAAATGAATAAAAAAATAGTGGGAT(N)x AAATCTGTTCATGC CATGC C CCTGCTTAAAACTGAC CATTTGT(N)xACAGGAAAAAACCCTGAGCATTTGTGAC TTTCCATTGCTCATAGCATAATGACTAAACTCCTTTACGTCCCTAAAACTTTCCTGCCCAACCCTCTGGTCTTAC TCTTCATCCTTTTCTCTGCCCCCTACCCCATGGTCATCAGCCTCAGCAGCAGATCATAAAGGATATACTCAGCTA AGGAGT CTGAGCTTGT CCTATCACCAACAG GGAG CCATG GAG CTTTTAAG CAG GTTGAACAG GAGTAAGACAAGA CCTGAAAAAGCCAGTGAGGAGG CTGTTGAAATAACC CAGG CACAGGTA CATGGGGGTGTGAACTAAGG GAAAGAA AGAGTGAG GGAGAGGGAGAGGGGTGAAT TGGATT TAAGAGACAT TAGCTTATGGAATCTACAAAT CTTGGTGACT GATGATGCAGGGGAGTCCAGCAGAGCGAGGACTCTAGGCTTATACCCACAGTTCTAGCTTAGACGACAAGGTAGA GGGCAC TAGCTATTTTTGAGGAGCTGCTAT CAATGC CTAACAAAAGCCAACAATACTCATGATT CTGCGTTTTCA CTACGTGTATGTTG CAAGGC CCTGAAAG CTGTAAGATAG TTTCCTTTC CCTGTC CCTTTTGTTCACTC CCTTCCA CACTGAGTGTGGGAATTCACTGAAAGCTCAGCAAAACCCATCACCTCTGCCACACCCCCGCCCCCCGCCTAAGTT GACTACAAACCCACACATCAGACCCAGGTTAGCAATCTAACAGTGCCTGAAATTAAGTGTTTAAACATTAAAACC AATCAAGCCCAAAGAACCTAACTTGAAATGAAGAGGTGTTTAAAAGAAGTAGGCTAAGGTACACCGAGAAAAACA ACTTAAATTATTAAAGTGGAGTTTTCACCATGCAGGAAAAATGTTTCAATAATGTCAGATTAAAAATGTAAATCT ATTTACCTTCCAGATGGGTTTTATGAGAGCTCTTATTAAAATCAGCTGCCTACCTGTTTGTGAAAGAGAATTTGT TACTTAGGGAAGGTAGTCAGACCCAGGCATGCTGGAATCATTTTCTCACATTTGTCGCTGACTGAATAGCTCCTG TGACTTCTGCTTTGTCAGTTAATTACTGTGTAAACTTTCCCACTGAAAACAGAGGCTTATTCAGTGTTGTGGTTT T T TT TTT TTT( N} xTCAGTGTTGTGTTTTTTTCTGAATCCAAGTGACAAATGTTCATGTTGTCCAAAGCGGATTT CTTCCAATTCATTTATCGCCTTCCTCTTGACCCTGTAGTGCATTTGTAATTCCGAAATTATGACATTAATATCAC TTCCATATCAATGAACTTTGTGTCTAGGTTATATGGTTAAGAGAGCTTTGAGACAGAATTAGTCAGATGGAGAAA CTAGGAGGGAAAAAAAGTCCTTCATAAGTCAGTGATGGACAAGAAGACTTATTTTCAACTATGATGCAAGAAACT CTAAAGGGTCATGTGTTCATTCATGAAATAAAATATTATCTATAT(N) xTGCAGTTCAGTGACAGCTTTCCTCAT CAAATTGATAGACCAAGAATAGGTTTATACATGTAATAAATCTACCACCTTTTTTTTTCCAGTTGAGAAAGAAAA TGAAATGTGACTATGGGAAAACCTGGCCAAAATTTATATTATATGCC(N)xAGTTACCTGAGTCTGTGGCCTCCT GCTTTTATATTGGGG GTAGAñAC(N]xAGATGTATTTGTGAGGGACTCAGTGATCTTCCACCAGGCAGCTTCTCA GTAGGCAGTTTGCACCACCCACTATAATTTGGAGTAGAGTAAGCCTTCCTTGGCTGAATCCTCAAGTGAATTTCT ATTTTCTTCTCTTATTGGAGAGAAGAATGAAAATCGGATGAAGGGAAGAGAGAGCAGAGAACAAGAAATATGGAA CCATAGAAACCATGAGTCACAGAAGTGACTGGGGGAGTCGAACGTCTCCAATTATCTCTGCTTCTTATGAAGTGA ATGAAGCTAAGGCAGACTTGGTGTCAGTAAACTTTTTGTAGGTAAGGTTTTTTTTTTCCTTGGTCCTTCTAGTGT CTCTGCTCCTTTTATCTT CTTGTCAGTTGC CACATGTT CCTACT CTTATT CAAGTTTAAATATTTTTTTTGAAAG TTTGAAAGGTCTGCGTCTTAGGTGCTTTGTGAGAGAGGTAATGATATTGACGATTAAAAAAAAACCTGTGATAAC AA(N)xAAGATCCTATGGAACCATTAAGTCAACCCTATGGATGACCTTAAAGGCAGA(N)xACAATTGCCTTGTA GTGTGTACCACGACCACACA( N) xAGTGAAATGAGCATAGGAGCACATCTCAGACACACCACAAGTTCAGTTTAT GCTATTTTCACCTGTAATGTTGCAGAGGCTGCTGTGGGGAGTAAATGAGAAAATGTTTGCAAATTACAGAGCATT GTATGGAGGCCCACAGAGAGACGACATT CTTT CTTC CCTC CTTCAG CAATGTGAAACCGTGTCAGAAGCACACAT GCATGGTCAGAGTGTCTCTTGGTAAATAAGAGACATAAAGCACTTTGTTTTTTATTTTTATTAAACCAATCTTAA AAAAGATTGATTGGCTCCATAGACCTAAACGATTAGCACTGAAACCTCTCACATTTCCTGGAAACTGAATGTTTT GCGCTTGGAATTCTAAAGAACTGGGAAAAAGGAGAGTAATAAAGTATAAAGAATAGTTTCATCTCCTACCCAGCT TATACGGTGAGGAGGAGGTTGGCAGTGATTTAGTATAAGTATAAAAAGAACAGTCG( N)xCAACAGTTGACCTTA AATAAGGTTTTACATTAGTC CAAATGATTT CAAAGA( N) XGTGTGTGTGCCTGTGTGTATGTGAGAGAGAGAGAA CG AGAGAGAGAGTCAG AGTC AAAC TTGC AAAGTAAGGAAGTTGATGTGTC CAAAGAAACCTAAGGAAATAAGAAC TATAGCAGTGGGTCAGTTGCATTATCTTTCCGTATCCCTAATGTTTGGAGGCAAATGAGCAGCTGTAGGAGAAAT TATTTGCTTATTTGCCTCCTCAGATATTATTATCAAAGGCAGGAGGCCTCTTCCCAACAAGCTTCCCGGAACAAT G CA(W) xGGGACGATCTCCñTAATATTCATCAATCCCAGGATAAAAGCCCCGAATGCCAAATCATTTAGATAATT GATTTATATTTTCCATGTGTACTATGAACTATGGAACTTAACCTTTCCTCCCTCACATCACATTTTCTCATTACC TTGAACAATCATTGGTATTTTTAAATTTTGACCTGATCCTTTTTAGCCTTTATGTTTCCAGACTACAGCACGCAA TTACAGAGTAAAGTAATTGAAT CTTAACAC CCTTGAGG GTGTCTCCAGGG CATCC CAAAATCTGACGCAGGCATC CCTTTTATAACTGCCC CTGTAC CCATAC CTTC CC CTTT CTTT CAGC CAGGTGGAGTTTGTGTAAATAAACTCAAC CCTTTTGAGGGGTGGA( N) xTTTGCGTTAGTCTTTTGCTTTTTTTGGGTGGGGGATCAGAGAGAGTGGAGCACAG AAGTGTCATCCCCTTCTTCCACCATCTAAGTCTTTTGGGGCTTTTCACCAGTGCCCTCTGGAAGCAAGTGCATTG GTGCAGCTATATTTTTTGCATTTGGGTATCTGCCTTTCATGACCTTTTCCTTTTTTTAATTTTATTTTTT
> H sl4 1 067 80 499 -1067 861 36
CCCCTCACTGTGTTTCTCGCACAGTAATACACGG CCGTGT CCACGG CGGT CACAGAGCTCAGCTTCAGGGAGAAC t g g t t c t t g g a c g t g t c t a c t g a c a t g g t g a c t c g a c t c t t g a g g g a c g g g t t g t a g t a g g t g c t c c c a c t a t a a TAGATGTACCCAATCCACTCCAGTCCCTTCCCTGGGGGCTGCCGGATCCAGCCCCACCAGTTACTACTGCTGATG
g a g t a a c c a g a g a c a g c g c a g g t g a g g g a c a g g g t g t c c g a a g g c t t c a c c a g t c c t g g g c c c g a c t c c t g c a g c TGCACCTGGGACAGGACCACTGTGAACAGAGAGACCCACAGTGAGCCCTGGGCTCAGAGGCACCTCCCATATCTC
c a t g t c t g c a g c c t t g a g a c a c t c a c a t c t g g g a g c t g c c a c c a g c a g g a g g a a g a a c c a c a g g t g t t t c a t g t t c t t g c a c a g g a g g t c c a g g a c t c t c a g a a a g t a t t t c c c a t g t g a g c a g g a c c c t g a a t t t a a g g a a a t g t g t g a TGGTTTCCCTTGGGTGCCTAAGTGAGATTTGCATGTGGGTGGTGCCTCTGTATGGAGAGGTGAAAAGGGATGAGG
g a g g c c c c a g t c t t t t a g g c t c t c c c t g g a a g g a g g a t g c t g g t t g t g c c c t c t g a g a a t t c a g t t a t c t t c c t g GGGCCTCAACTCACTATGTCCTGGCTCCTCTTTTCCCAGGTGAGGAAACAGATTGCAACAGCAGCTTAATGTAAC AATCATGTGAGTTCAGACACACCAGGATTCACTTAACGTTATTTGTAGTTCAGAACCTCTATCAGGTTTAGAGGG AAT CGCT CTGTGCCAGGGAGTGG GTCTTAAATAG CAAAACGG CC TCAGAAAACCCAA CATAATCTACAG CGAGAC CTCAGCATGGCAAGCAAGGAATCCCTAAAGCCACCAGGGAGCTCCGGATGCACTGATACGGCCCAGACACATGGC GAGTCCAGGAACTGATGGGGACTTTGGGGGAGCCTCTTTTATTATTTTTTAGGATTCTGTGGTTGAAGGTCACAA CACTGGGCCTGACTGCTT CCTGAACCAAGC CCAG CACAATATGGTT CACC CCAGTGACATTTTCAGATGTTTCTT CCTGTAATGAAAGCGCGGTGTGATGTGTATGCACCTATGTGTTTACCTAATGAATGTAAAGAGAAGCACATTTCA TGCAGCTGTATTTTCATAAATGTCAGTCATTCATCATGTTAGTGTCTATTTTTCCATAAATCTGTACAGAACAAA TTATTCATTCATTGATTTGTAATAGTCTTATTGAGACATTATTACTATACATTAAATATTGCAAAAAGTGTGCGG TTTAATAAGTACTCAA( N) xATGTTATTCCACTCGTGCTCACCTGTGCATCCCCAAAAGCCATCCAAATAATGAA TAAACAAATTACACTTAAAAGTTT CCTTGTGT TC CT CTACAATT CCTCCTTC CCAATTATTTTCTTCCTCCACCA TATTCTGAGTCAGTCCTTCACATTTAATACCT( M) xAGTATGTGTTTTATTTGTGTGGCTTCTCGCACATAACTA CTTATGATGCACACATGTTGAGCATCAACAATGTATTGATTCTAATGATGGATGTTATTACCATAAATTAGCATG CCATTACTGTTTATCTATGTATCTTCTTGGTATTTGTACCATTTCTAGTTGCTTAGTATTACACAGAGGTGCTTC TCAGCTTGGAGATGTAAGCACCCCATAAAAATATGTTACTATTCTTATAACAAATACTAAACTTACTGATCTGCA AATTAGCATATATACATCAAATTTTTGATGTTATAGGACAAACAATATATCTGAAATCTGAACAGACATAAAGAC TTGCAGGGAAATAAACAGGAGCAGATGATAATCTTTTCTGGGACAGAGGCTGCCAAATGTCATTTAAGTTAGCAC ACGATTAAAGTAGACATATTCATTGGGTGGTTTCAATTTGAGTGTGATAGAGAAGTTATTGTTTAAATTCTCAGA GTGTATGCAGTTGAGGAATT CCTCCTGC TATT GAAG CCTT TTTC TT CAGTACTGGGGATACATCACAAAATGCTC CAGCCTCTACCCCTTGGGATGGTGCTGTCTGGGAAAGCAAAACAGCAACTACAGCTGAAATGCATCCAGACACAC CTCCCCATCACCACTACATTGCAAGAGAAATTATCTGCAGAGGTAAAGCCATCAAAACCACTGTTCTACAGACAC TGGAGAAACCAATAAGAACTGGGAGGGGAGAGAGGAATGCACCAAGTCCCTGTCCAGGCCCACCTCCCATCTTCC CTCAGGAGTAACAGCCTTATTCAAAAGGAAAAGGCAGAAACTGAAGAAATCAGTGGGAAGACATAGTGGCTGCTG AAGGAATATATGAATAAAAACGAAGA( N) xCTAAAAAAAAAAAAAAAAAGAATGTAAAGAACCGGCCAAGTGTAG ATGAAGGGCCCTGGAAACCTAGCTACTTGCTAGTCAGGA(N)xATCTCTTGGCATTGGCATGGTGTCGGCTACAA TGGATGOTGGGAGCTTGGTGTCACGGCTCCTTCCAAAGGACATGCCTGCTCCCTGAGTTAACTCCCAGACGCAGT TGGACAGGCCTCCTGGGGTCTGAGAAGCTCCTTTCATGTACTGAAATCCTGTCATTATGTTTTTGTATTCTAGTG TCTCCCTAAAAGTACAGTGAGACCCAGGGTCCATTCATGTGTGTATTCAGGACTCTCTGATTTTTATGTATTTTA TTCATCTCTCTCTACTAC CTTTTCTACCAAACTAGACATTTAAAAAAT TG CAATAT( N) xCTTAGCTTAGAGAAG TGACAAGCTGAGAAAACATGGTTCTTATTTTTACATAACATGAATAGCAGGCAAAGCAGAAAAGCTGCACACTAA TCAATTTGCTTCAATACATCACATAATTAAAGTTGGGAAGCTCTGTGTGTGTGTCAGTTCACGTGTTTTTGTGTG ACAGAAGAGAGAAGGCAGGAGGAGAGAC CATG CCAAAAGGAGAC CCTACCTGTTTTGACCATAATGTGTGAGGTA CTCAAGTAAATACAGGGACTTAGTGCTTGATGGACAAGGTCCTCCTCAGATGGAGAAGACAACTGGATGCACCTC CATATGGGTACATATTAGTATTTACATAAATGCCATTTTCTAATCATATCAACACTCCGACACATTATAAGAGAT GTATTGGAGGGTGTCTGGTGGTGAAATATTATGGTGAGAACAACCCACATCTACAGCCCCTTTTCTGCCCTGTTG CACTTGCCCTGATGCGAACTTGATCCTGCTCATCCTGACCCCTAACAATCATCCTAAGCCCCCATACTGCCCCGA ATGCCCCCTGCTGCTCCTATTCACCCCTGCAGGGAGGGTTGTGTCTAGGCTCACAATGAAGGCCCTTCATTGCGT CTTTTGCTTAAAAATGCGTAGTTGTGTGTTCACTGGGCACAGAGCTCAGCTGTAAGAACTGTTTCTTGGATCTGG ATATGGACTCTTGAGCAGTGGGTTGTAATTTGTGCTCCCTTCACAACCCATGCACCTGATCCACTCCTGTCCATC TTCTAGGGGCAAGCAGATACAATTCTAGCAGGAAACACTGGTTATGATGGGGAATCCAGAGACAGTGCAGGTGAG GAGAGGGTCTGCAAGGAGTCTCAAGCCAGAAGTGTGCTGAGAAACATAGTTGTTGATGTTAACAGGTTCTGGGCA ACACAGTGAAATTCCCAAAACCACACATTTTTATGAGAATAAAGAGCTCACTTTGTCCAATTTGTGAGTCTCCTA GAACAATTCAGTAGATTTCGAGGTTAGGTTAAAAAGTATTATCACATGTTCCTTTCCTCAAACTTGCAACCAAAT
> H sl4 6 254 32 43 -625 55 478
GGAGGG GTATGCATA CACAT CTGTGC CC CTGTGTAGATGT CT CTGCTGAGGAC CAGAAATGGTAGCTCACATATG TGGTTTGTGG CTTGTTAAAAAG CAGC CCTT CATATGGAG G CACO CAAAGC CATTAACTTTATACATTTTGTTTAA AAAAAGAAAAAACT CTGCTTAC CCGTTAGT TTACTACCAAGC CGTATCAAAT CC CACACACTAACT CT TAAAAAA TATTTTGAAT CTCTCTCC CAAAAACAAAACAGAC CAAATGATTTGC CT GC CTATTGATACTCTGTCTC CTTG GTG AT TGTG CCATGAGG CTTGGAAGTGACTT CTCCAGGT CACCTC CCATTTACCTGCCTGCTGGAGCTTTAATAGAAG ACT CGGTATT GAGGGTACTAAATGTT CT CAGACT CCAAAAATAC CCAGACATCT CTGTGTGCTCCATT CAAC CT C CTAT CTATTTTGAGAAAAAACTG TAATT TGGTGCTC CTACAGTAGAAC CT CAAAT C CAACTCGGAGGCAGTT CC C GTAACTGC CTGCTTTG CAGG CTTAGCTTGC CT TTTAAGAAATGATTTATTTG CCTCTTAG CTGTATTGAGAACT C TGTCTTCCCCTGTGTATCTGGCATCAGGAACATATTGTAGGCCTGTTTACTTTAGGCCACACAGATGACATCAAC A CTATCTTACTAAAACAGAAAGGC TTTCAATTTGACAGAAAATCTTTAAACCTACC TGGGTTTTGACAATTTGAC ACAAGTATTCTCAT CT TAAGGAAAAG CC CAGGATATTGGAGCTCAAGGAATCATTTATTT TGTCTCAG CAGGAAA CTGGTCGTGCAGTGTGACTACCTGTTAGGCTGATGGATTTTCGAGCTCTGCTCTTCTCTGCTGGCGAAGGCCTTT GGCTTTTTCCTG CAGACC CAGGGCTTGG CAGGAAAGACAATGGG{N) xAATCTGCCAAATGGGGATAAGAATGAA GAGATTTTTACCTTGCAGGAAGGT CAGATGGATTAGTGGTATTAACAGTAC C CAAGTGAGGAGCT C CTTCAGTTG CCTG GTGAGAGGTAAAGTGTGTTGAAATAATTAATTGCTATGAAAATACCAG CATT CACAAATCTTTTTCTCAAA TCAACC CAGGGAGGGG CTACAAAG CC CAGTTTTCATGGGC CACAGC TT GAATTGGC CTAT CCAAAG C C CAGC CAT TGGCAG GATCTGTTTATT CAGAGGAGTT CTTAG GACAGTCAG GAGC TCTAC C CAGGACTC TC TTAGAAAC CTGCA GGTG GAGTGAGCATTTAT CC GAAGAGACAG CCTGATACATAAGT GGAGAGAGCTC CTCATTCACTAAAATATTT T
t a c c t t g g c a t t g a a g t t g a c a a a a c t a a g a g c t a t t a a c t a a c a t t a a g a g c a g g g g c c t c a a a g g a t g t g a a a ACCACAGAGAGTGTGATTCCAACCCCCACCTTCCGTCTTCTCCCCCTTCATCTTGTGCGCTTG(N)xTATGTCTA TATGTAGAAATAGTTATGCCTAAAGGTTTGGGGAAGAAGAAAGGAGACTGTATGTTTCAAAATAAGGTTTGAAAT TTTGAGTATTGGATACGAGTTAAGGCAGGCCTGGGACTCAAACTGCCAGGGCATGAGTCCTGGATAATCTTGTCT GGGAAT TTGATCTGGG CTTGGGGGGAGAGAGCGTGTTAGTTATTTAGT CTTT CTAAGCTTGAGTTT CG{N) xATA GGA CAACTGACTACGGGG CAAAAAGTAGACTT(K ) xTTTAAGAGCCAATTTTTATAGTCTGTAAAGGATTTGATA TGGTATTTGG CATAAGAAATAGTTTAGTAGAT GT TAGCTATTAT TACTTATGGT CAGAAATG CAATA CAGAGATG ATCAAGAGTAGGCTAAAATAGATGGGAGTTTTAGGAAGAGACAAGGGAGTTTGAGAGAAGATTCTGACTTGATTT TTGAGTTGGC CTAAAT CCTTG GAGACTTGCTT CTAATATATATACACTAGTCAGTTGTAT TCAAAAG TCTCTATA G CAACCAC TTTTAT CACTTGAAAC TAAGTACC T CTGAATAGAAACACT CA CG CATGATTG CATGCT CACAGGAAA GGATATAAGAAG CTTCACTTATTTTACAAGAAAT CTAAATTTTCTTGCAGG CTAATTTGAAT CTTAAAAGAAAAG GATATTTATTAGATGT CTACTGTC(N) xTGGGAAAGTTTCATTGATGGATGCACATTTTGAGGCATTGTATTATA ATTTGAAA CGAGAT CT CTAAATTAGCAAGTTCTGGTGTTCAATTGCATAAATAAGAACTGGTTTGCAATAAGAAT CAGATCAAAGTTTAGTTACAGGTAAGGATCATGCCTTCAGAGGAAAATAGAATTATTATTGACATTATTTATAAT TTGTTAGTGGGGTGGTTCATTATAATAAATGCAG CCTAGC TCAGTTTTCC CAGTAAGGTCAT CAGATCTGAGTGG CTGCATGTAAATTG CTAC C CAAGG GTGACATT CCATAAAGAGGGATGTGTGT TTGAGAATAGGTAG CACCAGTC C AGTGGTGGGAATAGATATTTATGCAGT(N)xGCTGTTTACTAAAGGGCTGTGGAGGTGTCCTCCTAAAT(N)xTC AG CC CTAAAC CTTTTCTCCCACTTAGAT TGAAAATATACTGAAG CATCAC CTTC CTGGGACT CCCATCAAAATT C TTGT CT GTAGAAAG CTCTTGCCTGTG GAGAACAGGCTCAGGCTCAACCAT CTGTATTTAGG GACACAAGGAAAAG TTGC CTATGGACAATTGC CT GTTGACTTAATT TTAGATTTAG CATCT CAGTT GGTTTCAATGATC CTGGG CAAT T GTGCTTTT TGAGAG GCAGTATATATGAAAACACTTTG CAAAAATATAAAAGCTTTAGTTATC CATT C CAAAAA CA TT CACGAAGG CG CACCATATGCTAGG CA{N) xAAAATGTAATTGGGTATCATCATTTCCATTTCTAATTGTTTCC TCTGCCTG CAACAGGAACAGGACAGGAC CAAT TTGCAGGTGC CATC CGGGGTCT CAGAGC CCATCT CAAAGTGTG
GTGACCTAGATGTCATCTTTGAATATAGAGCCGCCAGCCAGAAGCTCACAGTGACCATTGTGAGGGCACAGGGCC
TCCCAGATAAGGACCGAAGTGGTGTCAACTCCTGGCAAGTTCATGTAGTGCTGCTGCCTGGTAAGAAACACAGGG
GCAG GACGAACATACAGAGAGGGC CCAACG CCGTCTTCAG GGAGAAGGTCACCTTTGC CAAG CTGGAGCCCAGAG
ATGTGG CTGC CTGTGCTGT CCG CT TC CG CCTGTACGCTGC CCGGAAGATGACCCGAGAGAGAATGATGGGAGAGA
AACTATTCTATCTCAGCCACCTGCACCCAGAAGGGGAAATGAAAGTGACTCTGGTTCTGGAGCCAAGAAGTAATA
TAAG CGTGAGTATGTTAAATG GTG CTGCTAATGATGTG GTGTGTT CGAAGACCT G GGATTGT CAGCC CTTGATTT
AGTACGTTCAGTCTGTGAATTCAGAGACTTTAATTTAATTTTCCATTTGGCTCTCCTCACCCTTGTAAGCCATGA
TTTGCACTTCATGGTACAATGGGTCTTTCATGCCTGTCTTCCTGATAACCTGATGGATAAATGCCCTGTCCATTT
TAGGACAAAATAGCAGCAGTGTGGTAGAGGGCATGGAGAAAAACAGGGAAGGTGGCTTGGCCCTCTGGATTGTGG
ATAACTGATCCTACTCCTCCTGCCCCCAATCTGCATCATGATGAATCTCAGAGATGGGACTTTCTGGGCAAGGTT
CTTAACGGGTATGCTGGGGCAGACTGAGAGAGCAGCCCAGAACTTCAGCTGCTTCTGAAACCTTTCATTTGTGTT
ACTACCGT CTTCCCTCATACCCCATTTC CCTGCTTGCCACAGGGTTTCAAGGAAAGGTGT CATAATTTACATGC C
TACAGGATGGGCAGCTCACTACTGCTGTGGGTTATATGTCATGTAGTATGCCCATCTCCTTTTACAAAACCACGT
TAACAGAT CAGGGAGCTCTGTAGG CT CTGGATTCAATCAG{ K } xTTTTTTCTTAATATCTGAGTAAAGCACATCG
GAAATG ATGG AAAACAGAATGC AGTGGAGATC AGAAGGGATC CTGCGTGAT AG AGC TTGG GAATGGGGATG ATC A
CACCCATATTTATGCT (N) xTGGGGAAAATGAGCTAGTGTAAAAGAACTTTGCAGGTGGAAG {N} xCTGAGAGAT
TTATACGGATTCAGAATTTGATGGGAGAGGCAAGCAGTGAGTGACAGGTGCCACAGCGGGAAGCTTTTATAGGAG
CACAGGAGGGGGAGTGAGTCTCACTTAAGAGTAAGCAAATATTTCATTGTTGAATTTTTATCTTCTAGACAGCAG
GGTCTGTAAGACATCTATCACGACCACCTAGCCCTGCCGTTTACAAACTATGTTCCATACAGCCTCAGGTTAGGT
GGGGGGCGCTCTGCAGAAATGTCTGTGGTTTGGTAAACAGGATACTTTCCTGCAGATGGGCAGAATATTTGATTC
CTTTTATTGGTTCTAATTATTGAGTTTCTGGTCAGAAATTTGAAGAAATGATTTCAAACATTGAGAAGAAAGGTT
TAAGTCAAGTCTAATGCCCTTGTTTCTGGTGGAGGAGATGGTGTTTCAGGGGGACAGGATTGCCTGCCCCCACAG
GGTCGATTCCTACTCCCGTGCCTTGAGGGACAATCAGTGTCCTTCTCAGTTTTGTGCTCCTCCTCCCGATGTGCC
CGTG CCTG GTGGGGAAGTGTAG GTGCTCAG GAGAAGCGTG CC CATGAATAAACATGACAGAAAGCTCAG GTT CT C
CTGGGCTCACTTGGTGCCTGCCAGGTCGCTGAGACCCACAACCCATGTGTTGTCTTTTTTCTCAAACAGTTGTAA
ACGTGCTCATTTGTAATGTAAGTTATTTCTTTAAAAGGCAAAATTTGGCACCTGGCATATATTTAACAACCTTAA
GCAGAACTAGGAGCCAAATGTTCTATGTGGGGGTAGACGTCCAGTGGTCATTTCTTTTATAGTTAAGACACAACG
CCTCCTGTATTGCCATATTCATCAAGGAGCTGCAGAGAACAGAAACAAACCTCAGGCCTGGAGTGGAGGGGTTCC
TTAGAAACAT CCAAGCA CAGATTACA CAGGGGGGAATTTGAGTCTATATTGAATG TATTATTAACTCTCCTTGTA
TCTTAGAGGGTTTGATTTACACTAGTAATAACGTCTTTGTCCGTTTTGCCTTTAGAGAGGAAATATTGAATGGAG
A CAG CT CTTGAAA CAAATATGT CAGTTC CTAATGCAG GGATCTCAT CCGTGACTGTGCTC CCTGGGCCCATTGTA
ATTCACGCCGAGTGCGCCCTTAGTCATCTTTGGCAGTCATTGGGGATTGGGTCACAATTTCACCAAACATAATAT
CGGCTTTCTCTAAAAGAGAGTAGTGCAG CGTGGGCCAATGGATT CTTTGTTGTT CTTC CC CT CCAGAGTGGAGGG
TCTC CG CT CAGC C CATCTGCGGTTTCTCA CAGTGATAGTACTTCAT CCACGCAGTCGCTGT CTCATGGAGGGGCG
CCAGAGCTGTTGGTGGGGCTCTCGTACAATGCCACAACGGGGCGATTATCTGTGGAAATGATCAAAGGCAGCCAT
TTCCGAAACCTCGCTGTTAACCGAGCACCTGGTAAGTGTGAGTCTGTTCTCCCAGCTCTGGTTCTTCCAGAGGCA
AGTGGAAAGCCCTGTCTGCTTTCATCTTAGTTCTTTAATTTTTACCATAAAAATGATCTTATTATGGCTTAAGCA
AGCTTT CACAAAGAG ATAC CTCTGTATT TTTTAAATGG CT CT CTATTTAATTAGAC TT CTTTGCTATCTAGATGT
TTCTGGACATGAAATAGACTGGCCTGTGTGTCTATATTGTTTAAAGGAAAAGAATGAACAGCATTTTCAGCAACA
TAAT GT GTATGGGCAATTAAAATCTG CTAATñCGTCCTAG GC CCTCAACTTTAAAGAA( N) xATGGTTGTGGTGG
TGA CAGAGAAAGGAAGTAAAGGAGGCAGAT CTGGGGCAATGCAGGACTGGCTTT CTAG GGGAACTGC CACTT T T {
N)xGGGCACCAAACTGTTATTGAAAGAAGGATGAAATCAGTATATTTGTGGCAGGCTGAAAGAGTCAGTGGGCTC
AGGC CT CAGGTATCAGCAGCAG GTGAAGAACTAGGAGCAC CACTGGAAAAGAGT CATCTC CTA CAACTGAAT CTG
GGCTCTCACTCTTTTTTTGCCCTTAACTATGCTGCAGCCCACAAAGGATCTGTGGAATGGGGGTATCTTGTTAGT
CAACTTGTATGAAAACCATTAGGAGGGCAAATTAGTAAGGAATGCCCCAGCACTTTGGATAACAGAATAGACTGG
GTTTATCAGCCATATTCTATGTCGACACTCAGAGCCACATGAGGGCTATTTAGAATGCATAATAGATGCTTATTA
TTAATTGC AT AG CT AG ACTCTTTTTG CT AG AT ACCAATTTGTTTTC CAGAATAG CT CT GAATTT ACTAGAGATCT
GTCTGGATTTACTACTTTGCCTTCTGAACTTTTGCTAACCTCTTTTGCTGTCCTTCTGAGATCAGGTAAAGTCTG
AACAATTTTTTAGGAACAGGGTAGAGATGCAGAGAATGATGGGGTTGTGGATCACACAATCATGGCCTATTCAAG
TCCATGCTGGGAAGCTTTATTTGGTATATTGGGTGGTACCGGGCAATCCAGAGG( N) xGTGTATACTTTATAAGT
TTGCATTCCTGTAATACATGCAAAACTGTGATGTTGAGGATGGAACTATTATATCTTGATAAATATCTACATAAA
TATGCTAAAGATGAAATTCCTGTAACAAATATTTAAAACCAAATAGGAAAAAA(N)xGGACAGGAGCCAGGAGAG
GAATGGAGCTGTGAGTGGGGAAAGGCGGAGGCCGCATGCCAAGTGCACCGCTCCCCACGGCAGAGGGAGAGCCGG
CGGCAG CC TGGGGCCTGCTGCAGAGT CCAC CACAGGGT CG CCGC CGGCCAGGGGAACTTGTACCACTGAGGG CCA
TGACAG CTTGTG GTTGAGGCTCTCGG CT CAGGTGCAGTTACCTCTT CCGTGACCTCAATCATGCCCGACGGGGCG
GTCATGAT CACATAAAGAATG GTT CTTAGGGAAGTGTGAAGGAAAATAGGTTGAGC CTGAGTGTTAGGAACT TTT
CTTC TCTTGAAT ATTGTTAAG GTTTTACATGTTGCTG G AAAG AGAGAC AGCGGCTAGG ATTT AGAAGTGAGAAT C
TTTTTGCCTCAAAATATATCATAAAACAAAAGGCAACTTTGAATTCACCAACCTGACCCGGTTAGATAATCGGAT
ATTTTTTAAACTTTTCCTCCTAAGTAAGACAGAGGAACTGAACAAACAGTAGATGAAGTTCTTCTCCTCTGTTGG
GGTTTAGAAGAAATAAGCCACT GAGATAGAAACCCCCT CC CCTT CTGAGGTAAAAT CCTTGG GCGAGAGGGCTC C
TTTCTCTGATGGGCTGGAGGTAGTGGGGCCAATAATAGAAATGTGAGTCTCTGCCGGCTCCAACCAGAGCCT( N)
xGAG CAAC AATGGCTACAATT AAAATGG ATTAATGGATGAAT AG AG GC AC AGAT ATGTGAGAAAAAATAAAA
> H s15 _62540507 -62549957
ATGGCAGGACCAGAACAAGGACTCAAATTTTCCAGCTCTTGGCTGGAGCCTTCCCATACCCCACCACCCCTAGGG c t g c a g c c tc tc a c c tg ttg a a g c g tg a g c tc a g tg a g ttc c a g tttc ttttt c a g c t c c tc c a a g tc a c a c tg c ATCCCTGCCTTGT CAAT CAGTACAACTTGAAG CTGCTCTG CCAAAG CTGAGTTCTC CTGCTT CAGCTC CTTATTG CTGTTG CTACGG C CAGAGGCAGTAGAGAAAGGAATGAACAAAGAA CAGAAAGG GCTGCTTTG GTGATCAG CC CTC TAC CCTCACC CCAC( N ) xCCTGTGCACGCATTCAAACCTGTGTGACCACGTCACCATGCTCACATGCAGCCCTTC CACCTCCCAGCACATCACCCACACTAAGGGCCCCACACCTCCCATCCCACCCTCCCCCGTCCTACCTGTTATTGT AT AACT CC AG CCTG AGGG CATCGCTG TCTCTGGT TAACTG GTTGTTGTAC TG AAAATACAGAAAGGTG AAGT CAG GATACAGCAGGCAGAGAAGCAG CTGG CAGACTAGGAACGACAGC TA CAGTGACTATTC C (N ) xAAAAAGAGAATA TTACTATTG TTATTAC CGTTATGACTG C CATTGT TG GAAC C ( N ) xGCTGGGGGTGGGGGCACAGATGGAAAGGGG GATAAT CTTGTGTT CAGTTTTTGAAGGGTACA TT CT CATA GTCCAAAA CT CAGAAAATA CAGAAGGGAAATATCT CC CAGC CACC CTGGTCCTTTCTCCTGAGTTTT TTACAAAT CCTTGCAGACATGTTTTATGTATATTAT CATAGTA CACACACACA CGTGTT CCCTCTCTCTGCA CAAATGTTAACATACTAAAGATACT CTTCTGTACCTT CACAGTG CA A G TA C C A TA T C TC C C A C C TA G (N )x C CCCAACTCACCCACAGCAGCCGACACAGCCCCAGGCTGACTCTAACAAG CACG CACAAAAG CAGC GAGAAATG GC CCATGCTG CT TT CTGGGCAGGACACTCCAT CC CG CAGAAGGGAC CTAAA GGTCCCTCACTCCTCCAT CTGGAAAG CCGGGCTG C CAGGG TATG GGGCAG GCGGTTGGAC TCñCCCTATCTG CCT TCCTCTGCTCCTGC TC CAACTCTC C CACATGCTG C CAGGAATACAG CAGATGGCTGGC CAGATCCT CGGC CT CTT CT AG AATG AG AG AGGTTG AG AT GGG G CC CAAAG G ACTC CC CCTAAAG G CC TGTC AAAG CACCAGGT TG AAGG ATG ACGGGTGCCCAGATTCCCACATTCAAACTGCCTGGCAGCACGTTCATTGTGATACAGTGTTGTCTTCAATTCTGC
tt t c t c a a a c a t c a a g a c t c t a a t t a t c t g a a t t g a a c c t t t a g g a g a a a a g c c a a g c a a a t g c t g a a a g a g g a g GAAAGCAACATT CTC CAGAGGACAGGAG GGAACTTCACAC CCTC CACT CAC CTG TAATTG CCT CTTTAGG GCTCC CCGGTTTTGCTGGCTTTCTTGCTTTTCCTATAGGAAGAGGAAGACAGAGCTCTTACTAGGGGGAGGCAGAGATGG
cacag caag g g ac atg cc cctag aatg ccaccaatg cccc ag g acag g ccc ac ccatg g g accag g ttatcag g g AC CCTGTGGGGATGAGGTGGAATCTGGGGAGTGAGC CTTTTTCCCCAGGCTGGGGGTGGG CAAGACGAGACTGGG
g c c tc ta c a tc tg a g tg c c c c c c a a a c c c a g c a g tc a tg c c g tg a g c a a a c a a a tc a tttc ttc ta g t tg c ttg a CAAGTT TTTGGTTGTG CTGTTT CTGCG G GGAGAGTCAAAG GAAG GTGACCAAGGATGG CCCCCTCCA CTCTATTC CCCAGACCAGGAAGCGGTAGACAGGGGCCAGAAATGGATTTTAAAGGCAAAGTTCTCAGACCCACTAGGACCATG AACTG GTAAA CT CT CCT CAAGCT C CCAAGGACAGAGGATTTGG GTCTTTGTTGGTTTTGGCC CACGGC CACAGAA CTGAAAGTCCGAAT CT GGATTCT C CCGAAAGGACAGTAACATfiAA CCT TTAGAGATGGAGTCTGAGAAAAGC TCA CCCTTCTACCAGCTTGTGATTTAGAAAGGTG(N)xAACCCCAGGACATGTGTGGCAAGGGCTGGAGCATGGGTAT CTGAAGAAGAGACAGTAGGCAAAGAGGGCAGCAACAGAAGAGCCATGATGCATGCTCCGTGCTCTGGGGTCCCTC TAGCTGAGGC CTCGGCCCCCCTGCTCCC CATT TG CC CTTGGCAT CAGGGACCCT CAGC CCTTTCTT CAGGGC CCC AAGGGGAAACTGGAG G CCAGGACTTG CAG CGT GGAATT GGTGGACC CCAT TGAACT CTTAC CAATGACTC GATGG TTTTATTCCGTCGACTGATTGTTATATAA CTCGAGT CCAGGGCGA CTGTTACCTCTTGGTATCTGCTC TGAGGCA CGTG AAGAGAGGAGGAGT TG GAGGAGGATCGAGGGGAGAG GTAGAGAGAG CAAT CATTAG GGTTGGGGG GAG GGT GTGAGAGGTCTCAGATGGCAGAGGGGCACCCAGCCCCCACTGTGGGAGGAGGTTGGAGGGCTGGCCTGCAGGGTC ACTGGGTCATGGCC CAGG GC CT CT TACT TCCAGATC CTTCAGG GTAGCGGATGATC CAGAGC CCTC CG CACAGAT AC CTGTTGCTGACTA CAAGAGATGAGAGTGCG CATGGAAATCTT CTGT CC CCTC CATGTC TAAGCC CT CT GACTT CCTTTCTTCCCCAG CAA CTGA CAACATT TTCTTTTCTGC CTAACTTGGA C CCTT CATC C CATAAC CTCTTTGTGC
c a a c t t c t c t c a t g g t t t t t a t c t c c c c a c c a t c c c t t c c t c c c a a g c a g c t c t c a t c t g g t g t t t c t a c a g t a g GATAAAATGATGTAAC CAGC CACGGG GTACAACT CCTT CATGTGAACAGC CTCAAGGAAGAAGCCT CAGGGAAGA GGCAACTTCCTCAACATGGCCCAGCAAATCGGCCAGTCCCAGGATCTTTCCTCTCTCTGGTGATTCCTGCCTCAG TT TC TTTC TA TC A TA C TTTC CTTTCCCG CGCACT CC CCTCAATTT CTGAAGTTC CT CAGAAACAA CGC CATT GAG CTGTTGGGAGAGTTGTTGCAGGCTCTCGGTTAATGAGAAGGTCCTAAGGGCAGAGACACAGGCGTCAGGGGTGCA GGTGGCT CAATG GGAAAGAAGGTGTTAGACAGTG GC CT CC CTGACT CC CT CAGC CTAGAGACTGCTGC CCTGATA AGAGACACTCACTTGCTTTCATCCATGAGATTGGGGCTAGCATGATGATTCTGTGTGGGAAGAAACATATGAGAT GGT CAT T CAATTTCTGTAAGAAT CAC CGAGGACAGGTGAG CCAAGTTC CCAAGC T CATTC CAACAATATT CTATC TC CC CAAACCTCAAGGAAAAGC CCAGTCTCAG CCTTCCCC CAGC CCAGT C CCCAG GAAG GGCACCTGGCC CTAGG GAACAAACTACT TGTGAAAAGAGAAGAGAATC CATAGTAGGTACAGGAGGGCAGTGGG CAGAGGAGGAGGAAGCA TGAACCTGAGGTGC CG CTATGCTAGCAAGACTGG CACCAG GGCAAGGGA CACCACCAGGTAACACGGTGT CATTG GCAGGTGGCTAGACAGATGCAG CATTGT CTTTGG CGTCTGTCACAGAACAAAGCAG GG CATTAGGGGG CCATGCA GGAGAGAAGGTG CCTCAGTGGCATGGACACAAGAGTGTGGAACAGGTA TGACAGATACTCACGCAATGACAG TGA CAGT CATGCATG CTTGGAGC CACT CGGGGGTGTG GATATGGTTTG C CCAG GCCTTGGGCTG CTAGG CAGCATTTT GAATGG CCAAAG CT CAAAGCAGGGGCTG CACGG C CT CCTT CATGAATACATGAGGACCTCGCTCCC CCTGGAGTT CAGCAAAAGACAGCAG CAGG TT CTGACCTGTGGCAG CCACAGCAGG CTGGAGGAAG GTGTAG CAAAGT CAGG GAA GAGAAAGGCACTGTTCTT CCAGACGG CCAGTCTGTG CCACAGAGTGAC CCTAACTG GGCCCCTCCTGGGCAGAGG A CAGTGATGAG GAAGTAGTAGT CTGGGCAATGTACAATGTAGCTGC CTCTCTCTTCTCTCTTCCCTTCTCCC TTG CAGTGG CATAAT CAGCAGGGGCTAC CAT CTCCTACAGCTCGCCCAG CTTCTCCT GCAG CTCCTCCTTGACAT GCT GCT CAGACTAAAGT GCAT TGTT GATCTC CATATT CT CACTGTT CTG GACAGAGAGAAG CAAT CACG CCAC C CACT GCAGCTGGAGACCG CAGAACTTG GTGTCTGCCTCCCATGGCGCCGG GAAGGATGGAGG CAGATTAGAAAAATGAT CCCCTCTCCCCCATAG CCAT CAAACCAGGGCT CTGG CT CACAGGTC CCTT CAGAAGTG CCACTTCACATGAGGGC T A C (N ) xGGGAGGAGGGCAGTCTCCCCAGGTGGGATGCACCAGTTTTACAAAGCTGCTCTCTGTAGCTGCTCCTT GAGCTCAGGGTTCTGGGAGAGTGTGGGGCTGATGGTGGTGAGGTCATTTTGCACGGTCCCCAGGGTTTGCCTGCA TACCTCCGCCTGCTCCCCTCAGAGCTCAGCCACCCGCCCCAGCTCCAGCAGCCTCTCCTGCTCCCAGTTCAGGCA TT TCAAGT CCTCATTGTCTTGCATGTGGGCCCACTG GAGCTGTC CTGCTAGACTTTCCAG CTCCTCCCACAGGTG CT CAG CCTCTGCCTGTAGCTGCTGCTGCAGCTGCAGAC C CCAGAACTTGGTGT CTGCCTCCCACGGCACTGGGAA GGATGGAGGCAGATTAGAAAAATCATCGCCTCTCATCCACAGCCATCAAAGCAGGGCTCTGGCTCACAGGTCCCC TCAGAAGTGCCGCTTCACATGAGGGCTGCGCCCCCTGCTGGGGGCTCCAGGGGTGGGATTCAGCTGAGAAAGGAA GCAGACAATAACGG C CTCTG GATT CTCAAAAACACC CTCCTCTTGGTACACAGCTCCTCTCGGGCTCCCCAGACT TGGC CT CC CTGCTAATGATT CT CAAAAAAACC CT CT GCATTCTCAAAAAAAAAAAAAC CCTCCTCTTGATCCACT GCACCCCTCAGGCTCCCCAAACTTGGCCTCCCTGCTAATGATTCCTTGCACCCTGATGGTAGCCAATCTTCCAAG CCACTT T CAGATAGAGACAACTGTGGGTGG CT GACAACACACAC T T ( N ) xGCCTCCTGGCACAGACCTCTTTCCC TCTG CCTCAAAGCCTTTC CATG CATCCACCTCTCTG GCATTCTAAG CCAT CC CCACAG CC CT CTGATG CCAGTC C TG CT CC CAGGTCAC CC CAGC CC CAGCTTAC CCATCTGGTTCCTTAGTT CAGC CAAGATCGTCTC CAGCTCCTGTA CT CGACTCATGCTATG CACCGT CT CCTC CCTCAACG CGTGCACCTGCC CAAAGCACAGGGGGAAAGGG CCCTGCA GAGAGGGGCTGGCAGCTGGACAAGCTACCATCTCCCTCTCTGCCCCTACCTCCACAAAGCCCAGACCCATGACCA CCTCTGGCTGTGCTCCTCCCATTT CACAGATG CCTGGAACAATCAAGTGACCTATCTACG GTGGGGGCTGAAGGG TCAG GT CT CACCTG CT CTGACATCTCCCGCAT CCT C TG CCACCACAAGGCGCTCTCTC CTTT CAGATT CTCAGCG TGTT GATC TTTCTC CATCTGTAGT TGTTTCAGTGACCC CATTAC CTGCAAGAATGGGC ACAG AAGTTAGG AAGGG CTGTCACTGGTCCTCACACGCTCCTGGCCACCTGGGGTCATCTTCCTTCCACATCCCTCCCTCTGCAAAGCCTCA CCTGCCCCAGGTGTACTTCCAGCTGTGCCCGCTCCTCCATGGCCTGCTGTAACTGCTGGTTAGCATCAAGGGCTT CATAAC CAGCTTCAACTC CT CCAT CTGGGAAG CTGGGCTGCCAGGG GATGAGGGAGGCTGTAGGA CTCACACTGT CCATGTT C TTCTGCTG CGTGGAGAGAGCAAAGAGAGTT CGTTCCAATT CT C C CGTAAACTGCTGGGAATACTGCA GG CAGCTGGCCAGATCTGTGGACT CTTCTGAAATGAGAGAGGTTGAGATGGGGCCCAAAG GACTAC CC CTAAAAT CCTGTCAAAGCAGCAGGTTGAAGGATGACGGG TG C C CAGATTCC CñCCTT CAAACTGC CTGG CAGCAG CACGTT C AGTGTGATACAGTGCTGTCTTCATTTCTGCTTTCTCAAACATCAAGATTCCAATTGTCTGATTTGAACTTTTGGG AGAAAAGC CGAGCAGATG CT GAAAGAGAAGGAAAGCAACATTCT CCAGAGGACAGGAGGGAACTTCACACCCTC C ACTCAC CT CTAACTGC CT CTTTAG GGCTCTCC CTGGTTTTGCTG GCTTTCTTGCTTTT CCTATAGGAAGAG GAAG ACACAGCT CTTACTGG GG GAGG CAGAGATGGCACAG CAAGGGACATGC CC CCAGAATG CCAC CAATGC CCCAGGA CAGG CCCACCCATGGGAC CAGGTTATCAGGGACC CTGTGGGGATGAGGTGGAACCTGGG GGGTGAGCCTTCTTCC CAGG CTG GGGGTCAGCAAGACGAGACTAG CAC CT CTACATCTGAGTGC CC CC CAAACC CAGCAGTCATGCTGTGA GCAAAGAAATTACATTACTAGTGTGATT CTAGTTGATC CACAAT TTCCTGGTTGTGCTGT TT CC TTGGGAGAGT C AAAG GAAGGTGACCAAGGGTGG CC CCCT CCACTCTATT CCCCAGGC CATGAAGCAGTA GG CAGGGG CCAGGAGTG GATTTTAAAGG CAAAG TTAT CAGACCCACTAGGACCATGAACTGGTAAACTCTCCTCAAG CT CC CAAG GACAGA G GATTTGGGACTTTGTTGGTTTTGG CCCACAGC CACAGAACTGAAAGTCTGAAT CTGGATT CT CT CAAAAGGACAG TAACATAAAGCTCTATGAGGCAGGA(N)xGAGTGAGAAGTTCAGATCTGGGGATCCTGGGCCATTCCACACAGTG CC CTTTAAAAGGT CTAGAGCTGGG CTCAATGTACAACTTGGTCAATAAAGAT CTCTACTGTGAAGTTG CTTTGCT T T A G ( N ) xCTCTGCTATCTATTATCACCGTGGAATAGTTGAAGTGTTGGCTTGAACCTCAGAAGGAAATAAACAG GCTCATGAGCTAGC CATATAAGTATAAT CTATATAATAATGGTTTT CATO CATGATGCAT( N ) xATGGCAGGACC AGAACAAGGACCCAAATTTTCCGGCTCTTGGCTGGAGCCTCCCCATAACCTGCATGATCCCTAGACCATGTCCCC AG CTGGATG GGGCT CC CACCAC CC CTGGGG CTGCAG CCTCTTGC CAGAAG CAGGATCTTAGC CCTCTC CAGCTT C CTTTGCAGTTGCTT CAGATT GAGT CGAAAT CTTCGGTCACTAGGACTTGAAG CTTCTCTTCTAGTTCT GAGTTCT TCTGCTTCCGGTCCTCGTTGTTTTGGCTATGGCCAGAGGCAGTAGAGAAAAGAATGAACAAAGAACAGAAAGGAC TACTGTGGAA
> H s l3 _ 685618 7 - 6866 41 6
GTTTTCTTCAACTTTATGCTCCAGATTTGAACATTTTTTCTTGGTCATTATTGTATTTTTATTTAGCTTGTCTCA CT CTTT TTATAATTTT TG GCAAACAAAT TATT TC C CAAAGCAGT TTTTTAAT CTCTAATATACC CATTATTATAG CAATATAATCTCAATT CAAAGCAG CAATGGCCCTTTTCAGTTTC TCAGAAGGTAAGAATTACATTT TTTTAGGGA AT CAAATAATTTTTAT GT TTTCTTAAAT TGAAGTATAACTTACAAACT GTAAGTG TCTACATTTTAGATGTATAA TT CAATAATTTGGGTTGTGTTTTTGTTT TT T T G T G T T T T (N ) xATTGTCATTTCCCTTAGGTAAAGATGCTACAT AATT CAGATTGTGA GC AAAC ACAG CCAC AT CT CC CC AT CACC CATA GCTGTT AAACTT AGTAAGGAAT AATTTAA TAGGTAAATCTAAAATAACTAAATTCTATCATTAAAAAAGATCTGCAGGCAGAAACAACAACAAAAAATCATTTT CTGTTGCATATCCTGTGTGGAGGATGTTGTCAACGAAAGCTGGGTCTTCATGGGCTCACACAGAAAAAATAACAG ACTC CT GTAT AAATATTTTTTT CAGTAT CAACATAGATATAAACGTTTAGTTTTTTCCAGTGATTTTG GAAATGT AAAC AT CG CGTGTGGAGAACGCGTT AGCTTTTGG AATGTGTGGC AATC TGTG ATGCG CACAGTG AAGTG AGTAC C ATGTGGTCAGGGAACTGCACGC CTCCCGCTGGGTGG CAG CGAGTGTTTTCAG GTTTTAGGTTGTGC CT CCAGTGG AGGGAGAGCTTCTGACCATGACTATCAATCTGAGGTGTTTCATCTTGTCCTCCTCCCCAGCCCCTCCCCCAGTCC CGTCTAT CAGAATC TTTTACAG CAGGAG GATGATAGGATTCACC CTACGC CTTCACCTAAGGACACAGTGTGTGA
a a g g t g t t c t g c a a g g c a g a a g c t g t c g g t a c a t t g c a g a t a g a a g g a a t t a c a g c t c a c c g g g c c t c t a a a g t t TTCTGCAGAGGCATAGAGCTGGAC TTTATTGATT TT CC GATTGTATAGGAAATTATTGTATGTT TCCTTTGCAGT
t g a g t a g a t c t a t t t t c c a g a a a g g c c a a a a t t g t t g a t t g t g t g t t t g t g t g a g t a g g g g g t g c t t a a t a a t g t g t g t a a a g g g g a t a g g t t g g t t a g g a t g c a g g c t c c t c a a a t g c a a a a c c a a a c t a a t g a g g g a g a t a a a a g c t g a t t t c a t t t c t t g t t c t c c c c t c c a t g t c t t t t g t t g t g g c a t c t g g a g t t c c t t c t c t c t t a g a c t t c t g t a c t CAAACTAGGAATTCATTCATCCCCAAGGCAACAATTTCATTTAATTTAGATCGCTTTTTTTTTGGTGTTTTTTCC TTGGGTTATATTTAGTTGCTGTCTACATATTTACGGCAAGAGAGGTCTGCCTGGCTCCTATTTTTCTAAACATGT CTTAATTATTTGTCTGCGGCCCTTTCTTATTCTTTGTTCATTCTACATTATTTGTTTTTCTCTTCCCTTCCTAGA AATTTTATTTTATGTTACATTTAGGAAGAGATTTTTATCTTATCAAACCAATCCCATTTGCTTTACACCTACAAA TTTCTCATTTGG CCTGCTATGGAATT CTTCGT TGGTTTCCAGTG CTAT T CAAAAC CTTTT CTTCTAAGAAATATA TATTTTTCTG CCATTTGTAGTTATTTGGGTGG CAGGG GAGGTAGATTGTCAGTTCACCAT CTTGAC TTAAA(N) x GACTTAAATTTCTTGTACTAGTTCTTGTTTCTTATTTGTTTCTTTAAAAAAAAAAAAAGATCTTTGTTCATTTTG TAAAATAATGTATAGTATCTCTAGAAAATTGAGAAAATATAGTCAAGCAAAATACATTCAATCAAATGGATTAAA TTTTGCAATAAGTTTCCTTGCTAAAGAAAATAATATTCACATCTTAATTTCAGTGGCACCATACTACACTGCAAC AAAAATCTCTTAGTTTCAGCTTTATTAA(N)xAAAACAAAATTTGGTACTATTGCTACTAAATTCTATTCACTAC AAAAAACAGT CATA GAATCTGCTAAAAAAAAATTATGACATTAGTC CAGT TAAGTTTGTGTTTGGATACATGTAA TCAAAAAGTTGGAGTTGTTTTGCTTCACATGGAGAGCCCTTCAGATGGGTGGCGCATGTCTGATAGGCCAGCCGC ATGGACACAGCGAGGCTTGCTGTTCTGTCCCTGACTTGTTCTTGGTGCCCATTGTCCTCATTTGTAGAACATGCC TGGCAGCTCCAGGCTGTCGTCTGTGGTCCTGTCATGAGCATGTATCAAATGAACCTATATTTTTTTCTTAATAAA TACACTTGGTAAAGTTTCACTTACAC CT TGTCGGCCACGTGACC CCTC TTAACCGCAAATGAGC CT GGAAATCAG GTTTTTTGATTTACCCCAATTCCCTGGAACAAAACCAAGATACTGTTAAGGCAAATGGAGTTTGTGGCAGGCACA TCAGTGTCTGCCACAGAGGCCAGTGTCCTTAGAGGGCTGGGCACCTACGTGGCACATAATAGTGCTGCACTACCA GTTGGATGCAATTGTCCATGCACAGGCAGTCATAATTGCATATATTGCCACACATATCTCAGTATGAACACAGAG CAGAATACACAAACACATTAGAAAGTATTTGCCTGAATAAGAATGGTACTTTTATTTCTCCATTTAGCCAGATGA TGCTTCTCTCAACAGTACTACCOTGTCTGACGCATCCCAGGATAAAGAAGGGAGTTTTGCGGTTCCCAGGAGTGA CTCTGTGGTAAGTCATCCATGTCAGCACAGTTACATGTCAAGGTACAGTTGCAAGTCAAAGCAACTAATGTAGGA AGGTAAGAAGAAGCAATTCTTTAAAACATTAAAAAATACTTGAG CTAAGATT GCAAGGAAAC CACAGCTG CTGGT AGAACTACTAACTTTGTAAAAATTAGGAAGAAAGAGGCTCAAAATAAATGAGGACTCCATGGATTCATGTCACTG CAGCCAGAAATCAAAGGATGTTTTTTATGTGTATCAATTTGGAGGAATTTGTTAACTTTGGTCTTCTCAGCTGAG CTCACTCAGAAAAATCCCAGAGGTTTGATGACGTGGCTTAACTACCGTCTTAACCCAGAAGTTCCTTTTACCCAC AGGATAAATTCAAAACCTTCATCTTGATGTTGAAGGCTCTCCACTATGCCTTAGTTCTCAGGACTCACTCATGCA AACTCTTTACCTTCAGATGAATTCATTCACTTGGTCTTAAGTAACAGAACTTGTACTTTTGTTTTGTCAAAAATT TTTGTCATTCCCCCTCCCGTCACCCCCACTGCGAATCCCCATTCATTGTCTCTGAGCAGGAGCTCTGCAGGGAGG ATATGATAGACCCTCCACAGAAGACAGTGAGAGACCCCATCAGATGAGGCAATTGTTTTCCAGTGGCTGGTTGAT GGAAGGCAGCAGTTAGCACAACAGCCACTCAGATGGAGGTGGCACATTCAAAGGTTCTGCTGGGGCAGTTTTTCC TCAACC CCTGTC CATTTCGAGAAACAAGGAGCTGATGACATTTACATG CATGACACATTC CTAGGAAATC CCTGT TGTTTAG G GA GATAATG CTGTTAGCC CCACATAAATTAACTTTG CTGTTGTGTTTTTAGATTTTTGTTTATGGTA TTGTTTTGTTGCACAGTGATCATG TAAGAG GAATAAAATAATATGAGTGACTTTTATCAT TGAT TAACTTACTAA TAACTCTGTAGGTTATGGCATAAATAGGCTGAGGGTTACCCACAATATCTGACCCACTGACTGTAAATCAGCTCT TACTTT CC CATT CAAGACCAGGCATTGG GATG CAAATGAGTGAGAGTGTCTCTGCTTTAGACGT CATC TCAAATT CAT CTCACTTATTAATGAAAACAAAGTC AGTC ATT C ACAAAGTCTGGT AACTGTTCGAATAAG G AGTG AC CTGC A GTAGAT CTTACACAGTTGTGCTATTC CCTTGTGACTG GGGACTCGTGC CCTG AGGACTTG CC CAGG AT CCTTTAT CTTCCACCACAGCTTGTGGCACCCATGTGCCCAACAGTCACCCCTGCATGGCAGCCTCAACTCTGCCTCCCTCCT CCCCTCTTCATGCGCATCTCTGAGACCTGTGAATTTCATCATGTGAATGGTTGTAAAATCTTTCTGCTCCCCTCC TGTCTTACAGTCTCT CAGTACAGAAACT CATTGTTA( N) xTCGCTTACAGCCTGCGGTGGTTCTCCAACTTTCTA GTTAACCTGGAAGCTCCTGACATGGTCAGACCCCTTGTCGGCCTCTGTAGCTGGCTTTAACACCCCCAGCATTTT GTA CAAG CAG CAGCACAGGACTGCGTGTGGTT CCATG CACA CAC CACCAGAACTTGGAGGAG GATGATTGTATGA TTGAGTTG CCAAAGGAGAGGATGGTAGG CATG CCTTCCTTTCTAGGTG TTTC CATGGCAG CT CCATGG C CACAAT TTTATTTCTGGTAAGACTGCTGGTCATATTTTTCTATTTCTTTATTCTTAGCTGCGAATTTCTTCGCATCGTTGG TTCATTCAG(N)xGGGTCTTGTGTGGCAGTTATAGAGCTGTCCAGGGTAAGTTAATCACCCAGATTTTATTTTTA TTTTCATTTTGTTTTTTTCTGAAGGCAGGTTGAATGCCAGTAATGACATAAATAAAAGTTTCAGCAGTATTGGGT CTAAAAAATATC TCACTGGCACTG GC CT TCGACTTGACTTAC CT TGGATTGTTTGTTACT CAGAGTAT CT TTTAG AAAAGGTAAACAAAATGATTATTTGCTAGAGTGGCTCACTGATGCCAAGTTATTGCTTTGGTAACTACATTTTGG GCCCTGTGCTTCTTGGTTATAAGGAACCACAGGGACCCCAGTTTGATTTCCCAGTCTCTTCTAGAAGGAGGGATA TCCGGTAATCAAATGAAGCAGGGGCCAGGGCTTCAGGGGTCCTGTCTGTGACACCTGTAAAAATAAACCTCTCCT GCCAGGCTAAATTCCCGGGGCCAACTTACTGTCACCCCCAAGACTTCAGGGGGCCCCGACAACTTGCCCTACTGG AATTTATGTTGTAAAATGGTAAAAGAAGGTAACCTAAATTAT CTTTTC CATAAGATAGCCAT GACCTAAC CCATA TCTTTC CC CCAGTCTTTAATCATTTCTT CATGTTTGAAATGC CAACTT TATGAGAAGTAAAT CCAC CAGG TTTGC AGATGT CTGT CT CTATGGTTT(N)xGTTTATGAAACCTCCAAAGCCTGATATCATTGTAGGCATAAAGCTTTGTT TAA(N)xGCGTGAGCCACCACGCCTGGCTTG(N)xTGTTTAATTTTTTAATGGATGTAATTTAAGGGTTAGATTA TTTTCTATCCCAATTTGTATGAATTTTGGTAATGTGTATTTGCTTTATCAAGATATACAGTTTATTTCAAATTAC AAGGTATGTTGTTATAAAT TTATTGTGTGTGTA CACATATACATGCATA CACATACA CAAATGT TTAT TTACATT ATAATTTTAAATTATCTGTGATGGTAAATATTACTCATTTTTGTTTCTAAAATGTGTGCGTTATATCTTAATCAG A CTCTAAATAGT TG CCCGTATCAG C (N) xAGCAGCAATATTTTTAAAGAGATAACTAACTTTATTTAATTGTGCC TATTTTTT CCTATTGCAATAATTTAT TCAAAT CTTATTTTTT TACATT CTTT CTTGGCTCTACT TGGATAATTTT TTCTCATCTCTTAGAGTATGAAGATATTTCCCTTAATTTTAGTCATTTGCATTAAAGTTñAGAATCTCTGAATAA AACTTTGGCTATTCTCATGGATATTTGGTATTTAATGTTCTCATTGTCTTAATATTTAGGTATTTAGGTTATAAT TTCAGATG TGAACAATGTTTTAATTCAAGACAT CATCAGAATAC CTTTAAAT TCATGTTGAATAGATTTT GCAAG TGTTATATCTTC TG CTGTAAAATG CATTGCATATTTGT CAGTGAATGTG G CTTC CTTAAGTTG GT C TCTGGGGAC T CTG GCACAGTGTTTT TTG CAAGTTGTGGATTTGTC CATT CTGATCTTGGTT GCAGAACAAGAAGATGAGGAGC C TTA CTGT CTTGGTTTAAT CAAGCACAGGTG GCTTTAGTACATGGAGTCATATAG CACAGAAACT CAAACCATGGG CAAAAAC CAAACAG CCAGT CAGAGAGAT CATAAACATGTT TTGAGTCAGT GAAATT TTAAGTAGTGAACAGAGCA TTGAAAATGTTTCGTATGGTCTTTATTGAATATGGAGGACATACCAGATAACTTTAGTCAAAGTCTAGGAGAAAA CTATAG CAGATAGTAC CAGGCTCCTACAATTTAAAATTTAACATAATAA C CC CT CATACT CTACTTTC CAATCCT AG CTTGGTGTAAGTAGGG CAGCAGAG CAACTTTCAG CAGAGGGAAAACAC CCGTTGTGATTCTGTCACTTCCCTT AG CAGTGAAAGC CCTGGTGAGAAACACATGTGGTTGAT CACATTAGGGAAACGGTGGGCT CAGTTATGAACAAG C TGGGGAAGGTTG CTTTAAATGGG G GTGGTCTTTATATTTAAAATACATTAAAAT CACTAGTACAAATCAAAAACG ATGCTAGTCGTAATATAATCCTTAATGAGGAACGGATGTTTTAAAATGATATATGTCAAGCATCTAGCCTTCCTG CTCAACCTACATCCCAACTCAGGTAACGTTTATTGGCTGCCACTGTTCCTTCATCTCCACCATGAATCATTCTTC TG CTGTAACACAAAGA TGTGCATTTGTT TGTCTTGCT CTG TTTTGTTT CTAACCTTTAGGATTCTAAGTCTCAAC TCTTTGACGTGGCTTTCAAAACCTTCCATAATTTGCACTGACTCTCCTGATTAATTTATTTCTTATGTTTTTCTA ATAC CATGACTTGACT CCTGGTAC CC CAGT CTCTTCAT GATC CCTGGT GCTGAAATGCTT CAGATATTTCCATG C CT GAAATTTCTG CTTC CATCT
>H sl8_41809704-41819480
TGGT CCAAGCTATG CTAAGTGGGAAGTTAC GTCTCTGG GAGAGACAAATAGAA CAAATGACT CATT TACTGGGC C TC CT GCTTTTCT TCACT CAAGGGTTC CT TAGAAAAATGTACCTGTCTGAGTGGAAG TGACTG GACATGTGACAGT TGAATAAAGCACAGAG CATGAATGGAACACTACACC C CAC CAGGCCCGGC CTGTTGCCTG CAG GAGGAG CTGACT TGGAGCAGGGTT CTTTGG CAGCAGGG CAATGCTAGAGCACAGGAAACT TGGCTTCCTCCCAG CACATC CCTGGCA GAAAACTCACTGTGAGAAAAGAGGGGAGGAATTCAAAGCTGAATCCAAATCAAGTTGCCAGAGCCCTTCGGCTAA GATGTG GGAGGT C CTGGGAAAGAAAC CAGG GACTTTAATATCAAAAGGAAAATT G G CTAGAAGAAATGACAGAG C CATTGGAAATGAAACTGAAGGAATCTCAGCTTGGGACCAATCTACTGGCAGGGACCACCCTTCCCACTGTGGCTT GCTGACATCAAACATGGAGACAATGAAAGGAGTGGACCATCACAAAACCACACATTTAACAAATTGTGCCAGGAA CAGTAGATAAATAGTCTATAATTACTTTGC CGTTTGAAATTATATTTCTATAGG CC CATC CTAAGT TAATATAAT AAATAT TTCTCTAAATACTTCATGAATGTCTCAGTTACAT C CTAGGACAAATTAT CTTCCCCAG GTAACTGCTT C AGGCAATTCACACACCATGATCTGTTCTTTTCCACACTGTTTCTTCATCTCGTCTCACCTTGATTCCCCACTTAC CTCAATGCTACTATTAAAATCAAATGTTAAGCCTTTGAAGAAGTCTTCCTCACTTATATCAGTGAACTTTAATGC AGAAATC(N ) xGAAATTGTTCATTTCCTCCAAGTAATGCAATGCCGAAATCCTAGGGAATTCTTTCACCCATTGA CCTACAAATAGTGAGTTA CTTTTTTGGATCTGACATTG CACT CATAAG TTTTATAC CCTATTTGG C A T T T { N ) x T ATTTTT CTAAGTATAT CCTGTGAATAGACT CCCCAG CTAGACATTGAAATAGAAGCTGTAGACT TT CTTTTTCTT TATACTTCAAAATGTACAGCACCTAGAAGATGCTTGATGTGGACTTATTTTATATTATTATCAGCTAATTGGGGT AGTC TGGGAGAATGGATCAACAAGTATT T T T A ( N ) xGCTCTCTCAAGGGATATGACATAAATCCCAGGGGTACCC CAAGTGGTAAG GATTCT C CTTCCATTAAAAGTCTCT TT CCAGAATATTTGATGTTC TCTAAAGAGATACATAAAT GAGAGATAGATAGATAA CTAGCCGTATAAAAAGGAATATC TATTTGCTGATG CTAATTACTC TTT C CT CAAATT C CTACCACTTTAC CCATATTCTATC TGTATTTCTTTGAATTATTCGGTGAATGTTTG TTTTATTTCTTT CTTTAG C AAGCTTTCTGAAAACCAG CTTTTCTTTG CTTTCCAT CATCTCTTCCCCAC CTGT CATGATATTAG G CT CAATACA TGCT GGTTTACTTAT CTTGAAATACTTCTTAGAGGG GACATC CAGATTGCATTCATGAGCTTACTT CAA CATGCT TG CTTTAAATCACCAATAT CTACCA C CACGTTAAGAAACTTACGTGTTATGCGG CCAGAGCCAT CTGAT CCATG G TGCCTTCAACTTTTGTTAGACTCTGGATTTCTCAATATAATAACTAGAaGTAGTGTCACAAAGTCCTTACGTTAC TCTCTATAACTATT CATTTCATTTTAATGTATGTTATG TTTGTTAAAT TTTT GAAGGTATAAGCTTTAGATGGGT CTGCAATTATTACAGATTTTTCTTGCATAGGATGAGGAGAGTTTCCTATGCATTGTCAAAGCCAAAAAATGGAGT ATTC C C TGTGCCAATAAACACATTTCTGGTAAATAATAATGAAAAAATATGACATTGATCTT CGGC CACAATACT GAA CTAAATGAAATAT GT CAGGG GT C CATAAATCTTGAAC CTAGGAAGAT CTTTTGTTCT TCAGAGAAATGTAAG GTGGCACCCACAATACGAAGGCAGGAAGTGAAGGCTGACAAGGAAGGAACCAGAGATGCTGAGACACCATACCAG CCT CTGGCCCAAAGGTTCAGTTAC CACAGGATTAAAACATGAGCTTTGAAGCTCGG CTGCCTAT GGGTAATTGT G GT CTTT CTTGTATTATTTGGTATT CAGT CAAGTTG TATTTATGTTTTTT CTC CTTT CTAACAGT GACACTCACCT TAACTGTACTAGGAACACAGTACGGGGTGCATGTCTCTGCTGGGCCTGAAGTTTTAAAGCAAAACAGTCTTCCTG AAGAATGTTCGAGACAGTGGCAGCAGAAAT CACAAT GAG G GGACCCTAAAGAGAAAGTAGAACCAAAAGAGAAG G CT TGTTGCTGTTTACT CAGACCCT CCTG CATACCTAGACTGCTCCTTGAT CAAATC TTGGAGAAGAATTGTGTAT GTTTGG TGAGGGAAACAC CAAACCATTTATGGAACCTT CTGCTGCCTT TTAAACAAAGTCAGACGAAGTATAATA TAAAACTGATAATAGAAGCATAGCATGCTCTTAAAGTTTTATAATTCAGTTTTGTGTGTATGTAAACCTTGTGGG GGTGGTTTGCTGTAGCAGGCTCTCAG CTTGCCTCCCTCCCTT CTGGCACT CTCTTCTACTCTTT CAGCAAACAGT TGATGCAGTCAT CT CCAT TAACAAGG CCTTTGTCATGT CATAAACTGC CAAAG T CATAAACTTTTATT TGGCTTT GT TT TCTACCACTT CACTTTTAAAAT CTTTTAAGAGACAC CTTGGGTCTTT CCTTTCTTC CCATTTAT CAGTTG C TT CC CAGCTCCTTATT CTTGCTTC TATTTTTGTAATTACATATGAATG CCTTCTCTCCTCTACTCTGG CTTTTTG AACTTTATGCAT CAA CAAAAACTG CAAAAATTTACACTTTTTTGAAGATG CTGATGACTC TACAGT CTTCCCTTC TCAGTACCATTCTTATAATCATGGATGCTAAAGCACTGGTGTCATAATATCATTATGTCAGGTTGAATTCTTTAA TTACAAGAAGATTGATGGCTGCCCCAAGAAGAGGCTTTGATTTATGTGCCTTTTGTATATTCCTCAGACCGTACC AAAGCATTAATATACAGCAGAGGTCCTCTCAGTGTTCTCTGAAATAATCTGCTTTTCTCCATCTCCAGGAAGGCA ATGGGCTATTAAGATTATGTGTGTTAGACCTAACTCTGACATGGATCTAAAAGCATTTGTTTGATGAGGAAAGAT CAGT CAGO CTGAGC CCTC ACATCCACTGGATC CAAATCTGTAAT TTTTAG'i'G AC ACAG AC TT CTGAATATATTAC AGACTAATGAAGCCCTTGATTTCCTAAATTTTCAAAAAGCTGAATGCTTCATGCATCGGTATAGAGCAAAAATTT CCAT GC CATT CCAGAT CñTTGGGAGTAAGG CTAATATGAG CTTCTAGCATTCCACACCTTGTATTCTT GT GTCTG TATAATCTTTATTGGGCAAAGATTTATCTCCAATGAGCAGGTCTACAAAACCTTGCACAGACTGTTCTTTTATTT TTAATT GTAGTAAAGAAT C CAGAGCTTGAGAG CC C CTG GTATAGTTCAATATCATC CCAT TATT C CAAATACTTC AGAAAC CT CAAAA C CT CTGATATTCTCAGAAGAATAATTGAC CAAGAGACAAAT GAAGGCAG GCTTTAACTACCT GTGGTAAGAAAT GAOATCAATGATAACAAAACAACT GAAGGTTGTTATAGACATAATT CAGC CT CT CCTGGTAAA CTAAAAAGAGAACTGCATTTCCTCTGAAAAGAAGAAGTTCTACCAGTGTCACAGGGAAGTCCTCG(N)xGATTCT CTCATCT CATTTGTAAAACAAAGAAACT TGAC CCAAATGTACTT TGTTTTTCTTTTAATT CTA C CATT CTTTATT TCTATCATATGC TAATTATTCTGCTTTC TTCCACCCACTGCCAC CAGATGCATT TTAAAAGTTATTTTTTGTTCT GCTGCTTTGTGTTCCAAGAGACAAA(N) xTCATATATGATTAGATAACATTAATAATACTTTTAAAGGAAAAACT ATCCAAGTATTATGTTATAAAATATTTGAATGGAATAGATAATTGAGAGTCTCATATTATTTCCCTAATAATGAC TGAATCAGTGAACTATACAGGAGTCAAGAGACAGTCCTTTTTCTTTCTTTCCCTGGCATTCAGATAGGCCTCTGC TAGTAAGGAATTGGTGGG CAGGGGGTGGGGGT GGAGTAGCAAATATAATGTAAGATATTC C CATAAAATAAGTGG CATT TGATAAACAATACTATTTGCTAATACTT CTACATAAAGTG GTGTGTTAGTTTTCTAC CAC TGATGTAAAAA ATTACCACAAACTTGGCAA(N)xGTTTGTTTTTGTTTTTTCTGTCTTCAGAGGAGACATT{ N ) xTACACATAAGT GACCTTTTAACC TGTTAT CTGAGTTTTC TTCTCTCTTGAT TAATGTAAATCTAGTT TGAAGATT TAGACAAAGAA GGGTCAAAATTGAACATATCGTAGAAGTATAAAAATGAATGGAACTAGAGTACGTGAAACTATGTCAACAGCTAA AACATACACT CCAATCAAGCA CATAAAAGAAG CCACTGTGAAAACTTCAGGATAAAG GTTATAT GCAATG GTTTT CACCAAGCGTAATGACTAGAATTATAGTGTTAAGATTAGTTGGAAAAATTTAATGGCCATATATATATATAAGCT TCTCCTATATTCAGCTGG CTGGAACCTCAG GACAATAG CACTGGAAATAAAAGAAAGAATAAAAAAGACACGTTT CTTTAAATTT CTTATG TTTATTTATTAAAAAT CTTCTGTG CAAATTGGGATGTAAG TGTATTT CTGTATAT CCCA CATTAACTACTG CT CT CACTAATTATTTGGTAAATTAG TTGT CT CCCACACCTG GCTAAT CAT CAAAATCACCCA AGCAGCTT CATGAG CT CTTGAG( N ) xAATCAGCTTAAAAGTGGGTTTTTCCCAGAGTCTCCTAAAGAGATCCCAA ACAGTCACAACCTTTATTTTAGTCTTGTGAAGTCTGTCACAGAGAACTCAA(N) xGAC CTGTGCTTCACTAAGAA TTTATATTTTATGTTAGGATATGAAAATGC C CAACAAATTTTTTAAATTTAGTTAT CATC TTTGATTCAGTAGAC CACTCGCTCTGATATGAGCTGCCAATCAATGAGAAAAATTTAGTCTCCATACAACAGCTGAATATAGTACTGGAT GTAAACATGGGGAAGACTAGATATTTTAGCTTTAAGAATATTATTTTTGGTTAAGAAGAGAAAACTCAAGTACAT AAAAAATTATCAAGTATTATCTGACAATATAAAAATATTGGGAAAAACTTCTATGTAAGTGTTGAGATAGGTTGG CTCTTAAATGAGGTATAGATGTCACACAGGCAATGGAAAGGGAGGATACCGTGTTAGAGAAAAAAAGCACAAACA TAAATCAGAG CTGAATGTAGCCTTCAGTATGT CGTCTCACCT CCAATGGAAGTGAAAGTATCTTTTGATT CATAT CCTAGAGAAATTAT CC CATCAAGATCTCTT CTGGAAAATC CCTGGCAGATTATCAAACTAGAAATATG CTTTTTC TAATAT TTTCAACTTGTGGCTGTTTAGAAAAATGAGTG TGTGGGCACATGTGTGTG TTGATGTGAGATTATGTGT TCAAGATTATGCTAGTCTCACTGGGAAGGTGGATTGGCATGGTAAGAAAGCCCTAGGGAAATGGATGTGGTGTTT CAGCAG CTATTTATTATTAGTGATAAAACATTGTATGAAAAC CC CTTCCTGCCTTTTTAAAT CAGAC CAAAAGAA AACAAAGT CCATATACAAGAAGGAATTAATTTTT CAAAGCTAATTCTTAAATACATAATTTTGAA C CTGAGAGGA TGGTGGGGAGGT CGAACATTTTAAATGT CATCT CAT TTTACTATTAACAGGAG GGAAGTTTTAGAG CT TAGGAGT CATGTTTT TAAG CATT CACAGATATAGAAAAAAATATATATAAC TTTAAAAAGACT TTGCTGTT TGAATATTGAA TCGTTTTGGTGCCTCACTGCTGCTTTAAAAATTCCATGTGGGCTGTATACTGCTAAGAGATTAGTCACTTCAGTG GAGAAT CTTAGAACTT C CTGTTCAGGGCAC CTGGGGAT GAGAAAGGAGACCATGGTAGAT GATACT TTTCTTATC TGACTAATAAGAGTGGGATCAGAGGAATGCTGGAAAAATATTTGAAACTGTAGACCATGGGTAGAGAAAAGACTT GAAGGAAAATTATCTTAGGACTTCAATACTCCAAAAAATCATGATCAACTTCATATGTACAGTACATTTTCAGTT CATC CG GTTC CCTGTGACTTCAAAGACATC C CTCAAGT CTAAGCTAC CATCATCAATTTATTACTTGCTTAGACT GACATGATACTGGT CC CTGAATTCATTC CATC CT CCTT CAAG CCATTTCCTAATAG CAGT CAGGAATATTACTGA CCCCACTTCC CATCTAGATTACGTCTTCTAGTGTATTC TTGGAAGTACCTATAT CTTTCCTTTACCTCTC TTATC ACAGTTTTTAATTATACATTCATCTGAGTCATTATCTAATATTTGCCAAAAA{ N) xCATAAATTTAAAAAATGAG GGCAGGAATGAACAAAAACAAAAGAAGTGGGCTGTCATCTATTAATGAAAATATAGTCTAATTGAGGGCCGTAGG AGTGGACACTGTCCTGGTGTAGTGCTCAACTGAAAATTAAGAATGGAGAAAAATCCATTGAAAAATGTGGATGGT GAAACATTAAAAGAAAAACCAGGGATATGACT GGGCTGTTAGTGTTAATCAATTG G CT CT CATG CC CACCTTTTA TCTATAAAATCACCAAATTCACTCTGAGAAATATGTAAACCAATGCA
> H s l9 _ 3484000 6 - 34851 07 9
TTGTTCTCCATGTTTTCAGGGCCTGACCTCGTTGCTGCTGTCAAGCAGAGAAGGTATGAAGGGATTTCTAAACCA AGGTGGGGAGAGAAGGG CATGTGAATGCAACC CATGGCTCACTC C CTGGGAAAG CCAGGAG CCCAGTCAC CTCCA GTCTCCCTGTGTGAGAGGAGGGGCTCTTCCAGCAATGCCTGGCTGAAGCCTCTGAAGAGCACCTGCAGGAGGCGG TTTCCACAAGGAAGAGCCGGCCTCATCTTGTGCAGATCAATTTGGGAGACGGGGTGGCCGGTGCTGGTCTGTGCC TTGCAGGAGGCACCCATTTCT(N)xCTGAATGGCGTTTCAAAGTTGGGAAAAGTAGTAATTAGCTTTCAAAAGTA GTCACAAAAATTAAAATGGGCTCTGAACTGAGAGGAGAGCTTGGTTAGAAATAAGTAGAGAAACTGCTGTGGATA TGAGAAAG CAAGGT CAGGGTGGCAGACTAAGGA C CATGGG CAGGAGTAGAGGGCACGCAGGG CTCCCAGGCACTC GAAGCCATGCTGTAGAGCAGAGTGGGAGGCCGGGTTACTTGGTCCCTGCTGGACAGGCCAGGCTCTGTCAGATAG GCAGTGG GGAGCGG GTAGGTGAAGACAGAATGGTA CTTG CA CAGAATAGTAGTGG GGTGGGGGGTGGCATTGACC AATCAC GAGCACACATT CAGATGCTTTCTGTC CACACTTGTAGT CCCAGCTA { N) xGAGGAGCAAGTTTGAACCG GGGGTGGAGTTTGAGACTTTTCTTTCAGACATATGACTGATTCGACTTCTAAGAAGTCACATCTGGCACCATCTC TGCAGAGACTGT TGTCTG CATG CAGGAGTGCTGG CT CT GATCTGGCCACATC CT GC CTTTTTCATT CAACCTTGT AACAGACTTT CATGTGAT TAGTAAAACGAATGGGCTTTAAGT CACATATC CTTAGTGCTTGGAAA CAG TA ÍN )xG GAAACAGTAT CT CTTAAATAAATTAACAAAAAGC C(N)xAGCCAAAGAAGGTGGACTCAGCAGCTGAGTAAACCA GTGC CTGT CTTTGCCTGC CACCTC CACC CCCAGCTC CT C CAC CTGCACTGGCAG TGCAGGAGCGCCGT CATT CAC TGCTGGCTCCCCTTAGCCGCCTTTATGCCATTGACCCAGGCAGGCCTCCTCCCCTCCTAGAACCAGGGCCTGGAG ATGC TGGAGC CTAGCACTAGTGTGTACC CTACAGGTGGGCTGGGCAGGAAGC CAAGCAATCACTGCCCTGAGCTC TTGGTT CT TG CAGGAAA CACAG CAGT GGAGAGCAAGACAC CAGCACGCTG CC CT CACCACCTCT CCTCAC CACGG TGGAGGATGTGAACCAGGTATTCAGGCAGGCTCTGTGGGCACAGACTTGGGCTAGCTTCCACAGCTGCAGTTTTC AGAGGG CCTCAAAGTAATGAGGGATCTT CTTTGTGGTGATTAATTTTGTGTAAATGATTAGCCACCT CAG CTGAC TCAG AG CC AC AGGAAGCT CT ACG G ( N) xGG ACAGGGGTGTGGGAGTGAGTGG GG CT CT CACCCGG CTT CCTAGG A GGGAATAAGGGCCAGGCATCTGGGGAGGTGGGAAATGATGATACATTGCCCGAAATGCCACTAAGAACTCTCTTT TGTTTCTCAGGATAACAAAACCAAAACGTGGCCACC CAAAGCACCCTGGCAG CACC CTTCCCCG CTTC CCAG CAC GCTG CC CAGC CC CAGCG CAC CACT CTATGCAGTCAC CAGC CCTGGCAGCCAGTG GAACGACACCATGCAGATGCT GCAG TC CC CAGTGTGGG C CG CAAC CAACGACTG CAGTGCCGCTGCCTTCT CCTATGTG CAGACC CCAC CC CAGC C CCCACCCCCACCAGCACACAAGGCAGCACCCAAGGGCTTCAAGGCCTTCCCTGGGAAGGGTGAGCGCAGGCCAGC CTATCTGCCCCAGTACTGACCCCAG(N)xCAGAGCTGTGGGGATGAGTGTCCCCACCCCAGGGCCACTTAGCTGA CACCAGCCC CTCAGAGGAC CAGTGCGCCCCATCC CAGGGAGGGTTCCTTG GGGACAAGG GTGGT TGGCAGCTCCA AGC CTTTAAACC TGGCTT CTGAAACGATGGCAT CAGAG CC CTGGAGAGCCAG CT GGAGACACAGGCGT CT GG CCT TCAGGGGCTT( N) xAGCTCTTGGGTTTTTAGTCCTGCTGTGTGTTGGGAAGACATCAGGCCTGAAAGCTGAGGGC ATAACTGACCATTTTTTGGAAAC C CT CT CCTCCCT C CT C CA CACCTTGAGTGATGACCACACCAAT CACTGTATT TTATAGCTTTTTTTTTCATGTAGGTTTTTAGTTAAAACATCTCCTGCCTAAAATGCATTGAATATTTTAAGATAA CAGATATACTGGCTGGAGGTTTGTTTAACACTATCTATATTTAAGCTTATACAAAATGGGCAAAATATAGAATAT TTGTGATTGGAAGCAGTCACCTGGGGTTTCTGGGGGTGGACAGTTCCTCGCCACCCAGCAGCACCCTGGGACTGC GGCCTTTCCCAG CTTTAT TGAAGCAGAA TGGTGGAACT TGTG CCGGAGGTACAC TG CT GAAGGTGCGT GG CGGTG GACCAG CCAG CTGCTGTC CATGTG CAGAGCAAGG CTGCAC CTGCTGCCCT TCGAT C CTTCCACACATGGC CAGGA CACTGC CACAAT CCTCGGGGTGTG GT CAAGGGGCA CTCAGAGACACCTGCA CTAGAAATTGCAT TGACATTGTGA GCTGGC T CAGAAGACAAACCAATTAAGATGTAGATAGAAATTAATTTAAGGT CT TTTCTTAAAAAAAAAAT C CAC CTCATTTT CAGT TAACATGTGC CATTAAATAGATAACAT C CTGTGGATTTAGG GATATTTTCCAGC CAGAAT GGA TCCAGAAGAATTGAATGGTG CTTAAATTGAGAAATAATAATAAATATAT CTATATAGAATAGACATAT CC CACTG TATATTAATTGAGGTTA CAGAAAGTT CTTTATATAAAACTTATTTAAATTTTTCATATTTCATC TTTGAAAAAGT CTGAGAAAAATCCATAATATTTTCTGGTATGAAAGTTTGACA( N) xTTAAATCATTAAACCATTTTAATATATGT TAACTACTAATAAATGGTTTATTCTTTCTAACTCCATATAAGCTTTTCCAGCAAAGATTGTAATAACACATTTAT GTTCTCGTTTTTCTACAGATATAAGTAAATTTATATATAAAAATACCAAAAAGA( N} xAACACACACACTAATTT GAGAGGACCCGTAGGGTGTCTGAGCCCAGCCACCTGAGTTTTTAGTACTGTGTTGTCAGGCTCTTCCCAGGCCTC AGGTGTTGTCTTTTGTGCTGTGTG GGGATGCATTGC TG CCTGTATTTATGAT CTTT TG CCGTGGTT CTGAG CATT CAC C TCAC CATGTTTACAAAGAAC TGTTTTGTATATAGACATTTTCAGGCACGTG C TTTGCAC CAAC CCTGCGTG GCTCTTGT CTGTGTTAGCTGTCACGGTGTGCACACTAATC TCTGTTAAAGTT GT CTATG GCTGT TCTACTTGTAA GATAGTTTTCTATTTCCT TCAGTAATGTGTCCA CAGTACC CTGTATTTCGAGTT C CATTATACT GAAGTACT CAT GTTTTAATAGTGCCTCTCCAAAGGCCTCACCTTGGACAGAGGTCAATCCTTGATGCTCCAGCACAGGTGACGTCA CTAATTGTCACTTTCCAGTTTGTTTTTCTCTATTAAGGAAGACATTTTCTAATTGCATCTCCATGGGCTGTGAGA CTGTGTGAAGCCGTTTGTGTGGTCTCCATGTAGGTGCTGTGTTCCCGGCACCGCCTTGCTCTGAACACTGGTAAT TCCAGG TG CTGCGCTTGG CAGAGGGGTCTCGCCAAAGC GCATGTGTGTGCAT GT GTGAACGTGTGTGT CCTTTG C ATGGTT G G GCGTGGGTGC CTTGCTAGGG CATCAG CAG GACATTGTGTGTATAGTTACAATGCTT CCAAACTG GAA CTCTA CAT TTTGTATCTTTTAAAG CT CCTATAAGTAAAATAACTATTGGCTT TATTAAAAATATACAT TTAATAA TACTTTGTAT CC TTGTTTAAAATGAACTGCAGAG GACATTAT GTCAAAAG CCCTGCCT GACTCATTGCTT TGAAA AGCTGCATACTGAGTTGGTTTTTAGGTCAGCCTGTCCTCTGATTCTAGCAAGGGGAAGACCACTTGGGATATTAC GGTGAG CTGAAGTGGATGGG CAGGAGG GAGAACAAG CAGCTACGCCTCCC CTAT TGGCTTCCCTAAGG CC CAAG C TGGTTTAGGCCCCTGTGCAGGGGGAGCCAGTGCAGTAGTGAAGGAGACGTCCTCTGCAAGGGTGCAAGCTCTCTG GGTTGGGGTCTACCAACA CC CATGTT CCACCAAATGTGGATG CTGTGACTTGCC CAGC CAGCTTGT CCAATGACG AGAC CCAAGAATGAGGG CTTGTTTTACC CTCTTC CCAAAC CA GGGACCCAAT CTATGG CAGAGATACTGTGC CTT ACAGGGATAAGAAGGACCTGTCTCT(N) xCAAAAACAGGACCTGTCTCACTGGATGATGCCATCGTGTTAAGCAG AAGAAAGATAGTATCATGAGAC CCACAAATGGAGGG CAGG TCACAGGAG CATTT CAGCTCATTT TCATTTACAGA AAGACAGCACCT CAGTGGAACCTTTTTTGACGAC CT GAAGAAAATGGAGTAATGTGGG CATTAATATATAAG CTG CTGGTGTAGGAACCACTCTTCTTTCTGCCAAGGTGCCCTGAAGTTGAGCATGGGGACCAAGTCCCAGAGCCTTCC AAATGCAC CT CAGGACAAAGTTG C TCGGTAACGG C CATGT TG CCTGAGAGTG CT CAAGAATACACTGGAAGT TCA TATTAAAAAGCTAGCAGG(N ) xGCCCATATTTTATTCCAGAGTGTTCCCATTTAACACAAGCACCCAGTTCTCTT GACTAGTTCTGATTCACAGCAGTGCTGGAGGCCAGAGGGCAGAAGTCCCGCAGCTCTGGCTGTGTGCAAATGAGT GGGGAGGGAAGCTGAGCAGATGCTACACCTGAGCTACTACTGCAAGGGCTTCACAGGCCTTGGCCCATCTGAGAA ATGG GGGAGGAT GCGCAGG CAT CTTGAGGTACTTGG GAGGCTTGTGACCTTCCTCTCC CACCTGGGGG CT CCTTG GATAGGG GGCTT CCAGTTTGGAGG CTGAGATGCTGCATGC CAACCATGCGGCTGTGAG GGGATGGC CTTTTGTGC ACTGTGTGGC CACTGCTGTGAC CTGC CT CCTGTTGTGGAGACAGGAAATAAAC CTT CC CAATGAT CAGCTTCTCT ACTGCC CAAAAG CATGTATGTG GACAGCATGACTGGCTAC CT CCCAGCGACAGG TACC CAGGAG TCTGAG GCAC C TCCAAG GGTAGACT CT G GAAATAACACATG CACCTAGC CTTACAGCACAGGGGTGAGCTC CTG GGC CCAGGCCTG TGTGTGT CAGTGAG GGGACTGGATGTGG CTTGTACT GGAATATT GACTTGGAGATG TGGG CCTAGGTGAGGGAGA AGGATT GTTT CAGG CGTGT CACATTCAT TT CAA CATTCACGAATGTCATATAAGGG CTAGGC CCTTGCTAGGAGC CAGGACTGCAAAGATACAGTCCCTGCCTTTAAAACTCAGTATTTTAGGGGAGACAACCCATCTTTCCCATCCTTG ATAAGAGAACCACCATCCACCAGCAACCCAGTTGCTCAGGCTGAAATCCCAGAAGATATTCTTCGTT(N)xTGCC GTGTGTCCTC CAA CTTGC CTGTTGCTCC CAAGGCTG CTTGGGCCT CTCCATCG GTTTTCTGG GGTCAGAG CCCAT C T T C < N) xTGACCACCACAGTGCAACTTGAGACAGATGAAAGGACGAGTTGCTGGGGCCAGAACTTACGAATTCT AATACTGAGCAAAAAGACAGAACTAGAG(N)xCAACCAGACGAAAACATTCTCGCGTTGGGCTAAAAGAATACTA TTAAATACAT CATCTGGG CTGTTCCAG GACTG GGAGGGGATTAG CACTACGGTGGAGCTG CGGATAGG CC CAAGC AGGC CT C AAATAAATGTAAAGCTTTTTATT CT CGTGGAAACAGC TATGTACAATGGGGAAAATGTT TTTAAGCAT CTAG CAAAGGATTCACGGGGCTCTGGTAATGAAG CC CCGAAACGACAAGACTGGAT CG CG CAGGGTAG GG CCAAC ACCAGGGACAAACCCCCGGCCTCTTGGGGGCGCGGTGAGTAGGTGGCCTCTCCAAGCACCACTCCCGATGTGCGC ATGAGCGCAG CCGC CC CTACGCAGCGCGTG CGCACGTGCACT CACCACGTCCAT CC CAGACGTG CGGACC CGGGT GTCTGCAAGGTTCAGTCTCCACACCCCAGCGCCCGACCCTGCGCGGGGACATGCGCACAAGCGCGCGTCCTGACC ACCCGGACGTGCTGGCCCACACGCACACGCGTGCGCATTACCCCCGCCCCATCCGCGCCTGCGCTCAACCCCGCC TACACCTGCTCCGTGGCCTCCCCGGAGGCGATGAGCCAACCCCGGTAGCTCCAGAGGCGTGGTCCCCTCGCCTTC GCCGGTCAACATGACTCAGCATCCTCGGTGGGCTCGCCTCCTCCCCCGGAACAGCTGTCTAAAATCCATGGGCGT GGAAACGCCACGCCACGCCCCGCTCCCGACTTCTTCAGCTCAGGCCCCAGAACTGACCACCCCACCAGTCCTCTG CCCCAGTCCCTCTGCCTTCACTCCCATGGCCACCTGTGCTCCAGGGCCTGCCTGACCCCTGTCCAGCGGCCCTGA AGTAAGGT CTGT GATGTCTAAATGCCGA( N) xCTCTTGTTTGGAACTGTGCCCGCCACAGCCTGCAGTAAGTGCT AGCTTCAT CAATACTCAGTGG CAGTCAATTGCTTTT C CAGGG CTGCCTTATGCTTGAAATTAAAT CTTTG CTATC CAGAGTTAGGACTGTC CC CGGACTTCCTAACT TGCCTCTGCTGT CAGGTGTACC CTCTGGTCCGCTC CTCTCTCA GGGCCACATCATCTTG CTGGGGGCACGTGAAGGTGT CCGG CT CTGCAGCATGTT CT CTTG GAAT TT CT GG GCTAG GTTT CCTTTCGATT CT CCTTGGTCAGTC CCTGGCCTG GGGGCAT
> H s 20 _ 314 281 '7 9 - 31440225
C AAAACAAGTTTTAGACT TTTAGACATTTT CC CACTAAT C CTTGTAATTGGTTGGTTT GCTTGCTTT CAGATGAT TCTTTTTGTTAATTCTATTTTTTAA(N) xTCCATTGAAAAAATCTGGGATGCTCTGTGATTCCAGTGAGTTTGGG ATTG CATC CTTT TACT TTTCTAAAAGCCAAGT CCTGAAAC TTGAGGGAATTGATTT CTAATG CAGAAC CT CTCAT GACC CACACT CAGACTTCTAAGAGATCAGGACAGTGAGAACCTG CCCAAAAATCACATTT CACTTTTTTATTTTT T A T T T T C T ( N) xCACATTTCACTTTTTCGTTCTTTCATTTGTGTGTGCCCCGTGGTCTTAGGTGCCTTGGGAGTG TCCAGGTGACTGGGTCACCGTCCTCATCCTTGGAGTGGGTGTGGCTGACTTGAGCACCCGCTGGGGCTGTTACTT TGAAGTACAGAAG C CT CG GAGAACAAGGAGAGACTT CCTGTAACTATTTCAAGACATGGATTTCAAACACAGGAA GCAGTTTCAGAAAGAAATACGAGAGAGTAAAGAGTGGTTAATGTTGAGGCTGGGTAGCAGCGACTTGATCTGTGC TCTCCGAGAGAGCGTGCCAGGTTCTGCAGTGTGGCTGCTCTCTGCTCAGCTCTTTTTTTTTTTTAATTA(N)xTG GAGCCCAGGCTTGGGGTTCCATAGATTGAGGGAGAAGTAGGAGTGTCTCTGTCAGTTAACTTCTATC(N)xTTGC CAATACCACTGGGATAAATGTCAGTGGATAAAAGACTGTGATGATTATGGTTCTTGTTCATGGTCTTGATAGAGT AGAATAATGTTC TTTAAAAGTTCACAG GAACCAGTTTTTTTTTT TGGCTTGTGT GAAATAATATGCGTAAATGGT CATAA CAAAAGTAG CCAACATTTTATTTATTTA(N ) xTAGATCAGTCAGTCCCTGCTGAATGGGGAATTGAGGAA GAG GTTAT TTGATTTAT CTAAGGTATACAGATGTTT CTTT TC CTTGCACCCACCTT TG GAAACTGTAAGTACTCA CCAGATAATGAC TCTGG G CAAATTGCTG CTAAGCGTTGATGAGAGTACCAAGGCATGG GAGATCTGT C CTAGACC TAGACAGGTTTGATACACTTGGAGAAGATGGACTTGGTGC CTGC CTGTGTTCAAGT CC CAGATGAGGGTTACGGA AGGCAAGAACCACAACTCTTATGGGGGAGAGTGTGATTGGGGTCTGAGGAGGCTTCCTTCCCTGAGAGGGTGACC AGAATTAG CTGCTT TGGATGGTCGGTGTTGTAGAAC TT CAACAGTCGAGGTTGGGAGGGATTGTGGTC TTGAAGA GGCTTATAACTC TGTT GCATGTCCATCG GTGCTG GGGAACATTGAGAGGCTAAGTTTG CAGAGG CTGAAGGTCAC TAAC CAG GTGCTTTGT TGATCCTGGAAG GCTGGGAT CAAATG CT CATACCCAGCTG GTTCAACC CTGACATAGAA GTACTGTGTTTGGCCT GCACAGAGT(N) xAAATATAAAGGATTTGTCTCCAGGTTGTCAAAGTCCTTCCCTTACC TGTTACCACCTCTCTTGGCTCATTTACATTATCTGGCCCCTCCTGAAAGTTTTTGCATTTGCAGCCCTTTGTAAA TATGGAAAATGGAGCCACGTGGGCTTGTCAGGCTGTGGGAAAAACAGTGTATCTGCACACACTACCTTGTGGTAT TCGTATCCTTGGGAGATCTGCAAAGATTCTACATTCCTAAGTCGTCCTGGAGAAATTTACTTTTAGTTGATTTTA GTCCAAAG( N) xGCCTGAAACACTGACTTGGGGCTTCTTAGGTTATAACTATTGCTTTACCTCAAGATAAGGTTC TAAGAGTGACCTTAATTACTAGCTACAGG(N)xAAGCCACAGATTTTTAATAACCTCATGAATACAGACTTGAAA GGAGTG CCACGTGAAAG C CAGTGTCTTAGTATATATTTTTTTAG TGATTTAGTGTTTGTTAT GT GTGTATAGTTA TGAGCAATAT CATGGACAGATGCAACTT TTGG CCTTTATAGGTG GGACCCAGAAGG CTGGGTAT CTGGTTTGGCC TTGATAATTTGTTGGACAGAGAAGGTTTTAAAGGGTTTGAGAGGGTAGCATTAGTGAAACTAGTGAAACTAGGTG GATACCAAGCTAGCATTCC(N) xAGCATTCCTTTTCATAAGCTCTTTCGACTTCCTTTCTTCCTTGGTCATTAAT GCCACCAGGCATGTGATTAGAAGGCCCCTTCCCTGCCTTTTGGATTGTGGAGCTGTAGCTCTGAGTCTGCTTCCC GCTGGGGCTGGTCGGCTC TTGGCCTGTGTATC CC CTGCTCTTTGGGGCTGTTGGTTTCAGGAAAG G CAGCAAACT GCAG CAAAAG CAGAGGTGGGGACAGTGAGG GARAAT GGAT GATT TAGCTTTGCC GG CCAC CAGG CAGTAG CTGTA
C(N)xCCTCTAGTACTTTATTGAAACTTTCAATTATTTTTTATTTCTGCGGCTGGTAAAGGAATAGTGAAAGGCT GTCTAGAACTGTGATTCTGGTTTGGTCTAACAAGACTTTGATGATGAACTTTGAAATTCGTTTTGATAAAATTGT CCTGGTGGTGTCAC CTGATCCTCATCTCTGGATG CTTTTTGACTGCAGGTTATAAAGAGG GTCCTTTCTC CTGGG ATGAGACTTGATCTCCTCTGTTGTGTTTGGTGTCTTCATCAGGCTGTGGGGAAGCTAGATAAGCTGTTTATTTCT GTTTTTAAATT (N)xATGCTTCAGTTTTTGTGGCTGTGAGTT( N ) xCTTCTGCCGAGGTCGTTCTCAGTGGCTGT ACCTTTGATTTTGTTGCTTTACTTTCCATGCTGCTCTGTGGCCCTGACCTGTAAATACAGGTTTTTGCAACTTGC ATT CATTGACAGTTAAAT GGAATT CACTTGTCATAGATGTGAATACAGTACACTG GAATGAAGATGC C C CATGGT TAAAGTACACACAGGACTATGAGTGTTATAACTTTTATTGGTCTTGGCTAGATAGAGTCCTGGCCTAAGGACTGA GACAGATTTCTTCATAGC CT CTTAATGGGAAT CAGAGACCCTCAGTGGGG CT CATG GAGGGTCAGAGCACTTTC C ATTGGGCTTCCAAAGAGTTCCATTGAGCCCTTTCCAGGGTCCTTTCTATTGACGCCCTCATGCAATCAGACCTGG ACTT GATTGTCCAGTCCCTTGGTAAGGACT CAGTTTATGATTGT CAGCTACCTTGCTAGGCTGTAATT GT CT CAG GCTTCCCTAGTTTTTTCTCCTTAATCCTTTTTCAGTCCCTGAGAAGCTTCTACATGTTTCAAGGGGTAGTCTGCT CT TT CTGGGAGCTGTTAC CGTCAAGG TTGCTATAAACAAATCCATGTT GT TTAT CTGAG GCTGAAGAAAATTGAC AT TAAACTGATGTGGTTTTTGT CTGTGTTGGGGTTTGATCAAAGAC CACATCTC CTTTTGGGTTAGAAGT CTGC C TGATATTTTATTGCCATACTAATGCCAAGCATCTCACCCTTTTTAATGTCTTGTGCAGGTCAACGTATTGAAACT TACTGT TGAAGACTTGGAGAAAGAGAGGGATT TCTACTTCGGAAAG CTAC GGAACATTGAATTGATTTGC CAGGA GAACGAGGGGGAAAACGACC CTGTATTG CAGAGGATTGTAGACATT CTGTATGC CACAGATGTATGTGTTTGACA TGAGGATATTTTCTTTCCATTTTACATAGAAGGGTTGGTGAACTCTGTGCTGATGCTTGTTGTATTCCAGTGTTG CAT T CATCAAAAGACTT CAT CTTTAACC CC TCAAAGTCAGCCAGAGG G CAT C TCTG CC CAGCTTTAGC TT CTGC C TGAGGT CTGTGAGCTTTTGAAGAAGGAATAGGACAAGGAGGTGG CTGG CTTG CC CAGCATCTGTAGTATGTGGC C ACTGATAG GTGATGAGTG CCACAAACTG CT CT TAGCCAGAAGCAAC CCATGT CCTCACTCCACCCCAC CT C C TAT TGCTTGGATCCCTCAGCTTCAGTTGCTGCCTCCATTTTATCAGGGCCCGGGGCATGCCCGGAAAAGCAGGCACAT GCTCCCTTTTTCACGGCGTGCCCACATATGACGTCATGTCTGGGTATGCCTTTCTCCTCCCTCCTAGGAGTTTGC CTGGTTCTCACTCCCTCAAAGTACTCTATGATCAAGTTTCTTTGGATCCATGTTTATTGCACAGTCAAATCTGTT GATATTAATCACGACATGTTAG TTGAT CAGGGAAGACTCATTTTTTT CTAGATTTAGGATTGTTATC C GG CTGTG TC CACTGTTTAATGGTGATGTTTGTAAT TCTG TGTGCCTAGCAAGGT CTGTAGGAT CAAACTACAAACTT CTGGT AGTATGTGTAGCACTATG CAAATACAAGGTAGTAATAATGCTGTTACTGT CACT GT CCTTGAGACATATCTTGTG AATTTCAGGGGATTAAGACAGATGACTGAAAAGTTGATAATCTGTTGAATAAATCTTTAATTTAGAGCTTGCTCT GTGCTATGTAATGAAAAAAAGACAACGAAGATATAGGGGGT CCT TGGTTTAGTCTGAG TATTATTTTAGTG GAG C CACATCTTAAGTAAGTT CAG CTATAGAGATTTAGTTAATAGAAATAAGATGGTGGTTGAñAGAGAAGT CTGATTT TACAGCAGAATAGTATAGTAAT TTCCTGCTCT GAGTCCATGCCAGAAT GC CCTGTAGT GAGAATGAGG( N ) xTGT GCACTCAATTTTTCTGCCACTGATGTTGGGTCAGTCTTGCTGTTGTGAATATTTTGGGTAAAGCATGGCTCCTAA GTATAACAAAAGGAAGGCATAAAAGGAAAGCTGGCTGCATAGACAGTTTATTGAGAGAAGATGTTAGAGTACTGA ACTTTACAGGGGACTTCAGGAATTTTTGAGGGAAA(N)xGAGGGTAAATTTGACTGTGCTCAGTTTTTGTCTTTG GTGAGATG CAATTCTACTAC CCAAGAAATAAAATAGCAGTCACCTGGC CTGTATAGAG CCAGGAAGAACCATTGT TTTTAAGAGGCTGTAAGTATGAGGAAAGTGAACTCACAGTAGAAATGGATCTTTCAGTGGCTTTCCCCTCTCATT TT CCTATTTCAGGAAGGCTTTGTGATAC CTGATGAAGGGGG CCCA CAGGAGGAG CAAGAAGAGTATTAACAG CCT GGACCAGCAGAGCAACATCGGAATTCTTCACTCCAAATCATGTGCTTAACTGTAAAATACTCCCTTTTGTTATCC TTAGAGGACTCACTGGTTTCTTTTCATAAGCAAAAAGTACCTCTTCTTAAAGTGCACTTTGCAGACGTTTCACTC CT TT TC CAATAAGTTTGAGT TAGGAG CTTTTACCTTGTAGCAGAGCAGTAT TAACACCTAGTTGGTT CAC CTGGA AAACAGAGAGGCTGACCGTGGGGCTCACCATGCGGATGCGGGTCACACTGAATGCTGGAGAGATGTTATGTAATA TGCTGAGGTGGCGACCTCAGTGGAGAAATGTAAAGACTGAATTGAATTTTAAGCTAATGTGAAATCAGAGAATGT TGTAATAAGTAAATGCCTTAAGAGTATTTAAAATATGCTTCCACATTTCAAAATATAAAATGTAACATGACAAGA GATTTTGCGTTTGACATTGTGTCTGGGAAGGAAGGGCCAGACCTTGGAACCTTTGGAACCTGCTGTCAACAGGTC TTACAGGGCTGCTTGAACCCTCATAGGCCTAGGCTTTGGTCTAAAAGGAACATTTAAAAAGTTGCCCTGTAAAGT TATTTG GTGTCATTGAC CAATT GCAT CC CAGCTAAAAAGCAAGAGG CAT CGTTG C CTGGATAATAGAG GATGTGT TT CAGC CCTGAGATGTTACAGTTGAAGAG CTTGGTTTTCATTGAG CATTT CT CTATTTTTCCAGTTAT CC C CGAA ATTT CTATGTATTATATT TT TTGGGGAAGTGAGGTGTGCCCAGTTT TTTAAT CTAACAACTACTTTTG GGGACTT GCCCACAT CTCTGGGATT TGAATG GGGATTGTATCCCATTTTAC TGTCTT TTAGGTTTACATTTACCACGTTTCT CTTCTCTGCTCCCCTTGCCCACTGGGGACTCCTCTTTGGCTCCTTGAAGTTTGCTGCTTAGAGTTGGAAGTGCAG CAGG CAGGTGATCATGCTGCAAGTTCTTTCTG GACCTCTGGCAAAGGGAGTGGT CAGT GAAGGCCAT CGT TAC CT TGGGATCTGCCAGGCTGGGGTGTTTTCGGTATCTGCTGTTCACAGCTCTCCACTGTAATCCGAATACTTTGCCAG TG CACTAATCTCTTTGGAGATAAAATTCATTAGTGTGTTACTAAATGT TAATTTT C TTTTGCGGAAAATACAGTA CCGTGTCTG(N) xCTTCATTCCTTAACTCTCCCTCATTTGCTTTGCCCACAGCCTATTCAGTTCCTTTGTTTGGC AG GATT CTGCAAAATGTGTCTCAC CCACTACTGAGATTGTTCAG CC CCTGATGTATTTGTATTGATTT GTTT CTG GTGGTAGCTTGTCCTGAAATGTGTGTAGAAAG CAAGTATTTTATGATAAAAATG TTGTGTAGTGCATG CT CTGTG TGGAATTCAGAGGAAAACCCAGATTCAGTGATTAACAATGCCAAAAAATGCAAGTAACTAGCCATTGTTCAAATG ACAGTGGTGCTATTTCTCTTTTGTGG CCTTTTAGACTTTTGTTG CC CTAAAATT CCATTTTATTGGGAAC CCATT TT C CAC CTGGTCTTTCTTGACAGGGTTTTTTT CTACTTTAAACAGTTT CTAAATAAAATTCTGTATTT CAAGAGT AT CATGTC TTCTGAAATTTGTCTTGC CCTGG GTATATGCTGTTAGGTT CAAG TGATGGGAAACCAGTG CTTC TTT CTTCAGTGAGGACTGATCTTTTCACATCCTTTACTGATTTTTCAGATGTGCTTATTTCTTC ÍN)xAGATGTGCTT ATTT CT AAGCTGACTTCT TTTTTCTT CATT CACATTATATTGTACAGC CT C CTG CTTTTTAAAATTCT CGTTGCT GTAAGAGGTTTTTCCTCTCGGAAGTCCAAGGCCTGGCCTATCTGCTGTGAAGCCTTTTCAGGGCATTTCCTTCTG AGAAATATAGCAGGACAGTGCTTGGCAGATGACTGTGGGGAGATTTTTTTTTTTTTTTTTTCTGAAGGTGTGA(N JxCTGAAAGTATGTTTCAAAGTCAGTTAACATGATCTTAATCTATAAAATAATCTAAAATTGTCACCATTTTTTT CTCCCTTATAAAA
TABLA E
>H sl_10675607-10681070
G CT CAGGGT GAAT TATC GG C C C CATGGAGGCAGGGTG GCAGTATT TC TCAAAACTGCAAAT GCA CACGC C CT GGGACCAC CCTCTGGGGGTT CTGAG G CT(N)xGATTGAGAGATGGGGAAACGTGAAGGAAGAGAAG A C A T CCTGTGCCACCTTAGGTGTGCACA CAGA CTCAC TG GAAG GATAAATGTAAGA C TACTAATG GCTG TTGGTGGTGGG GAAG C C T T T T T T CACAGACT GCCCTGTG CTATTCGATGAGTAAACTGTATGCGTG CTA C CT GTT CAAAAAG TGTAAAACAGAAATGGGAGT TGGGGACT GAGGGAGAGG C C CTATGTTGATTACAGT TTTGGGACCGAGATTCCAGGGCTAGGACACACCGGGGAGTCTCGCATGGAATCCGCAGGCAGGGAGGGA GTTTGGTGGC CAAGGAGG GACTO CAGG CC CC TC CAGC CCTGTCGTCTCTGCCCCCTGAGAGTGC CATTC AGCAG CCCTGGGGGAGGGGGTGTCTCCTTGCCCACTCGGAGGTCT TTGCGACTTGGAT C CG CCTG GTGG GGCTGCGTGCCAG CAGC CCCTTCCCCC TCATGC CACATO CTGCCCTGAG CTGAAGGAAAGAA CG GGCCT TCTCCTC CCCTG T GGTCATAAAACGCCAGGA CATGGAAGGTAC CT CTGAAAAC TTACAGGCACAGGTGA GAA CTGAGGGCAGAAGT CT CT CAAGCAG CAAGT CCTTCAGCTT CACT CGGCCGTGTTGT GACGTT CCTT CCCAGAGTTCA GAACG CTG CT GATTTT CCCTCTCCCC CTTTCTCC CAGATGAG CAGCACACTGTGGGGG CGC CTTC CGGATAG CTCTCCACTGGTGCTGT GAT CGCCTTCCCCGCTGGTCTGTGTCCCACCCACGCCC CTC CTGATGA CAGATCCTT CGGGGGGAGG CAGAGCAC CCGC CACGTG GC CT GGAGATGG C CG CTC CAGA AAGGTGTTCG CAGTGGAGACGGG GCTCTCCCGCGTGCCTTG GAGGAT CAGATGAG GT TGTGGACG TGGG AGCAC TTTGTGAACT GGGCATGG GAAAGAGT GAAGGTGGAAA CTCTCCCCACGT CAAGTAGAATTGCAT T CAGGAGGG CACTTTGC TATGTG CCGT TCTAAAGAGAAT C G (N )xTTA C T TA A T A C A A A T A A A G A G A A G AGAACGGGG CCAGGTGATCTT C CATGAAGACTGGCTG CCCTTCACAGCAGCAGCAATGTGTTCTC CTGA GACATTGCCCCACTCCCTTCTTGCCCTCTGTGCTGGTCCCAGTCCCTCTCCACCTCG TGAACC CTTGCA GCAGGCTCGCCCCAGCTCTTCTG CCTCTG CGGCCCCCACTTCT GAC CACCCTCAGAGCCTCCCCAGTGC T GCTG T C CAGG GT CATG GCACTCCCTGATTCACATCCCC CAGTAATT CC CATT GCTTAG GAAAGAATAT TCGGCTTCC TTGACCA CA CAGTC CACCTGGT GC CAC C TGTC TACCT CTTTCCTGCCCCGCCCCAGCCCC CACTGTACCTGCGGCTTCCTTGGCTCCTCTCCGTTCTCCCCAGCACCCTGCTCTTGGGGTCCTCTGGGC CAGAG CAGCT CTC CT CT CTAGTCAGG C CCAGG GTGCT CACC TG GTCTACACCCACCTACCGTCTCTCTT TTGTGCTTGCTCC CCTC CAGC CC C CAAGGGC CT CAGGGCTC CTC CAAGG CCCTGCTGACCCCAGCCCCT CTCCCCTGCCACCCCCATC CCTTTTG ATA GGCGTTCTCATTCT CTGAAG CCTTGCTCAGTCATTTGGAG A CTTG TG TTTTCTCTCCTCTG G CTCCAAG CGAGAGCAGG GCTGGGAGGT CTAGTTTACCA CAG CC CACC CCCGTGCCCCCACCCAG CGTAC CACGGTCCCCCTCTCCACTTCGCTGAGGCCCCGCTTGGTTCTGCTGT GGCCCCCAG C TTA T CGC CAGC TATAGTAG GG CATATATGG GTATATT TATG CT CC CCTAAACATACTCC TG TTATC CTCTCTGGAT CTTTTGTTCCTGAGGTAGAGAC CATG CTGTTGGTCTTCAGAT CACCTCTGCC ACCTGACTG CCAACT CAGGT C TCAGCT CACTGCTAGATACCTC CACC CACAG CCTGCAGT CAGGGTTGG GTGACTGGCATCCCTTGTGTCCCAGTT GAGG CAGTG G GAACAGGAGG GGTGGAGC CGGCTGCTGTTGTG ATGAT CATGTGCTGCCTTCCTGCGTTG GAGT GTTCCACTTGCCCATTGTTGGAGGCGAG GAGACTGTGC CACTGAGGCA C CT GTGC CAGC CC CCA C C CGAAGAAGGACGC CATGTG CGTCTT CTAAC CCTCCTCCTCT T CC CG CCTGTAGGTC C CGCAGG CTCC CGATGGCGAGATTACGG CGCCCTGGCCATCATCATGG CAGGCA TTG CATTTGGCTTTCACCAGC TCTACAAGGT GAGTCACC CC CAG CGGCTGCAGGTGCTGTGCCCCTGCC ACCCTGTGG CATT CAGCAC CC TTTCAC TGGT TCCCCTGCATGGAGGCACTGCCGTGCTCTGCCCTG GAG TGTGTTGGCAGTGAGAGGCTGTGGTAGAGGTCGGGGTTCACAGCTGGATCCAGGCAGACCCTGCCCTGC TGTGGGGCTTGGCAGGCTCTGCCCACC T T T T CTCCAGTGCT CAG CATATAC CACC CAGATCT C CACTTG CAGGGGCGTCGTGAGGTCTGTCCCTTG CCAG CC CATG TCAGGCACTTACAGTAGTGCCCGCACGG GGAA G CC GT GCGAG CAGTT TT G A G A C ( N ) x C A CCATTGTTCTGCCACAGCATTTAAGAGCGGCGGTGAGAACC T CAGTGT CATCAGAT GTATGGGAGCTGTTCTTTTTGCTCTGTCTGTTGTGCTGCCAT CTATAGAAATCT AGGATTTCC CAGG CTTC T CAAGT CAA C TT TCAG GAT CCCAGGTTCATTGCTTG CAA CAGAACTGCAGGC ATGACGTGAA CATTGACTCC CTTGTGAAGGC C CAACTTG GACGTGAAG CAAAT GAGT CTAGAG TCTAGA AAATG CAGGCCAGGGTAGGTGCCCAGTCCAGATCTTGGGACCCTGTAGAGAAGCCTCTTTTCTTTCGAG GCTGCCAGCTCCTCCACCCCTGATTGGTGA( N ) xGGTGATCCGCCCACCTCGGCCTCCCAAAGTGCCAG GAT TACAG GCTTGAGCCACCGTG CCCGGCTG GTGATT T T C T T T T T TAAACAAC CT GAATGAGCA CTCAG ACTCGCCATGGTCACTCTTAG CACATGAAACATGGT CAG CCATGGTATT CCAGATTC TT CATT TTTGAT GTAATA CAAAAGC TAATGACAG TTGTAGCAGAATATAATTT TTTAAGTGTAG CATGTATTCCTGGAGCT GGACATTTCCATGTTAGAAATTGAGAAACAAG <N )xTTGAGAAACAAAACATCCTGGGTTGGC < N )xC A CCTCTTGGGACCCGGTTCTTCTC CCTCTGAA CTGGTGCCCC CTTCAAAT CACTGACTGTATA CAAG CAC CAAGAAGAATGTGA CAC GAACGC CAGA GTCCTCAGAT GAATGC CC TG G CTGTAATGAGCAG TAGG CACT GTCACATGGCCACCCATGGAGGG CCTGTGGC CTG ACGGG A
> H s l 120440890 -120451513
CATGAACGATGTTATGACCAAAAGGTGAAATGCTGAGCAGAGTTCCAGAAACACTGCAATGAGCACAAA GGAGGTTATAGAAACTTTAGTGCAAACTGTT( N) xGTTAGAGGAAAGAGTCCCTGCTGGAGCCTTGCTC AGCCTCCAGGAGAGGTTGGGTGATGATGCAGCGTGATCCTCCTTGGGGGCATCAGAGCAGGAGGCAGGG GTGGAGCAGAGAGCAGGGTAGCAGGCAGTGAAGCTATGAGTGCAGTGAGAAAATAAGGGAGAGCTAGTC AGAAGGCTTCAGCTTTCTCTGTGACGTCAGCTGCTGGGAGTGTAGGGGTGATAGAGCTGAAAGAGATGA
AGAAGAGAGCTCAGGAGACAGAGGACAAGGGAAAAATAAAAGTATCACCAAGTGTGTTGAGGGCCTAAT T CAGGTTGGAAACCAGAAATTGCAGTTTC CTAT CTGTAAAT TAATTTGATTTTCT CAAGCAACAC CCAG C AGCC TAAATTGGGG AGGG CAATTTGAAT AT AT CTTC TGGATCTATT CTATTTAATC AAAT CATC ATTT TCATCTCTGTACCATTTTTCTTTGGGCTGTTATAGAATTTCTTGAAGAAAAACACTAATTCATCAACTC C CATTGCTACAATATAATTTGTGTG CTAAA CGCAGATGTGTAG CATCTGAGTGTTGGACCCTGAG CCTT GTGTAAACAGCCTGCTTTGAAATATTTCTCTTTATTTATGGAAATTATACATGTGTATCACAAAGGATA CAAATAACC CTGAATGATAATAGAGTAACAAAT GAAAGCCT CCATTATCCT C CAACC CTGCATGCTTGT CTAATACACTCCCCTCCGCAGATGTGACTGATAAGTATAATTCTTTCTCCATTTTCTGATATGTCTTTA CAAGTATAGAAACTATGTAATATTTAACGGCATTATGCCAAGTGCCTATAGCCAACAGTACTGTACTGT ATACTTAAAAT TTACTAAGAGGGTTGATCT CTTGTATTCTACACACACAC(N) xACACATGCCAATAAT GATAAAGAGGGTGCAAGGAAACTTTGGAGGTGATGGCTATGTTATGGCCTTGATAATGGTGATGGTTTC ACAGGTATGTATTCCTCCCCAAATT( N) xTAAATAAGTACATACTATTCATATTATTTTACCCTTGCCT TTTTTAGTGAACAACATATTGG(N) xTTTTTTTTTAAAAAGTGAACAACATATTGTAAGTAGTTTTCTG ATTTAGATTATTGCTTCCAATAGATGTATACTGTTCTATAGTATGGAAGCAAATATTTATTTAACAATT TTGCTCATGTAAGGATGATGTAATTATGTTGTTTAAAATATTACATTTATGTTGTTTATAAGGTTTTCT ATATTTTGACAGTAAATATATATGTGTATATTTACAACAATTTATACACACACACATAATAAATACACA TACATTTTCTTTGCCCCTGAAAAGCATTTACATTGTGATAGAGTCCTAGAATTAAAAGTGTTAGATCCA GGAGAATACAACGTCCCACTAAAAATCTTCTACTTCTTTGCAAACTATCTATAAAAT <N>xGTATAGTA G GCCCATATTC TTAAAACATTTC CAGTAGTATATAA(N)xGCAAATCTCTTCTTCCAGTCAGTTATTTG T CACTAACTTTTATAATGTTTTT CACCAT GCAGAACATTTTAAACACTCCTGGTTGC TGAGTTTCATGT CATTCTTCTGT CTTT CTTCATTCTATGATTGTAAAAATGTC TTTCTAATAT GTGCTT TAACAAATTTAT TTTGGTATGTCTCTGATTTTTCTTTTATATGGAATATAATTCTGTCTCCCATCTAAGGTAGTAATCTGT CTTATTATTGGATC(N) xTCAGGATCACTCAGCCCTGTGTCATTCATCAAGACAATGAAAGAATGACCC TGAACATATTTGGAGATCTTTGGGACTGCCATTTCCATCACAGGTCCACAATGGCAGAGCCATGAAGGC ATAATGGTTTAAAAGGAGGGGCCCAGAACATATGTGGGGCCTTTAACTCACTGCCCAGTGCCACTTCAA ATCTCTGCTCTTTGATTCAAATGCAGCACTTTTTGGCCACCCCCGGTGTGGCTCCAGTGGGCCCAGTTG TGGTGCCAGCAGCAGTGGCTGGCCCTCCAGAGGATGTGGGTAGTAAACCTTTGCAGCATTCACATGGTG CTAAATCTCCAGGCACTCAGAATGCAAGAACTGTGGGAGGATGGCCAGCTCCACATTGATTTCAAAGGA TACCTCACCACAGAGAGTCCCCAATAGGGCAATGTCTTATGGAGCCATGGGAGCAGGGCTGCCCCAAAA CCCCAGAACTGTGGAGCCTCCAGTGTAAAACGCCG(N)xTAGTTGGTAAAATAAATTGAGGAAATTATT ACTAAACCTGTCAGGGAGCTATTATTTCTAGGGTGAAAGGAAGGGGCTATACTCAGGAAAAGAAGGGAG GGAGGAAGAAGGGTATGCTTGGTGTAGCTTTTGCTGGGGTGTTAGCAATATATTTCTTGACTGCTAGGG TTTGCTGTACATATTTGTTAAAATAGTCCTGAAAGTTTTATCCTCACTTCTTTATATGCTGTATTTTAC AAT(K)xAGTAGTGGGACAACCTTCTTGTTTCCAAGCAGTCTGCACTQGGGTTGGCAGGGGCAGAAATT TGCTAGTTGGG CC AGTCCC CAAG CC CGTAGGTC ATGC ATGC AGGTGGGTGG CAGCTGTGGTGGTAGCAG CAGGTTGGATGGCCCATCCTTAGGCTCCTGGAAGGAGTGCTCACATGCCGAAGGTGGTGGATGGGGCAG CCTGGCAGAATGCTTGGGTGAGGGTGGCAGCAGATGCCCTGTGAGCCTGCTGCTAGTGGGCAGGGTTGT TTTCAGTGG CAGCAACCATGGGCAG GTGGTTGATGTGGAGATG CAGAGGCTTTTGAG CCTCATGACAGG ATTCATTCTGCTGAGAGATGGACTCTCAAAATG GAAC CTTG CTGTAG CTGCTTAGGATTTG CCAG GGGT GGTGTGTGCGGGACC CAGCGTGAGC TCCC ATTCTAGAGCAGTAGAGTTGTATGGT CTTC AGGCAG CTTC CTATGTTAGTTTTAAGGTCCTCAAGGGTCGAGGAGTTATCTCATGGCTAGGATTGCAAGAATCTACAGT GGGAATGTGGACTGCTGGGGGTCTCTCACTTACCCTTTCTCCACACTGGAGAGCCTCTCTGGGCTCCCA GCTGGTCATGGCTGAGCAGGCTGTCTCACTTCTCTCTTTCCTTGOCTTAGGTGTTTCTGGCCACTTTTC TGTTGAATT CCTGTGTA( N) xACTGGGTTTGAATTTTTTTAAAAACA{ N) xACTGGGTTTGAATTTTTT TAAGGTTATTGGGGTGTGTGTCTGTGTGTGTGCGTGCACGTGTGCACTATTGTAAATGTGGTCTTTTCA ATTACATCCTGCAAATGGTTCTTCTGTGAATTGGTTTTACATCATTTTATTTACTGAATCCCTAATAGT TT ATGGTAGTT CTCCTCTGTTTT CTTC AGTTGT CCAGGCAT AT AGTCGTATT AAC CTTCT AACT ATTAT ATGTAAAAC CCTAGTTTTATATGTAAAAC CTAAGCACAGTAACACACATTC TGATGTAACATGATTAGT GGTTGATGCCTATCCTCTTCCAAATCCACTTTCTTCAAACTTTTCTAGGAGGCAAGGGAAAGGGGTCAT GGAAGAGAGAAAAAGGAGAGAGACAGAAATGTGAGACAATTGCATGACAGCGGGACAAGGGCTGGGTCC CTACATCTCAGTGCTTGCCCAGCCTAAATTTTCTCAAATAGGCACGTGCAATAGACAGGTGATGTTTTA CAATGAGGCTCTTGAAGCT CTCAAAATTAT CGG CCAGGTTACAAAGC CCATATTCAATCAAACATATAT TTATGCTGAGGCACACAAGTGCTACCAGGTAGTGCCAGATGGCAGCCATGACCCAGATGGGTATGAATT ATT
> H sl 173 508 677 * 1735137 15
GCTGTCTAGCCACCACTGGATTCATGAGTCTAAAGAGCTGGTGTTGAAAGGTATGGAAGGAAGATTACA GATTTGTAGAGCATCTATGCATAGTGTTAAAATCATGAGTGTAGCTGGGACTAAAGAAGGAGAGACTAT AAAATAAGAACAGTGGAGAGAGTTAATTGTCCTGAGGTGCACCAACATTAAAGGGTTAAAAA(N)xTTC AAAAATAGCATGCAGCAAGTCCTTCCATGGGATTGTTGGTTTAAAAAAGTAAA (W) xGGTAAAAGTGAC ATTGTTCAGGCTCCAGAGCCTGGAACTTAAGAGGCTTTGTGGCTTCCCCTCTCACCCTGTTGGAACCCA GAGACCATGCTGTGAAGAAGCCCAGTCCAGCCAACTATACAGAATAGGCCATAGAGGGGAGAAACAAAG
CACCCTGGAAGACAACTAGCATCAGCTGCCAAACGTGCGAGTGAGGACAT (N) xGGAAAAATCAGTATC AATTG TGGAGAGC CT CCAGAAAT( N) xGAAGAGCAGTGGGGAAACTGTAGCAGGAGGCTGGAGGGAGGC AGCCATGCTATCAAAAGGCAGTACCACTGGCCAGGCTGTCACCTGAGGCAACTCGGAA(N)xACTATAA AGATACATG CAAfiCT CTGTTT ATTGAGGCATC
>H s2_2Q 9406837 -209420 259
c a t g c a g t g c a a a t t c g g g t t g t g a c a t c a g c a a g c t g g t t c t g t g c a g a a t c t a g t t g g c c a t a t t g a TTGCAACCAATTTTAGCTAGTTGTGTTATCT( N) xCTACATTCAATGACCTCACGTAAGAAGCAGAGGT AAATTTTCCAGTCTGTTAACTCTGCAATTAATGGAAAACTAATGGAAAACATTTTGGACAGCTTATTCC T CCATGTAñATGTTTAACACTATTAATTATTTCTGGAT (N) xTATATGAAGG GGAGTTT CTTGAGGAGA GCTGACTCACATGATTACAAGGTGAAGTCCCATGATATGCTGTCTGCAAGCTGAGGAAGAAGGAAGCCA ATAG t N) xCTGATGATCAGATTTTCAAAAACATGTTGTATTAGAAATTTGGCTGTATTTCTATGACTAT TCTTTTATGAAAACTGTCTCACCAAGATATTGCAATACATCTCCATGGGCAGTTTCTTACAATGCCCAC ATGTAGTGACAGTTTATTTCTTTATGTTTATTTTTCCATTCATTAATTTTTTGAAGAGAAAGACTAAAT TTAATACAAACTCACTAGTTCAATTCAAGTAGAGGTGATTGTTCAGAAGAAAACTTGGAAAAAGATGAG TGTTTTAGAGACTGTGGAACCAATATGAGAAGGAGCTCAGTCAAAAACTCTGACTCTATTTCTGACTGG AAGACAAñATGGTGGCCGAGACCGTCTCTGCCTCTCCCTTTTCTCTCTCTGGTTTGGTAGGACTfiTTTC CGGATTTTATCTGTTATCATCTAGGCTAAAGTTTAAATTCCCCAGAAAAGCTTCTTTTATAGGGTTGGG G AG AAATGGTGAAGC CC AT CTCAATGTGGAATT AAAG ATGTGTGAGT CATCTC AG AACTTCAAAT CCTA C ATTT AATATGGAGATTGT AT AT CACAAAAACAATGTTCAG AGGC AT TGGT AG AGTTGCTAT CTAGATA TAAGAACTGGGAAGTAACACAGAGGGAAGTAGAAACAAAGCAAGCTTGGTAGGAAGCAAGAAATTCCCA ACACACAGATACTTTGGGCAGTGTAAGAAGTATCTTTATTCAATCATTAGATTTGCCCGAGATGTGTCA ACTTCAAAGAAATGTGGGAAACTCATTTCTCTCTGATGTGGAGGAATTACTTATATTTAACAGAATCCC AGAGTTTTGTTTTATTGCACTTATAGTTCATTTGAAACACTTTCCTGTGTACTTTTAGAATTAGAGTTG CATCTGCCTGAGAATAATTAAGAGTGCCGGCATTATAATACTACTGCTGAAGCAATGGT(N) xATCCCT TTTACTGTGTTACTTTTACATGAATAATTGTGTATTCAACCATCAGATAAAAAGGAACAAACTTTAAAA CTATGACTCTGGGTAACAATGCAAATATAGGATTCCTGTGTCATATGTTATACTAGAATAAAGTCAAAA TTGATCAAATAATTATTTAAAAAGTTAAAGCATGTGGGGAATTTTTTTATTATTCAAAGTAAAAAAGGT CTTTCTAACTGATATAAAGGCCAGAACGACTCAAAGTAAAGGCTAATTTTCTCTATTACAAAAGTAATC TGTACTGCATGATCGATGATGAAGCAGAAAGTGAATfiTTGGCAACTCATACTAAAGACAAGTAATTCTT ATAATTAATTTTATATATTTCTGAATATACAAAAAAATCCAAAGCACTTCAAGAAAGAATCAAGTAAAA AATAAAAATGTAATAG GAAAATGGCAAAGGAGC CAT CATTT CTACTTAAAGTTAGAGGTAAAAACAAAT ACATGCAACAGTAATATATATTTTGTCTTCTTGTTTGTTTAAATAAATAAAAATAGTCTATTTTGATAG AAAAGAAGCTAAAACTGCCATGACTAAACCCTAATTTTCAGCGTGTTTGCCTAGAGATTCAGAATCCTA AAACCCGTCAGTTGCTGACACCATAGGTATATTATAAGGTAGATTACTTACCAAAAAACTATGTGTCAA AATTCAGGGAGACTGCAAATATCTGTGGTAGAGATGAAAGTGTTCTGATTAGGAATAAATAATGTATTT C CAGACAATTATTAAATGT CACATAAG CAACAGAAAGGCAACTCAAGATAC CAAT CTCTTTGATACAG ( N) xGATGGGAATAAATCACAAACATATAGATAGCTTCCATTTGACCAGGATGTAGCAGTTAAATGTTTC CAGATGTCACATGTAAAAGATGGATTACTAATATAAAATTTAAAACCCTATCAGTAAGAATCATATACA TACCTTTTCAATTTAGTTTGTATTATT CTT CAG GTTATACCAGTC CTAGCTGTCTGGAGTTTATG G GAA AAGCAATGAATCATATAATGCAAACATGAAAGAGATCTAGGAATTATCTAGCTGTTTTTTATTAATATT GCATGTGCTATTTAATGTGAATTTCAAGTCAAAAC<N)xAGAAGTAAAAAACCTGTAGCAGAGGCTGTC AGTTGCCTGCCTATGTCTCCTCTGGAGTACCCTTTCCTGTGTACAGTAATCACATGGCTTCCAGCTGGA GGCACCTGCGATTGTTTGCTTGCAGGCTTTTTCTGACTGCTGGGAGCGCTGGGGAGTAAACTCACCCCA GGAGCAGGACTTAAGCAAAGACAGAAGCTGGT(N)xAGGGAGTTTTTTTGTATTñCCTTTAGTTTTTCT CCCAAACAAAATATATTTTGAAAAACTAAGACACCGCTGCTTTTTACAAACATAAGTATTTACCAGTAA TTAAAATACCAAAATGCTGGACCTTTTTGGGATTTAATTTACTCTTATGTTTTTATCATCTCAAAGACT TTAAATATAAATCTTTCTCTCTACCAT TGACCT CTGTTACT CATAAAGTAC CTCACTAT CTTACATGTT GTTTACATTAGATAAAATTGCAATTACTCACTGACATTAATTACATGCTTTACAATGATGATAATTTAA AAACC TTGT TAGT TAATTTTGTG TTGATCAAACACAATTTC TGTTAGA CCACTAC CAAATGTGATGTAG AATGTTTTT CCTT GATTTATGTT GCTGTTTTTGTCACTAATTATTTT CATGTTAT GTGCAATCAAAATT TATAAAATTGGAAAAGGAAAGTTTTAACTGGCTGCATCCCCCAAAGAGACAGGAAAAGTGTACCTTTTT C CTGGTGGACCTTGAGACATGTTTACATGATACTTTGAGTAGTGC TGAAACTTATATTT CATTATTTTG TATTTCTTAAATTGTTTTGCATGTCAATATGACATGTTAATTGTCAAAGAAAAGTCATATGAAATGTCA GCTTACCCAATATCACATTTTATAATAATTAACTGGTTGGGCATTGATGAAATGTTCATTTCCTAGAGG ATCCTGAAACCATAAGACTCAGTGTCCCCAGTGTCCTCTTTTTTCTGCTTTACCTGTTTTGATTTTAGA ACCCTTGAGAA( K) xCCATTCTATACTTCACTAATCACACAGCTTACATGGCTACCACATTCTCCAGCC AGACAGACTCCACGAATCCAAAAAATTGTAGGTTTATGTACATGAA(N)xATAAAAGGGGTTGGTCAAG GTTATTCATTTACACTGGAGTTTGGATAGAGCAGAGAGGAAATACTTACGGGCCAATTTTGAAATCAAC AATGCTATTATCCCCñTCñGTGGATTTTTAGAAATTACTTTTTTGGñTACñTTTGATTTTCTTTTATCA TTTATACTTCCTTACAAAACAGAGATGCAGGTATTACTTCTTATCTAGGAAAGTAAGCCAGGCTGCATA TCCCAAGACATTGATATCCATTTTACCTCTTAAGTGTTTGATACCTACGTGACCAGTGAGTTTCTATGC CAAAGGCAATGATTT CT CATT AATC AATT CACT AT ATGATT CAAAATTTGAGTCAAAGTTCT CTGTGCT TTGTTTCCTTCAGGGAAGAAAAAAAAGAAAGGGAAATGAATCATTCTAAGAGCCCAGGGAAAGTTTCAC ACTGTGTAACATAGATT CTTGAAGACAAGGG CTTCGGAATC TTTGTGTTTTTGAGTAACATAAATAGTT CAAATCTAGACTAATGATTTATTGACCCAATATTTATAATCATGAATAGACAA(N)xTATGGCAACTGG AGTGAATATGAAAGATAGTGGGTATGAAGAGTTGGGGTCATCATTGCAATATACCAGTGTCCTCATGAT GATTTTCTCTTATTCAGCACTTATGATTTATACTTACAAAG{ N) XTTTTTATCTTTCATGTATGATAAA AACATCTTTTTTATAATAT CTAG CTAGTT TTTGGAAGGGTTAGTCGTTT CTAAAAGGAGCAAAGTTCTT TTCCTCTGAATATTTAAATGGTTAAAATATCATTTAAATAAATGTTTATATAATGGCCTGAAGCTGCTA CATCCTTCTATTC AAGT CTG GAATTGG CT CAGAAC CT AC AT AACACT AAGTTTGT AG A CGT AGATCATC AGTGAC CCCAGAGGGATAAGATGGTGC CT GTAAAAGTACATAGAT TAAGAAAAGGAAAATCTACTTCAG AAGG GG TGATTCACTAATGTGGGTAGT CATATT CCATGGTTGTAGAGGCTGG GAAAAAAGAGTGAGAAT GTAAATTTTTAAAAAGAGAATTTA(N)xGGAAAGAAACCTCAGCAGAGTGACTGGGGAGAAGTCTCTCT GATAAATGGAGGAGTAGTT CTATTT TTGC TCAAAGTAAAGGTCAGT C CAAGGTCT( N) xCAGGGCTTAA TAATAGGTAACTAGGñGTAGAA(N) xCCGCTATATCTTTAACATTTAGGCTTATAAACTAGTGTTTATA ATTGCC(N)xATCCAGATCATCTTTTCTTTATCCATTTAT(N)xATCCACTTACTATGTGCTGAGCCAT GGGGAGGGAATATTATTACTG CTAG CTTGAG CCAC CATCTCTAAT CT CACTGGACCCTAGGTGGCTAGA GTATGCAAGAAATCTGTGTTTCATTATTGAACACACATTCCATGGTGTGTTCATGGGGGAAAGAGGACT CCAGGGCTTCCTATTTCATCATTTTTGCTGACATTCATGTAGTTGTCTTTTAACTCAGACAGGAAAAAA AATACTTCTAAAACTCATATCTGCTGAAGGACTTGTATCCGAAATGTACAACAAACTCTTAAACTTGA ( N)xCATATTTTTGAAAATGACTATGTATTGACTCCTGTTAATTGAGCCTTGTGTAATCCCTTTCCCTCA AACCTGAGTCGACTCCATGATTCTCTTTTGATCAATAATATGCAACAGTAGAGATAAATCATGAATCTG GCAGCTTGTACCTGTCCCATAGCCCCAGCTGCCTATCAGTTATGACAATGAATATAGCACTGTTTAGTT CCACTCAACTGTTAAAAGGACTTTTGATGAAACATTGTGTCATTGCTCTAGAGAATAATAATTTTAGGT GAAT CA CTGATATTT AT ATTGTCTC ATTT ATTTTTTGTTTT AAAAAG CATCGT AAAAATTGTTT CAAAA TTTTCTGTTTTATTTAATCTTAAATGAGACATCATTTACTTAAAAGTACAATTCAAGAGAAGGTTTGAT TGAGTGACATTTGATAGGAAGAAAGTTAGGTTGGT GT CTAGGTTTGAAATGAAGAAAATAGAACTG GAG GGAAGTGAATGCTGTTGCTGGGCAACGTGGGTTTGCTGTGGAGAAGCCATTTATTAATTGAATTTAATC TAACCTGAGAGTTGTTTATTCTTTG CT TCTTTTTTGTTTTACC CTTTTATTGAAACAGAAGAACTAGTC TCTCCAGTGAATGAAGAAATGAACTAGTATGTTAGGCTTGTGACTCCAGGATTTCAACCCATGGGTCTT TCAATAGAAATTC CATT CTGAATG G CATTAGTG CTTGATTTTCAACTTGTGGTTATTCTGAGTCAAGGA GAGAGAAACAAGCTG CC CTAC CAAATGTAGCAAAACTACATGT
>Hs5_3306716-3307102 GTCACCTGGCACCTGCAACGCCATGCAGCTCACCACACCCCGCAGTTCCTGGGCATCTGCAGGCTCCTG TGGTTCCAGGGGGCCTGGGGAGCTCAGCTCTGCCATGTCTTTGGGAAGAGGCCTTCTCCCCACAGGGCT GCAGTGACCCTCAGGCCCACACATCTTAGTCACTGGTACATGTGCAGGTTACTAAGTGGGCCCTGGGGG ACACTGGCTGTTACAGGAG GTGTCCTGAGTGTCATGGTGAGATCTCCAG CATCCCATGCTTTTCGGGTT GTGTGGAGGGCCACAGGCCCTGGCATCTGACCTGGGGAAGGTGTGCCGGGTCCTGTGCCCTGCACAAGT GGTGTGTCCCTCAGCATGCTTGCTGAGAGGTAACACGTGACA
> H s 5 _ l67308741 -1673 135 70
GGGTAGTACAGGCTAAGTCCAGCATGTTATCTTTGCTGTGGAGGTCACAGAAACAGCCATCCCTCTTGC
CCCAAATGCAGAGCCACACAGGACTTGGGCTATCAGGTCATGGCCCTTTTTCTCCCTCAACT(W)x TAA
CCTCAGAATATTTGTTTGGCTGATTGAGATATACAACAGACAGTGAAAGCGTTGTATTAAAGGGCATGG
GTTACAATTTTTTAAAGATTTCTTAAAACGAAATACAGCATCACAAAATACCCTTTACGCCCACT(N)x
AACCATATCAATCACCATGAACCTCAGTTTCCCTATCTGATTAGATTGAGCATTGGCTTAAAAGGCCTC
t a a g c t t t c t t c t t g t t c t a g t a t t c t g t c t c t g t a a g c t c c t t g c a t c t g g c t g g a c c a a g c t g t c c c
AATGTTAGAGGCTCCCCAACTCCTCCTCTGTATAGTGTGGATTTCATCCCAATATCTGGGACATTGTCA
a a a c t a a g g c c t a c a c a g t c a t g c a a a g c a g a t c c t t g t a a c t t a c a t g c a a a a g t t g a g a a g a g t g g t t t t g g t a t g c t a t a a c a c t t g g c t g a a a a t g t t g a t g g c a a t c c c a t c a a g g a g a g a a a g t a c a g c t c c
ACCAACACAAACCTGGGCTCCCAAAGACCCTCACCCTGGCCTCCTAAACAAGGTCTCCCACCTCAATAA
t t g g g a a c a a a t a g t c c t c a g g a c t c t c t c g g c a t t g t t a a c t t g c c t a g c a a t g t a c c a a a t a g a g a t c a g a c t t t t c c t t c t t t t c c t t t g t g c c t t c t t g a a a c a g a c a a a t a c t g c a c a t g t g c a c g t g t g t g c a c a c g c a c a c a c a c a c a c a c g t a a a t g t g t a g t t t g t a g g t c c a g c a a t t t c c a g c c c a a c t a g g g t c a g g a c c a c t c t t c t g c a c c a t g g t g c a a a t g a g c a c a c a a t c a a g a t g g a c a g g t a a t t c c t t t g g a g a t g c a c a g c a t g g a a g t g a c a g a c g g c a g a g a a c t g c t c t t t c t c a g c a a a t a a t c t t g a t a t t g a c a a a a t t c t c a g g g t c g a t a g g c a t t g t g t a c a t t c t a c c t c t c c a t t t c t g t c c c t g g c t t c t t g g t a c a t a t
AAAAAGATGTCCTAGGCAAGCCCAGAGACTGGGAACTACAAGTGATATCTTAACAGTTTCGTGCCCTAG
t a a t t t t t a c c c a g a a c t t c a g c t t c t c g c a a a t t c a c t t a a a g a g t a c a t t c a a t t c t a a c t a g c t t c c t g c g t a a t a t t c a a t a g t a a t t t t t a a a a g t g t c c c c a g t c c c t t g t t a g g g a t g t c t c a a t t c t t g g g t t g a a t c a a a g g a t t t c a t t a t t t t a g c a g t g g g t a g t t t c a a c a g a a a t t a t a g c c c c a g c c c t g g g t a c a t c t c t t a g a c c c a t a c c c a a a a t a a t c a t g a a g a g t g t g g a c t c t g a c g t c c a t a c t a c a t g c t t TCAAATGATAGCTCAGCCACTTTCTAA (N) xACAATTGAAGCAATGATTGGGTTTGAGAGTACTTGTTT C CTAATTTAACAAACAATGTTCT(N)xGATAATAGCTATCCTATTGGTGTGAAATGGTATTTCTT CATG GTTTTGATTTGCATTTCTCTAATGATTAGTTCTTTTCATGTGCTTATTGCCTGTTTTT( N) XAGGCCAC T CCATGGCTGTGGTGACATGAAATAAGATGACAAGAAGACCACAC CATGATCTTG CCCCAATACAGACA AAACCAAAGTCACTGTCCAAAAAA(N)xGTCTTCTCCCTCATAGCAATGAGTTAATCAACTGTGTTTGG CCACAGTGTGCTCCTGCCTGGTGGTGTTT TGGCTAGAGGACATGGACACCATTAAG CTGTGAATTGGGA T TATTATC C CT GTTGTACAGG TGGGAAAACTGTGGCACAGTGAGGTTAAGTAATTTGCCATAGGATACA T AGTT ATT AAGCAGT A C AAG A (N) xGATATATAGAAATTTAAGTGAATTTTTAAAAGAC (N) xGAAAAT GCAAGAAATAG CAGCAAAG CATGT
> H s 6 _ 25097428 -251034 62
ACCTGAGAAATAAAAGCATTCCTATGTAAGAAAGTTTATTCTCAGGTGCATGGTCTGATGAGTACAGAT TTTTGTTTTATTTTCTC CACATTTTCACTTTCCAAT CTTCCT CCACT TATT CCATG GAAAAATAAAGAC TATATTTGGGCTAATGTCCCTAATTAGAGAAGAAGTGTCAGTCAAAGAAGGTTGTACTTATACTTTGTT TTTAAGTATAATTGCATGGTGATTGGAATTGAAGTTAGATGTAGCAAGACTGñCTACTGAAAATAGGTG G GAATTG CACACT CCTGTATTTAATGTTAAT ATATG GATTAACTATT ATTTG GCAGGCATT AT ACTAAG TGTTTAATATGAACTAATTGATTACCTTCACAATGCCCCATGAGGCACTATTATGATGCTCATTTTAGA GAGAGAGAAA CTAAG CCTTAGAGAAGACACATGACTTGCTTGAGG CCATAC CATTAG CAGT CAGC CCAG C CC CAGCTCTC CACC C A ( N)xAGCCAGTGAAGCCTCAGCCCTCACAGTTGCCACAAAACTTTGGGAGTA AAATCTGAAAAGCTTTTACATTCAAAACCATTAAAAATCTACTTGTT(N> xTCTACTTGTTACACACAC ATATC CT CAGTTG CAAC CCATCTTGAGATCCAA ( N) xGAGATCTAAT TT CTAACTGTGG CATTGATAGA AGCGTGGTGTCTCCCAAATCTAATTCCATACCACAATCTTTCTTGAAAAGCCTTAACTCTTCAATTTCA ATGTCATACAAGAAGGTGATTGTGGTTGTGTTGGAGCAGAAAGAGAAAGATCCAGAGAACGAGATTTGT CACTG TGAT TCAC CTAACATGGGTTTTATAGA CAGACATTTCTGTTAAAGGGAAT TT TACT TT CTTGAG CTACAAATTGGTTACACAGACGTGATAGATATTTTTAGTGAGATAAGTAAAAAGTACATAATCATTTGT ATTTCTATAGCACCAAGGTGACACATTTATGATCTTGGAATCAATTTTGTAGACAC’AGGTGAGAACGCT GTCACTG CACCAAAGGC GTCAGTAGCTTTTCTCTTTTTACACACATTAATATC CT CC CACTGTTACAGT CACCACCTTGTCTCATCTTCTTCCAGCCTTTAGTCTCTACATGGTTCACTAACAGGTTTCCCTGCCTCC AAC TATGTTACTATT CATTACTCTTCCTAAAACATAATGTGTGGTGGTCACA(N) xTCTTGATGTGTCT TTGTCTCTCTATAAAATAAAAATAAAATCAGTATCTAGATGGGGTATTGTGGGAAGTGAAAGTATTTGC CAACTGCTTACATGCAACAGTACAGCTTTTAATTGCTTTAAATGACAGCATCACATTAAGTGGTCAAGT ACC CACCAC CT CCAATT CTT CTCCACATCCTTGCAACTCAACCTACTTACACACAG G CATACAAACTAA C CC CT CCAGTT CACATCTGATTACATCCCTGTTCCTCACCCGACTAT CAG G CT CC CTAT CCTT CCAGGA AAAAATCCAAATCACGTAGCATGTCCTAAAGTGCTCATACAACCCCTGTCTCCTGTTCTGAACTGCAAT TTCTCATCCCAGCATCTTTTTATCTTTGCTGGTCAGTTCCTACTCCTTCTTTAAAATGCAGCATTCATA GACTCATCCAGTGTTCGTCTCCTCTCCTGC(N)xACAAATGAATTTACAAGTAACAAAGAGCAATGATC AAAAGAAATTATGTTTTAACCTTAGACCAATCTTTTCTGAAAAAATTTGGAAGATACAAGAGAGCTAAA TAATGTCAACAAGAAGTTTAAAAAGAAACACATCAGCAAGAGACCACATTAAGCAATTATACACATGTA GCTACGTTGCAAGAGAACGTAGCATGTCAAAATGATGACAGGTTTTTAACAATGAGACAAATGAAGTGC AGAGAAGTCAGATAGATGGTGGTAGAACCAAAACCACAATCCCATGTACAC CG CT CCTT CTACTGTACT G CTGCTT CC AGTTGATG AG AAGTCATGGGCAGTCCTAACAAGATT ACTGTC AC AAAAACTC GC CT CG AG G AC CG ACTT ATTATC CAAT AAATTC AG AC AAATACAGGTAAGTGT ACTTTCTTGGT AAAAT GACACATG CATTTCCTGCCTGCACATAAACAGTCCCTGCCCTTTGTTCACAAAATATCAGTAACTGTATGATAAAAT AAATG TTTTTGTG AAGC AG ATG ATTGCTC ATGAAAACGTTATAGCTAGGGG ATG AGT CACTGC AT AAGC T CAGGAT CAAATAAATTTTATATA CAAT CAAGGAAATCTTAGCAAAGTTTAAACATCTATAAACTTT CC TTTAGCATTTAAAATAAAATTGTACTCCATATAACACAAAATGGATAGATTTATTATATGTATTGCTTT TGCAAGGGATTATGAAGTCCATAAAATGTGCATGTAATATAACTTGATTGTATCACAAATATGATTTAA ACAAATATG CTT (N) X C CCCACAAACAAACAAACAAAAAACC CAATATG CTTG CCTT AACT AC CAAGGC TTAAGGTGCTCTTTAAGAATCACCATGTTTGCCTTTTTTGATCTTTAATTTTTCAAATTTACTGAGATC
C ( K ) xTAACACAAATATAGATTCATGTAACTGCCACCACAATCGGGATACAGGACAGTTCATCACCCCC AAAACTT CCTTGCATTG CACCTTTGTAGTAAAATCCGTCTTTCCCTAAC CCATGG TCAGGAATATTT CC CCTATATTTTCTTTTAAAAATTTTATAGTTTTATATTAACATTTATATCTATAATCCATTTTTACTTAC TTTTTGCATGTGACATAAGGTTCAGATCGAGGTAGGTTTTTTTTTTGGTTCAGATCGAGGTAGGTTTTT TTGTGTGTGTGTGTATGGGTGTGCAATTTCTCCAGTAACATTTGTTGAAAAGACTACCCTTTCTCCATT AAATTGC CT TTTT CCGGGCCGGGCACAGTGGCTTGCAGCTCATGC CTGTAATC CCAG CAC(N) xCACCC CCCAAAAAAAAGCCTTTTTTTCATTCACATTTCCCCAAA (N) XTAAAAT ATGAACTATATTGAAC AT CC AATTCATGA
> H s S _ 170477988 -17048 086 7
CATGGCAGCCCTCCTTCCTCTGAAGGATTTTTTCCCACTCTCCTCAGATGAAGACGCGGAGGCCTGAGC G CTGGGGGT CG CC CGGGGCTGCACAGCGGGTCAGAGGCCAACCCTGGGCGGAG CCAG GC CACACC CT CC CTCAAGGATTCACAGTCTCCCCAAAGTAACCGCAACGAGGGACCCTCCCGTCACCAAACTTCCTTTCAG TCACTTCCGCTTGTACTAATTCTGGCCCAACCGTTCATTATATTTTTACAATTCTTATCTTGGTGTGTA TTTTTTGACCCTAAAAATCCTAACTCCATTCCATCAATATTTTGTATATGCAGATTTCTAGCACAGTAA GTCATAAAATGTG CCTCTC CTGG CAACACAAAAATGT CTTCCTGTTG CTTTTAAG ACCACCGACTGATG CTGCTGCCTGGATAT CCACTTGCAAGCATTC CCGAGC CT CAGAACGTGAACTCGCAAACGCTCAGGCAC CAGCTGTCTTAAG CCTAAATC TG CT CT CCAC CAAAAGGGTGGCATTAGATGGCGTCTCACTTTACATAT TTTCATTTTTATAACCACATTCCTGGGGTTTGAAGCTACTGTAGGTGGTTGAAGCAAGGAGTGAAGAGC
c g g t g t t g a g g a t g g g c t g a t t t a c a g g g a c a g g a g c c t g c a g t c a c c a c a g c g g a g c c t g t a g g a a t g AGGCAGAAAAGCCAGGGAAAGCACTGTGGGGACGGCTCTGACTGCTGAAGAGTGTGCACACACACACGT
a c a c a c a c a c g t a c a c a c a g a c a c a c a c a c a c a g a c a c a c a c a c g t a c t a g c g c t g t g g g g g c g g c c c t GACTCCTGAAGCATGTACACACACGTACATGCACACACACATACAGACACACACACACAGACACACATG TACCAGCGCTGTGGGGGTGGCTGTGACTC CTGAAG CATGTG CACACACGTACACAGACACACACATGTA CACA CACACATGTACACACACACGTACACACACG(N) xGTACCAGCACTGTGGGGGTGGATCTGACTCC TGAAGCATATG<N)xGTACCAGTGCTGTGGGGCGGCTCTGACTCCTGAAGCATGTG(N)xCAGCGCTGT GGGGGTGGCTCTGACTC CTGAAG CT CATG( N) xTCCAGCGCTGTGGGGGTGGCCTCTGACTCTTGAAGC ATGTGCACACACACGTACACACACAGAGACACACACGTATACACACACAGAGACACACACACGTACCAG CGCTGTGGGGGTGAC CT CTGACT CCTGAAGTGTGCACACATACATACACACGTGCAGGCATATGTGCAT GGACATGTGTGTACACACAGCACACACACACGCATGCTAAAGTACCTGTTTCCCTCAGATAGGGCCACA CTCAGCACAGAGCTCACCAGGGGCCCCTCCCAGAGATGCGCTCCCCCCTACCACATGACCCCGGAATGG CCATGTGTGCAGGGTGCTCACAGCACACATGTGCTCATGGTGCCCTTGGCTTCGTGGCCGGCTGTGCCA ATCACGTTTGCTT CATCTC CATGGCTT CT CTGACCAT CATC CCACAC( N) xGTGGGAGAGGAAACAGGC AG GAATCGGGGGG ACGTGCGTGAGAAGGG CTGAGC CC CATGGCTGGCTCTGCAGGTGGAGGTGGCAGCC TCAAGCCACAGAA( N) xCCCCACTCCAAGGAGAACTTCAGACTTTGAGAAATCGTAAAGGAGGGTCTGT AAACGAGAGGTCAGAACGAACGAGCTTCCAGGAGTGACTCCCCAGGGTGCCCTTTTCCGGGGCCATCCC CACAGTGACCTGGCATGAGGTCCAGCCAGAGCCAGCACAAGGGACCCGGATGGCCACCTGGTTAGGCAC GGATCCAGTGCTC CCTACG CC CACTTC CAGCTACAGACT CCTGGGACTT CACAGGAACCCCTGCTGGCC ACCGCCTGCTCCTGCCTGGAAGCTGGAGTTGGGAGTCAGTCAGGGCTGCTCTGCACTCTGGCCAGACTT GCTGATTCACCCCATGAGAGTGGGCTCGTCCCAGTCCCGGCCACCCCCTGCCAGCCCCAGCCATGGTCG CCGCCCTCCCAGCAGGCCTGCCTCTCTCTGGGGATGGCTTGTGTTTCGGGGTGGGGTCATGCAGTCCGT CCAATCCCTCACACACATCACCCACCCCGGTGCTCTCTCCCCGCAGGGGCCACGCAGCCCCCATCCATC CCTCACACAGATCACCGCCCCCCTCCCAGTGCTCTCTCCCTGCATGGCCACACAGCCCATCCGTCCCCT CACAATCACACCC CATG CACT CT CT CCTCAC CTGAGCTT CG CAGC CCTTGAGATCACCAGCGTCGTCAT GC
>H s7 24 07 984 6 -240 84 079
AAACAGACACGAT CCATATGCTCATGGAACTTATGAG CTAGTGGGGCAGACGGGCATTAAACAAATAAA CCTAAAGAAAAAAGAACGGAGCCTTTTGATA( N) xAGAGATTTGATTGTTTCTCTAAGAATCTTCTCTT ACATTTTGTAGTCCTTGGAAAAGGGGAAAGTAATAAATCTATTCAGGACTATTTATAGAGATGTTGGCA AATAGCAGCTAAGATGGTCATGTTTAGAAAACACATATGCCTTTATAAAACATAACACTCTTAGTCGTC AGGGATATGGTATTACCTTTTAGTAATATGGTCACTTGGCTCGAAAGTAGGGAAGACATCTTGAGCGAG ACCACATTATACCCCCTTGCCCA{ N) xAGAACTAAGGAAATAATTTAAGAAAGCAGTACCACTAATAGA ATTTAAAATGGCAGCAGAAGCAATAACAAAAAGCAAAATTGACCTTATAGAAAACTAAATCATGATACA ATACCAGCATGATAAACTTAGATGGGTAATCCATCTAACCAAGTTTATGTCCAAGAATATATTTATCCC CTGGATATTTGCAGTTAAGGTATGGTAGTTGGGAAAGTACCAGGTATTTGGAGGTAGGAGTAAATGAGA GATAATCTTCTCTAATGACAGAAACATTCAGAGACTCAGAAACAGATGTTCAGAAAGTGAAGGTTGATT CTGTCTGTGTCTATT AG CAAG CT CCAC AC AATG AATG CC ATGC AAAT AATC ATGT AGGC AATGTGGACT GTAG(N)xTGATTTAAAGGCACTGGAACCAACACTTGAAAACAAGGTCCACTTACTTGTACTGTTTCCA AATCTAACTACCTCCACTAAGTTGATCACCAAACATCCTGTCAATATATCGATCCATGATTGTAAAAGT GCTTAGGGCAGTGGGAACTTAGAGAAGGCACTTCATGAATTTCGGCTCAATCTCAATCTGAAATTCATT TTGGGGAAGCGTAATAAATGTGCTTGCTAACAGTGGTCAACAAGGATGCTGTGCCTCGAAATTGTACAT ATTGTCCAAAACACTGTGAAATCAGTGTTCTATTCAGGATTCAAAACATAACGTGGAGTGGAAACAGAA AGCAGCATAATTATTTCTTTTGTCAGAGAAGTAACTAGGTGTTTACCATGGCCCAAAGTAGGGTATAAA TGCTAATCTAAGAATAACTACCCTCACTTTTCGAGGGTAATTATTGTAATCAAAGGGCACAACAAGAAC AGAGCAGTCAGCTGCCTGGTTACATAGTATAAATACGGTAAGTGGCAACTTGAGAAACAATGTGCAGGT ATCAGGAGAAAAGCTCTTTACAATATTCTGTTATGGTCAGCCAAGAAGAGAAATGGAAATTTAATGTTT GCAAATGGCTTTTATTCAGGTACCCAGTACGACTCGTTCTGAACATTGAAAGAAAGGGAACATTGAAAA TAAATTTCCATCACAGGAAAT CACAAATATG CTGAGGGATG CATGGATT CTGGATATGAGGGGGCATTG TTAGTAATGAAAC CCTTGGTGCTGGCCGC CATG AGTTGAGT CACGGGTTTTTTTC AAGGTGGATCATCT CTATTTAGTTTGAGG CACTTCTTTTGAAT CCTC AT ATTAAATATGTG CT CTTTTTTAGTTCCGGATCTG TGGTTGGTCAAAATCCTAACTCAATTAAACAGAAAACACTTAAAGAATAGTTTATTTTCTCTCTTTCGA GCAGTGTTTCTCAGCCTGTTAAATGCATGTTCCTCTGTAATTATCCTAAAAACCTCACTTGTTCCTGCA TATACTTGATATAGACCACTTTCAACCTTTT CACTTTTC CC CACCAGTCAAGAAGATTCTGTTGTTTGT TTTATAAAAGAGTAGATATTTTTCACCTTTTAGACCTAAAATTATGTTATAATACTGCATATGCTTCTT GAGCCAGTTCCAAAGATGATGATTTAGGGCTGTCGACTGTTGAGTTTCATTGCCGCCTCTGTGAACAGG AGCACTGACAGACTG CT CCAATTAAAACTTAAATAAGATGC TTCCACTT CGCTTGTTTAGCTGAGCTAA
AAGGTAATATTTCAACT CACATAAT CT CAGTGCTTTT CC CAGGAGTTGCTATCTGACCAATGGGCAGCA CCTTGACAATAAAGTTGAATATAAAAT CT CACTGTGG CCATTATT CTGATGTCAAGGAAAACATGACAA ACACACAACCTAC CACTGTTGTTATTCAT CT CCATAAATACTCTCTTTCTGGGTAGCACTAAAACAAAG CGAAAATTGCAATGCCAGCTCAAGTTCTATACAGTCTGTGTGñ (N) xAAAAATAAAATTGGAATCCACA TTTTCCTGATTGTCAATCCTCTAACAATATTTTCAATATAACAAGGACTTCCATGTATATTTTTATGTT
t c a t t t t g c t t a t t a a t a a g c a g t t a a g a t t t a a a t a g a t a t a t a c a t t t a c a a a a c a t g t c c a t a a a a AGTGCTT( N) xTGCATTGTTATAGGAAAATCAAGAAGATTGTTTCAGGCATCTTTGAACACAGTGTGTG TTTTCATTTGCAAATCAGCTGCATTTGTGAAAGAAGTAAGCAACTCTTCAACATGACTCTATGCAGTCT TATCTTATATTGTCACAACAAAGCATAAATTTGAAATAGACCCAGTACTGTTCAGTGGTTTAAACCCCC AGCACTGCCACATAC( N) xGACTTATTCCTGGTGCCTAGAAAAGGAAGGGAAGGGGGAGAGGGGAAGCT CCTGACCGGCGTTGCTTCTGGAGAGTATAGGCTGCTGGGGTGCTTATGGGTAAACTTTTTCTTCAGGCT GTGAATTCTTAGT CA CT CAAATGGTAAAAGAAC CCAAAGGCAATCATTATAAGAATATAAATAGAATTT GTAATCATTAAAT CGGCAAAAGAAAATTTGATC CATC CACTAAAAAG CAAGCAAATATAGTAAATGAAA AATATGAATAAAG TATAGTAAAGAAAG CACAGTGAATGAGAAT CATGAGATGAGGTGAGATGGTAGAAA GAAGTCATAATAT CATGGTAATGACAATT TGTT CATATGG
>Hs8 1 423 37 087 -1423 479 90
CATGAAGGAGAGGATGGCCCTCTGCAAGCCAAGGAGGCAGGCCTGGAACGGACCCTTCCCTCTGGCCCT CGGAAGAACCCAGCCTGCTGACACCTTGATT( N) xGGTGACAGGGGGTTGTTAAGGTTGTTTAGATGTG TGCGAGGCACGGAGCGACG CC CAGAAG ACGAGGGC CT CCGGTGAGTGTGAGTGATTCCTAAACACG GCA CACAGCCAACAAAACAAGACAACACCAAGACCGGGTGTGAGGTGGGCGGGAAAAGCAGGACTTGTAGTT GACCTCTGGGTCCTGGGGATGGGGGCGGGCACCAGGCATCATGATGGGTAGGAGCACAGCTGAAGGCCG GGGC(N)xAGCCCTGCCCTAAGTGAAGGAGCCCTCTCAGCACTAAGAGGCGCCGGGCCAGACCTGCTTA TCGATCACCTTTGGGCCCAGTGTGGGGCTGCTGCTCCGGAAACAATTTTTCTTTTTCTTTCAAGATTCA CCAGCTCTGTTCAGGAGGGTGGCCAGCTTGGCGACCTGGACATTTAATCCTCCAGCCAAAGATTCCCTT CTGTGCGGCAGCCCTGAGGGGGCTGTCTAGATGACCTCACTTGAGCTGAGTGACTGCCTGGGATGTGGG TCCCGTGAATGGGGCA(N)xCTCAATCCCCTGACCCCGGCACGGTGCCCCGCCTCTTGGAGCCCAAGCC AGCGGGGCTGAGGCAGGGGCTGCTGGGGGTGGACAGGGGAGGTGGGCTTGGGAGCACTGAAGAAAGGGT GACACTCCAGGCATCTGTGAAAG CC CAGAAGACAT CC CTAGACAAAGGAGGAAACAGGGTTAGGGAAGG GCACTCCTAGAGAAGGAGCAG CTTG CAGAGG CC CAGTAGTG GCAC CACAGGGGCAGCTGGAAGATGTG G CACCACGGGGGCAGCTGGAAGATGTGGCACCACGGGGCAGCTGGAAAGTGGGGTTGGTGGGCAGAGGTG AGACTGAGTCCTGGGAGAAGTCCACACCGGACATGGGAGACTTTCCCCTGACCTGATGGGCTTTATTTA AGCAAGGAAGGGACTGGTGTGAATTAGATTGGAGACACT CCCTCTGTCGGCACAGAGA(N) xATTTCAC AGTCTGTCTCATACTAGGTGGGGACATATTCCACGCTTACCGTACAACTCAATTCGGACTGGCTGCACC TCATGTACTCAGAAGACAGGCTGCAGTCTTGTATGACATCTTGGATGAGACTCAGGCCGCAGTCTTGGA TGACACATACGTGGACATCGGGCTGTGGTGGCCGGAGTAGAGTCAGAGAGGCTGGAAGGAGGCCCTTTT CCAGGGCAGGAATGATGGTGCCTGGCCTGGAGCTGTGGTGGGGTTCCTGGCCCTCCATTCAGCTGCCAG CCCATCCCAGCTTGCACCGGAGGGAAGCAGCGGGCCTGGGCTCTGCGGCATCCCAGGGGGGCAGGTCAC AGGGCAGCTACCCTACCCCAGCCTCCTCCCTGACCCCGCCGGTGTCTGCCCAGACAGCCGGCATCCCCA TCCCGCTGGGCTGGCTCCCGGCCCTGCCCTACCCCCAACTCCAGACTCATGTCAGGCTTCCTGTCCCAG CCGGAGGGTGACC CGAGGC CTGACC CCAGAGGC CCAGGCGATGGG CC CGGAAGGAAGGCCGCCTCCTTG ATGCAGCCACAGCAGGCGGGGCTGGGGCTTCCCCATGGACTCCCTCGaGAAGCACTTCACACTTTGGTG GGCGCTCAGCCAGGGCGGCCCCCAGCAGCGGCAGGAGGCGAGGGCCTCTGGGGCAGGGAGGACATGCAC CTGCGCCTGGGCCTGTGAGGGGCTCCCCCACCACAGCGTGGGGTCTCTGGGCATGCCAGGGTTGCCCCC ACTAGACAGCCAGGGTTCTCAGGGAACACAGAGCAGGCAAGCAGGCCGGCTTCTGCTTCAGCCTCAGCC CCTGTCCGCACCTGGGGCAGCACAGGACAGGTTTAGGAACAGACCCTAGCTGCACGCCTGGTGAGGACC TTCACCTCTGGGC CTCCCGCTGCTCCT CTGTGAAAATGGAGGTAT CGGGGTGTTCGTCCCTTCTAGGGC CAGAGAGAGATGG CAGCGAATGCACGTGGTC CCTAAAGGAGAG CCGT CCGTGGCCACGTAGGAAAGTGT CAGGCCCAGCTCCTCATTCCTGGGATGTGGCCTTTTCTCACCTGCGCTGAACTCTGCTCTCAGGACTCA AACCCTGAGCTGCTGI’GGAG G CTAACT CACT CATG CT CCAGGC CCAT CTTCTGAATAAGTTTCCCAGAC CCTCCAACCTGCCCTCCGTGTCGGCCCTGGGGTGCCTGCCTCTCCTTCCCTCTTCCTC(N)xATGGATG GGGGTCCCTGCCCAGGGCTGCCGGCAGGCGGTGCACCTTGCACTGCTCACTTAGTCCCACAGCCTCGCT TGGCCGCAGGGAGTGTGTCCTCTGCACACTTTCAGGGCTCCGTCCTTCTGGCCAGCGCCTCTGCAGGCT ACGCCTTCTGTGCTGGGCCTCTGGGCATAGTTACTAGGCAGCCAGGGCTGGGGCCAGCCCTTGGCCTTA GCTCAGCTCCAGGGAGGATCAAGAGGGTGTCCAGTGTGGCCGTACACCTGGGCACCTGCTTCAGAATCC AGGTCCTTGGGAAACAAGTGG CTTTGGTCTTGC CAATGC CT TTGCTCTGTTTTCAGCTGGGAATTAAGC TGGGGAACAGCAGGAGGGGCAGCATTTGCTGGCCCTGGTGCAGGAGGTTGTGGGTGAGAAGAGCCGAGT TCAGGTCCCCACCTGGGCAGCCTGTGGTGTGAGCTAGAGGTGCAGAAGCCTGCGGGGACACGGAGGGCA GCCGGGCAGGAGC CTGCGGGGACACGGAGGG CAGC CAGG CAGGAGGCTG CGGGGACGTGGAGGGCAGCC GGGCAGGAGCCTGCAGGGATGCGGAGGGCAGCAGGGCAGGCTGGGCTTTCTCCAGTGCTTGCTCCCCTC CCAAGGGAGTTCCTAATTTGGGTCCCCCTGCTCTTCCTTGGTTGAGTGGGCCCCACGCACTTGGCTATG ATCCAGCAGACTGCTGCGTCCTGGTGTGTCCCTGCCCCACCTTCCCACTGCCCTCGGCCTCTCTCCCAG C CC CTGGGAGCTGGGTT TTTTGCTT CCAATT TCAAATACAT CAGACT CCATCC CACTGTCCTAT C CC AA GGTGTCTTTTG CCTTTCTGGGAAAATAAAGTGG CAGGGGAGTGAACC CACCGCAATGGGACAG CC CT CC TCCTGTCCT CCGCGGGAGCAGCATC TCGCAGGGGCTCCG CCAAGC CTTCGAGG CC CAGCCAAAGCTGGG GTG GGG G CACTG GCCGGAAGCTTAGGGGAG GGGGAGC CTGCAGGACT TC ACTC CTGTGCCCGTGTGG CC GGGGGTACCTG CTTGA C CT CTGAGCTGTCA C CG GGAATAGT GTCCñGCCTGCCTG CCTGCTGGGACCTC TGCAGAGACCATGAGCTCAGTCCTCAGGGCCTAGGTCCTGGGTTAACACTGAGGGCTGTGCGGCCAAGG ACC CCGGGGTGGGCATGAC CGGGGTGTGTGGAG GAAGGAAG GGGG T CTG GGAAAGGATGCAGACAAGAG GTG GGTG CTGGGCCAGCAC CTGCTTCTGTTCTTGGCCTCAGGACCGT CTATGGAATTGGGAGTAATT CT CTG CAACATGG GCCCTG CAGCCGAGGT CTGGGCAGAGGCGT CAGGGCACAGAG CAGTGGTACCAC CATG C CAGGGAGATG CTCAGG CCAGGACCAAGCAGAAAGGGGAGCT CAC CATAAACT C C CTGGGGGGAGGGGA G CG CTTTAC CC CCA CAAGAATGGGCAG CAA CAGGT CATCTATCCT CAG GGCCCATAACCCCTT CACCTG CTT CCTGGTGACCTG CTGC CGGGAAGGGCCCTGGG CCTCAGGGCCTTTGTGGGGCTTCCTGTACCCCCC TGAATGCTGTCACTGTG CCTGAAAC CT CACCAGTGGGGC CT CATT TCTTTGGT CCTATAACACTC CTTG ATTTTCCAG CGTTAAG CACACAGTCATGAG CTGTTGC CTGAAGCTGACACTTACTGAGGGTGTTCTGTG CAG CAGG CTTTGTCCTGAG C C T TTT G A T( N ) xGGAGC CCTGCCTATGTGATGATGGGGCCAGGAGCACC CTGATCATCTTTAGAGCCCACCCCTAACCCAGGTGGGGGCAGCAGGTCCCTGTAAGCATGCTGGATGAA A C C A (U ) x CCTGCCGGAGTCTGGAGCCTGTCTATTTAGTTTGACAñTAAATAATTGGGCTAGTATTTAC ACTGATCTCTAGTAAGGGGGGCACAGAGCAGGGAAGGGTGTGCCGGGAGGGCTGAAGTGTGCCCCTGGG CAT CGGGGCGACTGTGC CAGCAC( N } xCCCACAGAAGAGACTCCTACCCAGTTCTCTTTTCTAGCCCAG AAGAATGGCATTTCTGGAC CATCTGGAACAT CT GGAT CATC TCCC CTGCTGATGCACAAACAG C CAGGC CTC CGGAGAAGG GG GGTGG GACAGGGTGTTG CAGG CAGAGGGAACGG CC CCTG CAAAG GAT CATCAG GG AGACTGAGCAG GTCAGGAT GGCTGCCG GAA CAAGCAG CGTCTCAGTGGACCTG TGGAGGGCAGGG GATG AGCAGGTGAAG GAGGAGAT GGGTGGAGAGCGTGTACCTGGGGCAGGCCTGGAGGCGGGAGGGAGGGGCT GGAGCCAGAGAAAG GGGCGGCTG GTGGAGGGAGGTGAGG CTGGAGGGGCTGGG CCTGGTACTTTGGATG AAG CCAGGT CACTGCAAAGATGAAATGTGATAG CACTGGGCAAAGGAAG CAAAGC CCATCACCAT GAGA C CAGCTC CCAATCCTTC CCTGGAATT CATTTAGGAAAAGAT CTAGAGAAGTCAAGGTGGGG CC CGGAG G GGATGGGATCAGGGTCCGTGGGAAG CAAGTT CATGTCTTTG CAC CGAGGATGGACAGGAGTGTGGAGTG
T ( N ) xCTGGAGGAAAGCCTGGGGTTGTGTGCAGGGACCCACAGAGGACTCAGGTGAGTGAGACCGCAGG CAGGCCG TGAACAGC CC CAATCC CACGACAGAGGCAG CCAAACCCTT TC CCAAAGATGTTTTATAGAGA CCTGGCGGGGCAGGGGGGGACTTTGGTCTTTTT CAGACACAGAAGTT C C T T C (N ) xTAGGATGCCACCT GTCTTTGGAATGTCTGCATGTCAGGTGCCCCAGCCCTCAGGATCCAGGACAGGTCTGGCTGCGGTGGGT G CTGGATGGATGCTGAGGAGGGGAGGGACAGTCACTG CATACCAG CCTCTGCCAAAATGGGGAGC CCAG GGGCAGGAGTAGACAGGAAGGACTTCCTGGAGGAGGAGGGCTGGCAGTTTGGGAAGATTTAGATGGGGA GAAAGTGGGGCAAGGG CATG GGGGTGGGAAGTCTGG GAAAAGATGTC CTGGCAGGGTGGAGGGGAACG G AAG TTAC GATGGTAGTGGGGAAAAG CATCTG CC CAGGAG CCAATGGT GCAAACACA(N )xCACACCCGT GTG CACAAG CACCTC CACCGCTTTC C CAGCCTC CTAG CCAGAGCC CTGGGGCT CAGCT CAGGC CCTTñC C CC CAGC CCCACCCGñC CACAGG CT TCACAACCAGAGGC CAAGAC CC CGGCAAATGCA CATCCCCTCGT CCCTCTCTGTCCCAGCCCCACTGCTCTCCCAGAGCCTGCTGCCATCCCCTCTCTGTCCTCCAGGCCTTG G GT CCTC CACACCCT CCTGCTGC CACCTGGATG CC CCAACCTCCCGTGGACTGTTGGGGTCTC C CAGGG G CC CCCG CACTTCTGTCTGTATGGC TCTTGTCCACTCTGCCCGGCTGACACCTCCGGCCAC CCGAGACT G CT CGAATG CC CAGCAC CC CACACAGC TGGCAC CTC CTCTG CAGACCGCAGC CTG GGAGAGAGGGGAAC CCACGGGCCTGTGAGTCTTTCAGAT CCTGGTAAA CAG TAGGAGCACGGGAAATAT TCTGGAGAGAAAT C G CC GCCCTCAG CTCCAGTCCCTTGTTC CAGC CTCCTGCTGGGTGCCCGGGGGGAG CGGACGTGAT GCAG CTCTCCACCCAGTACCTGGTACCCAGCTCCGTGCGGAGCACGGGGGCCAGCCCTGAGGACTCCTACTGA CCTGCAGGTTCGCGGTCCAGAGACGGGGCAGCT CACAGG CAGGGGACGCAGCT CTGACAT C CT CAGG TG TTG GAGCTGGG GCCAGGAG GTCATGGAGCTG CAAT CC CAACT CAAGGACTGGTTGAG CACTGG CACTGC C CAATAAAGGC CCCTGGTTGAGACTGTGGGTAGGG CCAGGGCTCAñCGTGTAT CAGGAGTGAC CAGGGC CAGGGCT CAGATTGTGT TGAGGC CAGGGCTCAACGTGTATCAGGAGTGAC CñGGGCCAGGGCTCAGATT GTGTTGAGG CCAGGG CT CAACGT GTAT CAGGAGTGAC CAGG GCCAGG GCTCAGATTGTGTT GAGG CCAG GGCT CAACGTGTATCAGGAGTGACCAGGGCCAGGG CT CAGATTGT GTTGAGGCCAGG GCTCAACGTGTA T CAG GAGTGAC CAGGGC CAG GG CTCAGATTGTGTTGAGG CCAGGG CT CAACG TGTflT CAGGAGTGAC CA GGGCCAGGGCT CAGATTGTGTTGAGGC CAGGGCTCAACGTGTATCAGGAGTGACCAGGGCCAGGG CT CA GATTGTGTTGAGGCCAGGG CTCAATGTGTAT CAGGAGTGAC CAGGGC CAGGGC TCAGATTGTGTTGAGG CCAGGGCTCAACGTGTATCAGGAGTGACCAGGGCCAGGGCTCAGATTGTGTTGAGGCCAGGGCTCAACG TGTATCAGGAGTGAC CAGGGCCAGGGCTCAGATTG TG TTGAGGCCAGGG CTCAAC GTGTAT CAGGAGTG ACCAGGG CCAGGGCT CAGATTGTGTTGAGGC CAGG GC TCAACGTGTATCAGGAGT GACCAGGGCCAGGG C TCAGATTGTGTTGAGG CCAGGG CT CAAGGTGTAT CAGGAGTGAC CAGGGCCAGGGCT CAGATTGTGAT TGAGGCCAG GG CTCAACGTGTñT CAGGAGTGAC CAGGGC CAGGGC TCAGATTGTGATTGAGGT CAGGGC T CAATGTGTAT CAGGAG TGACCAGG GT CAGGGC TCAG GAGGTAAGGGAG CTCAGGGTGCTCTCTGAAGT TAGGCCT CTTAGTGAC CATCTTCAGCTGGATCTTGCTGGAC CAG AñGCCAGAG CTTTGACC CCACAGGT GTGCCCTACCTGGGCCCTG GAG GGG CAG GT CAGTCTTTCTTGAGT GC CATCCTTT CACCCATGTTTTAA GCCTC CTGTTGACAAGGAT CTGGACACAGGATTACATTC CAAGTC CTTGCT CAAAGC CTGAGC CGTCTC CACC C CATCAG CCACGTTGGT TGTGTCCCTGAC CCGGAC CCGG CCTCCCCTGCAGAC CCTCGGGG CT CC CTCC C TAAT CC CACCCTTCTG CACGCACCTGGG CCAGGACATGGG CAGTCACAGC TGACAGATTGTTTA GGGGATCAGGAAGCCGGTGGGTGTGCAGCAGATGCCTGTGTGGACATGCACACATGCACACTCATGCCC GCTT C CAGG CAGGAAGACC GGAGGCTGCACGTGGGCAGCGGCG GGTGGTGGTAGT CCTTGACCAGGGTG TGAGT CCAGTTTGTCTT CAAC CTGGTGCTGGGAAGTGGG GTTG G GGAGGGTGTGAGG CATCCTGGGGT C CTTGGTGGGTTAGACTC CT GAAACCCA( N ) xGACTGGCATCTCCCCAGGGTTCCTATGTGGGTGACACA GATCAACGTGTCAATGGTTTCCTCACTGTCCCTATCTGACCGCATGGCCCTGCCTGTCTCTCCTCCTCC CTGGAGCCCATGTTTGGGACTTCCA{ N ) xGGCATAGGAGCTCTCCTGAGGAAAGG CATGGGAAACAGAG GGAATAG CAAGTGCGAAGACCTGGTATGGAAGTAAGAAGTGAC CGGAGTGT TG CTTG GGG(N ) xGAGGC AATGCATCAGGCAGGCGGAACCCTGCCCCTCCCAGGCTGACCCCAGCCAGACGGGGAAGGACCTGGGTC CCTGC TGTGGGTGGTGAGT CC CATACATGT CCCTGCCCACCCTGT CATCCTGCAGGGAGCAAA CTGATC CCAGAGGGGGTGCCCAG CC CACG CCGTGCAG CACCGCAGGGGGAACCCAGCGTGATAGAGAGCAGTGT C TTAGACCTGGCTCCAG CGC CGAGGCTGCGG C CCAATCGAGG CCGGGTCCTT CC CCTCGAGACTTCAG CC TCGGCTCTCCTGGACTTCAGGGTGACATCACCAGGGTGTTGGGGGTCTTCTGTGATCCTCCTACGAATA
C ( M) xCTATGTTTCTCAGGCTAATCTTGAACTCCTGGGCTCAAGTGATCTGCCTGCCTCAGCCTCCCAA AGCAC TGGGATTACATGTGTCAACCACCGTG CCCTACCCA
>H sl0_11383156O -1 13 8 40757
CCAAATATTACAATCTTTCCACCATGCCAAACTACCTTATGAATGAGCAACCAGAAGCGATGAACATAT TTGCT TTGCACTAATGCTCTAGAAAGGGCAAATGCTCAG CT CCTAGACTCACT TG CC CTAGGCTCACTC TGAAT CC CAGTAATAAAAT GAGT CATCTTTAAG GTAGAG GATAAAGAGCTTTGAGAG CCACATCCTGCT TTTGCACTTTCTTCTCT GGTTGCTATTTCAGAGTTTTCTTT TAAC CACACC GTTT TTAATATATT CT CT GTCTTGCTAATGATAGGAAACTTGCTGCTGCATGGTTCCATAAAACATTATCTCATTATCTCCTTTGCC TCTCACTGTAAAGTACTTTGTAG CATTGAAAGTGG CATATGAAAAAAATTAAT TT CATTAAATTAATG G AGAGTAT CATGACTGCT CAGTTACGGCTCAGAT GCATTTGTAGAGGGACTG TGTATG CATCAGAG CTTG TCCTTTATTTTGCCATTTTTCCTCCCACATGCACCCACCAATGTCCCTAGGAAGGAAGAAATAATTCTA AATGGCATGCTTTCAAGAAATATTGTTACCACTAACAACCTGGAAAATCCCAAATACAAAGCATGACTT TATATGG C CAATGAGAAAATTGCAGGGGCAATC CAGTTATCTT CACCATCTTC CTAAGAGATACAAAAG TTACAGCTGAATACATCATGC TATAATTCACAG CAATCCATAAGATTGATCATGT CT CTCCAAGTGCAC AGTGTTTCAGTTTATACAA(N)xAGAAAAAACAGTCTGGTCCCTCCAATCCTGGAGCACACTGTATCTG TACCACAG GTTACCCAC CCAACT CGGGGGAAACTGAGGCTCAG CAAATTTAAAAAAT TACCTC CCTC CA AAAAAAAAAAACTAGAATAT C TT CTGAGAAATAAG CTTCTG CT CAGTGAGCTTAACATAGTCTAAGA( N ) xGGGCCATTGGTCTAGGACTTGTAACATCTACCTTGTTTTAGCTTCTATAAATGAAAGCAAGACTCCC ACATTAA GTAATGCTGGTACC CC CATGAGG C CATC CTACATAT CTGCACTCTATTGTGGAACT CCTC CA GATGA CCTC CCACTGGCTCATTCAGTTACTT CTTGGTCT CAACTGTCCCAGAT CCTT TAATTCTCTG CT TTGGCTTTACAGCTCCACTCCAACTCAGGTCAAAGATTCACTGG(N)xTGCTTCCGGCATTCCATTATT CCACT TAAC T T T A C (N ) xTCATCTCTCCTATCGTCTGGGTTCTACCTGGTAGAGATCAATATCTAAAGC TAAATGGAGAT CAATGGATAG CTTTGCAAAT CATCACCTGATAAC TGGGGT CC CCTAGTAAAGAGGTGC TGCAGAATGTTGCTTGGACTGAAGGCCAAAATG GAACCTTCTTAATGAAGTTC CT CT CCAAAT CTAGAC TCAGTAT CTGC CTTTAGTAGAGT CACCTCTG CCAGTACACC CTGAGTGCATGT TGACGGTCAGAT CAGC CTGCCTCTAAAACCCAGACCTCCACAGAATGAGGAACCAAGAAAATCAGAGATGGGCCTTTGTTCCATT TTCAATG CAGTAACTA CACACATGAGGGAAAA C GT GCTT CAGT GGAAATTAAG TTAT CTACAAT CAACT GGATACTCACTTTCCCTCCAGGAAAACTAGTAAGAAATGGAGAAAACATCTCCTCCAGTATTTTCCCAT CGTTTGGAGGGAAGAGGAACTGCCCTCCCACCACCCTGTCCTGCAATAGAATCATGAGGAGGGAAAAAA ATCATACTT CC CCACAATAAAGATTTTAT CACAAAAGATGGAAAG TTCGGTACAAATAAAAGTTGTCAG TTTCATACCACTTTTTCATTACCGTTTCTATTGCACTCTACACTTTTATATTATATTGTCAGATAATTA GTTATTTTTGTATCTGACT CTTCTATCAGACTGAATTCCTT CATGAAAATCAT TTTTAAATTTAT CCTT AAATATC CATATCTGAT(K)xAGATTTAATTAGTGTATAAACACCAAGTCTCTATAGAACAGAAACTCT TACAATTACAAAGCATAGACATATTTATGCATCATTTTGCATTGTTTATCTTCGCTATTTACCTGTCTA CTAACAGAGTGAACTAGAT CCATAACTGTTT CCTTAAAT CAACTTATAATT CC C CAT CAATGCAATAGG TTCATGTTT CACAAAATTAGACCTATAAATT TC CTACTCAT TTAATTATGC CC CT CT TCAAAAAAAGAA TTTAAGACAGCATACCATATAGGAAAATAAAATTTTTTTAGAATAAGTAAGAAAAATGAGGCAAAGCCA CACATAAAT TTTTACCTTT CTTATAATGT T C CTTTAGGAGTGCTCATCTGT CTACCTTAGTGCCACAAG ATGGTATGGTGAACAACTTTT CAAGAACAA CATTTTACAATTTACTCATAAACAACC CAATAACTTT CC AAAATAAAC CCAGACAACATTGACAGAGG CTGGTGGTTAGTAAAG TAACATGATT CAGCCTTGGTAGCT ATTAG CAAGT CTAGCAGACTT CA CAGGGATT CT CT GAAGACA CTAGACAGC GATG CACTTAAAAAATAA ATCGACTTTTGGTCTGAAACCATTAGAAGAAAAAAATGAGGCAGAAGCAAAGGTCAAGGCTGAAGTAGG TTGATAAAGCACTGCTCCCTTGAAGATACCTTCTCTCAAGAGGCACTCCCTGCCCGATATTCCCACTGA AGAACAGTTCTCTGGCAGTTAATAATGGGCTAAGAGAAGCTGTGTTCTTTTGCTTACAGGCCCAGAATT GGAAC TCATGATGGTGCTACT TAGATCACAC C C CAGTGGGCAATC CCAGTTGAGAAATCCGGAAACACT AGAGAAG CTAAATGAAG CGTTTGACTCAAAGATGG GCTGAGTGTGTAGGTG GCTG GTGTTTGGTG CATA TTGCTTTCATAGTAAATAAAGCAGGGAGAAAAAAGAAATCAAAAT3GATTCTTTTCAACTTTTTTTTAA CTTCCCAAGGGTTGGCTGAGGATAAGGTATCACTGGGACGTAGAAAAAGCTAGGGAAAATGTTGGCTTC TGCTTGCTTATAC CAGTAACT TTATTATT TTATTTGT TTGAAAATAAGAGAAGGCAATCCAACTGCAGC CAGGACAATGACATCAGCACACTTAGGAAGGCCTGATCTGGGGATTGTCTTTTAAAATATTGAGTAACA TCTTTCTGACCAAGTGAAGAAAGGGAAAAAAGTTATACTTATCTTCTGTTCAATAAACACTCTTCCTTA ACCAGTTGCACTGCTTCCTTAAGACAATCTGGCAGTATCTAAAGCAATTGGAACAGTGATCTTAACCAA AACAAATACCTCAAGCATAAA( N) xATGTATGTCAGGTGCTATAAAGGTCCACTTCATTATCCTGCTTC TCTATCATAATGATGAAGATGGG(N)xCCCATCTTCATCATTATGATG(N)xTATATCATCATCATTAT CCTTTAAACACAGTGATTGGC( N) xTTTTTAAAAAATAAATAACTAAACATGCTGATTAGATCATTCTG AGATATCAAATCACTCTCTAATTTTCGATCACCTACTTTGATCTCAGGGACCCAGCCTTGTCTTCCCTC TGAATTGTGGAGACC CAGC CTGTAGGT CATAAAATAAATTTGAGGAGGGAGG CAATACTGAAGTTCCTT ATTCACAGAAGTCAATACATAAGTGATGTCATCATTACAGAGGTCAGACTTGGTACTTGCCAGATATAA CAGCCTGATCTATTTAATGTACTTTCTGTGTTTGCAAGAAATTAATGCAGGAATATTGTCAGATGAACA TCATATTATCTAATAGTTTACCTAAGTGAGACATCTCTGTGCTTCATTGTTCATACATTñTTAACTCTA CCTGTTGACCAAATTATTATTTACTAAAATGTTTTCATTGATTGATATTTACAATTTATTTGGAACTGC TCTCAACTAGCTAGAAGTAATCATCAAGAATTTCACAAAACCAACCTCCTGTTGGGTCATCATCGTGAA CTTGCAAGCATCAGAAGACTCCAGTTTGGAAAGTGTGCAAAGGAGAGGTGGTATCCCTTCAATCTTTCA GAAGCATGCTGGAAGAGCAATCTTTCCAAAATATGAAGGATGACTACTTCTCAAGAAAGCCGTAACAGC TCTCTATTGCCATAAGATCAAGCAACAGCTC CT CTGT CTGGCTTCC(N) xTAGGTTTTTAGGGGAGGCT GTTACATGAGAATGCTATATAAACAGCATGCTGTTTGCAAGTGGTTGCAGTTTTCCTGCCTAGTCTGCT GCCACTAGACTG(N)xGTGCTTTATAACCCTGGTTCTATCCCACCCACTATGAACCCTCACAAAAGCTA GAGCCAAGTCACTAGAGCTGGTTCCTTTTGTTCTCTTTTTATTCTTATGACTGTGTCCTGCCTTAAGGG TGTCTTCTGTTCACTCGTGGCTG( N) xGGGTGTCTTCTGTTAACTAATGCCCCAACTATCCCCCCAAGT CATTTCCCATGACCCCACTCCCAGCATGTTTTCCATCTCTCCAATGCCTTTCCTCTCCTCTGCTTAGCC GTGGTGATTGCCT CCATGT CTTTACTTGCAGAGCTTC CCCCATCT TGAACATCTTTCTGTCTAACTCTC TAATTCCCACCAATTTATCAAGACTCAGAAGAAGGACCACTTAGCTTTATAGTTAACCCCCGTTATTCC ATTTCCCTCCTGTTAAT CT CCTT CTTCTCTGAC CT CT CACAGCAC CAGTAGCTGCAATACATGATTTAG CACTTGATTACTTTCCGTCTAATAGGGTTGTCTAATTGTTTCAGCACACATTTGATCTGCAGTTGAACT GTAAAGTCTTCAAGGGAAGGACCCCATGTAGTCGACATCTTTGCTATGTCCCAGAGGGAGAGACATGGT CTGCATTGTAGGCAAACAAATATTTCCTTCCCCAGGGCTAACCTAGGTGAGGAAATTGACAAATACTTT CTGTGCACCTGCTGGAAGCAñGGCACTCTTCCCTGACCATTGCAGCCTCCTTGTTTACATAAAAGCAAC ACAGGTGAAAATATAAATGAACCTAAGATTAGGAAGACTGGGAAAATATTAATGCTCATGAATATTTAG CAGGGCTCCCTCATTAACTGGGTTTAGAGAGGTCTCCCTTACAATCACGAAGCTCTAATAAATGCTGGC ATTTAATTGTGCT CATC CAATAGGAACTAAG CTQCAGGGGTTTGTGAGC CAAGAAGTACAATATGGTAT AGAGTAGTGGTTT TAAAAC CAAG CCTCACACAñTT CCTAAACACT TT CñG(N)xTAGACAGAATACTGC TTGGAACCAGAAGACAAGATATTTTTTCATTCCCAACCAATTATTACCTATAGGAAACTAGACTGTGCA ATTTTTCTGAAGTTCAGTGTT CT CCACTT CAGG CGTAGGTAGGGCTT CCTCCACACA( N) xAAGGCAAA GAGGAGCCATTTAAAACATGTATCATGAAG(N)xACACCTGTAATCCCAGCACTTTGGAAGGCCAAGGC TGGAGGATCACTTGAGGTCAGGAGTTTGAGACCACCATGGGCTACATAGTGAGACACCACCAC
> H s l l 1209 97 333 -1210 01 541
CATGCATTTGñGAGTGCTAGGGAGGTGAGAAGATTTAAAAGTTTCTTTAAGTTCCñAGGCTAGAGGGAG AATTCTTCCCATCAGTATAAGTTGGAGTCACATAAACCATAATGTATCAGGTTCAATTCTTAAGAGTTT TGGTCAAACTACAATACAAGCAACTGTATTCTTGCTGTGTGACTGACCAGCCTCTCTTATGACCAGAGC ATCCATATTTGGATTCAAGTTGAGATTGAATATGGCTTCTCTTTGATCCTTACAAGAGCATCCAAATTA CCTGACTAAAATGTTACAATGGCCACAGATATATATTTTCTAGTGGCACTACTTTCCCATCCTCTACAT TTCGCTTTCTTTTCTGGCTCAATGGACTCTGTTTAATTCAGAGAGCTAGTTTGCTAGGCCCCTGAGCCC CTGCTTGTTGGATTCAACCAGACTCAAACATTTATGTAGACCCAGATGTTATTAAAGCAAGAACCGAAG ATTTCTAAGGCAAGAAGGG CCTTTAGAAGACAT CT TACCTTGT CT CCTGGAGAAACAGCCACACAAATT CTTTGAATTCTTTGGAGGTAGCCAGGGCGCTTTCTTATTTTTAATATCCCAGGGAATTCATCCCCACAG CCTTCCTTGGTCACACTTAGGTATTTTAAGTGCCTCCCTGTTACAAATTTCACGTTTTCAAATTACTCC TCTAATGGTTACATTTTTCATTTCTCCTGAAAATTGGATTTATTTCCTTCTAAGAATGGACCATGCTCT TTAAAATCAGATCAATTTATATGTCCATCAGTGGAAGGTCTTAAAATATCTCGGTAGTGTGGATCATCA AATAATAAACCTATATTGGGTAATCAGTAGTCTAGAAAGGATGGCATCCTGAAAGTTTAATTTTAGTCA AGATGATCAAATT CCAATAGAAGTT CACAGT CCAATCATGCTG CT CAGTAGTTATTAGGAGAGTCATTG AGGTGGGTTTTACAGATACAA CCTCAATT CTGT CTTC CC GGAGTGGG CG CTTG ACCCTC ACTCTAGGAA CTAAATGATCCAGGAGGAGCGTTAAGATTCTGGCGGGTTAGCACTCCGGGTCCATTCACCTTGTTATCG GCTCTCTTCCTTTCCCCGCGTCAGTGTCCACAGTGCAGTGCCCGAGCTTCAGCCACTACTCCGTGTGCA CAAGCAGCTGCCCCGACACATGCTCCGACCTGACGGCCTCGCGGAACTGCGCCACGCCGTGCACAGAGG GCTG CGAGTGCAACCAG GGCTTCGTCCTCAGCACCAGCCAGTGCGTCC CTCTGCACAAGTGCGGCTGCG ACTTCGACGGGCACTACTACACCATGGGGGAGTTCTTCTGGGGCACGGCCAACTGCACTGTGCAATGCC TGTGCGAGGAGG G CGGGGACGTCTACTGCTT CAACAAGACC TGCGGCAG CGGGGAGGTGTGCGCCGTGG AGGACGGCTACCAGGGCTGCTTCCCCAAGCGGGAGACCGTGTGCCTGCTCAGCCAGAACCAGGTGCTGC ACACCTTTGACGG CGCCTCCTACGC CTTC CC CT CCGAGTTC TC CTACAC CCTCCTGAAGACCTGCCCTG AGCGCCCAGAGTACTTGGAAATCGACATCAACAAGAAGAAGCCCGATGCAGGACCTGCTTGGCTGCGGG GACTTCGGATCCTGGTGGCCGACCAGGAGGTCAAGATAGGAGGCATCGGGGCTTCGGAAGTCAAGGTAA GGCTCCTTGCTCCTTTGGAGGGGTTCCTGGTACGTCCAGCCAGGAGGAGGAGCTCGCCATCCTCTTCAG GTTTGCCTTTCTGCCTCTGCCTATGAGTGGATTTTAGAAAGAAGATGGCTCCAAACGGGAGCGTTTTCT AATGATCTGCACCAAATAAGAATTATCAGGTGGAGAAATGGCATGCAGCTGGGATTTCTCCAATTAGAT AGAGCAGGACTAG CT CT CC GGAATAGCAAATAC TTACAGAACACT TACTATGTGCCAAGCACTATTCTT AACCGTGAAGGGAGATTCACACCTGCCTGGCTGGCTCCAGAATCTGCGCTCTTAACCCTTGCACCCTAC TAACTCCCCTACCTATACATGCCAGTCTATGTGGGGTGATGTAAGGGGTGACTCCCACCCCTCTGACTT GCTCTGAAAAACCCATCATTAGGCACCC(N)xTTGGAAAACAAAAGCCAACACAGTACTGCCCAAGGAG ATAATAAAATACG CATC CACAAACTGGAAGGGATGTGTTTT TATT CT CATTGCCACATTGGCAGAGATT TATTTATTAATAGACTAGTAGGAAGTAGATAGAAGAATGGACAGCTAGATCATTAGAGATATGCTATGT GCCCTCGAGGAGCTGGGACACAGAAAAAGTAAGACTGGAAGAGAAAAGCAGCCAGAAAGGCAGGGAAGG AGGGGAGCTATTTTGAAAGGTACCAAAACAGAGATGCCAGGCGGCAGGAGGGAGCCACGTATCTAAGAC GATTTTTGCT CACACTTTAGTTT CGTTTGTTTTGG GC TAGCTTTGGGTC CAGAACACA CAAATAAACAG CAAATGGGTCTAATCAGGGGAAT CCAAAAAGAAATAAATAAAGAAG C CCATGACAAATCGGTTGGTATC TTTGTGGGTTCCATCAGCTTCTTGACCTCTCTCAAAAGGCTAGCAATAGGGCAGACCGTGTCTTTATCC CACAGCCTAGGAAATGG AAAGTG CT CTGTGTGT CT CTGG AATTGATAAG AATG ACTTGATTTTT CAGTT GAATGGTCAGGAAGTGGAATTGC CTTTTTTC CATC CT TCGGGGAAGCTGGAAATTTATCGAAACAAAAA CAGTACGACAGTGGAGT CCAAG G GCGTGGTGACTGTC CAGTAC TCAGACATAGGTCTATTGTACATCCG GCTGTCCACCACATACTTCAATTGCACAGGGGGCTTGTGCGGCTTCTACAATGCCAACGCCAGTGACGA GTTCTGTCTCCCCAACGGCAAGTGCACGGACAACCTGGCAGTGTTCCTGGAAAGCTGGACAACTTTCGA GGAGATCTGCAATGGAGAGTGTGGGGACCTGCTGAAGGCCTGCAACAATGACTCGGAGCTGCTCAAGTT TTATCGAAGCCGCTCCAGGTGCGGCATCATCAACGACCCCTCCAACAGCTCCTTCCTGGAGTGCCATGG GGTG GTGAACGTCACTG CCTATTAC CG CACCTG CCTTTT CCGCCTGTGC CAGAGTGGGGGCAATGAGTC AGAGCTCTGTGACTCTGTGGC CCGGTATG CAAGCG CCTG CAAGAATG CGGACGTGGAGGTGGGGCCCTG GCGGACCTATGACTTCTGCCGTAAGTTGGGGTTGGATTCTGGGAGAGGCTTTCTAGCTGGGAAATAGGT GCCATGCTGGGCCTTACTCAGGACTTCCCTTCCAGGTGGGACTGAGCTCAAGGGTGATGTTAATTACTG TGTATACCTACCCTAGT CTGATGTGTC CT CC CAAAGACT CATCAG CC CAAACCGGAGAGCCTCTCATCT TTACTTTTTTTCCTAACTG CACAGTAC CT CCTCACTG CAGAA CAATTAACCATCCTATTTGCATGTGG T TTTATCTAGCAGAGGGTAGAGGCACCATGTCAAAGGACTAGAAATCTTTGTGATGGCTCTGTAGGTGCC TTCTCCCTTGGCAAAGAAGAGCCCCATTGATTTTGATAGGAAGTGATCGGCCAGGGCACCTTTGGGGCA TGGAGATTAGCATGCTCCCCAGGGCAAACCTGACAATGGGTATTAAGAGAGTCCTGTGTTGCTGGCTTT TAATTTTGTCCCGAAAGTATCTGAC CC CT CT CAGCAATG CAGTGG CAAGTACTAGTAGTAATTAGAAAA CTTTCTGGGAAGACCTGATACGTAACACCCTTGATTATCAGGGAAGCCACAGGGAGAAGTAACATGGCA GAAAGCAGGACTCACTCGTG
> H s l3 505 897 06 -50603 14 2
TTCCTGCTTCTCGTTTGGCACGCATGTTAGATGGCAGAGACCAAGAATTCAAGATGGTTGGTGGCCAGA TTTTTGTAGACAGAGATGGTGATTTGTTTAGTTTCATCTTAGATTTTTTGAGAACTCACCAGCTTTTAT TACCCACTGAATTTTCAGACTATCTTAGGCI'TCAGAGAGAGGCTCT’rTTCTATGAACTTCGTTCTCTAG TTGATCTCTTAAACCCATACCTGCTACAGCCAAGACCTGCTCTTGTGGAGGTACATTTCCTAAGCCGGA ACACTCAAGCTTTTTTCAGGGTGTTTGGCTCTTGCAGCAAAACAATTGAGATGCTAACAGGGAGGATTA CAGTGTTTACAGAACAACCTT CAGCGC CGAC CTGGAATG GTAACTTTTT CCCTCCTCAGATGAC CTTflC TTCCACTGCCTCCACAAAGACCTTCTTACCATGACCTGGTTTTCCAGTGTGGTTCTGACAGCACTACTG ATAACCAAACTGGAGTCAGGTATTTTGTACTTTGCAGTATTTCTCTTGTATACCAGTTTGTGATGTTTT CTCTAAAAACTTGAAGTTC CT CAGG CCTGTAACTT CTGGAAAAGATGATTATTCAAAATAATGTTTTGG GGTAACCAGTGGAGTTGGG( N) xATTAGGTGGGACAAAAGGAATAAATGAAGACTGCCCAGAAAAAACT GAGACTATGGACATTCAAATCATGGGAGAAAATAATTTTGTAGATTATGTTCCATTGCTAATGAATTTG ACTTAGAAAAGAATTGCCTTATTTTTAAGAGATTGTTTCAGTGGTTCACATAAAGGCTCGCTCACTGGT TTCTCTTGAGTTC CTTACACACTATATAAGT TGTT CTTT C AGTTTTATG ATTC AACTACTGTTTTTCCT TCAGCTGACTTTATTTTTAAACACCCTTAAAGACAGATATATCTCATGGCAAATTTGGTATCCTGTTAC AGCCTTGGCTCTTAAACAACTCAAAATATTGGGATAGGOTGTCAGTATGTTAAGGATAGTTGCTCCTGA GTCAATTCTTCACTTACTCCCTCTGTTGTTCTTGGCTGGATCCTAACGCTGATTTCCACTCTGCTGTCA CAAACATTTTTCCCCCCGTAAAATGTCTTAATGCTGTCCTACCATTATTTTACCAACTGTGAAAGCTGG CTTTAATTTTTAGGAGGAAAAGAAAAGCCTGCATGTGTTCTTTATTGGTATC(N)xGGTAAAGGTAGGC GTATTTTAAGATATTTT CTTAACTTGAGC AGTAGC CAAC AG GAAGGATACC AGTGTCTCTCTCTCTTAG CGACACACTCCTTGGTCTTGCTTACCAACTGGAGGACACTAGGTAGAATAACCGAGTATGACAATTCTT AATTGTTTACATTTTATAACTTC CTGT CCTT CAAAAGAGTTTGAAATGT CATTTTGGGAAAAGAGAGCC AGTCAAGCTAGTAGGCTGATTGTGAAGAAAATCTAATACCTTATCTTTATCTCAAACCTCTGTACAACT TTATTTTCATTGATGGGATACTTTAACAAAAATGAAATTTTTTTTGGTTTTTAAAATATGAGTGATTAT G AC CT CTTTGGG GAT CATGCT TCAAAAAGT C AG AAAC CTAGAGACAAAACTGT CATTGATTTTTAAGAA GAAACACAC TAGGTCAAAAGAAGATGT CC TG GAAATATGAAGTACTCTTTAAAAACCATGCATTTGGAG AAAGTAATTGTTTCCTTGAAAAACATGATTAAAAACTAAAACTGGGATGTTCCTGTGTGTACACAGTGC CAAATGGTTTTCCCTTTTTATGTTGTGTTTTAGAAACAGCACGAAAGTTTTTTCCATTTTAAAGTGAGA AAACATTATATTTAGA CTTCCATAATT CCAAAATCAGAAG CTATTTT TAAAATTAGCATTT TCTT GCAT CAC CAAATGGTATTCAATTGT TTGAAG CT CAAAATTT TTAC CATTCCATAAATGTTTGTGAATTTTTAG ACAGTGCCAATTTAAAAGTAGAGATAG C CAATCTGAATACG GTGAAATTATGGGGATCT CTGGTGATTG GGATGAAAACT CTGG C CTTAAAAGGTC CACTTTTAGTATATAATTGC CTAATTAG CAAT CATTTTTATT TTTTG CTCACT CCCTGGTCTGAATCTATCTGTCTATT CAGATATTTTTTGGTAGGTTTG GAAAATG GAG AAGTGAGCCTAATTGGTGCCTAATTGTCTGGTGTATCATTCACTTTATTCAGTTTGTTCTATCAATATG ATTTACCCCTCAAGGTTAACCTAGCAGGTTGCTCAGTTATTATCTCTCAAGGTCACAGTACTAGAAATA CTTGGCTTG CATCTTTCAGATGC CATT CATGTTAT CAAGCT CAAATTATAGTTGGTCACAGGATT CTAA AGTCTTTATTTGACTTCTCCTTTTTGAACTGGCTCAAATGGAAAAGTGTAGTTGCTTTTAAATGTTAAA AATAAGTTTAAACTT TATATTTC CCATTGGTTTCC CCTATTTTGTCCTTTCTTTGTGTG CTTGAAATAT TTTATTTTT CAGTTTGT CCTC AT AGGG AA TC AAGT ATTTTAGCTAGGTG ATGT CTTGCAAGT ACGTT CC ACTTTGTTACAATCTACTATCTGTATATACTATTTGTATCTTAATTCTTTTATGAGATGTTCTGTAACA TTTTTCTCACTTTGACAAATGTTTTTAGACTGTACAGTCAAGATCTGGCGCTTGGGGGTAAGTGGAATG ATTTGCTAATATTGAGAATCTGTTGTATCAAACATAATAAACTTTTTTTGAGATGTGCTTTGTAGTTTG GTAATTCTTAGATTTTACTATTGGTTTCTTTGCTTAGAATGAGTTTCTAAATGTGAACTACAAAAACCA CTCTACTTCAAGCATGATCAT TAG AAGAAGGTTAGGGATGT CC CCAAGCATCATTTTAT CC CTTAAAAC TACAGATAGTTTAG GTTTCTG C CTGAACTAATCTAAGTTAG CTTTTT CTTCACAT CTCATAAACTAATG GTGGGAGGACACAAAGAGCTTTATGAAAACATTTTTGATGGCAGATAAGGACGGAGTCATAAAGCTGTA TCCATGCAGATGTGTTATGCTGACATTATTGTTATTATTGAGATGGAGTTTCACTCTTGTTGCCCAGGC TGGTATGCAATGGCGTGATCTCGGCTCACCACAACCTCCGCCTCT<N} xTATGCTGACATTATTAAAGC CAACAGATAGCTAAT CT GAAAAGTT CATG TG C A T T ( N ) xTCATGTGCATTTTTCTTGAATTGGTTCTGT GTACTGCATTTGTTCTGTGGAGCAC CT CAGTGAATAGAAGATTTTGC CCTTTC CACTCT CCAGAGGCTT TTGACCCCAAGTTGTGCTCTTTGACAATGGAGAGTTATTGCGTAAAAAGTGACAATAAAATCTTGACAT TTCATATCTGCAAAATTTCACTTAAAAAAAGGTAAATTTCATAGGAGTAAGTCGCAAAGGAAAATCCAT AATGT CAGTGCTAATAATTGATAAGATTAGATCTGGAGGG GAAATTGTAAGTG CACAGCAT T CATTAAA ATATTTGATTCAATCAACTAATTAATAGCAAATTCAAAACAAAATTGGAAATAGTTATATAAAAATTAA TACAATTTGTTTGAT CAAATGTTTT CTTTAGAAAGTGATCT CT CAAAAATTGATGAGTGTT GGTTTACT TCTAAATATGGAATTGAAATGACTTTCAGAATCACAGCAAGGTCTCGATTTTCAGCACCTGATTACTGA T T T G G T (N ) xTCACTCATTTGGATTTGTAAAGTAAATGAAATTTCAAAAAAAAAATCTGTCTTGAGAAG GTATGATGC CTTTGAGGTTGTAA GT CAAATT TATGTGTATAGTTTTG CTAGTTATTAAA GGGATG CTTT ATAATTAAG CTTCTTTTTGTGTGTCCTAGGTATGTTT CTATAAAACCTGATAA CC GAAAAT TGGC CAAC GGAACAAATGT CCTCGG CTTACT GATT GACACTTTATTAAAGGAAGG CTTTCATTTGGT CAG CACTAGA ACAGTATCT T CTGAAGACAAAACTGAATG CTATAG CTTTGAAAGGATAAAAAG CC CTGAAGTGCT CATC ACGAATGAAA CACCAAAACCAGAGACTAT CATCATAC CAGAG CAATCTCAGATAAAGAAATGAAGTTGT CTATCCTCTTTTAAAGAGAAATTGCCATTTTTCTTGTTTCATTACGTATTTAGGGCATACATGTTAGCC AAATC TACACT CAGC CTAACT CTTGGC TT CATCTG CTGCCATG CCGT CT CTGGGCAACCAGGCCC CAAC TGT GCTTAAGC CATAATGCCTGCTG CT CT CTAGACAACTCCATGTACTTGGTG CTTTGGTATATGTT CT ACCTT CAATACATCTTTCCCTTTCTCTTCTT CATGAATGCT T A ( N ) xTAGTTGGGCCTTCCGTATTTTC CCACTTATGTTTGTTTCAGC(N)xCCAGCATGTTTTTATCAATCACAGTTTCATTTCTGACAAGGCACC (N)xAAGGCACCAAGATTTTAGCTTATGATTCTTGAACCTCAATAATTTACCTTATAAGTGATTAGAAG TTCTCCCTACAAATCTTTTCTGAAAAACAGCTCTAATGCTTAATTTTACTCTTGGATTTAATATACAAG TTGTAATTACAAAGAGTTCTTAGGGTATGGTAAAGTCAAGATTAATTTAGGCTGC(N)xTATTTAGGTG CTTTAAAAATGTATTTGAAATCAGGAGCCTGAAGTAGTATAAAATGGTCACAAGAAAGCAGCTTGAGAA ATACATGGTTTAGGAAATCAAAATCTGTTTCTAATTTGTCCTTTATACCATTTACAAGTGACAAACTTC AGCTTTGCT TTATAAA CTTTT GTAC TGAT T CTCTAT CATAAGTAATAGAAATGTT CACATCAATAGAGA ATTACACCACTAGAGACTATG CTGG GGTAATAAAAGATTTG CTAAGAGCACTCAATCTCTTATATTC TA CTG CTATTT TT CCTCTGTGCTAGG GAAGTTGACTGTAATG G CAAGTT TAGCAACGTAGAAG CTAC CGTG AGGAAATCT TCAAAAATAAATTTATGAAATTGTATTACCTT CCTGGAAC CAAAGGACCCAT GGGAGACA GCAATGTGG CATCAGACTTAG CATT TTTAAA CAAATG CTCAGAAATAAAGATAGGACATTT T AAAATTA CTAAAAATACTTAGCTTTCTGGAATATCAATGACCTAGTAAAAAACGTAAGTACTGGGCTGGCCTGGTG GT(N)xCATCTGTCTATGATTTAAAAACAACATCAACAAC AAAAAACCCCAAAAATCACAATTACCGGC TTT GC TGTAGAACTATATACC TCAAGACTTACTAG GATTTCT(N)xAAATTACAAAAATTGGCTGGGCA TGGTGGCACGTGCCTGTAGTC CC CAGTTACTTGG AAGGCTGAGGCAGGAGAAC TGCTTGAGTCTGGAAG GCGGAGGTTGCAGCCTCCGCCGAGATCATGCCATTGCACTCCAGCCTAGGTGACAGAGTGGGACTCTGT CTCAAAAAAAAAAAAAAAAAAAAAAGGATTTCTTTTGGGCCTATTATAAATGAAAAGTACAAATTTTAT CAAAACACTTCATCATCGAAAATACCTTAATATTTAACTAAACTATGAGATCCAATCACAAAAATATTT TCATGGATAAAAAGC TAAAAT CTTACATGG GTGAAAATGGAAAATGATATTACGCATGG CACAGAATAA AAACAGAACTAAACATG TAG CAAAATAAATAGGAC CTTG CT CCCAAAAC TAAAATGTGTTTAAGACTTG C CTAACCACACAAGG CTTTTAGGTAGGTAGT CTTCTGTACC TTCAGATTTCACAC CAACAC CTCAAAAT CCAGAAATCATTAAAGCAAATGGCTTTTTATTAAACAAATGGCTTTTTATCAGTAGGGTAAAACATACC AAATAAGTGTTTCAT TT CATATTAT TC TT CATCAT CT CACAACACTT CTGGAAAGAATGTGAATTTCTT CAGGTT CAT GAGTATGAGG C CAAATAG GAGAGATTTG TT TTACCC( N ) xCTTAAACTTCTGGGCTCAAG CAAT C CTTC CACCT CA C CCT C CCAAAGTGTTGGGATTACAGGCGTGTGC CACTGCACATG GT CTACG CT TTCTGTAATAAGAGAC CTC CAAGGAAATCTGGCTG CAATGTTTAAGGTT CCATGAGTTCTT TTAAGATG AGGACTGAGACTGCTATAG TGGTCTATGT CATACT CC CC CATAGTGCTGGG CATCTAACAAGCTCAAGT GAACTGAACTGAAA CATTT G GAAACTATAGT CTCAGTGAATAGATGCTTG GAAAAACAAGATTTATAGT AGATT CAAAGATGGAAGAGGATAATTTAAATAATACAAT CTGAAAGAGT TGA CTCAAGT TACAGATTAC AGAAACTAC CC CAGAG TATTTTC AATTAGTTTACTTGG (N ) xGTTTATTTGGGTAATGC CATATGGAAA GGAGATGAAAATATAGATGTATATGGGGATATAAA CTTGTTGCAGATTACACAAAAT TATATGTTTAAG C AG AG AATAGT C C AGTT AAAATTGGGATATA CTT AATTTGCTAAG AA CAATGTAATG CT AC CAGC ATGC T TAAAATTAAAAAAACTAAGAGAATGTAAGT TCAAAATGAGTGTCA CAT GACTATAATT GATAGTAAAT CTTCA CTTACCTCCATAAC TTAGGAGT CA TGAAAAGAAAGAAGTTAG CAGACCAGGCATGTAAAT CAGA TGTAAGTAGGATGGAATACAGAAATATTC CTAGTT CTGGGTAACC CAATTT GAGGTCATTTTAAGTTTA C TG AATTTG GATT AATC AATTGGTT CAAT AATC AAGGTAAAATGGTC CAAAAGAT CCTG AT ATAT CC AG AAG AC ATAT TACTGAGGTTTT AC AAGG CT AATTTG AAATGC AGAT TAAC ATTCT AGGTTTC CTT ACATA T CATTAAAATAGACCATATTAAAAATCAATT T CAGAAGCTG CTTTATGACTGGAATAAAAAA CAATTAA AGATG TAGAGTAACATTGTAT TTCTGT CT GATGCTGGGAGG CTAT TTTTTCATGG CAAGTAATAACTGG GTAAACAGAGAGATACTTTAACTTTACAAATAGGAAT CCTAGAAAACAAGGTTTTGC TC CTAGAAAACA AAAATAGTT CCTTT CAATTAATTTT CTAAAAATGGTTGCAT TGGT CACTTTTTAAA CTC CTA CAATCTA AGCAGCACT GAATTC CTATTTGATTTTTTTTTTTG CTAATAGTTTTCTT CAGAAG CAAACCTGCTTTAG GGTGAAAAAAAGTTTCAGACCTTAGACAGAACCTTCATTTCCTCTTAAGTATGTTTCTCCñTCAAAAAC AAGTATTCAAATAAAGAAT CT CAAT TATCTAAACCTATTTTTGGC CATCTT C CGG AATTT CTTGTTAAA T CTC CTTTGAAATGC CACAGTTGCACT CTTGGAAAACTTGT CTTC CTGGTTTTCAAATTTCTGTTTATT ATTACAGCTCTTCAG CTAGAATGCCAGGT CTTTGTGCTGTACTGTTT CT TGAAACTGGAGACTAG CACA CTTCC CCCAAT TCAATTACTTGCTTTC CATT TGAGAAATTAAAGC CAGGTCAATTTTATAAAGTT TTAA CTGAC TGAATTATTTGGACTTTAAG TT CT CAATTT CATT CC CTTAGC CTAATACT G G GAAAGATCTTGG TTTTATTGAGAGAAGGTAAGGTTTAAGAGTATTAACACATTTCAGTCATTGACTCTTATAATTGATATA AGCGTAGGGATTTTCTAACAGGCACACGCTTGAGATATGACACTCTAAGGAAAAAAGGGAGAGTTCTTA TTCAATTCTAGAAATGTTTAAGATTACATACTAATGATATAGTTTATTTACATAAATGTGTGTTCATAT TTTTT CTAAAAGAAAGGGACCGACTGT CC TT CATAAAGAGCTTCAGT CAGCTTTT CTAC CC TTAT CAGC C TT A TG (N ) xTTAAACTAGTTTCTTAACTATCTCCTTTCTTTCATTGGTAACTCCATTGTCTTCCAGTA TAGAAAATGTATAAT CTTTTACACCTCCCTTTTAT CCAACC CATCAT GATATTCAA CAACT TTTT CCTT TGAAATATCTTAAATTTATTCTTTCACCTCCATTCTCCTGCCACTATTGCTGTTCAGGTTTCTTCACCC CTAGATTACTACAGTAGTCACATGCCCAGTTTCCTTGCTACCATATTTTATTTCTTAGACATTTCTATA T AT AGT ATTGC CACATTTAATTCC AAACAAATC AATAGG CT ATAATG CCTC CCT ATT CC CTG CT ATATC AAGAC CAAAAT CCT CTAATTG GTTTTCTAGG CAAAGG CT CAATAG CCTAGTTCTAAC CTAGTAATACAA ATTCATGTAAC CTTTGC CTTAGGTT TTAT GG CACAGATC CCAGGT TACTTCTGGGGGAATGTTGCATTC AATTTTGGC TTAAAT TC CTAC CTA CAAAC CAGACAAGTC CATTACTTGACAACTCAATG CATTTTAATT TA CAAGCATGCAGGC CACAAATTATAAAATGAAGACTTTTTACTGAGTCAG CAGAAACAACTGTT CAAT T CTCCTGTAAGT CCC CAGTTACTTACAGTTT T CTG CTGAGGTGAT CATCTCATGTGG CATTTATT GC CC TCTACTAAAATGAGCTATTATCTGACACTCTGCTCTGTACAGACGTGTTTCTGTTTCCATTTTCAGTTC CTTCT C CACATGGT CTAAG TAC CAGTAGT TTAGTCAGTTCTCACCTTTTTT GAGGGGAAGG CGGCAATG GTGGTATCAAAATTTACACCTGGAAAACAGGTCATAAAGAAAATCAATATAGAAGGGAAAGTGCTTTAC CAC AGG AATTATT ATTT AATTGCTGGT TG AATTCT TG ACTTTTGGTT AAAAGGT AGATAT AA CAAAT AT GTAAACCAC CCTCAAAG CCAAACCAAAAC CAACAG CAACAGGCTTTT CAAT CAATGTAATTTATACATT GGGTTCTGGAAAACAGAAATGTCCTATAAATTTTATATAAATGCTATATAAGGGTACATAAAAATGAAT C CTTATTATAAATATGACT TAAATGTCATAG CAGTAAAGAAATGGTACTTATTTGAT CCACAAATTT CT TTGCCTTCT TAGATT TTTTTGAAAGTTTT GGTGTGGCTTGGTTTCTT TTTGAAGGGC CGAG CTG GGAGA AGGAGGAAAGG CATTTGGT CT CCTATAAAATTGTC CAAGAAAGTT TATGAGAAAG CCTATAT CACT CAA TAAGGTGGCTC CAGTAAGTTTAGGCATTCAAAAACATTC CATTAATTTTAGAGTTAC CTTAAAGAATTA C AGCAGATC CT CAATGCTTTTGTTT CTGTGATT ACG GTAATGGCATT TCTTGAAAAGTATTTTTAAATG TAAGGATTCTTTATT CTACTACCACCTCCTATCTCCT CAGAGATA CACAAATAAAGG CC CAAAAATAAC AGTTAAACTTTATAGAATTTCACACTTCATTGGAAAGGTTAGAAATGATCATTTAGATCACTAATGACC C CCCAAAAGATAACAAAATTT CACAGG CC TGATTATT CATAC CCC TACCCACACTTTGCTGGTTTTGCT G AAAAAAAAAAAAAAATGACATTTT AAGGTACCTG AG AC CTGAGG AATG AATGTT AATAGC TCAC AC AG AATGAGAAGAGCTCTGATCAGTGAAGTAGTAAAAGGTAGAGGGCACTCCTAAATTATCAGAAAATAACC TGCATATTT CTGTATTTAT CTAGCAGGTATG CAGC CTTTCTACCACTTTAC CATAAAGGTATTACTC CT G ATTTTCAG (N } xTTCCCATAAGACAGCATGCATTTGTTCTAAGTCAGGTATGGAAGCTTAAATGAGAG AAATGGTATTTTTGGCTGAGGAGGAAAAGTGAACTGGAAAAATTTCATGTAGGGGGTAAATAGCTTAAG TTTTAGACAATTGAAGATGAAGGAAAGGACATTCTAGGAG TAAAT AAC“T TGAGTTAAGGCTTGGATGAA AATCTGAGTGAGGTT TGAAAAAG TATTATTT CTCTTTGG CTTAACAACAGTTATTATGGTATT CATTGA CAATTACTCCCATAGTGGTAAAG TCAAT CTTACTTAAGGCAAAAAATATCCTAGATATAGCATTT TGCC TAGG CAACTCTTT C C TCTCAAGATTTCT CAC CTGATTTGCTGAGTTAAAAGCCACAGACTCC CAATTAT TTAGGGTAAACTTAT TTATG CCTAGGTTT CCTTCCAGTATTTCTTACAAGTTTTTTGTTTTTTTTTCTT CAAAG CAACACAC CATGG
>Hsl4_49987035-50000858
CATG CACAGACAGTGGCCACACTGATGCT CT CACTGAG C CATTGGGT CCTCAATT CTGCTGC CACATTG AGTCTGCTCTGGTCCTCCTCTAT CAAGGTT CAAAGTTCAGAGCAGAAGCAGCCAGTTAGCCAAGGTGAG GTTATC(N)xCTCAAAAATGTCTCTCTCTTTCTCTCTTÍN)XAATGGAAAGAAAAACAAAAAT(N)xGA CCCTTTGACAGAGCTGTGTTTTTATCGATCAACTTTTGGCTACTGGGGTACCAATCCAGGAAGGAGGAT GGTGGGGTGTAACATGGACTTTGCCAACAAC TCCTAT CATCTCTATGAAGATGAAAGCACTAGGAGAGC AGGGACCTTGTCTGTCGCACT CATGGCAAGATCTCAGAGATTAGAATTGGGCTTTGTAAA(N)xCCTGG CCCTTTTTTGCAGCTTTTGTTTAAGATGTT CTTTCCAAT CAGGATG C CTCCCCACTGC(N)xGTGTAAA AATCATATCTTC CTATCTTATC C CACTCAT CTGATGACTTCCAACAATGATGCCAAAATACAT CAGCTC AGATGAAGGCACTAGCTCAGGTG CCCTTGGGTTTTTGC CTTGCAACTGATCCACTTAAGACACAGCCTC TGAAAACACATTGAC TCTAAAAGCAAAG CCT CCTACACT CAATAAAT CC TTGTTGGATTTCAG CAAGAG ACTG CAGATAGAGAAGTATTGAT GCTGGAG CTGTAAAGTGCCTTAGAGAAAAGACAGATCTGG C CCAGT CTTTG C CCTCTTGGGTGTTCACATCCTTTCT TAGCTAT CACCCAAC C CAGTTGGATAAATATAT C TCTT TTTGT CTTTCCACATACTTCAAT CCCTT CCTTTCCTGAATCATTTCTGT TACTTTAC TCTCAAG CTTCC CCATTCTGTCTCTTCCTTTT CAATGCTTAATGTCAGGAC CATCCACAGGGTAGAAGGTAGGGGGGTATG CAGTGGAAATAGAGGGGTGAAGG CAATAGACAACAGTTAGTAAGACAGCAATATT CACTGAGCAT TGTC TGTG C CCTGAAAG CT GCAGTACATTTGGGAGTCAGATT CACAGAGCTGTCAACTACC TATGCGGTGAAT TTGAGGGCCTGAAGTAGGGT CT C CCGCTAAAAAATGGC C A ( N)xCTATT ATCTTTCATGGTGTCTGAAA TTCTACCTGTTAAAGACTTAAC C CTTGTGATTGAGGCAG CGACTCTATCCTTGGTGT TAGTCAAGGATA TTCTATTAAATATAC TCTATTATAGTCTATTCATATGCT CAAGTTAAGACATCAGTTGTGTT C CTGCCT GACT CTGTGATAAATGAGATT C CAACCTTGGGCCCCTGTAACTGTT C CTGCTTTTATAATCTG C CCAAT GTCCAATTCCTCTAC CACAGGCTCTATG CCTTTAGTCT C CATCTAGTTACTAAAATC CAGCTT CTGAAA TTCACACCCATGGAT GGAGTTCTTCCAGGTATGCAGTAG CTATACAACACATATT CATTGACTGACTGG CCAGCTACCTTCTACTGGTCTCCTTGATG CTGACCTAGAATGGTGC CCCATATACACCCTCCTCTGTGC TCTCT CACCTGCATTCAGCAG CGGCTTCCTTACTGCCCACACCTGAGCCCTCCTGCCACCCTC CGTGAG AAGACAGGTTCAAAATGTCAG C CAAACCAC CACCCTGTTTCCATCTTCTTGGCTGAT TAAACTAGGTCA TT AAAT CTTTGGG AGCTCTT CT C TTAAA C AC ATATTCC ATGAATC C C CTGCTCT AAAACTTC ACT TTTC CAGGGTCAGACAACAAGGAT C CACTAGG CACATTTAGCAACGTGTACCAGCTTCCTCCCTCTTCTGCAT TGCCCTCTGCAGCTCTCCATGGGTGTTC CAGTGAGAG C C CCCTACT C CTCCCCACTGCCCAGGATCCCT GCTGAGTACAGGAAAATAGT CAAAACTATGACAGAGGAGATAAGG CTACCAGAT CAAAGGGTT CAGGAC AAGAGACGAGTGTAAATTAAGG C TGGAAAAGTAGGATGGTACCAGAT CAAGTGGACT TGAAAGGAAGCA GACAAGTCTGAATGGCAGATTGGTATAAGAGGGATGTTT CAGGAGGGGTTCTGGCCCAGGTACACCATG TTGGGAT GAGAAGGGTTTGGAGG TGAGACGACCAGTGAGGGTAGCACAGACCCCTTTGAAACT CAGCAG AAAGGTAGACCTTTGGTCCAAGG GGAAAAAAATACTGAAAATT(N)xTCTGGGGCATCCCTGAACTTTT AAAAAT CCCTGGAG CAGAAT CTTACAACAAATCAACAC CTGACATAGGGTGGTCAC C TAAATGGTAGT( N} xC CCTCATAAGTGAGATTAGTGTCCTTACAAAAGGGCTCAAGAGAACTAGCTAGCCCCTTTTGTCTT TCTG C CATGTGAGGACACAGTGG GGATG CAGTGCCTAAAAAGAAAAAGAGAGAGÍN)xTAAAGCAGCAG TGTTTAATCC(N)xCCTACTTAATCTCTGCTGAGAGAAAACAGAAATTGATAAGCACGGTAGTAAATGG TACTAATGGAATGGATGATAGGAAGTAAACTCTAGGCCATTTAAGGAGGAGGG(N)xGGAGGGCAAAAA CCAAGACTTTGAATT TGGGGATT TTGGAAATGAAAAACATGAAAGTTTGAATATTT C TAAATAACCAGA TAGATGGTGATGTAT CCAAATGAGAGAGGATGCCTGTGTTTTGTTGATT TGGGTTTTTTGCATTGCTTA TGGAGAGGGATGTAAAGTTTG(N)XTTGGTATTACAGAGCCCTAAAGTTTGAGCAAGAGGAACTACATG TTATAGTTTTTC C C C CAGGC CCAGTAGC CCACATCTCCC CTTTATGTTCTTGGC CAAGTGCTGTGTATG GGAAATAGGAAGAAAACATC CAT CCAAATGTAGGAAAAT CACACTGACAACTGAGGAGAGGATGGGAAG CAGGAAAAGATTTGAGACACTAG GGAAAAAAAGCAAAACAGTAAGGAGGTTGTATTCATGAAC CTAAAT TCCAAG CTGTTT C C CCCAGTGATAAGACAGAGGAAAAAGACAAAATATGAATAT CTGGACTGGCACAGT CAGATAGACTCCAATTGGGTTAC CTCTGGGAAAATAGTAAAATCAC C CTGCAGGGAT CTCAAGGAGCCT GAGT CTTTGAATTGAGTTTTAACAGCAACAAAGCCAGATTTTTTTT CTCTGGTCTGCAGTTGTATTGGC CTCCATGATCTCTGAATAAGT CAAGCAAGGGATTTTT CCT GAAGTGGGAAGCTTTTCATGGCTGGTTTC TTGAGAGCCTTGAACACAATGCTTTGGCAGGTATGTGG CTGCCCAG CAGGGAAT CTTGGAGCTTTGAGC AGCTCTGGCAGTTG CAAGGAGG CAGGTCACACAGCTTCACAAATCTT CTAAAAGAAAAAGCAAGAGAGT CTAAATTTAAAGAGGAGAAG CAACAGCTATGGAAAAGAGGAGAAAAGTATCAAAGAT TTTTAGAAAACA GAATAGAAAAGATAAATTAACTT CAAGCTCAGAAAGAAG CAAAGTAAAAGGAGGAAT TGAG(N)xTGGA GGAGTTTAGAATGATGCCCAAAT TTCCTAT C CAGGGAACTTGGGTTTTGATAGTGTACTTCACAGAAAT AAGGACTGGTGGGAATT CC CGAAAG GTTGGGTG G GGATGGAGATCA CTAGAAGGAGAGATAATGAGCTC AGGTAAGTTTTAAAAGTGTTGTTTTCATGCACCTTTGCAATATACCACTAAAATATATATTAAATA(N) xGGAAGAGACTGGAG GGGTAG TACTTCAG CT CACAGATGAGCCGTGG GAAGGAGAGACT CATC CTGAAA G CATGGCAG CAGGAGGTGAAGGCTGATGCTGATGAGACAAATGTAGACATG GAGCAATTACTGTTTGTC TTTTC CCGGGAAAATGGTAGG TAAGATTGTC CATAGAAAGTGAGGTAGCTGGAGGAGAG CTGGGAGC CT GGAAAGACTTGGAGAGGTGGAAGAAAATGTTAAAGGATTGCCAGAGTTGTACATTGCCTTGACATTGGA T GCTATAAACCTT CATCTATG CCAGTATATGTGTATGTTGTGTGACTGCCT CCAG CATTTCT CAGACTC T CAGGTTTTGTAGATGGAG CT CC CTAGATGTTAAGAAGATATGAATTGGAAGTTG CAGGACAGGTCTAA CAAACAGACAAGGGAACAGAG GAATTAAGGATG CCAGCAAGGAAGATTGAAG GATTGATGTGC CATGAA ATT TCAT CTGGAATGATAAAGAAATGAAACCAGGAAGGGGCTGACAGAATT CAAAGAAACATGAGGGAT CAGGAAA CTGCAGAGTACAAAGAGCAAAAA CAAGCAGAT CTTTAAAACCAT CTTAAAGGACGTGTTAAG AGGTAGATCTTATGTTTTCTAATTTGGGGTACCCAAAATGTTTTTATTGTGATTATTCTACCCTGGGAA ATGAGGGAATGAGAG CAAAGACCTAGTAT GTTTATGAACTGCAAATA CTATTTTTAAAAAGT(N)xCAG G CATGTTGG CG CTTACCTGTAGT CC CAGCTACTTGGGAGGCTGAGG CAGGAGAAT CACTTGAACCTAG G AGG CAGAGGTTGCAGTGAG CCGAGATCA(N)xGTAACTTTCTACCCATCTAAAACTGCTAATTGCAAAA GACATGC CAGTTACTGG CATTGTTATATAAAAAAG CTGGAAG CAGTGAAGC CAAT GAGAAATAAGAT CC CACTTCTGATGAAAACGGACCAATGAGAAGTAAACTCCTGCTTCTAACAACAATTGAAAGCAAAACTTT ATACTAAGATTGGTCAAGGACATGCCAGTGAGTTGTAGATCTAGCTTTTTTAGGCATCAAGTTGTAATG TTATT CG ATTAAT AATTTC CT CCGTGACAGC CAAAGCCATGGT CCAAAAG G AGTG ATCTTCACTGCC CA G CCATGAGAGCAATCTG CTTGGAGATG C(N) xC CTACCTCTTCCTCTCACTGGCTGAACCCAATGGGAT G CCAGAAAATGGC CATGGATG CAGTACATTC CATACAACATAGTGTAAATATGTTTTCAAATCGGAG GT TTG CTGATATAACTGGAAAAT CCAAGAAAAAAAAT CAGATACAGTA CACGTCTCCA(N)xCTGGACTCC GTCTCAAAACAAAACAAACAAAAAATTTTTTTAATCTGGCAACACCAGATCTATTTTCTGTAGGGCAGC ACT CAA CTGGAACTGAGTAAC TG CAGCTG CT CTAGACAAGGCATGC
>H sl5_29402690-29413501
AG GAGAGGCTG GCTTTC CC CACCATGG GGTGGC CTGGGGGCTACCTACTCGTAGGGCCTGTGAATATAG C CCTG CC CGGGATGAGTTTGATG CC CC CT CAGT CTGCTGGTCT TCCC CAAACTATAACC CCATGAGGAG CACAGGAAC{N) xTGTGGAGCACTGGTGGGGAGAGGGGGAGAAGAGATGAGGAAGGGAAGGACTCCGGT TTGTGTTTTGTGGGGGCTAACAT GAATGGGAAAGGAGGG GACAGCCTGGAG CAGC CCAG CCAAGCCCAG GTGCAGGGTTCTGTGCCTTTCTTCCCAGAGAGCTGGAGAGTGCAATGGGCAGGTGTGAAGTCAAGGCGG AG GGCAG CAGACGGGGACCAGGG CT CG CT CCAG CTGCCTTCTGGGG CAGGTG CAGAGGGGTCTGGAGTA G CT CC CAAAGATGGGGCAAGGTGGGTC CGAGTGGC CAGGAAGCTGAGGGTGGTGGTGCGGAAGGGGGTT T CT CAGAAGAT CAGGGGGTGCAGGGTTGAGGGCAG CAGCAGACTTGCGGGG CCTG CAGTGAGGAAGGAG T CATGTG CAATTCTGAAGACAACAT CT CCGGGGTGGCAATACGAAAGAGAGG GTG CCAC CCACGGCT CG GTGTCTCTG CC CTGGGT CTAGGGTCTGAGTT CCGGAGTGATGGAGGGGGCCAAGC GTCTGTGAAACTTG GGG GGTGGT CTGGGT CACGAC TGGGGACGAG GAGG GTCTGGAGGTCC CAGC CCTG CTTGTGGATGCG CT GTTGTTCCACCCTCTGGCTCTGCCTCCCTGTCTGTGAAATGGGGTCCCACTTCCTTGAGGCCTCAGTGC TCAGGATGCAGGGGAGATGCTGGCAGCCGAGGAGAGGGGCAAATGCTGGCTGTCAGGGGCTTTCTCCAG GATACAGGACAGCTCAC CCTGAGAGTG CCAGGGAGGGTTAGATGAAG CAAATGAATAGG CATGGTTATG ATC CATCTC CC CAGTGATC CAGAAAATGG CATG CAGGCGGCCATATG GATCAGT CACCATCTCTGAGTG GCTCGGCTGCC TGGCGGTAAACACCGGGC CCTT CAGGCC CCCAG GTG CGGACCACACCC TCTG CACCTG CTC CATGAGTG CC CGTGGT CATGGTGG GGGT TGAGGGTAGGGGTCTGGCTGTCCC CATTTGGCCCTCTG T GCAC CTAGACACAGAGGCTG TCACAGTGGCTG CCACTTGTGC CCCTGCAGGCCAGGGCTCAT CCAC CC CAGGGCCATGTCCAGGCATCTGCCTGGCTGAGGTGCACTGGGGCTCCACTGCCCAGCCGGGACCTCCAG GGAGCAATGAG CCTGGGATAACATT CAGGAGAC CC GAGGGTGCTGGCTACT CTGGTCCAAGCC CCTC GT C CCTCACAG CGTG CT CAG GATAGGACC CG CAGT CC CAAGGGC C CTGG CCTCTACTTTCAGGAG CCTG GA ACC CCGGGCTC CTGCTGTC CC CGGC CCTG CAGCAGTGGGGCGCTTCC CTTGGTGT CACCTCTCTGGG CA CCCCCATCCCCACCATGCCCAGTGAAGCTCTCACCTGTGTCCTCCCGTCCTTGCCAGGCCCTTCGCCTG AAGTCCTAGCTCCAGCTGGCCTGGGGCAGCCACTCGGCACCTTGTATCCCGGGAGCTGGTGGGCAGGGA CTGGGGTTC CC CT CAGC CT CCATGTGC CT GGCCAC CTGGGGAG CATATGCAGGCG CTGGAGGATGCC TG G CTAGGAGT CTAT CAGCTC CTGT CC CGAC CTGCAG CGGTTCTC CCTGATGACCACAGAAACATTCCCTG TGG CAGGTGGCAATG CTTAGCAC CCAGAG CGAG CAGTCAGATGAGATATGGGCCAGCTG CAGAAATCTG GGTGC TGGTGTGTAGGAGACGGGGAGG CC CC CT CACAGTTGTCTGTGTCTG CATGTGTGTGTA CACG CA TGTGTGTTTGGGACAGGTGAGTCTGTG CTTCTC CCATCC CAGG CCACTCGC CTCCTCCCAGGG CTCAGC ACAGT CT CCTTTCTC CAGGATAAGAGGGC CCTGGGGTGG CGAAGGAG CTGC CTTG CGGTATGGAATCTG C CT CT CT CCAC CC CCTT CT CTTCTAGT CCTAATTCTCCTTGGACATC CATTTTTC CGGTTTTTGTTTTC ATTTAAAAAA(N)xAACTGTGGTAAAATATATCTGTAACTTAAAATCGAGCATCATAGCCATTTTGAAG TGCAGAGTT CTGTGG CAGT CAGT CCAT CTG CATGT CAGG CAGC CATGGCCACTGCGCTCTCCTGCACGT CTTCCTCCTCCCCAACAGGAACTTTGTCCCCACTAAGCACCACCGCCCATTTCTCCTTCCCAGCCCCTG GCGGCCC{N) xCGGCGCCTGTTTTCCATTCTGTCCAGTCTCGTGGTCACAGGCGCTCCCAGGCCCTCCT GTC CATCTGTTTGGCGT CACTTTTAGG CT G GAATG GAAG CCCC(N) xCCAGAACCAGTATATTTGGCAC AAAATGC CCTT ACGT CACTTGGCGGGGGGGGGATGTAGGAATAAAGTGACTCTTC CCTGTCQATGTAGA CCATTTGGGCATCCGGCAAACAGAGCCTTCCCTTGACAGAACAGATGCCCCACCTTTCTGGTGGCCAGG GGACAGGCTGCCTCTCCTTTCCCGCCCGCCCATCAGCTAAGTGCCTGTCTTCCAGGATGGTAACCAGGC TGTGCCGTAGGAGGC CAGGTGGCAGTGGC CTTGGGCTGTGT CACCAGTCTCTAGC CTGACCACAAAGGG AGCGGCCCTGGGGGAGGTGCAGCTGGGGGACAGGTTCCACCGTGGGAGTCAGAAAACTTGGTGGTTCCC CACACACCATGGCATTAACTCTCTTGTTTTAAAGAAACATACTGCATATCATAAGGATCTCTGTCTGTG AGTCAGCGAATTGCACGTGCACGTATTCATCATGTGGCGCTTTTCCCTCTGCATGTCCCATCTTTGGGG ACGTGGTGGAGCGCCTGTCGGGTGTGG CGGGAACACGTGTG CTGACCTGGCCATGTCTG CT TAGATCTG CAGCCTCATGAGAGGGGGCATTGCTGAGCGAGGGGGCGTCCGTGTGGGCCACCGCATCATCGAGATCAA CGGGCAGAGCGTGGTGGCCACAGCCCACGAGAAGATAGTCCAAGCTCTGTCCAACTCGGTCGGAGAGGT AAGGAGGGACTTT GAGTGTG CCT CTGCATG C CGGTTCCCACGTGCTC CCGCCTGC CCTC CATGAGCCTC CCCCGCTCCAGAGGACACAGGGCATCTGAAGGTCAGCCAGGCTGTGTCTCCCATCGGGGCTGCTGTGAC AAGGGTGAATGGAGCGGCAGGTGATGCGGAGCCATCCCGGTCTGGCTCTAACTGAGCAACCGGGAGGGC ACAGGCTGCTGGGGGTGCTCCCTGGACACTACTTTTGGGACTGGACCTGTATGTCCAGTTCAGAAGCTC TGCAGGGCACACAGCTCTGGCCACCCTGGCAGGCTTGCCTCTCTGCCCTTGCTGGCCATCCCCTCCCAC AGCCAGG CTGCTTGC CCTGTCCCACTT CACTGGCTGAAC CC CC CACAGTCCAGTT CTTT CTGG CATCGC TTCCACGGCAGTGTGAGGATGGTGTGGGGGAGGTGCAGGTTTTCCTCCAGAGCTCACAAGGAGGCAGGA TCATTGGGAGGGGGGATGGTGCTGGGGAGGTGGCACATACACCGGGGGAGGCCTGGCCAGTGCGCAGCG
G(N)xGGATGTGATGTATGAGTGTGGGTAAGTGTGGGTAGGTGTGGGAAATGA(N)xCTGTGGGTGTGT GTGAGCATGGGGTGTCAAGCATGAGTGTGTATGTGTGCACGGCTGTGGCTGTGGGTCTGTGAGTGGGTG TGAGGCATGGGGTGTGAGGAGAGTGTGAGTGGGTGTGTCTGTGTGTGTGTAGGGGTGTGTGAAGGGGTG TTCAGCACAAAGGGGCTGCAGGCTCTCTGGGAAGCGTGGTGGCCGCACGAGGCGGGAGGCTGAGAGCTG AGGAGCCCGGCCTACAGAGGGGCCCGGTGGGGACCGGGCAGGGAATTTGGGTCTGGGGACAGCAGCAGC AGGCCGGGCCGTCTGTGGTGCTCCCCCTGGTGGCTCTGTGAGGCTGAGTGCGTGGCGACCATCCTTGCA GCCTGTGGGTGGGCGGGTGGGTGGAGGGGTCTCTAGTTCCTGTTCTTCGGATGAGGTCTCCAGGAGCCA CATCGGGACACTGGTATTTTGTGCACTGCAGGGTCGGCTGGGAGCTGGGAGGCCTGTTTTCCCTGGACT GGTGGGGGCAGGGGTCTGGCTCAGGAGGAGGAGGGGCCAGGCGGACACCAGCCTCCCTTTGATGGCTTC AAAGAAAGGAAAAGCCAGTGTTCCCAAGTAGATGCTTTGAAGCGGCCGCCCCCACGGCCAGCAGCCCCA CACCTGT GAGGAAAT CCACCCGTGGAGGAGCGGGCG GGAGAACGGGGAAGAG GCAGGAGGCGG CCGGG G CGTTGGAATCCTTTCAGCCCTGCCGAGGTTATGGTGCCTGTTTCCAGCAGCTCTGAGCCCACGCGGAAC ATGAGTCGCTGATAAACAAGGCTTGCAGTTGCGAGGGGCCTCGGGGTGGGTTTCTGGTTGAGGAACGCC GGGGAATGG CTGATGGAGGAAGC CC CTGC CGGGAGCAGGAG CAGCGGAAACCTACAT CGGC CA CACAGA TCCCGGCGGGGGAGGGAGGGCAACGCTCAGAAGCAGAGCTGCGCGATGTGCCTTTCTGGGCTGCTCAGC CCAGGGCCCGAGACCTGCTTGCCTGAGGAGGGGCAGATGCACGGCGGCTGCACCCCGTGGGGAGGCCAG CATGCCTGCGGCAGCTGACCCTGGCAGTGCGAGGGCCCCCCATCCGGAGGCCGCAGCCTAGCACAGTCC CGGGCAGGACAGGGGAGAGCGTCGTGCTCAGGAGAGGAGAAGTGGGCAGTAGAGCTCCCTTTCCGTTCT TGAAGCAGACGTT TAGAGTCTGCTC CTTC CCAGTGTCCG CAAAGAAGGATTACAAGG CGCGCCACCCAC ACGGTTGGT CT CC CTGGGGCGGT CTGAGG CCTTGCTTGTATAC CCAG CTCCTTTGGAGT CAGGGCTCGA GGCCAGGCCCCAAAGGGTCATTTGCTCACCGATCCCCAAAGGCTAAGAACTTGAGGTGCCCTGGGCCCC AGCGAGCCTGTTCCCAGCCCCCAGCACACTTTCCTGGCCCGTGTGTGTTTTGTAAGAAGGGGTGTA(N) xGGGGTGCGTGTGTCTTCCCCCATTGAGCTCCTCCTCAGTGAAGCAGAGTGCATCCCTCGCAGTTTCTG CCGAGGCCTGGCCACCCCCAGCCAGCCCCAGTGGATGGGCAAACTGTTGTCACCACTAGAAGTCCAATT TCTCACCCCTCGGGTGTCAGCTGGACTGGAAGCAAAACTAGCACTCAACTGTATTGACTCCTCTGGAGC CAGGGGTGGGGACCTGGAGTCTCAGTCTGCTGCATCTGGAAGGCATCATAAGTGTCCCCAGGCCTGGGC AGGCTGGCCTTCCTCCTTCACTCTAGGAGATGGGCATTTGAAGCAGAACTCTGGGGGGTTTGCCTCTGT CCTTTGCTTTCAC CTGATTGTGGGAGG GGAGGTG GGAGGGCAG CGGCTCAGCCTC CTGTTT CTGTCCGC AGATCCACATGAAGACCATGCCCGC CG CCATGTTCAGGCTC CT CACGGGTCAGGAGACC CCGCTGTACA TCTAGGCCACCCCAGCCTGGCCACGCAGCCAGGACACCGGGCAGGGCCGCCCGGGCCCAGAGGAGCTGG GAGCCGGGC CG CAGACTTGACCC CGACGC CACAGCCCAG CCACGGACGCTGGCTC CC CAAAGGGTGTGC CCTCACCACCCACTTGATTTTTTTCATTTTGCCAAAAAGGGGTATGTCTTTATCAAAGGAGAGTCACAG AACAAATGT TTGTTTGTAAAGCGTT CCAAGTATTTTGCCACGTTCTGGACTG TCTTCTC CCTG CACAAG CCAGGGTGTGT CT CGGTAGCTGTGCGTGGTGTGGAGTGTGTGT CTTT CCTCCCTGAAGCTGTG CGGAGC GAACTGG CG CCTC CGAGGGACGCGG CT CC CGGGGCAGGG CAGC CGTCACCCCTGCCTCCCGCCCCCTTG GCTGGGACGTCTGGGGTCCTGTGGGGCCCCCACAATGGTCCCAAACAGCTGCCTCTGCCACTGACTGCA GGGACAC GGGCAG CCTGGCTCCCAGGACA CGACTTGTAATGAAAGTTTGGGGACATGTGAT TGATTGAT TGATTGTAAATAAAGGATGATGGCCACAACATGAAAACTCCATATTTATTTAGATGCTATTATTACTGT TTGGACTTTTATTTTGGCAGGCTTTTTTCCAGACTCTAGGGTTTTCCAATGTGACTAATGACCACACCT GCCTCTCCCGTCGTCTCTTCTGGGCACCCTCCCACCCGGCTGCATACCCGGGCAGGGCTCC CACAGAGA CAAGGAGGG CACAGGTGTCTGCC CC CT CTTTAAAATCGATCTACACACATCCACG CACATG CGACCCCG AGGAAACGAAACC CACTCTAGAAAACG CGAC CTTGGCCG CACCTAAAGCAGC CAG CCGTGAGTGCAGAC CCCTTG G CCAG CGTGGCGCAGTG GC CC TGAG CAGTAGTG GCATGT GTGTAGATCAAGTC GGAT CTAGTC C AGCT CGGTTC ATTAGCGATC CATGTAAT CTGACGTCAT CTTGTCT CGAAGTCTCTTTTTTTGGC CC AG GCCTTGAAGAATACACTGTGACTTAAGAAGCCTTACCACGCAGTAACTAAAGCTTTAGGATGACTGTAT T CGAGGAGTGC CGTGTGTTGCATGCAG CTACC CGTAGGAAGAC TT CGCG CATATCAC TAATAAAC CTGA AGTCGTGATGAAAAGCCGTGTGTGTGACTGGTCTGTTACCTCAGCGGCGGGTGCCCGCCTGTCCTTTCA TTCACAGCTTG GATGCG GCT C TG CAAATT CACTAT GCGGTGGCAGCCACAGCTGCTCTGTGCTCCTCCT GGAT C CTGAAC CTTGGAAG CTGT CACTAATGAGTT CGGTGGGTGGGTGCTCTGGGCACCAGGTGT CAGC TGGGCAACGCC CCGCTG CAACTGGAGGTG CCAG CAATGCTACCAGGTCACG GGGT CAGCGC CAGGTT CT TGACCCACTTGCCCCAG CAGGGAGACACG T CCG CAGAGCACTCACTAAT GGAATGAG GGAGCCAGGGAG C CTTGGG CTTCTT CAGATTTCAACGTAAATGGC CC CTGGGAAAGGTGTG CATGCGTGTGCGTGTGTAAA AGCCGGACCG CCCATCCCAGCACCCTGTCTGCACCTGTGGGTCCTTTGCAGCCTGTCTAGAGTGACAAT T CTG G CAGCTCTG CCAGCACTTA CCTTTCTTCATGG CCA CCGTTCTGG TGT CACCTGGTGT GGGAAAGC CAGGTAGCAAGCACGGGCTTTGTTCAGAGGCGGTGGCCTGGGGTC CAGCTGGGTT AGGCTGTGGGTCTT GGATGGG CAGGAGCCCTCCTCTCTGGC CTGCCCACAGCAGCGTGAGCTC CTTGCTGTGGGCACTGACCA CCTTCACGGCCCAGGATTCTTGG GGAACTG CAGAGTAAG GAGAAGACTTACTCTCAC CTAGCCGCCCAG AGGGTGTGGGC CAAGGG CTTGCTCCAGCATGACCTTGGC CATCAGGGGCATATAAAAGAGA CGGCCTTT CAAGG CG CACACCAGCTTT GTGC CTCC CT CTTC CCAGAñAGTATACATGGTAGAGGCAAAGAAAG CTGT T CCATAGGAGACTGTCACAGAGAGCTG CGTATATG CCGAAATGTATATG CACTCG CT CTGAAT GAATAT G C T T (U ) xTACAATTCTCGACGTGGCCAACAAACAAGAATGCCAGGGATGGCCACATCCTTGGGGAATG AAGAGAAATGT CG CCTGAGTCAAAGCTG GAGGGAACGGGATTC CACTCAG GAGGC CAAGG GGGACATTC CTGCCGTGCCCAGGACAAGGCGGGGAAGCAGGTCCCGGGGAGAGAGCCCTGGCTAGCTGAGGACATGCA GGCAGGACTCC CACCAAAT CATACACATCTTTAAAAAAC CTGTATTTAAGGAAACAT CTGAGC CATG GC CTGCATT CCTCCCAGCAGCCCTTGGAGGC CATGTC CATGGAGG GG CTGC CATGGC CAGCGTGGA C GGGC GTGGGTGTGTC CAGGCC CTGGCCCTCCCCACTCTG CCCCGGAGTTGACC CT CTCCGGAAGG CT T CTGGA AG CACAG CTGAGTGGCATGGGTATCCCAGGCT C CC CGGCTGGCTACTGCAAGGTC CTCTCAGGACCCTG CCCCTGGCCACTGGGGTACACAGCCTCTCAGCCTGCCTCTGTCCCAGTTAGACCT CAGAGACACAGAGG C CAGAAATACTTCTTTTTATTAAAACACGGATCATTATTAAAACACCTTGAG GTACATTAAATAAATAC AGCCTTT CCAGTTGTACAGACAGGTCT CTGGTG GC TTGAAAACAATTTC CTATAAATTCTG CCTTAG CA GCCTCTGAGAGTCAGCATGGGTGGGGAGAAGCATGTGTGTGTAGCGAGCTTTCTGTGGCCTGGTGTGTT ACAAG CATTTAAT GTGGAT CCTCATCT CCAGCCAAG GGAGACCTGAAGT CCTTGCGGATGATG CT CC CC CTTCTGC GATCTT GCTTGC CATAGAGTTG GGGTCCATGTTCTCCCCACT CAGCAACC TTGTGGCCCCCA GGCTGGGGTCAAC GGTGGTGTCACAGCAGGTGC CAACCC CCTTTG GGTCTCAGGGGAGGGCAGG GGATG C CCGGGG CCCAGC TGAGTTGGAGAAGGAG CGTGTCACCG CCAAAG CAGC CACAGTTGGCCCATTT CAGA T CCCAGATGGAACATGATTGGAG CGGGGGAAAGACGCATAGGGAGGGGAAG CAGC CAGCTT TCTCCTGG G CTGTGAGCTGACGTG CAGCCCGGGGCCTGGGCTCTTGCTG CTGGTACC CGGATG CCGTGGAGCCGGGG C CAAGGGGACCACAGCTTCTGGGTCAAGT TGCCAGCACATTCC CTAGTCTACCAG CACCTCTCTCTTCT GGGGACCTTCC CAGGAAGC CCAGAAC CTGGACC CT CCTAGAGT CCATGAGT GGCC CTGAG CAGCGGGGC TGATGGGGGCACC CCACCCCGTCCCTGAGTGCCTCATTT CGAAGGGGCAGAAGGTTC CAGC CCAGCGGC ATC AAGTTGGGTACAGTTT CCGT ATCC AC AAAACAGAGCTG C C AC CAGCGC AAAC CC AGAG CTGGGTGA AG GTG GGGTGCAGGTGAGGGG CCTACAG GGAGG CACTGCTGGGGCAGCAGC CCCATTGGCCAT TG G GAC CGGGC CCTGCTGCAGAGAAGGGAAGGATG CTGAGCAGCAGG CTGAGGGGGTC CAGTAGGCCTCTG CC CC GGCTGCATGA
>H sl5_98411055-98420737
TTCTG GAACTCTCTACCTAAG CC CATG GTGGAG GGGGGCTGGAGGA CA CAG CACAAAAATGTGAG GCAA ACCTGAGAAGTTAGAGGAGAACCACACTC CACACT CAATGCACTGGCAGAACATTTC CAG GAGGGAAGA AGGAGTCTCGTCT CAAGGACTTTGTTCTG TCAACGATGCTGAAGAGCTGGCTGCAACTATC CAC CTT CA CAGCTGACCCTGTTCCCGCCTGCTGGTGCCCTGTCCACACCCACTTGACAAACCACACCCTTTCCTGGT CACTAAC CAGGGC CTGAGGAG CAAACAGCTCTTTGTTCTTT CC CGACATTGTTGCAGGCTGAACACTGA ATCAAAACACATG CTGGAAGT CACAAAGGTGCT GCAGCACCTGGACCAGGT TAGGTT CAGGTCAGAT CC AGATG GAAGAG CCAGAC CT CCAC CCTGAC CTTCACAGAGAT CC CG CTTACAGTTC CTGCCACATTGT CT C CAACATTCAATGAAGTTT TT CT GAGA CT GAAAAAAAGACTGA CAGCCT CCAGCAAAGGTTTAGACTAA TTTGGATAAAG CTTAGAGAG CACAAAAATA CG TAAAGAA CAGAGCAATTTCG TGTGCTTTCAGGT CC CA AAACGAATATTAAAAACAGATAAAATTTACTTGGGTAATTCAC{ N ) xTTTGTCTTTAATCCCATCAATT ATGGAGGGTAAAAATAGGACTTCGTTCTAGTTATC CAGATAAC CAT CTCATTTCTTGGCTTCACTTGAC T GAG G AA GACCTGG CCT GAAG AAAATATGGGTC ATTCAGTACATTTTCCTTGT ATTAAACG ATTAGGGT GTTCATTTATTTT TCTAAAGAAATAA CATAGTTAACTGATGTGGA CAATTT TACTTTAAATGTGATC( N ) x GATCTAGTGAAATTCAAGATTCAAAAAGTAAGTTATTTCAGTATTTTTGTATGGATTTGCTTTCTTT ACATTTTTAACCATGACATCAGAGTTTAAGGTGTTTTTTTGTGTTATCTGCACCACATTTTCCAATAGA AATGT AGGCTTTAGACTTCTAG AAGGATATGGAAGTTTC AG AT CAGTCT CTCAATAGGGAT TC AGTATT T TTTTTGGTGAATATGG GCAGAAGGGG CAGTCACGTGCCAGAAACTTACAGGAGATG CAGAGATG GCTT ATGGAAATTTTTT TTTAACTG TTAAAATT TTCAAGT CC CAAAAGGTGCTACATCT CAAGG ATC CATATG AAAAAAAGGGAAGTGCAAGATGCTTTATGGCAGAGCGAAT(N)xTTCCTGGGGTATGGCAGACACTCGA TG GAAACTGATGTGGGTTTCT CTGGCTCG CTGTCTAACCAGGATCATTT CC CACGTGGCACAAC C CC AC CAGCACTACACTGCTAAACTTGCCCCGTCAGCAGGCAGCTTGGGGTGTATCTATTTAGCCATTTCATTC AACATAGGTGAAC CAAAACAAAG CCCATC CAGGAG CTA TCT ( N ) xCTCACCTGCTGGCCCCCTCTGACC AGCCACT CTGAGTGC CACATGG GTCCTGAGCATGACACATT CATT CCTCAC CTGCGTTG CCCTTACTGT TGCCCTG{ N) xTACAGAGCTGGGTGCAGGCTCTCCTACATACATTTGCAATTCACCCAGGCCTCCCCAT AGCATGCATCAACTCTAATGAGTAATATTTGGTTATCCCTAAACTTTAATATCTTGCTCCCTCTG(N)x T GAGTTT CTGACAAATAGGGGGT CCAACATGACTTTGTGGAATGG CTGAACAGAATTGGTTCTTTTAAT AAGGCTTTTCTCTAAAATAAGCAGCAGCCCCCTAAAGGGCAGGAAACTGATCACATGACTACAGCATGT T CCAGCTGCAGGAA CAGGAGT CAAAAAGACTCAGC CTGATGAACATAAAAT CAAAGC CTAGAAGAGAAA ACTAATGAGGGGCAAGAAGAACTTGAATCCAGGCCCAAGTCTTCAGGCCTTAGCTTGACTGTTGCCTTG TCCTTGCTGCCCCTTCTGGTGGACCTGATGCACCCTAAACACTCCTGGGCCATGAGCTAAGGCTCAAAG CATATTGGT CACATGAGTGT C TGAAGACGGCCTCTGTGAGT CC CT GCAGATTTCCTTACACATTT CACA AGGAAGGGCAATGCCTTCTTTGTGCCTCAGACCAGAAGAGTGAAGGTTGACAGCCTTGGAGTAATTCCC AGGAAAGCTGCCTGGGAGAAAATGCCACCATGGGCTTTCTTCCTGATGTTGGTTTGGTAGGCAATGGCC CCAAGATCCTCCTGAGGCCTCTAGCCAGGAGGGATGTGTGGGATGCCTCTGCCAGTGTTTGCCCCATGC AAGTTCCATGCAGAAAAGATTCATGGGTCCCAAGGAGACAGAACTGAGAGCACCCAGGCTAGAGAGCAG AGATGCAGAGAACCCAGCCCACGTGTAGAGAAAGCCCCCTCCCTGTCCTATGGTAAGGAGCCCTCATCC CCCACCAATGCAGGGTGGAAGATGGAGATGGAGATG(N) xCTCGGCCTCCCAAAGTGCTGGGATTACAG G CGTGAG CCACCGCG C CCGGC CGGAGATGTTTTTTGAGAAC CCAC CAAC CT CAAT CACAGACTGAGCTC TGAGGCATGGCAGTCAGAGGCTGAAAACATCCACAGAGAGTCACCAGAGTCAGGATACAGAAAGGAAAG ATCTCGC CACAGGTGTGTCGGGTTAAGGAGAGGGT CAG GGGAAG G CAGG CAATGCAAGGAGATGAGTCA T GCGGATGAGCCC CAA CCAGAAACCATGG CCTGGCAG CCTTGACCAGGAAAGCCTGT CAGAA CGC CATC GTGCTTCACTGAATATTCTCTGCCTGGGATTATATTCAAATTAACCTTATTATAGTAAACTGCATTTCT AGTGCACATTTCAGAGTTTGAGTTTAATCATTCACAAATAATCTCTTTTTCTACCTTAAATTTGAAGTT AAAAAAC CG TT( N ) xAGCCCTTCAAGCTACCCAGCATTTTCAAGAGCTATATGGGATTGATGGATATTT ACTGTCTGTATTATAATCAAATCCCAGTGCATTGCCAGGAATTGGTAGAACTAAGAACTTTCAGGATGT AGCCTTAAGTAGGCGTATGCCTGCCAAGGTGCATACACATTGCATTAAA( N) xCTATGACAGCAGCAGG ATTAGCAAAACAT CT CAAGAACAAAGAGATGGAGT TCAACT CTGTTCTACACGCCAAAT CTGC CTGCCC CCTCCTGCCCCTTAAAAACCTCCTTTGTAAAAATATATGCCCCTTCCCCCTACCCCAGGGGACTGCTTA C CTGGCGTTTTACAGGACATAGG TAAAGAAAGCCAGAAAGG CA CAGTGACTTGGGAG GCAACAGGTGGT GTACAGGACCTGCCACAGCCCCGAACTGCCTCTTGCCAATTTCAAAATGAAGAGATGCTGTTTGAATGA ACAGGAG CAAATGGGTCTTCT CCTTGTCCT CACTC CT CACT TT CGGCTT CACTGCTC CAAAATTAAAAA CAAAAGCAAAAGC CTGTATCCAGGAGGAC GCCATGAGAGTG CCTGACCGAT CTTGGTGGACGTTC CCGC CAGAGCT CTGTGC CAGGCTGC CGGTGCTGTTTATT CT CCCTTGGCAGG C CCGCAG CATATGGTGC CTCC CTCACCTCTTCGGCGTCTTCCACGTCACA TGCAGTGAGAGACT CCACTCTC CTGGGGGGTGTGTCGGGC C CCTCACTGGGCCTGTTGCTGTATCAT CT GACAGATTACAGG C CG CATATT GCTGGGCT CTGC CT CTGT TGGGAGCCAGCTTAGTCATTTAACAGTTCATTCAGAGAGGGAACCAGGGTGCCCTGTGACTGCTGACGT GATGGATTT TGTC TGGGGAGGAAAGGAGG CACAGC CC TGCAAAAGTCTACAAATC CCATGGGTGGGCCA AACTTTGATGACTTGACCTAGTCGGCCCCAATTTATCATAATAAAATGAAAATAAT(N)xCTTGACTCA CAGGCAGCAAGCGGCAAATTTGAACCCAGTTACGCCTCTCCCTCTGCAGCACTTGCTATCATGCTACCT CTTCCAG CATGTCTT C CAAC CGTTATTAGG GTAAG CTTTCTAAAACGTGTATTTT CTAAAATG CACCAA AACTAAT TAATA CTAATATGACTTTTGTATTTAAAAAAAAT CACTTATT CATATCTACAGTAAAG CATT CCAGAGTGCTAGTGGTTCCCAAAGAATTCTTCTTG( N ) xTTTTTTTTTAAATAAAATAATTCTTCTTTC ACAAAATATTAATAAATGTCCTGCCTTGGCGAGTCTTCCAACAAAAGTGGCTTTGTGGCCATCTGTGCT G GGAATACAGGGT TGTATTGGAAATG(N)xCACCTGGACAGCACCCGAAACGCTGGGCTCTCATAACCT CGAGCTT CCATCTTAATTTCCAGGCCCTG CCTGGT CAAGTTTC CC CCTTGTTATT CAG CAGGAACATTT TCTCTTGAAACGGACGGTTGGAATCCTGGCCTAAACTCAGGTTCTCCCTCCTCATCAATGTCTCTCTCC ACCTCTC CTACTAGAAGCCCTAC CTCCAAG(N)xGAGCATCCTTAACATCCCAGCACCCCCTCTCCTGC TGCAATT CAGTCCAATTAGCC CCTGGC C CAAAATCTAATTGTGTG CAAAGCACAAAT TAACACTAAACC GGATCAGTGTTGTTTCTTTTTAATGCTGAAACAGTTCAGGGCTCACTCATCCTACTTCATCAGCTAGAT TGCAAGCTCCATGTAGGGATTTTCTCCCTTTTATTTAAACCACCAGGCTGCCCCTTTTAGTGGACCTGA TGCGCCCTAAACACT CCTGGG CCATGAGC TAAGACT CAAAGAATATTGGTCACATGAGTGTCTGAAGAT GACTTCTGTGAGTGC CTGCAGATTTCCTCATACAT CC CGCAAGGAAGAG CAATGC CC CATGT C CATGAA CGGGGACCATGACTCCCCACTCTGGCTGCGACCTTCTTTTTTATTCTCACCTTGCTGAAAA(N ) XCCTG CCTGCCCTGCCCATAAATGAATTAAGGTTTTGCATCCTGAATCTTTATCCTCTGGTTGAAATGTTCTTA T CCCATGGTTAGC CTTTCAAGGC CAGGTT CAAATG CTCTTCCCCTCACTGGACATTC CCTTAT CACATC CTTTACATG CCTC CCTCCTGAAGAGTCTGG CCAAGTCACATGG CCTGGCAC CATTACTAATTG CC CACT GACCCATGG
>Hsl7_26267494-26272569
GACC CTATGGAGTGAGCAGGT CATG CTGCGT CT CTTTGGGCAG CATTTGGTCACAAGATGGTGACTGGG CACCTGTGTGCTTATCAGAAGTCACGGCAAAGTTTTGCAATGAGAAACAGGAAATGAAGGAGTCCGCTT
TTAATTTCTAGGGAGAGATGCTCAGTATT( K ) xATCTGAGTTTGTTGGGCTAGACACATTGGACATGTT ATAC CCATGAACC CTGG CTGACAAGGTGG CAGC CACACAGC CAAT CT CC GCTACTGATAAGATAAAGGA CTTGGCCTATGACATATGACT CT CC TGACAACCACTTGATGGTTACCTAGCGAAGGAGGTAATTAACAA AGAATCTTACAAGAAACTCGTGTTG CT CGGG CAAATG CC CGGC CTGGACATCCCAGGTCCAGAGGAGTG GAAAGAACCCAGC CAGTTC CAACGGGATCTG CAAC GTACTTTCTGACCT GTTGTTCTGACGGCAACAGA GCAGGACACTGTCATCGCCTGCCAGCCTCCAAGTCTCTGTGACACTGTCTCAAAACAGTCTAAAGCTTT AAGTCCCCTCAAACCTCCCACATCTGCCATGTGGCTGGATCACTCCAAAATCAGGACATGTCCCTAACC TTCTATTAAGGGCATTCGACTTTATACCATGTCACTGAGCATCACTCAATCCTGCCTATAAATATGGAT GCTATGCTAACAAGACTTATTGCTGATTTAATGGCAGTGAGTACAGGCTGGTGTGCTGCACACATACAC AGAGAATGGTATTGAAAAATGGGAATGATAATTAGAGTTTAATAAATGGTAATTTGCTCCTTGCTGCTG TGTG TGACAGGGGTTGGG GAGGGATGAAATAGAGGGG CC CTTC CAGAAAGTGAAATGTGGATAGAACG T ATGCCATGACAGCTTTATTTTAATTTTCACATTGCCCAGGATTGAAGTTAATGCTCATTTACCTACAGA TCCTTTTCCCTGGAAATCACCTCTTGTGGAACATGAGTGGAGAAGACAATTAGACAAGAAATATTTATC ACTG GGACCAAATTTTAACTAGAGT CATAGAATGAAGAGGT CAGGG CTC CAGAGACAAAGTGAAGCAGA GAATTAAAATAAT TATCTGGATCAT TCAGGGGAAATAAAAT CATTGTTTTTGCTTGAGTGAGTGTCTGA ATGATCTAAATAATTGTCCCCACCCTTTGTTTTGGTCTGGTAGTGAAGACCTGGCCAATCGATCCAGTA GCAAAGGATGGAAGTTGGAAGTTTGTGGCGAGAAAGGACAGGAAGGCTGAGTAAGCCTGAATCCCCCTG GTCCTGCAGGCCTGAGCATTGGG CAGAGG CTGG CC CAGT CCAGAG CTTGGGCTGGAGAGGGAAGGATGG TATTCCTGGAAGCTAGAGACCAGAAAGTAAGCAACGCTGAAAGTCGCATGGGAAACCAAGGTCGACAAA CCCAGAATACAGC CAGAAGGAGC CAGGTTTAGT GAGTGAAT CCAGGAAG CTAAAAGATTAAGCAGAACA GAATCATGAGGG (N) xAGTTAGGAAAGGTTAAACAATTGACTATACCAGAGGTTTGCAGCTTCTTGGAC TCCACTGGCCAATCAAGAGAAGTGGAATTTTTCTGGGGCAGTGGOGGTATGCTAGTGTGCAAATGGAAG GTATGGTGTTATTCAGCGGGCCCTGCAAGTCTCCCCTTCCCAGAAACAACTACTTTATTTCTAGATATA GAATAATAATAATAGTAAAAAGACAGCTACTTTATGTAAATGCATGAATGATCCATTGCTTGAGATTTA TTCGTGGCCTACTTGTGTGTTCATTTTTCAAACACAAATTTTCCTTGGAGGACAAATCAGTAGTTTCCA AGCTTACTATAGAAATTAACAATAACTAAT(N)xAGTGTTTTGTAGATTCCTCCTAGTTCCCTCTTTTG GATTGTTTGCTAAGGTAGAAGCCACCTGCCACATCACTAGGACACACAAGCAGCCAATGCAGATGTCCA TGGAGTAAGGAACTGAGGC CT CCTG CCAATAGCTG CGTGAGTGTG CCAC CTTGGAAGCAGACTGTCCAG CCCCAGTCAAACCTTCAAATGACTGCG(N)xTTATACATAGTGAATACTGTTAGTTCCCTTTCCTG(N) xTTCTTATAAAAGGTCCAGTTTTCCCCTTTTTTCTCTCTCTTATCCTCTTGCCTTCCTCCATGGGATGA TACAGCAAGAGGATCCTCACAAGATGCTGGCA( N i xAAATATCTCTGGAAGAGTAACAAATACCAG(N) xACTACTAAATTT TCAG CT CC TATTTTGG CATGAGACAAGTGGTGGGTGGTTCAGTTCGGCTTTTTGTA ACTGCTAGTTCTGTC CC CCACCCCGCC CCGC CAACACACAC CATGGATAGTACCACGTGAAAGCCAGTC CTCCTGCCCCTCATATTCATCCCTTACCTCCCACCCTAATTAATCTGGTCCAGATCCTCATTCTTGCCA TGAATTTTAGCCT CCTCTCTTAGATAGGA TTTACATGTT CT TG CCTATAGTGGGTGAAATTG(N) xCTG AAGATGAGATTATCCGGGATTTAGGGTGCGTCTAAATGCAATGACTAGTGTCCTTATAAAAGAAAGGAG AAATTGGAGA CAAAGAGACACAGGG GGAAAACCACATGAAGACAGGGTGTAGAGATCAGAGGGATACAT CTGCAAGCCACAG CATG CCAGGATTGGATTG CAGAGC CACAGAAG CGAGGAGATACACA( N)xCTTCCC TTCAGAGGCCCCTGTTCAT CT CAGC CCTAAACC CAGCAG CAGT CT CTGGAGTGAGGTTGGGGAATTACA AAAGCCGTA( N ) xTTTACAAAGCCACAGCCTTGTGGTTGTTATAGTTCCCA(N)xCTAGATCCCTCCCA TGTGCAGTTCACAGTAGGGTTTCCGCTCCTACGAGACTCTAATGACGCCGCTGCTCTGACAGGAGGCAG AGCTCAGGTGGTCATGCTCGCCTGCCCTCCACTCAGCTCCTGCTCTGCAGCCTGGTTCCTAACAAGCCA TGGACTGGTACTGGTCCCTCAGGG
>Hsl7_70072445-70073023
AGGCTGCTGCTGAGGCTTCACTCATGGGACAGCTTCCTGGAAGGGGCTGGTTACCTCAAGTCCAGATCA TTCAGAGGGA CATGAGG AT CACACT CCGC CAATGC AGATGTTG AGGG AC CTTCCCTCTAAG ATGTGAAA AAGACCCCAAGCAAAGTTC CCAATGTG CC CT CT CTGCAC CTTAGAAAAC CACAAGGAAATATGGATTGG TTCCAGAAAAAGGCAGATGCCACAGGCACTGGGCCCTGCCTTCATGATTGCATCAAAACTGTTCACGCA GCTGCCTTCTCTCAATTATCCAAGAGCTGTTAGTCTTCATGAGTCTTGTCCAAATCTTTTGCCTTTTCT CTTCCTTTGGCTGTT CC CTGGGATCGTGATGATTACTGCTC TCAT CCAAGGTGAACAGAAGGTGAAATT CAAACCGGTCAGGCTGCTGATAGAATGTTTGGAAAGAGAAGGCAGGTGAAGCTGCTGCCTCAGTGGAGA CCCTGACCTTCTACCTGCTGTCCCTCCCACTTTCTCAAAGGCACCAATCTATGCTACACCTCTGGGCTG TGTTCACGTTGCTTCATCATTCCATGG
>Hsl8 77058035-77064649
CACGCTTTCCCTAGGTCAGGAAGGAAGCGAGGGTGCTTGTCCAACAGTGCCAGCATTCCAGACAGTGCA
GTACGGTAAGAAAGGGGAATGCAAGGTATGG(N)xTTTTTCAATGGATACAGAACCTAGAAGTTGGCCT
TGCACCAAAGAGGATATGGGAACAGAACATCAGTATGAGGAAAGATGTTCCTCCTTCTCAGGAGGCAGA
GTGGCTGTCCCATAGCCACTTCAGACTCACCATCTGTGTGTTCTCCCCTTCTCCCAAAGCTGTCTACTT
TCTTAATTCCTGTTTCTTGTTGAGGGCCAGCCTCCCTGAGCAAGGCAGTCAGCCTGAGCTGCTCTTCGA
GGTCTTGGATCCGGAGCCTCCACATTCAGTTCCCAAGTCCAGTTGATGCTCTTTTCTAGACTTCTGAAG
atttcatgcatttgcctcttcctttctgtcttgatgcctgctgccacatcatccttcacgtagaccttg
ATCCCAGTTCTTTCCCCCACATTTGACCCCCACCACTCTGTCCTTAGAATAATGCCTCAGTTAACTCTG TTACCTGGACAGGGCCCATCCAGAATGAAGCCGGGACTCCTGAGCCTGGCTCTGCATGCCCTCTTGGTG GACAGGAGGCTCTCTGCTTCACTGTGCTCTCTTGATGGGACCTAGAGAGCTCCTCACCCCCTCCTCCCT CTCCfiGCCCTTCTCTTGTGTGTGTCTCTTCCATTCACAGGCATTTGTCTTTGCTCCCAGACCCATGTGC GAGCCACATACAGTTCCCTCTTCCTGGAGCGCTTCTCTTGTGGAAACCACTAGTAAA( N) xTCTGCAGT CTCCTGTGACTGCACCACATGGACCCTGCATTAGTGCCTCTCAGTAGCATGATCTCTTGCAGCCTCTCT CCCTGTTATGCTGTGGGATTCTTTAGGAAAGGAATCCTCACCCTCCTTGTGTTTGGG <N) xCCAAGCTC AATGAGTAGTGATGCATTTTCATGAGTTTTTGCCAAATGAATACACAAATTCTCAGTGGGAATGTGGGA GTCACTGTCTGTTTAGAGAACTGTAGCTCCTGCTGCAGATGGGCAGCTCTCCGGAGCCCTGTGCTGTGC CACGAGGAGCAGCCGTAGGCCGGGCTGTGCAGAGCCTTGTCTCCATCTAAGGAGTGGATGCTTTGATTG GTAAGCAGTGGCGAGGCACGGATGGTCACTAAAAGCAACGTACTAGAATGAGGAGGTGTTTGTCTTCGG AAATAGCACTGCTGGGGAAGGTAGGTTAAAGAATGTTGGAGCTGAGCTCAGGAAGCCACGGAAGAGGCT GCAGTCAGGACGCGTGGTAAGAGGGCAGAGCGAAGGAGCTTCAGGAGGGGGGCACTCCATGTCTGAAGT GGGGTTGAGAGGTTTGAGAGCGTTTGGGAGGTAGTTGAATCCAGAAAGTGAGGAAAGGTTGGTGGTGGT GATGATGGCTGTGGAAGAGTAGCTCTCGTTTATTTAGGACTGACTGGGTCAGGCATGTCACCTGGGTGC CCTCGTGTCATCCTTGGACCACCCTGCAGTGCTTGCAAAGTGAGGAGGCTGAGTGGGAGGTCTTTCCCT TCTCGGTTAAGGTATCCATAGGGAGAGGGCAGCGGGGCCAGACAGGCCCCTTGCAGATGTCTCAGCAGG TGCTGAGGGACAGGGAGGTGCAGGCTGTTGAGGGAGAGGCGAGGGTGGGCTGTGGGTCGCATGTGGGTT GCGGTGAGGACACAAACCCCTTTGTGAGGACACTGAGACTTGAGCCCGGGGCTGCGGGGCCGTGGACAA CAGCACCTCACATCCCAGAGGGAGCTCTGCACTTGCAGCTGACTCCACACCCAGGGCCTGTCACAGCCG CCCAGCAAACATACTGAGCAAGGACTGGATGGAAAATGAGATGCTATTTCTTTCTTTAGGGTTCTTTGC ATACTTGCACAGTCATTTAAAATCCTAAAGAAGTAATTGTTCGC(N)xTTAAAAAAGTGGCATTAATCA TAGTCCTGTTTACAAACTGCATAGGATTTATGCCTCATTACGATTCCCCACCCTAAATCATCATGCTTC ACACTAAGAGAGAAAAGTCTAAAGGAAATACTTTGTTATCTCAATCTGCTAAGTAGAATGTTTTTAACT TAGCACATTTGTTAAATAATTTTTCAGAGAGCTGCTTACTTGGGCACACATAATAGAGGCAAACTTGGT ACTTACCCTAATATGAATTAATGTAAGGTTTCATTATCTTTTAACTGTTGGCAGTGATAGAACTACCAT TTTTCATCTCACATTTTTGCTATTTñAAGCACTCATA( N) xTGTATGAAGCACAARTGTATAAAGCACT TGTAGAACTTCATGATGGTAGTCAGAGACTGAATTGTTGCTAACTGCATTATTTAATATAGCCTAGTGT TTTATGTTAATTCTACTTATTTAACAAGAGTAGCATGACAAAACAAAAAGGTTTAAACTGCCTCTCATG TAATTTTGATGATTTTGTTTCCACTTAGATTTATGAAACTTGGAGACATCTTTTCAAGGTATACAGAGC ACAGCCAGTTTAAAGGTGCTTCATAAAACTTTGTGGAAGCCAAAATCTATAGAACTGAGAAGCATCTCA GGTAGATAGTTAGCTGCTGTTTGGAATTTTTAAAACTTTATTATATTCACTCTTCTAAGAGCATTTATC AAAACAATCTCTAAAATAGTAATTGTAAGCTCTATTTTCCTTGTCTTTGAAAGTAAGCGCTGTGTCCTG ATGGAAGGCTCTGAGCCAGCTGACCTCTTGAGTTCTGATCCTAACTGATTCACTTTGTGCTCTAATCCT CACGCACTTAGAGGGTACCATAGACATGAAAATAGTTTGGGTTATATTCCAGTTTGGGTTATATTCCAG TACCATGTGTGCCAAAGAACATAAAAATGAGTGAATATTTTTATGGATATTTTGCAACCATAAAATTGG GTTTT ATGGTTTTATTG G AGT CAATTG CAATTTTGTG CAATGAATGG CTGTGT CATTGTTC AT ACTC AG ATTTATAACGTCAGCAGTATCCGTGGATAACCCTTCTTGGATGTGCTGTGAAATAAGTGCAGAAACCTC CCTGCCCCCACCCTCCCCGGGGACAAGGTGAAAGCAGCACGTTGTGAAATAAGTGCACACñCCCCCCGG GGACAAGGTGAAAGCAC CACG CGTGGCACCACGTTGTGAAATAAGTG CACACACCCC CGGGGACAAGGT GAGAGCACCACGCGTGGCAGCACGTTGTGAAATAAGTGCACACACCCCCGGGGACAAGGTGAAAGCACC ACACATCGCAG CACGCTACATA CATAT CGAGAGGGTATTTGAG CTTC CCTGTG CTACTTTGCTTCTATñ AAGCCAAATAAATAAAACATATGCACATGTAATATGTGTATTTTTAGGTTAAATGAGATCTCTTTGTCA CCTAATACATGGCCCAACTTTTCTGGCAGTCCTCCTAAGGGAGGACTGTGGTACTTGAGGAGCAGGCAT TAGGGAAGCACATTCAGATAGCCCAGTATTTGGAAAACACTAAAGAAAAGCCATCCTATCTTTCACAGA C CTCCTGTCTGTCATTC CAAGGGACAC ACAT ACATGCTAAGGATT AAACAC AT CTGG AATATC CATTTC TGTTCAGCTAAAAAGACTTAATTTTTTGAAATAGGTTTTTTTAAAAAATTAATGGACACCCTAATTTGG GGCTTGTTGATGTCTTTCTCAGGCAGTGTTCTATATTGTAATTATAAAGGTGGTGACTCAGTTTTCTAG TTTTATGTCCGCCTCTTTCAGAGCAGGCAAAAGTTGCATTCGTTCCAAATAACAGATCTGAGCTGAAAG GCAATGACGGACTTTCTTCATATAACTAGAGCCCTTCTCATTATTAAACGTTGTTAGAGGCTG(N) xTG TTATAAAGGTACCTTGACATGCTAGGGAGAAAATATCTTGTTCTTCCTflTTAAAATGAAAATAGTAGCA AGTAACAATCTTAACGCTACTTTGTATTATA CAGTTGGGCATC CCAG CTGCTT CAGC CCATGAATGTGC AGCATGCCTGCATGGTAGGAAAGCTGGGTGTTCCCGCAGGCCTCCTGCATTAATGTGTTTGCTGTTGTC CCCTTTATCCTAGGAACCCTCACCCAGAATGAAATGATATTTAAGCG3CTGCACCTGGGCACCGTGTCC TATGGCGCCGACACGATGGATGAGATCCAGAGCCATGTCAGGGAGTCCTACTCACAGGTAAGTGGGTTC CTCCTGCACGGGGTCTGCTTCCACACACATCCCGCGCCATGAGTCCAGCTGAGCCTCGTGTGTTCCCAT CTGTAAATTAGCACGCTTCTTGGTGGTCTTAACATCCTGCTACTTCAGTCGTCTGTTCGAATATACGAA TCAGGAAATCGTCTTTGATGAGCATTTGTACATGAGTGAAATCGTAGAAGGCATAGTGTGCTGAAGTCA TTTATTC AT AT CACATTTTACTAGTGAGTTATTTGTAAAAAATTATT CAGT AAAC ATTGTAGTTG AAGA ATGTTCAGTAGGATACTACATCAATATGTTTTCTAACAGTTCTGTTTTTCTCTATAATAAAATGTTCTC ATATTTTTAAAGGTAAATTCTTTTTTTGCCTTTTCGGTAGATAAAAGTAGACACCATCGTGTTGGGAAT CTCGTAT CTGT CATTCACCTACCTGTCAT CAGTGC CCAGGTGGTGATGTTT CATCTGTAGGTGTT CTTC ATTATTATTA(N)xCTGCCTTTCGTTGGGGCCAAAGCTCCCTAAAGTTCTGAGAACAGGAATTAG(N)x TTTGGAT CAGGGATACT CAAC CTGTATTT GCTTTTAAAACAGGAT TTAGCCTAGAGG CTGTAC CACTTG CCAGAGATT TTGCAGGCAGTGG GTTGTGTATAGAGTCTGTC CATGG
> H s l9 _ 1363386 7 - 13645 68 8
CATGTGTATCCTAAGAAGTTAGCAAGGGAAATTTAATCAAATGAGCTACACCTGCTTATGGAAATGCAG ATCTTAGTTTTTCTTTTTT CTTTTTTTTT T T { N) xGGTTTTTCTTGAGTAAGAACCGAGCAAGGTGTCC CCAGGTTGCAAAATAACAAGTTTTACCACACCTGGAGATAAAGTTACCAACGATAGGATCAAGTGCACT ATTTCACCTAGAAAGCTTGAGAAAAATTATCAGCTTCATCTGTTC( M)xGCAGGTAAGGGTCAGAGTGG GAGCTATTTTGCAA(N)xATTATGGGGCTGGCATTCCATGTTGGGCCTTTAT(N)xCACCTCAGCCTCC TGAATTGCTGGGACTACAGGCGTGTGCCACCATGCCTGGCTAATTTTTCCGGGTTTCTTTTGTAGAAAT AGGGATCTTGTTATGTTGTGAGATCATGC CA CTGCACTC CAGC CTGGGTGAGAGTGAGACTACAT CT CA AAAAA CAAACAAACGAG CAAAAAGACATGGTTAATACGTTGATTT TATGAAAC TTGC CTCAAAGATGTT TCCTCTTGTAAAGGCACATTGGGGCTCTTAAAATGTTTCGAGTTATAGGACCTCAAAGCTCCTGTCTGT CTTGTTCACTGCCTCAGCCTCAGCCATGAAGAGAGGTCAAAATAGAAACGCAACACCTCCAG(N)xCTG GGGACACAGGTAGGGGGCATTTGATTCTGATGCCTCCTTAGAAGGAGAATAGCATGATTTCCCCAGAAA ATGGAGACTTGGTCCTTTTATTTTCTTCTGTTTTGCCCAACGACTAAAAAAGAGCTTGTATGTATCCGG GGAAGACAACAGAGAATGTGGTAAGAGAGTTAGGCTTGGTGA CAACAAGAATGGGAG CACACG CACATG TTTGCAAGGTTACACATAAATATATGCATACATCTGTGATCTCCTTCCATAGCTGTTCTGCCTTTTCCG TCACTGAAATAACAACTAAAT CATCTCATGT CC CATATGGCAG CCAGACTGTCACAAAGTTAATGGACA TAAAGTTCACAAGGGCTTGTGGTACAAAAAGTAAACTTTTTGAATTATAAGGAAGTAGGTACTACACTG GTGCAGCAAACACTGT C CAGC CAT(N)xATGGAGTTGGACAGATTCTGTTAGCTACTTCAAAATATTTA GGCAGAAGGAAGGAACTTCAGGGCATTCAGGGCAGTTTTCCTGGATCCTCCACTGAA( N) xAGCTCAAG ACATGGGGCTGCTTCTGCTGGGGCAATACTTAATGGTGTTACATACAAAGGTTTTTTTTCCATATTGGG GGAAATATTTTTAGGAAGGA(N)xATCAAAAAAGAGTATGAAAGG( N) xGGGCAATTTATAAAGAAAAG AGGTTTGTTTGGTCAACAGTCCTGCAGGCTGTACAAGAAAGTTGGTACAGGCATCTGCCTCTGGTGAGG GCCTCAGGCTGCTTCCATCATGGTAGAAGGTGAAAGGGAGCTGGCATGTACAGAGACCACATGATGACA GAGGAAG CAAGAGAGAG CAGAGAGAAGTG CCAGAGTCTTTTG(N)xAAAGCAAAGGGTTTTGTATGACA AAACAAG CATATATCAGGTAC CC CCGATATGTTTACTAC CTGAGC CTTATTGTGT CACCAAAAAATACC AGAAATACT CGTCAAAGAT TTAGACCCACACA(N) xTGACCCCTTGTTGTATTTCTGTTGGACAGTGCT TATCGGATGACTACCAAGCCCTGCAC(N)xCTGATCCCTTGTTGTATTTCTGTTGGAGAGTGCTTGCCT GGATGGCTACC CCAGCC CTGCACTTCTGAGCTC CATGAC CC CTTAGAAATGGTTT CCTCATCTAGAAAC AATG CAACAGCAACAAAGAATAATAATAC CATG CTAACATC CATG CTGGGTT(N)xCCTCAGTTTCCTC ACCTG CAAAATGAGGAT GAG GACAGGACACACC CCATGGGATCACGAAAGGATGGAACAATG C CCTC CC TCTATATGTGACAGGCCAAGCACAGGGTAGTGGAAAATAAACACTGACTTCATTGGCAATCCTCTTTAC TGAGTG CAGTT GAAATTGC CT C TT A C (N ) xGAAATTGCTTCTCGTGTCCAGGAAAGCACCTGGTGCTCC AGAAACAGAGAAGGCAAAAACAGAAAAAGACCCACTGGA(N) xGACCCACTGGAGACACAATCAAATAA AGGTAATAGAAAAAGAGAC CC CAGCATCACTTATTGGTGGGAGTAAAAATTTCAC CTAAGAATTACATG TGGAGAAAAACATGAACGCATTTGTGTTCAGACTGTTCAAATGCATGTTGTCCCACAGAGAGCTTAATA GGTTGACTTGGCAGACCATTAAATCTCGCTTCTGTGTCTCTTGCCTTCCCTTCCTTGGCGTCGTCTGTG TCTTTTGTTGTG GAAAAAG CTTT CATCCCATGC CGACAGAAAT CACTCAACTCTCAC CTGAGG CACT CA TTTAAGAGCATGCCTCAGAGGATTGTGTGTTCCTTGGA(N)xGGAAATTAAGAGGAACAAATATAGAGA TTAATACTATTTGCATATTAAATGATGATCTAATTATAAAACATAGGAAGTTAAATAAAAATATTAGAA ACAATGAGATAATTCCATAAGGTGGCTAG(N)xTTATTATAAAAATTAATGCTGC(N)xCTATATATGA TGACCACTTA C TGAATAAAATGGGGAAAAAAGT CT CATTTACAAAAGCAAAC CAAGATAAAGTACTG TA TTGAAAATAAACTT(N)xTAATAATAATGCTAACTATATATGATGACCACTCAGTGAATAAAACGGGGG GAAAT CT CATT TACAAAAG CAAACCAGGATAAAGTACTGTA TTGAAAATAAACTTATGGAGTGGGAAGA ACCCTTATGAAAAGAATTTTAAAATTCTACTGAAGAATGTGGTATAATCCTCCAATACACGAAAATAAA TGCAGTTATTGAGAGGAAAACAATACTGCATAAAAATAAATTCTGTCCAAGTTACTCTACGAGTAAAAG TGTTATTTGTT CCCAAATCAAGAAAAAAC CAAGTCACCTTG GGGGACCAGAAT CAG G GGAAAGAGAT TT AACTGGG CT TAAAGATGTGTTACAAGGCG G CACG G CATATGTTGG( N)xGTCATGTTAGATAGGCTCAC AGCCTGGGTGGCGGTGTTGGTAATTTTCACTGCCTCCAACTTACAGTCACGTATTATCACCAAAGTGAC TCCCCTCCTCC CACATGAGT C TGAGAGGG CT CCTGGTATAT CAAAACCATGTGAAAAGCAAT CGTGT CC TTTGGACATTTTAGTGCATTAGCTCCCACTGAATGTAACAATTGTGGCCAGTGTGGTGGCTCTCATCTG CCAGGCATGGTGGCACATGCCTGTGGCCTCATACTTTGGAGGCTAAGGTGGGAGGATTGCTTGAGCCCA GGAGGTTGAGGTTGCAGTGAGTTATGATCGT(N)xGTGAGTCTGAAAGTTTTCTTTAAATTTCC(N)xC ATAAATCAAATTTCTTTCATTTGTTTTAAAAATTTTTGCAAAAGTAAAG(N)xGCACAATATTTTAAAG CCACTCTCC CATGACTT CT CCAATACAAATACTGGGTATTT TATTGTATTT CTGT CTGGTTCT CC CC CT CTAGGTTTTTATTTTTCATACACATGTAATTATAATTTAAAGATTTTATATACAATAGTTTTCCATCAT CTGAG TAGT CACTATACGG CT CATTATCCAAGTATTTACAC CATTATGTGTTTAGG GAAGCAAAAGT CA TCCTAACATAAAAATTGTCACTAATGAACTGACAACTTGACAATTATCCCCCTCAACCATCCATTTCGG AG TAATACAAATATTGC TTTG GAAG CCAT CAGATCAGTGTGTTTGTCAC TGTTTATCTCTGATTGTTGA TGGTTGCTTGAAACTCAATGACACAATTCAAAACTATGTGAAACAACTGCAGAGTGAACATTGAAAATC TTTGCAAG GGG CAGCAGGATT TCAT GAAGTCAATGAAAGTGATGTTG GAGAGC TACTTG CAATGCAGAG AATAACATTGTTAAGTGAGGAACTTGCAGTGAGACTGGTTAACAACAGAAAAGGAGATGACAGGTGGAT GATAGTGATGAACACTTTGAAATAAAATGAATG(N)xGGAATGAATGGATGATCAAATGGTTTAAAAAG CCCT C TGAGGCACCTTGTTTTGG CC CACAAAATGATT CCCAACAAAGTT CAAGATTACATAACTG C CTT TAG GTTTTTGAGGAATCACCACACTGT CT TC CACAATAGTTGAACTAATTTACATTCCCAC CAAAAG TG TAAAAGCATTC CTTTTTCTCC CATTAC CGGGTACATG C
> H s 20 _ 12067861 - 120748 79
C AG AAGTTGTGTTGAAG CAGAAT C ATGTC AAGGTT TGGGTGTG AG AGT ACATCTGGTTT TACATT A C TA AAGTT CATT TG CTTATAGATTGTAGAAGCA C ( N)xATGCTTTGGCCGAGGAACGGGGATGTTTGGGTCA CTGTAGGGGAAGCTGGCTGTAGCCATCATTGACTCCTGGGCCCACTTTGGCCGTCACCTGCAGCCTTGC TGTCTTCCTCC CTCAAGTGCC CGTG CCTC CCTACCTGAGGGAAGTCTGGAATTGCTGTGGCGCTGGAñC TCT CAGGCATGAGATTCATGAGGGACA GATGTTGATG GGTAAATAGT CCAGCATT CTTCTG CTTC CTTA GGACTGTTCTGAAACAGATTCTACT CGGT CC CCAAGCTGTC CACAGTAGT CACCTGGTCACTGATAGAC CTTTT CTTGGTTCCCTTTCCTTCTG CCTCGC CTTCTC CATT CT CATAAGTATT CCTGGTAT CACTTC CC AAATCAACAA CTTACAC CCAAAT CTTTGTCTCTAGATCTTCTT CTGAGGGGAT C CAAATTAAGACAATC ACCAAT CAATATGTTGTAAAGAGAGAACT CAGAAAAT TGATCTGGTGAG CGTACTGAATAGAAGCTC CA TAAGAAATGTT CCACTGTTTTTGGAAATCT CATTTGGTCTCTCTGTCTT CTGACTGTTAAG CAGT TTTC ATC CT TATATTTACACACAGGGT CC TTTCTCACAAC CAACATTTTGC CTTCTTCTTTGTGAGCCCATGG AAGACATGTGTTGG GGAACGTGTTACAAG CT TCTC GCTGGGGC CAGAGTTTGACTGGTGTCTTGCTTTC ATC CTTCTGCCCCAGTTCACCTTTTTAACTCTTCATTTGCCCTGAATTTTACGAACTGGTCATCCTGGG TCTTTGGTACCTTCTGTGGCTCTACTTTAGTCTCCTATCCTCTAGCTCTGCCTTACCTAGACCTACAAT TTACCTGTGGTTCTGGGCCCCTGCTATAGGAATCTACTCTAAGCTGAGATCCTGTCTTGGATTTATGCC CTGAAACTAC CATTC C CACTT CT CATT CTGTGTCTCTTTTCCTC CTAAC CCTTCCACTTCCCTGGACTG TGCTCATGGCCCTCTGGCCCATCTCTTGCCTGTGCACTTCAGGTGGTGTTTCTAATTCATGGCAAGTGG TAT TT CTAATG CGTGGCTTTCAAAATATTTT CCTGTAAAACTTACCTTC CTAC CACTTAGAGCCATC TA AATGTTTTTCT TCAAAGATACACAATATTGAATTAATGCTCTT TCAGAAACAGAACGAAACTATTGTAC TGTCTCTCCCC CATT CTTAAGAATGAGTGTT TGGATTGTAAAGTGAT TGAACCACACTCGATTCT CT CT TAT TAATG GAAAATTGG CCATAAATTG CT GAGGCGTGACTAGCTCAT TT CCCAATACAG CT GCAAATGC TTTATGTGGGGATCACTTTCGATTTCTGAGAAAGAAAGGAGAAAACTCTACAGAACTGACACCTCACTA AACAGTACCTCGTGT CACTCCCTACCGAGAA CTACTGTTCTGAAGAGGCAACGAATTGAAGATAGTG CC ACCTTCAGC CAAGGCTGTTTC CTTAGAGT GCATATTTTTAATGTTTTTATGGTGTGGTAGATTTGGCTG CAAGCTTTGAAAAGGCAGCCATTAAACATTTTCGTTTTTTTCCCACCATTTAAAAAACTAGTCTAAAAT GCATATATG CCTGAGAñGTGCTCTGTATT TAGCGCACAATT C (N ) xATATCTTCTGATACCATGTCTAG TATTCCAGTAGATATCCATCTTCTGTCTCCCCCACCAAATCTTCTTTTCTCCTGGGATCAGCCCTCCCT TTGGAAAAT TATTTTTT CCTG ACTT CAGATG AAAGTC AT AG AGTGGCTG CT AAAC ACTGTG CCCCACTG ACCCCATTTCC CAGGGT CTATAATC CATG CAATAAAAAGAGAGTTGCAGTCAG CATTTTTTTTTT CTTT TAGTGAGAACCAAGAGT CACTTCTT CC CT CT CAGTTAATGAACTTCAAAGGTCTT CTCTGT CATGTTTC C AGTC TCAT AAG AAAAG CAGT AAAGTC CATG AG AATG AAGC TG AC ATGGGAAG AGGAAT CAGAAATG AT
GñAAGGGAATC CTTATGGTTTGTAAGTAG CTTGTG C CAACC CATGAGGC CAGCTGATCC CATGCCTTAT ATAGC TTTCTTACCACGTCAATACAGT CACT CCTT CC CCTAAC CCTAGGACTTGGGGGT CTGCCATGTG TGTTAAAAAGAAACAGGTATTGGGT(N)x C CAGAGAGAGAGAGATACAG GTATTCCAGTGC(N )xTAGT ATñCCTACTGCTGGTGGATAAAGAAGGTGAAACAAGACAGAATTCTACCAGTTGACñAAGTATACTTTT CCATTGCTGTGGAAGTC AAAATCTC CATG TAGT ATTAACT C AG ATGATC CCGTTTTTAAAT TAAC AACC ATTTTTTCT C CAGCT CACTTTATAT CG CGAAAGTTTCTATTTAGT T TAATCATTTATTGAG CACCTGTT TCACCTAAGAATCAATATTATTGGTGGAAAGTTGAACAAATGTATAAGATATAATCCCTGAATTCTAAG CAT TCTATAGCTTTTTT TTTTATTT CT CTAATGTG CCTGTTATTTTT CATACCAGGAG(N) xTTGCCAA GCATCCCCATCTCAGGCTCTGCTTTTAAGGAACCAGACCTAAGACGCCCTCTTCTTAATGTCTCATCTA TATCTTCACATCCACGT CATGTTTGTCTTTT CTACTGGCTCTTGCAC CCACCCAATCACACATTTGC TA TGTTGGTCAACTGGAACAGAAGGAGCATCTTGGTGGTTTCAGGATTTGGTACCATATCCGTATACCCCA TGCTGCACAGGGCTACT CAGC TGGCTAAG CTAGTGGATGTAAAGAGTGAGGGATAGCCTGTATTT CCTA AAATAGCATAGCAGCAACTGCACTAATTTTAGATAAGGACACAAAATGCCTGGGGATATAAAGATGTGT GTGTC TGTGTTTTAATC CAAGTTGAAATTATACAAAGAAACTT GCAAGAAGCACAAGAAAAAAGACCTA TAT TATGAT TTTCACTTGAAG TGAAGTAAATGGAGGGGCTGGGAAATAGATGC CAGGCACCCCTCTCCA ACGTGGGTATG CCTG GCTGAGTC CCTGAGACTGCAGAGTGG CAGTGTTGAGTGAC CACCATGTGTTCAG CGAGGAGTGAATGCCAGAATATG CT CTAAGGAAAAGTGCAGATAATAAACTTAAAAAGACAACTTAG CA AAATTGGTAGTTTCT CAAGGAGTTG GGGGTC ACC AGGTGAGTAGTGAGGAC AGAG CCAG CC ATAGG ATA AGAAGAGAAAAGGCAGAAGAAGGGAGAAGAAGCTCAGTAAATGCCAGATGGGGTGCTTGTTGATACATG TATT C CAGAAAGTTTGG CTGGAAAATGTGGACAAAAAGAAT TGATCTTGATTGGG CACATATATTATAA AACAGGAAATAGTGCTTATCTTTTTACAATTACGTGAAGTATGGGCAATGT{ N) xATTATATGCTATGT TGCTTCCTGTGCCAACTATTTTTCTGCTCTATAACCTCACTGCATCCCTTTGAAATTTATTTTCTCCAT GTCAAAGAATTGGAAAT CTGCAACTGACTATTGAC CAGACT CTGT CACTGATATTGTACAGCTAATTAT ATATATCTATTTTATAACTGTAATGGTTGTTAAGGTAATGTTAAATAATACTTTCTAAGCATCTGTTAT ATGAAGAGTTATATCCTCTGCATAGTATAAGGTTGAGTTTCTTTTTACTTCTTCTCAGTTTTTCCCCTT TTTCAGCTTT(N)xAAAGATATGATAAATGTTTGATATGATGGATAGATATGCTAATTACCCTGATCTG ATCACTGGATCTTACATGTATGTAAACATCACTATGTACCTCATA{ N) xGATGCTTTCTAGCTATGGAG CAGCCAGAATTAGTAGTTCAAGCGCAGGGACTCATTAACATTGTGATCGGGCTTTTTTCTTATGATTTC TCCCAGATTCTACTTGTAGAAAAATGTAGAAAACAGACATTTTTAAAGTGTGTTCTCACTTGGGTCCTC GTCCTCTTCCTTCATTAATTGAGTTGTCCGCCTTTCAGTCATACTCAGCTCTAAGGTGGAACCACCACA ATTGCACCACAACCCTTCTAACTCCTGACCTCTGATCATTGCCTTTGTCCTTAACCAGTTCTGCATTTC TTTCATAGCAGTAAT CAAG CAAAAACCACTT CCTAGAAAAGTCTT CACCAACAGCTCTATAGGTTATAA CATTGCTATCTTTCTAGAGCCTTGTAGCGTATGTGAATATGTATTAATACCTCATTCATACACATGCGT ACGTGTAGTATATTATTTTAGATTAGGAAAAGTTCCCTGAGATCGCCACAGTCCCCATTTCCCCATTTT ATAAATGAGGCAC CGAAGC( N) xTCAACATCTGTGTTTCTTCCCATGACTCTGTCACCACTCAAGAATG CCAGGACAAGGCACATTACATGC CTGG CAGGTTTT CTTTCTGTGTG(N) xACCCTGATGGATTTTGTAT GACTCATGTTGGTGTGGAGGACATCCTG(N)xAAACCAGTGAGGTTACAGAAATAGCAGCTCAAGGAAG AAGAAGGACCAAGGAAGGATATGATTTCAGGCAAAATTTCAGCCTTGGCTTGATCCCATGAGAATCTCA GGAGTGCAAGTGG
?H s20 36 25 551 7 -362 60 170
CATGCTGTGATTCTGCACCCTTAGAGCAGAAGCACCACAGACAGCGCCAATACCATTCAGCACCACGGA CAGAGCCACGCTCTGACGTCTTCTGCAGAAAAGTCCCATCAACTATTACACTGACCCAAAGAATGAACG AGGCAAGACCACTGAGTGCCCAAAATCAGTGGGATAGGAAGTCAGAATCTGCAACAGACACAGATGGCT TTCCCTCTCCCCCCTGCCTTTTCTGTAAAAGAGTCCCAAATGTTGCAATTGTCCTCACAGTGACATTTC CCCTATCCTGGGGGCCTCCCGTCCCTCTCCCCACAACAGTGAAGAAGAGGGTATCGAGCCGGGCAGTGG AGTACCCTGGTGACAGATTAG CACTGC CAAC CACC GACACAAAA C CATTTGGTGTGGGGAAATGATAGA TGTTAA CACT CATTTTAAACACAGG CAACTG AATT CCTG CAGACAGG CG C CACGGCTGG AATTTAG GGC ACAGCTGCTAGGTTCAGATAAGGTG CAGAAT CAGC CCAG CATACCGCGATGTTACTTAATAATGAAAAT GTGGTGATTGATGCTGCTGTCTCGAGAATATAACCTCCCCATTTGGAGGTTTTCAGCTCCCAAATCATC CATT CCACCTTTTTT TTTTTCCC CCTTTCTT TCTTTTTTTG GAAAGG CAATGTGTGTGTGGTGGGGGTG TGGATACAGGGTT GATGGGGC CTG G CAACATGAAG CT CATGATGCTAAATAATTTTTTTAAAAGTCCAA AGAG CTTAGTTTGAAAGAATT CAAT TCTAGCAAGCTCAC CTGGGACAGCAAG TTCAACATGAAAAACTG GTTTGGACAAGTTAC CTTC CATATTTTTATGGT CAAAGG CTTCTT TTACTACACTACTCCCCATTTCTA TGATGTCACAAAGAGCCAATCAGAGTTTTTGATTCAGGACCTCAACTCCCACCTCTTCCTGAGCCCCAG GAAAGCTGGACCATTTATCTACTGCCTCCTGAAGAGCATCCCCTGGAGAAAACCTCAGATTCTGCATGT CCAAAATTAGTCTAGGTTAGAATATCGAAGCAAGCCCTTTTTGCTCTCTATGGCAAGTGACAGCACTCA AGGCAAACACCTGGGCATTATCCAAGACTCCTCCTTCCCCCTCATCTCCCAAGTCCAGCATCTCACACC TGCTATTTTTATCTC CTAAAC CACT CT CT CTTAAATT CACAATTCAC CTACTTTTTTTTTTTTTAGCCC TATAGCGTTAAAATACAACCTCACCCCCTTCCCCTAAATAGCCACCTGGGCTCCAGCCTTTTTCTCCAG TTCATCTTTCATCCTGCTCATCAGAATAATTTTCCTAAATGGAAATATATCATGCCACTCTTCTGCTTA AAACCCAAGATCCTGTAGGCATTCAATGTCCTGTAGGACAAAGTCCAGTTTCCTTACCATAGCATCAAA ATCTTTAAGGATCTGGTTTCAGTGTATCCCTCAATGTTACTCACAATCTTTACCAAAAGCACCTGAAAT AAATGCTTATTTCTGCCTC CTTG CATTTG CACATG CTGTTC CTTCTGTCTTGCAGGACCTTC CTCTCCT TTTTCTGCCTGAA( N) xTCCCAGTAGAGAAAAGTTGTTAATCTAGAGCCCATGGAAGACTTGGGAAGTC CC(N)xGTGTGCCTGCCAGTGGGAGGGGGTTTGAATGCGGTGTCAACAGAATCTACTAGTCCTGAGAGA GTTACTGCTTTAAGGAGGCTGAGAGGTTGTTTTGCATGGAATGGAGGCAGAGCCCAGACTTTATAGATA TCTCCATTTTCACTCATGCAATAATTCATGTATTCATTCAAAAGATCTTCATTGAACATCTACTCTGT( N) xTAATTAGGATAGGAAGGAGAGATGTGTGGCTCTCTTCTTGCTTGCTTTTGGAAGTCCTGCTAAAAA ATGTGAGAGAGCATCTCTAAGGCAGGGAGTATTGCCTTAGAGCTGTTGTCGGTCATTTACCCTTGAGTA TAAATGACATTTT AGTTTTTTTT CC CTGCAAGT AGTTTG TTGAAGGGGAATGAAG AAAAAAGCCTGAG G GGCACTGAATGCTTTGGAGAAAAGC CATAAAATGG CC CAAAATAAGAGTTGGGGAGACAAGGGCTTTGA GACAAACTTGCTG ATTTTG AATCTTGC AG CCAC CG CT CCTC CCTCTGTCAGGGCCTCTCTCCCTGTCTC TCACAGACCTTGGAGCTGGCGGGAAGAAGCTTGGCTTGGAGCCAGCCATGTCCAGCCTCTGCCTTTTTT GAAGCCAGAGGTAGCAGCTGCCTATGCAACATGAACTAAGTGCCCAGCTCTACTCTCTCTCTGCTCAAC CCTACTGCCAACAGCCACTGTTCATTGCCTGAAATTCCCATCATTAATGTCCCAAAGCCTAGGCTGGGT GTCCTAGGAGCTCCATTCTTATTCTGGCTGTCTTGTTCTGACCTTGGGCATGTTCCTTCATTTCTCTGG ATGCAAGGGCCAGATTGGATTCATCTCCTCTGAGGTCCCTGCAGCCCTAGCTGTCTATGTGTTTTTTCC TCACCACAGTTAACCTATTAAGGTTGCAAACTCAAGGTGGTGTTCTGGAGACCCCCAAGAGAGCTGGGG GATATCTGAGCAC CTGGAATCGAGG CTGAGCAC CTTC CT CAGT CTTCTGGCTGGCAGAAGCCGTGGCCC AG GAGTTGGGAGGGAGAAGGG CT CTGGAAGAGG CC CT CAGC CC CTTGTGGCCCTTTAAATGCTGCCAAA ATGACTCCTGCTCCGTAGATTGCTGCCTGGAGCTGTCAGACACGCGTCAAGGTCAGGGGTGTGCATCTG CCCCAGACACAGCAATAATGGCTTCTAATGGACTGTAAACAGGTGAAGTTTGCTGCAAAGACATTGCTG
CTGCTGGAACAATAAAAGCTT CTTT CTTCTTAACC CT CT CCTGGATTTTGATTATGCCACAACTACACT GTCTTTAAACAGGGC CTTCTC CAAAGACATT CTTG CCTGGA( N) xTGCAGCTACTGATGATTTGCGCTG GACTTTGCTAACAGATGTAAGAGAGATAAAAAGTGACCTTGGAGCTCAGA(N)xCAATAGGATGATTGT GAGCATTGAAAGAAATTAGATGCATGAAATGCCTGGCACAGAGGTCTCAGTAAAGGTTAGTCCCCTCTC CTACACCCACTCCCCT(N)xCCCAGAGCCCACCCCACTTCTCGGGGATGCTGTCCCCAAGGTCCTCATG TGCCAGGCAGGCAGCCAGCAGC
>Hs21_46129441-46139409
CTGACAACTCCAGAGAAGGCGCATGGGCCCCGTGGCAGACCCGAACCCCCAGCCTCGCGACCGCCTGTG ACCTGCGGGTCAACCAC CCGC CG CGGCTC CACG CCGTGGGCA CAGA CTCAGGGAGCAGGATGAGAAAGC TGAGACGGCGCAGCCACGGCCCGGTGCCTTCACGCGCACAGCGACACAGCCCCAGCCAGCGGGGCCCAC GCTAAGGCGGAATCCCACAGAAGCCTACAGAGCGAGCGCGCGCCTGTGCTTCCCAAAACGGAATGGAAC CAAGGTGACTTCTACAGAACGAT CTGAAG CC CTGG CTGG CC CTTATG CTAGTCTCTTGGGAGCGTTCCA AATGCAGCTCAATATTACTTACTTGACTT TTAT CTTT CGTC CCTGGTTCGTGGTATTTATAACTGGGTC ATCTTTTAACTATTTGCAACGTAGCTTCAGGGGAGAGGGGGAGGGCTTTATAAATAACCTGTATTATTA TTATGCAGGTTGATTCTGTTCCCTGAGCTAAAGGGAACATGAAAATACATGTCTGTGACTCATGCCCCC CCACCCCCACTCCAGGGTGTG CTGAGGAGTCTCTCAG CTGCCCCGGGGT CCTCGAGCAGGGGAGGGAGA AAGGCTGGCGCTG CGCCCTCCATCG CGTGAAGC CAGGGGAT TTTG CT CTGCGACAAGCTGACTTGGCTC TCGTATTGTTTGCAGAATCAC CCAGTT CCAAGG CAGT CCCTGCGGG CAGGTGCAGCTGTGCGGGAGCTT CAGTCCTGTCCCCAACACCCAGGCAGTAATGGTTCCAGCACGGAAGGTCTACCTACCTCCCACTGCACA GCCCGAGGGCTGT CCTGGAGG CACAGC CATC CGTC CCTGGGTGGG CAGG CACGTTTATG( N) xCACGCG AGTCAGCACGTTC CATACT CGG GTGAT CGTG CT CATC CC CTGGTCATGT CATCG GGATCTGAGTGCCAT CCGAGCAGAGAGCTGTGGC CCGGTG CCGGGGGT GGACTT CATCTATT CCAGGGAACCAAGGATGCATGA TTTGCAAACAAAACCAGAAGCGCAAGCCATCTCCTCGCCTCCCCTGATAGCCGTGCTGCGGAGCCTGAG TGCTGGAGTCACTAATTTACTGTCCAGTAACTGAGTTTCTAGCGACCAGGTCTAGTTGTCGTGTTACTT TATTAAATTTTGG TTTGATAAGTAAAATC CCTCAG TAGAAACT CTAGAAAAGTATTAGCGTGTCATGGT CATGAGCAAGCCAAACAGCCTTTCTAAAAGGTGGGAAGGAGACCCCCAGGCTTCGCCAGGACCCCCCAG TGAGCCAGCCCAGGGCTCTCCACTCAGACCAGATTACGCCCCAAAGAACCGAGCCCTTCACTCCAGAGG TGGTTGTGTTTGGGG CTGTGT CTGTGT CT CCAGCTTTTTGCTCATGTGGGGAAGGAGCAGGGCCGGTGT GGCCCTTCGCTTT GCTGAGTG CAGG CTGGGGGCAC CT CCAC CAGCTAAGGAAATGGCTCGTGCACAGTA GCCTCCCGGGCTGCTGCGACTTCATTCTTCATTCCCAAAGCAGGTGTCAGCCTTTCCCGGGAGGCCCAG CAGGTAAGCACTTGTGGAGGCCCCGGTGGCTGCTGGTTAGCTCTTGAAGCTCGTCCCCACCCTGCGTGC GTTCTAAAGAGCCGCGTTTCTATTGCAACTGCCTGCCCTGCGCTTTCATCTTCCCCACCTGTGCTCCTC CCGCCTCTGCCCATCTCCACAGGGTGCTACCTGCCAGTCCTGCCAAAGCGTCCTCGGGCACCGCGGCTT GAATCAGTGTTAGAAAGTGGCATTTGTGACTCGACACCCCTCCCAGCTCCCCGGCAAGATACCCCCGCC CGAGTTCCCATGCCCCTGCCTTACCTGTGCAGGGCTCCCAACCCTGCGTGCCGTGGCCGGGGGCCGCCA GGGGCAGCACAAAACACAGACTCAGCAGGGCAGACATGAGGGGCTTGGGTGCCAñGCTCCATCCAGGGC TCCGCTCAGCCTG CAGGGAAGTGGCTG CT CCTCAGACGT CTGCGT CAGC CTTTCCA(N) xTCACTGCGA CGGCCACATGCCAGCGGCCTAAGGCGGGGACAGAGGTGGCCCTGGCGGTCGCAGTGGGGAAGCTTCTGG TTGCTGCCTGCGGGAGG CAGGTGGC CC CTGG CAGGTTGGGCTG CAGC CG CTGGTTTATCTCTTCATTTC CCTGATGGGCTTGGGCCCTGCGTGCAGGTGACAGGTGATAACAATCTCGGCCCGGCCTGGGATTAGTCG CTGGCAAAGCACACTTTCCAAAAGGAAAGACAGAAATAGATCGTGCTGCGGAGAGCTGAGAACAAGCCA GAGAAACTTAGGAGACC CGTTTCCCTCAGGCGCAGGCTCCCTGGGCT CACCGCCTTTTAGGA CCAGGGG CACAGGCGACCATGGCAGGCCGGGAGCGGGTGGGGCGGGTGCACTCTGGCTGGGGGTTTGGGCAGCAGG TCCCCACAGTCTGCTGGAGGGTCCCAGAGGCAGGTCGGGGCTCTGCTAGGGGGTCCTAGAGGCAGGTCG GGGCTCTGCTGTGGTGGGGAGGGGG GGTC CCAGAG GTAGGT CCTGTG CT CTGCGGGGGGTGGGGGGTGG TCCTGGAGGCAGATCCCAGGCTCTGCTGGGGGTCTAGGCGGCAGGTCCCGGGCTCTGCTGGGTCCCCCG GCGGCAGCGGCCGTC CATC CC CAGGAGGGGCTGGG CT CC CT CGGAGGGCTTTTTATCTGGTGCCCAGCC CTCCCGAGGGTGTGTGGGTTTGTCAACTGTTGGGTTTCAGGAATTTCCCTCTGGAGGGAAGTCGGTCCT TAAAGGGAACAGCTAATGAGAAGGAGTGGGTGAGTGTCCCTCAGGGAAGGGGCACGGCCATGGTCACTC ATCCAGAGCCGAG GACCCTCG CATC TGAC CT CT CGACACTG CAAT GGGCATGCCACTCGAGGGGAGCAG CTATTAACTAG AAGTATTCTTTTTACAAAAGTG CC CTGC CC CC CTAC CCTCT CCAAACAGCACTGAACG CAGCATTCTTGCAGAAATCTCCAACGCAACTGGGTGCTGCGTTTCTGTGCCTGGTGTGGAGGCCCTGGC AAACTGGTCTGAGGC CGATGG CTTTCC CTGGTT CACAGG CC CACAAAGTGCACACACTCCTGCTAGTTT CAGAAGGAAGCCCCCACTCTGGGACCTCTGACACAATGAGTTTTCCAAGAGCATAGACCCTGTCATTAG GGCTTGTG(N)xCTCCTGGGGCT CTGGAGGCTCTT CC CATAAACAGC CC CTGCGATGGGACAGGCTCAT TCTGTCTCCCTCTCGCTTCCTCTTTTACCCGGCCCTGCCCCCAGGGATATTAATTCATAGTTAATATTA ATTCACAGATATTAATTCGTAGTCCTATTTATGAGGCTTGAGTTAACTGGCATCATATGAAGGGAGGTT GAAGGTCTCCCATTATATGTGACTC CT CTTC CAGCAC CCAAGTGAG C CCTGGGGTGGGGTGTGTCTCCC TCACCCCCCAGGACCCCCTGCCAGCTCCGGTGGGCATGCAGTGCCAGGACAGAGGCTTCCTGAGTGGGC TCCCCACCCCACTGGGGCTCAGCAGCTGGAAACGCTCCACATAGCACCATTCCTGACAACCCTAACCTA ATGAGG GACGACGTGGTTC CT CAGAGG AG CGAAAGGC CTGATCGTTGTGTAAATGGAAT AAATGGAGTC TTCCTCTTTTGTGTCCCAATCCATGAGAACTTTGTCCTGGGCCAGGCCGCTCCCAGGCCAGCTGCGGGT CACAGTGGCCTCTGAGACCACGC CC CTGAGGGGATGGGGAC TC CC CC CAAATGATGGGCAGGTGGCACT CCAGAGCTAATTTAAACAATCACAATTAAGCTGCCTTGAACCCTTCCGGCAAAATTTTTCTTATGTTAC ATTTGATGCAGTAGCACTAGTAATAGCCTCAAAGATAACTAAGGGTGAGCCAGTGGAAATCCTCCTCAC ATACGAAACTTCACAAAGATG TG CTGAGCGCAGGAGGAAAG CT CAG(N) xCTCTACAGCTCCCTGTGTG ACCCTGGACAAGT CACTTACATT CT CCAAGGTTTAACTT CCACGT CTGTAAAATGTGCCTAGGAGGACA CATGTGGTAGGG(N)x TCAGCATTTCTGGAACCACGACCCAGGGAACACCAGCTCCAAAATGCTCGATG AGAAAAAGCAGGTGCATTTGCAGGTGGACTC CACAGC CCCTTCCT CAGGGACACAAGCTTGGCTCACCC ACGCGGAGCTCTGAGACATCAGTGATCAGAAGCCTGCTTA(N) xGGAATTCTCATGAGATCTGTTTCTT CAAAGGTATAGGG CATT CCCCTTTG CT CT CT CT CT CT CTCTCCTGCT CCACCTTGGTAAGATGTGCTGG CTTCCCAATTACCCAGTCTCAGGTAGTTCTTTATAGCGGTGTGAAAATAGACTAAGCAAGAAAGAAAGA GGAAAAGAAAAGAGAGAAATC CT CCACAT CATGTGTATCAAAGTC T CATGGAGACGAATCCCCAGAACC CACTAAGAGAAAT GCATGTGCAAAGAAGAAAGCATGT CC CTGC CATAACATCAGACATCAGAACTGAGC TTCACTCCCGAGAAGCAGGTTGGGTAAGTGCTGCATTGTTAACAA3GGAATTCACGGGGCACCAGGCCC TCACAGCATAGGAAGGAACGGCAGCCCATCCCAGGTCTGCTAACTCTGCCTCCAAACGACAAAAGAGAA AATGTTCCAAGAACTTCCCCCAAACACATCCTGGTGCAGGCTGTCACTCAATTTGTCTGCTTCAAGGGA AACAGAGACGATCAATAGAAGAGGAGGAGGG CT CCAT CTAT CCTC TAAGGACACAGGTTGTTAATCAAG TCAGATCAGTAACAAAC CC CAGGAGGT CATCACGGGCTT CATCAAG CTG CAGAAGTAACCGTTAGCGTG TCATTGTTTGGCCAAAGGCGGCTGTGAAGTGTGTCCATGTCTTTAAATATTCAACATGCCCCAACCAGG GAACAATGCATTCTC CAAGAAAACAAGGAACAGAG CT CAAAGAGAAACTGAAAATGATAACTTTATGTT ACAAAAATAACACTCTTACATTCAC TC CT CCAGTTTTGTGTTTTATTAAACTAAACCATCACTTCTGTG ATCTATGAACCCCAGAAGGAC(N)xTTAATAATTTAAAGACACAAAAATTCTTCCAGGATACACCATCA CTTGGTATATAATGACCACTGTTGTATGACATTTGAAAAGGTATCTAAAAAAAAGAGAAGGCACACACG TGGTGTTTGACAATGAGTTTAGATAGñCTGGGGAGTACTTGGCAAGCCTGGACAATGAGCATTAGCCAA CTCTCTGGG<N) xCTGGGGTTATATGCTGTTGGGCCAGGCCATCCAAGGCTGGGCTCCCCATCATTGAG CAGTTGGCTCCAT CAGT CACTGAGT CAACTGAC CAGG CACAAGTGAGGT CCATTGTCT CCAGAAAATGC AGGCTAGAATTCCAATTTGAAAGTCATTTAGATCTCATTAGTTATTTAATAAGAGAATAAGAGATTATT TTCTCACATAAAGAGTT CT CTTTTTTT TCAAACAAGAAATACTGTAGGT CTTTTATATGACTAAGAACA ACATAATTATTAATCTTGACCTCCAAGGAGAGGAAATTTTGGCTCTTAAAATCTAACACTAGAATCTGA CCCTTGACCCATTACATTAGGGTTCCCATTTATTTTTTTTTTCTTTTGAAGAGATCCTTAAATAGTAGG GTATTACCCTTTGTCTGCTTTATCCTGATTGCTTCTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTG TGTGTGTGTGTGTGTGTGTGTGTTTAGAGATGGGGTTTCGCCATGTTGCCCAGGCT (N) xTGCTTCCCT TTTTTGGATGCTT TGTTTTAC CTTGTCATTTTG CCAGGTGT TTAAAATGTTCATTTAGTCAGCAATATT TTCTTTGAGACTTTATGGGTCTTAAAATACAATTTTTATTCTAATGCTCACTAAACTTGGAGACAATTT TTGAATTACAATT TTTAGACTTTTT TG CTATTGAATTGTATGTGT C T ( N) xAAGTTACTATGTGATTTT AGAATCTCTTGTTTTTCTAGGATAAATAA(N ) xATGTCGCCTTTATACACATTCTTCTCTTCTTAGTGC TGTTTCCAGGCAGTC CC CT CAGG CT TTGGTCTT CC CTAAATGCAGAGATTAAGAAATGGGTGCCACAGC CCACCCCTCTCCACTGAGGTGGGA(N)xCAGAAAGACCCACTGAGTTTAACCTCAAGCACTCCTGCTCT CCTTCCTGGCCACCTTCCTGGTCCAGCCACTTTCGTTGCCCATAGCCTTCACTGCCCAATAGTCTCCCA CATCTCCTTCAGGTC CCCTGGAT CTATTCAGAG CAGT CT CTGCAAAACC CAGAACTGGTTAGGCCAGAA CTCCCTTCAAGGACTTCTGGTTGCCCCCAGAATAAAGACGACATTCCAGGATGTGACCTATAGGGCCCT GGAGTGGTCTCGC CTGCCACCCAGCTG CAATTT CAGC CCTGCACCTGCC CTCCCTTTTATGCTGCAGCC ACTAGGGTCTCCCAGGCACGCATCCATGATGAGCTTCCGCCTACCAAGGCTGCCCCCAGGCTGGGCCCT CTCCCTGAACACCCTTCCTCTGTCCCCATCCCAATTAACTCCTCTGCGGAAAGCCACCCCATTAGGAGC TCTCACTGCACTATGTGCTGCCCCTACTTCAGGGTGGTGGTTTTCATCC{ N) xGAACTCATTATCTTGA CTGTGGGC(N)xTCTTCACCTTGCAACATTTAAAACACTTTTCCAGCC(N)xAACTCTTTTTTGTTGTT GAATCATTTAGATGTAAATTATTAACATGATTTTCACTCTTAGATAT CT CTG
>H s22 4 98 379 92 -4984 342 7
GAGTTTTGCAGAATCAC CT CACATG CACGTGGT CT CTGAAC TCAGGACGAGG GTGACATTGCGAGGCAC AGGCAGATGAGATCTTCTTACTAAATTCTGTAACAAAAAAAGCTCACAGAGGAACGTGTGGGAGAACGC TCCT CCTGACCTT G CGTTGGACAGAGAACTT TG ACTTGG CC CC AG AAAG CACCTGCC AC ATGTAGAATG TAGACATTGGTAGAGACTCAG CTTAAGAACTTCTGTC CT CGAAAACA CAGCT CTAAGAGTGGACAGGTA AGCCCCAAGGAGAAGAAGACATCA(N)xTGTATGAGTCAATTTTCATAGG(N) xTAGGTAGGTTGGTTT AATG GACTCACAGTC CCACATGG CTGGGGAGAC CT CACAAT CATGG CAGAAAGTGAAGGAGAAG CAAAG GCATGTCTTATGGCAGCAGGCAAGAGGGCGTGTGCAAGGGAACTCCCTTTATAAAACCATCAGATCTCA GGAGACTTATTCACTATCACGAGAACAGCAGGGGAAAACCCGCCCCCATGATTCAATTACCTCCAGTGG GTCTCTTCCAAGACCCATGGGGAAGATGGGAGCTACAATTCAAGATAAGGTTTGACTGAGGACACAGCC AAACCATATCACC CAGCAAGT CCACTACTGAGTTTATAC CCAATAGATG CAACATGCCTAACAGCTAAA CCTAAAAATGTTCCCACTGAAATGGGCAGCAGTGGTAGGAAAGGTGGATCGAGTATAGCGTCTTCACAG CTGTGTATCATCCTGAGCTACAAACAGCTGCCGTGCACAGCAGCAGGCATTCACCCTCCACACCCAGTA TCAAAACACAGCAGACAATCCAAAGGTGCAAACTGCAAAATTCCGTTTACTTCTTTTTCAAAAGCAGGC
AAAGCTGAGGACTGGTGATGGCAGCCCAGATACTCAATGGGGCTGACTCTAGAGGTTCCCCAAGGGGCT GGGAATGCCAGGT CT CT CT CT CCAG CTGGTGGCTGGC TCTCAGGC CCTT CCGTCCGGCTAGTGGCTGGC TCTCGGGCCTCTGCGTCCAGCTGGCAGCTGGTTCTCGGGCTGCCCTCAGCAGGCCCTCCCTCTCCAGCT CCGCAGGGGGAAGACCACGTCTCTTCCCTGGCTCCACTGACTCCCCTTTTTTAACAAAAATTGGCACTA CTAAAAGCAAACAAAAATTTGCTTTCTATCCACTGATAGGAACAAGGTCATGTTTGCTTTCCCCCGACA GGAGGCCCCTGCGGGAAGTAACAGACACACGCTCACTCTCCGTTTCTCATCCTCTATCGAAAGGCGTCG GCAAATCTGGAAGGGGCGGTCTGCAGCTGAGCTCTGCTGATTCCTTAATCAGCACCCCTTTGTAGGACC CCCTGGGTTA CATCCATCTTCTC CCAT CATAAG CGACGTGTGTGT CAACAATGCTTGCCTGTTTGAAGA TTACGACTCAAGT AG AAAAGCTGGTGT CT AATGTT AT CT AT CCGGGTGCTCCCAGGAGG ATGATGAAAG CCAA CC CTGTGGG CC CTGAGT CAG G { N) xGCCAGGGCTGGTTTGAGCATAGACACCAGGGTGCTGTTCC GCCAACGCGTCCTCCACTATCGGTGGGTGGAAGGCCTGAGTGTTTCCTGGTCTGCTGAGGGCTCGTTTG TGAATATCCCTTT CCGAATTG CCAGACGGTT CC CTGTGC CCATTT TTAAGTATTTTTCATCACTTCCTT ATGGATTTGTAGAAG CACTTT CTACATAGGGAACGTT CACTCTTTAT CACATAAATCCGAAGTATTACC CCAGTTGGCAGCTTTTTAATTATATTTTGGCATAGAGAAGTCATAAATATTTATTTGTGCAAGCATCGA AAATGTTCATATATGGCTTTCGCCATCGTTGTGGGCTTCTGGAATGACTTCCCCACACAGTTATTGTGT AAGACTCAGCATCATCACACAAGAAAATGGCCATTTACTTTCTTCATGAGAATTCCAGAAATATCGTTT TCGTGTGTTAAATGCAGAATATCCTGTCACAGCTGTCTGACCAGCCTACTAGCTTTACGTGGCCCAGCC TCAGATCTCAGCTGTGCTTTTATGTGTGTCTGGGAGACACTGCATCTGAACAGGGTTAGTGGGGGGTCC TGAGTTTTGGAGTCTGCTGTGTCTGCTCCTCAAGGCCATCAGTGAACTTGAACTTGATCTCTCCTCTCT TTTGAGAGAGATCGG CATGAT CCGG CC CACC CTACAAGTGCAGGCAGTCGGAACAATGCTCCCACAAAG CCCCCAGCCCAGCGAGGAGGCAGCACCCCATTCCTGTGGGGCCACGGGGAGAAAGCCCCCTGCTGTGAT GGGCCGTGCTAGAGAGGCTGGAGGGCGCGGACCAGGCAGGGGTTGCGGACAGGGCAGGGGGCGCGGGGC ACCTTCCATCCTCCCTGGCTCAGCAGGGCTGTGTGTGGCCGGCCACCTTGGGCTCTTTCCCTGTCCATG ATAC CATGTAGTCAGGAACAGGGTGAG CTGT GAATGAGTTC CCTTGACTGAGGTGCTGGTGG GATGGAG AGGAGAGCGAGGGTGAAGTCTGAGAGCCCTGGGGTGGGGCACTCGGGGCTTCAGGTCCCCCTGGGCCCT GAGGACACTCAACTTCAGCTTTCCTGCCACTTCCCAGCACACCGGCTGCCATGGCCTCCCCGATGAAAT CAATTATCCCATTGCTACAGTCAGTTAAAAGTGAAATTAAATGAATTCTGAACATTAATAACAGGCTGT AATAAGACAAACTAC CT CTTAAT GATG CATAGAAC CCTGATGAAAATAAATTTCCTTGTGGGGTGTGTG TGAGTGTGTGCGAGT CAGGTG GGGAACTCAAGGGGGAGT CT TATC CG CGGAATTAAAATGAATATTGAT TACACAGCTGATAATTAATGATAGCAATTTTTGATTAAAAATGAGTTCATTTAGCAAACAAGTGGAGTG TGGTAACATTACATCTAAAAATAGAATCTAGAAGAAAGGGCCCTTGTTACACAGGCAGGAGTACAGTGA GCTCTTTGATCCTGAGGCCGACGCCTCTGGATTCGAAATGAGCTTCTCACTCAGCAGCCCCCGGAACAC ACAGGCCTCCCGGGAGACGACCGCCTCCGTCTCCCCAGCGGGAAGGCAACACGGTGCCGCGGAGGCCCA GCCTCCAGCCTTCCACAAGATCCAGGGGCTCAGGGAGAAGCGATTGGGCCTTCAGGTGCTTCTCTCCAA AATCCCATGGCTTAGACAGATTCAGCAAAAATGGGATAAAACGGGGCAGTAACCCTGGAAGTGCTGCTG CGCTGGGAAGGACTC CTCCACCC CGGG CAGGTTTGTT CTGTCTCCCTGT( N) xACAGACACATGCATAC ACCCAGACACACAGACACACGCATACACATACACAGACACAGACACACACATACACACATATACATACA CATACACACACGTAC(N> xCAAGGAAATGTTTATGTGGCTGAATTCAGAAAAAGCACCTGCGAAGCCCT TCTGACTCAGAGCAGCCGAAACGCCGGGTAAGATGTGGAAGCCTGTGGCTGCGCTGGTAATAAGGACAG AAACGCCGCCAGCCATGGGGGCTGCCCTCAGGAGAGCGGGACGGGCAGGAGGGCACCACAGGTGGCTCA GTCCTGGGCCTGAAGCTAGTTGCTGTGGGCAGCGTGCGGGGGCAGGAGCAGGGGCAGGACCCTAAAAGC GGGAGAATTCGGAATCCAGACATCCATAAAGCCAGGACACCCCAAACGGCCACAATTTGGAGCATGA
>H s22 3 58 699 39 * 358783 94
GGAGACAGAGGCAGAGAGAGGTCATGTTTCTTGCCTGAGATCACACAGGTCATAGGTGACAAAGGCAGA TTTCAGACCTGGGGCTGGCTGCAGAGTCCAT{ N) xCCCTGCTGGCATCCAGCAGCGATGCTGAGGGGGG CCCTGGACAGGCTGGTGGTGAGAGGGGAGGGGGGTTGTCAGCCTTATGGGTCAGAGGCAAGGGGAGTGA GTGAAGGGCTGGG CGGGGACTGAGACC CAGGGTGGGACAGAGC CT CCGT CCTGCCCTTGGAGCTCTGGG CAAGTATCTTTCCTCACTTGGTCTCAGCCTGGGCTCCATGAAAGCAAAGGTATTGGGTGAGAAGGGAGG GGAAGAATCTTCCTTTC CACCGGGC CCAAGC CCTACT CTOGACTTTCCTTCTCTTTG AGGTTTTCAAGA ACCAGAGTGTTAGGGGACTCTGCAAAGAGCACCGGGAAGACCACCCCTGCCTGCTCCTGGAGAACCTCC TCTGGCCCTGTGG CCGC CATC CAAC CTGCAG CC CACCTCTG CC CTGAAG CACAGGGGAGTG(N)xTCCC TGGCTGGAGCGGCAGAGCTCAGCGGATGGGGCCGTGGGTTCTGGAGGTGGACAGTCTCACTTTTAT(N) xGAGTCTCACTTT TAAT CC CAGCTCTG CC CAAGTGGAAC CCGGGAGT CCACTTGGGCTAGTGGACAAGT GTCAGGAAACGCTGCCCAGAGAGGACGTGGGCAGAGTCAAGAAGGGGGAGTTACAGTCATCTGAATAAA GACGGAGGGGAAAAGGGACAGTGCAAGGCTGGCAAGGTCAGTGGGAGGTGGGTGGATGGGGTGGCAGGA GCAGGTCCTGCCAGC CTGTGTGT CAGGACGT CTGACC TTGT CC CATGGGCAGTGGGAGTTACTACAAGG TTCTATGCAGGAGAGAGGCTGGT CAGATTTGTGTCTAAT CC CCACAAAC CCTGCCCTGTGAG CTTGAAC AAGGGGTCTTCCTTTCTAGGTGGGAACTCCTCCTCTGCACGATGAGGCACCCAGCCTTGGCAGTCCTTA GCTCCCATATAGTTCTGGTGCCGGATTAGGATGCAGAACAAGGCGTGGGCTCAGCAGAGCCACTTTGGG GGTAACTTTTGGG CATGGGAGGG CC CTGGGC CT CAAATTGC TCCCAGTG CAATTGGTCCCACAAGGCAA GATGTTTCCCTATAGATGCTTAGATGGCCTGGACCTCACCTTGACGTGCCTCCTTTGTTCCCAGCCACA CTTTG GC CT CACTGGGC CTGTGT CGAT CCTGTAGC CCCACGGCTCTT CT TGGG GAGG GAAGTCTT TGTG GGGTC CTTGTCGAGTGTGTGGGT CATAGG CCTGGGTGTGTGAAAGGG CTGC AAAG CATGAGGGAAGGGG GAGAAAC CGCTGTGCAGGGGG CAAAAC CACTGTGTAG AAGGCAAGGG GGAGAGTCCGGG CATAGCGC CA GGTGGG GAGGCACTGGGGGAGAAGACGG GG GAAAG GAAGCAG GAGCCGAGG GGAGACTGGGGGACTGGT GCGCTTTTTTTTTTT TTAACTTCAT GAAAATATTT GTTTGACTAAAATTAC CCTTTTTTTTTTTTTGCC CTTTCAGAAACTATGTTAGTAGTTCTGTGGACACACTTAATTGAAAATAAGA(N)xGCTGTGTATACAA CAAAAGGACGAAGAACACT GGTCAAGATG CAC CATTAGCGTAGGGCGGGTAAG TTTAGACTAGAGAATG GGTCTAAGGAG CT TAACTT GACTATGAAG CATGATTTTGTCAACAGGGT GGAAAG CAAC CG TGTATTTG GTTTT CCACTG GATCGCTG CTTC CGCCTCTG CAAAC CACCAGGAAATAGGCTCTT CTAAGC CT CCTCAG AT TT CAAAATG CACC GACCGTAT CACATTATTG TT T CTTCAGAG CAAGAGT TGCTTCTT TGTCTC CC CT ACTGTGCATTCCATAGGTGACTCAGAAAGACGCATTTTTAGTTTTTGTTGTTATTTGGAGTTTT(N )x G GAAGG CATT TGAG CA CAGAGATG CCTGTCCCCC CATTTTCGTCTCCTTGAAAACTTCACTTAAGG CTGC TGTGTTCATGAGG CTTTCTGTGGGTGGATGGAGGT CAACCTTGAAGAAG CAGC CGAT TT CAGCTGGAGG TTGGTGCTGCCCTGGACAT CAGG CC CC CAGAAG CAG GGGAGACCATGGGAACATGAAATGTTGGG GTGC TTTTGCAATTCCCGGAG CATGGAAGTGGGATTGGG CAGGAGGAGGGG CTACTGCACTGTGATGCTATTT AGCTGAT G ACCTTTA( N ) xTATCCCCTCACTGGACAGAGGAGGAAACTGAGGCCCAGAGAGTGGGGGCA TCTTGCTCAAGGC CATG CATCAAAT CAGT CAGAAC CCAAGGACCTGG CTG (N ) x GGTGT CCAACCAG CG TTTTAGAGC CCCCGCTG CAGT CAAT CG CTAT CAGG CACCCTGTGTCCTT CC CAGAGCATTG CTGAATAC CTGGAAGGAGG CAGAGAACAC CAGC CCAAGGAAG GAATACACTG GGAT C TTGAGGAAGG CAGGAG CTTC CTTGG TG CAGTGT CCCCAGCTGCCTATCTGC CTGGGAGTGGCCTGGG CACC CAGGAC CT CCTCTC CAAC TCCAGGCATCC CATGGAGGACTTGG CAGC CCAG CT GAG GGATACTCCAGGAGGAT CT TC CCTTGG CAGG TAGATCCATGT CAAACGGATC CCTCAT CCAAAG CCAGCGCTGGCTTCACGTGCTGTTTTGTCTTCTCTT CCTCT CACACGAACATACG CTGTGTTGTTCTTTCCTGAA( N ) xCTCATAAATTCTCCCTCTGGGCTTTT TGCCAAGGGGCTGAATAAACAGGAAGGAAGGGGCAAATAAATCACCAAGACAGGCTCTGGGAAGGTTTC TGACTCTGGAGAC CGTCACAGAAGAGCTGGC CGGCCTCCGCCTTGGGAGTTGCAG CGAGGCTGTGTAGA AG GG GTTATAG GAGCAGTGATATGGGGTCAG CAGCAGGCTGGTGCCCAGGCAGGCTTTCAT CAACGCTG CGACCTT { N ) xG ATCTT TT AT AC AACT TGGAGGGC TCTGGAGGGAGGGT CTTG CTGAGCTG CT CT GAT A GTGGGGATCATGCAGACCCCCGGCTTTCCCACTCTAGACGGCCTTGGCGAGAGCCCCCAGCTGCCCTTG AGTTG GGACATCCTG GGGGTGAATGAGGGTAGAAAAAGGCAGCTGCAGTTGGAGG CT TG CG GCTCAGAG CTAAGGC CTCCACCC CACAGC CTCCGCTGGCTCCC CAGGTGAGCTGGGACAC CAAGATCAGAGGCTGGA AACAAGAAGTTTTGGCTTTTAGCTGGCAGCAGCAGCAGCTGCCAACATCATCTCTCCTTCAAATGACAG GCTGGGGACTTTGGGAAAACAGCCACAATAAGCGTGTTCGGAGGGAAAGGAGGGGGAGAGGAAGGAAAG AGAGCAGAGAAGATGGTGAGGTCTGGATCGTGGGGTGCAGCGTGTGTGCGCACATGTGCTTCTGTGATC CTCAGTATGTCAGTGTTTCTGTGACTGTGTGTACACTTACGCTGTGT CT CTGTGATT CAAT CT CTGTGT GTCCC CAGGGAGA CAGTGTGCGT CT C T G T ( N ) xTAGTGTGAGTCTCTGTTTATGGTGATGGGTGGCAGC GCAATGC CTTGAAAATGTTTC CT CCAAGGTAGCAAÁGAAAAGTCCCTGTGACCAT CCAAAG CCTTGGTG AGCTCAGCGGCT(N )xTAGCAACACAGCCAACTCTTTTTTTTTTTTTTT{ N)xCAGATCTTGAAGAAAA CCGGAATTGTAAC TTAAGTTT TATCTG CGTATAAC CTTGCAG CGGCATGGCAAAGGAGACAGGAT CT CA CAGAATTTTACAAAT <N)xGCAAC(N)xGCCCAGCGATCCTGCCGGAGGGTCTTGACCACAGAGTGGGG GTGTGGGGGCTGT GATT CAGATT CT CCGAGGTG CTGATGCAGCTCTGTGGCGC CTACTGCCCCCGCCCC CCGACTGAGAACTGTGG CT GTGCTCAG CCAC CGTGACTGGCTGG GGTATGG CT CG GATC CCTTGGGT GT TGGCCAGGCCCCAGTTGATCTCCAAGGGCTCTCGGTGTTCCAGGAAGCTATGGCTTGGTGTCTGAAACA CAGT C TGGAGTAATAAATGTCGGGAGGAAGGAT GT GACTAGGCTGCACAGGGAAGAGAC CAAGACACTT TGAGCCAAATCTCTCTTTCTCTTGGGACCCCAAGAGCTGATTCACCTGGAACATCCTGGAGCCATGACA CAAGC CATG CT CTGATGGCAGAGGCTG CCAGCTGC CACCCAGGA CAGAGAG CTTATG GGAGGTGGTG GA GTTCCTGCTCC CCGCAAAG CCTGAGGG CTGGTAGAGATGTGAGGGTGTTTCAACGTTGCTTTC CAAGAT AAGGAATGAGT TCTTTTTCTGTCCTTCTT CTTCTGTCCCTGGACAGC CTGACTGCGT CAAAGAAG GAGG TGGGGGAGCCAGGGGGCAGCTGGGGTAGGAAGAAGTCCAGCTTGGTGACATTAATGGGAGGACAGAATT GCCTGGCTCTT CTGT TTTTGGACACGGGCACAGTTATGAAACTTTGAGCAACC CCAGGCCT GGGG CTGG TTCCTGGTGGCCCTGGCCCACCCCTGCCTGCTGTGTGAGGCCTGTGAGGTG CATGAG GC CACTGT CCAT AAATGGAAT TGTC AGTTTATT AC CCTCGGTACC CAG CCC AGTGCCTGGAAGTT CTTC CCGG ATGT CG AC TGCCGTGGGGAGAAGAGGGGC CCACTGGC CA CAGTGAGGGTGCTGAG CTGGAGACGAGTTACA CC CGAT CTCATCCTGGGAGGAGACTGTGGTCTGGAGGGGGTGAGAGCAAAAATGCAGAGGCTTAAAACAGAGGTG AGGGCAGAGGGAGGCAGGAGAGACCGGGAAGGATTCAGGA( N ) xCTCTAATGTTATCGTCTCCTCCCTG CTGTCCACTGTCCCGCCACACGGGTTTCTTC CAGTTCCTCGAACACACTAAGC CATGTC CA CC CTTGGG GCTTGG(TJ)xTACCGTATTTTGTCTCTCTCCCACTAGATTATCACCTAAAACCGAGCAGGGGCCATATA GGTCTTCTATAAATATCTCTTGTTAGTCAGCAAACATTTATTGAATGACTGTATGAAGGAGAAAATGGC TTACC TGGCTTTGGAAG CTTGGGGT T G ( N ) x
TABLA F
>Hs1_21461150-21461398 CTTTCACTGATCAGAGCATCGAGCCCCGGTCTTGCTGACCTCAGGGTGATGGCCCTGAAAGGGGGACGG GACCAGCTTGGGGCTAGTCTGAAGGAAGACACAGAGACAAGAGAGCCCCTTGAGAGACAGGGTCAGGGG CCTGAACGAGGGGCTATGGAGGGTTCGTACAGCCCTGGAGACCATGGGCTACTGGGTTGCTAAGTGCTG GAAACACCTGCATGTCCCCCAGGCACCACGGCAGCTCTGAGA
>Hs1_89223352-89223503 CTTTCTCTAAATCTGAAAGCTGATAACAGCTAAATACATGGCTGGAATGCCAGAAATGGAGTCAGGCCA AGTGGAGAACAAAAAGCCTGGTTTATTCTCTCAAAAAAAATTGGGTTCAAGCCAAGACCTTGGAAGAGT GAGTAAGTGAAGGG
>Hs2_37500136-37500260 GTTGAATTAAAGCTAGAAAGAAAAAGAGAGAAATTTCAGAATGCATTGAAGAGAGAGTAGGACCTACCC TTAAAGTAAGGGGAAAAATTGCAACATGTCAGAATTTCCTCTGGATCAAGGAAGGT
>Hs2_13103843-13104004 AAAATATTTAAGCCTTATTATCCTTTTTTATAACTTGGGATCTGTAAGACTGCTTTGTGAATTTCACTG
t Gt t a t t a t Ga a Ga t t a t t a t a a t a a Ca t a t g g Ga Ca t t t c t t t g t a a a CCa t a a t a t GGCCa t Ga a a a TATTAGTGTTTTTATATCTCTTGC
>Hs3_26316904-26317098 ACAATGTTAAAATCTGCTAAATGGTAGAGTAATTGGACCCATGGTGTAAGCAATTACTTATGACCGGGT CTCTAAATGTTGTTTGCTTATAGTCTTTAGTATTACTCAGTTTTTAAAACAAAAATTCTGCTATGCTAA ATATATTCATGAATGAATACTATAAAACCTTCTCCAAATACTCTATTTTTAAGTTAG
>Hs3_109045139-109045270 TTGGGCTAAGGCTTTCTTACATTTAGTGTCTTTTTAAATCATGTTTTACAGTCATTTACCCTTCAGTCC TTACCTAAGATGACATCTTTTCTGTAGCCTCCAATCAGGCAGATGATTCAACAGAACAGTTGA
>Hs3_54201491 -54201591
c a g t a t t c t t t g g t g t a a a t g t g a a a a c c a t a g c a g t a g c c a a a c a g a c c g t t g g g t t a t t g a a a a c c c a g a c a c c t a a g g g g g a a g a t a g t t g t c c a c a g
>Hs3_4567129-4567322
g g t c c a t a a t a a a t g a t t g c a g a c t a t t g t t a t c c t c a t g c c t g g g a c a t a a t a a a t a t c c c t a a a c t g t t a c a t g a a a a a t a g a a a t t c c c t g a g a g c a t a a t t a g g c c a g t t a t c t c c a a g t t a g a t t t c g t t c a t g g g a c a a a t g g c a g a c t t t t c c a g c a t g g g c t g a a a g a a g t a a a a a a a a g a a g a a a >Hs4_15922534-1592276 5TTTTCTTTACTTGATAAGAGAAGAGTGACATTTATGGCCTTCAGACATGGGCCAAACCTGA c a g t g a a c a g g a t c t g t c a g t c c c t g t g t g g a a c c c c a a c c a a g t c c t t a t g a a g a g a a a a t t c t a g a a g c t a a t a c t t t c a t t t a t g c t c a a c c t g g a g g g g a c a t t t a a a a a a c a c a g t a a a a a a g t c t c t a c a g a c t t t c c a c c a g c t t c t a c c c a a c t t a t t a c t g t
>Hs4_186017668-186017837
t c c a a a c t a c a t a g c c t t c c t t a g a g a a a a c t a c t a g a t c a t g a a c a a g t c a g a a c a g a g a g t a g a g a a g a a a t t t t c c a t g g a a g a g c t g a t a a t t g g a g a t t t t g g g g a g a g g g a g a t g a a a a g a a t t g a a a t g g a GAAGAAAAATCCGTTTTTGTGGGGGAAAATAT
>Hs4_44016974-44017119 AGAGAGAGAATTGATGCTCTAGTCTGCTTCCATGGGGCTTCAAATCCTGGAGCAAATCACTCCCATCCC
t t c c t a c a c t a t c t a t g t a c c t t t c c a g a c a a t t t c a a t a a a g a a c a t g a c c c c t c a t t t g a g a a c t c t c a g a t c t g
>Hs4_14175097-14175323
a g a g c t c t c t c c t g t g g a c c a t a a g g g c t g c a a a t t t a g g c t a a a c t c c t g g g c t t t c a c t t c c a t c t a a a c c t c a c g g g a a a c a t g t c t t c c t t t a a g t t t g c t g a c a a t g c a a a t t t c a t a a a a a t c t t t t c a a t a c a t a t t t c t t a c t t t g a a t t a t a a a a t a a t c t g t t t a a a g g g c a c a t a t t a a g g g a t a c t g c a t t t t g g g a a a c a g a c a g a g c t g g t g c
>He5_138546637-138546876 TGAGGCAGGCCCCCAGGAACTGAAACCTGGATGAACTAACTAAGTTTACAGGCCTGCAAAGCTGCACAC CAGTAAGGTCCTAAGTCATCAGCTTTCTAGGGCAAACCTCTCTGGCCCGGCACAGGTCAGGTCTGGGTA AGGAAAGAACCTAAGCATGACCTCTACATGCCCCCAAGTTCTCCTCAGAAATTACAAAGAAGCCTTCCC TACTCCATGCTATGTCAGCCATACACAGAATTC
>Hs5_ 1895609-1895744
CTñGTT TAAACACT TAC CAAAATAACAGAAAACATGAT G C C C TGATñGG CAACAAGGGGGAAAAGGTGT TT CT CAC CAATACAAAC C TGTGCAT CAC CT T CAGAAAT CT C CAC T CAGGAAAAGG C GGCAAT T C CAG
>Hs5_131556651-
13155686 9 CATGGAAC CACTTATT GGAGAC T GT GAAAAGAAAG CAAG CATG TTA CAAT CCTATTGGTC AGT TTAGTAG C CT GAAT GAAT GACTCCTTCAT GAT GACCCCCTG CACT GTGCCCTG GAAT CAGACACAA ATAAGACAAGACATGATCTCAGGCCTCAATTTGCTGAAGAATTGGTAAGGAAGACACATGTAAACAGA1 AATTGTATGAAATTCTGCCAG
>Hs5_108742923-108743062
AC TTAAGT GGAAAT CATAC CAATTAT TACTAC TT GAATACAT GAGTAAAT TTAT TTAC CAAGCACT T C T AT G C T C CT TAAC TGAGAATGCAGACT GATTAC T C TATGAGTT CATT TATGT CAATGTACAG C CT TT CAC TA
>Hs5_106949296-106949480
TAAAT C CT TTAAAAT CAAGT CAAT C C C CAAATAT GGAAAAGATAAT TCTCTTTC CATGAGAGAAGT TGA AGTGGT G C CATAACAGAAAT TAG C C C TAAATGTT T CAGAACT GT GGTTAGAAGT TAAGTTACATAAGAA CAT C TGGATGG CAAT CAAT CAGTGGAAGATAT TATATGGTATAAAGT
>Hs5_ 149653105-149653309
GCAGGATCCAGATAGCAGAGAGATGGCTGGAGAGGC (N) xTCCCCTCTGAAAGTCTTACACATGAGCAC ACACACACACCCCTCT GAG C CT T C CAGAG C CTAGCT CAG C GGTGTGGGCACT G C CT CGGACAGGCAC C C C C CAC C TT C C TGAGGGTAATAGC CACTCTGCT GAG
>Hs5_121461146-121461216
GAAGGT T C T CACAGAGGACAT CTTACCCGTTC TATTAGC CAT GTGCCCCT GAGGGT C CAGT CAT T C CTA AG
>Hs6_70348510-7034862
7 TT CT C C CAAACGAGGAGT TT TGGAGATTATAAT T C T CAT CATG TTAAGG CT CAGG TAAAAAAAAAAAA AGAAAAGAAAGAGTAAACGGAG C TTAAAAAAAAAA C CAGATACT TAGT GT
>Hs6_54736509-54736604
GGTTTCCCTT TAGT C CAT G C TT CTAAAAACAAGTACAT TGGTAC CAAT CTAGTGTT G C TGTGGAGAAAT TAGGGT CTAT CTTTGTTGTTCCACTTT
>Hs7_75395901-75396027
CTGCCCACAC TAGACGTT G C TTAGAGTT CAG C TT GAAGC CAT GT GATGTGGGT CAC TGC C CAGGAGGAC TTTTTT CTAGTGCT GT GGC C TGGAGGAC TTTCCCTC TAG C CT CT CATGCAGGTGTACT
>Hs7_3239026-3239202
ATAATGATAGCAAT C CAT TT GGAAGGTAAT TTAC TGATAACT TTAGAT TT TT CAAT CAAAAAC C TTAAA TT G CAAAATTACATAT TT TGGGCATGTATTAGGTAACAAAAGTAAAGAGGCAACAAAGAAT C TT TGAAC AACATT GAAC TTCCTGCTCT GAGGCAGAAT TATAGTAT T
>Hs7_12932213-12932446
ACATAT TACT GATAGATAAT T CAGT C CT TT T CAAAAAAT C TGTAGATAAGTT TGTT G(N)xGTGAAATT TGCAAAGAAGG CTTTTTTTTTTTCCTGT TAAT TT TAG C CT CT TATTAGAGAC TACT G C C C TACT TT C C T G C C C TGGGTAG C T C TAGT CAAC CT GGCAGTAT CT TT G CAT GAAAC C CAAAGAAAAAG CAGC CTGTGCCA
G
>Hs8_ 120684337-120684476
C T C T CAT CAAGTGGT TTCTCTTCCTCCCCCATCACCATCCAGCT CACAGG GATT TGTCATTCCC CAGGC ACCCTCTCTTTCT GGAACTAT CACACAAGAC CAGAAAAT CATGTGTGC CATAAAAT GAGT C CAAAT TTA AA
>Hs896945650-96945720
AAC CAGAAAAGCT GT G CAGTGAC C CAGC C C C T TAAT GAC CAT GAAT GAC CCTGTTTTCTCTCTCTTCTC AT
>Hs9_113608047-113608176
CTAC TAC C TTAC CTAT GGCAT CAGAGTT TTGTTTCCC CAT GTATAT C C TAT C T C GATAC C GT GGTAAAT GTAT GAGTAAAT GT CT GT TAAGAT T C TT T CAT GAGAAT CATGAGAGAT TTATTGGTCACCA
>Hs9_77788560-77788689
GGATAGACATATGGATAATAGCT TAAAAATATAAATAT GGTCTTTC CAGT CATAGAAAC C TT C C GAT C C AAGAGGTAC TTTTCCCACTTTC CAAAAAATA CAGGGCTATATACACATACCC GTAAGG CAC
>Hs9_4816508-4816623
AAT TCCCCTCTGTGGT CAGC C TAGT CT GAGAAT GTGCTGTTGGGAGAC TAGG GTGCTTAT TAGGAT CT C TTTTCTGTGCCCTGC CATGAC T C TGG CAACT CAGATGGCT CAGAGAG
>Hs9_ 125204413-125204594
CTGGGTGGGCTGGTTCC TTAAAGAC CGTACCCTCGCTAT CAAC CAGAAAAT CCTGGAGAGCT GAAGTGA ACT G CAACT TGGGTTAGG CCCCAGCTTGACAGCTC GAAAT CATT TTTTTTCCTTCTTGCTC CAACT CT G ACTGGGACCTGC CAT GTCCCTTACCTTCTTTCCCATTTTCTGAG
>Hs10_116886669-116886802
TT TGAAAAGAAC TGAT CT CT TAAAAT GGCT T C CAAACATT GGGATAGAAGAGCAAT TT TTATAACACAT GAAAAGCAAGATATAAGAAATT GAAT T CAT GT T CAT GT GTAAAGAACACAATAGAACAGTAATT T
>Hs10_120651143-120651387
ATGT CAGGGC CCTGCTAGT GGAAAT TCACTTCTGTGTG TTAGAACAGG TGTTTCTGTGAGATCCT CAGA GAAAC CAGAACAGCTAGAAG CAAGAACAT TCCCCCACCCCTTTT CAGAAAATAGAAG CAAGT TT CT G C C ATGAAAGC CATTTTTGC TTAAAACAC CAG G C TGAAATAATT G CATAT TTGTTGGGGATTGACAGT TGAG T TT TT T CAT GAAT TTTGCTATCTTTTGACGGTAG CAAA
> H s l0 _ 131665070 -131665215
TA C TT TG CAGAAT CTACTTGG TAAAGACACT GAGATTCT CAAGATAG GAAACAAACAGG TTATCTAG TA T TAAT TATGCAGT GACTAG GT GGAGAT GCTAGTGTCCCAGCCCTCCTCCCTGAGG CACTAAGAAT GAAG AGAGAGAA
>H S10_11335204 -11335346
C C C T C CAA C CT CACAAT TTGCTGGGGGTCTT TAAC G C CTAGAAC C CAGAAATG T CA C TTAAG C CT G C TG TGTGTGTCTCTCT CAAGG C CT C CAAAAC CTCCAGTGTCCAGGTGT CAAGTT CAGAGCTAGCAGCAGTCA AGATA
> H s ll_ 24655162 -24655310
TGAATATAAGCAAGG GACTAATAAAAT GTGTAGCTTTACCTTTTCTGCCTCCTAATGTATATAT CAAAT ATATATCATTTTTTGTCTATTTCCACTTT GACAAT TG TTAT GAAAAT CTCCACCTGTAGCTGCCTGGCA C TATT GTAACA
> H s l2 _ 123553236 -123553545 GTCTCCTGTGGCAAGTCTGCCTCTGGCGTCTCTGTGTTCAAAGGACACTGTAGCTGCCCCTGTGTCGAG GAGCACTGGGCACAGGTGTCTGAGTGTGGGGCACAGG TAAAAG GGCCCCTGTAACCCTGAATGATGACA GCCACGATTGTCAGGGCCCTAGA GAAACG GGCCTCCACCCATGCACAGCGGCAGGCCTGCAGG AATG AG GCCAGACTCCTTGGCCCTCACTT GGCAGAAAGGT CATTCCACTCC CTAAGG AAGAG CCAGGTACC CAA C CCCAGCCTGTCGTTGGCTCTGCCGGAGGGGCTGA
> H s l2 _ 7205280 -7205480
CAG GGATAG GGGCTGTGGG TAAGAAGGCAAGCT TGC CAT GAAC TCTGCTTCCCTCTT GATGCACACGC C CCTGCAGACTGCCACAAAGTAGCATATGTCCTTCTCTTAGTGTCTGCCCAGCCTCGGCCTGACTCTTGG CCTTCCTGGATCAGCTGCCTCTGGGGCTGGACATGGGTTTTTCTCTTAAATGCAACCACTCTA
> H s l2 _ 127539423 -127539595
CTGTTCGTCCCACCCAGGT TGATAAAG GT TGCGAT TCTTCATGC CAAGAAAAAAGTT TT TAT C CACAAA TACATATGGAGGATCCTTTCAAGGCTGTTCCTCCTGGGGTGCTTTAGAGAGTCCCAGATCACATAATTA AAGA C CAAG GG TG CAGACTAGCAGGA CTGGTCCCC
^ H s l 3 _ 10 S 98316 0 - 108983305
CTTTCCTGTAT GAAT G C TTAAGTAATT C CAACACAAG GAAGAT TATGGCTACTGATT TAAGTAGAATAT GGG TATGAGTAT CATGTATAGTGGTTCAG GAGGA CAG GGAT T CAGAGAGAT T C CATAAAAAAAAC TG GT GCACTTGT
> H s l3 _ 55470461 -55470661
AGC TAAT TTGTCCCCAGGAT CAGCT TTCTTGGAGTGTCATGGCTTTGGCTGACA CAGAAAT TGCAC C TA TAGATGATTCTCCTGGCCGCCCTGG GAC C CAG C TG CAGGCATG TGCTGATTTCTAGT CAAT GGAGAATT ATT T C TAGATT GT TT CAGGAGAAAGAAATAAAT GT CT GT CTAATT GGAGT CAAT C TT TTAG GA
> H s l3 _ 40513335 -40513451
AA C CTATACAGAGCATGAAG CACTTTTA CAG TAAGAAAAAAAG GAT CAAG CTGGTTTCAT CAAAAAAAA AAAAAAAAAAAC CTATTACACCATC CGAACT GTATAAG CAGACGACAT
>Hs14_71183079-71183311
CT T CAT CT CAAATATAG CCCTTGTCC CATT GGAC TGAC CAC CAT CACT TT TGTATGTACAAACATT TAT TGAGTGC C CAG CACTTCCCT CAGT G C TT CATATGAACAG C TGAT CT G CAGG CAT GAAT TATGGTAC CAT TGCTAAAT GAC C CAGAGGTTAT C CAAAGCAAAAT G C TAAACT GT GGATAGGAGGG CTTCTGGTGT CAT C AT AACT CT AT TT TGAT CATT AC CT TA
>Hs15_95868804-95869053
TGGT GGCAT C TATT CTCTTTTGTTTGGTCTTT TATAAC T CAT GT GT TGGATT C CAGGGCATGAGAGAAA ACCTACGTCCTTTCACTCACCT GGCGAGTGAAGAGAACAT TAAC TT GTAT GCTTCTCT CATT C CATAAA TGTT TGGGCTAT TTGCTTCTTTCTTCTCACT CAGAT CGAGAT TATATGAT GAAGATAACAG CCGGCCTT CATACACT TGGCCCACTGCT GAG C C CAC TGCAC C TGAGGGAGC
>Hs15_81851632-81851811
C TTATAT TTAT CAGCTGGTTCTGTACT TGTGAGAGAAAAAGAGACTAT CATT TATG GAT C C C C C TAAAA C CT GTAGGACATGCACT TAAAGT TTTGGGTGC TAAAT CAACT T C T C TAA C T CAAGT G CAG GT GGAGACA C CT TAGAACACAAGGAT CAAT GT G CAT CCTCTTTCTCTATCC
>Hs15_32072109-32072293
TT TGTAATAT GAGACTATAT TGG CACAAAC TATATAC CATACAAGAAGAGGGAAAAAAAT GAG C TGAAT CAAT GAGGACAAATAAAACT GTCTTGTTGCCT CATGGAAAGAAAAGGT T C GAGGTT TAAT TTAAAAGTA CAGGCT G C TAGAGTAATAT CCTTTCTTTTCCCTT CAGT CTAATT GT T
>Hs15_66763691-66763868
TGGACTAAC C CAAT GT C C C C CAGT C C T C TT TGGAAGG CTTTTGTTTACTT GATTAATT GGC CAAAGCAA G CAGGGT C CT GAAATAGGGGACAT GAGACAAGTAAGT C TT GATT GGGATGGAGGGAGAGAAAGAACACA G C CAGAGGC CAGGGCAGAAAGGAC CACAGGAGAGGAG C TT
>Hs16_26462674-26462938
GGC C TAAAC CAC TAGGAGTT T C TGGAAACAG C CACT T C C C CAAACT TGTCACTCTGC CAT TT CTAGCT T GT T C CATT T C TAG CTTGTTCCTTCC CAGGAATAAGAGGCACT TTAGCAGTAGAATT GAGGAGAAGAAGT CT CAGAC CAT GT C C TT CAGGAT GGAAAAGT GT GGT C CAAGAACAGAGAAGAT GACAC CAGAGGAGAGAC AATGCT CGTT GGAGTT GACTAGGT GGCAGAAAGGGT CAGGAGAGG CACAGGAG C CACA
>Hs19_49663834-49663995
AGTGAGTGAGTAAC TGAGCAT CAGAC CACT GGAC CTGTTTCCT CAAAT CAT CAT TT CAAGC CAGATAGA ATAT T C CAGT TAGAC CACAAAGAGAAAC TT TGAC CACT GGAGAGAAAAAT C C TAAGAGCT G CAT G CACA AGTTAGGAC C CT G CAGTTAATGAG
>Hs20_39516205-39516408
G CATAC TCTGGTTT CACT CT CAAGT C CATT CTCTTTCT CATGTT CAGAAT TAAGAAAACATGAGACAC T GAAAGAAT GGAGGTAAAGC CAACAAACT TT TGT CAT GCTTTTTCCCTC TATTAT GT GT GAAGTT T C T C T CACAGAAAGT GT TGAAATACAAATAAAATAGT CACATGC CAGGGATAGAATGTT GGTT TGAGG CAG
TABLA G
>Hs10_75892398-75892532
AAGATAGT CTCTTGGCATT CAAAAATT TAGCATACTT TAACT CATATT TG CT GT GATG TTAG GTAGTGA ATTAT T C TT GT TAAC TAT CTAGCGTC CAATGT GACAT GTGTTTGTTGT TAC C TACAA C TTAT CAAA
>Hs18_26012771-26012898
CACACACAGACAT CACAT C TTAATT CACAGCGTGCATC TAAG TT GATT TTAGTTAGTTGCATATGCTGT GGAC C TAAATGAC C C T CAC CAAGCAG CAT CA CAAAG T TA C CATT C CAT CAAATG GT TT T
>Hs3_10167419-10167564
GTAAC TTGCCATC CGCACAGAAAATAC GAGAAAAT C T G CATG TT TGAT TATAGTAT TAAT GGA CAAATA AGTTTTTGC TAAATGTGAGTATT TCTGTTCCTTTTT GTAAATAT GT GACATT CCTGATTGATTTGGGTT TTTTTGTT
>Hs17_60439625-60439754
CT GAT C CT TAGAAGCAT C CT TAAAACAT TTAAAATATAC C T CAAAAAAG C TGGACT TT CAACAACAACA AT TATACAAATGTATACATACT GTAT CT GT TAGAGACACAC CAAGAAT GAACAGGAAT CT C
>Hs13_110359320-110359567
G C AGAGTGCGGT AGAAGT TT C C AGG AGGTGT T A T C T AGAGG CAT C TG AG AG GAAG GT GG GAGAC C CAG C AG CACACT TTC TG C C T C A C TC C C T TT A T C*T AC T GAAAAT A C T G A A C T G C CATAAG A C A C TT TG AAAAGG GGGCTTCT TGAT TTAAAAAGGT CT GGAAAG CAGTGTTAGA CAATT T C T C TAGT C C T C T C TAA ATAAG AT G C T A T G TA A C A A TT TGAT GAAAAT TAAC C TA CAAAAAT GAT
>Hs16_ 13939660-13939769
T CAATAGAAT CACT CT GGGAAAATAT T T CT TAGGAAGT G TG TTG TTTG TC TG G ACATACACG TGTATG T TT TTAAG AAT GAGTAT CT G G TACTAT T C CAAAT C CAAGTAG
>Hs18_71535848-71535989
CCTTG G C C TTG TC C G TC TC CAT G TCAG TTG G ATC T GACAG C A G G G TC T G TTT TC T C TC T C T CACT CGG C G ACACATT TAG CC ACCTCTACTCC G C CTCATCTTG G TCCG G TAGG GAGG CAGGAAGG TG ATTCCCC TG G AGGG

Claims (7)

REIVINDICACIONES
1. Un método de detección del nivel de una molécula de ADN extracelular en circulación asociada con cáncer de mama en el ADN extracelular en circulación en una muestra de sangre, suero o plasma de un paciente que tiene cáncer de mama o se sospecha que tiene cáncer de mama, que comprende:
determinar el nivel de cada una de las regiones cromosómicas expuestas en la Tabla 7 en el ADN extracelular de la muestra, en donde la presencia de un nivel más alto que los niveles normales de un ácido nucleico de al menos 25 nucleótidos de longitud que se asigna de forma inequívoca a una región cromosómica expuesta en la Tabla 7 es indicativa de un riesgo aumentado de cáncer de mama o de recaída del cáncer de mama.
2. El método de la reivindicación 1, en donde la etapa de determinación del nivel de cada una de las regiones cromosómicas expuestas en la Tabla 7 comprende:
obtener la secuencia del ADN extracelular en circulación de la muestra;
comparar las secuencias que carecen de elementos repetitivos de las secuencias de ácido nucleico en circulación con las secuencias de cada una de las regiones cromosómicas expuestas en la Tabla 7, para determinar si una secuencia de al menos 25 nucleótidos contiguos en el ADN extracelular en circulación se encuentra dentro de una región cromosómica de la Tabla 7 y está presente en un nivel aumentado en comparación con un valor de índice.
3. El método de la reivindicación 2, que comprende adicionalmente comparar secuencias que carecen de elementos repetitivos de la secuencia de ácido nucleico en circulación con las secuencias de cada una de las regiones cromosómicas expuestas en una tabla seleccionada de la Tabla 2, la Tabla 3, la Tabla 4, la Tabla 5 o la Tabla 6, para determinar si una secuencia en el ADN extracelular en circulación de al menos 25 nucleótidos contiguos se encuentra dentro de una región cromosómica enumerada en la tabla.
4. El método de la reivindicación 1, 2 o 3, en donde se sospecha que el paciente tiene cáncer de mama.
5. El método de una cualquiera de las reivindicaciones 1 a 4, que comprende determinar la cantidad total de todos los ADN extracelulares en circulación en la muestra, teniendo cada uno una secuencia que se encuentra dentro de la región cromosómica; y correlacionar una cantidad total aumentada con una probabilidad aumentada de que dicho paciente tenga cáncer de mama.
6. El método de la reivindicación 1, en donde la etapa de determinación del nivel de cada una de las regiones cromosómicas expuestas en la Tabla 7 comprende poner en contacto una pluralidad de sondas que comprenden sondas que son selectivas para cada región cromosómica expuesta en la Tabla 7 con una muestra de ADN obtenida de la muestra de sangre, suero o plasma en condiciones en las que las sondas hibridan de forma selectiva con las secuencias diana presentes en la región cromosómica; y detectar la hibridación de una o más sondas.
7. El método de la reivindicación 6, en donde la pluralidad de sondas está unida a una superficie sólida.
ES11769758T 2010-04-16 2011-04-18 Biomarcadores de ácidos nucleicos en circulación asociados al cáncer de mama Active ES2703769T3 (es)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US32492710P 2010-04-16 2010-04-16
PCT/US2011/032931 WO2011130751A1 (en) 2010-04-16 2011-04-18 Breast cancer associated circulating nucleic acid biomarkers

Publications (1)

Publication Number Publication Date
ES2703769T3 true ES2703769T3 (es) 2019-03-12

Family

ID=44799071

Family Applications (1)

Application Number Title Priority Date Filing Date
ES11769758T Active ES2703769T3 (es) 2010-04-16 2011-04-18 Biomarcadores de ácidos nucleicos en circulación asociados al cáncer de mama

Country Status (6)

Country Link
US (3) US10047397B2 (es)
EP (1) EP2558854B1 (es)
CA (1) CA2796578C (es)
ES (1) ES2703769T3 (es)
PL (1) PL2558854T3 (es)
WO (1) WO2011130751A1 (es)

Families Citing this family (26)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11939634B2 (en) 2010-05-18 2024-03-26 Natera, Inc. Methods for simultaneous amplification of target loci
SG10202008532PA (en) * 2010-11-30 2020-10-29 Univ Hong Kong Chinese Detection of genetic or molecular aberrations associated with cancer
WO2012129363A2 (en) 2011-03-24 2012-09-27 President And Fellows Of Harvard College Single cell nucleic acid detection and analysis
US20140303008A1 (en) * 2011-10-21 2014-10-09 Chronix Biomedical Colorectal cancer associated circulating nucleic acid biomarkers
US10214775B2 (en) 2011-12-07 2019-02-26 Chronix Biomedical Prostate cancer associated circulating nucleic acid biomarkers
CN104685064A (zh) * 2012-07-24 2015-06-03 纳特拉公司 高度复合pcr方法和组合物
CN110872617A (zh) 2012-09-04 2020-03-10 夸登特健康公司 检测稀有突变和拷贝数变异的系统和方法
US10876152B2 (en) 2012-09-04 2020-12-29 Guardant Health, Inc. Systems and methods to detect rare mutations and copy number variation
US11913065B2 (en) 2012-09-04 2024-02-27 Guardent Health, Inc. Systems and methods to detect rare mutations and copy number variation
US20160040229A1 (en) 2013-08-16 2016-02-11 Guardant Health, Inc. Systems and methods to detect rare mutations and copy number variation
US10706957B2 (en) 2012-09-20 2020-07-07 The Chinese University Of Hong Kong Non-invasive determination of methylome of tumor from plasma
US9732390B2 (en) 2012-09-20 2017-08-15 The Chinese University Of Hong Kong Non-invasive determination of methylome of fetus or tumor from plasma
WO2015100427A1 (en) 2013-12-28 2015-07-02 Guardant Health, Inc. Methods and systems for detecting genetic variants
TWI813141B (zh) * 2014-07-18 2023-08-21 香港中文大學 Dna混合物中之組織甲基化模式分析
WO2016183106A1 (en) 2015-05-11 2016-11-17 Natera, Inc. Methods and compositions for determining ploidy
CN117174167A (zh) 2015-12-17 2023-12-05 夸登特健康公司 通过分析无细胞dna确定肿瘤基因拷贝数的方法
CA3025708A1 (en) 2016-05-30 2017-12-07 The Chinese University Of Hong Kong Detecting hematological disorders using cell-free dna in blood
WO2018081130A1 (en) 2016-10-24 2018-05-03 The Chinese University Of Hong Kong Methods and systems for tumor detection
CA3039685A1 (en) 2016-11-30 2018-06-07 The Chinese University Of Hong Kong Analysis of cell-free dna in urine and other samples
TW202348802A (zh) 2017-01-25 2023-12-16 香港中文大學 使用核酸片段之診斷應用
CA3092998A1 (en) 2018-03-13 2019-09-19 Grail, Inc. Anomalous fragment detection and classification
JP2021520816A (ja) 2018-04-14 2021-08-26 ナテラ, インコーポレイテッド 循環腫瘍dnaの個別化された検出を用いる癌検出およびモニタリングの方法
US11211147B2 (en) 2020-02-18 2021-12-28 Tempus Labs, Inc. Estimation of circulating tumor fraction using off-target reads of targeted-panel sequencing
US11211144B2 (en) 2020-02-18 2021-12-28 Tempus Labs, Inc. Methods and systems for refining copy number variation in a liquid biopsy assay
US11475981B2 (en) 2020-02-18 2022-10-18 Tempus Labs, Inc. Methods and systems for dynamic variant thresholding in a liquid biopsy assay
CN114107510B (zh) * 2021-12-10 2023-10-03 湖南工程学院 基于dna三链介导构建多维dna酶矩阵的超灵敏循环核酸检测体系、试剂盒和方法

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR100776214B1 (ko) * 2004-08-23 2008-01-17 주식회사 마크로젠 염색체 이상 검정방법 및 마이크로어레이 칩
US20090299640A1 (en) * 2005-11-23 2009-12-03 University Of Utah Research Foundation Methods and Compositions Involving Intrinsic Genes
ES2923759T3 (es) 2006-12-14 2022-09-30 Life Technologies Corp Aparato para medir analitos utilizando matrices de FET
US20100112590A1 (en) 2007-07-23 2010-05-06 The Chinese University Of Hong Kong Diagnosing Fetal Chromosomal Aneuploidy Using Genomic Sequencing With Enrichment
US8682597B2 (en) 2007-10-16 2014-03-25 Exxonmobil Research And Engineering Company Estimating detailed compositional information from limited analytical data
WO2009051842A2 (en) * 2007-10-18 2009-04-23 The Johns Hopkins University Detection of cancer by measuring genomic copy number and strand length in cell-free dna
US20100035252A1 (en) 2008-08-08 2010-02-11 Ion Torrent Systems Incorporated Methods for sequencing individual nucleic acids under tension

Also Published As

Publication number Publication date
CA2796578C (en) 2021-11-23
WO2011130751A1 (en) 2011-10-20
PL2558854T3 (pl) 2019-04-30
US20190078165A1 (en) 2019-03-14
US10047397B2 (en) 2018-08-14
US11377695B2 (en) 2022-07-05
EP2558854B1 (en) 2018-10-10
US20220333213A1 (en) 2022-10-20
CA2796578A1 (en) 2011-10-20
EP2558854A1 (en) 2013-02-20
US20130116127A1 (en) 2013-05-09
EP2558854A4 (en) 2014-03-05

Similar Documents

Publication Publication Date Title
ES2703769T3 (es) Biomarcadores de ácidos nucleicos en circulación asociados al cáncer de mama
US11326204B2 (en) Assays for single molecule detection and use thereof
US20210207130A1 (en) Methods and compositions for the making and using of guide nucleic acids
JP5236286B2 (ja) 肝線維症に関連する遺伝的多型、その検出方法および使用
EP1144684B1 (en) Enhanced sequencing by hybridization using pools of probes
GB2610100A (en) Antisense oligomers for treatment of non-sense mediated RNA decay based conditions and diseases
CA2801468C (en) Prostate cancer associated circulating nucleic acid biomarkers
JP2009521205A (ja) 発作に関連する遺伝的多型、その検出方法および使用
ABADÍA‐CARDOSO et al. Discovery and characterization of single‐nucleotide polymorphisms in steelhead/rainbow trout, Oncorhynchus mykiss
JP2010263894A (ja) 癌における治療標的
CA3050984A1 (en) Molecular subtyping, prognosis, and treatment of bladder cancer
US20220093208A1 (en) Compositions, methods, and systems to detect hematopoietic stem cell transplantation status
JP2009523006A (ja) 脈管疾患に関連する遺伝的多型、その検出方法および使用
AU2014317843A1 (en) Methods and kits for predicting outcome and methods and kits for treating breast cancer with radiation therapy
CA3231249A1 (en) Coronavirus rapid diagnostics
EP1476067A2 (en) Novel compositions and methods for cancer
JP2009523405A (ja) 冠動脈心疾患に関連する遺伝的多型、その検出方法および使用
AU2016224709B2 (en) Method for assisting in prognostic diagnosis of colorectal cancer, recording medium and determining device
US20240060136A1 (en) Methods for detecting and predicting grade 3 cervical epithelial neoplasia (cin3) and/or cancer
KR20220058616A (ko) 진단 염색체 마커
CA3186997A1 (en) Methods for detecting and predicting cancer and/or cin3
KR20050114099A (ko) 대장암 진단용 dna 칩
CN113817859B (zh) 用于小麦品种鉴定的mnp标记位点、引物组合物和试剂盒及其应用
US20080193935A1 (en) Detection of Dna Sequence Motifs in Ruminants
CUPERLOVIC-CULF i Patent Application Publication do Pub. No.: US 2011/0165582A1