CN1232504A - 3' genomic promoter region and polymerase gene mutations responsible for attenuation in viruses of order designated mononegavirales - Google Patents

3' genomic promoter region and polymerase gene mutations responsible for attenuation in viruses of order designated mononegavirales Download PDF

Info

Publication number
CN1232504A
CN1232504A CN97198321A CN97198321A CN1232504A CN 1232504 A CN1232504 A CN 1232504A CN 97198321 A CN97198321 A CN 97198321A CN 97198321 A CN97198321 A CN 97198321A CN 1232504 A CN1232504 A CN 1232504A
Authority
CN
China
Prior art keywords
virus
leu
ile
ser
nucleotide
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN97198321A
Other languages
Chinese (zh)
Inventor
S·A·乌登
M·S·西迪
J·M·泰特姆
B·R·墨菲
V·B·伦道夫
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Goverment Of United States, AS REPRESENTED BY SECRETARY D
Wyeth Holdings LLC
Original Assignee
Goverment Of United States, AS REPRESENTED BY SECRETARY D
American Cyanamid Co
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Goverment Of United States, AS REPRESENTED BY SECRETARY D, American Cyanamid Co filed Critical Goverment Of United States, AS REPRESENTED BY SECRETARY D
Publication of CN1232504A publication Critical patent/CN1232504A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/11DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/005Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from viruses
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N7/00Viruses; Bacteriophages; Compositions thereof; Preparation or purification thereof
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K39/00Medicinal preparations containing antigens or antibodies
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2760/00MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssRNA viruses negative-sense
    • C12N2760/00011Details
    • C12N2760/18011Paramyxoviridae
    • C12N2760/18411Morbillivirus, e.g. Measles virus, canine distemper
    • C12N2760/18422New viral proteins or individual genes, new structural or functional aspects of known viral proteins or genes
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2760/00MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssRNA viruses negative-sense
    • C12N2760/00011Details
    • C12N2760/18011Paramyxoviridae
    • C12N2760/18511Pneumovirus, e.g. human respiratory syncytial virus
    • C12N2760/18522New viral proteins or individual genes, new structural or functional aspects of known viral proteins or genes
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2760/00MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssRNA viruses negative-sense
    • C12N2760/00011Details
    • C12N2760/18011Paramyxoviridae
    • C12N2760/18611Respirovirus, e.g. Bovine, human parainfluenza 1,3
    • C12N2760/18622New viral proteins or individual genes, new structural or functional aspects of known viral proteins or genes

Abstract

The invention discloses isolated, recombinantly-generated, attenuated, nonsegmented, negative-sense, single stranded RNA viruses of the Order Mononegavirales having at least one attenuating mutation in the 3' genomic promoter region and having at least one attenuating mutation in the RNA polymerase gene are described. Vaccines are formulated comprising such viruses and a physiologically acceptable carrier. The vaccines are used for immunizing an individual to induce protection against a nonsegmented, negative-sense, single stranded RNA virus of the Order Mononegavirales.

Description

In mononegavirale virales virus, cause 3 ' genomic promoter region of attenuation and the sudden change in the pol gene
Invention field
The present invention relates to mononegavirale virales (Order designated Mononegavirales), isolating, recombinate that produce, attenuation, Nonsegmented, negative justice (negative-sense), single strand RNA virus, it has the attenuation sudden change of at least one place in 3 ' genomic promoter region, have place attenuation sudden change in rna polymerase gene at least.The present invention has obtained the support of government, authorizes fund by Public Health Service.Government has certain right of the present invention.
Background of invention
The negative adopted single strand RNA virus of bag quilt is by the tissue of uniqueness and expression.The geneome RNA of this mononegavirale virus plays two kinds of template actions in virus nucleocapsid: the one, and as the template of synthetic messenger RNA(mRNA) (mRNA), it two is templates as synthetic anti-genome (+) chain.Negative adopted single strand RNA virus is encoded and is packed their RNA RNA-dependent polysaccharase.When virus is sloughed capsid in by the cell of its infection, only synthetic messenger RNA(mRNA).Virus replication occurred in after synthesizing of mRNA, and needed the synthetic continuously of viral protein.The new anti-genome of synthetic (+) chain plays a part further generation (-) geneome RNA copy template.
The polysaccharase mixture starts and implements by the cis acting signal in conjunction with (engage) genome 3 ' terminal (promoter region specifically) transcribes and duplicates.So the virogene uniaxially transcribes from genomic templates according to 3 ' to 5 ' direction.For its upstream neighbor (being nucleoprotein gene (N)), always less from the mRNA that downstream gene (as pol gene (L)) makes.So,, have the gradient of a mRNA abundance usually according to the position of the relative genome 3 ' of gene.
According to the classification that the international council of viral nomenclature rearranged in 1993, established an order, be called mononegavirale virales (Monogenavirales).This order comprises the enveloped virus of three sections, and strand, non-sections, negative polarity (negative justice) geneome RNA are all arranged.Described section is Paramyxoviridae, Rhabdoviridae and Filoviridae.Paramyxoviridae is further divided into two subfamilies again: paramyxovirus (paramytoxinae) and pneumonitis virus (pneumovirinae).The paramyxovirus subfamily contains three genus: paramyxovirus, rubella virus (Rubulavirus) and Measles virus (mobillivirus).The pneumonitis virus subfamily comprises pneumonitis virus and belongs to.
These two kinds of new classification are according to morphological criteria, virus genomic composition, proteinic biological activity and sequence relation.The distinctive morphological specificity of paramyxovirus subfamily is the size and the shape (diameter 18mm, long 1mm, pitch (pitch) 5.5nm) of its nucleocapsid in the enveloped virus, has the left hand helix symmetry.Biologic criteria is: the antigenic cross-reaction and 2 in 1) belonging between the member) paramyxovirus, rubella virus genus have the neuraminic acid enzymic activity, and Morbillivirus does not then have.In addition, also to consider the variation of P genes encoding potential (coding potential), because in rubella virus, there is an extra gene (SH).
Pneumonitis virus (pneumovius) can be distinguished with paramyxovirus (paramyxovirinae) on form mutually, because the former nucleocapsid is very narrow.In addition, the main difference of pneumonitis virus and paramyxovirus be the protein coding cistron number (have 10 in the pneumonitis virus, and paramyxovirus being 6) and with the visibly different attachment protein (G) of paramyxovirus.Though paramyxovirus and pneumonitis virus have 6 albumen (N, P, M, G/H/HN, F and L) seemingly corresponding on function, have only latter two albumen to show serial correlation tangible between two subfamilies.Some pneumonitis virus albumen does not have all corresponding parts of most of paramyxovirus, i.e. non-structural protein NS 1 and NS2, little hydrophobic proteins SH and secondary albumen M2.Some paramyxovirus albumen is C and V, lacks the corresponding part in the pneumonitis virus.But it is identical that the disease basi gene group of pneumonitis virus and paramyxovirus is formed.Rhabdovirus and Filovirus are too.Table 1 provided these three kinds of viruses name classification, and each example that belongs to.
Table 1
The classification of the non-sections of mononegavirale virales, negative adopted single strand RNA virus Paramyxoviridae
The paramyxovirus subfamily
Paramyxovirus genus
Sendai virus (mouse 1 type parainfluenza virus)
1 type and 3 type human parainfluenza viruses (PIV)
3 type bovine influenza viruses (BIV)
Rubella virus genus
SV 41 virus (SV) (2 type canine parainfluenza virus)
Mumps virus
New castle disease virus (NDV) (1 type avian paramyxoviruses)
2,4a and 4b type human parainfluenza virus
Morbillivirus
Measles virus (MV)
The dolphin Measles virus
Canine distemper virus (CDV)
For a short time ruminate beastly plague virus
The Phocine distemper virus
Rinderpest virus
The pneumonitis virus subfamily
Pneumonitis virus belongs to
Human respiratory syncytial virus (RSV)
Bovine respiratory syncytial virus
Mouse pneumonia virus
Turkey Coryzavirus (Turkey rhinotracheitis virus) Rhabdoviridae
Lyssavirus (Lyssavirus)
Rabies virus (Rabie virus)
Vesiculovirus genus
Vesicular stomatitis virus
Ephemeral fever virus belongs to (Ephemerovirus)
Bovine ephemeral fever virus Filoviridae
Filovirus belongs to
Marburg virus
For many above-mentioned viruses, also without any the vaccine that can get.So, be necessary to develop the vaccine that resists this class humans and animals pathogenic agent.This class vaccine should be able to cause protective immunological reaction in inoculator's body.The quality of this favourable reaction and quantity characteristics are to be inferred by the characteristics seen in the natural viral infection survivor, and such survivor generally speaking is not subjected to infection more identical or that high correlation is viral in the long duration afterwards.
Seeking to develop in the process of this class vaccine has several different methods to consider, comprises use: the single virus protein vaccine (subunit vaccine) of (1) purifying; (2) the totivirus preparation of deactivation; (3) attenuated virus of Huoing.
That the advantage of subunit vaccine is is pure, composition clearly and relatively easily ins all sorts of ways mass production, comprises the recombinant dna expression method.So far, except famous hepatitis B surface antigen(HBsAg), the viral sub-units vaccine generally only causes fugitive and/or inadequate immunizing power, especially in natural receptor.
The formalin deactivation preparation of full poliomyelitis (IPV) and hepatitis A virus is proved to be safe and effective.On the contrary, use similar inactivated whole virus immunity all to cause bad immune response and/or such reaction type, make vaccine inoculation person contact natural later on again or " wild-type " when virus, overreaction or abnormal diseases took place easily such as respiratory syncytial virus and Measles virus.
(1966) were in one's early years once attempted the RSV vaccine that the parenteral road gives the formalin deactivation and were inoculated to the child.Unfortunately, the unusual disease of serious characteristic (reference list number 1,2) has appearred having taken place behind the natural infection RSV of serious adverse effects-afterwards in the agent of this vaccine time test in place.Propose, the antigen of this formalin deactivation has caused unusual or unbalanced immune response, makes vaccine inoculation person to RSV disease-susceptible humans (3,4).
So, produced the attenuation candidate vaccine of living by cold going down to posterity (cold passage) or chemical substance mutagenesis.These RSV strains are found in virulence attenuation of among the seropositivity grownup.Unfortunately, when giving seronegative baby, it is excessive or not enough that they are found attenuation; Sometimes, they also are found and lack genetic stability (5,6).Another kind of parenteral road gives the inoculation method of live virus because of invalid the termination (7).It should be noted that these live RSV vaccines never with disease increase the weight of relevant, not as above-mentioned observed with the RSV vaccine of formalin deactivation.At present, though utilize coldly go down to posterity, the RSV virus of A 2 of chemomorphosis and the present well afoot of clinical experiment that the B-1 strain carries out, also be not approved for human RSV vaccine so far.
The derived virus alive of the suitable attenuation of wild virus warp provides the outstanding advantage as candidate vaccine.As the replicability factor of living; they cause infection in inoculator's body; during this time; the virogene product is expressed, is processed and offered with inoculator's specificity MHC I and II type molecule; induce body fluid and cell-mediated immune response thus; and synergetic property cytokine pattern, the latter is identical with natural infection survivor's protective immunity type.
This favourable immune response pattern is opposite with the limited response that has deactivation or subunit vaccine to excite, and deactivation or subunit vaccine mainly are confined to humoral immunization attenuation system usually.And; the whole virus vaccine of some formalin deactivation (for example Measles virus of developing the sixties and respiratory syncytial virus vaccines) inductive immunne response not only can not provide persistent protection, and unusual, hyperreactive or mortality disease takes place easily when in fact causing vaccine inoculation person to contact wild-type virus later on.
Though the attenuated virus of living has fabulous characteristic as candidate vaccine, they are proved to be and are difficult to development.The focus of difficulty is to isolate a kind of like this derived virus of wild-type strain, and it has been lost pathogenic (being virulence) but has kept enough replicatioies, to infect the inoculator and to cause enough strong required type of immune response.
In history, the fine balance between this virulence and the attenuation once reached like this, even the strain isolated of a wild-type virus is following to different host tissues or cell continuous passage at different growth conditionss (for example temperature).This process helps the growth that virus becomes strain (mutant strain) by inference, and some becomes strain and has good attenuation feature.Once in a while, also can realize further attenuation by chemomorphosis.
This propagation/the scheme that goes down to posterity cause usually occurring to virus temperature sensitive, acclimatization to cold and/or that host range changes derive strain-they one or the variation that is different from the wild-type pathogenic virus-promptly may the variation relevant with attenuation is arranged all.
Utilize this method to produce several live-virus vaccines, comprised vaccine prevention measles and parotitis (paramyxovirus) and opposing poliomyelitis and rubella (positive chain RNA virus), and become the main means of world today's immunization programs for children.
But the method for this generation attenuated live virus candidate vaccine expends time in, and, good again, be still unpredictalbely, mainly rely on those genome mutation strains that take place at random of selecting growth fast with required attenuation characteristic.Perhaps, the virus of gained has required phenotype external, even also to show as in animal model be attenuation.But in the human or animal host of hope used as candidate vaccine, all attenuation deficiency or attenuation are excessive often again for they.
Even using vaccine, still need more effective vaccine at present.For example, present Measles Vaccine provides goodish provide protection.But popular the showing of measles has defectiveness on the vaccine potency now recently.Although mother has been carried out immunity, the ratio that acute measles infection takes place one-year-old following child is still very high, and this shows that vaccine can not induce and the wild-type measles infects the suitable anti-measles antibody level (8,9,10) of institute's inductive.So the passive antibody that can not provide competent placenta to shift for the baby through mother of vaccine immunity is to protect the newborn infant in the follow-up continuation of insurance of a some months of birth.
The acute measles of accepting in the past to take place among immune teenager and the youth infects the new problem of having pointed out.The inefficacy of these secondary vaccines shows, existing vaccine has limitation (11,12,13) inducing and keep on the ability of abundant and persistent antiviral provide protection.Recently, another potential problems have been found again.In the past 15 years of the hemagglutinin of isolating wild-type Measles virus show and vaccine strain (14) distance more and more far away.This " antigenic drift " proposed a problem, and promptly the ideal antigenic characteristic that provides best protection required may be provided vaccine strain.So, need improved vaccine.
Rational vaccine design must specifically, promptly determine the evaluations that base and those genomes that cause attenuation change by means of the virulence to encoding viral by means of these vaccines are better understood.
The present invention's general introduction
Given this, one of the object of the invention is to identify that those cause the sudden change zone of viral attenuation in the mononegavirale virales rna virus cdna group.
Another object of the present invention is that reorganization is created in the virus of having mixed described attenuation sudden change in the genome.
Another object of the present invention is to make the vaccine preparation that contains attenuated virus.
Hereinafter will discuss above other purpose that reaches of the present invention, these purposes will be that produce with the reorganization that separates the mononegavirale virales, attenuation by producing, non-segmental, negative adopted single strand RNA virus reach, and the attenuation that such virus has at least one attenuation that is positioned at 3 ' genomic promoter region sudden change and at least one rna polymerase gene suddenlys change.
With the Measles virus is example, the attenuation of at least one sudden change is selected from the 3 ' genomic promoter region: and Nucleotide 26 (A → T), Nucleotide 42 (A → T or A → C) and Nucleotide 96 (G → A), above all is present on the normal chain with described other Nucleotide of the application (unless otherwise mentioned), being anti-genomic, promptly is signal (coding) meaning; And the attenuation of at least one sudden change is selected from the Nucleotide change that can cause following amino acid change in the rna polymerase gene, and described amino acid change is selected from: residue 331 (Isoleucine → Threonine), residue 1409 (L-Ala → Threonine), residue 1624 (Threonine → L-Ala), residue 1649 (arginine → methionine(Met)), residue 1717 (aspartic acid → L-Ala), residue 1936 (Histidine → tyrosine), residue 2074 (glutamine → arginine) and residue 2114 (arginine → Methionin).
With the human 3-type parainfluenza virus is example, and the attenuation of at least one sudden change is selected from the 3 ' genomic promoter region: and Nucleotide 23 (T → C), Nucleotide 24 (C → T), Nucleotide 28 (G → T) and Nucleotide 45 (T → A); And the attenuation of at least one sudden change is selected from the Nucleotide change that can cause following amino acid change in the rna polymerase gene, and described amino acid change is selected from: residue 942 (Methionin → Histidine), residue 992 (leucine → phenylalanine), 1292 (leucine → phenylalanines) and residue 1558 (Threonine → Isoleucine).
With B subgroup human respiratory syncytial virus is example, and the attenuation of at least one sudden change is selected from the 3 ' genomic promoter region: Nucleotide 4 (C → G) and insert an extra A in the continuous A of Nucleotide 6-11 position; And the attenuation of at least one sudden change is selected from the Nucleotide change that can cause following amino acid change in the rna polymerase gene, described amino acid change is selected from: residue 353 (arginine → Methionin), 451 (Methionin → arginine), 1229 (aspartic acids →), 2029 (Threonine → Isoleucines) and 2050 (l-asparagine → aspartic acids).
In another embodiment of the present invention, attenuated virus is used to prepare vaccine, and described vaccine can excite the protective immune response of the viral wild-type of opposing.
In another embodiment of the present invention, isolating, normal chain, an anti-genome courier nucleic acid molecule (or isolating, minus strand genomic nucleic acids molecule) with intact virus nucleotide sequence (wild-type or through non-recombination method attenuated virus) is processed, promptly introduce the described attenuation sudden change of one or more the application, to produce an attenuated virus isolating, that reorganization produces.Then such virus is used for preparing vaccine, described vaccine can excite the protective immunological reaction of antiviral wild-type.
In another embodiment of the present invention, such one section complete wild-type or vaccine virus nucleotide sequence are used to: (1) design PCR primer is used for the existence of corresponding virus in PCR test test sample; Or (2 designs and selection peptide are used for the existence of corresponding virus in the ELISA test sample).
The accompanying drawing summary
Fig. 1 has described the history that goes down to posterity of Edmonston Measles virus (15).The abbreviation meaning is as follows among the figure: HK-people's kidney; HA-people's amnion; CE (am)-Embryo Gallus domesticus; The CEF-chick embryo fibroblast; DK-dog kidney; The WI-38 human diploid cell, SK-sheep kidney; *-the plaque clone.Numbering immediately following each abbreviation is represented passage number.
Fig. 2 has described the Genome Atlas of Measles virus, has shown that one is positioned at and the cis acting controlling element of inferring of and anti-genome end terminal near genome.The Genome Atlas of this figure top-Measles virus, start from 3 ' end 52 Nucleotide leader sequence (1), end at 5 ' end 37 Nucleotide tailer sequence (t).Represent the gene border with vertical line; It under each gene the few nucleotide of cistron.This figure bottom-genome and anti-genome 3 ' end extends the enlarged view of promotor, shown position and the sequence of two sections high conservative zone A and B.Marked simultaneously and inserted intergenic trinucleotide.According to hypothesis, the new life 5 ' RNA that comprises A ' and B ' district contains regulating and controlling sequence and cause N albumen package action in this sequence.
Fig. 3 has described the gene mapping (last figure) of the B subgroup wild-type strain of the RSV that is called as 2B and 18537, and overlapping (figure below) of 68 Nucleotide arranged between the intergenic sequence of strain (middle figure) and M2 gene and the L gene.RSV 2B strain is compared few 6 Nucleotide in the G gene with 18537 strains, 2 amino-acid residues of therefore encoding less in G albumen.The 2B strain has 145 Nucleotide in 5 ' tail region, by contrast, there are 149 Nucleotide in this district of 18537 strains.Compare with 18537 strains, the 2B strain is each many Nucleotide in NS-1, NS-2 and N gene, and each lacks a Nucleotide in M and F gene.
Detailed description of the present invention
By the enzymic activity of a multimeric protein, obtained that negative adopted single strand RNA virus is genomic to be transcribed and duplicate to nucleoglucoprotein core (nucleocapsid) effect.Exposed geneome RNA can not be as template.But these genome sequences only just can be identified when being wrapping in the capsid structure fully by N albumen.This only occurs in following condition, and promptly the terminal promoter sequence of genome and anti-genome is identified and excites and transcribe or duplicate path.
For the carrying out of above-mentioned polysaccharase path, all paramyxovirus all need two kinds of viral proteins, L and P.In order to transcribe effectively carrying out of path, comprise that the pneumonitis virus of RSV also needs transcriptional elongation factor M2.Other cofactor also may work, and perhaps comprises the NS1 and the NS2 albumen of encoding viral, and host cell encoded protein matter.
But, considerable evidence shows, L albumen is carrying out great majority (if not full agent) and is transcribing and duplicating relevant enzymic process just, comprise the initiation and the termination of ribonucleotide polymerization, adding of mRNA transcription product emitted and polyadenylation, methylates and the proteic specificity phosphorylation of (perhaps) P.It is big that L albumen central role in genome is transcribed and duplicated has obtained its volume, to the support (16) responsive and its competent catalytic capability in the viral complex body of transcriptionally active that suddenlys change.
Following proposal has been drawn in above-mentioned consideration, and L albumen is made of a series of linearly aligned structural domains, and this cascaded structure has been incorporated into together (17) with each dispersive function.In fact, according to the dependency of the clear and definite proteinic functional domain of other characteristic, having identified that in the negative-sense viral L albumen 3 are this has demarcation (delimitad) dispersive element.They comprise: identification of RNA template and/or the phosphodiester bond that infer (1) form structural domain; (2) RNA binding member and (3) ATP binding domains.These functional element of inferring (17) have been disclosed for the negative justice of non-sections, proteic all the previous researchs of single strand RNA virus L.
Be not subjected to the constraint of the following stated, can reasonably suppose, the important determinative that these non-encoding histones, promotor and other adjusted and controlled territory of cis acting genome are duplicating efficiencies, transcribe and duplicate relevant with L albumen is accomplished because the Measles virus (MV) of these factor mononegavirale viraleses is with other virus, so they may be again these viral virulence determinatives.
Generally speaking, the present invention has been considered to comprise the one group of synergetic property that is positioned between cis acting adjustment signal (3 ' genomic promoter region) and the pol gene (L) and has changed, and these variations cause viral attenuation to make virus keep enough replicatioies simultaneously.Attenuation suddenlys change by the reasonableness of 3 ' genomic promoter region and pol gene and optimizes, such sudden change provides duplicating efficiency ideal balance: therefore, virus vaccines can not reproduce into disease, but still keep the ability that infects vaccine inoculation person's cell, to express that enough abundant gene product excites comprehensively and to produce the ideal immunne response of type and produce again and enlarge immunne response that it was caused greatly at utmost entirely.
Be not subjected to the constraint of the following stated, the attenuation sudden change in promotor of extension (3 ' genomic promoter region) and the pol gene it is believed that the effect that affects the cis acting signal and in conjunction with the conformation of the polysaccharase mixture of these signals.For example, be curled into helical pattern after promotor RNA is wrapped.Variation in the promoter sequence may have influence on the relative position of using the conservative signal that is relative to each other.Specifically, Measles virus wild-type 3 ' genomic promoter region respectively has a pyrimidine (uridylic) (anti-genome courier sequence is VITAMIN B4 at this) at the 26th and 42.And vaccine strain is that (anti-genome courier sequence correspondingly is pyrimidine to purine in described position; Referring to the table 3 of embodiment 1 hereinafter).The purine that volume is bigger may change distance and/or the angle between the promotor conserved domain (for example in Measles virus, position 1-11 and position 87-98), causes the change of cis acting signal three-dimensional conformation when polysaccharase is offered.
Zooscopy shows that the minimizing of virus replication will be enough to avoid disease but also be enough to excite required immunne response.The minimizing of duplicating may have been represented the minimizing of transcribing, the minimizing of the protein expression of encoding viral, and the minimizing of antisense template is so produced less new genome.The attenuated virus of gained is compared with wild-type, and its virulence significantly reduces.
By following two kinds of methods virus strain is introduced in attenuation sudden change as herein described:
(1) ordinary method, for example carry out chemomorphosis at virus growing period in the cell culture that is added with chemical mutagen, select the virus that under the suboptimal temperature degree, easily goes down to posterity to select the sudden change of temperature sensitive and/or acclimatization to cold, evaluation produces the mutated viruses of little plaque in cell culture, and goes down to posterity by heterologous host and to select host-range mutant.In animal model, filter out the virus that biological activity weakens then.Attenuated virus is carried out the nucleotide sequencing of its 3 ' genomic promoter region and pol gene, to search the position of attenuation sudden change.In a single day above process is finished, then manner of execution (2):
(2) the better method of introducing the attenuation sudden change comprises and adopts site-directed mutagenesis to make predetermined sudden change.The closely related virus of the known attenuation sudden change of these sudden change employing methods (1) or reference is identified.In 3 ' genomic promoter region and pol gene, respectively introduce one or more sudden changes.Also can estimate the storage effect of the various combination of coding and non-code change.
By the standard recombinant dna method, virus genomic DNA copy is introduced in the sudden change in 3 ' genomic promoter region and the pol gene.This may be wild-type or the genetic background of modifying C-type virus C (for example virus of modifying by method (1)), produces new virus thus.Produce infectious clone or the particle that comprises these attenuation sudden changes with cDNA " rescue " system, this system has been used to multiple virus, comprising Sendai virus (18); Measles virus (19); Respiratory syncytial virus (20); Rabies virus (21); Vesicular stomatitis virus (VSV) (15); And rinderpest virus (23); This paper quotes also with reference to above reference.Relevant Measles virus is saved system, referring to the disclosed International Patent Application WO 97/06270 (24) of the U.S.; Relevant PIV-3 saves system, referring to U.S. Provisional Patent Application 60/047575 (25); Relevant RSV saves system, referring to the disclosed International Patent Application WO 97/12023 (26) of the U.S.; These applications are included this paper in as a reference.
In brief, all mononegavirale virus rescue systems can be summarized as follows: all need portion to be equivalent to the genomic clone's of intact virus DNA, this genome is positioned between suitable dna dependent rna polymerase promoter (for example t7 rna polymerase promotor) and self the cutting ribozyme sequence (for example hepatitis δ ribozyme), and this part clone's DNA is inserted in the fertile bacterial plasmid.This transcription vector provides the dna profiling of easy handling, and RNA polymerase (for example t7 rna polymerase) can verily be transcribed down the single stranded RNA copy with accurate or intimate viral anti-genome (or genome) of 5 ' and 3 ' end accurately from this template.The orientation of genomic dna copy and flank promoter sequence and ribozyme sequence has determined whether anti-genome or genomic RNA equivalents transcribe.The filial generation of rescue new virus also needs virus-specific trans-acting albumen, be about to exposed strand anti-genome of virus or geneome RNA transcription product and wrap up the required albumen of into functional nucleocapsid template, viral nucleocapsid protein (N or NP), the relevant phosphoric acid albumen (P) of polysaccharase and polysaccharase (L) albumen are arranged.These albumen comprise active viral RNA RNA-dependent polysaccharase, and the latter must syncaryon capsid template transcribe and duplicate with acquisition.
The trans-acting albumen that the Measles virus rescue is required is parcel albumen N and polysaccharase complex proteins, P and L.The parcel albumen of PIV-3 is called NP, and the polysaccharase complex proteins is also referred to as P and L.With regard to RSV, virus-specific trans-acting albumen comprises N, P and L, adds another albumen M2 (transcriptional elongation factor of RSV coding).
Usually, these viral trans-acting albumen are to be produced by the material expression vector of one or more coding desirable proteins, though all or part of required trans-acting albumen can produce in genetic engineering modified mammalian cell, these cells contain as stable transformant and express these virus-specific gene and gene products.
Typical rescue environment (but definitely not getting rid of other) comprises suitable mammalian cell environment, wherein has the T7 polysaccharase to drive the transcription vector of self-contained viral genome cDNA, transcribes anti-genome (or genome) single stranded RNA.When transcribing, perhaps be right after thereafter, the anti-genome of this virus (or genome) rna transcription product is wrapping in the functional template by nucleocapsid protein, and by required polysaccharase component institute combination, this polysaccharase component is produced simultaneously from the proteic cotransfection expression plasmid of required virus-specific trans-acting of encoding.These incidents and process have caused as the transcribing of the virus mRNA of prerequisite, and newly genomicly duplicate and increase, and produce new daughter of virus thus, promptly obtain rescue.
With regard to the rescue of mad dog, VSV and Sendai virus, the T7 polysaccharase is provided by recombinant vaccinia seedling diseases poison VTF7-3.But, this system requirements, the virus of being rescued must be by physics and biochemical method or by going down to posterity repeatedly to separate with virus vaccinicum in the cell and the tissue that are not the good host of poxvirus.With regard to the rescue of MV cDNA, avoided above-mentioned requirements by producing an expression T7 polysaccharase and virus N and the proteic clone of P, in auxiliary cell line by genome expression vector and the transfection of L expression vector have been realized being rescued.The advantage of virus vaccinicum host range mutant MVA-7 can be expressed t7 rna polymerase but not duplicate in mammalian cell, is used to save RSV, rinderpest virus and MV.After essential parcel albumen was expressed simultaneously, the anti-genome virus RNA of synthetic total length was wrapped, duplicates and transcribed by varial polymerases albumen, and the genome after duplicating is wrapped in the infective virus particle.Except that described anti-genome, now, the genome analogue of Sendai virus and PIV-3 is also by successfully rescue (25,27).
The rescue system provides a kind of composition thus, it comprises a transcription vector, this carrier comprises an isolated nucleic acid molecule, the genome or the inverted defined gene group of a non-sections of this nucleic acid molecule encoding mononegavirale virales, negative adopted single strand RNA virus, they have at least one attenuation sudden change and at least one attenuation sudden change in rna polymerase gene in 3 ' genomic promoter region; Together also have at least one expression vector, this carrier comprises at least one isolated nucleic acid molecule, this nucleic acid molecule encoding parcel, transcribes and duplicate necessary trans-acting albumen (for example N of Measles virus, P and L; The P of PIV-3 and L; The N of RSV, P, L and M2).Transform or transfection host cell with at least two above-described expression vectors then, host cell is cultivated under the condition that allows these carrier co expression, to produce the infectivity attenuated virus.
The infective virus of being rescued is then at first tested its required phenotype (temperature sensitivity, acclimatization to cold, plaque form and transcribe and duplicate weaken) by in vitro method.Adopt minimum replicon system that the sudden change that is positioned at cis acting 3 ' genomic promoter region is also tested, wherein required trans-acting parcel and polymerase activity are provided by wild-type or vaccine helper virus, or the N, P gene that comprise gene specific attenuation sudden change by expression and different L gene plasmid provided.
There is the attenuation phenotype in virus if be rescued, and just attacks experiment with suitable animal model.Non-human primate is the preferred animal model of research human disease pathogenesis.These primates are at first used the recombinant virus immunity of attenuation, attack with the wild-type of virus then.Adopt number of ways to include but not limited to that nose is interior, tracheae is interior and subcutaneous vaccination is infected monkey, (29).The rhesus monkey of experimental infection and cynomolgus macaque are also as the vaccine-induced anti-measles provide protection (30) of Research of Animal Model for Study.Weigh provide protection by following standard: disease S﹠S, survival rate, virus are discharged (shedding) and antibody titer.If satisfactory standard, then this attenuation recombinant virus considers that can be used as candidate vaccine is used for the human test.The virus of " being rescued " is considered to " reorganization produces ", and filial generation that this is viral and offspring also are that so they have all introduced the attenuation sudden change.
Even can be used as the optimum level of vaccine relatively, the attenuation deficiency or the attenuation of " being rescued " virus are excessive, and this is still valuable information to developing best vaccine strain.
Best, the codon that contains the attenuation point mutation adds the 3rd sudden change by introducing second or second in this codon and comes stabilization, and does not change by the coded amino acid of the codon that only contains the attenuation point mutation.The infective virus clone who contains the sudden change of attenuation and stabilization also uses above-mentioned cDNA " rescue " system to produce.
Measles virus is the useful model of the present invention, because as described herein, and the sequence information of its pathogenic wild virus and proved that in history the sequence information of effectively anti-disease vaccine now obtains.
Measles virus was named as to separate infected patient's the tissue culture of David Edmonston from one early than 1954 and obtains (31).This Edmonston strain of Measles virus becomes the ancestors of many attenuated live Measles Vaccines, comprises Moratan, and it is the present vaccine (Attenuvax of the U.S. TMMerck Shap ﹠amp; Dohme, WestPoint, PA) effective in nineteen sixty-eight approval and proof.
The positive immune programme for children of determining in the middle and later periods sixties has caused the measles case reported rapidly to drop to 1500 of nineteen eighty-three by nearly 700,000 of nineteen sixty-five.Simultaneously, also developed other vaccine strain (Fig. 1) by the Edmonston strain, and Schwarz (Institut Merieux, Lyon, France), Zagreb (Zagreb, Yugoslavia) and AIK-C (Japan).These other vaccines also prove effectively and are extensive use of.In one's early years, the insufficient vaccine strain (Rubeovax of reactionogenicity attenuation TM: Merck Sharp ﹠amp; Dohme) in children, caused measles sample disease, its use thereby be interrupted.But, it through successfully further attenuation produced Moraten vaccine strain (see figure 1) (32).The measles virus vaccines that live provide the once successful experience of exploitation effective vaccine, and provide model for the virus vaccines attenuation molecule mechanism of understanding the negative adopted single strand RNA virus of non-sections.
Because as this importance of human morbidity and main cause of death, Measles virus (MV) is widely studied.MV be a kind of big, be close to the spheric bag by particle, it is made up of two components: lipoprotein membrane and ribonucleoprotein core, the distinctive biological functions of they each tools (33).The capsid of virion is a host cell deutero-cytoplasmic membrane, through three kinds of virus-specific albumen: hemagglutinin (H; About 80 kilodaltons (kD)) and fusion glycoprotein (F 1,2About 60kD; They are given prominence on the surface of virion and give the ability that virion adheres to and enter host cell) modification (16).Be considered to protectiveness at the antibody of H and/or F, virus causes the ability (34,35,36) that infects because they can neutralize.Stromatin (M, about 37kD) is the both sexes albumen that is arranged in the internal surface of film, thereby thinks duplicate (37) that virus takes place perfect the form of its perfect virion.The core of virion contains the geneome RNA of long 15,894 Nucleotide, and geneome RNA template activity (38,39,40) has been given in it and tight associating of nucleocapsid protein (N) of about 60kD of 2600 molecules.With it be about 1 micron volution ribonucleoprotein particle losse associations be the viral RNA RNA-dependent polysaccharase (L that enzymic activity is arranged; About 240kD), it and polysaccharase cofactor (P; About 70kD), may also have other virus-specific and the cooperation of host-encoded protein, transcribe and duplicate MV genome sequence (41).
So far, complete nucleotide sequence (only limiting to Edmonston B laboratory strain and AIK-C vaccine strain), coding potentiality and the genomic composition of MV (33) have been reported.6 kinds of virion structural protein are by 6 nonoverlapping coded by said gene of adjoining, and they arrange as follows: 3 '-N-P-M-F-H-L-5 '.Two other MV gene product that present function it be unclear that has also been made evaluation.These the two kinds Nonstructural Proteins that are known as C (about 20kD) and V (about 45kD) are all by the P genes encoding, and the former is by secondary reading frame coding among the PmRNA; The latter is by corotation record editor's P gene deutero-mRNA coding, and this mRNA coding has the hybridization albumen (16) that P N-terminal sequence and new zinc fingerprint sample are rich in the C-terminal structural domain of halfcystine.
Except the proteic sequence of coding virus-specific, the MV genome also contains unique non-encoding histone structural domain, and these structural domains are transcribed similar with the structural domain that duplicates path (16,42) to the guidance of correlated virus.
These adjustment signals are positioned at MV genomic 3 ' and 5 ' end, and between each cistron of cross-over connection in the short transcribed spacer on border.The former coding instructs that genome is transcribed, genome and anti-genome parcel and the promotor of inferring and/or the regulating and controlling sequence element that duplicate.The latter sends signal and makes Transcription Termination and carry out the polyadenylation of each monocistron (monocistronic) virus mRNA, and then starts next gene transcription.In general, as if though the RNA RNA-dependent polysaccharase of other non-segmented negative-strand RNA viruses is arranged, MV polysaccharase mixture is also to these signals react (16,42,43,44).
Transcription initiation in MV genomic 3 ' terminal or its near, carry out to 5 ' direction then, form monocistronic mRNA (40,42,45).Along with polysaccharase moves along the horizontal of MV genomic templates, it and stopping/the start signal experience of inferring, these signals by 3 ' to 5 ' order are: partly conservative Transcription Termination/polyadenylation signal (A/GU/G UA A/U NN A 4, wherein N can be any in 4 kinds of bases), each monocistron RNA finishes at this; Trinucleotide punctuate mark (CUU between the gene of not transcribed; But at H: it is CGU for the L border); Start half required conservative start signal (AGG A/G NN C/AAA/GGA/U, wherein N can be any in 4 kinds of bases) (45,46) with next genetic transcription.Because some polysaccharase mixture can not restart, the abundance of various MV mRNA is along with encoding gene reduces away from genome 3 ' end.This mRNA gradient is directly corresponding with the proteic abundance of each virus-specific.This shows that the MV protein expression finally is controlled (44) on transcriptional level.
3 ' and 5 ' MV genome end contains non-encoding histone sequence, has unique similarity (42) with the leader of SVS and tail RNA coding region.Nucleotide 1 to 55 has been determined the zone between genome 3 ' end and N gene starting point, simultaneously finds 37 additional nucleotides between the terminal point of L gene and genome 5 ' end.But, do not resemble VSV, even do not resemble Sendai virus and the NDV of paramyxovirus, MV is not transcribed into these end region (+) short, unmodified or (-) adopted leading RNA (47,48,49).But, be transcribed into leading company and read transcription product, comprise total length polyadenylation leader: N, leader: N: P, leader: N: P: M also has the anti-genome MV of total length RNA (48,49) certainly.So in MV, decision starts the leading transcription product of weak point that VSV strand, negative polarity genome are transcribed into reproduction switch, i.e. operating element (50,51,52) that seemingly do not have.This has caused the consideration of other pattern of this important reproduction process and exploration (42).
It seems that Measles virus and all other mononegavirale viruses except that rhabdovirus all prolong the restriction (42) that its terminal adjusted and controlled territory is not subjected to its leader and tail region encoding sequence.These zones of Measles virus comprise 107 3 ' end genome nucleotides (" 3 ' genomic promoter region ", claim again " prolongation promotor ", it comprises 42 Nucleotide of the leader of encoding, thereafter 3 genes are Nucleotide at interval, 52 Nucleotide with 5 ' the end non-translational region of coding N mRNA) and 109 5 ' terminal nucleotides (Nucleotide of 3 ' the end non-translational region of 69 coding L mRNA, gene is the Nucleotide of trinucleotide and 37 tail regions of encoding at interval).The total nucleotide sequence that two short districts are all arranged in about 100 Nucleotide of said gene group and anti-genome 3 ' end: 14 in 16 Nucleotide of genome and anti-genomic absolute 3 ' end is identical.Be positioned at these ends, also have the zone of 12 definitely identical Nucleotide of another sequence.Their position just or transcribe near the MV genome and to begin the position that begins certainly with anti-genome duplication certainly, show that these short sequence domains of holding comprise a prolongation promoter region.
These sequential elements that separate can be arranged the transcription initiation site at N gene initiation site place-internal structure territory instruction transcription initiation site among both and anti-genomic generation (42,48,53) is instructed in 3 ' end structure territory.Except their regulating and controlling effects as the cis acting determinative of transcribing and duplicating, encoding respectively new life's 5 ' end of anti-genome and geneome RNA of genome that these 3 ' ends prolong and anti-genomic promoter region.Exist the required signal of still undetermined N albumen nucleogenesis in these nascent RNAs, it is required and after this amplify and transcribe and duplicate another required crucial controlling element promptly to form the nucleocapsid template.Fig. 2 has shown the position and the sequence in the adjusted and controlled territory of cis acting of inferring of these high conservatives.
In other member's genome of paramyxovirus genus (Paramyxoviridae), exist the similar terminal non-encoding histone zone of position, size and spacing, though have only 8 to 11 identical (42,54) with MV in their the absolute ends Nucleotide.Morbillivirus rabies virus (canine distemper virus) genome end (CDV) shows the homology with its relatives MV higher degree: the Nucleotide of two virus leader sequences and tailer sequence 73% is identical, comprises 17 (55) in absolute 3 ' terminal 18 Nucleotide 16 and 18 Nucleotide of 5 ' end.Still do not find to prolong the attached inner CDV genome structure territory that promotor has homology with MV.But, between the genomic Nucleotide 85 of CDV and 103, the section of 20 Nucleotide of a segment length being arranged between 15,587 to 15,606,15 in these 20 Nucleotide is complementary (Gene Bank accession number AF 14953).This shows that CDV is the same with MV, contains another zone at its non-coding 3 ' genome and anti-genome end, and this zone may provide important cis acting promotor and/or adjustment signal (55).
In addition, the exact length (55 Nucleotide) of several members (MV, CDV, PIV-3, BPV-3, SV and NDV) 3 ' leader is identical in the Paramyxoviridae.Other evidence of relevant these prolongations, non-protein-coding region importance is from the analysis to a large amount of different copy protections (copy-back) defective interferingviruses (DI), and these viruses are separated to from subacute sclerosing panencephalitis (SSPE) cerebral tissue recently.Do not find that the stem district is shorter than the DI of 95 5 ' terminal gene group Nucleotide.This shows that MV DI rna replicon and the required minimum signal of parcel are longer than the long tailer sequence of 37 Nucleotide, comprise another adjusted and controlled territory, inside of inferring (56) and extend to.
Be example to a certain extent with the Measles virus, the present invention is directed to such notion, promptly important virulence/attenuation decision base is present in the trans-acting that must act on it with these cis-acting elements in the non-encoding histone regulation and control of viral genome the zone in transcribes/the replicative enzyme mixture in.MV genomic 3 ' and 5 ' end have all been found the cis acting structural domain, are positioned at 6 both sides of adjoining gene of coding virus structural protein; And be present in the MV genome as the short district that comprises border between internal gene.The former coding instructs that genome is transcribed, genome and anti-genome parcel and duplicate etc. significant process, infer promotor and/or regulating and controlling sequence element.The latter sends the signal of each monocistron virus mRNA Transcription Termination and polyadenylation, and then starts next gene transcription.Transcribe/replicative enzyme, promptly RNA RNA-dependent polysaccharase molecule can regulate transcribe and/efficient of duplicating, so determining the abundance of cytopathogenic effect virogene product and/or virion filial generation.
The present invention for the evidence of this viewpoint of Measles virus available from having determined Edmonston wild-type MV strain isolated for generations and the derive nucleotide sequence (referring to Fig. 1) of coding region (and aminoacid sequence of predicting) of the non-coding control region (3 ' genomic promoter region territory) of the Measles Vaccine strain that obtains and L gene of strain isolated thus first.Also other wild-type strain isolated is independently detected, for relatively.
The nucleotide sequence of vaccine measles strain in 4 kinds of wild-types and 5 (in normal chain, anti-genome, the messenger strand), and the putative amino acid sequence of the RNA polymerase of these Measles viruss (L albumen) is as described below, numbers (SEQ ID NO) referring to the corresponding sequence of this paper: Virus Nucleotide sequence The L protein sequence Wild-typeEdmonston SEQ ID NO:1 SEQ ID NO:21977 SEQ ID NO:3 SEQ ID NO:41983 SEQ ID NO:5 SEQ ID NO:6Montefiore SEQ ID NO:7 SEQ ID NO:8 vaccine RubeovaxTM SEQ ID NO:9 SEQ ID NO:10Moraten SEQ ID NO:11 SEQ ID NO:12Zagreb SEQ ID NO:13 SEQ ID NO:14AIK-C SEQ ID NO:15 SEQ ID NO:16
15,894 Nucleotide of each Measles virus gene group leader listed above.The translation of L gene starts from the codon at Nucleotide 9234-9236 place; Translation stop codon is positioned at Nucleotide 15783-15785.Translate long 2,183 amino acid of L albumen.
Notice that the Nucleotide 2499 of 1983 wild-type Measles viruss is expressed as " G " in SEQ ID NO:5.In fact, this base is that " G " and " C " use with.In addition, note Rubeovax TMThe Nucleotide 2143 of vaccine virus is expressed as " T " in SEQ ID NO:9.In quilt 9 clones that check order, this base of 7 clones is " T ", and 2 is " C "; So this base can be " T " or " C ".
In addition, except at Nucleotide 4917 and 4924 Schwarz of place being " C " but not " T ", Schwarz vaccine virus genome is identical with Moraten vaccine virus genome (SEQ ID NO:11).
Then, the nucleotide difference and the L gene and the Nucleotide and the amino acid difference of L protein sequence of distinguishing them of 3 ' end genomic promoter region of distinguishing the wild virus of Edmonston wild-type strain isolated, vaccine strain and other independent separate is compared and arranges (referring to the table 3 of embodiment 1 hereinafter to 5).
As shown in table 3, the sudden change (in anti-genome, messenger strand) of 3 places from the wild for generations MV strain isolated and vaccine strain 3 ' the end genomic promoter region of deriving arranged: at Nucleotide 26 places, " A " becomes " T "; Nucleotide 42 places, " A " become " C " or " A " becomes " T "; Only in Zagreb, Nucleotide 96 places, " G " becomes " A ".In addition, other tested wild-type strain isolated is " A " but not " G " at Nucleotide 50 places, and is therefore all different with wild for generations strain isolated and vaccine strain.
Measles Vaccine strain (Rubeovax TM, Moraten, Schwarz, AIK-C and Zagreb) with the predicted amino acid sequence of wild-type strain isolated (1977,1983 with Montefiore) L gene and wild for generations strain isolated (Edmonston) 49 places different (referring to hereinafter table 4 and the table 5 of embodiment 1) are arranged in long 2183 amino acid whose opening code-reading frames.
These amino acid differences can be divided into 4 classes:
(1) a certain vaccine strain is different from for generations, and the position that is different from other vaccine and wild strain, points out a possible attenuation site.
(2) specific differences between all wild-types and all the vaccine sequences; These also may constitute important attenuation site.
(3) newer wild-type is different from the residue of older wild-type on the time; They may cause the heredity drift.
(4) one or more vaccine strains and/or wild-type strain have the total amino acid whose position that is different from other toxic strain; These variations may have been represented the dependency between the pedigree specificity in the vaccine strain, the variation of potential attenuation and the wild-type strain isolated respectively.
(1) class that has 4 kinds of a certain vaccines to be different from other vaccine and wild-type strain changes.Two kinds are present in (amino acid 331 and 2114) among Moraten and the Schwarz, and two kinds are present in (amino acid/11 624 and 2074) among the AIK-C.These sudden changes have special meaning, because all these viruses all are good vaccines.So these positions are attenuation sites.
Have only a position, promptly 1717, belong to (2) class, promptly all wild-types are aspartic acid all at this, and all vaccines are L-Ala.What is interesting is that this position is arranged in of two zones, the L gene of Measles virus and rabies virus (otherwise, be the height homologous) do not show unusual conservative property in described zone.This difference makes 1717 to be likely the key position of attenuation sudden change in the Measles virus.
Newer wild-type on time (1983 with Montefiore) has 5 places different with older wild-type (Edmonston and 1977), and 149,636,720,2017 and 2119, so these differences belong to (3) class.These differences prompt for the heredity drift rather than point out the attenuation mutational site.In addition, Montefiore (1989 strain isolated) has 16 place's differences (seeing Table 5) in addition with all the other strains.These may be heredity drift ((3) class) or random variation ((4) class).Remaining 23 place is (4) class difference, promptly is different from one or more viral differences that have jointly.
It is the sudden changes of potential attenuation (4) class that 3 places (1409,1649,1936) are arranged in these positions.Described changing into, two kinds of vaccine strains have and are different from the common change of wild-type strain for generations.These changes may with produce Rubeovax TMRelevant with the vaccine pedigree of Moraten vaccine (Fig. 1).
The applicant finds that their AIK-C vaccine strain nucleotide sequence has 21 place's differences with disclosed sequence (33), comprises that a place inserts and place disappearance.Several codings that caused in these differences change, and comprise that intragenic 2 places of L (being positioned at amino acid/11 477 and 2008) change.
So, for the preparation living vaccine along with measles for generations strain constantly be attenuated obtaining best replication, in the L gene order naturally other change of increase it seems and be suppressed and be limited.According to hypothesis, this the limited tolerance of L gene varied number and position not only is subjected to keeping the influence of polysaccharase multi-function capability needs, and the influence that changes of original 3 ' end promotor that just has that the L albumen that is subjected to producing subsequently is inevitable interacts with it (transcribe and duplicate obtaining).In other words, Zui Jia viral attenuation needs polymerase protein and its collaborative (promptly chain) that applies in the cis acting controlling element of effect is changed.
3 '-leader shows the lowest tolerated that changes, and it only allows Nucleotide 26 in the attenuation process (after all being that " A " becomes " T ") and Nucleotide 42 (" A " becomes " C " or " A " becomes " T ") two place's high selectivities to change (in the anti-genome messenger strand).In Zagreb, only there is a place to change in addition, promptly " G " at the 96th place becomes " A ", may be very important when this and L gene specific sexually revise when combining.As if 3 '-leader has only experienced place heredity drift since 1954, promptly the 50th " G " becomes " A " (seeing Table 3).
3 ' end changes into only in all MV vaccine strains that two place's pyrimidines are replaced by purine in the genome meaning chain in the genomic promoter region in the attenuation process.The common evolution of L gene is considered to reflect to help the selection result of viral delicate variation of breeding in different host cells in these attenuation processes.All vaccine strains are all cultivated in chicken embryo (CE) or chick embryo fibroblast (CEF) in its attenuation process (Fig. 1).In addition, some vaccine strain contacts with special host cell; That is, the Zagreb vaccine is cultivated in Madin-Darby canine kidney(cell line) and in the human diploid cell, and the AIK-C vaccine adapts to the sheep nephrocyte.Moraten and Rubeovax TMOnly in CE and CEF, grow.
Some pedigree specificity L gene alteration (Rubeovax TM, in Moraten and the Schwarz vaccine the 1649th; With the 1717th change in all vaccines) represented the L gene of a subgroup to adapt to the change of 3 '-leader, regulate for the vaccine attenuation and transcribe/reproduction process.In addition, the vaccine individual specificity changes (change of (1) class) may provide meticulous and harmonious adjusting for the virus replication of each vaccine strain/transcribe.
According to table 3 and above argumentation, the sudden change of the key attenuation of MV 3 ' genomic promoter region is Nucleotide 26 (A → T), Nucleotide 42 (A → T or A → C) and Nucleotide 96 (G → A) (in anti-genome, the messenger strand).
According to table 4 and above argumentation, the proteic key attenuation of L mutational site is as follows: amino-acid residue 331 (Isoleucine → Threonine), 1409 (L-Ala → Threonines), 1642 (Threonine → L-Ala), 1649 (arginine → methionine(Met)s), 1717 (aspartic acid → L-Ala), 1936 (Histidine → tyrosine), 2074 (glutamine → arginine) and 2114 (arginine → Methionins).Should be appreciated that, cause the Nucleotide of above-mentioned amino acid change to change to be not limited to hereinafter among the embodiment 1 table 4 listed; Causing the codon change to become above-mentioned amino acid whose all Nucleotide variations after translation all is included in the scope of the present invention.
3 type human parainfluenza viruses (HPIV-3) are another kind of non-sections, negative justice, strand coating RNA viruses.HPIV-3 belongs to Paramyxoviridae (seeing Table 1).15,462 Nucleotide of the gene group leader of HPIV-3, and 6 the nonoverlapping protein coding genes (57) of encoding.A kind of virion structural protein of respectively encoding of 5 genes wherein, they are called as NP (corresponding to the N albumen of MV), M, F, HN (hemagglutinin-neuraminidase) and L.The 6th mRNA coding P albumen, and, go back encoding D albumen by rna editing mechanism by eclipsed 5 ' the immediate opening code-reading frame (ORF) coding C albumen.
The same with MV, HPIV-3 comprises 3 '-non-encoding histone leader of 55 Nucleotide, but is different from Measles virus (this district is 37 Nucleotide), and it has 5 '-tail region of 44 Nucleotide of a segment length.Polysaccharase is with linearity, succession, initial-the mode open gene group that stops, and this mode is by the signals direct of transcribing in the RNA template.
Wild-type virus JS strain has been obtained to have the result (7,57) of prospect through the go down to posterity trial of developing the active HPIV-3 vaccine of attenuation of cell cultures under the suboptimal temperature degree.The go down to posterity evaluation of level has separated (CP) mutant strain of several strains " cold going down to posterity " according to JS strain difference.One of them mutant strain goes down to posterity from 45 times, is called as cp45.
This virus shows 3 interesting characteristics: (1) acclimatization to cold (ca): be lower than effective replication under 20 ℃ of suboptimal temperature degree; (2) temperature sensitivity (ts): can not replication in vitro when temperature is equal to or higher than 39 ℃; (3) little plaque morphological specificity.It seems that this mutant strain be candidate vaccine likely, because: (a) its ca, ts and little plaque phenotype keep stable after cell cultures goes down to posterity; (b) it duplicates the upper respiratory tract and the lower respiratory tract that is confined to hamster; (c) it has induced the remarkable protection (58,59) that anti-wild-type HPIV-3 attacks once more in hamster.
Evaluation to this strain in rhesus monkey shows that the attenuation sudden change in the cp45 is the combination (60) of ts and non-ts sudden change.After this show that in the intravital evaluation of chimpanzee cp45 has obtained gratifying attenuation, the protection (61) that still can induce high-caliber anti-wild-type virus to attack simultaneously.In seronegativity baby and children's, cp45 has been made preliminary clinical evaluation again, pointed out this candidate vaccine strain to have suitable infectivity and suitable attenuation, and have medium immunogenicity (61).
The cp45 strain is as described below to be cultivated in rhesus monkey embryo lung (FRhL) cell and Vero cell: the PIV-3 cp45 virus of cultivating in the FRhL cell prepares this virus by the FRhL cell monolayer that the MOI inoculation with 0.1-1.0 is paved with in tissue culture flasks.Metainfective cell with the EMEM substratum 32 ℃ of cultivations.After about 7 days, when observing maximum cell pathology effect (synplasm (synctyia)), culture is taken turns freeze-thaw cycle through one, results virus is compiled viral liquid and is stored in-70 ℃.
The Vero cell forms on microcarrier bead and is paved with individual layer in the bio-reactor, produces virus on the Vero cell monolayer, and the continuously stirring microcarrier bead is cultivated, and preparation is grown in the PIV-3 cp45 virus in the Vero cell thus.Metainfective bioreactor culture thing is maintained 30 ℃.After 4 to 5 days, results virus when observing synplasm (Syncytial) CPE.The nutrient solution that will contain virus is stored in-70 ℃.
Below be HPIV-3 JS wild-type strain (89) and be grown in the FRhL cell and the Vero cell in the nucleotide sequence (in the normal chain, anti-genome, messenger strand) of cp45 vaccine strain, and the putative amino acid sequence of the RNA polymerase of these HPIV-3 viruses (L albumen), referring to the corresponding SEQ ID of this paper NO.: Virus Nucleotide sequence The L protein sequence Wild-typeJS SEQ ID NO:17 SEQ ID NO:18 vaccine FRhL cp45 SEQ ID NO:19 SEQ ID NO:20Vero cp45 SEQ ID NO:21 SEQ ID NO:22
Above listed each PIV-3 viral genome is all grown 15,462 Nucleotide.The proteic translation of L starts from Nucleotide 8646 to 8648 codons; Translation stop codon is positioned at Nucleotide 15345 to 15347.Long 2,233 amino acid of the L albumen of translating into.
As embodiment 2 hereinafter and wherein table 6 in detail as described in, according to the difference between the cp45 sudden change vaccine strain of cultivating in wild-type JS strain and the FRhL, the key attenuation sudden change of HPIV-3 3 ' end genomic promoter region is Nucleotide 23 (T → C), Nucleotide 24 (C → T), Nucleotide 28 (G → T) and Nucleotide 45 (T → A) (in anti-genome, the messenger strand).Hereinafter embodiment 2 and table 6 wherein also describe in detail, and the proteic key mutational site of HPIV-3L comprises following site: amino-acid residue 942 (tyrosine → Histidine), 992 (leucine → phenylalanines) and 1558 (Threonine → Isoleucines).
In addition, the cp45 sudden change vaccine strain of Vero cultivation also has another place's sudden change because of the intragenic coding of L changes: amino-acid residue 1292 (leucine → phenylalanine).
Should be appreciated that, cause the Nucleotide variation of above amino acid change to be not limited to hereinafter described in the embodiment 2; Cause codon to change and after translation, become above amino acid whose all Nucleotide and change all within the scope of the present invention.
Human respiratory syncytial virus (RSV) is another kind of non-sections, negative justice, strand coating RNA viruses.The pneumonitis virus that RSV belongs to the pneumonitis virus subfamily belongs to (seeing Table 1).
According to the reactivity of F and G surface glycoprotein and monoclonal antibody, identified two kinds of main people RSV subgroups, be called A and B (62).Recently, the A of RSV strain and B pedigree are confirmed (63,64) by sequential analysis.Also be separated to should virus ox, sheep, goat strain.This viral host specificity is relevant with the G attachment protein obviously, and this albumen height deviates from (65,66) between people and Niu/sheep strain, and may (at least in part) influenced by the receptors bind effect.
RSV be virus pneumonia serious among the infant and bronchitic main diseases because of.Very popular among the baby of serious disease (being lower respiratory illness (LRD)) below 6 monthly ages.It is the most normal to betide not immune baby and contacts RSV first.RSV is also relevant with the air flue hyperergy with asthma, and is broncho-pulmonary dysplasia and congestive heart disease (CHD) " high-risk " child patient's the important cause of the death.It still makes children easily suffer from one of common respiratory tract infection of otitis media.In the adult, RSV mainly represents the upper respiratory disease of a kind of no complication (uncomplecated); But in the elderly, it can be equal to influenza, is both the susceptibility factor that develops into particularly bacillary bronchitis of serious LRD and pneumonia.Disease is confined to respiratory tract usually, but when serious immunocompromised, the diffusion to other organ may occur.Virus contains viral respiratory secretions to other position diffuse pollution thing by having polluted, and infection x nasal cavity, oral cavity or connection mucous membrane begin.
The RSV disease is seasonal, and virus only is separated to usually in the winter time in month, is that November is to April at the north latitude degree for example.Virus distributes very wide, has 90% to infect once at least below 2 years old among the children.Multiple strain co-propagate.Do not have the direct evidence (as A type influenza virus finding) of antigenic drift, accumulated amino acid change, point out immune pressure may promote the evolution of virus but sequence studies show that in G albumen and the SH albumen hypervariable region.
In mouse and cotton vole (cotton rat) model, the F of RSV and G albumen have all excited neutralizing antibody, and single providing with these protein immunizations resists the long-effective protection power (67,68) that infects again.
In human body, do not produce complete immunizing power, all can take place to infect again all the life (69,70) RSV; But, evidence suggests that immune factor can be protected and not suffer from serious disease.The reduction of disease seriousness with in the past once or multilayer infect relevant; and evidence suggests; the children that infected by one of RSV two main subgroups have the protection (71) that anti-homology subgroup to a certain degree infects; many observation promptings, the attenuated live virus vaccines may provide the provide protection that is enough to prevent serious M ﹠ M.Rsv infection simultaneous excitation antibody and cell-mediated immunity power.In some researchs, find relevantly with anti-LRD protection at the proteic serum neutralizing antibody of F and G, do not reduce to some extent though also prove upper respiratory disease (URD).The intravital high-level serum antibody of baby is relevant with anti-LRD protection, and intravenously is used the tire immunoglobulin (Ig) of RSV neutralizing antibody of height and is presented at the protection (70,72,73) that anti-severe is provided in the high risk child.The effect of local immunity, nose antibody specifically is among studying.
The ribonucleoprotein that constitutes of RSV virion is wrapped in the lipoprotein envelope.The virion that pneumonitis virus belongs in size with similar to all other paramyxovirus in shape.Show through negative staining and electron microscope, the virion out-of-shape, diameter is from 150 to 300nm differences (74).This viral nucleocapsid is a symmetric volution, similar to other paramyxovirus, but screw diameter is 12 to 15nm, but not 18nm.Coating is made of the double-layer of lipoid from host cell membrane, and contain encoding viral stride the film surface glycoprotein.Viral glycoprotein mediation adhewsive action and penetration, and be formed in the virion furcella with being separated from each other.All members of paramyxovirus subfamily have blood coagulation activity, but this function is not the feature of pneumonitis virus, because blood coagulation activity is not present in RSV, but are present in PVM (75).The neuraminic acid enzymic activity is present in the member of paramyxovirus, rubella virus (rubulavirus) genus, but is not present in Measles virus and the mouse pneumonia virus (PVM) (75).
RSV has two subgroups, A and B.Wild-type RSV genome (strain 2B) is the negative adopted RNA single strand (SEQ ID NO:23) of long 15,218 Nucleotide, becomes 10 kinds of main subgenomic mRNAs through transcribing.10 kinds of a kind of main polypeptide chains of each own coding of mRNA: three kinds is to stride film surface protein (G, F and SH); Three kinds is to combine the protein (N, P and L) that forms virus nucleocapsid with geneome RNA; Two kinds is Nonstructural Protein (NS1 and NS2), and they are at infected cell inner accumulation, but also trace is present in the virion, and may work at regulatory transcription with duplicate the time; A kind of is non-glycosylated virion stromatin (M); At last a kind of is M2, and another kind of non-glycosylated protein is proved to be RSV specific transcriptional elongation factor (see figure 3) recently.These 10 kinds of viral proteins have illustrated most encoding viral ability.
Viral genome is wrapped up by main nucleocapsid protein (N), and combines with phosphoric acid albumen (P) and big (L) polymerase protein.These three kinds of albumen are proved to be to instruct necessary and be enough (76) by the rna replicon of the RSV minimal genome of cDNA coding.Further research to have proved in order transcribing and to have carried out needing M2 albumen (ORF1) (74) fully.When M2 lacked, the transcription product of brachymemma occupied the majority, and the rescue (74) of full-length gene group does not take place.
M (stromatin) and M2 albumen all are that the virion of inside is conjugated protein, and they are not present in the nucleocapsid structure.Similar because of with other non-segmented negative-strand RNA viruses, M albumen is considered to make before packing transcribes inertia in the nucleocapsid, and mediates it and combine with peplos.Only measured very small amount of NS1 and NS2 albumen in purified virus, and thought that at present they are Nonstructural Proteins, their effect is still uncertain, though they may be the instrumentalities of transcribing and duplicating.There are three kinds to stride film surface glycoprotein: G in the virion, F and SH.G and F (syzygy) are envelope glycoproteins, and known their mediation virus is adhered to and penetrated (77) host cell.The proteic Unknown Function of SH is though report hints that it relates to the fusion function (78) of virus recently.
At present after measured the genomic complete sequence of two strain wild-type RSV B subgroups (2B and 18537) (see below literary composition SEQ ID NO:23 and 25).Geneome RNA had not both added yet polyadenylation (79) not of cap.No matter in virion and in the cell, geneome RNA is all combined closely with N albumen.
3 ' end of geneome RNA has the outer leader of the gene of one section 44 Nucleotide, infers that it contains main viral promotors (Fig. 3).3 ' end is 10 virogenes after the genomic promoter region, its be in proper order 3 '-NS1-NS2-N-P-M-SH-G-F-M2-L-5 ' (Fig. 3).L gene back is tail region (Fig. 3) outside the gene of 145 to 149 Nucleotide.Each gene all starts from one section conservative property, 9 nucleotide gene start signals, 3 '-GGGGCAAAU, and (except the 10 Nucleotide start signals of L gene, it is 3 '-GGG ACAAA AU; The difference place has added underscore).Each gene transcription is all from first Nucleotide of this signal.Each gene all ends at gene end (the 3 '-AGU/GU/AANNNU/AA of one and half conservative propertys, 12 to 14 Nucleotide 3-5) (wherein N may be in 4 kinds of bases any one), this end instructs the termination and the polyadenylation (Fig. 3) of transcribing.Preceding 9 genes of RSVB strain all are nonoverlapping, are separated with the intergenic region (Fig. 3) of 3 to 56 Nucleotide.Intergenic region does not contain any conservative property motif or any tangible second structure characteristic, and intergenic region proof little replicon (minreplicon) system in in front and continue after genetic expression do not influence (Fig. 3).Latter two RSV gene has 68 Nucleotide overlapping (Fig. 3).The gene start signal of L gene is positioned within the M2 gene, rather than after it.The overlap of these 68 Nucleotide encoded last 68 Nucleotide (do not comprise poly-A tail) of M2 mRNA and preceding 68 Nucleotide of coding L mRNA.
The product that genome is transcribed is that 10 kinds of different subgene group polyadenylation mRNA and many poly-cistron polyadenylations are even read transcription product (74).The genome deactivation method of utilizing UV (ultraviolet) light to mediate is transcribed drawing and be studies show that, the RSV gene begins to be transcribed (80) by 3 ' to 5 ' order from the single promotor near 3 ' end.So as if RSV is synthetic has followed with regard to singlely entering of proposing of all mononegavirale virus, consecutive transcription pattern (16,81).According to this pattern, polysaccharase (L) contacts geneome RNA with the form of nucleocapsid at 3 ' end genomic promoter region, and from first Nucleotide transcriptional start.RSV mRNA is the collinearity copy of these genes, does not find that mRNA edits or montage.
The sequential analysis of RSV mRNA shows synthetic first Nucleotide (74) of gene start signal separately that originates in of each transcription product in the born of the same parents.The structure that mRNA 5 ' end is added cap is as follows: m7G (5 ') ppp (5 ') GP (G that wherein is added with underscore is first template nucleotide of mRNA), and, mRNA separately 3 ' end polyadenylation (82).These two kinds of modifications are considered to be carried out in corotation record mode by varial polymerases.3 area discovers of RSV3 ' end genomic promoter are important cis-acting elements (83).They are preceding 10 Nucleotide (supposing to play a part promotor), Nucleotide 21-25 and the gene start signal (83) that is positioned at Nucleotide 45-53.Different with other paramyxovirus such as measles, celestial platform and PIV-3, find that the rest part of RSV NS1 gene leader and non-coding region is found the tolerance (83) that insertion, disappearance and replacement is had height.
In addition, by to 3 ' end genomic promoter region preceding 12 Nucleotide saturation mutagenesis (promptly, each base is by respectively with a kind of displacement of its excess-three kind base and the efficient of relatively translating and duplicating), a succession of U that is positioned at Nucleotide 6-10 demonstrates the height inhibition (83) to replacement.On the contrary, preceding 5 multiple replacements of nucleotide pair tolerate relatively, and wherein two of the 4th is to raise sudden change, cause duplicating and transcribe and having improved 4 to 20 times of RSV-CAT RNA.By using the little replicon of bicistronic mRNA (minireplicon) system, prove that it is mRNA synthetic signal that gene initial sum gene stops motif, and seemingly self controls, with the characteristic irrelevant (84) of flanking sequence.
L gene start signal has occupied 68 Nucleotide in upstream of M2 gene termination signal, overlapping (Fig. 3) (74) of causing gene thus.M2 gene termination signal causes in the L gene that the L genetic transcription is regular to be finished in advance.Total length LmRNA is considerably less, only can not discern at polysaccharase just to produce when the M2 gene stops motif.This makes that transcribing of LmRNA is much lower.The overlapping linear precedence transcriptional profile that as if do not meet of gene.Do not know whether jump back to again outside the L gene start signal after polysaccharase leaves the M2 gene, perhaps whether exist second internal promoter (74) of L genetic transcription.Also may, it is approaching that the L gene can be aggregated small segment institute of enzyme, this enzyme can not be come L gene start signal place and slipped over the M2 gene at M2 gene start signal place transcriptional start.
The relative abundance of various RSV mRNA reduces with the distance of each gene apart from promotor, supposes that this is to cause (80) owing to polysaccharase fails in the consecutive transcription process.Gene overlap is to cause the synthetic second kind of mechanism that reduces of total length L mRNA.And some mRNA has the characteristic that possibility reduces translation efficiency.The initiator codon of SH mRNA is in the suitable Kozak sequence environment in an Asia, and G ORF starts from second methionyl codon in the mRNA.
Duplicating of RSV RNA is considered to the pattern (16,81) that (74) have followed institute's proposition of carrying out with vesicular stomatitis virus and Sendai virus.This relates to and changes anti-terminator into from mRNA synthetic termination-initial modes and connect reading mode.The result has synthesized justice to duplicate-middle (RI) RNA, and this is and the accurate complementary copy of geneome RNA.Then, it plays the effect of synthon for genomic templates.Relate to the mechanism that changes to the anti-terminator pattern and it is believed that relating to the record of N albumen corotation wraps up nascent RNA (16,81).Rna replicon in the RSV is the same with other non-segmented negative-strand RNA viruses, depends on ongoing protein synthesis (85).Detected the RI RNA (74,85) that standard virus and RSV-CAT minigene group were once foretold.In standard virus system and minigene group system, abundance is all than low 10 to 20 times of filial generation genome in the born of the same parents of RI RNA.The corresponding separately herein sequence numbering (SEQ.ID No) of following reference has provided the putative amino acid sequence of the RNA polymerase (L albumen) of the nucleotide sequence (with the demonstration of normal chain, anti-genome messenger strand) of various wild-types, vaccine type and answer type RSV strain and these RSV viruses: Virus Nucleotide sequence The L protein sequence Wild-type2B SEQ ID NO:23 SEQ ID NO:2418537 SEQ ID NO:25 SEQ ID NO:26 vaccine 2B33F SEQ ID NO:27 SEQ ID NO:282B20L SEQ ID NO:29 SEQ ID NO:30 reply type 2B33F TS (+) SEQ ID NO:31 SEQ ID NO:322B20L TS (-) SEQ ID NO:33 SEQ ID NO:34
Long 2,166 amino acid whose L albumen of various RSV virogene group codings.Genome length and other Nucleotide information are as follows: Virus Genome Wild-type Length The L initiator codon The L terminator codon2B 15218 8502-8504 15000-1500218537 15229 8509-8511 15007-15009 Vaccine2B33F 15219 8503-8505 15001-150032B20L 15219 8503-8505 15001-15003 The answer type2B33F TS (+) 15219 8503-8505 15001-150032B20L TS (-) 15219 8503-8505 15001-15003
As the hereinafter detailed description of embodiment 3 (particularly table 7 and table 8), the crucial attenuation sudden change of 3 ' end genomic promoter region of RSV B subgroup is Nucleotide 4 (C → G) and inserted an A (in the anti-genome messenger strand) in a succession of A of Nucleotide 6-11.Hereinafter embodiment 3 also describes in detail, the potential attenuation of the proteic key of the L of RSV site is as follows: amino-acid residue 353 (arginine → Methionin), 451 (Methionin → arginine), 1229 (aspartic acid → l-asparagines), 2029 (Threonine → Isoleucines) and 2025 (l-asparagine → aspartic acids).Should be appreciated that, cause the Nucleotide variation of above-mentioned amino acid change to be not limited to hereinafter those described in the embodiment 3; All cause codon to change and become above-mentioned amino acid whose Nucleotide change all within the scope of the present invention after translation.
Compare with infected person and animal host's wild-type virus, attenuated virus of the present invention shows significantly weakening of virulence.The degree of attenuation reaches in the individuality of great majority acceptance immunity infection symptoms can not occur, but virus has kept enough replicatioies and also can excite required type of immune response at infectious (infectious in) in having in vaccine inoculation person's body.
Attenuated virus of the present invention can be used to prepare vaccine.For this reason, attenuated virus is adjusted to suitable concentration and be equipped with suitable vaccine adjuvant, diluent or carrier.Can be with acceptable medium on the physiology as carrier.They include but not limited to: suitable isotonic solution, phosphate buffered saline buffer or the like.Suitable adjuvant includes but not limited to MPL TM(3-O-deacylated tRNA list phosphoric acid lipid A; RIBI ImmunoChem Research, Inc., Hamilton, MT) and IL-12 (Genetics Institute, Cambridge, MA).
In one embodiment of the invention, be intended to comprising that the preparation of attenuated virus is as vaccine.Can be with attenuated virus and freezing protection additive or stablizer, for example protein (as serum albumin, gelatin), sugar (as sucrose, lactose, sorbyl alcohol), amino acid (as Sodium Glutamate), salt or other protective material mix.This mixture is maintained liquid state, then or drying or lyophilize in order to transportation with preserve, mix with water more before use temporarily.
The preparation that comprises attenuated virus of the present invention can be used for immune human or animal, the protection that infects with the corresponding wild-type virus of reactance attenuated virus.So the present invention also provides a kind of immune body to induce the method for anti-mononegavirale virales picornavirus infection protection, promptly by giving the vaccine preparation of individual immunity significant quantity, described preparation has mixed the above viral attenuation variant of this paper.
In order to excite immunne response, the essential capacity vaccine of giving individual with suitable number of times.Those skilled in the art can easily determine above-mentioned total amount and dose.Administration can be by arbitrary conventional effective means, for example in the nose, parenteral, per os or be locally applied to mucomembranous surface (for example in the nose, oral cavity, intraocular, vagina or rectum surface), for example passes through aerosol spray.Preferred means of administration is an intranasal administration.
In another embodiment of the present invention, one section isolated nucleic acid molecule is used to produce oligonucleotide probe (from normal chain response gene group messenger strand or minus strand complementary gene group chain) and expression of peptides (only from the anti-genome messenger strand of normal chain), described nucleic acid molecule has the intact virus nucleotide sequence of wild-type virus or vaccine virus of the present invention, and described peptide is used for detection, body fluid or tissue sample and whether has described wild-type virus and/or vaccine strain.This nucleotide sequence is used for designing the diagnostic test of high specific and susceptibility, whether there to be virus in the test sample.
The primer that has synthesized polymerase chain reaction (RCR) according to the sequence of viral wild-type of the present invention or vaccine.Testing sample is carried out the RNA reverse transcription, the pcr amplification in the cDNA district that selectes then, described zone is corresponding to nucleotide sequence as herein described (the peculiar Nucleotide that promptly has certain virus strain).On gel, identify amplification PCR products, and by confirming their specificity with the hybridization of specificity nucleotide probe.
Test with ELISA and to detect the antigen that whether has wild-type or vaccine virus strain.Design is also picked out the peptide that contains one or more unique residues (according to wild-type as herein described or vaccine sequence).Then with these peptides and haptens (as keyhole _ hemocyanin (KLH)) coupling, and be used for immune animal (for example rabbit) with the manufacture order specific polyclonal antibody.Select polyclonal antibody or, be used for the antigen that " capturing ELISA " detects described virus generation then polyclonal antibody and monoclonal antibody combination.
Moraten measles virus vaccines strain sample has been used for the microbial preservation budapest treaty (" budapest treaty ") of patented procedure according to international recognition by the applicant, at American type culture collection (12301Parklawn Drive, Rockville, Maryland 20852, USA.) carried out preservation, the preserving number that obtains is ATCC VR2587.The sample that the Vero of HPIV-3 virus cultivates the cp45 vaccine strain by the applicant according to budapest treaty, (Maryland 20852 for 12301 Parklawn Drive, Rockville in American type culture collection, USA.) carried out preservation, the preserving number that obtains is ATCC VR2588.The sample of 2B wild-type RSV virus by the applicant according to budapest treaty, (Maryland 20852 for 12301 Parklawn Drive, Rockville in American type culture collection, USA.) carried out preservation, the preserving number that obtains is ATCC VR2586.
According to above three parts of strain of preservation and sequence informations of relevant they and other virus strain, can utilize previously described site-directed mutagenesis and rescue technology to be incorporated herein the sudden change (or keep wildtype phenotype) of described all virus strain, and hereinafter obtain these virus strain and carry out other sudden change in the listed sudden change group in the table 3,4 and 6 to 8.
For the present invention is understood better, provide following examples.They are for the present invention is described, it can not be understood as to limit the scope of the invention.
Embodiment
Standard molecular biological technique uses according to described program such as Sambrook (86).
Embodiment 1
Measles
Before preparation RNA is used for sequential analysis, in the Vero cell, directly from AttenuvaxTM vaccine bottle (lot number #0716B) Moraten MV vaccine virus is cultivated a generation, the Schwarz vaccine virus is cultivated a generation (lot number #96G04/M179 G41D), Zagreb and Rubeovax TMVaccine virus respectively cultivated for 2 generations.Before the extracting RNA material, MV wild-type strain isolated Montefiore (56) is gone down to posterity in the Vero cell 5 to 6 times, similarly, before the extracting RNA material is used for analyzing, MV wild-type strain isolated 1977,1983 (14) is gone down to posterity 5 to 7 times.From Dr.J.Beeler (CBER) receive Edmonston wild-type strain isolated (see figure 1) be before receiving at human kidney cells go down to posterity 7 times and in the Vero cell, go down to posterity 3 times former Edmonston strain isolated, before being used for sequential analysis, go down to posterity once at the Vero cell again.
Infection multiplicity with 0.1 to 1.0 (m.o.i.) vero cells infection prepares RNA, and just gathers in the crops after allowing it reach maximum cytopathogenic effect.Use Trizol TMTotal RNA of reagent (Gibco-BRL) extracting viral infection of measles cell.
From the total RNA of Vero passage material separation, with reversed transcriptive enzyme-PCR (Perkin-Elmer/Cetus) amplification, use be that Corticovirus genomic 3 ' and 5 ' promoter region and 5 ' hold measles (EdmonstonB strain (the 19)) Auele Specific Primer of L gene right.Table 2 provides the sequence of these primers.Primer SEQ IDNO:35-54,74,77 and 78 is anti-genome messenger strands.Primer SEQ ID NO:55-73,75,76 and 79 is genome minus strands.
Table 2
PCR primer and to the order-checking of MVL gene and genome end 9047CATATCACTC ACTCTGGGAT GGAG 9070(SEQ ID NO:35) 9371TCAGAACATC AAGCACCGCC 9390(SEQ ID NO:36) 9741ACAGTCAAGA CTGAGATGAG 9760(SEQ ID NO:37) 10001AAGAGTCAGA TACATGTGGA 10020(SEQ ID NO:38) 10351ACATGAATCA GCCTAAAGTC 10370(SEQ ID NO:39) 10674CCGAAAGAGT TCCTGCGTTA CGACC 10698(SEQ ID NO:40) 11083CAGTCCACAC AAGTACCAGG 11102(SEQ ID NO:41) 11461GTCAGAAGCT GTGGACCATC 11480(SEQ ID NO:42) 11841AATATTGCTA CAACAATGGC 11860(SEQ ID NO:43) 12196ACTCTTCATT CCTAGACTGG 12215(SEQ ID NO:44) 12542GTCCAATTAT GACTATGAAC 12561(SEQ ID NO:45) 12891AGAACAGACA TGAAGCTTGC 12910(SEQ ID NO:46) 13232CCAACAAGGA ATGCTTCTAG 13251(SEQ ID NO:47) 13551ACAGCACTAT CTATGATTGA CCTGG 13575(SEQ ID NO:48) 13930GCAACATGGT TTACACATGC 13949(SEQ ID NO:49) 14280AGATTGAGAG TTGATCCAGG 14299(SEQ ID NO:50) 14629AGGAGATACT TAAACTAAGC 14648(SEQ ID NO:51) 14981TAAGCTTATG CCTTTCAGCG 15000(SEQ ID NO:52) 15337TTAACGGACC TAAGCTGTGC 15356(SEQ ID NO:53) 15671GAAACAGATT ATTATGACGG 15690(SEQ ID NO:54) 9290CGGGCTATCT AGGTGAACTT CAGG 9267(SEQ ID NO:55) 9500ATTTGGATAT GGAATATGAG 9481(SEQ ID NO:56) 9840ACTCAACTGA ACTACCAGTG 9821(SEQ ID NO:57) 10181AAGAACATCA TGTATTTCAG 10162(SEQ ID NO:58) 10549TTATCAACGC ACTGCTCATG 10530(SEQ ID NO:59) 10919ATTTTCAGCA ATCACTTGGC ATGCC 10895(SEQ ID NO:60) 11280GCCTCTGTGC AAACAAGCTG 11261(SEQ ID NO:61) 11638TCTCTAGTTA CTCTAGCAGC 11619(SEQ ID NO:62) 12010AGGTCGTTGT TTGTGAGGAG 11991(SEQ ID NO:63) 12361TCGTCCTCTT CTTTACTGTC 12342(SEQ ID NO:64) 12689CCGTCCTCGA GCTAGCCTCG 12670(SEQ ID NO:65) 13052CTCCTCCAGG CTCACATTGG 13033(SEQ ID NO:66) 13420GGGTTGGTAC ATAGCTCTGC 13401(SEQ ID NO:67) 13767CACCCATCTG ATATTTCCCT GATGG 13743(SEQ ID NO:68) 14099TGGTTGACAG TACAAATCTG 14080(SEQ ID NO:69) 14460CTGAAATGGG AAGATTGTGC 14441(SEQ ID NO:70) 14820AGCAATCTAC ACTGCCTACC 14801(SEQ ID NO:71) 15180TCACAGATGA TTCAATTATC 15161(SEQ ID NO:72) 15530GATCCTAGAT ATAAGTTCTC 15511(SEQ ID NO:73) 1ACCAAACAAA GTTGGGTAAG G 21(SEQ ID NO:74) GGGGGATCC 100ATCCCTAATCCTGCTCTTGTCCC 78(SEQ ID NO:75) 200GATTCCTCTG ATGGCTCCAC 181(SEQ ID NO:76) 15221TAACAGTCAA GGAGACCAAA G 15741(SEQ ID NO:77) GGGAAGCTT 15801AACCCTAATCCTGCCCTAGGTGG 15823(SEQ ID NO:78) 15894ACCAGACAAA GCTGGGAATA GA 15873The genomic overlapping PCR fragment of (SEQ ID NO:79) intact virus directly check order and without cloning to obtain consensus sequence with dideoxy terminator cycle sequencing (ABI PRISM377 sequenator and ABI PRISM 377 sequencing kits) with double-stranded.In order to determine the sequence of absolute ends, adopted described linker (55) in the past.
In order to check hypothesis of the present invention, measured Edmonston wild-type MV strain isolated for generations, the nucleotide sequence of the vaccine strain that is derived from this strain isolated that obtains and non-encoding histone control region of other wild strain and L gene.To shown in 5 each Nucleotide (anti-genome messenger strand) and amino acid difference are compared and arrange (difference is sentenced italic and represented) as table 3 hereinafter then:
Table 3
Difference in MV3 ' the end genomic promoter region nucleotide sequence
The Nucleotide numbering: Virus
26 42 50 96Edmonston w-t A A G G vaccine: Rubeovax TMT C G GMoraten T C G GSchwarz T C G GZagreb T T G AAIK-C T C G G wild type: 1977 A A A G1983 A A A GMontefiore A A A G
Table 4
MV L Nucleotide and amino acid whose difference between Edmonston wild-type and the vaccine strain
331 1409 1624 1649 1717 1887 1936 2074 2114Edmonston w-t ATT GCA ACC AGG GAT AAC CAT CAA AGA sudden change ACT ACA GCC ATG GCT GAC TAT CGA AAAEdmonston w-t I A T R D N H Q RRubeovax TMvac. I A T M A D H Q RMoraten vac. T A T M A D H Q KSchwarz vac. T A T M A D H Q KZagreb vac. I T T R A N H Q RAIK-C vac. I T A R A N Y R R
Table 5
MV L Nucleotide and amino acid whose difference between the wild-type strain
81 122 149 252 331 441 447 500 513 570 613Edmonston w-t GCC GAT GTT ACA ATT AAA AAA GAT GTG AAA TAC sudden change ACC AAT ATT GCA GTT AGA AGA AAT ATG AAT CACEdmonston w-t A D V T I K K D V K Y1977 w-t A N V T V K K D M K Y1983 w-t T D I T I K K N M N HMontefiore w-t A D I A I R R D M K Y
618 621 623 626 628 632 636 637 641 645 650Edmonston w-t GTC AGT AGG AGA GCA ATA CAA GTA GAC GAT ATG sudden change GCC AAT AAG AAA GAA GTA CAT ATA AAT AAT ATAEdmonston w-t V S R R A I Q V D D M1977 w-t A S R R A I Q I D N M1983 w-t V N K R A I H V D D MMontefiore w-t V S R K E V H V N D I
652 720 723 794 914 970 1,044 1,294 1,569 1705 1745Edmonston w-t GCT ATC TAT CGG CGG GCC GGA AGC GTT ATC AAT sudden change ACC GTC TGC TGG CAG TCA AGA ACC ATT GTC AGTEdmonston w-t A I Y R R A G S V I N1977 w-t A I C W Q A G S V I N1983 w-t A V C R R S G T I I NMontefiore w-t T V C R R A R S V V S
1860 1,865 1,936 2,007 2,013 2,017 2,030 2,096 2119 2165Edmonston w-t GTA TTC CAT GAC GAT ACT AAT ATA AAG GTC sudden change ATA TAC TAT GGC GGT ATT AGT GTA CGG ATCEdmonston w-t V F H D D T N I K V1977 w-t V Y H D D T N I K V1983 w-t V F Y D G I N I R IMontefiore w-t I F H G D I S V R V
Embodiment 2
PIV-3
Table 6 is that the parental generation wild-type JS strain and the FRhL-sequence with the cp45 mutant strain Vero cultivation that cultivate of PIV-3 virus compares (in the anti-gene messenger strand).If the change of codon does not cause amino acid whose change, table 6 is with " non-" expression, and the back is unaltered amino acid name.
Table 6
The cp45 that Vero cultivates and FRhL-cultivates and the sequence of JS strain are relatively
Gene regions nucleotide position JS FRhL cp45 Vero cp45 codon changes amino acid change
(numbering in the L) 3 ' leader sequence, 23 T C C
24?????????C?????????T????????????T
28?????????G?????????T????????????T
45 T A ANP UTR, 62 A T TNP coding regions, 397 T C C GTA → GCC Val → Ala
1275 T G G TCT → GCT Ser → AlaP code area 2080 T C C AAT → AAC are non-/and AsnM code area 4347 C A A CCC → ACC Pro → ThrF code area 5536 C T T AAC → AAT are non-/Asn
6329?????????A?????????G????????????G??????????ATA→GTA???????Ile→Val
6419 G A A GCA → ACA Ala → ThrHN coding region 6847 T C C GGT → GGC are non-/Gly
7956 T C C GTT → GCT Val → AlaL coding region 9323 T C C TAT → TAC are non-/Tyr (226)
9971 A G G GAA → GAG are non-/Glu (442)
11469?????????T?????????C????????????C??????????TAC→CAC???????Tyr→His(442)
11621?????????G?????????T????????????T??????????TTG→TTT???????Leu→Phe(942)
11521?????????A?????????A????????????T *????????TTA→TTT???????Leu→Phe(1292)
12581 C T T TTC → TTT are non-/Phe (992)
13318 C T T ACT → ATT Thr → Ile (1558) # suddenlys change 20 20
The sequential analysis of the cp45 mutant strain that PIV-3 virus parent's wild-type JS strain and FRhL are cultivated shows that the latter has 20 places' Nucleotide and changes.Wherein 4 are in non-coding 3 ' the end leader, are: Nucleotide 23 (T → G), 24 (C → T), 28 (G → T) and 45 (T → A) (in the anti-genome messenger strands).From the genome minus strand, become bigger purine (" A ") by less pyrimidine (" C ") on the 28th and may change the size that is clipped in zone between interior two conserved regions of 3 ' end genomic promoter region, changed the cis acting signal thus and offered to the space of polysaccharase.
It is that the intragenic coding of NP, M, F, HN and L changes that 9 places change.The change of 7 places is non-coding or reticent change the in NP, P, F, HN and L gene or the NP non-translational region (UTR) in addition.Verified, since its ts phenotype, cp45 sudden change transcriptional activity very weak (87) under nonpermissive temperature.This ts phenotype is existing to cause (88) through atlas analysis by viral L gene.Because verified, with regard to the sudden change in HN and the F glycoprotein, cp45 virus can normally be exercised its function (87), and this has just supported this hint, that is, the intragenic sudden change of 3 ' leader and L has contribution to the attenuation phenotype of virus.
So, 4 place's specificitys in 3 ' the end leader of the cp45 that FRhL cultivates change, sexually revise with causing with intragenic 3 places of the L of upper/lower positions amino acid change coding, attenuation phenotype to the cp45 candidate vaccine strain has been made significant contribution, described amino acid change is: 942 (Tyr → His), 992 (Leu → Phe) and 1558 (Thr → Ile).
And the cp45 sudden change vaccine strain that Vero cultivates also has another place's sudden change because of the intragenic coding of L changes: amino-acid residue 1292 (leucine → phenylalanine) (marking with asterisk in the table 6).
Preceding two amino acid whose changes (the 942nd and 992) are arranged in one of high conservative region of all paramyxovirus L genes in the L albumen through atlas analysis.The 3rd amino acid change (the 1558th) is through the joining region of atlas analysis between two conservative pieces, and corresponding with it is the change of amino acid/11 717 in the MV vaccine strain.
Disclosed document (89) has only pointed out that 18 places change between the anti-genome messenger strand of cp45 strain that JS and FRhL cultivate.The applicant has found 16 places wherein.
Disclosed document not report is changed by 4 places that the applicant finds: and the Nucleotide 45 in the 3 ' leader (T → A), Nucleotide 62 in the NP UTR (A → T), or cause (the Nucleotide 397 of Val → the Ala) (T → A) and cause amino acid change (Nucleotide 1275 (T → C) (Nucleotide on the anti-genome messenger strand changes) of Ser → Ala) of amino acid change in the NP albumen.Disclosed document was not reported another place's potential attenuation sudden change that the inventor finds yet in the cp45 vaccine strain that Vero cultivates, it is because of the Nucleotide 12521 (amino acid/11 292 of A → T) cause (Leu → Phe).
Embodiment 3
The B subgroup of RSV
Temperature sensitivity phenotype (ts) is closely related with the interior attenuation of body; In addition, some non-ts sudden change may also cause attenuation.By sequential analysis and the ts to RSV mutant strain and revert strain, the evaluation to ts and the sudden change of non-ts attenuation has been finished in the evaluation of these phenotypes of growth in acclimatization to cold (ca) and the body.
Below 5 kinds of RSV 2B strains genome all the order-checking: 2B parent, 2B33F, revert strain 2B33F TS (+), 2B20L and revert strain 2B20L TS (+).2B33F and 2B20L strain are ts and ca, describe (90) to some extent in this paper reference U.S. Patent application of quoting 08/059,444.In having identified 2B33F and 2B20L the sudden change region after, measured the sequence in the following strain aforementioned region again: obtain in addition 9 kinds of 2B33F " answers " strain isolateds at 39 ℃ of subculture in vitro separately with behind cercopithecus aethiops or chimpanzee interior generation and obtain other 9 kinds of 2B20L " answer " strain isolated at 39 ℃ of subculture in vitro separately.
Table 7 is to 12 being summaries of these results.
Table 7
Sequence between RSV 2B and 2B33F virus strain relatively
Nucleotide position+ Nucleotide changes
Gene/zone 3 ' the end of vRNA ??RSV?2B ??RSV ??2B33F RSV 2B33F TS (+), 5a revert strain Amino acid change
Genomic promoter ???4 ???6 ????C ????- The A that G is extra The A that G is extra The non-coding of non-coding
??M ???4175 ???4199 ????T ????T ????C ????C ????C ????C The non-coding of non-coding
??SH ???4329 ???4409 ???4420 ???4442 ???4454 ???4484 ???4497 ???4505 ???4525 ???4526 ???4542 ???4561 ???4575 ???4598 ????T ????T ????T ????T ????T ????T ????T ????T ????T ????T ????T ????T ????T ????T ????C ????C ????C ????C ????C ????C ????C ????C ????C ????C ????C ????C ????C ????C ????C ????C ????C ????C ????C ????C ????C ????C ????C ????C ????C ????C ????C ????C The non-Thr (99 of the non-Ser of the non-Tyr of the non-Cys of the non-His of the non-Ile of Phe-Leu (10) (36) Ile-Thr (40) (47) (51) (61) termination-Gln (66) (68) Ile-Thr (75) Ile-Thr (75) termination-Gln (81) Leu-Pro (87) Trp-Arg (92)
??L ???9559 ???9853* ???12186 ???14587 ???15071 ????G ????A ????G ????C ????A ????A ????G ????A ????T ????G ????A ????A ????A ????T ????G The non-coding of Arg-Lys (353) Lys-Arg (451) * Asp-Asn (1229) Thr-Ile (2029)
+: the nucleotide position number number of M, the SH of 2B33F and 2B33F TS (+) and L gene than 2 bigger.
*: in 2B33F TS (+) strain, the 9853rd Lys → Arg change reverts back to Lys.
Table 8
Sequence between RSV 2B and 2B20L virus strain relatively
Nucleotide position+ Nucleotide changes
Gene/zone 3 ' the end of vRNA RSV?2B ???RSV ??2B20L RSV 2B20L TS (+), R1 revert strain Amino acid change
Genomic promoter ??4 ??6 ????C ????- The A that G is extra The A that G is extra Non-coding *Non-coding *
?L ??8963 ??13347 ??14587 ??14649 ??14650 ????C ????A ????C ????A ????A ????T ????A ????T ????G ????A ????T ????G ????T ????G ????T Non-Thr (154) Asn-AsP (1616) Thr-Ile (2029) *????Asn-Asp(2050) ????Asn-Asp-Val ????(2050) **
+: the nucleotide position number number of the L gene of 2B20L and 2B20L TS (+) is more bigger than 2B. *: the total sudden change in 2B33F and the 2B20L virus strain. *: at the 14650th, sudden change has suppressed the ts phenotype in 2B20L TS (+) the revert strain.
Table 9
RSV 2B, ts and revert strain
Sample The source External phenotype ts ca Grow cotton vole AGM in the body
39/32 ℃ of EOP plaque form 20/32 ℃ of yield Nasal mucus Lung The nose washing lotion Bronchial perfusate
RVS?2B Wild parent plant ????0.7 ????(WT) ???0.0001 ????5.5 a????3.9 b????(4/4) ????5.8 a????5.2 b????(4/4) ????5.8 e????(4/4) ????4.7 e????(4/4)
RVS?2B33F Ca, ts sudden change separates self cooling 33 times the 2B of going down to posterity ????0.00007 ????(sp/int/wt) ???0.04 ??≤1.6 a??<1.9 b????(1/4) ????<1.5 a????<1.2 b????(0/4) ????3.0 e????(4/4) ????<0.9 e????(0/4)
RVS?2B33F-5a TS(+) The 2B33F turn is gone down to posterity, at the plaque of 39 ℃ of pickings ????0.5 ????(WT) ???0.03 ??≤1.7 a????(1/4) ????3.5 a????(4/4) ????4.2 e????(4/4) ????4.0 e????(4/4)
RVS?2B33F-4a TS(+) The 2B33F turn is gone down to posterity, at the plaque of 39 ℃ of pickings ????0.7 ????(WT) ???0.01 ??≤1.7 a????(3/4) ????3.8 a????(4/4) ????ND ????ND
RVS?2B33F-3b TS(+) The 2B33F turn is gone down to posterity, at the plaque of 39 ℃ of pickings ????0.5 ???(WT) ???0.04 ??≤2.5 a????(3/4) ????2.9 a????(4/4) ????ND ????ND
AGM?pp2 The AGM#A2 that 2B33F infects, d7 nose washing lotion is at the plaque of 32 ℃ of pickings ????0.3 ???(sp,int) ???0.00002 ??≤2.0 b????(1/4) ????1.6 b????(4/4) ????ND ????ND
AGM?pp4 The AGM#A2 that 2B33F infects, d7 nose washing lotion is at the plaque of 32 ℃ of pickings ???0.1 ???(sp,int) ???0.008 ??<1.6 b????(0/4) ????1.2 b????(4/4) ????ND ????ND
AGM?pp6 The AGM#A4 that 2B33F infects, d12 nose washing lotion is at the plaque of 32 ℃ of pickings ???0.000004 ???(wt) ?≤0.00005 ??≤1.5 b????(1/4) ????<1.1 b????(0/4) ????ND ????ND
AGM?pp7 The AGM#A2 that 2B33F infects, d12 nose washing lotion is at the plaque of 32 ℃ of pickings ???0.000004 ???(sp/int/wt) ???0.007 ???≤1.4 b???(1/4) ????<1.0 b????(0/4) ????ND ????ND
Table 9 (continuing)
RSV 2B, ts and revert strain
Sample The source External phenotype ts ca Grow cotton vole AGM in the body
39/32 ℃ of EOP plaque form 20/32 ℃ of yield Nasal mucus Lung The nose washing lotion Bronchial perfusate
Chimpanzee pp1A The chimpanzee #1552 that 2B33F infects, d4 lavage of trachea liquid is at the plaque of 32 ℃ of pickings ?0.5 ?(WT) ???ND ??ND ??ND ??ND ??ND
Chimpanzee pp3A The chimpanzee #1560 that 2B33F infects, d6 lavage of trachea liquid is at the plaque of 32 ℃ of pickings ?0.7 ?(WT) ???ND ??2.4 c??(4/4) ??≤3.0 c??(3/4) ??ND ??ND
Chimpanzee pp5A The chimpanzee #1563 that 2B33F infects, the d6 nose swab is at the plaque of 32 ℃ of pickings ?0.7 ?(WT) ???ND ??≤2.3 c??(3/4) ??3.0 c??(4/4) ??ND ??ND
RVS?2B20L Ca, ts sudden change separates self cooling 20 times the 2B of going down to posterity ?0.0002 ?(int/wt) ???0.02 ??<1.9 d??(0/4) ??<1.3 d??(0/4) ??<0.7 f??(0/2) ??<0.7 f??(0/2)
RVS?2B20L?R1 TS(+) The 2B20L turn is gone down to posterity, at the plaque of 39 ℃ of pickings ?0.6 ?(WT) ???ND ??2.3 c??(4/4) ??3.5 c??(4/4) ??ND ??ND
RVS?2B20L?R2 TS(+) The 2B20L turn is gone down to posterity, at the plaque of 39 ℃ of pickings ?0.6 ?(WT) ???ND ??≤2.5 c??(3/4) ??2.7 c??(4/4) ??ND ??ND
RVS?2B20L?R9 TS(+) The 2B20L turn is gone down to posterity, at the plaque of 39 ℃ of pickings ?0.8 ?(WT) ???ND ??≤2.2 c??(3/4) ??4.0 c??(4/4) ??ND ??ND
RVS?2B20L?R10 TS(+) The 2B20L turn is gone down to posterity, at the plaque of 39 ℃ of pickings ?0.7 ?(WT) ???ND ??2.6 c??(4/4) ??3.2 c??(4/4) ??ND ??ND
*: growth, i.e. Log in the body of mensuration 10Average virus titer (# infected/# sum).ND=does not carry out.WT=wild-type plaque size.The little plaque of sp=.Int=median size plaque. aDosage=10 6.6PFU IN bDosage=10 5.6PFU IN cDosage=10 6.3PFU IN dDosage=10 5.9PFU IN eDosage=10 6.6PFU IN+IT fDosage=10 6.0PFU IN+IT
Table 10
2B33F revert strain
????ts(+)In?Vitro ?????????AGM Chimpanzee
????5a????4a????3b ?pp2??pp4??pp6??pp7 ??1A????3A????5A
Base number+
????M ????S?????S?????S ?S????S????S????S ??S?????S?????S
????4176,4200
????SH ????S?????S?????S ?S????S????S????S ??S?????S?????S
14 bases *
????L ????S?????S?????S ????2B????2B????2B ????S?????S?????S ????S?????S?????S ????S?????S?????S ?S????S????S????S ?2B???S????S????S ?S????S????S????S ?S????S????S????S ?S????S????S????S ??S?????S?????S ??ND????2B????2B ??S?????S?????S ??ND????S?????S ??S?????S?????S
????9560 ????9854 ????12187 ????14588 ????15072
Phenotype
Ts ca attenuation ????2B????2B????2B ????S?????S?????S ????r?????r?????r ?r????r????S????S ?2B???S????2B???S ?(r)??(r)??S????S ??2B????2B????2B ??ND????ND????ND ??ND????r?????r
+: the base number of these 2B33F revert strains M, SH and L gene is more bigger than 2B. *: base 4330,4410,4421,4443,4455,4485,4498,4506,4526,4527,4543,4562,4576,4599.The slight answer ND=that replys fully on answer moderate on the r=phenotype (r)=phenotype that the base 2B=that S=is identical with 2B33F is returned on 2B base or the phenotype does not carry out
Table 11
2B20L revert strain
TS (+) in-vitro separation thing
Base number+ R1???R2???R3A???R4A???R5A???R6A???67A???R8A???R9A???R10A
???L S????S????S?????S?????S?????S?????S?????S?????S?????S C *??S????ND????S?????S?????ND????S?????S?????S?????S S????S????S?????S?????S?????S?????S?????S?????S?????S S????S????2B????S?????2B????2B????S?????S?????2B????2B A *??A *??S?????A *???S?????S?????A *???A *???S?????S
???8964 ???13348 ???14588 ???14650 ???14651
Phenotype
The ts attenuation 2B???2B???ND????ND????ND????ND????ND????ND????2B????2B r????r????ND????ND????ND????ND????ND????ND????r?????r
+: the base number of these 2B20L revert strains L gene is more bigger than 2B.The base 2B=that S=is identical with 2B33F is returned to moderate answer on 2B base or the r=phenotype *=sequence change the ND=different with 2B or 2820L do not carry out
Table 12
RSV 2B, ts and revert strain: phenotype is summed up
Virus isolated strain The source External phenotype Attenuation in the body
??ts ????ca Cotton mouse ??AGM
?RSV?2B The wild-type parent strain ??- ????- ??- ???-
?RSV?2B33F Ca, ts sudden change separates self cooling 33 times the 2B of going down to posterity ??++++ ????++ ??++++ ???+++
?RSV?2B33F-5a ?TS(+) The 2B33F turn is gone down to posterity, at the plaque of 39 ℃ of pickings ??- ????++ ????++ ???+
?RSV?2B33F-4a ?TS(+) The 2B33F turn is gone down to posterity, at the plaque of 39 ℃ of pickings ??- ????++ ???++ ???ND
RSV?2B33F-3b TS(+) The 2B33F turn is gone down to posterity, at the plaque of 39 ℃ of pickings ???- ??++ ??++ ??ND
AGM?pp2 The AGM#A2 that 2B33F infects, d7 nose washing lotion is the phagocytosis of 32 ℃ of pickings ???+ ??- ??+++ ??ND
AGM?pp4 The AGM#A2 that 2B33F infects, d7 nose washing lotion is at the plaque of 32 ℃ of pickings ???+ ??++ ??+++ ??ND
AGM?pp6 The AGM#A4 that 2B33F infects, d12 nose washing lotion is at the plaque of 32 ℃ of pickings ???++++ ??- ??++++ ??ND
AGM?pp7 The AGM#A2 that 2B33F infects, d12 nose washing lotion is at the plaque of 32 ℃ of pickings ???++++ ??++ ??++++ ??ND
Chimpanzee pp1A The chimpanzee #1552 that 2B33F infects, d4 lavage of trachea liquid is at the plaque of 32 ℃ of pickings ???- ??ND ??ND ??ND
Chimpanzee pp3A The chimpanzee #1560 that 2B33F infects, d6 lavage of trachea liquid is at the plaque of 32 ℃ of pickings ???- ??ND ??++ ??ND
Chimpanzee pp5A The chimpanzee #1563 that 2B33F infects, d10 lavage of trachea liquid is at the plaque of 32 ℃ of pickings ???- ??ND ??++ ??ND
RSV?2B20L Ca, ts sudden change separates self cooling 20 times the 2B of going down to posterity ???- ??++ ??++++ ??++++
RSV?2B20L?R1 TS(+) The 2B20L turn is gone down to posterity, at the plaque of 39 ℃ of pickings ???- ??ND ??++ ??ND
RSV?2B20L?R2 TS(+) The 2B20L turn is gone down to posterity, at the plaque of 39 ℃ of pickings ???- ??ND ??++ ??ND
RSV?2B20L?R9 TS(+) The 2B20L turn is gone down to posterity, at the plaque of 39 ℃ of pickings ???- ??ND ??++ ??ND
RSV?2B20L R10 TS(+) The 2B20L turn is gone down to posterity, at the plaque of 39 ℃ of pickings ???- ??ND ??++ ??ND
ND=do not carry out-=the wild-type phenotype, that is, non temperature-sensibility, non-acclimatization to cold, non-attenuation+extremely ++ ++=temperature sensitivity, acclimatization to cold or attenuation level increase
Can draw several important insight according to above information:
A. shown in table 7 (2B33F) and table 8 (2B20L), the sequence that identifies in these two mutant strains changes quite few: the difference of RSV 2B33F and its parent RSV 2B is that two places in the 3 ' genomic promoter region change, the non-coding of M gene 5 ' two terminal places change, and sexually revise and the non-coding change in 1 place (poly-A primitive) at intragenic 4 places of the L of coding RNA RNA-dependent polysaccharase coding.In addition, through atlas analysis, the SH gene has 14 places separately and changes.RSV 2B20L and its parent RSV 2B are only variant on 7 nucleotide positions, and wherein 3 places and 2B33F virus is total, and this comprises that change of two places and the intragenic place of L coding in the 3 ' genomic promoter region change.The distinctive change of two 2B20L of place virus is through the coding region of atlas analysis at the L gene in addition.Identified the potential attenuation sudden change that is positioned at non-coding 3 ' genomic promoter region and RNA RNA-dependent pol gene.
B. in attenuated virus 2B33F and 2B20L strain, can identify intragenic two ts of the place sudden change of L:
(i) in 2B33F, cause L Argine Monohydrochloride 451 to change (9853 sudden changes of Nucleotide of Lys → Arg) (A → G) obvious and ts and attenuation phenotypic correlation.In 2B33F TS (+) the 5a strain only the answer of this position just cause recovering fully 39 ℃ of growths (table 9) and in animal the part of attenuation reply.The related of this and ts and attenuation phenotype obtains from chimpanzee with from isolating 6 kinds in addition " TS revert the strain fully " (4a of cell cultures, 3b, pp2,3A, 5a, 5A) the support of partial sequence analysis wherein has only Nucleotide 9853 sudden changes that answer (table 10-12) (note at 9853 places regressive isolate A GM (cercopithecus aethiops) taking place, part only takes place its ts phenotype replys) has taken place.(Lys → Arg) keep stable in cDNA infections clone construct promptly by stablizing this codon by inserting one second sudden change, reduces the possibility that it is returned to Lys whereby in described amino acid 451 sudden changes.
(ii) in 2B20L, cause that place coding sexually revises in the L albumen (amino acid 2,050, base 14, the 649 sudden change (A → G) it seems and ts and attenuation phenotypic correlation of Asn → Asp).In TS (+) revert strain, answer has all taken place this aspartic acid that is positioned at amino acid 2050 places, and (Asp → Asn), perhaps, (A → T) has become another amino acid (Asp → Val) (table 8,11) because the Nucleotide at Nucleotide 14,650 places replaces.Above suggestion is according to TS (+) being replied the complete sequence analysis of strain R1 and other several TS (+) being replied strain (R2, R4A, R7A, R8A) the partial sequence analysis (table 11) of selection area.In R1 revert strain, also found the sudden change of another relevant place, be positioned at (the amino acid/11 616Asn → Asp) of Nucleotide 13,347 places with above-mentioned answer.But, do not know what effect this sudden change has to the ts phenotype; The L gene of other revert strain is not order-checking fully as yet.
C.2B33F with the 2B20L strain the total sequence change in 3 places is arranged:
(i) all have in 2B33F and the 2B20L corresponding to amino acid 2029 and change (change (C → T) (table 7,8) of 14,587 place's Nucleotide of Thr → Ile).This " T " Nucleotide replaces to be found to be present in 10% the virus strain of the RSV2B for generations colony, and may obtain preferred in the attenuation process.In 2B33F and 2B20L virus, all do not find " C " base of wild-type.
(ii) in the 3 ' genomic promoter region of 2B33F and 2B20L, found the sudden change of two places: Nucleotide 4 (C → G) He among a succession of A in Nucleotide 6 to 11 places inserted an extra A (in anti-genomic information chain).When analyzing the sequence of selected TS (+) revert strain, find to have the sudden change of 3 places in 2B33F TS (+) 5a (table 7) and 2B20L TS (+) R1 (table 8) revert strain, to be retained.These non-codings that are retained, cis acting suddenly change still relevant with the viral attenuation of part.
Change the expression of using little replicon RSV-CAT system to carry out and show for analyzing these cis actings, as 2B when virus or 2B33F or 2B33F TS (+) provide auxiliary L gene function for generations (these viral N, P are identical with the M2 gene), in this vitro system, the Nucleotide of 3 ' genomic promoter 4 changes that (C → G) is the rise sudden change of a kind of transcribing/duplicate.
Also with the little replicon of this RSV-CAT system to the 3 ' genomic promoter of 2B33F and 2B for generations the subsidiary function that provides of virus or RSV2B33F or 2B33F TS (+) virus carried out complementation analysis.3 kinds of strains are all supported the transcribing/copy function of 3 ' genomic promoter mediation of 2B and 2B33F simultaneously.But 2B33F and 2B33F TS (+) virus are preferred their 2B33F 3 ' genomic promoter.This analysis clearly illustrates that in the attenuation process of vaccine, 3 ' genomic promoter is evolved jointly with RNA RNA-dependent pol gene.Proof clearly under the support of the transcribing of 37 ℃ of the little replicons of RSV-CAT/copy function, (Arg → answer Lys) has caused the answer of ts phenotype in the 2B33F mutant strain 5a to the proteic single amino acids 451 of the L that sequential analysis draws.2B33F virus does not provide subsidiary function for the little replicon of RSV-CAT (having 2B or 2B33F3 ' genomic promoter) at 37 ℃.
No matter d. phenotype how, the SH that bias is arranged that finds in the 2B33F is super, and sudden change is present in all 2B33F revert strains, but does not then have in the 2B20L of ts, ca and attenuation.So, do not have data this sudden change and any biology phenotypic correlation can be joined at present.
Another kind of wild-type RSV 18537 strains have also been accepted order-checking and have been compared with wild-type RSV 2B strain.Equally, in all above-mentioned Key residues positions, these two kinds of wild-type strains all are the same.2B is at the Thr at the codon ACA at Nucleotide 14568 to 14588 places coding L Argine Monohydrochloride 2090 places, and 18537 strains are at the Ile at codon ATT coded amino acid 2029 places at Nucleotide 14593 to 14595 places (compare with the 8502-8504 among the 2B, the initiator codon of L gene is positioned at Nucleotide 8509-8611 in 18537 strains).
Embodiment 4
Detect the PCR test of Measles virus
Patient of 21 years old because of continuous three weeks carrying out property dry cough, out of breath and high fever be admitted to hospital.After treating 7 days with clarithomycin, perhaps carrying out similar treatment with atovaquone, his symptom is not improved.And with the upper right abdomen pain of main suit, proving has not conformability to omeprazole and antacid.Relevant medical history of past comprise that the VIII factor lacks and this is admitted to hospital before 3-4 diagnose out the HIV infection.Before 1 year, he has accepted the booster immunization of a measles-mumps-rubella vaccine for college entrance.
Bronchoalveolar lavage and the transbronchial biopsy that carries out two days later of being admitted to hospital shows that anti-property hyperplasia and alveolar stave cell come off and chronic little inflammation.Microorganism is not all found in gram, argentiform or PAS dyeing.Thoracic CT scan shows at left lung base portion multiple pathology fusion tubercle is arranged.Although cause patient's HIV in late period pulmonary complication and used empirical antimicrobial drug that in order to prevent and treat opportunistic bacterium, mycobacterium and mycosis liquid the patient remains and maintains 39 ℃ high fever.Left side pleura generation sepage; The diagnostic pleurocentesis holds itself out to be exudative, but what is not diagnosed in others.The bronchoalveolar lavage that carries out after 3 weeks only shows alveolar tissue cell (wherein some has been full of iron content blood yellow), some lymphocytes and neutrophil cell.FITE, AFB and argentiform dyeing are still negative.
After 2 weeks, carried out the wedge excision of left lung by the small-sized throacotomy (minithoracotomy) under the CT guide.Many tissue slicies disclose the acute and chronic inflammation tuberal area with regional necrosis and fibrosis.Have many polykaryocytes, wherein some contains in the tenuigenin and intranuclear inclusion simultaneously, prompting Measles virus giant cell pneumonia.Bacterium, fungi, P.carinii and antiacid special stain for microorganism provide negative findings once more.Electron microscope microscopy to this lung biopsy slice shows particulate form and paramyxovirus, for example Measles virus unanimity.Measure the anti-measles IgM of the serum feminine gender of tiring through the solid phase ha test, IgM after this catches the immunity test result too.
After 2 weeks, rhesus monkey (RMK) the nephridial tissue culturing cell of having inoculated patient's lung biopsy material shows the characteristic cytopathy of viral infection of measles.And obtained using the confirmation of the immunofluorescent test of carrying out at the monoclonal antibody of Measles virus.According to this diagnosis, give oral sharp Barverine 1000mg B.I.D.14 days.Unfortunately, the patient is constantly worsened, and is finally dead after 2 months.
In order to prove conclusively the character of the Measles virus that exists in patient's body, will carry out reverse transcription and RCR amplification from the virus that infected tissue obtains, carry out sequential analysis then.With the isolating Measles virus of patient's lung biopsy inoculation RhMK, be interior 2 propagation that go down to posterity at continuous Vero (monkey kidney) tissue culture cells.According to the explanation of manufacturers, with TRIzol reagent (Life Technologies, Grand Island, total RNA of cells infected in NY) the extracting s-generation Vero cell.Equally, extracted total RNA from patient's lung biopsy material.Obtain measles virus vaccines strain (Moraten) (as a component of trivalent MMR the vaccine) (Attenuvax of the present U.S. use of unit price form TM, Merck, Sharpe , ﹠amp; Dohme).This virus goes down to posterity once in the Vero cell, then as mentioned above, and total RNA of extracting vaccine infection cell.
(Branchburg NJ) will above-mentioned each RNA preparation reverse transcription (RT) one-tenth cDNA for Perkin-Elmer/CetusRT-PCR test kit, Perkin-Elmer Cetus with six poly-primers and Maloney mouse leukovirus reverse transcriptase at random.Use Measles virus specificity oligodeoxynucleotide primer according to aforementioned Edmonston Measles virus sequences Design to carrying out PCR then, these cDNA increase.These PCR products have comprised across the genomic one group of overlapping DNA fragment of 15,894 Nucleotide Measles viruss of complete length.Adopt manufacturer (ABIPRISM 377 sequenators and ABI PRISM dna sequencing kit; Perkin-Elmer/Centus, FosterCity, CA) each PCR product of dideoxy terminator cycle sequencing direct analysis of Jian Liing and not cloning has been determined total genome sequence.The analysis that two chains of pcr amplified dna product have all been carried out is to eliminate possible sequencing error.
The nucleotide sequence that reaches the genomic selection area of Measles virus of sick lung tissue existence in patient's viral isolates is compared with the Moraten vaccine virus, and compared with the nucleotide sequence of other measles wild-type virus and vaccine strain.This sequential analysis shows identical with the Moraten vaccine strain, and does not show with in the past or the dependency of present popular wild virus or other Measles Vaccine strain.
Embodiment 5
Detect the ELISA of RSV
The ELISA test is used for detecting whether have RSV.Peptide is according to designing and select to all B subgroup virus strain or to the homology that each wild-type, vaccine or the B subgroup of replying RSV have a specific RSV sequence with described herein.Then, with these peptides and KLH coupling and be used for immunizing rabbit, to produce the polyclonal antibody of monospecific.Select these polyclonal antibodies, or, be used for " catching ELISA " then, detect whether there is RSV antigen polyclonal antibody and monoclonal antibody combination.
Reference
1.Kapikian,A.Z.,et?al., Am.J. Epidemol.,89,405-421(1969).
2.Chin,J.,et?al., Am.J.Epidemol.89,449-463(1969).
3.Fulginiti,V.A.,et?al., Am.J. Epidemol.89,435-448(1969).
4.Prince,G.A.,et?al., J.Virology57,721-728(1986).
5.Kim,H.W.,et?al., Pediatrics52,56-63(1973).
6.Hodes,D.S.,et?al., Proc.Soc.Exp. Biol.Med.145,1158-1164(1974).
7.Belshe, R.B., and Hissom, F.K., J.Med. Virol., 10, 235-242 (1982).
8.Black,F.L.,et?al., Am.J.Epidemiol.124,442-452(1986).
9.Lennon, J.L., and Black, F.L., J. Pediatrics, 108, 671-676 (1986).
10.Pabst,H.F.,et?al., Pediatr.Infect. Dis.J.11,525-529(1992).
11.Centers?for?Disease?Control, MMWR40,369-372(1991).
12.Centers?for?Disease?Control, MMWR41:S6,1-12(1992).
13.King,G.E.,et?al., Pediatr.Infect. Dis.J.10,883-887(1991).
14.Rota,J.S.,et?al., Virology188,135-142(1992).
15.Rota,J.S.,et?al., Virus?Res.31,317-330(1994).
16.Lamb, R.A., and Kolakosky, D., pp.1177-1204 of Vol. 1, Fields Virology, B.N. Fields, etal., Eds. (3rd ed., Raven Press, 1996).
17.Sidhu,M.S.,et?al., Virology193,50-65(1993).
18.Garcin,D.,et?al., EMBO?J.14,6087-6094(1995).
19.Radecke,F.,et?al., EMBO?J.14,5773-5783(1995).
20.Collins,P.L.,et?al., Proc.Natl.Acad. Sci.,USA92,11563-11567(1995).
21. disclosed european patent application No.702,085.
22. disclosed International Application No. WO 96/10400.
23.Baron, M.D., and Barrett, T., J. Virology, 71, 1265-1271 (1997).
24. disclosed International Application No. WO 97/06270.
25. U.S. Provisional Patent Application 60/047575.
26. disclosed International Application No. WO 97/12032.
27.Kato,A.,et?al., Genes?to?Cells1,569-579(1996).
28.Sidhu,M.S.,et?al., Virology208,800-807(1995).
29.Shaffer,M.F.,et?al., J.Immunol.41,241-256(1941).
30.Enders,J.F.,et?al., N.Engl.J.Med.263,153-159(1960).
31.Enders, J.F., and Peebles, M.E., Proc. Soc.Exp.Biol.Med., 86, 227-286 (1954).
32.Schwarz,A.J.F., Am.J.Dis.Child.103,216-219(1962).
33.Griffin, D.E., and Bellini, W.J., pp.1267-1312 of Vol.1, Fields Virology, B.N.Fields, etal., Eds. (3rd ed., Raven Press, 1996).
34.Birrer,M.J.,et?al., Viroloqy108,381-390(1981).
35.Birrer,M.J.,et?al., Nature293,67-69(1981).
36.Norby,E.,et?al.,p?p.?481-507, The Paramyxoviruses,D.Kingsbury,Ed.(Plenum?Press,1991).
37.Peebles,M.E.,pp.?427-456, The Paramyxoviruses,D.Kingsbury,Ed.(Plenum?Press,1991).
38.Egelman,E.H.,et?al., J.Virol.63,2233-2243(1989).
39.Udem,S.A.,et?al., J.Virol.Methods8,123-136(1984).
40.Udem, S.A., and Cook, K.A., J.Virol, 49, 57-65 (1984).
41.Moyer, S.A., and Horikami, S.M., pp. 249-274, The Paramyxoviruses, D.Kingsbury, Ed. (Plenum Press, 1991).
42.Blumberg,B.,et?al.,pp.?235-247, The?paramyxoviruses,D.Kingsbury,Ed.(Plenum?Press,1991).
43.Berrett,T.,et?al.,pp.83-102, The?Pramyxoviruses,D.Kingsbury,Ed.(Plenum?Press,1991).
44.Tordo,N.,et?al., Sem.in?Virology3,341-357(1992).
45.Cattaneo,R.,et?al., EMBO?J.6,681-688(1987).
46.Crowley,J.C.,et?al., Virology164,498-506(1988).
47.Banerjee, A.K., and Barik, S., et al., Virology, 188, 417-428 (1992).
48.Castaneda, S.J., and Wong, T.C., J. Virol., 63, 2977-2986 (1989).
49.Chan, J., et al., pp.221-231, Genetics and Pathogenicity of Negative Stranded Viruses, B.W.J.Mahy and D.Kolakofsky, Eds. (ElsevierBiomedical Press, 1989).
50.Blumberg,B.,et?al., Cell23,837-845(1981).
51.Blumberg,B.,et?al., Cell32,559-567(1983).
52.Kolakofsky, D., and Blumberg, B.M., pages 203-213, Virus Persistence, B.M.J.Mahy, etal., Eds. (Cambridge University Press, 1982).
53.Castaneda, S.J., and Wong, T.C., J. Virol., 64, 222-230 (1990).
54.Curran, J.A., and Kolakofsky, D., Virology, 182, 168-176 (1991).
55.Sidhu,M.S.,et?al., Virology193,66-72(1993).
56.Sidhu,M.S.,et?al., Virology202,631-641(1994).
57.Collins,P.L.,et?al.,pp.1205-1241of?Vol.1, Fields?Virology,B.N.Fields,et?al.,Eds.(3rd?ed.,Raven?PresB,1996).
58.Crookshanks, F.K., and Belshe, R.B., J. Med, Virol., 13,243-249 (1984).
59.Crookshanks-Newman, F.K., and Belshe, R.B., J.Med.Virol.., 18, 131-137 (1986).
60.Hall,S.L.,et?al., Virus?Res.22,173-184(1992).
61.Karron,R.A.,et?al., J.Inf.Dis.172,1445-1450(1995).
62.Anderson,L.J.,et?al., J.Infect.Dis.151,626-633(1985).
63.Collins, P.L., pp.103-162 of The Paramyxoviruses, D.W.Kingsbury, Ed. (Plenum Press, NY and London, 1991).
64.Sullender,W.M., J.Virology65,5425-5434(1991).
65.Lerch,R.A.,et?al., J.Virology64,5559-5569(1990).
66.Mallipeddi, S.K., and Samal, S.K., J. Gen Virol., 74, 2787-2791 (1993).
67.Johnson,P.R.,et?al., J.Virology61,3163-3166(1987).
68.Stott,E.J.,et?al., J.Virology61,3855-3861(1987).
69.Henderson,F.W.,et?al., N.Engl.J. Med.,300,530-534(1979).
70.Hall,S.L.,et?al., J.Infect.Dis.163,693-698(1991).
71.Mufson,M.A.,et?al., J.Gen.Virol.66,2111-2124(1985).
72.Glezen,W.P.,et?al., Am.J.Dis. Child.140,543-546(1986).
73.Hemming,V.G.,et?al., Clin.Microbiol. Res.,8,22-33(1995).
74.Collins,P.L.et.al.,pp.1313-1351of?vol.1, Fields?Virology,B.N.Fields,et?al.,Eds.(3rded.,Raven?Press,1996).
75.Ling, R., and Pringle, C.R., J.Gen. Virol., 70, 1427-1440 (1989).
76.Yu,Q.,et?al., J.Virology69,2412-2419(1995).
77.McIntosh, K., and Chanock, R.M., pp.1045-1072 of Virology, B.N.Fields, et al., Eds. (2nded., Raven Press, 1990).
78.Heminway,B.R.,et?al.,pp.167?ofAbstracts?of?the?IX?International?Congress?of?Virology,P17-2,(1993).
79.Mink,M.A.,et?al., Viroloqy185,615-624(1991).
80.Dickens,L.E.,et?al., J.Virology.52,364-369(1990).
81.Wagner, R.R., and Rose, J.K., pp. 1121-1135 of vol.1, Fields Virology, B.N.Fields, et al., Eds. (3rd ed., Raven Press, 1996).
82.Barik,S., J.Gen.Virol.74,485-490(1993).
83.Collins,P.L.,et?al.,pp.259-264?of Vaccines?93:modern?approaches?to?new?vaccines including?prevention?of?AIDS,F.Brown?et?al.,Eds.(Cold?Spring?Harbor?Laboratory?Press,NY,1993).
84.Kuo,L.,et?al., J.Virology.70,6892-6901(1996).
85.Huang, Y.T., and Wertz, G.W., J. Viroloogy, 43, 150-157 (1982).
86.Sambrook,J.,et?al., Molecular?Cloning: A?Laboratory?Manual,2nd?ed.,Cold?Spring?HarborLaboratory?Press,Cold?Spring?Harbor,N.Y.(1989).
87.Ray,R.,et?al., J.Virol.69,1959-1963(1995).
88.Ray,R.,et?al., J.Virol.70,580-584(1996).
89.Stokes,A.,et?al., Virus?Research30,43-52(1993).
90. U.S. Patent application No.08/059,444.
Sequence table (1) general information:
(i) applicant: Udem, Stephen A.
Sidhu,Mohinderjit?S.
Tatem,Joanne?M.
Murphy,Brian?R.
Randolph,Valerie?B.
(ii) denomination of invention: in mononegavirale virales (Order designated Mononegavirales) virus, cause 3 ' genomic promoter region of attenuation and the sudden change in the pol gene
(iii) sequence quantity: 79
(iv) contact address:
(A) address: American Home Products Corporation
(B) street: One Campus Drive
(C) city: Parsippany
(D) state: New Jersey
(E) country: United States
(F) postcode: 07054
(v) computer-reader form:
(A) media types: floppy disk
(B) computer: IBM PC compatible
(C) operating system: PC-DOS/MS-DOS
(D) software: PatentIn Release#1.0, Version#1.30
(vi) current application information:
(A) application number: US
(B) applying date:
(C) classification:
(viii) lawyer/proxy's information:
(A) name: Gordon, Alan M.
(B) accession designation number: 30,637
(C) reference/folder numbering: 33,294 PCT
(ix) telecommunication information:
(A) phone: 973/683-2157
(B) information of fax: 973/683-4117 (2) SEQ ID NO:1:
(i) sequence signature:
(A) length: 15894 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) collection of illustrative plates structure: linearity
(ii) molecule type: RNA (genome)
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: ACCAAACAAA GTTGGGTAAG GATAGATCAA TCAATGATCA TATTCTAGTG CACTTAGGAT 60 TCAAGATCCT ATTATCAGGG ACAAGAGCAG GATTAGGGAT ATCCGAGATG GCCACACTTT 120 TAAGGAGCTT AGCATTGTTC AAAAGAAACA AGGACAAACC ACCCATTACA TCAGGATCCG 180 GTGGAGCCAT CAGAGGAATC AAACACATTA TTATAGTACC AATCCCTGGA GATTCCTCAA 240 TTACCACTCG ATCCAGACTT CTGGACCGGT TGGTCAGGTT AATTGGAAAC CCGGATGTGA 300 GCGGGCCCAA ACTAACAGGG GCACTAATAG GTATATTATC CTTATTTGTG GAGTCTCCAG 360 GTCAATTGAT TCAGAGGATC ACCGATGACC CTGACGTTAG CATAAGGCTG TTAGAGGTTG 420 TCCAGAGTGA CCAGTCACAA TCTGGCCTTA CCTTCGCATC AAGAGGTACC AACATGGAGG 480 ATGAGGCGGA CCAATACTTT TCACATGATG ATCCAATTAG TAGTGATCAA TCCAGGTTCG 540 GATGGTTCGA GAACAAGGAA ATCTCAGATA TTGAAGTGCA AGACCCTGAG GGATTCAACA 600 TGATTCTGGG TACCATCCTA GCCCAAATTT GGGTCTTGCT CGCAAAGGCG GTTACGGCCC 660 CAGACACGGC AGCTGATTCG GAGCTAAGAA GGTGGATAAA GTACACCCAA CAAAGAAGGG 720 TAGTTGGTGA ATTTAGATTG GAGAGAAAAT GGTTGGATGT GGTGAGGAAC AGGATTGCCG 780 AGGACCTCTC CTTACGCCGA TTCATGGTCG CTCTAATCCT GGATATCAAG AGAACACCCG 840 GAAACAAACC CAGGATTGCT GAAATGATAT GTGACATTGA TACATATATC GTAGAGGCAG 900 GATTAGCCAG TTTTATCCTG ACTATTAAGT TTGGGATAGA AACTATGTAT CCTGCTCTTG 960 GACTGCATGA ATTTGCTGGT GAGTTATCCA CACTTGAGTC CTTGATGAAC CTTTACCAGC 1020 AAATGGGGGA AACTGCACCC TACATGGTAA TCCTGGAGAA CTCAATTCAG AACAAGTTCA 1080 GTGCAGGATC ATACCCTCTG CTCTGGAGCT ATGCCATGGG AGTAGGAGTG GAACTTGAAA 1140 ACTCCATGGG AGGTTTGAAC TTTGGCCGAT CTTACTTTGA TCCAGCATAT TTTAGATTAG 1200 GGCAAGAGAT GGTAAGGAGG TCAGCTGGAA AGGTCAGTTC CACATTGGCA TCTGAACTCG 1260 GTATCACTGC CGAGGATGCA AGGCTTGTTT CAGAGATTGC AATGCATACT ACTGAGGACA 1320 AGATCAGTAG AGCGGTTGGA CCCAGACAAG CCCAAGTATC ATTTCTACAC GGTGATCAAA 1380 GTGAGAATGA GCTACCGAGA TTGGGGGGCA AGGAAGATAG GAGGGTCAAA CAGAGTCGAG 1440 GAGAAGCCAG GGAGAGCTAC AGAGAAACCG GGCCCAGCAG AGCAAGTGAT GCGAGAGCTG 1500 CCCATCTTCC AACCGGCACA CCCCTAGACA TTGACACTGC ATCGGAGTCC AGCCAAGATC 1560 CGCAGGACAG TCGAAGGTCA GCTGACGCCC TGCTTAGGCT GCAAGCCATG GCAGGAATCT 1620 CGGAAGAACA AGGCTCAGAC ACGGACACCC CTATAGTGTA CAATGACAGA AATCTTCTAG 1680 ACTAGGTGCG AGAGGCCGAG GACCAGAACA ACATCCGCCT ACCCTCCATC ATTGTTATAA 1740 AAAACTTAGG AACCAGGTCC ACACAGCCGC CAGCCCATCA ACCATCCACT CCCACGATTG 1800 GAGCCGATGG CAGAAGAGCA GGCACGCCAT GTCAAAAACG GACTGGAATG CATCCGGGCT 1860 CTCAAGGCCG AGCCCATCGG CTCACTGGCC ATCGAGGAAG CTATGGCAGC ATGGTCAGAA 1920 ATATCAGACA ACCCAGGACA GGAGCGAGCC ACCTGCAGGG AAGAGAAGGC AGGCAGTTCG 1980 GGTCTCAGCA AACCATGCCT CTCAGCAATT GGATCAACTG AAGGCGGTGC ACCTCGCATC 2040 CGCGGCCAGG GACCTGGAGA GAGCGATGAC GACGCTGAAA CTTTGGGAAT CCCCCCAAGA 2100 AATCTCCAGG CATCAAGCAC TGGGTTACAG TGTTATTATG TTTATGATCA CAGCGGTGAA 2160 GCGGTTAAGG GAATCCAAGA TGCTGACTCT ATCATGGTTC AATCAGGCCT TGATGGTGAT 2220 AGCACCCTCT CAGGAGGAGA CAATGAATCT GAAAACAGCG ATGTGGATAT TGGCGAACCT 2280 GATACCGAGG GATATGCTAT CACTGACCGG GGATCTGCTC CCATCTCTAT GGGGTTCAGG 2340 GCTTCTGATG TTGAAACTGC AGAAGGAGGG GAGATCCACG AGCTCCTGAG ACTCCAATCC 2400 AGAGGCAACA ACTTTCCGAA GCTTGGGAAA ACTCTCAATG TTCCTCCGCC CCCGGACCCC 2460 GGTAGGGCCA GCACTTCCGA GACACCCATT AAAAAGGGCA CAGACGCGAG ATTAGCCTCA 2520 TTTGGAACGG AGATCGCGTC TTTATTGACA GGTGGTGCAA CCCAATGTGC TCGAAAGTCA 2580 CCCTCGGAAC CATCAGGGCC AGGTGCACCT GCGGGGAATG TCCCCGAGTG TGTGAGCAAT 2640 GCCGCACTGA TACAGGAGTG GACACCCGAA TCTGGTACCA CAATCTCCCC GAGATCCCAG 2700 AATAATGAAG AAGGGGGAGA CTATTATGAT GATGAGCTGT TCTCTGATGT CCAAGATATT 2760 AAAACAGCCT TGGCCAAAAT ACACGAGGAT AATCAGAAGA TAATCTCCAA GCTAGAATCA 2820 CTGCTGTTAT TGAAGGGAGA AGTTGAGTCA ATTAAGAAGC AGATCAACAG GCAAAATATC 2880 AGCATATCCA CCCTGGAAGG ACACCTCTCA AGCATCATGA TCGCCATTCC TGGACTTGGG 2940 AAGGATCCCA ACGACCCCAC TGCAGATGTC GAAATCAATC CCGACTTGAA ACCCATCATA 3000 GGCAGAGATT CAGGCCGAGC ACTGGCCGAA GTTCTCAAGA AACCCGTTGC CAGCCGACAA 3060 CTCCAAGGAA TGACAAATGG ACGGACCAGT TCCAGAGGAC AGCTGCTGAA GGAATTTCAG 3120 CTAAAGCCGA TCGGGAAAAA GATGAGCTCA GCCGTCGGGT TTGTTCCTGA CACCGGCCCT 3180 GCATCACGCA GTGTAATCCG CTCCATTATA AAATCCAGCC GGCTAGAGGA GGATCGGAAG 3240 CGTTACCTGA TGACTCTCCT TGATGATATC AAAGGAGCCA ATGATCTTGC CAAGTTCCAC 3300 CAGATGCTGA TGAAGATAAT AATGAAGTAG CTACAGCTCA ACTTACCTGC CAACCCCATG 3360 CCAGTCGACC CAACTAGTAC AACCTAAATC CATTATAAAA AACTTAGGAG CAAAGTGATT 3420 GCCTCCCAAG TTCCACAATG ACAGAGATCT ACGACTTCGA CAAGTCGGCA TGGGACATCA 3480 AAGGGTCGAT CGCTCCGATA CAACCCACCA CCTACAGTGA TGGCAGGCTG GTGCCCCAGG 3540 TCAGAGTCAT AGATCCTGGT CTAGGCGACA GGAAGGATGA ATGCTTTATG TACATGTTTC 3600 TGCTGGGGGT TGTTGAGGGC AGCGATCCCC TAGGGCCTCC AATCGGGCGA GCATTTGGGT 3660 CCCTGCCCTT AGGTGTTGGC AGATCCACAG CAAAGCCCGA AGAACTCCTC AAAGAGGCCA 3720 CTGAGCTTGA CATAGTTGTT AGACGTACAG CAGGGCTCAA TGAAAAACTG GTGTTCTACA 3780 ACAACACCCC ACTAACTCTC CTCACACCTT GGAGAAAGGT CCTAACAACA GGGAGTGTCT 3840 TCAACGCAAA CCAAGTGTGC AATGCGGTTA ATCTGATACC GCTCGATACC CCGCAGAGGT 3900 TCCGTGTTGT TTATATGAGC ATCACCCGTC TTTCGGATAA CGGGTATTAC ACCGTTCCTA 3960 GAAGAATGCT GGAATTCAGA TCGGTCAATG CAGTGGCCTT CAACCTGCTG GTGACCCTTA 4020 GGATTGACAA GGCGATAGGC CCTGGGAAGA TCATCGACAA TACAGAGCAA CTTCCTGAGG 4080 CAACATTTAT GGTCCACATC GGGAACTTCA GGAGAAAGAA GAGTGAAGTC TACTCTGCCG 4140 ATTATTGCAA AATGAAAATC GAAAAGATGG GCCTGGTTTT TGCACTTGGT GGGATAGGGG 4200 GCACCAGTCT TCACATTAGA AGCACAGGCA AGATGAGCAA GACTCTCCAT GCACAACTCG 4260 GGTTCAAGAA GACCTTATGT TACCCGCTGA TGGATATCAA TGAAGACCTT AATCGATTAC 4320 TCTGGAGGAG CAGATGCAAG ATAGTAAGAA TCCAGGCAGT TTTGCAGCCA TCAGTTCCTC 4380 AAGAATTCCG CATTTACGAC GACGTGATCA TAAATGATGA CCAAGGACTA TTCAAAGTTC 4440 TGTAGACCGT AGTGCCCAGC AATGCCCGAA AACGACCCCC CTCACAATGA CAGCCAGAAG 4500 GCCCGGACAA AAAAGCCCCC TCCGAAAGAC TCCACGGACC AAGCGAGAGG CCAGCCAGCA 4560 GCCGACGGCA AGCGCGAACA CCAGGCGGCC CCAGCACAGA ACAGCCCTGA CACAAGGCCA 4620 CCACCAGCCA CCCCAATCTG CATCCTCCTC GTGGGACCCC CGAGGACCAA CCCCCAAGGC 4680 TGCCCCCGAT CCAAACCACC AACCGCATCC CCACCACCCC CGGGAAAGAA ACCCCCAGCA 4740 ATTGGAAGGC CCCTCCCCCT CTTCCTCAAC ACAAGAACTC CACAACCGAA CCGCACAAGC 4800 GACCGAGGTG ACCCAACCGC AGGCATCCGA CTCCCTAGAC AGATCCTCTC TCCCCGGCAA 4860 ACTAAACAAA ACTTAGGGCC AAGGAACATA CACACCCAAC AGAACCCAGA CCCCGGCCCA 4920 CGGCGCCGCG CCCCCAACCC CCGACAACCA GAGGGAGCCC CCAACCAATC CCGCCGGTTC 4980 CCCCGGTGCC CACAGGCAGG GACACCAACC CCCGAACAGA CCCAGCACCC AACCATCGAC 5040 AATCCAAGAC GGGGGGGCCC CCCCAAAAAA AGGCCCCCAG GGGCCGACAG CCAGCACCGC 5100 GAGGAAGCCC ACCCACCCCA CACACGACCA CGGCAACCAA ACCAGAACCC AGACCACCCT 5160 GGGCCACCAG CTCCCAGACT CGGCCATCAC CCCGCAGAAA GGAAAGGCCA CAACCCGCGC 5220 ACCCCAGCCC CGATCCGGCG GGGAGCCACC CAACCCGAAC CAGCACCCAA GAGCGATCCC 5280 CGAAGGACCC CCGAACCGCA AAGGACATCA GTATCCCACA GCCTCTCCAA GTCCCCCGGT 5340 CTCCTCCTTT TCTCGAAGGG ACCAAAAGAT CAATCCACCA CACCCGACGA CACTCAACTC 5400 CCCACCCCTA AAGGAGACAC CGGGAATCCC AGAATCAAGA CTCATCCAAT GTCCATCATG 5460 GGTCTCAAGG TGAACGTCTC TGCCATATTC ATGGCAGTAC TGTTAACTCT CCAGACACCC 5520 ACCGGTCAAA TCCATTGGGG CAATCTCTCT AAGATAGGGG TGGTAGGAAT AGGAAGTGCA 5580 AGCTACAAAG TTATGACTCG TTCCAGCCAT CAATCATTAG TCATAAAATT AATGCCCAAT 5640 ATAACTCTCC TCAATAACTG CACGAGGGTA GAGATTGCAG AATACAGGAG ACTACTGAGA 5700 ACAGTTTTGG AACCAATTAG AGATGCACTT AATGCAATGA CCCAGAATAT AAGACCGGTT 5760 CAGAGTGTAG CTTCAAGTAG GAGACACAAG AGATTTGCGG GAGTAGTCCT GGCAGGTGCG 5820 GCCCTAGGCG TTGCCACAGC TGCTCAGATA ACAGCCGGCA TTGCACTTCA CCAGTCCATG 5880 CTGAACTCTC AAGCCATCGA CAATCTGAGA GCGAGCCTGG AAACTACTAA TCAGGCAATT 5940 GAGGCAATCA GACAAGCAGG GCAGGAGATG ATATTGGCTG TTCAGGGTGT CCAAGACTAC 6000 ATCAATAATG AGCTGATACC GTCTATGAAC CAACTATCTT GTGATTTAAT CGGCCAGAAG 6060 CTCGGGCTCA AATTGCTCAG ATACTATACA GAAATCCTGT CATTATTTGG CCCCAGCTTA 6120 CGGGACCCCA TATCTGCGGA GATATCTATC CAGGCTTTGA GCTATGCGCT TGGAGGAGAC 6180 ATCAATAAGG TGTTAGAAAA GCTCGGATAC AGTGGAGGTG ATTTACTGGG CATCTTAGAG 6240 AGCAGAGGAA TAAAGGCCCG GATAACTCAC GTCGACACAG AGTCCTACTT CATTGTCCTC 6300 AGTATAGCCT ATCCGACGCT GTCCGAGATT AAGGGGGTGA TTGTCCACCG GCTAGAGGGG 6360 GTCTCGTACA ACATAGGCTC TCAAGAGTGG TATACCACTG TGCCCAAGTA TGTTGCAACC 6420 CAAGGGTACC TTATCTCGAA TTTTGATGAG TCATCGTGTA CTTTCATGCC AGAGGGGACT 6480 GTGTGCAGCC AAAATGCCTT GTACCCGATG AGTCCTCTGC TCCAAGAATG CCTCCGGGGG 6540 TCCACCAAGT CCTGTGCTCG TACACTCGTA TCCGGGTCTT TTGGGAACCG GTTCATTTTA 6600 TCACAAGGGA ACCTAATAGC CAATTGTGCA TCAATCCTTT GCAAGTGTTA CACAACAGGA 6660 ACGATCATTA ATCAAGACCC TGACAAGATC CTAACATACA TTGCTGCCGA TCACTGCCCG 6720 GTAGTCGAGG TGAACGGCGT GACCATCCAA GTCGGGAGCA GGAGGTATCC AGACGCTGTG 6780 TACTTGCACA GAATTGACCT CGGTCCTCCC ATATCATTGG AGAGGTTGGA CGTAGGGACA 6840 AATCTGGGGA ATGCAATTGC TAAGTTGGAG GATGCCAAGG AATTGTTGGA GTCATCGGAC 6900 CAGATATTGA GGAGTATGAA AGGTTTATCG AGCACTAGCA TAGTCTACAT CCTGATTGCA 6960 GTGTGTCTTG GAGGGTTGAT AGGGATCCCC GCTTTAATAT GTTGCTGCAG GGGGCGTTGT 7020 AACAAAAAGG GAGAACAAGT TGGTATGTCA AGACCAGGCC TAAAGCCTGA TCTTACGGGA 7080 ACATCAAAAT CCTATGTAAG GTCGCTCTGA TCCTCTACAA CTCTTGAAAC ACAAATGTCC 7140 CACAAGTCTC CTCTTCGTCA TCAAGCAACC ACCGCACCCA GCATCAAGCC CACCTGAAAT 7200 TATCTCCGGC TTCCCTCTGG CCGAACAATA TCGGTAGTTA ATTAAAACTT AGGGTGCAAG 7260 ATCATCCACA ATGTCACCAC AACGAGACCG GATAAATGCC TTCTACAAAG ATAACCCCCA 7320 TCCCAAGGGA AGTAGGATAG TCATTAACAG AGAACATCTT ATGATTGATA GACCTTATGT 7380 TTTGCTGGCT GTTCTGTTTG TCATGTCTCT GAGCTTGATC GGGTTGCTAG CCATTGCAGG 7440 CATTAGACTT CATCGGGCAG CCATCTACAC CGCAGAGATC CATAAAAGCC TCAGCACCAA 7500 TCTAGATGTA ACTAACTCAA TCGAGCATCA GGTCAAGGAC GTGCTGACAC CACTCTTCAA 7560 AATCATCGGT GATGAAGTGG GCCTGAGGAC ACCTCAGAGA TTCACTGACC TAGTGAAATT 7620 CATCTCTGAC AAGATTAAAT TCCTTAATCC GGATAGGGAG TACGACTTCA GAGATCTCAC 7680 TTGGTGTATC AACCCGCCAG AGAGAATCAA ATTGGATTAT GATCAATACT GTGCAGATGT 7740 GGCTGCTGAA GAGCTCATGA ATGCATTGGT GAACTCAACT CTACTGGAGA CCAGAACAAC 7800 CAATCAGTTC CTAGCTGTCT CAAAGGGAAA CTGCTCAGGG CCCACTACAA TCAGAGGTCA 7860 ATTCTCAAAC ATGTCGCTGT CCCTGTTAGA CTTGTATTTA AGTCGAGGTT ACAATGTGTC 7920 ATCTATAGTC ACTATGACAT CCCAGGGAAT GTATGGGGGA ACTTACCTAG TGGAAAAGCC 7980 TAATCTGAGC AGCAAAAGGT CAGAGTTGTC ACAACTGAGC ATGTACCGAG TGTTTGAAGT 8040 AGGTGTTATC AGAAATCCGG GTTTGGGGGC TCCGGTGTTC CATATGACAA ACTATCTTGA 8100 GCAACCAGTC AGTAATGATC TCAGCAACTG TATGGTGGCT TTGGGGGAGC TCAAACTCGC 8160 AGCCCTTTGT CACGGGGAAG ATTCTATCAC AATTCCCTAT CAGGGATCAG GGAAAGGTGT 8220 CAGCTTCCAG CTCGTCAAGC TAGGTGTCTG GAAATCCCCA ACCGACATGC AATCCTGGGT 8280 CCCCTTATCA ACGGATGATC CAGTGATAGA CAGGCTTTAC CTCTCATCTC ACAGAGGTGT 8340 TATCGCTGAC AATCAAGCAA AATGGGCTGT CCCGACAACA CGAACAGATG ACAAGTTGCG 8400 AATGGAGACA TGCTTCCAAC AGGCGTGTAA GGGTAAAATC CAAGCACTCT GCGAGAATCC 8460 CGAGTGGGCA CCATTGAAGG ATAACAGGAT TCCTTCATAC GGGGTCTTGT CTGTTGATCT 8520 GAGTCTGACA GTTGAGCTTA AAATCAAAAT TGCTTCGGGA TTCGGGCCAT TGATCACACA 8580 CGGTTCAGGG ATGGACCTAT ACAAATCCAA CCACAACAAT GTGTATTGGC TGACTATCCC 8640 GCCAATGAAG AACCTAGCCT TAGGTGTAAT CAACACATTG GAGTGGATAC CGAGATTCAA 8700 GGTTAGTCCC AACCTCTTCA CTGTCCCAAT TAAGGAAGCA GGCGAAGACT GCCATGCCCC 8760 AACATACCTA CCTGCGGAGG TGGATGGTGA TGTCAAACTC AGTTCCAATC TGGTGATTCT 8820 ACCTGGTCAA GATCTCCAAT ATGTTTTGGC AACCTACGAT ACTTCCAGGG TTGAACATGC 8880 TGTGGTTTAT TACGTTTACA GCCCAGGCCG CTCATTTTCT TACTTTTATC CTTTTAGGTT 8940 GCCTATAAAG GGGGTCCCCA TCGAATTACA AGTGGAATGC TTCACATGGG ACCAAAAACT 9000 CTGGTGCCGT CACTTCTGTG TGCTTGCGGA CTCAGAATCT GGTGGACATA TCACTCACTC 9060 TGGGATGGTG GGCATGGGAG TCAGCTGCAC AGTCACCCGG GAAGATGGAA CCAATCGCAG 9120 ATAGGGCTGC TAGTGAACCA ATCACATGAT GTCACCCAGA CATCAGGCAT ACCCACTAGT 9180 GTGAAATAGA CATCAGAATT AAGAAAAACG TAGGGTCCAA GTGGTTCCCC GTTATGGACT 9240 CGCTATCTGT CAACCAGATC TTATACCCTG AAGTTCACCT AGATAGCCCG ATAGTTACCA 9300 ATAAGATAGT AGCCATCCTG GAGTATGCTC GAGTCCCTCA CGCTTACAGC CTGGAGGACC 9360 CTACACTGTG TCAGAACATC AAGCACCGCC TAAAAAACGG ATTTTCCAAC CAAATGATTA 9420 TAAACAATGT GGAAGTTGGG AATGTCATCA AGTCCAAGCT TAGGAGTTAT CCGGCCCACT 9480 CTCATATTCC ATATCCAAAT TGTAATCAGG ATTTATTTAA CATAGAAGAC AAAGAGTCAA 9540 CGAGGAAGAT CCGTGAACTC CTCAAAAAGG GGAATTCGCT GTACTCCAAA GTCAGTGATA 9600 AGGTTTTCCA ATGCTTAAGG GACACTAACT CACGGCTTGG CCTAGGCTCC GAATTGAGGG 9660 AGGACATCAA GGAGAAAGTT ATTAACTTGG GAGTTTACAT GCACAGCTCC CAGTGGTTTG 9720 AGCCCTTTCT GTTTTGGTTT ACAGTCAAGA CTGAGATGAG GTCAGTGATT AAATCACAAA 9780 CCCATACTTG CCATAGGAGG AGACACACAC CTGTATTCTT CACTGGTAGT TCAGTTGAGT 9840 TGCTAATCTC TCGTGACCTT GTTGCTATAA TCAGTAAAGA GTCTCAACAT GTATATTACC 9900 TGACATTTGA ACTGGTTTTG ATGTATTGTG ATGTCATAGA GGGGAGGTTA ATGACAGAGA 9960 CCGCTATGAC TATTGATGCT AGGTATACAG AGCTTCTAGG AAGAGTCAGA TACATGTGGA 10020 AACTGATAGA TGGTTTCTTC CCTGCACTCG GGAATCCAAC TTATCAAATT GTAGCCATGC 10080 TGGAGCCTCT TTCACTTGCT TACCTGCAGC TGAGGGATAT AACAGTAGAA CTCAGAGGTG 10140 CTTTCCTTAA CCACTGCTTT ACTGAAATAC ATGATGTTCT TGACCAAAAC GGGTTTTCTG 10200 ATGAAGGTAC TTATCATGAG TTAATTGAAG CTCTAGATTA CATTTTCATA ACTGATGACA 10260 TACATCTGAC AGGGGAGATT TTCTCATTTT TCAGAAGTTT CGGCCACCCC AGACTTGAAG 10320 CAGTAACGGC TGCTGAAAAT GTTAGGAAAT ACATGAATCA GCCTAAAGTC ATTGTGTATG 10380 AGACTCTGAT GAAAGGTCAT GCCATATTTT GTGGAATCAT AATCAACGGC TATCGTGACA 10440 GGCACGGAGG CAGTTGGCCA CCGCTGACCC TCCCCCTGCA TGCTGCAGAC ACAATCCGGA 10500 ATGCTCAAGC TTCAGGTGAA GGGTTAACAC ATGAGCAGTG CGTTGATAAC TGGAAATCTT 10560 TTGCTGGAGT GAAATTTGGC TGCTTTATGC CTCTTAGCCT GGATAGTGAT CTGACAATGT 10620 ACCTAAAGGA CAAGGCACTT GCTGCTCTCC AAAGGGAATG GGATTCAGTT TACCCGAAAG 10680 AGTTCCTGCG TTACGACCCT CCCAAGGGAA CCGGGTCACG GAGGCTTGTA GATGTTTTCC 10740 TTAATGATTC GAGCTTTGAC CCATATGATG TGATAATGTA TGTTGTAAGT GGAGCTTACC 10800 TCCATGACCC TGAGTTCAAC CTGTCTTACA GCCTGAAAGA AAAGGAGATC AAGGAAACAG 10860 GTAGACTTTT TGCTAAAATG ACTTACAAAA TGAGGGCATG CCAAGTGATT GCTGAAAATC 10920 TAATCTCAAA CGGGATTGGC AAATATTTTA AGGACAATGG GATGGCCAAG GATGAGCACG 10980 ATTTGACTAA GGCACTCCAC ACTCTAGCTG TCTCAGGAGT CCCCAAAGAT CTCAAAGAAA 11040 GTCACAGGGG GGGGCCAGTC TTAAAAACCT ACTCCCGAAG CCCAGTCCAC ACAAGTACCA 11100 GGAACGTGAG AGCAGCAAAA GGGTTTATAG GGTTCCCTCA AGTAATTCGG CAGGACCAAG 11160 ACACTGATCA TCCGGAGAAT ATGGAAGCTT ACGAGACAGT CAGTGCATTT ATCACGACTG 11220 ATCTCAAGAA GTACTGCCTT AATTGGAGAT ATGAGACCAT CAGCTTGTTT GCACAGAGGC 11280 TAAATGAGAT TTACGGATTG CCCTCATTTT TCCAGTGGCT GCATAAGAGG CTTGAGACCT 11340 CTGTCCTGTA TGTAAGTGAC CCTCATTGCC CCCCCGACCT TGACGCCCAT ATCCCGTTAT 11400 ATAAAGTCCC CAATGATCAA ATCTTCATTA AGTACCCTAT GGGAGGTATA GAAGGGTATT 11460 GTCAGAAGCT GTGGACCATC AGCACCATTC CCTATCTATA CCTGGCTGCT TATGAGAGCG 11520 GAGTAAGGAT TGCTTCGTTA GTGCAAGGGG ACAATCAGAC CATAGCCGTA ACAAAAAGGG 11580 TACCCAGCAC ATGGCCCTAC AACCTTAAGA AACGGGAAGC TGCTAGAGTA ACTAGAGATT 11640 ACTTTGTAAT TCTTAGGCAA AGGCTACATG ATATTGGCCA TCACCTCAAG GCAAATGAGA 11700 CAATTGTTTC ATCACATTTT TTTGTCTATT CAAAAGGAAT ATATTATGAT GGGCTACTTG 11760 TGTCCCAATC ACTCAAGAGC ATCGCAAGAT GTGTATTCTG GTCAGAGACT ATAGTTGATG 11820 AAACAAGGGC AGCATGCAGT AATATTGCTA CAACAATGGC TAAAAGCATC GAGAGAGGTT 11880 ATGACCGTTA CCTTGCATAT TCCCTGAACG TCCTAAAAGT GATACAGCAA ATTCTGATCT 11940 CTCTTGGCTT CACAATCAAT TCAACCATGA CCCGGGATGT AGTCATACCC CTCCTCACAA 12000 ACAACGACCT CTTAATAAGG ATGGCACTGT TGCCCGCTCC TATTGGGGGG ATGAATTATC 12060 TGAATATGAG CAGGCTGTTT GTCAGAAACA TCGGTGATCC AGTAACATCA TCAATTGCTG 12120 ATCTCAAGAG AATGATTCTC GCCTCACTAA TGCCTGAAGA GACCCTCCAT CAAGTAATGA 12180 CACAACAACC GGGGGACTCT TCATTCCTAG ACTGGGCTAG CGACCCTTAC TCAGCAAATC 12240 TTGTATGTGT CCAGAGCATC ACTAGACTCC TCAAGAACAT AACTGCAAGG TTTGTCCTGA 12300 TCCATAGTCC AAACCCAATG TTAAAAGGAT TATTCCATGA TGACAGTAAA GAAGAGGACG 12360 AGGGACTGGC GGCATTCCTC ATGGACAGGC ATATTATAGT ACCTAGGGCA GCTCATGAAA 12420 TCCTGGATCA TAGTGTCACA GGGGCAAGAG AGTCTATTGC AGGCATGCTG GATACCACAA 12480 AAGGCCTGAT TCGAGCCAGC ATGAGGAAGG GGGGGTTAAC CTCTCGAGTG ATAACCAGAT 12540 TGTCCAATTA TGACTATGAA CAATTCAGAG CAGGGATGGT GCTATTGACA GGAAGAAAGA 12600 GAAATGTCCT CATTGACAAA GAGTCATGTT CAGTGCAGCT GGCGAGAGCT CTAAGAAGCC 12660 ATATGTGGGC GAGGCTAGCT CGAGGACGGC CTATTTACGG CCTTGAGGTC CCTGATGTAC 12720 TAGAATCTAT GCGAGGCCAC CTTATTCGGC GTCATGAGAC ATGTGTCATC TGCGAGTGTG 12780 GATCAGTCAA CTACGGATGG TTTTTTGTCC CCTCGGGTTG CCAACTGGAT GATATTGACA 12840 AGGAAACATC ATCCTTGAGA GTCCCATATA TTGGTTCTAC CACTGATGAG AGAACAGACA 12900 TGAAGCTTGC CTTCGTAAGA GCCCCAAGTC GATCCTTGCG ATCTGCTGTT AGAATAGCAA 12960 CAGTGTACTC ATGGGCTTAC GGTGATGATG ATAGCTCTTG GAACGAAGCC TGGTTGTTGG 13020 CTAGGCAAAG GGCCAATGTG AGCCTGGAGG AGCTAAGGGT GATCACTCCC ATCTCAACTT 13080 CGACTAATTT AGCGCATAGG TTGAGGGATC GTAGCACTCA AGTGAAATAC TCAGGTACAT 13140 CCCTTGTCCG AGTGGCGAGG TATACCACAA TCTCCAACGA CAATCTCTCA TTTGTCATAT 13200 CAGATAAGAA GGTTGATACT AACTTTATAT ACCAACAAGG AATGCTTCTA GGGTTGGGTG 13260 TTTTAGAAAC ATTGTTTCGA CTCGAGAAAG ATACCGGATC ATCTAACACG GTATTACATC 13320 TTCACGTCGA AACAGATTGT TGCGTGATCC CGATGATAGA TCATCCCAGG ATACCCAGCT 13380 CCCGCAAGCT AGAGCTGAGG GCAGAGCTAT GTACCAACCC ATTGATATAT GATAATGCAC 13440 CTTTAATTGA CAGAGATGCA ACAAGGCTAT ACACCCAGAG CCATAGGAGG CACCTTGTGG 13500 AATTTGTTAC ATGGTCCACA CCCCAACTAT ATCACATTTT AGCTAAGTCC ACAGCACTAT 13560 CTATGATTGA CCTGGTAACA AAATTTGAGA AGGACCATAT GAATGAAATT TCAGCTCTCA 13620 TAGGGGATGA CGATATCAAT AGTTTCATAA CTGAGTTTCT GCTCATAGAG CCAAGATTAT 13680 TCACTATCTA CTTGGGCCAG TGTGCGGCCA TCAATTGGGC ATTTGATGTA CATTATCATA 13740 GACCATCAGG GAAATATCAG ATGGGTGAGC TGTTGTCATC GTTCCTTTCT AGAATGAGCA 13800 AAGGAGTGTT TAAGGTGCTT GTCAATGCTC TAAGCCACCC AAAGATCTAC AAGAAATTCT 13860 GGCATTGTGG TATTATAGAG CCTATCCATG GTCCTTCACT TGATGCTCAA AACTTGCACA 13920 CAACTGTGTG CAACATGGTT TACACATGCT ATATGACCTA CCTCGACCTG TTGTTGAATG 13980 AAGAGTTAGA AGAGTTCACA TTTCTCTTGT GTGAAAGCGA CGAGGATGTA GTACCGGACA 14040 GATTCGACAA CATCCAGGCA AAACACTTAT GTGTTCTGGC AGATTTGTAC TGTCAACCAG 14100 GGACCTGCCC ACCAATTCGA GGTCTAAGAC CGGTAGAGAA ATGTGCAGTT CTAACCGACC 14160 ATATCAAGGC AGAGGCTAGG TTATCTCCAG CAGGATCTTC GTGGAACATA AATCCAATTA 14220 TTGTAGACCA TTACTCATGC TCTCTGACTT ATCTCCGGCG AGGATCGATC AAACAGATAA 14280 GATTGAGAGT TGATCCAGGA TTCATTTTCG ACGCCCTCGC TGAGGTAAAT GTCAGTCAGC 14340 CAAAGATCGG CAGCAACAAC ATCTCAAATA TGAGCATCAA GGATTTCAGA CCCCCACACG 14400 ATGATGTTGC AAAATTGCTC AAAGATATCA ACACAAGCAA GCACAATCTT CCCATTTCAG 14460 GGGGCAATCT CGCCAATTAT GAAATCCATG CTTTCCGCAG AATCGGGTTG AACTCATCTG 14520 CTTGCTACAA AGCTGTTGAG ATATCAACAT TAATTAGGAG ATGCCTTGAG CCAGGGGAAG 14580 ACGGCTTGTT CTTGGGTGAG GGATCGGGTT CTATGTTGAT CACTTATAAG GAGATACTTA 14640 AACTAAACAA GTGCTTCTAT AATAGTGGGG TTTCCGCCAA TTCTAGATCT GGTCAAAGGG 14700 AATTAGCACC CTATCCCTCC GAAGTTGGCC TTGTCGAACA CAGAATGGGA GTAGGTAATA 14760 TTGTCAAAGT GCTCTTTAAC GGGAGGCCCG AAGTCACGTG GGTAGGCAGT GTAGATTGCT 14820 TCAATTTCAT AGTTAGTAAT ATCCCTACCT CTAGTGTGGG GTTTATCCAT TCAGATATAG 14880 AGACCTTGCC TAACAAAGAT ACTATAGAGA AGCTAGAGGA ATTGGCAGCC ATCTTATCGA 14940 TGGCTCTGCT CCTGGGCAAA ATAGGATCAA TACTGGTGAT TAAGCTTATG CCTTTCAGCG 15000 GGGATTTTGT TCAGGGATTT ATAAGTTATG TAGGGTCTCA TTATAGAGAA GTGAACCTTG 15060 TATACCCTAG ATACAGCAAC TTCATATCTA CTGAATCTTA TTTGGTTATG ACAGATCTCA 15120 AGGCTAACCG GCTAATGAAT CCTGAAAAGA TTAAGCAGCA GATAATTGAA TCATCTGTGA 15180 GGACTTCACC TGGACTTATA GGTCACATCC TATCCATTAA GCAACTAAGC TGCATACAAG 15240 CAATTGTGGG AGACGCAGTT AGTAGAGGTG ATATCAATCC TACTCTGAAA AAACTTACAC 15300 CTATAGAGCA GGTGCTGATC AATTGCGGGT TGGCAATTAA CGGACCTAAG CTGTGCAAAG 15360 AATTGATCCA CCATGATGTT GCCTCAGGGC AAGATGGATT GCTTAATTCT ATACTCATCC 15420 TCTACAGGGA GTTGGCAAGA TTCAAAGACA ACCAAAGAAG TCAACAAGGG ATGTTCCACG 15480 CTTACCCCGT ATTGGTAAGT AGCAGGCAAC GAGAACTTAT ATCTAGGATC ACCCGCAAAT 15540 TTTGGGGGCA CATTCTTCTT TACTCCGGGA ACAGAAAGTT GATAAATAAG TTTATCCAGA 15600 ATCTCAAGTC CGGCTATCTG ATACTAGACT TACACCAGAA TATCTTCGTT AAGAATCTAT 15660 CCAAGTCAGA GAAACAGATT ATTATGACGG GGGGTTTGAA ACGTGAGTGG GTTTTTAAGG 15720 TAACAGTCAA GGAGACCAAA GAATGGTATA AGTTAGTCGG ATACAGTGCC CTGATTAAGG 15780 ACTAATTGGT TGAACTCCGG AACCCTAATC CTGCCCTAGG TGGTTAGGCA TTATTTGCAA 15840 TATATTAAAG AAAACTTTGA AAATACGAAG TTTCTATTCC CAGCTTTGTC TGGT 15894 (2) SEQ ID NO: 2 of the message: ...
(i) sequence signature:
(A) length: 2183 amino acid
(B) type: amino acid
(C) chain:
(D) topological framework: linearity
(ii) molecule type: protein
(xi) sequence description: SEQ ID NO:2:
Met?Asp?Ser?Leu?Ser?Val?Asn?Gln?Ile?Leu?Tyr?Pro?Glu?Val?His?Leu
1???????????????5???????????????????10??????????????????15
Asp?Ser?Pro?Ile?Val?Thr?Asn?Lys?Ile?Val?Ala?Ile?Leu?Glu?Tyr?Ala
20??????????????????25???????????????????30
Arg?Val?Pro?His?Ala?Tyr?Ser?Leu?Glu?Asp?Pro?Thr?Leu?Cys?Gln?Asn
35??????????????????40?????????????????45
Ile?Lys?His?Arg?Leu?Lys?Asn?Gly?Phe?Ser?Asn?Gln?Met?Ile?Ile?Asn
50??????????????????55??????????????????60
Asn?Val?Glu?Val?Gly?Asn?Val?Ile?Lys?Ser?Lys?Leu?Arg?Ser?Tyr?Pro
65??????????????????70??????????????????75??????????????????80
Ala?His?Ser?His?Ile?Pro?Tyr?Pro?Asn?Cys?Asn?Gln?Asp?Leu?Phe?Asn
85??????????????????90??????????????????95
Ile?Glu?Asp?Lys?Glu?Ser?Thr?Arg?Lys?Ile?Arg?Glu?Leu?Leu?Lys?Lys
100?????????????????105??????????????????110Gly?Asn?Ser?Leu?Tyr?Ser?Lys?Val?Ser?Asp?Lys?Val?Phe?Gln?Cys?Leu
115?????????????????120?????????????????125Arg?Asp?Thr?Asn?Ser?Arg?Leu?Gly?Leu?Gly?Ser?Glu?Leu?Arg?Glu?Asp
130?????????????????135?????????????????140Ile?Lys?Glu?Lys?Val?Ile?Asn?Leu?Gly?Val?Tyr?Met?His?Ser?Ser?Gln145?????????????????150?????????????????155?????????????????160Trp?Phe?Glu?Pro?Phe?Leu?Phe?Trp?Phe?Thr?Val?Lys?Thr?Glu?Met?Arg
165?????????????????170?????????????????175Ser?Val?Ile?Lys?Ser?Gln?Thr?His?Thr?Cys?His?Arg?Arg?Arg?His?Thr
180?????????????????185?????????????????190Pro?Val?Phe?Phe?Thr?Gly?Ser?Ser?Val?Glu?Leu?Leu?Ile?Ser?Arg?Asp
195?????????????????200?????????????????205Leu?Val?Ala?Ile?Ile?Ser?Lys?Glu?Ser?Gln?His?Val?Tyr?Tyr?Leu?Thr
210?????????????????215?????????????????220phe?Glu?Leu?Val?Leu?Met?Tyr?Cys?Asp?Val?Ile?Glu?Gly?Arg?Leu?Met225?????????????????230?????????????????235?????????????????240Thr?Glu?Thr?Ala?Met?Thr?Ile?Asp?Ala?Arg?Tyr?Thr?Glu?Leu?Leu?Gly
245?????????????????250?????????????????255Arg?Val?Arg?Tyr?Met?Trp?Lys?Leu?Ile?Asp?Gly?Phe?Phe?Pro?Ala?Leu
260?????????????????265?????????????????270Gly?Asn?Pro?Thr?Tyr?Gln?Ile?Val?Ala?Met?Leu?Glu?Pro?Leu?Ser?Leu
275?????????????????280?????????????????285Ala?Tyr?Leu?Gln?Leu?Arg?Asp?Ile?Thr?Val?Glu?Leu?Arg?Gly?Ala?Phe
290?????????????????295?????????????????300Leu?Asn?His?Cys?Phe?Thr?Glu?Ile?His?Asp?Val?Leu?Asp?Gln?Asn?Gly305?????????????????310?????????????????315?????????????????320Phe?Ser?Asp?Glu?Gly?Thr?Tyr?His?Glu?Leu?Ile?Glu?Ala?Leu?Asp?Tyr
325?????????????????330?????????????????335Ile?Phe?Ile?Thr?Asp?Asp?Ile?His?Leu?Thr?Gly?Glu?Ile?Phe?Ser?Phe
340?????????????????345?????????????????350Phe?Arg?Ser?Phe?Gly?His?Pro?Arg?Leu?Glu?Ala?Val?Thr?Ala?Ala?Glu
355?????????????????360?????????????????365Asn?Val?Arg?Lys?Tyr?Met?Asn?Gln?Pro?Lys?Val?Ile?Val?Tyr?Glu?Thr
370?????????????????375?????????????????380Leu?Met?Lys?Gly?His?Ala?Ile?Phe?Cys?Gly?Ile?Ile?Ile?Asn?Gly?Tyr385?????????????????390?????????????????395?????????????????400Arg?Asp?Arg?His?Gly?Gly?Ser?Trp?Pro?Pro?Leu?Thr?Leu?Pro?Leu?His
405?????????????????410?????????????????415Ala?Ala?Asp?Thr?Ile?Arg?Asn?Ala?Gln?Ala?Ser?Gly?Glu?Gly?Leu?Thr
420?????????????????425?????????????????430His?Glu?Gln?Cys?Val?Asp?Asn?Trp?Lys?Ser?Phe?Ala?Gly?Val?Lys?Phe
435?????????????????440?????????????????445Gly?Cys?Phe?Met?Pro?Leu?Ser?Leu?Asp?Ser?Asp?Leu?Thr?Met?Tyr?Leu
450????????????????455??????????????????460Lys?Asp?Lys?Ala?Leu?Ala?Ala?Leu?Gln?Arg?Glu?Trp?Asp?Ser?Val?Tyr465?????????????????470?????????????????475?????????????????480Pro?Lys?Glu?Phe?Leu?Arg?Tyr?Asp?Pro?Pro?Lys?Gly?Thr?Gly?Ser?Arg
485?????????????????490?????????????????495Arg?Leu?Val?Asp?Val?Phe?Leu?Asn?Asp?Ser?Ser?Phe?Asp?Pro?Tyr?Asp
500?????????????????505?????????????????510Val?Ile?Met?Tyr?Val?Val?Ser?Gly?Ala?Tyr?Leu?His?Asp?Pro?Glu?Phe
515?????????????????520?????????????????525Asn?Leu?Ser?Tyr?Ser?Leu?Lys?Glu?Lys?Glu?Ile?Lys?Glu?Thr?Gly?Arg
530?????????????????535?????????????????540Leu?Phe?Ala?Lys?Met?Thr?Tyr?Lys?Met?Arg?Ala?Cys?Gln?Val?Ile?Ala545?????????????????550?????????????????555?????????????????560Glu?Asn?Leu?Ile?Ser?Asn?Gly?Ile?Gly?Lys?Tyr?Phe?Lys?Asp?Asn?Gly
565?????????????????570?????????????????575Met?Ala?Lys?Asp?Glu?His?Asp?Leu?Thr?Lys?Ala?Leu?His?Thr?Leu?Ala
580?????????????????585?????????????????590Val?Ser?Gly?Val?Pro?Lys?Asp?Leu?Lys?Glu?Ser?His?Arg?Gly?Gly?Pro
595?????????????????600?????????????????605Val?Leu?Lys?Thr?Tyr?Ser?Arg?Ser?Pro?Val?His?Thr?Ser?Thr?Arg?Asn
610?????????????????615?????????????????620Val?Arg?Ala?Ala?Lys?Gly?Phe?Ile?Gly?Phe?Pro?Gln?Val?Ile?Arg?Gln625?????????????????630?????????????????635?????????????????640Asp?Gln?Asp?Thr?Asp?His?Pro?Glu?Asn?Met?Glu?Ala?Tyr?Glu?Thr?Val
645?????????????????650?????????????????655Ser?Ala?Phe?Ile?Thr?Thr?Asp?Leu?Lys?Lys?Tyr?Cys?Leu?Asn?Trp?Arg
660?????????????????665?????????????????670Tyr?Glu?Thr?Ile?Ser?Leu?Phe?Ala?Gln?Arg?Leu?Asn?Glu?Ile?Tyr?Gly
675?????????????????680?????????????????685Leu?Pro?Ser?Phe?Phe?Gln?Trp?Leu?His?Lys?Arg?Leu?Glu?Thr?Ser?Val
690?????????????????695?????????????????700Leu?Tyr?Val?Ser?Asp?Pro?His?Cys?Pro?Pro?Asp?Leu?Asp?Ala?His?Ile705?????????????????710?????????????????715?????????????????720Pro?Leu?Tyr?Lys?Val?Pro?Asn?Asp?Gln?Ile?Phe?Ile?Lys?Tyr?Pro?Met
725?????????????????730?????????????????735Gly?Gly?Ile?Glu?Gly?Tyr?Cys?Gln?Lys?Leu?Trp?Thr?Ile?Ser?Thr?Ile
740?????????????????745?????????????????750Pro?Tyr?Leu?Tyr?Leu?Ala?Ala?Tyr?Glu?Ser?Gly?Val?Arg?Ile?Ala?Ser
755?????????????????760?????????????????765Leu?Val?Gln?Gly?Asp?Asn?Gln?Thr?Ile?Ala?Val?Thr?Lys?Arg?Val?Pro
770?????????????????775?????????????????780Ser?Thr?Trp?Pro?Tyr?Asn?Leu?Lys?Lys?Arg?Glu?Ala?Ala?Arg?Val?Thr785?????????????????790?????????????????795?????????????????800Arg?Asp?Tyr?Phe?Val?Ile?Leu?Arg?Gln?Arg?Leu?His?Asp?Ile?Gly?His
805?????????????????810?????????????????815His?Leu?Lys?Ala?Asn?Glu?Thr?Ile?Val?Ser?Ser?His?Phe?Phe?Val?Tyr
820?????????????????825?????????????????830Ser?Lys?Gly?Ile?Tyr?Tyr?Asp?Gly?Leu?Leu?Val?Ser?Gln?Ser?Leu?Lys
835?????????????????840?????????????????845Ser?Ile?Ala?Arg?Cys?Val?Phe?Trp?Ser?Glu?Thr?Ile?Val?Asp?Glu?Thr
850?????????????????855?????????????????860Arg?Ala?Ala?Cys?Ser?Asn?Ile?Ala?Thr?Thr?Met?Ala?Lys?Ser?Ile?Glu865??????????????????870????????????????875?????????????????880Arg?Gly?Tyr?Asp?Arg?Tyr?Leu?Ala?Tyr?Ser?Leu?Asn?Val?Leu?Lys?Val
885?????????????????890?????????????????895Ile?Gln?Gln?Ile?Leu?Ile?Ser?Leu?Gly?Phe?Thr?Ile?Asn?Ser?Thr?Met
900?????????????????905?????????????????910Thr?Arg?Asp?Val?Val?Ile?Pro?Leu?Leu?Thr?Asn?Asn?Asp?Leu?Leu?Ile
915?????????????????920?????????????????925Arg?Met?Ala?Leu?Leu?Pro?Ala?Pro?Ile?Gly?Gly?Met?Asn?Tyr?Leu?Asn
930?????????????????935?????????????????940Met?Ser?Arg?Leu?Phe?Val?Arg?Asn?Ile?Gly?Asp?Pro?Val?Thr?Ser?Ser945?????????????????950?????????????????955?????????????????960Ile?Ala?Asp?Leu?Lys?Arg?Met?Ile?Leu?Ala?Ser?Leu?Met?Pro?Glu?Glu
965?????????????????970?????????????????975Thr?Leu?His?Gln?Val?Met?Thr?Gln?Gln?Pro?Gly?Asp?Ser?Ser?Phe?Leu
980?????????????????985?????????????????990Asp?Trp?Ala?Ser?Asp?Pro?Tyr?Ser?Ala?Asn?Leu?Val?Cys?Val?Gln?Ser
995?????????????????1000????????????????1005Ile?Thr?Arg?Leu?Leu?Lys?Asn?Ile?Thr?Ala?Arg?Phe?Val?Leu?Ile?His
1010????????????????1015????????????????1020Ser?Pro?Asn?Pro?Met?Leu?Lys?Gly?Leu?Phe?His?Asp?Asp?Ser?Lys?Glu1025????????????????1030????????????????1035????????????????1040Glu?Asp?Glu?Gly?Leu?Ala?Ala?Phe?Leu?Met?Asp?Arg?His?Ile?Ile?Val
1045????????????????1050????????????????1055Pro?Arg?Ala?Ala?His?Glu?Ile?Leu?Asp?His?Ser?Val?Thr?Gly?Ala?Arg
1060????????????????1065????????????????1070Glu?Ser?Ile?Ala?Gly?Met?Leu?Asp?Thr?Thr?Lys?Gly?Leu?Ile?Arg?Ala
1075????????????????1080????????????????1085Ser?Met?Arg?Lys?Gly?Gly?Leu?Thr?Ser?Arg?Val?Ile?Thr?Arg?Leu?Ser
1090????????????????1095????????????????1100Asn?Tyr?Asp?Tyr?Glu?Gln?Phe?Arg?Ala?Gly?Met?Val?Leu?Leu?Thr?Gly1105????????????????1110????????????????1115????????????????1120Arg?Lys?Arg?Asn?Val?Leu?Ile?Asp?Lys?Glu?Ser?Cys?Ser?Val?Gln?Leu
1125????????????????1130????????????????1135Ala?Arg?Ala?Leu?Arg?Ser?His?Met?Trp?Ala?Arg?Leu?Ala?Arg?Gly?Arg
1140????????????????1145????????????????1150Pro?Ile?Tyr?Gly?Leu?Glu?Val?Pro?Asp?Val?Leu?Glu?Ser?Met?Arg?Gly
1155????????????????1160????????????????1165His?Leu?Ile?Arg?Arg?His?Glu?Thr?Cys?Val?Ile?Cys?Glu?Cys?Gly?Ser
1170????????????????1175????????????????1180Val?Asn?Tyr?Gly?Trp?Phe?Phe?Val?Pro?Ser?Gly?Cys?Gln?Leu?Asp?Asp1185????????????????1190????????????????1195????????????????1200Ile?Asp?Lys?Glu?Thr?Ser?Ser?Leu?Arg?Val?Pro?Tyr?Ile?Gly?Ser?Thr
1205????????????????1210????????????????1215Thr?Asp?Glu?Arg?Thr?Asp?Met?Lys?Leu?Ala?Phe?Val?Arg?Ala?Pro?Ser
1220????????????????1225????????????????1230Arg?Ser?Leu?Arg?Ser?Ala?Val?Arg?Ile?Ala?Thr?Val?Tyr?Ser?Trp?Ala
1235????????????????1240????????????????1245Tyr?Gly?Asp?Asp?Asp?Ser?Ser?Trp?Asn?Glu?Ala?Trp?Leu?Leu?Ala?Arg
1250????????????????1255????????????????1260Gln?Arg?Ala?Asn?Val?Ser?Leu?Glu?Glu?Leu?Arg?Val?Ile?Thr?Pro?Ile1265????????????????1270????????????????1275????????????????1280Ser?Thr?Ser?Thr?Asn?Leu?Ala?His?Arg?Leu?Arg?Asp?Arg?Ser?Thr?Gln
1285????????????????1290????????????????1295Val?Lys?Tyr?Ser?Gly?Thr?Ser?Leu?Val?Arg?Val?Ala?Arg?Tyr?Thr?Thr
1300????????????????1305????????????????1310Ile?Ser?Asn?Asp?Asn?Leu?Ser?Phe?Val?Ile?Ser?Asp?Lys?Lys?Val?Asp
1315????????????????1320????????????????1325Thr?Asn?Phe?Ile?Tyr?Gln?Gln?Gly?Met?Leu?Leu?Gly?Leu?Gly?Val?Leu
1330????????????????1335????????????????1340Glu?Thr?Leu?Phe?Arg?Leu?Glu?Lys?Asp?Thr?Gly?Ser?Ser?Asn?Thr?Val1345????????????????1350????????????????1355????????????????1360Leu?His?Leu?His?Val?Glu?Thr?Asp?Cys?Cys?Val?Ile?Pro?Met?Ile?Asp
1365????????????????1370????????????????1375His?Pro?Arg?Ile?Pro?Ser?Ser?Arg?Lys?Leu?Glu?Leu?Arg?Ala?Glu?Leu
1380????????????????1385????????????????1390Cys?Thr?Asn?Pro?Leu?Ile?Tyr?Asp?Asn?Ala?Pro?Leu?Ile?Asp?Arg?Asp
1395????????????????1400????????????????1405Ala?Thr?Arg?Leu?Tyr?Thr?Gln?Ser?His?Arg?Arg?His?Leu?Val?Glu?Phe
1410????????????????1415????????????????1420Val?Thr?Trp?Ser?Thr?Pro?Gln?Leu?Tyr?His?Ile?Leu?Ala?Lys?Ser?Thr1425????????????????1430????????????????1435????????????????1440Ala?Leu?Ser?Met?Ile?Asp?Leu?Val?Thr?Lys?Phe?Glu?Lys?Asp?His?Met
1445????????????????1450????????????????1455Asn?Glu?Ile?Ser?Ala?Leu?Ile?Gly?Asp?Asp?Asp?Ile?Asn?Ser?Phe?Ile
1460????????????????1465????????????????1470Thr?Glu?Phe?Leu?Leu?Ile?Glu?Pro?Arg?Leu?Phe?Thr?Ile?Tyr?Leu?Gly
1475????????????????1480????????????????1485Gln?Cys?Ala?Ala?Ile?Asn?Trp?Ala?Phe?Asp?Val?His?Tyr?His?Arg?Pro
1490????????????????1495????????????????1500Ser?Gly?Lys?Tyr?Gln?Met?Gly?Glu?Leu?Leu?Ser?Ser?Phe?Leu?Ser?Arg1505????????????????1510????????????????1515????????????????1520Met?Ser?Lys?Gly?Val?Phe?Lys?Val?Leu?Val?Asn?Ala?Leu?Ser?His?Pro
1525?????????????????1530????????????????1535Lys?Ile?Tyr?Lys?Lys?Phe?Trp?His?Cys?Gly?Ile?Ile?Glu?Pro?Ile?His
1540????????????????1545????????????????1550Gly?Pro?Ser?Leu?Asp?Ala?Gln?Asn?Leu?His?Thr?Thr?Val?Cys?Asn?Met
1555????????????????1560????????????????1565Val?Tyr?Thr?Cys?Tyr?Met?Thr?Tyr?Leu?Asp?Leu?Leu?Leu?Asn?Glu?Glu
1570????????????????1575????????????????1580Leu?Glu?Glu?Phe?Thr?Phe?Leu?Leu?Cys?Glu?Ser?Asp?Glu?Asp?Val?Val1585????????????????1590????????????????1595????????????????1600Pro?Asp?Arg?Phe?Asp?Asn?Ile?Gln?Ala?Lys?His?Leu?Cys?Val?Leu?Ala
1605????????????????1610????????????????1615Asp?Leu?Tyr?Cys?Gln?Pro?Gly?Thr?Cys?Pro?Pro?Ile?Arg?Gly?Leu?Arg
1620????????????????1625????????????????1630Pro?Val?Glu?Lys?Cys?Ala?Val?Leu?Thr?Asp?His?Ile?Lys?Ala?Glu?Ala
1635????????????????1640????????????????1645Arg?Leu?Ser?Pro?Ala?Gly?Ser?Ser?Trp?Asn?Ile?Asn?Pro?Ile?Ile?Val
1650????????????????1655????????????????1660Asp?His?Tyr?Ser?Cys?Ser?Leu?Thr?Tyr?Leu?Arg?Arg?Gly?Ser?Ile?Lys1665????????????????1670????????????????1675????????????????1680Gln?Ile?Arg?Leu?Arg?Val?Asp?Pro?Gly?Phe?Ile?Phe?Asp?Ala?Leu?Ala
1685????????????????1690????????????????1695Glu?Val?Asn?Val?Ser?Gln?Pro?Lys?Ile?Gly?Ser?Asn?Asn?Ile?Ser?Asn
1700????????????????1705????????????????1710Met?Ser?Ile?Lys?Asp?Phe?Arg?Pro?Pro?His?Asp?Asp?Val?Ala?Lys?Leu
1715????????????????1720????????????????1725Leu?Lys?Asp?Ile?Asn?Thr?Ser?Lys?His?Asn?Leu?Pro?Ile?Ser?Gly?Gly
1730????????????????1735????????????????1740Asn?Leu?Ala?Asn?Tyr?Glu?Ile?His?Ala?Phe?Arg?Arg?Ile?Gly?Leu?Asn1745????????????????1750????????????????1755????????????????1760Ser?Ser?Ala?Cys?Tyr?Lys?Ala?Val?Glu?Ile?Ser?Thr?Leu?Ile?Arg?Arg
1765????????????????1770????????????????1775Cys?Leu?Glu?Pro?Gly?Glu?Asp?Gly?Leu?Phe?Leu?Gly?Glu?Gly?Ser?Gly
1780????????????????1785????????????????1790Ser?Met?Leu?Ile?Thr?Tyr?Lys?Glu?Ile?Leu?Lys?Leu?Asn?Lys?Cys?Phe
1795????????????????1800????????????????1805Tyr?Asn?Ser?Gly?Val?Ser?Ala?Asn?Ser?Arg?Ser?Gly?Gln?Arg?Glu?Leu
1810????????????????1815????????????????1820Ala?Pro?Tyr?Pro?Ser?Glu?Val?Gly?Leu?Val?Glu?His?Arg?Met?Gly?Val1825????????????????1830????????????????1835????????????????1840Gly?Asn?Ile?Val?Lys?Val?Leu?Phe?Asn?Gly?Arg?Pro?Glu?Val?Thr?Trp
1845????????????????1850????????????????1855Val?Gly?Ser?Val?Asp?Cys?Phe?Asn?Phe?Ile?Val?Ser?Asn?Ile?Pro?Thr
1860????????????????1865????????????????1870Ser?Ser?Val?Gly?Phe?Ile?His?Ser?Asp?Ile?Glu?Thr?Leu?Pro?Asn?Lys
1875????????????????1880????????????????1885Asp?Thr?Ile?Glu?Lys?Leu?Glu?Glu?Leu?Ala?Ala?Ile?Leu?Ser?Met?Ala
1890????????????????1895????????????????1900Leu?Leu?Leu?Gly?Lys?Ile?Gly?Ser?Ile?Leu?Val?Ile?Lys?Leu?Met?Pro1905????????????????1910????????????????1915????????????????1920Phe?Ser?Gly?Asp?Phe?Val?Gln?Gly?Phe?Ile?Ser?Tyr?Val?Gly?Ser?His
1925????????????????1930????????????????1935Tyr?Arg?Glu?Val?Asn?Leu?Val?Tyr?Pro?Arg?Tyr?Ser?Asn?Phe?Ile?Ser
1940????????????????1945????????????????1950Thr?Glu?Ser?Tyr?Leu?Val?Met?Thr?Asp?Leu?Lys?Ala?Asn?Arg?Leu?Met
1955????????????????1960????????????????1965Asn?Pro?Glu?Lys?Ile?Lys?Gln?Gln?Ile?Ile?Glu?Ser?Ser?Val?Arg?Thr
1970????????????????1975????????????????1980Ser?Pro?Gly?Leu?Ile?Gly?His?Ile?Leu?Ser?Ile?Lys?Gln?Leu?Ser?Cys1985????????????????1990????????????????1995????????????????2000Ile?Gln?Ala?Ile?Val?Gly?Asp?Ala?Val?Ser?Arg?Gly?Asp?Ile?Asn?Pro
2005????????????????2010????????????????2015Thr?Leu?Lys?Lys?Leu?Thr?Pro?Ile?Glu?Gln?Val??Leu?Ile?Asn?Cys?Gly
2020????????????????2025?????????????????2030Leu?Ala?Ile?Asn?Gly?Pro?Lys?Leu?Cys?Lys?Glu?Leu?Ile?His?His?Asp
2035????????????????2040????????????????2045Val?Ala?Ser?Gly?Gln?Asp?Gly?Leu?Leu?Asn?Ser?Ile?Leu?Ile?Leu?Tyr
2050????????????????2055????????????????2060Arg?Glu?Leu?Ala?Arg?Phe?Lys?Asp?Asn?Gln?Arg?Ser?Gln?Gln?Gly?Met2065????????????????2070????????????????2075????????????????2080Phe?His?Ala?Tyr?Pro?Val?Leu?Val?Ser?Ser?Arg?Gln?Arg?Glu?Leu?Ile
2085????????????????2090????????????????2095Ser?Arg?Ile?Thr?Arg?Lys?Phe?Trp?Gly?His?Ile?Leu?Leu?Tyr?Ser?Gly
2100????????????????2105????????????????2110Asn?Arg?Lys?Leu?Ile?Asn?Lys?Phe?Ile?Gln?Asn?Leu?Lys?Ser?Gly?Tyr
2115????????????????2120????????????????2125Leu?Ile?Leu?Asp?Leu?His?Gln?Asn?Ile?Phe?Val?Lys?Asn?Leu?Ser?Lys
2130????????????????2135????????????????2140Ser?Glu?Lys?Gln?Ile?Ile?Met?Thr?Gly?Gly?Leu?Lys?Arg?Glu?Trp?Val
2145????????????????2150????????????????2155????????????????2160
Phe?Lys?Val?Thr?Val?Lys?Glu?Thr?Lys?Glu?Trp?Tyr?Lys?Leu?Val?Gly
2165????????????????2170????????????????2175
Tyr?Ser?Ala?Leu?Ile?Lys?Asp
The information of 2180 (2) SEQ ID NO:3:
(i) sequence signature:
(A) length: 15894 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: DNA (genome)
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: ACCAAACAAA GTTGGGTAAG GATAGATCAA TCAATGATCA TATTCTAGTA CACTTAGGAT 60 TCAAGATCCT ATTATCAGGG ACAAGAGCAG GATTAGGGAT ATCCGAGATG GCCACACTTC 120 TAAGGAGCTT AGCATTGTTC AAAAGAAACA AGGACAAACC ACCCATTACA TCAGGATCCG 180 GTGGAGCCAT CAGAGGAATC AAACACATTA TTATAGTACC AATCCCGGGA GATTCCTCAA 240 TTACCACTCG ATCTAGACTT CTGGACCGGT TGGTCAGGTT AATTGGAAAC CCGGATGTGA 300 GCGGGCCCAA ACTAACAGGG GCACTAATAG GTATATTATC CTTATTTGTG GAGTCTCCAG 360 GTCAATTGAT TCAGAGGATC ACCGATGACC CTGACGTTAG CATAAGGCTG TTAGAGGTTG 420 TCCAGAGTGA CCAGTCACAA TCTGGCCTTA CCTTCGCATC AAGAGGTACC AACATGGAGG 480 ATGAGGCGGA CCAATATTTT TCACATGATG ATCCAAGTAG TAGTGATCAA TCCAGGTTCG 540 GATGGTTCGA GAACAAGGAA ATCTCAGATA TTGAAGTGCA AGACCCTGAG GGATTCAACA 600 TGATTCTGGG TACCATCCTA GCTCAAATTT GGGTCTTGCT CGCAAAGGCG GTTACGGCCC 660 CAGACACGGC AGCTGATTCG GAGCTAAGAA GGTGGATAAA GTACACCCAA CAAAGAAGGG 720 TAGTTGGTGA ATTTAGATTG GAGAGAAAAT GGTTGGATGT GGTGAGGAAC AGGATTGCCG 780 AGGACCTCTC CTTACGCCGA TTCATGGTCG CTCTAATCCT GGATATCAAG AGAACACCCG 840 GGAACAAACC CAGGATTGCT GAAATGATAT GTGACATTGA TACATATATC GTAGAGGCAG 900 GATTAGCCAG TTTTATCCTG ACTATTAAGT TTGGGATAGA AACTATGTAT CCTGCTCTTG 960 GACTGCATGA ATTTGCTGGT GAGTTATCCA CACTTGAGTC CTTGATGAAT CTTTACCAGC 1020 AAATGGGGGA AACTGCACCA TACATGGTAA TCCTGGAGAA CTCAATTCAG AACAAGTTCA 1080 GTGCAGGATC ATACCCTCTG CTCTGGAGCT ATGCCATGGG AGTAGGAGTG GAACTTGAAA 1140 ACTCCATGGG AGGTTTGAAC TTTGGCCGAT CTTACTTCGA TCCAGCATAT TTCAGACTAG 1200 GGCAAGAGAT GGTGAGGAGG TCAGCTGGAA AGGTCAGTTC CACATTGGCA TCTGAACTCG 1260 GTATCACTGC CGAAGATGCA AGGCTTGTTT CAGAGATCGC AATGCATACT ACAGAGGACA 1320 GGATCAGTAG AGCGGTTGGA CCCAGACAAT CCCAAGTGTC ATTCCTACAC GGTGATCAAA 1380 ATGAAAATGA GCTACCGAGA TGGGGGGGTA AGGAAGATAT GAGGGTCAAA CAGAGTCGGG 1440 GAGAAGCCAG AGAGAGCTAC AGAGAAACCA GGCCCAGCAG AGCAAGTGAC GCGAGAGCTA 1500 CCCATCCTCC AACCGACACA CCCTTAGACA TTGACACTGC ATCGGAGTCC AGCCAAGATC 1560 CGCAGGACAG TCGAAGGTCA GCTGACGCCC TGCTCAGGCT GCAAGCCATG GCAGGAATCT 1620 CGGAAGAACA AGGCTCAGAC ACGGACACCC CTAGAGTGTA CAATGACAGA GATCTTCTAG 1680 ACTAGGTGCA AGAGGCCGAG GACCAGAACA ACATCCGCCT ACCCTCCATC ATTGTTATAA 1740 AAAACTTAGG AACCAGGTCC ACACAGCCGC CAGCCCACCA ACCATCCACT CCCACGATTG 1800 GGGCCGATGG CAGAAGAGCA GGCACGCCAT GTCAAAAACG GACTGGAATG CATCCGGGCT 1860 CTCAAGGCCG AGCCCATCGG CTCACTGGCC ATCGAGGAAG CTATGGCAGC ATGGTCAGAA 1920 ATATCAGACA ACCCAGGACA GGAGCGAGCC GCCTGCAAGG AAGAGAAGGC AAGCAGTCCG 1980 GGTCTCAGCA AACCATGCCT CTCAGCAATT GGATCAACTG AAGGCGGTGC ACCTCGCATC 2040 CGCGGTCAGG GATCTGGAGA GAGCGATGAC GACGCTGAAA CTTTGGGAAT CCCCTCAGGA 2100 AATCTCCAGG CATCAAGCAC TGGGTTACAG TGTTATTATG TTTATGATCA CAGCGGTGAA 2160 GCGGTTAAGG GAATCCAAGA TGCTGACTCT ATCATGGTTC AATCAGGCCT TGATGGTGAT 2220 AGCACCCTCT CAGGAGGAGA CAATGAATCT GAAAACAGCG ATGTGGATAT TGGCGAACCT 2280 GATACCGAGG GATATGCTAT CACTGACCGG GGATCTGCTC CCATCTCTAT GGGGTTCAGG 2340 GCTTCTGATG TTGAAACTGC AGAAGGAGGG GAGATCCACG AGCTCCTGAG ACTCCAATCC 2400 AGAGGCAACA ACTTTCCAAA GCTTAGGAAA ACTCTCAATG TTCCCCCGCC CCCGGACCCT 2460 GGTAGGGCCA GCACTTCCGA GACACCCATT AAAAAGGGCA CAGACGCGAG ATTAGCCTCA 2520 TTTGGAACGG AGATCGCGTC TTTATTGACA GGTGGTGCAA CCCAATGTGC TCGAAAGTCA 2580 CCCTCGGAAC CATCAGGGCC AGGTGCACCT GCGGGGAATG TCCCCGAGTG TGTGAGCAAT 2640 GCCGTACTGA TACAGGAGTG GACACCCGAA TCTGGTACCA CAATCTCCCC GAGATCCCAG 2700 AATAATGAAG AAGGGGGAGA TTATTATGAT GATGAGCTGT TCTCTGATGT CCAAGATATT 2760 AAAACAGCCT TGGCCAAAAT ACACGAGGAT AATCAGAAGA TAATCACCAA GCTAGAATCA 2820 CTGCTGTTAT TGAAGGGGGA AGTTGAGTCA ATCAAGAAGC AGATCAACAG GCAAAATATC 2880 AGCATATCCA CCTTGGAAGG ACACCTCTCA AGCATCATGA TCGCCATTCC TGGACTTGGG 2940 AAGGATCCCA ACGACCCCAC TGCAGATGTC GAAATCAATC CCGACTTGAA ACCCATCATA 3000 GGCAGAGATT CAGGCCGAGC ACTGGCTGAA GTTCTCAAGA AACCCGTTGC CAGCCGACAA 3060 ATCCAAGGAA TGACAAATGG ACGGACCAGT TCCAGAGGAC AGCTGCTGAA GGAATTTCAG 3120 CTAAAGCCGA TCGGGAAAAA GATGAGCTCA GCCGTCGGGT TTGTTCCGGA CACCGGCCCT 3180 GCATCACGCA GTGTAATCCG CTCCATTATA AAATCCAGCC GGCTAGAGGA GGATCGGAAG 3240 CGTTACCTGA TGACTCTCCT TGATGACATC AAAGGAGCCA ACGATCTTGC CAAGTTCCAC 3300 CAGATGCTGA TGAAGATAAT AATGAAGTAG CTACAGCTCA ACTTACCTGC CAACCCCATG 3360 CCAGTCGACC TAGCTAATAC AACCTAAATC CATTATAAAA AACTTAGGAG CAAAGTGATT 3420 GCCTCCCAAG TTCCACAATG ACAGAGATCT ACGACTTCGA CAAGTCGGCA TGGGACATCA 3480 AAGGGTCGAT CGCTCCGATA CAACCCACCA CCTACAGTGA TGGCAGGCTG GTGCCCCAGG 3540 TCAGAGTCAT AGATCCTGGT CTAGGCGACA GAAAAGATGA ATGTTTTATG TACATGTTTC 3600 TGCTGGGGGT TGTTGAGGAC AGCGATCTCC TAGGGCCTCC AATCGGGCGA GCATTTGGGT 3660 CTCTGCCCTT AGGTGTTGGC AGATCCACAG CAAAACCCGA AGAACTCCTC AAAGAGGCCA 3720 CTGAGCTTGA CATAGTTGTT AGACGTACAG CAGGGCTCAA TGAAAAACTG GTGTTCTACA 3780 ACAACACCCC ACTAACTCTC CTCATACCTT GGAGAAAGGT CCTAACAACA GGGAGTGTCT 3840 TCAACGCAAA CCAAGTGTGC AATGCGGTTA ATCTGATACC GCTGGATACC CCGCAGAGGT 3900 TCCGTGTTGT TTATATGAGC ATCACCCGTC TTTCAGATAA CGGGTATTAC ACCGTTCCTA 3960 GAAGAATGCT GGAATTCAGA TCGGTCAATG CAGTGGCCTT CAACCTGCTG GTGACCCTTA 4020 GGATTGACAA GGCGATTGGC CATGGGAAGA TCATCGACAA TGCAGAGCAA CTTCCTGAGG 4080 CAACATTTAT GGTCCACATC GGGAACTTCA GGAGAAAGAA AAGTGAAGTC TACTCTGCCG 4140 ATTATTGCAA AATGAAAATC GAAAAGATGG GCCTGGTTTT TGCACTTGGT GGGATAGGGG 4200 GCACCAGTCT TCACATTAGA AGCACAGGCA AAATGAGCAA GACTCTCCAT GCACAACTCG 4260 GGTTCAAGAA GACCCTATGT TACCCACTGA TGGATATCAA TGAAGACCTT AATCGATTAC 4320 TCTGGAGGAG CAGATGCAAG ATAGTAAGAA TCCAGGCAGT TTTGCAGCCA TCAGTTCCTC 4380 AAGAATTCCG CATTTACGAC GACGTTATCA TAAATGATGA CCAAGGATTA TTCAAAGTTC 4440 TGTAGACCGT AGTGCCCAGC AATGCCCGAA GACGACCCTC CTCACAATGA CAGCCAGAAG 4500 GCCCGGAAAA AAAGGCCCCC TCCGAAAGAC TCCACAGACC AAATGAGAGG CCAGCCAGCA 4560 GCTGACGGCA AGCACGAACA CCAGGCGGCC CCAGCACAGA ACAGCCCTGA CATAAGGCCA 4620 CCACCAGCCA TCCCAATCTG CATCCTCCTC GTAGGACCCC CGAGGACCAA CCCCCAAGGT 4680 TGCCCCCCAC CCAAACCACC AACCGCATCC CTACCACCCC CGGGAAAGAA ACCCCCAGCA 4740 ACTGGAAGAG CCCTTCCCCT TTCCCTCAAC ACAAGAACTC CACAACCGAA CCACACAAGC 4800 GACCGAGGTG ACCCAACCGC AGGCACCCGA CTCCCTAGAC AGATCCTCTC CCCCTGGCAA 4860 ACTAAACAAA ACTTAGGGCC AAGGAACATA CACACCCAAC AGAACCCAGA CCCCGGCCCA 4920 CGGCGCCGCG CCCCCAACCC CCGACAACCA GAGGGAGCCC CCAACCAATC CCGCCGGCTC 4980 CCCCGGTGCC CACAGGCAGG CACACCAACC CCCGAACAGA CCCAGCACCC AGCCATCGAC 5040 AATCCAAGAC GGGGGGGCCC CCCCAAAAAA AGGCCCCCAG GGGCCGACAG CCAGCACCGC 5100 GAGGAAGCCC ACCCACCCCA CACACGACCA CGACAACCAA ACCAGAACCC AGACCACCCT 5160 GGGCCACCAG TTCCCAGACT CGGCCATCAC CCCGCAGAAA GGAAAGGCCA CAACCTGCGC 5220 ACCCCAGCCC CGATCCGGCG GGCAGCCACC CAACCCTAAC CAGCACCCAA GAGCGATCCC 5280 CGAAGGACCC CCGAACCGCA AAGGACATCA GTATCCCACA GCCTCTCCAA GTCCCCCGGT 5340 CTCTTCCTCT TCTCGAAGGG ACTAAAAGAT CAATCCACCA CATCCGACGA CACTCAACTC 5400 CCCGTCCCTA AAGGAGACAC CGGGAATCCC GGAATTAAGA CTCATCCAAT GTCCATCATG 5460 GGTCTCAAGG TGAACGTCTC TGCCATATTC ATGGCAGTAC TGTTAACTCT CCAAACACCC 5520 ACCGGTCAAA TCCATTGGGG CAATCTCTCT AAGATAGGGG TGGTAGGAAT AGGAAGTGCA 5580 AGCTACAAAG TTATGACTCG TTCCAGCCAT CAATCATTAG TCATAAAATT AATGCCCAAT 5640 ATAACTCTCC TCAATAACTG CACGAGGGTA GAGATTGCAG AATACAGGAG ACTACTGAGA 5700 ACAGTTTTGG AACCAATTAG AGATGCACTT AATGCAATGA CCCAGAATAT AAGACCGTTT 5760 CAGAGTGTAG CTTCAAGTAG GAGACACAAG AGATTTGCAG GAGTAGTCCT GGCAGGTGCG 5820 GCCCTAGGCG TTGCCACAGC TGCTCAGATA ACAGCCGGCA TTGCACTTCA CCAGTCCATG 5880 CTGAACTCTC AAGCCATCGA CAATCTGAGG GCAAGTCTGG AAACTACTAA TCAGGCAATT 5940 GAGGCAATCA GACAAGCAGG GCAGGAGATG ATATTGGCTG TTCAGGGTGT CCAAGACTAC 6000 ATCAATAATG AGCTGATACC GTCTATGAAC CAACTATCTT GTGATTTAAT CGGCCAGAAG 6060 CTCGGGCTCA AATTGCTCAG ATACTATACA GAAATCCTGT CATTATTTGG CCCTAGCTTA 6120 CGGGACCCCA TATCTGCGGA GATATCTATC CAGGCTTTGA GCTATGCGCT CGGAGGAGAT 6180 ATCAATAAGG TGTTAGAAAA GCTCGGATAT AGTGGAGGTG ATTTACTGGG CATCTTAGAG 6240 AGCAGAGGAA TAAAGGCCCG GATAACTCAC GTCGACACAG AGTCCTACTT CATTGTCCTC 6300 AGTATAGCCT ACCCGACGCT GTCCGAGATC AAGGGGGTGA TTGTCCACCG GCTAGAGGGG 6360 GTCTCGTACA ACATAGGCTC TCAAGAGTGG TATACGACTG TGCCCAAGTA TGTTGCAACC 6420 CAAGGGTACC TTATCTCGAA TTTTGATGAG TCATCGTGTA CTTTCATGCC AGAGGGGACT 6480 GTGTGCAGCC AAAATGCCTT GTACCCGATG AGTCCTCTGC TCCAAGAATG CCTCCGGGGG 6540 TCCACCAAGT CCTGTGCTCG TACACTCGTA TCTGGGTCTT TTGGGAACCG GTTCATTTTG 6600 TCACAAGGGA ACCTAATAGC CAATTGTGCA TCAATCCTTT GCAAGTGTTA CACAACAGGA 6660 ACGATCATTA ATCAAGACCC TGACAAGATC CTAACATACA TTGCTGCCGA TCACTGCCCG 6720 GTAGTCGAGG TGAACGGCGT GACCATCCAA GTCGGGAGCA GGAGGTATCC AGACGCTGTG 6780 TACTTGCACA GAATTGACCT CGGTCCTCCC ATATCATTGG AGAGGTTGGA CGTAGGGACA 6840 AATCTGGGGA ATGCAATTGC TAAGTTGGAG GATGCCAAGG AATTGTTGGA GTCATCGGAC 6900 CAGATATTGA GGAGTATGAA AGGTTTGTCG AGCACTAGCA TAGTCTACAT CCTGATTGCA 6960 GTGTGTCTTG GAGGGTTGAT AGGGATCCCC GCTTTAATAT GTTGCTGCAG GGGGCGTTGT 7020 AACAAAAAGG GAGAACAAGT TGGTATGTCA AGACCAGGCC TAAAGCCTGA TCTTACAGGA 7080 ACATCGAAAT CCTATGTAAG GTCGCTCTGA TCCTCTACAA CTCTTGGAAC ACAAATGTCC 7140 CACAAGTCTC CTCTTCGTCA TCAAGCAACC ACCGCATCCA GCATCAAGCC CACCTGAAAT 7200 TATCTCCGGC TCCCCTTTGG CCGAACAATA TCGGTAGTTA ATTAAAACTT AGGGTGCAAG 7260 ATCATCCACA ATGTCACCAC AACGAGACCG GATAAATGCC TTCTACAAAG ATAACCCCCA 7320 TCCCAAGGGA AGTAGGATAG TTATCAACAG AGAACACCTT ATGATTGATA GACCTTATGT 7380 TTTGCTGGCT GTTCTGTTCG TCATGTTTCT GAGCTTGATC GGGTTGCTAG CAATTGCAGG 7440 CATTAGACTT CATCGGGCAG CCATCTACAC CGCAGAGATC CATAAAAGCC TCAGCACCAA 7500 TCTAGATGTA ACTAACTCAA TTGAGCATCA GGTCAAGGAC GTGCTGACAC CACTCTTCAA 7560 AATCATCGGT GATGAAGTGG GCCTGAGGAC ACCTCAGAGA TTCACTGACC TAGTGAAATT 7620 CATCTCTGAC AAGATTAAAT TCCTTAACCC GGATAGGGAG TACGACTTCA GAGATCTCAC 7680 TTGGTGTATC AACCCGCCAG AGAGAATCAA ATTGGATTAT GATCAATACT GTGCAGATGT 7740 GGCTGCTGAA GAGCTCATGA ATGCATTGGT GAACTCAACT CTACTGGAGA CCAGAACAAC 7800 CAATCAGTTC CTAGCTGTCT CAAAGGGAAA CTGCTCAGGG CCCACTACAA TCAGAGGTCA 7860 ATTCTCAAAC ATGTCGCTGT CCCTGTTGGA CTTGTATTTA AGTCGAGGTT ACAATGTGTC 7920 ATCTATAGTC ACTATGACAT CCCAGGGAAT GTACGGGGGA ACTTACCTAG TGGAAAAGCC 7980 TAATCTGAGC AGCAAAGGGT CAGAGTTGTC ACAACTGAGC ATGTACCGAG TGTTTGAAGT 8040 AGGTGTTATC AGAAATCCGG GTTTGGGGGC TCCGGTGTTC CATATGACAA ACTATTTTGA 8100 GCAACCAGTC AGTAATGATC TCAGCAACTG TATGGTGGCT TTGGGGGAGC TCAAACTCGC 8160 AGCCCTTTGT CACGGGGGAG ATTCTATCAC AATTCCCTAT CAGGGATCAG GGAAAGGTGT 8220 CAGCTTTCAG CTCGTCAAGC TAGGTGTCTG GAAATCCCCA ACCGACATGC AATCCTGGGT 8280 CCCCTTCTCA ACGGATGACC CAGTGATAGA CAGGCTTTAC CTCTCATCTC ACAGAGGTGT 8340 TATCGCTGAC AATCAAGCAA AATGGGCTAT CCCGACAACA AGAACAGATG ACAAGTTGCG 8400 AATGGAGACA TGCTTCCAGC AGGCGTGTAA GGGTAAAATC CAAGCACTCT GCGAGAATCC 8460 CGAGTGGGCA CCATTGAAGG ATAACAGGAT TCCTTCATAC GGAGTCTTGT CTGTTGATCT 8520 GAGTCTAACA GTTGAGCTTA AAATCAAAAT TGCTTCGGGA TTCGGGCCAT TGATCACACA 8580 CGGTTCAGGG ATGGACCTAT ACAAGTCCAA CCACAACAAT GAGTATTGGC TGACTATCCC 8640 GCCAATGAAG AACCTAGCCC TAGGTGTAAT CAACACATTG GAGTGGATAC CGAGATTCAA 8700 GGTTAGTCCC AACCTCTTCA CTGTCCCAAT TAAGGAAGCA GGCGAAGACT GCCATGCCCC 8760 AACATACCTA CCTGCGGAGG TGGATGGTGA TGTCAAACTC AGTTCCAATC TGGTGATCCT 8820 ACCTGGTCAA GATCTCCAAT ATGTTTTGGC AACCTACGAT ACTTCCAGGG TTGAACATGC 8880 TGTGGTTTAT TACGTTTACA GCCCAAGCCG CTCATTTTCT TACTTTTATC CTTTTAGGTT 8940 GCCTATAAAG GGGATCCCCA TCGAATTACA AGTGGAATGC TTCACATGGG ACCAAAAACT 9000 CTGGTGCCGT CACTTCTGTG TGCTTGCGGA CTCAGAATCT GGTGGACATA TCACTCACTC 9060 TGGGATGGTG GGCATGGGAG TCAGCTGCAC AGTCACCCGG GAAGATGGAA CCAATAGCAG 9120 ATAGGGCTGC CAGTGAACCA ATCACATGAT GTCACCCAGA CATCAGGCAT ACCCACTAGT 9180 GTGAAATAGA CATCAGAATT AAGAAAAACG TAGGGTCCAA GTGGTTCCCC GTTATGGACT 9240 CGCTATCTGT CAACCAGATC TTATACCCCG AAGTTCACCT AGATAGCCCG ATAGTTACCA 9300 ACAAGATAGT AGCCATCCTG GAGTATGCTC GAGTCCCTCA CGCTTACAGC CTGGAGGACC 9360 CTACACTGTG TCAGAACATC AAGCACCGCC TAAAAAACGG ATTTTCCAAC CAAATGATTA 9420 TAAACAATGT GGAAGTTGGG AATGTCATCA AGTCCAAGCT TAGGAGTTAT CCGGCCCACT 9480 CTCATATTCC ATATCCAAAC TGTAATCAGG ATTTATTTAA CATAGAAGAC AAAGAGTCAA 9540 CGAGGAAGAT CCGTGAACTC CTCAAAAAGG GAAATTCGCT GTACTCTAAA GTCAGTAATA 9600 AGGTTTTCCA ATGCTTGAGG GACACTAATT CACGGCTTGG TCTAGGCTCC GAATTGAGGG 9660 AGGACATCAA GGAGAAAGTT ATTAACTTGG GAGTTTACAT GCACAGCTCC CAATGGTTTG 9720 AGCCCTTTCT GTTTTGGTTT ACAGTCAAGA CTGAGATGAG GTCAGTGATT AAATCACAAA 9780 CCCATACTTG CCATAGGAGG AGACACACAC CTGTATTCTT CACTGGTAGT TCAGTTGAGT 9840 TGCTAATCTC TCGTGACCTT GTTGCTATAA TCAGTAAAGA GTCTCAACAT GTATATTACC 9900 TAACATTTGA GCTGGTTTTG ATGTATTGTG ATGTCATAGA GGGGAGGTTA ATGACAGAGA 9960 CTGCTATGAC CATTGATGCT AGATATACAG AGCTTCTAGG AAGAGTCAGA TACATGTGGA 10020 AATTGATAGA TGGTTTCTTC CCTGCACTCG GGAATCCAAC TTATCAAATT GTAGCCATGC 10080 TGGAGCCTCT TTCACTTGCT TACCTGCAGC TGAGGGATAT AACGGTAGAA CTCAGAGGTG 10140 CTTTCCTTAA CCACTGCTTT ACTGAAATAC ATGATGTTCT TGACCAAAAC GGGTTTTCTG 10200 ATGAAGGTAC TTATCACGAG TTAGTTGAAG CTCTAGATTA CATTTTCATA ACTGATGACA 10260 TACACCTGAC AGGGGAGATT TTCTCATTTT TCAGAAGTTT CGGCCACCCC AGACTTGAAG 10320 CAGTAACGGC TGCTGAAAAT GTTAGGAAAT ACATGAATCA GCCTAAAGTC ATTGTGTATG 10380 AGACTCTGAT GAAAGGTCAT GCCATATTTT GTGGAATCAT AATCAACGGC TATCGTGACA 10440 GGCACGGAGG CAGTTGGCCA CCGCTGACCC TCCCCCTGCA TGCTGCAGAC ACAATCCGGA 10500 ATGCTCAAGC CTCAGGTGAA GGATTAACAC ATGAGCAGTG CGTTGATAAC TGGAAATCTT 10560 TTGCTGGAGT GAAATTTGGC TGCTTCATGC CTCTTAGCCT GGATAGTGAT CTGACAATGT 10620 ACCTAAAGGA CAAGGCACTT GCTGCTCTCC AAAGGGAATG GGATTCAGTT TACCCGAAAG 10680 AGTTCCTGCG TTACGACCCC CCCAAGGGAA CCGGGTCACG GAGGCTTGTA GATGTTTTCC 10740 TTAATGATTC GAGCTTTGAC CCATATGATA TGATAATGTA TGTTGTAAGT GGAGCTTACC 10800 TCCATGACCC TGAGTTCAAC CTGTCTTACA GCCTGAAAGA AAAGGAGATC AAGGAAACAG 10860 GTAGACTTTT TGCTAAAATG ACTTACAAAA TGAGGGCATG CCAAGTGATT GCTGAAAATC 10920 TAATCTCAAA CGGGATTGGC AAATATTTTA AGGACAATGG GATGGCCAAG GATGAGCACG 10980 ATTTGACTAA GGCACTCCAC ACTCTGGCTG TCTCAGGAGT CCCTAAAGAT CTCAAAGAAA 11040 GTCACAGAGG GGGGCCAGTC CTAAAAACCT ACTCCCGAAG CCCAGCCCAC ACAAATACCA 11100 GGAACGTGAG GGCAGCAAAA GGGTTTATAG GGTTCCCTCA GATAATTCGG CAGGACCAAG 11160 ACACTAATCA TCCGGAGAAT ATGGAAGCTT ACGAGACAGT CAGTGCATTT ATCACAACTG 11220 ATCTCAAGAA GTACTGCCTT AATTGGAGAT ATGAGACCAT CAGCTTGTTT GCACAGAGGC 11280 TAAATGAGAT TTACGGATTA CCCTCATTTT TTCAGTGGCT GCATAAGAGG CTTGAGACCT 11340 CTGTCCTGTA TGTAAGTGAC CCTCATTGCC CCCCCGACCT TGACGCCCAT ATCCCGTTAT 11400 GCAAAGTCCC CAATGACCAA ATCTTCATTA AGTACCCTAT GGGAGGTATA GAAGGGTATT 11460 GTCAGAAGCT GTGGACCATC AGCACCATTC CCTATTTATA CCTGGCTGCT TATGAGAGCG 11520 GAGTAAGGAT TGCTTCATTA GTGCAAGGGG ACAATCAGAC CATAGCTGTA ACAAAAAGGG 11580 TACCCAGCAC ATGGCCTTAC AACCTTAAGA AATGGGAAGC TGCTAGAGTA ACTAGAGATT 11640 ACTTTGTAAT TCTTAGGCAA AGGCTACATG ACATTGGCCA TCACCTCAAG GCAAATGAGA 11700 CAATTGTTTC ATCACATTTT TTTGTTTATT CAAAAGGAAT ATATTATGAT GGGCTACTTG 11760 TGTCCCAATC ACTCAAGAGC ATCGCAAGAT GTGTATTCTG GTCAGAGACT ATAGTTGATG 11820 AAACAAGGGC AGCATGCAGT AATATTGCTA CAACAATGGC TAAAAGCATC GAGAGAGGTT 11880 ATGACCGTTA CCTTGCATAT TCCCTGAACG TCCTAAAAGT GATACAGCAG ATTCTGATCT 11940 CTCTTGGCTT CACAATCAAT TCAACCATGA CCCAGGATGT AGTCATACCC CTCCTCACAA 12000 ACAACGACCT CTTAATAAGG ATGGCACTGT TGCCCGCTCC TATTGGGGGG ATGAATTATC 12060 TGAATATGAG CAGGCTGTTT GTCAGAAACA TCGGTGATCC AGTAACATCA TCAATTGCTG 12120 ATCTCAAGAG AATGATTCTC GCATCACTGA TGCCTGAAGA GACCCTCCAT CAAGTAATGA 12180 CACAGCAACC GGGGGACTCT TCATTCCTAG ACTGGGCTAG CGACCCTTAC TCAGCAAATC 12240 TTGTATGTGT CCAGAGCATC ACTAGACTCC TCAAGAACAT AACTGCAAGG TTTGTCCTAA 12300 TCCACAGTCC AAACCCAATG TTAAAGGGAT TATTCCATGA TGACAGTAAA GAAGAGGACG 12360 AGGGACTGGC AGCATTCCTC ATGGACAGGC ATATTATAGT ACCTAGGGCA GCTCATGAAA 12420 TCCTGGATCA TAGTGTCACA GGGGCAAGAG AGTCTATTGC AGGCATGCTA GATACCACAA 12480 AAGGCCTGAT TCGAGCCAGC ATGAGGAAGG GGGGGTTAAC CTCTCGAGTG ATAACCAGAT 12540 TGTCCAATTA TGACTATGAA CAATTCAGAG CAGGGATGGT GCTATTAACA GGAAGAAAGA 12600 GAAATGTCCT CATTGACAAA GAGTCATGTT CAGTGCAGCT GGCGAGAGCC CTAAGAAGCC 12660 ATATGTGGGC GAGGCTAGCT CGAGGACGGC CTATTTACGG CCTTGAGGTC CCTGATGTAC 12720 TAGAATCTAT GCGAGGCCAC CTTATTCGGC GTCATGAGAC ATGTGTCATC TGCGAGTGTG 12780 GATCAGTCAA CTACGGATGG TTTTTTGTCC CCTCGGGTTG CCAACTGGAT GATATTGACA 12840 AGGAAACATC ATCCTTGAGA GTCCCATATA TTGGTTCTAC CACTGATGAG AGAACAGACA 12900 TGAAGCTTGC CTTCGTAAGA GCCCCAAGTC GATCCTTGCG ATCTGCTGTT AGAATAGCAA 12960 CAGTGTACTC ATGGGCTTAC GGTGATGATG ATAGCTCTTG GAACGAAGCC TGGTTGTTGG 13020 CAAGGCAAAG GGCTAATGTG AGCCTGGAGG AGCTAAGGGT GATCACTCCC ATCTCAACTT 13080 CGACTAATTT AGCACATAGG TTGAGGGATC GTAGCACTCA AGTGAAATAC TCAGGTACAT 13140 CCCTTGTCCG AGTGGCAAGG TATACCACAA TCTCCAACGA CAATCTCTCA TTTGTCATAT 13200 CAGATAAGAA GGTTGATACT AACTTTATAT ACCAACAAGG AATGCTCCTA GGGTTGGGCG 13260 TTTTAGAAAC ATTGTTTCGA CTCGAGAAAG ATACCGGATC ATCTAACACG GTATTACATC 13320 TTCACGTCGA AACAGATTGT TGCGTGATCC CAATGATAGA TCATCCCAGG ATACCCAGCT 13380 CTCGCAAGCT AGAGCTGAGG GCAGAGCTGT GTACCAACCC ATTGATATAT GATAATGCAC 13440 CTTTAATTGA CAGAGATGCA ACAAGGCTAT ACACCCAGAG CCATAGGAGG CACCTTGTAG 13500 AATTTGTTAC ATGGTCCACA CCCCAACTAT ATCACATTCT AGCTAAGTCC ACAGCACTAT 13560 CTATGATTGA CCTGGTAACA AAATTTGAGA AGGACCATAT GAATGAAATT TCAGCTCTCA 13620 TAGGGGATGA CGATATCAAT AGTTTCATAA CTGAGTTTCT GCTTATAGAG CCAAGATTAT 13680 TCACTATCTA CTTGGGCCAG TGTGCGGCCA TCAATTGGGC ATTTGATGTA CATTATCATA 13740 GACCATCAGG GAAATATCAG ATGGGTGAGC TGTTGTCATC GTTCCTTTCT AGAATGAGCA 13800 AAGGAGTGTT TAAGGTGCTT GTCAATGCTC TAAGCCACCC AAAGATCTAC AAGAAATTCT 13860 GGCACTGTGG TATTATAGAG CCTATCCATG GTCCTTCACT TGATGCTCAA AACTTGCACA 13920 CAACTGTGTG CAACATGGTT TACACATGCT ATATGACCTA CCTCGACCTG TTGTTGAATG 13980 AAGAGTTAGA AGAGTTTACA TTTCTTTTGT GTGAAAGTGA CGAGGATGTA GTACCGGACA 14040 GATTCGACAA CATCCAGGCA AAACACTTGT GTGTTCTGGC AGATTTGTAC TGTCAACCAG 14100 GGACCTGCCC ACCAATTCGA GGTCTAAGAC CGGTAGAGAA ATGTGCAGTT CTAACCGACC 14160 ATATCAAGGC GGAGGCTAGG TTATCTCCAG CAGGATCTTC GTGGAACATA AATCCAATTA 14220 TTGTAGACCA TTACTCATGC TCTCTGACTT ATCTTCGGCG AGGATCGATC AAACAGATAA 14280 GATTGAGAGT TGATCCAGGA TTCATTTTCG ACGCCCTCGC TGAGGTAAAT GTCAGTCAGC 14340 CAAAGATCGG CAGCAACAAC ATCTCAAATA TGAGCATCAA GGATTTCAGA CCCCCACACG 14400 ATGATGTTGC AAAATTGCTC AAAGATATCA ATACAAGCAA GCACAATCTT CCCATTTCTG 14460 GGGGCAATCT CGCCAATTAT GAAATCCATG CTTTCCGCAG AATCGGGTTG AACTCATCTG 14520 CTTGCTACAA AGCTGTTGAG ATATCAACAT TAATTAGGAG ATGCCTTGAG CCAGGGGAAG 14580 ACGGCTTATT CTTGGGTGAG GGATCGGGTT CTATGTTGAT CACTTATAAG GAGATACTTA 14640 AACTAAACAA GTGCTTCTAT AATAGTGGGG TCTCTGCCAA TTCTAGATCT GGTCAAAGGG 14700 AATTAGCACC CTATCCCTCC GAAGTTGGCC TTGTCGAACA CAGAATGGGA GTAGGTAATA 14760 TTGTCAAGGT GCTCTTTAAC GGGAGGCCCG AAGTCACATG GGTAGGCAGT GTAGATTGCT 14820 TCAATTACAT AGTTAGTAAT ATCCCTACCT CTAGTGTGGG GTTTATCCAT TCAGATATAG 14880 AGACCTTACC TAACAAAGAT ACTATAGAGA AGCTAGAGGA ATTGGCAGCC ATCTTATCGA 14940 TGGCTCTGCT CCTGGGCAAA ATAGGATCAA TACTGGTGAT TAAGCTTATG CCTTTCAGCG 15000 GGGATTTTGT TCAGGGATTT ATAAGTTATG TAGGGTCTCA TTATAGAGAA GTGAACCTTG 15060 TATACCCCAG ATACAGCAAC TTCATATCTA CTGAATCTTA TTTGGTTATG ACAGATCTCA 15120 AGGCTAACCG GCTAATGAAT CCTGAAAAGA TTAAGCAGCA GATAATTGAA TCATCTGTGC 15180 GGACTTCACC TGGACTTATA GGTCACATCC TATCCATTAA GCAACTAAGC TGCATACAAG 15240 CAATTGTGGG AGACGCAGTT AGTAGAGGTG ATATCAATCC TACTCTGAAA AAACTTACAC 15300 CTATAGAGCA GGTGCTGATC AATTGCGGGT TGGCAATTAA CGGACCTAAA CTGTGCAAAG 15360 AATTGATCCA CCATGATGTT GCCTCAGGGC AAGATGGATT GCTTAATTCT ATACTCATCC 15420 TCTACAGGGA GTTGGCAAGA TTCAAGGACA ACCAAAGAAG TCAACAAGGG ATGTTCCACG 15480 CTTACCCCGT ATTGGTAAGT AGCAGGCAAC GAGAACTTAT ATCTAGAATC ACTCGCAAAT 15540 TTTGGGGGCA CATTCTTCTT TACTCCGGGA ACAGAAAGTT GATAAATAAG TTTATCCAGA 15600 ATCTCAAGTC CGGTTATCTG ATACTAGACT TACACCAGAA TATCTTCGTT AAGAATCTAT 15660 CCAAGTCAGA GAAACAGATT ATTATGACGG GGGGTTTGAA ACGTGAGTGG GTTTTTAAGG 15720 TAACAGTCAA GGAGACCAAG GAATGGTATA AGTTAGTCGG ATACAGTGCC CTGATTAAGG 15780 ACTAATTGGT TGAACTCCGG AACCCTAATC CTGCCCCAGG TGGTTAGGCA TTATTTGTAA 15840 TATATTAAAG AAAACTTTGA AAATACGAAG TTTCTATTCC CAGCTTTGTC TGGT 15894 (2) SEQ ID NO: 4 Information: (I) SEQUENCE CHARACTERISTICS: ...
(A) length: 2183 amino acid
(B) type: amino acid
(C) chain:
(D) topological framework: linearity is molecule type (ii): protein (xi) sequence description: SEQ ID NO:4:Met Asp Ser Leu Ser Val Asn Gln Ile Leu Tyr Pro Glu Val His Leu1 5 10 15Asp Ser Pro Ile Val Thr Asn Lys Ile Val Ala Ile Leu Glu Tyr Ala
20??????????????????25??????????????????30Arg?Val?Pro?His?Ala?Tyr?Ser?Leu?Glu?Asp?Pro?Thr?Leu?Cys?Gln?Asn
35??????????????????40??????????????????45Ile?Lys?His?Arg?Leu?Lys?Asn?Gly?Phe?Ser?Asn?Gln?Met?Ile?Ile?Asn
50??????????????????55??????????????????60Asn?Val?Glu?Val?Gly?Asn?Val?Ile?Lys?Ser?Lys?Leu?Arg?Ser?Tyr?Pro65??????????????????70??????????????????75??????????????????80Ala?His?Ser?His?Ile?Pro?Tyr?Pro?Asn?Cys?Asn?Gln?Asp?Leu?Phe?Asn
85??????????????????90??????????????????95Ile?Glu?Asp?Lys?Glu?Ser?Thr?Arg?Lys?Ile?Arg?Glu?Leu?Leu?Lys?Lys
100?????????????????105?????????????????110Gly?Asn?Ser?Leu?Tyr?Ser?Lys?Val?Ser?Asn?Lys?Val?Phe?Gln?Cys?Leu
115?????????????????120?????????????????125Arg?Asp?Thr?Asn?Ser?Arg?Leu?Gly?Leu?Gly?Ser?Glu?Leu?Arg?Glu?Asp
130?????????????????135?????????????????140Ile?Lys?Glu?Lys?Val?Ile?Asn?Leu?Gly?Val?Tyr?Met?His?Ser?Ser?Gln145?????????????????150?????????????????155?????????????????160Trp?Phe?G1u?Pro?Phe?Leu?Phe?Trp?phe?Thr?Val?Lys?Thr?Glu?Met?Arg
165?????????????????170?????????????????175Ser?Val?Ile?Lys?Ser?Gln?Thr?His?Thr?Cys?His?Arg?Arg?Arg?His?Thr
180?????????????????185?????????????????190Pro?Val?Phe?Phe?Thr?Gly?Ser?Ser?Val?Glu?Leu?Leu?Ile?Ser?Arg?Asp
195?????????????????200?????????????????205Leu?Val?Ala?Ile?Ile?Ser?Lys?Glu?Ser?Gln?His?Val?Tyr?Tyr?Leu?Thr
210?????????????????215?????????????????220Phe?Glu?Leu?Val?Leu?Met?Tyr?Cys?Asp?Val?Ile?Glu?Gly?Arg?Leu?Met225?????????????????230?????????????????235?????????????????240Thr?Glu?Thr?Ala?Met?Thr?Ile?Asp?Ala?Arg?Tyr?Thr?Glu?Leu?Leu?Gly
245?????????????????250?????????????????255Arg?Val?Arg?Tyr?Met?Trp?Lys?Leu?Ile?Asp?Gly?Phe?Phe?Pro?Ala?Leu
260?????????????????265?????????????????270Gly?Asn?Pro?Thr?Tyr?Gln?Ile?Val?Ala?Met?Leu?Glu?Pro?Leu?Ser?Leu
275?????????????????280?????????????????285Ala?Tyr?Leu?Gln?Leu?Arg?Asp?Ile?Thr?Val?Glu?Leu?Arg?Gly?Ala?Phe
290?????????????????295?????????????????300Leu?Asn?His?Cys?Phe?Thr?Glu?Ile?His?Asp?Val?Leu?Asp?Gln?Asn?Gly305?????????????????310?????????????????315?????????????????320Phe?Ser?Asp?Glu?Gly?Thr?Tyr?His?Glu?Leu?Val?Glu?Ala?Leu?Asp?Tyr
325?????????????????330?????????????????335Ile?Phe?Ile?Thr?Asp?Asp?Ile?His?Leu?Thr?Gly?Glu?Ile?Phe?Ser?Phe
340?????????????????345?????????????????350Phe?Arg?Ser?Phe?Gly?His?Pro?Arg?Leu?Glu?Ala?Val?Thr?Ala?Ala?Glu
355?????????????????360?????????????????365Asn?Val?Arg?Lys?Tyr?Met?Asn?Gln?Pro?Lys?Val?Ile?Val?Tyr?Glu?Thr
370?????????????????375?????????????????380Leu?Met?Lys?Gly?His?Ala?Ile?Phe?Cys?Gly?Ile?Ile?Ile?Asn?Gly?Tyr385?????????????????390?????????????????395?????????????????400Arg?Asp?Arg?His?Gly?Gly?Ser?Trp?Pro?Pro?Leu?Thr?Leu?Pro?Leu?His
405?????????????????410?????????????????415Ala?Ala?Asp?Thr?Ile?Arg?Asn?Ala?Gln?Ala?Ser?Gly?Glu?Gly?Leu?Thr
420?????????????????425?????????????????430His?Glu?Gln?Cys?Val?Asp?Asn?Trp?Lys?Ser?Phe?Ala?Gly?Val?Lys?Phe
435?????????????????440?????????????????445Gly?Cys?Phe?Met?Pro?Leu?Ser?Leu?Asp?Ser?Asp?Leu?Thr?Met?Tyr?Leu
450?????????????????455?????????????????460Lys?Asp?Lys?Ala?Leu?Ala?Ala?Leu?Gln?Arg?Glu?Trp?Asp?Ser?Val?Tyr465?????????????????470?????????????????475?????????????????480Pro?Lys?Glu?Phe?Leu?Arg?Tyr?Asp?Pro?Pro?Lys?Gly?Thr?Gly?Ser?Arg
485?????????????????490?????????????????495Arg?Leu?Val?Asp?Val?Phe?Leu?Asn?Asp?Ser?Ser?Phe?Asp?Pro?Tyr?Asp
500?????????????????505?????????????????510Met?Ile?Met?Tyr?Val?Val?Ser?Gly?Ala?Tyr?Leu?His?Asp?Pro?Glu?Phe
515?????????????????520?????????????????525Asn?Leu?Ser?Tyr?Ser?Leu?Lys?Glu?Lys?Glu?Ile?Lys?Glu?Thr?Gly?Arg
530?????????????????535?????????????????540Leu?Phe?Ala?Lys?Met?Thr?Tyr?Lys?Met?Arg?Ala?Cys?Gln?Val?Ile?Ala545?????????????????550?????????????????555?????????????????560Glu?Asn?Leu?Ile?Ser?Asn?Gly?Ile?Gly?Lys?Tyr?Phe?Lys?Asp?Asn?Gly
565?????????????????570?????????????????575Met?Ala?Lys?Asp?Glu?His?Asp?Leu?Thr?Lys?Ala?Leu?His?Thr?Leu?Ala
580?????????????????585?????????????????590Val?Ser?Gly?Val?Pro?Lys?Asp?Leu?Lys?Glu?Ser?His?Arg?Gly?Gly?Pro
595?????????????????600?????????????????605Val?Leu?Lys?Thr?Tyr?Ser?Arg?Ser?Pro?Ala?His?Thr?Asn?Thr?Arg?Asn
610?????????????????615?????????????????620Val?Arg?Ala?Ala?Lys?Gly?Phe?Ile?Gly?Phe?Pro?Gln?Ile?Ile?Arg?Gln625?????????????????630?????????????????635?????????????????640Asp?Gln?Asp?Thr?Asn?His?Pro?Glu?Asn?Met?Glu?Ala?Tyr?Glu?Thr?Val
645?????????????????650?????????????????655Ser?Ala?Phe?Ile?Thr?Thr?Asp?Leu?Lys?Lys?Tyr?Cys?Leu?Asn?Trp?Arg
660?????????????????665?????????????????670Tyr?Glu?Thr?Ile?Ser?Leu?Phe?Ala?Gln?Arg?Leu?Asn?Glu?Ile?Tyr?Gly
675?????????????????680?????????????????685Leu?Pro?Ser?Phe?Phe?Gln?Trp?Leu?His?Lys?Arg?Leu?Glu?Thr?Ser?Val
690?????????????????695?????????????????700Leu?Tyr?Val?Ser?Asp?Pro?His?Cys?Pro?Pro?Asp?Leu?Asp?Ala?His?Ile705?????????????????710?????????????????715?????????????????720Pro?Leu?Cys?Lys?Val?Pro?Asn?Asp?Gln?Ile?Phe?Ile?Lys?Tyr?Pro?Met
725?????????????????730?????????????????735Gly?Gly?Ile?Glu?Gly?Tyr?Cys?Gln?Lys?Leu?Trp?Thr?Ile?Ser?Thr?Ile
740?????????????????745?????????????????750Pro?Tyr?Leu?Tyr?Leu?Ala?Ala?Tyr?Glu?Ser?Gly?Val?Arg?Ile?Ala?Ser
755?????????????????760?????????????????765Leu?Val?Gln?Gly?Asp?Asn?Gln?Thr?Ile?Ala?Val?Thr?Lys?Arg?Val?Pro
770?????????????????775?????????????????780Ser?Thr?Trp?Pro?Tyr?Asn?Leu?Lys?Lys?Trp?Glu?Ala?Ala?Arg?Val?Thr785?????????????????790?????????????????795?????????????????800Arg?Asp?Tyr?Phe?Val?Ile?Leu?Arg?Gln?Arg?Leu?His?Asp?Ile?Gly?His
805?????????????????810?????????????????815His?Leu?Lys?Ala?Asn?Glu?Thr?Ile?Val?Ser?Ser?His?Phe?Phe?Val?Tyr
820?????????????????825?????????????????830Ser?Lys?Gly?Ile?Tyr?Tyr?Asp?Gly?Leu?Leu?Val?Ser?Gln?Ser?Leu?Lys
835?????????????????840?????????????????845Ser?Ile?Ala?Arg?Cys?Val?Phe?Trp?Ser?Glu?Thr?Ile?Val?Asp?Glu?Thr
850?????????????????855?????????????????860Arg?Ala?Ala?Cys?Ser?Asn?Ile?Ala?Thr?Thr?Met?Ala?Lys?Ser?Ile?Glu865?????????????????870?????????????????875?????????????????880Arg?Gly?Tyr?Asp?Arg?Tyr?Leu?Ala?Tyr?Ser?Leu?Asn?Val?Leu?Lys?Val
885?????????????????890?????????????????895Ile?Gln?Gln?Ile?Leu?Ile?Ser?Leu?Gly?Phe?Thr?Ile?Asn?Ser?Thr?Met
900?????????????????905?????????????????910Thr?Gln?Asp?Val?Val?Ile?Pro?Leu?Leu?Thr?Asn?Asn?Asp?Leu?Leu?Ile
915?????????????????920?????????????????925Arg?Met?Ala?Leu?Leu?Pro?Ala?Pro?Ile?Gly?Gly?Met?Asn?Tyr?Leu?Asn
930?????????????????935?????????????????940Met?Ser?Arg?Leu?Phe?Val?Arg?Asn?Ile?Gly?Asp?Pro?Val?Thr?Ser?Ser945?????????????????950?????????????????955?????????????????960Ile?Ala?Asp?Leu?Lys?Arg?Met?Ile?Leu?Ala?Ser?Leu?Met?Pro?Glu?Glu
965?????????????????970?????????????????975Thr?Leu?His?Gln?Val?Met?Thr?Gln?Gln?Pro?Gly?Asp?Ser?Ser?Phe?Leu
980?????????????????985?????????????????990Asp?Trp?Ala?Ser?Asp?Pro?Tyr?Ser?Ala?Asn?Leu?Val?Cys?Val?Gln?Ser
995?????????????????1000????????????????1005Ile?Thr?Arg?Leu?Leu?Lys?Asn?Ile?Thr?Ala?Arg?Phe?Val?Leu?Ile?His
1010????????????????1015????????????????1020Ser?Pro?Asn?Pro?Met?Leu?Lys?Gly?Leu?Phe?His?Asp?Asp?Ser?Lys?Glu1025????????????????1030????????????????1035????????????????1040Glu?Asp?Glu?Gly?Leu?Ala?Ala?Phe?Leu?Met?Asp?Arg?His?Ile?Ile?Val
1045????????????????1050????????????????1055Pro?Arg?Ala?Ala?His?Glu?Ile?Leu?Asp?His?Ser?Val?Thr?Gly?Ala?Arg
1060????????????????1065????????????????1070Glu?Ser?Ile?Ala?Gly?Met?Leu?Asp?Thr?Thr?Lys?Gly?Leu?Ile?Arg?Ala
1075????????????????1080????????????????1085Ser?Met?Arg?Lys?Gly?Gly?Leu?Thr?Ser?Arg?Val?Ile?Thr?Arg?Leu?Ser
1090????????????????1095????????????????1100Asn?Tyr?Asp?Tyr?Glu?Gln?Phe?Arg?Ala?Gly?Met?Val?Leu?Leu?Thr?Gly1105????????????????1110????????????????1115????????????????1120Arg?Lys?Arg?Asn?Val?Leu?Ile?Asp?Lys?Glu?Ser?Cys?Ser?Val?Gln?Leu
1125????????????????1130????????????????1135Ala?Arg?Ala?Leu?Arg?Ser?His?Met?Trp?Ala?Arg?Leu?Ala?Arg?Gly?Arg
1140????????????????1145????????????????1150Pro?Ile?Tyr?Gly?Leu?Glu?Val?Pro?Asp?Val?Leu?Glu?Ser?Met?Arg?Gly
1155????????????????1160????????????????1165His?Leu?Ile?Arg?Arg?His?Glu?Thr?Cys?Val?Ile?Cys?Glu?Cys?Gly?Ser
1170????????????????1175????????????????1180Val?Asn?Tyr?Gly?Trp?Phe?Phe?Val?Pro?Ser?Gly?Cys?Gln?Leu?Asp?Asp1185????????????????1190????????????????1195????????????????1200Ile?Asp?Lys?Glu?Thr?Ser?Ser?Leu?Arg?Val?Pro?Tyr?Ile?Gly?Ser?Thr
1205????????????????1210????????????????1215Thr?Asp?Glu?Arg?Thr?Asp?Met?Lys?Leu?Ala?Phe?Val?Arg?Ala?Pro?Ser
1220????????????????1225????????????????1230Arg?Ser?Leu?Arg?Ser?Ala?Val?Arg?Ile?Ala?Thr?Val?Tyr?Ser?Trp?Ala
1235????????????????1240????????????????1245Tyr?Gly?Asp?Asp?Asp?Ser?Ser?Trp?Asn?Glu?Ala?Trp?Leu?Leu?Ala?Arg
1250????????????????1255????????????????1260Gln?Arg?Ala?Asn?Val?Ser?Leu?Glu?Glu?Leu?Arg?Val?Ile?Thr?Pro?Ile1265????????????????1270????????????????1275????????????????1280Ser?Thr?Ser?Thr?Asn?Leu?Ala?His?Arg?Leu?Arg?Asp?Arg?Ser?Thr?Gln
1285????????????????1290????????????????1295Val?Lys?Tyr?Ser?Gly?Thr?Ser?Leu?Val?Arg?Val?Ala?Arg?Tyr?Thr?Thr
1300????????????????1305????????????????1310Ile?Ser?Asn?Asp?Asn?Leu?Ser?Phe?Val?Ile?Ser?Asp?Lys?Lys?Val?Asp
1315????????????????1320????????????????1325Thr?Asn?Phe?Ile?Tyr?Gln?Gln?Gly?Met?Leu?Leu?Gly?Leu?Gly?Val?Leu
1330????????????????1335????????????????1340Glu?Thr?Leu?Phe?Arg?Leu?Glu?Lys?Asp?Thr?Gly?Ser?Ser?Asn?Thr?Val1345???????????????1350????????????????1355?????????????????1360Leu?His?Leu?His?Val?Glu?Thr?Asp?Cys?Cys?Val?Ile?Pro?Met?Ile?Asp
1365????????????????1370????????????????1375His?Pro?Arg?Ile?Pro?Ser?Ser?Arg?Lys?Leu?Glu?Leu?Arg?Ala?Glu?Leu
1380????????????????1385????????????????1390Cys?Thr?Asn?Pro?Leu?Ile?Tyr?Asp?Asn?Ala?Pro?Leu?Ile?Asp?Arg?Asp
1395????????????????1400????????????????1405Ala?Thr?Arg?Leu?Tyr?Thr?Gln?Ser?His?Arg?Arg?His?Leu?Val?Glu?Phe
1410????????????????1415????????????????1420Val?Thr?Trp?Ser?Thr?Pro?Gln?Leu?Tyr?His?Ile?Leu?Ala?Lys?Ser?Thr1425????????????????1430????????????????1435????????????????1440Ala?Leu?Ser?Met?Ile?Asp?Leu?Val?Thr?Lys?Phe?Glu?Lys?Asp?His?Met
1445????????????????1450????????????????1455Asn?Glu?Ile?Ser?Ala?Leu?Ile?Gly?Asp?Asp?Asp?Ile?Asn?Ser?Phe?Ile
1460????????????????1465????????????????1470Thr?Glu?Phe?Leu?Leu?Ile?Glu?Pro?Arg?Leu?Phe?Thr?Ile?Tyr?Leu?Gly
1475????????????????1480????????????????1485Gln?Cys?Ala?Ala?Ile?Asn?Trp?Ala?Phe?Asp?Val?His?Tyr?His?Arg?Pro
1490????????????????1495????????????????1500Ser?Gly?Lys?Tyr?Gln?Met?Gly?Glu?Leu?Leu?Ser?Ser?Phe?Leu?Ser?Arg1505????????????????1510????????????????1515????????????????1520Met?Ser?Lys?Gly?Val?Phe?Lys?Val?Leu?Val?Asn?Ala?Leu?Ser?His?Pro
1525????????????????1530????????????????1535Lys?Ile?Tyr?Lys?Lys?Phe?Trp?His?Cys?Gly?Ile?Ile?Glu?Pro?Ile?His
1540????????????????1545????????????????1550Gly?Pro?Ser?Leu?Asp?Ala?Gln?Asn?Leu?His?Thr?Thr?Val?Cys?Asn?Met
1555????????????????1560????????????????1565Val?Tyr?Thr?Cys?Tyr?Met?Thr?Tyr?Leu?Asp?Leu?Leu?Leu?Asn?Glu?Glu
1570????????????????1575????????????????1580Leu?Glu?Glu?Phe?Thr?Phe?Leu?Leu?Cys?Glu?Ser?Asp?Glu?Asp?Val?Val1585????????????????1590????????????????1595????????????????1600Pro?Asp?Arg?Phe?Asp?Asn?Ile?Gln?Ala?Lys?His?Leu?Cys?Val?Leu?Ala
1605????????????????1610????????????????1615Asp?Leu?Tyr?Cys?Gln?Pro?Gly?Thr?Cys?Pro?Pro?Ile?Arg?Gly?Leu?Arg
1620????????????????1625????????????????1630Pro?Val?Glu?Lys?Cys?Ala?Val?Leu?Thr?Asp?His?Ile?Lys?Ala?Glu?Ala
1635????????????????1640????????????????1645Arg?Leu?Ser?Pro?Ala?Gly?Ser?Ser?Trp?Asn?Ile?Asn?Pro?Ile?Ile?Val
1650????????????????1655????????????????1660Asp?His?Tyr?Ser?Cys?Ser?Leu?Thr?Tyr?Leu?Arg?Arg?Gly?Ser?Ile?Lys1665????????????????1670????????????????1675????????????????1680Gln?Ile?Arg?Leu?Arg?Val?Asp?Pro?Gly?Phe?Ile?Phe?Asp?Ala?Leu?Ala
1685????????????????1690????????????????1695Glu?Val?Asn?Val?Ser?Gln?Pro?Lys?Ile?Gly?Ser?Asn?Asn?Ile?Ser?Asn
1700????????????????1705????????????????1710Met?Ser?Ile?Lys?Asp?Phe?Arg?Pro?Pro?His?Asp?Asp?Val?Ala?Lys?Leu
1715????????????????1720????????????????1725Leu?Lys?Asp?Ile?Asn?Thr?Ser?Lys?His?Asn?Leu?Pro?Ile?Ser?Gly?Gly
1730????????????????1735????????????????1740Asn?Leu?Ala?Asn?Tyr?Glu?Ile?His?Ala?Phe?Arg?Arg?Ile?Gly?Leu?Asn1745????????????????1750????????????????1755????????????????1760Ser?Ser?Ala?Cys?Tyr?Lys?Ala?Val?Glu?Ile?Ser?Thr?Leu?Ile?Arg?Arg
1765????????????????1770????????????????1775Cys?Leu?Glu?Pro?Gly?Glu?Asp?Gly?Leu?Phe?Leu?Gly?Glu?Gly?Ser?Gly
1780????????????????1785????????????????1790Ser?Met?Leu?Ile?Thr?Tyr?Lys?Glu?Ile?Leu?Lys?Leu?Asn?Lys?Cys?Phe
1795????????????????1800????????????????1805Tyr?Asn?Ser?Gly?Val?Ser?Ala?Asn?Ser?Arg?Ser?Gly?Gln?Arg?Glu?Leu
1810????????????????1815????????????????1820Ala?Pro?Tyr?Pro?Ser?Glu?Val?Gly?Leu?Val?Glu?His?Arg?Met?Gly?Val1825????????????????1830????????????????1835????????????????1840Gly?Asn?Ile?Val?Lys?Val?Leu?Phe?Asn?Gly?Arg?Pro?Glu?Val?Thr?Trp
1845????????????????1850????????????????1855Val?Gly?Ser?Val?Asp?Cys?Phe?Asn?Tyr?Ile?Val?Ser?Asn?Ile?Pro?Thr
1860????????????????1865????????????????1870Ser?Ser?Val?Gly?Phe?Ile?His?Ser?Asp?Ile?Glu?Thr?Leu?Pro?Asn?Lys
1875????????????????1880????????????????1885Asp?Thr?Ile?Glu?Lys?Leu?Glu?Glu?Leu?Ala?Ala?Ile?Leu?Ser?Met?Ala
1890????????????????1895????????????????1900Leu?Leu?Leu?Gly?Lys?Ile?Gly?Ser?Ile?Leu?Val?Ile?Lys?Leu?Met?Pro1905????????????????1910????????????????1915????????????????1920Phe?Ser?Gly?Asp?Phe?Val?Gln?Gly?Phe?Ile?Ser?Tyr?Val?Gly?Ser?His
1925????????????????1930????????????????1935Tyr?Arg?Glu?Val?Asn?Leu?Val?Tyr?Pro?Arg?Tyr?Ser?Asn?Phe?Ile?Ser
1940????????????????1945????????????????1950
Thr?Glu?Ser?Tyr?Leu?Val?Met?Thr?Asp?Leu?Lys?Ala?Asn?Arg?Leu?Met
1955????????????????1960????????????????1965
Asn?Pro?Glu?Lys?Ile?Lys?Gln?Gln?Ile?Ile?Glu?Ser?Ser?Val?Arg?Thr
1970????????????????1975????????????????1980
Ser?Pro?Gly?Leu?Ile?Gly?His?Ile?Leu?Ser?Ile?Lys?Gln?Leu?Ser?Cys
1985????????????????1990????????????????1995????????????????2000
Ile?Gln?Ala?Ile?Val?Gly?Asp?Ala?Val?Ser?Arg?Gly?Asp?Ile?Asn?Pro
2005????????????????2010????????????????2015
Thr?Leu?Lys?Lys?Leu?Thr?Pro?Ile?Glu?Gln?Val?Leu?Ile?Asn?Cys?Gly
2020????????????????2025????????????????2030
Leu?Ala?Ile?Asn?Gly?Pro?Lys?Leu?Cys?Lys?Glu?Leu?Ile?His?His?Asp
2035????????????????2040????????????????2045
Val?Ala?Ser?Gly?Gln?Asp?Gly?Leu?Leu?Asn?Ser?Ile?Leu?Ile?Leu?Tyr
2050????????????????2055????????????????2060
Arg?Glu?Leu?Ala?Arg?Phe?Lys?Asp?Asn?Gln?Arg?Ser?Gln?Gln?Gly?Met
2065????????????????2070????????????????2075????????????????2080
Phe?His?Ala?Tyr?Pro?Val?Leu?Val?Ser?Ser?Arg?Gln?Arg?Glu?Leu?Ile
2085???????????????2090?????????????????2095
Ser?Arg?Ile?Thr?Arg?Lys?Phe?Trp?Gly?His?Ile?Leu?Leu?Tyr?Ser?Gly
2100????????????????2105????????????????2110
Asn?Arg?Lys?Leu?Ile?Asn?Lys?Phe?Ile?Gln?Asn?Leu?Lys?Ser?Gly?Tyr
2115????????????????2120????????????????2125
Leu?Ile?Leu?Asp?Leu?His?Gln?Asn?Ile?Phe?Val?Lys?Asn?Leu?Ser?Lys
2130????????????????2135????????????????2140
Ser?Glu?Lys?Gln?Ile?Ile?Met?Thr?Gly?Gly?Leu?Lys?Arg?Glu?Trp?Val
2145????????????????2150????????????????2155????????????????2160
Phe?Lys?Val?Thr?Val?Lys?Glu?Thr?Lys?Glu?Trp?Tyr?Lys?Leu?Val?Gly
2165???????????????2170????????????????2175
Tyr?Ser?Ala?Leu?Ile?Lys?Asp
The information of 2180 (2) SEQ ID NO:5:
(i) sequence signature:
(A) length: 15894 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: DNA (genome)
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: ACCAAACAAA GTTGGGTAAG GATAGATCAA TCAATGATCA TATTCTAGTA CACTTAGGAT 60 TCAAGATCCT ATTATCAGGG ACAAGAGCAG GATTAGGGAT ATCCGAGATG GCCACACTTT 120 TAAGGAGCTT AGCATTGTTC AAAAGAAACA AGGACAAACC ACCCATTACA TCAGGATCCG 180 GTGGAGCCAT CAGAGGAATC AAACACATTA TTATAGTACC AATCCCTGGA GATTCCTCAA 240 TTACCACTCG ATCCAGACTA CTGGACCGGT TGGTCAGGTT AATTGGAAAC CCGGATGTGA 300 GCGGGCCCAA ACTAACAGGG GCACTAATAG GTATATTATC CTTGTTTGTG GAGTCTCCAG 360 GTCAATTGAT TCAGAGGATC ACCGATGACC CTGACGTTAG CATCAGGCTG TTAGAGGTTG 420 TCCAGAGTGA CCAGTCACAA TCTGGCCTTA CCTTCGCATC AAGAGGTACC AACATGGAGG 480 ATGAGGCGGA CCAATACTTT TCACATGATG ATCCAAGTAG TAGTGATCAA TCCAGGTCCG 540 GATGGTTCGA GAACAAGGAA ATCTCAGATA TTGAAGTGCA AGACCCTGAG GGATTCAACA 600 TGATTCTGGG TACCATTCTA GCCCAAATTT GGGTCTTGCT CGCGAAGGCG GTTACGGCCC 660 CAGACACGGC AGCTGATTCG GAGCTAAGAA GGTGGATAAA GTACACCCAA CAAAGAAGGG 720 TAGTTGGTGA ATTCAGATTG GAGAGAAAAT GGTTGGATGT GGTGAGGAAC AGGATTGCCG 780 AGGACCTCTC CTTACGCCGA TTCATGGTCG CTCTAATCCT GGATATCAAG AGGACACCCG 840 GGAACAAACC AAGGATTGCT GAAATGATAT GTGACATTGA TACATATATC GTAGAGGCAG 900 GATTAGCCAG TTTTATCCTA ACTATTAAGT TTGGGATAGA AACTATGTAT CCTGCTCTTG 960 GACTGCATGA ATTTGCTGGT GAGTTATCCA CACTTGAGTC CTTGATGAAT CTTTACCAGC 1020 AAATGGGAGA AACTGCACCC TACATGGTAA TCCTGGAGAA CTCAATTCAG AACAAGTTCA 1080 GTGCAGGATC ATACCCCCTG CTCTGGAGCT ATGCCATGGG AGTAGGGGTG GAACTTGAAA 1140 ACTCCATGGG AGGTTTGAAC TTTGGTCGAT CTTACTTTGA TCCAGCATAT TTTAGATTAG 1200 GGCAAGAGAT GGTGAGGAGG TCAGCTGGGA AAGTCAGTTC CACATTAGCA TCTGAACTCG 1260 GTATCACTGC TGAGGATGCA AGGCTTGTTT CAGAGATTGC AATGCACACT ACTGAGGACA 1320 GGACCAGTAG AGCGGTTGGA CCCAGACAAG CCCAAGTGTC ATTTCTACAC GGTGATCAAA 1380 GTGAGAATGA GCTACCAGGA TTGGGGGGCA AGGAAGATAG GAGGGTCAAA CAGAGTCGGG 1440 GAGAAGCCAG GGAGAGCTAC AGAGAAACCG GGTCTAGCAG AGCAAGCGAT GCGAGAGCTG 1500 CCCATCTTCC AACCAGCGCA CCCCTAGACA TTGACACTGC ATCGGAGTCA GGCCAAGATC 1560 CGCAGGACAG TCGACGGTCA GCTGACGCCC TGCTCAGGCT GCAAGCCATG GCAGGAATCT 1620 TGGAAGAACA AGGCTCAGAC ACGGACACCC CTAGGGTGTA CAATGACAGA GATCTTCTAG 1680 ACTAGGTGCG AGAGGCCGAG GACCAGAACA ACATCCGCCT ACCCTCCATC ATTGTTATAA 1740 AAAACTTAGG AACCAGGTCC ACACAGCCGC CAGCCAACCA ACCATCCACT CCTACGACTG 1800 GGGCCGATGG CAGAAGAGCA GGCACGCCAT GTCAAAAACG GACTGGAATG CATCCGGGCT 1860 CTCAAGGCCG AGCCCATCGG CTCACTGGCC GTCGAGGAAG CCATGGCAGC ATGGTCACAA 1920 ATATCAGACA ACCCAGGACA GGACCGAACC ACCCGCAAGG AAGAGGAGGC AGGCAGTTCG 1980 GGTCTCAGCA AACCATGCCT CTCAGCAATT GGATCAACTG AAGGCAGTGC ACCTCGCATC 2040 TGCGGTCAGG GATCTGGAGA GAGCGATGAC AACGCTGAAA CTTTGGGAAT CCCCTCAAGA 2100 AATCTCCAGG CATCAAGCAC TGGGTTACAG TGTTATCATG TTTATGATCA CAGCGGTGAA 2160 GCGGTTAAGG GAATCCAAGA TGCTGACTCT ATCATGGTTC AATCAGGCCT TGATGGTGAT 2220 AGCACCCTCT CAGGAGGAGA CGATGAATCT GAAAACAGCG ATGTGGATAT TGGCGAACCT 2280 GATACCGAGG GATATGCTAT CACTGACCGG GGATCTGCTC CCATCTCTAT GGGGTTCAGG 2340 GCTTCTGATG TTGAAACTGC AGAAGGAGGG GAGATCCACG AGCTCCTGAG ACTCCAATCT 2400 AGAGGCAACA ACTTCCCGAA GCTTGGGAAA ACTCTCAATG TTCCTCCGCC CCCGAACCCC 2460 GGTAGGGCCA GCACTTCCGA GACACCCATT AAAAAGGGGA CAGACGCGAG ATTAGCCTCA 2520 TTTGGAGCGG AGATCGCGTC TTTATTGACA GGTGGTGCAA CCCAATGTGC TCGAAAGTCA 2580 CCCTCGGAAC CATCAGGGCC AGGTGCACCT GTGGGGAATG TCCCCGAGTG TGTGAGCAAT 2640 GCCGCACTGA TACAGGAGTG GACACCCGAA TCTGGTACCA CAATCTCCCC GAGATCCCAG 2700 AATAATGAAG AAGGGGGAGA TTATTATGAT GATGAGCTGT TCTCCGATGT CCAAGACATC 2760 AAAACAGCCT TGGCCAAAAT ACACGAGGAT AATCAGAAGA TAATCTCCAA GCTAGAATCA 2820 CTGCTGTTAT TGAAGGGAGA AGTTGAGTCA ATTAAAAAGC AGATCAACAG GCAAAATATC 2880 AGCATATCCA CCCTGGAAGG ACACCTCTCA AGCATCATGA TCGCCATTCC TGGACTTGGG 2940 AAGGATCCCA ACGACCCCAC TGCAGATGTC GAACTCAATC CCGACCTGAA ACCCATCATA 3000 GGCAGAGATT CAGGCCGAGC ACTGGCCGAA GTTCTCAAGA AACCCGTTGC CAGCCGACAA 3060 CTCCAAGGAA TGACAAATGG ACGGACCAGT TCCAGAGGAC AGCTGCTGAA GGAATTTCAA 3120 CTAAAGCCGA TCGGGAAAAA GATGAGCTCA GCCGTCGGGT TTGTTCCTGA CACCGGCCCC 3180 GCATCACGCA GTGTAATCCG CTCCATTATA AAATCCAGCC GGCTAGAGGA GGATCGGAAG 3240 CGTTACCTGA TGACTCTCCT TGATGATATC AAAGGAGCCA ACGATCTTGC CAAGTTCCAC 3300 CAGATGCTGA TGAAGATAAT AATGAAGTAG CTACAGCTCA ACTTACCTGC CAACCTCATG 3360 CCAATCGACC TAATTAGTAC AGCCTAAATC CATTATAAAA AACTTAGGAG CAAAGTGATT 3420 GCCTCCCAAG TTCCACAATG ACAGAGATCT ACGACTTCGA CAAGTCGGCA TGGGACATCA 3480 AAGGGTCGAT CGCTCCGATA CAACCTACCA CCTACAGTGA TGGCAGGCTG GTGCCCCAGG 3540 TCAGAGTCAT AGATCCTGGT CTAGGCGACA GGAAGGATGA ATGCTTTACG TACATGTTTC 3600 TGCTGGGGGT TGTTGAGGAC AGCGATCCCC TAGGGCCTCC AATCGGGCGA GCATTTGGGT 3660 CCCTGCCCTT AGGTGTTGGT AGATCCACAG CAAAACCCGA AGAACTCCTC AAAGAGGCCA 3720 CTGAGCTTGA CATAGTCGTT AGACGTACAG CAGGGCTCAA TGAAAAACTG GTGTTCTACA 3780 ACAACACCCC ACTAACTCTC CTCACACCTT GGAGAAAGGT CCTAACAACA GGGAGTGTCT 3840 TCAACGCAAA CCAAGTGTGC AATGCGGTTA ATCTGATACC GCTGGATACC CCGCAGAGGT 3900 TCCGTGTTGT TTATATGAGC ATCACCCGTC TTTCGGATAA CGGGTATTAC ACCGTTCCTA 3960 GAAGAATGCT AGAATTCAGA TCGGTCAATG CAGTGGCTTT CAACCTGCTG GTGACCCTTA 4020 GGATTGACAA AGCGATTGGC CCTGGGAAGA TCATCGATAA TGCAGAGCAA CTTCCTGAGG 4080 CAACATTTAT GGTCCACATC GGGAACTTCA GGAGAAAGAA GAGTGAAGTC TACTCTGCTG 4140 ATTATTGCAA AATGAAAATC GAAAAGATGG GCCTGGTTTT TGCACTTGGT GGGATAGGGG 4200 GCACCAGTCT TCACATTAGA AGCACAGGCA AAATGAGCAA GACTCTCCAT GCACAACTCG 4260 GGTTCAAAAA GACCTTATGT TACCCACTGA TGGATATCAA TGAAGACCTT AATCGATTAC 4320 TCTGGAGGAG CAGATGCAAG ATAGTAAGAA TCCAGGCAGT TTTGCAGCCA TCAGTTCCCC 4380 AAGAATTCCG CATTTACGAC GACGTGATCA TAAATGATGA CCAAGGACTA TTCAAAGTTC 4440 TGTAGACCGT AGTGCCCAGC AATACCCGAA AACGACCCCC CTCATAATGA CAGCCAGAAG 4500 GCCCGGACAA AAAAGCCCCC TCCAAAAGAC TCCACGGACC AAGTGAGAGG CCAGCCAGCA 4560 GCTGACGGCA AGCGTGAACA CCAGGCGGCC TGGGCACAGA ACAGCCCCGA CACAAGGCAA 4620 CCACCAGCCA TCCCAATCTG CGTCCTCCTC GTGGGACCCC CGAGGACCAA CCCCCAAGGT 4680 CGCCCCCGAC CCAGACCACC AACCGCATCC CCACAGCCCC CGGGAAAGAG ACCCCCAGCA 4740 ACTGGAAGGC CCCTCCCCCT TTCCCTCAAC GCAAGAACTC CACAACCGAA CCGCACAAGC 4800 GATCGAGGTG ACCCAACCGC AGGCATCCGA CTCCCTAGAC AGATCCTCTC CCCCCGGCAA 4860 ACTAAACAAA ACTTAGGGCC AAGGAACATA CACACCCGAC AGAACCCAGA CCCCGGCCCA 4920 CGGCGCCGCG CCCCCACCTC CCGACAACCA GAGGGAGCCC CCAACCAATC CCGCCGGCTC 4980 CCCCGGTGCC CACAGGCAGG CACACCAACC CTCGAACAGA CCCAGCACCC AGCCATCGAC 5040 AATTCAAGAC GGGGGGCCCC CCCCAAAAAA AGGCCCCCAG GGGCCGACAG CCAGCACCGC 5100 GAGGAAGCCC ACCCACCCCA CACACGACCA CAGGAACCGA ACCAGAATCC AGACCACCCT 5160 GGGCCACCAG TTCCCAGACT CGGCCATCAC CCCGCAGAAA GGAAAGGCCA CAACCCGCGC 5220 ACCCCTGCCC TGATCCGGTG GGCGGCCACC CAACCCGAAC CAGCACCCAA GAGCGATCCC 5280 CGAAGGGCCC CCGAACCGCA AAAGACATCA GTATCCCACA GCCTCTCCAA GTCCCCCGGT 5340 CTCCCCCTCT TCTCGAAGGG ACCAAAAGAT CAATCCACCA CACCCGACGA CACTCAATTC 5400 CCCACCCCTA AAGGAGACAC CGGGAATCCC AGAATCAAGA CTCATCCAAT GTCCATCATG 5460 GGTCTCAAGG TGAACGTCTC TGCCATATTC ATGGCAGTAC TGTTAACTCT CCAAACACCC 5520 ACCGGTCAAA TCCATTGGGG CAATCTCTCT AAGATAGGGG TGGTAGGGAT AGGAAGTGCA 5580 AGCTACAAAG TTATGACTCG TTCCAGCCAT CAATCATTAG TCATAAAATT AATGCCCAAT 5640 ATAACTCTCC TCAATAACTG CACGAGGGTA GAGATTGCAG AATACAGGAG ACTACTGAGA 5700 ACAGTTTTGG AACCAATTAG AGATGCACTT AATGCAATGA CCCAGAATAT AAGACCGGTT 5760 CAGAGTGTAG CTTCAAGTAG GAGACACAAG AGATTTGCTG GAGTTGTCCT GGCGGGTGCG 5820 GCCCTAGGCG TTGCCACAGC TGCTCAGATA ACAGCCGGCA TTGCACTTCA CCAGTCCATG 5880 TTGAACTCTC AAGCCATCGA CAATCTGAGA GCGAGCCTGG AAACTACTAA TCAGGCAATT 5940 GAGGCAATCA GACAAGCAGG GCAGGAGATG ATATTGGCTG TTCAGGGTGT CCAAGACTAC 6000 ATCAATAATG AGCTGATACC GTCTATGAAC CAACTATCTT GTGATTTAAT CGGCCAGAAG 6060 CTAGGGCTCA AATTGCTCAG ATACTATACA GAAATCCTGT CACTATTTGG CCCCAGCTTA 6120 CGGGACCCCA TATCTGCGGA GATATCTATC CAGGCTTTGA GCTATGCGCT TGGAGGAGAT 6180 ATCAATAAGG TGTTAGAAAA GCTCGGATAC AGTGGAGGTG ATTTACTGGG CATCTTAGAG 6240 AGCAGAGGAA TAAAGGCCCG GATAACTCAC GTCGACACAG AGTCCTACTT CATTGTACTC 6300 AGTATAGCCT ATCCGACGCT GTCCGAGATT AAGGGGGTGA TTGTCCACCG GCTAGAAGGG 6360 GTCTCGTACA ACATAGGCTC TCAAGAGTGG TATACCACTG TGCCCAAGTA TGTTGCAACC 6420 CAAGGGTACC TTATCTCGAA TTTTGATGAG TCATCGTGTA CTTTCATGCC AGAGGGGACT 6480 GTGTGCAGCC AAAATGCCTT GTACCCGATG AGTCCTCTGC TCCAAGAATG CCTCCGGGGG 6540 TCCACCAAGT CCTGTGCTCG TACACTTGTA TCCGGGTCTT TTGGGAACCG GTTCATTTTA 6600 TCACAAGGGA ATCTAATAGC CAATTGTGCA TCAATCCTTT GCAAGTGTTA CACAACAGGA 6660 ACGATCATTA ATCAGGACCC TGACAAGATC CTAACATACA TTGCTGCCGA TCACTGCCCG 6720 GTGGTCGAGG TGAACGGCGT GACCATCCAA GTCGGGAGCA GGCGGTATCC GGACGCTGTG 6780 TACTTGCACA GAATTGACCT CGGTCCTCCC ATATCATTGG AGAGGTTGGA CGTAGGGACA 6840 AATCTGGGGA ATGCAATTGC TAAGTTGGAG GATGCCAAGG AATTGTTGGA GTCATCGGAC 6900 CAGATATTGA GGAGTATGAA AGGTTTATCG AGCACTAGCA TAGTTTACAT CCTGATTGCA 6960 GTGTGTCTTG GAGGGTTGAT AGGGATCCCC GCTTTAATAT GTTGCTGCAG GGGGCGTTGT 7020 AACAAAAAGG GAGAACAAGT TGGTATGTCA AGACCAGGCC TAAAGCCTGA TCTTACAGGA 7080 ACATCAAAAT CCTATGTAAG GTCGCTCTGA TCCTCTACAA CTCTTGAAAC ACAAATGTCC 7140 CACAAGTCTC CTCTTCGTCA TCAAGCAACC ACCGCATCCA GCATCGAGCC CACCTGAAAT 7200 TGTCTCCGGA TTCCCTCTGG CCGAACAATA TCGGTAGTTA ATTAAAACTT AGGGTGCAAG 7260 ATCATCCACA ATGTCACCAC AACGAGACCG GATAAATGCC TTCTACAAAG ACAACCCCCA 7320 TCCTAGGGGA AGTAGGATAG TTATTAACAG AGAACATCTT ATGATTGATA GACCTTATGT 7380 TTTGCTGGCT GTTCTATTCG TCATGTTTCT GAGCTTGATC GGGTTGCTAG CCATTGCAGG 7440 CATAAGACTT CATCGGGCAG CCATCTACAC CGCAGAGATC CATAAAAGCC TCAGCACCAA 7500 TCTAGATGTA ACTAACTCAA TCGAGCATCA GGTCAAGGAC GTGCTGACAC CACTCTTCAA 7560 GATCATCGGT GATGAAGTGG GCCTGAGGAC ACCTCAGAGA TTCACCGACC TAGTGAAATT 7620 CATCTCTGAC AAGATTAAAT TCCTTAATCC GGATAGGGAG TACGACTTCA GAGATCTCAC 7680 TTGGTGTATC AACCCGCCAG AGAGAATCAA ATTGGATTAT GATCAATACT GTGCAGATGT 7740 GGCTGCTGAA GAACTCATGA ATGCATTGGT GAACTCAACT CTACTGGAGG CCAGGGTAAC 7800 CAATCAGTTC CTAGCTGTCT CAAAGGGAAA CTGCTCAGGG CCCACTACAA TCAGAGGTCA 7860 ATTCTCAAAC ATGTCGCTGT CCCTGTTGGA CTTGTATTTA AATCGAGGTT ACAATGTGTC 7920 ATCTATAGTC ACTATGACAT CCCAGGGAAT GTACGGGGGA ACTTACCTAG TGGAAAAGCC 7980 TAATCTGAGC AGTAAAGGGT CAGAGTTGTC ACAACTGAGC ATGCACCGAG TGTTTGAAGT 8040 AGGTGTTATC AGAAATCCGG GTTTGGGGGC TCCGGTGTTC CATATGACAA ACTATTTTGA 8100 GCAACCAGTC AGTAATGATT TCAGCAACTG CATGGTGGCT TTGGGGGAGC TCAAATTCGC 8160 AGCCCTTTGT CACAGGGAAG ATTCTATCAC AATTCCCTAT CAGGGATCAG GGAAAGGTGT 8220 CAGCTTCCAG CTCGTCAAGC TAGGTGTCTG GAAATCCCCA ACCGACATGC AATCCTGGGT 8280 CCCCCTATCA ACGGATGATC CAGTGATAGA CAGGCTCTAC CTCTCATCTC ACAGAGGCGT 8340 TATCGCTGAC AATCAAGCAA AATGGGCTGT CCCGACAACA CGGACAGATG ACAAGTTGCG 8400 AATGGAGACA TGCTTCCAGC AGGCGTGTAA GGGTAAAATC CAAGCACTCT GCGAGAATCC 8460 CGAGTGGGCA CCATTGAAGG ATAACAGGAT TCCTTCATAC GGGGTCTTGT CTGTTAATCT 8520 GAGTCTGACA GTTGAGCTTA AAATCAAAAT TGCTTCAGGA TTCGGGCCAT TGATCACACA 8580 CGGTTCAGGG ATGGACCTAT ACAAATCCAA CCACAACAAT GTGTATTGGC TGACTATCCC 8640 GCCAATGAAG AACCTAGCCT TAGGTGTAAT CAACACATTG GAGTGGATAC CGAGATTCAA 8700 GGTTAGTCCC TACCTCTTCA CTGTTCCAAT TAAGGAAGCA GGCGAGGACT GCCATGCCCC 8760 AACATACCTA CCTGCGGAGG TGGATGGTGA TGTCAAACTC AGTTCCAATC TGGTGATTCT 8820 ACCTGGTCAA GATCTCCAAT ATGTTTTGGC AACCTATGAT ACTTCCAGAG TTGAACATGC 8880 TGTGGTTTAT TACGTTTACA GCCCAAGCCG CTCATTTTCT TACTTTTATC CTTTTAGGTT 8940 GCCTATAAGG GGGGTCCCCA TCGAATTACA AGTGGAATGC TTCACATGGG ACCAAAAACT 9000 CTGGTGCCGT CACTTCTGTG TGCTTGCGGA CTCAGAATCT GGTGGATATA TCACTCACTC 9060 TGGGATGGTG GGCATGGGAG TCAGCTGCAC AGTCACTCGG GAAGATGGAA CCAACCGCAG 9120 ATAGGGCTGC CAGTGAACCA ATCACATGAT GTCACCCAGA CATCAGGCAT ACCCACTAGT 9180 GTGAAATAGA CATCAGAATT AAGAAAAACG TAGGGTCCAA GTGGTTCCCC GTTATGGACT 9240 CGCTATCTGT CAACCAGATC TTATACCCTG AAGTTCACCT AGATAGCCCG ATAGTTACCA 9300 ATAAGATAGT AGCTATCCTG GAGTATGCTC GAGTCCCTCA CGCATACAGC CTGGAGGACC 9360 CTACACTGTG TCAGAACATC AAGCACCGCC TAAAAAACGG ATTTTCCAAC CAAATGATTA 9420 TAAACAATGT GGAAGTTGGG AATGTCATCA AGTCCAAGCT TAGGAGTTAT CCGACCCACT 9480 CTCATATTCC ATATCCAAAT TGTAATCAGG ATTTATTTAA CATAGAAGAC AAAGAGTCAA 9540 CAAGGAAGAT CCGTGAGCTC CTCAAAAAGG GAAATTCGCT GTACTCCAAA GTCAGTGATA 9600 AGGTTTTCCA ATGCCTGAGG GACACTAACT CACGGCTTGG CCTAGGCTCC GAATTGAGGG 9660 AGGACATCAA GGAGAAAATT ATTAACTTGG GAGTTTACAT GCACAGCTCC CAATGGTTTG 9720 AGCCCTTTCT GTTTTGGTTT ACAGTCAAGA CTGAGATGAG GTCAGTGATT AAATCACAAA 9780 CCCATACTTG CCATAGGAGG AGACACACAC CAGTATTCTT CACTGGTAGT TCAGTTGAGT 9840 TGCTAATCTC TCGTGACCTT GTTGCTATAA TCAGTAAAGA GTCTCAACAT GTATATTACC 9900 TGACGTTTGA ACTGGTCTTG ATGTATTGTG ATGTCATAGA GGGGAGGTTA ATGACAGAGA 9960 CCGCTATGAC CATTGATGCT AGGTATACAG AGCTTCTAGG AAGAGTCAGA TACATGTGGA 10020 AACTGATAGA TGGTTTCTTC CCTGCACTCG GGAATCCAAC TTACCAAATT GTAGCCATGC 10080 TGGAGCCTCT TTCACTTGCT TACCTGCAGC TGAGGGATAT AACAGTAGAA CTCAGAGGTG 10140 CTTTCCTTAA CCACTGCTTT ACTGAAATAC ATGATGTTCT TGACCAAAAC GGGTTTTCTG 10200 ATGAAGGTAC TTATCATGAG TTAATTGAAG CCCTAGATTA CATTTTCATA ACTGATGACA 10260 TACATCTGAC AGGGGAGATT TTCTCATTTT TCAGAAGTTT CGGCCACCCC AGACTTGAAG 10320 CAGTAACGGC TGCTGAAAAT GTTAGGAAAT ACATGAATCA GCCTAAAGTC ATTGTGTATG 10380 AGACTCTGAT GAAAGGTCAT GCCATATTCT GTGGAATCAT AATCAACGGC TATCGTGACA 10440 GGCACGGAGG CAGTTGGCCA CCCCTGACCC TCCCCCTGCA TGCTGCAGAC ACAATCCGGA 10500 ATGCTCAAGC TTCAGGTGAA GGGTTAACAC ATGAGCAGTG CGTTGATAAC TGGAAATCTT 10560 TTGCTGGAGT GAAATTTGGC TGCTTTATGC CTCTTAGCCT GGATAGTGAT CTGACAATGT 10620 ACCTAAAGGA CAAGGCACTT GCTGCTCTCC AAAGGGAATG GGATTCAGTT TACCCGAAAG 10680 AGTTCCTGCG TTACGACCCT CCCAAAGGAA CTGGGTCACG GAGGCTTGTA AATGTTTTCC 10740 TTAATGATTC GAGCTTTGAC CCATATGACA TGATAATGTA TGTTGTAAGT GGAGCTTACC 10800 TCCATGACCC TGAGTTCAAC CTGTCTTACA GCCTGAAAGA AAAGGAGATC AAGGAAACAG 10860 GTAGACTTTT TGCTAAAATG ACTTACAAAA TGAGGGCATG CCAAGTGATT GCTGAAAATC 10920 TAATCTCAAA CGGGATTGGC AATTATTTTA AGGACAATGG GATGGCCAAG GACGAGCACG 10980 ATTTGACTAA GGCACTCCAC ACTCTAGCTG TCTCAGGAGT CCCCAAAGAT CTCAAAGAAA 11040 GTCACAGGGG GGGGCCAGTC TTAAAAACCC ACTCCCGAAG CCCAGTCCAC ACAAGTACCA 11100 AGAACGTGAG AGCAGCAAAA GGGTTTATAG GATTCCCTCA TGTAATTCGG CAGGACCAAG 11160 ACACTGATCA TCCGGAGAAT ATGGAGGCTT ACGAGACAGT CAGTGCATTT ATCACGACTG 11220 ATCTCAAGAA GTACTGCCTT AATTGGAGAT ATGAGACCAT CAGCTTATTT GCACAAAGGC 11280 TAAATGAGAT TTACGGATTA CCCTCATTTT TCCAGTGGCT GCATAAGAGG CTTGAAACCT 11340 CTGTCCTCTA TGTAAGTGAC CCTCATTGCC CCCCTGACCT TGACGCCCAT GTCCCGTTAT 11400 GCAAAGTCCC CAATGACCAA ATCTTCATTA AGTACCCTAT GGGAGGTATA GAAGGGTATT 11460 GTCAGAAGCT GTGGACCATC AGCACCATTC CCTATTTATA CCTGGCTGCT TATGAGAGCG 11520 GAGTAAGGAT TGCTTCGTTA GTGCAAGGGG ACAATCAGAC CATAGCCGTA ACAAAAAGGG 11580 TACCCAGCAC ATGGCCTTAC AACCTTAAGA AACGGGAAGC TGCTAGAGTA ACTAGAGATT 11640 ACTTTGTAAT TCTTAGGCAA AGGCTACATG ACATAGGCCA TCACCTCAAG GCAAATGAGA 11700 CAATTGTCTC ATCACATTTT TTTGTCTATT CAAAAGGAAT ATATTATGAT GGGCTACTTG 11760 TGTCCCAATC ACTCAAGAGC ATCGCAAGAT GTGTATTCTG GTCAGAGACT ATAGTTGATG 11820 AAACAAGGGC AGCATGCAGT AATATTGCTA CAACAATGGC TAAAAGCATC GAGAGAGGTT 11880 ATGACCGTTA CCTTGCATAT TCCCTGAACG TCCTAAAAGT GATACAGCAA ATCCTGATCT 11940 CTCTTGGCTT CACAATCAAT TCAACCATGA CCCGGGATGT AGTCATACCC CTCCTCACAA 12000 ACAACGATCT CTTAATAAGG ATGGCACTGT TGCCCGCTCC TATCGGGGGG ATGAATTATC 12060 TGAATATGAG CAGGCTGTTT GTCAGAAACA TCGGTGATCC AGTAACATCA TCAATTGCTG 12120 ATCTCAAGAG AATGATTCTC TCATCACTAA TGCCTGAAGA GACCCTTCAT CAAGTAATGA 12180 CACAACAACC GGGGGACTCT TCATTCCTAG ACTGGGCTAG CGACCCTTAC TCAGCAAATC 12240 TTGTATGCGT CCAGAGCATC ACTAGACTCC TCAAGAACAT AACTGCAAGG TTTGTCCTGA 12300 TCCATAGTCC AAACCCAATG TTAAAAGGGT TATTCCATGA TGACAGTAAA GAAGAGGACG 12360 AGGGACTGGC GGCATTCCTC ATGGACAGGC ATATTATAGT ACCTAGGGCA GCTCATGAAA 12420 TCCTGGATCA TAGTGTCACA GGGGCAAGAG AGTCTATTGC AGGCATGCTA GATACCACAA 12480 AAGGCCTGAT TCGAGCCAGC ATGAGGAAGG GGGGGTTAAC CTCTCGAGTG ATAACCAGAT 12540 TGTCCAATTA TGACTATGAA CAATTTAGAG CAGGGATGGT GCTATTGACA GGAAGAAAGA 12600 GAAATGTCCT CATTGACAAA GAGTCATGTT CAGTGCAGCT GGCTAGAGCC CTAAGAAGCC 12660 ATATGTGGGC AAGGCTAGCT CGAGGACGGC CTATTTACGG CCTTGAGGTC CCTGATGTAC 12720 TAGAATCTAT GCGAGGCCAC CTTATTCGGC GCCATGAGAC ATGTGTCATC TGCGAGTGTG 12780 GATCAGTCAA CTACGGATGG TTTTTTGTCC CCTCGGGTTG CCAACTGGAT GATATTGACA 12840 AGGAAACATC ATCCTTGAGA GTCCCATATA TTGGTTCTAC CACTGATGAG AGAACAGACA 12900 TGAAGCTTGC CTTCGTAAGA GCCCCAAGTC GATCCTTGCG ATCTGCTGTT AGAATAGCAA 12960 CAGTGTACTC ATGGGCTTAT GGTGATGATG ATAGCTCTTG GAACGAAGCC TGGTTGTTGG 13020 CAAGGCAAAG GGCCAATGTG AGCCTGGAGG AGCTAAGGGT GATCACTCCC ATCTCAACTT 13080 CGACTAATTT AGCGCATAGG TTGAGGGATC GTACCACTCA AGTGAAATAC TCAGGTACAT 13140 CCCTTGTCCG AGTGGCAAGG TATACCACAA TCTCCAACGA CAATCTCTCA TTTGTCATAT 13200 CAGATAAGAA GGTTGATACT AACTTTATAT ACCAACAGGG AATGCTTCTA GGGTTGGGTG 13260 TTTTAGAAAC ATTGTTTCGA CTCGAGAAAG ATACCGGATC ATCTAACACG GTATTACATC 13320 TTCACGTCGA AACAGATTGT TGCGTGATCC CGATGATAGA TCATCCCAGG ATACCCAGCT 13380 CCCGCAAGCT AGAGCTTAGG GCAGAGCTAT GTACCAACCC ATTGATATAT GATAATGCAC 13440 CTTTAATTGA CAGAGATGCA ACAAGGCTAT ACACCCAGAG CCATAGGAGG CACCTTGTGG 13500 AATTTGTTAC ATGGTCCACA CCCCAACTAT ATCACATTTT AGCTAAGTCC ACAGCACTAT 13560 CTATGATTGA CCTGGTAACA AAATTTGAGA AGGACCATAT GAATGAAATT TCAGCTCTCA 13620 TAGGGGATGA CGATATCAAT AGTTTCATAA CTGAGTTTCT GCTTATAGAG CCAAGATTAT 13680 TCACTATCTA CTTGGGCCAG TGTGCAGCCA TCAATTGGGC ATTTGATGTA CATTATCATA 13740 GACCATCAGG GAAATATCAG ATGGGTGAGC TGTTGTCTTC GTTCCTTTCT AGAATGAGCA 13800 AAGGAGTGTT TAAGGTGCTT GTCAATGCTC TAAGCCACCC AAAGATCTAC AAGAAATTCT 13860 GGCATTGTGG TATTATAGAG CCTATCCATG GTCCTTCACT TGATGCTCAA AACTTACACA 13920 CAACTGTGTG CAACATGATT TACACATGCT ATATGACCTA CCTCGACCTG TTGTTGAATG 13980 AAGAGTTAGA AGAGTTCACA TTTCTTCTGT GTGAAAGCGA CGAGGATGTA GTACCGGACA 14040 GATTCGACAA TATCCAGGCA AAACACTTGT GTGTTCTAGC AGATTTGTAC TGTCAACCAG 14100 GGACCTGCCC ACCAATTCGA GGTCTACGAC CTGTAGAGAA ATGTGCAGTT CTAACCGATC 14160 ATATCAAGGC AGAGGCTAGG TTATCTCCAG CAGGGTCTTC GTGGAACATA AATCCAATTA 14220 TTGTAGACCA TTACTCATGC TCTCTGACTT ATCTCCGGCG AGGATCGATC AAACAGATAA 14280 GATTGAGAGT TGATCCAGGA TTCATTTTTG ACGCCCTCGC TGAGGTAAAT GTCAGTCAGC 14340 CAAAGATCGG CAGCAACAAC ATCTCAAATA TGAGCATCAA GGATTTCAGA CCTCCACACG 14400 ATGATGTTGC AAAATTGCTC AAAGATATCA ACACAAGCAA GCACAATCTT CCCATTTCAG 14460 GGGGTAATCT CGCCAATTAT GAAATCCACG CTTTCCGCAG AATCGGGTTA AACTCATCCG 14520 CTTGCTACAA AGCTGTTGAG ATATCAACAT TAATTAGGAG ATGCCTTGAG CCAGGGGAAG 14580 ACGGCTTGTT CTTGGGTGAG GGGTCGGGTT CTATGTTGAT CACTTATAAG GAGATACTAA 14640 AACTAAACAA GTGCTTCTAT AATAGTGGGG TTTCCGCCAA TTCTAGATCT GGTCAAAGGG 14700 AATTAGCACC CTATCCCTCC GAAGTTGGTC TTGTCGAACA CAGAATGGGA GTAGGTAATA 14760 TTGTCAAAGT GCTCTTTAAC GGGAGGCCCG AAGTCACGTG GGTAGGCAGT GTAGATTGCT 14820 TCAATTTCAT AGTCAGTAAT ATCCCTACCT CTAGTGTGGG GTTTATCCAT TCAGATATAG 14880 AGACCTTACC TAACAAAGAT ACTATAGAGA AGCTAGAGGA ATTAGCAGCC ATCTTATCGA 14940 TGGCTCTGCT CCTTGGCAAA ATAGGATCAA TACTGGTGAT TAAGCTTATG CCTTTCAGCG 15000 GGGATTTTGT TCAGGGATTT ATAAGTTATG TAGGGTCTTA TTATAGAGAA GTGAACCTTG 15060 TCTACCCTAG ATACAGCAAC TTCATATCTA CTGAATCTTA TTTAGTCATG ACAGATCTCA 15120 AAGCTAACCG GCTAATGAAT CCTGAAAAGA TTAAGCAGCA GATAATTGAA TCATCTGTGC 15180 GGACTTCACC TGGACTTATA GGTCACATCC TATCCATTAA GCAACTAAGC TGCATACAAG 15240 CAATTGTGGG AGACGCAGTT AGTAGAGGTG GTATCAACCC TATTCTGAAG AAACTTACAC 15300 CTATAGAGCA GGTGCTGATC AATTGCGGGT TGGCAATTAA CGGACCTAAA CTGTGCAAAG 15360 AATTGATCCA CCATGATGTT GCCTCAGGGC AAGATGGATT GCTTAACTCT ATACTCATCC 15420 TCTACAGGGA GTTGGCAAGA TTCAAAGACA ACCAAAGAAG TCAACAAGGG ATGTTCCATG 15480 CTTACCCCGT ATTGGTAAGT AGCAGGCAAC GAGAACTTAT ATCTAGGATC ACCCGCAAAT 15540 TTTGGGGGCA TATTCTTCTT TACTCCGGGA ACAGAAAGTT GATAAATCGG TTTATCCAGA 15600 ATCTCAAGTC CGGTTACCTG ATACTAGACT TACACCAGAA TATCTTCGTT AAGAATCTAT 15660 CTAAGTCAGA GAAACAGATT ATTATGACGG GGGGTTTAAA ACGTGAGTGG GTTTTTAAGG 15720 TAACAATCAA GGAGACCAAA GAATGGTATA AGTTAGTCGG ATACAGTGCC CTGATTAAGG 15780 ATTAATTGGT TGGACTCCGG GACCCTAATC CTGCCCTAGG TAGTTAGGCA TTATTTGCAA 15840 TATATTAAAG AAAACTTTGA AAATACGAAG TTTCTATTCC CAGCTTTGTC TGGT 15894 (2) SEQ ID NO: 6 of the message: ...
(i) sequence signature:
(A) length: 2183 amino acid
(B) type: amino acid
(C) chain:
(D) topological framework: linearity
(ii) molecule type: protein
(xi) sequence description: SEQ ID NO:6:Met Asp Ser Leu Ser Val Asn Gln Ile Leu Tyr Pro Glu Val His Leu1 5 10 15Asp Ser Pro Ile Val Thr Asn Lys Ile Val Ala Ile Leu Glu Tyr Ala
20??????????????????25??????????????????30Arg?Val?Pro?His?Ala?Tyr?Ser?Leu?Glu?Asp?Pro?Thr?Leu?Cys?Gln?Asn
35??????????????????40??????????????????45Ile?Lys?His?Arg?Leu?Lys?Asn?Gly?Phe?Ser?Asn?Gln?Met?Ile?Ile?Asn
50??????????????????55??????????????????60Asn?Val?Glu?Val?Gly?Asn?Val?Ile?Lys?Ser?Lys?Leu?Arg?Ser?Tyr?Pro65??????????????????70??????????????????75??????????????????80Thr?His?Ser?His?Ile?Pro?Tyr?Pro?Asn?Cys?Asn?Gln?Asp?Leu?Phe?Asn
85??????????????????90??????????????????95Ile?Glu?Asp?Lys?Glu?Ser?Thr?Arg?Lys?Ile?Arg?Glu?Leu?Leu?Lys?Lys
100?????????????????105?????????????????110Gly?Asn?Ser?Leu?Tyr?Ser?Lys?Val?Ser?Asp?Lys?Val?Phe?Gln?Cys?Leu
115?????????????????120?????????????????125Arg?Asp?Thr?Asn?Ser?Arg?Leu?Gly?Leu?Gly?Ser?Glu?Leu?Arg?Glu?Asp
130?????????????????135?????????????????140Ile?Lys?Glu?Lys?Ile?Ile?Asn?Leu?Gly?Val?Tyr?Met?His?Ser?Ser?Gln145?????????????????150?????????????????155?????????????????160Trp?Phe?Glu?Pro?Phe?Leu?Phe?Trp?Phe?Thr?Val?Lys?Thr?Glu?Met?Arg
165?????????????????170?????????????????175Ser?Val?Ile?Lys?Ser?Gln?Thr?His?Thr?Cys?His?Arg?Arg?Arg?His?Thr
180?????????????????185?????????????????190Pro?Val?Phe?Phe?Thr?Gly?Ser?Ser?Val?Glu?Leu?Leu?Ile?Ser?Arg?Asp
195?????????????????200?????????????????205Leu?Val?Ala?Ile?Ile?Ser?Lys?Glu?Ser?Gln?His?Val?Tyr?Tyr?Leu?Thr
210?????????????????215?????????????????220Phe?Glu?Leu?Val?Leu?Met?Tyr?Cys?Asp?Val?Ile?Glu?Gly?Arg?Leu?Met225?????????????????230?????????????????235?????????????????240Thr?Glu?Thr?Ala?Met?Thr?Ile?Asp?Ala?Arg?Tyr?Thr?Glu?Leu?Leu?Gly
245?????????????????250?????????????????255Arg?Val?Arg?Tyr?Met?Trp?Lys?Leu?Ile?Asp?Gly?Phe?Phe?Pro?Ala?Leu
260?????????????????265?????????????????270Gly?Asn?Pro?Thr?Tyr?Gln?Ile?Val?Ala?Met?Leu?Glu?Pro?Leu?Ser?Leu
275?????????????????280?????????????????285Ala?Tyr?Leu?Gln?Leu?Arg?Asp?Ile?Thr?Val?Glu?Leu?Arg?Gly?Ala?Phe
290?????????????????295??????????????????300Leu?Asn?His?Cys?Phe?Thr?Glu?Ile?His?Asp?Val?Leu?Asp?Gln?Asn?Gly305?????????????????310?????????????????315?????????????????320Phe?Ser?Asp?Glu?Gly?Thr?Tyr?His?Glu?Leu?Ile?Glu?Ala?Leu?Asp?Tyr
325?????????????????330?????????????????335Ile?Phe?Ile?Thr?Asp?Asp?Ile?His?Leu?Thr?Gly?Glu?Ile?Phe?Ser?Phe
340?????????????????345?????????????????350Phe?Arg?Ser?Phe?Gly?His?Pro?Arg?Leu?Glu?Ala?Val?Thr?Ala?Ala?Glu
355?????????????????360?????????????????365Asn?Val?Arg?Lys?Tyr?Met?Asn?Gln?Pro?Lys?Val?Ile?Val?Tyr?Glu?Thr
370?????????????????375?????????????????380Leu?Met?Lys?Gly?His?Ala?Ile?Phe?Cys?Gly?Ile?Ile?Ile?Asn?Gly?Tyr385?????????????????390?????????????????395?????????????????400Arg?Asp?Arg?His?Gly?Gly?Ser?Trp?Pro?Pro?Leu?Thr?Leu?Pro?Leu?His
405?????????????????410?????????????????415Ala?Ala?Asp?Thr?Ile?Arg?Asn?Ala?Gln?Ala?Ser?Gly?Glu?Gly?Leu?Thr
420?????????????????425?????????????????430His?Glu?Gln?Cys?Val?Asp?Asn?Trp?Lys?Ser?Phe?Ala?Gly?Val?Lys?Phe
435?????????????????440?????????????????445Gly?Cys?Phe?Met?Pro?Leu?Ser?Leu?Asp?Ser?Asp?Leu?Thr?Met?Tyr?Leu
450?????????????????455?????????????????460Lys?Asp?Lys?Ala?Leu?Ala?Ala?Leu?Gln?Arg?Glu?Trp?Asp?Ser?Val?Tyr465????????????????470??????????????????475?????????????????480Pro?Lys?Glu?Phe?Leu?Arg?Tyr?Asp?Pro?Pro?Lys?Gly?Thr?Gly?Ser?Arg
485?????????????????490?????????????????495Arg?Leu?Val?Asn?Val?Phe?Leu?Asn?Asp?Ser?Ser?Phe?Asp?Pro?Tyr?Asp
500?????????????????505?????????????????510Met?Ile?Met?Tyr?Val?Val?Ser?Gly?Ala?Tyr?Leu?His?Asp?Pro?Glu?Phe
515?????????????????520?????????????????525Asn?Leu?Ser?Tyr?Ser?Leu?Lys?Glu?Lys?Glu?Ile?Lys?Glu?Thr?Gly?Arg
530?????????????????535?????????????????540Leu?Phe?Ala?Lys?Met?Thr?Tyr?Lys?Met?Arg?Ala?Cys?Gln?Val?Ile?Ala545?????????????????550?????????????????555?????????????????560Glu?Asn?Leu?Ile?Ser?Asn?Gly?Ile?Gly?Asn?Tyr?Phe?Lys?Asp?Asn?Gly
565?????????????????570?????????????????575Met?Ala?Lys?Asp?Glu?His?Asp?Leu?Thr?Lys?Ala?Leu?His?Thr?Leu?Ala
580?????????????????585?????????????????590Val?Ser?Gly?Val?Pro?Lys?Asp?Leu?Lys?Glu?Ser?His?Arg?Gly?Gly?Pro
595?????????????????600?????????????????605Val?Leu?Lys?Thr?His?Ser?Arg?Ser?Pro?Val?His?Thr?Ser?Thr?Lys?Asn
610?????????????????615?????????????????????????620Val?Arg?Ala?Ala?Lys?Gly?Phe?Ile?Gly?Phe?Pro?His?Val?Ile?Arg?Gln625?????????????????630?????????????????635?????????????????640Asp?Gln?Asp?Thr?Asp?His?Pro?Glu?Asn?Met?Glu?Ala?Tyr?Glu?Thr?Val
645?????????????????650?????????????????655Ser?Ala?Phe?Ile?Thr?Thr?Asp?Leu?Lys?Lys?Tyr?Cys?Leu?Asn?Trp?Arg
660?????????????????665?????????????????670Tyr?Glu?Thr?Ile?Ser?Leu?Phe?Ala?Gln?Arg?Leu?Asn?Glu?Ile?Tyr?Gly
675?????????????????680?????????????????685Leu?Pro?Ser?Phe?Phe?Gln?Trp?Leu?His?Lys?Arg?Leu?Glu?Thr?Ser?Val
690?????????????????695?????????????????700Leu?Tyr?Val?Ser?Asp?Pro?His?Cys?Pro?Pro?Asp?Leu?Asp?Ala?His?Val705?????????????????710?????????????????715?????????????????720Pro?Leu?Cys?Lys?Val?Pro?Asn?Asp?Gln?Ile?Phe?Ile?Lys?Tyr?Pro?Met
725?????????????????730?????????????????735Gly?Gly?Ile?Glu?Gly?Tyr?Cys?Gln?Lys?Leu?Trp?Thr?Ile?Ser?Thr?Ile
740?????????????????745?????????????????750Pro?Tyr?Leu?Tyr?Leu?Ala?Ala?Tyr?Glu?Ser?Gly?Val?Arg?Ile?Ala?Ser
755?????????????????760?????????????????765Leu?Val?Gln?Gly?Asp?Asn?Gln?Thr?Ile?Ala?Val?Thr?Lys?Arg?Val?Pro
770?????????????????775?????????????????780Ser?Thr?Trp?Pro?Tyr?Asn?Leu?Lys?Lys?Arg?Glu?Ala?Ala?Arg?Val?Thr785?????????????????790?????????????????795?????????????????800Arg?Asp?Tyr?Phe?Val?Ile?Leu?Arg?Gln?Arg?Leu?His?Asp?Ile?Gly?His
805?????????????????810?????????????????815His?Leu?Lys?Ala?Asn?Glu?Thr?Ile?Val?Ser?Ser?His?Phe?Phe?Val?Tyr
820?????????????????825?????????????????830Ser?Lys?Gly?Ile?Tyr?Tyr?Asp?Gly?Leu?Leu?Val?Ser?Gln?Ser?Leu?Lys
835?????????????????840?????????????????845Ser?Ile?Ala?Arg?Cys?Val?Phe?Trp?Ser?Glu?Thr?Ile?Val?Asp?Glu?Thr
850?????????????????855?????????????????860Arg?Ala?Ala?Cys?Ser?Asn?Ile?Ala?Thr?Thr?Met?Ala?Lys?Ser?Ile?Glu865?????????????????870?????????????????875?????????????????880Arg?Gly?Tyr?Asp?Arg?Tyr?Leu?Ala?Tyr?Ser?Leu?Asn?Val?Leu?Lys?Val
885?????????????????890?????????????????895Ile?Gln?Gln?Ile?Leu?Ile?Ser?Leu?Gly?Phe?Thr?Ile?Asn?Ser?Thr?Met
900?????????????????905?????????????????910Thr?Arg?Asp?Val?Val?Ile?Pro?Leu?Leu?Thr?Asn?Asn?Asp?Leu?Leu?Ile
915?????????????????920?????????????????925Arg?Met?Ala?Leu?Leu?Pro?Ala?Pro?Ile?Gly?Gly?Met?Asn?Tyr?Leu?Asn
930?????????????????935?????????????????940Met?Ser?Arg?Leu?Phe?Val?Arg?Asn?Ile?Gly?Asp?Pro?Val?Thr?Ser?Ser945?????????????????950?????????????????955?????????????????960Ile?Ala?Asp?Leu?Lys?Arg?Met?Ile?Leu?Ser?Ser?Leu?Met?Pro?Glu?Glu
965?????????????????970?????????????????975Thr?Leu?His?Gln?Val?Met?Thr?Gln?Gln?Pro?Gly?Asp?Ser?Ser?Phe?Leu
980?????????????????985?????????????????990Asp?Trp?Ala?Ser?Asp?Pro?Tyr?Ser?Ala?Asn?Leu?Val?Cys?Val?Gln?Ser
995?????????????????1000????????????????1005Ile?Thr?Arg?Leu?Leu?Lys?Asn?Ile?Thr?Ala?Arg?Phe?Val?Leu?Ile?His
1010????????????????1015????????????????1020Ser?Pro?Asn?Pro?Met?Leu?Lys?Gly?Leu?Phe?His?Asp?Asp?Ser?Lys?Glu1025????????????????1030????????????????1035????????????????1040Glu?Asp?Glu?Gly?Leu?Ala?Ala?Phe?Leu?Met?Asp?Arg?His?Ile?Ile?Val
1045????????????????1050????????????????1055Pro?Arg?Ala?Ala?His?Glu?Ile?Leu?Asp?His?Ser?Val?Thr?Gly?Ala?Arg
1060????????????????????1065????????????????1070Glu?Ser?Ile?Ala?Gly?Met?Leu?Asp?Thr?Thr?Lys?Gly?Leu?Ile?Arg?Ala
1075????????????????1080????????????????1085Ser?Met?Arg?Lys?Gly?Gly?Leu?Thr?Ser?Arg?Val?Ile?Thr?Arg?Leu?Ser
1090????????????????1095????????????????1100Asn?Tyr?Asp?Tyr?Glu?Gln?Phe?Arg?Ala?Gly?Met?Val?Leu?Leu?Thr?Gly1105????????????????1110????????????????1115????????????????1120Arg?Lys?Arg?Asn?Val?Leu?Ile?Asp?Lys?Glu?Ser?Cys?Ser?Val?Gln?Leu
1125???????????????1130????????????????1135Ala?Arg?Ala?Leu?Arg?Ser?His?Met?Trp?Ala?Arg?Leu?Ala?Arg?Gly?Arg
1140????????????????1145????????????????1150Pro?Ile?Tyr?Gly?Leu?Glu?Val?Pro?Asp?Val?Leu?Glu?Ser?Met?Arg?Gly
1155????????????????1160????????????????1165His?Leu?Ile?Arg?Arg?His?Glu?Thr?Cys?Val?Ile?Cys?Glu?Cys?Gly?Ser
1170????????????????1175????????????????1180Val?Asn?Tyr?Gly?Trp?Phe?Phe?Val?Pro?Ser?Gly?Cys?Gln?Leu?Asp?Asp1185????????????????1190????????????????1195????????????????1200Ile?Asp?Lys?Glu?Thr?Ser?Ser?Leu?Arg?Val?Pro?Tyr?Ile?Gly?Ser?Thr
1205????????????????1210????????????????1215Thr?Asp?Glu?Arg?Thr?Asp?Met?Lys?Leu?Ala?Phe?Val?Arg?Ala?Pro?Ser
1220????????????????1225????????????????1230Arg?Ser?Leu?Arg?Ser?Ala?Val?Arg?Ile?Ala?Thr?Val?Tyr?Ser?Trp?Ala
1235????????????????1240????????????????1245Tyr?Gly?Asp?Asp?Asp?Ser?Ser?Trp?Asn?Glu?Ala?Trp?Leu?Leu?Ala?Arg
1250????????????????1255????????????????1260Gln?Arg?Ala?Asn?Val?Ser?Leu?Glu?Glu?Leu?Arg?Val?Ile?Thr?Pro?Ile1265????????????????1270????????????????1275????????????????1280Ser?Thr?Ser?Thr?Asn?Leu?Ala?His?Arg?Leu?Arg?Asp?Arg?Thr?Thr?Gln
1285????????????????1290????????????????1295Val?Lys?Tyr?Ser?Gly?Thr?Ser?Leu?Val?Arg?Val?Ala?Arg?Tyr?Thr?Thr
1300????????????????1305????????????????1310Ile?Ser?Asn?Asp?Asn?Leu?Ser?Phe?Val?Ile?Ser?Asp?Lys?Lys?Val?Asp
1315????????????????1320????????????????1325Thr?Asn?Phe?Ile?Tyr?Gln?Gln?Gly?Met?Leu?Leu?Gly?Leu?Gly?Val?Leu
1330????????????????1335????????????????1340Glu?Thr?Leu?Phe?Arg?Leu?Glu?Lys?Asp?Thr?Gly?Ser?Ser?Asn?Thr?Val1345????????????????1350????????????????1355????????????????1360Leu?His?Leu?His?Val?Glu?Thr?Asp?Cys?Cys?Val?Ile?Pro?Met?Ile?Asp
1365????????????????1370????????????????1375His?Pro?Arg?Ile?Pro?Ser?Ser?Arg?Lys?Leu?Glu?Leu?Arg?Ala?Glu?Leu
1380????????????????1385????????????????1390Cys?Thr?Asn?Pro?Leu?Ile?Tyr?Asp?Asn?Ala?Pro?Leu?Ile?Asp?Arg?Asp
1395?????????????????1400????????????????1405Ala?Thr?Arg?Leu?Tyr?Thr?Gln?Ser?His?Arg?Arg?His?Leu?Val?Glu?Phe
1410????????????????1415????????????????1420Val?Thr?Trp?Ser?Thr?Pro?Gln?Leu?Tyr?His?Ile?Leu?Ala?Lys?Ser?Thr1425????????????????1430????????????????1435????????????????1440Ala?Leu?Ser?Met?Ile?Asp?Leu?Val?Thr?Lys?Phe?Glu?Lys?Asp?His?Met
1445????????????????1450????????????????1455Asn?Glu?Ile?Ser?Ala?Leu?Ile?Gly?Asp?Asp?Asp?Ile?Asn?Ser?Phe?Ile
1460????????????????1465????????????????1470Thr?Glu?Phe?Leu?Leu?Ile?Glu?Pro?Arg?Leu?Phe?Thr?Ile?Tyr?Leu?Gly
1475????????????????1480????????????????1485Gln?Cys?Ala?Ala?Ile?Asn?Trp?Ala?Phe?Asp?Val?His?Tyr?His?Arg?Pro
1490????????????????1495????????????????1500Ser?Gly?Lys?Tyr?Gln?Met?Gly?Glu?Leu?Leu?Ser?Ser?Phe?Leu?Ser?Arg1505????????????????1510????????????????1515????????????????1520Met?Ser?Lys?Gly?Val?Phe?Lys?Val?Leu?Val?Asn?Ala?Leu?Ser?His?Pro
1525????????????????1530????????????????1535Lys?Ile?Tyr?Lys?Lys?Phe?Trp?His?Cys?Gly?Ile?Ile?Glu?Pro?Ile?His
1540????????????????1545????????????????1550Gly?Pro?Ser?Leu?Asp?Ala?Gln?Asn?Leu?His?Thr?Thr?Val?Cys?Asn?Met
1555????????????????1560????????????????1565Ile?Tyr?Thr?Cys?Tyr?Met?Thr?Tyr?Leu?Asp?Leu?Leu?Leu?Asn?Glu?Glu
1570????????????????1575????????????????1580Leu?Glu?Glu?Phe?Thr?Phe?Leu?Leu?Cys?Glu?Ser?Asp?Glu?Asp?Val?Val1585????????????????1590????????????????1595????????????????1600Pro?Asp?Arg?Phe?Asp?Asn?Ile?Gln?Ala?Lys?His?Leu?Cys?Val?Leu?Ala
1605????????????????1610????????????????1615Asp?Leu?Tyr?Cys?Gln?Pro?Gly?Thr?Cys?Pro?Pro?Ile?Arg?Gly?Leu?Arg
1620????????????????1625????????????????1630Pro?Val?Glu?Lys?Cys?Ala?Val?Leu?Thr?Asp?His?Ile?Lys?Ala?Glu?Ala
1635????????????????1640????????????????1645Arg?Leu?Ser?Pro?Ala?Gly?Ser?Ser?Trp?Asn?Ile?Asn?Pro?Ile?Ile?Val
1650????????????????1655????????????????1660Asp?His?Tyr?Ser?Cys?Ser?Leu?Thr?Tyr?Leu?Arg?Arg?Gly?Ser?Ile?Lys1665????????????????1670????????????????1675????????????????1680Gln?Ile?Arg?Leu?Arg?Val?Asp?Pro?Gly?Phe?Ile?Phe?Asp?Ala?Leu?Ala
1685????????????????1690????????????????1695Glu?Val?Asn?Val?Ser?Gln?Pro?Lys?Ile?Gly?Ser?Asn?Asn?Ile?Ser?Asn
1700????????????????1705????????????????1710Met?Ser?Ile?Lys?Asp?Phe?Arg?Pro?Pro?His?Asp?Asp?Val?Ala?Lys?Leu
1715????????????????1720????????????????1725Leu?Lys?Asp?Ile?Asn?Thr?Ser?Lys?His?Asn?Leu?Pro?Ile?Ser?Gly?Gly
1730????????????????1735????????????????1740Asn?Leu?Ala?Asn?Tyr?Glu?Ile?His?Ala?Phe?Arg?Arg?Ile?Gly?Leu?Asn1745????????????????1750????????????????1755????????????????1760Ser?Ser?Ala?Cys?Tyr?Lys?Ala?Val?Glu?Ile?Ser?Thr?Leu?Ile?Arg?Arg
1765????????????????1770????????????????1775Cys?Leu?Glu?Pro?Gly?Glu?Asp?Gly?Leu?Phe?Leu?Gly?Glu?Gly?Ser?Gly
1780????????????????1785????????????????1790Ser?Met?Leu?Ile?Thr?Tyr?Lys?Glu?Ile?Leu?Lys?Leu?Asn?Lys?Cys?Phe
1795????????????????1800????????????????1805Tyr?Asn?Ser?Gly?Val?Ser?Ala?Asn?Ser?Arg?Ser?Gly?Gln?Arg?Glu?Leu
1810????????????????1815????????????????1820Ala?Pro?Tyr?Pro?Ser?Glu?Val?Gly?Leu?Val?Glu?His?Arg?Met?Gly?Val1825????????????????1830????????????????1835????????????????1840Gly?Asn?Ile?Val?Lys?Val?Leu?Phe?Asn?Gly?Arg?Pro?Glu?Val?Thr?Trp
1845????????????????1850????????????????1855Val?Gly?Ser?Val?Asp?Cys?Phe?Asn?Phe?Ile?Val?Ser?Asn?Ile?Pro?Thr
1860????????????????1865????????????????1870Ser?Ser?Val?Gly?Phe?Ile?His?Ser?Asp?Ile?Glu?Thr?Leu?Pro?Asn?Lys
1875????????????????1880????????????????1885Asp?Thr?Ile?Glu?Lys?Leu?Glu?Glu?Leu?Ala?Ala?Ile?Leu?Ser?Met?Ala
1890????????????????1895????????????????1900Leu?Leu?Leu?Gly?Lys?Ile?Gly?Ser?Ile?Leu?Val?Ile?Lys?Leu?Met?Pro1905????????????????1910????????????????1915????????????????1920Phe?Ser?Gly?Asp?Phe?Val?Gln?Gly?Phe?Ile?Ser?Tyr?Val?Gly?Ser?Tyr
1925????????????????1930????????????????1935Tyr?Arg?Glu?Val?Asn?Leu?Val?Tyr?Pro?Arg?Tyr?Ser?Asn?Phe?Ile?Ser
1940????????????????1945????????????????1950Thr?Glu?Ser?Tyr?Leu?Val?Met?Thr?Asp?Leu?Lys?Ala?Asn?Arg?Leu?Met
1955????????????????1960????????????????1965Asn?Pro?Glu?Lys?Ile?Lys?Gln?Gln?Ile?Ile?Glu?Ser?Ser?Val?Arg?Thr
1970????????????????1975????????????????1980Ser?Pro?Gly?Leu?Ile?Gly?His?Ile?Leu?Ser?Ile?Lys?Gln?Leu?Ser?Cys1985????????????????1990????????????????1995????????????????2000Ile?Gln?Ala?Ile?Val?Gly?Asp?Ala?Val?Ser?Arg?Gly?Gly?Ile?Asn?Pro
2005????????????????2010????????????????2015Ile?Leu?Lys?Lys?Leu?Thr?Pro?Ile?Glu?Gln?Val?Leu?Ile?Asn?Cys?Gly
2020????????????????2025????????????????2030Leu?Ala?Ile?Asn?Gly?Pro?Lys?Leu?Cys?Lys?Glu?Leu?Ile?His?His?Asp
2035????????????????2040????????????????2045
Val?Ala?Ser?Gly?Gln?Asp?Gly?Leu?Leu?Asn?Ser?Ile?Leu?Ile?Leu?Tyr
2050????????????????2055????????????????2060
Arg?Glu?Leu?Ala?Arg?Phe?Lys?Asp?Asn?Gln?Arg?Ser?Gln?Gln?Gly?Met
2065????????????????2070????????????????2075????????????????2080
Phe?His?Ala?Tyr?Pro?Val?Leu?Val?Ser?Ser?Arg?Gln?Arg?Glu?Leu?Ile
2085????????????????2090????????????????2095
Ser?Arg?Ile?Thr?Arg?Lys?Phe?Trp?Gly?His?Ile?Leu?Leu?Tyr?Ser?Gly
2100???????????????2105????????????????2110
Asn?Arg?Lys?Leu?Ile?Asn?Arg?Phe?Ile?Gln?Asn?Leu?Lys?Ser?Gly?Tyr
2115????????????????2120????????????????2125
Leu?Ile?Leu?Asp?Leu?His?Gln?Asn?Ile?Phe?Val?Lys?Asn?Leu?Ser?Lys
2130????????????????2135????????????????2140
Ser?Glu?Lys?Gln?Ile?Ile?Met?Thr?Gly?Gly?Leu?Lys?Arg?Glu?Trp?Val
2145????????????????2150????????????????2155????????????????2160
Phe?Lys?Val?Thr?Ile?Lys?Glu?Thr?Lys?Glu?Trp?Tyr?Lys?Leu?Val?Gly
2165???????????????2170?????????????????2175
Tyr?Ser?Ala?Leu?Ile?Lys?Asp
The information of 2180 (2) SEQ ID NO:7:
(i) sequence signature:
(A) length: 15894 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: ACCAAACAAA GTTGGGTAAG GATAGATCAA TCAATGATCA TATTCTAGTA CACTTAGGAT 60 TCAAGATCCT ATTATCAGGG ACAAGAGCAG GATTAGGGAT ATCCGAGATG GCCACACTTT 120 TGAGGAGCTT AGCATTGTTC AAAAGAAACA AGGACAAACC ACCCATTACA TCAGGATCCG 180 GTGGAGCCAT CAGAGGAATC AAACACATTA TTATAGTACC AATTCCTGGA GATTCCTCAA 240 TTACCACTCG ATCCAGACTA CTGGACCGGT TGGTCAGGTT AATTGGAAAC CCGGATGTGA 300 GCGGGCCCAA ACTAACAGGG GCACTAATAG GTATATTATC CTTATTTGTG GAGTCTCCAG 360 GTCAATTGAT TCAGAGGATC ACCGATGACC CTGACGTTAG CATCAGGCTG TTAGAGGTTG 420 TTCAGAGTGA CCAGTCACAA TCTGGCCTTA CCTTCGCATC AAGAGGTACC AACATGGAGG 480 ATGAGGCGGA CCAATACTTT TCACATGATG ATCCAAGCAG TAGTGATCAA TCCAGGTCCG 540 GATGGTTCGA GAACAAGGAA ATCTCAGATA TTGAAGTGCA AGATCCTGAG GGATTCAACA 600 TGATTCTGGG TACCATTCTA GCCCAGATCT GGGTCTTGCT CGCAAAGGCG GTTACGGCCC 660 CAGACACGGC AGCTGATTCG GAGCTAAGAA GGTGGATAAA GTACACCCAA CAAAGAAGGG 720 TAGTTGGTGA ATTTAGATTG GAGAGAAAAT GGTTGGATGT GGTGAGGAAC AGGATTGCCG 780 AGGACCTCTC TTTACGCCGA TTCATGGTGG CTCTAATCCT GGATATCAAG AGGACACCCG 840 GGAACAAACC TAGGATTGCT GAAATGATAT GTGACATTGA TACATATATC GTAGAGGCAG 900 GATTAGCCAG TTTTATCTTG ACTATTAAGT TTGGGATAGA AACTATGTATCCTGCTCTTG 960 GACTGCATGA ATTTGCTGGT GAGTTATCCA CACTTGAGTC CTTGATGAAT CTTTACCAGC 1020 AAATGGGAGA AACTGCACCC TACATGGTAA TCCTAGAGAA CTCAATTCAG AACAAGTTCA 1080 GCGCAGGATC ATACCCTCTG CTCTGGAGCT ATGCCATGGG AGTAGGAGTG GAACTTGAAA 1140 ACTCCATGGG AGGTTTGAAC TTTGGTCGAT CTTACTTTGA TCCAGCATAT TTTAGATTAG 1200 GGCAAGAGAT GGTGAGGAGG TCAGCTGGAA AGGTCAGTTC CACATTGGCA TCCGAACTCG 1260 GTATCACTGC CGAGGATGCA AGGCTTGTTT CAGAGATTGC AATGCATACT ACTGAGGACA 1320 GGATCAGTAG AGCGGTCGGA CCCAGACAAG CCCAAGTATC ATTTCTACAC GGTGATCAAA 1380 GTGAGAATGA GCTACCAGGA TTGGGGGGCA AGGAAGACAG GAGGGTCAAA CAGAGTCGGG 1440 GAGAAGCCAG GGAGAGCTAC AGAGAAACCG AGTCCAGCAG AGCAAGTGAT GCGAGAGCTG 1500 CCCATCCTCC AACCAGCATG CCCCTAGACA TTGACACTGC ATCGGAGTCA GGCCAAGATC 1560 CGCAGGACAG TCGAAGGTCA GCTGACGCTC TGCTCAGGCT GCAAGCCATG GCAGGAATCT 1620 TGGAAGAACA AGGCTCAGAC ACGGACACCC CTAGGGTATA CAATGACAGA GATCTTCTAG 1680 ATTAGGTGCG AGAGGCCGAG GACCAGAACA ACATCCGCCT ACCCTCCATC ATTGTTATAA 1740 AAAACTTAGG AACCAGGTCC ACACAGCCGC CAGCCAACCA ACCATCCACT CCCACGACTG 1800 GAGCCGATGG CAGAAGAGCA GGCACGCCAT GTCAAAAACG GACTGGAATG CATCCGGGCT 1860 CTCAAGGCCG AGCCCATCGG CTCACTGGCC GTCGAGGAAG CCATGGCAGC ATGGTCAGAA 1920 ATATCAGACA ATCCAGGACA GGACCGAGCC GCCTGCAAGG AAGAGGAGGC AGGCAGTTCG 1980 GGTCTCAGCA AACCATGCTT CTCAGCAATT GGATCAACTG AAGGCGGTGC ACCTCGCATC 2040 CGCGGTCAGG GATCTGGAGA AAGCGATGAC GACGCTGAAA CTTTGGGAAT CCCCTCAAGA 2100 AATCTCCAGG CATCAAGCAC TGGGTTACAG TGTTATCATG TTTATGATCA CAGCGGTGAA 2160 GCGGTTAAGG GAATCCAAGA TGCTGACTCT ATCATGGTTC AATCAGGCCT TGATGGTGAT 2220 AGCACCCTCT CAGGAGGAGA CGATGAATCT GAAAACAGCG ATGTGGATAT TGGCGAACCT 2280 GATACCGAGG GATATGCTAT CACTGACCGG GGATCTGCTC CCATCTCTAT GGGGTTCAGG 2340 GCTTCTGATG TTGAAACTGC AGAAGGAGGG GAGATCCACG AGCTCCTGAA ACTCCAATCC 2400 AGAGGCAACA ACTTTCCGAA GCTTGGGAAA ACTCTCAATG TTCCTCCGCC CCCGAACCCC 2460 AGTAGGGCCA GCACTTCCGA GACACCCATT AAAAAGGGGA CAGACGCGAG ATTGGCCTCA 2520 TTTGGAACGG AGATCGCGTC TTTATTGACA GGTGGTGCAA CCCAATGTGC TCGAAAGTCA 2580 CCCTCGGAAC CGTCAGGGCC AGATGCACCT GCGGGGAATG TCCCCGAGTG TGTGAGCAAT 2640 GCCGCACTGA TACAGGAGTG GACACCCGAA TCTGGTACCA CAATCTCCCC GAGATCCCAG 2700 AATAATGAAG AAGGGGGAGA CTATTATGAT GATGAGCTGT TCTCCGATGT CCAAGACATC 2760 AAAACAGCCT TGGCCAAAAT ACACGAGGAT AATCAGAAGA TAATCTCCAA GCTAGAATCA 2820 TTGCTGTTAT TGAAGGGAGA AGTTGAGTCA ATTAAGAAGC AGATCAACAG GCAAAATATC 2880 AGCATATCCA CCCTGGAAGG ACACCTCTCA AGCATCATGA TTGCCATTCC TGGACTTGGG 2940 AAGGATCCCA ACGACCCCAC TGCAGATGTC GAACTCAATC CCGACCTGAA ACCCATCATA 3000 GGCAGAGATT CAGGCCGAGC ACTGGCCGAA GTTCTCAAGA AGCCCGTTGC CAGCCGACAA 3060 CTCCAGGGAA TGACTAATGG ACGGACCAGT TCCAGAGGAC AGCTGCTGAA GGAATTTCAA 3120 CTAAAGCCGA TCGGGAAAAA GGTGAGCTCA GCCGTCGGGT TTGTCCCTGA CACCGGCCCT 3180 GCATCACGCA GTGTAATCCG CTCCATTATA AAATCCAGCC GGCTAGAGGA GGATCGGAAG 3240 CGTTACCTGT TGACTCTCCT TGATGATATC AAAGGAGCCA ACGATCTTGC CAAGTTCCAC 3300 CAGATGCTGA TGAAGATAAT AATGAAGTAG CTACAGCTCA ACTTACCTGC CAACCCCATG 3360 CCAGTCGACC TAATTAGTAC AACCTAAATC CATTATAAAA AACTTAGGAG CAAAGTGATT 3420 GCCTCCTAAG TTCCACAATG ACAGAGATCT ACGACTTCGA CAAGTCGGCA TGGGACATCA 3480 AAGGGTCGAT CGCTCCGATA CAACCTACCA CCTACAGTGA TGGCAGGCTG GTGCCCCAGG 3540 TCAGAGTCAT AGATCCTGGT CTAGGTGATA GGAAGGATGA ATGCTTTATG TACATGTTTC 3600 TGCTGGGGGT TGTTGAGGAC AGAGATCCCC TAGGGCCTCC AATCGGGCGA GCATTCGGGT 3660 CCCTGCCCTT AGGTGTTGGT AGATCCACAG CAAAACCCGA GGAACTCCTC AAAGAGGCCA 3720 CTGAGCTTGA CATAGTTGTT AGACGTACAG CAGGGCTCAA TGAAAAACTG GTGTTCTACA 3780 ACAACACCCC ACTAACCCTC CTCACACCTT GGAGAAAGGT CCTAACAACA GGGAGTGTCT 3840 TCAATGCAAA CCAAGTGTGC AATGCGGTTA ATCTAATACC GCTGGACACC CCGCAGAGGT 3900 TCCGTGTTGT TTATATGAGC ATCACCCGTC TTTCGGATAA CGGGTATTAC ACCGTTCCCA 3960 GAAGAATGCT GGAATTCAGA TCGGTCAATG CAGTGGCCTT CAACCTGCTA GTGACCCTCA 4020 GGATTGACAA GGCGATTGGC CCTGGGAAGA TCATCGACAA TGCAGAGCAA CTTCCTGAGG 4080 CAACATTTAT GGTCCACATC GGGAACTTCA GGAGAAAGAA GAGTGAAGTC TACTCTGCCG 4140 ATTATTGCAA AATGAAAATC GAAAAGATGG GCCTGGTTTT TGCACTTGGT GGGATAGGGG 4200 GCACCAGTCT TCACATTAGA AGCACAGGCA AAATGAGCAA GACTCTCCAT GCACAACTCG 4260 GGTTCAAGAA GACCTTATGT TACCCACTGA TGGATATCAA TGAAGACCTT AATCGGTTAC 4320 TCTGGAGGAG CAGATGCAAG ATAGTAAGAA TCCAGGCAGT TTTGCAGCCA TCAGTTCCTC 4380 AAGAATTCCG CATTTACGAC GACGTGATCA TAAATGATGA CCAAGGACTA TTCAAAGTTC 4440 TGTAGACCGT AGTGCCCAGC AATACCCGAA AACGACCCCC CTCATAATGA CAGCCAGAAG 4500 GCCCGGACAA AAAAGCCCCC TCCAAAAGAC TTCACGGACC AAGCGAGAGG CCAGCCAGCA 4560 GCCGACAGCA AGTGTGGACA CCAGGCGGCC CAAGCACAGA ACAGCCCCGA CACAAGGCCA 4620 CCACCAGCCA TCCCAATCCG CGTCCTCCTC GTAGGACCCC CGAGGACCAA CCCCCAAGGT 4680 CGCTCCGGAC ACAGACCACC AGCCGCATCC CCACAGCCCT CGGGAAAGGA ACCCCCAGCA 4740 ACTGGAAGGC CCCTTCCCCC CTCCCCCAAC GCAAGAACCC CACAACCGAA CCGCACAAGC 4800 GACCGAGGTG ACCCAACCGC AGGCATCCGA CTCCCTAGAC AGACCCTCCC TCCCCGGCAT 4860 ACTAAACAAA ACTTAGGGCC AAGGAACACA CACACCCGAC AGAACCCAGA CCCCGGCCCG 4920 CGGCACCGCG CCCCCACCCC CCGAAAACCA GAGGGAGCCC CCAACCAATC CCGCCGCCCC 4980 CCCCGGTGCC CACAGGTAGG CACACCAACC CCCGAACAGA CCCAGCACCC AGCCACCGAC 5040 AATCCAAGAC GGGGGGCCCC CCCCAAAAAA AGGCCCCCAG GGGCCGACAG CCAGCATCGC 5100 GAGGAAGCCC ACCCACCCCA CACACGACCA CGGCAACCAA ACCAGAGCCC AGACCACCCT 5160 GGGCCACCAG CTCCCAGACT CGGCCATCAC CCCGAAAAAA GGAAAGGCCA CAACCCGCGC 5220 ACCCCAGGCC CGATCCGGCG GGAAGCCACC CAACCCGAAC CAGCACCCAA GAGCGATCCC 5280 TGGGGGACCC CCAAACCGCA AAAGACATCA GTATCCCACC GCCTCTCCAA GTCCCCCGGT 5340 CTCCTCCTCT TCTCGAAGGG ACCAAAAGAT CAATCCACCA CATCCGACGA CACTCAATTC 5400 CCCACCCCTA AAGGAGACAC CGGGAATCCC AGAATCAAGA CTCATCCAAT GTCCATCATG 5460 GGTCTCAAGG TGAATGTCTT TGCCATATTC ATGGCAGTAC TGTTAACTCT CCAAACACCC 5520 ACCGGTCAAA TCCATTGGGG CAATCTCTCT AAGATAGGGG TGGTAGGGAT AGGAAGTGCA 5580 AGCTACAAAG TTATGACTCG TTCCAGCCAT CAATCATTGG TCATAAAATT AATGCCCAAT 5640 ATAACTCTCC TCAATAACTG CACGAGGGTA GAAATTGCAG AATACAGGAG ACTACTGAGA 5700 ACAGTTTTGG AACCAATTAG AGATGCACTT AATGCAATGA CCCAGAATAT AAGACCGGTT 5760 CAGAGTGTAG CTTCAAGTAG GAGACACAAG AGATTTGCGG GAGTTGTCCT GGCAGGTGCG 5820 GCCCTAGGCG TTGCCACAGC TGCTCAGATA ACAGCCGGCA TTGCACTTCA CCAGTCCATG 5880 CTGAACTCTC AAGCCATCGA CAATCTGAGA GCAAGCCTGG AAACTACTAA TCAGGCAATT 5940 GAGGCAATCA GGCAAGCAGG GCAGGAGATG ATATTGGCTG TTCAGGGTGT CCAAGACTAC 6000 ATCAATAATG AGCTGATACC GTCTATGAAC CAACTATCTT GTGATTTAAT CGGCCAGAAG 6060 CTAGGGCTCA AATTGCTCAG ATACTATACA GAAATCCTGT CATTATTTGG CCCCAGCTTA 6120 CGGGACCCCA TATCTGCGGA GATATCCATC CAGGCTTTGA GCTATGCGCT TGGGGGAGAT 6180 ATCAATAAGG TATTAGAAAA GCTCGGATAC AGTGGAGGTG ATTTACTGGG CATCTTAGAG 6240 AGCAGAGGAA TAAAGGCCCG GATAACTCAC GTCGACACAG AGTCCTACTT CATTGTCCTC 6300 AGTATAGCCT ATCCGACGCT GTCCGAGATT AAGGGGGTGA TTGTCCACCG GCTAGAGGGG 6360 GTCTCGTACA ATATAGGCTC TCAAGAGTGG TATACCACTG TGCCCAAGTA TGTTGCAACC 6420 CAAGGGTACC TTATCTCGAA TTTTGATGAG TCATCGTGTA CTTTCATGCC AGAGGGGACT 6480 GTGTGCAGCC AAAATGCCTT GTACCCGATG AGTCCTCTGC TCCAAGAATG CCTCCGGGGG 6540 TCCACCAAGT CCTGTGCTCG TACACTCGTA TCCGGGTCTT TTGGGAACCG GTTCATTTTA 6600 TCACAAGGGA ACCTAATAGC CAATTGTGCA TCAATCCTCT GCAAGTGTTA CACAACAGGA 6660 ACGATCATTA ATCAAGACCC TGACAAGATC CTAACATACA TTGCTGCCGA TCACTGCCCG 6720 GTGGTCGAGG TGAACGGTGT GACCATCCAA GTCGGGAGCA GGAGGTATCC GGACGCGGTG 6780 TACCTGCACA GAATTGACCT CGGTCCTCCC ATATCATTGG AGAAGTTGGA CGTAGGGACA 6840 AATCTGGGGA ATGCAATTGC TAAGCTGGAG GATGCCAAGG AATTGCTGGA GTCATCGGAC 6900 CAGATATTGA GGAGTATGAA AGGTTTATCG AGCACTAGCA TAGTTTACAT CCTGATTGCA 6960 GTGTGTCTTG GAGGGTTGAT AGGGATCCCC GCTTTAATAT GTTGCTGCAG GGGGCGTTGT 7020 AACAAAAAGG GGGAACAAGT TGGTATGTCA AGACCAGGCC TAAAGCCTGA TCTTACAGGG 7080 ACATCAAAAT CCTATGTAAG GTCGCTCTGA TCCCCTACAA CTCTTGAAAC ACAGATTTCC 7140 CACAAGTCTC CTCTCCGTCA TCAAGCAACC ACCGCATCCA GCATCAAGGC CACCCGAAAT 7200 TGTCTCCGGC TTCCCTCTGG CCGAACGATA TCGGTAGTTA ATTAAAACTT AGGGTGCAAG 7260 ATCATCCACA ATGTCACCAC ACCGAGACCG AATAAATGCC TTCTACAAAG ACAACCCCCA 7320 TCCTAAGGGA AGTAGGATAG TTATTAACAG AGAACATCTT ATGATTGATA GACCTTATGT 7380 TTTGCTGGCT GTTCTATTCG TCATGTTTCT GAGCTTGATC GGGTTGCTAG CCATTGCAGG 7440 CATTAGACTC CATCGGGCAG CCATCTACAC CGCAGAGATC CATAAGAGCC TCAGCACCAA 7500 TCTAGATGTA ACTAACTCAA TCGAGCATCA GGTCAAGGAC GTGCTGACAC CACTCTTCAA 7560 GATCATCGGT GATGAAGTGG GCCTGAGGAC ACCTCAGAGA TTCACTGACC TAGTGAAATT 7620 CATCTCTGAC AAAATTAAAT TCCTTAATCC GGATAGGGAG TACGACTTCA GAGATCTCAC 7680 TTGGTGTATC AACCCGCCAG AGAGAATCAA ATTGGATTAT GATCAATACT GTGCAGATGT 7740 GGCTGCTGAA GAACTCATGA ATGCATTGGT GAACTCAACT CTACTGGAGG CCAGGGCAAC 7800 CAATCAGTTC CTAGCTGTCT CAAAGGGAAA CTGCTCAGGG CCCACTACAA TCAGAGGTCA 7860 ATTCTCAAAC ATGTCGCTGT CCCTGTTGGA CTTGTATTTA AGTCGAGGTT ACAATGTGTC 7920 ATCTATAGTC ACCATGACAT CCCAGGGAAT GTACGGGGGA ACTTACCTAG TGGGAAAGCC 7980 TAATCTGAGC AGTAAAGGGT CAGAGTTGTC ACAACTGAGC ATGCACCGAG TGTTTGAAGT 8040 AGGGGTTATC AGAAATCCGG GTTTGGGGGC TCCGGTGTTC CATATGACAA ACTATTTTGA 8100 GCAACCAGTC AGTAATGATT TCAGCAACTG CATGGTGGCT TTGGGGGAGC TCAGGTTCGC 8160 AGCCCTCTGT CACAGGGAAG ATTCTGTCAC GGTTCCCTAT CAGGGGTCAG GGAAAGGTGT 8220 CAGCTTCCAG CTCGTCAAGC TAGGTGTCTG GAAATCCCCA ACCGACATGC AATCCTGGGT 8280 CCCCCTATCA ACGGATGATC CAGTGATAGA TAGGCTTTAC CTCTCATCTC ACAGAGGTGT 8340 TATCGCTGAC AATCAAGCAA AATGGGCTGT CCCGACAACA CGGACAGATG ACAAGTTGCG 8400 AATGGAGACA TGCTTCCAGC AGGCGTGTAA GGGTAAAAAC CAAGCACTCT GCGAGAATCC 8460 CGAGTGGGCA CCATTGAAGG ATAACAGGAT TCCTTCATAC GGGGTCTTGT CTGTTAATCT 8520 GAGTCTGACA GTTGAGCTTA AAATCAAAAT TGCTTCAGGA TTCGGGCCAT TGATCACACA 8580 CGGTTCAGGG ATGGACCTAT ACAAAACCAA CCACAACAAT GTGTATTGGC TGACTATCCC 8640 GCCAATGAAG AACCTAGCCT TAGGTGTAAT CAACACATTG GAGTGGATAC CGAGATTCAA 8700 GGTTAGTCCC AACCTCTTCA CTGTTCCAAT CAAGGAAGCA GGCGAGGACT GCCATGCCCC 8760 AACATACCTA CCTGCGGAGG TGGATGGTGA TGTCAAACTC AGTTCCAATC TGGTAATTCT 8820 ACCTGGTCAG GATCTCCAAT ATGTTTTGGC AACCTACGAT ACTTCCAGGG TTGAACATGC 8880 TGTGGTTTAT TATGTTTACA GCCCAGGCCG CTCATTTTCT TACTTTTATC CTTTTAGGTT 8940 GCCTATAAAG GGGGTCCCAA TCGAATTACA AGTGGAATGC TTCACATGGG ACCAAAAACT 9000 CTGGTGCCGT CACTTCTGTG TGCTTGCGGA TTCAGAATCT GGTGGACATA TCACTCACTC 9060 TGGGATGGTG GGCATGGGAG TCAGCTGCAC AGTCACTCGG GAAGATGGAA CCAATCGCAG 9120 ATAGGGCTGC CAGTGAACCG ATCACATGAT GTCACTCAGA CACCAGGCAT ACCCACTAGT 9180 GTGAAATAGA CATCAGAATT AAGAAAAACG TAGGGTCCAA GTGGTTTCCC GTCATGGACT 9240 CGCTATCTGT CAACCAGATC TTGTACCCTG AAGTTCACCT AGATAGCCCG ATAGTTACCA 9300 ATAAGATAGT AGCTATCCTG GAGTATGCTC GAGTCCCTCA CGCTTACAGC CTTGAGGACC 9360 CTACACTGTG TCAGAACATC AAGCACCGCC TAAAAAACGG ATTCTCCAAC CAAATGATTA 9420 TAAACAATGT GGAAGTTGGG AATGTCATCA AGTCCAAGCT TAGGAGTTAT CCGGCCCACT 9480 CTCATATTCC ATATCCAAAT TGTAATCAGG ATTTATTTAA CATAGAAGAC AAAGAGTCAA 9540 CAAGGAAGAT CCGTGAGCTC CTAAAAAAGG GAAATTCGCT GTACTCCAAA GTCAGTGATA 9600 AGGTTTTCCA ATGCCTGAGG GACACTAACT CACGGCTTGG CCTAGGCTCC GAATTGAGGG 9660 AGGACATCAA GGAGAAAATT ATTAACTTGG GAGTTTACAT GCACAGCTCC CAATGGTTTG 9720 AGCCCTTTCT GTTTTGGTTT ACAGTCAAGA CTGAGATGAG GTCAGTGATT AAATCACAAA 9780 CCCATACTTG CCATAGGAGG AGACACACAC CTGTATTCTT CACTGGTAGT TCAGTTGAGC 9840 TGTTAATCTC TCGTGACCTT GTTGCTATAA TCAGTAAGGA GTCTCAACAT GTATATTACC 9900 TGACGTTTGA ACTGGTTTTG ATGTATTGTG ATGTCATAGA GGGGAGGTTA ATGACAGAGA 9960 CCGCTATGAC CATTGATGCT AGGTATGCAG AACTTCTAGG AAGAGTCAGA TACATGTGGA 10020 AACTGATAGA TGGTTTCTTC CCTGCACTCG GGAATCCAAC TTATCAAATT GTAGCTATGC 10080 TGGAGCCACT TTCACTTGCT TACCTGCAAC TGAGGGACAT AACAGTAGAA CTCAGAGGTG 10140 CTTTCCTTAA CCACTGCTTT ACTGAAATAC ATGATGTTCT TGACCAAAAC GGGTTTTCTG 10200 ATGAAGGTAC TTATCATGAG TTAATTGAAG CCTTAGATTA CATTTTCATA ACTGATGACA 10260 TACATCTGAC AGGGGAGATT TTCTCATTTT TCAGAAGTTT CGGCCACCCC AGACTTGAAG 10320 CAGTAACGGC TGCTGAAAAT GTCAGGAAAT ACATGAATCA GCCTAAAGTC ATTGTGTATG 10380 AGACTCTGAT GAAGGGTCAT GCCATATTTT GTGGAATCAT AATCAACGGC TATCGTGACA 10440 GGCACGGAGG CAGTTGGCCA CCCCTGACCC TCCCCCTGCA TGCTGCAGAC ACAATCCGGA 10500 ATGCTCAAGC TTCAGGTGAA GGGTTAACAC ATGAGCAGTG CGTTGATAAC TGGAGATCAT 10560 TTGCTGGAGT GAGATTTGGC TGTTTTATGC CTCTTAGCCT GGACAGTGAT CTGACAATGT 10620 ACCTAAAGGA CAAGGCACTT GCTGCTCTCC AAAGGGAATG GGATTCAGTT TACCCGAAAG 10680 AGTTCCTGCG TTACGATCCT CCCAAGGGAA CCGGGTCACG GAGGCTTGTA GATGTTTTCC 10740 TTAATGATTC GAGCTTTGAC CCATATGATA TGATAATGTA TGTCGTAAGT GGAGCCTACC 10800 TCCATGACCC TGAGTTCAAT CTGTCTTACA GCCTGAAAGA AAAGGAGATC AAGGAAACAG 10860 GTAGACTTTT CGCTAAAATG ACTTACAAAA TGAGGGCATG CCAAGTGATC GCTGAAAATC 10920 TAATCTCAAA CGGGATTGGC AAGTATTTTA AGGACAATGG GATGGCCAAG GATGAGCACG 10980 ATTTGACTAA GGCACTCCAC ACTCTGGCTG TCTCAGGAGT CCCCAAAGAT CTCAAAGAAA 11040 GTCACAGGGG GGGGCCAGTC TTAAAAACCT ACTCCCGAAG CCCAGTCCAC ACAAGTACCA 11100 GGAACGTTAA AGCAGAAAAA GGGTTTGTAG GATTCCCTCA TGTAATTCGG CAGAATCAAG 11160 ACACTGATCA TCCGGAGAAT ATAGAAACCT ACGAGACAGT CAGCGCATTT ATCACGACTG 11220 ATCTCAAGAA GTACTGCCTT AATTGGAGAT ATGAGACCAT CAGCTTATTT GCACAGAGGC 11280 TAAATGAGAT TTACGGATTA CCCTCATTTT TTCAGTGGCT GCATAAGAGG CTTGAAACCT 11340 CTGTCCTCTA TGTAAGTGAT CCTCATTGCC CCCCCGACCT TGACGCCCAT GTCCCGTTAT 11400 GCAAAGTCCC CAATGACCAA ATCTTCATCA AGTACCCTAT GGGAGGTATA GAAGGGTATT 11460 GTCAGAAGCT GTGGACCATC AGCACCATTC CCTACTTATA CCTGGCTGCT TATGAGAGCG 11520 GGGTAAGGAT TGCCTCGTTA GTGCAAGGGG ACAATCAGAC CATAGCCGTA ACAAAAAGGG 11580 TACCCAGCAC ATGGCCTTAC AACCTTAAGA AACGGGAAGC TGCTAGAGTA ACTAGAGATT 11640 ACTTTGTAAT TCTTAGGCAA AGGCTACATG ACATTGGCCA TCACCTCAAG GCAAATGAGA 11700 CAATTGTTTC ATCACATTTT TTTGTCTATT CAAAAGGAAT ATATTATGAT GGGCTACTTG 11760 TGTCCCAATC ACTCAAGAGC ATTGCAAGAT GTGTATTCTG GTCAGAGACT ATAGTTGATG 11820 AAACAAGGGC AGCATGCAGT AATATTGCTA CAACAATGGC TAAAAGCATC GAGAGAGGTT 11880 ATGACCGTTA TCTTGCATAT TCCCTGAACG TCCTAAAAGT GATACAGCAA ATTTTGATCT 11940 CTCTTGGCTT CACAATCAAT TCAACCATGA CCCGAGATGT AGTCATACCC CTCCTCACAA 12000 ACAACGATCT CTTAATAAGG ATGGCACTGT TGCCCGCTCC TATTGGGGGG ATGAATTATC 12060 TGAACATGAG CAGGCTGTTT GTCAGAAACA TCGGTGATCC AGTAACATCA TCAATTGCTG 12120 ATCTCAAGAG AATGATTCTC GCATCACTAA TGCCTGAAGA GACCCTCCAT CAAGTAATGA 12180 CACAACAACC GGGGGACTCT TCATTCCTAG ACTGGGCTAG CGACCCTTAC TCAGCAAATC 12240 TTGTATGCGT CCAGAGCATC ACTAGACTCC TCAAGAACAT AACTGCAAGG TTTGTCCTAA 12300 TCCATAGTCC AAACCCAATG TTAAAAGGGT TATTCCATGA TGACAGTAAA GAAGAGGACG 12360 AGAGACTGGC GGCATTCCTC ATGGACAGGC ATATTATAGT ACCTAGGGCA GCTCATGAAA 12420 TCCTGGATCA TAGTGTCACA GGGGCAAGAG AGTCTATTGC AGGCATGCTA GATACCACAA 12480 AAGGCCTGAT TCGAGCCAGC ATGAGGAAGG GGGGGTTAAC CTCTCGAGTG ATAACCAGAT 12540 TGTCCAATTA TGACTATGAA CAATTTAGAG CAGGGATGGT GCTATTGACA GGAAGAAAGA 12600 GAAATGTCCT CATTGACAAA GAGTCATGTT CAGTGCAGCT GGCTAGAGCC CTAAGAAGCC 12660 ATATGTGGGC AAGACTAGCT CGAGGACGGC CTATTTACGG CCTTGAGGTC CCTGATGTAC 12720 TAGAATCTAT GCGAGGCCAC CTTATTCGGC GTCATGAGAC ATGTGTCATC TGCGAGTGTG 12780 GATCAGTCAA CTACGGATGG TTTTTTGTCC CCTCGGGTTG CCAACTGGAT GATATTGACA 12840 AGGAAACATC ATCCTTGAGA GTCCCATATA TTGGTTCTAC CACTGATGAG AGAACAGACA 12900 TGAAGCTTGC CTTCGTAAGA GCCCCAAGTA GATCCTTGCG ATCTGCCGTT AGAATAGCAA 12960 CAGTGTACTC ATGGGCTTAC GGTGATGATG ATAGCTCTTG GAACGAAGCC TGGTTGTTGG 13020 CAAGGCAAAG GGCCAATGTG AGCCTGGAGG AGCTAAGGGT GATCACTCCC ATCTCGACTT 13080 CGACTAATTT AGCGCATAGG TTGAGGGATC GTAGCACTCA AGTGAAATAC TCAGGTACAT 13140 CCCTTGTCAG AGTGGCAAGG TATACCACAA TCTCCAACGA CAATCTCTCA TTTGTCATAT 13200 CAGATAAGAA AGTTGATACT AACTTTATAT ACCAACAAGG AATGCTTCTA GGGTTGGGTG 13260 TTTTAGAAAC ATTGTTTCGA CTCGAGAAAG ATACTGGATC ATCTAACACG GTATTACATC 13320 TTCACGTCGA AACAGATTGT TGCGTGATCC CGATGATAGA TCATCCCAGG ATACCCAGCT 13380 CCCGCAAGCT AGAGCTGAGG GCAGAGCTAT GTACCAACCC ATTGATATAT GATAATGCAC 13440 CTTTAATTGA CAGAGATGCA ACAAGGCTAT ACACCCAGAG CCATAGGAGG CACCTTGTGG 13500 AATTTGTTAC ATGGTCCACA CCCCAACTAT ATCACATTCT AGCTAAGTCC ACAGCACTAT 13560 CTATGATTGA CCTGGTAACA AAATTTGAGA AGGACCATAT GAATGAAATT TCAGCTCTCA 13620 TAGGGGATGA CGATATCAAT AGTTTCATAA CTGAGTTTCT GCTTATAGAG CCAAGATTAT 13680 TCACCATCTA CTTGGGCCAG TGTGCAGCCA TCAATTGGGC ATTTGATGTA CATTATCATA 13740 GACCATCAGG GAAATATCAG ATGGGTGAGC TGTTGTCTTC GTTCCTTTCT AGAATGAGCA 13800 AAGGAGTGTT TAAGGTGCTT GTCAATGCTC TAAGCCACCC AAAGATCTAC AAGAAATTCT 13860 GGCATTGTGG TATTATAGAG CCTATCCATG GTCCTTCACT TGATGCTCAA AACTTGCACA 13920 CAACTGTGTG CAACATGGTT TACACATGCT ATATGACCTA CCTTGACCTG TTGTTGAATG 13980 AAGAGTTAGA AGAGTTCACA TTTCTTTTGT GTGAAAGCGA TGAGGATGTA GTACCGGACA 14040 GATTCGACAA CATCCAGGCA AAACACTTGT GTGTTCTGGC AGATTTGTAC TGTCAACCAG 14100 GGACCTGCCC ACCGATTCGA GGTCTAAGGC CGGTAGAGAA ATGTGCAGTT CTAACCGATC 14160 ATATCAAGGC AGAGGCTAGG TTATCTCCAG CAGGATCTTC GTGGAACATA AATCCAATTA 14220 TTGTAGACCA TTACTCATGC TCTCTGACTT ATCTCCGTCG AGGATCTATC AAACAGATAA 14280 GATTGAGAGT TGATCCAGGA TTCATTTTTG ATGCCCTCGC TGAGGTAAAT GTCAGTCAGC 14340 CAAAGGTCGG CAGCAACAAC ATCTCAAATA TGAGCATCAA GGATTTCAGA CCTCCACACG 14400 ATGATGTTGC AAAATTGCTC AAAGATATCA ACACAAGCAA GCACAATCTT CCCATTTCAG 14460 GGGGTAGTCT TGCCAATlAT GAAATCCATG CTTTCCGCAG AATCGGGTTA AACTCATCTG 14520 CTTGCTACAA AGCTGTTGAG ATATCAACAT TAATTAGGAG ATGCCTTGAG CCAGGGGAAG 14580 ACGGCTTGTT CTTGGGTGAG GGGTCGGGTT CTATGTTGAT CACTTATAAG GAGATACTAA 14640 AACTAAACAA GTGCTTCTAT AATAGTGGGG TTTCCGCCAA TTCTAGATCT GGTCAAAGGG 14700 AATTAGCACC CTATCCCTCC GAAGTTGGCC TTGTCGAACA CAGAATGGGA GTAGGTAATA 14760 TTGTCAAGGT GCTCTTTAAC GGGAGGCCCG AAGTCACGTG GGTAGGCAGT ATAGATTGCT 14820 TCAATTTCAT AGTCAGTAAT ATCCCTACCT CTAGTGTGGG ATTTATCCAT TCAGATATAG 14880 AGACCTTACC CAACAAAGAT ACTATAGAGA AGTTAGAGGA ATTGGCAGCC ATCTTATCGA 14940 TGGCTCTACT CCTTGGCAAA ATAGGATCAA TACTGGTGAT TAAGCTTATG CCTTTCAGCG 15000 GGGATTTTGT TCAGGGATTT ATAAGCTATG TAGGGTCTCA TTATAGAGAA GTGAACCTTG 15060 TCTACCCTAG GTACAGCAAC TTCATATCTA CTGAATCTTA TTTAGTTATG ACAGATCTCA 15120 AAGCTAACCG GCTAATGAAT CCTGAAAAGA TTAAGCAGCA GATAATTGAA TCATCTGTGC 15180 GGACTTCACC TGGACTTATA GGTCACATCC TATCTATCAA GCAACTAAGC TGCATACAAG 15240 CAATTGTGGG AGGCGCAGTT AGTAGAGGTG ATATCAACCC TATTCTGAAA AAACTTACAC 15300 CTATAGAGCA GGTGCTGATC AGTTGCGGGT TGGCAATTAA CGGACCTAAG CTGTGCAAAG 15360 AATTAATCCA CCATGATGTT GCCTCAGGGC AAGATGGATT GCTTAACTCT ATACTCATCC 15420 TCTACAGGGA GTTGGCAAGA TTCAAAGACA ACCAAAGAAG TCAACAAGGG ATGTTCCACG 15480 CTTACCCCGT ATTGGTAAGT AGTAGGCAAC GAGAACTTGT ATCTAGGATC ACTCGCAAAT 15540 TTTGGGGGCA TATTCTTCTT TACTCCGGGA ACAGAAAGTT GATAAATCGG TTTATCCAGA 15600 ATCTCAAGTC CGGTTATCTA ATACTAGACT TACACCAGAA TATCTTCGTT AAGAATCTAT 15660 CCAAGTCAGA GAAACAGATT ATTATGACGG GGGGTTTAAA ACGTGAGTGG GTTTTTAAGG 15720 TAACAGTCAA GGAGACCAAA GAATGGTATA AGTTAGTCGG ATACAGCGCT CTGATTAAGG 15780 ATTAATTGGT TGAACTCCGG AACCCTAATC CTACCCTAGG TAGTTAGGCA TTATTTGCAA 15840 TATATTAAAG AAAACTTTGA AAATACGAAG TTTCTATTCC CAGCTTTGTC TGGT 15894 (2) SEQ ID NO: 8 information about: ...
(i) sequence signature:
(A) length: 2183 amino acid
(B) type: amino acid
(C) chain:
(D) topological framework: linearity
(ii) molecule type: protein
(xi) sequence description: SEQ ID NO:8:
Met?Asp?Ser?Leu?Ser?Val?Asn?Gln?Ile?Leu?Tyr?Pro?Glu?Val?His?Leu
1???????????????5???????????????????10??????????????????15
Asp?Ser?Pro?Ile?Val?Thr?Asn?Lys?Ile?Val?Ala?Ile?Leu?Glu?Tyr?Ala
20??????????????????25??????????????????30
Arg?Val?Pro?His?Ala?Tyr?Ser?Leu?Glu?Asp?Pro?Thr?Leu?Cys?Gln?Asn
35??????????????????40??????????????????45
Ile?Lys?His?Arg?Leu?Lys?Asn?Gly?Phe?Ser?Asn?Gln?Met?Ile?Ile?Asn
50??????????????????55??????????????????60
Asn?Val?Glu?Val?Gly?Asn?Val?Ile?Lys?Ser?Lys?Leu?Arg?Ser?Tyr?Pro65??????????????????70??????????????????75??????????????????80Ala?His?Ser?His?Ile?Pro?Tyr?Pro?Asn?Cys?Asn?Gln?Asp?Leu?Phe?Asn
85??????????????????90??????????????????95Ile?Glu?Asp?Lys?Glu?Ser?Thr?Arg?Lys?Ile?Arg?Glu?Leu?Leu?Lys?Lys
100?????????????????105?????????????????110Gly?Asn?Ser?Leu?Tyr?Ser?Lys?Val?Ser?Asp?Lys?Val??Phe?Gln?Cys?Leu
115?????????????????120??????????????????125Arg?Asp?Thr?Asn?Ser?Arg?Leu?Gly?Leu?Gly?Ser?Glu?Leu?Arg?Glu?Asp
130?????????????????135?????????????????140Ile?Lys?Glu?Lys?Ile?Ile?Asn?Leu?Gly?Val?Tyr?Met?His?Ser?Ser?Gln145?????????????????150?????????????????155?????????????????160Trp?Phe?Glu?Pro?Phe?Leu?Phe?Trp?Phe?Thr?Val?Lys?Thr?Glu?Met?Arg
165?????????????????170?????????????????175Ser?Val?Ile?Lys?Ser?Gln?Thr?His?Thr?Cys?His?Arg?Arg?Arg?His?Thr
180?????????????????????185?????????????????190Pro?Val?Phe?Phe?Thr?Gly?Ser?Ser?Val?Glu?Leu?Leu?Ile?Ser?Arg?Asp
195?????????????????200?????????????????205Leu?Val?Ala?Ile?Ile?Ser?Lys?Glu?Ser?Gln?His?Val?Tyr?Tyr?Leu?Thr
210?????????????????215?????????????????220Phe?Glu?Leu?Val?Leu?Met?Tyr?Cys?Asp?Val?Ile?Glu?Gly?Arg?Leu?Met225?????????????????230?????????????????235?????????????????240Thr?Glu?Thr?Ala?Met?Thr?Ile?Asp?Ala?Arg?Tyr?Ala?Glu?Leu?Leu?Gly
245?????????????????250?????????????????255Arg?Val?Arg?Tyr?Met?Trp?Lys?Leu?Ile?Asp?Gly?Phe?Phe?Pro?Ala?Leu
260?????????????????265??????????????????270Gly?Asn?Pro?Thr?Tyr?Gln?Ile?Val?Ala?Met?Leu?Glu?Pro?Leu?Ser?Leu
275?????????????????280?????????????????285Ala?Tyr?Leu?Gln?Leu?Arg?Asp?Ile?Thr?Val?Glu?Leu?Arg?Gly?Ala?Phe
290?????????????????295??????????????????300Leu?Asn?His?Cys?Phe?Thr?Glu?Ile?His?Asp?Val?Leu?Asp?Gln?Asn?Gly305?????????????????310?????????????????315?????????????????320Phe?Ser?Asp?Glu?Gly?Thr?Tyr?His?Glu?Leu?Ile?Glu?Ala?Leu?Asp?Tyr
325?????????????????330?????????????????335Ile?Phe?Ile?Thr?Asp?Asp?Ile?His?Leu?Thr?Gly?Glu?Ile?Phe?Ser?Phe
340?????????????????345?????????????????350Phe?Arg?Ser?Phe?Gly?His?Pro?Arg?Leu?Glu?Ala?Val?Thr?Ala?Ala?Glu
355?????????????????360?????????????????365Asn?Val?Arg?Lys?Tyr?Met?Asn?Gln?Pro?Lys?Val?Ile?Val?Tyr?Glu?Thr
370?????????????????375?????????????????380Leu?Met?Lys?Gly?His?Ala?Ile?Phe?Cys?Gly?Ile?Ile?Ile?Asn?Gly?Tyr385?????????????390?????????????????????395?????????????????400Arg?Asp?Arg?His?Gly?Gly?Ser?Trp?Pro?Pro?Leu?Thr?Leu?Pro?Leu?His
405?????????????????410?????????????????415Ala?Ala?Asp?Thr?Ile?Arg?Asn?Ala?Gln?Ala?Ser?Gly?Glu?Gly?Leu?Thr
420?????????????????425?????????????????430His?Glu?Gln?Cys?Val?Asp?Asn?Trp?Arg?Ser?Phe?Ala?Gly?Val?Arg?Phe
435?????????????????440?????????????????445Gly?Cys?Phe?Met?Pro?Leu?Ser?Leu?Asp?Ser?Asp?Leu?Thr?Met?Tyr?Leu
450?????????????????455?????????????????460Lys?Asp?Lys?Ala?Leu?Ala?Ala?Leu?Gln?Arg?Glu?Trp?Asp?Ser?Val?Tyr465?????????????????470?????????????????475?????????????????480Pro?Lys?Glu?Phe?Leu?Arg?Tyr?Asp?Pro?Pro?Lys?Gly?Thr?Gly?Ser?Arg
485?????????????????490?????????????????495Arg?Leu?Val?Asp?Val?Phe?Leu?Asn?Asp?Ser?Ser?Phe?Asp?Pro?Tyr?Asp
500?????????????????505?????????????????510Met?Ile?Met?Tyr?Val?Val?Ser?Gly?Ala?Tyr?Leu?His?Asp?Pro?Glu?Phe
515?????????????????520?????????????????525Asn?Leu?Ser?Tyr?Ser?Leu?Lys?Glu?Lys?Glu?Ile?Lys?Glu?Thr?Gly?Arg
530?????????????????535?????????????????540Leu?Phe?Ala?Lys?Met?Thr?Tyr?Lys?Met?Arg?Ala?Cys?Gln?Val?Ile?Ala545?????????????????550?????????????????555?????????????????560Glu?Asn?Leu?Ile?Ser?Asn?Gly?Ile?Gly?Lys?Tyr?Phe?Lys?Asp?Asn?Gly
565?????????????????570?????????????????575Met?Ala?Lys?Asp?Glu?His?Asp?Leu?Thr?Lys?Ala?Leu?His?Thr?Leu?Ala
580?????????????????585?????????????????590Val?Ser?Gly?Val?Pro?Lys?Asp?Leu?Lys?Glu?Ser?His?Arg?Gly?Gly?Pro
595?????????????????600?????????????????605Val?Leu?Lys?Thr?Tyr?Ser?Arg?Ser?Pro?Val?His?Thr?Ser?Thr?Arg?Asn
610?????????????????615?????????????????620Val?Lys?Ala?Glu?Lys?Gly?Phe?Val?Gly?Phe?Pro?His?Val?Ile?Arg?Gln625?????????????????630?????????????????635?????????????????640Asn?Gln?Asp?Thr?Asp?His?Pro?Glu?Asn?Ile?Glu?Thr?Tyr?Glu?Thr?Val
645?????????????????650?????????????????655Ser?Ala?Phe?Ile?Thr?Thr?Asp?Leu?Lys?Lys?Tyr?Cys?Leu?Asn?Trp?Arg
660?????????????????665?????????????????670Tyr?Glu?Thr?Ile?Ser?Leu?Phe?Ala?Gln?Arg?Leu?Asn?Glu?Ile?Tyr?Gly
675?????????????????680?????????????????685Leu?Pro?Ser?Phe?Phe?Gln?Trp?Leu?His?Lys?Arg?Leu?Glu?Thr?Ser?Val
690?????????????????695?????????????????700Leu?Tyr?Val?Ser?Asp?Pro?His?Cys?Pro?Pro?Asp?Leu?Asp?Ala?His?Val705?????????????????710?????????????????715?????????????????720Pro?Leu?Cys?Lys?Val?Pro?Asn?Asp?Gln?Ile?Phe?Ile?Lys?Tyr?Pro?Met
725?????????????????730?????????????????735Gly?Gly?Ile?Glu?Gly?Tyr?Cys?Gln?Lys?Leu?Trp?Thr?Ile?Ser?Thr?Ile
740?????????????????745?????????????????750Pro?Tyr?Leu?Tyr?Leu?Ala?Ala?Tyr?Glu?Ser?Gly?Val?Arg?Ile?Ala?Ser
755?????????????????760?????????????????765Leu?Val?Gln?Gly?Asp?Asn?Gln?Thr?Ile?Ala?Val?Thr?Lys?Arg?Val?Pro
770?????????????????775?????????????????780Ser?Thr?Trp?Pro?Tyr?Asn?Leu?Lys?Lys?Arg?Glu?Ala?Ala?Arg?Val?Thr785?????????????????790?????????????????795?????????????????800Arg?Asp?Tyr?Phe?Val?Ile?Leu?Arg?Gln?Arg?Leu?His?Asp?Ile?Gly?His
805?????????????????810?????????????????815His?Leu?Lys?Ala?Asn?Glu?Thr?Ile?Val?Ser?Ser?His?Phe?Phe?Val?Tyr
820?????????????????825?????????????????830Ser?Lys?Gly?Ile?Tyr?Tyr?Asp?Gly?Leu?Leu?Val?Ser?Gln?Ser?Leu?Lys
835?????????????????840?????????????????845Ser?Ile?Ala?Arg?Cys?Val?Phe?Trp?Ser?Glu?Thr?Ile?Val?Asp?Glu?Thr
850?????????????????855?????????????????860Arg?Ala?Ala?Cys?Ser?Asn?Ile?Ala?Thr?Thr?Met?Ala?Lys?Ser?Ile?Glu865?????????????????870?????????????????875?????????????????880Arg?Gly?Tyr?Asp?Arg?Tyr?Leu?Ala?Tyr?Ser?Leu?Asn?Val?Leu?Lys?Val
885?????????????????890?????????????????895Ile?Gln?Gln?Ile?Leu?Ile?Ser?Leu?Gly?Phe?Thr?Ile?Asn?Ser?Thr?Met
900?????????????????905?????????????????910Thr?Arg?Asp?Val?Val?Ile?Pro?Leu?Leu?Thr?Asn?Asn?Asp?Leu?Leu?Ile
915?????????????????920?????????????????925Arg?Met?Ala?Leu?Leu?Pro?Ala?Pro?Ile?Gly?Gly?Met?Asn?Tyr?Leu?Asn
930?????????????????935?????????????????940Met?Ser?Arg?Leu?Phe?Val?Arg?Asn?Ile?Gly?Asp?Pro?Val?Thr?Ser?Ser945?????????????????950?????????????????955?????????????????960Ile?Ala?Asp?Leu?Lys?Arg?Met?Ile?Leu?Ala?Ser?Leu?Met?Pro?Glu?Glu
965?????????????????970?????????????????975Thr?Leu?His?Gln?Val?Met?Thr?Gln?Gln?Pro?Gly?Asp?Ser?Ser?Phe?Leu
980?????????????????985?????????????????990Asp?Trp?Ala?Ser?Asp?Pro?Tyr?Ser?Ala?Asn?Leu?Val?Cys?Val?Gln?Ser
995?????????????????1000????????????????1005Ile?Thr?Arg?Leu?Leu?Lys?Asn?Ile?Thr?Ala?Arg?Phe?Val?Leu?Ile?His
1010???????????????1015?????????????????1020Ser?Pro?Asn?Pro?Met?Leu?Lys?Gly?Leu?Phe?His?Asp?Asp?Ser?Lys?Glu1025????????????????1030????????????????1035????????????????1040Glu?Asp?Glu?Arg?Leu?Ala?Ala?Phe?Leu?Met?Asp?Arg?His?Ile?Ile?Val
1045????????????????1050????????????????1055Pro?Arg?Ala?Ala?His?Glu?Ile?Leu?Asp?His?Ser?Val?Thr?Gly?Ala?Arg
1060????????????????1065????????????????1070Glu?Ser?Ile?Ala?Gly?Met?Leu?Asp?Thr?Thr?Lys?Gly?Leu?Ile?Arg?Ala
1075????????????????1080????????????????1085Ser?Met?Arg?Lys?Gly?Gly?Leu?Thr?Ser?Arg?Val?Ile?Thr?Arg?Leu?Ser
1090????????????????1095????????????????1100Asn?Tyr?Asp?Tyr?Glu?Gln?Phe?Arg?Ala?Gly?Met?Val?Leu?Leu?Thr?Gly1105????????????????1110????????????????1115????????????????1120Arg?Lys?Arg?Asn?Val?Leu?Ile?Asp?Lys?Glu?Ser?Cys?Ser?Val?Gln?Leu
1125????????????????1130????????????????1135Ala?Arg?Ala?Leu?Arg?Ser?His?Met?Trp?Ala?Arg?Leu?Ala?Arg?Gly?Arg
1140????????????????1145????????????????1150Pro?Ile?Tyr?Gly?Leu?Glu?Val?Pro?Asp?Val?Leu?Glu?Ser?Met?Arg?Gly
1155????????????????1160????????????????1165His?Leu?Ile?Arg?Arg?His?Glu?Thr?Cys?Val?Ile?Cys?Glu?Cys?Gly?Ser
1170????????????????1175????????????????1180Val?Asn?Tyr?Gly?Trp?Phe?Phe?Val?Pro?Ser?Gly?Cys?Gln?Leu?Asp?Asp1185????????????????1190????????????????1195????????????????1200Ile?Asp?Lys?Glu?Thr?Ser?Ser?Leu?Arg?Val?Pro?Tyr?Ile?Gly?Ser?Thr
1205????????????????1210????????????????1215Thr?Asp?Glu?Arg?Thr?Asp?Met?Lys?Leu?Ala?Phe?Val?Arg?Ala?Pro?Ser
1220????????????????1225????????????????1230Arg?Ser?Leu?Arg?Ser?Ala?Val?Arg?Ile?Ala?Thr?Val?Tyr?Ser?Trp?Ala
1235????????????????1240????????????????1245Tyr?Gly?Asp?Asp?Asp?Ser?Ser?Trp?Asn?Glu?Ala?Trp?Leu?Leu?Ala?Arg
1250????????????????1255????????????????1260Gln?Arg?Ala?Asn?Val?Ser?Leu?Glu?Glu?Leu?Arg?Val?Ile?Thr?Pro?Ile1265????????????????1270????????????????1275????????????????1280Ser?Thr?Ser?Thr?Asn?Leu?Ala?His?Arg?Leu?Arg?Asp?Arg?Ser?Thr?Gln
1285????????????????1290????????????????1295Val?Lys?Tyr?Ser?Gly?Thr?Ser?Leu?Val?Arg?Val?Ala?Arg?Tyr?Thr?Thr
1300????????????????1305????????????????1310Ile?Ser?Asn?Asp?Asn?Leu?Ser?Phe?Val?Ile?Ser?Asp?Lys?Lys?Val?Asp
1315????????????????1320????????????????1325Thr?Asn?Phe?Ile?Tyr?Gln?Gln?Gly?Met?Leu?Leu?Gly?Leu?Gly?Val?Leu
1330????????????????1335????????????????1340Glu?Thr?Leu?Phe?Arg?Leu?Glu?Lys?Asp?Thr?Gly?Ser?Ser?Asn?Thr?Val1345????????????????1350????????????????1355????????????????1360Leu?His?Leu?His?Val?Glu?Thr?Asp?Cys?Cys?Val?Ile?Pro?Met?Ile?Asp
1365????????????????1370????????????????1375His?Pro?Arg?Ile?Pro?Ser?Ser?Arg?Lys?Leu?Glu?Leu?Arg?Ala?Glu?Leu
1380????????????????1385????????????????1390Cys?Thr?Asn?Pro?Leu?Ile?Tyr?Asp?Asn?Ala?Pro?Leu?Ile?Asp?Arg?Asp
1395????????????????1400????????????????1405Ala?Thr?Arg?Leu?Tyr?Thr?Gln?Ser?His?Arg?Arg?His?Leu?Val?Glu?Phe
1410????????????????1415????????????????1420Val?Thr?Trp?Ser?Thr?Pro?Gln?Leu?Tyr?His?Ile?Leu?Ala?Lys?Ser?Thr1425????????????????1430????????????????1435????????????????1440Ala?Leu?Ser?Met?Ile?Asp?Leu?Val?Thr?Lys?Phe?Glu?Lys?Asp?His?Met
1445????????????????1450????????????????1455Asn?Glu?Ile?Ser?Ala?Leu?Ile?Gly?Asp?Asp?Asp?Ile?Asn?Ser?Phe?Ile
1460????????????????1465????????????????1470Thr?Glu?Phe?Leu?Leu?Ile?Glu?Pro?Arg?Leu?Phe?Thr?Ile?Tyr?Leu?Gly
1475????????????????1480????????????????1485Gln?Cys?Ala?Ala?Ile?Asn?Trp?Ala?Phe?Asp?Val?His?Tyr?His?Arg?Pro
1490????????????????1495????????????????1500Ser?Gly?Lys?Tyr?Gln?Met?Gly?Glu?Leu?Leu?Ser?Ser?Phe?Leu?Ser?Arg1505????????????????1510????????????????1515????????????????1520Met?Ser?Lys?Gly?Val?Phe?Lys?Val?Leu?Val?Asn?Ala?Leu?Ser?His?Pro
1525????????????????1530????????????????1535Lys?Ile?Tyr?Lys?Lys?phe?Trp?His?Cys?Gly?Ile?Ile?Glu?Pro?Ile?His
1540????????????????1545????????????????1550Gly?Pro?Ser?Leu?Asp?Ala?Gln?Asn?Leu?His?Thr?Thr?Val?Cys?Asn?Met
1555????????????????1560????????????????1565Val?Tyr?Thr?Cys?Tyr?Met?Thr?Tyr?Leu?Asp?Leu?Leu?Leu?Asn?Glu?Glu
1570????????????????1575????????????????1580Leu?Glu?Glu?Phe?Thr?Phe?Leu?Leu?Cys?Glu?Ser?Asp?Glu?Asp?Val?Val1585????????????????1590????????????????1595????????????????1600Pro?Asp?Arg?Phe?Asp?Asn?Ile?Gln?Ala?Lys?His?Leu?Cys?Val?Leu?Ala
1605????????????????1610????????????????1615Asp?Leu?Tyr?Cys?Gln?Pro?Gly?Thr?Cys?Pro?Pro?Ile?Arg?Gly?Leu?Arg
1620????????????????1625????????????????1630Pro?Val?Glu?Lys?Cys?Ala?Val?Leu?Thr?Asp?His?Ile?Lys?Ala?Glu?Ala
1635????????????????1640????????????????1645Arg?Leu?Ser?Pro?Ala?Gly?Ser?Ser?Trp?Asn?Ile?Asn?Pro?Ile?Ile?Val
1650????????????????1655????????????????1660Asp?His?Tyr?Ser?Cys?Ser?Leu?Thr?Tyr?Leu?Arg?Arg?Gly?Ser?Ile?Lys1665????????????????1670????????????????1675????????????????1680Gln?Ile?Arg?Leu?Arg?Val?Asp?Pro?Gly?Phe?Ile?Phe?Asp?Ala?Leu?Ala
1685????????????????1690????????????????1695Glu?Val?Asn?Val?Ser?Gln?Pro?Lys?Val?Gly?Ser?Asn?Asn?Ile?Ser?Asn
1700????????????????1705????????????????1710Met?Ser?Ile?Lys?Asp?Phe?Arg?Pro?Pro?His?Asp?Asp?Val?Ala?Lys?Leu
1715????????????????1720????????????????1725Leu?Lys?Asp?Ile?Asn?Thr?Ser?Lys?His?Asn?Leu?Pro?Ile?Ser?Gly?Gly
1730????????????????1735????????????????1740Ser?Leu?Ala?Asn?Tyr?Glu?Ile?His?Ala?Phe?Arg?Arg?Ile?Gly?Leu?Asn1745????????????????1750????????????????1755????????????????1760Ser?Ser?Ala?Cys?Tyr?Lys?Ala?Val?Glu?Ile?Ser?Thr?Leu?Ile?Arg?Arg
1765????????????????1770????????????????1775Cys?Leu?Glu?Pro?Gly?Glu?Asp?Gly?Leu?Phe?Leu?Gly?Glu?Gly?Ser?Gly
1780????????????????1785????????????????1790Ser?Met?Leu?Ile?Thr?Tyr?Lys?Glu?Ile?Leu?Lys?Leu?Asn?Lys?Cys?Phe
1795????????????????1800????????????????1805Tyr?Asn?Ser?Gly?Val?Ser?Ala?Asn?Ser?Arg?Ser?Gly?Gln?Arg?Glu?Leu
1810????????????????1815????????????????1820Ala?Pro?Tyr?Pro?Ser?Glu?Val?Gly?Leu?Val?Glu?His?Arg?Met?Gly?Val1825????????????????1830????????????????1835????????????????1840Gly?Asn?Ile?Val?Lys?Val?Leu?Phe?Asn?Gly?Arg?Pro?Glu?Val?Thr?Trp
1845????????????????1850????????????????1855Val?Gly?Ser?Ile?Asp?Cys?Phe?Asn?Phe?Ile?Val?Ser?Asn?Ile?Pro?Thr
1860????????????????1865????????????????1870Ser?Ser?Val?Gly?Phe?Ile?His?Ser?Asp?Ile?Glu?Thr?Leu?Pro?Asn?Lys
1875????????????????1880????????????????1885Asp?Thr?Ile?Glu?Lys?Leu?Glu?Glu?Leu?Ala?Ala?Ile?Leu?Ser?Met?Ala
1890????????????????1895????????????????1900Leu?Leu?Leu?Gly?Lys?Ile?Gly?Ser?Ile?Leu?Val?Ile?Lys?Leu?Met?Pro1905????????????????1910????????????????1915????????????????1920Phe?Ser?Gly?Asp?Phe?Val?Gln?Gly?Phe?Ile?Ser?Tyr?Val?Gly?Ser?His
1925????????????????1930????????????????1935Tyr?Arg?Glu?Val?Asn?Leu?Val?Tyr?Pro?Arg?Tyr?Ser?Asn?Phe?Ile?Ser
1940????????????????1945????????????????1950Thr?Glu?Ser?Tyr?Leu?Val?Met?Thr?Asp?Leu?Lys?Ala?Asn?Arg?Leu?Met
1955????????????????1960????????????????1965Asn?Pro?Glu?Lys?Ile?Lys?Gln?Gln?Ile?Ile?Glu?Ser?Ser?Val?Arg?Thr
1970????????????????1975????????????????1980Ser?Pro?Gly?Leu?Ile?Gly?His?Ile?Leu?Ser?Ile?Lys?Gln?Leu?Ser?Cys1985????????????????1990????????????????1995????????????????2000Ile?Gln?Ala?Ile?Val?Gly?Gly?Ala?Val?Ser?Arg?Gly?Asp?Ile?Asn?Pro
2005????????????????2010????????????????2015Ile?Leu?Lys?Lys?Leu?Thr?Pro?Ile?Glu?Gln?Val?Leu?Ile?Ser?Cys?Gly
2020????????????????2025????????????????2030Leu?Ala?Ile?Asn?Gly?Pro?Lys?Leu?Cys?Lys?Glu?Leu?Ile?His?His?Asp
2035????????????????2040????????????????2045Val?Ala?Ser?Gly?Gln?Asp?Gly?Leu?Leu?Asn?Ser?Ile?Leu?Ile?Leu?Tyr
2050????????????????2055????????????????2060Arg?Glu?Leu?Ala?Arg?Phe?Lys?Asp?Asn?Gln?Arg?Ser?Gln?Gln?Gly?Met2065????????????????2070????????????????2075????????????????2080Phe?His?Ala?Tyr?Pro?Val?Leu?Val?Ser?Ser?Arg?Gln?Arg?Glu?Leu?Val
2085????????????????2090????????????????2095Ser?Arg?Ile?Thr?Arg?Lys?Phe?Trp?Gly?His?Ile?Leu?Leu?Tyr?Ser?Gly
2100????????????????2105????????????????2110Asn?Arg?Lys?Leu?Ile?Asn?Arg?Phe?Ile?Gln?Asn?Leu?Lys?Ser?Gly?Tyr
2115????????????????2120????????????????2125
Leu?Ile?Leu?Asp?Leu?His?Gln?Asn?Ile?Phe?Val?Lys?Asn?Leu?Ser?Lys
2130????????????????2135????????????????2140
Ser?Glu?Lys?Gln?Ile?Ile?Met?Thr?Gly?Gly?Leu?Lys?Arg?Glu?Trp?Val
2145????????????????2150????????????????2155????????????????2160
Phe?Lys?Val?Thr?Val?Lys?Glu?Thr?Lys?Glu?Trp?Tyr?Lys?Leu?Val?Gly
2165????????????????2170????????????????2175
Tyr?Ser?Ala?Leu?Ile?Lys?Asp
The information of 2180 (2) SEQ ID NO:9:
(i) sequence signature:
(A) length: 15894 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: ACCAAACAAA GTTGGGTAAG GATAGTTCAA TCAATGATCA TCTTCTAGTG CACTTAGGAT 60 TCAAGATCCT ATTATCAGGG ACAAGAGCAG GATTAGGGAT ATCCGAGATG GCCACACTTT 120 TAAGGAGCTT AGCATTGTTC AAAAGAAACA AGGACAAACC ACCCATTACA TCAGGATCCG 180 GTGGAGCCAT CAGAGGAATC AAACACATTA TTATAGTACC AATCCCTGGA GATTCCTCAA 240 TTACCACTCG ATCCAGACTT CTGGACCGGT TGGTCAGGTT AATTGGAAAC CCGGATGTGA 300 GCGGGCCCAA ACTAACAGGG GCACTAATAG GTATATTATC CTTATTTGTG GAGTCTCCAG 360 GTCAATTGAT TCAGAGGATC ACCGATGACC CTGACGTTAG CATAAGGCTG TTAGAGGTTG 420 TCCAGAGTGA CCAGTCACAA TCTGGCCTTA CCTTCGCATC AAGAGGTACC AACATGGAGG 480 ATGAGGCGGA CCAATACTTT TCACATGATG ATCCAATTAG TAGTGATCAA TCCAGGTTCG 540 GATGGTTCGA GAACAAGGAA ATCTCAGATA TTGAAGTGCA AGACCCTGAG GGATTCAACA 600 TGATTCTGGG TACCATCCTA GCCCAAATTT GGGTCTTGCT CGCAAAGGCG GTTACGGCCC 660 CAGACACGGC AGCTGATTCG GAGCTAAGAA GGTGGATAAA GTACACCCAA CAAAGAAGGG 720 TGGTTGGTGA ATTTAGATTG GAGAGAAAAT GGTTGGATGT GGTGAGGAAC AGGATTGCCG 780 AGGACCTCTC CTTACGCCGA TTCATGGTCG CTCTAATCCT GGATATCAAG AGAACACCCG 840 GAAACAAACC CAGGATTGCT GAAATGATAT GTGACATTGA TACATATATC GTAGAGGCAG 900 GATTAGCCAG TTTTATCCTG ACTATTAAGT TTGGGATAGA AACTATGTAT CCTGCTCTTG 960 GACTGCATGA ATTTGCTGGT GAGTTATCCA CACTTGAGTC CTTGATGAAC CTTTACCAGC 1020 AAATGGGGGA AACTGCACCC TACATGGTAA TCCTGGAGAA CTCAATTCAG AACAAGTTCA 1080 GTGCAGGATC ATACCCTCTG CTCTGGAGCT ATGCCATGGG AGTAGGAGTG GAACTTGAAA 1140 ACTCCATGGG AGGTTTGAAC TTTGGCCGAT CTTACTTTGA TCCAGCATAT TTTAGATTAG 1200 GGCAAGAGAT GGTAAGGAGG TCAGCTGGAA AGGTCAGTTC CACATTGGCA TCTGAACTCG 1260 GTATCACTGC CGAGGATGCA AGGCTTGTTT CAGAGATTGC AATGCATACT ACTGAGGACA 1320 AGATCAGTAG AGCGGTTGGA CCCAGACAAG CCCAAGTATC ATTTCTACAC GGTGATCAAA 1380 GTGAGAATGA GCTACCGAGA TTGGGGGGCA AGGAAGATAG GAGGGTCAAA CAGAGTCGAG 1440 GAGAAGCCAG GGAGAGCTAC AGAGAAACCG GGCCCAGCAG AGCAAGTGAT GCGAGAGCTG 1500 CCCATCTTCC AACCGGCACA CCCCTAGACA TTGACACTGC AACGGAGTCC AGCCAAGATC 1560 CGCAGGACAG TCGAAGGTCA GCTGACGCCC TGCTTAGGCT GCAAGCCATG GCAGGAATCT 1620 CGGAAGAACA AGGCTCAGAC ACGGACACCC CTATAGTGTA CAATGACAGA AATCTTCTAG 1680 ACTAGGTGCG AGAGGCCGAG GGCCAGAACA ACATCCGCCT ACCCTCCATC ATTGTTATAA 1740 AAAACTTAGG AACCAGGTCC ACACAGCCGC CAGCCCATCA ACCATCCACT CCCACGATTG 1800 GAGCCAATGG CAGAAGAGCA GGCACGCCAT GTCAAAAACG GACTGGAATG CATCCGGGCT 1860 CTCAAGGCCG AGCCCATCGG CTCACTGGCC ATCGAGGAAG CTATGGCAGC ATGGTCAGAA 1920 ATATCAGACA ACCCAGGACA GGAGCGAGCC ACCTGCAGGG AAGAGAAGGC AGGCAGTTCG 1980 GGTCTCAGCA AACCATGCCT CTCAGCAATT GGATCAACTG AAGGCGGTGC ACCTCGCATC 2040 CGCGGTCAGG GACCTGGAGA GAGCGATGAC GACGCTGAAA CTTTGGGAAT CCCCCCAAGA 2100 AATCTCCAGG CATCAAGCAC TGGGTTACAG TGTTATTACG TTTATGATCA CAGCGGTGAA 2160 GCGGTTAAGG GAATCCAAGA TGCTGACTCT ATCATGGTTC AATCAGGCCT TGATGGTGAT 2220 AGCACCCTCT CAGGAGGAGA CAATGAATCT GAAAACAGCG ATGTGGATAT TGGCGAACCT 2280 GATACCGAGG GATATGCTAT CACTGACCGG GGATCTGCTC CCATCTCTAT GGGGTTCAGG 2340 GCTTCTGATG TTGAAACTGC AGAAGGAGGG GAGATCCACG AGCTCCTGAG ACTCCAATCC 2400 AGAGGCAACA ACTTTCCGAA GCTTGGGAAA ACTCTCAATG TTCCTCCGCC CCCGGACCCC 2460 GGTAGGGCCA GCACTTCCGG GACACCCATT AAAAAGGGCA CAGACGCGAG ATTAGCCTCA 2520 TTTGGAACGG AGATCGCGTC TTTATTGACA GGTGGTGCAA CCCAATGTGC TCGAAAGTCA 2580 CCCTCGGAAC CATCAGGGCC AGGTGCACCT GCGGGGAATG TCCCCGAGTG TGTGAGCAAT 2640 GCCGCACTGA TACAGGAGTG GACACCCGAA TCTGGTACCA CAATCTCCCC GAGATCCCAG 2700 AATAATGAAG AAGGGGGAGA CTATTATGAT GATGAGCTGT TCTCTGATGT CCAAGATATT 2760 AAAACAGCCT TGGCCAAAAT ACACGAGGAT AATCAGAAGA TAATCTCCAA GCTAGAATCA 2820 CTGCTGTTAT TGAAGGGAGA AGTTGAGTCA ATTAAGAAGC AGATCAACAG GCAAAATATC 2880 AGCATATCCA CCCTGGAAGG ACACCTCTCA AGCATCATGA TCGCCATTCC TGGACTTGGG 2940 AAGGATCCCA ACGACCCCAC TGCAGATGTC GAAATCAATC CCGACTTGAA ACCCATCATA 3000 GGCAGAGATT CAGGCCGAGC ACTGGCCGAA GTTCTCAAGA AACCCGTTGC CAGCCGACAA 3060 CTCCAAGGAA TGACAAATGG ACGGACCAGT TCCAGAGGAC AGCTGCTGAA GGAATTTCAG 3120 CTAAAGCCGA TCGGGAAAAA GATGAGCTCA GCCGTCGGGT TTGTTCCTGA CACCGGCCCT 3180 GCATCACGCA GTGTAATCCG CTCCATTATA AAATCCAGCC GGCTAGAGGA GGATCGGAAG 3240 CGTTACCTGA TGACTCTCCT TGATGATATC AAAGGAGCCA ATGATCTTGC CAAGTTCCAC 3300 CAGATGCTGA TGAAGATAAT AATGAAGTAG CTACAGCTCA ACTTACCTGC CAACCCCATG 3360 CCAGTCGACC CAACTAGTAC AACCTAAATC CATTATAAAA AACTTAGGAG CAAAGTGATT 3420 GCCTCCCAAG TTCCACAATG ACAGAGACCT ACGACTTCGA CAAGTCGGCA TGGGACATCA 3480 AAGGGTCGAT CGCTCCGATA CAACCCACCA CCTACAGTGA TGGCAGGCTG GTGCCCCAGG 3540 TCAGAGTCAT AGATCCTGGT CTAGGCGACA GGAAGGATGA ATGCTTTATG TACATGTTTC 3600 TGCTGGGGGT TGTTGAGGAC AGCGATTCCC TAGGGCCTCC AATCGGGCGA GCATTTGGGT 3660 CCCTGCCCTT AGGTGTTGGC AGATCCACAG CAAAGCCCGA AAAACTCCTC AAAGAGGCCA 3720 CTGAGCTTGA CATAGTTGTT AGACGTACAG CAGGGCTCAA TGAAAAACTG GTGTTCTACA 3780 ACAACACCCC ACTAACTCTC CTCACACCTT GGAGAAAGGT CCTAACAACA GGGAGTGTCT 3840 TCAACGCAAA CCAAGTGTGC AATGCGGTTA ATCTGATACC GCTCGATACC CCGCAGAGGT 3900 TCCGTGTTGT TTATATGAGC ATCACCCGTC TTTCGGATAA CGGGTATTAC ACCGTTCCTA 3960 GAAGAATGCT GGAATTCAGA TCGGTCAATG CAGTGGCCTT CAACCTGCTG GTGACCCTTA 4020 GGATTGACAA GGCGATAGGC CCTGGGAAGA TCATCGACAA TACAGAGCAA CTTCCTGAGG 4080 CAACATTTAT GGTCCACATC GGGAACTTCA GGAGAAAGAA GAGTGAAGTC TACTCTGCCG 4140 ATTATTGCAA AATGAAAATC GAAAAGATGG GCCTGGTTTT TGCACTTGGT GGGATAGGGG 4200 GCACCAGTCT TCACATTAGA AGCACAGGCA AAATGAGCAA GACTCTCCAT GCACAACTCG 4260 GGTTCAAGAA GACCTTATGT TACCCGCTGA TGGATATCAA TGAAGACCTT AATCGATTAC 4320 TCTGGAGGAG CAGATGCAAG ATAGTAAGAA TCCAGGCAGT TTTGCAGCCA TCAGTTCCTC 4380 AAGAATTCCG CATTTACGAC GACGTGATCA TAAATGATGA CCAAGGACTA TTCAAAGTTC 4440 TGTAGACCGT AGTGCCCAGC AATGCCCGAA AACGACCCCC CTCACAATGA CAGCCAGAAG 4500 GCCCGGACAA AAAAGCCCCC TCCGAAAGAC TCCACGGACC AAGCGAGAGG CCAGCCAGCA 4560 GCCGACGGCA AGCGCGAACA CCAGGCGGCC CCAGCACAGA ACAGCCCCGA CACAAGGCCA 4620 CCACCAGCCA CCCCAATCTG CATCCTCCTC GTGGGACCCC CGAGGACCAA CCCCCAAGGC 4680 TGCCCCCGAT CCAAACCACC AACCGCATCC CCACCACCCC CGGGAAAGAA ACCCCCAGCA 4740 ATTGGAAGGC CCCTCCCCCT CTTCCTCAAC ACAAGAACTC CACAACCGAA CCGCACAAGC 4800 GACCGAGGTG ACCCAACCGC AGGCATCCGA CTCCCTAGAC AGATCCTCTC TCCCCGGCAA 4860 ACTAAACAAA ACTTAGGGCC AAGGAACATA CACACCCAAC AGAACCCAGA CCCCGGCCCA 4920 CGGCGCCGCG CCCCCAACCC CCGACAACCA GAGGGAGCCC CCAACCAATC CCGCCGGCTC 4980 CCCCGGTGCC CACAGGCAGG GACACCAACC CCCGAACAGA CCCAGCACCC AACCATCGAC 5040 AATCCAAGAC GGGGGGGCCC CCCCAAAAAA AGGCCCCCAG GGGCCGACAG CCAGCACCGC 5100 GAGGAAGCCC ACCCACCCCA CACACGACCA CGGCAACCAA ACCAGAACCC AGACCACCCT 5160 GGGCCACCAG CTCCCAGACT CGGCCATCAC CCCGCAGAAA GGAAAGGCCA CAACCCGCGC 5220 ACCCCAGCCC CGATCCGGCG GGGAGCCACC CAACCCGAAC CAGCACCCAA GAGCGATCCC 5280 CGAAGGACCC CCGAACCGCA AAGGACACCA GTATCCCACA GCCTCTCCAA GTCCCCCGGT 5340 CTCCTCCTCT TCTCGAAGGG ACCAAAAGAT CAATCCACCA CACCCGACGA CACTCAACTC 5400 CCCACCCCTA AAGGAGACAC CGGGAATCCC AGAATCAAGA CTCATCCAAT GTCCATCATG 5460 GGTCTCAAGG TGAACGTCTC TGCCATATTC ATGGCAGTAC TGTTAACTCT CCAAACACCC 5520 ACCGGTCAAA TCCATTGGGG CAATCTCTCT AAGATAGGGG TGGTAGGAAT AGGAAGTGCA 5580 AGCTACAAAG TTATGACTCG TTCCAGCCAT CAATCATTAG TCATAAAATT AATGCCCAAT 5640 ATAACTCTCC TCAATAACTG CACGAGGGTA GAGATTGCAG AATACAGGAG ACTACTGAGA 5700 ACAGTTTTGG AACCAATTAG AGATGCACTT AATGCAATGA CCCAGAATAT AAGACCGGTT 5760 CAGAGTGTAG CTTCAAGTAG GAGACACAAG AGATTTGCGG GAGTAGTCCT GGCAGGTGCG 5820 GCCCTAGGCG TTGCCACAGC TGCTCAGATA ACAGCCGGCA TTGCACTTCA CCAGTCCATG 5880 CTGAACTCTC AAGCCATCGA CAATCTGAGA GCGAGCCTGG AAACTACTAA TCAGGCAATT 5940 GAGGCAATCA GACAAGCAGG GCAGGAGATG ATATTGGCTG TTCAGGGTGT CCAAGACTAC 6000 ATCAATAATG AGCTGATACC GTCTATGAAC CAACTATCTT GTGATTTAAT CGGCCAGAAG 6060 CTCGGGCTCA AATTGCTCAG ATACTATACA GAAATCCTGT CATTATTTGG CCCCAGTTTA 6120 CGGGACCCCA TATCTGCGGA GATATCTATC CAGGCTTTGA GCTATGCGCT TGGAGGAGAC 6180 ATCAATAAGG TGTTAGAAAA GCTCGGATAC AGTGGAGGTG ATTTACTGGG CATCTTAGAG 6240 AGCGGAGGAA TAAAGGCCCG GATAACTCAC GTCGACACAG AGTCCTACTT CATTGTCCTC 6300 AGTATAGCCT ATCCGACGCT GTCCGAGATT AAGGGGGTGA TTGTCCACCG GCTAGAGGGG 6360 GTCTCGTACA ACATAGGCTC TCAAGAGTGG TATACCACTG TGCCCAAGTA TGTTGCAACC 6420 CAAGGGTACC TTATCTCGAA TTTTGATGAG TCATCGTGTA CTTTCATGCC AGAGGGGACT 6480 GTGTGCAGCC AAAATGCCTT GTACCCGATG AGTCCTCTGC TCCAAGAATG CCTCCGGGGG 6540 TACACCAAGT CCTGTGCTCG TACACTCGTA TCCGGGTCTT TTGGGAACCG GTTCATTTTA 6600 TCACAAGGGA ACCTAATAGC CAATTGTGCA TCAATCCTTT GCAAGTGTTA CACAACAGGA 6660 ACGATCATTA ATCAAGACCC TGACAAGATC CTAACATACA TTGCTGCCGA TCACTGCCCG 6720 GTAGTCGAGG TGAACGGCGT GATCATCCAA GTCGGGAGCA GGAGGTATCC AGACGCTGTG 6780 TACTTGCACA GAATTGACCT CGGTCCTCCC ATATCATTGG AGAGGTTGGA CGTAGGGACA 6840 AATCTGGGGA ATGCAATTGC TAAGTTGGAG GATGCCAAGG AATTGTTGGA GTCATCGGAC 6900 CAGATATTGA GGAGTATGAA AGGTTTATCG AGCACTAGCA TAGTCTACAT CCTGATTGCA 6960 GTGTGTCTTG GAGGGTTGAT AGGGATCCCC GCTTTAATAT GTTGCTGCAG GGGGCGTTGT 7020 AACAAAAAGG GAGAACAAGT TGGTATGTCA AGACCAGGCC TAAAGCCTGA TCTTACGGGA 7080 ACATCAAAAT CCTATGTAAG GTCGCTCTGA TCCTCTACAA CTCTTGAAAC ACAAATGTCC 7140 CACAAGTCTC CTCTTCGTCA TCAAGCAACC ACCGCACCCA GCATCAAGCC CACCTGAAAT 7200 TATCTCCGGC TTCCCTCTGG CCGAACAATA TCGGTAGTTA ATTAAAACTT AGGGTGCAAG 7260 ATCATCCACA ATGTCACCAC AACGAGACCG GATAAATGCC TTCTACAAAG ATAACCCCCA 7320 TCCCAAGGGA AGTAGGATAG TCATTAACAG AGAACATCTT ATGATTGATA GACCTTATGT 7380 TTTGCTGGCT GTTCTGTTTG TCATGTTTCT GAGCTTGATC GGGTTGCTAG CCATTGCAGG 7440 CATTAGACTT CATCGGGCAG CCATCTACAC CGCAGAGATC CATAAAAGCC TCAGCACCAA 7500 TCTAGATGTA ACTAACTCAA TCGAGCATCA GGTCAAGGAC GTGCTGACAC CACTCTTCAA 7560 AATCATCGGT GATGAAGTGG GCCTGAGGAC ACCTCAGAGA TTCACTGACC TAGTGAAATT 7620 CATCTCTGAC AAGATTAAAT TCCTTAATCC GGATAGGGAG TACGACTTCA GAGATCTCAC 7680 TTGGTGTATC AACCCGCCAG AGAGAATCAA ATTGGATTAT GATCAATACT GTGCAGATGT 7740 GGCTGCTGAA GAGCTCATGA ATGCATTGGT GAACTCAACT CTACTGGAGA CCAGAACAAC 7800 CAATCAGTTC CTAGCTGTCT CAAAGGGAAA CTGCTCAGGG CCCACTACAA TCAGAGGTCA 7860 ATTCTCAAAC ATGTCGCTGT CCCTGTTAGA CTTGTATTTA GGTCGAGGTT ACAATGTGTC 7920 ATCTATAGTC ACTATGACAT CCCAGGGAAT GTATGGGGGA ACTTACCTAG TGGAAAAGCC 7980 TAATCTGAGC AGCAAAAGGT CAGAGTTGTC ACAACTGAGC ATGTACCGAG TGTTTGAAGT 8040 AGGTGTTATC AGAAATCCGG GTTTGGGGGC TCCGGTGTTC CATATGACAA ACTATCTTGA 8100 GCAACCAGTC AGTAATGATC TCAGCAACTG TATGGTGGCT TTGGGGGAGC TCAAACTCGC 8160 AGCCCTTTGT CACGGGGAAG ATTCTATCAC AATTCCCTAT CAGGGATCAG GGAAAGGTGT 8220 CAGCTTCCAG CTCGTCAAGC TAGGTGTCTG GAAATCCCCA ACCGACATGC AATCCTGGGT 8280 CCCCTTATCA ACGGATGATC CAGTGATAGA CAGGCTTTAC CTCTCATCTC ACAGAGGTGT 8340 TATCGCTGAC AATCAAGCAA AATGGGCTGT CCCGACAACA CGAACAGATG ACAAGTTGCG 8400 AATGGAGACA TGCTTCCAAC AGGCGTGTAA GGGTAAAATC CAAGCACTCT GCGAGAATCC 8460 CGAGTGGGCA CCATTGAAGG ATAACAGGAT TCCTTCATAC GGGGTCTTGT CTGTTGATCT 8520 GAGTCTGACA GTTGAGCTTA AAATCAAAAT TGCTTCGGGA TTCGGGCCAT TGATCACACA 8580 CGGTTCAGGG ATGGACCTAT ACAAATCCAA CCACAACAAT GTGTATTGGC TGACTATCCC 8640 GCCAATGAAG AACCTAGCCT TAGGTGTAAT CAACACATTG GAGTGGATAC CGAGATTCAA 8700 GGTTAGTCCC TACCTCTTCA CTGTCCCAAT TAAGGAAGCA GGCGAAGACT GCCATGCCCC 8760 AACATACCTA CCTGCGGAGG TGGATGGTGA TGTCAAACTC AGTTCCAATC TGGTGATTCT 8820 ACCTGGTCAA GATCTCCAAT ATGTTTTGGC AACCTACGAT ACTTCCAGGG TTGAACATGC 8880 TGTGGTTTAT TACGTTTACA GCCCAAGCCG CTCATTTTCT TACTTTTATC CTTTTAGGTT 8940 GCCTATAAAG GGGGTCCCCA TCGAATTACA AGTGGAATGC TTCACATGGG ACCAAAAACT 9000 CTGGTGCCGT CACTTCTGTG TGCTTGCGGA CTCAGAATCT GGTGGACATA TCACTCACTC 9060 TGGGATGGTG GGCATGGGAG TCAGCTGCAC AGTCACCCGG GAAGATGGAA CCAATCGCAG 9120 ATAGGGCTGC TAGTGAACCA ATCACATGAT GTCACCCAGA CATCAGGCAT ACCCACTAGT 9180 GTGAAATAGA CATCAGAATT AAGAAAAACG TAGGGTCCAA GTGGTTCCCC GTTATGGACT 9240 CGCTATCTGT CAACCAGATC TTATACCCTG AAGTTCACCT AGATAGCCCG ATAGTTACCA 9300 ATAAGATAGT AGCCATCCTG GAGTATGCTC GAGTCCCTCA CGCTTACAGC CTGGAGGACC 9360 CTACACTGTG TCAGAACATC AAGCACCGCC TAAAAAACGG ATTTTCCAAC CAAATGATTA 9420 TAAACAATGT GGAAGTTGGG AATGTCATCA AGTCCAAGCT TAGGAGTTAT CCGGCCCACT 9480 CTCATATTCC ATATCCAAAT TGTAATCAGG ATTTATTTAA CATAGAAGAC AAAGAGTCAA 9540 CGAGGAAGAT CCGTGAACTC CTCAAAAAGG GGAATTCGCT GTACTCCAAA GTCAGTGATA 9600 AGGTTTTCCA ATGCTTAAGG GACACTAACT CACGGCTTGG CCTAGGCTCC GAATTGAGGG 9660 AGGACATCAA GGAGAAAGTT ATTAACTTGG GAGTTTACAT GCACAGCTCC CAGTGGTTTG 9720 AGCCCTTTCT GTTTTGGTTT ACAGTCAAGA CTGAGATGAG GTCAGTGATT AAATCACAAA 9780 CCCATACTTG CCATAGGAGG AGACACACAC CTGTATTCTT CACTGGTAGT TCAGTTGAGT 9840 TGCTAATCTC TCGTGACCTT GTTGCTATAA TCAGTAAAGA GTCTCAACAT GTATATTACC 9900 TGACATTTGA ACTGGTTTTG ATGTATTGTG ATGTCATAGA GGGGAGGTTA ATGACAGAGA 9960 CCGCTATGAC TATTGATGCT AGGTATACAG AGCTTCTAGG AAGAGTCAGA TACATGTGGA 10020 AACTGATAGA TGGTTTCTTC CCTGCACTCG GGAATCCAAC TTATCAAATT GTAGCCATGC 10080 TGGAGCCTCT TTCACTTGCT TACCTGCAGC TGAGGGATAT AACAGTAGAA CTCAGAGGTG 10140 CTTTCCTTAA CCACTGCTTT ACTGAAATAC ATGATGTTCT TGACCAAAAC GGGTTTTCTG 10200 ATGAAGGTAC TTATCATGAG TTAATTGAAG CTCTAGATTA CATTTTCATA ACTGATGACA 10260 TACATCTGAC AGGGGAGATT TTCTCATTTT TCAGAAGTTT CGGCCACCCC AGACTTGAAG 10320 CAGTAACGGC TGCTGAAAAT GTTAGGAAAT ACATGAATCA GCCTAAAGTC ATTGTGTATG 10380 AGACTCTGAT GAAAGGTCAT GCCATATTTT GTGGAATCAT AATCAACGGC TATCGTGACA 10440 GGCACGGAGG CAGTTGGCCA CCGCTGACCC TCCCCCTGCA TGCTGCAGAC ACAATCCGGA 10500 ATGCTCAAGC TTCAGGTGAA GGGTTAACAC ATGAGCAGTG CGTTGATAAC TGGAAATCTT 10560 TTGCTGGAGT GAAATTTGGC TGCTTTATGC CTCTTAGCCT GGATAGTGAT CTGACAATGT 10620 ACCTAAAGGA CAAGGCACTT GCTGCTCTCC AAAGGGAATG GGATTCAGTT TACCCGAAAG 10680 AGTTCCTGCG TTACGACCCT CCCAAGGGAA CCGGGTCACG GAGGCTTGTA GATGTTTTCC 10740 TTAATGATTC GAGCTTTGAC CCATATGATG TGATAATGTA TGTTGTAAGT GGAGCTTACC 10800 TCCATGACCC TGAGTTCAAC CTGTCTTACA GCCTGAAAGA AAAGGAGATC AAGGAAACAG 10860 GTAGACTTTT TGCTAAAATG ACTTACAAAA TGAGGGCATG CCAAGTGATT GCTGAAAATC 10920 TAATCTCAAA CGGGATTGGC AAATATTTTA AGGACAATGG GATGGCCAAG GATGAGCACG 10980 ATTTGACTAA GGCACTCCAC ACTCTAGCTG TCTCAGGAGT CCCCAAAGAT CTCAAAGAAA 11040 GTCACAGGGG GGGGCCAGTC TTAAAAACCT ACTCCCGAAG CCCAGTCCAC ACAAGTACCA 11100 GGAACGTGAG AGCAGCAAAA GGGTTTATAG GGTTCCCTCA AGTAATTCGG CAGGACCAAG 11160 ACACTGATCA TCCGGAGAAT ATGGAAGCTT ACGAGACAGT CAGTGCATTT ATCACGACTG 11220 ATCTCAAGAA GTACTGCCTT AATTGGAGAT ATGAGACCAT CAGCTTGTTT GCACAGAGGC 11280 TAAATGAGAT TTACGGATTG CCCTCATTTT TCCAGTGGCT GCATAAGAGG CTTGAGACCT 11340 CTGTCCTGTA TGTAAGTGAC CCTCATTGCC CCCCCGACCT TGACGCCCAT ATCCCGTTAT 11400 ATAAAGTCCC CAATGATCAA ATCTTCATTA AGTACCCTAT GGGAGGTATA GAAGGGTATT 11460 GTCAGAAGCT GTGGACCATC AGCACCATTC CCTATCTATA CCTGGCTGCT TATGAGAGCG 11520 GAGTAAGGAT TGCTTCGTTA GTGCAAGGGG ACAATCAGAC CATAGCCGTA ACAAAAAGGG 11580 TACCCAGCAC ATGGCCCTAC AACCTTAAGA AACGGGAAGC TGCTAGAGTA ACTAGAGATT 11640 ACTTTGTAAT TCTTAGGCAA AGGCTACATG ATATTGGCCA TCACCTCAAG GCAAATGAGA 11700 CAATTGTTTC ATCACATTTT TTTGTCTATT CAAAAGGAAT ATATTATGAT GGGCTACTTG 11760 TGTCCCAATC ACTCAAGAGC ATCGCAAGAT GTGTATTCTG GTCAGAGACT ATAGTTGATG 11820 AAACAAGGGC AGCATGCAGT AATATTGCTA CAACAATGGC TAAAAGCATC GAGAGAGGTT 11880 ATGACCGTTA CCTTGCATAT TCCCTGAACG TCCTAAAAGT GATACAGCAA ATTCTGATCT 11940 CTCTTGGCTT CACAATCAAT TCAACCATGA CCCGGGATGT AGTCATACCC CTCCTCACAA 12000 ACAACGACCT CTTAATAAGG ATGGCACTGT TGCCCGCTCC TATTGGGGGG ATGAATTATC 12060 TGAATATGAG CAGGCTGTTT GTCAGAAACA TCGGTGATCC AGTAACATCA TCAATTGCTG 12120 ATCTCAAGAG AATGATTCTC GCCTCACTAA TGCCTGAAGA GACCCTCCAT CAAGTAATGA 12180 CACAACAACC GGGGGACTCT TCATTCCTAG ACTGGGCTAG CGACCCTTAC TCAGCAAATC 12240 TTGTATGTGT CCAGAGCATC ACTAGACTCC TCAAGAACAT AACTGCAAGG TTTGTCCTGA 12300 TCCATAGTCC AAACCCAATG TTAAAAGGAT TATTCCATGA TGACAGTAAA GAAGAGGACG 12360 AGGGACTGGC GGCATTCCTC ATGGACAGGC ATATTATAGT ACCTAGGGCA GCTCATGAAA 12420 TCCTGGATCA TAGTGTCACA GGGGCAAGAG AGTCTATTGC AGGCATGCTG GATACCACAA 12480 AAGGCTTGAT TCGAGCCAGC ATGAGGAAGG GGGGGTTAAC CTCTCGAGTG ATAACCAGAT 12540 TGTCCAATTA TGACTATGAA CAATTCAGAG CAGGGATGGT GCTATTGACA GGAAGAAAGC 12600 GAAATGTCCT CATTGACAAA GAGTCATGTT CAGTGCAGCT GGCGAGAGCT CTAAGAAGCC 12660 ATATGTGGGC GAGGCTAGCT CGAGGACGGC CTATTTACGG CCTTGAGGTC CCTGATGTAC 12720 TAGAATCTAT GCGAGGCCAC CTTATTCGGC GTCATGAGAC ATGTGTCATC TGCGAGTGTG 12780 GATCAGTCAA CTACGGATGG TTTTTTGTCC CCTCGGGTTG CCAACTGGAT GATATTGACA 12840 AGGAAACATC ATCCTTGAGA GTCCCATATA TTGGTTCTAC CACTGATGAG AGAACAGACA 12900 TGAAGCTTGC CTTCGTAAGA GCCCCAAGTC GATCCTTGCG ATCTGCTGTT AGAATAGCAA 12960 CAGTGTACTC ATGGGCTTAC GGTGATGATG ATAGCTCTTG GAACGAAGCC TGGTTGTTGG 13020 CTAGGCAAAG GGCCAATGTG AGCCTGGAGG AGCTAAGGGT GATCACTCCC ATCTCAACTT 13080 CGACTAATTT AGCGCATAGG TTGAGGGATC GTAGCACTCA AGTGAAATAC TCAGGTACAT 13140 CCCTTGTCCG AGTGGCGAGG TATACCACAA TCTCCAACGA CAATCTCTCA TTTGTCATAT 13200 CAGATAAGAA GGTTGATACT AACTTTATAT ACCAACAAGG AATGCTTCTA GGGTTGGGTG 13260 TTTTAGAAAC ATTGTTTCGA CTCGAGAAAG ATACCGGATC ATCTAACACG GTATTACATC 13320 TTCACGTCGA AACAGATTGT TGCGTGATCC CGATGATAGA TCATCCCAGG ATACCCAGCT 13380 CCCGCAAGCT AGAGCTGAGG GCAGAGCTAT GTACCAACCC ATTGATATAT GATAATGCAC 13440 CTTTAATTGA CAGAGATGCA ACAAGGCTAT ACACCCAGAG CCATAGGAGG CACCTTGTGG 13500 AATTTGTTAC ATGGTCCACA CCCCAACTAT ATCACATTTT AGCTAAGTCC ACAGCACTAT 13560 CTATGATTGA CCTGGTAACA AAATTTGAGA AGGACCATAT GAATGAAATT TCAGCTCTCA 13620 TAGGGGATGA CGATATCAAT AGTTTCATAA CTGAGTTTCT GCTCATAGAG CCAAGATTAT 13680 TCACTATCTA CTTGGGCCAG TGTGCGGCCA TCAATTGGGC ATTTGATGTA CATTATCATA 13740 GACCATCAGG GAAATATCAG ATGGGTGAGC TGTTGTCATC GTTCCTTTCT AGAATGAGCA 13800 AAGGAGTGTT TAAGGTGCTT GTCAATGCTC TAAGCCACCC AAAGATCTAC AAGAAATTCT 13860 GGCATTGTGG TATTATAGAG CCTATCCATG GTCCTTCACT TGATGCTCAA AACTTGCACA 13920 CAACTGTGTG CAACATGGTT TACACATGCT ATATGACCTA CCTCGACCTG TTGTTGAATG 13980 AAGAGTTAGA AGAGTTCACA TTTCTCTTGT GTGAAAGCGA CGAGGATGTA GTACCGGACA 14040 GATTCGACAA CATCCAGGCA AAACACTTAT GTGTTCTGGC AGATTTGTAC TGTCAACCAG 14100 GGACCTGCCC ACCAATTCGA GGTCTAAGAC CGGTAGAGAA ATGTGCAGTT CTAACCGACC 14160 ATATCAAGGC AGAGGCTATG TTATCTCCAG CAGGATCTTC GTGGAACATA AATCCAATTA 14220 TTGTAGACCA TTACTCATGC TCTCTGACTT ATCTCCGGCG AGGATCGATC AAACAGATAA 14280 GATTGAGAGT TGATCCAGGA TTCATTTTCG ACGCCCTCGC TGAGGTAAAT GTCAGTCAGC 14340 CAAAGATCGG CAGCAACAAC ATCTCAAATA TGAGCATCAA GGCTTTCAGA CCCCCACACG 14400 ATGATGTTGC AAAATTGCTC AAAGATATCA ACACAAGCAA GCACAATCTT CCCATTTCAG 14460 GGGGCAATCT CGCCAATTAT GAAATCCATG CTTTCCGCAG AATCGGGTTG AACTCATCTG 14520 CTTGCTACAA AGCTGTTGAG ATATCAACAT TAATTAGGAG ATGCCTTGAG CCAGGGGAGG 14580 ACGGCTTGTT CTTGGGTGAG GGATCGGGTT CTATGTTGAT CACTTATAAG GAGATACTTA 14640 AACTAAACAA GTGCTTCTAT AATAGTGGGG TTTCCGCCAA TTCTAGATCT GGTCAAAGGG 14700 AATTAGCACC CTATCCCTCC GAAGTTGGCC TTGTCGAACA CAGAATGGGA GTAGGTAATA 14760 TTGTCAAAGT GCTCTTTAAC GGGAGGCCCG AAGTCACGTG GGTAGGCAGT GTAGATTGCT 14820 TCAATTTCAT AGTTAGTAAT ATCCCTACCT CTAGTGTGGG GTTTATCCAT TCAGATATAG 14880 AGACCTTGCC TGACAAAGAT ACTATAGAGA AGCTAGAGGA ATTGGCAGCC ATCTTATCGA 14940 TGGCTCTGCT CCTGGGCAAA ATAGGATCAA TACTGGTGAT TAAGCTTATG CCTTTCAGCG 15000 GGGATTTTGT TCAGGGATTT ATAAGTTATG TAGGGTCTCA TTATAGAGAA GTGAACCTTG 15060 TATACCCTAG ATACAGCAAC TTCATCTCTA CTGAATCTTA TTTGGTTATG ACAGATCTCA 15120 AGGCTAACCG GCTAATGAAT CCTGAAAAGA TTAAGCAGCA GATAATTGAA TCATCTGTGA 15180 GGACTTCACC TGGACTTATA GGTCACATCC TATCCATTAA GCAACTAAGC TGCATACAAG 15240 CAATTGTGGG AGACGCAGTT AGTAGAGGTG ATATCAATCC TACTCTGAAA AAACTTACAC 15300 CTATAGAGCA GGTGCTGATC AATTGCGGGT TGGCAATTAA CGGACCTAAG CTGTGCAAAG 15360 AATTGATCCA CCATGATGTT GCCTCAGGGC AAGATGGATT GCTTAATTCT ATACTCATCC 15420 TCTACAGGGA GTTGGCAAGA TTCAAAGACA ACCAAAGAAG TCAACAAGGG ATGTTCCACG 15480 CTTACCCCGT ATTGGTAAGT AGCAGGCAAC GAGAACTTAT ATCTAGGATC ACCCGCAAAT 15540 TCTGGGGGCA CATTCTTCTT TACTCCGGGA ACAGAAAGTT GATAAATAAG TTTATCCAGA 15600 ATCTCAAGTC CGGCTATCTG ATACTAGACT TACACCAGAA TATCTTCGTT AAGAATCTAT 15660 CCAAGTCAGA GAAACAGATT ATTATGACGG GGGGTTTGAA ACGTGAGTGG GTTTTTAAGG 15720 TAACAGTCAA GGAGACCAAA GAATGGTATA AGTTAGTCGG ATACAGTGCC CTGATTAAGG 15780 ACTAATTGGT TGAACTCCGG AACCCTAATC CTGCCCTAGG TGGTTAGGCA TTATTTGCAA 15840 TATATTAAAG AAAACTTTGA AAATACGAAG TTTCTATTCC CAGCTTTGTC TGGT 15894 (2) SEQ ID NO: 10 information about: (I) SEQUENCE CHARACTERISTICS: ...
(A) length: 2183 amino acid
(B) type: amino acid
(C) chain:
(D) topological framework: linearity is molecule type (ii): protein (xi) sequence description: SEQ ID NO:10:Met Asp Ser Leu Ser Val Asn Gln Ile Leu Tyr Pro Glu Val His Leu1 5 10 15Asp Ser Pro Ile Val Thr Asn Lys Ile Val Ala Ile Leu Glu Tyr Ala
20??????????????????25??????????????????30Arg?Val?Pro?His?Ala?Tyr?Ser?Leu?Glu?Asp?Pro?Thr?Leu?Cys?Gln?Asn
35??????????????????40??????????????????45Ile?Lys?His?Arg?Leu?Lys?Asn?Gly?Phe?Ser?Asn?Gln?Met?Ile?Ile?Asn
50??????????????????55???????????????????60Asn?Val?Glu?Val?Gly?Asn?Val?Ile?Lys?Ser?Lys?Leu?Arg?Ser?Tyr?Pro65??????????????????70??????????????????75??????????????????80Ala?His?Ser?His?Ile?Pro?Tyr?Pro?Asn?Cys?Asn?Gln?Asp?Leu?Phe?Asn
85??????????????????90??????????????????95Ile?Glu?Asp?Lys?Glu?Ser?Thr?Arg?Lys?Ile?Arg?Glu?Leu?Leu?Lys?Lys
100?????????????????105?????????????????110Gly?Asn?Ser?Leu?Tyr?Ser?Lys?Val?Ser?Asp?Lys?Val?Phe?Gln?Cys?Leu
115?????????????????120?????????????????125Arg?Asp?Thr?Asn?Ser?Arg?Leu?Gly?Leu?Gly?Ser?Glu?Leu?Arg?Glu?Asp
130?????????????????135?????????????????140Ile?Lys?Glu?Lys?Val?Ile?Asn?Leu?Gly?Val?Tyr?Met?His?Ser?Ser?Gln145?????????????????150?????????????????155?????????????????160Trp?Phe?Glu?Pro?Phe?Leu?Phe?Trp?Phe?Thr?Val?Lys?Thr?Glu?Met?Arg
165?????????????????170?????????????????175Ser?Val?Ile?Lys?Ser?Gln?Thr?His?Thr?Cys?His?Arg?Arg?Arg?His?Thr
180?????????????????185?????????????????190Pro?Val?Phe?Phe?Thr?Gly?Ser?Ser?Val?Glu?Leu?Leu?Ile?Ser?Arg?Asp
195?????????????????200?????????????????205Leu?Val?Ala?Ile?Ile?Ser?Lys?Glu?Ser?Gln?His?Val?Tyr?Tyr?Leu?Thr
210?????????????????215?????????????????220Phe?Glu?Leu?Val?Leu?Met?Tyr?Cys?Asp?Val?Ile?Glu?Gly?Arg?Leu?Met225?????????????????230?????????????????235?????????????????240Thr?Glu?Thr?Ala?Met?Thr?Ile?Asp?Ala?Arg?Tyr?Thr?Glu?Leu?Leu?Gly
245?????????????????250?????????????????255Arg?Val?Arg?Tyr?Met?Trp?Lys?Leu?Ile?Asp?Gly?Phe?Phe?Pro?Ala?Leu
260?????????????????265?????????????????270Gly?Asn?Pro?Thr?Tyr?Gln?Ile?Val?Ala?Met?Leu?Glu?Pro?Leu?Ser?Leu
275?????????????????280?????????????????285Ala?Tyr?Leu?Gln?Leu?Arg?Asp?Ile?Thr?Val?Glu?Leu?Arg?Gly?Ala?Phe
290?????????????????295?????????????????300Leu?Asn?His?Cys?Phe?Thr?Glu?Ile?His?Asp?Val?Leu?Asp?Gln?Asn?Gly305?????????????????310?????????????????315?????????????????320Phe?Ser?Asp?Glu?Gly?Thr?Tyr?His?Glu?Leu?Ile?Glu?Ala?Leu?Asp?Tyr
325?????????????????330?????????????????335Ile?Phe?Ile?Thr?Asp?Asp?Ile?His?Leu?Thr?Gly?Glu?Ile?Phe?Ser?Phe
340?????????????????345?????????????????350Phe?Arg?Ser?Phe?Gly?His?Pro?Arg?Leu?Glu?Ala?Val?Thr?Ala?Ala?Glu
355?????????????????360?????????????????365Asn?Val?Arg?Lys?Tyr?Met?Asn?Gln?Pro?Lys?Val?Ile?Val?Tyr?Glu?Thr
370?????????????????375?????????????????380Leu?Met?Lys?Gly?His?Ala?Ile?Phe?Cys?Gly?Ile?Ile?Ile?Asn?Gly?Tyr385?????????????????390?????????????????395?????????????????400Arg?Asp?Arg?His?Gly?Gly?Ser?Trp?Pro?Pro?Leu?Thr?Leu?Pro?Leu?His
405?????????????????410?????????????????415Ala?Ala?Asp?Thr?Ile?Arg?Asn?Ala?Gln?Ala?Ser?Gly?Glu?Gly?Leu?Thr
420?????????????????425?????????????430His?Glu?Gln?Cys?Val?Asp?Asn?Trp?Lys?Ser?Phe?Ala?Gly?Val?Lys?Phe
435?????????????????440?????????????????445Gly?Cys?Phe?Met?Pro?Leu?Ser?Leu?Asp?Ser?Asp?Leu?Thr?Met?Tyr?Leu
450?????????????????455?????????????????460Lys?Asp?Lys?Ala?Leu?Ala?Ala?Leu?Gln?Arg?Glu?Trp?Asp?Ser?Val?Tyr465?????????????????470?????????????????475?????????????????480Pro?Lys?Glu?Phe?Leu?Arg?Tyr?Asp?Pro?Pro?Lys?Gly?Thr?Gly?Ser?Arg
485?????????????????490?????????????????495Arg?Leu?Val?Asp?Val?Phe?Leu?Asn?Asp?Ser?Ser?Phe?Asp?Pro?Tyr?Asp
500?????????????????505?????????????????510Val?Ile?Met?Tyr?Val?Val?Ser?Gly?Ala?Tyr?Leu?His?Asp?Pro?Glu?Phe
5l5?????????????????520?????????????????525Asn?Leu?Ser?Tyr?Ser?Leu?Lys?Glu?Lys?Glu?Ile?Lys?Glu?Thr?Gly?Arg
530?????????????????535?????????????????540Leu?Phe?Ala?Lys?Met?Thr?Tyr?Lys?Met?Arg?Ala?Cys?Gln?Val?Ile?Ala545?????????????????550?????????????????555?????????????????560Glu?Asn?Leu?Ile?Ser?Asn?Gly?Ile?Gly?Lys?Tyr?Phe?Lys?Asp?Asn?Gly
565?????????????????570?????????????????575Met?Ala?Lys?Asp?Glu?His?Asp?Leu?Thr?Lys?Ala?Leu?His?Thr?Leu?Ala
580?????????????????585?????????????????590Val?Ser?Gly?Val?Pro?Lys?Asp?Leu?Lys?Glu?Ser?His?Arg?Gly?Gly?Pro
595?????????????????600?????????????????605Val?Leu?Lys?Thr?Tyr?Ser?Arg?Ser?Pro?Val?His?Thr?Ser?Thr?Arg?Asn
610?????????????????615?????????????????620Val?Arg?Ala?Ala?Lys?Gly?Phe?Ile?Gly?Phe?Pro?Gln?Val?Ile?Arg?Gln625?????????????????630?????????????????635?????????????????640Asp?Gln?Asp?Thr?Asp?His?Pro?Glu?Asn?Met?Glu?Ala?Tyr?Glu?Thr?Val
645?????????????????650?????????????????655Ser?Ala?Phe?Ile?Thr?Thr?Asp?Leu?Lys?Lys?Tyr?Cys?Leu?Asn?Trp?Arg
660?????????????????665?????????????????670Tyr?Glu?Thr?Ile?Ser?Leu?Phe?Ala?Gln?Arg?Leu?Asn?Glu?Ile?Tyr?Gly
675?????????????????680?????????????????685Leu?Pro?Ser?Phe?Phe?Gln?Trp?Leu?His?Lys?Arg?Leu?Glu?Thr?Ser?Val
690?????????????????695?????????????????700Leu?Tyr?Val?Ser?Asp?Pro?His?Cys?Pro?Pro?Asp?Leu?Asp?Ala?His?Ile705?????????????????710?????????????????715?????????????????720Pro?Leu?Tyr?Lys?Val?Pro?Asn?Asp?Gln?Ile?Phe?Ile?Lys?Tyr?Pro?Met
725?????????????????730?????????????????735Gly?Gly?Ile?Glu?Gly?Tyr?Cys?Gln?Lys?Leu?Trp?Thr?Ile?Ser?Thr?Ile
740?????????????????745?????????????????750Pro?Tyr?Leu?Tyr?Leu?Ala?Ala?Tyr?Glu?Ser?Gly?Val?Arg?Ile?Ala?Ser
755?????????????????760?????????????????765Leu?Val?Gln?Gly?Asp?Asn?Gln?Thr?Ile?Ala?Val?Thr?Lys?Arg?Val?Pro
770?????????????????775?????????????????780Ser?Thr?Trp?Pro?Tyr?Asn?Leu?Lys?Lys?Arg?Glu?Ala?Ala?Arg?Val?Thr785?????????????????790?????????????????795?????????????????800Arg?Asp?Tyr?Phe?Val?Ile?Leu?Arg?Gln?Arg?Leu?His?Asp?Ile?Gly?His
805?????????????????810?????????????????815His?Leu?Lys?Ala?Asn?Glu?Thr?Ile?Val?Ser?Ser?His?Phe?Phe?Val?Tyr
820?????????????????825?????????????????830Ser?Lys?Gly?Ile?Tyr?Tyr?Asp?Gly?Leu?Leu?Val?Ser?Gln?Ser?Leu?Lys
835?????????????????840?????????????????845Ser?Ile?Ala?Arg?Cys?Val?Phe?Trp?Ser?Glu?Thr?Ile?Val?Asp?Glu?Thr
850?????????????????855?????????????????860Arg?Ala?Ala?Cys?Ser?Asn?Ile?Ala?Thr?Thr?Met?Ala?Lys?Ser?Ile?Glu865?????????????????870?????????????????875?????????????????880Arg?Gly?Tyr?Asp?Arg?Tyr?Leu?Ala?Tyr?Ser?Leu?Asn?Val?Leu?Lys?Val
885?????????????????890?????????????????895Ile?Gln?Gln?Ile?Leu?Ile?Ser?Leu?Gly?Phe?Thr?Ile?Asn?Ser?Thr?Met
900?????????????????905?????????????????910Thr?Arg?Asp?Val?Val?Ile?Pro?Leu?Leu?Thr?Asn?Asn?Asp?Leu?Leu?Ile
915?????????????????920?????????????????925Arg?Met?Ala?Leu?Leu?Pro?Ala?Pro?Ile?Gly?Gly?Met?Asn?Tyr?Leu?Asn
930?????????????????935?????????????????940Met?Ser?Arg?Leu?Phe?Val?Arg?Asn?Ile?Gly?Asp?Pro?Val?Thr?Ser?Ser945?????????????????950?????????????????955?????????????????960Ile?Ala?Asp?Leu?Lys?Arg?Met?Ile?Leu?Ala?Ser?Leu?Met?Pro?Glu?Glu
965?????????????????970?????????????????975Thr?Leu?His?Gln?Val?Met?Thr?Gln?Gln?Pro?Gly?Asp?Ser?Ser?Phe?Leu
980?????????????????985?????????????????990Asp?Trp?Ala?Ser?Asp?Pro?Tyr?Ser?Ala?Asn?Leu?Val?Cys?Val?Gln?Ser
995?????????????????1000????????????????1005Ile?Thr?Arg?Leu?Leu?Lys?Asn?Ile?Thr?Ala?Arg?Phe?Val?Leu?Ile?His
1010????????????????1015???????????????1020Ser?Pro?Asn?Pro?Met?Leu?Lys?Gly?Leu?Phe?His?Asp?Asp?Ser?Lys?Glu1025????????????????1030????????????????1035????????????????1040Glu?Asp?Glu?Gly?Leu?Ala?Ala?Phe?Leu?Met?Asp?Arg?His?Ile?Ile?Val
1045????????????????1050????????????????1055Pro?Arg?Ala?Ala?His?Glu?Ile?Leu?Asp?His?Ser?Val?Thr?Gly?Ala?Arg
1060????????????????1065????????????????1070Glu?Ser?Ile?Ala?Gly?Met?Leu?Asp?Thr?Thr?Lys?Gly?Leu?Ile?Arg?Ala
1075????????????????1080????????????????1085Ser?Met?Arg?Lys?Gly?Gly?Leu?Thr?Ser?Arg?Val?Ile?Thr?Arg?Leu?Ser
1090????????????????1095????????????????1100Asn?Tyr?Asp?Tyr?Glu?Gln?Phe?Arg?Ala?Gly?Met?Val?Leu?Leu?Thr?Gly1105????????????????1110????????????????1115????????????????1120Arg?Lys?Arg?Asn?Val?Leu?Ile?Asp?Lys?Glu?Ser?Cys?Ser?Val?Gln?Leu
1125????????????????1130????????????????1135Ala?Arg?Ala?Leu?Arg?Ser?His?Met?Trp?Ala?Arg?Leu?Ala?Arg?Gly?Arg
1140????????????????1145????????????????1150Pro?Ile?Tyr?Gly?Leu?Glu?Val?Pro?Asp?Val?Leu?Glu?Ser?Met?Arg?Gly
1155????????????????1160????????????????1165His?Leu?Ile?Arg?Arg?His?Glu?Thr?Cys?Val?Ile?Cys?Glu?Cys?Gly?Ser
1170????????????????1175????????????????1180Val?Asn?Tyr?Gly?Trp?Phe?Phe?Val?Pro?Ser?Gly?Cys?Gln?Leu?Asp?Asp1185????????????????1190????????????????1195????????????????1200Ile?Asp?Lys?Glu?Thr?Ser?Ser?Leu?Arg?Val?Pro?Tyr?Ile?Gly?Ser?Thr
1205????????????????1210????????????????1215Thr?Asp?Glu?Arg?Thr?Asp?Met?Lys?Leu?Ala?Phe?Val?Arg?Ala?Pro?Ser
1220????????????????1225????????????????1230Arg?Ser?Leu?Arg?Ser?Ala?Val?Arg?Ile?Ala?Thr?Val?Tyr?Ser?Trp?Ala
1235????????????????1240????????????????1245Tyr?Gly?Asp?Asp?Asp?Ser?Ser?Trp?Asn?Glu?Ala?Trp?Leu?Leu?Ala?Arg
1250????????????????1255????????????????1260Gln?Arg?Ala?Asn?Val?Ser?Leu?Glu?Glu?Leu?Arg?Val?Ile?Thr?Pro?Ile1265????????????????1270????????????????1275????????????????1280Ser?Thr?Ser?Thr?Asn?Leu?Ala?His?Arg?Leu?Arg?Asp?Arg?Ser?Thr?Gln
1285????????????????1290????????????????1295Val?Lys?Tyr?Ser?Gly?Thr?Ser?Leu?Val?Arg?Val?Ala?Arg?Tyr?Thr?Thr
1300????????????????1305????????????????1310Ile?Ser?Asn?Asp?Asn?Leu?Ser?Phe?Val?Ile?Ser?Asp?Lys?Lys?Val?Asp
1315????????????????1320????????????????1325Thr?Asn?Phe?Ile?Tyr?Gln?Gln?Gly?Met?Leu?Leu?Gly?Leu?Gly?Val?Leu
1330????????????????1335????????????????1340Glu?Thr?Leu?Phe?Arg?Leu?Glu?Lys?Asp?Thr?Gly?Ser?Ser?Asn?Thr?Val1345????????????????1350????????????????1355????????????????1360Leu?His?Leu?His?Val?Glu?Thr?Asp?Cys?Cys?Val?Ile?Pro?Met?Ile?Asp
1365????????????????1370????????????????1375His?Pro?Arg?Ile?Pro?Ser?Ser?Arg?Lys?Leu?Glu?Leu?Arg?Ala?Glu?Leu
1380????????????????1385????????????????1390Cys?Thr?Asn?Pro?Leu?Ile?Tyr?Asp?Asn?Ala?Pro?Leu?Ile?Asp?Arg?Asp
1395????????????????1400???????????????1405Ala?Thr?Arg?Leu?Tyr?Thr?Gln?Ser?His?Arg?Arg?His?Leu?Val?Glu?Phe
1410????????????????1415????????????????1420Val?Thr?Trp?Ser?Thr?Pro?Gln?Leu?Tyr?His?Ile?Leu?Ala?Lys?Ser?Thr1425????????????????1430????????????????1435????????????????1440Ala?Leu?Ser?Met?Ile?Asp?Leu?Val?Thr?Lys?Phe?Glu?Lys?Asp?His?Met
1445????????????????1450????????????????1455Asn?Glu?Ile?Ser?Ala?Leu?Ile?Gly?Asp?Asp?Asp?Ile?Asn?Ser?Phe?Ile
1460????????????????1465????????????????1470Thr?Glu?Phe?Leu?Leu?Ile?Glu?Pro?Arg?Leu?Phe?Thr?Ile?Tyr?Leu?Gly
1475????????????????1480????????????????1485Gln?Cys?Ala?Ala?Ile?Asn?Trp?Ala?Phe?Asp?Val?His?Tyr?His?Arg?Pro
1490????????????????1495????????????????1500Ser?Gly?Lys?Tyr?Gln?Met?Gly?Glu?Leu?Leu?Ser?Ser?Phe?Leu?Ser?Arg1505????????????????1510????????????????1515????????????????1520Met?Ser?Lys?Gly?Val??Phe?Lys?Val?Leu?Val?Asn?Ala?Leu?Ser?His?Pro
1525?????????????????1530????????????????1535Lys?Ile?Tyr?Lys?Lys?Phe?Trp?His?Cys?Gly?Ile?Ile?Glu?Pro?Ile?His
1540????????????????1545????????????????1550Gly?Pro?Ser?Leu?Asp?Ala?Gln?Asn?Leu?His?Thr?Thr?Val?Cys?Asn?Met
1555????????????????1560????????????????1565Val?Tyr?Thr?Cys?Tyr?Met?Thr?Tyr?Leu?Asp?Leu?Leu?Leu?Asn?Glu?Glu
1570????????????????1575????????????????1580Leu?Glu?Glu?Phe?Thr?Phe?Leu?Leu?Cys?Glu?Ser?Asp?Glu?Asp?Val?Val1585????????????????1590????????????????1595????????????????1600Pro?Asp?Arg?Phe?Asp?Asn?Ile?Gln?Ala?Lys?His?Leu?Cys?Val?Leu?Ala
1605????????????????1610????????????????1615Asp?Leu?Tyr?Cys?Gln?Pro?Gly?Thr?Cys?Pro?Pro?Ile?Arg?Gly?Leu?Arg
1620????????????????1625????????????????1630Pro?Val?Glu?Lys?Cys?Ala?Val?Leu?Thr?Asp?His?Ile?Lys?Ala?Glu?Ala
1635????????????????1640????????????????1645Met?Leu?Ser?Pro?Ala?Gly?Ser?Ser?Trp?Asn?Ile?Asn?Pro?Ile?Ile?Val
1650????????????????1655????????????????1660Asp?His?Tyr?Ser?Cys?Ser?Leu?Thr?Tyr?Leu?Arg?Arg?Gly?Ser?Ile?Lys1665????????????????1670????????????????1675????????????????1680Gln?Ile?Arg?Leu?Arg?Val?Asp?Pro?Gly?Phe?Ile?Phe?Asp?Ala?Leu?Ala
1685????????????????1690????????????????1695Glu?Val?Asn?Val?Ser?Gln?Pro?Lys?Ile?Gly?Ser?Asn?Asn?Ile?Ser?Asn
1700????????????????1705????????????????1710Met?Ser?Ile?Lys?Ala?Phe?Arg?Pro?Pro?His?Asp?Asp?Val?Ala?Lys?Leu
1715????????????????1720????????????????1725Leu?Lys?Asp?Ile?Asn?Thr?Ser?Lys?His?Asn?Leu?Pro?Ile?Ser?Gly?Gly
1730????????????????1735????????????????1740Asn?Leu?Ala?Asn?Tyr?Glu?Ile?His?Ala?Phe?Arg?Arg?Ile?Gly?Leu?Asn1745????????????????1750????????????????1755????????????????1760Ser?Ser?Ala?Cys?Tyr?Lys?Ala?Val?Glu?Ile?Ser?Thr?Leu?Ile?Arg?Arg
1765????????????????1770????????????????1775Cys?Leu?Glu?Pro?Gly?Glu?Asp?Gly?Leu?Phe?Leu?Gly?Glu?Gly?Ser?Gly
1780????????????????1785????????????????1790Ser?Met?Leu?Ile?Thr?Tyr?Lys?Glu?Ile?Leu?Lys?Leu?Asn?Lys?Cys?Phe
1795????????????????1800????????????????1805Tyr?Asn?Ser?Gly?Val?Ser?Ala?Asn?Ser?Arg?Ser?Gly?Gln?Arg?Glu?Leu
1810????????????????1815????????????????1820Ala?Pro?Tyr?Pro?Ser?Glu?Val?Gly?Leu?Val?Glu?His?Arg?Met?Gly?Val1825????????????????1830????????????????1835????????????????1840Gly?Asn?Ile?Val?Lys?Val?Leu?Phe?Asn?Gly?Arg?Pro?Glu?Val?Thr?Trp
1845????????????????1850????????????????1855Val?Gly?Ser?Val?Asp?Cys?Phe?Asn?Phe?Ile?Val?Ser?Asn?Ile?Pro?Thr
1860????????????????1865????????????????1870Ser?Ser?Val?Gly?Phe?Ile?His?Ser?Asp?Ile?Glu?Thr?Leu?Pro?Asp?Lys
1875????????????????1880????????????????1885Asp?Thr?Ile?Glu?Lys?Leu?Glu?Glu?Leu?Ala?Ala?Ile?Leu?Ser?Met?Ala
1890????????????????1895????????????????1900Leu?Leu?Leu?Gly?Lys?Ile?Gly?Ser?Ile?Leu?Val?Ile?Lys?Leu?Met?Pro1905????????????????1910????????????????1915????????????????1920Phe?Ser?Gly?Asp?Phe?Val?Gln?Gly?Phe?Ile?Ser?Tyr?Val?Gly?Ser?His
1925????????????????1930????????????????1935Tyr?Arg?Glu?Val?Asn?Leu?Val?Tyr?Pro?Arg?Tyr?Ser?Asn?Phe?Ile?Ser
1940????????????????1945????????????????1950Thr?Glu?Ser?Tyr?Leu?Val?Met?Thr?Asp?Leu?Lys?Ala?Asn?Arg?Leu?Met
1955????????????????1960????????????????1965
Asn?Pro?Glu?Lys?Ile?Lys?Gln?Gln?Ile?Ile?Glu?Ser?Ser?Val?Arg?Thr
1970????????????????1975????????????????1980
Ser?Pro?Gly?Leu?Ile?Gly?His?Ile?Leu?Ser?Ile?Lys?Gln?Leu?Ser?Cys
1985????????????????1990????????????????1995????????????????2000
Ile?Gln?Ala?Ile?Val?Gly?Asp?Ala?Val?Ser?Arg?Gly?Asp?Ile?Asn?Pro
2005????????????????2010????????????????2015
Thr?Leu?Lys?Lys?Leu?Thr?Pro?Ile?Glu?Gln?Val?Leu?Ile?Asn?Cys?Gly
2020????????????????????2025????????????????2030
Leu?Ala?Ile?Asn?Gly?Pro?Lys?Leu?Cys?Lys?Glu?Leu?Ile?His?His?Asp
2035????????????????2040????????????????2045
Val?Ala?Ser?Gly?Gln?Asp?Gly?Leu?Leu?Asn?Ser?Ile?Leu?Ile?Leu?Tyr
2050????????????????2055????????????????2060
Arg?Glu?Leu?Ala?Arg?Phe?Lys?Asp?Asn?Gln?Arg?Ser?Gln?Gln?Gly?Met
2065????????????????2070????????????????2075????????????????2080
Phe?His?Ala?Tyr?Pro?Val?Leu?Val?Ser?Ser?Arg?Gln?Arg?Glu?Leu?Ile
2085????????????????2090????????????????2095
Ser?Arg?Ile?Thr?Arg?Lys?Phe?Trp?Gly?His?Ile?Leu?Leu?Tyr?Ser?Gly
2100????????????????2105????????????????2110
Asn?Arg?Lys?Leu?Ile?Asn?Lys?Phe?Ile?Gln?Asn?Leu?Lys?Ser?Gly?Tyr
2115????????????????2120????????????????2125
Leu?Ile?Leu?Asp?Leu?His?Gln?Asn?Ile?Phe?Val?Lys?Asn?Leu?Ser?Lys
2130????????????????2135????????????????2140
Ser?Glu?Lys?Gln?Ile?Ile?Met?Thr?Gly?Gly?Leu?Lys?Arg?Glu?Trp?Val
2145????????????????2150????????????????2155????????????????2160
Phe?Lys?Val?Thr?Val?Lys?Glu?Thr?Lys?Glu?Trp?Tyr?Lys?Leu?Val?Gly
2165????????????????2170????????????????2175
Tyr?Ser?Ala?Leu?Ile?Lys?Asp
The information of 2180 (2) SEQ ID NO:11:
(i) sequence signature:
(A) length: 15894 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: ACCAAACAAA GTTGGGTAAG GATAGTTCAA TCAATGATCA TCTTCTAGTG CACTTAGGAT 60 TCAAGATCCT ATTATCAGGG ACAAGAGCAG GATTAGGGAT ATCCGAGATG GCCACACTTT 120 TAAGGAGCTT AGCATTGTTC AAAAGAAACA AGGACAAACC ACCCATTACA TCAGGATCCG 180 GTGGAGCCAT CAGAGGAATC AAACACATTA TTATAGTACC AATCCCTGGA GATTCCTCAA 240 TTACCACTCG ATCCAGACTT CTGGACCGGT TGGTGAGGTT AATTGGAAAC CCGGATGTGA 300 GCGGGCCCAA ACTAACAGGG GCACTAATAG GTATATTATC CTTATTTGTG GAGTCTCCAG 360 GTCAATTGAT TCAGAGGATC ACCGATGACC CTGACGTTAG CATAAGGCTG TTAGAGGTTG 420 TCCAGAGTGA CCAGTCACAA TCTGGCCTTA CCTTCGCATC AAGAGGTACC AACATGGAGG 480 ATGAGGCGGA CCAATACTTT TCACATGATG ATCCAATTAG TAGTGATCAA TCCAGGTTCG 540 GATGGTTCGG GAACAAGGAA ATCTCAGATA TTGAAGTGCA AGACCCTGAG GGATTCAACA 600 TGATTCTGGG TACCATCCTA GCCCAAATTT GGGTCTTGCT CGCAAAGGCG GTTACGGCCC 660 CAGACACGGC AGCTGATTCG GAGCTAAGAA GGTGGATAAA GTACACCCAA CAAAGAAGGG 720 TAGTTGGTGA ATTTAGATTG GAGAGAAAAT GGTTGGATGT GGTGAGGAAC AGGATTGCCG 780 AGGACCTCTC CTTACGCCGA TTCATGGTCG CTCTAATCCT GGATATCAAG AGAACACCCG 840 GAAACAAACC CAGGATTGCT GAAATGATAT GTGACATTGA TACATATATC GTAGAGGCAG 900 GATTAGCCAG TTTTATCCTG ACTATTAAGT TTGGGATAGA AACTATGTAT CCTGCTCTTG 960 GACTGCATGA ATTTGCTGGT GAGTTATCCA CACTTGAGTC CTTGATGAAC CTTTACCAGC 1020 AAATGGGGGA AACTGCACCC TACATGGTAA TCCTGGAGAA CTCAATTCAG AACAAGTTCA 1080 GTGCAGGATC ATACCCTCTG CTCTGGAGCT ATGCCATGGG AGTAGGAGTG GAACTTGAAA 1140 ACTCCATGGG AGGTTTGAAC TTTGGCCGAT CTTACTTTGA TCCAGCATAT TTTAGATTAG 1200 GGCAAGAGAT GGTAAGGAGG TCAGCTGGAA AGGTCAGTTC CACATTGGCA TCTGAACTCG 1260 GTATCACTGC CGAGGATGCA AGGCTTGTTT CAGAGATTGC AATGCATACT ACTGAGGACA 1320 AGATCAGTAG AGCGGTTGGA CCCAGACAAG CCCAAGTATC ATTTCTACAC GGTGATCAAA 1380 GTGAGAATGA GCTACCGAGA TTGGGGGGCA AGGAAGATAG GAGGGTCAAA CAGAGTCGAG 1440 GAGAAGCCAG GGAGAGCTAC AGAGAAACCG GGCCCAGCAG AGCAAGTGAT GCGAGAGCTG 1500 CCCATCTTCC AACCGGCACA CCCCTAGACA TTGACACTGC AACGGAGTCC AGCCAAGATC 1560 CGCAGGACAG TCGAAGGTCA GCTGACGCCC TGCTTAGGCT GCAAGCCATG GCAGGAATCT 1620 CGGAAGAACA AGGCTCAGAC ACGGACACCC CTATAGTGTA CAATGACAGA AATCTTCTAG 1680 ACTAGGTGCG AGAGGCCGAG GGCCAGAACA ACATCCGCCT ACCATCCATC ATTGTTATAA 1740 AAAACTTAGG AACCAGGTCC ACACAGCCGC CAGCCCATCA ACCATCCACT CCCACGATTG 1800 GAGCCAATGG CAGAAGAGCA GGCACGCCAT GTCAAAAACG GACTGGAATG CATCCGGGCT 1860 CTCAAGGCCG AGCCCATCGG CTCACTGGCC ATCGAGGAAG CTATGGCAGC ATGGTCAGAA 1920 ATATCAGACA ACCCAGGACA GGAGCGAGCC ACCTGCAGGG AAGAGAAGGC AGGCAGTTCG 1980 GGTCTCAGCA AACCATGCCT CTCAGCAATT GGATCAACTG AAGGCGGTGC ACCTCGCATC 2040 CGCGGTCAGG GACCTGGAGA GAGCGATGAC GACGCTGAAA CTTTGGGAAT CCCCCCAAGA 2100 AATCTCCAGG CATCAAGCAC TGGGTTACAG TGTTATTACG TTTATGATCA CAGCGGTGAA 2160 GCGGTTAAGG GAATCCAAGA TGCTGACTCT ATCATGGTTC AATCAGGCCT TGATGGTGAT 2220 AGCACCCTCT CAGGAGGAGA CAATGAATCT GAAAACAGCG ATGTGGATAT TGGCGAACCT 2280 GATACCGAGG GATATGCTAT CACTGACCGG GGATCTGCTC CCATCTCTAT GGGGTTCAGG 2340 GCTTCTGATG TTGAAACTGC AGAAGGAGGG GAGATCCACG AGCTCCTGAG ACTCCAATCC 2400 AGAGGCAACA ACTTTCCGAA GCTTGGGAAA ACTCTCAATG TTCCTCCGCC CCCGGACCCC 2460 GGTAGGGCCA GCACTTCCGG GACACCCATT AAAAAGGGCA CAGACGCGAG ATTAGCCTCA 2520 TTTGGAACGG AGATCGCGTC TTTATTGACA GGTGGTGCAA CCCAATGTGC TCGAAAGTCA 2580 CCCTCGGAAC CATCAGGGCC AGGTGCACCT GCGGGGAATG TCCCCGAGTG TGTGAGCAAT 2640 GCCGCACTGA TACAGGAGTG GACACCCGAA TCTGGTACCA CAATCTCCCC GAGATCCCAG 2700 AATAATGAAG AAGGGGGAGA CTATTATGAT GATGAGCTGT TCTCTGATGT CCAAGATATT 2760 AAAACAGCCT TGGCCAAAAT ACACGAGGAT AATCAGAAGA TAATCTCCAA GCTAGAATCA 2820 CTGCTGTTAT TGAAGGGAGA AGTTGAGTCA ATTAAGAAGC AGATCAACAG GCAAAATATC 2880 AGCATATCCA CCCTGGAAGG ACACCTCTCA AGCATCATGA TCGCCATTCC TGGACTTGGG 2940 AAGGATCCCA ACGACCCCAC TGCAGATGTC GAAATCAATC CCGACTTGAA ACCCATCATA 3000 GGCAGAGATT CAGGCCGAGC ACTGGCCGAA GTTCTCAAGA AACCCGTTGC CAGCCGACAA 3060 CTCCAAGGAA TGACAAATGG ACGGACCAGT TCCAGAGGAC AGCTGCTGAA GGAATTTCAG 3120 CTAAAGCCGA TCGGGAAAAA GATGAGCTCA GCCGTCGGGT TTGTTCCTGA CACCGGCCCT 3180 GCATCACGCA GTGTAATCCG CTCCATTATA AAATCCAGCC GGCTAGAGGA GGATCGGAAG 3240 CGTTACCTGA TGACTCTCCT TGATGATATC AAAGGAGCCA ATGATCTTGC CAAGTTCCAC 3300 CAGATGCTGA TGAAGATAAT AATGAAGTAG CTACAGCTCA ACTTACCTGC CAACCCCATG 3360 CCAGTCGACC CAACTAGTAC AACCTAAATC CATTATAAAA AACTTAGGAG CAAAGTGATT 3420 GCCTCCCAAG GTCCACAATG ACAGAGACCT ACGACTTCGA CAAGTCGGCA TGGGACATCA 3480 AAGGGTCGAT CGCTCCGATA CAACCCACCA CCTACAGTGA TGGCAGGCTG GTGCCCCAGG 3540 TCAGAGTCAT AGATCCTGGT CTAGGCGACA GGAAGGATGA ATGCTTTATG TACATGTTTC 3600 TGCTGGGGGT TGTTGAGGAC AGCGATTCCC TAGGGCCTCC AATCGGGCGA GCATTTGGGT 3660 TCCTGCCCTT AGGTGTTGGC AGATCCACAG CAAAGCCCGA AAAACTCCTC AAAGAGGCCA 3720 CTGAGCTTGA CATAGTTGTT AGACGTACAG CAGGGCTCAA TGAAAAACTG GTGTTCTACA 3780 ACAACACCCC ACTAACTCTC CTCACACCTT GGAGAAAGGT CCTAACAACA GGGAGTGTCT 3840 TCAACGCAAA CCAAGTGTGC AATGCGGTTA ATCTGATACC GCTCGATACC CCGCAGAGGT 3900 TCCGTGTTGT TTATATGAGC ATCACCCGTC TTTCGGATAA CGGGTATTAC ACCGTTCCTA 3960 GAAGAATGCT GGAATTCAGA TCGGTCAATG CAGTGGCCTT CAACCTGCTG GTGACCCTTA 4020 GGATTGACAA GGCGATAGGC CCTGGGAAGA TCATCGACAA TACAGAGCAA CTTCCTGAGG 4080 CAACATTTAT GGTCCACATC GGGAACTTCA GGAGAAAGAA GAGTGAAGTC TACTCTGCCG 4140 ATTATTGCAA AATGAAAATC GAAAAGATGG GCCTGGTTTT TGCACTTGGT GGGATAGGGG 4200 GCACCAGTCT TCACATTAGA AGCACAGGCA AAATGAGCAA GACTCTCCAT GCACAACTCG 4260 GGTTCAAGAA GACCTTATGT TACCCGCTGA TGGATATCAA TGAAGACCTT AATCGATTAC 4320 TCTGGAGGAG CAGATGCAAG ATAGTAAGAA TCCAGGCAGT TTTGCAGCCA TCAGTTCCTC 4380 AAGAATTCCG CATTTACGAC GACGTGATCA TAAATGATGA CCAAGGACTA TTCAAAGTTC 4440 TGTAGACCGT AGTGCCCAGC AATGCCCGAA AACGACCCCC CTCACAATGA CAGCCAGAAG 4500 GCCCGGACAA AAAAGCCCCC TCCGAAAGAC TCCACGGACC AAGCGAGAGG CCAGCCAGCA 4560 GCCGACGGCA AGCGCGAACA CCAGGCGGCC CCAGCACAGA ACAGCCCTGA CACAAGGCCA 4620 CCACCAGCCA CCCCAATCTG CATCCTCCTC GTGGGACCCC CGAGGACCAA CCCCCAAGGC 4680 TGCCCCCGAT CCAAACCACC AACCGCATCC CCACCACCCC CGGGAAAGAA ACCCCCAGCA 4740 ATTGGAAGGC CCCTCCCCCT CTTCCTCAAC ACAAGAACTC CACAACCGAA CCGCACAAGC 4800 GACCGAGGTG ACCCAACCGC AGGCATCCGA CTCCCTAGAC AGATCCTCTC TCCCCGGCAA 4860 ACTAAACAAA ACTTAGGGCC AAGGAACATA CACACCCAAC AGAACCCAGA CCCCGGTCCA 4920 CGGTGCCGCG CCCCCAACCC CCGACAACCA GAGGGAGCCC CCAACCAATC CCGCCGGCTC 4980 CCCCGGTGCC CACAGGCAGG GACACCAACC CCCGAACAGA CCCAGCACCC AACCATCGAC 5040 AATCCAAGAC GGGGGGGCCC CCCCAAAAAA AGGCCCCCAG GGGCCGACAG CCAGCACCGC 5100 GAGGAAGCCC ACCCACCCCA CACACGACCA CGGCAACCAA ACCAGAACCC AGACCACCCT 5160 GGGCCACCAG CTCCCAGACT CGGCCATCAC CCCGCAGAAA GGAAAGGCCA CAACCCGCGC 5220 ACCCCAGCCC CGATCCGGCG GGGAGCCACC CAACCCGAAC CAGCACCCAA GAGCGATCCC 5280 CGAAGGACCC CCGAACCGCA AAGGACATCA GTATCCCACA GCCTCTCCAA GTCCCCCGGT 5340 CTCCTCCTCT TCTCGAAGGG ACCAAAAGAT CAATCCACCA CACCCGACGA CACTCAACTC 5400 CCCACCCCTA AAGGAGACAC CGGGAATCCC AGAATCAAGA CTCATCCAAT GTCCATCATG 5460 GGTCTCAAGG TGAACGTCTC TGCCATATTC ATGGCAGTAC TGTTAACTCT CCAAACACCC 5520 ACCGGTCAAA TCCATTGGGG CAATCTCTCT AAGATAGGGG TGGTAGGAAT AGGAAGTGCA 5580 AGCTACAAAG TTATGACTCG TTCCAGCCAT CAATCATTAG TCATAAAATT AATGCCCAAT 5640 ATAACTCTCC TCAATAACTG CACGAGGGTA GAGATTGCAG AATACAGGAG ACTACTGAGA 5700 ACAGTTTTGG AACCAATTAG AGATGCACTT AATGCAATGA CCCAGAATAT AAGACCGGTT 5760 CAGAGTGTAG CTTCAAGTAG GAGACACAAG AGATTTGCGG GAGTAGTCCT GGCAGGTGCG 5820 GCCCTAGGCG TTGCCACAGC TGCTCAGATA ACAGCCGGCA TTGCACTTCA CCAGTCCATG 5880 CTGAACTCTC AAGCCATCGA CAATCTGAGA GCGAGCCTGG AAACTACTAA TCAGGCAATT 5940 GAGACAATCA GACAAGCAGG GCAGGAGATG ATATTGGCTG TTCAGGGTGT CCAAGACTAC 6000 ATCAATAATG AGCTGATACC GTCTATGAAC CAACTATCTT GTGATTTAAT CGGCCAGAAG 6060 CTCGGGCTCA AATTGCTCAG ATACTATACA GAAATCCTGT CATTATTTGG CCCCAGTTTA 6120 CGGGACCCCA TATCTGCGGA GATATCTATC CAGGCTTTGA GCTATGCGCT TGGAGGAGAC 6180 ATCAATAAGG TGTTAGAAAA GCTCGGATAC AGTGGAGGTG ATTTACTGGG CATCTTAGAG 6240 AGCGGAGGAA TAAAGGCCCG GATAACTCAC GTCGACACAG AGTCCTACTT CATTGTCCTC 6300 AGTATAGCCT ATCCGACGCT GTCCGAGATT AAGGGGGTGA TTGTCCACCG GCTAGAGGGG 6360 GTCTCGTACA ACATAGGCTC TCAAGAGTGG TATACCACTG TGCCCAAGTA TGTTGCAACC 6420 CAAGGGTACC TTATCTCGAA TTTTGATGAG TCATCGTGTA CTTTCATGCC AGAGGGGACT 6480 GTGTGCAGCC AAAATGCCTT GTACCCGATG AGTCCTCTGC TCCAAGAATG CCTCCGGGGG 6540 TACACCAAGT CCTGTGCTCG TACACTCGTA TCCGGGTCTT TTGGGAACCG GTTCATTTTA 6600 TCACAAGGGA ACCTAATAGC CAATTGTGCA TCAATCCTTT GCAAGTGTTA CACAACAGGA 6660 ACGATCATTA ATCAAGACCC TGACAAGATC CTAACATACA TTGCTGCCGA TCACTGCCCG 6720 GTAGTCGAGG TGAACGGCGT GACCATCCAA GTCGGGAGCA GGAGGTATCC AGACGCTGTG 6780 TACTTGCACA GAATTGACCT CGGTCCTCCC ATATCATTGG AGAGGTTGGA CGTAGGGACA 6840 AATCTGGGGA ATGCAATTGC TAAGTTGGAG GATGCCAAGG AATTGTTGGA GTCATCGGAC 6900 CAGATATTGA GGAGTATGAA AGGTTTATCG AGCACTAGCA TAGTCTACAT CCTGATTGCA 6960 GTGTGTCTTG GAGGGTTGAT AGGGATCCCC GCTTTAATAT GTTGCTGCAG GGGGCGTTGT 7020 AACAAAAAGG GAGAACAAGT TGGTATGTCA AGACCAGGCC TAAAGCCTGA TCTTACGGGA 7080 ACATCAAAAT CCTATGTAAG GTCGCTCTGA TCCTCTACAA CTCTTGAAAC ACAAATGTCC 7140 CACAAGTCTC CTCTTCGTCA TCAAGCAACC ACCGCACCCA GCATCAAGCC CACCTGAAAT 7200 TATCTCCGGC TTCCCTCTGG CCGAACAATA TCGGTAGTTA ATCAAAACTT AGGGTGCAAG 7260 ATCATCCACA ATGTCACCAC AACGAGACCG GATAAATGCC TTCTACAAAG ATAACCCCCA 7320 TCCCAAGGGA AGTAGGATAG TCATTAACAG AGAACATCTT ATGATTGATA GACCTTATGT 7380 TTTGCTGGCT GTTCTGTTTG TCATGTTTCT GAGCTTGATC GGGTTGCTAG CCATTGCAGG 7440 CATTAGACTT CATCGGGCAG CCATCTACAC CGCAGAGATC CATAAAAGCC TCAGCACCAA 7500 TCTAGATGTA ACTAACTCAA TCGAGCATCA GGTCAAGGAC GTGCTGACAC CACTCTTCAA 7560 AATCATCGGT GATGAAGTGG GCCTGAGGAC ACCTCAGAGA TTCACTGACC TAGTGAAATT 7620 AATCTCTGAC AAGATTAAAT TCCTTAATCC GGATAGGGAG TACGACTTCA GAGATCTCAC 7680 TTGGTGTATC AACCCGCCAG AGAGAATCAA ATTGGATTAT GATCAATACT GTGCAGATGT 7740 GGCTGCTGAA GAGCTCATGA ATGCATTGGT GAACTCAACT CTACTGGAGA CCAGAACAAC 7800 CAATCAGTTC CTAGCTGTCT CAAAGGGAAA CTGCTCAGGG CCCACTACAA TCAGAGGTCA 7860 ATTCTCAAAC ATGTCGCTGT CCCTGTTAGA CTTGTATTTA GGTCGAGGTT ACAATGTGTC 7920 ATCTATAGTC ACTATGACAT CCCAGGGAAT GTATGGGGGA ACTTACCTAG TGGAAAAGCC 7980 TAATCTGAGC AGCAAAAGGT CAGAGTTGTC ACAACTGAGC ATGTACCGAG TGTTTGAAGT 8040 AGGTGTTATC AGAAATCCGG GTTTGGGGGC TCCGGTGTTC CATATGACAA ACTATCTTGA 8100 GCAACCAGTC AGTAATGATC TCAGCAACTG TATGGTGGCT TTGGGGGAGC TCAAACTCGC 8160 AGCCCTTTGT CACGGGGAAG ATTCTATCAC AATTCCCTAT CAGGGATCAG GGAAAGGTGT 8220 CAGCTTCCAG CTCGTCAAGC TAGGTGTCTG GAAATCCCCA ACCGACATGC AATCCTGGGT 8280 CCCCTTATCA ACGGATGATC CAGTGATAGA CAGGCTTTAC CTCTCATCTC ACAGAGGTGT 8340 TATCGCTGAC AATCAAGCAA AATGGGCTGT CCCGACAACA CGAACAGATG ACAAGTTGCG 8400 AATGGAGACA TGCTTCCAAC AGGCGTGTAA GGGTAAAATC CAAGCACTCT GCGAGAATCC 8460 CGAGTGGGCA CCATTGAAGG ATAACAGGAT TCCTTCATAC GGGGTCTTGT CTGTTGATCT 8520 GAGTCTGACA GTTGAGCTTA AAATCAAAAT TGCTTCGGGA TTCGGGCCAT TGATCACACA 8580 CGGTTCAGGG ATGGACCTAT ACAAATCCAA CCACAACAAT GTGTATTGGC TGACTATCCC 8640 GCCAATGAAG AACCTAGCCT TAGGTGTAAT CAACACATTG GAGTGGATAC CGAGATTCAA 8700 GGTTAGTCCC TACCTCTTCA CTGTCCCAAT TAAGGAAGCA GGCGAAGACT GCCATGCCCC 8760 AACATACCTA CCTGCGGAGG TGGATGGTGA TGTCAAACTC AGTTCCAATC TGGTGATTCT 8820 ACCTGGTCAA GATCTCCAAT ATGTTTTGGC AACCTACGAT ACTTCCAGGG TTGAACATGC 8880 TGTGGTTTAT TACGTTTACA GCCCAAGCCG CTCATTTTCT TACTTTTATC CTTTTAGGTT 8940 GCCTATAAAG GGGGTCCCCA TCGAATTACA AGTGGAATGC TTCACATGGG ACCAAAAACT 9000 CTGGTGCCGT CACTTCTGTG TGCTTGCGGA CTCAGAATCT GGTGGACATA TCACTCACTC 9060 TGGGATGGTG GGCATGGGAG TCAGCTGCAC AGTCACCCGG GAAGATGGAA CCAATCGCAG 9120 ATAGGGCTGC TAGTGAACCA ATCACATGAT GTCACCCAGA CATCAGGCAT ACCCACTAGT 9180 GTGAAATAGA CATCAGAATT AAGAAAAACG TAGGGTCCAA GTGGTTCCCC GTTATGGACT 9240 CGCTATCTGT CAACCAGATC TTATACCCTG AAGTTCACCT AGATAGCCCG ATAGTTACCA 9300 ATAAGATAGT AGCCATCCTG GAGTATGCTC GAGTCCCTCA CGCTTACAGC CTGGAGGACC 9360 CTACACTGTG TCAGAACATC AAGCACCGCC TAAAAAACGG ATTTTCCAAC CAAATGATTA 9420 TAAACAATGT GGAAGTTGGG AATGTCATCA AGTCCAAGCT TAGGAGTTAT CCGGCCCACT 9480 CTCATATTCC ATATCCAAAT TGTAATCAGG ATTTATTTAA CATAGAAGAC AAAGAGTCAA 9540 CGAGGAAGAT CCGTGAACTC CTCAAAAAGG GGAATTCGCT GTACTCCAAA GTCAGTGATA 9600 AGGTTTTCCA ATGCTTAAGG GACACTAACT CACGGCTTGG CCTAGGCTCC GAATTGAGGG 9660 AGGACATCAA GGAGAAAGTT ATTAACTTGG GAGTTTACAT GCACAGCTCC CAGTGGTTTG 9720 AGCCCTTTCT GTTTTGGTTT ACAGTCAAGA CTGAGATGAG GTCAGTGATT AAATCACAAA 9780 CCCATACTTG CCATAGGAGG AGACACACAC CTGTATTCTT CACTGGTAGT TCAGTTGAGT 9840 TGCTAATCTC TCGTGACCTT GTTGCTATAA TCAGTAAAGA GTCTCAACAT GTATATTACC 9900 TGACATTTGA ACTGGTTTTG ATGTATTGTG ATGTCATAGA GGGGAGGTTA ATGACAGAGA 9960 CCGCTATGAC TATTGATGCT AGGTATACAG AGCTTCTAGG AAGAGTCAGA TACATGTGGA 10020 AACTGATAGA TGGTTTCTTC CCTGCACTCG GGAATCCAAC TTATCAAATT GTAGCCATGC 10080 TGGAGCCTCT TTCACTTGCT TACCTGCAGC TGAGGGATAT AACAGTAGAA CTCAGAGGTG 10140 CTTTCCTTAA CCACTGCTTT ACTGAAATAC ATGATGTTCT TGACCAAAAC GGGTTTTCTG 10200 ATGAAGGTAC TTATCATGAG TTAACTGAAG CTCTAGATTA CATTTTCATA ACTGATGACA 10260 TACATCTGAC AGGGGAGATT TTCTCATTTT TCAGAAGTTT CGGCCACCCC AGACTTGAAG 10320 CAGTAACGGC TGCTGAAAAT GTTAGGAAAT ACATGAATCA GCCTAAAGTC ATTGTGTATG 10380 AGACTCTGAT GAAAGGTCAT GCCATATTTT GTGGAATCAT AATCAACGGC TATCGTGACA 10440 GGCACGGAGG CAGTTGGCCA CCGCTGACCC TCCCCCTGCA TGCTGCAGAC ACAATCCGGA 10500 ATGCTCAAGC TTCAGGTGAA GGGTTAACAC ATGAGCAGTG CGTTGATAAC TGGAAATCTT 10560 TTGCTGGAGT GAAATTTGGC TGCTTTATGC CTCTTAGCCT GGATAGTGAT CTGACAATGT 10620 ACCTAAAGGA CAAGGCACTT GCTGCTCTCC AAAGGGAATG GGATTCAGTT TACCCGAAAG 10680 AGTTCCTGCG TTACGACCCT CCCAAGGGAA CCGGGTCACG GAGGCTTGTA GATGTTTTCC 10740 TTAATGATTC GAGCTTTGAC CCATATGATG TGATAATGTA TGTTGTAAGT GGAGCTTACC 10800 TCCATGACCC TGAGTTCAAC CTGTCTTACA GCCTGAAAGA AAAGGAGATC AAGGAAACAG 10860 GTAGACTTTT TGCTAAAATG ACTTACAAAA TGAGGGCATG CCAAGTGATT GCTGAAAATC 10920 TAATCTCAAA CGGGATTGGC AAATATTTTA AGGACAATGG GATGGCCAAG GATGAGCACG 10980 ATTTGACTAA GGCACTCCAC ACTCTAGCTG TCTCAGGAGT CCCCAAAGAT CTCAAAGAAA 11040 GTCACAGGGG GGGGCCAGTC TTAAAAACCT ACTCCCGAAG CCCAGTCCAC ACAAGTACCA 11100 GGAACGTGAG AGCAGCAAAA GGGTTTATAG GGTTCCCTCA AGTAATTCGG CAGGACCAAG 11160 ACACTGATCA TCCGGAGAAT ATGGAAGCTT ACGAGACAGT CAGTGCATTT ATCACGACTG 11220 ATCTCAAGAA GTACTGCCTT AATTGGAGAT ATGAGACCAT CAGCTTGTTT GCACAGAGGC 11280 TAAATGAGAT TTACGGATTG CCCTCATTTT TCCAGTGGCT GCATAAGAGG CTTGAGACCT 11340 CTGTCCTGTA TGTAAGTGAC CCTCATTGCC CCCCCGACCT TGACGCCCAT ATCCCGTTAT 11400 ATAAAGTCCC CAATGATCAA ATCTTCATTA AGTACCCTAT GGGAGGTATA GAAGGGTATT 11460 GTCAGAAGCT GTGGACCATC AGCACCATTC CCTATCTATA CCTGGCTGCT TATGAGAGCG 11520 GAGTAAGGAT TGCTTCGTTA GTGCAAGGGG ACAATCAGAC CATAGCCGTA ACAAAAAGGG 11580 TACCCAGCAC ATGGCCCTAC AACCTTAAGA AACGGGAAGC TGCTAGAGTA ACTAGAGATT 11640 ACTTTGTAAT TCTTAGGCAA AGGCTACATG ATATTGGCCA TCACCTCAAG GCAAATGAGA 11700 CAATTGTTTC ATCACATTTT TTTGTCTATT CAAAAGGAAT ATATTATGAT GGGCTACTTG 11760 TGTCCCAATC ACTCAAGAGC ATCGCAAGAT GTGTATTCTG GTCAGAGACT ATAGTTGATG 11820 AAACAAGGGC AGCATGCAGT AATATTGCTA CAACAATGGC TAAAAGCATC GAGAGAGGTT 11880 ATGACCGTTA CCTTGCATAT TCCCTGAACG TCCTAAAAGT GATACAGCAA ATTCTGATCT 11940 CTCTTGGCTT CACAATCAAT TCAACCATGA CCCGGGATGT AGTCATACCC CTCCTCACAA 12000 ACAACGACCT CTTAATAAGG ATGGCACTGT TGCCCGCTCC TATTGGGGGG ATGAATTATC 12060 TGAATATGAG CAGGCTGTTT GTCAGAAACA TCGGTGATCC AGTAACATCA TCAATTGCTG 12120 ATCTCAAGAG AATGATTCTC GCCTCACTAA TGCCTGAAGA GACCCTCCAT CAAGTAATGA 12180 CACAACAACC GGGGGACTCT TCATTCCTAG ACTGGGCTAG CGACCCTTAC TCAGCAAATC 12240 TTGTATGTGT CCAGAGCATC ACTAGACTCC TCAAGAACAT AACTGCAAGG TTTGTCCTGA 12300 TCCATAGTCC AAACCCAATG TTAAAAGGAT TATTCCATGA TGACAGTAAA GAAGAGGACG 12360 AGGGACTGGC GGCATTCCTC ATGGACAGGC ATATTATAGT ACCTAGGGCA GCTCATGAAA 12420 TCCTGGATCA TAGTGTCACA GGGGCAAGAG AGTCTATTGC AGGCATGCTG GATACCACAA 12480 AAGGCTTGAT TCGAGCCAGC ATGAGGAAGG GGGGGTTAAC CTCTCGAGTG ATAACCAGAT 12540 TGTCCAATTA TGACTATGAA CAATTCAGAG CAGGGATGGT GCTATTGACA GGAAGAAAGA 12600 GAAATGTCCT CATTGACAAA GAGTCATGTT CAGTGCAGCT GGCGAGAGCT CTAAGAAGCC 12660 ATATGTGGGC GAGGCTAGCT CGAGGACGGC CTATTTACGG CCTTGAGGTC CCTGATGTAC 12720 TAGAATCTAT GCGAGGCCAC CTTATTCGGC GTCATGAGAC ATGTGTCATC TGCGAGTGTG 12780 GATCAGTCAA CTACGGATGG TTTTTTGTCC CCTCGGGTTG CCAACTGGAT GATATTGACA 12840 AGGAAACATC ATCCTTGAGA GTCCCATATA TTGGTTCTAC CACTGATGAG AGAACAGACA 12900 TGAAGCTTGC CTTCGTAAGA GCCCCAAGTC GATCCTTGCG ATCTGCTGTT AGAATAGCAA 12960 CAGTGTACTC ATGGGCTTAC GGTGATGATG ATAGCTCTTG GAACGAAGCC TGGTTGTTGG 13020 CTAGGCAAAG GGCCAATGTG AGCCTGGAGG AGCTAAGGGT GATCACTCCC ATCTCAACTT 13080 CGACTAATTT AGCGCATAGG TTGAGGGATC GTAGCACTCA AGTGAAATAC TCAGGTACAT 13140 CCCTTGTCCG AGTGGCGAGG TATACCACAA TCTCCAACGA CAATCTCTCA TTTGTCATAT 13200 CAGATAAGAA GGTTGATACT AACTTTATAT ACCAACAAGG AATGCTTCTA GGGTTGGGTG 13260 TTTTAGAAAC ATTGTTTCGA CTCGAGAAAG ATACCGGATC ATCTAACACG GTATTACATC 13320 TTCACGTCGA AACAGATTGT TGCGTGATCC CGATGATAGA TCATCCCAGG ATACCCAGCT 13380 CCCGCAAGCT AGAGCTGAGG GCAGAGCTAT GTACCAACCC ATTGATATAT GATAATGCAC 13440 CTTTAATTGA CAGAGATGCA ACAAGGCTAT ACACCCAGAG CCATAGGAGG CACCTTGTGG 13500 AATTTGTTAC ATGGTCCACA CCCCAACTAT ATCACATTTT AGCTAAGTCC ACAGCACTAT 13560 CTATGATTGA CCTGGTAACA AAATTTGAGA AGGACCATAT GAATGAAATT TCAGCTCTCA 13620 TAGGGGATGA CGATATCAAT AGTTTCATAA CTGAGTTTCT GCTCATAGAG CCAAGATTAT 13680 TCACTATCTA CTTGGGCCAG TGTGCGGCCA TCAATTGGGC ATTTGATGTA CATTATCATA 13740 GACCATCAGG GAAATATCAG ATGGGTGAGC TGTTGTCATC GTTCCTTTCT AGAATGAGCA 13800 AAGGAGTGTT TAAGGTGCTT GTCAATGCTC TAAGCCACCC AAAGATCTAC AAGAAATTCT 13860 GGCATTGTGG TATTATAGAG CCTATCCATG GTCCTTCACT TGATGCTCAA AACTTGCACA 13920 CAACTGTGTG CAACATGGTT TACACATGCT ATATGACCTA CCTCGACCTG TTGTTGAATG 13980 AAGAGTTAGA AGAGTTCACA TTTCTCTTGT GTGAAAGCGA CGAGGATGTA GTACCGGACA 14040 GATTCGACAA CATCCAGGCA AAACACTTAT GTGTTCTGGC AGATTTGTAC TGTCAACCAG 14100 GGACCTGCCC ACCAATTCGA GGTCTAAGAC CGGTAGAGAA ATGTGCAGTT CTAACCGACC 14160 ATATCAAGGC AGAGGCTATG TTATCTCCAG CAGGATCTTC GTGGAACATA AATCCAATTA 14220 TTGTAGACCA TTACTCATGC TCTCTGACTT ATCTCCGGCG AGGATCGATC AAACAGATAA 14280 GATTGAGAGT TGATCCAGGA TTCATTTTCG ACGCCCTCGC TGAGGTAAAT GTCAGTCAGC 14340 CAAAGATCGG CAGCAACAAC ATCTCAAATA TGAGCATCAA GGCTTTCAGA CCCCCACACG 14400 ATGATGTTGC AAAATTGCTC AAAGATATCA ACACAAGCAA GCACAATCTT CCCATTTCAG 14460 GGGGCAATCT CGCCAATTAT GAAATCCATG CTTTCCGCAG AATCGGGTTG AACTCATCTG 14520 CTTGCTACAA AGCTGTTGAG ATATCAACAT TAATTAGGAG ATGCCTTGAG CCAGGGGAGG 14580 ACGGCTTGTT CTTGGGTGAG GGATCGGGTT CTATGTTGAT CACTTATAAA GAGATACTTA 14640 AACTAAACAA GTGCTTCTAT AATAGTGGGG TTTCCGCCAA TTCTAGATCT GGTCAAAGGG 14700 AATTAGCACC CTATCCCTCC GAAGTTGGCC TTGTCGAACA CAGAATGGGA GTAGGTAATA 14760 TTGTCAAAGT GCTCTTTAAC GGGAGGCCCG AAGTCACGTG GGTAGGCAGT GTAGATTGCT 14820 TCAATTTCAT AGTTAGTAAT ATCCCTACCT CTAGTGTGGG GTTTATCCAT TCAGATATAG 14880 AGACCTTGCC TGACAAAGAT ACTATAGAGA AGCTAGAGGA ATTGGCAGCC ATCTTATCGA 14940 TGGCTCTGCT CCTGGGCAAA ATAGGATCAA TACTGGTGAT TAAGCTTATG CCTTTCAGCG 15000 GGGATTTTGT TCAGGGATTT ATAAGTTATG TAGGGTCTCA TTATAGAGAA GTGAACCTTG 15060 TATACCCTAG ATACAGCAAC TTCATCTCTA CTGAATCTTA TTTGGTTATG ACAGATCTCA 15120 AGGCTAACCG GCTAATGAAT CCTGAAAAGA TTAAGCAGCA GATAATTGAA TCATCTGTGA 15180 GGACTTCACC TGGACTTATA GGTCACATCC TATCCATTAA GCAACTAAGC TGCATACAAG 15240 CAATTGTGGG AGACGCAGTT AGTAGAGGTG ATATCAATCC TACTCTGAAA AAACTTACAC 15300 CTATAGAGCA GGTGCTGATC AATTGCGGGT TGGCAATTAA CGGACCTAAG CTGTGCAAAG 15360 AATTGATCCA CCATGATGTT GCCTCAGGGC AAGATGGATT GCTTAATTCT ATACTCATCC 15420 TCTACAGGGA GTTGGCAAGA TTCAAAGACA ACCAAAGAAG TCAACAAGGG ATGTTCCACG 15480 CTTACCCCGT ATTGGTAAGT AGCAGGCAAC GAGAACTTAT ATCTAGGATC ACCCGCAAAT 15540 TCTGGGGGCA CATTCTTCTT TACTCCGGGA ACAAAAAGTT GATAAATAAG TTTATCCAGA 15600 ATCTCAAGTC CGGCTATCTG ATACTAGACT TACACCAGAA TATCTTCGTT AAGAATCTAT 15660 CCAAGTCAGA GAAACAGATT ATTATGACGG GGGGTTTGAA ACGTGAGTGG GTTTTTAAGG 15720 TAACAGTCAA GGAGACCAAA GAATGGTATA AGTTAGTCGG ATACAGTGCC CTGATTAAGG 15780 ACTAATTGGT TGAACTCCGG AACCCTAATC CTGCCCTAGG TGGTTAGGCA TTATTTGCAA 15840 TATATTAAAG AAAACTTTGA AAATACGAAG TTTCTATTCC CAGCTTTGTC TGGT 15894 (2) SEQ ID NO: 12 information about: ...
(i) sequence signature:
(A) length: 2183 amino acid
(B) type: amino acid
(C) chain:
(D) topological framework: linearity
(ii) molecule type: protein
(xi) sequence description: SEQ ID NO:12:
Met?Asp?Ser?Leu?Ser?Val?Asn?Gln?Ile?Leu?Tyr?Pro?Glu?Val?His?Leu1???????????????5???????????????????10??????????????????15Asp?Ser?Pro?Ile?Val?Thr?Asn?Lys?Ile?Val?Ala?Ile?Leu?Glu?Tyr?Ala
20??????????????????25??????????????????30Arg?Val?Pro?His?Ala?Tyr?Ser?Leu?Glu?Asp?Pro?Thr?Leu?Cys?Gln?Asn
35??????????????????40??????????????????45Ile?Lys?His?Arg?Leu?Lys?Asn?Gly?Phe?Ser?Asn?Gln?Met?Ile?Ile?Asn
50??????????????????55??????????????????60Asn?Val?Glu?Val?Gly?Asn?Val?Ile?Lys?Ser?Lys?Leu?Arg?Ser?Tyr?Pro65??????????????????70??????????????????75??????????????????80Ala?His?Ser?His?Ile?Pro?Tyr?Pro?Asn?Cys?Asn?Gln?Asp?Leu?Phe?Asn
85??????????????????90??????????????????95Ile?Glu?Asp?Lys?Glu?Ser?Thr?Arg?Lys?Ile?Arg?Glu?Leu?Leu?Lys?Lys
100?????????????????105?????????????????110Gly?Asn?Ser?Leu?Tyr?Ser?Lys?Val?Ser?Asp?Lys?Val?Phe?Gln?Cys?Leu
115?????????????????120?????????????????125Arg?Asp?Thr?Asn?Ser?Arg?Leu?Gly?Leu?Gly?Ser?Glu?Leu?Arg?Glu?Asp
130?????????????????135?????????????????140Ile?Lys?Glu?Lys?Val?Ile?Asn?Leu?Gly?Val?Tyr?Met?His?Ser?Ser?Gln145?????????????????150?????????????????155?????????????????160Trp?Phe?Glu?Pro?Phe?Leu?Phe?Trp?Phe?Thr?Val?Lys?Thr?Glu?Met?Arg
165?????????????????170?????????????????175Ser?Val?Ile?Lys?Ser?Gln?Thr?His?Thr?Cys?His?Arg?Arg?Arg?His?Thr
180?????????????????185?????????????????190Pro?Val?Phe?Phe?Thr?Gly?Ser?Ser?Val?Glu?Leu?Leu?Ile?Ser?Arg?Asp
195?????????????????200?????????????????205Leu?Val?Ala?Ile?Ile?Ser?Lys?Glu?Ser?Gln?His?Val?Tyr?Tyr?Leu?Thr
210?????????????????215?????????????????220Phe?Glu?Leu?Val?Leu?Met?Tyr?Cys?Asp?Val?Ile?Glu?Gly?Arg?Leu?Met225?????????????????230?????????????????235?????????????????240Thr?Glu?Thr?Ala?Met?Thr?Ile?Asp?Ala?Arg?Tyr?Thr?Glu?Leu?Leu?Gly
245?????????????????250?????????????????255Arg?Val?Arg?Tyr?Met?Trp?Lys?Leu?Ile?Asp?Gly?Phe?Phe?Pro?Ala?Leu
260?????????????????265?????????????????270Gly?Asn?Pro?Thr?Tyr?Gln?Ile?Val?Ala?Met?Leu?Glu?Pro?Leu?Ser?Leu
275?????????????????280?????????????????285Ala?Tyr?Leu?Gln?Leu?Arg?Asp?Ile?Thr?Val?Glu?Leu?Arg?Gly?Ala?Phe
290?????????????????295?????????????????300Leu?Asn?His?Cys?Phe?Thr?Glu?Ile?His?Asp?Val?Leu?Asp?Gln?Asn?Gly305?????????????????310?????????????????315?????????????????320Phe?Ser?Asp?Glu?Gly?Thr?Tyr?His?Glu?Leu?Thr?Glu?Ala?Leu?Asp?Tyr
325?????????????????330?????????????????335Ile?Phe?Ile?Thr?Asp?Asp?Ile?His?Leu?Thr?Gly?Glu?Ile?Phe?Ser?Phe
340?????????????????345?????????????????350Phe?Arg?Ser?Phe?Gly?His?Pro?Arg?Leu?Glu?Ala?Val?Thr?Ala?Ala?Glu
355?????????????????360?????????????????365Asn?Val?Arg?Lys?Tyr?Met?Asn?Gln?Pro?Lys?Val?Ile?Val?Tyr?Glu?Thr
370?????????????????375?????????????????380Leu?Met?Lys?Gly?His?Ala?Ile?Phe?Cys?Gly?Ile?Ile?Ile?Asn?Gly?Tyr385?????????????????390?????????????????395?????????????????400Arg?Asp?Arg?His?Gly?Gly?Ser?Trp?Pro?Pro?Leu?Thr?Leu?Pro?Leu?His
405?????????????????410?????????????????415Ala?Ala?Asp?Thr?Ile?Arg?Asn?Ala?Gln?Ala?Ser?Gly?Glu?Gly?Leu?Thr
420?????????????????425?????????????????430His?Glu?Gln?Cys?Val?Asp?Asn?Trp?Lys?Ser?Phe?Ala?Gly?Val?Lys?Phe
435?????????????????440?????????????????445Gly?Cys?Phe?Met?Pro?Leu?Ser?Leu?Asp?Ser?Asp?Leu?Thr?Met?Tyr?Leu
450?????????????????455?????????????????460Lys?Asp?Lys?Ala?Leu?Ala?Ala?Leu?Gln?Arg?Glu?Trp?Asp?Ser?Val?Tyr465?????????????????470?????????????????475?????????????????480Pro?Lys?Glu?Phe?Leu?Arg?Tyr?Asp?Pro?Pro?Lys?Gly?Thr?Gly?Ser?Arg
485?????????????????490?????????????????495Arg?Leu?Val?Asp?Val?Phe?Leu?Asn?Asp?Ser?Ser?Phe?Asp?Pro?Tyr?Asp
500?????????????????505?????????????????510Val?Ile?Met?Tyr?Val?Val?Ser?Gly?Ala?Tyr?Leu?His?Asp?Pro?Glu?Phe
515?????????????????520?????????????????525Asn?Leu?Ser?Tyr?Ser?Leu?Lys?Glu?Lys?Glu?Ile?Lys?Glu?Thr?Gly?Arg
530?????????????????535?????????????????540Leu?Phe?Ala?Lys?Met?Thr?Tyr?Lys?Met?Arg?Ala?Cys?Gln?Val?Ile?Ala545?????????????????550?????????????????555?????????????????560Glu?Asn?Leu?Ile?Ser?Asn?Gly?Ile?Gly?Lys?Tyr?Phe?Lys?Asp?Asn?Gly
565?????????????????570?????????????????575Met?Ala?Lys?Asp?Glu?His?Asp?Leu?Thr?Lys?Ala?Leu?His?Thr?Leu?Ala
580?????????????????585?????????????????590Val?Ser?Gly?Val?Pro?Lys?Asp?Leu?Lys?Glu?Ser?His?Arg?Gly?Gly?Pro
595?????????????????600?????????????????605Val?Leu?Lys?Thr?Tyr?Ser?Arg?Ser?Pro?Val?His?Thr?Ser?Thr?Arg?Asn
610?????????????????615?????????????????620Val?Arg?Ala?Ala?Lys?Gly?Phe?Ile?Gly?Phe?Pro?Gln?Val?Ile?Arg?Gln625?????????????????630?????????????????635?????????????????640Asp?Gln?Asp?Thr?Asp?His?Pro?Glu?Asn?Met?Glu?Ala?Tyr?Glu?Thr?Val
645?????????????????650?????????????????655Ser?Ala?Phe?Ile?Thr?Thr?Asp?Leu?Lys?Lys?Tyr?Cys?Leu?Asn?Trp?Arg
660?????????????????665?????????????????670Tyr?Glu?Thr?Ile?Ser?Leu?Phe?Ala?Gln?Arg?Leu?Asn?Glu?Ile?Tyr?Gly
675?????????????????680?????????????????685Leu?Pro?Ser?Phe?Phe?Gln?Trp?Leu?His?Lys?Arg?Leu?Glu?Thr?Ser?Val
690?????????????????695?????????????????700Leu?Tyr?Val?Ser?Asp?Pro?His?Cys?Pro?Pro?Asp?Leu?Asp?Ala?His?Ile705?????????????????710?????????????????715?????????????????720Pro?Leu?Tyr?Lys?Val?Pro?Asn?Asp?Gln?Ile?Phe?Ile?Lys?Tyr?Pro?Met
725?????????????????730?????????????????735Gly?Gly?Ile?Glu?Gly?Tyr?Cys?Gln?Lys?Leu?Trp?Thr?Ile?Ser?Thr?Ile
740?????????????????745?????????????????750Pro?Tyr?Leu?Tyr?Leu?Ala?Ala?Tyr?Glu?Ser?Gly?Val?Arg?Ile?Ala?Ser
755?????????????????760?????????????????765Leu?Val?Gln?Gly?Asp?Asn?Gln?Thr?Ile?Ala?Val?Thr?Lys?Arg?Val?Pro
770?????????????????775?????????????????780Ser?Thr?Trp?Pro?Tyr?Asn?Leu?Lys?Lys?Arg?Glu?Ala?Ala?Arg?Val?Thr785?????????????????790?????????????????795?????????????????800Arg?Asp?Tyr?Phe?Val?Ile?Leu?Arg?Gln?Arg?Leu?His?Asp?Ile?Gly?His
805?????????????????810?????????????????815His?Leu?Lys?Ala?Asn?Glu?Thr?Ile?Val?Ser?Ser?His?Phe?Phe?Val?Tyr
820?????????????????825?????????????830Ser?Lys?Gly?Ile?Tyr?Tyr?Asp?Gly?Leu?Leu?Val?Ser?Gln?Ser?Leu?Lys
835?????????????????840?????????????????845Ser?Ile?Ala?Arg?Cys?Val?Phe?Trp?Ser?Glu?Thr?Ile?Val?Asp?Glu?Thr
850?????????????????855?????????????????860Arg?Ala?Ala?Cys?Ser?Asn?Ile?Ala?Thr?Thr?Met?Ala?Lys?Ser?Ile?Glu865?????????????????870?????????????????875?????????????????880Arg?Gly?Tyr?Asp?Arg?Tyr?Leu?Ala?Tyr?Ser?Leu?Asn?Val?Leu?Lys?Val
885?????????????????890?????????????????895Ile?Gln?Gln?Ile?Leu?Ile?Ser?Leu?Gly?Phe?Thr?Ile?Asn?Ser?Thr?Met
900?????????????????905?????????????????910Thr?Arg?Asp?Val?Val?Ile?Pro?Leu?Leu?Thr?Asn?Asn?Asp?Leu?Leu?Ile
915?????????????????920?????????????????925Arg?Met?Ala?Leu?Leu?Pro?Ala?Pro?Ile?Gly?Gly?Met?Asn?Tyr?Leu?Asn
930?????????????????935?????????????????940Met?Ser?Arg?Leu?Phe?Val?Arg?Asn?Ile?Gly?Asp?Pro?Val?Thr?Ser?Ser945?????????????????950?????????????????955?????????????????960Ile?Ala?Asp?Leu?Lys?Arg?Met?Ile?Leu?Ala?Ser?Leu?Met?Pro?Glu?Glu
965?????????????????970?????????????????975Thr?Leu?His?Gln?Val?Met?Thr?Gln?Gln?Pro?Gly?Asp?Ser?Ser?Phe?Leu
980?????????????????985?????????????????990Asp?Trp?Ala?Ser?Asp?Pro?Tyr?Ser?Ala?Asn?Leu?Val?Cys?Val?Gln?Ser
995?????????????????1000????????????????1005Ile?Thr?Arg?Leu?Leu?Lys?Asn?Ile?Thr?Ala?Arg?Phe?Val?Leu?Ile?His
1010????????????????1015????????????????1020Ser?Pro?Asn?Pro?Met?Leu?Lys?Gly?Leu?Phe?His?Asp?Asp?Ser?Lys?Glu1025????????????????1030???????????????1035????????????????1040Glu?Asp?Glu?Gly?Leu?Ala?Ala?Phe?Leu?Met?Asp?Arg?His?Ile?Ile?Val
1045????????????????1050????????????????1055Pro?Arg?Ala?Ala?His?Glu?Ile?Leu?Asp?His?Ser?Val?Thr?Gly?Ala?Arg
1060????????????????1065????????????????1070Glu?Ser?Ile?Ala?Gly?Met?Leu?Asp?Thr?Thr?Lys?Gly?Leu?Ile?Arg?Ala
1075????????????????1080????????????????1085Ser?Met?Arg?Lys?Gly?Gly?Leu?Thr?Ser?Arg?Val?Ile?Thr?Arg?Leu?Ser
1090???????????????1095?????????????????1100Asn?Tyr?Asp?Tyr?Glu?Gln?Phe?Arg?Ala?Gly?Met?Val?Leu?Leu?Thr?Gly1105???????????????1110?????????????????1115????????????????1120Arg?Lys?Arg?Asn?Val?Leu?Ile?Asp?Lys?Glu?Ser?Cys?Ser?Val?Gln?Leu
1125????????????????1130????????????????1135Ala?Arg?Ala?Leu?Arg?Ser?His?Met?Trp?Ala?Arg?Leu?Ala?Arg?Gly?Arg
1140????????????????1145????????????????1150Pro?Ile?Tyr?Gly?Leu?Glu?Val?Pro?Asp?Val?Leu?Glu?Ser?Met?Arg?Gly
1155????????????????1160????????????????1165His?Leu?Ile?Arg?Arg?His?Glu?Thr?Cys?Val?Ile?Cys?Glu?Cys?Gly?Ser
1170????????????????1175???????????????1180Val?Asn?Tyr?Gly?Trp?Phe?Phe?Val?Pro?Ser?Gly?Cys?Gln?Leu?Asp?Asp1185????????????????1190????????????????1195???????????????1200Ile?Asp?Lys?Glu?Thr?Ser?Ser?Leu?Arg?Val?Pro?Tyr?Ile?Gly?Ser?Thr
1205????????????????1210????????????????1215Thr?Asp?Glu?Arg?Thr?Asp?Met?Lys?Leu?Ala?Phe?Val?Arg?Ala?Pro?Ser
1220????????????????1225????????????????1230Arg?Ser?Leu?Arg?Ser?Ala?Val?Arg?Ile?Ala?Thr?Val?Tyr?Ser?Trp?Ala
1235????????????????1240????????????????1245Tyr?Gly?Asp?Asp?Asp?Ser?Ser?Trp?Asn?Glu?Ala?Trp?Leu?Leu?Ala?Arg
1150????????????????1255????????????????1260Gln?Arg?Ala?Asn?Val?Ser?Leu?Glu?Glu?Leu?Arg?Val?Ile?Thr?Pro?Ile1265????????????????1270????????????????1275????????????????1280Ser?Thr?Ser?Thr?Asn?Leu?Ala?His?Arg?Leu?Arg?Asp?Arg?Ser?Thr?Gln
1285????????????????1290????????????????1295Val?Lys?Tyr?Ser?Gly?Thr?Ser?Leu?Val?Arg?Val?Ala?Arg?Tyr?Thr?Thr
1300????????????????1305????????????????1310Ile?Ser?Asn?Asp?Asn?Leu?Ser?Phe?Val?Ile?Ser?Asp?Lys?Lys?Val?Asp
1315????????????????1320????????????????1325Thr?Asn?Phe?Ile?Tyr?Gln?Gln?Gly?Met?Leu?Leu?Gly?Leu?Gly?Val?Leu
1330????????????????1335????????????????1340Glu?Thr?Leu?Phe?Arg?Leu?Glu?Lys?Asp?Thr?Gly?Ser?Ser?Asn?Thr?Val1345????????????????1350????????????????1355????????????????1360Leu?His?Leu?His?Val?Glu?Thr?Asp?Cys?Cys?Val?Ile?Pro?Met?Ile?Asp
1365????????????????1370????????????????1375His?Pro?Arg?Ile?Pro?Ser?Ser?Arg?Lys?Leu?Glu?Leu?Arg?Ala?Glu?Leu
1380????????????????1385???????????????1390Cys?Thr?Asn?Pro?Leu?Ile?Tyr?Asp?Asn?Ala?Pro?Leu?Ile?Asp?Arg?Asp
1395????????????????1400????????????????1405Ala?Thr?Arg?Leu?Tyr?Thr?Gln?Ser?His?Arg?Arg?His?Leu?Val?Glu?Phe
1410????????????????1415????????????????1420Val?Thr?Trp?Ser?Thr?Pro?Gln?Leu?Tyr?His?Ile?Leu?Ala?Lys?Ser?Thr1425????????????????1430????????????????1435????????????????1440Ala?Leu?Ser?Met?Ile?Asp?Leu?Val?Thr?Lys?Phe?Glu?Lys?Asp?His?Met
1445????????????????1450????????????????1455Asn?Glu?Ile?Ser?Ala?Leu?Ile?Gly?Asp?Asp?Asp?Ile?Asn?Ser?Phe?Ile
1460????????????????1465????????????????1470Thr?Glu?Phe?Leu?Leu?Ile?Glu?Pro?Arg?Leu?Phe?Thr?Ile?Tyr?Leu?Gly
1475????????????????1480????????????????1485Gln?Cys?Ala?Ala?Ile?Asn?Trp?Ala?Phe?Asp?Val?His?Tyr?His?Arg?Pro
1490????????????????1495????????????????1500Ser?Gly?Lys?Tyr?Gln?Met?Gly?Glu?Leu?Leu?Ser?Ser?Phe?Leu?Ser?Arg1505????????????????1510????????????????1515????????????????1520Met?Ser?Lys?Gly?Val?Phe?Lys?Val?Leu?Val?Asn?Ala?Leu?Ser?His?Pro
1525????????????????1530????????????????1535Lys?Ile?Tyr?Lys?Lys?Phe?Trp?His?Cys?Gly?Ile?Ile?Glu?Pro?Ile?His
1540????????????????1545????????????????1550Gly?Pro?Ser?Leu?Asp?Ala?Gln?Asn?Leu?His?Thr?Thr?Val?Cys?Asn?Met
1555????????????????1560????????????????1565Val?Tyr?Thr?Cys?Tyr?Met?Thr?Tyr?Leu?Asp?Leu?Leu?Leu?Asn?Glu?Glu
1570????????????????1575????????????????1580Leu?Glu?Glu?Phe?Thr?Phe?Leu?Leu?Cys?Glu?Ser?Asp?Glu?Asp?Val?Val1585????????????????1590????????????????1595????????????????1600Pro?Asp?Arg?Phe?Asp?Asn?Ile?Gln?Ala?Lys?His?Leu?Cys?Val?Leu?Ala
1605????????????????16l0????????????????1615Asp?Leu?Tyr?Cys?Gln?Pro?Gly?Thr?Cys?Pro?Pro?Ile?Arg?Gly?Leu?Arg
1620????????????????1625????????????????1630Pro?Val?Glu?Lys?Cys?Ala?Val?Leu?Thr?Asp?His?Ile?Lys?Ala?Glu?Ala
1635????????????????1640????????????????1645Met?Leu?Ser?Pro?Ala?Gly?Ser?Ser?Trp?Asn?Ile?Asn?Pro?Ile?Ile?Val
1650???????????????1655?????????????????1660Asp?His?Tyr?Ser?Cys?Ser?Leu?Thr?Tyr?Leu?Arg?Arg?Gly?Ser?Ile?Lys1665????????????????1670????????????????1675????????????????1680Gln?Ile?Arg?Leu?Arg?Val?Asp?Pro?Gly?Phe?Ile?Phe?Asp?Ala?Leu?Ala
1685????????????????1690????????????????1695Glu?Val?Asn?Val?Ser?Gln?Pro?Lys?Ile?Gly?Ser?Asn?Asn?Ile?Ser?Asn
1700????????????????1705????????????????1710Met?Ser?Ile?Lys?Ala?Phe?Arg?Pro?Pro?His?Asp?Asp?Val?Ala?Lys?Leu
1715????????????????1720????????????????1725Leu?Lys?Asp?Ile?Asn?Thr?Ser?Lys?His?Asn?Leu?Pro?Ile?Ser?Gly?Gly
1730????????????????1735????????????????1740Asn?Leu?Ala?Asn?Tyr?Glu?Ile?His?Ala?Phe?Arg?Arg?Ile?Gly?Leu?Asn1745????????????????1750????????????????1755????????????????1760Ser?Ser?Ala?Cys?Tyr?Lys?Ala?Val?Glu?Ile?Ser?Thr?Leu?Ile?Arg?Arg
1765????????????????1770????????????????1775Cys?Leu?Glu?Pro?Gly?Glu?Asp?Gly?Leu?Phe?Leu?Gly?Glu?Gly?Ser?Gly
1780????????????????1785????????????????1790Ser?Met?Leu?Ile?Thr?Tyr?Lys?Glu?Ile?Leu?Lys?Leu?Asn?Lys?Cys?Phe
1795????????????????1800????????????????1805Tyr?Asn?Ser?Gly?Val?Ser?Ala?Asn?Ser?Arg?Ser?Gly?Gln?Arg?Glu?Leu
1810????????????????1815????????????????1820Ala?Pro?Tyr?Pro?Ser?Glu?Val?Gly?Leu?Val?Glu?His?Arg?Met?Gly?Val1825????????????????1830????????????????1835????????????????1840Gly?Asn?Ile?Val?Lys?Val?Leu?Phe?Asn?Gly?Arg?Pro?Glu?Val?Thr?Trp
1845????????????????1850????????????????1855Val?Gly?Ser?Val?Asp?Cys?Phe?Asn?Phe?Ile?Val?Ser?Asn?Ile?Pro?Thr
1860????????????????1865????????????????1870Ser?Ser?Val?Gly?Phe?Ile?His?Ser?Asp?Ile?Glu?Thr?Leu?Pro?Asp?Lys
1875????????????????1880????????????????1885Asp?Thr?Ile?Glu?Lys?Leu?Glu?Glu?Leu?Ala?Ala?Ile?Leu?Ser?Met?Ala
1890????????????????1895????????????????1900Leu?Leu?Leu?Gly?Lys?Ile?Gly?Ser?Ile?Leu?Val?Ile?Lys?Leu?Met?Pro1905????????????????1910????????????????1915????????????????1920Phe?Ser?Gly?Asp?Phe?Val?Gln?Gly?Phe?Ile?Ser?Tyr?Val?Gly?Ser?His
1925????????????????1930???????????????1935Tyr?Arg?Glu?Val?Asn?Leu?Val?Tyr?Pro?Arg?Tyr?Ser?Asn?Phe?Ile?Ser
1940????????????????1945????????????????1950Thr?Glu?Ser?Tyr?Leu?Val?Met?Thr?Asp?Leu?Lys?Ala?Asn?Arg?Leu?Met
1955????????????????1960????????????????1965Asn?Pro?Glu?Lys?Ile?Lys?Gln?Gln?Ile?Ile?Glu?Ser?Ser?Val?Arg?Thr
1970????????????????1975????????????????1980Ser?Pro?Gly?Leu?Ile?Gly?His?Ile?Leu?Ser?Ile?Lys?Gln?Leu?Ser?Cys1985????????????????1990????????????????1995????????????????2000Ile?Gln?Ala?Ile?Val?Gly?Asp?Ala?Val?Ser?Arg?Gly?Asp?Ile?Asn?Pro
2005????????????????2010????????????????2015Thr?Leu?Lys?Lys?Leu?Thr?Pro?Ile?Glu?Gln?Val?Leu?Ile?Asn?Cys?Gly
2020????????????????????2025????????????????2030Leu?Ala?Ile?Asn?Gly?Pro?Lys?Leu?Cys?Lys?Glu?Leu?Ile?His?His?Asp
2035????????????????????2040????????????????2045Val?Ala?Ser?Gly?Gln?Asp?Gly?Leu?Leu?Asn?Ser?Ile?Leu?Ile?Leu?Tyr
2050????????????????2055????????????????2060
Arg?Glu?Leu?Ala?Arg?Phe?Lys?Asp?Asn?Gln?Arg?Ser?Gln?Gln?Gly?Met
2065????????????????2070????????????????2075????????????????2080
Phe?His?Ala?Tyr?Pro?Val?Leu?Val?Ser?Ser?Arg?Gln?Arg?Glu?Leu?Ile
2085????????????????2090????????????????2095
Ser?Arg?Ile?Thr?Arg?Lys?Phe?Trp?Gly?His?Ile?Leu?Leu?Tyr?Ser?Gly
2100????????????????2105????????????????2110
Asn?Lys?Lys?Leu?Ile?Asn?Lys?Phe?Ile?Gln?Asn?Leu?Lys?Ser?Gly?Tyr
2115????????????????2120????????????????2125
Leu?Ile?Leu?Asp?Leu?His?Gln?Asn?Ile?Phe?Val?Lys?Asn?Leu?Ser?Lys
2130????????????????????2135????????????????2140
Ser?Glu?Lys?Gln?Ile?Ile?Met?Thr?Gly?Gly?Leu?Lys?Arg?Glu?Trp?Val
2145????????????????2150????????????????2155????????????????2160
Phe?Lys?Val?Thr?Val?Lys?Glu?Thr?Lys?Glu?Trp?Tyr?Lys?Leu?Val?Gly
2165????????????????2170????????????????2175
Tyr?Ser?Ala?Leu?Ile?Lys?Asp
The information of 2180 (2) SEQ ID NO:13:
(i) sequence signature:
(A) length: 15894 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: ACCAAACAAA GTTGGGTAAG GATAGTTCAA TCAATGATCA TTTTCTAGTG CACTTAGGAT 60 TCAAGATCCT ATTATCAGGG ACAAGAGCAG GATTAAGGAT ATCCGAGATG GCCACACTTT 120 TAAGGAGCTT AGCATTGTTC AAAAGAAACA AGGACAAACC ACCCATTACA TCAGGATCCG 180 GTGGAGCCAT CAGAGGAATC AAACACATTA TTATAGTACC AATCCCTGGA GATTCCTCAA 240 TTACCACTCG ATCCAGACTT CTGGACCGGT TGGTCAGGTT AATTGGAAAC CCGGATGTGA 300 GCGGGCCCAA ACTAACAGGG GCACTAATAG GTATATTATC CTTATTTGTG GAGTCTCCAG 360 GTCAATTGAT TCAGAGGATC ACCGATGACC CTGACGTTAG CATAAGGCTG TTAGAGGTTG 420 TCCAGAGTGA CCAGTCACAA TCTGGCCTTA CCTTCGCATC AAGAGGTACC AACATGGAGG 480 ATGAGGCGGA CCAATACTTT TCACATGATG ATCCAATTAG TAGTGATCAA TCCAGGTTCG 540 GATGGTTCGA GAACAAGGAA ATCTCAGATA TTGAAGTGCA AGACCCTGAG GGATTCAACA 600 TGATTCTGGG TACCATCCTA GCTCAAATTT GGGTCTTGCT CGCAAAGGCG GTTACGGCCC 660 CAGACACGGC AGCTGATTCG GAGCTAAGAA GGTGGATAAA GTACACCCAA CAAAGAAGGG 720 TAGTTGGTGA ATTTAGATTG GAGAGAAAAT GGTTGGATGT GGTGAGGAAC AGGATTGCCG 780 AGGACCTCTC CTTACGCCGA TTCATGGTCG CTCTAATCCT GGATATCAAG AGAACACCCG 840 GAAACAAACC CAGGATTGCT GAAATGATAT GTGACATTGA TACATATATC GTAGAGGCAG 900 GATTAGCCAG TTTTATCCTG ACTATTAAGT TTGGGATAGA AACTATGTAT CCTGCTCTTG 960 GACTGCATGA ATTTGCTGGT GAGTTATCCA CACTTGAGTC CTTGATGAAC CTTTACCAGC 1020 AAATGGGGGA AACTGCACCC TACATGGTAA TCCTGGAGAA CTCAATTCAG AACAAGTTCA 1080 GTGCAGGATC ATACCCTCTG CTCTGGAGCT ATGCCATGGG AGTAGGAGTG GAACTTGAAA 1140 ACTCCATGGG AGGTTTGAAC TTTGGCCGAT CTTACTTTGA TCCAGCATAT TTTAGATTAG 1200 GGCAAGAGAT GGTAAGGAGG TCAGCTGGAA AGGTCAGTTC CACATTGGCA TCTGAACTCG 1260 GTATCACTGC CGAGGATGCA AGGCTTGTTT CAGAGATTGC AATGCATACT ACTGAGGACA 1320 AGATCAGTAG AGCGGTTGGA CCCAGACAAG CCCAAGTATC ATTTCTACAC GGTGATCAAA 1380 GTGAGAATGA GCTACCGAGA TTGGGGGGCA AGGAAGATAG GAGGGTCAAA CAGAGTCGAG 1440 GAGAAGCCAG GGAGAGCTAC AGAGAAACCG GGCCCAGCAG AGCAAGTGAT GCGAGAGCTG 1500 CCCATCTTCC AACCGGCACA CCCCTAGACA TTGACACTGC ATCGGAGTCC AGCCAAGATC 1560 CGCAGGACAG TCGAAGGTCA GCTGACGCCC TGCTTAGGCT GCAAGCCATG GCAGGAATCT 1620 CGGAAGAACA AGGCTCAGAC ACGGACACCC CTATAGTGTA CAATGACAGA AATCTTCTAG 1680 ACTAGGTGCG AGAGGCCGAG GGCCAGAACA ACATCCGCCT ACCCTCCATC ATTGTTATAA 1740 AAAACTTAGG AACCAGGTCC ACACAGCCGC CAGCCCATCA ACCATCCACT CCCACGATTG 1800 GAGCCGATGG CAGAAGAGCA GGCACGCCAT GTCAAAAACG GACTGGAATG CATCCGGGCT 1860 CTCAAGGCCG AGCCCATCGG CTCACTGGCC ATCGAGGAAG CTATGGCAGC ATGGTCAGAA 1920 ATATCAGACA ACCCAGGACA GGAGCGAGCC ACCTGCAGGG AAGAGAAGGC AGGCAGTTCG 1980 GGTCTCAGCA AACCATGCCT CTCAGCAATT GGATCAACTG AAGGCGGTGC ACCTCGCATC 2040 CGCGGTCAGG GACCTGGAGA GAGCGATGAC GACGCTGAAA CTTTGGGAAT CCCCCCAAGA 2100 AATCTCCAGG CATCAAGCAC TGGGTTACAG TGTTATTATG TTTATGATCA CAGCGGTGAA 2160 GCGGTTAAGG GAATCCAAGA TGCTGACTCT ATCATGGTTC AATCAGGCCT TGATGGTGAT 2220 AGCACCCTCT CAGGAGGAGA CAATGAATCT GAAAACAGCG ATGTGGATAT TGGCGAACCT 2280 GATACCGAGG GATATGCTAT CACTGACCGG GGATCTGCTC CCATCTCTAT GGGGTTCAGG 2340 GCTTCTGATG TTGAAACTGC AGAAGGAGGG GAGATCCACG AGCTCCTGAG ACTCCAATCC 2400 AGAGGCAACA ACTTTCCGAA GCTTGGGAAA ACTCTCAATG TTCCTCCGCC TCCGGACCCC 2460 GGTAGGGCCA GCACTTCCGG GACACCCATT AAAAAGGGCA CAGACGCGAG ATTAGCCTCA 2520 TTTGGAACGG AGATCGCGTC TTTATTGACA GGTGGTGCAA CCCAATGTGC TCGAAAGTCA 2580 CCCTCGGAAC CATCAGGGCC AGGTGCACCT GCGGGGAATG TCCCCGAGTG TGTGAGCAAT 2640 GCCGCACTGA TACAGGAGTG GACACCCGAA TCTGGTACCA CAATCTCCCC GAGATCCCAG 2700 AATAATGAAG AAGGGGGAGA CTATTATGAT GATGAGCTGT TCTCTGATGT CCAAGATATT 2760 AAAACAGCCT TGGCCAAAAT ACACGAGGAT AATCAGAAGA TAATCTCCAA GCTAGAATCA 2820 CTGCTGTTAT TGAAGGGAGA AGTTGAGTCA ATTAAGAAGC AGATCAACAG GCAAAATATC 2880 AGCATATCCA CCCTGGAAGG ACACCTCTCA AGCATCATGA TCGCCATTCC TGGACTTGGG 2940 AAGGATCCCA ACGACCCCAC TGCAGATGTC GAAATCAATC CCGACTTGAA ACCCATCATA 3000 GGCAGAGATT CAGGCCGAGC ACTGGCCGAA GTTCTCAAGA AACCCGTTGC CAGCCGACAA 3060 CTCCAAGGAA TGACAAATGG ACGGACCAGT TCCAGAGGAC AGCTGCTGAA GGAATTTCAG 3120 CTAAAGCCGA TCGGGAAAAA GATGAGCTCA GCCGTCGGGT TTGTTCCTGA CACCGGCCCT 3180 GCATCACGCA GTGTAATCCG CTCCATTATA AAATCCAGCC GGCTAGAGGA GGATCGGAAG 3240 CGTTACCTGA TGACTCTCCT TGATGATATC AAAGGAGCCA ATGATCTTGC CAAGTTCCAC 3300 CAGATGCTGA TGAAGATAAT AATGAAGTAG CTACAGCTCA ACTTACCTGC CAACCCCATG 3360 CCAGTCGACC CAACTAGTAC AACCTAAATC CATTATAAAA AACTTAGGAG CAAAGTGATT 3420 GCCTCCCAAG TTCCACAATG ACAGAGATCT ACGACTTCGA CAAGTCGGCA TGGGACATCA 3480 AAGGGTTGAT CGCTCCGATA CAACCCACCA CCTACAGTGA TGGCAGGCTG GTGCCCCAGG 3540 TCAGAGTCAT AGATCCTGGT CTAGGCGACA GGAAGGATGA ATGCTTTATG TACATGTTTC 3600 TGCTGGGGGT TGTTGAGGAC AGCGATCCCC TAGGGCCTCC AATCGGGCGA GCATTTGGGT 3660 CCCTGCCCTT AGGTGTTGGC AAATCCACAG CAAAGCCCGA AAAACTCCTC AAAGAGGCCA 3720 CTGAGCTTGA CATAGTTGTT AGACGTACAG CAGGGCTCAA TGAAAAACTG GTGTTCTACA 3780 ACAACACCCC ACTAACTCTC CTCACACCTT GGAGAAAGGT CCTAACAACA GGGAGTGTCT 3840 TCAACGCAAA CCAAGTGTGC AGTGCGGTTA ATCTGATACC GCTCGATACC CCGCAGAGGT 3900 TCCGTGTTGT TTATATGAGC ATCACCCGTC TTTCGGATAA CGGGTATTAC ACCGTTCCTA 3960 GAAGAATGCT GGAATTCAGA TCGGTCAATG CAGTGGCCTT CAACCTGCTG GTGACCCTTA 4020 GGATTGACAA GGCGATAGGC CCTGGGAAGA TCATCGACAA TACAGAGCAA CTTCCTGAGG 4080 CAACATTTAT GGTCCACATC GGGAACTTCA GGAGAAAGAA GAGTGAAGTC TACTCTGCCG 4140 ATTATTGCAA AATGAAAATC GAAAAGATGG GCCTGGTTTT TGCACTTGGT GGGATAGGGG 4200 GCACCAGTCT TCACATTAGA AGCACAGGCA AAATGAGCAA GACTCTCCAT GCACAACTCG 4260 GGTTCAAGAA GACCTTATGT TACCCGCTGA TAGATATCAA TGAAGACCTT AATCGATTAC 4320 TCTGGAGGAG CAGATGCAAG ATAGTAAGAA TCCAGGCAGT TTTGCAGCCA TCAGTTCCTC 4380 AAGAATTCCG CATTTACGAC GACGTGATCA TAAATGATGA CCAAGGACTA TTCAAAGTTC 4440 TGTAGACCGT AGTGCCCAGC AATGCCCGAA AACGACCCCC CTCACAATGA CAGCCAGAAG 4500 GCCCGGACAA AAAAGCCCCC TCCGAAAGAC TCCACGGACC AAGCGAGAGG CCAGCCAGCA 4560 GCCGACGGCA AGCGCGAACA CCAGGCGGCC CCAGCACAGA ACAGCCCTGA TACAAGGCCA 4620 CCACCAGCCA CCCCAATCTG CATCCTCCTC GTGGGACCCC CGAGGACCAA CCCCCAAGGC 4680 TGCCCCCGAT CCAAACCACC AACCGCATCC CCACCACCCC CGGGAAAGAA ACCCCCAGCA 4740 ATTGGAAGGC CCCTCCCCCT CTTCCTCAAC ACAAGAACTC CACAACCGAA CCGCACAAGC 4800 GACCGAGGTG ACCCAACCGC AGGCATCCGA CTCCCTAGAC AGATCCTCTC TCCCCGGCAA 4860 ACTAAACAAA ACTTAGGGCC AAGGAACATA CACACCCAAC AGAACCCAGA CCCCGGCCCA 4920 CGGCGCCGCG CCCCCAACCC CCGACAACCA GAGGGAGCCC CCAACCAATC CCGCCGGCTC 4980 CCCCGGTGCC CACAGGCAGG GACACCAACC CCCGAACAGA CCCAGCACCC AACCATCGAC 5040 AATCCAAGAC GGGGGGGCCC CCCCAAAAAA AGGCCCCCAG GGGCCGACAG CCAGCACCGC 5100 GAGGAAGCCC ACCCACCCCA CACACGACCA CGGCAACCAA ACCAGAACCC AGACCACCCT 5160 GGGCCACCAG CTCCCAGACT CGGCCATCAC CCCGCAGAAA GGAAAGGCCA CAACCCGCGC 5220 ACCCCAGCCC CGATCCGGCG GGGAGCCACC CAACCCGAAC CAGCACCCAA GAGCGATCCC 5280 CGAAGGACCC CCGAACCGCA AAGGACATCA GTATCCCACA GCCTCTCCAA GTCCCCCGGT 5340 CTCCTCCCCT TCTCGAAGGG ACCAAAAGAT CAATCCACCA CACCCGACGA CACTCAACTC 5400 CCCACCCCTA AAGGAGACAC CGGGAATCCC AGAATCAAGA CTCATCCAAT GTCCATCATG 5460 GGTCTCAAGG TGAACGTCTC TGCCATATTC ATGGCAGTAC TGTTAACTCT CCAAACACCC 5520 ACCGGTCAAA TCCATTGGGG CAATCTCTCT AAGATAGGGG TGGTAGGAAT AGGAAGTGCA 5580 AGCTACAAAG TTATGACTCG TTCCAGCCAT CAATCATTAG TCATAAAATT AATGCCCAAT 5640 ATAACTCTCC TCAATAACTG CACGAGGGTA GAGATTGCAG AATACAGGAG ACTACTGAGA 5700 ACAGTTTTGG AACCAATTAG AGATGCACTT AATGCAATGA CCCAGAATAT AAGACCGGTT 5760 CAGAGTGTAG CTTCAAGTAG GAGACACAAG AGATTTGCGG GAGTAGTCCT GGCAGGTGCG 5820 GCCCTAGGCG TTGCCACAGC TGCTCAGATA ACAGCCGGCA TTGCACTTCA CCAGTCCATG 5880 CTGAACTCTC AAGCCATCGA CAATCTGAGA GCGAGCCTGG AAACTACTAA TCAGGCAATT 5940 GAGGCAATCA GACAAGCAGG GCAGGAGATG ATATTGGCTG TTCAGGGTGT CCAAGACTAC 6000 ATCAATAATG AGCTGATACC GTCTATGAAC CAACTATCTT GTGATTTAAT CGGCCAGAAG 6060 CTCGGGCTCA AATTGCTCAG ATACTATACA GAAATCCTGT CATTATTTGG CCCCAGCTTA 6120 CGGGACCCCA TATCTGCGGA GATATCTATC CAGGCTTTGA GCTATGCGCT TGGAGGAGAC 6180 ATCAATAAGG TGTTAGAAAA GCTCGGATAC AGTGGAGGTG ATTTACTGGG CATCTTAGAG 6240 AGCAGAGGAA TAAAGGCCCG GATAACTCAC GTCGACACAG AGTCCTACTT CATTGTCCTC 6300 AGTATAGCCT ATCCGACGCT GTCCGAGATT AAGGGGGTGA TTGTCCACCG GCTAGAGGGG 6360 GTCTCGTACA ACATAGGCTC TCAAGAGTGG TATACCACTG TGCCCAAGTA TGTTGCAACC 6420 CAAGGGTACC TTATCTCGAA TTTTGATGAG TCATCGTGTA CTTTCATGCC AGAGGGGACT 6480 GTGTGCAGCC AAAATGCCTT GTACCCGATG AGTCCTCTGC TCCAAGAATG CCTCCGGGGG 6540 TCCACCAAGT CCTGTGCTCG TACACTCGTA TCCGGGTCTT TTGGGAACCG GTTCATTTTA 6600 TCACAAGGGA ACCTAATAGC CAATTGTGCA TCAATCCTTT GCAAGTGTTA CACAACAGGA 6660 ACGATCATTA ATCAAGACCC TGACAAGATC CTAACATACA TTGCTGCCGA TCACTGCCCG 6720 GTAGTCGAGG TGAACGGCGT GACCATCCAA GTCGGGAGCA GGAGGTATCC AGATGCTGTG 6780 TACTTGCACA GAATTGACCT CGGTCCTCCC ATATCATTGG AGAGGTTGGA CGTAGGGACA 6840 AATCTGGGGA ATGCAATTGC TAAGTTGGAG GATGCCAAGG AATTGTTGGA GTCATCGGAC 6900 CAGATATTGA GGAGTATGAA AGGTTTATCG AGCACTAGCA TAGTCTACAT CCTGATTGCA 6960 GTGTGTCTTG GAGGGTTGAT AGGGATCCCC GCTTTAATAT GTTGCTGCAG GGGGCGTTGT 7020 AACAAAAAGG GAGAACAAGT TGGTATGTCA AGACCAGGCC TAAAGCCTGA TCTTACGGGA 7080 ACATCAAAAT CCTATGTAAG GTCGCTCTGA TCCTCTACAA CTCTTGAAAC ACAAATGTCC 7140 CACAAGTCTC CTCTTCGTCA TCAAGCAACC ACCGCACCCA GCATCAAGCC CACCTGAAAT 7200 TATCTCCGGC TTCCCTCTGG CCGAACAATA TCGGTAGTTA ATTAAAACTT AGGGTGCAAG 7260 ATCATCCACA ATGTCACCAC AACGAGACCG GATAAATGCC TTCTACAAAG ATAACCCCCA 7320 TCCCAAGGGA AGTAGGATAG TCATTAACAG AGAACATCTT ATGATTGATA GACCTTATGT 7380 TTTGCTGGCT GTTCTGTTTG TCATGTTTCT GAGCTTGATC GGGTTGCTAG CCATTGCAGG 7440 CATTAGACTT CATCGGGCAG CCATCTACAC CGCAGAGATC CATAAAAGCC TCAGCACCAA 7500 TCTAGATGTA ACTAACTCAA TCGAGCATCA GGTCAAGGAC GTGCTGACAC CACTCTTCAA 7560 AATCATCGGT GATGAAGTGG GCCTGAGGAC ACCTCAGAGA TTCACTGACC TAGTGAAATT 7620 CATCTCTGAC AAGATTAAAT TCCTTAATCC GGATAGGGAG TACGACTTCA GAGATCTCAC 7680 TTGGTGTATC AACCCGCCAG AGAGAATCAA ATTGGATTAT GATCAATACT GTGCAGATGT 7740 GGCTGCTGAA GAGCTCATGA ATGCATTGGT GAACTCAACT CTACTGGAGA CCAGAACAAC 7800 CAATCAGTTC CTAGCTGTCT CAAAGGGAAA CTGCTCAGGG CCCACTACAA TCAGAGGTCA 7860 ATTCTCAAAC ATGTCGCTGT CCCTGTTAGA CTTGTATTTA GGTCGAGGTT ACAATGTGTC 7920 ATCTATAGTC ACTATGACAT CCCAGGGAAT GTATGGGGGA ACTTACCTAG TGGAAAAGCC 7980 TAATCTGAGC AGCAAAAGGT CAGAGTTGTC ACAACTGAGC ATGTACCGAG TGTTTGAAGT 8040 AGGTGTTATC AGAAATCCGG GTTTGGGGGC TCCGGTGTTC CATATGACAA ACTATCTTGA 8100 GCAACCAGCC AGTAATGATC TCAGCAACTG TATGGTGGCT TTGGGGGAGC TCAAACTCGC 8160 AGCCCTTTGT CACGGGGAAG ATTCTATCAC AATTCCCTAT CAGGGATCAG GGAAAGGTGT 8220 CAGCTTCCAG CTCGTCAAGC TAGGTGTCTG GAAATCCCCA ACCGACATGC AATCCTGGGT 8280 CCCCTTATCA ACGGATGATC CAGTGATAGA CAGGCTTTAC CTCTCATCTC ACAGAGGTGT 8340 TATCGCTGAC AATCAAGCAA AATGGGCTGT CCCGACAACA CGAACAGATG ACAAGTTGCG 8400 AATGGAGACA TGCTTCCAAC AGGCGTGTAA GGGTAAAATC CAAGCACTCT GCGAGAATCC 8460 CGAGTGGGCA CCATTGAAGG ATAACAGGAT TCCTTCATAC GGGGTCTTGT CTGTTGATCT 8520 GAGTCTGACA GTTGAGCTTA AAATCAAAAT TGCTTCGGGA TTCGGGCCAT TGATCACACA 8580 CGGTTCAGGG ATGGACCTAT ACAAATCCAA CCACAACAAT GTGTATTGGC TGACTATCCC 8640 GCCAATGAAG AACCTAGCCT TAGGTGTAAT CAACACATTG GAGTGGATAC CGAGATTCAA 8700 GGTTAGTCCC TACCTCTTCA ATGTCCCAAT TAAGGAAGCA GGCGAAGACT GCCATGCCCC 8760 AACATACCTA CCTGCGGAGG TGGATGGTGA TGTCAAACTC AGTTCCAATC TGGTGATTCT 8820 ACCTGGTCAA GATCTCCAAT ATGTTTTGGC AACCTACGAT ACTTCCAGGG TTGAACATGC 8880 TGTGGTTTAT TACGTTTACA GCCCAGGCCG CTCATTTTCT TACTTTTATC CTTTTAGGTT 8940 GCCTATAAAG GGGGTCCCCA TCGAATTACA AGTGGAATGC TTCACATGGG ACCAAAAACT 9000 CTGGTGCCGT CACTTCTGTG TGCTTGCGGA CTCAGAATCT GGTGGACATA TCACTCACTC 9060 TGGGATGGTG GGCATGGGAG TCAGCTGCAC AGTCACCCGG GAAGATGGAA CCAATCGCAG 9120 ATAGGGCTGC TAGTGAACCA ATCTCATGAT GTCACCCAGA CATCAGGCAT ACCCACTAGT 9180 GTGAAATAGA CATCAGAATT AAGAAAAACG TAGGGTCCAA GTGGTTCCCC GTTATGGACT 9240 CGCTATCTGT CAACCAGATC TTATACCCTG AAGTTCACCT AGATAGCCCG ATAGTTACCA 9300 ATAAGATAGT AGCCATCCTG GAGTATGCTC GAGTCCCTCA CGCTTACAGC CTGGAGGACC 9360 CTACACTGTG TCAGAACATC AAGCACCGCC TAAAAAACGG ATTTTCCAAC CAAATGATTA 9420 TAAACAATGT GGAAGTTGGG AATGTCATCA AGTCCAAGCT TAGGAGTTAT CCGGCCCACT 9480 CTCATATTCC ATATCCAAAT TGTAATCAGG ATTTATTTAA CATAGAAGAC AAAGAGTCAA 9540 CGAGGAAGAT CCGTGAACTC CTCAAAAAGG GGAATTCGCT GTACTCCAAA GTCAGTGATA 9600 AGGTTTTCCA ATGCTTAAGG GACACTAACT CACGGCTTGG CCTAGGCTCC GAATTGAGGG 9660 AGGACATCAA GGAGAAAGTT ATTAACTTGG GAGTTTACAT GCACAGCTCC CAGTGGTTTG 9720 AGCCCTTTCT GTTTTGGTTT ACAGTCAAGA CTGAGATGAG GTCAGTGATT AAATCACAAA 9780 CCCATACTTG CCATAGGAGG AGACACACAC CTGTATTCTT CACTGGTAGT TCAGTTGAGT 9840 TGCTAATCTC TCGTGACCTT GTTGCTATAA TCAGTAAAGA GTCTCAACAT GTATATTACC 9900 TGACATTTGA ACTGGTTTTG ATGTATTGTG ATGTCATAGA GGGGAGGTTA ATGACAGAGA 9960 CCGCTATGAC TATTGATGCT AGGTATACAG AGCTTCTAGG AAGAGTCAGA TACATGTGGA 10020 AACTGATAGA TGGTTTCTTC CCTGCACTCG GGAATCCAAC TTATCAAATT GTAGCCATGC 10080 TGGAGCCTCT TTCACTTGCT TACCTGCAGC TGAGGGATAT AACAGTAGAA CTCAGAGGTG 10140 CTTTCCTTAA CCACTGCTTT ACTGAAATAC ATGATGTTCT TGACCAAAAC GGGTTTTCTG 10200 ATGAAGGTAC TTATCATGAG TTAATTGAAG CTCTAGATTA CATTTTCATA ACTGATGACA 10260 TACATCTGAC AGGGGAGATT TTCTCATTTT TCAGAAGTTT CGGCCACCCC AGACTTGAAG 10320 CAGTAACGGC TGCTGAAAAT GTTAGGAAAT ACATGAATCA GCCTAAAGTC ATTGTGTATG 10380 AGACTCTGAT GAAAGGTCAT GCCATATTTT GTGGAATCAT AATCAACGGC TATCGTGACA 10440 GGCACGGAGG CAGTTGGCCA CCGCTGACCC TCCCCCTGCA TGCTGCAGAC ACAATCCGGA 10500 ATGCTCAAGC TTCAGGTGAA GGGTTAACAC ATGAGCAGTG CGTTGATAAC TGGAAATCTT 10560 TTGCTGGAGT GAAATTTGGC TGCTTTATGC CTCTTAGCCT GGATAGTGAT CTGACAATGT 10620 ACCTAAAGGA CAAGGCACTT GCTGCTCTCC AAAGGGAATG GGATTCAGTT TACCCGAAAG 10680 AGTTCCTGCG TTACGACCCT CCCAAGGGAA CCGGGTCACG GAGGCTTGTA GATGTTTTCC 10740 TTAATGATTC GAGCTTTGAC CCATATGATG TGATAATGTA TGTTGTAAGT GGAGCTTACC 10800 TCCATGACCC TGAGTTCAAC CTGTCTTACA GCCTGAAAGA AAAGGAGATC AAGGAAACAG 10860 GTAGACTTTT TGCTAAAATG ACTTACAAAA TGAGGGCATG CCAAGTGATT GCTGAAAATC 10920 TAATCTCAAA CGGGATTGGC AAATATTTTA AGGACAATGG GATGGCCAAG GATGAGCACG 10980 ATTTGACTAA GGCACTCCAC ACTCTAGCTG TCTCAGGAGT CCCCAAAGAT CTCAAAGAAA 11040 GTCACAGGGG GGGGCCAGTC TTAAAAACCT ACTCCCGAAG CCCAGTCCAC ACAAGTACCA 11100 GGAACGTGAG AGCAGCAAAA GGGTTTATAG GGTTCCCTCA AGTAATTCGG CAGGACCAAG 11160 ACACTGATCA TCCGGAGAAT ATGGAAGCTT ACGAGACAGT CAGTGCATTT ATCACGACTG 11220 ATCTCAAGAA GTACTGCCTT AATTGGAGAT ATGAGACCAT CAGCTTGTTT GCACAGAGGC 11280 TAAATGAGAT TTACGGATTG CCCTCATTTT TCCAGTGGCT GCATAAGAGG CTTGAGACCT 11340 CTGTCCTGTA TGTAAGTGAC CCTCATTGCC CCCCCGACCT TGACGCCCAT ATCCCGTTAT 11400 ATAAAGTCCC CAATGATCAA ATCTTCATTA AGTACCCTAT GGGAGGTATA GAAGGGTATT 11460 GTCAGAAGCT GTGGACCATC AGCACCATTC CCTATCTATA CCTGGCTGCT TATGAGAGCG 11520 GAGTAAGGAT TGCTTCGTTA GTGCAAGGGG ACAATCAGAC CATAGCCGTA ACAAAAAGGG 11580 TACCCAGCAC ATGGCCCTAC AACCTTAAGA AACGGGAAGC TGCTAGAGTA ACTAGAGATT 11640 ACTTTGTAAT TCTTAGGCAA AGGCTACATG ATATTGGCCA TCACCTCAAG GCAAATGAGA 11700 CAATTGTTTC ATCACATTTT TTTGTCTATT CAAAAGGAAT ATATTATGAT GGGCTACTTG 11760 TGTCCCAATC ACTCAAGAGC ATCGCAAGAT GTGTATTCTG GTCAGAGACT ATAGTTGATG 11820 AAACAAGGGC AGCATGCAGT AATATTGCTA CAACAATGGC TAAAAGCATC GAGAGAGGTT 11880 ATGACCGTTA CCTTGCATAT TCCCTGAACG TCCTAAAAGT GATACAGCAA ATTCTGATCT 11940 CTCTTGGCTT CACAATCAAT TCAACCATGA CCCGGGATGT AGTCATACCC CTCCTCACAA 12000 ACAACGACCT CTTAATAAGG ATGGCACTGT TGCCCGCTCC TATTGGGGGG ATGAATTATC 12060 TGAATATGAG CAGGCTGTTT GTCAGAAACA TCGGTGATCC AGTAACATCA TCAATTGCTG 12120 ATCTCAAGAG AATGATTCTC GCCTCACTAA TGCCTGAAGA GACCCTCCAT CAAGTAATGA 12180 CACAACAACC GGGGGACTCT TCATTCCTAG ACTGGGCTAG CGACCCTTAC TCAGCAAATC 12240 TTGTATGTGT CCAGAGCATC ACTAGACTCC TCAAGAACAT AACTGCAAGG TTTGTCCTGA 12300 TCCATAGTCC AAACCCAATG TTAAAAGGAT TATTCCATGA TGACAGTAAA GAAGAGGACG 12360 AGGGACTGGC GGCATTCCTC ATGGACAGGC ATATTATAGT ACCTAGGGCA GCTCATGAAA 12420 TCCTGGATCA TAGTGTCACA GGGGCAAGAG AGTCTATTGC AGGCATGCTG GATACCACAA 12480 AAGGCCTGAT TCGAGCCAGC ATGAGGAAGG GGGGGTTAAC CTCTCGAGTG ATAACCAGAT 12540 TGTCCAATTA TGACTATGAA CAATTCAGAG CAGGGATGGT GCTATTGACA GGAAGAAAGA 12600 GAAATGTCCT CATTGACAAA GAGTCATGTT CAGTGCAGCT GGCGAGAGCT CTAAGAAGCC 12660 ATATGTGGGC GAGGCTAGCT CGAGGACGGC CTATTTACGG CCTTGAGGTC CCTGATGTAC 12720 TAGAATCTAT GCGAGGCCAC CTTATTCGGC GTCATGAGAC ATGTGTCATC TGCGAGTGTG 12780 GATCAGTCAA CTACGGATGG TTTTTTGTCC CCTCGGGTTG CCAACTGGAT GATATTGACA 12840 AGGAAACATC ATCCTTGAGA GTCCCATATA TTGGTTCTAC CACTGATGAG AGAACAGACA 12900 TGAAGCTTGC CTTCGTAAGA GCCCCAAGTC GATCCTTGCG ATCTGCTGTT AGAATAGCAA 12960 CAGTGTACTC ATGGGCTTAC GGTGATGATG ATAGCTCTTG GAACGAAGCC TGGTTGTTGG 13020 CTAGGCAAAG GGCCAATGTG AGCCTGGAGG AGCTAAGGGT GATCACTCCC ATCTCAACTT 13080 CGACTAATTT AGCGCATAGG TTGAGGGATC GTAGCACTCA AGTGAAATAC TCAGGTACAT 13140 CCCTTGTCCG AGTGGCGAGG TATACCACAA TCTCCAACGA CAATCTCTCA TTTGTCATAT 13200 CAGATAAGAA GGTTGATACT AACTTTATAT ACCAACAAGG AATGCTTCTA GGGTTGGGTG 13260 TTTTAGAAAC ATTGTTTCGA CTCGAGAAAG ATACCGGATC ATCTAACACG GTATTACATC 13320 TTCACGTCGA AACAGATTGT TGCGTGATCC CGATGATAGA TCATCCCAGG ATACCCAGCT 13380 CCCGCAAGCT AGAGCTGAGG GCAGAGCTAT GTACCAACCC ATTGATATAT GATAATGCAC 13440 CTTTAATTGA CAGAGATACA ACAAGGCTAT ACACCCAGAG CCATAGGAGG CACCTTGTGG 13500 AATTTGTTAC ATGGTCCACA CCCCAACTAT ATCACATTTT AGCTAAGTCC ACAGCACTAT 13560 CTATGATTGA CCTGGTAACA AAATTTGAGA AGGACCATAT GAATGAAATT TCAGCTCTCA 13620 TAGGGGATGA CGATATCAAT AGTTTCATAA CTGAGTTTCT GCTCATAGAG CCAAGATTAT 13680 TCACTATCTA CTTGGGCCAG TGTGCGGCCA TCAATTGGGC ATTTGATGTA CATTATCATA 13740 GACCATCAGG GAAATATCAG ATGGGTGAGC TGTTGTCATC GTTCCTTTCT AGAATGAGCA 13800 AAGGAGTGTT TAAGGTGCTT GTCAATGCTC TAAGCCACCC AAAGATCTAC AAGAAATTCT 13860 GGCATTGTGG TATTATAGAG CCTATCCATG GTCCTTCACT TGATGCTCAA AACTTGCACA 13920 CAACTGTGTG CAACATGGTT TACACATGCT ATATGACCTA CCTCGACCTG TTGTTGAATG 13980 AAGAGTTAGA AGAGTTCACA TTTCTCTTGT GTGAAAGCGA CGAGGATGTA GTACCGGACA 14040 GATTCGACAA CATCCAGGCA AAACACTTAT GTGTTCTGGC AGATTTGTAC TGTCAACCAG 14100 GGACCTGCCC ACCAATTCGA GGTCTAAGAC CGGTAGAGAA ATGTGCAGTT CTAACCGACC 14160 ATATCAAGGC AGAGGCTAGG TTATCTCCAG CAGGATCTTC GTGGAACATA AATCCAATTA 14220 TTGTAGACCA TTACTCATGC TCTCTGACTT ATCTCCGGCG AGGATCGATC AAACAGATAA 14280 GATTGAGAGT TGATCCAGGA TTCATTTTCG ACGCCCTCGC TGAGGTAAAT GTCAGTCAGC 14340 CAAAGATCGG CAGCAACAAC ATCTCAAATA TGAGCATCAA GGCTTTCAGA CCCCCACACG 14400 ATGATGTTGC AAAATTGCTC AAAGATATCA ACACAAGCAA GCACAATCTT CCCATTTCAG 14460 GGGGCAATCT CGCCAATTAT GAAATCCATG CTTTCCGCAG AATCGGGTTG AACTCATCTG 14520 CTTGCTACAA AGCTGTTGAG ATATCAACAT TAATTAGGAG ATGCCTTGAG CCAGGGGAGG 14580 ACGGCTTGTT CTTGGGTGAG GGATCGGGTT CTATGTTGAT CACTTATAAG GAGATACTTA 14640 AACTAAACAA GTGCTTCTAT AATAGTGGGG TTTCCGCCAA TTCTAGATCT GGTCAAAGGG 14700 AATTAGCACC CTATCCCTCC GAAGTTGGCC TTGTCGAACA CAGAATGGGA GTAGGTAATA 14760 TTGTCAAAGT GCTCTTTAAC GGGAGGCCCG AAGTCACGTG GGTAGGCAGT GTAGATTGCT 14820 TCAATTTCAT AGTTAGTAAT ATCCCTACCT CTAGTGTGGG GTTTATCCAT TCAGATATAG 14880 AGACCTTGCC TAACAAAGAT ACTATAGAGA AGCTAGAGGA ATTGGCAGCC ATCTTATCGA 14940 TGGCTCTGCT CCTGGGCAAA ATAGGATCAA TACTGGTGAT TAAGCTTATG CCTTTCAGCG 15000 GGGATTTTGT TCAGGGATTT ATAAGTTATG TAGGGTCCCA TTATAGAGAA GTGAACCTTG 15060 TATACCCTAG ATACAGCAAC TTCATATCTA CTGAATCTTA TTTGGTTATG ACAGATCTCA 15120 AGGCTAACCG GCTAATGAAT CCTGAAAAGA TTAAGCAGCA GATAATTGAA TCATCTGTGA 15180 GGACTTCACC TGGACTTATA GGTCACATCC TATCCATTAA GCAACTAAGC TGCATACAAG 15240 CAATTGTGGG AGACGCAGTT AGTAGAGGTG ATATCAATCC TACTCTGAAA AAACTTACAC 15300 CTATAGAGCA GGTGCTGATC AATTGCGGGT TGGCAATTAA CGGACCTAAG CTGTGCAAAG 15360 AATTGATCCA CCATGATGTT GCCTCAGGGC AAGATGGATT GCTTAATTCT ATACTCATCC 15420 TCTACAGGGA GTTGGCAAGA TTCAAAGACA ACCAAAGAAG TCAACAAGGG ATGTTCCACG 15480 CTTACCCCGT ATTGGTAAGT AGCAGGCAAC GAGAACTTAT ATCTAGGATC ACCCGCAAAT 15540 TTTGGGGGCA CATTCTTCTT TACTCCGGGA ACAGAAAGTT GATAAATAAG TTTATCCAGA 15600 ATCTCAAGTC CGGCTATCTG ATACTAGACT TACACCAGAA TATCTTCGTT AAGAATCTAT 15660 CCAAGTCAGA GAAACAGATT ATTATGACGG GGGGTTTGAA ACGTGAGTGG GTTTTTAAGG 15720 TAACAGTCAA GGAGACCAAA GAATGGTATA AGTTAGTCGG ATACAGTGCC CTGATTAAGG 15780 ACTAATTGGT TGAACTCCGG AACCCTAATC CTGCCCTAGG TGGTTAGGCA TTATTTGCAA 15840 TAGATTAAAG AAAACTTTGA AAATACGAAG TTTCTATTCC CAGCTTTGTC TGGT 15894 (2) SEQ ID NO: 14 information about: ...
(i) sequence signature:
(A) length: 2183 amino acid
(B) type: amino acid
(C) chain:
(D) topological framework: linearity
(ii) molecule type: protein
(xi) sequence description: SEQ ID NO:14:
Met?Asp?Ser?Leu?Ser?Val?Asn?Gln?Ile?Leu?Tyr?Pro?Glu?Val?His?Leu
1???????????????5???????????????????10??????????????????15
Asp?Ser?Pro?Ile?Val?Thr?Asn?Lys?Ile?Val?Ala?Ile?Leu?Glu?Tyr?Ala
20??????????????????25??????????????????30
Arg?Val?Pro?His?Ala?Tyr?Ser?Leu?Glu?Asp?Pro?Thr?Leu?Cys?Gln?Asn
35??????????????????40??????????????????45
Ile?Lys?His?Arg?Leu?Lys?Asn?Gly?Phe?Ser?Asn?Gln?Met?Ile?Ile?Asn
50??????????????????55??????????????????60
Asn?Val?Glu?Val?Gly?Asn?Val?Ile?Lys?Ser?Lys?Leu?Arg?Ser?Tyr?Pro
65??????????????????70??????????????????75??????????????????80
Ala?His?Ser?His?Ile?Pro?Tyr?Pro?Asn?Cys?Asn?Gln?Asp?Leu?Phe?Asn
85?????????????????90??????????????????95Ile?Glu?Asp?Lys?Glu?Ser?Thr?Arg?Lys?Ile?Arg?Glu?Leu?Leu?Lys?Lys
100?????????????????105?????????????????110Gly?Asn?Ser?Leu?Tyr?Ser?Lys?Val?Ser?Asp?Lys?Val?Phe?Gln?Cys?Leu
115?????????????????120?????????????????125Arg?Asp?Thr?Asn?Ser?Arg?Leu?Gly?Leu?Gly?Ser?Glu?Leu?Arg?Glu?Asp
130?????????????????135?????????????????140Ile?Lys?Glu?Lys?Val?Ile?Asn?Leu?Gly?Val?Tyr?Met?His?Ser?Ser?Gln145?????????????????150?????????????????155?????????????????160Trp?Phe?Glu?Pro?Phe?Leu?Phe?Trp?Phe?Thr?Val?Lys?Thr?Glu?Met?Arg
165?????????????????170?????????????????175Ser?Val?Ile?Lys?Ser?Gln?Thr?His?Thr?Cys?His?Arg?Arg?Arg?His?Thr
180?????????????????185?????????????????190Pro?Val?Phe?Phe?Thr?Gly?Ser?Ser?Val?Glu?Leu?Leu?Ile?Ser?Arg?Asp
195?????????????????200?????????????????205Leu?Val?Ala?Ile?Ile?Ser?Lys?Glu?Ser?Gln?His?Val?Tyr?Tyr?Leu?Thr
210?????????????????215?????????????????220Phe?Glu?Leu?Val?Leu?Met?Tyr?Cys?Asp?Val?Ile?Glu?Gly?Arg?Leu?Met225?????????????????230?????????????????235?????????????????240Thr?Glu?Thr?Ala?Met?Thr?Ile?Asp?Ala?Arg?Tyr?Thr?Glu?Leu?Leu?Gly
245?????????????????250?????????????????255Arg?Val?Arg?Tyr?Met?Trp?Lys?Leu?Ile?Asp?Gly?Phe?Phe?Pro?Ala?Leu
260?????????????????265?????????????????270Gly?Asn?Pro?Thr?Tyr?Gln?Ile?Val?Ala?Met?Leu?Glu?Pro?Leu?Ser?Leu
275?????????????????280?????????????????285Ala?Tyr?Leu?Gln?Leu?Arg?Asp?Ile?Thr?Val?Glu?Leu?Arg?Gly?Ala?Phe
290?????????????????295?????????????????300Leu?Asn?His?Cys?Phe?Thr?Glu?Ile?His?Asp?Val?Leu?Asp?Gln?Asn?Gly305?????????????????310?????????????????315?????????????????320Phe?Ser?Asp?Glu?Gly?Thr?Tyr?His?Glu?Leu?Ile?Glu?Ala?Leu?Asp?Tyr
325?????????????????330?????????????????335Ile?Phe?Ile?Thr?Asp?Asp?Ile?His?Leu?Thr?Gly?Glu?Ile?Phe?Ser?Phe
340?????????????????345?????????????????350Phe?Arg?Ser?Phe?Gly?His?Pro?Arg?Leu?Glu?Ala?Val?Thr?Ala?Ala?Glu
355?????????????????360?????????????????365Asn?Val?Arg?Lys?Tyr?Met?Asn?Gln?Pro?Lys?Val?Ile?Val?Tyr?Glu?Thr
370?????????????????375?????????????????380Leu?Met?Lys?Gly?His?Ala?Ile?Phe?Cys?Gly?Ile?Ile?Ile?Asn?Gly?Tyr385?????????????????390?????????????????395?????????????????400Arg?Asp?Arg?His?Gly?Gly?Ser?Trp?Pro?Pro?Leu?Thr?Leu?Pro?Leu?His
405?????????????????410?????????????????415Ala?Ala?Asp?Thr?Ile?Arg?Asn?Ala?Gln?Ala?Ser?Gly?Glu?Gly?Leu?Thr
420?????????????????425?????????????????430His?Glu?Gln?Cys?Val?Asp?Asn?Trp?Lys?Ser?Phe?Ala?Gly?Val?Lys?Phe
435?????????????????440?????????????????445Gly?Cys?Phe?Met?Pro?Leu?Ser?Leu?Asp?Ser?Asp?Leu?Thr?Met?Tyr?Leu
450?????????????????455?????????????????460Lys?Asp?Lys?Ala?Leu?Ala?Ala?Leu?Gln?Arg?Glu?Trp?Asp?Ser?Val?Tyr465?????????????????470?????????????????475?????????????????480Pro?Lys?Glu?Phe?Leu?Arg?Tyr?Asp?Pro?Pro?Lys?Gly?Thr?Gly?Ser?Arg
485?????????????????490?????????????????495Arg?Leu?Val?Asp?Val?Phe?Leu?Asn?Asp?Ser?Ser?Phe?Asp?Pro?Tyr?Asp
500?????????????????505?????????????????510Val?Ile?Met?Tyr?Val?Val?Ser?Gly?Ala?Tyr?Leu?His?Asp?Pro?Glu?Phe
515?????????????????520?????????????????525Asn?Leu?Ser?Tyr?Ser?Leu?Lys?Glu?Lys?Glu?Ile?Lys?Glu?Thr?Gly?Arg
530?????????????????535?????????????????540Leu?Phe?Ala?Lys?Met?Thr?Tyr?Lys?Met?Arg?Ala?Cys?Gln?Val?Ile?Ala545?????????????????550?????????????????555?????????????????560Glu?Asn?Leu?Ile?Ser?Asn?Gly?Ile?Gly?Lys?Tyr?Phe?Lys?Asp?Asn?Gly
565?????????????????570?????????????????575Met?Ala?Lys?Asp?Glu?His?Asp?Leu?Thr?Lys?Ala?Leu?His?Thr?Leu?Ala
580?????????????????585?????????????????590Val?Ser?Gly?Val?Pro?Lys?Asp?Leu?Lys?Glu?Ser?His?Arg?Gly?Gly?Pro
595?????????????????600?????????????????605Val?Leu?Lys?Thr?Tyr?Ser?Arg?Ser?Pro?Val?His?Thr?Ser?Thr?Arg?Asn
610?????????????????615?????????????????620Val?Arg?Ala?Ala?Lys?Gly?Phe?Ile?Gly?Phe?Pro?Gln?Val?Ile?Arg?Gln625?????????????????630?????????????????635?????????????????640Asp?Gln?Asp?Thr?Asp?His?Pro?Glu?Asn?Met?Glu?Ala?Tyr?Glu?Thr?Val
645?????????????????650?????????????????655Ser?Ala?Phe?Ile?Thr?Thr?Asp?Leu?Lys?Lys?Tyr?Cys?Leu?Asn?Trp?Arg
660?????????????????665??????????????????670Tyr?Glu?Thr?Ile?Ser?Leu?Phe?Ala?Gln?Arg?Leu?Asn?Glu?Ile?Tyr?Gly
675?????????????????680?????????????????685Leu?Pro?Ser?Phe?Phe?Gln?Trp?Leu?His?Lys?Arg?Leu?Glu?Thr?Ser?Val
690?????????????????695?????????????????700Leu?Tyr?Val?Ser?Asp?Pro?His?Cys?Pro?Pro?Asp?Leu?Asp?Ala?His?Ile705?????????????????710?????????????????715?????????????????720Pro?Leu?Tyr?Lys?Val?Pro?Asn?Asp?Gln?Ile?Phe?Ile?Lys?Tyr?Pro?Met
725?????????????????730?????????????????735Gly?Gly?Ile?Glu?Gly?Tyr?Cys?Gln?Lys?Leu?Trp?Thr?Ile?Ser?Thr?Ile
740?????????????????745?????????????????750Pro?Tyr?Leu?Tyr?Leu?Ala?Ala?Tyr?Glu?Ser?Gly?Val?Arg?Ile?Ala?Ser
755?????????????????760?????????????????765Leu?Val?Gln?Gly?Asp?Asn?Gln?Thr?Ile?Ala?Val?Thr?Lys?Arg?Val?Pro
770?????????????????775?????????????????780Ser?Thr?Trp?Pro?Tyr?Asn?Leu?Lys?Lys?Arg?Glu?Ala?Ala?Arg?Val?Thr785?????????????????790?????????????????795?????????????????800Arg?Asp?Tyr?Phe?Val?Ile?Leu?Arg?Gln?Arg?Leu?His?Asp?Ile?Gly?His
805?????????????????810?????????????????8l5His?Leu?Lys?Ala?Asn?Glu?Thr?Ile?Val?Ser?Ser?His?Phe?Phe?Val?Tyr
820?????????????????825?????????????????830Ser?Lys?Gly?Ile?Tyr?Tyr?Asp?Gly?Leu?Leu?Val?Ser?Gln?Ser?Leu?Lys
835?????????????????840?????????????????845Ser?Ile?Ala?Arg?Cys?Val?Phe?Trp?Ser?Glu?Thr?Ile?Val?Asp?Glu?Thr
850?????????????????855?????????????????860Arg?Ala?Ala?Cys?Ser?Asn?Ile?Ala?Thr?Thr?Met?Ala?Lys?Ser?Ile?Glu865?????????????????870?????????????????875?????????????????880Arg?Gly?Tyr?Asp?Arg?Tyr?Leu?Ala?Tyr?Ser?Leu?Asn?Val?Leu?Lys?Val
885?????????????????890?????????????????895Ile?Gln?Gln?Ile?Leu?Ile?Ser?Leu?Gly?Phe?Thr?Ile?Asn?Ser?Thr?Met
900?????????????????905?????????????????910Thr?Arg?Asp?Val?Val?Ile?Pro?Leu?Leu?Thr?Asn?Asn?Asp?Leu?Leu?Ile
915?????????????????920?????????????????925Arg?Met?Ala?Leu?Leu?Pro?Ala?Pro?Ile?Gly?Gly?Met?Asn?Tyr?Leu?Asn
930?????????????????935?????????????????940Met?Ser?Arg?Leu?Phe?Val?Arg?Asn?Ile?Gly?Asp?Pro?Val?Thr?Ser?Ser945?????????????????950?????????????????955?????????????????960Ile?Ala?Asp?Leu?Lys?Arg?Met?Ile?Leu?Ala?Ser?Leu?Met?Pro?Glu?Glu
965?????????????????970?????????????????975Thr?Leu?His?Gln?Val?Met?Thr?Gln?Gln?Pro?Gly?Asp?Ser?Ser?Phe?Leu
980?????????????????985?????????????????990Asp?Trp?Ala?Ser?Asp?Pro?Tyr?Ser?Ala?Asn?Leu?Val?Cys?Val?Gln?Ser
995?????????????????1000????????????????1005Ile?Thr?Arg?Leu?Leu?Lys?Asn?Ile?Thr?Ala?Arg?Phe?Val?Leu?Ile?His
1010????????????????1015????????????????1020Ser?Pro?Asn?Pro?Met?Leu?Lys?Gly?Leu?Phe?His?Asp?Asp?Ser?Lys?Glu1025????????????????1030????????????????1035????????????????1040Glu?Asp?Glu?Gly?Leu?Ala?Ala?Phe?Leu?Met?Asp?Arg?His?Ile?Ile?Val
1045????????????????1050????????????????1055Pro?Arg?Ala?Ala?His?Glu?Ile?Leu?Asp?His?Ser?Val?Thr?Gly?Ala?Arg
1060????????????????1065????????????????1070Glu?Ser?Ile?Ala?Gly?Met?Leu?Asp?Thr?Thr?Lys?Gly?Leu?Ile?Arg?Ala
1075????????????????1080?????????????????1085Ser?Met?Arg?Lys?Gly?Gly?Leu?Thr?Ser?Arg?Val?Ile?Thr?Arg?Leu?Ser
1090????????????????1095????????????????1100Asn?Tyr?Asp?Tyr?Glu?Gln?Phe?Arg?Ala?Gly?Met?Val?Leu?Leu?Thr?Gly1105????????????????1110????????????????1115????????????????1120Arg?Lys?Arg?Asn?Val?Leu?Ile?Asp?Lys?Glu?Ser?Cys?Ser?Val?Gln?Leu
1125????????????????1130????????????????1135Ala?Arg?Ala?Leu?Arg?Ser?His?Met?Trp?Ala?Arg?Leu?Ala?Arg?Gly?Arg
1140????????????????1145????????????????1150Pro?Ile?Tyr?Gly?Leu?Glu?Val?Pro?Asp?Val?Leu?Glu?Ser?Met?Arg?Gly
1155????????????????1160????????????????1165His?Leu?Ile?Arg?Arg?His?Glu?Thr?Cys?Val?Ile?Cys?Glu?Cys?Gly?Ser
1170????????????????1175????????????????1180Val?Asn?Tyr?Gly?Trp?Phe?Phe?Val?Pro?Ser?Gly?Cys?Gln?Leu?Asp?Asp1185????????????????1190????????????????1195????????????????1200Ile?Asp?Lys?Glu?Thr?Ser?Ser?Leu?Arg?Val?Pro?Tyr?Ile?Gly?Ser?Thr
1205????????????????1210????????????????1215Thr?Asp?Glu?Arg?Thr?Asp?Met?Lys?Leu?Ala?Phe?Val?Arg?Ala?Pro?Ser
1220????????????????1225????????????????1230Arg?Ser?Leu?Arg?Ser?Ala?Val?Arg?Ile?Ala?Thr?Val?Tyr?Ser?Trp?Ala
1235????????????????1240????????????????1245Tyr?Gly?Asp?Asp?Asp?Ser?Ser?Trp?Asn?Glu?Ala?Trp?Leu?Leu?Ala?Arg
1250????????????????1255????????????????1260Gln?Arg?Ala?Asn?Val?Ser?Leu?Glu?Glu?Leu?Arg?Val?Ile?Thr?Pro?Ile1265????????????????1270????????????????1275????????????????1280Ser?Thr?Ser?Thr?Asn?Leu?Ala?His?Arg?Leu?Arg?Asp?Arg?Ser?Thr?Gln
1285????????????????1290????????????????1295Val?Lys?Tyr?Ser?Gly?Thr?Ser?Leu?Val?Arg?Val?Ala?Arg?Tyr?Thr?Thr
1300????????????????1305????????????????1310Ile?Ser?Asn?Asp?Asn?Leu?Ser?Phe?Val?Ile?Ser?Asp?Lys?Lys?Val?Asp
1315????????????????1320????????????????1325Thr?Asn?Phe?Ile?Tyr?Gln?Gln?Gly?Met?Leu?Leu?Gly?Leu?Gly?Val?Leu
1330????????????????1335????????????????1340Glu?Thr?Leu?Phe?Arg?Leu?Glu?Lys?Asp?Thr?Gly?Ser?Ser?Asn?Thr?Val1345????????????????1350????????????????1355????????????????1360Leu?His?Leu?His?Val?Glu?Thr?Asp?Cys?Cys?Val?Ile?Pro?Met?Ile?Asp
1365????????????????1370????????????????1375His?Pro?Arg?Ile?Pro?Ser?Ser?Arg?Lys?Leu?Glu?Leu?Arg?Ala?Glu?Leu
1380????????????????1385????????????????1390Cys?Thr?Asn?Pro?Leu?Ile?Tyr?Asp?Asn?Ala?Pro?Leu?Ile?Asp?Arg?Asp
1395????????????????1400????????????????1405Thr?Thr?Arg?Leu?Tyr?Thr?Gln?Ser?His?Arg?Arg?His?Leu?Val?Glu?Phe
1410????????????????1415????????????????1420Val?Thr?Trp?Ser?Thr?Pro?Gln?Leu?Tyr?His?Ile?Leu?Ala?Lys?Ser?Thr1425????????????????1430????????????????1435???????????????1440Ala?Leu?Ser?Met?Ile?Asp?Leu?Val?Thr?Lys?Phe?Glu?Lys?Asp?His?Met
1445????????????????1450????????????????1455Asn?Glu?Ile?Ser?Ala?Leu?Ile?Gly?Asp?Asp?Asp?Ile?Asn?Ser?Phe?Ile
1460????????????????1465????????????????1470Thr?Glu?Phe?Leu?Leu?Ile?Glu?Pro?Arg?Leu?Phe?Thr?Ile?Tyr?Leu?Gly
1475????????????????1480????????????????1485Gln?Cys?Ala?Ala?Ile?Asn?Trp?Ala?Phe?Asp?Val?His?Tyr?His?Arg?Pro
1490????????????????1495????????????????1500Ser?Gly?Lys?Tyr?Gln?Met?Gly?Glu?Leu?Leu?Ser?Ser?Phe?Leu?Ser?Arg1505????????????????1510????????????????1515????????????????1520Met?Ser?Lys?Gly?Val?Phe?Lys?Val?Leu?Val?Asn?Ala?Leu?Ser?His?Pro
1525????????????????1530????????????????1535Lys?Ile?Tyr?Lys?Lys?Phe?Trp?His?Cys?Gly?Ile?Ile?Glu?Pro?Ile?His
1540????????????????1545????????????????1550Gly?Pro?Ser?Leu?Asp?Ala?Gln?Asn?Leu?His?Thr?Thr?Val?Cys?Asn?Met
1555????????????????1560????????????????1565Val?Tyr?Thr?Cys?Tyr?Met?Thr?Tyr?Leu?Asp?Leu?Leu?Leu?Asn?Glu?Glu
1570????????????????1575????????????????1580Leu?Glu?Glu?Phe?Thr?Phe?Leu?Leu?Cys?Glu?Ser?Asp?Glu?Asp?Val?Val1585????????????????1590????????????????1595????????????????1600Pro?Asp?Arg?Phe?Asp?Asn?Ile?Gln?Ala?Lys?His?Leu?Cys?Val?Leu?Ala
1605????????????????1610????????????????1615Asp?Leu?Tyr?Cys?Gln?Pro?Gly?Thr?Cys?Pro?Pro?Ile?Arg?Gly?Leu?Arg
1620????????????????1625????????????????1630Pro?Val?Glu?Lys?Cys?Ala?Val?Leu?Thr?Asp?His?Ile?Lys?Ala?Glu?Ala
1635????????????????1640????????????????1645Arg?Leu?Ser?Pro?Ala?Gly?Ser?Ser?Trp?Asn?Ile?Asn?Pro?Ile?Ile?Val
1650????????????????1655????????????????1660Asp?His?Tyr?Ser?Cys?Ser?Leu?Thr?Tyr?Leu?Arg?Arg?Gly?Ser?Ile?Lys1665????????????????1670????????????????1675????????????????1680Gln?Ile?Arg?Leu?Arg?Val?Asp?Pro?Gly?Phe?Ile?Phe?Asp?Ala?Leu?Ala
1685????????????????1690????????????????1695Glu?Val?Asn?Val?Ser?Gln?Pro?Lys?Ile?Gly?Ser?Asn?Asn?Ile?Ser?Asn
1700????????????????1705????????????????1710Met?Ser?Ile?Lys?Ala?Phe?Arg?Pro?Pro?His?Asp?Asp?Val?Ala?Lys?Leu
1715????????????????1720????????????????1725Leu?Lys?Asp?Ile?Asn?Thr?Ser?Lys?His?Asn?Leu?Pro?Ile?Ser?Gly?Gly
1730????????????????1735????????????????1740Asn?Leu?Ala?Asn?Tyr?Glu?Ile?His?Ala?Phe?Arg?Arg?Ile?Gly?Leu?Asn1745????????????????1750????????????????1755????????????????1760Ser?Ser?Ala?Cys?Tyr?Lys?Ala?Val?Glu?Ile?Ser?Thr?Leu?Ile?Arg?Arg
1765????????????????1770????????????????1775Cys?Leu?Glu?Pro?Gly?Glu?Asp?Gly?Leu?Phe?Leu?Gly?Glu?Gly?Ser?Gly
1780????????????????1785????????????????1790Ser?Met?Leu?Ile?Thr?Tyr?Lys?Glu?Ile?Leu?Lys?Leu?Asn?Lys?Cys?Phe
1795????????????????1800????????????????1805Tyr?Asn?Ser?Gly?Val?Ser?Ala?Asn?Ser?Arg?Ser?Gly?Gln?Arg?Glu?Leu
1810????????????????1815????????????????1820Ala?Pro?Tyr?Pro?Ser?Glu?Val?Gly?Leu?Val?Glu?His?Arg?Met?Gly?Val1825????????????????1830????????????????1835????????????????1840Gly?Asn?Ile?Val?Lys?Val?Leu?Phe?Asn?Gly?Arg?Pro?Glu?Val?Thr?Trp
1845????????????????1850????????????????1855Val?Gly?Ser?Val?Asp?Cys?Phe?Asn?Phe?Ile?Val?Ser?Asn?Ile?Pro?Thr
1860????????????????1865????????????????1870Ser?Ser?Val?Gly?Phe?Ile?His?Ser?Asp?Ile?Glu?Thr?Leu?Pro?Asn?Lys
1875????????????????1880????????????????1885Asp?Thr?Ile?Glu?Lys?Leu?Glu?Glu?Leu?Ala?Ala?Ile?Leu?Ser?Met?Ala
1890????????????????1895????????????????1900Leu?Leu?Leu?Gly?Lys?Ile?Gly?Ser?Ile?Leu?Val?Ile?Lys?Leu?Met?Pro1905????????????????1910????????????????1915????????????????1920Phe?Ser?Gly?Asp?Phe?Val?Gln?Gly?Phe?Ile?Ser?Tyr?Val?Gly?Ser?His
1925????????????????1930????????????????1935Tyr?Arg?Glu?Val?Asn?Leu?Val?Tyr?Pro?Arg?Tyr?Ser?Asn?Phe?Ile?Ser
1940????????????????1945????????????????1950Thr?Glu?Ser?Tyr?Leu?Val?Met?Thr?Asp?Leu?Lys?Ala?Asn?Arg?Leu?Met
1955????????????????1960????????????????1965Asn?Pro?Glu?Lys?Ile?Lys?Gln?Gln?Ile?Ile?Glu?Ser?Ser?Val?Arg?Thr
1970????????????????1975????????????????1980Ser?Pro?Gly?Leu?Ile?Gly?His?Ile?Leu?Ser?Ile?Lys?Gln?Leu?Ser?Cys1985????????????????1990????????????????1995????????????????2000Ile?Gln?Ala?Ile?Val?Gly?Asp?Ala?Val?Ser?Arg?Gly?Asp?Ile?Asn?Pro
2005????????????????2010????????????????2015Thr?Leu?Lys?Lys?Leu?Thr?Pro?Ile?Glu?Gln?Val?Leu?Ile?Asn?Cys?Gly
2020????????????????2025????????????????2030Leu?Ala?Ile?Asn?Gly?Pro?Lys?Leu?Cys?Lys?Glu?Leu?Ile?His?His?Asp
2035????????????????2040????????????????2045Val?Ala?Ser?Gly?Gln?Asp?Gly?Leu?Leu?Asn?Ser?Ile?Leu?Ile?Leu?Tyr
2050????????????????2055????????????????2060Arg?Glu?Leu?Ala?Arg?Phe?Lys?Asp?Asn?Gln?Arg?Ser?Gln?Gln?Gly?Met2065????????????????2070????????????????2075????????????????2080Phe?His?Ala?Tyr?Pro?Val?Leu?Val?Ser?Ser?Arg?Gln?Arg?Glu?Leu?Ile
2085????????????????2090????????????????2095Ser?Arg?Ile?Thr?Arg?Lys?Phe?Trp?Gly?His?Ile?Leu?Leu?Tyr?Ser?Gly
2100????????????????2105????????????????2110Asn?Arg?Lys?Leu?Ile?Asn?Lys?Phe?Ile?Gln?Asn?Leu?Lys?Ser?Gly?Tyr
2115????????????????2120????????????????2125Leu?Ile?Leu?Asp?Leu?His?Gln?Asn?Ile?Phe?Val?Lys?Asn?Leu?Ser?Lys
2130????????????????2135????????????????2140
Ser?Glu?Lys?Gln?Ile?Ile?Met?Thr?Gly?Gly?Leu?Lys?Arg?Glu?Trp?Val
2145????????????????2150????????????????2155????????????????2160
Phe?Lys?Val?Thr?Val?Lys?Glu?Thr?Lys?Glu?Trp?Tyr?Lys?Leu?Val?Gly
2165????????????????2170????????????????2175
Tyr?Ser?Ala?Leu?Ile?Lys?Asp
The information of 2180 (2) SEQ ID NO:15:
(i) sequence signature:
(A) length: 15894 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: ACCAAACAAA GTTGGGTAAG GATAGTTCAA TCAATGATCA TCTTCTAGTG CACTTAGGAT 60 TCAAGATCCT ATTATCAGGG ACAAGAGCAG GATTAGGGAT ATCCGAGATG GCCACACTTT 120 TAAGGAGCTT AGCATTGTTC AAAAGAAACA AGGACAAACC ACCCATTACA TCAGGATCCG 180 GTGGAGCCAT CAGAGGAATC AAACACATTA TTATAGTACC AATCCCTGGA GATTCCTCAA 240 TTACCACTCG ATCCAGACTT CTGGACCGGT TGGTCAGGTT AATTGGAAAC CCGGATGTGA 300 GCGGGCCCAA ACTAACAGGG GCACTAATAG GTATATTATC CTTATTTGTG GAGTCTCCAG 360 GTCAATTGAT TCAGAGGATC ACCGATGACC CTGACGTTAG CATAAGGCTG TTAGAGGTTG 420 TCCAGAGTGA CCAGTCACAA TCTGGCCTTA CCTTCGCATC AAGAGGTACC AACATGGAGG 480 ATGAGGCGGA CAAATACTTT TCACATGATG ATCCAATTAG TAGTGATCAA TCCAGGTTCG 540 GATGGTTCGA GAACAAGGAA ATCTCAGATA TTGAAGTGCA AGACCCTGAG GGATTCAACA 600 TGATTCTGGG TACCATCCTA GCCCAAATTT GGGTCTTGCT CGCAAAGGCG GTTACGGCCC 660 CAGACACGGC AGCTGATTCG GAGCTAAGAA GGTGGATAAA GTACACCCAA CAAAGAAGGG 720 TAGTTGGTGA ATTTAGATTG GAGAGAAAAT GGTTGGATGT GGTGAGGAAC AGGATTGCCG 780 AGGACCTCTC CTTACGCCGA TTCATGGTCG CTCTAATCCT GGATATCAAG AGAACACCCG 840 GAAACAAACC CAGGATTGCT GAAATGATAT GTGACATTGA TACATATATC GTAGAGGCAG 900 GATTAGCCAG TTTTATCCTG ACTATTAAGT TTGGGATAGA AACTATGTAT CCTGCTCTTG 960 GACTGCATGA ATTTGCTGGT GAGTTATCCA CACTTGAGTC CTTGATGAAC CTTTACCAGC 1020 AAATGGGGGA AACTGCACCC TACATGGTAA TCCTGGAGAA CTCAATTCAG AACAAGTTCA 1080 GTGCAGGATC ATACCCTCTG CTCTGGAGCT ATGCCATGGG AGTAGGAGTG GAACTTGAAA 1140 ACTCCATGGG AGGTTTGAAC TTTGGCCGAT CTTACTTTGA TCCAGCATAT TTTAGATTAG 1200 GGCAAGAGAT GGTAAGGAGG TCAGCTGGAA AGGTCAGTTC CACATTGGCA TCTGAACTCG 1260 GTATCACTGC CGAGGATGCA AGGCTTGTTT CAGAGATTGC AATGCATACT ACTGAGGACA 1320 AGATCAGTAG AGCGGTTGGA CCCAGACAAG CCCAAGTATC ATTTCTACAC GGTGATCAAA 1380 GTGAGAATGA GCTACCGAGA TTGGGGGGCA AGGAAGATAG GAGGGTCAAA CAGAGTCGAG 1440 GAGAAGCCAG GGAGAGCTAC AGAGAAACCG GGCCCAGCAG AGCAAGTGAT GCGAGAGCTG 1500 CCCATCTTCC AACCGGCACA CCCCTAGACA TTGACACTGC ATCGGAGTCC AGCCAAGATC 1560 CGCAGGACAG TCGAAGGTCA GCTGACGCCC TGCTTAGGCT GCAAGCCATG GCAGGAATCT 1620 CGGAAGAACA AGGCTCAGAC ACGGACACCC CTATAGTGTA CAATGACAGA AATCTTCTAG 1680 ACTAGGTGCG AGAGGCCGAG GACCAGAACA ACATCCGCCT ACCCTCCATC ATTGTTATAA 1740 AAAACTTAGG AACCAGGTCC ACACAGCCGC CAGCCCATCA ACCATCCACT CCCACGATTG 1800 GAGCCGATGG CAGAAGAGCA GGCACGCCAT GTCAAAAACG GACTGGAATG CATCCGGGCT 1860 CTCAAGGCCG AGCCCATCGG CTCACTGGCC ATCGAGGAAG CTATGGCAGC ATGGTCAGAA 1920 ATATCAGACA ACCCAGGACA GGAGCGAGCC ACCTGCAGGG AAGAGAAGGC AGGCAGTTCG 1980 GGTCTCAGCA AACCATGCCT CTCAGCAATT GGATCAACTG AAGGCGGTGC ACCTCGCATC 2040 CGCGGTCAGG GACCTGGAGA GAGCGATGAC GACGCTGAAA CTTTGGGAAT CCCCCCAAGA 2100 AATCTCCAGG CATCAAGCAC TGGGTTACAG TGTTATTATG TTTATGATCA CAGCGGTGAA 2160 GCGGTTAAGG GAATCCAAGA TGCTGACTCT ATCATGGTTC AATCAGGCCT TGATGGTGAT 2220 AGCACCCTAT CAGGAGGAGA CAATGAATCT GAAAACAGCG ATGTGGATAT TGGCGAACCT 2280 GATACCGAGG GATATGCTAT CACTGACCGG GGATCTGCTC CCATCTCTAT GGGGTTCAGG 2340 GCTTCTGATG TTGAAACTGC AGAAGGAGGG GAGATCCACG AGCTCCTGAG ACTCCAATCC 2400 AGAGGCAACA ACTTTCCGAA GCTTGGGAAA ACTCTCAATG TTCCTCCGCC CCCGGACCCC 2460 GGTAGGGCCA GCACTTCCGG GACACCCATT AAAAAGGGCA CAGACGCGAG ATTAGCCTCA 2520 TTTGGAACGG AGATCGCGTC TTTATTGACA GGTGGTGCAA CCCAATGTGC TCGAAAGTCA 2580 CCCTCGGAAC CATCAGGGCC AGGTGCACCT GCGGGGAATG TCCCCGAGTA TGTGAGCAAT 2640 GCCGCACTGA TACAGGAGTG GACACCCGAA TCTGGTACCA CAATCTCCCC GAGATCCCAG 2700 AATAATGAAG AAGGGGGAGA CTATTATGAT GATGAGCTGT TCTCTGATGT CCAAGATATT 2760 AAAACAGCCT TGGCCAAAAT ACACGAGGAT AATCAGAAGA TAATCTCCAA GCTAGAATCA 2820 CTGCTGTTAT TGAAGGGAGA AGTTGAGTCA ATTAAGAAGC AGATCAACAG GCAAAATATC 2880 AGCATATCCA CCCTGGAAGG ACACCTCTCA AGCATCATGA TCGCCATTCC TGGACTTGGG 2940 AAGGATCCCA ACGACCCCAC TGCAGATGTC GAAATCAATC CCGACTTGAA ACCCATCATA 3000 GGCAGAGATT CAGGCCGAGC ACTGGCCGAA GTTCTCAAGA AACCCGTTGC CAGCCGACAA 3060 CTCCAAGGAA TGACAAATGG ACGGACCAGT TCCAGAGGAC AGCTGCTGAA GGAATTTCAG 3120 CCAAAGCCGA TCGGGAAAAA GATGAGCTCA GCCGTCGGGT TTGTTCCTGA CACCGGCCCT 3180 GCATCACGCA GTGTAATCCG CTCCATTATA AAATCCAGCC GGCTAGAGGA GGATCGGAAG 3240 CGTTACCTGA TGACTCTCCT TGATGATATC AAAGGAGCCA ATGATCTTGC CAAGTTCCAC 3300 CAGATGCTGA TGAAGATAAT AATGAAGTAG CTACAGCTCA ACTTACCTGC CAACCCCATG 3360 CCAGTCGACC CAACTAGTAC AACCTAAATC CATTATAAAA AACTTAGGAG CAAAGTGATT 3420 GCCTCCCAAG TTCCACAATG ACAGAGATCT ACGACTTCGA CAAGTCGGCA TGGGACATCA 3480 AAGGGTCGAT CGCTCCGATA CAACCGACCA CCTACAGTGA TGGCAGGCTG GTGCCCCAGG 3540 TCAGAGTCAT AGATCCTGGT CTAGGCGACA GGAAGGATGA ATGCTTTATG TACATGTCTC 3600 TGCTGGGGGT TGTTGAGGAC AGCGATCCCC TAGGGCCTCC AATCGGGCGA GCATTTGGGT 3660 CCCTGCCCTT AGGTGTTGGC AGATCCACAG CAAAGCCCGA AAAACTCCTC AAAGAGGCCA 3720 CTGAGCTTGA CATAGTTGTT AGACGTACAG CAGGGCTCAA TGAAAAACTG GTGTTCTACA 3780 ACAACACCCC ACTAACTCTC CTCACACCTT GGAGAAAGGT CCTAACAACA GGGAGTGTCT 3840 TCAACGCAAA CCAAGTGTGC AATGCGGTTA ATCTGATACC GCTCGATACC CCGCAGAGGT 3900 TCCGTGTTGT TTATATGAGC ATCACCCGTC TTTCGGATAA CGGGTATTAC ACCGTTCCTA 3960 GAAGAATGCT GGAATTCAGA TCGGTCAATG CAGTGGCCTT CAACCTGCTG GTGACCCTTA 4020 GGATTGACAA GGCGATAGGC CCTGGGAAGA TCATCGACAA TACAGAGCAA CTTCCTGAGG 4080 CAACATTTAT GGTCCACATC GGGAACTTCA GGAGAAAGAA GAGTGAAGTC TACTCTGCCG 4140 ATTATTGCAA AATGAAAATC GAAAAGATGG GCCTGGTTTT TGCACTTGGT GGGATAGGGG 4200 GCACCAGTCT TCACATTAGA AGCACAGGCA AAATGAGCAA GACTCTCCAT GCACAACTCG 4260 GGTTCAAGAA GACCTTATGT TACCCGCTGA TGGATATCAA TGAAGACCTT AATCGATTAC 4320 TCTGGAGGAG CAGATGCAAG ATAGTAAGAA TCCAGGCAGT TTTGCAGCCA TCAGTTCCTC 4380 AAGAATTCCG CATTTACGAC GACGTGATCA TAAATGATGA CCAAGGACTA TTCAAAGTTC 4440 TGTAGACCGT AGTGCCCAGC AATGCCCGAA AACGACCCCC CTCACAATGA CAGCCAGAAG 4500 GCCCGGACAA AAAAGCCCCC TCCGAAAGAC TCCACTGACC AAGCGAGAGG CCAGCCAGCA 4560 GCCGACGGCA AGCACGAACA CCAGGCGGCC CCAGCACAGA ACAGCCCTGA TACAAGGCCA 4620 CCACCAGCCA CCCCAATCTG CATCCTCCTC GTGGGACCCC CGAGGACCAA CCCCCAAGGC 4680 TGCCCCCGAT CCAAACCACC AACCGCATCC CCACCACCCC CGGGAAAGAA ACCCCCAGCA 4740 ATTGGAAGGC CCCTCCCCCT CTTCCTCAAC ACAAGAACTC CACAACCGAA CCGCACAAGC 4800 GACCGAGGTG ACCCAACCGC AGGCATCCGA CTCCCTAGAC AGATCCTCTC TCCCCGGCAA 4860 ACTAAACAAA ACTTAGGGCC AAGGAACATA CACACCCAAC AGAACCCAGA CCCCGGCCCA 4920 CGGCGCCGCG CCCCCAACCC CCGACAACCA GAGGGAGCCC CCAACCAATC CCGCCGGCTC 4980 CCCCGGTGCC CACAGGCAGG GACACCAACC CCCGAACAGA CCCAGCACCT AACCATCGAC 5040 AATCCAAGAC GGGGGGGCCC CCCCAAAAAA AGGCCCCCAG GGGCCGACAG CCAGCACCGC 5100 GAGGAAGCCC ACCCACCCCA CACACGACCA CGGCAACCAA ACCAGAACCC AGACCACCCT 5160 GGGCCACCAG CTCCCAGACT CGGCCATCAC CCCGCAGAAA GGAAAGGCCA CAACCCGCGC 5220 ACCCCAGCCC CGATCCGGCG GGGAGCCACC CAACCCGAAC CAGCACCCAA GAGCGATCCC 5280 CGAAGGACCC CCGAACCGCA AAGGACATCA GTATCCCACA GCCTCTCCAA GTCCCCCGGT 5340 CTCCTCCTCT TCTCGAAGGG ACCAAAAGAT CAATCCACCA CACCCGACGA CACTCAACTC 5400 CCCACCCCTA AAGGAGACAC CGGGAATCCC AGAATCAAGA CTCATCCAAT GTCCATCATG 5460 GGTCTCAAGG TGAACGTCTC TGCCATATTC ATGGCAGTAC TGTTAACTCT CCAAACACCC 5520 ACCGGTCAAA TCCATTGGGG CAATCTCTCT AAGATAGGGG TGGTAGGAAT AGGAAGTGCA 5580 AGCTACAAAG TTATGACTCG TTCCAGCCAT CAATCATTAG TCATAAAATT AATGCCCAAT 5640 ATAACTCTCC TCAATAACTG CACGAGGGTA GAGATTGCAG AATACAGGAG ACTACTGAGA 5700 ACAGTTTTGG AACCAATTAG AGATGCACTT AATGCAATGA CCCAGAATAT AAGACCGGTT 5760 CAGAGTGTAG CTTCAAGTAG GAGACACAAG AGATTTGCGG GAGTAGTCCT GGCAGGTGCG 5820 GCCCTAGGCG TTGCCACAGC TGCTCAGATA ACAGCCGGCA TTGCACTTCA CCAGTCCATG 5880 CTGAACTCTC AAGCCATCGA CAATCTGAGA GCGAGCCTGG AAACTACTAA TCAGGCAATT 5940 GAGGCAATCA GACAAGCAGG GCAGGAGATG ATATTGGCTG TTCAGGGTGT CCAAGACTAC 6000 ATCAATAATG AGCTGATACC GTCTATGAAC CAACTATCTT GTGATTTAAT CGGCCAGAAG 6060 CTCGGGCTCA AATTGCTCAG ATACTATACA GAAATCCTGT CATTATTTGG CCCCAGCTTA 6120 CGGGACCCCA TATCTGCGGA GATATCTATC CAGGCTTTGA GCTATGCGCT TGGAGGAGAC 6180 ATCAATAAGG TGTTAGAAAA GCTCGGATAC AGTGGAGGTG ATTTACTGGG CATCTTAGAG 6240 AGCAGAGGAA TAAAGGCCCG GATAACTCAC GTCGACACAG AGTCCTACTT CATTGTCCTC 6300 AGTATAGCCT ATCCGACGCT GTCCGAGATT AAGGGGGTGA TTGTCCACCG GCTAGAGGGG 6360 GTCTCGTACA ACATAGGCTC TCAAGAGTGG TATACCACTG TGCCCAAGTA TGTTGCAACC 6420 CAAGGGTACC TTATCTCGAA TTTTGATGAG TCATCGTGTA CTTTCATGCC AGAGGGGACT 6480 GTGTGCAGCC AAAATGCCTT GTACCCGATG AGTCCTCTGC TCCAAGAATG CCTCCGGGGG 6540 TACACCAAGT CCTGTGCTCG TACACTCGTA TCCGGGTCTT TTGGGAACCG GTTCATTTTA 6600 TCACAAGGGA ACCTAATAGC CAATTGTGCA TCAATCCTTT GCAAGTGTTA CACAACAGGA 6660 ACGATCATTA ATCAAGACCC TGACAAGATC CTAACATACA TTGCTGCCGA TAACTGCCCG 6720 GTAGTCGAGG TGAACGGCGT GACCATCCAA GTCGGGAGCA GGAGGTATCC AGACGCTGTG 6780 TACTTGCACA GAATTGACCT CGGTCCTCCC ATATTATTGG AGAGGTTGGA CGTAGGGACA 6840 AATCTGGGGA ATGCAATTGC TAAGTTGGAG GATGCCAAGG AATTGTTGGA GTCATCGGAC 6900 CAGATATTGA GGAGTATGAA AGGTTTATCG AGCACTTGCA TAGTCTACAT CCTGATTGCA 6960 GTGTGTCTTG GAGGGTTGAT AGGGATCCCC GCTTTAATAT GTTGCTGCAG GGGGCGTTGT 7020 AACAAAAAGG GAGAACAAGT TGGTATGTCA AGACCAGGCC TAAAGCCTGA TCTTACGGGA 7080 ACATCAAAAT CCTATGTAAG GTCGCTCTGA TCCTCTACAA CTCTTGAAAC ACAAATGTCC 7140 CACAAGTCTC CTCTTCGTCA TCAAGCAACC ACCGCACCCA GCATCAAGCC CACCTGAAAT 7200 TATCTCCGGC TTCCCTCTGG CCGAACAATA TCGGTAGTTA ATTAAAACTT AGGGTGCAAG 7260 ATCATCCACA ATGTCACCAC AACGAGACCG GATAAATGCC TTCTACAAAG ATAACCCCCA 7320 TCCCAAGGGA AGTAGGATAG TCATTAACAG AGAACATCTT ATGATTGATA GACCTTATGT 7380 TTTGCTGGCT GTTCTGTTTG TCATGTTTCT GAGCTTGATC GGGTTGCTAG CCATTGCAGG 7440 CATTAGACTT CATCGGGCAG CCATCTACAC CGCAGAGATC CATAAAAGCC TCAGCACCAA 7500 TCTAGATGTA ACTAACTCAA TCGAGCATCA GGTCAAGGAC GTGCTGACAC CACTCTTCAA 7560 AATCATCGGT GATGAAGTGG GCCTGAGGAC ACCTCAGAGA TTCACTGACC TAGTGAAATT 7620 CATCTCTGAC AAGATTAAAT TCCTTAATCC GGATAGGGAG TACGACTTCA GAGATCTCAC 7680 TTGGTGTATC AACCCGCCAG AGAGAATCAA ATTGGATTAT GATCAATACT GTGCAGATGT 7740 GGCTGCTGAA GAGCTCATGA ATGCATTGGT GAACTCAACT CTACTGGAGA CCAGAACAAC 7800 CAATCAGTTC CTAGCTGTCT CAAAGGGAAA CTGCTCAGGG CCCACTACAA TCAGAGGTCA 7860 ATTCTCAAAC ATGTCGCTGT CCCTGTTAGA CTTGTATTTA GGTCGAGGTT ACAATGTGTC 7920 ATCTATAGTC ACTATGACAT CCCAGGGAAT GTATGGGGGA ACTTACCTAG TGGAAAAGCC 7980 TAATCTGAGC AGCAAAAGGT CAGAGTTGTC ACAACTGAGC ATGTACCGAG TGTTTGAAGT 8040 AGGTGTTATC AGAAATCCGG GTTTGGGGGC TCCGGTGTTC CATATGACAA ACTATCTTGA 8100 GCAACCAGTC AGTAATGATC TCAGCAACTG TATGGTGGCT TTGGGGGAGC TCAAACTCGC 8160 AGCCCTTTGT CACCGGGAAG ATTCTATCAC AATTCCCTAT CAGGGATCAG GGAAAGGTGT 8220 CAGCTTCCAG CTCGTCAAGC TAGGTGTCTG GAAATCCCCA ACCGACATGC AATCCTGGGT 8280 CACCTTATCA ACGGATGATC CAGTGATAGA CAGGCTTTAC CTCTCATCTC ACAGAGGTGT 8340 TATCGCTGAC AATCAAGCAA AATGGGCTGT CCCGACAACA CGAACAGATG ACAAGTTGCG 8400 AATGGAGACA TGCTTCCAAC AGGCGTGTAA GGGTAAAATC CAAGCACTCT GCGAGAATCC 8460 CGAGTGGGCA CCATTGAAGG ATAACAGGAT TCCTTCATAC GGGGTCTTGT CTGTTGATCT 8520 GAGTCTGACA GTTGAGCTTA AAATCAAAAT TGCTTCGGGA TTCGGGCCAT TGATCACACA 8580 CGGTTCAGGG ATGGACCTAT ACAAATCCAA CCACAACAAT GTGTATTGGC TGACTATCCC 8640 ACCAATGAAG AACCTAGCCT TAGGTGTAAT CAACACATTG GAGTGGATAC CGAGATTCAA 8700 GGTTAGTCCC TACCTCTTCA ATGTCCCAAT TAAGGAAGCA GGCGAAGACT GCCATGCCCC 8760 AACATACCTA CCTGCGGAGG TGGATGGTGA TGTCAAACTC AGTTCCAATC TGGTGATTCT 8820 ACCTGGTCAA GATCTCCAAT ATGTTTTGGC AACCTACGAT ACTTCCAGGG TTGAACATGC 8880 TGTGGTTTAT TACGTTTACA GCCCAAGCCG CTCATTTTCT TACTTTTATC CTTTTAGGTT 8940 GCCTATAAAG GGGGTCCCCA TCGAATTACA AGTGGAATGC TTCACATGGG ACCAAAAACT 9000 CTGGTGCCGT CACTTCTGTG TGCTTGCGGA CTCAGAATCT GGTGGACATA TCACTCACTC 9060 TGGGATGGTG GGCATGGGAG TCAGCTGCAC AGTCACCCGG GAAGATGGAA CCAATCGCAG 9120 ATAGGGCTGC TAGTGAACTA ATCTCATGAT GTCACCCAGA CATCAGGCAT ACCCACTAGT 9180 GTGAAATAGA CATCAGAATT AAGAAAAACG TAGGGTCCAA GTGGTTCCCC GTTATGGACT 9240 CGCTATCTGT CAACCAGATC TTATACCCTG AAGTTCACCT AGATAGCCCG ATAGTTACCA 9300 ATAAGATAGT AGCCATCCTG GAGTATGCTC GAGTCCCTCA CGCTTACAGC CTGGAGGACC 9360 CTACACTGTG TCAGAACATC AAGCACCGCC TAAAAAACGG ATTTTCCAAC CAAATGATTA 9420 TAAACAATGT GGAAGTTGGG AATGTCATCA AGTCCAAGCT TAGGAGTTAT CCGGCCCACT 9480 CTCATATTCC ATATCCAAAT TGTAATCAGG ATTTATTTAA CATAGAAGAC AAAGAGTCAA 9540 CGAGGAAGAT CCGTGAACTC CTCAAAAAGG GGAATTCGCT GTACTCCAAA GTCAGTGATA 9600 AGGTTTTCCA ATGCTTAAGG GACACTAACT CACGGCTTGG CCTAGGCTCC GAATTGAGGG 9660 AGGACATCAA GGAGAAAGTT ATTAACTTGG GAGTTTACAT GCACAGCTCC CAGTGGTTTG 9720 AGCCCTTTCT GTTTTGGTTT ACAGTCAAGA CTGAGATGAG GTCAGTGATT AAATCACAAA 9780 CCCATACTTG CCATAGGAGG AGACACACAC CTGTATTCTT CACTGGTAGT TCAGTTGAGT 9840 TGCTAATCTC TCGTGACCTT GTTGCTATAA TCAGTAAAGA GTCTCAACAT GTATATTACC 9900 TGACATTTGA ACTGGTTTTG ATGTATTGTG ATGTCATAGA GGGGAGGTTA ATGACAGAGA 9960 CCGCTATGAC TATTGATGCT AGGTATACAG AGCTTCTAGG AAGAGTCAGA TACATGTGGA 10020 AACTGATAGA TGGTTTCTTC CCTGCACTCG GGAATCCAAC TTATCAAATT GTAGCCATGC 10080 TGGAGCCTCT TTCACTTGCT TACCTGCAGC TGAGGGATAT AACAGTAGAA CTCAGAGGTG 10140 CTTTCCTTAA CCACTGCTTT ACTGAAATAC ATGATGTTCT TGACCAAAAC GGGTTTTCTG 10200 ATGAAGGTAC TTATCATGAG TTAATTGAAG CTCTAGATTA CATTTTCATA ACTGATGACA 10260 TACATCTGAC AGGGGAGATT TTCTCATTTT TCAGAAGTTT CGGCCACCCC AGACTTGAAG 10320 CAGTAACGGC TGCTGAAAAT GTTAGGAAAT ACATGAATCA GCCTAAAGTC ATTGTGTATG 10380 AGACTCTGAT GAAAGGTCAT GCCATATTTT GTGGAATCAT AATCAACGGC TATCGTGACA 10440 GGCACGGAGG CAGTTGGCCA CCGCTGACCC TCCCCCTGCA TGCTGCAGAC ACAATCCGGA 10500 ATGCTCAAGC TTCAGGTGAA GGGTTAACAC ATGAGCAGTG CGTTGATAAC TGGAAATCTT 10560 TTGCTGGAGT GAAATTTGGC TGCTTTATGC CTCTTAGCCT GGATAGTGAT CTGACAATGT 10620 ACCTAAAGGA CAAGGCACTT GCTGCTCTCC AAAGGGAATG GGATTCAGTT TACCCGAAAG 10680 AGTTCCTGCG TTACGACCCT CCCAAGGGAA CCGGGTCACG GAGGCTTGTA GATGTTTTCC 10740 TTAATGATTC GAGCTTTGAC CCATATGATG TGATAATGTA TGTTGTAAGT GGAGCTTACC 10800 TCCATGACCC TGAGTTCAAC CTGTCTTACA GCCTGAAAGA AAAGGAGATC AAGGAAACAG 10860 GTAGACTTTT TGCTAAAATG ACTTACAAAA TGAGGGCATG CCAAGTGATT GCTGAAAATC 10920 TAATCTCAAA CGGGATTGGC AAATATTTTA AGGACAATGG GATGGCCAAG GATGAGCACG 10980 ATTTGACTAA GGCACTCCAC ACTCTAGCTG TCTCAGGAGT CCCCAAAGAT CTCAAAGAAA 11040 GTCACAGGGG GGGGCCAGTC TTAAAAACCT ACTCCCGAAG CCCAGTCCAC ACAAGTACCA 11100 GGAACGTGAG AGCAGCAAAA GGGTTTATAG GGTTCCCTCA AGTAATTCGG CAGGACCAAG 11160 ACACTGATCA TCCGGAGAAT ATGGAAGCTT ACGAGACAGT CAGTGCATTT ATCACGACTG 11220 ATCTCAAGAA GTACTGCCTT AATTGGAGAT ATGAGACCAT CAGCTTGTTT GCACAGAGGC 11280 TAAATGAGAT TTACGGATTG CCCTCATTTT TCCAGTGGCT GCATAAGAGG CTTGAGACCT 11340 CTGTCCTGTA TGTAAGTGAC CCTCATTGCC CCCCCGACCT TGACGCCCAT ATCCCGTTAT 11400 ATAAAGTCCC CAATGATCAA ATCTTCATTA AGTACCCTAT GGGAGGTATA GAAGGGTATT 11460 GTCAGAAGCT GTGGACCATC AGCACCATTC CCTATCTATA CCTGGCTGCT TATGAGAGCG 11520 GAGTAAGGAT TGCTTCGTTA GTGCAAGGGG ACAATCAGAC CATAGCCGTA ACAAAAAGGG 11580 TACCCAGCAC ATGGCCCTAC AACCTTAAGA AACGGGAAGC TGCTAGAGTA ACTAGAGATT 11640 ACTTTGTAAT TCTTAGGCAA AGGCTACATG ATATTGGCCA TCACCTCAAG GCAAATGAGA 11700 CAATTGTTTC ATCACATTTT TTTGTCTATT CAAAAGGAAT ATATTATGAT GGGCTACTTG 11760 TGTCCCAATC ACTCAAGAGC ATCGCAAGAT GTGTATTCTG GTCAGAGACT ATAGTTGATG 11820 AAACAAGGGC AGCATGCAGT AATATTGCTA CAACAATGGC TAAAAGCATC GAGAGAGGTT 11880 ATGACCGTTA CCTTGCATAT TCCCTGAACG TCCTAAAAGT GATACAGCAA ATTCTGATCT 11940 CTCTTGGCTT CACAATCAAT TCAACCATGA CCCGGGATGT AGTCATACCC CTCCTCACAA 12000 ACAACGACCT CTTAATAAGG ATGGCACTGT TGCCCGCTCC TATTGGGGGG ATGAATTATC 12060 TGAATATGAG CAGGCTGTTT GTCAGAAACA TCGGTGATCC AGTAACATCA TCAATTGCTG 12120 ATCTCAAGAG AATGATTCTC GCCTCACTAA TGCCTGAAGA GACCCTCCAT CAAGTAATGA 12180 CACAACAACC GGGGGACTCT TCATTCCTAG ACTGGGCTAG CGACCCTTAC TCAGCAAATC 12240 TTGTATGTGT CCAGAGCATC ACTAGACTCC TCAAGAACAT AACTGCAAGG TTTGTCCTGA 12300 TCCATAGTCC AAACCCAATG TTAAAAGGAT TATTCCATGA TGACAGTAAA GAAGAGGACG 12360 AGGGACTGGC GGCATTCCTC ATGGACAGGC ATATTATAGT ACCTAGGGCA GCTCATGAAA 12420 TCCTGGATCA TAGTGTCACA GGGGCAAGAG AGTCTATTGC AGGCATGCTG GATACCACAA 12480 AAGGCCTGAT TCGAGCCAGC ATGAGGAAGG GGGGGTTAAC CTCTCGAGTG ATAACCAGAT 12540 TGTCCAATTA TGACTATGAA CAATTCAGAG CAGGGATGGT GCTATTGACA GGAAGAAAGA 12600 GAAATGTCCT CATTGACAAA GAGTCATGTT CAGTGCAGCT GGCGAGAGCT CTAAGAAGCC 12660 ATATGTGGGC GAGGCTAGCT CGAGGACGGC CTATTTACGG CCTTGAGGTC CCTGATGTAC 12720 TAGAATCTAT GCGAGGCCAC CTTATTCGGC GTCATGAGAC ATGTGTCATC TGCGAGTGTG 12780 GATCAGTCAA CTACGGATGG TTTTTTGTCC CCTCGGGTTG CCAACTGGAT GATATTGACA 12840 AGGAAACATC ATCCTTGAGA GTCCCATATA TTGGTTCTAC CACTGATGAG AGAACAGACA 12900 TGAAGCTTGC CTTCGTAAGA GCCCCAAGTC GATCCTTGCG ATCTGCTGTT AGAATAGCAA 12960 CAGTGTACTC ATGGGCTTAC GGTGATGATG ATAGCTCTTG GAACGAAGCC TGGTTGTTGG 13020 CTAGGCAAAG GGCCAATGTG AGCCTGGAGG AGCTAAGGGT GATCACTCCC ATCTCAACTT 13080 CGACTAATTT AGCGCATAGG TTGAGGGATC GTAGCACTCA AGTGAAATAC TCAGGTACAT 13140 CCCTTGTCCG AGTGGCGAGG TATACCACAA TCTCCAACGA CAATCTCTCA TTTGTCATAT 13200 CAGATAAGAA GGTTGATACT AACTTTATAT ACCAACAAGG AATGCTTCTA GGGTTGGGTG 13260 TTTTAGAAAC ATTGTTTCGA CTCGAGAAAG ATACCGGATC ATCTAACACG GTATTACATC 13320 TTCACGTCGA AACAGATTGT TGCGTGATCC CGATGATAGA TCATCCCAGG ATACCCAGCT 13380 CCCGCAAGCT AGAGCTGAGG GCAGAGCTAT GTACCAACCC ATTGATATAT GATAATGCAC 13440 CTTTAATTGA CAGAGATACA ACAAGGCTAT ACACCCAGAG CCATAGGAGG CACCTTGTGG 13500 AATTTGTTAC ATGGTCCACA CCCCAACTAT ATCACATTTT AGCTAAGTCC ACAGCACTAT 13560 CTATGATTGA CCTGGTAACA AAATTTGAGA AGGACCATAT GAATGAAATT TCAGCTCTCA 13620 TAGGGGATGA CGATATCAAT AGTTTCATAA CTGAGTTTCT GCTCATAGAG CCAAGATTAT 13680 TCACTATCTA CTTGGGCCAG TGTGCGGCCA TCAATTGGGC ATTTGATGTA CATTATCATA 13740 GACCATCAGG GAAATATCAG ATGGGTGAGC TGTTGTCATC GTTCCTTTCT AGAATGAGCA 13800 AAGGAGTGTT TAAGGTGCTT GTCAATGCTC TAAGCCACCC AAAGATCTAC AAGAAATTCT 13860 GGCATTGTGG TATTATAGAG CCTATCCATG GTCCTTCACT TGATGCTCAA AACTTGCACA 13920 CAACTGTGTG CAACATGGTT TACACATGCT ATATGACCTA CCTCGACCTG TTGTTGAATG 13980 AAGAGTTAGA AGAGTTCACA TTTCTCTTGT GTGAAAGCGA CGAGGATGTA GTACCGGACA 14040 GATTCGACAA CATCCAGGCA AAACACTTAT GTGTTCTGGC AGATTTGTAC TGTCAACCAG 14100 GGGCCTGCCC ACCAATTCGA GGTCTAAGAC CGGTAGAGAA ATGTGCAGTT CTAACCGACC 14160 ATATCAAGGC AGAGGCTAGG TTATCTCCAG CAGGATCTTC GTGGAACATA AATCCAATTA 14220 TTGTAGACCA TTACTCATGC TCTCTGACTT ATCTCCGGCG AGGATCGATC AAACAGATAA 14280 GATTGAGAGT TGATCCAGGA TTCATTTTCG ACGCCCTCGC TGAGGTAAAT GTCAGTCAGC 14340 CAAAGATCGG CAGCAACAAC ATCTCAAATA TGAGCATCAA GGCTTTCAGA CCCCCACACG 14400 ATGATGTTGC AAAATTGCTC AAAGATATCA ACACAAGCAA GCACAATCTT CCCATTTCAG 14460 GGGGCAATCT CGCCAATTAT GAAATCCATG CTTTCCGCAG AATCGGGTTG AACTCATCTG 14520 CTTGCTACAA AGCTGTTGAG ATATCAACAT TAATTAGGAG ATGCCTTGAG CCAGGGGAGG 14580 ACGGCTTGTT CTTGGGTGAG GGATCGGGTT CTATGTTGAT CACTTATAAG GAGATACTTA 14640 AACTAAACAA GTGCTTCTAT AATAGTGGGG TTTCCGCCAA TTCTAGATCT GGTCAAAGGG 14700 AATTAGCACC CTATCCCTCC GAAGTTGGCC TTGTCGAACA CAGAATGGGA GTAGGTAATA 14760 TTGTCAAAGT GCTCTTTAAC GGGAGGCCCG AAGTCACGTG GGTAGGCAGT GTAGATTGCT 14820 TCAATTTCAT AGTTAGTAAT ATCCCTACCT CTAGTGTGGG GTTTATCCAT TCAGATATAG 14880 AGACCTTGCC TAACAAAGAT ACTATAGAGA AGCTAGAGGA ATTGGCAGCC ATCTTATCGA 14940 TGGCTCTGCT CCTGGGCAAA ATAGGATCAA TACTGGTGAT TAAGCTTATG CCTTTCAGCG 15000 GGGATTTTGT TCAGGGATTT ATAAGTTATG TAGGGTCTTA TTATAGAGAA GTGAACCTTG 15060 TATACCCTAG ATACAGCAAC TTCATATCTA CTGAATCTTA TTTGGTTATG ACAGATCTCA 15120 AGGCTAACCG GCTAATGAAT CCTGAAAAGA TTAAGCAGCA GATAATTGAA TCATCTGTGA 15180 GGACTTCACC TGGACTTATA GGTCACATCC TATCCATTAA GCAACTAAGC TGCATACAAG 15240 CAATTGTGGG AGACGCAGTT AGTAGAGGTG ATATCAATCC TACTCTGAAA AAACTTACAC 15300 CTATAGAGCA GGTGCTGATC AATTGCGGGT TGGCAATTAA CGGACCTAAG CTGTGCAAAG 15360 AATTGATCCA CCATGATGTT GCCTCAGGGC AAGATGGATT GCTTAATTCT ATACTCATCC 15420 TCTACAGGGA GTTGGCAAGA TTCAAAGACA ACCGAAGAAG TCAACAAGGG ATGTTCCACG 15480 CTTACCCCGT ATTGGTAAGT AGCAGGCAAC GAGAACTTAT ATCTAGGATC ACCCGCAAAT 15540 TTTGGGGGCA CATTCTTCTT TACTCCGGGA ACAGAAAGTT GATAAATAAG TTTATCCAGA 15600 ATCTCAAGTC CGGCTATCTG ATACTAGACT TACACCAGAA TATCTTCGTT AAGAATCTAT 15660 CCAAGTCAGA GAAACAGATT ATTATGACGG GGGGTTTGAA ACGTGAGTGG GTTTTTAAGG 15720 TAACAGTCAA GGAGACCAAA GAATGGTATA AGTTAGTCGG ATACAGTGCC CTGATTAAGG 15780 ACTAATTGAT TGAACTCCGG AACCCTAATC CTGCCCTAGG TGGTTAGGCA TTATTTGCAA 15840 TATATTAAAG AAAACTTTGA AAATACGAAG TTTCTATTCC CAGCTTTGTC TGGT 15894 (2) SEQ ID NO: 16 information about: (I) SEQUENCE CHARACTERISTICS: ...
(A) length: 2183 amino acid
(B) type: amino acid
(C) chain:
(D) topological framework: linearity is molecule type (ii): protein (xi) sequence description: SEQ ID NO:16:Met Asp Ser Leu Ser Val Asn Gln Ile Leu Tyr Pro Glu Val His Leu 15 10 15Asp Ser Pro Ile Val Thr Asn Lys Ile Val Ala Ile Leu Glu Tyr Ala
20??????????????????25??????????????????30Arg?Val?Pro?His?Ala?Tyr?Ser?Leu?Glu?Asp?Pro?Thr?Leu?Cys?Gln?Asn
35??????????????????40??????????????????45Ile?Lys?His?Arg?Leu?Lys?Asn?Gly?Phe?Ser?Asn?Gln?Met?Ile?Ile?Asn
50??????????????????55??????????????????60Asn?Val?Glu?Val?Gly?Asn?Val?Ile?Lys?Ser?Lys?Leu?Arg?Ser?Tyr?Pro65??????????????????70??????????????????75??????????????????80Ala?His?Ser?His?Ile?Pro?Tyr?Pro?Asn?Cys?Asn?Gln?Asp?Leu?Phe?Asn
85??????????????????90??????????????????95Ile?Glu?Asp?Lys?Glu?Ser?Thr?Arg?Lys?Ile?Arg?Glu?Leu?Leu?Lys?Lys
100?????????????????105?????????????????110Gly?Asn?Ser?Leu?Tyr?Ser?Lys?Val?Ser?Asp?Lys?Val?Phe?Gln?Cys?Leu
115?????????????????120?????????????????125Arg?Asp?Thr?Asn?Ser?Arg?Leu?Gly?Leu?Gly?Ser?Glu?Leu?Arg?Glu?Asp
130?????????????????135?????????????????140Ile?Lys?Glu?Lys?Val?Ile?Asn?Leu?Gly?Val?Tyr?Met?His?Ser?Ser?Gln145?????????????????150?????????????????155?????????????????160Trp?Phe?Glu?Pro?Phe?Leu?Phe?Trp?Phe?Thr?Val?Lys?Thr?Glu?Met?Arg
165?????????????????170?????????????????175Ser?Val?Ile?Lys?Ser?Gln?Thr?His?Thr?Cys?His?Arg?Arg?Arg?His?Thr
180?????????????????185?????????????????190Pro?Val?Phe?Phe?Thr?Gly?Ser?Ser?Val?Glu?Leu?Leu?Ile?Ser?Arg?Asp
195?????????????????200?????????????????205Leu?Val?Ala?Ile?Ile?Ser?Lys?Glu?Ser?Gln?His?Val?Tyr?Tyr?Leu?Thr
210?????????????????215?????????????????220Phe?Glu?Leu?Val?Leu?Met?Tyr?Cys?Asp?Val?Ile?Glu?Gly?Arg?Leu?Met225?????????????????230?????????????????235?????????????????240Thr?Glu?Thr?Ala?Met?Thr?Ile?Asp?Ala?Arg?Tyr?Thr?Glu?Leu?Leu?Gly
245?????????????????250?????????????????255Arg?Val?Arg?Tyr?Met?Trp?Lys?Leu?Ile?Asp?Gly?Phe?Phe?Pro?Ala?Leu
260?????????????????265?????????????????270Gly?Asn?Pro?Thr?Tyr?Gln?Ile?Val?Ala?Met?Leu?Glu?Pro?Leu?Ser?Leu
275?????????????????280?????????????????285Ala?Tyr?Leu?Gln?Leu?Arg?Asp?Ile?Thr?Val?Glu?Leu?Arg?Gly?Ala?Phe
290?????????????????295?????????????????300Leu?Asn?His?Cys?Phe?Thr?Glu?Ile?His?Asp?Val?Leu?Asp?Gln?Asn?Gly305?????????????????310?????????????????315?????????????????320Phe?Ser?Asp?Glu?Gly?Thr?Tyr?His?Glu?Leu?Ile?Glu?Ala?Leu?Asp?Tyr
325?????????????????330?????????????????335Ile?Phe?Ile?Thr?Asp?Asp?Ile?His?Leu?Thr?Gly?Glu?Ile?Phe?Ser?Phe
340?????????????????345?????????????????350Phe?Arg?Ser?Phe?Gly?His?Pro?Arg?Leu?Glu?Ala?Val?Thr?Ala?Ala?Glu
355?????????????????360?????????????????365Asn?Val?Arg?Lys?Tyr?Met?Asn?Gln?Pro?Lys?Val?Ile?Val?Tyr?Glu?Thr
370?????????????????375?????????????????380Leu?Met?Lys?Gly?His?Ala?Ile?Phe?Cys?Gly?Ile?Ile?Ile?Asn?Gly?Tyr385?????????????????390?????????????????395?????????????????400Arg?Asp?Arg?His?Gly?Gly?Ser?Trp?Pro?Pro?Leu?Thr?Leu?Pro?Leu?His
405?????????????????410?????????????????415Ala?Ala?Asp?Thr?Ile?Arg?Asn?Ala?Gln?Ala?Ser?Gly?Glu?Gly?Leu?Thr
420?????????????????425?????????????????430His?Glu?Gln?Cys?Val?Asp?Asn?Trp?Lys?Ser?Phe?Ala?Gly?Val?Lys?Phe
435?????????????????440?????????????????445Gly?Cys?Phe?Met?Pro?Leu?Ser?Leu?Asp?Ser?Asp?Leu?Thr?Met?Tyr?Leu
450?????????????????455?????????????????460Lys?Asp?Lys?Ala?Leu?Ala?Ala?Leu?Gln?Arg?Glu?Trp?Asp?Ser?Val?Tyr465?????????????????470?????????????????475?????????????????480Pro?Lys?Glu?Phe?Leu?Arg?Tyr?Asp?Pro?Pro?Lys?Gly?Thr?Gly?Ser?Arg
485?????????????????490?????????????????495Arg?Leu?Val?Asp?Val?Phe?Leu?Asn?Asp?Ser?Ser?Phe?Asp?Pro?Tyr?Asp
500?????????????????505?????????????????510Val?Ile?Met?Tyr?Val?Val?Ser?Gly?Ala?Tyr?Leu?His?Asp?Pro?Glu?Phe
515?????????????????520?????????????????525Asn?Leu?Ser?Tyr?Ser?Leu?Lys?Glu?Lys?Glu?Ile?Lys?Glu?Thr?Gly?Arg
530?????????????????535?????????????????540Leu?Phe?Ala?Lys?Met?Thr?Tyr?Lys?Met?Arg?Ala?Cys?Gln?Val?Ile?Ala545?????????????????550?????????????????555?????????????????560Glu?Asn?Leu?Ile?Ser?Asn?Gly?Ile?Gly?Lys?Tyr?Phe?Lys?Asp?Asn?Gly
565?????????????????570?????????????????575Met?Ala?Lys?Asp?Glu?His?Asp?Leu?Thr?Lys?Ala?Leu?His?Thr?Leu?Ala
580?????????????????585?????????????????590Val?Ser?Gly?Val?Pro?Lys?Asp?Leu?Lys?Glu?Ser?His?Arg?Gly?Gly?Pro
595?????????????????600?????????????????605Val?Leu?Lys?Thr?Tyr?Ser?Arg?Ser?Pro?Val?His?Thr?Ser?Thr?Arg?Asn
610?????????????????615?????????????????620Val?Arg?Ala?Ala?Lys?Gly?Phe?Ile?Gly?Phe?Pro?Gln?Val?Ile?Arg?Gln625?????????????????630?????????????????635?????????????????640Asp?Gln?Asp?Thr?Asp?His?Pro?Glu?Asn?Met?Glu?Ala?Tyr?Glu?Thr?Val
645?????????????????650?????????????????655Ser?Ala?Phe?Ile?Thr?Thr?Asp?Leu?Lys?Lys?Tyr?Cys?Leu?Asn?Trp?Arg
660?????????????????665?????????????????670Tyr?Glu?Thr?Ile?Ser?Leu?Phe?Ala?Gln?Arg?Leu?Asn?Glu?Ile?Tyr?Gly
675?????????????????680?????????????????685Leu?Pro?Ser?Phe?Phe?Gln?Trp?Leu?His?Lys?Arg?Leu?Glu?Thr?Ser?Val
690?????????????????695?????????????????700Leu?Tyr?Val?Ser?Asp?Pro?His?Cys?Pro?Pro?Asp?Leu?Asp?Ala?His?Ile705?????????????????710?????????????????715?????????????????720Pro?Leu?Tyr?Lys?Val?Pro?Asn?Asp?Gln?Ile?Phe?Ile?Lys?Tyr?Pro?Met
725?????????????????730?????????????????735Gly?Gly?Ile?Glu?Gly?Tyr?Cys?Gln?Lys?Leu?Trp?Thr?Ile?Ser?Thr?Ile
740?????????????????745?????????????????750Pro?Tyr?Leu?Tyr?Leu?Ala?Ala?Tyr?Glu?Ser?Gly?Val?Arg?Ile?Ala?Ser
755?????????????????760?????????????????765Leu?Val?Gln?Gly?Asp?Asn?Gln?Thr?Ile?Ala?Val?Thr?Lys?Arg?Val?Pro
770?????????????????775?????????????????780Ser?Thr?Trp?Pro?Tyr?Asn?Leu?Lys?Lys?Arg?Glu?Ala?Ala?Arg?Val?Thr785?????????????????790?????????????????795?????????????????800Arg?Asp?Tyr?Phe?Val?Ile?Leu?Arg?Gln?Arg?Leu?His?Asp?Ile?Gly?His
805?????????????????810?????????????????815His?Leu?Lys?Ala?Asn?Glu?Thr?Ile?Val?Ser?Ser?His?Phe?Phe?Val?Tyr
820?????????????????825?????????????????830Ser?Lys?Gly?Ile?Tyr?Tyr?Asp?Gly?Leu?Leu?Val?Ser?Gln?Ser?Leu?Lys
835?????????????????840?????????????????845Ser?Ile?Ala?Arg?Cys?Val?Phe?Trp?Ser?Glu?Thr?Ile?Val?Asp?Glu?Thr
850?????????????????855?????????????????860Arg?Ala?Ala?Cys?Ser?Asn?Ile?Ala?Thr?Thr?Met?Ala?Lys?Ser?Ile?Glu865?????????????????870?????????????????875?????????????????880Arg?Gly?Tyr?Asp?Arg?Tyr?Leu?Ala?Tyr?Ser?Leu?Asn?Val?Leu?Lys?Val
885?????????????????890?????????????????895Ile?Gln?Gln?Ile?Leu?Ile?Ser?Leu?Gly?Phe?Thr?Ile?Asn?Ser?Thr?Met
900?????????????????905?????????????????910Thr?Arg?Asp?Val?Val?Ile?Pro?Leu?Leu?Thr?Asn?Asn?Asp?Leu?Leu?Ile
915?????????????????920?????????????????925Arg?Met?Ala?Leu?Leu?Pro?Ala?Pro?Ile?Gly?Gly?Met?Asn?Tyr?Leu?Asn
930?????????????????935?????????????????940Met?Ser?Arg?Leu?Phe?Val?Arg?Asn?Ile?Gly?Asp?Pro?Val?Thr?Ser?Ser945?????????????????950?????????????????955?????????????????960Ile?Ala?Asp?Leu?Lys?Arg?Met?Ile?Leu?Ala?Ser?Leu?Met?Pro?Glu?Glu
965?????????????????970?????????????????975Thr?Leu?His?Gln?Val?Met?Thr?Gln?Gln?Pro?Gly?Asp?Ser?Ser?Phe?Leu
980?????????????????985?????????????????990Asp?Trp?Ala?Ser?Asp?Pro?Tyr?Ser?Ala?Asn?Leu?Val?Cys?Val?Gln?Ser
995?????????????????1000????????????????1005Ile?Thr?Arg?Leu?Leu?Lys?Asn?Ile?Thr?Ala?Arg?Phe?Val?Leu?Ile?His
1010????????????????1015????????????????1020Ser?Pro?Asn?Pro?Met?Leu?Lys?Gly?Leu?Phe?His?Asp?Asp?Ser?Lys?Glu1025????????????????1030????????????????1035????????????????1040Glu?Asp?Glu?Gly?Leu?Ala?Ala?Phe?Leu?Met?Asp?Arg?His?Ile?Ile?Val
1045????????????????1050????????????????1055Pro?Arg?Ala?Ala?His?Glu?Ile?Leu?Asp?His?Ser?Val?Thr?Gly?Ala?Arg
1060????????????????1065????????????????1070Glu?Ser?Ile?Ala?Gly?Met?Leu?Asp?Thr?Thr?Lys?Gly?Leu?Ile?Arg?Ala
1075????????????????1080????????????????1085Ser?Met?Arg?Lys?Gly?Gly?Leu?Thr?Ser?Arg?Val?Ile?Thr?Arg?Leu?Ser
1090????????????????1095????????????????1100Asn?Tyr?Asp?Tyr?Glu?Gln?Phe?Arg?Ala?Gly?Met?Val?Leu?Leu?Thr?Gly1105????????????????1110????????????????1115????????????????1120Arg?Lys?Arg?Asn?Val?Leu?Ile?Asp?Lys?Glu?Ser?Cys?Ser?Val?Gln?Leu
1125????????????????1130????????????????1135Ala?Arg?Ala?Leu?Arg?Ser?His?Met?Trp?Ala?Arg?Leu?Ala?Arg?Gly?Arg
1140????????????????1145????????????????1150Pro?Ile?Tyr?Gly?Leu?Glu?Val?Pro?Asp?Val?Leu?Glu?Ser?Met?Arg?Gly
1155????????????????1160????????????????1165His?Leu?Ile?Arg?Arg?His?Glu?Thr?Cys?Val?Ile?Cys?Glu?Cys?Gly?Ser
1170????????????????1175????????????????1180Val?Asn?Tyr?Gly?Trp?Phe?Phe?Val?Pro?Ser?Gly?Cys?Gln?Leu?Asp?Asp1185????????????????1190????????????????1195????????????????1200Ile?Asp?Lys?Glu?Thr?Ser?Ser?Leu?Arg?Val?Pro?Tyr?Ile?Gly?Ser?Thr
1205????????????????1210????????????????1215Thr?Asp?Glu?Arg?Thr?Asp?Met?Lys?Leu?Ala?Phe?Val?Arg?Ala?Pro?Ser
1220????????????????1225????????????????1230Arg?Ser?Leu?Arg?Ser?Ala?Val?Arg?Ile?Ala?Thr?Val?Tyr?Ser?Trp?Ala
1235????????????????1240????????????????1245Tyr?Gly?Asp?Asp?Asp?Ser?Ser?Trp?Asn?Glu?Ala?Trp?Leu?Leu?Ala?Arg
1250????????????????1255????????????????1260Gln?Arg?Ala?Asn?Val?Ser?Leu?Glu?Glu?Leu?Arg?Val?Ile?Thr?Pro?Ile1265????????????????1270????????????????1275????????????????1280Ser?Thr?Ser?Thr?Asn?Leu?Ala?His?Arg?Leu?Arg?Asp?Arg?Ser?Thr?Gln
1285????????????????1290????????????????1295Val?Lys?Tyr?Ser?Gly?Thr?Ser?Leu?Val?Arg?Val?Ala?Arg?Tyr?Thr?Thr
1300????????????????1305????????????????1310Ile?Ser?Asn?Asp?Asn?Leu?Ser?Phe?Val?Ile?Ser?Asp?Lys?Lys?Val?Asp
1315????????????????1320????????????????1325Thr?Asn?Phe?Ile?Tyr?Gln?Gln?Gly?Met?Leu?Leu?Gly?Leu?Gly?Val?Leu
1330????????????????1335????????????????1340Glu?Thr?Leu?Phe?Arg?Leu?Glu?Lys?Asp?Thr?Gly?Ser?Ser?Asn?Thr?Val1345????????????????1350????????????????1355????????????????1360Leu?His?Leu?His?Val?Glu?Thr?Asp?Cys?Cys?Val?Ile?Pro?Met?Ile?Asp
1365????????????????1370????????????????1375His?Pro?Arg?Ile?Pro?Ser?Ser?Arg?Lys?Leu?Glu?Leu?Arg?Ala?Glu?Leu
1380????????????????1385????????????????1390Cys?Thr?Asn?Pro?Leu?Ile?Tyr?Asp?Asn?Ala?Pro?Leu?Ile?Asp?Arg?Asp
1395????????????????1400????????????????1405Thr?Thr?Arg?Leu?Tyr?Thr?Gln?Ser?His?Arg?Arg?His?Leu?Val?Glu?Phe
1410????????????????1415????????????????1420Val?Thr?Trp?Ser?Thr?Pro?Gln?Leu?Tyr?His?Ile?Leu?Ala?Lys?Ser?Thr1425????????????????1430????????????????1435????????????????1440Ala?Leu?Ser?Met?Ile?Asp?Leu?Val?Thr?Lys?Phe?Glu?Lys?Asp?His?Met
1445????????????????1450????????????????1455Asn?Glu?Ile?Ser?Ala?Leu?Ile?Gly?Asp?Asp?Asp?Ile?Asn?Ser?Phe?Ile
1460????????????????1465????????????????1470Thr?Glu?Phe?Leu?Leu?Ile?Glu?Pro?Arg?Leu?Phe?Thr?Ile?Tyr?Leu?Gly
1475????????????????1480????????????????1485Gln?Cys?Ala?Ala?Ile?Asn?Trp?Ala?Phe?Asp?Val?His?Tyr?His?Arg?Pro
1490????????????????1495????????????????1500Ser?Gly?Lys?Tyr?Gln?Met?Gly?Glu?Leu?Leu?Ser?Ser?Phe?Leu?Ser?Arg1505????????????????1510????????????????1515????????????????1520Met?Ser?Lys?Gly?Val?Phe?Lys?Val?Leu?Val?Asn?Ala?Leu?Ser?His?Pro
1525????????????????1530????????????????1535Lys?Ile?Tyr?Lys?Lys?Phe?Trp?His?Cys?Gly?Ile?Ile?Glu?Pro?Ile?His
1540????????????????1545????????????????1550Gly?Pro?Ser?Leu?Asp?Ala?Gln?Asn?Leu?His?Thr?Thr?Val?Cys?Asn?Met
1555????????????????1560????????????????1565Val?Tyr?Thr?Cys?Tyr?Met?Thr?Tyr?Leu?Asp?Leu?Leu?Leu?Asn?Glu?Glu
1570????????????????1575????????????????1580Leu?Glu?Glu?Phe?Thr?Phe?Leu?Leu?Cys?Glu?Ser?Asp?Glu?Asp?Val?Val1585????????????????1590????????????????1595????????????????1600Pro?Asp?Arg?Phe?Asp?Asn?Ile?Gln?Ala?Lys?His?Leu?Cys?Val?Leu?Ala
1605????????????????1610????????????????1615Asp?Leu?Tyr?Cys?Gln?Pro?Gly?Ala?Cys?Pro?Pro?Ile?Arg?Gly?Leu?Arg
1620????????????????1625????????????????1630Pro?Val?Glu?Lys?Cys?Ala?Val?Leu?Thr?Asp?His?Ile?Lys?Ala?Glu?Ala
1635????????????????1640????????????????1645Arg?Leu?Ser?Pro?Ala?Gly?Ser?Ser?Trp?Asn?Ile?Asn?Pro?Ile?Ile?Val
1650????????????????1655????????????????1660Asp?His?Tyr?Ser?Cys?Ser?Leu?Thr?Tyr?Leu?Arg?Arg?Gly?Ser?Ile?Lys1665????????????????1670????????????????1675????????????????1680Gln?Ile?Arg?Leu?Arg?Val?Asp?Pro?Gly?Phe?Ile?Phe?Asp?Ala?Leu?Ala
1685????????????????1690????????????????1695Glu?Val?Asn?Val?Ser?Gln?Pro?Lys?Ile?Gly?Ser?Asn?Asn?Ile?Ser?Asn
1700????????????????1705????????????????1710Met?Ser?Ile?Lys?Ala?Phe?Arg?Pro?Pro?His?Asp?Asp?Val?Ala?Lys?Leu
1715????????????????1720????????????????1725Leu?Lys?Asp?Ile?Asn?Thr?Ser?Lys?His?Asn?Leu?Pro?Ile?Ser?Gly?Gly
1730????????????????1735????????????????1740Asn?Leu?Ala?Asn?Tyr?Glu?Ile?His?Ala?Phe?Arg?Arg?Ile?Gly?Leu?Asn1745????????????????1750????????????????1755????????????????1760Ser?Ser?Ala?Cys?Tyr?Lys?Ala?Val?Glu?Ile?Ser?Thr?Leu?Ile?Arg?Arg
1765????????????????1770????????????????1775Cys?Leu?Glu?Pro?Gly?Glu?Asp?Gly?Leu?Phe?Leu?Gly?Glu?Gly?Ser?Gly
1780????????????????1785????????????????1790Ser?Met?Leu?Ile?Thr?Tyr?Lys?Glu?Ile?Leu?Lys?Leu?Asn?Lys?Cys?Phe
1795????????????????1800????????????????1805Tyr?Asn?Ser?Gly?Val?Ser?Ala?Asn?Ser?Arg?Ser?Gly?Gln?Arg?Glu?Leu
1810????????????????1815????????????????1820Ala?Pro?Tyr?Pro?Ser?Glu?Val?Gly?Leu?Val?Glu?His?Arg?Met?Gly?Val1825????????????????1830????????????????1835????????????????1840Gly?Asn?Ile?Val?Lys?Val?Leu?Phe?Asn?Gly?Arg?Pro?Glu?Val?Thr?Trp
1845????????????????1850????????????????1855Val?Gly?Ser?Val?Asp?Cys?Phe?Asn?Phe?Ile?Val?Ser?Asn?Ile?Pro?Thr
1860????????????????1865????????????????1870Ser?Ser?Val?Gly?Phe?Ile?His?Ser?Asp?Ile?Glu?Thr?Leu?Pro?Asn?Lys
1875????????????????1880????????????????1885Asp?Thr?Ile?Glu?Lys?Leu?Glu?Glu?Leu?Ala?Ala?Ile?Leu?Ser?Met?Ala
1890????????????????1895????????????????1900Leu?Leu?Leu?Gly?Lys?Ile?Gly?Ser?Ile?Leu?Val?Ile?Lys?Leu?Met?Pro1905????????????????1910????????????????1915????????????????1920Phe?Ser?Gly?Asp?Phe?Val?Gln?Gly?Phe?Ile?Ser?Tyr?Val?Gly?Ser?Tyr
1925????????????????1930????????????????1935Tyr?Arg?Glu?Val?Asn?Leu?Val?Tyr?Pro?Arg?Tyr?Ser?Asn?Phe?Ile?Ser
1940????????????????1945????????????????1950Thr?Glu?Ser?Tyr?Leu?Val?Met?Thr?Asp?Leu?Lys?Ala?Asn?Arg?Leu?Met
1955????????????????1960????????????????1965
Asn?Pro?Glu?Lys?Ile?Lys?Gln?Gln?Ile?Ile?Glu?Ser?Ser?Val?Arg?Thr
1970????????????????1975???????????????1980
Ser?Pro?Gly?Leu?Ile?Gly?His?Ile?Leu?Ser?Ile?Lys?Gln?Leu?Ser?Cys
1985????????????????1990????????????????1995????????????????2000
Ile?Gln?Ala?Ile?Val?Gly?Asp?Ala?Val?Ser?Arg?Gly?Asp?Ile?Asn?Pro
2005????????????????2010????????????????2015
Thr?Leu?Lys?Lys?Leu?Thr?Pro?Ile?Glu?Gln?Val?Leu?Ile?Asn?Cys?Gly
2020????????????????2025????????????????2030
Leu?Ala?Ile?Asn?Gly?Pro?Lys?Leu?Cys?Lys?Glu?Leu?Ile?His?His?Asp
2035????????????????2040????????????????2045
Val?Ala?Ser?Gly?Gln?Asp?Gly?Leu?Leu?Asn?Ser?Ile?Leu?Ile?Leu?Tyr
2050????????????????2055????????????????2060
Arg?Glu?Leu?Ala?Arg?Phe?Lys?Asp?Asn?Arg?Arg?Ser?Gln?Gln?Gly?Met
2065????????????????2070????????????????2075????????????????2080
Phe?His?Ala?Tyr?Pro?Val?Leu?Val?Ser?Ser?Arg?Gln?Arg?Glu?Leu?Ile
2085????????????????2090????????????????2095
Ser?Arg?Ile?Thr?Arg?Lys?Phe?Trp?Gly?His?Ile?Leu?Leu?Tyr?Ser?Gly
2100????????????????2105????????????????2110
Asn?Arg?Lys?Leu?Ile?Asn?Lys?Phe?Ile?Gln?Asn?Leu?Lys?Ser?Gly?Tyr
2115????????????????2120????????????????2125
Leu?Ile?Leu?Asp?Leu?His?Gln?Asn?Ile?Phe?Val?Lys?Asn?Leu?Ser?Lys
2130????????????????2135????????????????2140
Ser?Glu?Lys?Gln?Ile?Ile?Met?Thr?Gly?Gly?Leu?Lys?Arg?Glu?Trp?Val
2145????????????????2150????????????????2155????????????????2160
Phe?Lys?Val?Thr?Val?Lys?Glu?Thr?Lys?Glu?Trp?Tyr?Lys?Leu?Val?Gly
2165????????????????2170????????????????2175
Tyr?Ser?Ala?Leu?Ile?Lys?Asp
The information of 2180 (2) SEQ ID NO:17:
(i) sequence signature:
(A) length: 15462 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: ACCAAACAAG AGAAGAAACT TGTCTGGGAA TATAAATTTA ACTTTAAATT AACTTAGGAT 60 TAAAGACATT GACTAGAAGG TCAAGAAAAG GGAACTCTAT AATTTCAAAA ATGTTGAGCC 120 TATTTGATAC ATTTAATGCA CGTAGGCAAG AAAACATAAC AAAATCAGCC GGTGGAGCTA 180 TCATTCCTGG ACAGAAAAAT ACTGTCTCTA TATTCGCCCT TGGACCGACA ATAACTGATG 240 ATAATGAGAA AATGACATTA GCTCTTCTAT TTCTATCTCA TTCACTAGAT AATGAGAAAC 300 AACATGCACA AAGGGCAGGG TTCTTGGTGT CTTTATTGTC AATGGCTTAT GCCAATCCAG 360 AGCTCTACCT AACAACAAAT GGAAGTAATG CAGATGTCAA GTATGTCATA TACATGATTG 420 AGAAAGATCT AAAACGGCAA AAGTATGGAG GATTTGTGGT TAAGACGAGA GAGATGATAT 480 ATGAAAAGAC AACTGATTGG ATATTTGGAA GTGACCTGGA TTATGATCAG GAAACTATGT 540 TGCAGAACGG CAGGAACAAT TCAACAATTG AAGACCTTGT CCACACATTT GGGTATCCAT 600 CATGTTTAGG AGCTCTTATA ATACAGATCT GGATAGTTCT GGTCAAAGCT ATCACTAGTA 660 TCTCAGGGTT AAGAAAAGGC TTTTTCACCC GATTGGAAGC TTTCAGACAA GATGGAACAG 720 TGCAGGCAGG GCTGGTATTG AGCGGTGACA CAGTGGATCA GATTGGGTCA ATCATGCGGT 780 CTCAACAGAG CTTGGTAACT CTTATGGTTG AAACATTAAT AACAATGAAT ACCAGCAGAA 840 ATGACCTCAC AACCATAGAA AAGAATATAC AAATTGTTGG CAACTACATA AGAGATGCAG 900 GTCTCGCTTC ATTCTTCAAT ACAATCAGAT ATGGAATTGA GACCAGAATG GCAGCTTTGA 960 CTCTATCCAC TCTCAGACCA GATATCAATA GATTAAAAGC TTTGATGGAA CTGTATTTAT 1020 CAAAGGGACC ACGCGCTCCT TTCATCTGTA TCCTCAGAGA TCCTATACAT GGTGAGTTCG 1080 CACCAGGCAA CTATCCTGCC ATATGGAGCT ATGCAATGGG GGTGGCAGTT GTACAAAATA 1140 GAGCCATGCA ACAGTATGTG ACGGGAAGAT CATATCTAGA CATTGATATG TTCCAGCTAG 1200 GACAAGCAGT AGCACGTGAT GCCGAAGCTC AAATGAGCTC AACACTGGAA GATGAACTTG 1260 GAGTGACACA CGAATCTAAA GAAAGCTTGA AGAGACATAT AAGGAACATA AACAGTTCAG 1320 AGACATCTTT CCACAAACCG ACAGGTGGAT CAGCCATAGA GATGGCAATA GATGAAGAGC 1380 CAGAACAATT CGAACATAGA GCAGATCAAG AACAAAATGG AGAACCTCAA TCATCCATAA 1440 TTCAATATGC CTGGGCAGAA GGAAATAGAA GCGATGATCA GACTGAGCAA GCTACAGAAT 1500 CTGACAATAT CAAGACCGAA CAACAAAACA TCAGAGACAG ACTAAACAAG AGACTCAACG 1560 ACAAGAAGAA ACAAAGCAGT CAACCACCCA CTAATCCCAC AAACAGAACA AACCAGGACG 1620 AAATAGATGA TCTGTTTAAC GCATTTGGAA GCAACTAATC GAATCAACAT TTTAATCTAA 1680 ATCAATAATA AATAAGAAAA ACTTAGGATT AAAGAATCCT ATCATACCGG AATATAGGGT 1740 GGTAAATTTA GAGTCTGCTT GAAACTCAAT CAATAGAGAG TTGATGGAAA GCGATGCTAA 1800 AAACTATCAA ATCATGGATT CTTGGGAAGA GGAATCAAGA GATAAATCAA CTAATATCTC 1860 CTCGGCCCTC AACATCATTG AATTCATACT CAGCACCGAC CCCCAAGAAG ACTTATCGGA 1920 AAACGACACA ATCAACACAA GAACCCAGCA ACTCAGTGCC ACCATCTGTC AACCAGAAAT 1980 CAAACCAACA GAAACAAGTG AGAAAGATAG TGGATCAACT GACAAAAATA GACAGTCCGG 2040 GTCATCACAC GAATGTACAA CAGAAGCAAA AGATAGAAAT ATTGATCAGG AAACTGTACA 2100 GAGAGGACCT GGGAGAAGAA GCAGCTCAGA TAGTAGAGCT GAGACTGTGG TCTCTGGAGG 2160 AATCCCCAGA AGCATCACAG ATTCTAAAAA TGGAACCCAA AACACGGAGG ATATTGATCT 2220 CAATGAAATT AGAAAGATGG ATAAGGACTC TATTGAGGGG AAAATGCGAC AATCTGCAAA 2280 TGTTCCAAGC GAGATATCAG GAAGTGATGA CATATTTACA ACAGAACAAA GTAGAAACAG 2340 TGATCATGGA AGAAGCCTGG AATCTATCAG TACACCTGAT ACAAGATCAA TAAGTGTTGT 2400 TACTGCTGCA ACACCAGATG ATGAAGAAGA AATACTAATG AAAAATAGTA GGACAAAGAA 2460 AAGTTCTTCA ACACATCAAG AAGATGACAA AAGAATTAAA AAAGGGGGAA AAGGGAAAGA 2520 CTGGTTTAAG AAATCAAAAG ATACCGACAA CCAGATACCA ACATCAGACT ACAGATCCAC 2580 ATCAAAAGGG CAGAAGAAAA TCTCAAAGAC AACAACCACC AACACCGACA CAAAGGGGCA 2640 AACAGAAATA CAGACAGAAT CATCAGAAAC ACAATCCTCA TCATGGAATC TCATCATCGA 2700 CAACAACACC GACCGGAACG AACAGACAAG CACAACTCCT CCAACAACAA CTTCCAGATC 2760 AACTTATACA AAAGAATCGA TCCGAACAAA CTCTGAATCC AAACCCAAGA CACAAAAGAC 2820 AAATGGAAAG GAAAGGAAGG ATACAGAAGA GAGCAATCGA TTTACAGAGA GGGCAATTAC 2880 TCTATTGCAG AATCTTGGTG TAATTCAATC CACATCAAAA CTAGATTTAT ATCAAGACAA 2940 ACGAGTTGTA TGTGTAGCAA ATGTACTAAA CAATGTAGAT ACTGCATCAA AGATAGATTT 3000 CCTGGCAGGA TTAGTCATAG GGGTTTCAAT GGACAACGAC ACAAAATTAA CACAGATACA 3060 AAATGAAATG CTAAACCTCA AAGCAGATCT AAAGAAAATG GACGAATCAC ATAGAAGATT 3120 GATAGAAAAT CAAAGAGAAC AACTGTCATT GATCACGTCA CTAATTTCAA ATCTCAAAAT 3180 TATGACTGAG AGAGGAGGAA AGAAAGACCA AAATGAATCC AATGAGAGAG TATCCATGAT 3240 CAAAACAAAA TTGAAAGAAG AAAAGATCAA GAAGACCAGG TTTGACCCAC TTATGGAGGC 3300 ACAAGGCATT GACAAGAATA TACCCGATCT ATATCGACAT GCAGGAGATA CACTAGAGAA 3360 CGATGTACAA GTTAAATCAG AGATATTAAG TTCATACAAT GAGTCAAATG CAACAAGACT 3420 AATACCCAAA AAAGTGAGCA GTACAATGAG ATCACTAGTT GCAGTCATCA ACAACAGCAA 3480 TCTCTCACAA AGCACAAAAC AATCATACAT AAACGAACTC AAACGTTGCA AAAATGATGA 3540 AGAAGTATCT GAATTAATGG ACATGTTCAA TGAAGATGTC AACAATTGCC AATGATCCAA 3600 CAAAGAAACG ACACCGAACA AACAGACAAG AAACAACAGT AGATCAAAAC CTGTCAACAC 3660 ACACAAAATC AAGCAGAATG AAACAACAGA TATCAATCAA TATACAAATA AGAAAAACTT 3720 AGGATTAAAG AATAAATTAA TCCTTGTCCA AAATGAGTAT AACTAACTCT GCAATATACA 3780 CATTCCCAGA ATCATCATTC TCTGAAAATG GTCATATAGA ACCATTACCA CTCAAAGTCA 3840 ATGAACAGAG GAAAGCAGTA CCCCACATTA GAGTTGCCAA GATCGGAAAT CCACCAAAAC 3900 ACGGATCCCG GTATTTAGAT GTCTTCTTAC TCGGCTTCTT CGAGATGGAA CGAATCAAAG 3960 ACAAATACGG GAGTGTGAAT GATCTCGACA GTGACCCGAG TTACAAAGTT TGTGGCTCTG 4020 GATCATTACC AATCGGATTG GCTAAGTACA CTGGGAATGA CCAGGAATTG TTACAAGCCG 4080 CAACCAAACT GGATATAGAA GTGAGAAGAA CAGTCAAAGC GAAAGAGATG GTTGTTTACA 4140 CGGTACAAAA TATAAAACCA GAACTGTACC CATGGTCCAA TAGACTAAGA AAAGGAATGC 4200 TGTTCGATGC CAACAAAGTT GCTCTTGCTC CTCAATGTCT TCCACTAGAT AGGAGCATAA 4260 AATTTAGAGT AATCTTCGTG AATTGTACGG CAATTGGATC AATAACCTTG TTCAAAATTC 4320 CTAAGTCAAT GGCATCACTA TCTCTACCCA ACACAATATC AATCAATCTG CAGGTACACA 4380 TAAAAACAGG GGTTCAGACT GATTCTAAAG GGATAGTTCA AATTTTGGAT GAGAAAGGCG 4440 AAAAATCACT GAATTTCATG GTCCATCTCG GATTGATCAA AAGAAAAGTA GGCAGAATGT 4500 ACTCTGTTGA ATACTGTAAA CAGAAAATCG AGAAAATGAG ATTGATATTT TCTTTAGGAC 4560 TAGTTGGAGG AATCAGTCTT CATGTCAATG CAACTGGGTC CATATCAAAA ACACTAGCAA 4620 GTCAGCTGGT ATTCAAAAGA GAGATTTGTT ATCCTTTAAT GGATCTAAAT CCGCATCTCA 4680 ATCTAGTTAT CTGGGCTTCA TCAGTAGAGA TTACAAGAGT GGATGCAATT TTCCAACCTT 4740 CTTTACCTGG CGAGTTCAGA TACTATCCTA ATATTATTGC AAAAGGAGTT GGGAAAATCA 4800 AACAATGGAA CTAGTAATCT CTATTTTAGT CCGGACGTAT CTATTAAGCC GAAGCAAATA 4860 AAGGATAATC AAAAACTTAG GACAAAAGAG GTCAATACCA ACAACTATTA GCAGTCACAC 4920 TCGCAAGAAT AAGAGAGAAG GGACCAAAAA AGTCAAATAG GAGAAATCAA AACAAAAGGT 4980 ACAGAACACC AGAACAACAA AATCAAAACA TCCAACTCAC TCAAAACAAA AATTCCAAAA 5040 GAGACCGGCA ACACAACAAG CACTGAACAC AATGCCAACT TCAATACTGC TAATTATTAC 5100 AACCATGATC ATGGCATCTT TCTGCCAAAT AGATATCACA AAACTACAGC ACGTAGGTGT 5160 ATTGGTCAAC AGTCCCAAAG GGATGAAGAT ATCACAAAAC TTTGAAACAA GATATCTAAT 5220 TTTGAGCCTC ATACCAAAAA TAGAAGACTC TAACTCTTGT GGTGACCAAC AGATCAAGCA 5280 ATACAAGAAG TTATTGGATA GACTGATCAT CCCTTTATAT GATGGATTAA GATTACAGAA 5340 AGATGTGATA GTAACCAATC AAGAATCCAA TGAAAACACT GATCCCAGAA CAAAACGATT 5400 CTTTGGAGGG GTAATTGGAA CCATTGCTCT GGGAGTAGCA ACCTCAGCAC AAATTACAGC 5460 GGCAGTTGCT CTGGTTGAAG CCAAGCAGGC AAGATCAGAC ATCGAAAAAC TCAAAGAAGC 5520 AATTAGGGAC ACAAACAAAG CAGTGCAGTC AGTTCAGAGC TCCATAGGAA ATTTAATAGT 5580 AGCAATTAAA TCAGTCCAGG ATTATGTTAA CAAAGAAATC GTGCCATCGA TTGCGAGGCT 5640 AGGTTGTGAA GCAGCAGGAC TTCAATTAGG AATTGCATTA ACACAGCATT ACTCAGAATT 5700 AACAAACATA TTTGGTGATA ACATAGGATC GTTACAAGAA AAAGGAATAA AATTACAAGG 5760 TATAGCATCA TTATACCGCA CAAATATCAC AGAAATATTC ACAACATCAA CAGTTGATAA 5820 ATATGATATC TATGATCTGT TATTTACAGA ATCAATAAAG GTGAGAGTTA TAGATGTTGA 5880 CTTGAATGAT TACTCAATCA CCCTCCAAGT CAGACTCCCT TTATTAACTA GGCTGCTGAA 5940 CACTCAGATC TACAAAGTAG ATTCCATATC ATATAACATC CAAAACAGAG AATGGTATAT 6000 CCCTCTTCCC AGCCATATCA TGACGAAAGG GGCATTTCTA GGTGGAGCAG ACGTCAAAGA 6060 ATGTATAGAA GCATTCAGCA GCTATATATG CCCTTCTGAT CCAGGATTTG TATTAAACCA 6120 TGAAATAGAG AGCTGCTTAT CAGGAAACAT ATCCCAATGT CCAAGAACAA CGGTCACATC 6180 AGACATTGTT CCAAGATATG CATTTGTCAA TGGAGGAGTG GTTGCAAACT GTATAACAAC 6240 CACCTGTACA TGCAACGGAA TTGGTAATAG AATCAATCAA CCACCTGATC AAGGAGTAAA 6300 AATTATAACA CATAAAGAAT GTAGTACAAT AGGTATCAAC GGAATGCTGT TCAATACAAA 6360 TAAAGAAGGA ACTCTTGCAT TCTATACACC AAATGATATA ACACTAAACA ATTCTGTTGC 6420 ACTTGATCCA ATTGACATAT CAATCGAGCT CAACAAGGCC AAATCAGATC TAGAAGAATC 6480 AAAAGAATGG ATAAGAAGGT CAAATCAAAA ACTAGATTCT ATTGGAAATT GGCATCAATC 6540 TAGCACTACA ATCATAATTA TTTTGATAAT GATCATTATA TTGTTTATAA TTAATATAAC 6600 GATAATTACA ATTGCAATTA AGTATTACAG AATTCAAAAG AGAAATCGAG TGGATCAAAA 6660 TGACAAGCCA TATGTACTAA CAAACAAATA ACATATCTAC AGATCATTAG ATATTAAAAT 6720 TATAAAAAAC TTAGGAGTAA AGTTACGCAA TCCAACTCTA CTCATATAAT TGAGGAAGGA 6780 CCCAATAGAC AAATCCAAAT TCGAGATGGA ATACTGGAAG CATACCAATC ACGGAAAGGA 6840 TGCTGGTAAT GAGCTGGAGA CGTCTATGGC TACTCATGGC AACAAGCTCA CTAATAAGAT 6900 AATATACATA TTATGGACAA TAATCCTGGT GTTATTATCA ATAGTCTTCA TCATAGTGCT 6960 AATTAATTCC ATCAAAAGTG AAAAGGCCCA CGAATCATTG CTGCAAGACA TAAATAATGA 7020 GTTTATGGAA ATTACAGAAA AGATCCAAAT GGCATCGGAT AATACCAATG ATCTAATACA 7080 GTCAGGAGTG AATACAAGGC TTCTTACAAT TCAGAGTCAT GTCCAGAATT ACATACCAAT 7140 ATCATTGACA CAACAGATGT CAGATCTTAG GAAATTCATT AGTGAAATTA CAATTAGAAA 7200 TGATAATCAA GAAGTGCTGC CACAAAGAAT AACACATGAT GTAGGTATAA AACCTTTAAA 7260 TCCAGATGAT TTTTGGAGAT GCACGTCTGG TCTTCCATCT TTAATGAAAA CTCCAAAAAT 7320 AAGGTTAATG CCAGGGCCGG GATTATTAGC TATGCCAACG ACTGTTGATG GCTGTGTTAG 7380 AACTCCGTCT TTAGTTATAA ATGATCTGAT TTATGCTTAT ACCTCAAATC TAATTACTCG 7440 AGGTTGTCAG GATATAGGAA AATCATATCA AGTCTTACAG ATAGGGATAA TAACTGTAAA 7500 CTCAGACTTG GTACCTGACT TAAATCCTAG GATCTCTCAT ACCTTTAACA TAAATGACAA 7560 TAGGAAGTCA TGTTCTCTAG CACTCCTAAA TACAGATGTA TATCAACTGT GTTCAACTCC 7620 CAAAGTTGAT GAAAGATCAG ATTATGCATC ATCAGGCATA GAAGATATTG TACTTGATAT 7680 TGTCAATTAT GATGGTTCAA TCTCAACAAC AAGATTTAAG AATAATAACA TAAGCTTTGA 7740 TCAACCATAT GCTGCACTAT ACCCATCTGT TGGACCAGGG ATATACTACA AAGGCAAAAT 7800 AATATTTCTC GGGTATGGAG GTCTTGAACA TCCAATAAAT GAGAATGTAA TCTGCAACAC 7860 AACTGGGTGC CCCGGGAAAA CACAGAGAGA CTGTAATCAA GCGTCTCATA GTCCATGGTT 7920 TTCAGATAGG AGGATGGTCA ACTCCATCAT TGTTGTTGAC AAAGGCTTAA ACTCAATTCC 7980 AAAATTGAAA GTATGGACGA TATCTATGCG ACAAAATTAC TGGGGGTCAG AAGGAAGGTT 8040 ACTTCTACTA GGTAACAAGA TCTATATATA TACAAGATCT ACAAGTTGGC ATAGCAAGTT 8100 ACAATTAGGA ATAATTGATA TTACTGATTA CAGTGATATA AGGATAAAAT GGACATGGCA 8160 TAATGTGCTA TCAAGACCAG GAAACAATGA ATGTCCATGG GGACATTCAT GTCCAGATGG 8220 ATGTATAACA GGAGTATATA CTGATGCATA TCCACTCAAT CCCACAGGGA GCATTGTGTC 8280 ATCTGTCATA TTAGACTCAC AAAAATCGAG AGTGAACCCA GTCATAACTT ACTCAACAGC 8340 AACCGAAAGA GTAAACGAGC TGGCCATCCT AAACAGAACA CTCTCAGCTG GATATACAAC 8400 AACAAGCTGC ATTACACACT ATAACAAAGG ATATTGTTTT CATATAGTAG AAATAAATCA 8460 TAAAAGCTTA AACACATTTC AACCCATGTT GTTCAAAACA GAGATTCCAA AAAGCTGCAG 8520 TTAATCATAA TTAACCATAA TATGCATCAA TCTATCTATA ATACAAGTAT ATGATAAGTA 8580 ATCAGCAATC AGACAATAGA CAAAAGGGAA ATATAAAAAA CTTAGGAGCA AAGCGTGCTC 8640 GGGAAATGGA CACTGAATCT AACAATGGCA CTGTATCTGA CATACTCTAT CCTGAGTGTC 8700 ACCTTAACTC TCCTATCGTT AAAGGTAAAA TAGCACAATT ACACACTATT ATGAGTCTAC 8760 CTCAGCCTTA TGATATGGAT GACGACTCAA TACTAGTTAT CACTAGACAG AAAATAAAAC 8820 TTAATAAATT GGATAAAAGA CAACGATCTA TTAGAAGATT AAAATTAATA TTAACTGAAA 8880 AAGTGAATGA CTTAGGAAAA TACACATTTA TCAGATATCC AGAAATGTCA AAAGAAATGT 8940 TCAAATTATA TATACCTGGT ATTAACAGTA AAGTGACTGA ATTATTACTT AAAGCAGATA 9000 GAACATATAG TCAAATGACT GATGGATTAA GAGATCTATG GATTAATGTG CTATCAAAAT 9060 TAGCCTCAAA AAATGATGGA AGCAATTATG ATCTTAATGA AGAAATTAAT AATATATCGA 9120 AAGTTCACAC AACCTATAAA TCAGATAAAT GGTATAATCC ATTCAAAACA TGGTTTACTA 9180 TCAAGTATGA TATGAGAAGA TTACAAAAAG CTCGAAATGA GATCACTTTT AATGTTGGGA 9240 AGGATTATAA CTTGTTAGAA GACCAGAAGA ATTTCTTATT GATACATCCA GAATTGGTTT 9300 TGATATTAGA TAAACAAAAC TATAATGGTT ATCTAATTAC TCCTGAATTA GTATTGATGT 9360 ATTGTGACGT AGTCGAAGGC CGATGGAATA TAAGTGCATG TGCTAAGTTA GATCCAAAAT 9420 TACAATCTAT GTATCAGAAA GGTAATAACC TGTGGGAAGT GATAGATAAA TTGTTTCCAA 9480 TTATGGGAGA AAAGACATTT GATGTGATAT CGTTATTAGA ACCACTTGCA TTATCCTTAA 9540 TTCAAACTCA TGATCCTGTT AAACAACTAA GAGGAGCTTT TTTAAATCAT GTGTTATCCG 9600 AGATGGAATT AATATTTGAA TCTAGAGAAT CGATTAAGGA ATTTCTGAGT GTAGATTACA 9660 TTGATAAAAT TTTAGATATA TTTAATAAGT CTACAATAGA TGAAATAGCA GAGATTTTCT 9720 CTTTTTTTAG AACATTTGGG CATCCTCCAT TAGAAGCTAG TATTGCAGCA GAAAAGGTTA 9780 GAAAATATAT GTATATTGGA AAACAATTAA AATTTGACAC TATTAATAAA TGTCATGCTA 9840 TCTTCTGTAC AATAATAATT AACGGATATA GAGAGAGGCA TGGTGGACAG TGGCCTCCTG 9900 TGACATTACC TGATCATGCA CACGAATTCA TCATAAATGC TTACGGTTCA AACTCTGCGA 9960 TATCATATGA AAATGCTGTT GATTATTACC AGAGCTTTAT AGGAATAAAA TTCAATAAAT 10020 TCATAGAGCC TCAGTTAGAT GAGGATTTGA CAATTTATAT GAAAGATAAA GCATTATCTC 10080 CAAAAAAATC AAATTGGGAC ACAGTTTATC CTGCATCTAA TTTACTGTAC CGTACTAACG 10140 CATCCAACGA ATCACGAAGA TTAGTTGAAG TATTTATAGC AGATAGTAAA TTTGATCCTC 10200 ATCAGATATT GGATTATGTA GAATCTGGGG ACTGGTTAGA TGATCCAGAA TTTAATATTT 10260 CTTATAGTCT TAAAGAAAAA GAGATCAAAC AGGAAGGTAG ACTCTTTGCA AAAATGACAT 10320 ACAAAATGAG AGCTACACAA GTTTTATCAG AGACACTACT TGCAAATAAC ATAGGAAAAT 10380 TCTTTCAAGA AAATGGGATG GTGAAGGGAG AGATTGAATT ACTTAAGAGA TTAACAACCA 10440 TATCAATATC AGGAGTTCCA CGGTATAATG AAGTGTACAA TAATTCTAAA AGCCATACAG 10500 ATGACCTTAA AACCTACAAT AAAATAAGTA ATCTTAATTT GTCTTCTAAT CAGAAATCAA 10560 AGAAATTTGA ATTCAAGTCA ACGGATATCT ACAATGATGG ATACGAGACT GTGAGCTGTT 10620 TCCTAACAAC AGATCTCAAA AAATACTGTC TTAATTGGAG ATATGAATCA ACAGCTCTAT 10680 TTGGAGAAAC TTGCAACCAA ATATTTGGAT TAAATAAATT GTTTAATTGG TTACACCCTC 10740 GTCTTGAAGG AAGTACAATC TATGTAGGTG ATCCTTACTG TCCTCCATCA GATAAAGAAC 10800 ATATATCATT AGAGGATCAC CCTGATTCTG GTTTTTACGT TCATAACCCA AGAGGGGGTA 10860 TAGAAGGATT TTGTCAAAAA TTATGGACAC TCATATCTAT AAGTGCAATA CATCTAGCAG 10920 CTGTTAGAAT AGGCGTGAGG GTGACTGCAA TGGTTCAAGG AGACAATCAA GCTATAGCTG 10980 TAACCACAAG AGTACCCAAC AATTATGACT ACAGAGTTAA GAAGGAGATA GTTTATAAAG 11040 ATGTAGTGAG ATTTTTTGAT TCATTAAGAG AAGTGATGGA TGATCTAGGT CATGAACTTA 11100 AATTAAATGA AACGATTATA AGTAGCAAGA TGTTCATATA TAGCAAAAGA ATCTATTATG 11160 ATGGGAGAAT TCTTCCTCAA GCTCTAAAAG CATTATCTAG ATGTGTCTTC TGGTCAGAGA 11220 CAGTAATAGA CGAAACAAGA TCAGCATCTT CAAATTTGGC AACATCATTT GCAAAAGCAA 11280 TTGAGAATGG TTATTCACCT GTTCTAGGAT ATGCATGCTC AATTTTTAAG AACATTCAAC 11340 AACTATATAT TGCCCTTGGG ATGAATATCA ATCCAACTAT AACACAGAAT ATCAGAGATC 11400 AGTATTTTAG GAATCCAAAT TGGATGCAAT ATGCCTCTTT AATACCTGCT AGTGTTGGGG 11460 GATTCAATTA CATGGCCATG TCAAGATGTT TTGTAAGGAA TATTGGTGAT CCATCAGTTG 11520 CCGCATTGGC TGATATTAAA AGATTTATTA AGGCGAATCT ATTAGACCGA AGTGTTCTTT 11580 ATAGGATTAT GAATCAAGAA CCAGGTGAGT CATCTTTTTT GGACTGGGCT TCAGATCCAT 11640 ATTCATGCAA TTTACCACAA TCTCAAAATA TAACCACCAT GATAAAAAAT ATAACAGCAA 11700 GGAATGTATT ACAAGATTCA CCAAATCCAT TATTATCTGG ATTATTCACA AATACAATGA 11760 TAGAAGAAGA TGAAGAATTA GCTGAGTTCC TGATGGACAG GAAGGTAATT CTCCCTAGAG 11820 TTGCACATGA TATTCTAGAT AATTCTCTCA CAGGAATTAG AAATGCCATA GCTGGAATGT 11880 TAGATACGAC AAAATCACTA ATTCGGGTTG GCATAAATAG AGGAGGACTG ACATATAGTT 11940 TGTTGAGGAA AATCAGTAAT TACGATCTAG TACAATATGA AACACTAAGT AGGACTTTGC 12000 GACTAATTGT AAGTGATAAA ATCAAGTATG AAGATATGTG TTCGGTAGAC CTTGCCATAG 12060 CATTGCGACA AAAGATGTGG ATTCATTTAT CAGGAGGAAG GATGATAAGT GGACTTGAAA 12120 CGCCTGACCC ATTAGAATTA CTATCTGGGG TAGTAATAAC AGGATCAGAA CATTGTAAAA 12180 TATGTTATTC TTCAGATGGC ACAAACCCAT ATACTTGGAT GTATTTACCC GGTAATATCA 12240 AAATAGGATC AGCAGAAACA GGTATATCGT CATTAAGAGT TCCTTATTTT GGATCAGTCA 12300 CTGATGAAAG ATCTGAAGCA CAATTAGGAT ATATCAAGAA TCTTAGTAAA CCTGCAAAAG 12360 CCGCAATAAG AATAGCAATG ATATATACAT GGGCATTTGG TAATGATGAG ATATCTTGGA 12420 TGGAAGCCTC ACAGATAGCA CAAACACGTG CAAATTTTAC ACTAGATAGT CTCAAAATTT 12480 TAACACCGGT AGCTACATCA ACAAATTTAT CACACAGATT AAAGGATACT GCAACTCAGA 12540 TGAAATTCTC CAGTACATCA TTGATCAGAG TCAGCAGATT CATAACAATG TCCAATGATA 12600 ACATGTCTAT CAAAGAAGCT AATGAAACCA AAGATACTAA TCTTATTTAT CAACAAATAA 12660 TGTTAACAGG ATTAAGTGTT TTCGAATATT TATTTAGATT AAAAGAAACC ACAGGACACA 12720 ACCCTATAGT TATGCATCTG CACATAGAAG ATGAGTGTTG TATTAAAGAA AGTTTTAATG 12780 ATGAACATAT TAATCCAGAG TCTACATTAG AATTAATTCG ATATCCTGAA AGTAATGAAT 12840 TTATTTATGA TAAAGACCCA CTCAAAGATG TGGACTTATC AAAACTTATG GTTATTAAAG 12900 ACCATTCTTA CACAATTGAT ATGAATTATT GGGATGATAC TGACATCATA CATGCAATTT 12960 CAATATGTAC TGCAATTACA ATAGCAGATA CTATGTCACA ATTAGATCGA GATAATTTAA 13020 AAGAGATAAT AGTTATTGCA AATGATGATG ATATTAATAG CTTAATCACT GAATTTTTGA 13080 CTCTTGACAT ACTTGTATTT CTCAAGACAT TTGGTGGATT ATTAGTAAAT CAATTTGCAT 13140 ACACTCTTTA TAGTCTAAAA ATAGAAGGTA GGGATCTCAT TTGGGATTAT ATAATGAGAA 13200 CACTGAGAGA TACTTCCCAT TCAATATTAA AAGTATTATC TAATGCATTA TCTCATCCTA 13260 AAGTATTCAA GAGGTTCTGG GATTGTGGAG TTTTAAACCC TATTTATGGT CCTAATACTG 13320 CTAGTCAAGA CCAGATAAAA CTTGCCCTAT CTATATGTGA ATATTCACTA GATCTATTTA 13380 TGAGAGAATG GTTGAATGGT GTATCACTTG AAATATACAT TTGTGACAGC GATATGGAAG 13440 TTGCAAATGA TAGGAAACAA GCCTTTATTT CTAGACACCT TTCATTTGTT TGTTGTTTAG 13500 CAGAAATTGC ATCTTTCGGA CCTAACCTGT TAAACTTAAC ATACTTGGAG AGACTTGATC 13560 TATTGAAACA ATATCTTGAA TTAAATATTA AAGAAGACCC TACTCTTAAA TATGTACAAA 13620 TATCTGGATT ATTAATTAAA TCGTTCCCAT CAACTGTAAC ATACGTAAGA AAGACTGCAA 13680 TCAAATATCT AAGGATTCGC GGTATTAGTC CACCTGAGGT AATTGATGAT TGGGATCCGG 13740 TAGAAGATGA AAATATGCTG GATAACATTG TCAAAACTAT AAATGATAAC TGTAATAAAG 13800 ATAATAAAGG GAATAAAATT AACAATTTCT GGGGACTAGC ACTTAAGAAC TATCAAGTCC 13860 TTAAAATCAG ATCTATAACA AGTGATTCTG ATGATAATGA TAGACTAGAT GCTAATACAA 13920 GTGGTTTGAC ACTTCCTCAA GGAGGGAATT ATCTATCGCA TCAATTGAGA TTATTCGGAA 13980 TCAACAGCAC TAGTTGTCTG AAAGCTCTTG AGTTATCACA AATTTTAATG AAGGAAGTCA 14040 ATAAAGACAA GGACAGGCTC TTCCTGGGAG AAGGAGCAGG AGCTATGCTA GCATGTTATG 14100 ATGCCACATT AGGACCTGCA GTTAATTATT ATAATTCAGG TTTGAATATA ACAGATGTAA 14160 TTGGTCAACG AGAATTGAAA ATATTTCCTT CAGAGGTATC ATTAGTAGGT AAAAAATTAG 14220 GAAATGTGAC ACAGATTCTT AACAGGGTAA AAGTACTGTT CAATGGGAAT CCTAATTCAA 14280 CATGGATAGG AAATATGGAA TGTGAGAGCT TAATATGGAG TGAATTAAAT GATAAGTCCA 14340 TTGGATTAGT ACATTGTGAT ATGGAAGGAG CTATCGGTAA ATCAGAAGAA ACTGTTCTAC 14400 ATGAACATTA TAGTGTTATA AGAATTACAT ACTTGATTGG GGATGATGAT GTTGTTTTAG 14460 TTTCCAAAAT TATACCTACA ATCACTCCGA ATTGGTCTAG AATACTTTAT CTATATAAAT 14520 TATATTGGAA AGATGTAAGT ATAATATCAC TCAAAACTTC TAATCCTGCA TCAACAGAAT 14580 TATATCTAAT TTCGAAAGAT GCATATTGTA CTATAATGGA ACCTAGTGAA ATTGTTTTAT 14640 CAAAACTTAA AAGATTGTCA CTCTTGGAAG AAAATAATCT ATTAAAATGG ATCATTTTAT 14700 CAAAGAAGAG GAATAATGAA TGGTTACATC ATGAAATCAA AGAAGGAGAA AGAGATTATG 14760 GAATCATGAG ACCATATCAT ATGGCACTAC AAATCTTTGG ATTTCAAATC AATTTAAATC 14820 ATCTGGCGAA AGAATTTTTA TCAACCCCAG ATCTGACTAA TATCAACAAT ATAATCCAAA 14880 GTTTTCAGCG AACAATAAAG GATGTTTTAT TTGAATGGAT TAATATAACT CATGATGATA 14940 AGAGACATAA ATTAGGCGGA AGATATAACA TATTCCCACT GAAAAATAAG GGAAAGTTAA 15000 GACTGCTATC GAGAAGACTA GTATTAAGTT GGATTTCATT ATCATTATCG ACTCGATTAC 15060 TTACAGGTCG CTTTCCTGAT GAAAAATTTG AACATAGAGC ACAGACTGGA TATGTATCAT 15120 TAGCTGATAC TGATTTAGAA TCATTAAAGT TATTGTCGAA AAACATCATT AAGAATTACA 15180 GAGAGTGTAT AGGATCAATA TCATATTGGT TTCTAACCAA AGAAGTTAAA ATACTTATGA 15240 AATTGATTGG TGGTGCTAAA TTATTAGGAA TTCCCAGACA ATATAAAGAA CCCGAAGACC 15300 AGTTATTAGA AAACTACAAT CAACATGATG AATTTGATAT CGATTAAAAC ATAAATACAA 15360 TGAAGATATA TCCTAACCTT TATCTTTAAG CCTAGGAATA GACAAAAAGT AAGAAAAACA 15420 TGTAATATAT ATATACCAAA CAGAGTTCTT CTCTTGTTTG GT 15462 (2) SEQ ID NO: 18 information about: ...
(i) sequence signature:
(A) length: 2233 amino acid
(B) type: amino acid
(C) chain:
(D) topological framework: linearity
(ii) molecule type: protein
(xi) sequence description: SEQ ID NO:18:
Met?Asp?Thr?Glu?Ser?Asn?Asn?Gly?Thr?Val?Ser?Asp?Ile?Leu?Tyr?Pro
1???????????????5???????????????????10??????????????????15
Glu?Cys?His?Leu?Asn?Ser?Pro?Ile?Val?Lys?Gly?Lys?Ile?Ala?Gln?Leu
20??????????????????25??????????????????30
His?Thr?Ile?Met?Ser?Leu?Pro?Gln?Pro?Tyr?Asp?Met?Asp?Asp?Asp?Ser
35??????????????????40??????????????????45
Ile?Leu?Val?Ile?Thr?Arg?Gln?Lys?Ile?Lys?Leu?Asn?Lys?Leu?Asp?Lys
50??????????????????55??????????????????60Arg?Gln?Arg?Ser?Ile?Arg?Arg?Leu?Lys?Leu?Ile?Leu?Thr?Glu?Lys?Val65??????????????????70??????????????????75??????????????????80Asn?Asp?Leu?Gly?Lys?Tyr?Thr?Phe?Ile?Arg?Tyr?Pro?Glu?Met?Ser?Lys
85??????????????????90??????????????????95Glu?Met?Phe?Lys?Leu?Tyr?Ile?Pro?Gly?Ile?Asn?Ser?Lys?Val?Thr?Glu
100?????????????????105?????????????????110Leu?Leu?Leu?Lys?Ala?Asp?Arg?Thr?Tyr?Ser?Gln?Met?Thr?Asp?Gly?Leu
115?????????????????120?????????????????125Arg?Asp?Leu?Trp?Ile?Asn?Val?Leu?Ser?Lys?Leu?Ala?Ser?Lys?Asn?Asp
130?????????????????135?????????????????140Gly?Ser?Asn?Tyr?Asp?Leu?Asn?Glu?Glu?Ile?Asn?Asn?Ile?Ser?Lys?Val145?????????????????150?????????????????155?????????????????160His?Thr?Thr?Tyr?Lys?Ser?Asp?Lys?Trp?Tyr?Asn?Pro?Phe?Lys?Thr?Trp
165?????????????????170?????????????????175Phe?Thr?Ile?Lys?Tyr?Asp?Met?Arg?Arg?Leu?Gln?Lys?Ala?Arg?Asn?Glu
180?????????????????185?????????????????190Ile?Thr?Phe?Asn?Val?Gly?Lys?Asp?Tyr?Asn?Leu?Leu?Glu?Asp?Gln?Lys
195?????????????????200?????????????????205Asn?Phe?Leu?Leu?Ile?His?Pro?Glu?Leu?Val?Leu?Ile?Leu?Asp?Lys?Gln
210?????????????????215?????????????????220Asn?Tyr?Asn?Gly?Tyr?Leu?Ile?Thr?Pro?Glu?Leu?Val?Leu?Met?Tyr?Cys225?????????????????230?????????????????235?????????????????240Asp?Val?Val?Glu?Gly?Arg?Trp?Asn?Ile?Ser?Ala?Cys?Ala?Lys?Leu?Asp
245?????????????????250?????????????????255Pro?Lys?Leu?Gln?Ser?Met?Tyr?Gln?Lys?Gly?Asn?Asn?Leu?Trp?Glu?Val
260?????????????????265?????????????????270Ile?Asp?Lys?Leu?Phe?Pro?Ile?Met?Gly?Glu?Lys?Thr?Phe?Asp?Val?Ile
275?????????????????280?????????????????285Ser?Leu?Leu?Glu?Pro?Leu?Ala?Leu?Ser?Leu?Ile?Gln?Thr?His?Asp?Pro
290?????????????????295?????????????????300Val?Lys?Gln?Leu?Arg?Gly?Ala?Phe?Leu?Asn?His?Val?Leu?Ser?Glu?Met305?????????????????310?????????????????315?????????????????320Glu?Leu?Ile?Phe?Glu?Ser?Arg?Glu?Ser?Ile?Lys?Glu?Phe?Leu?Ser?Val
325?????????????????330?????????????????335Asp?Tyr?Ile?Asp?Lys?Ile?Leu?Asp?Ile?Phe?Asn?Lys?Ser?Thr?Ile?Asp
340?????????????????345?????????????????350Glu?Ile?Ala?Glu?Ile?Phe?Ser?Phe?Phe?Arg?Thr?Phe?Gly?His?Pro?Pro
355?????????????????360?????????????????365Leu?Glu?Ala?Ser?Ile?Ala?Ala?Glu?Lys?Val?Arg?Lys?Tyr?Met?Tyr?Ile
370?????????????????375?????????????????380Gly?Lys?Gln?Leu?Lys?Phe?Asp?Thr?Ile?Asn?Lys?Cys?His?Ala?Ile?Phe385?????????????????390?????????????????395?????????????????400Cys?Thr?Ile?Ile?Ile?Asn?Gly?Tyr?Arg?Glu?Arg?His?Gly?Gly?Gln?Trp
405?????????????????410?????????????????415Pro?Pro?Val?Thr?Leu?Pro?Asp?His?Ala?His?Glu?Phe?Ile?Ile?Asn?Ala
420?????????????????425?????????????????430Tyr?Gly?Ser?Asn?Ser?Ala?Ile?Ser?Tyr?Glu?Asn?Ala?Val?Asp?Tyr?Tyr
435?????????????????440?????????????????445Gln?Ser?Phe?Ile?Gly?Ile?Lys?Phe?Asn?Lys?Phe?Ile?Glu?Pro?Gln?Leu
450?????????????????455?????????????????460Asp?Glu?Asp?Leu?Thr?Ile?Tyr?Met?Lys?Asp?Lys?Ala?Leu?Ser?Pro?Lys465?????????????????470?????????????????475?????????????????480Lys?Ser?Asn?Trp?Asp?Thr?Val?Tyr?Pro?Ala?Ser?Asn?Leu?Leu?Tyr?Arg
485?????????????????490?????????????????495Thr?Asn?Ala?Ser?Asn?Glu?Ser?Arg?Arg?Leu?Val?Glu?Val?Phe?Ile?Ala
500?????????????????505?????????????????510Asp?Ser?Lys?Phe?Asp?Pro?His?Gln?Ile?Leu?Asp?Tyr?Val?Glu?Ser?Gly
515?????????????????520?????????????????525Asp?Trp?Leu?Asp?Asp?Pro?Glu?Phe?Asn?Ile?Ser?Tyr?Ser?Leu?Lys?Glu
530?????????????????535?????????????????540Lys?Glu?Ile?Lys?Gln?Glu?Gly?Arg?Leu?Phe?Ala?Lys?Met?Thr?Tyr?Lys545?????????????????550?????????????????555?????????????????560Met?Arg?Ala?Thr?Gln?Val?Leu?Ser?Glu?Thr?Leu?Leu?Ala?Asn?Asn?Ile
565?????????????????570?????????????????575Gly?Lys?Phe?Phe?Gln?Glu?Asn?Gly?Met?Val?Lys?Gly?Glu?Ile?Glu?Leu
580?????????????????585?????????????????590Leu?Lys?Arg?Leu?Thr?Thr?Ile?Ser?Ile?Ser?Gly?Val?Pro?Arg?Tyr?Asn
595?????????????????600?????????????????605Glu?Val?Tyr?Asn?Asn?Ser?Lys?Ser?His?Thr?Asp?Asp?Leu?Lys?Thr?Tyr
610?????????????????615?????????????????620Asn?Lys?Ile?Ser?Asn?Leu?Asn?Leu?Ser?Ser?Asn?Gln?Lys?Ser?Lys?Lys625?????????????????630?????????????????635?????????????????640Phe?Glu?Phe?Lys?Ser?Thr?Asp?Ile?Tyr?Asn?Asp?Gly?Tyr?Glu?Thr?Val
645?????????????????650?????????????????655Ser?Cys?Phe?Leu?Thr?Thr?Asp?Leu?Lys?Lys?Tyr?Cys?Leu?Asn?Trp?Arg
660?????????????????665?????????????????670Tyr?Glu?Ser?Thr?Ala?Leu?Phe?Gly?Glu?Thr?Cys?Asn?Gln?Ile?Phe?Gly
675?????????????????680?????????????????685Leu?Asn?Lys?Leu?Phe?Asn?Trp?Leu?His?Pro?Arg?Leu?Glu?Gly?Ser?Thr
690?????????????????695?????????????????700Ile?Tyr?Val?Gly?Asp?Pro?Tyr?Cys?Pro?Pro?Ser?Asp?Lys?Glu?His?Ile705?????????????????710?????????????????715?????????????????720Ser?Leu?Glu?Asp?His?Pro?Asp?Ser?Gly?Phe?Tyr?Val?His?Asn?Pro?Arg
725?????????????????730?????????????????735Gly?Gly?Ile?Glu?Gly?Phe?Cys?Gln?Lys?Leu?Trp?Thr?Leu?Ile?Ser?Ile
740?????????????????745?????????????????750Ser?Ala?Ile?His?Leu?Ala?Ala?Val?Arg?Ile?Gly?Val?Arg?Val?Thr?Ala
755?????????????????760?????????????????765Met?Val?Gln?Gly?Asp?Asn?Gln?Ala?Ile?Ala?Val?Thr?Thr?Arg?Val?Pro
770?????????????????775?????????????????780Asn?Asn?Tyr?Asp?Tyr?Arg?Val?Lys?Lys?Glu?Ile?Val?Tyr?Lys?Asp?Val785?????????????????790?????????????????795?????????????????800Val?Arg?Phe?Phe?Asp?Ser?Leu?Arg?Glu?Val?Met?Asp?Asp?Leu?Gly?His
805?????????????????810?????????????????815Glu?Leu?Lys?Leu?Asn?Glu?Thr?Ile?Ile?Ser?Ser?Lys?Met?Phe?Ile?Tyr
820?????????????????825?????????????????830Ser?Lys?Arg?Ile?Tyr?Tyr?Asp?Gly?Arg?Ile?Leu?Pro?Gln?Ala?Leu?Lys
835?????????????????840?????????????????845Ala?Leu?Ser?Arg?Cys?Val?Phe?Trp?Ser?Glu?Thr?Val?Ile?Asp?Glu?Thr
850?????????????????855?????????????????860Arg?Ser?Ala?Ser?Ser?Asn?Leu?Ala?Thr?Ser?Phe?Ala?Lys?Ala?Ile?Glu865?????????????????870?????????????????875?????????????????880Asn?Gly?Tyr?Ser?Pro?Val?Leu?Gly?Tyr?Ala?Cys?Ser?Ile?Phe?Lys?Asn
885?????????????????890?????????????????895Ile?Gln?Gln?Leu?Tyr?Ile?Ala?Leu?Gly?Met?Asn?Ile?Asn?Pro?Thr?Ile
900?????????????????905?????????????????910Thr?Gln?Asn?Ile?Arg?Asp?Gln?Tyr?Phe?Arg?Asn?Pro?Asn?Trp?Met?Gln
915?????????????????920?????????????????925Tyr?Ala?Ser?Leu?Ile?Pro?Ala?Ser?Val?Gly?Gly?Phe?Asn?Tyr?Met?Ala
930?????????????????935?????????????????940Met?Ser?Arg?Cys?Phe?Val?Arg?Asn?Ile?Gly?Asp?Pro?Ser?Val?Ala?Ala945?????????????????950?????????????????955?????????????????960Leu?Ala?Asp?Ile?Lys?Arg?Phe?Ile?Lys?Ala?Asn?Leu?Leu?Asp?Arg?Ser
965?????????????????970?????????????????975Val?Leu?Tyr?Arg?Ile?Met?Asn?Gln?Glu?Pro?Gly?Glu?Ser?Ser?Phe?Leu
980?????????????????985?????????????????990Asp?Trp?Ala?Ser?Asp?Pro?Tyr?Ser?Cys?Asn?Leu?Pro?Gln?Ser?Gln?Asn
995?????????????????1000????????????????1005Ile?Thr?Thr?Met?Ile?Lys?Asn?Ile?Thr?Ala?Arg?Asn?Val?Leu?Gln?Asp
1010????????????????1015????????????????1020Ser?Pro?Asn?Pro?Leu?Leu?Ser?Gly?Leu?Phe?Thr?Asn?Thr?Met?Ile?Glu1025????????????????1030????????????????1035????????????????1040Glu?Asp?Glu?Glu?Leu?Ala?Glu?Phe?Leu?Met?Asp?Arg?Lys?Val?Ile?Leu
1045????????????????1050????????????????1055Pro?Arg?Val?Ala?His?Asp?Ile?Leu?Asp?Asn?Ser?Leu?Thr?Gly?Ile?Arg
1060????????????????1065????????????????1070Asn?Ala?Ile?Ala?Gly?Met?Leu?Asp?Thr?Thr?Lys?Ser?Leu?Ile?Arg?Val
1075????????????????1080????????????????1085Gly?Ile?Asn?Arg?Gly?Gly?Leu?Thr?Tyr?Ser?Leu?Leu?Arg?Lys?Ile?Ser
1090????????????????1095????????????????1100Asn?Tyr?Asp?Leu?Val?Gln?Tyr?Glu?Thr?Leu?Ser?Arg?Thr?Leu?Arg?Leu1105????????????????1110????????????????1115????????????????1120Ile?Val?Ser?Asp?Lys?Ile?Lys?Tyr?Glu?Asp?Met?Cys?Ser?Val?Asp?Leu
1125????????????????1130????????????????1135Ala?Ile?Ala?Leu?Arg?Gln?Lys?Met?Trp?Ile?His?Leu?Ser?Gly?Gly?Arg
1140????????????????1145????????????????1150Met?Ile?Ser?Gly?Leu?Glu?Thr?Pro?Asp?Pro?Leu?Glu?Leu?Leu?Ser?Gly
1155????????????????1160????????????????1165Val?Val?Ile?Thr?Gly?Ser?Glu?His?Cys?Lys?Ile?Cys?Tyr?Ser?Ser?Asp
1170????????????????1175????????????????1180Gly?Thr?Asn?Pro?Tyr?Thr?Trp?Met?Tyr?Leu?Pro?Gly?Asn?Ile?Lys?Ile1185????????????????1190????????????????1195????????????????1200Gly?Ser?Ala?Glu?Thr?Gly?Ile?Ser?Ser?Leu?Arg?Val?Pro?Tyr?Phe?Gly
1205????????????????1210????????????????1215Ser?Val?Thr?Asp?Glu?Arg?Ser?Glu?Ala?Gln?Leu?Gly?Tyr?Ile?Lys?Asn
1220????????????????1225????????????????1230Leu?Ser?Lys?Pro?Ala?Lys?Ala?Ala?Ile?Arg?Ile?Ala?Met?Ile?Tyr?Thr
1235????????????????1240????????????????1245Trp?Ala?Phe?Gly?Asn?Asp?Glu?Ile?Ser?Trp?Met?Glu?Ala?Ser?Gln?Ile
1250????????????????1255????????????????1260Ala?Gln?Thr?Arg?Ala?Asn?Phe?Thr?Leu?Asp?Ser?Leu?Lys?Ile?Leu?Thr1265????????????????1270????????????????1275????????????????1280Pro?Val?Ala?Thr?Ser?Thr?Asn?Leu?Ser?His?Arg?Leu?Lys?Asp?Thr?Ala
1285????????????????1290????????????????1295Thr?Gln?Met?Lys?Phe?Ser?Ser?Thr?Ser?Leu?Ile?Arg?Val?Ser?Arg?Phe
1300????????????????1305????????????????1310Ile?Thr?Met?Ser?Asn?Asp?Asn?Met?Ser?Ile?Lys?Glu?Ala?Asn?Glu?Thr
1315????????????????1320????????????????1325Lys?Asp?Thr?Asn?Leu?Ile?Tyr?Gln?Gln?Ile?Met?Leu?Thr?Gly?Leu?Ser
1330????????????????1335????????????????1340Val?Phe?Glu?Tyr?Leu?Phe?Arg?Leu?Lys?Glu?Thr?Thr?Gly?His?Asn?Pro1345????????????????1350????????????????1355????????????????1360Ile?Val?Met?His?Leu?His?Ile?Glu?Asp?Glu?Cys?Cys?Ile?Lys?Glu?Ser
1365????????????????1370????????????????1375Phe?Asn?Asp?Glu?His?Ile?Asn?Pro?Glu?Ser?Thr?Leu?Glu?Leu?Ile?Arg
1380????????????????1385????????????????1390Tyr?Pro?Glu?Ser?Asn?Glu?Phe?Ile?Tyr?Asp?Lys?Asp?Pro?Leu?Lys?Asp
1395????????????????1400????????????????1405Val?Asp?Leu?Ser?Lys?Leu?Met?Val?Ile?Lys?Asp?His?Ser?Tyr?Thr?Ile
1410????????????????1415????????????????1420Asp?Met?Asn?Tyr?Trp?Asp?Asp?Thr?Asp?Ile?Ile?His?Ala?Ile?Ser?Ile1425????????????????1430????????????????1435????????????????1440Cys?Thr?Ala?Ile?Thr?Ile?Ala?Asp?Thr?Met?Ser?Gln?Leu?Asp?Arg?Asp
1445????????????????1450????????????????1455Asn?Leu?Lys?Glu?Ile?Ile?Val?Ile?Ala?Asn?Asp?Asp?Asp?Ile?Asn?Ser
1460????????????????1465????????????????1470Leu?Ile?Thr?Glu?Phe?Leu?Thr?Leu?Asp?Ile?Leu?Val?Phe?Leu?Lys?Thr
1475????????????????1480????????????????1485Phe?Gly?Gly?Leu?Leu?Val?Asn?Gln?Phe?Ala?Tyr?Thr?Leu?Tyr?Ser?Leu
1490????????????????1495????????????????1500Lys?Ile?Glu?Gly?Arg?Asp?Leu?Ile?Trp?Asp?Tyr?Ile?Met?Arg?Thr?Leu1505????????????????1510????????????????1515????????????????1520Arg?Asp?Thr?Ser?His?Ser?Ile?Leu?Lys?Val?Leu?Ser?Asn?Ala?Leu?Ser
1525????????????????1530????????????????1535His?Pro?Lys?Val?Phe?Lys?Arg?Phe?Trp?Asp?Cys?Gly?Val?Leu?Asn?Pro
1540????????????????1545????????????????1550Ile?Tyr?Gly?Pro?Asn?Thr?Ala?Ser?Gln?Asp?Gln?Ile?Lys?Leu?Ala?Leu
1555????????????????1560????????????????1565Ser?Ile?Cys?Glu?Tyr?Ser?Leu?Asp?Leu?Phe?Met?Arg?Glu?Trp?Leu?Asn
1570????????????????1575????????????????1580Gly?Val?Ser?Leu?Glu?Ile?Tyr?Ile?Cys?Asp?Ser?Asp?Met?Glu?Val?Ala1585????????????????1590????????????????1595????????????????1600Asn?Asp?Arg?Lys?Gln?Ala?Phe?Ile?Ser?Arg?His?Leu?Ser?Phe?Val?Cys
1605????????????????1610????????????????1615Cys?Leu?Ala?Glu?Ile?Ala?Ser?Phe?Gly?Pro?Asn?Leu?Leu?Asn?Leu?Thr
1620????????????????1625????????????????1630Tyr?Leu?Glu?Arg?Leu?Asp?Leu?Leu?Lys?Gln?Tyr?Leu?Glu?Leu?Asn?Ile
1635????????????????1640????????????????1645Lys?Glu?Asp?Pro?Thr?Leu?Lys?Tyr?Val?Gln?Ile?Ser?Gly?Leu?Leu?Ile
1650????????????????1655????????????????1660Lys?Ser?Phe?Pro?Ser?Thr?Val?Thr?Tyr?Val?Arg?Lys?Thr?Ala?Ile?Lys1665????????????????1670????????????????1675????????????????1680Tyr?Leu?Arg?Ile?Arg?Gly?Ile?Ser?Pro?Pro?Glu?Val?Ile?Asp?Asp?Trp
1685????????????????1690????????????????1695Asp?Pro?Val?Glu?Asp?Glu?Asn?Met?Leu?Asp?Asn?Ile?Val?Lys?Thr?Ile
1700????????????????1705????????????????1710Asn?Asp?Asn?Cys?Asn?Lys?Asp?Asn?Lys?Gly?Asn?Lys?Ile?Asn?Asn?Phe
1715????????????????1720????????????????1725Trp?Gly?Leu?Ala?Leu?Lys?Asn?Tyr?Gln?Val?Leu?Lys?Ile?Arg?Ser?Ile
1730????????????????1735????????????????1740Thr?Ser?Asp?Ser?Asp?Asp?Asn?Asp?Arg?Leu?Asp?Ala?Asn?Thr?Ser?Gly1745????????????????1750????????????????1755????????????????1760Leu?Thr?Leu?Pro?Gln?Gly?Gly?Asn?Tyr?Leu?Ser?His?Gln?Leu?Arg?Leu
1765????????????????1770????????????????1775Phe?Gly?Ile?Asn?Ser?Thr?Ser?Cys?Leu?Lys?Ala?Leu?Glu?Leu?Ser?Gln
1780????????????????1785????????????????1790Ile?Leu?Met?Lys?Glu?Val?Asn?Lys?Asp?Lys?Asp?Arg?Leu?Phe?Leu?Gly
1795????????????????1800????????????????1805Glu?Gly?Ala?Gly?Ala?Met?Leu?Ala?Cys?Tyr?Asp?Ala?Thr?Leu?Gly?Pro
1810????????????????1815????????????????1820Ala?Val?Asn?Tyr?Tyr?Asn?Ser?Gly?Leu?Asn?Ile?Thr?Asp?Val?Ile?Gly1825????????????????1830????????????????1835????????????????1840Gln?Arg?Glu?Leu?Lys?Ile?Phe?Pro?Ser?Glu?Val?Ser?Leu?Val?Gly?Lys
1845????????????????1850????????????????1855Lys?Leu?Gly?Asn?Val?Thr?Gln?Ile?Leu?Asn?Arg?Val?Lys?Val?Leu?Phe
1860????????????????1865????????????????1870Asn?Gly?Asn?Pro?Asn?Ser?Thr?Trp?Ile?Gly?Asn?Met?Glu?Cys?Glu?Ser
1875????????????????1880????????????????1885Leu?Ile?Trp?Ser?Glu?Leu?Asn?Asp?Lys?Ser?Ile?Gly?Leu?Val?His?Cys
1890????????????????1895????????????????1900Asp?Met?Glu?Gly?Ala?Ile?Gly?Lys?Ser?Glu?Glu?Thr?Val?Leu?His?Glu1905????????????????1910????????????????1915????????????????1920His?Tyr?Ser?Val?Ile?Arg?Ile?Thr?Tyr?Leu?Ile?Gly?Asp?Asp?Asp?Val
1925????????????????1930????????????????1935Val?Leu?Val?Ser?Lys?Ile?Ile?Pro?Thr?Ile?Thr?Pro?Asn?Trp?Ser?Arg
1940????????????????1945????????????????1950Ile?Leu?Tyr?Leu?Tyr?Lys?Leu?Tyr?Trp?Lys?Asp?Val?Ser?Ile?Ile?Ser
1955????????????????1960????????????????1965Leu?Lys?Thr?Ser?Asn?Pro?Ala?Ser?Thr?Glu?Leu?Tyr?Leu?Ile?Ser?Lys
1970????????????????1975????????????????1980Asp?Ala?Tyr?Cys?Thr?Ile?Met?Glu?Pro?Ser?Glu?Ile?Val?Leu?Ser?Lys1985????????????????1990????????????????1995????????????????2000Leu?Lys?Arg?Leu?Ser?Leu?Leu?Glu?Glu?Asn?Asn?Leu?Leu?Lys?Trp?Ile
2005????????????????2010????????????????2015Ile?Leu?Ser?Lys?Lys?Arg?Asn?Asn?Glu?Trp?Leu?His?His?Glu?Ile?Lys
2020????????????????2025????????????????2030Glu?Gly?Glu?Arg?Asp?Tyr?Gly?Ile?Met?Arg?Pro?Tyr?His?Met?Ala?Leu
2035????????????????2040????????????????2045Gln?Ile?Phe?Gly?Phe?Gln?Ile?Asn?Leu?Asn?His?Leu?Ala?Lys?Glu?Phe
2050???????????????2055????????????????2060Leu?Ser?Thr?Pro?Asp?Leu?Thr?Asn?Ile?Asn?Asn?Ile?Ile?Gln?Ser?Phe2065????????????????2070????????????????2075????????????????2080Gln?Arg?Thr?Ile?Lys?Asp?Val?Leu?Phe?Glu?Trp?Ile?Asn?Ile?Thr?His
2085????????????????2090????????????????2095Asp?Asp?Lys?Arg?His?Lys?Leu?Gly?Gly?Arg?Tyr?Asn?Ile?Phe?Pro?Leu
2100????????????????2105????????????????2110
Lys?Asn?Lys?Gly?Lys?Leu?Arg?Leu?Leu?Ser?Arg?Arg?Leu?Val?Leu?Ser
2115????????????????2120????????????????2125
Trp?Ile?Ser?Leu?Ser?Leu?Ser?Thr?Arg?Leu?Leu?Thr?Gly?Arg?Phe?Pro
2130????????????????2135????????????????2140
Asp?Glu?Lys?Phe?Glu?His?Arg?Ala?Gln?Thr?Gly?Tyr?Val?Ser?Leu?Ala
2145????????????????2150????????????????2155????????????????2160
Asp?Thr?Asp?Leu?Glu?Ser?Leu?Lys?Leu?Leu?Ser?Lys?Asn?Ile?Ile?Lys
2165????????????????2170????????????????2175
Asn?Tyr?Arg?Glu?Cys?Ile?Gly?Ser?Ile?Ser?Tyr?Trp?Phe?Leu?Thr?Lys
2180????????????????2185????????????????2190
Glu?Val?Lys?Ile?Leu?Met?Lys?Leu?Ile?Gly?Gly?Ala?Lys?Leu?Leu?Gly
2195????????????????2200????????????????2205
Ile?Pro?Arg?Gln?Tyr?Lys?Glu?Pro?Glu?Asp?Gln?Leu?Leu?Glu?Asn?Tyr
2210????????????????2215????????????????2220
Asn?Gln?His?Asp?Glu?Phe?Asp?Ile?Asp
The information of 2,225 2230 (2) SEQ ID NO:19:
(i) sequence signature:
(A) length: 15462 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: ACCAAACAAG AGAAGAAACT TGCTTGGTAA TATAAATTTA ACTTAAAATT AACTTAGGAT 60 TTAAGACATT GACTAGAAGG TCAAGAAAAG GGAACTCTAT AATTTCAAAA ATGTTGAGCC 120 TATTTGATAC ATTTAATGCA GGTAGGCAAG AAAACATAAC AAAATCAGCC GGTGGAGCTA 180 TCATTCCTGG ACAGAAAAAT ACTGTCTCTA TATTCGCCCT TGGACCGACA ATAACTGATG 240 ATAATGAGAA AATGACATTA GCTCTTCTAT TTCTATCTCA TTCACTAGAT AATGAGAAAC 300 AACATGCACA AAGGGCAGGG TTCTTGGTGT CTTTATTGTC AATGGCTTAT GCCAATCCAG 360 AGCTCTACCT AACAACAAAT GGAAGTAATG CAGATGCCAA GTATGTCATA TACATGATTG 420 AGAAAGATCT AAAACGGCAA AAGTATGGAG GATTTGTGGT TAAGACGAGA GAGATGATAT 480 ATGAAAAGAC AACTGATTGG ATATTTGGAA GTGACCTGGA TTATGATCAG GAAACTATGT 540 TGCAGAACGG CAGGAACAAT TCAACAATTG AAGACCTTGT CCACACATTT GGGTATCCAT 600 CATGTTTAGG AGCTCTTATA ATACAGATCT GGATAGTTCT GGTCAAAGCT ATCACTAGTA 660 TCTCAGGGTT AAGAAAAGGC TTTTTCACCC GATTGGAAGC TTTCAGACAA GATGGAACAG 720 TGCAGGCAGG GCTGGTATTG AGCGGTGACA CAGTGGATCA GATTGGGTCA ATCATGCGGT 780 CTCAACAGAG CTTGGTAACT CTTATGGTTG AAACATTAAT AACAATGAAT ACCAGCAGAA 840 ATGACCTCAC AACCATAGAA AAGAATATAC AAATTGTTGG CAACTACATA AGAGATGCAG 900 GTCTCGCTTC ATTCTTCAAT ACAATCAGAT ATGGAATTGA GACCAGAATG GCAGCTTTGA 960 CTCTATCCAC TCTCAGACCA GATATCAATA GATTAAAAGC TTTGATGGAA CTGTATTTAT 1020 CAAAGGGACC ACGCGCTCCT TTCATCTGTA TCCTCAGAGA TCCTATACAT GGTGAGTTCG 1080 CACCAGGCAA CTATCCTGCC ATATGGAGCT ATGCAATGGG GGTGGCAGTT GTACAAAATA 1140 GAGCCATGCA ACAGTATGTG ACGGGAAGAT CATATCTAGA CATTGATATG TTCCAGCTAG 1200 GACAAGCAGT AGCACGTGAT GCCGAAGCTC AAATGAGCTC AACACTGGAA GATGAACTTG 1260 GAGTGACACA CGAAGCTAAA GAAAGCTTGA AGAGACATAT AAGGAACATA AACAGTTCAG 1320 AGACATCTTT CCACAAACCG ACAGGTGGAT CAGCCATAGA GATGGCAATA GATGAAGAGC 1380 CAGAACAATT CGAACATAGA GCAGATCAAG AACAAAATGG AGAACCTCAA TCATCCATAA 1440 TTCAATATGC CTGGGCAGAA GGAAATAGAA GCGATGATCA GACTGAGCAA GCTACAGAAT 1500 CTGACAATAT CAAGACCGAA CAACAAAACA TCAGAGACAG ACTAAACAAG AGACTCAACG 1560 ACAAGAAGAA ACAAAGCAGT CAACCACCCA CTAATCCCAC AAACAGAACA AACCAGGACG 1620 AAATAGATGA TCTGTTTAAC GCATTTGGAA GCAACTAATC GAATCAACAT TTTAATCTAA 1680 ATCAATAATA AATAAGAAAA ACTTAGGATT AAAGAATCCT ATCATACCGG AATATAGGGT 1740 GGTAAATTTA GAGTCTGCTT GAAACTCAAT CAATAGAGAG TTGATGGAAA GCGATGCTAA 1800 AAACTATCAA ATCATGGATT CTTGGGAAGA GGAATCAAGA GATAAATCAA CTAATATCTC 1860 CTCGGCCCTC AACATCATTG AATTCATACT CAGCACCGAC CCCCAAGAAG ACTTATCGGA 1920 AAACGACACA ATCAACACAA GAACCCAGCA ACTCAGTGCC ACCATCTGTC AACCAGAAAT 1980 CAAACCAACA GAAACAAGTG AGAAAGATAG TGGATCAACT GACAAAAATA GACAGTCTGG 2040 GTCATCACAC GAATGTACAA CAGAAGCAAA AGATAGAAAC ATTGATCAGG AAACTGTACA 2100 GAGAGGACCT GGGAGAAGAA GCAGCTCAGA TAGTAGAGCT GAGACTGTGG TCTCTGGAGG 2160 AATCCCCAGA AGCATCACAG ATTCTAAAAA TGGAACCCAA AACACGGAGG ATATTGATCT 2220 CAATGAAATT AGAAAGATGG ATAAGGACTC TATTGAGGGG AAAATGCGAC AATCTGCAAA 2280 TGTTCCAAGC GAGATATCAG GAAGTGATGA CATATTTACA ACAGAACAAA GTAGAAACAG 2340 TGATCATGGA AGAAGCCTGG AATCTATCAG TACACCTGAT ACAAGATCAA TAAGTGTTGT 2400 TACTGCTGCA ACACCAGATG ATGAAGAAGA AATACTAATG AAAAATAGTA GGACAAAGAA 2460 AAGTTCTTCA ACACATCAAG AAGATGACAA AAGAATTAAA AAAGGGGGAA AAGGGAAAGA 2520 CTGGTTTAAG AAATCAAAAG ATACCGACAA CCAGATACCA ACATCAGACT ACAGATCCAC 2580 ATCAAAAGGG CAGAAGAAAA TCTCAAAGAC AACAACCACC AACACCGACA CAAAGGGGCA 2640 AACAGAAATA CAGACAGAAT CATCAGAAAC ACAATCCTCA TCATGGAATC TCATCATCGA 2700 CAACAACACC GACCGGAACG AACAGACAAG CACAACTCCT CCAACAACAA CTTCCAGATC 2760 AACTTATACA AAAGAATCGA TCCGAACAAA CTCTGAATCC AAACCCAAGA CACAAAAGAC 2820 AAATGGAAAG GAAAGGAAGG ATACAGAAGA GAGCAATCGA TTTACAGAGA GGGCAATTAC 2880 TCTATTGCAG AATCTTGGTG TAATTCAATC CACATCAAAA CTAGATTTAT ATCAAGACAA 2940 ACGAGTTGTA TGTGTAGCAA ATGTACTAAA CAATGTAGAT ACTGCATCAA AGATAGATTT 3000 CCTGGCAGGA TTAGTCATAG GGGTTTCAAT GGACAACGAC ACAAAATTAA CACAGATACA 3060 AAATGAAATG CTAAACCTCA AAGCAGATCT AAAGAAAATG GACGAATCAC ATAGAAGATT 3120 GATAGAAAAT CAAAGAGAAC AACTGTCATT GATCACGTCA CTAATTTCAA ATCTCAAAAT 3180 TATGACTGAG AGAGGAGGAA AGAAAGACCA AAATGAATCC AATGAGAGAG TATCCATGAT 3240 CAAAACAAAA TTGAAAGAAG AAAAGATCAA GAAGACCAGG TTTGACCCAC TTATGGAGGC 3300 ACAAGGCATT GACAAGAATA TACCCGATCT ATATCGACAT GCAGGAGATA CACTAGAGAA 3360 CGATGTACAA GTTAAATCAG AGATATTAAG TTCATACAAT GAGTCAAATG CAACAAGACT 3420 AATACCCAAA AAAGTGAGCA GTACAATGAG ATCACTAGTT GCAGTCATCA ACAACAGCAA 3480 TCTCTCACAA AGCACAAAAC AATCATACAT AAACGAACTC AAACGTTGCA AAAATGATGA 3540 AGAAGTATCT GAATTAATGG ACATGTTCAA TGAAGATGTC AACAATTGCC AATGATCCAA 3600 CAAAGAAACG ACACCGAACA AACAGACAAG AAACAACAGT AGATCAAAAC CTGTCAACAC 3660 ACACAAAATC AAGCAGAATG AAACAACAGA TATCAATCAA TATACAAATA AGAAAAACTT 3720 AGGATTAAAG AATAAATTAA TCCTTGTCCA AAATGAGTAT AACTAACTCT GCAATATACA 3780 CATTCCCAGA ATCATCATTC TCTGAAAATG GTCATATAGA ACCATTACCA CTCAAAGTCA 3840 ATGAACAGAG GAAAGCAGTA CCCCACATTA GAGTTGCCAA GATCGGAAAT CCACCAAAAC 3900 ACGGATCCCG GTATTTAGAT GTCTTCTTAC TCGGCTTCTT CGAGATGGAA CGAATCAAAG 3960 ACAAATACGG GAGTGTGAAT GATCTCGACA GTGACCCGAG TTACAAAGTT TGTGGCTCTG 4020 GATCATTACC AATCGGATTG GCTAAGTACA CTGGGAATGA CCAGGAATTG TTACAAGCCG 4080 CAACCAAACT GGATATAGAA GTGAGAAGAA CAGTCAAAGC GAAAGAGATG GTTGTTTACA 4140 CGGTACAAAA TATAAAACCA GAACTGTACC CATGGTCCAA TAGACTAAGA AAAGGAATGC 4200 TGTTCGATGC CAACAAAGTT GCTCTTGCTC CTCAATGTCT TCCACTAGAT AGGAGCATAA 4260 AATTTAGAGT AATCTTCGTG AATTGTACGG CAATTGGATC AATAACCTTG TTCAAAATTC 4320 CTAAGTCAAT GGCATCACTA TCTCTAACCA ACACAATATC AATCAATCTG CAGGTACACA 4380 TAAAAACAGG GGTTCAGACT GATTCTAAAG GGATAGTTCA AATTTTGGAT GAGAAAGGCG 4440 AAAAATCACT GAATTTCATG GTCCATCTCG GATTGATCAA AAGAAAAGTA GGCAGAATGT 4500 ACTCTGTTGA ATACTGTAAA CAGAAAATCG AGAAAATGAG ATTGATATTT TCTTTAGGAC 4560 TAGTTGGAGG AATCAGTCTT CATGTCAATG CAACTGGGTC CATATCAAAA ACACTAGCAA 4620 GTCAGCTGGT ATTCAAAAGA GAGATTTGTT ATCCTTTAAT GGATCTAAAT CCGCATCTCA 4680 ATCTAGTTAT CTGGGCTTCA TCAGTAGAGA TTACAAGAGT GGATGCAATT TTCCAACCTT 4740 CTTTACCTGG CGAGTTCAGA TACTATCCTA ATATTATTGC AAAAGGAGTT GGGAAAATCA 4800 AACAATGGAA CTAGTAATCT CTATTTTAGT CCGGACGTAT CTATTAAGCC GAAGCAAATA 4860 AAGGATAATC AAAAACTTAG GACAAAAGAG GTCAATACCA ACAACTATTA GCAGTCACAC 4920 TCGCAAGAAT AAGAGAGAAG GGACCAAAAA AGTCAAATAG GAGAAATCAA AACAAAAGGT 4980 ACAGAACACC AGAACAACAA AATCAAAACA TCCAACTCAC TCAAAACAAA AATTCCAAAA 5040 GAGACCGGCA ACACAACAAG CACTGAACAC AATGCCAACT TCAATACTGC TAATTATTAC 5100 AACCATGATC ATGGCATCTT TCTGCCAAAT AGATATCACA AAACTACAGC ACGTAGGTGT 5160 ATTGGTCAAC AGTCCCAAAG GGATGAAGAT ATCACAAAAC TTTGAAACAA GATATCTAAT 5220 TTTGAGCCTC ATACCAAAAA TAGAAGACTC TAACTCTTGT GGTGACCAAC AGATCAAGCA 5280 ATACAAGAAG TTATTGGATA GACTGATCAT CCCTTTATAT GATGGATTAA GATTACAGAA 5340 AGATGTGATA GTAACCAATC AAGAATCCAA TGAAAACACT GATCCCAGAA CAAAACGATT 5400 CTTTGGAGGG GTAATTGGAA CCATTGCTCT GGGAGTAGCA ACCTCAGCAC AAATTACAGC 5460 GGCAGTTGCT CTGGTTGAAG CCAAGCAGGC AAGATCAGAC ATCGAAAAAC TCAAAGAAGC 5520 AATTAGGGAC ACAAATAAAG CAGTGCAGTC AGTTCAGAGC TCCATAGGAA ATTTAATAGT 5580 AGCAATTAAA TCAGTCCAGG ATTATGTTAA CAAAGAAATC GTGCCATCGA TTGCGAGGCT 5640 AGGTTGTGAA GCAGCAGGAC TTCAATTAGG AATTGCATTA ACACAGCATT ACTCAGAATT 5700 AACAAACATA TTTGGTGATA ACATAGGATC GTTACAAGAA AAAGGAATAA AATTACAAGG 5760 TATAGCATCA TTATACCGCA CAAATATCAC AGAAATATTC ACAACATCAA CAGTTGATAA 5820 ATATGATATC TATGATCTGT TATTTACAGA ATCAATAAAG GTGAGAGTTA TAGATGTTGA 5880 CTTGAATGAT TACTCAATCA CCCTCCAAGT CAGACTCCCT TTATTAACTA GGCTGCTGAA 5940 CACTCAGATC TACAAAGTAG ATTCCATATC ATATAACATC CAAAACAGAG AATGGTATAT 6000 CCCTCTTCCC AGCCATATCA TGACGAAAGG GGCATTTCTA GGTGGAGCAG ACGTCAAAGA 6060 ATGTATAGAA GCATTCAGCA GCTATATATG CCCTTCTGAT CCAGGATTTG TATTAAACCA 6120 TGAAATAGAG AGCTGCTTAT CAGGAAACAT ATCCCAATGT CCAAGAACAA CGGTCACATC 6180 AGACATTGTT CCAAGATATG CATTTGTCAA TGGAGGAGTG GTTGCAAACT GTATAACAAC 6240 CACCTGTACA TGCAACGGAA TTGGTAATAG AATCAATCAA CCACCTGATC AAGGAGTAAA 6300 AATTATAACA CATAAAGAAT GTAGTACAGT AGGTATCAAC GGAATGCTGT TCAATACAAA 6360 TAAAGAAGGA ACTCTTGCAT TCTATACACC AAATGATATA ACACTAAACA ATTCTGTTAC 6420 ACTTGATCCA ATTGACATAT CAATCGAGCT CAACAAGGCC AAATCAGATC TAGAAGAATC 6480 AAAAGAATGG ATAAGAAGGT CAAATCAAAA ACTAGATTCT ATTGGAAATT GGCATCAATC 6540 TAGCACTACA ATCATAATTA TTTTGATAAT GATCATTATA TTGTTTATAA TTAATATAAC 6600 GATAATTACA ATTGCAATTA AGTATTACAG AATTCAAAAG AGAAATCGAG TGGATCAAAA 6660 TGACAAGCCA TATGTACTAA CAAACAAATA ACATATCTAC AGATCATTAG ATATTAAAAT 6720 TATAAAAAAC TTAGGAGTAA AGTTACGCAA TCCAACTCTA CTCATATAAT TGAGGAAGGA 6780 CCCAATAGAC AAATCCAAAT TCGAGATGGA ATACTGGAAG CATACCAATC ACGGAAAGGA 6840 TGCTGGCAAT GAGCTGGAGA CGTCTATGGC TACTCATGGC AACAAGCTCA CTAATAAGAT 6900 AATATACATA TTATGGACAA TAATCCTGGT GTTATTATCA ATAGTCTTCA TCATAGTGCT 6960 AATTAATTCC ATCAAAAGTG AAAAGGCCCA CGAATCATTG CTGCAAGACA TAAATAATGA 7020 GTTTATGGAA ATTACAGAAA AGATCCAAAT GGCATCGGAT AATACCAATG ATCTAATACA 7080 GTCAGGAGTG AATACAAGGC TTCTTACAAT TCAGAGTCAT GTCCAGAATT ACATACCAAT 7140 ATCATTGACA CAACAGATGT CAGATCTTAG GAAATTCATT AGTGAAATTA CAATTAGAAA 7200 TGATAATCAA GAAGTGCTGC CACAAAGAAT AACACATGAT GTAGGTATAA AACCTTTAAA 7260 TCCAGATGAT TTTTGGAGAT GCACGTCTGG TCTTCCATCT TTAATGAAAA CTCCAAAAAT 7320 AAGGTTAATG CCAGGGCCGG GATTATTAGC TATGCCAACG ACTGTTGATG GCTGTGTTAG 7380 AACTCCGTCT TTAGTTATAA ATGATCTGAT TTATGCTTAT ACCTCAAATC TAATTACTCG 7440 AGGTTGTCAG GATATAGGAA AATCATATCA AGTCTTACAG ATAGGGATAA TAACTGTAAA 7500 CTCAGACTTG GTACCTGACT TAAATCCTAG GATCTCTCAT ACCTTTAACA TAAATGACAA 7560 TAGGAAGTCA TGTTCTCTAG CACTCCTAAA TACAGATGTA TATCAACTGT GTTCAACTCC 7620 CAAAGTTGAT GAAAGATCAG ATTATGCATC ATCAGGCATA GAAGATATTG TACTTGATAT 7680 TGTCAATTAT GATGGTTCAA TCTCAACAAC AAGATTTAAG AATAATAACA TAAGCTTTGA 7740 TCAACCATAT GCTGCACTAT ACCCATCTGT TGGACCAGGG ATATACTACA AAGGCAAAAT 7800 AATATTTCTC GGGTATGGAG GTCTTGAACA TCCAATAAAT GAGAATGTAA TCTGCAACAC 7860 AACTGGGTGC CCCGGGAAAA CACAGAGAGA CTGTAATCAA GCGTCTCATA GTCCATGGTT 7920 TTCAGATAGG AGGATGGTCA ACTCCATCAT TGTTGCTGAC AAAGGCTTAA ACTCAATTCC 7980 AAAATTGAAA GTATGGACGA TATCTATGCG ACAAAATTAC TGGGGGTCAG AAGGAAGGTT 8040 ACTTCTACTA GGTAACAAGA TCTATATATA TACAAGATCT ACAAGTTGGC ATAGCAAGTT 8100 ACAATTAGGA ATAATTGATA TTACTGATTA CAGTGATATA AGGATAAAAT GGACATGGCA 8160 TAATGTGCTA TCAAGACCAG GAAACAATGA ATGTCCATGG GGACATTCAT GTCCAGATGG 8220 ATGTATAACA GGAGTATATA CTGATGCATA TCCACTCAAT CCCACAGGGA GCATTGTGTC 8280 ATCTGTCATA TTAGACTCAC AAAAATCGAG AGTGAACCCA GTCATAACTT ACTCAACAGC 8340 AACCGAAAGA GTAAACGAGC TGGCCATCCT AAACAGAACA CTCTCAGCTG GATATACAAC 8400 AACAAGCTGC ATTACACACT ATAACAAAGG ATATTGTTTT CATATAGTAG AAATAAATCA 8460 TAAAAGCTTA AACACATTTC AACCCATGTT GTTCAAAACA GAGATTCCAA AAAGCTGCAG 8520 TTAATCATAA TTAACCATAA TATGCATCAA TCTATCTATA ATACAAGTAT ATGATAAGTA 8580 ATCAGCAATC AGACAATAGA CAAAAGGGAA ATATAAAAAA CTTAGGAGCA AAGCGTGCTC 8640 GGGAAATGGA CACTGAATCT AACAATGGCA CTGTATCTGA CATACTCTAT CCTGAGTGTC 8700 ACCTTAACTC TCCTATCGTT AAAGGTAAAA TAGCACAATT ACACACTATT ATGAGTCTAC 8760 CTCAGCCTTA TGATATGGAT GACGACTCAA TACTAGTTAT CACTAGACAG AAAATAAAAC 8820 TTAATAAATT GGATAAAAGA CAACGATCTA TTAGAAGATT AAAATTAATA TTAACTGAAA 8880 AAGTGAATGA CTTAGGAAAA TACACATTTA TCAGATATCC AGAAATGTCA AAAGAAATGT 8940 TCAAATTATA TATACCTGGT ATTAACAGTA AAGTGACTGA ATTATTACTT AAAGCAGATA 9000 GAACATATAG TCAAATGACT GATGGATTAA GAGATCTATG GATTAATGTG CTATCAAAAT 9060 TAGCCTCAAA AAATGATGGA AGCAATTATG ATCTTAATGA AGAAATTAAT AATATATCGA 9120 AAGTTCACAC AACCTATAAA TCAGATAAAT GGTATAATCC ATTCAAAACA TGGTTTACTA 9180 TCAAGTATGA TATGAGAAGA TTACAAAAAG CTCGAAATGA GATCACTTTT AATGTTGGGA 9240 AGGATTATAA CTTGTTAGAA GACCAGAAGA ATTTCTTATT GATACATCCA GAATTGGTTT 9300 TGATATTAGA TAAACAAAAC TACAATGGTT ATCTAATTAC TCCTGAATTA GTATTGATGT 9360 ATTGTGACGT AGTCGAAGGC CGATGGAATA TAAGTGCATG TGCTAAGTTA GATCCAAAAT 9420 TACAATCTAT GTATCAGAAA GGTAATAACC TGTGGGAAGT GATAGATAAA TTGTTTCCAA 9480 TTATGGGAGA AAAGACATTT GATGTGATAT CGTTATTAGA ACCACTTGCA TTATCCTTAA 9540 TTCAAACTCA TGATCCTGTT AAACAACTAA GAGGAGCTTT TTTAAATCAT GTGTTATCCG 9600 AGATGGAATT AATATTTGAA TCTAGAGAAT CGATTAAGGA ATTTCTGAGT GTAGATTACA 9660 TTGATAAAAT TTTAGATATA TTTAATAAGT CTACAATAGA TGAAATAGCA GAGATTTTCT 9720 CTTTTTTTAG AACATTTGGG CATCCTCCAT TAGAAGCTAG TATTGCAGCA GAAAAGGTTA 9780 GAAAATATAT GTATATTGGA AAACAATTAA AATTTGACAC TATTAATAAA TGTCATGCTA 9840 TCTTCTGTAC AATAATAATT AACGGATATA GAGAGAGGCA TGGTGGACAG TGGCCTCCTG 9900 TGACATTACC TGATCATGCA CACGAATTCA TCATAAATGC TTACGGTTCA AACTCTGCGA 9960 TATCATATGA GAATGCTGTT GATTATTACC AGAGCTTTAT AGGAATAAAA TTCAATAAAT 10020 TCATAGAGCC TCAGTTAGAT GAGGATTTGA CAATTTATAT GAAAGATAAA GCATTATCTC 10080 CAAAAAAATC AAATTGGGAC ACAGTTTATC CTGCATCTAA TTTACTGTAC CGTACTAACG 10140 CATCCAACGA ATCACGAAGA TTAGTTGAAG TATTTATAGC AGATAGTAAA TTTGATCCTC 10200 ATCAGATATT GGATTATGTA GAATCTGGGG ACTGGTTAGA TGATCCAGAA TTTAATATTT 10260 CTTATAGTCT TAAAGAAAAA GAGATCAAAC AGGAAGGTAG ACTCTTTGCA AAAATGACAT 10320 ACAAAATGAG AGCTACACAA GTTTTATCAG AGACACTACT TGCAAATAAC ATAGGAAAAT 10380 TCTTTCAAGA AAATGGGATG GTGAAGGGAG AGATTGAATT ACTTAAGAGA TTAACAACCA 10440 TATCAATATC AGGAGTTCCA CGGTATAATG AAGTGTACAA TAATTCTAAA AGCCATACAG 10500 ATGACCTTAA AACCTACAAT AAAATAAGTA ATCTTAATTT GTCTTCTAAT CAGAAATCAA 10560 AGAAATTTGA ATTCAAGTCA ACGGATATCT ACAATGATGG ATACGAGACT GTGAGCTGTT 10620 TCCTAACAAC AGATCTCAAA AAATACTGTC TTAATTGGAG ATATGAATCA ACAGCTCTAT 10680 TTGGAGAAAC TTGCAACCAA ATATTTGGAT TAAATAAATT GTTTAATTGG TTACACCCTC 10740 GTCTTGAAGG AAGTACAATC TATGTAGGTG ATCCTTACTG TCCTCCATCA GATAAAGAAC 10800 ATATATCATT AGAGGATCAC CCTGATTCTG GTTTTTACGT TCATAACCCA AGAGGGGGTA 10860 TAGAAGGATT TTGTCAAAAA TTATGGACAC TCATATCTAT AAGTGCAATA CATCTAGCAG 10920 CTGTTAGAAT AGGCGTGAGG GTGACTGCAA TGGTTCAAGG AGACAATCAA GCTATAGCTG 10980 TAACCACAAG AGTACCCAAC AATTATGACT ACAGAGTTAA GAAGGAGATA GTTTATAAAG 11040 ATGTAGTGAG ATTTTTTGAT TCATTAAGAG AAGTGATGGA TGATCTAGGT CATGAACTTA 11100 AATTAAATGA AACGATTATA AGTAGCAAGA TGTTCATATA TAGCAAAAGA ATCTATTATG 11160 ATGGGAGAAT TCTTCCTCAA GCTCTAAAAG CATTATCTAG ATGTGTCTTC TGGTCAGAGA 11220 CAGTAATAGA CGAAACAAGA TCAGCATCTT CAAATTTGGC AACATCATTT GCAAAAGCAA 11280 TTGAGAATGG TTATTCACCT GTTCTAGGAT ATGCATGCTC AATTTTTAAG AACATTCAAC 11340 AACTATATAT TGCCCTTGGG ATGAATATCA ATCCAACTAT AACACAGAAT ATCAGAGATC 11400 AGTATTTTAG GAATCCAAAT TGGATGCAAT ATGCCTCTTT AATACCTGCT AGTGTTGGGG 11460 GATTCAATCA CATGGCCATG TCAAGATGTT TTGTAAGGAA TATTGGTGAT CCATCAGTTG 11520 CCGCATTGGC TGATATTAAA AGATTTATTA AGGCGAATCT ATTAGACCGA AGTGTTCTTT 11580 ATAGGATTAT GAATCAAGAA CCAGGTGAGT CATCTTTTTT TGACTGGGCT TCAGATCCAT 11640 ATTCATGCAA TTTACCACAA TCTCAAAATA TAACCACCAT GATAAAAAAT ATAACAGCAA 11700 GGAATGTATT ACAAGATTCA CCAAATCCAT TATTATCTGG ATTATTCACA AATACAATGA 11760 TAGAAGAAGA TGAAGAATTA GCTGAGTTCC TGATGGACAG GAAGGTAATT CTCCCTAGAG 11820 TTGCACATGA TATTCTAGAT AATTCTCTCA CAGGAATTAG AAATGCCATA GCTGGAATGT 11880 TAGATACGAC AAAATCACTA ATTCGGGTTG GCATAAATAG AGGAGGACTG ACATATAGTT 11940 TGTTGAGGAA AATCAGTAAT TACGATCTAG TACAATATGA AACACTAAGT AGGACTTTGC 12000 GACTAATTGT AAGTGATAAA ATCAAGTATG AAGATATGTG TTCGGTAGAC CTTGCCATAG 12060 CATTGCGACA AAAGATGTGG ATTCATTTAT CAGGAGGAAG GATGATAAGT GGACTTGAAA 12120 CGCCTGACCC ATTAGAATTA CTATCTGGGG TAGTAATAAC AGGATCAGAA CATTGTAAAA 12180 TATGTTATTC TTCAGATGGC ACAAACCCAT ATACTTGGAT GTATTTACCC GGTAATATCA 12240 AAATAGGATC AGCAGAAACA GGTATATCGT CATTAAGAGT TCCTTATTTT GGATCAGTCA 12300 CTGATGAAAG ATCTGAAGCA CAATTAGGAT ATATCAAGAA TCTTAGTAAA CCTGCAAAAG 12360 CCGCAATAAG AATAGCAATG ATATATACAT GGGCATTTGG TAATGATGAG ATATCTTGGA 12420 TGGAAGCCTC ACAGATAGCA CAAACACGTG CAAATTTTAC ACTAGATAGT CTCAAAATTT 12480 TAACACCGGT AGCTACATCA ACAAATTTAT CACACAGATT AAAGGATACT GCAACTCAGA 12540 TGAAATTCTC CAGTACATCA TTGATCAGAG TCAGCAGATT TATAACAATG TCCAATGATA 12600 ACATGTCTAT CAAAGAAGCT AATGAAACCA AAGATACTAA TCTTATTTAT CAACAAATAA 12660 TGTTAACAGG ATTAAGTGTT TTCGAATATT TATTTAGATT AAAAGAAACC ACAGGACACA 12720 ACCCTATAGT TATGCATCTG CACATAGAAG ATGAGTGTTG TATTAAAGAA AGTTTTAATG 12780 ATGAACATAT TAATCCAGAG TCTACATTAG AATTAATTCG ATATCCTGAA AGTAATGAAT 12840 TTATTTATGA TAAAGACCCA CTCAAAGATG TGGACTTATC AAAACTTATG GTTATTAAAG 12900 ACCATTCTTA CACAATTGAT ATGAATTATT GGGATGATAC TGACATCATA CATGCAATTT 12960 CAATATGTAC TGCAATTACA ATAGCAGATA CTATGTCACA ATTAGATCGA GATAATTTAA 13020 AAGAGATAAT AGTTATTGCA AATGATGATG ATATTAATAG CTTAATCACT GAATTTTTGA 13080 CTCTTGACAT ACTTGTATTT CTCAAGACAT TTGGTGGATT ATTAGTAAAT CAATTTGCAT 13140 ACACTCTTTA TAGTCTAAAA ATAGAAGGTA GGGATCTCAT TTGGGATTAT ATAATGAGAA 13200 CACTGAGAGA TACTTCCCAT TCAATATTAA AAGTATTATC TAATGCATTA TCTCATCCTA 13260 AAGTATTCAA GAGGTTCTGG GATTGTGGAG TTTTAAACCC TATTTATGGT CCTAATATTG 13320 CTAGTCAAGA CCAGATAAAA CTTGCCCTAT CTATATGTGA ATATTCACTA GATCTATTTA 13380 TGAGAGAATG GTTGAATGGT GTATCACTTG AAATATACAT TTGTGACAGC GATATGGAAG 13440 TTGCAAATGA TAGGAAACAA GCCTTTATTT CTAGACACCT TTCATTTGTT TGTTGTTTAG 13500 CAGAAATTGC ATCTTTCGGA CCTAACCTGT TAAACTTAAC ATACTTGGAG AGACTTGATC 13560 TATTGAAACA ATATCTTGAA TTAAATATTA AAGAAGACCC TACTCTTAAA TATGTACAAA 13620 TATCTGGATT ATTAATTAAA TCGTTCCCAT CAACTGTAAC ATACGTAAGA AAGACTGCAA 13680 TCAAATATCT AAGGATTCGC GGTATTAGTC CACCTGAGGT AATTGATGAT TGGGATCCGG 13740 TAGAAGATGA AAATATGCTG GATAACATTG TCAAAACTAT AAATGATAAC TGTAATAAAG 13800 ATAATAAAGG GAATAAAATT AACAATTTCT GGGGACTAGC ACTTAAGAAC TATCAAGTCC 13860 TTAAAATCAG ATCTATAACA AGTGATTCTG ATGATAATGA TAGACTAGAT GCTAATACAA 13920 GTGGTTTGAC ACTTCCTCAA GGAGGGAATT ATCTATCGCA TCAATTGAGA TTATTCGGAA 13980 TCAACAGCAC TAGTTGTCTG AAAGCTCTTG AGTTATCACA AATTTTAATG AAGGAAGTCA 14040 ATAAAGACAA GGACAGGCTC TTCCTGGGAG AAGGAGCAGG AGCTATGCTA GCATGTTATG 14100 ATGCCACATT AGGACCTGCA GTTAATTATT ATAATTCAGG TTTGAATATA ACAGATGTAA 14160 TTGGTCAACG AGAATTGAAA ATATTTCCTT CAGAGGTATC ATTAGTAGGT AAAAAATTAG 14220 GAAATGTGAC ACAGATTCTT AACAGGGTAA AAGTACTGTT CAATGGGAAT CCTAATTCAA 14280 CATGGATAGG AAATATGGAA TGTGAGAGCT TAATATGGAG TGAATTAAAT GATAAGTCCA 14340 TTGGATTAGT ACATTGTGAT ATGGAAGGAG CTATCGGTAA ATCAGAAGAA ACTGTTCTAC 14400 ATGAACATTA TAGTGTTATA AGAATTACAT ACTTGATTGG GGATGATGAT GTTGTTTTAG 14460 TTTCCAAAAT TATACCTACA ATCACTCCGA ATTGGTCTAG AATACTTTAT CTATATAAAT 14520 TATATTGGAA AGATGTAAGT ATAATATCAC TCAAAACTTC TAATCCTGCA TCAACAGAAT 14580 TATATCTAAT TTCGAAAGAT GCATATTGTA CTATAATGGA ACCTAGTGAA ATTGTTTTAT 14640 CAAAACTTAA AAGATTGTCA CTCTTGGAAG AAAATAATCT ATTAAAATGG ATCATTTTAT 14700 CAAAGAAGAG GAATAATGAA TGGTTACATC ATGAAATCAA AGAAGGAGAA AGAGATTATG 14760 GAATCATGAG ACCATATCAT ATGGCACTAC AAATCTTTGG ATTTCAAATC AATTTAAATC 14820 ATCTGGCGAA AGAATTTTTA TCAACCCCAG ATCTGACTAA TATCAACAAT ATAATCCAAA 14880 GTTTTCAGCG AACAATAAAG GATGTTTTAT TTGAATGGAT TAATATAACT CATGATGATA 14940 AGAGACATAA ATTAGGCGGA AGATATAACA TATTCCCACT GAAAAATAAG GGAAAGTTAA 15000 GACTGCTATC GAGAAGACTA GTATTAAGTT GGATTTCATT ATCATTATCG ACTCGATTAC 15060 TTACAGGTCG CTTTCCTGAT GAAAAATTTG AACATAGAGC ACAGACTGGA TATGTATCAT 15120 TAGCTGATAC TGATTTAGAA TCATTAAAGT TATTGTCGAA AAACATCATT AAGAATTACA 15180 GAGAGTGTAT AGGATCAATA TCATATTGGT TTCTAACCAA AGAAGTTAAA ATACTTATGA 15240 AATTGATTGG TGGTGCTAAA TTATTAGGAA TTCCCAGACA ATATAAAGAA CCCGAAGACC 15300 AGTTATTAGA AAACTACAAT CAACATGATG AATTTGATAT CGATTAAAAC ATAAATACAA 15360 TGAAGATATA TCCTAACCTT TATCTTTAAG CCTAGGAATA GACAAAAAGT AAGAAAAACA 15420 TGTAATATAT ATATACCAAA CAGAGTTCTT CTCTTGTTTG GT 15462 (2) SEQ ID NO: 20 information about: ...
(i) sequence signature:
(A) length: 2233 amino acid
(B) type: amino acid
(C) chain:
(D) topological framework: linearity
(ii) molecule type: protein
(xi) sequence description: SEQ ID NO:20:
Met?Asp?Thr?Glu?Ser?Asn?Asn?Gly?Thr?Val?Ser?Asp?Ile?Leu?Tyr?Pro
1???????????????5???????????????????10??????????????????15
Glu?Cys?His?Leu?Asn?Ser?Pro?Ile?Val?Lys?Gly?Lys?Ile?Ala?Gln?Leu
20??????????????????25??????????????????30
His?Thr?Ile?Met?Ser?Leu?Pro?Gln?Pro?Tyr?Asp?Met?Asp?Asp?Asp?Ser
35??????????????????40??????????????????45
Ile?Leu?Val?Ile?Thr?Arg?Gln?Lys?Ile?Lys?Leu?Asn?Lys?Leu?Asp?Lys
50??????????????????55??????????????????60
Arg?Gln?Arg?Ser?Ile?Arg?Arg?Leu?Lys?Leu?Ile?Leu?Thr?Glu?Lys?Val
65??????????????????70??????????????????75??????????????????80
Asn?Asp?Leu?Gly?Lys?Tyr?Thr?Phe?Ile?Arg?Tyr?Pro?Glu?Met?Ser?Lys
85??????????????????90??????????????????95
Glu?Met?Phe?Lys?Leu?Tyr?Ile?Pro?Gly?Ile?Asn?Ser?Lys?Val?Thr?Glu
100?????????????????105?????????????????110
Leu?Leu?Leu?Lys?Ala?Asp?Arg?Thr?Tyr?Ser?Gln?Met?Thr?Asp?Gly?Leu
115?????????????????120?????????????????125
Arg?Asp?Leu?Trp?Ile?Asn?Val?Leu?Ser?Lys?Leu?Ala?Ser?Lys?Asn?Asp
130?????????????????135?????????????????140
Gly?Ser?Asn?Tyr?Asp?Leu?Asn?Glu?Glu?Ile?Asn?Asn?Ile?Ser?Lys?Val145?????????????????150?????????????????155?????????????????160His?Thr?Thr?Tyr?Lys?Ser?Asp?Lys?Trp?Tyr?Asn?Pro?Phe?Lys?Thr?Trp
165?????????????????170?????????????????175Phe?Thr?Ile?Lys?Tyr?Asp?Met?Arg?Arg?Leu?Gln?Lys?Ala?Arg?Asn?Glu
180?????????????????185?????????????????190Ile?Thr?Phe?Asn?Val?Gly?Lys?Asp?Tyr?Asn?Leu?Leu?Glu?Asp?Gln?Lys
195?????????????????200?????????????????205Asn?Phe?Leu?Leu?Ile?His?Pro?Glu?Leu?Val?Leu?Ile?Leu?Asp?Lys?Gln
210?????????????????215?????????????????220Asn?Tyr?Asn?Gly?Tyr?Leu?Ile?Thr?Pro?Glu?Leu?Val?Leu?Met?Tyr?Cys225?????????????????230?????????????????235?????????????????240Asp?Val?Val?Glu?Gly?Arg?Trp?Asn?Ile?Ser?Ala?Cys?Ala?Lys?Leu?Asp
245?????????????????250?????????????????255Pro?Lys?Leu?Gln?Ser?Met?Tyr?Gln?Lys?Gly?Asn?Asn?Leu?Trp?Glu?Val
260?????????????????265?????????????????270Ile?Asp?Lys?Leu?Phe?Pro?Ile?Met?Gly?Glu?Lys?Thr?Phe?Asp?Val?Ile
275?????????????????280?????????????????285Ser?Leu?Leu?Glu?Pro?Leu?Ala?Leu?Ser?Leu?Ile?Gln?Thr?His?Asp?Pro
290?????????????????295?????????????????300Val?Lys?Gln?Leu?Arg?Gly?Ala?Phe?Leu?Asn?His?Val?Leu?Ser?Glu?Met305?????????????????310?????????????????315?????????????????320Glu?Leu?Ile?Phe?Glu?Ser?Arg?Glu?Ser?Ile?Lys?Glu?Phe?Leu?Ser?Val
325?????????????????330?????????????????335Asp?Tyr?Ile?Asp?Lys?Ile?Leu?Asp?Ile?Phe?Asn?Lys?Ser?Thr?Ile?Asp
340?????????????????345?????????????????350Glu?Ile?Ala?Glu?Ile?Phe?Ser?Phe?Phe?Arg?Thr?Phe?Gly?His?Pro?Pro
355?????????????????360?????????????????365Leu?Glu?Ala?Ser?Ile?Ala?Ala?Glu?Lys?Val?Arg?Lys?Tyr?Met?Tyr?Ile
370?????????????????375?????????????????380Gly?Lys?Gln?Leu?Lys?Phe?Asp?Thr?Ile?Asn?Lys?Cys?His?Ala?Ile?Phe385?????????????????390?????????????????395?????????????????400Cys?Thr?Ile?Ile?Ile?Asn?Gly?Tyr?Arg?Glu?Arg?His?Gly?Gly?Gln?Trp
405?????????????????410?????????????????415Pro?Pro?Val?Thr?Leu?Pro?Asp?His?Ala?His?Glu?Phe?Ile?Ile?Asn?Ala
420?????????????????425?????????????????430Tyr?Gly?Ser?Asn?Ser?Ala?Ile?Ser?Tyr?Glu?Asn?Ala?Val?Asp?Tyr?Tyr
435?????????????????440?????????????????445Gln?Ser?Phe?Ile?Gly?Ile?Lys?Phe?Asn?Lys?Phe?Ile?Glu?Pro?Gln?Leu
450?????????????????455?????????????????460Asp?Glu?Asp?Leu?Thr?Ile?Tyr?Met?Lys?Asp?Lys?Ala?Leu?Ser?Pro?Lys465?????????????????470?????????????????475?????????????????480Lys?Ser?Asn?Trp?Asp?Thr?Val?Tyr?Pro?Ala?Ser?Asn?Leu?Leu?Tyr?Arg
485?????????????????490?????????????????495Thr?Asn?Ala?Ser?Asn?Glu?Ser?Arg?Arg?Leu?Val?Glu?Val?Phe?Ile?Ala
500?????????????????505?????????????????510Asp?Ser?Lys?Phe?Asp?Pro?His?Gln?Ile?Leu?Asp?Tyr?Val?Glu?Ser?Gly
515?????????????????520?????????????????525Asp?Trp?Leu?Asp?Asp?Pro?Glu?Phe?Asn?Ile?Ser?Tyr?Ser?Leu?Lys?Glu
530?????????????????535?????????????????540Lys?Glu?Ile?Lys?Gln?Glu?Gly?Arg?Leu?Phe?Ala?Lys?Met?Thr?Tyr?Lys545?????????????????550?????????????????555?????????????????560Met?Arg?Ala?Thr?Gln?Val?Leu?Ser?Glu?Thr?Leu?Leu?Ala?Asn?Asn?Ile
565?????????????????570?????????????????575Gly?Lys?Phe?Phe?Gln?Glu?Asn?Gly?Met?Val?Lys?Gly?Glu?Ile?Glu?Leu
580?????????????????585?????????????????590Leu?Lys?Arg?Leu?Thr?Thr?Ile?Ser?Ile?Ser?Gly?Val?Pro?Arg?Tyr?Asn
595?????????????????600?????????????????605Glu?Val?Tyr?Asn?Asn?Ser?Lys?Ser?His?Thr?Asp?Asp?Leu?Lys?Thr?Tyr
610?????????????????615?????????????????620Asn?Lys?Ile?Ser?Asn?Leu?Asn?Leu?Ser?Ser?Asn?Gln?Lys?Ser?Lys?Lys625?????????????????630?????????????????635?????????????????640Phe?Glu?Phe?Lys?Ser?Thr?Asp?Ile?Tyr?Asn?Asp?Gly?Tyr?Glu?Thr?Val
645?????????????????650?????????????????655Ser?Cys?Phe?Leu?Thr?Thr?Asp?Leu?Lys?Lys?Tyr?Cys?Leu?Asn?Trp?Arg
660?????????????????665?????????????????670Tyr?Glu?Ser?Thr?Ala?Leu?Phe?Gly?Glu?Thr?Cys?Asn?Gln?Ile?Phe?Gly
675?????????????????680?????????????????685Leu?Asn?Lys?Leu?Phe?Asn?Trp?Leu?His?Pro?Arg?Leu?Glu?Gly?Ser?Thr
690?????????????????695?????????????????700Ile?Tyr?Val?Gly?Asp?Pro?Tyr?Cys?Pro?Pro?Ser?Asp?Lys?Glu?His?Ile705?????????????????710?????????????????715?????????????????720Ser?Leu?Glu?Asp?His?Pro?Asp?Ser?Gly?Phe?Tyr?Val?His?Asn?Pro?Arg
725?????????????????730?????????????????735Gly?Gly?Ile?Glu?Gly?Phe?Cys?Gln?Lys?Leu?Trp?Thr?Leu?Ile?Ser?Ile
740?????????????????745?????????????????750Ser?Ala?Ile?His?Leu?Ala?Ala?Val?Arg?Ile?Gly?Val?Arg?Val?Thr?Ala
755?????????????????760?????????????????765Met?Val?Gln?Gly?Asp?Asn?Gln?Ala?Ile?Ala?Val?Thr?Thr?Arg?Val?Pro
770?????????????????775?????????????????780Asn?Asn?Tyr?Asp?Tyr?Arg?Val?Lys?Lys?Glu?Ile?Val?Tyr?Lys?Asp?Val785?????????????????790?????????????????795?????????????????800Val?Arg?Phe?Phe?Asp?Ser?Leu?Arg?Glu?Val?Met?Asp?Asp?Leu?Gly?His
805?????????????????810?????????????????815Glu?Leu?Lys?Leu?Asn?Glu?Thr?Ile?Ile?Ser?Ser?Lys?Met?Phe?Ile?Tyr
820?????????????????825?????????????????830Ser?Lys?Arg?Ile?Tyr?Tyr?Asp?Gly?Arg?Ile?Leu?Pro?Gln?Ala?Leu?Lys
835?????????????????840?????????????????845Ala?Leu?Ser?Arg?Cys?Val?Phe?Trp?Ser?Glu?Thr?Val?Ile?Asp?Glu?Thr
850?????????????????855?????????????????860Arg?Ser?Ala?Ser?Ser?Asn?Leu?Ala?Thr?Ser?Phe?Ala?Lys?Ala?Ile?Glu865?????????????????870?????????????????875?????????????????880Asn?Gly?Tyr?Ser?Pro?Val?Leu?Gly?Tyr?Ala?Cys?Ser?Ile?Phe?Lys?Asn
885?????????????????890?????????????????895Ile?Gln?Gln?Leu?Tyr?Ile?Ala?Leu?Gly?Met?Asn?Ile?Asn?Pro?Thr?Ile
900?????????????????905?????????????????910Thr?Gln?Asn?Ile?Arg?Asp?Gln?Tyr?Phe?Arg?Asn?Pro?Asn?Trp?Met?Gln
915?????????????????920?????????????????925Tyr?Ala?Ser?Leu?Ile?Pro?Ala?Ser?Val?Gly?Gly?Phe?Asn?His?Met?Ala
930?????????????????935?????????????????940Met?Ser?Arg?Cys?Phe?Val?Arg?Asn?Ile?Gly?Asp?Pro?Ser?Val?Ala?Ala945?????????????????950?????????????????955?????????????????960Leu?Ala?Asp?Ile?Lys?Arg?Phe?Ile?Lys?Ala?Asn?Leu?Leu?Asp?Arg?Ser
965?????????????????970?????????????????975Val?Leu?Tyr?Arg?Ile?Met?Asn?Gln?Glu?Pro?Gly?Glu?Ser?Ser?Phe?Phe
980?????????????????985?????????????????990Asp?Trp?Ala?Ser?Asp?Pro?Tyr?Ser?Cys?Asn?Leu?Pro?Gln?Ser?Gln?Asn
995?????????????????1000????????????????1005Ile?Thr?Thr?Met?Ile?Lys?Asn?Ile?Thr?Ala?Arg?Asn?Val?Leu?Gln?Asp
1010????????????????1015????????????????1020Ser?Pro?Asn?Pro?Leu?Leu?Ser?Gly?Leu?Phe?Thr?Asn?Thr?Met?Ile?Glu1025????????????????1030????????????????1035????????????????1040Glu?Asp?Glu?Glu?Leu?Ala?Glu?Phe?Leu?Met?Asp?Arg?Lys?Val?Ile?Leu
1045????????????????1050????????????????1055Pro?Arg?Val?Ala?His?Asp?Ile?Leu?Asp?Asn?Ser?Leu?Thr?Gly?Ile?Arg
1060????????????????1065????????????????1070Asn?Ala?Ile?Ala?Gly?Met?Leu?Asp?Thr?Thr?Lys?Ser?Leu?Ile?Arg?Val
1075????????????????1080????????????????1085Gly?Ile?Asn?Arg?Gly?Gly?Leu?Thr?Tyr?Ser?Leu?Leu?Arg?Lys?Ile?Ser
1090????????????????1095????????????????1100Asn?Tyr?Asp?Leu?Val?Gln?Tyr?Glu?Thr?Leu?Ser?Arg?Thr?Leu?Arg?Leu1105????????????????1110????????????????1115????????????????1120Ile?Val?Ser?Asp?Lys?Ile?Lys?Tyr?Glu?Asp?Met?Cys?Ser?Val?Asp?Leu
1125????????????????1130????????????????1135Ala?Ile?Ala?Leu?Arg?Gln?Lys?Met?Trp?Ile?His?Leu?Ser?Gly?Gly?Arg
1140????????????????1145????????????????1150Met?Ile?Ser?Gly?Leu?Glu?Thr?Pro?Asp?Pro?Leu?Glu?Leu?Leu?Ser?Gly
1155????????????????1160????????????????1165Val?Val?Ile?Thr?Gly?Ser?Glu?His?Cys?Lys?Ile?Cys?Tyr?Ser?Ser?Asp
1170????????????????1175????????????????1180Gly?Thr?Asn?Pro?Tyr?Thr?Trp?Met?Tyr?Leu?Pro?Gly?Asn?Ile?Lys?Ile1185????????????????1190????????????????1195????????????????1200Gly?Ser?Ala?Glu?Thr?Gly?Ile?Ser?Ser?Leu?Arg?Val?Pro?Tyr?Phe?Gly
1205????????????????1210????????????????1215Ser?Val?Thr?Asp?Glu?Arg?Ser?Glu?Ala?Gln?Leu?Gly?Tyr?Ile?Lys?Asn
1220????????????????1225????????????????1230Leu?Ser?Lys?Pro?Ala?Lys?Ala?Ala?Ile?Arg?Ile?Ala?Met?Ile?Tyr?Thr
1235????????????????1240????????????????1245Trp?Ala?Phe?Gly?Asn?Asp?Glu?Ile?Ser?Trp?Met?Glu?Ala?Ser?Gln?Ile
1250????????????????1255????????????????1260Ala?Gln?Thr?Arg?Ala?Asn?Phe?Thr?Leu?Asp?Ser?Leu?Lys?Ile?Leu?Thr1265????????????????1270????????????????1275????????????????1280Pro?Val?Ala?Thr?Ser?Thr?Asn?Leu?Ser?His?Arg?Leu?Lys?Asp?Thr?Ala
1285????????????????1290????????????????1295Thr?Gln?Met?Lys?Phe?Ser?Ser?Thr?Ser?Leu?Ile?Arg?Val?Ser?Arg?Phe
1300????????????????1305????????????????1310Ile?Thr?Met?Ser?Asn?Asp?Asn?Met?Ser?Ile?Lys?Glu?Ala?Asn?Glu?Thr
1315????????????????1320????????????????1325Lys?Asp?Thr?Asn?Leu?Ile?Tyr?Gln?Gln?Ile?Met?Leu?Thr?Gly?Leu?Ser
1330????????????????1335????????????????1340Val?Phe?Glu?Tyr?Leu?Phe?Arg?Leu?Lys?Glu?Thr?Thr?Gly?His?Asn?Pro1345????????????????1350????????????????1355????????????????1360Ile?Val?Met?His?Leu?His?Ile?Glu?Asp?Glu?Cys?Cys?Ile?Lys?Glu?Ser
1365????????????????1370????????????????1375Phe?Asn?Asp?Glu?His?Ile?Asn?Pro?Glu?Ser?Thr?Leu?Glu?Leu?Ile?Arg
1380????????????????1385????????????????1390Tyr?Pro?Glu?Ser?Asn?Glu?Phe?Ile?Tyr?Asp?Lys?Asp?Pro?Leu?Lys?Asp
1395????????????????1400????????????????1405Val?Asp?Leu?Ser?Lys?Leu?Met?Val?Ile?Lys?Asp?His?Ser?Tyr?Thr?Ile
1410????????????????1415????????????????1420Asp?Met?Asn?Tyr?Trp?Asp?Asp?Thr?Asp?Ile?Ile?His?Ala?Ile?Ser?Ile1425????????????????1430????????????????1435????????????????1440Cys?Thr?Ala?Ile?Thr?Ile?Ala?Asp?Thr?Met?Ser?Gln?Leu?Asp?Arg?Asp
1445????????????????1450????????????????1455Asn?Leu?Lys?Glu?Ile?Ile?Val?Ile?Ala?Asn?Asp?Asp?Asp?Ile?Asn?Ser
1460????????????????1465????????????????1470Leu?Ile?Thr?Glu?Phe?Leu?Thr?Leu?Asp?Ile?Leu?Val?Phe?Leu?Lys?Thr
1475????????????????1480????????????????1485Phe?Gly?Gly?Leu?Leu?Val?Asn?Gln?Phe?Ala?Tyr?Thr?Leu?Tyr?Ser?Leu
1490????????????????1495????????????????1500Lys?Ile?Glu?Gly?Arg?Asp?Leu?Ile?Trp?Asp?Tyr?Ile?Met?Arg?Thr?Leu1505????????????????1510????????????????1515????????????????1520Arg?Asp?Thr?Ser?His?Ser?Ile?Leu?Lys?Val?Leu?Ser?Asn?Ala?Leu?Ser
1525????????????????1530????????????????1535His?Pro?Lys?Val?Phe?Lys?Arg?Phe?Trp?Asp?Cys?Gly?Val?Leu?Asn?Pro
1540????????????????1545????????????????1550Ile?Tyr?Gly?Pro?Asn?Ile?Ala?Ser?Gln?Asp?Gln?Ile?Lys?Leu?Ala?Leu
1555????????????????1560????????????????1565Ser?Ile?Cys?Glu?Tyr?Ser?Leu?Asp?Leu?Phe?Met?Arg?Glu?Trp?Leu?Asn
1570????????????????1575????????????????1580Gly?Val?Ser?Leu?Glu?Ile?Tyr?Ile?Cys?Asp?Ser?Asp?Met?Glu?Val?Ala1585????????????????1590????????????????1595????????????????1600Asn?Asp?Arg?Lys?Gln?Ala?Phe?Ile?Ser?Arg?His?Leu?Ser?Phe?Val?Cys
1605????????????????1610????????????????1615Cys?Leu?Ala?Glu?Ile?Ala?Ser?Phe?Gly?Pro?Asn?Leu?Leu?Asn?Leu?Thr
1620????????????????1625????????????????1630Tyr?Leu?Glu?Arg?Leu?Asp?Leu?Leu?Lys?Gln?Tyr?Leu?Glu?Leu?Asn?Ile
1635????????????????1640????????????????1645Lys?Glu?Asp?Pro?Thr?Leu?Lys?Tyr?Val?Gln?Ile?Ser?Gly?Leu?Leu?Ile
1650????????????????1655????????????????1660Lys?Ser?Phe?Pro?Ser?Thr?Val?Thr?Tyr?Val?Arg?Lys?Thr?Ala?Ile?Lys1665????????????????1670????????????????1675????????????????1680Tyr?Leu?Arg?Ile?Arg?Gly?Ile?Ser?Pro?Pro?Glu?Val?Ile?Asp?Asp?Trp
1685????????????????1690????????????????1695Asp?Pro?Val?Glu?Asp?Glu?Asn?Met?Leu?Asp?Asn?Ile?Val?Lys?Thr?Ile
1700????????????????1705????????????????1710Asn?Asp?Asn?Cys?Asn?Lys?Asp?Asn?Lys?Gly?Asn?Lys?Ile?Asn?Asn?Phe
1715????????????????1720????????????????1725Trp?Gly?Leu?Ala?Leu?Lys?Asn?Tyr?Gln?Val?Leu?Lys?Ile?Arg?Ser?Ile
1730????????????????1735????????????????1740Thr?Ser?Asp?Ser?Asp?Asp?Asn?Asp?Arg?Leu?Asp?Ala?Asn?Thr?Ser?Gly1745????????????????1750????????????????1755????????????????1760Leu?Thr?Leu?Pro?Gln?Gly?Gly?Asn?Tyr?Leu?Ser?His?Gln?Leu?Arg?Leu
1765????????????????1770????????????????1775Phe?Gly?Ile?Asn?Ser?Thr?Ser?Cys?Leu?Lys?Ala?Leu?Glu?Leu?Ser?Gln
1780????????????????1785????????????????1790Ile?Leu?Met?Lys?Glu?Val?Asn?Lys?Asp?Lys?Asp?Arg?Leu?Phe?Leu?Gly
1795????????????????1800????????????????1805Glu?Gly?Ala?Gly?Ala?Met?Leu?Ala?Cys?Tyr?Asp?Ala?Thr?Leu?Gly?Pro
1810????????????????1815????????????????1820Ala?Val?Asn?Tyr?Tyr?Asn?Ser?Gly?Leu?Asn?Ile?Thr?Asp?Val?Ile?Gly1825????????????????1830????????????????1835????????????????1840Gln?Arg?Glu?Leu?Lys?Ile?Phe?Pro?Ser?Glu?Val?Ser?Leu?Val?Gly?Lys
1845????????????????1850????????????????1855Lys?Leu?Gly?Asn?Val?Thr?Gln?Ile?Leu?Asn?Arg?Val?Lys?Val?Leu?Phe
1860????????????????1865????????????????1870Asn?Gly?Asn?Pro?Asn?Ser?Thr?Trp?Ile?Gly?Asn?Met?Glu?Cys?Glu?Ser
1875????????????????1880????????????????1885Leu?Ile?Trp?Ser?Glu?Leu?Asn?Asp?Lys?Ser?Ile?Gly?Leu?Val?His?Cys
1890????????????????1895????????????????1900Asp?Met?Glu?Gly?Ala?Ile?Gly?Lys?Ser?Glu?Glu?Thr?Val?Leu?His?Glu1905????????????????1910???????????????1915????????????????1920His?Tyr?Ser?Val?Ile?Arg?Ile?Thr?Tyr?Leu?Ile?Gly?Asp?Asp?Asp?Val
1925????????????????1930????????????????1935Val?Leu?Val?Ser?Lys?Ile?Ile?Pro?Thr?Ile?Thr?Pro?Asn?Trp?Ser?Arg
1940????????????????1945????????????????1950Ile?Leu?Tyr?Leu?Tyr?Lys?Leu?Tyr?Trp?Lys?Asp?Val?Ser?Ile?Ile?Ser
1955????????????????1960????????????????1965Leu?Lys?Thr?Ser?Asn?Pro?Ala?Ser?Thr?Glu?Leu?Tyr?Leu?Ile?Ser?Lys
1970????????????????1975????????????????1980Asp?Ala?Tyr?Cys?Thr?Ile?Met?Glu?Pro?Ser?Glu?Ile?Val?Leu?Ser?Lys1985????????????????1990????????????????1995????????????????2000Leu?Lys?Arg?Leu?Ser?Leu?Leu?Glu?Glu?Asn?Asn?Leu?Leu?Lys?Trp?Ile
2005????????????????2010????????????????2015Ile?Leu?Ser?Lys?Lys?Arg?Asn?Asn?Glu?Trp?Leu?His?His?Glu?Ile?Lys
2020????????????????2025????????????????2030Glu?Gly?Glu?Arg?Asp?Tyr?Gly?Ile?Met?Arg?Pro?Tyr?His?Met?Ala?Leu
2035????????????????2040????????????????2045Gln?Ile?Phe?Gly?Phe?Gln?Ile?Asn?Leu?Asn?His?Leu?Ala?Lys?Glu?Phe
2050????????????????2055????????????????2060Leu?Ser?Thr?Pro?Asp?Leu?Thr?Asn?Ile?Asn?Asn?Ile?Ile?Gln?Ser?Phe2065????????????????2070????????????????2075????????????????2080Gln?Arg?Thr?Ile?Lys?Asp?Val?Leu?Phe?Glu?Trp?Ile?Asn?Ile?Thr?His
2085????????????????2090????????????????2095Asp?Asp?Lys?Arg?His?Lys?Leu?Gly?Gly?Arg?Tyr?Asn?Ile?Phe?Pro?Leu
2100????????????????2105????????????????2110Lys?Asn?Lys?Gly?Lys?Leu?Arg?Leu?Leu?Ser?Arg?Arg?Leu?Val?Leu?Ser
2115????????????????2120????????????????2125Trp?Ile?Ser?Leu?Ser?Leu?Ser?Thr?Arg?Leu?Leu?Thr?Gly?Arg?Phe?Pro
2130????????????????2135????????????????2140Asp?Glu?Lys?Phe?Glu?His?Arg?Ala?Gln?Thr?Gly?Tyr?Val?Ser?Leu?Ala2145????????????????2150????????????????2155????????????????2160Asp?Thr?Asp?Leu?Glu?Ser?Leu?Lys?Leu?Leu?Ser?Lys?Asn?Ile?Ile?Lys
2165????????????????2170????????????????2175Asn?Tyr?Arg?Glu?Cys?Ile?Gly?Ser?Ile?Ser?Tyr?Trp?Phe?Leu?Thr?Lys
2180????????????????2185????????????????2190Glu?Val?Lys?Ile?Leu?Met?Lys?Leu?Ile?Gly?Gly?Ala?Lys?Leu?Leu?Gly
2195????????????????2200????????????????2205
Ile?Pro?Arg?Gln?Tyr?Lys?Glu?Pro?Glu?Asp?Gln?Leu?Leu?Glu?Asn?Tyr
2210????????????????2215????????????????2220
Asn?Gln?His?Asp?Glu?Phe?Asp?Ile?Asp
The information of 2,225 2230 (2) SEQ ID NO:21:
(i) sequence signature:
(A) length: 15462 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: ACCAAACAAG AGAAGAAACT TGCTTGGTAA TATAAATTTA ACTTAAAATT AACTTAGGAT 60 TTAAGACATT GACTAGAAGG TCAAGAAAAG GGAACTCTAT AATTTCAAAA ATGTTGAGCC 120 TATTTGATAC ATTTAATGCA CGTAGGCAAG AAAACATAAC AAAATCAGCC GGTGGAGCTA 180 TCATTCCTGG ACAGAAAAAT ACTGTCTCTA TATTCGCCCT TGGACCGACA ATAACTGATG 240 ATAATGAGAA AATGACATTA GCTCTTCTAT TTCTATCTCA TTCACTAGAT AATGAGAAAC 300 AACATGCACA AAGGGCAGGG TTCTTGGTGT CTTTATTGTC AATGGCTTAT GCCAATCCAG 360 AGCTCTACCT AACAACAAAT GGAAGTAATG CAGATGCCAA GTATGTCATA TACATGATTG 420 AGAAAGATCT AAAACGGCAA AAGTATGGAG GATTTGTGGT TAAGACGAGA GAGATGATAT 480 ATGAAAAGAC AACTGATTGG ATATTTGGAA GTGACCTGGA TTATGATCAG GAAACTATGT 540 TGCAGAACGG CAGGAACAAT TCAACAATTG AAGACCTTGT CCACACATTT GGGTATCCAT 600 CATGTTTAGG AGCTCTTATA ATACAGATCT GGATAGTTCT GGTCAAAGCT ATCACTAGTA 660 TCTCAGGGTT AAGAAAAGGC TTTTTCACCC GATTGGAAGC TTTCAGACAA GATGGAACAG 720 TGCAGGCAGG GCTGGTATTG AGCGGTGACA CAGTGGATCA GATTGGGTCA ATCATGCGGT 780 CTCAACAGAG CTTGGTAACT CTTATGGTTG AAACATTAAT AACAATGAAT ACCAGCAGAA 840 ATGACCTCAC AACCATAGAA AAGAATATAC AAATTGTTGG CAACTACATA AGAGATGCAG 900 GTCTCGCTTC ATTCTTCAAT ACAATCAGAT ATGGAATTGA GACCAGAATG GCAGCTTTGA 960 CTCTATCCAC TCTCAGACCA GATATCAATA GATTAAAAGC TTTGATGGAA CTGTATTTAT 1020 CAAAGGGACC ACGCGCTCCT TTCATCTGTA TCCTCAGAGA TCCTATACAT GGTGAGTTCG 1080 CACCAGGCAA CTATCCTGCC ATATGGAGCT ATGCAATGGG GGTGGCAGTT GTACAAAATA 1140 GAGCCATGCA ACAGTATGTG ACGGGAAGAT CATATCTAGA CATTGATATG TTCCAGCTAG 1200 GACAAGCAGT AGCACGTGAT GCCGAAGCTC AAATGAGCTC AACACTGGAA GATGAACTTG 1260 GAGTGACACA CGAAGCTAAA GAAAGCTTGA AGAGACATAT AAGGAACATA AACAGTTCAG 1320 AGACATCTTT CCACAAACCG ACAGGTGGAT CAGCCATAGA GATGGCAATA GATGAAGAGC 1380 CAGAACAATT CGAACATAGA GCAGATCAAG AACAAAATGG AGAACCTCAA TCATCCATAA 1440 TTCAATATGC CTGGGCAGAA GGAAATAGAA GCGATGATCA GACTGAGCAA GCTACAGAAT 1500 CTGACAATAT CAAGACCGAA CAACAAAACA TCAGAGACAG ACTAAACAAG AGACTCAACG 1560 ACAAGAAGAA ACAAAGCAGT CAACCACCCA CTAATCCCAC AAACAGAACA AACCAGGACG 1620 AAATAGATGA TCTGTTTAAC GCATTTGGAA GCAACTAATC GAATCAACAT TTTAATCTAA 1680 ATCAATAATA AATAAGAAAA ACTTAGGATT AAAGAATCCT ATCATACCGG AATATAGGGT 1740 GGTAAATTTA GAGTCTGCTT GAAACTCAAT CAATAGAGAG TTGATGGAAA GCGATGCTAA 1800 AAACTATCAA ATCATGGATT CTTGGGAAGA GGAATCAAGA GATAAATCAA CTAATATCTC 1860 CTCGGCCCTC AACATCATTG AATTCATACT CAGCACCGAC CCCCAAGAAG ACTTATCGGA 1920 AAACGACACA ATCAACACAA GAACCCAGCA ACTCAGTGCC ACCATCTGTC AACCAGAAAT 1980 CAAACCAACA GAAACAAGTG AGAAAGATAG TGGATCAACT GACAAAAATA GACAGTCTGG 2040 GTCATCACAC GAATGTACAA CAGAAGCAAA AGATAGAAAC ATTGATCAGG AAACTGTACA 2100 GAGAGGACCT GGGAGAAGAA GCAGCTCAGA TAGTAGAGCT GAGACTGTGG TCTCTGGAGG 2160 AATCCCCAGA AGCATCACAG ATTCTAAAAA TGGAACCCAA AACACGGAGG ATATTGATCT 2220 CAATGAAATT AGAAAGATGG ATAAGGACTC TATTGAGGGG AAAATGCGAC AATCTGCAAA 2280 TGTTCCAAGC GAGATATCAG GAAGTGATGA CATATTTACA ACAGAACAAA GTAGAAACAG 2340 TGATCATGGA AGAAGCCTGG AATCTATCAG TACACCTGAT ACAAGATCAA TAAGTGTTGT 2400 TACTGCTGCA ACACCAGATG ATGAAGAAGA AATACTAATG AAAAATAGTA GGACAAAGAA 2460 AAGTTCTTCA ACACATCAAG AAGATGACAA AAGAATTAAA AAAGGGGGAA AAGGGAAAGA 2520 CTGGTTTAAG AAATCAAAAG ATACCGACAA CCAGATACCA ACATCAGACT ACAGATCCAC 2580 ATCAAAAGGG CAGAAGAAAA TCTCAAAGAC AACAACCACC AACACCGACA CAAAGGGGCA 2640 AACAGAAATA CAGACAGAAT CATCAGAAAC ACAATCCTCA TCATGGAATC TCATCATCGA 2700 CAACAACACC GACCGGAACG AACAGACAAG CACAACTCCT CCAACAACAA CTTCCAGATC 2760 AACTTATACA AAAGAATCGA TCCGAACAAA CTCTGAATCC AAACCCAAGA CACAAAAGAC 2820 AAATGGAAAG GAAAGGAAGG ATACAGAAGA GAGCAATCGA TTTACAGAGA GGGCAATTAC 2880 TCTATTGCAG AATCTTGGTG TAATTCAATC CACATCAAAA CTAGATTTAT ATCAAGACAA 2940 ACGAGTTGTA TGTGTAGCAA ATGTACTAAA CAATGTAGAT ACTGCATCAA AGATAGATTT 3000 CCTGGCAGGA TTAGTCATAG GGGTTTCAAT GGACAACGAC ACAAAATTAA CACAGATACA 3060 AAATGAAATG CTAAACCTCA AAGCAGATCT AAAGAAAATG GACGAATCAC ATAGAAGATT 3120 GATAGAAAAT CAAAGAGAAC AACTGTCATT GATCACGTCA CTAATTTCAA ATCTCAAAAT 3180 TATGACTGAG AGAGGAGGAA AGAAAGACCA AAATGAATCC AATGAGAGAG TATCCATGAT 3240 CAAAACAAAA TTGAAAGAAG AAAAGATCAA GAAGACCAGG TTTGACCCAC TTATGGAGGC 3300 ACAAGGCATT GACAAGAATA TACCCGATCT ATATCGACAT GCAGGAGATA CACTAGAGAA 3360 CGATGTACAA GTTAAATCAG AGATATTAAG TTCATACAAT GAGTCAAATG CAACAAGACT 3420 AATACCCAAA AAAGTGAGCA GTACAATGAG ATCACTAGTT GCAGTCATCA ACAACAGCAA 3480 TCTCTCACAA AGCACAAAAC AATCATACAT AAACGAACTC AAACGTTGCA AAAATGATGA 3540 AGAAGTATCT GAATTAATGG ACATGTTCAA TGAAGATGTC AACAATTGCC AATGATCCAA 3600 CAAAGAAACG ACACCGAACA AACAGACAAG AAACAACAGT AGATCAAAAC CTGTCAACAC 3660 ACACAAAATC AAGCAGAATG AAACAACAGA TATCAATCAA TATACAAATA AGAAAAACTT 3720 AGGATTAAAG AATAAATTAA TCCTTGTCCA AAATGAGTAT AACTAACTCT GCAATATACA 3780 CATTCCCAGA ATCATCATTC TCTGAAAATG GTCATATAGA ACCATTACCA CTCAAAGTCA 3840 ATGAACAGAG GAAAGCAGTA CCCCACATTA GAGTTGCCAA GATCGGAAAT CCACCAAAAC 3900 ACGGATCCCG GTATTTAGAT GTCTTCTTAC TCGGCTTCTT CGAGATGGAA CGAATCAAAG 3960 ACAAATACGG GAGTGTGAAT GATCTCGACA GTGACCCGAG TTACAAAGTT TGTGGCTCTG 4020 GATCATTACC AATCGGATTG GCTAAGTACA CTGGGAATGA CCAGGAATTG TTACAAGCCG 4080 CAACCAAACT GGATATAGAA GTGAGAAGAA CAGTCAAAGC GAAAGAGATG GTTGTTTACA 4140 CGGTACAAAA TATAAAACCA GAACTGTACC CATGGTCCAA TAGACTAAGA AAAGGAATGC 4200 TGTTCGATGC CAACAAAGTT GCTCTTGCTC CTCAATGTCT TCCACTAGAT AGGAGCATAA 4260 AATTTAGAGT AATCTTCGTG AATTGTACGG CAATTGGATC AATAACCTTG TTCAAAATTC 4320 CTAAGTCAAT GGCATCACTA TCTCTAACCA ACACAATATC AATCAATCTG CAGGTACACA 4380 TAAAAACAGG GGTTCAGACT GATTCTAAAG GGATAGTTCA AATTTTGGAT GAGAAAGGCG 4440 AAAAATCACT GAATTTCATG GTCCATCTCG GATTGATCAA AAGAAAAGTA GGCAGAATGT 4500 ACTCTGTTGA ATACTGTAAA CAGAAAATCG AGAAAATGAG ATTGATATTT TCTTTAGGAC 4560 TAGTTGGAGG AATCAGTCTT CATGTCAATG CAACTGGGTC CATATCAAAA ACACTAGCAA 4620 GTCAGCTGGT ATTCAAAAGA GAGATTTGTT ATCCTTTAAT GGATCTAAAT CCGCATCTCA 4680 ATCTAGTTAT CTGGGCTTCA TCAGTAGAGA TTACAAGAGT GGATGCAATT TTCCAACCTT 4740 CTTTACCTGG CGAGTTCAGA TACTATCCTA ATATTATTGC AAAAGGAGTT GGGAAAATCA 4800 AACAATGGAA CTAGTAATCT CTATTTTAGT CCGGACGTAT CTATTAAGCC GAAGCAAATA 4860 AAGGATAATC AAAAACTTAG GACAAAAGAG GTCAATACCA ACAACTATTA GCAGTCACAC 4920 TCGCAAGAAT AAGAGAGAAG GGACCAAAAA AGTCAAATAG GAGAAATCAA AACAAAAGGT 4980 ACAGAACACC AGAACAACAA AATCAAAACA TCCAACTCAC TCAAAACAAA AATTCCAAAA 5040 GAGACCGGCA ACACAACAAG CACTGAACAC AATGCCAACT TCAATACTGC TAATTATTAC 5100 AACCATGATC ATGGCATCTT TCTGCCAAAT AGATATCACA AAACTACAGC ACGTAGGTGT 5160 ATTGGTCAAC AGTCCCAAAG GGATGAAGAT ATCACAAAAC TTTGAAACAA GATATCTAAT 5220 TTTGAGCCTC ATACCAAAAA TAGAAGACTC TAACTCTTGT GGTGACCAAC AGATCAAGCA 5280 ATACAAGAAG TTATTGGATA GACTGATCAT CCCTTTATAT GATGGATTAA GATTACAGAA 5340 AGATGTGATA GTAACCAATC AAGAATCCAA TGAAAACACT GATCCCAGAA CAAAACGATT 5400 CTTTGGAGGG GTAATTGGAA CCATTGCTCT GGGAGTAGCA ACCTCAGCAC AAATTACAGC 5460 GGCAGTTGCT CTGGTTGAAG CCAAGCAGGC AAGATCAGAC ATCGAAAAAC TCAAAGAAGC 5520 AATTAGGGAC ACAAATAAAG CAGTGCAGTC AGTTCAGAGC TCCATAGGAA ATTTAATAGT 5580 AGCAATTAAA TCAGTCCAGG ATTATGTTAA CAAAGAAATC GTGCCATCGA TTGCGAGGCT 5640 AGGTTGTGAA GCAGCAGGAC TTCAATTAGG AATTGCATTA ACACAGCATT ACTCAGAATT 5700 AACAAACATA TTTGGTGATA ACATAGGATC GTTACAAGAA AAAGGAATAA AATTACAAGG 5760 TATAGCATCA TTATACCGCA CAAATATCAC AGAAATATTC ACAACATCAA CAGTTGATAA 5820 ATATGATATC TATGATCTGT TATTTACAGA ATCAATAAAG GTGAGAGTTA TAGATGTTGA 5880 CTTGAATGAT TACTCAATCA CCCTCCAAGT CAGACTCCCT TTATTAACTA GGCTGCTGAA 5940 CACTCAGATC TACAAAGTAG ATTCCATATC ATATAACATC CAAAACAGAG AATGGTATAT 6000 CCCTCTTCCC AGCCATATCA TGACGAAAGG GGCATTTCTA GGTGGAGCAG ACGTCAAAGA 6060 ATGTATAGAA GCATTCAGCA GCTATATATG CCCTTCTGAT CCAGGATTTG TATTAAACCA 6120 TGAAATAGAG AGCTGCTTAT CAGGAAACAT ATCCCAATGT CCAAGAACAA CGGTCACATC 6180 AGACATTGTT CCAAGATATG CATTTGTCAA TGGAGGAGTG GTTGCAAACT GTATAACAAC 6240 CACCTGTACA TGCAACGGAA TTGGTAATAG AATCAATCAA CCACCTGATC AAGGAGTAAA 6300 AATTATAACA CATAAAGAAT GTAGTACAGT AGGTATCAAC GGAATGCTGT TCAATACAAA 6360 TAAAGAAGGA ACTCTTGCAT TCTATACACC AAATGATATA ACACTAAACA ATTCTGTTAC 6420 ACTTGATCCA ATTGACATAT CAATCGAGCT CAACAAGGCC AAATCAGATC TAGAAGAATC 6480 AAAAGAATGG ATAAGAAGGT CAAATCAAAA ACTAGATTCT ATTGGAAATT GGCATCAATC 6540 TAGCACTACA ATCATAATTA TTTTGATAAT GATCATTATA TTGTTTATAA TTAATATAAC 6600 GATAATTACA ATTGCAATTA AGTATTACAG AATTCAAAAG AGAAATCGAG TGGATCAAAA 6660 TGACAAGCCA TATGTACTAA CAAACAAATA ACATATCTAC AGATCATTAG ATATTAAAAT 6720 TATAAAAAAC TTAGGAGTAA AGTTACGCAA TCCAACTCTA CTCATATAAT TGAGGAAGGA 6780 CCCAATAGAC AAATCCAAAT TCGAGATGGA ATACTGGAAG CATACCAATC ACGGAAAGGA 6840 TGCTGGCAAT GAGCTGGAGA CGTCTATGGC TACTCATGGC AACAAGCTCA CTAATAAGAT 6900 AATATACATA TTATGGACAA TAATCCTGGT GTTATTATCA ATAGTCTTCA TCATAGTGCT 6960 AATTAATTCC ATCAAAAGTG AAAAGGCCCA CGAATCATTG CTGCAAGACA TAAATAATGA 7020 GTTTATGGAA ATTACAGAAA AGATCCAAAT GGCATCGGAT AATACCAATG ATCTAATACA 7080 GTCAGGAGTG AATACAAGGC TTCTTACAAT TCAGAGTCAT GTCCAGAATT ACATACCAAT 7140 ATCATTGACA CAACAGATGT CAGATCTTAG GAAATTCATT AGTGAAATTA CAATTAGAAA 7200 TGATAATCAA GAAGTGCTGC CACAAAGAAT AACACATGAT GTAGGTATAA AACCTTTAAA 7260 TCCAGATGAT TTTTGGAGAT GCACGTCTGG TCTTCCATCT TTAATGAAAA CTCCAAAAAT 7320 AAGGTTAATG CCAGGGCCGG GATTATTAGC TATGCCAACG ACTGTTGATG GCTGTGTTAG 7380 AACTCCGTCT TTAGTTATAA ATGATCTGAT TTATGCTTAT ACCTCAAATC TAATTACTCG 7440 AGGTTGTCAG GATATAGGAA AATCATATCA AGTCTTACAG ATAGGGATAA TAACTGTAAA 7500 CTCAGACTTG GTACCTGACT TAAATCCTAG GATCTCTCAT ACCTTTAACA TAAATGACAA 7560 TAGGAAGTCA TGTTCTCTAG CACTCCTAAA TACAGATGTA TATCAACTGT GTTCAACTCC 7620 CAAAGTTGAT GAAAGATCAG ATTATGCATC ATCAGGCATA GAAGATATTG TACTTGATAT 7680 TGTCAATTAT GATGGTTCAA TCTCAACAAC AAGATTTAAG AATAATAACA TAAGCTTTGA 7740 TCAACCATAT GCTGCACTAT ACCCATCTGT TGGACCAGGG ATATACTACA AAGGCAAAAT 7800 AATATTTCTC GGGTATGGAG GTCTTGAACA TCCAATAAAT GAGAATGTAA TCTGCAACAC 7860 AACTGGGTGC CCCGGGAAAA CACAGAGAGA CTGTAATCAA GCGTCTCATA GTCCATGGTT 7920 TTCAGATAGG AGGATGGTCA ACTCCATCAT TGTTGCTGAC AAAGGCTTAA ACTCAATTCC 7980 AAAATTGAAA GTATGGACGA TATCTATGCG ACAAAATTAC TGGGGGTCAG AAGGAAGGTT 8040 ACTTCTACTA GGTAACAAGA TCTATATATA TACAAGATCT ACAAGTTGGC ATAGCAAGTT 8100 ACAATTAGGA ATAATTGATA TTACTGATTA CAGTGATATA AGGATAAAAT GGACATGGCA 8160 TAATGTGCTA TCAAGACCAG GAAACAATGA ATGTCCATGG GGACATTCAT GTCCAGATGG 8220 ATGTATAACA GGAGTATATA CTGATGCATA TCCACTCAAT CCCACAGGGA GCATTGTGTC 8280 ATCTGTCATA TTAGACTCAC AAAAATCGAG AGTGAACCCA GTCATAACTT ACTCAACAGC 8340 AACCGAAAGA GTAAACGAGC TGGCCATCCT AAACAGAACA CTCTCAGCTG GATATACAAC 8400 AACAAGCTGC ATTACACACT ATAACAAAGG ATATTGTTTT CATATAGTAG AAATAAATCA 8460 TAAAAGCTTA AACACATTTC AACCCATGTT GTTCAAAACA GAGATTCCAA AAAGCTGCAG 8520 TTAATCATAA TTAACCATAA TATGCATCAA TCTATCTATA ATACAAGTAT ATGATAAGTA 8580 ATCAGCAATC AGACAATAGA CAAAAGGGAA ATATAAAAAA CTTAGGAGCA AAGCGTGCTC 8640 GGGAAATGGA CACTGAATCT AACAATGGCA CTGTATCTGA CATACTCTAT CCTGAGTGTC 8700 ACCTTAACTC TCCTATCGTT AAAGGTAAAA TAGCACAATT ACACACTATT ATGAGTCTAC 8760 CTCAGCCTTA TGATATGGAT GACGACTCAA TACTAGTTAT CACTAGACAG AAAATAAAAC 8820 TTAATAAATT GGATAAAAGA CAACGATCTA TTAGAAGATT AAAATTAATA TTAACTGAAA 8880 AAGTGAATGA CTTAGGAAAA TACACATTTA TCAGATATCC AGAAATGTCA AAAGAAATGT 8940 TCAAATTATA TATACCTGGT ATTAACAGTA AAGTGACTGA ATTATTACTT AAAGCAGATA 9000 GAACATATAG TCAAATGACT GATGGATTAA GAGATCTATG GATTAATGTG CTATCAAAAT 9060 TAGCCTCAAA AAATGATGGA AGCAATTATG ATCTTAATGA AGAAATTAAT AATATATCGA 9120 AAGTTCACAC AACCTATAAA TCAGATAAAT GGTATAATCC ATTCAAAACA TGGTTTACTA 9180 TCAAGTATGA TATGAGAAGA TTACAAAAAG CTCGAAATGA GATCACTTTT AATGTTGGGA 9240 AGGATTATAA CTTGTTAGAA GACCAGAAGA ATTTCTTATT GATACATCCA GAATTGGTTT 9300 TGATATTAGA TAAACAAAAC TACAATGGTT ATCTAATTAC TCCTGAATTA GTATTGATGT 9360 ATTGTGACGT AGTCGAAGGC CGATGGAATA TAAGTGCATG TGCTAAGTTA GATCCAAAAT 9420 TACAATCTAT GTATCAGAAA GGTAATAACC TGTGGGAAGT GATAGATAAA TTGTTTCCAA 9480 TTATGGGAGA AAAGACATTT GATGTGATAT CGTTATTAGA ACCACTTGCA TTATCCTTAA 9540 TTCAAACTCA TGATCCTGTT AAACAACTAA GAGGAGCTTT TTTAAATCAT GTGTTATCCG 9600 AGATGGAATT AATATTTGAA TCTAGAGAAT CGATTAAGGA ATTTCTGAGT GTAGATTACA 9660 TTGATAAAAT TTTAGATATA TTTAATAAGT CTACAATAGA TGAAATAGCA GAGATTTTCT 9720 CTTTTTTTAG AACATTTGGG CATCCTCCAT TAGAAGCTAG TATTGCAGCA GAAAAGGTTA 9780 GAAAATATAT GTATATTGGA AAACAATTAA AATTTGACAC TATTAATAAA TGTCATGCTA 9840 TCTTCTGTAC AATAATAATT AACGGATATA GAGAGAGGCA TGGTGGACAG TGGCCTCCTG 9900 TGACATTACC TGATCATGCA CACGAATTCA TCATAAATGC TTACGGTTCA AACTCTGCGA 9960 TATCATATGA GAATGCTGTT GATTATTACC AGAGCTTTAT AGGAATAAAA TTCAATAAAT 10020 TCATAGAGCC TCAGTTAGAT GAGGATTTGA CAATTTATAT GAAAGATAAA GCATTATCTC 10080 CAAAAAAATC AAATTGGGAC ACAGTTTATC CTGCATCTAA TTTACTGTAC CGTACTAACG 10140 CATCCAACGA ATCACGAAGA TTAGTTGAAG TATTTATAGC AGATAGTAAA TTTGATCCTC 10200 ATCAGATATT GGATTATGTA GAATCTGGGG ACTGGTTAGA TGATCCAGAA TTTAATATTT 10260 CTTATAGTCT TAAAGAAAAA GAGATCAAAC AGGAAGGTAG ACTCTTTGCA AAAATGACAT 10320 ACAAAATGAG AGCTACACAA GTTTTATCAG AGACACTACT TGCAAATAAC ATAGGAAAAT 10380 TCTTTCAAGA AAATGGGATG GTGAAGGGAG AGATTGAATT ACTTAAGAGA TTAACAACCA 10440 TATCAATATC AGGAGTTCCA CGGTATAATG AAGTGTACAA TAATTCTAAA AGCCATACAG 10500 ATGACCTTAA AACCTACAAT AAAATAAGTA ATCTTAATTT GTCTTCTAAT CAGAAATCAA 10560 AGAAATTTGA ATTCAAGTCA ACGGATATCT ACAATGATGG ATACGAGACT GTGAGCTGTT 10620 TCCTAACAAC AGATCTCAAA AAATACTGTC TTAATTGGAG ATATGAATCA ACAGCTCTAT 10680 TTGGAGAAAC TTGCAACCAA ATATTTGGAT TAAATAAATT GTTTAATTGG TTACACCCTC 10740 GTCTTGAAGG AAGTACAATC TATGTAGGTG ATCCTTACTG TCCTCCATCA GATAAAGAAC 10800 ATATATCATT AGAGGATCAC CCTGATTCTG GTTTTTACGT TCATAACCCA AGAGGGGGTA 10860 TAGAAGGATT TTGTCAAAAA TTATGGACAC TCATATCTAT AAGTGCAATA CATCTAGCAG 10920 CTGTTAGAAT AGGCGTGAGG GTGACTGCAA TGGTTCAAGG AGACAATCAA GCTATAGCTG 10980 TAACCACAAG AGTACCCAAC AATTATGACT ACAGAGTTAA GAAGGAGATA GTTTATAAAG 11040 ATGTAGTGAG ATTTTTTGAT TCATTAAGAG AAGTGATGGA TGATCTAGGT CATGAACTTA 11100 AATTAAATGA AACGATTATA AGTAGCAAGA TGTTCATATA TAGCAAAAGA ATCTATTATG 11160 ATGGGAGAAT TCTTCCTCAA GCTCTAAAAG CATTATCTAG ATGTGTCTTC TGGTCAGAGA 11220 CAGTAATAGA CGAAACAAGA TCAGCATCTT CAAATTTGGC AACATCATTT GCAAAAGCAA 11280 TTGAGAATGG TTATTCACCT GTTCTAGGAT ATGCATGCTC AATTTTTAAG AACATTCAAC 11340 AACTATATAT TGCCCTTGGG ATGAATATCA ATCCAACTAT AACACAGAAT ATCAGAGATC 11400 AGTATTTTAG GAATCCAAAT TGGATGCAAT ATGCCTCTTT AATACCTGCT AGTGTTGGGG 11460 GATTCAATCA CATGGCCATG TCAAGATGTT TTGTAAGGAA TATTGGTGAT CCATCAGTTG 11520 CCGCATTGGC TGATATTAAA AGATTTATTA AGGCGAATCT ATTAGACCGA AGTGTTCTTT 11580 ATAGGATTAT GAATCAAGAA CCAGGTGAGT CATCTTTTTT TGACTGGGCT TCAGATCCAT 11640 ATTCATGCAA TTTACCACAA TCTCAAAATA TAACCACCAT GATAAAAAAT ATAACAGCAA 11700 GGAATGTATT ACAAGATTCA CCAAATCCAT TATTATCTGG ATTATTCACA AATACAATGA 11760 TAGAAGAAGA TGAAGAATTA GCTGAGTTCC TGATGGACAG GAAGGTAATT CTCCCTAGAG 11820 TTGCACATGA TATTCTAGAT AATTCTCTCA CAGGAATTAG AAATGCCATA GCTGGAATGT 11880 TAGATACGAC AAAATCACTA ATTCGGGTTG GCATAAATAG AGGAGGACTG ACATATAGTT 11940 TGTTGAGGAA AATCAGTAAT TACGATCTAG TACAATATGA AACACTAAGT AGGACTTTGC 12000 GACTAATTGT AAGTGATAAA ATCAAGTATG AAGATATGTG TTCGGTAGAC CTTGCCATAG 12060 CATTGCGACA AAAGATGTGG ATTCATTTAT CAGGAGGAAG GATGATAAGT GGACTTGAAA 12120 CGCCTGACCC ATTAGAATTA CTATCTGGGG TAGTAATAAC AGGATCAGAA CATTGTAAAA 12180 TATGTTATTC TTCAGATGGC ACAAACCCAT ATACTTGGAT GTATTTACCC GGTAATATCA 12240 AAATAGGATC AGCAGAAACA GGTATATCGT CATTAAGAGT TCCTTATTTT GGATCAGTCA 12300 CTGATGAAAG ATCTGAAGCA CAATTAGGAT ATATCAAGAA TCTTAGTAAA CCTGCAAAAG 12360 CCGCAATAAG AATAGCAATG ATATATACAT GGGCATTTGG TAATGATGAG ATATCTTGGA 12420 TGGAAGCCTC ACAGATAGCA CAAACACGTG CAAATTTTAC ACTAGATAGT CTCAAAATTT 12480 TAACACCGGT AGCTACATCA ACAAATTTAT CACACAGATT TAAGGATACT GCAACTCAGA 12540 TGAAATTCTC CAGTACATCA TTGATCAGAG TCAGCAGATT TATAACAATG TCCAATGATA 12600 ACATGTCTAT CAAAGAAGCT AATGAAACCA AAGATACTAA TCTTATTTAT CAACAAATAA 12660 TGTTAACAGG ATTAAGTGTT TTCGAATATT TATTTAGATT AAAAGAAACC ACAGGACACA 12720 ACCCTATAGT TATGCATCTG CACATAGAAG ATGAGTGTTG TATTAAAGAA AGTTTTAATG 12780 ATGAACATAT TAATCCAGAG TCTACATTAG AATTAATTCG ATATCCTGAA AGTAATGAAT 12840 TTATTTATGA TAAAGACCCA CTCAAAGATG TGGACTTATC AAAACTTATG GTTATTAAAG 12900 ACCATTCTTA CACAATTGAT ATGAATTATT GGGATGATAC TGACATCATA CATGCAATTT 12960 CAATATGTAC TGCAATTACA ATAGCAGATA CTATGTCACA ATTAGATCGA GATAATTTAA 13020 AAGAGATAAT AGTTATTGCA AATGATGATG ATATTAATAG CTTAATCACT GAATTTTTGA 13080 CTCTTGACAT ACTTGTATTT CTCAAGACAT TTGGTGGATT ATTAGTAAAT CAATTTGCAT 13140 ACACTCTTTA TAGTCTAAAA ATAGAAGGTA GGGATCTCAT TTGGGATTAT ATAATGAGAA 13200 CACTGAGAGA TACTTCCCAT TCAATATTAA AAGTATTATC TAATGCATTA TCTCATCCTA 13260 AAGTATTCAA GAGGTTCTGG GATTGTGGAG TTTTAAACCC TATTTATGGT CCTAATATTG 13320 CTAGTCAAGA CCAGATAAAA CTTGCCCTAT CTATATGTGA ATATTCACTA GATCTATTTA 13380 TGAGAGAATG GTTGAATGGT GTATCACTTG AAATATACAT TTGTGACAGC GATATGGAAG 13440 TTGCAAATGA TAGGAAACAA GCCTTTATTT CTAGACACCT TTCATTTGTT TGTTGTTTAG 13500 CAGAAATTGC ATCTTTCGGA CCTAACCTGT TAAACTTAAC ATACTTGGAG AGACTTGATC 13560 TATTGAAACA ATATCTTGAA TTAAATATTA AAGAAGACCC TACTCTTAAA TATGTACAAA 13620 TATCTGGATT ATTAATTAAA TCGTTCCCAT CAACTGTAAC ATACGTAAGA AAGACTGCAA 13680 TCAAATATCT AAGGATTCGC GGTATTAGTC CACCTGAGGT AATTGATGAT TGGGATCCGG 13740 TAGAAGATGA AAATATGCTG GATAACATTG TCAAAACTAT AAATGATAAC TGTAATAAAG 13800 ATAATAAAGG GAATAAAATT AACAATTTCT GGGGACTAGC ACTTAAGAAC TATCAAGTCC 13860 TTAAAATCAG ATCTATAACA AGTGATTCTG ATGATAATGA TAGACTAGAT GCTAATACAA 13920 GTGGTTTGAC ACTTCCTCAA GGAGGGAATT ATCTATCGCA TCAATTGAGA TTATTCGGAA 13980 TCAACAGCAC TAGTTGTCTG AAAGCTCTTG AGTTATCACA AATTTTAATG AAGGAAGTCA 14040 ATAAAGACAA GGACAGGCTC TTCCTGGGAG AAGGAGCAGG AGCTATGCTA GCATGTTATG 14100 ATGCCACATT AGGACCTGCA GTTAATTATT ATAATTCAGG TTTGAATATA ACAGATGTAA 14160 TTGGTCAACG AGAATTGAAA ATATTTCCTT CAGAGGTATC ATTAGTAGGT AAAAAATTAG 14220 GAAATGTGAC ACAGATTCTT AACAGGGTAA AAGTACTGTT CAATGGGAAT CCTAATTCAA 14280 CATGGATAGG AAATATGGAA TGTGAGAGCT TAATATGGAG TGAATTAAAT GATAAGTCCA 14340 TTGGATTAGT ACATTGTGAT ATGGAAGGAG CTATCGGTAA ATCAGAAGAA ACTGTTCTAC 14400 ATGAACATTA TAGTGTTATA AGAATTACAT ACTTGATTGG GGATGATGAT GTTGTTTTAG 14460 TTTCCAAAAT TATACCTACA ATCACTCCGA ATTGGTCTAG AATACTTTAT CTATATAAAT 14520 TATATTGGAA AGATGTAAGT ATAATATCAC TCAAAACTTC TAATCCTGCA TCAACAGAAT 14580 TATATCTAAT TTCGAAAGAT GCATATTGTA CTATAATGGA ACCTAGTGAA ATTGTTTTAT 14640 CAAAACTTAA AAGATTGTCA CTCTTGGAAG AAAATAATCT ATTAAAATGG ATCATTTTAT 14700 CAAAGAAGAG GAATAATGAA TGGTTACATC ATGAAATCAA AGAAGGAGAA AGAGATTATG 14760 GAATCATGAG ACCATATCAT ATGGCACTAC AAATCTTTGG ATTTCAAATC AATTTAAATC 14820 ATCTGGCGAA AGAATTTTTA TCAACCCCAG ATCTGACTAA TATCAACAAT ATAATCCAAA 14880 GTTTTCAGCG AACAATAAAG GATGTTTTAT TTGAATGGAT TAATATAACT CATGATGATA 14940 AGAGACATAA ATTAGGCGGA AGATATAACA TATTCCCACT GAAAAATAAG GGAAAGTTAA 15000 GACTGCTATC GAGAAGACTA GTATTAAGTT GGATTTCATT ATCATTATCG ACTCGATTAC 15060 TTACAGGTCG CTTTCCTGAT GAAAAATTTG AACATAGAGC ACAGACTGGA TATGTATCAT 15120 TAGCTGATAC TGATTTAGAA TCATTAAAGT TATTGTCGAA AAACATCATT AAGAATTACA 15180 GAGAGTGTAT AGGATCAATA TCATATTGGT TTCTAACCAA AGAAGTTAAA ATACTTATGA 15240 AATTGATTGG TGGTGCTAAA TTATTAGGAA TTCCCAGACA ATATAAAGAA CCCGAAGACC 15300 AGTTATTAGA AAACTACAAT CAACATGATG AATTTGATAT CGATTAAAAC ATAAATACAA 15360 TGAAGATATA TCCTAACCTT TATCTTTAAG CCTAGGAATA GACAAAAAGT AAGAAAAACA 15420 TGTAATATAT ATATACCAAA CAGAGTTCTT CTCTTGTTTG GT 15462 (2) SEQ ID NO: 22 information about: ...
(i) sequence signature:
(A) length: 2233 amino acid
(B) type: amino acid
(C) chain:
(D) topological framework: linearity
(ii) molecule type: protein
(xi) sequence description: SEQ ID NO:22:Met Asp Thr Glu Ser Asn Asn Gly Thr Val Ser Asp Ile Leu Tyr Pro1 5 10 15Glu Cys His Leu Asn Ser Pro Ile Val Lys Gly Lys Ile Ala Gln Leu
20??????????????????25??????????????????30His?Thr?Ile?Met?Ser?Leu?Pro?Gln?Pro?Tyr?Asp?Met?Asp?Asp?Asp?Ser
35??????????????????40??????????????????45Ile?Leu?Val?Ile?Thr?Arg?Gln?Lys?Ile?Lys?Leu?Asn?Lys?Leu?Asp?Lys
50??????????????????55??????????????????60Arg?Gln?Arg?Ser?Ile?Arg?Arg?Leu?Lys?Leu?Ile?Leu?Thr?Glu?Lys?Val65??????????????????70??????????????????75??????????????????80Asn?Asp?Leu?Gly?Lys?Tyr?Thr?Phe?Ile?Arg?Tyr?Pro?Glu?Met?Ser?Lys
85??????????????????90??????????????????95Glu?Met?Phe?Lys?Leu?Tyr?Ile?Pro?Gly?Ile?Asn?Ser?Lys?Val?Thr?Glu
100?????????????????105?????????????????110Leu?Leu?Leu?Lys?Ala?Asp?Arg?Thr?Tyr?Ser?Gln?Met?Thr?Asp?Gly?Leu
115?????????????????120?????????????????125Arg?Asp?Leu?Trp?Ile?Asn?Val?Leu?Ser?Lys?Leu?Ala?Ser?Lys?Asn?Asp
130?????????????????135?????????????????140Gly?Ser?Asn?Tyr?Asp?Leu?Asn?Glu?Glu?Ile?Asn?Asn?Ile?Ser?Lys?Val145?????????????????150?????????????????155?????????????????160His?Thr?Thr?Tyr?Lys?Ser?Asp?Lys?Trp?Tyr?Asn?Pro?Phe?Lys?Thr?Trp
165?????????????????170?????????????????175Phe?Thr?Ile?Lys?Tyr?Asp?Met?Arg?Arg?Leu?Gln?Lys?Ala?Arg?Asn?Glu
180?????????????????185?????????????????190Ile?Thr?Phe?Asn?Val?Gly?Lys?Asp?Tyr?Asn?Leu?Leu?Glu?Asp?Gln?Lys
195?????????????????200?????????????????205Asn?Phe?Leu?Leu?Ile?His?Pro?Glu?Leu?Val?Leu?Ile?Leu?Asp?Lys?Gln
210?????????????????215?????????????????220Asn?Tyr?Asn?Gly?Tyr?Leu?Ile?Thr?Pro?Glu?Leu?Val?Leu?Met?Tyr?Cys225?????????????????230?????????????????235?????????????????240Asp?Val?Val?Glu?Gly?Arg?Trp?Asn?Ile?Ser?Ala?Cys?Ala?Lys?Leu?Asp
245?????????????????250?????????????????255Pro?Lys?Leu?Gln?Ser?Met?Tyr?Gln?Lys?Gly?Asn?Asn?Leu?Trp?Glu?Val
260?????????????????265?????????????????270Ile?Asp?Lys?Leu?Phe?Pro?Ile?Met?Gly?Glu?Lys?Thr?Phe?Asp?Val?Ile
275?????????????????280?????????????????285Ser?Leu?Leu?Glu?Pro?Leu?Ala?Leu?Ser?Leu?Ile?Gln?Thr?His?Asp?Pro
290?????????????????295?????????????????300Val?Lys?Gln?Leu?Arg?Gly?Ala?Phe?Leu?Asn?His?Val?Leu?Ser?Glu?Met305?????????????????310?????????????????315?????????????????320Glu?Leu?Ile?Phe?Glu?Ser?Arg?Glu?Ser?Ile?Lys?Glu?Phe?Leu?Ser?Val
325?????????????????330?????????????????335Asp?Tyr?Ile?Asp?Lys?Ile?Leu?Asp?Ile?Phe?Asn?Lys?Ser?Thr?Ile?Asp
340?????????????????345?????????????????350Glu?Ile?Ala?Glu?Ile?Phe?Ser?Phe?Phe?Arg?Thr?Phe?Gly?His?Pro?Pro
355?????????????????360?????????????????365Leu?Glu?Ala?Ser?Ile?Ala?Ala?Glu?Lys?Val?Arg?Lys?Tyr?Met?Tyr?Ile
370?????????????????375?????????????????380Gly?Lys?Gln?Leu?Lys?Phe?Asp?Thr?Ile?Asn?Lys?Cys?His?Ala?Ile?Phe385?????????????????390?????????????????395?????????????????400Cys?Thr?Ile?Ile?Ile?Asn?Gly?Tyr?Arg?Glu?Arg?His?Gly?Gly?Gln?Trp
405?????????????????410?????????????????415Pro?Pro?Val?Thr?Leu?Pro?Asp?His?Ala?His?Glu?Phe?Ile?Ile?Asn?Ala
420?????????????????425?????????????????430Tyr?Gly?Ser?Asn?Ser?Ala?Ile?Ser?Tyr?Glu?Asn?Ala?Val?Asp?Tyr?Tyr
435?????????????????440?????????????????445Gln?Ser?Phe?Ile?Gly?Ile?Lys?Phe?Asn?Lys?Phe?Ile?Glu?Pro?Gln?Leu
450?????????????????455?????????????????460Asp?Glu?Asp?Leu?Thr?Ile?Tyr?Met?Lys?Asp?Lys?Ala?Leu?Ser?Pro?Lys465?????????????????470?????????????????475?????????????????480Lys?Ser?Asn?Trp?Asp?Thr?Val?Tyr?Pro?Ala?Ser?Asn?Leu?Leu?Tyr?Arg
485?????????????????490?????????????????495Thr?Asn?Ala?Ser?Asn?Glu?Ser?Arg?Arg?Leu?Val?Glu?Val?Phe?Ile?Ala
500?????????????????505?????????????????510Asp?Ser?Lys?Phe?Asp?Pro?His?Gln?Ile?Leu?Asp?Tyr?Val?Glu?Ser?Gly
515?????????????????520?????????????????525Asp?Trp?Leu?Asp?Asp?Pro?Glu?Phe?Asn?Ile?Ser?Tyr?Ser?Leu?Lys?Glu
530?????????????????535?????????????????540Lys?Glu?Ile?Lys?Gln?Glu?Gly?Arg?Leu?Phe?Ala?Lys?Met?Thr?Tyr?Lys545?????????????????550?????????????????555?????????????????560Met?Arg?Ala?Thr?Gln?Val?Leu?Ser?Glu?Thr?Leu?Leu?Ala?Asn?Asn?Ile
565?????????????????570?????????????????575Gly?Lys?Phe?Phe?Gln?Glu?Asn?Gly?Met?Val?Lys?Gly?Glu?Ile?Glu?Leu
580?????????????????585?????????????????590Leu?Lys?Arg?Leu?Thr?Thr?Ile?Ser?Ile?Ser?Gly?Val?Pro?Arg?Tyr?Asn
595?????????????????600?????????????????605Glu?Val?Tyr?Asn?Asn?Ser?Lys?Ser?His?Thr?Asp?Asp?Leu?Lys?Thr?Tyr
610?????????????????615?????????????????620Asn?Lys?Ile?Ser?Asn?Leu?Asn?Leu?Ser?Ser?Asn?Gln?Lys?Ser?Lys?Lys625?????????????????630?????????????????635?????????????????640Phe?Glu?Phe?Lys?Ser?Thr?Asp?Ile?Tyr?Asn?Asp?Gly?Tyr?Glu?Thr?Val
645?????????????????650?????????????????655Ser?Cys?Phe?Leu?Thr?Thr?Asp?Leu?Lys?Lys?Tyr?Cys?Leu?Asn?Trp?Arg
660?????????????????665?????????????????670Tyr?Glu?Ser?Thr?Ala?Leu?Phe?Gly?Glu?Thr?Cys?Asn?Gln?Ile?Phe?Gly
675?????????????????680?????????????????685Leu?Asn?Lys?Leu?Phe?Asn?Trp?Leu?His?Pro?Arg?Leu?Glu?Gly?Ser?Thr
690?????????????????695?????????????????700Ile?Tyr?Val?Gly?Asp?Pro?Tyr?Cys?Pro?Pro?Ser?Asp?Lys?Glu?His?Ile705?????????????????710?????????????????715?????????????????720Ser?Leu?Glu?Asp?His?Pro?Asp?Ser?Gly?Phe?Tyr?Val?His?Asn?Pro?Arg
725?????????????????730?????????????????735Gly?Gly?Ile?Glu?Gly?Phe?Cys?Gln?Lys?Leu?Trp?Thr?Leu?Ile?Ser?Ile
740?????????????????745?????????????????750Ser?Ala?Ile?His?Leu?Ala?Ala?Val?Arg?Ile?Gly?Val?Arg?Val?Thr?Ala
755?????????????????760?????????????????765Met?Val?Gln?Gly?Asp?Asn?Gln?Ala?Ile?Ala?Val?Thr?Thr?Arg?Val?Pro
770?????????????????775?????????????????780Asn?Asn?Tyr?Asp?Tyr?Arg?Val?Lys?Lys?Glu?Ile?Val?Tyr?Lys?Asp?Val785?????????????????790?????????????????795?????????????????800Val?Arg?Phe?Phe?Asp?Ser?Leu?Arg?Glu?Val?Met?Asp?Asp?Leu?Gly?His
805?????????????????810?????????????????815Glu?Leu?Lys?Leu?Asn?Glu?Thr?Ile?Ile?Ser?Ser?Lys?Met?Phe?Ile?Tyr
820?????????????????825?????????????????830Ser?Lys?Arg?Ile?Tyr?Tyr?Asp?Gly?Arg?Ile?Leu?Pro?Gln?Ala?Leu?Lys
835?????????????????840?????????????????845Ala?Leu?Ser?Arg?Cys?Val?Phe?Trp?Ser?Glu?Thr?Val?Ile?Asp?Glu?Thr
850?????????????????855?????????????????860Arg?Ser?Ala?Ser?Ser?Asn?Leu?Ala?Thr?Ser?Phe?Ala?Lys?Ala?Ile?Glu865?????????????????870?????????????????875?????????????????880Asn?Gly?Tyr?Ser?Pro?Val?Leu?Gly?Tyr?Ala?Cys?Ser?Ile?Phe?Lys?Asn
885?????????????????890?????????????????895Ile?Gln?Gln?Leu?Tyr?Ile?Ala?Leu?Gly?Met?Asn?Ile?Asn?Pro?Thr?Ile
900?????????????????905?????????????????910Thr?Gln?Asn?Ile?Arg?Asp?Gln?Tyr?Phe?Arg?Asn?Pro?Asn?Trp?Met?Gln
915?????????????????920?????????????????925Tyr?Ala?Ser?Leu?Ile?Pro?Ala?Ser?Val?Gly?Gly?Phe?Asn?His?Met?Ala
930?????????????????935?????????????????940Met?Ser?Arg?Cys?Phe?Val?Arg?Asn?Ile?Gly?Asp?Pro?Ser?Val?Ala?Ala945?????????????????950?????????????????955?????????????????960Leu?Ala?Asp?Ile?Lys?Arg?Phe?Ile?Lys?Ala?Asn?Leu?Leu?Asp?Arg?Ser
965?????????????????970?????????????????975Val?Leu?Tyr?Arg?Ile?Met?Asn?Gln?Glu?Pro?Gly?Glu?Ser?Ser?Phe?Phe
980?????????????????985?????????????????990Asp?Trp?Ala?Ser?Asp?Pro?Tyr?Ser?Cys?Asn?Leu?Pro?Gln?Ser?Gln?Asn
995?????????????????1000????????????????1005Ile?Thr?Thr?Met?Ile?Lys?Asn?Ile?Thr?Ala?Arg?Asn?Val?Leu?Gln?Asp
1010????????????????1015????????????????1020Ser?Pro?Asn?Pro?Leu?Leu?Ser?Gly?Leu?Phe?Thr?Asn?Thr?Met?Ile?Glu1025????????????????1030????????????????1035????????????????1040Glu?Asp?Glu?Glu?Leu?Ala?Glu?Phe?Leu?Met?Asp?Arg?Lys?Val?Ile?Leu
1045????????????????1050????????????????1055Pro?Arg?Val?Ala?His?Asp?Ile?Leu?Asp?Asn?Ser?Leu?Thr?Gly?Ile?Arg
1060????????????????1065????????????????1070Asn?Ala?Ile?Ala?Gly?Met?Leu?Asp?Thr?Thr?Lys?Ser?Leu?Ile?Arg?Val
1075????????????????1080????????????????1085Gly?Ile?Asn?Arg?Gly?Gly?Leu?Thr?Tyr?Ser?Leu?Leu?Arg?Lys?Ile?Ser
1090????????????????1095????????????????1100Asn?Tyr?Asp?Leu?Val?Gln?Tyr?Glu?Thr?Leu?Ser?Arg?Thr?Leu?Arg?Leu1105????????????????1110????????????????1115????????????????1120Ile?Val?Ser?Asp?Lys?Ile?Lys?Tyr?Glu?Asp?Met?Cys?Ser?Val?Asp?Leu
1125????????????????1130????????????????1135Ala?Ile?Ala?Leu?Arg?Gln?Lys?Met?Trp?Ile?His?Leu?Ser?Gly?Gly?Arg
1140????????????????1145????????????????1150Met?Ile?Ser?Gly?Leu?Glu?Thr?Pro?Asp?Pro?Leu?Glu?Leu?Leu?Ser?Gly
1155????????????????1160????????????????1165Val?Val?Ile?Thr?Gly?Ser?Glu?His?Cys?Lys?Ile?Cys?Tyr?Ser?Ser?Asp
1170????????????????1175????????????????1180Gly?Thr?Asn?Pro?Tyr?Thr?Trp?Met?Tyr?Leu?Pro?Gly?Asn?Ile?Lys?Ile1185????????????????1190????????????????1195????????????????1200Gly?Ser?Ala?Glu?Thr?Gly?Ile?Ser?Ser?Leu?Arg?Val?Pro?Tyr?Phe?Gly
1205????????????????1210????????????????1215Ser?Val?Thr?Asp?Glu?Arg?Ser?Glu?Ala?Gln?Leu?Gly?Tyr?Ile?Lys?Asn
1220????????????????1225????????????????1230Leu?Ser?Lys?Pro?Ala?Lys?Ala?Ala?Ile?Arg?Ile?Ala?Met?Ile?Tyr?Thr
1235????????????????1240????????????????1245Trp?Ala?Phe?Gly?Asn?Asp?Glu?Ile?Ser?Trp?Met?Glu?Ala?Ser?Gln?Ile
1250????????????????1255????????????????1260Ala?Gln?Thr?Arg?Ala?Asn?Phe?Thr?Leu?Asp?Ser?Leu?Lys?Ile?Leu?Thr1265????????????????1270????????????????1275????????????????1280Pro?Val?Ala?Thr?Ser?Thr?Asn?Leu?Ser?His?Arg?Phe?Lys?Asp?Thr?Ala
1285????????????????1290????????????????1295Thr?Gln?Met?Lys?Phe?Ser?Ser?Thr?Ser?Leu?Ile?Arg?Val?Ser?Arg?Phe
1300????????????????1305????????????????1310Ile?Thr?Met?Ser?Asn?Asp?Asn?Met?Ser?Ile?Lys?Glu?Ala?Asn?Glu?Thr
1315????????????????1320????????????????1325Lys?Asp?Thr?Asn?Leu?Ile?Tyr?Gln?Gln?Ile?Met?Leu?Thr?Gly?Leu?Ser
1330????????????????1335????????????????1340Val?Phe?Glu?Tyr?Leu?Phe?Arg?Leu?Lys?Glu?Thr?Thr?Gly?His?Asn?Pro1345????????????????1350????????????????1355????????????????1360Ile?Val?Met?His?Leu?His?Ile?Glu?Asp?Glu?Cys?Cys?Ile?Lys?Glu?Ser
1365????????????????1370????????????????1375Phe?Asn?Asp?Glu?His?Ile?Asn?Pro?Glu?Ser?Thr?Leu?Glu?Leu?Ile?Arg
1380????????????????1385????????????????1390Tyr?Pro?Glu?Ser?Asn?Glu?Phe?Ile?Tyr?Asp?Lys?Asp?Pro?Leu?Lys?Asp
1395????????????????1400????????????????1405Val?Asp?Leu?Ser?Lys?Leu?Met?Val?Ile?Lys?Asp?His?Ser?Tyr?Thr?Ile
1410????????????????1415????????????????1420Asp?Met?Asn?Tyr?Trp?Asp?Asp?Thr?Asp?Ile?Ile?His?Ala?Ile?Ser?Ile1425????????????????1430????????????????1435????????????????1440Cys?Thr?Ala?Ile?Thr?Ile?Ala?Asp?Thr?Met?Ser?Gln?Leu?Asp?Arg?Asp
1445????????????????1450????????????????1455Asn?Leu?Lys?Glu?Ile?Ile?Val?Ile?Ala?Asn?Asp?Asp?Asp?Ile?Asn?Ser
1460????????????????1465????????????????1470Leu?Ile?Thr?Glu?Phe?Leu?Thr?Leu?Asp?Ile?Leu?Val?Phe?Leu?Lys?Thr
1475????????????????1480????????????????1485Phe?Gly?Gly?Leu?Leu?Val?Asn?Gln?Phe?Ala?Tyr?Thr?Leu?Tyr?Ser?Leu
1490????????????????1495????????????????1500Lys?Ile?Glu?Gly?Arg?Asp?Leu?Ile?Trp?Asp?Tyr?Ile?Met?Arg?Thr?Leu1505????????????????1510????????????????1515????????????????1520Arg?Asp?Thr?Ser?His?Ser?Ile?Leu?Lys?Val?Leu?Ser?Asn?Ala?Leu?Ser
1525????????????????1530????????????????1535His?Pro?Lys?Val?Phe?Lys?Arg?Phe?Trp?Asp?Cys?Gly?Val?Leu?Asn?Pro
1540????????????????1545????????????????1550Ile?Tyr?Gly?Pro?Asn?Ile?Ala?Ser?Gln?Asp?Gln?Ile?Lys?Leu?Ala?Leu
1555????????????????1560????????????????1565Ser?Ile?Cys?Glu?Tyr?Ser?Leu?Asp?Leu?Phe?Met?Arg?Glu?Trp?Leu?Asn
1570????????????????1575????????????????1580Gly?Val?Ser?Leu?Glu?Ile?Tyr?Ile?Cys?Asp?Ser?Asp?Met?Glu?Val?Ala1585????????????????1590????????????????1595????????????????1600Asn?Asp?Arg?Lys?Gln?Ala?Phe?Ile?Ser?Arg?His?Leu?Ser?Phe?Val?Cys
1605????????????????1610????????????????1615Cys?Leu?Ala?Glu?Ile?Ala?Ser?Phe?Gly?Pro?Asn?Leu?Leu?Asn?Leu?Thr
1620????????????????1625????????????????1630Tyr?Leu?Glu?Arg?Leu?Asp?Leu?Leu?Lys?Gln?Tyr?Leu?Glu?Leu?Asn?Ile
1635????????????????1640????????????????1645Lys?Glu?Asp?Pro?Thr?Leu?Lys?Tyr?Val?Gln?Ile?Ser?Gly?Leu?Leu?Ile
1650????????????????1655????????????????1660Lys?Ser?Phe?Pro?Ser?Thr?Val?Thr?Tyr?Val?Arg?Lys?Thr?Ala?Ile?Lys1665????????????????1670????????????????1675????????????????1680Tyr?Leu?Arg?Ile?Arg?Gly?Ile?Ser?Pro?Pro?Glu?Val?Ile?Asp?Asp?Trp
1685????????????????1690????????????????1695Asp?Pro?Val?Glu?Asp?Glu?Asn?Met?Leu?Asp?Asn?Ile?Val?Lys?Thr?Ile
1700????????????????1705????????????????1710Asn?Asp?Asn?Cys?Asn?Lys?Asp?Asn?Lys?Gly?Asn?Lys?Ile?Asn?Asn?Phe
1715????????????????1720????????????????1725Trp?Gly?Leu?Ala?Leu?Lys?Asn?Tyr?Gln?Val?Leu?Lys?Ile?Arg?Ser?Ile
1730????????????????1735????????????????1740Thr?Ser?Asp?Ser?Asp?Asp?Asn?Asp?Arg?Leu?Asp?Ala?Asn?Thr?Ser?Gly1745????????????????1750????????????????1755????????????????1760Leu?Thr?Leu?Pro?Gln?Gly?Gly?Asn?Tyr?Leu?Ser?His?Gln?Leu?Arg?Leu
1765????????????????1770????????????????1775Phe?Gly?Ile?Asn?Ser?Thr?Ser?Cys?Leu?Lys?Ala?Leu?Glu?Leu?Ser?Gln
1780????????????????1785????????????????1790Ile?Leu?Met?Lys?Glu?Val?Asn?Lys?Asp?Lys?Asp?Arg?Leu?Phe?Leu?Gly
1795????????????????1800????????????????1805Glu?Gly?Ala?Gly?Ala?Met?Leu?Ala?Cys?Tyr?Asp?Ala?Thr?Leu?Gly?Pro
1810????????????????1815????????????????1820Ala?Val?Asn?Tyr?Tyr?Asn?Ser?Gly?Leu?Asn?Ile?Thr?Asp?Val?Ile?Gly1825????????????????1830????????????????1835????????????????1840Gln?Arg?Glu?Leu?Lys?Ile?Phe?Pro?Ser?Glu?Val?Ser?Leu?Val?Gly?Lys
1845????????????????1850????????????????1855Lys?Leu?Gly?Asn?Val?Thr?Gln?Ile?Leu?Asn?Arg?Val?Lys?Val?Leu?Phe
1860????????????????1865????????????????1870Asn?Gly?Asn?Pro?Asn?Ser?Thr?Trp?Ile?Gly?Asn?Met?Glu?Cys?Glu?Ser
1875????????????????1880????????????????1885Leu?Ile?Trp?Ser?Glu?Leu?Asn?Asp?Lys?Ser?Ile?Gly?Leu?Val?His?Cys
1890????????????????1895????????????????1900Asp?Met?Glu?Gly?Ala?Ile?Gly?Lys?Ser?Glu?Glu?Thr?Val?Leu?His?Glu1905????????????????1910????????????????1915????????????????1920His?Tyr?Ser?Val?Ile?Arg?Ile?Thr?Tyr?Leu?Ile?Gly?Asp?Asp?Asp?Val
1925????????????????1930????????????????1935Val?Leu?Val?Ser?Lys?Ile?Ile?Pro?Thr?Ile?Thr?Pro?Asn?Trp?Ser?Arg
1940????????????????1945????????????????1950Ile?Leu?Tyr?Leu?Tyr?Lys?Leu?Tyr?Trp?Lys?Asp?Val?Ser?Ile?Ile?Ser
1955????????????????1960????????????????1965Leu?Lys?Thr?Ser?Asn?Pro?Ala?Ser?Thr?Glu?Leu?Tyr?Leu?Ile?Ser?Lys
1970????????????????1975????????????????1980Asp?Ala?Tyr?Cys?Thr?Ile?Met?Glu?Pro?Ser?Glu?Ile?Val?Leu?Ser?Lys1985????????????????1990????????????????1995????????????????2000Leu?Lys?Arg?Leu?Ser?Leu?Leu?Glu?Glu?Asn?Asn?Leu?Leu?Lys?Trp?Ile
2005????????????????2010????????????????2015Ile?Leu?Ser?Lys?Lys?Arg?Asn?Asn?Glu?Trp?Leu?His?His?Glu?Ile?Lys
2020????????????????2025????????????????2030Glu?Gly?Glu?Arg?Asp?Tyr?Gly?Ile?Met?Arg?Pro?Tyr?His?Met?Ala?Leu
2035????????????????2040????????????????2045
Gln?Ile?Phe?Gly?Phe?Gln?Ile?Asn?Leu?Asn?His?Leu?Ala?Lys?Glu?Phe
2050????????????????2055????????????????2060
Leu?Ser?Thr?Pro?Asp?Leu?Thr?Asn?Ile?Asn?Asn?Ile?Ile?Gln?Ser?Phe
2065????????????????2070????????????????2075????????????????2080
Gln?Arg?Thr?Ile?Lys?Asp?Val?Leu?phe?Glu?Trp?Ile?Asn?Ile?Thr?His
2085????????????????2090????????????????2095
Asp?Asp?Lys?Arg?His?Lys?Leu?Gly?Gly?Arg?Tyr?Asn?Ile?Phe?Pro?Leu
2100????????????????2105????????????????2110
Lys?Asn?Lys?Gly?Lys?Leu?Arg?Leu?Leu?Ser?Arg?Arg?Leu?Val?Leu?Ser
2115????????????????2120????????????????2125
Trp?Ile?Ser?Leu?Ser?Leu?Ser?Thr?Arg?Leu?Leu?Thr?Gly?Arg?Phe?Pro
2130????????????????2135????????????????2140
Asp?Glu?Lys?Phe?Glu?His?Arg?Ala?Gln?Thr?Gly?Tyr?Val?Ser?Leu?Ala
2145????????????????2150????????????????2155????????????????2160
Asp?Thr?Asp?Leu?Glu?Ser?Leu?Lys?Leu?Leu?Ser?Lys?Asn?Ile?Ile?Lys
2165????????????????2170????????????????2175
Asn?Tyr?Arg?Glu?Cys?Ile?Gly?Ser?Ile?Ser?Tyr?Trp?Phe?Leu?Thr?Lys
2180????????????????2185????????????????2190
Glu?Val?Lys?Ile?Leu?Met?Lys?Leu?Ile?Gly?Gly?Ala?Lys?Leu?Leu?Gly
2195????????????????2200????????????????2205
Ile?Pro?Arg?Gln?Tyr?Lys?Glu?Pro?Glu?Asp?Gln?Leu?Leu?Glu?Asn?Tyr
2210????????????????2215????????????????2220
Asn?Gln?His?Asp?Glu?Phe?Asp?Ile?Asp
The information of 2,225 2230 (2) SEQ ID NO:23:
(i) sequence signature:
(A) length: 15218 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: ACGCGAAAAA ATGCGTACTA CAAACTTGCA CATTCGAAAA AAATGGGGCA AATAAGAACT 60 TGATAAGTGC TATTTAAGTC TAACCTTTTC AATCAGAAAT GGGGTGCAAT TCACTGAGCA 120 TGATAAAGGT TAGATTACAA AATTTATTTG ACAATGACGA AGTAGCATTG TTAAAAATAA 180 CATGTTATAC TGATAAATTA ATTCTTCTGA CCAATGCATT AGCCAAAGCA GCAATACATA 240 CAATTAAATT AAACGGCATA GTTTTTATAC ATGTTATAAC AAGCAGTGAA GTGTGCCCTG 300 ATAACAATAT TGTAGTGAAA TCTAACTTTA CAACAATGCC AATACTACAA AATGGAGGAT 360 ACATATGGGA ATTGATTGAG TTGACACACT GCTCTCAATT AAACGGTTTA ATGGATGATA 420 ATTGTGAAAT CAAATTTTCT AAAAGACTAA GTGACTCAGT AATGACTAAT TATATGAATC 480 AAATATCTGA CTTACTTGGG CTTGATCTCA ATTCATGAAT TATGTTTAGT CTAATTCAAT 540 AGACATGTGT TTATTACCAT TTTAGTTAAT ATAAAAACTC ATCAAAGGGA AATGGGGCAA 600 ATAAACTCAC CTAATCAATC AAACCATGAG CACTACAAAT GACAACACTA CTATGCAAAG 660 ATTGATGATC ACAGACATGA GACCCCTGTC AATGGATTCA ATAATAACAT CTCTTACCAA 720 AGAAATCATC ACACACAAAT TCATATACTT GATAAACAAT GAATGTATTG TAAGAAAACT 780 TGATGAAAGA CAAGCTACAT TTACATTCTT AGTCAATTAT GAGATGAAGC TACTGCACAA 840 AGTAGGGAGT ACCAAATACA AAAAATACAC TGAATATAAT ACAAAATATG GCACTTTCCC 900 CATGCCTATA TTTATCAATC ACGGCGGGTT TCTAGAATGT ATTGGCATTA AGCCTACAAA 960 ACACACTCCT ATAATATACA AATATGACCT CAACCCGTGA ATTCCAACAA AAAAACCAAC 1020 CCAACCAAAC CAAACTATTC CTCAAACAAC AGTGCTCAAT AGTTAAGAAG GAGCTAATCC 1080 ATTTTAGTAA TTAAAAATAA AAGTAAAGCC AATAACATAA ATTGGGGCAA ATACAAAGAT 1140 GGCTCTTAGC AAAGTCAAGT TGAATGATAC ATTAAATAAG GATCAGCTGC TGTCATCCAG 1200 CAAATACACT ATTCAACGTA GTACAGGAGA TAATATTGAC ACTCCCAATT ATGATGTGCA 1260 AAAACACCTA AACAAACTAT GTGGTATGCT ATTAATCACT GAAGATGCAA ATCATAAATT 1320 CACAGGATTA ATAGGTATGT TATATGCTAT GTCCAGGTTA GGAAGGGAAG ACACTATAAA 1380 GATACTTAAA GATGCTGGAT ATCATGTTAA AGCTAATGGA GTAGATATAA CAACATATCG 1440 TCAAGATATA AATGGAAAGG AAATGAAATT CGAAGTATTA ACATTATCAA GCTTGACATC 1500 AGAAATACAA GTCAATATTG AGATAGAATC TAGAAAGTCC TACAAAAAAA TGCTAAAAGA 1560 GATGGGAGAA GTGGCTCCAG AATATAGGCA TGATTCTCCA GACTGTGGGA TGATAATACT 1620 GTGTATAGCT GCACTTGTGA TAACCAAATT AGCAGCAGGA GACAGATCAG GTCTTACAGC 1680 AGTAATTAGG AGGGCAAACA ATGTCTTAAA AAACGAAATA AAACGATACA AGGGCCTCAT 1740 ACCAAAGGAT ATAGCTAACA GTTTTTATGA AGTGTTTGAA AAACACCCTC ATCTTATAGA 1800 TGTTTTCGTG CACTTTGGCA TTGCACAATC ATCCACAAGA GGGGGTAGTA GAGTTGAAGG 1860 AATCTTTGCA GGATTGTTTA TGAATGCCTA TGGTTCAGGG CAAGTAATGC TAAGATGGGG 1920 AGTTTTAGCC AAATCTGTAA AAAATATCAT GCTAGGACAT GCTAGTGTCC AGGCAGAAAT 1980 GGAGCAAGTT GTGGAAGTCT ATGAGTATGC ACAGAAGTTG GGAGGAGAAG CTGGATTCTA 2040 CCATATATTG AACAATCCAA AAGCATCATT GCTGTCATTA ACTCAATTTC CCAACTTCTC 2100 AAGTGTGGTC CTAGGCAATG CAGCAGGTCT AGGCATAATG GGAGAGTATA GAGGTACACC 2160 AAGAAACCAG GATCTTTATG ATGCAGCTAA AGCATATGCA GAGCAACTCA AAGAAAATGG 2220 AGTAATAAAC TACAGTGTAT TAGACTTAAC AGCAGAAGAA TTGGAAGCCA TAAAGCATCA 2280 ACTCAACCCC AAAGAAGATG ATGTAGAGCT TTAAGTTAAC AAAAAATACG GGGCAAATAA 2340 GTCAACATGG AGAAGTTTGC ACCTGAATTT CATGGAGAAG ATGCAAATAA CAAAGCTACC 2400 AAATTCCTAG AATCAATAAA GGGCAAGTTC GCATCATCCA AAGATCCTAA GAAGAAAGAT 2460 AGCATAATAT CTGTTAACTC AATAGATATA GAAGTAACTA AAGAGAGCCC GATAACATCT 2520 GGCACCAACA TCATCAATCC AACAAGTGAA GCCGACAGTA CCCCAGAAAC AAAAGCCAAC 2580 TACCCAAGAA AACCCCTAGT AAGCTTCAAA GAAGATCTCA CCCCAAGTGA CAACCCTTTT 2640 TCTAAGTTGT ACAAGGAAAC AATAGAAACA TTTGATAACA ATGAAGAAGA ATCTAGCTAC 2700 TCATATGAAG AGATAAATGA TCAAACAAAT GACAACATTA CAGCAAGACT AGATAGAATT 2760 GATGAAAAAT TAAGTGAAAT ATTAGGAATG CTCCATACAT TAGTAGTTGC AAGTGCAGGA 2820 CCCACTTCAG CTCGCGATGG AATAAGAGAT GCTATGGTTG GTCTAAGAGA AGAGATGATA 2880 GAAAAAATAA GAGCGGAAGC ATTAATGACC AATGATAGGT TAGAGGCTAT GGCAAGACTT 2940 AGGAATGAGG AAAGCGAAAA AATGGCAAAA GACACCTCAG ATGAAGTGTC TCTTAATCCA 3000 ACTTCCAAAA AATTGAGTGA CTTGTTGGAA GACAACGATA GTGACAATGA TCTATCACTT 3060 GATGATTTTT GATCAGCGAT CAACTCACTC AGCAATCAAC AACATCAATA AAACAGACAT 3120 CAATCCATTG AATCAACTGC CAGACCGAAC AAACAAACGT CCATCAGTAG AACCACCAAC 3180 CAATCAATCA ACCAATTGAT CAATCAGCAA CCCGACAAAA TTAACAATAT AGTAACAAAA 3240 AAAGAACAAG ATGGGGCAAA TATGGAAACA TACGTGAACA AGCTTCACGA AGGCTCCACA 3300 TACACAGCAG CTGTTCAGTA CAATGTTCTA GAAAAAGATG ATGATCCTGC ATCACTAACA 3360 ATATGGGTGC CTATGTTCCA GTCATCTGTG CCAGCAGACT TGCTCATAAA AGAACTTGCA 3420 AGCATCAATA TACTAGTGAA GCAGATCTCT ACGCCCAAAG GACCTTCACT ACGAGTCACG 3480 ATTAACTCAA GAAGTGCTGT GCTGGCTCAA ATGCCTAGTA ATTTCATCAT AAGCGCAAAT 3540 GTATCATTAG ATGAAAGAAG CAAATTAGCA TATGATGTAA CTACACCTTG TGAAATCAAA 3600 GCATGCAGTC TAACATGCTT AAAAGTAAAA AGTATGTTAA CTACAGTCAA AGATCTTACC 3660 ATGAAGACAT TCAACCCCAC TCATGAGATC ATTGCTCTAT GTGAATTTGA AAATATTATG 3720 ACATCAAAAA GAGTAATAAT ACCAACCTAT CTAAGATCAA TTAGTGTCAA GAACAAGGAT 3780 CTGAACTCAC TAGAAAATAT AGCAACCACC GAATTCAAAA ATGCTATCAC CAATGCAAAA 3840 ATTATTCCTT ATGCAGGATT AGTGTTAGTT ATCACAGTTA CTGACAATAA AGGAGCATTC 3900 AAATATATCA AACCACAGAG TCAATTTATA GTAGATCTTG GTGCCTACCT AGAAAAAGAG 3960 AGCATATATT ATGTGACTAC TAATTGGAAG CATACAGCTA CACGTTTTTC AATCAAACCA 4020 CTAGAGGATT AAACTTAATT ATCAACACTG AATGACAGGT CCACATATAT CCTCAAACTA 4080 CACACTATAT CCAAACATCA TAAACATCTA CACTACACAC TTCATCACAC AAACCAATCC 4140 CACTCAAAAT CCAAAATCAC TACCAGCCAC TATCTGCTAG ACCTAGAGTG CGAATAGGTA 4200 AATAAAACCA AAATATGGGG TAAATAGACA TTAGTTAGAG TTCAATCAAT CTTAACAACC 4260 ATTTATACCG CCAATTCAAC ACATATACTA TAAATCTTAA AATGGGAAAT ACATCCATCA 4320 CAATAGAATT CACAAGCAAA TTTTGGCCCT ATTTTACACT AATACATATG ATCTTAACTC 4380 TAATCTTTTT ACTAATTATA ATCACTATTA TGATTGCAAT ACTAAATAAG CTAAGTGAAC 4440 ATAAAGCATT CTGTAACAAA ACTCTTGAAC TAGGACAGAT GTATCAAATC AACACATAGA 4500 GTTCTACCAT TATGCTGTGT CAAATTATAA TCCTGTATAT ATAAACAAAC AAATCCAATC 4560 TTCTCACAGA GTCATGGTGT CGCAAAACCA CGCTAACTAT CATGGTAGCA TAGAGTAGTT 4620 ATTTAAAAAT TAACATAATG ATGAATTGTT AGTATGAGAT CAAAAACAAC ATTGGGGCAA 4680 ATGCAACCAT GTCCAAACAC AAGAATCAAC GCACTGCCAG GACTCTAGAA AAGACCTGGG 4740 ATACTCTTAA TCATCTAATT GTAATATCCT CTTGTTTATA CAGATTAAAT TTAAAATCTA 4800 TAGCACAAAT AGCACTATCA GTTTTGGCAA TGATAATCTC AACCTCTCTC ATAATTGCAG 4860 CCATAATATT CATCATCTCT GCCAATCACA AAGTTACACT AACAACGGTC ACAGTTCAAA 4920 CAATAAAAAA CCACACTGAA AAAAACATCA CCACCTACCC TACTCAAGTC TCACCAGAAA 4980 GGGTTAGTTC ATCCAAGCAA CCCACAACCA CATCACCAAT CCACACAAGT TCAGCTACAA 5040 CATCACCCAA TACAAAATCA GAAACACACC ATACAACAGC ACAAACCAAA GGCAGAACCA 5100 CCACTTCAAC ACAGACCAAC AAGCCAAGCA CAAAACCACG TCCAAAAAAT CCACCAAAAA 5160 AAGATGATTA CCATTTTGAA GTGTTCAACT TCGTTCCCTG CAGTATATGT GGCAACAATC 5220 AACTTTGCAA ATCCATCTGC AAAACAATAC CAAGCAACAA ACCAAAGAAG AAACCAACCA 5280 TCAAACCCAC AAACAAACCA ACCACCAAAA CCACAAACAA AAGAGACCCA AAAACACCAG 5340 CCAAAACGAC GAAAAAAGAA ACTACCACCA ACCCAACAAA AAAACTAACC CTCAAGACCA 5400 CAGAAAGAGA CACCAGCACC TCACAATCCA CTGCACTCGA CACAACCACA TTAAAACACA 5460 CAGTCCAACA GCAATCCCTC CTCTCAACCA CCCCCGAAAA CACACCCAAC TCCACACAAA 5520 CACCCACAGC ATCCGAGCCC TCCACACCAA ACTCCACCCA AAAAACCCAG CCACATGCTT 5580 AGTTATTCAA AAACTACATC TTAGCAGAGA ACCGTGATCT ATCAAGCAAG AACGAAATTA 5640 AACCTGGGGC AAATAACCAT GGAGTTGATG ATCCACAAGT CAAGTGCAAT CTTCCTAACT 5700 CTTGCTATTA ATGCATTGTA CCTCACCTCA AGTCAGAACA TAACTGAGGA GTTTTACCAA 5760 TCGACATGTA GTGCAGTTAG CAGAGGTTAT TTTAGTGCTT TAAGAACAGG TTGGTATACT 5820 AGTGTCATAA CAATAGAATT AAGTAATATA AAAGAAACCA AATGCAATGG AACTGACACT 5880 AAAGTAAAAC TTATGAAACA AGAATTAGAT AAGTATAAGA ATGCAGTAAC AGAATTACAG 5940 CTACTTATGC AAAACACACC AGCTGTCAAC AACCGGGCCA GAAGAGAAGC ACCACAGTAT 6000 ATGAACTACA CAATCAATAC CACTAAAAAC CTAAATGTAT CAATAAGCAA GAAGAGGAAA 6060 CGAAGATTTC TAGGCTTCTT GTTAGGTGTG GGATCTGCAA TAGCAAGTGG TATAGCTGTA 6120 TCAAAAGTTC TACACCTTGA AGGAGAAGTG AACAAGATCA AAAATGCTTT GTTGTCTACA 6180 AACAAAGCTG TAGTCAGTTT ATCAAATGGG GTCAGTGTTT TAACCAGCAA AGTGTTAGAT 6240 CTCAAGAATT ACATAAATAA CCAATTATTA CCCATAGTAA ATCAACAGAG CTGTCGCATC 6300 TCCAACATTG AAACAGTTAT AGAATTCCAG CAGAAGAACA GCAGATTGTT GGAAATCACC 6360 AGAGAATTTA GTGTCAATGC AGGTGTAACA ACACCTTTAA GCACTTACAT GTTGACAAAC 6420 AGTGAGTTAC TATCATTAAT CAATGATATG CCTATAACAA ATGATCAGAA AAAATTAATG 6480 TCAAGCAATG TTCAGATAGT AAGGCAACAA AGTTATTCCA TCATGTCTAT AATAAAGGAA 6540 GAAGTCCTTG CATATGTTGT ACAGCTGCCT ATCTATGGTG TAATAGATAC ACCTTGCTGG 6600 AAATTGCACA CATCGCCTCT ATGCACTACC AACATCAAAG AAGGATCAAA TATTTGTTTA 6660 ACAAGGACTG ATAGAGGATG GTATTGTGAT AATGCAGGAT CAGTATCCTT CTTTCCACAG 6720 GCTGACACTT GTAAAGTACA GTCCAATCGA GTATTTTGTG ACACTATGAA CAGTTTGACA 6780 TTACCAAGTG AAGTCAGCCT TTGTAACACT GACATATTCA ATTCCAAGTA TGACTGCAAA 6840 ATTATGACAT CAAAAACAGA CATAAGCAGC TCAGTAATTA CTTCTCTTGG AGCTATAGTG 6900 TCATGCTATG GTAAAACTAA ATGCACTGCA TCCAACAAAA ATCGTGGGAT TATAAAGACA 6960 TTTTCTAATG GTTGTGACTA TGTGTCAAAC AAAGGAGTAG ATACTGTGTC AGTGGGCAAC 7020 ACTTTATACT ATGTAAACAA GCTGGAAGGC AAGAACCTTT ATGTAAAAGG GGAACCTATA 7080 ATAAATTACT ATGACCCTCT AGTGTTTCCT TCTGATGAGT TTGATGCATC AATATCTCAA 7140 GTCAATGAAA AAATCAATCA AAGTTTAGCT TTTATTCGTA GATCTGATGA ATTACTACAT 7200 AATGTAAATA CTGGCAAATC TACTACAAAT ATTATGATAA CTACAATTAT TATAGTAATC 7260 ATTGTAGTAT TGTTATCATT AATAGCTATT GGTTTACTGT TGTATTGTAA AGCCAAAAAC 7320 ACACCAGTTA CACTAAGCAA AGACCAACTA AGTGGAATCA ATAATATTGC ATTCAGCAAA 7380 TAGACAAAAA ACCACCTGAT CATGTTTCAA CAACAATCTG CTGACCACCA ATCCCAAATC 7440 AACTTACAAC AAATATTTCA ACATCACAGT ACAGGCTGAA TCATTTCCTC ACATCATGCT 7500 ACCCACATAA CTAAGCTAGA TCCTTAACTT ATAGTTACAT AAAAACCTCA AGTATCACAA 7560 TCAACCACTA AATCAACACA TCATTCACAA AATTAACAGC TGGGGCAAAT ATGTCGCGAA 7620 GAAATCCTTG TAAATTTGAG ATTAGAGGTC ATTGCTTGAA TGGTAGAAGA TGTCACTACA 7680 GTCATAATTA CTTTGAATGG CCTCCTCATG CATTACTAGT GAGGCAAAAC TTCATGTTAA 7740 ACAAGATACT CAAGTCAATG GACAAAAGCA TAGACACTTT GTCTGAAATA AGTGGAGCTG 7800 CTGAACTGGA TAGAACAGAA GAATATGCTC TTGGTATAGT TGGAGTGCTA GAGAGTTACA 7860 TAGGATCTAT AAACAACATA ACAAAACAAT CAGCATGTGT TGCTATGAGT AAACTTCTTA 7920 TTGAGATCAA TAGTGATGAC ATTAAAAAGC TTAGAGATAA TGAAGAACCC AATTCACCTA 7980 AGATAAGAGT GTACAATACT GTTATATCAT ACATTGAGAG CAATAGAAAA AACAACAAGC 8040 AAACCATCCA TCTGCTCAAG AGACTACCAG CAGACGTGCT GAAGAAGACA ATAAAGAACA 8100 CATTAGATAT CCACAAAAGC ATAACCATAA GCAATCCAAA AGAGTCAACT GTGAATGATC 8160 AAAATGACCA AACCAAAAAT AATGATATTA CCGGATAAAT ATCCTTGTAG TATATCATCC 8220 ATATTGATCT CAAGTGAAAG CATGGTTGCT ACATTCAATC ATAAAAACAT ATTACAATTT 8280 AACCATAACT ATTTGGATAA CCACCAGCGT TTATTAAATC ATATATTTGA TGAAATTCAT 8340 TGGACACCTA AAAACTTATT AGATGCCACT CAACAATTTC TCCAACATCT TAACATCCCT 8400 GAAGATATAT ATACAGTATA TATATTAGTG TCATAATGCT TGACCATAAC GACTCTATGT 8460 CATCCAACCA TAAAACTATT TTGATAAGGT TATGGGACAA AATGGATCCC ATTATTAATG 8520 GAAACTCTGC TAATGTGTAT CTAACTGATA GTTATTTAAA AGGTGTTATC TCTTTTTCAG 8580 AGTGTAATGC TTTAGGGAGT TATCTTTTTA ACGGCCCTTA TCTTAAAAAT GATTACACCA 8640 ACTTAATTAG TAGACAAAGC CCACTACTAG AGCATATGAA TCTTAAAAAA CTAACTATAA 8700 CACAGTCATT AATATCTAGA TATCATAAAG GTGAACTGAA ATTAGAAGAA CCAACTTATT 8760 TCCAGTCATT ACTTATGACA TATAAAAGTA TGTCCTCGTC TGAACAAATT GCTACAACTA 8820 ACTTACTTAA AAAAATAATA CGAAGAGCCA TAGAAATAAG TGATGTAAAG GTGTACGCCA 8880 TCTTGAATAA ACTAGGATTA AAGGAAAAGG ACAGAGTTAA GCCCAACAAT AATTCAGGTG 8940 ATGAAAACTC AGTACTTACA ACCATAATTA AAGATGATAT ACTTTCGGCT GTGGAAAACA 9000 ATCAATCATA TACAAATTCA GACAAAAGTC ACTCAGTAAA TCAAAATATC ACTATCAAAA 9060 CAACACTCTT GAAAAAATTG ATGTGTTCAA TGCAACATCC TCCATCATGG TTAATACACT 9120 GGTTCAATTT ATATACAAAA TTAAATAACA TATTAACACA ATATCGATCA AATGAGGTAA 9180 AAAGTCATGG GTTTATATTA ATAGATAATC AAACTTTAAG TGGTTTTCAG TTTATTTTAA 9240 ATCAATATGG TTGTATCGTT TATCATAAAG GACTCAAAAA AATCACAACT ACTACTTACA 9300 ATCAATTTTT GACATGGAAA GACATCAGCC TTAGCAGATT AAATGTTTGC TTAATTACTT 9360 GGATAAGTAA TTGTTTAAAT ACATTAAACA AAAGCTTAGG GCTGAGATGT GGATTCAATA 9420 ATGTTGTGTT ATCACAATTA TTTCTTTATG GAGATTGTAT ACTGAAATTA TTTCATAATG 9480 AAGGCTTCTA CATAATAAAA GAAGTAGAGG GATTTATTAT GTCTTTAATT CTAAACATAA 9540 CAGAAGAAGA TCAATTTAGG AAACGATTTT ATAATAGCAT GCTAAATAAC ATCACAGATG 9600 CAGCTATTAA GGCTCAAAAG GACCTACTAT CAAGAGTATG TCACACTTTA TTAGACAAGA 9660 CAGTGTCTGA TAATATCATA AATGGTAAAT GGATAATCCT ATTAAGTAAA TTTCTTAAAT 9720 TGATTAAGCT TGCAGGTGAT AATAATCTCA ATAACTTGAG TGAGCTATAT TTTCTCTTCA 9780 GAATCTTTGG ACATCCAATG GTCGATGAAA GACAAGCAAT GGATTCTGTA AGAATTAACT 9840 GTAATGAAAC TAAGTTCTAC TTATTAAGTA GTCTAAGTAC ATTAAGAGGT GCTTTCATTT 9900 ATAGAATCAT AAAAGGGTTT GTAAATACCT ACAACAGATG GCCCACCTTA AGGAATGCTA 9960 TTGTCCTACC TCTAAGATGG TTAAACTACT ATAAACTTAA TACTTATCCA TCTCTACTTG 10020 AAATCACAGA AAATGATTTG ATTATTTTAT CAGGATTGCG GTTCTATCGT GAGTTTCATC 10080 TGCCTAAAAA AGTGGATCTT GAAATGATAA TAAATGACAA AGCCATTTCA CCTCCAAAAG 10140 ATCTAATATG GACTAGTTTT CCTAGAAATT ACATGCCATC ACATATACAA AATTATATAG 10200 AACATGAAAA GTTGAAGTTC TCTGAAAGCG ACAGATCGAG AAGAGTACTA GAGTATTACT 10260 TGAGAGATAA TAAATTCAAT GAATGCGATC TATACAATTG TGTAGTCAAT CAAAGCTATC 10320 TCAACAACTC TAATCACGTG GTATCACTAA CTGGTAAAGA AAGAGAGCTC AGTGTAGGTA 10380 GAATGTTTGC TATGCAACCA GGTATGTTTA GGCAAATCCA AATCTTAGCA GAGAAAATGA 10440 TAGCTGAAAA TATTTTACAA TTCTTCCCTG AGAGTTTGAC AAGATATGGT GATCTAGAGC 10500 TTCAAAAGAT ATTAGAATTA AAAGCAGGAA TAAGCAACAA GTCAAATCGT TATAATGATA 10560 ACTACAACAA TTATATCAGT AAATGTTCTA TCATTACAGA TCTTAGCAAA TTCAATCAGG 10620 CATTTAGATA TGAAACATCA TGTATCTGCA GTGATGTATT AGATGAACTG CATGGAGTAC 10680 AATCTCTGTT CTCTTGGTTG CATTTAACAA TACCTCTTGT CACAATAATA TGTACATATA 10740 GACATGCACC TCCTTTCATA AAGGATCATG TTGTTAATCT TAATGAGGTT GATGAACAAA 10800 GTGGATTATA CAGATATCAT ATGGGTGGTA TTGAGGGCTG GTGTCAAAAA CTGTGGACCA 10860 TTGAAGCTAT ATCATTATTA GATCTAATAT CTCTCAAAGG GAAATTCTCT ATCACAGCTC 10920 TGATAAATGG TGATAATCAG TCAATTGATA TAAGCAAACC AGTTAGACTT ATAGAGGGTC 10980 AGACCCATGC ACAAGCAGAT TATTTGTTAG CATTAAATAG CCTTAAATTG TTATATAAAG 11040 AGTATGCAGG TATAGGCCAT AAGCTTAAGG GAACAGAGAC CTATATATCC CGAGATATGC 11100 AGTTCATGAG CAAAACAATC CAGCACAATG GAGTGTACTA TCCAGCCAGT ATCAAAAAAG 11160 TCCTGAGAGT AGGTCCATGG ATAAACACGA TACTTGATGA TTTTAAAGTT AGTTTAGAAT 11220 CTATAGGCAG CTTAACACAG GAGTTAGAAT ACAGAGGAGA AAGCTTATTA TGCAGTTTAA 11280 TATTTAGGAA CATTTGGTTA TACAATCAAA TTGCTTTGCA ACTCCGAAAT CATGCATTAT 11340 GTAACAATAA GCTATATTTA GATATATTGA AAGTATTAAA ACACTTAAAA ACTTTTTTTA 11400 ATCTTGATAG CATTGATATG GCTTTATCAT TGTATATGAA TTTGCCTATG CTGTTTGGTG 11460 GTGGTGATCC TAATTTGTTA TATCGAAGCT TTTATAGGAG AACTCCAGAC TTCCTTACAG 11520 AAGCTATAGT ACATTCAGTG TTTGTGTTGA GCTATTATAC TGGTCACGAT TTACAAGATA 11580 AGCTCCAGGA TCTTCCAGAT GATAGACTGA ACAAATTCTT GACATGTGTC ATCACATTTG 11640 ATAAAAATCC CAATGCCGAG TTTGTAACAT TGATGAGGGA TCCACAGGCT TTAGGGTCTG 11700 AAAGGCAAGC TAAAATTACT AGTGAGATTA ATAGATTAGC AGTAACAGAA GTCTTAAGTA 11760 TAGCCCCAAA CAAAATATTT TCTAAAAGTG CACAACATTA TACTACCACT GAGATTGATC 11820 TAAATGACAT TATGCAAAAT ATAGAACCAA CTTACCCTCA TGGATTAAGA GTTGTTTATG 11880 AAAGTTTACC TTTTTATAAA GCAGAAAAAA TAGTTAATCT TATATCAGGA ACAAAATCCA 11940 TAACTAATAT ACTTGAAAAA ACATCAGCAA TAGATACAAC TGATATTAAT AGGGCTACTG 12000 ATATGATGAG GAAAAATATA ACTTTACTTA TAAGGATACT TCCACTAGAT TGTAACAAAG 12060 ACAAAAGAGA GTTATTAAGT TTAGAAAATC TTAGTATAAC TGAATTAAGC AAGTATGTAA 12120 GAGAAAGATC TTGGTCATTA TCCAATATAG TAGGAGTAAC ATCGCCAAGT ATTATGTTCA 12180 CAATGGACAT TAAATATACA ACTAGCACTA TAGCCAGTGG TATAATAATA GAAAAATATA 12240 ATGTTAATAG TTTAACTCGT GGTGAAAGAG GACCCACCAA GCCATGGGTA GGCTCATCCA 12300 CGCAGGAGAA AAAAACAATG CCAGTGTACA ACAGACAAGT TTTAACCAAA AAGCAAAGAG 12360 ACCAAATAGA TTTATTAGCA AAATTAGACT GGGTATATGC ATCCATAGAC AACAAAGATG 12420 AATTCATGGA AGAACTGAGT ACTGGAACAC TTGGACTGTC ATATGAAAAA GCCAAAAAGT 12480 TGTTTCCACA ATATCTAAGT GTCAATTATT TACACCGTTT AACAGTCAGT AGTAGACCAT 12540 GTGAATTCCC TGCATCAATA CCAGCTTATA GAACAACAAA TTATCATTTT GATACTAGTC 12600 CTATCAATCA TGTATTAACA GAAAAGTATG GAGATGAAGA TATCGACATT GTGTTTCAAA 12660 ATTGCATAAG TTTTGGTCTT AGCCTGATGT CGGTTGTGGA ACAATTCACA AACATATGTC 12720 CTAATAGAAT TATTCTCATA CCGAAGCTGA ATGAGATACA TTTGATGAAA CCTCCTATAT 12780 TTACAGGAGA TGTTGATATC ATCAAGTTGA AGCAAGTGAT ACAAAAGCAG CACATGTTCC 12840 TACCAGATAA AATAAGTTTA ACCCAATATG TAGAATTATT CTTAAGTAAC AAAGCACTTA 12900 AATCTGGATC TCACATCAAC TCTAATTTAA TATTAGTACA TAAAATGTCT GATTATTTTC 12960 ATAATGCTTA TATTTTAAGT ACTAATTTAG CTGGACATTG GATTCTGATT ATTCAACTTA 13020 TGAAAGATTC AAAAGGTATT TTTGAAAAAG ATTGGGGAGA GGGGTACATA ACTGATCATA 13080 TGTTCATTAA TTTGAATGTT TTCTTTAATG CTTATAAGAC TTATTTGCTA TGTTTTCATA 13140 AAGGTTATGG TAAAGCAAAA TTAGAATGTG ATATGAACAC TTCAGATCTT CTTTGTGTTT 13200 TGGAGTTAAT AGACAGTAGC TACTGGAAAT CTATGTCTAA AGTTTTCCTA GAACAAAAAG 13260 TCATAAAATA CATAGTCAAT CAAGACACAA GTTTGCGTAG AATAAAAGGC TGTCACAGTT 13320 TTAAGTTGTG GTTTTTAAAA CGCCTTAATA ATGCTAAATT TACCGTATGC CCTTGGGTTG 13380 TTAACATAGA TTATCACCCA ACACACATGA AAGCTATATT ATCTTACATA GATTTAGTTA 13440 GAATGGGGTT AATAAATGTA GATAAATTAA CCATTAAAAA TAAAAACAAA TTCAATGATG 13500 AATTTTACAC ATCAAATCTC TTTTACATTA GTTATAACTT TTCAGACAAC ACTCATTTGC 13560 TAACAAAACA AATAAGAATT GCTAATTCAG AATTAGAAGA TAATTATAAC AAACTATATC 13620 ACCCAACCCC AGAAACTTTA GAAAATATGT CATTAATTCC TGTTAAAAGT AATAATAGTA 13680 ACAAACCTAA ATTTTGTATA AGTGGAAATA CCGAATCTAT GATGATGTCA ACATTCTCTA 13740 GTAAAATGCA TATTAAATCT TCCACTGTTA CCACAAGATT CAATTATAGC AAACAAGACT 13800 TGTACAATTT ATTTCCAATT GTTGTGATAG ACAAGATTAT AGATCATTCA GGTAATACAG 13860 CAAAATCTAA CCAACTTTAC ACCACCACTT CACATCAGAC ATCTTTAGTA AGGAATAGTG 13920 CATCACTTTA TTGCATGCTT CCTTGGCATC ATGTCAATAG ATTTAACTTT GTATTTAGTT 13980 CCACAGGATG CAAGATCAGT ATAGAGTATA TTTTAAAAGA TCTTAAGATT AAGGACCCCA 14040 GTTGTATAGC ATTCATAGGT GAAGGAGCTG GTAACTTATT ATTACGTACG GTAGTAGAAC 14100 TTCATCCAGA CATAAGATAC ATTTACAGAA GTTTAAAAGA TTGCAATGAT CATAGTTTAC 14160 CTATTGAATT TCTAAGGTTA TACAACGGGC ATATAAACAT AGATTATGGT GAGAATTTAA 14220 CCATTCCTGC TACAGATGCA ACTAATAACA TTCATTGGTC TTATTTACAT ATAAAATTTG 14280 CAGAACCTAT TAGCATCTTT GTCTGCGATG CTGAATTACC TGTTACAGCC AATTGGAGTA 14340 AAATTATAAT TGAATGGAGT AAGCATGTAA GAAAGTGCAA GTACTGTTCT TCTGTAAATA 14400 GATGCATTTT AATTGCAAAA TATCATGCTC AAGATGACAT TGATTTCAAA TTAGATAACA 14460 TTACTATATT AAAAACTTAC GTGTGCCTAG GTAGCAAGTT AAAAGGATCT GAAGTTTACT 14520 TAATCCTTAC AATAGGCCCT GCAAATATAC TTCCTGTTTT TGATGTTGTA CAAAATGCTA 14580 AATTGACACT TTCAAGAACT AAAAATTTCA TTATGCCTAA AAAAACTGAC AAGGAATCTA 14640 TCGATGCAAA TATTAAAAGC TTAATACCTT TCCTTTGTTA CCCTATAACA AAAAAAGGAA 14700 TTAAGACTTC ATTGTCAAAA TTGAAGAGTG TAGTTAATGG AGATATATTA TCATATTCTA 14760 TAGCTGGACG TAATGAAGTA TTCAGCAACA AGCTTATAAA CCACAAGCAT ATGAATATCC 14820 TAAAATGGCT AGATCATGTT TTAAATTTTA GATCAGCTGA ACTTAATTAC AATCATTTAT 14880 ACATGATAGA GTCCACATAT CCTTACTTAA GTGAATTGTT AAATAGTTTA ACAACCAATG 14940 AGCTCAAGAA GCTGATTAAA ATAACAGGTA GTGTGCTATA CAACCTTCCC AACGAACAGT 15000 AGTTTAAAAT ATCATTAACA AGTTTGGTCA AATTTAGATG CTAACACATC ATTATATTAT 15060 AGTTATTAAA AAATATACAA ACTTTTCAAT AATTTAGCAT ATTGATTCCA AAATTATCAT 15120 TTTAGTCTTA AGGGGTTAAA TAAAAGTCTA AAACTAACAA TTATACATGT GCATTCACAA 15180 CACAACGAGA CATTAGTTTT TGACACTTTT TTTCTCGT 15218 (2) SEQ ID NO: 24 information about: ...
(i) sequence signature:
(A) length: 2166 amino acid
(B) type: amino acid
(C) chain:
(D) topological framework: linearity
(ii) molecule type: protein
(xi) sequence description: SEQ ID NO:24:
Met?Asp?Pro?Ile?Ile?Asn?Gly?Asn?Ser?Ala?Asn?Val?Tyr?Leu?Thr?Asp
1???????????????5???????????????????10??????????????????15
Ser?Tyr?Leu?Lys?Gly?Val?Ile?Ser?Phe?Ser?Glu?Cys?Asn?Ala?Leu?Gly
20??????????????????25??????????????????30
Ser?Tyr?Leu?Phe?Asn?Gly?Pro?Tyr?Leu?Lys?Asn?Asp?Tyr?Thr?Asn?Leu
35??????????????????40??????????????????45
Ile?Ser?Arg?Gln?Ser?Pro?Leu?Leu?Glu?His?Met?Asn?Leu?Lys?Lys?Leu
50??????????????????55??????????????????60
Thr?Ile?Thr?Gln?Ser?Leu?Ile?Ser?Arg?Tyr?His?Lys?Gly?Glu?Leu?Lys
65??????????????????70??????????????????75??????????????????80
Leu?Glu?Glu?Pro?Thr?Tyr?Phe?Gln?Ser?Leu?Leu?Met?Thr?Tyr?Lys?Ser
85??????????????????90??????????????????95
Met?Ser?Ser?Ser?Glu?Gln?Ile?Ala?Thr?Thr?Asn?Leu?Leu?Lys?Lys?Ile
100?????????????????105?????????????????110
Ile?Arg?Arg?Ala?Ile?Glu?Ile?Ser?Asp?Val?Lys?Val?Tyr?Ala?Ile?Leu
115?????????????????120?????????????????125Asn?Lys?Leu?Gly?Leu?Lys?Glu?Lys?Asp?Arg?Val?Lys?Pro?Asn?Asn?Asn
130?????????????????135?????????????????140Ser?Gly?Asp?Glu?Asn?Ser?Val?Leu?Thr?Thr?Ile?Ile?Lys?Asp?Asp?Ile145?????????????????150?????????????????155?????????????????160Leu?Ser?Ala?Val?Glu?Asn?Asn?Gln?Ser?Tyr?Thr?Asn?Ser?Asp?Lys?Ser
165?????????????????170?????????????????175His?Ser?Val?Asn?Gln?Asn?Ile?Thr?Ile?Lys?Thr?Thr?Leu?Leu?Lys?Lys
180?????????????????185?????????????????190Leu?Met?Cys?Ser?Met?Gln?His?Pro?Pro?Ser?Trp?Leu?Ile?His?Trp?Phe
195?????????????????200?????????????????205Asn?Leu?Tyr?Thr?Lys?Leu?Asn?Asn?Ile?Leu?Thr?Gln?Tyr?Arg?Ser?Asn
210?????????????????215?????????????????220Glu?Val?Lys?Ser?His?Gly?Phe?Ile?Leu?Ile?Asp?Asn?Gln?Thr?Leu?Ser225?????????????????230?????????????????235?????????????????240Gly?Phe?Gln?Phe?Ile?Leu?Asn?Gln?Tyr?Gly?Cys?Ile?Val?Tyr?His?Lys
245?????????????????250?????????????????255Gly?Leu?Lys?Lys?Ile?Thr?Thr?Thr?Thr?Tyr?Asn?Gln?Phe?Leu?Thr?Trp
260?????????????????265?????????????????270Lys?Asp?Ile?Ser?Leu?Ser?Arg?Leu?Asn?Val?Cys?Leu?Ile?Thr?Trp?Ile
275?????????????????280?????????????????285Ser?Asn?Cys?Leu?Asn?Thr?Leu?Asn?Lys?Ser?Leu?Gly?Leu?Arg?Cys?Gly
290?????????????????295?????????????????300Phe?Asn?Asn?Val?Val?Leu?Ser?Gln?Leu?Phe?Leu?Tyr?Gly?Asp?Cys?Ile305?????????????????310?????????????????315?????????????????320Leu?Lys?Leu?Phe?His?Asn?Glu?Gly?Phe?Tyr?Ile?Ile?Lys?Glu?Val?Glu
325?????????????????330?????????????????335Gly?Phe?Ile?Met?Ser?Leu?Ile?Leu?Asn?Ile?Thr?Glu?Glu?Asp?Gln?Phe
340?????????????????345?????????????????350Arg?Lys?Arg?Phe?Tyr?Asn?Ser?Met?Leu?Asn?Asn?Ile?Thr?Asp?Ala?Ala
355?????????????????360?????????????????365Ile?Lys?Ala?Gln?Lys?Asp?Leu?Leu?Ser?Arg?Val?Cys?His?Thr?Leu?Leu
370?????????????????375?????????????????380Asp?Lys?Thr?Val?Ser?Asp?Asn?Ile?Ile?Asn?Gly?Lys?Trp?Ile?Ile?Leu385?????????????????390?????????????????395?????????????????400Leu?Ser?Lys?Phe?Leu?Lys?Leu?Ile?Lys?Leu?Ala?Gly?Asp?Asn?Asn?Leu
405?????????????????410?????????????????415Asn?Asn?Leu?Ser?Glu?Leu?Tyr?Phe?Leu?Phe?Arg?Ile?Phe?Gly?His?Pro
420?????????????????425?????????????????430Met?Val?Asp?Glu?Arg?Gln?Ala?Met?Asp?Ser?Val?Arg?Ile?Asn?Cys?Asn
435?????????????????440?????????????????445Glu?Thr?Lys?Phe?Tyr?Leu?Leu?Ser?Ser?Leu?Ser?Thr?Leu?Arg?Gly?Ala
450?????????????????455?????????????????460Phe?Ile?Tyr?Arg?Ile?Ile?Lys?Gly?Phe?Val?Asn?Thr?Tyr?Asn?Arg?Trp465?????????????????470?????????????????475?????????????????480Pro?Thr?Leu?Arg?Asn?Ala?Ile?Val?Leu?Pro?Leu?Arg?Trp?Leu?Asn?Tyr
485?????????????????490?????????????????495Tyr?Lys?Leu?Asn?Thr?Tyr?Pro?Ser?Leu?Leu?Glu?Ile?Thr?Glu?Asn?Asp
500?????????????????505?????????????????510Leu?Ile?Ile?Leu?Ser?Gly?Leu?Arg?Phe?Tyr?Arg?Glu?Phe?His?Leu?Pro
515?????????????????520?????????????????525Lys?Lys?Val?Asp?Leu?Glu?Met?Ile?Ile?Asn?Asp?Lys?Ala?Ile?Ser?Pro
530?????????????????535?????????????????540Pro?Lys?Asp?Leu?Ile?Trp?Thr?Ser?Phe?Pro?Arg?Asn?Tyr?Met?Pro?Ser545?????????????????550?????????????????555?????????????????560His?Ile?Gln?Asn?Tyr?Ile?Glu?His?Glu?Lys?Leu?Lys?Phe?Ser?Glu?Ser
565?????????????????570?????????????????575Asp?Arg?Ser?Arg?Arg?Val?Leu?Glu?Tyr?Tyr?Leu?Arg?Asp?Asn?Lys?Phe
580?????????????????585?????????????????590Asn?Glu?Cys?Asp?Leu?Tyr?Asn?Cys?Val?Val?Asn?Gln?Ser?Tyr?Leu?Asn
595?????????????????600?????????????????605Asn?Ser?Asn?His?Val?Val?Ser?Leu?Thr?Gly?Lys?Glu?Arg?Glu?Leu?Ser
610?????????????????615?????????????????620Val?Gly?Arg?Met?Phe?Ala?Met?Gln?Pro?Gly?Met?Phe?Arg?Gln?Ile?Gln625?????????????????630?????????????????635?????????????????640Ile?Leu?Ala?Glu?Lys?Met?Ile?Ala?Glu?Asn?Ile?Leu?Gln?Phe?Phe?Pro
645?????????????????650?????????????????655Glu?Ser?Leu?Thr?Arg?Tyr?Gly?Asp?Leu?Glu?Leu?Gln?Lys?Ile?Leu?Glu
660?????????????????665?????????????????670Leu?Lys?Ala?Gly?Ile?Ser?Asn?Lys?Ser?Asn?Arg?Tyr?Asn?Asp?Asn?Tyr
675?????????????????680?????????????????685Asn?Asn?Tyr?Ile?Ser?Lys?Cys?Ser?Ile?Ile?Thr?Asp?Leu?Ser?Lys?Phe
690?????????????????695?????????????????700Asn?Gln?Ala?Phe?Arg?Tyr?Glu?Thr?Ser?Cys?Ile?Cys?Ser?Asp?Val?Leu705?????????????????710?????????????????715?????????????????720Asp?Glu?Leu?His?Gly?Val?Gln?Ser?Leu?Phe?Ser?Trp?Leu?His?Leu?Thr
725?????????????????730?????????????????735Ile?Pro?Leu?Val?Thr?Ile?Ile?Cys?Thr?Tyr?Arg?His?Ala?Pro?Pro?Phe
740?????????????????745?????????????????750Ile?Lys?Asp?His?Val?Val?Asn?Leu?Asn?Glu?Val?Asp?Glu?Gln?Ser?Gly
755?????????????????760?????????????????765Leu?Tyr?Arg?Tyr?His?Met?Gly?Gly?Ile?Glu?Gly?Trp?Cys?Gln?Lys?Leu
770?????????????????775?????????????????780Trp?Thr?Ile?Glu?Ala?Ile?Ser?Leu?Leu?Asp?Leu?Ile?Ser?Leu?Lys?Gly785?????????????????790?????????????????795?????????????????800Lys?Phe?Ser?Ile?Thr?Ala?Leu?Ile?Asn?Gly?Asp?Asn?Gln?Ser?Ile?Asp
805?????????????????810?????????????????815Ile?Ser?Lys?Pro?Val?Arg?Leu?Ile?Glu?Gly?Gln?Thr?His?Ala?Gln?Ala
820?????????????????825?????????????????830Asp?Tyr?Leu?Leu?Ala?Leu?Asn?Ser?Leu?Lys?Leu?Leu?Tyr?Lys?Glu?Tyr
835?????????????????840?????????????????845Ala?Gly?Ile?Gly?His?Lys?Leu?Lys?Gly?Thr?Glu?Thr?Tyr?Ile?Ser?Arg
850?????????????????855?????????????????860Asp?Met?Gln?Phe?Met?Ser?Lys?Thr?Ile?Gln?His?Asn?Gly?Val?Tyr?Tyr865?????????????????870?????????????????875?????????????????880Pro?Ala?Ser?Ile?Lys?Lys?Val?Leu?Arg?Val?Gly?Pro?Trp?Ile?Asn?Thr
885?????????????????890?????????????????895Ile?Leu?Asp?Asp?Phe?Lys?Val?Ser?Leu?Glu?Ser?Ile?Gly?Ser?Leu?Thr
900?????????????????905?????????????????910Gln?Glu?Leu?Glu?Tyr?Arg?Gly?Glu?Ser?Leu?Leu?Cys?Ser?Leu?Ile?Phe
915?????????????????920?????????????????925Arg?Asn?Ile?Trp?Leu?Tyr?Asn?Gln?Ile?Ala?Leu?Gln?Leu?Arg?Asn?His
930?????????????????935?????????????????940Ala?Leu?Cys?Asn?Asn?Lys?Leu?Tyr?Leu?Asp?Ile?Leu?Lys?Val?Leu?Lys945?????????????????950?????????????????955?????????????????960His?Leu?Lys?Thr?Phe?Phe?Asn?Leu?Asp?Ser?Ile?Asp?Met?Ala?Leu?Ser
965?????????????????970?????????????????975Leu?Tyr?Met?Asn?Leu?Pro?Met?Leu?Phe?Gly?Gly?Gly?Asp?Pro?Asn?Leu
980?????????????????985?????????????????990Leu?Tyr?Arg?Ser?Phe?Tyr?Arg?Arg?Thr?Pro?Asp?Phe?Leu?Thr?Glu?Ala
995?????????????????1000????????????????1005Ile?Val?His?Ser?Val?Phe?Val?Leu?Ser?Tyr?Tyr?Thr?Gly?His?Asp?Leu
1010????????????????1015????????????????1020Gln?Asp?Lys?Leu?Gln?Asp?Leu?Pro?Asp?Asp?Arg?Leu?Asn?Lys?Phe?Leu1025????????????????1030????????????????1035????????????????1040Thr?Cys?Val?Ile?Thr?Phe?Asp?Lys?Asn?Pro?Asn?Ala?Glu?Phe?Val?Thr
1045????????????????1050????????????????1055Leu?Met?Arg?Asp?Pro?Gln?Ala?Leu?Gly?Ser?Glu?Arg?Gln?Ala?Lys?Ile
1060????????????????1065????????????????1070Thr?Ser?Glu?Ile?Asn?Arg?Leu?Ala?Val?Thr?Glu?Val?Leu?Ser?Ile?Ala
1075????????????????1080????????????????1085Pro?Asn?Lys?Ile?Phe?Ser?Lys?Ser?Ala?Gln?His?Tyr?Thr?Thr?Thr?Glu
1090????????????????1095????????????????1100Ile?Asp?Leu?Asn?Asp?Ile?Met?Gln?Asn?Ile?Glu?Pro?Thr?Tyr?Pro?His1105????????????????1110????????????????1115????????????????1120Gly?Leu?Arg?Val?Val?Tyr?Glu?Ser?Leu?Pro?Phe?Tyr?Lys?Ala?Glu?Lys
1125????????????????1130????????????????1135Ile?Val?Asn?Leu?Ile?Ser?Gly?Thr?Lys?Ser?Ile?Thr?Asn?Ile?Leu?Glu
1140????????????????1145????????????????1150Lys?Thr?Ser?Ala?Ile?Asp?Thr?Thr?Asp?Ile?Asn?Arg?Ala?Thr?Asp?Met
1155????????????????1160????????????????1165Met?Arg?Lys?Asn?Ile?Thr?Leu?Leu?Ile?Arg?Ile?Leu?Pro?Leu?Asp?Cys
1170????????????????1175????????????????1180Asn?Lys?Asp?Lys?Arg?Glu?Leu?Leu?Ser?Leu?Glu?Asn?Leu?Ser?Ile?Thr1185????????????????1190????????????????1195????????????????1200Glu?Leu?Ser?Lys?Tyr?Val?Arg?Glu?Arg?Ser?Trp?Ser?Leu?Ser?Asn?Ile
1205????????????????1210????????????????1215Val?Gly?Val?Thr?Ser?Pro?Ser?Ile?Met?Phe?Thr?Met?Asp?Ile?Lys?Tyr
1220????????????????1225????????????????1230Thr?Thr?Ser?Thr?Ile?Ala?Ser?Gly?Ile?Ile?Ile?Glu?Lys?Tyr?Asn?Val
1235????????????????1240????????????????1245Asn?Ser?Leu?Thr?Arg?Gly?Glu?Arg?Gly?Pro?Thr?Lys?Pro?Trp?Val?Gly
1250????????????????1255????????????????1260Ser?Ser?Thr?Gln?Glu?Lys?Lys?Thr?Met?Pro?Val?Tyr?Asn?Arg?Gln?Val1265????????????????1270????????????????1275????????????????1280Leu?Thr?Lys?Lys?Gln?Arg?Asp?Gln?Ile?Asp?Leu?Leu?Ala?Lys?Leu?Asp
1285????????????????1290????????????????1295Trp?Val?Tyr?Ala?Ser?Ile?Asp?Asn?Lys?Asp?Glu?Phe?Met?Glu?Glu?Leu
1300????????????????1305????????????????1310Ser?Thr?Gly?Thr?Leu?Gly?Leu?Ser?Tyr?Glu?Lys?Ala?Lys?Lys?Leu?Phe
1315????????????????1320????????????????1325Pro?Gln?Tyr?Leu?Ser?Val?Asn?Tyr?Leu?His?Arg?Leu?Thr?Val?Ser?Ser
1330????????????????1335????????????????1340Arg?Pro?Cys?Glu?Phe?Pro?Ala?Ser?Ile?Pro?Ala?Tyr?Arg?Thr?Thr?Asn1345????????????????1350????????????????1355????????????????1360Tyr?His?Phe?Asp?Thr?Ser?Pro?Ile?Asn?His?Val?Leu?Thr?Glu?Lys?Tyr
1365????????????????1370????????????????1375Gly?Asp?Glu?Asp?Ile?Asp?Ile?Val?Phe?Gln?Asn?Cys?Ile?Ser?Phe?Gly
1380????????????????1385????????????????1390Leu?Ser?Leu?Met?Ser?Val?Val?Glu?Gln?Phe?Thr?Asn?Ile?Cys?Pro?Asn
1395????????????????1400????????????????1405Arg?Ile?Ile?Leu?Ile?Pro?Lys?Leu?Asn?Glu?Ile?His?Leu?Met?Lys?Pro
1410????????????????1415????????????????1420Pro?Ile?Phe?Thr?Gly?Asp?Val?Asp?Ile?Ile?Lys?Leu?Lys?Gln?Val?Ile1425????????????????1430????????????????1435????????????????1440Gln?Lys?Gln?His?Met?Phe?Leu?Pro?Asp?Lys?Ile?Ser?Leu?Thr?Gln?Tyr
1445????????????????1450????????????????1455Val?Glu?Leu?Phe?Leu?Ser?Asn?Lys?Ala?Leu?Lys?Ser?Gly?Ser?His?Ile
1460????????????????1465????????????????1470Asn?Ser?Asn?Leu?Ile?Leu?Val?His?Lys?Met?Ser?Asp?Tyr?Phe?His?Asn
1475????????????????1480????????????????1485Ala?Tyr?Ile?Leu?Ser?Thr?Asn?Leu?Ala?Gly?His?Trp?Ile?Leu?Ile?Ile
1490????????????????1495????????????????1500Gln?Leu?Met?Lys?Asp?Ser?Lys?Gly?Ile?Phe?Glu?Lys?Asp?Trp?Gly?Glu1505????????????????1510????????????????1515????????????????1520Gly?Tyr?Ile?Thr?Asp?His?Met?Phe?Ile?Asn?Leu?Asn?Val?Phe?Phe?Asn
1525????????????????1530????????????????1535Ala?Tyr?Lys?Thr?Tyr?Leu?Leu?Cys?Phe?His?Lys?Gly?Tyr?Gly?Lys?Ala
1540????????????????1545????????????????1550Lys?Leu?Glu?Cys?Asp?Met?Asn?Thr?Ser?Asp?Leu?Leu?Cys?Val?Leu?Glu
1555????????????????1560????????????????1565Leu?Ile?Asp?Ser?Ser?Tyr?Trp?Lys?Ser?Met?Ser?Lys?Val?Phe?Leu?Glu
1570????????????????1575????????????????1580Gln?Lys?Val?Ile?Lys?Tyr?Ile?Val?Asn?Gln?Asp?Thr?Ser?Leu?Arg?Arg1585????????????????1590????????????????1595????????????????1600Ile?Lys?Gly?Cys?His?Ser?Phe?Lys?Leu?Trp?Phe?Leu?Lys?Arg?Leu?Asn
1605????????????????1610????????????????1615Asn?Ala?Lys?Phe?Thr?Val?Cys?Pro?Trp?Val?Val?Asn?Ile?Asp?Tyr?His
1620????????????????1625????????????????1630Pro?Thr?His?Met?Lys?Ala?Ile?Leu?Ser?Tyr?Ile?Asp?Leu?Val?Arg?Met
1635????????????????1640????????????????1645Gly?Leu?Ile?Asn?Val?Asp?Lys?Leu?Thr?Ile?Lys?Asn?Lys?Asn?Lys?Phe
1650????????????????1655????????????????1660Asn?Asp?Glu?Phe?Tyr?Thr?Ser?Asn?Leu?Phe?Tyr?Ile?Ser?Tyr?Asn?Phe1665????????????????1670????????????????1675????????????????1680Ser?Asp?Asn?Thr?His?Leu?Leu?Thr?Lys?Gln?Ile?Arg?Ile?Ala?Asn?Ser
1685????????????????1690????????????????1695Glu?Leu?Glu?Asp?Asn?Tyr?Asn?Lys?Leu?Tyr?His?Pro?Thr?Pro?Glu?Thr
1700????????????????1705????????????????1710Leu?Glu?Asn?Met?Ser?Leu?Ile?Pro?Val?Lys?Ser?Asn?Asn?Ser?Asn?Lys
1715????????????????1720????????????????1725Pro?Lys?Phe?Cys?Ile?Ser?Gly?Asn?Thr?Glu?Ser?Met?Met?Met?Ser?Thr
1730????????????????1735????????????????1740Phe?Ser?Ser?Lys?Met?His?Ile?Lys?Ser?Ser?Thr?Val?Thr?Thr?Arg?Phe1745????????????????1750????????????????1755????????????????1760Asn?Tyr?Ser?Lys?Gln?Asp?Leu?Tyr?Asn?Leu?Phe?Pro?Ile?Val?Val?Ile
1765????????????????1770????????????????1775Asp?Lys?Ile?Ile?Asp?His?Ser?Gly?Asn?Thr?Ala?Lys?Ser?Asn?Gln?Leu
1780????????????????1785????????????????1790Tyr?Thr?Thr?Thr?Ser?His?Gln?Thr?Ser?Leu?Val?Arg?Asn?Ser?Ala?Ser
1795????????????????1800????????????????1805Leu?Tyr?Cys?Met?Leu?Pro?Trp?His?His?Val?Asn?Arg?Phe?Asn?Phe?Val
1810????????????????1815????????????????1820Phe?Ser?Ser?Thr?Gly?Cys?Lys?Ile?Ser?Ile?Glu?Tyr?Ile?Leu?Lys?Asp1825????????????????1830????????????????1835????????????????1840Leu?Lys?Ile?Lys?Asp?Pro?Ser?Cys?Ile?Ala?Phe?Ile?Gly?Glu?Gly?Ala
1845????????????????1850????????????????1855Gly?Asn?Leu?Leu?Leu?Arg?Thr?Val?Val?Glu?Leu?His?Pro?Asp?Ile?Arg
1860????????????????1865????????????????1870Tyr?Ile?Tyr?Arg?Ser?Leu?Lys?Asp?Cys?Asn?Asp?His?Ser?Leu?Pro?Ile
1875????????????????1880????????????????1885Glu?Phe?Leu?Arg?Leu?Tyr?Asn?Gly?His?Ile?Asn?Ile?Asp?Tyr?Gly?Glu
1890????????????????1895????????????????1900Asn?Leu?Thr?Ile?Pro?Ala?Thr?Asp?Ala?Thr?Asn?Asn?Ile?His?Trp?Ser1905????????????????1910????????????????1915????????????????1920Tyr?Leu?His?Ile?Lys?Phe?Ala?Glu?Pro?Ile?Ser?Ile?Phe?Val?Cys?Asp
1925????????????????1930????????????????1935Ala?Glu?Leu?Pro?Val?Thr?Ala?Asn?Trp?Ser?Lys?Ile?Ile?Ile?Glu?Trp
1940????????????????1945????????????????1950Ser?Lys?His?Val?Arg?Lys?Cys?Lys?Tyr?Cys?Ser?Ser?Val?Asn?Arg?Cys
1955????????????????1960????????????????1965Ile?Leu?Ile?Ala?Lys?Tyr?His?Ala?Gln?Asp?Asp?Ile?Asp?Phe?Lys?Leu
1970????????????????1975????????????????1980Asp?Asn?Ile?Thr?Ile?Leu?Lys?Thr?Tyr?Val?Cys?Leu?Gly?Ser?Lys?Leu1985????????????????1990????????????????1995????????????????2000Lys?Gly?Ser?Glu?Val?Tyr?Leu?Ile?Leu?Thr?Ile?Gly?Pro?Ala?Asn?Ile
2005????????????????2010????????????????2015Leu?Pro?Val?Phe?Asp?Val?Val?Gln?Asn?Ala?Lys?Leu?Thr?Leu?Ser?Arg
2020????????????????2025????????????????2030Thr?Lys?Asn?Phe?Ile?Met?Pro?Lys?Lys?Thr?Asp?Lys?Glu?Ser?Ile?Asp
2035????????????????2040????????????????2045Ala?Asn?Ile?Lys?Ser?Leu?Ile?Pro?Phe?Leu?Cys?Tyr?Pro?Ile?Thr?Lys
2050????????????????2055????????????????2060Lys?Gly?Ile?Lys?Thr?Ser?Leu?Ser?Lys?Leu?Lys?Ser?Val?Val?Asn?Gly2065????????????????2070????????????????2075????????????????2080Asp?Ile?Leu?Ser?Tyr?Ser?Ile?Ala?Gly?Arg?Asn?Glu?Val?Phe?Ser?Asn
2085????????????????2090????????????????2095Lys?Leu?Ile?Asn?His?Lys?His?Met?Asn?Ile?Leu?Lys?Trp?Leu?Asp?His
2100????????????????2105????????????????2110Val?Leu?Asn?Phe?Arg?Ser?Ala?Glu?Leu?Asn?Tyr?Asn?His?Leu?Tyr?Met
2115????????????????2120????????????????2125Ile?Glu?Ser?Thr?Tyr?Pro?Tyr?Leu?Ser?Glu?Leu?Leu?Asn?Ser?Leu?Thr
2130????????????????2135????????????????2140Thr?Asn?Glu?Leu?Lys?Lys?Leu?Ile?Lys?Ile?Thr?Gly?Ser?Val?Leu?Tyr2145????????????????2150????????????????2155????????????????2160Asn?Leu?Pro?Asn?Glu?Gln
The information of 2165 (2) SEQ ID NO:25:
(i) sequence signature:
(A) length: 15229 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: ACGCGAAAAA ATGCGTACTA CAAACTTGCA CATTCGGAAA AAATGGGGCA AATAAGAATT 60 TGATAAGTGC TATTTAAATC TAACCTTTTC AATCAGAAAT GGGGTGCAAT TCACTGAGCA 120 TGATAAAGGT TAGATTACAA AATTTATTTG ACAATGACGA AGTAGCATTG TTAAAAATAA 180 CATGTTATAC TGACAAATTA ATTCTTCTGA CCAATGCATT AGCCAAAGCA GTAATACATA 240 CAATTAAATT AAACGGCATA GTTTTTATAC ATGTTATAAC AAGCAGTGAA GTGTGCCCTG 300 ACAACAATAT TGTAGTGAAA TCTAACTTTA CAACAATGCC AATATTACAA AACGGAGGAT 360 ACATATGGGA ATTGATTGAG TTGACACACT GCTCTCAATC AAATGGTCTA ATGGATGATA 420 ATTGTGAAAT CAAATTTTCT AAAAGACTAA GTGACTCAGT AATGACTAAT TATATGAATC 480 AAATATCTGA TTTACTTGGG CTTGATCTCA ATTCATGAAT TATGTTTAGT CTAATTTAAT 540 AGACATGTGT TTATCACCAT TTTAGTTAAT ATAAAACCTC ATCAAAGGGA AATGGGGCAA 600 ATAAACTCAC CTAATCAGTC AAACCATGAG CACTACAAAT GACAACACTA CTATGCAAAG 660 ATTGATGATC ACAGACATGA GACCCCTGTC GATGGAATCA ATAATAACAT CTCTCACCAA 720 AGAAATCATA ACACACAAAT TCATATACTT GATAAACAAT GAATGTATTG TAAGAAAACT 780 TGATGAAAGA CAAGCTACAT TTACATTCTT AGTCAATTAT GAGATGAAGC TATTGCACAA 840 AGTAGGGAGT ACCAAATACA AGAAATACAC TGAATATAAT ACAAAATATG GCACTTTCCC 900 CATGCCTATA TTTATCAATC ATGACGGGTT TCTAGAATGT ATTGGCATTA AGCCTACAAA 960 ACACACTCCT ATAATATACA AATATGACCT CAACCCGTAA ATTCCAACAA AAAACTAACC 1020 CATCCAAACT AAGCTATTCC TCAAACAACA GTGCTCAACA GTTAAGAAGG AGCTAATCCA 1080 TTTTAGTAAT TAAAAATAAA GGCAGAGCCA ATAACATAAA TTGGGGCAAA TACAAAGATG 1140 GCTCTTAGCA AAGTCAAGTT AAATGATACA TTAAATAAGG ATCAGCTGCT GTCATCCAGC 1200 AAATACACTA TTCAACGTAG TACAGGAGAT AATATTGACA CTCCCAATTA TGATGTGCAA 1260 AAACACCTAA ACAAACTATG TGGTATGCTA TTAATCACTG AAGATGCAAA TCATAAATTC 1320 ACAGGATTAA TAGGTATGTT ATATGCTATG TCCAGGTTAG GAAGGGAAGA CACTATAAAG 1380 ATACTTAAAG ATGCTGGATA TCATGTTAAA GCTAATGGAG TAGATATAAC AACATATCGT 1440 CAAGATATAA ACGGAAAGGA AATGAAATTC GAAGTATTAA CATTATCAAG CTTGACATCA 1500 GAAATACAAG TCAATATTGA GATAGAATCT AGAAAGTCCT ACAAAAAAAT GCTAAAAGAG 1560 ATGGGAGAAG TGGCTCCAGA ATATAGGCAT GATTCTCCAG ACTGTGGGAT GATAATACTG 1620 TGTATAGCTG CACTTGTAAT AACCAAGTTA GCAGCAGGAG ATAGATCAGG TCTTACAGCA 1680 GTAATTAGGA GGGCAAACAA TGTCTTAAAA AACGAAATAA AACGCTACAA GGGCCTCATA 1740 CCAAAGGATA TAGCTAACAG TTTTTATGAA GTGTTTGAAA AACACCCTCA TCTTATAGAT 1800 GTTTTTGTGC ACTTTGGCAT TGCACAATCA TCCACAAGAG GGGGTAGTAG AGTTGAAGGA 1860 ATCTTTGCAG GATTATTTAT GAATGCCTAT GGTTCAGGGC AAGTAATGCT AAGATGGGGA 1920 GTTCTAGCCA AATCTGTAAA AAATATCATG CTAGGACATG CTAGTGTCCA GGCAGAAATG 1980 GAACAAGTTG TGGAAGTTTA TGAGTATGCA CAGAAGTTGG GAGGAGAAGC TGGATTCTAC 2040 CATATATTGA ACAATCCAAA AGCATCATTG CTGTCATTAA CTCAATTTCC TAACTTCTCA 2100 AGTGTGGTCC TAGGCAATGC AGCAGGTCTA GGCATAATGG GAGAGTATAG AGGTACACCA 2160 AGAAACCAAG ATCTATATGA TGCAGCCAAA GCATATGCAG AGCAACTCAA AGAAAATGGA 2220 GTAATAAACT ACAGTGTATT AGACTTAACA GCAGAAGAAT TGGAAGCCAT AAAGCATCAA 2280 CTCAACCCCA AAGAAGATGA TGTAGAGCTT TAAGTTAACA AAAAATACGG GGCAAATAAG 2340 TCAACATGGA GAAGTTTGCA CCTGAATTTC ATGGAGAAGA TGCAAACAAC AAAGCTACCA 2400 AATTCCTAGA ATCAATAAAG GGCAAGTTTG CATCATCCAA AGATCCTAAG AAGAAAGATA 2460 GCATAATATC TGTTAACTCA ATAGATATAG AAGTAACTAA AGAGAGCCCG ATAACATCTG 2520 GCACCAACAT CATCAATCCA ATAAGTGAAG CTGATAGTAC CCCAGAAGCT AAAGCCAACT 2580 ACCCAAGAAA ACCCCTAGTA AGCTTCAAAG AAGATCTCAC CCCAAGTGAC AACCCCTTTT 2640 CTAAGTTGTA CAAAGAAACA ATAGAAACAT TTGATAACAA TGAAGAAGAA TCTAGCTACT 2700 CATATGAAGA AATAAATGAT CAAACAAATG ACAACATTAC AGCAAGACTA GATAGAATTG 2760 ATGAAAAATT AAGTGAAATA TTAGGAATGC TCCATACATT AGTAGTTGCA AGTGCAGGAC 2820 CCACCTCAGC TCGCGATGGA ATAAGAGATG CTATGGTTGG TCTAAGAGAA GAAATGATAG 2880 AAAAAATAAG AGCGGAAGCA TTAATGACCA ATGATAGGTT AGAGGCTATG GCAAGACTTA 2940 GGAATGAGGA AAGCGAAAAA ATGGCAAAAG ACACCTCAGA TGAAGTGTCT CTTAATCCAA 3000 CTTCCAAAAA ATTGAGTAAT TTGTTGGAAG ACAACGATAG TGACAATGAT CTATCACTTG 3060 ATGATTTTTG ATCAGTGATC AACTCACTCA GCAATCAACA ACATCAATGA AACAGACATC 3120 AATCCATTGA ATCAACTGCC AGACTGAACA CACAAACGTC CATCAGCAGA ACTACCAACC 3180 AATCAATCAA CCAATTGATC AATCAGCGAC CTAACAAAAT TAACAATATA GTAACAAAAA 3240 AAGAACAAGA TGGGGCAAAT ATGGAAACAT ACGTGAACAA GCTTCACGAG GGCTCCACAT 3300 ACACAGCAGC TGTTCAGTAC AATGTTCTAG AAAAAGATGA TGATCCTGCA TCACTAACAA 3360 TATGGGTGCC TATGTTCCAG TCATCTGTGC CAGCAGACTT GCTCATAAAA GAACTTGCAA 3420 GCATCAACAT ACTAGTGAAG CAGATCTCCA CGCCCAAAGG ACCTTCACTA CGAGTCACGA 3480 TTAACTCAAG AAGTGCTGTG CTGGCACAAA TGCCTAGTAG TTTTATCATA AGTGCAAATG 3540 TATCATTAGA TGAAAGAAGC AAATTAGCAT ATGATGTAAC TACACCTTGT GAAATCAAAG 3600 CATGCAGTCT AACATGCTTA AAAGTAAAAA GTATGTTAAC TACAGTCAAA GATCTTACCA 3660 TGAAAACATT CAATCCCACT CATGAGATTA TTGCTCTATG TGAATTTGAA AATATTATGA 3720 CATCAAAAAG AGTAATAATA CCAACCTATC TAAGATCAAT TAGTGTCAAA AACAAGGACC 3780 TGAACTCACT AGAAAATATA GCAACCACCG AATTCAAAAA TGCTATCACC AATGCGAAAA 3840 TTATTCCCTA TGCAGGATTA GTATTAGTTA TCACAGTTAC TGACAATAAA GGAGCATTCA 3900 AATATATCAA GCCACAGAGT CAATTTATAG TAGATCTTGG GGCCTACCTA GAAAAAGAGA 3960 GCATATATTA TGTGACTACA AATTGGAAGC ATACAGCTAC ACGTTTTTCA ATCAAACCAC 4020 TAGAGGATTA AACTTAATTA TCAACACTAA ATGACAGGTC CACATATATC TTCAAACTAT 4080 ACATTATATC CAAACATCAT GAGCATTTAC ACTACACACT TTTACCATAT AAATCAATCT 4140 CATTTAAAAT CCAAAATTAC TTCCAGCTAT CATCTGTTAG ACCTAGAGTG CGAATAGGTA 4200 AATAAAACCA AAATATGGGG TAAATAGACA TTAGTTAGAG TTCAATCAAT CTCAACAACC 4260 ATTTATACCG CCAATTCAGT ACATATACTA TAAATCTCAA AATGGGAAAT ACATCCATCA 4320 CAATAGAATT CACAAGCAAA TTTTGGCCTT ATTTTACACT AATACATATG ATCTTAACTC 4380 TAATCTCTTT ACTAATTATA ATCACTATTA TGATTGCAAT ACTAAATAAG CTAAGTGAAC 4440 ATAAAACATT CTGCAACAAA ACTCTTGAAC TAGGACAGAT GTATCAAATC AACACATAGT 4500 GTTCTACCAT TATGCTGTGT CAAATTATAA TCTTGTATAT ATAAACAAAC AAATCCAATC 4560 TTCTCACAGA GTCATGGTGG CGCAAAACCA CGCCAACCAT CATGATAGCA TAGAGTAGTT 4620 ATTTAAAAAT TAACATAATG ATGAATTATT GGTATGAGAT CAGGAACAAC ATTGGGGCAA 4680 ATGCAGCCAT GTCCAAGCAC AAGAATCGGC GCACTGCCGG GACTCTAGAA AGGACCTGGG 4740 ATACTCTTAA TCATCTAATT GTAATATCCT CTTGTTTATA CAGATTAAAT TTAAAATCTA 4800 TAGCACAAAT AGCACTGTCA GTTTTGGCAA TGATAATCTC AACCTCTCTC ATAATTGCAG 4860 CCATAATATT CATCATCTCT GCCAATCACA AAGTTACACT AACAACGGTT ACAGTTCAAA 4920 CAATAAAAAA CCACACTGAA AAAAACATCT CCACCTACCT TACTCAAGTC CCACCAGAAA 4980 GGGTCAACTC ATCCAAACAA CCCACAACCA CATCACCAAT CCACACAAAT TCAGCCACAA 5040 TATCACCAAA TACAAAATCA GAAACACACC ATACAACAGC ACAAACCAAA GGCAGAATCA 5100 CCACTTCAAC ACAGACCAAC AAGCCAAGCA CAAAATCACG TTCAAAAAAT CCACCAAAAA 5160 AACCAAAAGA TGATTACCAT TTTGAAGTGT TCAATTTTGT TCCCTGTAGT ATATGTGGTA 5220 ATAATCAACT CTGCAAATCC ATCTGCAAAA CAATACCAAG CAACAAACCA AAGAAAAAAC 5280 CAACCATCAA ACCCACAAAC AAACCAACCA CCAAAACCAC AAACAAAAGA GACCCCAAAA 5340 CACCAGCCAA AATGCCAAAA AAAGAAATCA TCACCAACCC AGCAAAAAAA CCAACCCTCA 5400 AGACCACAGA AAGAGACACC AGCATTTCAC AATCCACCGT GCTCGACACA ATCACTCCAA 5460 AATACACAAT CCAACAGCAA TCCCTCCACT CAACCACCTC CGAAAACACA CCCAGCTCCA 5520 CACAAATACC CACAGCATCC GAGCCCTCCA CATTAAATCC TAATTAAAAA ACCTAGTCAC 5580 ATGCTTAGTT ATTCAAAAAC TACATCTTAG CAGAGAACCG TGATCTATCA AGCAAGAACA 5640 AAATTAAACC TGGGGCAAAT AACCATGGAG TTGCTGATCC ACAGGTCAAG TGCAATCTTC 5700 CTAACTCTTG CTGTTAATGC ATTGTACCTC ACCTCAAGTC AGAACATAAC TGAGGAGTTT 5760 TACCAATCGA CATGTAGTGC AGTTAGCAGA GGTTATTTTA GTGCTTTAAG AACAGGTTGG 5820 TATACCAGTG TCATAACAAT AGAATTAAGT AATATAAAAG AAACCAAATG CAATGGAACT 5880 GACACTAAAG TAAAACTTAT AAAACAAGAA TTAGATAAGT ATAAGAATGC AGTAACAGAA 5940 TTACAGCTAC TTATGCAAAA CACGCCAGCT GCCAACAACC GGGCCAGAAG AGAAGCACCA 6000 CAGTACATGA ACTACACAAT CAATACCACA AAAAACCTAA ATGTATCAAT AAGCAAGAAA 6060 AGGAAACGAA GATTTCTGGG CTTCTTGTTA GGTGTAGGAT CTGCAATAGC AAGTGGTATA 6120 GCTGTATCCA AAGTTTTACA CCTTGAAGGA GAAGTGAACA AAATCAAAAA TGCTTTGTTG 6180 TCTACAAACA AAGCTGTAGT CAGTCTATCA AATGGGGTCA GTGTTTTAAC CAGCAAAGTG 6240 TTAGATCTCA AGAATTACAT AAATAACCGA ATATTACCCA TAGTAAATCA ACAGAGCTGT 6300 CGCATCTCCA ACATTGAAAC AGTTATAGAA TTCCAGCAGA AGAATAGCAG ATTGTTGGAA 6360 ATCACCAGAG AATTTAGTGT TAATGCAGGT GTAACAACAC CTTTAAGCAC TTACATGTTA 6420 ACAAACAGTG AGTTACTATC ATTGATCAAT GATATGCCTA TAACAAATGA CCAGAAAAAA 6480 TTAATGTCAA GCAATGTTCA GATAGTAAGG CAACAAAGTT ATTCTATCAT GTCTATAATA 6540 AAGGAAGAAG TCCTTGCATA TGTTGTACAG CTACCTATCT ATGGTGTAAT AGATACACCT 6600 TGCTGGAAAT TACACACATC ACCTCTATGC ACCACCAACA TCAAAGAAGG ATCAAATATT 6660 TGTTTAACAA GGACTGATAG AGGATGGTAT TGTGATAATG CAGGATCAGT ATCCTTCTTC 6720 CCACAGGCTG ATACTTGCAA AGTACAGTCC AATCGAGTAT TTTGTGACAC TATGAACAGT 6780 TTAACATTAC CAAGTGAAGT CAGCCTTTGT AACACTGACA TATTCAATTC CAAGTATGAC 6840 TGCAAAATTA TGACATCAAA AACAGACATA AGCAGCTCAG TAATTACTTC TCTTGGAGCT 6900 ATAGTGTCAT GCTATGGAAA AACTAAATGC ACTGCATCCA ATAAAAATCG TGGGATTATA 6960 AAGACATTTT CTAATGGTTG TGACTATGTG TCAAACAAAG GAGTAGATAC TGTGTCAGTG 7020 GGCAACACTT TATACTATGT AAACAAGCTG GAAGGCAAAA ACCTTTATGT AAAAGGGGAA 7080 CCTATAATAA ATTACTATGA TCCTCTAGTG TTTCCTTCTG ATGAGTTTGA TGCATCAATA 7140 TCTCAAGTCA ATGAAAAAAT CAATCAAAGT TTAGCTTTTA TTCGTAGATC TGATGAATTA 7200 CTACATAATG TAAATACTGG CAAATCTACT ACAAATATTA TGATAACTAC AATTATTATA 7260 GTAATCATTG TAGTATTGTT ATCATTAATA GCTATTGGTT TACTGTTGTA TTGCAAAGCC 7320 AAAAACACAC CAGTTACACT AAGCAAAGAC CAACTAAGTG GAATCAATAA TATTGCATTC 7380 AGCAAATAGA CAAAAAACTA CTTAATCATG TTTCAACAAC AATCTGCTGA CCACCAATCC 7440 CAAATCAACT TAACAACAAA TATTTCAACA TCATAGCACA GGCTGAATCA TTTCCTCATA 7500 TCATGCTACC TACACAACTA AGCTAGATCT TCAACTCATA GTTACATAAA AACCCCAAGT 7560 ATCACAATCA AACACTAAAT CGACACATCA TTCACAAAAT TAACAACTGG GGCAAATATG 7620 TCGCGAAGAA ATCCTTGTAA ATTTGAGATT AGAGGTCATT GCTTGAATGG TAGAAGATGT 7680 CACTACAGTC ATAATTATTT TGAATGGCCT CCTCATGCAT TACTAGTGAG GCAAAACTTC 7740 ATGTTAAACA AGATACTTAA GTCAATGGAC AAAAGCATAG ACACTTTGTC GGAAATAAGT 7800 GGAGCTGCTG AACTGGATAG AACAGAAGAA TATGCTCTTG GTATAGTTGG AGTGCTAGAG 7860 AGTTACATAG GATCAATAAA CAACATAACA AAACAATCAG CATGTGTTGC TATGAGTAAA 7920 CTTCTTATTG AGATCAACAG TGATGACATT AAAAAACTGA GAGATAACGA AGAACCCAAT 7980 TCGCCTAAGA TAAGAGTGTA CAATACTGTT ATATCATACA TTGAGAGCAA TAGAAAAAAC 8040 AACAAGCAAA CCATCCATCT GCTCAAAAGA CTACCAGCAG ACGTGCTGAA GAAGACAATA 8100 AAGAACACAT TAGATATCCA CAAAAGCATA ACCATAAGCA ACTCAAAAGA GTCAACCGTG 8160 AATGATCAAA ATGACCAAAC CAAAAATAAT GATATTACCG GATAAATATC CTTGTAGTAT 8220 ATCATCCATA TTGATTTCAA GTGAAAGCAT GATTGCTACA TTCAATCATA AAAACATATT 8280 ACAATTTAAC CATAACCATT TGGATAACCA CCAGTGTTTA TTAAATCATA TATTTGATGA 8340 AATTCATTGG ACACCTAAAA ACTTATTAGA TGCCACTCAA CAATTTCTCC AACATCTTAA 8400 CATCCCTGAA GATATATATA CAGTATATAT ATTAGTGTCA TAATGCTTGA CCATAACAAT 8460 TTTATATCAT TCAACCATAA AACAACCTTA ATAAGGTTAT GGGACAAAAT GGATCCCATT 8520 ATTAATGGAA ACTCTGCCAA TGTGTATCTA ACTGATAGTT ATCTAAAAGG TGTTATCTCT 8580 TTTTCAGAAT GTAATGCTTT AGGGAGTTAC CTTTTTAACG GCCCCTATCT TAAAAATGAT 8640 TACACCAACT TAATTAGTAG ACAAAGCCCA CTACTAGAGC ATATGAATCT AAAAAAACTA 8700 ACTATAACAC AGTCATTAAT ATCTAGATAT CATAAAGGTG AACTGAAGTT AGAAGAACCA 8760 ACTTATTTCC AGTCATTACT TATGACATAT AAAAGTATGT CCTCGTCTGA ACAAATTGCT 8820 ACAACTAATT TACTTAAAAA AATAATACGA AGAGCTATAG AAATAAGTGA TGTAAAGGTG 8880 TACGCCATCT TGAATAAACT GGGACTAAAG GAAAAGGACA GAGTTAAGCC CAACAATAAT 8940 TCAGGTGATG AAAACTCAGT TCTTACAACC ATAATCAAAG ATGATATACT TTCAGCTGTG 9000 GAAAACAATC AATCATATAC AAATTCAGAC AAAAATCATT CAGTAAATCA AAATATCACT 9060 ATCAAAACAA CACTCTTGAA AAAATTGATG TGTTCAATGC AACATCCTCC ATCATGGTTA 9120 ATACACTGGT TCAATTTATA TACAAAATTA AATAACATAT TAACACAATA TCGATCAAAT 9180 GAGGTAAAAA GTCATGGGTT TATATTAATA GATAATCAAA CTTTAAGTGA TTTTCAGTTT 9240 ATTTTAAATC AATATGGTTG TATCGTTTAT CATAAAGGAC TCAAAAAAAT CACAACTACT 9300 ACTTACAATC AATTTTTGAC ATGGAAAGAC ATCAGCCTTA GCAGATTAAA TGTTTGCTTA 9360 ATTACTTGGA TAAGTAATTG TTTAAATACA TTAAATAAAA GCTTAGGGCT GAGATGTGGA 9420 TTCAATAATG TTGTGTTATC ACAACTATTT CTTTATGGAG ATTGTATACT GAAATTATTC 9480 CATAATGAAG GCTTCTACAT AATAAAAGAA GTAGAGGGAT TTATTATGTC TTTAATTCTA 9540 AACATAACAG AAGAAGATCA ATTTAGGAAA CGATTTTATA ATAGCATGCT AAATAACATC 9600 ACAGATGCAG CTATTAAGGC TCAAAAAAAC CTACTATCAA GAGTATGTCA CACTTTATTA 9660 GACAAGACAG TGTCTGATAA TATCATAAAT GGTAAATGGA TAATCCTATT AAGTAAATTT 9720 CTTAAATTGA TTAAGCTTGC AGGTGATAAT AATCTCAATA ACTTGAGTGA GCTTTATTTT 9780 CTCTTCAGAA TCTTTGGACA TCCAATGGTC GATGAAAGAC AAGCAATGGA TGCTGTAAGA 9840 ATTAACTGTA ATGAAACCAA GTTCTACTTA TTAAGTAATC TAAGTACGTT AAGAGGTGCT 9900 TTCATTTATA GAATCATAAA GGGGTTTGTA AATACCTACA ACAGATGGCC CACTTTAAGG 9960 AATGCTATTG TTCTACCTCT AAGATGGTTG AACTATTATA AACTTAATAC TTATCCATCT 10020 CTACTTGAAA TCACAGAGAA AGATTTGATT ATTTTATCAG GATTGCGGTT CTATCGTGAG 10080 TTTCATCTGC CTAAAAAAGT GGATCTTGAA ATGATAATAA ATGACAAAGC CATTTCACCT 10140 CCAAAAGATT TAATATGGAC TAGTTTTCCT AGAAATTACA TGCCATCACA TATACAAAAT 10200 TATATAGAAC ATGAAAAGTT GAAGTTCTCT GAAAGTGACA GATCAAGAAG AGTACTAGAG 10260 TATTACTTGA GAGATAATAA ATTCAATGAA TGCGATCTAT ACAATTGTGT GGTCAATCAA 10320 AGCTATCTCA ACAACTCTAA CCATGTGGTA TCACTAACTG GTAAAGAAAG AGAGCTCAGT 10380 GTAGGTAGAA TGTTTGCTAT GCAACCAGGT ATGTTTAGGC AAATTCAAAT CTTAGCAGAG 10440 AAAATGATAG CCGAAAATAT TTTACAATTC TTCCCTGAGA GTTTGACAAG ATATGGTGAT 10500 CTAGAGCTTC AAAAGATATT AGAATTAAAA GCAGGAATAA GCAACAAGTC AAATCGTTAT 10560 AATGATAACT ACAACAATTA TATCAGTAAA TGTTCTATCA TTACAGACCT TAGCAAATTC 10620 AATCAAGCAT TTAGATATGA AACATCATGT ATCTGCAGTG ATGTATTAGA TGAACTGCAT 10680 GGAGTACAAT CTCTGTTCTC TTGGTTGCAT TTAACAATAC CTCTTGTCAC AATAATATGT 10740 ACATATAGAC ATGCACCTCC TTTTATAAAG GATCATGTTG TTAATCTTAA TAAAGTTGAT 10800 GAACAAAGTG GATTATACAG ATATCATATG GGTGGTATTG AAGGCTGGTG TCAAAAACTG 10860 TGGACCATTG AAGCTATATC ATTATTAGAT CTAATATCTC TCAAAGGGAA ATTCTCTATC 10920 ACAGCTCTAA TAAATGGTGA TAATCAGTCA ATTGATATAA GTAAACCAGT TAGACTTATA 10980 GAGGGTCAGA CCCATGCTCA AGCAGATTAT TTGTTAGCAT TAAATAGCCT TAAATTGCTA 11040 TATAAAGAGT ATGCGGGCAT AGGCCACAAG CTCAAGGGAA CAGAGACCTA TATATCCCGA 11100 GATATGCAAT TCATGAGCAA AACAATCCAG CACAATGGAG TGTACTATCC AGCCAGTATC 11160 AAAAAAGTCC TGAGAGTAGG TCCATGGATA AATACAATAC TTGATGATTT TAAAGTTAGT 11220 TTAGAATCTA TAGGTAGCTT AACACAGGAG TTAGAATATA GAGGAGAGAG CTTATTATGC 11280 AGTTTAATAT TTAGGAACAT TTGGTTATAC AATCAAATTG CTTTGCAACT CCGAAATCAT 11340 GCATTATGTC ACAATAAGCT ATATTTAGAT ATATTGAAAG TATTAAAACA CTTAAAAACT 11400 TTTTTTAATC TTGATAGTAT TGATATGGCT TTAACATTGT ATATGAATTT GCCTATGCTG 11460 TTTGGTGGTG GTGATCCTAA TTTGTTATAT CGAAGCTTTT ATAGGAGAAC TCCAGACTTC 11520 CTTACAGAAG CTATAGTACA TTCAGTGTTT GTGTTGAGCT ATTATACTGG TCACGATTTA 11580 CAAGATAAGC TCCAGGATCT TCCAGATGAT AGACTGAACA AATTCTTGAC ATGTATCATC 11640 ACGTTTGATA AAAATCCCAA TGCCGAGTTT GTAACATTGA TGAGAGATCC ACAGGCTTTA 11700 GGGTCTGAAA GGCAAGCAAA AATTACTAGT GAGATTAATA GATTAGCAGT GACAGAAGTC 11760 TTAAGTATAG CTCCAAACAA AATATTTTCT AAAAGTGCAC AACATTATAC TACCACTGAG 11820 ATTGATCTAA ATGATATTAT GCAAAATATA GAACCAACTT ACCCTCATGG ATTAAGAGTT 11880 GTTTATGAAA GTTTACCTTT TTATAAAGCA GAAAAAATAG TTAATCTTAT ATCAGGAACA 11940 AAATCCATAA CTAATATACT TGAAAAAACA TCAGCAATAG ATTCAACTGA TATTAATAGG 12000 GCTACTGATA TGATGAGGAA AAATATAACT TTACTTATAA GGATACTTCC ACTAGATTGT 12060 AACAAAGACA AAAGAGAGTT ATTAAGTTTA GAAAATCTTA GTATAACTGA ATTAAGCAAG 12120 TATGTAAGAG AAAGATCTTG GTCGTTATCC AATATAGTAG GAGTAACATC GCCAAGTATT 12180 ATGTTCACAA TGGACATTAA ATATACAACT AGCACTATAG CCAGTGGTAT AATTATAGAA 12240 AAATATAATG TTAATAGTTT AACTCGTGGT GAAAGAGGAC CTACTAAGCC ATGGGTAGGT 12300 TCATCTACGC AGGAGAAAAA AACAATGCCA GTGTACAATA GACAAGTTTT AACCAAAAAG 12360 CAAAGAGACC AAATAGATTT ATTAGCAAAA TTAGACTGGG TATATGCATC CATAGACAAC 12420 AAAGATGAAT TCATGGAAGA ACTGAGTACT GGAACACTTG GACTGTCATA TGAGAAAGCC 12480 AAAAAATTGT TTCCACAATA TCTAAGTGTC AATTATTTAC ACCGCTTAAC AGTCAGTAGT 12540 AGACCATGTG AATTCCCTGC ATCAATACCA GCTTATAGAA CAACAAATTA TCATTTCGAT 12600 ACTAGTCCTA TCAACCATGT ATTAACAGAA AAGTATGGAG ATGAAGATAT CGACATTGTG 12660 TTTCAAAATT GCATAAGTTT TGGTCTTAGC TTAATGTCGG TTGTGGAACA ATTCACAAAC 12720 ATATGTCCTA ATAGAATTAT TCTCATACCG AAGCTGAATG AGATACATTT GATGAAACCT 12780 CCTATATTTA CAGGAGATGT TGATATCATC AAGTTGAAGC AAGTGATACA AAAACAGCAC 12840 ATGTTCCTAC CAGATAAAAT AAGTTTAACC CAATATGTAG AATTATTCCT AAGTAACAAA 12900 GCACTTAAAT CTGGATCTCA CATCAACTCT AATTTAATAT TAGTACATAA AATGTCTGAT 12960 TATTTTCATA ATGCTTATAT TTTAAGTACT AATTTAGCTG GACATTGGAT TCTGATTATT 13020 CAACTTATGA AGGATTCAAA AGGTATTTTT GAAAAAGATT GGGGAGAGGG GTATATAACT 13080 GATCATATGT TCATTAATTT GAATGTTTTC TTTAATGCTT ATAAGACTTA TTTGCTATGT 13140 TTTCATAAAG GTTATGGTAA AGCAAAATTA GAATGTGATA TGAACACTTC AGATCTTCTT 13200 TGTGTTTTGG AGCTAATAGA CAGTAGCTAC TGGAAATCTA TGTCTAAAGT TTTCCTAGAA 13260 CAAAAAGTCA TAAAATACAT AATCAATCAA GACACAAGTT TGCATAGAAT AAAAGGTTGT 13320 CATAGTTTTA AGTTATGGTT TTTAAAACGC CTTAATAATG CTAAATTTAC CGTATGCCCT 13380 TGGGTTGTTA ACATAGATTA TCACCCAACA CACATGAAAG CTATATTATC TTACATAGAT 13440 TTAGTTAGAA TGGGGTTAAT AAATGTAGAT AAATTAACCA TTAAAAATAA AAATAAATTC 13500 AATGATGAAT TTTACACATC AAATCTCTTT TACATTAGTT ATAACTTTTC AGATAACACT 13560 CATTTGCTAA CAAAACAAAT AAGAATTGCT AATTCAGAAT TAGAAAATAA TTATAACAAA 13620 CTATATCACC CAACCCCAGA AACTTTAGAA AATATGTCAT TAATTCCTGT CAAAAGTAAT 13680 AATAGTAATA AACCTAAATT TGGTATAAGT GGAAATACCG AATCTATGAT GACGTCAACA 13740 TTCTCCAATA AAACGCATAT TAAATCTTCC GCTGTTATTA CAAGATTCAA TTATAGTAAA 13800 CAAGACTTGT ACAATTTATT TCCAATTGTC GTGATAGACA GGATTATAGA TCATTCAGGT 13860 AATACAGCAA AATCTAACCA ACTCTACACT ACCACTTCAC ATCAGACATC TTTAGTAAGG 13920 AATAGTGCAT CACTTTATTG CATGCTTCCT TGGCATCATG TCAATAGATT TAACTTTGTA 13980 TTTAGTTCCA CAGGATGCAA GATCAGTATA GAGTATATTT TAAAAGATCT TAAGATTAAA 14040 GACCCCAGTT GTATAGCATT CATAGGTGAA GGAGCTGGTA ACTTATTATT ACGTACAGTA 14100 GTAGAACTTC ATCCAGACAT AAGATACATT TACAGAAGTT TAAAAGATTG CAATGATCAT 14160 AGTTTACCTA TTGAATTTCT AAGGTTATAC AACGGGCATA TAAACATAGA TTATGGTGAG 14220 AATTTAACCA TTCCTGCTAC AGATGCAACT AATAACATTC ATTGGTCTTA TTTACATATA 14280 AAATTTGCAG AACCTATTAG CATTTTTGTC TGCGATGCTG AATTACCTGT TACAGCCAAT 14340 TGGAGTAAAA TTATAATTGA ATGGAGTAAG CATGTAAGAA AGTGCAAGTA CTGTTCCTCT 14400 GTAAATAGAT GCATTTTAAT TGCAAAATAT CATGCCCAAG ATGATATTGA TTTCAAATTA 14460 GATAACATTA CTATATTAAA AACTTACGTG TGCCTAGGTA GCAAGTTAAA AGGATCTGAA 14520 GTTTACTTAG TCCTTACAAT AGGCCCTGCA AATATACTTC CTGTTTTTAA TGTTGTGCAA 14580 AATGCTAAAT TGATTCTTTC AAGGACTAAA AATTTCATTA TGCCTAAAAA AACTGACAAA 14640 GAATCTATCG ATGCAAATAT TAAAAGCTTA ATACCTTTCC TTTGTTACCC TATAACAAAA 14700 AAAGGAATTA AGACTTCATT GTCAAAATTG AAGAGTGTAG TTAGTGGAGA TATATTATCA 14760 TATTCTATAG CTGGACGTAA TGAAGTATTC AGCAACAAGC TTATAAACCA CAAGCATATG 14820 AATATCCTAA AATGGCTAGA TCATGTTTTA AACTTTAGAT CAGCTGAACT TAATTACAAT 14880 CATTTATATA TGATAGAGTC CACATATCCT TACTTAAGTG AATTGTTAAA CAGTTTAACA 14940 ACCAATGAGC TCAAGAAGCT GATTAAAATA ACAGGTAGTG TACTATACAA CCTTCCCAAC 15000 GAACAGTAAC TTAAAACATC ATTAACAAGT TTGATCAAAT TTAGATGCTA ACACATCATA 15060 ATATTATAGT TATTAAAAAA TATATATGCA AACTTTTCAA TAATTTAGCA TATTGATTCC 15120 AAAGTTATCA TTTTGGTCTT AAGGGGTTGA ATAAAAATCT AAAACTAACA ATTATACATG 15180 TGCATTTACA ACACAACGAG ACATTAGTTT TTGACACTTT TTTTCTCGT 15229 (2) SEQ ID NO: 26 information about: ...
(i) sequence signature:
(A) length: 2166 amino acid
(B) type: amino acid
(C) chain:
(D) topological framework: linearity
(ii) molecule type: protein
(xi) sequence description: SEQ ID NO:26:
Met?Asp?Pro?Ile?Ile?Asn?Gly?Asn?Ser?Ala?Asn?Val?Tyr?Leu?Thr?Asp
1???????????????5???????????????????10??????????????????15
Ser?Tyr?Leu?Lys?Gly?Val?Ile?Ser?Phe?Ser?Glu?Cys?Asn?Ala?Leu?Gly
20??????????????????25??????????????????30
Ser?Tyr?Leu?Phe?Asn?Gly?Pro?Tyr?Leu?Lys?Asn?Asp?Tyr?Thr?Asn?Leu
35??????????????????40??????????????????45Ile?Ser?Arg?Gln?Ser?Pro?Leu?Leu?Glu?His?Met?Asn?Leu?Lys?Lys?Leu
50??????????????????55??????????????????60Thr?Ile?Thr?Gln?Ser?Leu?Ile?Ser?Arg?Tyr?His?Lys?Gly?Glu?Leu?Lys65??????????????????70??????????????????75??????????????????80Leu?Glu?Glu?Pro?Thr?Tyr?Phe?Gln?Ser?Leu?Leu?Met?Thr?Tyr?Lys?Ser
85??????????????????90??????????????????95Met?Ser?Ser?Ser?Glu?Gln?Ile?Ala?Thr?Thr?Asn?Leu?Leu?Lys?Lys?Ile
100?????????????????105?????????????????110Ile?Arg?Arg?Ala?Ile?Glu?Ile?Ser?Asp?Val?Lys?Val?Tyr?Ala?Ile?Leu
115?????????????????120?????????????????125Asn?Lys?Leu?Gly?Leu?Lys?Glu?Lys?Asp?Arg?Val?Lys?Pro?Asn?Asn?Asn
130?????????????????135?????????????????140Ser?Gly?Asp?Glu?Asn?Ser?Val?Leu?Thr?Thr?Ile?Ile?Lys?Asp?Asp?Ile145?????????????????150?????????????????155?????????????????160Leu?Ser?Ala?Val?Glu?Asn?Asn?Gln?Ser?Tyr?Thr?Asn?Ser?Asp?Lys?Asn
165?????????????????170?????????????????175His?Ser?Val?Asn?Gln?Asn?Ile?Thr?Ile?Lys?Thr?Thr?Leu?Leu?Lys?Lys
180?????????????????185?????????????????190Leu?Met?Cys?Ser?Met?Gln?His?Pro?Pro?Ser?Trp?Leu?Ile?His?Trp?Phe
195?????????????????200?????????????????205Asn?Leu?Tyr?Thr?Lys?Leu?Asn?Asn?Ile?Leu?Thr?Gln?Tyr?Arg?Ser?Asn
210?????????????????215?????????????????220Glu?Val?Lys?Ser?His?Gly?Phe?Ile?Leu?Ile?Asp?Asn?Gln?Thr?Leu?Ser225?????????????????230?????????????????235?????????????????240Asp?Phe?Gln?Phe?Ile?Leu?Asn?Gln?Tyr?Gly?Cys?Ile?Val?Tyr?His?Lys
245?????????????????250?????????????????255Gly?Leu?Lys?Lys?Ile?Thr?Thr?Thr?Thr?Tyr?Asn?Gln?Phe?Leu?Thr?Trp
260?????????????????265?????????????????270Lys?Asp?Ile?Ser?Leu?Ser?Arg?Leu?Asn?Val?Cys?Leu?Ile?Thr?Trp?Ile
275?????????????????280?????????????????285Ser?Asn?Cys?Leu?Asn?Thr?Leu?Asn?Lys?Ser?Leu?Gly?Leu?Arg?Cys?Gly
290?????????????????295?????????????????300Phe?Asn?Asn?Val?Val?Leu?Ser?Gln?Leu?Phe?Leu?Tyr?Gly?Asp?Cys?Ile305?????????????????310?????????????????315?????????????????320Leu?Lys?Leu?Phe?His?Asn?Glu?Gly?Phe?Tyr?Ile?Ile?Lys?Glu?Val?Glu
325?????????????????330?????????????????335Gly?Phe?Ile?Met?Ser?Leu?Ile?Leu?Asn?Ile?Thr?Glu?Glu?Asp?Gln?Phe
340?????????????????345?????????????????350Arg?Lys?Arg?Phe?Tyr?Asn?Ser?Met?Leu?Asn?Asn?Ile?Thr?Asp?Ala?Ala
355?????????????????360?????????????????365Ile?Lys?Ala?Gln?Lys?Asn?Leu?Leu?Ser?Arg?Val?Cys?His?Thr?Leu?Leu
370?????????????????375?????????????????380Asp?Lys?Thr?Val?Ser?Asp?Asn?Ile?Ile?Asn?Gly?Lys?Trp?Ile?Ile?Leu385?????????????????390?????????????????395?????????????????400Leu?Ser?Lys?Phe?Leu?Lys?Leu?Ile?Lys?Leu?Ala?Gly?Asp?Asn?Asn?Leu
405?????????????????410?????????????????415Asn?Asn?Leu?Ser?Glu?Leu?Tyr?Phe?Leu?Phe?Arg?Ile?Phe?Gly?His?Pro
420?????????????????425?????????????????430Met?Val?Asp?Glu?Arg?Gln?Ala?Met?Asp?Ala?Val?Arg?Ile?Asn?Cys?Asn
435?????????????????440?????????????????445Glu?Thr?Lys?Phe?Tyr?Leu?Leu?Ser?Asn?Leu?Ser?Thr?Leu?Arg?Gly?Ala
450?????????????????455?????????????????460Phe?Ile?Tyr?Arg?Ile?Ile?Lys?Gly?Phe?Val?Asn?Thr?Tyr?Asn?Arg?Trp465?????????????????470?????????????????475?????????????????480Pro?Thr?Leu?Arg?Asn?Ala?Ile?Val?Leu?Pro?Leu?Arg?Trp?Leu?Asn?Tyr
485?????????????????490?????????????????495Tyr?Lys?Leu?Asn?Thr?Tyr?Pro?Ser?Leu?Leu?Glu?Ile?Thr?Glu?Lys?Asp
500?????????????????505?????????????????510Leu?Ile?Ile?Leu?Ser?Gly?Leu?Arg?Phe?Tyr?Arg?Glu?Phe?His?Leu?Pro
515?????????????????520?????????????????525Lys?Lys?Val?Asp?Leu?Glu?Met?Ile?Ile?Asn?Asp?Lys?Ala?Ile?Ser?Pro
530?????????????????535?????????????????540Pro?Lys?Asp?Leu?Ile?Trp?Thr?Ser?Phe?Pro?Arg?Asn?Tyr?Met?Pro?Ser545?????????????????550?????????????????555?????????????????560His?Ile?Gln?Asn?Tyr?Ile?Glu?His?Glu?Lys?Leu?Lys?Phe?Ser?Glu?Ser
565?????????????????570?????????????????575Asp?Arg?Ser?Arg?Arg?Val?Leu?Glu?Tyr?Tyr?Leu?Arg?Asp?Asn?Lys?Phe
580?????????????????585?????????????????590Asn?Glu?Cys?Asp?Leu?Tyr?Asn?Cys?Val?Val?Asn?Gln?Ser?Tyr?Leu?Asn
595?????????????????600?????????????????605Asn?Ser?Asn?His?Val?Val?Ser?Leu?Thr?Gly?Lys?Glu?Arg?Glu?Leu?Ser
610?????????????????615?????????????????620Val?Gly?Arg?Met?Phe?Ala?Met?Gln?Pro?Gly?Met?Phe?Arg?Gln?Ile?Gln625?????????????????630?????????????????635?????????????????640Ile?Leu?Ala?Glu?Lys?Met?Ile?Ala?Glu?Asn?Ile?Leu?Gln?Phe?Phe?Pro
645?????????????????650?????????????????655Glu?Ser?Leu?Thr?Arg?Tyr?Gly?Asp?Leu?Glu?Leu?Gln?Lys?Ile?Leu?Glu
660?????????????????665?????????????????670Leu?Lys?Ala?Gly?Ile?Ser?Asn?Lys?Ser?Asn?Arg?Tyr?Asn?Asp?Asn?Tyr
675?????????????????680?????????????????685Asn?Asn?Tyr?Ile?Ser?Lys?Cys?Ser?Ile?Ile?Thr?Asp?Leu?Ser?Lys?Phe
690?????????????????695?????????????????700Asn?Gln?Ala?Phe?Arg?Tyr?Glu?Thr?Ser?Cys?Ile?Cys?Ser?Asp?Val?Leu705?????????????????710?????????????????715?????????????????720Asp?Glu?Leu?His?Gly?Val?Gln?Ser?Leu?Phe?Ser?Trp?Leu?His?Leu?Thr
725?????????????????730?????????????????735Ile?Pro?Leu?Val?Thr?Ile?Ile?Cys?Thr?Tyr?Arg?His?Ala?Pro?Pro?Phe
740?????????????????745?????????????????750Ile?Lys?Asp?His?Val?Val?Asn?Leu?Asn?Lys?Val?Asp?Glu?Gln?Ser?Gly
755?????????????????760?????????????????765Leu?Tyr?Arg?Tyr?His?Met?Gly?Gly?Ile?Glu?Gly?Trp?Cys?Gln?Lys?Leu
770?????????????????775?????????????????780Trp?Thr?Ile?Glu?Ala?Ile?Ser?Leu?Leu?Asp?Leu?Ile?Ser?Leu?Lys?Gly785?????????????????790?????????????????795?????????????????800Lys?Phe?Ser?Ile?Thr?Ala?Leu?Ile?Asn?Gly?Asp?Asn?Gln?Ser?Ile?Asp
805?????????????????810?????????????????815Ile?Ser?Lys?Pro?Val?Arg?Leu?Ile?Glu?Gly?Gln?Thr?His?Ala?Gln?Ala
820?????????????????825?????????????????830Asp?Tyr?Leu?Leu?Ala?Leu?Asn?Ser?Leu?Lys?Leu?Leu?Tyr?Lys?Glu?Tyr
835?????????????????840?????????????????845Ala?Gly?Ile?Gly?His?Lys?Leu?Lys?Gly?Thr?Glu?Thr?Tyr?Ile?Ser?Arg
850?????????????????855?????????????????860Asp?Met?Gln?Phe?Met?Ser?Lys?Thr?Ile?Gln?His?Asn?Gly?Val?Tyr?Tyr865?????????????????870?????????????????875?????????????????880Pro?Ala?Ser?Ile?Lys?Lys?Val?Leu?Arg?Val?Gly?Pro?Trp?Ile?Asn?Thr
885?????????????????890?????????????????895Ile?Leu?Asp?Asp?Phe?Lys?Val?Ser?Leu?Glu?Ser?Ile?Gly?Ser?Leu?Thr
900?????????????????905?????????????????910Gln?Glu?Leu?Glu?Tyr?Arg?Gly?Glu?Ser?Leu?Leu?Cys?Ser?Leu?Ile?Phe
915?????????????????920?????????????????925Arg?Asn?Ile?Trp?Leu?Tyr?Asn?Gln?Ile?Ala?Leu?Gln?Leu?Arg?Asn?His
930?????????????????935?????????????????940Ala?Leu?Cys?His?Asn?Lys?Leu?Tyr?Leu?Asp?Ile?Leu?Lys?Val?Leu?Lys945?????????????????950?????????????????955?????????????????960His?Leu?Lys?Thr?Phe?Phe?Asn?Leu?Asp?Ser?Ile?Asp?Met?Ala?Leu?Thr
965?????????????????970?????????????????975Leu?Tyr?Met?Asn?Leu?Pro?Met?Leu?Phe?Gly?Gly?Gly?Asp?Pro?Asn?Leu
980?????????????????985?????????????????990Leu?Tyr?Arg?Ser?Phe?Tyr?Arg?Arg?Thr?Pro?Asp?Phe?Leu?Thr?Glu?Ala
995?????????????????1000????????????????1005Ile?Val?His?Ser?Val?Phe?Val?Leu?Ser?Tyr?Tyr?Thr?Gly?His?Asp?Leu
1010????????????????1015????????????????1020Gln?Asp?Lys?Leu?Gln?Asp?Leu?Pro?Asp?Asp?Arg?Leu?Asn?Lys?Phe?Leu1025????????????????1030????????????????1035????????????????1040Thr?Cys?Ile?Ile?Thr?Phe?Asp?Lys?Asn?Pro?Asn?Ala?Glu?Phe?Val?Thr
1045????????????????1050????????????????1055Leu?Met?Arg?Asp?Pro?Gln?Ala?Leu?Gly?Ser?Glu?Arg?Gln?Ala?Lys?Ile
1060????????????????1065????????????????1070Thr?Ser?Glu?Ile?Asn?Arg?Leu?Ala?Val?Thr?Glu?Val?Leu?Ser?Ile?Ala
1075????????????????1080????????????????1085Pro?Asn?Lys?Ile?Phe?Ser?Lys?Ser?Ala?Gln?His?Tyr?Thr?Thr?Thr?Glu
1090????????????????1095????????????????1100Ile?Asp?Leu?Asn?Asp?Ile?Met?Gln?Asn?Ile?Glu?Pro?Thr?Tyr?Pro?His1105????????????????1110????????????????1115????????????????1120Gly?Leu?Arg?Val?Val?Tyr?Glu?Ser?Leu?Pro?Phe?Tyr?Lys?Ala?Glu?Lys
1125????????????????1130????????????????1135Ile?Val?Asn?Leu?Ile?Ser?Gly?Thr?Lys?Ser?Ile?Thr?Asn?Ile?Leu?Glu
1140????????????????1145????????????????1150Lys?Thr?Ser?Ala?Ile?Asp?Ser?Thr?Asp?Ile?Asn?Arg?Ala?Thr?Asp?Met
1155????????????????1160????????????????1165Met?Arg?Lys?Asn?Ile?Thr?Leu?Leu?Ile?Arg?Ile?Leu?Pro?Leu?Asp?Cys
1170????????????????1175????????????????1180Asn?Lys?Asp?Lys?Arg?Glu?Leu?Leu?Ser?Leu?Glu?Asn?Leu?Ser?Ile?Thr1185????????????????1190????????????????1195????????????????1200Glu?Leu?Ser?Lys?Tyr?Val?Arg?Glu?Arg?Ser?Trp?Ser?Leu?Ser?Asn?Ile
1205????????????????1210????????????????1215Val?Gly?Val?Thr?Ser?Pro?Ser?Ile?Met?Phe?Thr?Met?Asp?Ile?Lys?Tyr
1220????????????????1225????????????????1230Thr?Thr?Ser?Thr?Ile?Ala?Ser?Gly?Ile?Ile?Ile?Glu?Lys?Tyr?Asn?Val
1235????????????????1240????????????????1245Asn?Ser?Leu?Thr?Arg?Gly?Glu?Arg?Gly?Pro?Thr?Lys?Pro?Trp?Val?Gly
1250????????????????1255????????????????1260Ser?Ser?Thr?Gln?Glu?Lys?Lys?Thr?Met?Pro?Val?Tyr?Asn?Arg?Gln?Val1265????????????????1270????????????????1275????????????????1280Leu?Thr?Lys?Lys?Gln?Arg?Asp?Gln?Ile?Asp?Leu?Leu?Ala?Lys?Leu?Asp
1285????????????????1290????????????????1295Trp?Val?Tyr?Ala?Ser?Ile?Asp?Asn?Lys?Asp?Glu?Phe?Met?Glu?Glu?Leu
1300????????????????1305????????????????1310Ser?Thr?Gly?Thr?Leu?Gly?Leu?Ser?Tyr?Glu?Lys?Ala?Lys?Lys?Leu?Phe
1315????????????????1320????????????????1325Pro?Gln?Tyr?Leu?Ser?Val?Asn?Tyr?Leu?His?Arg?Leu?Thr?Val?Ser?Ser
1330????????????????1335????????????????1340Arg?Pro?Cys?Glu?Phe?Pro?Ala?Ser?Ile?Pro?Ala?Tyr?Arg?Thr?Thr?Asn1345????????????????1350????????????????1355????????????????1360Tyr?His?Phe?Asp?Thr?Ser?Pro?Ile?Asn?His?Val?Leu?Thr?Glu?Lys?Tyr
1365????????????????1370????????????????1375Gly?Asp?Glu?Asp?Ile?Asp?Ile?Val?Phe?Gln?Asn?Cys?Ile?Ser?Phe?Gly
1380????????????????1385????????????????1390Leu?Ser?Leu?Met?Ser?Val?Val?Glu?Gln?Phe?Thr?Asn?Ile?Cys?Pro?Asn
1395????????????????1400????????????????1405Arg?Ile?Ile?Leu?Ile?Pro?Lys?Leu?Asn?Glu?Ile?His?Leu?Met?Lys?Pro
1410????????????????1415????????????????1420Pro?Ile?Phe?Thr?Gly?Asp?Val?Asp?Ile?Ile?Lys?Leu?Lys?Gln?Val?Ile1425????????????????1430????????????????1435????????????????1440Gln?Lys?Gln?His?Met?Phe?Leu?Pro?Asp?Lys?Ile?Ser?Leu?Thr?Gln?Tyr
1445????????????????1450????????????????1455Val?Glu?Leu?Phe?Leu?Ser?Asn?Lys?Ala?Leu?Lys?Ser?Gly?Ser?His?Ile
1460????????????????1465????????????????1470Asn?Ser?Asn?Leu?Ile?Leu?Val?His?Lys?Met?Ser?Asp?Tyr?Phe?His?Asn
1475????????????????1480????????????????1485Ala?Tyr?Ile?Leu?Ser?Thr?Asn?Leu?Ala?Gly?His?Trp?Ile?Leu?Ile?Ile
1490????????????????1495????????????????1500Gln?Leu?Met?Lys?Asp?Ser?Lys?Gly?Ile?Phe?Glu?Lys?Asp?Trp?Gly?Glu1505????????????????1510????????????????1515????????????????1520Gly?Tyr?Ile?Thr?Asp?His?Met?Phe?Ile?Asn?Leu?Asn?Val?Phe?Phe?Asn
1525????????????????1530????????????????1535Ala?Tyr?Lys?Thr?Tyr?Leu?Leu?Cys?Phe?His?Lys?Gly?Tyr?Gly?Lys?Ala
1540????????????????1545????????????????1550Lys?Leu?Glu?Cys?Asp?Met?Asn?Thr?Ser?Asp?Leu?Leu?Cys?Val?Leu?Glu
1555????????????????1560????????????????1565Leu?Ile?Asp?Ser?Ser?Tyr?Trp?Lys?Ser?Met?Ser?Lys?Val?Phe?Leu?Glu
1570????????????????1575????????????????1580Gln?Lys?Val?Ile?Lys?Tyr?Ile?Ile?Asn?Gln?Asp?Thr?Ser?Leu?His?Arg1585????????????????1590????????????????1595????????????????1600Ile?Lys?Gly?Cys?His?Ser?Phe?Lys?Leu?Trp?Phe?Leu?Lys?Arg?Leu?Asn
1605????????????????1610????????????????1615Asn?Ala?Lys?Phe?Thr?Val?Cys?Pro?Trp?Val?Val?Asn?Ile?Asp?Tyr?His
1620????????????????1625????????????????1630Pro?Thr?His?Met?Lys?Ala?Ile?Leu?Ser?Tyr?Ile?Asp?Leu?Val?Arg?Met
1635????????????????1640????????????????1645Gly?Leu?Ile?Asn?Val?Asp?Lys?Leu?Thr?Ile?Lys?Asn?Lys?Asn?Lys?Phe
1650????????????????1655????????????????1660Asn?Asp?Glu?Phe?Tyr?Thr?Ser?Asn?Leu?Phe?Tyr?Ile?Ser?Tyr?Asn?Phe1665????????????????1670????????????????1675????????????????1680Ser?Asp?Asn?Thr?His?Leu?Leu?Thr?Lys?Gln?Ile?Arg?Ile?Ala?Asn?Ser
1685????????????????1690????????????????1695Glu?Leu?Glu?Asn?Asn?Tyr?Asn?Lys?Leu?Tyr?His?Pro?Thr?Pro?Glu?Thr
1700????????????????1705????????????????1710Leu?Glu?Asn?Met?Ser?Leu?Ile?Pro?Val?Lys?Ser?Asn?Asn?Ser?Asn?Lys
1715????????????????1720????????????????1725Pro?Lys?Phe?Gly?Ile?Ser?Gly?Asn?Thr?Glu?Ser?Met?Met?Thr?Ser?Thr
1730????????????????1735????????????????1740Phe?Ser?Asn?Lys?Thr?His?Ile?Lys?Ser?Ser?Ala?Val?Ile?Thr?Arg?Phe1745????????????????1750????????????????1755????????????????1760Asn?Tyr?Ser?Lys?Gln?Asp?Leu?Tyr?Asn?Leu?Phe?Pro?Ile?Val?Val?Ile
1765????????????????1770????????????????1775Asp?Arg?Ile?Ile?Asp?His?Ser?Gly?Asn?Thr?Ala?Lys?Ser?Asn?Gln?Leu
1780????????????????1785????????????????1790Tyr?Thr?Thr?Thr?Ser?His?Gln?Thr?Ser?Leu?Val?Arg?Asn?Ser?Ala?Ser
1795????????????????1800????????????????1805Leu?Tyr?Cys?Met?Leu?Pro?Trp?His?His?Val?Asn?Arg?Phe?Asn?Phe?Val
1810????????????????1815????????????????1820Phe?Ser?Ser?Thr?Gly?Cys?Lys?Ile?Ser?Ile?Glu?Tyr?Ile?Leu?Lys?Asp1825????????????????1830????????????????1835????????????????1840Leu?Lys?Ile?Lys?Asp?Pro?Ser?Cys?Ile?Ala?Phe?Ile?Gly?Glu?Gly?Ala
1845????????????????1850????????????????1855Gly?Asn?Leu?Leu?Leu?Arg?Thr?Val?Val?Glu?Leu?His?Pro?Asp?Ile?Arg
1860????????????????1865????????????????1870Tyr?Ile?Tyr?Arg?Ser?Leu?Lys?Asp?Cys?Asn?Asp?His?Ser?Leu?Pro?Ile
1875????????????????1880????????????????1885Glu?Phe?Leu?Arg?Leu?Tyr?Asn?Gly?His?Ile?Asn?Ile?Asp?Tyr?Gly?Glu
1890????????????????1895????????????????1900Asn?Leu?Thr?Ile?Pro?Ala?Thr?Asp?Ala?Thr?Asn?Asn?Ile?His?Trp?Ser1905????????????????1910????????????????1915????????????????1920Tyr?Leu?His?Ile?Lys?Phe?Ala?Glu?Pro?Ile?Ser?Ile?Phe?Val?Cys?Asp
1925????????????????1930????????????????1935Ala?Glu?Leu?Pro?Val?Thr?Ala?Asn?Trp?Ser?Lys?Ile?Ile?Ile?Glu?Trp
1940????????????????1945????????????????1950Ser?Lys?His?Val?Arg?Lys?Cys?Lys?Tyr?Cys?Ser?Ser?Val?Asn?Arg?Cys
1955????????????????1960????????????????1965Ile?Leu?Ile?Ala?Lys?Tyr?His?Ala?Gln?Asp?Asp?Ile?Asp?Phe?Lys?Leu
1970????????????????1975????????????????1980Asp?Asn?Ile?Thr?Ile?Leu?Lys?Thr?Tyr?Val?Cys?Leu?Gly?Ser?Lys?Leu1985????????????????1990????????????????1995????????????????2000Lys?Gly?Ser?Glu?Val?Tyr?Leu?Val?Leu?Thr?Ile?Gly?Pro?Ala?Asn?Ile
2005????????????????2010????????????????2015Leu?Pro?Val?Phe?Asn?Val?Val?Gln?Asn?Ala?Lys?Leu?Ile?Leu?Ser?Arg
2020????????????????2025????????????????2030Thr?Lys?Asn?Phe?Ile?Met?Pro?Lys?Lys?Thr?Asp?Lys?Glu?Ser?Ile?Asp
2035????????????????2040????????????????2045Ala?Asn?Ile?Lys?Ser?Leu?Ile?Pro?Phe?Leu?Cys?Tyr?Pro?Ile?Thr?Lys
2050????????????????2055????????????????2060Lys?Gly?Ile?Lys?Thr?Ser?Leu?Ser?Lys?Leu?Lys?Ser?Val?Val?Ser?Gly2065????????????????2070????????????????2075????????????????2080Asp?Ile?Leu?Ser?Tyr?Ser?Ile?Ala?Gly?Arg?Asn?Glu?Val?Phe?Ser?Asn
2085????????????????2090????????????????2095
Lys?Leu?Ile?Asn?His?Lys?His?Met?Asn?Ile?Leu?Lys?Trp?Leu?Asp?His
2100????????????????2105????????????????2110
Val?Leu?Asn?Phe?Arg?Ser?Ala?Glu?Leu?Asn?Tyr?Asn?His?Leu?Tyr?Met
2115????????????????2120????????????????2125
Ile?Glu?Ser?Thr?Tyr?Pro?Tyr?Leu?Ser?Glu?Leu?Leu?Asn?Ser?Leu?Thr
2130????????????????2135????????????????2140
Thr?Asn?Glu?Leu?Lys?Lys?Leu?Ile?Lys?Ile?Thr?Gly?Ser?Val?Leu?Tyr
2145????????????????2150????????????????2155????????????????2160
Asn?Leu?Pro?Asn?Glu?Gln
The information of 2165 (2) SEQ ID NO:27:
(i) sequence signature:
(A) length: 15219 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: ACGGGAAAAA AATGCGTACT ACAAACTTGC ACATTCGAAA AAAATGGGGC AAATAAGAAC 60 TTGATAAGTG CTATTTAAGT CTAACCTTTT CAATCAGAAA TGGGGTGCAA TTCACTGAGC 120 ATGATAAAGG TTAGATTACA AAATTTATTT GACAATGACG AAGTAGCATT GTTAAAAATA 180 ACATGTTATA CTGATAAATT AATTCTTCTG ACCAATGCAT TAGCCAAAGC AGCAATACAT 240 ACAATTAAAT TAAACGGCAT AGTTTTTATA CATGTTATAA CAAGCAGTGA AGTGTGCCCT 300 GATAACAATA TTGTAGTGAA ATCTAACTTT ACAACAATGC CAATACTACA AAATGGAGGA 360 TACATATGGG AATTGATTGA GTTGACACAC TGCTCTCAAT TAAACGGTTT AATGGATGAT 420 AATTGTGAAA TCAAATTTTC TAAAAGACTA AGTGACTCAG TAATGACTAA TTATATGAAT 480 CAAATATCTG ACTTACTTGG GCTTGATCTC AATTCATGAA TTATGTTTAG TCTAATTCAA 540 TAGACATGTG TTTATTACCA TTTTAGTTAA TATAAAAACT CATCAAAGGG AAATGGGGCA 600 AATAAACTCA CCTAATCAAT CAAACCATGA GCACTACAAA TGACAACACT ACTATGCAAA 660 GATTGATGAT CACAGACATG AGACCCCTGT CAATGGATTC AATAATAACA TCTCTTACCA 720 AAGAAATCAT CACACACAAA TTCATATACT TGATAAACAA TGAATGTATT GTAAGAAAAC 780 TTGATGAAAG ACAAGCTACA TTTACATTCT TAGTCAATTA TGAGATGAAG CTACTGCACA 840 AAGTAGGGAG TACCAAATAC AAAAAATACA CTGAATATAA TACAAAATAT GGCACTTTCC 900 CCATGCCTAT ATTTATCAAT CACGGCGGGT TTCTAGAATG TATTGGCATT AAGCCTACAA 960 AACACACTCC TATAATATAC AAATATGACC TCAACCCGTG AATTCCAACA AAAAAACCAA 1020 CCCAACCAAA CCAAACTATT CCTCAAACAA CAGTGCTCAA TAGTTAAGAA GGAGCTAATC 1080 CATTTTAGTA ATTAAAAATA AAAGTAAAGC CAATAACATA AATTGGGGCA AATACAAAGA 1140 TGGCTCTTAG CAAAGTCAAG TTGAATGATA CATTAAATAA GGATCAGCTG CTGTCATCCA 1200 GCAAATACAC TATTCAACGT AGTACAGGAG ATAATATTGA CACTCCCAAT TATGATGTGC 1260 AAAAACACCT AAACAAACTA TGTGGTATGC TATTAATCAC TGAAGATGCA AATCATAAAT 1320 TCACAGGATT AATAGGTATG TTATATGCTA TGTCCAGGTT AGGAAGGGAA GACACTATAA 1380 AGATACTTAA AGATGCTGGA TATCATGTTA AAGCTAATGG AGTAGATATA ACAACATATC 1440 GTCAAGATAT AAATGGAAAG GAAATGAAAT TCGAAGTATT AACATTATCA AGCTTGACAT 1500 CAGAAATACA AGTCAATATT GAGATAGAAT CTAGAAAGTC CTACAAAAAA ATGCTAAAAG 1560 AGATGGGAGA AGTGGCTCCA GAATATAGGC ATGATTCTCC AGACTGTGGG ATGATAATAC 1620 TGTGTATAGC TGCACTTGTG ATAACCAAAT TAGCAGCAGG AGACAGATCA GGTCTTACAG 1680 CAGTAATTAG GAGGGCAAAC AATGTCTTAA AAAACGAAAT AAAACGATAC AAGGGCCTCA 1740 TACCAAAGGA TATAGCTAAC AGTTTTTATG AAGTGTTTGA AAAACACCCT CATCTTATAG 1800 ATGTTTTCGT GCACTTTGGC ATTGCACAAT CATCCACAAG AGGGGGTAGT AGAGTTGAAG 1860 GAATCTTTGC AGGATTGTTT ATGAATGCCT ATGGTTCAGG GCAAGTAATG CTAAGATGGG 1920 GAGTTTTAGC CAAATCTGTA AAAAATATCA TGCTAGGACA TGCTAGTGTC CAGGCAGAAA 1980 TGGAGCAAGT TGTGGAAGTC TATGAGTATG CACAGAAGTT GGGAGGAGAA GCTGGATTCT 2040 ACCATATATT GAACAATCCA AAAGCATCAT TGCTGTCATT AACTCAATTT CCCAACTTCT 2100 CAAGTGTGGT CCTAGGCAAT GCAGCAGGTC TAGGCATAAT GGGAGAGTAT AGAGGTACAC 2160 CAAGAAACCA GGATCTTTAT GATGCAGCTA AAGCATATGC AGAGCAACTC AAAGAAAATG 2220 GAGTAATAAA CTACAGTGTA TTAGACTTAA CAGCAGAAGA ATTGGAAGCC ATAAAGCATC 2280 AACTCAACCC CAAAGAAGAT GATGTAGAGC TTTAAGTTAA CAAAAAATAC GGGGCAAATA 2340 AGTCAACATG GAGAAGTTTG CACCTGAATT TCATGGAGAA GATGCAAATA ACAAAGCTAC 2400 CAAATTCCTA GAATCAATAA AGGGCAAGTT CGCATCATCC AAAGATCCTA AGAAGAAAGA 2460 TAGCATAATA TCTGTTAACT CAATAGATAT AGAAGTAACT AAAGAGAGCC CGATAACATC 2520 TGGCACCAAC ATCATCAATC CAACAAGTGA AGCCGACAGT ACCCCAGAAA CAAAAGCCAA 2580 CTACCCAAGA AAACCCCTAG TAAGCTTCAA AGAAGATCTC ACCCCAAGTG ACAACCCTTT 2640 TTCTAAGTTG TACAAGGAAA CAATAGAAAC ATTTGATAAC AATGAAGAAG AATCTAGCTA 2700 CTCATATGAA GAGATAAATG ATCAAACAAA TGACAACATT ACAGCAAGAC TAGATAGAAT 2760 TGATGAAAAA TTAAGTGAAA TATTAGGAAT GCTCCATACA TTAGTAGTTG CAAGTGCAGG 2820 ACCCACTTCA GCTCGCGATG GAATAAGAGA TGCTATGGTT GGTCTAAGAG AAGAGATGAT 2880 AGAAAAAATA AGAGCGGAAG CATTAATGAC CAATGATAGG TTAGAGGCTA TGGCAAGACT 2940 TAGGAATGAG GAAAGCGAAA AAATGGCAAA AGACACCTCA GATGAAGTGT CTCTTAATCC 3000 AACTTCCAAA AAATTGAGTG ACTTGTTGGA AGACAACGAT AGTGACAATG ATCTATCACT 3060 TGATGATTTT TGATCAGCGA TCAACTCACT CAGCAATCAA CAACATCAAT AAAACAGACA 3120 TCAATCCATT GAATCAACTG CCAGACCGAA CAAACAAACG TCCATCAGTA GAACCACCAA 3180 CCAATCAATC AACCAATTGA TCAATCAGCA ACCCGACAAA ATTAACAATA TAGTAACAAA 3240 AAAAGAACAA GATGGGGCAA ATATGGAAAC ATACGTGAAC AAGCTTCACG AAGGCTCCAC 3300 ATACACAGCA GCTGTTCAGT ACAATGTTCT AGAAAAAGAT GATGATCCTG CATCACTAAC 3360 AATATGGGTG CCTATGTTCC AGTCATCTGT GCCAGCAGAC TTGCTCATAA AAGAACTTGC 3420 AAGCATCAAT ATACTAGTGA AGCAGATCTC TACGCCCAAA GGACCTTCAC TACGAGTCAC 3480 GATTAACTCA AGAAGTGCTG TGCTGGCTCA AATGCCTAGT AATTTCATCA TAAGCGCAAA 3540 TGTATCATTA GATGAAAGAA GCAAATTAGC ATATGATGTA ACTACACCTT GTGAAATCAA 3600 AGCATGCAGT CTAACATGCT TAAAAGTAAA AAGTATGTTA ACTACAGTCA AAGATCTTAC 3660 CATGAAGACA TTCAACCCCA CTCATGAGAT CATTGCTCTA TGTGAATTTG AAAATATTAT 3720 GACATCAAAA AGAGTAATAA TACCAACCTA TCTAAGATCA ATTAGTGTCA AGAACAAGGA 3780 TCTGAACTCA CTAGAAAATA TAGCAACCAC CGAATTCAAA AATGCTATCA CCAATGCAAA 3840 AATTATTCCT TATGCAGGAT TAGTGTTAGT TATCACAGTT ACTGACAATA AAGGAGCATT 3900 CAAATATATC AAACCACAGA GTCAATTTAT AGTAGATCTT GGTGCCTACC TAGAAAAAGA 3960 GAGCATATAT TATGTGACTA CTAATTGGAA GCATACAGCT ACACGTTTTT CAATCAAACC 4020 ACTAGAGGAT TAAACTTAAT TATCAACACT GAATGACAGG TCCACATATA TCCTCAAACT 4080 ACACACTATA TCCAAACATC ATAAACATCT ACACTACACA CTTCATCACA CAAACCAATC 4140 CCACTCAAAA TCCAAAATCA CTACCAGCCA CTATCCGCTA GACCTAGAGT GCGAATAGGC 4200 AAATAAAACC AAAATATGGG GTAAATAGAC ATTAGTTAGA GTTCAATCAA TCTTAACAAC 4260 CATTTATACC GCCAATTCAA CACATATACT ATAAATCTTA AAATGGGAAA TACATCCATC 4320 ACAATAGAAC TCACAAGCAA ATTTTGGCCC TATTTTACAC TAATACATAT GATCTTAACT 4380 CTAATCTTTT TACTAATTAT AATCACTATC ATGATTGCAA CACTAAATAA GCTAAGTGAA 4440 CACAAAGCAT TCTGCAACAA AACTCTTGAA CTAGGACAGA TGTACCAAAT CAACACACAG 4500 AGTTCCACCA TTATGCTGTG TCAAACCATA ATCCTGTATA TACAAACAAA CAAATCCAAT 4560 CCTCTCACAG AGTCACGGTG TCGCAAAACC ACGCTAACCA TCATGGTAGC ATAGAGTAGT 4620 TATTTAAAAA TTAACATAAT GATGAATTGT TAGTATGAGA TCAAAAACAA CATTGGGGCA 4680 AATGCAACCA TGTCCAAACA CAAGAATCAA CGCACTGCCA GGACTCTAGA AAAGACCTGG 4740 GATACTCTTA ATCATCTAAT TGTAATATCC TCTTGTTTAT ACAGATTAAA TTTAAAATCT 4800 ATAGCACAAA TAGCACTATC AGTTTTGGCA ATGATAATCT CAACCTCTCT CATAATTGCA 4860 GCCATAATAT TCATCATCTC TGCCAATCAC AAAGTTACAC TAACAACGGT CACAGTTCAA 4920 ACAATAAAAA ACCACACTGA AAAAAACATC ACCACCTACC CTACTCAAGT CTCACCAGAA 4980 AGGGTTAGTT CATCCAAGCA ACCCACAACC ACATCACCAA TCCACACAAG TTCAGCTACA 5040 ACATCACCCA ATACAAAATC AGAAACACAC CATACAACAG CACAAACCAA AGGCAGAACC 5100 ACCACTTCAA CACAGACCAA CAAGCCAAGC ACAAAACCAC GTCCAAAAAA TCCACCAAAA 5160 AAAGATGATT ACCATTTTGA AGTGTTCAAC TTCGTTCCCT GCAGTATATG TGGCAACAAT 5220 CAACTTTGCA AATCCATCTG CAAAACAATA CCAAGCAACA AACCAAAGAA GAAACCAACC 5280 ATCAAACCCA CAAACAAACC AACCACCAAA ACCACAAACA AAAGAGACCC AAAAACACCA 5340 GCCAAAACGA CGAAAAAAGA AACTACCACC AACCCAACAA AAAAACTAAC CCTCAAGACC 5400 ACAGAAAGAG ACACCAGCAC CTCACAATCC ACTGCACTCG ACACAACCAC ATTAAAACAC 5460 ACAGTCCAAC AGCAATCCCT CCTCTCAACC ACCCCCGAAA ACACACCCAA CTCCACACAA 5520 ACACCCACAG CATCCGAGCC CTCCACACCA AACTCCACCC AAAAAACCCA GCCACATGCT 5580 TAGTTATTCA AAAACTACAT CTTAGCAGAG AACCGTGATC TATCAAGCAA GAACGAAATT 5640 AAACCTGGGG CAAATAACCA TGGAGTTGAT GATCCACAAG TCAAGTGCAA TCTTCCTAAC 5700 TCTTGCTATT AATGCATTGT ACCTCACCTC AAGTCAGAAC ATAACTGAGG AGTTTTACCA 5760 ATCGACATGT AGTGCAGTTA GCAGAGGTTA TTTTAGTGCT TTAAGAACAG GTTGGTATAC 5820 TAGTGTCATA ACAATAGAAT TAAGTAATAT AAAAGAAACC AAATGCAATG GAACTGACAC 5880 TAAAGTAAAA CTTATGAAAC AAGAATTAGA TAAGTATAAG AATGCAGTAA CAGAATTACA 5940 GCTACTTATG CAAAACACAC CAGCTGTCAA CAACCGGGCC AGAAGAGAAG CACCACAGTA 6000 TATGAACTAC ACAATCAATA CCACTAAAAA CCTAAATGTA TCAATAAGCA AGAAGAGGAA 6060 ACGAAGATTT CTAGGCTTCT TGTTAGGTGT GGGATCTGCA ATAGCAAGTG GTATAGCTGT 6120 ATCAAAAGTT CTACACCTTG AAGGAGAAGT GAACAAGATC AAAAATGCTT TGTTGTCTAC 6180 AAACAAAGCT GTAGTCAGTT TATCAAATGG GGTCAGTGTT TTAACCAGCA AAGTGTTAGA 6240 TCTCAAGAAT TACATAAATA ACCAATTATT ACCCATAGTA AATCAACAGA GCTGTCGCAT 6300 CTCCAACATT GAAACAGTTA TAGAATTCCA GCAGAAGAAC AGCAGATTGT TGGAAATCAC 6360 CAGAGAATTT AGTGTCAATG CAGGTGTAAC AACACCTTTA AGCACTTACA TGTTGACAAA 6420 CAGTGAGTTA CTATCATTAA TCAATGATAT GCCTATAACA AATGATCAGA AAAAATTAAT 6480 GTCAAGCAAT GTTCAGATAG TAAGGCAACA AAGTTATTCC ATCATGTCTA TAATAAAGGA 6540 AGAAGTCCTT GCATATGTTG TACAGCTGCC TATCTATGGT GTAATAGATA CACCTTGCTG 6600 GAAATTGCAC ACATCGCCTC TATGCACTAC CAACATCAAA GAAGGATCAA ATATTTGTTT 6660 AACAAGGACT GATAGAGGAT GGTATTGTGA TAATGCAGGA TCAGTATCCT TCTTTCCACA 6720 GGCTGACACT TGTAAAGTAC AGTCCAATCG AGTATTTTGT GACACTATGA ACAGTTTGAC 6780 ATTACCAAGT GAAGTCAGCC TTTGTAACAC TGACATATTC AATTCCAAGT ATGACTGCAA 6840 AATTATGACA TCAAAAACAG ACATAAGCAG CTCAGTAATT ACTTCTCTTG GAGCTATAGT 6900 GTCATGCTAT GGTAAAACTA AATGCACTGC ATCCAACAAA AATCGTGGGA TTATAAAGAC 6960 ATTTTCTAAT GGTTGTGACT ATGTGTCAAA CAAAGGAGTA GATACTGTGT CAGTGGGCAA 7020 CACTTTATAC TATGTAAACA AGCTGGAAGG CAAGAACCTT TATGTAAAAG GGGAACCTAT 7080 AATAAATTAC TATGACCCTC TAGTGTTTCC TTCTGATGAG TTTGATGCAT CAATATCTCA 7140 AGTCAATGAA AAAATCAATC AAAGTTTAGC TTTTATTCGT AGATCTGATG AATTACTACA 7200 TAATGTAAAT ACTGGCAAAT CTACTACAAA TATTATGATA ACTACAATTA TTATAGTAAT 7260 CATTGTAGTA TTGTTATCAT TAATAGCTAT TGGTTTACTG TTGTATTGTA AAGCCAAAAA 7320 CACACCAGTT ACACTAAGCA AAGACCAACT AAGTGGAATC AATAATATTG CATTCAGCAA 7380 ATAGACAAAA AACCACCTGA TCATGTTTCA ACAACAATCT GCTGACCACC AATCCCAAAT 7440 CAACTTACAA CAAATATTTC AACATCACAG TACAGGCTGA ATCATTTCCT CACATCATGC 7500 TACCCACATA ACTAAGCTAG ATCCTTAACT TATAGTTACA TAAAAACCTC AAGTATCACA 7560 ATCAACCACT AAATCAACAC ATCATTCACA AAATTAACAG CTGGGGCAAA TATGTCGCGA 7620 AGAAATCCTT GTAAATTTGA GATTAGAGGT CATTGCTTGA ATGGTAGAAG ATGTCACTAC 7680 AGTCATAATT ACTTTGAATG GCCTCCTCAT GCATTACTAG TGAGGCAAAA CTTCATGTTA 7740 AACAAGATAC TCAAGTCAAT GGACAAAAGC ATAGACACTT TGTCTGAAAT AAGTGGAGCT 7800 GCTGAACTGG ATAGAACAGA AGAATATGCT CTTGGTATAG TTGGAGTGCT AGAGAGTTAC 7860 ATAGGATCTA TAAACAACAT AACAAAACAA TCAGCATGTG TTGCTATGAG TAAACTTCTT 7920 ATTGAGATCA ATAGTGATGA CATTAAAAAG CTTAGAGATA ATGAAGAACC CAATTCACCT 7980 AAGATAAGAG TGTACAATAC TGTTATATCA TACATTGAGA GCAATAGAAA AAACAACAAG 8040 CAAACCATCC ATCTGCTCAA GAGACTACCA GCAGACGTGC TGAAGAAGAC AATAAAGAAC 8100 ACATTAGATA TCCACAAAAG CATAACCATA AGCAATCCAA AAGAGTCAAC TGTGAATGAT 8160 CAAAATGACC AAACCAAAAA TAATGATATT ACCGGATAAA TATCCTTGTA GTATATCATC 8220 CATATTGATC TCAAGTGAAA GCATGGTTGC TACATTCAAT CATAAAAACA TATTACAATT 8280 TAACCATAAC TATTTGGATA ACCACCAGCG TTTATTAAAT CATATATTTG ATGAAATTCA 8340 TTGGACACCT AAAAACTTAT TAGATGCCAC TCAACAATTT CTCCAACATC TTAACATCCC 8400 TGAAGATATA TATACAGTAT ATATATTAGT GTCATAATGC TTGACCATAA CGACTCTATG 8460 TCATCCAACC ATAAAACTAT TTTGATAAGG TTATGGGACA AAATGGATCC CATTATTAAT 8520 GGAAACTCTG CTAATGTGTA TCTAACTGAT AGTTATTTAA AAGGTGTTAT CTCTTTTTCA 8580 GAGTGTAATG CTTTAGGGAG TTATCTTTTT AACGGCCCTT ATCTTAAAAA TGATTACACC 8640 AACTTAATTA GTAGACAAAG CCCACTACTA GAGCATATGA ATCTTAAAAA ACTAACTATA 8700 ACACAGTCAT TAATATCTAG ATATCATAAA GGTGAACTGA AATTAGAAGA ACCAACTTAT 8760 TTCCAGTCAT TACTTATGAC ATATAAAAGT ATGTCCTCGT CTGAACAAAT TGCTACAACT 8820 AACTTACTTA AAAAAATAAT ACGAAGAGCC ATAGAAATAA GTGATGTAAA GGTGTACGCC 8880 ATCTTGAATA AACTAGGATT AAAGGAAAAG GACAGAGTTA AGCCCAACAA TAATTCAGGT 8940 GATGAAAACT CAGTACTTAC AACCATAATT AAAGATGATA TACTTTCGGC TGTGGAAAAC 9000 AATCAATCAT ATACAAATTC AGACAAAAGT CACTCAGTAA ATCAAAATAT CACTATCAAA 9060 ACAACACTCT TGAAAAAATT GATGTGTTCA ATGCAACATC CTCCATCATG GTTAATACAC 9120 TGGTTCAATT TATATACAAA ATTAAATAAC ATATTAACAC AATATCGATC AAATGAGGTA 9180 AAAAGTCATG GGTTTATATT AATAGATAAT CAAACTTTAA GTGGTTTTCA GTTTATTTTA 9240 AATCAATATG GTTGTATCGT TTATCATAAA GGACTCAAAA AAATCACAAC TACTACTTAC 9300 AATCAATTTT TGACATGGAA AGACATCAGC CTTAGCAGAT TAAATGTTTG CTTAATTACT 9360 TGGATAAGTA ATTGTTTAAA TACATTAAAC AAAAGCTTAG GGCTGAGATG TGGATTCAAT 9420 AATGTTGTGT TATCACAATT ATTTCTTTAT GGAGATTGTA TACTGAAATT ATTTCATAAT 9480 GAAGGCTTCT ACATAATAAA AGAAGTAGAG GGATTTATTA TGTCTTTAAT TCTAAACATA 9540 ACAGAAGAAG ATCAATTTAA GAAACGATTT TATAATAGCA TGCTAAATAA CATCACAGAT 9600 GCAGCTATTA AGGCTCAAAA GGACCTACTA TCAAGAGTAT GTCACACTTT ATTAGACAAG 9660 ACAGTGTCTG ATAATATCAT AAATGGTAAA TGGATAATCC TATTAAGTAA ATTTCTTAAA 9720 TTGATTAAGC TTGCAGGTGA TAATAATCTC AATAACTTGA GTGAGCTATA TTTTCTCTTC 9780 AGAATCTTTG GACATCCAAT GGTCGATGAA AGACAAGCAA TGGATTCTGT AAGAATTAAC 9840 TGTAATGAAA CTAGGTTCTA CTTATTAAGT AGTCTAAGTA CATTAAGAGG TGCTTTCATT 9900 TATAGAATCA TAAAAGGGTT TGTAAATACC TACAACAGAT GGCCCACCTT AAGGAATGCT 9960 ATTGTCCTAC CTCTAAGATG GTTAAACTAC TATAAACTTA ATACTTATCC ATCTCTACTT 10020 GAAATCACAG AAAATGATTT GATTATTTTA TCAGGATTGC GGTTCTATCG TGAGTTTCAT 10080 CTGCCTAAAA AAGTGGATCT TGAAATGATA ATAAATGACA AAGCCATTTC ACCTCCAAAA 10140 GATCTAATAT GGACTAGTTT TCCTAGAAAT TACATGCCAT CACATATACA AAATTATATA 10200 GAACATGAAA AGTTGAAGTT CTCTGAAAGC GACAGATCGA GAAGAGTACT AGAGTATTAC 10260 TTGAGAGATA ATAAATTCAA TGAATGCGAT CTATACAATT GTGTAGTCAA TCAAAGCTAT 10320 CTCAACAACT CTAATCACGT GGTATCACTA ACTGGTAAAG AAAGAGAGCT CAGTGTAGGT 10380 AGAATGTTTG CTATGCAACC AGGTATGTTT AGGCAAATCC AAATCTTAGC AGAGAAAATG 10440 ATAGCTGAAA ATATTTTACA ATTCTTCCCT GAGAGTTTGA CAAGATATGG TGATCTAGAG 10500 CTTCAAAAGA TATTAGAATT AAAAGCAGGA ATAAGCAACA AGTCAAATCG TTATAATGAT 10560 AACTACAACA ATTATATCAG TAAATGTTCT ATCATTACAG ATCTTAGCAA ATTCAATCAG 10620 GCATTTAGAT ATGAAACATC ATGTATCTGC AGTGATGTAT TAGATGAACT GCATGGAGTA 10680 CAATCTCTGT TCTCTTGGTT GCATTTAACA ATACCTCTTG TCACAATAAT ATGTACATAT 10740 AGACATGCAC CTCCTTTCAT AAAGGATCAT GTTGTTAATC TTAATGAGGT TGATGAACAA 10800 AGTGGATTAT ACAGATATCA TATGGGTGGT ATTGAGGGCT GGTGTCAAAA ACTGTGGACC 10860 ATTGAAGCTA TATCATTATT AGATCTAATA TCTCTCAAAG GGAAATTCTC TATCACAGCT 10920 CTGATAAATG GTGATAATCA GTCAATTGAT ATAAGCAAAC CAGTTAGACT TATAGAGGGT 10980 CAGACCCATG CACAAGCAGA TTATTTGTTA GCATTAAATA GCCTTAAATT GTTATATAAA 11040 GAGTATGCAG GTATAGGCCA TAAGCTTAAG GGAACAGAGA CCTATATATC CCGAGATATG 11100 CAGTTCATGA GCAAAACAAT CCAGCACAAT GGAGTGTACT ATCCAGCCAG TATCAAAAAA 11160 GTCCTGAGAG TAGGTCCATG GATAAACACG ATACTTGATG ATTTTAAAGT TAGTTTAGAA 11220 TCTATAGGCA GCTTAACACA GGAGTTAGAA TACAGAGGAG AAAGCTTATT ATGCAGTTTA 11280 ATATTTAGGA ACATTTGGTT ATACAATCAA ATTGCTTTGC AACTCCGAAA TCATGCATTA 11340 TGTAACAATA AGCTATATTT AGATATATTG AAAGTATTAA AACACTTAAA AACTTTTTTT 11400 AATCTTGATA GCATTGATAT GGCTTTATCA TTGTATATGA ATTTGCCTAT GCTGTTTGGT 11460 GGTGGTGATC CTAATTTGTT ATATCGAAGC TTTTATAGGA GAACTCCAGA CTTCCTTACA 11520 GAAGCTATAG TACATTCAGT GTTTGTGTTG AGCTATTATA CTGGTCACGA TTTACAAGAT 11580 AAGCTCCAGG ATCTTCCAGA TGATAGACTG AACAAATTCT TGACATGTGT CATCACATTT 11640 GATAAAAATC CCAATGCCGA GTTTGTAACA TTGATGAGGG ATCCACAGGC TTTAGGGTCT 11700 GAAAGGCAAG CTAAAATTAC TAGTGAGATT AATAGATTAG CAGTAACAGA AGTCTTAAGT 11760 ATAGCCCCAA ACAAAATATT TTCTAAAAGT GCACAACATT ATACTACCAC TGAGATTGAT 11820 CTAAATGACA TTATGCAAAA TATAGAACCA ACTTACCCTC ATGGATTAAG AGTTGTTTAT 11880 GAAAGTTTAC CTTTTTATAA AGCAGAAAAA ATAGTTAATC TTATATCAGG AACAAAATCC 11940 ATAACTAATA TACTTGAAAA AACATCAGCA ATAGATACAA CTGATATTAA TAGGGCTACT 12000 GATATGATGA GGAAAAATAT AACTTTACTT ATAAGGATAC TTCCACTAGA TTGTAACAAA 12060 GACAAAAGAG AGTTATTAAG TTTAGAAAAT CTTAGTATAA CTGAATTAAG CAAGTATGTA 12120 AGAGAAAGAT CTTGGTCATT ATCCAATATA GTAGGAGTAA CATCGCCAAG TATTATGTTC 12180 ACAATGAACA TTAAATATAC AACTAGCACT ATAGCCAGTG GTATAATAAT AGAAAAATAT 12240 AATGTTAATA GTTTAACTCG TGGTGAAAGA GGACCCACCA AGCCATGGGT AGGCTCATCC 12300 ACGCAGGAGA AAAAAACAAT GCCAGTGTAC AACAGACAAG TTTTAACCAA AAAGCAAAGA 12360 GACCAAATAG ATTTATTAGC AAAATTAGAC TGGGTATATG CATCCATAGA CAACAAAGAT 12420 GAATTCATGG AAGAACTGAG TACTGGAACA CTTGGACTGT CATATGAAAA AGCCAAAAAG 12480 TTGTTTCCAC AATATCTAAG TGTCAATTAT TTACACCGTT TAACAGTCAG TAGTAGACCA 12540 TGTGAATTCC CTGCATCAAT ACCAGCTTAT AGAACAACAA ATTATCATTT TGATACTAGT 12600 CCTATCAATC ATGTATTAAC AGAAAAGTAT GGAGATGAAG ATATCGACAT TGTGTTTCAA 12660 AATTGCATAA GTTTTGGTCT TAGCCTGATG TCGGTTGTGG AACAATTCAC AAACATATGT 12720 CCTAATAGAA TTATTCTCAT ACCGAAGCTG AATGAGATAC ATTTGATGAA ACCTCCTATA 12780 TTTACAGGAG ATGTTGATAT CATCAAGTTG AAGCAAGTGA TACAAAAGCA GCACATGTTC 12840 CTACCAGATA AAATAAGTTT AACCCAATAT GTAGAATTAT TCTTAAGTAA CAAAGCACTT 12900 AAATCTGGAT CTCACATCAA CTCTAATTTA ATATTAGTAC ATAAAATGTC TGATTATTTT 12960 CATAATGCTT ATATTTTAAG TACTAATTTA GCTGGACATT GGATTCTGAT TATTCAACTT 13020 ATGAAAGATT CAAAAGGTAT TTTTGAAAAA GATTGGGGAG AGGGGTACAT AACTGATCAT 13080 ATGTTCATTA ATTTGAATGT TTTCTTTAAT GCTTATAAGA CTTATTTGCT ATGTTTTCAT 13140 AAAGGTTATG GTAAAGCAAA ATTAGAATGT GATATGAACA CTTCAGATCT TCTTTGTGTT 13200 TTGGAGTTAA TAGACAGTAG CTACTGGAAA TCTATGTCTA AAGTTTTCCT AGAACAAAAA 13260 GTCATAAAAT ACATAGTCAA TCAAGACACA AGTTTGCGTA GAATAAAAGG CTGTCACAGT 13320 TTTAAGTTGT GGTTTTTAAA ACGCCTTAAT AATGCTAAAT TTACCGTATG CCCTTGGGTT 13380 GTTAACATAG ATTATCACCC AACACACATG AAAGCTATAT TATCTTACAT AGATTTAGTT 13440 AGAATGGGGT TAATAAATGT AGATAAATTA ACCATTAAAA ATAAAAACAA ATTCAATGAT 13500 GAATTTTACA CATCAAATCT CTTTTACATT AGTTATAACT TTTCAGACAA CACTCATTTG 13560 CTAACAAAAC AAATAAGAAT TGCTAATTCA GAATTAGAAG ATAATTATAA CAAACTATAT 13620 CACCCAACCC CAGAAACTTT AGAAAATATG TCATTAATTC CTGTTAAAAG TAATAATAGT 13680 AACAAACCTA AATTTTGTAT AAGTGGAAAT ACCGAATCTA TGATGATGTC AACATTCTCT 13740 AGTAAAATGC ATATTAAATC TTCCACTGTT ACCACAAGAT TCAATTATAG CAAACAAGAC 13800 TTGTACAATT TATTTCCAAT TGTTGTGATA GACAAGATTA TAGATCATTC AGGTAATACA 13860 GCAAAATCTA ACCAACTTTA CACCACCACT TCACATCAGA CATCTTTAGT AAGGAATAGT 13920 GCATCACTTT ATTGCATGCT TCCTTGGCAT CATGTCAATA GATTTAACTT TGTATTTAGT 13980 TCCACAGGAT GCAAGATCAG TATAGAGTAT ATTTTAAAAG ATCTTAAGAT TAAGGACCCC 14040 AGTTGTATAG CATTCATAGG TGAAGGAGCT GGTAACTTAT TATTACGTAC GGTAGTAGAA 14100 CTTCATCCAG ACATAAGATA CATTTACAGA AGTTTAAAAG ATTGCAATGA TCATAGTTTA 14160 CCTATTGAAT TTCTAAGGTT ATACAACGGG CATATAAACA TAGATTATGG TGAGAATTTA 14220 ACCATTCCTG CTACAGATGC AACTAATAAC ATTCATTGGT CTTATTTACA TATAAAATTT 14280 GCAGAACCTA TTAGCATCTT TGTCTGCGAT GCTGAATTAC CTGTTACAGC CAATTGGAGT 14340 AAAATTATAA TTGAATGGAG TAAGCATGTA AGAAAGTGCA AGTACTGTTC TTCTGTAAAT 14400 AGATGCATTT TAATTGCAAA ATATCATGCT CAAGATGACA TTGATTTCAA ATTAGATAAC 14460 ATTACTATAT TAAAAACTTA CGTGTGCCTA GGTAGCAAGT TAAAAGGATC TGAAGTTTAC 14520 TTAATCCTTA CAATAGGCCC TGCAAATATA CTTCCTGTTT TTGATGTTGT ACAAAATGCT 14580 AAATTGATAC TTTCAAGAAC TAAAAATTTC ATTATGCCTA AAAAAACTGA CAAGGAATCT 14640 ATCGATGCAA ATATTAAAAG CTTAATACCT TTCCTTTGTT ACCCTATAAC AAAAAAAGGA 14700 ATTAAGACTT CATTGTCAAA ATTGAAGAGT GTAGTTAATG GAGATATATT ATCATATTCT 14760 ATAGCTGGAC GTAATGAAGT ATTCAGCAAC AAGCTTATAA ACCACAAGCA TATGAATATC 14820 CTAAAATGGC TAGATCATGT TTTAAATTTT AGATCAGCTG AACTTAATTA CAATCATTTA 14880 TACATGATAG AGTCCACATA TCCTTACTTA AGTGAATTGT TAAATAGTTT AACAACCAAT 14940 GAGCTCAAGA AGCTGATTAA AATAACAGGT AGTGTGCTAT ACAACCTTCC CAACGAACAG 15000 TAGTTTAAAA TATCATTAAC AAGTTTGGTC AAATTTAGAT GCTAACACAT CATTATATTA 15060 TAGTTATTAA AGAATATACA AACTTTTCAA TAATTTAGCA TATTGATTCC AAAATTATCA 15120 TTTTAGTCTT AAGGGGTTAA ATAAAAGTCT AAAACTAACA ATTATACATG TGCATTCACA 15180 ACACAACGAG ACATTAGTTT TTGACACTTT TTTTCTCGT 15219 (2) SEQ ID NO: 28 information about: ...
(i) sequence signature:
(A) length: 2166 amino acid
(B) type: amino acid
(C) chain:
(D) topological framework: linearity
(ii) molecule type: protein (xi) sequence description: SEQ ID NO:28:Met Asp Pro Ile Ile Asn Gly Asn Ser Ala Asn Val Tyr Leu Thr Asp1 5 10 15Ser Tyr Leu Lys Gly Val Ile Ser Phe Ser Glu Cys Asn Ala Leu Gly
20??????????????????25??????????????????30Ser?Tyr?Leu?Phe?Asn?Gly?Pro?Tyr?Leu?Lys?Asn?Asp?Tyr?Thr?Asn?Leu
35??????????????????40??????????????????45Ile?Ser?Arg?Gln?Ser?Pro?Leu?Leu?Glu?His?Met?Asn?Leu?Lys?Lys?Leu
50??????????????????55??????????????????60Thr?Ile?Thr?Gln?Ser?Leu?Ile?Ser?Arg?Tyr?His?Lys?Gly?Glu?Leu?Lys65??????????????????70??????????????????75??????????????????80Leu?Glu?Glu?Pro?Thr?Tyr?Phe?Gln?Ser?Leu?Leu?Met?Thr?Tyr?Lys?Ser
85??????????????????90??????????????????95Met?Ser?Ser?Ser?Glu?Gln?Ile?Ala?Thr?Thr?Asn?Leu?Leu?Lys?Lys?Ile
100?????????????????105?????????????????110Ile?Arg?Arg?Ala?Ile?Glu?Ile?Ser?Asp?Val?Lys?Val?Tyr?Ala?Ile?Leu
115?????????????????120?????????????????125Asn?Lys?Leu?Gly?Leu?Lys?Glu?Lys?Asp?Arg?Val?Lys?Pro?Asn?Asn?Asn
130?????????????????135?????????????????140Ser?Gly?Asp?Glu?Asn?Ser?Val?Leu?Thr?Thr?Ile?Ile?Lys?Asp?Asp?Ile145?????????????????150?????????????????155?????????????????160Leu?Ser?Ala?Val?Glu?Asn?Asn?Gln?Ser?Tyr?Thr?Asn?Ser?Asp?Lys?Ser
165?????????????????170?????????????????175His?Ser?Val?Asn?Gln?Asn?Ile?Thr?Ile?Lys?Thr?Thr?Leu?Leu?Lys?Lys
180?????????????????185?????????????????190Leu?Met?Cys?Ser?Met?Gln?His?Pro?Pro?Ser?Trp?Leu?Ile?His?Trp?Phe
195?????????????????200?????????????????205Asn?Leu?Tyr?Thr?Lys?Leu?Asn?Asn?Ile?Leu?Thr?Gln?Tyr?Arg?Ser?Asn
210?????????????????215?????????????????220Glu?Val?Lys?Ser?His?Gly?Phe?Ile?Leu?Ile?Asp?Asn?Gln?Thr?Leu?Ser225?????????????????230?????????????????235?????????????????240Gly?Phe?Gln?Phe?Ile?Leu?Asn?Gln?Tyr?Gly?Cys?Ile?Val?Tyr?His?Lys
245?????????????????250?????????????????255Gly?Leu?Lys?Lys?Ile?Thr?Thr?Thr?Thr?Tyr?Asn?Gln?Phe?Leu?Thr?Trp
260?????????????????265?????????????????270Lys?Asp?Ile?Ser?Leu?Ser?Arg?Leu?Asn?Val?Cys?Leu?Ile?Thr?Trp?Ile
275?????????????????280?????????????????285Ser?Asn?Cys?Leu?Asn?Thr?Leu?Asn?Lys?Ser?Leu?Gly?Leu?Arg?Cys?Gly
290?????????????????295?????????????????300Phe?Asn?Asn?Val?Val?Leu?Ser?Gln?Leu?Phe?Leu?Tyr?Gly?Asp?Cys?Ile305?????????????????310?????????????????315?????????????????320Leu?Lys?Leu?Phe?His?Asn?Glu?Gly?Phe?Tyr?Ile?Ile?Lys?Glu?Val?Glu
325?????????????????330?????????????????335Gly?Phe?Ile?Met?Ser?Leu?Ile?Leu?Asn?Ile?Thr?Glu?Glu?Asp?Gln?Phe
340?????????????????345?????????????????350Lys?Lys?Arg?Phe?Tyr?Asn?Ser?Met?Leu?Asn?Asn?Ile?Thr?Asp?Ala?Ala
355?????????????????360?????????????????365Ile?Lys?Ala?Gln?Lys?Asp?Leu?Leu?Ser?Arg?Val?Cys?His?Thr?Leu?Leu
370?????????????????375?????????????????380Asp?Lys?Thr?Val?Ser?Asp?Asn?Ile?Ile?Asn?Gly?Lys?Trp?Ile?Ile?Leu385?????????????????390?????????????????395?????????????????400Leu?Ser?Lys?Phe?Leu?Lys?Leu?Ile?Lys?Leu?Ala?Gly?Asp?Asn?Asn?Leu
405?????????????????410?????????????????415Asn?Asn?Leu?Ser?Glu?Leu?Tyr?Phe?Leu?Phe?Arg?Ile?Phe?Gly?His?Pro
420?????????????????425?????????????????430Met?Val?Asp?Glu?Arg?Gln?Ala?Met?Asp?Ser?Val?Arg?Ile?Asn?Cys?Asn
435?????????????????440?????????????????445Glu?Thr?Arg?Phe?Tyr?Leu?Leu?Ser?Ser?Leu?Ser?Thr?Leu?Arg?Gly?Ala
450?????????????????455?????????????????460Phe?Ile?Tyr?Arg?Ile?Ile?Lys?Gly?Phe?Val?Asn?Thr?Tyr?Asn?Arg?Trp465?????????????????470?????????????????475?????????????????480Pro?Thr?Leu?Arg?Asn?Ala?Ile?Val?Leu?Pro?Leu?Arg?Trp?Leu?Asn?Tyr
485?????????????????490?????????????????495Tyr?Lys?Leu?Asn?Thr?Tyr?Pro?Ser?Leu?Leu?Glu?Ile?Thr?Glu?Asn?Asp
500?????????????????505?????????????????510Leu?Ile?Ile?Leu?Ser?Gly?Leu?Arg?Phe?Tyr?Arg?Glu?Phe?His?Leu?Pro
515?????????????????520?????????????????525Lys?Lys?Val?Asp?Leu?Glu?Met?Ile?Ile?Asn?Asp?Lys?Ala?Ile?Ser?Pro
530?????????????????535?????????????????540Pro?Lys?Asp?Leu?Ile?Trp?Thr?Ser?Phe?Pro?Arg?Asn?Tyr?Met?Pro?Ser545?????????????????550?????????????????555?????????????????560His?Ile?Gln?Asn?Tyr?Ile?Glu?His?Glu?Lys?Leu?Lys?Phe?Ser?Glu?Ser
565?????????????????570?????????????????575Asp?Arg?Ser?Arg?Arg?Val?Leu?Glu?Tyr?Tyr?Leu?Arg?Asp?Asn?Lys?Phe
580?????????????????585?????????????????590Asn?Glu?Cys?Asp?Leu?Tyr?Asn?Cys?Val?Val?Asn?Gln?Ser?Tyr?Leu?Asn
595?????????????????600?????????????????605Asn?Ser?Asn?His?Val?Val?Ser?Leu?Thr?Gly?Lys?Glu?Arg?Glu?Leu?Ser
610?????????????????615?????????????????620Val?Gly?Arg?Met?Phe?Ala?Met?Gln?Pro?Gly?Met?Phe?Arg?Gln?Ile?Gln625?????????????????630?????????????????635?????????????????640Ile?Leu?Ala?Glu?Lys?Met?Ile?Ala?Glu?Asn?Ile?Leu?Gln?Phe?Phe?Pro
645?????????????????650?????????????????655Glu?Ser?Leu?Thr?Arg?Tyr?Gly?Asp?Leu?Glu?Leu?Gln?Lys?Ile?Leu?Glu
660?????????????????665?????????????????670Leu?Lys?Ala?Gly?Ile?Ser?Asn?Lys?Ser?Asn?Arg?Tyr?Asn?Asp?Asn?Tyr
675?????????????????680?????????????????685Asn?Asn?Tyr?Ile?Ser?Lys?Cys?Ser?Ile?Ile?Thr?Asp?Leu?Ser?Lys?Phe
690?????????????????695?????????????????700Asn?Gln?Ala?Phe?Arg?Tyr?Glu?Thr?Ser?Cys?Ile?Cys?Ser?Asp?Val?Leu705?????????????????710?????????????????715?????????????????720Asp?Glu?Leu?His?Gly?Val?Gln?Ser?Leu?Phe?Ser?Trp?Leu?His?Leu?Thr
725?????????????????730?????????????????735Ile?Pro?Leu?Val?Thr?Ile?Ile?Cys?Thr?Tyr?Arg?His?Ala?Pro?Pro?Phe
740?????????????????745?????????????????750Ile?Lys?Asp?His?Val?Val?Asn?Leu?Asn?Glu?Val?Asp?Glu?Gln?Ser?Gly
755?????????????????760?????????????????765Leu?Tyr?Arg?Tyr?His?Met?Gly?Gly?Ile?Glu?Gly?Trp?Cys?Gln?Lys?Leu
770?????????????????775?????????????????780Trp?Thr?Ile?Glu?Ala?Ile?Ser?Leu?Leu?Asp?Leu?Ile?Ser?Leu?Lys?Gly785?????????????????790?????????????????795?????????????????800Lys?Phe?Ser?Ile?Thr?Ala?Leu?Ile?Asn?Gly?Asp?Asn?Gln?Ser?Ile?Asp
805?????????????????810?????????????????815Ile?Ser?Lys?Pro?Val?Arg?Leu?Ile?Glu?Gly?Gln?Thr?His?Ala?Gln?Ala
820?????????????????825?????????????????830Asp?Tyr?Leu?Leu?Ala?Leu?Asn?Ser?Leu?Lys?Leu?Leu?Tyr?Lys?Glu?Tyr
835?????????????????840?????????????????845Ala?Gly?Ile?Gly?His?Lys?Leu?Lys?Gly?Thr?Glu?Thr?Tyr?Ile?Ser?Arg
850?????????????????855?????????????????860Asp?Met?Gln?Phe?Met?Ser?Lys?Thr?Ile?Gln?His?Asn?Gly?Val?Tyr?Tyr865?????????????????870?????????????????875?????????????????880Pro?Ala?Ser?Ile?Lys?Lys?Val?Leu?Arg?Val?Gly?Pro?Trp?Ile?Asn?Thr
885?????????????????890?????????????????895Ile?Leu?Asp?Asp?Phe?Lys?Val?Ser?Leu?Glu?Ser?Ile?Gly?Ser?Leu?Thr
900?????????????????905?????????????????910Gln?Glu?Leu?Glu?Tyr?Arg?Gly?Glu?Ser?Leu?Leu?Cys?Ser?Leu?Ile?Phe
915?????????????????920?????????????????925Arg?Asn?Ile?Trp?Leu?Tyr?Asn?Gln?Ile?Ala?Leu?Gln?Leu?Arg?Asn?His
930?????????????????935?????????????????940Ala?Leu?Cys?Asn?Asn?Lys?Leu?Tyr?Leu?Asp?Ile?Leu?Lys?Val?Leu?Lys945?????????????????950?????????????????955?????????????????960His?Leu?Lys?Thr?Phe?Phe?Asn?Leu?Asp?Ser?Ile?Asp?Met?Ala?Leu?Ser
965?????????????????970?????????????????975Leu?Tyr?Met?Asn?Leu?Pro?Met?Leu?Phe?Gly?Gly?Gly?Asp?Pro?Asn?Leu
980?????????????????985?????????????????990Leu?Tyr?Arg?Ser?Phe?Tyr?Arg?Arg?Thr?Pro?Asp?Phe?Leu?Thr?Glu?Ala
995?????????????????1000????????????????1005Ile?Val?His?Ser?Val?Phe?Val?Leu?Ser?Tyr?Tyr?Thr?Gly?His?Asp?Leu
1010????????????????1015????????????????1020Gln?Asp?Lys?Leu?Gln?Asp?Leu?Pro?Asp?Asp?Arg?Leu?Asn?Lys?Phe?Leu1025????????????????1030????????????????1035????????????????1040Thr?Cys?Val?Ile?Thr?Phe?Asp?Lys?Asn?Pro?Asn?Ala?Glu?Phe?Val?Thr
1045????????????????1050????????????????1055Leu?Met?Arg?Asp?Pro?Gln?Ala?Leu?Gly?Ser?Glu?Arg?Gln?Ala?Lys?Ile
1060????????????????1065????????????????1070Thr?Ser?Glu?Ile?Asn?Arg?Leu?Ala?Val?Thr?Glu?Val?Leu?Ser?Ile?Ala
1075????????????????1080????????????????1085Pro?Asn?Lys?Ile?Phe?Ser?Lys?Ser?Ala?Gln?His?Tyr?Thr?Thr?Thr?Glu
1090????????????????1095????????????????1100Ile?Asp?Leu?Asn?Asp?Ile?Met?Gln?Asn?Ile?Glu?Pro?Thr?Tyr?Pro?His1105????????????????1110????????????????1115????????????????1120Gly?Leu?Arg?Val?Val?Tyr?Glu?Ser?Leu?Pro?Phe?Tyr?Lys?Ala?Glu?Lys
1125????????????????1130????????????????1135Ile?Val?Asn?Leu?Ile?Ser?Gly?Thr?Lys?Ser?Ile?Thr?Asn?Ile?Leu?Glu
1140????????????????1145????????????????1150Lys?Thr?Ser?Ala?Ile?Asp?Thr?Thr?Asp?Ile?Asn?Arg?Ala?Thr?Asp?Met
1155????????????????1160????????????????1165Met?Arg?Lys?Asn?Ile?Thr?Leu?Leu?Ile?Arg?Ile?Leu?Pro?Leu?Asp?Cys
1170????????????????1175????????????????1180Asn?Lys?Asp?Lys?Arg?Glu?Leu?Leu?Ser?Leu?Glu?Asn?Leu?Ser?Ile?Thr1185????????????????1190????????????????1195????????????????1200Glu?Leu?Ser?Lys?Tyr?Val?Arg?Glu?Arg?Ser?Trp?Ser?Leu?Ser?Asn?Ile
1205????????????????1210????????????????1215Val?Gly?Val?Thr?Ser?Pro?Ser?Ile?Met?Phe?Thr?Met?Asn?Ile?Lys?Tyr
1220????????????????1225????????????????1230Thr?Thr?Ser?Thr?Ile?Ala?Ser?Gly?Ile?Ile?Ile?Glu?Lys?Tyr?Asn?Val
1235????????????????1240????????????????1245Asn?Ser?Leu?Thr?Arg?Gly?Glu?Arg?Gly?Pro?Thr?Lys?Pro?Trp?Val?Gly
1250????????????????1255????????????????1260Ser?Ser?Thr?Gln?Glu?Lys?Lys?Thr?Met?Pro?Val?Tyr?Asn?Arg?Gln?Val1265????????????????1270????????????????1275????????????????1280Leu?Thr?Lys?Lys?Gln?Arg?Asp?Gln?Ile?Asp?Leu?Leu?Ala?Lys?Leu?Asp
1285????????????????1290????????????????1295Trp?Val?Tyr?Ala?Ser?Ile?Asp?Asn?Lys?Asp?Glu?Phe?Met?Glu?Glu?Leu
1300????????????????1305????????????????1310Ser?Thr?Gly?Thr?Leu?Gly?Leu?Ser?Tyr?Glu?Lys?Ala?Lys?Lys?Leu?Phe
1315????????????????1320????????????????1325Pro?Gln?Tyr?Leu?Ser?Val?Asn?Tyr?Leu?His?Arg?Leu?Thr?Val?Ser?Ser
1330????????????????1335????????????????1340Arg?Pro?Cys?Glu?Phe?Pro?Ala?Ser?Ile?Pro?Ala?Tyr?Arg?Thr?Thr?Asn1345????????????????1350????????????????1355????????????????1360Tyr?His?Phe?Asp?Thr?Ser?Pro?Ile?Asn?His?Val?Leu?Thr?Glu?Lys?Tyr
1365????????????????1370????????????????1375Gly?Asp?Glu?Asp?Ile?Asp?Ile?Val?Phe?Gln?Asn?Cys?Ile?Ser?Phe?Gly
1380????????????????1385????????????????1390Leu?Ser?Leu?Met?Ser?Val?Val?Glu?Gln?Phe?Thr?Asn?Ile?Cys?Pro?Asn
1395????????????????1400????????????????1405Arg?Ile?Ile?Leu?Ile?Pro?Lys?Leu?Asn?Glu?Ile?His?Leu?Met?Lys?Pro
1410????????????????1415????????????????1420Pro?Ile?Phe?Thr?Gly?Asp?Val?Asp?Ile?Ile?Lys?Leu?Lys?Gln?Val?Ile1425????????????????1430????????????????1435????????????????1440Gln?Lys?Gln?His?Met?Phe?Leu?Pro?Asp?Lys?Ile?Ser?Leu?Thr?Gln?Tyr
1445????????????????1450????????????????1455Val?Glu?Leu?Phe?Leu?Ser?Asn?Lys?Ala?Leu?Lys?Ser?Gly?Ser?His?Ile
1460????????????????1465????????????????1470Asn?Ser?Asn?Leu?Ile?Leu?Val?His?Lys?Met?Ser?Asp?Tyr?Phe?His?Asn
1475????????????????1480????????????????1485Ala?Tyr?Ile?Leu?Ser?Thr?Asn?Leu?Ala?Gly?His?Trp?Ile?Leu?Ile?Ile
1490????????????????1495????????????????1500Gln?Leu?Met?Lys?Asp?Ser?Lys?Gly?Ile?Phe?Glu?Lys?Asp?Trp?Gly?Glu1505????????????????1510????????????????1515????????????????1520Gly?Tyr?Ile?Thr?Asp?His?Met?Phe?Ile?Asn?Leu?Asn?Val?Phe?Phe?Asn
1525????????????????1530????????????????1535Ala?Tyr?Lys?Thr?Tyr?Leu?Leu?Cys?Phe?His?Lys?Gly?Tyr?Gly?Lys?Ala
1540????????????????1545????????????????1550Lys?Leu?Glu?Cys?Asp?Met?Asn?Thr?Ser?Asp?Leu?Leu?Cys?Val?Leu?Glu
1555????????????????1560????????????????1565Leu?Ile?Asp?Ser?Ser?Tyr?Trp?Lys?Ser?Met?Ser?Lys?Val?Phe?Leu?Glu
1570????????????????1575????????????????1580Gln?Lys?Val?Ile?Lys?Tyr?Ile?Val?Asn?Gln?Asp?Thr?Ser?Leu?Arg?Arg1585????????????????1590????????????????1595????????????????1600Ile?Lys?Gly?Cys?His?Ser?Phe?Lys?Leu?Trp?Phe?Leu?Lys?Arg?Leu?Asn
1605????????????????1610????????????????1615Asn?Ala?Lys?Phe?Thr?Val?Cys?Pro?Trp?Val?Val?Asn?Ile?Asp?Tyr?His
1620????????????????1625????????????????1630Pro?Thr?His?Met?Lys?Ala?Ile?Leu?Ser?Tyr?Ile?Asp?Leu?Val?Arg?Met
1635????????????????1640????????????????1645Gly?Leu?Ile?Asn?Val?Asp?Lys?Leu?Thr?Ile?Lys?Asn?Lys?Asn?Lys?Phe
1650????????????????1655????????????????I660Asn?Asp?Glu?Phe?Tyr?Thr?Ser?Asn?Leu?Phe?Tyr?Ile?Ser?Tyr?Asn?Phe1665????????????????1670????????????????1675????????????????1680Ser?Asp?Asn?Thr?His?Leu?Leu?Thr?Lys?Gln?Ile?Arg?Ile?Ala?Asn?Ser
1685????????????????1690????????????????1695Glu?Leu?Glu?Asp?Asn?Tyr?Asn?Lys?Leu?Tyr?His?Pro?Thr?Pro?Glu?Thr
1700????????????????1705????????????????1710Leu?Glu?Asn?Met?Ser?Leu?Ile?Pro?Val?Lys?Ser?Asn?Asn?Ser?Asn?Lys
1715????????????????1720????????????????1725Pro?Lys?Phe?Cys?Ile?Ser?Gly?Asn?Thr?Glu?Ser?Met?Met?Met?Ser?Thr
1730????????????????1735????????????????1740Phe?Ser?Ser?Lys?Met?His?Ile?Lys?Ser?Ser?Thr?Val?Thr?Thr?Arg?Phe1745????????????????1750????????????????1755????????????????1760Asn?Tyr?Ser?Lys?Gln?Asp?Leu?Tyr?Asn?Leu?Phe?Pro?Ile?Val?Val?Ile
1765????????????????1770????????????????1775Asp?Lys?Ile?Ile?Asp?His?Ser?Gly?Asn?Thr?Ala?Lys?Ser?Asn?Gln?Leu
1780????????????????1785????????????????1790Tyr?Thr?Thr?Thr?Ser?His?Gln?Thr?Ser?Leu?Val?Arg?Asn?Ser?Ala?Ser
1795????????????????1800????????????????1805Leu?Tyr?Cys?Met?Leu?Pro?Trp?His?His?Val?Asn?Arg?Phe?Asn?Phe?Val
1810????????????????1815????????????????1820Phe?Ser?Ser?Thr?Gly?Cys?Lys?Ile?Ser?Ile?Glu?Tyr?Ile?Leu?Lys?Asp1825????????????????1830????????????????1835????????????????1840Leu?Lys?Ile?Lys?Asp?Pro?Ser?Cys?Ile?Ala?Phe?Ile?Gly?Glu?Gly?Ala
1845????????????????1850????????????????1855Gly?Asn?Leu?Leu?Leu?Arg?Thr?Val?Val?Glu?Leu?His?Pro?Asp?Ile?Arg
1860????????????????1865????????????????1870Tyr?Ile?Tyr?Arg?Ser?Leu?Lys?Asp?Cys?Asn?Asp?His?Ser?Leu?Pro?Ile
1875????????????????1880????????????????1885Glu?Phe?Leu?Arg?Leu?Tyr?Asn?Gly?His?Ile?Asn?Ile?Asp?Tyr?Gly?Glu
1890????????????????1895????????????????1900Asn?Leu?Thr?Ile?Pro?Ala?Thr?Asp?Ala?Thr?Asn?Asn?Ile?His?Trp?Ser1905????????????????1910????????????????1915????????????????1920Tyr?Leu?His?Ile?Lys?Phe?Ala?Glu?Pro?Ile?Ser?Ile?Phe?Val?Cys?Asp
1925????????????????1930????????????????1935Ala?Glu?Leu?Pro?Val?Thr?Ala?Asn?Trp?Ser?Lys?Ile?Ile?Ile?Glu?Trp
1940????????????????1945????????????????1950Ser?Lys?His?Val?Arg?Lys?Cys?Lys?Tyr?Cys?Ser?Ser?Val?Asn?Arg?Cys
1955????????????????1960????????????????1965Ile?Leu?Ile?Ala?Lys?Tyr?His?Ala?Gln?Asp?Asp?Ile?Asp?Phe?Lys?Leu
1970????????????????1975????????????????1980Asp?Asn?Ile?Thr?Ile?Leu?Lys?Thr?Tyr?Val?Cys?Leu?Gly?Ser?Lys?Leu1985????????????????1990????????????????1995????????????????2000Lys?Gly?Ser?Glu?Val?Tyr?Leu?Ile?Leu?Thr?Ile?Gly?Pro?Ala?Asn?Ile
2005????????????????2010????????????????2015Leu?Pro?Val?Phe?Asp?Val?Val?Gln?Asn?Ala?Lys?Leu?Ile?Leu?Ser?Arg
2020????????????????2025????????????????2030
Thr?Lys?Asn?Phe?Ile?Met?Pro?Lys?Lys?Thr?Asp?Lys?Glu?Ser?Ile?Asp
2035????????????????2040????????????????2045
Ala?Asn?Ile?Lys?Ser?Leu?Ile?Pro?Phe?Leu?Cys?Tyr?Pro?Ile?Thr?Lys
2050????????????????2055????????????????2060
Lys?Gly?Ile?Lys?Thr?Ser?Leu?Ser?Lys?Leu?Lys?Ser?Val?Val?Asn?Gly
2065????????????????2070????????????????2075????????????????2080
Asp?Ile?Leu?Ser?Tyr?Ser?Ile?Ala?Gly?Arg?Asn?Glu?Val?Phe?Ser?Asn
2085????????????????2090????????????????2095
Lys?Leu?Ile?Asn?His?Lys?His?Met?Asn?Ile?Leu?Lys?Trp?Leu?Asp?His
2100????????????????2105????????????????2110
Val?Leu?Asn?Phe?Arg?Ser?Ala?Glu?Leu?Asn?Tyr?Asn?His?Leu?Tyr?Met
2115????????????????2120????????????????2125
Ile?Glu?Ser?Thr?Tyr?Pro?Tyr?Leu?Ser?Glu?Leu?Leu?Asn?Ser?Leu?Thr
2130????????????????2135????????????????2140
Thr?Asn?Glu?Leu?Lys?Lys?Leu?Ile?Lys?Ile?Thr?Gly?Ser?Val?Leu?Tyr
2145????????????????2150????????????????2155????????????????2160
Asn?Leu?Pro?Asn?Glu?Gln
The information of 2165 (2) SEQ ID NO:29:
(i) sequence signature:
(A) length: 15219 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: ACGGGAAAAA AATGCGTACT ACAAACTTGC ACATTCGAAA AAAATGGGGC AAATAAGAAC 60 TTGATAAGTG CTATTTAAGT CTAACCTTTT CAATCAGAAA TGGGGTGCAA TTCACTGAGC 120 ATGATAAAGG TTAGATTACA AAATTTATTT GACAATGACG AAGTAGCATT GTTAAAAATA 180 ACATGTTATA CTGATAAATT AATTCTTCTG ACCAATGCAT TAGCCAAAGC AGCAATACAT 240 ACAATTAAAT TAAACGGCAT AGTTTTTATA CATGTTATAA CAAGCAGTGA AGTGTGCCCT 300 GATAACAATA TTGTAGTGAA ATCTAACTTT ACAACAATGC CAATACTACA AAATGGAGGA 360 TACATATGGG AATTGATTGA GTTGACACAC TGCTCTCAAT TAAACGGTTT AATGGATGAT 420 AATTGTGAAA TCAAATTTTC TAAAAGACTA AGTGACTCAG TAATGACTAA TTATATGAAT 480 CAAATATCTG ACTTACTTGG GCTTGATCTC AATTCATGAA TTATGTTTAG TCTAATTCAA 540 TAGACATGTG TTTATTACCA TTTTAGTTAA TATAAAAACT CATCAAAGGG AAATGGGGCA 600 AATAAACTCA CCTAATCAAT CAAACCATGA GCACTACAAA TGACAACACT ACTATGCAAA 660 GATTGATGAT CACAGACATG AGACCCCTGT CAATGGATTC AATAATAACA TCTCTTACCA 720 AAGAAATCAT CACACACAAA TTCATATACT TGATAAACAA TGAATGTATT GTAAGAAAAC 780 TTGATGAAAG ACAAGCTACA TTTACATTCT TAGTCAATTA TGAGATGAAG CTACTGCACA 840 AAGTAGGGAG TACCAAATAC AAAAAATACA CTGAATATAA TACAAAATAT GGCACTTTCC 900 CCATGCCTAT ATTTATCAAT CACGGCGGGT TTCTAGAATG TATTGGCATT AAGCCTACAA 960 AACACACTCC TATAATATAC AAATATGACC TCAACCCGTG AATTCCAACA AAAAAACCAA 1020 CCCAACCAAA CCAAACTATT CCTCAAACAA CAGTGCTCAA TAGTTAAGAA GGAGCTAATC 1080 CATTTTAGTA ATTAAAAATA AAAGTAAAGC CAATAACATA AATTGGGGCA AATACAAAGA 1140 TGGCTCTTAG CAAAGTCAAG TTGAATGATA CATTAAATAA GGATCAGCTG CTGTCATCCA 1200 GCAAATACAC TATTCAACGT AGTACAGGAG ATAATATTGA CACTCCCAAT TATGATGTGC 1260 AAAAACACCT AAACAAACTA TGTGGTATGC TATTAATCAC TGAAGATGCA AATCATAAAT 1320 TCACAGGATT AATAGGTATG TTATATGCTA TGTCCAGGTT AGGAAGGGAA GACACTATAA 1380 AGATACTTAA AGATGCTGGA TATCATGTTA AAGCTAATGG AGTAGATATA ACAACATATC 1440 GTCAAGATAT AAATGGAAAG GAAATGAAAT TCGAAGTATT AACATTATCA AGCTTGACAT 1500 CAGAAATACA AGTCAATATT GAGATAGAAT CTAGAAAGTC CTACAAAAAA ATGCTAAAAG 1560 AGATGGGAGA AGTGGCTCCA GAATATAGGC ATGATTCTCC AGACTGTGGG ATGATAATAC 1620 TGTGTATAGC TGCACTTGTG ATAACCAAAT TAGCAGCAGG AGACAGATCA GGTCTTACAG 1680 CAGTAATTAG GAGGGCAAAC AATGTCTTAA AAAACGAAAT AAAACGATAC AAGGGCCTCA 1740 TACCAAAGGA TATAGCTAAC AGTTTTTATG AAGTGTTTGA AAAACACCCT CATCTTATAG 1800 ATGTTTTCGT GCACTTTGGC ATTGCACAAT CATCCACAAG AGGGGGTAGT AGAGTTGAAG 1860 GAATCTTTGC AGGATTGTTT ATGAATGCCT ATGGTTCAGG GCAAGTAATG CTAAGATGGG 1920 GAGTTTTAGC CAAATCTGTA AAAAATATCA TGCTAGGACA TGCTAGTGTC CAGGCAGAAA 1980 TGGAGCAAGT TGTGGAAGTC TATGAGTATG CACAGAAGTT GGGAGGAGAA GCTGGATTCT 2040 ACCATATATT GAACAATCCA AAAGCATCAT TGCTGTCATT AACTCAATTT CCCAACTTCT 2100 CAAGTGTGGT CCTAGGCAAT GCAGCAGGTC TAGGCATAAT GGGAGAGTAT AGAGGTACAC 2160 CAAGAAACCA GGATCTTTAT GATGCAGCTA AAGCATATGC AGAGCAACTC AAAGAAAATG 2220 GAGTAATAAA CTACAGTGTA TTAGACTTAA CAGCAGAAGA ATTGGAAGCC ATAAAGCATC 2280 AACTCAACCC CAAAGAAGAT GATGTAGAGC TTTAAGTTAA CAAAAAATAC GGGGCAAATA 2340 AGTCAACATG GAGAAGTTTG CACCTGAATT TCATGGAGAA GATGCAAATA ACAAAGCTAC 2400 CAAATTCCTA GAATCAATAA AGGGCAAGTT CGCATCATCC AAAGATCCTA AGAAGAAAGA 2460 TAGCATAATA TCTGTTAACT CAATAGATAT AGAAGTAACT AAAGAGAGCC CGATAACATC 2520 TGGCACCAAC ATCATCAATC CAACAAGTGA AGCCGACAGT ACCCCAGAAA CAAAAGCCAA 2580 CTACCCAAGA AAACCCCTAG TAAGCTTCAA AGAAGATCTC ACCCCAAGTG ACAACCCTTT 2640 TTCTAAGTTG TACAAGGAAA CAATAGAAAC ATTTGATAAC AATGAAGAAG AATCTAGCTA 2700 CTCATATGAA GAGATAAATG ATCAAACAAA TGACAACATT ACAGCAAGAC TAGATAGAAT 2760 TGATGAAAAA TTAAGTGAAA TATTAGGAAT GCTCCATACA TTAGTAGTTG CAAGTGCAGG 2820 ACCCACTTCA GCTCGCGATG GAATAAGAGA TGCTATGGTT GGTCTAAGAG AAGAGATGAT 2880 AGAAAAAATA AGAGCGGAAG CATTAATGAC CAATGATAGG TTAGAGGCTA TGGCAAGACT 2940 TAGGAATGAG GAAAGCGAAA AAATGGCAAA AGACACCTCA GATGAAGTGT CTCTTAATCC 3000 AACTTCCAAA AAATTGAGTG ACTTGTTGGA AGACAACGAT AGTGACAATG ATCTATCACT 3060 TGATGATTTT TGATCAGCGA TCAACTCACT CAGCAATCAA CAACATCAAT AAAACAGACA 3120 TCAATCCATT GAATCAACTG CCAGACCGAA CAAACAAACG TCCATCAGTA GAACCACCAA 3180 CCAATCAATC AACCAATTGA TCAATCAGCA ACCCGACAAA ATTAACAATA TAGTAACAAA 3240 AAAAGAACAA GATGGGGCAA ATATGGAAAC ATACGTGAAC AAGCTTCACG AAGGCTCCAC 3300 ATACACAGCA GCTGTTCAGT ACAATGTTCT AGAAAAAGAT GATGATCCTG CATCACTAAC 3360 AATATGGGTG CCTATGTTCC AGTCATCTGT GCCAGCAGAC TTGCTCATAA AAGAACTTGC 3420 AAGCATCAAT ATACTAGTGA AGCAGATCTC TACGCCCAAA GGACCTTCAC TACGAGTCAC 3480 GATTAACTCA AGAAGTGCTG TGCTGGCTCA AATGCCTAGT AATTTCATCA TAAGCGCAAA 3540 TGTATCATTA GATGAAAGAA GCAAATTAGC ATATGATGTA ACTACACCTT GTGAAATCAA 3600 AGCATGCAGT CTAACATGCT TAAAAGTAAA AAGTATGTTA ACTACAGTCA AAGATCTTAC 3660 CATGAAGACA TTCAACCCCA CTCATGAGAT CATTGCTCTA TGTGAATTTG AAAATATTAT 3720 GACATCAAAA AGAGTAATAA TACCAACCTA TCTAAGATCA ATTAGTGTCA AGAACAAGGA 3780 TCTGAACTCA CTAGAAAATA TAGCAACCAC CGAATTCAAA AATGCTATCA CCAATGCAAA 3840 AATTATTCCT TATGCAGGAT TAGTGTTAGT TATCACAGTT ACTGACAATA AAGGAGCATT 3900 CAAATATATC AAACCACAGA GTCAATTTAT AGTAGATCTT GGTGCCTACC TAGAAAAAGA 3960 GAGCATATAT TATGTGACTA CTAATTGGAA GCATACAGCT ACACGTTTTT CAATCAAACC 4020 ACTAGAGGAT TAAACTTAAT TATCAACACT GAATGACAGG TCCACATATA TCCTCAAACT 4080 ACACACTATA TCCAAACATC ATAAACATCT ACACTACACA CTTCATCACA CAAACCAATC 4140 CCACTCAAAA TCCAAAATCA CTACCAGCCA CTATCTGCTA GACCTAGAGT GCGAATAGGT 4200 AAATAAAACC AAAATATGGG GTAAATAGAC ATTAGTTAGA GTTCAATCAA TCTTAACAAC 4260 CATTTATACC GCCAATTCAA CACATATACT ATAAATCTTA AAATGGGAAA TACATCCATC 4320 ACAATAGAAT TCACAAGCAA ATTTTGGCCC TATTTTACAC TAATACATAT GATCTTAACT 4380 CTAATCTTTT TACTAATTAT AATCACTATT ATGATTGCAA TACTAAATAA GCTAAGTGAA 4440 CATAAAGCAT TCTGTAACAA AACTCTTGAA CTAGGACAGA TGTATCAAAT CAACACATAG 4500 AGTTCTACCA TTATGCTGTG TCAAATTATA ATCCTGTATA TATAAACAAA CAAATCCAAT 4560 CTTCTCACAG AGTCATGGTG TCGCAAAACC ACGCTAACTA TCATGGTAGC ATAGAGTAGT 4620 TATTTAAAAA TTAACATAAT GATGAATTGT TAGTATGAGA TCAAAAACAA CATTGGGGCA 4680 AATGCAACCA TGTCCAAACA CAAGAATCAA CGCACTGCCA GGACTCTAGA AAAGACCTGG 4740 GATACTCTTA ATCATCTAAT TGTAATATCC TCTTGTTTAT ACAGATTAAA TTTAAAATCT 4800 ATAGCACAAA TAGCACTATC AGTTTTGGCA ATGATAATCT CAACCTCTCT CATAATTGCA 4860 GCCATAATAT TCATCATCTC TGCCAATCAC AAAGTTACAC TAACAACGGT CACAGTTCAA 4920 ACAATAAAAA ACCACACTGA AAAAAACATC ACCACCTACC CTACTCAAGT CTCACCAGAA 4980 AGGGTTAGTT CATCCAAGCA ACCCACAACC ACATCACCAA TCCACACAAG TTCAGCTACA 5040 ACATCACCCA ATACAAAATC AGAAACACAC CATACAACAG CACAAACCAA AGGCAGAACC 5100 ACCACTTCAA CACAGACCAA CAAGCCAAGC ACAAAACCAC GTCCAAAAAA TCCACCAAAA 5160 AAAGATGATT ACCATTTTGA AGTGTTCAAC TTCGTTCCCT GCAGTATATG TGGCAACAAT 5220 CAACTTTGCA AATCCATCTG CAAAACAATA CCAAGCAACA AACCAAAGAA GAAACCAACC 5280 ATCAAACCCA CAAACAAACC AACCACCAAA ACCACAAACA AAAGAGACCC AAAAACACCA 5340 GCCAAAACGA CGAAAAAAGA AACTACCACC AACCCAACAA AAAAACTAAC CCTCAAGACC 5400 ACAGAAAGAG ACACCAGCAC CTCACAATCC ACTGCACTCG ACACAACCAC ATTAAAACAC 5460 ACAGTCCAAC AGCAATCCCT CCTCTCAACC ACCCCCGAAA ACACACCCAA CTCCACACAA 5520 ACACCCACAG CATCCGAGCC CTCCACACCA AACTCCACCC AAAAAACCCA GCCACATGCT 5580 TAGTTATTCA AAAACTACAT CTTAGCAGAG AACCGTGATC TATCAAGCAA GAACGAAATT 5640 AAACCTGGGG CAAATAACCA TGGAGTTGAT GATCCACAAG TCAAGTGCAA TCTTCCTAAC 5700 TCTTGCTATT AATGCATTGT ACCTCACCTC AAGTCAGAAC ATAACTGAGG AGTTTTACCA 5760 ATCGACATGT AGTGCAGTTA GCAGAGGTTA TTTTAGTGCT TTAAGAACAG GTTGGTATAC 5820 TAGTGTCATA ACAATAGAAT TAAGTAATAT AAAAGAAACC AAATGCAATG GAACTGACAC 5880 TAAAGTAAAA CTTATGAAAC AAGAATTAGA TAAGTATAAG AATGCAGTAA CAGAATTACA 5940 GCTACTTATG CAAAACACAC CAGCTGTCAA CAACCGGGCC AGAAGAGAAG CACCACAGTA 6000 TATGAACTAC ACAATCAATA CCACTAAAAA CCTAAATGTA TCAATAAGCA AGAAGAGGAA 6060 ACGAAGATTT CTAGGCTTCT TGTTAGGTGT GGGATCTGCA ATAGCAAGTG GTATAGCTGT 6120 ATCAAAAGTT CTACACCTTG AAGGAGAAGT GAACAAGATC AAAAATGCTT TGTTGTCTAC 6180 AAACAAAGCT GTAGTCAGTT TATCAAATGG GGTCAGTGTT TTAACCAGCA AAGTGTTAGA 6240 TCTCAAGAAT TACATAAATA ACCAATTATT ACCCATAGTA AATCAACAGA GCTGTCGCAT 6300 CTCCAACATT GAAACAGTTA TAGAATTCCA GCAGAAGAAC AGCAGATTGT TGGAAATCAC 6360 CAGAGAATTT AGTGTCAATG CAGGTGTAAC AACACCTTTA AGCACTTACA TGTTGACAAA 6420 CAGTGAGTTA CTATCATTAA TCAATGATAT GCCTATAACA AATGATCAGA AAAAATTAAT 6480 GTCAAGCAAT GTTCAGATAG TAAGGCAACA AAGTTATTCC ATCATGTCTA TAATAAAGGA 6540 AGAAGTCCTT GCATATGTTG TACAGCTGCC TATCTATGGT GTAATAGATA CACCTTGCTG 6600 GAAATTGCAC ACATCGCCTC TATGCACTAC CAACATCAAA GAAGGATCAA ATATTTGTTT 6660 AACAAGGACT GATAGAGGAT GGTATTGTGA TAATGCAGGA TCAGTATCCT TCTTTCCACA 6720 GGCTGACACT TGTAAAGTAC AGTCCAATCG AGTATTTTGT GACACTATGA ACAGTTTGAC 6780 ATTACCAAGT GAAGTCAGCC TTTGTAACAC TGACATATTC AATTCCAAGT ATGACTGCAA 6840 AATTATGACA TCAAAAACAG ACATAAGCAG CTCAGTAATT ACTTCTCTTG GAGCTATAGT 6900 GTCATGCTAT GGTAAAACTA AATGCACTGC ATCCAACAAA AATCGTGGGA TTATAAAGAC 6960 ATTTTCTAAT GGTTGTGACT ATGTGTCAAA CAAAGGAGTA GATACTGTGT CAGTGGGCAA 7020 CACTTTATAC TATGTAAACA AGCTGGAAGG CAAGAACCTT TATGTAAAAG GGGAACCTAT 7080 AATAAATTAC TATGACCCTC TAGTGTTTCC TTCTGATGAG TTTGATGCAT CAATATCTCA 7140 AGTCAATGAA AAAATCAATC AAAGTTTAGC TTTTATTCGT AGATCTGATG AATTACTACA 7200 TAATGTAAAT ACTGGCAAAT CTACTACAAA TATTATGATA ACTACAATTA TTATAGTAAT 7260 CATTGTAGTA TTGTTATCAT TAATAGCTAT TGGTTTACTG TTGTATTGTA AAGCCAAAAA 7320 CACACCAGTT ACACTAAGCA AAGACCAACT AAGTGGAATC AATAATATTG CATTCAGCAA 7380 ATAGACAAAA AACCACCTGA TCATGTTTCA ACAACAATCT GCTGACCACC AATCCCAAAT 7440 CAACTTACAA CAAATATTTC AACATCACAG TACAGGCTGA ATCATTTCCT CACATCATGC 7500 TACCCACATA ACTAAGCTAG ATCCTTAACT TATAGTTACA TAAAAACCTC AAGTATCACA 7560 ATCAACCACT AAATCAACAC ATCATTCACA AAATTAACAG CTGGGGCAAA TATGTCGCGA 7620 AGAAATCCTT GTAAATTTGA GATTAGAGGT CATTGCTTGA ATGGTAGAAG ATGTCACTAC 7680 AGTCATAATT ACTTTGAATG GCCTCCTCAT GCATTACTAG TGAGGCAAAA CTTCATGTTA 7740 AACAAGATAC TCAAGTCAAT GGACAAAAGC ATAGACACTT TGTCTGAAAT AAGTGGAGCT 7800 GCTGAACTGG ATAGAACAGA AGAATATGCT CTTGGTATAG TTGGAGTGCT AGAGAGTTAC 7860 ATAGGATCTA TAAACAACAT AACAAAACAA TCAGCATGTG TTGCTATGAG TAAACTTCTT 7920 ATTGAGATCA ATAGTGATGA CATTAAAAAG CTTAGAGATA ATGAAGAACC CAATTCACCT 7980 AAGATAAGAG TGTACAATAC TGTTATATCA TACATTGAGA GCAATAGAAA AAACAACAAG 8040 CAAACCATCC ATCTGCTCAA GAGACTACCA GCAGACGTGC TGAAGAAGAC AATAAAGAAC 8100 ACATTAGATA TCCACAAAAG CATAACCATA AGCAATCCAA AAGAGTCAAC TGTGAATGAT 8160 CAAAATGACC AAACCAAAAA TAATGATATT ACCGGATAAA TATCCTTGTA GTATATCATC 8220 CATATTGATC TCAAGTGAAA GCATGGTTGC TACATTCAAT CATAAAAACA TATTACAATT 8280 TAACCATAAC TATTTGGATA ACCACCAGCG TTTATTAAAT CATATATTTG ATGAAATTCA 8340 TTGGACACCT AAAAACTTAT TAGATGCCAC TCAACAATTT CTCCAACATC TTAACATCCC 8400 TGAAGATATA TATACAGTAT ATATATTAGT GTCATAATGC TTGACCATAA CGACTCTATG 8460 TCATCCAACC ATAAAACTAT TTTGATAAGG TTATGGGACA AAATGGATCC CATTATTAAT 8520 GGAAACTCTG CTAATGTGTA TCTAACTGAT AGTTATTTAA AAGGTGTTAT CTCTTTTTCA 8580 GAGTGTAATG CTTTAGGGAG TTATCTTTTT AACGGCCCTT ATCTTAAAAA TGATTACACC 8640 AACTTAATTA GTAGACAAAG CCCACTACTA GAGCATATGA ATCTTAAAAA ACTAACTATA 8700 ACACAGTCAT TAATATCTAG ATATCATAAA GGTGAACTGA AATTAGAAGA ACCAACTTAT 8760 TTCCAGTCAT TACTTATGAC ATATAAAAGT ATGTCCTCGT CTGAACAAAT TGCTACAACT 8820 AACTTACTTA AAAAAATAAT ACGAAGAGCC ATAGAAATAA GTGATGTAAA GGTGTACGCC 8880 ATCTTGAATA AACTAGGATT AAAGGAAAAG GACAGAGTTA AGCCCAACAA TAATTCAGGT 8940 GATGAAAACT CAGTACTTAC AACTATAATT AAAGATGATA TACTTTCGGC TGTGGAAAAC 9000 AATCAATCAT ATACAAATTC AGACAAAAGT CACTCAGTAA ATCAAAATAT CACTATCAAA 9060 ACAACACTCT TGAAAAAATT GATGTGTTCA ATGCAACATC CTCCATCATG GTTAATACAC 9120 TGGTTCAATT TATATACAAA ATTAAATAAC ATATTAACAC AATATCGATC AAATGAGGTA 9180 AAAAGTCATG GGTTTATATT AATAGATAAT CAAACTTTAA GTGGTTTTCA GTTTATTTTA 9240 AATCAATATG GTTGTATCGT TTATCATAAA GGACTCAAAA AAATCACAAC TACTACTTAC 9300 AATCAATTTT TGACATGGAA AGACATCAGC CTTAGCAGAT TAAATGTTTG CTTAATTACT 9360 TGGATAAGTA ATTGTTTAAA TACATTAAAC AAAAGCTTAG GGCTGAGATG TGGATTCAAT 9420 AATGTTGTGT TATCACAATT ATTTCTTTAT GGAGATTGTA TACTGAAATT ATTTCATAAT 9480 GAAGGCTTCT ACATAATAAA AGAAGTAGAG GGATTTATTA TGTCTTTAAT TCTAAACATA 9540 ACAGAAGAAG ATCAATTTAG GAAACGATTT TATAATAGCA TGCTAAATAA CATCACAGAT 9600 GCAGCTATTA AGGCTCAAAA GGACCTACTA TCAAGAGTAT GTCACACTTT ATTAGACAAG 9660 ACAGTGTCTG ATAATATCAT AAATGGTAAA TGGATAATCC TATTAAGTAA ATTTCTTAAA 9720 TTGATTAAGC TTGCAGGTGA TAATAATCTC AATAACTTGA GTGAGCTATA TTTTCTCTTC 9780 AGAATCTTTG GACATCCAAT GGTCGATGAA AGACAAGCAA TGGATTCTGT AAGAATTAAC 9840 TGTAATGAAA CTAAGTTCTA CTTATTAAGT AGTCTAAGTA CATTAAGAGG TGCTTTCATT 9900 TATAGAATCA TAAAAGGGTT TGTAAATACC TACAACAGAT GGCCCACCTT AAGGAATGCT 9960 ATTGTCCTAC CTCTAAGATG GTTAAACTAC TATAAACTTA ATACTTATCC ATCTCTACTT 10020 GAAATCACAG AAAATGATTT GATTATTTTA TCAGGATTGC GGTTCTATCG TGAGTTTCAT 10080 CTGCCTAAAA AAGTGGATCT TGAAATGATA ATAAATGACA AAGCCATTTC ACCTCCAAAA 10140 GATCTAATAT GGACTAGTTT TCCTAGAAAT TACATGCCAT CACATATACA AAATTATATA 10200 GAACATGAAA AGTTGAAGTT CTCTGAAAGC GACAGATCGA GAAGAGTACT AGAGTATTAC 10260 TTGAGAGATA ATAAATTCAA TGAATGCGAT CTATACAATT GTGTAGTCAA TCAAAGCTAT 10320 CTCAACAACT CTAATCACGT GGTATCACTA ACTGGTAAAG AAAGAGAGCT CAGTGTAGGT 10380 AGAATGTTTG CTATGCAACC AGGTATGTTT AGGCAAATCC AAATCTTAGC AGAGAAAATG 10440 ATAGCTGAAA ATATTTTACA ATTCTTCCCT GAGAGTTTGA CAAGATATGG TGATCTAGAG 10500 CTTCAAAAGA TATTAGAATT AAAAGCAGGA ATAAGCAACA AGTCAAATCG TTATAATGAT 10560 AACTACAACA ATTATATCAG TAAATGTTCT ATCATTACAG ATCTTAGCAA ATTCAATCAG 10620 GCATTTAGAT ATGAAACATC ATGTATCTGC AGTGATGTAT TAGATGAACT GCATGGAGTA 10680 CAATCTCTGT TCTCTTGGTT GCATTTAACA ATACCTCTTG TCACAATAAT ATGTACATAT 10740 AGACATGCAC CTCCTTTCAT AAAGGATCAT GTTGTTAATC TTAATGAGGT TGATGAACAA 10800 AGTGGATTAT ACAGATATCA TATGGGTGGT ATTGAGGGCT GGTGTCAAAA ACTGTGGACC 10860 ATTGAAGCTA TATCATTATT AGATCTAATA TCTCTCAAAG GGAAATTCTC TATCACAGCT 10920 CTGATAAATG GTGATAATCA GTCAATTGAT ATAAGCAAAC CAGTTAGACT TATAGAGGGT 10980 CAGACCCATG CACAAGCAGA TTATTTGTTA GCATTAAATA GCCTTAAATT GTTATATAAA 11040 GAGTATGCAG GTATAGGCCA TAAGCTTAAG GGAACAGAGA CCTATATATC CCGAGATATG 11100 CAGTTCATGA GCAAAACAAT CCAGCACAAT GGAGTGTACT ATCCAGCCAG TATCAAAAAA 11160 GTCCTGAGAG TAGGTCCATG GATAAACACG ATACTTGATG ATTTTAAAGT TAGTTTAGAA 11220 TCTATAGGCA GCTTAACACA GGAGTTAGAA TACAGAGGAG AAAGCTTATT ATGCAGTTTA 11280 ATATTTAGGA ACATTTGGTT ATACAATCAA ATTGCTTTGC AACTCCGAAA TCATGCATTA 11340 TGTAACAATA AGCTATATTT AGATATATTG AAAGTATTAA AACACTTAAA AACTTTTTTT 11400 AATCTTGATA GCATTGATAT GGCTTTATCA TTGTATATGA ATTTGCCTAT GCTGTTTGGT 11460 GGTGGTGATC CTAATTTGTT ATATCGAAGC TTTTATAGGA GAACTCCAGA CTTCCTTACA 11520 GAAGCTATAG TACATTCAGT GTTTGTGTTG AGCTATTATA CTGGTCACGA TTTACAAGAT 11580 AAGCTCCAGG ATCTTCCAGA TGATAGACTG AACAAATTCT TGACATGTGT CATCACATTT 11640 GATAAAAATC CCAATGCCGA GTTTGTAACA TTGATGAGGG ATCCACAGGC TTTAGGGTCT 11700 GAAAGGCAAG CTAAAATTAC TAGTGAGATT AATAGATTAG CAGTAACAGA AGTCTTAAGT 11760 ATAGCCCCAA ACAAAATATT TTCTAAAAGT GCACAACATT ATACTACCAC TGAGATTGAT 11820 CTAAATGACA TTATGCAAAA TATAGAACCA ACTTACCCTC ATGGATTAAG AGTTGTTTAT 11880 GAAAGTTTAC CTTTTTATAA AGCAGAAAAA ATAGTTAATC TTATATCAGG AACAAAATCC 11940 ATAACTAATA TACTTGAAAA AACATCAGCA ATAGATACAA CTGATATTAA TAGGGCTACT 12000 GATATGATGA GGAAAAATAT AACTTTACTT ATAAGGATAC TTCCACTAGA TTGTAACAAA 12060 GACAAAAGAG AGTTATTAAG TTTAGAAAAT CTTAGTATAA CTGAATTAAG CAAGTATGTA 12120 AGAGAAAGAT CTTGGTCATT ATCCAATATA GTAGGAGTAA CATCGCCAAG TATTATGTTC 12180 ACAATGGACA TTAAATATAC AACTAGCACT ATAGCCAGTG GTATAATAAT AGAAAAATAT 12240 AATGTTAATA GTTTAACTCG TGGTGAAAGA GGACCCACCA AGCCATGGGT AGGCTCATCC 12300 ACGCAGGAGA AAAAAACAAT GCCAGTGTAC AACAGACAAG TTTTAACCAA AAAGCAAAGA 12360 GACCAAATAG ATTTATTAGC AAAATTAGAC TGGGTATATG CATCCATAGA CAACAAAGAT 12420 GAATTCATGG AAGAACTGAG TACTGGAACA CTTGGACTGT CATATGAAAA AGCCAAAAAG 12480 TTGTTTCCAC AATATCTAAG TGTCAATTAT TTACACCGTT TAACAGTCAG TAGTAGACCA 12540 TGTGAATTCC CTGCATCAAT ACCAGCTTAT AGAACAACAA ATTATCATTT TGATACTAGT 12600 CCTATCAATC ATGTATTAAC AGAAAAGTAT GGAGATGAAG ATATCGACAT TGTGTTTCAA 12660 AATTGCATAA GTTTTGGTCT TAGCCTGATG TCGGTTGTGG AACAATTCAC AAACATATGT 12720 CCTAATAGAA TTATTCTCAT ACCGAAGCTG AATGAGATAC ATTTGATGAA ACCTCCTATA 12780 TTTACAGGAG ATGTTGATAT CATCAAGTTG AAGCAAGTGA TACAAAAGCA GCACATGTTC 12840 CTACCAGATA AAATAAGTTT AACCCAATAT GTAGAATTAT TCTTAAGTAA CAAAGCACTT 12900 AAATCTGGAT CTCACATCAA CTCTAATTTA ATATTAGTAC ATAAAATGTC TGATTATTTT 12960 CATAATGCTT ATATTTTAAG TACTAATTTA GCTGGACATT GGATTCTGAT TATTCAACTT 13020 ATGAAAGATT CAAAAGGTAT TTTTGAAAAA GATTGGGGAG AGGGGTACAT AACTGATCAT 13080 ATGTTCATTA ATTTGAATGT TTTCTTTAAT GCTTATAAGA CTTATTTGCT ATGTTTTCAT 13140 AAAGGTTATG GTAAAGCAAA ATTAGAATGT GATATGAACA CTTCAGATCT TCTTTGTGTT 13200 TTGGAGTTAA TAGACAGTAG CTACTGGAAA TCTATGTCTA AAGTTTTCCT AGAACAAAAA 13260 GTCATAAAAT ACATAGTCAA TCAAGACACA AGTTTGCGTA GAATAAAAGG CTGTCACAGT 13320 TTTAAGTTGT GGTTTTTAAA ACGCCTTAAT AATGCTAAAT TTACCGTATG CCCTTGGGTT 13380 GTTAACATAG ATTATCACCC AACACACATG AAAGCTATAT TATCTTACAT AGATTTAGTT 13440 AGAATGGGGT TAATAAATGT AGATAAATTA ACCATTAAAA ATAAAAACAA ATTCAATGAT 13500 GAATTTTACA CATCAAATCT CTTTTACATT AGTTATAACT TTTCAGACAA CACTCATTTG 13560 CTAACAAAAC AAATAAGAAT TGCTAATTCA GAATTAGAAG ATAATTATAA CAAACTATAT 13620 CACCCAACCC CAGAAACTTT AGAAAATATG TCATTAATTC CTGTTAAAAG TAATAATAGT 13680 AACAAACCTA AATTTTGTAT AAGTGGAAAT ACCGAATCTA TGATGATGTC AACATTCTCT 13740 AGTAAAATGC ATATTAAATC TTCCACTGTT ACCACAAGAT TCAATTATAG CAAACAAGAC 13800 TTGTACAATT TATTTCCAAT TGTTGTGATA GACAAGATTA TAGATCATTC AGGTAATACA 13860 GCAAAATCTA ACCAACTTTA CACCACCACT TCACATCAGA CATCTTTAGT AAGGAATAGT 13920 GCATCACTTT ATTGCATGCT TCCTTGGCAT CATGTCAATA GATTTAACTT TGTATTTAGT 13980 TCCACAGGAT GCAAGATCAG TATAGAGTAT ATTTTAAAAG ATCTTAAGAT TAAGGACCCC 14040 AGTTGTATAG CATTCATAGG TGAAGGAGCT GGTAACTTAT TATTACGTAC GGTAGTAGAA 14100 CTTCATCCAG ACATAAGATA CATTTACAGA AGTTTAAAAG ATTGCAATGA TCATAGTTTA 14160 CCTATTGAAT TTCTAAGGTT ATACAACGGG CATATAAACA TAGATTATGG TGAGAATTTA 14220 ACCATTCCTG CTACAGATGC AACTAATAAC ATTCATTGGT CTTATTTACA TATAAAATTT 14280 GCAGAACCTA TTAGCATCTT TGTCTGCGAT GCTGAATTAC CTGTTACAGC CAATTGGAGT 14340 AAAATTATAA TTGAATGGAG TAAGCATGTA AGAAAGTGCA AGTACTGTTC TTCTGTAAAT 14400 AGATGCATTT TAATTGCAAA ATATCATGCT CAAGATGACA TTGATTTCAA ATTAGATAAC 14460 ATTACTATAT TAAAAACTTA CGTGTGCCTA GGTAGCAAGT TAAAAGGATC TGAAGTTTAC 14520 TTAATCCTTA CAATAGGCCC TGCAAATATA CTTCCTGTTT TTGATGTTGT ACAAAATGCT 14580 AAATTGATAC TTTCAAGAAC TAAAAATTTC ATTATGCCTA AAAAAACTGA CAAGGAATCT 14640 ATCGATGCAG ATATTAAAAG CTTAATACCT TTCCTTTGTT ACCCTATAAC AAAAAAAGGA 14700 ATTAAGACTT CATTGTCAAA ATTGAAGAGT GTAGTTAATG GAGATATATT ATCATATTCT 14760 ATAGCTGGAC GTAATGAAGT ATTCAGCAAC AAGCTTATAA ACCACAAGCA TATGAATATC 14820 CTAAAATGGC TAGATCATGT TTTAAATTTT AGATCAGCTG AACTTAATTA CAATCATTTA 14880 TACATGATAG AGTCCACATA TCCTTACTTA AGTGAATTGT TAAATAGTTT AACAACCAAT 14940 GAGCTCAAGA AGCTGATTAA AATAACAGGT AGTGTGCTAT ACAACCTTCC CAACGAACAG 15000 TAGTTTAAAA TATCATTAAC AAGTTTGGTC AAATTTAGAT GCTAACACAT CATTATATTA 15060 TAGTTATTAA AAAATATACA AACTTTTCAA TAATTTAGCA TATTGATTCC AAAATTATCA 15120 TTTTAGTCTT AAGGGGTTAA ATAAAAGTCT AAAACTAACA ATTATACATG TGCATTCACA 15180 ACACAACGAG ACATTAGTTT TTGACACTTT TTTTCTCGT 15219 (2) SEQ ID NO: 30 information about: (I) SEQUENCE CHARACTERISTICS: ...
(A) length: 2166 amino acid
(B) type: amino acid
(C) chain:
(D) topological framework: linearity is molecule type (ii): protein (xi) sequence description: SEQ ID NO:30:Met Asp Pro Ile Ile Asn Gly Asn Ser Ala Asn Val Tyr Leu Thr Asp1 5 10 15Ser Tyr Leu Lys Gly Val Ile Ser Phe Ser Glu Cys Asn Ala Leu Gly
20??????????????????25??????????????????30Ser?Tyr?Leu?Phe?Asn?Gly?Pro?Tyr?Leu?Lys?Asn?Asp?Tyr?Thr?Asn?Leu
35??????????????????40??????????????????45Ile?Ser?Arg?Gln?Ser?Pro?Leu?Leu?Glu?His?Met?Asn?Leu?Lys?Lys?Leu
50??????????????????55??????????????????60Thr?Ile?Thr?Gln?Ser?Leu?Ile?Ser?Arg?Tyr?His?Lys?Gly?Glu?Leu?Lys65??????????????????70??????????????????75??????????????????80Leu?Glu?Glu?Pro?Thr?Tyr?Phe?Gln?Ser?Leu?Leu?Met?Thr?Tyr?Lys?Ser
85??????????????????90??????????????????95Met?Ser?Ser?Ser?Glu?Gln?Ile?Ala?Thr?Thr?Asn?Leu?Leu?Lys?Lys?Ile
100?????????????????105?????????????????110Ile?Arg?Arg?Ala?Ile?Glu?Ile?Ser?Asp?Val?Lys?Val?Tyr?Ala?Ile?Leu
115?????????????????120?????????????????125Asn?Lys?Leu?Gly?Leu?Lys?Glu?Lys?Asp?Arg?Val?Lys?Pro?Asn?Asn?Asn
130?????????????????135?????????????????140Ser?Gly?Asp?Glu?Asn?Ser?Val?Leu?Thr?Thr?Ile?Ile?Lys?Asp?Asp?Ile145?????????????????150?????????????????155?????????????????160Leu?Ser?Ala?Val?Glu?Asn?Asn?Gln?Ser?Tyr?Thr?Asn?Ser?Asp?Lys?Ser
165?????????????????170?????????????????175His?Ser?Val?Asn?Gln?Asn?Ile?Thr?Ile?Lys?Thr?Thr?Leu?Leu?Lys?Lys
180?????????????????185?????????????????190Leu?Met?Cys?Ser?Met?Gln?His?Pro?Pro?Ser?Trp?Leu?Ile?His?Trp?Phe
195?????????????????200?????????????????205Asn?Leu?Tyr?Thr?Lys?Leu?Asn?Asn?Ile?Leu?Thr?Gln?Tyr?Arg?Ser?Asn
210?????????????????215?????????????????220Glu?Val?Lys?Ser?His?Gly?Phe?Ile?Leu?Ile?Asp?Asn?Gln?Thr?Leu?Ser225?????????????????230?????????????????235?????????????????240Gly?Phe?Gln?Phe?Ile?Leu?Asn?Gln?Tyr?Gly?Cys?Ile?Val?Tyr?His?Lys
245?????????????????250?????????????????255Gly?Leu?Lys?Lys?Ile?Thr?Thr?Thr?Thr?Tyr?Asn?Gln?Phe?Leu?Thr?Trp
260?????????????????265?????????????????270Lys?Asp?Ile?Ser?Leu?Ser?Arg?Leu?Asn?Val?Cys?Leu?Ile?Thr?Trp?Ile
275?????????????????280?????????????????285Ser?Asn?Cys?Leu?Asn?Thr?Leu?Asn?Lys?Ser?Leu?Gly?Leu?Arg?Cys?Gly
290?????????????????295?????????????????300Phe?Asn?Asn?Val?Val?Leu?Ser?Gln?Leu?Phe?Leu?Tyr?Gly?Asp?Cys?Ile305?????????????????310?????????????????315?????????????????320Leu?Lys?Leu?Phe?His?Asn?Glu?Gly?Phe?Tyr?Ile?Ile?Lys?Glu?Val?Glu
325?????????????????330?????????????????335Gly?Phe?Ile?Met?Ser?Leu?Ile?Leu?Asn?Ile?Thr?Glu?Glu?Asp?Gln?Phe
340?????????????????345?????????????????350Arg?Lys?Arg?Phe?Tyr?Asn?Ser?Met?Leu?Asn?Asn?Ile?Thr?Asp?Ala?Ala
355?????????????????360?????????????????365Ile?Lys?Ala?Gln?Lys?Asp?Leu?Leu?Ser?Arg?Val?Cys?His?Thr?Leu?Leu
370?????????????????375?????????????????380Asp?Lys?Thr?Val?Ser?Asp?Asn?Ile?Ile?Asn?Gly?Lys?Trp?Ile?Ile?Leu385?????????????????390?????????????????395?????????????????400Leu?Ser?Lys?Phe?Leu?Lys?Leu?Ile?Lys?Leu?Ala?Gly?Asp?Asn?Asn?Leu
405?????????????????410?????????????????415Asn?Asn?Leu?Ser?Glu?Leu?Tyr?Phe?Leu?Phe?Arg?Ile?Phe?Gly?His?Pro
420?????????????????425?????????????????430Met?Val?Asp?Glu?Arg?Gln?Ala?Met?Asp?Ser?Val?Arg?Ile?Asn?Cys?Asn
435?????????????????440?????????????????445Glu?Thr?Lys?Phe?Tyr?Leu?Leu?Ser?Ser?Leu?Ser?Thr?Leu?Arg?Gly?Ala
450?????????????????455?????????????????460Phe?Ile?Tyr?Arg?Ile?Ile?Lys?Gly?Phe?Val?Asn?Thr?Tyr?Asn?Arg?Trp465?????????????????470?????????????????475?????????????????480Pro?Thr?Leu?Arg?Asn?Ala?Ile?Val?Leu?Pro?Leu?Arg?Trp?Leu?Asn?Tyr
485?????????????????490?????????????????495Tyr?Lys?Leu?Asn?Thr?Tyr?Pro?Ser?Leu?Leu?Glu?Ile?Thr?Glu?Asn?Asp
500?????????????????505?????????????????510Leu?Ile?Ile?Leu?Ser?Gly?Leu?Arg?Phe?Tyr?Arg?Glu?Phe?His?Leu?Pro
515?????????????????520?????????????????525Lys?Lys?Val?Asp?Leu?Glu?Met?Ile?Ile?Asn?Asp?Lys?Ala?Ile?Ser?Pro
530?????????????????535?????????????????540Pro?Lys?Asp?Leu?Ile?Trp?Thr?Ser?Phe?Pro?Arg?Asn?Tyr?Met?Pro?Ser545?????????????????550?????????????????555?????????????????560His?Ile?Gln?Asn?Tyr?Ile?Glu?His?Glu?Lys?Leu?Lys?Phe?Ser?Glu?Ser
565?????????????????570?????????????????575Asp?Arg?Ser?Arg?Arg?Val?Leu?Glu?Tyr?Tyr?Leu?Arg?Asp?Asn?Lys?Phe
580?????????????????585?????????????????590Asn?Glu?Cys?Asp?Leu?Tyr?Asn?Cys?Val?Val?Asn?Gln?Ser?Tyr?Leu?Asn
595?????????????????600?????????????????605Asn?Ser?Asn?His?Val?Val?Ser?Leu?Thr?Gly?Lys?Glu?Arg?Glu?Leu?Ser
610?????????????????615?????????????????620Val?Gly?Arg?Met?Phe?Ala?Met?Gln?Pro?Gly?Met?Phe?Arg?Gln?Ile?Gln625?????????????????630?????????????????635?????????????????640Ile?Leu?Ala?Glu?Lys?Met?Ile?Ala?Glu?Asn?Ile?Leu?Gln?Phe?Phe?Pro
645?????????????????650?????????????????655Glu?Ser?Leu?Thr?Arg?Tyr?Gly?Asp?Leu?Glu?Leu?Gln?Lys?Ile?Leu?Glu
660?????????????????665?????????????????670Leu?Lys?Ala?Gly?Ile?Ser?Asn?Lys?Ser?Asn?Arg?Tyr?Asn?Asp?Asn?Tyr
675?????????????????680?????????????????685Asn?Asn?Tyr?Ile?Ser?Lys?Cys?Ser?Ile?Ile?Thr?Asp?Leu?Ser?Lys?Phe
690?????????????????695?????????????????700Asn?Gln?Ala?Phe?Arg?Tyr?Glu?Thr?Ser?Cys?Ile?Cys?Ser?Asp?Val?Leu705?????????????????710?????????????????715?????????????????720Asp?Glu?Leu?His?Gly?Val?Gln?Ser?Leu?Phe?Ser?Trp?Leu?His?Leu?Thr
725?????????????????730?????????????????735Ile?Pro?Leu?Val?Thr?Ile?Ile?Cys?Thr?Tyr?Arg?His?Ala?Pro?Pro?Phe
740?????????????????745?????????????????750Ile?Lys?Asp?His?Val?Val?Asn?Leu?Asn?Glu?Val?Asp?Glu?Gln?Ser?Gly
755?????????????????760?????????????????765Leu?Tyr?Arg?Tyr?His?Met?Gly?Gly?Ile?Glu?Gly?Trp?Cys?Gln?Lys?Leu
770?????????????????775?????????????????780Trp?Thr?Ile?Glu?Ala?Ile?Ser?Leu?Leu?Asp?Leu?Ile?Ser?Leu?Lys?Gly785?????????????????790?????????????????795?????????????????800Lys?Phe?Ser?Ile?Thr?Ala?Leu?Ile?Asn?Gly?Asp?Asn?Gln?Ser?Ile?Asp
805?????????????????810?????????????????815Ile?Ser?Lys?Pro?Val?Arg?Leu?Ile?Glu?Gly?Gln?Thr?His?Ala?Gln?Ala
820?????????????????825?????????????????830Asp?Tyr?Leu?Leu?Ala?Leu?Asn?Ser?Leu?Lys?Leu?Leu?Tyr?Lys?Glu?Tyr
835?????????????????840?????????????????845Ala?Gly?Ile?Gly?His?Lys?Leu?Lys?Gly?Thr?Glu?Thr?Tyr?Ile?Ser?Arg
850?????????????????855?????????????????860Asp?Met?Gln?Phe?Met?Ser?Lys?Thr?Ile?Gln?His?Asn?Gly?Val?Tyr?Tyr865?????????????????870?????????????????875?????????????????880Pro?Ala?Ser?Ile?Lys?Lys?Val?Leu?Arg?Val?Gly?Pro?Trp?Ile?Asn?Thr
885?????????????????890?????????????????895Ile?Leu?Asp?Asp?Phe?Lys?Val?Ser?Leu?Glu?Ser?Ile?Gly?Ser?Leu?Thr
900?????????????????905?????????????????910Gln?Glu?Leu?Glu?Tyr?Arg?Gly?Glu?Ser?Leu?Leu?Cys?Ser?Leu?Ile?Phe
915?????????????????920?????????????????925Arg?Asn?Ile?Trp?Leu?Tyr?Asn?Gln?Ile?Ala?Leu?Gln?Leu?Arg?Asn?His
930?????????????????935?????????????????940Ala?Leu?Cys?Asn?Asn?Lys?Leu?Tyr?Leu?Asp?Ile?Leu?Lys?Val?Leu?Lys945?????????????????950?????????????????955?????????????????960His?Leu?Lys?Thr?Phe?Phe?Asn?Leu?Asp?Ser?Ile?Asp?Met?Ala?Leu?Ser
965?????????????????970?????????????????975Leu?Tyr?Met?Asn?Leu?Pro?Met?Leu?Phe?Gly?Gly?Gly?Asp?Pro?Asn?Leu
980?????????????????985?????????????????990Leu?Tyr?Arg?Ser?Phe?Tyr?Arg?Arg?Thr?Pro?Asp?Phe?Leu?Thr?Glu?Ala
995?????????????????1000????????????????1005Ile?Val?His?Ser?Val?Phe?Val?Leu?Ser?Tyr?Tyr?Thr?Gly?His?Asp?Leu
1010????????????????1015????????????????1020Gln?Asp?Lys?Leu?Gln?Asp?Leu?Pro?Asp?Asp?Arg?Leu?Asn?Lys?Phe?Leu1025????????????????1030????????????????1035????????????????1040Thr?Cys?Val?Ile?Thr?Phe?Asp?Lys?Asn?Pro?Asn?Ala?Glu?Phe?Val?Thr
1045????????????????1050????????????????1055Leu?Met?Arg?Asp?Pro?Gln?Ala?Leu?Gly?Ser?Glu?Arg?Gln?Ala?Lys?Ile
1060????????????????1065????????????????1070Thr?Ser?Glu?Ile?Asn?Arg?Leu?Ala?Val?Thr?Glu?Val?Leu?Ser?Ile?Ala
1075????????????????1080????????????????1085Pro?Asn?Lys?Ile?Phe?Ser?Lys?Ser?Ala?Gln?His?Tyr?Thr?Thr?Thr?Glu
1090????????????????1095????????????????1100Ile?Asp?Leu?Asn?Asp?Ile?Met?Gln?Asn?Ile?Glu?Pro?Thr?Tyr?Pro?His1105????????????????1110????????????????1115????????????????1120Gly?Leu?Arg?Val?Val?Tyr?Glu?Ser?Leu?Pro?Phe?Tyr?Lys?Ala?Glu?Lys
1125????????????????1130????????????????1135Ile?Val?Asn?Leu?Ile?Ser?Gly?Thr?Lys?Ser?Ile?Thr?Asn?Ile?Leu?Glu
1140????????????????1145????????????????1150Lys?Thr?Ser?Ala?Ile?Asp?Thr?Thr?Asp?Ile?Asn?Arg?Ala?Thr?Asp?Met
1155????????????????1160????????????????1165Met?Arg?Lys?Asn?Ile?Thr?Leu?Leu?Ile?Arg?Ile?Leu?Pro?Leu?Asp?Cys
1170????????????????1175????????????????1180Asn?Lys?Asp?Lys?Arg?Glu?Leu?Leu?Ser?Leu?Glu?Asn?Leu?Ser?Ile?Thr1185????????????????1190????????????????1195????????????????1200Glu?Leu?Ser?Lys?Tyr?Val?Arg?Glu?Arg?Ser?Trp?Ser?Leu?Ser?Asn?Ile
1205????????????????1210????????????????1215Val?Gly?Val?Thr?Ser?Pro?Ser?Ile?Met?Phe?Thr?Met?Asp?Ile?Lys?Tyr
1220????????????????1225????????????????1230Thr?Thr?Ser?Thr?Ile?Ala?Ser?Gly?Ile?Ile?Ile?Glu?Lys?Tyr?Asn?Val
1235????????????????1240????????????????1245Asn?Ser?Leu?Thr?Arg?Gly?Glu?Arg?Gly?Pro?Thr?Lys?Pro?Trp?Val?Gly
1250????????????????1255????????????????1260Ser?Ser?Thr?Gln?Glu?Lys?Lys?Thr?Met?Pro?Val?Tyr?Asn?Arg?Gln?Val1265????????????????1270????????????????1275????????????????1280Leu?Thr?Lys?Lys?Gln?Arg?Asp?Gln?Ile?Asp?Leu?Leu?Ala?Lys?Leu?Asp
1285????????????????1290????????????????1295Trp?Val?Tyr?Ala?Ser?Ile?Asp?Asn?Lys?Asp?Glu?Phe?Met?Glu?Glu?Leu
1300????????????????1305????????????????1310Ser?Thr?Gly?Thr?Leu?Gly?Leu?Ser?Tyr?Glu?Lys?Ala?Lys?Lys?Leu?Phe
1315????????????????1320????????????????1325Pro?Gln?Tyr?Leu?Ser?Val?Asn?Tyr?Leu?His?Arg?Leu?Thr?Val?Ser?Ser
1330????????????????1335????????????????1340Arg?Pro?Cys?Glu?Phe?Pro?Ala?Ser?Ile?Pro?Ala?Tyr?Arg?Thr?Thr?Asn1345????????????????1350????????????????1355????????????????1360Tyr?His?Phe?Asp?Thr?Ser?Pro?Ile?Asn?His?Val?Leu?Thr?Glu?Lys?Tyr
1365????????????????1370????????????????1375Gly?Asp?Glu?Asp?Ile?Asp?Ile?Val?Phe?Gln?Asn?Cys?Ile?Ser?Phe?Gly
1380????????????????1385????????????????1390Leu?Ser?Leu?Met?Ser?Val?Val?Glu?Gln?Phe?Thr?Asn?Ile?Cys?Pro?Asn
1395????????????????1400????????????????1405Arg?Ile?Ile?Leu?Ile?Pro?Lys?Leu?Asn?Glu?Ile?His?Leu?Met?Lys?Pro
1410????????????????1415????????????????1420Pro?Ile?Phe?Thr?Gly?Asp?Val?Asp?Ile?Ile?Lys?Leu?Lys?Gln?Val?Ile1425????????????????1430????????????????1435????????????????1440Gln?Lys?Gln?His?Met?Phe?Leu?Pro?Asp?Lys?Ile?Ser?Leu?Thr?Gln?Tyr
1445????????????????1450????????????????1455Val?Glu?Leu?Phe?Leu?Ser?Asn?Lys?Ala?Leu?Lys?Ser?Gly?Ser?His?Ile
1460????????????????1465????????????????1470Asn?Ser?Asn?Leu?Ile?Leu?Val?His?Lys?Met?Ser?Asp?Tyr?Phe?His?Asn
1475????????????????1480????????????????1485Ala?Tyr?Ile?Leu?Ser?Thr?Asn?Leu?Ala?Gly?His?Trp?Ile?Leu?Ile?Ile
1490????????????????1495????????????????1500Gln?Leu?Met?Lys?Asp?Ser?Lys?Gly?Ile?Phe?Glu?Lys?Asp?Trp?Gly?Glu1505????????????????1510????????????????1515????????????????1520Gly?Tyr?Ile?Thr?Asp?His?Met?Phe?Ile?Asn?Leu?Asn?Val?Phe?Phe?Asn
1525????????????????1530????????????????1535Ala?Tyr?Lys?Thr?Tyr?Leu?Leu?Cys?Phe?His?Lys?Gly?Tyr?Gly?Lys?Ala
1540????????????????1545????????????????1550Lys?Leu?Glu?Cys?Asp?Met?Asn?Thr?Ser?Asp?Leu?Leu?Cys?Val?Leu?Glu
1555????????????????1560????????????????1565Leu?Ile?Asp?Ser?Ser?Tyr?Trp?Lys?Ser?Met?Ser?Lys?Val?Phe?Leu?Glu
1570????????????????1575????????????????1580Gln?Lys?Val?Ile?Lys?Tyr?Ile?Val?Asn?Gln?Asp?Thr?Ser?Leu?Arg?Arg1585????????????????1590????????????????1595????????????????1600Ile?Lys?Gly?Cys?His?Ser?Phe?Lys?Leu?Trp?Phe?Leu?Lys?Arg?Leu?Asn
1605????????????????1610????????????????1615Asn?Ala?Lys?Phe?Thr?Val?Cys?Pro?Trp?Val?Val?Asn?Ile?Asp?Tyr?His
1620????????????????1625????????????????1630Pro?Thr?His?Met?Lys?Ala?Ile?Leu?Ser?Tyr?Ile?Asp?Leu?Val?Arg?Met
1635????????????????1640????????????????1645Gly?Leu?Ile?Asn?Val?Asp?Lys?Leu?Thr?Ile?Lys?Asn?Lys?Asn?Lys?Phe
1650????????????????1655????????????????1660Asn?Asp?Glu?Phe?Tyr?Thr?Ser?Asn?Leu?Phe?Tyr?Ile?Ser?Tyr?Asn?Phe1665????????????????1670????????????????1675????????????????1680Ser?Asp?Asn?Thr?His?Leu?Leu?Thr?Lys?Gln?Ile?Arg?Ile?Ala?Asn?Ser
1685????????????????1690????????????????1695Glu?Leu?Glu?Asp?Asn?Tyr?Asn?Lys?Leu?Tyr?His?Pro?Thr?Pro?Glu?Thr
1700????????????????1705????????????????1710Leu?Glu?Asn?Met?Ser?Leu?Ile?Pro?Val?Lys?Ser?Asn?Asn?Ser?Asn?Lys
1715????????????????1720????????????????1725Pro?Lys?Phe?Cys?Ile?Ser?Gly?Asn?Thr?Glu?Ser?Met?Met?Met?Ser?Thr
1730????????????????1735????????????????1740Phe?Ser?Ser?Lys?Met?His?Ile?Lys?Ser?Ser?Thr?Val?Thr?Thr?Arg?Phe1745????????????????1750????????????????1755????????????????1760Asn?Tyr?Ser?Lys?Gln?Asp?Leu?Tyr?Asn?Leu?Phe?Pro?Ile?Val?Val?Ile
1765????????????????1770????????????????1775Asp?Lys?Ile?Ile?Asp?His?Ser?Gly?Asn?Thr?Ala?Lys?Ser?Asn?Gln?Leu
1780????????????????1785????????????????1790Tyr?Thr?Thr?Thr?Ser?His?Gln?Thr?Ser?Leu?Val?Arg?Asn?Ser?Ala?Ser
1795????????????????1800????????????????1805Leu?Tyr?Cys?Met?Leu?Pro?Trp?His?His?Val?Asn?Arg?Phe?Asn?Phe?Val
1810????????????????1815????????????????1820Phe?Ser?Ser?Thr?Gly?Cys?Lys?Ile?Ser?Ile?Glu?Tyr?Ile?Leu?Lys?Asp1825????????????????1830????????????????1835????????????????1840Leu?Lys?Ile?Lys?Asp?Pro?Ser?Cys?Ile?Ala?Phe?Ile?Gly?Glu?Gly?Ala
1845????????????????1850????????????????1855Gly?Asn?Leu?Leu?Leu?Arg?Thr?Val?Val?Glu?Leu?His?Pro?Asp?Ile?Arg
1860????????????????1865????????????????1870Tyr?Ile?Tyr?Arg?Ser?Leu?Lys?Asp?Cys?Asn?Asp?His?Ser?Leu?Pro?Ile
1875????????????????1880????????????????1885Glu?Phe?Leu?Arg?Leu?Tyr?Asn?Gly?His?Ile?Asn?Ile?Asp?Tyr?Gly?Glu
1890????????????????1895????????????????1900Asn?Leu?Thr?Ile?Pro?Ala?Thr?Asp?Ala?Thr?Asn?Asn?Ile?His?Trp?Ser1905????????????????1910????????????????1915????????????????1920Tyr?Leu?His?Ile?Lys?Phe?Ala?Glu?Pro?Ile?Ser?Ile?Phe?Val?Cys?Asp
1925????????????????1930????????????????1935Ala?Glu?Leu?Pro?Val?Thr?Ala?Asn?Trp?Ser?Lys?Ile?Ile?Ile?Glu?Trp
1940????????????????1945????????????????1950Ser?Lys?His?Val?Arg?Lys?Cys?Lys?Tyr?Cys?Ser?Ser?Val?Asn?Arg?Cys
1955????????????????1960????????????????1965
Ile?Leu?Ile?Ala?Lys?Tyr?His?Ala?Gln?Asp?Asp?Ile?Asp?Phe?Lys?Leu
1970????????????????1975????????????????1980
Asp?Asn?Ile?Thr?Ile?Leu?Lys?Thr?Tyr?Val?Cys?Leu?Gly?Ser?Lys?Leu
1985????????????????1990????????????????1995????????????????2000
Lys?Gly?Ser?Glu?Val?Tyr?Leu?Ile?Leu?Thr?Ile?Gly?Pro?Ala?Asn?Ile
2005????????????????2010????????????????2015
Leu?Pro?Val?Phe?Asp?Val?Val?Gln?Asn?Ala?Lys?Leu?Ile?Leu?Ser?Arg
2020????????????????2025????????????????2030
Thr?Lys?Asn?Phe?Ile?Met?Pro?Lys?Lys?Thr?Asp?Lys?Glu?Ser?Ile?Asp
2035????????????????2040????????????????2045
Ala?Asp?Ile?Lys?Ser?Leu?Ile?Pro?Phe?Leu?Cys?Tyr?Pro?Ile?Thr?Lys
2050????????????????2055????????????????2060
Lys?Gly?Ile?Lys?Thr?Ser?Leu?Ser?Lys?Leu?Lys?Ser?Val?Val?Asn?Gly
2065????????????????2070????????????????2075????????????????2080
Asp?Ile?Leu?Ser?Tyr?Ser?Ile?Ala?Gly?Arg?Asn?Glu?Val?Phe?Ser?Asn
2085????????????????2090????????????????2095
Lys?Leu?Ile?Asn?His?Lys?His?Met?Asn?Ile?Leu?Lys?Trp?Leu?Asp?His
2100????????????????2105????????????????2110
Val?Leu?Asn?Phe?Arg?Ser?Ala?Glu?Leu?Asn?Tyr?Asn?His?Leu?Tyr?Met
2115????????????????2120????????????????2125
Ile?Glu?Ser?Thr?Tyr?Pro?Tyr?Leu?Ser?Glu?Leu?Leu?Asn?Ser?Leu?Thr
2130????????????????2135????????????????2140
Thr?Asn?Glu?Leu?Lys?Lys?Leu?Ile?Lys?Ile?Thr?Gly?Ser?Val?Leu?Tyr
2145????????????????2150????????????????2155????????????????2160
Asn?Leu?Pro?Asn?Glu?Gln
The information of 2165 (2) SEQ ID NO:31:
(i) sequence signature:
(A) length: 15219 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: ACGGGAAAAA AATGCGTACT ACAAACTTGC ACATTCGAAA AAAATGGGGC AAATAAGAAC 60 TTGATAAGTG CTATTTAAGT CTAACCTTTT CAATCAGAAA TGGGGTGCAA TTCACTGAGC 120 ATGATAAAGG TTAGATTACA AAATTTATTT GACAATGACG AAGTAGCATT GTTAAAAATA 180 ACATGTTATA CTGATAAATT AATTCTTCTG ACCAATGCAT TAGCCAAAGC AGCAATACAT 240 ACAATTAAAT TAAACGGCAT AGTTTTTATA CATGTTATAA CAAGCAGTGA AGTGTGCCCT 300 GATAACAATA TTGTAGTGAA ATCTAACTTT ACAACAATGC CAATACTACA AAATGGAGGA 360 TACATATGGG AATTGATTGA GTTGACACAC TGCTCTCAAT TAAACGGTTT AATGGATGAT 420 AATTGTGAAA TCAAATTTTC TAAAAGACTA AGTGACTCAG TAATGACTAA TTATATGAAT 480 CAAATATCTG ACTTACTTGG GCTTGATCTC AATTCATGAA TTATGTTTAG TCTAATTCAA 540 TAGACATGTG TTTATTACCA TTTTAGTTAA TATAAAAACT CATCAAAGGG AAATGGGGCA 600 AATAAACTCA CCTAATCAAT CAAACCATGA GCACTACAAA TGACAACACT ACTATGCAAA 660 GATTGATGAT CACAGACATG AGACCCCTGT CAATGGATTC AATAATAACA TCTCTTACCA 720 AAGAAATCAT CACACACAAA TTCATATACT TGATAAACAA TGAATGTATT GTAAGAAAAC 780 TTGATGAAAG ACAAGCTACA TTTACATTCT TAGTCAATTA TGAGATGAAG CTACTGCACA 840 AAGTAGGGAG TACCAAATAC AAAAAATACA CTGAATATAA TACAAAATAT GGCACTTTCC 900 CCATGCCTAT ATTTATCAAT CACGGCGGGT TTCTAGAATG TATTGGCATT AAGCCTACAA 960 AACACACTCC TATAATATAC AAATATGACC TCAACCCGTG AATTCCAACA AAAAAACCAA 1020 CCCAACCAAA CCAAACTATT CCTCAAACAA CAGTGCTCAA TAGTTAAGAA GGAGCTAATC 1080 CATTTTAGTA ATTAAAAATA AAAGTAAAGC CAATAACATA AATTGGGGCA AATACAAAGA 1140 TGGCTCTTAG CAAAGTCAAG TTGAATGATA CATTAAATAA GGATCAGCTG CTGTCATCCA 1200 GCAAATACAC TATTCAACGT AGTACAGGAG ATAATATTGA CACTCCCAAT TATGATGTGC 1260 AAAAACACCT AAACAAACTA TGTGGTATGC TATTAATCAC TGAAGATGCA AATCATAAAT 1320 TCACAGGATT AATAGGTATG TTATATGCTA TGTCCAGGTT AGGAAGGGAA GACACTATAA 1380 AGATACTTAA AGATGCTGGA TATCATGTTA AAGCTAATGG AGTAGATATA ACAACATATC 1440 GTCAAGATAT AAATGGAAAG GAAATGAAAT TCGAAGTATT AACATTATCA AGCTTGACAT 1500 CAGAAATACA AGTCAATATT GAGATAGAAT CTAGAAAGTC CTACAAAAAA ATGCTAAAAG 1560 AGATGGGAGA AGTGGCTCCA GAATATAGGC ATGATTCTCC AGACTGTGGG ATGATAATAC 1620 TGTGTATAGC TGCACTTGTG ATAACCAAAT TAGCAGCAGG AGACAGATCA GGTCTTACAG 1680 CAGTAATTAG GAGGGCAAAC AATGTCTTAA AAAACGAAAT AAAACGATAC AAGGGCCTCA 1740 TACCAAAGGA TATAGCTAAC AGTTTTTATG AAGTGTTTGA AAAACACCCT CATCTTATAG 1800 ATGTTTTCGT GCACTTTGGC ATTGCACAAT CATCCACAAG AGGGGGTAGT AGAGTTGAAG 1860 GAATCTTTGC AGGATTGTTT ATGAATGCCT ATGGTTCAGG GCAAGTAATG CTAAGATGGG 1920 GAGTTTTAGC CAAATCTGTA AAAAATATCA TGCTAGGACA TGCTAGTGTC CAGGCAGAAA 1980 TGGAGCAAGT TGTGGAAGTC TATGAGTATG CACAGAAGTT GGGAGGAGAA GCTGGATTCT 2040 ACCATATATT GAACAATCCA AAAGCATCAT TGCTGTCATT AACTCAATTT CCCAACTTCT 2100 CAAGTGTGGT CCTAGGCAAT GCAGCAGGTC TAGGCATAAT GGGAGAGTAT AGAGGTACAC 2160 CAAGAAACCA GGATCTTTAT GATGCAGCTA AAGCATATGC AGAGCAACTC AAAGAAAATG 2220 GAGTAATAAA CTACAGTGTA TTAGACTTAA CAGCAGAAGA ATTGGAAGCC ATAAAGCATC 2280 AACTCAACCC CAAAGAAGAT GATGTAGAGC TTTAAGTTAA CAAAAAATAC GGGGCAAATA 2340 AGTCAACATG GAGAAGTTTG CACCTGAATT TCATGGAGAA GATGCAAATA ACAAAGCTAC 2400 CAAATTCCTA GAATCAATAA AGGGCAAGTT CGCATCATCC AAAGATCCTA AGAAGAAAGA 2460 TAGCATAATA TCTGTTAACT CAATAGATAT AGAAGTAACT AAAGAGAGCC CGATAACATC 2520 TGGCACCAAC ATCATCAATC CAACAAGTGA AGCCGACAGT ACCCCAGAAA CAAAAGCCAA 2580 CTACCCAAGA AAACCCCTAG TAAGCTTCAA AGAAGATCTC ACCCCAAGTG ACAACCCTTT 2640 TTCTAAGTTG TACAAGGAAA CAATAGAAAC ATTTGATAAC AATGAAGAAG AATCTAGCTA 2700 CTCATATGAA GAGATAAATG ATCAAACAAA TGACAACATT ACAGCAAGAC TAGATAGAAT 2760 TGATGAAAAA TTAAGTGAAA TATTAGGAAT GCTCCATACA TTAGTAGTTG CAAGTGCAGG 2820 ACCCACTTCA GCTCGCGATG GAATAAGAGA TGCTATGGTT GGTCTAAGAG AAGAGATGAT 2880 AGAAAAAATA AGAGCGGAAG CATTAATGAC CAATGATAGG TTAGAGGCTA TGGCAAGACT 2940 TAGGAATGAG GAAAGCGAAA AAATGGCAAA AGACACCTCA GATGAAGTGT CTCTTAATCC 3000 AACTTCCAAA AAATTGAGTG ACTTGTTGGA AGACAACGAT AGTGACAATG ATCTATCACT 3060 TGATGATTTT TGATCAGCGA TCAACTCACT CAGCAATCAA CAACATCAAT AAAACAGACA 3120 TCAATCCATT GAATCAACTG CCAGACCGAA CAAACAAACG TCCATCAGTA GAACCACCAA 3180 CCAATCAATC AACCAATTGA TCAATCAGCA ACCCGACAAA ATTAACAATA TAGTAACAAA 3240 AAAAGAACAA GATGGGGCAA ATATGGAAAC ATACGTGAAC AAGCTTCACG AAGGCTCCAC 3300 ATACACAGCA GCTGTTCAGT ACAATGTTCT AGAAAAAGAT GATGATCCTG CATCACTAAC 3360 AATATGGGTG CCTATGTTCC AGTCATCTGT GCCAGCAGAC TTGCTCATAA AAGAACTTGC 3420 AAGCATCAAT ATACTAGTGA AGCAGATCTC TACGCCCAAA GGACCTTCAC TACGAGTCAC 3480 GATTAACTCA AGAAGTGCTG TGCTGGCTCA AATGCCTAGT AATTTCATCA TAAGCGCAAA 3540 TGTATCATTA GATGAAAGAA GCAAATTAGC ATATGATGTA ACTACACCTT GTGAAATCAA 3600 AGCATGCAGT CTAACATGCT TAAAAGTAAA AAGTATGTTA ACTACAGTCA AAGATCTTAC 3660 CATGAAGACA TTCAACCCCA CTCATGAGAT CATTGCTCTA TGTGAATTTG AAAATATTAT 3720 GACATCAAAA AGAGTAATAA TACCAACCTA TCTAAGATCA ATTAGTGTCA AGAACAAGGA 3780 TCTGAACTCA CTAGAAAATA TAGCAACCAC CGAATTCAAA AATGCTATCA CCAATGCAAA 3840 AATTATTCCT TATGCAGGAT TAGTGTTAGT TATCACAGTT ACTGACAATA AAGGAGCATT 3900 CAAATATATC AAACCACAGA GTCAATTTAT AGTAGATCTT GGTGCCTACC TAGAAAAAGA 3960 GAGCATATAT TATGTGACTA CTAATTGGAA GCATACAGCT ACACGTTTTT CAATCAAACC 4020 ACTAGAGGAT TAAACTTAAT TATCAACACT GAATGACAGG TCCACATATA TCCTCAAACT 4080 ACACACTATA TCCAAACATC ATAAACATCT ACACTACACA CTTCATCACA CAAACCAATC 4140 CCACTCAAAA TCCAAAATCA CTACCAGCCA CTATCCGCTA GACCTAGAGT GCGAATAGGC 4200 AAATAAAACC AAAATATGGG GTAAATAGAC ATTAGTTAGA GTTCAATCAA TCTTAACAAC 4260 CATTTATACC GCCAATTCAA CACATATACT ATAAATCTTA AAATGGGAAA TACATCCATC 4320 ACAATAGAAC TCACAAGCAA ATTTTGGCCC TATTTTACAC TAATACATAT GATCTTAACT 4380 CTAATCTTTT TACTAATTAT AATCACTATC ATGATTGCAA CACTAAATAA GCTAAGTGAA 4440 CACAAAGCAT TCTGCAACAA AACTCTTGAA CTAGGACAGA TGTACCAAAT CAACACACAG 4500 AGTTCCACCA TTATGCTGTG TCAAACCATA ATCCTGTATA TACAAACAAA CAAATCCAAT 4560 CCTCTCACAG AGTCACGGTG TCGCAAAACC ACGCTAACCA TCATGGTAGC ATAGAGTAGT 4620 TATTTAAAAA TTAACATAAT GATGAATTGT TAGTATGAGA TCAAAAACAA CATTGGGGCA 4680 AATGCAACCA TGTCCAAACA CAAGAATCAA CGCACTGCCA GGACTCTAGA AAAGACCTGG 4740 GATACTCTTA ATCATCTAAT TGTAATATCC TCTTGTTTAT ACAGATTAAA TTTAAAATCT 4800 ATAGCACAAA TAGCACTATC AGTTTTGGCA ATGATAATCT CAACCTCTCT CATAATTGCA 4860 GCCATAATAT TCATCATCTC TGCCAATCAC AAAGTTACAC TAACAACGGT CACAGTTCAA 4920 ACAATAAAAA ACCACACTGA AAAAAACATC ACCACCTACC CTACTCAAGT CTCACCAGAA 4980 AGGGTTAGTT CATCCAAGCA ACCCACAACC ACATCACCAA TCCACACAAG TTCAGCTACA 5040 ACATCACCCA ATACAAAATC AGAAACACAC CATACAACAG CACAAACCAA AGGCAGAACC 5100 ACCACTTCAA CACAGACCAA CAAGCCAAGC ACAAAACCAC GTCCAAAAAA TCCACCAAAA 5160 AAAGATGATT ACCATTTTGA AGTGTTCAAC TTCGTTCCCT GCAGTATATG TGGCAACAAT 5220 CAACTTTGCA AATCCATCTG CAAAACAATA CCAAGCAACA AACCAAAGAA GAAACCAACC 5280 ATCAAACCCA CAAACAAACC AACCACCAAA ACCACAAACA AAAGAGACCC AAAAACACCA 5340 GCCAAAACGA CGAAAAAAGA AACTACCACC AACCCAACAA AAAAACTAAC CCTCAAGACC 5400 ACAGAAAGAG ACACCAGCAC CTCACAATCC ACTGCACTCG ACACAACCAC ATTAAAACAC 5460 ACAGTCCAAC AGCAATCCCT CCTCTCAACC ACCCCCGAAA ACACACCCAA CTCCACACAA 5520 ACACCCACAG CATCCGAGCC CTCCACACCA AACTCCACCC AAAAAACCCA GCCACATGCT 5580 TAGTTATTCA AAAACTACAT CTTAGCAGAG AACCGTGATC TATCAAGCAA GAACGAAATT 5640 AAACCTGGGG CAAATAACCA TGGAGTTGAT GATCCACAAG TCAAGTGCAA TCTTCCTAAC 5700 TCTTGCTATT AATGCATTGT ACCTCACCTC AAGTCAGAAC ATAACTGAGG AGTTTTACCA 5760 ATCGACATGT AGTGCAGTTA GCAGAGGTTA TTTTAGTGCT TTAAGAACAG GTTGGTATAC 5820 TAGTGTCATA ACAATAGAAT TAAGTAATAT AAAAGAAACC AAATGCAATG GAACTGACAC 5880 TAAAGTAAAA CTTATGAAAC AAGAATTAGA TAAGTATAAG AATGCAGTAA CAGAATTACA 5940 GCTACTTATG CAAAACACAC CAGCTGTCAA CAACCGGGCC AGAAGAGAAG CACCACAGTA 6000 TATGAACTAC ACAATCAATA CCACTAAAAA CCTAAATGTA TCAATAAGCA AGAAGAGGAA 6060 ACGAAGATTT CTAGGCTTCT TGTTAGGTGT GGGATCTGCA ATAGCAAGTG GTATAGCTGT 6120 ATCAAAAGTT CTACACCTTG AAGGAGAAGT GAACAAGATC AAAAATGCTT TGTTGTCTAC 6180 AAACAAAGCT GTAGTCAGTT TATCAAATGG GGTCAGTGTT TTAACCAGCA AAGTGTTAGA 6240 TCTCAAGAAT TACATAAATA ACCAATTATT ACCCATAGTA AATCAACAGA GCTGTCGCAT 6300 CTCCAACATT GAAACAGTTA TAGAATTCCA GCAGAAGAAC AGCAGATTGT TGGAAATCAC 6360 CAGAGAATTT AGTGTCAATG CAGGTGTAAC AACACCTTTA AGCACTTACA TGTTGACAAA 6420 CAGTGAGTTA CTATCATTAA TCAATGATAT GCCTATAACA AATGATCAGA AAAAATTAAT 6480 GTCAAGCAAT GTTCAGATAG TAAGGCAACA AAGTTATTCC ATCATGTCTA TAATAAAGGA 6540 AGAAGTCCTT GCATATGTTG TACAGCTGCC TATCTATGGT GTAATAGATA CACCTTGCTG 6600 GAAATTGCAC ACATCGCCTC TATGCACTAC CAACATCAAA GAAGGATCAA ATATTTGTTT 6660 AACAAGGACT GATAGAGGAT GGTATTGTGA TAATGCAGGA TCAGTATCCT TCTTTCCACA 6720 GGCTGACACT TGTAAAGTAC AGTCCAATCG AGTATTTTGT GACACTATGA ACAGTTTGAC 6780 ATTACCAAGT GAAGTCAGCC TTTGTAACAC TGACATATTC AATTCCAAGT ATGACTGCAA 6840 AATTATGACA TCAAAAACAG ACATAAGCAG CTCAGTAATT ACTTCTCTTG GAGCTATAGT 6900 GTCATGCTAT GGTAAAACTA AATGCACTGC ATCCAACAAA AATCGTGGGA TTATAAAGAC 6960 ATTTTCTAAT GGTTGTGACT ATGTGTCAAA CAAAGGAGTA GATACTGTGT CAGTGGGCAA 7020 CACTTTATAC TATGTAAACA AGCTGGAAGG CAAGAACCTT TATGTAAAAG GGGAACCTAT 7080 AATAAATTAC TATGACCCTC TAGTGTTTCC TTCTGATGAG TTTGATGCAT CAATATCTCA 7140 AGTCAATGAA AAAATCAATC AAAGTTTAGC TTTTATTCGT AGATCTGATG AATTACTACA 7200 TAATGTAAAT ACTGGCAAAT CTACTACAAA TATTATGATA ACTACAATTA TTATAGTAAT 7260 CATTGTAGTA TTGTTATCAT TAATAGCTAT TGGTTTACTG TTGTATTGTA AAGCCAAAAA 7320 CACACCAGTT ACACTAAGCA AAGACCAACT AAGTGGAATC AATAATATTG CATTCAGCAA 7380 ATAGACAAAA AACCACCTGA TCATGTTTCA ACAACAATCT GCTGACCACC AATCCCAAAT 7440 CAACTTACAA CAAATATTTC AACATCACAG TACAGGCTGA ATCATTTCCT CACATCATGC 7500 TACCCACATA ACTAAGCTAG ATCCTTAACT TATAGTTACA TAAAAACCTC AAGTATCACA 7560 ATCAACCACT AAATCAACAC ATCATTCACA AAATTAACAG CTGGGGCAAA TATGTCGCGA 7620 AGAAATCCTT GTAAATTTGA GATTAGAGGT CATTGCTTGA ATGGTAGAAG ATGTCACTAC 7680 AGTCATAATT ACTTTGAATG GCCTCCTCAT GCATTACTAG TGAGGCAAAA CTTCATGTTA 7740 AACAAGATAC TCAAGTCAAT GGACAAAAGC ATAGACACTT TGTCTGAAAT AAGTGGAGCT 7800 GCTGAACTGG ATAGAACAGA AGAATATGCT CTTGGTATAG TTGGAGTGCT AGAGAGTTAC 7860 ATAGGATCTA TAAACAACAT AACAAAACAA TCAGCATGTG TTGCTATGAG TAAACTTCTT 7920 ATTGAGATCA ATAGTGATGA CATTAAAAAG CTTAGAGATA ATGAAGAACC CAATTCACCT 7980 AAGATAAGAG TGTACAATAC TGTTATATCA TACATTGAGA GCAATAGAAA AAACAACAAG 8040 CAAACCATCC ATCTGCTCAA GAGACTACCA GCAGACGTGC TGAAGAAGAC AATAAAGAAC 8100 ACATTAGATA TCCACAAAAG CATAACCATA AGCAATCCAA AAGAGTCAAC TGTGAATGAT 8160 CAAAATGACC AAACCAAAAA TAATGATATT ACCGGATAAA TATCCTTGTA GTATATCATC 8220 CATATTGATC TCAAGTGAAA GCATGGTTGC TACATTCAAT CATAAAAACA TATTACAATT 8280 TAACCATAAC TATTTGGATA ACCACCAGCG TTTATTAAAT CATATATTTG ATGAAATTCA 8340 TTGGACACCT AAAAACTTAT TAGATGCCAC TCAACAATTT CTCCAACATC TTAACATCCC 8400 TGAAGATATA TATACAGTAT ATATATTAGT GTCATAATGC TTGACCATAA CGACTCTATG 8460 TCATCCAACC ATAAAACTAT TTTGATAAGG TTATGGGACA AAATGGATCC CATTATTAAT 8520 GGAAACTCTG CTAATGTGTA TCTAACTGAT AGTTATTTAA AAGGTGTTAT CTCTTTTTCA 8580 GAGTGTAATG CTTTAGGGAG TTATCTTTTT AACGGCCCTT ATCTTAAAAA TGATTACACC 8640 AACTTAATTA GTAGACAAAG CCCACTACTA GAGCATATGA ATCTTAAAAA ACTAACTATA 8700 ACACAGTCAT TAATATCTAG ATATCATAAA GGTGAACTGA AATTAGAAGA ACCAACTTAT 8760 TTCCAGTCAT TACTTATGAC ATATAAAAGT ATGTCCTCGT CTGAACAAAT TGCTACAACT 8820 AACTTACTTA AAAAAATAAT ACGAAGAGCC ATAGAAATAA GTGATGTAAA GGTGTACGCC 8880 ATCTTGAATA AACTAGGATT AAAGGAAAAG GACAGAGTTA AGCCCAACAA TAATTCAGGT 8940 GATGAAAACT CAGTACTTAC AACCATAATT AAAGATGATA TACTTTCGGC TGTGGAAAAC 9000 AATCAATCAT ATACAAATTC AGACAAAAGT CACTCAGTAA ATCAAAATAT CACTATCAAA 9060 ACAACACTCT TGAAAAAATT GATGTGTTCA ATGCAACATC CTCCATCATG GTTAATACAC 9120 TGGTTCAATT TATATACAAA ATTAAATAAC ATATTAACAC AATATCGATC AAATGAGGTA 9180 AAAAGTCATG GGTTTATATT AATAGATAAT CAAACTTTAA GTGGTTTTCA GTTTATTTTA 9240 AATCAATATG GTTGTATCGT TTATCATAAA GGACTCAAAA AAATCACAAC TACTACTTAC 9300 AATCAATTTT TGACATGGAA AGACATCAGC CTTAGCAGAT TAAATGTTTG CTTAATTACT 9360 TGGATAAGTA ATTGTTTAAA TACATTAAAC AAAAGCTTAG GGCTGAGATG TGGATTCAAT 9420 AATGTTGTGT TATCACAATT ATTTCTTTAT GGAGATTGTA TACTGAAATT ATTTCATAAT 9480 GAAGGCTTCT ACATAATAAA AGAAGTAGAG GGATTTATTA TGTCTTTAAT TCTAAACATA 9540 ACAGAAGAAG ATCAATTTAA GAAACGATTT TATAATAGCA TGCTAAATAA CATCACAGAT 9600 GCAGCTATTA AGGCTCAAAA GGACCTACTA TCAAGAGTAT GTCACACTTT ATTAGACAAG 9660 ACAGTGTCTG ATAATATCAT AAATGGTAAA TGGATAATCC TATTAAGTAA ATTTCTTAAA 9720 TTGATTAAGC TTGCAGGTGA TAATAATCTC AATAACTTGA GTGAGCTATA TTTTCTCTTC 9780 AGAATCTTTG GACATCCAAT GGTCGATGAA AGACAAGCAA TGGATTCTGT AAGAATTAAC 9840 TGTAATGAAA CTAAGTTCTA CTTATTAAGT AGTCTAAGTA CATTAAGAGG TGCTTTCATT 9900 TATAGAATCA TAAAAGGGTT TGTAAATACC TACAACAGAT GGCCCACCTT AAGGAATGCT 9960 ATTGTCCTAC CTCTAAGATG GTTAAACTAC TATAAACTTA ATACTTATCC ATCTCTACTT 10020 GAAATCACAG AAAATGATTT GATTATTTTA TCAGGATTGC GGTTCTATCG TGAGTTTCAT 10080 CTGCCTAAAA AAGTGGATCT TGAAATGATA ATAAATGACA AAGCCATTTC ACCTCCAAAA 10140 GATCTAATAT GGACTAGTTT TCCTAGAAAT TACATGCCAT CACATATACA AAATTATATA 10200 GAACATGAAA AGTTGAAGTT CTCTGAAAGC GACAGATCGA GAAGAGTACT AGAGTATTAC 10260 TTGAGAGATA ATAAATTCAA TGAATGCGAT CTATACAATT GTGTAGTCAA TCAAAGCTAT 10320 CTCAACAACT CTAATCACGT GGTATCACTA ACTGGTAAAG AAAGAGAGCT CAGTGTAGGT 10380 AGAATGTTTG CTATGCAACC AGGTATGTTT AGGCAAATCC AAATCTTAGC AGAGAAAATG 10440 ATAGCTGAAA ATATTTTACA ATTCTTCCCT GAGAGTTTGA CAAGATATGG TGATCTAGAG 10500 CTTCAAAAGA TATTAGAATT AAAAGCAGGA ATAAGCAACA AGTCAAATCG TTATAATGAT 10560 AACTACAACA ATTATATCAG TAAATGTTCT ATCATTACAG ATCTTAGCAA ATTCAATCAG 10620 GCATTTAGAT ATGAAACATC ATGTATCTGC AGTGATGTAT TAGATGAACT GCATGGAGTA 10680 CAATCTCTGT TCTCTTGGTT GCATTTAACA ATACCTCTTG TCACAATAAT ATGTACATAT 10740 AGACATGCAC CTCCTTTCAT AAAGGATCAT GTTGTTAATC TTAATGAGGT TGATGAACAA 10800 AGTGGATTAT ACAGATATCA TATGGGTGGT ATTGAGGGCT GGTGTCAAAA ACTGTGGACC 10860 ATTGAAGCTA TATCATTATT AGATCTAATA TCTCTCAAAG GGAAATTCTC TATCACAGCT 10920 CTGATAAATG GTGATAATCA GTCAATTGAT ATAAGCAAAC CAGTTAGACT TATAGAGGGT 10980 CAGACCCATG CACAAGCAGA TTATTTGTTA GCATTAAATA GCCTTAAATT GTTATATAAA 11040 GAGTATGCAG GTATAGGCCA TAAGCTTAAG GGAACAGAGA CCTATATATC CCGAGATATG 11100 CAGTTCATGA GCAAAACAAT CCAGCACAAT GGAGTGTACT ATCCAGCCAG TATCAAAAAA 11160 GTCCTGAGAG TAGGTCCATG GATAAACACG ATACTTGATG ATTTTAAAGT TAGTTTAGAA 11220 TCTATAGGCA GCTTAACACA GGAGTTAGAA TACAGAGGAG AAAGCTTATT ATGCAGTTTA 11280 ATATTTAGGA ACATTTGGTT ATACAATCAA ATTGCTTTGC AACTCCGAAA TCATGCATTA 11340 TGTAACAATA AGCTATATTT AGATATATTG AAAGTATTAA AACACTTAAA AACTTTTTTT 11400 AATCTTGATA GCATTGATAT GGCTTTATCA TTGTATATGA ATTTGCCTAT GCTGTTTGGT 11460 GGTGGTGATC CTAATTTGTT ATATCGAAGC TTTTATAGGA GAACTCCAGA CTTCCTTACA 11520 GAAGCTATAG TACATTCAGT GTTTGTGTTG AGCTATTATA CTGGTCACGA TTTACAAGAT 11580 AAGCTCCAGG ATCTTCCAGA TGATAGACTG AACAAATTCT TGACATGTGT CATCACATTT 11640 GATAAAAATC CCAATGCCGA GTTTGTAACA TTGATGAGGG ATCCACAGGC TTTAGGGTCT 11700 GAAAGGCAAG CTAAAATTAC TAGTGAGATT AATAGATTAG CAGTAACAGA AGTCTTAAGT 11760 ATAGCCCCAA ACAAAATATT TTCTAAAAGT GCACAACATT ATACTACCAC TGAGATTGAT 11820 CTAAATGACA TTATGCAAAA TATAGAACCA ACTTACCCTC ATGGATTAAG AGTTGTTTAT 11880 GAAAGTTTAC CTTTTTATAA AGCAGAAAAA ATAGTTAATC TTATATCAGG AACAAAATCC 11940 ATAACTAATA TACTTGAAAA AACATCAGCA ATAGATACAA CTGATATTAA TAGGGCTACT 12000 GATATGATGA GGAAAAATAT AACTTTACTT ATAAGGATAC TTCCACTAGA TTGTAACAAA 12060 GACAAAAGAG AGTTATTAAG TTTAGAAAAT CTTAGTATAA CTGAATTAAG CAAGTATGTA 12120 AGAGAAAGAT CTTGGTCATT ATCCAATATA GTAGGAGTAA CATCGCCAAG TATTATGTTC 12180 ACAATGAACA TTAAATATAC AACTAGCACT ATAGCCAGTG GTATAATAAT AGAAAAATAT 12240 AATGTTAATA GTTTAACTCG TGGTGAAAGA GGACCCACCA AGCCATGGGT AGGCTCATCC 12300 ACGCAGGAGA AAAAAACAAT GCCAGTGTAC AACAGACAAG TTTTAACCAA AAAGCAAAGA 12360 GACCAAATAG ATTTATTAGC AAAATTAGAC TGGGTATATG CATCCATAGA CAACAAAGAT 12420 GAATTCATGG AAGAACTGAG TACTGGAACA CTTGGACTGT CATATGAAAA AGCCAAAAAG 12480 TTGTTTCCAC AATATCTAAG TGTCAATTAT TTACACCGTT TAACAGTCAG TAGTAGACCA 12540 TGTGAATTCC CTGCATCAAT ACCAGCTTAT AGAACAACAA ATTATCATTT TGATACTAGT 12600 CCTATCAATC ATGTATTAAC AGAAAAGTAT GGAGATGAAG ATATCGACAT TGTGTTTCAA 12660 AATTGCATAA GTTTTGGTCT TAGCCTGATG TCGGTTGTGG AACAATTCAC AAACATATGT 12720 CCTAATAGAA TTATTCTCAT ACCGAAGCTG AATGAGATAC ATTTGATGAA ACCTCCTATA 12780 TTTACAGGAG ATGTTGATAT CATCAAGTTG AAGCAAGTGA TACAAAAGCA GCACATGTTC 12840 CTACCAGATA AAATAAGTTT AACCCAATAT GTAGAATTAT TCTTAAGTAA CAAAGCACTT 12900 AAATCTGGAT CTCACATCAA CTCTAATTTA ATATTAGTAC ATAAAATGTC TGATTATTTT 12960 CATAATGCTT ATATTTTAAG TACTAATTTA GCTGGACATT GGATTCTGAT TATTCAACTT 13020 ATGAAAGATT CAAAAGGTAT TTTTGAAAAA GATTGGGGAG AGGGGTACAT AACTGATCAT 13080 ATGTTCATTA ATTTGAATGT TTTCTTTAAT GCTTATAAGA CTTATTTGCT ATGTTTTCAT 13140 AAAGGTTATG GTAAAGCAAA ATTAGAATGT GATATGAACA CTTCAGATCT TCTTTGTGTT 13200 TTGGAGTTAA TAGACAGTAG CTACTGGAAA TCTATGTCTA AAGTTTTCCT AGAACAAAAA 13260 GTCATAAAAT ACATAGTCAA TCAAGACACA AGTTTGCGTA GAATAAAAGG CTGTCACAGT 13320 TTTAAGTTGT GGTTTTTAAA ACGCCTTAAT AATGCTAAAT TTACCGTATG CCCTTGGGTT 13380 GTTAACATAG ATTATCACCC AACACACATG AAAGCTATAT TATCTTACAT AGATTTAGTT 13440 AGAATGGGGT TAATAAATGT AGATAAATTA ACCATTAAAA ATAAAAACAA ATTCAATGAT 13500 GAATTTTACA CATCAAATCT CTTTTACATT AGTTATAACT TTTCAGACAA CACTCATTTG 13560 CTAACAAAAC AAATAAGAAT TGCTAATTCA GAATTAGAAG ATAATTATAA CAAACTATAT 13620 CACCCAACCC CAGAAACTTT AGAAAATATG TCATTAATTC CTGTTAAAAG TAATAATAGT 13680 AACAAACCTA AATTTTGTAT AAGTGGAAAT ACCGAATCTA TGATGATGTC AACATTCTCT 13740 AGTAAAATGC ATATTAAATC TTCCACTGTT ACCACAAGAT TCAATTATAG CAAACAAGAC 13800 TTGTACAATT TATTTCCAAT TGTTGTGATA GACAAGATTA TAGATCATTC AGGTAATACA 13860 GCAAAATCTA ACCAACTTTA CACCACCACT TCACATCAGA CATCTTTAGT AAGGAATAGT 13920 GCATCACTTT ATTGCATGCT TCCTTGGCAT CATGTCAATA GATTTAACTT TGTATTTAGT 13980 TCCACAGGAT GCAAGATCAG TATAGAGTAT ATTTTAAAAG ATCTTAAGAT TAAGGACCCC 14040 AGTTGTATAG CATTCATAGG TGAAGGAGCT GGTAACTTAT TATTACGTAC GGTAGTAGAA 14100 CTTCATCCAG ACATAAGATA CATTTACAGA AGTTTAAAAG ATTGCAATGA TCATAGTTTA 14160 CCTATTGAAT TTCTAAGGTT ATACAACGGG CATATAAACA TAGATTATGG TGAGAATTTA 14220 ACCATTCCTG CTACAGATGC AACTAATAAC ATTCATTGGT CTTATTTACA TATAAAATTT 14280 GCAGAACCTA TTAGCATCTT TGTCTGCGAT GCTGAATTAC CTGTTACAGC CAATTGGAGT 14340 AAAATTATAA TTGAATGGAG TAAGCATGTA AGAAAGTGCA AGTACTGTTC TTCTGTAAAT 14400 AGATGCATTT TAATTGCAAA ATATCATGCT CAAGATGACA TTGATTTCAA ATTAGATAAC 14460 ATTACTATAT TAAAAACTTA CGTGTGCCTA GGTAGCAAGT TAAAAGGATC TGAAGTTTAC 14520 TTAATCCTTA CAATAGGCCC TGCAAATATA CTTCCTGTTT TTGATGTTGT ACAAAATGCT 14580 AAATTGATAC TTTCAAGAAC TAAAAATTTC ATTATGCCTA AAAAAACTGA CAAGGAATCT 14640 ATCGATGCAA ATATTAAAAG CTTAATACCT TTCCTTTGTT ACCCTATAAC AAAAAAAGGA 14700 ATTAAGACTT CATTGTCAAA ATTGAAGAGT GTAGTTAATG GAGATATATT ATCATATTCT 14760 ATAGCTGGAC GTAATGAAGT ATTCAGCAAC AAGCTTATAA ACCACAAGCA TATGAATATC 14820 CTAAAATGGC TAGATCATGT TTTAAATTTT AGATCAGCTG AACTTAATTA CAATCATTTA 14880 TACATGATAG AGTCCACATA TCCTTACTTA AGTGAATTGT TAAATAGTTT AACAACCAAT 14940 GAGCTCAAGA AGCTGATTAA AATAACAGGT AGTGTGCTAT ACAACCTTCC CAACGAACAG 15000 TAGTTTAAAA TATCATTAAC AAGTTTGGTC AAATTTAGAT GCTAACACAT CATTATATTA 15060 TAGTTATTAA AGAATATACA AACTTTTCAA TAATTTAGCA TATTGATTCC AAAATTATCA 15120 TTTTAGTCTT AAGGGGTTAA ATAAAAGTCT AAAACTAACA ATTATACATG TGCATTCACA 15180 ACACAACGAG ACATTAGTTT TTGACACTTT TTTTCTCGT 15219 (2) SEQ ID NO: 32 information about: ...
(i) sequence signature:
(A) length: 2166 amino acid
(B) type: amino acid
(C) chain:
(D) topological framework: linearity
(ii) molecule type: protein
(xi) sequence description: SEQ ID NO:32:
Met?Asp?Pro?Ile?Ile?Asn?Gly?Asn?Ser?Ala?Asn?Val?Tyr?Leu?Thr?Asp
1???????????????5???????????????????10??????????????????15
Ser?Tyr?Leu?Lys?Gly?Val?Ile?Ser?Phe?Ser?Glu?Cys?Asn?Ala?Leu?Gly
20??????????????????25??????????????????30
Ser?Tyr?Leu?Phe?Asn?Gly?Pro?Tyr?Leu?Lys?Asn?Asp?Tyr?Thr?Asn?Leu
35??????????????????40??????????????????45
Ile?Ser?Arg?Gln?Ser?Pro?Leu?Leu?Glu?His?Met?Asn?Leu?Lys?Lys?Leu
50??????????????????55??????????????????60
Thr?Ile?Thr?Gln?Ser?Leu?Ile?Ser?Arg?Tyr?His?Lys?Gly?Glu?Leu?Lys
65??????????????????70??????????????????75??????????????????80
Leu?Glu?Glu?Pro?Thr?Tyr?Phe?Gln?Ser?Leu?Leu?Met?Thr?Tyr?Lys?Ser
85??????????????????90??????????????????95
Met?Ser?Ser?Ser?Glu?Gln?Ile?Ala?Thr?Thr?Asn?Leu?Leu?Lys?Lys?Ile
100?????????????????105?????????????????110Ile?Arg?Arg?Ala?Ile?Glu?Ile?Ser?Asp?Val?Lys?Val?Tyr?Ala?Ile?Leu
115?????????????????120?????????????????125Asn?Lys?Leu?Gly?Leu?Lys?Glu?Lys?Asp?Arg?Val?Lys?Pro?Asn?Asn?Asn
130?????????????????135?????????????????140Ser?Gly?Asp?Glu?Asn?Ser?Val?Leu?Thr?Thr?Ile?Ile?Lys?Asp?Asp?Ile145?????????????????150?????????????????155?????????????????160Leu?Ser?Ala?Val?Glu?Asn?Asn?Gln?Ser?Tyr?Thr?Asn?Ser?Asp?Lys?Ser
165?????????????????170?????????????????175His?Ser?Val?Asn?Gln?Asn?Ile?Thr?Ile?Lys?Thr?Thr?Leu?Leu?Lys?Lys
180?????????????????185?????????????????190Leu?Met?Cys?Ser?Met?Gln?His?Pro?Pro?Ser?Trp?Leu?Ile?His?Trp?Phe
195?????????????????200?????????????????205Asn?Leu?Tyr?Thr?Lys?Leu?Asn?Asn?Ile?Leu?Thr?Gln?Tyr?Arg?Ser?Asn
210?????????????????215?????????????????220Glu?Val?Lys?Ser?His?Gly?Phe?Ile?Leu?Ile?Asp?Asn?Gln?Thr?Leu?Ser225?????????????????230?????????????????235?????????????????240Gly?Phe?Gln?Phe?Ile?Leu?Asn?Gln?Tyr?Gly?Cys?Ile?Val?Tyr?His?Lys
245?????????????????250?????????????????255Gly?Leu?Lys?Lys?Ile?Thr?Thr?Thr?Thr?Tyr?Asn?Gln?Phe?Leu?Thr?Trp
260?????????????????265?????????????????270Lys?Asp?Ile?Ser?Leu?Ser?Arg?Leu?Asn?Val?Cys?Leu?Ile?Thr?Trp?Ile
275?????????????????280?????????????????285Ser?Asn?Cys?Leu?Asn?Thr?Leu?Asn?Lys?Ser?Leu?Gly?Leu?Arg?Cys?Gly
290?????????????????295?????????????????300Phe?Asn?Asn?Val?Val?Leu?Ser?Gln?Leu?Phe?Leu?Tyr?Gly?Asp?Cys?Ile305?????????????????310?????????????????315?????????????????320Leu?Lys?Leu?Phe?His?Asn?Glu?Gly?Phe?Tyr?Ile?Ile?Lys?Glu?Val?Glu
325?????????????????330?????????????????335Gly?Phe?Ile?Met?Ser?Leu?Ile?Leu?Asn?Ile?Thr?Glu?Glu?Asp?Gln?Phe
340?????????????????345?????????????????350Lys?Lys?Arg?Phe?Tyr?Asn?Ser?Met?Leu?Asn?Asn?Ile?Thr?Asp?Ala?Ala
355?????????????????360?????????????????365Ile?Lys?Ala?Gln?Lys?Asp?Leu?Leu?Ser?Arg?Val?Cys?His?Thr?Leu?Leu
370?????????????????375?????????????????380Asp?Lys?Thr?Val?Ser?Asp?Asn?Ile?Ile?Asn?Gly?Lys?Trp?Ile?Ile?Leu385?????????????????390?????????????????395?????????????????400Leu?Ser?Lys?Phe?Leu?Lys?Leu?Ile?Lys?Leu?Ala?Gly?Asp?Asn?Asn?Leu
405?????????????????410?????????????????415Asn?Asn?Leu?Ser?Glu?Leu?Tyr?Phe?Leu?Phe?Arg?Ile?Phe?Gly?His?Pro
420?????????????????425?????????????????430Met?Val?Asp?Glu?Arg?Gln?Ala?Met?Asp?Ser?Val?Arg?Ile?Asn?Cys?Asn
435?????????????????440?????????????????445Glu?Thr?Lys?Phe?Tyr?Leu?Leu?Ser?Ser?Leu?Ser?Thr?Leu?Arg?Gly?Ala
450?????????????????455?????????????????460Phe?Ile?Tyr?Arg?Ile?Ile?Lys?Gly?Phe?Val?Asn?Thr?Tyr?Asn?Arg?Trp465?????????????????470?????????????????475?????????????????480Pro?Thr?Leu?Arg?Asn?Ala?Ile?Val?Leu?Pro?Leu?Arg?Trp?Leu?Asn?Tyr
485?????????????????490?????????????????495Tyr?Lys?Leu?Asn?Thr?Tyr?Pro?Ser?Leu?Leu?Glu?Ile?Thr?Glu?Asn?Asp
500?????????????????505?????????????????510Leu?Ile?Ile?Leu?Ser?Gly?Leu?Arg?Phe?Tyr?Arg?Glu?Phe?His?Leu?Pro
515?????????????????520?????????????????525Lys?Lys?Val?Asp?Leu?Glu?Met?Ile?Ile?Asn?Asp?Lys?Ala?Ile?Ser?Pro
530?????????????????535?????????????????540Pro?Lys?Asp?Leu?Ile?Trp?Thr?Ser?Phe?Pro?Arg?Asn?Tyr?Met?Pro?Ser545?????????????????550?????????????????555?????????????????560His?Ile?Gln?Asn?Tyr?Ile?Glu?His?Glu?Lys?Leu?Lys?Phe?Ser?Glu?Ser
565?????????????????570?????????????????575Asp?Arg?Ser?Arg?Arg?Val?Leu?Glu?Tyr?Tyr?Leu?Arg?Asp?Asn?Lys?Phe
580?????????????????585?????????????????590Asn?Glu?Cys?Asp?Leu?Tyr?Asn?Cys?Val?Val?Asn?Gln?Ser?Tyr?Leu?Asn
595?????????????????600?????????????????605Asn?Ser?Asn?His?Val?Val?Ser?Leu?Thr?Gly?Lys?Glu?Arg?Glu?Leu?Ser
610?????????????????615?????????????????620Val?Gly?Arg?Met?Phe?Ala?Met?Gln?Pro?Gly?Met?Phe?Arg?Gln?Ile?Gln625?????????????????630?????????????????635?????????????????640Ile?Leu?Ala?Glu?Lys?Met?Ile?Ala?Glu?Asn?Ile?Leu?Gln?Phe?Phe?Pro
645?????????????????650?????????????????655Glu?Ser?Leu?Thr?Arg?Tyr?Gly?Asp?Leu?Glu?Leu?Gln?Lys?Ile?Leu?Glu
660?????????????????665?????????????????670Leu?Lys?Ala?Gly?Ile?Ser?Asn?Lys?Ser?Asn?Arg?Tyr?Asn?Asp?Asn?Tyr
675?????????????????680?????????????????685Asn?Asn?Tyr?Ile?Ser?Lys?Cys?Ser?Ile?Ile?Thr?Asp?Leu?Ser?Lys?Phe
690?????????????????695?????????????????700Asn?Gln?Ala?Phe?Arg?Tyr?Glu?Thr?Ser?Cys?Ile?Cys?Ser?Asp?Val?Leu705?????????????????710?????????????????715?????????????????720Asp?Glu?Leu?His?Gly?Val?Gln?Ser?Leu?Phe?Ser?Trp?Leu?His?Leu?Thr
725?????????????????730?????????????????735Ile?Pro?Leu?Val?Thr?Ile?Ile?Cys?Thr?Tyr?Arg?His?Ala?Pro?Pro?Phe
740?????????????????745?????????????????750Ile?Lys?Asp?His?Val?Val?Asn?Leu?Asn?Glu?Val?Asp?Glu?Gln?Ser?Gly
755?????????????????760?????????????????765Leu?Tyr?Arg?Tyr?His?Met?Gly?Gly?Ile?Glu?Gly?Trp?Cys?Gln?Lys?Leu
770?????????????????775?????????????????780Trp?Thr?Ile?Glu?Ala?Ile?Ser?Leu?Leu?Asp?Leu?Ile?Ser?Leu?Lys?Gly785?????????????????790?????????????????795?????????????????800Lys?Phe?Ser?Ile?Thr?Ala?Leu?Ile?Asn?Gly?Asp?Asn?Gln?Ser?Ile?Asp
805?????????????????810?????????????????815Ile?Ser?Lys?Pro?Val?Arg?Leu?Ile?Glu?Gly?Gln?Thr?His?Ala?Gln?Ala
820?????????????????825?????????????????830Asp?Tyr?Leu?Leu?Ala?Leu?Asn?Ser?Leu?Lys?Leu?Leu?Tyr?Lys?Glu?Tyr
835?????????????????840?????????????????845Ala?Gly?Ile?Gly?His?Lys?Leu?Lys?Gly?Thr?Glu?Thr?Tyr?Ile?Ser?Arg
850?????????????????855?????????????????860Asp?Met?Gln?Phe?Met?Ser?Lys?Thr?Ile?Gln?His?Asn?Gly?Val?Tyr?Tyr865?????????????????870?????????????????875?????????????????880Pro?Ala?Ser?Ile?Lys?Lys?Val?Leu?Arg?Val?Gly?Pro?Trp?Ile?Asn?Thr
885?????????????????890?????????????????895Ile?Leu?Asp?Asp?Phe?Lys?Val?Ser?Leu?Glu?Ser?Ile?Gly?Ser?Leu?Thr
900?????????????????905?????????????????910Gln?Glu?Leu?Glu?Tyr?Arg?Gly?Glu?Ser?Leu?Leu?Cys?Ser?Leu?Ile?Phe
915?????????????????920?????????????????925Arg?Asn?Ile?Trp?Leu?Tyr?Asn?Gln?Ile?Ala?Leu?Gln?Leu?Arg?Asn?His
930?????????????????935?????????????????940Ala?Leu?Cys?Asn?Asn?Lys?Leu?Tyr?Leu?Asp?Ile?Leu?Lys?Val?Leu?Lys945?????????????????950?????????????????955?????????????????960His?Leu?Lys?Thr?Phe?Phe?Asn?Leu?Asp?Ser?Ile?Asp?Met?Ala?Leu?Ser
965?????????????????970?????????????????975Leu?Tyr?Met?Asn?Leu?Pro?Met?Leu?Phe?Gly?Gly?Gly?Asp?Pro?Asn?Leu
980?????????????????985?????????????????990Leu?Tyr?Arg?Ser?Phe?Tyr?Arg?Arg?Thr?Pro?Asp?Phe?Leu?Thr?Glu?Ala
995?????????????????1000????????????????1005Ile?Val?His?Ser?Val?Phe?Val?Leu?Ser?Tyr?Tyr?Thr?Gly?His?Asp?Leu
1010????????????????1015????????????????1020Gln?Asp?Lys?Leu?Gln?Asp?Leu?Pro?Asp?Asp?Arg?Leu?Asn?Lys?Phe?Leu1025????????????????1030????????????????1035????????????????1040Thr?Cys?Val?Ile?Thr?Phe?Asp?Lys?Asn?Pro?Asn?Ala?Glu?Phe?Val?Thr
1045????????????????1050????????????????1055Leu?Met?Arg?Asp?Pro?Gln?Ala?Leu?Gly?Ser?Glu?Arg?Gln?Ala?Lys?Ile
1060????????????????1065????????????????1070Thr?Ser?Glu?Ile?Asn?Arg?Leu?Ala?Val?Thr?Glu?Val?Leu?Ser?Ile?Ala
1075????????????????1080????????????????1085Pro?Asn?Lys?Ile?Phe?Ser?Lys?Ser?Ala?Gln?His?Tyr?Thr?Thr?Thr?Glu
1090????????????????1095????????????????1100Ile?Asp?Leu?Asn?Asp?Ile?Met?Gln?Asn?Ile?Glu?Pro?Thr?Tyr?Pro?His1105????????????????1110????????????????1115????????????????1120Gly?Leu?Arg?Val?Val?Tyr?Glu?Ser?Leu?Pro?Phe?Tyr?Lys?Ala?Glu?Lys
1125????????????????1130????????????????1135Ile?Val?Asn?Leu?Ile?Ser?Gly?Thr?Lys?Ser?Ile?Thr?Asn?Ile?Leu?Glu
1140????????????????1145????????????????1150Lys?Thr?Ser?Ala?Ile?Asp?Thr?Thr?Asp?Ile?Asn?Arg?Ala?Thr?Asp?Met
1155????????????????1160????????????????1165Met?Arg?Lys?Asn?Ile?Thr?Leu?Leu?Ile?Arg?Ile?Leu?Pro?Leu?Asp?Cys
1170????????????????1175????????????????1180Asn?Lys?Asp?Lys?Arg?Glu?Leu?Leu?Ser?Leu?Glu?Asn?Leu?Ser?Ile?Thr1185????????????????1190????????????????1195????????????????I200Glu?Leu?Ser?Lys?Tyr?Val?Arg?Glu?Arg?Ser?Trp?Ser?Leu?Ser?Asn?Ile
1205????????????????1210????????????????1215Val?Gly?Val?Thr?Ser?Pro?Ser?Ile?Met?Phe?Thr?Met?Asn?Ile?Lys?Tyr
1220????????????????1225????????????????1230Thr?Thr?Ser?Thr?Ile?Ala?Ser?Gly?Ile?Ile?Ile?Glu?Lys?Tyr?Asn?Val
1235????????????????1240????????????????1245Asn?Ser?Leu?Thr?Arg?Gly?Glu?Arg?Gly?Pro?Thr?Lys?Pro?Trp?Val?Gly
1250????????????????1255????????????????1260Ser?Ser?Thr?Gln?Glu?Lys?Lys?Thr?Met?Pro?Val?Tyr?Asn?Arg?Gln?Val1265????????????????1270????????????????1275????????????????1280Leu?Thr?Lys?Lys?Gln?Arg?Asp?Gln?Ile?Asp?Leu?Leu?Ala?Lys?Leu?Asp
1285????????????????1290????????????????1295Trp?Val?Tyr?Ala?Ser?Ile?Asp?Asn?Lys?Asp?Glu?Phe?Met?Glu?Glu?Leu
1300????????????????1305????????????????1310Ser?Thr?Gly?Thr?Leu?Gly?Leu?Ser?Tyr?Glu?Lys?Ala?Lys?Lys?Leu?Phe
1315????????????????1320????????????????1325Pro?Gln?Tyr?Leu?Ser?Val?Asn?Tyr?Leu?His?Arg?Leu?Thr?Val?Ser?Ser
1330????????????????1335????????????????1340Arg?Pro?Cys?Glu?Phe?Pro?Ala?Ser?Ile?Pro?Ala?Tyr?Arg?Thr?Thr?Asn1345????????????????1350????????????????1355????????????????1360Tyr?His?Phe?Asp?Thr?Ser?Pro?Ile?Asn?His?Val?Leu?Thr?Glu?Lys?Tyr
1365????????????????1370????????????????1375Gly?Asp?Glu?Asp?Ile?Asp?Ile?Val?Phe?Gln?Asn?Cys?Ile?Ser?Phe?Gly
1380????????????????1385????????????????1390Leu?Ser?Leu?Met?Ser?Val?Val?Glu?Gln?Phe?Thr?Asn?Ile?Cys?Pro?Asn
1395????????????????1400????????????????1405Arg?Ile?Ile?Leu?Ile?Pro?Lys?Leu?Asn?Glu?Ile?His?Leu?Met?Lys?Pro
1410????????????????1415????????????????1420Pro?Ile?Phe?Thr?Gly?Asp?Val?Asp?Ile?Ile?Lys?Leu?Lys?Gln?Val?Ile1425????????????????1430????????????????1435????????????????1440Gln?Lys?Gln?His?Met?Phe?Leu?Pro?Asp?Lys?Ile?Ser?Leu?Thr?Gln?Tyr
1445????????????????1450????????????????1455Val?Glu?Leu?Phe?Leu?Ser?Asn?Lys?Ala?Leu?Lys?Ser?Gly?Ser?His?Ile
1460????????????????1465????????????????1470Asn?Ser?Asn?Leu?Ile?Leu?Val?His?Lys?Met?Ser?Asp?Tyr?Phe?His?Asn
1475????????????????1480????????????????1485Ala?Tyr?Ile?Leu?Ser?Thr?Asn?Leu?Ala?Gly?His?Trp?Ile?Leu?Ile?Ile
1490????????????????1495????????????????1500Gln?Leu?Met?Lys?Asp?Ser?Lys?Gly?Ile?Phe?Glu?Lys?Asp?Trp?Gly?Glu1505????????????????1510????????????????1515????????????????1520Gly?Tyr?Ile?Thr?Asp?His?Met?Phe?Ile?Asn?Leu?Asn?Val?Phe?Phe?Asn
1525????????????????1530????????????????1535Ala?Tyr?Lys?Thr?Tyr?Leu?Leu?Cys?Phe?His?Lys?Gly?Tyr?Gly?Lys?Ala
1540????????????????1545????????????????1550Lys?Leu?Glu?Cys?Asp?Met?Asn?Thr?Ser?Asp?Leu?Leu?Cys?Val?Leu?Glu
1555????????????????1560????????????????1565Leu?Ile?Asp?Ser?Ser?Tyr?Trp?Lys?Ser?Met?Ser?Lys?Val?Phe?Leu?Glu
1570????????????????1575????????????????1580Gln?Lys?Val?Ile?Lys?Tyr?Ile?Val?Asn?Gln?Asp?Thr?Ser?Leu?Arg?Arg1585????????????????1590????????????????1595????????????????1600Ile?Lys?Gly?Cys?His?Ser?Phe?Lys?Leu?Trp?Phe?Leu?Lys?Arg?Leu?Asn
1605????????????????1610????????????????1615Asn?Ala?Lys?Phe?Thr?Val?Cys?Pro?Trp?Val?Val?Asn?Ile?Asp?Tyr?His
1620????????????????1625????????????????1630Pro?Thr?His?Met?Lys?Ala?Ile?Leu?Ser?Tyr?Ile?Asp?Leu?Val?Arg?Met
1635????????????????1640????????????????1645Gly?Leu?Ile?Asn?Val?Asp?Lys?Leu?Thr?Ile?Lys?Asn?Lys?Asn?Lys?Phe
1650????????????????1655????????????????1660Asn?Asp?Glu?Phe?Tyr?Thr?Ser?Asn?Leu?Phe?Tyr?Ile?Ser?Tyr?Asn?Phe1665????????????????1670????????????????1675????????????????1680Ser?Asp?Asn?Thr?His?Leu?Leu?Thr?Lys?Gln?Ile?Arg?Ile?Ala?Asn?Ser
1685????????????????1690????????????????1695Glu?Leu?Glu?Asp?Asn?Tyr?Asn?Lys?Leu?Tyr?His?Pro?Thr?Pro?Glu?Thr
1700????????????????1705????????????????1710Leu?Glu?Asn?Met?Ser?Leu?Ile?Pro?Val?Lys?Ser?Asn?Asn?Ser?Asn?Lys
1715????????????????1720????????????????1725Pro?Lys?Phe?Cys?Ile?Ser?Gly?Asn?Thr?Glu?Ser?Met?Met?Met?Ser?Thr
1730????????????????1735????????????????1740Phe?Ser?Ser?Lys?Met?His?Ile?Lys?Ser?Ser?Thr?Val?Thr?Thr?Arg?Phe1745????????????????1750????????????????1755????????????????1760Asn?Tyr?Ser?Lys?Gln?Asp?Leu?Tyr?Asn?Leu?Phe?Pro?Ile?Val?Val?Ile
1765????????????????1770????????????????1775Asp?Lys?Ile?Ile?Asp?His?Ser?Gly?Asn?Thr?Ala?Lys?Ser?Asn?Gln?Leu
1780????????????????1785????????????????1790Tyr?Thr?Thr?Thr?Ser?His?Gln?Thr?Ser?Leu?Val?Arg?Asn?Ser?Ala?Ser
1795????????????????1800????????????????1805Leu?Tyr?Cys?Met?Leu?Pro?Trp?His?His?Val?Asn?Arg?Phe?Asn?Phe?Val
1810????????????????1815????????????????1820Phe?Ser?Ser?Thr?Gly?Cys?Lys?Ile?Ser?Ile?Glu?Tyr?Ile?Leu?Lys?Asp1825????????????????1830????????????????1835????????????????1840Leu?Lys?Ile?Lys?Asp?Pro?Ser?Cys?Ile?Ala?Phe?Ile?Gly?Glu?Gly?Ala
1845????????????????1850????????????????1855Gly?Asn?Leu?Leu?Leu?Arg?Thr?Val?Val?Glu?Leu?His?Pro?Asp?Ile?Arg
1860????????????????1865????????????????1870Tyr?Ile?Tyr?Arg?Ser?Leu?Lys?Asp?Cys?Asn?Asp?His?Ser?Leu?Pro?Ile
1875????????????????1880????????????????1885Glu?Phe?Leu?Arg?Leu?Tyr?Asn?Gly?His?Ile?Asn?Ile?Asp?Tyr?Gly?Glu
1890????????????????1895????????????????1900Asn?Leu?Thr?Ile?Pro?Ala?Thr?Asp?Ala?Thr?Asn?Asn?Ile?His?Trp?Ser1905????????????????1910????????????????1915????????????????1920Tyr?Leu?His?Ile?Lys?Phe?Ala?Glu?Pro?Ile?Ser?Ile?Phe?Val?Cys?Asp
1925????????????????1930????????????????1935Ala?Glu?Leu?Pro?Val?Thr?Ala?Asn?Trp?Ser?Lys?Ile?Ile?Ile?Glu?Trp
1940????????????????1945????????????????1950Ser?Lys?His?Val?Arg?Lys?Cys?Lys?Tyr?Cys?Ser?Ser?Val?Asn?Arg?Cys
1955????????????????1960????????????????1965Ile?Leu?Ile?Ala?Lys?Tyr?His?Ala?Gln?Asp?Asp?Ile?Asp?Phe?Lys?Leu
1970????????????????1975????????????????1980Asp?Asn?Ile?Thr?Ile?Leu?Lys?Thr?Tyr?Val?Cys?Leu?Gly?Ser?Lys?Leu1985????????????????1990????????????????1995????????????????2000Lys?Gly?Ser?Glu?Val?Tyr?Leu?Ile?Leu?Thr?Ile?Gly?Pro?Ala?Asn?Ile
2005????????????????2010????????????????2015Leu?Pro?Val?Phe?Asp?Val?Val?Gln?Asn?Ala?Lys?Leu?Ile?Leu?Ser?Arg
2020????????????????2025????????????????2030Thr?Lys?Asn?Phe?Ile?Met?Pro?Lys?Lys?Thr?Asp?Lys?Glu?Ser?Ile?Asp
2035????????????????2040????????????????2045Ala?Asn?Ile?Lys?Ser?Leu?Ile?Pro?Phe?Leu?Cys?Tyr?Pro?Ile?Thr?Lys
2050????????????????2055????????????????2060Lys?Gly?Ile?Lys?Thr?Ser?Leu?Ser?Lys?Leu?Lys?Ser?Val?Val?Asn?Gly2065????????????????2070????????????????2075????????????????2080Asp?Ile?Leu?Ser?Tyr?Ser?Ile?Ala?Gly?Arg?Asn?Glu?Val?Phe?Ser?Asn
2085????????????????2090????????????????2095Lys?Leu?Ile?Asn?His?Lys?His?Met?Asn?Ile?Leu?Lys?Trp?Leu?Asp?His
2100????????????????2105????????????????2110Val?Leu?Asn?Phe?Arg?Ser?Ala?Glu?Leu?Asn?Tyr?Asn?His?Leu?Tyr?Met
2115????????????????2120????????????????2125Ile?Glu?Ser?Thr?Tyr?Pro?Tyr?Leu?Ser?Glu?Leu?Leu?Asn?Ser?Leu?Thr
2130????????????????2135????????????????2140Thr?Asn?Glu?Leu?Lys?Lys?Leu?Ile?Lys?Ile?Thr?Gly?Ser?Val?Leu?Tyr
2145????????????????2150????????????????2155????????????????2160
Asn?Leu?Pro?Asn?Glu?Gln
The information of 2165 (2) SEQ ID NO:33:
(i) sequence signature:
(A) length: 15219 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: ACGGGAAAAA AATGCGTACT ACAAACTTGC ACATTCGAAA AAAATGGGGC AAATAAGAAC 60 TTGATAAGTG CTATTTAAGT CTAACCTTTT CAATCAGAAA TGGGGTGCAA TTCACTGAGC 120 ATGATAAAGG TTAGATTACA AAATTTATTT GACAATGACG AAGTAGCATT GTTAAAAATA 180 ACATGTTATA CTGATAAATT AATTCTTCTG ACCAATGCAT TAGCCAAAGC AGCAATACAT 240 ACAATTAAAT TAAACGGCAT AGTTTTTATA CATGTTATAA CAAGCAGTGA AGTGTGCCCT 300 GATAACAATA TTGTAGTGAA ATCTAACTTT ACAACAATGC CAATACTACA AAATGGAGGA 360 TACATATGGG AATTGATTGA GTTGACACAC TGCTCTCAAT TAAACGGTTT AATGGATGAT 420 AATTGTGAAA TCAAATTTTC TAAAAGACTA AGTGACTCAG TAATGACTAA TTATATGAAT 480 CAAATATCTG ACTTACTTGG GCTTGATCTC AATTCATGAA TTATGTTTAG TCTAATTCAA 540 TAGACATGTG TTTATTACCA TTTTAGTTAA TATAAAAACT CATCAAAGGG AAATGGGGCA 600 AATAAACTCA CCTAATCAAT CAAACCATGA GCACTACAAA TGACAACACT ACTATGCAAA 660 GATTGATGAT CACAGACATG AGACCCCTGT CAATGGATTC AATAATAACA TCTCTTACCA 720 AAGAAATCAT CACACACAAA TTCATATACT TGATAAACAA TGAATGTATT GTAAGAAAAC 780 TTGATGAAAG ACAAGCTACA TTTACATTCT TAGTCAATTA TGAGATGAAG CTACTGCACA 840 AAGTAGGGAG TACCAAATAC AAAAAATACA CTGAATATAA TACAAAATAT GGCACTTTCC 900 CCATGCCTAT ATTTATCAAT CACGGCGGGT TTCTAGAATG TATTGGCATT AAGCCTACAA 960 AACACACTCC TATAATATAC AAATATGACC TCAACCCGTG AATTCCAACA AAAAAACCAA 1020 CCCAACCAAA CCAAACTATT CCTCAAACAA CAGTGCTCAA TAGTTAAGAA GGAGCTAATC 1080 CATTTTAGTA ATTAAAAATA AAAGTAAAGC CAATAACATA AATTGGGGCA AATACAAAGA 1140 TGGCTCTTAG CAAAGTCAAG TTGAATGATA CATTAAATAA GGATCAGCTG CTGTCATCCA 1200 GCAAATACAC TATTCAACGT AGTACAGGAG ATAATATTGA CACTCCCAAT TATGATGTGC 1260 AAAAACACCT AAACAAACTA TGTGGTATGC TATTAATCAC TGAAGATGCA AATCATAAAT 1320 TCACAGGATT AATAGGTATG TTATATGCTA TGTCCAGGTT AGGAAGGGAA GACACTATAA 1380 AGATACTTAA AGATGCTGGA TATCATGTTA AAGCTAATGG AGTAGATATA ACAACATATC 1440 GTCAAGATAT AAATGGAAAG GAAATGAAAT TCGAAGTATT AACATTATCA AGCTTGACAT 1500 CAGAAATACA AGTCAATATT GAGATAGAAT CTAGAAAGTC CTACAAAAAA ATGCTAAAAG 1560 AGATGGGAGA AGTGGCTCCA GAATATAGGC ATGATTCTCC AGACTGTGGG ATGATAATAC 1620 TGTGTATAGC TGCACTTGTG ATAACCAAAT TAGCAGCAGG AGACAGATCA GGTCTTACAG 1680 CAGTAATTAG GAGGGCAAAC AATGTCTTAA AAAACGAAAT AAAACGATAC AAGGGCCTCA 1740 TACCAAAGGA TATAGCTAAC AGTTTTTATG AAGTGTTTGA AAAACACCCT CATCTTATAG 1800 ATGTTTTCGT GCACTTTGGC ATTGCACAAT CATCCACAAG AGGGGGTAGT AGAGTTGAAG 1860 GAATCTTTGC AGGATTGTTT ATGAATGCCT ATGGTTCAGG GCAAGTAATG CTAAGATGGG 1920 GAGTTTTAGC CAAATCTGTA AAAAATATCA TGCTAGGACA TGCTAGTGTC CAGGCAGAAA 1980 TGGAGCAAGT TGTGGAAGTC TATGAGTATG CACAGAAGTT GGGAGGAGAA GCTGGATTCT 2040 ACCATATATT GAACAATCCA AAAGCATCAT TGCTGTCATT AACTCAATTT CCCAACTTCT 2100 CAAGTGTGGT CCTAGGCAAT GCAGCAGGTC TAGGCATAAT GGGAGAGTAT AGAGGTACAC 2160 CAAGAAACCA GGATCTTTAT GATGCAGCTA AAGCATATGC AGAGCAACTC AAAGAAAATG 2220 GAGTAATAAA CTACAGTGTA TTAGACTTAA CAGCAGAAGA ATTGGAAGCC ATAAAGCATC 2280 AACTCAACCC CAAAGAAGAT GATGTAGAGC TTTAAGTTAA CAAAAAATAC GGGGCAAATA 2340 AGTCAACATG GAGAAGTTTG CACCTGAATT TCATGGAGAA GATGCAAATA ACAAAGCTAC 2400 CAAATTCCTA GAATCAATAA AGGGCAAGTT CGCATCATCC AAAGATCCTA AGAAGAAAGA 2460 TAGCATAATA TCTGTTAACT CAATAGATAT AGAAGTAACT AAAGAGAGCC CGATAACATC 2520 TGGCACCAAC ATCATCAATC CAACAAGTGA AGCCGACAGT ACCCCAGAAA CAAAAGCCAA 2580 CTACCCAAGA AAACCCCTAG TAAGCTTCAA AGAAGATCTC ACCCCAAGTG ACAACCCTTT 2640 TTCTAAGTTG TACAAGGAAA CAATAGAAAC ATTTGATAAC AATGAAGAAG AATCTAGCTA 2700 CTCATATGAA GAGATAAATG ATCAAACAAA TGACAACATT ACAGCAAGAC TAGATAGAAT 2760 TGATGAAAAA TTAAGTGAAA TATTAGGAAT GCTCCATACA TTAGTAGTTG CAAGTGCAGG 2820 ACCCACTTCA GCTCGCGATG GAATAAGAGA TGCTATGGTT GGTCTAAGAG AAGAGATGAT 2880 AGAAAAAATA AGAGCGGAAG CATTAATGAC CAATGATAGG TTAGAGGCTA TGGCAAGACT 2940 TAGGAATGAG GAAAGCGAAA AAATGGCAAA AGACACCTCA GATGAAGTGT CTCTTAATCC 3000 AACTTCCAAA AAATTGAGTG ACTTGTTGGA AGACAACGAT AGTGACAATG ATCTATCACT 3060 TGATGATTTT TGATCAGCGA TCAACTCACT CAGCAATCAA CAACATCAAT AAAACAGACA 3120 TCAATCCATT GAATCAACTG CCAGACCGAA CAAACAAACG TCCATCAGTA GAACCACCAA 3180 CCAATCAATC AACCAATTGA TCAATCAGCA ACCCGACAAA ATTAACAATA TAGTAACAAA 3240 AAAAGAACAA GATGGGGCAA ATATGGAAAC ATACGTGAAC AAGCTTCACG AAGGCTCCAC 3300 ATACACAGCA GCTGTTCAGT ACAATGTTCT AGAAAAAGAT GATGATCCTG CATCACTAAC 3360 AATATGGGTG CCTATGTTCC AGTCATCTGT GCCAGCAGAC TTGCTCATAA AAGAACTTGC 3420 AAGCATCAAT ATACTAGTGA AGCAGATCTC TACGCCCAAA GGACCTTCAC TACGAGTCAC 3480 GATTAACTCA AGAAGTGCTG TGCTGGCTCA AATGCCTAGT AATTTCATCA TAAGCGCAAA 3540 TGTATCATTA GATGAAAGAA GCAAATTAGC ATATGATGTA ACTACACCTT GTGAAATCAA 3600 AGCATGCAGT CTAACATGCT TAAAAGTAAA AAGTATGTTA ACTACAGTCA AAGATCTTAC 3660 CATGAAGACA TTCAACCCCA CTCATGAGAT CATTGCTCTA TGTGAATTTG AAAATATTAT 3720 GACATCAAAA AGAGTAATAA TACCAACCTA TCTAAGATCA ATTAGTGTCA AGAACAAGGA 3780 TCTGAACTCA CTAGAAAATA TAGCAACCAC CGAATTCAAA AATGCTATCA CCAATGCAAA 3840 AATTATTCCT TATGCAGGAT TAGTGTTAGT TATCACAGTT ACTGACAATA AAGGAGCATT 3900 CAAATATATC AAACCACAGA GTCAATTTAT AGTAGATCTT GGTGCCTACC TAGAAAAAGA 3960 GAGCATATAT TATGTGACTA CTAATTGGAA GCATACAGCT ACACGTTTTT CAATCAAACC 4020 ACTAGAGGAT TAAACTTAAT TATCAACACT GAATGACAGG TCCACATATA TCCTCAAACT 4080 ACACACTATA TCCAAACATC ATAAACATCT ACACTACACA CTTCATCACA CAAACCAATC 4140 CCACTCAAAA TCCAAAATCA CTACCAGCCA CTATCTGCTA GACCTAGAGT GCGAATAGGT 4200 AAATAAAACC AAAATATGGG GTAAATAGAC ATTAGTTAGA GTTCAATCAA TCTTAACAAC 4260 CATTTATACC GCCAATTCAA CACATATACT ATAAATCTTA AAATGGGAAA TACATCCATC 4320 ACAATAGAAT TCACAAGCAA ATTTTGGCCC TATTTTACAC TAATACATAT GATCTTAACT 4380 CTAATCTTTT TACTAATTAT AATCACTATT ATGATTGCAA TACTAAATAA GCTAAGTGAA 4440 CATAAAGCAT TCTGTAACAA AACTCTTGAA CTAGGACAGA TGTATCAAAT CAACACATAG 4500 AGTTCTACCA TTATGCTGTG TCAAATTATA ATCCTGTATA TATAAACAAA CAAATCCAAT 4560 CTTCTCACAG AGTCATGGTG TCGCAAAACC ACGCTAACTA TCATGGTAGC ATAGAGTAGT 4620 TATTTAAAAA TTAACATAAT GATGAATTGT TAGTATGAGA TCAAAAACAA CATTGGGGCA 4680 AATGCAACCA TGTCCAAACA CAAGAATCAA CGCACTGCCA GGACTCTAGA AAAGACCTGG 4740 GATACTCTTA ATCATCTAAT TGTAATATCC TCTTGTTTAT ACAGATTAAA TTTAAAATCT 4800 ATAGCACAAA TAGCACTATC AGTTTTGGCA ATGATAATCT CAACCTCTCT CATAATTGCA 4860 GCCATAATAT TCATCATCTC TGCCAATCAC AAAGTTACAC TAACAACGGT CACAGTTCAA 4920 ACAATAAAAA ACCACACTGA AAAAAACATC ACCACCTACC CTACTCAAGT CTCACCAGAA 4980 AGGGTTAGTT CATCCAAGCA ACCCACAACC ACATCACCAA TCCACACAAG TTCAGCTACA 5040 ACATCACCCA ATACAAAATC AGAAACACAC CATACAACAG CACAAACCAA AGGCAGAACC 5100 ACCACTTCAA CACAGACCAA CAAGCCAAGC ACAAAACCAC GTCCAAAAAA TCCACCAAAA 5160 AAAGATGATT ACCATTTTGA AGTGTTCAAC TTCGTTCCCT GCAGTATATG TGGCAACAAT 5220 CAACTTTGCA AATCCATCTG CAAAACAATA CCAAGCAACA AACCAAAGAA GAAACCAACC 5280 ATCAAACCCA CAAACAAACC AACCACCAAA ACCACAAACA AAAGAGACCC AAAAACACCA 5340 GCCAAAACGA CGAAAAAAGA AACTACCACC AACCCAACAA AAAAACTAAC CCTCAAGACC 5400 ACAGAAAGAG ACACCAGCAC CTCACAATCC ACTGCACTCG ACACAACCAC ATTAAAACAC 5460 ACAGTCCAAC AGCAATCCCT CCTCTCAACC ACCCCCGAAA ACACACCCAA CTCCACACAA 5520 ACACCCACAG CATCCGAGCC CTCCACACCA AACTCCACCC AAAAAACCCA GCCACATGCT 5580 TAGTTATTCA AAAACTACAT CTTAGCAGAG AACCGTGATC TATCAAGCAA GAACGAAATT 5640 AAACCTGGGG CAAATAACCA TGGAGTTGAT GATCCACAAG TCAAGTGCAA TCTTCCTAAC 5700 TCTTGCTATT AATGCATTGT ACCTCACCTC AAGTCAGAAC ATAACTGAGG AGTTTTACCA 5760 ATCGACATGT AGTGCAGTTA GCAGAGGTTA TTTTAGTGCT TTAAGAACAG GTTGGTATAC 5820 TAGTGTCATA ACAATAGAAT TAAGTAATAT AAAAGAAACC AAATGCAATG GAACTGACAC 5880 TAAAGTAAAA CTTATGAAAC AAGAATTAGA TAAGTATAAG AATGCAGTAA CAGAATTACA 5940 GCTACTTATG CAAAACACAC CAGCTGTCAA CAACCGGGCC AGAAGAGAAG CACCACAGTA 6000 TATGAACTAC ACAATCAATA CCACTAAAAA CCTAAATGTA TCAATAAGCA AGAAGAGGAA 6060 ACGAAGATTT CTAGGCTTCT TGTTAGGTGT GGGATCTGCA ATAGCAAGTG GTATAGCTGT 6120 ATCAAAAGTT CTACACCTTG AAGGAGAAGT GAACAAGATC AAAAATGCTT TGTTGTCTAC 6180 AAACAAAGCT GTAGTCAGTT TATCAAATGG GGTCAGTGTT TTAACCAGCA AAGTGTTAGA 6240 TCTCAAGAAT TACATAAATA ACCAATTATT ACCCATAGTA AATCAACAGA GCTGTCGCAT 6300 CTCCAACATT GAAACAGTTA TAGAATTCCA GCAGAAGAAC AGCAGATTGT TGGAAATCAC 6360 CAGAGAATTT AGTGTCAATG CAGGTGTAAC AACACCTTTA AGCACTTACA TGTTGACAAA 6420 CAGTGAGTTA CTATCATTAA TCAATGATAT GCCTATAACA AATGATCAGA AAAAATTAAT 6480 GTCAAGCAAT GTTCAGATAG TAAGGCAACA AAGTTATTCC ATCATGTCTA TAATAAAGGA 6540 AGAAGTCCTT GCATATGTTG TACAGCTGCC TATCTATGGT GTAATAGATA CACCTTGCTG 6600 GAAATTGCAC ACATCGCCTC TATGCACTAC CAACATCAAA GAAGGATCAA ATATTTGTTT 6660 AACAAGGACT GATAGAGGAT GGTATTGTGA TAATGCAGGA TCAGTATCCT TCTTTCCACA 6720 GGCTGACACT TGTAAAGTAC AGTCCAATCG AGTATTTTGT GACACTATGA ACAGTTTGAC 6780 ATTACCAAGT GAAGTCAGCC TTTGTAACAC TGACATATTC AATTCCAAGT ATGACTGCAA 6840 AATTATGACA TCAAAAACAG ACATAAGCAG CTCAGTAATT ACTTCTCTTG GAGCTATAGT 6900 GTCATGCTAT GGTAAAACTA AATGCACTGC ATCCAACAAA AATCGTGGGA TTATAAAGAC 6960 ATTTTCTAAT GGTTGTGACT ATGTGTCAAA CAAAGGAGTA GATACTGTGT CAGTGGGCAA 7020 CACTTTATAC TATGTAAACA AGCTGGAAGG CAAGAACCTT TATGTAAAAG GGGAACCTAT 7080 AATAAATTAC TATGACCCTC TAGTGTTTCC TTCTGATGAG TTTGATGCAT CAATATCTCA 7140 AGTCAATGAA AAAATCAATC AAAGTTTAGC TTTTATTCGT AGATCTGATG AATTACTACA 7200 TAATGTAAAT ACTGGCAAAT CTACTACAAA TATTATGATA ACTACAATTA TTATAGTAAT 7260 CATTGTAGTA TTGTTATCAT TAATAGCTAT TGGTTTACTG TTGTATTGTA AAGCCAAAAA 7320 CACACCAGTT ACACTAAGCA AAGACCAACT AAGTGGAATC AATAATATTG CATTCAGCAA 7380 ATAGACAAAA AACCACCTGA TCATGTTTCA ACAACAATCT GCTGACCACC AATCCCAAAT 7440 CAACTTACAA CAAATATTTC AACATCACAG TACAGGCTGA ATCATTTCCT CACATCATGC 7500 TACCCACATA ACTAAGCTAG ATCCTTAACT TATAGTTACA TAAAAACCTC AAGTATCACA 7560 ATCAACCACT AAATCAACAC ATCATTCACA AAATTAACAG CTGGGGCAAA TATGTCGCGA 7620 AGAAATCCTT GTAAATTTGA GATTAGAGGT CATTGCTTGA ATGGTAGAAG ATGTCACTAC 7680 AGTCATAATT ACTTTGAATG GCCTCCTCAT GCATTACTAG TGAGGCAAAA CTTCATGTTA 7740 AACAAGATAC TCAAGTCAAT GGACAAAAGC ATAGACACTT TGTCTGAAAT AAGTGGAGCT 7800 GCTGAACTGG ATAGAACAGA AGAATATGCT CTTGGTATAG TTGGAGTGCT AGAGAGTTAC 7860 ATAGGATCTA TAAACAACAT AACAAAACAA TCAGCATGTG TTGCTATGAG TAAACTTCTT 7920 ATTGAGATCA ATAGTGATGA CATTAAAAAG CTTAGAGATA ATGAAGAACC CAATTCACCT 7980 AAGATAAGAG TGTACAATAC TGTTATATCA TACATTGAGA GCAATAGAAA AAACAACAAG 8040 CAAACCATCC ATCTGCTCAA GAGACTACCA GCAGACGTGC TGAAGAAGAC AATAAAGAAC 8100 ACATTAGATA TCCACAAAAG CATAACCATA AGCAATCCAA AAGAGTCAAC TGTGAATGAT 8160 CAAAATGACC AAACCAAAAA TAATGATATT ACCGGATAAA TATCCTTGTA GTATATCATC 8220 CATATTGATC TCAAGTGAAA GCATGGTTGC TACATTCAAT CATAAAAACA TATTACAATT 8280 TAACCATAAC TATTTGGATA ACCACCAGCG TTTATTAAAT CATATATTTG ATGAAATTCA 8340 TTGGACACCT AAAAACTTAT TAGATGCCAC TCAACAATTT CTCCAACATC TTAACATCCC 8400 TGAAGATATA TATACAGTAT ATATATTAGT GTCATAATGC TTGACCATAA CGACTCTATG 8460 TCATCCAACC ATAAAACTAT TTTGATAAGG TTATGGGACA AAATGGATCC CATTATTAAT 8520 GGAAACTCTG CTAATGTGTA TCTAACTGAT AGTTATTTAA AAGGTGTTAT CTCTTTTTCA 8580 GAGTGTAATG CTTTAGGGAG TTATCTTTTT AACGGCCCTT ATCTTAAAAA TGATTACACC 8640 AACTTAATTA GTAGACAAAG CCCACTACTA GAGCATATGA ATCTTAAAAA ACTAACTATA 8700 ACACAGTCAT TAATATCTAG ATATCATAAA GGTGAACTGA AATTAGAAGA ACCAACTTAT 8760 TTCCAGTCAT TACTTATGAC ATATAAAAGT ATGTCCTCGT CTGAACAAAT TGCTACAACT 8820 AACTTACTTA AAAAAATAAT ACGAAGAGCC ATAGAAATAA GTGATGTAAA GGTGTACGCC 8880 ATCTTGAATA AACTAGGATT AAAGGAAAAG GACAGAGTTA AGCCCAACAA TAATTCAGGT 8940 GATGAAAACT CAGTACTTAC AACTATAATT AAAGATGATA TACTTTCGGC TGTGGAAAAC 9000 AATCAATCAT ATACAAATTC AGACAAAAGT CACTCAGTAA ATCAAAATAT CACTATCAAA 9060 ACAACACTCT TGAAAAAATT GATGTGTTCA ATGCAACATC CTCCATCATG GTTAATACAC 9120 TGGTTCAATT TATATACAAA ATTAAATAAC ATATTAACAC AATATCGATC AAATGAGGTA 9180 AAAAGTCATG GGTTTATATT AATAGATAAT CAAACTTTAA GTGGTTTTCA GTTTATTTTA 9240 AATCAATATG GTTGTATCGT TTATCATAAA GGACTCAAAA AAATCACAAC TACTACTTAC 9300 AATCAATTTT TGACATGGAA AGACATCAGC CTTAGCAGAT TAAATGTTTG CTTAATTACT 9360 TGGATAAGTA ATTGTTTAAA TACATTAAAC AAAAGCTTAG GGCTGAGATG TGGATTCAAT 9420 AATGTTGTGT TATCACAATT ATTTCTTTAT GGAGATTGTA TACTGAAATT ATTTCATAAT 9480 GAAGGCTTCT ACATAATAAA AGAAGTAGAG GGATTTATTA TGTCTTTAAT TCTAAACATA 9540 ACAGAAGAAG ATCAATTTAG GAAACGATTT TATAATAGCA TGCTAAATAA CATCACAGAT 9600 GCAGCTATTA AGGCTCAAAA GGACCTACTA TCAAGAGTAT GTCACACTTT ATTAGACAAG 9660 ACAGTGTCTG ATAATATCAT AAATGGTAAA TGGATAATCC TATTAAGTAA ATTTCTTAAA 9720 TTGATTAAGC TTGCAGGTGA TAATAATCTC AATAACTTGA GTGAGCTATA TTTTCTCTTC 9780 AGAATCTTTG GACATCCAAT GGTCGATGAA AGACAAGCAA TGGATTCTGT AAGAATTAAC 9840 TGTAATGAAA CTAAGTTCTA CTTATTAAGT AGTCTAAGTA CATTAAGAGG TGCTTTCATT 9900 TATAGAATCA TAAAAGGGTT TGTAAATACC TACAACAGAT GGCCCACCTT AAGGAATGCT 9960 ATTGTCCTAC CTCTAAGATG GTTAAACTAC TATAAACTTA ATACTTATCC ATCTCTACTT 10020 GAAATCACAG AAAATGATTT GATTATTTTA TCAGGATTGC GGTTCTATCG TGAGTTTCAT 10080 CTGCCTAAAA AAGTGGATCT TGAAATGATA ATAAATGACA AAGCCATTTC ACCTCCAAAA 10140 GATCTAATAT GGACTAGTTT TCCTAGAAAT TACATGCCAT CACATATACA AAATTATATA 10200 GAACATGAAA AGTTGAAGTT CTCTGAAAGC GACAGATCGA GAAGAGTACT AGAGTATTAC 10260 TTGAGAGATA ATAAATTCAA TGAATGCGAT CTATACAATT GTGTAGTCAA TCAAAGCTAT 10320 CTCAACAACT CTAATCACGT GGTATCACTA ACTGGTAAAG AAAGAGAGCT CAGTGTAGGT 10380 AGAATGTTTG CTATGCAACC AGGTATGTTT AGGCAAATCC AAATCTTAGC AGAGAAAATG 10440 ATAGCTGAAA ATATTTTACA ATTCTTCCCT GAGAGTTTGA CAAGATATGG TGATCTAGAG 10500 CTTCAAAAGA TATTAGAATT AAAAGCAGGA ATAAGCAACA AGTCAAATCG TTATAATGAT 10560 AACTACAACA ATTATATCAG TAAATGTTCT ATCATTACAG ATCTTAGCAA ATTCAATCAG 10620 GCATTTAGAT ATGAAACATC ATGTATCTGC AGTGATGTAT TAGATGAACT GCATGGAGTA 10680 CAATCTCTGT TCTCTTGGTT GCATTTAACA ATACCTCTTG TCACAATAAT ATGTACATAT 10740 AGACATGCAC CTCCTTTCAT AAAGGATCAT GTTGTTAATC TTAATGAGGT TGATGAACAA 10800 AGTGGATTAT ACAGATATCA TATGGGTGGT ATTGAGGGCT GGTGTCAAAA ACTGTGGACC 10860 ATTGAAGCTA TATCATTATT AGATCTAATA TCTCTCAAAG GGAAATTCTC TATCACAGCT 10920 CTGATAAATG GTGATAATCA GTCAATTGAT ATAAGCAAAC CAGTTAGACT TATAGAGGGT 10980 CAGACCCATG CACAAGCAGA TTATTTGTTA GCATTAAATA GCCTTAAATT GTTATATAAA 11040 GAGTATGCAG GTATAGGCCA TAAGCTTAAG GGAACAGAGA CCTATATATC CCGAGATATG 11100 CAGTTCATGA GCAAAACAAT CCAGCACAAT GGAGTGTACT ATCCAGCCAG TATCAAAAAA 11160 GTCCTGAGAG TAGGTCCATG GATAAACACG ATACTTGATG ATTTTAAAGT TAGTTTAGAA 11220 TCTATAGGCA GCTTAACACA GGAGTTAGAA TACAGAGGAG AAAGCTTATT ATGCAGTTTA 11280 ATATTTAGGA ACATTTGGTT ATACAATCAA ATTGCTTTGC AACTCCGAAA TCATGCATTA 11340 TGTAACAATA AGCTATATTT AGATATATTG AAAGTATTAA AACACTTAAA AACTTTTTTT 11400 AATCTTGATA GCATTGATAT GGCTTTATCA TTGTATATGA ATTTGCCTAT GCTGTTTGGT 11460 GGTGGTGATC CTAATTTGTT ATATCGAAGC TTTTATAGGA GAACTCCAGA CTTCCTTACA 11520 GAAGCTATAG TACATTCAGT GTTTGTGTTG AGCTATTATA CTGGTCACGA TTTACAAGAT 11580 AAGCTCCAGG ATCTTCCAGA TGATAGACTG AACAAATTCT TGACATGTGT CATCACATTT 11640 GATAAAAATC CCAATGCCGA GTTTGTAACA TTGATGAGGG ATCCACAGGC TTTAGGGTCT 11700 GAAAGGCAAG CTAAAATTAC TAGTGAGATT AATAGATTAG CAGTAACAGA AGTCTTAAGT 11760 ATAGCCCCAA ACAAAATATT TTCTAAAAGT GCACAACATT ATACTACCAC TGAGATTGAT 11820 CTAAATGACA TTATGCAAAA TATAGAACCA ACTTACCCTC ATGGATTAAG AGTTGTTTAT 11880 GAAAGTTTAC CTTTTTATAA AGCAGAAAAA ATAGTTAATC TTATATCAGG AACAAAATCC 11940 ATAACTAATA TACTTGAAAA AACATCAGCA ATAGATACAA CTGATATTAA TAGGGCTACT 12000 GATATGATGA GGAAAAATAT AACTTTACTT ATAAGGATAC TTCCACTAGA TTGTAACAAA 12060 GACAAAAGAG AGTTATTAAG TTTAGAAAAT CTTAGTATAA CTGAATTAAG CAAGTATGTA 12120 AGAGAAAGAT CTTGGTCATT ATCCAATATA GTAGGAGTAA CATCGCCAAG TATTATGTTC 12180 ACAATGGACA TTAAATATAC AACTAGCACT ATAGCCAGTG GTATAATAAT AGAAAAATAT 12240 AATGTTAATA GTTTAACTCG TGGTGAAAGA GGACCCACCA AGCCATGGGT AGGCTCATCC 12300 ACGCAGGAGA AAAAAACAAT GCCAGTGTAC AACAGACAAG TTTTAACCAA AAAGCAAAGA 12360 GACCAAATAG ATTTATTAGC AAAATTAGAC TGGGTATATG CATCCATAGA CAACAAAGAT 12420 GAATTCATGG AAGAACTGAG TACTGGAACA CTTGGACTGT CATATGAAAA AGCCAAAAAG 12480 TTGTTTCCAC AATATCTAAG TGTCAATTAT TTACACCGTT TAACAGTCAG TAGTAGACCA 12540 TGTGAATTCC CTGCATCAAT ACCAGCTTAT AGAACAACAA ATTATCATTT TGATACTAGT 12600 CCTATCAATC ATGTATTAAC AGAAAAGTAT GGAGATGAAG ATATCGACAT TGTGTTTCAA 12660 AATTGCATAA GTTTTGGTCT TAGCCTGATG TCGGTTGTGG AACAATTCAC AAACATATGT 12720 CCTAATAGAA TTATTCTCAT ACCGAAGCTG AATGAGATAC ATTTGATGAA ACCTCCTATA 12780 TTTACAGGAG ATGTTGATAT CATCAAGTTG AAGCAAGTGA TACAAAAGCA GCACATGTTC 12840 CTACCAGATA AAATAAGTTT AACCCAATAT GTAGAATTAT TCTTAAGTAA CAAAGCACTT 12900 AAATCTGGAT CTCACATCAA CTCTAATTTA ATATTAGTAC ATAAAATGTC TGATTATTTT 12960 CATAATGCTT ATATTTTAAG TACTAATTTA GCTGGACATT GGATTCTGAT TATTCAACTT 13020 ATGAAAGATT CAAAAGGTAT TTTTGAAAAA GATTGGGGAG AGGGGTACAT AACTGATCAT 13080 ATGTTCATTA ATTTGAATGT TTTCTTTAAT GCTTATAAGA CTTATTTGCT ATGTTTTCAT 13140 AAAGGTTATG GTAAAGCAAA ATTAGAATGT GATATGAACA CTTCAGATCT TCTTTGTGTT 13200 TTGGAGTTAA TAGACAGTAG CTACTGGAAA TCTATGTCTA AAGTTTTCCT AGAACAAAAA 13260 GTCATAAAAT ACATAGTCAA TCAAGACACA AGTTTGCGTA GAATAAAAGG CTGTCACAGT 13320 TTTAAGTTGT GGTTTTTAAA ACGCCTTGAT AATGCTAAAT TTACCGTATG CCCTTGGGTT 13380 GTTAACATAG ATTATCACCC AACACACATG AAAGCTATAT TATCTTACAT AGATTTAGTT 13440 AGAATGGGGT TAATAAATGT AGATAAATTA ACCATTAAAA ATAAAAACAA ATTCAATGAT 13500 GAATTTTACA CATCAAATCT CTTTTACATT AGTTATAACT TTTCAGACAA CACTCATTTG 13560 CTAACAAAAC AAATAAGAAT TGCTAATTCA GAATTAGAAG ATAATTATAA CAAACTATAT 13620 CACCCAACCC CAGAAACTTT AGAAAATATG TCATTAATTC CTGTTAAAAG TAATAATAGT 13680 AACAAACCTA AATTTTGTAT AAGTGGAAAT ACCGAATCTA TGATGATGTC AACATTCTCT 13740 AGTAAAATGC ATATTAAATC TTCCACTGTT ACCACAAGAT TCAATTATAG CAAACAAGAC 13800 TTGTACAATT TATTTCCAAT TGTTGTGATA GACAAGATTA TAGATCATTC AGGTAATACA 13860 GCAAAATCTA ACCAACTTTA CACCACCACT TCACATCAGA CATCTTTAGT AAGGAATAGT 13920 GCATCACTTT ATTGCATGCT TCCTTGGCAT CATGTCAATA GATTTAACTT TGTATTTAGT 13980 TCCACAGGAT GCAAGATCAG TATAGAGTAT ATTTTAAAAG ATCTTAAGAT TAAGGACCCC 14040 AGTTGTATAG CATTCATAGG TGAAGGAGCT GGTAACTTAT TATTACGTAC GGTAGTAGAA 14100 CTTCATCCAG ACATAAGATA CATTTACAGA AGTTTAAAAG ATTGCAATGA TCATAGTTTA 14160 CCTATTGAAT TTCTAAGGTT ATACAACGGG CATATAAACA TAGATTATGG TGAGAATTTA 14220 ACCATTCCTG CTACAGATGC AACTAATAAC ATTCATTGGT CTTATTTACA TATAAAATTT 14280 GCAGAACCTA TTAGCATCTT TGTCTGCGAT GCTGAATTAC CTGTTACAGC CAATTGGAGT 14340 AAAATTATAA TTGAATGGAG TAAGCATGTA AGAAAGTGCA AGTACTGTTC TTCTGTAAAT 14400 AGATGCATTT TAATTGCAAA ATATCATGCT CAAGATGACA TTGATTTCAA ATTAGATAAC 14460 ATTACTATAT TAAAAACTTA CGTGTGCCTA GGTAGCAAGT TAAAAGGATC TGAAGTTTAC 14520 TTAATCCTTA CAATAGGCCC TGCAAATATA CTTCCTGTTT TTGATGTTGT ACAAAATGCT 14580 AAATTGATAC TTTCAAGAAC TAAAAATTTC ATTATGCCTA AAAAAACTGA CAAGGAATCT 14640 ATCGATGCAG TTATTAAAAG CTTAATACCT TTCCTTTGTT ACCCTATAAC AAAAAAAGGA 14700 ATTAAGACTT CATTGTCAAA ATTGAAGAGT GTAGTTAATG GAGATATATT ATCATATTCT 14760 ATAGCTGGAC GTAATGAAGT ATTCAGCAAC AAGCTTATAA ACCACAAGCA TATGAATATC 14820 CTAAAATGGC TAGATCATGT TTTAAATTTT AGATCAGCTG AACTTAATTA CAATCATTTA 14880 TACATGATAG AGTCCACATA TCCTTACTTA AGTGAATTGT TAAATAGTTT AACAACCAAT 14940 GAGCTCAAGA AGCTGATTAA AATAACAGGT AGTGTGCTAT ACAACCTTCC CAACGAACAG 15000 TAGTTTAAAA TATCATTAAC AAGTTTGGTC AAATTTAGAT GCTAACACAT CATTATATTA 15060 TAGTTATTAA AAAATATACA AACTTTTCAA TAATTTAGCA TATTGATTCC AAAATTATCA 15120 TTTTAGTCTT AAGGGGTTAA ATAAAAGTCT AAAACTAACA ATTATACATG TGCATTCACA 15180 ACACAACGAG ACATTAGTTT TTGACACTTT TTTTCTCGT 15219 (2) SEQ ID NO: 34 information about: ...
(i) sequence signature:
(A) length: 2166 amino acid
(B) type: amino acid
(C) chain:
(D) topological framework: linearity
(ii) molecule type: protein
(xi) sequence description: SEQ ID NO:34:
Met?Asp?Pro?Ile?Ile?Asn?Gly?Asn?Ser?Ala?Asn?Val?Tyr?Leu?Thr?Asp
1???????????????5???????????????????10??????????????????15
Ser?Tyr?Leu?Lys?Gly?Val?Ile?Ser?Phe?Ser?Glu?Cys?Asn?Ala?Leu?Gly
20??????????????????25??????????????????30
Ser?Tyr?Leu?Phe?Asn?Gly?Pro?Tyr?Leu?Lys?Asn?Asp?Tyr?Thr?Asn?Leu
35??????????????????40??????????????????45Ile?Ser?Arg?Gln?Ser?Pro?Leu?Leu?Glu?His?Met?Asn?Leu?Lys?Lys?Leu
50??????????????????55??????????????????60Thr?Ile?Thr?Gln?Ser?Leu?Ile?Ser?Arg?Tyr?His?Lys?Gly?Glu?Leu?Lys65??????????????????70??????????????????75??????????????????80Leu?Glu?Glu?Pro?Thr?Tyr?Phe?Gln?Ser?Leu?Leu?Met?Thr?Tyr?Lys?Ser
85??????????????????90??????????????????95Met?Ser?Ser?Ser?Glu?Gln?Ile?Ala?Thr?Thr?Asn?Leu?Leu?Lys?Lys?Ile
100?????????????????105?????????????????110Ile?Arg?Arg?Ala?Ile?Glu?Ile?Ser?Asp?Val?Lys?Val?Tyr?Ala?Ile?Leu
115?????????????????120?????????????????125Asn?Lys?Leu?Gly?Leu?Lys?Glu?Lys?Asp?Arg?Val?Lys?Pro?Asn?Asn?Asn
130?????????????????135?????????????????140Ser?Gly?Asp?Glu?Asn?Ser?Val?Leu?Thr?Thr?Ile?Ile?Lys?Asp?Asp?Ile145?????????????????150?????????????????155?????????????????160Leu?Ser?Ala?Val?Glu?Asn?Asn?Gln?Ser?Tyr?Thr?Asn?Ser?Asp?Lys?Ser
165?????????????????170?????????????????175His?Ser?Val?Asn?Gln?Asn?Ile?Thr?Ile?Lys?Thr?Thr?Leu?Leu?Lys?Lys
180?????????????????185?????????????????190Leu?Met?Cys?Ser?Met?Gln?His?Pro?Pro?Ser?Trp?Leu?Ile?His?Trp?Phe
195?????????????????200?????????????????205Asn?Leu?Tyr?Thr?Lys?Leu?Asn?Asn?Ile?Leu?Thr?Gln?Tyr?Arg?Ser?Asn
210?????????????????215?????????????????220Glu?Val?Lys?Ser?His?Gly?Phe?Ile?Leu?Ile?Asp?Asn?Gln?Thr?Leu?Ser225?????????????????230?????????????????235?????????????????240Gly?Phe?Gln?Phe?Ile?Leu?Asn?Gln?Tyr?Gly?Cys?Ile?Val?Tyr?His?Lys
245?????????????????250?????????????????255Gly?Leu?Lys?Lys?Ile?Thr?Thr?Thr?Thr?Tyr?Asn?Gln?Phe?Leu?Thr?Trp
260?????????????????265?????????????????270Lys?Asp?Ile?Ser?Leu?Ser?Arg?Leu?Asn?Val?Cys?Leu?Ile?Thr?Trp?Ile
275?????????????????280?????????????????285Ser?Asn?Cys?Leu?Asn?Thr?Leu?Asn?Lys?Ser?Leu?Gly?Leu?Arg?Cys?Gly
290?????????????????295?????????????????300Phe?Asn?Asn?Val?Val?Leu?Ser?Gln?Leu?Phe?Leu?Tyr?Gly?Asp?Cys?Ile305?????????????????310?????????????????315?????????????????320Leu?Lys?Leu?Phe?His?Asn?Glu?Gly?Phe?Tyr?Ile?Ile?Lys?Glu?Val?Glu
325?????????????????330?????????????????335Gly?Phe?Ile?Met?Ser?Leu?Ile?Leu?Asn?Ile?Thr?Glu?Glu?Asp?Gln?Phe
340?????????????????345?????????????????350Arg?Lys?Arg?Phe?Tyr?Asn?Ser?Met?Leu?Asn?Asn?Ile?Thr?Asp?Ala?Ala
355?????????????????360?????????????????365Ile?Lys?Ala?Gln?Lys?Asp?Leu?Leu?Ser?Arg?Val?Cys?His?Thr?Leu?Leu
370?????????????????375?????????????????380Asp?Lys?Thr?Val?Ser?Asp?Asn?Ile?Ile?Asn?Gly?Lys?Trp?Ile?Ile?Leu385?????????????????390?????????????????395?????????????????400Leu?Ser?Lys?Phe?Leu?Lys?Leu?Ile?Lys?Leu?Ala?Gly?Asp?Asn?Asn?Leu
405?????????????????410?????????????????415Asn?Asn?Leu?Ser?Glu?Leu?Tyr?Phe?Leu?Phe?Arg?Ile?Phe?Gly?His?Pro
420?????????????????425?????????????????430Met?Val?Asp?Glu?Arg?Gln?Ala?Met?Asp?Ser?Val?Arg?Ile?Asn?Cys?Asn
435?????????????????440?????????????????445Glu?Thr?Lys?Phe?Tyr?Leu?Leu?Ser?Ser?Leu?Ser?Thr?Leu?Arg?Gly?Ala
450?????????????????455?????????????????460Phe?Ile?Tyr?Arg?Ile?Ile?Lys?Gly?Phe?Val?Asn?Thr?Tyr?Asn?Arg?Trp465?????????????????470?????????????????475?????????????????480Pro?Thr?Leu?Arg?Asn?Ala?Ile?Val?Leu?Pro?Leu?Arg?Trp?Leu?Asn?Tyr
485?????????????????490?????????????????495Tyr?Lys?Leu?Asn?Thr?Tyr?Pro?Ser?Leu?Leu?Glu?Ile?Thr?Glu?Asn?Asp
500?????????????????505?????????????????510Leu?Ile?Ile?Leu?Ser?Gly?Leu?Arg?Phe?Tyr?Arg?Glu?Phe?His?Leu?Pro
515?????????????????520?????????????????525Lys?Lys?Val?Asp?Leu?Glu?Met?Ile?Ile?Asn?Asp?Lys?Ala?Ile?Ser?Pro
530?????????????????535?????????????????540Pro?Lys?Asp?Leu?Ile?Trp?Thr?Ser?Phe?Pro?Arg?Asn?Tyr?Met?Pro?Ser545?????????????????550?????????????????555?????????????????560His?Ile?Gln?Asn?Tyr?Ile?Glu?His?Glu?Lys?Leu?Lys?Phe?Ser?Glu?Ser
565?????????????????570?????????????????575Asp?Arg?Ser?Arg?Arg?Val?Leu?Glu?Tyr?Tyr?Leu?Arg?Asp?Asn?Lys?Phe
580?????????????????585?????????????????590Asn?Glu?Cys?Asp?Leu?Tyr?Asn?Cys?Val?Val?Asn?Gln?Ser?Tyr?Leu?Asn
595?????????????????600?????????????????605Asn?Ser?Asn?His?Val?Val?Ser?Leu?Thr?Gly?Lys?Glu?Arg?Glu?Leu?Ser
610?????????????????615?????????????????620Val?Gly?Arg?Met?Phe?Ala?Met?Gln?Pro?Gly?Met?Phe?Arg?Gln?Ile?Gln625?????????????????630?????????????????635?????????????????640Ile?Leu?Ala?Glu?Lys?Met?Ile?Ala?Glu?Asn?Ile?Leu?Gln?Phe?Phe?Pro
645?????????????????650?????????????????655Glu?Ser?Leu?Thr?Arg?Tyr?Gly?Asp?Leu?Glu?Leu?Gln?Lys?Ile?Leu?Glu
660?????????????????665?????????????????670Leu?Lys?Ala?Gly?Ile?Ser?Asn?Lys?Ser?Asn?Arg?Tyr?Asn?Asp?Asn?Tyr
675?????????????????680?????????????????685Asn?Asn?Tyr?Ile?Ser?Lys?Cys?Ser?Ile?Ile?Thr?Asp?Leu?Ser?Lys?Phe
690?????????????????695?????????????????700Asn?Gln?Ala?Phe?Arg?Tyr?Glu?Thr?Ser?Cys?Ile?Cys?Ser?Asp?Val?Leu705?????????????????710?????????????????715?????????????????720Asp?Glu?Leu?His?Gly?Val?Gln?Ser?Leu?Phe?Ser?Trp?Leu?His?Leu?Thr
725?????????????????730?????????????????735Ile?Pro?Leu?Val?Thr?Ile?Ile?Cys?Thr?Tyr?Arg?His?Ala?Pro?Pro?Phe
740?????????????????745?????????????????750Ile?Lys?Asp?His?Val?Val?Asn?Leu?Asn?Glu?Val?Asp?Glu?Gln?Ser?Gly
755?????????????????760?????????????????765Leu?Tyr?Arg?Tyr?His?Met?Gly?Gly?Ile?Glu?Gly?Trp?Cys?Gln?Lys?Leu
770?????????????????775?????????????????780Trp?Thr?Ile?Glu?Ala?Ile?Ser?Leu?Leu?Asp?Leu?Ile?Ser?Leu?Lys?Gly785?????????????????790?????????????????795?????????????????800Lys?Phe?Ser?Ile?Thr?Ala?Leu?Ile?Asn?Gly?Asp?Asn?Gln?Ser?Ile?Asp
805?????????????????810?????????????????815Ile?Ser?Lys?Pro?Val?Arg?Leu?Ile?Glu?Gly?Gln?Thr?His?Ala?Gln?Ala
820?????????????????825?????????????????830Asp?Tyr?Leu?Leu?Ala?Leu?Asn?Ser?Leu?Lys?Leu?Leu?Tyr?Lys?Glu?Tyr
835?????????????????840?????????????????845Ala?Gly?Ile?Gly?His?Lys?Leu?Lys?Gly?Thr?Glu?Thr?Tyr?Ile?Ser?Arg
850?????????????????855?????????????????860Asp?Met?Gln?Phe?Met?Ser?Lys?Thr?Ile?Gln?His?Asn?Gly?Val?Tyr?Tyr865?????????????????870?????????????????875?????????????????880Pro?Ala?Ser?Ile?Lys?Lys?Val?Leu?Arg?Val?Gly?Pro?Trp?Ile?Asn?Thr
885?????????????????890?????????????????895Ile?Leu?Asp?Asp?Phe?Lys?Val?Ser?Leu?Glu?Ser?Ile?Gly?Ser?Leu?Thr
900?????????????????905?????????????????910Gln?Glu?Leu?Glu?Tyr?Arg?Gly?Glu?Ser?Leu?Leu?Cys?Ser?Leu?Ile?Phe
915?????????????????920?????????????????925Arg?Asn?Ile?Trp?Leu?Tyr?Asn?Gln?Ile?Ala?Leu?Gln?Leu?Arg?Asn?His
930?????????????????935?????????????????940Ala?Leu?Cys?Asn?Asn?Lys?Leu?Tyr?Leu?Asp?Ile?Leu?Lys?Val?Leu?Lys945?????????????????950?????????????????955?????????????????960His?Leu?Lys?Thr?Phe?Phe?Asn?Leu?Asp?Ser?Ile?Asp?Met?Ala?Leu?Ser
965?????????????????970?????????????????975Leu?Tyr?Met?Asn?Leu?Pro?Met?Leu?Phe?Gly?Gly?Gly?Asp?Pro?Asn?Leu
980?????????????????985?????????????????990Leu?Tyr?Arg?Ser?Phe?Tyr?Arg?Arg?Thr?Pro?Asp?Phe?Leu?Thr?Glu?Ala
995?????????????????1000????????????????1005Ile?Val?His?Ser?Val?Phe?Val?Leu?Ser?Tyr?Tyr?Thr?Gly?His?Asp?Leu
1010????????????????1015????????????????1020Gln?Asp?Lys?Leu?Gln?Asp?Leu?Pro?Asp?Asp?Arg?Leu?Asn?Lys?Phe?Leu1025????????????????1030????????????????1035????????????????1040Thr?Cys?Val?Ile?Thr?Phe?Asp?Lys?Asn?Pro?Asn?Ala?Glu?Phe?Val?Thr
1045????????????????1050????????????????1055Leu?Met?Arg?Asp?Pro?Gln?Ala?Leu?Gly?Ser?Glu?Arg?Gln?Ala?Lys?Ile
1060????????????????1065????????????????1070Thr?Ser?Glu?Ile?Asn?Arg?Leu?Ala?Val?Thr?Glu?Val?Leu?Ser?Ile?Ala
1075????????????????1080????????????????1085Pro?Asn?Lys?Ile?Phe?Ser?Lys?Ser?Ala?Gln?His?Tyr?Thr?Thr?Thr?Glu
1090????????????????1095????????????????1100Ile?Asp?Leu?Asn?Asp?Ile?Met?Gln?Asn?Ile?Glu?Pro?Thr?Tyr?Pro?His1105????????????????1110????????????????1115????????????????1120Gly?Leu?Arg?Val?Val?Tyr?Glu?Ser?Leu?Pro?Phe?Tyr?Lys?Ala?Glu?Lys
1125????????????????1130????????????????1135Ile?Val?Asn?Leu?Ile?Ser?Gly?Thr?Lys?Ser?Ile?Thr?Asn?Ile?Leu?Glu
1140????????????????1145????????????????1150Lys?Thr?Ser?Ala?Ile?Asp?Thr?Thr?Asp?Ile?Asn?Arg?Ala?Thr?Asp?Met
1155????????????????1160????????????????1165Met?Arg?Lys?Asn?Ile?Thr?Leu?Leu?Ile?Arg?Ile?Leu?Pro?Leu?Asp?Cys
1170????????????????1175????????????????1180Asn?Lys?Asp?Lys?Arg?Glu?Leu?Leu?Ser?Leu?Glu?Asn?Leu?Ser?Ile?Thr1185????????????????1190????????????????1195????????????????1200Glu?Leu?Ser?Lys?Tyr?Val?Arg?Glu?Arg?Ser?Trp?Ser?Leu?Ser?Asn?Ile
1205????????????????1210????????????????1215Val?Gly?Val?Thr?Ser?Pro?Ser?Ile?Met?Phe?Thr?Met?Asp?Ile?Lys?Tyr
1220????????????????1225????????????????1230Thr?Thr?Ser?Thr?Ile?Ala?Ser?Gly?Ile?Ile?Ile?Glu?Lys?Tyr?Asn?Val
1235????????????????1240????????????????1245Asn?Ser?Leu?Thr?Arg?Gly?Glu?Arg?Gly?Pro?Thr?Lys?Pro?Trp?Val?Gly
1250????????????????1255????????????????1260Ser?Ser?Thr?Gln?Glu?Lys?Lys?Thr?Met?Pro?Val?Tyr?Asn?Arg?Gln?Val1265????????????????1270????????????????1275????????????????1280Leu?Thr?Lys?Lys?Gln?Arg?Asp?Gln?Ile?Asp?Leu?Leu?Ala?Lys?Leu?Asp
1285????????????????1290????????????????1295Trp?Val?Tyr?Ala?Ser?Ile?Asp?Asn?Lys?Asp?Glu?Phe?Met?Glu?Glu?Leu
1300????????????????1305????????????????1310Ser?Thr?Gly?Thr?Leu?Gly?Leu?Ser?Tyr?Glu?Lys?Ala?Lys?Lys?Leu?Phe
1315????????????????1320????????????????1325Pro?Gln?Tyr?Leu?Ser?Val?Asn?Tyr?Leu?His?Arg?Leu?Thr?Val?Ser?Ser
1330????????????????1335????????????????1340Arg?Pro?Cys?Glu?Phe?Pro?Ala?Ser?Ile?Pro?Ala?Tyr?Arg?Thr?Thr?Asn1345????????????????1350????????????????1355????????????????1360Tyr?His?Phe?Asp?Thr?Ser?Pro?Ile?Asn?His?Val?Leu?Thr?Glu?Lys?Tyr
1365????????????????1370????????????????1375Gly?Asp?Glu?Asp?Ile?Asp?Ile?Val?Phe?Gln?Asn?Cys?Ile?Ser?Phe?Gly
1380????????????????1385????????????????1390Leu?Ser?Leu?Met?Ser?Val?Val?Glu?Gln?Phe?Thr?Asn?Ile?Cys?Pro?Asn
1395????????????????1400????????????????1405Arg?Ile?Ile?Leu?Ile?Pro?Lys?Leu?Asn?Glu?Ile?His?Leu?Met?Lys?Pro
1410????????????????1415????????????????1420Pro?Ile?Phe?Thr?Gly?Asp?Val?Asp?Ile?Ile?Lys?Leu?Lys?Gln?Val?Ile1425????????????????1430????????????????1435????????????????1440Gln?Lys?Gln?His?Met?Phe?Leu?Pro?Asp?Lys?Ile?Ser?Leu?Thr?Gln?Tyr
1445????????????????1450????????????????1455Val?Glu?Leu?Phe?Leu?Ser?Asn?Lys?Ala?Leu?Lys?Ser?Gly?Ser?His?Ile
1460????????????????1465????????????????1470Asn?Ser?Asn?Leu?Ile?Leu?Val?His?Lys?Met?Ser?Asp?Tyr?Phe?His?Asn
1475????????????????1480????????????????1485Ala?Tyr?Ile?Leu?Ser?Thr?Asn?Leu?Ala?Gly?His?Trp?Ile?Leu?Ile?Ile
1490????????????????1495????????????????1500Gln?Leu?Met?Lys?Asp?Ser?Lys?Gly?Ile?Phe?Glu?Lys?Asp?Trp?Gly?Glu1505????????????????1510????????????????1515????????????????1520Gly?Tyr?Ile?Thr?Asp?His?Met?Phe?Ile?Asn?Leu?Asn?Val?Phe?Phe?Asn
1525????????????????1530????????????????1535Ala?Tyr?Lys?Thr?Tyr?Leu?Leu?Cys?Phe?His?Lys?Gly?Tyr?Gly?Lys?Ala
1540????????????????1545????????????????1550Lys?Leu?Glu?Cys?Asp?Met?Asn?Thr?Ser?Asp?Leu?Leu?Cys?Val?Leu?Glu
1555????????????????1560????????????????1565Leu?Ile?Asp?Ser?Ser?Tyr?Trp?Lys?Ser?Met?Ser?Lys?Val?Phe?Leu?Glu
1570????????????????1575????????????????1580Gln?Lys?Val?Ile?Lys?Tyr?Ile?Val?Asn?Gln?Asp?Thr?Ser?Leu?Arg?Arg1585????????????????1590????????????????1595????????????????1600Ile?Lys?Gly?Cys?His?Ser?Phe?Lys?Leu?Trp?Phe?Leu?Lys?Arg?Leu?Asp
1605????????????????1610????????????????1615Asn?Ala?Lys?Phe?Thr?Val?Cys?Pro?Trp?Val?Val?Asn?Ile?Asp?Tyr?His
1620????????????????1625????????????????1630Pro?Thr?His?Met?Lys?Ala?Ile?Leu?Ser?Tyr?Ile?Asp?Leu?Val?Arg?Met
1635????????????????1640????????????????1645Gly?Leu?Ile?Asn?Val?Asp?Lys?Leu?Thr?Ile?Lys?Asn?Lys?Asn?Lys?Phe
1650????????????????1655????????????????1660Asn?Asp?Glu?Phe?Tyr?Thr?Ser?Asn?Leu?Phe?Tyr?Ile?Ser?Tyr?Asn?Phe1665????????????????1670????????????????1675????????????????1680Ser?Asp?Asn?Thr?His?Leu?Leu?Thr?Lys?Gln?Ile?Arg?Ile?Ala?Asn?Ser
1685????????????????1690????????????????1695Glu?Leu?Glu?Asp?Asn?Tyr?Asn?Lys?Leu?Tyr?His?Pro?Thr?Pro?Glu?Thr
1700????????????????1705????????????????1710Leu?Glu?Asn?Met?Ser?Leu?Ile?Pro?Val?Lys?Ser?Asn?Asn?Ser?Asn?Lys
1715????????????????1720????????????????1725Pro?Lys?Phe?Cys?Ile?Ser?Gly?Asn?Thr?Glu?Ser?Met?Met?Met?Ser?Thr
1730????????????????1735????????????????1740Phe?Ser?Ser?Lys?Met?His?Ile?Lys?Ser?Ser?Thr?Val?Thr?Thr?Arg?Phe1745????????????????1750????????????????1755????????????????1760Asn?Tyr?Ser?Lys?Gln?Asp?Leu?Tyr?Asn?Leu?Phe?Pro?Ile?Val?Val?Ile
1765????????????????1770????????????????1775Asp?Lys?Ile?Ile?Asp?His?Ser?Gly?Asn?Thr?Ala?Lys?Ser?Asn?Gln?Leu
1780????????????????1785????????????????1790Tyr?Thr?Thr?Thr?Ser?His?Gln?Thr?Ser?Leu?Val?Arg?Asn?Ser?Ala?Ser
1795????????????????1800????????????????1805Leu?Tyr?Cys?Met?Leu?Pro?Trp?His?His?Val?Asn?Arg?Phe?Asn?Phe?Val
1810????????????????1815????????????????1820Phe?Ser?Ser?Thr?Gly?Cys?Lys?Ile?Ser?Ile?Glu?Tyr?Ile?Leu?Lys?Asp1825????????????????1830???????????????1835?????????????????1840Leu?Lys?Ile?Lys?Asp?Pro?Ser?Cys?Ile?Ala?Phe?Ile?Gly?Glu?Gly?Ala
1845????????????????1850????????????????1855Gly?Asn?Leu?Leu?Leu?Arg?Thr?Val?Val?Glu?Leu?His?Pro?Asp?Ile?Arg
1860????????????????1865????????????????1870Tyr?Ile?Tyr?Arg?Ser?Leu?Lys?Asp?Cys?Asn?Asp?His?Ser?Leu?Pro?Ile
1875????????????????1880????????????????1885Glu?Phe?Leu?Arg?Leu?Tyr?Asn?Gly?His?Ile?Asn?Ile?Asp?Tyr?Gly?Glu
1890????????????????1895????????????????1900Asn?Leu?Thr?Ile?Pro?Ala?Thr?Asp?Ala?Thr?Asn?Asn?Ile?His?Trp?Ser1905????????????????1910????????????????1915????????????????1920Tyr?Leu?His?Ile?Lys?Phe?Ala?Glu?Pro?Ile?Ser?Ile?Phe?Val?Cys?Asp
1925????????????????1930????????????????1935Ala?Glu?Leu?Pro?Val?Thr?Ala?Asn?Trp?Ser?Lys?Ile?Ile?Ile?Glu?Trp
1940????????????????1945????????????????1950Ser?Lys?His?Val?Arg?Lys?Cys?Lys?Tyr?Cys?Ser?Ser?Val?Asn?Arg?Cys
1955????????????????1960????????????????1965Ile?Leu?Ile?Ala?Lys?Tyr?His?Ala?Gln?Asp?Asp?Ile?Asp?Phe?Lys?Leu
1970????????????????1975????????????????1980Asp?Asn?Ile?Thr?Ile?Leu?Lys?Thr?Tyr?Val?Cys?Leu?Gly?Ser?Lys?Leu1985????????????????1990????????????????1995????????????????2000Lys?Gly?Ser?Glu?Val?Tyr?Leu?Ile?Leu?Thr?Ile?Gly?Pro?Ala?Asn?Ile
2005????????????????2010????????????????2015Leu?Pro?Val?Phe?Asp?Val?Val?Gln?Asn?Ala?Lys?Leu?Ile?Leu?Ser?Arg
2020????????????????2025????????????????2030Thr?Lys?Asn?Phe?Ile?Met?Pro?Lys?Lys?Thr?Asp?Lys?Glu?Ser?Ile?Asp
2035????????????????2040????????????????2045Ala?Val?Ile?Lys?Ser?Leu?Ile?Pro?Phe?Leu?Cys?Tyr?Pro?Ile?Thr?Lys
2050????????????????2055????????????????2060Lys?Gly?Ile?Lys?Thr?Ser?Leu?Ser?Lys?Leu?Lys?Ser?Val?Val?Asn?Gly2065????????????????2070????????????????2075????????????????2080Asp?Ile?Leu?Ser?Tyr?Ser?Ile?Ala?Gly?Arg?Asn?Glu?Val?Phe?Ser?Asn
2085????????????????2090????????????????2095
Lys?Leu?Ile?Asn?His?Lys?His?Met?Asn?Ile?Leu?Lys?Trp?Leu?Asp?His
2100????????????????2105????????????????2110
Val?Leu?Asn?Phe?Arg?Ser?Ala?Glu?Leu?Asn?Tyr?Asn?His?Leu?Tyr?Met
2115????????????????2120????????????????2125
Ile?Glu?Ser?Thr?Tyr?Pro?Tyr?Leu?Ser?Glu?Leu?Leu?Asn?Ser?Leu?Thr
2130????????????????2135????????????????2140
Thr?Asn?Glu?Leu?Lys?Lys?Leu?Ile?Lys?Ile?Thr?Gly?Ser?Val?Leu?Tyr
2145????????????????2150????????????????2155????????????????2160
Asn?Leu?Pro?Asn?Glu?Gln
The information of 2165 (2) SEQ ID NO:35:
(i) sequence signature:
(A) length: 24 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:35:CATATCACTC ACTCTGGGAT GGAG 24 (2) SEQ ID NO:36:
(i) sequence signature:
(A) length: 20 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:36:TCAGAACATC AAGCACCGCC 20 (2) SEQ ID NO:37:
(i) sequence signature:
(A) length: 20 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:37:ACAGTCAAGA CTGAGATGAG 20 (2) SEQ ID NO:38:
(i) sequence signature:
(A) length: 20 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:38:AAGAGTCAGA TACATGTGGA 20 (2) SEQ ID NO:39:
(i) sequence signature:
(A) length: 20 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:39:ACATGAATCA GCCTAAAGTC 20 (2) SEQ ID NO:40:
(i) sequence signature:
(A) length: 25 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:40:CCGAAAGAGT TCCTGCGTTA CGACC 25 (2) SEQ ID NO:41:
(i) sequence signature:
(A) length: 20 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:41:CAGTCCACAC AAGTACCAGG 20 (2) SEQ ID NO:42:
(i) sequence signature:
(A) length: 20 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:42:GTCAGAAGCT GTGGACCATC 20 (2) SEQ ID NO:43:
(i) sequence signature:
(A) length: 20 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:43:AATATTGCTA CAACAATGGC 20 (2) SEQ ID NO:44:
(i) sequence signature:
(A) length: 20 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:44:ACTCTTCATT CCTAGACTGG 20 (2) SEQ ID NO:45:
(i) sequence signature:
(A) length: 20 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:45:GTCCAATTAT GACTATGAAC 20 (2) SEQ ID NO:46:
(i) sequence signature:
(A) length: 20 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) topological framework: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:46:AGAACAGACA TGAAGCTTGC 20 (2) SEQ ID NO:47:
(i) sequence signature:
(A) length: 20 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:47:CCAACAAGGA ATGCTTCTAG 20 (2) SEQ ID NO:48:
(i) sequence signature:
(A) length: 25 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:48:ACAGCACTAT CTATGATTGA CCTGG 25 (2) SEQ ID NO:49:
(i) sequence signature:
(A) length: 20 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:49:GCAACATGGT TTACACATGC 20 (2) SEQ ID NO:50:
(i) sequence signature:
(A) length: 20 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:50:AGATTGAGAG TTGATCCAGG 20 (2) SEQ ID NO:51:
(i) sequence signature:
(A) length: 20 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:51:AGGAGATACT TAAACTAAGC 20 (2) SEQ ID NO:52:
(i) sequence signature:
(A) length: 20 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:52:TAAGCTTATG CCTTTCAGCG 20 (2) SEQ ID NO:53:
(i) sequence signature:
(A) length: 20 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:53:TTAACGGACC TAAGCTGTGC 20 (2) SEQ ID NO:54:
(i) sequence signature:
(A) length: 20 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:54:GAAACAGATT ATTATGACGG 20 (2) SEQ ID NO:55:
(i) sequence signature:
(A) length: 24 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:55:CGGGCTATCT AGGTGAACTT CAGG 24 (2) SEQ ID NO:56:
(i) sequence signature:
(A) length: 20 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:56:ATTTGGATAT GGAATATGAG 20 (2) SEQ ID NO:57:
(i) sequence signature:
(A) length: 20 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:57:ACTCAACTGA ACTACCAGTG 20 (2) SEQ ID NO:58:
(i) sequence signature:
(A) length: 20 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:58:AAGAACATCA TGTATTTCAG 20 (2) SEQ ID NO:59:
(i) sequence signature:
(A) length: 20 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:59:TTATCAACGC ACTGCTCATG 20 (2) SEQ ID NO:60:
(i) sequence signature:
(A) length: 25 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:60:ATTTTCAGCA ATCACTTGGC ATGCC 25 (2) SEQ ID NO:61:
(i) sequence signature:
(A) length: 20 base pairs
(B) chain: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:61:GCCTCTGTGC AAACAAGCTG 20 (2) SEQ ID NO:62:
(i) sequence signature:
(A) length: 20 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence signature: the information of SEQ ID NO:62:TCTCTAGTTA CTCTAGCAGC 20 (2) SEQ ID NO:63:
(i) sequence signature:
(A) length: 20 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:63:AGGTCGTTGT TTGTGAGGAG 20 (2) SEQ ID NO:64:
(i) sequence signature:
(A) length: 20 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linear r
(ii) molecule type: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:64:TCGTCCTCTT CTTTACTGTC 20 (2) SEQ ID NO:65:
(i) sequence signature:
(A) length: 20 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:65:CCGTCCTCGA GCTAGCCTCG 20 (2) SEQ ID NO:66:
(i) sequence signature:
(A) length: 20 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:66:CTCCTCCAGG CTCACATTGG 20 (2) SEQ ID NO:67:
(i) sequence signature:
(A) length: 20 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:67:GGGTTGGTAC ATAGCTCTGC 20 (2) SEQ ID NO:68:
(i) sequence signature:
(A) length: 25 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:68:CACCCATCTG ATATTTCCCT GATGG 25 (2) SEQ ID NO:69:
(i) sequence signature:
(A) length: 20 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:69:TGGTTGACAG TACAAATCTG 20 (2) SEQ ID NO:70:
(i) sequence signature:
(A) length: 20 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:70:CTGAAATGGG AAGATTGTGC 20 (2) SEQ ID NO:71:
(i) sequence signature:
(A) length: 20 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:71:AGCAATCTAC ACTGCCTACC 20 (2) SEQ ID NO:72:
(i) sequence signature:
(A) length: 20 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:72:TCACAGATGA TTCAATTATC 20 (2) SEQ ID NO:73:
(i) sequence signature:
(A) length: 20 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:73:GATCCTAGAT ATAAGTTCTC 20 (2) SEQ ID NO:74:
(i) sequence signature:
(A) length: 21 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:74:ACCAAACAAA GTTGGGTAAG G 21 (2) SEQ ID NO:75:
(i) sequence signature:
(A) length: 32 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:75:GGGGGATCCA TCCCTAATCC TGCTCTTGTC CC 32 (2) SEQ ID NO:76:
(i) sequence signature:
(A) length: 20 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:76:GATTCCTCTG ATGGCTCCAC 20 (2) SEQ ID NO:77:
(i) sequence signature:
(A) length: 21 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:77:TAACAGTCAA GGAGACCAAA G 21 (2) SEQ ID NO:78:
(i) sequence signature:
(A) length: 32 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence description: the information of SEQ ID NO:78:GGGAAGCTTA ACCCTAATCC TGCCCTAGGT GG 32 (2) SEQ ID NO:79:
(i) sequence signature:
(A) length: 22 base pairs
(B) type: nucleic acid
(C) chain: strand
(D) topological framework: linearity
(ii) molecule type: RNA (genome)
(xi) sequence description: SEQ ID NO:79:ACCAGACAAA GCTGGGAATA GA 22

Claims (46)

1. a mononegavirale virales is isolating, reorganization is that produce, attenuation, Nonsegmented, negative justice, strand RAN virus, and it has at least one attenuation sudden change at 3 ' genomic promoter region, and has at least one attenuation sudden change in RNA polymerase.
2. virus according to claim 1, wherein said virus is from Paramyxoviridae.
3. virus according to claim 2, wherein said virus is from the paramyxovirus subfamily.
4. virus according to claim 3, wherein said virus is from Morbillivirus.
5. virus according to claim 4, wherein said virus is Measles virus.
6. Measles virus according to claim 5, wherein:
(a) have the attenuation sudden change of at least one place to be selected from 3 ' genomic promoter region: (A → T), (A → T or A → C) (G → A), these Nucleotide all exist in normal chain, anti-group group, the messenger strand Nucleotide 42 Nucleotide 26 with Nucleotide 96; With
(b) having the attenuation sudden change of at least one place to be selected from the Nucleotide that causes following amino acid change in rna polymerase gene changes: residue 331 (Isoleucine → Threonine), 1409 (L-Ala → Threonines), 1624 (Threonine → L-Ala), 1649 (arginine → methionine(Met)s), 1717 (aspartic acid → L-Ala), 1936 (Histidine → tyrosine), 2074 (glutamine → arginine) and 2114 (arginine → Methionins).
7. virus according to claim 3, wherein said virus is from paramyxovirus genus.
8. virus according to claim 7, wherein said virus are 3 type human parainfluenza viruses (PIV-3).
9. PIV-3 virus according to claim 8, wherein
(a) have the attenuation sudden change of at least one place to be selected from 3 ' genomic promoter region: (T → C), (C → T), (G → T) (T → A), these Nucleotide all exist in normal chain, anti-genome, the messenger strand Nucleotide 28 Nucleotide 24 Nucleotide 23 with Nucleotide 45; With
(b) having the attenuation sudden change of at least one place to be selected from the Nucleotide that causes following amino acid change in rna polymerase gene changes: residue 942 (tyrosine → Histidine), 992 (leucine → phenylalanines), 1292 (leucine → phenylalanines) and 1558 (Threonine → Isoleucines).
10. virus according to claim 3, wherein said virus is from rubella virus genus.
11. virus according to claim 2, wherein said virus is from the pneumonitis virus subfamily.
12. virus according to claim 11, wherein said virus belongs to from pneumonitis virus.
13. virus according to claim 12, wherein said virus are human respiratory syncytial virus's (RSV) B subgroups.
14. virus according to claim 13, wherein
(a) sudden change of at least one place attenuation in 3 ' genomic promoter region is selected from: and Nucleotide 4 (C → G) and in a succession of A at Nucleotide 6 to 11 places, inserted an extra A, these Nucleotide all exist in normal chain, anti-genome, the messenger strand; With
(b) at least one place attenuation sudden change in rna polymerase gene is selected from the Nucleotide change that causes following amino acid change: residue 353 (arginine → Methionin), 451 (Methionin → arginine), 1229 (aspartic acid → l-asparagines), 2029 (Threonine → Isoleucines) and 2050 (l-asparagine → aspartic acids).
15. virus according to claim 1, wherein said virus is from Rhabdoviridae.
16. virus according to claim 1, wherein said virus is from Filoviridae.
17. a vaccine, it comprise according to the described mononegavirale virales of claim 1 isolating, reorganization is that produce, acceptable carrier on attenuation, Nonsegmented, negative justice, strand RAN virus and the physiology.
18. vaccine according to claim 17, it comprises acceptable carrier on described Measles virus of claim 5 and the physiology.
19. vaccine according to claim 18, it comprises acceptable carrier on described Measles virus of claim 6 and the physiology.
20. vaccine according to claim 17, it comprises acceptable carrier on described PIV-3 of claim 8 and the physiology.
21. vaccine according to claim 20, it comprises acceptable carrier on described PIV-3 of claim 9 and the physiology.
22. vaccine according to claim 17, it comprises acceptable carrier on claim 13 described RSV B subgroup and the physiology.
23. vaccine according to claim 22, it comprises acceptable carrier on claim 14 described RSV B subgroup and the physiology.
24. an immune body is with the method for the non-sections of inducing anti-mononegavirale virales, negative justice, strand RAN virus protection, it comprises and gives the individual right requirement 17 described vaccines.
25. method according to claim 24, vaccine wherein are the described vaccines of claim 18.
26. method according to claim 25, vaccine wherein are the described vaccines of claim 19.
27. method according to claim 24, vaccine wherein are the described vaccines of claim 20.
28. method according to claim 27, vaccine wherein are the described vaccines of claim 21.
29. method according to claim 24, vaccine wherein are the described vaccines of claim 22.
30. method according to claim 29, vaccine wherein are the described vaccines of claim 23.
31. isolated nucleic acid molecule, it comprises the normal chain that is selected from following Measles virus group, the sequence on the anti-genome messenger strand: 1977 wild-type strains (SEQ ID NO:3), 1983 wild-type strains (SEQ IDNO:5), wherein the 2499th Nucleotide is G or C; Montefiore wild-type strain (SEQ ID NO:7), Rubeovax TMVaccine strain (SEQ ID NO:9), wherein Nucleotide 2143 is T or C; Moraten vaccine strain (SEQ ID NO:11), Schwarz vaccine strain (SEQ ID NO:11), wherein Nucleotide 4917 is that C and Nucleotide 4924 are C and Zagreb vaccine strain (SEQ ID NO:13), and their complementary gene group sequence
32. isolated nucleic acid molecule, it comprises the PIV-3 sequence on normal chain, the anti-genome messenger strand, this sequence is selected from: cultivate cp45 vaccine strain (SEQ ID NO:19) in rhesus monkey embryo pneumonocyte and cultivation at the intracellular cp45 vaccine of Vero (SEQ ID NO:21), and the complementary gene group sequence of strain.
33. composition, it comprises a transcription vector, this carrier comprises isolated nucleic acid molecule, the non-sections of this molecule encoding mononegavirale virales, negative justice, single strand RNA virus, this nucleic acid molecule has attenuation sudden change and the interior attenuation sudden change of at least one RAN pol gene at least one 3 ' genomic promoter region, they with at least one expression vector together, this expression vector comprises at least one isolated nucleic acid molecule, this nucleic acid molecule encoding wraps up, transcribes and duplicate necessary trans-acting albumen, expresses producing infectious attenuated virus by this.
34. composition according to claim 33, wherein said transcription vector comprises an isolated nucleic acid molecule and at least one expression vector, the described Measles virus of this isolated nucleic acid molecule coding claim 5, this expression vector comprise at least one isolated nucleic acid molecule of coding trans-acting albumen N, P and L.
35. composition according to claim 34, wherein said transcription vector comprise the isolated nucleic acid molecule of the described Measles virus of coding claim 6.
36. composition according to claim 33, wherein said transcription vector contains the isolated nucleic acid molecule of the described PIV-3 of coding claim 8, and containing at least one expression vector, this expression vector comprises at least one isolated nucleic acid molecule of coding trans-acting albumen NP, P and L.
37. composition according to claim 36, wherein said transcription vector comprise the isolated nucleic acid molecule of the described PIV-3 of coding claim 9.
38. composition according to claim 33, wherein said transcription vector comprises an isolated nucleic acid molecule of the described RSV B of coding claim 13 subgroup, and comprising at least one expression vector, this expression vector comprises at least one isolated nucleic acid molecule of coding trans-acting albumen N, P, L and M2.
39. according to the described composition of claim 38, wherein said transcription vector comprises an isolated nucleic acid molecule of the described RSV B of coding claim 14 subgroup virus.
40. method of producing the infectious attenuation of strand negative-sense viral order, non-sections, negative justice, single strand RNA virus, it comprises with described two kinds of carriers conversion of claim 33 or transfection host cell at least, and under the condition that allows these carrier coexpressions, cultivate host cell, to produce infectious attenuated virus.
41. according to the described method of claim 40, virus wherein is the described Measles virus of claim 5.
42. according to the described method of claim 41, virus wherein is the described Measles virus of claim 6.
43. according to the described method of claim 40, virus wherein is the described PIV-3 virus of claim 8.
44. according to the described method of claim 43, virus wherein is the described PIV-3 virus of claim 9.
45. according to the described method of claim 40, virus wherein is the described RSV B of claim 13 subgroup virus.
46. according to the described method of claim 45, virus wherein is the described RSV B of claim 14 subgroup virus.
CN97198321A 1996-09-27 1997-09-19 3' genomic promoter region and polymerase gene mutations responsible for attenuation in viruses of order designated mononegavirales Pending CN1232504A (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US2682396P 1996-09-27 1996-09-27
US60/026,823 1996-09-27

Publications (1)

Publication Number Publication Date
CN1232504A true CN1232504A (en) 1999-10-20

Family

ID=21833976

Family Applications (1)

Application Number Title Priority Date Filing Date
CN97198321A Pending CN1232504A (en) 1996-09-27 1997-09-19 3' genomic promoter region and polymerase gene mutations responsible for attenuation in viruses of order designated mononegavirales

Country Status (8)

Country Link
EP (1) EP0932684A2 (en)
JP (1) JP2000517194A (en)
KR (1) KR20000048628A (en)
CN (1) CN1232504A (en)
AU (1) AU4427897A (en)
BR (1) BR9712138A (en)
CA (1) CA2265554A1 (en)
WO (1) WO1998013501A2 (en)

Families Citing this family (68)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6887699B1 (en) 1990-05-22 2005-05-03 Medimmune Vaccines, Inc. Recombinant negative strand RNA virus expression systems and vaccines
US6410023B1 (en) 1997-05-23 2002-06-25 United States Of America Recombinant parainfluenza virus vaccines attenuated by deletion or ablation of a non-essential gene
BR9812232A (en) * 1997-09-19 2000-07-18 American Cyanamid Co Human respiratory syncytial virus (rsv) human subgroup b, isolated, recombinantly generated, attenuated, vaccine, process to immunize an individual to induce protection against rsv subgroup b, composition, process to produce attenuated infectious rsv subgroup b, and isolated acid molecule nucleic
CA2323927A1 (en) * 1998-03-26 1999-09-30 American Cyanamid Company Mutations responsible for attenuation in measles virus or human respiratory syncytial virus subgroup b
DK1090108T3 (en) * 1998-06-03 2011-04-26 Wyeth Corp New methods for salvaging RNA viruses
AU748416B2 (en) 1998-06-12 2002-06-06 Andrej Egorov Interferon inducing genetically engineered attenuated viruses
CA2334895C (en) 1998-06-12 2016-01-19 Mount Sinai School Of Medicine Attenuated negative strand viruses with altered interferon antagonist activity for use as vaccines and pharmaceuticals
US6544785B1 (en) 1998-09-14 2003-04-08 Mount Sinai School Of Medicine Of New York University Helper-free rescue of recombinant negative strand RNA viruses
US6146642A (en) 1998-09-14 2000-11-14 Mount Sinai School Of Medicine, Of The City University Of New York Recombinant new castle disease virus RNA expression systems and vaccines
US6764685B1 (en) 2000-03-21 2004-07-20 Medimmune Vaccines, Inc. Recombinant parainfluenza virus expression systems and vaccines
WO2001077394A1 (en) 2000-04-10 2001-10-18 Mount Sinai School Of Medicine Of New York University Screening methods for identifying viral proteins with interferon antagonizing functions and potential antiviral agents
PT1292615E (en) * 2000-06-23 2007-01-31 Wyeth Corp Modified morbillivirus v proteins
US7361496B1 (en) 2000-08-02 2008-04-22 Wyeth Rescue of mumps virus from cDNA
CN101921732A (en) 2001-01-19 2010-12-22 维洛诺瓦蒂夫公司 A virus causing respiratory tract illness in susceptible mammals
US8715922B2 (en) 2001-01-19 2014-05-06 ViroNovative Virus causing respiratory tract illness in susceptible mammals
AU2003219839B2 (en) 2002-02-21 2008-02-21 Medimmune, Llc Recombinant parainfluenza virus expression systems and vaccines comprising heterologous antigens derived from metapneumovirus
US7465456B2 (en) 2002-04-26 2008-12-16 Medimmune, Llc Multi plasmid system for the production of influenza virus
CN103540568A (en) 2002-04-26 2014-01-29 米迪缪尼有限公司 Multi plasmid system for the production of influenza virus
DE60233038D1 (en) * 2002-06-20 2009-09-03 Pasteur Institut Infectious cDNA of an approved measles virus vaccine strain. Use in immunogenic compositions
EP1375670B1 (en) 2002-06-20 2013-06-12 Institut Pasteur Recombinant measles viruses expressing epitopes of antigens of RNA viruses and use of the recombinant viruses for the preparation of vaccine compositions
CA2432738A1 (en) * 2003-02-26 2004-08-26 Philippe Despres New dengue and west nile viruses proteins and genes coding the foregoing, and their use in vaccinal, therapeutic and diagnostic applications
US7572904B2 (en) 2003-03-28 2009-08-11 Medimmune, Llc Nucleic acids encoding respiratory syncytial virus subgroup B strain 9320
EP2494986A1 (en) 2003-04-25 2012-09-05 MedImmune Vaccines, Inc. Metapneumovirus strains and their use in vaccine formulations and as vectors for expression of antigenic sequences and methods for propagating virus
US7566458B2 (en) 2003-06-16 2009-07-28 Medimmune, Llc Influenza hemagglutinin and neuraminidase variants
WO2005062820A2 (en) 2003-12-23 2005-07-14 Medimmune Vaccines, Inc Multi plasmid system for the production of influenza virus
JP4980895B2 (en) 2004-05-25 2012-07-18 メディミューン,エルエルシー Influenza hemagglutinin and neuraminidase variants
CN102727880A (en) 2004-06-01 2012-10-17 西奈山医学院 Genetically engineered swine influenza virus and uses thereof
EP1855713B1 (en) 2005-02-15 2016-04-27 Mount Sinai School of Medicine Genetically engineered equine influenza virus and uses thereof
CA2600730C (en) 2005-03-08 2014-11-25 Medimmune, Inc. Influenza hemagglutinin and neuraminidase variants
AU2006262380A1 (en) 2005-06-21 2007-01-04 Medimmune, Llc Methods and compositions for expressing a heterologous protease
US7790434B2 (en) 2005-06-21 2010-09-07 Medimmune, Llc Methods and compositions for expressing negative-sense viral RNA in canine cells
KR101492643B1 (en) 2005-12-02 2015-02-12 이칸 스쿨 오브 메디슨 엣 마운트 시나이 Chimeric viruses presenting non-native surface proteins and uses thereof
MX2008013388A (en) 2006-04-19 2009-03-02 Medimmune Llc Methods and compositions for expressing negative-sense viral rna in canine cells.
WO2008133701A1 (en) 2006-07-21 2008-11-06 Medimmune, Llc. Methods and compositions for increasing replication capacity of an influenza virus
CN101983069B (en) 2006-08-09 2014-07-16 米迪缪尼有限公司 Influenza hemagglutinin and neuraminidase variants
EP2099903A4 (en) * 2006-12-22 2010-07-28 Penn State Res Found Modified polymerases and attenuated viruses and methods of use thereof
JP5666905B2 (en) 2007-06-18 2015-02-12 メディミューン,エルエルシー Influenza B virus having alterations in hemagglutinin polypeptide
CA2730408A1 (en) 2008-07-11 2010-01-14 Chin-Fen Yang Influenza hemagglutinin and neuraminidase variants
WO2010053986A1 (en) 2008-11-05 2010-05-14 Wyeth Multicomponent immunogenic composition for the prevention of beta-hemolytic streptococcal (bhs) disease
ES2550179T3 (en) 2009-02-05 2015-11-05 Icahn School Of Medicine At Mount Sinai Chimeric Newcastle disease viruses and uses thereof
CN102361649A (en) 2009-02-12 2012-02-22 米迪缪尼有限公司 Influenza hemagglutinin and neuraminidase variants
JP2012521786A (en) 2009-03-30 2012-09-20 モウント シナイ スクール オフ メディシネ Influenza virus vaccine and use thereof
WO2011014504A1 (en) 2009-07-27 2011-02-03 Mount Sinai School Of Medicine Of New York University Recombinant influenza virus vectors and uses thereof
EP2459585A1 (en) 2009-07-30 2012-06-06 Mount Sinai School of Medicine Influenza viruses and uses thereof
US9708373B2 (en) 2010-03-30 2017-07-18 Icahn School Of Medicine At Mount Sinai Influenza virus vaccine and uses thereof
EP2420242A1 (en) 2010-08-20 2012-02-22 Lauer, Ulrich M. Oncolytic measles virus
CN103906843B (en) * 2011-06-08 2016-12-07 维什瓦斯·乔希 Double-mass model mammalian expression systems
US9441205B2 (en) 2011-06-08 2016-09-13 Vishwas Joshi Two plasmid mammalian expression system
BR112014006694A2 (en) 2011-09-20 2020-11-17 Mount Sinai School Of Medicine influenza vaccines and uses of these
US20150224181A1 (en) 2012-09-14 2015-08-13 The United States Of America As Represented By The Secretary Department Of Health And Human Se Brachyury protein, non-poxvirus non-yeast vectors encoding brachyury protein, and their use
CN105263516A (en) 2012-12-18 2016-01-20 西奈山伊坎医学院 Influenza virus vaccines and uses thereof
US20160015760A1 (en) 2013-03-14 2016-01-21 Icahn School Of Medicine At Mount Sinai Newcastle disease viruses and uses thereof
WO2014159960A1 (en) 2013-03-14 2014-10-02 Icahn School Of Medicine At Mount Sinai Antibodies against influenza virus hemagglutinin and uses thereof
US20170000832A1 (en) 2014-02-27 2017-01-05 Viralytics Limited Combination method for treatment of cancer
CA2974699A1 (en) 2015-01-23 2016-07-28 Icahn School Of Medicine At Mount Sinai Influenza virus vaccination regimens
WO2016137929A1 (en) 2015-02-26 2016-09-01 Boehringer Ingelheim Vetmedica Gmbh Bivalent swine influenza virus vaccine
WO2017024000A1 (en) 2015-08-03 2017-02-09 The United States Of America, As Represented By The Secretary, Department Of Health And Human Services Brachyury deletion mutants, non-yeast vectors encoding brachyury deletion mutants, and their use
WO2017031408A1 (en) 2015-08-20 2017-02-23 University Of Rochester Single-cycle virus for the development of canine influenza vaccines
US10973903B2 (en) 2015-08-20 2021-04-13 University Of Rochester NS1 truncated virus for the development of canine influenza vaccines
AU2016308917A1 (en) 2015-08-20 2018-03-15 Cornell University Live-attenuated vaccine having mutations in viral polymerase for the treatment and prevention of canine influenza virus
EP3463439B1 (en) 2016-06-03 2022-08-03 University of Rochester Equine influenza virus live-attenuated vaccines
CA3023143A1 (en) 2016-06-15 2017-12-21 Icahn School Of Medicine At Mount Sinai Influenza virus hemagglutinin proteins and uses thereof
US11254733B2 (en) 2017-04-07 2022-02-22 Icahn School Of Medicine At Mount Sinai Anti-influenza B virus neuraminidase antibodies and uses thereof
JOP20190256A1 (en) 2017-05-12 2019-10-28 Icahn School Med Mount Sinai Newcastle disease viruses and uses thereof
CN111989116A (en) 2018-02-27 2020-11-24 罗切斯特大学 Multivalent attenuated live influenza vaccine for prevention and control of Equine Influenza Virus (EIV)
US11166996B2 (en) 2018-12-12 2021-11-09 Flagship Pioneering Innovations V, Inc. Anellovirus compositions and methods of use
WO2020176709A1 (en) 2019-02-27 2020-09-03 University Of Rochester Multivalent live-attenuated influenza vaccine for prevention and control of equine influenza virus (eiv) in horses
US11103576B1 (en) 2020-06-15 2021-08-31 University Of Pittsburgh - Of The Commonwealth System Of Higher Education Measles virus vaccine expressing SARS-COV-2 protein(s)

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
AU7007491A (en) * 1990-02-02 1991-08-08 Schweiz. Serum- & Impfinstitut Bern Cdna corresponding to the genome of negative-strand rna viruses, and process for the production of infectious negative-strand rna viruses
JP3045581B2 (en) * 1991-10-14 2000-05-29 社団法人北里研究所 Measles vaccine virus strain identification method
EP0636172B1 (en) * 1992-04-14 2005-08-17 The Mount Sinai School of Medicine of the City University of New York Genetically engineered attenuated viruses
IL105456A (en) * 1992-04-21 1996-12-05 American Home Prod Attenuated respiratory syncytial virus vaccine compositions
TW275632B (en) * 1992-04-21 1996-05-11 American Cyanamid Co
ES2210273T5 (en) * 1994-07-18 2010-03-29 Conzelmann, Karl-Klaus, Prof. Dr. VIRUS WITH NEGATIVE CHAIN NON-SEGMENTED RECOMBINANT INFECTIVE.

Also Published As

Publication number Publication date
JP2000517194A (en) 2000-12-26
BR9712138A (en) 2000-01-18
WO1998013501A2 (en) 1998-04-02
EP0932684A2 (en) 1999-08-04
WO1998013501A3 (en) 1998-08-13
AU4427897A (en) 1998-04-17
CA2265554A1 (en) 1998-04-02
KR20000048628A (en) 2000-07-25

Similar Documents

Publication Publication Date Title
CN1232504A (en) 3' genomic promoter region and polymerase gene mutations responsible for attenuation in viruses of order designated mononegavirales
CN1273603A (en) Attenuated respiratory syncytial viruses
CN1250725C (en) Prodn of attenuated parainfluenza virus vaccines from cloned nucleotide sequences
CN101012454B (en) Production of attenuated chimeric respiratory syncytial virus vaccines from cloned nucleotide sequences
US6664066B2 (en) Modified Morbillivirus V proteins
AU2020203460B2 (en) Attenuation of human respiratory syncytial virus by genome scale codon-pair deoptimization
US7192593B2 (en) Use of recombinant parainfluenza viruses (PIVs) as vectors to protect against infection and disease caused by PIV and other human pathogens
CN1347453A (en) Use of recombinant parainfluenza viruses (PIVs) as vectors to protect against infection and disease caused by PIV and other human pathogens
CN1347458A (en) Production of attenuated negative stranded RNA virus vaccines from cloned nucleotide sequences
AU2001267014A1 (en) Modified morbillivirus V proteins
WO1999015631A1 (en) Recombinant rsv virus expression systems and vaccines
CN1364195A (en) Production of attenuated chimeric respiratory syncytial virus vaccines from cloned nucleotide sequences
CN1402792A (en) Production of attenuated, human-bovine chimeric respiratory syncytial virus vaccines
KR20110063863A (en) Live, attenuated respiratory syncytial virus
CN1370237A (en) Recombinant parainfluenza virus vaccines attenuated by deletion or ablation of non-essential gene
CN113293149A (en) Construction of F gene replaced chimeric measles attenuated strain
CN1177927C (en) Mutations responsible for attenuation in measles virus or human respiratory syncytial virus subgroup B
CN1369011A (en) Attenuated human-bovine chimeric parainfluenza virus (PIV) vaccines
WO2013154728A1 (en) Genetically stable live attenuated respiratory syncytial virus vaccine and its production
CN1224462A (en) Production of attenuated respiratory syncytial virus vaccines from cloned nucleotide sequences
CN1468301A (en) Respiratory syncytial virus vaccines expressing protective antigens from promotor-proximal genes
AU8933001A (en) 3' genomic promoter region and polymerase gene mutations responsible for attenuation in viruses of the order designated mononegavirales
MXPA00009256A (en) Mutations responsible for attenuation in measles virus or human respiratory syncytial virus subgroup b
Takeuchi et al. Toward understanding the pathogenicity of wild-type measles virus by reverse genetics
AU5592201A (en) Production of attenuated respiratory syncytial virus vaccines from cloned nucleotide sequences

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C12 Rejection of a patent application after its publication
RJ01 Rejection of invention patent application after publication