CN1257282C - Southern oligomycin biosynthetic gene cluster - Google Patents

Southern oligomycin biosynthetic gene cluster Download PDF

Info

Publication number
CN1257282C
CN1257282C CN 03150923 CN03150923A CN1257282C CN 1257282 C CN1257282 C CN 1257282C CN 03150923 CN03150923 CN 03150923 CN 03150923 A CN03150923 A CN 03150923A CN 1257282 C CN1257282 C CN 1257282C
Authority
CN
China
Prior art keywords
ala
leu
gly
val
arg
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN 03150923
Other languages
Chinese (zh)
Other versions
CN1523034A (en
Inventor
邓子新
孙宇晖
周秀芬
涂国全
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shanghai Jiaotong University
Original Assignee
Shanghai Jiaotong University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shanghai Jiaotong University filed Critical Shanghai Jiaotong University
Priority to CN 03150923 priority Critical patent/CN1257282C/en
Publication of CN1523034A publication Critical patent/CN1523034A/en
Application granted granted Critical
Publication of CN1257282C publication Critical patent/CN1257282C/en
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Images

Landscapes

  • Peptides Or Proteins (AREA)
  • Enzymes And Modification Thereof (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)
  • Preparation Of Compounds By Using Micro-Organisms (AREA)

Abstract

The present invention relates to a southern oligomycin biosynthesis gene cluster which belongs to the technical field of genes. The whole southern oligomycin biosynthesis gene cluster has 13 genes comprising (1) seven polyketone synthase genes, namely nlmA1, nlmA2, nlmA3, nlmA4, nlmA5, nlmA6 and nlmA7, (2) two southern oligomycin modifying genes, namely nlmB and nlmOI, (3) two southern oligomycin transposase genes, namely nlmTI and nlmTII, (4) a southern oligomycin regulator gene, namely nlmRI, and (5) a southern oligomycin precursor synthetic gene, namely ccrA. The provided genes, protein thereof and antibodies thereof can also be used for searching and developing compounds or protein used in medicinal industry, industry and agriculture.

Description

The nanoligomycin biological synthesis gene cluster
Technical field
What the present invention relates to is a kind of microbiotic biological synthesis gene cluster, and particularly a kind of nanoligomycin biological synthesis gene cluster as 20 hexa-atomic macrolide antibiotics belongs to the gene engineering field.
Background technology
Streptomycete and nearly edge actinomycetes thereof have extremely important using value owing to producing a large amount of natural antibiotics.Polyketide is one of maximum monoid of these natural products, because of its huge pharmaceutical use is widely used in fields such as medical, for animals and agricultural.For example antibacterium microbiotic erythromycin (erythromycin), antifungal antibiotic amphotericin B (amphotericin B), parasiticide microbiotic Avrmectin (avermectin), tumor inhibitor rapamycin (rapamycin), antitumor antibiotics daunorubicin (daunarubicin) etc.Polyketide is formed by polyketide synthase (PKS) catalysis, the PKS that forms with the modular structure form forms polyketide by the successive condensation reaction with some simple carboxylic acid molecules catalysis in similar fatty acid biological synthetic mode, see for details: David A Hopwood and David H Sherman.Molecular genetics of polyketides and itscomparison to fatty acid biosynthesis (molecular genetics of polyketide and with fatty acid biological synthetic relatively) .Annual Review Genetics (genetics year summary) .1990,24; 37-66.Each module only is responsible for a step condensation reaction in polyketone chain formation process, it comprises a 'beta '-ketoester acyl-synthetase (KS) structural domain at least, acyltransferase (AT) structural domain and an acyl carrier protein (ACP) structural domain.In addition, also may comprise a 'beta '-ketoester acyl reductase (KR) structural domain, dehydratase (DH) structural domain and a fatty acyl reductases (ER) structural domain, they are determining the reduction step of the extender unit that adds.In addition, also need the cyclisation and the release of the effect catalysis polyketone chain of thioesterase (TE) structural domain.At last, also to pass through hydroxylation, glycosylation, methylate and modification step such as acylations.These steps are vital for the biological activity of most of end products.The modular structure of polyketone biosynthesizing PKS is formed and is had certain plasticity-, makes genetically engineered operations such as the insertion of the specificity of number that people can be by changing module, extension of module or structural domain or inactivation obtain new polymeric polyketone derivatives.
Nanoligomycin (nanligomycin) is a kind of 20 hexa-atomic macrolide antibiotics that produced by nanchang streptomycete (Streptomyces nanchangensis), it is the special inhibitor of Mitochondrial ATPase, has the intensive anti-mycotic activity, or a kind of potential immunosuppressor and antitumor inhibitor, it has similar structure to the oligomycin that belongs to 20 hexa-atomic macrolide antibiotics together (oligomycin), but at C-14, C14-C15, C-26 is but different fully, it is a brand new, simultaneously existing evident difference on the The Nomenclature Composition and Structure of Complexes of gene, is a brand-new gene cluster.
Summary of the invention
The object of the present invention is to provide a kind of nanoligomycin biological synthesis gene cluster nucleotide sequence or the complementary sequence (sequence 1) of totally 13 genes, make it can be used for compound or the proteic innovation and the development of medicine, industry, agricultural.7 of the present invention polyketide synthase (PKS) that is used to encode, it comprises 17 modules, and totally 79 structural domains are responsible for the biosynthesizing of catalysis nanoligomycin polyketone aglycone; Other has 2 genes, i.e. nlmB, and the nlmOI coding participates in the albumen that the nanoligomycin biosynthesizing is modified, and is responsible for the oxidation of catalysis polyketone chain; Also have 2 genes, i.e. nlmTI, nlmTII are responsible for encoding and participate in the albumen of nanoligomycin biological synthesis gene cluster swivel base; Also have 1 regulatory gene, promptly nlmRI participates in the biosynthetic adjusting of nanoligomycin; Also have 1 ccrA gene, promptly ccrA participates in the biosynthesizing of nanoligomycin precursor.These nucleotide sequences are the nlmA1 (5469-23915) that are selected from respectively in the sequence 1, nlmA2 (23938-38337), nlmA3 (38626-50133), nlmA4 (93935-82242), nlmA5 (82170-76564), nlmA6 (50289-64196), nlmA7 (64243-75024), nlmB (76408-75191), nlmOI (94138-94641), nlmTI (255-1208), nlmTII (98193-96412), nlmRI (2168-4990), ccrA (94705-96048).
The present invention also provides a fragment to come from least in the sequence 1 polyketide synthase sequence to make up recombinant vectors to obtain the approach of novel polyketide synthase with the sequence that comes from other polyketide synthase gene cluster.
The present invention also provides the approach that improves nanoligomycin output in the genetically engineered microorganism body.
The present invention also provides the approach that is comprised the recombinant DNA carrier of dna sequence dna in the partial sequence 1 at least.
The present invention also provides and has produced that the nanoligomycin biosynthesis gene is interrupted or the approach of the microbe that doubles, and the gene of one of them includes the nucleotide sequence in the sequence 1 at least.
The complementary sequence of sequence 1 can obtain at any time according to DNA base complementrity principle.The nucleotide sequence of sequence 1 or partial nucleotide sequence can be by polymerase chain reaction (PCR) or with suitable digestion with restriction enzyme corresponding D NA or use other suitable technique to obtain.By nucleotide sequence provided by the present invention or partial nucleotide sequence, the DNA that can utilize the method for polymerase chain reaction (PCR) or comprise sequence of the present invention obtains the gene similar to the nanoligomycin biosynthesis gene as the method that probe carries out Southern hybridization from other organism.
Comprise nucleotide sequence provided by the present invention or the clone gene or the dna fragmentation of partial sequence can obtain new nanoligomycin derivative by interrupting biosynthetic one or several step of nanoligomycin at least.Comprise the output that dna fragmentation or gene can be used for improving the nanoligomycin or derivatives thereof.
Comprise nucleotide sequence provided by the present invention or at least the cloned DNA of partial sequence can be used to from the nanchang streptomycete genomic library more library, location plasmid.These library plasmids include the partial sequence among the present invention at least, also include the DNA that former adjacent domain is not cloned in the nanchang streptomycete genome.
Nucleotide sequence provided by the present invention can be modified or be suddenlyd change.These approach comprise insertion or displacement, the polymerase chain reaction, and mistake mediation polymerase chain reaction, the locus specificity sudden change, not homotactic reconnecting, or by ultraviolet ray or chemical reagent.
The different piece that nucleotide sequence provided by the present invention can be by sequence or the homologous sequence in other source directly evolve (DNA shuffling).
Come from one or more polyketide synthase structural domains, module or the gene of identical or different polyketide synthase systems by disappearance or inactivation, or increase one or more polyketide synthase structural domains, module or gene and produce new polyketide.
Comprise sequence of the present invention or at least the clone gene of partial sequence can in foreign host, express with the polyketide synthase that obtains modifying or higher biological activity or higher output by appropriate expression system.These foreign host comprise streptomycete, intestinal bacteria, genus bacillus, yeast, plant and animal etc.
Comprise nucleotide sequence of the present invention or the fragment of partial sequence or structural domain or module or gene can be used for making up polyketide synthase storehouse or polyketide synthase derive storehouse or combinatorial libraries at least.
The nucleotide sequence of nanoligomycin biosynthesizing modifying factor provides by disappearance or has transformed the approach that these modifying factors obtain the nanoligomycin derivative.
Contain nucleotide sequence of the present invention or at least partial sequence gene or gene cluster can be expressed in heterologous host and understand their functions in host's metabolic chain by the DNA chip technology.
Comprise aminoacid sequence of the present invention or at least the polypeptide of partial sequence may after remove or substitute certain or some amino acid, still have biological activity even new biologic activity is arranged, perhaps improved output or optimization albumen dynamic characteristic or other character of being devoted to obtain.
By the suitable technique disappearance, the aminoacid sequence that connects among the present invention can obtain new albumen or enzyme, and then produces new polyketone or the product that is associated.
Aminoacid sequence provided by the present invention can be used for separating the protein of needs and can being used for Antibody Preparation.
Aminoacid sequence provided by the present invention provides the possibility of prediction polyketide synthase three-dimensional structure.
The present invention has substantive distinguishing features and obvious improvement, and gene provided by the present invention and protein thereof, antibody also can be used for searching and develop compound or the albumen that can be used for medicine, industry, agricultural.
Description of drawings
The chemical structural drawing of Fig. 1 nanoligomycin (A) and oligomycin (B)
Among Fig. 1:
Component R 1 R 4
A OH H 2
B OH O
C H H 2
Component R 1 R 2 R 3 R 4
A OH CH 3 H H 2
B OH CH 3 H O
C H CH 3 H H 2
D OH H H H 2
E OH CH 3 OH O
The composition diagram of Fig. 2 nanoligomycin biological synthesis gene cluster
As Fig. 2, arrow A 1-A7 represents the polyketide synthase gene, and arrow B is represented cytochrome P450 gene, arrow OI represents oxidase gene, arrow TI, TII represent transposase gene, and arrow RI represents the transcription regulatory protein gene, and arrow ccrA represents the ccrA gene.
Fig. 3 nanoligomycin polyketone biosynthesizing model
As Fig. 3, frame table shows I type PKS protein protomer, and line is represented I type PKS module (module), each structural domain of circle expression.KS represents 'beta '-ketoester acyl-synthetase structural domain; on behalf of acetate, ATa load the structural domain of extender unit; on behalf of butyric acid, ATb load the structural domain of extender unit; on behalf of propionic acid, ATp load the structural domain of extender unit; KR represents 'beta '-ketoester acyl reductase structural domain, and DH represents the dehydratase structural domain, and ER represents the fatty acyl reductases structural domain; ACP represents the acyl carrier protein structural domain, and TE represents the thioesterase structural domain.
Embodiment
Being that the total DNA gene library of probe and nanchang streptomycete is hybridized from a dna fragmentation in the erythromycin biological synthesis gene cluster, therefrom obtain to include the positive Coase plasmid (cosmid) of its homologous sequence, choose wherein 4 positive Coase plasmid 8D1 respectively, 6C2,8G1,16C4 adopt shotgun to carry out nucleotide sequencing.Dna fragmentation is reclaimed wherein 1.6-2.0kb fragment with 550 Sonic Dismembrator ultrasonic wave (Fisher Scientific company) fracture and 0.7% usefulness low melting-point agarose, (Bio 101 to pass through Geneclean II reagent kit again, Inc company) the purifying rear clone is built into a series of order-checking subclones to the SmaI site (handling through CIAP in advance) of pUC18.Prep 96 Plasmid Kit (Qiagen company) are adopted in the preparation of order-checking subclone plasmid DNA.The mensuration of sequence adopts BigDye Terminator Cycle Sequencing Kits (Applied Biosystem Division, Perkin Elmer company) on 377 DNA Sequencers (PE/ABD), finishes automatically, the order-checking universal primer is 5 ' GTA AAA CGA CGG CCA GT, 3 ' (forward), and 5 ' GCG GAT AAC AAT TTC ACA CAG G3 ' (reverse).It is by the graduate line server of the state-run health of Japan that the ORF of sequence analyzes Http:// watson.nih.go.jp/ ~ jun/cgi-bin/frameplot.plThe online software of Frame-Plot 2.3.2 that provides carries out.The homology of sequence relatively be the PSI-BLAST software that provides of the line server by American National biotechnology information center ( Http:// www.ncbi.nlm.nih.gov/BLAST) carry out.
Below in conjunction with Fig. 1, Fig. 2, Fig. 3 the present invention is described in further detail
Whole Nanchangmycin biological synthesis gene cluster among the present invention is totally 13 genes, is specially:
(1) polyketide synthase gene, i.e. nlmA1, nlmA2, nlmA3, nlmA4, nlmA5, nlmA6, nlmA7 be totally 7 genes;
(2) modifying factor of nanoligomycin, i.e. nlmB, nlmOI is totally 2 genes;
(3) nanoligomycin transposase gene, i.e. nlmTI, nlmTII is totally 2 genes;
(4) regulatory gene of nanoligomycin, i.e. nlmRI;
(5) nanoligomycin precursor synthetic gene, i.e. ccrA.
The polyketide synthase gene:
Below be 7 required I type polyketide synthase opening code-reading frames of 20 hexa-atomic macrolide antibiotics nanoligomycin polyketone aglycone biosynthesizing among the coding catalysis nanchang streptomycete NS3226, be nlmA1, nlmA2, nlmA3, nlmA4, nlmA5, nlmA6, the nucleotide sequence of nlmA7 or complementary sequence and amino acid sequence corresponding thereof.
7 I type polyketide synthase opening code-reading frames; its module or structural domain, i.e. nucleotide sequence or the complementary sequence and the amino acid sequence corresponding thereof of ketone group synthetase structure domain, acyltransferase structural domain, keto reductase structural domain, dehydratase structural domain, enoyl-reductase enzyme structural domain, acyl carrier protein structural domain, thioesterase structural domain.
It is to be used to the polyketide synthase of encoding that 7 genes (nlmA1-nlmA7) are arranged in the sequence 1, wherein comprises 17 modules altogether, has 79 structural domains, and the biosynthesizing of responsible catalysis nanoligomycin polyketone aglycone (Fig. 2, Fig. 3).
NlmA1 comprises 4 modules, i.e. load-on module and module 1, module 2, module 3.Load-on module contains 3 structural domain: KS-L, AT-L, and ACP-L is responsible for the initial synthetic of polyketone chain, and catalysis is introduced an acetate as synthetic initial unit and the final nanoligomycin C33-C34 carbochain skeleton that forms.Module 1 contains 4 structural domain: KS1, AT1, and KR1, ACP1 is responsible for catalysis and introduces the C31-C32 carbochain skeleton that an acetate extender unit finally forms nanoligomycin.Module 2 contains 4 structural domain: KS2, AT2, and KR2, ACP2 is responsible for propionic acid extender unit of catalysis introducing and finally forms the C29-C30 carbochain skeleton of nanoligomycin, and have a methyl branch on C30.Module 3 contains 6 structural domain: KS3, AT3, and DH3, ER3, KR3, ACP3 is responsible for catalysis and introduces the C27-C28 carbochain skeleton that an acetate extender unit finally forms nanoligomycin.The amino acid position of each structural domain is as shown in table 1.
Structural domain and amino acid position thereof that table 1 polyketide synthase NlmA1 is comprised
Module Structural domain Amino acid position in sequence 2
Load-on module KS-L AT-L ACP-L 19-428 534-827 914-985
Module 1 KS1 AT1 KR1 ACP1 1006-1418 1524-1819 2104-2283 2372-2453
Module 2 KS2 AT2 KR2 ACP2 2478-2902 3009-3312 3629-3808 3896-3974
Module 3 KS3 AT3 DH3 ER3 KR3 ACP3 4012-4418 4524-4814 4875-5063 5385-5689 5699-5879 5980-6063
NlmA2 comprises 3 modules, and promptly module 4, module 5, module 6.Module 4 contains 4 structural domain: KS4, AT4, and KR4, ACP4 is responsible for catalysis and introduces the C25-C26 carbochain skeleton that an acetate extender unit finally forms nanoligomycin.Module 5 contains 4 structural domain: KS5, AT5, and KR5, ACP5 is responsible for propionic acid extender unit of catalysis introducing and finally forms the C23-C24 carbochain skeleton of nanoligomycin, and have a methyl branch on C24.Module 6 contains 5 structural domain: KS6, AT6, and DH6, KR6, ACP6 is responsible for catalysis and introduces the C21-C22 carbochain skeleton that an acetate extender unit finally forms nanoligomycin.The amino acid position of each structural domain is as shown in table 2.
Structural domain and amino acid position thereof that table 2 polyketide synthase NlmA2 is comprised
Module Structural domain Amino acid position in sequence 3
Module 4 KS4 AT4 KR4 ACP4 33-450 564-866 1141-1292 1379-1446
Module 5 KS5 AT5 KR5 ACP5 1467-1887 2002-2301 2625-2804 2912-2979
Module 6 KS6 AT6 DH6 KR6 ACP6 3003-3429 3542-3839 3898-4050 4357-4534 4662-4711
NlmA3 comprises 2 modules, i.e. module 7 and module 8.Module 7 contains 6 structural domain: KS7, AT7, and DH7, ER7, KR7, ACP7 is responsible for butyric acid extender unit of catalysis introducing and finally forms the C19-C20 carbochain skeleton of nanoligomycin, and have an ethyl branch on C20.Module 8 contains 5 structural domain: KS8, AT8, and DH8, KR8, ACP8 is responsible for catalysis and introduces the C17-C18 carbochain skeleton that an acetate extender unit finally forms nanoligomycin.The amino acid position of each structural domain is as shown in table 3.
Structural domain and amino acid position thereof that table 3 polyketide synthase NlmA3 is comprised
Module Structural domain Amino acid position in sequence 4
Module 7 KS7 AT7 DH7 ER8 KR7 ACP7 166-366 476-779 831-1020 1343-1634 1641-1824 1928-1995
Module 8 KS8 2026-2445
AT8 DH8 KR8 ACP8 2548-2852 2908-3082 3396-3578 3681-3748
NlmA4 comprises two modules, i.e. module 9 and module 10.Module 9 contains 5 structural domain: KS9, AT9, and DH9, KR9, ACP9 is responsible for catalysis and introduces the C15-C16 carbochain skeleton that an acetate extender unit finally forms nanoligomycin.Module 10 contains 6 structural domain KS10, AT10, and DH10, ER10, KR10, ACP10 is responsible for catalysis and introduces the C13-C14 carbochain skeleton that an acetate extender unit finally forms nanoligomycin.The amino acid position of each structural domain is as shown in table 4.
Structural domain and amino acid position thereof that table 4 polyketide synthase NlmA4 is comprised
Module Structural domain Amino acid position in sequence 5
Module 5 KS9 AT9 DH9 KR9 ACP9 33-459 585-886 941-1125 1423-1602 1711-1771
Module 6 KS10 AT10 DH10 ER10 KR10 ACP10 1797-2224 2324-2615 2670-2853 3171-3460 3468-3647 3749-3806
NlmA5 comprises 1 module, and promptly module 11.Module 7 contains 5 structural domain: KS11, AT11, and DH11, KR11, ACP11 is responsible for propionic acid extender unit of catalysis introducing and finally forms the C11-C12 carbochain skeleton of nanoligomycin, and have a methyl branch on C12.The amino acid position of each structural domain is as shown in table 5.
Structural domain and amino acid position thereof that table 5 polyketide synthase NlmA5 is comprised
Module Structural domain Amino acid position in sequence 6
Module 7 KS11 AT11 DH11 KR11 ACP11 33-459 569-871 925-1110 1430-1610 1719-1780
NlmA6 comprises 3 modules, and promptly module 12, module 13, module 14.Module 12 contains 4 structural domain: KS12, AT12, and KR12, ACP12 is responsible for propionic acid extender unit of catalysis introducing and finally forms the C9-C10 carbochain skeleton of nanoligomycin, and have a methyl branch on C10.Module 13 contains 4 structural domain: KS13, AT13, and KR13, ACP13 is responsible for propionic acid extender unit of catalysis introducing and finally forms the C7-C8 carbochain skeleton of nanoligomycin, and have a methyl branch on C8.Module 14 contains 4 structural domain: KS14, AT14, and KR14, ACP14 is responsible for propionic acid extender unit of catalysis introducing and finally forms the C5-C6 carbochain skeleton of nanoligomycin, and have a methyl branch on C6.The amino acid position of each structural domain is as shown in table 6.
Structural domain and amino acid position thereof that table 6 polyketide synthase NlmA6 is comprised
Module Structural domain Amino acid position in sequence 7
Module 12 KS12 AT12 KR12 ACP12 30-444 554-856 1170-1351 1459-1524
Module 13 KS13 AT13 KR13 ACP13 1547-1964 2072-2374 2688-2865 2974-3041
Module 14 KS14 AT14 KR14 ACP14 3064-3477 3592-3894 4199-4378 4486-4547
NlmA7 comprises 2 modules, i.e. module 15 and module 16.Module 15 contains 4 structural domain: KS15, AT15, and KR15, ACP15 is responsible for propionic acid extender unit of catalysis introducing and finally forms the C3-C4 carbochain skeleton of nanoligomycin, and have a methyl branch on C4.Module 16 contains 6 structural domain: KS16, AT16, DH16, KR16, ACP16, TE are responsible for catalysis and introduce the C1-C2 carbochain skeleton that an acetate extender unit finally forms nanoligomycin, simultaneously under making the generation cyclisation of polyketone chain under the TE catalysis and discharging from polyketide synthase.The amino acid position of each structural domain is as shown in table 7.
Structural domain and amino acid position thereof that table 7 polyketide synthase NlmA7 is comprised
Module Structural domain Amino acid position in sequence 8
Module 15 KS15 AT15 KR15 ACP15 32-458 572-874 1177-1357 1470-1535
Module 16 KS16 AT16 DH16 KR16 ACP16 TE 1556-1980 2076-2363 2423-2610 2925-3104 3213-3280 3371-3588
The modifying factor of nanoligomycin:
Below be 2 opening code-reading frames that coding participates in nanoligomycin polyketone chain oxidative modification, i.e. nlmB, the nucleotide sequence of nlmOI or complementary sequence and amino acid sequence corresponding thereof.
The gene that has 2 responsible nanoligomycin oxidative modifications during sequence 1 has, i.e. nlmB, nlmOI (Fig. 2), difference Codocyte cytochrome p 450 and oxydase, in the biosynthesizing of nanoligomycin, the oxidation of catalysis C12 and C28 respectively forms hydroxyl and ketone group in corresponding position.Their Nucleotide, amino acid position and function thereof are as shown in table 12.
The Nucleotide of the modifying factor of table 12 nanoligomycin, amino acid position and function thereof
Gene The position of base in the sequence 1 Amino acid sequence corresponding Function
nlmB nlmOI 83059-81845 101284-101787 Sequence 9 sequences 10 The Cytochrome P450 oxydase
The transposase gene of nanoligomycin:
Below be 2 opening code-reading frames that coding participates in nanoligomycin biological synthesis gene cluster swivel base, i.e. nlmTI, the nucleotide sequence of nlmTII (Fig. 2) or complementary sequence and amino acid sequence corresponding thereof.Their Nucleotide, amino acid position and function thereof are as shown in table 13.
Nucleotide, amino acid position and the function thereof of table 13 nanoligomycin desoxy sugar biosynthesis gene
Gene The position of base in the sequence 1 Amino acid sequence corresponding Function
nlmTI nlmTII 255-1208 98193-96412 Sequence 11 sequences 12 The transposase transposase
The regulatory gene of nanoligomycin:
Below be that coding participates in 1 opening code-reading frame that the nanoligomycin biosynthesizing is regulated, the i.e. nucleotide sequence of nlmRI (Fig. 2) or complementary sequence and amino acid sequence corresponding thereof.Their Nucleotide, amino acid position and function thereof are as shown in table 14.
The Nucleotide of the regulatory gene of table 14 nanoligomycin, amino acid position and function thereof
Gene The position of base in the sequence 1 Amino acid sequence corresponding Function
nlmR1 2168-4990 Sequence 13 Transcription regulatory protein
The precursor synthetic gene of nanoligomycin:
Below be the opening code-reading frame of coding ccrA, the i.e. nucleotide sequence of ccrA (Fig. 2) or complementary sequence and amino acid sequence corresponding thereof.Their Nucleotide, amino acid position and function thereof are as shown in table 14.
The Nucleotide of the regulatory gene of table 14 nanoligomycin, amino acid position and function thereof
Gene The position of base in the sequence 1 Amino acid sequence corresponding Function
ccrA 94705-96048 Sequence 14 CcrA
Sequence 1 is the nanoligomycin biological synthesis gene cluster nucleotide sequence or the complementary sequence of totally 13 genes, 104096 bases of total length, the gene (nlmA1-nlmA7) that comprises 7 polyketide synthases that are used to encode, 2 participate in the gene (nlmB that the nanoligomycin biosynthesizing is modified, nlmOI), and the gene of 2 participation nanoligomycin biological synthesis gene cluster swivel bases (nlmTI, nlmTII), the regulatory gene of 1 nanoligomycin (nlmR1), 1 nanoligomycin precursor synthetic gene (ccrA).
Sequence 2 is the aminoacid sequence of the I type polyketide synthase (NlmA1) of nlmA1 gene (5469-23915 base in the sequence 1) coding.
Sequence 3 is the aminoacid sequence of the I type polyketide synthase (NlmA2) of nlmA2 gene (23938-38337 base in the sequence 1) coding.
Sequence 4 is the aminoacid sequence of the I type polyketide synthase (NlmA3) of nlmA3 gene (38626-50133 base in the sequence 1) coding.
Sequence 5 is the aminoacid sequence of the I type polyketide synthase (NlmA4) of nlmA4 gene (93935-82242 base in the sequence 1) coding.
Sequence 6 is the aminoacid sequence of the I type polyketide synthase (NlmA5) of nlmA5 gene (82170-76564 base in the sequence 1) coding.
Sequence 7 is the aminoacid sequence of the I type polyketide synthase (NlmA6) of nlmA6 gene (50289-64196 base in the sequence 1) coding.
Sequence 8 is the aminoacid sequence of the I type polyketide synthase (NlmA7) of nlmA7 gene (64243-75024 base in the sequence 1) coding.
Sequence 9 is the aminoacid sequence of the Cytochrome P450 (NlmB) of nlmB gene (76408-75191 base in the sequence 1) coding.
Sequence 10 is the aminoacid sequence of the oxydase (NlmOI) of nlmOI gene (94138-94641 base in the sequence 1) coding.
Sequence 11 is the aminoacid sequence of the transposase (NlmTI) of nlmTI gene (255-1208 base in the sequence 1) coding.
Sequence 12 is the aminoacid sequence of the transposase (NlmTII) of nlmTII gene (98193-96412 base in the sequence 1) coding.
Sequence 13 is the aminoacid sequence of the transcription regulatory protein (NlmRI) of nlmRI gene (2168-4990 base in the sequence 1) coding.
Sequence 14 is the aminoacid sequence of the ccrA (CcrA) of ccrA gene (94705-96048 base in the sequence 1) coding.
Following content according to the present invention provides gene order:
Sequence list:
SEQUENCE LISTING
<110〉Shanghai Communications University
<120〉nanoligomycin biological synthesis gene cluster
<160>14
<170>PatentIn version 3.1
<210>1
<211>104096
<212>DNA
<213〉nanchang streptomycete NS3226 (Streptomyces nlmchangensis n.sp.NS3226)
<400>1
atggaacggt gggtgcagac ctgcagacgg gaactcttgg accgcaccct gatctggaac 60
caccggcacc tgctccacgc cctgcgcgag ttcgagcagt tctacaacgc acaccggccg 120
caccagggca tcgcgaacgc cagaccgctg cacgccttac ccaggccgat cgacgatcct 180
gagcagatca gccgtctcga catacgacgc cgcgatcgac tcggcgggat cctccacgag 240
taccaacatg ccgcatgacc agcacggatg acattctcgg caagggcacg accatcatct 300
cgcgacgatc gacagccgcg cgagagcacg ggggcgagcg ccttccaacg agactcccta 360
caccctcgca caccacctcc tccagggcgg acgggttttc ggcaggcgca acgctgctga 420
cctggcaccg tcgtctggtg cgcgcgagag aacccgacct gggggtacgt caggttccag 480
ggcgagctgc gacggcttgg ccatcgggtt gccgccgcac tatccgccgc gctctgcgcc 540
gctccggttt accgcccgca ccgcagcgcg cctcccagca gacgtggcgt tccttcctgc 600
gctcccaggc ccatacgctg ctcgcctgcg acttcatgcg tgtggagacc gtcttcctca 660
aacgtctcta cgtcttcttc gtcatggaga tcaagactcg gcgcgtccat gtcctgggcg 720
tcaccgttcg ccctacgggc gcatgggtca cccagttcgc ccgcaacctg ctcaaggatc 780
tcgaggagag ggctgggtgc ttccggttcc tcatccgtga ccgggacagc aagttcaccg 840
ccgcgttcga cgccgtcttc gccgacaatg gcacagccgt catcccgacc ccgccgcaaa 900
gccctcggtc caacgcgttc gccgaacgat ggatacgcac agcccgcgct gaatgcaccg 960
accgaatcct catcaccggc gaacgacacc tccgtgccgt ccttaccacg tacgccgagc 1020
actacaacac cggacgggcc caccgcagcc tcgacctacg cgccccagac gaccgcccga 1080
gcgtcatccc cctgcctgct gcggtagtcc gacggcgacg gctacttggc ggcctgctca 1140
acgagtacca caccacgcca ccccaacgac ttctccatcc acaagaaaca cccagctcag 1200
cggcctgatc gggatattga cacccttcac gcctcctgct ctcggatcga cggcaggtcg 1260
tgcatgaaaa ccgcctacct gtcaccgctg aaccgctgca tgatgcgctt ggtcagctcg 1320
cctggcggcc ctggctgccg gcggacacag ccgacggctc cgccagcctt tcccggaaca 1380
ggtcgggtga aggagccgcc accgtgatac tgccgggttc cggctccttc agtaattcaa 1440
tgccggggga acgcacgggt ccgtccgcgc accagcatcc tcttgaggtc atgaagtaac 1500
aggggggtgg aaaaaggggg cgtcttttgt gcggtcatgg agcgctcgtt cgggaatgca 1560
cgctaccagc tgattctatg agtatgctgc ggattctcta tgcttcctct cgcgcgacga 1620
ccgatgggga acaatggaga tatagagcgg aagaaggaat tccgcagcgg tccgcgttag 1680
cgtccccgta ttcctgcgga ggccggcgtg cggaccggat accgcggctt gcgcaccgat 1740
ttttcgaagg gaccctgatg aacagcactg tgaccgcctg gaaggacgcc atgtgccggg 1800
agggtctcgg ggcgggcccg gaccacccgg cgggcgtggc ggacctcatc gccgaccttc 1860
cgatcgagga gggttccgcg gccgcggcca ccctttccac gcagccgttc ttcgactgct 1920
gcggctgacc cgctcgacca ccccgcggtc gccgcccgca cgggcggcca ccctgccgcc 1980
gacccgttcc cgcgcggatc ggcgccggtc tgcccgtaag aaacgggccg gacggatcgt 2040
ggcgtattca cggaatgtga cgactggaaa ggggggtaag ggtcgtgtcc aggtatgacg 2100
ctggatggtc tggcatatac ctcaaccgtg acgagcgatt ccagattcgg gcagaggggg 2160
tgatgggatg aaactctcgg aaccctccta ttatccggag attgtcgaac gctccgaaga 2220
aatctcgttg cttgcccaag acctcgcaaa caccaagcgc ggcgaaggtg ccgtggtcgt 2280
catccattct gggcccggag tcggtcgtac ggcactgctc gatgaattcc tgcggcagtc 2340
tgggaattcc ggggcccggg tgtgcgccgc cacgggatcg gccgcggaga ccggcaacga 2400
gttgggcgtc gtcacccagc tgttcccgga agacgggccg atcgccgctg cggtctggct 2460
ggcccgggcg ctcgacgacc accacggcga cccgtccccg gatgccgacc ggctcttcga 2520
catgctgcgc ggggagttcc ggcagggccc gctggtgctg gcggtcgacg acgttcagct 2580
ggccgacgcg gcgtctctgc ggttcctgct gcacctcata cgccggctgc gcaccactcc 2640
cgtgctgatc gtcctcactg agcccgtcgg atcgtgcgcc ctcccgctcg ctttccaggc 2700
ggaacttctc cggcatcccc ggtgccgacg tcttcggctg cagccgctgt ccgtggacgg 2760
cgtcacccgg atgatagagc cctacgtggc cgagaccgag gtggcgcggc tggccaccca 2820
gttccatgcc gtcagcggtg gcaaccccgt cctggtgcgc gggctgctgg ctgatcaccg 2880
ggccggacaa cggctggaag agcagggcat cggcgcacaa tacaacggat acccggcctt 2940
cactcaagcc gcactggtct cggcttaccg ggacgacccc gtacttttcg aggtcgtttg 3000
cggcattgcc gtcctcggcg agaacgcgtc tcccgccctc gtggcctgcc tggtcgaccg 3060
gggagccgat gtggtggccc gtgtcatgac cgcactgaac acggcaagcc tgctcaatgg 3120
ccccgccttc cgtagcccac tcgttgcgaa ggccctgctg gagctcctgg atgtggagac 3180
tcgcggagag ctgcaccggc gcgcggccga gctgctccat gccgacgcgg cacttcctgc 3240
cgacgtcgcg caccatctgc tcgctacccc gatcgccgaa tcctgggtgc tgccgacctt 3300
gctcgccgcg gccgagcagg ctgtccaggg cggcgggcag gacttcagac tcgactgcct 3360
ccggctggca ggccgacagg cggcaaccga ggaggaacgt gccgccgtcg tcgccgcccg 3420
ggtccggatc ggctgggaga tcgatccccg gctgatcacc ccatggctcg gcgaactcgg 3480
cgccgcactc cgccgaggac acgtcggcag ctcggacgct gcctggaccg tcaaacactt 3540
tgtatggcac gaccatgtcg aggaggccgc cgacatcctt tccgcactga tggagcgaac 3600
cgaggagaac agcgacgcgc acgccgaact cgagatcgtc cggcattggg tgcggtacac 3660
ctgccccact ttactcgagg gatcggtgga cgcagatgcg ccctccctgt ccggtccgtt 3720
cccgcagcgg ttccaactga gaccggcctc gtacgccgtc gagatgcttg gccggctttt 3780
caccgagggc ccctgcgatc aggcggcggc catggccgag gagatccttc gcggctgccg 3840
gttcggtgag accaccgtcg aagctgtcga aggagccctg ctggtcctcg tctatgccga 3900
acgtcccggc cgggcactgc actggtgtga ggcgctgctg gagcaggcag gagatcaccc 3960
caccggcaca gccgctgcga tcctgagcag tattcgcgcc gaaatcgccc ttcggcaagg 4020
cgcattggaa gaggccgaga cgtacgcgga ccgggccctc aacgccatct cacggctggg 4080
ctggggggtg gccatcggct cgcccctggc cgtccgggta cgggccgcga tggccgcggg 4140
tcgcaccggc ctggcagggg cctggctgaa tcaggacgta ccccagggga tgttccgcac 4200
ccgccacgga ctgctgtaca tgcacgcacg cggtcattac cacctggcca ccgaccgccc 4260
gactgtcgct ctggaggact tcctgacctg tggccggctg gccaaggagt ggggcatgga 4320
cgtgcccaca tttctgccgt ggcgcacctc agccgcgctg gcccacctgg ccctgggcaa 4380
cggcagccgg gccagtgcct tggcacggga gcagctgacc cggcccggcg gcggctggcc 4440
gaggtgccgg gcggtgtcgc tgcgggtgct cgccgccacg agcgaactcg accgccgccc 4500
tgctctgtta cgtgagtcgg tcaatctgct ggagagctgt ggcgatcacg tggagttgct 4560
gcattcgctg gccgaccagt tccaggcgct gtccgaagcg ggggcacccg cgaaggcacg 4620
gattgcggcc cggcatgcca gaaccgtcgc cgacaattgt ggcacggaga cgctctttcg 4680
caggctgttc aaggaggagg tgcccgagga caccgacgaa tcggccgact tcgggcagga 4740
ccaccagggg tttgccagcc tgaccgacgc ggagcggcgg gtcaccgccc tggccgccct 4800
cgggtactcc aaccgggaga tcggacgcaa gctcttcatc accaagagca ccgtcgagca 4860
gcacctcacc cgggtctacc ggaagctcgg ggtacgcaac cgggccgacc tcggcgacct 4920
gctcgccggg atcaacctcg cagcccagcc ccaggtgatg ggcaggacgt cctcggccgc 4980
cgtcggctga ggacgcaccc cgcggccgac cgtctcgccg ctacaaagag ctgtgcgcgg 5040
tgctggggga ccggcgtttc agctcggacg ccaggccaca ctcgcactgc cgcgcaaccc 5100
ggagcaactc gccctgctgc acgaggatcg tccctgatca aggtgccatc gaggaactgc 5160
tgcgctacat cacgatcgtg cagaacggcg tggaacgcgt cgccacggca gcggtatgag 5220
cagcgatgcg ggggcacccg gcgcggggtg tccccgcacg ctgtcccggc ccctgccccc 5280
aggacaccga ctctgcccgt gacacggaaa gcgcaagtca gcaggggcgg aggccttacg 5340
gcggtcccag ctaaggggtc gcctagtggt tgaggctagg ggccgcccgc tcggatattc 5400
ggtgtgacct gcggccgggg tgccgcattg aggcgcgctt caggttccta gcgacgtaag 5460
ggaaacgcat ggcgggtgga tccgagtcag aggccgctga gttcacggcg cgatccgccc 5520
agccgatcgc ggtggtggga atggcgtgcc ggctgcccgg tgcggcggga ccggcggaat 5580
ttcgcgccat tctccgcagc ggtacggaag ctgtcggcgc cgccgccccg gatcgtccgt 5640
acgccccgcc gcggggtggc ttcctggact cggtggaccg tttcgacgcc ggattcttcg 5700
gtgtctcgcc gcgcgaggcc gcggtcatgg acccgcagca gcggctgatg ctggaactgt 5760
gctgggaagc actcgaagac tcaggcatcg tgcccgcccg cctcgacggt agcgacgccg 5820
gtgtcttcgt gggcgccatc accgacgact acgccgtact gtcccgggcc gccggcgtgg 5880
acgccgccac cccggagacg agcaccggcc tcaaccgggg catgatcgcc aaccgggtct 5940
cctacaggct gggcctgcgg ggtccgagct ttacggtcga ctcgggacag tcgtcgtccc 6000
tggtcgccgt gcacctggcc accgagagcc tgcgccgggg cgagtgctcc ctggctctgg 6060
ccggcggggt gaacctgatc ctcgcagagg acagcacggc cgccgtcgaa cgcttcgggg 6120
cgctctcccc ggacggccgc tgctacacct tcgacgctcg cgccaacggc tatgtgcgcg 6180
gcgagggcgg cggtgtcgtc gtcctcaagc ggctcactga cgcggtcgcg gacggcgacg 6240
acatcctgtg cgtgctcgcg ggcagcgcgg tgaacaacga cggtggcggc gaaggcctga 6300
ccgtacccga ccgccagggt caggaggccg tgctcaccgc cgcgtacgag caggcgggga 6360
tctccccgaa cgccgtcgga tacgtggaac tgcacggcac gggaacccct gccggtgacc 6420
ccgtggaggc cgcggccgtc ggtgccgtgc tcggcgcggg ccgcagtgcg gaacagccgc 6480
tgctggtcgg ttcggtgaag accaacatcg gccacctcga aggcgccgcc ggtatcgccg 6540
gactcctcaa ggccgtgctg accgtacgcc accgcgagat ccacgcaagc ctcaacttca 6600
ccacccccag cactcgcatc cccatgaccg agctgggcct gagtgtcaac acggcactgc 6660
gtccctggct gagcgaggcc ggcccgctga tcgtgggcgt cagctccttc ggcatggggg 6720
gcaccaactg tcatgtcgtc ctcacggaat ggcacggcgt cgcaccggtg accgcacccg 6780
gcatccgccc caacgggaca gcggtgcccc tcctcatcac cggccgggac gagcaggcgc 6840
tgcgcgacca ggcgcaccac ctgggccggc acctcgacga gcacggtccg ctgcgcctga 6900
aggacgtcgc ccacaccttg gccgccggcc gcacggcgtt cgagcacagg gccgtgctac 6960
tcgtccgcga gccgcaggac atgaccgacg gcctcgcccg gctcgccgac ggcacgcccg 7020
gcccggacct cgtacgcgcc accgcgacct gtagctccct cgccttcctg ttcaccggac 7080
agggcagcca gcgccccggc atgaccgccg agttgtacca gtcctcgtcc gagtacgcgg 7140
ccgccctcga cgaggtctgc gcccatctcg atccccagtt gcgggtgccc ttgcgggagg 7200
tactcttcgc cgcggaagga acggcggaag cggtcctgct cgaccgtacg gagttcaccc 7260
agcctgcact gttcgccgtc gaagtcgccc tcttccggtt cgcggagcac tgtggcctcg 7320
tcccacggct gctgctcggc cactccgtcg gcgaactggc cgcggcgcac gtcgccggcg 7380
tcctgtccct cgccgacgcc tgcagcctgg tcgccgcgcg cggccgactg atgcaggccc 7440
agccggccac cggggcgatg gcggccatcc aggctacaga gaaggaactt gcgccgttcc 7500
tcgacgagtc ggtggcggcg gccgccctga acggcccggc ttctaccgtc cttgcgggcg 7560
acgaggaagc cgtcctggcc atcgccgcgc actgggcggc caagggccgc agaaccaagc 7620
ggctgagagt cagccacgcc ttccactcgc cgcacatgga cggcatgctc gaggagttcc 7680
accgggtcgc cgggcagctg accttcgagg ccccccgtgt cccgatcgtg tcgaacgaga 7740
cgggcgccct gctcaccgag gcggaagcgt gctcgccgga gtactgggta cggcaagccc 7800
gcgtgaccgt gcggttcctg gacggagtgc gcctgctgga ggagcagggt gtgaccaccc 7860
tgctcgaact cggccccgac ggcacgctgt cgtccctggc ccgggactgc ctgcgcggcg 7920
tcgacgccgt gtccgtgccc ctgctgcgcg gccgcaccga accggaggag gtggtcgccg 7980
ccctggccac cctccaggtc cgtggtgtgc cgatgcactg ggagcggctg gccaccgagg 8040
agggcgcccg gcgggtgccg ctgcccacat acccatttca gcggcggcgc cactggctgc 8100
ccgacctggt cgcccaggat tctgtgcctg cccccggccg ggctgccgga cagcggtccc 8160
gtcccgtcaa cgagccggcg ccgtcggcgc acgcaccgcg cggcgaccgt acgatgcggg 8220
agaccgtccg ggcagccgtg gcactggtgc tcgggcacga ctccccggac gacatccccg 8280
cgcacacgac gttcagggag ttggggctca gctccctgat gctggccgaa gtcggcgagc 8340
ggctcaccga ggcgaccggg cgccgggtcc ccacgaccct gctcttcgac cacccgactc 8400
cggacgcact cgtacgcgag ctgacgtccg ggggtgctga acggcccgcg gcgctcacca 8460
ctgctccctc ggcggcgcac gccgacgacc ccgtcgtggt cgtgggcatg gcctgccggc 8520
tgcccggagg gatccggtcg ccggaggagt tctggcagtt catggcggcg gacggcgacg 8580
ccatctctcc gctgcccacc gatcggggct gggccgtctc cggggacttc cccgccgagg 8640
gcggtttcct ggcggacgtg gccgggttcg acgcggcgtt tttcgggatc tcgccgcgtg 8700
aggcgttggc gatggatccg cagcagcggc tgctgctcga gacgtcgtgg gaggcgctgg 8760
agcgggccgg ggtggacgcg ctgtcgctgc gcggcagccg caccggcgtc ttcgtcggcg 8820
cgagcccctc ggaatacggc cccagactcc acgaaccttc gcaagccgac ggacacgtgc 8880
tgaccggtac ggcgcccagt gtgctgtccg gccgggtggc ctatgtgctg ggtcttgagg 8940
gtccggcgct gacggtggac acggcgtgct cgtcgtcgct ggtggcgctg catctggcgg 9000
cgcaggcgct gcggggcggt gagtgcgact tggccctcgc cggcggtgtg gcggtgatgg 9060
cgacggcggg catgttcgca gagttcgcgc ggcagggggg tttggctcgt gatggccggt 9120
gcaaggcgtt tgcggatggt gcggatggta ctgggtgggg tgagggtgtc ggggtgctgg 9180
tgctttcgcg tttgtcggag gcgcgtcggt gtggttacac ggtgttggcg gtggtgagtg 9240
gttcggcggt gaattcggat ggtgcgtcga atggtttgac ggcgccgaat ggtccgtcgc 9300
agcagcgggt gattcgtcag gcgttggcgt cggcggggtt gtcgccgggg gatgtggatg 9360
tggtggaggc gcatgggacg gggacggcgc tgggtgatcc gatcgaggcg caggcgttgc 9420
tggccacgta tgggcaggag cgtggggcgg ggcggccgtt gtatgtgggt tcggtgaagt 9480
cgaatattgg gcatgtgcag gcggctgcgg gtgtggcggg tgtgatcaag tctgtgctgg 9540
cgttgcggta tggggtgctg ccgcggacgc tgcatgtgga tgtgccgtcg cgggaggtgg 9600
actggtcggc gggtgcggtg gagttgctga ctgaggcggt ggagtggctg gcggggggcc 9660
gtccgcggcg ggtgggggtg tcggcgttcg ggatcagcgg taccaacgcc cacgtgatcc 9720
tggaggaggc gccggagggt gtcgaggaga gcgcggctgg tgaggttgcg ggtgtggtgc 9780
cgtgggtggt gtcggcgcgg tcggaggagg ggttgcgggc gcaggctgcc cggttggtgg 9840
agcatgtggt gggcgggtct gggctggggc cggtggatgt gggctggtcg ttggcccggt 9900
cgcgtgcggt gttggagcac cgggcggtgg tgttgggagg ggatggggag gagttggtgg 9960
cggggcttcg tgcgttgtgc gatggggtgt tggggccggg tgtggtgcgg ggtgtggctg 10020
gtgatggtgg gacggcgttg ttgttcacgg gtcagggtgc gcagcgtgtg ggtatgggcc 10080
gggagttgta tgaggcgttt ccggtgttcg cggcggcgtt tgatgcggtg tgtgccgggt 10140
tcgaggggat gttgcccggg tcgttgcggg gtgttgtttt tggtgatggt ggcggggttg 10200
tggaccgtac ggagtgggcg cagccggcgt tgtttgcgct ggaggtggcg ttgttcgagt 10260
tggtcgtgtc gtggggtgtg cgggcggatg tgctggtggg tcactcggtt ggtgagttgg 10320
tggcggctca tgtggcgggt gtgtggtcgt tggcggatgc gtgtcgggtg gtggcggcgc 10380
ggggtcggtt gatgcaggcg ctgcccgttg gtggggcgat ggttgcggtg cgggtgggtg 10440
agggggagtt gccggtgttg ccggaggggg tgtcggtggc ggcggtgaac gggccgcggt 10500
cgttggttct ctccggggat gaggggccgg tgcttgagct ggcggcgcgg ctggccgggg 10560
agggccggga taccaggcgg ttgagggtct cgcacgcgtt ccattcggcg cggatggagc 10620
cgatgctcgc tgagtttgcg caggtgctgg cggcggtgga gttccgtgcg ccgcggatcc 10680
cggtgatctc caacgtgacc ggtgaggtgg ccggcgagga gctgaccacg cccgagtact 10740
gggtgcgtca ggtacgcgag gccgtccgct tcgccgacgg agtgaacacc gcacacggct 10800
cgggcgtccg gcgttatctc gaactcggac ccgacggcgt cctgacctcc ctcgctcacg 10860
acatactggc cgagcagggc atcgaccggg atgtggccgt cgtacccgcg ctccgccatg 10920
accagcccga atcccgcacg ctgctgaccg ccctcggcca actgcacacc accggcatgg 10980
acgtgggctg ggcggccttc ctcgcgccgt acggcgcccg caccgtcgag ctgcccacct 11040
acgccttcga acaccaccgt tactggttgg accccgtcgc acccgcctcg gcacctgcgg 11100
atcctctccg ctaccgcgcc gagtgggcga gtgtgccgga ctgcgccacg ccgtcgctga 11160
gcggtgtcca ggccgtcgtc gtccccgcgg gcgggggcca cctggatgtc ctgccggacg 11220
ttacggccgc cctccgggag cacggtgcgc ggaccgtgct ggtcgaggtc gacccggagc 11280
gagccgatcg cgccgagatc gccgacgccc tgcgcgcggc gctcggcgag gaaggcggcg 11340
gcgtggtgtc gctgctcgcc ctggaccgcg ggcccttcgc gggcgtcgcc gcgaccgctg 11400
tgctgctgca ggccctcacc gggctcgacg gcggtggccg cctgtggtcg ctgacgcgtg 11460
gcgcggtgtc ggtgagccgc tccgacgcgc tgaccgaccc cgggcaggcc caggtgtggg 11520
ggatgggccg cgtcgcggca ctggagcacc ccgagcgctg gggcggcctc gtcgacctgc 11580
ccaccgagct ggacgaccgg gcgcgggctc ggctgtgtgc cgtactgtcg ggcagcaccg 11640
gtgaggatca ggtggccgtg cgggcggcgg ggctgtacgc ccggcggctg caccgcgtgg 11700
cgccccgggt gcccaccacc gaggacgcgg gcgccgcctc cggccagggg gtgggcgacc 11760
gccgggcgta tacgtacggc accgtgctgg tgaccggcgg caccggcgcc ctgggcgcgc 11820
acatcgccaa ctggctcgcc aggtccggca cccggcatgt actgctcacc agccgccgtg 11880
gcccggacgc cgagggcgct gcggacctca ccgcgcggct gcgggagctg ggcaccgagg 11940
tgaccgtcgc cgcatgcgac gtggccgacc ggcagcgcct ggcggacctg atcgccgcac 12000
tgtcggcgga ccgaccgctg acgggtgtcg tgcacgcggc cggtgtcctc gacgacgggg 12060
tgctcgactc gctcacccca gaccggttcg acgcggtcgc ccggcccaag gtgatcggcg 12120
cccggcacct gcacgaactc acgcgcgatc tcgacctgtc cctgttcgtg atgttctcgt 12180
ccgtcgtcgg cacggtcggc ctggccggac agggcaacta cgcggccgcc aacgcctacc 12240
tggacgccct cgccgtgcac cgggcccagc acggcctgcc ggcaacggcg gtggcctggg 12300
gctcctggtc cggcgctggc atggccggcg acacccgggc cgcccgtgac cggctggcgc 12360
gcgccggcct ggcgcccctc gaccccgccg ccgccctcgc cgtgctcgac cgggtcatcg 12420
ccgacggcga gaccgccgtg accgtcgccg atgtggactg ggagcggttc gcggccgggt 12480
tcgcccctgg caggccgcac ccgctgctcg ccgggatccc cgagctatgg cacgcccggc 12540
cgcaggagac cggccaggtc accgatgggc cggcggaccg gcttgccgga ctggcgggtg 12600
acgaactgcg ccaggcgctc gacgacatgg tgaccgtgga ggtcgccgct gtgctggggt 12660
tccgggccaa ggaccgggtg ccgaccgacc gcaccttcaa gtcgctcggc ttcgactcgc 12720
tgatcggcgt ggagttccgc aaccggctcg ccgccgcact cggcaggcgg ctgccgccca 12780
gcctgatcta cgaccacccc acgccaggca ggctggtaga gcacctggcc gccggagtgg 12840
acggcggcga ccagccctcg accgtcggcg ggcgaccggt tgcccccaca cgcacccacg 12900
acgaccccgt tgtgatcgtg tccgccgcct gccggttccc cggtggcgtg cgtaccccgg 12960
aggacctgtg gcagctcgtc ctcgacggcg gcgacgccat cggccccttc ccggtggacc 13020
ggggctggga cctcgaccgc ctctacgatc ccgaccccgg cgcgtcaggc accagttacg 13080
tccgcgaggg cggtttcctc accggcgtgg cggacttcga cgcggtgttc ttcgggatct 13140
cgccgcgtga ggcgctggcg atggatccgc agcagcggct gctgctcgag acctcgtggg 13200
aggcgctgga gcgggccggc atcgtcccgg gctccctggc cggcagccgg accggcgtgt 13260
tcgtcggctc caacggccag gactacgcga acctgctgca ctcctccgat gtcgaggggc 13320
atgtgctgac cggcacggcc tccagcgtcc tgtccggccg catcgcctac accctggggc 13380
tcgagggccc cgcgctgacc gtcgacaccg cctgctcctc ctcgctggtc gccctgcacc 13440
tcgccgtcca ggcgctcagc tccggggagt gcgacctcgc gctcgcgggc ggtgtgaccg 13500
tcatgtccgg atccgacata ttcgtggagt tctcccggca gcgcggcctg tccgccgacg 13560
ggcgctgcaa ggccttcggc cccgacgctg acggcaccgg ctgggccgag ggcgtgggca 13620
ccgtcgttct ggaacggctc tccgacgcgc gccgcctggg ccatgaggtg ctgggcgtcg 13680
tgcgcggcac cgccgtcaat caggacggcg cctccaacgg gctcagcgcc cccagcgggc 13740
gcgcgcagca gcgggttatc cgccaggcgc tggccgacgc cggctgcgca ccgtccgacg 13800
tggacgcggt ggaggcgcac ggcaccggca cccggttggg cgaccccatc gaggcgcaag 13860
ccctgctcac cacctacggt caggaccgcc ccgccgaccg gccgctgtat ctcggctcca 13920
tcaagtcgaa tatcgggcac gcccaggccg ccgccggact ggccggcgtg ctgaagatgc 13980
tgttcgcact gaggcacggg cagctcccga agaccctgca cgccccgcgg ccgaccccgg 14040
aggtcgactg gtccgagggc gcggtcgccc tgctcaccga ggaccggccc tggccggccg 14100
tcgaccggcc gcgccgcgct ggcgtctccg ccttcggcgt cagcggcacc aacgcccacg 14160
tgatcctgga gcaggcgccc ccgtcggccg cctccgaccc ggcacccacc gttcggccgc 14220
ccgcggtgga cagctccgtc cagccgtggg tgctgaccgc caggtcgggg gaggcgctgg 14280
gcgcgctcgc ggaccgcttg cgcgaggcgg cacccggcgc ggtcccggcc gacgtcgcac 14340
gctcccttgt gacgaccagg acgatctggg cggagcgcgc cgtgctgctc gccgacggcc 14400
gtgacgagta cgcctccggg ctcgccgcgc tggccactgg agagggcgac gcgcgggtcg 14460
tgcgcggcac cgccgacacc cgcggccggg tcgtcttcgt cttccccggc cagggcgcgc 14520
agtgggccgg catggccgcc cggctgtggg agtcgtcgcc ggagttcgcg cggtggatgg 14580
atcgctgtga caaggccctc ggggacctga ccgactggtc cctcgccgag gtgatccacc 14640
aggccgacgg agcgcccgga ctggaccgcg tggacgtgct ccagccggcg tcctgggccg 14700
tgagcgtctc gctggccgcc ctgtggcgtt cctgcggggt cgaaccggcc gccgtggtgg 14760
ggcactcgca gggggagata gccgcggcgt gtgtggcggg tgccctctcg ctggaggacg 14820
gcgccatgct ggtgacgctg cgcagccggc tcatccgcga ggagctgtcc gggcacggcg 14880
gcatgatgtc ggtggccctg tccccggccg gcacggcgga ccgcatagcc tgctgggagg 14940
gcaggatctg cgtcgcagcg cacaacagcc gccgctccac cgtcgtcgcc ggcgagccgg 15000
cggcgctggc cgaactgctc gccgcctgcg aggcggacgg catacgggcc cgccgcatcc 15060
ccgtggacta cgcttcccac tcaccgcagg tggagcggat cgagcggaag ctgaccgagc 15120
tggcggccgg gatcgtgtcg cgctcctcgg agatcccctt ccattccacc gtgaccggta 15180
ccaggctcca caccacgggc ctggacgccg ggtactggta ccgcaacctc cgcaagcccg 15240
tgctgttcgg gccggtcacc gaggagctcc tcacccaggg ccacgacgtg ttcctggaga 15300
tgagcccgca ccccgtactg gtgccggccg tgcaggaggc ctccgacgcg gtcaccgcga 15360
cagccgccgc ggtgggcagc ctgcgccgag gcgacggcgg cccggaacgg ttcctgctct 15420
cgctggccga ggccttcgtc cgcggtgccc acgtggactg ggcggccgtg ctgggcggca 15480
ccggtacccg cctggtcgag ctgcccacct atcccttcca gcgcacgcgg ttctggcccg 15540
agccggtcac cccggccacc gcgaccggcg gccaggacga tgcaccgctg tggcaggccg 15600
tggagcgcgg cgacgtggcc gccgtggccg ccgaactggc tgtgccggac ggccggtcat 15660
tgcgtgacct ggtgcccgcg ctgtccggtt ggcgccgccg ccggagggac tccgcgacgc 15720
tcgacatctg gcgctaccgc gtcacctgga cacaggtgaa cctgcccgtg tcggccgccg 15780
tgaccggcga ctggctgctg gtgaccgacg accccgacac ggcggtcccc cggtgggtga 15840
gcgcggcgct cggcgagggc ctcgccaccg tggtgcggcc ggcggacgtc cccgcatggt 15900
cgcgcacgcc ccagggcacc gggtggacgg gcgtggtgtc cctgctgggc ctcacagatc 15960
actcgcaccc gtgtcacccc gccctgtcga ctggggtggc cgcaaccgtg accctgctga 16020
ccgcgctgcg agaggccggg atcgaggcac ctctgtggtg tctgaccagc ggcgccgtcg 16080
gcaccggtgg cctggaccag gtcacggcac ccaaccaggc ccagctctgg gggctgggcc 16140
gggtcgccgg gctggagacc cccgcgacct ggggcggact tgtcgacctg cccgccgaac 16200
ccgacgagcg caccgcggcc ttgctgcggg ccgcgctcac cgccgacgga atcgagcagg 16260
agtacgccct tcggccttca ggaccgtacg tacgtcggct ggttcgggcg cccctggcgg 16320
gcgtggcggc gccgcgctcc tggcgcccgc gacccgacgg caccgtcgtc gtcaccggcg 16380
gcaccggagc actgggcgcc agggtggccc gctggctcgc ccgcgcgggc gccgggcacc 16440
tgctgctgac cagccggcgc ggcccggccg ccgacggtgc cgtcgaactc tccgaggaac 16500
tgcgggcgct gggggccgaa gtgacgatca ccgcctgtga tgtcgccgac cgggcccagc 16560
tggccgatgt gctggcagcc gtgcccacgg cgtttccggt cagcgcagtg atccacaccg 16620
cgggcgtgag cggcaacgcc ccgctcgccg ggaccaccct cgccgagctg gccgaggtcg 16680
tcgccgccaa ggccgccggc gcccgcaatc tggacgagct gctggccggc caggacctcg 16740
acgcgttcgt gctgttctcg tccggagccg ccgtctgggg cagcgcgggc cagggcggat 16800
atgccgcggc caacgcgtac gccgacgcgc tcgcggccga ccggcgccgg cgagggctcg 16860
tcgccacgtc ggtggcctgg ggcagctggg ccggcggcgg catggtcgac gacgacctcg 16920
cgcgtgagct ggcccgcggt ggcgttcgct cgatggaccc cgaccgggcg atcgccgctc 16980
tccagcaggc gctcgaccac gacgagaccg cgctgacggt ctccgacatg gactgggccc 17040
gcttcgccga gacattcacc gccgcccgcc cgcgcccgct gatcgacggc atcccggagg 17100
ccgcccccgc ctcggccgaa ccggccggcg atatccccgg cctggccgcg cggctcgcgc 17160
agctgcccga cggcgagcgc gaccgggaac tgttggatct ggtcaggaac gccgcggcgc 17220
ttgccctcgg gcacacgggc accgagccca tcacaccgtc gaagccgttc aaggaactgg 17280
gcttcgactc gctgaccgcg gtcgacctgc gcaaccggct gacggcggcc accgggctgc 17340
ggctgcccgc caccctcgtc ttcgactacc ccacgccccg cgcggcggcc gacgcgttgc 17400
gggccgtgct gttcgccgcg gacatgccgg tcgacacggc cgcacccgcc cggagcgcct 17460
ccgcccgacc ggcggacgac gacccggtcg tcgtcgtggc gatggcctgc cggtatcccg 17520
gcggggcgac aacacccgag aagttctggg acctgatcgc tgcgggcgag gacggaatcg 17580
gaggctttcc caccgaccgt ggctgggaga tcggccccgg cgcggccttc tcccggaccg 17640
gcggtttcct ggcggacgtg gccgggttcg acgcggcgtt tttcgggatc tcgccgcgtg 17700
aggcgctggc gatggatccg cagcagcggc tgctgctcga gacgtcgtgg gaggcgctgg 17760
agcgggccgg ggtggacgcg ctgtcgctgc gcggcagccg caccggcgtc ttcgtcggcg 17820
cgagcccctc ggaatacggc accttggtcg cttccctgga gggaggccag gactatgccc 17880
tcactggcgc cgtcggcagt gtgctgtccg gccgggtggc ctatgtgctg ggtcttgagg 17940
gtccggcgct gacggtggac acggcgtgct cgtcgtcgct ggtggcgctg catctggcgg 18000
cgcaggcgct gcggggcggt gagtgcgacc tggccctcgc cggcggtgtg gccgtgatgg 18060
ccacccccaa cgccttcgac gccttcgcgc ggcagggggg tttggctcgt gatggccggt 18120
gcaaggcgtt tgcggatggt gcggatggta ctgggtgggg tgagggtgtc ggggtgctgg 18180
tgctttcgcg tttgtcggag gcgcgtcggt gtggttacac ggtgttggcg gtggtgagtg 18240
gttcggcggt gaattcggat ggtgcgtcga atggtttgac ggcgccgaat ggtccgtcgc 18300
agcagcgggt gattcgtcag gcgttggcgt cggcggggtt gtcgccgggg gatgtggatg 18360
tggtggaggc gcatgggacg gggacggcgc tgggtgatcc gatcgaggcg caggcgttgc 18420
tggccacgta tgggcaggag cgtggggcgg ggcggccgtt gtatgtgggt tcggtgaagt 18480
cgaatattgg gcatgtgcag gcggctgcgg gtgtggcggg tgtgatcaag tctgtgctgg 18540
cgttgcggta tggggtgctg ccgcggacgc tgcatgtgga tgtgccgtcg cgggaggtgg 18600
actggtcggc gggtgcggtg gagttgctga ctgaggcggt ggagtggccg gcggggggcc 18660
gtccgcggcg ggtgggggtg tcggcgttcg ggatcagcgg taccaacgcc cacgtgatcc 18720
tggaggaggc gccggagggt gtcgaggaga gcgcggctgg tgaggttgcg ggtgtggtgc 18780
cgtgggtggt gtcggcgcgg tcggaggagg ggttgcgggc gcaggctgcc cggttggtgg 18840
agcatgtggt gggcgggtct gggctggggc cggtggatgt gggctggtcg ttggcccggt 18900
cgcgtgcggt gttggagcac cgggcggtgg tgttgggagg ggatggggag gagttggtgg 18960
cggggcttcg tgcgttgtgc gatggggtgt tggggccggg tgtggtgcgg ggtgtggctg 19020
gtgatggtgg gacggcgttg ttgttcacgg gtcagggtgc gcagcgtgtg ggtatgggcc 19080
gggagttgta tgaggcgttt ccggtgttcg cggcggcgtt tgatgcggtg tgtgccgggt 19140
tcgaggggat gttgcccggg tcgttgcggg gtgttgtttt tggtgatggt ggcggggttg 19200
tggaccgtac ggagtgggcg cagccggcgt tgtttgcgct ggaggtggcg ttgttcgagt 19260
tggtcgtgtc gtggggtgtg cgggcggatg tgctggtggg tcactcggtt ggtgagttgg 19320
tggcggctca tgtggcgggt gtgtggtcgt tggcggatgc gtgtcgggtg gtggcggcgc 19380
ggggtcggtt gatgcaggcg ctgcccgttg gtggggcgat ggttgcggtg cgggtgggtg 19440
agggggagtt gccggtgttg ccggaggggg tgtcggtggc ggcggtgaac gggccgcggt 19500
cgttggttct ctccggggat gaggggccgg tgcttgagct ggcggcgcgg ctggccgggg 19560
agggccggga taccaggcgg ttgagggtct cgcacgcttt ccattcggcg cggatggagc 19620
cgatgctcgc tgagttcgcg caggtgctgg cggcggtgga gttccgtgcg ccgcggatcc 19680
cggtgatctc caacgtgacc ggtgaggtgg ccggcgagga gctgaccacg cccgagtact 19740
gggtgcgtca ggtacgcgag gccgtccgct tcgccgacgg agtgaacacc gcactgggcc 19800
gaggtgtgga caagttcctg gagttggggc cgtcgggccc gctgaccgcg atggccgagg 19860
aggtcatcga acacaccggc acccgagcgg tctgtgtccc cgtgctccgc gccggacgcc 19920
ccgaggacgc caccctgctg cacgcgctcg cggccgtgtt cgtcaccggc gccacagtcg 19980
gctggacggc tccgctcgcc ggtaccggag cgcgggccgt ggacctgccg acgtacgcct 20040
tccagcacaa gcggtactgg ccgcagccgg cgaccgtcgg ccgggacctg gccgcggccg 20100
ggctcgccga ggccgggcat ccgttgctga cggcctggct cccctcgccg gagggcgagg 20160
atgtgctgtg caccggccgg atctccctgg cgacgcatcc ctggctggct gaccatgcgg 20220
tgctgggcac cgtgctcgtc cccggcaccg cgttcgtgga cctcgcctgc tgggcgggcc 20280
accgagtggg gtgcggcgcg ctgcgtgaac tgaccctcgc cacgccgctg gcgctcgcac 20340
aggacatggc ggtgcggctg cggttggtgc tcggcgcgcc cgacgacacc ggctgccgcc 20400
cggtcgcgct gtactcgcag caggaaggcg cggacgaagg gacggacggg acgggctgga 20460
cgcggcatgc cgagggcctg ctggccccgg gcggcgacgc gtccgtacag ccgcccacgg 20520
acttcgagac ctggccggtg acgggctgcg agcccatccc actggacggt ttctacgaag 20580
agctcgccga cgcgggcttc tcctacgggc cggtctttcg gggcctgcgg gccgcatggc 20640
ggcgcggcgg ccaggtcttc gccgaggtga gtctgcccgc cgacgagacc ggtggcttcg 20700
gcgtccatcc ggccctgctc gacgcggccc tgcacgcgct ggggccggtc tcacgggaca 20760
cggacgagcc cggctcggcc cggctgccgt tctcctgggg cgaggtacgg gtgcacgcgg 20820
ccggcgccga ccgcttgcgg gtctgtctgg tccgggccga ggacggcacc gtcacgttgc 20880
atggcgcgga cgccgcgggc cggccggtgg taaccgtcgg ctcgctggtc ctgcgcccga 20940
tctcgccaga gcgactgcac ggcggcgcag cggcttttga cgacgcgctc ttcaccactc 21000
gctggatgcc gctgagcgtc gccgacggca tcgcatatcc cactgccgac tgcgtactgc 21060
tcggtgaccc tctggaacgc gcctggcggc accaccccga cctcgactcg ttcgccgagg 21120
cactcgcggc cggcaaggaa aaaccgggta cggtgctcgc tcgctgtccg cgggacatcg 21180
cggccggcgt cgaccctgcc gaggcggccc ggcggtgtgc ggagtgggcg ctcgacctgc 21240
tcaagcggtg gctggacgac gaccggctga cggactgtca tctcgtgatc ggcacccggc 21300
acgcggtgac caccggcgcc gaggaccaga ccgccggccg gacggacgac cccgccgtgc 21360
tcgcccagtc cacgcttctc ggcctggtcc gctcggccca gaccgagaac cccggccgtg 21420
tcaccctggc cgacttcgac ggcaccgcac ccgacccggc gcacctcatc ctggccgtac 21480
ggcaggcgga gccggaggtg gctgtgcgcg ccggccggct ttacgcccgc agactcaccc 21540
gtccggacac cgggcgggcc ctggccgtcc cgccgggagc gggctcgtgg cggctggaga 21600
gcaccgggcg cggcaccctg gacaacctgg cgctggttcc ctgcgcccag gccgaggagc 21660
cgctgggcga gggcatggtg cggatcgccg tacgcgccgc cggggtgaac ttccgggacg 21720
tcctgatcgt cctggacatg tatcccggcc gcgcggacct gggcaccgag tgtgccggag 21780
tcgtggtgga gacggggcac ggcgtcaccg gactggtccc gggggaccgg gtgatgggca 21840
tggtggccgg ggccttcgcg cctaccgccg tggtcgatca gcggttcctg gttcggatac 21900
cggacggctg gtcctacgag acggccgctg ccatcccggt cgccttcttg accgcctact 21960
acggcctggt cgacctggcc gggctgagtg cgggggagtc ggtgctcgta cacgccgccg 22020
ccggcggggt gggcatggcc gccgtccagc tggcccggca cctgggcgcc gaaatgtacg 22080
gcaccgcgag cgagccgaag tgggacacgc tgctcgacag cgggctggac cgcgcgcaca 22140
tcgcctcctc acggacgacg gtcttcgccg actcggtgat ggaggcgacc gggggtgcgg 22200
gcgtggacgt ggtgctgaac tcgctcgcgg gcgagttcgt tgacgcctcg ctgcgggccc 22260
tgccgcgagg cggccggttc gtcgagatgg gcaagaccga cctgcgcgat cccgagcggg 22320
tcgctgccga gcaccccggg gttcggtacc ggcccttcga cctgggtgag gccggcgcgg 22380
accgcatcgc cgaggtcctt gcgcacctgg ccgagctctt cgcctccggt gagctcaccc 22440
cgctgcccgt gaccgtctgg gacatccggg acgccccggc cgccttccgt gcgctcagcc 22500
aggccgcact caccggcaag ggcgtactga ccgtccccgc cccttccttc gaggccggcg 22560
agacggtgct gatcacgggc ggcacaggaa ccctgggcac cctgctggcc cggcacctgg 22620
tgaccgagca cgggctgcgc cacgtcatcc tggccggacg ccgtggtacc gagaccgcgg 22680
aggtgcggca cctgcgcggc gacgtggcgg aactcggtgc gcgcatcgag gtggtggcct 22740
gtgacgccgg cgacgagcgg gccctgcgtc aggtgctgga cgccctcacc gccgagcacc 22800
gcctcgcggg cgtcgtgcac gccgccggcg tcaccgacga cggggttgtg tccgccctgg 22860
accgcggccg gctgtccgcc gtactccacc cgaaggtgcg cggagcgtgg aacctgcacc 22920
ggctcaccgc aggctcggaa ctccggatgt tcgtgctgtt ctcctccgcc tcggccaccc 22980
ttggcgcggc gggtcagggc aactacgccg cggccaacgc cttcctcgac gcgctcgccg 23040
agcaccggca cgcccttggg ctgcccgcca ccagcctcgc ctgggggctg tgggagcagg 23100
ccagcggaat gaccgggcgg ctcctcgacc gcgaccggca gcggatgagc cggtccggca 23160
tcgtgcccct cagctccgcg cacggtctcg cgctgttcga cgccgcgcgg ctggccggcc 23220
tccccacgct caccccggca cgcctggacc tggccgcgct tcgggtgcgg tacgcacacg 23280
agcaggtgcc cgcggtcctt cgggaactcg tccgcgtccg gccgtccgca gccgaggacc 23340
ccacgacagc cccggacacc acgaccgcac cgggaccgtc cggtgccatg acactggcgg 23400
accggctggc aggactgtcc gccccggagc ggcagcggca cgtcctggac ctggtgcgtc 23460
ggcacaccgc ggccgtactg ggccacggct cggccgacga tgtcgacccc gaccaggcgt 23520
tcaaggccct cgggttcgac tcgctcaccg cggtcgaact gcgcaaccac ctgcggacgg 23580
cgacctccct ggccgtcccc gcgaccctcg tcttcgacca tccgacaccg gccgcgctcg 23640
ccgcgcatct cctggaactc gccgcaccgc cggaacggga cccggcgctc cgggtcatgg 23700
gcggactcga ccggctcgag gccgacgtcg aggcgctggc ctccggcggc gccgggcacc 23760
aggaggaggt ggccacgagg ctgcgccgcg tgctgcgacg cctggagtcg ggcccggggg 23820
ccgcccactc cggcacggag gaaacctccc tcgacaccgc ctcggcgacg gaagtcctcg 23880
ccttcatcga cagcgaattc ggcgatctcg cctagtacag gtacggagtt gatcgcagtg 23940
gtcagtgacg acaagcttgt cgactacctg aagcgggtca ccgcggacct caaacggacc 24000
cgtcagcggg tccacgagct ggagtcgggc agcgccgaac cgatcgctgt cgtggcgatg 24060
gggtgccgct tccccggagg catcagctcc ccggaagacc tctgggagtt cgtgcgcctg 24120
ggcagtgacg ccatctcgga gttccctacc gaccgtggct ggcacaccag ccggctgagc 24180
gggaacttcc ggcgggccgg cggattcctt tatgacgcgg gcgacttcga cgcgggtctg 24240
ttcgggatct cgccgcgcga ggcgctggcg atggacccgc agcagcggct gctcctcgag 24300
acggcctggg agacgctgga gcgggccggt gtcgacccca cctcggtgcg gggcgccgac 24360
ggcggcgtgt tcatcggcat ggccgaccag aagtacggcc cccgcgacga cgaactgctc 24420
ggtgaggtca ggggactcgt cctgaccggc acgaccagca gcgttgcctc aggccggatc 24480
gcctactcac ttggtctgca agggcccgcg atcaccatcg acaccgcctg ctcgtcgtcg 24540
ctggtcgcac tgcacctcgc ggtgcgctcc ctgcgggccg gtgagtgccc gttcgccctc 24600
gtcggcggcg ccgcggtgat ggcggagccc accctgttcg cggagatggc cgagcagggt 24660
ggcatggccg gagacggccg ctgcaaggcc ttcgcggccg ctgcggacgg caccggctgg 24720
ggcgaaggtg tcggcgtact cctgctgcag ccgctgtcca ccgcacgcga gcagggcctg 24780
cccgtcctgg ccaccgtacg cggctccgca gtcaaccagg acggagcctc caacggcctc 24840
tcggccccca acggccccgc ccagtgccgc gtcatccgca aggcgctcgc cgacgcccag 24900
ctcgtcgccg gccagatcga cgccgtggag gcccacggca ccggcaccgc gctcggcgac 24960
ccgatcgagg cgcaggcgct gctggccaca tacggccagg accggcccgg cgacgagccc 25020
ttgtggctcg gctcggtcaa gtcgaacatc gggcacaccc aggccgcggc cggtatggcc 25080
ggcgtcatca agatggtgca ggcgatgcgg cacgggctgc tgccgcgcac cctgcacgtg 25140
gacgaaccga cccctgaggc cgactggtcg gccggtgatg tgcggctgct gacggaggag 25200
cgggaatggc cggacacggg acggccgcgg cgcgcggcgg tgtcgtcgtt cgggatcagc 25260
ggcaccaacg cgcatgtcgt gctggaactg cccaccggca ctgtcgggga gccagccgat 25320
gcggccgggc cggttccgga cccgtcggcc tgcgccccga ttccgtggct gctgtccgcc 25380
gcgagcgccg acgccctgcg cgcgcaggcc cgcagactgc accgcttcgt ggacacaccc 25440
ggtgccccgc gcccgatcga caccgccctg tcgctcacgg tcacccgggc ccgactcgac 25500
caccgcgcca tcgtgttcgg caccgaccag gcagaactgc gggccggact gggggcattg 25560
gccgcgggcg aaagcacccc gcggactgtg cacggacgga ccgtgccgag cgcgacgatc 25620
gcgttcctgt tcaccgggca gggggcgcag cgggcgggca tgggccgggc ggcgtacgcc 25680
gcgttccccg agttcgccgc ggcgttcgac gcggtgtgcg cggagctgga cggactgctg 25740
ccccggccgc tgaagtcggt gctcttcgcc gagccgaact cggccgacgc cgcactggtc 25800
gaccagaccc tgtacgccca gaccggcttg ttcgctttcg aagtggcgct gttccggctg 25860
ctggaggagt ggggtgtccg accgggagtg ctgctggggc attccgtcgg cgagctggcc 25920
gccgcgcacg tggcgggcgt ctggtcgctg ccggacgcct gccgtgtggt cgcggcgcgg 25980
gcgcggctga tgcaggccct gccggaagac ggggcgatgc tgtcggtggc cgccagcgag 26040
aagcatatcg ccgaactgct gggcgacctc gccgacgtgg atgtggccgc ggtcaacggc 26100
ccggccgtca ccgtgctgtc ggggccgacg ggtgccgtgg cggacgtcgg ggagcggctg 26160
gccggcgcgg ggctgcgcac gaaacacttg cgagtgagtc atgccttcca ctccgccctg 26220
atggagccga tgctcgcgga attcgcacgc gagatcgccg acgtgacctt ccagcagccc 26280
gagctgccga tcatctccaa tctgacgggc cagcaggccg acgcggccga gctgggctcc 26340
gccgcctact gggtgcgcca ggtccgtggc accgtacggt tcgccgacgg tgtcggccga 26400
ctcgccgcgc acggcgtcac cgcctgcctc gaactcggcc cggacggtgt gctgaccgcc 26460
ctggcccgcg actgcctcac ggccgccgcc gatgtggcct tggtgcccgc actgaggcgc 26520
gaccaggacg agccggccgc gctcctcgcc gcgctcgccg aactccatgt gcggggcgtc 26580
gaggtggact gggccgcgat gctgaccgca cgcggcggcc ggcgcgcagc cctgcccacc 26640
tacgccttcc agcgagagcg ctactggctg cccgccaccc cctccgtcgc ctccgccgtc 26700
tccgcgcccg ccgagcaggc agaccggctg ctgtaccgcg tcggctggtc gccggtcacc 26760
ggtttcgaca ccgaggcgag gccggagggc acctggctcg tcgtcgcttc cccggacgac 26820
gagggccgcc gcgtcgccca ggcgctcggc ccgcacaccg tgctcgtggc ccacgacccc 26880
gatgacccct ccgggtcggt ggcacggctg cgcggcgccc tgccggccga ccgcccggtg 26940
accggggttc tggccctgcc ggagcagacc ggcgcggcgg ccgtggcggc ccagctcgca 27000
ctccgcgagg ccctccgcga cgccgaagtg cgcgcccccc tgtggtgcgc gacccgtgcg 27060
gcggtctcgg tcggaggtga ggccaccccc ggcgccgcac aggcaccgct gtggggattg 27120
aacagggctc tggaaacctg cggggggatg gtggacctgc ctcagcggtt ggactcacgg 27180
tccctggggc tgctggccgc ggccctcacc aacccggccg acgccgacga actcgccgta 27240
cgcaccggtg ggctgttcgc ccggcggctg cacgccgtgc agcccgtgcc gcgcgctcct 27300
cgcccgtggc gggccgacgg gaccgtcctc gtcaccggcg acgtcgagtc ggcaaccgat 27360
gacctgctgc ggaggctgag cggtgacggc gagcgccccg tggtgctggc gcggcgaccc 27420
ggcaccgccc tgcagaacgg ggccgcgggc gacgggtcgt gcaccgtcgt ggagtgggac 27480
cccgcggccg gcgcacccga gacgccctcg ccggtgaccg ccgtcgtaca tctggacaac 27540
atccagccgt ccgccccacg ggacgacgcc gaccctttgg ccctggctgc cgcagtggcg 27600
gagcggctcc acaccgtcga ccggctgacc gagctgttcg gcaaccagga tctggacgcc 27660
ttcgtgctgt tgtcctcggt tgccgggatc tggggcggcg ccgaggacgt cgttcacacc 27720
gtcgtgcacg cggccctgga gtccgccgcc gaacgccggg ccgctgccgg cctgcgcggc 27780
gcctgcgtag gctggggccc ctgggccggc gccggcgacg ggccggacgt gcccggactc 27840
gtacccatgc gccccgagcc ggcactggcc gcgctgtggc acgcgctgga cgacgacgcg 27900
gccgtcttcg ccgtcgccga cgtcgactgg ccgcggttcc acccggtcct caccagccgg 27960
cgtccccggc ctgtcgtctc cggtctgccc gaggtacggg cgctcaggcc ggcgccatcg 28020
gcggcgcccg ccgtcggcat ggacgtcacc gacctggaac accggctgcg ggacctggtg 28080
ctcaccgagg ccgcaacggc gctcgggcat gccttccgcg actcgatgga cccgctgcgc 28140
cccttccgcg acgccgggtt cgagtcgctc accgccgtgc gtttccgcga ccggatcgcc 28200
tccgaaaccg ggctgaacct ctcggccacc ctcgtcttcg accaccccac gcccgaggcg 28260
gtcgtggccc acctgctggc cgaactgacc ggggggcggc ccgacgaggc ggagcaggtc 28320
agcacccgat cgcacgacga cccggtggtc atcatcggca tggcgtgccg ttaccccggc 28380
ggggtcagcg accccgaggg cctgtgggaa ctggtccact ccggacgcga aggcatcggt 28440
gacttcccca cggaccgcgg ctgggacctg gcggcgctgc gacgtgccgt cccccacctg 28500
gccctcaggg ccggcttcct ccccgacgcc gccgccttcg acgccgcctt cttcgggatc 28560
tcgccgcgcg aggcactggc tatggatccg cagcagcggc tcctactcga agcctcgtgg 28620
gaggctgtgg agaccgccgg catcgatccg gcgtcgctgc gcggcagccg caccggggtg 28680
ttcgcgggcg tggccggctc cgattacggt gccgcactcg ccggttcgcg tgaggcagag 28740
ggctatctga tgaccggaac ggccaccagc gtggtctccg gccggatcgc ctatgttttc 28800
ggtctgcagg gcccggcgct caccatcgac acagcctgct cctcctcgct ggtggcgctt 28860
cacacggccg tgggcgcgct gcgcaagggc gagtgcgacc tcgcctttgc caccggcgtc 28920
gccgtcatct ccaccccgga cgctttcgtg gacttcgcca agcaggacgg gctggcggca 28980
gacggccgct gcaaggcgtt cgccgtcggc gccgacggga ccaactgggc cgagggtgtg 29040
ggtgtgctgc tcgtcgagcg gctctccgac gcccgccgca acggtcatcg tgtgctggcc 29100
gtgctccgcg gcagcgcggt caactcggac ggcgcctcga acgggctcgc cgcacccaac 29160
ggcggcgcgc agcagcgggt catccgccag gccctggcgg acgcagggct gaccgccccg 29220
gacgtcgacg ccctggaggc gcacggcacc ggcacggcgc tcggcgaccc gatcgaggca 29280
caggccgtgc tggccaccta cggccagggt cgccccgccg atcggccgct gtggctgggc 29340
tcactgaagt cgaacatcgg ccacagcgcc gccgcagccg gtgtcggcgg cgtgatcaag 29400
atggtggagg cgatgcggca cggagtcctg ccgccgaccc tgcacgccga cgagccgacc 29460
cacgaggtgg actggtccgt gggcgcggtg gaactgctca ccacggcacg cgactggccc 29520
gagaccgggc ggccgcgccg cgccgcggtg tcgtcgttcg gcgtcagcgg caccaacgcg 29580
cacgtcatcc tggaacaggg ccccgacctg gccccgggcg gcgtgcccgg tgtccaggag 29640
gaccccgcgc ccagggccgc gggaggatgt gccggcaacg ccgtcccctg gctgctgtcc 29700
ggacgttccg cccgggctct gcgcgaccag gcggcccgtc tcgccgggca tctgacgcgc 29760
ggtgacccgt cggccgaagc gatcggacac gcgctgctca cctcccgtac cgccttcgag 29820
caccgggccg tcgtgctggg cggcggtacc gtcgatctcg tcgaaggact ggatgctctc 29880
gccgccgggg aaccggcccc gtcggtggtc gccggcgcac ctcgtccgac cggccgtgga 29940
cccgtcttcg tctttcccgg ccaaggtggt caatggtccg gcatggcgtc cgaactcctc 30000
gacacctgcc cggccttcgc cgcccgctgg gccgagtgcg agcgtgcgtt cgcgccgcac 30060
atggacgtct cgctcacgga ggcggtccgc gacgccgcgg ccctggagcg ggtcgatgtc 30120
gtccagcccg tgctcttcgc ggtcatggtg tcgctggtcg aggtgtggcg ttcgtacggg 30180
gtacggcctg ccgcggtgat cgggcactcg cagggcgaga tcgccgcggc ctgcgtcgcg 30240
ggagcgctgt ccctcgacga tgccgcccgc gtcgtcgcgc tgcgcgccag ggcgctcggc 30300
gtgctggccg gtgcgggcgg catggtctcc gtcgcactcc cgcccgccga gaccgagggc 30360
tggctgcggc gttgggagga ccgcatctcc gtggctgcgg tcaacggccc ctcctccgtc 30420
gtggtctcgg gtgaaccggc tgccctggag gaactggtgg agcaagcccg cacccgggac 30480
gtccgggtgc gccgcatcga ggtcgactac gcctcgcact cggcacaggt ggcccgtatc 30540
gaggacgagg tcctccgact gctggaaccg attcggccgc ggacgtccga ggtccccttc 30600
ttctccaccg tctccacgca gtggcaggac accaccgcga tggacgccgc ctattggtac 30660
cgcaacctgc gcgatccggt gctgttcgcc ccgtccgtcg gcgcgctcgt cgaccagggg 30720
cacacggtgt tcgtagaggt cagccctcac ccggtgctca cctccggcct gctggagacc 30780
gctgaacgcg ccgacgtgga cctgacggtc accggcaccc tgcgccgggg tgagggcggg 30840
ctcgcccgga tgcgtgcctc gctcgccgaa ctgtgggtgc acggcacgcc cgtcgactgg 30900
tcggccgcct tcgacccggc cccggcgggg ccggtgccgt tgcccactta cgccttccag 30960
cgcgaccgct attggcccga tccgcgcccg gcgtccgccg acccggtgta cgagaccttc 31020
tggcgggcgg tggacgaggc ggacctgccc gcgctgaccg gaaccctggg cgtcaccgac 31080
gaccagccgt tgcgcgaggt gctgcccgcc ctgtcggcct ggcggcgcag ccgtacggaa 31140
caagcggtca cagacagctg gcgctaccgc gtgtgctgga agcggctgcc ggacgcggcc 31200
cccgccgaac tgcccggcac ctggctcctg gtgaccaccg agggcgccgc cggggacccg 31260
tccgccgctg ccgccctgca ggcagtgcgg gacgctgccg ggcacaccgt gacccttgct 31320
gtcgacagcg acgacgagcc ggcttcactc gcggcggcgc tgcgcgagac gctgcgggga 31380
acgcatccgg ccggcgtggt cacactgacc ggcacggacg tctcaccgca cccggtcagc 31440
ccggtcgtcc cggtgggcac ggccctcacc gtcaccctgc tccaagccct ggacgcggcc 31500
gacgtcgatg cgccgctgtg gtgcctgaca cgcggcgccg tcgccaccga cgacgacacc 31560
gccgggcccg gcagcccgct ccagtcggcg ctctgggccc tgggccggat cgcggccgtg 31620
gagtctcccg gcaactgggg cggtctcgtc gacctgccgg acaccttcga cgacagcgcc 31680
gcgcggcgac tggtgtcggt cctcgccagc ctggacggcg aggatcaggt ggcgctgcgc 31740
gtctccggcg cgtacggccg tcggctgatg cgcgccaacc ccactgcctc acccggctcc 31800
ggctggcgcc cgcgcggcac cgtgctggtg accggcggca ccggagcgct cggcgggcgc 31860
gtcgcccgct ggctcgcccg ggacggcgcg gagcatatcg tgctcgccag ccgccgcggc 31920
tcgcaggcac cgggagtcga cgacctggtg gccgaactga gcggcctcgg ggcccaggtg 31980
acggtggact cctgcgacct gagcgtggcc tcggaagcgt tcgcgctggt cgaccggata 32040
cagcgcgacg gcgaccggat cggggcagtc atccacaccg cgggagcggg tggcctcgga 32100
ccgctcgtcg acgcgggact ggacgacatg gagctggcca tggccggcaa ggtcgccggc 32160
atcgacaacc tggagcgggc gctggacgac caacagctcg acgcggtcgt ctacttctcc 32220
tccatcagcg cgtcctgggg cgccggtgac cacggcatct acgcggcggc caacgccgtc 32280
ctggacgccc gcgccgaggc ccggcgtgcg gccggcgtgc acaccgtgtc ggtggcctgg 32340
gcgccctggg gcggaggcgg catgatcgac gacccggccg tggcggacac actgaaccgc 32400
atgggcctgc ctctggtgga ccccgacctc gcgatcagtg gtctcgccac gatcctcgcc 32460
gagggggagg agtcgctgct gctggtggac gtggactggg gcaggttcat cccccagttc 32520
accctgcgcc gccccagccg cctgttcgac gaactgcccg aggcacgggc ggcggaggcc 32580
gacacggggc ctgccaaggc cgacgcccct tccccgctgg ccggtcggct ggccgggctg 32640
agcaaggcca agcgcgccac ggcgctgcgc gacctcgtac gcgagcacgt cgccgcggtg 32700
ctgggccaca acgacccggc ggccgtcgat gccggccggg cgctgaagga cctcggcttc 32760
gactcactga cggcggtgga actgcgtgac cgactgagca ccgtggccgc aatgcgcctc 32820
ccggccaccc tggtcttcga ccatcccacc atcgccgaac tggccgactt cctggcccgt 32880
ggcctggagc cggagacggc ccggccgacc gccgcacccg ccaccgtcgt acgcgtcgac 32940
caggacgagc cggtcgccat cgtcgccatg gcctgccgct accccggcga catcgcctcc 33000
gcagaggagc tgtggcgtgc tgtccgcgac gagaaggacc tgatcagccc attccccatc 33060
aaccggggct ggccggttga ccgactgctg gacgccgatc ccgaccggcc cggcaccagc 33120
tacgtcgacc acggcggatt cctgcacgac gccggtgact tcgaccccgg cttcttcggt 33180
atctcgccgc gagaggcaca ggccatggac ccgcagcagc ggctgctgct cgaatcgtcc 33240
tgggaagtac tggaacgcgc cggtatggtc ccgaagtccc tgcggggcag ccggaccggg 33300
gtatacgtcg gtctgaccga ccaggcctac ggcactcgcc tgcgcggatc cttggacggc 33360
atggagggct tcctcgtcag cgcgtcgtcc aacgtggcct ccggccggat ctcgtactcg 33420
ctcgggctcc agggccctgc gctcaccgtg gacacggcct gctcgtcctc gctggtggcc 33480
ctgcacctgg ccacccaggc gctgcgcaac ggcgagtgcg acctcgccat cgcgggcgcc 33540
gccaccgtca tgccggaccc cacctccttc atggccttca gccggcagcg cggactggcc 33600
gccgacggcc gctgcaagcc gttcgccgcc gccgcggacg ggttctccct cggcgagggt 33660
gtcggcgtcc tgctggtgga gcggctgtcc gacgcccgcc ggctcggaca ccccgtgctg 33720
gcgctgatcc gtggctccgc ggtgaaccag gacggggcct ccaacggcat caccgcgccc 33780
aacggccctt cccaggagcg tgtcatccgg caggccctgg tcaacgccgc gctgcccgcc 33840
tccgcggtgg acgtggtgga ggcgcacggg accggcacca ccctcggcga cccgatcgag 33900
gcgcaggcgc tgctggccac gtacggccag gaccggcctg ccgaccggcc gctgcgactg 33960
ggctcggtca agtcgaactt cgggcacacg caggccgcgg ccggcatggc cggtgttatc 34020
aagatggtgc aggccatgcg gcacgagttg atgccgcgca ccctgcacgt ggacgcgccc 34080
agcccgcacg tggactggag ctccggggcg gtggagcttc tcgccgaggc gcgaccgtgg 34140
ccgcggggtg atgagccgcg ccgtgctggg gtctccgcct tcgggatcag cggcaccaac 34200
gcgcatgtcg tcctggaaga ggcatcgcag gagccgacgc ccgacgggag cgccggggcg 34260
ccggatacgc cggatacgcc ggacgcgccg gtcgaggcgg acaccggccg tcccctgccg 34320
ctcgtcgtct cggcccgcac cccggacgca ctgcgcgacc aggccgcccg cctgaccgcc 34380
ctcctggacc gggaagaaca cccggtctcc gacctcgcct actcgcttgc cacggcccgc 34440
ggtgtgttgg accgggccgc ggtcgtcgtc gccgcggacc cggacgaact gcgccggaac 34500
ctggccgacc tgaccacgag agcggtcgcc gagcggcggg ccgagggcgg cctcgccttc 34560
ctcttcaccg gtcagggcgc ccagcgcgcc ggcatgggac gctccctgta cgacgccttc 34620
cccgagttcg ccgcggcctt cgacgaggtg tgcgcggaac tcgaccggca cctgccccgc 34680
ccgctgcgca ccgtcgtgtg ggccgagccc gggacggacg aggccgcgct gctcgaccag 34740
accctgtaca cccagaccgg tttgttcgcc gtcgaggtgg cactgttccg gctgctggag 34800
cactggggcg tacggcccga cgccctgctc ggccactcgg tcggcgagct cgccgccgcc 34860
cacttggccg gcgtgtggtc gaccgaggac gcggcccggg tggtcgccgc ccgcgcccgg 34920
ctgatgcagg aactgccgga gggcggcgcg atgctgtccg tcgctgcggc cggggacgag 34980
gtgtccgccg tgctcggcga cgcgtccgcc gaagtcgccg tcgctgcggt caacggcccc 35040
gcgtcgctgg tcctgtccgg caccgaggag tccgtgacgg ccgccggcgc ccggctcgcc 35100
gaggcggggc tgcgcaccaa gcgacttacc gtcagccatg ccttccactc gtccctcatg 35160
gaaccgatgc tcgccgcgta cgagcatgaa ctcgcccagg tcgccttcgc cgagccggcg 35220
ttgcccgtcg tctccaacct caccggggag gtggccggcg ccgagctgtg cgaacccgcc 35280
tactgggtga ggcaggtgcg gcaggccgtg cggttcgcgg acggggtgcg caccgtgctc 35340
gacgagggcg tgaccaccct cctggaactg ggcccggatg gcgtcctgac cgccatggcg 35400
caggagtcgg ccggggagcg ggccaccggt atcgccgccc agcgccggga ccgtgaccag 35460
gtgcggaccc tgctcaccgc actcggcagg ctccacgtgc gcaccgaacg cgtggactgg 35520
gccgcgttct tccgcggcac cggggcccgc cgggtggacc tgcccaccta cgccttccag 35580
cgccggcgct actggctgga cacgtcgtcg ggtggtgccg aggcactggc cggcgcgggc 35640
ctggcgggta ccggacaccc gctgttgacg gcgtccgcga cgctgcccgg gacgggcgag 35700
tccctcttca gcggcagcct gcccggggcc ccggacggac ggcccctgtc gggcggcgag 35760
atcctcgaac tggtgctgtg ggcgggtggg aacttcggct gccaccggat cgccggactc 35820
gatgtcgccg gatcggtgcc ccacgctccc caggcgccgc tgcaactggt ggtggcagca 35880
cccgacgagt ccggaaaccg ggccttcacg ctccacctgg gcccggtggg cggcccgcac 35940
ggcccggtgg agggcccctg gacgcggatc gcccacggcg tactcggcgg cacccccacc 36000
cccctgccgc cggagcccgg caccgcggcc tggccgccgg ccgacgccga gcccgtcgga 36060
gccgacctcg tctggcgccg ggaggacgag ttgttcgccg aactggagct ggccgaacgg 36120
aacgcggcgg acgtcgaccg tttcgccctg caccccgggc tgctggcgga ggtgatggag 36180
ctgatcgccg gactggccgg agaaccggtc cacttcaccg gggtgacccg gtacgcgacc 36240
ggcgccaccg tgctgcgcgt ccatctgacc cgcgtcgccc ccgacaccgt caccgcgctg 36300
ctgacggacg cggaaggcga accggtgctc tcggtggacc gggtccaggt ccgtgccgac 36360
ggtgcggcgg ccgtgcgctc ggccacggcc gccgcaccgg acgccctgta cgagctgacc 36420
tggacaccgg tcggcgccga ggccctccca ccggacaccg gctgggcagt ggtgggcgtc 36480
cccgccggcg acctggccaa ggtcctggag gcgcagggcg ccgaggtggc aacccaccct 36540
gacctggcgt ccctcggcag cacggccgac cgcggtgaca tgcctggtct tgtcgtcctg 36600
tccgtggaga cggcacccgg cgcacccctg gagtccgcgc gactgaccgt tcaccacacc 36660
ctgcgtctgg tcggggaact cctcgcggac acccagctca ccggcacccg gttcgccttc 36720
gtcacgaggg cgtccgtgtc caccggcgac ggcgcggcgg tcgacccggc gcaggcggcg 36780
gtccgcgggc tgctgctctc cgcccaggcc gagcacccgg accggttcgt cgtcgtcgac 36840
ctgggcggcc gggaggagga cgccgatctg ctcacggcgg ccgtcggcac ctccctggcg 36900
gcggctgagc cgcacctcgc gatccgcgac ggccggctgc tcgtgccgcg gctggcccgg 36960
gtcaccgagc caccgcaggc ctttgcagcc gggcccgagg agcacggcac ggtgctggtc 37020
accggcgcca ccggaggcat cggcaccaag atcgtgccgc acctggtggc cgagcacggc 37080
gtgcgcaggc tgctgctgct cagccggaag ggccctgacg acccccgcgc ggccgaactg 37140
ggccgtgagc tggcagcgta cggcgccgag gcgacgttca cggcctgtga catcgctgac 37200
cgcgcggcac tcgaggccgt cctggccgag gtgccggccg agcacccggt gaccgcggtc 37260
gtgcacatcg ccggagtcgt ggacgacggc gtgctcacca cactgagccc cgagcgcgtc 37320
gacaccgtgc tgcggcccaa ggccgaggcc gcgcagcacc tgcacgagct gaccgccggc 37380
cttgagcttt cccatttcgt gctgttctcc tccggcgtcg gcgtgctcgg gggcgccggg 37440
caggccaact acgcggcggc caacgccttc ctggacgcgc tggcgcagac cagacaggct 37500
gccgggctcc cggcgtcctc gctcgcctgg ggcctgtggg agaccgacat gggcatgtcg 37560
gcgcgcctgt ccgaggtcga ccgccgtcga atggcccagg cgggcgtcct cgccctcacc 37620
ccgcagcagg gcatcgcgct gttcgaccgg gcgtggaact ccggtgcggc gaccctggta 37680
ccgatgagcc tggacacggc tgtgctgcgc aggaaggcag ccgactccgc cctgcccgcc 37740
ccgttccgcg cactggtccg cacaccgctg cgccgggccg ccgccggccc cgcacaggcg 37800
gcgggacagt ccttcgcgca gcggctggca gagcagcccg ggagcagtcg caggcggctg 37860
ctgctggagt tgatccagcg acaggtgggc accgtgctgg actacggggc cgacaccctg 37920
ctcgatgccc ggcgcacctt ccgggagctg ggcttcgact cactcaccgc ggtggagctg 37980
cgcaatcgcc tggtcgccgc gacgggcgtc cagctttccg ccgcgctggt cttcgaccac 38040
cccacggcgg acgcgctcgc cgaatacctg gagagcaagg tcctgcggtc acaggtcggg 38100
gcgcccctgc cggtgctcac ccagctcgac cacctggagg ccgcgctcgc ggcgcccccg 38160
gccgacaccg ccacccgcga gcagatcgcg gcccggttgc gcgccctggc ctccacctgg 38220
agcgcccagc ccgacgacgg ccatggagcc gatgacggcg acatcagcag caagctcgat 38280
tccgccacgg atgaagagct gttcgacttc atcagcgggg aattcggaga ggactgagtc 38340
cgatggccaa cgagcagcag ctgcgcgact acctcaagcg ggccggcgcc gaactgcacc 38400
gtacgcgtcg gcgcctggcc gacgtggagg cacggagcac cgagccggtc gcgatcatcg 38460
gaatggcctg ccgctatccc ggcggggcga gcacccccga ggacctttgg cggcttgtga 38520
tcgaggagac cgacgcgatc ggcccctacc ccaccgaccg gggctgggac ctggacggct 38580
tctaccaccc cgaccccggc aaccccggca cctgctacgc ggacggtggc gggttcatcg 38640
acgacatcgc ctcgttcgac gcggccttct ttggcatctc cccgcgtgag gcgcaggcca 38700
tggaccctca gcagcgcctg ctcctcgaga cctgctggga agcgctggaa caggccggtc 38760
tcgacatcca cgcgttacgc ggcagccgta ccggagtgtt cgccggcctc agccagcagg 38820
actacggcac tctgctggcc gccgcaccgg gcgggctgga cggctacgcc gccaccggca 38880
cctccaacag cgtcctgtcc ggccgcatct cgtacgtcct gggcctggag ggccccgccg 38940
tcaccgtcga caccgcctgc tcctcctcac tggtggccct gcacctcgcc gtgcaggcgc 39000
tgcgcaacgg cgagtgcgac ctcgcgctgg cgggcggggc gacgacgttg tccacctccg 39060
ccgtccacct ggccctgtcc ggtcagcgcg cactggcacc cgacggccgc tccaaggcgt 39120
tctcggcggc ggccgacggt gccggatgga gcgagggagt cggtgtcctc gccgtcgagc 39180
ggctgtccga cgcccgccgg ctcgggccac cgggtcctcg cggttctgcg gggcagcgcc 39240
gtcaaccagg acggcgcgtc caacgggctg accgcgccca acggcccctc gcagcagcga 39300
gtcatccgcc aggcgctggc caacgcgggc ctcacacccg ccgacgtgga catcgtcgag 39360
gcgcacggca ccgggaccag tctcggcgac ccgatcgagg cggatgcgct gctgtccacc 39420
tacggccagg ccaggccggc cgaccggccg ctgtggctgg gctcgctgaa gtccaacatc 39480
gggcacagcg gagccgcggc cggggtggcc ggcgtgatca agatggtgca ggctctgcgg 39540
cacggcgtca tgcccaggac gctgcatgcc gaggaaccca ccccgaacgt cgactggtcc 39600
tccggcgccg tggaactgct caaccgggcg cgcgactggc ccgcctccgg cacgcgccgc 39660
cgggccgccg tctcctcctt cggtatcagc ggcaccaacg cgcatgtcat cctcgaagag 39720
gctccgcagg acagtggtcc ggagaccggc gacgaggcgg acccatcacc cgagggaacc 39780
ccctggccgc tgctgccgtg ggtgctgtcc gcgcgcagcg aacacgccct gcgcggccag 39840
gcccgcgccc tgcacacgca cctgctggcc catcccgaac cggccgacac cgacgtggca 39900
ctctcgctcg ccaccacccg gaccggtctt gagtaccggg ccgccgtcct ggccgccgac 39960
cgggatggat tcctgaacgc gctggaggcc ctcgccgacg accgccccac caacggggta 40020
ctgcgcggaa ccgccgccga gggcaaggcc gtgttcgtct tccccggcca gggcgcgcag 40080
tggaccggca tggcccggga actcctcgac acctcgccgg tgttcgcggc caaggcggcc 40140
gagtgcgccg cggccatcga ggagttcgtg gacttcaagg tcctggacgt gctgcgcgac 40200
gagcccggcg ccgcgtccat ggaccgcatc gaggtcgtcc agcccgtgct gttcaccgtc 40260
atggtgtcgc tggccgagct gtggcggtcc ttcggcatcc agcccgacgc cgtggtcggc 40320
agctcccagg gcgagatcgc cgccgcccac gtcgcgggcg ggctgaccct cgaggacgca 40380
gcccgtgtga tctgcctgcg cagccgcttg ctggccgaga ccctggtcgg caagggcgcc 40440
gtcgcgtcgg tggcgctccc cgccgaccag gtgcgcgagc ggctgcggcg ctgggacggc 40500
cggctgtccg tcgccggcgt gaacgggccc cgcctcgtcg cggtggccgg ggacgacgcc 40560
gcgctcgccg agttcgtcga ggagtgcgcc cgcgacgaca tccgcgccag gaccgtggcc 40620
gccaccgtgc cgacgcactg cgccctcgtc gacccgctgc gcgagcggct gctggaactg 40680
ctggccccgg tccggccgcg caccggcacc gtgccgctgt actcgacggt gaccgggggc 40740
ctgctggaca ccgccaccat ggacgccggc tactggtacg acaacacacg ggcgccggtg 40800
ctcttcgaac ccgtcgtccg caccctgctc gccgagggac accacgcttt cgtcgagtcc 40860
agcgcgcatc cggtgctggc catggccgtc gagcagacgg tcgatgccac aggggcgccg 40920
ggcgtcgtcg tggagtccct gcggcgggac gagggcggcc ccggccggat gctgacctcc 40980
ctgaccaagg cccacctggg cggcgtccgc gtcgactggc ccacggtgtt cgccggcacc 41040
ggcgcccgca ccgtggatct gcccacctat gccttccagc gcacccggta ctgggccgag 41100
accgccgatc gcaccggcga cgtgggctcg gtcggcctgt cgccggtaga ccacccgctg 41160
ctcggcgccc tggtccggat ggccgacggc gacggggccg tgctgaccgg acggctctcc 41220
ttgcacactc acggctggct ggccgatcac ggcgtggccg accaggtgat cttccccggc 41280
accggcttcg tggagctggc ggtgcttgcc ggagaccagg tcggctgcgg ccggatcgag 41340
gaactgaccc tgcacacccc gctcgtcgtg ccccgcaccg gcgcgctcgt cgtccaggtg 41400
aacgtccagg cggccgatga caccggagca cgcgcgctcg gcgtgtactc ccgccctgac 41460
gacgccggcg ccgacatggt ctggacccgg cacgcctccg gtgtcctcgt ccccgaggac 41520
accgtggacg cggaggatac cgacgggctc agcggcgtct ggccccccga gggcgccgag 41580
cccgtggcca tctccggtct ctacgacggc atggccgcgg ccggctacca gtacgggccc 41640
gggttccgcg gtctgagccg ggcctggcac ctcgacggcg acgtctacgc cgaggtggcg 41700
ctgcccgccg accagacgtc agcggcggaa cgctacggcc tgcaccccgc cctcttcgac 41760
gccgccctgc acgccatgtt cacctgggac ggcgacgacg gcggcggggt cggcatgccg 41820
ttctcctgga ccggggttcg gctgcacgcc accggctgcg cacggctgcg ggtacggctc 41880
gcccggcgcg gcgagagcga cttcacggtg acactgacgg acgaggccgg tgaccccgtc 41940
gtatcggtcg actccctggt cgtgcgcagg atgaccgggg ccgcgccgga caccgtacgc 42000
accgacacgc tctaccggct cgactggaag accgtccggg ccggggagga gacatccgcg 42060
ccccgctgcg tcctgctggg caccgacccg ctgggcgtcg ccgccgccct gccaggcacc 42120
gcgcgcgtgg ccgacgtcga gcgactcgcc gagctggccg ccgcgggcgg ccccgtcacc 42180
gcactgctgc ccgtcgccgg cgacggctcc gccgagcgga tcggcgatcc ggtgatcgac 42240
accgtcgccg tgctgcagtc ctggatcgcc gacggccggc tcgacgacac ccggctcgtc 42300
gtgctcaccc ggggcgcggt ggcgaccgcc ccaagggagg acgtcaccga cctcgccgcc 42360
gccggcgtct ggggcctgat gcgctcggcg cagaacgagc atcccgggcg cttcggcctc 42420
atcgacctcg acaccgccga atcctccacc gcggcgctcg gcacggcgct cgcctcggag 42480
gaggagcaac tcgcgctgcg cgacggagta ctgcgcgggc cgagcctgac ccggtgggac 42540
ccgggcacga ccatcctgcc ccccgccggc gagagcgcct ggcggctgga gaacacccgg 42600
cccggcacga tcgagggcct ggacgcggcc ccctgcccgg agcttctcgc ccccctcgga 42660
ccccggcagg tacggatcgc cgtccgcgcc gccggcatca acttcaagga cgtggtcgtc 42720
gccctcgacc tggtgcccgg actgaccggc ctgggcggcg aggtcgccgg tgtgatcacc 42780
gccgtcggcg ctgaggtcac ctaccaccgg gtcggcgacc aggtgttcgg cctggccacc 42840
gaggtcttcg gcccggtgac cgtcgccgac gaacgcaccg tccaccggat acccgatggc 42900
tggaccttcg aggaggccgc ctccgtcgcc gtcacctata tgacggccta ctacggactt 42960
gtcgacctcg gcggcctgcg cgccgggcag agcgtgctca tccacgccgg agccggcggc 43020
gtcggcagcg ctgccgtcca gctggcccgc cacctcggag ccgaggtcta cgcgacggcc 43080
agccccggta agtggggcgc cctgcgcgcc cagggcctgg acggcgcgca catcgccaac 43140
tcgcggaccc tcgacttcga gcagtggttc ctgcactcca ccgacggccg gggcatggac 43200
gtcgtactcg actgtctggc aggcgagttc gtcgacgcgg gcctcagact gctgccccgc 43260
ggcgggcact tcctggagat gggcaagacc gacaagcgcg atgccgaaca ggtcggcgcg 43320
gcccacccgg gcgtcgtcta ccgggcgtac gacctgccgg aagccggtcc cgaccgcatc 43380
cacgagatgc tggtcaccct caccgggctg ttcgaggacg gtgtcctgcg gccaccgcac 43440
gtcaacgcct gggacatccg cgacgcccgg gccgccttcc gggctctgag ccaggccgcg 43500
ctcgtcggca aggccgtgct cacccttccc ggcgtcccgt tctcccccca cggcacggtc 43560
ctgatcaccg gtggcaccgg tatgctcggc gcactgctcg cccgccacct ggtcaccgcg 43620
cacaacgtga ccagcctgct gctcaccagc cggcgcggcc ccgacgcccc gggtgccgcg 43680
gagctcacgg cggaactgac cgaggccggt gcccgggtgg acgtcgtcgc gtgcgacgtc 43740
gccgaccgcg atcagctggc cgctctgctc gccggcattc ccgccgagcg cccgctgacc 43800
gccgtactgc acaccgcggc ggccctggac gacggtctcg tcgagtccct caccgccgag 43860
cgcacccgcg ccgttctgcg ccccaaggtc gacggtgccg tccaactgca cgaactcacc 43920
cgcgacctcg acctcggcgc gttcgtgctg ttctcgtccc tggccggcac catgggcgcc 43980
cccggccagg gcaactacgc cgccgccaat gtgatgctcg acgccctggc cgcacaccgg 44040
cgggcccagg ggctgcccgg gctatccctc gcctggggct tctgggatca gcgcagtgag 44100
atgtccggca acctcgatga ccgcgacata cagcgcatga gccgcggcgg catcgtgccc 44160
atgagcagcg aggagggcct cgccacattc gacctggcct gccgcaccga ccgcgctcaa 44220
ctggtccccg cgaggctcga ccccgctgcg ctcgccggga ccaccggtcg ggttcctccc 44280
gtcatgcgag ccctgatacc cgctcccgcc cggcgttccg gacgccgctc cgccgaggcc 44340
ggggacgact cgctgcgcgc acggctggtc ccgctcaccg gcaccgaacg cacgcgcatc 44400
ctgctccagt tggtccgctc gaatgccgcc accgtgcttg gccacactga ccccgacgcg 44460
gtcggcgcgg ccacaccctt ccgtgaactc ggcttcgact cgctgaccgc cgttgagttc 44520
cgcaaccggc tcaccggcgc cgtcggcttc cggctccctg tcaccgtggt cttcgaccac 44580
cccacccccg gcgcactgac cgacttcctc gccgccgaac tcctcggtgg cctggacgaa 44640
accgacgccc cggccggtcc gtcccgcgcc acgcccgcgg ccgtcgcccg caccgatgaa 44700
gaacccctgg tgatcgtcgg catggcctgc cgctacccgg gcggtatctc caccccggag 44760
gagctgtggg acttcgtcct cgcggagcgc gacgccatct ccggcttccc ggaggaccgc 44820
ggctggcgcc gcgagcggtc cgccgacggc tccgcgccgc agcagggcgg gttcctcgac 44880
cgcgtcgcgg agttcgacgc cgcgttcttc ggcatctccc cccgcgaggc actgaccatg 44940
gacccgcagc agcggctgct gctggagacc tcctgggagg ccctcgaacg cgccggaatc 45000
gcgccgggta ccctgcgcgg cagtcgtact ggcatcttcg tcggtgccgc cgcctccggg 45060
tacaccagtc tgttccgccg cggctcggaa gccctcgccg gatacggcgt gaccggcgcc 45120
tccaccagtg tggtgtccgg acgcgtggcc tacgtgctgg gcctggaggg gcccgccgtc 45180
accgtggaca cggcctgctc gtcctcgttg gtcgccctgc acaccgccgc gctgtcactg 45240
cgcgcgggcg actgcgacct tgccctcgcc ggcggtgtcg ccgtgatgac cagtccgttc 45300
ctcttcgacg acttcgccag gcagggcggc ctctcgcccg acggccgctg caaggccttc 45360
gccggttccg ccgacggcac cggctgggcg gagggcaccg gcatggtcct cctcgaacgt 45420
ctctccgacg cccgccggaa cgggcatccg gtcctcgcgg tgctgcgcgg cagcgccgtc 45480
aaccaggatg gcgcctccaa cgggctgacc gcgcccaacg ggccgtcaca gcaacgggtg 45540
atcaggcagg cactcgacag ggccgggctc acccctgcgg acatcgatgc cgtcgaggcg 45600
cacggcaccg gaaccgtcct cggtgacccc atcgaggcac aggccgtcct cgccacctac 45660
ggacgggacc gggatccgga ccgccccgtg ctgctcggtt ccctgaagtc caacatcggt 45720
cacagccagg ccgctgccgg tatcggcgga gtgatcaaga cggtgcaggc cctgctccac 45780
ggcatcctgc cccgtagcct gcacatcgac gagccgaccc cgcacgtcga ctggtccgcg 45840
ggcgccgtcg atctgctcac cgagacccgc tcctggccgg ccacggacca tccccgccgg 45900
gccggtgtgt cctccttcgg cgtcagcggc accaacgccc acgccatcct cgaacaggcc 45960
accgagcccg agcccccgat cgtcgatcag gcgcccctgc ccgtcactcc gtggctgctg 46020
tccgggcacg acgaacaagg cctgcgcgct caggccgaaa cactcgtgag ctggttgcgg 46080
gaacagccgg agggctctgt caccgacatc ggccacgccc tggccacccg ccgggccgca 46140
ctggaacacc gagcagccct gccggtcacc gatcgggacg aggcgctcgc ccggctcgcc 46200
gagttcgcgg ccggccgcgt ccccgacggg ctgctgcgcg gcacggccca agagggctgc 46260
ctcgccctgc tcttcgccgg acagggcacg cagcggcccg gcatggggcg tgacctgtac 46320
gcggccttcc ccgcgttcgc ccacgccttc gacgaggcct gcgcacatct cgaccccctg 46380
ctcggacggc ccttgcgtga caccgtgttc accgccgagg ccgccgaact cgaccggacc 46440
gccatcaccc agcccgccct cttcgccctc gaagtggccc tgtaccggct gctggagtcc 46500
tggggcgtgg agccggaata cgtcctcggc cactccgtcg gcgagatcgc cgccgcccat 46560
gcggccgggg tcctcgacct gcccgacgcc gcccggctgg tggccgcccg ggggcgcctg 46620
atgcaggccc tgccgcccgg cggagccatg ctggccgtgc aggtcgggga aacggaggcc 46680
accgaggcgc tcggcgcggt gctcggcgag agggcggcca ccgtggacct ggccgccgtc 46740
aacggccccc actcggtggt gttctccggc accgctcgat ccgtggacgc gctcgacgcg 46800
cacttcaccg cgcggggtcg gcggacccgc cggctcaccg tgagccacgc cttccactcg 46860
ccgctcatgg aaccgatgct cgacgagttc gccgaactgg tgtcccggct gaccttcgcc 46920
gcgcccagga tccccgttgt ctccgatctc accggatccg ttctcggcgc gggcgatctc 46980
gccgaccctc gccactgggt aaggcatgcc cggcacaccg tccgcttcgc agacggcatc 47040
gacaccctcg tcggcgcagg cgtcaccgac ttcctggaac tgggcccgga cgccacgctc 47100
gccacgatgg ccgaggactg cttcgccacc gcccccaccg gcgtgtgcac ttcgctgctg 47160
cgccgtgacg gatcggaacc ggtcaccctg ctgatggccc tggcccgtgc ccatgtgcac 47220
ggcgtcaccg tcgactggaa ggccgtcctc gccggcaccg gcgctcggtg ggtggacctg 47280
ccgacctacg ccttccagcg ggagtcctac tggcccgcgg agtccacggc cggacgcagc 47340
gacccatcct cggccggctt cgacgacacc gggcaccccc tgctcggcgc catggtcggc 47400
gcagccggcg gcgacgtcct gttcaccggc gagctctcgc tggccgccca gccctggctg 47460
gctgaccacc gcgtcctgga cgccgtcctg tttcccggca ccggcttcct cgaactcgcc 47520
tcctgggcgg gcagccgcct ggacgccggc gacctggagg aactggtcgt ccaccgcccg 47580
ctggtgctgc ccgaacacgg cggcgtcacc gtgcaggtgg tcgtcggcga ggccaccgat 47640
gaggaccgca ggccggtcgc cgtctactcc cgcgccgccg acgacgccgg atggaccagg 47700
catgcggagg gactgctcgc caccggaccc gcagcccagc cggccgaccc gtcggcccac 47760
tggccgccgc agggcgccga gcgcgtcgac ctggacgagt tctacgccgg tctggccgac 47820
gccgggaccg cctacgggcc ggtgttccag ggcctcaccg cggtctggcg gctggacggc 47880
gagatctacg ccgacgtggc gctgcccgcg caggcggccg acgacgccag gggcttcgga 47940
gttcaccccg cactgctgga cgccgctctg cacaccctcg cgttcctgcc cggcgccgac 48000
cggagcagcg gcccgttcct gccgttcgcc tggcgggacg tcaccgtccc cggccccggc 48060
gccacctctt gccggatccg cctcacgccc ggcaacggaa ccgacgaggt ggccgctacc 48120
ctctgggacg gcgacggtcg gccgctcgca gccgtgggcg gactgagcct gcgcagcgtc 48180
tcccgcactc aactgggcac gtccgcggtc gcgtcgtccc tgttccgtat ggactggaca 48240
cccgcctcac agcccagggc cgtcggcgcc ccgacggtcc gctgggccgt ggtgggcccg 48300
gacgcccccg gaacacccga catcgaccac tacgccgacc tcgtggccct gcgccggcac 48360
ctcgccgacg gcggcccggt acccgaccag gtactcctgc cgtgtgcccc ctccgccggc 48420
ggcgccgacg ccggcgcagc ccgcgacgcc gtgcacgcgg cgctgcacac tctgcgtacc 48480
tgggcggagg acgagcactt cgccaagagc cggctggtgc tgtgcacccg cggcgccgtc 48540
gtggcacaac cgggcgaagg cgtacgcgac ctggcgcacg ccgcggtctg gggcctcgcc 48600
cgcagcgcac agctggaaca ccccgaccgg ttcgtcctgg tcgacctcga caccggcacc 48660
acgctcgacg acctcacccg gtcgcagctc ctggcccgga ccgagtccac cgacgcggcc 48720
cagttcgcga tccgcggcgc cctgaccctc gtaccggccg tcacccgtca ggccggacag 48780
gtcccggcgc cggaagcacc gtggccggcc gacggcacca ccctgatcac cggagccggc 48840
ggcatgatcg gcggactgct cgcccggcat ctcgtccgcg aacacggcgt acgccacctg 48900
ctactcctcg gccgccgcgg cgaggacacc ccgggcatgg ccgagctgcg ccgggaactc 48960
accgacgcgg gagccgacgt ccacgtcacc gcctgcgacg ccgccgaccg ggaggccctg 49020
gctgccgtac tcggccgcat cccgtccact gcccccttga ccgccgtcgt gcacgccgcg 49080
ggcgtcgtcg acgacggggt gctcggctcg gtcactgacg agcaggtcga ccgcgtccta 49140
cgccccaaga tcgacgccgc ggtgaacctg caccacctca ccgcccccct cggtctgcgc 49200
gccttcgtcg tctgctcctc cctcgccgga gccctcggcg gcggcggcca gtccgcctac 49260
gccgctgcca acgcctacct ggacgccctg tgcctgcgac ggcgggccga tggcctgccc 49320
gcgctctcgc tcgcctgggg tccttgggag agcagcgccg gcatgaccgc ccagctcgcc 49380
gcggccgacc tgcgacgcat ctcccgcgcg ggcatgcagc cgctcacccc ggacgacggg 49440
ttggccctct tcgacgcggc ccacgccacc ggggaagcgg tgctgctgcc cttccgcttc 49500
gaacccggcg gcctgtccac cgccgaccgg gcgtccctgc cccccgccct gcgccccctg 49560
gtgccccgac cccgacgccg gcctggcgac cccgtccccg gcctgtccgg tctccgcgac 49620
cgcctgcgcc ccctgtcgca ggacgaccgg accggcgccc tggagaatct cgtccgcgcc 49680
gaggtggcct cggtgctcgc cctgccttcg gcggacgcgg taccggtcac caaggcgttc 49740
aagaccctcg gcttcgactc cctgatggcc gtcgacctcc gcaatcgcct cagcgccctc 49800
accggtgtca ggctccccgc gaccctggtc ttcgaccacc ccaccccacg ggccctggcc 49860
acccgcctgc tcaccggcat ggaactggac accgccaccg ccaccgaccc ggccctgctc 49920
gccctgcgcg aactcgaaac cgcggtccgc tcgatggcgc ccggtgccga cgaccgcgga 49980
gcgatggcga cccggctgcg ggtgctgctg acagcgctcg aggagaccgc ggacgacacc 50040
gacggtgcgg acacggacgg cgataccgac ctcgactcgg tgagcaccga ggaactcgtc 50100
aacctgctcg gcgacgagtt cggcctcacc tgagaaccac ccctgcctgc accacccgac 50160
ccgaacttag gggtgttcgc ggtcctgaac tggggccggg atccgcgtcc tggcccccta 50220
gcctgcaaac aggcctgtcc ttgcgcattg acgaaacacc tgagtgggag ttgagcatga 50280
gcagttccat gtcggagatc gtcgacgcgc tgcgggcctc actgctggag aacgagcggc 50340
tgcgccagca gaaccagcgg ctcagcgcgg catcctcgga gcccctcgcc atcgtgggca 50400
tcggctgccg ctatcccggc ggagtccgtg ataccgaggg cctgtggcag ctcatcgccg 50460
agggccgtga cgccatgtcg gacttcccca ccgaccgtgg atgggaggac cgggatgtcc 50520
ccgccgcccg caccggcgct ttcctccacg acgcgggcga cttcgacccc gcgttcttcc 50580
gcatctcgcc gcgggaggcg atggcgatgg acccccagca gcggctgctg ctggaaacct 50640
cctgggaagc cctcgaacgc gccggtatcg acccggtctc gctcaagggc agccgcaccg 50700
gcgtgttcat cggcggcgcc ccccaggagt acggcgcgct cgtgatgaac tcagcccagg 50760
gcgccggagg ctacgcactc accggcgccc ccggcagtgt cctgtccggc cggatctcct 50820
acgtgctggg cctggagggc ccggcggtca cggtggatac cgcgtgctcg tcctccctcg 50880
tcgccctgca cctcgcgatc aagtcgctgc gcaccggcga gtgcgacctc gcgctggccg 50940
gcggcgttct tgtcctgatc acgccgacca tcttcaccga gttctccgcc accggcggat 51000
cggccggcga cggccgctgc aaggcgttct cctcggacgc ggacggcacc ggctggggcg 51060
aaggcgcggg cgtcctcgcg atccaacgcc tgtccgacgc gcgccgggac ggcaaccccg 51120
tcctcgcggt gatccgcggg tcggccgtga accaggacgg tgcgtcgaac ggtctgagtg 51180
ctccgaacgg tccgtcgcag cagcgggtga tccggcaggc gatcgccaat gccgggctga 51240
ccctcgcgga cgtcgacatg gtcgaggcgc atggcaccgg caccacgctc ggcgacccca 51300
tcgaagccga ggcgctgctc gccacctacg gccaggaacg gcacgacggc cggcctctct 51360
ggctcggcac cctcaagtcg aacgtcggtc acacccaggc tgcggccggc atctccggcg 51420
tcatcaaggc cgccctcgcg ctccagcacg gcatcatgcc caagacgctg cacgtggacg 51480
agccgacgcc ggaggtcgac tggtcggcgg gtgcggtgga gctgctgacc gaggcacgtc 51540
agtggccgga gaccgggcag ccgcgtcgcg tgggtgtgtc gtccttcggg atcagcggca 51600
cgaacgccca cgtcatcctg gagcaggccc ccgaggccgc cccggcggaa caggcggacg 51660
gggacgcccc ggcggagctg ccggtgacac cgtgggtggt caccggccgg aacgaggcgg 51720
cgctgcgcga gcaggccgca cggctgctgg accacctcac gcagcagccc gacctgagcc 51780
cgcgggacgt gggcttctcg ctggtaggga cacgctcggc gttcgagcag cgcgcggtcg 51840
tgctgggcgg cgacatggcg gcgctgaccg agggggtccg cgccctggcg gcccaggagc 51900
cgaacaccca tgtgatcgcc ggcacggccg aggtccgcag cggcatcgtc ttcgtgttcc 51960
cgggtcaggg gtcgcagtgg gttggtatgg ggagggagtt gtgggatgcc tcgccggtgt 52020
tcgcggagtc gatggtggcg tgtgagcgtg cgctggcgcc gttcgtggac tggtcgctga 52080
aggatgtggt gttccggggc gcggaggatc cgctgtgggc ccgtgtggat gtggtgcagc 52140
cggtgttgtg ggcggtgatg gtgtcgctgg ctgcggtgtg gcggtccttc ggggtggagc 52200
ctgttgcggt ggtggggcat tcgcagggtg aggtggcggc tgcgtgtgtg gctggtgggt 52260
tgtcgttgga ggatggtgcg cgggtggttg cggtgcggtc gcggctggtg cgggagaagt 52320
tgtcgggtct gggtgggatg ggttcggtgg cgcttcctgt ggaggcggtg gaggtgcgtc 52380
tgggccggtt cgggggccgg gtcggggtgg cggcggtgaa cgggccgacg tcggtggtgg 52440
tctccggtga ggtcgaggcg ctggacgcgc tgctggcgga gtgtgaggag gcgggggtgc 52500
gggcccgtcg tatcgcggtg gactacgcct cgcattcggc gcaggtggat gcgctcaccg 52560
acgacctgct ggcggagctg gccgagttgc ggccgcagtc ctcgtcggtg gcgttctatt 52620
cgacggtgac cggtgagcgg ttggacacgg ccgggctgga cgccaggtac tgggtgacga 52680
acctgcgcga gcgggtcaac ttcgagcccg tcacgcgact gctggccgag aagggggccg 52740
gtgtcttcgt cgagtccagc ccgcacccgg tgctgacggt cgcggtgacg gagaccggcg 52800
aggccgcgga ccggtcggtg gtggccgtgg gttcgctgcg gcgcgaggag ggcggtcttc 52860
ggcggttcct ggcatcgctg gccgaggcgt acgtcgctgg tgtcccggtg gactggtcgg 52920
tgacgttcgc cgggagtggc gcccgtcggg tggacctgcc cacctacgcc ttccagcacc 52980
aacgctactg gctggacgac gtggtgttgc cggggcaggg cggtggcggt tcgtccgatc 53040
cggcggacgc ggcgttctgg ggcgccgtcg agcgcgcgga cgccgagagt gtggtctcgc 53100
tggtggacgg ggcggacgcg caggtgtggg agtcggtgct tccggcgttg tcggcctggc 53160
gcaaggggcg tcgtacgcag tcgacgctcg actcgtggcg gtaccggacg gtgtggcgtt 53220
cggtgacggt gtcgtcggcg gcttcgctat gtggtgtgtg gctggtggtc agctctggtc 53280
cgggtgctcc ggtggagcag gtcacgctgg cgctgacggc tgcgggggct gaggtgcggg 53340
tgctggatgt gcctgtggag cgtggggctt tggcggagtg gtttgccgaa gcgggtgagg 53400
tcgcgggtgt ggtgtcgctg ctggcgtggg acgaggatga ggcgttggcg tcgtcgctcg 53460
cgttggtgca ggcgcatggg gatgccgggt tgtcggcgcc ggtgtgggtg ctgacgcggg 53520
gtgcggcggc tgtgggctcg gatgatgccg tatgcgcgac gcagacgtcg ctgtgggcgt 53580
ggggtcaggt cgtcggcttg gagctgcccg ctgtgtgggg cggtctggtg gacgttcctg 53640
ccgagtggga tgggcgggtg tcgtccgcgc tggctgcggt gctggcggct ggtgagggcg 53700
aggaccaggt cgcggtgcgg tcctcgggtg tgtacgcgcg tcgtctggtg tgggcgccgc 53760
tgggcgcggg tgcggctgcg gtgcgggagt tcaagccgca gggcaccgtg ctgatcaccg 53820
gtggcaccgg tggtgtcggc ggtcatctgg cgcgctggct ggcgagggag ggcgccgagc 53880
acctgctgct ggtcaaccgc actggtgaag gagctgctga acttctcgaa gagctgcgtg 53940
gctcgggtgc ggaggtgacg gtggcggcgt gtgatgtgac cgatcgggcg gctttggcgg 54000
aactgcttgc tggaatccct gccgaacgtc ctttgaccgc cgtgttccat gctgcggggg 54060
tcgcgggcta cggtctggtc cgcgaactgg acgtggcgga tctggatgtc gagatggccg 54120
ccaggaccct cggtgcccgt catcttgacg agctgaccgc cgaactcggc ctggatctgg 54180
atgcgttcgt ggtgttctcc acgggggctt cggtgtgggg gagtgcgggg aacggggcga 54240
atgcggctgc gggtggttat ctggatggtc tgatccgtgg tcgtcgggcg cgtgggctgg 54300
tgggttcgtc ggtgtcgtgg ggtggctggg gggccacggc tatggcggtg ggggagacgg 54360
cggagcggtt gtcgcgtcgt ggggtgcggt tgctggagcc ggagttggcg gttcgggcgt 54420
tgcgtcaggt gctggagcag gatgaggtgt cggtgacggt ggccgacctg gactggtcgt 54480
tgttcacgcc ggggtacgcg atggcgcggc gccggccgct gatcgaggac atccccgaag 54540
ccgcccgggc actgcgtgac atcaccgaga ctgacgagac ccaggacgcg gcggccggag 54600
gactgcggga gcggctggcc gggctggcgg agtcggagca gcaggcgttg ctgctggggc 54660
tggtgcgggg tgaggccgcg caggtgctgg cgcacgggtc gacggcggag atcacgccga 54720
gcaggccgtt caaggagctc gggttcgact cgctgaccgg gatggagctg cgcaaccgac 54780
tgtccaaggc caccggactc cggctgcccg ccaccctcgt cttcgactac cccaacctcc 54840
agcaactggc ttccctgctg cgtacggcgc tcatcgacgg tcttccgggg gccggcgccg 54900
tcgcgacgac ggtccggctg gtggacgacg aaccgctcgc aatcatcggt atggcctgcc 54960
gctacccggg tgacgtccgc gatcccgagg acctgtggcg actggtctcc gaaggccgcg 55020
acgaactgtc ggacttcccc accgaccgcg gctgggaacg ttggggtacg cccgcggtcg 55080
gtcaggccgg attcctgcac gaggccgggg acttcgacgc tgccttcttc gggatctcgc 55140
cccgtgaggc cgcgagcatg gacccgcagc agcggcttct gctggaggtg tcgtgggagg 55200
ccttcgagca ggccggcatc gacccctggt cgctgcgcaa cagccccacc ggggtcttcg 55260
tcggcggcgg cccgcaggac tatcccacgg tgctgatggg ctcggccgag gccgccagcg 55320
gctacggcat gaccggcgcg ctcggcagtg tgatgtccgg ccgggtctcc tacatgctgg 55380
gcctggaggg gccggcggtc acggtggaca ccgcgtgctc gtcctccctg gtcgccctcc 55440
acctggccgc gcagtccctg cacaacggcg agtgcggtct ggccgtggcc ggcggcgtga 55500
ccatcatggc cacgccgggc gcgttcctcg ggttcgacac gttgggcggc ttggctgagg 55560
acggccgctg caaggccttc gcggcgtccg cggacggcac cggctgggcc gaaggcgtcg 55620
gcatggtcgt cctcgagcgc ctgtcggacg cgcgccgcaa cgggcacgag gtgctggcgg 55680
tggtccgcgg gtcggccgtg aaccaggacg gtgcgtcgaa cggtctgagt gctccgaacg 55740
gtccgtcgca gcagcgggtg atccgccagg cgctggcgaa cgccggcctg tccgccgcgg 55800
acgtcgacat ggtcgaggcg catggcaccg gcaccacgct cggcgacccc atcgaggcgc 55860
aggcgctgct ggccacctac ggccaggacc gcccggccga ccggccgctg tggctcggct 55920
cggtgaagtc caacttcggt cacacgggtg ccgccgccgg tgtcgcgggc gtcatcaagt 55980
ccgtactggc gctgcggcac ggactgatgc cgaagaccct gcatgtcgac gagccgacgc 56040
ctgaggtcga ctggtcggcg ggtgcggtgg agctgctgac cgaggcacgt cagtggccgg 56100
agacggagca gccgcgtcgc gtgggtgtgt cgtccttcgg gatcagcggg acgaacgcgc 56160
acttgatcct ggaggaggct ccgcaggccg cggccgtgga ggacgagcgg gacgggtccg 56220
tggccccggt gtcgtcgccg gtggtgccgt gggtcgtgtc gggccgctcg gagaccgcgt 56280
tgcgggcgca ggcggcgcga ctggcggagc atctggcgca gcggccggaa gcgggcgcgc 56340
tggacgtggg cttctcgctg gtggagtcgc ggtcggcgtt cgaacagcgt gcggtggtgc 56400
tgggcgcgga ccgggaggag ttgctggccg gggtacgcgc ggtgggggag ggcgcccagg 56460
cgtccggtgt ggtcaccggg cgggccgctc aatccggtgt ggtgttcgtg ttcccgggtc 56520
aggggtcgca gtgggttggt atggggaggg agttgtggga tgcctcgccg gtgttcgcgg 56580
agtcgatggt ggcgtgtgag cgtgcgctgg cgccgttcgt ggactggtcg ctgaaggatg 56640
tggtgttccg gggcgcggag gatccgctgt gggcccgtgt ggatgtggtg cagccggtgt 56700
tgtgggcggt gatggtgtcg ctggctgcgg tgtggcggtc cttcggggtg gagcctgttg 56760
cggtggtggg gcattcgcag ggtgaggtgg cggctgcgtg tgtggctggt gggttgtcgt 56820
tggaggatgg tgcgcgggtg gtcgcggtgc ggtcgcggct ggtgcgggag aagttgtcgg 56880
gtctgggtgg gatgggttcg gtggcgcttc ctgtggaggc ggtggaggtg cgtctgggcc 56940
ggttcggggg ccgggtcggg gtggcggcgg tgaacgggcc gacgtcggtg gtggtctccg 57000
gtgaggtcga ggcgctggac gcgctgctgg cggagtgtga ggaggcgggg gtgcgggccc 57060
gtcgtatcgc ggtggactac gcctcgcatt cggcgcaggt ggatgcgctc accgacgacc 57120
tgctggcgga gctggccgag ttgcggccgc agtcctcgtc ggtggcgttc tattcgacgg 57180
tgaccggtga gcggttggac acggccgggc tggacgccag gtactgggtg acgaacctgc 57240
gcgagcgggt caacttcgag ccggtgacac gtctgctggc cgaacgggaa caccaattct 57300
tcgtcgagtc cagcccgcac ccggtgctga cggtcgcggt gacggagacc ggcgaggccg 57360
cggaccggtc ggtggtggcc gtgggttcgc tgcggcgcga ggagggcggc gtccagcgcc 57420
tgttgacgtc gctggccgag gcgtacgtcg ctggggtgcc cgtcgactgg tcgaagacct 57480
tccacggcac cggtgcccag tccgtggacc tgcccaccta cgccttccag caccagcact 57540
actggctgga cgacgtggtg ttgccggggc agggcggtgg cggttcgtcc gatccggcgg 57600
acgcggcgtt ctggggcgcc gtcgagcgcg cggacatcga cagcgtggcc tcgatcgtcg 57660
acggggtcga ccagcaggcc tgggaaagcg tcgtcccggc gctgtcggcc tggcgcaagg 57720
ggcgtcagga gcgagcgcta ctggattcct ggcggtaccg gacggtgtgg cgttcggtga 57780
cggtgtcgtc ggcggcttcg ctatgtggtg tgtggctggt ggtcagctct ggtccgggtg 57840
ctccggtgga gcaggtcacg ctggcgctga cggctgcggg ggctgaggtg cgggtgctgg 57900
atgtgcctgt ggagcgtggg gctttggcgg agtggtttgc cgaagcgggt gaggtcgcgg 57960
gtgtggtgtc gctgctggcg tgggacgagg atgaggcgtt ggcgtcgtcg ctcgcgttgg 58020
tgcaggcgca tggggatgcc gggttgtcgg cgccggtgtg ggtgctgacg cggggtgcgg 58080
cggctgtggg ctcggatgat gccgtatgcg cgacgcagac gtcgctgtgg gcgtggggtc 58140
aggtcgtcgg cttggagctg cccgctgtgt ggggcggtct ggtggacgtt cctgccgagt 58200
gggatgggcg ggtgtcgtcc gcgctggctg cggtgctggc ggctggtgag ggcgaggacc 58260
aggtcgcggt gcggtcctcg ggtgtgtacg cgcgtcgtct ggtgtgggcg ccgctgggcg 58320
cgggtgcggc tgcggtgcgg gagttcaagc cgcagggcac cgtgctgatc accggtggca 58380
ccggtggtgt cggcggtcat ctggcgcgct ggctggcgag ggagggcgcc gagcacctgc 58440
tgctggtcaa ccgcactggt gaaggagctg ctgaacttct cgaagagctg cgtggctcgg 58500
gtgcggaggt gacggtggcg gcgtgtgatg tgaccgatcg ggcggctttg gcggaactgc 58560
ttgctggaat ccctgccgaa cgtcctttga ccgccgtgtt ccatgctgcg ggggtcgcgg 58620
gctacggtct ggtccgcgaa ctggacgcgg cggatctgga tgccgagatg gccgccaaga 58680
ccctcggtgc ccgtcatctt gacgagctga ccgccgaact cggcctggac ctggaggcgt 58740
tcgttctctt ttcctccggc gccgctgtgt ggggaagtgc gggaagcggt ggttacgcgg 58800
cggcgaacgg gtacttggat ggtctggcgc aggagcgtcg ggcgcgtggt ctggcggcga 58860
cgtcggtgtc gtggggcaac tggaaggaca ccggtctggc gaccgatacg accgcggagc 58920
agttggcacg tctcggtgtc cggccgatgg atccggcgct ggcggtagcg gccctccggc 58980
aggtgctgga gcacgacgag atcgcgctga ccgtgaccga catggactgg gcgcgcttcg 59040
cccccggcta cacgctggcc cgccgccgcc cgctgatcga ggacatcccc gaagccaccc 59100
gcgcgctcag cgaggactcc gccgacccgg cgaacgacat ggccggagcc gccctgcggg 59160
ccgagctgga aggactgggc cgggccgagc agctcgccgt gctcatggac ctggtgcgta 59220
gtgaggtcac ccgcatcctg gcgggtgcct ccgcggccga catcacgccg gagaggccgt 59280
tcaaggagct cgggttcgac tcgctgaccg cgatggaact gcgcaacctg ctcaccatcg 59340
ccaccggact gcgcctgccc gccacccttg tcttcgacta ccccaatccg cgacagcttg 59400
ccgcccatct gtgcgacgaa ctgatcggcg ttggcgcgga tcccgtgggg gccgacgtcg 59460
tcgtacgcgg ctcgtccgac gaaccgctgg ccgtcgtcgg catggcctgc cgttacgcgg 59520
gcggcgtgtc gacccccgag gacctgtggc agatggtggc ggagaacagg gaagggctca 59580
ccgacgtccc ctcctatcga gggtgggagg ggtggaacgt cgccagcctt cgtcgcgccg 59640
gcttcctgca cgaggcgggt gacttcgacg ccggtttctt cgggatctcg ccgcgtgagg 59700
ccgcgaccat ggacccgcag cagcggcttc tgctggaggt gtcgtgggag gccgtggagc 59760
gggccggtat cgaccccaag tcgctgcggg gcagtgacac aggcgttttc gtgggcggta 59820
cggccgtcga gtacggcgca ctgctgatga actcgccgac cggccagggc tacgcagtca 59880
ccagctcctc cggcagcgtc ttgtcgggtc gtgtctccta caccctcggc ctggaagggc 59940
ccgccgtcag cgtggacacg gcatgctcgt cctccctcgt cgccctgcac ctcgccgccc 60000
aggcgttgcg caacggcgag tgcggcctcg cgctcaccgg tggtgtcggt ctgatggcca 60060
cacctggcgg gttcgtggag ttcgacacgc ttggcggact gtcgtccgac ggccatacca 60120
aagcctttgc agcgtccgcc gacggtatcg gctggggcga aggcgtcggc atgatcgtgc 60180
tggaacgttt gtcggacgcg cgccgcaacg ggcacgaggt gctggcggtg gtccgcgggt 60240
cggcggtcaa ccaggacggt gcgtcgaacg gtctgagtgc tccgaacggt ccgtcgcagc 60300
agcgggtgat ccggcaggcg gtcgccaatg ccgggctgac cctcgcggac atcgacatgg 60360
tcgaggcgca cggcaccggc accacgctcg gcgaccccat cgaggcgcag gccctgctga 60420
acacctacgg tcaggaacga cacgacggcc aaccgctgtg gctcggctcc gtgaaaacca 60480
acatcgggca cacgggcgct gccgcgggtg tggcgggcat catcaagtcc gtcctcgccc 60540
tgcgcaacgg cgtcatgccg atgaccctga acgtggacgg gccgacaccg aaggtcgact 60600
ggtcggcggg agcggtggag ttgctgaccc aggggcggga atggccccag acggaccgta 60660
cgcggcgtgc gggtgtgtcc tcgttcggga tcagcggcac caacgcccat gtgatcatcg 60720
aggaggcacc cccggccgag gaacccccgg cccagcccgg gaccgacctt ccggcggccc 60780
ccgcactcgc gacaccggtc gttccgtggg tgttctccgg acggtcgaac ggagccctgc 60840
gcggccaggc cgagcgcctg tcagcactgg cggagaacga acccggcctc gacctcaccg 60900
acgcggcgtt ctccctggcg acgacgcgag ccagtctgga acaccgcgcc gtggtgctcg 60960
gccgtgacac gtcggaaatg ctcgacggcc tgcgcgggct caccgcacag ggctcggtcg 61020
ccggcgtggt ctccggtgtt accgctgccg acagccgtgc tgtctttgtg tttcctggtc 61080
aggggtcgca gtgggtgggg atggggcggg agttgtggga ggtttcgtct gtttttgctg 61140
agtcgatggt ggcgtgtgag cgggcgttgg tgccgtttgt ggattggtcg ttgcgggatg 61200
tggtgttcgg gggtgggggt gatgggttgt gggagcgggt ggatgtggtg cagccggtgt 61260
tgtgggcggt gatggtgtcg ttggcggcgg tgtggcggtc gtttggtgtg gagccggctg 61320
cggtggtggg gcattcgcag ggtgaggttg cggcggcgtg tgtggcgggg gggttgtcgt 61380
tggaggatgg tgcccgggtg gtggcggtgc gttcgcgtct ggtgcgggat gggttgtcgg 61440
ggcggggtgg gatggtgtcg gtggggttgt cggtgggtga ggtggaggag tggttggccg 61500
ggttgggggg tcgggtgggg gtggcggcgg tgaatgggcc gtcgtcggtg gtggtttcgg 61560
gtgaggcgga ggtgttggag gggttgttgg cggggtttga gggtgcgggg gtgcgggcgc 61620
gtcggatcgc ggtggattat gcgtcgcatt cggtgcaggt ggatgcgctc ggtgatgatc 61680
tgctggcggg gctggcgggt attcggccgg tgtcgtcgtc ggtggcgttc tattcgacgg 61740
tgtccgggga gcggatggac acggcggggc tggatgcggg gtactgggtg gcgaatttgc 61800
gggagcgggt gttgttcgag ccggtggtgc ggatgctggt ggagcggggc agtgcggtgt 61860
tcgtggagtc cagtccgcat ccggtgcttg ccatggcggt ccaggagacc ggtgaggctg 61920
tgggccggtc ggtggtcgcg gtggggtcgt tgcggcggga tgacggcggt gctggacggt 61980
ttttggcgtc gttggcggag gcgtatgtgg tgggtgcgcc ggtggactgg tcggtgttgt 62040
tcgcgggcgc gggtgcgcgg cgggtggatc tgccgacgta tgccttccag caccagcgct 62100
actggctgga gggtgtcacc gtcggaggcg agccccagga cacggtggag gatgacacgg 62160
atgccgcgtt ctgggacgcc gtggagcgcg agagcctgtc cgacctcgct gaggtactcg 62220
acgtctccga tgccggcgct gcggccgagg cctggctgcc cacgctgtcg gcctggcgca 62280
agggccgccg taggcagatg accctcgatt cgtggcgcta ccggactact tggcgcgcgt 62340
acagcctgcc ctcaggaacc cgcctgtcgg ggatgtgggt ggtggtggct tctggtgggg 62400
atgcgccggt ggtggaggtg cggcgggcgt tggaggcggc tggtgcggag gtgtccgttc 62460
gggaggttct cgacggtgtg gcactcgcgg atgtgtcggg tgtggtgtcg ttgctggcgt 62520
gggatgaggg gtccgcgttg gagtcgatgt tgcggttggt gcgggcggtt ggtggtggtg 62580
aggtgccgtt gtgggtgctg acgcggggtg ccgcggtggt gggtgtggat gatccggtgt 62640
cggcggtgca gtcgcaggtg tgggcgttgg ggcaggtggt ggggttggaa cagccccagg 62700
gttggggtgg tctggtggat gttcccgggg tgtgggatga gcgggtggcg tcgttgttgg 62760
ctggtgtgct ggcggctggt gagggtgagg atcaggtcgc ggtgcgttcg tcgggtgtgt 62820
atgggcgtcg tctggtgcgt gctccgcttg gtgggagtcc ggtgccggtg cgggagtggg 62880
gtccgtcggg cacggtcctg gtcaccggtg gtactggtgg gatcggtggg catctggcgc 62940
ggtggctggc gaaggagggt gccgagcacc tgttgttggt cagccgtggt gagcgggccc 63000
agggtgcggc cgaactggtc gaggaggtgc gcgggctggg cgcggaggtg acggtcgccg 63060
cgtgtgatgt gaccgaccgg gcggctctcg cggaactgct cgccgagcat cccgtcacct 63120
cgatcttcca caccgccggg atcgccgcgc acggcttcct gaccgacctc gacccggctg 63180
agctcgggga ccagatgggg gcccgtgtgg tcggggcgcg tcacctggat gagctgtccg 63240
ttgagttggg cttggatctg gatgcgttcg tggtgttctc cacgggggct tcggtgtggg 63300
ggagtgcggg gaacggggcg aatgcggctg cgggtggtta tctggatggt ctgatccgtg 63360
gtcgtcgggc gcgtgggctg gtgggttcgt cggtgtcgtg gggtggctgg ggggccacgg 63420
ctatggcggt gggggagacg gcggagcggt tgtcgcgtcg tggggtgcgg ttgctggagc 63480
cggagttggc ggttcgggcg ttgcgtcagg tgctggagca ggatgaggtg tcggtgacgg 63540
tggccgacct ggactggtcg ttgttcacgc cggggtacgc gatggcgcgg cgccggccgc 63600
tgatcgagga catccccgaa gccgcccggg cactgcgtga catcaccgag actgacgaga 63660
cccaggacgc ggcggccgga ggactgcggg agcggctggc cgggctggcg gagtcggagc 63720
agcaggcgtt gctgctgggg ctggtgcggg gtgaggccgc gcaggtgctg gcgcacgggt 63780
cgacggcgga gatcacgccg agcaggccgt tcaaggagct cgggttcgac tcgctgaccg 63840
ggatggagct gcgcaaccga ctgtccaagg ccaccggact ccggctgccc gccaccctcg 63900
tcttcgacta ccccaacccg caacgcgtca ccgatctctt gctcaccgat ctcgaccagc 63960
aggatggccg accgggcatc gccgacgttc tcgacatcaa gcgggaactg tcccggatcg 64020
gtgaggcact cgagggcgtc gcacccgatc aacaggcccg tgaggacatc gtcgcccacc 64080
ttcgcgatct gatcacccag ctcagcgcta ccgagcagca cggtgccacc gatctcgaag 64140
ccgccacgga cgacgagatc ttcgacttca tcgaccgcga cctaggcgtg tcctgaacag 64200
gcacctgccg ggttttcaac tgcttcggag tggggtttca cgatgaccga ggacaaactt 64260
cgtacctatc tgcgcagggt tacggccgaa ctgcagcaga cccgccagca gctcaaggac 64320
agccaggacc gagggcggga gccgctcgcc atcgtgggaa tggcctgtcg acttcccggc 64380
ggggccgact cgccggagca actgtggcag atggtgaggg acggcgccga cggggtgggc 64440
ggattcccgg acgaccgcgg ctgggacctt acctcgctcc tcagcgacga tcccgaccgt 64500
ccgggcacga cgtacaccca ggagggcgcg ttcctgaagg gggcgggtga cttcgacgcc 64560
gggctcttcg gtatctcgcc gcgtgaggcc gcgaccatgg acccgcagca gcgactgctt 64620
ctggagacct cgtgggaggc gttggaacgg gccgggatcg acccgcactc gctgcggggc 64680
agccggaccg gggtattcgt cggcggtacg gccatcgagc acatcgtcaa gctgatgaac 64740
tcgccgaccg atcaggggta cgccatcacc ggcggctcgg ggagcatcat gtccggccgg 64800
atctcctacg tcctgggctt ggaagggccg gcggtcacca tcgacaccgc gtgctcctcg 64860
tctctcgtcg cactgcactc ggccgtacag tcgctccggc agggtgactg ctctctggcg 64920
ctggccggcg gcgttgcggt gatggccaca ccctctgcct tcgtgacctt cgcccggcag 64980
cgcggactgg ccgcagacgg ccgctgcaaa gcgttctccg acgacgcgga cgggatcggc 65040
tggggtgaag gcgtcgccgt cgtgctgctg gaacgtctgt cggacgcgcg gcgcaatggg 65100
catgaggtgc tggcggtggt ccgtgggtcg gcggtcaacc aggatggtgc gtcgaacggt 65160
ctgagtgctc cgaacggtcc gtcgcagcag cgggtgatcc ggcaggcggt cgccaatgcc 65220
gggctgaccc tcgcggacgt cgacatggtc gaggcgcacg gcacggggac cacgctcggc 65280
gaccccatcg aggcgcaggc cctgctgaac acctacggtc aggaacgaca cgacggccaa 65340
ccgctgtggc tcggctcgct gaaatcgaac atcgcacaca cccaaggcgt ctcaggcgtc 65400
gccggcgtca tcaagaccgt gctggccctg cgccacggca ttctgcccaa aaccctgcat 65460
gtgggcgagc ggagcagcca ggttgactgg tccgtcggcg cggtggaact gctcactgag 65520
gcacgggagt ggccggagac ggggcgtccg cggcgggcgg gtgtgtcgtc gttcgggatc 65580
agcggcacca acgtacacgt gatcatcgaa caggccccgc aggaagagtc tgccgagcca 65640
cggacggacg aggcgccctc gttggagtcc cccttcgcca cgaagcccgc cacactgccc 65700
tggctgatct ccggcaacac cgaggccgca ctgcgtgaac aggccgcccg cctgcgggcc 65760
cacctcaatg cccaccccgg cctcgcggca gccgacatcg gtcactccct gctgacgagc 65820
cgcaccagat tcgcccaccg cgcggtgctg ttgaccgagc aggacggcga ccggcgcacc 65880
gcactgaccg ccctcgccga cggactcgac gcccccggcc tgattcgagg caccggtgac 65940
actggcgcgg gtgtggtgtt tgtgtttcct ggtcaggggt cgcagtgggt ggggatgggg 66000
cgggagttgt gggaagtctc gtctgtgttt gctgagtcga tggtggcgtg tgagcgggcg 66060
ttggcgccgt ttgtggggtg gtctttgcgg gatgtggtgt tcgagggtgg gggtgagggg 66120
ctgtggggtc gggtggatgt ggtgcagccg gtgttgtggg cggtgatggt gtcgcttgct 66180
gcggtgtggc ggtcgtttgg tgtggagccg gtgggggtgg tggggcattc gcagggtgag 66240
gtggcggcgg cgtgtgtggc cgggggcttg tcgctggagg acggcgcccg ggtggtggcg 66300
gttcggtcac gcctggtggg agagaggctg tccgggcggg gcgggatggt gtcggtgacg 66360
ttgccggtgg cccaggtgga ggagtggctg gcgggctctg ggggccgggt tggggtggcg 66420
gcggtgaacg ggccgtcgtc ggtggtggtc tcgggtgagg tggaggcgct ggacggcctg 66480
ctggtcgagc tcgatggcgc gggggtgcgg gcgcgccgga tcgcggtgga ctacgcctcg 66540
cattcggcgc aggtggatgc gctcaacgat gatctcctgg cggggttggc ggacattcgg 66600
ccggtgtcgt cgccggtggc gttctactcg acggtgaccg gcgagcggat ggacacggca 66660
gggctggacg ctgcgtattg ggcggcgaat ctgcgggagc gggtgttgtt cgagccggtg 66720
gtccggacgc ttgccgagct ggagcaccag gtgtttgtgg agtccagtcc gcatccggtg 66780
cttgcgatgg cggtccagga gacgttggag agcgcgtccg gggccggtgc tgcagtgggg 66840
tcgctgcggc gggacgatgg cggtgctgga cggttcttgg cgtcgttggc ggaggcgtat 66900
gttgcggggg cgccggtgga ctggtcggtg ttgttcgagg gtacgggtac gcggcgggtg 66960
gatctgccga cgtatgcctt ccagcaccag cgttactggc tcgaagacgc ttccgcaccg 67020
ggtgcggagg gtgtggtgga tccggtggat gcggcgttct ggggtgcggt agagcgagcg 67080
gatgtgcagg gtgttgcggc acttgtggat ggttcggtgc cgggtgtgtg ggagccggtg 67140
gtgccggtgc tgtcggcctg gcgcaagggg cgtgaagaac ggtcggtcct ggattcgtgg 67200
cgttaccgga ctacttggcg tgcgttcagc ctgccctcag gaacccggct gtcggggatg 67260
tggctggtgg tggcttccgg tggggatgcg ccggtggatg aggtgcggca ggcgcttgag 67320
gcggctggtg cggaggtgtg tgttcgggcg gatctcgacg gtgcggcact ggcgggtgtg 67380
tcgggtgtgg tgtcgttgtt ggcgtgggat gaggggtcgg cggtggtgtc gacggtgggg 67440
ttggtgcagg cgtgtggcgg tggtggtgag gtgccgttgt gggtgttgac gcggggtgct 67500
gcggtggtgg gtgtggatga tccggtgtcg gcggtgcagt cgcaggtgtg ggcgttgggg 67560
caggtggtgg ggttggagca gcccggtggt tggggtggtc tggtggatgt tcccggggtg 67620
tgggatgagc gggtggcgtc cttgttggcc ggtgtgctgg cggctggtgg gggtgaggat 67680
caggtggcgg tgcgttcgtc gggtgcgtac gggcgtcgtc tggtgcgtgc tccactgggt 67740
gcgagcccgg tgcgggtgcg ggagtggagt ccgtcgggca cagcgctggt caccggtggt 67800
acgggtggga tcggtgggca tctggcgcgt tggttggcga gggagggtgt cgggcatctg 67860
ctgctggtca gccgccgtgg tccggaggcc gagggcgtgg ccgagctggt cgaggagctg 67920
ggcggcctgg gtgtggaggt gacggttgtc gcgtgtgatg tgaccgatcg ggcggctctc 67980
gcggaactgc tcgccacaat ccccgccgag tatcccctca cgagcgtgtt ccatgctgcg 68040
gggatcgcgg gttacggtct ggttcgcgaa ctggatgccg cggggctgga tgccgagatg 68100
gccgccaaga ctctcggtgc ccgtcatctc gacgagctga ccgccgaact tggcctggat 68160
ctggatgcgt tcgtggtgtt ctcctccggt gccgctgtgt gggggagtgc cggtagcggt 68220
ggttacgcgg cggcgaacgc gtatctggat ggtctggcgc gggagcgccg ggcgcgtggt 68280
ctggtggcga catcggtgtc gtggggcaac tggaagaaca ccggtctggc gaccgacacc 68340
accgcggagc agctgacgcg catcggtgtc cggccgatgg agccggagtt ggcggttcgg 68400
gcgttgcggc aggcgctgga gcaggacgag gtgtcaatga cggtggccga catggactgg 68460
tcgttgttca cgccggggta cgcgctggcc cgccgccgtc cgctgatcga ggagatcccc 68520
gaagccgccc gcgcgctcag cgaggactcc gccgacccgg cgaacgacac ggtcggtggc 68580
gactccccct tgcggcagtc cctcgccgca ctgaccgagt ccgagcagca cgaacggctc 68640
ctcggtgcgg tccgtacgga agcggcggct gttctcaccc actcgacgac cgacgagatc 68700
acggccggca agccgttccg tgacttggga ttcgactccc tgaccgcgat ggaactgcgc 68760
aaccggctca acgccgccac cggactccgc ctgcccgcca cgatcgtctt cgactacccc 68820
acgccccgcc ggctcgcagg acacctgcac gacaagctct tcgacagtgg tgccgaggtc 68880
gcgcttccgc agctgcgggc aacggacgac gacccgatcg tgatcgtggg catggcctgc 68940
cgcttccccg gcggggtgcg cggtcccgag gacctgtgga ggctgctcgc cgaggggcgc 69000
gacgagatga cggagttccc cgcggaccgg ggctggcaag gaccggccat gaacgccttc 69060
gtggaggagt tcggcggcgc ccgacaaggt gccttcctcg cggacgcggc ggagttcgac 69120
gctgcgttct tcgggatctc gccgcgtgag gcgcgggcga tggatccgca gcagcggctg 69180
ctgcttgaga cctcctggga ggtgcttgaa cgcgccggct acgacccggt ctccctgcgc 69240
ggcagccgca ccggcgtgtt tgtcggcggt acgccgcagg aatacacgac ggtcctcatg 69300
aactcggccg aggccggtag cggctacgcg ctcaccggta cctccggcag cgtgatgtcg 69360
ggccgggtcg cctacaccct gggcctggag ggaccggccg tgacgattga cacggcgtgt 69420
tcgtcctcgc ttgtcacgct gcatctggcg gcgcaggcgc tgcgaggcgg agagtgtgac 69480
ctcgccctgg tcggtggcgt gacggtcatg gccacacccg gggcctttgt ggagttcgcc 69540
cgacagggcg gtctggcggg agacgggcgg tgcaaggcgt tcgccgcggg tgccgacggc 69600
accggctggg gcgagggcgt cgggatgctg gccgtccagc ggctctcgga cgcggtgcgg 69660
gacggacgtc gggtgctggc ggtggtgcgg ggctcggcgg tgaactccga cggtgcgtcg 69720
aacgggctga cagcgccgaa cggtccgtcg cagcagcggg tgatccggca ggcgttggcc 69780
tcggcggggc tttcggcggc ggatgtcgat gtggtggagg ggcacgggac gggtacggcg 69840
ctgggtgatc cgatcgaggc gcaggcgctg ctggccacct acggtcagga ccgtccggcg 69900
gaccggccgt tgtggctcgg ttcggtgaag tccaacatcg gacacaccca gtacgccgcc 69960
ggagtcgccg gtgtgatcaa ggccgtactc gcgctccagc accgtctgct gccgaagacg 70020
ctgcatgtgg aggagccgac gccggaggtg gactggtcgt cgggtgcggt gggagtgctg 70080
acagaggcgc gggagtggcc ggagacggga cgtccgcggc gtgcgggggt gtcggcgttc 70140
gggatcagcg ggacgaacgc gcacgtgatt ctggagcagg ctccggaagc cgtagaggag 70200
agcgcgtctg gtgagaccgg ttcggtgctg gtgccgtggg tgatctcggc gcggtcggag 70260
caggcgttgc gagagcaggc gcggcggctg gccggacacc tgcgcgcaca tgacctgcgc 70320
cccgtcgatg tggggttctc gctggccacg acacgggcgg ggctggagca ccgggcggtg 70380
ctggtgggac gggagacgtc ggagttcctg gcccagctgg agacggtggc cggggacggg 70440
ccggtgtcgg agggcgggac ggcgtttctg ttctccgggc agggctcgca gcgggcgggg 70500
atgggcaggg aattgtatga ggcatatccg gtgttcgcgg ccgctttcga tgaggtgtgc 70560
gggcatctgg acgtgctcct ggagcgtccg gtgaaggaag tggtcttcgc cggtggcaag 70620
gcgctggacc ggacggtgtt cacccaggcg ggtctgtttg cgcttgaggt ggcgttgttc 70680
gagctggtgg gttcgtgggg ggtgcgggcg gatgtgctgc tggggcactc catcggcgag 70740
ctggccgcgg cgtacgcggc gggcgtgtgg tcgctcgagg acgcgtgccg ggtggtggcg 70800
gcgcggggcc ggctgatgca ggccctgccg gagggcgggg tcatggtcgc ggtggaagcc 70860
gcggaggagg agctgcccca gttgccggcg ggggtgtcgg tggcggcggt gaacgggccg 70920
cgttcgctgg tgctctccgg cgacgacgaa ccggtgaccg cgctcgcgca gaccttcgcg 70980
gggcagggcc ggcgcaccag acggctgacc gtgagccacg ccttccactc cgcgtggatg 71040
gagccgatgc tggcggactt cgccgaggtg ctgggctccg tggagttccg tgcaccgcgc 71100
atccctgtgg tgtccaacgt gaccgggcag gtcgcgggcg aggagctggc cacccctgat 71160
tactgggtgc ggcatgtgcg ggaggcggtc cgattcgctg acggggtgac caccgtgctg 71220
gggcggggtg tcgacaagtt cctggagctg ggcccgggtg gcgcactgac cgcgatggcc 71280
gaggaggcgc tggaccacac cggtaccgac gccgtctgcg cccccgtcct gcaccccgag 71340
catcccgaag cgtcgagcgc cgtccgtggc ctcggacgga tctacgccgt cggcgccccg 71400
gccgactggt ccgcgctctt cgccggtacc ggcgcacgcc gtgtcgacct gcccacctac 71460
gccttccaac gacggcgctt ctggctcgac tcgctcgcta ccggtagcgg cgatccggcg 71520
agcctcggac tcacgaccac cggtcatccg ctgctcggcg ccggcgtgag gctgcccgat 71580
tcggacggct tcctgttcac cggcagactt tctctggcca cgcagccgtg gatcgcccag 71640
cacgcgctgc tgggcaccgc gctgctgcct ggtaccgcgt tcgtggagct ggcgctgcgc 71700
gccggcgccg agtcgggctg cgaggtgatc gaggaactca ccctggaagc ccccctggtg 71760
ttggaggagc atggcggtcg cgcggtccac gtgacggtcg gcgggctcga cgagtccggc 71820
cggcgcacga tcacgctcca ctcacggccc gacggcgcgg acgacgacga gtcctggctt 71880
cggcacgcca ccggcgtact ggtcgagcgg cgcgagacgg agtccgccga tgcgccgacg 71940
gagggtgtgt ggccgcccga cggcgccaca cagatctccg tccaggactt ctacccggac 72000
atggccgagg ccggattcac ctacgggccg gtcttccagg gcctgcgagt cctgtggagc 72060
aaggacggcg agctgttcgc cgaggttcgg ctgccggacg aggcgggcga ggcgggcgat 72120
gagggcagcg ggttcggtgt gcacccggca ctgctggacg cggccctgca gcccctcgcc 72180
ctcagtgtcc tcggcgggac ggacggccgg caaccggtca agggcggcat gcccttcgtc 72240
tggaccgggg tccggctgca cgccacccac gccacggtcg cccgggtcaa gctggccccg 72300
gtgggacgca gcgaggtgtc cgtcgtggtg accgacgact cggggctgcc gatcgccacg 72360
gtcgactcgc tggccatgcg cgacccgatt ctggaacagt tcactgcctc cgcgccccgg 72420
caggatgcgc tgttcggcgt gcggtggacg cccatacccc tcgcggcgca cgctgagccc 72480
ggtgagtggg cgatgctcgg cttcgacccg ctggagatcc gccagcgtct cgtcgaggcc 72540
ggcctcaccg gtacgccgta tctcgatccg cagtccctga tcgacaccgt ggaatcgggc 72600
aagcccgttc cgccagtcgt ggcggtgtcc tgcttcggcg gtgggggcag taccgtcaca 72660
gccactcacg aggccgtcgg acgggctctg ggagtgcttc agcactggct cgcggacgcc 72720
cgcctcatga gttcccggct ggttctactg acccgaggtg cggttccggc cgtcgacacc 72780
gaccggatcg aggacctggc ggcctcggcc gtctggggtc tggtgcgggc ggcccagtcc 72840
gagcatccgg accggatcgt gctcatcgac ctcgatgacg accccacgtc gtaccgggcg 72900
ctgcccgcgg ccctcggcac cggtgaacca caactcgccc tgcgcacggg cgccgccagc 72960
gcgcctcgcc tggcccggca caccggcgcg ccggaggtca ccccgggctt cggccctgac 73020
ggcaccgtgc tggtcaccgg gggcaccggg gcgctcggcg cggtcgtcgc ccggcacctc 73080
gcggccgcgc atggcgtccg gcacctggta ctggccagcc gcagcggagc cgaagcttca 73140
ggcgcggacg cgctgctggc cgacctgacc gagctgggcg ccgacgccac gatcgtggcc 73200
tgcgacgtct cggaccgcgc cgcgctggcc gctctgctgg acgccatccc agccgagcgg 73260
ccgctgaccg gcgtcgtgca cacggcgggg gtactggcgg acgggacagt cgagtccctc 73320
accccggacc aggccgacac ggtgctgcgg gccaaggccg acgcggcctg gcatctgcac 73380
gaactgaccg cgctcacgcc ggtgcgggag ttcgtcctct tctcctccgc cgccggactg 73440
ctgggcagtc aggggcaggg caactacgcg gccgccaacg ccttcctgga cgccctcgcc 73500
gcccaccggc gagccgcggg actggccggt acctcgctgg cctggggctg gtgggacctg 73560
cccggcggca tggccgcgga cctcggccgt gccgaacgcg cccggatggc ccgtggtggg 73620
ctcaccccct tcacagccga gaccggaatg gacgccttcg accagaccct cgccgccggc 73680
accgagcccc tgctcgtccc gatgcgtatg aacaccgcgg tggcgcgggc ttcggccggg 73740
cagcagatac cgtcggtgct gcgcgggctg gtccgggccc cccggcgacg ggccgtccga 73800
tcggacgagg ggagcgcctc gcggctgcgc gagcggctgg ccggagcgaa cgcggacgag 73860
cggctggcca tgctcaccga gctggtccgt gtcgaggccg ctcaggtgct cgggcacagc 73920
ggggccgagg ccgtcgagga cggcagcagc ttcgccgagc tgggcttcga ctcgctcacc 73980
tcggtcgagt tgcgcaaccg catcggcgag cgcacaggac tgcggctggc gtccacggtc 74040
gtcttcgacc accccacacc ggccgccctc gccgccgaac tcggtgaccg gctgggcgat 74100
acggccgact tcgtgtcggc cgcgcagccg tccgaggccc ccggagccgg cggctccggc 74160
gtcgagacga ccgcggacac ggcggtgatc aacggggtgg aggcgctcta ccggcgctcc 74220
atcgagctgg gccggctcga cctggggcac agcgtgctga agaactcggt cgacctgcgg 74280
gcgagtttct ccgttcccga cgaggtccgg aatggaccgg agctcgtcag gctcgtcgag 74340
ggagcacagc acccgaagat catctgcttc ccgtcgcagt cggtgtgggc gagcaaccag 74400
gaactggtcg gcatggccgt accgctgcgc ggagtccgtg acctgtggtc cctgatgctc 74460
cccggcttcg tgaccggcca gcccgtcgcc gccgatgtgg acgcggcggc cgagtacgcc 74520
gtacgactca tcgaagaact ggtccaggac gagcccttcg tcctggccgg gcgttcctcc 74580
ggcggcagga tcgcccatga ggtcgccgtc aggctggagg gacgaggccg tgccccgaag 74640
ggactggtgc tgatcgacag ctacatggcc ggctatgagg cgacttccta catcacgccg 74700
gtgatggagt ccaaggccct ggagctggag aaggacttcg gtcagatgac cgggacccgg 74760
ctcaccgcga tggccgccta cttcgccatg ttcgaggcat ggcagcctga ggagacctcg 74820
gttccgacgc tgctggtgcg ggcttcggag cgttacggca tcgagccggg gcaggagcag 74880
cccccggccg aggaatggca gtccgcctgg ccgctgccgc acgacgcgat cgacgtgccg 74940
ggtaaccact actccatgat cgaaggcagc ggggacgtca cggcggcggc cgtgcaccgg 75000
tggctggtgg agcgtgacgc gtaggaccgc tcaccacgac gggccgtgct ccggcaacgg 75060
gagcatggcc cgtcgcacgc gtgcggaggc ggcgccgccg acgccggacc cgccggacga 75120
aagaagacga cgggcccagc aggtgtcggc ctgctgggcc cgtcgtccgt gcggtggggc 75180
ggatgccgtg tcaccaggtg atgggcaggc tctccagtcc gcggaggatg gaagccggct 75240
tcagccggag ttccgactcg ggcaccgcga gccgtacctc cggcatccgc tccatgatgg 75300
cccggaacgc ctcctggagt tcgatacggg cgagctgcgc gccgaggcag tagtggatgc 75360
ccgtactgaa ggccaggtgc gggttctgct cacgcgccag gtccagccgg tccccgtcct 75420
cgaacacgtc ggggtcacga ttggcggtgg cgaccgcggg caggaccacg acaccggcgg 75480
gcagcacctt gccgttgctc agctccacct ccgcggtggt gagccggggg gtgatgccgc 75540
cggtcgcggt gagcgggacg aaccgcagca gctcgtcgat ggccttcgga agcgcctcgg 75600
ggttcgcccg cagcttgtcg aactcctcgg gatggtgcag cagggtgagc aggaacatgc 75660
tgatcaggtt ggcggtcgtt tcgtgacccg cgctgaggat gccgatactc agcgtgatga 75720
tctcacgctc ggtgagcgta ctgtcttctt cttcgctgac cgcgatcagc tcgctgatca 75780
tgtcgtcggc gggcttctgc cgcttgaccg cgatgaggtc accgaagtag ttgaccaacg 75840
cgaccgtcgc cgcttccttc tccgcgacct gatgccagtc gccgagcaac gcgttcgacc 75900
aggcgtgaaa cgtgtcctgg tcgccggccg gcacgccgag gagctcgcag accacgcgga 75960
ccggcagcgg aacggcgaag ttcttcacca gatccaccgg acggggcagg gtctggagct 76020
cgtcgaggag ttcgaccacc agctccacga tccggggccg cagctgctcc acccgccgtg 76080
cggtgaaagc cttgctgacc agcttgcgca gccgggtgtg ctccggcggg tccatgccga 76140
cgagagattc gttcatcagc ttgccggtct cggtctccga catggcggcg gccgcggtcg 76200
cgatgacccg gctgctgaac cgggggtcca ggagtacctt gcggacatcg gcgtgcttgg 76260
tcaccatcca gccggtgatg ccgtcgggaa acttcacctc gaccacggac tcgccgtccc 76320
ggacctcggc cagctccggg ggcagttcac agaccgaggg cgggtccggg aacgggaacg 76380
gtatgggttc ggaaggcgct tcggccatgg atggctctcc agattcgtga gggtttctcg 76440
ggcgcggcgg aacgacgcgt gggggtgggc aggccgacct tcctcgcagg ctatgcacga 76500
tcggccccta caccctcccc ctagctcgcc ccactcgcgt gacgtgcccg gtaccgagcc 76560
ccgtcaccgc gtgctggtac tcctggccat cgcgagcagc gccgtcacgt ccatggactt 76620
gatctcctcg ctgcggtcgg ccacctccgg ctccgccctc tcgtcgccgg tcagcgccag 76680
cagcgcctcc agcagaccgt gttcgcgcag ccggtccagt gggacggaca tcagggcgcg 76740
gcgtacggcc gcttcctgct cggcctcggg cgtcgtggcg ctctgcgctc cgtcgggggc 76800
cagccgggcg accagcaggg ccaccagtgc gccgagcgtc ggatagtcga agaccaggct 76860
gacggggagt cgcagaccgg tggccgcgtt gagccggttg cgcagttcca cggcggtcag 76920
cgaggtgaag ccgaggtcgc ggaacggccg gtcggtctcg atctggtccg aggacgcgtg 76980
ccccagcacg gcggcgatcc ggtcacggac cagcgccagc agcgtctcct gacgttcggc 77040
gggccccatc tccgcgagcc ggtgccgcag ttggctctcc ggggcctggc tcccggtctg 77100
cgccgaccgc tgggccggga cccggaccag gccgcgcagc actggcggca gggagccggc 77160
ggcggcctgc gcgcgcaggg cggcgctgtt gaggcggacg gggaacacga cgggctcggc 77220
gcggcgccag cccaggtcga acaaggcgag gccctggtcc gccggcattg cggacacccc 77280
gcccatcgga gagcccgccg aggccccctc ggcgaggtgg ctgtccatac cccgctcggt 77340
cgcccacagc ccccagccca gtgaggtggc cgcgagcccg ttgcggcgtc tgtactgggc 77400
gagagagtcc aggaaggtgt tggcggccgc gtagttgccc tgtccggtgc cgccgagggt 77460
gcccgcgatc gaggagaaca gcacgaaggc ggacaggtcc agctccgctg tcagctcatg 77520
gagatgtacc gcggcgtcca ccttggcgcg gaacacccgg tcgatcagct ccggcgccag 77580
cgaggccagc agcgcattct cggcagtgcc cgcgcagtgc accaccgcgg tcagcggatg 77640
cccggcggaa accccgtcga gtacggccgc cagcgcagcg cggtccgaca cgtcgcaggc 77700
caccagcgtc accgtggccc ccagctcctc caagtcgtgg acgagttcac cggcgccctc 77760
ggcatccgga ccctgtcggc tgagcagcag cagatggtgt acgccgtgca gctttgcgag 77820
acgggtggcg accagcgcgc cgatcccacc ggtcgccccg gtcaccagca ccgtgccgga 77880
cgggtcgagc gcgtcgagct ccgtcccggg cacggtgtcc gggacgtcgg cggccggcgg 77940
ccggttgagg cgaggcatca gcgcgacccc gtcgcgcagg gcgagttgag cctcccccga 78000
cgcgaccgcg gcccgcaccg cccgcccgga ctcgacgctc ccgtcggtgt ccagcaggac 78060
cagccggccg ggattctccg actggatcga acgcaccagc ccccacagcg gcgcggagcc 78120
gagcccggcc acccggtcat cggctgagat gcccacggcg tccgtggtcg tcaccaccag 78180
cggtatcgag gcgagccgct cgtcgagttc ctctgagacc caggtgcgcg ccgcctccag 78240
caccagcccg gcgcgttccc gggtaagcgc cgggagctcc gcgaaccccg gaccggccgg 78300
ttcaccgtcg tccggtgtga gtacgacgaa cgcgggcacc ggatcacccg cggcgaccga 78360
cgcgctcagc gccagcaggt ccgggtggtg cacggcaccg ctgtcggggc ctgcccatgc 78420
cggcgccgtg cccaccaccg cccacgccgc cgtcttgacc gtgggcagcg gcaccggtgt 78480
ccacaccatg ctcagcagcg aggaatgcgt ggcggtccgc ccggaccctg gatcgccgtc 78540
ggtgaggtcg gccggccgca gcagcagcga accgatcgag acgaccgggg tgcccgtgcc 78600
gtccacggca cgcagcgaca cggcgctctt gcccgctggg gagagggcta cccgcagggc 78660
ggtggcgccg gtggcctgca gggacaccgc gttccaggcg aacgggcgca gcggtgcgct 78720
cggggcgtcc agttcaccga ggaagcccac cgcgtgcagg gccgcgtcca gcagcgcggg 78780
gtgcagcagg aactcgcccg ccgcctcgtg cagctcctgc ggcagctcga tctcggcgaa 78840
gacctcccgg ccacgggacc aaacggcccg cagcccccgg aaggcgggcc cgtaggcgag 78900
cgggccgccc gccaggtcct cgtacaggct gccgatgtcg atccgctcgg cgccgcgcgg 78960
cggccagtcg gccggttccg ccttcggtgc cggaccggcc gggcacagga cacccgtggc 79020
gtgccgcgtc cactcggcgg ccgtgaactc ttcgtccgtg cgggcgaaga agccgacttc 79080
gcgccgcccg gcgtcgtccg gggcgcccac agtgagctgc aggtccacac cgccggtctc 79140
gggcagcgtg agcggagtct gaagggtcag ctcctcgatg taggggcact cggcgagatc 79200
gcccgccacg gcggccagtt ccacgaacgc ggttcccggc agcggaaccg tccccgccac 79260
tctgtggtca gccagccagg gatggtcacg cagcgaaaga cggccggtca ggacgaggcc 79320
gtcggactcg gcgggctcca cggcggcccc gagcagcgga tgcccggggg aacgcagacc 79380
ggcggcagtc acatcaccgg atccggacac cggcatacgc agccagtagc gctggtgctg 79440
gaaggcgtac gtgggcaggt ccaccggccg tgcaccggtg ccctcgaaca ccaccgacca 79500
gttcacgggc gcgcccgcga cgtacgcctc ggccagtgac gtgagcagcc gcccggcacc 79560
cgcgtcgtcg cgtcgtatgg tcccggacac atgggccgag acccccgcac cgtcgatgat 79620
ctcctggagg gccatcgcga cgacagggtg aggactgacc tccagcaggg tccggtgtcc 79680
ctcgtcgagc gtggcccgca ctgcggcctc gaaggcgacg gtctcccgca tgttccgcag 79740
ccagtacccg gcatccagcc ccgccgtgtc catccgctcc ccggacaccg tcgaatagaa 79800
cgccaccgac gacgacaccg gccgaatacc cgccagcccc gccagcagat catcaccgag 79860
cgcatccacc tgcaccgaat gcgacgcata atccaccgcg atccgacgcg cccgcacccc 79920
cgcaccctca aaccccgcca acaacccctc caacacctcc gcctcacccg aaaccaccac 79980
cgacgacggc ccattcaccg ccgccacccc cacccgaccc cccaacccgg ccaaccactc 80040
ctccacctca cccaccgaca accccaccga caccatccca ccccgccccg acagcctctc 80100
tcccaccagg cgtgaccgaa ccgccaccac ccgggcacca tcctccaacg acaacccccc 80160
cgccacacac gccgccgcaa cctcaccctg cgaatgcccc accaccgcag ccggctccac 80220
accaaacgac cgccacaccg ccgccaacga caccatcacc gcccacaaca ccggctgcac 80280
cacatccacc cgctcccaca acccatcacc cccacccccg aacaccacat cccgcaacga 80340
ccaatccaca aacggcacca acgcccgctc acacgccacc atcgactcag caaaaacaga 80400
cgaaacctcc cacaactccc gccccatccc cacccactgc gacccctgac caggaaacac 80460
aaagaccaca cccataccgc cgtcaccagc agcgcgccca ctcaccaccc cggcccctgg 80520
cacgccctcg gccaccgacc gcaaccccgc caacaactcc tcccggtcac cccccagcac 80580
caccgcccgt tgctcaaacg ccgaccgcgt ccccaccagc gagaaaccca catccaccgg 80640
acccagcccc ggccgctcca caagccactc caacaaccgg ccagcctgcg cccgcaacgc 80700
cccctcacca cgagccgaca ccacccacgg caccgacccc atcagcacag gacgatccgc 80760
ctcgtccacc tccaccggct ccgcctccgg agcttgctcc aggatcacat gcgcgttggt 80820
accgctgacg ccgaaggagg acaccccggc tcggcgcgga cgccccgtct ccggccactc 80880
ccgcgcctcg gtcagcagcc gtaccgcacc cgaggaccag tcgacgtgcg gtgtcggctc 80940
gtccacatgc agcgtcttcg gcatgacgcc gtgccgcagc gccatgaccg tcttgatcac 81000
accgctgacg ccggccgcgt actgggtgtg accgacgttc gacttgatgg aacccagcca 81060
cagcggccgg tcggccgggc ggtcctggcc gtaggtggcc agcagggccc gtgcctcgat 81120
ggggtcgccg agcgtcgtac ccgtgccgtg cgcctcgacc atgtcgacgt ccgcggtggc 81180
caggccggcg ttgtccagga cccgcaggat gaggtggcgc tgggccggcc cgctgggcgc 81240
ggtgaggccg ttgctggcac cgtcctggtt gatgccggtt tctcgcacca cggccagcac 81300
cgggtgcccg gtccgctggg cgtcggagag ccgctgaaga accagtacgc cgacgccctc 81360
gccccagccg gtgccgtcgg cgtccgagga gaacgccttg cagcggccgt cggatgccag 81420
cccgccctgc cgggagaact ccgggaacgc gccgggcgtc gccagtactg tgaccccgcc 81480
ggccaaggcc agcgagatct cgtccttgcg cagcgcctgc accgccagat gaagggagac 81540
cagcgacgac gagcacgccg tgtcgacggt gaccgcggga ccttccaggc cgagcgtgta 81600
cgccacccgg ccggagagga cgctgctgga tgtgccggtc atcagatagc cgtccagcgc 81660
gtgcggcgac gcggcgagca ggccggcgaa atctccgctc gtgccgccga agaacacccc 81720
ggtgtcactg ccccgcaagg accgtggatc gatcctggcg cgctccagcg cctcccagga 81780
ggatgccaag agcaaccgct gctgcggatc ggtggtgagc gcctcgcgtg gtgcgatgcc 81840
gaagaaggcc gcgtcgaact cgccggcgca gcgcaggaaa ccaccttcca gcggggtcga 81900
ggtgccggcc cccgcggcat cggtttcgta cagcccgtcc aggttccagc cgcggtcgtc 81960
ggggaagccc gccacggcgt ccgcttcctc cgacagcatc cgccacacgt cttccggcgt 82020
gtccgcaccg cccggcagcc ggcacgccat gccgatgatt gcgatcggct cgcgggcccg 82080
gtcctcgacc tcacgcagcc ggcgctgaag cccacgtgcc tcggtgacgg cccgcttgag 82140
gtattcgcgg tacttctcgt cgtcggacat gacgttctca gctccttggt gatgtgttcc 82200
cggcggaatc gcgccgatcg ggcaagacct gacccggccg gtcagagacc gaagccctgg 82260
tcgatcgcct cgaagaggtc gtcggtggtc gcggtggcga ggtcgtccag ggcggccggt 82320
ccggtatccg gctcacgcca ctccgacagc agcccgcgca gccgtcgcac gacccggtcg 82380
cggtcgccgt tgtccgcgtc gatcgtcttg agggcagcct ccagggcgtc cacctccgcg 82440
agcgcggcct cgacgccgga cagcccggcc gggacgagag cggtgagcag ttccgtggcg 82500
agcgcggtcg gcgtcgggtg ttcgaagacg agcgtggccg gcagacgcag tccggtggcg 82560
gcaccgagcc ggttgcggaa ctccacggcg gtcagcgagc tgaagccgag ctcactgaag 82620
gaccgctccg ccgtgaccgc ggccaccgag gcgtgcccca gcaccgtcgc cgcgtgggag 82680
cgcaccagtt ccagcaccct gcggtgctgc tcgacggcgg gcagccgcgc caggcgcgtg 82740
cgcagcccgg aaccgttgct gccggaccgc gccgtacgcc gcgaggtccc ttgcaccaga 82800
cctcggagga gatgcggcac ctcgtccgcg ccggaccgca gcgtcgacgg cgtgagccgt 82860
gtgatcacct gcaacggcac gtccgccccg caggccaggt cgaagagggc gagcgcctcc 82920
gcgtcctcca gcggtaccac gccggctcgt tccatgcggt gcagctcttt gtcgttgagg 82980
tgtccggtca tggtgctgcg ggtgttccac aggccccagg cgagggaggt ggcggggagg 83040
ccgtgggtgc ggcggtggtg ggcgagggcg tcgaggtagg tgttggcggc ggcgtagttg 83100
gcttgtcctg gggcgccgag ttgtccggcg gcggaggagt agaggatgaa ggtggtgacg 83160
gggtggttga gggtgaggtg gtgcaggtgg gtggctgcgt cgatcttggg gcgcaggacg 83220
ttgtggagtt gggtgttggt gagggtggtg atcgttgcgt cgtcgaggat gccggcggtg 83280
tggatgacgg tggtcagggt gtggtgggag aggagtgcgg cgagttggtc ggggtcggcg 83340
gtgtcgcagg cggtgatggt gacgtgggcg ccggccttgg tgagttcggc cgtgagttcg 83400
gtggcgccgg gggcgtcggg tccgcgtcgg ctggtgagca gcaggtcggt gacgccgtgc 83460
tggtggacga ggtggcgggc gaggagagcg ccgagggtgc cggtgccgcc ggtgatcagg 83520
gcggtgccgt tcggatcgat cccccgcggt acggtgagga cggccgggcc ggccggcgtc 83580
gcccggttcg catgccggga tgcctcccgg gcacgccgca ggtcgcggac ggtgatccgc 83640
ggcggcgtca gcttcccgtc gtggcacagc gacatcaccg cgaccagcat ttcccggatc 83700
cggtcgggat cggcctccgc caggtcgtag gtccggtagg ccacgcccgc gtgggcctcg 83760
gccacctgcg ccgggtcgcg cctgtcggcc ctgcccgtct ccaggaaccg gccgccgcgc 83820
ggcagcaggc gcagcccggc gtccacggac tcaccggcca gacagttcag gacgacgtcc 83880
accccgcagc cgccgcttgt ctccatgaac cggtgttcga actccggcgt tcgggagtcc 83940
gcgatgtgtg catcgtccag cccgtacttc ctcaacgtcg gccacttgcc ggtggcggcg 84000
gtgccataca cctcggctcc cagttggtgg gccagctgca cggccgccag ctgggagccg 84060
tccgccgcgt cgtgtaccag caccgacatc ccgggctgta cgtccgccag gtccaccagg 84120
ccgtagtaag ggatgagata ggtgagggga acggccgccg cctgctcgta ggaccagccg 84180
tcgggcatgg gggccaggca gcggtggtcg tacaccgcga agggcccgga cgagccggag 84240
cgcagtgcca gcacccggtc gcccacggcc agatcggtca cctcggcacc cgtctcctgc 84300
accgtgccgg ccgcgtagcc gctgatgccc ttgctcccct cggccgggtc gagcgaggcc 84360
acgacgtcct ggaagtcgag cccgaccgcg tgcaccgcga tccggacctg accgggcccg 84420
agcgggtcca gaacctccgg gcagggcacc agcgccatgt cctcgatcga gccgcggcct 84480
gtgctctcca gccgccacgg caccggcccg gacggcggca gcaacgccgg atgcgaacgt 84540
gcccgggcca gccgaggcgc ggcgactgtg ccgcgacgaa cggcgatccg gggctcggca 84600
caggccagcg cgtccccgaa gacctctcgt gacgtgtcca gcccgtccag gtctaccagg 84660
aagaagcggt ccggatgctc ggtctgcgcc gagcccacca gaccccagct ggtggagttc 84720
gccaagtcgg tgacgtcctc ggcgtcgccc gcggcgaccg cgcccgaggt gacgaggacc 84780
aggcgcgcgg aggcgaaccg gtcatcggcc agccaggcac ggatcacgcc gagcgtcgcc 84840
tcggccgccg cgtgcgcgtc agcggcgggg tcatccgaga cacggggggc gagacagcgg 84900
accacgatac ccgggaccgg accgttcgcg gccagttcct cgaaggtggc cgccacccgc 84960
agtgccgcgc cggtcagccc gaaggggtcg tcgccgacga gggcatagga ctgagcggcc 85020
gccacgggca gagccgtcca gtcgagatgg agcagtgcct gctccagcgt gccgcccgag 85080
ggctggtcga acacggcggg tttggtcacc acggatccca ccgagacgac cggacgcccg 85140
gcctcgtcgg tgaccagcag ggcgagctgg tcccccgagg gggtaagacg gacccgagcc 85200
gtcgtcgcgc cggacgcgta cagccgcacc gcggcccagg aggacggcag ccgtacgccc 85260
tcgtcccgcc cgccgacaaa cgccatggcg tgcagcgcgg catccagcag ggccgggtgg 85320
acgccgtagc gcgtgccctc ctcccgctcc gactcgggca accggacctc ggcgaacagc 85380
tcgtcatgct gccgccacag cgcgtgcagg cactggaagg cgggtccgta gtggtagccg 85440
ctcgcggcga gctcctggta gacaccgtcc gtctcgactg cctctgcccc ggccggcggc 85500
caggacggca ggggagccgg ctctgcgggc gcgtcggcac tcagggtgcc ggtcgcgtgc 85560
agcgtccagt cgccgtcact ggtgcgggaa tgcagggtga ccgtgcagcg gcccgagtca 85620
ctgtccgggg tggcccggac ctgcacggtg acctcgtcct cggggccggc gcccagcaac 85680
agtggtgtgt gcggggtcag tacctccaaa tacgggcggt tcaggagatc gcccgcccgg 85740
atcaccaggt cgacgaaggc gctgccgggc agcaggacct ggcccaagag cgcgtggtcg 85800
agcagccatg gctgggcgcg cccggacaac cggccggtca gcagcacccc ctcgccgtcg 85860
gccagcgtca cagccgctcc cagcagcgga tggtctgccg cctgaagccc catcgcggtc 85920
aggtcaccgg cgccacgacg gtgttccagc cagaaccggc gtcgctggaa ggcgtaggtg 85980
ggcaggtcga cgcggcgtgc gccggttccg gtgaagaacg cggaccagtc ggccggggcg 86040
ccggcggcgt agatccggcc gagggcatgg acgacgctct gcgcttcggg gcgttcggga 86100
tgcaggatgg gggtgcagac ggtgccggtg ccggtgtggt ccagcgtctc ctcggccatc 86160
gcggtcaggg aacctcccgg acccagttcc aggaacttgt cgaccccgcg ggagtgggcg 86220
gtggtgaccc cgtcggcgaa acggaccgct tcccgcacat gccgtaccca gtaggcaggg 86280
gtggccagtt cctcgtccgc gatctgcccg gtcacgttgg acaccaccgc aatccgcggc 86340
cgacggaact ccaccccagc cagtacctcg gcgaactgag ccagcatcgg ctccatccgc 86400
gccgagtgga aagcatgacc gaccgccagc cgcttgatcc gccgcccccg cccggccagc 86460
tcctgcgcaa cagcggtgac cggctcctcg tccccggaca gcaccagtga acgcggcccg 86520
ttcaccgccg ccaccgacac ccccgccggc agctccggca actcctcctc cgccgcctgg 86580
accgcgacca tcaccccacc cgacggcaac gcctgcatca gccggccccg cgccgccacg 86640
acccggcacg cgtcctgaag cgaccacacc cccgccacat acgccgccgc cagctcaccg 86700
atcgaatgcc ccagcagcac atcggcccgc accccccacg agcccaccag ctcgaacaga 86760
gccacctcca acgcgaacag accagcctgg gtccattcgg tccggtccag cacctcacca 86820
ccggcgaaga ccgcatcgcg caacgacccg ggcagcatcc cgtcgaaccc ggcgcacacc 86880
tcgtcgaagg cagccgcgaa caccgggaac gcctcataca gctcccggcc catactcgcc 86940
cgctgcgagc cctgcccgga gaacaggaac gccgtcccgc catccgtgac cgacccgccg 87000
gccacccggc cctcggccag cgcctccaac tgtgccagga gctcggacgt ctcccgcccg 87060
accagcaccg cccggtgccc cagccccgcc cgcgccgccg ccagcgaaaa ccccacatcc 87120
accggacgca gcccatgccg cgcaacatgc ccggccaggt tccgtgcctg ctcccgcagc 87180
gcctgcgccg accgcgccga gatcacccac ggcaccagca ccgaaccggt ctcaccaggc 87240
gcgctctcct cgacagtctc cggcgcctgc tccaggatca cgtgcgcgtt ggtcccgctg 87300
atcccgaacg ccgacacccc cgcacgacgc ggacgccccg tctccggcca ctcccgcgcc 87360
tgggccagca actccaccgc acccgaagac cagtccacct caggagtcgg ctcctccaca 87420
tgcagcgtcc tgggcagcac cccgtgccgc atcgccagca ccgacttgat cacacccgcc 87480
acacccgcgg ccgcctgcgt atgaccgatg ttcgacttta ccgaccccag ccacaacggc 87540
cggtccaccg gacgatcctg cccatacgtg gccagcagcg cctgagcctc gatcggatca 87600
cccagcgccg tacccgtccc atgcgcctcc accacatcga catccaccgc ggccagcccc 87660
gcgctcgcca gcgcctgacc gatcacccgc tgctgcgacg gaccgttcgg cgccgtcaga 87720
ccattcgacg caccgtcctg gttgaccgcc gatccacgca ccaccgccag caccgggtgg 87780
ccgtggcgac gcgcatccga caaccgttcc accagcagca tcccgacgcc ctcgccccag 87840
ccggtgccgt cggcaccggc ggcgaacgcc ttgcaccgcc cgtccggcgc caagccgccc 87900
tgccgggaca gttccacgaa caccccgggc gtcgccatca cggtcacccc cccggccagg 87960
gccaggtcgc actcaccgcc ccgcagcgcc tgagccgcaa gatgcagcgc caccagcgac 88020
gacgagcacg ccgtgtccag cgtgaccgcg ggtccctcca gacctagcgt gaaggccacc 88080
cggccggagg cgacactgcc cgcggacccg gtgccgatga acccttcgac ctcgtcgggc 88140
agggtgccgg ggccggaacc gtagtcgtgg tacatcaatc cggcgaacac gccggtccgg 88200
ctgccgcgca ccgcgcgcgg atcgatgccg gcccgctcca gcgcctccca cgacgtctcc 88260
agcaacagcc gctgctgcgg atccatcgcc agcgcctcac gcggcgagat cccgaagaac 88320
tccgcgtcga agtcggcggc ctcgtacagg aacccgcccc gccgcacata cgacgtcccc 88380
ggccgggcca gttccgggtc gtacagctcc tccacatccc agccgcggtc ctcgggcatg 88440
ccggagatcg cgtctcgctc ggcggccacc aactcccata gcgcatcggg agtagccacc 88500
ccgccggggt agcggcaggc catgcccacg atcacgatcg gatcgtcgtc cgtcgccctc 88560
agctccggca gggcgacctc ggcaccacgg tcgaaaagct tctcgtgcag gtgttcggcg 88620
agccgctggg gcgtggggtg gtcgaacacc agggtggcgg gcaacccgag cccggtcgcg 88680
gcccccaccc cgttacgcag ctcgatcgcg gccagcgagt cgaagcccag ctcccggaag 88740
ctccggcgcg cttccagacc gtcggtgttc gcatgcccca acgcccgggc cgcgtgcgtc 88800
cgtacgaggt cgagcagcag tcgcaggcgt tccgcctcgg gcagctctcg cagcccggcc 88860
gcgaacgtca tacccgcata ggccccctcc gctgcgttgg ccgcccggcg caccgggacc 88920
cgcacgaacc cacgcagaag cggcggcaaa gtgcccgccg cggcctggga gcgcagcgcg 88980
ccgggctcca gcggcagtgg cacggcgacc ggcgcgtcca gcccgactgc cgcgtcgagc 89040
agcgccatcc cctgctcttc ggagataggt accacgccgg ctcgttccat gcggtgcagc 89100
tctttgtcgt tgaggtgtcc ggtcatggtg ctgcgggtgt tccacaggcc ccaggcgagg 89160
gaggtggcgg ggaggccgtg ggtgcggcgg tggtgggcga gggcgtcgag gtaggtgttg 89220
gcggcggcgt agttggcttg tcctggggcg ccgagttgtc cggcggcgga ggagtagagg 89280
atgaaggtgg tgacggggtg gttgagggtg aggtggtgca ggtgggtggc tgcgtcgatc 89340
ttggggcgca ggacgttgtg gagttgggtg ttggtgaggg tggtgatcgt tgcgtcgtcg 89400
aggatgccgg cggtgtggat gacggtggtc agggtgtggt gggagaggag tgcggcgagt 89460
tggtcggggt cggcggtgtc gcaggcggtg atggtgacgt gggcgccggc cttggtgagt 89520
tcggccgtga gttcggtggc gccgggggcg tcgggtccgc gtcggctggt gagcagcagg 89580
tcggtgacgc cgtgctggtg gacgaggtgg cgggcgagga gagcgccgag ggtgccggtg 89640
ccgccggtga tcagggcggt gccgttcgga tcgatccccc gcggcggccg agggcttccc 89700
gccgatgccc gcgtcagccg cggcgccgtg gccctgccct cccgcagtgc cagccgtggc 89760
tcgtccaccc cgaccgtcga agccaacgcc gccagggact cggccgtacc gtcgaggtcc 89820
acgagcgcga tccggccggg atgttcggtc accgccgagc tgagcaaacc ccacaccgcc 89880
gcctgtaccg ggtccggtgc gtcgccatcc gccacggccg ccgcgttcca ggtcaccatc 89940
accagccgcg cggtgccgag ccgttcgccg gcgagccagg accgcagcag agccagggcc 90000
cgctcggcgg cctcgtgcgc cgcgtcggcg tcgacggagc cgtcagacgt gccgaccgtg 90060
ccgacgatga agtccggcac ctcggccccc gcctccaccg cggcttcgag cccggccaga 90120
tcggcatgga cgtcggcgtg gggcaggccg gccagcggcc ccggacccag cacagcccag 90180
cgccccggcg tctcgcagcc gccgaccggg acccaggacg gcatgagcag cgcgtccacg 90240
gccccgtcca gggcctcgac cgctccggcg ggccgcaccg ccagcgactc caccgtggcc 90300
acggtgcgcc cggccgcgtc ggcgaagagc agtgcgaggg tgtcgggacc agtgggggcc 90360
agccgtaccc gcaggaacga ggccccggcc gcgtgcagcg acaccccgcg ccacgcgaac 90420
ggcagcctgg cccgctgctc cctgccgaac tcacccggca cgaatgccga cgcgtgcagc 90480
gcggcgtccg caagcgccgg atgcaggcag aactccccgg cgtcaccggt gacgtcctcg 90540
ggcagccgga cctcggcgta cacctcctcg ccgtgccgcc acaccgcgcg cagcccctgg 90600
aaggccggcc cgtagcgcac accggccgcc gcgaagtccg cgtagcagcc atcggtgtcc 90660
agtttctcgg cccccgccgg cggccacacc gtcagctcct ccgcggaagc cgtacggacc 90720
gcggtgaggg tgcccccggc gtgccgcgtc cacacgtcgt cggcggaccc ctcgggctgc 90780
gtgtacagcc cgaacccgcg gtcgccggag gcgtccgggg agtccacggc gagctgaacg 90840
cgcagcccac cgccgcgcgg cagcgccagc ggaacctcca gaacgaggtc tgcgacatgg 90900
ttgagcccga cgtgccgggc cgcccacagc gccagttcgg ccacggcggt gcccggcagc 90960
aggacagcgt cgtcgattgc gtgatccgcc agccacggct gcgcctcaac ggacagcctg 91020
ccggtgaaca gacggccgcc gccgaccgcc ggcaccagcg tggcccccag cagcggatgt 91080
tccgcggcga ccaggccggc cgcgcccatg tctcccgccc cgggggcggt ctccagccag 91140
taccgggtcc cctggaaggc gtacgtcggc agcggcaccc gcgacgtcgg gcgaccgccg 91200
aagaaggtcc gccagtcgat cggcacaccg gccgtgtgag cctgggccag catggccgtc 91260
agtgtgcgga cctcggggcg ctgacggcgc agtgccgggg cgaccaccgt gggggtaccg 91320
gaggcggcga gggtctcctc ggtcatggac gtgagcgcgg cgtccgcacc cagctccagg 91380
aacacggtga cctcctcggc gacgagggtg tgcaccacgt cgtggaaccg gacgggctcg 91440
cgtgcctgcc gagcccagta cgccggatcc gcccagtcgg cggcgtccag gacggtgccc 91500
gtcacactgg acacgaccgg gatgcgaggc ggctcgaacg acagccgtcc ggccacctcc 91560
ccgagcggct cgaccaccgg atccatcagc ggggagtgga aggcgtggct caccggtatc 91620
cgccgggtcc tgcgacccag ctccgtgaag tgcgccgcga tcccggtgac gacgtcctcg 91680
tcccccgaca cgaccacgga accgggcccg ttgacggccg cgacaccggc caccgactcg 91740
tgcccggcca gcagcggcag tacctcgtcg gccgtggccg ccaccgacac catcgccccg 91800
ccctcgggca gctcctgcat cagccgcccc cgggtggcga ccagttcaca ggcgtcggcc 91860
agcgacagca tgcccgacac atgcgtggcg gtcagttcgc cgaccgagtg cccggacagc 91920
agccgcggac gcacccccca ctcctccagc agccggaaca gtgccacccc gagcgcgaac 91980
gtcgccgact gggtgtacag cgtccggtcc agcagccgtg cctcggcgct gttcgcgtcg 92040
gccagcagca ggggcagcag cggcatgtcc agccgcgcgc ccagttcggc acagacctcg 92100
tccagagccc gggcgtacgc tgggaaggtc tcgtacaact ccctgcccat gccgggccgc 92160
tgggcaccct ggccggtgaa caggaaggcg cagcgcacct cgtccgcgac cccgtccacc 92220
agcccggcag gacgctcccc gcgcgccagc tcggcgagcc cgcccagcag ttcctcgcgg 92280
ctctccgtga gcaacacggc gcggtgctcg aaggcggtac gggtggtcgc cagcgcggcc 92340
gccaggtcac cgagcggcag atccggacgg tcgagcagat gggcgtgcag ccgggcggcc 92400
tgggcgggca aggccgcggg cgtggccgca gacaacggca cgggcacgac cggcagcacg 92460
gcggaaacca tcgaatcgga cacgtcggcc cgctccgctc cgtgagccgg aagatctgcg 92520
cccttcgccg tatcggtcgg tggatcggca gtgggcggct cctccaggat gacgtgcgcg 92580
ttggtgccgc tgatcccgaa cgaggacaca ccggcccgcc gaggacgccc ggttcgcggc 92640
catgcatggt tctccgtcag cagctccacc gtgcccgctg accagtccac ttcgggcgtc 92700
ggccggtcga cgtgcagcgt ccggggcaac tgctggtggc gcatcgccat gaccatcttg 92760
atgattccgg cagcgcccgc ggcggcctgg gtgtgaccga tattggactt gacggagccg 92820
agccgcagag ggctgctgtc cgggcgctcc cggccgtagg tggcgatcag ggcctgcgcc 92880
tcgatcgggt cgccgagcgt cgtgccggtg ccgtgcgctt ccaccacgtc gatgtccgcg 92940
aagcccagcc gggccgaggc cagcgcctgc tcgatgactc gctcctgcga ggggccgttg 93000
ggggaggtca gcccgttgga cgcgccgtcc tggttgaccg cgctgccgcg gacgatcgcc 93060
aggacgtcgt gacccaggcg acgggcgtcc gagagccgct cgactacgag caacgcggag 93120
ccctcgcccc agccgacccc gtcggcggcg gccgcgaacg ccttgcaccg gccgtcggtt 93180
gccaggccgc gctgacggga gttctccacg aaggtggcag gggtggccat gacggtcgcg 93240
ccgcctgcca gcgccaggtc gcattccccg tcgcgcagcg cctgcaccgc gaggtgcagc 93300
gccaccagcg acgacgagca ggcggtgtcg agggtgacgg ccggaccctc cagctccagg 93360
aagtaggaga tccggcccga ggcgacgctg ccggccatgc cggtgctcaa atagccctcg 93420
acctcgtcgg gcagggtgcc gggaccggac gcgtagtcgt ggtacatcag cccggtgaag 93480
atgccggtcc gtgagccggc cagcgaccgc gggtccactc cggtgttctc cagcgcctcc 93540
caggcggtct ccagcagcag gcgctgctgc ggatccatcg ccagcgcctc acgcggcgag 93600
atcccgaaga actccgcgtc aaactcggcg gccccgcgca ggaacccgcc ctgggtggca 93660
taggtggtgc ccggccggtc ggggtcgacg tcgtagagtt caccgagcgc ccagccgcgg 93720
tccgacggca tgtcggtgac ggcgtcctcg ccccgcagga caaggtccca cagctcttcg 93780
ggcgaaccga caccgccggg cagtcggcag gccatgccga cgattgctat gggctcggtt 93840
tcgccggcct cgagcttctt gatgcgtcga tgggcctgcc gcagatcggc tgccaacttc 93900
ttgaggtaat caaccagctg cgcttcgttc gtcacctgag aacctgcctg agagattggc 93960
aaaccgcgcc cttcggggcg aagctacgaa cctcaccccc ctaaccgcct cccctcagcc 94020
accccggaag gtgtggatgg gcgcatatgg tcgggtaggg gttggcggta ggggcgcccc 94080
ctgcctagcc tctgcatgaa ttcccgtgcc gtgccaagga ctggagtata acgagcaatg 94140
ggcgttttcg agcaggaagc cgcagaatca acgggggaga aatttgtcag gcccgcggcg 94200
ccggaaagga tgcgtgacct cgactttctg ctcggtgatt ttcgtgtgga atggacgaac 94260
ttcaccgcag acccgcccgt gaagggcacg gctgcttgga acaccgtgtc gaccttcgcc 94320
ggtcacgcgt acgagatgac ccagctggta ccgaaagacg acctcactgg ccgcttcgtc 94380
atccagtggg tcgagtcgga gtcgtcattc tccggctatt attacgacga ctggggaaac 94440
cgcaccctgc tgaccgcgaa gggatggcag gatgggtacc tttccttcac aggtgaatgc 94500
atcgggtttg gccgctggtt cctgctcaaa gagcggtacc aggttatcga cgagaaccac 94560
tacctgaaat gcggattcat cagattcgag gcagacggcg aatgggttcc tgcggacgag 94620
gtccactgct accgcgtctg aacatgtcga accacccggg aaatcgacgc tcgggttcct 94680
gactcccggg aaggtgaacc aaccatgact ctgctgtccg aagcggtacg cgcgggtgcg 94740
tcgccacagg aactggagcg ggcggaaccg cccagggagt acaccgccgc gtacatccac 94800
tccgaggaca cccggatgtt cgagggggtc gcggacaagg acgtgcgcaa gtcgctgcgg 94860
gtcggccggg tgccgatgcc tgaactggcg ccggacgagg tgctggtcgc cgtcatggcc 94920
agtgccgtca actacaacac cgtgtggtcg gcgatcttcg agccgctgcc caccttccgc 94980
tttctgaggc agttcgccgc gcagggcggc tgggcctcgc ggcacgacct tccctaccac 95040
gtgctgggct cggacggcgc cggcgtggtg gtgcgcacgg ggcccggggt gcggcactgg 95100
aagaccggcg accacgtggt ggtcagctgc gtccaggccg acgaccagga agcggccacg 95160
caggcagacg ggatgctcgg cgccgagcag cgcatatggg gcttcgagac caacttcgga 95220
ggcctcgccc attacgcggt ggtccgggcc agtcagctga tccccaagcc cggccatctc 95280
agttgggagg aggcagcctg caacccgctg tgcggaggta cggcgtaccg gatgctggtc 95340
ggtgaccgtg gcgcccggct taagcagggt gagatcgtgc tgatctgggg tgcggccggc 95400
ggcctcggcg cctacgcggt gcaactggtc aagaacggcg gcggcatccc agtcggtgtc 95460
gtcagctccc ccgccaaggc ggaggcggct cggcggctcg gctgcgacgt ggtgatcgac 95520
cgtcaggaga tcggtctcga cgaccgtacg gcgtacgacc cggccgcggt gatcgagaca 95580
ggcaagcagc tggggcgcat catccggcgg gaggtggggg aagacccgca catcgtcttc 95640
gagcacgtcg gccggtccac cttcccggtc tccgtttttg cggtacgccg cggcggcacg 95700
gtggtgacct gcggctcgag cacgggttat cagcacacct acgacaaccg ctacctgtgg 95760
atgaagctga agcggattat cggcagccac gccgccaacc ttcaggagca gtgggaactg 95820
aaccgactgg tgtcccgcgg ccaaatcgtg ccgacccttt ccgcggtcta ccctctggcg 95880
gaggtggctg cggccacccg gtcggttcag accaaccgcc acataggaaa ggtcggtgtt 95940
ttgtgtctgg ccgaggcacc cgggcagggc gtcaccgacc ccgccctgcg tgcccgggtg 96000
ggcgaggagc gcctcagcct cctccgcgac ctttctccca ctgcctgagc cagggaagag 96060
gtggtcgagg acctcgcggc gatgctgccg atgacgcgtt cgcagtgggc gttcatccga 96120
ggcgccgggg gcgctcttga tcacctccag ttcctccgcc ttgaaggccg catcgaacgc 96180
gtccccgtac ttggcgtcgc gatcgcgcag caggaaccgc agggactcca tgcgtacgcc 96240
gaggccggcg gcgaggttcc gccggtggtg aggaactcac gccaggttgg actggagtga 96300
cgcggtgccg ggtcgatgcc cgctgcgttc aggataacga tctacaacac actccattcg 96360
cccagctcaa cgccactcgt gaacactgcc cgatcctgtt gcccattgtg gttatgcggt 96420
gagggtgtgc tcgagtcggg tgaggtggct ggtgcgggtg cggtccaggg ggcggtcgtt 96480
ccaccaggca ttgaggcgga tcaggttgag tgcgacggcg gagtagatgt gttcgagatg 96540
ggtcttcgcc aggcctcggt agcgagcgcg gcgtgtgccg gtgacagcgg tggcctgccg 96600
gatggtgccc tcgatgccgg aacgtaaggc gtagtcggtg ttccagtcct tggtcttctg 96660
ctgggtgcgg gtgtgccgga gtgcctcggt catctgtctg aggtggaggg agagttggcg 96720
gcggttcttc ttcgcggtgg tgcactgggg tttgaagggg caggggatgc agtcgagggc 96780
ggcgaagctg accacggtct tggggatacc ttcgctcacg acggggttcc aggtggcgct 96840
ggtatgtccg gcggggcagg tggccttccc ggcttcccgg tcgatggtga agtcggtggc 96900
agcgaagcct gcctgggctt tggcctggcg ggaggtgtcc agcaggaccg gcgtgatcag 96960
tgcgattccg taggtcttca ccgagccgtg gatgagttcg gcggtggcgt agccggagtc 97020
gggatagtgc tcatcgggaa gcagcccgcg ttgctggagt gcgtggtgaa tggcattgag 97080
tgtcttgctg tccggcactg tggagtgggt ggtggcgatg ttcgtgatca ggttcgggtg 97140
tgtgcgtgcc ttctcgggag cggaggtgca ggtttcgctg atgtggagct tgtagccgtt 97200
ccagaacatg tcgcgtttgg cggaccagcg agcatcggtg tcataaggcg aggacagacg 97260
taggtggccg ggcgggcggc cgtcaccgcc ctcatcggtc ttctcccgcc gcttgacgac 97320
ctcgcggccg cctcgggtga tggtgcgggt gtagttctgc acaagcacac accacaggac 97380
ctgcaccgcg ggcagctcgc gcagccagac cggagagctg gagtggtaga ccgcgcccag 97440
cagggcgaag ccgtcccgcg cgaagtccac ggccagtttc tgctgcctgg cccgggaagt 97500
gggcaggcgc cagctgtcca cccgcggtcc gtaccgcctg ctccacgagg ccacgtccac 97560
tgcctgtgcg acccagtccg gacccgcgca ggtcagcgct tccagtgcgg cgcgaaccgc 97620
ttccccggcc agctccagcc ggttcaggtc ccgcaccgcc gcgaccacgt gggtggagtc 97680
cgtgcgctgc ttgcctcctg cggccagcag gccttgttcg gtcagcctgg ccaccaacaa 97740
gtccagtacc ttctcttcca ggccatgggc ggcaacccgg ctacggaact gggacaggac 97800
actgaagtcg aagccgggat cctccaggcc cagtccgagc gcgtaggacc acgagagttt 97860
gtcccgcacc gcctcggcag cctgccggtc agtcaggttc tccgccatct gcagcaccgt 97920
gaccaatgcc aagcggcccg gtgaccagcc acgcggcccc gtcaccgcga acgcttccgc 97980
gaactcggca tccgcgaaca actcaccgag ccgatcacgc accacgaccg gcaacggcac 98040
ctgacgaccg gagtacttcg cccgcaccgc ccgagccacc tccggcgccg gctccggcca 98100
cgaccgcggc tccatcggca cgagcccctc ccactccgtg accgggaacg agaaggaccg 98160
gccaccagac cagcctgccg caacaatcaa caccccgacc agcacaaaca gcaaatgggc 98220
aacaggatcg ggcagcgttg aaccatggca ggatcgacac agacgccgcg ggtaacctgc 98280
tggtttccct gccgatgccg cgctttgacg ggcggatcgt gctcgccgtg gacgtctccc 98340
cgtggctgcg ctcggacgcg gcctgctcgc ctgagcgact gctctgtcac gtccacggcc 98400
gttctcggga ggccgcgcag atcatcccgg gttggccgta ctccttcatc gccgcgctga 98460
caccggaccg tacttcgtgg acgcagatcc tggacgtggt ccggctcggg cccgccgacg 98520
atgccgcggc cgtcaccgcc gaccagcttc gggctgtggt cgagaggctg atcgcggccg 98580
gtcaatggca gccgcgaggc catgtcgatc ggggtgccgc ccgcgacgaa cctgaatgtg 98640
ggacatccgg tcatggcgtc gaagatccgg ccgatgccgt cggacagctt gctcgcgctg 98700
tcgctgtaca gctcctcggc gccgtccgtg aggtacagtc tcgccgcccc cgggccgtcc 98760
cgttgaacct cactgtccac ccctctccgt agctcgcatt cgatgcgaga aaatcgcatc 98820
gaatgcgagg cggcagcgaa accgcagtcg tccatccgga cgagtgacag cggctgacta 98880
tcgggtctcg ccagccgcta ccgtcgcgcg acgcaacgtt ccggacatcc ctgttcagcg 98940
aggtgttgtc cgccctcgcc ggtgccgaca cagcaactgt cgccagttcg catccgatgc 99000
aattggcggc gcccgtgccg tttgcgctgg gccgaccacc ggtgtctcat caggtggcca 99060
gtgctgcctg ggtgaagcgc gctattcctc ggctggggtt ggggcttggg ggagacgtag 99120
ttggaagcat gctccgaggg tccggggggt gagggtgagg gtgcctttgt ggcggtgggc 99180
caggtcgcgg gcgatggcga ggcccaggcc ggtgccgccg tggtcgcggg agcgggcgtc 99240
gtcgagtcgg acgaagcgtt cgaagatgcg ctcggcgtct tcagtgggca cgcctggtcc 99300
gtcgtcgtgc accgtgaggt cgacccaagc gtcctggttt cggatggtga tgtggatgcg 99360
gtgtgcggcg tggcgggcgg cgttgtcgat gaggttgcgc agtagtcgtt cgtattcgtc 99420
ggggtttccg tgtgcgtgtg cgggggcggt gctgtcgcag gtgagggtca gcggtcgttc 99480
ggtgaggggg tattgctcgg tcagccggga ggcgagggct gtcaggtcga cggtttcggg 99540
gccggctgtg ggggtgcggg tgtcgaggcg ggcgaggagc agcaggtctt cggcgagggc 99600
gtggaggcgg cgggtctggc gtgcggcggt ggtgaccgcg gcgggccagt cggtgcgctc 99660
cgggtaggcg agcgcgactt ccaggctggc cagcagtgtg gtgagggggc tgcgtagttc 99720
gtgggcggcg tccgcgacga agcggcgttg ctgggcggcg gcgctgtcga ggcgttggag 99780
ggtggtgttg atggtggtgg ccagggcggt gatctcgtgt cccgtggcgg ggacggtgac 99840
gcgttcgcgg gggtcgctcg cggtgaccga ggcggtgagg acgcggatgg cttcgaccgg 99900
ccgcagcgcg atgcggacgg cgaagtaggc gacggcggcg atcagtacga ggctgacgag 99960
cccggctcgc agcagcaggc ggtcggtggc ctcggtgatt gtttcggcga tgtcctcggc 100020
tgcgtgcggc agcaccacca catagacccg cagttgggcg tcggcggcga cacccagagc 100080
ggcgactctg tcgctgctca gttcatcggc cctgacatcg tggtacatga ccaggtaggt 100140
cccaccgtcc ttgccgaacc ggtccccggg ctcggagtcg gggcgcgccg gcatgcgcgt 100200
gggaatggtc gtatagcccc agcctgacga ctcgtcatct ggcggggcgg gcaactcggc 100260
cttgggcggg gcgggcagca catggcgggt gccgggatcg aactcctcca tgcctccgcc 100320
gtaggcgaca gcaccgcggc ggtcggtcgc gacgacctcg tacggcacag tgcttcggcg 100380
aacaggaacc acaccctcct ccacctgatc gacgagagcc cggaattgtg cctgcgcctg 100440
cccttcggcg atctgcgtgc tctcgcggta gacgtcgtgg tgtacccacc agccgatgcc 100500
gaccaggatg acggcggcgg ccgaggcggc ggccagggcc gtgcgggctc gtaccgaacg 100560
cggccaccag cggcgtcgcg gcccggtcgc gttcacggtc gtccaccagc ggcgtcgcgg 100620
ctcagtcgcg ttcacggtcg tccaccagtc ggtagccggt accccgcacg gtctgcaggg 100680
actgccggtg gaacgcggcg tccaccttct tgcgtagcgc gctgacgcgt gcctccacca 100740
ggttgggatc ctcggcttcg tcgggccacg cgtgatagag cagatccgtt ttggagaccg 100800
cctggcccgc ccggcgggcc agcagctcca gcacggcgaa ctcccggggg tgtgagttcc 100860
acccggaccc cggcccggcg gcagacccgg ccggcgacat ccagcgagag gtcgcccacg 100920
gcaaggacgg gcggggcgac cgtggcggct cgtctgacca gggcccgcag ccgtgcgacg 100980
agcaccacgt aggagaaggg cttggccagg tagtcatcgg cccccgtgtc cagggcctcc 101040
gcctgatccc actccccgtc cttggcggtg aggaccagga tgggggtcgc gttgttctcc 101100
cggcgcaact gggcgcagac cttgtagccg ttgagtccgg gcagcatcaa gtccaggacg 101160
accagggcgt attcgccggt ccgggccatc cacagtccct gccggccgtc atgggcgagg 101220
tcgacgctgt agccctcggc ggtcagaccg gtgtgcaggg tgtgggcaag gtccacctcg 101280
tcttcgacca ccagaatgcg catgcggtgc agcctcgcac agcgccggcc cgccttcctg 101340
atcggccggt caggttggat cagcatccgg tcaacgaggt cccggcatgc tggccgagtc 101400
tcttcgccac tactgaaagg ccctgccacc gatgtccgtt gctgaacgtg cccccgccgc 101460
ggccaggacg gtttcccctc ccgcacgcgc ccgtcgtcag tcaccgctcc ggcctgtgac 101520
cgatggcggt ccccagccgc gggcgcgtct gcggtgacgc gggggtgctg agccgagttg 101580
gtcaagctca agggcacgac gggccatcat cacgccacaa gcgatgggga ggtccgagcg 101640
ggcgagcgaa agcgaaccct tgcccttacg agccgacatg aggaacaccg cgtcgatggc 101700
gaagcggtcg gcggcagatt tacgcctgac cagcagaaac ccagtgatta cacccggaag 101760
acaacaagtt atcagatagc ctccaggggg agacttgcgc agacaagtga aaagagcgtg 101820
cgcggccacg atcgccacgg cggcagccgt ggccctggcg gcggacatga ccagcccggc 101880
gtcggcggag cccgagcgta cggccggtga ccaggccgta cagaccacgc ccaagcaccg 101940
cgtcaccctg atcaccggcg accgtgtcgt cgtcgacgcc aagggccgcg tcatcggcct 102000
ggagcgggcc aagggccgcg aggggatacc cgtccagatc cgcaaggccg acgggcacac 102060
cctcgtcgtg ccggccgacg cggcacggct gatagccgac ggcagactcg accagcggct 102120
cttcgacgtc accgagctca acaagtcggc caaccgcaag gcccagaagc agggcctcaa 102180
gctgatcgtc ggctacagcg gcacggccgc cgcggcgaag gcggacgtcc gggaggccgg 102240
cgacaccaag gtccgcagga ccctgaagtc gctgaacgcg gacgcggtgc tgacgcccaa 102300
gggcgacgcg cccgacctgt gggccgcggt caccgacacg ccgtccggcg gcgcgaagac 102360
cgcctccggc atcgcccacg tgtggctcga cggggtccgc aaggccagcc tcgacaagtc 102420
cgttccgcag atcggcgccg acaaggcgtg ggccgccggg tacaacggca agggcgtcaa 102480
gatcgccgtc ctcgacaccg gtgtcgacgc gacccacccg gacctcaagg agcaggtggt 102540
cggagagaag aacttctcca cgtcccccga cgcgaccgac aagtacggcc acggcacgca 102600
cgtcgcgtcc atcgcggccg gtacgggagc caagtcggcg ggcaagtaca agggcgtcgc 102660
accgggcgcc aagctgctca acggcaaggt gctcggcgac gacggctccg gcgacgactc 102720
cggcatcctc gccggcatgg agtgggcggt cgagcagggc gccgacgtcg tgaacctcag 102780
cctcggcggc ggggacaccc ccgacatcga cccgcttgag gcccaggtca acaagctgtc 102840
caaggagaag ggcgtcctct tcgccatcgc cgcgggcaac gacggcgact tcggcgagca 102900
gacgatcggc tccccgggca gcgcggaggc cgcgctcacc gtgggcgccg tcgacgacac 102960
cgacaagctg gcctcgttct ccagcacggg ccccggcctc gacgggcaga tcaagcccga 103020
cgtgaccgca cccggtgtgg acaccacggc cgcctcggcc ccgggcagcg tcatagccca 103080
ggaggtcggc gagaagccgc ccggctacgt gagcatctcg ggtacgtcga tggccacccc 103140
gcatgtcgcg ggcgccgcgg cgatcctgaa gcagcagcac cccgactgga cgtacaccca 103200
gctcaagggt gcgctgaccg gctccgcgaa gggcggcaag tacacgccgt tccagcaggg 103260
ttcgggtccg gagtccaggg tcgacaatgg cacagtcaag ccagatccgg tgggtcgctc 103320
gatcctcgaa cggtcgggtg agtcggtcgg cccgtccatg cacgtggctc gcacaccgac 103380
gacaagccgg tcaccgacaa ggtcacgtac aagaacctcg ggaagaccga tgtcacgctg 103440
accctcgcgg tgacggccac cgacccgaag gggcaggccg caccggccgg cttcttcacg 103500
ctcggcacca agacgctgac cgtcccggcg ggcggctcgg cctccgccga cctcacggtc 103560
aacacgaagc agggcggcac gctcgacggc gcctactccg cgtacgtgac cgccaccggc 103620
ggcggccaga gcgtacgcac ggcggcgacg gtgcagcgcg aggtggagtc gtacgacgtc 103680
acgctcaagt tcatcgaccg tgacggcaac ccggcgaagt actacaacgc cgaactggac 103740
ggtgtcaccg ggctcgcaca gggcaagtgg tactcgccct acgacgagtc cggcaccgtc 103800
aaggtccgct ttcccaaggg cggttacatc ttcaactcgg ccgtccacgt cgacccggat 103860
gaccccgcca agggtttcga ctgggtgacg cagccgaagc tgagcatcac caagaaggcc 103920
acgatcacgg tggacgcgcg gaccgcgaag ccggtggaca tcaccgtgcc cgacgcggcg 103980
gcgaagtcgg aggtcgctac gccgttgtac accgtcggcg tgccggacgg cagcaactcg 104040
tacggctggt ggctggactc gtacgccaac ttccgtaccg cgcacgccgg tccgca 104096
<210>2
<211>6148
<212>PRT
<213〉nanchang streptomycete NS3226 (Streptomyces nanchangensis n.sp.NS3226)
<400>1
Met Ala Gly Gly Ser Glu Ser Glu Ala Ala Glu Phe Thr Ala Arg Ser
1 5 10 15
Ala Gln Pro Ile Ala Val Val Gly Met Ala Cys Arg Leu Pro Gly Ala
20 25 30
Ala Gly Pro Ala Glu Phe Arg Ala Ile Leu Arg Ser Gly Thr Glu Ala
35 40 45
Val Gly Ala Ala Ala Pro Asp Arg Pro Tyr Ala Pro Pro Arg Gly Gly
50 55 60
Phe Leu Asp Ser Val Asp Arg Phe Asp Ala Gly Phe Phe Gly Val Ser
65 70 75 80
Pro Arg Glu Ala Ala Val Met Asp Pro Gln Gln Arg Leu Met Leu Glu
85 90 95
Leu Cys Trp Glu Ala Leu Glu Asp Ser Gly Ile Val Pro Ala Arg Leu
100 105 110
Asp Gly Ser Asp Ala Gly Val Phe Val Gly Ala Ile Thr Asp Asp Tyr
115 120 125
Ala Val Leu Ser Arg Ala Ala Gly Val Asp Ala Ala Thr Pro Glu Thr
130 135 140
Ser Thr Gly Leu Asn Arg Gly Met Ile Ala Asn Arg Val Ser Tyr Arg
145 150 155 160
Leu Gly Leu Arg Gly Pro Ser Phe Thr Val Asp Ser Gly Gln Ser Ser
165 170 175
Ser Leu Val Ala Val His Leu Ala Thr Glu Ser Leu Arg Arg Gly Glu
180 185 190
Cys Ser Leu Ala Leu Ala Gly Gly Val Asn Leu Ile Leu Ala Glu Asp
195 200 205
Ser Thr Ala Ala Val Glu Arg Phe Gly Ala Leu Ser Pro Asp Gly Arg
210 215 220
Cys Tyr Thr Phe Asp Ala Arg Ala Asn Gly Tyr Val Arg Gly Glu Gly
225 230 235 240
Gly Gly Val Val Val Leu Lys Arg Leu Thr Asp Ala Val Ala Asp Gly
245 250 255
Asp Asp Ile Leu Cys Val Leu Ala Gly Ser Ala Val Asn Asn Asp Gly
260 265 270
Gly Gly Glu Gly Leu Thr Val Pro Asp Arg Gln Gly Gln Glu Ala Val
275 280 285
Leu Thr Ala Ala Tyr Glu Gln Ala Gly Ile Ser Pro Asn Ala Val Gly
290 295 300
Tyr Val Glu Leu His Gly Thr Gly Thr Pro Ala Gly Asp Pro Val Glu
305 310 315 320
Ala Ala Ala Val Gly Ala Val Leu Gly Ala Gly Arg Ser Ala Glu Gln
325 330 335
Pro Leu Leu Val Gly Ser Val Lys Thr Asn Ile Gly His Leu Glu Gly
340 345 350
Ala Ala Gly Ile Ala Gly Leu Leu Lys Ala Val Leu Thr Val Arg His
355 360 365
Arg Glu Ile His Ala Ser Leu Asn Phe Thr Thr Pro Ser Thr Arg Ile
370 375 380
Pro Met Thr Glu Leu Gly Leu Ser Val Asn Thr Ala Leu Arg Pro Trp
385 390 395 400
Leu Ser Glu Ala Gly Pro Leu Ile Val Gly Val Ser Ser Phe Gly Met
405 410 415
Gly Gly Thr Asn Cys His Val Val Leu Thr Glu Trp His Gly Val Ala
420 425 430
Pro Val Thr Ala Pro Gly Ile Arg Pro Asn Gly Thr Ala Val Pro Leu
435 440 445
Leu Ile Thr Gly Arg Asp Glu Gln Ala Leu Arg Asp Gln Ala His His
450 455 460
Leu Gly Arg His Leu Asp Glu His Gly Pro Leu Arg Leu Lys Asp Val
465 470 475 480
Ala His Thr Leu Ala Ala Gly Arg Thr Ala Phe Glu His Arg Ala Val
485 490 495
Leu Leu Val Arg Glu Pro Gln Asp Met Thr Asp Gly Leu Ala Arg Leu
500 505 510
Ala Asp Gly Thr Pro Gly Pro Asp Leu Val Arg Ala Thr Ala Thr Cys
515 520 525
Ser Ser Leu Ala Phe Leu Phe Thr Gly Gln Gly Ser Gln Arg Pro Gly
530 535 540
Met Thr Ala Glu Leu Tyr Gln Ser Ser Ser Glu Tyr Ala Ala Ala Leu
545 550 555 560
Asp Glu Val Cys Ala His Leu Asp Pro Gln Leu Arg Val Pro Leu Arg
565 570 575
Glu Val Leu Phe Ala Ala Glu Gly Thr Ala Glu Ala Val Leu Leu Asp
580 585 590
Arg Thr Glu Phe Thr Gln Pro Ala Leu Phe Ala Val Glu Val Ala Leu
595 600 605
Phe Arg Phe Ala Glu His Cys Gly Leu Val Pro Arg Leu Leu Leu Gly
610 615 620
His Ser Val Gly Glu Leu Ala Ala Ala His Val Ala Gly Val Leu Ser
625 630 635 640
Leu Ala Asp Ala Cys Ser Leu Val Ala Ala Arg Gly Arg Leu Met Gln
645 650 655
Ala Gln Pro Ala Thr Gly Ala Met Ala Ala Ile Gln Ala Thr Glu Lys
660 665 670
Glu Leu Ala Pro Phe Leu Asp Glu Ser Val Ala Ala Ala Ala Leu Asn
675 680 685
Gly Pro Ala Ser Thr Val Leu Ala Gly Asp Glu Glu Ala Val Leu Ala
690 695 700
Ile Ala Ala His Trp Ala Ala Lys Gly Arg Arg Thr Lys Arg Leu Arg
705 710 715 720
Val Ser His Ala Phe His Ser Pro His Met Asp Gly Met Leu Glu Glu
725 730 735
Phe His Arg Val Ala Gly Gln Leu Thr Phe Glu Ala Pro Arg Val Pro
740 745 750
Ile Val Ser Asn Glu Thr Gly Ala Leu Leu Thr Glu Ala Glu Ala Cys
755 760 765
Ser Pro Glu Tyr Trp Val Arg Gln Ala Arg Val Thr Val Arg Phe Leu
770 775 780
Asp Gly Val Arg Leu Leu Glu Glu Gln Gly Val Thr Thr Leu Leu Glu
785 790 795 800
Leu Gly Pro Asp Gly Thr Leu Ser Ser Leu Ala Arg Asp Cys Leu Arg
805 810 815
Gly Val Asp Ala Val Ser Val Pro Leu Leu Arg Gly Arg Thr Glu Pro
820 825 830
Glu Glu Val Val Ala Ala Leu Ala Thr Leu Gln Val Arg Gly Val Pro
835 840 845
Met His Trp Glu Arg Leu Ala Thr Glu Glu Gly Ala Arg Arg Val Pro
850 855 860
Leu Pro Thr Tyr Pro Phe Gln Arg Arg Arg His Trp Leu Pro Asp Leu
865 870 875 880
Val Ala Gln Asp Ser Val Pro Ala Pro Gly Arg Ala Ala Gly Gln Arg
885 890 895
Ser Arg Pro Val Asn Glu Pro Ala Pro Ser Ala His Ala Pro Arg Gly
900 905 910
Asp Arg Thr Met Arg Glu Thr Val Arg Ala Ala Val Ala Leu Val Leu
915 920 925
Gly His Asp Ser Pro Asp Asp Ile Pro Ala His Thr Thr Phe Arg Glu
930 935 940
Leu Gly Leu Ser Ser Leu Met Leu Ala Glu Val Gly Glu Arg Leu Thr
945 950 955 960
Glu Ala Thr Gly Arg Arg Val Pro Thr Thr Leu Leu Phe Asp His Pro
965 970 975
Thr Pro Asp Ala Leu Val Arg Glu Leu Thr Ser Gly Gly Ala Glu Arg
980 985 990
Pro Ala Ala Leu Thr Thr Ala Pro Ser Ala Ala His Ala Asp Asp Pro
995 1000 1005
Val Val Val Val Gly Met Ala Cys Arg Leu Pro Gly Gly Ile Arg
1010 1015 1020
Ser Pro Glu Glu Phe Trp Gln Phe Met Ala Ala Asp Gly Asp Ala
1025 1030 1035
Ile Ser Pro Leu Pro Thr Asp Arg Gly Trp Ala Val Ser Gly Asp
1040 1045 1050
Phe Pro Ala Glu Gly Gly Phe Leu Ala Asp Val Ala Gly Phe Asp
1055 1060 1065
Ala Ala Phe Phe Gly Ile Ser Pro Arg Glu Ala Leu Ala Met Asp
1070 1075 1080
Pro Gln Gln Arg Leu Leu Leu Glu Thr Ser Trp Glu Ala Leu Glu
1085 1090 1095
Arg Ala Gly Val Asp Ala Leu Ser Leu Arg Gly Ser Arg Thr Gly
1100 1105 1110
Val Phe Val Gly Ala Ser Pro Ser Glu Tyr Gly Pro Arg Leu His
1115 1120 1125
Glu Pro Ser Gln Ala Asp Gly His Val Leu Thr Gly Thr Ala Pro
1130 1135 1140
Ser Val Leu Ser Gly Arg Val Ala Tyr Val Leu Gly Leu Glu Gly
1145 1150 1155
Pro Ala Leu Thr Val Asp Thr Ala Cys Ser Ser Ser Leu Val Ala
1160 1165 1170
Leu His Leu Ala Ala Gln Ala Leu Arg Gly Gly Glu Cys Asp Leu
1175 1180 1185
Ala Leu Ala Gly Gly Val Ala Val Met Ala Thr Ala Gly Met Phe
1190 1195 1200
Ala Glu Phe Ala Arg Gln Gly Gly Leu Ala Arg Asp Gly Arg Cys
1205 1210 1215
Lys Ala Phe Ala Asp Gly Ala Asp Gly Thr Gly Trp Gly Glu Gly
l220 1225 1230
Val Gly Val Leu Val Leu Ser Arg Leu Ser Glu Ala Arg Arg Cys
1235 1240 1245
Gly Tyr Thr Val Leu Ala Val Val Ser Gly Ser Ala Val Asn Ser
1250 1255 1260
Asp Gly Ala Ser Asn Gly Leu Thr Ala Pro Asn Gly Pro Ser Gln
1265 1270 1275
Gln Arg Val Ile Arg Gln Ala Leu Ala Ser Ala Gly Leu Ser Pro
1280 1285 1290
Gly Asp Val Asp Val Val Glu Ala His Gly Thr Gly Thr Ala Leu
1295 1300 1305
Gly Asp Pro Ile Glu Ala Gln Ala Leu Leu Ala Thr Tyr Gly Gln
1310 1315 1320
Glu Arg Gly Ala Gly Arg Pro Leu Tyr Val Gly Ser Val Lys Ser
1325 1330 1335
Asn Ile Gly His Val Gln Ala Ala Ala Gly Val Ala Gly Val Ile
1340 1345 1350
Lys Ser Val Leu Ala Leu Arg Tyr Gly Val Leu Pro Arg Thr Leu
1355 1360 1365
His Val Asp Val Pro Ser Arg Glu Val Asp Trp Ser Ala Gly Ala
1370 1375 1380
Val Glu Leu Leu Thr Glu Ala Val Glu Trp Leu Ala Gly Gly Arg
1385 1390 1395
Pro Arg Arg Val Gly Val Ser Ala Phe Gly Ile Ser Gly Thr Asn
1400 1405 1410
Ala His Val Ile Leu Glu Glu Ala Pro Glu Gly Val Glu Glu Ser
1415 1420 1425
Ala Ala Gly Glu Val Ala Gly Val Val Pro Trp Val Val Ser Ala
1430 1435 1440
Arg Ser Glu Glu Gly Leu Arg Ala Gln Ala Ala Arg Leu Val Glu
1445 1450 1455
His Val Val Gly Gly Ser Gly Leu Gly Pro Val Asp Val Gly Trp
1460 1465 1470
Ser Leu Ala Arg Ser Arg Ala Val Leu Glu His Arg Ala Val Val
1475 1480 1485
Leu Gly Gly Asp Gly Glu Glu Leu Val Ala Gly Leu Arg Ala Leu
1490 1495 1500
Cys Asp Gly Val Leu Gly Pro Gly Val Val Arg Gly Val Ala Gly
1505 1510 1515
Asp Gly Gly Thr Ala Leu Leu Phe Thr Gly Gln Gly Ala Gln Arg
1520 1525 1530
Val Gly Met Gly Arg Glu Leu Tyr Glu Ala Phe Pro Val Phe Ala
1535 1540 1545
Ala Ala Phe Asp Ala Val Cys Ala Gly Phe Glu Gly Met Leu Pro
1550 1555 1560
Gly Ser Leu Arg Gly Val Val Phe Gly Asp Gly Gly Gly Val Val
1565 1570 1575
Asp Arg Thr Glu Trp Ala Gln Pro Ala Leu Phe Ala Leu Glu Val
1580 1585 1590
Ala Leu Phe Glu Leu Val Val Ser Trp Gly Val Arg Ala Asp Val
1595 1600 1605
Leu Val Gly His Ser Val Gly Glu Leu Val Ala Ala His Val Ala
1610 1615 1620
Gly Val Trp Ser Leu Ala Asp Ala Cys Arg Val Val Ala Ala Arg
1625 1630 1635
Gly Arg Leu Met Gln Ala Leu Pro Val Gly Gly Ala Met Val Ala
1640 1645 1650
Val Arg Val Gly Glu Gly Glu Leu Pro Val Leu Pro Glu Gly Val
1655 1660 1665
Ser Val Ala Ala Val Asn Gly Pro Arg Ser Leu Val Leu Ser Gly
1670 1675 1680
Asp Glu Gly Pro Val Leu Glu Leu Ala Ala Arg Leu Ala Gly Glu
1685 1690 1695
Gly Arg Asp Thr Arg Arg Leu Arg Val Ser His Ala Phe His Ser
1700 1705 1710
Ala Arg Met Glu Pro Met Leu Ala Glu Phe Ala Gln Val Leu Ala
1715 1720 1725
Ala Val Glu Phe Arg Ala Pro Arg Ile Pro Val Ile Ser Asn Val
1730 1735 1740
Thr Gly Glu Val Ala Gly Glu Glu Leu Thr Thr Pro Glu Tyr Trp
1745 1750 1755
Val Arg Gln Val Arg Glu Ala Val Arg Phe Ala Asp Gly Val Asn
1760 1765 1770
Thr Ala His Gly Ser Gly Val Arg Arg Tyr Leu Glu Leu Gly Pro
1775 1780 1785
Asp Gly Val Leu Thr Ser Leu Ala His Asp Ile Leu Ala Glu Gln
1790 1795 1800
Gly Ile Asp Arg Asp Val Ala Val Val Pro Ala Leu Arg His Asp
1805 1810 1815
Gln Pro Glu Ser Arg Thr Leu Leu Thr Ala Leu Gly Gln Leu His
1820 1825 1830
Thr Thr Gly Met Asp Val Gly Trp Ala Ala Phe Leu Ala Pro Tyr
1835 1840 1845
Gly Ala Arg Thr Val Glu Leu Pro Thr Tyr Ala Phe Glu His His
1850 1855 1860
Arg Tyr Trp Leu Asp Pro Val Ala Pro Ala Ser Ala Pro Ala Asp
1865 1870 1875
Pro Leu Arg Tyr Arg Ala Glu Trp Ala Ser Val Pro Asp Cys Ala
1880 1885 1890
Thr Pro Ser Leu Ser Gly Val Gln Ala Val Val Val Pro Ala Gly
1895 1900 1905
Gly Gly His Leu Asp Val Leu Pro Asp Val Thr Ala Ala Leu Arg
1910 1915 1920
Glu His Gly Ala Arg Thr Val Leu Val Glu Val Asp Pro Glu Arg
1925 1930 1935
Ala Asp Arg Ala Glu Ile Ala Asp Ala Leu Arg Ala Ala Leu Gly
1940 1945 1950
Glu Glu Gly Gly Gly Val Val Ser Leu Leu Ala Leu Asp Arg Gly
1955 1960 1965
Pro Phe Ala Gly Val Ala Ala Thr Ala Val Leu Leu Gln Ala Leu
1970 1975 1980
Thr Gly Leu Asp Gly Gly Gly Arg Leu Trp Ser Leu Thr Arg Gly
1985 1990 1995
Ala Val Ser Val Ser Arg Ser Asp Ala Leu Thr Asp Pro Gly Gln
2000 2005 2010
Ala Gln Val Trp Gly Met Gly Arg Val Ala Ala Leu Glu His Pro
2015 2020 2025
Glu Arg Trp Gly Gly Leu Val Asp Leu Pro Thr Glu Leu Asp Asp
2030 2035 2040
Arg Ala Arg Ala Arg Leu Cys Ala Val Leu Ser Gly Ser Thr Gly
2045 2050 2055
Glu Asp Gln Val Ala Val Arg Ala Ala Gly Leu Tyr Ala Arg Arg
2060 2065 2070
Leu His Arg Val Ala Pro Arg Val Pro Thr Thr Glu Asp Ala Gly
2075 2080 2085
Ala Ala Ser Gly Gln Gly Val Gly Asp Arg Arg Ala Tyr Thr Tyr
2090 2095 2100
Gly Thr Val Leu Val Thr Gly Gly Thr Gly Ala Leu Gly Ala His
2105 2110 2115
Ile Ala Asn Trp Leu Ala Arg Ser Gly Thr Arg His Val Leu Leu
2120 2125 2130
Thr Ser Arg Arg Gly Pro Asp Ala Glu Gly Ala Ala Asp Leu Thr
2135 2140 2145
Ala Arg Leu Arg Glu Leu Gly Thr Glu Val Thr Val Ala Ala Cys
2150 2155 2160
Asp Val Ala Asp Arg Gln Arg Leu Ala Asp Leu Ile Ala Ala Leu
2165 2170 2175
Ser Ala Asp Arg Pro Leu Thr Gly Val Val His Ala Ala Gly Val
2180 2185 2190
Leu Asp Asp Gly Val Leu Asp Ser Leu Thr Pro Asp Arg Phe Asp
2195 2200 2205
Ala Val Ala Arg Pro Lys Val Ile Gly Ala Arg His Leu His Glu
2210 2215 2220
Leu Thr Arg Asp Leu Asp Leu Ser Leu Phe Val Met Phe Ser Ser
2225 2230 2235
Val Val Gly Thr Val Gly Leu Ala Gly Gln Gly Asn Tyr Ala Ala
2240 2245 2250
Ala Asn Ala Tyr Leu Asp Ala Leu Ala Val His Arg Ala Gln His
2255 2260 2265
Gly Leu Pro Ala Thr Ala Val Ala Trp Gly Ser Trp Ser Gly Ala
2270 2275 2280
Gly Met Ala Gly Asp Thr Arg Ala Ala Arg Asp Arg Leu Ala Arg
2285 2290 2295
Ala Gly Leu Ala Pro Leu Asp Pro Ala Ala Ala Leu Ala Val Leu
2300 2305 2310
Asp Arg Val Ile Ala Asp Gly Glu Thr Ala Val Thr Val Ala Asp
2315 2320 2325
Val Asp Trp Glu Arg Phe Ala Ala Gly Phe Ala Pro Gly Arg Pro
2330 2335 2340
His Pro Leu Leu Ala Gly Ile Pro Glu Leu Trp His Ala Arg Pro
2345 2350 2355
Gln Glu Thr Gly Gln Val Thr Asp Gly Pro Ala Asp Arg Leu Ala
2360 2365 2370
Gly Leu Ala Gly Asp Glu Leu Arg Gln Ala Leu Asp Asp Met Val
2375 2380 2385
Thr Val Glu Val Ala Ala Val Leu Gly Phe Arg Ala Lys Asp Arg
2390 2395 2400
Val Pro Thr Asp Arg Thr Phe Lys Ser Leu Gly Phe Asp Ser Leu
2405 2410 2415
Ile Gly Val Glu Phe Arg Asn Arg Leu Ala Ala Ala Leu Gly Arg
2420 2425 2430
Arg Leu Pro Pro Ser Leu Ile Tyr Asp His Pro Thr Pro Gly Arg
2435 2440 2445
Leu Val Glu His Leu Ala Ala Gly Val Asp Gly Gly Asp Gln Pro
2450 2455 2460
Ser Thr Val Gly Gly Arg Pro Val Ala Pro Thr Arg Thr His Asp
2465 2470 2475
Asp Pro Val Val Ile Val Ser Ala Ala Cys Arg Phe Pro Gly Gly
2480 2485 2490
Val Arg Thr Pro Glu Asp Leu Trp Gln Leu Val Leu Asp Gly Gly
2495 2500 2505
Asp Ala Ile Gly Pro Phe Pro Val Asp Arg Gly Trp Asp Leu Asp
2510 2515 2520
Arg Leu Tyr Asp Pro Asp Pro Gly Ala Ser Gly Thr Ser Tyr Val
2525 2530 2535
Arg Glu Gly Gly Phe Leu Thr Gly Val Ala Asp Phe Asp Ala Val
2540 2545 2550
Phe Phe Gly Ile Ser Pro Arg Glu Ala Leu Ala Met Asp Pro Gln
2555 2560 2565
Gln Arg Leu Leu Leu Glu Thr Ser Trp Glu Ala Leu Glu Arg Ala
2570 2575 2580
Gly Ile Val Pro Gly Ser Leu Ala Gly Ser Arg Thr Gly Val Phe
2585 2590 2595
Val Gly Ser Asn Gly Gln Asp Tyr Ala Asn Leu Leu His Ser Ser
2600 2605 2610
Asp Val Glu Gly His Val Leu Thr Gly Thr Ala Ser Ser Val Leu
2615 2620 2625
Ser Gly Arg Ile Ala Tyr Thr Leu Gly Leu Glu Gly Pro Ala Leu
2630 2635 2640
Thr Val Asp Thr Ala Cys Ser Ser Ser Leu Val Ala Leu His Leu
2645 2650 2655
Ala Val Gln Ala Leu Ser Ser Gly Glu Cys Asp Leu Ala Leu Ala
2660 2665 2670
Gly Gly Val Thr Val Met Ser Gly Ser Asp Ile Phe Val Glu Phe
2675 2680 2685
Ser Arg Gln Arg Gly Leu Ser Ala Asp Gly Arg Cys Lys Ala Phe
2690 2695 2700
Gly Pro Asp Ala Asp Gly Thr Gly Trp Ala Glu Gly Val Gly Thr
2705 2710 2715
Val Val Leu Glu Arg Leu Ser Asp Ala Arg Arg Leu Gly His Glu
2720 2725 2730
Val Leu Gly Val Val Arg Gly Thr Ala Val Asn Gln Asp Gly Ala
2735 2740 2745
Ser Asn Gly Leu Ser Ala Pro Ser Gly Arg Ala Gln Gln Arg Val
2750 2755 2760
Ile Arg Gln Ala Leu Ala Asp Ala Gly Cys Ala Pro Ser Asp Val
2765 2770 2775
Asp Ala Val Glu Ala His Gly Thr Gly Thr Arg Leu Gly Asp Pro
2780 2785 2790
Ile Glu Ala Gln Ala Leu Leu Thr Thr Tyr Gly Gln Asp Arg Pro
2795 2800 2805
Ala Asp Arg Pro Leu Tyr Leu Gly Ser Ile Lys Ser Asn Ile Gly
2810 2815 2820
His Ala Gln Ala Ala Ala Gly Leu Ala Gly Val Leu Lys Met Leu
2825 2830 2835
Phe Ala Leu Arg His Gly Gln Leu Pro Lys Thr Leu His Ala Pro
2840 2845 2850
Arg Pro Thr Pro Glu Val Asp Trp Ser Glu Gly Ala Val Ala Leu
2855 2860 2865
Leu Thr Glu Asp Arg Pro Trp Pro Ala Val Asp Arg Pro Arg Arg
2870 2875 2880
Ala Gly Val Ser Ala Phe Gly Val Ser Gly Thr Asn Ala His Val
2885 2890 2895
Ile Leu Glu Gln Ala Pro Pro Ser Ala Ala Ser Asp Pro Ala Pro
2900 2905 2910
Thr Val Arg Pro Pro Ala Val Asp Ser Ser Val Gln Pro Trp Val
2915 2920 2925
Leu Thr Ala Arg Ser Gly Glu Ala Leu Gly Ala Leu Ala Asp Arg
2930 2935 2940
Leu Arg Glu Ala Ala Pro Gly Ala Val Pro Ala Asp Val Ala Arg
2945 2950 2955
Ser Leu Val Thr Thr Arg Thr Ile Trp Ala Glu Arg Ala Val Leu
2960 2965 2970
Leu Ala Asp Gly Arg Asp Glu Tyr Ala Ser Gly Leu Ala Ala Leu
2975 2980 2985
Ala Thr Gly Glu Gly Asp Ala Arg Val Val Arg Gly Thr Ala Asp
2990 2995 3000
Thr Arg Gly Arg Val Val Phe Val Phe Pro Gly Gln Gly Ala Gln
3005 3010 3015
Trp Ala Gly Met Ala Ala Arg Leu Trp Glu Ser Ser Pro Glu Phe
3020 3025 3030
Ala Arg Trp Met Asp Arg Cys Asp Lys Ala Leu Gly Asp Leu Thr
3035 3040 3045
Asp Trp Ser Leu Ala Glu Val Ile His Gln Ala Asp Gly Ala Pro
3050 3055 3060
Gly Leu Asp Arg Val Asp Val Leu Gln Pro Ala Ser Trp Ala Val
3065 3070 3075
Ser Val Ser Leu Ala Ala Leu Trp Arg Ser Cys Gly Val Glu Pro
3080 3085 3090
Ala Ala Val Val Gly His Ser Gln Gly Glu Ile Ala Ala Ala Cys
3095 3100 3105
Val Ala Gly Ala Leu Ser Leu Glu Asp Gly Ala Met Leu Val Thr
3110 3115 3120
Leu Arg Ser Arg Leu Ile Arg Glu Glu Leu Ser Gly His Gly Gly
3125 3130 3135
Met Met Ser Val Ala Leu Ser Pro Ala Gly Thr Ala Asp Arg Ile
3140 3145 3150
Ala Cys Trp Glu Gly Arg Ile Cys Val Ala Ala His Asn Ser Arg
3155 3160 3165
Arg Ser Thr Val Val Ala Gly Glu Pro Ala Ala Leu Ala Glu Leu
3170 3175 3180
Leu Ala Ala Cys Glu Ala Asp Gly Ile Arg Ala Arg Arg Ile Pro
3185 3190 3195
Val Asp Tyr Ala Ser His Ser Pro Gln Val Glu Arg Ile Glu Arg
3200 3205 3210
Lys Leu Thr Glu Leu Ala Ala Gly Ile Val Ser Arg Ser Ser Glu
3215 3220 3225
Ile Pro Phe His Ser Thr Val Thr Gly Thr Arg Leu His Thr Thr
3230 3235 3240
Gly Leu Asp Ala Gly Tyr Trp Tyr Arg Asn Leu Arg Lys Pro Val
3245 3250 3255
Leu Phe Gly Pro Val Thr Glu Glu Leu Leu Thr Gln Gly His Asp
3260 3265 3270
Val Phe Leu Glu Met Ser Pro His Pro Val Leu Val Pro Ala Val
3275 3280 3285
Gln Glu Ala Ser Asp Ala Val Thr Ala Thr Ala Ala Ala Val Gly
3290 3295 3300
Ser Leu Arg Arg Gly Asp Gly Gly Pro Glu Arg Phe Leu Leu Ser
3305 3310 3315
Leu Ala Glu Ala Phe Val Arg Gly Ala His Val Asp Trp Ala Ala
3320 3325 3330
Val Leu Gly Gly Thr Gly Thr Arg Leu Val Glu Leu Pro Thr Tyr
3335 3340 3345
Pro Phe Gln Arg Thr Arg Phe Trp Pro Glu Pro Val Thr Pro Ala
3350 3355 3360
Thr Ala Thr Gly Gly Gln Asp Asp Ala Pro Leu Trp Gln Ala Val
3365 3370 3375
Glu Arg Gly Asp Val Ala Ala Val Ala Ala Glu Leu Ala Val Pro
3380 3385 3390
Asp Gly Arg Ser Leu Arg Asp Leu Val Pro Ala Leu Ser Gly Trp
3395 3400 3405
Arg Arg Arg Arg Arg Asp Ser Ala Thr Leu Asp Ile Trp Arg Tyr
3410 3415 3420
Arg Val Thr Trp Thr Gln Val Asn Leu Pro Val Ser Ala Ala Val
3425 3430 3435
Thr Gly Asp Trp Leu Leu Val Thr Asp Asp Pro Asp Thr Ala Val
3440 3445 3450
Pro Arg Trp Val Ser Ala Ala Leu Gly Glu Gly Leu Ala Thr Val
3455 3460 3465
Val Arg Pro Ala Asp Val Pro Ala Trp Ser Arg Thr Pro Gln Gly
3470 3475 3480
Thr Gly Trp Thr Gly Val Val Ser Leu Leu Gly Leu Thr Asp His
3485 3490 3495
Ser His Pro Cys His Pro Ala Leu Ser Thr Gly Val Ala Ala Thr
3500 3505 3510
Val Thr Leu Leu Thr Ala Leu Arg Glu Ala Gly Ile Glu Ala Pro
3515 3520 3525
Leu Trp Cys Leu Thr Ser Gly Ala Val Gly Thr Gly Gly Leu Asp
3530 3535 3540
Gln Val Thr Ala Pro Asn Gln Ala Gln Leu Trp Gly Leu Gly Arg
3545 3550 3555
Val Ala Gly Leu Glu Thr Pro Ala Thr Trp Gly Gly Leu Val Asp
3560 3565 3570
Leu Pro Ala Glu Pro Asp Glu Arg Thr Ala Ala Leu Leu Arg Ala
3575 3580 3585
Ala Leu Thr Ala Asp Gly Ile Glu Gln Glu Tyr Ala Leu Arg Pro
3590 3595 3600
Ser Gly Pro Tyr Val Arg Arg Leu Val Arg Ala Pro Leu Ala Gly
3605 3610 3615
Val Ala Ala Pro Arg Ser Trp Arg Pro Arg Pro Asp Gly Thr Val
3620 3625 3630
Val Val Thr Gly Gly Thr Gly Ala Leu Gly Ala Arg Val Ala Arg
3635 3640 3645
Trp Leu Ala Arg Ala Gly Ala Gly His Leu Leu Leu Thr Ser Arg
3650 3655 3660
Arg Gly Pro Ala Ala Asp Gly Ala Val Glu Leu Ser Glu Glu Leu
3665 3670 3675
Arg Ala Leu Gly Ala Glu Val Thr Ile Thr Ala Cys Asp Val Ala
3680 3685 3690
Asp Arg Ala Gln Leu Ala Asp Val Leu Ala Ala Val Pro Thr Ala
3695 3700 3705
Phe Pro Val Ser Ala Val Ile His Thr Ala Gly Val Ser Gly Asn
3710 3715 3720
Ala Pro Leu Ala Gly Thr Thr Leu Ala Glu Leu Ala Glu Val Val
3725 3730 3735
Ala Ala Lys Ala Ala Gly Ala Arg Asn Leu Asp Glu Leu Leu Ala
3740 3745 3750
Gly Gln Asp Leu Asp Ala Phe Val Leu Phe Ser Ser Gly Ala Ala
3755 3760 3765
Val Trp Gly Ser Ala Gly Gln Gly Gly Tyr Ala Ala Ala Asn Ala
3770 3775 3780
Tyr Ala Asp Ala Leu Ala Ala Asp Arg Arg Arg Arg Gly Leu Val
3785 3790 3795
Ala Thr Ser Val Ala Trp Gly Ser Trp Ala Gly Gly Gly Met Val
3800 3805 3810
Asp Asp Asp Leu Ala Arg Glu Leu Ala Arg Gly Gly Val Arg Ser
3815 3820 3825
Met Asp Pro Asp Arg Ala Ile Ala Ala Leu Gln Gln Ala Leu Asp
3830 3835 3840
His Asp Glu Thr Ala Leu Thr Val Ser Asp Met Asp Trp Ala Arg
3845 3850 3855
Phe Ala Glu Thr Phe Thr Ala Ala Arg Pro Arg Pro Leu Ile Asp
3860 3865 3870
Gly Ile Pro Glu Ala Ala Pro Ala Ser Ala Glu Pro Ala Gly Asp
3875 3880 3885
Ile Pro Gly Leu Ala Ala Arg Leu Ala Gln Leu Pro Asp Gly Glu
3890 3895 3900
Arg Asp Arg Glu Leu Leu Asp Leu Val Arg Asn Ala Ala Ala Leu
3905 3910 3915
Ala Leu Gly His Thr Gly Thr Glu Pro Ile Thr Pro Ser Lys Pro
3920 3925 3930
Phe Lys Glu Leu Gly Phe Asp Ser Leu Thr Ala Val Asp Leu Arg
3935 3940 3945
Asn Arg Leu Thr Ala Ala Thr Gly Leu Arg Leu Pro Ala Thr Leu
3950 3955 3960
Val Phe Asp Tyr Pro Thr Pro Arg Ala Ala Ala Asp Ala Leu Arg
3965 3970 3975
Ala Val Leu Phe Ala Ala Asp Met Pro Val Asp Thr Ala Ala Pro
3980 3985 3990
Ala Arg Ser Ala Ser Ala Arg Pro Ala Asp Asp Asp Pro Val Val
3995 4000 4005
Val Val Ala Met Ala Cys Arg Tyr Pro Gly Gly Ala Thr Thr Pro
4010 4015 4020
Glu Lys Phe Trp Asp Leu Ile Ala Ala Gly Glu Asp Gly Ile Gly
4025 4030 4035
Gly Phe Pro Thr Asp Arg Gly Trp Glu Ile Gly Pro Gly Ala Ala
4040 4045 4050
Phe Ser Arg Thr Gly Gly Phe Leu Ala Asp Val Ala Gly Phe Asp
4055 4060 4065
Ala Ala Phe Phe Gly Ile Ser Pro Arg Glu Ala Leu Ala Met Asp
4070 4075 4080
Pro Gln Gln Arg Leu Leu Leu Glu Thr Ser Trp Glu Ala Leu Glu
4085 4090 4095
Arg Ala Gly Val Asp Ala Leu Ser Leu Arg Gly Ser Arg Thr Gly
4100 4105 4110
Val Phe Val Gly Ala Ser Pro Ser Glu Tyr Gly Thr Leu Val Ala
4115 4120 4125
Ser Leu Glu Gly Gly Gln Asp Tyr Ala Leu Thr Gly Ala Val Gly
4130 4135 4140
Ser Val Leu Ser Gly Arg Val Ala Tyr Val Leu Gly Leu Glu Gly
4145 4150 4155
Pro Ala Leu Thr Val Asp Thr Ala Cys Ser Ser Ser Leu Val Ala
4160 4165 4170
Leu His Leu Ala Ala Gln Ala Leu Arg Gly Gly Glu Cys Asp Leu
4175 4180 4185
Ala Leu Ala Gly Gly Val Ala Val Met Ala Thr Pro Asn Ala Phe
4190 4195 4200
Asp Ala Phe Ala Arg Gln Gly Gly Leu Ala Arg Asp Gly Arg Cys
4205 4210 4215
Lys Ala Phe Ala Asp Gly Ala Asp Gly Thr Gly Trp Gly Glu Gly
4220 4225 4230
Val Gly Val Leu Val Leu Ser Arg Leu Ser Glu Ala Arg Arg Cys
4235 4240 4245
Gly Tyr Thr Val Leu Ala Val Val Ser Gly Ser Ala Val Asn Ser
4250 4255 4260
Asp Gly Ala Ser Asn Gly Leu Thr Ala Pro Asn Gly Pro Ser Gln
4265 4270 4275
Gln Arg Val Ile Arg Gln Ala Leu Ala Ser Ala Gly Leu Ser Pro
4280 4285 4290
Gly Asp Val Asp Val Val Glu Ala His Gly Thr Gly Thr Ala Leu
4295 4300 4305
Gly Asp Pro Ile Glu Ala Gln Ala Leu Leu Ala Thr Tyr Gly Gln
4310 4315 4320
Glu Arg Gly Ala Gly Arg Pro Leu Tyr Val Gly Ser Val Lys Ser
4325 4330 4335
Asn Ile Gly His Val Gln Ala Ala Ala Gly Val Ala Gly Val Ile
4340 4345 4350
Lys Ser Val Leu Ala Leu Arg Tyr Gly Val Leu Pro Arg Thr Leu
4355 4360 4365
His Val Asp Val Pro Ser Arg Glu Val Asp Trp Ser Ala Gly Ala
4370 4375 4380
Val Glu Leu Leu Thr Glu Ala Val Glu Trp Pro Ala Gly Gly Arg
4385 4390 4395
Pro Arg Arg Val Gly Val Ser Ala Phe Gly Ile Ser Gly Thr Asn
4400 4405 4410
Ala His Val Ile Leu Glu Glu Ala Pro Glu Gly Val Glu Glu Ser
4415 4420 4425
Ala Ala Gly Glu Val Ala Gly Val Val Pro Trp Val Val Ser Ala
4430 4435 4440
Arg Ser Glu Glu Gly Leu Arg Ala Gln Ala Ala Arg Leu Val Glu
4445 4450 4455
His Val Val Gly Gly Ser Gly Leu Gly Pro Val Asp Val Gly Trp
4460 4465 4470
Ser Leu Ala Arg Ser Arg Ala Val Leu Glu His Arg Ala Val Val
4475 4480 4485
Leu Gly Gly Asp Gly Glu Glu Leu Val Ala Gly Leu Arg Ala Leu
4490 4495 4500
Cys Asp Gly Val Leu Gly Pro Gly Val Val Arg Gly Val Ala Gly
4505 4510 4515
Asp Gly Gly Thr Ala Leu Leu Phe Thr Gly Gln Gly Ala Gln Arg
4520 4525 4530
Val Gly Met Gly Arg Glu Leu Tyr Glu Ala Phe Pro Val Phe Ala
4535 4540 4545
Ala Ala Phe Asp Ala Val Cys Ala Gly Phe Glu Gly Met Leu Pro
4550 4555 4560
Gly Ser Leu Arg Gly Val Val Phe Gly Asp Gly Gly Gly Val Val
4565 4570 4575
Asp Arg Thr Glu Trp Ala Gln Pro Ala Leu Phe Ala Leu Glu Val
4580 4585 4590
Ala Leu Phe Glu Leu Val Val Ser Trp Gly Val Arg Ala Asp Val
4595 4600 4605
Leu Val Gly His Ser Val Gly Glu Leu Val Ala Ala His Val Ala
4610 4615 4620
Gly Val Trp Ser Leu Ala Asp Ala Cys Arg Val Val Ala Ala Arg
4625 4630 4635
Gly Arg Leu Met Gln Ala Leu Pro Val Gly Gly Ala Met Val Ala
4640 4645 4650
Val Arg Val Gly Glu Gly Glu Leu Pro Val Leu Pro Glu Gly Val
4655 4660 4665
Ser Val Ala Ala Val Asn Gly Pro Arg Ser Leu Val Leu Ser Gly
4670 4675 4680
Asp Glu Gly Pro Val Leu Glu Leu Ala Ala Arg Leu Ala Gly Glu
4685 4690 4695
Gly Arg Asp Thr Arg Arg Leu Arg Val Ser His Ala Phe His Ser
4700 4705 4710
Ala Arg Met Glu Pro Met Leu Ala Glu Phe Ala Gln Val Leu Ala
4715 4720 4725
Ala Val Glu Phe Arg Ala Pro Arg Ile Pro Val Ile Ser Asn Val
4730 4735 4740
Thr Gly Glu Val Ala Gly Glu Glu Leu Thr Thr Pro Glu Tyr Trp
4745 4750 4755
Val Arg Gln Val Arg Glu Ala Val Arg Phe Ala Asp Gly Val Asn
4760 4765 4770
Thr Ala Leu Gly Arg Gly Val Asp Lys Phe Leu Glu Leu Gly Pro
4775 4780 4785
Ser Gly Pro Leu Thr Ala Met Ala Glu Glu Val Ile Glu His Thr
4790 4795 4800
Gly Thr Arg Ala Val Cys Val Pro Val Leu Arg Ala Gly Arg Pro
4805 4810 4815
Glu Asp Ala Thr Leu Leu His Ala Leu Ala Ala Val Phe Val Thr
4820 4825 4830
Gly Ala Thr Val Gly Trp Thr Ala Pro Leu Ala Gly Thr Gly Ala
4835 4840 4845
Arg Ala Val Asp Leu Pro Thr Tyr Ala Phe Gln His Lys Arg Tyr
4850 4855 4860
Trp Pro Gln Pro Ala Thr Val Gly Arg Asp Leu Ala Ala Ala Gly
4865 4870 4875
Leu Ala Glu Ala Gly His Pro Leu Leu Thr Ala Trp Leu Pro Ser
4880 4885 4890
Pro Glu Gly Glu Asp Val Leu Cys Thr Gly Arg Ile Ser Leu Ala
4895 4900 4905
Thr His Pro Trp Leu Ala Asp His Ala Val Leu Gly Thr Val Leu
4910 4915 4920
Val Pro Gly Thr Ala Phe Val Asp Leu Ala Cys Trp Ala Gly His
4925 4930 4935
Arg Val Gly Cys Gly Ala Leu Arg Glu Leu Thr Leu Ala Thr Pro
4940 4945 4950
Leu Ala Leu Ala Gln Asp Met Ala Val Arg Leu Arg Leu Val Leu
4955 4960 4965
Gly Ala Pro Asp Asp Thr Gly Cys Arg Pro Val Ala Leu Tyr Ser
4970 4975 4980
Gln Gln Glu Gly Ala Asp Glu Gly Thr Asp Gly Thr Gly Trp Thr
4985 4990 4995
Arg His Ala Glu Gly Leu Leu Ala Pro Gly Gly Asp Ala Ser Val
5000 5005 5010
Gln Pro Pro Thr Asp Phe Glu Thr Trp Pro Val Thr Gly Cys Glu
5015 5020 5025
Pro Ile Pro Leu Asp Gly Phe Tyr Glu Glu Leu Ala Asp Ala Gly
5030 5035 5040
Phe Ser Tyr Gly Pro Val Phe Arg Gly Leu Arg Ala Ala Trp Arg
5045 5050 5055
Arg Gly Gly Gln Val Phe Ala Glu Val Ser Leu Pro Ala Asp Glu
5060 5065 5070
Thr Gly Gly Phe Gly Val His Pro Ala Leu Leu Asp Ala Ala Leu
5075 5080 5085
His Ala Leu Gly Pro Val Ser Arg Asp Thr Asp Glu Pro Gly Ser
5090 5095 5100
Ala Arg Leu Pro Phe Ser Trp Gly Glu Val Arg Val His Ala Ala
5105 5110 5115
Gly Ala Asp Arg Leu Arg Val Cys Leu Val Arg Ala Glu Asp Gly
5120 5125 5130
Thr Val Thr Leu His Gly Ala Asp Ala Ala Gly Arg Pro Val Val
5135 5140 5145
Thr Val Gly Ser Leu Val Leu Arg Pro Ile Ser Pro Glu Arg Leu
5150 5155 5160
His Gly Gly Ala Ala Ala Phe Asp Asp Ala Leu Phe Thr Thr Arg
5165 5170 5175
Trp Met Pro Leu Ser Val Ala Asp Gly Ile Ala Tyr Pro Thr Ala
5180 5185 5190
Asp Cys Val Leu Leu Gly Asp Pro Leu Glu Arg Ala Trp Arg His
5195 5200 5205
His Pro Asp Leu Asp Ser Phe Ala Glu Ala Leu Ala Ala Gly Lys
5210 5215 5220
Glu Lys Pro Gly Thr Val Leu Ala Arg Cys Pro Arg Asp Ile Ala
5225 5230 5235
Ala Gly Val Asp Pro Ala Glu Ala Ala Arg Arg Cys Ala Glu Trp
5240 5245 5250
Ala Leu Asp Leu Leu Lys Arg Trp Leu Asp Asp Asp Arg Leu Thr
5255 5260 5265
Asp Cys His Leu Val Ile Gly Thr Arg His Ala Val Thr Thr Gly
5270 5275 5280
Ala Glu Asp Gln Thr Ala Gly Arg Thr Asp Asp Pro Ala Val Leu
5285 5290 5295
Ala Gln Ser Thr Leu Leu Gly Leu Val Arg Ser Ala Gln Thr Glu
5300 5305 5310
Asn Pro Gly Arg Val Thr Leu Ala Asp Phe Asp Gly Thr Ala Pro
5315 5320 5325
Asp Pro Ala His Leu Ile Leu Ala Val Arg Gln Ala Glu Pro Glu
5330 5335 5340
Val Ala Val Arg Ala Gly Arg Leu Tyr Ala Arg Arg Leu Thr Arg
5345 5350 5355
Pro Asp Thr Gly Arg Ala Leu Ala Val Pro Pro Gly Ala Gly Ser
5360 5365 5370
Trp Arg Leu Glu Ser Thr Gly Arg Gly Thr Leu Asp Asn Leu Ala
5375 5380 5385
Leu Val Pro Cys Ala Gln Ala Glu Glu Pro Leu Gly Glu Gly Met
5390 5395 5400
Val Arg Ile Ala Val Arg Ala Ala Gly Val Asn Phe Arg Asp Val
5405 5410 5415
Leu Ile Val Leu Asp Met Tyr Pro Gly Arg Ala Asp Leu Gly Thr
5420 5425 5430
Glu Cys Ala Gly Val Val Val Glu Thr Gly His Gly Val Thr Gly
5435 5440 5445
Leu Val Pro Gly Asp Arg Val Met Gly Met Val Ala Gly Ala Phe
5450 5455 5460
Ala Pro Thr Ala Val Val Asp Gln Arg Phe Leu Val Arg Ile Pro
5465 5470 5475
Asp Gly Trp Ser Tyr Glu Thr Ala Ala Ala Ile Pro Val Ala Phe
5480 5485 5490
Leu Thr Ala Tyr Tyr Gly Leu Val Asp Leu Ala Gly Leu Ser Ala
5495 5500 5505
Gly Glu Ser Val Leu Val His Ala Ala Ala Gly Gly Val Gly Met
5510 5515 5520
Ala Ala Val Gln Leu Ala Arg His Leu Gly Ala Glu Met Tyr Gly
5525 5530 5535
Thr Ala Ser Glu Pro Lys Trp Asp Thr Leu Leu Asp Ser Gly Leu
5540 5545 5550
Asp Arg Ala His Ile Ala Ser Ser Arg Thr Thr Val Phe Ala Asp
5555 5560 5565
Ser Val Met Glu Ala Thr Gly Gly Ala Gly Val Asp Val Val Leu
5570 5575 5580
Asn Ser Leu Ala Gly Glu Phe Val Asp Ala Ser Leu Arg Ala Leu
5585 5590 5595
Pro Arg Gly Gly Arg Phe Val Glu Met Gly Lys Thr Asp Leu Arg
5600 5605 5610
Asp Pro Glu Arg Val Ala Ala Glu His Pro Gly Val Arg Tyr Arg
5615 5620 5625
Pro Phe Asp Leu Gly Glu Ala Gly Ala Asp Arg Ile Ala Glu Val
5630 5635 5640
Leu Ala His Leu Ala Glu Leu Phe Ala Ser Gly Glu Leu Thr Pro
5645 5650 5655
Leu Pro Val Thr Val Trp Asp Ile Arg Asp Ala Pro Ala Ala Phe
5660 5665 5670
Arg Ala Leu Ser Gln Ala Ala Leu Thr Gly Lys Gly Val Leu Thr
5675 5680 5685
Val Pro Ala Pro Ser Phe Glu Ala Gly Glu Thr Val Leu Ile Thr
5690 5695 5700
Gly Gly Thr Gly Thr Leu Gly Thr Leu Leu Ala Arg His Leu Val
5705 5710 5715
Thr Glu His Gly Leu Arg His Val Ile Leu Ala Gly Arg Arg Gly
5720 5725 5730
Thr Glu Thr Ala Glu Val Arg His Leu Arg Gly Asp Val Ala Glu
5735 5740 5745
Leu Gly Ala Arg Ile Glu Val Val Ala Cys Asp Ala Gly Asp Glu
5750 5755 5760
Arg Ala Leu Arg Gln Val Leu Asp Ala Leu Thr Ala Glu His Arg
5765 5770 5775
Leu Ala Gly Val Val His Ala Ala Gly Val Thr Asp Asp Gly Val
5780 5785 5790
Val Ser Ala Leu Asp Arg Gly Arg Leu Ser Ala Val Leu His Pro
5795 5800 5805
Lys Val Arg Gly Ala Trp Asn Leu His Arg Leu Thr Ala Gly Ser
5810 5815 5820
Glu Leu Arg Met Phe Val Leu Phe Ser Ser Ala Ser Ala Thr Leu
5825 5830 5835
Gly Ala Ala Gly Gln Gly Asn Tyr Ala Ala Ala Asn Ala Phe Leu
5840 5845 5850
Asp Ala Leu Ala Glu His Arg His Ala Leu Gly Leu Pro Ala Thr
5855 5860 5865
Ser Leu Ala Trp Gly Leu Trp Glu Gln Ala Ser Gly Met Thr Gly
5870 5875 5880
Arg Leu Leu Asp Arg Asp Arg Gln Arg Met Ser Arg Ser Gly Ile
5885 5890 5895
Val Pro Leu Ser Ser Ala His Gly Leu Ala Leu Phe Asp Ala Ala
5900 5905 5910
Arg Alu Ala Gly Leu Pro Thr Leu Thr Pro Ala Arg Leu Asp Leu
5915 5920 5925
Ala Ala Leu Arg Val Arg Tyr Ala His Glu Gln Val Pro Ala Val
5930 5935 5940
Leu Arg Glu Leu Val Arg Val Arg Pro Ser Ala Ala Glu Asp Pro
5945 5950 5955
Thr Thr Ala Pro Asp Thr Thr Thr Ala Pro Gly Pro Ser Gly Ala
5960 5965 5970
Met Thr Leu Ala Asp Arg Leu Ala Gly Leu Ser Ala Pro Glu Arg
5975 5980 5985
Gln Arg His Val Leu Asp Leu Val Arg Arg His Thr Ala Ala Val
5990 5995 6000
Leu Gly His Gly Ser Ala Asp Asp Val Asp Pro Asp Gln Ala Phe
6005 6010 6015
Lys Ala Leu Gly Phe Asp Ser Leu Thr Ala Val Glu Leu Arg Asn
6020 6025 6030
His Leu Arg Thr Ala Thr Ser Leu Ala Val Pro Ala Thr Leu Val
6035 6040 6045
Phe Asp His Pro Thr Pro Ala Ala Leu Ala Ala His Leu Leu Glu
6050 6055 6060
Leu Ala Ala Pro Pro Glu Arg Asp Pro Ala Leu Arg Val Met Gly
6065 6070 6075
Gly Leu Asp Arg Leu Glu Ala Asp Val Glu Ala Leu Ala Ser Gly
6080 6085 6090
Gly Ala Gly His Gln Glu Glu Val Ala Thr Arg Leu Arg Arg Val
6095 6100 6105
Leu Arg Arg Leu Glu Ser Gly Pro Gly Ala Ala His Ser Gly Thr
6110 6115 6120
Glu Glu Thr Ser Leu Asp Thr Ala Ser Ala Thr Glu Val Leu Ala
6125 6130 6135
Phe Ile Asp Ser Glu Phe Gly Asp Leu Ala
6140 6145
<210>3
<211>4799
<212>PRT
<213〉nanchang streptomycete NS3226 (Streptomyces nanchangensis n.sp.NS3226)
<400>1
Val Val Ser Asp Asp Lys Leu Val Asp Tyr Leu Lys Arg Val Thr Ala
1 5 10 15
Asp Leu Lys Arg Thr Arg Gln Arg Val His Glu Leu Glu Ser Gly Ser
20 25 30
Ala Glu Pro Ile Ala Val Val Ala Met Gly Cys Arg Phe Pro Gly Gly
35 40 45
Ile Ser Ser Pro Glu Asp Leu Trp Glu Phe Val Arg Leu Gly Ser Asp
50 55 60
Ala Ile Ser Glu Phe Pro Thr Asp Arg Gly Trp His Thr Ser Arg Leu
65 70 75 80
Ser Gly Asn Phe Arg Arg Ala Gly Gly Phe Leu Tyr Asp Ala Gly Asp
85 90 95
Phe Asp Ala Gly Leu Phe Gly Ile Ser Pro Arg Glu Ala Leu Ala Met
100 105 110
Asp Pro Gln Gln Arg Leu Leu Leu Glu Thr Ala Trp Glu Thr Leu Glu
115 120 125
Arg Ala Gly Val Asp Pro Thr Ser Val Arg Gly Ala Asp Gly Gly Val
130 135 140
Phe Ile Gly Met Ala Asp Gln Lys Tyr Gly Pro Arg Asp Asp Glu Leu
145 150 155 160
Leu Gly Glu Val Arg Gly Leu Val Leu Thr Gly Thr Thr Ser Ser Val
165 170 175
Ala Ser Gly Arg Ile Ala Tyr Ser Leu Gly Leu Gln Gly Pro Ala Ile
180 185 190
Thr Ile Asp Thr Ala Cys Ser Ser Ser Leu Val Ala Leu His Leu Ala
195 200 205
Val Arg Ser Leu Arg Ala Gly Glu Cys Pro Phe Ala Leu Val Gly Gly
210 215 220
Ala Ala Val Met Ala Glu Pro Thr Leu Phe Ala Glu Met Ala Glu Gln
225 230 235 240
Gly Gly Met Ala Gly Asp Gly Arg Cys Lys Ala Phe Ala Ala Ala Ala
245 250 255
Asp Gly Thr Gly Trp Gly Glu Gly Val Gly Val Leu Leu Leu Gln Pro
260 265 270
Leu Ser Thr Ala Arg Glu Gln Gly Leu Pro Val Leu Ala Thr Val Arg
275 280 285
Gly Ser Ala Val Asn Gln Asp Gly Ala Ser Asn Gly Leu Ser Ala Pro
290 295 300
Asn Gly Pro Ala Gln Cys Arg Val Ile Arg Lys Ala Leu Ala Asp Ala
305 310 315 320
Gln Leu Val Ala Gly Gln Ile Asp Ala Val Glu Ala His Gly Thr Gly
325 330 335
Thr Ala Leu Gly Asp Pro Ile Glu Ala Gln Ala Leu Leu Ala Thr Tyr
340 345 350
Gly Gln Asp Arg Pro Gly Asp Glu Pro Leu Trp Leu Gly Ser Val Lys
355 360 365
Ser Asn Ile Gly His Thr Gln Ala Ala Ala Gly Met Ala Gly Val Ile
370 375 380
Lys Met Val Gln Ala Met Arg His Gly Leu Leu Pro Arg Thr Leu His
385 390 395 400
Val Asp Glu Pro Thr Pro Glu Ala Asp Trp Ser Ala Gly Asp Val Arg
405 410 415
Leu Leu Thr Glu Glu Arg Glu Trp Pro Asp Thr Gly Arg Pro Arg Arg
420 425 430
Ala Ala Val Ser Ser Phe Gly Ile Ser Gly Thr Asn Ala His Val Val
435 440 445
Leu Glu Leu Pro Thr Gly Thr Val Gly Glu Pro Ala Asp Ala Ala Gly
450 455 460
Pro Val Pro Asp Pro Ser Ala Cys Ala Pro Ile Pro Trp Leu Leu Ser
465 470 475 480
Ala Ala Ser Ala Asp Ala Leu Arg Ala Gln Ala Arg Arg Leu His Arg
485 490 495
Phe Val Asp Thr Pro Gly Ala Pro Arg Pro Ile Asp Thr Ala Leu Ser
500 505 510
Leu Thr Val Thr Arg Ala Arg Leu Asp His Arg Ala Ile Val Phe Gly
515 520 525
Thr Asp Gln Ala Glu Leu Arg Ala Gly Leu Gly Ala Leu Ala Ala Gly
530 535 540
Glu Ser Thr Pro Arg Thr Val His Gly Arg Thr Val Pro Ser Ala Thr
545 550 555 560
Ile Ala Phe Leu Phe Thr Gly Gln Gly Ala Gln Arg Ala Gly Met Gly
565 570 575
Arg Ala Ala Tyr Ala Ala Phe Pro Glu Phe Ala Ala Ala Phe Asp Ala
580 585 590
Val Cys Ala Glu Leu Asp Gly Leu Leu Pro Arg Pro Leu Lys Ser Val
595 600 605
Leu Phe Ala Glu Pro Asn Ser Ala Asp Ala Ala Leu Val Asp Gln Thr
610 615 620
Leu Tyr Ala Gln Thr Gly Leu Phe Ala Phe Glu Val Ala Leu Phe Arg
625 630 635 640
Leu Leu Glu Glu Trp Gly Val Arg Pro Gly Val Leu Leu Gly His Ser
645 650 655
Val Gly Glu Leu Ala Ala Ala His Val Ala Gly Val Trp Ser Leu Pro
660 665 670
Asp Ala Cys Arg Val Val Ala Ala Arg Ala Arg Leu Met Gln Ala Leu
675 680 685
Pro Glu Asp Gly Ala Met Leu Ser Val Ala Ala Ser Glu Lys His Ile
690 695 700
Ala Glu Leu Leu Gly Asp Leu Ala Asp Val Asp Val Ala Ala Val Asn
705 710 715 720
Gly Pro Ala Val Thr Val Leu Ser Gly Pro Thr Gly Ala Val Ala Asp
725 730 735
Val Gly Glu Arg Leu Ala Gly Ala Gly Leu Arg Thr Lys His Leu Arg
740 745 750
Val Ser His Ala Phe His Ser Ala Leu Met Glu Pro Met Leu Ala Glu
755 760 765
Phe Ala Arg Glu Ile Ala Asp Val Thr Phe Gln Gln Pro Glu Leu Pro
770 775 780
Ile Ile Ser Asn Leu Thr Gly Gln Gln Ala Asp Ala Ala Glu Leu Gly
785 790 795 800
Ser Ala Ala Tyr Trp Val Arg Gln Val Arg Gly Thr Val Arg Phe Ala
805 810 815
Asp Gly Val Gly Arg Leu Ala Ala His Gly Val Thr Ala Cys Leu Glu
820 825 830
Leu Gly Pro Asp Gly Val Leu Thr Ala Leu Ala Arg Asp Cys Leu Thr
835 840 845
Ala Ala Ala Asp Val Ala Leu Val Pro Ala Leu Arg Arg Asp Gln Asp
850 855 860
Glu Pro Ala Ala Leu Leu Ala Ala Leu Ala Glu Leu His Val Arg Gly
865 870 875 880
Val Glu Val Asp Trp Ala Ala Met Leu Thr Ala Arg Gly Gly Arg Arg
885 890 895
Ala Ala Leu Pro Thr Tyr Ala Phe Gln Arg Glu Arg Tyr Trp Leu Pro
900 905 910
Ala Thr Pro Ser Val Ala Ser Ala Val Ser Ala Pro Ala Glu Gln Ala
915 920 925
Asp Arg Leu Leu Tyr Arg Val Gly Trp Ser Pro Val Thr Gly Phe Asp
930 935 940
Thr Glu Ala Arg Pro Glu Gly Thr Trp Leu Val Val Ala Ser Pro Asp
945 950 955 960
Asp Glu Gly Arg Arg Val Ala Gln Ala Leu Gly Pro His Thr Val Leu
965 970 975
Val Ala His Asp Pro Asp Asp Pro Ser Gly Ser Val Ala Arg Leu Arg
980 985 990
Gly Ala Leu Pro Ala Asp Arg Pro Val Thr Gly Val Leu Ala Leu Pro
995 1000 1005
Glu Gln Thr Gly Ala Ala Ala Val Ala Ala Gln Leu Ala Leu Arg
1010 1015 1020
Glu Ala Leu Arg Asp Ala Glu Val Arg Ala Pro Leu Trp Cys Ala
1025 1030 1035
Thr Arg Ala Ala Val Ser Val Gly Gly Glu Ala Thr Pro Gly Ala
1040 1045 1050
Ala Gln Ala Pro Leu Trp Gly Leu Asn Arg Ala Leu Glu Thr Cys
1055 1060 1065
Gly Gly Met Val Asp Leu Pro Gln Arg Leu Asp Ser Arg Ser Leu
1070 1075 1080
Gly Leu Leu Ala Ala Ala Leu Thr Asn Pro Ala Asp Ala Asp Glu
1085 1090 1095
Leu Ala Val Arg Thr Gly Gly Leu Phe Ala Arg Arg Leu His Ala
1100 1105 1110
Val Gln Pro Val Pro Arg Ala Pro Arg Pro Trp Arg Ala Asp Gly
1115 1120 1125
Thr Val Leu Val Thr Gly Asp Val Glu Ser Ala Thr Asp Asp Leu
1130 1135 1140
Leu Arg Arg Leu Ser Gly Asp Gly Glu Arg Pro Val Val Leu Ala
1145 1150 1155
Arg Arg Pro Gly Thr Ala Leu Gln Asn Gly Ala Ala Gly Asp Gly
1160 1165 1170
Ser Cys Thr Val Val Glu Trp Asp Pro Ala Ala Gly Ala Pro Glu
1175 1180 1185
Thr Pro Ser Pro Val Thr Ala Val Val His Leu Asp Asn Ile Gln
1190 1195 1200
Pro Ser Ala Pro Arg Asp Asp Ala Asp Pro Leu Ala Leu Ala Ala
1205 1210 1215
Ala Val Ala Glu Arg Leu His Thr Val Asp Arg Leu Thr Glu Leu
1220 1225 1230
Phe Gly Asn Gln Asp Leu Asp Ala Phe Val Leu Leu Ser Ser Val
1235 1240 1245
Ala Gly Ile Trp Gly Gly Ala Glu Asp Val Val His Thr Val Val
1250 1255 1260
His Ala Ala Leu Glu Ser Ala Ala Glu Arg Arg Ala Ala Ala Gly
1265 1270 1275
Leu Arg Gly Ala Cys Val Gly Trp Gly Pro Trp Ala Gly Ala Gly
1280 1285 1290
Asp Gly Pro Asp Val Pro Gly Leu Val Pro Met Arg Pro Glu Pro
1295 1300 1305
Ala Leu Ala Ala Leu Trp His Ala Leu Asp Asp Asp Ala Ala Val
1310 1315 1320
Phe Ala Val Ala Asp Val Asp Trp Pro Arg Phe His Pro Val Leu
1325 1330 1335
Thr Ser Arg Arg Pro Arg Pro Val Val Ser Gly Leu Pro Glu Val
1340 1345 1350
Arg Ala Leu Arg Pro Ala Pro Ser Ala Ala Pro Ala Val Gly Met
1355 1360 1365
Asp Val Thr Asp Leu Glu His Arg Leu Arg Asp Leu Val Leu Thr
1370 1375 1380
Glu Ala Ala Thr Ala Leu Gly His Ala Phe Arg Asp Ser Met Asp
1385 1390 1395
Pro Leu Arg Pro Phe Arg Asp Ala Gly Phe Glu Ser Leu Thr Ala
1400 1405 1410
Val Arg Phe Arg Asp Arg Ile Ala Ser Glu Thr Gly Leu Asn Leu
1415 1420 1425
Ser Ala Thr Leu Val Phe Asp His Pro Thr Pro Glu Ala Val Val
1430 1435 1440
Ala His Leu Leu Ala Glu Leu Thr Gly Gly Arg Pro Asp Glu Ala
1445 1450 1455
Glu Gln Val Ser Thr Arg Ser His Asp Asp Pro Val Val Ile Ile
1460 1465 1470
Gly Met Ala Cys Arg Tyr Pro Gly Gly Val Ser Asp Pro Glu Gly
1475 1480 1485
Leu Trp Glu Leu Val His Ser Gly Arg Glu Gly Ile Gly Asp Phe
1490 1495 1500
Pro Thr Asp Arg Gly Trp Asp Leu Ala Ala Leu Arg Arg Ala Val
1505 1510 1515
Pro His Leu Ala Leu Arg Ala Gly Phe Leu Pro Asp Ala Ala Ala
1520 1525 1530
Phe Asp Ala Ala Phe Phe Gly Ile Ser Pro Arg Glu Ala Leu Ala
1535 1540 1545
Met Asp Pro Gln Gln Arg Leu Leu Leu Glu Ala Ser Trp Glu Ala
1550 1555 1560
Val Glu Thr Ala Gly Ile Asp Pro Ala Ser Leu Arg Gly Ser Arg
1565 1570 1575
Thr Gly Val Phe Ala Gly Val Ala Gly Ser Asp Tyr Gly Ala Ala
1580 1585 1590
Leu Ala Gly Ser Arg Glu Ala Glu Gly Tyr Leu Met Thr Gly Thr
1595 1600 1605
Ala Thr Ser Val Val Ser Gly Arg Ile Ala Tyr Val Phe Gly Leu
1610 1615 1620
Gln Gly Pro Ala Leu Thr Ile Asp Thr Ala Cys Ser Ser Ser Leu
1625 1630 1635
Val Ala Leu His Thr Ala Val Gly Ala Leu Arg Lys Gly Glu Cys
1640 1645 1650
Asp Leu Ala Phe Ala Thr Gly Val Ala Val Ile Ser Thr Pro Asp
1655 1660 1665
Ala Phe Val Asp Phe Ala Lys Gln Asp Gly Leu Ala Ala Asp Gly
1670 1675 1680
Arg Cys Lys Ala Phe Ala Val Gly Ala Asp Gly Thr Asn Trp Ala
1685 1690 1695
Glu Gly Val Gly Val Leu Leu Val Glu Arg Leu Ser Asp Ala Arg
1700 1705 1710
Arg Asn Gly His Arg Val Leu Ala Val Leu Arg Gly Ser Ala Val
1715 1720 1725
Asn Ser Asp Gly Ala Ser Asn Gly Leu Ala Ala Pro Asn Gly Gly
1730 1735 1740
Ala Gln Gln Arg Val Ile Arg Gln Ala Leu Ala Asp Ala Gly Leu
1745 1750 1755
Thr Ala Pro Asp Val Asp Ala Leu Glu Ala His Gly Thr Gly Thr
1760 1765 1770
Ala Leu Gly Asp Pro Ile Glu Ala Gln Ala Val Leu Ala Thr Tyr
1775 1780 1785
Gly Gln Gly Arg Pro Ala Asp Arg Pro Leu Trp Leu Gly Ser Leu
1790 1795 1800
Lys Ser Asn Ile Gly His Ser Ala Ala Ala Ala Gly Val Gly Gly
1805 1810 1815
Val Ile Lys Met Val Glu Ala Met Arg His Gly Val Leu Pro Pro
1820 1825 1830
Thr Leu His Ala Asp Glu Pro Thr His Glu Val Asp Trp Ser Val
1835 1840 1845
Gly Ala Val Glu Leu Leu Thr Thr Ala Arg Asp Trp Pro Glu Thr
1850 1855 1860
Gly Arg Pro Arg Arg Ala Ala Val Ser Ser Phe Gly Val Ser Gly
1865 1870 1875
Thr Asn Ala His Val Ile Leu Glu Gln Gly Pro Asp Leu Ala Pro
1880 1885 1890
Gly Gly Val Pro Gly Val Gln Glu Asp Pro Ala Pro Arg Ala Ala
1895 1900 1905
Gly Gly Cys Ala Gly Asn Ala Val Pro Trp Leu Leu Ser Gly Arg
1910 1915 1920
Ser Ala Arg Ala Leu Arg Asp Gln Ala Ala Arg Leu Ala Gly His
1925 1930 1935
Leu Thr Arg Gly Asp Pro Ser Ala Glu Ala Ile Gly His Ala Leu
1940 1945 1950
Leu Thr Ser Arg Thr Ala Phe Glu His Arg Ala Val Val Leu Gly
1955 1960 1965
Gly Gly Thr Val Asp Leu Val Glu Gly Leu Asp Ala Leu Ala Ala
1970 1975 1980
Gly Glu Pro Ala Pro Ser Val Val Ala Gly Ala Pro Arg Pro Thr
1985 1990 1995
Gly Arg Gly Pro Val Phe Val Phe Pro Gly Gln Gly Gly Gln Trp
2000 2005 2010
Ser Gly Met Ala Ser Glu Leu Leu Asp Thr Cys Pro Ala Phe Ala
2015 2020 2025
Ala Arg Trp Ala Glu Cys Glu Arg Ala Phe Ala Pro His Met Asp
2030 2035 2040
Val Ser Leu Thr Glu Ala Val Arg Asp Ala Ala Ala Leu Glu Arg
2045 2050 2055
Val Asp Val Val Gln Pro Val Leu Phe Ala Val Met Val Ser Leu
2060 2065 2070
Val Glu Val Trp Arg Ser Tyr Gly Val Arg Pro Ala Ala Val Ile
2075 2080 2085
Gly His Ser Gln Gly Glu Ile Ala Ala Ala Cys Val Ala Gly Ala
2090 2095 2100
Leu Ser Leu Asp Asp Ala Ala Arg Val Val Ala Leu Arg Ala Arg
2105 2110 2115
Ala Leu Gly Val Leu Ala Gly Ala Gly Gly Met Val Ser Val Ala
2120 2125 2130
Leu Pro Pro Ala Glu Thr Glu Gly Trp Leu Arg Arg Trp Glu Asp
2135 2140 2145
Arg Ile Ser Val Ala Ala Val Asn Gly Pro Ser Ser Val Val Val
2150 2155 2160
Ser Gly Glu Pro Ala Ala Leu Glu Glu Leu Val Glu Gln Ala Arg
2165 2170 2175
Thr Arg Asp Val Arg Val Arg Arg Ile Glu Val Asp Tyr Ala Ser
2180 2185 2190
His Ser Ala Gln Val Ala Arg Ile Glu Asp Glu Val Leu Arg Leu
2195 2200 2205
Leu Glu Pro Ile Arg Pro Arg Thr Ser Glu Val Pro Phe Phe Ser
2210 2215 2220
Thr Val Ser Thr Gln Trp Gln Asp Thr Thr Ala Met Asp Ala Ala
2225 2230 2235
Tyr Trp Tyr Arg Asn Leu Arg Asp Pro Val Leu Phe Ala Pro Ser
2240 2245 2250
Val Gly Ala Leu Val Asp Gln Gly His Thr Val Phe Val Glu Val
2255 2260 2265
Ser Pro His Pro Val Leu Thr Ser Gly Leu Leu Glu Thr Ala Glu
2270 2275 2280
Arg Ala Asp Val Asp Leu Thr Val Thr Gly Thr Leu Arg Arg Gly
2285 2290 2295
Glu Gly Gly Leu Ala Arg Met Arg Ala Ser Leu Ala Glu Leu Trp
2300 2305 2310
Val His Gly Thr Pro Val Asp Trp Ser Ala Ala Phe Asp Pro Ala
2315 2320 2325
Pro Ala Gly Pro Val Pro Leu Pro Thr Tyr Ala Phe Gln Arg Asp
2330 2335 2340
Arg Tyr Trp Pro Asp Pro Arg Pro Ala Ser Ala Asp Pro Val Tyr
2345 2350 2355
Glu Thr Phe Trp Arg Ala Val Asp Glu Ala Asp Leu Pro Ala Leu
2360 2365 2370
Thr Gly Thr Leu Gly Val Thr Asp Asp Gln Pro Leu Arg Glu Val
2375 2380 2385
Leu Pro Ala Leu Ser Ala Trp Arg Arg Ser Arg Thr Glu Gln Ala
2390 2395 2400
Val Thr Asp Ser Trp Arg Tyr Arg Val Cys Trp Lys Arg Leu Pro
2405 2410 2415
Asp Ala Ala Pro Ala Glu Leu Pro Gly Thr Trp Leu Leu Val Thr
2420 2425 2430
Thr Glu Gly Ala Ala Gly Asp Pro Ser Ala Ala Ala Ala Leu Gln
2435 2440 2445
Ala Val Arg Asp Ala Ala Gly His Thr Val Thr Leu Ala Val Asp
2450 2455 2460
Ser Asp Asp Glu Pro Ala Ser Leu Ala Ala Ala Leu Arg Glu Thr
2465 2470 2475
Leu Arg Gly Thr His Pro Ala Gly Val Val Thr Leu Thr Gly Thr
2480 2485 2490
Asp Val Ser Pro His Pro Val Ser Pro Val Val Pro Val Gly Thr
2495 2500 2505
Ala Leu Thr Val Thr Leu Leu Gln Ala Leu Asp Ala Ala Asp Val
2510 2515 2520
Asp Ala Pro Leu Trp Cys Leu Thr Arg Gly Ala Val Ala Thr Asp
2525 2530 2535
Asp Asp Thr Ala Gly Pro Gly Ser Pro Leu Gln Ser Ala Leu Trp
2540 2545 2550
Ala Leu Gly Arg Ile Ala Ala Val Glu Ser Pro Gly Asn Trp Gly
2555 2560 2565
Gly Leu Val Asp Leu Pro Asp Thr Phe Asp Asp Ser Ala Ala Arg
2570 2575 2580
Arg Leu Val Ser Val Leu Ala Ser Leu Asp Gly Glu Asp Gln Val
2585 2590 2595
Ala Leu Arg Val Ser Gly Ala Tyr Gly Arg Arg Leu Met Arg Ala
2600 2605 2610
Asn Pro Thr Ala Ser Pro Gly Ser Gly Trp Arg Pro Arg Gly Thr
2615 2620 2625
Val Leu Val Thr Gly Gly Thr Gly Ala Leu Gly Gly Arg Val Ala
2630 2635 2640
Arg Trp Leu Ala Arg Asp Gly Ala Glu His Ile Val Leu Ala Ser
2645 2650 2655
Arg Arg Gly Ser Gln Ala Pro Gly Val Asp Asp Leu Val Ala Glu
2660 2665 2670
Leu Ser Gly Leu Gly Ala Gln Val Thr Val Asp Ser Cys Asp Leu
2675 2680 2685
Ser Val Ala Ser Glu Ala Phe Ala Leu Val Asp Arg Ile Gln Arg
2690 2695 2700
Asp Gly Asp Arg Ile Gly Ala Val Ile His Thr Ala Gly Ala Gly
2705 2710 2715
Gly Leu Gly Pro Leu Val Asp Ala Gly Leu Asp Asp Met Glu Leu
2720 2725 2730
Ala Met Ala Gly Lys Val Ala Gly Ile Asp Asn Leu Glu Arg Ala
2735 2740 2745
Leu Asp Asp Gln Gln Leu Asp Ala Val Val Tyr Phe Ser Ser Ile
2750 2755 2760
Ser Ala Ser Trp Gly Ala Gly Asp His Gly Ile Tyr Ala Ala Ala
2765 2770 2775
Asn Ala Val Leu Asp Ala Arg Ala Glu Ala Arg Arg Ala Ala Gly
2780 2785 2790
Val His Thr Val Ser Val Ala Trp Ala Pro Trp Gly Gly Gly Gly
2795 2800 2805
Met Ile Asp Asp Pro Ala Val Ala Asp Thr Leu Asn Arg Met Gly
2810 2815 2820
Leu Pro Leu Val Asp Pro Asp Leu Ala Ile Ser Gly Leu Ala Thr
2825 2830 2835
Ile Leu Ala Glu Gly Glu Glu Ser Leu Leu Leu Val Asp Val Asp
2840 2845 2850
Trp Gly Arg Phe Ile Pro Gln Phe Thr Leu Arg Arg Pro Ser Arg
2855 2860 2865
Leu Phe Asp Glu Leu Pro Glu Ala Arg Ala Ala Glu Ala Asp Thr
2870 2875 2880
Gly Pro Ala Lys Ala Asp Ala Pro Ser Pro Leu Ala Gly Arg Leu
2885 2890 2895
Ala Gly Leu Ser Lys Ala Lys Arg Ala Thr Ala Leu Arg Asp Leu
2900 2905 2910
Val Arg Glu His Val Ala Ala Val Leu Gly His Asn Asp Pro Ala
2915 2920 2925
Ala Val Asp Ala Gly Arg Ala Leu Lys Asp Leu Gly Phe Asp Ser
2930 2935 2940
Leu Thr Ala Val Glu Leu Arg Asp Arg Leu Ser Thr Val Ala Ala
2945 2950 2955
Met Arg Leu Pro Ala Thr Leu Val Phe Asp His Pro Thr Ile Ala
2960 2965 2970
Glu Leu Ala Asp Phe Leu Ala Arg Gly Leu Glu Pro Glu Thr Ala
2975 2980 2985
Arg Pro Thr Ala Ala Pro Ala Thr Val Val Arg Val Asp Gln Asp
2990 2995 3000
Glu Pro Val Ala Ile Val Ala Met Ala Cys Arg Tyr Pro Gly Asp
3005 3010 3015
Ile Ala Ser Ala Glu Glu Leu Trp Arg Ala Val Arg Asp Glu Lys
3020 3025 3030
Asp Leu Ile Ser Pro Phe Pro Ile Asn Arg Gly Trp Pro Val Asp
3035 3040 3045
Arg Leu Leu Asp Ala Asp Pro Asp Arg Pro Gly Thr Ser Tyr Val
3050 3055 3060
Asp His Gly Gly Phe Leu His Asp Ala Gly Asp Phe Asp Pro Gly
3065 3070 3075
Phe Phe Gly Ile Ser Pro Arg Glu Ala Gln Ala Met Asp Pro Gln
3080 3085 3090
Gln Arg Leu Leu Leu Glu Ser Ser Trp Glu Val Leu Glu Arg Ala
3095 3100 3105
Gly Met Val Pro Lys Ser Leu Arg Gly Ser Arg Thr Gly Val Tyr
3110 3115 3120
Val Gly Leu Thr Asp Gln Ala Tyr Gly Thr Arg Leu Arg Gly Ser
3125 3130 3135
Leu Asp Gly Met Glu Gly Phe Leu Val Ser Ala Ser Ser Asn Val
3140 3145 3150
Ala Ser Gly Arg Ile Ser Tyr Ser Leu Gly Leu Gln Gly Pro Ala
3155 3160 3165
Leu Thr Val Asp Thr Ala Cys Ser Ser Ser Leu Val Ala Leu His
3170 3175 3180
Leu Ala Thr Gln Ala Leu Arg Asn Gly Glu Cys Asp Leu Ala Ile
3185 3190 3195
Ala Gly Ala Ala Thr Val Met Pro Asp Pro Thr Ser Phe Met Ala
3200 3205 3210
Phe Ser Arg Gln Arg Gly Leu Ala Ala Asp Gly Arg Cys Lys Pro
3215 3220 3225
Phe Ala Ala Ala Ala Asp Gly Phe Ser Leu Gly Glu Gly Val Gly
3230 3235 3240
Val Leu Leu Val Glu Arg Leu Ser Asp Ala Arg Arg Leu Gly His
3245 3250 3255
Pro Val Leu Ala Leu Ile Arg Gly Ser Ala Val Asn Gln Asp Gly
3260 3265 3270
Ala Ser Asn Gly Ile Thr Ala Pro Asn Gly Pro Ser Gln Glu Arg
3275 3280 3285
Val Ile Arg Gln Ala Leu Val Asn Ala Ala Leu Pro Ala Ser Ala
3290 3295 3300
Val Asp Val Val Glu Ala His Gly Thr Gly Thr Thr Leu Gly Asp
3305 3310 3315
Pro Ile Glu Ala Gln Ala Leu Leu Ala Thr Tyr Gly Gln Asp Arg
3320 3325 3330
Pro Ala Asp Arg Pro Leu Arg Leu Gly Ser Val Lys Ser Asn Phe
3335 3340 3345
Gly His Thr Gln Ala Ala Ala Gly Met Ala Gly Val Ile Lys Met
3350 3355 3360
Val Gln Ala Met Arg His Glu Leu Met Pro Arg Thr Leu His Val
3365 3370 3375
Asp Ala Pro Ser Pro His Val Asp Trp Ser Ser Gly Ala Val Glu
3380 3385 3390
Leu Leu Ala Glu Ala Arg Pro Trp Pro Arg Gly Asp Glu Pro Arg
3395 3400 3405
Arg Ala Gly Val Ser Ala Phe Gly Ile Ser Gly Thr Asn Ala His
3410 3415 3420
Val Val Leu Glu Glu Ala Ser Gln Glu Pro Thr Pro Asp Gly Ser
3425 3430 3435
Ala Gly Ala Pro Asp Thr Pro Asp Thr Pro Asp Ala Pro Val Glu
3440 3445 3450
Ala Asp Thr Gly Arg Pro Leu Pro Leu Val Val Ser Ala Arg Thr
3455 3460 3465
Pro Asp Ala Leu Arg Asp Gln Ala Ala Arg Leu Thr Ala Leu Leu
3470 3475 3480
Asp Arg Glu Glu His Pro Val Ser Asp Leu Ala Tyr Ser Leu Ala
3485 3490 3495
Thr Ala Arg Gly Val Leu Asp Arg Ala Ala Val Val Val Ala Ala
3500 3505 3510
Asp Pro Asp Glu Leu Arg Arg Asn Leu Ala Asp Leu Thr Thr Arg
3515 3520 3525
Ala Val Ala Glu Arg Arg Ala Glu Gly Gly Leu Ala Phe Leu Phe
3530 3535 3540
Thr Gly Gln Gly Ala Gln Arg Ala Gly Met Gly Arg Ser Leu Tyr
3545 3550 3555
Asp Ala Phe Pro Glu Phe Ala Ala Ala Phe Asp Glu Val Cys Ala
3560 3565 3570
Glu Leu Asp Arg His Leu Pro Arg Pro Leu Arg Thr Val Val Trp
3575 3580 3585
Ala Glu Pro Gly Thr Asp Glu Ala Ala Leu Leu Asp Gln Thr Leu
3590 3595 3600
Tyr Thr Gln Thr Gly Leu Phe Ala Val Glu Val Ala Leu Phe Arg
3605 3610 3615
Leu Leu Glu His Trp Gly Val Arg Pro Asp Ala Leu Leu Gly His
3620 3625 3630
Ser Val Gly Glu Leu Ala Ala Ala His Leu Ala Gly Val Trp Ser
3635 3640 3645
Thr Glu Asp Ala Ala Arg Val Val Ala Ala Arg Ala Arg Leu Met
3650 3655 3660
Gln Glu Leu Pro Glu Gly Gly Ala Met Leu Ser Val Ala Ala Ala
3665 3670 3675
Gly Asp Glu Val Ser Ala Val Leu Gly Asp Ala Ser Ala Glu Val
3680 3685 3690
Ala Val Ala Ala Val Asn Gly Pro Ala Ser Leu Val Leu Ser Gly
3695 3700 3705
Thr Glu Glu Ser Val Thr Ala Ala Gly Ala Arg Leu Ala Glu Ala
3710 3715 3720
Gly Leu Arg Thr Lys Arg Leu Thr Val Ser His Ala Phe His Ser
3725 3730 3735
Ser Leu Met Glu Pro Met Leu Ala Ala Tyr Glu His Glu Leu Ala
3740 3745 3750
Gln Val Ala Phe Ala Glu Pro Ala Leu Pro Val Val Ser Asn Leu
3755 3760 3765
Thr Gly Glu Val Ala Gly Ala Glu Leu Cys Glu Pro Ala Tyr Trp
3770 3775 3780
Val Arg Gln Val Arg Gln Ala Val Arg Phe Ala Asp Gly Val Arg
3785 3790 3795
Thr Val Leu Asp Glu Gly Val Thr Thr Leu Leu Glu Leu Gly Pro
3800 3805 3810
Asp Gly Val Leu Thr Ala Met Ala Gln Glu Ser Ala Gly Glu Arg
3815 3820 3825
Ala Thr Gly Ile Ala Ala Gln Arg Arg Asp Arg Asp Gln Val Arg
3830 3835 3840
Thr Leu Leu Thr Ala Leu Gly Arg Leu His Val Arg Thr Glu Arg
3845 3850 3855
Val Asp Trp Ala Ala Phe Phe Arg Gly Thr Gly Ala Arg Arg Val
3860 3865 3870
Asp Leu Pro Thr Tyr Ala Phe Gln Arg Arg Arg Tyr Trp Leu Asp
3875 3880 3885
Thr Ser Ser Gly Gly Ala Glu Ala Leu Ala Gly Ala Gly Leu Ala
3890 3895 3900
Gly Thr Gly His Pro Leu Leu Thr Ala Ser Ala Thr Leu Pro Gly
3905 3910 3915
Thr Gly Glu Ser Leu Phe Ser Gly Ser Leu Pro Gly Ala Pro Asp
3920 3925 3930
Gly Arg Pro Leu Ser Gly Gly Glu Ile Leu Glu Leu Val Leu Trp
3935 3940 3945
Ala Gly Gly Asn Phe Gly Cys His Arg Ile Ala Gly Leu Asp Val
3950 3955 3960
Ala Gly Ser Val Pro His Ala Pro Gln Ala Pro Leu Gln Leu Val
3965 3970 3975
Val Ala Ala Pro Asp Glu Ser Gly Asn Arg Ala Phe Thr Leu His
3980 3985 3990
Leu Gly Pro Val Gly Gly Pro His Gly Pro Val Glu Gly Pro Trp
3995 4000 4005
Thr Arg Ile Ala His Gly Val Leu Gly Gly Thr Pro Thr Pro Leu
4010 4015 4020
Pro Pro Glu Pro Gly Thr Ala Ala Trp Pro Pro Ala Asp Ala Glu
4025 4030 4035
Pro Val Gly Ala Asp Leu Val Trp Arg Arg Glu Asp Glu Leu Phe
4040 4045 4050
Ala Glu Leu Glu Leu Ala Glu Arg Asn Ala Ala Asp Val Asp Arg
4055 4060 4065
Phe Ala Leu His Pro Gly Leu Leu Ala Glu Val Met Glu Leu Ile
4070 4075 4080
Ala Gly Leu Ala Gly Glu Pro Val His Phe Thr Gly Val Thr Arg
4085 4090 4095
Tyr Ala Thr Gly Ala Thr Val Leu Arg Val His Leu Thr Arg Val
4100 4105 4110
Ala Pro Asp Thr Val Thr Ala Leu Leu Thr Asp Ala Glu Gly Glu
4115 4120 4125
Pro Val Leu Ser Val Asp Arg Val Gln Val Arg Ala Asp Gly Ala
4130 4135 4140
Ala Ala Val Arg Ser Ala Thr Ala Ala Ala Pro Asp Ala Leu Tyr
4145 4150 4155
Glu Leu Thr Trp Thr Pro Val Gly Ala Glu Ala Leu Pro Pro Asp
4160 4165 4170
Thr Gly Trp Ala Val Val Gly Val Pro Ala Gly Asp Leu Ala Lys
4175 4180 4185
Val Leu Glu Ala Gln Gly Ala Glu Val Ala Thr His Pro Asp Leu
4190 4195 4200
Ala Ser Leu Gly Ser Thr Ala Asp Arg Gly Asp Met Pro Gly Leu
4205 4210 4215
Val Val Leu Ser Val Glu Thr Ala Pro Gly Ala Pro Leu Glu Ser
4220 4225 4230
Ala Arg Leu Thr Val His His Thr Leu Arg Leu Val Gly Glu Leu
4235 4240 4245
Leu Ala Asp Thr Gln Leu Thr Gly Thr Arg Phe Ala Phe Val Thr
4250 4255 4260
Arg Ala Ser Val Ser Thr Gly Asp Gly Ala Ala Val Asp Pro Ala
4265 4270 4275
Gln Ala Ala Val Arg Gly Leu Leu Leu Ser Ala Gln Ala Glu His
4280 4285 4290
Pro Asp Arg Phe Val Val Val Asp Leu Gly Gly Arg Glu Glu Asp
4295 4300 4305
Ala Asp Leu Leu Thr Ala Ala Val Gly Thr Ser Leu Ala Ala Ala
4310 4315 4320
Glu Pro His Leu Ala Ile Arg Asp Gly Arg Leu Leu Val Pro Arg
4325 4330 4335
Leu Ala Arg Val Thr Glu Pro Pro Gln Ala Phe Ala Ala Gly Pro
4340 4345 4350
Glu Glu His Gly Thr Val Leu Val Thr Gly Ala Thr Gly Gly Ile
4355 4360 4365
Gly Thr Lys Ile Val Pro His Leu Val Ala Glu His Gly Val Arg
4370 4375 4380
Arg Leu Leu Leu Leu Ser Arg Lys Gly Pro Asp Asp Pro Arg Ala
4385 4390 4395
Ala Glu Leu Gly Arg Glu Leu Ala Ala Tyr Gly Ala Glu Ala Thr
4400 4405 4410
Phe Thr Ala Cys Asp Ile Ala Asp Arg Ala Ala Leu Glu Ala Val
4415 4420 4425
Leu Ala Glu Val Pro Ala Glu His Pro Val Thr Ala Val Val His
4430 4435 4440
Ile Ala Gly Val Val Asp Asp Gly Val Leu Thr Thr Leu Ser Pro
4445 4450 4455
Glu Arg Val Asp Thr Val Leu Arg Pro Lys Ala Glu Ala Ala Gln
4460 4465 4470
His Leu His Glu Leu Thr Ala Gly Leu Glu Leu Ser His Phe Val
4475 4480 4485
Leu Phe Ser Ser Gly Val Gly Val Leu Gly Gly Ala Gly Gln Ala
4490 4495 4500
Asn Tyr Ala Ala Ala Asn Ala Phe Leu Asp Ala Leu Ala Gln Thr
4505 4510 4515
Arg Gln Ala Ala Gly Leu Pro Ala Ser Ser Leu Ala Trp Gly Leu
4520 4525 4530
Trp Glu Thr Asp Met Gly Met Ser Ala Arg Leu Ser Glu Val Asp
4535 4540 4545
Arg Arg Arg Met Ala Gln Ala Gly Val Leu Ala Leu Thr Pro Gln
4550 4555 4560
Gln Gly Ile Ala Leu Phe Asp Arg Ala Trp Asn Ser Gly Ala Ala
4565 4570 4575
Thr Leu Val Pro Met Ser Leu Asp Thr Ala Val Leu Arg Arg Lys
4580 4585 4590
Ala Ala Asp Ser Ala Leu Pro Ala Pro Phe Arg Ala Leu Val Arg
4595 4600 4605
Thr Pro Leu Arg Arg Ala Ala Ala Gly Pro Ala Gln Ala Ala Gly
4610 4615 4620
Gln Ser Phe Ala Gln Arg Leu Ala Glu Gln Pro Gly Ser Ser Arg
4625 4630 4635
Arg Arg Leu Leu Leu Glu Leu Ile Gln Arg Gln Val Gly Thr Val
4640 4645 4650
Leu Asp Tyr Gly Ala Asp Thr Leu Leu Asp Ala Arg Arg Thr Phe
4655 4660 4665
Arg Glu Leu Gly Phe Asp Ser Leu Thr Ala Val Glu Leu Arg Asn
4670 4675 4680
Arg Leu Val Ala Ala Thr Gly Val Gln Leu Ser Ala Ala Leu Val
4685 4690 4695
Phe Asp His Pro Thr Ala Asp Ala Leu Ala Glu Tyr Leu Glu Ser
4700 4705 4710
Lys Val Leu Arg Ser Gln Val Gly Ala Pro Leu Pro Val Leu Thr
4715 4720 4725
Gln Leu Asp His Leu Glu Ala Ala Leu Ala Ala Pro Pro Ala Asp
4730 4735 4740
Thr Ala Thr Arg Glu Gln Ile Ala Ala Arg Leu Arg Ala Leu Ala
4745 4750 4755
Ser Thr Trp Ser Ala Gln Pro Asp Asp Gly His Gly Ala Asp Asp
4760 4765 4770
Gly Asp Ile Ser Ser Lys Leu Asp Ser Ala Thr Asp Glu Glu Leu
4775 4780 4785
Phe Asp Phe Ile Ser Gly Glu Phe Gly Glu Asp
4790 4795
<210>4
<211>3835
<212>PRT
<213〉nanchang streptomycete NS3226 (Streptomyces nanchangensis n.sp.NS3226)
<400>1
Val Ala Gly Ser Ser Thr Thr Ser Pro Arg Ser Thr Arg Pro Ser Leu
1 5 10 15
Ala Ser Pro Arg Val Arg Arg Arg Pro Trp Thr Leu Ser Ser Ala Cys
20 25 30
Ser Ser Arg Pro Ala Gly Lys Arg Trp Asn Arg Pro Val Ser Thr Ser
35 40 45
Thr Arg Tyr Ala Ala Ala Val Pro Glu Cys Ser Pro Ala Ser Ala Ser
50 55 60
Arg Thr Thr Ala Leu Cys Trp Pro Pro His Arg Ala Gly Trp Thr Ala
65 70 75 80
Thr Pro Pro Pro Ala Pro Pro Thr Ala Ser Cys Pro Ala Ala Ser Arg
85 90 95
Thr Ser Trp Ala Trp Arg Ala Pro Pro Ser Pro Ser Thr Pro Pro Ala
100 105 110
Pro Pro His Trp Trp Pro Cys Thr Ser Pro Cys Arg Arg Cys Ala Thr
115 120 125
Ala Ser Ala Thr Ser Arg Trp Arg Ala Gly Arg Arg Arg Cys Pro Pro
130 135 140
Pro Pro Ser Thr Trp Pro Cys Pro Val Ser Ala His Trp His Pro Thr
145 150 155 160
Ala Ala Pro Arg Arg Ser Arg Arg Arg Pro Thr Val Pro Asp Gly Ala
165 170 175
Arg Glu Ser Val Ser Ser Pro Ser Ser Gly Cys Pro Thr Pro Ala Gly
180 185 190
Ser Gly His Arg Val Leu Ala Val Leu Arg Gly Ser Ala Val Asn Gln
195 200 205
Asp Gly Ala Ser Asn Gly Leu Thr Ala Pro Asn Gly Pro Ser Gln Gln
210 215 220
Arg Val Ile Arg Gln Ala Leu Ala Asn Ala Gly Leu Thr Pro Ala Asp
225 230 235 240
Val Asp Ile Val Glu Ala His Gly Thr Gly Thr Ser Leu Gly Asp Pro
245 250 255
Ile Glu Ala Asp Ala Leu Leu Ser Thr Tyr Gly Gln Ala Arg Pro Ala
260 265 270
Asp Arg Pro Leu Trp Leu Gly Ser Leu Lys Ser Asn Ile Gly His Ser
275 280 285
Gly Ala Ala Ala Gly Val Ala Gly Val Ile Lys Met Val Gln Ala Leu
290 295 300
Arg His Gly Val Met Pro Arg Thr Leu His Ala Glu Glu Pro Thr Pro
305 310 315 320
Asn Val Asp Trp Ser Ser Gly Ala Val Glu Leu Leu Asn Arg Ala Arg
325 330 335
Asp Trp Pro Ala Ser Gly Thr Arg Arg Arg Ala Ala Val Ser Ser Phe
340 345 350
Gly Ile Ser Gly Thr Asn Ala His Val Ile Leu Glu Glu Ala Pro Gln
355 360 365
Asp Ser Gly Pro Glu Thr Gly Asp Glu Ala Asp Pro Ser Pro Glu Gly
370 375 380
Thr Pro Trp Pro Leu Leu Pro Trp Val Leu Ser Ala Arg Ser Glu His
385 390 395 400
Ala Leu Arg Gly Gln Ala Arg Ala Leu His Thr His Leu Leu Ala His
405 410 415
Pro Glu Pro Ala Asp Thr Asp Val Ala Leu Ser Leu Ala Thr Thr Arg
420 425 430
Thr Gly Leu Glu Tyr Arg Ala Ala Val Leu Ala Ala Asp Arg Asp Gly
435 440 445
Phe Leu Asn Ala Leu Glu Ala Leu Ala Asp Asp Arg Pro Thr Asn Gly
450 455 460
Val Leu Arg Gly Thr Ala Ala Glu Gly Lys Ala Val Phe Val Phe Pro
465 470 475 480
Gly Gln Gly Ala Gln Trp Thr Gly Met Ala Arg Glu Leu Leu Asp Thr
485 490 495
Ser Pro Val Phe Ala Ala Lys Ala Ala Glu Cys Ala Ala Ala Ile Glu
500 505 510
Glu Phe Val Asp Phe Lys Val Leu Asp Val Leu Arg Asp Glu Pro Gly
515 520 525
Ala Ala Ser Met Asp Arg Ile Glu Val Val Gln Pro Val Leu Phe Thr
530 535 540
Val Met Val Ser Leu Ala Glu Leu Trp Arg Ser Phe Gly Ile Gln Pro
545 550 555 560
Asp Ala Val Val Gly Ser Ser Gln Gly Glu Ile Ala Ala Ala His Val
565 570 575
Ala Gly Gly Leu Thr Leu Glu Asp Ala Ala Arg Val Ile Cys Leu Arg
580 585 590
Ser Arg Leu Leu Ala Glu Thr Leu Val Gly Lys Gly Ala Val Ala Ser
595 600 605
Val Ala Leu Pro Ala Asp Gln Val Arg Glu Arg Leu Arg Arg Trp Asp
610 615 620
Gly Arg Leu Ser Val Ala Gly Val Asn Gly Pro Arg Leu Val Ala Val
625 630 635 640
Ala Gly Asp Asp Ala Ala Leu Ala Glu Phe Val Glu Glu Cys Ala Arg
645 650 655
Asp Asp Ile Arg Ala Arg Thr Val Ala Ala Thr Val Pro Thr His Cys
660 665 670
Ala Leu Val Asp Pro Leu Arg Glu Arg Leu Leu Glu Leu Leu Ala Pro
675 680 685
Val Arg Pro Arg Thr Gly Thr Val Pro Leu Tyr Ser Thr Val Thr Gly
690 695 700
Gly Leu Leu Asp Thr Ala Thr Met Asp Ala Gly Tyr Trp Tyr Asp Asn
705 710 715 720
Thr Arg Ala Pro Val Leu Phe Glu Pro Val Val Arg Thr Leu Leu Ala
725 730 735
Glu Gly His His Ala Phe Val Glu Ser Ser Ala His Pro Val Leu Ala
740 745 750
Met Ala Val Glu Gln Thr Val Asp Ala Thr Gly Ala Pro Gly Val Val
755 760 765
Val Glu Ser Leu Arg Arg Asp Glu Gly Gly Pro Gly Arg Met Leu Thr
770 775 780
Ser Leu Thr Lys Ala His Leu Gly Gly Val Arg Val Asp Trp Pro Thr
785 790 795 800
Val Phe Ala Gly Thr Gly Ala Arg Thr Val Asp Leu Pro Thr Tyr Ala
805 810 815
Phe Gln Arg Thr Arg Tyr Trp Ala Glu Thr Ala Asp Arg Thr Gly Asp
820 825 830
Val Gly Ser Val Gly Leu Ser Pro Val Asp His Pro Leu Leu Gly Ala
835 840 845
Leu Val Arg Met Ala Asp Gly Asp Gly Ala Val Leu Thr Gly Arg Leu
850 855 860
Ser Leu His Thr His Gly Trp Leu Ala Asp His Gly Val Ala Asp Gln
865 870 875 880
Val Ile Phe Pro Gly Thr Gly Phe Val Glu Leu Ala Val Leu Ala Gly
885 890 895
Asp Gln Val Gly Cys Gly Arg Ile Glu Glu Leu Thr Leu His Thr Pro
900 905 910
Leu Val Val Pro Arg Thr Gly Ala Leu Val Val Gln Val Asn Val Gln
915 920 925
Ala Ala Asp Asp Thr Gly Ala Arg Ala Leu Gly Val Tyr Ser Arg Pro
930 935 940
Asp Asp Ala Gly Ala Asp Met Val Trp Thr Arg His Ala Ser Gly Val
945 950 955 960
Leu Val Pro Glu Asp Thr Val Asp Ala Glu Asp Thr Asp Gly Leu Ser
965 970 975
Gly Val Trp Pro Pro Glu Gly Ala Glu Pro Val Ala Ile Ser Gly Leu
980 985 990
Tyr Asp Gly Met Ala Ala Ala Gly Tyr Gln Tyr Gly Pro Gly Phe Arg
995 1000 1005
Gly Leu Ser Arg Ala Trp His Leu Asp Gly Asp Val Tyr Ala Glu
1010 1015 1020
Val Ala Leu Pro Ala Asp Gln Thr Ser Ala Ala Glu Arg Tyr Gly
1025 1030 1035
Leu His Pro Ala Leu Phe Asp Ala Ala Leu His Ala Met Phe Thr
1040 1045 1050
Trp Asp Gly Asp Asp Gly Gly Gly Val Gly Met Pro Phe Ser Trp
1055 1060 1065
Thr Gly Val Arg Leu His Ala Thr Gly Cys Ala Arg Leu Arg Val
1070 1075 1080
Arg Leu Ala Arg Arg Gly Glu Ser Asp Phe Thr Val Thr Leu Thr
1085 1090 1095
Asp Glu Ala Gly Asp Pro Val Val Ser Val Asp Ser Leu Val Val
1100 1105 1110
Arg Arg Met Thr Gly Ala Ala Pro Asp Thr Val Arg Thr Asp Thr
1115 1120 1125
Leu Tyr Arg Leu Asp Trp Lys Thr Val Arg Ala Gly Glu Glu Thr
1130 1135 1140
Ser Ala Pro Arg Cys Val Leu Leu Gly Thr Asp Pro Leu Gly Val
1145 1150 1155
Ala Ala Ala Leu Pro Gly Thr Ala Arg Val Ala Asp Val Glu Arg
1160 1165 1170
Leu Ala Glu Leu Ala Ala Ala Gly Gly Pro Val Thr Ala Leu Leu
1175 1180 1185
Pro Val Ala Gly Asp Gly Ser Ala Glu Arg Ile Gly Asp Pro Val
1190 1195 1200
Ile Asp Thr Val Ala Val Leu Gln Ser Trp Ile Ala Asp Gly Arg
1205 1210 1215
Leu Asp Asp Thr Arg Leu Val Val Leu Thr Arg Gly Ala Val Ala
1220 1225 1230
Thr Ala Pro Arg Glu Asp Val Thr Asp Leu Ala Ala Ala Gly Val
1235 1240 1245
Trp Gly Leu Met Arg Ser Ala Gln Asn Glu His Pro Gly Arg Phe
1250 1255 1260
Gly Leu Ile Asp Leu Asp Thr Ala Glu Ser Ser Thr Ala Ala Leu
1265 1270 1275
Gly Thr Ala Leu Ala Ser Glu Glu Glu Gln Leu Ala Leu Arg Asp
1280 1285 1290
Gly Val Leu Arg Gly Pro Ser Leu Thr Arg Trp Asp Pro Gly Thr
1295 1300 1305
Thr Ile Leu Pro Pro Ala Gly Glu Ser Ala Trp Arg Leu Glu Asn
1310 1315 1320
Thr Arg Pro Gly Thr Ile Glu Gly Leu Asp Ala Ala Pro Cys Pro
1325 1330 1335
Glu Leu Leu Ala Pro Leu Gly Pro Arg Gln Val Arg Ile Ala Val
1340 1345 1350
Arg Ala Ala Gly Ile Asn Phe Lys Asp Val Val Val Ala Leu Asp
1355 1360 1365
Leu Val Pro Gly Leu Thr Gly Leu Gly Gly Glu Val Ala Gly Val
1370 1375 1380
Ile Thr Ala Val Gly Ala Glu Val Thr Tyr His Arg Val Gly Asp
1385 1390 1395
Gln Val Phe Gly Leu Ala Thr Glu Val Phe Gly Pro Val Thr Val
1400 1405 1410
Ala Asp Glu Arg Thr Val His Arg Ile Pro Asp Gly Trp Thr Phe
1415 1420 1425
Glu Glu Ala Ala Ser Val Ala Val Thr Tyr Met Thr Ala Tyr Tyr
1430 1435 1440
Gly Leu Val Asp Leu Gly Gly Leu Arg Ala Gly Gln Ser Val Leu
1445 1450 1455
Ile His Ala Gly Ala Gly Gly Val Gly Ser Ala Ala Val Gln Leu
1460 1465 1470
Ala Arg His Leu Gly Ala Glu Val Tyr Ala Thr Ala Ser Pro Gly
1475 1480 1485
Lys Trp Gly Ala Leu Arg Ala Gln Gly Leu Asp Gly Ala His Ile
1490 1495 1500
Ala Asn Ser Arg Thr Leu Asp Phe Glu Gln Trp Phe Leu His Ser
1505 1510 1515
Thr Asp Gly Arg Gly Met Asp Val Val Leu Asp Cys Leu Ala Gly
1520 1525 1530
Glu Phe Val Asp Ala Gly Leu Arg Leu Leu Pro Arg Gly Gly His
1535 1540 1545
Phe Leu Glu Met Gly Lys Thr Asp Lys Arg Asp Ala Glu Gln Val
1550 1555 1560
Gly Ala Ala His Pro Gly Val Val Tyr Arg Ala Tyr Asp Leu Pro
1565 1570 1575
Glu Ala Gly Pro Asp Arg Ile His Glu Met Leu Val Thr Leu Thr
1580 1585 1590
Gly Leu Phe Glu Asp Gly Val Leu Arg Pro Pro His Val Asn Ala
1595 1600 1605
Trp Asp Ile Arg Asp Ala Arg Ala Ala Phe Arg Ala Leu Ser Gln
1610 1615 1620
Ala Ala Leu Val Gly Lys Ala Val Leu Thr Leu Pro Gly Val Pro
1625 1630 1635
Phe Ser Pro His Gly Thr Val Leu Ile Thr Gly Gly Thr Gly Met
1640 1645 1650
Leu Gly Ala Leu Leu Ala Arg His Leu Val Thr Ala His Asn Val
1655 1660 1665
Thr Ser Leu Leu Leu Thr Ser Arg Arg Gly Pro Asp Ala Pro Gly
1670 1675 1680
Ala Ala Glu Leu Thr Ala Glu Leu Thr Glu Ala Gly Ala Arg Val
1685 1690 1695
Asp Val Val Ala Cys Asp Val Ala Asp Arg Asp Gln Leu Ala Ala
1700 1705 1710
Leu Leu Ala Gly Ile Pro Ala Glu Arg Pro Leu Thr Ala Val Leu
1715 1720 1725
His Thr Ala Ala Ala Leu Asp Asp Gly Leu Val Glu Ser Leu Thr
1730 1735 1740
Ala Glu Arg Thr Arg Ala Val Leu Arg Pro Lys Val Asp Gly Ala
1745 1750 1755
Val Gln Leu His Glu Leu Thr Arg Asp Leu Asp Leu Gly Ala Phe
1760 1765 1770
Val Leu Phe Ser Ser Leu Ala Gly Thr Met Gly Ala Pro Gly Gln
1775 1780 1785
Gly Asn Tyr Ala Ala Ala Asn Val Met Leu Asp Ala Leu Ala Ala
1790 1795 1800
His Arg Arg Ala Gln Gly Leu Pro Gly Leu Ser Leu Ala Trp Gly
1805 1810 1815
Phe Trp Asp Gln Arg Ser Glu Met Ser Gly Asn Leu Asp Asp Arg
1820 1825 1830
Asp Ile Gln Arg Met Ser Arg Gly Gly Ile Val Pro Met Ser Ser
1835 1840 1845
Glu Glu Gly Leu Ala Thr Phe Asp Leu Ala Cys Arg Thr Asp Arg
1850 1855 1860
Ala Gln Leu Val Pro Ala Arg Leu Asp Pro Ala Ala Leu Ala Gly
1865 1870 1875
Thr Thr Gly Arg Val Pro Pro Val Met Arg Ala Leu Ile Pro Ala
1880 1885 1890
Pro Ala Arg Arg Ser Gly Arg Arg Ser Ala Glu Ala Gly Asp Asp
1895 1900 1905
Ser Leu Arg Ala Arg Leu Val Pro Leu Thr Gly Thr Glu Arg Thr
1910 1915 1920
Arg Ile Leu Leu Gln Leu Val Arg Ser Asn Ala Ala Thr Val Leu
1925 1930 1935
Gly His Thr Asp Pro Asp Ala Val Gly Ala Ala Thr Pro Phe Arg
1940 1945 1950
Glu Leu Gly Phe Asp Ser Leu Thr Ala Val Glu Phe Arg Asn Arg
1955 1960 1965
Leu Thr Gly Ala Val Gly Phe Arg Leu Pro Val Thr Val Val Phe
1970 1975 1980
Asp His Pro Thr Pro Gly Ala Leu Thr Asp Phe Leu Ala Ala Glu
1985 1990 1995
Leu Leu Gly Gly Leu Asp Glu Thr Asp Ala Pro Ala Gly Pro Ser
2000 2005 2010
Arg Ala Thr Pro Ala Ala Val Ala Arg Thr Asp Glu Glu Pro Leu
2015 2020 2025
Val Ile Val Gly Met Ala Cys Arg Tyr Pro Gly Gly Ile Ser Thr
2030 2035 2040
Pro Glu Glu Leu Trp Asp Phe Val Leu Ala Glu Arg Asp Ala Ile
2045 2050 2055
Ser Gly Phe Pro Glu Asp Arg Gly Trp Arg Arg Glu Arg Ser Ala
2060 2065 2070
Asp Gly Ser Ala Pro Gln Gln Gly Gly Phe Leu Asp Arg Val Ala
2075 2080 2085
Glu Phe Asp Ala Ala Phe Phe Gly Ile Ser Pro Arg Glu Ala Leu
2090 2095 2100
Thr Met Asp Pro Gln Gln Arg Leu Leu Leu Glu Thr Ser Trp Glu
2105 2110 2115
Ala Leu Glu Arg Ala Gly Ile Ala Pro Gly Thr Leu Arg Gly Ser
2120 2125 2130
Arg Thr Gly Ile Phe Val Gly Ala Ala Ala Ser Gly Tyr Thr Ser
2135 2140 2145
Leu Phe Arg Arg Gly Ser Glu Ala Leu Ala Gly Tyr Gly Val Thr
2150 2155 2160
Gly Ala Ser Thr Ser Val Val Ser Gly Arg Val Ala Tyr Val Leu
2165 2170 2175
Gly Leu Glu Gly Pro Ala Val Thr Val Asp Thr Ala Cys Ser Ser
2180 2185 2190
Ser Leu Val Ala Leu His Thr Ala Ala Leu Ser Leu Arg Ala Gly
2195 2200 2205
Asp Cys Asp Leu Ala Leu Ala Gly Gly Val Ala Val Met Thr Ser
2210 2215 2220
Pro Phe Leu Phe Asp Asp Phe Ala Arg Gln Gly Gly Leu Ser Pro
2225 2230 2235
Asp Gly Arg Cys Lys Ala Phe Ala Gly Ser Ala Asp Gly Thr Gly
2240 2245 2250
Trp Ala Glu Gly Thr Gly Met Val Leu Leu Glu Arg Leu Ser Asp
2255 2260 2265
Ala Arg Arg Asn Gly His Pro Val Leu Ala Val Leu Arg Gly Ser
2270 2275 2280
Ala Val Asn Gln Asp Gly Ala Ser Asn Gly Leu Thr Ala Pro Asn
2285 2290 2295
Gly Pro Ser Gln Gln Arg Val Ile Arg Gln Ala Leu Asp Arg Ala
2300 2305 2310
Gly Leu Thr Pro Ala Asp Ile Asp Ala Val Glu Ala His Gly Thr
2315 2320 2325
Gly Thr Val Leu Gly Asp Pro Ile Glu Ala Gln Ala Val Leu Ala
2330 2335 2340
Thr Tyr Gly Arg Asp Arg Asp Pro Asp Arg Pro Val Leu Leu Gly
2345 2350 2355
Ser Leu Lys Ser Asn Ile Gly His Ser Gln Ala Ala Ala Gly Ile
2360 2365 2370
Gly Gly Val Ile Lys Thr Val Gln Ala Leu Leu His Gly Ile Leu
2375 2380 2385
Pro Arg Ser Leu His Ile Asp Glu Pro Thr Pro His Val Asp Trp
2390 2395 2400
Ser Ala Gly Ala Val Asp Leu Leu Thr Glu Thr Arg Ser Trp Pro
2405 2410 2415
Ala Thr Asp His Pro Arg Arg Ala Gly Val Ser Ser Phe Gly Val
2420 2425 2430
Ser Gly Thr Asn Ala His Ala Ile Leu Glu Gln Ala Thr Glu Pro
2435 2440 2445
Glu Pro Pro Ile Val Asp Gln Ala Pro Leu Pro Val Thr Pro Trp
2450 2455 2460
Leu Leu Ser Gly His Asp Glu Gln Gly Leu Arg Ala Gln Ala Glu
2465 2470 2475
Thr Leu Val Ser Trp Leu Arg Glu Gln Pro Glu Gly Ser Val Thr
2480 2485 2490
Asp Ile Gly His Ala Leu Ala Thr Arg Arg Ala Ala Leu Glu His
2495 2500 2505
Arg Ala Ala Leu Pro Val Thr Asp Arg Asp Glu Ala Leu Ala Arg
2510 2515 2520
Leu Ala Glu Phe Ala Ala Gly Arg Val Pro Asp Gly Leu Leu Arg
2525 2530 2535
Gly Thr Ala Gln Glu Gly Cys Leu Ala Leu Leu Phe Ala Gly Gln
2540 2545 2550
Gly Thr Gln Arg Pro Gly Met Gly Arg Asp Leu Tyr Ala Ala Phe
2555 2560 2565
Pro Ala Phe Ala His Ala Phe Asp Glu Ala Cys Ala His Leu Asp
2570 2575 2580
Pro Leu Leu Gly Arg Pro Leu Arg Asp Thr Val Phe Thr Ala Glu
2585 2590 2595
Ala Ala Glu Leu Asp Arg Thr Ala Ile Thr Gln Pro Ala Leu Phe
2600 2605 2610
Ala Leu Glu Val Ala Leu Tyr Arg Leu Leu Glu Ser Trp Gly Val
2615 2620 2625
Glu Pro Glu Tyr Val Leu Gly His Ser Val Gly Glu Ile Ala Ala
2630 2635 2640
Ala His Ala Ala Gly Val Leu Asp Leu Pro Asp Ala Ala Arg Leu
2645 2650 2655
Val Ala Ala Arg Gly Arg Leu Met Gln Ala Leu Pro Pro Gly Gly
2660 2665 2670
Ala Met Leu Ala Val Gln Val Gly Glu Thr Glu Ala Thr Glu Ala
2675 2680 2685
Leu Gly Ala Val Leu Gly Glu Arg Ala Ala Thr Val Asp Leu Ala
2690 2695 2700
Ala Val Asn Gly Pro His Ser Val Val Phe Ser Gly Thr Ala Arg
2705 2710 2715
Ser Val Asp Ala Leu Asp Ala His Phe Thr Ala Arg Gly Arg Arg
2720 2725 2730
Thr Arg Arg Leu Thr Val Ser His Ala Phe His Ser Pro Leu Met
2735 2740 2745
Glu Pro Met Leu Asp Glu Phe Ala Glu Leu Val Ser Arg Leu Thr
2750 2755 2760
Phe Ala Ala Pro Arg Ile Pro Val Val Ser Asp Leu Thr Gly Ser
2765 2770 2775
Val Leu Gly Ala Gly Asp Leu Ala Asp Pro Arg His Trp Val Arg
2780 2785 2790
His Ala Arg His Thr Val Arg Phe Ala Asp Gly Ile Asp Thr Leu
2795 2800 2805
Val Gly Ala Gly Val Thr Asp Phe Leu Glu Leu Gly Pro Asp Ala
2810 2815 2820
Thr Leu Ala Thr Met Ala Glu Asp Cys Phe Ala Thr Ala Pro Thr
2825 2830 2835
Gly Val Cys Thr Ser Leu Leu Arg Arg Asp Gly Ser Glu Pro Val
2840 2845 2850
Thr Leu Leu Met Ala Leu Ala Arg Ala His Val His Gly Val Thr
2855 2860 2865
Val Asp Trp Lys Ala Val Leu Ala Gly Thr Gly Ala Arg Trp Val
2870 2875 2880
Asp Leu Pro Thr Tyr Ala Phe Gln Arg Glu Ser Tyr Trp Pro Ala
2885 2890 2895
Glu Ser Thr Ala Gly Arg Ser Asp Pro Ser Ser Ala Gly Phe Asp
2900 2905 2910
Asp Thr Gly His Pro Leu Leu Gly Ala Met Val Gly Ala Ala Gly
2915 2920 2925
Gly Asp Val Leu Phe Thr Gly Glu Leu Ser Leu Ala Ala Gln Pro
2930 2935 2940
Trp Leu Ala Asp His Arg Val Leu Asp Ala Val Leu Phe Pro Gly
2945 2950 2955
Thr Gly Phe Leu Glu Leu Ala Ser Trp Ala Gly Ser Arg Leu Asp
2960 2965 2970
Ala Gly Asp Leu Glu Glu Leu Val Val His Arg Pro Leu Val Leu
2975 2980 2985
Pro Glu His Gly Gly Val Thr Val Gln Val Val Val Gly Glu Ala
2990 2995 3000
Thr Asp Glu Asp Arg Arg Pro Val Ala Val Tyr Ser Arg Ala Ala
3005 3010 3015
Asp Asp Ala Gly Trp Thr Arg His Ala Glu Gly Leu Leu Ala Thr
3020 3025 3030
Gly Pro Ala Ala Gln Pro Ala Asp Pro Ser Ala His Trp Pro Pro
3035 3040 3045
Gln Gly Ala Glu Arg Val Asp Leu Asp Glu Phe Tyr Ala Gly Leu
3050 3055 3060
Ala Asp Ala Gly Thr Ala Tyr Gly Pro Val Phe Gln Gly Leu Thr
3065 3070 3075
Ala Val Trp Arg Leu Asp Gly Glu Ile Tyr Ala Asp Val Ala Leu
3080 3085 3090
Pro Ala Gln Ala Ala Asp Asp Ala Arg Gly Phe Gly Val His Pro
3095 3100 3105
Ala Leu Leu Asp Ala Ala Leu His Thr Leu Ala Phe Leu Pro Gly
3110 3115 3120
Ala Asp Arg Ser Ser Gly Pro Phe Leu Pro Phe Ala Trp Arg Asp
3125 3130 3135
Val Thr Val Pro Gly Pro Gly Ala Thr Ser Cys Arg Ile Arg Leu
3140 3145 3150
Thr Pro Gly Asn Gly Thr Asp Glu Val Ala Ala Thr Leu Trp Asp
3155 3160 3165
Gly Asp Gly Arg Pro Leu Ala Ala Val Gly Gly Leu Ser Leu Arg
3170 3175 3180
Ser Val Ser Arg Thr Gln Leu Gly Thr Ser Ala Val Ala Ser Ser
3185 3190 3195
Leu Phe Arg Met Asp Trp Thr Pro Ala Ser Gln Pro Arg Ala Val
3200 3205 3210
Gly Ala Pro Thr Val Arg Trp Ala Val Val Gly Pro Asp Ala Pro
3215 3220 3225
Gly Thr Pro Asp Ile Asp His Tyr Ala Asp Leu Val Ala Leu Arg
3230 3235 3240
Arg His Leu Ala Asp Gly Gly Pro Val Pro Asp Gln Val Leu Leu
3245 3250 3255
Pro Cys Ala Pro Ser Ala Gly Gly Ala Asp Ala Gly Ala Ala Arg
3260 3265 3270
Asp Ala Val His Ala Ala Leu His Thr Leu Arg Thr Trp Ala Glu
3275 3280 3285
Asp Glu His Phe Ala Lys Ser Arg Leu Val Leu Cys Thr Arg Gly
3290 3295 3300
Ala Val Val Ala Gln Pro Gly Glu Gly Val Arg Asp Leu Ala His
3305 3310 3315
Ala Ala Val Trp Gly Leu Ala Arg Ser Ala Gln Leu Glu His Pro
3320 3325 3330
Asp Arg Phe Val Leu Val Asp Leu Asp Thr Gly Thr Thr Leu Asp
3335 3340 3345
Asp Leu Thr Arg Ser Gln Leu Leu Ala Arg Thr Glu Ser Thr Asp
3350 3355 3360
Ala Ala Gln Phe Ala Ile Arg Gly Ala Leu Thr Leu Val Pro Ala
3365 3370 3375
Val Thr Arg Gln Ala Gly Gln Val Pro Ala Pro Glu Ala Pro Trp
3380 3385 3390
Pro Ala Asp Gly Thr Thr Leu Ile Thr Gly Ala Gly Gly Met Ile
3395 3400 3405
Gly Gly Leu Leu Ala Arg His Leu Val Arg Glu His Gly Val Arg
3410 3415 3420
His Leu Leu Leu Leu Gly Arg Arg Gly Glu Asp Thr Pro Gly Met
3425 3430 3435
Ala Glu Leu Arg Arg Glu Leu Thr Asp Ala Gly Ala Asp Val His
3440 3445 3450
Val Thr Ala Cys Asp Ala Ala Asp Arg Glu Ala Leu Ala Ala Val
3455 3460 3465
Leu Gly Arg Ile Pro Ser Thr Ala Pro Leu Thr Ala Val Val His
3470 3475 3480
Ala Ala Gly Val Val Asp Asp Gly Val Leu Gly Ser Val Thr Asp
3485 3490 3495
Glu Gln Val Asp Arg Val Leu Arg Pro Lys Ile Asp Ala Ala Val
3500 3505 3510
Asn Leu His His Leu Thr Ala Pro Leu Gly Leu Arg Ala Phe Val
3515 3520 3525
Val Cys Ser Ser Leu Ala Gly Ala Leu Gly Gly Gly Gly Gln Ser
3530 3535 3540
Ala Tyr Ala Ala Ala Asn Ala Tyr Leu Asp Ala Leu Cys Leu Arg
3545 3550 3555
Arg Arg Ala Asp Gly Leu Pro Ala Leu Ser Leu Ala Trp Gly Pro
3560 3565 3570
Trp Glu Ser Ser Ala Gly Met Thr Ala Gln Leu Ala Ala Ala Asp
3575 3580 3585
Leu Arg Arg Ile Ser Arg Ala Gly Met Gln Pro Leu Thr Pro Asp
3590 3595 3600
Asp Gly Leu Ala Leu Phe Asp Ala Ala His Ala Thr Gly Glu Ala
3605 3610 3615
Val Leu Leu Pro Phe Arg Phe Glu Pro Gly Gly Leu Ser Thr Ala
3620 3625 3630
Asp Arg Ala Ser Leu Pro Pro Ala Leu Arg Pro Leu Val Pro Arg
3635 3640 3645
Pro Arg Arg Arg Pro Gly Asp Pro Val Pro Gly Leu Ser Gly Leu
3650 3655 3660
Arg Asp Arg Leu Arg Pro Leu Ser Gln Asp Asp Arg Thr Gly Ala
3665 3670 3675
Leu Glu Asn Leu Val Arg Ala Glu Val Ala Ser Val Leu Ala Leu
3680 3685 3690
Pro Ser Ala Asp Ala Val pro Val Thr Lys Ala Phe Lys Thr Leu
3695 3700 3705
Gly Phe Asp Ser Leu Met Ala Val Asp Leu Arg Asn Arg Leu Ser
3710 3715 3720
Ala Leu Thr Gly Val Arg Leu Pro Ala Thr Leu Val Phe Asp His
3725 3730 3735
Pro Thr Pro Arg Ala Leu Ala Thr Arg Leu Leu Thr Gly Met Glu
3740 3745 3750
Leu Asp Thr Ala Thr Ala Thr Asp Pro Ala Leu Leu Ala Leu Arg
3755 3760 3765
Glu Leu Glu Thr Ala Val Arg Ser Met Ala Pro Gly Ala Asp Asp
3770 3775 3780
Arg Gly Ala Met Ala Thr Arg Leu Arg Val Leu Leu Thr Ala Leu
3785 3790 3795
Glu Glu Thr Ala Asp Asp Thr Asp Gly Ala Asp Thr Asp Gly Asp
3800 3805 3810
Thr Asp Leu Asp Ser Val Ser Thr Glu Glu Leu Val Asn Leu Leu
3815 3820 3825
Gly Asp Glu Phe Gly Leu Thr
3830 3835
<210>5
<211>3897
<212>PRT
<213〉nanchang streptomycete NS3226 (Streptomyces nanchangensis n.sp.NS3226)
<400>1
Val Thr Asn Glu Ala Gln Leu Val Asp Tyr Leu Lys Lys Leu Ala Ala
1 5 10 15
Asp Leu Arg Gln Ala His Arg Arg Ile Lys Lys Leu Glu Ala Gly Glu
20 25 30
Thr Glu Pro Ile Ala Ile Val Gly Met Ala Cys Arg Leu Pro Gly Gly
35 40 45
Val Gly Ser Pro Glu Glu Leu Trp Asp Leu Val Leu Arg Gly Glu Asp
50 55 60
Ala Val Thr Asp Met Pro Ser Asp Arg Gly Trp Ala Leu Gly Glu Leu
65 70 75 80
Tyr Asp Val Asp Pro Asp Arg Pro Gly Thr Thr Tyr Ala Thr Gln Gly
85 90 95
Gly Phe Leu Arg Gly Ala Ala Glu Phe Asp Ala Glu Phe Phe Gly Ile
100 105 110
Ser Pro Arg Glu Ala Leu Ala Met Asp Pro Gln Gln Arg Leu Leu Leu
115 120 125
Glu Thr Ala Trp Glu Ala Leu Glu Asn Thr Gly Val Asp Pro Arg Ser
130 135 140
Leu Ala Gly Ser Arg Thr Gly Ile Phe Thr Gly Leu Met Tyr His Asp
145 150 155 160
Tyr Ala Ser Gly Pro Gly Thr Leu Pro Asp Glu Val Glu Gly Tyr Leu
165 170 175
Ser Thr Gly Met Ala Gly Ser Val Ala Ser Gly Arg Ile Ser Tyr Phe
180 185 190
Leu Glu Leu Glu Gly Pro Ala Val Thr Leu Asp Thr Ala Cys Ser Ser
195 200 205
Ser Leu Val Ala Leu His Leu Ala Val Gln Ala Leu Arg Asp Gly Glu
210 215 220
Cys Asp Leu Ala Leu Ala Gly Gly Ala Thr Val Met Ala Thr Pro Ala
225 230 235 240
Thr Phe Val Glu Asn Ser Arg Gln Arg Gly Leu Ala Thr Asp Gly Arg
245 250 255
Cys Lys Ala Phe Ala Ala Ala Ala Asp Gly Val Gly Trp Gly Glu Gly
260 265 270
Ser Ala Leu Leu Val Val Glu Arg Leu Ser Asp Ala Arg Arg Leu Gly
275 280 285
His Asp Val Leu Ala Ile Val Arg Gly Ser Ala Val Asn Gln Asp Gly
290 295 300
Ala Ser Asn Gly Leu Thr Ser Pro Asn Gly Pro Ser Gln Glu Arg Val
305 310 315 320
Ile Glu Gln Ala Leu Ala Ser Ala Arg Leu Gly Phe Ala Asp Ile Asp
325 330 335
Val Val Glu Ala His Gly Thr Gly Thr Thr Leu Gly Asp Pro Ile Glu
340 345 350
Ala Gln Ala Leu Ile Ala Thr Tyr Gly Arg Glu Arg Pro Asp Ser Ser
355 360 365
Pro Leu Arg Leu Gly Ser Val Lys Ser Asn Ile Gly His Thr Gln Ala
370 375 380
Ala Ala Gly Ala Ala Gly Ile Ile Lys Met Val Met Ala Met Arg His
385 390 395 400
Gln Gln Leu Pro Arg Thr Leu His Val Asp Arg Pro Thr Pro Glu Val
405 410 415
Asp Trp Ser Ala Gly Thr Val Glu Leu Leu Thr Glu Asn His Ala Trp
420 425 430
Pro Arg Thr Gly Arg Pro Arg Arg Ala Gly Val Ser Ser Phe Gly Ile
435 440 445
Ser Gly Thr Asn Ala His Val Ile Leu Glu Glu Pro Pro Thr Ala Asp
450 455 460
Pro Pro Thr Asp Thr Ala Lys Gly Ala Asp Leu Pro Ala His Gly Ala
465 470 475 480
Glu Arg Ala Asp Val Ser Asp Ser Met Val Ser Ala Val Leu Pro Val
485 490 495
Val Pro Val Pro Leu Ser Ala Ala Thr Pro Ala Ala Leu Pro Ala Gln
500 505 510
Ala Ala Arg Leu His Ala His Leu Leu Asp Arg Pro Asp Leu Pro Leu
515 520 525
Gly Asp Leu Ala Ala Ala Leu Ala Thr Thr Arg Thr Ala Phe Glu His
530 535 540
Arg Ala Val Leu Leu Thr Glu Ser Arg Glu Glu Leu Leu Gly Gly Leu
545 550 555 560
Ala Glu Leu Ala Arg Gly Glu Arg Pro Ala Gly Leu Val Asp Gly Val
565 570 575
Ala Asp Glu Val Arg Cys Ala Phe Leu Phe Thr Gly Gln Gly Ala Gln
580 585 590
Arg Pro Gly Met Gly Arg Glu Leu Tyr Glu Thr Phe Pro Ala Tyr Ala
595 600 605
Arg Ala Leu Asp Glu Val Cys Ala Glu Leu Gly Ala Arg Leu Asp Met
610 615 620
Pro Leu Leu Pro Leu Leu Leu Ala Asp Ala Asn Ser Ala Glu Ala Arg
625 630 635 640
Leu Leu Asp Arg Thr Leu Tyr Thr Gln Ser Ala Thr Phe Ala Leu Gly
645 650 655
Val Ala Leu Phe Arg Leu Leu Glu Glu Trp Gly Val Arg Pro Arg Leu
660 665 670
Leu Ser Gly His Ser Val Gly Glu Leu Thr Ala Thr His Val Ser Gly
675 680 685
Met Leu Ser Leu Ala Asp Ala Cys Glu Leu Val Ala Thr Arg Gly Arg
690 695 700
Leu Met Gln Glu Leu Pro Glu Gly Gly Ala Met Val Ser Val Ala Ala
705 710 715 720
Thr Ala Asp Glu Val Leu Pro Leu Leu Ala Gly His Glu Ser Val Ala
725 730 735
Gly Val Ala Ala Val Asn Gly Pro Gly Ser Val Val Val Ser Gly Asp
740 745 750
Glu Asp Val Val Thr Gly Ile Ala Ala His Phe Thr Glu Leu Gly Arg
755 760 765
Arg Thr Arg Arg Ile Pro Val Ser His Ala Phe His Ser Pro Leu Met
770 775 780
Asp Pro Val Val Glu Pro Leu Gly Glu Val Ala Gly Arg Leu Ser Phe
785 790 795 800
Glu Pro Pro Arg Ile Pro Val Val Ser Ser Val Thr Gly Thr Val Leu
805 810 815
Asp Ala Ala Asp Trp Ala Asp Pro Ala Tyr Trp Ala Arg Gln Ala Arg
820 825 830
Glu Pro Val Arg Phe His Asp Val Val His Thr Leu Val Ala Glu Glu
835 840 845
Val Thr Val Phe Leu Glu Leu Gly Ala Asp Ala Ala Leu Thr Ser Met
850 855 860
Thr Glu Glu Thr Leu Ala Ala Ser Gly Thr Pro Thr Val Val Ala Pro
865 870 875 880
Ala Leu Arg Arg Gln Arg Pro Glu Val Arg Thr Leu Thr Ala Met Leu
885 890 895
Ala Gln Ala His Thr Ala Gly Val Pro Ile Asp Trp Arg Thr Phe Phe
900 905 910
Gly Gly Arg Pro Thr Ser Arg Val Pro Leu Pro Thr Tyr Ala Phe Gln
915 920 925
Gly Thr Arg Tyr Trp Leu Glu Thr Ala Pro Gly Ala Gly Asp Met Gly
930 935 940
Ala Ala Gly Leu Val Ala Ala Glu His Pro Leu Leu Gly Ala Thr Leu
945 950 955 960
Val Pro Ala Val Gly Gly Gly Arg Leu Phe Thr Gly Arg Leu Ser Val
965 970 975
Glu Ala Gln Pro Trp Leu Ala Asp His Ala Ile Asp Asp Ala Val Leu
980 985 990
Leu Pro Gly Thr Ala Val Ala Glu Leu Ala Leu Trp Ala Ala Arg His
995 1000 1005
Val Gly Leu Asn His Val Ala Asp Leu Val Leu Glu Val Pro Leu
1010 1015 1020
Ala Leu Pro Arg Gly Gly Gly Leu Arg Val Gln Leu Ala Val Asp
1025 1030 1035
Ser Pro Asp Ala Ser Gly Asp Arg Gly Phe Gly Leu Tyr Thr Gln
1040 1045 1050
Pro Glu Gly Ser Ala Asp Asp Val Trp Thr Arg His Ala Gly Gly
1055 1060 1065
Thr Leu Thr Ala Val Arg Thr Ala Ser Ala Glu Glu Leu Thr Val
1070 1075 1080
Trp Pro Pro Ala Gly Ala Glu Lys Leu Asp Thr Asp Gly Cys Tyr
1085 1090 1095
Ala Asp Phe Ala Ala Ala Gly Val Arg Tyr Gly Pro Ala Phe Gln
1100 1105 1110
Gly Leu Arg Ala Val Trp Arg His Gly Glu Glu Val Tyr Ala Glu
1115 1120 1125
Val Arg Leu Pro Glu Asp Val Thr Gly Asp Ala Gly Glu Phe Cys
1130 1135 1140
Leu His Pro Ala Leu Ala Asp Ala Ala Leu His Ala Ser Ala Phe
1145 1150 1155
Val Pro Gly Glu Phe Gly Arg Glu Gln Arg Ala Arg Leu Pro Phe
1160 1165 1170
Ala Trp Arg Gly Val Ser Leu His Ala Ala Gly Ala Ser Phe Leu
1175 1180 1185
Arg Val Arg Leu Ala Pro Thr Gly Pro Asp Thr Leu Ala Leu Leu
1190 1195 1200
Phe Ala Asp Ala Ala Gly Arg Thr Val Ala Thr Val Glu Ser Leu
1205 1210 1215
Ala Val Arg Pro Ala Gly Ala Val Glu Ala Leu Asp Gly Ala Val
1220 1225 1230
Asp Ala Leu Leu Met Pro Ser Trp Val Pro Val Gly Gly Cys Glu
1235 1240 1245
Thr Pro Gly Arg Trp Ala Val Leu Gly Pro Gly Pro Leu Ala Gly
1250 1255 1260
Leu Pro His Ala Asp Val His Ala Asp Leu Ala Gly Leu Glu Ala
1265 1270 1275
Ala Val Glu Ala Gly Ala Glu Val Pro Asp Phe Ile Val Gly Thr
1280 1285 1290
Val Gly Thr Ser Asp Gly Ser Val Asp Ala Asp Ala Ala His Glu
1295 1300 1305
Ala Ala Glu Arg Ala Leu Ala Leu Leu Arg Ser Trp Leu Ala Gly
1310 1315 1320
Glu Arg Leu Gly Thr Ala Arg Leu Val Met Val Thr Trp Asn Ala
1325 1330 1335
Ala Ala Val Ala Asp Gly Asp Ala Pro Asp Pro Val Gln Ala Ala
1340 1345 1350
Val Trp Gly Leu Leu Ser Ser Ala Val Thr Glu His Pro Gly Arg
1355 1360 1365
Ile Ala Leu Val Asp Leu Asp Gly Thr Ala Glu Ser Leu Ala Ala
1370 1375 1380
Leu Ala Ser Thr Val Gly Val Asp Glu Pro Arg Leu Ala Leu Arg
1385 1390 1395
Glu Gly Arg Ala Thr Ala Pro Arg Leu Thr Arg Ala Ser Ala Gly
1400 1405 1410
Ser Pro Arg Pro Pro Arg Gly Ile Asp Pro Asn Gly Thr Ala Leu
1415 1420 1425
Ile Thr Gly Gly Thr Gly Thr Leu Gly Ala Leu Leu Ala Arg His
1430 1435 1440
Leu Val His Gln His Gly Val Thr Asp Leu Leu Leu Thr Ser Arg
1445 1450 1455
Arg Gly Pro Asp Ala Pro Gly Ala Thr Glu Leu Thr Ala Glu Leu
1460 1465 1470
Thr Lys Ala Gly Ala His Val Thr Ile Thr Ala Cys Asp Thr Ala
1475 1480 1485
Asp Pro Asp Gln Leu Ala Ala Leu Leu Ser His His Thr Leu Thr
1490 1495 1500
Thr Val Ile His Thr Ala Gly Ile Leu Asp Asp Ala Thr Ile Thr
1505 1510 1515
Thr Leu Thr Asn Thr Gln Leu His Asn Val Leu Arg Pro Lys Ile
1520 1525 1530
Asp Ala Ala Thr His Leu His His Leu Thr Leu Asn His Pro Val
1535 1540 1545
Thr Thr Phe Ile Leu Tyr Ser Ser Ala Ala Gly Gln Leu Gly Ala
1550 1555 1560
Pro Gly Gln Ala Asn Tyr Ala Ala Ala Asn Thr Tyr Leu Asp Ala
1565 1570 1575
Leu Ala His His Arg Arg Thr His Gly Leu Pro Ala Thr Ser Leu
1580 1585 1590
Ala Trp Gly Leu Trp Asn Thr Arg Ser Thr Met Thr Gly His Leu
1595 1600 1605
Asn Asp Lys Glu Leu His Arg Met Glu Arg Ala Gly Val Val Pro
1610 1615 1620
Ile Ser Glu Glu Gln Gly Met Ala Leu Leu Asp Ala Ala Val Gly
1625 1630 1635
Leu Asp Ala Pro Val Ala Val Pro Leu Pro Leu Glu Pro Gly Ala
1640 1645 1650
Leu Arg Ser Gln Ala Ala Ala Gly Thr Leu Pro Pro Leu Leu Arg
1655 1660 1665
Gly Phe Val Arg Val Pro Val Arg Arg Ala Ala Asn Ala Ala Glu
1670 1675 1680
Gly Ala Tyr Ala Gly Met Thr Phe Ala Ala Gly Leu Arg Glu Leu
1685 1690 1695
Pro Glu Ala Glu Arg Leu Arg Leu Leu Leu Asp Leu Val Arg Thr
1700 1705 1710
His Ala Ala Arg Ala Leu Gly His Ala Asn Thr Asp Gly Leu Glu
1715 1720 1725
Ala Arg Arg Ser Phe Arg Glu Leu Gly Phe Asp Ser Leu Ala Ala
1730 1735 1740
Ile Glu Leu Arg Asn Gly Val Gly Ala Ala Thr Gly Leu Gly Leu
1745 1750 1755
Pro Ala Thr Leu Val Phe Asp His Pro Thr Pro Gln Arg Leu Ala
1760 1765 1770
Glu His Leu His Glu Lys Leu Phe Asp Arg Gly Ala Glu Val Ala
1775 1780 1785
Leu Pro Glu Leu Arg Ala Thr Asp Asp Asp Pro Ile Val Ile Val
1790 1795 1800
Gly Met Ala Cys Arg Tyr Pro Gly Gly Val Ala Thr Pro Asp Ala
1805 1810 1815
Leu Trp Glu Leu Val Ala Ala Glu Arg Asp Ala Ile Ser Gly Met
1820 1825 1830
Pro Glu Asp Arg Gly Trp Asp Val Glu Glu Leu Tyr Asp Pro Glu
1835 1840 1845
Leu Ala Arg Pro Gly Thr Ser Tyr Val Arg Arg Gly Gly Phe Leu
1850 1855 1860
Tyr Glu Ala Ala Asp Phe Asp Ala Glu Phe Phe Gly Ile Ser Pro
1865 1870 1875
Arg Glu Ala Leu Ala Met Asp Pro Gln Gln Arg Leu Leu Leu Glu
1880 1885 1890
Thr Ser Trp Glu Ala Leu Glu Arg Ala Gly Ile Asp Pro Arg Ala
1895 1900 1905
Val Arg Gly Ser Arg Thr Gly Val Phe Ala Gly Leu Met Tyr His
1910 1915 1920
Asp Tyr Gly Ser Gly Pro Gly Thr Leu Pro Asp Glu Val Glu Gly
1925 1930 1935
Phe Ile Gly Thr Gly Ser Ala Gly Ser Val Ala Ser Gly Arg Val
1940 1945 1950
Ala Phe Thr Leu Gly Leu Glu Gly Pro Ala Val Thr Leu Asp Thr
1955 1960 1965
Ala Cys Ser Ser Ser Leu Val Ala Leu His Leu Ala Ala Gln Ala
1970 1975 1980
Leu Arg Gly Gly Glu Cys Asp Leu Ala Leu Ala Gly Gly Val Thr
1985 1990 1995
Val Met Ala Thr Pro Gly Val Phe Val Glu Leu Ser Arg Gln Gly
2000 2005 2010
Gly Leu Ala Pro Asp Gly Arg Cys Lys Ala Phe Ala Ala Gly Ala
2015 2020 2025
Asp Gly Thr Gly Trp Gly Glu Gly Val Gly Met Leu Leu Val Glu
2030 2035 2040
Arg Leu Ser Asp Ala Arg Arg His Gly His Pro Val Leu Ala Val
2045 2050 2055
Val Arg Gly Ser Ala Val Asn Gln Asp Gly Ala Ser Asn Gly Leu
2060 2065 2070
Thr Ala Pro Asn Gly Pro Ser Gln Gln Arg Val Ile Gly Gln Ala
2075 2080 2085
Leu Ala Ser Ala Gly Leu Ala Ala Val Asp Val Asp Val Val Glu
2090 2095 2100
Ala His Gly Thr Gly Thr Ala Leu Gly Asp Pro Ile Glu Ala Gln
2105 2110 2115
Ala Leu Leu Ala Thr Tyr Gly Gln Asp Arg Pro Val Asp Arg Pro
2120 2125 2130
Leu Trp Leu Gly Ser Val Lys Ser Asn Ile Gly His Thr Gln Ala
2135 2140 2145
Ala Ala Gly Val Ala Gly Val Ile Lys Ser Val Leu Ala Met Arg
2150 2155 2160
His Gly Val Leu Pro Arg Thr Leu His Val Glu Glu Pro Thr Pro
2165 2170 2175
Glu Val Asp Trp Ser Ser Gly Ala Val Glu Leu Leu Ala Gln Ala
2180 2185 2190
Arg Glu Trp Pro Glu Thr Gly Arg Pro Arg Arg Ala Gly Val Ser
2195 2200 2205
Ala Phe Gly Ile Ser Gly Thr Asn Ala His Val Ile Leu Glu Gln
2210 2215 2220
Ala Pro Glu Thr Val Glu Glu Ser Ala Pro Gly Glu Thr Gly Ser
2225 2230 2235
Val Leu Val Pro Trp Val Ile Ser Ala Arg Ser Ala Gln Ala Leu
2240 2245 2250
Arg Glu Gln Ala Arg Asn Leu Ala Gly His Val Ala Arg His Gly
2255 2260 2265
Leu Arg Pro Val Asp Val Gly Phe Ser Leu Ala Ala Ala Arg Ala
2270 2275 2280
Gly Leu Gly His Arg Ala Val Leu Val Gly Arg Glu Thr Ser Glu
2285 2290 2295
Leu Leu Ala Gln Leu Glu Ala Leu Ala Glu Gly Arg Val Ala Gly
2300 2305 2310
Gly Ser Val Thr Asp Gly Gly Thr Ala Phe Leu Phe Ser Gly Gln
2315 2320 2325
Gly Ser Gln Arg Ala Ser Met Gly Arg Glu Leu Tyr Glu Ala Phe
2330 2335 2340
Pro Val Phe Ala Ala Ala Phe Asp Glu Val Cys Ala Gly Phe Asp
2345 2350 2355
Gly Met Leu Pro Gly Ser Leu Arg Asp Ala Val Phe Ala Gly Gly
2360 2365 2370
Glu Val Leu Asp Arg Thr Glu Trp Thr Gln Ala Gly Leu Phe Ala
2375 2380 2385
Leu Glu Val Ala Leu Phe Glu Leu Val Gly Ser Trp Gly Val Arg
2390 2395 2400
Ala Asp Val Leu Leu Gly His Ser Ile Gly Glu Leu Ala Ala Ala
2405 2410 2415
Tyr Val Ala Gly Val Trp Ser Leu Gln Asp Ala Cys Arg Val Val
2420 2425 2430
Ala Ala Arg Gly Arg Leu Met Gln Ala Leu Pro Ser Gly Gly Val
2435 2440 2445
Met Val Ala Val Gln Ala Ala Glu Glu Glu Leu Pro Glu Leu Pro
2450 2455 2460
Ala Gly Val Ser Val Ala Ala Val Asn Gly Pro Arg Ser Leu Val
2465 2470 2475
Leu Ser Gly Asp Glu Glu Pro Val Thr Ala Val Ala Gln Glu Leu
2480 2485 2490
Ala Gly Arg Gly Arg Arg Ile Lys Arg Leu Ala Val Gly His Ala
2495 2500 2505
Phe His Ser Ala Arg Met Glu Pro Met Leu Ala Gln Phe Ala Glu
2510 2515 2520
Val Leu Ala Gly Val Glu Phe Arg Arg Pro Arg Ile Ala Val Val
2525 2530 2535
Ser Asn Val Thr Gly Gln Ile Ala Asp Glu Glu Leu Ala Thr Pro
2540 2545 2550
Ala Tyr Trp Val Arg His Val Arg Glu Ala Val Arg Phe Ala Asp
2555 2560 2565
Gly Val Thr Thr Ala His Ser Arg Gly Val Asp Lys Phe Leu Glu
2570 2575 2580
Leu Gly Pro Gly Gly Ser Leu Thr Ala Met Ala Glu Glu Thr Leu
2585 2590 2595
Asp His Thr Gly Thr Gly Thr Val Cys Thr Pro Ile Leu His Pro
2600 2605 2610
Glu Arg Pro Glu Ala Gln Ser Val Val His Ala Leu Gly Arg Ile
2615 2620 2625
Tyr Ala Ala Gly Ala Pro Ala Asp Trp Ser Ala Phe Phe Thr Gly
2630 2635 2640
Thr Gly Ala Arg Arg Val Asp Leu Pro Thr Tyr Ala Phe Gln Arg
2645 2650 2655
Arg Arg Phe Trp Leu Glu His Arg Arg Gly Ala Gly Asp Leu Thr
2660 2665 2670
Ala Met Gly Leu Gln Ala Ala Asp His Pro Leu Leu Gly Ala Ala
2675 2680 2685
Val Thr Leu Ala Asp Gly Glu Gly Val Leu Leu Thr Gly Arg Leu
2690 2695 2700
Ser Gly Arg Ala Gln Pro Trp Leu Leu Asp His Ala Leu Leu Gly
2705 2710 2715
Gln Val Leu Leu Pro Gly Ser Ala Phe Val Asp Leu Val Ile Arg
2720 2725 2730
Ala Gly Asp Leu Leu Asn Arg Pro Tyr Leu Glu Val Leu Thr Pro
2735 2740 2745
His Thr Pro Leu Leu Leu Gly Ala Gly Pro Glu Asp Glu Val Thr
2750 2755 2760
Val Gln Val Arg Ala Thr Pro Asp Ser Asp Ser Gly Arg Cys Thr
2765 2770 2775
Val Thr Leu His Ser Arg Thr Ser Asp Gly Asp Trp Thr Leu His
2780 2785 2790
Ala Thr Gly Thr Leu Ser Ala Asp Ala Pro Ala Glu Pro Ala Pro
2795 2800 2805
Leu Pro Ser Trp Pro Pro Ala Gly Ala Glu Ala Val Glu Thr Asp
2810 2815 2820
Gly Val Tyr Gln Glu Leu Ala Ala Ser Gly Tyr His Tyr Gly Pro
2825 2830 2835
Ala Phe Gln Cys Leu His Ala Leu Trp Arg Gln His Asp Glu Leu
2840 2845 2850
Phe Ala Glu Val Arg Leu Pro Glu Ser Glu Arg Glu Glu Gly Thr
2855 2860 2865
Arg Tyr Gly Val His Pro Ala Leu Leu Asp Ala Ala Leu His Ala
2870 2875 2880
Met Ala Phe Val Gly Gly Arg Asp Glu Gly Val Arg Leu Pro Ser
2885 2890 2895
Ser Trp Ala Ala Val Arg Leu Tyr Ala Ser Gly Ala Thr Thr Ala
2900 2905 2910
Arg Val Arg Leu Thr Pro Ser Gly Asp Gln Leu Ala Leu Leu Val
2915 2920 2925
Thr Asp Glu Ala Gly Arg Pro Val Val Ser Val Gly Ser Val Val
2930 2935 2940
Thr Lys Pro Ala Val Phe Asp Gln Pro Ser Gly Gly Thr Leu Glu
2945 2950 2955
Gln Ala Leu Leu His Leu Asp Trp Thr Ala Leu Pro Val Ala Ala
2960 2965 2970
Ala Gln Ser Tyr Ala Leu Val Gly Asp Asp Pro Phe Gly Leu Thr
2975 2980 2985
Gly Ala Ala Leu Arg Val Ala Ala Thr Phe Glu Glu Leu Ala Ala
2990 2995 3000
Asn Gly Pro Val Pro Gly Ile Val Val Arg Cys Leu Ala Pro Arg
3005 3010 3015
Val Ser Asp Asp Pro Ala Ala Asp Ala His Ala Ala Ala Glu Ala
3020 3025 3030
Thr Leu Gly Val Ile Arg Ala Trp Leu Ala Asp Asp Arg Phe Ala
3035 3040 3045
Ser Ala Arg Leu Val Leu Val Thr Ser Gly Ala Val Ala Ala Gly
3050 3055 3060
Asp Ala Glu Asp Val Thr Asp Leu Ala Asn Ser Thr Ser Trp Gly
3065 3070 3075
Leu Val Gly Ser Ala Gln Thr Glu His Pro Asp Arg Phe Phe Leu
3080 3085 3090
Val Asp Leu Asp Gly Leu Asp Thr Ser Arg Glu Val Phe Gly Asp
3095 3100 3105
Ala Leu Ala Cys Ala Glu Pro Arg Ile Ala Val Arg Arg Gly Thr
3110 3115 3120
Val Ala Ala Pro Arg Leu Ala Arg Ala Arg Ser His Pro Ala Leu
3125 3130 3135
Leu Pro Pro Ser Gly Pro Val Pro Trp Arg Leu Glu Ser Thr Gly
3140 3145 3150
Arg Gly Ser Ile Glu Asp Met Ala Leu Val Pro Cys Pro Glu Val
3155 3160 3165
Leu Asp Pro Leu Gly Pro Gly Gln Val Arg Ile Ala Val His Ala
3170 3175 3180
Val Gly Leu Asp Phe Gln Asp Val Val Ala Ser Leu Asp Pro Ala
3185 3190 3195
Glu Gly Ser Lys Gly Ile Ser Gly Tyr Ala Ala Gly Thr Val Gln
3200 3205 3210
Glu Thr Gly Ala Glu Val Thr Asp Leu Ala Val Gly Asp Arg Val
3215 3220 3225
Leu Ala Leu Arg Ser Gly Ser Ser Gly Pro Phe Ala Val Tyr Asp
3230 3235 3240
His Arg Cys Leu Ala Pro Met Pro Asp Gly Trp Ser Tyr Glu Gln
3245 3250 3255
Ala Ala Ala Val Pro Leu Thr Tyr Leu Ile Pro Tyr Tyr Gly Leu
3260 3265 3270
Val Asp Leu Ala Asp Val Gln Pro Gly Met Ser Val Leu Val His
3275 3280 3285
Asp Ala Ala Asp Gly Ser Gln Leu Ala Ala Val Gln Leu Ala His
3290 3295 3300
Gln Leu Gly Ala Glu Val Tyr Gly Thr Ala Ala Thr Gly Lys Trp
3305 3310 3315
Pro Thr Leu Arg Lys Tyr Gly Leu Asp Asp Ala His Ile Ala Asp
3320 3325 3330
Ser Arg Thr Pro Glu Phe Glu His Arg Phe Met Glu Thr Ser Gly
3335 3340 3345
Gly Cys Gly Val Asp Val Val Leu Asn Cys Leu Ala Gly Glu Ser
3350 3355 3360
Val Asp Ala Gly Leu Arg Leu Leu Pro Arg Gly Gly Arg Phe Leu
3365 3370 3375
Glu Thr Gly Arg Ala Asp Arg Arg Asp Pro Ala Gln Val Ala Glu
3380 3385 3390
Ala His Ala Gly Val Ala Tyr Arg Thr Tyr Asp Leu Ala Glu Ala
3395 3400 3405
Asp Pro Asp Arg Ile Arg Glu Met Leu Val Ala Val Met Ser Leu
3410 3415 3420
Cys His Asp Gly Lys Leu Thr Pro Pro Arg Ile Thr Val Arg Asp
3425 3430 3435
Leu Arg Arg Ala Arg Glu Ala Ser Arg His Ala Asn Arg Ala Thr
3440 3445 3450
Pro Ala Gly Pro Ala Val Leu Thr Val Pro Arg Gly Ile Asp Pro
3455 3460 3465
Asn Gly Thr Ala Leu Ile Thr Gly Gly Thr Gly Thr Leu Gly Ala
3470 3475 3480
Leu Leu Ala Arg His Leu Val His Gln His Gly Val Thr Asp Leu
3485 3490 3495
Leu Leu Thr Ser Arg Arg Gly Pro Asp Ala Pro Gly Ala Thr Glu
3500 3505 3510
Leu Thr Ala Glu Leu Thr Lys Ala Gly Ala His Val Thr Ile Thr
3515 3520 3525
Ala Cys Asp Thr Ala Asp Pro Asp Gln Leu Ala Ala Leu Leu Ser
3530 3535 3540
His His Thr Leu Thr Thr Val Ile His Thr Ala Gly Ile Leu Asp
3545 3550 3555
Asp Ala Thr Ile Thr Thr Leu Thr Asn Thr Gln Leu His Asn Val
3560 3565 3570
Leu Arg Pro Lys Ile Asp Ala Ala Thr His Leu His His Leu Thr
3575 3580 3585
Leu Asn His Pro Val Thr Thr Phe Ile Leu Tyr Ser Ser Ala Ala
3590 3595 3600
Gly Gln Leu Gly Ala Pro Gly Gln Ala Asn Tyr Ala Ala Ala Asn
3605 3610 3615
Thr Tyr Leu Asp Ala Leu Ala His His Arg Arg Thr His Gly Leu
3620 3625 3630
Pro Ala Thr Ser Leu Ala Trp Gly Leu Trp Asn Thr Arg Ser Thr
3635 3640 3645
Met Thr Gly His Leu Asn Asp Lys Glu Leu His Arg Met Glu Arg
3650 3655 3660
Ala Gly Val Val Pro Leu Glu Asp Ala Glu Ala Leu Ala Leu Phe
3665 3670 3675
Asp Leu Ala Cys Gly Ala Asp Val Pro Leu Gln Val Ile Thr Arg
3680 3685 3690
Leu Thr Pro Ser Thr Leu Arg Ser Gly Ala Asp Glu Val Pro His
3695 3700 3705
Leu Leu Arg Gly Leu Val Gln Gly Thr Ser Arg Arg Thr Ala Arg
3710 3715 3720
Ser Gly Ser Asn Gly Ser Gly Leu Arg Thr Arg Leu Ala Arg Leu
3725 3730 3735
Pro Ala Val Glu Gln His Arg Arg Val Leu Glu Leu Val Arg Ser
3740 3745 3750
His Ala Ala Thr Val Leu Gly His Ala Ser Val Ala Ala Val Thr
3755 3760 3765
Ala Glu Arg Ser Phe Ser Glu Leu Gly Phe Ser Ser Leu Thr Ala
3770 3775 3780
Val Glu Phe Arg Asn Arg Leu Gly Ala Ala Thr Gly Leu Arg Leu
3785 3790 3795
Pro Ala Thr Leu Val Phe Glu His Pro Thr Pro Thr Ala Leu Ala
3800 3805 3810
Thr Glu Leu Leu Thr Ala Leu Val Pro Ala Gly Leu Ser Gly Val
3815 3820 3825
Glu Ala Ala Leu Ala Glu Val Asp Ala Leu Glu Ala Ala Leu Lys
3830 3835 3840
Thr Ile Asp Ala Asp Asn Gly Asp Arg Asp Arg Val Val Arg Arg
3845 3850 3855
Leu Arg Gly Leu Leu Ser Glu Trp Arg Glu Pro Asp Thr Gly Pro
3860 3865 3870
Ala Ala Leu Asp Asp Leu Ala Thr Ala Thr Thr Asp Asp Leu Phe
3875 3880 3885
Glu Ala Ile Asp Gln Gly Phe Gly Leu
3890 3895
<210>6
<211>1868
<212>PRT
<213〉nanchang streptomycete NS3226 (Streptomyces nanchangensis n.sp.NS3226)
<400>1
Met Ser Asp Asp Glu Lys Tyr Arg Glu Tyr Leu Lys Arg Ala Val Thr
1 5 10 15
Glu Ala Arg Gly Leu Gln Arg Arg Leu Arg Glu Val Glu Asp Arg Ala
20 25 30
Arg Glu Pro Ile Ala Ile Ile Gly Met Ala Cys Arg Leu Pro Gly Gly
35 40 45
Ala Asp Thr Pro Glu Asp Val Trp Arg Met Leu Ser Glu Glu Ala Asp
50 55 60
Ala Val Ala Gly Phe Pro Asp Asp Arg Gly Trp Asn Leu Asp Gly Leu
65 70 75 80
Tyr Glu Thr Asp Ala Ala Gly Ala Gly Thr Ser Thr Pro Leu Glu Gly
85 90 95
Gly Phe Leu Arg Cys Ala Gly Glu Phe Asp Ala Ala Phe Phe Gly Ile
100 105 110
Ala Pro Arg Glu Ala Leu Thr Thr Asp Pro Gln Gln Arg Leu Leu Leu
115 120 125
Ala Ser Ser Trp Glu Ala Leu Glu Arg Ala Arg Ile Asp Pro Arg Ser
130 135 140
Leu Arg Gly Ser Asp Thr Gly Val Phe Phe Gly Gly Thr Ser Gly Asp
145 150 155 160
Phe Ala Gly Leu Leu Ala Ala Ser Pro His Ala Leu Asp Gly Tyr Leu
165 170 175
Met Thr Gly Thr Ser Ser Ser Val Leu Ser Gly Arg Val Ala Tyr Thr
180 185 190
Leu Gly Leu Glu Gly Pro Ala Val Thr Val Asp Thr Ala Cys Ser Ser
195 200 205
Ser Leu Val Ser Leu His Leu Ala Val Gln Ala Leu Arg Lys Asp Glu
210 215 220
Ile Ser Leu Ala Leu Ala Gly Gly Val Thr Val Leu Ala Thr Pro Gly
225 230 235 240
Ala Phe Pro Glu Phe Ser Arg Gln Gly Gly Leu Ala Ser Asp Gly Arg
245 250 255
Cys Lys Ala Phe Ser Ser Asp Ala Asp Gly Thr Gly Trp Gly Glu Gly
260 265 270
Val Gly Val Leu Val Leu Gln Arg Leu Ser Asp Ala Gln Arg Thr Gly
275 280 285
His Pro Val Leu Ala Val Val Arg Glu Thr Gly Ile Asn Gln Asp Gly
290 295 300
Ala Ser Asn Gly Leu Thr Ala Pro Ser Gly Pro Ala Gln Arg His Leu
305 310 315 320
Ile Leu Arg Val Leu Asp Asn Ala Gly Leu Ala Thr Ala Asp Val Asp
325 330 335
Met Val Glu Ala His Gly Thr Gly Thr Thr Leu Gly Asp Pro Ile Glu
340 345 350
Ala Arg Ala Leu Leu Ala Thr Tyr Gly Gln Asp Arg Pro Ala Asp Arg
355 360 365
Pro Leu Trp Leu Gly Ser Ile Lys Ser Asn Val Gly His Thr Gln Tyr
370 375 380
Ala Ala Gly Val Ser Gly Val Ile Lys Thr Val Met Ala Leu Arg His
385 390 395 400
Gly Val Met Pro Lys Thr Leu His Val Asp Glu Pro Thr Pro His Val
405 410 415
Asp Trp Ser Ser Gly Ala Val Arg Leu Leu Thr Glu Ala Arg Glu Trp
420 425 430
Pro Glu Thr Gly Arg Pro Arg Arg Ala Gly Val Ser Ser Phe Gly Val
435 440 445
Ser Gly Thr Asn Ala His Val Ile Leu Glu Gln Ala Pro Glu Ala Glu
450 455 460
Pro Val Glu Val Asp Glu Ala Asp Arg Pro Val Leu Met Gly Ser Val
465 470 475 480
Pro Trp Val Val Ser Ala Arg Gly Glu Gly Ala Leu Arg Ala Gln Ala
485 490 495
Gly Arg Leu Leu Glu Trp Leu Val Glu Arg Pro Gly Leu Gly Pro Val
500 505 510
Asp Val Gly Phe Ser Leu Val Gly Thr Arg Ser Ala Phe Glu Gln Arg
515 520 525
Ala Val Val Leu Gly Gly Asp Arg Glu Glu Leu Leu Ala Gly Leu Arg
530 535 540
Ser Val Ala Glu Gly Val Pro Gly Ala Gly Val Val Ser Gly Arg Ala
545 550 555 560
Ala Gly Asp Gly Gly Met Gly Val Val Phe Val Phe Pro Gly Gln Gly
565 570 575
Ser Gln Trp Val Gly Met Gly Arg Glu Leu Trp Glu Val Ser Ser Val
580 585 590
Phe Ala Glu Ser Met Val Ala Cys Glu Arg Ala Leu Val Pro Phe Val
595 600 605
Asp Trp Ser Leu Arg Asp Val Val Phe Gly Gly Gly Gly Asp Gly Leu
610 615 620
Trp Glu Arg Val Asp Val Val Gln Pro Val Leu Trp Ala Val Met Val
625 630 635 640
Ser Leu Ala Ala Val Trp Arg Ser Phe Gly Val Glu Pro Ala Ala Val
645 650 655
Val Gly His Ser Gln Gly Glu Val Ala Ala Ala Cys Val Ala Gly Gly
660 665 670
Leu Ser Leu Glu Asp Gly Ala Arg Val Val Ala Val Arg Ser Arg Leu
675 680 685
Val Gly Glu Arg Leu Ser Gly Arg Gly Gly Met Val Ser Val Gly Leu
690 695 700
Ser Val Gly Glu Val Glu Glu Trp Leu Ala Gly Leu Gly Gly Arg Val
705 710 715 720
Gly Val Ala Ala Val Asn Gly Pro Ser Ser Val Val Val Ser Gly Glu
725 730 735
Ala Glu Val Leu Glu Gly Leu Leu Ala Gly Phe Glu Gly Ala Gly Val
740 745 750
Arg Ala Arg Arg Ile Ala Val Asp Tyr Ala Ser His Ser Val Gln Val
755 760 765
Asp Ala Leu Gly Asp Asp Leu Leu Ala Gly Leu Ala Gly Ile Arg Pro
770 775 780
Val Ser Ser Ser Val Ala Phe Tyr Ser Thr Val Ser Gly Glu Arg Met
785 790 795 800
Asp Thr Ala Gly Leu Asp Ala Gly Tyr Trp Leu Arg Asn Met Arg Glu
805 810 815
Thr Val Ala Phe Glu Ala Ala Val Arg Ala Thr Leu Asp Glu Gly His
820 825 830
Arg Thr Leu Leu Glu Val Ser Pro His Pro Val Val Ala Met Ala Leu
835 840 845
Gln Glu Ile Ile Asp Gly Ala Gly Val Ser Ala His Val Ser Gly Thr
850 855 860
Ile Arg Arg Asp Asp Ala Gly Ala Gly Arg Leu Leu Thr Ser Leu Ala
865 870 875 880
Glu Ala Tyr Val Ala Gly Ala Pro Val Asn Trp Ser Val Val Phe Glu
885 890 895
Gly Thr Gly Ala Arg Pro Val Asp Leu Pro Thr Tyr Ala Phe Gln His
900 905 910
Gln Arg Tyr Trp Leu Arg Met Pro Val Ser Gly Ser Gly Asp Val Thr
915 920 925
Ala Ala Gly Leu Arg Ser Pro Gly His Pro Leu Leu Gly Ala Ala Val
930 935 940
Glu Pro Ala Glu Ser Asp Gly Leu Val Leu Thr Gly Arg Leu Ser Leu
945 950 955 960
Arg Asp His Pro Trp Leu Ala Asp His Arg Val Ala Gly Thr Val Pro
965 970 975
Leu Pro Gly Thr Ala Phe Val Glu Leu Ala Ala Val Ala Gly Asp Leu
980 985 990
Ala Glu Cys Pro Tyr Ile Glu Glu Leu Thr Leu Gln Thr Pro Leu Thr
995 1000 1005
Leu Pro Glu Thr Gly Gly Val Asp Leu Gln Leu Thr Val Gly Ala
1010 1015 1020
Pro Asp Asp Ala Gly Arg Arg Glu Val Gly Phe Phe Ala Arg Thr
1025 1030 1035
Asp Glu Glu Phe Thr Ala Ala Glu Trp Thr Arg His Ala Thr Gly
1040 1045 1050
Val Leu Cys Pro Ala Gly Pro Ala Pro Lys Ala Glu Pro Ala Asp
1055 1060 1065
Trp Pro Pro Arg Gly Ala Glu Arg Ile Asp Ile Gly Ser Leu Tyr
1070 1075 1080
Glu Asp Leu Ala Gly Gly Pro Leu Ala Tyr Gly Pro Ala Phe Arg
1085 1090 1095
Gly Leu Arg Ala Val Trp Ser Arg Gly Arg Glu Val Phe Ala Glu
1100 1105 1110
Ile Glu Leu Pro Gln Glu Leu His Glu Ala Ala Gly Glu Phe Leu
1115 1120 1125
Leu His Pro Ala Leu Leu Asp Ala Ala Leu His Ala Val Gly Phe
1130 1135 1140
Leu Gly Glu Leu Asp Ala Pro Ser Ala Pro Leu Arg Pro Phe Ala
1145 1150 1155
Trp Asn Ala Val Ser Leu Gln Ala Thr Gly Ala Thr Ala Leu Arg
1160 1165 1170
Val Ala Leu Ser Pro Ala Gly Lys Ser Ala Val Ser Leu Arg Ala
1175 1180 1185
Val Asp Gly Thr Gly Thr Pro Val Val Ser Ile Gly Ser Leu Leu
1190 1195 1200
Leu Arg Pro Ala Asp Leu Thr Asp Gly Asp Pro Gly Ser Gly Arg
1205 1210 1215
Thr Ala Thr His Ser Ser Leu Leu Ser Met Val Trp Thr Pro Val
1220 1225 1230
Pro Leu Pro Thr Val Lys Thr Ala Ala Trp Ala Val Val Gly Thr
1235 1240 1245
Ala Pro Ala Trp Ala Gly Pro Asp Ser Gly Ala Val His His Pro
1250 1255 1260
Asp Leu Leu Ala Leu Ser Ala Ser Val Ala Ala Gly Asp Pro Val
1265 1270 1275
Pro Ala Phe Val Val Leu Thr Pro Asp Asp Gly Glu Pro Ala Gly
1280 1285 1290
Pro Gly Phe Ala Glu Leu Pro Ala Leu Thr Arg Glu Arg Ala Gly
1295 1300 1305
Leu Val Leu Glu Ala Ala Arg Thr Trp Val Ser Glu Glu Leu Asp
1310 1315 1320
Glu Arg Leu Ala Ser Ile Pro Leu Val Val Thr Thr Thr Asp Ala
1325 1330 1335
Val Gly Ile Ser Ala Asp Asp Arg Val Ala Gly Leu Gly Ser Ala
1340 1345 1350
Pro Leu Trp Gly Leu Val Arg Ser Ile Gln Ser Glu Asn Pro Gly
1355 1360 1365
Arg Leu Val Leu Leu Asp Thr Asp Gly Ser Val Glu Ser Gly Arg
1370 1375 1380
Ala Val Arg Ala Ala Val Ala Ser Gly Glu Ala Gln Leu Ala Leu
1385 1390 1395
Arg Asp Gly Val Ala Leu Met Pro Arg Leu Asn Arg Pro Pro Ala
1400 1405 1410
Ala Asp Val Pro Asp Thr Val Pro Gly Thr Glu Leu Asp Ala Leu
1415 1420 1425
Asp Pro Ser Gly Thr Val Leu Val Thr Gly Ala Thr Gly Gly Ile
1430 1435 1440
Gly Ala Leu Val Ala Thr Arg Leu Ala Lys Leu His Gly Val His
1445 1450 1455
His Leu Leu Leu Leu Ser Arg Gln Gly Pro Asp Ala Glu Gly Ala
1460 1465 1470
Gly Glu Leu Val His Asp Leu Glu Glu Leu Gly Ala Thr Val Thr
1475 1480 1485
Leu Val Ala Cys Asp Val Ser Asp Arg Ala Ala Leu Ala Ala Val
1490 1495 1500
Leu Asp Gly Val Ser Ala Gly His Pro Leu Thr Ala Val Val His
1505 1510 1515
Cys Ala Gly Thr Ala Glu Asn Ala Leu Leu Ala Ser Leu Ala Pro
1520 1525 1530
Glu Leu Ile Asp Arg Val Phe Arg Ala Lys Val Asp Ala Ala Val
1535 1540 1545
His Leu His Glu Leu Thr Ala Glu Leu Asp Leu Ser Ala Phe Val
1550 1555 1560
Leu Phe Ser Ser Ile Ala Gly Thr Leu Gly Gly Thr Gly Gln Gly
1565 1570 1575
Asn Tyr Ala Ala Ala Asn Thr Phe Leu Asp Ser Leu Ala Gln Tyr
1580 1585 1590
Arg Arg Arg Asn Gly Leu Ala Ala Thr Ser Leu Gly Trp Gly Leu
1595 1600 1605
Trp Ala Thr Glu Arg Gly Met Asp Ser His Leu Ala Glu Gly Ala
1610 1615 1620
Ser Ala Gly Ser Pro Met Gly Gly Val Ser Ala Met Pro Ala Asp
1625 1630 1635
Gln Gly Leu Ala Leu Phe Asp Leu Gly Trp Arg Arg Ala Glu Pro
1640 1645 1650
Val Val Phe Pro Val Arg Leu Asn Ser Ala Ala Leu Arg Ala Gln
1655 1660 1665
Ala Ala Ala Gly Ser Leu Pro Pro Val Leu Arg Gly Leu Val Arg
1670 1675 1680
Val Pro Ala Gln Arg Ser Ala Gln Thr Gly Ser Gln Ala Pro Glu
1685 1690 1695
Ser Gln Leu Arg His Arg Leu Ala Glu Met Gly Pro Ala Glu Arg
1700 1705 1710
Gln Glu Thr Leu Leu Ala Leu Val Arg Asp Arg Ile Ala Ala Val
1715 1720 1725
Leu Gly His Ala Ser Ser Asp Gln Ile Glu Thr Asp Arg Pro Phe
1730 1735 1740
Arg Asp Leu Gly Phe Thr Ser Leu Thr Ala Val Glu Leu Arg Asn
1745 1750 1755
Arg Leu Asn Ala Ala Thr Gly Leu Arg Leu Pro Val Ser Leu Val
1760 1765 1770
Phe Asp Tyr Pro Thr Leu Gly Ala Leu Val Ala Leu Leu Val Ala
1775 1780 1785
Arg Leu Ala Pro Asp Gly Ala Gln Ser Ala Thr Thr Pro Glu Ala
1790 1795 1800
Glu Gln Glu Ala Ala Val Arg Arg Ala Leu Met Ser Val Pro Leu
1805 1810 1815
Asp Arg Leu Arg Glu His Gly Leu Leu Glu Ala Leu Leu Ala Leu
1820 1825 1830
Thr Gly Asp Glu Arg Ala Glu Pro Glu Val Ala Asp Arg Ser Glu
1835 1840 1845
Glu Ile Lys Ser Met Asp Val Thr Ala Leu Leu Ala Met Ala Arg
1850 1855 1860
Ser Thr Ser Thr Arg
1865
<210>7
<211>4635
<212>PRT
<213〉nanchang streptomycete NS3226 (Streptomyces nanchangensis n.sp.NS3226)
<400>1
Met Ser Glu Ile Val Asp Ala Leu Arg Ala Ser Leu Leu Glu Asn Glu
1 5 10 15
Arg Leu Arg Gln Gln Asn Gln Arg Leu Ser Ala Ala Ser Ser Glu Pro
20 25 30
Leu Ala Ile Val Gly Ile Gly Cys Arg Tyr Pro Gly Gly Val Arg Asp
35 40 45
Thr Glu Gly Leu Trp Gln Leu Ile Ala Glu Gly Arg Asp Ala Met Ser
50 55 60
Asp Phe Pro Thr Asp Arg Gly Trp Glu Asp Arg Asp Val Pro Ala Ala
65 70 75 80
Arg Thr Gly Ala Phe Leu His Asp Ala Gly Asp Phe Asp Pro Ala Phe
85 90 95
Phe Arg Ile Ser Pro Arg Glu Ala Met Ala Met Asp Pro Gln Gln Arg
100 105 110
Leu Leu Leu Glu Thr Ser Trp Glu Ala Leu Glu Arg Ala Gly Ile Asp
115 120 125
Pro Val Ser Leu Lys Gly Ser Arg Thr Gly Val Phe Ile Gly Gly Ala
130 135 140
Pro Gln Glu Tyr Gly Ala Leu Val Met Asn Ser Ala Gln Gly Ala Gly
145 150 155 160
Gly Tyr Ala Leu Thr Gly Ala Pro Gly Ser Val Leu Ser Gly Arg Ile
165 170 175
Ser Tyr Val Leu Gly Leu Glu Gly Pro Ala Val Thr Val Asp Thr Ala
180 185 190
Cys Ser Ser Ser Leu Val Ala Leu His Leu Ala Ile Lys Ser Leu Arg
195 200 205
Thr Gly Glu Cys Asp Leu Ala Leu Ala Gly Gly Val Leu Val Leu Ile
210 215 220
Thr Pro Thr Ile Phe Thr Glu Phe Ser Ala Thr Gly Gly Ser Ala Gly
225 230 235 240
Asp Gly Arg Cys Lys Ala Phe Ser Ser Asp Ala Asp Gly Thr Gly Trp
245 250 255
Gly Glu Gly Ala Gly Val Leu Ala Ile Gln Arg Leu Ser Asp Ala Arg
260 265 270
Arg Asp Gly Asn Pro Val Leu Ala Val Ile Arg Gly Ser Ala Val Asn
275 280 285
Gln Asp Gly Ala Ser Asn Gly Leu Ser Ala Pro Asn Gly Pro Ser Gln
290 295 300
Gln Arg Val Ile Arg Gln Ala Ile Ala Asn Ala Gly Leu Thr Leu Ala
305 310 315 320
Asp Val Asp Met Val Glu Ala His Gly Thr Gly Thr Thr Leu Gly Asp
325 330 335
Pro Ile Glu Ala Glu Ala Leu Leu Ala Thr Tyr Gly Gln Glu Arg His
340 345 350
Asp Gly Arg Pro Leu Trp Leu Gly Thr Leu Lys Ser Asn Val Gly His
355 360 365
Thr Gln Ala Ala Ala Gly Ile Ser Gly Val Ile Lys Ala Ala Leu Ala
370 375 380
Leu Gln His Gly Ile Met Pro Lys Thr Leu His Val Asp Glu Pro Thr
385 390 395 400
Pro Glu Val Asp Trp Ser Ala Gly Ala Val Glu Leu Leu Thr Glu Ala
405 410 415
Arg Gln Trp Pro Glu Thr Gly Gln Pro Arg Arg Val Gly Val Ser Ser
420 425 430
Phe Gly Ile Ser Gly Thr Asn Ala His Val Ile Leu Glu Gln Ala Pro
435 440 445
Glu Ala Ala Pro Ala Glu Gln Ala Asp Gly Asp Ala Pro Ala Glu Leu
450 455 460
Pro Val Thr Pro Trp Val Val Thr Gly Arg Asn Glu Ala Ala Leu Arg
465 470 475 480
Glu Gln Ala Ala Arg Leu Leu Asp His Leu Thr Gln Gln Pro Asp Leu
485 490 495
Ser Pro Arg Asp Val Gly Phe Ser Leu Val Gly Thr Arg Ser Ala Phe
500 505 510
Glu Gln Arg Ala Val Val Leu Gly Gly Asp Met Ala Ala Leu Thr Glu
515 520 525
Gly Val Arg Ala Leu Ala Ala Gln Glu Pro Asn Thr His Val Ile Ala
530 535 540
Gly Thr Ala Glu Val Arg Ser Gly Ile Val Phe Val Phe Pro Gly Gln
545 550 555 560
Gly Ser Gln Trp Val Gly Met Gly Arg Glu Leu Trp Asp Ala Ser Pro
565 570 575
Val Phe Ala Glu Ser Met Val Ala Cys Glu Arg Ala Leu Ala Pro Phe
580 585 590
Val Asp Trp Ser Leu Lys Asp Val Val Phe Arg Gly Ala Glu Asp Pro
595 600 605
Leu Trp Ala Arg Val Asp Val Val Gln Pro Val Leu Trp Ala Val Met
610 615 620
Val Ser Leu Ala Ala Val Trp Arg Ser Phe Gly Val Glu Pro Val Ala
625 630 635 640
Val Val Gly His Ser Gln Gly Glu Val Ala Ala Ala Cys Val Ala Gly
645 650 655
Gly Leu Ser Leu Glu Asp Gly Ala Arg Val Val Ala Val Arg Ser Arg
660 665 670
Leu Val Arg Glu Lys Leu Ser Gly Leu Gly Gly Met Gly Ser Val Ala
675 680 685
Leu Pro Val Glu Ala Val Glu Val Arg Leu Gly Arg Phe Gly Gly Arg
690 695 700
Val Gly Val Ala Ala Val Asn Gly Pro Thr Ser Val Val Val Ser Gly
705 710 715 720
Glu Val Glu Ala Leu Asp Ala Leu Leu Ala Glu Cys Glu Glu Ala Gly
725 730 735
Val Arg Ala Arg Arg Ile Ala Val Asp Tyr Ala Ser His Ser Ala Gln
740 745 750
Val Asp Ala Leu Thr Asp Asp Leu Leu Ala Glu Leu Ala Glu Leu Arg
755 760 765
Pro Gln Ser Ser Ser Val Ala Phe Tyr Ser Thr Val Thr Gly Glu Arg
770 775 780
Leu Asp Thr Ala Gly Leu Asp Ala Arg Tyr Trp Val Thr Asn Leu Arg
785 790 795 800
Glu Arg Val Asn Phe Glu Pro Val Thr Arg Leu Leu Ala Glu Lys Gly
805 810 815
Ala Gly Val Phe Val Glu Ser Ser Pro His Pro Val Leu Thr Val Ala
820 825 830
Val Thr Glu Thr Gly Glu Ala Ala Asp Arg Ser Val Val Ala Val Gly
835 840 845
Ser Leu Arg Arg Glu Glu Gly Gly Leu Arg Arg Phe Leu Ala Ser Leu
850 855 860
Ala Glu Ala Tyr Val Ala Gly Val Pro Val Asp Trp Ser Val Thr Phe
865 870 875 880
Ala Gly Ser Gly Ala Arg Arg Val Asp Leu Pro Thr Tyr Ala Phe Gln
885 890 895
His Gln Arg Tyr Trp Leu Asp Asp Val Val Leu Pro Gly Gln Gly Gly
900 905 910
Gly Gly Ser Ser Asp Pro Ala Asp Ala Ala Phe Trp Gly Ala Val Glu
915 920 925
Arg Ala Asp Ala Glu Ser Val Val Ser Leu Val Asp Gly Ala Asp Ala
930 935 940
Gln Val Trp Glu Ser Val Leu Pro Ala Leu Ser Ala Trp Arg Lys Gly
945 950 955 960
Arg Arg Thr Gln Ser Thr Leu Asp Ser Trp Arg Tyr Arg Thr Val Trp
965 970 975
Arg Ser Val Thr Val Ser Ser Ala Ala Ser Leu Cys Gly Val Trp Leu
980 985 990
Val Val Ser Ser Gly Pro Gly Ala Pro Val Glu Gln Val Thr Leu Ala
995 1000 1005
Leu Thr Ala Ala Gly Ala Glu Val Arg Val Leu Asp Val Pro Val
1010 1015 1020
Glu Arg Gly Ala Leu Ala Glu Trp Phe Ala Glu Ala Gly Glu Val
1025 1030 1035
Ala Gly Val Val Ser Leu Leu Ala Trp Asp Glu Asp Glu Ala Leu
1040 1045 1050
Ala Ser Ser Leu Ala Leu Val Gln Ala His Gly Asp Ala Gly Leu
1055 1060 1065
Ser Ala Pro Val Trp Val Leu Thr Arg Gly Ala Ala Ala Val Gly
1070 1075 1080
Ser Asp Asp Ala Val Cys Ala Thr Gln Thr Ser Leu Trp Ala Trp
1085 1090 1095
Gly Gln Val Val Gly Leu Glu Leu Pro Ala Val Trp Gly Gly Leu
1100 1105 1110
Val Asp Val Pro Ala Glu Trp Asp Gly Arg Val Ser Ser Ala Leu
1115 1120 1125
Ala Ala Val Leu Ala Ala Gly Glu Gly Glu Asp Gln Val Ala Val
1130 1135 1140
Arg Ser Ser Gly Val Tyr Ala Arg Arg Leu Val Trp Ala Pro Leu
1145 1150 1155
Gly Ala Gly Ala Ala Ala Val Arg Glu Phe Lys Pro Gln Gly Thr
1160 1165 1170
Val Leu Ile Thr Gly Gly Thr Gly Gly Val Gly Gly His Leu Ala
1175 1180 1185
Arg Trp Leu Ala Arg Glu Gly Ala Glu His Leu Leu Leu Val Asn
1190 1195 1200
Arg Thr Gly Glu Gly Ala Ala Glu Leu Leu Glu Glu Leu Arg Gly
1205 1210 1215
Ser Gly Ala Glu Val Thr Val Ala Ala Cys Asp Val Thr Asp Arg
1220 1225 1230
Ala Ala Leu Ala Glu Leu Leu Ala Gly Ile Pro Ala Glu Arg Pro
1235 1240 1245
Leu Thr Ala Val Phe His Ala Ala Gly Val Ala Gly Tyr Gly Leu
1250 1255 1260
Val Arg Glu Leu Asp Val Ala Asp Leu Asp Val Glu Met Ala Ala
1265 1270 1275
Arg Thr Leu Gly Ala Arg His Leu Asp Glu Leu Thr Ala Glu Leu
1280 1285 1290
Gly Leu Asp Leu Asp Ala Phe Val Val Phe Ser Thr Gly Ala Ser
1295 1300 1305
Val Trp Gly Ser Ala Gly Asn Gly Ala Asn Ala Ala Ala Gly Gly
1310 1315 1320
Tyr Leu Asp Gly Leu Ile Arg Gly Arg Arg Ala Arg Gly Leu Val
1325 1330 1335
Gly Ser Ser Val Ser Trp Gly Gly Trp Gly Ala Thr Ala Met Ala
1340 1345 1350
Val Gly Glu Thr Ala Glu Arg Leu Ser Arg Arg Gly Val Arg Leu
1355 1360 1365
Leu Glu Pro Glu Leu Ala Val Arg Ala Leu Arg Gln Val Leu Glu
1370 1375 1380
Gln Asp Glu Val Ser Val Thr Val Ala Asp Leu Asp Trp Ser Leu
1385 1390 1395
Phe Thr Pro Gly Tyr Ala Met Ala Arg Arg Arg Pro Leu Ile Glu
1400 1405 1410
Asp Ile Pro Glu Ala Ala Arg Ala Leu Arg Asp Ile Thr Glu Thr
1415 1420 1425
Asp Glu Thr Gln Asp Ala Ala Ala Gly Gly Leu Arg Glu Arg Leu
1430 1435 1440
Ala Gly Leu Ala Glu Ser Glu Gln Gln Ala Leu Leu Leu Gly Leu
1445 1450 1455
Val Arg Gly Glu Ala Ala Gln Val Leu Ala His Gly Ser Thr Ala
1460 1465 1470
Glu Ile Thr Pro Ser Arg Pro Phe Lys Glu Leu Gly Phe Asp Ser
1475 1480 1485
Leu Thr Gly Met Glu Leu Arg Asn Arg Leu Ser Lys Ala Thr Gly
1490 1495 1500
Leu Arg Leu Pro Ala Thr Leu Val Phe Asp Tyr Pro Asn Leu Gln
1505 1510 1515
Gln Leu Ala Ser Leu Leu Arg Thr Ala Leu Ile Asp Gly Leu Pro
1520 1525 1530
Gly Ala Gly Ala Val Ala Thr Thr Val Arg Leu Val Asp Asp Glu
1535 1540 1545
Pro Leu Ala Ile Ile Gly Met Ala Cys Arg Tyr Pro Gly Asp Val
1550 1555 1560
Arg Asp Pro Glu Asp Leu Trp Arg Leu Val Ser Glu Gly Arg Asp
1565 1570 1575
Glu Leu Ser Asp Phe Pro Thr Asp Arg Gly Trp Glu Arg Trp Gly
1580 1585 1590
Thr Pro Ala Val Gly Gln Ala Gly Phe Leu His Glu Ala Gly Asp
1595 1600 1605
Phe Asp Ala Ala Phe Phe Gly Ile Ser Pro Arg Glu Ala Ala Ser
1610 1615 1620
Met Asp Pro Gln Gln Arg Leu Leu Leu Glu Val Ser Trp Glu Ala
1625 1630 1635
Phe Glu Gln Ala Gly Ile Asp Pro Trp Ser Leu Arg Asn Ser Pro
1640 1645 1650
Thr Gly Val Phe Val Gly Gly Gly Pro Gln Asp Tyr Pro Thr Val
1655 1660 1665
Leu Met Gly Ser Ala Glu Ala Ala Ser Gly Tyr Gly Met Thr Gly
1670 1675 1680
Ala Leu Gly Ser Val Met Ser Gly Arg Val Ser Tyr Met Leu Gly
1685 1690 1695
Leu Glu Gly Pro Ala Val Thr Val Asp Thr Ala Cys Ser Ser Ser
1700 1705 1710
Leu Val Ala Leu His Leu Ala Ala Gln Ser Leu His Asn Gly Glu
1715 1720 1725
Cys Gly Leu Ala Val Ala Gly Gly Val Thr Ile Met Ala Thr Pro
1730 1735 1740
Gly Ala Phe Leu Gly Phe Asp Thr Leu Gly Gly Leu Ala Glu Asp
1745 1750 1755
Gly Arg Cys Lys Ala Phe Ala Ala Ser Ala Asp Gly Thr Gly Trp
1760 1765 1770
Ala Glu Gly Val Gly Met Val Val Leu Glu Arg Leu Ser Asp Ala
1775 1780 1785
Arg Arg Asn Gly His Glu Val Leu Ala Val Val Arg Gly Ser Ala
1790 1795 1800
Val Asn Gln Asp Gly Ala Ser Asn Gly Leu Ser Ala Pro Asn Gly
1805 1810 1815
Pro Ser Gln Gln Arg Val Ile Arg Gln Ala Leu Ala Asn Ala Gly
1820 1825 1830
Leu Ser Ala Ala Asp Val Asp Met Val Glu Ala His Gly Thr Gly
1835 1840 1845
Thr Thr Leu Gly Asp Pro Ile Glu Ala Gln Ala Leu Leu Ala Thr
1850 1855 1860
Tyr Gly Gln Asp Arg Pro Ala Asp Arg Pro Leu Trp Leu Gly Ser
1865 1870 1875
Val Lys Ser Asn Phe Gly His Thr Gly Ala Ala Ala Gly Val Ala
1880 1885 1890
Gly Val Ile Lys Ser Val Leu Ala Leu Arg His Gly Leu Met Pro
1895 1900 1905
Lys Thr Leu His Val Asp Glu Pro Thr Pro Glu Val Asp Trp Ser
1910 1915 1920
Ala Gly Ala Val Glu Leu Leu Thr Glu Ala Arg Gln Trp Pro Glu
1925 1930 1935
Thr Glu Gln Pro Arg Arg Val Gly Val Ser Ser Phe Gly Ile Ser
1940 1945 1950
Gly Thr Asn Ala His Leu Ile Leu Glu Glu Ala Pro Gln Ala Ala
1955 1960 1965
Ala Val Glu Asp Glu Arg Asp Gly Ser Val Ala Pro Val Ser Ser
1970 1975 1980
Pro Val Val Pro Trp Val Val Ser Gly Arg Ser Glu Thr Ala Leu
1985 1990 1995
Arg Ala Gln Ala Ala Arg Leu Ala Glu His Leu Ala Gln Arg Pro
2000 2005 2010
Glu Ala Gly Ala Leu Asp Val Gly Phe Ser Leu Val Glu Ser Arg
2015 2020 2025
Ser Ala Phe Glu Gln Arg Ala Val Val Leu Gly Ala Asp Arg Glu
2030 2035 2040
Glu Leu Leu Ala Gly Val Arg Ala Val Gly Glu Gly Ala Gln Ala
2045 2050 2055
Ser Gly Val Val Thr Gly Arg Ala Ala Gln Ser Gly Val Val Phe
2060 2065 2070
Val Phe Pro Gly Gln Gly Ser Gln Trp Val Gly Met Gly Arg Glu
2075 2080 2085
Leu Trp Asp Ala Ser Pro Val Phe Ala Glu Ser Met Val Ala Cys
2090 2095 2100
Glu Arg Ala Leu Ala Pro Phe Val Asp Trp Ser Leu Lys Asp Val
2105 2110 2115
Val Phe Arg Gly Ala Glu Asp Pro Leu Trp Ala Arg Val Asp Val
2120 2125 2130
Val Gln Pro Val Leu Trp Ala Val Met Val Ser Leu Ala Ala Val
2135 2140 2145
Trp Arg Ser Phe Gly Val Glu Pro Val Ala Val Val Gly His Ser
2150 2155 2160
Gln Gly Glu Val Ala Ala Ala Cys Val Ala Gly Gly Leu Ser Leu
2165 2170 2175
Glu Asp Gly Ala Arg Val Val Ala Val Arg Ser Arg Leu Val Arg
2180 2185 2190
Glu Lys Leu Ser Gly Leu Gly Gly Met Gly Ser Val Ala Leu Pro
2195 2200 2205
Val Glu Ala Val Glu Val Arg Leu Gly Arg Phe Gly Gly Arg Val
2210 2215 2220
Gly Val Ala Ala Val Asn Gly Pro Thr Ser Val Val Val Ser Gly
2225 2230 2235
Glu Val Glu Ala Leu Asp Ala Leu Leu Ala Glu Cys Glu Glu Ala
2240 2245 2250
Gly Val Arg Ala Arg Arg Ile Ala Val Asp Tyr Ala Ser His Ser
2255 2260 2265
Ala Gln Val Asp Ala Leu Thr Asp Asp Leu Leu Ala Glu Leu Ala
2270 2275 2280
Glu Leu Arg Pro Gln Ser Ser Ser Val Ala Phe Tyr Ser Thr Val
2285 2290 2295
Thr Gly Glu Arg Leu Asp Thr Ala Gly Leu Asp Ala Arg Tyr Trp
2300 2305 2310
Val Thr Asn Leu Arg Glu Arg Val Asn Phe Glu Pro Val Thr Arg
2315 2320 2325
Leu Leu Ala Glu Arg Glu His Gln Phe Phe Val Glu Ser Ser Pro
2330 2335 2340
His Pro Val Leu Thr Val Ala Val Thr Glu Thr Gly Glu Ala Ala
2345 2350 2355
Asp Arg Ser Val Val Ala Val Gly Ser Leu Arg Arg Glu Glu Gly
2360 2365 2370
Gly Val Gln Arg Leu Leu Thr Ser Leu Ala Glu Ala Tyr Val Ala
2375 2380 2385
Gly Val Pro Val Asp Trp Ser Lys Thr Phe His Gly Thr Gly Ala
2390 2395 2400
Gln Ser Val Asp Leu Pro Thr Tyr Ala Phe Gln His Gln His Tyr
2405 2410 2415
Trp Leu Asp Asp Val Val Leu Pro Gly Gln Gly Gly Gly Gly Ser
2420 2425 2430
Ser Asp Pro Ala Asp Ala Ala Phe Trp Gly Ala Val Glu Arg Ala
2435 2440 2445
Asp Ile Asp Ser Val Ala Ser Ile Val Asp Gly Val Asp Gln Gln
2450 2455 2460
Ala Trp Glu Ser Val Val Pro Ala Leu Ser Ala Trp Arg Lys Gly
2465 2470 2475
Arg Gln Glu Arg Ala Leu Leu Asp Ser Trp Arg Tyr Arg Thr Val
2480 2485 2490
Trp Arg Ser Val Thr Val Ser Ser Ala Ala Ser Leu Cys Gly Val
2495 2500 2505
Trp Leu Val Val Ser Ser Gly Pro Gly Ala Pro Val Glu Gln Val
2510 2515 2520
Thr Leu Ala Leu Thr Ala Ala Gly Ala Glu Val Arg Val Leu Asp
2525 2530 2535
Val Pro Val Glu Arg Gly Ala Leu Ala Glu Trp Phe Ala Glu Ala
2540 2545 2550
Gly Glu Val Ala Gly Val Val Ser Leu Leu Ala Trp Asp Glu Asp
2555 2560 2565
Glu Ala Leu Ala Ser Ser Leu Ala Leu Val Gln Ala His Gly Asp
2570 2575 2580
Ala Gly Leu Ser Ala Pro Val Trp Val Leu Thr Arg Gly Ala Ala
2585 2590 2595
Ala Val Gly Ser Asp Asp Ala Val Cys Ala Thr Gln Thr Ser Leu
2600 2605 2610
Trp Ala Trp Gly Gln Val Val Gly Leu Glu Leu Pro Ala Val Trp
2615 2620 2625
Gly Gly Leu Val Asp Val Pro Ala Glu Trp Asp Gly Arg Val Ser
2630 2635 2640
Ser Ala Leu Ala Ala Val Leu Ala Ala Gly Glu Gly Glu Asp Gln
2645 2650 2655
Val Ala Val Arg Ser Ser Gly Val Tyr Ala Arg Arg Leu Val Trp
2660 2665 2670
Ala Pro Leu Gly Ala Gly Ala Ala Ala Val Arg Glu Phe Lys Pro
2675 2680 2685
Gln Gly Thr Val Leu Ile Thr Gly Gly Thr Gly Gly Val Gly Gly
2690 2695 2700
His Leu Ala Arg Trp Leu Ala Arg Glu Gly Ala Glu His Leu Leu
2705 2710 2715
Leu Val Asn Arg Thr Gly Glu Gly Ala Ala Glu Leu Leu Glu Glu
2720 2725 2730
Leu Arg Gly Ser Gly Ala Glu Val Thr Val Ala Ala Cys Asp Val
2735 2740 2745
Thr Asp Arg Ala Ala Leu Ala Glu Leu Leu Ala Gly Ile Pro Ala
2750 2755 2760
Glu Arg Pro Leu Thr Ala Val Phe His Ala Ala Gly Val Ala Gly
2765 2770 2775
Tyr Gly Leu Val Arg Glu Leu Asp Ala Ala Asp Leu Asp Ala Glu
2780 2785 2790
Met Ala Ala Lys Thr Leu Gly Ala Arg His Leu Asp Glu Leu Thr
2795 2800 2805
Ala Glu Leu Gly Leu Asp Leu Glu Ala Phe Val Leu Phe Ser Ser
2810 2815 2820
Gly Ala Ala Val Trp Gly Ser Ala Gly Ser Gly Gly Tyr Ala Ala
2825 2830 2835
Ala Asn Gly Tyr Leu Asp Gly Leu Ala Gln Glu Arg Arg Ala Arg
2840 2845 2850
Gly Leu Ala Ala Thr Ser Val Ser Trp Gly Asn Trp Lys Asp Thr
2855 2860 2865
Gly Leu Ala Thr Asp Thr Thr Ala Glu Gln Leu Ala Arg Leu Gly
2870 2875 2880
Val Arg Pro Met Asp Pro Ala Leu Ala Val Ala Ala Leu Arg Gln
2885 2890 2895
Val Leu Glu His Asp Glu Ile Ala Leu Thr Val Thr Asp Met Asp
2900 2905 2910
Trp Ala Arg Phe Ala Pro Gly Tyr Thr Leu Ala Arg Arg Arg Pro
2915 2920 2925
Leu Ile Glu Asp Ile Pro Glu Ala Thr Arg Ala Leu Ser Glu Asp
2930 2935 2940
Ser Ala Asp Pro Ala Asn Asp Met Ala Gly Ala Ala Leu Arg Ala
2945 2950 2955
Glu Leu Glu Gly Leu Gly Arg Ala Glu Gln Leu Ala Val Leu Met
2960 2965 2970
Asp Leu Val Arg Ser Glu Val Thr Arg Ile Leu Ala Gly Ala Ser
2975 2980 2985
Ala Ala Asp Ile Thr Pro Glu Arg Pro Phe Lys Glu Leu Gly Phe
2990 2995 3000
Asp Ser Leu Thr Ala Met Glu Leu Arg Asn Leu Leu Thr Ile Ala
3005 3010 3015
Thr Gly Leu Arg Leu Pro Ala Thr Leu Val Phe Asp Tyr Pro Asn
3020 3025 3030
Pro Arg Gln Leu Ala Ala His Leu Cys Asp Glu Leu Ile Gly Val
3035 3040 3045
Gly Ala Asp Pro Val Gly Ala Asp Val Val Val Arg Gly Ser Ser
3050 3055 3060
Asp Glu Pro Leu Ala Val Val Gly Met Ala Cys Arg Tyr Ala Gly
3065 3070 3075
Gly Val Ser Thr Pro Glu Asp Leu Trp Gln Met Val Ala Glu Asn
3080 3085 3090
Arg Glu Gly Leu Thr Asp Val Pro Ser Tyr Arg Gly Trp Glu Gly
3095 3100 3105
Trp Asn Val Ala Ser Leu Arg Arg Ala Gly Phe Leu His Glu Ala
3110 3115 3120
Gly Asp Phe Asp Ala Gly Phe Phe Gly Ile Ser Pro Arg Glu Ala
3125 3130 3135
Ala Thr Met Asp Pro Gln Gln Arg Leu Leu Leu Glu Val Ser Trp
3140 3145 3150
Glu Ala Val Glu Arg Ala Gly Ile Asp Pro Lys Ser Leu Arg Gly
3155 3160 3165
Ser Asp Thr Gly Val Phe Val Gly Gly Thr Ala Val Glu Tyr Gly
3170 3175 3180
Ala Leu Leu Met Asn Ser Pro Thr Gly Gln Gly Tyr Ala Val Thr
3185 3190 3195
Ser Ser Ser Gly Ser Val Leu Ser Gly Arg Val Ser Tyr Thr Leu
3200 3205 3210
Gly Leu Glu Gly Pro Ala Val Ser Val Asp Thr Ala Cys Ser Ser
3215 3220 3225
Ser Leu Val Ala Leu His Leu Ala Ala Gln Ala Leu Arg Asn Gly
3230 3235 3240
Glu Cys Gly Leu Ala Leu Thr Gly Gly Val Gly Leu Met Ala Thr
3245 3250 3255
Pro Gly Gly Phe Val Glu Phe Asp Thr Leu Gly Gly Leu Ser Ser
3260 3265 3270
Asp Gly His Thr Lys Ala Phe Ala Ala Ser Ala Asp Gly Ile Gly
3275 3280 3285
Trp Gly Glu Gly Val Gly Met Ile Val Leu Glu Arg Leu Ser Asp
3290 3295 3300
Ala Arg Arg Asn Gly His Glu Val Leu Ala Val Val Arg Gly Ser
3305 3310 3315
Ala Val Asn Gln Asp Gly Ala Ser Asn Gly Leu Ser Ala Pro Asn
3320 3325 3330
Gly Pro Ser Gln Gln Arg Val Ile Arg Gln Ala Val Ala Asn Ala
3335 3340 3345
Gly Leu Thr Leu Ala Asp Ile Asp Met Val Glu Ala His Gly Thr
3350 3355 3360
Gly Thr Thr Leu Gly Asp Pro Ile Glu Ala Gln Ala Leu Leu Asn
3365 3370 3375
Thr Tyr Gly Gln Glu Arg His Asp Gly Gln Pro Leu Trp Leu Gly
3380 3385 3390
Ser Val Lys Thr Asn Ile Gly His Thr Gly Ala Ala Ala Gly Val
3395 3400 3405
Ala Gly Ile Ile Lys Ser Val Leu Ala Leu Arg Asn Gly Val Met
3410 3415 3420
Pro Met Thr Leu Asn Val Asp Gly Pro Thr Pro Lys Val Asp Trp
3425 3430 3435
Ser Ala Gly Ala Val Glu Leu Leu Thr Gln Gly Arg Glu Trp Pro
3440 3445 3450
Gln Thr Asp Arg Thr Arg Arg Ala Gly Val Ser Ser Phe Gly Ile
3455 3460 3465
Ser Gly Thr Asn Ala His Val Ile Ile Glu Glu Ala Pro Pro Ala
3470 3475 3480
Glu Glu Pro Pro Ala Gln Pro Gly Thr Asp Leu Pro Ala Ala Pro
3485 3490 3495
Ala Leu Ala Thr Pro Val Val Pro Trp Val Phe Ser Gly Arg Ser
3500 3505 3510
Asn Gly Ala Leu Arg Gly Gln Ala Glu Arg Leu Ser Ala Leu Ala
3515 3520 3525
Glu Asn Glu Pro Gly Leu Asp Leu Thr Asp Ala Ala Phe Ser Leu
3530 3535 3540
Ala Thr Thr Arg Ala Ser Leu Glu His Arg Ala Val Val Leu Gly
3545 3550 3555
Arg Asp Thr Ser Glu Met Leu Asp Gly Leu Arg Gly Leu Thr Ala
3560 3565 3570
Gln Gly Ser Val Ala Gly Val Val Ser Gly Val Thr Ala Ala Asp
3575 3580 3585
Ser Arg Ala Val Phe Val Phe Pro Gly Gln Gly Ser Gln Trp Val
3590 3595 3600
Gly Met Gly Arg Glu Leu Trp Glu Val Ser Ser Val Phe Ala Glu
3605 3610 3615
Ser Met Val Ala Cys Glu Arg Ala Leu Val Pro Phe Val Asp Trp
3620 3625 3630
Ser Leu Arg Asp Val Val Phe Gly Gly Gly Gly Asp Gly Leu Trp
3635 3640 3645
Glu Arg Val Asp Val Val Gln Pro Val Leu Trp Ala Val Met Val
3650 3655 3660
Ser Leu Ala Ala Val Trp Arg Ser Phe Gly Val Glu Pro Ala Ala
3665 3670 3675
Val Val Gly His Ser Gln Gly Glu Val Ala Ala Ala Cys Val Ala
3680 3685 3690
Gly Gly Leu Ser Leu Glu Asp Gly Ala Arg Val Val Ala Val Arg
3695 3700 3705
Ser Arg Leu Val Arg Asp Gly Leu Ser Gly Arg Gly Gly Met Val
3710 3715 3720
Ser Val Gly Leu Ser Val Gly Glu Val Glu Glu Trp Leu Ala Gly
3725 3730 3735
Leu Gly Gly Arg Val Gly Val Ala Ala Val Asn Gly Pro Ser Ser
3740 3745 3750
Val Val Val Ser Gly Glu Ala Glu Val Leu Glu Gly Leu Leu Ala
3755 3760 3765
Gly Phe Glu Gly Ala Gly Val Arg Ala Arg Arg Ile Ala Val Asp
3770 3775 3780
Tyr Ala Ser His Ser Val Gln Val Asp Ala Leu Gly Asp Asp Leu
3785 3790 3795
Leu Ala Gly Leu Ala Gly Ile Arg Pro Val Ser Ser Ser Val Ala
3800 3805 3810
Phe Tyr Ser Thr Val Ser Gly Glu Arg Met Asp Thr Ala Gly Leu
3815 3820 3825
Asp Ala Gly Tyr Trp Val Ala Asn Leu Arg Glu Arg Val Leu Phe
3830 3835 3840
Glu Pro Val Val Arg Met Leu Val Glu Arg Gly Ser Ala Val Phe
3845 3850 3855
Val Glu Ser Ser Pro His Pro Val Leu Ala Met Ala Val Gln Glu
3860 3865 3870
Thr Gly Glu Ala Val Gly Arg Ser Val Val Ala Val Gly Ser Leu
3875 3880 3885
Arg Arg Asp Asp Gly Gly Ala Gly Arg Phe Leu Ala Ser Leu Ala
3890 3895 3900
Glu Ala Tyr Val Val Gly Ala Pro Val Asp Trp Ser Val Leu Phe
3905 3910 3915
Ala Gly Ala Gly Ala Arg Arg Val Asp Leu Pro Thr Tyr Ala Phe
3920 3925 3930
Gln His Gln Arg Tyr Trp Leu Glu Gly Val Thr Val Gly Gly Glu
3935 3940 3945
Pro Gln Asp Thr Val Glu Asp Asp Thr Asp Ala Ala Phe Trp Asp
3950 3955 3960
Ala Val Glu Arg Glu Ser Leu Ser Asp Leu Ala Glu Val Leu Asp
3965 3970 3975
Val Ser Asp Ala Gly Ala Ala Ala Glu Ala Trp Leu Pro Thr Leu
3980 3985 3990
Ser Ala Trp Arg Lys Gly Arg Arg Arg Gln Met Thr Leu Asp Ser
3995 4000 4005
Trp Arg Tyr Arg Thr Thr Trp Arg Ala Tyr Ser Leu Pro Ser Gly
4010 4015 4020
Thr Arg Leu Ser Gly Met Trp Val Val Val Ala Ser Gly Gly Asp
4025 4030 4035
Ala Pro Val Val Glu Val Arg Arg Ala Leu Glu Ala Ala Gly Ala
4040 4045 4050
Glu Val Ser Val Arg Glu Val Leu Asp Gly Val Ala Leu Ala Asp
4055 4060 4065
Val Ser Gly Val Val Ser Leu Leu Ala Trp Asp Glu Gly Ser Ala
40704 075 4080
Leu Glu Ser Met Leu Arg Leu Val Arg Ala Val Gly Gly Gly Glu
4085 4090 4095
Val Pro Leu Trp Val Leu Thr Arg Gly Ala Ala Val Val Gly Val
4100 4105 4110
Asp Asp Pro Val Ser Ala Val Gln Ser Gln Val Trp Ala Leu Gly
4115 4120 4125
Gln Val Val Gly Leu Glu Gln Pro Gln Gly Trp Gly Gly Leu Val
4130 4135 4140
Asp Val Pro Gly Val Trp Asp Glu Arg Val Ala Ser Leu Leu Ala
4145 4150 4155
Gly Val Leu Ala Ala Gly Glu Gly Glu Asp Gln Val Ala Val Arg
4160 4165 4170
Ser Ser Gly Val Tyr Gly Arg Arg Leu Val Arg Ala Pro Leu Gly
4175 4180 4185
Gly Ser Pro Val Pro Val Arg Glu Trp Gly Pro Ser Gly Thr Val
4190 4195 4200
Leu Val Thr Gly Gly Thr Gly Gly Ile Gly Gly His Leu Ala Arg
4205 4210 4215
Trp Leu Ala Lys Glu Gly Ala Glu His Leu Leu Leu Val Ser Arg
4220 4225 4230
Gly Glu Arg Ala Gln Gly Ala Ala Glu Leu Val Glu Glu Val Arg
4235 4240 4245
Gly Leu Gly Ala Glu Val Thr Val Ala Ala Cys Asp Val Thr Asp
4250 4255 4260
Arg Ala Ala Leu Ala Glu Leu Leu Ala Glu His Pro Val Thr Ser
4265 4270 4275
Ile Phe His Thr Ala Gly Ile Ala Ala His Gly Phe Leu Thr Asp
4280 4285 4290
Leu Asp Pro Ala Glu Leu Gly Asp Gln Met Gly Ala Arg Val Val
4295 4300 4305
Gly Ala Arg His Leu Asp Glu Leu Ser Val Glu Leu Gly Leu Asp
4310 4315 4320
Leu Asp Ala Phe Val Val Phe Ser Thr Gly Ala Ser Val Trp Gly
4325 4330 4335
Ser Ala Gly Asn Gly Ala Asn Ala Ala Ala Gly Gly Tyr Leu Asp
4340 4345 4350
Gly Leu Ile Arg Gly Arg Arg Ala Arg Gly Leu Val Gly Ser Ser
4355 4360 4365
Val Ser Trp Gly Gly Trp Gly Ala Thr Ala Met Ala Val Gly Glu
4370 4375 4380
Thr Ala Glu Arg Leu Ser Arg Arg Gly Val Arg Leu Leu Glu Pro
4385 4390 4395
Glu Leu Ala Val Arg Ala Leu Arg Gln Val Leu Glu Gln Asp Glu
4400 4405 4410
Val Ser Val Thr Val Ala Asp Leu Asp Trp Ser Leu Phe Thr Pro
4415 4420 4425
Gly Tyr Ala Met Ala Arg Arg Arg Pro Leu Ile Glu Asp Ile Pro
4430 4435 4440
Glu Ala Ala Arg Ala Leu Arg Asp Ile Thr Glu Thr Asp Glu Thr
4445 4450 4455
Gln Asp Ala Ala Ala Gly Gly Leu Arg Glu Arg Leu Ala Gly Leu
4460 4465 4470
Ala Glu Ser Glu Gln Gln Ala Leu Leu Leu Gly Leu Val Arg Gly
4475 4480 4485
Glu Ala Ala Gln Val Leu Ala His Gly Ser Thr Ala Glu Ile Thr
4490 4495 4500
Pro Ser Arg Pro Phe Lys Glu Leu Gly Phe Asp Ser Leu Thr Gly
4505 4510 4515
Met Glu Leu Arg Asn Arg Leu Ser Lys Ala Thr Gly Leu Arg Leu
4520 4525 4530
Pro Ala Thr Leu Val Phe Asp Tyr Pro Asn Pro Gln Arg Val Thr
4535 4540 4545
Asp Leu Leu Leu Thr Asp Leu Asp Gln Gln Asp Gly Arg Pro Gly
4550 4555 4560
Ile Ala Asp Val Leu Asp Ile Lys Arg Glu Leu Ser Arg Ile Gly
4565 4570 4575
Glu Ala Leu Glu Gly Val Ala Pro Asp Gln Gln Ala Arg Glu Asp
4580 4585 4590
Ile Val Ala His Leu Arg Asp Leu Ile Thr Gln Leu Ser Ala Thr
4595 4600 4605
Glu Gln His Gly Ala Thr Asp Leu Glu Ala Ala Thr Asp Asp Glu
4610 4615 4620
Ile Phe Asp Phe Ile Asp Arg Asp Leu Gly Val Ser
4625 4630 4635
<210>8
<211>3593
<212>PRT
<213〉nanchang streptomycete NS3226 (Streptomyces nanchangensis n.sp.NS3226)
<400>1
Met Thr Glu Asp Lys Leu Arg Thr Tyr Leu Arg Arg Val Thr Ala Glu
1 5 10 15
Leu Gln Gln Thr Arg Gln Gln Leu Lys Asp Ser Gln Asp Arg Gly Arg
20 25 30
Glu Pro Leu Ala Ile Val Gly Met Ala Cys Arg Leu Pro Gly Gly Ala
35 40 45
Asp Ser Pro Glu Gln Leu Trp Gln Met Val Arg Asp Gly Ala Asp Gly
50 55 60
Val Gly Gly Phe Pro Asp Asp Arg Gly Trp Asp Leu Thr Ser Leu Leu
65 70 75 80
Ser Asp Asp Pro Asp Arg Pro Gly Thr Thr Tyr Thr Gln Glu Gly Ala
85 90 95
Phe Leu Lys Gly Ala Gly Asp Phe Asp Ala Gly Leu Phe Gly Ile Ser
100 105 110
Pro Arg Glu Ala Ala Thr Met Asp Pro Gln Gln Arg Leu Leu Leu Glu
115 120 125
Thr Ser Trp Glu Ala Leu Glu Arg Ala Gly Ile Asp Pro His Ser Leu
130 135 140
Arg Gly Ser Arg Thr Gly Val Phe Val Gly Gly Thr Ala Ile Glu His
145 150 155 160
Ile Val Lys Leu Met Asn Ser Pro Thr Asp Gln Gly Tyr Ala Ile Thr
165 170 175
Gly Gly Ser Gly Ser Ile Met Ser Gly Arg Ile Ser Tyr Val Leu Gly
180 185 190
Leu Glu Gly Pro Ala Val Thr Ile Asp Thr Ala Cys Ser Ser Ser Leu
195 200 205
Val Ala Leu His Ser Ala Val Gln Ser Leu Arg Gln Gly Asp Cys Ser
210 215 220
Leu Ala Leu Ala Gly Gly Val Ala Val Met Ala Thr Pro Ser Ala Phe
225 230 235 240
Val Thr Phe Ala Arg Gln Arg Gly Leu Ala Ala Asp Gly Arg Cys Lys
245 250 255
Ala Phe Ser Asp Asp Ala Asp Gly Ile Gly Trp Gly Glu Gly Val Ala
260 265 270
Val Val Leu Leu Glu Arg Leu Ser Asp Ala Arg Arg Asn Gly His Glu
275 280 285
Val Leu Ala Val Val Arg Gly Ser Ala Val Asn Gln Asp Gly Ala Ser
290 295 300
Asn Gly Leu Ser Ala Pro Asn Gly Pro Ser Gln Gln Arg Val Ile Arg
305 310 315 320
Gln Ala Val Ala Asn Ala Gly Leu Thr Leu Ala Asp Val Asp Met Val
325 330 335
Glu Ala His Gly Thr Gly Thr Thr Leu Gly Asp Pro Ile Glu Ala Gln
340 345 350
Ala Leu Leu Asn Thr Tyr Gly Gln Glu Arg His Asp Gly Gln Pro Leu
355 360 365
Trp Leu Gly Ser Leu Lys Ser Asn Ile Ala His Thr Gln Gly Val Ser
370 375 380
Gly Val Ala Gly Val Ile Lys Thr Val Leu Ala Leu Arg His Gly Ile
385 390 395 400
Leu Pro Lys Thr Leu His Val Gly Glu Arg Ser Ser Gln Val Asp Trp
405 410 415
Ser Val Gly Ala Val Glu Leu Leu Thr Glu Ala Arg Glu Trp Pro Glu
420 425 430
Thr Gly Arg Pro Arg Arg Ala Gly Val Ser Ser Phe Gly Ile Ser Gly
435 440 445
Thr Asn Val His Val Ile Ile Glu Gln Ala Pro Gln Glu Glu Ser Ala
450 455 460
Glu Pro Arg Thr Asp Glu Ala Pro Ser Leu Glu Ser Pro Phe Ala Thr
465 470 475 480
Lys Pro Ala Thr Leu Pro Trp Leu Ile Ser Gly Asn Thr Glu Ala Ala
485 490 495
Leu Arg Glu Gln Ala Ala Arg Leu Arg Ala His Leu Asn Ala His Pro
500 505 510
Gly Leu Ala Ala Ala Asp Ile Gly His Ser Leu Leu Thr Ser Arg Thr
515 520 525
Arg Phe Ala His Arg Ala Val Leu Leu Thr Glu Gln Asp Gly Asp Arg
530 535 540
Arg Thr Ala Leu Thr Ala Leu Ala Asp Gly Leu Asp Ala Pro Gly Leu
545 550 555 560
Ile Arg Gly Thr Gly Asp Thr Gly Ala Gly Val Val Phe Val Phe Pro
565 570 575
Gly Gln Gly Ser Gln Trp Val Gly Met Gly Arg Glu Leu Trp Glu Val
580 585 590
Ser Ser Val Phe Ala Glu Ser Met Val Ala Cys Glu Arg Ala Leu Ala
595 600 605
Pro Phe Val Gly Trp Ser Leu Arg Asp Val Val Phe Glu Gly Gly Gly
610 615 620
Glu Gly Leu Trp Gly Arg Val Asp Val Val Gln Pro Val Leu Trp Ala
625 630 635 640
Val Met Val Ser Leu Ala Ala Val Trp Arg Ser Phe Gly Val Glu Pro
645 650 655
Val Gly Val Val Gly His Ser Gln Gly Glu Val Ala Ala Ala Cys Val
660 665 670
Ala Gly Gly Leu Ser Leu Glu Asp Gly Ala Arg Val Val Ala Val Arg
675 680 685
Ser Arg Leu Val Gly Glu Arg Leu Ser Gly Arg Gly Gly Met Val Ser
690 695 700
Val Thr Leu Pro Val Ala Gln Val Glu Glu Trp Leu Ala Gly Ser Gly
705 710 715 720
Gly Arg Val Gly Val Ala Ala Val Asn Gly Pro Ser Ser Val Val Val
725 730 735
Ser Gly Glu Val Glu Ala Leu Asp Gly Leu Leu Val Glu Leu Asp Gly
740 745 750
Ala Gly Val Arg Ala Arg Arg Ile Ala Val Asp Tyr Ala Ser His Ser
755 760 765
Ala Gln Val Asp Ala Leu Asn Asp Asp Leu Leu Ala Gly Leu Ala Asp
770 775 780
Ile Arg Pro Val Ser Ser Pro Val Ala Phe Tyr Ser Thr Val Thr Gly
785 790 795 800
Glu Arg Met Asp Thr Ala Gly Leu Asp Ala Ala Tyr Trp Ala Ala Asn
805 810 815
Leu Arg Glu Arg Val Leu Phe Glu Pro Val Val Arg Thr Leu Ala Glu
820 825 830
Leu Glu His Gln Val Phe Val Glu Ser Ser Pro His Pro Val Leu Ala
835 840 845
Met Ala Val Gln Glu Thr Leu Glu Ser Ala Ser Gly Ala Gly Ala Ala
850 855 860
Val Gly Ser Leu Arg Arg Asp Asp Gly Gly Ala Gly Arg Phe Leu Ala
865 870 875 880
Ser Leu Ala Glu Ala Tyr Val Ala Gly Ala Pro Val Asp Trp Ser Val
885 890 895
Leu Phe Glu Gly Thr Gly Thr Arg Arg Val Asp Leu Pro Thr Tyr Ala
900 905 910
Phe Gln His Gln Arg Tyr Trp Leu Glu Asp Ala Ser Ala Pro Gly Ala
915 920 925
Glu Gly Val Val Asp Pro Val Asp Ala Ala Phe Trp Gly Ala Val Glu
930 935 940
Arg Ala Asp Val Gln Gly Val Ala Ala Leu Val Asp Gly Ser Val Pro
945 950 955 960
Gly Val Trp Glu Pro Val Val Pro Val Leu Ser Ala Trp Arg Lys Gly
965 970 975
Arg Glu Glu Arg Ser Val Leu Asp Ser Trp Arg Tyr Arg Thr Thr Trp
980 985 990
Arg Ala Phe Ser Leu Pro Ser Gly Thr Arg Leu Ser Gly Met Trp Leu
995 1000 1005
Val Val Ala Ser Gly Gly Asp Ala Pro Val Asp Glu Val Arg Gln
1010 1015 1020
Ala Leu Glu Ala Ala Gly Ala Glu Val Cys Val Arg Ala Asp Leu
1025 1030 1035
Asp Gly Ala Ala Leu Ala Gly Val Ser Gly Val Val Ser Leu Leu
1040 1045 1050
Ala Trp Asp Glu Gly Ser Ala Val Val Ser Thr Val Gly Leu Val
1055 1060 1065
Gln Ala Cys Gly Gly Gly Gly Glu Val Pro Leu Trp Val Leu Thr
1070 1075 1080
Arg Gly Ala Ala Val Val Gly Val Asp Asp Pro Val Ser Ala Val
1085 1090 1095
Gln Ser Gln Val Trp Ala Leu Gly Gln Val Val Gly Leu Glu Gln
1100 1105 1110
Pro Gly Gly Trp Gly Gly Leu Val Asp Val Pro Gly Val Trp Asp
1115 1120 1125
Glu Arg Val Ala Ser Leu Leu Ala Gly Val Leu Ala Ala Gly Gly
1130 1135 1140
Gly Glu Asp Gln Val Ala Val Arg Ser Ser Gly Ala Tyr Gly Arg
1145 1150 1155
Arg Leu Val Arg Ala Pro Leu Gly Ala Ser Pro Val Arg Val Arg
1160 1165 1170
Glu Trp Ser Pro Ser Gly Thr Ala Leu Val Thr Gly Gly Thr Gly
1175 1180 1185
Gly Ile Gly Gly His Leu Ala Arg Trp Leu Ala Arg Glu Gly Val
1190 1195 1200
Gly His Leu Leu Leu Val Ser Arg Arg Gly Pro Glu Ala Glu Gly
1205 1210 1215
Val Ala Glu Leu Val Glu Glu Leu Gly Gly Leu Gly Val Glu Val
1220 1225 1230
Thr Val Val Ala Cys Asp Val Thr Asp Arg Ala Ala Leu Ala Glu
1235 1240 1245
Leu Leu Ala Thr Ile Pro Ala Glu Tyr Pro Leu Thr Ser Val Phe
1250 1255 1260
His Ala Ala Gly Ile Ala Gly Tyr Gly Leu Val Arg Glu Leu Asp
1265 1270 1275
Ala Ala Gly Leu Asp Ala Glu Met Ala Ala Lys Thr Leu Gly Ala
1280 1285 1290
Arg His Leu Asp Glu Leu Thr Ala Glu Leu Gly Leu Asp Leu Asp
1295 1300 1305
Ala Phe Val Val Phe Ser Ser Gly Ala Ala Val Trp Gly Ser Ala
1310 1315 1320
Gly Ser Gly Gly Tyr Ala Ala Ala Asn Ala Tyr Leu Asp Gly Leu
1325 1330 1335
Ala Arg Glu Arg Arg Ala Arg Gly Leu Val Ala Thr Ser Val Ser
1340 1345 1350
Trp Gly Asn Trp Lys Asn Thr Gly Leu Ala Thr Asp Thr Thr Ala
1355 1360 1365
Glu Gln Leu Thr Arg Ile Gly Val Arg Pro Met Glu Pro Glu Leu
1370 1375 1380
Ala Val Arg Ala Leu Arg Gln Ala Leu Glu Gln Asp Glu Val Ser
1385 1390 1395
Met Thr Val Ala Asp Met Asp Trp Ser Leu Phe Thr Pro Gly Tyr
1400 1405 1410
Ala Leu Ala Arg Arg Arg Pro Leu Ile Glu Glu Ile Pro Glu Ala
1415 1420 1425
Ala Arg Ala Leu Ser Glu Asp Ser Ala Asp Pro Ala Asn Asp Thr
1430 1435 1440
Val Gly Gly Asp Ser Pro Leu Arg Gln Ser Leu Ala Ala Leu Thr
1445 1450 1455
Glu Ser Glu Gln His Glu Arg Leu Leu Gly Ala Val Arg Thr Glu
1460 1465 1470
Ala Ala Ala Val Leu Thr His Ser Thr Thr Asp Glu Ile Thr Ala
1475 1480 1485
Gly Lys Pro Phe Arg Asp Leu Gly Phe Asp Ser Leu Thr Ala Met
1490 1495 1500
Glu Leu Arg Asn Arg Leu Asn Ala Ala Thr Gly Leu Arg Leu Pro
1505 1510 1515
Ala Thr Ile Val Phe Asp Tyr Pro Thr Pro Arg Arg Leu Ala Gly
1520 1525 1530
His Leu His Asp Lys Leu Phe Asp Ser Gly Ala Glu Val Ala Leu
1535 1540 1545
Pro Gln Leu Arg Ala Thr Asp Asp Asp Pro Ile Val Ile Val Gly
1550 1555 1560
Met Ala Cys Arg Phe Pro Gly Gly Val Arg Gly Pro Glu Asp Leu
1565 1570 1575
Trp Arg Leu Leu Ala Glu Gly Arg Asp Glu Met Thr Glu Phe Pro
1580 1585 1590
Ala Asp Arg Gly Trp Gln Gly Pro Ala Met Asn Ala Phe Val Glu
1595 1600 1605
Glu Phe Gly Gly Ala Arg Gln Gly Ala Phe Leu Ala Asp Ala Ala
1610 1615 1620
Glu Phe Asp Ala Ala Phe Phe Gly Ile Ser Pro Arg Glu Ala Arg
1625 1630 1635
Ala Met Asp Pro Gln Gln Arg Leu Leu Leu Glu Thr Ser Trp Glu
1640 1645 1650
Val Leu Glu Arg Ala Gly Tyr Asp Pro Val Ser Leu Arg Gly Ser
1655 1660 1665
Arg Thr Gly Val Phe Val Gly Gly Thr Pro Gln Glu Tyr Thr Thr
1670 1675 1680
Val Leu Met Asn Ser Ala Glu Ala Gly Ser Gly Tyr Ala Leu Thr
1685 1690 1695
Gly Thr Ser Gly Ser Val Met Ser Gly Arg Val Ala Tyr Thr Leu
1700 1705 1710
Gly Leu Glu Gly Pro Ala Val Thr Ile Asp Thr Ala Cys Ser Ser
1715 1720 1725
Ser Leu Val Thr Leu His Leu Ala Ala Gln Ala Leu Arg Gly Gly
1730 1735 1740
Glu Cys Asp Leu Ala Leu Val Gly Gly Val Thr Val Met Ala Thr
1745 1750 1755
Pro Gly Ala Phe Val Glu Phe Ala Arg Gln Gly Gly Leu Ala Gly
1760 1765 1770
Asp Gly Arg Cys Lys Ala Phe Ala Ala Gly Ala Asp Gly Thr Gly
1775 1780 1785
Trp Gly Glu Gly Val Gly Met Leu Ala Val Gln Arg Leu Ser Asp
1790 1795 1800
Ala Val Arg Asp Gly Arg Arg Val Leu Ala Val Val Arg Gly Ser
1805 1810 18l5
Ala Val Asn Ser Asp Gly Ala Ser Asn Gly Leu Thr Ala Pro Asn
1820 1825 1830
Gly Pro Ser Gln Gln Arg Val Ile Arg Gln Ala Leu Ala Ser Ala
1835 1840 1845
Gly Leu Ser Ala Ala Asp Val Asp Val Val Glu Gly His Gly Thr
1850 1855 1860
Gly Thr Ala Leu Gly Asp Pro Ile Glu Ala Gln Ala Leu Leu Ala
1865 1870 1875
Thr Tyr Gly Gln Asp Arg Pro Ala Asp Arg Pro Leu Trp Leu Gly
1880 1885 1890
Ser Val Lys Ser Asn Ile Gly His Thr Gln Tyr Ala Ala Gly Val
1895 1900 1905
Ala Gly Val Ile Lys Ala Val Leu Ala Leu Gln His Arg Leu Leu
1910 1915 1920
Pro Lys Thr Leu His Val Glu Glu Pro Thr Pro Glu Val Asp Trp
1925 1930 1935
Ser Ser Gly Ala Val Gly Val Leu Thr Glu Ala Arg Glu Trp Pro
1940 1945 1950
Glu Thr Gly Arg Pro Arg Arg Ala Gly Val Ser Ala Phe Gly Ile
1955 1960 1965
Ser Gly Thr Asn Ala His Val Ile Leu Glu Gln Ala Pro Glu Ala
1970 1975 1980
Val Glu Glu Ser Ala Ser Gly Glu Thr Gly Ser Val Leu Val Pro
1985 1990 1995
Trp Val Ile Ser Ala Arg Ser Glu Gln Ala Leu Arg Glu Gln Ala
2000 2005 2010
Arg Arg Leu Ala Gly His Leu Arg Ala His Asp Leu Arg Pro Val
2015 2020 2025
Asp Val Gly Phe Ser Leu Ala Thr Thr Arg Ala Gly Leu Glu His
2030 2035 2040
Arg Ala Val Leu Val Gly Arg Glu Thr Ser Glu Phe Leu Ala Gln
2045 2050 2055
Leu Glu Thr Val Ala Gly Asp Gly Pro Val Ser Glu Gly Gly Thr
2060 2065 2070
Ala Phe Leu Phe Ser Gly Gln Gly Ser Gln Arg Ala Gly Met Gly
2075 2080 2085
Arg Glu Leu Tyr Glu Ala Tyr Pro Val Phe Ala Ala Ala Phe Asp
2090 2095 2100
Glu Val Cys Gly His Leu Asp Val Leu Leu Glu Arg Pro Val Lys
2105 2110 2115
Glu Val Val Phe Ala Gly Gly Lys Ala Leu Asp Arg Thr Val Phe
2120 2125 2130
Thr Gln Ala Gly Leu Phe Ala Leu Glu Val Ala Leu Phe Glu Leu
2135 2140 2145
Val Gly Ser Trp Gly Val Arg Ala Asp Val Leu Leu Gly His Ser
2150 2155 2160
Ile Gly Glu Leu Ala Ala Ala Tyr Ala Ala Gly Val Trp Ser Leu
2165 2170 2175
Glu Asp Ala Cys Arg Val Val Ala Ala Arg Gly Arg Leu Met Gln
2180 2185 2190
Ala Leu Pro Glu Gly Gly Val Met Val Ala Val Glu Ala Ala Glu
2195 2200 2205
Glu Glu Leu Pro Gln Leu Pro Ala Gly Val Ser Val Ala Ala Val
2210 2215 2220
Asn Gly Pro Arg Ser Leu Val Leu Ser Gly Asp Asp Glu Pro Val
2225 2230 2235
Thr Ala Leu Ala Gln Thr Phe Ala Gly Gln Gly Arg Arg Thr Arg
2240 2245 2250
Arg Leu Thr Val Ser His Ala Phe His Ser Ala Trp Met Glu Pro
2255 2260 2265
Met Leu Ala Asp Phe Ala Glu Val Leu Gly Ser Val Glu Phe Arg
2270 2275 2280
Ala Pro Arg Ile Pro Val Val Ser Asn Val Thr Gly Gln Val Ala
2285 2290 2295
Gly Glu Glu Leu Ala Thr Pro Asp Tyr Trp Val Arg His Val Arg
2300 2305 2310
Glu Ala Val Arg Phe Ala Asp Gly Val Thr Thr Val Leu Gly Arg
2315 2320 2325
Gly Val Asp Lys Phe Leu Glu Leu Gly Pro Gly Gly Ala Leu Thr
2330 2335 2340
Ala Met Ala Glu Glu Ala Leu Asp His Thr Gly Thr Asp Ala Val
2345 2350 2355
Cys Ala Pro Val Leu His Pro Glu His Pro Glu Ala Ser Ser Ala
2360 2365 2370
Val Arg Gly Leu Gly Arg Ile Tyr Ala Val Gly Ala Pro Ala Asp
2375 2380 2385
Trp Ser Ala Leu Phe Ala Gly Thr Gly Ala Arg Arg Val Asp Leu
2390 2395 2400
Pro Thr Tyr Ala Phe Gln Arg Arg Arg Phe Trp Leu Asp Ser Leu
2405 2410 2415
Ala Thr Gly Ser Gly Asp Pro Ala Ser Leu Gly Leu Thr Thr Thr
2420 2425 2430
Gly His Pro Leu Leu Gly Ala Gly Val Arg Leu Pro Asp Ser Asp
2435 2440 2445
Gly Phe Leu Phe Thr Gly Arg Leu Ser Leu Ala Thr Gln Pro Trp
2450 2455 2460
Ile Ala Gln His Ala Leu Leu Gly Thr Ala Leu Leu Pro Gly Thr
2465 2470 2475
Ala Phe Val Glu Leu Ala Leu Arg Ala Gly Ala Glu Ser Gly Cys
2480 2485 2490
Glu Val Ile Glu Glu Leu Thr Leu Glu Ala Pro Leu Val Leu Glu
2495 2500 2505
Glu His Gly Gly Arg Ala Val His Val Thr Val Gly Gly Leu Asp
2510 2515 2520
Glu Ser Gly Arg Arg Thr Ile Thr Leu His Ser Arg Pro Asp Gly
2525 2530 2535
Ala Asp Asp Asp Glu Ser Trp Leu Arg His Ala Thr Gly Val Leu
2540 2545 2550
Val Glu Arg Arg Glu Thr Glu Ser Ala Asp Ala Pro Thr Glu Gly
2555 2560 2565
Val Trp Pro Pro Asp Gly Ala Thr Gln Ile Ser Val Gln Asp Phe
2570 2575 2580
Tyr Pro Asp Met Ala Glu Ala Gly Phe Thr Tyr Gly Pro Val Phe
2585 2590 2595
Gln Gly Leu Arg Val Leu Trp Ser Lys Asp Gly Glu Leu Phe Ala
2600 2605 2610
Glu Val Arg Leu Pro Asp Glu Ala Gly Glu Ala Gly Asp Glu Gly
2615 2620 2625
Ser Gly Phe Gly Val His Pro Ala Leu Leu Asp Ala Ala Leu Gln
2630 2635 2640
Pro Leu Ala Leu Ser Val Leu Gly Gly Thr Asp Gly Arg Gln Pro
2645 2650 2655
Val Lys Gly Gly Met Pro Phe Val Trp Thr Gly Val Arg Leu His
2660 2665 2670
Ala Thr His Ala Thr Val Ala Arg Val Lys Leu Ala Pro Val Gly
2675 2680 2685
Arg Ser Glu Val Ser Val Val Val Thr Asp Asp Ser Gly Leu Pro
2690 2695 2700
Ile Ala Thr Val Asp Ser Leu Ala Met Arg Asp Pro Ile Leu Glu
2705 2710 2715
Gln Phe Thr Ala Ser Ala Pro Arg Gln Asp Ala Leu Phe Gly Val
2720 2725 2730
Arg Trp Thr Pro Ile Pro Leu Ala Ala His Ala Glu Pro Gly Glu
2735 2740 2745
Trp Ala Met Leu Gly Phe Asp Pro Leu Glu Ile Arg Gln Arg Leu
2750 2755 2760
Val Glu Ala Gly Leu Thr Gly Thr Pro Tyr Leu Asp Pro Gln Ser
2765 2770 2775
Leu Ile Asp Thr Val Glu Ser Gly Lys Pro Val Pro Pro Val Val
2780 2785 2790
Ala Val Ser Cys Phe Gly Gly Gly Gly Ser Thr Val Thr Ala Thr
2795 2800 2805
His Glu Ala Val Gly Arg Ala Leu Gly Val Leu Gln His Trp Leu
2810 2815 2820
Ala Asp Ala Arg Leu Met Ser Ser Arg Leu Val Leu Leu Thr Arg
2825 2830 2835
Gly Ala Val Pro Ala Val Asp Thr Asp Arg Ile Glu Asp Leu Ala
2840 2845 2850
Ala Ser Ala Val Trp Gly Leu Val Arg Ala Ala Gln Ser Glu His
2855 2860 2865
Pro Asp Arg Ile Val Leu Ile Asp Leu Asp Asp Asp Pro Thr Ser
2870 2875 2880
Tyr Arg Ala Leu Pro Ala Ala Leu Gly Thr Gly Glu Pro Gln Leu
2885 2890 2895
Ala Leu Arg Thr Gly Ala Ala Ser Ala Pro Arg Leu Ala Arg His
2900 2905 2910
Thr Gly Ala Pro Glu Val Thr Pro Gly Phe Gly Pro Asp Gly Thr
2915 2920 2925
Val Leu Val Thr Gly Gly Thr Gly Ala Leu Gly Ala Val Val Ala
2930 2935 2940
Arg His Leu Ala Ala Ala His Gly Val Arg His Leu Val Leu Ala
2945 2950 2955
Ser Arg Ser Gly Ala Glu Ala Ser Gly Ala Asp Ala Leu Leu Ala
2960 2965 2970
Asp Leu Thr Glu Leu Gly Ala Asp Ala Thr Ile Val Ala Cys Asp
2975 2980 2985
Val Ser Asp Arg Ala Ala Leu Ala Ala Leu Leu Asp Ala Ile Pro
2990 2995 3000
Ala Glu Arg Pro Leu Thr Gly Val Val His Thr Ala Gly Val Leu
3005 3010 3015
Ala Asp Gly Thr Val Glu Ser Leu Thr Pro Asp Gln Ala Asp Thr
3020 3025 3030
Val Leu Arg Ala Lys Ala Asp Ala Ala Trp His Leu His Glu Leu
3035 3040 3045
Thr Ala Leu Thr Pro Val Arg Glu Phe Val Leu Phe Ser Ser Ala
3050 3055 3060
Ala Gly Leu Leu Gly Ser Gln Gly Gln Gly Asn Tyr Ala Ala Ala
3065 3070 3075
Asn Ala Phe Leu Asp Ala Leu Ala Ala His Arg Arg Ala Ala Gly
3080 3085 3090
Leu Ala Gly Thr Ser Leu Ala Trp Gly Trp Trp Asp Leu Pro Gly
3095 3100 3105
Gly Met Ala Ala Asp Leu Gly Arg Ala Glu Arg Ala Arg Met Ala
3110 3115 3120
Arg Gly Gly Leu Thr Pro Phe Thr Ala Glu Thr Gly Met Asp Ala
3125 3130 3135
Phe Asp Gln Thr Leu Ala Ala Gly Thr Glu Pro Leu Leu Val Pro
3140 3145 3150
Met Arg Met Asn Thr Ala Val Ala Arg Ala Ser Ala Gly Gln Gln
3155 3160 3165
Ile Pro Ser Val Leu Arg Gly Leu Val Arg Ala Pro Arg Arg Arg
3170 3175 3180
Ala Val Arg Ser Asp Glu Gly Ser Ala Ser Arg Leu Arg Glu Arg
3185 3190 3195
Leu Ala Gly Ala Asn Ala Asp Glu Arg Leu Ala Met Leu Thr Glu
3200 3205 3210
Leu Val Arg Val Glu Ala Ala Gln Val Leu Gly His Ser Gly Ala
3215 3220 3225
Glu Ala Val Glu Asp Gly Ser Ser Phe Ala Glu Leu Gly Phe Asp
3230 3235 3240
Ser Leu Thr Ser Val Glu Leu Arg Asn Arg Ile Gly Glu Arg Thr
3245 3250 3255
Gly Leu Arg Leu Ala Ser Thr Val Val Phe Asp His Pro Thr Pro
3260 3265 3270
Ala Ala Leu Ala Ala Glu Leu Gly Asp Arg Leu Gly Asp Thr Ala
3275 3280 3285
Asp Phe Val Ser Ala Ala Gln Pro Ser Glu Ala Pro Gly Ala Gly
3290 3295 3300
Gly Ser Gly Val Glu Thr Thr Ala Asp Thr Ala Val Ile Asn Gly
3305 3310 3315
Val Glu Ala Leu Tyr Arg Arg Ser Ile Glu Leu Gly Arg Leu Asp
3320 3325 3330
Leu Gly His Ser Val Leu Lys Asn Ser Val Asp Leu Arg Ala Ser
3335 3340 3345
Phe Ser Val Pro Asp Glu Val Arg Asn Gly Pro Glu Leu Val Arg
3350 3355 3360
Leu Val Glu Gly Ala Gln His Pro Lys Ile Ile Cys Phe Pro Ser
3365 3370 3375
Gln Ser Val Trp Ala Ser Asn Gln Glu Leu Val Gly Met Ala Val
3380 3385 3390
Pro Leu Arg Gly Val Arg Asp Leu Trp Ser Leu Met Leu Pro Gly
3395 3400 3405
Phe Val Thr Gly Gln Pro Val Ala Ala Asp Val Asp Ala Ala Ala
3410 3415 3420
Glu Tyr Ala Val Arg Leu Ile Glu Glu Leu Val Gln Asp Glu Pro
3425 3430 3435
Phe Val Leu Ala Gly Arg Ser Ser Gly Gly Arg Ile Ala His Glu
3440 3445 3450
Val Ala Val Arg Leu Glu Gly Arg Gly Arg Ala Pro Lys Gly Leu
3455 3460 3465
Val Leu Ile Asp Ser Tyr Met Ala Gly Tyr Glu Ala Thr Ser Tyr
3470 3475 3480
Ile Thr Pro Val Met Glu Ser Lys Ala Leu Glu Leu Glu Lys Asp
3485 3490 3495
Phe Gly Gln Met Thr Gly Thr Arg Leu Thr Ala Met Ala Ala Tyr
3500 3505 3510
Phe Ala Met Phe Glu Ala Trp Gln Pro Glu Glu Thr Ser Val Pro
3515 3520 3525
Thr Leu Leu Val Arg Ala Ser Glu Arg Tyr Gly Ile Glu Pro Gly
3530 3535 3540
Gln Glu Gln Pro Pro Ala Glu Glu Trp Gln Ser Ala Trp Pro Leu
3545 3550 3555
Pro His Asp Ala Ile Asp Val Pro Gly Asn His Tyr Ser Met Ile
3560 3565 3570
Glu Gly Ser Gly Asp Val Thr Ala Ala Ala Val His Arg Trp Leu
3575 3580 3585
Val Glu Arg Asp Ala
3590
<210>9
<211>405
<212>PRT
<213〉nanchang streptomycete NS3226 (Streptomyces nanchangensis n.sp.NS3226)
<400>1
Met Ala Glu Ala Pro Ser Glu Pro Ile Pro Phe Pro Phe Pro Asp Pro
1 5 10 15
Pro Ser Val Cys Glu Leu Pro Pro Glu Leu Ala Glu Val Arg Asp Gly
20 25 30
Glu Ser Val Val Glu Val Lys Phe Pro Asp Gly Ile Thr Gly Trp Met
35 40 45
Val Thr Lys His Ala Asp Val Arg Lys Val Leu Leu Asp Pro Arg Phe
50 55 60
Ser Ser Arg Val Ile Ala Thr Ala Ala Ala Ala Met Ser Glu Thr Glu
65 70 75 80
Thr Gly Lys Leu Met Asn Glu Ser Leu Val Gly Met Asp Pro Pro Glu
85 90 95
His Thr Arg Leu Arg Lys Leu Val Ser Lys Ala Phe Thr Ala Arg Arg
100 105 110
Val Glu Gln Leu Arg Pro Arg Ile Val Glu Leu Val Val Glu Leu Leu
115 120 125
Asp Glu Leu Gln Thr Leu Pro Arg Pro Val Asp Leu Val Lys Asn Phe
130 135 140
Ala Val Pro Leu Pro Val Arg Val Val Cys Glu Leu Leu Gly Val Pro
145 150 155 160
Ala Gly Asp Gln Asp Thr Phe His Ala Trp Ser Asn Ala Leu Leu Gly
165 170 175
Asp Trp His Gln Val Ala Glu Lys Glu Ala Ala Thr Val Ala Leu Val
180 185 190
Asn Tyr Phe Gly Asp Leu Ile Ala Val Lys Arg Gln Lys Pro Ala Asp
195 200 205
Asp Met Ile Ser Glu Leu Ile Ala Val Ser Glu Glu Glu Asp Ser Thr
210 215 220
Leu Thr Glu Arg Glu Ile Ile Thr Leu Ser Ile Gly Ile Leu Ser Ala
225 230 235 240
Gly His Glu Thr Thr Ala Asn Leu Ile Ser Met Phe Leu Leu Thr Leu
245 250 255
Leu His His Pro Glu Glu Phe Asp Lys Leu Arg Ala Asn Pro Glu Ala
260 265 270
Leu Pro Lys Ala Ile Asp Glu Leu Leu Arg Phe Val Pro Leu Thr Ala
275 280 285
Thr Gly Gly Ile Thr Pro Arg Leu Thr Thr Ala Glu Val Glu Leu Ser
290 295 300
Asn Gly Lys Val Leu Pro Ala Gly Val Val Val Leu Pro Ala Val Ala
305 310 315 320
Thr Ala Asn Arg Asp Pro Asp Val Phe Glu Asp Gly Asp Arg Leu Asp
325 330 335
Leu Ala Arg Glu Gln Asn Pro His Leu Ala Phe Ser Thr Gly Ile His
340 345 350
Tyr Cys Leu Gly Ala Gln Leu Ala Arg Ile Glu Leu Gln Glu Ala Phe
355 360 365
Arg Ala Ile Met Glu Arg Met Pro Glu Val Arg Leu Ala Val Pro Glu
370 375 380
Ser Glu Leu Arg Leu Lys Pro Ala Ser Ile Leu Arg Gly Leu Glu Ser
385 390 395 400
Leu Pro Ile Thr Trp
405
<210>10
<211>167
<212>PRT
<213〉nanchang streptomycete NS3226 (Streptomyces nanchangensis n.sp.NS3226)
<400>1
Met Gly Val Phe Glu Gln Glu Ala Ala Glu Ser Thr Gly Glu Lys Phe
1 5 10 15
Val Arg Pro Ala Ala Pro Glu Arg Met Arg Asp Leu Asp Phe Leu Leu
20 25 30
Gly Asp Phe Arg Val Glu Trp Thr Asn Phe Thr Ala Asp Pro Pro Val
35 40 45
Lys Gly Thr Ala Ala Trp Asn Thr Val Ser Thr Phe Ala Gly His Ala
50 55 60
Tyr Glu Met Thr Gln Leu Val Pro Lys Asp Asp Leu Thr Gly Arg Phe
65 70 75 80
Val Ile Gln Trp Val Glu Ser Glu Ser Ser Phe Ser Gly Tyr Tyr Tyr
85 90 95
Asp Asp Trp Gly Asn Arg Thr Leu Leu Thr Ala Lys Gly Trp Gln Asp
100 105 110
Gly Tyr Leu Ser Phe Thr Gly Glu Cys Ile Gly Phe Gly Arg Trp Phe
115 120 125
Leu Leu Lys Glu Arg Tyr Gln Val Ile Asp Glu Asn His Tyr Leu Lys
130 135 140
Cys Gly Phe Ile Arg Phe Glu Ala Asp Gly Glu Trp Val Pro Ala Asp
145 150 155 160
Glu Val His Cys Tyr Arg Val
165
<210>11
<211>317
<212>PRT
<213〉nanchang streptomycete NS3226 (Streptomyces nanchangensis n.sp.NS3226)
<400>1
Met Thr Ser Thr Asp Asp Ile Leu Gly Lys Gly Thr Thr Ile Ile Ser
1 5 10 15
Arg Arg Ser Thr Ala Ala Arg Glu His Gly Gly Glu Arg Leu Pro Thr
20 25 30
Arg Leu Pro Thr Pro Ser His Thr Thr Ser Ser Arg Ala Asp Gly Phe
35 40 45
Ser Ala Gly Ala Thr Leu Leu Thr Trp His Arg Arg Leu Val Arg Ala
50 55 60
Arg Glu Pro Asp Leu Gly Val Arg Gln Val Pro Gly Arg Ala Ala Thr
65 70 75 80
Ala Trp Pro Ser Gly Cys Arg Arg Thr Ile Arg Arg Ala Leu Arg Arg
85 90 95
Ser Gly Leu Pro Pro Ala Pro Gln Arg Ala Ser Gln Gln Thr Trp Arg
100 105 110
Ser Phe Leu Arg Ser Gln Ala His Thr Leu Leu Ala Cys Asp Phe Met
1l5 120 125
Arg Val Glu Thr Val Phe Leu Lys Arg Leu Tyr Val Phe Phe Val Met
130 135 140
Glu Ile Lys Thr Arg Arg Val His Val Leu Gly Val Thr Val Arg Pro
145 150 155 160
Thr Gly Ala Trp Val Thr Gln Phe Ala Arg Asn Leu Leu Lys Asp Leu
165 170 175
Glu Glu Arg Ala Gly Cys Phe Arg Phe Leu Ile Arg Asp Arg Asp Ser
180 185 190
Lys Phe Thr Ala Ala Phe Asp Ala Val Phe Ala Asp Asn Gly Thr Ala
195 200 205
Val Ile Pro Thr Pro Pro Gln Ser Pro Arg Ser Asn Ala Phe Ala Glu
210 215 220
Arg Trp Ile Arg Thr Ala Arg Ala Glu Cys Thr Asp Arg Ile Leu Ile
225 230 235 240
Thr Gly Glu Arg His Leu Arg Ala Val Leu Thr Thr Tyr Ala Glu His
245 250 255
Tyr Asn Thr Gly Arg Ala His Arg Ser Leu Asp Leu Arg Ala Pro Asp
260 265 270
Asp Arg Pro Ser Val Ile Pro Leu Pro Ala Ala Val Val Arg Arg Arg
275 280 285
Arg Leu Leu Gly Gly Leu Leu Asn Glu Tyr His Thr Thr Pro Pro Gln
290 295 300
Arg Leu Leu His Pro Gln Glu Thr Pro Ser Ser Ala Ala
305 310 315
<210>12
<211>593
<212>PRT
<213〉nanchang streptomycete NS3226 (Streptomyces nanchangensis n.sp.NS3226)
<400>1
Val Leu Ile Val Ala Ala Gly Trp Ser Gly Gly Arg Ser Phe Ser Phe
1 5 10 15
Pro Val Thr Glu Trp Glu Gly Leu Val Pro Met Glu Pro Arg Ser Trp
20 25 30
Pro Glu Pro Ala Pro Glu Val Ala Arg Ala Val Arg Ala Lys Tyr Ser
35 40 45
Gly Arg Gln Val Pro Leu Pro Val Val Val Arg Asp Arg Leu Gly Glu
50 55 60
Leu Phe Ala Asp Ala Glu Phe Ala Glu Ala Phe Ala Val Thr Gly Pro
65 70 75 80
Arg Gly Trp Ser Pro Gly Arg Leu Ala Leu Val Thr Val Leu Gln Met
85 90 95
Ala Glu Asn Leu Thr Asp Arg Gln Ala Ala Glu Ala Val Arg Asp Lys
100 105 110
Leu Ser Trp Ser Tyr Ala Leu Gly Leu Gly Leu Glu Asp Pro Gly Phe
115 120 125
Asp Phe Ser Val Leu Ser Gln Phe Arg Ser Arg Val Ala Ala His Gly
130 135 140
Leu Glu Glu Lys Val Leu Asp Leu Leu Val Ala Arg Leu Thr Glu Gln
145 150 155 160
Gly Leu Leu Ala Ala Gly Gly Lys Gln Arg Thr Asp Ser Thr His Val
165 170 175
Val Ala Ala Val Arg Asp Leu Asn Arg Leu Glu Leu Ala Gly Glu Ala
180 185 190
Val Arg Ala Ala Leu Glu Ala Leu Thr Cys Ala Gly Pro Asp Trp Val
195 200 205
Ala Gln Ala Val Asp Val Ala Ser Trp Ser Arg Arg Tyr Gly Pro Arg
210 215 220
Val Asp Ser Trp Arg Leu Pro Thr Ser Arg Ala Arg Gln Gln Lys Leu
225 230 235 240
Ala Val Asp Phe Ala Arg Asp Gly Phe Ala Leu Leu Gly Ala Val Tyr
245 250 255
His Ser Ser Ser Pro Val Trp Leu Arg Glu Leu Pro Ala Val Gln Val
260 265 270
Leu Trp Cys Val Leu Val Gln Asn Tyr Thr Arg Thr Ile Thr Arg Gly
275 280 285
Gly Arg Glu Val Val Lys Arg Arg Glu Lys Thr Asp Glu Gly Gly Asp
290 295 300
Gly Arg Pro Pro Gly His Leu Arg Leu Ser Ser Pro Tyr Asp Thr Asp
305 310 315 320
Ala Arg Trp Ser Ala Lys Arg Asp Met Phe Trp Asn Gly Tyr Lys Leu
325 330 335
His Ile Ser Glu Thr Cys Thr Ser Ala Pro Glu Lys Ala Arg Thr His
340 345 350
Pro Asn Leu Ile Thr Asn Ile Ala Thr Thr His Ser Thr Val Pro Asp
355 360 365
Ser Lys Thr Leu Asn Ala Ile His His Ala Leu Gln Gln Arg Gly Leu
370 375 380
Leu Pro Asp Glu His Tyr Pro Asp Ser Gly Tyr Ala Thr Ala Glu Leu
385 390 395 400
Ile His Gly Ser Val Lys Thr Tyr Gly Ile Ala Leu Ile Thr Pro Val
405 410 415
Leu Leu Asp Thr Ser Arg Gln Ala Lys Ala Gln Ala Gly Phe Ala Ala
420 425 430
Thr Asp Phe Thr Ile Asp Arg Glu Ala Gly Lys Ala Thr Cys Pro Ala
435 440 445
Gly His Thr Ser Ala Thr Trp Asn Pro Val Val Ser Glu Gly Ile Pro
450 455 460
Lys Thr Val Val Ser Phe Ala Ala Leu Asp Cys Ile Pro Cys Pro Phe
465 470 475 480
Lys Pro Gln Cys Thr Thr Ala Lys Lys Asn Arg Arg Gln Leu Ser Leu
485 490 495
His Leu Arg Gln Met Thr Glu Ala Leu Arg His Thr Arg Thr Gln Gln
500 505 510
Lys Thr Lys Asp Trp Asn Thr Asp Tyr Ala Leu Arg Ser Gly Ile Glu
515 520 525
Gly Thr Ile Arg Gln Ala Thr Ala Val Thr Gly Thr Arg Arg Ala Arg
530 535 540
Tyr Arg Gly Leu Ala Lys Thr His Leu Glu His Ile Tyr Ser Ala Val
545 550 555 560
Ala Leu Asn Leu Ile Arg Leu Asn Ala Trp Trp Asn Asp Arg Pro Leu
565 570 575
Asp Arg Thr Arg Thr Ser His Leu Thr Arg Leu Glu His Thr Leu Thr
580 585 590
Ala
<210>13
<211>940
<212>PRT
<213〉nanchang streptomycete NS3226 (Streptomyces nanchangensis n.sp.NS3226)
<400>1
Met Lys Leu Ser Glu Pro Ser Tyr Tyr Pro Glu Ile Val Glu Arg Ser
1 5 10 15
Glu Glu Ile Ser Leu Leu Ala Gln Asp Leu Ala Asn Thr Lys Arg Gly
20 25 30
Glu Gly Ala Val Val Val Ile His Ser Gly Pro Gly Val Gly Arg Thr
35 40 45
Ala Leu Leu Asp Glu Phe Leu Arg Gln Ser Gly Asn Ser Gly Ala Arg
50 55 60
Val Cys Ala Ala Thr Gly Ser Ala Ala Glu Thr Gly Asn Glu Leu Gly
65 70 75 80
Val Val Thr Gln Leu Phe Pro Glu Asp Gly Pro Ile Ala Ala Ala Val
85 90 95
Trp Leu Ala Arg Ala Leu Asp Asp His His Gly Asp Pro Ser Pro Asp
100 105 110
Ala Asp Arg Leu Phe Asp Met Leu Arg Gly Glu Phe Arg Gln Gly Pro
115 120 125
Leu Val Leu Ala Val Asp Asp Val Gln Leu Ala Asp Ala Ala Ser Leu
130 135 140
Arg Phe Leu Leu His Leu Ile Arg Arg Leu Arg Thr Thr Pro Val Leu
145 150 155 160
Ile Val Leu Thr Glu Pro Val Gly Ser Cys Ala Leu Pro Leu Ala Phe
165 170 175
Gln Ala Glu Leu Leu Arg His Pro Arg Cys Arg Arg Leu Arg Leu Gln
180 185 190
Pro Leu Ser Val Asp Gly Val Thr Arg Met Ile Glu Pro Tyr Val Ala
195 200 205
Glu Thr Glu Val Ala Arg Leu Ala Thr Gln Phe His Ala Val Ser Gly
210 215 220
Gly Asn Pro Val Leu Val Arg Gly Leu Leu Ala Asp His Arg Ala Gly
225 230 235 240
Gln Arg Leu Glu Glu Gln Gly Ile Gly Ala Gln Tyr Asn Gly Tyr Pro
245 250 255
Ala Phe Thr Gln Ala Ala Leu Val Ser Ala Tyr Arg Asp Asp Pro Val
260 265 270
Leu Phe Glu Val Val Cys Gly Ile Ala Val Leu Gly Glu Asn Ala Ser
275 280 285
Pro Ala Leu Val Ala Cys Leu Val Asp Arg Gly Ala Asp Val Val Ala
290 295 300
Arg Val Met Thr Ala Leu Asn Thr Ala Ser Leu Leu Asn Gly Pro Ala
305 310 315 320
Phe Arg Ser Pro Leu Val Ala Lys Ala Leu Leu Glu Leu Leu Asp Val
325 330 335
Glu Thr Arg Gly Glu Leu His Arg Arg Ala Ala Glu Leu Leu His Ala
340 345 350
Asp Ala Ala Leu Pro Ala Asp Val Ala His His Leu Leu Ala Thr Pro
355 360 365
Ile Ala Glu Ser Trp Val Leu Pro Thr Leu Leu Ala Ala Ala Glu Gln
370 375 380
Ala Val Gln Gly Gly Gly Gln Asp Phe Arg Leu Asp Cys Leu Arg Leu
385 390 395 400
Ala Gly Arg Gln Ala Ala Thr Glu Glu Glu Arg Ala Ala Val Val Ala
405 410 415
Ala Arg Val Arg Ile Gly Trp Glu Ile Asp Pro Arg Leu Ile Thr Pro
420 425 430
Trp Leu Gly Glu Leu Gly Ala Ala Leu Arg Arg Gly His Val Gly Ser
435 440 445
Ser Asp Ala Ala Trp Thr Val Lys His Phe Val Trp His Asp His Val
450 455 460
Glu Glu Ala Ala Asp Ile Leu Ser Ala Leu Met Glu Arg Thr Glu Glu
465 470 475 480
Asn Ser Asp Ala His Ala Glu Leu Glu Ile Val Arg His Trp Val Arg
485 490 495
Tyr Thr Cys Pro Thr Leu Leu Glu Gly Ser Val Asp Ala Asp Ala Pro
500 505 510
Ser Leu Ser Gly Pro Phe Pro Gln Arg Phe Gln Leu Arg Pro Ala Ser
515 520 525
Tyr Ala Val Glu Met Leu Gly Arg Leu Phe Thr Glu Gly Pro Cys Asp
530 535 540
Gln Ala Ala Ala Met Ala Glu Glu Ile Leu Arg Gly Cys Arg Phe Gly
545 550 555 560
Glu Thr Thr Val Glu Ala Val Glu Gly Ala Leu Leu Val Leu Val Tyr
565 570 575
Ala Glu Arg Pro Gly Arg Ala Leu His Trp Cys Glu Ala Leu Leu Glu
580 585 590
Gln Ala Gly Asp His Pro Thr Gly Thr Ala Ala Ala Ile Leu Ser Ser
595 600 605
Ile Arg Ala Glu Ile Ala Leu Arg Gln Gly Ala Leu Glu Glu Ala Glu
610 615 620
Thr Tyr Ala Asp Arg Ala Leu Asn Ala Ile Ser Arg Leu Gly Trp Gly
625 630 635 640
Val Ala Ile Gly Ser Pro Leu Ala Val Arg Val Arg Ala Ala Met Ala
645 650 655
Ala Gly Arg Thr Gly Leu Ala Gly Ala Trp Leu Asn Gln Asp Val Pro
660 665 670
Gln Gly Met Phe Arg Thr Arg His Gly Leu Leu Tyr Met His Ala Arg
675 680 685
Gly His Tyr His Leu Ala Thr Asp Arg Pro Thr Val Ala Leu Glu Asp
690 695 700
Phe Leu Thr Cys Gly Arg Leu Ala Lys Glu Trp Gly Met Asp Val Pro
705 710 715 720
Thr Phe Leu Pro Trp Arg Thr Ser Ala Ala Leu Ala His Leu Ala Leu
725 730 735
Gly Asn Gly Ser Arg Ala Ser Ala Leu Ala Arg Glu Gln Leu Thr Arg
740 745 750
Pro Gly Gly Gly Trp Pro Arg Cys Arg Ala Val Ser Leu Arg Val Leu
755 760 765
Ala Ala Thr Ser Glu Leu Asp Arg Arg Pro Ala Leu Leu Arg Glu Ser
770 775 780
Val Asn Leu Leu Glu Ser Cys Gly Asp His Val Glu Leu Leu His Ser
785 790 795 800
Leu Ala Asp Gln Phe Gln Ala Leu Ser Glu Ala Gly Ala Pro Ala Lys
805 810 815
Ala Arg Ile Ala Ala Arg His Ala Arg Thr Val Ala Asp Asn Cys Gly
820 825 830
Thr Glu Thr Leu Phe Arg Arg Leu Phe Lys Glu Glu Val Pro Glu Asp
835 840 845
Thr Asp Glu Ser Ala Asp Phe Gly Gln Asp His Gln Gly Phe Ala Ser
850 855 860
Leu Thr Asp Ala Glu Arg Arg Val Thr Ala Leu Ala Ala Leu Gly Tyr
865 870 875 880
Ser Asn Arg Glu Ile Gly Arg Lys Leu Phe Ile Thr Lys Ser Thr Val
885 890 895
Glu Gln His Leu Thr Arg Val Tyr Arg Lys Leu Gly Val Arg Asn Arg
900 905 910
Ala Asp Leu Gly Asp Leu Leu Ala Gly Ile Asn Leu Ala Ala Gln Pro
915 920 925
Gln Val Met Gly Arg Thr Ser Ser Ala Ala Val Gly
930 935 940
<210>14
211>447
<212>PRT
<213〉nanchang streptomycete NS3226 (Streptomyces nanchangensis n.sp.NS3226)
<400>1
Met Thr Leu Leu Ser Glu Ala Val Arg Ala Gly Ala Ser Pro Gln Glu
1 5 10 15
Leu Glu Arg Ala Glu Pro Pro Arg Glu Tyr Thr Ala Ala Tyr Ile His
20 25 30
Ser Glu Asp Thr Arg Met Phe Glu Gly Val Ala Asp Lys Asp Val Arg
35 40 45
Lys Ser Leu Arg Val Gly Arg Val Pro Met Pro Glu Leu Ala Pro Asp
50 55 60
Glu Val Leu Val Ala Val Met Ala Ser Ala Val Asn Tyr Asn Thr Val
65 70 75 80
Trp Ser Ala Ile Phe Glu Pro Leu Pro Thr Phe Arg Phe Leu Arg Gln
85 90 95
Phe Ala Ala Gln Gly Gly Trp Ala Ser Arg His Asp Leu Pro Tyr His
100 105 110
Val Leu Gly Ser Asp Gly Ala Gly Val Val Val Arg Thr Gly Pro Gly
115 120 125
Val Arg His Trp Lys Thr Gly Asp His Val Val Val Ser Cys Val Gln
130 135 140
Ala Asp Asp Gln Glu Ala Ala Thr Gln Ala Asp Gly Met Leu Gly Ala
145 150 155 160
Glu Gln Arg Ile Trp Gly Phe Glu Thr Asn Phe Gly Gly Leu Ala His
165 170 175
Tyr Ala Val Val Arg Ala Ser Gln Leu Ile Pro Lys Pro Gly His Leu
180 185 190
Ser Trp Glu Glu Ala Ala Cys Asn Pro Leu Cys Gly Gly Thr Ala Tyr
195 200 205
Arg Met Leu Val Gly Asp Arg Gly Ala Arg Leu Lys Gln Gly Glu Ile
210 215 220
Val Leu Ile Trp Gly Ala Ala Gly Gly Leu Gly Ala Tyr Ala Val Gln
225 230 235 240
Leu Val Lys Asn Gly Gly Gly Ile Pro Val Gly Val Val Ser Ser Pro
245 250 255
Ala Lys Ala Glu Ala Ala Arg Arg Leu Gly Cys Asp Val Val Ile Asp
260 265 270
Arg Gln Glu Ile Gly Leu Asp Asp Arg Thr Ala Tyr Asp Pro Ala Ala
275 280 285
Val Ile Glu Thr Gly Lys Gln Leu Gly Arg Ile Ile Arg Arg Glu Val
290 295 300
Gly Glu Asp Pro His Ile Val Phe Glu His Val Gly Arg Ser Thr Phe
305 310 315 320
Pro Val Ser Val Phe Ala Val Arg Arg Gly Gly Thr Val Val Thr Cys
325 330 335
Gly Ser Ser Thr Gly Tyr Gln His Thr Tyr Asp Asn Arg Tyr Leu Trp
340 345 350
Met Lys Leu Lys Arg Ile Ile Gly Ser His Ala Ala Asn Leu Gln Glu
355 360 365
Gln Trp Glu Leu Asn Arg Leu Val Ser Arg Gly Gln Ile Val Pro Thr
370 375 380
Leu Ser Ala Val Tyr Pro Leu Ala Glu Val Ala Ala Ala Thr Arg Ser
385 390 395 400
Val Gln Thr Asn Arg His Ile Gly Lys Val Gly Val Leu Cys Leu Ala
405 410 415
Glu Ala Pro Gly Gln Gly Val Thr Asp Pro Ala Leu Arg Ala Arg Val
420 425 430
Gly Glu Glu Arg Leu Ser Leu Leu Arg Asp Leu Ser Pro Thr Ala
435 440 445

Claims (7)

1, a kind of nanoligomycin biological synthesis gene cluster, it has nucleotide sequence shown in the SEQ ID NO.1, and whole nanoligomycin biological synthesis gene cluster is totally 13 genes, is specially:
(1) polyketide synthase gene, i.e. nlmA1, nlmA2, nlmA3, nlmA4, nlmA5, nlmA6, nlmA7 be totally 7 genes;
(2) modifying factor of nanoligomycin, i.e. nlmB, nlmOI is totally 2 genes;
(3) nanoligomycin transposase gene, i.e. nlmTI, nlmTII is totally 2 genes;
(4) regulatory gene of nanoligomycin, i.e. nlmRI;
(5) nanoligomycin precursor synthetic gene, i.e. ccrA.
2, nanoligomycin biological synthesis gene cluster according to claim 1, it is characterized in that, described polyketide synthase gene, 7 required I type polyketide synthase opening code-reading frames, i.e. nlmA1 of 20 hexa-atomic macrolide antibiotics nanoligomycin polyketone aglycone biosynthesizing among its coding catalysis nanchang streptomycete NS3226, nlmA2, nlmA3, nlmA4, nlmA5, nlmA6, the nucleotide sequence of nlmA7 or complementary sequence and amino acid sequence corresponding thereof.
3, nanoligomycin biological synthesis gene cluster according to claim 2; it is characterized in that; described 7 I type polyketide synthase opening code-reading frames; its module or structural domain, i.e. nucleotide sequence or the complementary sequence and the amino acid sequence corresponding thereof of ketone group synthetase structure domain, acyltransferase structural domain, keto reductase structural domain, dehydratase structural domain, enoyl-reductase enzyme structural domain, acyl carrier protein structural domain, thioesterase structural domain.
4, nanoligomycin biological synthesis gene cluster according to claim 1, it is characterized in that, the modifying factor of described nanoligomycin, its coding participates in 2 opening code-reading frames of nanoligomycin polyketone chain oxidative modification, be nlmB, the nucleotide sequence of nlmOI or complementary sequence and amino acid sequence corresponding thereof.
5, nanoligomycin biological synthesis gene cluster according to claim 1, it is characterized in that, described nanoligomycin transposase gene, its coding participates in 2 opening code-reading frames of nanoligomycin biological synthesis gene cluster swivel base, be nlmTI, the nucleotide sequence of nlmTII or complementary sequence and amino acid sequence corresponding thereof.
6, nanoligomycin biological synthesis gene cluster according to claim 1, it is characterized in that, the regulatory gene of described nanoligomycin, its coding participates in 1 opening code-reading frame that the nanoligomycin biosynthesizing is regulated, the i.e. nucleotide sequence of nlmRI or complementary sequence and amino acid sequence corresponding thereof.
7, nanoligomycin biological synthesis gene cluster according to claim 1, it is characterized in that, described nanoligomycin precursor synthetic gene, the opening code-reading frame of its coding ccrA, the i.e. nucleotide sequence of ccrA or complementary sequence and amino acid sequence corresponding thereof.
CN 03150923 2003-09-11 2003-09-11 Southern oligomycin biosynthetic gene cluster Expired - Fee Related CN1257282C (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN 03150923 CN1257282C (en) 2003-09-11 2003-09-11 Southern oligomycin biosynthetic gene cluster

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN 03150923 CN1257282C (en) 2003-09-11 2003-09-11 Southern oligomycin biosynthetic gene cluster

Publications (2)

Publication Number Publication Date
CN1523034A CN1523034A (en) 2004-08-25
CN1257282C true CN1257282C (en) 2006-05-24

Family

ID=34286817

Family Applications (1)

Application Number Title Priority Date Filing Date
CN 03150923 Expired - Fee Related CN1257282C (en) 2003-09-11 2003-09-11 Southern oligomycin biosynthetic gene cluster

Country Status (1)

Country Link
CN (1) CN1257282C (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101063140B (en) * 2007-01-26 2011-06-22 中国科学院上海生命科学研究院 Vancocin biological synthesis gene cluster

Also Published As

Publication number Publication date
CN1523034A (en) 2004-08-25

Similar Documents

Publication Publication Date Title
CN1277843C (en) Comparative mycobacterial geneomics as a tool for identifying targets for the diagnosis, prophylaxis or treatment of mycobacterioses
CN1977046A (en) DNA coding for polypeptide participating in biosynthesis of pladienolide
CN1227362C (en) Biosynthetic genes for spinosyn insecticide production
DK2271666T3 (en) NRPS-PKS GROUP AND ITS MANIPULATION AND APPLICABILITY
CN1230991A (en) Hamster EF-1 &#39;alpha&#39; transcriptional regulatory DNA
CN1626662A (en) Genes for the synthesis of antipathogenic substances
CN1730657A (en) The biological synthesis gene cluster of chlorothricin and application thereof
CN1732264A (en) Borrelidin-producing polyketide synthase and its use
CN1263855C (en) Isolation of the biosythesis genes for pseudo-oligosaccharides from streptomyces glaucescens gla.o and their use
CN1676607A (en) Cloning genes from streptomyces cyaneogriseus subsp. noncyanogenus for biosynthesis of antibiotics and methods of use
CN1849391A (en) Nitrilases, nucleic acids encoding them and methods for making and using them
CN1898259A (en) Gene variants coding for proteins from the metabolic pathway of fine chemicals
CN1578834A (en) Genes coding for metabolic pathway proteins
CN1582300A (en) Gene which codes for novel proteins
CN1630729A (en) Method for the recombination of genetic elements
CN101063140A (en) Vancocin biological synthesis gene cluster
CN1596267A (en) Genes coding for regulatory proteins
CN1257282C (en) Southern oligomycin biosynthetic gene cluster
CN1190444C (en) Biosynthesis gene cluster of Nanchang
CN1507493A (en) Biosynthetic genes for butenyl-spinosyn insecticide production
CN1667123A (en) Gene cluster responsible for synthesis of FR-008 polyketone antibiotics
CN1117865C (en) Use of alpha&#39;-1,4-glucan lyase for preparation of 1,5-D-anhydrofructose
CA2391131C (en) Genes and proteins for rosaramicin biosynthesis
CN1714149A (en) Strain belonging to the genus streptomyces and being capable of producing nemadictin and process for producing nemadictin using the strain
CN1729292A (en) DNA sequences from tcd genomic region of photorhabdus luminescens

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
C19 Lapse of patent right due to non-payment of the annual fee
CF01 Termination of patent right due to non-payment of annual fee