CN114717239A - High-activity blood coagulation factor VIII or VIII a polypeptide variant Gly710Thr - Google Patents

High-activity blood coagulation factor VIII or VIII a polypeptide variant Gly710Thr Download PDF

Info

Publication number
CN114717239A
CN114717239A CN202111654874.5A CN202111654874A CN114717239A CN 114717239 A CN114717239 A CN 114717239A CN 202111654874 A CN202111654874 A CN 202111654874A CN 114717239 A CN114717239 A CN 114717239A
Authority
CN
China
Prior art keywords
leu
ser
glu
thr
val
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202111654874.5A
Other languages
Chinese (zh)
Inventor
韦红英
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
First Affiliated Hospital of Guangxi Medical University
Original Assignee
First Affiliated Hospital of Guangxi Medical University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by First Affiliated Hospital of Guangxi Medical University filed Critical First Affiliated Hospital of Guangxi Medical University
Priority to CN202111654874.5A priority Critical patent/CN114717239A/en
Publication of CN114717239A publication Critical patent/CN114717239A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/435Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
    • C07K14/745Blood coagulation or fibrinolysis factors
    • C07K14/755Factors VIII, e.g. factor VIII C (AHF), factor VIII Ag (VWF)
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61PSPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
    • A61P7/00Drugs for disorders of the blood or the extracellular fluid
    • A61P7/04Antihaemorrhagics; Procoagulants; Haemostatic agents; Antifibrinolytic agents
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/85Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K38/00Medicinal preparations containing peptides
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2800/00Nucleic acids vectors
    • C12N2800/10Plasmid DNA
    • C12N2800/106Plasmid DNA for vertebrates
    • C12N2800/107Plasmid DNA for vertebrates for mammalian

Landscapes

  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Chemical & Material Sciences (AREA)
  • Organic Chemistry (AREA)
  • Genetics & Genomics (AREA)
  • Engineering & Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Zoology (AREA)
  • Biophysics (AREA)
  • Biotechnology (AREA)
  • Medicinal Chemistry (AREA)
  • Molecular Biology (AREA)
  • Wood Science & Technology (AREA)
  • Biochemistry (AREA)
  • Hematology (AREA)
  • General Engineering & Computer Science (AREA)
  • Biomedical Technology (AREA)
  • Microbiology (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • Plant Pathology (AREA)
  • Physics & Mathematics (AREA)
  • Gastroenterology & Hepatology (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Diabetes (AREA)
  • Toxicology (AREA)
  • General Chemical & Material Sciences (AREA)
  • Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
  • Pharmacology & Pharmacy (AREA)
  • Animal Behavior & Ethology (AREA)
  • Public Health (AREA)
  • Veterinary Medicine (AREA)
  • Peptides Or Proteins (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)

Abstract

The invention relates to a high-activity blood coagulation factor VIII or VIII a polypeptide variant Gly710 Thr. The mutant does not destroy or inhibit the blood coagulation activity of FVIII, the specific blood coagulation activity is 1.6-2 times of that of a wild type, and the mutant is superior in stability; the mutant obviously improves the drug efficacy of FVIII and has good clinical application prospect.

Description

High-activity blood coagulation factor VIII or VIII a polypeptide variant Gly710Thr
Technical Field
The invention belongs to the field of hemophilia, and particularly relates to a blood coagulation factor VIII or VIII a polypeptide variant Gly710Thr with high activity.
Background
Currently, the lifelong alternative treatment of FVIII is the only clinically effective treatment. Due to the fact that the content of FVIII in blood plasma is extremely low (100-200 ng/ml), the source of concentrated human FVIII is limited, potential virus transmission risks exist, and with the development of biotechnology, the genetically recombinant FVIII gradually replaces FVIII from blood plasma, and becomes a first choice preparation for replacement therapy. Fviii replacement therapy greatly improves the condition and quality of life of hemophiliacs.
Over time, the shortfalls of replacement therapy are increasingly significantly uncovered: first, induction of inhibitor production is considered to be the most serious complication of replacement therapy, with approximately 20% to 30% of hemophilia a patients developing fviii inhibitor during treatment, leading to reduced efficacy and even failure of replacement therapy; secondly, the F VIII has short half-life and frequent administration times: the half-life of the FVIII in vivo is only less than 12 hours, and in order to maintain the effective concentration of the blood plasma and achieve the purpose of preventing bleeding, the FVIII needs to be injected for 3 times every week; thirdly, the F VIII replacement therapy cost is high, the average cost of replacement therapy for severe hemophilia A patients per year reaches 10 million dollars, although the hemophilia is listed as major disease medical insurance in China at present, the reimbursement rate is also increased year by year, but the expensive drug cost still brings heavy economic burden to families and society. The heavy economic burden is the main reason that most severe hemophilia A patients in China are difficult to persist in long-term prevention and treatment. Gene therapy with HA is expected to solve the above problems, avoiding patients from life-long replacement therapy and the risk of inhibitor production, and HAs been well developed in recent years, but patients receiving this treatment have observed a decline in fviii expression over time, and some patients still need to be treated with replacement therapy to maintain fviii levels in vivo, and the success of gene therapy HAs previously solved the problem of maintaining stable expression of the introduced factor gene in vivo, while non-replacement therapy may possibly delay or at least reduce the exposure to exogenous fviii in the future, but replacement therapy will still be the mainstay of HA therapy.
Disclosure of Invention
The technical problem to be solved by the invention is to provide a high-activity blood coagulation factor VIII or VIII a polypeptide variant Gly710Thr, the mutant does not destroy or inhibit the blood coagulation activity of FVIII, the specific blood coagulation activity is 1.6-2 times of that of a wild type, and the stability is more excellent; the mutant obviously improves the drug efficacy of the FVIII and has good clinical application prospect.
The invention provides a nucleic acid for coding a high-activity blood coagulation factor VIII or VIII a polypeptide variant Gly710 Thr:
(1) hF VIII cDNA: the nucleotide sequence is shown in any one of SEQ ID NO 1-4;
or (2) BDD hF VIII-SQ: the nucleotide sequence is shown as any one of SEQ ID NO 5-8;
or (3) BDDhFVIII-N8: the nucleotide sequence is shown in any one of SEQ ID NO 9-12;
or (4) BDD hFVIII-SC: the nucleotide sequence is shown in any one of SEQ ID NO 13-16;
or (5) BDD hF VIII- Δ F: the nucleotide sequence is shown in any one of SEQ ID NO 17-20.
The invention also provides a mutant protein of the polypeptide variant Gly710Thr of the high-activity blood coagulation factor VIII or VIII a, the amino acid sequence is shown in any one of SEQ ID NO 21-29, and the 710th amino acid of the mutant is Thr instead of Gly of wild type VIII or VIII a. The amino acid sequence sequentially comprises the amino acid sequences of the corresponding mutant proteins of hFVIII cDNA, BDHFVIII-SQ, BDHFVIII-N8, BDHFVIII-SC and BDHFVIII-delta F and the amino acid sequences of the mutant proteins of BDHFVIII-V3, BDD human-cl rhFVIII, pFVIII and BDD pFVIII-OL.
The invention also provides a nucleic acid encoding the mutant protein, or a nucleic acid which has the same length as the encoding nucleic acid and is completely complementary with the encoding nucleic acid.
The invention also provides a carrier for expressing the mutant protein.
The invention also provides a preparation method of the mutant protein of the polypeptide variant Gly710Thr of the high-activity blood coagulation factor VIII or VIII a, which comprises the following steps:
BDD-hFVIII or full-length FVIII is taken as a template, a PCR site-directed mutagenesis method is used for introducing mutation sites into wild FVIII, Gly710 amino acid residues are replaced by Thr, and BDD-hFVIII or full-length FVIII eukaryotic expression plasmids which are correctly inserted into Gly710Thr mutation are screened after transformation.
The invention also provides application of the mutant protein coding nucleic acid of the high-activity blood coagulation factor VIII or VIII a polypeptide variant Gly710Thr, and the application of the nucleic acid in preparing gene therapy medicaments comprises the steps of connecting the nucleic acid with a promoter and a termination sequence, constructing an expression plasmid, a gene therapy virus or a non-viral vector and expressing the mutant protein.
The invention also provides application of the mutant protein of the high-activity blood coagulation factor VIII or VIII a polypeptide variant Gly710Thr in preparing a recombinant protein treatment medicament for hemophilia.
The invention also provides application of the mutant protein of the high-activity blood coagulation factor VIII or VIII a polypeptide variant Gly710Thr, which is applied to preparing the mutant fusion protein of the blood coagulation factor VIII or VIII a polypeptide variant Gly710Thr and is used for preparing a recombinant protein treatment medicament for hemophilia.
The fusion protein is obtained by fusing one of human albumin, immunoglobulin Fc, transferrin or alpha 1 antitrypsin with the mutant protein of the polypeptide variant Gly710Thr mutant of the blood coagulation factor VIII or VIII a.
The nucleic acid or amino acid sequence pharmaceutical composition or gene therapy vector of the present invention is useful for preventing and/or treating diseases, wherein the diseases mainly include hemorrhagic diseases or hemorrhage due to various causes; among the most likely bleeding disorders are hemophilia a and B, i.e. bleeding disorders due to hereditary factor viii deficiency, and include hemophilia a and B in which inhibitory antibodies are produced, or acquired factor viii deficiency due to the production of inhibitors; and other hemorrhagic diseases using bypass preparations, such as neonatal coagulation disorders; severe liver disease; high risk surgery; traumatic blood loss; bone marrow transplantation; thrombocytopenia and platelet dysfunction; emergency reversal of oral anticoagulation; a deficiency in congenital factor viii; von willebrand disease, and acquired von willebrand disease caused by inhibitors of von willebrand factor, blood loss associated with extensive injury, cerebral hemorrhage, platelet dysfunction.
Advantageous effects
The mutant does not destroy or inhibit the blood coagulation activity of FVIII, the specific blood coagulation activity is 1.6-2 times of that of a wild type, and the mutant is superior in stability; the mutant obviously improves the drug efficacy of FVIII and has good clinical application prospect.
Drawings
FIG. 1 shows the time course detection of FVIII activity of BDD-hFVIII Gly710Thr mutant in the absence of FIXa.
FIG. 2 shows the time course detection of FVIII activity of BDD-hFVIII Gly710Thr mutant in the presence of FIXa.
Detailed Description
The invention will be further illustrated with reference to the following specific examples. It should be understood that these examples are for illustrative purposes only and are not intended to limit the scope of the present invention. Further, it should be understood that various changes or modifications of the present invention may be made by those skilled in the art after reading the teaching of the present invention, and such equivalents may fall within the scope of the present invention as defined in the appended claims.
Example 1
A mutant protein of a polypeptide variant Gly710Thr of high-activity blood coagulation factor VIII or VIII a, the amino acid sequence of which is shown in any one of SEQ ID NO 21-29, and the 710th amino acid of the mutant is Thr instead of Gly of wild type VIII or VIII a. In this example, muteins of hFVIII cDNA were used as examples.
Expression and purification of FVIII Gly710Thr mutant
(1) Expression: site-directed mutagenesis kit (using QuikChange: (9))
Figure RE-GDA0003647079790000031
II XL sitte-Directed Mutagenesis Kit, Agilent, USA), using human B-region deletion type blood coagulation VIII factor (BDD-hFVIII) as a template, introducing mutation sites into wild type FVIII by a PCR site-Directed Mutagenesis method, replacing 710Gly with 710Thr, digesting amplification products by Dpn I, then transforming (adding 5-10 microliter of mutation products digested by Dpn I into 100 microliter of competent bacteria), coating transformed bacteria on a plate containing 1/2000 ampicillin, and culturing overnight to obtain clone; selecting a single colony, inoculating the single colony to 5ml LB culture solution containing 1/2000 ampicillin, carrying out overnight strain amplification on a shaking table at the temperature of 37 ℃ at the speed of 250 rpm, extracting plasmid DNA for sequencing, and screening BDD-hF VIII eukaryotic expression plasmids which correctly insert Gly710Thr mutation.
(2) And (3) purification: and transfecting CHO cells by using wild BDD-hFVIII and the constructed BDD-hFVIII Gly710Thr expression plasmids, and establishing a blank vector control. After the stable high-expression clone cell strain is screened out, the clone cell strain is transferred into a selective serum-free culture solution for large-scale culture, and the FVIII secreted into the culture solution is purified by an immunoadsorption method (immunoadsorption gel VIII-Select, GE Healthcare, USA). The product was concentrated using Centricon Plus Ultrafiltration centrifuge tubes (MILLIPORE, USA).
Detection of blood coagulation Activity of FVIII Gly710Thr mutant
(1) Coenzyme Activity assay of FVIII Gly710Thr mutant (FXa production assay): the coactivator activity of FVIII is manifested as the ability of FIXa to assist in the cleavage of FX to FXa. The wild-type BDD-hFVIII is used as a control, and the FXa generation experiment is used for evaluating the coenzyme activity of the FVIII Gly710Thr mutant: thrombin acts on FVIII for 1min (FVIII: thrombin: 20:1) to activate FVIII to FVIII a, hirudin inactivates thrombin; 0.5nM of FVIIIa is taken, 20 mu M, FIXa40nM of PSPC phospholipid particles are added, mixed and incubated for 1min at 37 ℃, FX with different concentrations of 0nM, 20nM, 40nM, 60nM, 120nM, 240nM and the like is respectively added, reaction is carried out for 10min at 37 ℃, and EDTA stops the reaction; adding FXa specific chromogenic substrate S-2765, carrying out color comparison at the wavelength of 405nm, and measuring the generated FXa. Michaelis-Menten enzyme kinetic analysis was performed using graphpad prism software, and the enzyme kinetic constants Km, Kcat, and Vmax were calculated.
(2) Effect of fviii Gly710Thr mutant on fviii specific activity: the specific activity of FVIII represents the coagulation activity of FVIII per mg of protein. The activity of the purified BDD-hFVIII mutant and wild type was determined using a first-stage and a second-stage method (Chromogenissay) according to
Figure RE-GDA0003647079790000041
SP4F VIII kit (Chromogenix, Italy) protocol. Protein quantification with enzyme-linked immunosorbent assay: the concentration of the sample was determined by sandwich ELISA using anti-C2 antibody GMA-8014 as the capture antibody, biotin-labeled anti-A2 antibody GMA-8016 as the detection antibody, and purified recombinant human FVIII as the standard. The specific activity of the FVIII mutant and the relative activity of the FVIII mutant relative to the wild type were calculated.
Effect of FVIII Gly710Thr mutants on FVIII stability
(1) Viii thermal stability test: the stability of the wild type FVIII was assessed by observing the degree of activity loss of the FVIII Gly710Thr mutant under high temperature conditions. Respectively taking 4nM of wild type FVIII and FVIII mutants, incubating at 55 ℃, respectively sampling at 0min, 10min, 20min, 30min and 40min after incubation, detecting residual FVIII activity by applying a chromogenic substrate method, and analyzing by graphpadprist software to obtain a one phase decay model fitting attenuation curve to obtain a half-life period.
(2) Fviia activity decay assay:
the time course of FVIII activity in the absence of FIXa: respectively taking 4nM of F VIII Gly710Thr and wild type BDD-hF VIII, inactivating thrombin by hirudin after 20nM activation of thrombin, adding FIXa40nM and PSPC 10 mu M, mixing, incubating at 23 ℃, respectively sampling at 0min, 5min, 10min, 15min and 20min after incubation, adding FX, detecting the FXa concentration of the sample according to an FXa generation test program, and indirectly determining the concentration of residual F VIII a.
Interval of FVIII activity in the presence of FIXa: the above procedure was repeated, and factor IXa40nM was added to the reaction before thrombin was added. Inactivating thrombin by hirudin after 20nM activation of thrombin, adding 10 μ M of PSPC, incubating at 23 ℃ after mixing, respectively sampling at 0min, 10min, 20min, 30min and 40min after incubation, adding FX, detecting FXa concentration of a sample by comparing with FXa generation test program, and indirectly determining the concentration of residual FVIIIa.
And thirdly, fitting an attenuation curve by using a one phase decay model of the graphpad prism to obtain a half-life period, and comparing the stability of the FVIIia.
4. Results of the experiment
(1) And (3) testing relative ratio of the BDD-hF VIII Gly710Thr mutant to activity:
transient transfection of plasmid into CHO cell, culturing for 48 hr, collecting culture liquid, taking supernatant for detection, detecting the activity of the mutant by APTT one-stage method, detecting the protein amount of the mutant by ELISA method, comparing with wild BDD-hF VIII, and obtaining the relative specific activity of the mutant primarily. The results show that: the relative specific activity of the BDD-hF VIII Gly710Thr mutant is 1.084 +/-0.004.
Screening cell strains stably expressed by the BDD-hFVIII Gly710Thr mutant for high-capacity cell culture, collecting culture solutions in 48h and 72h respectively, and extracting, purifying and concentrating at low temperature to obtain the target protein. And respectively detecting the activity of the mutant by a first-stage method and a second-stage method, detecting the protein amount of the mutant by an ELISA method, and comparing the protein amount with the wild BDD-hFVIII to calculate the relative specific activity of the mutant. The results show that: the relative ratio activity of the BDD-hFVIII Gly710Thr mutant in the first phase method is 2.07 +/-0.67, and the relative ratio activity in the second phase method is 1.62 +/-0.65.
(2) And (3) detecting the coenzyme activity and stability of the BDD-hFVIII Gly710Thr mutant:
the high-capacity cell culture product is adopted for detection, and the coenzyme activity and stability of the mutant are detected through FXa generation test, FVIII activity time-course test and the like, and the result shows that the mutant not only has the coenzyme performance and the thermal stability equivalent to those of a wild type but also shows better stability in the absence of FIXa under the same activity concentration.
The coenzyme action of the F VIII Gly710Thr mutant is detected by an FXa generation test. The results show that: after the FVIII Gly710 site is mutated into Thr, the coenzyme has the same coenzyme performance as the wild type.
② the coenzyme state and stability of the FVIII Gly710Thr mutant are detected by the activity time course of the FVIII in the absence of IXa. As shown in the results of fig. 1: in the reaction without FIXa, the half-life of the F VIII Gly710Thr mutant is 2 times of that of the wild type at 23 ℃, which indicates that the F VIII Gly710Thr mutant has better retention rate of A2 subunit and more excellent stability.
And thirdly, observing the influence of the FVIII mutant on the stability of the FVIII by detecting the time course of the FVIII activity in the presence of FIXa. As shown by the results in fig. 2: in the reaction containing FIXa, the half-life of the FVIII Gly710Thr mutant is equivalent to that of the wild type at 23 ℃.
And fourthly, BDD-hF VIII Gly710Thr mutant thermal stability detection: the activity of the FVIII Gly710Thr mutant and the wild-type BDD at 55 ℃ was attenuated by 50% for a comparable period of time, suggesting that both have similar thermostability at high temperature.
SEQUENCE LISTING
<110> first subsidiary Hospital of Guangxi medical university
<120> a highly active factor VIII or VIII a polypeptide variant Gly710Thr
<130> 1
<160> 29
<170> PatentIn version 3.3
<210> 1
<211> 7056
<212> DNA
<213> Artificial sequence
<400> 1
atgcaaatag agctctccac ctgcttcttt ctgtgccttt tgcgattctg ctttagtgcc 60
accagaagat actacctggg tgcagtggaa ctgtcatggg actatatgca aagtgatctc 120
ggtgagctgc ctgtggacgc aagatttcct cctagagtgc caaaatcttt tccattcaac 180
acctcagtcg tgtacaaaaa gactctgttt gtagaattca cggatcacct tttcaacatc 240
gctaagccaa ggccaccctg gatgggtctg ctaggtccta ccatccaggc tgaggtttat 300
gatacagtgg tcattacact taagaacatg gcttcccatc ctgtcagtct tcatgctgtt 360
ggtgtatcct actggaaagc ttctgaggga gctgaatatg atgatcagac cagtcaaagg 420
gagaaagaag atgataaagt cttccctggt ggaagccata catatgtctg gcaggtcctg 480
aaagagaatg gtccaatggc ctctgaccca ctgtgcctta cctactcata tctttctcat 540
gtggacctgg taaaagactt gaattcaggc ctcattggag ccctactagt atgtagagaa 600
gggagtctgg ccaaggaaaa gacacagacc ttgcacaaat ttatactact ttttgctgta 660
tttgatgaag ggaaaagttg gcactcagaa acaaagaact ccttgatgca ggatagggat 720
gctgcatctg ctcgggcctg gcctaaaatg cacacagtca atggttatgt aaacaggtct 780
ctgccaggtc tgattggatg ccacaggaaa tcagtctatt ggcatgtgat tggaatgggc 840
accactcctg aagtgcactc aatattcctc gaaggtcaca catttcttgt gaggaaccat 900
cgccaggcgt ccttggaaat ctcgccaata actttcctta ctgctcaaac actcttgatg 960
gaccttggac agtttctact gttttgtcat atctcttccc accaacatga tggcatggaa 1020
gcttatgtca aagtagacag ctgtccagag gaaccccaac tacgaatgaa aaataatgaa 1080
gaagcggaag actatgatga tgatcttact gattctgaaa tggatgtggt caggtttgat 1140
gatgacaact ctccttcctt tatccaaatt cgctcagttg ccaagaagca tcctaaaact 1200
tgggtacatt acattgctgc tgaagaggag gactgggact atgctccctt agtcctcgcc 1260
cccgatgaca gaagttataa aagtcaatat ttgaacaatg gccctcagcg gattggtagg 1320
aagtacaaaa aagtccgatt tatggcatac acagatgaaa cctttaagac tcgtgaagct 1380
attcagcatg aatcaggaat cttgggacct ttactttatg gggaagttgg agacacactg 1440
ttgattatat ttaagaatca agcaagcaga ccatataaca tctaccctca cggaatcact 1500
gatgtccgtc ctttgtattc aaggagatta ccaaaaggtg taaaacattt gaaggatttt 1560
ccaattctgc caggagaaat attcaaatat aaatggacag tgactgtaga agatgggcca 1620
actaaatcag atcctcggtg cctgacccgc tattactcta gtttcgttaa tatggagaga 1680
gatctagctt caggactcat tggccctctc ctcatctgct acaaagaatc tgtagatcaa 1740
agaggaaacc agataatgtc agacaagagg aatgtcatcc tgttttctgt atttgatgag 1800
aaccgaagct ggtacctcac agagaatata caacgctttc tccccaatcc agctggagtg 1860
cagcttgagg atccagagtt ccaagcctcc aacatcatgc acagcatcaa tggctatgtt 1920
tttgatagtt tgcagttgtc agtttgtttg catgaggtgg catactggta cattctaagc 1980
attggagcac agactgactt cctttctgtc ttcttctctg gatatacctt caaacacaaa 2040
atggtctatg aagacacact caccctattc ccattctcag gagaaactgt cttcatgtcg 2100
atggaaaacc caggtctatg gattctgact tgccacaact cagactttcg gaacagaggc 2160
atgaccgcct tactgaaggt ttctagttgt gacaagaaca ctggtgatta ttacgaggac 2220
agttatgaag atatttcagc atacttgctg agtaaaaaca atgccattga accaagaagc 2280
ttctcccaga attcaagaca ccctagcact aggcaaaagc aatttaatgc caccacaatt 2340
ccagaaaatg acatagagaa gactgaccct tggtttgcac acagaacacc tatgcctaaa 2400
atacaaaatg tctcctctag tgatttgttg atgctcttgc gacagagtcc tactccacat 2460
gggctatcct tatctgatct ccaagaagcc aaatatgaga ctttttctga tgatccatca 2520
cctggagcaa tagacagtaa taacagcctg tctgaaatga cacacttcag gccacagctc 2580
catcacagtg gggacatggt atttacccct gagtcaggcc tccaattaag attaaatgag 2640
aaactgggga caactgcagc aacagagttg aagaaacttg atttcaaagt ttctagtaca 2700
tcaaataatc tgatttcaac aattccatca gacaatttgg cagcaggtac tgataataca 2760
agttccttag gacccccaag tatgccagtt cattatgata gtcaattaga taccactcta 2820
tttggcaaaa agtcatctcc ccttactgag tctggtggac ctctgagctt gagtgaagaa 2880
aataatgatt caaagttgtt agaatcaggt ttaatgaata gccaagaaag ttcatgggga 2940
aaaaatgtat cgtcaacaga gagtggtagg ttatttaaag ggaaaagagc tcatggacct 3000
gctttgttga ctaaagataa tgccttattc aaagttagca tctctttgtt aaagacaaac 3060
aaaacttcca ataattcagc aactaataga aagactcaca ttgatggccc atcattatta 3120
attgagaata gtccatcagt ctggcaaaat atattagaaa gtgacactga gtttaaaaaa 3180
gtgacacctt tgattcatga cagaatgctt atggacaaaa atgctacagc tttgaggcta 3240
aatcatatgt caaataaaac tacttcatca aaaaacatgg aaatggtcca acagaaaaaa 3300
gagggcccca ttccaccaga tgcacaaaat ccagatatgt cgttctttaa gatgctattc 3360
ttgccagaat cagcaaggtg gatacaaagg actcatggaa agaactctct gaactctggg 3420
caaggcccca gtccaaagca attagtatcc ttaggaccag aaaaatctgt ggaaggtcag 3480
aatttcttgt ctgagaaaaa caaagtggta gtaggaaagg gtgaatttac aaaggacgta 3540
ggactcaaag agatggtttt tccaagcagc agaaacctat ttcttactaa cttggataat 3600
ttacatgaaa ataatacaca caatcaagaa aaaaaaattc aggaagaaat agaaaagaag 3660
gaaacattaa tccaagagaa tgtagttttg cctcagatac atacagtgac tggcactaag 3720
aatttcatga agaacctttt cttactgagc actaggcaaa atgtagaagg ttcatatgac 3780
ggggcatatg ctccagtact tcaagatttt aggtcattaa atgattcaac aaatagaaca 3840
aagaaacaca cagctcattt ctcaaaaaaa ggggaggaag aaaacttgga aggcttggga 3900
aatcaaacca agcaaattgt agagaaatat gcatgcacca caaggatatc tcctaataca 3960
agccagcaga attttgtcac gcaacgtagt aagagagctt tgaaacaatt cagactccca 4020
ctagaagaaa cagaacttga aaaaaggata attgtggatg acacctcaac ccagtggtcc 4080
aaaaacatga aacatttgac cccgagcacc ctcacacaga tagactacaa tgagaaggag 4140
aaaggggcca ttactcagtc tcccttatca gattgcctta cgaggagtca tagcatccct 4200
caagcaaata gatctccatt acccattgca aaggtatcat catttccatc tattagacct 4260
atatatctga ccagggtcct attccaagac aactcttctc atcttccagc agcatcttat 4320
agaaagaaag attctggggt ccaagaaagc agtcatttct tacaaggagc caaaaaaaat 4380
aacctttctt tagccattct aaccttggag atgactggtg atcaaagaga ggttggctcc 4440
ctggggacaa gtgccacaaa ttcagtcaca tacaagaaag ttgagaacac tgttctcccg 4500
aaaccagact tgcccaaaac atctggcaaa gttgaattgc ttccaaaagt tcacatttat 4560
cagaaggacc tattccctac ggaaactagc aatgggtctc ctggccatct ggatctcgtg 4620
gaagggagcc ttcttcaggg aacagaggga gcgattaagt ggaatgaagc aaacagacct 4680
ggaaaagttc cctttctgag agtagcaaca gaaagctctg caaagactcc ctccaagcta 4740
ttggatcctc ttgcttggga taaccactat ggtactcaga taccaaaaga agagtggaaa 4800
tcccaagaga agtcaccaga aaaaacagct tttaagaaaa aggataccat tttgtccctg 4860
aacgcttgtg aaagcaatca tgcaatagca gcaataaatg agggacaaaa taagcccgaa 4920
atagaagtca cctgggcaaa gcaaggtagg actgaaaggc tgtgctctca aaacccacca 4980
gtcttgaaac gccatcaacg ggaaataact cgtactactc ttcagtcaga tcaagaggaa 5040
attgactatg atgataccat atcagttgaa atgaagaagg aagattttga catttatgat 5100
gaggatgaaa atcagagccc ccgcagcttt caaaagaaaa cacgacacta ttttattgct 5160
gcagtggaga ggctctggga ttatgggatg agtagctccc cacatgttct aagaaacagg 5220
gctcagagtg gcagtgtccc tcagttcaag aaagttgttt tccaggaatt tactgatggc 5280
tcctttactc agcccttata ccgtggagaa ctaaatgaac atttgggact cctggggcca 5340
tatataagag cagaagttga agataatatc atggtaactt tcagaaatca ggcctctcgt 5400
ccctattcct tctattctag ccttatttct tatgaggaag atcagaggca aggagcagaa 5460
cctagaaaaa actttgtcaa gcctaatgaa accaaaactt acttttggaa agtgcaacat 5520
catatggcac ccactaaaga tgagtttgac tgcaaagcct gggcttattt ctctgatgtt 5580
gacctggaaa aagatgtgca ctcaggcctg attggacccc ttctggtctg ccacactaac 5640
acactgaacc ctgctcatgg gagacaagtg acagtacagg aatttgctct gtttttcacc 5700
atctttgatg agaccaaaag ctggtacttc actgaaaata tggaaagaaa ctgcagggct 5760
ccctgcaata tccagatgga agatcccact tttaaagaga attatcgctt ccatgcaatc 5820
aatggctaca taatggatac actacctggc ttagtaatgg ctcaggatca aaggattcga 5880
tggtatctgc tcagcatggg cagcaatgaa aacatccatt ctattcattt cagtggacat 5940
gtgttcactg tacgaaaaaa agaggagtat aaaatggcac tgtacaatct ctatccaggt 6000
gtttttgaga cagtggaaat gttaccatcc aaagctggaa tttggcgggt ggaatgcctt 6060
attggcgagc atctacatgc tgggatgagc acactttttc tggtgtacag caataagtgt 6120
cagactcccc tgggaatggc ttctggacac attagagatt ttcagattac agcttcagga 6180
caatatggac agtgggcccc aaagctggcc agacttcatt attccggatc aatcaatgcc 6240
tggagcacca aggagccctt ttcttggatc aaggtggatc tgttggcacc aatgattatt 6300
cacggcatca agacccaggg tgcccgtcag aagttctcca gcctctacat ctctcagttt 6360
atcatcatgt atagtcttga tgggaagaag tggcagactt atcgaggaaa ttccactgga 6420
accttaatgg tcttctttgg caatgtggat tcatctggga taaaacacaa tatttttaac 6480
cctccaatta ttgctcgata catccgtttg cacccaactc attatagcat tcgcagcact 6540
cttcgcatgg agttgatggg ctgtgattta aatagttgca gcatgccatt gggaatggag 6600
agtaaagcaa tatcagatgc acagattact gcttcatcct actttaccaa tatgtttgcc 6660
acctggtctc cttcaaaagc tcgacttcac ctccaaggga ggagtaatgc ctggagacct 6720
caggtgaata atccaaaaga gtggctgcaa gtggacttcc agaagacaat gaaagtcaca 6780
ggagtaacta ctcagggagt aaaatctctg cttaccagca tgtatgtgaa ggagttcctc 6840
atctccagca gtcaagatgg ccatcagtgg actctctttt ttcagaatgg caaagtaaag 6900
gtttttcagg gaaatcaaga ctccttcaca cctgtggtga actctctaga cccaccgtta 6960
ctgactcgct accttcgaat tcacccccag agttgggtgc accagattgc cctgaggatg 7020
gaggttctgg gctgcgaggc acaggacctc tactag 7056
<210> 2
<211> 7056
<212> DNA
<213> Artificial sequence
<400> 2
atgcaaatag agctctccac ctgcttcttt ctgtgccttt tgcgattctg ctttagtgcc 60
accagaagat actacctggg tgcagtggaa ctgtcatggg actatatgca aagtgatctc 120
ggtgagctgc ctgtggacgc aagatttcct cctagagtgc caaaatcttt tccattcaac 180
acctcagtcg tgtacaaaaa gactctgttt gtagaattca cggatcacct tttcaacatc 240
gctaagccaa ggccaccctg gatgggtctg ctaggtccta ccatccaggc tgaggtttat 300
gatacagtgg tcattacact taagaacatg gcttcccatc ctgtcagtct tcatgctgtt 360
ggtgtatcct actggaaagc ttctgaggga gctgaatatg atgatcagac cagtcaaagg 420
gagaaagaag atgataaagt cttccctggt ggaagccata catatgtctg gcaggtcctg 480
aaagagaatg gtccaatggc ctctgaccca ctgtgcctta cctactcata tctttctcat 540
gtggacctgg taaaagactt gaattcaggc ctcattggag ccctactagt atgtagagaa 600
gggagtctgg ccaaggaaaa gacacagacc ttgcacaaat ttatactact ttttgctgta 660
tttgatgaag ggaaaagttg gcactcagaa acaaagaact ccttgatgca ggatagggat 720
gctgcatctg ctcgggcctg gcctaaaatg cacacagtca atggttatgt aaacaggtct 780
ctgccaggtc tgattggatg ccacaggaaa tcagtctatt ggcatgtgat tggaatgggc 840
accactcctg aagtgcactc aatattcctc gaaggtcaca catttcttgt gaggaaccat 900
cgccaggcgt ccttggaaat ctcgccaata actttcctta ctgctcaaac actcttgatg 960
gaccttggac agtttctact gttttgtcat atctcttccc accaacatga tggcatggaa 1020
gcttatgtca aagtagacag ctgtccagag gaaccccaac tacgaatgaa aaataatgaa 1080
gaagcggaag actatgatga tgatcttact gattctgaaa tggatgtggt caggtttgat 1140
gatgacaact ctccttcctt tatccaaatt cgctcagttg ccaagaagca tcctaaaact 1200
tgggtacatt acattgctgc tgaagaggag gactgggact atgctccctt agtcctcgcc 1260
cccgatgaca gaagttataa aagtcaatat ttgaacaatg gccctcagcg gattggtagg 1320
aagtacaaaa aagtccgatt tatggcatac acagatgaaa cctttaagac tcgtgaagct 1380
attcagcatg aatcaggaat cttgggacct ttactttatg gggaagttgg agacacactg 1440
ttgattatat ttaagaatca agcaagcaga ccatataaca tctaccctca cggaatcact 1500
gatgtccgtc ctttgtattc aaggagatta ccaaaaggtg taaaacattt gaaggatttt 1560
ccaattctgc caggagaaat attcaaatat aaatggacag tgactgtaga agatgggcca 1620
actaaatcag atcctcggtg cctgacccgc tattactcta gtttcgttaa tatggagaga 1680
gatctagctt caggactcat tggccctctc ctcatctgct acaaagaatc tgtagatcaa 1740
agaggaaacc agataatgtc agacaagagg aatgtcatcc tgttttctgt atttgatgag 1800
aaccgaagct ggtacctcac agagaatata caacgctttc tccccaatcc agctggagtg 1860
cagcttgagg atccagagtt ccaagcctcc aacatcatgc acagcatcaa tggctatgtt 1920
tttgatagtt tgcagttgtc agtttgtttg catgaggtgg catactggta cattctaagc 1980
attggagcac agactgactt cctttctgtc ttcttctctg gatatacctt caaacacaaa 2040
atggtctatg aagacacact caccctattc ccattctcag gagaaactgt cttcatgtcg 2100
atggaaaacc caggtctatg gattctgacc tgccacaact cagactttcg gaacagaggc 2160
atgaccgcct tactgaaggt ttctagttgt gacaagaaca ctggtgatta ttacgaggac 2220
agttatgaag atatttcagc atacttgctg agtaaaaaca atgccattga accaagaagc 2280
ttctcccaga attcaagaca ccctagcact aggcaaaagc aatttaatgc caccacaatt 2340
ccagaaaatg acatagagaa gactgaccct tggtttgcac acagaacacc tatgcctaaa 2400
atacaaaatg tctcctctag tgatttgttg atgctcttgc gacagagtcc tactccacat 2460
gggctatcct tatctgatct ccaagaagcc aaatatgaga ctttttctga tgatccatca 2520
cctggagcaa tagacagtaa taacagcctg tctgaaatga cacacttcag gccacagctc 2580
catcacagtg gggacatggt atttacccct gagtcaggcc tccaattaag attaaatgag 2640
aaactgggga caactgcagc aacagagttg aagaaacttg atttcaaagt ttctagtaca 2700
tcaaataatc tgatttcaac aattccatca gacaatttgg cagcaggtac tgataataca 2760
agttccttag gacccccaag tatgccagtt cattatgata gtcaattaga taccactcta 2820
tttggcaaaa agtcatctcc ccttactgag tctggtggac ctctgagctt gagtgaagaa 2880
aataatgatt caaagttgtt agaatcaggt ttaatgaata gccaagaaag ttcatgggga 2940
aaaaatgtat cgtcaacaga gagtggtagg ttatttaaag ggaaaagagc tcatggacct 3000
gctttgttga ctaaagataa tgccttattc aaagttagca tctctttgtt aaagacaaac 3060
aaaacttcca ataattcagc aactaataga aagactcaca ttgatggccc atcattatta 3120
attgagaata gtccatcagt ctggcaaaat atattagaaa gtgacactga gtttaaaaaa 3180
gtgacacctt tgattcatga cagaatgctt atggacaaaa atgctacagc tttgaggcta 3240
aatcatatgt caaataaaac tacttcatca aaaaacatgg aaatggtcca acagaaaaaa 3300
gagggcccca ttccaccaga tgcacaaaat ccagatatgt cgttctttaa gatgctattc 3360
ttgccagaat cagcaaggtg gatacaaagg actcatggaa agaactctct gaactctggg 3420
caaggcccca gtccaaagca attagtatcc ttaggaccag aaaaatctgt ggaaggtcag 3480
aatttcttgt ctgagaaaaa caaagtggta gtaggaaagg gtgaatttac aaaggacgta 3540
ggactcaaag agatggtttt tccaagcagc agaaacctat ttcttactaa cttggataat 3600
ttacatgaaa ataatacaca caatcaagaa aaaaaaattc aggaagaaat agaaaagaag 3660
gaaacattaa tccaagagaa tgtagttttg cctcagatac atacagtgac tggcactaag 3720
aatttcatga agaacctttt cttactgagc actaggcaaa atgtagaagg ttcatatgac 3780
ggggcatatg ctccagtact tcaagatttt aggtcattaa atgattcaac aaatagaaca 3840
aagaaacaca cagctcattt ctcaaaaaaa ggggaggaag aaaacttgga aggcttggga 3900
aatcaaacca agcaaattgt agagaaatat gcatgcacca caaggatatc tcctaataca 3960
agccagcaga attttgtcac gcaacgtagt aagagagctt tgaaacaatt cagactccca 4020
ctagaagaaa cagaacttga aaaaaggata attgtggatg acacctcaac ccagtggtcc 4080
aaaaacatga aacatttgac cccgagcacc ctcacacaga tagactacaa tgagaaggag 4140
aaaggggcca ttactcagtc tcccttatca gattgcctta cgaggagtca tagcatccct 4200
caagcaaata gatctccatt acccattgca aaggtatcat catttccatc tattagacct 4260
atatatctga ccagggtcct attccaagac aactcttctc atcttccagc agcatcttat 4320
agaaagaaag attctggggt ccaagaaagc agtcatttct tacaaggagc caaaaaaaat 4380
aacctttctt tagccattct aaccttggag atgactggtg atcaaagaga ggttggctcc 4440
ctggggacaa gtgccacaaa ttcagtcaca tacaagaaag ttgagaacac tgttctcccg 4500
aaaccagact tgcccaaaac atctggcaaa gttgaattgc ttccaaaagt tcacatttat 4560
cagaaggacc tattccctac ggaaactagc aatgggtctc ctggccatct ggatctcgtg 4620
gaagggagcc ttcttcaggg aacagaggga gcgattaagt ggaatgaagc aaacagacct 4680
ggaaaagttc cctttctgag agtagcaaca gaaagctctg caaagactcc ctccaagcta 4740
ttggatcctc ttgcttggga taaccactat ggtactcaga taccaaaaga agagtggaaa 4800
tcccaagaga agtcaccaga aaaaacagct tttaagaaaa aggataccat tttgtccctg 4860
aacgcttgtg aaagcaatca tgcaatagca gcaataaatg agggacaaaa taagcccgaa 4920
atagaagtca cctgggcaaa gcaaggtagg actgaaaggc tgtgctctca aaacccacca 4980
gtcttgaaac gccatcaacg ggaaataact cgtactactc ttcagtcaga tcaagaggaa 5040
attgactatg atgataccat atcagttgaa atgaagaagg aagattttga catttatgat 5100
gaggatgaaa atcagagccc ccgcagcttt caaaagaaaa cacgacacta ttttattgct 5160
gcagtggaga ggctctggga ttatgggatg agtagctccc cacatgttct aagaaacagg 5220
gctcagagtg gcagtgtccc tcagttcaag aaagttgttt tccaggaatt tactgatggc 5280
tcctttactc agcccttata ccgtggagaa ctaaatgaac atttgggact cctggggcca 5340
tatataagag cagaagttga agataatatc atggtaactt tcagaaatca ggcctctcgt 5400
ccctattcct tctattctag ccttatttct tatgaggaag atcagaggca aggagcagaa 5460
cctagaaaaa actttgtcaa gcctaatgaa accaaaactt acttttggaa agtgcaacat 5520
catatggcac ccactaaaga tgagtttgac tgcaaagcct gggcttattt ctctgatgtt 5580
gacctggaaa aagatgtgca ctcaggcctg attggacccc ttctggtctg ccacactaac 5640
acactgaacc ctgctcatgg gagacaagtg acagtacagg aatttgctct gtttttcacc 5700
atctttgatg agaccaaaag ctggtacttc actgaaaata tggaaagaaa ctgcagggct 5760
ccctgcaata tccagatgga agatcccact tttaaagaga attatcgctt ccatgcaatc 5820
aatggctaca taatggatac actacctggc ttagtaatgg ctcaggatca aaggattcga 5880
tggtatctgc tcagcatggg cagcaatgaa aacatccatt ctattcattt cagtggacat 5940
gtgttcactg tacgaaaaaa agaggagtat aaaatggcac tgtacaatct ctatccaggt 6000
gtttttgaga cagtggaaat gttaccatcc aaagctggaa tttggcgggt ggaatgcctt 6060
attggcgagc atctacatgc tgggatgagc acactttttc tggtgtacag caataagtgt 6120
cagactcccc tgggaatggc ttctggacac attagagatt ttcagattac agcttcagga 6180
caatatggac agtgggcccc aaagctggcc agacttcatt attccggatc aatcaatgcc 6240
tggagcacca aggagccctt ttcttggatc aaggtggatc tgttggcacc aatgattatt 6300
cacggcatca agacccaggg tgcccgtcag aagttctcca gcctctacat ctctcagttt 6360
atcatcatgt atagtcttga tgggaagaag tggcagactt atcgaggaaa ttccactgga 6420
accttaatgg tcttctttgg caatgtggat tcatctggga taaaacacaa tatttttaac 6480
cctccaatta ttgctcgata catccgtttg cacccaactc attatagcat tcgcagcact 6540
cttcgcatgg agttgatggg ctgtgattta aatagttgca gcatgccatt gggaatggag 6600
agtaaagcaa tatcagatgc acagattact gcttcatcct actttaccaa tatgtttgcc 6660
acctggtctc cttcaaaagc tcgacttcac ctccaaggga ggagtaatgc ctggagacct 6720
caggtgaata atccaaaaga gtggctgcaa gtggacttcc agaagacaat gaaagtcaca 6780
ggagtaacta ctcagggagt aaaatctctg cttaccagca tgtatgtgaa ggagttcctc 6840
atctccagca gtcaagatgg ccatcagtgg actctctttt ttcagaatgg caaagtaaag 6900
gtttttcagg gaaatcaaga ctccttcaca cctgtggtga actctctaga cccaccgtta 6960
ctgactcgct accttcgaat tcacccccag agttgggtgc accagattgc cctgaggatg 7020
gaggttctgg gctgcgaggc acaggacctc tactag 7056
<210> 3
<211> 7056
<212> DNA
<213> Artificial sequence
<400> 3
atgcaaatag agctctccac ctgcttcttt ctgtgccttt tgcgattctg ctttagtgcc 60
accagaagat actacctggg tgcagtggaa ctgtcatggg actatatgca aagtgatctc 120
ggtgagctgc ctgtggacgc aagatttcct cctagagtgc caaaatcttt tccattcaac 180
acctcagtcg tgtacaaaaa gactctgttt gtagaattca cggatcacct tttcaacatc 240
gctaagccaa ggccaccctg gatgggtctg ctaggtccta ccatccaggc tgaggtttat 300
gatacagtgg tcattacact taagaacatg gcttcccatc ctgtcagtct tcatgctgtt 360
ggtgtatcct actggaaagc ttctgaggga gctgaatatg atgatcagac cagtcaaagg 420
gagaaagaag atgataaagt cttccctggt ggaagccata catatgtctg gcaggtcctg 480
aaagagaatg gtccaatggc ctctgaccca ctgtgcctta cctactcata tctttctcat 540
gtggacctgg taaaagactt gaattcaggc ctcattggag ccctactagt atgtagagaa 600
gggagtctgg ccaaggaaaa gacacagacc ttgcacaaat ttatactact ttttgctgta 660
tttgatgaag ggaaaagttg gcactcagaa acaaagaact ccttgatgca ggatagggat 720
gctgcatctg ctcgggcctg gcctaaaatg cacacagtca atggttatgt aaacaggtct 780
ctgccaggtc tgattggatg ccacaggaaa tcagtctatt ggcatgtgat tggaatgggc 840
accactcctg aagtgcactc aatattcctc gaaggtcaca catttcttgt gaggaaccat 900
cgccaggcgt ccttggaaat ctcgccaata actttcctta ctgctcaaac actcttgatg 960
gaccttggac agtttctact gttttgtcat atctcttccc accaacatga tggcatggaa 1020
gcttatgtca aagtagacag ctgtccagag gaaccccaac tacgaatgaa aaataatgaa 1080
gaagcggaag actatgatga tgatcttact gattctgaaa tggatgtggt caggtttgat 1140
gatgacaact ctccttcctt tatccaaatt cgctcagttg ccaagaagca tcctaaaact 1200
tgggtacatt acattgctgc tgaagaggag gactgggact atgctccctt agtcctcgcc 1260
cccgatgaca gaagttataa aagtcaatat ttgaacaatg gccctcagcg gattggtagg 1320
aagtacaaaa aagtccgatt tatggcatac acagatgaaa cctttaagac tcgtgaagct 1380
attcagcatg aatcaggaat cttgggacct ttactttatg gggaagttgg agacacactg 1440
ttgattatat ttaagaatca agcaagcaga ccatataaca tctaccctca cggaatcact 1500
gatgtccgtc ctttgtattc aaggagatta ccaaaaggtg taaaacattt gaaggatttt 1560
ccaattctgc caggagaaat attcaaatat aaatggacag tgactgtaga agatgggcca 1620
actaaatcag atcctcggtg cctgacccgc tattactcta gtttcgttaa tatggagaga 1680
gatctagctt caggactcat tggccctctc ctcatctgct acaaagaatc tgtagatcaa 1740
agaggaaacc agataatgtc agacaagagg aatgtcatcc tgttttctgt atttgatgag 1800
aaccgaagct ggtacctcac agagaatata caacgctttc tccccaatcc agctggagtg 1860
cagcttgagg atccagagtt ccaagcctcc aacatcatgc acagcatcaa tggctatgtt 1920
tttgatagtt tgcagttgtc agtttgtttg catgaggtgg catactggta cattctaagc 1980
attggagcac agactgactt cctttctgtc ttcttctctg gatatacctt caaacacaaa 2040
atggtctatg aagacacact caccctattc ccattctcag gagaaactgt cttcatgtcg 2100
atggaaaacc caggtctatg gattctgaca tgccacaact cagactttcg gaacagaggc 2160
atgaccgcct tactgaaggt ttctagttgt gacaagaaca ctggtgatta ttacgaggac 2220
agttatgaag atatttcagc atacttgctg agtaaaaaca atgccattga accaagaagc 2280
ttctcccaga attcaagaca ccctagcact aggcaaaagc aatttaatgc caccacaatt 2340
ccagaaaatg acatagagaa gactgaccct tggtttgcac acagaacacc tatgcctaaa 2400
atacaaaatg tctcctctag tgatttgttg atgctcttgc gacagagtcc tactccacat 2460
gggctatcct tatctgatct ccaagaagcc aaatatgaga ctttttctga tgatccatca 2520
cctggagcaa tagacagtaa taacagcctg tctgaaatga cacacttcag gccacagctc 2580
catcacagtg gggacatggt atttacccct gagtcaggcc tccaattaag attaaatgag 2640
aaactgggga caactgcagc aacagagttg aagaaacttg atttcaaagt ttctagtaca 2700
tcaaataatc tgatttcaac aattccatca gacaatttgg cagcaggtac tgataataca 2760
agttccttag gacccccaag tatgccagtt cattatgata gtcaattaga taccactcta 2820
tttggcaaaa agtcatctcc ccttactgag tctggtggac ctctgagctt gagtgaagaa 2880
aataatgatt caaagttgtt agaatcaggt ttaatgaata gccaagaaag ttcatgggga 2940
aaaaatgtat cgtcaacaga gagtggtagg ttatttaaag ggaaaagagc tcatggacct 3000
gctttgttga ctaaagataa tgccttattc aaagttagca tctctttgtt aaagacaaac 3060
aaaacttcca ataattcagc aactaataga aagactcaca ttgatggccc atcattatta 3120
attgagaata gtccatcagt ctggcaaaat atattagaaa gtgacactga gtttaaaaaa 3180
gtgacacctt tgattcatga cagaatgctt atggacaaaa atgctacagc tttgaggcta 3240
aatcatatgt caaataaaac tacttcatca aaaaacatgg aaatggtcca acagaaaaaa 3300
gagggcccca ttccaccaga tgcacaaaat ccagatatgt cgttctttaa gatgctattc 3360
ttgccagaat cagcaaggtg gatacaaagg actcatggaa agaactctct gaactctggg 3420
caaggcccca gtccaaagca attagtatcc ttaggaccag aaaaatctgt ggaaggtcag 3480
aatttcttgt ctgagaaaaa caaagtggta gtaggaaagg gtgaatttac aaaggacgta 3540
ggactcaaag agatggtttt tccaagcagc agaaacctat ttcttactaa cttggataat 3600
ttacatgaaa ataatacaca caatcaagaa aaaaaaattc aggaagaaat agaaaagaag 3660
gaaacattaa tccaagagaa tgtagttttg cctcagatac atacagtgac tggcactaag 3720
aatttcatga agaacctttt cttactgagc actaggcaaa atgtagaagg ttcatatgac 3780
ggggcatatg ctccagtact tcaagatttt aggtcattaa atgattcaac aaatagaaca 3840
aagaaacaca cagctcattt ctcaaaaaaa ggggaggaag aaaacttgga aggcttggga 3900
aatcaaacca agcaaattgt agagaaatat gcatgcacca caaggatatc tcctaataca 3960
agccagcaga attttgtcac gcaacgtagt aagagagctt tgaaacaatt cagactccca 4020
ctagaagaaa cagaacttga aaaaaggata attgtggatg acacctcaac ccagtggtcc 4080
aaaaacatga aacatttgac cccgagcacc ctcacacaga tagactacaa tgagaaggag 4140
aaaggggcca ttactcagtc tcccttatca gattgcctta cgaggagtca tagcatccct 4200
caagcaaata gatctccatt acccattgca aaggtatcat catttccatc tattagacct 4260
atatatctga ccagggtcct attccaagac aactcttctc atcttccagc agcatcttat 4320
agaaagaaag attctggggt ccaagaaagc agtcatttct tacaaggagc caaaaaaaat 4380
aacctttctt tagccattct aaccttggag atgactggtg atcaaagaga ggttggctcc 4440
ctggggacaa gtgccacaaa ttcagtcaca tacaagaaag ttgagaacac tgttctcccg 4500
aaaccagact tgcccaaaac atctggcaaa gttgaattgc ttccaaaagt tcacatttat 4560
cagaaggacc tattccctac ggaaactagc aatgggtctc ctggccatct ggatctcgtg 4620
gaagggagcc ttcttcaggg aacagaggga gcgattaagt ggaatgaagc aaacagacct 4680
ggaaaagttc cctttctgag agtagcaaca gaaagctctg caaagactcc ctccaagcta 4740
ttggatcctc ttgcttggga taaccactat ggtactcaga taccaaaaga agagtggaaa 4800
tcccaagaga agtcaccaga aaaaacagct tttaagaaaa aggataccat tttgtccctg 4860
aacgcttgtg aaagcaatca tgcaatagca gcaataaatg agggacaaaa taagcccgaa 4920
atagaagtca cctgggcaaa gcaaggtagg actgaaaggc tgtgctctca aaacccacca 4980
gtcttgaaac gccatcaacg ggaaataact cgtactactc ttcagtcaga tcaagaggaa 5040
attgactatg atgataccat atcagttgaa atgaagaagg aagattttga catttatgat 5100
gaggatgaaa atcagagccc ccgcagcttt caaaagaaaa cacgacacta ttttattgct 5160
gcagtggaga ggctctggga ttatgggatg agtagctccc cacatgttct aagaaacagg 5220
gctcagagtg gcagtgtccc tcagttcaag aaagttgttt tccaggaatt tactgatggc 5280
tcctttactc agcccttata ccgtggagaa ctaaatgaac atttgggact cctggggcca 5340
tatataagag cagaagttga agataatatc atggtaactt tcagaaatca ggcctctcgt 5400
ccctattcct tctattctag ccttatttct tatgaggaag atcagaggca aggagcagaa 5460
cctagaaaaa actttgtcaa gcctaatgaa accaaaactt acttttggaa agtgcaacat 5520
catatggcac ccactaaaga tgagtttgac tgcaaagcct gggcttattt ctctgatgtt 5580
gacctggaaa aagatgtgca ctcaggcctg attggacccc ttctggtctg ccacactaac 5640
acactgaacc ctgctcatgg gagacaagtg acagtacagg aatttgctct gtttttcacc 5700
atctttgatg agaccaaaag ctggtacttc actgaaaata tggaaagaaa ctgcagggct 5760
ccctgcaata tccagatgga agatcccact tttaaagaga attatcgctt ccatgcaatc 5820
aatggctaca taatggatac actacctggc ttagtaatgg ctcaggatca aaggattcga 5880
tggtatctgc tcagcatggg cagcaatgaa aacatccatt ctattcattt cagtggacat 5940
gtgttcactg tacgaaaaaa agaggagtat aaaatggcac tgtacaatct ctatccaggt 6000
gtttttgaga cagtggaaat gttaccatcc aaagctggaa tttggcgggt ggaatgcctt 6060
attggcgagc atctacatgc tgggatgagc acactttttc tggtgtacag caataagtgt 6120
cagactcccc tgggaatggc ttctggacac attagagatt ttcagattac agcttcagga 6180
caatatggac agtgggcccc aaagctggcc agacttcatt attccggatc aatcaatgcc 6240
tggagcacca aggagccctt ttcttggatc aaggtggatc tgttggcacc aatgattatt 6300
cacggcatca agacccaggg tgcccgtcag aagttctcca gcctctacat ctctcagttt 6360
atcatcatgt atagtcttga tgggaagaag tggcagactt atcgaggaaa ttccactgga 6420
accttaatgg tcttctttgg caatgtggat tcatctggga taaaacacaa tatttttaac 6480
cctccaatta ttgctcgata catccgtttg cacccaactc attatagcat tcgcagcact 6540
cttcgcatgg agttgatggg ctgtgattta aatagttgca gcatgccatt gggaatggag 6600
agtaaagcaa tatcagatgc acagattact gcttcatcct actttaccaa tatgtttgcc 6660
acctggtctc cttcaaaagc tcgacttcac ctccaaggga ggagtaatgc ctggagacct 6720
caggtgaata atccaaaaga gtggctgcaa gtggacttcc agaagacaat gaaagtcaca 6780
ggagtaacta ctcagggagt aaaatctctg cttaccagca tgtatgtgaa ggagttcctc 6840
atctccagca gtcaagatgg ccatcagtgg actctctttt ttcagaatgg caaagtaaag 6900
gtttttcagg gaaatcaaga ctccttcaca cctgtggtga actctctaga cccaccgtta 6960
ctgactcgct accttcgaat tcacccccag agttgggtgc accagattgc cctgaggatg 7020
gaggttctgg gctgcgaggc acaggacctc tactag 7056
<210> 4
<211> 7056
<212> DNA
<213> Artificial sequence
<400> 4
atgcaaatag agctctccac ctgcttcttt ctgtgccttt tgcgattctg ctttagtgcc 60
accagaagat actacctggg tgcagtggaa ctgtcatggg actatatgca aagtgatctc 120
ggtgagctgc ctgtggacgc aagatttcct cctagagtgc caaaatcttt tccattcaac 180
acctcagtcg tgtacaaaaa gactctgttt gtagaattca cggatcacct tttcaacatc 240
gctaagccaa ggccaccctg gatgggtctg ctaggtccta ccatccaggc tgaggtttat 300
gatacagtgg tcattacact taagaacatg gcttcccatc ctgtcagtct tcatgctgtt 360
ggtgtatcct actggaaagc ttctgaggga gctgaatatg atgatcagac cagtcaaagg 420
gagaaagaag atgataaagt cttccctggt ggaagccata catatgtctg gcaggtcctg 480
aaagagaatg gtccaatggc ctctgaccca ctgtgcctta cctactcata tctttctcat 540
gtggacctgg taaaagactt gaattcaggc ctcattggag ccctactagt atgtagagaa 600
gggagtctgg ccaaggaaaa gacacagacc ttgcacaaat ttatactact ttttgctgta 660
tttgatgaag ggaaaagttg gcactcagaa acaaagaact ccttgatgca ggatagggat 720
gctgcatctg ctcgggcctg gcctaaaatg cacacagtca atggttatgt aaacaggtct 780
ctgccaggtc tgattggatg ccacaggaaa tcagtctatt ggcatgtgat tggaatgggc 840
accactcctg aagtgcactc aatattcctc gaaggtcaca catttcttgt gaggaaccat 900
cgccaggcgt ccttggaaat ctcgccaata actttcctta ctgctcaaac actcttgatg 960
gaccttggac agtttctact gttttgtcat atctcttccc accaacatga tggcatggaa 1020
gcttatgtca aagtagacag ctgtccagag gaaccccaac tacgaatgaa aaataatgaa 1080
gaagcggaag actatgatga tgatcttact gattctgaaa tggatgtggt caggtttgat 1140
gatgacaact ctccttcctt tatccaaatt cgctcagttg ccaagaagca tcctaaaact 1200
tgggtacatt acattgctgc tgaagaggag gactgggact atgctccctt agtcctcgcc 1260
cccgatgaca gaagttataa aagtcaatat ttgaacaatg gccctcagcg gattggtagg 1320
aagtacaaaa aagtccgatt tatggcatac acagatgaaa cctttaagac tcgtgaagct 1380
attcagcatg aatcaggaat cttgggacct ttactttatg gggaagttgg agacacactg 1440
ttgattatat ttaagaatca agcaagcaga ccatataaca tctaccctca cggaatcact 1500
gatgtccgtc ctttgtattc aaggagatta ccaaaaggtg taaaacattt gaaggatttt 1560
ccaattctgc caggagaaat attcaaatat aaatggacag tgactgtaga agatgggcca 1620
actaaatcag atcctcggtg cctgacccgc tattactcta gtttcgttaa tatggagaga 1680
gatctagctt caggactcat tggccctctc ctcatctgct acaaagaatc tgtagatcaa 1740
agaggaaacc agataatgtc agacaagagg aatgtcatcc tgttttctgt atttgatgag 1800
aaccgaagct ggtacctcac agagaatata caacgctttc tccccaatcc agctggagtg 1860
cagcttgagg atccagagtt ccaagcctcc aacatcatgc acagcatcaa tggctatgtt 1920
tttgatagtt tgcagttgtc agtttgtttg catgaggtgg catactggta cattctaagc 1980
attggagcac agactgactt cctttctgtc ttcttctctg gatatacctt caaacacaaa 2040
atggtctatg aagacacact caccctattc ccattctcag gagaaactgt cttcatgtcg 2100
atggaaaacc caggtctatg gattctgacg tgccacaact cagactttcg gaacagaggc 2160
atgaccgcct tactgaaggt ttctagttgt gacaagaaca ctggtgatta ttacgaggac 2220
agttatgaag atatttcagc atacttgctg agtaaaaaca atgccattga accaagaagc 2280
ttctcccaga attcaagaca ccctagcact aggcaaaagc aatttaatgc caccacaatt 2340
ccagaaaatg acatagagaa gactgaccct tggtttgcac acagaacacc tatgcctaaa 2400
atacaaaatg tctcctctag tgatttgttg atgctcttgc gacagagtcc tactccacat 2460
gggctatcct tatctgatct ccaagaagcc aaatatgaga ctttttctga tgatccatca 2520
cctggagcaa tagacagtaa taacagcctg tctgaaatga cacacttcag gccacagctc 2580
catcacagtg gggacatggt atttacccct gagtcaggcc tccaattaag attaaatgag 2640
aaactgggga caactgcagc aacagagttg aagaaacttg atttcaaagt ttctagtaca 2700
tcaaataatc tgatttcaac aattccatca gacaatttgg cagcaggtac tgataataca 2760
agttccttag gacccccaag tatgccagtt cattatgata gtcaattaga taccactcta 2820
tttggcaaaa agtcatctcc ccttactgag tctggtggac ctctgagctt gagtgaagaa 2880
aataatgatt caaagttgtt agaatcaggt ttaatgaata gccaagaaag ttcatgggga 2940
aaaaatgtat cgtcaacaga gagtggtagg ttatttaaag ggaaaagagc tcatggacct 3000
gctttgttga ctaaagataa tgccttattc aaagttagca tctctttgtt aaagacaaac 3060
aaaacttcca ataattcagc aactaataga aagactcaca ttgatggccc atcattatta 3120
attgagaata gtccatcagt ctggcaaaat atattagaaa gtgacactga gtttaaaaaa 3180
gtgacacctt tgattcatga cagaatgctt atggacaaaa atgctacagc tttgaggcta 3240
aatcatatgt caaataaaac tacttcatca aaaaacatgg aaatggtcca acagaaaaaa 3300
gagggcccca ttccaccaga tgcacaaaat ccagatatgt cgttctttaa gatgctattc 3360
ttgccagaat cagcaaggtg gatacaaagg actcatggaa agaactctct gaactctggg 3420
caaggcccca gtccaaagca attagtatcc ttaggaccag aaaaatctgt ggaaggtcag 3480
aatttcttgt ctgagaaaaa caaagtggta gtaggaaagg gtgaatttac aaaggacgta 3540
ggactcaaag agatggtttt tccaagcagc agaaacctat ttcttactaa cttggataat 3600
ttacatgaaa ataatacaca caatcaagaa aaaaaaattc aggaagaaat agaaaagaag 3660
gaaacattaa tccaagagaa tgtagttttg cctcagatac atacagtgac tggcactaag 3720
aatttcatga agaacctttt cttactgagc actaggcaaa atgtagaagg ttcatatgac 3780
ggggcatatg ctccagtact tcaagatttt aggtcattaa atgattcaac aaatagaaca 3840
aagaaacaca cagctcattt ctcaaaaaaa ggggaggaag aaaacttgga aggcttggga 3900
aatcaaacca agcaaattgt agagaaatat gcatgcacca caaggatatc tcctaataca 3960
agccagcaga attttgtcac gcaacgtagt aagagagctt tgaaacaatt cagactccca 4020
ctagaagaaa cagaacttga aaaaaggata attgtggatg acacctcaac ccagtggtcc 4080
aaaaacatga aacatttgac cccgagcacc ctcacacaga tagactacaa tgagaaggag 4140
aaaggggcca ttactcagtc tcccttatca gattgcctta cgaggagtca tagcatccct 4200
caagcaaata gatctccatt acccattgca aaggtatcat catttccatc tattagacct 4260
atatatctga ccagggtcct attccaagac aactcttctc atcttccagc agcatcttat 4320
agaaagaaag attctggggt ccaagaaagc agtcatttct tacaaggagc caaaaaaaat 4380
aacctttctt tagccattct aaccttggag atgactggtg atcaaagaga ggttggctcc 4440
ctggggacaa gtgccacaaa ttcagtcaca tacaagaaag ttgagaacac tgttctcccg 4500
aaaccagact tgcccaaaac atctggcaaa gttgaattgc ttccaaaagt tcacatttat 4560
cagaaggacc tattccctac ggaaactagc aatgggtctc ctggccatct ggatctcgtg 4620
gaagggagcc ttcttcaggg aacagaggga gcgattaagt ggaatgaagc aaacagacct 4680
ggaaaagttc cctttctgag agtagcaaca gaaagctctg caaagactcc ctccaagcta 4740
ttggatcctc ttgcttggga taaccactat ggtactcaga taccaaaaga agagtggaaa 4800
tcccaagaga agtcaccaga aaaaacagct tttaagaaaa aggataccat tttgtccctg 4860
aacgcttgtg aaagcaatca tgcaatagca gcaataaatg agggacaaaa taagcccgaa 4920
atagaagtca cctgggcaaa gcaaggtagg actgaaaggc tgtgctctca aaacccacca 4980
gtcttgaaac gccatcaacg ggaaataact cgtactactc ttcagtcaga tcaagaggaa 5040
attgactatg atgataccat atcagttgaa atgaagaagg aagattttga catttatgat 5100
gaggatgaaa atcagagccc ccgcagcttt caaaagaaaa cacgacacta ttttattgct 5160
gcagtggaga ggctctggga ttatgggatg agtagctccc cacatgttct aagaaacagg 5220
gctcagagtg gcagtgtccc tcagttcaag aaagttgttt tccaggaatt tactgatggc 5280
tcctttactc agcccttata ccgtggagaa ctaaatgaac atttgggact cctggggcca 5340
tatataagag cagaagttga agataatatc atggtaactt tcagaaatca ggcctctcgt 5400
ccctattcct tctattctag ccttatttct tatgaggaag atcagaggca aggagcagaa 5460
cctagaaaaa actttgtcaa gcctaatgaa accaaaactt acttttggaa agtgcaacat 5520
catatggcac ccactaaaga tgagtttgac tgcaaagcct gggcttattt ctctgatgtt 5580
gacctggaaa aagatgtgca ctcaggcctg attggacccc ttctggtctg ccacactaac 5640
acactgaacc ctgctcatgg gagacaagtg acagtacagg aatttgctct gtttttcacc 5700
atctttgatg agaccaaaag ctggtacttc actgaaaata tggaaagaaa ctgcagggct 5760
ccctgcaata tccagatgga agatcccact tttaaagaga attatcgctt ccatgcaatc 5820
aatggctaca taatggatac actacctggc ttagtaatgg ctcaggatca aaggattcga 5880
tggtatctgc tcagcatggg cagcaatgaa aacatccatt ctattcattt cagtggacat 5940
gtgttcactg tacgaaaaaa agaggagtat aaaatggcac tgtacaatct ctatccaggt 6000
gtttttgaga cagtggaaat gttaccatcc aaagctggaa tttggcgggt ggaatgcctt 6060
attggcgagc atctacatgc tgggatgagc acactttttc tggtgtacag caataagtgt 6120
cagactcccc tgggaatggc ttctggacac attagagatt ttcagattac agcttcagga 6180
caatatggac agtgggcccc aaagctggcc agacttcatt attccggatc aatcaatgcc 6240
tggagcacca aggagccctt ttcttggatc aaggtggatc tgttggcacc aatgattatt 6300
cacggcatca agacccaggg tgcccgtcag aagttctcca gcctctacat ctctcagttt 6360
atcatcatgt atagtcttga tgggaagaag tggcagactt atcgaggaaa ttccactgga 6420
accttaatgg tcttctttgg caatgtggat tcatctggga taaaacacaa tatttttaac 6480
cctccaatta ttgctcgata catccgtttg cacccaactc attatagcat tcgcagcact 6540
cttcgcatgg agttgatggg ctgtgattta aatagttgca gcatgccatt gggaatggag 6600
agtaaagcaa tatcagatgc acagattact gcttcatcct actttaccaa tatgtttgcc 6660
acctggtctc cttcaaaagc tcgacttcac ctccaaggga ggagtaatgc ctggagacct 6720
caggtgaata atccaaaaga gtggctgcaa gtggacttcc agaagacaat gaaagtcaca 6780
ggagtaacta ctcagggagt aaaatctctg cttaccagca tgtatgtgaa ggagttcctc 6840
atctccagca gtcaagatgg ccatcagtgg actctctttt ttcagaatgg caaagtaaag 6900
gtttttcagg gaaatcaaga ctccttcaca cctgtggtga actctctaga cccaccgtta 6960
ctgactcgct accttcgaat tcacccccag agttgggtgc accagattgc cctgaggatg 7020
gaggttctgg gctgcgaggc acaggacctc tactag 7056
<210> 5
<211> 4374
<212> DNA
<213> Artificial sequence
<400> 5
atgcagatcg aactgagcac ttgcttcttc ctgtgtctcc tgcgcttttg cttctccgcc 60
acaaggagat actatctcgg tgccgtggag ctcagctggg actacatgca gagcgacttg 120
ggtgaactgc ctgtggacgc caggtttcca ccccgcgtgc ccaagagttt cccgttcaac 180
accagtgtcg tgtacaagaa aaccctcttc gtggaattca ccgaccacct gttcaacatc 240
gccaaaccgc gccctccctg gatggggctg ctcggcccga cgatccaggc tgaggtctat 300
gacacggtgg tgattaccct caagaacatg gctagccacc cggtgagcct gcacgccgtg 360
ggcgtgtcct attggaaagc gtccgagggt gcggagtacg atgaccagac ttcacagcgg 420
gagaaggaag acgacaaagt gttccccggg ggttcccaca cctatgtctg gcaggtcctg 480
aaggagaatg gtcctatggc ctccgaccca ttgtgcctca cctactctta cctaagccat 540
gtggatctcg tcaaggacct gaactcgggg ctgatcggcg ccctgctcgt gtgccgggag 600
ggctcactgg ccaaggagaa gacccaaact ctgcacaagt tcatcctgct gttcgcggta 660
ttcgacgagg ggaagtcctg gcactccgag accaagaaca gcctgatgca ggaccgcgac 720
gcagcctcgg cccgtgcgtg gccaaagatg cacaccgtga acggctacgt taacaggagc 780
ctacccggcc tgatcggctg ccaccgcaaa tcggtctact ggcatgtgat cggaatgggc 840
acaacgcccg aggtccacag tatcttcctc gagggccaca ctttcctggt ccggaatcac 900
cgccaggcca gcctggagat cagccccata acctttctga cggcgcagac cttactcatg 960
gatctcggcc agttcctcct gttctgccac atttcgtccc accagcacga tgggatggaa 1020
gcatatgtga aagtggactc ctgccccgag gaaccccagc ttaggatgaa gaacaatgag 1080
gaggccgagg actacgacga tgaccttacc gattcagaaa tggacgtagt acgctttgac 1140
gacgacaact ctccatcctt catacagatt cgctccgtcg ccaagaagca ccctaagact 1200
tgggtgcact acatcgcggc cgaggaggag gactgggatt atgctcccct ggtgctggcc 1260
cccgacgacc gcagctacaa gagccagtac ctgaataacg ggccccagcg catcggccgg 1320
aagtacaaga aagtgcggtt catggcttac acggacgaga ccttcaagac ccgggaggct 1380
atccagcatg agagcggcat cttggggccc ctcctgtacg gcgaagttgg agacacactg 1440
ctgatcatct tcaagaacca ggcgagcagg ccctacaaca tctaccccca cggcattacc 1500
gatgtccggc cgttgtacag ccgacggctg cccaagggcg tgaagcacct gaaggacttt 1560
ccgatcctgc cgggcgagat cttcaagtac aagtggactg tgaccgtgga ggatgggccg 1620
accaagagcg atccgcgctg cctgacccgt tactactcca gctttgtcaa tatggagcgc 1680
gacctcgcta gcggcttgat tggccctctg ctgatctgct acaaggagtc cgtggaccag 1740
agggggaatc agatcatgag tgacaagagg aacgtgatcc tgttctccgt gttcgacgaa 1800
aaccgcagct ggtatctcac cgagaatatc cagcgcttcc tgcccaaccc ggccggtgtg 1860
cagctggagg accccgagtt tcaggccagc aacatcatgc attctatcaa cggatatgtg 1920
tttgattccc tgcagctctc agtgtgtctg cacgaggtcg cctactggta tatcctcagc 1980
attggggcac agaccgactt cctgagcgtg ttcttctccg ggtatacctt caagcacaag 2040
atggtgtacg aggataccct gaccctgttc ccctttagcg gcgaaaccgt gtttatgtct 2100
atggagaacc ccgggctctg gatccttact tgccataact ccgacttccg caaccgcgga 2160
atgaccgcgc tcctgaaagt gtcgagttgt gacaagaaca ccggcgacta ttacgaggac 2220
agttacgagg acatctctgc gtacctcctt agcaagaata acgccatcga gccaagatcc 2280
ttcagccaga accccccagt gctgaagagg catcagcggg agatcacccg cacgaccctg 2340
cagtcggatc aggaggagat tgattacgac gacacgatca gtgtggagat gaagaaggag 2400
gacttcgaca tctacgacga agatgaaaac cagtcccctc ggtccttcca aaagaagacc 2460
cggcactact tcatcgccgc tgtggaacgc ctgtgggact atggaatgtc ttctagccct 2520
cacgttttga ggaaccgcgc ccagtcgggc agcgtgcccc agttcaagaa agtggtgttc 2580
caggagttca ccgacggctc cttcacccag ccactttacc ggggcgagct caatgaacat 2640
ctgggcctgc tgggacccta catcagggct gaggtggagg acaacatcat ggtgacattc 2700
cggaatcagg ccagcagacc atacagtttc tacagttcac tcatctccta cgaggaggac 2760
cagcgccagg gggctgaacc ccgtaagaac ttcgtgaagc caaacgaaac aaagacctac 2820
ttctggaagg tccagcacca catggcacct accaaggacg agttcgattg caaggcctgg 2880
gcctacttct ccgacgtgga cctggagaaa gatgtgcaca gcggcctgat tggccctctg 2940
ctggtgtgtc acacgaacac actcaaccct gcacacgggc ggcaggtcac tgtgcaggaa 3000
ttcgccctgt tctttaccat ctttgatgag acgaagtcct ggtatttcac cgaaaacatg 3060
gagaggaact gccgcgcacc ctgcaacatc cagatggaag atccgacatt caaggagaac 3120
taccggttcc atgccatcaa tggctacatc atggacaccc tgcctggcct cgtgatggcc 3180
caagaccagc gtatccgctg gtatctgctg tcgatgggct ccaacgagaa catccatagt 3240
atccacttca gcgggcatgt cttcacggtg aggaaaaagg aggagtacaa gatggcactg 3300
tacaacctct atcccggcgt gttcgagacc gtggagatgc tgccctccaa ggccggcatc 3360
tggagagtgg aatgcctgat cggcgagcac ctccacgctg ggatgtccac gctgttcctc 3420
gtttacagca ataagtgcca gacccctctg ggcatggcga gcggccacat ccgcgacttc 3480
cagattacag ccagcggcca gtacggtcag tgggctccaa agctggcccg tctgcactac 3540
tccggatcca tcaacgcctg gtccaccaag gaaccgttct cctggatcaa agtagacctg 3600
ctagccccca tgatcattca cggcatcaag acacaaggcg cccgacagaa gttctcgagc 3660
ctctatatct cccagttcat catcatgtat agcctggacg gaaagaagtg gcagacttac 3720
cgcggaaact cgacagggac cctgatggta ttcttcggta acgtggacag ctccggaatc 3780
aagcacaaca tcttcaaccc acccattatc gcccgctaca tccgcctgca ccccactcac 3840
tatagcatta ggtccaccct gcgaatggag ctcatgggct gtgacctgaa cagctgtagc 3900
atgcccctcg gcatggagtc taaggcgatc tccgacgcac agataacggc atcatcctac 3960
tttaccaaca tgttcgctac ctggtccccc tccaaggccc gactccacct gcaagggaga 4020
tccaacgcct ggcggccaca ggtcaacaat cccaaggagt ggctgcaagt ggactttcag 4080
aaaactatga aagtcaccgg agtgaccaca cagggagtga agtctctgct gaccagcatg 4140
tacgtgaagg agttcctcat ctccagttcg caggatggcc accagtggac gttgttcttc 4200
caaaacggta aagtcaaagt cttccaaggg aaccaggaca gctttacacc cgtcgtgaac 4260
tccctggacc ccccgcttct cactagatac ctccgcatcc accctcagag ctgggtgcac 4320
cagattgccc tgcgcatgga ggttctgggg tgtgaagccc aggacctgta ctaa 4374
<210> 6
<211> 4374
<212> DNA
<213> Artificial sequence
<400> 6
atgcagatcg aactgagcac ttgcttcttc ctgtgtctcc tgcgcttttg cttctccgcc 60
acaaggagat actatctcgg tgccgtggag ctcagctggg actacatgca gagcgacttg 120
ggtgaactgc ctgtggacgc caggtttcca ccccgcgtgc ccaagagttt cccgttcaac 180
accagtgtcg tgtacaagaa aaccctcttc gtggaattca ccgaccacct gttcaacatc 240
gccaaaccgc gccctccctg gatggggctg ctcggcccga cgatccaggc tgaggtctat 300
gacacggtgg tgattaccct caagaacatg gctagccacc cggtgagcct gcacgccgtg 360
ggcgtgtcct attggaaagc gtccgagggt gcggagtacg atgaccagac ttcacagcgg 420
gagaaggaag acgacaaagt gttccccggg ggttcccaca cctatgtctg gcaggtcctg 480
aaggagaatg gtcctatggc ctccgaccca ttgtgcctca cctactctta cctaagccat 540
gtggatctcg tcaaggacct gaactcgggg ctgatcggcg ccctgctcgt gtgccgggag 600
ggctcactgg ccaaggagaa gacccaaact ctgcacaagt tcatcctgct gttcgcggta 660
ttcgacgagg ggaagtcctg gcactccgag accaagaaca gcctgatgca ggaccgcgac 720
gcagcctcgg cccgtgcgtg gccaaagatg cacaccgtga acggctacgt taacaggagc 780
ctacccggcc tgatcggctg ccaccgcaaa tcggtctact ggcatgtgat cggaatgggc 840
acaacgcccg aggtccacag tatcttcctc gagggccaca ctttcctggt ccggaatcac 900
cgccaggcca gcctggagat cagccccata acctttctga cggcgcagac cttactcatg 960
gatctcggcc agttcctcct gttctgccac atttcgtccc accagcacga tgggatggaa 1020
gcatatgtga aagtggactc ctgccccgag gaaccccagc ttaggatgaa gaacaatgag 1080
gaggccgagg actacgacga tgaccttacc gattcagaaa tggacgtagt acgctttgac 1140
gacgacaact ctccatcctt catacagatt cgctccgtcg ccaagaagca ccctaagact 1200
tgggtgcact acatcgcggc cgaggaggag gactgggatt atgctcccct ggtgctggcc 1260
cccgacgacc gcagctacaa gagccagtac ctgaataacg ggccccagcg catcggccgg 1320
aagtacaaga aagtgcggtt catggcttac acggacgaga ccttcaagac ccgggaggct 1380
atccagcatg agagcggcat cttggggccc ctcctgtacg gcgaagttgg agacacactg 1440
ctgatcatct tcaagaacca ggcgagcagg ccctacaaca tctaccccca cggcattacc 1500
gatgtccggc cgttgtacag ccgacggctg cccaagggcg tgaagcacct gaaggacttt 1560
ccgatcctgc cgggcgagat cttcaagtac aagtggactg tgaccgtgga ggatgggccg 1620
accaagagcg atccgcgctg cctgacccgt tactactcca gctttgtcaa tatggagcgc 1680
gacctcgcta gcggcttgat tggccctctg ctgatctgct acaaggagtc cgtggaccag 1740
agggggaatc agatcatgag tgacaagagg aacgtgatcc tgttctccgt gttcgacgaa 1800
aaccgcagct ggtatctcac cgagaatatc cagcgcttcc tgcccaaccc ggccggtgtg 1860
cagctggagg accccgagtt tcaggccagc aacatcatgc attctatcaa cggatatgtg 1920
tttgattccc tgcagctctc agtgtgtctg cacgaggtcg cctactggta tatcctcagc 1980
attggggcac agaccgactt cctgagcgtg ttcttctccg ggtatacctt caagcacaag 2040
atggtgtacg aggataccct gaccctgttc ccctttagcg gcgaaaccgt gtttatgtct 2100
atggagaacc ccgggctctg gatccttacc tgccataact ccgacttccg caaccgcgga 2160
atgaccgcgc tcctgaaagt gtcgagttgt gacaagaaca ccggcgacta ttacgaggac 2220
agttacgagg acatctctgc gtacctcctt agcaagaata acgccatcga gccaagatcc 2280
ttcagccaga accccccagt gctgaagagg catcagcggg agatcacccg cacgaccctg 2340
cagtcggatc aggaggagat tgattacgac gacacgatca gtgtggagat gaagaaggag 2400
gacttcgaca tctacgacga agatgaaaac cagtcccctc ggtccttcca aaagaagacc 2460
cggcactact tcatcgccgc tgtggaacgc ctgtgggact atggaatgtc ttctagccct 2520
cacgttttga ggaaccgcgc ccagtcgggc agcgtgcccc agttcaagaa agtggtgttc 2580
caggagttca ccgacggctc cttcacccag ccactttacc ggggcgagct caatgaacat 2640
ctgggcctgc tgggacccta catcagggct gaggtggagg acaacatcat ggtgacattc 2700
cggaatcagg ccagcagacc atacagtttc tacagttcac tcatctccta cgaggaggac 2760
cagcgccagg gggctgaacc ccgtaagaac ttcgtgaagc caaacgaaac aaagacctac 2820
ttctggaagg tccagcacca catggcacct accaaggacg agttcgattg caaggcctgg 2880
gcctacttct ccgacgtgga cctggagaaa gatgtgcaca gcggcctgat tggccctctg 2940
ctggtgtgtc acacgaacac actcaaccct gcacacgggc ggcaggtcac tgtgcaggaa 3000
ttcgccctgt tctttaccat ctttgatgag acgaagtcct ggtatttcac cgaaaacatg 3060
gagaggaact gccgcgcacc ctgcaacatc cagatggaag atccgacatt caaggagaac 3120
taccggttcc atgccatcaa tggctacatc atggacaccc tgcctggcct cgtgatggcc 3180
caagaccagc gtatccgctg gtatctgctg tcgatgggct ccaacgagaa catccatagt 3240
atccacttca gcgggcatgt cttcacggtg aggaaaaagg aggagtacaa gatggcactg 3300
tacaacctct atcccggcgt gttcgagacc gtggagatgc tgccctccaa ggccggcatc 3360
tggagagtgg aatgcctgat cggcgagcac ctccacgctg ggatgtccac gctgttcctc 3420
gtttacagca ataagtgcca gacccctctg ggcatggcga gcggccacat ccgcgacttc 3480
cagattacag ccagcggcca gtacggtcag tgggctccaa agctggcccg tctgcactac 3540
tccggatcca tcaacgcctg gtccaccaag gaaccgttct cctggatcaa agtagacctg 3600
ctagccccca tgatcattca cggcatcaag acacaaggcg cccgacagaa gttctcgagc 3660
ctctatatct cccagttcat catcatgtat agcctggacg gaaagaagtg gcagacttac 3720
cgcggaaact cgacagggac cctgatggta ttcttcggta acgtggacag ctccggaatc 3780
aagcacaaca tcttcaaccc acccattatc gcccgctaca tccgcctgca ccccactcac 3840
tatagcatta ggtccaccct gcgaatggag ctcatgggct gtgacctgaa cagctgtagc 3900
atgcccctcg gcatggagtc taaggcgatc tccgacgcac agataacggc atcatcctac 3960
tttaccaaca tgttcgctac ctggtccccc tccaaggccc gactccacct gcaagggaga 4020
tccaacgcct ggcggccaca ggtcaacaat cccaaggagt ggctgcaagt ggactttcag 4080
aaaactatga aagtcaccgg agtgaccaca cagggagtga agtctctgct gaccagcatg 4140
tacgtgaagg agttcctcat ctccagttcg caggatggcc accagtggac gttgttcttc 4200
caaaacggta aagtcaaagt cttccaaggg aaccaggaca gctttacacc cgtcgtgaac 4260
tccctggacc ccccgcttct cactagatac ctccgcatcc accctcagag ctgggtgcac 4320
cagattgccc tgcgcatgga ggttctgggg tgtgaagccc aggacctgta ctaa 4374
<210> 7
<211> 4374
<212> DNA
<213> Artificial sequence
<400> 7
atgcagatcg aactgagcac ttgcttcttc ctgtgtctcc tgcgcttttg cttctccgcc 60
acaaggagat actatctcgg tgccgtggag ctcagctggg actacatgca gagcgacttg 120
ggtgaactgc ctgtggacgc caggtttcca ccccgcgtgc ccaagagttt cccgttcaac 180
accagtgtcg tgtacaagaa aaccctcttc gtggaattca ccgaccacct gttcaacatc 240
gccaaaccgc gccctccctg gatggggctg ctcggcccga cgatccaggc tgaggtctat 300
gacacggtgg tgattaccct caagaacatg gctagccacc cggtgagcct gcacgccgtg 360
ggcgtgtcct attggaaagc gtccgagggt gcggagtacg atgaccagac ttcacagcgg 420
gagaaggaag acgacaaagt gttccccggg ggttcccaca cctatgtctg gcaggtcctg 480
aaggagaatg gtcctatggc ctccgaccca ttgtgcctca cctactctta cctaagccat 540
gtggatctcg tcaaggacct gaactcgggg ctgatcggcg ccctgctcgt gtgccgggag 600
ggctcactgg ccaaggagaa gacccaaact ctgcacaagt tcatcctgct gttcgcggta 660
ttcgacgagg ggaagtcctg gcactccgag accaagaaca gcctgatgca ggaccgcgac 720
gcagcctcgg cccgtgcgtg gccaaagatg cacaccgtga acggctacgt taacaggagc 780
ctacccggcc tgatcggctg ccaccgcaaa tcggtctact ggcatgtgat cggaatgggc 840
acaacgcccg aggtccacag tatcttcctc gagggccaca ctttcctggt ccggaatcac 900
cgccaggcca gcctggagat cagccccata acctttctga cggcgcagac cttactcatg 960
gatctcggcc agttcctcct gttctgccac atttcgtccc accagcacga tgggatggaa 1020
gcatatgtga aagtggactc ctgccccgag gaaccccagc ttaggatgaa gaacaatgag 1080
gaggccgagg actacgacga tgaccttacc gattcagaaa tggacgtagt acgctttgac 1140
gacgacaact ctccatcctt catacagatt cgctccgtcg ccaagaagca ccctaagact 1200
tgggtgcact acatcgcggc cgaggaggag gactgggatt atgctcccct ggtgctggcc 1260
cccgacgacc gcagctacaa gagccagtac ctgaataacg ggccccagcg catcggccgg 1320
aagtacaaga aagtgcggtt catggcttac acggacgaga ccttcaagac ccgggaggct 1380
atccagcatg agagcggcat cttggggccc ctcctgtacg gcgaagttgg agacacactg 1440
ctgatcatct tcaagaacca ggcgagcagg ccctacaaca tctaccccca cggcattacc 1500
gatgtccggc cgttgtacag ccgacggctg cccaagggcg tgaagcacct gaaggacttt 1560
ccgatcctgc cgggcgagat cttcaagtac aagtggactg tgaccgtgga ggatgggccg 1620
accaagagcg atccgcgctg cctgacccgt tactactcca gctttgtcaa tatggagcgc 1680
gacctcgcta gcggcttgat tggccctctg ctgatctgct acaaggagtc cgtggaccag 1740
agggggaatc agatcatgag tgacaagagg aacgtgatcc tgttctccgt gttcgacgaa 1800
aaccgcagct ggtatctcac cgagaatatc cagcgcttcc tgcccaaccc ggccggtgtg 1860
cagctggagg accccgagtt tcaggccagc aacatcatgc attctatcaa cggatatgtg 1920
tttgattccc tgcagctctc agtgtgtctg cacgaggtcg cctactggta tatcctcagc 1980
attggggcac agaccgactt cctgagcgtg ttcttctccg ggtatacctt caagcacaag 2040
atggtgtacg aggataccct gaccctgttc ccctttagcg gcgaaaccgt gtttatgtct 2100
atggagaacc ccgggctctg gatccttaca tgccataact ccgacttccg caaccgcgga 2160
atgaccgcgc tcctgaaagt gtcgagttgt gacaagaaca ccggcgacta ttacgaggac 2220
agttacgagg acatctctgc gtacctcctt agcaagaata acgccatcga gccaagatcc 2280
ttcagccaga accccccagt gctgaagagg catcagcggg agatcacccg cacgaccctg 2340
cagtcggatc aggaggagat tgattacgac gacacgatca gtgtggagat gaagaaggag 2400
gacttcgaca tctacgacga agatgaaaac cagtcccctc ggtccttcca aaagaagacc 2460
cggcactact tcatcgccgc tgtggaacgc ctgtgggact atggaatgtc ttctagccct 2520
cacgttttga ggaaccgcgc ccagtcgggc agcgtgcccc agttcaagaa agtggtgttc 2580
caggagttca ccgacggctc cttcacccag ccactttacc ggggcgagct caatgaacat 2640
ctgggcctgc tgggacccta catcagggct gaggtggagg acaacatcat ggtgacattc 2700
cggaatcagg ccagcagacc atacagtttc tacagttcac tcatctccta cgaggaggac 2760
cagcgccagg gggctgaacc ccgtaagaac ttcgtgaagc caaacgaaac aaagacctac 2820
ttctggaagg tccagcacca catggcacct accaaggacg agttcgattg caaggcctgg 2880
gcctacttct ccgacgtgga cctggagaaa gatgtgcaca gcggcctgat tggccctctg 2940
ctggtgtgtc acacgaacac actcaaccct gcacacgggc ggcaggtcac tgtgcaggaa 3000
ttcgccctgt tctttaccat ctttgatgag acgaagtcct ggtatttcac cgaaaacatg 3060
gagaggaact gccgcgcacc ctgcaacatc cagatggaag atccgacatt caaggagaac 3120
taccggttcc atgccatcaa tggctacatc atggacaccc tgcctggcct cgtgatggcc 3180
caagaccagc gtatccgctg gtatctgctg tcgatgggct ccaacgagaa catccatagt 3240
atccacttca gcgggcatgt cttcacggtg aggaaaaagg aggagtacaa gatggcactg 3300
tacaacctct atcccggcgt gttcgagacc gtggagatgc tgccctccaa ggccggcatc 3360
tggagagtgg aatgcctgat cggcgagcac ctccacgctg ggatgtccac gctgttcctc 3420
gtttacagca ataagtgcca gacccctctg ggcatggcga gcggccacat ccgcgacttc 3480
cagattacag ccagcggcca gtacggtcag tgggctccaa agctggcccg tctgcactac 3540
tccggatcca tcaacgcctg gtccaccaag gaaccgttct cctggatcaa agtagacctg 3600
ctagccccca tgatcattca cggcatcaag acacaaggcg cccgacagaa gttctcgagc 3660
ctctatatct cccagttcat catcatgtat agcctggacg gaaagaagtg gcagacttac 3720
cgcggaaact cgacagggac cctgatggta ttcttcggta acgtggacag ctccggaatc 3780
aagcacaaca tcttcaaccc acccattatc gcccgctaca tccgcctgca ccccactcac 3840
tatagcatta ggtccaccct gcgaatggag ctcatgggct gtgacctgaa cagctgtagc 3900
atgcccctcg gcatggagtc taaggcgatc tccgacgcac agataacggc atcatcctac 3960
tttaccaaca tgttcgctac ctggtccccc tccaaggccc gactccacct gcaagggaga 4020
tccaacgcct ggcggccaca ggtcaacaat cccaaggagt ggctgcaagt ggactttcag 4080
aaaactatga aagtcaccgg agtgaccaca cagggagtga agtctctgct gaccagcatg 4140
tacgtgaagg agttcctcat ctccagttcg caggatggcc accagtggac gttgttcttc 4200
caaaacggta aagtcaaagt cttccaaggg aaccaggaca gctttacacc cgtcgtgaac 4260
tccctggacc ccccgcttct cactagatac ctccgcatcc accctcagag ctgggtgcac 4320
cagattgccc tgcgcatgga ggttctgggg tgtgaagccc aggacctgta ctaa 4374
<210> 8
<211> 4374
<212> DNA
<213> Artificial sequence
<400> 8
atgcagatcg aactgagcac ttgcttcttc ctgtgtctcc tgcgcttttg cttctccgcc 60
acaaggagat actatctcgg tgccgtggag ctcagctggg actacatgca gagcgacttg 120
ggtgaactgc ctgtggacgc caggtttcca ccccgcgtgc ccaagagttt cccgttcaac 180
accagtgtcg tgtacaagaa aaccctcttc gtggaattca ccgaccacct gttcaacatc 240
gccaaaccgc gccctccctg gatggggctg ctcggcccga cgatccaggc tgaggtctat 300
gacacggtgg tgattaccct caagaacatg gctagccacc cggtgagcct gcacgccgtg 360
ggcgtgtcct attggaaagc gtccgagggt gcggagtacg atgaccagac ttcacagcgg 420
gagaaggaag acgacaaagt gttccccggg ggttcccaca cctatgtctg gcaggtcctg 480
aaggagaatg gtcctatggc ctccgaccca ttgtgcctca cctactctta cctaagccat 540
gtggatctcg tcaaggacct gaactcgggg ctgatcggcg ccctgctcgt gtgccgggag 600
ggctcactgg ccaaggagaa gacccaaact ctgcacaagt tcatcctgct gttcgcggta 660
ttcgacgagg ggaagtcctg gcactccgag accaagaaca gcctgatgca ggaccgcgac 720
gcagcctcgg cccgtgcgtg gccaaagatg cacaccgtga acggctacgt taacaggagc 780
ctacccggcc tgatcggctg ccaccgcaaa tcggtctact ggcatgtgat cggaatgggc 840
acaacgcccg aggtccacag tatcttcctc gagggccaca ctttcctggt ccggaatcac 900
cgccaggcca gcctggagat cagccccata acctttctga cggcgcagac cttactcatg 960
gatctcggcc agttcctcct gttctgccac atttcgtccc accagcacga tgggatggaa 1020
gcatatgtga aagtggactc ctgccccgag gaaccccagc ttaggatgaa gaacaatgag 1080
gaggccgagg actacgacga tgaccttacc gattcagaaa tggacgtagt acgctttgac 1140
gacgacaact ctccatcctt catacagatt cgctccgtcg ccaagaagca ccctaagact 1200
tgggtgcact acatcgcggc cgaggaggag gactgggatt atgctcccct ggtgctggcc 1260
cccgacgacc gcagctacaa gagccagtac ctgaataacg ggccccagcg catcggccgg 1320
aagtacaaga aagtgcggtt catggcttac acggacgaga ccttcaagac ccgggaggct 1380
atccagcatg agagcggcat cttggggccc ctcctgtacg gcgaagttgg agacacactg 1440
ctgatcatct tcaagaacca ggcgagcagg ccctacaaca tctaccccca cggcattacc 1500
gatgtccggc cgttgtacag ccgacggctg cccaagggcg tgaagcacct gaaggacttt 1560
ccgatcctgc cgggcgagat cttcaagtac aagtggactg tgaccgtgga ggatgggccg 1620
accaagagcg atccgcgctg cctgacccgt tactactcca gctttgtcaa tatggagcgc 1680
gacctcgcta gcggcttgat tggccctctg ctgatctgct acaaggagtc cgtggaccag 1740
agggggaatc agatcatgag tgacaagagg aacgtgatcc tgttctccgt gttcgacgaa 1800
aaccgcagct ggtatctcac cgagaatatc cagcgcttcc tgcccaaccc ggccggtgtg 1860
cagctggagg accccgagtt tcaggccagc aacatcatgc attctatcaa cggatatgtg 1920
tttgattccc tgcagctctc agtgtgtctg cacgaggtcg cctactggta tatcctcagc 1980
attggggcac agaccgactt cctgagcgtg ttcttctccg ggtatacctt caagcacaag 2040
atggtgtacg aggataccct gaccctgttc ccctttagcg gcgaaaccgt gtttatgtct 2100
atggagaacc ccgggctctg gatccttacg tgccataact ccgacttccg caaccgcgga 2160
atgaccgcgc tcctgaaagt gtcgagttgt gacaagaaca ccggcgacta ttacgaggac 2220
agttacgagg acatctctgc gtacctcctt agcaagaata acgccatcga gccaagatcc 2280
ttcagccaga accccccagt gctgaagagg catcagcggg agatcacccg cacgaccctg 2340
cagtcggatc aggaggagat tgattacgac gacacgatca gtgtggagat gaagaaggag 2400
gacttcgaca tctacgacga agatgaaaac cagtcccctc ggtccttcca aaagaagacc 2460
cggcactact tcatcgccgc tgtggaacgc ctgtgggact atggaatgtc ttctagccct 2520
cacgttttga ggaaccgcgc ccagtcgggc agcgtgcccc agttcaagaa agtggtgttc 2580
caggagttca ccgacggctc cttcacccag ccactttacc ggggcgagct caatgaacat 2640
ctgggcctgc tgggacccta catcagggct gaggtggagg acaacatcat ggtgacattc 2700
cggaatcagg ccagcagacc atacagtttc tacagttcac tcatctccta cgaggaggac 2760
cagcgccagg gggctgaacc ccgtaagaac ttcgtgaagc caaacgaaac aaagacctac 2820
ttctggaagg tccagcacca catggcacct accaaggacg agttcgattg caaggcctgg 2880
gcctacttct ccgacgtgga cctggagaaa gatgtgcaca gcggcctgat tggccctctg 2940
ctggtgtgtc acacgaacac actcaaccct gcacacgggc ggcaggtcac tgtgcaggaa 3000
ttcgccctgt tctttaccat ctttgatgag acgaagtcct ggtatttcac cgaaaacatg 3060
gagaggaact gccgcgcacc ctgcaacatc cagatggaag atccgacatt caaggagaac 3120
taccggttcc atgccatcaa tggctacatc atggacaccc tgcctggcct cgtgatggcc 3180
caagaccagc gtatccgctg gtatctgctg tcgatgggct ccaacgagaa catccatagt 3240
atccacttca gcgggcatgt cttcacggtg aggaaaaagg aggagtacaa gatggcactg 3300
tacaacctct atcccggcgt gttcgagacc gtggagatgc tgccctccaa ggccggcatc 3360
tggagagtgg aatgcctgat cggcgagcac ctccacgctg ggatgtccac gctgttcctc 3420
gtttacagca ataagtgcca gacccctctg ggcatggcga gcggccacat ccgcgacttc 3480
cagattacag ccagcggcca gtacggtcag tgggctccaa agctggcccg tctgcactac 3540
tccggatcca tcaacgcctg gtccaccaag gaaccgttct cctggatcaa agtagacctg 3600
ctagccccca tgatcattca cggcatcaag acacaaggcg cccgacagaa gttctcgagc 3660
ctctatatct cccagttcat catcatgtat agcctggacg gaaagaagtg gcagacttac 3720
cgcggaaact cgacagggac cctgatggta ttcttcggta acgtggacag ctccggaatc 3780
aagcacaaca tcttcaaccc acccattatc gcccgctaca tccgcctgca ccccactcac 3840
tatagcatta ggtccaccct gcgaatggag ctcatgggct gtgacctgaa cagctgtagc 3900
atgcccctcg gcatggagtc taaggcgatc tccgacgcac agataacggc atcatcctac 3960
tttaccaaca tgttcgctac ctggtccccc tccaaggccc gactccacct gcaagggaga 4020
tccaacgcct ggcggccaca ggtcaacaat cccaaggagt ggctgcaagt ggactttcag 4080
aaaactatga aagtcaccgg agtgaccaca cagggagtga agtctctgct gaccagcatg 4140
tacgtgaagg agttcctcat ctccagttcg caggatggcc accagtggac gttgttcttc 4200
caaaacggta aagtcaaagt cttccaaggg aaccaggaca gctttacacc cgtcgtgaac 4260
tccctggacc ccccgcttct cactagatac ctccgcatcc accctcagag ctgggtgcac 4320
cagattgccc tgcgcatgga ggttctgggg tgtgaagccc aggacctgta ctaa 4374
<210> 9
<211> 4395
<212> DNA
<213> Artificial sequence
<400> 9
atgcaaatag agctctccac ctgcttcttt ctgtgccttt tgcgattctg ctttagtgcc 60
accagaagat actacctggg tgcagtggaa ctgtcatggg actatatgca aagtgatctc 120
ggtgagctgc ctgtggacgc aagatttcct cctagagtgc caaaatcttt tccattcaac 180
acctcagtcg tgtacaaaaa gactctgttt gtagaattca cggatcacct tttcaacatc 240
gctaagccaa ggccaccctg gatgggtctg ctaggtccta ccatccaggc tgaggtttat 300
gatacagtgg tcattacact taagaacatg gcttcccatc ctgtcagtct tcatgctgtt 360
ggtgtatcct actggaaagc ttctgaggga gctgaatatg atgatcagac cagtcaaagg 420
gagaaagaag atgataaagt cttccctggt ggaagccata catatgtctg gcaggtcctg 480
aaagagaatg gtccaatggc ctctgaccca ctgtgcctta cctactcata tctttctcat 540
gtggacctgg taaaagactt gaattcaggc ctcattggag ccctactagt atgtagagaa 600
gggagtctgg ccaaggaaaa gacacagacc ttgcacaaat ttatactact ttttgctgta 660
tttgatgaag ggaaaagttg gcactcagaa acaaagaact ccttgatgca ggatagggat 720
gctgcatctg ctcgggcctg gcctaaaatg cacacagtca atggttatgt aaacaggtct 780
ctgccaggtc tgattggatg ccacaggaaa tcagtctatt ggcatgtgat tggaatgggc 840
accactcctg aagtgcactc aatattcctc gaaggtcaca catttcttgt gaggaaccat 900
cgccaggcgt ccttggaaat ctcgccaata actttcctta ctgctcaaac actcttgatg 960
gaccttggac agtttctact gttttgtcat atctcttccc accaacatga tggcatggaa 1020
gcttatgtca aagtagacag ctgtccagag gaaccccaac tacgaatgaa aaataatgaa 1080
gaagcggaag actatgatga tgatcttact gattctgaaa tggatgtggt caggtttgat 1140
gatgacaact ctccttcctt tatccaaatt cgctcagttg ccaagaagca tcctaaaact 1200
tgggtacatt acattgctgc tgaagaggag gactgggact atgctccctt agtcctcgcc 1260
cccgatgaca gaagttataa aagtcaatat ttgaacaatg gccctcagcg gattggtagg 1320
aagtacaaaa aagtccgatt tatggcatac acagatgaaa cctttaagac tcgtgaagct 1380
attcagcatg aatcaggaat cttgggacct ttactttatg gggaagttgg agacacactg 1440
ttgattatat ttaagaatca agcaagcaga ccatataaca tctaccctca cggaatcact 1500
gatgtccgtc ctttgtattc aaggagatta ccaaaaggtg taaaacattt gaaggatttt 1560
ccaattctgc caggagaaat attcaaatat aaatggacag tgactgtaga agatgggcca 1620
actaaatcag atcctcggtg cctgacccgc tattactcta gtttcgttaa tatggagaga 1680
gatctagctt caggactcat tggccctctc ctcatctgct acaaagaatc tgtagatcaa 1740
agaggaaacc agataatgtc agacaagagg aatgtcatcc tgttttctgt atttgatgag 1800
aaccgaagct ggtacctcac agagaatata caacgctttc tccccaatcc agctggagtg 1860
cagcttgagg atccagagtt ccaagcctcc aacatcatgc acagcatcaa tggctatgtt 1920
tttgatagtt tgcagttgtc agtttgtttg catgaggtgg catactggta cattctaagc 1980
attggagcac agactgactt cctttctgtc ttcttctctg gatatacctt caaacacaaa 2040
atggtctatg aagacacact caccctattc ccattctcag gagaaactgt cttcatgtcg 2100
atggaaaacc caggtctatg gattctgact tgccacaact cagactttcg gaacagaggc 2160
atgaccgcct tactgaaggt ttctagttgt gacaagaaca ctggtgatta ttacgaggac 2220
agttatgaag atatttcagc atacttgctg agtaaaaaca atgccattga accaagaagc 2280
ttctcccaga attcaagaca cccttctcaa aacccaccag tcttgaaacg ccatcaacgg 2340
gaaataactc gtactactct tcagtcagat caagaggaaa ttgactatga tgataccata 2400
tcagttgaaa tgaagaagga agattttgac atttatgatg aggatgaaaa tcagagcccc 2460
cgcagctttc aaaagaaaac acgacactat tttattgctg cagtggagag gctctgggat 2520
tatgggatga gtagctcccc acatgttcta agaaacaggg ctcagagtgg cagtgtccct 2580
cagttcaaga aagttgtttt ccaggaattt actgatggct cctttactca gcccttatac 2640
cgtggagaac taaatgaaca tttgggactc ctggggccat atataagagc agaagttgaa 2700
gataatatca tggtaacttt cagaaatcag gcctctcgtc cctattcctt ctattctagc 2760
cttatttctt atgaggaaga tcagaggcaa ggagcagaac ctagaaaaaa ctttgtcaag 2820
cctaatgaaa ccaaaactta cttttggaaa gtgcaacatc atatggcacc cactaaagat 2880
gagtttgact gcaaagcctg ggcttatttc tctgatgttg acctggaaaa agatgtgcac 2940
tcaggcctga ttggacccct tctggtctgc cacactaaca cactgaaccc tgctcatggg 3000
agacaagtga cagtacagga atttgctctg tttttcacca tctttgatga gaccaaaagc 3060
tggtacttca ctgaaaatat ggaaagaaac tgcagggctc cctgcaatat ccagatggaa 3120
gatcccactt ttaaagagaa ttatcgcttc catgcaatca atggctacat aatggataca 3180
ctacctggct tagtaatggc tcaggatcaa aggattcgat ggtatctgct cagcatgggc 3240
agcaatgaaa acatccattc tattcatttc agtggacatg tgttcactgt acgaaaaaaa 3300
gaggagtata aaatggcact gtacaatctc tatccaggtg tttttgagac agtggaaatg 3360
ttaccatcca aagctggaat ttggcgggtg gaatgcctta ttggcgagca tctacatgct 3420
gggatgagca cactttttct ggtgtacagc aataagtgtc agactcccct gggaatggct 3480
tctggacaca ttagagattt tcagattaca gcttcaggac aatatggaca gtgggcccca 3540
aagctggcca gacttcatta ttccggatca atcaatgcct ggagcaccaa ggagcccttt 3600
tcttggatca aggtggatct gttggcacca atgattattc acggcatcaa gacccagggt 3660
gcccgtcaga agttctccag cctctacatc tctcagttta tcatcatgta tagtcttgat 3720
gggaagaagt ggcagactta tcgaggaaat tccactggaa ccttaatggt cttctttggc 3780
aatgtggatt catctgggat aaaacacaat atttttaacc ctccaattat tgctcgatac 3840
atccgtttgc acccaactca ttatagcatt cgcagcactc ttcgcatgga gttgatgggc 3900
tgtgatttaa atagttgcag catgccattg ggaatggaga gtaaagcaat atcagatgca 3960
cagattactg cttcatccta ctttaccaat atgtttgcca cctggtctcc ttcaaaagct 4020
cgacttcacc tccaagggag gagtaatgcc tggagacctc aggtgaataa tccaaaagag 4080
tggctgcaag tggacttcca gaagacaatg aaagtcacag gagtaactac tcagggagta 4140
aaatctctgc ttaccagcat gtatgtgaag gagttcctca tctccagcag tcaagatggc 4200
catcagtgga ctctcttttt tcagaatggc aaagtaaagg tttttcaggg aaatcaagac 4260
tccttcacac ctgtggtgaa ctctctagac ccaccgttac tgactcgcta ccttcgaatt 4320
cacccccaga gttgggtgca ccagattgcc ctgaggatgg aggttctggg ctgcgaggca 4380
caggacctct actag 4395
<210> 10
<211> 4395
<212> DNA
<213> Artificial sequence
<400> 10
atgcaaatag agctctccac ctgcttcttt ctgtgccttt tgcgattctg ctttagtgcc 60
accagaagat actacctggg tgcagtggaa ctgtcatggg actatatgca aagtgatctc 120
ggtgagctgc ctgtggacgc aagatttcct cctagagtgc caaaatcttt tccattcaac 180
acctcagtcg tgtacaaaaa gactctgttt gtagaattca cggatcacct tttcaacatc 240
gctaagccaa ggccaccctg gatgggtctg ctaggtccta ccatccaggc tgaggtttat 300
gatacagtgg tcattacact taagaacatg gcttcccatc ctgtcagtct tcatgctgtt 360
ggtgtatcct actggaaagc ttctgaggga gctgaatatg atgatcagac cagtcaaagg 420
gagaaagaag atgataaagt cttccctggt ggaagccata catatgtctg gcaggtcctg 480
aaagagaatg gtccaatggc ctctgaccca ctgtgcctta cctactcata tctttctcat 540
gtggacctgg taaaagactt gaattcaggc ctcattggag ccctactagt atgtagagaa 600
gggagtctgg ccaaggaaaa gacacagacc ttgcacaaat ttatactact ttttgctgta 660
tttgatgaag ggaaaagttg gcactcagaa acaaagaact ccttgatgca ggatagggat 720
gctgcatctg ctcgggcctg gcctaaaatg cacacagtca atggttatgt aaacaggtct 780
ctgccaggtc tgattggatg ccacaggaaa tcagtctatt ggcatgtgat tggaatgggc 840
accactcctg aagtgcactc aatattcctc gaaggtcaca catttcttgt gaggaaccat 900
cgccaggcgt ccttggaaat ctcgccaata actttcctta ctgctcaaac actcttgatg 960
gaccttggac agtttctact gttttgtcat atctcttccc accaacatga tggcatggaa 1020
gcttatgtca aagtagacag ctgtccagag gaaccccaac tacgaatgaa aaataatgaa 1080
gaagcggaag actatgatga tgatcttact gattctgaaa tggatgtggt caggtttgat 1140
gatgacaact ctccttcctt tatccaaatt cgctcagttg ccaagaagca tcctaaaact 1200
tgggtacatt acattgctgc tgaagaggag gactgggact atgctccctt agtcctcgcc 1260
cccgatgaca gaagttataa aagtcaatat ttgaacaatg gccctcagcg gattggtagg 1320
aagtacaaaa aagtccgatt tatggcatac acagatgaaa cctttaagac tcgtgaagct 1380
attcagcatg aatcaggaat cttgggacct ttactttatg gggaagttgg agacacactg 1440
ttgattatat ttaagaatca agcaagcaga ccatataaca tctaccctca cggaatcact 1500
gatgtccgtc ctttgtattc aaggagatta ccaaaaggtg taaaacattt gaaggatttt 1560
ccaattctgc caggagaaat attcaaatat aaatggacag tgactgtaga agatgggcca 1620
actaaatcag atcctcggtg cctgacccgc tattactcta gtttcgttaa tatggagaga 1680
gatctagctt caggactcat tggccctctc ctcatctgct acaaagaatc tgtagatcaa 1740
agaggaaacc agataatgtc agacaagagg aatgtcatcc tgttttctgt atttgatgag 1800
aaccgaagct ggtacctcac agagaatata caacgctttc tccccaatcc agctggagtg 1860
cagcttgagg atccagagtt ccaagcctcc aacatcatgc acagcatcaa tggctatgtt 1920
tttgatagtt tgcagttgtc agtttgtttg catgaggtgg catactggta cattctaagc 1980
attggagcac agactgactt cctttctgtc ttcttctctg gatatacctt caaacacaaa 2040
atggtctatg aagacacact caccctattc ccattctcag gagaaactgt cttcatgtcg 2100
atggaaaacc caggtctatg gattctgacc tgccacaact cagactttcg gaacagaggc 2160
atgaccgcct tactgaaggt ttctagttgt gacaagaaca ctggtgatta ttacgaggac 2220
agttatgaag atatttcagc atacttgctg agtaaaaaca atgccattga accaagaagc 2280
ttctcccaga attcaagaca cccttctcaa aacccaccag tcttgaaacg ccatcaacgg 2340
gaaataactc gtactactct tcagtcagat caagaggaaa ttgactatga tgataccata 2400
tcagttgaaa tgaagaagga agattttgac atttatgatg aggatgaaaa tcagagcccc 2460
cgcagctttc aaaagaaaac acgacactat tttattgctg cagtggagag gctctgggat 2520
tatgggatga gtagctcccc acatgttcta agaaacaggg ctcagagtgg cagtgtccct 2580
cagttcaaga aagttgtttt ccaggaattt actgatggct cctttactca gcccttatac 2640
cgtggagaac taaatgaaca tttgggactc ctggggccat atataagagc agaagttgaa 2700
gataatatca tggtaacttt cagaaatcag gcctctcgtc cctattcctt ctattctagc 2760
cttatttctt atgaggaaga tcagaggcaa ggagcagaac ctagaaaaaa ctttgtcaag 2820
cctaatgaaa ccaaaactta cttttggaaa gtgcaacatc atatggcacc cactaaagat 2880
gagtttgact gcaaagcctg ggcttatttc tctgatgttg acctggaaaa agatgtgcac 2940
tcaggcctga ttggacccct tctggtctgc cacactaaca cactgaaccc tgctcatggg 3000
agacaagtga cagtacagga atttgctctg tttttcacca tctttgatga gaccaaaagc 3060
tggtacttca ctgaaaatat ggaaagaaac tgcagggctc cctgcaatat ccagatggaa 3120
gatcccactt ttaaagagaa ttatcgcttc catgcaatca atggctacat aatggataca 3180
ctacctggct tagtaatggc tcaggatcaa aggattcgat ggtatctgct cagcatgggc 3240
agcaatgaaa acatccattc tattcatttc agtggacatg tgttcactgt acgaaaaaaa 3300
gaggagtata aaatggcact gtacaatctc tatccaggtg tttttgagac agtggaaatg 3360
ttaccatcca aagctggaat ttggcgggtg gaatgcctta ttggcgagca tctacatgct 3420
gggatgagca cactttttct ggtgtacagc aataagtgtc agactcccct gggaatggct 3480
tctggacaca ttagagattt tcagattaca gcttcaggac aatatggaca gtgggcccca 3540
aagctggcca gacttcatta ttccggatca atcaatgcct ggagcaccaa ggagcccttt 3600
tcttggatca aggtggatct gttggcacca atgattattc acggcatcaa gacccagggt 3660
gcccgtcaga agttctccag cctctacatc tctcagttta tcatcatgta tagtcttgat 3720
gggaagaagt ggcagactta tcgaggaaat tccactggaa ccttaatggt cttctttggc 3780
aatgtggatt catctgggat aaaacacaat atttttaacc ctccaattat tgctcgatac 3840
atccgtttgc acccaactca ttatagcatt cgcagcactc ttcgcatgga gttgatgggc 3900
tgtgatttaa atagttgcag catgccattg ggaatggaga gtaaagcaat atcagatgca 3960
cagattactg cttcatccta ctttaccaat atgtttgcca cctggtctcc ttcaaaagct 4020
cgacttcacc tccaagggag gagtaatgcc tggagacctc aggtgaataa tccaaaagag 4080
tggctgcaag tggacttcca gaagacaatg aaagtcacag gagtaactac tcagggagta 4140
aaatctctgc ttaccagcat gtatgtgaag gagttcctca tctccagcag tcaagatggc 4200
catcagtgga ctctcttttt tcagaatggc aaagtaaagg tttttcaggg aaatcaagac 4260
tccttcacac ctgtggtgaa ctctctagac ccaccgttac tgactcgcta ccttcgaatt 4320
cacccccaga gttgggtgca ccagattgcc ctgaggatgg aggttctggg ctgcgaggca 4380
caggacctct actag 4395
<210> 11
<211> 4395
<212> DNA
<213> Artificial sequence
<400> 11
atgcaaatag agctctccac ctgcttcttt ctgtgccttt tgcgattctg ctttagtgcc 60
accagaagat actacctggg tgcagtggaa ctgtcatggg actatatgca aagtgatctc 120
ggtgagctgc ctgtggacgc aagatttcct cctagagtgc caaaatcttt tccattcaac 180
acctcagtcg tgtacaaaaa gactctgttt gtagaattca cggatcacct tttcaacatc 240
gctaagccaa ggccaccctg gatgggtctg ctaggtccta ccatccaggc tgaggtttat 300
gatacagtgg tcattacact taagaacatg gcttcccatc ctgtcagtct tcatgctgtt 360
ggtgtatcct actggaaagc ttctgaggga gctgaatatg atgatcagac cagtcaaagg 420
gagaaagaag atgataaagt cttccctggt ggaagccata catatgtctg gcaggtcctg 480
aaagagaatg gtccaatggc ctctgaccca ctgtgcctta cctactcata tctttctcat 540
gtggacctgg taaaagactt gaattcaggc ctcattggag ccctactagt atgtagagaa 600
gggagtctgg ccaaggaaaa gacacagacc ttgcacaaat ttatactact ttttgctgta 660
tttgatgaag ggaaaagttg gcactcagaa acaaagaact ccttgatgca ggatagggat 720
gctgcatctg ctcgggcctg gcctaaaatg cacacagtca atggttatgt aaacaggtct 780
ctgccaggtc tgattggatg ccacaggaaa tcagtctatt ggcatgtgat tggaatgggc 840
accactcctg aagtgcactc aatattcctc gaaggtcaca catttcttgt gaggaaccat 900
cgccaggcgt ccttggaaat ctcgccaata actttcctta ctgctcaaac actcttgatg 960
gaccttggac agtttctact gttttgtcat atctcttccc accaacatga tggcatggaa 1020
gcttatgtca aagtagacag ctgtccagag gaaccccaac tacgaatgaa aaataatgaa 1080
gaagcggaag actatgatga tgatcttact gattctgaaa tggatgtggt caggtttgat 1140
gatgacaact ctccttcctt tatccaaatt cgctcagttg ccaagaagca tcctaaaact 1200
tgggtacatt acattgctgc tgaagaggag gactgggact atgctccctt agtcctcgcc 1260
cccgatgaca gaagttataa aagtcaatat ttgaacaatg gccctcagcg gattggtagg 1320
aagtacaaaa aagtccgatt tatggcatac acagatgaaa cctttaagac tcgtgaagct 1380
attcagcatg aatcaggaat cttgggacct ttactttatg gggaagttgg agacacactg 1440
ttgattatat ttaagaatca agcaagcaga ccatataaca tctaccctca cggaatcact 1500
gatgtccgtc ctttgtattc aaggagatta ccaaaaggtg taaaacattt gaaggatttt 1560
ccaattctgc caggagaaat attcaaatat aaatggacag tgactgtaga agatgggcca 1620
actaaatcag atcctcggtg cctgacccgc tattactcta gtttcgttaa tatggagaga 1680
gatctagctt caggactcat tggccctctc ctcatctgct acaaagaatc tgtagatcaa 1740
agaggaaacc agataatgtc agacaagagg aatgtcatcc tgttttctgt atttgatgag 1800
aaccgaagct ggtacctcac agagaatata caacgctttc tccccaatcc agctggagtg 1860
cagcttgagg atccagagtt ccaagcctcc aacatcatgc acagcatcaa tggctatgtt 1920
tttgatagtt tgcagttgtc agtttgtttg catgaggtgg catactggta cattctaagc 1980
attggagcac agactgactt cctttctgtc ttcttctctg gatatacctt caaacacaaa 2040
atggtctatg aagacacact caccctattc ccattctcag gagaaactgt cttcatgtcg 2100
atggaaaacc caggtctatg gattctgaca tgccacaact cagactttcg gaacagaggc 2160
atgaccgcct tactgaaggt ttctagttgt gacaagaaca ctggtgatta ttacgaggac 2220
agttatgaag atatttcagc atacttgctg agtaaaaaca atgccattga accaagaagc 2280
ttctcccaga attcaagaca cccttctcaa aacccaccag tcttgaaacg ccatcaacgg 2340
gaaataactc gtactactct tcagtcagat caagaggaaa ttgactatga tgataccata 2400
tcagttgaaa tgaagaagga agattttgac atttatgatg aggatgaaaa tcagagcccc 2460
cgcagctttc aaaagaaaac acgacactat tttattgctg cagtggagag gctctgggat 2520
tatgggatga gtagctcccc acatgttcta agaaacaggg ctcagagtgg cagtgtccct 2580
cagttcaaga aagttgtttt ccaggaattt actgatggct cctttactca gcccttatac 2640
cgtggagaac taaatgaaca tttgggactc ctggggccat atataagagc agaagttgaa 2700
gataatatca tggtaacttt cagaaatcag gcctctcgtc cctattcctt ctattctagc 2760
cttatttctt atgaggaaga tcagaggcaa ggagcagaac ctagaaaaaa ctttgtcaag 2820
cctaatgaaa ccaaaactta cttttggaaa gtgcaacatc atatggcacc cactaaagat 2880
gagtttgact gcaaagcctg ggcttatttc tctgatgttg acctggaaaa agatgtgcac 2940
tcaggcctga ttggacccct tctggtctgc cacactaaca cactgaaccc tgctcatggg 3000
agacaagtga cagtacagga atttgctctg tttttcacca tctttgatga gaccaaaagc 3060
tggtacttca ctgaaaatat ggaaagaaac tgcagggctc cctgcaatat ccagatggaa 3120
gatcccactt ttaaagagaa ttatcgcttc catgcaatca atggctacat aatggataca 3180
ctacctggct tagtaatggc tcaggatcaa aggattcgat ggtatctgct cagcatgggc 3240
agcaatgaaa acatccattc tattcatttc agtggacatg tgttcactgt acgaaaaaaa 3300
gaggagtata aaatggcact gtacaatctc tatccaggtg tttttgagac agtggaaatg 3360
ttaccatcca aagctggaat ttggcgggtg gaatgcctta ttggcgagca tctacatgct 3420
gggatgagca cactttttct ggtgtacagc aataagtgtc agactcccct gggaatggct 3480
tctggacaca ttagagattt tcagattaca gcttcaggac aatatggaca gtgggcccca 3540
aagctggcca gacttcatta ttccggatca atcaatgcct ggagcaccaa ggagcccttt 3600
tcttggatca aggtggatct gttggcacca atgattattc acggcatcaa gacccagggt 3660
gcccgtcaga agttctccag cctctacatc tctcagttta tcatcatgta tagtcttgat 3720
gggaagaagt ggcagactta tcgaggaaat tccactggaa ccttaatggt cttctttggc 3780
aatgtggatt catctgggat aaaacacaat atttttaacc ctccaattat tgctcgatac 3840
atccgtttgc acccaactca ttatagcatt cgcagcactc ttcgcatgga gttgatgggc 3900
tgtgatttaa atagttgcag catgccattg ggaatggaga gtaaagcaat atcagatgca 3960
cagattactg cttcatccta ctttaccaat atgtttgcca cctggtctcc ttcaaaagct 4020
cgacttcacc tccaagggag gagtaatgcc tggagacctc aggtgaataa tccaaaagag 4080
tggctgcaag tggacttcca gaagacaatg aaagtcacag gagtaactac tcagggagta 4140
aaatctctgc ttaccagcat gtatgtgaag gagttcctca tctccagcag tcaagatggc 4200
catcagtgga ctctcttttt tcagaatggc aaagtaaagg tttttcaggg aaatcaagac 4260
tccttcacac ctgtggtgaa ctctctagac ccaccgttac tgactcgcta ccttcgaatt 4320
cacccccaga gttgggtgca ccagattgcc ctgaggatgg aggttctggg ctgcgaggca 4380
caggacctct actag 4395
<210> 12
<211> 4395
<212> DNA
<213> Artificial sequence
<400> 12
atgcaaatag agctctccac ctgcttcttt ctgtgccttt tgcgattctg ctttagtgcc 60
accagaagat actacctggg tgcagtggaa ctgtcatggg actatatgca aagtgatctc 120
ggtgagctgc ctgtggacgc aagatttcct cctagagtgc caaaatcttt tccattcaac 180
acctcagtcg tgtacaaaaa gactctgttt gtagaattca cggatcacct tttcaacatc 240
gctaagccaa ggccaccctg gatgggtctg ctaggtccta ccatccaggc tgaggtttat 300
gatacagtgg tcattacact taagaacatg gcttcccatc ctgtcagtct tcatgctgtt 360
ggtgtatcct actggaaagc ttctgaggga gctgaatatg atgatcagac cagtcaaagg 420
gagaaagaag atgataaagt cttccctggt ggaagccata catatgtctg gcaggtcctg 480
aaagagaatg gtccaatggc ctctgaccca ctgtgcctta cctactcata tctttctcat 540
gtggacctgg taaaagactt gaattcaggc ctcattggag ccctactagt atgtagagaa 600
gggagtctgg ccaaggaaaa gacacagacc ttgcacaaat ttatactact ttttgctgta 660
tttgatgaag ggaaaagttg gcactcagaa acaaagaact ccttgatgca ggatagggat 720
gctgcatctg ctcgggcctg gcctaaaatg cacacagtca atggttatgt aaacaggtct 780
ctgccaggtc tgattggatg ccacaggaaa tcagtctatt ggcatgtgat tggaatgggc 840
accactcctg aagtgcactc aatattcctc gaaggtcaca catttcttgt gaggaaccat 900
cgccaggcgt ccttggaaat ctcgccaata actttcctta ctgctcaaac actcttgatg 960
gaccttggac agtttctact gttttgtcat atctcttccc accaacatga tggcatggaa 1020
gcttatgtca aagtagacag ctgtccagag gaaccccaac tacgaatgaa aaataatgaa 1080
gaagcggaag actatgatga tgatcttact gattctgaaa tggatgtggt caggtttgat 1140
gatgacaact ctccttcctt tatccaaatt cgctcagttg ccaagaagca tcctaaaact 1200
tgggtacatt acattgctgc tgaagaggag gactgggact atgctccctt agtcctcgcc 1260
cccgatgaca gaagttataa aagtcaatat ttgaacaatg gccctcagcg gattggtagg 1320
aagtacaaaa aagtccgatt tatggcatac acagatgaaa cctttaagac tcgtgaagct 1380
attcagcatg aatcaggaat cttgggacct ttactttatg gggaagttgg agacacactg 1440
ttgattatat ttaagaatca agcaagcaga ccatataaca tctaccctca cggaatcact 1500
gatgtccgtc ctttgtattc aaggagatta ccaaaaggtg taaaacattt gaaggatttt 1560
ccaattctgc caggagaaat attcaaatat aaatggacag tgactgtaga agatgggcca 1620
actaaatcag atcctcggtg cctgacccgc tattactcta gtttcgttaa tatggagaga 1680
gatctagctt caggactcat tggccctctc ctcatctgct acaaagaatc tgtagatcaa 1740
agaggaaacc agataatgtc agacaagagg aatgtcatcc tgttttctgt atttgatgag 1800
aaccgaagct ggtacctcac agagaatata caacgctttc tccccaatcc agctggagtg 1860
cagcttgagg atccagagtt ccaagcctcc aacatcatgc acagcatcaa tggctatgtt 1920
tttgatagtt tgcagttgtc agtttgtttg catgaggtgg catactggta cattctaagc 1980
attggagcac agactgactt cctttctgtc ttcttctctg gatatacctt caaacacaaa 2040
atggtctatg aagacacact caccctattc ccattctcag gagaaactgt cttcatgtcg 2100
atggaaaacc caggtctatg gattctgacg tgccacaact cagactttcg gaacagaggc 2160
atgaccgcct tactgaaggt ttctagttgt gacaagaaca ctggtgatta ttacgaggac 2220
agttatgaag atatttcagc atacttgctg agtaaaaaca atgccattga accaagaagc 2280
ttctcccaga attcaagaca cccttctcaa aacccaccag tcttgaaacg ccatcaacgg 2340
gaaataactc gtactactct tcagtcagat caagaggaaa ttgactatga tgataccata 2400
tcagttgaaa tgaagaagga agattttgac atttatgatg aggatgaaaa tcagagcccc 2460
cgcagctttc aaaagaaaac acgacactat tttattgctg cagtggagag gctctgggat 2520
tatgggatga gtagctcccc acatgttcta agaaacaggg ctcagagtgg cagtgtccct 2580
cagttcaaga aagttgtttt ccaggaattt actgatggct cctttactca gcccttatac 2640
cgtggagaac taaatgaaca tttgggactc ctggggccat atataagagc agaagttgaa 2700
gataatatca tggtaacttt cagaaatcag gcctctcgtc cctattcctt ctattctagc 2760
cttatttctt atgaggaaga tcagaggcaa ggagcagaac ctagaaaaaa ctttgtcaag 2820
cctaatgaaa ccaaaactta cttttggaaa gtgcaacatc atatggcacc cactaaagat 2880
gagtttgact gcaaagcctg ggcttatttc tctgatgttg acctggaaaa agatgtgcac 2940
tcaggcctga ttggacccct tctggtctgc cacactaaca cactgaaccc tgctcatggg 3000
agacaagtga cagtacagga atttgctctg tttttcacca tctttgatga gaccaaaagc 3060
tggtacttca ctgaaaatat ggaaagaaac tgcagggctc cctgcaatat ccagatggaa 3120
gatcccactt ttaaagagaa ttatcgcttc catgcaatca atggctacat aatggataca 3180
ctacctggct tagtaatggc tcaggatcaa aggattcgat ggtatctgct cagcatgggc 3240
agcaatgaaa acatccattc tattcatttc agtggacatg tgttcactgt acgaaaaaaa 3300
gaggagtata aaatggcact gtacaatctc tatccaggtg tttttgagac agtggaaatg 3360
ttaccatcca aagctggaat ttggcgggtg gaatgcctta ttggcgagca tctacatgct 3420
gggatgagca cactttttct ggtgtacagc aataagtgtc agactcccct gggaatggct 3480
tctggacaca ttagagattt tcagattaca gcttcaggac aatatggaca gtgggcccca 3540
aagctggcca gacttcatta ttccggatca atcaatgcct ggagcaccaa ggagcccttt 3600
tcttggatca aggtggatct gttggcacca atgattattc acggcatcaa gacccagggt 3660
gcccgtcaga agttctccag cctctacatc tctcagttta tcatcatgta tagtcttgat 3720
gggaagaagt ggcagactta tcgaggaaat tccactggaa ccttaatggt cttctttggc 3780
aatgtggatt catctgggat aaaacacaat atttttaacc ctccaattat tgctcgatac 3840
atccgtttgc acccaactca ttatagcatt cgcagcactc ttcgcatgga gttgatgggc 3900
tgtgatttaa atagttgcag catgccattg ggaatggaga gtaaagcaat atcagatgca 3960
cagattactg cttcatccta ctttaccaat atgtttgcca cctggtctcc ttcaaaagct 4020
cgacttcacc tccaagggag gagtaatgcc tggagacctc aggtgaataa tccaaaagag 4080
tggctgcaag tggacttcca gaagacaatg aaagtcacag gagtaactac tcagggagta 4140
aaatctctgc ttaccagcat gtatgtgaag gagttcctca tctccagcag tcaagatggc 4200
catcagtgga ctctcttttt tcagaatggc aaagtaaagg tttttcaggg aaatcaagac 4260
tccttcacac ctgtggtgaa ctctctagac ccaccgttac tgactcgcta ccttcgaatt 4320
cacccccaga gttgggtgca ccagattgcc ctgaggatgg aggttctggg ctgcgaggca 4380
caggacctct actag 4395
<210> 13
<211> 4392
<212> DNA
<213> Artificial sequence
<400> 13
atgcaaatag agctctccac ctgcttcttt ctgtgccttt tgcgattctg ctttagtgcc 60
accagaagat actacctggg tgcagtggaa ctgtcatggg actatatgca aagtgatctc 120
ggtgagctgc ctgtggacgc aagatttcct cctagagtgc caaaatcttt tccattcaac 180
acctcagtcg tgtacaaaaa gactctgttt gtagaattca cggatcacct tttcaacatc 240
gctaagccaa ggccaccctg gatgggtctg ctaggtccta ccatccaggc tgaggtttat 300
gatacagtgg tcattacact taagaacatg gcttcccatc ctgtcagtct tcatgctgtt 360
ggtgtatcct actggaaagc ttctgaggga gctgaatatg atgatcagac cagtcaaagg 420
gagaaagaag atgataaagt cttccctggt ggaagccata catatgtctg gcaggtcctg 480
aaagagaatg gtccaatggc ctctgaccca ctgtgcctta cctactcata tctttctcat 540
gtggacctgg taaaagactt gaattcaggc ctcattggag ccctactagt atgtagagaa 600
gggagtctgg ccaaggaaaa gacacagacc ttgcacaaat ttatactact ttttgctgta 660
tttgatgaag ggaaaagttg gcactcagaa acaaagaact ccttgatgca ggatagggat 720
gctgcatctg ctcgggcctg gcctaaaatg cacacagtca atggttatgt aaacaggtct 780
ctgccaggtc tgattggatg ccacaggaaa tcagtctatt ggcatgtgat tggaatgggc 840
accactcctg aagtgcactc aatattcctc gaaggtcaca catttcttgt gaggaaccat 900
cgccaggcgt ccttggaaat ctcgccaata actttcctta ctgctcaaac actcttgatg 960
gaccttggac agtttctact gttttgtcat atctcttccc accaacatga tggcatggaa 1020
gcttatgtca aagtagacag ctgtccagag gaaccccaac tacgaatgaa aaataatgaa 1080
gaagcggaag actatgatga tgatcttact gattctgaaa tggatgtggt caggtttgat 1140
gatgacaact ctccttcctt tatccaaatt cgctcagttg ccaagaagca tcctaaaact 1200
tgggtacatt acattgctgc tgaagaggag gactgggact atgctccctt agtcctcgcc 1260
cccgatgaca gaagttataa aagtcaatat ttgaacaatg gccctcagcg gattggtagg 1320
aagtacaaaa aagtccgatt tatggcatac acagatgaaa cctttaagac tcgtgaagct 1380
attcagcatg aatcaggaat cttgggacct ttactttatg gggaagttgg agacacactg 1440
ttgattatat ttaagaatca agcaagcaga ccatataaca tctaccctca cggaatcact 1500
gatgtccgtc ctttgtattc aaggagatta ccaaaaggtg taaaacattt gaaggatttt 1560
ccaattctgc caggagaaat attcaaatat aaatggacag tgactgtaga agatgggcca 1620
actaaatcag atcctcggtg cctgacccgc tattactcta gtttcgttaa tatggagaga 1680
gatctagctt caggactcat tggccctctc ctcatctgct acaaagaatc tgtagatcaa 1740
agaggaaacc agataatgtc agacaagagg aatgtcatcc tgttttctgt atttgatgag 1800
aaccgaagct ggtacctcac agagaatata caacgctttc tccccaatcc agctggagtg 1860
cagcttgagg atccagagtt ccaagcctcc aacatcatgc acagcatcaa tggctatgtt 1920
tttgatagtt tgcagttgtc agtttgtttg catgaggtgg catactggta cattctaagc 1980
attggagcac agactgactt cctttctgtc ttcttctctg gatatacctt caaacacaaa 2040
atggtctatg aagacacact caccctattc ccattctcag gagaaactgt cttcatgtcg 2100
atggaaaacc caggtctatg gattctgact tgccacaact cagactttcg gaacagaggc 2160
atgaccgcct tactgaaggt ttctagttgt gacaagaaca ctggtgatta ttacgaggac 2220
agttatgaag atatttcagc atacttgctg agtaaaaaca atgccattga accaagaagc 2280
ttctcccaga attcaagaca ccctagcact aggcaaaagc aatttaatgc caccacaatt 2340
ccagaaaata ctactcttca gtcagatcaa gaggaaattg actatgatga taccatatca 2400
gttgaaatga agaaggaaga ttttgacatt tatgatgagg atgaaaatca gagcccccgc 2460
agctttcaaa agaaaacacg acactatttt attgctgcag tggagaggct ctgggattat 2520
gggatgagta gctccccaca tgttctaaga aacagggctc agagtggcag tgtccctcag 2580
ttcaagaaag ttgttttcca ggaatttact gatggctcct ttactcagcc cttataccgt 2640
ggagaactaa atgaacattt gggactcctg gggccatata taagagcaga agttgaagat 2700
aatatcatgg taactttcag aaatcaggcc tctcgtccct attccttcta ttctagcctt 2760
atttcttatg aggaagatca gaggcaagga gcagaaccta gaaaaaactt tgtcaagcct 2820
aatgaaacca aaacttactt ttggaaagtg caacatcata tggcacccac taaagatgag 2880
tttgactgca aagcctgggc ttatttctct gatgttgacc tggaaaaaga tgtgcactca 2940
ggcctgattg gaccccttct ggtctgccac actaacacac tgaaccctgc tcatgggaga 3000
caagtgacag tacaggaatt tgctctgttt ttcaccatct ttgatgagac caaaagctgg 3060
tacttcactg aaaatatgga aagaaactgc agggctccct gcaatatcca gatggaagat 3120
cccactttta aagagaatta tcgcttccat gcaatcaatg gctacataat ggatacacta 3180
cctggcttag taatggctca ggatcaaagg attcgatggt atctgctcag catgggcagc 3240
aatgaaaaca tccattctat tcatttcagt ggacatgtgt tcactgtacg aaaaaaagag 3300
gagtataaaa tggcactgta caatctctat ccaggtgttt ttgagacagt ggaaatgtta 3360
ccatccaaag ctggaatttg gcgggtggaa tgccttattg gcgagcatct acatgctggg 3420
atgagcacac tttttctggt gtacagcaat aagtgtcaga ctcccctggg aatggcttct 3480
ggacacatta gagattttca gattacagct tcaggacaat atggacagtg ggccccaaag 3540
ctggccagac ttcattattc cggatcaatc aatgcctgga gcaccaagga gcccttttct 3600
tggatcaagg tggatctgtt ggcaccaatg attattcacg gcatcaagac ccagggtgcc 3660
cgtcagaagt tctccagcct ctacatctct cagtttatca tcatgtatag tcttgatggg 3720
aagaagtggc agacttatcg aggaaattcc actggaacct taatggtctt ctttggcaat 3780
gtggattcat ctgggataaa acacaatatt tttaaccctc caattattgc tcgatacatc 3840
cgtttgcacc caactcatta tagcattcgc agcactcttc gcatggagtt gatgggctgt 3900
gatttaaata gttgcagcat gccattggga atggagagta aagcaatatc agatgcacag 3960
attactgctt catcctactt taccaatatg tttgccacct ggtctccttc aaaagctcga 4020
cttcacctcc aagggaggag taatgcctgg agacctcagg tgaataatcc aaaagagtgg 4080
ctgcaagtgg acttccagaa gacaatgaaa gtcacaggag taactactca gggagtaaaa 4140
tctctgctta ccagcatgta tgtgaaggag ttcctcatct ccagcagtca agatggccat 4200
cagtggactc tcttttttca gaatggcaaa gtaaaggttt ttcagggaaa tcaagactcc 4260
ttcacacctg tggtgaactc tctagaccca ccgttactga ctcgctacct tcgaattcac 4320
ccccagagtt gggtgcacca gattgccctg aggatggagg ttctgggctg cgaggcacag 4380
gacctctact ag 4392
<210> 14
<211> 4392
<212> DNA
<213> Artificial sequence
<400> 14
atgcaaatag agctctccac ctgcttcttt ctgtgccttt tgcgattctg ctttagtgcc 60
accagaagat actacctggg tgcagtggaa ctgtcatggg actatatgca aagtgatctc 120
ggtgagctgc ctgtggacgc aagatttcct cctagagtgc caaaatcttt tccattcaac 180
acctcagtcg tgtacaaaaa gactctgttt gtagaattca cggatcacct tttcaacatc 240
gctaagccaa ggccaccctg gatgggtctg ctaggtccta ccatccaggc tgaggtttat 300
gatacagtgg tcattacact taagaacatg gcttcccatc ctgtcagtct tcatgctgtt 360
ggtgtatcct actggaaagc ttctgaggga gctgaatatg atgatcagac cagtcaaagg 420
gagaaagaag atgataaagt cttccctggt ggaagccata catatgtctg gcaggtcctg 480
aaagagaatg gtccaatggc ctctgaccca ctgtgcctta cctactcata tctttctcat 540
gtggacctgg taaaagactt gaattcaggc ctcattggag ccctactagt atgtagagaa 600
gggagtctgg ccaaggaaaa gacacagacc ttgcacaaat ttatactact ttttgctgta 660
tttgatgaag ggaaaagttg gcactcagaa acaaagaact ccttgatgca ggatagggat 720
gctgcatctg ctcgggcctg gcctaaaatg cacacagtca atggttatgt aaacaggtct 780
ctgccaggtc tgattggatg ccacaggaaa tcagtctatt ggcatgtgat tggaatgggc 840
accactcctg aagtgcactc aatattcctc gaaggtcaca catttcttgt gaggaaccat 900
cgccaggcgt ccttggaaat ctcgccaata actttcctta ctgctcaaac actcttgatg 960
gaccttggac agtttctact gttttgtcat atctcttccc accaacatga tggcatggaa 1020
gcttatgtca aagtagacag ctgtccagag gaaccccaac tacgaatgaa aaataatgaa 1080
gaagcggaag actatgatga tgatcttact gattctgaaa tggatgtggt caggtttgat 1140
gatgacaact ctccttcctt tatccaaatt cgctcagttg ccaagaagca tcctaaaact 1200
tgggtacatt acattgctgc tgaagaggag gactgggact atgctccctt agtcctcgcc 1260
cccgatgaca gaagttataa aagtcaatat ttgaacaatg gccctcagcg gattggtagg 1320
aagtacaaaa aagtccgatt tatggcatac acagatgaaa cctttaagac tcgtgaagct 1380
attcagcatg aatcaggaat cttgggacct ttactttatg gggaagttgg agacacactg 1440
ttgattatat ttaagaatca agcaagcaga ccatataaca tctaccctca cggaatcact 1500
gatgtccgtc ctttgtattc aaggagatta ccaaaaggtg taaaacattt gaaggatttt 1560
ccaattctgc caggagaaat attcaaatat aaatggacag tgactgtaga agatgggcca 1620
actaaatcag atcctcggtg cctgacccgc tattactcta gtttcgttaa tatggagaga 1680
gatctagctt caggactcat tggccctctc ctcatctgct acaaagaatc tgtagatcaa 1740
agaggaaacc agataatgtc agacaagagg aatgtcatcc tgttttctgt atttgatgag 1800
aaccgaagct ggtacctcac agagaatata caacgctttc tccccaatcc agctggagtg 1860
cagcttgagg atccagagtt ccaagcctcc aacatcatgc acagcatcaa tggctatgtt 1920
tttgatagtt tgcagttgtc agtttgtttg catgaggtgg catactggta cattctaagc 1980
attggagcac agactgactt cctttctgtc ttcttctctg gatatacctt caaacacaaa 2040
atggtctatg aagacacact caccctattc ccattctcag gagaaactgt cttcatgtcg 2100
atggaaaacc caggtctatg gattctgacc tgccacaact cagactttcg gaacagaggc 2160
atgaccgcct tactgaaggt ttctagttgt gacaagaaca ctggtgatta ttacgaggac 2220
agttatgaag atatttcagc atacttgctg agtaaaaaca atgccattga accaagaagc 2280
ttctcccaga attcaagaca ccctagcact aggcaaaagc aatttaatgc caccacaatt 2340
ccagaaaata ctactcttca gtcagatcaa gaggaaattg actatgatga taccatatca 2400
gttgaaatga agaaggaaga ttttgacatt tatgatgagg atgaaaatca gagcccccgc 2460
agctttcaaa agaaaacacg acactatttt attgctgcag tggagaggct ctgggattat 2520
gggatgagta gctccccaca tgttctaaga aacagggctc agagtggcag tgtccctcag 2580
ttcaagaaag ttgttttcca ggaatttact gatggctcct ttactcagcc cttataccgt 2640
ggagaactaa atgaacattt gggactcctg gggccatata taagagcaga agttgaagat 2700
aatatcatgg taactttcag aaatcaggcc tctcgtccct attccttcta ttctagcctt 2760
atttcttatg aggaagatca gaggcaagga gcagaaccta gaaaaaactt tgtcaagcct 2820
aatgaaacca aaacttactt ttggaaagtg caacatcata tggcacccac taaagatgag 2880
tttgactgca aagcctgggc ttatttctct gatgttgacc tggaaaaaga tgtgcactca 2940
ggcctgattg gaccccttct ggtctgccac actaacacac tgaaccctgc tcatgggaga 3000
caagtgacag tacaggaatt tgctctgttt ttcaccatct ttgatgagac caaaagctgg 3060
tacttcactg aaaatatgga aagaaactgc agggctccct gcaatatcca gatggaagat 3120
cccactttta aagagaatta tcgcttccat gcaatcaatg gctacataat ggatacacta 3180
cctggcttag taatggctca ggatcaaagg attcgatggt atctgctcag catgggcagc 3240
aatgaaaaca tccattctat tcatttcagt ggacatgtgt tcactgtacg aaaaaaagag 3300
gagtataaaa tggcactgta caatctctat ccaggtgttt ttgagacagt ggaaatgtta 3360
ccatccaaag ctggaatttg gcgggtggaa tgccttattg gcgagcatct acatgctggg 3420
atgagcacac tttttctggt gtacagcaat aagtgtcaga ctcccctggg aatggcttct 3480
ggacacatta gagattttca gattacagct tcaggacaat atggacagtg ggccccaaag 3540
ctggccagac ttcattattc cggatcaatc aatgcctgga gcaccaagga gcccttttct 3600
tggatcaagg tggatctgtt ggcaccaatg attattcacg gcatcaagac ccagggtgcc 3660
cgtcagaagt tctccagcct ctacatctct cagtttatca tcatgtatag tcttgatggg 3720
aagaagtggc agacttatcg aggaaattcc actggaacct taatggtctt ctttggcaat 3780
gtggattcat ctgggataaa acacaatatt tttaaccctc caattattgc tcgatacatc 3840
cgtttgcacc caactcatta tagcattcgc agcactcttc gcatggagtt gatgggctgt 3900
gatttaaata gttgcagcat gccattggga atggagagta aagcaatatc agatgcacag 3960
attactgctt catcctactt taccaatatg tttgccacct ggtctccttc aaaagctcga 4020
cttcacctcc aagggaggag taatgcctgg agacctcagg tgaataatcc aaaagagtgg 4080
ctgcaagtgg acttccagaa gacaatgaaa gtcacaggag taactactca gggagtaaaa 4140
tctctgctta ccagcatgta tgtgaaggag ttcctcatct ccagcagtca agatggccat 4200
cagtggactc tcttttttca gaatggcaaa gtaaaggttt ttcagggaaa tcaagactcc 4260
ttcacacctg tggtgaactc tctagaccca ccgttactga ctcgctacct tcgaattcac 4320
ccccagagtt gggtgcacca gattgccctg aggatggagg ttctgggctg cgaggcacag 4380
gacctctact ag 4392
<210> 15
<211> 4392
<212> DNA
<213> Artificial sequence
<400> 15
atgcaaatag agctctccac ctgcttcttt ctgtgccttt tgcgattctg ctttagtgcc 60
accagaagat actacctggg tgcagtggaa ctgtcatggg actatatgca aagtgatctc 120
ggtgagctgc ctgtggacgc aagatttcct cctagagtgc caaaatcttt tccattcaac 180
acctcagtcg tgtacaaaaa gactctgttt gtagaattca cggatcacct tttcaacatc 240
gctaagccaa ggccaccctg gatgggtctg ctaggtccta ccatccaggc tgaggtttat 300
gatacagtgg tcattacact taagaacatg gcttcccatc ctgtcagtct tcatgctgtt 360
ggtgtatcct actggaaagc ttctgaggga gctgaatatg atgatcagac cagtcaaagg 420
gagaaagaag atgataaagt cttccctggt ggaagccata catatgtctg gcaggtcctg 480
aaagagaatg gtccaatggc ctctgaccca ctgtgcctta cctactcata tctttctcat 540
gtggacctgg taaaagactt gaattcaggc ctcattggag ccctactagt atgtagagaa 600
gggagtctgg ccaaggaaaa gacacagacc ttgcacaaat ttatactact ttttgctgta 660
tttgatgaag ggaaaagttg gcactcagaa acaaagaact ccttgatgca ggatagggat 720
gctgcatctg ctcgggcctg gcctaaaatg cacacagtca atggttatgt aaacaggtct 780
ctgccaggtc tgattggatg ccacaggaaa tcagtctatt ggcatgtgat tggaatgggc 840
accactcctg aagtgcactc aatattcctc gaaggtcaca catttcttgt gaggaaccat 900
cgccaggcgt ccttggaaat ctcgccaata actttcctta ctgctcaaac actcttgatg 960
gaccttggac agtttctact gttttgtcat atctcttccc accaacatga tggcatggaa 1020
gcttatgtca aagtagacag ctgtccagag gaaccccaac tacgaatgaa aaataatgaa 1080
gaagcggaag actatgatga tgatcttact gattctgaaa tggatgtggt caggtttgat 1140
gatgacaact ctccttcctt tatccaaatt cgctcagttg ccaagaagca tcctaaaact 1200
tgggtacatt acattgctgc tgaagaggag gactgggact atgctccctt agtcctcgcc 1260
cccgatgaca gaagttataa aagtcaatat ttgaacaatg gccctcagcg gattggtagg 1320
aagtacaaaa aagtccgatt tatggcatac acagatgaaa cctttaagac tcgtgaagct 1380
attcagcatg aatcaggaat cttgggacct ttactttatg gggaagttgg agacacactg 1440
ttgattatat ttaagaatca agcaagcaga ccatataaca tctaccctca cggaatcact 1500
gatgtccgtc ctttgtattc aaggagatta ccaaaaggtg taaaacattt gaaggatttt 1560
ccaattctgc caggagaaat attcaaatat aaatggacag tgactgtaga agatgggcca 1620
actaaatcag atcctcggtg cctgacccgc tattactcta gtttcgttaa tatggagaga 1680
gatctagctt caggactcat tggccctctc ctcatctgct acaaagaatc tgtagatcaa 1740
agaggaaacc agataatgtc agacaagagg aatgtcatcc tgttttctgt atttgatgag 1800
aaccgaagct ggtacctcac agagaatata caacgctttc tccccaatcc agctggagtg 1860
cagcttgagg atccagagtt ccaagcctcc aacatcatgc acagcatcaa tggctatgtt 1920
tttgatagtt tgcagttgtc agtttgtttg catgaggtgg catactggta cattctaagc 1980
attggagcac agactgactt cctttctgtc ttcttctctg gatatacctt caaacacaaa 2040
atggtctatg aagacacact caccctattc ccattctcag gagaaactgt cttcatgtcg 2100
atggaaaacc caggtctatg gattctgaca tgccacaact cagactttcg gaacagaggc 2160
atgaccgcct tactgaaggt ttctagttgt gacaagaaca ctggtgatta ttacgaggac 2220
agttatgaag atatttcagc atacttgctg agtaaaaaca atgccattga accaagaagc 2280
ttctcccaga attcaagaca ccctagcact aggcaaaagc aatttaatgc caccacaatt 2340
ccagaaaata ctactcttca gtcagatcaa gaggaaattg actatgatga taccatatca 2400
gttgaaatga agaaggaaga ttttgacatt tatgatgagg atgaaaatca gagcccccgc 2460
agctttcaaa agaaaacacg acactatttt attgctgcag tggagaggct ctgggattat 2520
gggatgagta gctccccaca tgttctaaga aacagggctc agagtggcag tgtccctcag 2580
ttcaagaaag ttgttttcca ggaatttact gatggctcct ttactcagcc cttataccgt 2640
ggagaactaa atgaacattt gggactcctg gggccatata taagagcaga agttgaagat 2700
aatatcatgg taactttcag aaatcaggcc tctcgtccct attccttcta ttctagcctt 2760
atttcttatg aggaagatca gaggcaagga gcagaaccta gaaaaaactt tgtcaagcct 2820
aatgaaacca aaacttactt ttggaaagtg caacatcata tggcacccac taaagatgag 2880
tttgactgca aagcctgggc ttatttctct gatgttgacc tggaaaaaga tgtgcactca 2940
ggcctgattg gaccccttct ggtctgccac actaacacac tgaaccctgc tcatgggaga 3000
caagtgacag tacaggaatt tgctctgttt ttcaccatct ttgatgagac caaaagctgg 3060
tacttcactg aaaatatgga aagaaactgc agggctccct gcaatatcca gatggaagat 3120
cccactttta aagagaatta tcgcttccat gcaatcaatg gctacataat ggatacacta 3180
cctggcttag taatggctca ggatcaaagg attcgatggt atctgctcag catgggcagc 3240
aatgaaaaca tccattctat tcatttcagt ggacatgtgt tcactgtacg aaaaaaagag 3300
gagtataaaa tggcactgta caatctctat ccaggtgttt ttgagacagt ggaaatgtta 3360
ccatccaaag ctggaatttg gcgggtggaa tgccttattg gcgagcatct acatgctggg 3420
atgagcacac tttttctggt gtacagcaat aagtgtcaga ctcccctggg aatggcttct 3480
ggacacatta gagattttca gattacagct tcaggacaat atggacagtg ggccccaaag 3540
ctggccagac ttcattattc cggatcaatc aatgcctgga gcaccaagga gcccttttct 3600
tggatcaagg tggatctgtt ggcaccaatg attattcacg gcatcaagac ccagggtgcc 3660
cgtcagaagt tctccagcct ctacatctct cagtttatca tcatgtatag tcttgatggg 3720
aagaagtggc agacttatcg aggaaattcc actggaacct taatggtctt ctttggcaat 3780
gtggattcat ctgggataaa acacaatatt tttaaccctc caattattgc tcgatacatc 3840
cgtttgcacc caactcatta tagcattcgc agcactcttc gcatggagtt gatgggctgt 3900
gatttaaata gttgcagcat gccattggga atggagagta aagcaatatc agatgcacag 3960
attactgctt catcctactt taccaatatg tttgccacct ggtctccttc aaaagctcga 4020
cttcacctcc aagggaggag taatgcctgg agacctcagg tgaataatcc aaaagagtgg 4080
ctgcaagtgg acttccagaa gacaatgaaa gtcacaggag taactactca gggagtaaaa 4140
tctctgctta ccagcatgta tgtgaaggag ttcctcatct ccagcagtca agatggccat 4200
cagtggactc tcttttttca gaatggcaaa gtaaaggttt ttcagggaaa tcaagactcc 4260
ttcacacctg tggtgaactc tctagaccca ccgttactga ctcgctacct tcgaattcac 4320
ccccagagtt gggtgcacca gattgccctg aggatggagg ttctgggctg cgaggcacag 4380
gacctctact ag 4392
<210> 16
<211> 4392
<212> DNA
<213> Artificial sequence
<400> 16
atgcaaatag agctctccac ctgcttcttt ctgtgccttt tgcgattctg ctttagtgcc 60
accagaagat actacctggg tgcagtggaa ctgtcatggg actatatgca aagtgatctc 120
ggtgagctgc ctgtggacgc aagatttcct cctagagtgc caaaatcttt tccattcaac 180
acctcagtcg tgtacaaaaa gactctgttt gtagaattca cggatcacct tttcaacatc 240
gctaagccaa ggccaccctg gatgggtctg ctaggtccta ccatccaggc tgaggtttat 300
gatacagtgg tcattacact taagaacatg gcttcccatc ctgtcagtct tcatgctgtt 360
ggtgtatcct actggaaagc ttctgaggga gctgaatatg atgatcagac cagtcaaagg 420
gagaaagaag atgataaagt cttccctggt ggaagccata catatgtctg gcaggtcctg 480
aaagagaatg gtccaatggc ctctgaccca ctgtgcctta cctactcata tctttctcat 540
gtggacctgg taaaagactt gaattcaggc ctcattggag ccctactagt atgtagagaa 600
gggagtctgg ccaaggaaaa gacacagacc ttgcacaaat ttatactact ttttgctgta 660
tttgatgaag ggaaaagttg gcactcagaa acaaagaact ccttgatgca ggatagggat 720
gctgcatctg ctcgggcctg gcctaaaatg cacacagtca atggttatgt aaacaggtct 780
ctgccaggtc tgattggatg ccacaggaaa tcagtctatt ggcatgtgat tggaatgggc 840
accactcctg aagtgcactc aatattcctc gaaggtcaca catttcttgt gaggaaccat 900
cgccaggcgt ccttggaaat ctcgccaata actttcctta ctgctcaaac actcttgatg 960
gaccttggac agtttctact gttttgtcat atctcttccc accaacatga tggcatggaa 1020
gcttatgtca aagtagacag ctgtccagag gaaccccaac tacgaatgaa aaataatgaa 1080
gaagcggaag actatgatga tgatcttact gattctgaaa tggatgtggt caggtttgat 1140
gatgacaact ctccttcctt tatccaaatt cgctcagttg ccaagaagca tcctaaaact 1200
tgggtacatt acattgctgc tgaagaggag gactgggact atgctccctt agtcctcgcc 1260
cccgatgaca gaagttataa aagtcaatat ttgaacaatg gccctcagcg gattggtagg 1320
aagtacaaaa aagtccgatt tatggcatac acagatgaaa cctttaagac tcgtgaagct 1380
attcagcatg aatcaggaat cttgggacct ttactttatg gggaagttgg agacacactg 1440
ttgattatat ttaagaatca agcaagcaga ccatataaca tctaccctca cggaatcact 1500
gatgtccgtc ctttgtattc aaggagatta ccaaaaggtg taaaacattt gaaggatttt 1560
ccaattctgc caggagaaat attcaaatat aaatggacag tgactgtaga agatgggcca 1620
actaaatcag atcctcggtg cctgacccgc tattactcta gtttcgttaa tatggagaga 1680
gatctagctt caggactcat tggccctctc ctcatctgct acaaagaatc tgtagatcaa 1740
agaggaaacc agataatgtc agacaagagg aatgtcatcc tgttttctgt atttgatgag 1800
aaccgaagct ggtacctcac agagaatata caacgctttc tccccaatcc agctggagtg 1860
cagcttgagg atccagagtt ccaagcctcc aacatcatgc acagcatcaa tggctatgtt 1920
tttgatagtt tgcagttgtc agtttgtttg catgaggtgg catactggta cattctaagc 1980
attggagcac agactgactt cctttctgtc ttcttctctg gatatacctt caaacacaaa 2040
atggtctatg aagacacact caccctattc ccattctcag gagaaactgt cttcatgtcg 2100
atggaaaacc caggtctatg gattctgacg tgccacaact cagactttcg gaacagaggc 2160
atgaccgcct tactgaaggt ttctagttgt gacaagaaca ctggtgatta ttacgaggac 2220
agttatgaag atatttcagc atacttgctg agtaaaaaca atgccattga accaagaagc 2280
ttctcccaga attcaagaca ccctagcact aggcaaaagc aatttaatgc caccacaatt 2340
ccagaaaata ctactcttca gtcagatcaa gaggaaattg actatgatga taccatatca 2400
gttgaaatga agaaggaaga ttttgacatt tatgatgagg atgaaaatca gagcccccgc 2460
agctttcaaa agaaaacacg acactatttt attgctgcag tggagaggct ctgggattat 2520
gggatgagta gctccccaca tgttctaaga aacagggctc agagtggcag tgtccctcag 2580
ttcaagaaag ttgttttcca ggaatttact gatggctcct ttactcagcc cttataccgt 2640
ggagaactaa atgaacattt gggactcctg gggccatata taagagcaga agttgaagat 2700
aatatcatgg taactttcag aaatcaggcc tctcgtccct attccttcta ttctagcctt 2760
atttcttatg aggaagatca gaggcaagga gcagaaccta gaaaaaactt tgtcaagcct 2820
aatgaaacca aaacttactt ttggaaagtg caacatcata tggcacccac taaagatgag 2880
tttgactgca aagcctgggc ttatttctct gatgttgacc tggaaaaaga tgtgcactca 2940
ggcctgattg gaccccttct ggtctgccac actaacacac tgaaccctgc tcatgggaga 3000
caagtgacag tacaggaatt tgctctgttt ttcaccatct ttgatgagac caaaagctgg 3060
tacttcactg aaaatatgga aagaaactgc agggctccct gcaatatcca gatggaagat 3120
cccactttta aagagaatta tcgcttccat gcaatcaatg gctacataat ggatacacta 3180
cctggcttag taatggctca ggatcaaagg attcgatggt atctgctcag catgggcagc 3240
aatgaaaaca tccattctat tcatttcagt ggacatgtgt tcactgtacg aaaaaaagag 3300
gagtataaaa tggcactgta caatctctat ccaggtgttt ttgagacagt ggaaatgtta 3360
ccatccaaag ctggaatttg gcgggtggaa tgccttattg gcgagcatct acatgctggg 3420
atgagcacac tttttctggt gtacagcaat aagtgtcaga ctcccctggg aatggcttct 3480
ggacacatta gagattttca gattacagct tcaggacaat atggacagtg ggccccaaag 3540
ctggccagac ttcattattc cggatcaatc aatgcctgga gcaccaagga gcccttttct 3600
tggatcaagg tggatctgtt ggcaccaatg attattcacg gcatcaagac ccagggtgcc 3660
cgtcagaagt tctccagcct ctacatctct cagtttatca tcatgtatag tcttgatggg 3720
aagaagtggc agacttatcg aggaaattcc actggaacct taatggtctt ctttggcaat 3780
gtggattcat ctgggataaa acacaatatt tttaaccctc caattattgc tcgatacatc 3840
cgtttgcacc caactcatta tagcattcgc agcactcttc gcatggagtt gatgggctgt 3900
gatttaaata gttgcagcat gccattggga atggagagta aagcaatatc agatgcacag 3960
attactgctt catcctactt taccaatatg tttgccacct ggtctccttc aaaagctcga 4020
cttcacctcc aagggaggag taatgcctgg agacctcagg tgaataatcc aaaagagtgg 4080
ctgcaagtgg acttccagaa gacaatgaaa gtcacaggag taactactca gggagtaaaa 4140
tctctgctta ccagcatgta tgtgaaggag ttcctcatct ccagcagtca agatggccat 4200
cagtggactc tcttttttca gaatggcaaa gtaaaggttt ttcagggaaa tcaagactcc 4260
ttcacacctg tggtgaactc tctagaccca ccgttactga ctcgctacct tcgaattcac 4320
ccccagagtt gggtgcacca gattgccctg aggatggagg ttctgggctg cgaggcacag 4380
gacctctact ag 4392
<210> 17
<211> 4362
<212> DNA
<213> Artificial sequence
<400> 17
atgcagatcg aactgagcac ttgcttcttc ctgtgtctcc tgcgcttttg cttctccgcc 60
acaaggagat actatctcgg tgccgtggag ctcagctggg actacatgca gagcgacttg 120
ggtgaactgc ctgtggacgc caggtttcca ccccgcgtgc ccaagagttt cccgttcaac 180
accagtgtcg tgtacaagaa aaccctcttc gtggaattca ccgaccacct gttcaacatc 240
gccaaaccgc gccctccctg gatggggctg ctcggcccga cgatccaggc tgaggtctat 300
gacacggtgg tgattaccct caagaacatg gctagccacc cggtgagcct gcacgccgtg 360
ggcgtgtcct attggaaagc gtccgagggt gcggagtacg atgaccagac ttcacagcgg 420
gagaaggaag acgacaaagt gttccccggg ggttcccaca cctatgtctg gcaggtcctg 480
aaggagaatg gtcctatggc ctccgaccca ttgtgcctca cctactctta cctaagccat 540
gtggatctcg tcaaggacct gaactcgggg ctgatcggcg ccctgctcgt gtgccgggag 600
ggctcactgg ccaaggagaa gacccaaact ctgcacaagt tcatcctgct gttcgcggta 660
ttcgacgagg ggaagtcctg gcactccgag accaagaaca gcctgatgca ggaccgcgac 720
gcagcctcgg cccgtgcgtg gccaaagatg cacaccgtga acggctacgt taacaggagc 780
ctacccggcc tgatcggctg ccaccgcaaa tcggtctact ggcatgtgat cggaatgggc 840
acaacgcccg aggtccacag tatcttcctc gagggccaca ctttcctggt ccggaatcac 900
cgccaggcca gcctggagat cagccccata acctttctga cggcgcagac cttactcatg 960
gatctcggcc agttcctcct gttctgccac atttcgtccc accagcacga tgggatggaa 1020
gcatatgtga aagtggactc ctgccccgag gaaccccagc ttaggatgaa gaacaatgag 1080
gaggccgagg actacgacga tgaccttacc gattcagaaa tggacgtagt acgctttgac 1140
gacgacaact ctccatcctt catacagatt cgctccgtcg ccaagaagca ccctaagact 1200
tgggtgcact acatcgcggc cgaggaggag gactgggatt atgctcccct ggtgctggcc 1260
cccgacgacc gcagctacaa gagccagtac ctgaataacg ggccccagcg catcggccgg 1320
aagtacaaga aagtgcggtt catggcttac acggacgaga ccttcaagac ccgggaggct 1380
atccagcatg agagcggcat cttggggccc ctcctgtacg gcgaagttgg agacacactg 1440
ctgatcatct tcaagaacca ggcgagcagg ccctacaaca tctaccccca cggcattacc 1500
gatgtccggc cgttgtacag ccgacggctg cccaagggcg tgaagcacct gaaggacttt 1560
ccgatcctgc cgggcgagat cttcaagtac aagtggactg tgaccgtgga ggatgggccg 1620
accaagagcg atccgcgctg cctgacccgt tactactcca gctttgtcaa tatggagcgc 1680
gacctcgcta gcggcttgat tggccctctg ctgatctgct acaaggagtc cgtggaccag 1740
agggggaatc agatcatgag tgacaagagg aacgtgatcc tgttctccgt gttcgacgaa 1800
aaccgcagct ggtatctcac cgagaatatc cagcgcttcc tgcccaaccc ggccggtgtg 1860
cagctggagg accccgagtt tcaggccagc aacatcatgc attctatcaa cggatatgtg 1920
tttgattccc tgcagctctc agtgtgtctg cacgaggtcg cctactggta tatcctcagc 1980
attggggcac agaccgactt cctgagcgtg ttcttctccg ggtatacctt caagcacaag 2040
atggtgtacg aggataccct gaccctgttc ccctttagcg gcgaaaccgt gtttatgtct 2100
atggagaacc ccgggctctg gatccttact tgccataact ccgacttccg caaccgcgga 2160
atgaccgcgc tcctgaaagt gtcgagttgt gacaagaaca ccggcgacta ttacgaggac 2220
agttacgagg acatctctgc gtacctcctt agcaagaata acgccatcga gccaagatcc 2280
ttcagccaga accccccagt gctgaaggag atcacccgca cgaccctgca gtcggatcag 2340
gaggagattg attacgacga cacgatcagt gtggagatga agaaggagga cttcgacatc 2400
tacgacgaag atgaaaacca gtcccctcgg tccttccaaa agaagacccg gcactacttc 2460
atcgccgctg tggaacgcct gtgggactat ggaatgtctt ctagccctca cgttttgagg 2520
aaccgcgccc agtcgggcag cgtgccccag ttcaagaaag tggtgttcca ggagttcacc 2580
gacggctcct tcacccagcc actttaccgg ggcgagctca atgaacatct gggcctgctg 2640
ggaccctaca tcagggctga ggtggaggac aacatcatgg tgacattccg gaatcaggcc 2700
agcagaccat acagtttcta cagttcactc atctcctacg aggaggacca gcgccagggg 2760
gctgaacccc gtaagaactt cgtgaagcca aacgaaacaa agacctactt ctggaaggtc 2820
cagcaccaca tggcacctac caaggacgag ttcgattgca aggcctgggc ctacttctcc 2880
gacgtggacc tggagaaaga tgtgcacagc ggcctgattg gccctctgct ggtgtgtcac 2940
acgaacacac tcaaccctgc acacgggcgg caggtcactg tgcaggaatt cgccctgttc 3000
tttaccatct ttgatgagac gaagtcctgg tatttcaccg aaaacatgga gaggaactgc 3060
cgcgcaccct gcaacatcca gatggaagat ccgacattca aggagaacta ccggttccat 3120
gccatcaatg gctacatcat ggacaccctg cctggcctcg tgatggccca agaccagcgt 3180
atccgctggt atctgctgtc gatgggctcc aacgagaaca tccatagtat ccacttcagc 3240
gggcatgtct tcacggtgag gaaaaaggag gagtacaaga tggcactgta caacctctat 3300
cccggcgtgt tcgagaccgt ggagatgctg ccctccaagg ccggcatctg gagagtggaa 3360
tgcctgatcg gcgagcacct ccacgctggg atgtccacgc tgttcctcgt ttacagcaat 3420
aagtgccaga cccctctggg catggcgagc ggccacatcc gcgacttcca gattacagcc 3480
agcggccagt acggtcagtg ggctccaaag ctggcccgtc tgcactactc cggatccatc 3540
aacgcctggt ccaccaagga accgttctcc tggatcaaag tagacctgct agcccccatg 3600
atcattcacg gcatcaagac acaaggcgcc cgacagaagt tctcgagcct ctatatctcc 3660
cagttcatca tcatgtatag cctggacgga aagaagtggc agacttaccg cggaaactcg 3720
acagggaccc tgatggtatt cttcggtaac gtggacagct ccggaatcaa gcacaacatc 3780
ttcaacccac ccattatcgc ccgctacatc cgcctgcacc ccactcacta tagcattagg 3840
tccaccctgc gaatggagct catgggctgt gacctgaaca gctgtagcat gcccctcggc 3900
atggagtcta aggcgatctc cgacgcacag ataacggcat catcctactt taccaacatg 3960
ttcgctacct ggtccccctc caaggcccga ctccacctgc aagggagatc caacgcctgg 4020
cggccacagg tcaacaatcc caaggagtgg ctgcaagtgg actttcagaa aactatgaaa 4080
gtcaccggag tgaccacaca gggagtgaag tctctgctga ccagcatgta cgtgaaggag 4140
ttcctcatct ccagttcgca ggatggccac cagtggacgt tgttcttcca aaacggtaaa 4200
gtcaaagtct tccaagggaa ccaggacagc tttacacccg tcgtgaactc cctggacccc 4260
ccgcttctca ctagatacct ccgcatccac cctcagagct gggtgcacca gattgccctg 4320
cgcatggagg ttctggggtg tgaagcccag gacctgtact aa 4362
<210> 18
<211> 4362
<212> DNA
<213> Artificial sequence
<400> 18
atgcagatcg aactgagcac ttgcttcttc ctgtgtctcc tgcgcttttg cttctccgcc 60
acaaggagat actatctcgg tgccgtggag ctcagctggg actacatgca gagcgacttg 120
ggtgaactgc ctgtggacgc caggtttcca ccccgcgtgc ccaagagttt cccgttcaac 180
accagtgtcg tgtacaagaa aaccctcttc gtggaattca ccgaccacct gttcaacatc 240
gccaaaccgc gccctccctg gatggggctg ctcggcccga cgatccaggc tgaggtctat 300
gacacggtgg tgattaccct caagaacatg gctagccacc cggtgagcct gcacgccgtg 360
ggcgtgtcct attggaaagc gtccgagggt gcggagtacg atgaccagac ttcacagcgg 420
gagaaggaag acgacaaagt gttccccggg ggttcccaca cctatgtctg gcaggtcctg 480
aaggagaatg gtcctatggc ctccgaccca ttgtgcctca cctactctta cctaagccat 540
gtggatctcg tcaaggacct gaactcgggg ctgatcggcg ccctgctcgt gtgccgggag 600
ggctcactgg ccaaggagaa gacccaaact ctgcacaagt tcatcctgct gttcgcggta 660
ttcgacgagg ggaagtcctg gcactccgag accaagaaca gcctgatgca ggaccgcgac 720
gcagcctcgg cccgtgcgtg gccaaagatg cacaccgtga acggctacgt taacaggagc 780
ctacccggcc tgatcggctg ccaccgcaaa tcggtctact ggcatgtgat cggaatgggc 840
acaacgcccg aggtccacag tatcttcctc gagggccaca ctttcctggt ccggaatcac 900
cgccaggcca gcctggagat cagccccata acctttctga cggcgcagac cttactcatg 960
gatctcggcc agttcctcct gttctgccac atttcgtccc accagcacga tgggatggaa 1020
gcatatgtga aagtggactc ctgccccgag gaaccccagc ttaggatgaa gaacaatgag 1080
gaggccgagg actacgacga tgaccttacc gattcagaaa tggacgtagt acgctttgac 1140
gacgacaact ctccatcctt catacagatt cgctccgtcg ccaagaagca ccctaagact 1200
tgggtgcact acatcgcggc cgaggaggag gactgggatt atgctcccct ggtgctggcc 1260
cccgacgacc gcagctacaa gagccagtac ctgaataacg ggccccagcg catcggccgg 1320
aagtacaaga aagtgcggtt catggcttac acggacgaga ccttcaagac ccgggaggct 1380
atccagcatg agagcggcat cttggggccc ctcctgtacg gcgaagttgg agacacactg 1440
ctgatcatct tcaagaacca ggcgagcagg ccctacaaca tctaccccca cggcattacc 1500
gatgtccggc cgttgtacag ccgacggctg cccaagggcg tgaagcacct gaaggacttt 1560
ccgatcctgc cgggcgagat cttcaagtac aagtggactg tgaccgtgga ggatgggccg 1620
accaagagcg atccgcgctg cctgacccgt tactactcca gctttgtcaa tatggagcgc 1680
gacctcgcta gcggcttgat tggccctctg ctgatctgct acaaggagtc cgtggaccag 1740
agggggaatc agatcatgag tgacaagagg aacgtgatcc tgttctccgt gttcgacgaa 1800
aaccgcagct ggtatctcac cgagaatatc cagcgcttcc tgcccaaccc ggccggtgtg 1860
cagctggagg accccgagtt tcaggccagc aacatcatgc attctatcaa cggatatgtg 1920
tttgattccc tgcagctctc agtgtgtctg cacgaggtcg cctactggta tatcctcagc 1980
attggggcac agaccgactt cctgagcgtg ttcttctccg ggtatacctt caagcacaag 2040
atggtgtacg aggataccct gaccctgttc ccctttagcg gcgaaaccgt gtttatgtct 2100
atggagaacc ccgggctctg gatccttacc tgccataact ccgacttccg caaccgcgga 2160
atgaccgcgc tcctgaaagt gtcgagttgt gacaagaaca ccggcgacta ttacgaggac 2220
agttacgagg acatctctgc gtacctcctt agcaagaata acgccatcga gccaagatcc 2280
ttcagccaga accccccagt gctgaaggag atcacccgca cgaccctgca gtcggatcag 2340
gaggagattg attacgacga cacgatcagt gtggagatga agaaggagga cttcgacatc 2400
tacgacgaag atgaaaacca gtcccctcgg tccttccaaa agaagacccg gcactacttc 2460
atcgccgctg tggaacgcct gtgggactat ggaatgtctt ctagccctca cgttttgagg 2520
aaccgcgccc agtcgggcag cgtgccccag ttcaagaaag tggtgttcca ggagttcacc 2580
gacggctcct tcacccagcc actttaccgg ggcgagctca atgaacatct gggcctgctg 2640
ggaccctaca tcagggctga ggtggaggac aacatcatgg tgacattccg gaatcaggcc 2700
agcagaccat acagtttcta cagttcactc atctcctacg aggaggacca gcgccagggg 2760
gctgaacccc gtaagaactt cgtgaagcca aacgaaacaa agacctactt ctggaaggtc 2820
cagcaccaca tggcacctac caaggacgag ttcgattgca aggcctgggc ctacttctcc 2880
gacgtggacc tggagaaaga tgtgcacagc ggcctgattg gccctctgct ggtgtgtcac 2940
acgaacacac tcaaccctgc acacgggcgg caggtcactg tgcaggaatt cgccctgttc 3000
tttaccatct ttgatgagac gaagtcctgg tatttcaccg aaaacatgga gaggaactgc 3060
cgcgcaccct gcaacatcca gatggaagat ccgacattca aggagaacta ccggttccat 3120
gccatcaatg gctacatcat ggacaccctg cctggcctcg tgatggccca agaccagcgt 3180
atccgctggt atctgctgtc gatgggctcc aacgagaaca tccatagtat ccacttcagc 3240
gggcatgtct tcacggtgag gaaaaaggag gagtacaaga tggcactgta caacctctat 3300
cccggcgtgt tcgagaccgt ggagatgctg ccctccaagg ccggcatctg gagagtggaa 3360
tgcctgatcg gcgagcacct ccacgctggg atgtccacgc tgttcctcgt ttacagcaat 3420
aagtgccaga cccctctggg catggcgagc ggccacatcc gcgacttcca gattacagcc 3480
agcggccagt acggtcagtg ggctccaaag ctggcccgtc tgcactactc cggatccatc 3540
aacgcctggt ccaccaagga accgttctcc tggatcaaag tagacctgct agcccccatg 3600
atcattcacg gcatcaagac acaaggcgcc cgacagaagt tctcgagcct ctatatctcc 3660
cagttcatca tcatgtatag cctggacgga aagaagtggc agacttaccg cggaaactcg 3720
acagggaccc tgatggtatt cttcggtaac gtggacagct ccggaatcaa gcacaacatc 3780
ttcaacccac ccattatcgc ccgctacatc cgcctgcacc ccactcacta tagcattagg 3840
tccaccctgc gaatggagct catgggctgt gacctgaaca gctgtagcat gcccctcggc 3900
atggagtcta aggcgatctc cgacgcacag ataacggcat catcctactt taccaacatg 3960
ttcgctacct ggtccccctc caaggcccga ctccacctgc aagggagatc caacgcctgg 4020
cggccacagg tcaacaatcc caaggagtgg ctgcaagtgg actttcagaa aactatgaaa 4080
gtcaccggag tgaccacaca gggagtgaag tctctgctga ccagcatgta cgtgaaggag 4140
ttcctcatct ccagttcgca ggatggccac cagtggacgt tgttcttcca aaacggtaaa 4200
gtcaaagtct tccaagggaa ccaggacagc tttacacccg tcgtgaactc cctggacccc 4260
ccgcttctca ctagatacct ccgcatccac cctcagagct gggtgcacca gattgccctg 4320
cgcatggagg ttctggggtg tgaagcccag gacctgtact aa 4362
<210> 19
<211> 4362
<212> DNA
<213> Artificial sequence
<400> 19
atgcagatcg aactgagcac ttgcttcttc ctgtgtctcc tgcgcttttg cttctccgcc 60
acaaggagat actatctcgg tgccgtggag ctcagctggg actacatgca gagcgacttg 120
ggtgaactgc ctgtggacgc caggtttcca ccccgcgtgc ccaagagttt cccgttcaac 180
accagtgtcg tgtacaagaa aaccctcttc gtggaattca ccgaccacct gttcaacatc 240
gccaaaccgc gccctccctg gatggggctg ctcggcccga cgatccaggc tgaggtctat 300
gacacggtgg tgattaccct caagaacatg gctagccacc cggtgagcct gcacgccgtg 360
ggcgtgtcct attggaaagc gtccgagggt gcggagtacg atgaccagac ttcacagcgg 420
gagaaggaag acgacaaagt gttccccggg ggttcccaca cctatgtctg gcaggtcctg 480
aaggagaatg gtcctatggc ctccgaccca ttgtgcctca cctactctta cctaagccat 540
gtggatctcg tcaaggacct gaactcgggg ctgatcggcg ccctgctcgt gtgccgggag 600
ggctcactgg ccaaggagaa gacccaaact ctgcacaagt tcatcctgct gttcgcggta 660
ttcgacgagg ggaagtcctg gcactccgag accaagaaca gcctgatgca ggaccgcgac 720
gcagcctcgg cccgtgcgtg gccaaagatg cacaccgtga acggctacgt taacaggagc 780
ctacccggcc tgatcggctg ccaccgcaaa tcggtctact ggcatgtgat cggaatgggc 840
acaacgcccg aggtccacag tatcttcctc gagggccaca ctttcctggt ccggaatcac 900
cgccaggcca gcctggagat cagccccata acctttctga cggcgcagac cttactcatg 960
gatctcggcc agttcctcct gttctgccac atttcgtccc accagcacga tgggatggaa 1020
gcatatgtga aagtggactc ctgccccgag gaaccccagc ttaggatgaa gaacaatgag 1080
gaggccgagg actacgacga tgaccttacc gattcagaaa tggacgtagt acgctttgac 1140
gacgacaact ctccatcctt catacagatt cgctccgtcg ccaagaagca ccctaagact 1200
tgggtgcact acatcgcggc cgaggaggag gactgggatt atgctcccct ggtgctggcc 1260
cccgacgacc gcagctacaa gagccagtac ctgaataacg ggccccagcg catcggccgg 1320
aagtacaaga aagtgcggtt catggcttac acggacgaga ccttcaagac ccgggaggct 1380
atccagcatg agagcggcat cttggggccc ctcctgtacg gcgaagttgg agacacactg 1440
ctgatcatct tcaagaacca ggcgagcagg ccctacaaca tctaccccca cggcattacc 1500
gatgtccggc cgttgtacag ccgacggctg cccaagggcg tgaagcacct gaaggacttt 1560
ccgatcctgc cgggcgagat cttcaagtac aagtggactg tgaccgtgga ggatgggccg 1620
accaagagcg atccgcgctg cctgacccgt tactactcca gctttgtcaa tatggagcgc 1680
gacctcgcta gcggcttgat tggccctctg ctgatctgct acaaggagtc cgtggaccag 1740
agggggaatc agatcatgag tgacaagagg aacgtgatcc tgttctccgt gttcgacgaa 1800
aaccgcagct ggtatctcac cgagaatatc cagcgcttcc tgcccaaccc ggccggtgtg 1860
cagctggagg accccgagtt tcaggccagc aacatcatgc attctatcaa cggatatgtg 1920
tttgattccc tgcagctctc agtgtgtctg cacgaggtcg cctactggta tatcctcagc 1980
attggggcac agaccgactt cctgagcgtg ttcttctccg ggtatacctt caagcacaag 2040
atggtgtacg aggataccct gaccctgttc ccctttagcg gcgaaaccgt gtttatgtct 2100
atggagaacc ccgggctctg gatccttaca tgccataact ccgacttccg caaccgcgga 2160
atgaccgcgc tcctgaaagt gtcgagttgt gacaagaaca ccggcgacta ttacgaggac 2220
agttacgagg acatctctgc gtacctcctt agcaagaata acgccatcga gccaagatcc 2280
ttcagccaga accccccagt gctgaaggag atcacccgca cgaccctgca gtcggatcag 2340
gaggagattg attacgacga cacgatcagt gtggagatga agaaggagga cttcgacatc 2400
tacgacgaag atgaaaacca gtcccctcgg tccttccaaa agaagacccg gcactacttc 2460
atcgccgctg tggaacgcct gtgggactat ggaatgtctt ctagccctca cgttttgagg 2520
aaccgcgccc agtcgggcag cgtgccccag ttcaagaaag tggtgttcca ggagttcacc 2580
gacggctcct tcacccagcc actttaccgg ggcgagctca atgaacatct gggcctgctg 2640
ggaccctaca tcagggctga ggtggaggac aacatcatgg tgacattccg gaatcaggcc 2700
agcagaccat acagtttcta cagttcactc atctcctacg aggaggacca gcgccagggg 2760
gctgaacccc gtaagaactt cgtgaagcca aacgaaacaa agacctactt ctggaaggtc 2820
cagcaccaca tggcacctac caaggacgag ttcgattgca aggcctgggc ctacttctcc 2880
gacgtggacc tggagaaaga tgtgcacagc ggcctgattg gccctctgct ggtgtgtcac 2940
acgaacacac tcaaccctgc acacgggcgg caggtcactg tgcaggaatt cgccctgttc 3000
tttaccatct ttgatgagac gaagtcctgg tatttcaccg aaaacatgga gaggaactgc 3060
cgcgcaccct gcaacatcca gatggaagat ccgacattca aggagaacta ccggttccat 3120
gccatcaatg gctacatcat ggacaccctg cctggcctcg tgatggccca agaccagcgt 3180
atccgctggt atctgctgtc gatgggctcc aacgagaaca tccatagtat ccacttcagc 3240
gggcatgtct tcacggtgag gaaaaaggag gagtacaaga tggcactgta caacctctat 3300
cccggcgtgt tcgagaccgt ggagatgctg ccctccaagg ccggcatctg gagagtggaa 3360
tgcctgatcg gcgagcacct ccacgctggg atgtccacgc tgttcctcgt ttacagcaat 3420
aagtgccaga cccctctggg catggcgagc ggccacatcc gcgacttcca gattacagcc 3480
agcggccagt acggtcagtg ggctccaaag ctggcccgtc tgcactactc cggatccatc 3540
aacgcctggt ccaccaagga accgttctcc tggatcaaag tagacctgct agcccccatg 3600
atcattcacg gcatcaagac acaaggcgcc cgacagaagt tctcgagcct ctatatctcc 3660
cagttcatca tcatgtatag cctggacgga aagaagtggc agacttaccg cggaaactcg 3720
acagggaccc tgatggtatt cttcggtaac gtggacagct ccggaatcaa gcacaacatc 3780
ttcaacccac ccattatcgc ccgctacatc cgcctgcacc ccactcacta tagcattagg 3840
tccaccctgc gaatggagct catgggctgt gacctgaaca gctgtagcat gcccctcggc 3900
atggagtcta aggcgatctc cgacgcacag ataacggcat catcctactt taccaacatg 3960
ttcgctacct ggtccccctc caaggcccga ctccacctgc aagggagatc caacgcctgg 4020
cggccacagg tcaacaatcc caaggagtgg ctgcaagtgg actttcagaa aactatgaaa 4080
gtcaccggag tgaccacaca gggagtgaag tctctgctga ccagcatgta cgtgaaggag 4140
ttcctcatct ccagttcgca ggatggccac cagtggacgt tgttcttcca aaacggtaaa 4200
gtcaaagtct tccaagggaa ccaggacagc tttacacccg tcgtgaactc cctggacccc 4260
ccgcttctca ctagatacct ccgcatccac cctcagagct gggtgcacca gattgccctg 4320
cgcatggagg ttctggggtg tgaagcccag gacctgtact aa 4362
<210> 20
<211> 4362
<212> DNA
<213> Artificial sequence
<400> 20
atgcagatcg aactgagcac ttgcttcttc ctgtgtctcc tgcgcttttg cttctccgcc 60
acaaggagat actatctcgg tgccgtggag ctcagctggg actacatgca gagcgacttg 120
ggtgaactgc ctgtggacgc caggtttcca ccccgcgtgc ccaagagttt cccgttcaac 180
accagtgtcg tgtacaagaa aaccctcttc gtggaattca ccgaccacct gttcaacatc 240
gccaaaccgc gccctccctg gatggggctg ctcggcccga cgatccaggc tgaggtctat 300
gacacggtgg tgattaccct caagaacatg gctagccacc cggtgagcct gcacgccgtg 360
ggcgtgtcct attggaaagc gtccgagggt gcggagtacg atgaccagac ttcacagcgg 420
gagaaggaag acgacaaagt gttccccggg ggttcccaca cctatgtctg gcaggtcctg 480
aaggagaatg gtcctatggc ctccgaccca ttgtgcctca cctactctta cctaagccat 540
gtggatctcg tcaaggacct gaactcgggg ctgatcggcg ccctgctcgt gtgccgggag 600
ggctcactgg ccaaggagaa gacccaaact ctgcacaagt tcatcctgct gttcgcggta 660
ttcgacgagg ggaagtcctg gcactccgag accaagaaca gcctgatgca ggaccgcgac 720
gcagcctcgg cccgtgcgtg gccaaagatg cacaccgtga acggctacgt taacaggagc 780
ctacccggcc tgatcggctg ccaccgcaaa tcggtctact ggcatgtgat cggaatgggc 840
acaacgcccg aggtccacag tatcttcctc gagggccaca ctttcctggt ccggaatcac 900
cgccaggcca gcctggagat cagccccata acctttctga cggcgcagac cttactcatg 960
gatctcggcc agttcctcct gttctgccac atttcgtccc accagcacga tgggatggaa 1020
gcatatgtga aagtggactc ctgccccgag gaaccccagc ttaggatgaa gaacaatgag 1080
gaggccgagg actacgacga tgaccttacc gattcagaaa tggacgtagt acgctttgac 1140
gacgacaact ctccatcctt catacagatt cgctccgtcg ccaagaagca ccctaagact 1200
tgggtgcact acatcgcggc cgaggaggag gactgggatt atgctcccct ggtgctggcc 1260
cccgacgacc gcagctacaa gagccagtac ctgaataacg ggccccagcg catcggccgg 1320
aagtacaaga aagtgcggtt catggcttac acggacgaga ccttcaagac ccgggaggct 1380
atccagcatg agagcggcat cttggggccc ctcctgtacg gcgaagttgg agacacactg 1440
ctgatcatct tcaagaacca ggcgagcagg ccctacaaca tctaccccca cggcattacc 1500
gatgtccggc cgttgtacag ccgacggctg cccaagggcg tgaagcacct gaaggacttt 1560
ccgatcctgc cgggcgagat cttcaagtac aagtggactg tgaccgtgga ggatgggccg 1620
accaagagcg atccgcgctg cctgacccgt tactactcca gctttgtcaa tatggagcgc 1680
gacctcgcta gcggcttgat tggccctctg ctgatctgct acaaggagtc cgtggaccag 1740
agggggaatc agatcatgag tgacaagagg aacgtgatcc tgttctccgt gttcgacgaa 1800
aaccgcagct ggtatctcac cgagaatatc cagcgcttcc tgcccaaccc ggccggtgtg 1860
cagctggagg accccgagtt tcaggccagc aacatcatgc attctatcaa cggatatgtg 1920
tttgattccc tgcagctctc agtgtgtctg cacgaggtcg cctactggta tatcctcagc 1980
attggggcac agaccgactt cctgagcgtg ttcttctccg ggtatacctt caagcacaag 2040
atggtgtacg aggataccct gaccctgttc ccctttagcg gcgaaaccgt gtttatgtct 2100
atggagaacc ccgggctctg gatccttacg tgccataact ccgacttccg caaccgcgga 2160
atgaccgcgc tcctgaaagt gtcgagttgt gacaagaaca ccggcgacta ttacgaggac 2220
agttacgagg acatctctgc gtacctcctt agcaagaata acgccatcga gccaagatcc 2280
ttcagccaga accccccagt gctgaaggag atcacccgca cgaccctgca gtcggatcag 2340
gaggagattg attacgacga cacgatcagt gtggagatga agaaggagga cttcgacatc 2400
tacgacgaag atgaaaacca gtcccctcgg tccttccaaa agaagacccg gcactacttc 2460
atcgccgctg tggaacgcct gtgggactat ggaatgtctt ctagccctca cgttttgagg 2520
aaccgcgccc agtcgggcag cgtgccccag ttcaagaaag tggtgttcca ggagttcacc 2580
gacggctcct tcacccagcc actttaccgg ggcgagctca atgaacatct gggcctgctg 2640
ggaccctaca tcagggctga ggtggaggac aacatcatgg tgacattccg gaatcaggcc 2700
agcagaccat acagtttcta cagttcactc atctcctacg aggaggacca gcgccagggg 2760
gctgaacccc gtaagaactt cgtgaagcca aacgaaacaa agacctactt ctggaaggtc 2820
cagcaccaca tggcacctac caaggacgag ttcgattgca aggcctgggc ctacttctcc 2880
gacgtggacc tggagaaaga tgtgcacagc ggcctgattg gccctctgct ggtgtgtcac 2940
acgaacacac tcaaccctgc acacgggcgg caggtcactg tgcaggaatt cgccctgttc 3000
tttaccatct ttgatgagac gaagtcctgg tatttcaccg aaaacatgga gaggaactgc 3060
cgcgcaccct gcaacatcca gatggaagat ccgacattca aggagaacta ccggttccat 3120
gccatcaatg gctacatcat ggacaccctg cctggcctcg tgatggccca agaccagcgt 3180
atccgctggt atctgctgtc gatgggctcc aacgagaaca tccatagtat ccacttcagc 3240
gggcatgtct tcacggtgag gaaaaaggag gagtacaaga tggcactgta caacctctat 3300
cccggcgtgt tcgagaccgt ggagatgctg ccctccaagg ccggcatctg gagagtggaa 3360
tgcctgatcg gcgagcacct ccacgctggg atgtccacgc tgttcctcgt ttacagcaat 3420
aagtgccaga cccctctggg catggcgagc ggccacatcc gcgacttcca gattacagcc 3480
agcggccagt acggtcagtg ggctccaaag ctggcccgtc tgcactactc cggatccatc 3540
aacgcctggt ccaccaagga accgttctcc tggatcaaag tagacctgct agcccccatg 3600
atcattcacg gcatcaagac acaaggcgcc cgacagaagt tctcgagcct ctatatctcc 3660
cagttcatca tcatgtatag cctggacgga aagaagtggc agacttaccg cggaaactcg 3720
acagggaccc tgatggtatt cttcggtaac gtggacagct ccggaatcaa gcacaacatc 3780
ttcaacccac ccattatcgc ccgctacatc cgcctgcacc ccactcacta tagcattagg 3840
tccaccctgc gaatggagct catgggctgt gacctgaaca gctgtagcat gcccctcggc 3900
atggagtcta aggcgatctc cgacgcacag ataacggcat catcctactt taccaacatg 3960
ttcgctacct ggtccccctc caaggcccga ctccacctgc aagggagatc caacgcctgg 4020
cggccacagg tcaacaatcc caaggagtgg ctgcaagtgg actttcagaa aactatgaaa 4080
gtcaccggag tgaccacaca gggagtgaag tctctgctga ccagcatgta cgtgaaggag 4140
ttcctcatct ccagttcgca ggatggccac cagtggacgt tgttcttcca aaacggtaaa 4200
gtcaaagtct tccaagggaa ccaggacagc tttacacccg tcgtgaactc cctggacccc 4260
ccgcttctca ctagatacct ccgcatccac cctcagagct gggtgcacca gattgccctg 4320
cgcatggagg ttctggggtg tgaagcccag gacctgtact aa 4362
<210> 21
<211> 2351
<212> PRT
<213> Artificial sequence
<400> 21
Met Gln Ile Glu Leu Ser Thr Cys Phe Phe Leu Cys Leu Leu Arg Phe
1 5 10 15
Cys Phe Ser Ala Thr Arg Arg Tyr Tyr Leu Gly Ala Val Glu Leu Ser
20 25 30
Trp Asp Tyr Met Gln Ser Asp Leu Gly Glu Leu Pro Val Asp Ala Arg
35 40 45
Phe Pro Pro Arg Val Pro Lys Ser Phe Pro Phe Asn Thr Ser Val Val
50 55 60
Tyr Lys Lys Thr Leu Phe Val Glu Phe Thr Asp His Leu Phe Asn Ile
65 70 75 80
Ala Lys Pro Arg Pro Pro Trp Met Gly Leu Leu Gly Pro Thr Ile Gln
85 90 95
Ala Glu Val Tyr Asp Thr Val Val Ile Thr Leu Lys Asn Met Ala Ser
100 105 110
His Pro Val Ser Leu His Ala Val Gly Val Ser Tyr Trp Lys Ala Ser
115 120 125
Glu Gly Ala Glu Tyr Asp Asp Gln Thr Ser Gln Arg Glu Lys Glu Asp
130 135 140
Asp Lys Val Phe Pro Gly Gly Ser His Thr Tyr Val Trp Gln Val Leu
145 150 155 160
Lys Glu Asn Gly Pro Met Ala Ser Asp Pro Leu Cys Leu Thr Tyr Ser
165 170 175
Tyr Leu Ser His Val Asp Leu Val Lys Asp Leu Asn Ser Gly Leu Ile
180 185 190
Gly Ala Leu Leu Val Cys Arg Glu Gly Ser Leu Ala Lys Glu Lys Thr
195 200 205
Gln Thr Leu His Lys Phe Ile Leu Leu Phe Ala Val Phe Asp Glu Gly
210 215 220
Lys Ser Trp His Ser Glu Thr Lys Asn Ser Leu Met Gln Asp Arg Asp
225 230 235 240
Ala Ala Ser Ala Arg Ala Trp Pro Lys Met His Thr Val Asn Gly Tyr
245 250 255
Val Asn Arg Ser Leu Pro Gly Leu Ile Gly Cys His Arg Lys Ser Val
260 265 270
Tyr Trp His Val Ile Gly Met Gly Thr Thr Pro Glu Val His Ser Ile
275 280 285
Phe Leu Glu Gly His Thr Phe Leu Val Arg Asn His Arg Gln Ala Ser
290 295 300
Leu Glu Ile Ser Pro Ile Thr Phe Leu Thr Ala Gln Thr Leu Leu Met
305 310 315 320
Asp Leu Gly Gln Phe Leu Leu Phe Cys His Ile Ser Ser His Gln His
325 330 335
Asp Gly Met Glu Ala Tyr Val Lys Val Asp Ser Cys Pro Glu Glu Pro
340 345 350
Gln Leu Arg Met Lys Asn Asn Glu Glu Ala Glu Asp Tyr Asp Asp Asp
355 360 365
Leu Thr Asp Ser Glu Met Asp Val Val Arg Phe Asp Asp Asp Asn Ser
370 375 380
Pro Ser Phe Ile Gln Ile Arg Ser Val Ala Lys Lys His Pro Lys Thr
385 390 395 400
Trp Val His Tyr Ile Ala Ala Glu Glu Glu Asp Trp Asp Tyr Ala Pro
405 410 415
Leu Val Leu Ala Pro Asp Asp Arg Ser Tyr Lys Ser Gln Tyr Leu Asn
420 425 430
Asn Gly Pro Gln Arg Ile Gly Arg Lys Tyr Lys Lys Val Arg Phe Met
435 440 445
Ala Tyr Thr Asp Glu Thr Phe Lys Thr Arg Glu Ala Ile Gln His Glu
450 455 460
Ser Gly Ile Leu Gly Pro Leu Leu Tyr Gly Glu Val Gly Asp Thr Leu
465 470 475 480
Leu Ile Ile Phe Lys Asn Gln Ala Ser Arg Pro Tyr Asn Ile Tyr Pro
485 490 495
His Gly Ile Thr Asp Val Arg Pro Leu Tyr Ser Arg Arg Leu Pro Lys
500 505 510
Gly Val Lys His Leu Lys Asp Phe Pro Ile Leu Pro Gly Glu Ile Phe
515 520 525
Lys Tyr Lys Trp Thr Val Thr Val Glu Asp Gly Pro Thr Lys Ser Asp
530 535 540
Pro Arg Cys Leu Thr Arg Tyr Tyr Ser Ser Phe Val Asn Met Glu Arg
545 550 555 560
Asp Leu Ala Ser Gly Leu Ile Gly Pro Leu Leu Ile Cys Tyr Lys Glu
565 570 575
Ser Val Asp Gln Arg Gly Asn Gln Ile Met Ser Asp Lys Arg Asn Val
580 585 590
Ile Leu Phe Ser Val Phe Asp Glu Asn Arg Ser Trp Tyr Leu Thr Glu
595 600 605
Asn Ile Gln Arg Phe Leu Pro Asn Pro Ala Gly Val Gln Leu Glu Asp
610 615 620
Pro Glu Phe Gln Ala Ser Asn Ile Met His Ser Ile Asn Gly Tyr Val
625 630 635 640
Phe Asp Ser Leu Gln Leu Ser Val Cys Leu His Glu Val Ala Tyr Trp
645 650 655
Tyr Ile Leu Ser Ile Gly Ala Gln Thr Asp Phe Leu Ser Val Phe Phe
660 665 670
Ser Gly Tyr Thr Phe Lys His Lys Met Val Tyr Glu Asp Thr Leu Thr
675 680 685
Leu Phe Pro Phe Ser Gly Glu Thr Val Phe Met Ser Met Glu Asn Pro
690 695 700
Gly Leu Trp Ile Leu Thr Cys His Asn Ser Asp Phe Arg Asn Arg Gly
705 710 715 720
Met Thr Ala Leu Leu Lys Val Ser Ser Cys Asp Lys Asn Thr Gly Asp
725 730 735
Tyr Tyr Glu Asp Ser Tyr Glu Asp Ile Ser Ala Tyr Leu Leu Ser Lys
740 745 750
Asn Asn Ala Ile Glu Pro Arg Ser Phe Ser Gln Asn Ser Arg His Pro
755 760 765
Ser Thr Arg Gln Lys Gln Phe Asn Ala Thr Thr Ile Pro Glu Asn Asp
770 775 780
Ile Glu Lys Thr Asp Pro Trp Phe Ala His Arg Thr Pro Met Pro Lys
785 790 795 800
Ile Gln Asn Val Ser Ser Ser Asp Leu Leu Met Leu Leu Arg Gln Ser
805 810 815
Pro Thr Pro His Gly Leu Ser Leu Ser Asp Leu Gln Glu Ala Lys Tyr
820 825 830
Glu Thr Phe Ser Asp Asp Pro Ser Pro Gly Ala Ile Asp Ser Asn Asn
835 840 845
Ser Leu Ser Glu Met Thr His Phe Arg Pro Gln Leu His His Ser Gly
850 855 860
Asp Met Val Phe Thr Pro Glu Ser Gly Leu Gln Leu Arg Leu Asn Glu
865 870 875 880
Lys Leu Gly Thr Thr Ala Ala Thr Glu Leu Lys Lys Leu Asp Phe Lys
885 890 895
Val Ser Ser Thr Ser Asn Asn Leu Ile Ser Thr Ile Pro Ser Asp Asn
900 905 910
Leu Ala Ala Gly Thr Asp Asn Thr Ser Ser Leu Gly Pro Pro Ser Met
915 920 925
Pro Val His Tyr Asp Ser Gln Leu Asp Thr Thr Leu Phe Gly Lys Lys
930 935 940
Ser Ser Pro Leu Thr Glu Ser Gly Gly Pro Leu Ser Leu Ser Glu Glu
945 950 955 960
Asn Asn Asp Ser Lys Leu Leu Glu Ser Gly Leu Met Asn Ser Gln Glu
965 970 975
Ser Ser Trp Gly Lys Asn Val Ser Ser Thr Glu Ser Gly Arg Leu Phe
980 985 990
Lys Gly Lys Arg Ala His Gly Pro Ala Leu Leu Thr Lys Asp Asn Ala
995 1000 1005
Leu Phe Lys Val Ser Ile Ser Leu Leu Lys Thr Asn Lys Thr Ser
1010 1015 1020
Asn Asn Ser Ala Thr Asn Arg Lys Thr His Ile Asp Gly Pro Ser
1025 1030 1035
Leu Leu Ile Glu Asn Ser Pro Ser Val Trp Gln Asn Ile Leu Glu
1040 1045 1050
Ser Asp Thr Glu Phe Lys Lys Val Thr Pro Leu Ile His Asp Arg
1055 1060 1065
Met Leu Met Asp Lys Asn Ala Thr Ala Leu Arg Leu Asn His Met
1070 1075 1080
Ser Asn Lys Thr Thr Ser Ser Lys Asn Met Glu Met Val Gln Gln
1085 1090 1095
Lys Lys Glu Gly Pro Ile Pro Pro Asp Ala Gln Asn Pro Asp Met
1100 1105 1110
Ser Phe Phe Lys Met Leu Phe Leu Pro Glu Ser Ala Arg Trp Ile
1115 1120 1125
Gln Arg Thr His Gly Lys Asn Ser Leu Asn Ser Gly Gln Gly Pro
1130 1135 1140
Ser Pro Lys Gln Leu Val Ser Leu Gly Pro Glu Lys Ser Val Glu
1145 1150 1155
Gly Gln Asn Phe Leu Ser Glu Lys Asn Lys Val Val Val Gly Lys
1160 1165 1170
Gly Glu Phe Thr Lys Asp Val Gly Leu Lys Glu Met Val Phe Pro
1175 1180 1185
Ser Ser Arg Asn Leu Phe Leu Thr Asn Leu Asp Asn Leu His Glu
1190 1195 1200
Asn Asn Thr His Asn Gln Glu Lys Lys Ile Gln Glu Glu Ile Glu
1205 1210 1215
Lys Lys Glu Thr Leu Ile Gln Glu Asn Val Val Leu Pro Gln Ile
1220 1225 1230
His Thr Val Thr Gly Thr Lys Asn Phe Met Lys Asn Leu Phe Leu
1235 1240 1245
Leu Ser Thr Arg Gln Asn Val Glu Gly Ser Tyr Asp Gly Ala Tyr
1250 1255 1260
Ala Pro Val Leu Gln Asp Phe Arg Ser Leu Asn Asp Ser Thr Asn
1265 1270 1275
Arg Thr Lys Lys His Thr Ala His Phe Ser Lys Lys Gly Glu Glu
1280 1285 1290
Glu Asn Leu Glu Gly Leu Gly Asn Gln Thr Lys Gln Ile Val Glu
1295 1300 1305
Lys Tyr Ala Cys Thr Thr Arg Ile Ser Pro Asn Thr Ser Gln Gln
1310 1315 1320
Asn Phe Val Thr Gln Arg Ser Lys Arg Ala Leu Lys Gln Phe Arg
1325 1330 1335
Leu Pro Leu Glu Glu Thr Glu Leu Glu Lys Arg Ile Ile Val Asp
1340 1345 1350
Asp Thr Ser Thr Gln Trp Ser Lys Asn Met Lys His Leu Thr Pro
1355 1360 1365
Ser Thr Leu Thr Gln Ile Asp Tyr Asn Glu Lys Glu Lys Gly Ala
1370 1375 1380
Ile Thr Gln Ser Pro Leu Ser Asp Cys Leu Thr Arg Ser His Ser
1385 1390 1395
Ile Pro Gln Ala Asn Arg Ser Pro Leu Pro Ile Ala Lys Val Ser
1400 1405 1410
Ser Phe Pro Ser Ile Arg Pro Ile Tyr Leu Thr Arg Val Leu Phe
1415 1420 1425
Gln Asp Asn Ser Ser His Leu Pro Ala Ala Ser Tyr Arg Lys Lys
1430 1435 1440
Asp Ser Gly Val Gln Glu Ser Ser His Phe Leu Gln Gly Ala Lys
1445 1450 1455
Lys Asn Asn Leu Ser Leu Ala Ile Leu Thr Leu Glu Met Thr Gly
1460 1465 1470
Asp Gln Arg Glu Val Gly Ser Leu Gly Thr Ser Ala Thr Asn Ser
1475 1480 1485
Val Thr Tyr Lys Lys Val Glu Asn Thr Val Leu Pro Lys Pro Asp
1490 1495 1500
Leu Pro Lys Thr Ser Gly Lys Val Glu Leu Leu Pro Lys Val His
1505 1510 1515
Ile Tyr Gln Lys Asp Leu Phe Pro Thr Glu Thr Ser Asn Gly Ser
1520 1525 1530
Pro Gly His Leu Asp Leu Val Glu Gly Ser Leu Leu Gln Gly Thr
1535 1540 1545
Glu Gly Ala Ile Lys Trp Asn Glu Ala Asn Arg Pro Gly Lys Val
1550 1555 1560
Pro Phe Leu Arg Val Ala Thr Glu Ser Ser Ala Lys Thr Pro Ser
1565 1570 1575
Lys Leu Leu Asp Pro Leu Ala Trp Asp Asn His Tyr Gly Thr Gln
1580 1585 1590
Ile Pro Lys Glu Glu Trp Lys Ser Gln Glu Lys Ser Pro Glu Lys
1595 1600 1605
Thr Ala Phe Lys Lys Lys Asp Thr Ile Leu Ser Leu Asn Ala Cys
1610 1615 1620
Glu Ser Asn His Ala Ile Ala Ala Ile Asn Glu Gly Gln Asn Lys
1625 1630 1635
Pro Glu Ile Glu Val Thr Trp Ala Lys Gln Gly Arg Thr Glu Arg
1640 1645 1650
Leu Cys Ser Gln Asn Pro Pro Val Leu Lys Arg His Gln Arg Glu
1655 1660 1665
Ile Thr Arg Thr Thr Leu Gln Ser Asp Gln Glu Glu Ile Asp Tyr
1670 1675 1680
Asp Asp Thr Ile Ser Val Glu Met Lys Lys Glu Asp Phe Asp Ile
1685 1690 1695
Tyr Asp Glu Asp Glu Asn Gln Ser Pro Arg Ser Phe Gln Lys Lys
1700 1705 1710
Thr Arg His Tyr Phe Ile Ala Ala Val Glu Arg Leu Trp Asp Tyr
1715 1720 1725
Gly Met Ser Ser Ser Pro His Val Leu Arg Asn Arg Ala Gln Ser
1730 1735 1740
Gly Ser Val Pro Gln Phe Lys Lys Val Val Phe Gln Glu Phe Thr
1745 1750 1755
Asp Gly Ser Phe Thr Gln Pro Leu Tyr Arg Gly Glu Leu Asn Glu
1760 1765 1770
His Leu Gly Leu Leu Gly Pro Tyr Ile Arg Ala Glu Val Glu Asp
1775 1780 1785
Asn Ile Met Val Thr Phe Arg Asn Gln Ala Ser Arg Pro Tyr Ser
1790 1795 1800
Phe Tyr Ser Ser Leu Ile Ser Tyr Glu Glu Asp Gln Arg Gln Gly
1805 1810 1815
Ala Glu Pro Arg Lys Asn Phe Val Lys Pro Asn Glu Thr Lys Thr
1820 1825 1830
Tyr Phe Trp Lys Val Gln His His Met Ala Pro Thr Lys Asp Glu
1835 1840 1845
Phe Asp Cys Lys Ala Trp Ala Tyr Phe Ser Asp Val Asp Leu Glu
1850 1855 1860
Lys Asp Val His Ser Gly Leu Ile Gly Pro Leu Leu Val Cys His
1865 1870 1875
Thr Asn Thr Leu Asn Pro Ala His Gly Arg Gln Val Thr Val Gln
1880 1885 1890
Glu Phe Ala Leu Phe Phe Thr Ile Phe Asp Glu Thr Lys Ser Trp
1895 1900 1905
Tyr Phe Thr Glu Asn Met Glu Arg Asn Cys Arg Ala Pro Cys Asn
1910 1915 1920
Ile Gln Met Glu Asp Pro Thr Phe Lys Glu Asn Tyr Arg Phe His
1925 1930 1935
Ala Ile Asn Gly Tyr Ile Met Asp Thr Leu Pro Gly Leu Val Met
1940 1945 1950
Ala Gln Asp Gln Arg Ile Arg Trp Tyr Leu Leu Ser Met Gly Ser
1955 1960 1965
Asn Glu Asn Ile His Ser Ile His Phe Ser Gly His Val Phe Thr
1970 1975 1980
Val Arg Lys Lys Glu Glu Tyr Lys Met Ala Leu Tyr Asn Leu Tyr
1985 1990 1995
Pro Gly Val Phe Glu Thr Val Glu Met Leu Pro Ser Lys Ala Gly
2000 2005 2010
Ile Trp Arg Val Glu Cys Leu Ile Gly Glu His Leu His Ala Gly
2015 2020 2025
Met Ser Thr Leu Phe Leu Val Tyr Ser Asn Lys Cys Gln Thr Pro
2030 2035 2040
Leu Gly Met Ala Ser Gly His Ile Arg Asp Phe Gln Ile Thr Ala
2045 2050 2055
Ser Gly Gln Tyr Gly Gln Trp Ala Pro Lys Leu Ala Arg Leu His
2060 2065 2070
Tyr Ser Gly Ser Ile Asn Ala Trp Ser Thr Lys Glu Pro Phe Ser
2075 2080 2085
Trp Ile Lys Val Asp Leu Leu Ala Pro Met Ile Ile His Gly Ile
2090 2095 2100
Lys Thr Gln Gly Ala Arg Gln Lys Phe Ser Ser Leu Tyr Ile Ser
2105 2110 2115
Gln Phe Ile Ile Met Tyr Ser Leu Asp Gly Lys Lys Trp Gln Thr
2120 2125 2130
Tyr Arg Gly Asn Ser Thr Gly Thr Leu Met Val Phe Phe Gly Asn
2135 2140 2145
Val Asp Ser Ser Gly Ile Lys His Asn Ile Phe Asn Pro Pro Ile
2150 2155 2160
Ile Ala Arg Tyr Ile Arg Leu His Pro Thr His Tyr Ser Ile Arg
2165 2170 2175
Ser Thr Leu Arg Met Glu Leu Met Gly Cys Asp Leu Asn Ser Cys
2180 2185 2190
Ser Met Pro Leu Gly Met Glu Ser Lys Ala Ile Ser Asp Ala Gln
2195 2200 2205
Ile Thr Ala Ser Ser Tyr Phe Thr Asn Met Phe Ala Thr Trp Ser
2210 2215 2220
Pro Ser Lys Ala Arg Leu His Leu Gln Gly Arg Ser Asn Ala Trp
2225 2230 2235
Arg Pro Gln Val Asn Asn Pro Lys Glu Trp Leu Gln Val Asp Phe
2240 2245 2250
Gln Lys Thr Met Lys Val Thr Gly Val Thr Thr Gln Gly Val Lys
2255 2260 2265
Ser Leu Leu Thr Ser Met Tyr Val Lys Glu Phe Leu Ile Ser Ser
2270 2275 2280
Ser Gln Asp Gly His Gln Trp Thr Leu Phe Phe Gln Asn Gly Lys
2285 2290 2295
Val Lys Val Phe Gln Gly Asn Gln Asp Ser Phe Thr Pro Val Val
2300 2305 2310
Asn Ser Leu Asp Pro Pro Leu Leu Thr Arg Tyr Leu Arg Ile His
2315 2320 2325
Pro Gln Ser Trp Val His Gln Ile Ala Leu Arg Met Glu Val Leu
2330 2335 2340
Gly Cys Glu Ala Gln Asp Leu Tyr
2345 2350
<210> 22
<211> 1457
<212> PRT
<213> Artificial sequence
<400> 22
Met Gln Ile Glu Leu Ser Thr Cys Phe Phe Leu Cys Leu Leu Arg Phe
1 5 10 15
Cys Phe Ser Ala Thr Arg Arg Tyr Tyr Leu Gly Ala Val Glu Leu Ser
20 25 30
Trp Asp Tyr Met Gln Ser Asp Leu Gly Glu Leu Pro Val Asp Ala Arg
35 40 45
Phe Pro Pro Arg Val Pro Lys Ser Phe Pro Phe Asn Thr Ser Val Val
50 55 60
Tyr Lys Lys Thr Leu Phe Val Glu Phe Thr Asp His Leu Phe Asn Ile
65 70 75 80
Ala Lys Pro Arg Pro Pro Trp Met Gly Leu Leu Gly Pro Thr Ile Gln
85 90 95
Ala Glu Val Tyr Asp Thr Val Val Ile Thr Leu Lys Asn Met Ala Ser
100 105 110
His Pro Val Ser Leu His Ala Val Gly Val Ser Tyr Trp Lys Ala Ser
115 120 125
Glu Gly Ala Glu Tyr Asp Asp Gln Thr Ser Gln Arg Glu Lys Glu Asp
130 135 140
Asp Lys Val Phe Pro Gly Gly Ser His Thr Tyr Val Trp Gln Val Leu
145 150 155 160
Lys Glu Asn Gly Pro Met Ala Ser Asp Pro Leu Cys Leu Thr Tyr Ser
165 170 175
Tyr Leu Ser His Val Asp Leu Val Lys Asp Leu Asn Ser Gly Leu Ile
180 185 190
Gly Ala Leu Leu Val Cys Arg Glu Gly Ser Leu Ala Lys Glu Lys Thr
195 200 205
Gln Thr Leu His Lys Phe Ile Leu Leu Phe Ala Val Phe Asp Glu Gly
210 215 220
Lys Ser Trp His Ser Glu Thr Lys Asn Ser Leu Met Gln Asp Arg Asp
225 230 235 240
Ala Ala Ser Ala Arg Ala Trp Pro Lys Met His Thr Val Asn Gly Tyr
245 250 255
Val Asn Arg Ser Leu Pro Gly Leu Ile Gly Cys His Arg Lys Ser Val
260 265 270
Tyr Trp His Val Ile Gly Met Gly Thr Thr Pro Glu Val His Ser Ile
275 280 285
Phe Leu Glu Gly His Thr Phe Leu Val Arg Asn His Arg Gln Ala Ser
290 295 300
Leu Glu Ile Ser Pro Ile Thr Phe Leu Thr Ala Gln Thr Leu Leu Met
305 310 315 320
Asp Leu Gly Gln Phe Leu Leu Phe Cys His Ile Ser Ser His Gln His
325 330 335
Asp Gly Met Glu Ala Tyr Val Lys Val Asp Ser Cys Pro Glu Glu Pro
340 345 350
Gln Leu Arg Met Lys Asn Asn Glu Glu Ala Glu Asp Tyr Asp Asp Asp
355 360 365
Leu Thr Asp Ser Glu Met Asp Val Val Arg Phe Asp Asp Asp Asn Ser
370 375 380
Pro Ser Phe Ile Gln Ile Arg Ser Val Ala Lys Lys His Pro Lys Thr
385 390 395 400
Trp Val His Tyr Ile Ala Ala Glu Glu Glu Asp Trp Asp Tyr Ala Pro
405 410 415
Leu Val Leu Ala Pro Asp Asp Arg Ser Tyr Lys Ser Gln Tyr Leu Asn
420 425 430
Asn Gly Pro Gln Arg Ile Gly Arg Lys Tyr Lys Lys Val Arg Phe Met
435 440 445
Ala Tyr Thr Asp Glu Thr Phe Lys Thr Arg Glu Ala Ile Gln His Glu
450 455 460
Ser Gly Ile Leu Gly Pro Leu Leu Tyr Gly Glu Val Gly Asp Thr Leu
465 470 475 480
Leu Ile Ile Phe Lys Asn Gln Ala Ser Arg Pro Tyr Asn Ile Tyr Pro
485 490 495
His Gly Ile Thr Asp Val Arg Pro Leu Tyr Ser Arg Arg Leu Pro Lys
500 505 510
Gly Val Lys His Leu Lys Asp Phe Pro Ile Leu Pro Gly Glu Ile Phe
515 520 525
Lys Tyr Lys Trp Thr Val Thr Val Glu Asp Gly Pro Thr Lys Ser Asp
530 535 540
Pro Arg Cys Leu Thr Arg Tyr Tyr Ser Ser Phe Val Asn Met Glu Arg
545 550 555 560
Asp Leu Ala Ser Gly Leu Ile Gly Pro Leu Leu Ile Cys Tyr Lys Glu
565 570 575
Ser Val Asp Gln Arg Gly Asn Gln Ile Met Ser Asp Lys Arg Asn Val
580 585 590
Ile Leu Phe Ser Val Phe Asp Glu Asn Arg Ser Trp Tyr Leu Thr Glu
595 600 605
Asn Ile Gln Arg Phe Leu Pro Asn Pro Ala Gly Val Gln Leu Glu Asp
610 615 620
Pro Glu Phe Gln Ala Ser Asn Ile Met His Ser Ile Asn Gly Tyr Val
625 630 635 640
Phe Asp Ser Leu Gln Leu Ser Val Cys Leu His Glu Val Ala Tyr Trp
645 650 655
Tyr Ile Leu Ser Ile Gly Ala Gln Thr Asp Phe Leu Ser Val Phe Phe
660 665 670
Ser Gly Tyr Thr Phe Lys His Lys Met Val Tyr Glu Asp Thr Leu Thr
675 680 685
Leu Phe Pro Phe Ser Gly Glu Thr Val Phe Met Ser Met Glu Asn Pro
690 695 700
Gly Leu Trp Ile Leu Thr Cys His Asn Ser Asp Phe Arg Asn Arg Gly
705 710 715 720
Met Thr Ala Leu Leu Lys Val Ser Ser Cys Asp Lys Asn Thr Gly Asp
725 730 735
Tyr Tyr Glu Asp Ser Tyr Glu Asp Ile Ser Ala Tyr Leu Leu Ser Lys
740 745 750
Asn Asn Ala Ile Glu Pro Arg Ser Phe Ser Gln Asn Pro Pro Val Leu
755 760 765
Lys Arg His Gln Arg Glu Ile Thr Arg Thr Thr Leu Gln Ser Asp Gln
770 775 780
Glu Glu Ile Asp Tyr Asp Asp Thr Ile Ser Val Glu Met Lys Lys Glu
785 790 795 800
Asp Phe Asp Ile Tyr Asp Glu Asp Glu Asn Gln Ser Pro Arg Ser Phe
805 810 815
Gln Lys Lys Thr Arg His Tyr Phe Ile Ala Ala Val Glu Arg Leu Trp
820 825 830
Asp Tyr Gly Met Ser Ser Ser Pro His Val Leu Arg Asn Arg Ala Gln
835 840 845
Ser Gly Ser Val Pro Gln Phe Lys Lys Val Val Phe Gln Glu Phe Thr
850 855 860
Asp Gly Ser Phe Thr Gln Pro Leu Tyr Arg Gly Glu Leu Asn Glu His
865 870 875 880
Leu Gly Leu Leu Gly Pro Tyr Ile Arg Ala Glu Val Glu Asp Asn Ile
885 890 895
Met Val Thr Phe Arg Asn Gln Ala Ser Arg Pro Tyr Ser Phe Tyr Ser
900 905 910
Ser Leu Ile Ser Tyr Glu Glu Asp Gln Arg Gln Gly Ala Glu Pro Arg
915 920 925
Lys Asn Phe Val Lys Pro Asn Glu Thr Lys Thr Tyr Phe Trp Lys Val
930 935 940
Gln His His Met Ala Pro Thr Lys Asp Glu Phe Asp Cys Lys Ala Trp
945 950 955 960
Ala Tyr Phe Ser Asp Val Asp Leu Glu Lys Asp Val His Ser Gly Leu
965 970 975
Ile Gly Pro Leu Leu Val Cys His Thr Asn Thr Leu Asn Pro Ala His
980 985 990
Gly Arg Gln Val Thr Val Gln Glu Phe Ala Leu Phe Phe Thr Ile Phe
995 1000 1005
Asp Glu Thr Lys Ser Trp Tyr Phe Thr Glu Asn Met Glu Arg Asn
1010 1015 1020
Cys Arg Ala Pro Cys Asn Ile Gln Met Glu Asp Pro Thr Phe Lys
1025 1030 1035
Glu Asn Tyr Arg Phe His Ala Ile Asn Gly Tyr Ile Met Asp Thr
1040 1045 1050
Leu Pro Gly Leu Val Met Ala Gln Asp Gln Arg Ile Arg Trp Tyr
1055 1060 1065
Leu Leu Ser Met Gly Ser Asn Glu Asn Ile His Ser Ile His Phe
1070 1075 1080
Ser Gly His Val Phe Thr Val Arg Lys Lys Glu Glu Tyr Lys Met
1085 1090 1095
Ala Leu Tyr Asn Leu Tyr Pro Gly Val Phe Glu Thr Val Glu Met
1100 1105 1110
Leu Pro Ser Lys Ala Gly Ile Trp Arg Val Glu Cys Leu Ile Gly
1115 1120 1125
Glu His Leu His Ala Gly Met Ser Thr Leu Phe Leu Val Tyr Ser
1130 1135 1140
Asn Lys Cys Gln Thr Pro Leu Gly Met Ala Ser Gly His Ile Arg
1145 1150 1155
Asp Phe Gln Ile Thr Ala Ser Gly Gln Tyr Gly Gln Trp Ala Pro
1160 1165 1170
Lys Leu Ala Arg Leu His Tyr Ser Gly Ser Ile Asn Ala Trp Ser
1175 1180 1185
Thr Lys Glu Pro Phe Ser Trp Ile Lys Val Asp Leu Leu Ala Pro
1190 1195 1200
Met Ile Ile His Gly Ile Lys Thr Gln Gly Ala Arg Gln Lys Phe
1205 1210 1215
Ser Ser Leu Tyr Ile Ser Gln Phe Ile Ile Met Tyr Ser Leu Asp
1220 1225 1230
Gly Lys Lys Trp Gln Thr Tyr Arg Gly Asn Ser Thr Gly Thr Leu
1235 1240 1245
Met Val Phe Phe Gly Asn Val Asp Ser Ser Gly Ile Lys His Asn
1250 1255 1260
Ile Phe Asn Pro Pro Ile Ile Ala Arg Tyr Ile Arg Leu His Pro
1265 1270 1275
Thr His Tyr Ser Ile Arg Ser Thr Leu Arg Met Glu Leu Met Gly
1280 1285 1290
Cys Asp Leu Asn Ser Cys Ser Met Pro Leu Gly Met Glu Ser Lys
1295 1300 1305
Ala Ile Ser Asp Ala Gln Ile Thr Ala Ser Ser Tyr Phe Thr Asn
1310 1315 1320
Met Phe Ala Thr Trp Ser Pro Ser Lys Ala Arg Leu His Leu Gln
1325 1330 1335
Gly Arg Ser Asn Ala Trp Arg Pro Gln Val Asn Asn Pro Lys Glu
1340 1345 1350
Trp Leu Gln Val Asp Phe Gln Lys Thr Met Lys Val Thr Gly Val
1355 1360 1365
Thr Thr Gln Gly Val Lys Ser Leu Leu Thr Ser Met Tyr Val Lys
1370 1375 1380
Glu Phe Leu Ile Ser Ser Ser Gln Asp Gly His Gln Trp Thr Leu
1385 1390 1395
Phe Phe Gln Asn Gly Lys Val Lys Val Phe Gln Gly Asn Gln Asp
1400 1405 1410
Ser Phe Thr Pro Val Val Asn Ser Leu Asp Pro Pro Leu Leu Thr
1415 1420 1425
Arg Tyr Leu Arg Ile His Pro Gln Ser Trp Val His Gln Ile Ala
1430 1435 1440
Leu Arg Met Glu Val Leu Gly Cys Glu Ala Gln Asp Leu Tyr
1445 1450 1455
<210> 23
<211> 1464
<212> PRT
<213> Artificial sequence
<400> 23
Met Gln Ile Glu Leu Ser Thr Cys Phe Phe Leu Cys Leu Leu Arg Phe
1 5 10 15
Cys Phe Ser Ala Thr Arg Arg Tyr Tyr Leu Gly Ala Val Glu Leu Ser
20 25 30
Trp Asp Tyr Met Gln Ser Asp Leu Gly Glu Leu Pro Val Asp Ala Arg
35 40 45
Phe Pro Pro Arg Val Pro Lys Ser Phe Pro Phe Asn Thr Ser Val Val
50 55 60
Tyr Lys Lys Thr Leu Phe Val Glu Phe Thr Asp His Leu Phe Asn Ile
65 70 75 80
Ala Lys Pro Arg Pro Pro Trp Met Gly Leu Leu Gly Pro Thr Ile Gln
85 90 95
Ala Glu Val Tyr Asp Thr Val Val Ile Thr Leu Lys Asn Met Ala Ser
100 105 110
His Pro Val Ser Leu His Ala Val Gly Val Ser Tyr Trp Lys Ala Ser
115 120 125
Glu Gly Ala Glu Tyr Asp Asp Gln Thr Ser Gln Arg Glu Lys Glu Asp
130 135 140
Asp Lys Val Phe Pro Gly Gly Ser His Thr Tyr Val Trp Gln Val Leu
145 150 155 160
Lys Glu Asn Gly Pro Met Ala Ser Asp Pro Leu Cys Leu Thr Tyr Ser
165 170 175
Tyr Leu Ser His Val Asp Leu Val Lys Asp Leu Asn Ser Gly Leu Ile
180 185 190
Gly Ala Leu Leu Val Cys Arg Glu Gly Ser Leu Ala Lys Glu Lys Thr
195 200 205
Gln Thr Leu His Lys Phe Ile Leu Leu Phe Ala Val Phe Asp Glu Gly
210 215 220
Lys Ser Trp His Ser Glu Thr Lys Asn Ser Leu Met Gln Asp Arg Asp
225 230 235 240
Ala Ala Ser Ala Arg Ala Trp Pro Lys Met His Thr Val Asn Gly Tyr
245 250 255
Val Asn Arg Ser Leu Pro Gly Leu Ile Gly Cys His Arg Lys Ser Val
260 265 270
Tyr Trp His Val Ile Gly Met Gly Thr Thr Pro Glu Val His Ser Ile
275 280 285
Phe Leu Glu Gly His Thr Phe Leu Val Arg Asn His Arg Gln Ala Ser
290 295 300
Leu Glu Ile Ser Pro Ile Thr Phe Leu Thr Ala Gln Thr Leu Leu Met
305 310 315 320
Asp Leu Gly Gln Phe Leu Leu Phe Cys His Ile Ser Ser His Gln His
325 330 335
Asp Gly Met Glu Ala Tyr Val Lys Val Asp Ser Cys Pro Glu Glu Pro
340 345 350
Gln Leu Arg Met Lys Asn Asn Glu Glu Ala Glu Asp Tyr Asp Asp Asp
355 360 365
Leu Thr Asp Ser Glu Met Asp Val Val Arg Phe Asp Asp Asp Asn Ser
370 375 380
Pro Ser Phe Ile Gln Ile Arg Ser Val Ala Lys Lys His Pro Lys Thr
385 390 395 400
Trp Val His Tyr Ile Ala Ala Glu Glu Glu Asp Trp Asp Tyr Ala Pro
405 410 415
Leu Val Leu Ala Pro Asp Asp Arg Ser Tyr Lys Ser Gln Tyr Leu Asn
420 425 430
Asn Gly Pro Gln Arg Ile Gly Arg Lys Tyr Lys Lys Val Arg Phe Met
435 440 445
Ala Tyr Thr Asp Glu Thr Phe Lys Thr Arg Glu Ala Ile Gln His Glu
450 455 460
Ser Gly Ile Leu Gly Pro Leu Leu Tyr Gly Glu Val Gly Asp Thr Leu
465 470 475 480
Leu Ile Ile Phe Lys Asn Gln Ala Ser Arg Pro Tyr Asn Ile Tyr Pro
485 490 495
His Gly Ile Thr Asp Val Arg Pro Leu Tyr Ser Arg Arg Leu Pro Lys
500 505 510
Gly Val Lys His Leu Lys Asp Phe Pro Ile Leu Pro Gly Glu Ile Phe
515 520 525
Lys Tyr Lys Trp Thr Val Thr Val Glu Asp Gly Pro Thr Lys Ser Asp
530 535 540
Pro Arg Cys Leu Thr Arg Tyr Tyr Ser Ser Phe Val Asn Met Glu Arg
545 550 555 560
Asp Leu Ala Ser Gly Leu Ile Gly Pro Leu Leu Ile Cys Tyr Lys Glu
565 570 575
Ser Val Asp Gln Arg Gly Asn Gln Ile Met Ser Asp Lys Arg Asn Val
580 585 590
Ile Leu Phe Ser Val Phe Asp Glu Asn Arg Ser Trp Tyr Leu Thr Glu
595 600 605
Asn Ile Gln Arg Phe Leu Pro Asn Pro Ala Gly Val Gln Leu Glu Asp
610 615 620
Pro Glu Phe Gln Ala Ser Asn Ile Met His Ser Ile Asn Gly Tyr Val
625 630 635 640
Phe Asp Ser Leu Gln Leu Ser Val Cys Leu His Glu Val Ala Tyr Trp
645 650 655
Tyr Ile Leu Ser Ile Gly Ala Gln Thr Asp Phe Leu Ser Val Phe Phe
660 665 670
Ser Gly Tyr Thr Phe Lys His Lys Met Val Tyr Glu Asp Thr Leu Thr
675 680 685
Leu Phe Pro Phe Ser Gly Glu Thr Val Phe Met Ser Met Glu Asn Pro
690 695 700
Gly Leu Trp Ile Leu Thr Cys His Asn Ser Asp Phe Arg Asn Arg Gly
705 710 715 720
Met Thr Ala Leu Leu Lys Val Ser Ser Cys Asp Lys Asn Thr Gly Asp
725 730 735
Tyr Tyr Glu Asp Ser Tyr Glu Asp Ile Ser Ala Tyr Leu Leu Ser Lys
740 745 750
Asn Asn Ala Ile Glu Pro Arg Ser Phe Ser Gln Asn Ser Arg His Pro
755 760 765
Ser Gln Asn Pro Pro Val Leu Lys Arg His Gln Arg Glu Ile Thr Arg
770 775 780
Thr Thr Leu Gln Ser Asp Gln Glu Glu Ile Asp Tyr Asp Asp Thr Ile
785 790 795 800
Ser Val Glu Met Lys Lys Glu Asp Phe Asp Ile Tyr Asp Glu Asp Glu
805 810 815
Asn Gln Ser Pro Arg Ser Phe Gln Lys Lys Thr Arg His Tyr Phe Ile
820 825 830
Ala Ala Val Glu Arg Leu Trp Asp Tyr Gly Met Ser Ser Ser Pro His
835 840 845
Val Leu Arg Asn Arg Ala Gln Ser Gly Ser Val Pro Gln Phe Lys Lys
850 855 860
Val Val Phe Gln Glu Phe Thr Asp Gly Ser Phe Thr Gln Pro Leu Tyr
865 870 875 880
Arg Gly Glu Leu Asn Glu His Leu Gly Leu Leu Gly Pro Tyr Ile Arg
885 890 895
Ala Glu Val Glu Asp Asn Ile Met Val Thr Phe Arg Asn Gln Ala Ser
900 905 910
Arg Pro Tyr Ser Phe Tyr Ser Ser Leu Ile Ser Tyr Glu Glu Asp Gln
915 920 925
Arg Gln Gly Ala Glu Pro Arg Lys Asn Phe Val Lys Pro Asn Glu Thr
930 935 940
Lys Thr Tyr Phe Trp Lys Val Gln His His Met Ala Pro Thr Lys Asp
945 950 955 960
Glu Phe Asp Cys Lys Ala Trp Ala Tyr Phe Ser Asp Val Asp Leu Glu
965 970 975
Lys Asp Val His Ser Gly Leu Ile Gly Pro Leu Leu Val Cys His Thr
980 985 990
Asn Thr Leu Asn Pro Ala His Gly Arg Gln Val Thr Val Gln Glu Phe
995 1000 1005
Ala Leu Phe Phe Thr Ile Phe Asp Glu Thr Lys Ser Trp Tyr Phe
1010 1015 1020
Thr Glu Asn Met Glu Arg Asn Cys Arg Ala Pro Cys Asn Ile Gln
1025 1030 1035
Met Glu Asp Pro Thr Phe Lys Glu Asn Tyr Arg Phe His Ala Ile
1040 1045 1050
Asn Gly Tyr Ile Met Asp Thr Leu Pro Gly Leu Val Met Ala Gln
1055 1060 1065
Asp Gln Arg Ile Arg Trp Tyr Leu Leu Ser Met Gly Ser Asn Glu
1070 1075 1080
Asn Ile His Ser Ile His Phe Ser Gly His Val Phe Thr Val Arg
1085 1090 1095
Lys Lys Glu Glu Tyr Lys Met Ala Leu Tyr Asn Leu Tyr Pro Gly
1100 1105 1110
Val Phe Glu Thr Val Glu Met Leu Pro Ser Lys Ala Gly Ile Trp
1115 1120 1125
Arg Val Glu Cys Leu Ile Gly Glu His Leu His Ala Gly Met Ser
1130 1135 1140
Thr Leu Phe Leu Val Tyr Ser Asn Lys Cys Gln Thr Pro Leu Gly
1145 1150 1155
Met Ala Ser Gly His Ile Arg Asp Phe Gln Ile Thr Ala Ser Gly
1160 1165 1170
Gln Tyr Gly Gln Trp Ala Pro Lys Leu Ala Arg Leu His Tyr Ser
1175 1180 1185
Gly Ser Ile Asn Ala Trp Ser Thr Lys Glu Pro Phe Ser Trp Ile
1190 1195 1200
Lys Val Asp Leu Leu Ala Pro Met Ile Ile His Gly Ile Lys Thr
1205 1210 1215
Gln Gly Ala Arg Gln Lys Phe Ser Ser Leu Tyr Ile Ser Gln Phe
1220 1225 1230
Ile Ile Met Tyr Ser Leu Asp Gly Lys Lys Trp Gln Thr Tyr Arg
1235 1240 1245
Gly Asn Ser Thr Gly Thr Leu Met Val Phe Phe Gly Asn Val Asp
1250 1255 1260
Ser Ser Gly Ile Lys His Asn Ile Phe Asn Pro Pro Ile Ile Ala
1265 1270 1275
Arg Tyr Ile Arg Leu His Pro Thr His Tyr Ser Ile Arg Ser Thr
1280 1285 1290
Leu Arg Met Glu Leu Met Gly Cys Asp Leu Asn Ser Cys Ser Met
1295 1300 1305
Pro Leu Gly Met Glu Ser Lys Ala Ile Ser Asp Ala Gln Ile Thr
1310 1315 1320
Ala Ser Ser Tyr Phe Thr Asn Met Phe Ala Thr Trp Ser Pro Ser
1325 1330 1335
Lys Ala Arg Leu His Leu Gln Gly Arg Ser Asn Ala Trp Arg Pro
1340 1345 1350
Gln Val Asn Asn Pro Lys Glu Trp Leu Gln Val Asp Phe Gln Lys
1355 1360 1365
Thr Met Lys Val Thr Gly Val Thr Thr Gln Gly Val Lys Ser Leu
1370 1375 1380
Leu Thr Ser Met Tyr Val Lys Glu Phe Leu Ile Ser Ser Ser Gln
1385 1390 1395
Asp Gly His Gln Trp Thr Leu Phe Phe Gln Asn Gly Lys Val Lys
1400 1405 1410
Val Phe Gln Gly Asn Gln Asp Ser Phe Thr Pro Val Val Asn Ser
1415 1420 1425
Leu Asp Pro Pro Leu Leu Thr Arg Tyr Leu Arg Ile His Pro Gln
1430 1435 1440
Ser Trp Val His Gln Ile Ala Leu Arg Met Glu Val Leu Gly Cys
1445 1450 1455
Glu Ala Gln Asp Leu Tyr
1460
<210> 24
<211> 1463
<212> PRT
<213> Artificial sequence
<400> 24
Met Gln Ile Glu Leu Ser Thr Cys Phe Phe Leu Cys Leu Leu Arg Phe
1 5 10 15
Cys Phe Ser Ala Thr Arg Arg Tyr Tyr Leu Gly Ala Val Glu Leu Ser
20 25 30
Trp Asp Tyr Met Gln Ser Asp Leu Gly Glu Leu Pro Val Asp Ala Arg
35 40 45
Phe Pro Pro Arg Val Pro Lys Ser Phe Pro Phe Asn Thr Ser Val Val
50 55 60
Tyr Lys Lys Thr Leu Phe Val Glu Phe Thr Asp His Leu Phe Asn Ile
65 70 75 80
Ala Lys Pro Arg Pro Pro Trp Met Gly Leu Leu Gly Pro Thr Ile Gln
85 90 95
Ala Glu Val Tyr Asp Thr Val Val Ile Thr Leu Lys Asn Met Ala Ser
100 105 110
His Pro Val Ser Leu His Ala Val Gly Val Ser Tyr Trp Lys Ala Ser
115 120 125
Glu Gly Ala Glu Tyr Asp Asp Gln Thr Ser Gln Arg Glu Lys Glu Asp
130 135 140
Asp Lys Val Phe Pro Gly Gly Ser His Thr Tyr Val Trp Gln Val Leu
145 150 155 160
Lys Glu Asn Gly Pro Met Ala Ser Asp Pro Leu Cys Leu Thr Tyr Ser
165 170 175
Tyr Leu Ser His Val Asp Leu Val Lys Asp Leu Asn Ser Gly Leu Ile
180 185 190
Gly Ala Leu Leu Val Cys Arg Glu Gly Ser Leu Ala Lys Glu Lys Thr
195 200 205
Gln Thr Leu His Lys Phe Ile Leu Leu Phe Ala Val Phe Asp Glu Gly
210 215 220
Lys Ser Trp His Ser Glu Thr Lys Asn Ser Leu Met Gln Asp Arg Asp
225 230 235 240
Ala Ala Ser Ala Arg Ala Trp Pro Lys Met His Thr Val Asn Gly Tyr
245 250 255
Val Asn Arg Ser Leu Pro Gly Leu Ile Gly Cys His Arg Lys Ser Val
260 265 270
Tyr Trp His Val Ile Gly Met Gly Thr Thr Pro Glu Val His Ser Ile
275 280 285
Phe Leu Glu Gly His Thr Phe Leu Val Arg Asn His Arg Gln Ala Ser
290 295 300
Leu Glu Ile Ser Pro Ile Thr Phe Leu Thr Ala Gln Thr Leu Leu Met
305 310 315 320
Asp Leu Gly Gln Phe Leu Leu Phe Cys His Ile Ser Ser His Gln His
325 330 335
Asp Gly Met Glu Ala Tyr Val Lys Val Asp Ser Cys Pro Glu Glu Pro
340 345 350
Gln Leu Arg Met Lys Asn Asn Glu Glu Ala Glu Asp Tyr Asp Asp Asp
355 360 365
Leu Thr Asp Ser Glu Met Asp Val Val Arg Phe Asp Asp Asp Asn Ser
370 375 380
Pro Ser Phe Ile Gln Ile Arg Ser Val Ala Lys Lys His Pro Lys Thr
385 390 395 400
Trp Val His Tyr Ile Ala Ala Glu Glu Glu Asp Trp Asp Tyr Ala Pro
405 410 415
Leu Val Leu Ala Pro Asp Asp Arg Ser Tyr Lys Ser Gln Tyr Leu Asn
420 425 430
Asn Gly Pro Gln Arg Ile Gly Arg Lys Tyr Lys Lys Val Arg Phe Met
435 440 445
Ala Tyr Thr Asp Glu Thr Phe Lys Thr Arg Glu Ala Ile Gln His Glu
450 455 460
Ser Gly Ile Leu Gly Pro Leu Leu Tyr Gly Glu Val Gly Asp Thr Leu
465 470 475 480
Leu Ile Ile Phe Lys Asn Gln Ala Ser Arg Pro Tyr Asn Ile Tyr Pro
485 490 495
His Gly Ile Thr Asp Val Arg Pro Leu Tyr Ser Arg Arg Leu Pro Lys
500 505 510
Gly Val Lys His Leu Lys Asp Phe Pro Ile Leu Pro Gly Glu Ile Phe
515 520 525
Lys Tyr Lys Trp Thr Val Thr Val Glu Asp Gly Pro Thr Lys Ser Asp
530 535 540
Pro Arg Cys Leu Thr Arg Tyr Tyr Ser Ser Phe Val Asn Met Glu Arg
545 550 555 560
Asp Leu Ala Ser Gly Leu Ile Gly Pro Leu Leu Ile Cys Tyr Lys Glu
565 570 575
Ser Val Asp Gln Arg Gly Asn Gln Ile Met Ser Asp Lys Arg Asn Val
580 585 590
Ile Leu Phe Ser Val Phe Asp Glu Asn Arg Ser Trp Tyr Leu Thr Glu
595 600 605
Asn Ile Gln Arg Phe Leu Pro Asn Pro Ala Gly Val Gln Leu Glu Asp
610 615 620
Pro Glu Phe Gln Ala Ser Asn Ile Met His Ser Ile Asn Gly Tyr Val
625 630 635 640
Phe Asp Ser Leu Gln Leu Ser Val Cys Leu His Glu Val Ala Tyr Trp
645 650 655
Tyr Ile Leu Ser Ile Gly Ala Gln Thr Asp Phe Leu Ser Val Phe Phe
660 665 670
Ser Gly Tyr Thr Phe Lys His Lys Met Val Tyr Glu Asp Thr Leu Thr
675 680 685
Leu Phe Pro Phe Ser Gly Glu Thr Val Phe Met Ser Met Glu Asn Pro
690 695 700
Gly Leu Trp Ile Leu Thr Cys His Asn Ser Asp Phe Arg Asn Arg Gly
705 710 715 720
Met Thr Ala Leu Leu Lys Val Ser Ser Cys Asp Lys Asn Thr Gly Asp
725 730 735
Tyr Tyr Glu Asp Ser Tyr Glu Asp Ile Ser Ala Tyr Leu Leu Ser Lys
740 745 750
Asn Asn Ala Ile Glu Pro Arg Ser Phe Ser Gln Asn Ser Arg His Pro
755 760 765
Ser Thr Arg Gln Lys Gln Phe Asn Ala Thr Thr Ile Pro Glu Asn Thr
770 775 780
Thr Leu Gln Ser Asp Gln Glu Glu Ile Asp Tyr Asp Asp Thr Ile Ser
785 790 795 800
Val Glu Met Lys Lys Glu Asp Phe Asp Ile Tyr Asp Glu Asp Glu Asn
805 810 815
Gln Ser Pro Arg Ser Phe Gln Lys Lys Thr Arg His Tyr Phe Ile Ala
820 825 830
Ala Val Glu Arg Leu Trp Asp Tyr Gly Met Ser Ser Ser Pro His Val
835 840 845
Leu Arg Asn Arg Ala Gln Ser Gly Ser Val Pro Gln Phe Lys Lys Val
850 855 860
Val Phe Gln Glu Phe Thr Asp Gly Ser Phe Thr Gln Pro Leu Tyr Arg
865 870 875 880
Gly Glu Leu Asn Glu His Leu Gly Leu Leu Gly Pro Tyr Ile Arg Ala
885 890 895
Glu Val Glu Asp Asn Ile Met Val Thr Phe Arg Asn Gln Ala Ser Arg
900 905 910
Pro Tyr Ser Phe Tyr Ser Ser Leu Ile Ser Tyr Glu Glu Asp Gln Arg
915 920 925
Gln Gly Ala Glu Pro Arg Lys Asn Phe Val Lys Pro Asn Glu Thr Lys
930 935 940
Thr Tyr Phe Trp Lys Val Gln His His Met Ala Pro Thr Lys Asp Glu
945 950 955 960
Phe Asp Cys Lys Ala Trp Ala Tyr Phe Ser Asp Val Asp Leu Glu Lys
965 970 975
Asp Val His Ser Gly Leu Ile Gly Pro Leu Leu Val Cys His Thr Asn
980 985 990
Thr Leu Asn Pro Ala His Gly Arg Gln Val Thr Val Gln Glu Phe Ala
995 1000 1005
Leu Phe Phe Thr Ile Phe Asp Glu Thr Lys Ser Trp Tyr Phe Thr
1010 1015 1020
Glu Asn Met Glu Arg Asn Cys Arg Ala Pro Cys Asn Ile Gln Met
1025 1030 1035
Glu Asp Pro Thr Phe Lys Glu Asn Tyr Arg Phe His Ala Ile Asn
1040 1045 1050
Gly Tyr Ile Met Asp Thr Leu Pro Gly Leu Val Met Ala Gln Asp
1055 1060 1065
Gln Arg Ile Arg Trp Tyr Leu Leu Ser Met Gly Ser Asn Glu Asn
1070 1075 1080
Ile His Ser Ile His Phe Ser Gly His Val Phe Thr Val Arg Lys
1085 1090 1095
Lys Glu Glu Tyr Lys Met Ala Leu Tyr Asn Leu Tyr Pro Gly Val
1100 1105 1110
Phe Glu Thr Val Glu Met Leu Pro Ser Lys Ala Gly Ile Trp Arg
1115 1120 1125
Val Glu Cys Leu Ile Gly Glu His Leu His Ala Gly Met Ser Thr
1130 1135 1140
Leu Phe Leu Val Tyr Ser Asn Lys Cys Gln Thr Pro Leu Gly Met
1145 1150 1155
Ala Ser Gly His Ile Arg Asp Phe Gln Ile Thr Ala Ser Gly Gln
1160 1165 1170
Tyr Gly Gln Trp Ala Pro Lys Leu Ala Arg Leu His Tyr Ser Gly
1175 1180 1185
Ser Ile Asn Ala Trp Ser Thr Lys Glu Pro Phe Ser Trp Ile Lys
1190 1195 1200
Val Asp Leu Leu Ala Pro Met Ile Ile His Gly Ile Lys Thr Gln
1205 1210 1215
Gly Ala Arg Gln Lys Phe Ser Ser Leu Tyr Ile Ser Gln Phe Ile
1220 1225 1230
Ile Met Tyr Ser Leu Asp Gly Lys Lys Trp Gln Thr Tyr Arg Gly
1235 1240 1245
Asn Ser Thr Gly Thr Leu Met Val Phe Phe Gly Asn Val Asp Ser
1250 1255 1260
Ser Gly Ile Lys His Asn Ile Phe Asn Pro Pro Ile Ile Ala Arg
1265 1270 1275
Tyr Ile Arg Leu His Pro Thr His Tyr Ser Ile Arg Ser Thr Leu
1280 1285 1290
Arg Met Glu Leu Met Gly Cys Asp Leu Asn Ser Cys Ser Met Pro
1295 1300 1305
Leu Gly Met Glu Ser Lys Ala Ile Ser Asp Ala Gln Ile Thr Ala
1310 1315 1320
Ser Ser Tyr Phe Thr Asn Met Phe Ala Thr Trp Ser Pro Ser Lys
1325 1330 1335
Ala Arg Leu His Leu Gln Gly Arg Ser Asn Ala Trp Arg Pro Gln
1340 1345 1350
Val Asn Asn Pro Lys Glu Trp Leu Gln Val Asp Phe Gln Lys Thr
1355 1360 1365
Met Lys Val Thr Gly Val Thr Thr Gln Gly Val Lys Ser Leu Leu
1370 1375 1380
Thr Ser Met Tyr Val Lys Glu Phe Leu Ile Ser Ser Ser Gln Asp
1385 1390 1395
Gly His Gln Trp Thr Leu Phe Phe Gln Asn Gly Lys Val Lys Val
1400 1405 1410
Phe Gln Gly Asn Gln Asp Ser Phe Thr Pro Val Val Asn Ser Leu
1415 1420 1425
Asp Pro Pro Leu Leu Thr Arg Tyr Leu Arg Ile His Pro Gln Ser
1430 1435 1440
Trp Val His Gln Ile Ala Leu Arg Met Glu Val Leu Gly Cys Glu
1445 1450 1455
Ala Gln Asp Leu Tyr
1460
<210> 25
<211> 1453
<212> PRT
<213> Artificial sequence
<400> 25
Met Gln Ile Glu Leu Ser Thr Cys Phe Phe Leu Cys Leu Leu Arg Phe
1 5 10 15
Cys Phe Ser Ala Thr Arg Arg Tyr Tyr Leu Gly Ala Val Glu Leu Ser
20 25 30
Trp Asp Tyr Met Gln Ser Asp Leu Gly Glu Leu Pro Val Asp Ala Arg
35 40 45
Phe Pro Pro Arg Val Pro Lys Ser Phe Pro Phe Asn Thr Ser Val Val
50 55 60
Tyr Lys Lys Thr Leu Phe Val Glu Phe Thr Asp His Leu Phe Asn Ile
65 70 75 80
Ala Lys Pro Arg Pro Pro Trp Met Gly Leu Leu Gly Pro Thr Ile Gln
85 90 95
Ala Glu Val Tyr Asp Thr Val Val Ile Thr Leu Lys Asn Met Ala Ser
100 105 110
His Pro Val Ser Leu His Ala Val Gly Val Ser Tyr Trp Lys Ala Ser
115 120 125
Glu Gly Ala Glu Tyr Asp Asp Gln Thr Ser Gln Arg Glu Lys Glu Asp
130 135 140
Asp Lys Val Phe Pro Gly Gly Ser His Thr Tyr Val Trp Gln Val Leu
145 150 155 160
Lys Glu Asn Gly Pro Met Ala Ser Asp Pro Leu Cys Leu Thr Tyr Ser
165 170 175
Tyr Leu Ser His Val Asp Leu Val Lys Asp Leu Asn Ser Gly Leu Ile
180 185 190
Gly Ala Leu Leu Val Cys Arg Glu Gly Ser Leu Ala Lys Glu Lys Thr
195 200 205
Gln Thr Leu His Lys Phe Ile Leu Leu Phe Ala Val Phe Asp Glu Gly
210 215 220
Lys Ser Trp His Ser Glu Thr Lys Asn Ser Leu Met Gln Asp Arg Asp
225 230 235 240
Ala Ala Ser Ala Arg Ala Trp Pro Lys Met His Thr Val Asn Gly Tyr
245 250 255
Val Asn Arg Ser Leu Pro Gly Leu Ile Gly Cys His Arg Lys Ser Val
260 265 270
Tyr Trp His Val Ile Gly Met Gly Thr Thr Pro Glu Val His Ser Ile
275 280 285
Phe Leu Glu Gly His Thr Phe Leu Val Arg Asn His Arg Gln Ala Ser
290 295 300
Leu Glu Ile Ser Pro Ile Thr Phe Leu Thr Ala Gln Thr Leu Leu Met
305 310 315 320
Asp Leu Gly Gln Phe Leu Leu Phe Cys His Ile Ser Ser His Gln His
325 330 335
Asp Gly Met Glu Ala Tyr Val Lys Val Asp Ser Cys Pro Glu Glu Pro
340 345 350
Gln Leu Arg Met Lys Asn Asn Glu Glu Ala Glu Asp Tyr Asp Asp Asp
355 360 365
Leu Thr Asp Ser Glu Met Asp Val Val Arg Phe Asp Asp Asp Asn Ser
370 375 380
Pro Ser Phe Ile Gln Ile Arg Ser Val Ala Lys Lys His Pro Lys Thr
385 390 395 400
Trp Val His Tyr Ile Ala Ala Glu Glu Glu Asp Trp Asp Tyr Ala Pro
405 410 415
Leu Val Leu Ala Pro Asp Asp Arg Ser Tyr Lys Ser Gln Tyr Leu Asn
420 425 430
Asn Gly Pro Gln Arg Ile Gly Arg Lys Tyr Lys Lys Val Arg Phe Met
435 440 445
Ala Tyr Thr Asp Glu Thr Phe Lys Thr Arg Glu Ala Ile Gln His Glu
450 455 460
Ser Gly Ile Leu Gly Pro Leu Leu Tyr Gly Glu Val Gly Asp Thr Leu
465 470 475 480
Leu Ile Ile Phe Lys Asn Gln Ala Ser Arg Pro Tyr Asn Ile Tyr Pro
485 490 495
His Gly Ile Thr Asp Val Arg Pro Leu Tyr Ser Arg Arg Leu Pro Lys
500 505 510
Gly Val Lys His Leu Lys Asp Phe Pro Ile Leu Pro Gly Glu Ile Phe
515 520 525
Lys Tyr Lys Trp Thr Val Thr Val Glu Asp Gly Pro Thr Lys Ser Asp
530 535 540
Pro Arg Cys Leu Thr Arg Tyr Tyr Ser Ser Phe Val Asn Met Glu Arg
545 550 555 560
Asp Leu Ala Ser Gly Leu Ile Gly Pro Leu Leu Ile Cys Tyr Lys Glu
565 570 575
Ser Val Asp Gln Arg Gly Asn Gln Ile Met Ser Asp Lys Arg Asn Val
580 585 590
Ile Leu Phe Ser Val Phe Asp Glu Asn Arg Ser Trp Tyr Leu Thr Glu
595 600 605
Asn Ile Gln Arg Phe Leu Pro Asn Pro Ala Gly Val Gln Leu Glu Asp
610 615 620
Pro Glu Phe Gln Ala Ser Asn Ile Met His Ser Ile Asn Gly Tyr Val
625 630 635 640
Phe Asp Ser Leu Gln Leu Ser Val Cys Leu His Glu Val Ala Tyr Trp
645 650 655
Tyr Ile Leu Ser Ile Gly Ala Gln Thr Asp Phe Leu Ser Val Phe Phe
660 665 670
Ser Gly Tyr Thr Phe Lys His Lys Met Val Tyr Glu Asp Thr Leu Thr
675 680 685
Leu Phe Pro Phe Ser Gly Glu Thr Val Phe Met Ser Met Glu Asn Pro
690 695 700
Gly Leu Trp Ile Leu Thr Cys His Asn Ser Asp Phe Arg Asn Arg Gly
705 710 715 720
Met Thr Ala Leu Leu Lys Val Ser Ser Cys Asp Lys Asn Thr Gly Asp
725 730 735
Tyr Tyr Glu Asp Ser Tyr Glu Asp Ile Ser Ala Tyr Leu Leu Ser Lys
740 745 750
Asn Asn Ala Ile Glu Pro Arg Ser Phe Ser Gln Asn Pro Pro Val Leu
755 760 765
Lys Glu Ile Thr Arg Thr Thr Leu Gln Ser Asp Gln Glu Glu Ile Asp
770 775 780
Tyr Asp Asp Thr Ile Ser Val Glu Met Lys Lys Glu Asp Phe Asp Ile
785 790 795 800
Tyr Asp Glu Asp Glu Asn Gln Ser Pro Arg Ser Phe Gln Lys Lys Thr
805 810 815
Arg His Tyr Phe Ile Ala Ala Val Glu Arg Leu Trp Asp Tyr Gly Met
820 825 830
Ser Ser Ser Pro His Val Leu Arg Asn Arg Ala Gln Ser Gly Ser Val
835 840 845
Pro Gln Phe Lys Lys Val Val Phe Gln Glu Phe Thr Asp Gly Ser Phe
850 855 860
Thr Gln Pro Leu Tyr Arg Gly Glu Leu Asn Glu His Leu Gly Leu Leu
865 870 875 880
Gly Pro Tyr Ile Arg Ala Glu Val Glu Asp Asn Ile Met Val Thr Phe
885 890 895
Arg Asn Gln Ala Ser Arg Pro Tyr Ser Phe Tyr Ser Ser Leu Ile Ser
900 905 910
Tyr Glu Glu Asp Gln Arg Gln Gly Ala Glu Pro Arg Lys Asn Phe Val
915 920 925
Lys Pro Asn Glu Thr Lys Thr Tyr Phe Trp Lys Val Gln His His Met
930 935 940
Ala Pro Thr Lys Asp Glu Phe Asp Cys Lys Ala Trp Ala Tyr Phe Ser
945 950 955 960
Asp Val Asp Leu Glu Lys Asp Val His Ser Gly Leu Ile Gly Pro Leu
965 970 975
Leu Val Cys His Thr Asn Thr Leu Asn Pro Ala His Gly Arg Gln Val
980 985 990
Thr Val Gln Glu Phe Ala Leu Phe Phe Thr Ile Phe Asp Glu Thr Lys
995 1000 1005
Ser Trp Tyr Phe Thr Glu Asn Met Glu Arg Asn Cys Arg Ala Pro
1010 1015 1020
Cys Asn Ile Gln Met Glu Asp Pro Thr Phe Lys Glu Asn Tyr Arg
1025 1030 1035
Phe His Ala Ile Asn Gly Tyr Ile Met Asp Thr Leu Pro Gly Leu
1040 1045 1050
Val Met Ala Gln Asp Gln Arg Ile Arg Trp Tyr Leu Leu Ser Met
1055 1060 1065
Gly Ser Asn Glu Asn Ile His Ser Ile His Phe Ser Gly His Val
1070 1075 1080
Phe Thr Val Arg Lys Lys Glu Glu Tyr Lys Met Ala Leu Tyr Asn
1085 1090 1095
Leu Tyr Pro Gly Val Phe Glu Thr Val Glu Met Leu Pro Ser Lys
1100 1105 1110
Ala Gly Ile Trp Arg Val Glu Cys Leu Ile Gly Glu His Leu His
1115 1120 1125
Ala Gly Met Ser Thr Leu Phe Leu Val Tyr Ser Asn Lys Cys Gln
1130 1135 1140
Thr Pro Leu Gly Met Ala Ser Gly His Ile Arg Asp Phe Gln Ile
1145 1150 1155
Thr Ala Ser Gly Gln Tyr Gly Gln Trp Ala Pro Lys Leu Ala Arg
1160 1165 1170
Leu His Tyr Ser Gly Ser Ile Asn Ala Trp Ser Thr Lys Glu Pro
1175 1180 1185
Phe Ser Trp Ile Lys Val Asp Leu Leu Ala Pro Met Ile Ile His
1190 1195 1200
Gly Ile Lys Thr Gln Gly Ala Arg Gln Lys Phe Ser Ser Leu Tyr
1205 1210 1215
Ile Ser Gln Phe Ile Ile Met Tyr Ser Leu Asp Gly Lys Lys Trp
1220 1225 1230
Gln Thr Tyr Arg Gly Asn Ser Thr Gly Thr Leu Met Val Phe Phe
1235 1240 1245
Gly Asn Val Asp Ser Ser Gly Ile Lys His Asn Ile Phe Asn Pro
1250 1255 1260
Pro Ile Ile Ala Arg Tyr Ile Arg Leu His Pro Thr His Tyr Ser
1265 1270 1275
Ile Arg Ser Thr Leu Arg Met Glu Leu Met Gly Cys Asp Leu Asn
1280 1285 1290
Ser Cys Ser Met Pro Leu Gly Met Glu Ser Lys Ala Ile Ser Asp
1295 1300 1305
Ala Gln Ile Thr Ala Ser Ser Tyr Phe Thr Asn Met Phe Ala Thr
1310 1315 1320
Trp Ser Pro Ser Lys Ala Arg Leu His Leu Gln Gly Arg Ser Asn
1325 1330 1335
Ala Trp Arg Pro Gln Val Asn Asn Pro Lys Glu Trp Leu Gln Val
1340 1345 1350
Asp Phe Gln Lys Thr Met Lys Val Thr Gly Val Thr Thr Gln Gly
1355 1360 1365
Val Lys Ser Leu Leu Thr Ser Met Tyr Val Lys Glu Phe Leu Ile
1370 1375 1380
Ser Ser Ser Gln Asp Gly His Gln Trp Thr Leu Phe Phe Gln Asn
1385 1390 1395
Gly Lys Val Lys Val Phe Gln Gly Asn Gln Asp Ser Phe Thr Pro
1400 1405 1410
Val Val Asn Ser Leu Asp Pro Pro Leu Leu Thr Arg Tyr Leu Arg
1415 1420 1425
Ile His Pro Gln Ser Trp Val His Gln Ile Ala Leu Arg Met Glu
1430 1435 1440
Val Leu Gly Cys Glu Ala Gln Asp Leu Tyr
1445 1450
<210> 26
<211> 1474
<212> PRT
<213> Artificial sequence
<400> 26
Met Gln Ile Glu Leu Ser Thr Cys Phe Phe Leu Cys Leu Leu Arg Phe
1 5 10 15
Cys Phe Ser Ala Thr Arg Arg Tyr Tyr Leu Gly Ala Val Glu Leu Ser
20 25 30
Trp Asp Tyr Met Gln Ser Asp Leu Gly Glu Leu Pro Val Asp Ala Arg
35 40 45
Phe Pro Pro Arg Val Pro Lys Ser Phe Pro Phe Asn Thr Ser Val Val
50 55 60
Tyr Lys Lys Thr Leu Phe Val Glu Phe Thr Asp His Leu Phe Asn Ile
65 70 75 80
Ala Lys Pro Arg Pro Pro Trp Met Gly Leu Leu Gly Pro Thr Ile Gln
85 90 95
Ala Glu Val Tyr Asp Thr Val Val Ile Thr Leu Lys Asn Met Ala Ser
100 105 110
His Pro Val Ser Leu His Ala Val Gly Val Ser Tyr Trp Lys Ala Ser
115 120 125
Glu Gly Ala Glu Tyr Asp Asp Gln Thr Ser Gln Arg Glu Lys Glu Asp
130 135 140
Asp Lys Val Phe Pro Gly Gly Ser His Thr Tyr Val Trp Gln Val Leu
145 150 155 160
Lys Glu Asn Gly Pro Met Ala Ser Asp Pro Leu Cys Leu Thr Tyr Ser
165 170 175
Tyr Leu Ser His Val Asp Leu Val Lys Asp Leu Asn Ser Gly Leu Ile
180 185 190
Gly Ala Leu Leu Val Cys Arg Glu Gly Ser Leu Ala Lys Glu Lys Thr
195 200 205
Gln Thr Leu His Lys Phe Ile Leu Leu Phe Ala Val Phe Asp Glu Gly
210 215 220
Lys Ser Trp His Ser Glu Thr Lys Asn Ser Leu Met Gln Asp Arg Asp
225 230 235 240
Ala Ala Ser Ala Arg Ala Trp Pro Lys Met His Thr Val Asn Gly Tyr
245 250 255
Val Asn Arg Ser Leu Pro Gly Leu Ile Gly Cys His Arg Lys Ser Val
260 265 270
Tyr Trp His Val Ile Gly Met Gly Thr Thr Pro Glu Val His Ser Ile
275 280 285
Phe Leu Glu Gly His Thr Phe Leu Val Arg Asn His Arg Gln Ala Ser
290 295 300
Leu Glu Ile Ser Pro Ile Thr Phe Leu Thr Ala Gln Thr Leu Leu Met
305 310 315 320
Asp Leu Gly Gln Phe Leu Leu Phe Cys His Ile Ser Ser His Gln His
325 330 335
Asp Gly Met Glu Ala Tyr Val Lys Val Asp Ser Cys Pro Glu Glu Pro
340 345 350
Gln Leu Arg Met Lys Asn Asn Glu Glu Ala Glu Asp Tyr Asp Asp Asp
355 360 365
Leu Thr Asp Ser Glu Met Asp Val Val Arg Phe Asp Asp Asp Asn Ser
370 375 380
Pro Ser Phe Ile Gln Ile Arg Ser Val Ala Lys Lys His Pro Lys Thr
385 390 395 400
Trp Val His Tyr Ile Ala Ala Glu Glu Glu Asp Trp Asp Tyr Ala Pro
405 410 415
Leu Val Leu Ala Pro Asp Asp Arg Ser Tyr Lys Ser Gln Tyr Leu Asn
420 425 430
Asn Gly Pro Gln Arg Ile Gly Arg Lys Tyr Lys Lys Val Arg Phe Met
435 440 445
Ala Tyr Thr Asp Glu Thr Phe Lys Thr Arg Glu Ala Ile Gln His Glu
450 455 460
Ser Gly Ile Leu Gly Pro Leu Leu Tyr Gly Glu Val Gly Asp Thr Leu
465 470 475 480
Leu Ile Ile Phe Lys Asn Gln Ala Ser Arg Pro Tyr Asn Ile Tyr Pro
485 490 495
His Gly Ile Thr Asp Val Arg Pro Leu Tyr Ser Arg Arg Leu Pro Lys
500 505 510
Gly Val Lys His Leu Lys Asp Phe Pro Ile Leu Pro Gly Glu Ile Phe
515 520 525
Lys Tyr Lys Trp Thr Val Thr Val Glu Asp Gly Pro Thr Lys Ser Asp
530 535 540
Pro Arg Cys Leu Thr Arg Tyr Tyr Ser Ser Phe Val Asn Met Glu Arg
545 550 555 560
Asp Leu Ala Ser Gly Leu Ile Gly Pro Leu Leu Ile Cys Tyr Lys Glu
565 570 575
Ser Val Asp Gln Arg Gly Asn Gln Ile Met Ser Asp Lys Arg Asn Val
580 585 590
Ile Leu Phe Ser Val Phe Asp Glu Asn Arg Ser Trp Tyr Leu Thr Glu
595 600 605
Asn Ile Gln Arg Phe Leu Pro Asn Pro Ala Gly Val Gln Leu Glu Asp
610 615 620
Pro Glu Phe Gln Ala Ser Asn Ile Met His Ser Ile Asn Gly Tyr Val
625 630 635 640
Phe Asp Ser Leu Gln Leu Ser Val Cys Leu His Glu Val Ala Tyr Trp
645 650 655
Tyr Ile Leu Ser Ile Gly Ala Gln Thr Asp Phe Leu Ser Val Phe Phe
660 665 670
Ser Gly Tyr Thr Phe Lys His Lys Met Val Tyr Glu Asp Thr Leu Thr
675 680 685
Leu Phe Pro Phe Ser Gly Glu Thr Val Phe Met Ser Met Glu Asn Pro
690 695 700
Gly Leu Trp Ile Leu Thr Cys His Asn Ser Asp Phe Arg Asn Arg Gly
705 710 715 720
Met Thr Ala Leu Leu Lys Val Ser Ser Cys Asp Lys Asn Thr Gly Asp
725 730 735
Tyr Tyr Glu Asp Ser Tyr Glu Asp Ile Ser Ala Tyr Leu Leu Ser Lys
740 745 750
Asn Asn Ala Ile Glu Pro Arg Ser Phe Ser Gln Asn Ala Thr Asn Val
755 760 765
Ser Asn Asn Ser Asn Thr Ser Asn Asp Ser Asn Val Ser Pro Pro Val
770 775 780
Leu Lys Arg His Gln Arg Glu Ile Thr Arg Thr Thr Leu Gln Ser Asp
785 790 795 800
Gln Glu Glu Ile Asp Tyr Asp Asp Thr Ile Ser Val Glu Met Lys Lys
805 810 815
Glu Asp Phe Asp Ile Tyr Asp Glu Asp Glu Asn Gln Ser Pro Arg Ser
820 825 830
Phe Gln Lys Lys Thr Arg His Tyr Phe Ile Ala Ala Val Glu Arg Leu
835 840 845
Trp Asp Tyr Gly Met Ser Ser Ser Pro His Val Leu Arg Asn Arg Ala
850 855 860
Gln Ser Gly Ser Val Pro Gln Phe Lys Lys Val Val Phe Gln Glu Phe
865 870 875 880
Thr Asp Gly Ser Phe Thr Gln Pro Leu Tyr Arg Gly Glu Leu Asn Glu
885 890 895
His Leu Gly Leu Leu Gly Pro Tyr Ile Arg Ala Glu Val Glu Asp Asn
900 905 910
Ile Met Val Thr Phe Arg Asn Gln Ala Ser Arg Pro Tyr Ser Phe Tyr
915 920 925
Ser Ser Leu Ile Ser Tyr Glu Glu Asp Gln Arg Gln Gly Ala Glu Pro
930 935 940
Arg Lys Asn Phe Val Lys Pro Asn Glu Thr Lys Thr Tyr Phe Trp Lys
945 950 955 960
Val Gln His His Met Ala Pro Thr Lys Asp Glu Phe Asp Cys Lys Ala
965 970 975
Trp Ala Tyr Phe Ser Asp Val Asp Leu Glu Lys Asp Val His Ser Gly
980 985 990
Leu Ile Gly Pro Leu Leu Val Cys His Thr Asn Thr Leu Asn Pro Ala
995 1000 1005
His Gly Arg Gln Val Thr Val Gln Glu Phe Ala Leu Phe Phe Thr
1010 1015 1020
Ile Phe Asp Glu Thr Lys Ser Trp Tyr Phe Thr Glu Asn Met Glu
1025 1030 1035
Arg Asn Cys Arg Ala Pro Cys Asn Ile Gln Met Glu Asp Pro Thr
1040 1045 1050
Phe Lys Glu Asn Tyr Arg Phe His Ala Ile Asn Gly Tyr Ile Met
1055 1060 1065
Asp Thr Leu Pro Gly Leu Val Met Ala Gln Asp Gln Arg Ile Arg
1070 1075 1080
Trp Tyr Leu Leu Ser Met Gly Ser Asn Glu Asn Ile His Ser Ile
1085 1090 1095
His Phe Ser Gly His Val Phe Thr Val Arg Lys Lys Glu Glu Tyr
1100 1105 1110
Lys Met Ala Leu Tyr Asn Leu Tyr Pro Gly Val Phe Glu Thr Val
1115 1120 1125
Glu Met Leu Pro Ser Lys Ala Gly Ile Trp Arg Val Glu Cys Leu
1130 1135 1140
Ile Gly Glu His Leu His Ala Gly Met Ser Thr Leu Phe Leu Val
1145 1150 1155
Tyr Ser Asn Lys Cys Gln Thr Pro Leu Gly Met Ala Ser Gly His
1160 1165 1170
Ile Arg Asp Phe Gln Ile Thr Ala Ser Gly Gln Tyr Gly Gln Trp
1175 1180 1185
Ala Pro Lys Leu Ala Arg Leu His Tyr Ser Gly Ser Ile Asn Ala
1190 1195 1200
Trp Ser Thr Lys Glu Pro Phe Ser Trp Ile Lys Val Asp Leu Leu
1205 1210 1215
Ala Pro Met Ile Ile His Gly Ile Lys Thr Gln Gly Ala Arg Gln
1220 1225 1230
Lys Phe Ser Ser Leu Tyr Ile Ser Gln Phe Ile Ile Met Tyr Ser
1235 1240 1245
Leu Asp Gly Lys Lys Trp Gln Thr Tyr Arg Gly Asn Ser Thr Gly
1250 1255 1260
Thr Leu Met Val Phe Phe Gly Asn Val Asp Ser Ser Gly Ile Lys
1265 1270 1275
His Asn Ile Phe Asn Pro Pro Ile Ile Ala Arg Tyr Ile Arg Leu
1280 1285 1290
His Pro Thr His Tyr Ser Ile Arg Ser Thr Leu Arg Met Glu Leu
1295 1300 1305
Met Gly Cys Asp Leu Asn Ser Cys Ser Met Pro Leu Gly Met Glu
1310 1315 1320
Ser Lys Ala Ile Ser Asp Ala Gln Ile Thr Ala Ser Ser Tyr Phe
1325 1330 1335
Thr Asn Met Phe Ala Thr Trp Ser Pro Ser Lys Ala Arg Leu His
1340 1345 1350
Leu Gln Gly Arg Ser Asn Ala Trp Arg Pro Gln Val Asn Asn Pro
1355 1360 1365
Lys Glu Trp Leu Gln Val Asp Phe Gln Lys Thr Met Lys Val Thr
1370 1375 1380
Gly Val Thr Thr Gln Gly Val Lys Ser Leu Leu Thr Ser Met Tyr
1385 1390 1395
Val Lys Glu Phe Leu Ile Ser Ser Ser Gln Asp Gly His Gln Trp
1400 1405 1410
Thr Leu Phe Phe Gln Asn Gly Lys Val Lys Val Phe Gln Gly Asn
1415 1420 1425
Gln Asp Ser Phe Thr Pro Val Val Asn Ser Leu Asp Pro Pro Leu
1430 1435 1440
Leu Thr Arg Tyr Leu Arg Ile His Pro Gln Ser Trp Val His Gln
1445 1450 1455
Ile Ala Leu Arg Met Glu Val Leu Gly Cys Glu Ala Gln Asp Leu
1460 1465 1470
Tyr
<210> 27
<211> 1459
<212> PRT
<213> Artificial sequence
<400> 27
Met Gln Ile Glu Leu Ser Thr Cys Phe Phe Leu Cys Leu Leu Arg Phe
1 5 10 15
Cys Phe Ser Ala Thr Arg Arg Tyr Tyr Leu Gly Ala Val Glu Leu Ser
20 25 30
Trp Asp Tyr Met Gln Ser Asp Leu Gly Glu Leu Pro Val Asp Ala Arg
35 40 45
Phe Pro Pro Arg Val Pro Lys Ser Phe Pro Phe Asn Thr Ser Val Val
50 55 60
Tyr Lys Lys Thr Leu Phe Val Glu Phe Thr Asp His Leu Phe Asn Ile
65 70 75 80
Ala Lys Pro Arg Pro Pro Trp Met Gly Leu Leu Gly Pro Thr Ile Gln
85 90 95
Ala Glu Val Tyr Asp Thr Val Val Ile Thr Leu Lys Asn Met Ala Ser
100 105 110
His Pro Val Ser Leu His Ala Val Gly Val Ser Tyr Trp Lys Ala Ser
115 120 125
Glu Gly Ala Glu Tyr Asp Asp Gln Thr Ser Gln Arg Glu Lys Glu Asp
130 135 140
Asp Lys Val Phe Pro Gly Gly Ser His Thr Tyr Val Trp Gln Val Leu
145 150 155 160
Lys Glu Asn Gly Pro Met Ala Ser Asp Pro Leu Cys Leu Thr Tyr Ser
165 170 175
Tyr Leu Ser His Val Asp Leu Val Lys Asp Leu Asn Ser Gly Leu Ile
180 185 190
Gly Ala Leu Leu Val Cys Arg Glu Gly Ser Leu Ala Lys Glu Lys Thr
195 200 205
Gln Thr Leu His Lys Phe Ile Leu Leu Phe Ala Val Phe Asp Glu Gly
210 215 220
Lys Ser Trp His Ser Glu Thr Lys Asn Ser Leu Met Gln Asp Arg Asp
225 230 235 240
Ala Ala Ser Ala Arg Ala Trp Pro Lys Met His Thr Val Asn Gly Tyr
245 250 255
Val Asn Arg Ser Leu Pro Gly Leu Ile Gly Cys His Arg Lys Ser Val
260 265 270
Tyr Trp His Val Ile Gly Met Gly Thr Thr Pro Glu Val His Ser Ile
275 280 285
Phe Leu Glu Gly His Thr Phe Leu Val Arg Asn His Arg Gln Ala Ser
290 295 300
Leu Glu Ile Ser Pro Ile Thr Phe Leu Thr Ala Gln Thr Leu Leu Met
305 310 315 320
Asp Leu Gly Gln Phe Leu Leu Phe Cys His Ile Ser Ser His Gln His
325 330 335
Asp Gly Met Glu Ala Tyr Val Lys Val Asp Ser Cys Pro Glu Glu Pro
340 345 350
Gln Leu Arg Met Lys Asn Asn Glu Glu Ala Glu Asp Tyr Asp Asp Asp
355 360 365
Leu Thr Asp Ser Glu Met Asp Val Val Arg Phe Asp Asp Asp Asn Ser
370 375 380
Pro Ser Phe Ile Gln Ile Arg Ser Val Ala Lys Lys His Pro Lys Thr
385 390 395 400
Trp Val His Tyr Ile Ala Ala Glu Glu Glu Asp Trp Asp Tyr Ala Pro
405 410 415
Leu Val Leu Ala Pro Asp Asp Arg Ser Tyr Lys Ser Gln Tyr Leu Asn
420 425 430
Asn Gly Pro Gln Arg Ile Gly Arg Lys Tyr Lys Lys Val Arg Phe Met
435 440 445
Ala Tyr Thr Asp Glu Thr Phe Lys Thr Arg Glu Ala Ile Gln His Glu
450 455 460
Ser Gly Ile Leu Gly Pro Leu Leu Tyr Gly Glu Val Gly Asp Thr Leu
465 470 475 480
Leu Ile Ile Phe Lys Asn Gln Ala Ser Arg Pro Tyr Asn Ile Tyr Pro
485 490 495
His Gly Ile Thr Asp Val Arg Pro Leu Tyr Ser Arg Arg Leu Pro Lys
500 505 510
Gly Val Lys His Leu Lys Asp Phe Pro Ile Leu Pro Gly Glu Ile Phe
515 520 525
Lys Tyr Lys Trp Thr Val Thr Val Glu Asp Gly Pro Thr Lys Ser Asp
530 535 540
Pro Arg Cys Leu Thr Arg Tyr Tyr Ser Ser Phe Val Asn Met Glu Arg
545 550 555 560
Asp Leu Ala Ser Gly Leu Ile Gly Pro Leu Leu Ile Cys Tyr Lys Glu
565 570 575
Ser Val Asp Gln Arg Gly Asn Gln Ile Met Ser Asp Lys Arg Asn Val
580 585 590
Ile Leu Phe Ser Val Phe Asp Glu Asn Arg Ser Trp Tyr Leu Thr Glu
595 600 605
Asn Ile Gln Arg Phe Leu Pro Asn Pro Ala Gly Val Gln Leu Glu Asp
610 615 620
Pro Glu Phe Gln Ala Ser Asn Ile Met His Ser Ile Asn Gly Tyr Val
625 630 635 640
Phe Asp Ser Leu Gln Leu Ser Val Cys Leu His Glu Val Ala Tyr Trp
645 650 655
Tyr Ile Leu Ser Ile Gly Ala Gln Thr Asp Phe Leu Ser Val Phe Phe
660 665 670
Ser Gly Tyr Thr Phe Lys His Lys Met Val Tyr Glu Asp Thr Leu Thr
675 680 685
Leu Phe Pro Phe Ser Gly Glu Thr Val Phe Met Ser Met Glu Asn Pro
690 695 700
Gly Leu Trp Ile Leu Thr Cys His Asn Ser Asp Phe Arg Asn Arg Gly
705 710 715 720
Met Thr Ala Leu Leu Lys Val Ser Ser Cys Asp Lys Asn Thr Gly Asp
725 730 735
Tyr Tyr Glu Asp Ser Tyr Glu Asp Ile Ser Ala Tyr Leu Leu Ser Lys
740 745 750
Asn Asn Ala Ile Glu Pro Arg Ser Phe Ser Gln Asn Ser Arg His Gln
755 760 765
Ala Tyr Arg Tyr Arg Arg Gly Glu Ile Thr Arg Thr Thr Leu Gln Ser
770 775 780
Asp Gln Glu Glu Ile Asp Tyr Asp Asp Thr Ile Ser Val Glu Met Lys
785 790 795 800
Lys Glu Asp Phe Asp Ile Tyr Asp Glu Asp Glu Asn Gln Ser Pro Arg
805 810 815
Ser Phe Gln Lys Lys Thr Arg His Tyr Phe Ile Ala Ala Val Glu Arg
820 825 830
Leu Trp Asp Tyr Gly Met Ser Ser Ser Pro His Val Leu Arg Asn Arg
835 840 845
Ala Gln Ser Gly Ser Val Pro Gln Phe Lys Lys Val Val Phe Gln Glu
850 855 860
Phe Thr Asp Gly Ser Phe Thr Gln Pro Leu Tyr Arg Gly Glu Leu Asn
865 870 875 880
Glu His Leu Gly Leu Leu Gly Pro Tyr Ile Arg Ala Glu Val Glu Asp
885 890 895
Asn Ile Met Val Thr Phe Arg Asn Gln Ala Ser Arg Pro Tyr Ser Phe
900 905 910
Tyr Ser Ser Leu Ile Ser Tyr Glu Glu Asp Gln Arg Gln Gly Ala Glu
915 920 925
Pro Arg Lys Asn Phe Val Lys Pro Asn Glu Thr Lys Thr Tyr Phe Trp
930 935 940
Lys Val Gln His His Met Ala Pro Thr Lys Asp Glu Phe Asp Cys Lys
945 950 955 960
Ala Trp Ala Tyr Phe Ser Asp Val Asp Leu Glu Lys Asp Val His Ser
965 970 975
Gly Leu Ile Gly Pro Leu Leu Val Cys His Thr Asn Thr Leu Asn Pro
980 985 990
Ala His Gly Arg Gln Val Thr Val Gln Glu Phe Ala Leu Phe Phe Thr
995 1000 1005
Ile Phe Asp Glu Thr Lys Ser Trp Tyr Phe Thr Glu Asn Met Glu
1010 1015 1020
Arg Asn Cys Arg Ala Pro Cys Asn Ile Gln Met Glu Asp Pro Thr
1025 1030 1035
Phe Lys Glu Asn Tyr Arg Phe His Ala Ile Asn Gly Tyr Ile Met
1040 1045 1050
Asp Thr Leu Pro Gly Leu Val Met Ala Gln Asp Gln Arg Ile Arg
1055 1060 1065
Trp Tyr Leu Leu Ser Met Gly Ser Asn Glu Asn Ile His Ser Ile
1070 1075 1080
His Phe Ser Gly His Val Phe Thr Val Arg Lys Lys Glu Glu Tyr
1085 1090 1095
Lys Met Ala Leu Tyr Asn Leu Tyr Pro Gly Val Phe Glu Thr Val
1100 1105 1110
Glu Met Leu Pro Ser Lys Ala Gly Ile Trp Arg Val Glu Cys Leu
1115 1120 1125
Ile Gly Glu His Leu His Ala Gly Met Ser Thr Leu Phe Leu Val
1130 1135 1140
Tyr Ser Asn Lys Cys Gln Thr Pro Leu Gly Met Ala Ser Gly His
1145 1150 1155
Ile Arg Asp Phe Gln Ile Thr Ala Ser Gly Gln Tyr Gly Gln Trp
1160 1165 1170
Ala Pro Lys Leu Ala Arg Leu His Tyr Ser Gly Ser Ile Asn Ala
1175 1180 1185
Trp Ser Thr Lys Glu Pro Phe Ser Trp Ile Lys Val Asp Leu Leu
1190 1195 1200
Ala Pro Met Ile Ile His Gly Ile Lys Thr Gln Gly Ala Arg Gln
1205 1210 1215
Lys Phe Ser Ser Leu Tyr Ile Ser Gln Phe Ile Ile Met Tyr Ser
1220 1225 1230
Leu Asp Gly Lys Lys Trp Gln Thr Tyr Arg Gly Asn Ser Thr Gly
1235 1240 1245
Thr Leu Met Val Phe Phe Gly Asn Val Asp Ser Ser Gly Ile Lys
1250 1255 1260
His Asn Ile Phe Asn Pro Pro Ile Ile Ala Arg Tyr Ile Arg Leu
1265 1270 1275
His Pro Thr His Tyr Ser Ile Arg Ser Thr Leu Arg Met Glu Leu
1280 1285 1290
Met Gly Cys Asp Leu Asn Ser Cys Ser Met Pro Leu Gly Met Glu
1295 1300 1305
Ser Lys Ala Ile Ser Asp Ala Gln Ile Thr Ala Ser Ser Tyr Phe
1310 1315 1320
Thr Asn Met Phe Ala Thr Trp Ser Pro Ser Lys Ala Arg Leu His
1325 1330 1335
Leu Gln Gly Arg Ser Asn Ala Trp Arg Pro Gln Val Asn Asn Pro
1340 1345 1350
Lys Glu Trp Leu Gln Val Asp Phe Gln Lys Thr Met Lys Val Thr
1355 1360 1365
Gly Val Thr Thr Gln Gly Val Lys Ser Leu Leu Thr Ser Met Tyr
1370 1375 1380
Val Lys Glu Phe Leu Ile Ser Ser Ser Gln Asp Gly His Gln Trp
1385 1390 1395
Thr Leu Phe Phe Gln Asn Gly Lys Val Lys Val Phe Gln Gly Asn
1400 1405 1410
Gln Asp Ser Phe Thr Pro Val Val Asn Ser Leu Asp Pro Pro Leu
1415 1420 1425
Leu Thr Arg Tyr Leu Arg Ile His Pro Gln Ser Trp Val His Gln
1430 1435 1440
Ile Ala Leu Arg Met Glu Val Leu Gly Cys Glu Ala Gln Asp Leu
1445 1450 1455
Tyr
<210> 28
<211> 2133
<212> PRT
<213> Artificial sequence
<400> 28
Met Gln Leu Glu Leu Ser Thr Cys Val Phe Leu Cys Leu Leu Pro Leu
1 5 10 15
Gly Phe Ser Ala Ile Arg Arg Tyr Tyr Leu Gly Ala Val Glu Leu Ser
20 25 30
Trp Asp Tyr Arg Gln Ser Glu Leu Leu Arg Glu Leu His Val Asp Thr
35 40 45
Arg Phe Pro Ala Thr Ala Pro Gly Ala Leu Pro Leu Gly Pro Ser Val
50 55 60
Leu Tyr Lys Lys Thr Val Phe Val Glu Phe Thr Asp Gln Leu Phe Ser
65 70 75 80
Val Ala Arg Pro Arg Pro Pro Trp Met Gly Leu Leu Gly Pro Thr Ile
85 90 95
Gln Ala Glu Val Tyr Asp Thr Val Val Val Thr Leu Lys Asn Met Ala
100 105 110
Ser His Pro Val Ser Leu His Ala Val Gly Val Ser Phe Trp Lys Ser
115 120 125
Ser Glu Gly Ala Glu Tyr Glu Asp His Thr Ser Gln Arg Glu Lys Glu
130 135 140
Asp Asp Lys Val Leu Pro Gly Lys Ser Gln Thr Tyr Val Trp Gln Val
145 150 155 160
Leu Lys Glu Asn Gly Pro Thr Ala Ser Asp Pro Pro Cys Leu Thr Tyr
165 170 175
Ser Tyr Leu Ser His Val Asp Leu Val Lys Asp Leu Asn Ser Gly Leu
180 185 190
Ile Gly Ala Leu Leu Val Cys Arg Glu Gly Ser Leu Thr Arg Glu Arg
195 200 205
Thr Gln Asn Leu His Glu Phe Val Leu Leu Phe Ala Val Phe Asp Glu
210 215 220
Gly Lys Ser Trp His Ser Ala Arg Asn Asp Ser Trp Thr Arg Ala Met
225 230 235 240
Asp Pro Ala Pro Ala Arg Ala Gln Pro Ala Met His Thr Val Asn Gly
245 250 255
Tyr Val Asn Arg Ser Leu Pro Gly Leu Ile Gly Cys His Lys Lys Ser
260 265 270
Val Tyr Trp His Val Ile Gly Met Gly Thr Ser Pro Glu Val His Ser
275 280 285
Ile Phe Leu Glu Gly His Thr Phe Leu Val Arg His His Arg Gln Ala
290 295 300
Ser Leu Glu Ile Ser Pro Leu Thr Phe Leu Thr Ala Gln Thr Phe Leu
305 310 315 320
Met Asp Leu Gly Gln Phe Leu Leu Phe Cys His Ile Ser Ser His His
325 330 335
His Gly Gly Met Glu Ala His Val Arg Val Glu Ser Cys Ala Glu Glu
340 345 350
Pro Gln Leu Arg Arg Lys Ala Asp Glu Glu Glu Asp Tyr Asp Asp Asn
355 360 365
Leu Tyr Asp Ser Asp Met Asp Val Val Arg Leu Asp Gly Asp Asp Val
370 375 380
Ser Pro Phe Ile Gln Ile Arg Ser Val Ala Lys Lys His Pro Lys Thr
385 390 395 400
Trp Val His Tyr Ile Ser Ala Glu Glu Glu Asp Trp Asp Tyr Ala Pro
405 410 415
Ala Val Pro Ser Pro Ser Asp Arg Ser Tyr Lys Ser Leu Tyr Leu Asn
420 425 430
Ser Gly Pro Gln Arg Ile Gly Arg Lys Tyr Lys Lys Ala Arg Phe Val
435 440 445
Ala Tyr Thr Asp Val Thr Phe Lys Thr Arg Lys Ala Ile Pro Tyr Glu
450 455 460
Ser Gly Ile Leu Gly Pro Leu Leu Tyr Gly Glu Val Gly Asp Thr Leu
465 470 475 480
Leu Ile Ile Phe Lys Asn Lys Ala Ser Arg Pro Tyr Asn Ile Tyr Pro
485 490 495
His Gly Ile Thr Asp Val Ser Ala Leu His Pro Gly Arg Leu Leu Lys
500 505 510
Gly Trp Lys His Leu Lys Asp Met Pro Ile Leu Pro Gly Glu Thr Phe
515 520 525
Lys Tyr Lys Trp Thr Val Thr Val Glu Asp Gly Pro Thr Lys Ser Asp
530 535 540
Pro Arg Cys Leu Thr Arg Tyr Tyr Ser Ser Ser Ile Asn Leu Glu Lys
545 550 555 560
Asp Leu Ala Ser Gly Leu Ile Gly Pro Leu Leu Ile Cys Tyr Lys Glu
565 570 575
Ser Val Asp Gln Arg Gly Asn Gln Met Met Ser Asp Lys Arg Asn Val
580 585 590
Ile Leu Phe Ser Val Phe Asp Glu Asn Gln Ser Trp Tyr Leu Ala Glu
595 600 605
Asn Ile Gln Arg Phe Leu Pro Asn Pro Asp Gly Leu Gln Pro Gln Asp
610 615 620
Pro Glu Phe Gln Ala Ser Asn Ile Met His Ser Ile Asn Gly Tyr Val
625 630 635 640
Phe Asp Ser Leu Gln Leu Ser Val Cys Leu His Glu Val Ala Tyr Trp
645 650 655
Tyr Ile Leu Ser Val Gly Ala Gln Thr Asp Phe Leu Ser Val Phe Phe
660 665 670
Ser Gly Tyr Thr Phe Lys His Lys Met Val Tyr Glu Asp Thr Leu Thr
675 680 685
Leu Phe Pro Phe Ser Gly Glu Thr Val Phe Met Ser Met Glu Asn Pro
690 695 700
Gly Leu Trp Val Leu Thr Cys His Asn Ser Asp Leu Arg Asn Arg Gly
705 710 715 720
Met Thr Ala Leu Leu Lys Val Tyr Ser Cys Asp Arg Asp Ile Gly Asp
725 730 735
Tyr Tyr Asp Asn Thr Tyr Glu Asp Ile Pro Gly Phe Leu Leu Ser Gly
740 745 750
Lys Asn Val Ile Glu Pro Arg Ser Phe Ala Gln Asn Ser Arg Pro Pro
755 760 765
Ser Ala Ser Gln Lys Gln Phe Gln Thr Ile Thr Ser Pro Glu Asp Asp
770 775 780
Val Glu Leu Asp Pro Gln Ser Gly Glu Arg Thr Gln Ala Leu Glu Glu
785 790 795 800
Leu Ser Val Pro Ser Gly Asp Gly Ser Met Leu Leu Gly Gln Asn Pro
805 810 815
Ala Pro His Gly Ser Ser Ser Ser Asp Leu Gln Glu Ala Arg Asn Glu
820 825 830
Ala Asp Asp Tyr Leu Pro Gly Ala Arg Glu Arg Asn Thr Ala Pro Ser
835 840 845
Ala Ala Ala Arg Leu Arg Pro Glu Leu His His Ser Ala Glu Arg Val
850 855 860
Leu Thr Pro Glu Pro Glu Lys Glu Leu Lys Lys Leu Asp Ser Lys Met
865 870 875 880
Ser Ser Ser Ser Asp Leu Leu Lys Thr Ser Pro Thr Ile Pro Ser Asp
885 890 895
Thr Leu Ser Ala Glu Thr Glu Arg Thr His Ser Leu Gly Pro Pro His
900 905 910
Pro Gln Val Asn Phe Arg Ser Gln Leu Gly Ala Ile Val Leu Gly Lys
915 920 925
Asn Ser Ser His Phe Ile Gly Ala Gly Val Pro Leu Gly Ser Thr Glu
930 935 940
Glu Asp His Glu Ser Ser Leu Gly Glu Asn Val Ser Pro Val Glu Ser
945 950 955 960
Asp Gly Ile Phe Glu Lys Glu Arg Ala His Gly Pro Ala Ser Leu Thr
965 970 975
Lys Asp Asp Val Leu Phe Lys Val Asn Ile Ser Leu Val Lys Thr Asn
980 985 990
Lys Ala Arg Val Tyr Leu Lys Thr Asn Arg Lys Ile His Ile Asp Asp
995 1000 1005
Ala Ala Leu Leu Thr Glu Asn Arg Ala Ser Ala Thr Phe Met Asp
1010 1015 1020
Lys Asn Thr Thr Ala Ser Gly Leu Asn His Val Ser Asn Trp Ile
1025 1030 1035
Lys Gly Pro Leu Gly Lys Asn Pro Leu Ser Ser Glu Arg Gly Pro
1040 1045 1050
Ser Pro Glu Leu Leu Thr Ser Ser Gly Ser Gly Lys Ser Val Lys
1055 1060 1065
Gly Gln Ser Ser Gly Gln Gly Arg Ile Arg Val Ala Val Glu Glu
1070 1075 1080
Glu Glu Leu Ser Lys Gly Lys Glu Met Met Leu Pro Asn Ser Glu
1085 1090 1095
Leu Thr Phe Leu Thr Asn Ser Ala Asp Val Gln Gly Asn Asp Thr
1100 1105 1110
His Ser Gln Gly Lys Lys Ser Arg Glu Glu Met Glu Arg Arg Glu
1115 1120 1125
Lys Leu Val Gln Glu Lys Val Asp Leu Pro Gln Val Tyr Thr Ala
1130 1135 1140
Thr Gly Thr Lys Asn Phe Leu Arg Asn Ile Phe His Gln Ser Thr
1145 1150 1155
Glu Pro Ser Val Glu Gly Phe Asp Gly Gly Ser His Ala Pro Val
1160 1165 1170
Pro Gln Asp Ser Arg Ser Leu Asn Asp Ser Ala Glu Arg Ala Glu
1175 1180 1185
Thr His Ile Ala His Phe Ser Ala Ile Arg Glu Glu Ala Pro Leu
1190 1195 1200
Glu Ala Pro Gly Asn Arg Thr Gly Pro Gly Pro Arg Ser Ala Val
1205 1210 1215
Pro Arg Arg Val Lys Gln Ser Leu Lys Gln Ile Arg Leu Pro Leu
1220 1225 1230
Glu Glu Ile Lys Pro Glu Arg Gly Val Val Leu Asn Ala Thr Ser
1235 1240 1245
Thr Arg Trp Ser Glu Ser Ser Pro Ile Leu Gln Gly Ala Lys Arg
1250 1255 1260
Asn Asn Leu Ser Leu Pro Phe Leu Thr Leu Glu Met Ala Gly Gly
1265 1270 1275
Gln Gly Lys Ile Ser Ala Leu Gly Lys Ser Ala Ala Gly Pro Leu
1280 1285 1290
Ala Ser Gly Lys Leu Glu Lys Ala Val Leu Ser Ser Ala Gly Leu
1295 1300 1305
Ser Glu Ala Ser Gly Lys Ala Glu Phe Leu Pro Lys Val Arg Val
1310 1315 1320
His Arg Glu Asp Leu Leu Pro Gln Lys Thr Ser Asn Val Ser Cys
1325 1330 1335
Ala His Gly Asp Leu Gly Gln Glu Ile Phe Leu Gln Lys Thr Arg
1340 1345 1350
Gly Pro Val Asn Leu Asn Lys Val Asn Arg Pro Gly Arg Thr Pro
1355 1360 1365
Ser Lys Leu Leu Gly Pro Pro Met Pro Lys Glu Trp Glu Ser Leu
1370 1375 1380
Glu Lys Ser Pro Lys Ser Thr Ala Leu Arg Thr Lys Asp Ile Ile
1385 1390 1395
Ser Leu Pro Leu Asp Arg His Glu Ser Asn His Ser Ile Ala Ala
1400 1405 1410
Lys Asn Glu Gly Gln Ala Glu Thr Gln Arg Glu Ala Ala Trp Thr
1415 1420 1425
Lys Gln Gly Gly Pro Gly Arg Leu Cys Ala Pro Lys Pro Pro Val
1430 1435 1440
Leu Arg Arg His Gln Arg Asp Ile Ser Leu Pro Thr Phe Gln Pro
1445 1450 1455
Glu Glu Asp Lys Met Asp Tyr Asp Asp Ile Phe Ser Thr Glu Thr
1460 1465 1470
Lys Gly Glu Asp Phe Asp Ile Tyr Gly Glu Asp Glu Asn Gln Asp
1475 1480 1485
Pro Arg Ser Phe Gln Lys Arg Thr Arg His Tyr Phe Ile Ala Ala
1490 1495 1500
Val Glu Gln Leu Trp Asp Tyr Gly Met Ser Glu Ser Pro Arg Ala
1505 1510 1515
Leu Arg Asn Arg Ala Gln Asn Gly Glu Val Pro Arg Phe Lys Lys
1520 1525 1530
Val Val Phe Arg Glu Phe Ala Asp Gly Ser Phe Thr Gln Pro Ser
1535 1540 1545
Tyr Arg Gly Glu Leu Asn Lys His Leu Gly Leu Leu Gly Pro Tyr
1550 1555 1560
Ile Arg Ala Glu Val Glu Asp Asn Ile Met Val Thr Phe Lys Asn
1565 1570 1575
Gln Ala Ser Arg Pro Tyr Ser Phe Tyr Ser Ser Leu Ile Ser Tyr
1580 1585 1590
Pro Asp Asp Gln Glu Gln Gly Ala Glu Pro Arg His Asn Phe Val
1595 1600 1605
Gln Pro Asn Glu Thr Arg Thr Tyr Phe Trp Lys Val Gln His His
1610 1615 1620
Met Ala Pro Thr Glu Asp Glu Phe Asp Cys Lys Ala Trp Ala Tyr
1625 1630 1635
Phe Ser Asp Val Asp Leu Glu Lys Asp Val His Ser Gly Leu Ile
1640 1645 1650
Gly Pro Leu Leu Ile Cys Arg Ala Asn Thr Leu Asn Ala Ala His
1655 1660 1665
Gly Arg Gln Val Thr Val Gln Glu Phe Ala Leu Phe Phe Thr Ile
1670 1675 1680
Phe Asp Glu Thr Lys Ser Trp Tyr Phe Thr Glu Asn Val Glu Arg
1685 1690 1695
Asn Cys Arg Ala Pro Cys His Leu Gln Met Glu Asp Pro Thr Leu
1700 1705 1710
Lys Glu Asn Tyr Arg Phe His Ala Ile Asn Gly Tyr Val Met Asp
1715 1720 1725
Thr Leu Pro Gly Leu Val Met Ala Gln Asn Gln Arg Ile Arg Trp
1730 1735 1740
Tyr Leu Leu Ser Met Gly Ser Asn Glu Asn Ile His Ser Ile His
1745 1750 1755
Phe Ser Gly His Val Phe Ser Val Arg Lys Lys Glu Glu Tyr Lys
1760 1765 1770
Met Ala Val Tyr Asn Leu Tyr Pro Gly Val Phe Glu Thr Val Glu
1775 1780 1785
Met Leu Pro Ser Lys Val Gly Ile Trp Arg Ile Glu Cys Leu Ile
1790 1795 1800
Gly Glu His Leu Gln Ala Gly Met Ser Thr Thr Phe Leu Val Tyr
1805 1810 1815
Ser Lys Glu Cys Gln Ala Pro Leu Gly Met Ala Ser Gly Arg Ile
1820 1825 1830
Arg Asp Phe Gln Ile Thr Ala Ser Gly Gln Tyr Gly Gln Trp Ala
1835 1840 1845
Pro Lys Leu Ala Arg Leu His Tyr Ser Gly Ser Ile Asn Ala Trp
1850 1855 1860
Ser Thr Lys Asp Pro His Ser Trp Ile Lys Val Asp Leu Leu Ala
1865 1870 1875
Pro Met Ile Ile His Gly Ile Met Thr Gln Gly Ala Arg Gln Lys
1880 1885 1890
Phe Ser Ser Leu Tyr Ile Ser Gln Phe Ile Ile Met Tyr Ser Leu
1895 1900 1905
Asp Gly Arg Asn Trp Gln Ser Tyr Arg Gly Asn Ser Thr Gly Thr
1910 1915 1920
Leu Met Val Phe Phe Gly Asn Val Asp Ala Ser Gly Ile Lys His
1925 1930 1935
Asn Ile Phe Asn Pro Pro Ile Val Ala Arg Tyr Ile Arg Leu His
1940 1945 1950
Pro Thr His Tyr Ser Ile Arg Ser Thr Leu Arg Met Glu Leu Met
1955 1960 1965
Gly Cys Asp Leu Asn Ser Cys Ser Met Pro Leu Gly Met Gln Asn
1970 1975 1980
Lys Ala Ile Ser Asp Ser Gln Ile Thr Ala Ser Ser His Leu Ser
1985 1990 1995
Asn Ile Phe Ala Thr Trp Ser Pro Ser Gln Ala Arg Leu His Leu
2000 2005 2010
Gln Gly Arg Thr Asn Ala Trp Arg Pro Arg Val Ser Ser Ala Glu
2015 2020 2025
Glu Trp Leu Gln Val Asp Leu Gln Lys Thr Val Lys Val Thr Gly
2030 2035 2040
Ile Thr Thr Gln Gly Val Lys Ser Leu Leu Ser Ser Met Tyr Val
2045 2050 2055
Lys Glu Phe Leu Val Ser Ser Ser Gln Asp Gly Arg Arg Trp Thr
2060 2065 2070
Leu Phe Leu Gln Asp Gly His Thr Lys Val Phe Gln Gly Asn Gln
2075 2080 2085
Asp Ser Ser Thr Pro Val Val Asn Ala Leu Asp Pro Pro Leu Phe
2090 2095 2100
Thr Arg Tyr Leu Arg Ile His Pro Thr Ser Trp Ala Gln His Ile
2105 2110 2115
Ala Leu Arg Leu Glu Val Leu Gly Cys Glu Ala Gln Asp Leu Tyr
2120 2125 2130
<210> 29
<211> 1466
<212> PRT
<213> Artificial sequence
<400> 29
Met Gln Leu Glu Leu Ser Thr Cys Val Phe Leu Cys Leu Leu Pro Leu
1 5 10 15
Gly Phe Ser Ala Ile Arg Arg Tyr Tyr Leu Gly Ala Val Glu Leu Ser
20 25 30
Trp Asp Tyr Arg Gln Ser Glu Leu Leu Arg Glu Leu His Val Asp Thr
35 40 45
Arg Phe Pro Ala Thr Ala Pro Gly Ala Leu Pro Leu Gly Pro Ser Val
50 55 60
Leu Tyr Lys Lys Thr Val Phe Val Glu Phe Thr Asp Gln Leu Phe Ser
65 70 75 80
Val Ala Arg Pro Arg Pro Pro Trp Met Gly Leu Leu Gly Pro Thr Ile
85 90 95
Gln Ala Glu Val Tyr Asp Thr Val Val Val Thr Leu Lys Asn Met Ala
100 105 110
Ser His Pro Val Ser Leu His Ala Val Gly Val Ser Phe Trp Lys Ser
115 120 125
Ser Glu Gly Ala Glu Tyr Glu Asp His Thr Ser Gln Arg Glu Lys Glu
130 135 140
Asp Asp Lys Val Leu Pro Gly Lys Ser Gln Thr Tyr Val Trp Gln Val
145 150 155 160
Leu Lys Glu Asn Gly Pro Thr Ala Ser Asp Pro Pro Cys Leu Thr Tyr
165 170 175
Ser Tyr Leu Ser His Val Asp Leu Val Lys Asp Leu Asn Ser Gly Leu
180 185 190
Ile Gly Ala Leu Leu Val Cys Arg Glu Gly Ser Leu Thr Arg Glu Arg
195 200 205
Thr Gln Asn Leu His Glu Phe Val Leu Leu Phe Ala Val Phe Asp Glu
210 215 220
Gly Lys Ser Trp His Ser Ala Arg Asn Asp Ser Trp Thr Arg Ala Met
225 230 235 240
Asp Pro Ala Pro Ala Arg Ala Gln Pro Ala Met His Thr Val Asn Gly
245 250 255
Tyr Val Asn Arg Ser Leu Pro Gly Leu Ile Gly Cys His Lys Lys Ser
260 265 270
Val Tyr Trp His Val Ile Gly Met Gly Thr Ser Pro Glu Val His Ser
275 280 285
Ile Phe Leu Glu Gly His Thr Phe Leu Val Arg His His Arg Gln Ala
290 295 300
Ser Leu Glu Ile Ser Pro Leu Thr Phe Leu Thr Ala Gln Thr Phe Leu
305 310 315 320
Met Asp Leu Gly Gln Phe Leu Leu Phe Cys His Ile Ser Ser His His
325 330 335
His Gly Gly Met Glu Ala His Val Arg Val Glu Ser Cys Ala Glu Glu
340 345 350
Pro Gln Leu Arg Arg Lys Ala Asp Glu Glu Glu Asp Tyr Asp Asp Asn
355 360 365
Leu Tyr Asp Ser Asp Met Asp Val Val Arg Leu Asp Gly Asp Asp Val
370 375 380
Ser Pro Phe Ile Gln Ile Arg Ser Val Ala Lys Lys His Pro Lys Thr
385 390 395 400
Trp Val His Tyr Ile Ser Ala Glu Glu Glu Asp Trp Asp Tyr Ala Pro
405 410 415
Ala Val Pro Ser Pro Ser Asp Arg Ser Tyr Lys Ser Leu Tyr Leu Asn
420 425 430
Ser Gly Pro Gln Arg Ile Gly Arg Lys Tyr Lys Lys Ala Arg Phe Val
435 440 445
Ala Tyr Thr Asp Val Thr Phe Lys Thr Arg Lys Ala Ile Pro Tyr Glu
450 455 460
Ser Gly Ile Leu Gly Pro Leu Leu Tyr Gly Glu Val Gly Asp Thr Leu
465 470 475 480
Leu Ile Ile Phe Lys Asn Lys Ala Ser Arg Pro Tyr Asn Ile Tyr Pro
485 490 495
His Gly Ile Thr Asp Val Ser Ala Leu His Pro Gly Arg Leu Leu Lys
500 505 510
Gly Trp Lys His Leu Lys Asp Met Pro Ile Leu Pro Gly Glu Thr Phe
515 520 525
Lys Tyr Lys Trp Thr Val Thr Val Glu Asp Gly Pro Thr Lys Ser Asp
530 535 540
Pro Arg Cys Leu Thr Arg Tyr Tyr Ser Ser Ser Ile Asn Leu Glu Lys
545 550 555 560
Asp Leu Ala Ser Gly Leu Ile Gly Pro Leu Leu Ile Cys Tyr Lys Glu
565 570 575
Ser Val Asp Gln Arg Gly Asn Gln Met Met Ser Asp Lys Arg Asn Val
580 585 590
Ile Leu Phe Ser Val Phe Asp Glu Asn Gln Ser Trp Tyr Leu Ala Glu
595 600 605
Asn Ile Gln Arg Phe Leu Pro Asn Pro Asp Gly Leu Gln Pro Gln Asp
610 615 620
Pro Glu Phe Gln Ala Ser Asn Ile Met His Ser Ile Asn Gly Tyr Val
625 630 635 640
Phe Asp Ser Leu Gln Leu Ser Val Cys Leu His Glu Val Ala Tyr Trp
645 650 655
Tyr Ile Leu Ser Val Gly Ala Gln Thr Asp Phe Leu Ser Val Phe Phe
660 665 670
Ser Gly Tyr Thr Phe Lys His Lys Met Val Tyr Glu Asp Thr Leu Thr
675 680 685
Leu Phe Pro Phe Ser Gly Glu Thr Val Phe Met Ser Met Glu Asn Pro
690 695 700
Gly Leu Trp Val Leu Thr Cys His Asn Ser Asp Leu Arg Asn Arg Gly
705 710 715 720
Met Thr Ala Leu Leu Lys Val Tyr Ser Cys Asp Arg Asp Ile Gly Asp
725 730 735
Tyr Tyr Asp Asn Thr Tyr Glu Asp Ile Pro Gly Phe Leu Leu Ser Gly
740 745 750
Lys Asn Val Ile Glu Pro Arg Ser Phe Ser Gln Asn Ser Arg Pro Pro
755 760 765
Ser Ala Ser Ala Pro Lys Pro Pro Val Leu Arg His Gln Arg Asp Ile
770 775 780
Ser Leu Pro Thr Arg Gln Pro Glu Glu Asp Lys Met Asp Tyr Asp Asp
785 790 795 800
Ile Phe Ser Thr Glu Thr Lys Gly Glu Asp Phe Asp Ile Tyr Gly Glu
805 810 815
Asp Glu Asn Gln Asp Pro Arg Ser Phe Gln Lys Arg Thr Arg His Tyr
820 825 830
Phe Ile Ala Ala Val Glu Gln Leu Trp Asp Tyr Gly Met Ser Glu Ser
835 840 845
Pro Arg Ala Leu Arg Asn Arg Ala Gln Asn Gly Glu Val Pro Arg Phe
850 855 860
Lys Lys Val Val Phe Arg Glu Phe Ala Asp Gly Ser Phe Thr Gln Pro
865 870 875 880
Ser Tyr Arg Gly Glu Leu Asn Lys His Leu Gly Leu Leu Gly Pro Tyr
885 890 895
Ile Arg Ala Glu Val Glu Asp Asn Ile Met Val Thr Phe Lys Asn Gln
900 905 910
Ala Ser Arg Pro Tyr Ser Phe Tyr Ser Ser Leu Ile Ser Tyr Pro Asp
915 920 925
Asp Gln Glu Gln Gly Ala Glu Pro Arg His Asn Phe Val Gln Pro Asn
930 935 940
Glu Thr Arg Thr Tyr Phe Trp Lys Val Gln His His Met Ala Pro Thr
945 950 955 960
Glu Asp Glu Phe Asp Cys Lys Ala Trp Ala Tyr Phe Ser Asp Val Asp
965 970 975
Leu Glu Lys Asp Val His Ser Gly Leu Ile Gly Pro Leu Leu Ile Cys
980 985 990
Arg Ala Asn Thr Leu Asn Ala Ala His Gly Arg Gln Val Thr Val Gln
995 1000 1005
Glu Phe Ala Leu Phe Phe Thr Ile Phe Asp Glu Thr Lys Ser Trp
1010 1015 1020
Tyr Phe Thr Glu Asn Val Glu Arg Asn Cys Arg Ala Pro Cys His
1025 1030 1035
Leu Gln Met Glu Asp Pro Thr Leu Lys Glu Asn Tyr Arg Phe His
1040 1045 1050
Ala Ile Asn Gly Tyr Val Met Asp Thr Leu Pro Gly Leu Val Met
1055 1060 1065
Ala Gln Asn Gln Arg Ile Arg Trp Tyr Leu Leu Ser Met Gly Ser
1070 1075 1080
Asn Glu Asn Ile His Ser Ile His Phe Ser Gly His Val Phe Ser
1085 1090 1095
Val Arg Lys Lys Glu Glu Tyr Lys Met Ala Val Tyr Asn Leu Tyr
1100 1105 1110
Pro Gly Val Phe Glu Thr Val Glu Met Leu Pro Ser Lys Val Gly
1115 1120 1125
Ile Trp Arg Ile Glu Cys Leu Ile Gly Glu His Leu Gln Ala Gly
1130 1135 1140
Met Ser Thr Thr Phe Leu Val Tyr Ser Lys Glu Cys Gln Ala Pro
1145 1150 1155
Leu Gly Met Ala Ser Gly Arg Ile Arg Asp Phe Gln Ile Thr Ala
1160 1165 1170
Ser Gly Gln Tyr Gly Gln Trp Ala Pro Lys Leu Ala Arg Leu His
1175 1180 1185
Tyr Ser Gly Ser Ile Asn Ala Trp Ser Thr Lys Asp Pro His Ser
1190 1195 1200
Trp Ile Lys Val Asp Leu Leu Ala Pro Met Ile Ile His Gly Ile
1205 1210 1215
Met Thr Gln Gly Ala Arg Gln Lys Phe Ser Ser Leu Tyr Ile Ser
1220 1225 1230
Gln Phe Ile Ile Met Tyr Ser Leu Asp Gly Arg Asn Trp Gln Ser
1235 1240 1245
Tyr Arg Gly Asn Ser Thr Gly Thr Leu Met Val Phe Phe Gly Asn
1250 1255 1260
Val Asp Ala Ser Gly Ile Lys His Asn Ile Phe Asn Pro Pro Ile
1265 1270 1275
Val Ala Arg Tyr Ile Arg Leu His Pro Thr His Tyr Ser Ile Arg
1280 1285 1290
Ser Thr Leu Arg Met Glu Leu Met Gly Cys Asp Leu Asn Ser Cys
1295 1300 1305
Ser Met Pro Leu Gly Met Gln Asn Lys Ala Ile Ser Asp Ser Gln
1310 1315 1320
Ile Thr Ala Ser Ser His Leu Ser Asn Ile Phe Ala Thr Trp Ser
1325 1330 1335
Pro Ser Gln Ala Arg Leu His Leu Gln Gly Arg Thr Asn Ala Trp
1340 1345 1350
Arg Pro Arg Val Ser Ser Ala Glu Glu Trp Leu Gln Val Asp Leu
1355 1360 1365
Gln Lys Thr Val Lys Val Thr Gly Ile Thr Thr Gln Gly Val Lys
1370 1375 1380
Ser Leu Leu Ser Ser Met Tyr Val Lys Glu Phe Leu Val Ser Ser
1385 1390 1395
Ser Gln Asp Gly Arg Arg Trp Thr Leu Phe Leu Gln Asp Gly His
1400 1405 1410
Thr Lys Val Phe Gln Gly Asn Gln Asp Ser Ser Thr Pro Val Val
1415 1420 1425
Asn Ala Leu Asp Pro Pro Leu Phe Thr Arg Tyr Leu Arg Ile His
1430 1435 1440
Pro Thr Ser Trp Ala Gln His Ile Ala Leu Arg Leu Glu Val Leu
1445 1450 1455
Gly Cys Glu Ala Gln Asp Leu Tyr
1460 1465

Claims (9)

1. A nucleic acid encoding a variant Gly710Thr of a highly active factor viii or villa polypeptide, wherein:
(1) hF VIII cDNA: the nucleotide sequence is shown in any one of SEQ ID NO 1-4;
or (2) BDD hFVIII-SQ: the nucleotide sequence is shown as any one of SEQ ID NO 5-8;
or (3) BDD hFVIII-N8: the nucleotide sequence is shown in any one of SEQ ID NO 9-12;
or (4) BDD hFVIII-SC: the nucleotide sequence is shown in any one of SEQ ID NO 13-16;
or (5) BDDhFVIII- Δ F: the nucleotide sequence is shown in any one of SEQ ID NO 17-20.
2. A mutein of highly active factor VIII or VIIIa polypeptide variant Gly710Thr characterized in that: the amino acid sequence is shown in any one of SEQ ID NO 21-29, and the 710th amino acid of the mutant is Thr instead of Gly of wild type VIII or VIII a.
3. A nucleic acid encoding the mutein of claim 2 or a nucleic acid of the same length as the encoding nucleic acid and being completely complementary to the encoding nucleic acid.
4. A vector expressing the mutein of claim 2.
5. A method for preparing a mutant protein of a high-activity blood coagulation factor VIII or VIIIa polypeptide variant Gly710Thr, which comprises the following steps:
BDD-hFVIII or full-length FVIII is taken as a template, a PCR site-directed mutagenesis method is used for introducing mutation sites into wild FVIII, Gly710 amino acid residues are replaced by Thr, and BDD-hFVIII or full-length FVIII eukaryotic expression plasmids which are correctly inserted into Gly710Thr mutation are screened after transformation.
6. Use of a nucleic acid encoding a mutein of a highly active factor viii or villia polypeptide variant Gly710Thr, wherein: use of the nucleic acid of claim 1 or 3 for the preparation of a gene therapy drug comprising linking it to promoter and termination sequences to construct an expression plasmid, gene therapy viral or non-viral vector expressing the mutein of claim 2.
7. The use of a mutein of highly active factor viii or villa polypeptide variant Gly710Thr according to claim 2 wherein: is applied to preparing the recombinant protein therapeutic drug for hemophilia.
8. The use of a mutein of highly active factor viii or villa polypeptide variant Gly710Thr according to claim 2 wherein: the fusion protein is applied to preparing the fusion protein of the polypeptide variant Gly710Thr mutant of the blood coagulation factor VIII or VIII a, and the fusion protein is used for preparing the recombinant protein therapeutic drug for hemophilia.
9. Use according to claim 8, characterized in that: the fusion protein is obtained by fusing one of human albumin, immunoglobulin Fc, transferrin or alpha 1 antitrypsin with the mutant protein of the polypeptide variant Gly710Thr mutant of the blood coagulation factor VIII or VIII a.
CN202111654874.5A 2021-12-30 2021-12-30 High-activity blood coagulation factor VIII or VIII a polypeptide variant Gly710Thr Pending CN114717239A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202111654874.5A CN114717239A (en) 2021-12-30 2021-12-30 High-activity blood coagulation factor VIII or VIII a polypeptide variant Gly710Thr

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202111654874.5A CN114717239A (en) 2021-12-30 2021-12-30 High-activity blood coagulation factor VIII or VIII a polypeptide variant Gly710Thr

Publications (1)

Publication Number Publication Date
CN114717239A true CN114717239A (en) 2022-07-08

Family

ID=82235712

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202111654874.5A Pending CN114717239A (en) 2021-12-30 2021-12-30 High-activity blood coagulation factor VIII or VIII a polypeptide variant Gly710Thr

Country Status (1)

Country Link
CN (1) CN114717239A (en)

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105567735A (en) * 2016-01-05 2016-05-11 华东师范大学 Site specific repairing carrier system and method of blood coagulation factor genetic mutation
CN112126636A (en) * 2020-09-09 2020-12-25 上海交通大学医学院附属瑞金医院 High-activity blood coagulation factor XI mutant Ala570Thr
CN114196677A (en) * 2021-12-30 2022-03-18 广西医科大学第一附属医院 High-activity blood coagulation factor VIII or VIII a polypeptide variant Gly710Ala

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105567735A (en) * 2016-01-05 2016-05-11 华东师范大学 Site specific repairing carrier system and method of blood coagulation factor genetic mutation
CN112126636A (en) * 2020-09-09 2020-12-25 上海交通大学医学院附属瑞金医院 High-activity blood coagulation factor XI mutant Ala570Thr
CN114196677A (en) * 2021-12-30 2022-03-18 广西医科大学第一附属医院 High-activity blood coagulation factor VIII or VIII a polypeptide variant Gly710Ala

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
JACKY CHI KI NGO 等: "Crystal Structure of Human Factor VIII: Implications for the Formation of the Factor IXa-Factor VIIIa Complex", 《STRUCTURE》, pages 597 - 606 *
鲁丹 等: "长效重组人凝血因子Ⅷ研究现状及进展", 《生物工程学报》, pages 34 - 43 *

Similar Documents

Publication Publication Date Title
AU693837B2 (en) Hybrid human/animal factor VIII
Kornalik et al. Prothrombin activation induced by Ecarin-a prothrombin converting enzyme from Echis carinatus venom
Mann et al. [23] Prothrombin
Mohammed et al. An update on factor XI structure and function
Bertina et al. The use of a functional and immunologic assay for plasma protein C in the study of the heterogeneity of congenital protein C deficiency
EP0756638B1 (en) Means for treating disorders in the blood coagulation cascade
Brockway et al. Characterization of native streptokinase and altered streptokinase isolated from a human plasminogen activator complex
US6531298B2 (en) Factor IX antihemophilic factor with increased clotting activity
CA2374675C (en) Modified factor viii
Fujikawa et al. Mechanism of activation of bovine factor IX (Christmas factor) by bovine factor XIa (activated plasma thromboplastin antecedent)
WO1993020093A1 (en) Hybrid human/porcine factor viii
Thorelli et al. Cleavage requirements for activation of factor V by factor Xa
KR100490612B1 (en) Modified factor ⅷ
JP2007312777A (en) Thrombin mutant
Henriksen et al. Identification of the primary structural defect in the dysthrombin thrombin Quick I: substitution of cysteine for arginine-382
CN113248594B (en) Recombinant blood coagulation factor VIII and application thereof
EP0272928B1 (en) Compositions and methods for the synthesis and assay of enkephalinase
AU2008205445A1 (en) Method for producing gamma-carboxlated proteins
Rabiet et al. Molecular defect of prothrombin Barcelona. Substitution of cysteine for arginine at residue 273.
Dahlback Factor V and protein S as cofactors to activated protein C
US20040249134A1 (en) Factor viii c2 domain variants
CN114196677A (en) High-activity blood coagulation factor VIII or VIII a polypeptide variant Gly710Ala
Oehlin et al. Proteolytic formation and properties of a fragment of protein C containing the. gamma.-carboxyglutamic acid rich domain and the EGF-like region
US20040005670A1 (en) Compositions and methods for the treatment of hemophilia A
CN114717239A (en) High-activity blood coagulation factor VIII or VIII a polypeptide variant Gly710Thr

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination