CN111808863B - Codon-optimized coagulation factor VIII gene and construct thereof - Google Patents

Codon-optimized coagulation factor VIII gene and construct thereof Download PDF

Info

Publication number
CN111808863B
CN111808863B CN202010581446.3A CN202010581446A CN111808863B CN 111808863 B CN111808863 B CN 111808863B CN 202010581446 A CN202010581446 A CN 202010581446A CN 111808863 B CN111808863 B CN 111808863B
Authority
CN
China
Prior art keywords
factor viii
coagulation factor
codon
optimized
gene
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202010581446.3A
Other languages
Chinese (zh)
Other versions
CN111808863A (en
Inventor
吴昊泉
党颖
苏玲玲
叶青
牛琦
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Kanglin Bio Tech Hangzhou Co ltd
Original Assignee
Kanglin Bio Tech Hangzhou Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Kanglin Bio Tech Hangzhou Co ltd filed Critical Kanglin Bio Tech Hangzhou Co ltd
Priority to CN202010581446.3A priority Critical patent/CN111808863B/en
Publication of CN111808863A publication Critical patent/CN111808863A/en
Application granted granted Critical
Publication of CN111808863B publication Critical patent/CN111808863B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/435Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
    • C07K14/745Blood coagulation or fibrinolysis factors
    • C07K14/755Factors VIII, e.g. factor VIII C (AHF), factor VIII Ag (VWF)
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K38/00Medicinal preparations containing peptides
    • A61K38/16Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • A61K38/17Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
    • A61K38/36Blood coagulation or fibrinolysis factors
    • A61K38/37Factors VIII
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61PSPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
    • A61P7/00Drugs for disorders of the blood or the extracellular fluid
    • A61P7/04Antihaemorrhagics; Procoagulants; Haemostatic agents; Antifibrinolytic agents
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/85Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
    • C12N15/86Viral vectors
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N7/00Viruses; Bacteriophages; Compositions thereof; Preparation or purification thereof
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2740/00Reverse transcribing RNA viruses
    • C12N2740/00011Details
    • C12N2740/10011Retroviridae
    • C12N2740/15011Lentivirus, not HIV, e.g. FIV, SIV
    • C12N2740/15021Viruses as such, e.g. new isolates, mutants or their genomic sequences
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2740/00Reverse transcribing RNA viruses
    • C12N2740/00011Details
    • C12N2740/10011Retroviridae
    • C12N2740/15011Lentivirus, not HIV, e.g. FIV, SIV
    • C12N2740/15041Use of virus, viral particle or viral elements as a vector
    • C12N2740/15043Use of virus, viral particle or viral elements as a vector viral genome or elements thereof as genetic vector
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2800/00Nucleic acids vectors
    • C12N2800/10Plasmid DNA
    • C12N2800/106Plasmid DNA for vertebrates
    • C12N2800/107Plasmid DNA for vertebrates for mammalian
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2800/00Nucleic acids vectors
    • C12N2800/22Vectors comprising a coding region that has been codon optimised for expression in a respective host

Landscapes

  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Chemical & Material Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Organic Chemistry (AREA)
  • Genetics & Genomics (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Zoology (AREA)
  • General Health & Medical Sciences (AREA)
  • Medicinal Chemistry (AREA)
  • Wood Science & Technology (AREA)
  • Hematology (AREA)
  • Biochemistry (AREA)
  • General Engineering & Computer Science (AREA)
  • Biotechnology (AREA)
  • Biomedical Technology (AREA)
  • Virology (AREA)
  • Animal Behavior & Ethology (AREA)
  • Biophysics (AREA)
  • Immunology (AREA)
  • Gastroenterology & Hepatology (AREA)
  • Microbiology (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Molecular Biology (AREA)
  • Veterinary Medicine (AREA)
  • Public Health (AREA)
  • Pharmacology & Pharmacy (AREA)
  • Epidemiology (AREA)
  • Toxicology (AREA)
  • Plant Pathology (AREA)
  • Physics & Mathematics (AREA)
  • Diabetes (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • General Chemical & Material Sciences (AREA)
  • Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)
  • Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)

Abstract

The invention relates to the field of biological medicine, in particular to a coagulation factor VIII gene optimized by codons and a construction body thereof. The invention provides a coagulation factor VIII gene with optimized codons, wherein the GC content in the coagulation factor VIII gene sequence is at least 54%, the CpG island content is at least 122, and the gene sequence of the B structural domain in the coagulation factor VIII gene with optimized codons is a truncated sequence. The invention also provides a nucleic acid construct comprising the codon-optimized factor VIII gene. The codon-optimized coagulation factor VIII gene and the construction body thereof have the following beneficial effects: the expression level of the blood coagulation factor VIII is obviously improved; the activity level of the blood coagulation factor VIII is obviously improved; the dosage is small, the immune response and the immunogenicity are reduced, and the risk is relatively controllable.

Description

Codon-optimized coagulation factor VIII gene and construct thereof
Technical Field
The invention relates to the technical field of medicines, in particular to a coagulation factor VIII gene optimized by codons and a construction body thereof.
Background
Platelet coagulation factors are various protein components involved in the process of blood coagulation. The combined action of these factors is to maintain normal coagulation physiology in a bleeding state. If one or more coagulation factors are missing, a group of hereditary hemorrhagic diseases, hemophilia, results. Among them, the most clinically common hemophilia a is caused by coagulation factor VIII-antihemophilic globulin a (ahga), which is an X-linked recessive genetic disease. Patients develop coagulation disorders of varying severity and spontaneous bleeding.
The blood coagulation factor replacement therapy is the only confirmed and effective medicament for treating the type A hemophilia at present, is expensive, and can generate inhibiting factors after long-term application. Gene therapy with AAV or lentiviral vectors as a delivery vehicle allows for sustained expression of fully functional FVIII from human cells in vivo by permanent integration of the Factor Viii (FVIII) protein encoding gene into human cells or long term implantation into non-dividing cells. Because of its potential long-term efficacy and safety, it is the most promising therapeutic approach for hemophilia a. However, FVIII protein has a complex gene structure with a total length of more than 2330 amino acids, which poses certain technical challenges for purification including in vitro expression and in vivo gene vector therapy. Most of the FVIII currently used clinically is fully functional truncated FVIII (as shown in figure 1).
Improving the gene therapy vector expression cassette of FVIII protein is a key factor in determining the efficiency of gene therapy applications. First, only optimized protein expression efficiency leads to higher protein levels, which is the greatest challenge facing most current gene therapy clinical studies for hemophilia a. Plasma functional FVIII protein levels, i.e. restoration of procoagulant function, are one of the most important evaluation indicators in gene therapy studies, both preclinical and clinical stages. Clinical data published in the current time frame for gene therapy of hemophilia show that no study has achieved 100% recovery of procoagulant function.
Secondly, the optimized protein expression efficiency means that less vector dose will be required to maintain the same protein level. Gene therapy by permanently integrating protein-encoding genes into human cells, the safety risk brought by random integration sites of encoding genes is one of the most important safety considerations in gene therapy clinical research, especially in gene therapy research using lentiviral vectors as delivery means. Current research tends to suggest that the number of random integration sites (VCN) within a single cell is positively correlated with the risk of neoplasia due to random integration. Higher expression of protein to achieve unit integration means fewer random integration sites within a single cell, leading to more controllable safety risks.
Also, the less carrier dose required to maintain the same protein level, the lower the immune response and immunogenicity brought about by the delivery vehicle itself. This is particularly evident when AAV is used as a delivery vehicle, since most AAV serotypes have some degree of preexisting immunity in the human population (e.g., AAV2) and immune responses to AAV caused by repeated dosing. Lower vector doses will result in lower immune response and immunogenicity as well as more sustained protein expression.
The biggest technical challenge in improving the expression cassette of gene therapy vectors for FVIII proteins is the codon optimization of FVIII protein genes. There are many general codon optimization principles for increasing the amount of protein expression in vivo application of gene therapy vectors, and these general codon optimization principles cannot be uniformly applied to a single gene therapy vector. Different general codon optimization principles tend to present problems of mutual incompatibility. For example, changes in the composition of CpG islands or GC content of the total coding region will necessarily affect the choice of codon usage preference. Another consideration is that different codon optimisation will, in addition to giving different protein expression levels, also give rise to different post-translational modifications of FVIII and different biological activities. Codon optimization must also target the optimized in vivo use biological activity. In addition, among the different general codon optimization principles, the prior art and the recognition suggest that too high GC content affects the transcription efficiency of the gene, and thus avoiding too high GC content is an optimization principle to be repeatedly mentioned.
Disclosure of Invention
In view of the above-mentioned drawbacks of the prior art, it is an object of the present invention to provide a codon optimized factor VIII gene and constructs thereof, which solve the problems of the prior art.
In order to achieve the above objects and other related objects, a first aspect of the present invention provides a codon-optimized factor VIII gene, wherein the GC content of the gene sequence of the factor VIII gene is at least 54%, the CpG island content of the gene sequence of the factor VIII gene is at least 122, and the gene sequence of the B domain of the codon-optimized factor VIII gene is a truncated sequence.
Preferably, the GC content in the codon-optimized sequence is 54-63%, and the CpG island content is 122-248.
A second aspect provides a nucleic acid construct comprising the codon-optimized factor VIII gene.
A third aspect provides a lentivirus virally packaged from the nucleic acid construct.
A fourth aspect provides a lentiviral vector system comprising the nucleic acid construct and a helper plasmid.
In a fifth aspect, a recombinant factor VIII is provided, which is secreted from a cell after the lentivirus infects the cell.
The sixth aspect provides a composition for preventing or treating a coagulation factor deficiency disease, wherein the effective substance of the composition contains one or more of the following substances: a codon optimized factor VIII gene; the nucleic acid construct; the lentivirus.
A seventh aspect provides a cell line that is infected with the lentivirus.
An eighth aspect provides the use of the codon optimized factor VIII gene, the nucleic acid construct, the lentivirus, the recombinant factor VIII, the composition or the cell line for the preparation of a medicament for the prevention or treatment of a factor deficiency disorder.
As described above, the present invention has the following advantageous effects:
1) the expression level of the blood coagulation factor VIII can be obviously improved while the GC content is higher;
2) the activity level of the blood coagulation factor VIII can be obviously improved while the GC content is higher;
3) the dosage is small, the immune response and the immunogenicity are reduced, and the risk is relatively controllable.
Drawings
Figure 1 shows a schematic of factor VIII, full length factor VIII on the left and truncated factor VIII on the right.
FIG. 2 shows a schematic design of a factor VIII nucleic acid construct of the present invention.
FIG. 3 shows a lentiviral backbone plasmid map of the invention.
FIG. 4 shows the expression level of coagulation factor VIII in the supernatant after infection of HepG2 liver cell line with a TTR promoter-driven lentivirus as measured by Western Blot, which was measured at 20. mu.L.
FIG. 5 shows the expression level of factor VIII in the supernatant after infection of 293T cell line with lentivirus driven by EF1a promoter as measured by Western Blot, which was measured at 20. mu.L.
Detailed Description
Unless defined otherwise below, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs.
The term "nucleic acid construct" refers to an artificially constructed nucleic acid segment that can be introduced into a target cell or tissue, which can be a lentiviral vector comprising a vector backbone, i.e., an empty vector, and an expression framework.
The term "vector" refers to a nucleic acid fragment or polynucleotide fragment for introducing or transferring one or more nucleic acids or one or more polynucleotides into a target cell or tissue. Typically, the vector is used to introduce the foreign DNA into another cell or tissue. The vector may comprise a bacterial resistance gene for growth in bacteria and a promoter for expression of a protein of interest in an organism. The DNA may be generated in vitro by PCR or any other suitable technique or techniques known to those skilled in the art.
The term "expression cassette" refers to a sequence having the potential to encode a protein.
Through intensive research, the inventor provides a codon-optimized coagulation factor VIII gene in a first aspect, wherein the GC content in a codon-optimized sequence is at least 54%, the CpG island content is at least 122, and the gene sequence of a B structural domain in the codon-optimized coagulation factor VIII gene is a truncated sequence.
For example, the GC content may be 55% to 60%, 61% to 65%, 66% to 70%, 71% to 75%, 76% to 80%, 81% to 85%, 86% to 90%, 91% to 95%.
In one embodiment of the invention, the content of GC in the blood coagulation factor VIII gene sequence after codon optimization is 54-63%, and the content of CpG islands is 122-248.
The principles used in codon optimisation include:
1) preferentially selecting a mammalian cell biased codon according to codon usage preferences of different species;
2) avoiding hidden cleavage sites, cleavage donor and acceptor sequences, immature tailing signals, strong mRNA secondary structures, RNA instability sequences, transcription termination signals.
Codon optimization can use existing software for codon optimization of the gene of interest such as: DNAworks, UpGene, Benchling, etc.
Further, the coagulation factor VIII gene is a mammalian gene. For example, the human blood coagulation factor VIII Gene has the Gene sequence number (Gene ID: 2157).
The GC content, also known as the G + C ratio or GC ratio, is usually expressed as a percentage. The GC content calculation formula is:
[ (total number of G + C)/(total number of a + T + C + G) ]. 100%
The following conditions are met, namely called CpG islands:
1) GC content: the GC content reaches 55 percent;
2) the occurrence of dinucleotides: the frequency of CpG dinucleotides (ratio of observed value to expected value) reaches 65%;
3) sequence length: the length is not less than 200 bp.
Wherein, the observed value is the number of CpG sites actually contained in the fragment;
the calculation method of the "expected value" is: (C G)/LS. Wherein C, G represents the number of cytosines and guanines; LS stands for fragment length (length of sequence)
Specifically, the truncated B domain has an amino acid sequence length of; 21aa, the sequence is shown in SEQ ID NO. 2.
Specifically, the nucleotide sequence of the codon optimized coagulation factor VIII gene is SEQ ID NO. 3, SEQ ID NO. 8 or SEQ ID NO. 10.
Specifically, the promoter of the codon-optimized coagulation factor VIII gene is TTR or EF1 a.
In a second aspect, the invention provides a nucleic acid construct comprising said codon-optimized factor VIII gene.
Further, the nucleic acid construct is a non-viral vector or a viral vector.
The non-viral vector mediates gene transfer by using the physicochemical properties of the non-viral vector material.
Further, the viral vector is a lentiviral vector or an adeno-associated viral vector.
The vector backbone in the lentiviral vector may be a vector backbone as described in the prior art.
Further, the vector framework is pKL-CCL, and the nucleotide sequence of the lentiviral vector framework is shown in SEQ ID NO 1.
In a third aspect, the invention provides a lentivirus virally packaged from the nucleic acid construct.
In a fourth aspect, the invention provides a lentiviral vector system comprising the nucleic acid construct and a helper plasmid.
Further, the helper plasmids encode one or more nucleotide sequences of the gag and pol proteins, as well as other necessary viral packaging component nucleotide sequences, and may include packaging and envelope plasmids.
Further, the lentiviral vector system also includes a host cell, which may be selected from a variety of applicable host cells in the art, as long as it does not limit the object of the present invention. A particular suitable cell may be a lentivirus-producing cell, for example a 293T cell.
The fifth aspect of the invention provides a recombinant blood coagulation factor VIII, wherein the recombinant blood coagulation factor VIII is obtained by secretion of cells after the slow virus infects the cells.
The sixth aspect of the present invention provides a composition for preventing or treating a coagulation factor deficiency disease, wherein the effective substance comprises one or more of the following substances: a codon optimized factor VIII gene; the nucleic acid construct; the lentivirus; recombinant factor VIII.
The composition may be a pharmaceutical composition.
The form of the composition is not particularly limited, and may be in the form of various substances such as solid, liquid, gel, semifluid, aerosol, etc.
When the composition is used for preventing or treating a coagulation factor deficiency disease, an effective dose of the composition needs to be administered to a subject. Using this method, the subject is capable of normally expressing a coagulation factor.
In a seventh aspect, the invention provides a cell line infected with the lentivirus.
The cell line can be used as a biological agent for preparing products for preventing or treating the blood coagulation factor deficiency disease.
The eighth aspect of the invention provides the use of the codon optimized factor VIII gene, the nucleic acid construct, the lentivirus, the recombinant factor VIII, the composition or the cell line in the preparation of a medicament for preventing and treating the factor deficiency disease.
The application can be the application in the development, screening and pharmacological toxicology evaluation of a blood coagulation factor deficiency drug.
The coagulation factor deficiency disease in the sixth, seventh or eighth aspects of the present invention is an inherited coagulation factor deficiency disease, such as one or more of hemophilia a, hemophilia B and hemophilia C.
The embodiments of the present invention are described below with reference to specific embodiments, and other advantages and effects of the present invention will be easily understood by those skilled in the art from the disclosure of the present specification. The invention is capable of other and different embodiments and of being practiced or of being carried out in various ways, and its several details are capable of modification in various respects, all without departing from the spirit and scope of the present invention.
Before the present embodiments are further described, it is to be understood that the scope of the invention is not limited to the particular embodiments described below; it is also to be understood that the terminology used in the examples is for the purpose of describing particular embodiments, and is not intended to limit the scope of the present invention; in the description and claims of the present application, the singular forms "a", "an" and "the" include plural referents unless the context clearly dictates otherwise.
When numerical ranges are given in the examples, it is understood that both endpoints of each of the numerical ranges and any value therebetween can be selected unless the invention otherwise indicated. Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. In addition to the specific methods, devices, and materials used in the examples, any methods, devices, and materials similar or equivalent to those described in the examples may be used in the practice of the invention in addition to the specific methods, devices, and materials used in the examples, in keeping with the knowledge of one skilled in the art and with the description of the invention.
Example 1 coagulation factor VIII Gene expression framework design
An expression frame is designed as a wild type expression frame based on an amino acid sequence of recombinant human blood coagulation factor VIII (Elite Xyntha) for injection produced by the pharmaceutical company of Peucedanum, wherein the sequence mainly comprises regions A1, A2, B, A3, A3, C1 and C2, 887 amino acids are deleted after B domain is truncated, only 21 amino acids are contained, and the expression frame is numbered as P0001.
On the basis of the expression frame P0001, 3 other expression frames P0002, P0003 and P0004 are respectively designed and respectively optimized by codons under different conditions. The optimization principle is to preferentially select the biased codons of mammalian cells while avoiding hidden cleavage sites, cleavage donor and acceptor sequences, immature tailing signals, strong secondary mRNA structures, RNA instability sequences, transcription termination signals, and the like.
P0002 is based on satisfying these conditions, making GC content as uniform as possible, and without considering CpG content. P0003 is optimized to increase GC% as much as possible and to minimize the CpG content while satisfying the above conditions. P0004, on the basis of satisfying these conditions, totally irrespective of CpG factors, increased GC% to the highest. As shown in FIG. 2, the expression cassette is driven by different promoters. The information for each expression frame is shown in Table 1.
TABLE 1 expression framework optimization
Numbering Optimization mode CpG content GC content
P0001 Wild type 55 44%
P0002 Universal optimization 204 47%
P0003 Kanglin optimization 122 54%
P0004 Kanglin optimization 248 63%
Example 2 construction of a factor VIII Gene-expressing Lentiviral vector
The blood coagulation factor VIII gene expression frame P0001 designed in example 1 is driven by a TTR promoter, other expression frames are driven by two promoters of TTR and EF1a respectively, and then cloned into a lentivirus framework to form a lentivirus vector, wherein the lentivirus framework is derived from a3 rd generation replication-defective self-inactivation type pseudoenveloped lentivirus framework pKL-CCL based on HIV-1, which is self-prepared by Corolin Biotechnology (Hangzhou) Limited company, the nucleotide sequence is SEQ ID NO:01, and the map is shown in figure 3. The lentivirus backbone comprises a chimeric LTR promoter, an HIV-1 packaging signal (psi), a central polypurine tract (cPPT), a Rev Response Element (RRE), a polypurine fragment (PPT), a woodchuck hepatitis B virus post-transcriptional regulatory element (WPRE), a polyadenylation signal of SV40 virus (SV40pA signal), a replication initiation site of SV40 virus (SV40ori), and a self-inactivating long terminal repeat.
The coagulation factor VIII gene expression frame P0002 (driven by TTR promoter and having the nucleotide sequence SEQ ID NO:4) designed in example 1 was synthesized by Nanjing Kingsler Biotechnology Ltd, cloned between the multiple cloning sites ClaI/SalI on the lentiviral backbone pKL-CCL by a method known in the art using homologous recombination, and the sequence information was confirmed by sequencing after cloning was completed, and named as pKL-CCL-TTR-P0002 (nucleotide sequence SEQ ID NO: 5).
The wild-type expression frame P0001 (nucleotide sequence SEQ ID NO:6) of the factor VIII gene designed in example 1 was amplified by PCR as a template by a method known in the art, and cDNA obtained by reverse transcription of mRNA from a human cell line 293T (purchased from American Type Culture Collection (ATCC) and deposited as CRL-3216) was amplified in two fragments to amplify the wild-type factor VIII gene, and cloned between the multiple cloning sites AgeI/SalI on a lentiviral vector pKL-CCL-TTR-P0002 by a method known in the art through homologous recombination, and sequence information was confirmed by sequencing after cloning was completed, and the gene was named as pKL-CCL-TTR-P0001 (nucleotide sequence SEQ ID NO: 7).
Reverse transcription system and procedure:
Figure BDA0002552455200000071
keeping the temperature at 65 ℃ for 5min, and rapidly cooling on ice
Figure BDA0002552455200000072
The procedure is as follows: 45min at 42 ℃; 15min at 70 DEG C
Wild type FVIII (B region deleted) is amplified by 2 segments, and primers are as follows:
FVIII-F1:gtccactcattcttggatccaccggtgccaccatgcaaatagagctctccacctg;(SEQ ID NO:26)
FVIII-R1:cgtttcaagactggtgggttttggctagggtgtcttgaattctgggagaagc;(SEQ ID NO:27)
FVIII-F2:gcttctcccagaattcaagacaccctagccaaaacccaccagtcttgaaacg;(SEQ ID NO:28)
FVIII-R2:tccagaggttgattgtcgacgtttaaacgcggccgctcagtagaggtcctgtgcctcgc(SEQ ID NO:29)
and (3) amplification procedure:
Figure BDA0002552455200000081
the coagulation factor VIII gene expression frame P0003 (nucleotide sequence SEQ ID NO:8) designed in example 1 was synthesized by Nanjing Kingsler Biotechnology Ltd, cloned between AgeI/SalI multiple cloning sites on a lentiviral vector pKL-CCL-TTR-P0002 by a ligation method well known in the art, and sequence information was confirmed by sequencing after cloning was completed, and named pKL-CCL-TTR-P0003 (nucleotide sequence SEQ ID NO: 9).
The coagulation factor VIII gene expression frame P0004 (nucleotide sequence SEQ ID NO:10) designed in example 1 was synthesized by Nanjing Kingsler Biotechnology Ltd, cloned between AgeI/SalI multiple cloning sites on a lentiviral vector pKL-CCL-TTR-P0002 by a ligation method well known in the art, and sequence information was confirmed by sequencing after cloning was completed, and named as pKL-CCL-TTR-P0004 (nucleotide sequence SEQ ID NO: 11).
The EF1a promoter (nucleotide sequence SEQ ID NO:12) is amplified by a PCR amplification method well known in the art and pKL-CCL-EF1a (nucleotide sequence SEQ ID NO:13) is used as a template, and then is cloned between the multiple cloning sites ClaI/BamHI on a lentiviral vector pKL-CCL-TTR-P0002 by a ligation method well known in the art, and after the cloning is completed, the sequence information is confirmed by sequencing and is named as pKL-CCL-EF1a-P0002 (nucleotide sequence SEQ ID NO: 14).
EF1a amplification primer:
EF1a-F:acaaaaattcaaaattttatcgataagagcatgcgtgaggctccggtgcccgtcag;(SEQ ID NO:30)
EF1a-F:catggtggcaccggtggatccaagaatgagtcacgacacctgaaatggaag;(SEQ ID NO:31)
and (3) amplification procedure:
Figure BDA0002552455200000082
the EF1a promoter is amplified by PCR amplification method well known in the art and pKL-CCL-EF1a as a template, and then cloned between the multiple cloning sites ClaI/BamHI on a lentivirus expression vector pKL-CCL-TTR-P0003 by a ligation method well known in the art, and after the cloning is completed, the sequence information is confirmed by sequencing and is named as pKL-CCL-EF1a-P0003 (nucleotide sequence SEQ ID NO: 15).
The EF1a promoter is amplified by PCR amplification method well known in the art and pKL-CCL-EF1a as a template, and then cloned between the multiple cloning sites ClaI/BamHI on a lentiviral vector pKL-CCL-TTR-P0004 by a ligation method well known in the art, and after the cloning is completed, the sequence information is confirmed by sequencing and is named as pKL-CCL-EF1a-P0004 (nucleotide sequence SEQ ID NO: 16).
Example 3: blood coagulation factor VIII gene expression lentivirus package
The factor VIII gene lentiviral expression vectors constructed in example 2 (pKL-CCL-TTR-P0001, pKL-CCL-TTR-P0002, pKL-CCL-TTR-P0003, pKL-CCL-TTR-P0004, pKL-CCL-EF1a-P0002, pKL-CCL-EF1a-P0003, pKL-CCL-EF1a-P0004), envelope plasmids (pKL-Kan-Vsvg with the nucleotide sequence shown in SEQ ID NO: 17) and packaging plasmids (pKL-Kan-Rev with the nucleotide sequence shown in SEQ ID NO:18, pKL-Kan-GagPol with the nucleotide sequence shown in SEQ ID NO: 19) were co-transfected into 293T cells (purchased from American Type Culture Collection (ATCC) with the accession number CRL-3216) at the same time, packaging of coagulation factor VIII gene therapy lentiviral vectors was performed in this 293T cell line. The transfection method is PEI cationic polymer mediated eukaryotic cell transient transfection, PEI cationic polymer is PEI-Max transfection reagent (purchased from Polysciences, Cat. No. 24765-1) purchased from Polysciences, and the transfection operation is carried out according to the standard operation recommended by manufacturers, and the transfection scale is 10cm2 cell culture dish.
And after 48 hours of transfection, harvesting a lentivirus vector (transfected cell culture supernatant), firstly centrifuging for 5 minutes at the room temperature of 4000rpm on a table-type bucket crane to remove cell debris, then centrifuging for 4 hours at the temperature of 4 ℃ and 10000g to obtain virus particle sediment, adding 1ml of DMEM complete culture medium into the virus particle sediment after removing the centrifugal supernatant, re-suspending the virus particles by using a microsyringe, and subpackaging the prepared virus re-suspension at-80 ℃ for later use.
Different volumes of lentiviral vectors were seeded in 96-well cell culture plates on a pre-plated human CD4+ T cell line, MT4 cell line (purchased from shanghai seoul biotechnology limited). Cell culture supernatants infected with reporter lentiviral vector EGFP (lentiviral vector packaged with pCCL-sin-EF 1. alpha. -WPRE-EGFP as described above) were used as positive controls, and initial harvest infectious titers of lentiviral vectors were calculated by quantitative PCR and flow cytometry based on GFP signals by methods well known in the art. The primer probe sequence used for quantitative PCR was:
LTR-For primer:CTGTTGTGTGACTCTGGTAACT(SEQ ID NO:20)
LTR probe:5’-AAATCTCTAGCAGTGGCGCCCG-3’(SEQ ID NO:21)
LTR-Rev primer:TTCGCTTTCAAGTCCCTGTT(SEQ ID NO:22)
HK Forward primer 5’-GCTGTCATCTCTTGTGGGCTGT-3’(SEQ ID NO:23)
HK probe 5’-CCTGTCATGCCCACACAAATCTCTCC-3’(SEQ ID NO:24)
HK Reverse primer 5’-ACTCATGGGAGCTGCTGGTTC-3’(SEQ ID NO:25)
wherein the LTR probe has 6FAM fluorescent group at the 5 'end and MGB fluorescent group at the 3' end;
the HK probe has a CY5 fluorophore at the 5 'end and a BHQ2 fluorophore at the 3' end.
Example 4: detection of protein expression efficiency and activity of blood coagulation factor VIII gene expressed lentivirus transduced cells
The crude lentivirus vectors (pKL-CCL-TTR-P0001, pKL-CCL-TTR-P0002, pKL-CCL-TTR-P0003, pKL-CCL-TTR-P0004, pKL-CCL-EF1a-P0002, pKL-CCL-EF1a-P0003 and pKL-CCL-EF1a-P0004) packaged in the above example 3 were inoculated with lentiviruses containing different promoters respectively into HepG2 cells (purchased from Nanke Baibo organism) and 293T cells pre-plated in 24-well cell culture plates, and then HepG2 cell culture supernatants from 48h to 72h and HepG2 cell culture supernatants from 24h to 72h were collected respectively, frozen at-80 ℃ for detection of protein and Western FVIII activity, while 72h cells were collected for detection of VCN:
HepG2 cells were seeded in 24-well cell culture plates at 2.50X 10 per well5After 2h, the cells were transfected with pKL-CCL-TTR-P0001, pKL-CCL-TTR-P0002, pKL-CCL-TTR-P0003 and pKL-CCL-TTR-P0004 lentiviral vectors by infecting HepG2 cells with a volume gradient of 25. mu.l, 50. mu.l, 100. mu.l and 200. mu.lChanging the solution after night, changing the solution again for 48h, collecting cell culture supernatant for freezing and storing at-80 ℃ for protein Western and FVIII activity detection for 72h, and collecting cells for VCN detection at the same time;
293T cells were seeded in 24-well cell culture plates at 1.00X 10 per well52h after each cell, infecting 293T cells with pKL-CCL-EF1a-P0002, pKL-CCL-EF1a-P0003 and pKL-CCL-EF1a-P0004 lentiviral vectors in volume gradients of 10 mul, 20 mul, 40 mul and 80 mul, transducing for 24h, then changing the solution, collecting cell culture supernatant for cryopreservation at-80 ℃ for protein Western and FVIII activity detection, and simultaneously collecting the cells for VCN detection;
the collected cells were washed with PBS at 4200rpm for 5min, centrifuged to collect the cells, resuspended in 50. mu.l of a quick extraction solution (Quickextract)TMDNA Extraction solvent) (purchased from Lucigen, cat. No. QE09050) and a PCR instrument running the following program of Table 2 below was used to lyse cells and extract total DNA.
TABLE 2 PCR procedure
Temperature of Time
65℃ 15min
68℃ 15min
95℃ 10min
The Copy Number of HepG2 and 293T cell-infected lentiviruses (Vector Copy Number, VCN) was calculated from quantitative PCR data by methods well known in the art and the results are shown in tables 3 and 4.
TABLE 3 VCN after lentiviral transduction of HepG2 cells
Figure BDA0002552455200000111
TABLE 4 VCN after lentivirus transduction of 293T cells
Figure BDA0002552455200000112
Selecting a sample close to VCN for activity detection according to the detection result of the VCN, wherein the detection kit is Biophen FVIII, C (Chromogenic assay for measuring Factor VIII, C in plasma, or in concentrations), the product number is 221402, the standard product is recombinant human coagulation Factor VIII (Rijie Xyntha) for injection produced by the pharmaceutical company of Peucedanum, 500 IU/bottle, and the registration number is S20120059; the normal human plasma coagulation factor VIII activity was defined as 1IU/mL, and comparative analysis was performed based on this, and the data are shown in Table 5.
TABLE 5 detection of coagulation factor VIII Activity
Figure BDA0002552455200000113
The data show that, based on comparison of 24h expression activity equivalents of 1.00E +06 cells within a unit VCN, comprehensive comparison of a series of lentiviruses of the TTR promoter after infection of HepG2 cells shows an order of superiority: pKL-TTR-P0004> pKL-TTR-P0003> pKL-TTR-P0002> pKL-TTR-P0001, overall comparison pKL-TTR-P0004 being optimal, wild type pKL-TTR-P0001 being essentially undetectable; after 293T cells are infected by series of lentiviruses of the EF1a promoter, the comprehensive comparison shows that the advantages are ranked: pKL-EF1a-P0004 is similar to pKL-EF1a-P0003 and is superior to pKL-EF1 a-P0002. The codon optimization strategy of kanglin obviously improves the activity level of the blood coagulation factor VIII in the cell culture supernatant.
A part of cell culture supernatant samples were selected according to VCN results and activity detection results and subjected to Western Blot, the loading amount was 20. mu.l, the antibody was Factor VIII (R8B12) (sc-73597), the antibody was purchased from Santa Cruze, the epitope binding site was A2 domain peptides 497-.
The detection result shows that the codon optimization strategy of conlin obviously improves the expression quantity of the blood coagulation factor VIII in the cell culture supernatant under the condition of similar VCN.
The above examples are intended to illustrate the disclosed embodiments of the invention and are not to be construed as limiting the invention. In addition, various modifications of the invention set forth herein, as well as variations of the methods of the invention, will be apparent to persons skilled in the art without departing from the scope and spirit of the invention. While the invention has been specifically described in connection with various specific preferred embodiments thereof, it should be understood that the invention should not be unduly limited to such specific embodiments. Indeed, various modifications of the above-described embodiments which are obvious to those skilled in the art to which the invention pertains are intended to be covered by the scope of the present invention.
Sequence listing
<110 kang Lin Biotech (Hangzhou) Ltd
<120> a codon optimized coagulation factor VIII gene and its construct
<160> 31
<170> SIPOSequenceListing 1.0
<210> 1
<211> 6594
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 1
aatgcttcaa taatattgaa aaaggaagag tatgagtatt caacatttcc gtgtcgccct 60
tattcccttt tttgcggcat tttgccttcc tgtttttgct cacccagaaa cgctggtgaa 120
agtaaaagat gctgaagatc agttgggtgc acgagtgggt tacatcgaac tggatctcaa 180
cagcggtaag atccttgaga gttttcgccc cgaagaacgt tttccaatga tgagcacttt 240
taaagttctg ctatgtggcg cggtattatc ccgtattgac gccgggcaag agcaactcgg 300
tcgccgcata cactattctc agaatgactt ggttgagtac tcaccagtca cagaaaagca 360
tcttacggat ggcatgacag taagagaatt atgcagtgct gccataacca tgagtgataa 420
cactgcggcc aacttacttc tgacaacgat cggaggaccg aaggagctaa ccgctttttt 480
gcacaacatg ggggatcatg taactcgcct tgatcgttgg gaaccggagc tgaatgaagc 540
cataccaaac gacgagcgtg acaccacgat gcctgtagca atggcaacaa cgttgcgcaa 600
actattaact ggcgaactac ttactctagc ttcccggcaa caattaatag actggatgga 660
ggcggataaa gttgcaggac cacttctgcg ctcggccctt ccggctggct ggtttattgc 720
tgataaatct ggagccggtg agcgtgggtc tcgcggtatc attgcagcac tggggccaga 780
tggtaagccc tcccgtatcg tagttatcta cacgacgggg agtcaggcaa ctatggatga 840
acgaaataga cagatcgctg agataggtgc ctcactgatt aagcattggt aactgtcaga 900
ccaagtttac tcatatatac tttagattga tttaaaactt catttttaat ttaaaaggat 960
ctaggtgaag atcctttttg ataatctcat gaccaaaatc ccttaacgtg agttttcgtt 1020
ccactgagcg tcagaccccg tagaaaagat caaaggatct tcttgagatc ctttttttct 1080
gcgcgtaatc tgctgcttgc aaacaaaaaa accaccgcta ccagcggtgg tttgtttgcc 1140
ggatcaagag ctaccaactc tttttccgaa ggtaactggc ttcagcagag cgcagatacc 1200
aaatactgtt cttctagtgt agccgtagtt aggccaccac ttcaagaact ctgtagcacc 1260
gcctacatac ctcgctctgc taatcctgtt accagtggct gctgccagtg gcgataagtc 1320
gtgtcttacc gggttggact caagacgata gttaccggat aaggcgcagc ggtcgggctg 1380
aacggggggt tcgtgcacac agcccagctt ggagcgaacg acctacaccg aactgagata 1440
cctacagcgt gagctatgag aaagcgccac gcttcccgaa gggagaaagg cggacaggta 1500
tccggtaagc ggcagggtcg gaacaggaga gcgcacgagg gagcttccag ggggaaacgc 1560
ctggtatctt tatagtcctg tcgggtttcg ccacctctga cttgagcgtc gatttttgtg 1620
atgctcgtca ggggggcgga gcctatggaa aaacgccagc aacgcggcct ttttacggtt 1680
cctggccttt tgctggcctt ttgctcacat gttctttcct gcgttatccc ctgattctgt 1740
ggataaccgt attaccgcct ttgagtgagc tgataccgct cgccgcagcc gaacgaccga 1800
gcgcagcgag tcagtgagcg aggaagcgga agagcgccca atacgcaaac cgcctctccc 1860
cgcgcgttgg ccgattcatt aatgcagctg gcacgacagg tttcccgact ggaaagcggg 1920
cagtgagcgc aacgcaatta atgtgagtta gctcactcat taggcacccc aggctttaca 1980
ctttatgctt ccggctcgta tgttgtgtgg aattgtgagc ggataacaat ttcacacagg 2040
aaacagctat gaccatgatt acgccaagcg cgcaattaac cctcactaaa gggaacaaaa 2100
gctggagctg caagcttggc cattgcatac gttgtatcca tatcataata tgtacattta 2160
tattggctca tgtccaacat taccgccatg ttgacattga ttattgacta gttattaata 2220
gtaatcaatt acggggtcat tagttcatag cccatatatg gagttccgcg ttacataact 2280
tacggtaaat ggcccgcctg gctgaccgcc caacgacccc cgcccattga cgtcaataat 2340
gacgtatgtt cccatagtaa cgccaatagg gactttccat tgacgtcaat gggtggagta 2400
tttacggtaa actgcccact tggcagtaca tcaagtgtat catatgccaa gtacgccccc 2460
tattgacgtc aatgacggta aatggcccgc ctggcattat gcccagtaca tgaccttatg 2520
ggactttcct acttggcagt acatctacgt attagtcatc gctattacca tggtgatgcg 2580
gttttggcag tacatcaatg ggcgtggata gcggtttgac tcacggggat ttccaagtct 2640
ccaccccatt gacgtcaatg ggagtttgtt ttggcaccaa aatcaacggg actttccaaa 2700
atgtcgtaac aactccgccc cattgacgca aatgggcggt aggcgtgtac ggtgggaggt 2760
ctatataagc agagctcgtt tagtgaaccg gggtctctct ggttagacca gatctgagcc 2820
tgggagctct ctggctaact agggaaccca ctgcttaagc ctcaataaag cttgccttga 2880
gtgcttcaag tagtgtgtgc ccgtctgttg tgtgactctg gtaactagag atccctcaga 2940
cccttttagt cagtgtggaa aatctctagc agtggcgccc gaacagggac ttgaaagcga 3000
aagggaaacc agaggagctc tctcgacgca ggactcggct tgctgaagcg cgcacggcaa 3060
gaggcgaggg gcggcgactg gtgagtacgc caaaaatttt gactagcgga ggctagaagg 3120
agagagatgg gtgcgagagc gtcagtatta agcgggggag aattagatcg cgatgggaaa 3180
aaattcggtt aaggccaggg ggaaagaaaa aatataaatt aaaacatata gtatgggcaa 3240
gcagggagct agaacgattc gcagttaatc ctggcctgtt agaaacatca gaaggctgta 3300
gacaaatact gggacagcta caaccatccc ttcagacagg atcagaagaa cttagatcat 3360
tatataatac agtagcaacc ctctattgtg tgcatcaaag gatagagata aaagacacca 3420
aggaagcttt agacaagata gaggaagagc aaaacaaaag taagaccacc gcacagcaag 3480
cggccgctga tcttcagacc tggaggagga gatatgaggg acaattggag aagtgaatta 3540
tataaatata aagtagtaaa aattgaacca ttaggagtag cacccaccaa ggcaaagaga 3600
agagtggtgc agagagaaaa aagagcagtg ggaataggag ctttgttcct tgggttcttg 3660
ggagcagcag gaagcactat gggcgcagcg tcaatgacgc tgacggtaca ggccagacaa 3720
ttattgtctg gtatagtgca gcagcagaac aatttgctga gggctattga ggcgcaacag 3780
catctgttgc aactcacagt ctggggcatc aagcagctcc aggcaagaat cctggctgtg 3840
gaaagatacc taaaggatca acagctcctg gggatttggg gttgctctgg aaaactcatt 3900
tgcaccactg ctgtgccttg gaatgctagt tggagtaata aatctctgga acagatttgg 3960
aatcacacga cctggatgga gtgggacaga gaaattaaca attacacaag cttaatacac 4020
tccttaattg aagaatcgca aaaccagcaa gaaaagaatg aacaagaatt attggaatta 4080
gataaatggg caagtttgtg gaattggttt aacataacaa attggctgtg gtatataaaa 4140
ttattcataa tgatagtagg aggcttggta ggtttaagaa tagtttttgc tgtactttct 4200
atagtgaata gagttaggca gggatattca ccattatcgt ttcagaccca cctcccaacc 4260
ccgaggggac ccgacaggcc cgaaggaata gaagaagaag gtggagagag agacagagac 4320
agatccattc gattagtgaa cggatctcga cggtatcggt taacttttaa aagaaaaggg 4380
gggattgggg ggtacagtgc aggggaaaga atagtagaca taatagcaac agacatacaa 4440
actaaagaat tacaaaaaca aattacaaaa attcaaaatt ttatcgatga gtaattcata 4500
caaaaggact cgcccctgcc ttggggaatc ccagggaccg tcgttaaact cccactaacg 4560
tagaacccag agatcgctgc gttcccgccc cctcacccgc ccgctctcgt catcactgag 4620
gtggagaaga gcatgcgtga gggatccgtc gacaatcaac ctctggatta caaaatttgt 4680
gaaagattga ctggtattct taactatgtt gctcctttta cgctatgtgg atacgctgct 4740
ttaatgcctt tgtatcatgc tattgcttcc cgtatggctt tcattttctc ctccttgtat 4800
aaatcctggt tgctgtctct ttatgaggag ttgtggcccg ttgtcaggca acgtggcgtg 4860
gtgtgcactg tgtttgctga cgcaaccccc actggttggg gcattgccac cacctgtcag 4920
ctcctttccg ggactttcgc tttccccctc cctattgcca cggcggaact catcgccgcc 4980
tgccttgccc gctgctggac aggggctcgg ctgttgggca ctgacaattc cgtggtgttg 5040
tcggggaagc tgacgtcctt tccatggctg ctcgcctgtg ttgccacctg gattctgcgc 5100
gggacgtcct tctgctacgt cccttcggcc ctcaatccag cggaccttcc ttcccgcggc 5160
ctgctgccgg ctctgcggcc tcttccgcgt cttcgccttc gccctcagac gagtcggatc 5220
tccctttggg ccgcctcccc gcctggaatt cgagctcggt acctttaaga ccaatgactt 5280
acaaggcagc tgtagatctt agccactttt taaaagaaaa ggggggactg gaagggctaa 5340
ttcactccca acgaagacaa gatctgcttt ttgcttgtac tgggtctctc tggttagacc 5400
agatctgagc ctgggagctc tctggctaac tagggaaccc actgcttaag cctcaataaa 5460
gcttgccttg agtgcttcaa gtagtgtgtg cccgtctgtt gtgtgactct ggtaactaga 5520
gatccctcag acccttttag tcagtgtgga aaatctctag cagtagtagt tcatgtcatc 5580
ttattattca gtatttataa cttgcaaaga aatgaatatc agagagtgag aggaacttgt 5640
ttattgcagc ttataatggt tacaaataaa gcaatagcat cacaaatttc acaaataaag 5700
catttttttc actgcattct agttgtggtt tgtccaaact catcaatgta tcttatcatg 5760
tctggctcta gctatcccgc ccctaactcc gcccatcccg cccctaactc cgcccagttc 5820
cgcccattct ccgccccatg gctgactaat tttttttatt tatgcagagg ccgaggccgc 5880
ctcggcctct gagctattcc agaagtagtg aggaggcttt tttggaggcc tagggacgta 5940
cccaattcgc cctatagtga gtcgtattac gcgcgctcac tggccgtcgt tttacaacgt 6000
cgtgactggg aaaaccctgg cgttacccaa cttaatcgcc ttgcagcaca tccccctttc 6060
gccagctggc gtaatagcga agaggcccgc accgatcgcc cttcccaaca gttgcgcagc 6120
ctgaatggcg aatgggacgc gccctgtagc ggcgcattaa gcgcggcggg tgtggtggtt 6180
acgcgcagcg tgaccgctac acttgccagc gccctagcgc ccgctccttt cgctttcttc 6240
ccttcctttc tcgccacgtt cgccggcttt ccccgtcaag ctctaaatcg ggggctccct 6300
ttagggttcc gatttagtgc tttacggcac ctcgacccca aaaaacttga ttagggtgat 6360
ggttcacgta gtgggccatc gccctgatag acggtttttc gccctttgac gttggagtcc 6420
acgttcttta atagtggact cttgttccaa actggaacaa cactcaaccc tatctcggtc 6480
tattcttttg atttataagg gattttgccg atttcggcct attggttaaa aaatgagctg 6540
atttaacaaa aatttaacgc gaattttaac aaaatattaa cgtttacaat ttcc 6594
<210> 2
<211> 21
<212> PRT
<213> Artificial Sequence (Artificial Sequence)
<400> 2
Ser Phe Ser Gln Asn Ser Arg His Pro Ser Gln Asn Pro Pro Val Leu
1 5 10 15
Lys Arg His Gln Arg
20
<210> 3
<211> 4395
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 3
atgcagatcg aactgagtac ctgcttcttt ctttgcttgt tgcgcttttg cttttctgcg 60
acgcggcggt attacttggg ggcggttgag ttgagctggg attatatgca atctgacctg 120
ggtgaactgc cggtcgatgc gcggttcccg ccccgggttc ccaaaagttt tccattcaac 180
acatccgtcg tttacaaaaa aacgcttttc gtagaattca cagatcacct gtttaacata 240
gcgaagccac gacctccctg gatgggcctt ttgggaccta cgatacaagc tgaagtatac 300
gataccgttg ttatcacact gaagaacatg gcaagtcacc ctgtttccct tcatgccgtg 360
ggagtatcat attggaaggc ttctgaagga gcagaatatg atgatcaaac aagtcaaaga 420
gagaaggaag atgacaaagt gttccctggg gggagtcata cgtacgtgtg gcaagtattg 480
aaggaaaatg gtccgatggc gtctgacccg ctttgtctta cctattccta cctttctcac 540
gtggacctgg taaaagatct gaactcaggt ctcattggcg ccctgttggt ttgtcgcgag 600
ggttcattgg caaaagaaaa gactcaaacg cttcacaagt ttatccttct ctttgccgtc 660
ttcgatgaag ggaagtcttg gcatagtgag actaagaact ccctcatgca agacagggat 720
gctgcatccg cgcgagcgtg gcctaagatg cacacggtta acggctatgt gaacaggagc 780
ctgccagggc tcatcggttg ccacaggaag tccgtgtact ggcatgttat agggatgggg 840
actacacctg aagtccattc tatattcctc gaaggacaca cctttcttgt acgaaatcac 900
cgccaagcgt ctcttgaaat ttcccctatt accttcctca ctgcacaaac ccttctgatg 960
gacctgggcc aatttcttct gttctgtcac attagttcac atcaacatga cggtatggag 1020
gcttacgtca aggtggacag ctgcccagag gaaccccaat tgcgcatgaa aaacaatgag 1080
gaagctgaag actacgacga tgatcttacg gactccgaga tggacgtggt tcgctttgat 1140
gatgataatt ctccttcttt catccaaatc cgatctgtgg caaaaaaaca tcccaagacg 1200
tgggtgcatt atatcgctgc ggaggaagaa gattgggact atgctccttt ggtgcttgca 1260
cctgatgacc gcagttataa gtcacagtac ttgaataacg gccctcaaag aatcggaaga 1320
aaatataaga aggtccgatt catggcctac accgacgaga cgttcaaaac ccgagaagct 1380
attcagcacg aaagcggaat actggggccg ctgttgtatg gtgaagtcgg agatacactt 1440
ctcataatat ttaaaaacca ggcttcacgg ccatacaaca tctatccgca tggtatcacc 1500
gacgtgcggc ccctgtatag tcggagactg cctaaggggg taaaacatct caaggatttt 1560
ccgattctcc ccggagaaat tttcaagtat aaatggacgg tgacggtcga ggatggtcct 1620
accaaatccg atccccggtg tctcacaaga tactacagca gcttcgttaa tatggaaaga 1680
gacctcgctt ccggacttat cggaccgttg ctcatatgtt acaaagagtc cgtagatcaa 1740
aggggcaacc aaattatgtc cgataagcgg aatgttatat tgttcagtgt cttcgacgag 1800
aacaggtctt ggtatttgac tgaaaacatc cagcgatttc tgccgaaccc cgcaggggta 1860
caattggagg acccggaatt ccaagctagt aatatcatgc attctatcaa tggatacgta 1920
tttgattccc ttcagcttag cgtttgtctg catgaagtcg catattggta tatccttagt 1980
attggtgctc aaactgactt cctgtctgta tttttttctg gttacacctt caagcacaag 2040
atggtctacg aggacactct tacgcttttt cccttctctg gagagacggt gtttatgagc 2100
atggaaaacc ccgggctttg gattctcggg tgccataatt cagacttccg caacaggggt 2160
atgacagcct tgttgaaggt cagctcctgc gataaaaaca ccggggatta ctatgaagac 2220
tcctacgaag acatttctgc atatctcctc tccaaaaaca acgcgatcga accaaggtct 2280
ttttcccaga actcacggca tccaagccag aatccaccag tgttgaaacg ccatcagagg 2340
gaaattacgc gaacgacctt gcaaagcgat caggaagaaa ttgattatga cgacactata 2400
agtgtagaaa tgaaaaaaga ggactttgac atctatgatg aggatgagaa ccagtcccca 2460
agaagttttc agaagaagac ccgccactat tttattgctg ccgtcgaacg gttgtgggac 2520
tacggaatga gctcctcccc gcatgtgttg cggaatcgag cccaaagtgg ctctgtgcct 2580
cagttcaaaa aggtcgtatt ccaagaattc actgatggca gcttcactca gccactgtat 2640
cggggggagt tgaacgaaca tctcggcctc ttgggcccat acatacgcgc tgaggttgaa 2700
gataacataa tggtaacttt tcgaaatcag gcatcaaggc cttattcatt ttacagctct 2760
ctcatatctt acgaagagga ccaaagacaa ggagcggaac ctcgcaagaa ttttgtaaaa 2820
cccaatgaaa cgaaaacgta tttctggaag gttcagcacc acatggcccc aacaaaggat 2880
gaatttgatt gtaaagcgtg ggcgtatttt agtgacgtcg atctcgaaaa ggatgttcat 2940
tcagggctta tcggtcccct ccttgtgtgt catacaaaca cacttaatcc ggcgcacggt 3000
agacaagtaa ctgtgcagga atttgcgttg tttttcacga tctttgatga aactaagtca 3060
tggtatttta cggagaacat ggagcggaat tgtagggcac catgtaatat acagatggaa 3120
gacccaacct ttaaagaaaa ttacagattc catgccataa acgggtacat catggatact 3180
ctcccaggac tggtaatggc tcaggaccag cgaatacgat ggtacttgct tagcatgggg 3240
agtaacgaaa acatccattc tattcatttt tcaggccatg tgttcactgt cagaaaaaag 3300
gaggagtata agatggcgct ctacaatctg taccccggtg tgtttgagac ggtagaaatg 3360
ctgccctcca aagctggtat atggagagta gagtgtttga taggagaaca tctccacgcc 3420
ggcatgtcta cgctgtttct cgtttacagc aataagtgcc agacccccct ggggatggct 3480
agtgggcaca tccgcgattt tcaaattaca gcatctgggc aatacggtca atgggcgccg 3540
aaactggcta gactgcatta ttccgggtcc attaatgctt ggtccactaa ggagcccttc 3600
agctggatca aggtagacct tctcgcgcct atgattatac acggtataaa gacccaaggt 3660
gccagacaga agtttagtag cctttacata tcacagttta ttataatgta ctccttggat 3720
ggcaagaagt ggcaaaccta taggggtaac tccacgggaa ccctgatggt ctttttcggg 3780
aacgtagact cctcaggaat aaagcacaat attttcaatc ccccaatcat agcgcgctat 3840
atacgacttc atcctacgca ttactccata cgctctacgc tgcgaatgga gctgatgggc 3900
tgcgatctga acagttgctc catgcctctg ggcatggaat ctaaagccat cagcgatgca 3960
caaattaccg ctagtagcta cttcaccaat atgtttgcca catggtcccc gtctaaggct 4020
cgcctgcatc tgcaaggccg gtccaacgca tggcgacctc aggtcaataa cccaaaggaa 4080
tggttgcagg tagactttca gaagaccatg aaggttaccg gggtaactac tcagggggta 4140
aaatcactgt tgactagcat gtacgtgaaa gaattcctca ttagcagtag tcaagatggc 4200
catcagtgga cgctgttctt tcaaaacggg aaggtcaaag ttttccaggg gaatcaggac 4260
tccttcacac ccgtcgtcaa ctcactcgat ccaccactgt tgacccggta cctgagaatc 4320
cacccacaat cctgggttca ccaaatcgca ctcaggatgg aagtactcgg gtgcgaagcg 4380
caggacctct actga 4395
<210> 4
<211> 4395
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 4
atgcagatcg aactgagtac ctgcttcttt ctttgcttgt tgcgcttttg cttttctgcg 60
acgcggcggt attacttggg ggcggttgag ttgagctggg attatatgca atctgacctg 120
ggtgaactgc cggtcgatgc gcggttcccg ccccgggttc ccaaaagttt tccattcaac 180
acatccgtcg tttacaaaaa aacgcttttc gtagaattca cagatcacct gtttaacata 240
gcgaagccac gacctccctg gatgggcctt ttgggaccta cgatacaagc tgaagtatac 300
gataccgttg ttatcacact gaagaacatg gcaagtcacc ctgtttccct tcatgccgtg 360
ggagtatcat attggaaggc ttctgaagga gcagaatatg atgatcaaac aagtcaaaga 420
gagaaggaag atgacaaagt gttccctggg gggagtcata cgtacgtgtg gcaagtattg 480
aaggaaaatg gtccgatggc gtctgacccg ctttgtctta cctattccta cctttctcac 540
gtggacctgg taaaagatct gaactcaggt ctcattggcg ccctgttggt ttgtcgcgag 600
ggttcattgg caaaagaaaa gactcaaacg cttcacaagt ttatccttct ctttgccgtc 660
ttcgatgaag ggaagtcttg gcatagtgag actaagaact ccctcatgca agacagggat 720
gctgcatccg cgcgagcgtg gcctaagatg cacacggtta acggctatgt gaacaggagc 780
ctgccagggc tcatcggttg ccacaggaag tccgtgtact ggcatgttat agggatgggg 840
actacacctg aagtccattc tatattcctc gaaggacaca cctttcttgt acgaaatcac 900
cgccaagcgt ctcttgaaat ttcccctatt accttcctca ctgcacaaac ccttctgatg 960
gacctgggcc aatttcttct gttctgtcac attagttcac atcaacatga cggtatggag 1020
gcttacgtca aggtggacag ctgcccagag gaaccccaat tgcgcatgaa aaacaatgag 1080
gaagctgaag actacgacga tgatcttacg gactccgaga tggacgtggt tcgctttgat 1140
gatgataatt ctccttcttt catccaaatc cgatctgtgg caaaaaaaca tcccaagacg 1200
tgggtgcatt atatcgctgc ggaggaagaa gattgggact atgctccttt ggtgcttgca 1260
cctgatgacc gcagttataa gtcacagtac ttgaataacg gccctcaaag aatcggaaga 1320
aaatataaga aggtccgatt catggcctac accgacgaga cgttcaaaac ccgagaagct 1380
attcagcacg aaagcggaat actggggccg ctgttgtatg gtgaagtcgg agatacactt 1440
ctcataatat ttaaaaacca ggcttcacgg ccatacaaca tctatccgca tggtatcacc 1500
gacgtgcggc ccctgtatag tcggagactg cctaaggggg taaaacatct caaggatttt 1560
ccgattctcc ccggagaaat tttcaagtat aaatggacgg tgacggtcga ggatggtcct 1620
accaaatccg atccccggtg tctcacaaga tactacagca gcttcgttaa tatggaaaga 1680
gacctcgctt ccggacttat cggaccgttg ctcatatgtt acaaagagtc cgtagatcaa 1740
aggggcaacc aaattatgtc cgataagcgg aatgttatat tgttcagtgt cttcgacgag 1800
aacaggtctt ggtatttgac tgaaaacatc cagcgatttc tgccgaaccc cgcaggggta 1860
caattggagg acccggaatt ccaagctagt aatatcatgc attctatcaa tggatacgta 1920
tttgattccc ttcagcttag cgtttgtctg catgaagtcg catattggta tatccttagt 1980
attggtgctc aaactgactt cctgtctgta tttttttctg gttacacctt caagcacaag 2040
atggtctacg aggacactct tacgcttttt cccttctctg gagagacggt gtttatgagc 2100
atggaaaacc ccgggctttg gattctcggg tgccataatt cagacttccg caacaggggt 2160
atgacagcct tgttgaaggt cagctcctgc gataaaaaca ccggggatta ctatgaagac 2220
tcctacgaag acatttctgc atatctcctc tccaaaaaca acgcgatcga accaaggtct 2280
ttttcccaga actcacggca tccaagccag aatccaccag tgttgaaacg ccatcagagg 2340
gaaattacgc gaacgacctt gcaaagcgat caggaagaaa ttgattatga cgacactata 2400
agtgtagaaa tgaaaaaaga ggactttgac atctatgatg aggatgagaa ccagtcccca 2460
agaagttttc agaagaagac ccgccactat tttattgctg ccgtcgaacg gttgtgggac 2520
tacggaatga gctcctcccc gcatgtgttg cggaatcgag cccaaagtgg ctctgtgcct 2580
cagttcaaaa aggtcgtatt ccaagaattc actgatggca gcttcactca gccactgtat 2640
cggggggagt tgaacgaaca tctcggcctc ttgggcccat acatacgcgc tgaggttgaa 2700
gataacataa tggtaacttt tcgaaatcag gcatcaaggc cttattcatt ttacagctct 2760
ctcatatctt acgaagagga ccaaagacaa ggagcggaac ctcgcaagaa ttttgtaaaa 2820
cccaatgaaa cgaaaacgta tttctggaag gttcagcacc acatggcccc aacaaaggat 2880
gaatttgatt gtaaagcgtg ggcgtatttt agtgacgtcg atctcgaaaa ggatgttcat 2940
tcagggctta tcggtcccct ccttgtgtgt catacaaaca cacttaatcc ggcgcacggt 3000
agacaagtaa ctgtgcagga atttgcgttg tttttcacga tctttgatga aactaagtca 3060
tggtatttta cggagaacat ggagcggaat tgtagggcac catgtaatat acagatggaa 3120
gacccaacct ttaaagaaaa ttacagattc catgccataa acgggtacat catggatact 3180
ctcccaggac tggtaatggc tcaggaccag cgaatacgat ggtacttgct tagcatgggg 3240
agtaacgaaa acatccattc tattcatttt tcaggccatg tgttcactgt cagaaaaaag 3300
gaggagtata agatggcgct ctacaatctg taccccggtg tgtttgagac ggtagaaatg 3360
ctgccctcca aagctggtat atggagagta gagtgtttga taggagaaca tctccacgcc 3420
ggcatgtcta cgctgtttct cgtttacagc aataagtgcc agacccccct ggggatggct 3480
agtgggcaca tccgcgattt tcaaattaca gcatctgggc aatacggtca atgggcgccg 3540
aaactggcta gactgcatta ttccgggtcc attaatgctt ggtccactaa ggagcccttc 3600
agctggatca aggtagacct tctcgcgcct atgattatac acggtataaa gacccaaggt 3660
gccagacaga agtttagtag cctttacata tcacagttta ttataatgta ctccttggat 3720
ggcaagaagt ggcaaaccta taggggtaac tccacgggaa ccctgatggt ctttttcggg 3780
aacgtagact cctcaggaat aaagcacaat attttcaatc ccccaatcat agcgcgctat 3840
atacgacttc atcctacgca ttactccata cgctctacgc tgcgaatgga gctgatgggc 3900
tgcgatctga acagttgctc catgcctctg ggcatggaat ctaaagccat cagcgatgca 3960
caaattaccg ctagtagcta cttcaccaat atgtttgcca catggtcccc gtctaaggct 4020
cgcctgcatc tgcaaggccg gtccaacgca tggcgacctc aggtcaataa cccaaaggaa 4080
tggttgcagg tagactttca gaagaccatg aaggttaccg gggtaactac tcagggggta 4140
aaatcactgt tgactagcat gtacgtgaaa gaattcctca ttagcagtag tcaagatggc 4200
catcagtgga cgctgttctt tcaaaacggg aaggtcaaag ttttccaggg gaatcaggac 4260
tccttcacac ccgtcgtcaa ctcactcgat ccaccactgt tgacccggta cctgagaatc 4320
cacccacaat cctgggttca ccaaatcgca ctcaggatgg aagtactcgg gtgcgaagcg 4380
caggacctct actga 4395
<210> 5
<211> 11341
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 5
aatgcttcaa taatattgaa aaaggaagag tatgagtatt caacatttcc gtgtcgccct 60
tattcccttt tttgcggcat tttgccttcc tgtttttgct cacccagaaa cgctggtgaa 120
agtaaaagat gctgaagatc agttgggtgc acgagtgggt tacatcgaac tggatctcaa 180
cagcggtaag atccttgaga gttttcgccc cgaagaacgt tttccaatga tgagcacttt 240
taaagttctg ctatgtggcg cggtattatc ccgtattgac gccgggcaag agcaactcgg 300
tcgccgcata cactattctc agaatgactt ggttgagtac tcaccagtca cagaaaagca 360
tcttacggat ggcatgacag taagagaatt atgcagtgct gccataacca tgagtgataa 420
cactgcggcc aacttacttc tgacaacgat cggaggaccg aaggagctaa ccgctttttt 480
gcacaacatg ggggatcatg taactcgcct tgatcgttgg gaaccggagc tgaatgaagc 540
cataccaaac gacgagcgtg acaccacgat gcctgtagca atggcaacaa cgttgcgcaa 600
actattaact ggcgaactac ttactctagc ttcccggcaa caattaatag actggatgga 660
ggcggataaa gttgcaggac cacttctgcg ctcggccctt ccggctggct ggtttattgc 720
tgataaatct ggagccggtg agcgtgggtc tcgcggtatc attgcagcac tggggccaga 780
tggtaagccc tcccgtatcg tagttatcta cacgacgggg agtcaggcaa ctatggatga 840
acgaaataga cagatcgctg agataggtgc ctcactgatt aagcattggt aactgtcaga 900
ccaagtttac tcatatatac tttagattga tttaaaactt catttttaat ttaaaaggat 960
ctaggtgaag atcctttttg ataatctcat gaccaaaatc ccttaacgtg agttttcgtt 1020
ccactgagcg tcagaccccg tagaaaagat caaaggatct tcttgagatc ctttttttct 1080
gcgcgtaatc tgctgcttgc aaacaaaaaa accaccgcta ccagcggtgg tttgtttgcc 1140
ggatcaagag ctaccaactc tttttccgaa ggtaactggc ttcagcagag cgcagatacc 1200
aaatactgtc cttctagtgt agccgtagtt aggccaccac ttcaagaact ctgtagcacc 1260
gcctacatac ctcgctctgc taatcctgtt accagtggct gctgccagtg gcgataagtc 1320
gtgtcttacc gggttggact caagacgata gttaccggat aaggcgcagc ggtcgggctg 1380
aacggggggt tcgtgcacac agcccagctt ggagcgaacg acctacaccg aactgagata 1440
cctacagcgt gagctatgag aaagcgccac gcttcccgaa gggagaaagg cggacaggta 1500
tccggtaagc ggcagggtcg gaacaggaga gcgcacgagg gagcttccag ggggaaacgc 1560
ctggtatctt tatagtcctg tcgggtttcg ccacctctga cttgagcgtc gatttttgtg 1620
atgctcgtca ggggggcgga gcctatggaa aaacgccagc aacgcggcct ttttacggtt 1680
cctggccttt tgctggcctt ttgctcacat gttctttcct gcgttatccc ctgattctgt 1740
ggataaccgt attaccgcct ttgagtgagc tgataccgct cgccgcagcc gaacgaccga 1800
gcgcagcgag tcagtgagcg aggaagcgga agagcgccca atacgcaaac cgcctctccc 1860
cgcgcgttgg ccgattcatt aatgcagctg gcacgacagg tttcccgact ggaaagcggg 1920
cagtgagcgc aacgcaatta atgtgagtta gctcactcat taggcacccc aggctttaca 1980
ctttatgctt ccggctcgta tgttgtgtgg aattgtgagc ggataacaat ttcacacagg 2040
aaacagctat gaccatgatt acgccaagcg cgcaattaac cctcactaaa gggaacaaaa 2100
gctggagctg caagcttggc cattgcatac gttgtatcca tatcataata tgtacattta 2160
tattggctca tgtccaacat taccgccatg ttgacattga ttattgacta gttattaata 2220
gtaatcaatt acggggtcat tagttcatag cccatatatg gagttccgcg ttacataact 2280
tacggtaaat ggcccgcctg gctgaccgcc caacgacccc cgcccattga cgtcaataat 2340
gacgtatgtt cccatagtaa cgccaatagg gactttccat tgacgtcaat gggtggagta 2400
tttacggtaa actgcccact tggcagtaca tcaagtgtat catatgccaa gtacgccccc 2460
tattgacgtc aatgacggta aatggcccgc ctggcattat gcccagtaca tgaccttatg 2520
ggactttcct acttggcagt acatctacgt attagtcatc gctattacca tggtgatgcg 2580
gttttggcag tacatcaatg ggcgtggata gcggtttgac tcacggggat ttccaagtct 2640
ccaccccatt gacgtcaatg ggagtttgtt ttggcaccaa aatcaacggg actttccaaa 2700
atgtcgtaac aactccgccc cattgacgca aatgggcggt aggcgtgtac ggtgggaggt 2760
ctatataagc agagctcgtt tagtgaaccg gggtctctct ggttagacca gatctgagcc 2820
tgggagctct ctggctaact agggaaccca ctgcttaagc ctcaataaag cttgccttga 2880
gtgcttcaag tagtgtgtgc ccgtctgttg tgtgactctg gtaactagag atccctcaga 2940
cccttttagt cagtgtggaa aatctctagc agtggcgccc gaacagggac ctgaaagcga 3000
aagggaaacc agagctctct cgacgcagga ctcggcttgc tgaagcgcgc acggcaagag 3060
gcgaggggcg gcgactggtg agtacgccaa aaattttgac tagcggaggc tagaaggaga 3120
gagatgggtg cgagagcgtc agtattaagc gggggagaat tagatcgcga tgggaaaaaa 3180
ttcggttaag gccaggggga aagaaaaaat ataaattaaa acatatagta tgggcaagca 3240
gggagctaga acgattcgca gttaatcctg gcctgttaga aacatcagaa ggctgtagac 3300
aaatactggg acagctacaa ccatcccttc agacaggatc agaagaactt agatcattat 3360
ataatacagt agcaaccctc tattgtgtgc atcaaaggat agagataaaa gacaccaagg 3420
aagctttaga caagatagag gaagagcaaa acaaaagtaa gaccaccgca cagcaagcgg 3480
ccgctgatct tcagacctgg aggaggagat atgagggaca attggagaag tgaattatat 3540
aaatataaag tagtaaaaat tgaaccatta ggagtagcac ccaccaaggc aaagagaaga 3600
gtggtgcaga gagaaaaaag agcagtggga ataggagctt tgttccttgg gttcttggga 3660
gcagcaggaa gcactatggg cgcagcctca atgacgctga cggtacaggc cagacaatta 3720
ttgtctggta tagtgcagca gcagaacaat ttgctgaggg ctattgaggc gcaacagcat 3780
ctgttgcaac tcacagtctg gggcatcaag cagctccagg caagaatcct ggctgtggaa 3840
agatacctaa aggatcaaca gctcctgggg atttggggtt gctctggaaa actcatttgc 3900
accactgctg tgccttggaa tgctagttgg agtaataaat ctctggaaca gatttggaat 3960
cacacgacct ggatggagtg ggacagagaa attaacaatt acacaagctt aatacactcc 4020
ttaattgaag aatcgcaaaa ccagcaagaa aagaatgaac aagaattatt ggaattagat 4080
aaatgggcaa gtttgtggaa ttggtttaac ataacaaatt ggctgtggta tataaaatta 4140
ttcataatga tagtaggagg cttggtaggt ttaagaatag tttttgctgt actttctata 4200
gtgaatagag ttaggcaggg atattcacca ttatcgtttc agacccacct cccaaccccg 4260
aggggacccg acaggcccga aggaatagaa gaagaaggtg gagagagaga cagagacaga 4320
tccattcgat tagtgaacgg atctcgacgg tatcggttaa cttttaaaag aaaagggggg 4380
attggggggt acagtgcagg ggaaagaata gtagacataa tagcaacaga catacaaact 4440
aaagaattac aaaaacaaat tacaaaaatt caaaatttta tcgatggggg aggctgctgg 4500
tgaatattaa ccaaggtcac cccagttatc ggaggagcaa acaggggcta agtccactct 4560
tgcatctaaa atgagagaca aaaaatctat aaaaatggaa aacatgcata gaaatatgtg 4620
agggaggaaa aaattacccc caagaatgtt agtgcacgca gtcacacagg gagaagacta 4680
tttttgtttt gttttgattg ttttgttttg ttttggttgt tttgttttgg tgacctaact 4740
ggtcaaatga cctattaaga atatttcata gaacgaatgt tccgatgctc taatctctct 4800
agacaaggtt catatttgta tgggttactt attctctctt tgttgactaa gtcaataatc 4860
agaatcagca ggtttgcagt cagattggca gggataagca gcctagctca ggagaagtga 4920
gtataaaagc cccaggctgg gagcagccat cacagaagtc cactcattct tggatccacc 4980
ggtgccacca tgcagatcga actgagtacc tgcttctttc tttgcttgtt gcgcttttgc 5040
ttttctgcga cgcggcggta ttacttgggg gcggttgagt tgagctggga ttatatgcaa 5100
tctgacctgg gtgaactgcc ggtcgatgcg cggttcccgc cccgggttcc caaaagtttt 5160
ccattcaaca catccgtcgt ttacaaaaaa acgcttttcg tagaattcac agatcacctg 5220
tttaacatag cgaagccacg acctccctgg atgggccttt tgggacctac gatacaagct 5280
gaagtatacg ataccgttgt tatcacactg aagaacatgg caagtcaccc tgtttccctt 5340
catgccgtgg gagtatcata ttggaaggct tctgaaggag cagaatatga tgatcaaaca 5400
agtcaaagag agaaggaaga tgacaaagtg ttccctgggg ggagtcatac gtacgtgtgg 5460
caagtattga aggaaaatgg tccgatggcg tctgacccgc tttgtcttac ctattcctac 5520
ctttctcacg tggacctggt aaaagatctg aactcaggtc tcattggcgc cctgttggtt 5580
tgtcgcgagg gttcattggc aaaagaaaag actcaaacgc ttcacaagtt tatccttctc 5640
tttgccgtct tcgatgaagg gaagtcttgg catagtgaga ctaagaactc cctcatgcaa 5700
gacagggatg ctgcatccgc gcgagcgtgg cctaagatgc acacggttaa cggctatgtg 5760
aacaggagcc tgccagggct catcggttgc cacaggaagt ccgtgtactg gcatgttata 5820
gggatgggga ctacacctga agtccattct atattcctcg aaggacacac ctttcttgta 5880
cgaaatcacc gccaagcgtc tcttgaaatt tcccctatta ccttcctcac tgcacaaacc 5940
cttctgatgg acctgggcca atttcttctg ttctgtcaca ttagttcaca tcaacatgac 6000
ggtatggagg cttacgtcaa ggtggacagc tgcccagagg aaccccaatt gcgcatgaaa 6060
aacaatgagg aagctgaaga ctacgacgat gatcttacgg actccgagat ggacgtggtt 6120
cgctttgatg atgataattc tccttctttc atccaaatcc gatctgtggc aaaaaaacat 6180
cccaagacgt gggtgcatta tatcgctgcg gaggaagaag attgggacta tgctcctttg 6240
gtgcttgcac ctgatgaccg cagttataag tcacagtact tgaataacgg ccctcaaaga 6300
atcggaagaa aatataagaa ggtccgattc atggcctaca ccgacgagac gttcaaaacc 6360
cgagaagcta ttcagcacga aagcggaata ctggggccgc tgttgtatgg tgaagtcgga 6420
gatacacttc tcataatatt taaaaaccag gcttcacggc catacaacat ctatccgcat 6480
ggtatcaccg acgtgcggcc cctgtatagt cggagactgc ctaagggggt aaaacatctc 6540
aaggattttc cgattctccc cggagaaatt ttcaagtata aatggacggt gacggtcgag 6600
gatggtccta ccaaatccga tccccggtgt ctcacaagat actacagcag cttcgttaat 6660
atggaaagag acctcgcttc cggacttatc ggaccgttgc tcatatgtta caaagagtcc 6720
gtagatcaaa ggggcaacca aattatgtcc gataagcgga atgttatatt gttcagtgtc 6780
ttcgacgaga acaggtcttg gtatttgact gaaaacatcc agcgatttct gccgaacccc 6840
gcaggggtac aattggagga cccggaattc caagctagta atatcatgca ttctatcaat 6900
ggatacgtat ttgattccct tcagcttagc gtttgtctgc atgaagtcgc atattggtat 6960
atccttagta ttggtgctca aactgacttc ctgtctgtat ttttttctgg ttacaccttc 7020
aagcacaaga tggtctacga ggacactctt acgctttttc ccttctctgg agagacggtg 7080
tttatgagca tggaaaaccc cgggctttgg attctcgggt gccataattc agacttccgc 7140
aacaggggta tgacagcctt gttgaaggtc agctcctgcg ataaaaacac cggggattac 7200
tatgaagact cctacgaaga catttctgca tatctcctct ccaaaaacaa cgcgatcgaa 7260
ccaaggtctt tttcccagaa ctcacggcat ccaagccaga atccaccagt gttgaaacgc 7320
catcagaggg aaattacgcg aacgaccttg caaagcgatc aggaagaaat tgattatgac 7380
gacactataa gtgtagaaat gaaaaaagag gactttgaca tctatgatga ggatgagaac 7440
cagtccccaa gaagttttca gaagaagacc cgccactatt ttattgctgc cgtcgaacgg 7500
ttgtgggact acggaatgag ctcctccccg catgtgttgc ggaatcgagc ccaaagtggc 7560
tctgtgcctc agttcaaaaa ggtcgtattc caagaattca ctgatggcag cttcactcag 7620
ccactgtatc ggggggagtt gaacgaacat ctcggcctct tgggcccata catacgcgct 7680
gaggttgaag ataacataat ggtaactttt cgaaatcagg catcaaggcc ttattcattt 7740
tacagctctc tcatatctta cgaagaggac caaagacaag gagcggaacc tcgcaagaat 7800
tttgtaaaac ccaatgaaac gaaaacgtat ttctggaagg ttcagcacca catggcccca 7860
acaaaggatg aatttgattg taaagcgtgg gcgtatttta gtgacgtcga tctcgaaaag 7920
gatgttcatt cagggcttat cggtcccctc cttgtgtgtc atacaaacac acttaatccg 7980
gcgcacggta gacaagtaac tgtgcaggaa tttgcgttgt ttttcacgat ctttgatgaa 8040
actaagtcat ggtattttac ggagaacatg gagcggaatt gtagggcacc atgtaatata 8100
cagatggaag acccaacctt taaagaaaat tacagattcc atgccataaa cgggtacatc 8160
atggatactc tcccaggact ggtaatggct caggaccagc gaatacgatg gtacttgctt 8220
agcatgggga gtaacgaaaa catccattct attcattttt caggccatgt gttcactgtc 8280
agaaaaaagg aggagtataa gatggcgctc tacaatctgt accccggtgt gtttgagacg 8340
gtagaaatgc tgccctccaa agctggtata tggagagtag agtgtttgat aggagaacat 8400
ctccacgccg gcatgtctac gctgtttctc gtttacagca ataagtgcca gacccccctg 8460
gggatggcta gtgggcacat ccgcgatttt caaattacag catctgggca atacggtcaa 8520
tgggcgccga aactggctag actgcattat tccgggtcca ttaatgcttg gtccactaag 8580
gagcccttca gctggatcaa ggtagacctt ctcgcgccta tgattataca cggtataaag 8640
acccaaggtg ccagacagaa gtttagtagc ctttacatat cacagtttat tataatgtac 8700
tccttggatg gcaagaagtg gcaaacctat aggggtaact ccacgggaac cctgatggtc 8760
tttttcggga acgtagactc ctcaggaata aagcacaata ttttcaatcc cccaatcata 8820
gcgcgctata tacgacttca tcctacgcat tactccatac gctctacgct gcgaatggag 8880
ctgatgggct gcgatctgaa cagttgctcc atgcctctgg gcatggaatc taaagccatc 8940
agcgatgcac aaattaccgc tagtagctac ttcaccaata tgtttgccac atggtccccg 9000
tctaaggctc gcctgcatct gcaaggccgg tccaacgcat ggcgacctca ggtcaataac 9060
ccaaaggaat ggttgcaggt agactttcag aagaccatga aggttaccgg ggtaactact 9120
cagggggtaa aatcactgtt gactagcatg tacgtgaaag aattcctcat tagcagtagt 9180
caagatggcc atcagtggac gctgttcttt caaaacggga aggtcaaagt tttccagggg 9240
aatcaggact ccttcacacc cgtcgtcaac tcactcgatc caccactgtt gacccggtac 9300
ctgagaatcc acccacaatc ctgggttcac caaatcgcac tcaggatgga agtactcggg 9360
tgcgaagcgc aggacctcta ctgagcggcc gcgtttaaac gtcgacaatc aacctctgga 9420
ttacaaaatt tgtgaaagat tgactggtat tcttaactat gttgctcctt ttacgctatg 9480
tggatacgct gctttaatgc ctttgtatca tgctattgct tcccgtatgg ctttcatttt 9540
ctcctccttg tataaatcct ggttgctgtc tctttatgag gagttgtggc ccgttgtcag 9600
gcaacgtggc gtggtgtgca ctgtgtttgc tgacgcaacc cccactggtt ggggcattgc 9660
caccacctgt cagctccttt ccgggacttt cgctttcccc ctccctattg ccacggcgga 9720
actcatcgcc gcctgccttg cccgctgctg gacaggggct cggctgttgg gcactgacaa 9780
ttccgtggtg ttgtcgggga agctgacgtc ctttccatgg ctgctcgcct gtgttgccac 9840
ctggattctg cgcgggacgt ccttctgcta cgtcccttcg gccctcaatc cagcggacct 9900
tccttcccgc ggcctgctgc cggctctgcg gcctcttccg cgtcttcgcc ttcgccctca 9960
gacgagtcgg atctcccttt gggccgcctc cccgcctgga attcgagctc ggtaccttta 10020
agaccaatga cttacaaggc agctgtagat cttagccact ttttaaaaga aaagggggga 10080
ctggaagggc taattcactc ccaacgaaga caagatctgc tttttgcttg tactgggtct 10140
ctctggttag accagatctg agcctgggag ctctctggct aactagggaa cccactgctt 10200
aagcctcaat aaagcttgcc ttgagtgctt caagtagtgt gtgcccgtct gttgtgtgac 10260
tctggtaact agagatccct cagacccttt tagtcagtgt ggaaaatctc tagcagtagt 10320
agttcatgtc atcttattat tcagtattta taacttgcaa agaaatgaat atcagagagt 10380
gagaggaact tgtttattgc agcttataat ggttacaaat aaagcaatag catcacaaat 10440
ttcacaaata aagcattttt ttcactgcat tctagttgtg gtttgtccaa actcatcaat 10500
gtatcttatc atgtctggct ctagctatcc cgcccctaac tccgcccagt tccgcccatt 10560
ctccgcccca tggctgacta atttttttta tttatgcaga ggccgaggcc gcctcggcct 10620
ctgagctatt ccagaagtag tgaggaggct tttttggagg cctaggcttt tgcgtcgaga 10680
cgtacccaat tcgccctata gtgagtcgta ttacgcgcgc tcactggccg tcgttttaca 10740
acgtcgtgac tgggaaaacc ctggcgttac ccaacttaat cgccttgcag cacatccccc 10800
tttcgccagc tggcgtaata gcgaagaggc ccgcaccgat cgcccttccc aacagttgcg 10860
cagcctgaat ggcgaatggc gcgacgcgcc ctgtagcggc gcattaagcg cggcgggtgt 10920
ggtggttacg cgcagcgtga ccgctacact tgccagcgcc ctagcgcccg ctcctttcgc 10980
tttcttccct tcctttctcg ccacgttcgc cggctttccc cgtcaagctc taaatcgggg 11040
gctcccttta gggttccgat ttagtgcttt acggcacctc gaccccaaaa aacttgatta 11100
gggtgatggt tcacgtagtg ggccatcgcc ctgatagacg gtttttcgcc ctttgacgtt 11160
ggagtccacg ttctttaata gtggactctt gttccaaact ggaacaacac tcaaccctat 11220
ctcggtctat tcttttgatt tataagggat tttgccgatt tcggcctatt ggttaaaaaa 11280
tgagctgatt taacaaaaat ttaacgcgaa ttttaacaaa atattaacgt ttacaatttc 11340
c 11341
<210> 6
<211> 4395
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 6
atgcaaatag agctctccac ctgcttcttt ctgtgccttt tgcgattctg ctttagtgcc 60
accagaagat actacctggg tgcagtggaa ctgtcatggg actatatgca aagtgatctc 120
ggtgagctgc ctgtggacgc aagatttcct cctagagtgc caaaatcttt tccattcaac 180
acctcagtcg tgtacaaaaa gactctgttt gtagaattca cggatcacct tttcaacatc 240
gctaagccaa ggccaccctg gatgggtctg ctaggtccta ccatccaggc tgaggtttat 300
gatacagtgg tcattacact taagaacatg gcttcccatc ctgtcagtct tcatgctgtt 360
ggtgtatcct actggaaagc ttctgaggga gctgaatatg atgatcagac cagtcaaagg 420
gagaaagaag atgataaagt cttccctggt ggaagccata catatgtctg gcaggtcctg 480
aaagagaatg gtccaatggc ctctgaccca ctgtgcctta cctactcata tctttctcat 540
gtggacctgg taaaagactt gaattcaggc ctcattggag ccctactagt atgtagagaa 600
gggagtctgg ccaaggaaaa gacacagacc ttgcacaaat ttatactact ttttgctgta 660
tttgatgaag ggaaaagttg gcactcagaa acaaagaact ccttgatgca ggatagggat 720
gctgcatctg ctcgggcctg gcctaaaatg cacacagtca atggttatgt aaacaggtct 780
ctgccaggtc tgattggatg ccacaggaaa tcagtctatt ggcatgtgat tggaatgggc 840
accactcctg aagtgcactc aatattcctc gaaggtcaca catttcttgt gaggaaccat 900
cgccaggcgt ccttggaaat ctcgccaata actttcctta ctgctcaaac actcttgatg 960
gaccttggac agtttctact gttttgtcat atctcttccc accaacatga tggcatggaa 1020
gcttatgtca aagtagacag ctgtccagag gaaccccaac tacgaatgaa aaataatgaa 1080
gaagcggaag actatgatga tgatcttact gattctgaaa tggatgtggt caggtttgat 1140
gatgacaact ctccttcctt tatccaaatt cgctcagttg ccaagaagca tcctaaaact 1200
tgggtacatt acattgctgc tgaagaggag gactgggact atgctccctt agtcctcgcc 1260
cccgatgaca gaagttataa aagtcaatat ttgaacaatg gccctcagcg gattggtagg 1320
aagtacaaaa aagtccgatt tatggcatac acagatgaaa cctttaagac tcgtgaagct 1380
attcagcatg aatcaggaat cttgggacct ttactttatg gggaagttgg agacacactg 1440
ttgattatat ttaagaatca agcaagcaga ccatataaca tctaccctca cggaatcact 1500
gatgtccgtc ctttgtattc aaggagatta ccaaaaggtg taaaacattt gaaggatttt 1560
ccaattctgc caggagaaat attcaaatat aaatggacag tgactgtaga agatgggcca 1620
actaaatcag atcctcggtg cctgacccgc tattactcta gtttcgttaa tatggagaga 1680
gatctagctt caggactcat tggccctctc ctcatctgct acaaagaatc tgtagatcaa 1740
agaggaaacc agataatgtc agacaagagg aatgtcatcc tgttttctgt atttgatgag 1800
aaccgaagct ggtacctcac agagaatata caacgctttc tccccaatcc agctggagtg 1860
cagcttgagg atccagagtt ccaagcctcc aacatcatgc acagcatcaa tggctatgtt 1920
tttgatagtt tgcagttgtc agtttgtttg catgaggtgg catactggta cattctaagc 1980
attggagcac agactgactt cctttctgtc ttcttctctg gatatacctt caaacacaaa 2040
atggtctatg aagacacact caccctattc ccattctcag gagaaactgt cttcatgtcg 2100
atggaaaacc caggtctatg gattctgggg tgccacaact cagactttcg gaacagaggc 2160
atgaccgcct tactgaaggt ttctagttgt gacaagaaca ctggtgatta ttacgaggac 2220
agttatgaag atatttcagc atacttgctg agtaaaaaca atgccattga accaagaagc 2280
ttctcccaga attcaagaca ccctagccaa aacccaccag tcttgaaacg ccatcaacgg 2340
gaaataactc gtactactct tcagtcagat caagaggaaa ttgactatga tgataccata 2400
tcagttgaaa tgaagaagga agattttgac atttatgatg aggatgaaaa tcagagcccc 2460
cgcagctttc aaaagaaaac acgacactat tttattgctg cagtggagag gctctgggat 2520
tatgggatga gtagctcccc acatgttcta agaaacaggg ctcagagtgg cagtgtccct 2580
cagttcaaga aagttgtttt ccaggaattt actgatggct cctttactca gcccttatac 2640
cgtggagaac taaatgaaca tttgggactc ctggggccat atataagagc agaagttgaa 2700
gataatatca tggtaacttt cagaaatcag gcctctcgtc cctattcctt ctattctagc 2760
cttatttctt atgaggaaga tcagaggcaa ggagcagaac ctagaaaaaa ctttgtcaag 2820
cctaatgaaa ccaaaactta cttttggaaa gtgcaacatc atatggcacc cactaaagat 2880
gagtttgact gcaaagcctg ggcttatttc tctgatgttg acctggaaaa agatgtgcac 2940
tcaggcctga ttggacccct tctggtctgc cacactaaca cactgaaccc tgctcatggg 3000
agacaagtga cagtacagga atttgctctg tttttcacca tctttgatga gaccaaaagc 3060
tggtacttca ctgaaaatat ggaaagaaac tgcagggctc cctgcaatat ccagatggaa 3120
gatcccactt ttaaagagaa ttatcgcttc catgcaatca atggctacat aatggataca 3180
ctacctggct tagtaatggc tcaggatcaa aggattcgat ggtatctgct cagcatgggc 3240
agcaatgaaa acatccattc tattcatttc agtggacatg tgttcactgt acgaaaaaaa 3300
gaggagtata aaatggcact gtacaatctc tatccaggtg tttttgagac agtggaaatg 3360
ttaccatcca aagctggaat ttggcgggtg gaatgcctta ttggcgagca tctacatgct 3420
gggatgagca cactttttct ggtgtacagc aataagtgtc agactcccct gggaatggct 3480
tctggacaca ttagagattt tcagattaca gcttcaggac aatatggaca gtgggcccca 3540
aagctggcca gacttcatta ttccggatca atcaatgcct ggagcaccaa ggagcccttt 3600
tcttggatca aggtggatct gttggcacca atgattattc acggcatcaa gacccagggt 3660
gcccgtcaga agttctccag cctctacatc tctcagttta tcatcatgta tagtcttgat 3720
gggaagaagt ggcagactta tcgaggaaat tccactggaa ccttaatggt cttctttggc 3780
aatgtggatt catctgggat aaaacacaat atttttaacc ctccaattat tgctcgatac 3840
atccgtttgc acccaactca ttatagcatt cgcagcactc ttcgcatgga gttgatgggc 3900
tgtgatttaa atagttgcag catgccattg ggaatggaga gtaaagcaat atcagatgca 3960
cagattactg cttcatccta ctttaccaat atgtttgcca cctggtctcc ttcaaaagct 4020
cgacttcacc tccaagggag gagtaatgcc tggagacctc aggtgaataa tccaaaagag 4080
tggctgcaag tggacttcca gaagacaatg aaagtcacag gagtaactac tcagggagta 4140
aaatctctgc ttaccagcat gtatgtgaag gagttcctca tctccagcag tcaagatggc 4200
catcagtgga ctctcttttt tcagaatggc aaagtaaagg tttttcaggg aaatcaagac 4260
tccttcacac ctgtggtgaa ctctctagac ccaccgttac tgactcgcta ccttcgaatt 4320
cacccccaga gttgggtgca ccagattgcc ctgaggatgg aggttctggg ctgcgaggca 4380
caggacctct actga 4395
<210> 7
<211> 11341
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 7
aatgcttcaa taatattgaa aaaggaagag tatgagtatt caacatttcc gtgtcgccct 60
tattcccttt tttgcggcat tttgccttcc tgtttttgct cacccagaaa cgctggtgaa 120
agtaaaagat gctgaagatc agttgggtgc acgagtgggt tacatcgaac tggatctcaa 180
cagcggtaag atccttgaga gttttcgccc cgaagaacgt tttccaatga tgagcacttt 240
taaagttctg ctatgtggcg cggtattatc ccgtattgac gccgggcaag agcaactcgg 300
tcgccgcata cactattctc agaatgactt ggttgagtac tcaccagtca cagaaaagca 360
tcttacggat ggcatgacag taagagaatt atgcagtgct gccataacca tgagtgataa 420
cactgcggcc aacttacttc tgacaacgat cggaggaccg aaggagctaa ccgctttttt 480
gcacaacatg ggggatcatg taactcgcct tgatcgttgg gaaccggagc tgaatgaagc 540
cataccaaac gacgagcgtg acaccacgat gcctgtagca atggcaacaa cgttgcgcaa 600
actattaact ggcgaactac ttactctagc ttcccggcaa caattaatag actggatgga 660
ggcggataaa gttgcaggac cacttctgcg ctcggccctt ccggctggct ggtttattgc 720
tgataaatct ggagccggtg agcgtgggtc tcgcggtatc attgcagcac tggggccaga 780
tggtaagccc tcccgtatcg tagttatcta cacgacgggg agtcaggcaa ctatggatga 840
acgaaataga cagatcgctg agataggtgc ctcactgatt aagcattggt aactgtcaga 900
ccaagtttac tcatatatac tttagattga tttaaaactt catttttaat ttaaaaggat 960
ctaggtgaag atcctttttg ataatctcat gaccaaaatc ccttaacgtg agttttcgtt 1020
ccactgagcg tcagaccccg tagaaaagat caaaggatct tcttgagatc ctttttttct 1080
gcgcgtaatc tgctgcttgc aaacaaaaaa accaccgcta ccagcggtgg tttgtttgcc 1140
ggatcaagag ctaccaactc tttttccgaa ggtaactggc ttcagcagag cgcagatacc 1200
aaatactgtc cttctagtgt agccgtagtt aggccaccac ttcaagaact ctgtagcacc 1260
gcctacatac ctcgctctgc taatcctgtt accagtggct gctgccagtg gcgataagtc 1320
gtgtcttacc gggttggact caagacgata gttaccggat aaggcgcagc ggtcgggctg 1380
aacggggggt tcgtgcacac agcccagctt ggagcgaacg acctacaccg aactgagata 1440
cctacagcgt gagctatgag aaagcgccac gcttcccgaa gggagaaagg cggacaggta 1500
tccggtaagc ggcagggtcg gaacaggaga gcgcacgagg gagcttccag ggggaaacgc 1560
ctggtatctt tatagtcctg tcgggtttcg ccacctctga cttgagcgtc gatttttgtg 1620
atgctcgtca ggggggcgga gcctatggaa aaacgccagc aacgcggcct ttttacggtt 1680
cctggccttt tgctggcctt ttgctcacat gttctttcct gcgttatccc ctgattctgt 1740
ggataaccgt attaccgcct ttgagtgagc tgataccgct cgccgcagcc gaacgaccga 1800
gcgcagcgag tcagtgagcg aggaagcgga agagcgccca atacgcaaac cgcctctccc 1860
cgcgcgttgg ccgattcatt aatgcagctg gcacgacagg tttcccgact ggaaagcggg 1920
cagtgagcgc aacgcaatta atgtgagtta gctcactcat taggcacccc aggctttaca 1980
ctttatgctt ccggctcgta tgttgtgtgg aattgtgagc ggataacaat ttcacacagg 2040
aaacagctat gaccatgatt acgccaagcg cgcaattaac cctcactaaa gggaacaaaa 2100
gctggagctg caagcttggc cattgcatac gttgtatcca tatcataata tgtacattta 2160
tattggctca tgtccaacat taccgccatg ttgacattga ttattgacta gttattaata 2220
gtaatcaatt acggggtcat tagttcatag cccatatatg gagttccgcg ttacataact 2280
tacggtaaat ggcccgcctg gctgaccgcc caacgacccc cgcccattga cgtcaataat 2340
gacgtatgtt cccatagtaa cgccaatagg gactttccat tgacgtcaat gggtggagta 2400
tttacggtaa actgcccact tggcagtaca tcaagtgtat catatgccaa gtacgccccc 2460
tattgacgtc aatgacggta aatggcccgc ctggcattat gcccagtaca tgaccttatg 2520
ggactttcct acttggcagt acatctacgt attagtcatc gctattacca tggtgatgcg 2580
gttttggcag tacatcaatg ggcgtggata gcggtttgac tcacggggat ttccaagtct 2640
ccaccccatt gacgtcaatg ggagtttgtt ttggcaccaa aatcaacggg actttccaaa 2700
atgtcgtaac aactccgccc cattgacgca aatgggcggt aggcgtgtac ggtgggaggt 2760
ctatataagc agagctcgtt tagtgaaccg gggtctctct ggttagacca gatctgagcc 2820
tgggagctct ctggctaact agggaaccca ctgcttaagc ctcaataaag cttgccttga 2880
gtgcttcaag tagtgtgtgc ccgtctgttg tgtgactctg gtaactagag atccctcaga 2940
cccttttagt cagtgtggaa aatctctagc agtggcgccc gaacagggac ctgaaagcga 3000
aagggaaacc agagctctct cgacgcagga ctcggcttgc tgaagcgcgc acggcaagag 3060
gcgaggggcg gcgactggtg agtacgccaa aaattttgac tagcggaggc tagaaggaga 3120
gagatgggtg cgagagcgtc agtattaagc gggggagaat tagatcgcga tgggaaaaaa 3180
ttcggttaag gccaggggga aagaaaaaat ataaattaaa acatatagta tgggcaagca 3240
gggagctaga acgattcgca gttaatcctg gcctgttaga aacatcagaa ggctgtagac 3300
aaatactggg acagctacaa ccatcccttc agacaggatc agaagaactt agatcattat 3360
ataatacagt agcaaccctc tattgtgtgc atcaaaggat agagataaaa gacaccaagg 3420
aagctttaga caagatagag gaagagcaaa acaaaagtaa gaccaccgca cagcaagcgg 3480
ccgctgatct tcagacctgg aggaggagat atgagggaca attggagaag tgaattatat 3540
aaatataaag tagtaaaaat tgaaccatta ggagtagcac ccaccaaggc aaagagaaga 3600
gtggtgcaga gagaaaaaag agcagtggga ataggagctt tgttccttgg gttcttggga 3660
gcagcaggaa gcactatggg cgcagcctca atgacgctga cggtacaggc cagacaatta 3720
ttgtctggta tagtgcagca gcagaacaat ttgctgaggg ctattgaggc gcaacagcat 3780
ctgttgcaac tcacagtctg gggcatcaag cagctccagg caagaatcct ggctgtggaa 3840
agatacctaa aggatcaaca gctcctgggg atttggggtt gctctggaaa actcatttgc 3900
accactgctg tgccttggaa tgctagttgg agtaataaat ctctggaaca gatttggaat 3960
cacacgacct ggatggagtg ggacagagaa attaacaatt acacaagctt aatacactcc 4020
ttaattgaag aatcgcaaaa ccagcaagaa aagaatgaac aagaattatt ggaattagat 4080
aaatgggcaa gtttgtggaa ttggtttaac ataacaaatt ggctgtggta tataaaatta 4140
ttcataatga tagtaggagg cttggtaggt ttaagaatag tttttgctgt actttctata 4200
gtgaatagag ttaggcaggg atattcacca ttatcgtttc agacccacct cccaaccccg 4260
aggggacccg acaggcccga aggaatagaa gaagaaggtg gagagagaga cagagacaga 4320
tccattcgat tagtgaacgg atctcgacgg tatcggttaa cttttaaaag aaaagggggg 4380
attggggggt acagtgcagg ggaaagaata gtagacataa tagcaacaga catacaaact 4440
aaagaattac aaaaacaaat tacaaaaatt caaaatttta tcgatggggg aggctgctgg 4500
tgaatattaa ccaaggtcac cccagttatc ggaggagcaa acaggggcta agtccactct 4560
tgcatctaaa atgagagaca aaaaatctat aaaaatggaa aacatgcata gaaatatgtg 4620
agggaggaaa aaattacccc caagaatgtt agtgcacgca gtcacacagg gagaagacta 4680
tttttgtttt gttttgattg ttttgttttg ttttggttgt tttgttttgg tgacctaact 4740
ggtcaaatga cctattaaga atatttcata gaacgaatgt tccgatgctc taatctctct 4800
agacaaggtt catatttgta tgggttactt attctctctt tgttgactaa gtcaataatc 4860
agaatcagca ggtttgcagt cagattggca gggataagca gcctagctca ggagaagtga 4920
gtataaaagc cccaggctgg gagcagccat cacagaagtc cactcattct tggatccacc 4980
ggtgccacca tgcaaataga gctctccacc tgcttctttc tgtgcctttt gcgattctgc 5040
tttagtgcca ccagaagata ctacctgggt gcagtggaac tgtcatggga ctatatgcaa 5100
agtgatctcg gtgagctgcc tgtggacgca agatttcctc ctagagtgcc aaaatctttt 5160
ccattcaaca cctcagtcgt gtacaaaaag actctgtttg tagaattcac ggatcacctt 5220
ttcaacatcg ctaagccaag gccaccctgg atgggtctgc taggtcctac catccaggct 5280
gaggtttatg atacagtggt cattacactt aagaacatgg cttcccatcc tgtcagtctt 5340
catgctgttg gtgtatccta ctggaaagct tctgagggag ctgaatatga tgatcagacc 5400
agtcaaaggg agaaagaaga tgataaagtc ttccctggtg gaagccatac atatgtctgg 5460
caggtcctga aagagaatgg tccaatggcc tctgacccac tgtgccttac ctactcatat 5520
ctttctcatg tggacctggt aaaagacttg aattcaggcc tcattggagc cctactagta 5580
tgtagagaag ggagtctggc caaggaaaag acacagacct tgcacaaatt tatactactt 5640
tttgctgtat ttgatgaagg gaaaagttgg cactcagaaa caaagaactc cttgatgcag 5700
gatagggatg ctgcatctgc tcgggcctgg cctaaaatgc acacagtcaa tggttatgta 5760
aacaggtctc tgccaggtct gattggatgc cacaggaaat cagtctattg gcatgtgatt 5820
ggaatgggca ccactcctga agtgcactca atattcctcg aaggtcacac atttcttgtg 5880
aggaaccatc gccaggcgtc cttggaaatc tcgccaataa ctttccttac tgctcaaaca 5940
ctcttgatgg accttggaca gtttctactg ttttgtcata tctcttccca ccaacatgat 6000
ggcatggaag cttatgtcaa agtagacagc tgtccagagg aaccccaact acgaatgaaa 6060
aataatgaag aagcggaaga ctatgatgat gatcttactg attctgaaat ggatgtggtc 6120
aggtttgatg atgacaactc tccttccttt atccaaattc gctcagttgc caagaagcat 6180
cctaaaactt gggtacatta cattgctgct gaagaggagg actgggacta tgctccctta 6240
gtcctcgccc ccgatgacag aagttataaa agtcaatatt tgaacaatgg ccctcagcgg 6300
attggtagga agtacaaaaa agtccgattt atggcataca cagatgaaac ctttaagact 6360
cgtgaagcta ttcagcatga atcaggaatc ttgggacctt tactttatgg ggaagttgga 6420
gacacactgt tgattatatt taagaatcaa gcaagcagac catataacat ctaccctcac 6480
ggaatcactg atgtccgtcc tttgtattca aggagattac caaaaggtgt aaaacatttg 6540
aaggattttc caattctgcc aggagaaata ttcaaatata aatggacagt gactgtagaa 6600
gatgggccaa ctaaatcaga tcctcggtgc ctgacccgct attactctag tttcgttaat 6660
atggagagag atctagcttc aggactcatt ggccctctcc tcatctgcta caaagaatct 6720
gtagatcaaa gaggaaacca gataatgtca gacaagagga atgtcatcct gttttctgta 6780
tttgatgaga accgaagctg gtacctcaca gagaatatac aacgctttct ccccaatcca 6840
gctggagtgc agcttgagga tccagagttc caagcctcca acatcatgca cagcatcaat 6900
ggctatgttt ttgatagttt gcagttgtca gtttgtttgc atgaggtggc atactggtac 6960
attctaagca ttggagcaca gactgacttc ctttctgtct tcttctctgg atataccttc 7020
aaacacaaaa tggtctatga agacacactc accctattcc cattctcagg agaaactgtc 7080
ttcatgtcga tggaaaaccc aggtctatgg attctggggt gccacaactc agactttcgg 7140
aacagaggca tgaccgcctt actgaaggtt tctagttgtg acaagaacac tggtgattat 7200
tacgaggaca gttatgaaga tatttcagca tacttgctga gtaaaaacaa tgccattgaa 7260
ccaagaagct tctcccagaa ttcaagacac cctagccaaa acccaccagt cttgaaacgc 7320
catcaacggg aaataactcg tactactctt cagtcagatc aagaggaaat tgactatgat 7380
gataccatat cagttgaaat gaagaaggaa gattttgaca tttatgatga ggatgaaaat 7440
cagagccccc gcagctttca aaagaaaaca cgacactatt ttattgctgc agtggagagg 7500
ctctgggatt atgggatgag tagctcccca catgttctaa gaaacagggc tcagagtggc 7560
agtgtccctc agttcaagaa agttgttttc caggaattta ctgatggctc ctttactcag 7620
cccttatacc gtggagaact aaatgaacat ttgggactcc tggggccata tataagagca 7680
gaagttgaag ataatatcat ggtaactttc agaaatcagg cctctcgtcc ctattccttc 7740
tattctagcc ttatttctta tgaggaagat cagaggcaag gagcagaacc tagaaaaaac 7800
tttgtcaagc ctaatgaaac caaaacttac ttttggaaag tgcaacatca tatggcaccc 7860
actaaagatg agtttgactg caaagcctgg gcttatttct ctgatgttga cctggaaaaa 7920
gatgtgcact caggcctgat tggacccctt ctggtctgcc acactaacac actgaaccct 7980
gctcatggga gacaagtgac agtacaggaa tttgctctgt ttttcaccat ctttgatgag 8040
accaaaagct ggtacttcac tgaaaatatg gaaagaaact gcagggctcc ctgcaatatc 8100
cagatggaag atcccacttt taaagagaat tatcgcttcc atgcaatcaa tggctacata 8160
atggatacac tacctggctt agtaatggct caggatcaaa ggattcgatg gtatctgctc 8220
agcatgggca gcaatgaaaa catccattct attcatttca gtggacatgt gttcactgta 8280
cgaaaaaaag aggagtataa aatggcactg tacaatctct atccaggtgt ttttgagaca 8340
gtggaaatgt taccatccaa agctggaatt tggcgggtgg aatgccttat tggcgagcat 8400
ctacatgctg ggatgagcac actttttctg gtgtacagca ataagtgtca gactcccctg 8460
ggaatggctt ctggacacat tagagatttt cagattacag cttcaggaca atatggacag 8520
tgggccccaa agctggccag acttcattat tccggatcaa tcaatgcctg gagcaccaag 8580
gagccctttt cttggatcaa ggtggatctg ttggcaccaa tgattattca cggcatcaag 8640
acccagggtg cccgtcagaa gttctccagc ctctacatct ctcagtttat catcatgtat 8700
agtcttgatg ggaagaagtg gcagacttat cgaggaaatt ccactggaac cttaatggtc 8760
ttctttggca atgtggattc atctgggata aaacacaata tttttaaccc tccaattatt 8820
gctcgataca tccgtttgca cccaactcat tatagcattc gcagcactct tcgcatggag 8880
ttgatgggct gtgatttaaa tagttgcagc atgccattgg gaatggagag taaagcaata 8940
tcagatgcac agattactgc ttcatcctac tttaccaata tgtttgccac ctggtctcct 9000
tcaaaagctc gacttcacct ccaagggagg agtaatgcct ggagacctca ggtgaataat 9060
ccaaaagagt ggctgcaagt ggacttccag aagacaatga aagtcacagg agtaactact 9120
cagggagtaa aatctctgct taccagcatg tatgtgaagg agttcctcat ctccagcagt 9180
caagatggcc atcagtggac tctctttttt cagaatggca aagtaaaggt ttttcaggga 9240
aatcaagact ccttcacacc tgtggtgaac tctctagacc caccgttact gactcgctac 9300
cttcgaattc acccccagag ttgggtgcac cagattgccc tgaggatgga ggttctgggc 9360
tgcgaggcac aggacctcta ctgagcggcc gcgtttaaac gtcgacaatc aacctctgga 9420
ttacaaaatt tgtgaaagat tgactggtat tcttaactat gttgctcctt ttacgctatg 9480
tggatacgct gctttaatgc ctttgtatca tgctattgct tcccgtatgg ctttcatttt 9540
ctcctccttg tataaatcct ggttgctgtc tctttatgag gagttgtggc ccgttgtcag 9600
gcaacgtggc gtggtgtgca ctgtgtttgc tgacgcaacc cccactggtt ggggcattgc 9660
caccacctgt cagctccttt ccgggacttt cgctttcccc ctccctattg ccacggcgga 9720
actcatcgcc gcctgccttg cccgctgctg gacaggggct cggctgttgg gcactgacaa 9780
ttccgtggtg ttgtcgggga agctgacgtc ctttccatgg ctgctcgcct gtgttgccac 9840
ctggattctg cgcgggacgt ccttctgcta cgtcccttcg gccctcaatc cagcggacct 9900
tccttcccgc ggcctgctgc cggctctgcg gcctcttccg cgtcttcgcc ttcgccctca 9960
gacgagtcgg atctcccttt gggccgcctc cccgcctgga attcgagctc ggtaccttta 10020
agaccaatga cttacaaggc agctgtagat cttagccact ttttaaaaga aaagggggga 10080
ctggaagggc taattcactc ccaacgaaga caagatctgc tttttgcttg tactgggtct 10140
ctctggttag accagatctg agcctgggag ctctctggct aactagggaa cccactgctt 10200
aagcctcaat aaagcttgcc ttgagtgctt caagtagtgt gtgcccgtct gttgtgtgac 10260
tctggtaact agagatccct cagacccttt tagtcagtgt ggaaaatctc tagcagtagt 10320
agttcatgtc atcttattat tcagtattta taacttgcaa agaaatgaat atcagagagt 10380
gagaggaact tgtttattgc agcttataat ggttacaaat aaagcaatag catcacaaat 10440
ttcacaaata aagcattttt ttcactgcat tctagttgtg gtttgtccaa actcatcaat 10500
gtatcttatc atgtctggct ctagctatcc cgcccctaac tccgcccagt tccgcccatt 10560
ctccgcccca tggctgacta atttttttta tttatgcaga ggccgaggcc gcctcggcct 10620
ctgagctatt ccagaagtag tgaggaggct tttttggagg cctaggcttt tgcgtcgaga 10680
cgtacccaat tcgccctata gtgagtcgta ttacgcgcgc tcactggccg tcgttttaca 10740
acgtcgtgac tgggaaaacc ctggcgttac ccaacttaat cgccttgcag cacatccccc 10800
tttcgccagc tggcgtaata gcgaagaggc ccgcaccgat cgcccttccc aacagttgcg 10860
cagcctgaat ggcgaatggc gcgacgcgcc ctgtagcggc gcattaagcg cggcgggtgt 10920
ggtggttacg cgcagcgtga ccgctacact tgccagcgcc ctagcgcccg ctcctttcgc 10980
tttcttccct tcctttctcg ccacgttcgc cggctttccc cgtcaagctc taaatcgggg 11040
gctcccttta gggttccgat ttagtgcttt acggcacctc gaccccaaaa aacttgatta 11100
gggtgatggt tcacgtagtg ggccatcgcc ctgatagacg gtttttcgcc ctttgacgtt 11160
ggagtccacg ttctttaata gtggactctt gttccaaact ggaacaacac tcaaccctat 11220
ctcggtctat tcttttgatt tataagggat tttgccgatt tcggcctatt ggttaaaaaa 11280
tgagctgatt taacaaaaat ttaacgcgaa ttttaacaaa atattaacgt ttacaatttc 11340
c 11341
<210> 8
<211> 4392
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 8
atgcagattg agctgtccac ctgtttcttc ctgtgcctgc tgagattttg cttcagtgct 60
acaaggagat actacctggg ggctgtcgag ctgtcttggg attacatgca gtctgatctg 120
ggcgaactgc cagtggacgc gcggtttcct ccaagggtgc caaagtcctt cccctttaat 180
acatctgtgg tgtacaagaa gaccctgttt gtggagttta ccgaccacct gtttaacatc 240
gcgaagccta gaccaccctg gatgggcctg ctgggcccca caatccaggc cgaagtgtat 300
gacacagtgg taatcacact gaagaacatg gccagccacc cagtgtccct gcacgcggtg 360
ggcgtatcct actggaaggc cagcgaaggc gcggagtatg atgaccagac atcccagaga 420
gagaaggagg atgacaaggt gtttccaggc gggtcccaca cctatgtatg gcaggtgctg 480
aaggagaatg gccccatggc ctccgacccc ctgtgcctga catacagcta tctgtcccac 540
gtagacctgg tgaaggatct gaattccggg ctgatcgggg ccctgctggt gtgcagggag 600
ggctccctgg ccaaggagaa gacccagacc ctgcacaagt ttatcctgct gttcgcggtg 660
tttgatgagg gcaagtcctg gcacagcgaa acaaagaact ctctgatgca ggacagggac 720
gcggccagcg cgcgggcctg gccaaagatg cacaccgtaa atggctatgt gaacaggtcc 780
ctgccaggcc tgatcgggtg ccacagaaag tctgtatatt ggcacgtaat cgggatgggc 840
accacaccag aggtgcactc catctttctg gagggccaca ccttcctggt gagaaaccac 900
aggcaggcca gcctggagat cagccccatc acattcctga cagcccagac cctgctgatg 960
gacctgggcc agtttctgct gttttgtcac atcagctccc accagcacga tggcatggag 1020
gcctatgtga aggtggatag ctgccctgag gagccacagc tgaggatgaa gaataatgag 1080
gaggccgaag attatgatga tgacctgacc gacagcgaaa tggatgtggt gaggtttgac 1140
gatgacaact ccccatcctt catccagatc cgctccgtag ccaagaagca ccctaagaca 1200
tgggtgcact atatcgcggc cgaagaggag gactgggatt atgccccact ggtgctggcc 1260
cctgacgatc ggagctacaa gtcccagtat ctgaataatg gcccccagag aatcgggcgg 1320
aagtacaaga aggtgagatt catggcctat accgatgaga cattcaagac cagggaggcc 1380
atccagcacg aatctggcat cctgggccct ctgctgtatg gcgaagtggg cgacacactg 1440
ctgatcatct tcaagaacca ggccagcaga ccatacaaca tctacccaca cgggatcaca 1500
gacgtacggc ctctgtacag ccgccgcctg ccaaagggcg taaagcacct gaaggatttc 1560
cctatcctgc ccggggagat ctttaagtat aagtggacag tgacagtgga ggatggccca 1620
accaagtccg atccaaggtg cctgaccaga tactacagct cctttgtgaa catggagaga 1680
gatctggcca gcgggctgat cgggccactg ctgatctgct acaaggagag cgtagatcag 1740
agaggcaatc agatcatgtc cgacaagagg aatgtgatcc tgttctctgt attcgatgag 1800
aatagatctt ggtacctgac agagaacatc cagagatttc tgccaaatcc agccggggtg 1860
cagctggagg acccagagtt tcaggcctct aacatcatgc actccatcaa tggctatgta 1920
tttgactctc tgcagctgag cgtatgtctg cacgaagtgg cctactggta catcctgtcc 1980
atcggggccc agacagactt cctgtctgta ttcttttctg gctacacatt caagcacaag 2040
atggtgtatg aggacaccct gaccctgttt cctttctccg gggagaccgt attcatgtct 2100
atggagaatc ctggcctgtg gatactgggc tgtcacaatt ctgactttag aaatagaggc 2160
atgacagccc tgctgaaggt gtcctcctgt gacaagaata caggcgacta ctatgaggat 2220
tcctatgagg atatcagcgc gtatctgctg agcaagaata atgccatcga acctaggtct 2280
ttttctcaga actccagaca cccctcccag aaccctcctg tgctgaagag acaccagaga 2340
gagatcacaa ggaccaccct gcagtctgac caggaggaga tcgactatga tgatacaatc 2400
tctgtggaga tgaagaagga ggattttgat atctatgacg aagatgagaa tcagagccca 2460
agatccttcc agaagaagac ccgccactat ttcatcgcgg ccgtagagag actgtgggac 2520
tatggcatgt cttcctctcc ccacgtactg aggaacagag cccagtctgg ctctgtgcct 2580
cagtttaaga aggtggtgtt ccaggagttc acagacggga gctttaccca gccactgtac 2640
agaggcgaac tgaatgagca cctgggcctg ctgggcccat acatcagggc cgaagtggag 2700
gataacatca tggtgacctt cagaaatcag gccagcaggc catactcctt ttactcctcc 2760
ctgatctctt atgaggagga tcagagacag ggcgcggagc caagaaagaa ttttgtgaag 2820
ccaaatgaga caaagacata cttttggaag gtgcagcacc acatggcccc aaccaaggac 2880
gaatttgatt gcaaggcctg ggcctatttt tccgacgtgg atctggagaa ggatgtgcac 2940
tctggcctga tcgggcccct gctggtgtgc cacacaaaca ccctgaatcc tgcccacggg 3000
cgccaggtga cagtgcagga gtttgccctg ttctttacca tctttgacga aacaaagtcc 3060
tggtacttca cagagaacat ggagagaaat tgcagggccc catgcaatat ccagatggag 3120
gaccccacct tcaaggagaa ttatagattt cacgcgatca atggctacat catggatacc 3180
ctgccaggcc tggtaatggc ccaggatcag agaatcagat ggtacctgct gtctatgggc 3240
tctaatgaga acatccactc catccacttc tccgggcacg tattcaccgt aaggaagaag 3300
gaggagtaca agatggccct gtacaatctg tatccaggcg tatttgagac agtggagatg 3360
ctgcccagca aggccgggat ctggagagtg gagtgtctga tcggggagca cctgcacgcg 3420
ggcatgtcca cactgtttct ggtgtactcc aataagtgcc agacccctct gggcatggcc 3480
agcgggcaca tccgggactt ccagatcacc gcgtctggcc agtatggcca gtgggccccc 3540
aagctggccc gcctgcacta tagcgggtct atcaacgcgt ggtccaccaa ggagcccttt 3600
tcctggatca aggtggacct gctggcccca atgatcatcc acgggatcaa gacccagggc 3660
gcgcgccaga agttttcttc tctgtacatc tcccagttta tcatcatgta ctccctggat 3720
ggcaagaagt ggcagacata tcgcggcaac tccacaggca ccctgatggt gttctttggc 3780
aatgtggact cctccgggat caagcacaac atctttaatc caccaatcat cgcgcggtat 3840
atcaggctgc accccaccca ctactctatc agaagcacac tgaggatgga gctgatgggc 3900
tgtgacctga attcctgttc catgcctctg ggcatggaga gcaaggccat ctctgatgcc 3960
cagatcacag ccagctctta tttcacaaac atgtttgcca cctggtcccc atctaaggcc 4020
agactgcacc tgcagggcag atccaatgcc tggagaccac aggtgaataa tcccaaggag 4080
tggctgcagg tggacttcca gaagacaatg aaggtgaccg gggtgaccac ccagggcgta 4140
aagtccctgc tgacatccat gtatgtgaag gagttcctga tcagctcttc ccaggatggc 4200
caccagtgga ccctgttctt tcagaatggc aaggtgaagg tgtttcaggg caatcaggat 4260
tcctttaccc ctgtggtgaa ttccctggac ccacctctgc tgaccagata tctgagaatc 4320
caccctcaga gctgggtcca ccagattgcc ctgagaatgg aagtgctggg atgtgaagct 4380
caggatctgt at 4392
<210> 9
<211> 11341
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 9
aatgcttcaa taatattgaa aaaggaagag tatgagtatt caacatttcc gtgtcgccct 60
tattcccttt tttgcggcat tttgccttcc tgtttttgct cacccagaaa cgctggtgaa 120
agtaaaagat gctgaagatc agttgggtgc acgagtgggt tacatcgaac tggatctcaa 180
cagcggtaag atccttgaga gttttcgccc cgaagaacgt tttccaatga tgagcacttt 240
taaagttctg ctatgtggcg cggtattatc ccgtattgac gccgggcaag agcaactcgg 300
tcgccgcata cactattctc agaatgactt ggttgagtac tcaccagtca cagaaaagca 360
tcttacggat ggcatgacag taagagaatt atgcagtgct gccataacca tgagtgataa 420
cactgcggcc aacttacttc tgacaacgat cggaggaccg aaggagctaa ccgctttttt 480
gcacaacatg ggggatcatg taactcgcct tgatcgttgg gaaccggagc tgaatgaagc 540
cataccaaac gacgagcgtg acaccacgat gcctgtagca atggcaacaa cgttgcgcaa 600
actattaact ggcgaactac ttactctagc ttcccggcaa caattaatag actggatgga 660
ggcggataaa gttgcaggac cacttctgcg ctcggccctt ccggctggct ggtttattgc 720
tgataaatct ggagccggtg agcgtgggtc tcgcggtatc attgcagcac tggggccaga 780
tggtaagccc tcccgtatcg tagttatcta cacgacgggg agtcaggcaa ctatggatga 840
acgaaataga cagatcgctg agataggtgc ctcactgatt aagcattggt aactgtcaga 900
ccaagtttac tcatatatac tttagattga tttaaaactt catttttaat ttaaaaggat 960
ctaggtgaag atcctttttg ataatctcat gaccaaaatc ccttaacgtg agttttcgtt 1020
ccactgagcg tcagaccccg tagaaaagat caaaggatct tcttgagatc ctttttttct 1080
gcgcgtaatc tgctgcttgc aaacaaaaaa accaccgcta ccagcggtgg tttgtttgcc 1140
ggatcaagag ctaccaactc tttttccgaa ggtaactggc ttcagcagag cgcagatacc 1200
aaatactgtc cttctagtgt agccgtagtt aggccaccac ttcaagaact ctgtagcacc 1260
gcctacatac ctcgctctgc taatcctgtt accagtggct gctgccagtg gcgataagtc 1320
gtgtcttacc gggttggact caagacgata gttaccggat aaggcgcagc ggtcgggctg 1380
aacggggggt tcgtgcacac agcccagctt ggagcgaacg acctacaccg aactgagata 1440
cctacagcgt gagctatgag aaagcgccac gcttcccgaa gggagaaagg cggacaggta 1500
tccggtaagc ggcagggtcg gaacaggaga gcgcacgagg gagcttccag ggggaaacgc 1560
ctggtatctt tatagtcctg tcgggtttcg ccacctctga cttgagcgtc gatttttgtg 1620
atgctcgtca ggggggcgga gcctatggaa aaacgccagc aacgcggcct ttttacggtt 1680
cctggccttt tgctggcctt ttgctcacat gttctttcct gcgttatccc ctgattctgt 1740
ggataaccgt attaccgcct ttgagtgagc tgataccgct cgccgcagcc gaacgaccga 1800
gcgcagcgag tcagtgagcg aggaagcgga agagcgccca atacgcaaac cgcctctccc 1860
cgcgcgttgg ccgattcatt aatgcagctg gcacgacagg tttcccgact ggaaagcggg 1920
cagtgagcgc aacgcaatta atgtgagtta gctcactcat taggcacccc aggctttaca 1980
ctttatgctt ccggctcgta tgttgtgtgg aattgtgagc ggataacaat ttcacacagg 2040
aaacagctat gaccatgatt acgccaagcg cgcaattaac cctcactaaa gggaacaaaa 2100
gctggagctg caagcttggc cattgcatac gttgtatcca tatcataata tgtacattta 2160
tattggctca tgtccaacat taccgccatg ttgacattga ttattgacta gttattaata 2220
gtaatcaatt acggggtcat tagttcatag cccatatatg gagttccgcg ttacataact 2280
tacggtaaat ggcccgcctg gctgaccgcc caacgacccc cgcccattga cgtcaataat 2340
gacgtatgtt cccatagtaa cgccaatagg gactttccat tgacgtcaat gggtggagta 2400
tttacggtaa actgcccact tggcagtaca tcaagtgtat catatgccaa gtacgccccc 2460
tattgacgtc aatgacggta aatggcccgc ctggcattat gcccagtaca tgaccttatg 2520
ggactttcct acttggcagt acatctacgt attagtcatc gctattacca tggtgatgcg 2580
gttttggcag tacatcaatg ggcgtggata gcggtttgac tcacggggat ttccaagtct 2640
ccaccccatt gacgtcaatg ggagtttgtt ttggcaccaa aatcaacggg actttccaaa 2700
atgtcgtaac aactccgccc cattgacgca aatgggcggt aggcgtgtac ggtgggaggt 2760
ctatataagc agagctcgtt tagtgaaccg gggtctctct ggttagacca gatctgagcc 2820
tgggagctct ctggctaact agggaaccca ctgcttaagc ctcaataaag cttgccttga 2880
gtgcttcaag tagtgtgtgc ccgtctgttg tgtgactctg gtaactagag atccctcaga 2940
cccttttagt cagtgtggaa aatctctagc agtggcgccc gaacagggac ctgaaagcga 3000
aagggaaacc agagctctct cgacgcagga ctcggcttgc tgaagcgcgc acggcaagag 3060
gcgaggggcg gcgactggtg agtacgccaa aaattttgac tagcggaggc tagaaggaga 3120
gagatgggtg cgagagcgtc agtattaagc gggggagaat tagatcgcga tgggaaaaaa 3180
ttcggttaag gccaggggga aagaaaaaat ataaattaaa acatatagta tgggcaagca 3240
gggagctaga acgattcgca gttaatcctg gcctgttaga aacatcagaa ggctgtagac 3300
aaatactggg acagctacaa ccatcccttc agacaggatc agaagaactt agatcattat 3360
ataatacagt agcaaccctc tattgtgtgc atcaaaggat agagataaaa gacaccaagg 3420
aagctttaga caagatagag gaagagcaaa acaaaagtaa gaccaccgca cagcaagcgg 3480
ccgctgatct tcagacctgg aggaggagat atgagggaca attggagaag tgaattatat 3540
aaatataaag tagtaaaaat tgaaccatta ggagtagcac ccaccaaggc aaagagaaga 3600
gtggtgcaga gagaaaaaag agcagtggga ataggagctt tgttccttgg gttcttggga 3660
gcagcaggaa gcactatggg cgcagcctca atgacgctga cggtacaggc cagacaatta 3720
ttgtctggta tagtgcagca gcagaacaat ttgctgaggg ctattgaggc gcaacagcat 3780
ctgttgcaac tcacagtctg gggcatcaag cagctccagg caagaatcct ggctgtggaa 3840
agatacctaa aggatcaaca gctcctgggg atttggggtt gctctggaaa actcatttgc 3900
accactgctg tgccttggaa tgctagttgg agtaataaat ctctggaaca gatttggaat 3960
cacacgacct ggatggagtg ggacagagaa attaacaatt acacaagctt aatacactcc 4020
ttaattgaag aatcgcaaaa ccagcaagaa aagaatgaac aagaattatt ggaattagat 4080
aaatgggcaa gtttgtggaa ttggtttaac ataacaaatt ggctgtggta tataaaatta 4140
ttcataatga tagtaggagg cttggtaggt ttaagaatag tttttgctgt actttctata 4200
gtgaatagag ttaggcaggg atattcacca ttatcgtttc agacccacct cccaaccccg 4260
aggggacccg acaggcccga aggaatagaa gaagaaggtg gagagagaga cagagacaga 4320
tccattcgat tagtgaacgg atctcgacgg tatcggttaa cttttaaaag aaaagggggg 4380
attggggggt acagtgcagg ggaaagaata gtagacataa tagcaacaga catacaaact 4440
aaagaattac aaaaacaaat tacaaaaatt caaaatttta tcgatggggg aggctgctgg 4500
tgaatattaa ccaaggtcac cccagttatc ggaggagcaa acaggggcta agtccactct 4560
tgcatctaaa atgagagaca aaaaatctat aaaaatggaa aacatgcata gaaatatgtg 4620
agggaggaaa aaattacccc caagaatgtt agtgcacgca gtcacacagg gagaagacta 4680
tttttgtttt gttttgattg ttttgttttg ttttggttgt tttgttttgg tgacctaact 4740
ggtcaaatga cctattaaga atatttcata gaacgaatgt tccgatgctc taatctctct 4800
agacaaggtt catatttgta tgggttactt attctctctt tgttgactaa gtcaataatc 4860
agaatcagca ggtttgcagt cagattggca gggataagca gcctagctca ggagaagtga 4920
gtataaaagc cccaggctgg gagcagccat cacagaagtc cactcattct tggatccacc 4980
ggtgccacca tgcagattga gctgtccacc tgtttcttcc tgtgcctgct gagattttgc 5040
ttcagtgcta caaggagata ctacctgggg gctgtcgagc tgtcttggga ttacatgcag 5100
tctgatctgg gcgaactgcc agtggacgcg cggtttcctc caagggtgcc aaagtccttc 5160
ccctttaata catctgtggt gtacaagaag accctgtttg tggagtttac cgaccacctg 5220
tttaacatcg cgaagcctag accaccctgg atgggcctgc tgggccccac aatccaggcc 5280
gaagtgtatg acacagtggt aatcacactg aagaacatgg ccagccaccc agtgtccctg 5340
cacgcggtgg gcgtatccta ctggaaggcc agcgaaggcg cggagtatga tgaccagaca 5400
tcccagagag agaaggagga tgacaaggtg tttccaggcg ggtcccacac ctatgtatgg 5460
caggtgctga aggagaatgg ccccatggcc tccgaccccc tgtgcctgac atacagctat 5520
ctgtcccacg tagacctggt gaaggatctg aattccgggc tgatcggggc cctgctggtg 5580
tgcagggagg gctccctggc caaggagaag acccagaccc tgcacaagtt tatcctgctg 5640
ttcgcggtgt ttgatgaggg caagtcctgg cacagcgaaa caaagaactc tctgatgcag 5700
gacagggacg cggccagcgc gcgggcctgg ccaaagatgc acaccgtaaa tggctatgtg 5760
aacaggtccc tgccaggcct gatcgggtgc cacagaaagt ctgtatattg gcacgtaatc 5820
gggatgggca ccacaccaga ggtgcactcc atctttctgg agggccacac cttcctggtg 5880
agaaaccaca ggcaggccag cctggagatc agccccatca cattcctgac agcccagacc 5940
ctgctgatgg acctgggcca gtttctgctg ttttgtcaca tcagctccca ccagcacgat 6000
ggcatggagg cctatgtgaa ggtggatagc tgccctgagg agccacagct gaggatgaag 6060
aataatgagg aggccgaaga ttatgatgat gacctgaccg acagcgaaat ggatgtggtg 6120
aggtttgacg atgacaactc cccatccttc atccagatcc gctccgtagc caagaagcac 6180
cctaagacat gggtgcacta tatcgcggcc gaagaggagg actgggatta tgccccactg 6240
gtgctggccc ctgacgatcg gagctacaag tcccagtatc tgaataatgg cccccagaga 6300
atcgggcgga agtacaagaa ggtgagattc atggcctata ccgatgagac attcaagacc 6360
agggaggcca tccagcacga atctggcatc ctgggccctc tgctgtatgg cgaagtgggc 6420
gacacactgc tgatcatctt caagaaccag gccagcagac catacaacat ctacccacac 6480
gggatcacag acgtacggcc tctgtacagc cgccgcctgc caaagggcgt aaagcacctg 6540
aaggatttcc ctatcctgcc cggggagatc tttaagtata agtggacagt gacagtggag 6600
gatggcccaa ccaagtccga tccaaggtgc ctgaccagat actacagctc ctttgtgaac 6660
atggagagag atctggccag cgggctgatc gggccactgc tgatctgcta caaggagagc 6720
gtagatcaga gaggcaatca gatcatgtcc gacaagagga atgtgatcct gttctctgta 6780
ttcgatgaga atagatcttg gtacctgaca gagaacatcc agagatttct gccaaatcca 6840
gccggggtgc agctggagga cccagagttt caggcctcta acatcatgca ctccatcaat 6900
ggctatgtat ttgactctct gcagctgagc gtatgtctgc acgaagtggc ctactggtac 6960
atcctgtcca tcggggccca gacagacttc ctgtctgtat tcttttctgg ctacacattc 7020
aagcacaaga tggtgtatga ggacaccctg accctgtttc ctttctccgg ggagaccgta 7080
ttcatgtcta tggagaatcc tggcctgtgg atactgggct gtcacaattc tgactttaga 7140
aatagaggca tgacagccct gctgaaggtg tcctcctgtg acaagaatac aggcgactac 7200
tatgaggatt cctatgagga tatcagcgcg tatctgctga gcaagaataa tgccatcgaa 7260
cctaggtctt tttctcagaa ctccagacac ccctcccaga accctcctgt gctgaagaga 7320
caccagagag agatcacaag gaccaccctg cagtctgacc aggaggagat cgactatgat 7380
gatacaatct ctgtggagat gaagaaggag gattttgata tctatgacga agatgagaat 7440
cagagcccaa gatccttcca gaagaagacc cgccactatt tcatcgcggc cgtagagaga 7500
ctgtgggact atggcatgtc ttcctctccc cacgtactga ggaacagagc ccagtctggc 7560
tctgtgcctc agtttaagaa ggtggtgttc caggagttca cagacgggag ctttacccag 7620
ccactgtaca gaggcgaact gaatgagcac ctgggcctgc tgggcccata catcagggcc 7680
gaagtggagg ataacatcat ggtgaccttc agaaatcagg ccagcaggcc atactccttt 7740
tactcctccc tgatctctta tgaggaggat cagagacagg gcgcggagcc aagaaagaat 7800
tttgtgaagc caaatgagac aaagacatac ttttggaagg tgcagcacca catggcccca 7860
accaaggacg aatttgattg caaggcctgg gcctattttt ccgacgtgga tctggagaag 7920
gatgtgcact ctggcctgat cgggcccctg ctggtgtgcc acacaaacac cctgaatcct 7980
gcccacgggc gccaggtgac agtgcaggag tttgccctgt tctttaccat ctttgacgaa 8040
acaaagtcct ggtacttcac agagaacatg gagagaaatt gcagggcccc atgcaatatc 8100
cagatggagg accccacctt caaggagaat tatagatttc acgcgatcaa tggctacatc 8160
atggataccc tgccaggcct ggtaatggcc caggatcaga gaatcagatg gtacctgctg 8220
tctatgggct ctaatgagaa catccactcc atccacttct ccgggcacgt attcaccgta 8280
aggaagaagg aggagtacaa gatggccctg tacaatctgt atccaggcgt atttgagaca 8340
gtggagatgc tgcccagcaa ggccgggatc tggagagtgg agtgtctgat cggggagcac 8400
ctgcacgcgg gcatgtccac actgtttctg gtgtactcca ataagtgcca gacccctctg 8460
ggcatggcca gcgggcacat ccgggacttc cagatcaccg cgtctggcca gtatggccag 8520
tgggccccca agctggcccg cctgcactat agcgggtcta tcaacgcgtg gtccaccaag 8580
gagccctttt cctggatcaa ggtggacctg ctggccccaa tgatcatcca cgggatcaag 8640
acccagggcg cgcgccagaa gttttcttct ctgtacatct cccagtttat catcatgtac 8700
tccctggatg gcaagaagtg gcagacatat cgcggcaact ccacaggcac cctgatggtg 8760
ttctttggca atgtggactc ctccgggatc aagcacaaca tctttaatcc accaatcatc 8820
gcgcggtata tcaggctgca ccccacccac tactctatca gaagcacact gaggatggag 8880
ctgatgggct gtgacctgaa ttcctgttcc atgcctctgg gcatggagag caaggccatc 8940
tctgatgccc agatcacagc cagctcttat ttcacaaaca tgtttgccac ctggtcccca 9000
tctaaggcca gactgcacct gcagggcaga tccaatgcct ggagaccaca ggtgaataat 9060
cccaaggagt ggctgcaggt ggacttccag aagacaatga aggtgaccgg ggtgaccacc 9120
cagggcgtaa agtccctgct gacatccatg tatgtgaagg agttcctgat cagctcttcc 9180
caggatggcc accagtggac cctgttcttt cagaatggca aggtgaaggt gtttcagggc 9240
aatcaggatt cctttacccc tgtggtgaat tccctggacc cacctctgct gaccagatat 9300
ctgagaatcc accctcagag ctgggtccac cagattgccc tgagaatgga agtgctggga 9360
tgtgaagctc aggatctgta ttgagcggcc gcgtttaaac gtcgacaatc aacctctgga 9420
ttacaaaatt tgtgaaagat tgactggtat tcttaactat gttgctcctt ttacgctatg 9480
tggatacgct gctttaatgc ctttgtatca tgctattgct tcccgtatgg ctttcatttt 9540
ctcctccttg tataaatcct ggttgctgtc tctttatgag gagttgtggc ccgttgtcag 9600
gcaacgtggc gtggtgtgca ctgtgtttgc tgacgcaacc cccactggtt ggggcattgc 9660
caccacctgt cagctccttt ccgggacttt cgctttcccc ctccctattg ccacggcgga 9720
actcatcgcc gcctgccttg cccgctgctg gacaggggct cggctgttgg gcactgacaa 9780
ttccgtggtg ttgtcgggga agctgacgtc ctttccatgg ctgctcgcct gtgttgccac 9840
ctggattctg cgcgggacgt ccttctgcta cgtcccttcg gccctcaatc cagcggacct 9900
tccttcccgc ggcctgctgc cggctctgcg gcctcttccg cgtcttcgcc ttcgccctca 9960
gacgagtcgg atctcccttt gggccgcctc cccgcctgga attcgagctc ggtaccttta 10020
agaccaatga cttacaaggc agctgtagat cttagccact ttttaaaaga aaagggggga 10080
ctggaagggc taattcactc ccaacgaaga caagatctgc tttttgcttg tactgggtct 10140
ctctggttag accagatctg agcctgggag ctctctggct aactagggaa cccactgctt 10200
aagcctcaat aaagcttgcc ttgagtgctt caagtagtgt gtgcccgtct gttgtgtgac 10260
tctggtaact agagatccct cagacccttt tagtcagtgt ggaaaatctc tagcagtagt 10320
agttcatgtc atcttattat tcagtattta taacttgcaa agaaatgaat atcagagagt 10380
gagaggaact tgtttattgc agcttataat ggttacaaat aaagcaatag catcacaaat 10440
ttcacaaata aagcattttt ttcactgcat tctagttgtg gtttgtccaa actcatcaat 10500
gtatcttatc atgtctggct ctagctatcc cgcccctaac tccgcccagt tccgcccatt 10560
ctccgcccca tggctgacta atttttttta tttatgcaga ggccgaggcc gcctcggcct 10620
ctgagctatt ccagaagtag tgaggaggct tttttggagg cctaggcttt tgcgtcgaga 10680
cgtacccaat tcgccctata gtgagtcgta ttacgcgcgc tcactggccg tcgttttaca 10740
acgtcgtgac tgggaaaacc ctggcgttac ccaacttaat cgccttgcag cacatccccc 10800
tttcgccagc tggcgtaata gcgaagaggc ccgcaccgat cgcccttccc aacagttgcg 10860
cagcctgaat ggcgaatggc gcgacgcgcc ctgtagcggc gcattaagcg cggcgggtgt 10920
ggtggttacg cgcagcgtga ccgctacact tgccagcgcc ctagcgcccg ctcctttcgc 10980
tttcttccct tcctttctcg ccacgttcgc cggctttccc cgtcaagctc taaatcgggg 11040
gctcccttta gggttccgat ttagtgcttt acggcacctc gaccccaaaa aacttgatta 11100
gggtgatggt tcacgtagtg ggccatcgcc ctgatagacg gtttttcgcc ctttgacgtt 11160
ggagtccacg ttctttaata gtggactctt gttccaaact ggaacaacac tcaaccctat 11220
ctcggtctat tcttttgatt tataagggat tttgccgatt tcggcctatt ggttaaaaaa 11280
tgagctgatt taacaaaaat ttaacgcgaa ttttaacaaa atattaacgt ttacaatttc 11340
c 11341
<210> 10
<211> 4392
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 10
atgcagatcg agctgtccac ctgtttcttc ctgtgcctgc tgcgcttctg tttctccgcc 60
acccgacgat actacctggg ggccgtggag ctgagctggg actacatgca gagcgacctg 120
ggcgagctgc ccgtggacgc ccggttcccc cccagggtgc ccaagagctt ccccttcaac 180
accagcgtgg tgtacaagaa gaccctgttc gtggagttca ccgaccacct gttcaacatc 240
gccaagccca ggcccccctg gatgggcctg ctgggcccca ccatccaggc cgaggtgtac 300
gacaccgtgg tcatcaccct gaagaacatg gccagccacc ccgtgagcct gcacgccgtg 360
ggcgtgagct actggaaggc cagcgagggc gccgagtacg acgaccagac cagccagagg 420
gagaaggagg acgacaaggt gttccccggc ggcagccaca cctacgtgtg gcaggtgctg 480
aaggagaacg gccccatggc cagcgacccc ctgtgcctga cctacagcta cctgagccac 540
gtggacctgg tgaaggacct gaacagcggc ctgatcggcg ccctgctggt gtgcagggag 600
ggcagcctgg ccaaggagaa gacccagacc ctgcacaagt tcatcctgct gttcgccgtg 660
ttcgacgagg gcaagagctg gcacagcgag acgaagaaca gcctgatgca ggacagggac 720
gccgccagcg ccagggcctg gcccaagatg cacaccgtga acggctacgt gaaccggagc 780
ctgcccggcc tgatcggctg ccacaggaag agcgtgtact ggcacgtgat cggcatgggc 840
accacccccg aggtgcacag catcttcctg gagggccaca ccttcctggt gcggaaccac 900
aggcaggcca gcctggagat cagccccatc accttcctga ccgcccagac cctgctgatg 960
gacctgggcc agttcctgct gttctgccac atcagcagcc accagcacga cggcatggag 1020
gcctacgtga aggtggacag ctgccccgag gagccccagc tgcggatgaa gaacaacgag 1080
gaggccgagg actacgacga cgacctgacc gacagcgaga tggacgtggt gcggttcgac 1140
gacgacaaca gccccagctt catccagatc aggagcgtgg ccaagaagca ccccaagacc 1200
tgggtgcact acatcgccgc cgaggaggag gactgggact acgcccccct ggtgctggcc 1260
cccgacgaca ggagctacaa gagccagtac ctgaacaacg gcccccagcg gatcggcagg 1320
aagtacaaga aggtgcggtt catggcctac accgacgaga cattcaagac cagggaggcc 1380
atccagcacg agagcggcat cctgggcccc ctgctgtacg gcgaggtcgg cgacaccctg 1440
ctgatcatct tcaagaacca ggccagccgg ccctacaaca tctaccccca cggcatcacc 1500
gacgtgaggc ccctgtacag ccggaggctg cccaagggcg tgaagcacct gaaggacttc 1560
cccatcctgc ccggcgagat cttcaagtac aagtggaccg tgaccgtgga ggacggcccc 1620
accaagagcg acccccggtg cctgaccagg tactacagca gcttcgtgaa catggagagg 1680
gacctggcca gcggcctgat cggccccctg ctgatctgct acaaggagag cgtggaccag 1740
cggggcaacc agatcatgag cgacaagagg aacgtgatcc tgttcagcgt gttcgacgag 1800
aaccggagct ggtacctgac cgagaacatc cagaggttcc tgcccaaccc cgccggcgtg 1860
cagctggagg accccgagtt ccaggccagc aacatcatgc acagcatcaa cggctacgtg 1920
ttcgacagcc tgcagctgag cgtgtgcctg cacgaggtgg cctactggta catcctgagc 1980
atcggcgccc agaccgactt cctgagcgtg ttcttcagcg gctacacctt caagcacaag 2040
atggtgtacg aggacaccct gaccctgttc cccttcagcg gcgagacggt gttcatgagc 2100
atggagaacc ccggcctgtg gatactgggc tgccacaaca gcgacttccg gaacaggggc 2160
atgaccgccc tgctgaaggt gagcagctgc gacaagaaca ccggcgacta ctacgaggac 2220
agctacgagg acatcagcgc ctacctgctg agcaagaaca acgccatcga gccccggagc 2280
ttcagccaga acagcaggca ccccagccag aacccccccg tgctgaagag gcaccagagg 2340
gagatcacca ggaccaccct gcagagcgac caggaggaga tcgactacga cgacaccatc 2400
agcgtggaga tgaagaagga ggacttcgac atctacgacg aggacgagaa ccagagcccc 2460
cggagcttcc agaagaagac caggcactac ttcatcgccg ccgtggagag gctgtgggac 2520
tacggcatga gcagcagccc ccacgtgctg aggaacaggg cccagagcgg cagcgtgccc 2580
cagttcaaga aggtggtgtt ccaggagttc accgacggca gcttcaccca gcccctgtac 2640
aggggcgagc tgaacgagca cctgggcctg ctgggcccct acatcagggc cgaggtggag 2700
gacaacatca tggtgacctt ccggaaccag gccagcaggc cctacagctt ctacagcagc 2760
ctgatcagct acgaggagga ccagaggcag ggcgccgagc ccaggaagaa cttcgtgaag 2820
cccaacgaga cgaagaccta cttctggaag gtgcagcacc acatggcccc caccaaggac 2880
gagttcgact gcaaggcctg ggcctacttc agcgacgtgg acctggagaa ggacgtgcac 2940
tccggcctga tcgggcccct gctggtgtgc cacaccaaca ccctgaaccc cgcccacggc 3000
cggcaggtga ccgtgcagga gttcgccctg ttcttcacca tcttcgacga gacgaagagc 3060
tggtacttca ccgagaacat ggagcggaac tgcagggccc cctgcaacat ccagatggag 3120
gaccccacct tcaaggagaa ctacaggttc cacgccatca acggctacat catggacacc 3180
ctgcccggcc tggtcatggc ccaggaccag cggatcaggt ggtacctgct gagcatgggc 3240
agcaacgaga acatccacag catccacttc agcggccacg tgttcaccgt gcggaagaag 3300
gaggagtaca agatggccct gtacaacctg taccccggcg tgttcgagac ggtggagatg 3360
ctgcccagca aggccggcat ctggagggtg gagtgcctga tcggcgagca cctgcacgcc 3420
ggcatgagca ccctgttcct ggtgtacagc aacaagtgcc agacccccct gggcatggcc 3480
agcggccaca tccgggactt ccagatcacc gccagcggcc agtacggcca gtgggccccc 3540
aagctggcca ggctgcacta cagcggcagc atcaacgcct ggagcaccaa ggagcccttc 3600
agctggatca aggtggacct gctggccccc atgatcatcc acggcatcaa gacccagggc 3660
gccaggcaga agttcagcag cctgtacatc agccagttca tcatcatgta cagcctggac 3720
ggcaagaagt ggcagaccta caggggcaac agcaccggca ccctgatggt gttcttcggc 3780
aacgtggaca gcagcggcat caagcacaac atcttcaacc cccccatcat cgcccggtac 3840
atcaggctgc accccaccca ctacagcatc cggagcaccc tgaggatgga gctgatgggc 3900
tgcgacctga acagctgcag catgcccctg ggcatggaga gcaaggccat cagcgacgcc 3960
cagatcaccg ccagcagcta cttcaccaac atgttcgcca cctggagccc cagcaaggcc 4020
aggctgcacc tgcagggccg gagcaacgcc tggaggcccc aggtgaacaa ccccaaggag 4080
tggctgcagg tggacttcca gaagaccatg aaggtgaccg gcgtgaccac ccagggcgtg 4140
aagagcctgc tgaccagcat gtacgtgaag gagttcctga tcagcagcag ccaggacggc 4200
caccagtgga ccctgttctt ccagaacggc aaggtgaagg tgttccaggg caaccaggac 4260
agcttcaccc ccgtggtgaa cagcctggac ccccccctgc tgaccaggta cctgcgcatc 4320
cacccccaga gctgggtcca ccagatcgcc ctgaggatgg aggtgctggg gtgtgaggcc 4380
caggacctgt at 4392
<210> 11
<211> 11341
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 11
aatgcttcaa taatattgaa aaaggaagag tatgagtatt caacatttcc gtgtcgccct 60
tattcccttt tttgcggcat tttgccttcc tgtttttgct cacccagaaa cgctggtgaa 120
agtaaaagat gctgaagatc agttgggtgc acgagtgggt tacatcgaac tggatctcaa 180
cagcggtaag atccttgaga gttttcgccc cgaagaacgt tttccaatga tgagcacttt 240
taaagttctg ctatgtggcg cggtattatc ccgtattgac gccgggcaag agcaactcgg 300
tcgccgcata cactattctc agaatgactt ggttgagtac tcaccagtca cagaaaagca 360
tcttacggat ggcatgacag taagagaatt atgcagtgct gccataacca tgagtgataa 420
cactgcggcc aacttacttc tgacaacgat cggaggaccg aaggagctaa ccgctttttt 480
gcacaacatg ggggatcatg taactcgcct tgatcgttgg gaaccggagc tgaatgaagc 540
cataccaaac gacgagcgtg acaccacgat gcctgtagca atggcaacaa cgttgcgcaa 600
actattaact ggcgaactac ttactctagc ttcccggcaa caattaatag actggatgga 660
ggcggataaa gttgcaggac cacttctgcg ctcggccctt ccggctggct ggtttattgc 720
tgataaatct ggagccggtg agcgtgggtc tcgcggtatc attgcagcac tggggccaga 780
tggtaagccc tcccgtatcg tagttatcta cacgacgggg agtcaggcaa ctatggatga 840
acgaaataga cagatcgctg agataggtgc ctcactgatt aagcattggt aactgtcaga 900
ccaagtttac tcatatatac tttagattga tttaaaactt catttttaat ttaaaaggat 960
ctaggtgaag atcctttttg ataatctcat gaccaaaatc ccttaacgtg agttttcgtt 1020
ccactgagcg tcagaccccg tagaaaagat caaaggatct tcttgagatc ctttttttct 1080
gcgcgtaatc tgctgcttgc aaacaaaaaa accaccgcta ccagcggtgg tttgtttgcc 1140
ggatcaagag ctaccaactc tttttccgaa ggtaactggc ttcagcagag cgcagatacc 1200
aaatactgtc cttctagtgt agccgtagtt aggccaccac ttcaagaact ctgtagcacc 1260
gcctacatac ctcgctctgc taatcctgtt accagtggct gctgccagtg gcgataagtc 1320
gtgtcttacc gggttggact caagacgata gttaccggat aaggcgcagc ggtcgggctg 1380
aacggggggt tcgtgcacac agcccagctt ggagcgaacg acctacaccg aactgagata 1440
cctacagcgt gagctatgag aaagcgccac gcttcccgaa gggagaaagg cggacaggta 1500
tccggtaagc ggcagggtcg gaacaggaga gcgcacgagg gagcttccag ggggaaacgc 1560
ctggtatctt tatagtcctg tcgggtttcg ccacctctga cttgagcgtc gatttttgtg 1620
atgctcgtca ggggggcgga gcctatggaa aaacgccagc aacgcggcct ttttacggtt 1680
cctggccttt tgctggcctt ttgctcacat gttctttcct gcgttatccc ctgattctgt 1740
ggataaccgt attaccgcct ttgagtgagc tgataccgct cgccgcagcc gaacgaccga 1800
gcgcagcgag tcagtgagcg aggaagcgga agagcgccca atacgcaaac cgcctctccc 1860
cgcgcgttgg ccgattcatt aatgcagctg gcacgacagg tttcccgact ggaaagcggg 1920
cagtgagcgc aacgcaatta atgtgagtta gctcactcat taggcacccc aggctttaca 1980
ctttatgctt ccggctcgta tgttgtgtgg aattgtgagc ggataacaat ttcacacagg 2040
aaacagctat gaccatgatt acgccaagcg cgcaattaac cctcactaaa gggaacaaaa 2100
gctggagctg caagcttggc cattgcatac gttgtatcca tatcataata tgtacattta 2160
tattggctca tgtccaacat taccgccatg ttgacattga ttattgacta gttattaata 2220
gtaatcaatt acggggtcat tagttcatag cccatatatg gagttccgcg ttacataact 2280
tacggtaaat ggcccgcctg gctgaccgcc caacgacccc cgcccattga cgtcaataat 2340
gacgtatgtt cccatagtaa cgccaatagg gactttccat tgacgtcaat gggtggagta 2400
tttacggtaa actgcccact tggcagtaca tcaagtgtat catatgccaa gtacgccccc 2460
tattgacgtc aatgacggta aatggcccgc ctggcattat gcccagtaca tgaccttatg 2520
ggactttcct acttggcagt acatctacgt attagtcatc gctattacca tggtgatgcg 2580
gttttggcag tacatcaatg ggcgtggata gcggtttgac tcacggggat ttccaagtct 2640
ccaccccatt gacgtcaatg ggagtttgtt ttggcaccaa aatcaacggg actttccaaa 2700
atgtcgtaac aactccgccc cattgacgca aatgggcggt aggcgtgtac ggtgggaggt 2760
ctatataagc agagctcgtt tagtgaaccg gggtctctct ggttagacca gatctgagcc 2820
tgggagctct ctggctaact agggaaccca ctgcttaagc ctcaataaag cttgccttga 2880
gtgcttcaag tagtgtgtgc ccgtctgttg tgtgactctg gtaactagag atccctcaga 2940
cccttttagt cagtgtggaa aatctctagc agtggcgccc gaacagggac ctgaaagcga 3000
aagggaaacc agagctctct cgacgcagga ctcggcttgc tgaagcgcgc acggcaagag 3060
gcgaggggcg gcgactggtg agtacgccaa aaattttgac tagcggaggc tagaaggaga 3120
gagatgggtg cgagagcgtc agtattaagc gggggagaat tagatcgcga tgggaaaaaa 3180
ttcggttaag gccaggggga aagaaaaaat ataaattaaa acatatagta tgggcaagca 3240
gggagctaga acgattcgca gttaatcctg gcctgttaga aacatcagaa ggctgtagac 3300
aaatactggg acagctacaa ccatcccttc agacaggatc agaagaactt agatcattat 3360
ataatacagt agcaaccctc tattgtgtgc atcaaaggat agagataaaa gacaccaagg 3420
aagctttaga caagatagag gaagagcaaa acaaaagtaa gaccaccgca cagcaagcgg 3480
ccgctgatct tcagacctgg aggaggagat atgagggaca attggagaag tgaattatat 3540
aaatataaag tagtaaaaat tgaaccatta ggagtagcac ccaccaaggc aaagagaaga 3600
gtggtgcaga gagaaaaaag agcagtggga ataggagctt tgttccttgg gttcttggga 3660
gcagcaggaa gcactatggg cgcagcctca atgacgctga cggtacaggc cagacaatta 3720
ttgtctggta tagtgcagca gcagaacaat ttgctgaggg ctattgaggc gcaacagcat 3780
ctgttgcaac tcacagtctg gggcatcaag cagctccagg caagaatcct ggctgtggaa 3840
agatacctaa aggatcaaca gctcctgggg atttggggtt gctctggaaa actcatttgc 3900
accactgctg tgccttggaa tgctagttgg agtaataaat ctctggaaca gatttggaat 3960
cacacgacct ggatggagtg ggacagagaa attaacaatt acacaagctt aatacactcc 4020
ttaattgaag aatcgcaaaa ccagcaagaa aagaatgaac aagaattatt ggaattagat 4080
aaatgggcaa gtttgtggaa ttggtttaac ataacaaatt ggctgtggta tataaaatta 4140
ttcataatga tagtaggagg cttggtaggt ttaagaatag tttttgctgt actttctata 4200
gtgaatagag ttaggcaggg atattcacca ttatcgtttc agacccacct cccaaccccg 4260
aggggacccg acaggcccga aggaatagaa gaagaaggtg gagagagaga cagagacaga 4320
tccattcgat tagtgaacgg atctcgacgg tatcggttaa cttttaaaag aaaagggggg 4380
attggggggt acagtgcagg ggaaagaata gtagacataa tagcaacaga catacaaact 4440
aaagaattac aaaaacaaat tacaaaaatt caaaatttta tcgatggggg aggctgctgg 4500
tgaatattaa ccaaggtcac cccagttatc ggaggagcaa acaggggcta agtccactct 4560
tgcatctaaa atgagagaca aaaaatctat aaaaatggaa aacatgcata gaaatatgtg 4620
agggaggaaa aaattacccc caagaatgtt agtgcacgca gtcacacagg gagaagacta 4680
tttttgtttt gttttgattg ttttgttttg ttttggttgt tttgttttgg tgacctaact 4740
ggtcaaatga cctattaaga atatttcata gaacgaatgt tccgatgctc taatctctct 4800
agacaaggtt catatttgta tgggttactt attctctctt tgttgactaa gtcaataatc 4860
agaatcagca ggtttgcagt cagattggca gggataagca gcctagctca ggagaagtga 4920
gtataaaagc cccaggctgg gagcagccat cacagaagtc cactcattct tggatccacc 4980
ggtgccacca tgcagatcga gctgtccacc tgtttcttcc tgtgcctgct gcgcttctgt 5040
ttctccgcca cccgacgata ctacctgggg gccgtggagc tgagctggga ctacatgcag 5100
agcgacctgg gcgagctgcc cgtggacgcc cggttccccc ccagggtgcc caagagcttc 5160
cccttcaaca ccagcgtggt gtacaagaag accctgttcg tggagttcac cgaccacctg 5220
ttcaacatcg ccaagcccag gcccccctgg atgggcctgc tgggccccac catccaggcc 5280
gaggtgtacg acaccgtggt catcaccctg aagaacatgg ccagccaccc cgtgagcctg 5340
cacgccgtgg gcgtgagcta ctggaaggcc agcgagggcg ccgagtacga cgaccagacc 5400
agccagaggg agaaggagga cgacaaggtg ttccccggcg gcagccacac ctacgtgtgg 5460
caggtgctga aggagaacgg ccccatggcc agcgaccccc tgtgcctgac ctacagctac 5520
ctgagccacg tggacctggt gaaggacctg aacagcggcc tgatcggcgc cctgctggtg 5580
tgcagggagg gcagcctggc caaggagaag acccagaccc tgcacaagtt catcctgctg 5640
ttcgccgtgt tcgacgaggg caagagctgg cacagcgaga cgaagaacag cctgatgcag 5700
gacagggacg ccgccagcgc cagggcctgg cccaagatgc acaccgtgaa cggctacgtg 5760
aaccggagcc tgcccggcct gatcggctgc cacaggaaga gcgtgtactg gcacgtgatc 5820
ggcatgggca ccacccccga ggtgcacagc atcttcctgg agggccacac cttcctggtg 5880
cggaaccaca ggcaggccag cctggagatc agccccatca ccttcctgac cgcccagacc 5940
ctgctgatgg acctgggcca gttcctgctg ttctgccaca tcagcagcca ccagcacgac 6000
ggcatggagg cctacgtgaa ggtggacagc tgccccgagg agccccagct gcggatgaag 6060
aacaacgagg aggccgagga ctacgacgac gacctgaccg acagcgagat ggacgtggtg 6120
cggttcgacg acgacaacag ccccagcttc atccagatca ggagcgtggc caagaagcac 6180
cccaagacct gggtgcacta catcgccgcc gaggaggagg actgggacta cgcccccctg 6240
gtgctggccc ccgacgacag gagctacaag agccagtacc tgaacaacgg cccccagcgg 6300
atcggcagga agtacaagaa ggtgcggttc atggcctaca ccgacgagac attcaagacc 6360
agggaggcca tccagcacga gagcggcatc ctgggccccc tgctgtacgg cgaggtcggc 6420
gacaccctgc tgatcatctt caagaaccag gccagccggc cctacaacat ctacccccac 6480
ggcatcaccg acgtgaggcc cctgtacagc cggaggctgc ccaagggcgt gaagcacctg 6540
aaggacttcc ccatcctgcc cggcgagatc ttcaagtaca agtggaccgt gaccgtggag 6600
gacggcccca ccaagagcga cccccggtgc ctgaccaggt actacagcag cttcgtgaac 6660
atggagaggg acctggccag cggcctgatc ggccccctgc tgatctgcta caaggagagc 6720
gtggaccagc ggggcaacca gatcatgagc gacaagagga acgtgatcct gttcagcgtg 6780
ttcgacgaga accggagctg gtacctgacc gagaacatcc agaggttcct gcccaacccc 6840
gccggcgtgc agctggagga ccccgagttc caggccagca acatcatgca cagcatcaac 6900
ggctacgtgt tcgacagcct gcagctgagc gtgtgcctgc acgaggtggc ctactggtac 6960
atcctgagca tcggcgccca gaccgacttc ctgagcgtgt tcttcagcgg ctacaccttc 7020
aagcacaaga tggtgtacga ggacaccctg accctgttcc ccttcagcgg cgagacggtg 7080
ttcatgagca tggagaaccc cggcctgtgg atactgggct gccacaacag cgacttccgg 7140
aacaggggca tgaccgccct gctgaaggtg agcagctgcg acaagaacac cggcgactac 7200
tacgaggaca gctacgagga catcagcgcc tacctgctga gcaagaacaa cgccatcgag 7260
ccccggagct tcagccagaa cagcaggcac cccagccaga acccccccgt gctgaagagg 7320
caccagaggg agatcaccag gaccaccctg cagagcgacc aggaggagat cgactacgac 7380
gacaccatca gcgtggagat gaagaaggag gacttcgaca tctacgacga ggacgagaac 7440
cagagccccc ggagcttcca gaagaagacc aggcactact tcatcgccgc cgtggagagg 7500
ctgtgggact acggcatgag cagcagcccc cacgtgctga ggaacagggc ccagagcggc 7560
agcgtgcccc agttcaagaa ggtggtgttc caggagttca ccgacggcag cttcacccag 7620
cccctgtaca ggggcgagct gaacgagcac ctgggcctgc tgggccccta catcagggcc 7680
gaggtggagg acaacatcat ggtgaccttc cggaaccagg ccagcaggcc ctacagcttc 7740
tacagcagcc tgatcagcta cgaggaggac cagaggcagg gcgccgagcc caggaagaac 7800
ttcgtgaagc ccaacgagac gaagacctac ttctggaagg tgcagcacca catggccccc 7860
accaaggacg agttcgactg caaggcctgg gcctacttca gcgacgtgga cctggagaag 7920
gacgtgcact ccggcctgat cgggcccctg ctggtgtgcc acaccaacac cctgaacccc 7980
gcccacggcc ggcaggtgac cgtgcaggag ttcgccctgt tcttcaccat cttcgacgag 8040
acgaagagct ggtacttcac cgagaacatg gagcggaact gcagggcccc ctgcaacatc 8100
cagatggagg accccacctt caaggagaac tacaggttcc acgccatcaa cggctacatc 8160
atggacaccc tgcccggcct ggtcatggcc caggaccagc ggatcaggtg gtacctgctg 8220
agcatgggca gcaacgagaa catccacagc atccacttca gcggccacgt gttcaccgtg 8280
cggaagaagg aggagtacaa gatggccctg tacaacctgt accccggcgt gttcgagacg 8340
gtggagatgc tgcccagcaa ggccggcatc tggagggtgg agtgcctgat cggcgagcac 8400
ctgcacgccg gcatgagcac cctgttcctg gtgtacagca acaagtgcca gacccccctg 8460
ggcatggcca gcggccacat ccgggacttc cagatcaccg ccagcggcca gtacggccag 8520
tgggccccca agctggccag gctgcactac agcggcagca tcaacgcctg gagcaccaag 8580
gagcccttca gctggatcaa ggtggacctg ctggccccca tgatcatcca cggcatcaag 8640
acccagggcg ccaggcagaa gttcagcagc ctgtacatca gccagttcat catcatgtac 8700
agcctggacg gcaagaagtg gcagacctac aggggcaaca gcaccggcac cctgatggtg 8760
ttcttcggca acgtggacag cagcggcatc aagcacaaca tcttcaaccc ccccatcatc 8820
gcccggtaca tcaggctgca ccccacccac tacagcatcc ggagcaccct gaggatggag 8880
ctgatgggct gcgacctgaa cagctgcagc atgcccctgg gcatggagag caaggccatc 8940
agcgacgccc agatcaccgc cagcagctac ttcaccaaca tgttcgccac ctggagcccc 9000
agcaaggcca ggctgcacct gcagggccgg agcaacgcct ggaggcccca ggtgaacaac 9060
cccaaggagt ggctgcaggt ggacttccag aagaccatga aggtgaccgg cgtgaccacc 9120
cagggcgtga agagcctgct gaccagcatg tacgtgaagg agttcctgat cagcagcagc 9180
caggacggcc accagtggac cctgttcttc cagaacggca aggtgaaggt gttccagggc 9240
aaccaggaca gcttcacccc cgtggtgaac agcctggacc cccccctgct gaccaggtac 9300
ctgcgcatcc acccccagag ctgggtccac cagatcgccc tgaggatgga ggtgctgggg 9360
tgtgaggccc aggacctgta ttgagcggcc gcgtttaaac gtcgacaatc aacctctgga 9420
ttacaaaatt tgtgaaagat tgactggtat tcttaactat gttgctcctt ttacgctatg 9480
tggatacgct gctttaatgc ctttgtatca tgctattgct tcccgtatgg ctttcatttt 9540
ctcctccttg tataaatcct ggttgctgtc tctttatgag gagttgtggc ccgttgtcag 9600
gcaacgtggc gtggtgtgca ctgtgtttgc tgacgcaacc cccactggtt ggggcattgc 9660
caccacctgt cagctccttt ccgggacttt cgctttcccc ctccctattg ccacggcgga 9720
actcatcgcc gcctgccttg cccgctgctg gacaggggct cggctgttgg gcactgacaa 9780
ttccgtggtg ttgtcgggga agctgacgtc ctttccatgg ctgctcgcct gtgttgccac 9840
ctggattctg cgcgggacgt ccttctgcta cgtcccttcg gccctcaatc cagcggacct 9900
tccttcccgc ggcctgctgc cggctctgcg gcctcttccg cgtcttcgcc ttcgccctca 9960
gacgagtcgg atctcccttt gggccgcctc cccgcctgga attcgagctc ggtaccttta 10020
agaccaatga cttacaaggc agctgtagat cttagccact ttttaaaaga aaagggggga 10080
ctggaagggc taattcactc ccaacgaaga caagatctgc tttttgcttg tactgggtct 10140
ctctggttag accagatctg agcctgggag ctctctggct aactagggaa cccactgctt 10200
aagcctcaat aaagcttgcc ttgagtgctt caagtagtgt gtgcccgtct gttgtgtgac 10260
tctggtaact agagatccct cagacccttt tagtcagtgt ggaaaatctc tagcagtagt 10320
agttcatgtc atcttattat tcagtattta taacttgcaa agaaatgaat atcagagagt 10380
gagaggaact tgtttattgc agcttataat ggttacaaat aaagcaatag catcacaaat 10440
ttcacaaata aagcattttt ttcactgcat tctagttgtg gtttgtccaa actcatcaat 10500
gtatcttatc atgtctggct ctagctatcc cgcccctaac tccgcccagt tccgcccatt 10560
ctccgcccca tggctgacta atttttttta tttatgcaga ggccgaggcc gcctcggcct 10620
ctgagctatt ccagaagtag tgaggaggct tttttggagg cctaggcttt tgcgtcgaga 10680
cgtacccaat tcgccctata gtgagtcgta ttacgcgcgc tcactggccg tcgttttaca 10740
acgtcgtgac tgggaaaacc ctggcgttac ccaacttaat cgccttgcag cacatccccc 10800
tttcgccagc tggcgtaata gcgaagaggc ccgcaccgat cgcccttccc aacagttgcg 10860
cagcctgaat ggcgaatggc gcgacgcgcc ctgtagcggc gcattaagcg cggcgggtgt 10920
ggtggttacg cgcagcgtga ccgctacact tgccagcgcc ctagcgcccg ctcctttcgc 10980
tttcttccct tcctttctcg ccacgttcgc cggctttccc cgtcaagctc taaatcgggg 11040
gctcccttta gggttccgat ttagtgcttt acggcacctc gaccccaaaa aacttgatta 11100
gggtgatggt tcacgtagtg ggccatcgcc ctgatagacg gtttttcgcc ctttgacgtt 11160
ggagtccacg ttctttaata gtggactctt gttccaaact ggaacaacac tcaaccctat 11220
ctcggtctat tcttttgatt tataagggat tttgccgatt tcggcctatt ggttaaaaaa 11280
tgagctgatt taacaaaaat ttaacgcgaa ttttaacaaa atattaacgt ttacaatttc 11340
c 11341
<210> 12
<211> 1182
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 12
gctccggtgc ccgtcagtgg gcagagcgca catcgcccac agtccccgag aagttggggg 60
gaggggtcgg caattgaacc ggtgcctaga gaaggtggcg cggggtaaac tgggaaagtg 120
atgtcgtgta ctggctccgc ctttttcccg agggtggggg agaaccgtat ataagtgcag 180
tagtcgccgt gaacgttctt tttcgcaacg ggtttgccgc cagaacacag gtaagtgccg 240
tgtgtggttc ccgcgggcct ggcctcttta cgggttatgg cccttgcgtg ccttgaatta 300
cttccacgcc cctggctgca gtacgtgatt cttgatcccg agcttcgggt tggaagtggg 360
tgggagagtt cgaggccttg cgcttaagga gccccttcgc ctcgtgcttg agttgaggcc 420
tggcttgggc gctggggccg ccgcgtgcga atctggtggc accttcgcgc ctgtctccct 480
gctttcgata agtctctagc catttaaaat ttttgatgac ctgctgcgac gctttttttc 540
tggcaagata gtcttgtaaa tgcgggccaa gatctgcaca ctggtatttc ggtttttggg 600
gccgcgggcg gcgacggggc ccgtgcgtcc cagcgcacat gttcggcgag gcggggcctg 660
cgagcgcggc caccgagaat cggacggggg tagtctcaag ctggccggcc tgctctggtg 720
cctggcctcg cgccgccgtg tatcgccccg ccctgggcgg caaggctggc ccggtcggca 780
ccagttgcgt gagcggaaag atggccgctt cccggccctg ctgcagggag ctcaaaatgg 840
aggacgcggc gctcgggaga gcgggcgggt gagtcaccca cacaaaggaa aagggccttt 900
ccgtcctcag ccgtcgcttc atgtgactcc acggagtacc gggcgccgtc caggcacctc 960
gattagttct cgagcttttg gagtacgtcg tctttaggtt ggggggaggg gttttatgcg 1020
atggagtttc cccacactga gtgggtggag actgaagtta ggccagcttg gcacttgatg 1080
taattctcct tggaatttgc cctttttgag tttggatctt ggttcattct caagcctcag 1140
acagtggttc aaagtttttt tcttccattt caggtgtcgt ga 1182
<210> 13
<211> 7896
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 13
aatgcttcaa taatattgaa aaaggaagag tatgagtatt caacatttcc gtgtcgccct 60
tattcccttt tttgcggcat tttgccttcc tgtttttgct cacccagaaa cgctggtgaa 120
agtaaaagat gctgaagatc agttgggtgc acgagtgggt tacatcgaac tggatctcaa 180
cagcggtaag atccttgaga gttttcgccc cgaagaacgt tttccaatga tgagcacttt 240
taaagttctg ctatgtggcg cggtattatc ccgtattgac gccgggcaag agcaactcgg 300
tcgccgcata cactattctc agaatgactt ggttgagtac tcaccagtca cagaaaagca 360
tcttacggat ggcatgacag taagagaatt atgcagtgct gccataacca tgagtgataa 420
cactgcggcc aacttacttc tgacaacgat cggaggaccg aaggagctaa ccgctttttt 480
gcacaacatg ggggatcatg taactcgcct tgatcgttgg gaaccggagc tgaatgaagc 540
cataccaaac gacgagcgtg acaccacgat gcctgtagca atggcaacaa cgttgcgcaa 600
actattaact ggcgaactac ttactctagc ttcccggcaa caattaatag actggatgga 660
ggcggataaa gttgcaggac cacttctgcg ctcggccctt ccggctggct ggtttattgc 720
tgataaatct ggagccggtg agcgtgggtc tcgcggtatc attgcagcac tggggccaga 780
tggtaagccc tcccgtatcg tagttatcta cacgacgggg agtcaggcaa ctatggatga 840
acgaaataga cagatcgctg agataggtgc ctcactgatt aagcattggt aactgtcaga 900
ccaagtttac tcatatatac tttagattga tttaaaactt catttttaat ttaaaaggat 960
ctaggtgaag atcctttttg ataatctcat gaccaaaatc ccttaacgtg agttttcgtt 1020
ccactgagcg tcagaccccg tagaaaagat caaaggatct tcttgagatc ctttttttct 1080
gcgcgtaatc tgctgcttgc aaacaaaaaa accaccgcta ccagcggtgg tttgtttgcc 1140
ggatcaagag ctaccaactc tttttccgaa ggtaactggc ttcagcagag cgcagatacc 1200
aaatactgtt cttctagtgt agccgtagtt aggccaccac ttcaagaact ctgtagcacc 1260
gcctacatac ctcgctctgc taatcctgtt accagtggct gctgccagtg gcgataagtc 1320
gtgtcttacc gggttggact caagacgata gttaccggat aaggcgcagc ggtcgggctg 1380
aacggggggt tcgtgcacac agcccagctt ggagcgaacg acctacaccg aactgagata 1440
cctacagcgt gagctatgag aaagcgccac gcttcccgaa gggagaaagg cggacaggta 1500
tccggtaagc ggcagggtcg gaacaggaga gcgcacgagg gagcttccag ggggaaacgc 1560
ctggtatctt tatagtcctg tcgggtttcg ccacctctga cttgagcgtc gatttttgtg 1620
atgctcgtca ggggggcgga gcctatggaa aaacgccagc aacgcggcct ttttacggtt 1680
cctggccttt tgctggcctt ttgctcacat gttctttcct gcgttatccc ctgattctgt 1740
ggataaccgt attaccgcct ttgagtgagc tgataccgct cgccgcagcc gaacgaccga 1800
gcgcagcgag tcagtgagcg aggaagcgga agagcgccca atacgcaaac cgcctctccc 1860
cgcgcgttgg ccgattcatt aatgcagctg gcacgacagg tttcccgact ggaaagcggg 1920
cagtgagcgc aacgcaatta atgtgagtta gctcactcat taggcacccc aggctttaca 1980
ctttatgctt ccggctcgta tgttgtgtgg aattgtgagc ggataacaat ttcacacagg 2040
aaacagctat gaccatgatt acgccaagcg cgcaattaac cctcactaaa gggaacaaaa 2100
gctggagctg caagcttggc cattgcatac gttgtatcca tatcataata tgtacattta 2160
tattggctca tgtccaacat taccgccatg ttgacattga ttattgacta gttattaata 2220
gtaatcaatt acggggtcat tagttcatag cccatatatg gagttccgcg ttacataact 2280
tacggtaaat ggcccgcctg gctgaccgcc caacgacccc cgcccattga cgtcaataat 2340
gacgtatgtt cccatagtaa cgccaatagg gactttccat tgacgtcaat gggtggagta 2400
tttacggtaa actgcccact tggcagtaca tcaagtgtat catatgccaa gtacgccccc 2460
tattgacgtc aatgacggta aatggcccgc ctggcattat gcccagtaca tgaccttatg 2520
ggactttcct acttggcagt acatctacgt attagtcatc gctattacca tggtgatgcg 2580
gttttggcag tacatcaatg ggcgtggata gcggtttgac tcacggggat ttccaagtct 2640
ccaccccatt gacgtcaatg ggagtttgtt ttggcaccaa aatcaacggg actttccaaa 2700
atgtcgtaac aactccgccc cattgacgca aatgggcggt aggcgtgtac ggtgggaggt 2760
ctatataagc agagctcgtt tagtgaaccg gggtctctct ggttagacca gatctgagcc 2820
tgggagctct ctggctaact agggaaccca ctgcttaagc ctcaataaag cttgccttga 2880
gtgcttcaag tagtgtgtgc ccgtctgttg tgtgactctg gtaactagag atccctcaga 2940
cccttttagt cagtgtggaa aatctctagc agtggcgccc gaacagggac ttgaaagcga 3000
aagggaaacc agaggagctc tctcgacgca ggactcggct tgctgaagcg cgcacggcaa 3060
gaggcgaggg gcggcgactg gtgagtacgc caaaaatttt gactagcgga ggctagaagg 3120
agagagatgg gtgcgagagc gtcagtatta agcgggggag aattagatcg cgatgggaaa 3180
aaattcggtt aaggccaggg ggaaagaaaa aatataaatt aaaacatata gtatgggcaa 3240
gcagggagct agaacgattc gcagttaatc ctggcctgtt agaaacatca gaaggctgta 3300
gacaaatact gggacagcta caaccatccc ttcagacagg atcagaagaa cttagatcat 3360
tatataatac agtagcaacc ctctattgtg tgcatcaaag gatagagata aaagacacca 3420
aggaagcttt agacaagata gaggaagagc aaaacaaaag taagaccacc gcacagcaag 3480
cggccgctga tcttcagacc tggaggagga gatatgaggg acaattggag aagtgaatta 3540
tataaatata aagtagtaaa aattgaacca ttaggagtag cacccaccaa ggcaaagaga 3600
agagtggtgc agagagaaaa aagagcagtg ggaataggag ctttgttcct tgggttcttg 3660
ggagcagcag gaagcactat gggcgcagcg tcaatgacgc tgacggtaca ggccagacaa 3720
ttattgtctg gtatagtgca gcagcagaac aatttgctga gggctattga ggcgcaacag 3780
catctgttgc aactcacagt ctggggcatc aagcagctcc aggcaagaat cctggctgtg 3840
gaaagatacc taaaggatca acagctcctg gggatttggg gttgctctgg aaaactcatt 3900
tgcaccactg ctgtgccttg gaatgctagt tggagtaata aatctctgga acagatttgg 3960
aatcacacga cctggatgga gtgggacaga gaaattaaca attacacaag cttaatacac 4020
tccttaattg aagaatcgca aaaccagcaa gaaaagaatg aacaagaatt attggaatta 4080
gataaatggg caagtttgtg gaattggttt aacataacaa attggctgtg gtatataaaa 4140
ttattcataa tgatagtagg aggcttggta ggtttaagaa tagtttttgc tgtactttct 4200
atagtgaata gagttaggca gggatattca ccattatcgt ttcagaccca cctcccaacc 4260
ccgaggggac ccgacaggcc cgaaggaata gaagaagaag gtggagagag agacagagac 4320
agatccattc gattagtgaa cggatctcga cggtatcggt taacttttaa aagaaaaggg 4380
gggattgggg ggtacagtgc aggggaaaga atagtagaca taatagcaac agacatacaa 4440
actaaagaat tacaaaaaca aattacaaaa attcaaaatt ttatcgatga gtaattcata 4500
caaaaggact cgcccctgcc ttggggaatc ccagggaccg tcgttaaact cccactaacg 4560
tagaacccag agatcgctgc gttcccgccc cctcacccgc ccgctctcgt catcactgag 4620
gtggagaaga gcatgcgtga ggctccggtg cccgtcagtg ggcagagcgc acatcgccca 4680
cagtccccga gaagttgggg ggaggggtcg gcaattgaac cggtgcctag agaaggtggc 4740
gcggggtaaa ctgggaaagt gatgtcgtgt actggctccg cctttttccc gagggtgggg 4800
gagaaccgta tataagtgca gtagtcgccg tgaacgttct ttttcgcaac gggtttgccg 4860
ccagaacaca ggtaagtgcc gtgtgtggtt cccgcgggcc tggcctcttt acgggttatg 4920
gcccttgcgt gccttgaatt acttccacgc ccctggctgc agtacgtgat tcttgatccc 4980
gagcttcggg ttggaagtgg gtgggagagt tcgaggcctt gcgcttaagg agccccttcg 5040
cctcgtgctt gagttgaggc ctggcttggg cgctggggcc gccgcgtgcg aatctggtgg 5100
caccttcgcg cctgtctccc tgctttcgat aagtctctag ccatttaaaa tttttgatga 5160
cctgctgcga cgcttttttt ctggcaagat agtcttgtaa atgcgggcca agatctgcac 5220
actggtattt cggtttttgg ggccgcgggc ggcgacgggg cccgtgcgtc ccagcgcaca 5280
tgttcggcga ggcggggcct gcgagcgcgg ccaccgagaa tcggacgggg gtagtctcaa 5340
gctggccggc ctgctctggt gcctggcctc gcgccgccgt gtatcgcccc gccctgggcg 5400
gcaaggctgg cccggtcggc accagttgcg tgagcggaaa gatggccgct tcccggccct 5460
gctgcaggga gctcaaaatg gaggacgcgg cgctcgggag agcgggcggg tgagtcaccc 5520
acacaaagga aaagggcctt tccgtcctca gccgtcgctt catgtgactc cacggagtac 5580
cgggcgccgt ccaggcacct cgattagttc tcgagctttt ggagtacgtc gtctttaggt 5640
tggggggagg ggttttatgc gatggagttt ccccacactg agtgggtgga gactgaagtt 5700
aggccagctt ggcacttgat gtaattctcc ttggaatttg ccctttttga gtttggatct 5760
tggttcattc tcaagcctca gacagtggtt caaagttttt ttcttccatt tcaggtgtcg 5820
tgaggatcta tttccggtga gacccaagct ggctagctaa acttacgcgt gcctcggatc 5880
ctccagtgtg gtgtgcagat atccagcaca gtcccgggcc gagtctagac gtttaaaccc 5940
gctgatcagg tcgacaatca acctctggat tacaaaattt gtgaaagatt gactggtatt 6000
cttaactatg ttgctccttt tacgctatgt ggatacgctg ctttaatgcc tttgtatcat 6060
gctattgctt cccgtatggc tttcattttc tcctccttgt ataaatcctg gttgctgtct 6120
ctttatgagg agttgtggcc cgttgtcagg caacgtggcg tggtgtgcac tgtgtttgct 6180
gacgcaaccc ccactggttg gggcattgcc accacctgtc agctcctttc cgggactttc 6240
gctttccccc tccctattgc cacggcggaa ctcatcgccg cctgccttgc ccgctgctgg 6300
acaggggctc ggctgttggg cactgacaat tccgtggtgt tgtcggggaa gctgacgtcc 6360
tttccatggc tgctcgcctg tgttgccacc tggattctgc gcgggacgtc cttctgctac 6420
gtcccttcgg ccctcaatcc agcggacctt ccttcccgcg gcctgctgcc ggctctgcgg 6480
cctcttccgc gtcttcgcct tcgccctcag acgagtcgga tctccctttg ggccgcctcc 6540
ccgcctggaa ttcgagctcg gtacctttaa gaccaatgac ttacaaggca gctgtagatc 6600
ttagccactt tttaaaagaa aaggggggac tggaagggct aattcactcc caacgaagac 6660
aagatctgct ttttgcttgt actgggtctc tctggttaga ccagatctga gcctgggagc 6720
tctctggcta actagggaac ccactgctta agcctcaata aagcttgcct tgagtgcttc 6780
aagtagtgtg tgcccgtctg ttgtgtgact ctggtaacta gagatccctc agaccctttt 6840
agtcagtgtg gaaaatctct agcagtagta gttcatgtca tcttattatt cagtatttat 6900
aacttgcaaa gaaatgaata tcagagagtg agaggaactt gtttattgca gcttataatg 6960
gttacaaata aagcaatagc atcacaaatt tcacaaataa agcatttttt tcactgcatt 7020
ctagttgtgg tttgtccaaa ctcatcaatg tatcttatca tgtctggctc tagctatccc 7080
gcccctaact ccgcccatcc cgcccctaac tccgcccagt tccgcccatt ctccgcccca 7140
tggctgacta atttttttta tttatgcaga ggccgaggcc gcctcggcct ctgagctatt 7200
ccagaagtag tgaggaggct tttttggagg cctagggacg tacccaattc gccctatagt 7260
gagtcgtatt acgcgcgctc actggccgtc gttttacaac gtcgtgactg ggaaaaccct 7320
ggcgttaccc aacttaatcg ccttgcagca catccccctt tcgccagctg gcgtaatagc 7380
gaagaggccc gcaccgatcg cccttcccaa cagttgcgca gcctgaatgg cgaatgggac 7440
gcgccctgta gcggcgcatt aagcgcggcg ggtgtggtgg ttacgcgcag cgtgaccgct 7500
acacttgcca gcgccctagc gcccgctcct ttcgctttct tcccttcctt tctcgccacg 7560
ttcgccggct ttccccgtca agctctaaat cgggggctcc ctttagggtt ccgatttagt 7620
gctttacggc acctcgaccc caaaaaactt gattagggtg atggttcacg tagtgggcca 7680
tcgccctgat agacggtttt tcgccctttg acgttggagt ccacgttctt taatagtgga 7740
ctcttgttcc aaactggaac aacactcaac cctatctcgg tctattcttt tgatttataa 7800
gggattttgc cgatttcggc ctattggtta aaaaatgagc tgatttaaca aaaatttaac 7860
gcgaatttta acaaaatatt aacgtttaca atttcc 7896
<210> 14
<211> 12061
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 14
aatgcttcaa taatattgaa aaaggaagag tatgagtatt caacatttcc gtgtcgccct 60
tattcccttt tttgcggcat tttgccttcc tgtttttgct cacccagaaa cgctggtgaa 120
agtaaaagat gctgaagatc agttgggtgc acgagtgggt tacatcgaac tggatctcaa 180
cagcggtaag atccttgaga gttttcgccc cgaagaacgt tttccaatga tgagcacttt 240
taaagttctg ctatgtggcg cggtattatc ccgtattgac gccgggcaag agcaactcgg 300
tcgccgcata cactattctc agaatgactt ggttgagtac tcaccagtca cagaaaagca 360
tcttacggat ggcatgacag taagagaatt atgcagtgct gccataacca tgagtgataa 420
cactgcggcc aacttacttc tgacaacgat cggaggaccg aaggagctaa ccgctttttt 480
gcacaacatg ggggatcatg taactcgcct tgatcgttgg gaaccggagc tgaatgaagc 540
cataccaaac gacgagcgtg acaccacgat gcctgtagca atggcaacaa cgttgcgcaa 600
actattaact ggcgaactac ttactctagc ttcccggcaa caattaatag actggatgga 660
ggcggataaa gttgcaggac cacttctgcg ctcggccctt ccggctggct ggtttattgc 720
tgataaatct ggagccggtg agcgtgggtc tcgcggtatc attgcagcac tggggccaga 780
tggtaagccc tcccgtatcg tagttatcta cacgacgggg agtcaggcaa ctatggatga 840
acgaaataga cagatcgctg agataggtgc ctcactgatt aagcattggt aactgtcaga 900
ccaagtttac tcatatatac tttagattga tttaaaactt catttttaat ttaaaaggat 960
ctaggtgaag atcctttttg ataatctcat gaccaaaatc ccttaacgtg agttttcgtt 1020
ccactgagcg tcagaccccg tagaaaagat caaaggatct tcttgagatc ctttttttct 1080
gcgcgtaatc tgctgcttgc aaacaaaaaa accaccgcta ccagcggtgg tttgtttgcc 1140
ggatcaagag ctaccaactc tttttccgaa ggtaactggc ttcagcagag cgcagatacc 1200
aaatactgtc cttctagtgt agccgtagtt aggccaccac ttcaagaact ctgtagcacc 1260
gcctacatac ctcgctctgc taatcctgtt accagtggct gctgccagtg gcgataagtc 1320
gtgtcttacc gggttggact caagacgata gttaccggat aaggcgcagc ggtcgggctg 1380
aacggggggt tcgtgcacac agcccagctt ggagcgaacg acctacaccg aactgagata 1440
cctacagcgt gagctatgag aaagcgccac gcttcccgaa gggagaaagg cggacaggta 1500
tccggtaagc ggcagggtcg gaacaggaga gcgcacgagg gagcttccag ggggaaacgc 1560
ctggtatctt tatagtcctg tcgggtttcg ccacctctga cttgagcgtc gatttttgtg 1620
atgctcgtca ggggggcgga gcctatggaa aaacgccagc aacgcggcct ttttacggtt 1680
cctggccttt tgctggcctt ttgctcacat gttctttcct gcgttatccc ctgattctgt 1740
ggataaccgt attaccgcct ttgagtgagc tgataccgct cgccgcagcc gaacgaccga 1800
gcgcagcgag tcagtgagcg aggaagcgga agagcgccca atacgcaaac cgcctctccc 1860
cgcgcgttgg ccgattcatt aatgcagctg gcacgacagg tttcccgact ggaaagcggg 1920
cagtgagcgc aacgcaatta atgtgagtta gctcactcat taggcacccc aggctttaca 1980
ctttatgctt ccggctcgta tgttgtgtgg aattgtgagc ggataacaat ttcacacagg 2040
aaacagctat gaccatgatt acgccaagcg cgcaattaac cctcactaaa gggaacaaaa 2100
gctggagctg caagcttggc cattgcatac gttgtatcca tatcataata tgtacattta 2160
tattggctca tgtccaacat taccgccatg ttgacattga ttattgacta gttattaata 2220
gtaatcaatt acggggtcat tagttcatag cccatatatg gagttccgcg ttacataact 2280
tacggtaaat ggcccgcctg gctgaccgcc caacgacccc cgcccattga cgtcaataat 2340
gacgtatgtt cccatagtaa cgccaatagg gactttccat tgacgtcaat gggtggagta 2400
tttacggtaa actgcccact tggcagtaca tcaagtgtat catatgccaa gtacgccccc 2460
tattgacgtc aatgacggta aatggcccgc ctggcattat gcccagtaca tgaccttatg 2520
ggactttcct acttggcagt acatctacgt attagtcatc gctattacca tggtgatgcg 2580
gttttggcag tacatcaatg ggcgtggata gcggtttgac tcacggggat ttccaagtct 2640
ccaccccatt gacgtcaatg ggagtttgtt ttggcaccaa aatcaacggg actttccaaa 2700
atgtcgtaac aactccgccc cattgacgca aatgggcggt aggcgtgtac ggtgggaggt 2760
ctatataagc agagctcgtt tagtgaaccg gggtctctct ggttagacca gatctgagcc 2820
tgggagctct ctggctaact agggaaccca ctgcttaagc ctcaataaag cttgccttga 2880
gtgcttcaag tagtgtgtgc ccgtctgttg tgtgactctg gtaactagag atccctcaga 2940
cccttttagt cagtgtggaa aatctctagc agtggcgccc gaacagggac ctgaaagcga 3000
aagggaaacc agagctctct cgacgcagga ctcggcttgc tgaagcgcgc acggcaagag 3060
gcgaggggcg gcgactggtg agtacgccaa aaattttgac tagcggaggc tagaaggaga 3120
gagatgggtg cgagagcgtc agtattaagc gggggagaat tagatcgcga tgggaaaaaa 3180
ttcggttaag gccaggggga aagaaaaaat ataaattaaa acatatagta tgggcaagca 3240
gggagctaga acgattcgca gttaatcctg gcctgttaga aacatcagaa ggctgtagac 3300
aaatactggg acagctacaa ccatcccttc agacaggatc agaagaactt agatcattat 3360
ataatacagt agcaaccctc tattgtgtgc atcaaaggat agagataaaa gacaccaagg 3420
aagctttaga caagatagag gaagagcaaa acaaaagtaa gaccaccgca cagcaagcgg 3480
ccgctgatct tcagacctgg aggaggagat atgagggaca attggagaag tgaattatat 3540
aaatataaag tagtaaaaat tgaaccatta ggagtagcac ccaccaaggc aaagagaaga 3600
gtggtgcaga gagaaaaaag agcagtggga ataggagctt tgttccttgg gttcttggga 3660
gcagcaggaa gcactatggg cgcagcctca atgacgctga cggtacaggc cagacaatta 3720
ttgtctggta tagtgcagca gcagaacaat ttgctgaggg ctattgaggc gcaacagcat 3780
ctgttgcaac tcacagtctg gggcatcaag cagctccagg caagaatcct ggctgtggaa 3840
agatacctaa aggatcaaca gctcctgggg atttggggtt gctctggaaa actcatttgc 3900
accactgctg tgccttggaa tgctagttgg agtaataaat ctctggaaca gatttggaat 3960
cacacgacct ggatggagtg ggacagagaa attaacaatt acacaagctt aatacactcc 4020
ttaattgaag aatcgcaaaa ccagcaagaa aagaatgaac aagaattatt ggaattagat 4080
aaatgggcaa gtttgtggaa ttggtttaac ataacaaatt ggctgtggta tataaaatta 4140
ttcataatga tagtaggagg cttggtaggt ttaagaatag tttttgctgt actttctata 4200
gtgaatagag ttaggcaggg atattcacca ttatcgtttc agacccacct cccaaccccg 4260
aggggacccg acaggcccga aggaatagaa gaagaaggtg gagagagaga cagagacaga 4320
tccattcgat tagtgaacgg atctcgacgg tatcggttaa cttttaaaag aaaagggggg 4380
attggggggt acagtgcagg ggaaagaata gtagacataa tagcaacaga catacaaact 4440
aaagaattac aaaaacaaat tacaaaaatt caaaatttta tcgataagag catgcgtgag 4500
gctccggtgc ccgtcagtgg gcagagcgca catcgcccac agtccccgag aagttggggg 4560
gaggggtcgg caattgaacc ggtgcctaga gaaggtggcg cggggtaaac tgggaaagtg 4620
atgtcgtgta ctggctccgc ctttttcccg agggtggggg agaaccgtat ataagtgcag 4680
tagtcgccgt gaacgttctt tttcgcaacg ggtttgccgc cagaacacag gtaagtgccg 4740
tgtgtggttc ccgcgggcct ggcctcttta cgggttatgg cccttgcgtg ccttgaatta 4800
cttccacgcc cctggctgca gtacgtgatt cttgatcccg agcttcgggt tggaagtggg 4860
tgggagagtt cgaggccttg cgcttaagga gccccttcgc ctcgtgcttg agttgaggcc 4920
tggcttgggc gctggggccg ccgcgtgcga atctggtggc accttcgcgc ctgtctccct 4980
gctttcgata agtctctagc catttaaaat ttttgatgac ctgctgcgac gctttttttc 5040
tggcaagata gtcttgtaaa tgcgggccaa gatctgcaca ctggtatttc ggtttttggg 5100
gccgcgggcg gcgacggggc ccgtgcgtcc cagcgcacat gttcggcgag gcggggcctg 5160
cgagcgcggc caccgagaat cggacggggg tagtctcaag ctggccggcc tgctctggtg 5220
cctggcctcg cgccgccgtg tatcgccccg ccctgggcgg caaggctggc ccggtcggca 5280
ccagttgcgt gagcggaaag atggccgctt cccggccctg ctgcagggag ctcaaaatgg 5340
aggacgcggc gctcgggaga gcgggcgggt gagtcaccca cacaaaggaa aagggccttt 5400
ccgtcctcag ccgtcgcttc atgtgactcc acggagtacc gggcgccgtc caggcacctc 5460
gattagttct cgagcttttg gagtacgtcg tctttaggtt ggggggaggg gttttatgcg 5520
atggagtttc cccacactga gtgggtggag actgaagtta ggccagcttg gcacttgatg 5580
taattctcct tggaatttgc cctttttgag tttggatctt ggttcattct caagcctcag 5640
acagtggttc aaagtttttt tcttccattt caggtgtcgt gactcattct tggatccacc 5700
ggtgccacca tgcagatcga actgagtacc tgcttctttc tttgcttgtt gcgcttttgc 5760
ttttctgcga cgcggcggta ttacttgggg gcggttgagt tgagctggga ttatatgcaa 5820
tctgacctgg gtgaactgcc ggtcgatgcg cggttcccgc cccgggttcc caaaagtttt 5880
ccattcaaca catccgtcgt ttacaaaaaa acgcttttcg tagaattcac agatcacctg 5940
tttaacatag cgaagccacg acctccctgg atgggccttt tgggacctac gatacaagct 6000
gaagtatacg ataccgttgt tatcacactg aagaacatgg caagtcaccc tgtttccctt 6060
catgccgtgg gagtatcata ttggaaggct tctgaaggag cagaatatga tgatcaaaca 6120
agtcaaagag agaaggaaga tgacaaagtg ttccctgggg ggagtcatac gtacgtgtgg 6180
caagtattga aggaaaatgg tccgatggcg tctgacccgc tttgtcttac ctattcctac 6240
ctttctcacg tggacctggt aaaagatctg aactcaggtc tcattggcgc cctgttggtt 6300
tgtcgcgagg gttcattggc aaaagaaaag actcaaacgc ttcacaagtt tatccttctc 6360
tttgccgtct tcgatgaagg gaagtcttgg catagtgaga ctaagaactc cctcatgcaa 6420
gacagggatg ctgcatccgc gcgagcgtgg cctaagatgc acacggttaa cggctatgtg 6480
aacaggagcc tgccagggct catcggttgc cacaggaagt ccgtgtactg gcatgttata 6540
gggatgggga ctacacctga agtccattct atattcctcg aaggacacac ctttcttgta 6600
cgaaatcacc gccaagcgtc tcttgaaatt tcccctatta ccttcctcac tgcacaaacc 6660
cttctgatgg acctgggcca atttcttctg ttctgtcaca ttagttcaca tcaacatgac 6720
ggtatggagg cttacgtcaa ggtggacagc tgcccagagg aaccccaatt gcgcatgaaa 6780
aacaatgagg aagctgaaga ctacgacgat gatcttacgg actccgagat ggacgtggtt 6840
cgctttgatg atgataattc tccttctttc atccaaatcc gatctgtggc aaaaaaacat 6900
cccaagacgt gggtgcatta tatcgctgcg gaggaagaag attgggacta tgctcctttg 6960
gtgcttgcac ctgatgaccg cagttataag tcacagtact tgaataacgg ccctcaaaga 7020
atcggaagaa aatataagaa ggtccgattc atggcctaca ccgacgagac gttcaaaacc 7080
cgagaagcta ttcagcacga aagcggaata ctggggccgc tgttgtatgg tgaagtcgga 7140
gatacacttc tcataatatt taaaaaccag gcttcacggc catacaacat ctatccgcat 7200
ggtatcaccg acgtgcggcc cctgtatagt cggagactgc ctaagggggt aaaacatctc 7260
aaggattttc cgattctccc cggagaaatt ttcaagtata aatggacggt gacggtcgag 7320
gatggtccta ccaaatccga tccccggtgt ctcacaagat actacagcag cttcgttaat 7380
atggaaagag acctcgcttc cggacttatc ggaccgttgc tcatatgtta caaagagtcc 7440
gtagatcaaa ggggcaacca aattatgtcc gataagcgga atgttatatt gttcagtgtc 7500
ttcgacgaga acaggtcttg gtatttgact gaaaacatcc agcgatttct gccgaacccc 7560
gcaggggtac aattggagga cccggaattc caagctagta atatcatgca ttctatcaat 7620
ggatacgtat ttgattccct tcagcttagc gtttgtctgc atgaagtcgc atattggtat 7680
atccttagta ttggtgctca aactgacttc ctgtctgtat ttttttctgg ttacaccttc 7740
aagcacaaga tggtctacga ggacactctt acgctttttc ccttctctgg agagacggtg 7800
tttatgagca tggaaaaccc cgggctttgg attctcgggt gccataattc agacttccgc 7860
aacaggggta tgacagcctt gttgaaggtc agctcctgcg ataaaaacac cggggattac 7920
tatgaagact cctacgaaga catttctgca tatctcctct ccaaaaacaa cgcgatcgaa 7980
ccaaggtctt tttcccagaa ctcacggcat ccaagccaga atccaccagt gttgaaacgc 8040
catcagaggg aaattacgcg aacgaccttg caaagcgatc aggaagaaat tgattatgac 8100
gacactataa gtgtagaaat gaaaaaagag gactttgaca tctatgatga ggatgagaac 8160
cagtccccaa gaagttttca gaagaagacc cgccactatt ttattgctgc cgtcgaacgg 8220
ttgtgggact acggaatgag ctcctccccg catgtgttgc ggaatcgagc ccaaagtggc 8280
tctgtgcctc agttcaaaaa ggtcgtattc caagaattca ctgatggcag cttcactcag 8340
ccactgtatc ggggggagtt gaacgaacat ctcggcctct tgggcccata catacgcgct 8400
gaggttgaag ataacataat ggtaactttt cgaaatcagg catcaaggcc ttattcattt 8460
tacagctctc tcatatctta cgaagaggac caaagacaag gagcggaacc tcgcaagaat 8520
tttgtaaaac ccaatgaaac gaaaacgtat ttctggaagg ttcagcacca catggcccca 8580
acaaaggatg aatttgattg taaagcgtgg gcgtatttta gtgacgtcga tctcgaaaag 8640
gatgttcatt cagggcttat cggtcccctc cttgtgtgtc atacaaacac acttaatccg 8700
gcgcacggta gacaagtaac tgtgcaggaa tttgcgttgt ttttcacgat ctttgatgaa 8760
actaagtcat ggtattttac ggagaacatg gagcggaatt gtagggcacc atgtaatata 8820
cagatggaag acccaacctt taaagaaaat tacagattcc atgccataaa cgggtacatc 8880
atggatactc tcccaggact ggtaatggct caggaccagc gaatacgatg gtacttgctt 8940
agcatgggga gtaacgaaaa catccattct attcattttt caggccatgt gttcactgtc 9000
agaaaaaagg aggagtataa gatggcgctc tacaatctgt accccggtgt gtttgagacg 9060
gtagaaatgc tgccctccaa agctggtata tggagagtag agtgtttgat aggagaacat 9120
ctccacgccg gcatgtctac gctgtttctc gtttacagca ataagtgcca gacccccctg 9180
gggatggcta gtgggcacat ccgcgatttt caaattacag catctgggca atacggtcaa 9240
tgggcgccga aactggctag actgcattat tccgggtcca ttaatgcttg gtccactaag 9300
gagcccttca gctggatcaa ggtagacctt ctcgcgccta tgattataca cggtataaag 9360
acccaaggtg ccagacagaa gtttagtagc ctttacatat cacagtttat tataatgtac 9420
tccttggatg gcaagaagtg gcaaacctat aggggtaact ccacgggaac cctgatggtc 9480
tttttcggga acgtagactc ctcaggaata aagcacaata ttttcaatcc cccaatcata 9540
gcgcgctata tacgacttca tcctacgcat tactccatac gctctacgct gcgaatggag 9600
ctgatgggct gcgatctgaa cagttgctcc atgcctctgg gcatggaatc taaagccatc 9660
agcgatgcac aaattaccgc tagtagctac ttcaccaata tgtttgccac atggtccccg 9720
tctaaggctc gcctgcatct gcaaggccgg tccaacgcat ggcgacctca ggtcaataac 9780
ccaaaggaat ggttgcaggt agactttcag aagaccatga aggttaccgg ggtaactact 9840
cagggggtaa aatcactgtt gactagcatg tacgtgaaag aattcctcat tagcagtagt 9900
caagatggcc atcagtggac gctgttcttt caaaacggga aggtcaaagt tttccagggg 9960
aatcaggact ccttcacacc cgtcgtcaac tcactcgatc caccactgtt gacccggtac 10020
ctgagaatcc acccacaatc ctgggttcac caaatcgcac tcaggatgga agtactcggg 10080
tgcgaagcgc aggacctcta ctgagcggcc gcgtttaaac gtcgacaatc aacctctgga 10140
ttacaaaatt tgtgaaagat tgactggtat tcttaactat gttgctcctt ttacgctatg 10200
tggatacgct gctttaatgc ctttgtatca tgctattgct tcccgtatgg ctttcatttt 10260
ctcctccttg tataaatcct ggttgctgtc tctttatgag gagttgtggc ccgttgtcag 10320
gcaacgtggc gtggtgtgca ctgtgtttgc tgacgcaacc cccactggtt ggggcattgc 10380
caccacctgt cagctccttt ccgggacttt cgctttcccc ctccctattg ccacggcgga 10440
actcatcgcc gcctgccttg cccgctgctg gacaggggct cggctgttgg gcactgacaa 10500
ttccgtggtg ttgtcgggga agctgacgtc ctttccatgg ctgctcgcct gtgttgccac 10560
ctggattctg cgcgggacgt ccttctgcta cgtcccttcg gccctcaatc cagcggacct 10620
tccttcccgc ggcctgctgc cggctctgcg gcctcttccg cgtcttcgcc ttcgccctca 10680
gacgagtcgg atctcccttt gggccgcctc cccgcctgga attcgagctc ggtaccttta 10740
agaccaatga cttacaaggc agctgtagat cttagccact ttttaaaaga aaagggggga 10800
ctggaagggc taattcactc ccaacgaaga caagatctgc tttttgcttg tactgggtct 10860
ctctggttag accagatctg agcctgggag ctctctggct aactagggaa cccactgctt 10920
aagcctcaat aaagcttgcc ttgagtgctt caagtagtgt gtgcccgtct gttgtgtgac 10980
tctggtaact agagatccct cagacccttt tagtcagtgt ggaaaatctc tagcagtagt 11040
agttcatgtc atcttattat tcagtattta taacttgcaa agaaatgaat atcagagagt 11100
gagaggaact tgtttattgc agcttataat ggttacaaat aaagcaatag catcacaaat 11160
ttcacaaata aagcattttt ttcactgcat tctagttgtg gtttgtccaa actcatcaat 11220
gtatcttatc atgtctggct ctagctatcc cgcccctaac tccgcccagt tccgcccatt 11280
ctccgcccca tggctgacta atttttttta tttatgcaga ggccgaggcc gcctcggcct 11340
ctgagctatt ccagaagtag tgaggaggct tttttggagg cctaggcttt tgcgtcgaga 11400
cgtacccaat tcgccctata gtgagtcgta ttacgcgcgc tcactggccg tcgttttaca 11460
acgtcgtgac tgggaaaacc ctggcgttac ccaacttaat cgccttgcag cacatccccc 11520
tttcgccagc tggcgtaata gcgaagaggc ccgcaccgat cgcccttccc aacagttgcg 11580
cagcctgaat ggcgaatggc gcgacgcgcc ctgtagcggc gcattaagcg cggcgggtgt 11640
ggtggttacg cgcagcgtga ccgctacact tgccagcgcc ctagcgcccg ctcctttcgc 11700
tttcttccct tcctttctcg ccacgttcgc cggctttccc cgtcaagctc taaatcgggg 11760
gctcccttta gggttccgat ttagtgcttt acggcacctc gaccccaaaa aacttgatta 11820
gggtgatggt tcacgtagtg ggccatcgcc ctgatagacg gtttttcgcc ctttgacgtt 11880
ggagtccacg ttctttaata gtggactctt gttccaaact ggaacaacac tcaaccctat 11940
ctcggtctat tcttttgatt tataagggat tttgccgatt tcggcctatt ggttaaaaaa 12000
tgagctgatt taacaaaaat ttaacgcgaa ttttaacaaa atattaacgt ttacaatttc 12060
c 12061
<210> 15
<211> 12061
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 15
aatgcttcaa taatattgaa aaaggaagag tatgagtatt caacatttcc gtgtcgccct 60
tattcccttt tttgcggcat tttgccttcc tgtttttgct cacccagaaa cgctggtgaa 120
agtaaaagat gctgaagatc agttgggtgc acgagtgggt tacatcgaac tggatctcaa 180
cagcggtaag atccttgaga gttttcgccc cgaagaacgt tttccaatga tgagcacttt 240
taaagttctg ctatgtggcg cggtattatc ccgtattgac gccgggcaag agcaactcgg 300
tcgccgcata cactattctc agaatgactt ggttgagtac tcaccagtca cagaaaagca 360
tcttacggat ggcatgacag taagagaatt atgcagtgct gccataacca tgagtgataa 420
cactgcggcc aacttacttc tgacaacgat cggaggaccg aaggagctaa ccgctttttt 480
gcacaacatg ggggatcatg taactcgcct tgatcgttgg gaaccggagc tgaatgaagc 540
cataccaaac gacgagcgtg acaccacgat gcctgtagca atggcaacaa cgttgcgcaa 600
actattaact ggcgaactac ttactctagc ttcccggcaa caattaatag actggatgga 660
ggcggataaa gttgcaggac cacttctgcg ctcggccctt ccggctggct ggtttattgc 720
tgataaatct ggagccggtg agcgtgggtc tcgcggtatc attgcagcac tggggccaga 780
tggtaagccc tcccgtatcg tagttatcta cacgacgggg agtcaggcaa ctatggatga 840
acgaaataga cagatcgctg agataggtgc ctcactgatt aagcattggt aactgtcaga 900
ccaagtttac tcatatatac tttagattga tttaaaactt catttttaat ttaaaaggat 960
ctaggtgaag atcctttttg ataatctcat gaccaaaatc ccttaacgtg agttttcgtt 1020
ccactgagcg tcagaccccg tagaaaagat caaaggatct tcttgagatc ctttttttct 1080
gcgcgtaatc tgctgcttgc aaacaaaaaa accaccgcta ccagcggtgg tttgtttgcc 1140
ggatcaagag ctaccaactc tttttccgaa ggtaactggc ttcagcagag cgcagatacc 1200
aaatactgtc cttctagtgt agccgtagtt aggccaccac ttcaagaact ctgtagcacc 1260
gcctacatac ctcgctctgc taatcctgtt accagtggct gctgccagtg gcgataagtc 1320
gtgtcttacc gggttggact caagacgata gttaccggat aaggcgcagc ggtcgggctg 1380
aacggggggt tcgtgcacac agcccagctt ggagcgaacg acctacaccg aactgagata 1440
cctacagcgt gagctatgag aaagcgccac gcttcccgaa gggagaaagg cggacaggta 1500
tccggtaagc ggcagggtcg gaacaggaga gcgcacgagg gagcttccag ggggaaacgc 1560
ctggtatctt tatagtcctg tcgggtttcg ccacctctga cttgagcgtc gatttttgtg 1620
atgctcgtca ggggggcgga gcctatggaa aaacgccagc aacgcggcct ttttacggtt 1680
cctggccttt tgctggcctt ttgctcacat gttctttcct gcgttatccc ctgattctgt 1740
ggataaccgt attaccgcct ttgagtgagc tgataccgct cgccgcagcc gaacgaccga 1800
gcgcagcgag tcagtgagcg aggaagcgga agagcgccca atacgcaaac cgcctctccc 1860
cgcgcgttgg ccgattcatt aatgcagctg gcacgacagg tttcccgact ggaaagcggg 1920
cagtgagcgc aacgcaatta atgtgagtta gctcactcat taggcacccc aggctttaca 1980
ctttatgctt ccggctcgta tgttgtgtgg aattgtgagc ggataacaat ttcacacagg 2040
aaacagctat gaccatgatt acgccaagcg cgcaattaac cctcactaaa gggaacaaaa 2100
gctggagctg caagcttggc cattgcatac gttgtatcca tatcataata tgtacattta 2160
tattggctca tgtccaacat taccgccatg ttgacattga ttattgacta gttattaata 2220
gtaatcaatt acggggtcat tagttcatag cccatatatg gagttccgcg ttacataact 2280
tacggtaaat ggcccgcctg gctgaccgcc caacgacccc cgcccattga cgtcaataat 2340
gacgtatgtt cccatagtaa cgccaatagg gactttccat tgacgtcaat gggtggagta 2400
tttacggtaa actgcccact tggcagtaca tcaagtgtat catatgccaa gtacgccccc 2460
tattgacgtc aatgacggta aatggcccgc ctggcattat gcccagtaca tgaccttatg 2520
ggactttcct acttggcagt acatctacgt attagtcatc gctattacca tggtgatgcg 2580
gttttggcag tacatcaatg ggcgtggata gcggtttgac tcacggggat ttccaagtct 2640
ccaccccatt gacgtcaatg ggagtttgtt ttggcaccaa aatcaacggg actttccaaa 2700
atgtcgtaac aactccgccc cattgacgca aatgggcggt aggcgtgtac ggtgggaggt 2760
ctatataagc agagctcgtt tagtgaaccg gggtctctct ggttagacca gatctgagcc 2820
tgggagctct ctggctaact agggaaccca ctgcttaagc ctcaataaag cttgccttga 2880
gtgcttcaag tagtgtgtgc ccgtctgttg tgtgactctg gtaactagag atccctcaga 2940
cccttttagt cagtgtggaa aatctctagc agtggcgccc gaacagggac ctgaaagcga 3000
aagggaaacc agagctctct cgacgcagga ctcggcttgc tgaagcgcgc acggcaagag 3060
gcgaggggcg gcgactggtg agtacgccaa aaattttgac tagcggaggc tagaaggaga 3120
gagatgggtg cgagagcgtc agtattaagc gggggagaat tagatcgcga tgggaaaaaa 3180
ttcggttaag gccaggggga aagaaaaaat ataaattaaa acatatagta tgggcaagca 3240
gggagctaga acgattcgca gttaatcctg gcctgttaga aacatcagaa ggctgtagac 3300
aaatactggg acagctacaa ccatcccttc agacaggatc agaagaactt agatcattat 3360
ataatacagt agcaaccctc tattgtgtgc atcaaaggat agagataaaa gacaccaagg 3420
aagctttaga caagatagag gaagagcaaa acaaaagtaa gaccaccgca cagcaagcgg 3480
ccgctgatct tcagacctgg aggaggagat atgagggaca attggagaag tgaattatat 3540
aaatataaag tagtaaaaat tgaaccatta ggagtagcac ccaccaaggc aaagagaaga 3600
gtggtgcaga gagaaaaaag agcagtggga ataggagctt tgttccttgg gttcttggga 3660
gcagcaggaa gcactatggg cgcagcctca atgacgctga cggtacaggc cagacaatta 3720
ttgtctggta tagtgcagca gcagaacaat ttgctgaggg ctattgaggc gcaacagcat 3780
ctgttgcaac tcacagtctg gggcatcaag cagctccagg caagaatcct ggctgtggaa 3840
agatacctaa aggatcaaca gctcctgggg atttggggtt gctctggaaa actcatttgc 3900
accactgctg tgccttggaa tgctagttgg agtaataaat ctctggaaca gatttggaat 3960
cacacgacct ggatggagtg ggacagagaa attaacaatt acacaagctt aatacactcc 4020
ttaattgaag aatcgcaaaa ccagcaagaa aagaatgaac aagaattatt ggaattagat 4080
aaatgggcaa gtttgtggaa ttggtttaac ataacaaatt ggctgtggta tataaaatta 4140
ttcataatga tagtaggagg cttggtaggt ttaagaatag tttttgctgt actttctata 4200
gtgaatagag ttaggcaggg atattcacca ttatcgtttc agacccacct cccaaccccg 4260
aggggacccg acaggcccga aggaatagaa gaagaaggtg gagagagaga cagagacaga 4320
tccattcgat tagtgaacgg atctcgacgg tatcggttaa cttttaaaag aaaagggggg 4380
attggggggt acagtgcagg ggaaagaata gtagacataa tagcaacaga catacaaact 4440
aaagaattac aaaaacaaat tacaaaaatt caaaatttta tcgataagag catgcgtgag 4500
gctccggtgc ccgtcagtgg gcagagcgca catcgcccac agtccccgag aagttggggg 4560
gaggggtcgg caattgaacc ggtgcctaga gaaggtggcg cggggtaaac tgggaaagtg 4620
atgtcgtgta ctggctccgc ctttttcccg agggtggggg agaaccgtat ataagtgcag 4680
tagtcgccgt gaacgttctt tttcgcaacg ggtttgccgc cagaacacag gtaagtgccg 4740
tgtgtggttc ccgcgggcct ggcctcttta cgggttatgg cccttgcgtg ccttgaatta 4800
cttccacgcc cctggctgca gtacgtgatt cttgatcccg agcttcgggt tggaagtggg 4860
tgggagagtt cgaggccttg cgcttaagga gccccttcgc ctcgtgcttg agttgaggcc 4920
tggcttgggc gctggggccg ccgcgtgcga atctggtggc accttcgcgc ctgtctccct 4980
gctttcgata agtctctagc catttaaaat ttttgatgac ctgctgcgac gctttttttc 5040
tggcaagata gtcttgtaaa tgcgggccaa gatctgcaca ctggtatttc ggtttttggg 5100
gccgcgggcg gcgacggggc ccgtgcgtcc cagcgcacat gttcggcgag gcggggcctg 5160
cgagcgcggc caccgagaat cggacggggg tagtctcaag ctggccggcc tgctctggtg 5220
cctggcctcg cgccgccgtg tatcgccccg ccctgggcgg caaggctggc ccggtcggca 5280
ccagttgcgt gagcggaaag atggccgctt cccggccctg ctgcagggag ctcaaaatgg 5340
aggacgcggc gctcgggaga gcgggcgggt gagtcaccca cacaaaggaa aagggccttt 5400
ccgtcctcag ccgtcgcttc atgtgactcc acggagtacc gggcgccgtc caggcacctc 5460
gattagttct cgagcttttg gagtacgtcg tctttaggtt ggggggaggg gttttatgcg 5520
atggagtttc cccacactga gtgggtggag actgaagtta ggccagcttg gcacttgatg 5580
taattctcct tggaatttgc cctttttgag tttggatctt ggttcattct caagcctcag 5640
acagtggttc aaagtttttt tcttccattt caggtgtcgt gactcattct tggatccacc 5700
ggtgccacca tgcagattga gctgtccacc tgtttcttcc tgtgcctgct gagattttgc 5760
ttcagtgcta caaggagata ctacctgggg gctgtcgagc tgtcttggga ttacatgcag 5820
tctgatctgg gcgaactgcc agtggacgcg cggtttcctc caagggtgcc aaagtccttc 5880
ccctttaata catctgtggt gtacaagaag accctgtttg tggagtttac cgaccacctg 5940
tttaacatcg cgaagcctag accaccctgg atgggcctgc tgggccccac aatccaggcc 6000
gaagtgtatg acacagtggt aatcacactg aagaacatgg ccagccaccc agtgtccctg 6060
cacgcggtgg gcgtatccta ctggaaggcc agcgaaggcg cggagtatga tgaccagaca 6120
tcccagagag agaaggagga tgacaaggtg tttccaggcg ggtcccacac ctatgtatgg 6180
caggtgctga aggagaatgg ccccatggcc tccgaccccc tgtgcctgac atacagctat 6240
ctgtcccacg tagacctggt gaaggatctg aattccgggc tgatcggggc cctgctggtg 6300
tgcagggagg gctccctggc caaggagaag acccagaccc tgcacaagtt tatcctgctg 6360
ttcgcggtgt ttgatgaggg caagtcctgg cacagcgaaa caaagaactc tctgatgcag 6420
gacagggacg cggccagcgc gcgggcctgg ccaaagatgc acaccgtaaa tggctatgtg 6480
aacaggtccc tgccaggcct gatcgggtgc cacagaaagt ctgtatattg gcacgtaatc 6540
gggatgggca ccacaccaga ggtgcactcc atctttctgg agggccacac cttcctggtg 6600
agaaaccaca ggcaggccag cctggagatc agccccatca cattcctgac agcccagacc 6660
ctgctgatgg acctgggcca gtttctgctg ttttgtcaca tcagctccca ccagcacgat 6720
ggcatggagg cctatgtgaa ggtggatagc tgccctgagg agccacagct gaggatgaag 6780
aataatgagg aggccgaaga ttatgatgat gacctgaccg acagcgaaat ggatgtggtg 6840
aggtttgacg atgacaactc cccatccttc atccagatcc gctccgtagc caagaagcac 6900
cctaagacat gggtgcacta tatcgcggcc gaagaggagg actgggatta tgccccactg 6960
gtgctggccc ctgacgatcg gagctacaag tcccagtatc tgaataatgg cccccagaga 7020
atcgggcgga agtacaagaa ggtgagattc atggcctata ccgatgagac attcaagacc 7080
agggaggcca tccagcacga atctggcatc ctgggccctc tgctgtatgg cgaagtgggc 7140
gacacactgc tgatcatctt caagaaccag gccagcagac catacaacat ctacccacac 7200
gggatcacag acgtacggcc tctgtacagc cgccgcctgc caaagggcgt aaagcacctg 7260
aaggatttcc ctatcctgcc cggggagatc tttaagtata agtggacagt gacagtggag 7320
gatggcccaa ccaagtccga tccaaggtgc ctgaccagat actacagctc ctttgtgaac 7380
atggagagag atctggccag cgggctgatc gggccactgc tgatctgcta caaggagagc 7440
gtagatcaga gaggcaatca gatcatgtcc gacaagagga atgtgatcct gttctctgta 7500
ttcgatgaga atagatcttg gtacctgaca gagaacatcc agagatttct gccaaatcca 7560
gccggggtgc agctggagga cccagagttt caggcctcta acatcatgca ctccatcaat 7620
ggctatgtat ttgactctct gcagctgagc gtatgtctgc acgaagtggc ctactggtac 7680
atcctgtcca tcggggccca gacagacttc ctgtctgtat tcttttctgg ctacacattc 7740
aagcacaaga tggtgtatga ggacaccctg accctgtttc ctttctccgg ggagaccgta 7800
ttcatgtcta tggagaatcc tggcctgtgg atactgggct gtcacaattc tgactttaga 7860
aatagaggca tgacagccct gctgaaggtg tcctcctgtg acaagaatac aggcgactac 7920
tatgaggatt cctatgagga tatcagcgcg tatctgctga gcaagaataa tgccatcgaa 7980
cctaggtctt tttctcagaa ctccagacac ccctcccaga accctcctgt gctgaagaga 8040
caccagagag agatcacaag gaccaccctg cagtctgacc aggaggagat cgactatgat 8100
gatacaatct ctgtggagat gaagaaggag gattttgata tctatgacga agatgagaat 8160
cagagcccaa gatccttcca gaagaagacc cgccactatt tcatcgcggc cgtagagaga 8220
ctgtgggact atggcatgtc ttcctctccc cacgtactga ggaacagagc ccagtctggc 8280
tctgtgcctc agtttaagaa ggtggtgttc caggagttca cagacgggag ctttacccag 8340
ccactgtaca gaggcgaact gaatgagcac ctgggcctgc tgggcccata catcagggcc 8400
gaagtggagg ataacatcat ggtgaccttc agaaatcagg ccagcaggcc atactccttt 8460
tactcctccc tgatctctta tgaggaggat cagagacagg gcgcggagcc aagaaagaat 8520
tttgtgaagc caaatgagac aaagacatac ttttggaagg tgcagcacca catggcccca 8580
accaaggacg aatttgattg caaggcctgg gcctattttt ccgacgtgga tctggagaag 8640
gatgtgcact ctggcctgat cgggcccctg ctggtgtgcc acacaaacac cctgaatcct 8700
gcccacgggc gccaggtgac agtgcaggag tttgccctgt tctttaccat ctttgacgaa 8760
acaaagtcct ggtacttcac agagaacatg gagagaaatt gcagggcccc atgcaatatc 8820
cagatggagg accccacctt caaggagaat tatagatttc acgcgatcaa tggctacatc 8880
atggataccc tgccaggcct ggtaatggcc caggatcaga gaatcagatg gtacctgctg 8940
tctatgggct ctaatgagaa catccactcc atccacttct ccgggcacgt attcaccgta 9000
aggaagaagg aggagtacaa gatggccctg tacaatctgt atccaggcgt atttgagaca 9060
gtggagatgc tgcccagcaa ggccgggatc tggagagtgg agtgtctgat cggggagcac 9120
ctgcacgcgg gcatgtccac actgtttctg gtgtactcca ataagtgcca gacccctctg 9180
ggcatggcca gcgggcacat ccgggacttc cagatcaccg cgtctggcca gtatggccag 9240
tgggccccca agctggcccg cctgcactat agcgggtcta tcaacgcgtg gtccaccaag 9300
gagccctttt cctggatcaa ggtggacctg ctggccccaa tgatcatcca cgggatcaag 9360
acccagggcg cgcgccagaa gttttcttct ctgtacatct cccagtttat catcatgtac 9420
tccctggatg gcaagaagtg gcagacatat cgcggcaact ccacaggcac cctgatggtg 9480
ttctttggca atgtggactc ctccgggatc aagcacaaca tctttaatcc accaatcatc 9540
gcgcggtata tcaggctgca ccccacccac tactctatca gaagcacact gaggatggag 9600
ctgatgggct gtgacctgaa ttcctgttcc atgcctctgg gcatggagag caaggccatc 9660
tctgatgccc agatcacagc cagctcttat ttcacaaaca tgtttgccac ctggtcccca 9720
tctaaggcca gactgcacct gcagggcaga tccaatgcct ggagaccaca ggtgaataat 9780
cccaaggagt ggctgcaggt ggacttccag aagacaatga aggtgaccgg ggtgaccacc 9840
cagggcgtaa agtccctgct gacatccatg tatgtgaagg agttcctgat cagctcttcc 9900
caggatggcc accagtggac cctgttcttt cagaatggca aggtgaaggt gtttcagggc 9960
aatcaggatt cctttacccc tgtggtgaat tccctggacc cacctctgct gaccagatat 10020
ctgagaatcc accctcagag ctgggtccac cagattgccc tgagaatgga agtgctggga 10080
tgtgaagctc aggatctgta ttgagcggcc gcgtttaaac gtcgacaatc aacctctgga 10140
ttacaaaatt tgtgaaagat tgactggtat tcttaactat gttgctcctt ttacgctatg 10200
tggatacgct gctttaatgc ctttgtatca tgctattgct tcccgtatgg ctttcatttt 10260
ctcctccttg tataaatcct ggttgctgtc tctttatgag gagttgtggc ccgttgtcag 10320
gcaacgtggc gtggtgtgca ctgtgtttgc tgacgcaacc cccactggtt ggggcattgc 10380
caccacctgt cagctccttt ccgggacttt cgctttcccc ctccctattg ccacggcgga 10440
actcatcgcc gcctgccttg cccgctgctg gacaggggct cggctgttgg gcactgacaa 10500
ttccgtggtg ttgtcgggga agctgacgtc ctttccatgg ctgctcgcct gtgttgccac 10560
ctggattctg cgcgggacgt ccttctgcta cgtcccttcg gccctcaatc cagcggacct 10620
tccttcccgc ggcctgctgc cggctctgcg gcctcttccg cgtcttcgcc ttcgccctca 10680
gacgagtcgg atctcccttt gggccgcctc cccgcctgga attcgagctc ggtaccttta 10740
agaccaatga cttacaaggc agctgtagat cttagccact ttttaaaaga aaagggggga 10800
ctggaagggc taattcactc ccaacgaaga caagatctgc tttttgcttg tactgggtct 10860
ctctggttag accagatctg agcctgggag ctctctggct aactagggaa cccactgctt 10920
aagcctcaat aaagcttgcc ttgagtgctt caagtagtgt gtgcccgtct gttgtgtgac 10980
tctggtaact agagatccct cagacccttt tagtcagtgt ggaaaatctc tagcagtagt 11040
agttcatgtc atcttattat tcagtattta taacttgcaa agaaatgaat atcagagagt 11100
gagaggaact tgtttattgc agcttataat ggttacaaat aaagcaatag catcacaaat 11160
ttcacaaata aagcattttt ttcactgcat tctagttgtg gtttgtccaa actcatcaat 11220
gtatcttatc atgtctggct ctagctatcc cgcccctaac tccgcccagt tccgcccatt 11280
ctccgcccca tggctgacta atttttttta tttatgcaga ggccgaggcc gcctcggcct 11340
ctgagctatt ccagaagtag tgaggaggct tttttggagg cctaggcttt tgcgtcgaga 11400
cgtacccaat tcgccctata gtgagtcgta ttacgcgcgc tcactggccg tcgttttaca 11460
acgtcgtgac tgggaaaacc ctggcgttac ccaacttaat cgccttgcag cacatccccc 11520
tttcgccagc tggcgtaata gcgaagaggc ccgcaccgat cgcccttccc aacagttgcg 11580
cagcctgaat ggcgaatggc gcgacgcgcc ctgtagcggc gcattaagcg cggcgggtgt 11640
ggtggttacg cgcagcgtga ccgctacact tgccagcgcc ctagcgcccg ctcctttcgc 11700
tttcttccct tcctttctcg ccacgttcgc cggctttccc cgtcaagctc taaatcgggg 11760
gctcccttta gggttccgat ttagtgcttt acggcacctc gaccccaaaa aacttgatta 11820
gggtgatggt tcacgtagtg ggccatcgcc ctgatagacg gtttttcgcc ctttgacgtt 11880
ggagtccacg ttctttaata gtggactctt gttccaaact ggaacaacac tcaaccctat 11940
ctcggtctat tcttttgatt tataagggat tttgccgatt tcggcctatt ggttaaaaaa 12000
tgagctgatt taacaaaaat ttaacgcgaa ttttaacaaa atattaacgt ttacaatttc 12060
c 12061
<210> 16
<211> 12061
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 16
aatgcttcaa taatattgaa aaaggaagag tatgagtatt caacatttcc gtgtcgccct 60
tattcccttt tttgcggcat tttgccttcc tgtttttgct cacccagaaa cgctggtgaa 120
agtaaaagat gctgaagatc agttgggtgc acgagtgggt tacatcgaac tggatctcaa 180
cagcggtaag atccttgaga gttttcgccc cgaagaacgt tttccaatga tgagcacttt 240
taaagttctg ctatgtggcg cggtattatc ccgtattgac gccgggcaag agcaactcgg 300
tcgccgcata cactattctc agaatgactt ggttgagtac tcaccagtca cagaaaagca 360
tcttacggat ggcatgacag taagagaatt atgcagtgct gccataacca tgagtgataa 420
cactgcggcc aacttacttc tgacaacgat cggaggaccg aaggagctaa ccgctttttt 480
gcacaacatg ggggatcatg taactcgcct tgatcgttgg gaaccggagc tgaatgaagc 540
cataccaaac gacgagcgtg acaccacgat gcctgtagca atggcaacaa cgttgcgcaa 600
actattaact ggcgaactac ttactctagc ttcccggcaa caattaatag actggatgga 660
ggcggataaa gttgcaggac cacttctgcg ctcggccctt ccggctggct ggtttattgc 720
tgataaatct ggagccggtg agcgtgggtc tcgcggtatc attgcagcac tggggccaga 780
tggtaagccc tcccgtatcg tagttatcta cacgacgggg agtcaggcaa ctatggatga 840
acgaaataga cagatcgctg agataggtgc ctcactgatt aagcattggt aactgtcaga 900
ccaagtttac tcatatatac tttagattga tttaaaactt catttttaat ttaaaaggat 960
ctaggtgaag atcctttttg ataatctcat gaccaaaatc ccttaacgtg agttttcgtt 1020
ccactgagcg tcagaccccg tagaaaagat caaaggatct tcttgagatc ctttttttct 1080
gcgcgtaatc tgctgcttgc aaacaaaaaa accaccgcta ccagcggtgg tttgtttgcc 1140
ggatcaagag ctaccaactc tttttccgaa ggtaactggc ttcagcagag cgcagatacc 1200
aaatactgtc cttctagtgt agccgtagtt aggccaccac ttcaagaact ctgtagcacc 1260
gcctacatac ctcgctctgc taatcctgtt accagtggct gctgccagtg gcgataagtc 1320
gtgtcttacc gggttggact caagacgata gttaccggat aaggcgcagc ggtcgggctg 1380
aacggggggt tcgtgcacac agcccagctt ggagcgaacg acctacaccg aactgagata 1440
cctacagcgt gagctatgag aaagcgccac gcttcccgaa gggagaaagg cggacaggta 1500
tccggtaagc ggcagggtcg gaacaggaga gcgcacgagg gagcttccag ggggaaacgc 1560
ctggtatctt tatagtcctg tcgggtttcg ccacctctga cttgagcgtc gatttttgtg 1620
atgctcgtca ggggggcgga gcctatggaa aaacgccagc aacgcggcct ttttacggtt 1680
cctggccttt tgctggcctt ttgctcacat gttctttcct gcgttatccc ctgattctgt 1740
ggataaccgt attaccgcct ttgagtgagc tgataccgct cgccgcagcc gaacgaccga 1800
gcgcagcgag tcagtgagcg aggaagcgga agagcgccca atacgcaaac cgcctctccc 1860
cgcgcgttgg ccgattcatt aatgcagctg gcacgacagg tttcccgact ggaaagcggg 1920
cagtgagcgc aacgcaatta atgtgagtta gctcactcat taggcacccc aggctttaca 1980
ctttatgctt ccggctcgta tgttgtgtgg aattgtgagc ggataacaat ttcacacagg 2040
aaacagctat gaccatgatt acgccaagcg cgcaattaac cctcactaaa gggaacaaaa 2100
gctggagctg caagcttggc cattgcatac gttgtatcca tatcataata tgtacattta 2160
tattggctca tgtccaacat taccgccatg ttgacattga ttattgacta gttattaata 2220
gtaatcaatt acggggtcat tagttcatag cccatatatg gagttccgcg ttacataact 2280
tacggtaaat ggcccgcctg gctgaccgcc caacgacccc cgcccattga cgtcaataat 2340
gacgtatgtt cccatagtaa cgccaatagg gactttccat tgacgtcaat gggtggagta 2400
tttacggtaa actgcccact tggcagtaca tcaagtgtat catatgccaa gtacgccccc 2460
tattgacgtc aatgacggta aatggcccgc ctggcattat gcccagtaca tgaccttatg 2520
ggactttcct acttggcagt acatctacgt attagtcatc gctattacca tggtgatgcg 2580
gttttggcag tacatcaatg ggcgtggata gcggtttgac tcacggggat ttccaagtct 2640
ccaccccatt gacgtcaatg ggagtttgtt ttggcaccaa aatcaacggg actttccaaa 2700
atgtcgtaac aactccgccc cattgacgca aatgggcggt aggcgtgtac ggtgggaggt 2760
ctatataagc agagctcgtt tagtgaaccg gggtctctct ggttagacca gatctgagcc 2820
tgggagctct ctggctaact agggaaccca ctgcttaagc ctcaataaag cttgccttga 2880
gtgcttcaag tagtgtgtgc ccgtctgttg tgtgactctg gtaactagag atccctcaga 2940
cccttttagt cagtgtggaa aatctctagc agtggcgccc gaacagggac ctgaaagcga 3000
aagggaaacc agagctctct cgacgcagga ctcggcttgc tgaagcgcgc acggcaagag 3060
gcgaggggcg gcgactggtg agtacgccaa aaattttgac tagcggaggc tagaaggaga 3120
gagatgggtg cgagagcgtc agtattaagc gggggagaat tagatcgcga tgggaaaaaa 3180
ttcggttaag gccaggggga aagaaaaaat ataaattaaa acatatagta tgggcaagca 3240
gggagctaga acgattcgca gttaatcctg gcctgttaga aacatcagaa ggctgtagac 3300
aaatactggg acagctacaa ccatcccttc agacaggatc agaagaactt agatcattat 3360
ataatacagt agcaaccctc tattgtgtgc atcaaaggat agagataaaa gacaccaagg 3420
aagctttaga caagatagag gaagagcaaa acaaaagtaa gaccaccgca cagcaagcgg 3480
ccgctgatct tcagacctgg aggaggagat atgagggaca attggagaag tgaattatat 3540
aaatataaag tagtaaaaat tgaaccatta ggagtagcac ccaccaaggc aaagagaaga 3600
gtggtgcaga gagaaaaaag agcagtggga ataggagctt tgttccttgg gttcttggga 3660
gcagcaggaa gcactatggg cgcagcctca atgacgctga cggtacaggc cagacaatta 3720
ttgtctggta tagtgcagca gcagaacaat ttgctgaggg ctattgaggc gcaacagcat 3780
ctgttgcaac tcacagtctg gggcatcaag cagctccagg caagaatcct ggctgtggaa 3840
agatacctaa aggatcaaca gctcctgggg atttggggtt gctctggaaa actcatttgc 3900
accactgctg tgccttggaa tgctagttgg agtaataaat ctctggaaca gatttggaat 3960
cacacgacct ggatggagtg ggacagagaa attaacaatt acacaagctt aatacactcc 4020
ttaattgaag aatcgcaaaa ccagcaagaa aagaatgaac aagaattatt ggaattagat 4080
aaatgggcaa gtttgtggaa ttggtttaac ataacaaatt ggctgtggta tataaaatta 4140
ttcataatga tagtaggagg cttggtaggt ttaagaatag tttttgctgt actttctata 4200
gtgaatagag ttaggcaggg atattcacca ttatcgtttc agacccacct cccaaccccg 4260
aggggacccg acaggcccga aggaatagaa gaagaaggtg gagagagaga cagagacaga 4320
tccattcgat tagtgaacgg atctcgacgg tatcggttaa cttttaaaag aaaagggggg 4380
attggggggt acagtgcagg ggaaagaata gtagacataa tagcaacaga catacaaact 4440
aaagaattac aaaaacaaat tacaaaaatt caaaatttta tcgataagag catgcgtgag 4500
gctccggtgc ccgtcagtgg gcagagcgca catcgcccac agtccccgag aagttggggg 4560
gaggggtcgg caattgaacc ggtgcctaga gaaggtggcg cggggtaaac tgggaaagtg 4620
atgtcgtgta ctggctccgc ctttttcccg agggtggggg agaaccgtat ataagtgcag 4680
tagtcgccgt gaacgttctt tttcgcaacg ggtttgccgc cagaacacag gtaagtgccg 4740
tgtgtggttc ccgcgggcct ggcctcttta cgggttatgg cccttgcgtg ccttgaatta 4800
cttccacgcc cctggctgca gtacgtgatt cttgatcccg agcttcgggt tggaagtggg 4860
tgggagagtt cgaggccttg cgcttaagga gccccttcgc ctcgtgcttg agttgaggcc 4920
tggcttgggc gctggggccg ccgcgtgcga atctggtggc accttcgcgc ctgtctccct 4980
gctttcgata agtctctagc catttaaaat ttttgatgac ctgctgcgac gctttttttc 5040
tggcaagata gtcttgtaaa tgcgggccaa gatctgcaca ctggtatttc ggtttttggg 5100
gccgcgggcg gcgacggggc ccgtgcgtcc cagcgcacat gttcggcgag gcggggcctg 5160
cgagcgcggc caccgagaat cggacggggg tagtctcaag ctggccggcc tgctctggtg 5220
cctggcctcg cgccgccgtg tatcgccccg ccctgggcgg caaggctggc ccggtcggca 5280
ccagttgcgt gagcggaaag atggccgctt cccggccctg ctgcagggag ctcaaaatgg 5340
aggacgcggc gctcgggaga gcgggcgggt gagtcaccca cacaaaggaa aagggccttt 5400
ccgtcctcag ccgtcgcttc atgtgactcc acggagtacc gggcgccgtc caggcacctc 5460
gattagttct cgagcttttg gagtacgtcg tctttaggtt ggggggaggg gttttatgcg 5520
atggagtttc cccacactga gtgggtggag actgaagtta ggccagcttg gcacttgatg 5580
taattctcct tggaatttgc cctttttgag tttggatctt ggttcattct caagcctcag 5640
acagtggttc aaagtttttt tcttccattt caggtgtcgt gactcattct tggatccacc 5700
ggtgccacca tgcagatcga gctgtccacc tgtttcttcc tgtgcctgct gcgcttctgt 5760
ttctccgcca cccgacgata ctacctgggg gccgtggagc tgagctggga ctacatgcag 5820
agcgacctgg gcgagctgcc cgtggacgcc cggttccccc ccagggtgcc caagagcttc 5880
cccttcaaca ccagcgtggt gtacaagaag accctgttcg tggagttcac cgaccacctg 5940
ttcaacatcg ccaagcccag gcccccctgg atgggcctgc tgggccccac catccaggcc 6000
gaggtgtacg acaccgtggt catcaccctg aagaacatgg ccagccaccc cgtgagcctg 6060
cacgccgtgg gcgtgagcta ctggaaggcc agcgagggcg ccgagtacga cgaccagacc 6120
agccagaggg agaaggagga cgacaaggtg ttccccggcg gcagccacac ctacgtgtgg 6180
caggtgctga aggagaacgg ccccatggcc agcgaccccc tgtgcctgac ctacagctac 6240
ctgagccacg tggacctggt gaaggacctg aacagcggcc tgatcggcgc cctgctggtg 6300
tgcagggagg gcagcctggc caaggagaag acccagaccc tgcacaagtt catcctgctg 6360
ttcgccgtgt tcgacgaggg caagagctgg cacagcgaga cgaagaacag cctgatgcag 6420
gacagggacg ccgccagcgc cagggcctgg cccaagatgc acaccgtgaa cggctacgtg 6480
aaccggagcc tgcccggcct gatcggctgc cacaggaaga gcgtgtactg gcacgtgatc 6540
ggcatgggca ccacccccga ggtgcacagc atcttcctgg agggccacac cttcctggtg 6600
cggaaccaca ggcaggccag cctggagatc agccccatca ccttcctgac cgcccagacc 6660
ctgctgatgg acctgggcca gttcctgctg ttctgccaca tcagcagcca ccagcacgac 6720
ggcatggagg cctacgtgaa ggtggacagc tgccccgagg agccccagct gcggatgaag 6780
aacaacgagg aggccgagga ctacgacgac gacctgaccg acagcgagat ggacgtggtg 6840
cggttcgacg acgacaacag ccccagcttc atccagatca ggagcgtggc caagaagcac 6900
cccaagacct gggtgcacta catcgccgcc gaggaggagg actgggacta cgcccccctg 6960
gtgctggccc ccgacgacag gagctacaag agccagtacc tgaacaacgg cccccagcgg 7020
atcggcagga agtacaagaa ggtgcggttc atggcctaca ccgacgagac attcaagacc 7080
agggaggcca tccagcacga gagcggcatc ctgggccccc tgctgtacgg cgaggtcggc 7140
gacaccctgc tgatcatctt caagaaccag gccagccggc cctacaacat ctacccccac 7200
ggcatcaccg acgtgaggcc cctgtacagc cggaggctgc ccaagggcgt gaagcacctg 7260
aaggacttcc ccatcctgcc cggcgagatc ttcaagtaca agtggaccgt gaccgtggag 7320
gacggcccca ccaagagcga cccccggtgc ctgaccaggt actacagcag cttcgtgaac 7380
atggagaggg acctggccag cggcctgatc ggccccctgc tgatctgcta caaggagagc 7440
gtggaccagc ggggcaacca gatcatgagc gacaagagga acgtgatcct gttcagcgtg 7500
ttcgacgaga accggagctg gtacctgacc gagaacatcc agaggttcct gcccaacccc 7560
gccggcgtgc agctggagga ccccgagttc caggccagca acatcatgca cagcatcaac 7620
ggctacgtgt tcgacagcct gcagctgagc gtgtgcctgc acgaggtggc ctactggtac 7680
atcctgagca tcggcgccca gaccgacttc ctgagcgtgt tcttcagcgg ctacaccttc 7740
aagcacaaga tggtgtacga ggacaccctg accctgttcc ccttcagcgg cgagacggtg 7800
ttcatgagca tggagaaccc cggcctgtgg atactgggct gccacaacag cgacttccgg 7860
aacaggggca tgaccgccct gctgaaggtg agcagctgcg acaagaacac cggcgactac 7920
tacgaggaca gctacgagga catcagcgcc tacctgctga gcaagaacaa cgccatcgag 7980
ccccggagct tcagccagaa cagcaggcac cccagccaga acccccccgt gctgaagagg 8040
caccagaggg agatcaccag gaccaccctg cagagcgacc aggaggagat cgactacgac 8100
gacaccatca gcgtggagat gaagaaggag gacttcgaca tctacgacga ggacgagaac 8160
cagagccccc ggagcttcca gaagaagacc aggcactact tcatcgccgc cgtggagagg 8220
ctgtgggact acggcatgag cagcagcccc cacgtgctga ggaacagggc ccagagcggc 8280
agcgtgcccc agttcaagaa ggtggtgttc caggagttca ccgacggcag cttcacccag 8340
cccctgtaca ggggcgagct gaacgagcac ctgggcctgc tgggccccta catcagggcc 8400
gaggtggagg acaacatcat ggtgaccttc cggaaccagg ccagcaggcc ctacagcttc 8460
tacagcagcc tgatcagcta cgaggaggac cagaggcagg gcgccgagcc caggaagaac 8520
ttcgtgaagc ccaacgagac gaagacctac ttctggaagg tgcagcacca catggccccc 8580
accaaggacg agttcgactg caaggcctgg gcctacttca gcgacgtgga cctggagaag 8640
gacgtgcact ccggcctgat cgggcccctg ctggtgtgcc acaccaacac cctgaacccc 8700
gcccacggcc ggcaggtgac cgtgcaggag ttcgccctgt tcttcaccat cttcgacgag 8760
acgaagagct ggtacttcac cgagaacatg gagcggaact gcagggcccc ctgcaacatc 8820
cagatggagg accccacctt caaggagaac tacaggttcc acgccatcaa cggctacatc 8880
atggacaccc tgcccggcct ggtcatggcc caggaccagc ggatcaggtg gtacctgctg 8940
agcatgggca gcaacgagaa catccacagc atccacttca gcggccacgt gttcaccgtg 9000
cggaagaagg aggagtacaa gatggccctg tacaacctgt accccggcgt gttcgagacg 9060
gtggagatgc tgcccagcaa ggccggcatc tggagggtgg agtgcctgat cggcgagcac 9120
ctgcacgccg gcatgagcac cctgttcctg gtgtacagca acaagtgcca gacccccctg 9180
ggcatggcca gcggccacat ccgggacttc cagatcaccg ccagcggcca gtacggccag 9240
tgggccccca agctggccag gctgcactac agcggcagca tcaacgcctg gagcaccaag 9300
gagcccttca gctggatcaa ggtggacctg ctggccccca tgatcatcca cggcatcaag 9360
acccagggcg ccaggcagaa gttcagcagc ctgtacatca gccagttcat catcatgtac 9420
agcctggacg gcaagaagtg gcagacctac aggggcaaca gcaccggcac cctgatggtg 9480
ttcttcggca acgtggacag cagcggcatc aagcacaaca tcttcaaccc ccccatcatc 9540
gcccggtaca tcaggctgca ccccacccac tacagcatcc ggagcaccct gaggatggag 9600
ctgatgggct gcgacctgaa cagctgcagc atgcccctgg gcatggagag caaggccatc 9660
agcgacgccc agatcaccgc cagcagctac ttcaccaaca tgttcgccac ctggagcccc 9720
agcaaggcca ggctgcacct gcagggccgg agcaacgcct ggaggcccca ggtgaacaac 9780
cccaaggagt ggctgcaggt ggacttccag aagaccatga aggtgaccgg cgtgaccacc 9840
cagggcgtga agagcctgct gaccagcatg tacgtgaagg agttcctgat cagcagcagc 9900
caggacggcc accagtggac cctgttcttc cagaacggca aggtgaaggt gttccagggc 9960
aaccaggaca gcttcacccc cgtggtgaac agcctggacc cccccctgct gaccaggtac 10020
ctgcgcatcc acccccagag ctgggtccac cagatcgccc tgaggatgga ggtgctgggg 10080
tgtgaggccc aggacctgta ttgagcggcc gcgtttaaac gtcgacaatc aacctctgga 10140
ttacaaaatt tgtgaaagat tgactggtat tcttaactat gttgctcctt ttacgctatg 10200
tggatacgct gctttaatgc ctttgtatca tgctattgct tcccgtatgg ctttcatttt 10260
ctcctccttg tataaatcct ggttgctgtc tctttatgag gagttgtggc ccgttgtcag 10320
gcaacgtggc gtggtgtgca ctgtgtttgc tgacgcaacc cccactggtt ggggcattgc 10380
caccacctgt cagctccttt ccgggacttt cgctttcccc ctccctattg ccacggcgga 10440
actcatcgcc gcctgccttg cccgctgctg gacaggggct cggctgttgg gcactgacaa 10500
ttccgtggtg ttgtcgggga agctgacgtc ctttccatgg ctgctcgcct gtgttgccac 10560
ctggattctg cgcgggacgt ccttctgcta cgtcccttcg gccctcaatc cagcggacct 10620
tccttcccgc ggcctgctgc cggctctgcg gcctcttccg cgtcttcgcc ttcgccctca 10680
gacgagtcgg atctcccttt gggccgcctc cccgcctgga attcgagctc ggtaccttta 10740
agaccaatga cttacaaggc agctgtagat cttagccact ttttaaaaga aaagggggga 10800
ctggaagggc taattcactc ccaacgaaga caagatctgc tttttgcttg tactgggtct 10860
ctctggttag accagatctg agcctgggag ctctctggct aactagggaa cccactgctt 10920
aagcctcaat aaagcttgcc ttgagtgctt caagtagtgt gtgcccgtct gttgtgtgac 10980
tctggtaact agagatccct cagacccttt tagtcagtgt ggaaaatctc tagcagtagt 11040
agttcatgtc atcttattat tcagtattta taacttgcaa agaaatgaat atcagagagt 11100
gagaggaact tgtttattgc agcttataat ggttacaaat aaagcaatag catcacaaat 11160
ttcacaaata aagcattttt ttcactgcat tctagttgtg gtttgtccaa actcatcaat 11220
gtatcttatc atgtctggct ctagctatcc cgcccctaac tccgcccagt tccgcccatt 11280
ctccgcccca tggctgacta atttttttta tttatgcaga ggccgaggcc gcctcggcct 11340
ctgagctatt ccagaagtag tgaggaggct tttttggagg cctaggcttt tgcgtcgaga 11400
cgtacccaat tcgccctata gtgagtcgta ttacgcgcgc tcactggccg tcgttttaca 11460
acgtcgtgac tgggaaaacc ctggcgttac ccaacttaat cgccttgcag cacatccccc 11520
tttcgccagc tggcgtaata gcgaagaggc ccgcaccgat cgcccttccc aacagttgcg 11580
cagcctgaat ggcgaatggc gcgacgcgcc ctgtagcggc gcattaagcg cggcgggtgt 11640
ggtggttacg cgcagcgtga ccgctacact tgccagcgcc ctagcgcccg ctcctttcgc 11700
tttcttccct tcctttctcg ccacgttcgc cggctttccc cgtcaagctc taaatcgggg 11760
gctcccttta gggttccgat ttagtgcttt acggcacctc gaccccaaaa aacttgatta 11820
gggtgatggt tcacgtagtg ggccatcgcc ctgatagacg gtttttcgcc ctttgacgtt 11880
ggagtccacg ttctttaata gtggactctt gttccaaact ggaacaacac tcaaccctat 11940
ctcggtctat tcttttgatt tataagggat tttgccgatt tcggcctatt ggttaaaaaa 12000
tgagctgatt taacaaaaat ttaacgcgaa ttttaacaaa atattaacgt ttacaatttc 12060
c 12061
<210> 17
<211> 5612
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 17
ggccgcgttg ctggcgtttt tccataggct ccgcccccct gacgagcatc acaaaaatcg 60
acgctcaagt cagaggtggc gaaacccgac aggactataa agataccagg cgtttccccc 120
tggaagctcc ctcgtgcgct ctcctgttcc gaccctgccg cttaccggat acctgtccgc 180
ctttctccct tcgggaagcg tggcgctttc tcatagctca cgctgtaggt atctcagttc 240
ggtgtaggtc gttcgctcca agctgggctg tgtgcacgaa ccccccgttc agcccgaccg 300
ctgcgcctta tccggtaact atcgtcttga gtccaacccg gtaagacacg acttatcgcc 360
actggcagca gccactggta acaggattag cagagcgagg tatgtaggcg gtgctacaga 420
gttcttgaag tggtggccta actacggcta cactagaaga acagtatttg gtatctgcgc 480
tctgctgaag ccagttacct tcggaaaaag agttggtagc tcttgatccg gcaaacaaac 540
caccgctggt agcggtggtt tttttgtttg caagcagcag attacgcgca gaaaaaaagg 600
atctcaagaa gatcctttga tcttttctac ggggtctgac gctcagtgga acgaaaactc 660
acgttaaggg attttggtca tgaagcgctt ttgaagctcg gatccgaaca aacgacccaa 720
cacccgtgcg ttttattctg tctttttatt gccgatcccc tcagaagaac tcgtcaagaa 780
ggcgatagaa ggcgatgcgc tgcgaatcgg gagcggcgat accgtaaagc acgaggaagc 840
ggtcagccca ttcgccgcca agctcttcag caatatcacg ggtagccaac gctatgtcct 900
gatagcggtc cgccacaccc agccggccac agtcgatgaa tccagaaaag cggccatttt 960
ccaccatgat attcggcaag caggcatcgc catgggtcac gacgagatcc tcgccgtcgg 1020
gcatgctcgc cttgagcctg gcgaacagtt cggctggcgc gagcccctga tgctcttcgt 1080
ccagatcatc ctgatcgaca agaccggctt ccatccgagt acgtgctcgc tcgatgcgat 1140
gtttcgcttg gtggtcgaat gggcaggtag ccggatcaag cgtatgcagc cgccgcattg 1200
catcagccat gatggatact ttctcggcag gagcaaggtg agatgacagg agatcctgcc 1260
ccggcacttc gcccaatagc agccagtccc ttcccgcttc agtgacaacg tcgagcacag 1320
ctgcgcaagg aacgcccgtc gtggccagcc acgatagccg cgctgcctcg tcttgcagtt 1380
cattcagggc accggacagg tcggtcttga caaaaagaac cgggcgcccc tgcgctgaca 1440
gccggaacac ggcggcatca gagcagccga ttgtctgttg tgcccagtca tagccgaata 1500
gcctctccac ccaagcggcc ggagaacctg cgtgcaatcc atcttgttca atcatgcgaa 1560
acgatcctca tcctgtctct tgatcagagc ttgatcccct gcgccatcag atccttggcg 1620
gcaagaaagc catccagttt actttgcagg gcttcccaac cttaccagag gcctgcgccg 1680
cggccagctg gctagcaatt cccgggttaa ctctagagac attgattatt gactagttat 1740
taatagtaat caattacggg gtcattagtt catagcccat atatggagtt ccgcgttaca 1800
taacttacgg taaatggccc gcctggctga ccgcccaacg acccccgccc attgacgtca 1860
ataatgacgt atgttcccat agtaacgcca atagggactt tccattgacg tcaatgggtg 1920
gagtatttac ggtaaactgc ccacttggca gtacatcaag tgtatcatat gccaagtacg 1980
ccccctattg acgtcaatga cggtaaatgg cccgcctggc attatgccca gtacatgacc 2040
ttatgggact ttcctacttg gcagtacatc tacgtattag tcatcgctat taccatggtg 2100
atgcggtttt ggcagtacat caatgggcgt ggatagcggt ttgactcacg gggatttcca 2160
agtctccacc ccattgacgt caatgggagt ttgttttggc accaaaatca acgggacttt 2220
ccaaaatgtc gtaacaactc cgccccattg acgcaaatgg gcggtaggcg tgtacggtgg 2280
gaggtctata taagcagagc tcgtttagtg aaccgtcaga tcgcctggag acgccatcca 2340
cgctgttttg acctccatag aagacaccgg gaccgatcca gcctcccctc gaagcttaca 2400
tgtggtaccg agctcggatc ctgagaactt cagggtgagt ctatgggacc cttgatgttt 2460
tctttcccct tcttttctat ggttaagttc atgtcatagg aaggggagaa gtaacagggt 2520
acacatattg accaaatcag ggtaattttg catttgtaat tttaaaaaat gctttcttct 2580
tttaatatac ttttttgttt atcttatttc taatactttc cctaatctct ttctttcagg 2640
gcaataatga tacaatgtat catgcctctt tgcaccattc taaagaataa cagtgataat 2700
ttctgggtta aggcaatagc aatatttctg catataaata tttctgcata taaattgtaa 2760
ctgatgtaag aggtttcata ttgctaatag cagctacaat ccagctacca ttctgctttt 2820
attttatggt tgggataagg ctggattatt ctgagtccaa gctaggccct tttgctaatc 2880
atgttcatac ctcttatctt cctcccacag ctcctgggca acgtgctggt ctgtgtgctg 2940
gcccatcact ttggcaaagc acgtgagatc tgaattctga cactatgaag tgccttttgt 3000
acttagcctt tttattcatt ggggtgaatt gcaagttcac catagttttt ccacacaacc 3060
aaaaaggaaa ctggaaaaat gttccttcta attaccatta ttgcccgtca agctcagatt 3120
taaattggca taatgactta ataggcacag ccttacaagt caaaatgccc aagagtcaca 3180
aggctattca agcagacggt tggatgtgtc atgcttccaa atgggtcact acttgtgatt 3240
tccgctggta tggaccgaag tatataacac attccatccg atccttcact ccatctgtag 3300
aacaatgcaa ggaaagcatt gaacaaacga aacaaggaac ttggctgaat ccaggcttcc 3360
ctcctcaaag ttgtggatat gcaactgtga cggatgccga agcagtgatt gtccaggtga 3420
ctcctcacca tgtgctggtt gatgaataca caggagaatg ggttgattca cagttcatca 3480
acggaaaatg cagcaattac atatgcccca ctgtccataa ctctacaacc tggcattctg 3540
actataaggt caaagggcta tgtgattcta acctcatttc catggacatc accttcttct 3600
cagaggacgg agagctatca tccctgggaa aggagggcac agggttcaga agtaactact 3660
ttgcttatga aactggaggc aaggcctgca aaatgcaata ctgcaagcat tggggagtca 3720
gactcccatc aggtgtctgg ttcgagatgg ctgataagga tctctttgct gcagccagat 3780
tccctgaatg cccagaaggg tcaagtatct ctgctccatc tcagacctca gtggatgtaa 3840
gtctaattca ggacgttgag aggatcttgg attattccct ctgccaagaa acctggagca 3900
aaatcagagc gggtcttcca atctctccag tggatctcag ctatcttgct cctaaaaacc 3960
caggaaccgg tcctgctttc accataatca atggtaccct aaaatacttt gagaccagat 4020
acatcagagt cgatattgct gctccaatcc tctcaagaat ggtcggaatg atcagtggaa 4080
ctaccacaga aagggaactg tgggatgact gggcaccata tgaagacgtg gaaattggac 4140
ccaatggagt tctgaggacc agttcaggat ataagtttcc tttatacatg attggacatg 4200
gtatgttgga ctccgatctt catcttagct caaaggctca ggtgttcgaa catcctcaca 4260
ttcaagacgc tgcttcgcaa cttcctgatg atgagagttt attttttggt gatactgggc 4320
tatccaaaaa tccaatcgag cttgtagaag gttggttcag tagttggaaa agctctattg 4380
cctctttttt ctttatcata gggttaatca ttggactatt cttggttctc cgagttggta 4440
tccatctttg cattaaatta aagcacacca agaaaagaca gatttataca gacatagaga 4500
tgaaccgact tggaaagtaa ctcaaatcct gcacaacaga ttcttcatgt ttggaccaaa 4560
tcaacttgtg ataccatgct caaagaggcc tcaattatat ttgagttttt aatttttatg 4620
aaaaaaaaaa aaaaaaacgg aattcacccc accagtgcag gctgcctatc agaaagtggt 4680
ggctggtgtg gctaatgccc tggcccacaa gtatcactaa gctcgctttc ttgctgtcca 4740
atttctatta aaggttcctt tgttccctaa gtccaactac taaactgggg gatattatga 4800
agggccttga gcatctggat tctgcctaat aaaaaacatt tattttcatt gcaatgatgt 4860
atttaaatta tttctgaata ttttactaaa aagggaatgt gggaggtcag tgcatttaaa 4920
acataaagaa atgaagagct agttcaaacc ttgggaaaat acactatatc ttaaactcca 4980
tgaaagaagg tgaggctgca aacagctaat gcacattggc aacagcccct gatgcctatg 5040
ccttattcat ccctcagaaa aggattcaag tagaggcttg atttggaggt taaagttttg 5100
ctatgctgta ttttagtcga ccattactta ttgttttagc tgtcctcatg aatgtctttt 5160
cactacccat ttgcttatcc tgcatctctc agccttgact ccactcagtt ctcttgctta 5220
gagataccac ctttcccctg aagtgttcct tccatgtttt acggcgagat ggtttctcct 5280
cgcctggcca ctcagcctta gttgtctctg ttgtcttata gaggtctact tgaagaagga 5340
aaaacagggg gcatggtttg actgtcctgt gagcccttct tccctgcctc ccccactcac 5400
agtgacccgg aatccctcga catggcagtc tagcactagt gcggccgcag atctgcttcc 5460
tcgctcactg actcgctgcg ctcggtcgtt cggctgcggc gagcggtatc agctcactca 5520
aaggcggtaa tacggttatc cacagaatca ggggataacg caggaaagaa catgtgagca 5580
aaaggccagc aaaaggccag gaaccgtaaa aa 5612
<210> 18
<211> 3387
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 18
ggccgcgttg ctggcgtttt tccataggct ccgcccccct gacgagcatc acaaaaatcg 60
acgctcaagt cagaggtggc gaaacccgac aggactataa agataccagg cgtttccccc 120
tggaagctcc ctcgtgcgct ctcctgttcc gaccctgccg cttaccggat acctgtccgc 180
ctttctccct tcgggaagcg tggcgctttc tcatagctca cgctgtaggt atctcagttc 240
ggtgtaggtc gttcgctcca agctgggctg tgtgcacgaa ccccccgttc agcccgaccg 300
ctgcgcctta tccggtaact atcgtcttga gtccaacccg gtaagacacg acttatcgcc 360
actggcagca gccactggta acaggattag cagagcgagg tatgtaggcg gtgctacaga 420
gttcttgaag tggtggccta actacggcta cactagaaga acagtatttg gtatctgcgc 480
tctgctgaag ccagttacct tcggaaaaag agttggtagc tcttgatccg gcaaacaaac 540
caccgctggt agcggtggtt tttttgtttg caagcagcag attacgcgca gaaaaaaagg 600
atctcaagaa gatcctttga tcttttctac ggggtctgac gctcagtgga acgaaaactc 660
acgttaaggg attttggtca tgaagcgctt ttgaagctcg gatccgaaca aacgacccaa 720
cacccgtgcg ttttattctg tctttttatt gccgatcccc tcagaagaac tcgtcaagaa 780
ggcgatagaa ggcgatgcgc tgcgaatcgg gagcggcgat accgtaaagc acgaggaagc 840
ggtcagccca ttcgccgcca agctcttcag caatatcacg ggtagccaac gctatgtcct 900
gatagcggtc cgccacaccc agccggccac agtcgatgaa tccagaaaag cggccatttt 960
ccaccatgat attcggcaag caggcatcgc catgggtcac gacgagatcc tcgccgtcgg 1020
gcatgctcgc cttgagcctg gcgaacagtt cggctggcgc gagcccctga tgctcttcgt 1080
ccagatcatc ctgatcgaca agaccggctt ccatccgagt acgtgctcgc tcgatgcgat 1140
gtttcgcttg gtggtcgaat gggcaggtag ccggatcaag cgtatgcagc cgccgcattg 1200
catcagccat gatggatact ttctcggcag gagcaaggtg agatgacagg agatcctgcc 1260
ccggcacttc gcccaatagc agccagtccc ttcccgcttc agtgacaacg tcgagcacag 1320
ctgcgcaagg aacgcccgtc gtggccagcc acgatagccg cgctgcctcg tcttgcagtt 1380
cattcagggc accggacagg tcggtcttga caaaaagaac cgggcgcccc tgcgctgaca 1440
gccggaacac ggcggcatca gagcagccga ttgtctgttg tgcccagtca tagccgaata 1500
gcctctccac ccaagcggcc ggagaacctg cgtgcaatcc atcttgttca atcatgcgaa 1560
acgatcctca tcctgtctct tgatcagagc ttgatcccct gcgccatcag atccttggcg 1620
gcaagaaagc catccagttt actttgcagg gcttcccaac cttaccagag gcctgcgccg 1680
cggccagctg gctagcaatt cccgggttaa ctctagagaa tgtagtctta tgcaatactc 1740
ttgtagtctt gcaacatggt aacgatgagt tagcaacatg ccttacaagg agagaaaaag 1800
caccgtgcat gccgattggt ggaagtaagg tggtacgatc gtgccttatt aggaaggcaa 1860
cagacgggtc tgacatggat tggacgaacc actgaattcc gcattgcaga gatattgtat 1920
ttaagtgcct agctcgatac aataaacgcc atttgaccat tcaccacatt ggtgtgcacc 1980
tccaagctcg agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt 2040
ttgacctcca tagaagacac cgggaccgat ccagcctccc ctcgaagcta gtcgattagg 2100
catctcctat ggcaggaaga agcggagaca gcgacgaaga cctcctcaag gcagtcagac 2160
tcatcaagtt tctctatcaa agcaacccac ctcccaatcc cgaggggacc cgacaggccc 2220
gaaggaatag aagaagaagg tggagagaga gacagagaca gatccattcg attagtgaac 2280
ggatccttag cacttatctg ggacgatctg cggagcctgt gcctcttcag ctaccaccgc 2340
ttgagagact tactcttgat tgtaacgagg attgtggaac ttctgggacg cagggggtgg 2400
gaagccctca aatattggtg gaatctccta caatattgga gtcaggagct aaagaatagt 2460
gctgttagct tgctcaatgc cacagctata gcagtagctg aggggacaga tagggttata 2520
gaagtagtac aagaagcttg gcactggccg tcgttttaca acgtcgtgat ctgagcctgg 2580
gagatctctg gctaactagg gaacccactg cttaagcctc aataaagctt gccttgagtg 2640
cttcaagtag tgtgtgcccg tctgttgtgt gactctggta actagagatc aggaaaaccc 2700
tggcgttacc caacttaatc gccttgcagc acatccccct ttcgccagct ggcgtaatag 2760
cgaagaggcc cgcaccgatc gcccttccca acagttgcgc agcctgaatg gcgaatggcg 2820
cctgatgcgg tattttctcc ttacgcatct gtgcggtatt tcacaccgca tacgtcaaag 2880
caaccatagt gtcgaccatt acttattgtt ttagctgtcc tcatgaatgt cttttcacta 2940
cccatttgct tatcctgcat ctctcagcct tgactccact cagttctctt gcttagagat 3000
accacctttc ccctgaagtg ttccttccat gttttacggc gagatggttt ctcctcgcct 3060
ggccactcag ccttagttgt ctctgttgtc ttatagaggt ctacttgaag aaggaaaaac 3120
agggggcatg gtttgactgt cctgtgagcc cttcttccct gcctccccca ctcacagtga 3180
cccggaatcc ctcgacatgg cagtctagca ctagtgcggc cgcagatctg cttcctcgct 3240
cactgactcg ctgcgctcgg tcgttcggct gcggcgagcg gtatcagctc actcaaaggc 3300
ggtaatacgg ttatccacag aatcagggga taacgcagga aagaacatgt gagcaaaagg 3360
ccagcaaaag gccaggaacc gtaaaaa 3387
<210> 19
<211> 9171
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 19
ggccgcgttg ctggcgtttt tccataggct ccgcccccct gacgagcatc acaaaaatcg 60
acgctcaagt cagaggtggc gaaacccgac aggactataa agataccagg cgtttccccc 120
tggaagctcc ctcgtgcgct ctcctgttcc gaccctgccg cttaccggat acctgtccgc 180
ctttctccct tcgggaagcg tggcgctttc tcatagctca cgctgtaggt atctcagttc 240
ggtgtaggtc gttcgctcca agctgggctg tgtgcacgaa ccccccgttc agcccgaccg 300
ctgcgcctta tccggtaact atcgtcttga gtccaacccg gtaagacacg acttatcgcc 360
actggcagca gccactggta acaggattag cagagcgagg tatgtaggcg gtgctacaga 420
gttcttgaag tggtggccta actacggcta cactagaaga acagtatttg gtatctgcgc 480
tctgctgaag ccagttacct tcggaaaaag agttggtagc tcttgatccg gcaaacaaac 540
caccgctggt agcggtggtt tttttgtttg caagcagcag attacgcgca gaaaaaaagg 600
atctcaagaa gatcctttga tcttttctac ggggtctgac gctcagtgga acgaaaactc 660
acgttaaggg attttggtca tgaagcgctt ttgaagctcg gatccgaaca aacgacccaa 720
cacccgtgcg ttttattctg tctttttatt gccgatcccc tcagaagaac tcgtcaagaa 780
ggcgatagaa ggcgatgcgc tgcgaatcgg gagcggcgat accgtaaagc acgaggaagc 840
ggtcagccca ttcgccgcca agctcttcag caatatcacg ggtagccaac gctatgtcct 900
gatagcggtc cgccacaccc agccggccac agtcgatgaa tccagaaaag cggccatttt 960
ccaccatgat attcggcaag caggcatcgc catgggtcac gacgagatcc tcgccgtcgg 1020
gcatgctcgc cttgagcctg gcgaacagtt cggctggcgc gagcccctga tgctcttcgt 1080
ccagatcatc ctgatcgaca agaccggctt ccatccgagt acgtgctcgc tcgatgcgat 1140
gtttcgcttg gtggtcgaat gggcaggtag ccggatcaag cgtatgcagc cgccgcattg 1200
catcagccat gatggatact ttctcggcag gagcaaggtg agatgacagg agatcctgcc 1260
ccggcacttc gcccaatagc agccagtccc ttcccgcttc agtgacaacg tcgagcacag 1320
ctgcgcaagg aacgcccgtc gtggccagcc acgatagccg cgctgcctcg tcttgcagtt 1380
cattcagggc accggacagg tcggtcttga caaaaagaac cgggcgcccc tgcgctgaca 1440
gccggaacac ggcggcatca gagcagccga ttgtctgttg tgcccagtca tagccgaata 1500
gcctctccac ccaagcggcc ggagaacctg cgtgcaatcc atcttgttca atcatgcgaa 1560
acgatcctca tcctgtctct tgatcagagc ttgatcccct gcgccatcag atccttggcg 1620
gcaagaaagc catccagttt actttgcagg gcttcccaac cttaccagag gcctgcgccg 1680
cggccagctg gctagcaatt cccgggttaa ctctagagac attgattatt gactagttat 1740
taatagtaat caattacggg gtcattagtt catagcccat atatggagtt ccgcgttaca 1800
taacttacgg taaatggccc gcctggctga ccgcccaacg acccccgccc attgacgtca 1860
ataatgacgt atgttcccat agtaacgcca atagggactt tccattgacg tcaatgggtg 1920
gagtatttac ggtaaactgc ccacttggca gtacatcaag tgtatcatat gccaagtacg 1980
ccccctattg acgtcaatga cggtaaatgg cccgcctggc attatgccca gtacatgacc 2040
ttatgggact ttcctacttg gcagtacatc tacgtattag tcatcgctat taccatggtg 2100
atgcggtttt ggcagtacat caatgggcgt ggatagcggt ttgactcacg gggatttcca 2160
agtctccacc ccattgacgt caatgggagt ttgttttggc accaaaatca acgggacttt 2220
ccaaaatgtc gtaacaactc cgccccattg acgcaaatgg gcggtaggcg tgtacggtgg 2280
gaggtctata taagcagagc tcgtttagtg aaccgtcaga tcgcctggag acgccatcca 2340
cgctgttttg acctccatag aagacaccgg gaccgatcca gcctcccctc gaagcttaca 2400
tgtggtaccg agctcggatc ctgagaactt cagggtgagt ctatgggacc cttgatgttt 2460
tctttcccct tcttttctat ggttaagttc atgtcatagg aaggggagaa gtaacagggt 2520
acacatattg accaaatcag ggtaattttg catttgtaat tttaaaaaat gctttcttct 2580
tttaatatac ttttttgttt atcttatttc taatactttc cctaatctct ttctttcagg 2640
gcaataatga tacaatgtat catgcctctt tgcaccattc taaagaataa cagtgataat 2700
ttctgggtta aggcaatagc aatatttctg catataaata tttctgcata taaattgtaa 2760
ctgatgtaag aggtttcata ttgctaatag cagctacaat ccagctacca ttctgctttt 2820
attttatggt tgggataagg ctggattatt ctgagtccaa gctaggccct tttgctaatc 2880
atgttcatac ctcttatctt cctcccacag ctcctgggca acgtgctggt ctgtgtgctg 2940
gcccatcact ttggcaaagc acgtgagatc tgaattcgag atctgccgcc gccatgggtg 3000
cgagagcgtc agtattaagc gggggagaat tagatcgatg ggaaaaaatt cggttaaggc 3060
cagggggaaa gaaaaaatat aaattaaaac atatagtatg ggcaagcagg gagctagaac 3120
gattcgcagt taatcctggc ctgttagaaa catcagaagg ctgtagacaa atactgggac 3180
agctacaacc atcccttcag acaggatcag aagaacttag atcattatat aatacagtag 3240
caaccctcta ttgtgtgcat caaaggatag agataaaaga caccaaggaa gctttagaca 3300
agatagagga agagcaaaac aaaagtaaga aaaaagcaca gcaagcagca gctgacacag 3360
gacacagcaa tcaggtcagc caaaattacc ctatagtgca gaacatccag gggcaaatgg 3420
tacatcaggc catatcacct agaactttaa atgcatgggt aaaagtagta gaagagaagg 3480
ctttcagccc agaagtgata cccatgtttt cagcattatc agaaggagcc accccacaag 3540
atttaaacac catgctaaac acagtggggg gacatcaagc agccatgcaa atgttaaaag 3600
agaccatcaa tgaggaagct gcagaatggg atagagtgca tccagtgcat gcagggccta 3660
ttgcaccagg ccagatgaga gaaccaaggg gatcagacat cgctggaact actagtaccc 3720
ttcaggaaca aataggatgg atgacacata atccacctat cccagtagga gaaatctata 3780
aaagatggat aatcctggga ttaaataaaa tagtaagaat gtatagccct accagcattc 3840
tggacataag acaaggacca aaggaaccct ttagagacta tgtagaccga ttctataaaa 3900
ctctaagagc cgagcaagct tcacaagagg taaaaaattg gatgacagaa accttgttgg 3960
tccaaaatgc gaacccagat tgtaagacta ttttaaaagc attgggacca ggagcgacac 4020
tagaagaaat gatgacagca tgtcagggag tggggggacc cggccataaa gcaagagttt 4080
tggctgaagc aatgagccaa gtaacaaatc cagctaccat aatgatacag aaaggcaatt 4140
ttaggaacca aagaaagact gttaagtgtt tcaattgtgg caaagaaggg cacatagcca 4200
aaaattgcag ggcccctagg aaaaagggct gttggaaatg tggaaaggaa ggacaccaaa 4260
tgaaagattg tactgagaga caggctaatt ttttagggaa gatctggcct tcccacaagg 4320
gaaggccagg gaattttctt cagagcagac cagagccaac agccccacca gaagagagct 4380
tcaggtttgg ggaagagaca acaactccct ctcagaagca ggagccgata gacaaggaac 4440
tgtatccttt agcttccctc agatcactct ttggcagcga cccctcgtca caataaagat 4500
aggggggcaa ttaaaggaag ctctattaga tactggtgct gacgacacag tattagaaga 4560
aatgaatttg ccaggaagat ggaaaccaaa aatgataggg ggaattggag gttttatcaa 4620
agtaagacag tatgatcaga tactcataga aatctgcgga cataaagcta taggtacagt 4680
attagtagga cctacacctg tcaacataat tggaagaaat ctgttgactc agattggctg 4740
cactttaaat tttcccatta gtcctattga gactgtacca gtaaaattaa agccaggaat 4800
ggatggccca aaagttaaac aatggccatt gacagaagaa aaaataaaag cattagtaga 4860
aatttgtaca gaaatggaaa aggaaggaaa aatttcaaaa attgggcctg aaaatccata 4920
caatactcca gtatttgcca taaagaaaaa agacagtact aaatggagaa aattagtaga 4980
tttcagagaa cttaataaga gaactcaaga tttctgggaa gttcaattag gaataccaca 5040
tcctgcaggg ttaaaacaga aaaaatcagt aacagtactg gatgtgggcg atgcatattt 5100
ttcagttccc ttagataaag acttcaggaa gtatactgca tttaccatac ctagtataaa 5160
caatgagaca ccagggatta gatatcagta caatgtgctt ccacagggat ggaaaggatc 5220
accagcaata ttccagtgta gcatgacaaa aatcttagag ccttttagaa aacaaaatcc 5280
agacatagtc atctatcaat acatggatga tttgtatgta ggatctgact tagaaatagg 5340
gcagcataga acaaaaatag aggaactgag acaacatctg ttgaggtggg gatttaccac 5400
accagacaaa aaacatcaga aagaacctcc attcctttgg atgggttatg aactccatcc 5460
tgataaatgg acagtacagc ctatagtgct gccagaaaag gacagctgga ctgtcaatga 5520
catacagaaa ttagtgggaa aattgaattg ggcaagtcag atttatgcag ggattaaagt 5580
aaggcaatta tgtaaacttc ttaggggaac caaagcacta acagaagtag taccactaac 5640
agaagaagca gagctagaac tggcagaaaa cagggagatt ctaaaagaac cggtacatgg 5700
agtgtattat gacccatcaa aagacttaat agcagaaata cagaagcagg ggcaaggcca 5760
atggacatat caaatttatc aagagccatt taaaaatctg aaaacaggaa agtatgcaag 5820
aatgaagggt gcccacacta atgatgtgaa acaattaaca gaggcagtac aaaaaatagc 5880
cacagaaagc atagtaatat ggggaaagac tcctaaattt aaattaccca tacaaaagga 5940
aacatgggaa gcatggtgga cagagtattg gcaagccacc tggattcctg agtgggagtt 6000
tgtcaatacc cctcccttag tgaagttatg gtaccagtta gagaaagaac ccataatagg 6060
agcagaaact ttctatgtag atggggcagc caatagggaa actaaattag gaaaagcagg 6120
atatgtaact gacagaggaa gacaaaaagt tgtcccccta acggacacaa caaatcagaa 6180
gactgagtta caagcaattc atctagcttt gcaggattcg ggattagaag taaacatagt 6240
gacagactca caatatgcat tgggaatcat tcaagcacaa ccagataaga gtgaatcaga 6300
gttagtcagt caaataatag agcagttaat aaaaaaggaa aaagtctacc tggcatgggt 6360
accagcacac aaaggaattg gaggaaatga acaagtagat aaattggtca gtgctggaat 6420
caggaaagta ctatttttag atggaataga taaggcccaa gaagaacatg agaaatatca 6480
cagtaattgg agagcaatgg ctagtgattt taacctacca cctgtagtag caaaagaaat 6540
agtagccagc tgtgataaat gtcagctaaa aggggaagcc atgcatggac aagtagactg 6600
tagcccagga atatggcagc tagattgtac acatttagaa ggaaaagtta tcttggtagc 6660
agttcatgta gccagtggat atatagaagc agaagtaatt ccagcagaga cagggcaaga 6720
aacagcatac ttcctcttaa aattagcagg aagatggcca gtaaaaacag tacatacaga 6780
caatggcagc aatttcacca gtactacagt taaggccgcc tgttggtggg cggggatcaa 6840
gcaggaattt ggcattccct acaatccgca gtcacaagga gtaatagaat ctatgaataa 6900
agaattaaag aaaattatag gacaggtaag agatcaggct gaacatctta aaacagcagt 6960
acaaatggca gtattcatcc acaattttaa aagaaaaggg gggattgggg ggtacagtgc 7020
aggggaaaga atagtagaca taatagcaac agacatacaa actaaagaat tacaaaaaca 7080
aattacaaaa attcaaaatt ttcgggttta ttacagggac agcagagatc cagtttggaa 7140
aggaccagca aagctcctct ggaaaggtga aggggcagta gtaatacaag ataatagtga 7200
cataaaagta gtgccaagaa gaaaagcaaa gatcatcagg gattatggaa aacagatggc 7260
aggtgatgat tgtgtggcaa gtagacagga tgaggattaa cacatggaat tccggagcgg 7320
ccgcaggagc tttgttcctt gggttcttgg gagcagcagg aagcactatg ggcgcagcgt 7380
caatgacgct gacggtacag gccagacaat tattgtctgg tatagtgcag cagcagaaca 7440
atttgctgag ggctattgag gcgcaacagc atctgttgca actcacagtc tggggcatca 7500
agcagctcca ggcaagaatc ctggctgtgg aaagatacct aaaggatcaa cagctcctgg 7560
ggatttgggg ttgctctgga aaactcattt gcaccactgc tgtgccttgg aatgctagtt 7620
ggagtaataa atctctggaa cagatttgga atcacacgac ctggatggag tgggacagag 7680
aaattaacaa ttacacaagc ttccgcggaa ttcaccccac cagtgcaggc tgcctatcag 7740
aaagtggtgg ctggtgtggc taatgccctg gcccacaagt atcactaagc tcgctttctt 7800
gctgtccaat ttctattaaa ggttcctttg ttccctaagt ccaactacta aactggggga 7860
tattatgaag ggccttgagc atctggattc tgcctaataa aaaacattta ttttcattgc 7920
aatgatgtat ttaaattatt tctgaatatt ttactaaaaa gggaatgtgg gaggtcagtg 7980
catttaaaac ataaagaaat gaagagctag ttcaaacctt gggaaaatac actatatctt 8040
aaactccatg aaagaaggtg aggctgcaaa cagctaatgc acattggcaa cagcccctga 8100
tgcctatgcc ttattcatcc ctcagaaaag gattcaagta gaggcttgat ttggaggtta 8160
aagttttgct atgctgtatt ttacattact tattgtttta gctgtcctca tgaatgtctt 8220
ttcactaccc atttgcttat cctgcatctc tcagccttga ctccactcag ttctcttgct 8280
tagagatacc acctttcccc tgaagtgttc cttccatgtt ttacggcgag atggtttctc 8340
ctcgcctggc cactcagcct tagttgtctc tgttgtctta tagaggtcta cttgaagaag 8400
gaaaaacagg gggcatggtt tgactgtcct gtgagccctt cttccctgcc tcccccactc 8460
acagtgaccc ggaatccctc gacatggcag tctagcacta gtgcggccgc agatctgctt 8520
cctcgctcac tgactcgctg cgctcggtcg ttcggctgcg gcgagcggta tcagctcact 8580
caaaggcggt aatacggtta tccacagaat caggggataa cgcaggaaag aacatgtgag 8640
caaaaggcca gcaaaaggcc aggaaccgta aaaagtcgac cattacttat tgttttagct 8700
gtcctcatga atgtcttttc actacccatt tgcttatcct gcatctctca gccttgactc 8760
cactcagttc tcttgcttag agataccacc tttcccctga agtgttcctt ccatgtttta 8820
cggcgagatg gtttctcctc gcctggccac tcagccttag ttgtctctgt tgtcttatag 8880
aggtctactt gaagaaggaa aaacaggggg catggtttga ctgtcctgtg agcccttctt 8940
ccctgcctcc cccactcaca gtgacccgga atccctcgac atggcagtct agcactagtg 9000
cggccgcaga tctgcttcct cgctcactga ctcgctgcgc tcggtcgttc ggctgcggcg 9060
agcggtatca gctcactcaa aggcggtaat acggttatcc acagaatcag gggataacgc 9120
aggaaagaac atgtgagcaa aaggccagca aaaggccagg aaccgtaaaa a 9171
<210> 20
<211> 22
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 20
ctgttgtgtg actctggtaa ct 22
<210> 21
<211> 22
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 21
aaatctctag cagtggcgcc cg 22
<210> 22
<211> 20
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 22
ttcgctttca agtccctgtt 20
<210> 23
<211> 22
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 23
gctgtcatct cttgtgggct gt 22
<210> 24
<211> 26
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 24
cctgtcatgc ccacacaaat ctctcc 26
<210> 25
<211> 21
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 25
actcatggga gctgctggtt c 21
<210> 26
<211> 55
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 26
gtccactcat tcttggatcc accggtgcca ccatgcaaat agagctctcc acctg 55
<210> 27
<211> 52
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 27
cgtttcaaga ctggtgggtt ttggctaggg tgtcttgaat tctgggagaa gc 52
<210> 28
<211> 52
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 28
gcttctccca gaattcaaga caccctagcc aaaacccacc agtcttgaaa cg 52
<210> 29
<211> 59
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 29
tccagaggtt gattgtcgac gtttaaacgc ggccgctcag tagaggtcct gtgcctcgc 59
<210> 30
<211> 56
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 30
acaaaaattc aaaattttat cgataagagc atgcgtgagg ctccggtgcc cgtcag 56
<210> 31
<211> 51
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 31
catggtggca ccggtggatc caagaatgag tcacgacacc tgaaatggaa g 51

Claims (11)

1. The codon-optimized coagulation factor VIII gene is characterized in that the nucleotide sequence of the codon-optimized coagulation factor VIII gene is SEQ ID NO 8 or SEQ ID NO 10.
2. A nucleic acid construct comprising the codon optimized factor VIII gene of claim 1.
3. The nucleic acid construct of claim 2, wherein the nucleic acid construct is a non-viral vector or a viral vector.
4. The nucleic acid construct of claim 3, wherein the viral vector is a lentiviral vector or an adeno-associated viral vector, and the codon-optimized factor VIII gene is located in the expression frame of the lentiviral vector or adeno-associated viral vector.
5. A lentivirus virally packaged from the nucleic acid construct of claim 2.
6. A lentiviral vector system, wherein the lentiviral vector system comprises a helper plasmid and the nucleic acid construct of claim 2.
7. The lentiviral vector system of claim 6, further comprising a host cell.
8. A composition for preventing or treating a coagulation factor deficiency disease, wherein the effective substances of the composition comprise one or more of the following substances: the codon-optimized factor VIII gene of claim 1; the nucleic acid construct of claim 2; the lentivirus of claim 5.
9. A cell line infected with the lentivirus of claim 5.
10. Use of the codon optimized factor VIII gene according to claim 1, the nucleic acid construct according to claim 2, the lentivirus according to claim 5, the composition according to claim 8 or the cell line according to claim 9 for the preparation of a medicament for the prevention and/or treatment of a factor deficiency disorder.
11. The use of claim 10, wherein the coagulation factor deficiency disease comprises one or more of hemophilia a, hemophilia B, and hemophilia C.
CN202010581446.3A 2020-06-23 2020-06-23 Codon-optimized coagulation factor VIII gene and construct thereof Active CN111808863B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010581446.3A CN111808863B (en) 2020-06-23 2020-06-23 Codon-optimized coagulation factor VIII gene and construct thereof

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010581446.3A CN111808863B (en) 2020-06-23 2020-06-23 Codon-optimized coagulation factor VIII gene and construct thereof

Publications (2)

Publication Number Publication Date
CN111808863A CN111808863A (en) 2020-10-23
CN111808863B true CN111808863B (en) 2021-05-28

Family

ID=72845924

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010581446.3A Active CN111808863B (en) 2020-06-23 2020-06-23 Codon-optimized coagulation factor VIII gene and construct thereof

Country Status (1)

Country Link
CN (1) CN111808863B (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114410687B (en) * 2022-01-06 2024-04-19 上海本导基因技术有限公司 Lentiviral vector suitable for gene therapy of thalassemia and sickle-shaped anemia
CN115948408A (en) * 2022-09-23 2023-04-11 上海信致医药科技有限公司 Improved human coagulation factor VIII gene expression cassette and application thereof

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108348458A (en) * 2015-11-05 2018-07-31 诺和诺德股份有限公司 FVIII preparations
CN109072214A (en) * 2016-02-01 2018-12-21 比奥贝拉蒂治疗公司 The factor VIII gene of optimization
CN109929029A (en) * 2017-12-15 2019-06-25 广东东阳光药业有限公司 A method of improving recombinant human blood coagulation factor VII I high efficient expression
WO2019152692A1 (en) * 2018-02-01 2019-08-08 Bioverativ Therapeutics, Inc. Use of lentiviral vectors expressing factor viii

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108348458A (en) * 2015-11-05 2018-07-31 诺和诺德股份有限公司 FVIII preparations
CN109072214A (en) * 2016-02-01 2018-12-21 比奥贝拉蒂治疗公司 The factor VIII gene of optimization
CN109929029A (en) * 2017-12-15 2019-06-25 广东东阳光药业有限公司 A method of improving recombinant human blood coagulation factor VII I high efficient expression
WO2019152692A1 (en) * 2018-02-01 2019-08-08 Bioverativ Therapeutics, Inc. Use of lentiviral vectors expressing factor viii

Also Published As

Publication number Publication date
CN111808863A (en) 2020-10-23

Similar Documents

Publication Publication Date Title
ES2805045T3 (en) Vectors lentiviral
KR101320489B1 (en) Serum-free stable transfection and production of recombinant human proteins in human cell lines
CA2523138C (en) Lentiviral vectors carrying synthetic bi-directional promoters and uses thereof
AU2020244473B2 (en) An improved fetal hemoglobin for genetic correction of sickle cell disease
KR20210139265A (en) Adenosine deaminase base editor for modifying nucleobases in target sequences and methods of using the same
CN111808863B (en) Codon-optimized coagulation factor VIII gene and construct thereof
US20220364110A1 (en) Methods and compositions for genomic integration
CN106978443B (en) Beta-globin recombinant lentiviral vector and application thereof
CN110551713A (en) Optimized genetic tools for modifying clostridium bacteria
WO2012156839A2 (en) New generation of splice-less lentiviral vectors for safer gene therapy applications
AU782960B2 (en) Conditional gene trapping construct for the disruption of genes
KR20190111966A (en) Mutants of Adeno-associated Virus (AAV) Capsid Proteins
KR20230129162A (en) RNA targeting composition and method for treating type 1 myotonic dystrophy
KR20230127221A (en) RNA targeting compositions and methods for treating CAG repeat disease
CN113388642B (en) Nucleic acid construct
CN113549654B (en) Nucleic acid construct
KR20240037192A (en) Methods and compositions for genome integration
KR20230146525A (en) Improved disease treatment and nucleic acid delivery
US20240082327A1 (en) Retroviral vectors
KR20210118826A (en) Genetically modified Clostridium bacteria, preparation and use thereof
TW202246508A (en) Retroviral vectors
WO2024062259A1 (en) Retroviral vector comprising rre inserted within an intron
KR20240029020A (en) CRISPR-transposon system for DNA modification
KR20220012324A (en) Genetic Tools Optimized for Transformation of Bacteria

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant